TOP ACTIVATIONS
MAX = 0.099

loved going on pic n ics with his family . His
. They are for wood cut ters ." They took Lily
! She loved pic n ics and spending time with her
to go on pic n ics with her family . One
who really enjoyed pic n ics . They loved packing lots
. He loved pic n ics and couldn 't wait .
loved to have pic n ics with her family in the
. They had wasted their net and bucket , their scissors
Lily realized that pic n ics were not so scary after
other people having pic n ics . Jack and
to make more pic n ics and looked in the fridge
She had heard about att ics and wanted to explore one
little boy getting a buzz cut and a lady getting a
he wanted to play ' cut the boat '. The little
time there was a wood cut ter . He had an
to have more pic n ics in the future . <|endoftext|>
Lily who loved pic n ics . She would always pack
adventures , had pic n ics , and even danced together
time there was a wood cut ter who was strong and
of people having pic n ics and walking their dogs .

MIN ACTIVATIONS
MIN = -0.108

wants a ring like the ear on TV . She thinks
alone and always counted the days until something special happened .
days were sunny and some days were rainy . Her mum
the summer and why some days were sunny and some days
and pay on the enormous ear we will find !" Tom
the ball and rubbed the ear on the ground . The
the reply from the other ear . John was
and found the coin 's rightful place . Lily was happy
came to see the attractive ear of corn . They were
park they found the biggest ear Tom had ever seen !
, " We count the days , weeks and months and
bow and the 3 - year - old followed him ,
and she was three - years - old . She wanted
uff led the three - year - old 's hair .
?" asked the 3 - year - old child .
" ch oo cho o cho o ". They
a little girl named Kay lee . She really wanted to
?" asked the three - year - old girl .
d ough to make the ear . She makes it very
was shining brightly and the days got hotter and hotter .

INTERVAL 0.047 - 0.099
CONTAINS 0.120%

and Bella . Jake loved raft ing and said to Bella
snacks . After wards , Daddy hugged his little
bath in the green bath tub . They both had a
Make sure you carry your net so you don â €
use the veil to play fairy and prince . They have

INTERVAL -0.005 - 0.047
CONTAINS 65.260%

liked Spot . They took the rake and threw a stick
cover the pillow was a pillow case . Her mom my
up toy guns . They waved to their mom and said
had been nice . But it was too late . They
" You 're welcome , Tim my ," his mom

INTERVAL -0.056 - -0.005
CONTAINS 34.557%

. That is the most important thing . Now , go
mistake . They wished they had listened to Jen and not
. Lily said , " Mom my , that tree is
out and play , but it was still too cold .
to help her mom my , so she went to the

INTERVAL -0.108 - -0.056
CONTAINS 0.063%

stepped closer and saw the body was still , so he
. A few days later , there was a
side and the cat the other . Finally , the pile
naughty . " B izzy ," M ummy said
happy that the three - year - old had changed their
Token loved
Token ID6151
Feature activation+1.89e-02

Pos logit contributions
Deal+0.032
 if+0.031
pp+0.029
mun+0.029
 who+0.028
Neg logit contributions
 tow-0.049
lee-0.045
 delight-0.040
 excitement-0.040
 recess-0.040
Token going
Token ID1016
Feature activation+2.64e-02

Pos logit contributions
 who+0.019
 if+0.017
Deal+0.016
 but+0.015
 with+0.015
Neg logit contributions
 tow-0.045
lee-0.041
 delight-0.040
 recess-0.039
 excitement-0.038
Token on
Token ID319
Feature activation+5.41e-03

Pos logit contributions
Deal+0.034
 who+0.032
mun+0.031
 if+0.029
pp+0.028
Neg logit contributions
 tow-0.053
 recess-0.047
lee-0.046
 excitement-0.043
 delight-0.043
Token pic
Token ID8301
Feature activation-2.56e-03

Pos logit contributions
 who+0.008
 if+0.008
Deal+0.008
 but+0.008
 with+0.008
Neg logit contributions
 tow-0.009
lee-0.009
 delight-0.008
 recess-0.008
 response-0.008
Tokenn
Token ID77
Feature activation-2.92e-03

Pos logit contributions
 tow+0.005
lee+0.005
 delight+0.005
 recess+0.005
 excitement+0.005
Neg logit contributions
 who-0.003
Deal-0.003
 if-0.003
 with-0.003
 but-0.003
Tokenics
Token ID873
Feature activation+9.86e-02

Pos logit contributions
 tow+0.005
lee+0.004
 recess+0.004
ived+0.004
lights+0.004
Neg logit contributions
Deal-0.005
 who-0.004
iw-0.004
 if-0.004
pp-0.004
Token with
Token ID351
Feature activation+1.34e-02

Pos logit contributions
ouncing+0.120
pp+0.117
Acc+0.115
Deal+0.115
etermined+0.106
Neg logit contributions
 recess-0.171
 tow-0.160
 vacation-0.156
 his-0.152
 sail-0.150
Token his
Token ID465
Feature activation-1.06e-02

Pos logit contributions
 who+0.020
 with+0.018
 if+0.018
 but+0.016
 and+0.016
Neg logit contributions
chie-0.025
 tow-0.024
ived-0.024
lee-0.023
asm-0.023
Token family
Token ID1641
Feature activation-3.69e-02

Pos logit contributions
 tow+0.017
lee+0.017
alled+0.015
 accuracy+0.015
andel+0.015
Neg logit contributions
 who-0.017
Deal-0.016
 but-0.016
 if-0.016
 with-0.016
Token.
Token ID13
Feature activation+2.58e-03

Pos logit contributions
 tow+0.074
lee+0.065
 delight+0.062
 recess+0.061
 excitement+0.061
Neg logit contributions
Deal-0.043
mun-0.039
pp-0.037
 who-0.036
 if-0.034
Token His
Token ID2399
Feature activation+1.18e-02

Pos logit contributions
 who+0.003
 if+0.003
 but+0.003
Deal+0.003
 with+0.003
Neg logit contributions
 tow-0.005
lee-0.005
 recess-0.005
 delight-0.005
 excitement-0.005
Token.
Token ID13
Feature activation+9.66e-03

Pos logit contributions
Deal+0.021
 who+0.019
mun+0.018
 if+0.017
pp+0.016
Neg logit contributions
 tow-0.051
lee-0.048
 recess-0.046
 delight-0.046
 excitement-0.045
Token They
Token ID1119
Feature activation-2.14e-03

Pos logit contributions
Deal+0.014
 who+0.014
 if+0.013
mun+0.012
 but+0.012
Neg logit contributions
 tow-0.018
lee-0.016
 delight-0.015
 recess-0.015
 excitement-0.015
Token are
Token ID389
Feature activation+1.65e-02

Pos logit contributions
 tow+0.004
 pride+0.003
 excitement+0.003
 delight+0.003
lee+0.003
Neg logit contributions
Deal-0.003
but-0.003
pp-0.003
mun-0.003
 if-0.003
Token for
Token ID329
Feature activation+6.17e-03

Pos logit contributions
Deal+0.018
mun+0.017
pp+0.016
 if+0.016
 who+0.016
Neg logit contributions
 tow-0.035
lee-0.032
 excitement-0.030
 delight-0.030
 recess-0.030
Token wood
Token ID4898
Feature activation+1.53e-02

Pos logit contributions
Deal+0.007
 who+0.007
 if+0.006
 but+0.006
mun+0.006
Neg logit contributions
 tow-0.013
lee-0.012
 recess-0.012
 delight-0.011
 excitement-0.011
Tokencut
Token ID8968
Feature activation+9.84e-02

Pos logit contributions
Deal+0.015
mun+0.013
 who+0.012
pp+0.011
You+0.010
Neg logit contributions
 tow-0.034
lee-0.032
lights-0.030
 tentacles-0.030
 passengers-0.029
Tokenters
Token ID1010
Feature activation+4.64e-02

Pos logit contributions
Deal+0.137
 who+0.128
 if+0.119
 without+0.116
mun+0.113
Neg logit contributions
lee-0.173
 tow-0.161
 recess-0.149
ened-0.148
alks-0.148
Token."
Token ID526
Feature activation+3.39e-03

Pos logit contributions
Deal+0.051
mun+0.042
pp+0.038
obe+0.035
iw+0.035
Neg logit contributions
 tow-0.098
lee-0.091
 delight-0.084
 recess-0.084
 tentacles-0.084
Token They
Token ID1119
Feature activation-5.92e-03

Pos logit contributions
Deal+0.004
 who+0.004
 if+0.004
mun+0.004
pp+0.004
Neg logit contributions
 tow-0.007
lee-0.006
 delight-0.006
 recess-0.006
 excitement-0.006
Token took
Token ID1718
Feature activation+1.40e-02

Pos logit contributions
 tow+0.012
lee+0.011
 delight+0.010
 recess+0.010
 excitement+0.010
Neg logit contributions
 who-0.007
Deal-0.007
 if-0.007
 but-0.007
 without-0.007
Token Lily
Token ID20037
Feature activation+2.65e-02

Pos logit contributions
 with+0.017
 if+0.016
 who+0.015
 without+0.014
 by+0.013
Neg logit contributions
lee-0.030
 tow-0.028
 accuracy-0.026
alks-0.026
 response-0.026
Token!
Token ID0
Feature activation-5.28e-03

Pos logit contributions
Deal+0.023
pp+0.019
mun+0.019
iw+0.018
Acc+0.017
Neg logit contributions
lee-0.018
ived-0.013
 excitement-0.013
 recess-0.013
 tow-0.013
Token She
Token ID1375
Feature activation+2.33e-03

Pos logit contributions
 tow+0.012
lee+0.011
 delight+0.011
 recess+0.010
 excitement+0.010
Neg logit contributions
 who-0.005
Deal-0.005
 if-0.005
mun-0.004
 without-0.004
Token loved
Token ID6151
Feature activation+1.47e-02

Pos logit contributions
 who+0.003
 if+0.003
 but+0.003
Deal+0.003
 with+0.003
Neg logit contributions
 tow-0.005
 recess-0.004
lee-0.004
 delight-0.004
 response-0.004
Token pic
Token ID8301
Feature activation-7.31e-03

Pos logit contributions
 who+0.019
 if+0.018
Deal+0.017
 but+0.017
 with+0.017
Neg logit contributions
 tow-0.030
lee-0.027
 delight-0.026
 recess-0.026
 response-0.025
Tokenn
Token ID77
Feature activation-3.87e-03

Pos logit contributions
 tow+0.015
lee+0.014
 recess+0.013
 delight+0.013
 excitement+0.013
Neg logit contributions
 who-0.009
Deal-0.009
 if-0.008
 but-0.008
 with-0.008
Tokenics
Token ID873
Feature activation+9.77e-02

Pos logit contributions
 tow+0.007
lee+0.007
ived+0.006
 delight+0.006
 excitement+0.006
Neg logit contributions
Deal-0.006
 who-0.005
 if-0.005
iw-0.005
 but-0.005
Token and
Token ID290
Feature activation-3.02e-03

Pos logit contributions
Deal+0.106
mun+0.100
pp+0.096
our+0.090
ouncing+0.088
Neg logit contributions
 tow-0.188
 excitement-0.177
 delight-0.177
 recess-0.172
 anticipation-0.166
Token spending
Token ID4581
Feature activation-9.93e-03

Pos logit contributions
 tow+0.006
lee+0.005
 delight+0.005
 excitement+0.005
 recess+0.005
Neg logit contributions
 who-0.004
Deal-0.004
 if-0.004
 without-0.004
 but-0.004
Token time
Token ID640
Feature activation-4.05e-03

Pos logit contributions
 tow+0.014
lee+0.012
 delight+0.011
 recess+0.010
 response+0.010
Neg logit contributions
 who-0.019
 if-0.018
Deal-0.018
 but-0.018
 with-0.018
Token with
Token ID351
Feature activation+1.07e-02

Pos logit contributions
 tow+0.009
 recess+0.009
 excitement+0.009
lee+0.009
 delight+0.009
Neg logit contributions
Deal-0.003
 who-0.003
mun-0.003
 if-0.003
pp-0.002
Token her
Token ID607
Feature activation-1.21e-02

Pos logit contributions
 who+0.007
 if+0.006
 with+0.006
 without+0.005
 where+0.005
Neg logit contributions
 tow-0.029
ived-0.028
chie-0.028
 accuracy-0.028
 seventeen-0.027
Token to
Token ID284
Feature activation+1.37e-02

Pos logit contributions
 who+0.018
 if+0.017
Deal+0.017
 but+0.016
 without+0.015
Neg logit contributions
 tow-0.041
lee-0.038
 delight-0.037
 recess-0.036
 excitement-0.035
Token go
Token ID467
Feature activation+2.41e-02

Pos logit contributions
 who+0.019
 if+0.018
Deal+0.017
 but+0.017
 with+0.016
Neg logit contributions
 tow-0.026
lee-0.024
 delight-0.023
 recess-0.023
 excitement-0.022
Token on
Token ID319
Feature activation-3.09e-03

Pos logit contributions
Deal+0.027
 who+0.025
 if+0.023
mun+0.022
 based+0.021
Neg logit contributions
 tow-0.053
lee-0.048
 recess-0.047
 delight-0.045
 excitement-0.044
Token pic
Token ID8301
Feature activation-1.09e-02

Pos logit contributions
 tow+0.005
lee+0.005
 delight+0.004
 recess+0.004
 response+0.004
Neg logit contributions
 who-0.005
 if-0.005
 but-0.005
Deal-0.005
 with-0.004
Tokenn
Token ID77
Feature activation-5.38e-03

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.021
 recess+0.020
lights+0.020
Neg logit contributions
 who-0.012
 with-0.011
 if-0.011
Deal-0.011
 but-0.011
Tokenics
Token ID873
Feature activation+9.59e-02

Pos logit contributions
 tow+0.008
lee+0.008
lights+0.007
 recess+0.007
ived+0.007
Neg logit contributions
Deal-0.009
 who-0.008
iw-0.008
 if-0.008
mun-0.007
Token with
Token ID351
Feature activation+7.10e-03

Pos logit contributions
ouncing+0.130
Deal+0.125
iw+0.117
pp+0.117
Acc+0.116
Neg logit contributions
 recess-0.158
 vacation-0.138
 excitement-0.133
 holiday-0.132
 tow-0.130
Token her
Token ID607
Feature activation-1.34e-02

Pos logit contributions
 who+0.005
 with+0.005
 if+0.004
 but+0.003
Deal+0.003
Neg logit contributions
 tow-0.019
chie-0.018
lee-0.017
 accuracy-0.017
ived-0.017
Token family
Token ID1641
Feature activation-4.61e-02

Pos logit contributions
 tow+0.021
lee+0.019
 delight+0.018
 accuracy+0.017
 recess+0.017
Neg logit contributions
 who-0.024
 if-0.022
Deal-0.022
 but-0.022
 with-0.022
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
 tow+0.089
lee+0.080
 delight+0.076
 recess+0.076
 excitement+0.075
Neg logit contributions
Deal-0.054
mun-0.050
 who-0.045
pp-0.045
iw-0.044
Token One
Token ID1881
Feature activation+4.72e-03

Pos logit contributions
 who+0.005
 if+0.004
Deal+0.004
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.006
lee-0.005
 delight-0.005
 recess-0.005
 excitement-0.005
Token who
Token ID508
Feature activation-1.25e-02

Pos logit contributions
 tow+0.079
lee+0.070
 July+0.066
 recess+0.064
 delight+0.064
Neg logit contributions
Deal-0.040
mun-0.031
 if-0.030
pp-0.029
ows-0.029
Token really
Token ID1107
Feature activation+2.16e-03

Pos logit contributions
 tow+0.024
lee+0.021
 delight+0.020
 excitement+0.020
 recess+0.020
Neg logit contributions
Deal-0.015
 if-0.015
 who-0.015
 but-0.015
 without-0.014
Token enjoyed
Token ID8359
Feature activation-1.46e-02

Pos logit contributions
 who+0.002
Deal+0.002
 if+0.002
 but+0.002
 with+0.002
Neg logit contributions
 tow-0.005
lee-0.005
 delight-0.005
 recess-0.005
 response-0.004
Token pic
Token ID8301
Feature activation-5.24e-03

Pos logit contributions
 tow+0.029
lee+0.027
 recess+0.026
 delight+0.026
 excitement+0.025
Neg logit contributions
Deal-0.018
 who-0.018
 if-0.018
 but-0.016
mun-0.016
Tokenn
Token ID77
Feature activation-1.42e-02

Pos logit contributions
 tow+0.011
lee+0.010
 recess+0.009
 delight+0.009
 excitement+0.009
Neg logit contributions
 who-0.006
Deal-0.006
 if-0.006
 but-0.006
 with-0.005
Tokenics
Token ID873
Feature activation+9.45e-02

Pos logit contributions
 tow+0.026
lee+0.025
 recess+0.022
 delight+0.022
ived+0.022
Neg logit contributions
Deal-0.020
 who-0.019
 if-0.018
 but-0.017
iw-0.017
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
Deal+0.135
mun+0.124
ouncing+0.124
etermined+0.123
iw+0.122
Neg logit contributions
 tow-0.147
 recess-0.133
 excitement-0.131
 sailors-0.127
 delight-0.125
Token They
Token ID1119
Feature activation-4.95e-03

Pos logit contributions
 who+0.002
 if+0.002
 but+0.002
Deal+0.002
 with+0.002
Neg logit contributions
 tow-0.004
lee-0.004
 delight-0.004
 recess-0.004
 excitement-0.003
Token loved
Token ID6151
Feature activation+1.43e-02

Pos logit contributions
 tow+0.011
lee+0.010
 delight+0.010
 recess+0.010
 excitement+0.009
Neg logit contributions
 who-0.005
 if-0.005
Deal-0.005
 but-0.005
 with-0.004
Token packing
Token ID24157
Feature activation+1.64e-02

Pos logit contributions
 who+0.016
Deal+0.015
 if+0.015
 but+0.014
 without+0.013
Neg logit contributions
 tow-0.031
lee-0.029
 delight-0.027
 recess-0.027
 excitement-0.027
Token lots
Token ID6041
Feature activation-1.60e-02

Pos logit contributions
Deal+0.031
 who+0.030
 if+0.029
mun+0.028
pp+0.027
Neg logit contributions
 tow-0.022
lee-0.020
 excitement-0.019
 delight-0.019
 recess-0.018
Token.
Token ID13
Feature activation-1.98e-03

Pos logit contributions
 tow+0.073
lee+0.068
 recess+0.068
 delight+0.065
alks+0.064
Neg logit contributions
 who-0.029
 if-0.027
 but-0.025
 with-0.025
 without-0.023
Token He
Token ID679
Feature activation+7.90e-04

Pos logit contributions
 tow+0.004
 recess+0.003
lee+0.003
 delight+0.003
 response+0.003
Neg logit contributions
 who-0.003
 if-0.003
 but-0.003
Deal-0.003
 with-0.003
Token loved
Token ID6151
Feature activation+1.24e-02

Pos logit contributions
 who+0.001
 if+0.001
Deal+0.001
 but+0.001
 without+0.001
Neg logit contributions
 tow-0.002
lee-0.001
 delight-0.001
 recess-0.001
 excitement-0.001
Token pic
Token ID8301
Feature activation-1.08e-02

Pos logit contributions
 who+0.018
 if+0.017
 but+0.016
 with+0.016
Deal+0.016
Neg logit contributions
 tow-0.024
lee-0.021
 recess-0.020
 delight-0.020
 response-0.019
Tokenn
Token ID77
Feature activation-8.29e-03

Pos logit contributions
 tow+0.022
lee+0.021
 recess+0.020
 delight+0.019
 excitement+0.019
Neg logit contributions
 who-0.013
Deal-0.013
 if-0.012
 but-0.012
 with-0.011
Tokenics
Token ID873
Feature activation+9.44e-02

Pos logit contributions
 tow+0.016
lee+0.015
 delight+0.014
 excitement+0.013
ived+0.013
Neg logit contributions
Deal-0.011
 who-0.010
 if-0.010
 but-0.009
 without-0.009
Token and
Token ID290
Feature activation-3.86e-03

Pos logit contributions
Deal+0.091
mun+0.082
pp+0.081
 who+0.074
iw+0.071
Neg logit contributions
 tow-0.210
 delight-0.190
 excitement-0.189
lee-0.184
 anticipation-0.183
Token couldn
Token ID3521
Feature activation-2.17e-02

Pos logit contributions
 tow+0.007
lee+0.006
 delight+0.006
 excitement+0.006
 recess+0.006
Neg logit contributions
 who-0.006
Deal-0.006
 if-0.006
 but-0.005
 without-0.005
Token't
Token ID470
Feature activation+8.04e-03

Pos logit contributions
 tow+0.044
lee+0.039
 delight+0.037
 recess+0.037
 excitement+0.036
Neg logit contributions
 who-0.028
Deal-0.027
 if-0.027
 but-0.025
mun-0.024
Token wait
Token ID4043
Feature activation-1.39e-02

Pos logit contributions
 who+0.011
 if+0.010
Deal+0.010
 but+0.010
 without+0.009
Neg logit contributions
 tow-0.016
lee-0.015
 delight-0.014
 recess-0.014
 excitement-0.013
Token.
Token ID13
Feature activation+6.70e-03

Pos logit contributions
 tow+0.032
lee+0.030
 delight+0.028
 recess+0.028
 excitement+0.028
Neg logit contributions
 who-0.014
 if-0.013
Deal-0.013
 but-0.012
 without-0.011
Token loved
Token ID6151
Feature activation+1.89e-02

Pos logit contributions
 tow+0.014
lee+0.013
 delight+0.012
 recess+0.012
 excitement+0.012
Neg logit contributions
 who-0.007
 if-0.007
Deal-0.006
 but-0.006
 without-0.006
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
 who+0.019
 if+0.017
Deal+0.017
 but+0.016
 with+0.015
Neg logit contributions
 tow-0.044
lee-0.041
 delight-0.039
 recess-0.039
 excitement-0.038
Token have
Token ID423
Feature activation+8.19e-03

Pos logit contributions
 who+0.020
 if+0.019
Deal+0.018
 but+0.018
 with+0.017
Neg logit contributions
 tow-0.029
lee-0.027
 recess-0.025
 delight-0.025
 excitement-0.024
Token pic
Token ID8301
Feature activation-5.26e-03

Pos logit contributions
 who+0.011
 if+0.010
 but+0.010
 with+0.010
Deal+0.009
Neg logit contributions
 tow-0.017
lee-0.014
alks-0.014
 response-0.014
lights-0.014
Tokenn
Token ID77
Feature activation-8.30e-03

Pos logit contributions
 tow+0.012
lee+0.011
 delight+0.010
 recess+0.010
 excitement+0.010
Neg logit contributions
 who-0.005
Deal-0.005
 if-0.005
 with-0.005
 but-0.005
Tokenics
Token ID873
Feature activation+9.28e-02

Pos logit contributions
 tow+0.013
lee+0.012
ived+0.011
 recess+0.011
lights+0.010
Neg logit contributions
Deal-0.014
 who-0.013
iw-0.012
 if-0.012
 but-0.012
Token with
Token ID351
Feature activation+1.04e-02

Pos logit contributions
ouncing+0.137
mun+0.129
pp+0.120
 costs+0.117
atching+0.116
Neg logit contributions
 recess-0.144
 tow-0.128
 hours-0.123
 excitement-0.119
 response-0.117
Token her
Token ID607
Feature activation-1.00e-02

Pos logit contributions
 who+0.013
 if+0.012
Deal+0.012
 but+0.011
 without+0.011
Neg logit contributions
 tow-0.022
lee-0.020
 delight-0.019
 recess-0.019
 excitement-0.018
Token family
Token ID1641
Feature activation-3.74e-02

Pos logit contributions
 tow+0.014
lee+0.012
 delight+0.011
 recess+0.011
 excitement+0.011
Neg logit contributions
 who-0.019
 if-0.018
Deal-0.018
 but-0.018
 without-0.017
Token in
Token ID287
Feature activation+8.96e-03

Pos logit contributions
 tow+0.070
lee+0.062
 delight+0.059
 recess+0.058
 excitement+0.058
Neg logit contributions
Deal-0.047
mun-0.043
pp-0.039
ouncing-0.038
using-0.038
Token the
Token ID262
Feature activation+1.51e-02

Pos logit contributions
 who+0.013
 if+0.012
 with+0.012
 but+0.011
 without+0.011
Neg logit contributions
lee-0.017
 tow-0.017
ived-0.016
 accuracy-0.015
 bullet-0.015
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
Deal+0.027
mun+0.024
pp+0.023
iw+0.022
obe+0.020
Neg logit contributions
 tow-0.045
lee-0.040
 recess-0.039
 response-0.037
lights-0.037
Token They
Token ID1119
Feature activation+6.76e-03

Pos logit contributions
 who+0.021
 if+0.020
Deal+0.019
 but+0.019
 with+0.018
Neg logit contributions
 tow-0.038
lee-0.036
 recess-0.034
 delight-0.034
 excitement-0.033
Token had
Token ID550
Feature activation+1.55e-03

Pos logit contributions
 if+0.010
Deal+0.010
 who+0.009
 but+0.009
 without+0.009
Neg logit contributions
 tow-0.012
lee-0.011
 delight-0.010
 excitement-0.010
 pride-0.010
Token wasted
Token ID18359
Feature activation+1.72e-02

Pos logit contributions
 who+0.002
 if+0.002
Deal+0.002
 but+0.002
 without+0.001
Neg logit contributions
 tow-0.003
lee-0.003
 delight-0.003
 recess-0.003
 excitement-0.003
Token their
Token ID511
Feature activation+2.78e-02

Pos logit contributions
 who+0.031
Deal+0.030
 if+0.030
 but+0.028
mun+0.028
Neg logit contributions
 tow-0.026
lee-0.022
 delight-0.021
 recess-0.021
 excitement-0.021
Token net
Token ID2010
Feature activation+8.87e-02

Pos logit contributions
 who+0.047
 if+0.046
Deal+0.045
 but+0.043
 without+0.042
Neg logit contributions
 tow-0.044
lee-0.040
 delight-0.037
 recess-0.037
 excitement-0.036
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
Deal+0.105
mun+0.088
You+0.076
iw+0.072
 who+0.069
Neg logit contributions
 tow-0.188
lee-0.170
 excitement-0.165
lights-0.162
 delight-0.162
Token bucket
Token ID19236
Feature activation+3.26e-02

Pos logit contributions
Deal+0.031
 who+0.031
 if+0.030
 but+0.029
mun+0.028
Neg logit contributions
 tow-0.038
lee-0.035
 recess-0.033
 delight-0.033
 excitement-0.032
Token,
Token ID11
Feature activation+9.34e-03

Pos logit contributions
Deal+0.030
 who+0.027
mun+0.025
 if+0.024
pp+0.022
Neg logit contributions
 tow-0.077
lee-0.071
lights-0.067
 delight-0.066
 excitement-0.066
Token their
Token ID511
Feature activation-5.89e-03

Pos logit contributions
Deal+0.009
mun+0.008
pp+0.007
obe+0.006
our+0.006
Neg logit contributions
 tow-0.022
 excitement-0.020
lee-0.020
 delight-0.019
 response-0.019
Token scissors
Token ID41786
Feature activation+2.33e-02

Pos logit contributions
 tow+0.009
lee+0.008
 recess+0.007
 delight+0.007
 excitement+0.007
Neg logit contributions
 who-0.011
 if-0.010
Deal-0.010
 but-0.010
 with-0.010
Token Lily
Token ID20037
Feature activation+3.14e-02

Pos logit contributions
 who+0.008
 if+0.007
Deal+0.007
 but+0.007
 without+0.007
Neg logit contributions
 tow-0.010
lee-0.009
 delight-0.009
 recess-0.008
 excitement-0.008
Token realized
Token ID6939
Feature activation+7.64e-03

Pos logit contributions
Deal+0.041
 if+0.038
 who+0.038
 but+0.034
pp+0.034
Neg logit contributions
lee-0.060
 tow-0.058
 delight-0.054
 excitement-0.052
 recess-0.050
Token that
Token ID326
Feature activation+2.49e-02

Pos logit contributions
Deal+0.009
 who+0.008
pp+0.008
mun+0.008
 if+0.007
Neg logit contributions
 tow-0.016
lee-0.015
 delight-0.014
 recess-0.014
 excitement-0.014
Token pic
Token ID8301
Feature activation-1.80e-02

Pos logit contributions
Deal+0.038
ows+0.035
pp+0.034
our+0.032
 who+0.031
Neg logit contributions
 tow-0.042
 delight-0.041
 excitement-0.040
 recess-0.038
lee-0.037
Tokenn
Token ID77
Feature activation-1.66e-02

Pos logit contributions
 tow+0.035
lee+0.032
 delight+0.030
 recess+0.030
 excitement+0.029
Neg logit contributions
 who-0.024
Deal-0.023
 if-0.023
 but-0.021
 without-0.021
Tokenics
Token ID873
Feature activation+8.80e-02

Pos logit contributions
 tow+0.035
lee+0.032
 recess+0.030
 delight+0.030
 excitement+0.029
Neg logit contributions
 who-0.020
Deal-0.019
 if-0.019
 but-0.017
 without-0.017
Token were
Token ID547
Feature activation+4.15e-03

Pos logit contributions
pp+0.109
Deal+0.109
 who+0.107
mun+0.102
iw+0.100
Neg logit contributions
 tow-0.158
lee-0.140
 recess-0.138
 delight-0.137
 excitement-0.135
Token not
Token ID407
Feature activation-2.94e-03

Pos logit contributions
 who+0.005
Deal+0.005
 if+0.005
 but+0.005
mun+0.004
Neg logit contributions
 tow-0.008
lee-0.008
 delight-0.007
 recess-0.007
 excitement-0.007
Token so
Token ID523
Feature activation+2.52e-02

Pos logit contributions
 tow+0.005
lee+0.004
 excitement+0.004
 delight+0.004
 recess+0.004
Neg logit contributions
Deal-0.005
 who-0.004
mun-0.004
 if-0.004
pp-0.004
Token scary
Token ID14343
Feature activation+2.20e-02

Pos logit contributions
 who+0.041
Deal+0.040
 if+0.039
 but+0.038
mun+0.037
Neg logit contributions
 tow-0.040
lee-0.037
 delight-0.034
 recess-0.034
 excitement-0.033
Token after
Token ID706
Feature activation-1.51e-02

Pos logit contributions
Deal+0.027
pp+0.026
mun+0.021
 who+0.021
iw+0.020
Neg logit contributions
lee-0.041
 recess-0.040
 tow-0.039
ened-0.038
 delight-0.038
Token other
Token ID584
Feature activation-1.24e-02

Pos logit contributions
Deal+0.040
 who+0.038
pp+0.037
 if+0.037
mun+0.036
Neg logit contributions
 tow-0.038
lee-0.034
 delight-0.033
 excitement-0.031
 recess-0.030
Token people
Token ID661
Feature activation-2.05e-02

Pos logit contributions
 tow+0.018
lee+0.016
 delight+0.016
 excitement+0.015
 recess+0.015
Neg logit contributions
Deal-0.022
 who-0.021
pp-0.021
 if-0.021
 without-0.020
Token having
Token ID1719
Feature activation-1.77e-02

Pos logit contributions
 tow+0.040
lee+0.038
 delight+0.036
 excitement+0.035
 response+0.034
Neg logit contributions
Deal-0.028
 if-0.021
mons-0.021
mun-0.020
 who-0.020
Token pic
Token ID8301
Feature activation-2.59e-02

Pos logit contributions
 tow+0.029
lee+0.027
 delight+0.026
 recess+0.025
 excitement+0.025
Neg logit contributions
 who-0.028
Deal-0.027
 if-0.027
 but-0.025
 without-0.025
Tokenn
Token ID77
Feature activation-2.22e-02

Pos logit contributions
 tow+0.054
lee+0.050
 delight+0.047
 recess+0.047
 excitement+0.046
Neg logit contributions
 who-0.030
Deal-0.029
 if-0.028
 but-0.027
 with-0.026
Tokenics
Token ID873
Feature activation+8.78e-02

Pos logit contributions
 tow+0.042
lee+0.041
 recess+0.036
ived+0.036
 delight+0.035
Neg logit contributions
Deal-0.030
 who-0.030
 if-0.027
 but-0.026
 without-0.025
Token.
Token ID13
Feature activation+5.30e-03

Pos logit contributions
Deal+0.095
mun+0.091
 who+0.083
pp+0.080
 if+0.076
Neg logit contributions
 tow-0.178
 recess-0.165
lee-0.161
 delight-0.157
 excitement-0.156
Token
Token ID198
Feature activation+6.92e-03

Pos logit contributions
 who+0.007
 if+0.007
Deal+0.007
 but+0.006
 without+0.006
Neg logit contributions
 tow-0.010
lee-0.009
 delight-0.009
 recess-0.009
 excitement-0.009
Token
Token ID198
Feature activation+6.56e-04

Pos logit contributions
 who+0.008
 if+0.007
Deal+0.007
 but+0.007
 with+0.007
Neg logit contributions
 tow-0.015
lee-0.014
 recess-0.013
 delight-0.013
 excitement-0.013
TokenJack
Token ID14295
Feature activation+1.37e-02

Pos logit contributions
 who+0.001
 if+0.001
Deal+0.001
 but+0.001
 without+0.001
Neg logit contributions
 tow-0.001
lee-0.001
 delight-0.001
 recess-0.001
 excitement-0.001
Token and
Token ID290
Feature activation+5.80e-03

Pos logit contributions
Deal+0.014
 who+0.013
 if+0.013
 but+0.011
 without+0.011
Neg logit contributions
 tow-0.030
lee-0.029
 delight-0.027
 excitement-0.026
 recess-0.026
Token to
Token ID284
Feature activation+1.58e-03

Pos logit contributions
Deal+0.008
 who+0.008
mun+0.007
pp+0.007
 if+0.007
Neg logit contributions
 tow-0.012
lee-0.012
 delight-0.011
 excitement-0.011
 farewell-0.011
Token make
Token ID787
Feature activation+1.40e-02

Pos logit contributions
 who+0.002
Deal+0.002
 if+0.002
mun+0.002
 but+0.002
Neg logit contributions
 tow-0.003
 delight-0.002
lee-0.002
 excitement-0.002
 recess-0.002
Token more
Token ID517
Feature activation-7.91e-03

Pos logit contributions
Deal+0.022
 who+0.021
pp+0.020
 if+0.020
 but+0.019
Neg logit contributions
 tow-0.023
 delight-0.022
lee-0.022
 excitement-0.021
 recess-0.020
Token pic
Token ID8301
Feature activation-2.84e-02

Pos logit contributions
 excitement+0.010
 tow+0.010
 delight+0.010
 anticipation+0.009
lee+0.009
Neg logit contributions
Deal-0.015
pp-0.014
 who-0.013
but-0.013
just-0.012
Tokenn
Token ID77
Feature activation-1.91e-02

Pos logit contributions
 tow+0.053
lee+0.048
 delight+0.045
 recess+0.045
 excitement+0.044
Neg logit contributions
 who-0.041
 if-0.039
Deal-0.039
 but-0.037
 without-0.036
Tokenics
Token ID873
Feature activation+8.50e-02

Pos logit contributions
 tow+0.031
lee+0.031
ived+0.028
 recess+0.026
lights+0.026
Neg logit contributions
Deal-0.033
 who-0.028
iw-0.027
 if-0.026
 but-0.025
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
Deal+0.092
mun+0.078
pp+0.075
 who+0.071
ouncing+0.071
Neg logit contributions
 tow-0.182
 delight-0.167
 excitement-0.165
 recess-0.162
lee-0.157
Token looked
Token ID3114
Feature activation-1.65e-02

Pos logit contributions
 tow+0.021
 delight+0.018
lee+0.018
 excitement+0.018
 recess+0.017
Neg logit contributions
 who-0.017
Deal-0.017
 if-0.016
mun-0.016
 but-0.016
Token in
Token ID287
Feature activation-1.01e-02

Pos logit contributions
 tow+0.036
lee+0.034
 delight+0.032
 recess+0.032
 excitement+0.032
Neg logit contributions
 who-0.017
Deal-0.017
 if-0.016
 but-0.015
mun-0.014
Token the
Token ID262
Feature activation+6.44e-03

Pos logit contributions
 tow+0.017
lee+0.016
 recess+0.014
ived+0.014
 delight+0.014
Neg logit contributions
 who-0.016
 if-0.016
 but-0.015
 with-0.015
Deal-0.015
Token fridge
Token ID25772
Feature activation+1.31e-02

Pos logit contributions
 who+0.011
 if+0.011
Deal+0.011
 but+0.010
 without+0.010
Neg logit contributions
 tow-0.010
lee-0.009
 delight-0.008
 recess-0.008
 excitement-0.008
Token She
Token ID1375
Feature activation-5.09e-03

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.012
 but+0.011
 without+0.011
Neg logit contributions
 tow-0.019
lee-0.018
 delight-0.017
 recess-0.016
 excitement-0.016
Token had
Token ID550
Feature activation-1.34e-02

Pos logit contributions
 tow+0.009
lee+0.009
 delight+0.008
 excitement+0.007
 farewell+0.007
Neg logit contributions
Deal-0.007
pp-0.007
 if-0.007
 but-0.007
 who-0.007
Token heard
Token ID2982
Feature activation+1.03e-02

Pos logit contributions
 tow+0.028
lee+0.026
 delight+0.025
 recess+0.025
 excitement+0.024
Neg logit contributions
 who-0.016
 if-0.015
Deal-0.015
 but-0.014
 without-0.013
Token about
Token ID546
Feature activation-1.85e-02

Pos logit contributions
 who+0.014
 if+0.014
Deal+0.013
 but+0.013
 without+0.012
Neg logit contributions
 tow-0.020
lee-0.018
 recess-0.017
 delight-0.017
 excitement-0.016
Token att
Token ID708
Feature activation-3.27e-02

Pos logit contributions
 tow+0.032
 excitement+0.029
lee+0.029
 delight+0.029
 response+0.028
Neg logit contributions
Deal-0.029
pp-0.027
mun-0.024
ll-0.024
 who-0.023
Tokenics
Token ID873
Feature activation+8.28e-02

Pos logit contributions
 tow+0.061
lee+0.055
 delight+0.052
 recess+0.052
 excitement+0.051
Neg logit contributions
 who-0.047
 if-0.045
Deal-0.044
 but-0.042
 with-0.041
Token and
Token ID290
Feature activation-2.58e-02

Pos logit contributions
Deal+0.086
mun+0.083
pp+0.071
iw+0.071
obe+0.069
Neg logit contributions
 tow-0.170
lee-0.153
 recess-0.153
 response-0.151
 delight-0.150
Token wanted
Token ID2227
Feature activation-4.51e-02

Pos logit contributions
 tow+0.050
lee+0.047
 delight+0.044
 excitement+0.044
 recess+0.043
Neg logit contributions
Deal-0.035
 who-0.031
 if-0.030
pp-0.029
 but-0.028
Token to
Token ID284
Feature activation-1.78e-02

Pos logit contributions
 tow+0.096
lee+0.088
 delight+0.084
 recess+0.084
 excitement+0.083
Neg logit contributions
Deal-0.051
 who-0.051
 if-0.047
 but-0.044
mun-0.043
Token explore
Token ID7301
Feature activation-1.34e-02

Pos logit contributions
 tow+0.031
lee+0.027
 delight+0.027
 excitement+0.026
 recess+0.026
Neg logit contributions
 who-0.027
Deal-0.027
 if-0.026
 but-0.024
mun-0.024
Token one
Token ID530
Feature activation+1.04e-02

Pos logit contributions
 tow+0.022
lee+0.019
 delight+0.018
 recess+0.018
 excitement+0.018
Neg logit contributions
Deal-0.022
pp-0.020
ouncing-0.019
mun-0.019
 who-0.019
Token little
Token ID1310
Feature activation+2.43e-02

Pos logit contributions
 who+0.005
 if+0.004
Deal+0.004
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.004
lee-0.004
 delight-0.003
 recess-0.003
 excitement-0.003
Token boy
Token ID2933
Feature activation+1.88e-02

Pos logit contributions
Deal+0.042
 who+0.038
 if+0.037
pp+0.035
mun+0.034
Neg logit contributions
 tow-0.040
 delight-0.034
lee-0.034
 excitement-0.033
 response-0.032
Token getting
Token ID1972
Feature activation-8.94e-03

Pos logit contributions
Deal+0.022
pp+0.016
mons+0.015
iders+0.014
 if+0.014
Neg logit contributions
 tow-0.043
lee-0.039
 delight-0.037
 response-0.035
 polar-0.035
Token a
Token ID257
Feature activation+4.63e-03

Pos logit contributions
 tow+0.017
lee+0.016
 recess+0.015
 delight+0.015
 farewell+0.015
Neg logit contributions
 who-0.012
 if-0.012
 with-0.011
Deal-0.011
 but-0.011
Token buzz
Token ID14713
Feature activation+7.99e-03

Pos logit contributions
 who+0.008
Deal+0.008
 if+0.007
 but+0.007
mun+0.007
Neg logit contributions
 tow-0.008
lee-0.006
 delight-0.006
 recess-0.006
 excitement-0.006
Tokencut
Token ID8968
Feature activation+8.24e-02

Pos logit contributions
Deal+0.011
 who+0.010
 if+0.009
mun+0.009
iw+0.008
Neg logit contributions
 tow-0.015
lee-0.014
ived-0.014
 delight-0.013
ened-0.013
Token and
Token ID290
Feature activation-1.65e-02

Pos logit contributions
Deal+0.114
 who+0.092
 if+0.091
mun+0.090
pp+0.089
Neg logit contributions
 tow-0.148
lee-0.139
ened-0.135
 response-0.131
 delight-0.128
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 tow+0.029
lee+0.027
 delight+0.025
 excitement+0.025
 recess+0.025
Neg logit contributions
Deal-0.024
 who-0.024
 if-0.023
 but-0.022
 without-0.021
Token lady
Token ID10846
Feature activation+5.29e-03

Pos logit contributions
 tow+0.021
lee+0.018
 delight+0.017
 recess+0.017
 excitement+0.017
Neg logit contributions
 who-0.021
Deal-0.020
 if-0.020
 but-0.019
 without-0.018
Token getting
Token ID1972
Feature activation-2.91e-02

Pos logit contributions
Deal+0.007
 if+0.006
 who+0.006
 but+0.005
 without+0.005
Neg logit contributions
 tow-0.011
lee-0.010
 delight-0.009
 response-0.009
 excitement-0.009
Token a
Token ID257
Feature activation-5.45e-03

Pos logit contributions
 tow+0.058
lee+0.055
 recess+0.052
 delight+0.051
 excitement+0.049
Neg logit contributions
 who-0.037
 if-0.037
Deal-0.033
 with-0.033
 but-0.033
Token he
Token ID339
Feature activation-6.79e-03

Pos logit contributions
 who+0.001
 if+0.001
Deal+0.001
 but+0.001
 without+0.001
Neg logit contributions
 tow-0.002
lee-0.002
 delight-0.002
 recess-0.002
 excitement-0.002
Token wanted
Token ID2227
Feature activation-1.78e-02

Pos logit contributions
 tow+0.013
lee+0.012
 delight+0.011
 excitement+0.010
 recess+0.010
Neg logit contributions
Deal-0.009
 who-0.009
 if-0.009
 but-0.008
mun-0.008
Token to
Token ID284
Feature activation-8.82e-04

Pos logit contributions
 tow+0.041
lee+0.039
 delight+0.037
 recess+0.037
 excitement+0.036
Neg logit contributions
 who-0.018
 if-0.017
 but-0.015
Deal-0.015
 with-0.015
Token play
Token ID711
Feature activation-7.69e-03

Pos logit contributions
 tow+0.002
lee+0.001
 delight+0.001
 recess+0.001
 excitement+0.001
Neg logit contributions
 who-0.001
 if-0.001
Deal-0.001
 but-0.001
 without-0.001
Token '
Token ID705
Feature activation+4.61e-02

Pos logit contributions
 tow+0.018
 recess+0.016
 delight+0.016
 excitement+0.016
lee+0.016
Neg logit contributions
Deal-0.007
pp-0.006
mun-0.006
 who-0.006
ll-0.005
Tokencut
Token ID8968
Feature activation+8.19e-02

Pos logit contributions
 who+0.067
Deal+0.064
 if+0.064
 but+0.061
mun+0.058
Neg logit contributions
 tow-0.081
lee-0.072
 recess-0.070
 delight-0.068
 excitement-0.067
Token the
Token ID262
Feature activation+2.95e-03

Pos logit contributions
Deal+0.115
 who+0.101
 if+0.096
pp+0.093
iw+0.093
Neg logit contributions
lee-0.137
 tow-0.137
 delight-0.126
 response-0.125
ened-0.122
Token boat
Token ID8848
Feature activation+1.80e-02

Pos logit contributions
Deal+0.005
 who+0.005
 if+0.005
pp+0.005
 without+0.005
Neg logit contributions
 tow-0.004
 excitement-0.004
 delight-0.004
 recess-0.004
 response-0.003
Token'.
Token ID4458
Feature activation+1.85e-02

Pos logit contributions
Deal+0.020
mun+0.018
 who+0.017
iw+0.015
pp+0.015
Neg logit contributions
 tow-0.038
lee-0.033
 delight-0.033
 response-0.032
 farewell-0.032
Token The
Token ID383
Feature activation+1.69e-03

Pos logit contributions
Deal+0.030
 who+0.030
 if+0.029
pp+0.029
mun+0.029
Neg logit contributions
 tow-0.030
 delight-0.026
lee-0.026
 excitement-0.024
 farewell-0.023
Token little
Token ID1310
Feature activation+1.63e-02

Pos logit contributions
 who+0.003
 if+0.003
 but+0.003
 with+0.003
Deal+0.003
Neg logit contributions
 tow-0.003
lee-0.003
alks-0.002
 recess-0.002
ived-0.002
Token time
Token ID640
Feature activation+5.32e-03

Pos logit contributions
 if+0.025
 who+0.025
Deal+0.025
pp+0.025
mun+0.024
Neg logit contributions
 tow-0.009
 delight-0.006
lee-0.006
 excitement-0.006
 response-0.006
Token there
Token ID612
Feature activation+7.41e-03

Pos logit contributions
Deal+0.006
 who+0.006
 if+0.005
mun+0.005
pp+0.005
Neg logit contributions
 tow-0.011
lee-0.010
 delight-0.010
 recess-0.010
 excitement-0.010
Token was
Token ID373
Feature activation+1.68e-03

Pos logit contributions
 who+0.007
 if+0.007
Deal+0.007
 but+0.006
 without+0.006
Neg logit contributions
 tow-0.017
lee-0.016
 delight-0.015
 recess-0.015
 excitement-0.015
Token a
Token ID257
Feature activation-8.74e-04

Pos logit contributions
 who+0.002
Deal+0.002
 if+0.002
 but+0.002
 without+0.002
Neg logit contributions
 tow-0.004
lee-0.003
 delight-0.003
 recess-0.003
 excitement-0.003
Token wood
Token ID4898
Feature activation+2.31e-03

Pos logit contributions
 tow+0.001
lee+0.001
 delight+0.001
 recess+0.001
 excitement+0.001
Neg logit contributions
 who-0.002
 if-0.001
Deal-0.001
 but-0.001
 with-0.001
Tokencut
Token ID8968
Feature activation+8.17e-02

Pos logit contributions
Deal+0.003
 if+0.003
mun+0.003
 based+0.003
pp+0.003
Neg logit contributions
 tow-0.004
lee-0.004
 captain-0.004
 giant-0.004
 tentacles-0.004
Tokenter
Token ID353
Feature activation+1.31e-02

Pos logit contributions
Deal+0.110
 if+0.094
 who+0.090
mun+0.088
 without+0.088
Neg logit contributions
 tow-0.152
lee-0.146
 response-0.128
 delight-0.127
ened-0.127
Token.
Token ID13
Feature activation+2.94e-03

Pos logit contributions
Deal+0.017
idents+0.012
obe+0.012
mun+0.012
 based+0.011
Neg logit contributions
 tow-0.025
lee-0.024
 polar-0.023
 crew-0.022
 tentacles-0.022
Token He
Token ID679
Feature activation-1.78e-03

Pos logit contributions
 who+0.005
 if+0.005
Deal+0.005
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.005
lee-0.004
 delight-0.004
 recess-0.004
 excitement-0.004
Token had
Token ID550
Feature activation+1.04e-03

Pos logit contributions
 tow+0.003
lee+0.003
 delight+0.003
 excitement+0.003
 recess+0.003
Neg logit contributions
 if-0.002
Deal-0.002
 who-0.002
 but-0.002
 without-0.002
Token an
Token ID281
Feature activation+1.84e-03

Pos logit contributions
 who+0.001
Deal+0.001
 if+0.001
 but+0.001
mun+0.001
Neg logit contributions
 tow-0.002
lee-0.002
 delight-0.002
 excitement-0.002
 recess-0.002
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
 tow+0.000
lee+0.000
 recess+0.000
 delight+0.000
 excitement+0.000
Neg logit contributions
Deal-0.000
 who-0.000
mun-0.000
 if-0.000
pp-0.000
Token have
Token ID423
Feature activation+7.38e-03

Pos logit contributions
 who+0.015
 if+0.014
Deal+0.014
 but+0.013
 with+0.013
Neg logit contributions
 tow-0.018
lee-0.017
 recess-0.016
 delight-0.016
 excitement-0.016
Token more
Token ID517
Feature activation-2.04e-03

Pos logit contributions
 who+0.008
 if+0.007
 with+0.007
 but+0.006
 without+0.006
Neg logit contributions
 tow-0.017
lee-0.016
alks-0.015
opher-0.015
 response-0.015
Token pic
Token ID8301
Feature activation-2.55e-02

Pos logit contributions
 tow+0.003
 delight+0.002
 excitement+0.002
lee+0.002
 recess+0.002
Neg logit contributions
 who-0.004
Deal-0.004
 if-0.004
pp-0.003
 but-0.003
Tokenn
Token ID77
Feature activation-1.55e-02

Pos logit contributions
 tow+0.051
lee+0.047
 delight+0.045
 recess+0.044
 excitement+0.043
Neg logit contributions
 who-0.032
Deal-0.031
 if-0.031
 but-0.029
 with-0.028
Tokenics
Token ID873
Feature activation+8.15e-02

Pos logit contributions
 tow+0.028
lee+0.028
 recess+0.025
ived+0.024
lights+0.024
Neg logit contributions
Deal-0.023
 who-0.021
 if-0.020
 but-0.019
iw-0.019
Token in
Token ID287
Feature activation-7.35e-03

Pos logit contributions
Deal+0.087
mun+0.084
ouncing+0.078
pp+0.076
 who+0.076
Neg logit contributions
 tow-0.162
 recess-0.157
lee-0.140
 delight-0.140
 excitement-0.139
Token the
Token ID262
Feature activation+7.32e-03

Pos logit contributions
 tow+0.013
lee+0.012
 delight+0.011
 accuracy+0.011
 excitement+0.011
Neg logit contributions
 who-0.011
 if-0.011
 with-0.010
 without-0.010
 but-0.010
Token future
Token ID2003
Feature activation-2.24e-02

Pos logit contributions
 who+0.010
Deal+0.009
 if+0.009
 but+0.009
 with+0.009
Neg logit contributions
 tow-0.014
lee-0.013
 delight-0.012
 recess-0.012
 excitement-0.012
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
 tow+0.048
lee+0.044
 recess+0.044
 delight+0.042
 excitement+0.041
Neg logit contributions
Deal-0.024
 who-0.023
 if-0.021
mun-0.021
pp-0.020
Token<|endoftext|>
Token ID50256
Feature activation+8.57e-03

Pos logit contributions
 who+0.002
 if+0.002
Deal+0.002
 but+0.002
 with+0.002
Neg logit contributions
 tow-0.005
lee-0.005
 delight-0.005
 recess-0.005
 response-0.005
Token Lily
Token ID20037
Feature activation-1.54e-02

Pos logit contributions
 tow+0.047
lee+0.043
 delight+0.040
 recess+0.040
 excitement+0.039
Neg logit contributions
 who-0.041
 if-0.039
Deal-0.039
 but-0.037
 without-0.036
Token who
Token ID508
Feature activation-2.46e-02

Pos logit contributions
 tow+0.029
lee+0.027
 recess+0.023
 response+0.023
 delight+0.023
Neg logit contributions
Deal-0.021
mun-0.018
pp-0.017
mons-0.017
iders-0.016
Token loved
Token ID6151
Feature activation-2.94e-03

Pos logit contributions
 tow+0.052
lee+0.047
 delight+0.045
 recess+0.045
 excitement+0.045
Neg logit contributions
 who-0.026
 if-0.026
 but-0.025
Deal-0.025
mun-0.023
Token pic
Token ID8301
Feature activation-2.01e-02

Pos logit contributions
 tow+0.007
lee+0.007
 delight+0.006
 recess+0.006
 excitement+0.006
Neg logit contributions
 who-0.003
 if-0.002
Deal-0.002
 with-0.002
 but-0.002
Tokenn
Token ID77
Feature activation-2.10e-02

Pos logit contributions
 tow+0.043
lee+0.040
 recess+0.038
 delight+0.037
 excitement+0.037
Neg logit contributions
 who-0.022
Deal-0.022
 if-0.020
 with-0.020
 but-0.020
Tokenics
Token ID873
Feature activation+8.15e-02

Pos logit contributions
 tow+0.038
lee+0.036
 recess+0.032
ived+0.031
 delight+0.031
Neg logit contributions
Deal-0.030
 who-0.028
 if-0.027
 but-0.026
iw-0.025
Token.
Token ID13
Feature activation+4.21e-03

Pos logit contributions
ouncing+0.132
mun+0.126
pp+0.124
Deal+0.123
ll+0.118
Neg logit contributions
 tow-0.104
 recess-0.097
 excitement-0.086
 delight-0.084
 response-0.084
Token She
Token ID1375
Feature activation-4.95e-03

Pos logit contributions
 who+0.004
Deal+0.004
 if+0.004
mun+0.003
 without+0.003
Neg logit contributions
 tow-0.010
lee-0.009
 delight-0.009
 recess-0.009
 excitement-0.008
Token would
Token ID561
Feature activation+2.28e-02

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.007
 recess+0.007
 excitement+0.007
Neg logit contributions
Deal-0.007
 who-0.007
 if-0.007
 but-0.006
 without-0.006
Token always
Token ID1464
Feature activation+2.51e-02

Pos logit contributions
Deal+0.037
 who+0.036
 if+0.036
pp+0.035
mun+0.034
Neg logit contributions
 tow-0.036
 delight-0.032
 excitement-0.031
 farewell-0.030
 response-0.030
Token pack
Token ID2353
Feature activation+2.35e-02

Pos logit contributions
but+0.042
mons+0.041
ouncing+0.039
pp+0.039
iders+0.039
Neg logit contributions
 tow-0.039
 delight-0.034
 excitement-0.034
 response-0.033
 anticipation-0.031
Token adventures
Token ID17545
Feature activation+1.31e-02

Pos logit contributions
 who+0.009
 if+0.009
Deal+0.009
 but+0.008
 without+0.008
Neg logit contributions
 tow-0.012
lee-0.011
 delight-0.010
 recess-0.010
 excitement-0.010
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
Deal+0.015
 yourself+0.013
pp+0.013
mage+0.013
atching+0.013
Neg logit contributions
 tow-0.024
 recess-0.023
 hours-0.023
 excitement-0.022
lee-0.022
Token had
Token ID550
Feature activation+1.11e-04

Pos logit contributions
Deal+0.016
 if+0.013
 who+0.013
?+0.013
/+0.012
Neg logit contributions
 tow-0.036
lee-0.034
 recess-0.031
 excitement-0.031
 delight-0.031
Token pic
Token ID8301
Feature activation-1.53e-02

Pos logit contributions
 who+0.000
 if+0.000
 but+0.000
Deal+0.000
 with+0.000
Neg logit contributions
 tow-0.000
lee-0.000
 recess-0.000
 delight-0.000
 response-0.000
Tokenn
Token ID77
Feature activation-9.45e-03

Pos logit contributions
 tow+0.032
lee+0.030
 recess+0.028
 delight+0.028
 excitement+0.027
Neg logit contributions
 who-0.018
Deal-0.017
 if-0.017
 but-0.016
 with-0.016
Tokenics
Token ID873
Feature activation+8.14e-02

Pos logit contributions
 tow+0.019
lee+0.018
 recess+0.016
 delight+0.016
 excitement+0.016
Neg logit contributions
Deal-0.012
 who-0.012
 if-0.011
 but-0.010
 without-0.010
Token,
Token ID11
Feature activation-4.11e-03

Pos logit contributions
Deal+0.069
mun+0.068
pp+0.062
our+0.061
obe+0.059
Neg logit contributions
 tow-0.178
 recess-0.163
 excitement-0.155
 delight-0.154
 farewell-0.151
Token and
Token ID290
Feature activation-3.54e-03

Pos logit contributions
 tow+0.010
lee+0.009
 excitement+0.009
 delight+0.009
 recess+0.009
Neg logit contributions
Deal-0.004
 if-0.003
 who-0.003
mun-0.003
obe-0.003
Token even
Token ID772
Feature activation-7.86e-03

Pos logit contributions
 tow+0.007
lee+0.006
 delight+0.006
 recess+0.006
 excitement+0.006
Neg logit contributions
 who-0.005
 if-0.005
Deal-0.005
 but-0.004
 without-0.004
Token danced
Token ID39480
Feature activation-1.36e-02

Pos logit contributions
 tow+0.014
lee+0.013
 recess+0.012
 delight+0.012
 excitement+0.012
Neg logit contributions
Deal-0.011
 who-0.011
 if-0.010
 but-0.010
mun-0.010
Token together
Token ID1978
Feature activation+5.55e-04

Pos logit contributions
 tow+0.031
lee+0.029
 recess+0.029
 delight+0.028
 excitement+0.028
Neg logit contributions
Deal-0.012
 who-0.011
 if-0.010
mun-0.010
iw-0.009
Token time
Token ID640
Feature activation+5.03e-03

Pos logit contributions
 who+0.021
 if+0.021
pp+0.021
Deal+0.021
mun+0.021
Neg logit contributions
 tow-0.008
 delight-0.005
lee-0.005
 excitement-0.005
 response-0.005
Token there
Token ID612
Feature activation+6.38e-03

Pos logit contributions
Deal+0.006
 who+0.006
mun+0.005
pp+0.005
 if+0.005
Neg logit contributions
 tow-0.010
lee-0.009
 recess-0.009
 delight-0.009
 excitement-0.009
Token was
Token ID373
Feature activation-2.08e-04

Pos logit contributions
 who+0.006
 if+0.005
Deal+0.005
 but+0.005
 with+0.005
Neg logit contributions
 tow-0.015
lee-0.014
 recess-0.014
 delight-0.014
 excitement-0.013
Token a
Token ID257
Feature activation-2.00e-03

Pos logit contributions
 tow+0.000
lee+0.000
 delight+0.000
 recess+0.000
 excitement+0.000
Neg logit contributions
 who-0.000
Deal-0.000
 if-0.000
 but-0.000
 without-0.000
Token wood
Token ID4898
Feature activation+2.75e-03

Pos logit contributions
 tow+0.003
lee+0.003
 recess+0.003
 delight+0.003
 excitement+0.003
Neg logit contributions
 who-0.003
 if-0.003
Deal-0.003
 but-0.003
 with-0.003
Tokencut
Token ID8968
Feature activation+8.06e-02

Pos logit contributions
Deal+0.004
 if+0.003
mun+0.003
pp+0.003
 based+0.003
Neg logit contributions
 tow-0.005
lee-0.005
 captain-0.004
 bullet-0.004
 crew-0.004
Tokenter
Token ID353
Feature activation+1.30e-02

Pos logit contributions
Deal+0.110
 if+0.092
 who+0.089
mun+0.088
 without+0.087
Neg logit contributions
 tow-0.148
lee-0.143
ened-0.124
 response-0.124
 bullet-0.123
Token who
Token ID508
Feature activation-1.25e-02

Pos logit contributions
Deal+0.016
idents+0.012
obe+0.011
iders+0.011
mun+0.011
Neg logit contributions
 tow-0.026
lee-0.024
 polar-0.023
 crew-0.022
 tentacles-0.022
Token was
Token ID373
Feature activation-6.59e-03

Pos logit contributions
 tow+0.020
lee+0.017
ened+0.015
 delight+0.015
 excitement+0.015
Neg logit contributions
 but-0.019
Deal-0.019
but-0.018
 if-0.018
 who-0.018
Token strong
Token ID1913
Feature activation-1.26e-02

Pos logit contributions
 tow+0.013
lee+0.011
 July+0.011
 delight+0.011
 excitement+0.011
Neg logit contributions
Deal-0.009
 if-0.008
 who-0.008
mun-0.007
pp-0.007
Token and
Token ID290
Feature activation-7.69e-03

Pos logit contributions
 tow+0.027
lee+0.024
 delight+0.024
 excitement+0.023
ened+0.023
Neg logit contributions
Deal-0.015
mun-0.012
pp-0.012
but-0.011
mons-0.011
Token of
Token ID286
Feature activation-1.68e-02

Pos logit contributions
 tow+0.051
lee+0.046
 recess+0.043
 delight+0.043
 excitement+0.041
Neg logit contributions
 who-0.054
 if-0.051
Deal-0.050
 but-0.049
 with-0.048
Token people
Token ID661
Feature activation-2.82e-02

Pos logit contributions
 tow+0.025
lee+0.023
 delight+0.021
 excitement+0.021
 recess+0.021
Neg logit contributions
 who-0.029
Deal-0.029
 if-0.028
 without-0.027
 but-0.027
Token having
Token ID1719
Feature activation-2.79e-02

Pos logit contributions
 tow+0.054
lee+0.052
 excitement+0.046
 delight+0.046
 recess+0.045
Neg logit contributions
Deal-0.041
pp-0.029
iw-0.029
 if-0.028
obe-0.027
Token pic
Token ID8301
Feature activation-3.10e-02

Pos logit contributions
 tow+0.048
lee+0.044
 delight+0.041
 recess+0.041
 excitement+0.039
Neg logit contributions
 who-0.044
 if-0.042
Deal-0.041
 but-0.040
 without-0.038
Tokenn
Token ID77
Feature activation-3.11e-02

Pos logit contributions
 tow+0.065
lee+0.060
 delight+0.057
 recess+0.056
 excitement+0.055
Neg logit contributions
 who-0.035
Deal-0.034
 if-0.033
 but-0.032
 with-0.031
Tokenics
Token ID873
Feature activation+8.01e-02

Pos logit contributions
 tow+0.063
lee+0.060
 recess+0.055
 delight+0.054
 excitement+0.053
Neg logit contributions
Deal-0.039
 who-0.038
 if-0.036
 but-0.034
 without-0.032
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
Deal+0.095
mun+0.092
pp+0.086
iw+0.080
 who+0.078
Neg logit contributions
 tow-0.150
 recess-0.142
lee-0.133
 excitement-0.131
 delight-0.127
Token walking
Token ID6155
Feature activation-3.14e-02

Pos logit contributions
 tow+0.040
lee+0.037
 delight+0.035
 recess+0.035
 excitement+0.034
Neg logit contributions
 who-0.022
 if-0.021
Deal-0.021
 but-0.019
 without-0.019
Token their
Token ID511
Feature activation-2.05e-02

Pos logit contributions
 tow+0.069
lee+0.064
 delight+0.061
 recess+0.061
 excitement+0.060
Neg logit contributions
 who-0.034
Deal-0.032
 if-0.032
 but-0.029
 without-0.028
Token dogs
Token ID6844
Feature activation-3.04e-02

Pos logit contributions
 tow+0.035
lee+0.032
 delight+0.030
 recess+0.029
 excitement+0.028
Neg logit contributions
 who-0.033
 if-0.031
Deal-0.031
 but-0.030
 with-0.029
Token.
Token ID13
Feature activation-4.87e-04

Pos logit contributions
 tow+0.064
lee+0.059
 recess+0.057
 delight+0.055
 excitement+0.055
Neg logit contributions
Deal-0.034
 if-0.027
 who-0.027
mun-0.026
pp-0.026
Token wants
Token ID3382
Feature activation-4.38e-02

Pos logit contributions
 who+0.017
 if+0.015
 with+0.014
Deal+0.014
 but+0.014
Neg logit contributions
 tow-0.036
lee-0.034
 recess-0.033
 delight-0.032
ived-0.031
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
 tow+0.095
lee+0.088
 delight+0.084
 recess+0.084
 excitement+0.082
Neg logit contributions
 who-0.048
Deal-0.046
 if-0.045
 but-0.042
 without-0.040
Token ring
Token ID5858
Feature activation-9.19e-03

Pos logit contributions
 tow+0.038
lee+0.035
 delight+0.032
 recess+0.032
 excitement+0.031
Neg logit contributions
 who-0.041
 if-0.040
Deal-0.039
 but-0.038
 without-0.037
Token like
Token ID588
Feature activation-1.22e-02

Pos logit contributions
 tow+0.020
 response+0.017
lee+0.017
 delight+0.017
 recess+0.017
Neg logit contributions
mun-0.010
Deal-0.010
 who-0.009
 if-0.008
obe-0.008
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
 tow+0.026
lee+0.024
 delight+0.023
 recess+0.023
 excitement+0.022
Neg logit contributions
 who-0.015
 if-0.014
Deal-0.013
 but-0.013
 with-0.012
Token ear
Token ID1027
Feature activation-1.08e-01

Pos logit contributions
 tow+0.026
 response+0.021
 recess+0.021
 delight+0.021
 excitement+0.020
Neg logit contributions
Deal-0.030
 who-0.028
 if-0.028
ouncing-0.027
mun-0.027
Token on
Token ID319
Feature activation-3.49e-02

Pos logit contributions
 tow+0.218
lee+0.194
 response+0.185
 delight+0.184
 recess+0.182
Neg logit contributions
Deal-0.135
mun-0.122
obe-0.110
 who-0.109
 if-0.101
Token TV
Token ID3195
Feature activation+3.37e-03

Pos logit contributions
 tow+0.060
lee+0.056
 recess+0.051
 delight+0.051
 excitement+0.049
Neg logit contributions
 who-0.055
 if-0.053
Deal-0.050
 but-0.050
 with-0.049
Token.
Token ID13
Feature activation-9.20e-03

Pos logit contributions
Deal+0.004
mun+0.004
 who+0.003
pp+0.003
 if+0.003
Neg logit contributions
 tow-0.007
 recess-0.006
 delight-0.006
lee-0.006
 excitement-0.006
Token She
Token ID1375
Feature activation-8.53e-03

Pos logit contributions
 tow+0.019
lee+0.017
 delight+0.016
 recess+0.015
 excitement+0.015
Neg logit contributions
Deal-0.011
 who-0.011
 if-0.010
mun-0.010
pp-0.010
Token thinks
Token ID6834
Feature activation-2.12e-02

Pos logit contributions
 tow+0.015
lee+0.015
 delight+0.014
 excitement+0.013
 farewell+0.013
Neg logit contributions
Deal-0.012
 if-0.011
 who-0.011
pp-0.011
but-0.011
Token alone
Token ID3436
Feature activation+2.85e-03

Pos logit contributions
 tow+0.015
 excitement+0.015
 farewell+0.015
 delight+0.014
ened+0.014
Neg logit contributions
Deal-0.011
mun-0.011
obe-0.011
pp-0.011
/-0.010
Token and
Token ID290
Feature activation-5.72e-03

Pos logit contributions
Deal+0.004
mun+0.004
pp+0.004
iw+0.004
ll+0.003
Neg logit contributions
 hours-0.004
ened-0.004
 recess-0.004
 excitement-0.004
 tow-0.004
Token always
Token ID1464
Feature activation-2.84e-03

Pos logit contributions
 tow+0.009
lee+0.009
 excitement+0.009
 delight+0.009
 recess+0.008
Neg logit contributions
Deal-0.009
 who-0.008
pp-0.008
mun-0.008
 if-0.008
Token counted
Token ID14789
Feature activation-4.58e-02

Pos logit contributions
 tow+0.005
lee+0.005
 excitement+0.004
 delight+0.004
 recess+0.004
Neg logit contributions
Deal-0.004
 who-0.004
 if-0.004
pp-0.004
mun-0.004
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
 tow+0.091
lee+0.086
 delight+0.081
 recess+0.079
 excitement+0.079
Neg logit contributions
Deal-0.057
 who-0.054
mun-0.051
 if-0.050
pp-0.047
Token days
Token ID1528
Feature activation-1.01e-01

Pos logit contributions
 tow+0.038
lee+0.034
 delight+0.032
 recess+0.031
 excitement+0.030
Neg logit contributions
 who-0.038
 if-0.037
Deal-0.036
 but-0.035
 without-0.034
Token until
Token ID1566
Feature activation-3.18e-02

Pos logit contributions
 tow+0.233
lee+0.219
 recess+0.208
 delight+0.204
 excitement+0.200
Neg logit contributions
Deal-0.096
 who-0.091
 if-0.084
mun-0.080
 but-0.077
Token something
Token ID1223
Feature activation-2.75e-02

Pos logit contributions
 tow+0.080
lee+0.076
 delight+0.071
 recess+0.070
 response+0.069
Neg logit contributions
 who-0.025
 if-0.024
Deal-0.022
 but-0.021
 without-0.020
Token special
Token ID2041
Feature activation-1.94e-02

Pos logit contributions
 tow+0.051
lee+0.047
 recess+0.045
 delight+0.044
 July+0.043
Neg logit contributions
 who-0.040
 if-0.038
Deal-0.037
 but-0.037
 with-0.036
Token happened
Token ID3022
Feature activation-1.82e-02

Pos logit contributions
 tow+0.033
lee+0.030
 delight+0.028
 recess+0.027
 excitement+0.027
Neg logit contributions
 who-0.031
 if-0.030
Deal-0.030
 but-0.028
 without-0.028
Token.
Token ID13
Feature activation-8.77e-04

Pos logit contributions
 tow+0.034
lee+0.033
 recess+0.030
 response+0.029
 farewell+0.029
Neg logit contributions
Deal-0.024
mun-0.022
 who-0.021
pp-0.019
iw-0.019
Token days
Token ID1528
Feature activation-9.80e-02

Pos logit contributions
 tow+0.036
lee+0.033
 delight+0.031
 recess+0.030
 excitement+0.030
Neg logit contributions
 who-0.036
Deal-0.035
 if-0.035
 but-0.033
 without-0.032
Token were
Token ID547
Feature activation-1.29e-02

Pos logit contributions
 tow+0.189
lee+0.182
 recess+0.165
 delight+0.163
 excitement+0.160
Neg logit contributions
Deal-0.125
 who-0.110
mun-0.103
 if-0.100
 but-0.096
Token sunny
Token ID27737
Feature activation-1.51e-02

Pos logit contributions
 tow+0.027
lee+0.026
 delight+0.024
 recess+0.024
 excitement+0.024
Neg logit contributions
Deal-0.015
 who-0.014
 if-0.013
mun-0.012
 but-0.012
Token and
Token ID290
Feature activation-2.57e-02

Pos logit contributions
 tow+0.030
lee+0.029
 excitement+0.028
 delight+0.028
 recess+0.028
Neg logit contributions
Deal-0.018
pp-0.016
 who-0.015
 based-0.014
 if-0.014
Token some
Token ID617
Feature activation-1.72e-02

Pos logit contributions
 tow+0.041
lee+0.038
 delight+0.036
 excitement+0.035
 recess+0.033
Neg logit contributions
 who-0.042
Deal-0.042
 if-0.041
 but-0.038
 without-0.038
Token days
Token ID1528
Feature activation-9.95e-02

Pos logit contributions
 tow+0.028
lee+0.025
 delight+0.024
 recess+0.023
 excitement+0.023
Neg logit contributions
 who-0.028
 if-0.027
Deal-0.027
 but-0.026
 without-0.025
Token were
Token ID547
Feature activation-1.11e-02

Pos logit contributions
 tow+0.207
lee+0.200
 delight+0.180
 recess+0.178
 excitement+0.177
Neg logit contributions
Deal-0.114
 who-0.099
 if-0.094
mun-0.091
 but-0.089
Token rainy
Token ID37259
Feature activation-2.75e-02

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.020
 recess+0.020
 excitement+0.020
Neg logit contributions
 who-0.013
Deal-0.012
 if-0.012
 but-0.012
 without-0.011
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
lights+0.046
 recess+0.045
lee+0.044
 tow+0.043
 excitement+0.042
Neg logit contributions
Deal-0.037
 who-0.035
pp-0.035
 if-0.033
iw-0.032
Token Her
Token ID2332
Feature activation+1.49e-02

Pos logit contributions
 tow+0.014
lee+0.013
 delight+0.013
 recess+0.012
 excitement+0.012
Neg logit contributions
 who-0.010
Deal-0.009
 if-0.009
 but-0.009
 without-0.008
Token mum
Token ID25682
Feature activation+1.68e-02

Pos logit contributions
 who+0.031
 if+0.030
Deal+0.029
 but+0.029
 with+0.028
Neg logit contributions
 tow-0.019
lee-0.016
 recess-0.015
 delight-0.014
 excitement-0.014
Token the
Token ID262
Feature activation+1.65e-02

Feature not active on prev token;
no predictions were affected.
Token summer
Token ID3931
Feature activation-1.45e-02

Pos logit contributions
Deal+0.029
 who+0.028
 if+0.028
mun+0.026
pp+0.026
Neg logit contributions
 tow-0.025
lee-0.021
 delight-0.021
 excitement-0.021
 recess-0.020
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
 tow+0.025
lee+0.024
 delight+0.022
 recess+0.022
 excitement+0.022
Neg logit contributions
Deal-0.022
 who-0.021
 if-0.020
mun-0.019
pp-0.018
Token why
Token ID1521
Feature activation-3.33e-02

Pos logit contributions
 tow+0.026
lee+0.025
 delight+0.025
 excitement+0.025
 anticipation+0.023
Neg logit contributions
Deal-0.025
 who-0.023
pp-0.023
 if-0.022
mun-0.022
Token some
Token ID617
Feature activation-2.23e-02

Pos logit contributions
 tow+0.052
lee+0.047
 delight+0.044
 excitement+0.043
 recess+0.042
Neg logit contributions
Deal-0.056
 who-0.053
mun-0.050
 if-0.050
pp-0.049
Token days
Token ID1528
Feature activation-9.80e-02

Pos logit contributions
 tow+0.036
lee+0.033
 delight+0.031
 recess+0.030
 excitement+0.030
Neg logit contributions
 who-0.036
Deal-0.035
 if-0.035
 but-0.033
 without-0.032
Token were
Token ID547
Feature activation-1.29e-02

Pos logit contributions
 tow+0.189
lee+0.182
 recess+0.165
 delight+0.163
 excitement+0.160
Neg logit contributions
Deal-0.125
 who-0.110
mun-0.103
 if-0.100
 but-0.096
Token sunny
Token ID27737
Feature activation-1.51e-02

Pos logit contributions
 tow+0.027
lee+0.026
 delight+0.024
 recess+0.024
 excitement+0.024
Neg logit contributions
Deal-0.015
 who-0.014
 if-0.013
mun-0.012
 but-0.012
Token and
Token ID290
Feature activation-2.57e-02

Pos logit contributions
 tow+0.030
lee+0.029
 excitement+0.028
 delight+0.028
 recess+0.028
Neg logit contributions
Deal-0.018
pp-0.016
 who-0.015
 based-0.014
 if-0.014
Token some
Token ID617
Feature activation-1.72e-02

Pos logit contributions
 tow+0.041
lee+0.038
 delight+0.036
 excitement+0.035
 recess+0.033
Neg logit contributions
 who-0.042
Deal-0.042
 if-0.041
 but-0.038
 without-0.038
Token days
Token ID1528
Feature activation-9.95e-02

Pos logit contributions
 tow+0.028
lee+0.025
 delight+0.024
 recess+0.023
 excitement+0.023
Neg logit contributions
 who-0.028
 if-0.027
Deal-0.027
 but-0.026
 without-0.025
Token and
Token ID290
Feature activation-8.50e-03

Pos logit contributions
Deal+0.023
mun+0.022
pp+0.019
 who+0.018
obe+0.018
Neg logit contributions
 tow-0.040
lee-0.036
 recess-0.035
 delight-0.034
 excitement-0.033
Token pay
Token ID1414
Feature activation+2.65e-03

Pos logit contributions
 tow+0.014
 delight+0.011
 recess+0.011
 excitement+0.011
lee+0.011
Neg logit contributions
Deal-0.014
pp-0.013
mun-0.013
 who-0.013
 if-0.012
Token on
Token ID319
Feature activation-1.61e-02

Pos logit contributions
Deal+0.003
 who+0.003
mun+0.003
pp+0.002
 if+0.002
Neg logit contributions
 tow-0.006
lee-0.005
 recess-0.005
 delight-0.005
 excitement-0.005
Token the
Token ID262
Feature activation-1.78e-03

Pos logit contributions
 tow+0.030
lee+0.028
 delight+0.026
 recess+0.026
 response+0.025
Neg logit contributions
 who-0.024
 if-0.022
 but-0.021
Deal-0.021
 with-0.021
Token enormous
Token ID9812
Feature activation-1.95e-02

Pos logit contributions
 tow+0.003
lee+0.003
 delight+0.003
 excitement+0.002
 recess+0.002
Neg logit contributions
 who-0.003
Deal-0.003
 if-0.003
 without-0.003
 but-0.003
Token ear
Token ID1027
Feature activation-9.78e-02

Pos logit contributions
 tow+0.036
lee+0.033
 delight+0.031
 recess+0.031
 excitement+0.030
Neg logit contributions
 who-0.028
 if-0.027
Deal-0.027
 but-0.025
 without-0.025
Token we
Token ID356
Feature activation-5.66e-03

Pos logit contributions
 tow+0.199
lee+0.180
 recess+0.169
 delight+0.168
ived+0.164
Neg logit contributions
Deal-0.120
mun-0.111
 who-0.101
obe-0.094
 if-0.093
Token will
Token ID481
Feature activation+1.27e-02

Pos logit contributions
 tow+0.011
lee+0.010
 recess+0.010
 delight+0.010
 excitement+0.009
Neg logit contributions
 who-0.007
Deal-0.007
 if-0.007
 but-0.006
 without-0.006
Token find
Token ID1064
Feature activation-3.56e-03

Pos logit contributions
Deal+0.019
 who+0.018
mun+0.017
 if+0.017
pp+0.016
Neg logit contributions
 tow-0.024
lee-0.020
 delight-0.020
 recess-0.019
 excitement-0.019
Token!"
Token ID2474
Feature activation-6.35e-03

Pos logit contributions
 tow+0.007
lee+0.007
 delight+0.006
 recess+0.006
 excitement+0.006
Neg logit contributions
Deal-0.004
 who-0.004
mun-0.004
 if-0.004
 but-0.004
Token Tom
Token ID4186
Feature activation+1.13e-03

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 excitement+0.008
 recess+0.007
Neg logit contributions
pp-0.011
Deal-0.011
 who-0.011
 if-0.010
mun-0.010
Token the
Token ID262
Feature activation+6.97e-03

Pos logit contributions
 who+0.044
Deal+0.043
 if+0.041
 but+0.039
mun+0.039
Neg logit contributions
 tow-0.041
lee-0.037
 delight-0.035
 recess-0.034
 excitement-0.034
Token ball
Token ID2613
Feature activation+3.12e-02

Pos logit contributions
 who+0.010
Deal+0.010
 if+0.010
mun+0.009
pp+0.009
Neg logit contributions
 tow-0.013
lee-0.011
 delight-0.011
 excitement-0.011
 response-0.011
Token and
Token ID290
Feature activation-3.20e-03

Pos logit contributions
Deal+0.031
mun+0.028
 who+0.026
pp+0.024
 if+0.022
Neg logit contributions
 tow-0.072
lee-0.064
 delight-0.064
ived-0.063
 excitement-0.062
Token rubbed
Token ID31862
Feature activation-1.36e-02

Pos logit contributions
 tow+0.005
lee+0.005
 delight+0.005
 recess+0.005
 excitement+0.004
Neg logit contributions
 who-0.005
Deal-0.005
 if-0.005
 but-0.005
 without-0.005
Token the
Token ID262
Feature activation-8.59e-03

Pos logit contributions
 tow+0.022
lee+0.020
 delight+0.019
 excitement+0.018
 recess+0.018
Neg logit contributions
 who-0.022
Deal-0.021
 if-0.020
mun-0.019
 but-0.019
Token ear
Token ID1027
Feature activation-9.75e-02

Pos logit contributions
 tow+0.016
lee+0.014
 delight+0.014
 excitement+0.014
 response+0.013
Neg logit contributions
 who-0.012
 if-0.012
Deal-0.012
pp-0.011
mun-0.011
Token on
Token ID319
Feature activation-2.18e-02

Pos logit contributions
 tow+0.204
lee+0.186
 delight+0.177
 response+0.172
 recess+0.172
Neg logit contributions
Deal-0.116
 who-0.105
mun-0.104
 if-0.098
 but-0.088
Token the
Token ID262
Feature activation+1.94e-03

Pos logit contributions
 tow+0.038
lee+0.035
 delight+0.032
 recess+0.032
 excitement+0.031
Neg logit contributions
 who-0.034
 if-0.033
Deal-0.033
 but-0.031
 without-0.030
Token ground
Token ID2323
Feature activation-9.10e-03

Pos logit contributions
 who+0.003
 if+0.003
Deal+0.003
 but+0.003
 without+0.003
Neg logit contributions
 tow-0.003
lee-0.003
 delight-0.003
 recess-0.003
 excitement-0.003
Token.
Token ID13
Feature activation+9.13e-03

Pos logit contributions
 tow+0.018
 delight+0.016
lee+0.016
 excitement+0.016
 response+0.015
Neg logit contributions
Deal-0.012
mun-0.011
 who-0.010
pp-0.010
 if-0.009
Token The
Token ID383
Feature activation+3.04e-03

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.012
 but+0.011
 without+0.011
Neg logit contributions
 tow-0.018
lee-0.016
 delight-0.015
 excitement-0.015
 recess-0.015
Token the
Token ID262
Feature activation-2.80e-02

Pos logit contributions
 tow+0.039
lee+0.036
lights+0.036
 delight+0.035
 recess+0.035
Neg logit contributions
 if-0.028
 who-0.027
 with-0.027
 but-0.024
 without-0.023
Token reply
Token ID10971
Feature activation-3.22e-02

Pos logit contributions
 tow+0.048
lee+0.046
 recess+0.041
 delight+0.041
 excitement+0.040
Neg logit contributions
 who-0.045
 if-0.041
Deal-0.041
 but-0.040
 with-0.039
Token from
Token ID422
Feature activation-2.19e-02

Pos logit contributions
 tow+0.083
lee+0.078
 recess+0.075
 delight+0.075
 excitement+0.073
Neg logit contributions
 who-0.024
 if-0.023
 but-0.019
 with-0.019
Deal-0.019
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
lee+0.037
 tow+0.037
ived+0.035
text+0.034
alks+0.034
Neg logit contributions
 who-0.036
 if-0.034
 with-0.033
 but-0.031
 without-0.030
Token other
Token ID584
Feature activation-4.73e-02

Pos logit contributions
 tow+0.025
lee+0.023
 delight+0.022
 recess+0.022
 excitement+0.021
Neg logit contributions
 who-0.021
 if-0.020
Deal-0.020
 but-0.019
 without-0.019
Token ear
Token ID1027
Feature activation-9.67e-02

Pos logit contributions
 tow+0.082
lee+0.074
 delight+0.069
 recess+0.069
 excitement+0.067
Neg logit contributions
 who-0.074
Deal-0.071
 if-0.071
 but-0.067
 without-0.065
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
 tow+0.203
lee+0.184
 recess+0.176
 delight+0.175
ived+0.172
Neg logit contributions
Deal-0.111
mun-0.097
 who-0.089
obe-0.084
 if-0.083
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 tow+0.041
lee+0.039
 recess+0.037
 delight+0.037
 excitement+0.036
Neg logit contributions
 who-0.022
 if-0.021
Deal-0.020
 but-0.020
 with-0.019
Token
Token ID198
Feature activation-1.82e-02

Pos logit contributions
lee+0.040
 tow+0.039
 recess+0.039
 delight+0.038
ened+0.038
Neg logit contributions
 who-0.014
 with-0.011
 if-0.011
 but-0.010
Deal-0.009
TokenJohn
Token ID7554
Feature activation-1.94e-02

Pos logit contributions
lee+0.030
 tow+0.029
 recess+0.028
alks+0.026
 delight+0.026
Neg logit contributions
 who-0.032
 if-0.029
 with-0.028
Deal-0.028
 but-0.027
Token was
Token ID373
Feature activation-3.46e-03

Pos logit contributions
 tow+0.039
lee+0.037
 delight+0.035
 recess+0.034
 excitement+0.033
Neg logit contributions
Deal-0.024
 who-0.023
 if-0.023
 but-0.021
 without-0.020
Token and
Token ID290
Feature activation-4.36e-03

Pos logit contributions
Deal+0.012
mun+0.011
You+0.010
pp+0.009
w+0.009
Neg logit contributions
 tow-0.022
 recess-0.020
 delight-0.019
lee-0.019
 excitement-0.019
Token found
Token ID1043
Feature activation-6.53e-03

Pos logit contributions
 tow+0.008
lee+0.007
 delight+0.007
 recess+0.007
 excitement+0.006
Neg logit contributions
 who-0.006
Deal-0.006
 if-0.006
 but-0.006
 without-0.006
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
 tow+0.014
lee+0.013
 delight+0.012
 recess+0.012
 excitement+0.012
Neg logit contributions
 who-0.008
Deal-0.007
 if-0.007
 but-0.007
 without-0.007
Token coin
Token ID10752
Feature activation-1.28e-02

Pos logit contributions
 tow+0.023
 delight+0.019
 excitement+0.018
 response+0.017
 pride+0.017
Neg logit contributions
 who-0.027
Deal-0.027
 if-0.027
mun-0.027
pp-0.026
Token's
Token ID338
Feature activation-2.28e-03

Pos logit contributions
 tow+0.026
lee+0.024
 delight+0.021
lights+0.021
 recess+0.021
Neg logit contributions
Deal-0.016
mun-0.015
obe-0.013
 who-0.012
 if-0.012
Token rightful
Token ID43133
Feature activation-9.65e-02

Pos logit contributions
 tow+0.004
lee+0.003
 delight+0.003
 excitement+0.003
 recess+0.003
Neg logit contributions
 who-0.004
Deal-0.004
 if-0.004
mun-0.003
 but-0.003
Token place
Token ID1295
Feature activation-1.67e-02

Pos logit contributions
 tow+0.162
lee+0.147
 delight+0.137
 recess+0.136
 excitement+0.132
Neg logit contributions
 who-0.155
 if-0.149
Deal-0.148
 but-0.141
 without-0.137
Token.
Token ID13
Feature activation-5.39e-03

Pos logit contributions
 tow+0.035
lee+0.032
 delight+0.029
 excitement+0.029
 recess+0.029
Neg logit contributions
Deal-0.021
mun-0.019
 who-0.018
 if-0.016
pp-0.016
Token Lily
Token ID20037
Feature activation+2.23e-02

Pos logit contributions
 tow+0.009
lee+0.009
 delight+0.008
 excitement+0.008
 recess+0.008
Neg logit contributions
 who-0.008
Deal-0.008
 if-0.008
mun-0.008
 but-0.007
Token was
Token ID373
Feature activation+4.40e-03

Pos logit contributions
Deal+0.028
 who+0.027
 if+0.027
 but+0.024
mun+0.024
Neg logit contributions
 tow-0.044
lee-0.042
 delight-0.039
 excitement-0.038
 farewell-0.036
Token happy
Token ID3772
Feature activation-3.14e-03

Pos logit contributions
Deal+0.005
 who+0.005
 if+0.005
mun+0.005
 but+0.005
Neg logit contributions
 tow-0.009
lee-0.008
 delight-0.008
 excitement-0.008
 recess-0.008
Token came
Token ID1625
Feature activation-2.44e-03

Pos logit contributions
 tow+0.026
lee+0.024
 delight+0.021
 recess+0.021
 excitement+0.021
Neg logit contributions
Deal-0.023
 if-0.020
 who-0.020
 but-0.019
 without-0.018
Token to
Token ID284
Feature activation-1.13e-02

Pos logit contributions
 tow+0.005
lee+0.004
 response+0.004
 farewell+0.004
 delight+0.004
Neg logit contributions
Deal-0.004
 who-0.003
 if-0.003
mun-0.003
pp-0.003
Token see
Token ID766
Feature activation+1.54e-02

Pos logit contributions
 tow+0.019
 delight+0.017
 excitement+0.016
 response+0.016
 recess+0.016
Neg logit contributions
Deal-0.018
 who-0.017
pp-0.017
 if-0.017
ouncing-0.016
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
Deal+0.026
pp+0.023
obe+0.023
mun+0.022
but+0.022
Neg logit contributions
 tow-0.024
 delight-0.021
lee-0.020
 excitement-0.020
 response-0.019
Token attractive
Token ID10966
Feature activation-3.37e-02

Pos logit contributions
 tow+0.020
 delight+0.018
 excitement+0.017
 response+0.017
 anticipation+0.016
Neg logit contributions
 who-0.023
Deal-0.023
 if-0.023
pp-0.022
mun-0.022
Token ear
Token ID1027
Feature activation-9.63e-02

Pos logit contributions
 tow+0.051
 delight+0.046
lee+0.044
 excitement+0.044
 response+0.043
Neg logit contributions
Deal-0.058
 who-0.057
 if-0.055
mun-0.052
 without-0.051
Token of
Token ID286
Feature activation-2.32e-02

Pos logit contributions
 tow+0.208
lee+0.182
 delight+0.176
 response+0.173
 fingertips+0.171
Neg logit contributions
Deal-0.122
mun-0.099
obe-0.095
 if-0.083
 who-0.081
Token corn
Token ID11676
Feature activation-1.01e-02

Pos logit contributions
 tow+0.040
lee+0.037
 recess+0.034
 delight+0.034
 excitement+0.033
Neg logit contributions
 who-0.036
 if-0.035
Deal-0.034
 but-0.033
 with-0.032
Token.
Token ID13
Feature activation-2.87e-03

Pos logit contributions
 tow+0.019
lee+0.017
 delight+0.017
 excitement+0.016
 anticipation+0.016
Neg logit contributions
Deal-0.014
mun-0.011
 if-0.011
obe-0.011
 who-0.011
Token They
Token ID1119
Feature activation-5.14e-03

Pos logit contributions
 tow+0.005
lee+0.004
 delight+0.004
 excitement+0.004
 recess+0.004
Neg logit contributions
 who-0.005
Deal-0.005
 if-0.004
 but-0.004
 without-0.004
Token were
Token ID547
Feature activation+4.70e-03

Pos logit contributions
 tow+0.010
lee+0.010
 delight+0.009
 recess+0.009
 excitement+0.009
Neg logit contributions
 who-0.006
 if-0.006
Deal-0.006
 but-0.006
 without-0.005
Token park
Token ID3952
Feature activation+2.83e-02

Pos logit contributions
 who+0.006
Deal+0.005
 if+0.005
 with+0.005
 but+0.005
Neg logit contributions
lee-0.014
 tow-0.013
alks-0.013
lights-0.012
ened-0.012
Token they
Token ID484
Feature activation-5.73e-03

Pos logit contributions
Deal+0.029
 who+0.029
 if+0.027
mun+0.024
pp+0.023
Neg logit contributions
 tow-0.064
lee-0.058
 delight-0.056
 recess-0.056
 excitement-0.054
Token found
Token ID1043
Feature activation+9.68e-03

Pos logit contributions
 tow+0.011
lee+0.010
 delight+0.010
 recess+0.010
 excitement+0.009
Neg logit contributions
 who-0.008
 if-0.007
Deal-0.007
 but-0.007
 without-0.007
Token the
Token ID262
Feature activation-4.59e-03

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.012
 but+0.011
mun+0.011
Neg logit contributions
 tow-0.020
lee-0.018
 delight-0.017
 recess-0.017
 excitement-0.016
Token biggest
Token ID4094
Feature activation+1.51e-02

Pos logit contributions
 tow+0.008
 delight+0.007
 excitement+0.007
lee+0.007
 recess+0.007
Neg logit contributions
 who-0.007
 if-0.007
Deal-0.007
mun-0.006
pp-0.006
Token ear
Token ID1027
Feature activation-9.44e-02

Pos logit contributions
Deal+0.022
 who+0.021
 if+0.021
pp+0.020
mun+0.019
Neg logit contributions
 tow-0.028
 delight-0.025
 excitement-0.024
lee-0.024
 recess-0.023
Token Tom
Token ID4186
Feature activation-8.56e-03

Pos logit contributions
 tow+0.201
lee+0.182
 delight+0.170
 recess+0.168
 excitement+0.166
Neg logit contributions
Deal-0.112
mun-0.100
 who-0.098
 if-0.095
obe-0.087
Token had
Token ID550
Feature activation-2.32e-02

Pos logit contributions
 tow+0.016
lee+0.015
 delight+0.014
 recess+0.014
 excitement+0.013
Neg logit contributions
 who-0.012
 if-0.012
Deal-0.011
 but-0.011
 without-0.011
Token ever
Token ID1683
Feature activation-2.58e-02

Pos logit contributions
 tow+0.037
lee+0.034
 recess+0.032
 delight+0.031
 excitement+0.030
Neg logit contributions
 who-0.038
Deal-0.038
 if-0.037
 but-0.035
mun-0.034
Token seen
Token ID1775
Feature activation+1.41e-02

Pos logit contributions
 tow+0.044
lee+0.041
 recess+0.038
 delight+0.038
 excitement+0.037
Neg logit contributions
Deal-0.040
 who-0.039
 if-0.038
 but-0.036
 without-0.034
Token!
Token ID0
Feature activation-2.79e-03

Pos logit contributions
Deal+0.017
 who+0.015
mun+0.014
pp+0.014
 if+0.014
Neg logit contributions
 tow-0.029
lee-0.027
 recess-0.025
 delight-0.025
 excitement-0.024
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
 tow+0.000
lee+0.000
 delight+0.000
 excitement+0.000
 farewell+0.000
Neg logit contributions
Deal-0.000
 who-0.000
 if-0.000
mun-0.000
pp-0.000
Token "
Token ID366
Feature activation-2.24e-03

Pos logit contributions
 tow+0.017
lee+0.015
 delight+0.014
 excitement+0.014
 response+0.013
Neg logit contributions
Deal-0.019
 who-0.018
 if-0.017
mun-0.017
pp-0.016
TokenWe
Token ID1135
Feature activation-1.39e-02

Pos logit contributions
 tow+0.004
lee+0.003
 delight+0.003
 response+0.003
 recess+0.003
Neg logit contributions
 who-0.003
 if-0.003
Deal-0.003
 without-0.003
 but-0.003
Token count
Token ID954
Feature activation-2.04e-02

Pos logit contributions
 tow+0.028
lee+0.025
 delight+0.024
 recess+0.024
 excitement+0.024
Neg logit contributions
 who-0.017
Deal-0.017
 if-0.016
 but-0.016
 without-0.015
Token the
Token ID262
Feature activation-3.18e-02

Pos logit contributions
 tow+0.039
lee+0.037
 delight+0.035
 recess+0.035
 excitement+0.034
Neg logit contributions
Deal-0.026
 who-0.024
mun-0.024
 if-0.023
pp-0.022
Token days
Token ID1528
Feature activation-9.43e-02

Pos logit contributions
 tow+0.051
lee+0.045
 delight+0.042
 recess+0.042
 excitement+0.041
Neg logit contributions
 who-0.054
Deal-0.052
 if-0.052
 but-0.049
 without-0.048
Token,
Token ID11
Feature activation-2.58e-02

Pos logit contributions
 tow+0.201
lee+0.187
 recess+0.179
 delight+0.177
 July+0.172
Neg logit contributions
Deal-0.102
 who-0.092
mun-0.089
 if-0.084
 but-0.080
Token weeks
Token ID2745
Feature activation-5.88e-02

Pos logit contributions
 tow+0.050
 delight+0.045
lee+0.045
 excitement+0.044
 anticipation+0.042
Neg logit contributions
Deal-0.034
mun-0.031
 who-0.031
pp-0.030
 if-0.028
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
 tow+0.126
lee+0.113
 recess+0.110
 delight+0.109
 excitement+0.105
Neg logit contributions
Deal-0.066
 who-0.062
mun-0.058
 if-0.057
 but-0.054
Token months
Token ID1933
Feature activation-5.11e-02

Pos logit contributions
 tow+0.032
 delight+0.029
lee+0.028
 excitement+0.028
 recess+0.027
Neg logit contributions
Deal-0.023
mun-0.022
pp-0.021
 who-0.021
 if-0.019
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
 tow+0.111
lee+0.100
 recess+0.097
 delight+0.096
 excitement+0.093
Neg logit contributions
Deal-0.055
 who-0.054
 if-0.050
mun-0.048
 but-0.047
Token bow
Token ID9563
Feature activation+3.12e-03

Pos logit contributions
 tow+0.011
 delight+0.009
lee+0.009
 excitement+0.009
 response+0.009
Neg logit contributions
Deal-0.009
 who-0.009
 if-0.009
mun-0.009
 but-0.008
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
Deal+0.003
 who+0.003
 if+0.003
mun+0.003
 but+0.003
Neg logit contributions
 tow-0.007
lee-0.006
 delight-0.006
 recess-0.006
 excitement-0.006
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 tow+0.026
lee+0.024
 delight+0.022
 recess+0.022
 excitement+0.022
Neg logit contributions
 who-0.028
Deal-0.027
 if-0.027
 but-0.025
 without-0.025
Token 3
Token ID513
Feature activation-5.71e-02

Pos logit contributions
 tow+0.024
lee+0.020
 delight+0.020
 excitement+0.020
 recess+0.019
Neg logit contributions
 if-0.028
 who-0.028
Deal-0.028
 but-0.026
pp-0.026
Token-
Token ID12
Feature activation-6.06e-02

Pos logit contributions
 tow+0.093
lee+0.085
 delight+0.081
 excitement+0.079
 recess+0.076
Neg logit contributions
Deal-0.090
 if-0.089
 who-0.088
pp-0.083
 but-0.083
Tokenyear
Token ID1941
Feature activation-9.41e-02

Pos logit contributions
 tow+0.098
lee+0.089
 recess+0.082
 delight+0.081
 excitement+0.078
Neg logit contributions
 who-0.100
Deal-0.096
 if-0.096
 but-0.093
 without-0.090
Token-
Token ID12
Feature activation-4.49e-02

Pos logit contributions
 tow+0.221
 recess+0.199
 July+0.193
lee+0.189
 response+0.187
Neg logit contributions
 who-0.083
 with-0.083
 if-0.080
 without-0.073
 and-0.070
Tokenold
Token ID727
Feature activation-9.30e-04

Pos logit contributions
 tow+0.077
lee+0.071
 recess+0.066
 delight+0.065
 excitement+0.063
Neg logit contributions
 who-0.069
Deal-0.066
 if-0.066
 but-0.064
 without-0.062
Token followed
Token ID3940
Feature activation-1.40e-02

Pos logit contributions
 tow+0.002
lee+0.002
 delight+0.002
 recess+0.002
 excitement+0.002
Neg logit contributions
 who-0.001
Deal-0.001
 if-0.001
 but-0.001
 without-0.001
Token him
Token ID683
Feature activation-9.75e-04

Pos logit contributions
 tow+0.025
lee+0.023
 recess+0.021
 delight+0.021
 excitement+0.020
Neg logit contributions
 who-0.022
 if-0.021
 but-0.020
 with-0.019
 without-0.019
Token,
Token ID11
Feature activation+8.74e-03

Pos logit contributions
 tow+0.002
lee+0.002
 delight+0.002
 excitement+0.002
 recess+0.002
Neg logit contributions
Deal-0.001
mun-0.001
 who-0.001
 if-0.001
pp-0.001
Token and
Token ID290
Feature activation-3.02e-02

Pos logit contributions
Deal+0.015
mun+0.012
mons+0.012
iw+0.011
 if+0.011
Neg logit contributions
 tow-0.026
lee-0.026
 delight-0.024
 excitement-0.023
 response-0.023
Token she
Token ID673
Feature activation-2.07e-02

Pos logit contributions
 tow+0.056
lee+0.053
 delight+0.049
 recess+0.048
 excitement+0.048
Neg logit contributions
Deal-0.041
 who-0.041
 if-0.039
 but-0.037
 without-0.037
Token was
Token ID373
Feature activation-1.69e-02

Pos logit contributions
 tow+0.043
lee+0.040
 delight+0.038
 excitement+0.037
 recess+0.036
Neg logit contributions
Deal-0.024
 who-0.024
 if-0.023
 but-0.022
 without-0.021
Token three
Token ID1115
Feature activation-2.65e-02

Pos logit contributions
 tow+0.036
lee+0.034
 delight+0.032
 excitement+0.031
 recess+0.030
Neg logit contributions
Deal-0.020
 who-0.018
 if-0.018
 but-0.016
 without-0.016
Token-
Token ID12
Feature activation-6.53e-02

Pos logit contributions
 tow+0.050
lee+0.046
 delight+0.043
 recess+0.042
 excitement+0.042
Neg logit contributions
 who-0.037
Deal-0.036
 if-0.036
 but-0.033
 without-0.032
Tokenyears
Token ID19002
Feature activation-9.37e-02

Pos logit contributions
 tow+0.082
lee+0.072
 delight+0.066
 recess+0.065
 excitement+0.062
Neg logit contributions
 who-0.132
 if-0.128
Deal-0.128
 but-0.123
 without-0.120
Token-
Token ID12
Feature activation-4.26e-02

Pos logit contributions
 tow+0.173
lee+0.166
 delight+0.152
 recess+0.149
 excitement+0.147
Neg logit contributions
 who-0.129
Deal-0.125
 if-0.124
 but-0.117
 without-0.113
Tokenold
Token ID727
Feature activation+3.76e-03

Pos logit contributions
 tow+0.070
lee+0.065
 recess+0.059
 delight+0.058
 excitement+0.056
Neg logit contributions
 who-0.067
Deal-0.066
 if-0.065
 but-0.064
 without-0.062
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
Deal+0.004
pp+0.004
 if+0.003
 who+0.003
mun+0.003
Neg logit contributions
 tow-0.008
lee-0.007
 delight-0.007
 excitement-0.007
 response-0.007
Token She
Token ID1375
Feature activation-1.12e-02

Pos logit contributions
 tow+0.035
lee+0.032
 delight+0.030
 recess+0.030
 excitement+0.029
Neg logit contributions
 who-0.020
 if-0.019
Deal-0.019
 but-0.018
 without-0.017
Token wanted
Token ID2227
Feature activation-1.65e-02

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.020
 recess+0.020
 excitement+0.020
Neg logit contributions
 who-0.013
Deal-0.013
 if-0.013
 but-0.012
 without-0.011
Tokenuff
Token ID1648
Feature activation-9.26e-03

Pos logit contributions
 tow+0.022
lee+0.021
 recess+0.019
 delight+0.019
 excitement+0.018
Neg logit contributions
 who-0.018
 if-0.017
Deal-0.017
 but-0.017
 without-0.016
Tokenled
Token ID992
Feature activation-1.83e-02

Pos logit contributions
 tow+0.015
lee+0.014
 delight+0.013
 recess+0.013
 excitement+0.013
Neg logit contributions
 who-0.015
Deal-0.014
 if-0.014
 but-0.013
 without-0.013
Token the
Token ID262
Feature activation-9.35e-03

Pos logit contributions
 tow+0.037
lee+0.035
 delight+0.033
 recess+0.033
 excitement+0.032
Neg logit contributions
 who-0.022
Deal-0.021
 if-0.021
 but-0.020
 without-0.019
Token three
Token ID1115
Feature activation-4.22e-02

Pos logit contributions
 tow+0.017
lee+0.015
 delight+0.014
 recess+0.014
 excitement+0.014
Neg logit contributions
 who-0.014
 if-0.014
Deal-0.014
 but-0.013
 without-0.013
Token-
Token ID12
Feature activation-6.12e-02

Pos logit contributions
 tow+0.072
lee+0.064
 recess+0.061
 delight+0.060
 excitement+0.057
Neg logit contributions
 who-0.068
 if-0.065
Deal-0.063
 but-0.062
 with-0.060
Tokenyear
Token ID1941
Feature activation-9.32e-02

Pos logit contributions
 tow+0.086
lee+0.078
 recess+0.071
 delight+0.068
 farewell+0.066
Neg logit contributions
 who-0.112
 if-0.108
 but-0.107
Deal-0.107
 without-0.102
Token-
Token ID12
Feature activation-4.70e-02

Pos logit contributions
 tow+0.206
 recess+0.185
lee+0.181
 delight+0.177
 response+0.174
Neg logit contributions
 who-0.099
 if-0.094
 with-0.091
 without-0.088
 but-0.087
Tokenold
Token ID727
Feature activation-1.30e-03

Pos logit contributions
 tow+0.078
lee+0.073
 recess+0.066
 delight+0.064
 farewell+0.063
Neg logit contributions
 who-0.074
 but-0.071
 if-0.071
Deal-0.070
 without-0.066
Token's
Token ID338
Feature activation+1.68e-03

Pos logit contributions
 tow+0.003
lee+0.002
 delight+0.002
 recess+0.002
 excitement+0.002
Neg logit contributions
 who-0.002
Deal-0.002
 if-0.002
 but-0.002
 without-0.001
Token hair
Token ID4190
Feature activation-4.41e-02

Pos logit contributions
 who+0.003
 if+0.003
Deal+0.003
 but+0.003
 without+0.003
Neg logit contributions
 tow-0.003
lee-0.002
 delight-0.002
 recess-0.002
 excitement-0.002
Token.
Token ID13
Feature activation+2.94e-03

Pos logit contributions
 tow+0.094
lee+0.089
 delight+0.083
 excitement+0.081
 recess+0.080
Neg logit contributions
Deal-0.048
 who-0.044
 if-0.042
mun-0.042
pp-0.036
Token?"
Token ID1701
Feature activation-5.28e-03

Pos logit contributions
Deal+0.002
mun+0.001
 who+0.001
Acc+0.001
obe+0.001
Neg logit contributions
 tow-0.002
 giant-0.002
 delight-0.002
 excitement-0.002
 response-0.002
Token asked
Token ID1965
Feature activation-1.17e-02

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 recess+0.008
 excitement+0.008
Neg logit contributions
 who-0.008
Deal-0.008
 if-0.008
 but-0.007
 without-0.007
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
 tow+0.021
lee+0.020
 recess+0.019
 delight+0.019
 accuracy+0.019
Neg logit contributions
 who-0.017
 if-0.017
 with-0.016
 but-0.015
 without-0.015
Token 3
Token ID513
Feature activation-6.63e-02

Pos logit contributions
 tow+0.033
lee+0.029
 delight+0.027
 recess+0.027
 excitement+0.026
Neg logit contributions
 who-0.036
 if-0.034
Deal-0.034
 but-0.033
 without-0.032
Token-
Token ID12
Feature activation-6.14e-02

Pos logit contributions
 tow+0.111
lee+0.107
 delight+0.099
 excitement+0.096
 response+0.092
Neg logit contributions
Deal-0.102
 who-0.099
 if-0.097
 but-0.091
 without-0.090
Tokenyear
Token ID1941
Feature activation-9.31e-02

Pos logit contributions
 tow+0.084
lee+0.075
 delight+0.069
 recess+0.069
 excitement+0.066
Neg logit contributions
 who-0.118
 if-0.115
Deal-0.113
 but-0.107
 with-0.107
Token-
Token ID12
Feature activation-4.75e-02

Pos logit contributions
 tow+0.196
 recess+0.176
lee+0.173
 delight+0.169
 excitement+0.164
Neg logit contributions
 who-0.111
 if-0.106
Deal-0.099
 with-0.099
 but-0.096
Tokenold
Token ID727
Feature activation-7.11e-03

Pos logit contributions
 tow+0.082
lee+0.074
 delight+0.069
 recess+0.069
 excitement+0.067
Neg logit contributions
 who-0.074
Deal-0.071
 if-0.071
 but-0.068
 without-0.066
Token child
Token ID1200
Feature activation-5.27e-02

Pos logit contributions
lee+0.013
 tow+0.012
 delight+0.011
 excitement+0.011
 anticipation+0.011
Neg logit contributions
Deal-0.011
pp-0.009
 who-0.009
mun-0.009
 if-0.008
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
lee+0.105
 tow+0.103
 delight+0.095
 excitement+0.090
 response+0.089
Neg logit contributions
Deal-0.075
mun-0.057
pp-0.054
mons-0.053
You-0.052
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 tow+0.035
lee+0.033
 delight+0.031
 recess+0.031
 excitement+0.030
Neg logit contributions
 who-0.016
 if-0.015
Deal-0.015
 but-0.014
 without-0.013
Token "
Token ID366
Feature activation-4.82e-03

Pos logit contributions
 tow+0.025
lee+0.023
 recess+0.022
 delight+0.021
 excitement+0.021
Neg logit contributions
Deal-0.015
 who-0.015
 if-0.014
mun-0.013
 but-0.013
Tokench
Token ID354
Feature activation-5.88e-02

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 recess+0.008
 excitement+0.007
Neg logit contributions
 who-0.007
 if-0.006
Deal-0.006
 but-0.006
 without-0.006
Tokenoo
Token ID2238
Feature activation-6.51e-02

Pos logit contributions
 tow+0.108
lee+0.097
 recess+0.092
 delight+0.090
ived+0.090
Neg logit contributions
Deal-0.084
 who-0.079
 if-0.074
mun-0.071
pp-0.069
Token cho
Token ID1727
Feature activation-9.13e-02

Pos logit contributions
 tow+0.123
lee+0.116
 delight+0.108
 farewell+0.106
 recess+0.104
Neg logit contributions
Deal-0.086
 who-0.085
 if-0.080
mun-0.075
 but-0.074
Tokeno
Token ID78
Feature activation-6.63e-02

Pos logit contributions
 tow+0.156
lee+0.143
 recess+0.135
ived+0.131
 farewell+0.129
Neg logit contributions
 who-0.136
Deal-0.132
 if-0.130
 but-0.125
mun-0.119
Token cho
Token ID1727
Feature activation-9.30e-02

Pos logit contributions
 tow+0.122
lee+0.111
 delight+0.104
 farewell+0.104
ived+0.104
Neg logit contributions
Deal-0.091
 who-0.089
 if-0.083
mun-0.081
 but-0.079
Tokeno
Token ID78
Feature activation-6.47e-02

Pos logit contributions
 tow+0.167
lee+0.153
 recess+0.144
 delight+0.139
 farewell+0.137
Neg logit contributions
 who-0.134
Deal-0.129
 if-0.128
 but-0.122
 without-0.116
Token".
Token ID1911
Feature activation-2.56e-02

Pos logit contributions
 tow+0.120
lee+0.110
 delight+0.103
 farewell+0.102
 recess+0.101
Neg logit contributions
Deal-0.088
 who-0.087
 if-0.082
mun-0.078
 but-0.077
Token
Token ID198
Feature activation-1.81e-02

Pos logit contributions
 tow+0.048
lee+0.043
 delight+0.041
 excitement+0.040
 recess+0.039
Neg logit contributions
Deal-0.036
 who-0.035
 if-0.034
mun-0.031
pp-0.031
Token
Token ID198
Feature activation-1.90e-02

Pos logit contributions
 tow+0.042
lee+0.040
 delight+0.039
 recess+0.038
 excitement+0.038
Neg logit contributions
 who-0.017
 if-0.016
Deal-0.015
 but-0.014
 with-0.013
TokenThey
Token ID2990
Feature activation-2.66e-02

Pos logit contributions
 tow+0.034
lee+0.033
 delight+0.031
 recess+0.030
 excitement+0.030
Neg logit contributions
 who-0.028
 if-0.026
Deal-0.026
 but-0.025
 with-0.024
Token a
Token ID257
Feature activation-7.33e-03

Pos logit contributions
 tow+0.006
lee+0.006
 delight+0.006
 excitement+0.006
 recess+0.006
Neg logit contributions
Deal-0.004
 who-0.004
 if-0.004
pp-0.003
 without-0.003
Token little
Token ID1310
Feature activation+1.31e-02

Pos logit contributions
 tow+0.012
lee+0.011
 recess+0.010
 delight+0.010
 excitement+0.010
Neg logit contributions
 who-0.013
 if-0.012
Deal-0.012
 but-0.011
 with-0.011
Token girl
Token ID2576
Feature activation-2.96e-03

Pos logit contributions
 who+0.018
Deal+0.018
 if+0.018
 but+0.017
 without+0.016
Neg logit contributions
 tow-0.024
lee-0.022
 delight-0.021
 recess-0.021
 excitement-0.020
Token named
Token ID3706
Feature activation-2.52e-02

Pos logit contributions
 tow+0.005
lee+0.005
 recess+0.004
 delight+0.004
 July+0.004
Neg logit contributions
Deal-0.005
mun-0.004
pp-0.004
 if-0.004
ows-0.004
Token Kay
Token ID17356
Feature activation-4.37e-02

Pos logit contributions
 tow+0.038
lee+0.035
 recess+0.032
 delight+0.032
 excitement+0.030
Neg logit contributions
Deal-0.045
 who-0.043
 if-0.041
mun-0.041
 but-0.039
Tokenlee
Token ID7197
Feature activation-9.30e-02

Pos logit contributions
 tow+0.108
lee+0.100
 delight+0.097
 recess+0.097
 excitement+0.095
Neg logit contributions
 who-0.037
 if-0.033
Deal-0.032
 but-0.029
 with-0.028
Token.
Token ID13
Feature activation-8.18e-04

Pos logit contributions
lee+0.187
 tow+0.185
 delight+0.158
 recess+0.153
 farewell+0.153
Neg logit contributions
Deal-0.117
 if-0.100
mun-0.093
iw-0.086
 without-0.084
Token She
Token ID1375
Feature activation-6.10e-03

Pos logit contributions
 tow+0.002
lee+0.002
 delight+0.002
 recess+0.002
 excitement+0.002
Neg logit contributions
 who-0.001
 if-0.001
Deal-0.001
 but-0.001
 with-0.001
Token really
Token ID1107
Feature activation+2.18e-02

Pos logit contributions
 tow+0.014
lee+0.013
 delight+0.012
 recess+0.012
 excitement+0.012
Neg logit contributions
 who-0.006
 if-0.006
Deal-0.006
 but-0.005
 without-0.005
Token wanted
Token ID2227
Feature activation-5.39e-03

Pos logit contributions
 who+0.020
 if+0.019
Deal+0.018
 but+0.017
 without+0.017
Neg logit contributions
 tow-0.050
lee-0.047
 delight-0.045
 excitement-0.045
 recess-0.044
Token to
Token ID284
Feature activation+3.61e-03

Pos logit contributions
 tow+0.012
lee+0.011
 recess+0.010
 delight+0.010
 excitement+0.010
Neg logit contributions
 who-0.006
Deal-0.006
 if-0.005
 but-0.005
mun-0.005
Token?"
Token ID1701
Feature activation+1.10e-03

Pos logit contributions
 tow+0.009
 delight+0.008
 recess+0.008
lee+0.008
 excitement+0.008
Neg logit contributions
Deal-0.005
mun-0.005
 who-0.004
 based-0.004
Acc-0.003
Token asked
Token ID1965
Feature activation-1.29e-02

Pos logit contributions
 who+0.002
 if+0.002
Deal+0.002
 but+0.002
 without+0.002
Neg logit contributions
 tow-0.002
lee-0.002
 delight-0.001
 recess-0.001
 excitement-0.001
Token the
Token ID262
Feature activation-2.19e-02

Pos logit contributions
 tow+0.020
lee+0.019
 recess+0.017
 delight+0.017
 excitement+0.016
Neg logit contributions
 who-0.022
 if-0.021
Deal-0.021
 but-0.020
 with-0.020
Token three
Token ID1115
Feature activation-5.57e-02

Pos logit contributions
 tow+0.034
lee+0.031
 recess+0.029
 delight+0.029
 excitement+0.027
Neg logit contributions
 who-0.038
 if-0.036
Deal-0.036
 but-0.035
 with-0.034
Token-
Token ID12
Feature activation-6.55e-02

Pos logit contributions
 tow+0.105
lee+0.096
 recess+0.091
 delight+0.091
 excitement+0.088
Neg logit contributions
 who-0.078
 if-0.075
Deal-0.074
 but-0.070
 with-0.068
Tokenyear
Token ID1941
Feature activation-9.22e-02

Pos logit contributions
 tow+0.086
lee+0.076
 delight+0.069
 recess+0.068
 excitement+0.066
Neg logit contributions
 who-0.130
 if-0.126
Deal-0.125
 but-0.119
 without-0.118
Token-
Token ID12
Feature activation-4.45e-02

Pos logit contributions
 tow+0.201
 recess+0.180
lee+0.180
 delight+0.176
 excitement+0.171
Neg logit contributions
 who-0.101
 if-0.096
Deal-0.090
 with-0.090
 but-0.086
Tokenold
Token ID727
Feature activation-8.41e-03

Pos logit contributions
 tow+0.075
lee+0.068
 recess+0.063
 delight+0.063
 excitement+0.060
Neg logit contributions
 who-0.070
Deal-0.067
 if-0.067
 but-0.066
 without-0.063
Token girl
Token ID2576
Feature activation-2.78e-02

Pos logit contributions
 tow+0.013
lee+0.013
 delight+0.011
 excitement+0.011
 recess+0.011
Neg logit contributions
Deal-0.014
pp-0.012
mun-0.012
 who-0.012
 if-0.012
Token.
Token ID13
Feature activation-8.96e-03

Pos logit contributions
 tow+0.058
lee+0.057
 delight+0.054
 excitement+0.052
 anticipation+0.050
Neg logit contributions
Deal-0.034
mun-0.027
pp-0.026
 who-0.024
 if-0.024
Token
Token ID198
Feature activation-6.21e-03

Pos logit contributions
 tow+0.020
lee+0.019
 recess+0.019
lights+0.018
 tentacles+0.018
Neg logit contributions
 who-0.009
 if-0.008
 but-0.008
 with-0.008
Deal-0.007
Tokend
Token ID67
Feature activation-1.72e-02

Pos logit contributions
 tow+0.064
lee+0.057
 recess+0.054
 farewell+0.051
 delight+0.051
Neg logit contributions
Deal-0.059
 who-0.058
 if-0.056
 but-0.056
 without-0.053
Tokenough
Token ID619
Feature activation-1.61e-02

Pos logit contributions
 tow+0.030
lee+0.027
 recess+0.025
 delight+0.025
 excitement+0.024
Neg logit contributions
 who-0.026
Deal-0.026
 if-0.025
mun-0.024
 but-0.024
Token to
Token ID284
Feature activation-1.85e-02

Pos logit contributions
 tow+0.035
lee+0.033
 recess+0.032
 delight+0.031
alks+0.030
Neg logit contributions
Deal-0.017
 who-0.016
 if-0.014
mun-0.014
 but-0.013
Token make
Token ID787
Feature activation+3.30e-03

Pos logit contributions
 tow+0.035
lee+0.031
 delight+0.030
 recess+0.030
 excitement+0.029
Neg logit contributions
 who-0.026
Deal-0.025
 if-0.025
 but-0.023
 without-0.023
Token the
Token ID262
Feature activation-9.38e-03

Pos logit contributions
 who+0.005
Deal+0.005
 if+0.005
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.006
lee-0.005
 delight-0.005
 recess-0.005
 excitement-0.005
Token ear
Token ID1027
Feature activation-9.14e-02

Pos logit contributions
 tow+0.013
lee+0.011
 delight+0.011
 recess+0.011
 excitement+0.010
Neg logit contributions
 who-0.018
 if-0.017
Deal-0.017
 but-0.016
 without-0.016
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
 tow+0.170
lee+0.151
 delight+0.141
 response+0.140
 excitement+0.136
Neg logit contributions
Deal-0.137
mun-0.122
 who-0.117
 if-0.113
obe-0.108
Token She
Token ID1375
Feature activation-1.14e-03

Pos logit contributions
Deal+0.004
 who+0.004
pp+0.004
mun+0.004
 if+0.004
Neg logit contributions
 tow-0.007
lee-0.007
 delight-0.006
 recess-0.006
 excitement-0.006
Token makes
Token ID1838
Feature activation+9.63e-03

Pos logit contributions
 tow+0.002
lee+0.002
 delight+0.002
 recess+0.002
 excitement+0.002
Neg logit contributions
Deal-0.001
 who-0.001
 if-0.001
 but-0.001
 without-0.001
Token it
Token ID340
Feature activation-7.35e-03

Pos logit contributions
 who+0.012
 if+0.012
 with+0.011
 but+0.011
Deal+0.010
Neg logit contributions
 tow-0.020
lee-0.018
 recess-0.017
 delight-0.017
 response-0.016
Token very
Token ID845
Feature activation+2.54e-03

Pos logit contributions
 tow+0.013
lee+0.011
 delight+0.011
 recess+0.011
 excitement+0.011
Neg logit contributions
 who-0.011
Deal-0.011
 if-0.011
 but-0.010
mun-0.010
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
 tow+0.045
lee+0.042
 recess+0.040
 delight+0.039
 excitement+0.038
Neg logit contributions
 who-0.033
 if-0.032
Deal-0.031
 but-0.030
 with-0.029
Token shining
Token ID22751
Feature activation-8.94e-03

Pos logit contributions
 tow+0.020
lee+0.018
 delight+0.018
 excitement+0.017
 response+0.017
Neg logit contributions
Deal-0.014
 who-0.014
 if-0.013
mun-0.013
 but-0.012
Token brightly
Token ID35254
Feature activation-2.38e-02

Pos logit contributions
 tow+0.020
lee+0.019
 delight+0.018
 excitement+0.018
ened+0.017
Neg logit contributions
Deal-0.009
 who-0.008
mun-0.008
 if-0.007
pp-0.007
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
lee+0.045
ened+0.044
 delight+0.043
lights+0.043
 tow+0.042
Neg logit contributions
mun-0.028
Deal-0.028
 who-0.025
 if-0.023
 Or-0.023
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 tow+0.018
lee+0.016
 delight+0.015
 excitement+0.015
 recess+0.014
Neg logit contributions
 who-0.021
Deal-0.021
 if-0.020
pp-0.019
mun-0.019
Token days
Token ID1528
Feature activation-9.14e-02

Pos logit contributions
 tow+0.013
 delight+0.011
 excitement+0.011
lee+0.011
 response+0.010
Neg logit contributions
 who-0.029
Deal-0.028
 if-0.028
pp-0.027
mun-0.027
Token got
Token ID1392
Feature activation-8.32e-03

Pos logit contributions
 tow+0.174
lee+0.160
 delight+0.150
 recess+0.150
 excitement+0.146
Neg logit contributions
 who-0.126
 if-0.120
Deal-0.119
 but-0.113
 without-0.109
Token hotter
Token ID37546
Feature activation+1.01e-02

Pos logit contributions
 tow+0.015
lee+0.014
 delight+0.013
 recess+0.013
 excitement+0.013
Neg logit contributions
 who-0.012
 if-0.012
 but-0.011
Deal-0.011
 without-0.011
Token and
Token ID290
Feature activation-1.48e-02

Pos logit contributions
Deal+0.016
mun+0.014
 who+0.013
pp+0.013
 based+0.012
Neg logit contributions
lee-0.016
 tow-0.016
lights-0.016
 delight-0.015
 excitement-0.015
Token hotter
Token ID37546
Feature activation+4.06e-03

Pos logit contributions
 tow+0.023
lee+0.020
 delight+0.019
 recess+0.019
 excitement+0.018
Neg logit contributions
 who-0.025
Deal-0.025
 if-0.023
mun-0.023
 but-0.022
Token.
Token ID13
Feature activation-5.21e-03

Pos logit contributions
Deal+0.006
 who+0.005
mun+0.005
pp+0.005
 if+0.005
Neg logit contributions
lee-0.007
 tow-0.007
 delight-0.006
lights-0.006
 excitement-0.006
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
lee+0.012
 tow+0.012
 response+0.010
 delight+0.010
 July+0.010
Neg logit contributions
Deal-0.006
mun-0.004
iw-0.004
 if-0.004
mons-0.004
Token Bella
Token ID39304
Feature activation-1.01e-02

Pos logit contributions
 tow+0.018
lee+0.017
 delight+0.016
alks+0.016
lights+0.016
Neg logit contributions
 who-0.017
 if-0.017
Deal-0.016
 but-0.015
 with-0.015
Token.
Token ID13
Feature activation+2.04e-03

Pos logit contributions
 tow+0.025
lee+0.023
 delight+0.022
 recess+0.022
 excitement+0.021
Neg logit contributions
 who-0.008
Deal-0.008
 if-0.008
 but-0.007
 without-0.007
Token Jake
Token ID14757
Feature activation+6.79e-03

Pos logit contributions
 but+0.002
 who+0.002
 if+0.002
 with+0.002
Deal+0.002
Neg logit contributions
 tow-0.005
lee-0.005
 farewell-0.004
 delight-0.004
 response-0.004
Token loved
Token ID6151
Feature activation+1.63e-02

Pos logit contributions
Deal+0.009
 if+0.008
mun+0.008
pp+0.008
but+0.008
Neg logit contributions
 tow-0.012
lee-0.012
 delight-0.010
 excitement-0.010
 recess-0.010
Token raft
Token ID31812
Feature activation+7.87e-02

Pos logit contributions
 who+0.016
 if+0.014
Deal+0.013
 but+0.013
 with+0.013
Neg logit contributions
 tow-0.040
lee-0.036
 delight-0.035
 recess-0.034
 excitement-0.034
Tokening
Token ID278
Feature activation+3.33e-02

Pos logit contributions
Deal+0.094
 who+0.082
mun+0.080
Acc+0.078
using+0.077
Neg logit contributions
 tow-0.145
lee-0.133
 recess-0.126
 excitement-0.123
lights-0.123
Token and
Token ID290
Feature activation-4.55e-03

Pos logit contributions
 who+0.022
 if+0.020
Deal+0.020
 but+0.018
 without+0.016
Neg logit contributions
 tow-0.088
lee-0.082
 delight-0.079
 recess-0.079
 excitement-0.077
Token said
Token ID531
Feature activation-6.53e-03

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 recess+0.008
 excitement+0.008
Neg logit contributions
 who-0.006
 if-0.006
Deal-0.005
 but-0.005
 with-0.005
Token to
Token ID284
Feature activation-1.57e-02

Pos logit contributions
 tow+0.015
 recess+0.014
lee+0.014
 response+0.013
 delight+0.013
Neg logit contributions
 if-0.006
 who-0.006
 but-0.006
 with-0.006
Deal-0.006
Token Bella
Token ID39304
Feature activation+5.58e-03

Pos logit contributions
 tow+0.027
lee+0.023
 delight+0.023
 recess+0.023
 anticipation+0.022
Neg logit contributions
 who-0.026
 if-0.025
 but-0.024
 with-0.023
Deal-0.023
Token snacks
Token ID28487
Feature activation+2.63e-02

Pos logit contributions
 who+0.019
Deal+0.019
 if+0.018
 but+0.018
 with+0.017
Neg logit contributions
 tow-0.022
lee-0.020
 recess-0.019
 delight-0.019
 response-0.018
Token.
Token ID13
Feature activation+8.79e-03

Pos logit contributions
Deal+0.028
 who+0.026
mun+0.025
 if+0.024
pp+0.021
Neg logit contributions
 tow-0.057
lee-0.052
 delight-0.050
 excitement-0.050
 recess-0.049
Token
Token ID198
Feature activation+6.42e-03

Pos logit contributions
 who+0.009
 if+0.009
Deal+0.008
 but+0.008
 with+0.007
Neg logit contributions
 tow-0.020
lee-0.018
 recess-0.018
 delight-0.018
 response-0.017
Token
Token ID198
Feature activation-5.52e-06

Pos logit contributions
 who+0.005
 if+0.004
 with+0.004
 but+0.003
 without+0.003
Neg logit contributions
 tow-0.016
 recess-0.015
ened-0.015
lee-0.015
 response-0.015
TokenAfter
Token ID3260
Feature activation-1.49e-02

Feature not active on prev token;
no predictions were affected.
Tokenwards
Token ID2017
Feature activation+5.00e-02

Pos logit contributions
 tow+0.031
lee+0.029
 response+0.026
 delight+0.026
lights+0.026
Neg logit contributions
 who-0.019
 if-0.018
 with-0.017
 but-0.016
 without-0.016
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
 who+0.047
Deal+0.044
 if+0.044
 but+0.039
 without+0.038
Neg logit contributions
 tow-0.117
lee-0.109
 delight-0.105
 recess-0.104
 excitement-0.102
Token Daddy
Token ID26926
Feature activation+2.35e-02

Pos logit contributions
 who+0.024
 if+0.023
Deal+0.023
 but+0.022
 with+0.022
Neg logit contributions
 tow-0.021
lee-0.019
 recess-0.017
 delight-0.017
 excitement-0.016
Token hugged
Token ID41130
Feature activation-2.65e-03

Pos logit contributions
 who+0.023
 if+0.022
Deal+0.021
 but+0.020
 without+0.019
Neg logit contributions
 tow-0.054
lee-0.050
 recess-0.048
 delight-0.048
 excitement-0.047
Token his
Token ID465
Feature activation+1.46e-02

Pos logit contributions
 tow+0.005
lee+0.005
 recess+0.005
 delight+0.005
 excitement+0.004
Neg logit contributions
 who-0.004
 if-0.003
Deal-0.003
 but-0.003
 without-0.003
Token little
Token ID1310
Feature activation+3.54e-02

Pos logit contributions
 who+0.025
 if+0.024
Deal+0.023
 but+0.023
 with+0.023
Neg logit contributions
 tow-0.023
lee-0.021
 recess-0.020
 delight-0.019
alks-0.018
Token bath
Token ID7837
Feature activation+7.03e-03

Pos logit contributions
 who+0.015
 if+0.014
 but+0.014
 with+0.014
 without+0.013
Neg logit contributions
lee-0.018
 tow-0.017
 recess-0.016
 delight-0.015
alks-0.015
Token in
Token ID287
Feature activation+1.19e-02

Pos logit contributions
Deal+0.006
 who+0.006
mun+0.005
 if+0.005
 based+0.005
Neg logit contributions
 tow-0.016
 recess-0.015
lee-0.015
 delight-0.015
 excitement-0.014
Token the
Token ID262
Feature activation+2.08e-02

Pos logit contributions
 who+0.018
Deal+0.018
 if+0.017
 but+0.016
mun+0.016
Neg logit contributions
 tow-0.021
lee-0.019
 delight-0.018
 recess-0.018
 excitement-0.018
Token green
Token ID4077
Feature activation-1.14e-02

Pos logit contributions
 who+0.037
 with+0.034
 if+0.034
 but+0.034
Deal+0.033
Neg logit contributions
 tow-0.031
lee-0.029
 recess-0.026
 accuracy-0.025
alks-0.025
Token bath
Token ID7837
Feature activation-4.42e-03

Pos logit contributions
 tow+0.015
lee+0.014
 recess+0.012
 delight+0.012
 excitement+0.012
Neg logit contributions
 who-0.022
 if-0.021
Deal-0.021
 but-0.021
 with-0.021
Tokentub
Token ID37995
Feature activation+5.27e-02

Pos logit contributions
 tow+0.007
lee+0.007
 recess+0.006
 delight+0.006
lights+0.006
Neg logit contributions
Deal-0.007
 who-0.007
 if-0.007
mun-0.006
 based-0.006
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
Deal+0.062
mun+0.053
pp+0.045
obe+0.044
 who+0.043
Neg logit contributions
 tow-0.108
lee-0.106
 delight-0.103
 excitement-0.102
 anticipation-0.099
Token They
Token ID1119
Feature activation+3.13e-03

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.012
mun+0.011
 but+0.011
Neg logit contributions
 tow-0.024
lee-0.022
 delight-0.021
 recess-0.020
 excitement-0.020
Token both
Token ID1111
Feature activation+1.14e-02

Pos logit contributions
 who+0.004
 if+0.004
 but+0.003
 with+0.003
 like+0.003
Neg logit contributions
 tow-0.007
 recess-0.006
alks-0.006
lee-0.006
 accuracy-0.006
Token had
Token ID550
Feature activation+2.58e-03

Pos logit contributions
 who+0.016
 if+0.014
 but+0.013
Deal+0.013
 with+0.013
Neg logit contributions
 tow-0.023
 recess-0.020
lee-0.020
alks-0.020
 response-0.019
Token a
Token ID257
Feature activation+1.62e-02

Pos logit contributions
 who+0.003
 if+0.002
 but+0.002
 with+0.002
 without+0.002
Neg logit contributions
 tow-0.006
lee-0.005
 recess-0.005
alks-0.005
 response-0.005
Token Make
Token ID6889
Feature activation+2.38e-02

Pos logit contributions
Deal+0.014
 who+0.013
 if+0.013
mun+0.012
 without+0.012
Neg logit contributions
 tow-0.021
lee-0.019
 delight-0.019
 excitement-0.018
 recess-0.018
Token sure
Token ID1654
Feature activation-1.18e-02

Pos logit contributions
 who+0.033
Deal+0.031
 if+0.031
 but+0.029
 without+0.028
Neg logit contributions
 tow-0.045
lee-0.042
 delight-0.040
 recess-0.039
 excitement-0.038
Token you
Token ID345
Feature activation+7.02e-03

Pos logit contributions
 tow+0.023
lee+0.020
 delight+0.020
 recess+0.019
 excitement+0.019
Neg logit contributions
Deal-0.016
 who-0.014
mun-0.014
pp-0.014
 but-0.013
Token carry
Token ID3283
Feature activation+2.29e-02

Pos logit contributions
Deal+0.009
 who+0.009
 if+0.008
mun+0.008
 but+0.008
Neg logit contributions
 tow-0.013
lee-0.012
 delight-0.012
 excitement-0.012
 farewell-0.011
Token your
Token ID534
Feature activation+9.38e-03

Pos logit contributions
Deal+0.033
 who+0.033
 if+0.031
mun+0.030
 but+0.029
Neg logit contributions
 tow-0.041
lee-0.037
 delight-0.036
 recess-0.035
 excitement-0.035
Token net
Token ID2010
Feature activation+6.35e-02

Pos logit contributions
 who+0.016
 but+0.015
 if+0.015
 with+0.014
Deal+0.014
Neg logit contributions
 tow-0.015
lee-0.014
 recess-0.013
lights-0.012
 July-0.012
Token so
Token ID523
Feature activation+2.24e-02

Pos logit contributions
Deal+0.074
mun+0.070
 who+0.067
obe+0.056
 if+0.056
Neg logit contributions
 tow-0.129
lee-0.121
 delight-0.117
 excitement-0.115
 farewell-0.114
Token you
Token ID345
Feature activation+1.17e-02

Pos logit contributions
 who+0.026
 if+0.024
Deal+0.024
 but+0.022
 without+0.022
Neg logit contributions
 tow-0.048
lee-0.045
 recess-0.042
 delight-0.042
 excitement-0.041
Token don
Token ID836
Feature activation+2.43e-02

Pos logit contributions
 who+0.014
Deal+0.014
 if+0.013
mun+0.013
 but+0.013
Neg logit contributions
 tow-0.023
lee-0.021
 delight-0.020
 excitement-0.020
 recess-0.019
Tokenâ
Token ID22940
Feature activation+5.43e-02

Pos logit contributions
Deal+0.035
 who+0.035
 if+0.032
 but+0.030
mun+0.029
Neg logit contributions
 tow-0.045
lee-0.041
 delight-0.039
 excitement-0.037
 recess-0.036
Token€
Token ID26391
Feature activation-1.40e-02

Pos logit contributions
 who+0.106
 if+0.100
 but+0.095
mun+0.091
 without+0.090
Neg logit contributions
 tow-0.072
 recess-0.063
lee-0.061
 delight-0.061
 farewell-0.059
Token use
Token ID779
Feature activation+2.94e-02

Pos logit contributions
 who+0.003
Deal+0.003
 if+0.003
mun+0.003
 but+0.003
Neg logit contributions
 tow-0.005
 delight-0.004
lee-0.004
 recess-0.004
 excitement-0.004
Token the
Token ID262
Feature activation+6.71e-03

Pos logit contributions
 if+0.041
 who+0.041
 with+0.039
 but+0.038
 without+0.036
Neg logit contributions
 tow-0.055
lee-0.054
chie-0.049
ived-0.049
 recess-0.048
Token veil
Token ID30615
Feature activation+7.31e-03

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.012
mun+0.011
 without+0.011
Neg logit contributions
 tow-0.009
lee-0.008
 delight-0.008
 excitement-0.008
 recess-0.008
Token to
Token ID284
Feature activation-3.10e-03

Pos logit contributions
Deal+0.008
 who+0.007
mun+0.007
 if+0.007
 but+0.006
Neg logit contributions
 tow-0.016
lee-0.015
 recess-0.014
 delight-0.014
 excitement-0.014
Token play
Token ID711
Feature activation-9.97e-03

Pos logit contributions
 tow+0.006
lee+0.005
 delight+0.005
 recess+0.005
 excitement+0.005
Neg logit contributions
Deal-0.004
 who-0.004
 if-0.004
mun-0.004
 but-0.004
Token fairy
Token ID25607
Feature activation+5.13e-02

Pos logit contributions
 tow+0.023
lee+0.022
 recess+0.022
 delight+0.021
 excitement+0.021
Neg logit contributions
Deal-0.009
 who-0.008
 if-0.008
mun-0.007
pp-0.007
Token and
Token ID290
Feature activation-9.36e-04

Pos logit contributions
Deal+0.071
 if+0.064
 who+0.063
mun+0.061
 but+0.060
Neg logit contributions
 tow-0.094
lee-0.088
 recess-0.084
 delight-0.083
 response-0.079
Token prince
Token ID19716
Feature activation+3.80e-03

Pos logit contributions
 tow+0.002
lee+0.002
 delight+0.002
 recess+0.002
 excitement+0.002
Neg logit contributions
 who-0.001
 if-0.001
Deal-0.001
 but-0.001
 without-0.001
Token.
Token ID13
Feature activation+5.31e-03

Pos logit contributions
 who+0.003
Deal+0.003
 if+0.003
 but+0.003
 without+0.003
Neg logit contributions
 tow-0.009
lee-0.008
 recess-0.008
 delight-0.008
 excitement-0.008
Token They
Token ID1119
Feature activation-2.45e-03

Pos logit contributions
 who+0.005
 if+0.005
 but+0.005
 with+0.004
Deal+0.004
Neg logit contributions
 tow-0.012
lee-0.012
 response-0.011
 recess-0.011
 delight-0.011
Token have
Token ID423
Feature activation-2.26e-03

Pos logit contributions
 tow+0.005
lee+0.004
 delight+0.004
 recess+0.004
 excitement+0.004
Neg logit contributions
 who-0.003
Deal-0.003
 if-0.003
 but-0.003
mun-0.003
Token liked
Token ID8288
Feature activation+2.92e-03

Pos logit contributions
 tow+0.011
lee+0.010
 delight+0.009
 excitement+0.009
 recess+0.009
Neg logit contributions
 who-0.006
 if-0.006
Deal-0.006
 without-0.006
 but-0.006
Token Spot
Token ID15899
Feature activation+2.47e-02

Pos logit contributions
 who+0.004
 if+0.004
 but+0.003
Deal+0.003
 with+0.003
Neg logit contributions
 tow-0.006
lee-0.005
 delight-0.005
 recess-0.005
 excitement-0.005
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
Deal+0.027
pp+0.025
 if+0.024
but+0.024
 who+0.023
Neg logit contributions
 tow-0.049
lee-0.045
 delight-0.044
 excitement-0.043
 response-0.042
Token They
Token ID1119
Feature activation-3.59e-03

Pos logit contributions
 who+0.015
 if+0.014
Deal+0.014
 but+0.014
 with+0.013
Neg logit contributions
 tow-0.028
lee-0.028
 recess-0.026
 delight-0.025
 excitement-0.025
Token took
Token ID1718
Feature activation+1.61e-02

Pos logit contributions
 tow+0.008
lee+0.007
 delight+0.007
 excitement+0.007
 recess+0.007
Neg logit contributions
 who-0.004
 if-0.004
Deal-0.004
 without-0.003
 but-0.003
Token the
Token ID262
Feature activation+1.10e-02

Pos logit contributions
 with+0.020
 if+0.020
 who+0.020
 without+0.018
 but+0.017
Neg logit contributions
lee-0.033
 tow-0.031
alks-0.029
 recess-0.029
lights-0.029
Token rake
Token ID47885
Feature activation+2.17e-02

Pos logit contributions
 who+0.019
Deal+0.019
 if+0.018
 without+0.017
 but+0.017
Neg logit contributions
 tow-0.017
lee-0.014
 delight-0.014
 recess-0.014
 excitement-0.014
Token and
Token ID290
Feature activation-5.00e-03

Pos logit contributions
Deal+0.020
 who+0.018
mun+0.017
 if+0.015
pp+0.014
Neg logit contributions
 tow-0.052
lee-0.047
 recess-0.046
 delight-0.045
 excitement-0.044
Token threw
Token ID9617
Feature activation-7.44e-04

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 recess+0.008
 response+0.008
Neg logit contributions
 who-0.007
 if-0.007
Deal-0.007
 but-0.007
 with-0.007
Token a
Token ID257
Feature activation-1.19e-02

Pos logit contributions
 tow+0.001
lee+0.001
 recess+0.001
 delight+0.001
 July+0.001
Neg logit contributions
 who-0.001
 if-0.001
 with-0.001
 but-0.001
 without-0.001
Token stick
Token ID4859
Feature activation+1.28e-02

Pos logit contributions
lee+0.023
 tow+0.022
 recess+0.021
 delight+0.020
 anticipation+0.020
Neg logit contributions
 who-0.016
 if-0.015
 but-0.015
 with-0.015
 without-0.014
Token cover
Token ID3002
Feature activation-5.03e-03

Pos logit contributions
 who+0.000
Deal+0.000
 if+0.000
mun+0.000
 but+0.000
Neg logit contributions
 tow-0.000
lee-0.000
 delight-0.000
 excitement-0.000
 recess-0.000
Token the
Token ID262
Feature activation+1.96e-03

Pos logit contributions
 tow+0.008
lee+0.007
 delight+0.007
 excitement+0.006
 recess+0.006
Neg logit contributions
Deal-0.009
 who-0.008
mun-0.008
 if-0.008
pp-0.008
Token pillow
Token ID28774
Feature activation-4.85e-03

Pos logit contributions
 who+0.004
Deal+0.004
 if+0.004
mun+0.003
 but+0.003
Neg logit contributions
 tow-0.003
 delight-0.002
lee-0.002
 excitement-0.002
 recess-0.002
Token was
Token ID373
Feature activation-2.30e-03

Pos logit contributions
 tow+0.010
 delight+0.009
 excitement+0.009
lee+0.009
 anticipation+0.009
Neg logit contributions
Deal-0.006
mun-0.006
 who-0.005
iw-0.005
obe-0.005
Token a
Token ID257
Feature activation+2.68e-03

Pos logit contributions
 tow+0.005
lee+0.004
 delight+0.004
 excitement+0.004
 recess+0.004
Neg logit contributions
 who-0.003
Deal-0.003
 if-0.003
 but-0.003
mun-0.003
Token pillow
Token ID28774
Feature activation+8.59e-03

Pos logit contributions
 who+0.004
 if+0.004
Deal+0.004
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.004
lee-0.004
 recess-0.004
 delight-0.004
 excitement-0.004
Tokencase
Token ID7442
Feature activation+1.15e-02

Pos logit contributions
Deal+0.010
mun+0.009
 who+0.008
obe+0.008
 if+0.008
Neg logit contributions
 tow-0.018
 delight-0.016
lee-0.016
 excitement-0.015
 response-0.015
Token.
Token ID13
Feature activation+5.76e-03

Pos logit contributions
mun+0.015
Deal+0.015
mons+0.012
ouncing+0.012
 who+0.012
Neg logit contributions
 tow-0.024
 delight-0.020
 excitement-0.020
 response-0.020
 anticipation-0.020
Token Her
Token ID2332
Feature activation+1.80e-02

Pos logit contributions
 who+0.007
Deal+0.007
 if+0.007
mun+0.006
pp+0.006
Neg logit contributions
 tow-0.012
lee-0.011
 delight-0.010
 excitement-0.010
 recess-0.010
Token mom
Token ID1995
Feature activation+2.07e-02

Pos logit contributions
 who+0.034
 if+0.032
Deal+0.032
 but+0.031
 with+0.031
Neg logit contributions
 tow-0.026
lee-0.023
 recess-0.022
 July-0.020
alks-0.020
Tokenmy
Token ID1820
Feature activation+1.86e-02

Pos logit contributions
Deal+0.026
 if+0.023
 who+0.022
 without+0.022
 but+0.021
Neg logit contributions
 tow-0.042
lee-0.039
 delight-0.035
 excitement-0.034
ened-0.034
Token up
Token ID510
Feature activation-6.36e-04

Pos logit contributions
 tow+0.030
lee+0.027
 delight+0.027
 excitement+0.026
 recess+0.026
Neg logit contributions
Deal-0.018
 who-0.018
 if-0.016
 but-0.016
mun-0.015
Token toy
Token ID13373
Feature activation-4.66e-03

Pos logit contributions
 tow+0.001
lee+0.001
 delight+0.001
 recess+0.001
 excitement+0.001
Neg logit contributions
 who-0.001
Deal-0.001
 if-0.001
 but-0.001
 without-0.001
Token guns
Token ID6541
Feature activation+1.61e-02

Pos logit contributions
 tow+0.007
lee+0.007
 recess+0.006
 delight+0.006
 excitement+0.006
Neg logit contributions
 who-0.008
Deal-0.008
 if-0.008
 but-0.007
 without-0.007
Token.
Token ID13
Feature activation+7.71e-03

Pos logit contributions
Deal+0.017
mun+0.015
 who+0.013
iw+0.012
obe+0.012
Neg logit contributions
 tow-0.035
lee-0.032
 delight-0.031
 recess-0.030
 farewell-0.030
Token They
Token ID1119
Feature activation-1.48e-03

Pos logit contributions
 who+0.009
 if+0.008
Deal+0.008
 but+0.008
 with+0.007
Neg logit contributions
 tow-0.017
lee-0.016
 delight-0.015
 recess-0.015
 response-0.014
Token waved
Token ID26834
Feature activation+6.31e-03

Pos logit contributions
 tow+0.003
lee+0.003
 delight+0.003
 recess+0.003
 response+0.002
Neg logit contributions
 who-0.002
 if-0.002
Deal-0.002
 but-0.002
 with-0.002
Token to
Token ID284
Feature activation-4.98e-03

Pos logit contributions
 who+0.008
 if+0.008
 but+0.008
 with+0.007
 without+0.007
Neg logit contributions
 tow-0.014
lee-0.013
 recess-0.012
alks-0.012
 July-0.012
Token their
Token ID511
Feature activation-4.37e-03

Pos logit contributions
 tow+0.009
lee+0.008
 recess+0.008
ived+0.007
 delight+0.007
Neg logit contributions
 who-0.008
 if-0.007
 but-0.007
 with-0.007
Deal-0.007
Token mom
Token ID1995
Feature activation+3.12e-02

Pos logit contributions
 tow+0.007
lee+0.006
alled+0.006
isel+0.006
ived+0.006
Neg logit contributions
 who-0.008
 with-0.007
 but-0.007
 if-0.007
Deal-0.007
Token and
Token ID290
Feature activation-8.58e-03

Pos logit contributions
Deal+0.019
 who+0.017
 if+0.016
mun+0.014
 but+0.013
Neg logit contributions
 tow-0.083
lee-0.078
 delight-0.074
 recess-0.073
 excitement-0.073
Token said
Token ID531
Feature activation-2.03e-03

Pos logit contributions
 tow+0.014
lee+0.013
 recess+0.012
 delight+0.012
 response+0.012
Neg logit contributions
 who-0.014
 if-0.014
Deal-0.013
 but-0.013
 with-0.013
Token had
Token ID550
Feature activation-9.61e-03

Pos logit contributions
 who+0.002
 if+0.002
Deal+0.002
 but+0.001
 without+0.001
Neg logit contributions
 tow-0.003
lee-0.002
 delight-0.002
 recess-0.002
 excitement-0.002
Token been
Token ID587
Feature activation-8.86e-03

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.021
 recess+0.021
 excitement+0.020
Neg logit contributions
 who-0.009
 if-0.008
Deal-0.008
 but-0.007
 with-0.007
Token nice
Token ID3621
Feature activation+1.93e-02

Pos logit contributions
 tow+0.019
lee+0.017
alks+0.016
 delight+0.016
lights+0.016
Neg logit contributions
 who-0.012
 if-0.011
 with-0.011
Deal-0.010
 without-0.010
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
Deal+0.018
 who+0.018
 if+0.016
mun+0.016
pp+0.015
Neg logit contributions
 tow-0.044
lee-0.041
 recess-0.040
 delight-0.039
 excitement-0.038
Token But
Token ID887
Feature activation+2.83e-03

Pos logit contributions
 who+0.021
Deal+0.021
 if+0.020
mun+0.019
 but+0.019
Neg logit contributions
 tow-0.038
lee-0.034
 delight-0.032
 recess-0.032
 excitement-0.032
Token it
Token ID340
Feature activation+3.21e-03

Pos logit contributions
 who+0.004
Deal+0.004
 if+0.004
mun+0.004
 but+0.004
Neg logit contributions
 tow-0.005
lee-0.005
 delight-0.004
 recess-0.004
 excitement-0.004
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
 who+0.004
Deal+0.004
 if+0.004
pp+0.003
 but+0.003
Neg logit contributions
 tow-0.007
lee-0.006
 delight-0.006
 excitement-0.006
 recess-0.006
Token too
Token ID1165
Feature activation+2.24e-02

Pos logit contributions
 who+0.010
 if+0.009
 but+0.008
Deal+0.008
 without+0.008
Neg logit contributions
 tow-0.025
lee-0.024
 delight-0.023
 recess-0.022
 excitement-0.022
Token late
Token ID2739
Feature activation+3.26e-03

Pos logit contributions
 who+0.024
 if+0.022
Deal+0.021
 but+0.020
 without+0.020
Neg logit contributions
 tow-0.050
lee-0.047
 delight-0.045
 recess-0.045
 excitement-0.044
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
mun+0.005
Deal+0.005
pp+0.004
 who+0.004
iw+0.004
Neg logit contributions
 tow-0.005
 recess-0.005
lee-0.005
 farewell-0.005
 response-0.005
Token They
Token ID1119
Feature activation+6.51e-03

Pos logit contributions
Deal+0.029
 who+0.029
 if+0.028
mun+0.026
 but+0.026
Neg logit contributions
 tow-0.039
lee-0.035
 delight-0.033
 recess-0.032
 excitement-0.032
Token
Token ID198
Feature activation-4.13e-03

Pos logit contributions
 tow+0.004
lee+0.004
 recess+0.004
 delight+0.004
 anticipation+0.003
Neg logit contributions
 who-0.002
 if-0.002
 with-0.002
Deal-0.001
 but-0.001
Token"
Token ID1
Feature activation-5.46e-03

Pos logit contributions
 tow+0.008
lee+0.008
 recess+0.007
 delight+0.007
 excitement+0.007
Neg logit contributions
 who-0.006
Deal-0.006
 if-0.006
 with-0.005
 but-0.005
TokenYou
Token ID1639
Feature activation-2.77e-02

Pos logit contributions
 tow+0.012
lee+0.011
 delight+0.011
 recess+0.010
 excitement+0.010
Neg logit contributions
 who-0.006
 if-0.006
Deal-0.005
 but-0.005
 without-0.005
Token're
Token ID821
Feature activation+2.52e-02

Pos logit contributions
 tow+0.053
lee+0.050
 delight+0.047
 excitement+0.046
 pride+0.046
Neg logit contributions
 who-0.036
Deal-0.036
 if-0.034
 but-0.032
mun-0.032
Token welcome
Token ID7062
Feature activation+4.19e-02

Pos logit contributions
 who+0.035
 if+0.034
Deal+0.033
 but+0.031
 with+0.031
Neg logit contributions
 tow-0.048
lee-0.045
 recess-0.042
 delight-0.042
 excitement-0.040
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
Deal+0.041
 who+0.040
 if+0.037
mun+0.036
 but+0.033
Neg logit contributions
 tow-0.096
lee-0.089
 delight-0.085
 recess-0.083
 excitement-0.083
Token Tim
Token ID5045
Feature activation-2.25e-03

Pos logit contributions
 who+0.022
 if+0.021
Deal+0.021
 but+0.020
 without+0.020
Neg logit contributions
 tow-0.022
lee-0.020
 delight-0.018
 recess-0.018
 excitement-0.018
Tokenmy
Token ID1820
Feature activation+1.54e-02

Pos logit contributions
 tow+0.005
lee+0.005
 response+0.004
 delight+0.004
 recess+0.004
Neg logit contributions
Deal-0.002
 who-0.002
mun-0.002
mons-0.002
 without-0.002
Token,"
Token ID553
Feature activation+5.61e-03

Pos logit contributions
Deal+0.018
mun+0.017
 who+0.016
but+0.015
mons+0.015
Neg logit contributions
 tow-0.029
lee-0.028
 delight-0.026
 excitement-0.026
 response-0.025
Token his
Token ID465
Feature activation-1.63e-02

Pos logit contributions
 who+0.011
Deal+0.011
 if+0.011
 but+0.010
 without+0.010
Neg logit contributions
 tow-0.007
lee-0.007
 delight-0.006
 recess-0.006
 excitement-0.006
Token mom
Token ID1995
Feature activation-4.03e-03

Pos logit contributions
growth+0.028
 recess+0.028
andel+0.028
ived+0.028
alks+0.028
Neg logit contributions
 who-0.024
 with-0.022
 if-0.022
 but-0.022
 not-0.021
Token.
Token ID13
Feature activation+6.88e-03

Pos logit contributions
 tow+0.056
 delight+0.053
 excitement+0.052
 farewell+0.052
 pride+0.051
Neg logit contributions
Deal-0.038
mons-0.035
iw-0.035
mun-0.035
pp-0.034
Token That
Token ID1320
Feature activation+6.65e-03

Pos logit contributions
 who+0.007
Deal+0.007
 if+0.007
mun+0.006
 without+0.006
Neg logit contributions
 tow-0.015
lee-0.014
 delight-0.013
 recess-0.013
 excitement-0.013
Token is
Token ID318
Feature activation+2.33e-02

Pos logit contributions
Deal+0.008
but+0.008
pp+0.008
iw+0.008
 who+0.008
Neg logit contributions
 tow-0.012
 delight-0.011
 excitement-0.011
 giant-0.010
 aliens-0.010
Token the
Token ID262
Feature activation+8.65e-03

Pos logit contributions
 who+0.031
 if+0.029
Deal+0.029
 but+0.027
 without+0.026
Neg logit contributions
 tow-0.046
lee-0.043
 delight-0.040
 recess-0.040
 excitement-0.039
Token most
Token ID749
Feature activation+3.84e-02

Pos logit contributions
 who+0.012
Deal+0.012
 if+0.011
pp+0.011
mun+0.011
Neg logit contributions
 tow-0.016
 delight-0.013
 response-0.013
 excitement-0.013
 pride-0.013
Token important
Token ID1593
Feature activation-2.61e-02

Pos logit contributions
Deal+0.063
 who+0.063
pp+0.061
 if+0.061
mun+0.060
Neg logit contributions
 tow-0.060
lee-0.050
 delight-0.050
 excitement-0.048
 recess-0.045
Token thing
Token ID1517
Feature activation+6.99e-03

Pos logit contributions
 tow+0.035
 excitement+0.026
 delight+0.026
 passengers+0.026
 farewell+0.025
Neg logit contributions
Deal-0.049
pp-0.048
 who-0.047
mun-0.047
 if-0.045
Token.
Token ID13
Feature activation+6.67e-03

Pos logit contributions
Deal+0.008
mun+0.007
obe+0.007
pp+0.007
 who+0.007
Neg logit contributions
 tow-0.014
lee-0.012
 recess-0.012
 farewell-0.011
 delight-0.011
Token Now
Token ID2735
Feature activation-3.56e-03

Pos logit contributions
 who+0.007
Deal+0.007
 if+0.007
 but+0.006
mun+0.006
Neg logit contributions
 tow-0.015
lee-0.013
 delight-0.013
 recess-0.012
 excitement-0.012
Token,
Token ID11
Feature activation+2.11e-02

Pos logit contributions
 tow+0.007
 delight+0.007
lee+0.006
 excitement+0.006
 farewell+0.006
Neg logit contributions
Deal-0.004
mun-0.004
 who-0.004
pp-0.004
iw-0.004
Token go
Token ID467
Feature activation+2.32e-02

Pos logit contributions
Deal+0.028
mun+0.028
pp+0.026
 who+0.026
obe+0.024
Neg logit contributions
 tow-0.040
 delight-0.034
 response-0.033
 excitement-0.033
 recess-0.032
Token mistake
Token ID7457
Feature activation+2.19e-02

Pos logit contributions
 tow+0.006
 delight+0.005
lee+0.005
 recess+0.005
 excitement+0.005
Neg logit contributions
Deal-0.005
 who-0.005
 if-0.005
pp-0.004
mun-0.004
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
Deal+0.030
mun+0.025
iw+0.024
pp+0.024
 who+0.022
Neg logit contributions
 tow-0.042
lee-0.039
 recess-0.036
lights-0.035
 delight-0.035
Token They
Token ID1119
Feature activation+6.15e-04

Pos logit contributions
 who+0.013
 if+0.012
Deal+0.012
 but+0.011
 without+0.011
Neg logit contributions
 tow-0.022
lee-0.021
 delight-0.020
 recess-0.020
 excitement-0.019
Token wished
Token ID16555
Feature activation-1.89e-02

Pos logit contributions
 who+0.001
 if+0.001
Deal+0.001
 but+0.001
 without+0.001
Neg logit contributions
 tow-0.001
lee-0.001
 delight-0.001
 recess-0.001
 excitement-0.001
Token they
Token ID484
Feature activation+1.82e-04

Pos logit contributions
 tow+0.034
lee+0.031
 recess+0.029
 delight+0.028
 excitement+0.028
Neg logit contributions
Deal-0.028
 who-0.027
 if-0.026
pp-0.025
mun-0.025
Token had
Token ID550
Feature activation-1.03e-02

Pos logit contributions
 who+0.000
 if+0.000
Deal+0.000
 but+0.000
 without+0.000
Neg logit contributions
 tow-0.000
lee-0.000
 delight-0.000
 recess-0.000
 excitement-0.000
Token listened
Token ID16399
Feature activation-1.23e-02

Pos logit contributions
 tow+0.024
lee+0.022
 delight+0.021
 recess+0.021
 excitement+0.021
Neg logit contributions
 who-0.010
 if-0.009
Deal-0.009
 but-0.009
 without-0.008
Token to
Token ID284
Feature activation+5.41e-03

Pos logit contributions
 tow+0.027
lee+0.026
 delight+0.024
 recess+0.024
 excitement+0.024
Neg logit contributions
 who-0.013
Deal-0.012
 if-0.012
 but-0.011
 without-0.010
Token Jen
Token ID13364
Feature activation+1.65e-02

Pos logit contributions
 who+0.010
 if+0.010
 but+0.009
Deal+0.009
 with+0.009
Neg logit contributions
 tow-0.008
lee-0.007
 recess-0.007
 delight-0.007
ived-0.006
Token and
Token ID290
Feature activation+7.04e-03

Pos logit contributions
Deal+0.022
iw+0.018
mun+0.017
mons+0.017
iders+0.015
Neg logit contributions
 tow-0.031
lee-0.030
 response-0.028
 passengers-0.027
 farewell-0.027
Token not
Token ID407
Feature activation-9.06e-03

Pos logit contributions
Deal+0.011
 if+0.010
 who+0.010
pp+0.010
mun+0.010
Neg logit contributions
 tow-0.012
lee-0.010
 recess-0.010
 delight-0.010
 excitement-0.010
Token.
Token ID13
Feature activation-6.66e-03

Pos logit contributions
 tow+0.028
lee+0.026
stretched+0.024
lights+0.024
ened+0.023
Neg logit contributions
Deal-0.018
mun-0.014
 if-0.014
 who-0.013
pp-0.013
Token Lily
Token ID20037
Feature activation+1.96e-02

Pos logit contributions
 tow+0.012
lee+0.011
 recess+0.010
 delight+0.010
 excitement+0.010
Neg logit contributions
 who-0.010
 if-0.010
Deal-0.010
 but-0.009
 without-0.009
Token said
Token ID531
Feature activation+7.17e-03

Pos logit contributions
Deal+0.030
 if+0.027
pp+0.026
mun+0.025
iw+0.025
Neg logit contributions
lee-0.034
 tow-0.031
 delight-0.029
 excitement-0.028
 tentacles-0.026
Token,
Token ID11
Feature activation+2.96e-03

Pos logit contributions
 who+0.007
Deal+0.007
 if+0.007
mun+0.006
 but+0.006
Neg logit contributions
 tow-0.016
lee-0.015
 delight-0.014
 excitement-0.014
 recess-0.014
Token "
Token ID366
Feature activation+2.70e-03

Pos logit contributions
Deal+0.005
 who+0.004
 if+0.004
mun+0.004
 without+0.004
Neg logit contributions
 tow-0.005
lee-0.005
 delight-0.004
 excitement-0.004
 response-0.004
TokenMom
Token ID29252
Feature activation-2.97e-02

Pos logit contributions
 who+0.004
 if+0.004
Deal+0.004
 but+0.004
 without+0.004
Neg logit contributions
 tow-0.005
lee-0.004
 delight-0.004
 recess-0.004
 excitement-0.004
Tokenmy
Token ID1820
Feature activation-1.22e-02

Pos logit contributions
 tow+0.066
lee+0.063
 delight+0.058
 response+0.057
 excitement+0.055
Neg logit contributions
Deal-0.031
 who-0.028
mun-0.026
 if-0.024
 without-0.024
Token,
Token ID11
Feature activation-3.52e-02

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.020
 excitement+0.019
 response+0.019
Neg logit contributions
Deal-0.016
 who-0.015
mun-0.014
but-0.014
 if-0.014
Token that
Token ID326
Feature activation-1.24e-02

Pos logit contributions
 tow+0.075
lee+0.068
 delight+0.066
 recess+0.065
 excitement+0.064
Neg logit contributions
Deal-0.040
 who-0.039
 if-0.037
mun-0.035
 but-0.034
Token tree
Token ID5509
Feature activation+1.57e-02

Pos logit contributions
 tow+0.017
 giant+0.015
 excitement+0.015
 delight+0.015
 gust+0.015
Neg logit contributions
pp-0.021
Deal-0.021
 who-0.019
but-0.019
obe-0.019
Token is
Token ID318
Feature activation+1.06e-02

Pos logit contributions
Deal+0.017
 who+0.015
 if+0.015
mun+0.015
pp+0.014
Neg logit contributions
 tow-0.033
lee-0.030
lights-0.028
 delight-0.028
 gust-0.028
Token out
Token ID503
Feature activation+8.40e-03

Pos logit contributions
Deal+0.024
mun+0.021
 who+0.020
pp+0.019
 if+0.018
Neg logit contributions
 tow-0.033
 recess-0.030
lee-0.029
 excitement-0.027
 delight-0.027
Token and
Token ID290
Feature activation+5.96e-03

Pos logit contributions
Deal+0.010
mun+0.008
pp+0.007
 who+0.007
mons+0.006
Neg logit contributions
 tow-0.019
lee-0.018
 recess-0.017
 excitement-0.017
 delight-0.017
Token play
Token ID711
Feature activation-2.87e-02

Pos logit contributions
Deal+0.010
 who+0.010
 if+0.009
pp+0.009
mun+0.009
Neg logit contributions
 tow-0.010
 delight-0.008
lee-0.008
 excitement-0.008
 recess-0.008
Token,
Token ID11
Feature activation+8.06e-03

Pos logit contributions
 tow+0.060
 recess+0.057
 delight+0.056
 excitement+0.055
lee+0.052
Neg logit contributions
pp-0.031
Deal-0.029
ll-0.028
mun-0.028
iw-0.027
Token but
Token ID475
Feature activation-2.57e-03

Pos logit contributions
Deal+0.010
pp+0.009
mun+0.008
w+0.007
iw+0.007
Neg logit contributions
 excitement-0.017
 tow-0.016
 delight-0.016
 anticipation-0.015
 response-0.015
Token it
Token ID340
Feature activation-1.24e-02

Pos logit contributions
 tow+0.006
lee+0.005
 delight+0.005
 excitement+0.005
 recess+0.005
Neg logit contributions
 who-0.003
Deal-0.003
mun-0.002
pp-0.002
 if-0.002
Token was
Token ID373
Feature activation-8.04e-03

Pos logit contributions
 tow+0.025
lee+0.022
 delight+0.021
 excitement+0.021
 recess+0.020
Neg logit contributions
 who-0.016
 if-0.014
Deal-0.014
pp-0.014
 but-0.013
Token still
Token ID991
Feature activation+1.49e-03

Pos logit contributions
 tow+0.015
lee+0.014
 recess+0.013
 delight+0.013
 excitement+0.013
Neg logit contributions
 who-0.011
Deal-0.010
 if-0.010
mun-0.009
 but-0.009
Token too
Token ID1165
Feature activation+1.01e-02

Pos logit contributions
 who+0.002
Deal+0.002
mun+0.002
 if+0.002
pp+0.002
Neg logit contributions
 tow-0.003
lee-0.002
 recess-0.002
 excitement-0.002
 delight-0.002
Token cold
Token ID4692
Feature activation-9.51e-03

Pos logit contributions
 who+0.013
 if+0.013
Deal+0.012
 but+0.012
 without+0.012
Neg logit contributions
 tow-0.020
lee-0.019
 delight-0.018
 recess-0.017
 excitement-0.017
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
 recess+0.015
 tow+0.014
lee+0.013
lights+0.013
ived+0.013
Neg logit contributions
Deal-0.015
mun-0.014
iw-0.013
 who-0.013
pp-0.013
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
 tow+0.015
lee+0.015
 delight+0.014
 excitement+0.014
 recess+0.013
Neg logit contributions
Deal-0.008
 who-0.007
 if-0.007
mun-0.006
pp-0.006
Token help
Token ID1037
Feature activation-6.25e-03

Pos logit contributions
 tow+0.028
lee+0.026
 recess+0.025
lights+0.025
 delight+0.024
Neg logit contributions
 who-0.021
 if-0.020
 with-0.018
 but-0.018
 without-0.017
Token her
Token ID607
Feature activation-1.21e-02

Pos logit contributions
 tow+0.011
lee+0.010
 delight+0.010
 recess+0.009
 excitement+0.009
Neg logit contributions
Deal-0.009
 who-0.009
 if-0.009
mun-0.008
 but-0.008
Token mom
Token ID1995
Feature activation-1.59e-03

Pos logit contributions
 tow+0.017
lee+0.015
 delight+0.014
 excitement+0.014
 recess+0.013
Neg logit contributions
Deal-0.023
 who-0.022
 if-0.021
mun-0.021
pp-0.021
Tokenmy
Token ID1820
Feature activation-1.39e-03

Pos logit contributions
 tow+0.003
lee+0.003
 delight+0.003
 response+0.003
 excitement+0.003
Neg logit contributions
Deal-0.002
mun-0.002
iw-0.002
mons-0.002
w-0.001
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 tow+0.003
lee+0.002
 delight+0.002
 excitement+0.002
ened+0.002
Neg logit contributions
Deal-0.002
mun-0.002
iw-0.002
pp-0.001
mons-0.001
Token so
Token ID523
Feature activation+7.49e-03

Pos logit contributions
 tow+0.034
lee+0.031
 delight+0.030
 excitement+0.030
 response+0.029
Neg logit contributions
Deal-0.013
pp-0.010
mun-0.010
obe-0.009
w-0.009
Token she
Token ID673
Feature activation-8.96e-03

Pos logit contributions
Deal+0.007
 who+0.007
 if+0.007
mun+0.006
 but+0.006
Neg logit contributions
 tow-0.017
lee-0.016
 delight-0.015
 excitement-0.015
 recess-0.015
Token went
Token ID1816
Feature activation-1.53e-05

Pos logit contributions
 tow+0.016
lee+0.015
 delight+0.013
 excitement+0.013
 recess+0.013
Neg logit contributions
 who-0.013
Deal-0.013
 if-0.013
 but-0.012
 without-0.012
Token to
Token ID284
Feature activation-9.63e-03

Pos logit contributions
 tow+0.000
lee+0.000
 recess+0.000
 delight+0.000
 excitement+0.000
Neg logit contributions
Deal-0.000
 who-0.000
 if-0.000
mun-0.000
 but-0.000
Token the
Token ID262
Feature activation+4.97e-03

Pos logit contributions
 tow+0.017
lee+0.015
 delight+0.014
 recess+0.014
 excitement+0.014
Neg logit contributions
 who-0.015
 if-0.014
Deal-0.014
 but-0.014
 without-0.013
Token stepped
Token ID10764
Feature activation-2.36e-03

Pos logit contributions
 tow+0.031
lee+0.029
 delight+0.027
 recess+0.027
 excitement+0.026
Neg logit contributions
 who-0.018
Deal-0.017
 if-0.017
 but-0.016
 without-0.015
Token closer
Token ID5699
Feature activation-3.66e-04

Pos logit contributions
 tow+0.004
lee+0.004
 delight+0.003
 recess+0.003
 excitement+0.003
Neg logit contributions
 who-0.004
Deal-0.004
 if-0.004
 but-0.003
mun-0.003
Token and
Token ID290
Feature activation+3.33e-03

Pos logit contributions
lee+0.001
 tow+0.001
 delight+0.001
 excitement+0.001
 farewell+0.001
Neg logit contributions
Deal-0.000
mun-0.000
pp-0.000
/-0.000
 who-0.000
Token saw
Token ID2497
Feature activation-4.60e-03

Pos logit contributions
Deal+0.005
 who+0.005
mun+0.005
 if+0.005
pp+0.005
Neg logit contributions
 tow-0.005
lee-0.005
 recess-0.005
 delight-0.005
 excitement-0.004
Token the
Token ID262
Feature activation-1.61e-02

Pos logit contributions
 tow+0.010
lee+0.009
 recess+0.009
 delight+0.009
 excitement+0.008
Neg logit contributions
 who-0.005
 if-0.005
Deal-0.005
 but-0.005
 with-0.005
Token body
Token ID1767
Feature activation-5.62e-02

Pos logit contributions
 tow+0.025
lee+0.023
 delight+0.021
 recess+0.021
 excitement+0.020
Neg logit contributions
 who-0.028
 if-0.027
Deal-0.027
 but-0.026
 without-0.025
Token was
Token ID373
Feature activation-2.07e-02

Pos logit contributions
 tow+0.117
lee+0.110
 delight+0.104
 excitement+0.101
 response+0.098
Neg logit contributions
Deal-0.069
 who-0.062
 if-0.060
mun-0.056
pp-0.054
Token still
Token ID991
Feature activation-1.09e-02

Pos logit contributions
 tow+0.044
lee+0.042
 recess+0.040
 delight+0.039
 farewell+0.038
Neg logit contributions
 who-0.024
 if-0.023
 but-0.022
 with-0.021
Deal-0.021
Token,
Token ID11
Feature activation-1.35e-03

Pos logit contributions
 tow+0.023
lee+0.022
 delight+0.021
 excitement+0.020
 recess+0.020
Neg logit contributions
Deal-0.013
 who-0.012
 if-0.011
mun-0.011
 but-0.010
Token so
Token ID523
Feature activation+1.38e-02

Pos logit contributions
 excitement+0.003
 delight+0.003
 tow+0.003
 anticipation+0.003
lee+0.003
Neg logit contributions
pp-0.002
Deal-0.002
mun-0.001
ll-0.001
mons-0.001
Token he
Token ID339
Feature activation-3.55e-03

Pos logit contributions
 who+0.019
 if+0.018
 but+0.017
 with+0.017
Deal+0.017
Neg logit contributions
 tow-0.027
lee-0.025
 recess-0.024
 delight-0.023
 response-0.023
Token.
Token ID13
Feature activation-7.97e-03

Pos logit contributions
 tow+0.043
lee+0.041
 delight+0.039
 recess+0.038
 excitement+0.038
Neg logit contributions
 who-0.019
Deal-0.018
 if-0.017
 but-0.016
mun-0.015
Token
Token ID198
Feature activation-8.29e-03

Pos logit contributions
 tow+0.016
lee+0.015
 delight+0.014
 excitement+0.014
 recess+0.014
Neg logit contributions
 who-0.010
Deal-0.010
 if-0.009
mun-0.009
pp-0.009
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 tow+0.017
lee+0.015
 delight+0.015
 recess+0.015
 excitement+0.014
Neg logit contributions
Deal-0.010
 who-0.010
 if-0.009
 but-0.008
pp-0.008
TokenA
Token ID32
Feature activation-2.29e-02

Pos logit contributions
 tow+0.022
lee+0.020
 delight+0.019
 recess+0.019
 excitement+0.018
Neg logit contributions
 who-0.019
Deal-0.018
 if-0.018
 but-0.017
mun-0.017
Token few
Token ID1178
Feature activation-1.98e-03

Pos logit contributions
lee+0.045
alks+0.041
alled+0.040
 tow+0.040
ened+0.040
Neg logit contributions
 who-0.034
 if-0.031
 with-0.030
 but-0.030
Deal-0.029
Token days
Token ID1528
Feature activation-6.98e-02

Pos logit contributions
 tow+0.003
alks+0.003
lee+0.003
 accuracy+0.002
 mast+0.002
Neg logit contributions
 who-0.004
 if-0.004
 with-0.004
 and-0.003
 without-0.003
Token later
Token ID1568
Feature activation-2.02e-02

Pos logit contributions
 tow+0.131
lee+0.122
 delight+0.114
 recess+0.112
 excitement+0.111
Neg logit contributions
 who-0.098
 if-0.095
Deal-0.091
 but-0.088
 without-0.087
Token,
Token ID11
Feature activation+1.16e-03

Pos logit contributions
 tow+0.045
lee+0.041
 recess+0.040
 delight+0.040
 excitement+0.039
Neg logit contributions
 who-0.021
Deal-0.020
 if-0.019
 but-0.017
mun-0.017
Token there
Token ID612
Feature activation+4.24e-03

Pos logit contributions
Deal+0.002
pp+0.002
 who+0.002
owing+0.002
mun+0.002
Neg logit contributions
 tow-0.001
 recess-0.001
 delight-0.001
 excitement-0.001
lee-0.001
Token was
Token ID373
Feature activation-1.98e-03

Pos logit contributions
Deal+0.005
 who+0.005
 if+0.004
pp+0.004
 but+0.004
Neg logit contributions
 tow-0.009
lee-0.008
 delight-0.008
 excitement-0.008
 recess-0.008
Token a
Token ID257
Feature activation-6.55e-03

Pos logit contributions
 tow+0.004
lee+0.004
 delight+0.004
 recess+0.004
 excitement+0.004
Neg logit contributions
 who-0.002
Deal-0.002
 if-0.002
 but-0.002
mun-0.002
Token side
Token ID1735
Feature activation-1.66e-02

Pos logit contributions
 who+0.017
Deal+0.017
 if+0.017
mun+0.016
 but+0.016
Neg logit contributions
 tow-0.017
lee-0.015
 delight-0.014
 recess-0.014
 excitement-0.014
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 tow+0.037
lee+0.034
 delight+0.032
 recess+0.032
 excitement+0.032
Neg logit contributions
 who-0.018
Deal-0.018
 if-0.016
mun-0.015
 but-0.014
Token the
Token ID262
Feature activation-2.21e-02

Pos logit contributions
 tow+0.035
lee+0.031
 delight+0.030
 recess+0.029
 excitement+0.029
Neg logit contributions
 who-0.033
Deal-0.033
 if-0.032
 but-0.030
mun-0.029
Token cat
Token ID3797
Feature activation-3.69e-02

Pos logit contributions
 tow+0.039
lee+0.038
 recess+0.035
 delight+0.034
 farewell+0.032
Neg logit contributions
 who-0.034
 if-0.032
Deal-0.032
 but-0.031
 with-0.029
Token the
Token ID262
Feature activation-2.44e-02

Pos logit contributions
 tow+0.065
lee+0.060
 delight+0.057
 excitement+0.055
 recess+0.054
Neg logit contributions
 who-0.054
Deal-0.053
 if-0.052
 but-0.048
mun-0.047
Token other
Token ID584
Feature activation-5.87e-02

Pos logit contributions
 tow+0.041
lee+0.038
 recess+0.035
 delight+0.035
 excitement+0.033
Neg logit contributions
 who-0.039
 if-0.038
Deal-0.038
 but-0.036
 without-0.035
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 tow+0.104
lee+0.093
 delight+0.088
 recess+0.085
 excitement+0.085
Neg logit contributions
 who-0.089
Deal-0.087
 if-0.085
 but-0.080
 without-0.078
Token Finally
Token ID9461
Feature activation-1.41e-03

Pos logit contributions
 tow+0.025
lee+0.023
 recess+0.022
 delight+0.022
 excitement+0.021
Neg logit contributions
 who-0.018
 if-0.017
Deal-0.017
 but-0.016
 with-0.015
Token,
Token ID11
Feature activation+4.15e-03

Pos logit contributions
 tow+0.004
lee+0.004
 accuracy+0.004
 recess+0.004
lights+0.004
Neg logit contributions
 if-0.001
 but-0.001
 with-0.001
 who-0.001
 without-0.001
Token the
Token ID262
Feature activation-1.00e-03

Pos logit contributions
 who+0.006
 if+0.006
Deal+0.006
 but+0.006
 without+0.005
Neg logit contributions
 tow-0.007
lee-0.007
 delight-0.006
 recess-0.006
 excitement-0.006
Token pile
Token ID14540
Feature activation-2.02e-02

Pos logit contributions
lee+0.002
 tow+0.002
 recess+0.002
 delight+0.001
alks+0.001
Neg logit contributions
 who-0.002
 if-0.001
Deal-0.001
 but-0.001
 with-0.001
Token naughty
Token ID45128
Feature activation-8.30e-03

Pos logit contributions
 tow+0.028
lee+0.026
 delight+0.024
 excitement+0.024
 recess+0.023
Neg logit contributions
Deal-0.025
 who-0.024
 if-0.023
mun-0.022
pp-0.022
Token.
Token ID13
Feature activation-4.43e-03

Pos logit contributions
 tow+0.020
lee+0.018
 delight+0.017
 recess+0.017
 excitement+0.017
Neg logit contributions
 who-0.007
Deal-0.007
 if-0.007
 but-0.006
mun-0.006
Token
Token ID198
Feature activation-1.45e-03

Pos logit contributions
 tow+0.009
lee+0.008
 delight+0.008
 excitement+0.008
 recess+0.008
Neg logit contributions
 who-0.005
Deal-0.005
 if-0.005
 but-0.005
 without-0.005
Token
Token ID198
Feature activation-7.88e-03

Pos logit contributions
 tow+0.003
lee+0.003
 recess+0.003
 delight+0.003
 excitement+0.003
Neg logit contributions
 who-0.001
 if-0.001
 but-0.001
Deal-0.001
 with-0.001
Token"
Token ID1
Feature activation-8.48e-03

Pos logit contributions
 tow+0.016
lee+0.015
 recess+0.014
 delight+0.014
 excitement+0.013
Neg logit contributions
 who-0.010
 if-0.010
 with-0.009
 but-0.009
Deal-0.009
TokenB
Token ID33
Feature activation-5.66e-02

Pos logit contributions
 tow+0.016
lee+0.015
 recess+0.014
 delight+0.014
 farewell+0.013
Neg logit contributions
 who-0.012
 if-0.012
Deal-0.011
 but-0.011
 with-0.011
Tokenizzy
Token ID40593
Feature activation-4.46e-02

Pos logit contributions
 tow+0.088
lee+0.083
 recess+0.074
 delight+0.073
 response+0.070
Neg logit contributions
 who-0.097
Deal-0.094
 if-0.092
 but-0.088
 without-0.086
Token,"
Token ID553
Feature activation-8.42e-03

Pos logit contributions
 tow+0.091
lee+0.089
 delight+0.079
 excitement+0.077
 recess+0.077
Neg logit contributions
Deal-0.054
 who-0.049
 if-0.048
mun-0.046
 but-0.043
Token M
Token ID337
Feature activation-3.25e-02

Pos logit contributions
 tow+0.016
lee+0.015
 delight+0.015
 excitement+0.014
 anticipation+0.014
Neg logit contributions
Deal-0.011
 who-0.011
 if-0.010
mun-0.010
 without-0.010
Tokenummy
Token ID13513
Feature activation+4.30e-03

Pos logit contributions
 tow+0.050
lee+0.042
 recess+0.042
 delight+0.041
 excitement+0.039
Neg logit contributions
 who-0.057
 if-0.056
Deal-0.054
 but-0.053
 with-0.051
Token said
Token ID531
Feature activation-3.36e-02

Pos logit contributions
Deal+0.005
 who+0.005
 if+0.005
 but+0.005
 without+0.005
Neg logit contributions
 tow-0.009
lee-0.008
 delight-0.007
 recess-0.007
 excitement-0.007
Token happy
Token ID3772
Feature activation-4.03e-03

Pos logit contributions
 tow+0.003
lee+0.003
 recess+0.003
alks+0.003
 response+0.003
Neg logit contributions
 who-0.001
 if-0.001
 with-0.001
 but-0.001
Deal-0.001
Token that
Token ID326
Feature activation+1.68e-02

Pos logit contributions
 tow+0.008
lee+0.008
 delight+0.008
 excitement+0.008
 recess+0.008
Neg logit contributions
Deal-0.004
pp-0.004
mun-0.004
 who-0.004
 if-0.003
Token the
Token ID262
Feature activation-1.49e-03

Pos logit contributions
 who+0.025
 if+0.024
Deal+0.024
 but+0.023
 without+0.022
Neg logit contributions
 tow-0.030
lee-0.028
 delight-0.026
 recess-0.026
 excitement-0.025
Token three
Token ID1115
Feature activation-2.96e-02

Pos logit contributions
 tow+0.002
lee+0.002
 recess+0.002
alks+0.002
 delight+0.002
Neg logit contributions
 who-0.003
 if-0.002
Deal-0.002
 but-0.002
 with-0.002
Token-
Token ID12
Feature activation-5.29e-02

Pos logit contributions
 tow+0.056
lee+0.051
 recess+0.050
 delight+0.048
 response+0.047
Neg logit contributions
 who-0.043
 if-0.039
 but-0.037
Deal-0.037
 with-0.037
Tokenyear
Token ID1941
Feature activation-8.30e-02

Pos logit contributions
 tow+0.070
lee+0.062
 delight+0.056
 recess+0.056
 excitement+0.053
Neg logit contributions
 who-0.103
 if-0.100
Deal-0.100
 but-0.096
 without-0.094
Token-
Token ID12
Feature activation-4.02e-02

Pos logit contributions
 tow+0.190
 recess+0.172
lee+0.168
 delight+0.163
 response+0.163
Neg logit contributions
 who-0.081
 with-0.077
 if-0.074
Deal-0.070
 but-0.070
Tokenold
Token ID727
Feature activation+1.15e-02

Pos logit contributions
 tow+0.068
lee+0.062
 recess+0.057
 delight+0.057
 excitement+0.055
Neg logit contributions
 who-0.063
 if-0.061
Deal-0.061
 but-0.059
 without-0.057
Token had
Token ID550
Feature activation-1.52e-02

Pos logit contributions
 who+0.013
Deal+0.013
 if+0.012
 but+0.012
 without+0.011
Neg logit contributions
 tow-0.024
lee-0.022
 delight-0.021
 recess-0.021
 excitement-0.020
Token changed
Token ID3421
Feature activation+1.44e-02

Pos logit contributions
 tow+0.031
lee+0.028
lights+0.027
 recess+0.027
 delight+0.027
Neg logit contributions
 who-0.020
 if-0.018
 with-0.018
 but-0.018
 without-0.018
Token their
Token ID511
Feature activation-1.20e-03

Pos logit contributions
Deal+0.017
 who+0.016
 if+0.015
mun+0.014
pp+0.014
Neg logit contributions
 tow-0.030
lee-0.028
 delight-0.027
 excitement-0.026
 recess-0.026