TOP ACTIVATIONS
MAX = 0.330

- a box of mult icol ored bubbles !
grandma 's house for safe keeping . When
it to me for safe keeping . He told me to
and Ben sit at their desks . They look at the
The teacher comes to their desks . She smiles at their
saw lots of books , desks and chairs . Everyone was
friends sitting at the new desks . She sees her old
. They sit at their desks and draw with cr ay
, where there were many desks and chairs , and a
clos ets , and two desks . It also has a
door and saw lots of desks and people working . Mary
. They sit at their desks and wait for the exam
, some buttons , some speakers , and some masks .
! All the chairs and desks were empty . Carol was
classroom . All of the desks were in dis array and
grown - ups sitting at desks and typing on computers .
She saw a lot of desks , chairs , papers ,
. She has made their beds . She has left a
to look at the other magazines together , and find more
. They have their own beds in the same room .

MIN ACTIVATIONS
MIN = -0.224

it back ! It 'll only bring us more trouble if
for help , but he only bar ks and l icks
about the town . They only remember their friendship . <|endoftext|>
is a big one for only two dollars . â
no matches . She has only one card left . She
use my rod . But only for a little while .
You only get this cake for one
Sarah reminded him that he only had a little bit of
carefully . It is the only one we have ."
with at school . She only had one toy , a
too big . We can only look at it ."
He says yes , but only if they can measure his
It â s only shiny rocks . â
does not move . It only makes a mess .
Not only that , being wealthy also
V icky . She was only 3 years old and loved
with the desk , but only with the toys and books
remember , a bike is only a gift . A friend
not real . It is only a picture . Do you
the trail again . They only painted on paper . They

INTERVAL 0.192 - 0.330
CONTAINS 0.223%

a great time in the attic , even though it was
and felt very happy . From that day forward , Tom
She leapt out of her bed and ran outside . She
to be friends again . From then on , they always
and hugged each other . From that day on , they

INTERVAL 0.053 - 0.192
CONTAINS 11.787%

: Try hard and ask for help when you need it
. We can glue the beads and wrap the cater pillar
, and a secret door appeared ! The rabbit hopped inside
Tom my loved playing catch with Dad and he
a little bit into their cups . They cl ink ed

INTERVAL -0.085 - 0.053
CONTAINS 81.463%

tell her mom my because she was afraid she would get
a special pink medicine that will make you feel better right
. <|endoftext|> Once upon a time , there was a little
back to his house quickly and share the fish with his
too quiet ! Dr ums are fun and loud !" Tom

INTERVAL -0.224 - -0.085
CONTAINS 6.528%

every day . Teddy was no longer lonely because he had
a new dinosaur ." <|endoftext|> Once upon a time , there
Har ley didn 't like this answer . He
frame , the screw wouldn 't go in the wall .
named Stretch y . Stretch y was made of glue and
Token -
Token ID532
Feature activation+9.89e-02

Pos logit contributions
 loser+0.038
in+0.038
 thing+0.036
 problem+0.035
 am+0.034
Neg logit contributions
 magazines-0.092
keeping-0.092
icol-0.086
 parcels-0.085
 envelop-0.084
Token a
Token ID257
Feature activation+3.16e-02

Pos logit contributions
keeping+0.237
 magazines+0.224
icol+0.210
 envelop+0.203
 parcels+0.202
Neg logit contributions
 loser-0.131
 thing-0.130
in-0.129
 best-0.126
 friend-0.124
Token box
Token ID3091
Feature activation+4.35e-02

Pos logit contributions
 magazines+0.073
keeping+0.072
icol+0.066
 envelop+0.066
 parcels+0.065
Neg logit contributions
 loser-0.043
in-0.043
 problem-0.039
 thing-0.038
 am-0.038
Token of
Token ID286
Feature activation+2.13e-03

Pos logit contributions
 magazines+0.105
keeping+0.104
icol+0.095
 parcels+0.094
 computers+0.094
Neg logit contributions
in-0.055
 loser-0.055
 thing-0.051
 problem-0.051
 am-0.051
Token mult
Token ID1963
Feature activation+1.57e-01

Pos logit contributions
keeping+0.005
icol+0.004
 magazines+0.004
 parcels+0.004
aries+0.004
Neg logit contributions
 thing-0.003
 problem-0.003
 best-0.003
 loser-0.003
 frog-0.003
Tokenicol
Token ID27045
Feature activation+3.30e-01

Pos logit contributions
keeping+0.126
 magazines+0.122
nam+0.106
 computers+0.098
 parcels+0.079
Neg logit contributions
 thing-0.434
 friend-0.432
 loser-0.428
 problem-0.424
in-0.419
Tokenored
Token ID1850
Feature activation+7.61e-02

Pos logit contributions
 magazines+0.586
keeping+0.581
 computers+0.514
 parcels+0.505
 videos+0.504
Neg logit contributions
in-0.660
 loser-0.628
 problem-0.597
 thing-0.586
a-0.580
Token bubbles
Token ID25037
Feature activation+2.18e-03

Pos logit contributions
keeping+0.152
 magazines+0.146
icol+0.141
nam+0.137
 parcels+0.131
Neg logit contributions
 thing-0.129
in-0.118
 problem-0.118
 friend-0.115
 loser-0.112
Token!
Token ID0
Feature activation+2.15e-02

Pos logit contributions
 magazines+0.005
keeping+0.005
icol+0.005
 envelop+0.005
 parcels+0.005
Neg logit contributions
 loser-0.003
in-0.003
 thing-0.003
 problem-0.003
 am-0.002
Token
Token ID198
Feature activation-3.38e-02

Pos logit contributions
keeping+0.052
 magazines+0.050
 envelop+0.048
 parcels+0.048
 computers+0.047
Neg logit contributions
'm-0.025
Only-0.024
 friend-0.024
 am-0.023
 thing-0.023
Token
Token ID198
Feature activation-3.38e-02

Pos logit contributions
in+0.029
 am+0.026
 loser+0.024
 friend+0.023
a+0.023
Neg logit contributions
 magazines-0.098
keeping-0.097
icol-0.091
 envelop-0.090
 parcels-0.090
Token grandma
Token ID49890
Feature activation+1.72e-01

Pos logit contributions
 loser+0.050
in+0.048
 problem+0.043
 thing+0.042
 am+0.041
Neg logit contributions
 magazines-0.154
keeping-0.154
icol-0.142
 envelop-0.141
 parcels-0.141
Token's
Token ID338
Feature activation-4.86e-02

Pos logit contributions
keeping+0.480
 magazines+0.469
icol+0.462
 parcels+0.438
bons+0.437
Neg logit contributions
in-0.148
 friend-0.141
 loser-0.135
 problem-0.129
 best-0.127
Token house
Token ID2156
Feature activation+7.86e-02

Pos logit contributions
 loser+0.062
 friend+0.062
 problem+0.061
in+0.059
 thing+0.059
Neg logit contributions
keeping-0.115
 magazines-0.113
icol-0.106
 parcels-0.103
bons-0.103
Token for
Token ID329
Feature activation+6.00e-02

Pos logit contributions
 magazines+0.203
keeping+0.202
icol+0.196
 parcels+0.188
 envelop+0.187
Neg logit contributions
 loser-0.079
in-0.078
 friend-0.075
 thing-0.075
 problem-0.071
Token safe
Token ID3338
Feature activation+6.55e-02

Pos logit contributions
icol+0.146
 magazines+0.138
keeping+0.135
 illustrations+0.129
 counters+0.128
Neg logit contributions
 problem-0.069
 first-0.068
 friend-0.068
 thing-0.067
 best-0.066
Tokenkeeping
Token ID19934
Feature activation+3.29e-01

Pos logit contributions
 magazines+0.132
icol+0.129
 illustrations+0.125
 parcels+0.124
nam+0.122
Neg logit contributions
 thing-0.101
 friend-0.094
 loser-0.092
 best-0.090
 am-0.089
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 magazines+0.874
keeping+0.843
icol+0.814
 parcels+0.801
 envelop+0.795
Neg logit contributions
 loser-0.336
in-0.317
 am-0.294
 thing-0.293
 friend-0.287
Token
Token ID220
Feature activation+1.16e-02

Pos logit contributions
 friend+0.020
'm+0.019
 am+0.018
 thing+0.018
Only+0.018
Neg logit contributions
 magazines-0.036
keeping-0.035
 envelop-0.032
 candles-0.032
 speakers-0.032
Token
Token ID198
Feature activation-3.56e-02

Pos logit contributions
 magazines+0.032
keeping+0.032
icol+0.030
 envelop+0.029
 parcels+0.029
Neg logit contributions
in-0.011
 loser-0.010
 am-0.009
 friend-0.009
a-0.009
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
in+0.034
 loser+0.034
 am+0.030
 friend+0.029
 thing+0.029
Neg logit contributions
 magazines-0.097
keeping-0.096
icol-0.089
 envelop-0.088
 parcels-0.088
TokenWhen
Token ID2215
Feature activation+1.91e-02

Pos logit contributions
 loser+0.051
in+0.051
 thing+0.046
 am+0.046
 friend+0.045
Neg logit contributions
 magazines-0.084
keeping-0.083
icol-0.076
 envelop-0.075
 parcels-0.074
Token it
Token ID340
Feature activation-3.40e-02

Pos logit contributions
in+0.003
 thing+0.003
 loser+0.003
 friend+0.003
a+0.003
Neg logit contributions
 magazines-0.005
keeping-0.005
icol-0.004
 parcels-0.004
 computers-0.004
Token to
Token ID284
Feature activation+2.91e-02

Pos logit contributions
 friend+0.036
 thing+0.035
in+0.035
 loser+0.034
 best+0.031
Neg logit contributions
keeping-0.090
icol-0.085
 magazines-0.085
 computers-0.080
 parcels-0.079
Token me
Token ID502
Feature activation+1.33e-02

Pos logit contributions
keeping+0.065
 magazines+0.063
icol+0.062
 parcels+0.056
 computers+0.055
Neg logit contributions
 loser-0.043
in-0.043
 friend-0.042
 thing-0.040
 person-0.039
Token for
Token ID329
Feature activation+5.77e-02

Pos logit contributions
keeping+0.034
 magazines+0.033
icol+0.032
 parcels+0.030
 envelop+0.030
Neg logit contributions
in-0.015
 loser-0.015
 friend-0.014
 thing-0.014
a-0.013
Token safe
Token ID3338
Feature activation+6.56e-02

Pos logit contributions
icol+0.133
 magazines+0.126
keeping+0.125
 parcels+0.113
 counters+0.113
Neg logit contributions
 loser-0.081
 friend-0.077
 thing-0.075
 problem-0.075
in-0.074
Tokenkeeping
Token ID19934
Feature activation+3.18e-01

Pos logit contributions
icol+0.100
 magazines+0.097
 brand+0.091
 parcels+0.087
 illustrations+0.086
Neg logit contributions
 thing-0.137
 loser-0.135
 friend-0.129
in-0.128
 person-0.126
Token.
Token ID13
Feature activation-3.60e-03

Pos logit contributions
 magazines+0.807
keeping+0.794
icol+0.742
 parcels+0.732
 envelop+0.732
Neg logit contributions
 loser-0.364
in-0.348
 thing-0.318
 problem-0.316
 am-0.315
Token He
Token ID679
Feature activation-1.70e-02

Pos logit contributions
 loser+0.005
 am+0.004
 thing+0.004
'm+0.004
in+0.004
Neg logit contributions
keeping-0.009
 magazines-0.009
icol-0.008
 envelop-0.008
 computers-0.008
Token told
Token ID1297
Feature activation+1.32e-02

Pos logit contributions
in+0.022
 loser+0.022
 am+0.021
 friend+0.021
 problem+0.021
Neg logit contributions
keeping-0.040
 magazines-0.039
icol-0.036
 computers-0.035
 envelop-0.035
Token me
Token ID502
Feature activation+1.30e-02

Pos logit contributions
keeping+0.029
 magazines+0.029
icol+0.028
 parcels+0.026
 computers+0.025
Neg logit contributions
in-0.019
 thing-0.019
 loser-0.019
 friend-0.019
 problem-0.018
Token to
Token ID284
Feature activation+2.91e-02

Pos logit contributions
keeping+0.032
icol+0.031
 magazines+0.031
 envelop+0.029
 parcels+0.029
Neg logit contributions
in-0.017
 loser-0.016
 thing-0.015
 friend-0.015
 am-0.015
Token and
Token ID290
Feature activation+5.30e-02

Pos logit contributions
 loser+0.012
in+0.012
 problem+0.011
 am+0.011
 friend+0.010
Neg logit contributions
keeping-0.024
 magazines-0.024
 envelop-0.021
icol-0.021
 parcels-0.021
Token Ben
Token ID3932
Feature activation-2.94e-03

Pos logit contributions
keeping+0.115
 magazines+0.112
icol+0.104
 parcels+0.104
bons+0.101
Neg logit contributions
 loser-0.082
in-0.078
 friend-0.075
 thing-0.075
 problem-0.075
Token sit
Token ID1650
Feature activation-1.52e-03

Pos logit contributions
 loser+0.004
in+0.004
 problem+0.004
 thing+0.004
 friend+0.003
Neg logit contributions
 magazines-0.007
keeping-0.007
 envelop-0.006
icol-0.006
 parcels-0.006
Token at
Token ID379
Feature activation-2.83e-02

Pos logit contributions
 loser+0.002
in+0.002
 thing+0.002
 problem+0.002
 friend+0.002
Neg logit contributions
 magazines-0.004
keeping-0.004
icol-0.003
 envelop-0.003
 parcels-0.003
Token their
Token ID511
Feature activation+6.99e-02

Pos logit contributions
 friend+0.033
 thing+0.033
 loser+0.033
 frog+0.031
 end+0.031
Neg logit contributions
icol-0.066
keeping-0.066
 magazines-0.064
 parcels-0.064
 counters-0.063
Token desks
Token ID44301
Feature activation+2.99e-01

Pos logit contributions
keeping+0.149
 magazines+0.147
icol+0.134
 envelop+0.132
 parcels+0.132
Neg logit contributions
 loser-0.108
in-0.106
 problem-0.101
 thing-0.100
 friend-0.099
Token.
Token ID13
Feature activation+6.66e-03

Pos logit contributions
keeping+0.829
 magazines+0.822
icol+0.818
 parcels+0.773
 envelop+0.754
Neg logit contributions
 loser-0.250
in-0.248
 thing-0.217
 friend-0.213
 best-0.212
Token They
Token ID1119
Feature activation+6.48e-02

Pos logit contributions
keeping+0.015
 magazines+0.014
icol+0.014
bons+0.014
years+0.013
Neg logit contributions
 loser-0.009
 frog-0.009
 thing-0.008
 am-0.008
 friend-0.008
Token look
Token ID804
Feature activation-2.69e-02

Pos logit contributions
 magazines+0.159
keeping+0.158
icol+0.144
 envelop+0.144
 parcels+0.143
Neg logit contributions
 loser-0.080
in-0.078
 problem-0.071
 thing-0.070
 am-0.069
Token at
Token ID379
Feature activation-3.29e-02

Pos logit contributions
in+0.025
 problem+0.024
 loser+0.023
 best+0.023
 thing+0.023
Neg logit contributions
keeping-0.078
 magazines-0.072
icol-0.069
bons-0.069
 parcels-0.068
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
in+0.048
 loser+0.048
 thing+0.047
 friend+0.045
 best+0.044
Neg logit contributions
keeping-0.072
 magazines-0.069
icol-0.068
 parcels-0.065
 envelop-0.064
TokenThe
Token ID464
Feature activation+1.92e-02

Pos logit contributions
 loser+0.027
 problem+0.025
in+0.024
 best+0.024
 thing+0.023
Neg logit contributions
keeping-0.051
 magazines-0.051
 envelop-0.047
 computers-0.047
 parcels-0.047
Token teacher
Token ID4701
Feature activation-6.40e-03

Pos logit contributions
 magazines+0.049
keeping+0.047
 envelop+0.044
 computers+0.044
 parcels+0.043
Neg logit contributions
 loser-0.022
in-0.022
 am-0.019
 problem-0.019
 thing-0.019
Token comes
Token ID2058
Feature activation+5.29e-03

Pos logit contributions
 loser+0.010
in+0.009
 thing+0.009
 problem+0.009
 am+0.009
Neg logit contributions
 magazines-0.014
keeping-0.014
 envelop-0.012
icol-0.012
 parcels-0.012
Token to
Token ID284
Feature activation+3.50e-02

Pos logit contributions
 magazines+0.013
keeping+0.013
icol+0.012
 parcels+0.011
 envelop+0.011
Neg logit contributions
 loser-0.007
in-0.007
 friend-0.006
 thing-0.006
 problem-0.006
Token their
Token ID511
Feature activation+5.77e-02

Pos logit contributions
 magazines+0.078
keeping+0.076
icol+0.071
 parcels+0.069
bons+0.068
Neg logit contributions
 loser-0.051
in-0.051
 am-0.048
 thing-0.048
 friend-0.047
Token desks
Token ID44301
Feature activation+2.90e-01

Pos logit contributions
keeping+0.109
 magazines+0.106
icol+0.101
bons+0.096
 parcels+0.096
Neg logit contributions
 loser-0.100
 friend-0.100
in-0.098
 thing-0.098
 problem-0.098
Token.
Token ID13
Feature activation-8.09e-04

Pos logit contributions
keeping+0.777
 magazines+0.773
icol+0.744
 parcels+0.717
 envelop+0.705
Neg logit contributions
 loser-0.285
in-0.280
 problem-0.247
 thing-0.246
 am-0.244
Token She
Token ID1375
Feature activation+5.67e-02

Pos logit contributions
 loser+0.001
 am+0.001
in+0.001
 thing+0.001
 problem+0.001
Neg logit contributions
keeping-0.002
 magazines-0.002
icol-0.002
 envelop-0.002
 parcels-0.002
Token smiles
Token ID21845
Feature activation+9.89e-02

Pos logit contributions
keeping+0.140
 magazines+0.137
icol+0.126
 envelop+0.124
 parcels+0.123
Neg logit contributions
 loser-0.071
in-0.070
 problem-0.065
 am-0.064
 thing-0.062
Token at
Token ID379
Feature activation-3.69e-02

Pos logit contributions
 magazines+0.227
keeping+0.217
 envelop+0.202
 videos+0.198
icol+0.197
Neg logit contributions
 loser-0.140
in-0.135
 problem-0.122
 am-0.122
 thing-0.119
Token their
Token ID511
Feature activation+5.45e-02

Pos logit contributions
in+0.060
 loser+0.059
 thing+0.055
 friend+0.054
 am+0.054
Neg logit contributions
 magazines-0.076
keeping-0.076
icol-0.071
 parcels-0.069
 envelop-0.068
Token saw
Token ID2497
Feature activation+9.26e-02

Pos logit contributions
in+0.102
 loser+0.101
 am+0.091
 problem+0.091
 friend+0.090
Neg logit contributions
keeping-0.226
 magazines-0.226
icol-0.206
 envelop-0.204
 parcels-0.202
Token lots
Token ID6041
Feature activation+9.81e-02

Pos logit contributions
keeping+0.203
 magazines+0.195
icol+0.190
bons+0.178
 parcels+0.176
Neg logit contributions
in-0.138
 thing-0.132
 loser-0.131
 friend-0.130
 best-0.129
Token of
Token ID286
Feature activation-1.54e-03

Pos logit contributions
keeping+0.271
 magazines+0.266
icol+0.252
 envelop+0.246
 parcels+0.244
Neg logit contributions
in-0.093
 loser-0.091
 thing-0.081
 problem-0.080
 friend-0.077
Token books
Token ID3835
Feature activation+1.85e-01

Pos logit contributions
in+0.002
 problem+0.002
 best+0.002
 thing+0.002
 loser+0.002
Neg logit contributions
keeping-0.004
icol-0.004
 magazines-0.003
bons-0.003
 envelop-0.003
Token,
Token ID11
Feature activation-2.03e-02

Pos logit contributions
 magazines+0.450
keeping+0.450
icol+0.414
 envelop+0.408
 parcels+0.406
Neg logit contributions
 loser-0.229
in-0.227
 problem-0.205
 thing-0.203
 am-0.202
Token desks
Token ID44301
Feature activation+2.84e-01

Pos logit contributions
 best+0.028
 thing+0.027
 loser+0.027
in+0.026
 friend+0.026
Neg logit contributions
keeping-0.045
icol-0.044
 magazines-0.041
bons-0.039
 envelop-0.038
Token and
Token ID290
Feature activation+3.67e-02

Pos logit contributions
keeping+0.759
 magazines+0.750
icol+0.717
 parcels+0.688
 envelop+0.686
Neg logit contributions
in-0.270
 loser-0.263
 thing-0.238
 problem-0.238
 best-0.233
Token chairs
Token ID18791
Feature activation+2.07e-01

Pos logit contributions
icol+0.064
keeping+0.063
etti+0.052
bons+0.049
ulum+0.047
Neg logit contributions
 best-0.070
 thing-0.068
 friend-0.068
 problem-0.066
 other-0.066
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
 magazines+0.533
keeping+0.529
icol+0.499
 parcels+0.485
 envelop+0.484
Neg logit contributions
in-0.218
 loser-0.216
 problem-0.196
 thing-0.194
 am-0.193
Token Everyone
Token ID11075
Feature activation+4.75e-02

Pos logit contributions
 loser+0.021
 thing+0.020
 friend+0.019
 best+0.019
 problem+0.019
Neg logit contributions
keeping-0.040
 magazines-0.039
icol-0.038
 envelop-0.036
bons-0.035
Token was
Token ID373
Feature activation+1.23e-02

Pos logit contributions
 magazines+0.112
keeping+0.109
 computers+0.101
 envelop+0.101
 parcels+0.100
Neg logit contributions
 loser-0.068
in-0.057
 problem-0.056
 am-0.054
 thing-0.054
Token friends
Token ID2460
Feature activation-2.23e-02

Pos logit contributions
keeping+0.133
 magazines+0.128
icol+0.121
 parcels+0.117
 envelop+0.115
Neg logit contributions
 loser-0.117
 problem-0.112
in-0.111
 thing-0.111
 friend-0.110
Token sitting
Token ID5586
Feature activation+3.82e-02

Pos logit contributions
 loser+0.034
in+0.034
 problem+0.031
 thing+0.031
 am+0.031
Neg logit contributions
 magazines-0.048
keeping-0.048
icol-0.043
 envelop-0.043
 parcels-0.042
Token at
Token ID379
Feature activation-4.00e-02

Pos logit contributions
keeping+0.099
 magazines+0.098
icol+0.093
 parcels+0.090
 envelop+0.090
Neg logit contributions
in-0.044
 loser-0.043
 thing-0.038
 problem-0.038
 best-0.037
Token the
Token ID262
Feature activation+6.39e-03

Pos logit contributions
 end+0.052
 first+0.051
 thing+0.050
 am+0.048
 loser+0.047
Neg logit contributions
icol-0.094
keeping-0.091
 parcels-0.089
 counters-0.087
 magazines-0.086
Token new
Token ID649
Feature activation+2.99e-02

Pos logit contributions
keeping+0.014
 magazines+0.013
icol+0.013
 parcels+0.012
 envelop+0.012
Neg logit contributions
 loser-0.010
 thing-0.009
 problem-0.009
in-0.009
 friend-0.009
Token desks
Token ID44301
Feature activation+2.83e-01

Pos logit contributions
keeping+0.054
 magazines+0.053
icol+0.048
 parcels+0.048
 envelop+0.047
Neg logit contributions
 loser-0.056
in-0.055
 thing-0.053
 problem-0.053
 friend-0.052
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
keeping+0.728
 magazines+0.723
icol+0.689
 parcels+0.665
 envelop+0.661
Neg logit contributions
 loser-0.311
in-0.306
 thing-0.274
 problem-0.274
 am-0.269
Token She
Token ID1375
Feature activation+5.26e-02

Pos logit contributions
 loser+0.004
in+0.004
 am+0.004
 thing+0.004
 problem+0.004
Neg logit contributions
keeping-0.004
 magazines-0.004
icol-0.004
 envelop-0.004
 parcels-0.004
Token sees
Token ID7224
Feature activation+6.85e-02

Pos logit contributions
 magazines+0.123
keeping+0.121
icol+0.111
 envelop+0.110
 parcels+0.109
Neg logit contributions
 loser-0.072
in-0.070
 thing-0.064
 problem-0.064
 am-0.063
Token her
Token ID607
Feature activation+5.25e-03

Pos logit contributions
keeping+0.139
 magazines+0.133
icol+0.125
 parcels+0.120
bons+0.119
Neg logit contributions
in-0.115
 loser-0.113
 thing-0.107
 problem-0.106
 friend-0.105
Token old
Token ID1468
Feature activation+6.75e-02

Pos logit contributions
 magazines+0.011
keeping+0.011
 envelop+0.010
 computers+0.010
icol+0.010
Neg logit contributions
 loser-0.008
in-0.008
 am-0.007
 problem-0.007
 thing-0.007
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
keeping+0.370
 magazines+0.351
icol+0.345
 parcels+0.328
bons+0.327
Neg logit contributions
in-0.110
 loser-0.106
 friend-0.101
 best-0.099
 thing-0.099
Token They
Token ID1119
Feature activation+5.01e-02

Pos logit contributions
 loser+0.022
 am+0.020
 thing+0.020
in+0.020
 friend+0.020
Neg logit contributions
keeping-0.023
 magazines-0.023
icol-0.021
 parcels-0.020
 envelop-0.020
Token sit
Token ID1650
Feature activation-1.38e-02

Pos logit contributions
keeping+0.131
 magazines+0.130
icol+0.120
 envelop+0.119
 parcels+0.118
Neg logit contributions
 loser-0.055
in-0.053
 am-0.049
 thing-0.047
 problem-0.047
Token at
Token ID379
Feature activation-4.01e-02

Pos logit contributions
in+0.015
 loser+0.014
 friend+0.013
 thing+0.013
 best+0.013
Neg logit contributions
keeping-0.037
 magazines-0.036
icol-0.035
 parcels-0.034
 envelop-0.033
Token their
Token ID511
Feature activation+5.12e-02

Pos logit contributions
 thing+0.052
 friend+0.051
 loser+0.051
in+0.051
 best+0.049
Neg logit contributions
keeping-0.092
icol-0.091
 magazines-0.087
 parcels-0.084
 counters-0.083
Token desks
Token ID44301
Feature activation+2.83e-01

Pos logit contributions
keeping+0.100
 magazines+0.095
icol+0.087
 envelop+0.086
 parcels+0.086
Neg logit contributions
 loser-0.089
 problem-0.087
in-0.087
 friend-0.086
 thing-0.085
Token and
Token ID290
Feature activation+4.04e-02

Pos logit contributions
keeping+0.810
icol+0.777
 magazines+0.764
 parcels+0.730
nam+0.711
Neg logit contributions
in-0.232
 loser-0.228
 best-0.212
 thing-0.212
 problem-0.209
Token draw
Token ID3197
Feature activation+8.63e-02

Pos logit contributions
keeping+0.099
 magazines+0.095
icol+0.092
bons+0.086
 envelop+0.086
Neg logit contributions
 loser-0.050
 am-0.047
 thing-0.046
in-0.045
 problem-0.044
Token with
Token ID351
Feature activation-9.98e-03

Pos logit contributions
keeping+0.201
icol+0.181
nam+0.157
 magazines+0.157
 parcels+0.156
Neg logit contributions
 best-0.138
in-0.134
 thing-0.131
a-0.123
 friend-0.122
Token cr
Token ID1067
Feature activation+6.62e-02

Pos logit contributions
in+0.017
 thing+0.016
 loser+0.016
 best+0.016
 friend+0.016
Neg logit contributions
keeping-0.021
icol-0.019
 magazines-0.018
 parcels-0.017
bons-0.016
Tokenay
Token ID323
Feature activation+7.05e-02

Pos logit contributions
keeping+0.154
 magazines+0.146
nam+0.143
icol+0.139
 parcels+0.133
Neg logit contributions
in-0.092
 loser-0.085
a-0.081
 am-0.080
 friend-0.079
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
keeping+0.397
 magazines+0.390
icol+0.371
 parcels+0.358
 envelop+0.357
Neg logit contributions
 loser-0.171
in-0.169
 thing-0.163
 problem-0.156
 am-0.156
Token where
Token ID810
Feature activation+3.17e-03

Pos logit contributions
 best+0.013
 thing+0.013
 friend+0.013
 only+0.013
 loser+0.012
Neg logit contributions
keeping-0.031
icol-0.029
 magazines-0.026
bons-0.026
 parcels-0.025
Token there
Token ID612
Feature activation-4.34e-02

Pos logit contributions
keeping+0.008
icol+0.008
 magazines+0.007
bons+0.007
 envelop+0.007
Neg logit contributions
in-0.004
 loser-0.004
 thing-0.004
 best-0.004
 problem-0.004
Token were
Token ID547
Feature activation+5.06e-02

Pos logit contributions
 loser+0.061
in+0.058
 problem+0.053
 thing+0.052
 friend+0.051
Neg logit contributions
 magazines-0.101
keeping-0.099
 envelop-0.091
 computers-0.090
 parcels-0.090
Token many
Token ID867
Feature activation+7.80e-02

Pos logit contributions
keeping+0.107
icol+0.104
 magazines+0.098
bons+0.090
 parcels+0.090
Neg logit contributions
 best-0.082
 loser-0.081
 problem-0.078
 thing-0.077
 friend-0.076
Token desks
Token ID44301
Feature activation+2.82e-01

Pos logit contributions
keeping+0.143
icol+0.132
 magazines+0.117
 parcels+0.106
bons+0.106
Neg logit contributions
 problem-0.154
 thing-0.152
in-0.150
 best-0.149
 loser-0.148
Token and
Token ID290
Feature activation+4.45e-02

Pos logit contributions
keeping+0.788
 magazines+0.763
icol+0.759
nam+0.717
 parcels+0.717
Neg logit contributions
in-0.232
 loser-0.227
 am-0.215
 thing-0.214
 problem-0.209
Token chairs
Token ID18791
Feature activation+2.06e-01

Pos logit contributions
icol+0.075
keeping+0.073
amas+0.060
ulum+0.057
etti+0.057
Neg logit contributions
 thing-0.083
 am-0.079
 problem-0.077
 best-0.077
 one-0.077
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
 magazines+0.544
keeping+0.542
icol+0.521
 parcels+0.504
 envelop+0.499
Neg logit contributions
in-0.197
 loser-0.197
 am-0.180
 thing-0.179
 problem-0.177
Token and
Token ID290
Feature activation+4.49e-02

Pos logit contributions
 best+0.015
 thing+0.015
 loser+0.015
 only+0.014
 frog+0.014
Neg logit contributions
keeping-0.024
icol-0.023
 magazines-0.020
years-0.020
bons-0.020
Token a
Token ID257
Feature activation+3.52e-02

Pos logit contributions
icol+0.082
keeping+0.082
etti+0.066
bons+0.065
amas+0.065
Neg logit contributions
 best-0.081
 thing-0.080
 problem-0.079
 loser-0.079
 am-0.078
Token clos
Token ID3542
Feature activation+1.53e-01

Pos logit contributions
 loser+0.282
 problem+0.280
 best+0.273
 thing+0.268
in+0.265
Neg logit contributions
keeping-0.246
icol-0.216
 magazines-0.213
bons-0.193
 parcels-0.187
Tokenets
Token ID1039
Feature activation+7.64e-02

Pos logit contributions
keeping+0.281
 magazines+0.269
nam+0.264
icol+0.262
 brand+0.256
Neg logit contributions
 loser-0.265
 problem-0.258
in-0.242
 best-0.240
 winner-0.239
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
 magazines+0.205
keeping+0.203
icol+0.198
nam+0.193
 parcels+0.189
Neg logit contributions
 loser-0.065
 thing-0.064
 problem-0.063
 best-0.063
 am-0.062
Token and
Token ID290
Feature activation+4.30e-02

Pos logit contributions
 loser+0.027
 best+0.027
 thing+0.026
 frog+0.026
 only+0.026
Neg logit contributions
keeping-0.032
icol-0.030
 magazines-0.026
years-0.025
 brand-0.025
Token two
Token ID734
Feature activation-1.40e-01

Pos logit contributions
keeping+0.079
icol+0.078
 magazines+0.068
bons+0.065
amas+0.063
Neg logit contributions
 best-0.076
 loser-0.075
 thing-0.075
 problem-0.073
 am-0.073
Token desks
Token ID44301
Feature activation+2.82e-01

Pos logit contributions
 loser+0.275
 problem+0.272
 best+0.263
in+0.263
 thing+0.261
Neg logit contributions
keeping-0.248
 magazines-0.225
icol-0.221
bons-0.200
 parcels-0.196
Token.
Token ID13
Feature activation-5.84e-03

Pos logit contributions
keeping+0.803
 magazines+0.786
icol+0.778
 parcels+0.737
nam+0.730
Neg logit contributions
 loser-0.217
in-0.206
 problem-0.195
 am-0.194
 thing-0.190
Token It
Token ID632
Feature activation+2.51e-02

Pos logit contributions
 loser+0.008
 am+0.007
 frog+0.006
 only+0.006
 friend+0.006
Neg logit contributions
keeping-0.014
 magazines-0.013
icol-0.013
bons-0.013
 parcels-0.013
Token also
Token ID635
Feature activation-1.21e-01

Pos logit contributions
 magazines+0.061
keeping+0.058
 computers+0.055
 envelop+0.055
 parcels+0.054
Neg logit contributions
 loser-0.033
in-0.032
 thing-0.028
 friend-0.027
 problem-0.027
Token has
Token ID468
Feature activation+4.69e-02

Pos logit contributions
 am+0.148
 best+0.139
 problem+0.136
 loser+0.134
 thing+0.130
Neg logit contributions
keeping-0.311
icol-0.292
 magazines-0.287
 parcels-0.268
bons-0.268
Token a
Token ID257
Feature activation+3.10e-02

Pos logit contributions
icol+0.109
keeping+0.108
 magazines+0.099
 parcels+0.093
nam+0.092
Neg logit contributions
 loser-0.064
 best-0.062
 only-0.060
 thing-0.060
 friend-0.058
Token door
Token ID3420
Feature activation+1.16e-01

Pos logit contributions
keeping+0.013
 magazines+0.012
icol+0.011
bons+0.011
 videos+0.011
Neg logit contributions
 problem-0.010
 loser-0.009
 thing-0.009
in-0.009
 friend-0.009
Token and
Token ID290
Feature activation+3.73e-02

Pos logit contributions
keeping+0.298
 magazines+0.296
icol+0.273
 envelop+0.267
 parcels+0.266
Neg logit contributions
in-0.127
 loser-0.124
 thing-0.120
 problem-0.118
 friend-0.116
Token saw
Token ID2497
Feature activation+9.28e-02

Pos logit contributions
 magazines+0.093
keeping+0.092
icol+0.083
bons+0.083
 videos+0.083
Neg logit contributions
in-0.047
a-0.043
 friend-0.042
 thing-0.041
 am-0.041
Token lots
Token ID6041
Feature activation+9.71e-02

Pos logit contributions
keeping+0.204
 magazines+0.193
icol+0.189
bons+0.175
 videos+0.174
Neg logit contributions
in-0.139
 best-0.136
 thing-0.136
 friend-0.132
a-0.130
Token of
Token ID286
Feature activation+7.44e-04

Pos logit contributions
keeping+0.293
 magazines+0.276
icol+0.276
 parcels+0.258
nam+0.257
Neg logit contributions
in-0.075
 thing-0.066
 problem-0.063
 loser-0.062
 friend-0.059
Token desks
Token ID44301
Feature activation+2.82e-01

Pos logit contributions
keeping+0.002
icol+0.002
etti+0.001
bons+0.001
aries+0.001
Neg logit contributions
 problem-0.001
 thing-0.001
 best-0.001
in-0.001
 friend-0.001
Token and
Token ID290
Feature activation+3.76e-02

Pos logit contributions
keeping+0.754
 magazines+0.743
icol+0.726
 parcels+0.686
bons+0.681
Neg logit contributions
in-0.260
 loser-0.240
 thing-0.235
 problem-0.232
 best-0.231
Token people
Token ID661
Feature activation+6.69e-02

Pos logit contributions
keeping+0.056
icol+0.056
bons+0.043
etti+0.042
 magazines+0.041
Neg logit contributions
 best-0.078
 thing-0.078
 other-0.076
 problem-0.076
 friend-0.076
Token working
Token ID1762
Feature activation-1.47e-03

Pos logit contributions
 magazines+0.141
keeping+0.141
icol+0.132
bons+0.125
 envelop+0.125
Neg logit contributions
in-0.104
 loser-0.100
 problem-0.095
 am-0.095
 thing-0.094
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
in+0.002
 best+0.002
 thing+0.002
 problem+0.002
 loser+0.002
Neg logit contributions
 magazines-0.004
keeping-0.004
icol-0.003
 parcels-0.003
 illustrations-0.003
Token Mary
Token ID5335
Feature activation+4.76e-02

Pos logit contributions
 loser+0.019
 am+0.019
in+0.018
 thing+0.018
'm+0.018
Neg logit contributions
keeping-0.029
 magazines-0.029
icol-0.027
 envelop-0.026
 parcels-0.025
Token.
Token ID13
Feature activation-2.15e-03

Pos logit contributions
in+0.007
 loser+0.007
 friend+0.006
 thing+0.006
 problem+0.006
Neg logit contributions
keeping-0.021
 magazines-0.021
icol-0.021
 parcels-0.020
bons-0.019
Token They
Token ID1119
Feature activation+5.58e-02

Pos logit contributions
 loser+0.003
 am+0.003
 friend+0.003
 best+0.003
 only+0.003
Neg logit contributions
keeping-0.005
 magazines-0.005
 parcels-0.004
bons-0.004
 envelop-0.004
Token sit
Token ID1650
Feature activation-1.46e-02

Pos logit contributions
 magazines+0.126
keeping+0.118
 envelop+0.114
 parcels+0.111
 computers+0.110
Neg logit contributions
 loser-0.080
in-0.079
 thing-0.074
 problem-0.072
a-0.066
Token at
Token ID379
Feature activation-4.30e-02

Pos logit contributions
in+0.018
 loser+0.018
 thing+0.016
 friend+0.016
 best+0.015
Neg logit contributions
keeping-0.037
 magazines-0.036
icol-0.034
 computers-0.033
 parcels-0.033
Token their
Token ID511
Feature activation+4.89e-02

Pos logit contributions
 loser+0.058
in+0.057
 thing+0.057
 friend+0.056
 end+0.054
Neg logit contributions
keeping-0.096
icol-0.094
 magazines-0.093
 parcels-0.089
 counters-0.088
Token desks
Token ID44301
Feature activation+2.82e-01

Pos logit contributions
keeping+0.080
 magazines+0.077
icol+0.069
 parcels+0.068
 envelop+0.067
Neg logit contributions
 loser-0.101
in-0.098
 problem-0.097
 thing-0.095
 friend-0.094
Token and
Token ID290
Feature activation+4.59e-02

Pos logit contributions
keeping+0.758
 magazines+0.743
icol+0.726
 parcels+0.694
 computers+0.677
Neg logit contributions
in-0.275
 loser-0.274
 thing-0.239
 problem-0.235
 friend-0.235
Token wait
Token ID4043
Feature activation-1.17e-01

Pos logit contributions
keeping+0.103
 magazines+0.100
icol+0.096
 parcels+0.091
bons+0.090
Neg logit contributions
 loser-0.065
 am-0.061
in-0.060
 thing-0.060
 problem-0.058
Token for
Token ID329
Feature activation+5.74e-02

Pos logit contributions
in+0.124
 best+0.122
 loser+0.122
 thing+0.116
 friend+0.115
Neg logit contributions
keeping-0.313
 magazines-0.304
 parcels-0.282
bons-0.279
 envelop-0.278
Token the
Token ID262
Feature activation+6.81e-03

Pos logit contributions
keeping+0.146
icol+0.140
 magazines+0.131
 parcels+0.125
 counters+0.124
Neg logit contributions
 end-0.071
 friend-0.069
 thing-0.067
 loser-0.065
 first-0.065
Token exam
Token ID2814
Feature activation+2.04e-02

Pos logit contributions
keeping+0.016
 magazines+0.016
icol+0.014
 envelop+0.014
 parcels+0.014
Neg logit contributions
 loser-0.009
in-0.009
 problem-0.008
 thing-0.008
 am-0.008
Token,
Token ID11
Feature activation-3.74e-03

Pos logit contributions
 magazines+0.231
keeping+0.226
 envelop+0.207
 computers+0.206
 parcels+0.203
Neg logit contributions
 loser-0.154
in-0.146
 problem-0.135
 am-0.135
 friend-0.132
Token some
Token ID617
Feature activation+7.23e-03

Pos logit contributions
 thing+0.005
 loser+0.005
in+0.005
 am+0.005
 best+0.005
Neg logit contributions
keeping-0.009
icol-0.008
 magazines-0.008
 illustrations-0.007
bons-0.007
Token buttons
Token ID12163
Feature activation+5.15e-02

Pos logit contributions
 magazines+0.015
keeping+0.014
icol+0.013
 envelop+0.013
 parcels+0.013
Neg logit contributions
 loser-0.012
in-0.012
 problem-0.011
 thing-0.011
 am-0.011
Token,
Token ID11
Feature activation-3.54e-03

Pos logit contributions
 magazines+0.123
keeping+0.120
 envelop+0.112
 computers+0.111
 parcels+0.110
Neg logit contributions
 loser-0.067
in-0.064
 problem-0.057
 friend-0.056
 thing-0.056
Token some
Token ID617
Feature activation+5.54e-03

Pos logit contributions
 loser+0.004
 thing+0.004
 am+0.004
in+0.004
 best+0.004
Neg logit contributions
keeping-0.009
 magazines-0.008
icol-0.008
 parcels-0.007
 envelop-0.007
Token speakers
Token ID11636
Feature activation+2.82e-01

Pos logit contributions
keeping+0.011
 magazines+0.011
icol+0.010
 envelop+0.009
 parcels+0.009
Neg logit contributions
 loser-0.010
in-0.009
 problem-0.009
 thing-0.009
 am-0.009
Token,
Token ID11
Feature activation-5.67e-03

Pos logit contributions
keeping+0.746
 magazines+0.746
icol+0.690
 envelop+0.680
 parcels+0.677
Neg logit contributions
 loser-0.287
in-0.277
 thing-0.250
 problem-0.248
 am-0.246
Token and
Token ID290
Feature activation+5.06e-02

Pos logit contributions
 loser+0.007
 thing+0.007
 am+0.007
 best+0.007
 problem+0.007
Neg logit contributions
keeping-0.013
 magazines-0.012
icol-0.012
 parcels-0.011
 envelop-0.011
Token some
Token ID617
Feature activation+3.39e-03

Pos logit contributions
keeping+0.104
 magazines+0.098
icol+0.096
bons+0.087
 parcels+0.086
Neg logit contributions
 loser-0.081
 thing-0.079
 problem-0.078
 am-0.078
 best-0.077
Token masks
Token ID20680
Feature activation+1.49e-01

Pos logit contributions
keeping+0.007
 magazines+0.007
icol+0.006
 envelop+0.006
 parcels+0.006
Neg logit contributions
 loser-0.006
in-0.005
 problem-0.005
 thing-0.005
 am-0.005
Token.
Token ID13
Feature activation+4.11e-03

Pos logit contributions
 magazines+0.372
keeping+0.370
icol+0.340
 envelop+0.337
 parcels+0.336
Neg logit contributions
 loser-0.177
in-0.172
 problem-0.156
 thing-0.155
 am-0.154
Token!
Token ID0
Feature activation+1.84e-02

Pos logit contributions
keeping+0.034
 magazines+0.033
 envelop+0.032
 parcels+0.031
icol+0.031
Neg logit contributions
in-0.010
 thing-0.010
 problem-0.010
 best-0.010
 loser-0.009
Token All
Token ID1439
Feature activation-8.38e-02

Pos logit contributions
keeping+0.036
 magazines+0.033
 envelop+0.030
icol+0.030
 parcels+0.029
Neg logit contributions
 loser-0.032
 problem-0.031
 thing-0.031
 am-0.030
 best-0.030
Token the
Token ID262
Feature activation+2.51e-03

Pos logit contributions
 loser+0.129
in+0.121
 problem+0.112
 am+0.111
 thing+0.110
Neg logit contributions
 magazines-0.185
keeping-0.182
 computers-0.165
 envelop-0.164
 parcels-0.163
Token chairs
Token ID18791
Feature activation+2.05e-01

Pos logit contributions
keeping+0.005
 magazines+0.005
icol+0.005
 envelop+0.005
 parcels+0.005
Neg logit contributions
 loser-0.004
in-0.004
 problem-0.004
 thing-0.003
 friend-0.003
Token and
Token ID290
Feature activation+3.50e-02

Pos logit contributions
 magazines+0.484
keeping+0.481
icol+0.441
 envelop+0.437
 parcels+0.433
Neg logit contributions
 loser-0.271
in-0.266
 problem-0.245
 thing-0.244
 am-0.243
Token desks
Token ID44301
Feature activation+2.81e-01

Pos logit contributions
keeping+0.054
icol+0.053
 magazines+0.046
bons+0.044
etti+0.041
Neg logit contributions
 thing-0.074
 best-0.072
 problem-0.071
 loser-0.070
 person-0.069
Token were
Token ID547
Feature activation+4.40e-02

Pos logit contributions
 magazines+0.690
keeping+0.688
icol+0.651
 envelop+0.627
 parcels+0.625
Neg logit contributions
in-0.324
 loser-0.323
 am-0.303
 thing-0.302
 problem-0.299
Token empty
Token ID6565
Feature activation+1.23e-02

Pos logit contributions
keeping+0.097
 magazines+0.097
icol+0.093
bons+0.086
 parcels+0.086
Neg logit contributions
 loser-0.063
 problem-0.061
 best-0.060
in-0.060
 friend-0.059
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
 magazines+0.034
keeping+0.034
icol+0.033
 parcels+0.032
 envelop+0.032
Neg logit contributions
 problem-0.010
 thing-0.010
in-0.010
 loser-0.009
 best-0.009
Token Carol
Token ID5074
Feature activation+1.23e-01

Pos logit contributions
 loser+0.036
 problem+0.035
 thing+0.035
 best+0.035
 am+0.034
Neg logit contributions
keeping-0.039
 magazines-0.035
icol-0.033
 envelop-0.032
 brand-0.031
Token was
Token ID373
Feature activation+1.12e-02

Pos logit contributions
 magazines+0.319
keeping+0.316
icol+0.291
 envelop+0.287
 parcels+0.284
Neg logit contributions
in-0.137
 loser-0.129
a-0.125
 friend-0.124
 problem-0.120
Token classroom
Token ID15806
Feature activation+1.54e-01

Pos logit contributions
keeping+0.011
 magazines+0.011
 parcels+0.010
 envelop+0.010
icol+0.010
Neg logit contributions
 loser-0.007
in-0.006
 thing-0.006
 problem-0.006
 friend-0.006
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
keeping+0.429
 magazines+0.413
icol+0.406
 parcels+0.395
 envelop+0.387
Neg logit contributions
 thing-0.148
 loser-0.137
in-0.134
 friend-0.134
 am-0.125
Token All
Token ID1439
Feature activation-8.50e-02

Pos logit contributions
 thing+0.028
 loser+0.026
 am+0.025
 person+0.024
'm+0.024
Neg logit contributions
icol-0.047
keeping-0.046
 brand-0.042
 envelop-0.040
 magazines-0.039
Token of
Token ID286
Feature activation+4.06e-04

Pos logit contributions
 loser+0.126
in+0.124
 thing+0.114
 problem+0.114
 am+0.113
Neg logit contributions
 magazines-0.188
keeping-0.188
icol-0.171
 envelop-0.169
 parcels-0.167
Token the
Token ID262
Feature activation+8.28e-05

Pos logit contributions
icol+0.001
keeping+0.001
 magazines+0.001
 parcels+0.001
 counters+0.001
Neg logit contributions
in-0.001
 thing-0.001
 loser-0.001
 friend-0.000
 frog-0.000
Token desks
Token ID44301
Feature activation+2.81e-01

Pos logit contributions
keeping+0.000
 magazines+0.000
icol+0.000
 envelop+0.000
 parcels+0.000
Neg logit contributions
 loser-0.000
in-0.000
 thing-0.000
 best-0.000
 problem-0.000
Token were
Token ID547
Feature activation+4.17e-02

Pos logit contributions
keeping+0.688
 magazines+0.674
icol+0.648
 envelop+0.620
 parcels+0.617
Neg logit contributions
 loser-0.346
in-0.345
 am-0.321
 thing-0.320
 problem-0.319
Token in
Token ID287
Feature activation-1.61e-02

Pos logit contributions
keeping+0.110
icol+0.106
 magazines+0.104
bons+0.098
 parcels+0.097
Neg logit contributions
 loser-0.048
 problem-0.047
 best-0.046
 friend-0.046
in-0.045
Token dis
Token ID595
Feature activation+3.15e-02

Pos logit contributions
 thing+0.023
 loser+0.023
 problem+0.022
 am+0.021
 person+0.020
Neg logit contributions
keeping-0.038
icol-0.038
 magazines-0.035
 parcels-0.033
 counters-0.032
Tokenarray
Token ID18747
Feature activation+2.07e-01

Pos logit contributions
keeping+0.041
 magazines+0.039
icol+0.038
 parcels+0.037
 envelop+0.037
Neg logit contributions
 problem-0.068
in-0.068
 best-0.065
m-0.065
 thing-0.064
Token and
Token ID290
Feature activation+3.59e-02

Pos logit contributions
keeping+0.604
 magazines+0.581
icol+0.580
 envelop+0.556
 parcels+0.553
Neg logit contributions
 thing-0.156
 problem-0.147
 best-0.144
in-0.139
 loser-0.135
Token grown
Token ID7334
Feature activation+4.94e-02

Pos logit contributions
keeping+0.003
icol+0.003
 magazines+0.003
bons+0.003
etti+0.003
Neg logit contributions
 problem-0.002
in-0.002
 thing-0.002
 best-0.002
 friend-0.002
Token-
Token ID12
Feature activation+1.38e-02

Pos logit contributions
 magazines+0.125
keeping+0.125
icol+0.119
 parcels+0.113
 envelop+0.113
Neg logit contributions
in-0.056
 loser-0.051
 problem-0.050
 thing-0.049
a-0.049
Tokenups
Token ID4739
Feature activation+2.96e-02

Pos logit contributions
 magazines+0.030
keeping+0.030
icol+0.029
 parcels+0.027
 envelop+0.027
Neg logit contributions
in-0.021
 loser-0.020
a-0.019
 thing-0.018
 problem-0.018
Token sitting
Token ID5586
Feature activation+4.06e-02

Pos logit contributions
 magazines+0.056
keeping+0.056
 envelop+0.050
icol+0.049
 computers+0.049
Neg logit contributions
 loser-0.053
in-0.052
 thing-0.048
 problem-0.048
 am-0.048
Token at
Token ID379
Feature activation-3.90e-02

Pos logit contributions
keeping+0.105
 magazines+0.103
icol+0.101
 parcels+0.096
 envelop+0.096
Neg logit contributions
in-0.047
 loser-0.044
 thing-0.041
 best-0.041
 problem-0.040
Token desks
Token ID44301
Feature activation+2.80e-01

Pos logit contributions
 thing+0.051
 am+0.050
 loser+0.049
 problem+0.048
 best+0.047
Neg logit contributions
icol-0.089
keeping-0.086
 magazines-0.085
 parcels-0.082
 illustrations-0.081
Token and
Token ID290
Feature activation+3.74e-02

Pos logit contributions
icol+0.777
keeping+0.764
 magazines+0.758
 parcels+0.725
 envelop+0.703
Neg logit contributions
in-0.230
 loser-0.223
 thing-0.201
 problem-0.194
a-0.191
Token typing
Token ID19720
Feature activation+1.67e-01

Pos logit contributions
icol+0.046
keeping+0.043
 magazines+0.040
bons+0.039
 illustrations+0.035
Neg logit contributions
 problem-0.084
 thing-0.083
 loser-0.081
 friend-0.081
 person-0.081
Token on
Token ID319
Feature activation-2.10e-02

Pos logit contributions
keeping+0.445
icol+0.445
 magazines+0.427
 parcels+0.409
bons+0.405
Neg logit contributions
in-0.151
 thing-0.150
 loser-0.137
 best-0.134
 problem-0.131
Token computers
Token ID9061
Feature activation+2.58e-01

Pos logit contributions
 thing+0.030
 problem+0.028
in+0.027
 best+0.027
 friend+0.026
Neg logit contributions
keeping-0.046
icol-0.045
 magazines-0.040
 parcels-0.039
bons-0.039
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
keeping+0.717
icol+0.717
 magazines+0.704
 parcels+0.667
bons+0.654
Neg logit contributions
in-0.184
 loser-0.182
 problem-0.173
 thing-0.172
 best-0.166
Token She
Token ID1375
Feature activation+4.60e-02

Pos logit contributions
 loser+0.015
 frog+0.015
 thing+0.015
 friend+0.014
'm+0.014
Neg logit contributions
keeping-0.019
 magazines-0.018
 envelop-0.017
 parcels-0.016
bons-0.016
Token saw
Token ID2497
Feature activation+9.31e-02

Pos logit contributions
 magazines+0.109
keeping+0.109
icol+0.099
 envelop+0.098
 parcels+0.097
Neg logit contributions
 loser-0.060
in-0.060
 problem-0.055
 thing-0.054
 am-0.054
Token a
Token ID257
Feature activation+3.37e-02

Pos logit contributions
keeping+0.181
 magazines+0.181
icol+0.163
 envelop+0.160
 parcels+0.159
Neg logit contributions
 loser-0.162
in-0.159
 thing-0.149
 problem-0.148
 am-0.147
Token lot
Token ID1256
Feature activation+2.73e-02

Pos logit contributions
 magazines+0.085
keeping+0.082
 envelop+0.077
icol+0.077
 parcels+0.076
Neg logit contributions
 loser-0.039
in-0.038
 am-0.034
 thing-0.033
 problem-0.033
Token of
Token ID286
Feature activation+2.92e-03

Pos logit contributions
 magazines+0.072
keeping+0.072
icol+0.066
 envelop+0.066
 parcels+0.065
Neg logit contributions
 loser-0.029
in-0.028
 problem-0.025
 thing-0.025
 am-0.025
Token desks
Token ID44301
Feature activation+2.79e-01

Pos logit contributions
keeping+0.007
icol+0.006
 magazines+0.006
etti+0.006
bons+0.005
Neg logit contributions
 problem-0.005
 thing-0.004
in-0.004
 best-0.004
 loser-0.004
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
keeping+0.708
 magazines+0.703
icol+0.653
 envelop+0.641
 parcels+0.640
Neg logit contributions
 loser-0.319
in-0.312
 problem-0.284
 thing-0.284
 am-0.278
Token chairs
Token ID18791
Feature activation+2.04e-01

Pos logit contributions
 best+0.024
 only+0.023
 thing+0.023
 loser+0.023
 friend+0.022
Neg logit contributions
keeping-0.027
icol-0.026
 magazines-0.022
etti-0.021
bons-0.021
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 magazines+0.548
keeping+0.543
 envelop+0.501
icol+0.500
 computers+0.497
Neg logit contributions
 loser-0.207
in-0.201
 problem-0.173
 thing-0.172
 am-0.171
Token papers
Token ID9473
Feature activation+1.46e-01

Pos logit contributions
 best+0.013
 only+0.012
 thing+0.012
 first+0.011
 loser+0.011
Neg logit contributions
keeping-0.039
icol-0.037
 magazines-0.034
 parcels-0.032
bons-0.032
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 magazines+0.386
keeping+0.385
icol+0.355
 envelop+0.352
 parcels+0.350
Neg logit contributions
 loser-0.151
in-0.146
 problem-0.131
 thing-0.130
 am-0.129
Token.
Token ID13
Feature activation+7.58e-03

Pos logit contributions
icol+0.562
keeping+0.560
 magazines+0.560
 parcels+0.529
 computers+0.521
Neg logit contributions
in-0.198
 loser-0.194
 am-0.183
 thing-0.182
 problem-0.174
Token She
Token ID1375
Feature activation+7.08e-02

Pos logit contributions
keeping+0.018
 magazines+0.018
icol+0.016
 envelop+0.016
 parcels+0.016
Neg logit contributions
 loser-0.010
 am-0.009
 frog-0.009
in-0.009
 friend-0.009
Token has
Token ID468
Feature activation+6.40e-02

Pos logit contributions
keeping+0.183
 magazines+0.175
icol+0.162
 envelop+0.159
bons+0.158
Neg logit contributions
in-0.082
 am-0.079
 loser-0.077
a-0.077
 problem-0.076
Token made
Token ID925
Feature activation-5.30e-02

Pos logit contributions
keeping+0.162
icol+0.159
etti+0.147
 magazines+0.144
 videos+0.142
Neg logit contributions
 best-0.074
 only-0.073
 problem-0.069
 loser-0.068
 friend-0.067
Token their
Token ID511
Feature activation+7.26e-02

Pos logit contributions
 best+0.071
 friend+0.071
a+0.071
 thing+0.071
in+0.070
Neg logit contributions
keeping-0.123
icol-0.117
 magazines-0.113
nam-0.104
 parcels-0.103
Token beds
Token ID20237
Feature activation+2.78e-01

Pos logit contributions
keeping+0.145
 magazines+0.138
icol+0.132
 parcels+0.126
bons+0.123
Neg logit contributions
 loser-0.123
 friend-0.119
 problem-0.119
 thing-0.117
 best-0.116
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
keeping+0.717
 magazines+0.714
icol+0.682
 parcels+0.663
 envelop+0.660
Neg logit contributions
 loser-0.298
in-0.292
 best-0.276
 thing-0.271
 am-0.267
Token She
Token ID1375
Feature activation+6.54e-02

Pos logit contributions
keeping+0.027
 magazines+0.026
icol+0.025
 parcels+0.024
 envelop+0.024
Neg logit contributions
 loser-0.017
 am-0.016
in-0.016
 thing-0.015
'm-0.015
Token has
Token ID468
Feature activation+5.94e-02

Pos logit contributions
keeping+0.179
 magazines+0.166
icol+0.157
nam+0.151
bons+0.151
Neg logit contributions
 am-0.072
in-0.069
a-0.066
 problem-0.066
 loser-0.065
Token left
Token ID1364
Feature activation-3.07e-02

Pos logit contributions
keeping+0.144
icol+0.142
 magazines+0.128
etti+0.127
 videos+0.126
Neg logit contributions
 best-0.076
 only-0.073
 problem-0.071
 am-0.068
 friend-0.068
Token a
Token ID257
Feature activation+4.16e-02

Pos logit contributions
 thing+0.040
 loser+0.038
in+0.038
 best+0.036
 person+0.036
Neg logit contributions
icol-0.073
keeping-0.072
 magazines-0.067
 parcels-0.064
 illustrations-0.063
Token to
Token ID284
Feature activation+3.72e-02

Pos logit contributions
 magazines+0.212
keeping+0.210
 envelop+0.194
icol+0.194
 parcels+0.192
Neg logit contributions
 loser-0.100
in-0.098
 problem-0.091
 friend-0.091
 thing-0.091
Token look
Token ID804
Feature activation-2.74e-02

Pos logit contributions
 magazines+0.090
keeping+0.089
icol+0.082
 envelop+0.081
 parcels+0.081
Neg logit contributions
 loser-0.047
in-0.046
 problem-0.042
 thing-0.042
 am-0.041
Token at
Token ID379
Feature activation-2.60e-02

Pos logit contributions
in+0.037
 loser+0.036
 problem+0.034
 thing+0.034
 friend+0.033
Neg logit contributions
keeping-0.065
 magazines-0.063
icol-0.058
 parcels-0.057
 envelop-0.057
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
in+0.040
 loser+0.039
 thing+0.038
 friend+0.037
 am+0.037
Neg logit contributions
 magazines-0.055
keeping-0.054
icol-0.051
 envelop-0.049
 parcels-0.049
Token other
Token ID584
Feature activation-5.75e-02

Pos logit contributions
keeping+0.035
 magazines+0.034
icol+0.031
 envelop+0.031
 parcels+0.031
Neg logit contributions
 loser-0.024
in-0.024
 problem-0.022
 thing-0.022
 friend-0.021
Token magazines
Token ID16695
Feature activation+2.78e-01

Pos logit contributions
 loser+0.087
in+0.085
 thing+0.080
 problem+0.079
 friend+0.078
Neg logit contributions
keeping-0.124
 magazines-0.124
icol-0.114
 envelop-0.111
 parcels-0.111
Token together
Token ID1978
Feature activation+8.44e-04

Pos logit contributions
 magazines+0.683
keeping+0.680
icol+0.625
 envelop+0.620
 parcels+0.616
Neg logit contributions
 loser-0.340
in-0.331
 problem-0.300
 thing-0.298
 am-0.295
Token,
Token ID11
Feature activation-6.38e-04

Pos logit contributions
 magazines+0.002
keeping+0.002
 envelop+0.002
 computers+0.002
 parcels+0.002
Neg logit contributions
 loser-0.001
in-0.001
 problem-0.001
 am-0.001
 thing-0.001
Token and
Token ID290
Feature activation+5.14e-02

Pos logit contributions
 loser+0.001
in+0.001
 thing+0.001
 friend+0.001
 best+0.001
Neg logit contributions
keeping-0.002
 magazines-0.002
icol-0.002
bons-0.001
 envelop-0.001
Token find
Token ID1064
Feature activation+9.55e-03

Pos logit contributions
keeping+0.118
 magazines+0.118
icol+0.111
bons+0.106
 envelop+0.106
Neg logit contributions
 loser-0.067
in-0.065
 thing-0.064
 friend-0.063
 am-0.062
Token more
Token ID517
Feature activation-3.59e-02

Pos logit contributions
keeping+0.021
 magazines+0.020
icol+0.020
 parcels+0.018
 envelop+0.018
Neg logit contributions
in-0.014
 best-0.014
 thing-0.014
 loser-0.014
 friend-0.013
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
 magazines+0.274
keeping+0.272
 envelop+0.244
icol+0.240
 parcels+0.239
Neg logit contributions
 loser-0.168
in-0.155
 problem-0.150
 thing-0.146
 friend-0.143
Token They
Token ID1119
Feature activation+7.38e-02

Pos logit contributions
 loser+0.020
in+0.019
 am+0.019
 friend+0.018
 frog+0.018
Neg logit contributions
 magazines-0.033
keeping-0.033
icol-0.030
 envelop-0.030
 parcels-0.029
Token have
Token ID423
Feature activation-1.24e-02

Pos logit contributions
keeping+0.175
 magazines+0.174
icol+0.160
 envelop+0.157
 parcels+0.156
Neg logit contributions
 loser-0.097
in-0.095
 am-0.088
 problem-0.087
 thing-0.086
Token their
Token ID511
Feature activation+7.02e-02

Pos logit contributions
 best+0.016
 loser+0.016
 problem+0.016
 friend+0.015
in+0.015
Neg logit contributions
keeping-0.030
icol-0.028
 magazines-0.028
 illustrations-0.026
 parcels-0.026
Token own
Token ID898
Feature activation-4.15e-02

Pos logit contributions
keeping+0.149
 magazines+0.146
icol+0.135
 parcels+0.131
 envelop+0.130
Neg logit contributions
 loser-0.111
in-0.107
 problem-0.102
 thing-0.101
 friend-0.100
Token beds
Token ID20237
Feature activation+2.77e-01

Pos logit contributions
 loser+0.063
 thing+0.060
in+0.060
 problem+0.060
 friend+0.059
Neg logit contributions
keeping-0.090
 magazines-0.087
icol-0.083
 parcels-0.079
 envelop-0.078
Token in
Token ID287
Feature activation-4.72e-03

Pos logit contributions
 magazines+0.720
keeping+0.715
icol+0.686
 parcels+0.671
 envelop+0.669
Neg logit contributions
 loser-0.290
in-0.285
 am-0.261
 best-0.258
 thing-0.257
Token the
Token ID262
Feature activation+1.93e-02

Pos logit contributions
 thing+0.007
 loser+0.007
 am+0.006
 problem+0.006
 friend+0.006
Neg logit contributions
keeping-0.010
 magazines-0.010
icol-0.010
 parcels-0.010
 counters-0.009
Token same
Token ID976
Feature activation-6.86e-02

Pos logit contributions
keeping+0.043
 magazines+0.039
icol+0.036
 parcels+0.036
nam+0.036
Neg logit contributions
 loser-0.030
 problem-0.030
 best-0.030
 thing-0.029
in-0.029
Token room
Token ID2119
Feature activation+1.52e-01

Pos logit contributions
 thing+0.087
 loser+0.086
in+0.086
 problem+0.086
 friend+0.085
Neg logit contributions
keeping-0.163
 magazines-0.157
icol-0.150
nam-0.145
 parcels-0.144
Token.
Token ID13
Feature activation+7.51e-03

Pos logit contributions
 magazines+0.408
keeping+0.406
icol+0.387
 envelop+0.378
 parcels+0.377
Neg logit contributions
 loser-0.145
in-0.142
 thing-0.132
 problem-0.131
 best-0.130
Token it
Token ID340
Feature activation-3.85e-02

Pos logit contributions
in+0.017
 loser+0.017
 thing+0.016
 problem+0.016
 am+0.015
Neg logit contributions
keeping-0.025
 magazines-0.025
icol-0.023
 computers-0.022
 parcels-0.022
Token back
Token ID736
Feature activation-7.23e-02

Pos logit contributions
in+0.045
 loser+0.045
 thing+0.040
 friend+0.039
 problem+0.039
Neg logit contributions
 magazines-0.097
keeping-0.096
icol-0.090
 envelop-0.089
 parcels-0.088
Token!
Token ID0
Feature activation+1.45e-02

Pos logit contributions
in+0.088
 loser+0.082
 thing+0.078
a+0.077
 friend+0.076
Neg logit contributions
 magazines-0.181
keeping-0.178
icol-0.173
 envelop-0.167
 parcels-0.166
Token It
Token ID632
Feature activation+2.09e-02

Pos logit contributions
 magazines+0.030
keeping+0.029
 envelop+0.027
 computers+0.026
 desks+0.026
Neg logit contributions
 loser-0.025
in-0.023
 am-0.022
 friend-0.022
 thing-0.022
Token'll
Token ID1183
Feature activation+1.01e-03

Pos logit contributions
 magazines+0.046
keeping+0.045
 parcels+0.043
 envelop+0.042
bons+0.042
Neg logit contributions
 loser-0.031
in-0.027
 thing-0.024
 problem-0.023
 slider-0.023
Token only
Token ID691
Feature activation-2.24e-01

Pos logit contributions
keeping+0.003
 magazines+0.003
 envelop+0.002
icol+0.002
bons+0.002
Neg logit contributions
in-0.001
 loser-0.001
 problem-0.001
 thing-0.001
a-0.001
Token bring
Token ID2222
Feature activation-5.22e-02

Pos logit contributions
 loser+0.283
in+0.273
 problem+0.244
 thing+0.242
 am+0.231
Neg logit contributions
 magazines-0.540
keeping-0.538
 envelop-0.492
 parcels-0.489
icol-0.487
Token us
Token ID514
Feature activation+6.32e-02

Pos logit contributions
 thing+0.074
in+0.074
 loser+0.072
 friend+0.070
 best+0.069
Neg logit contributions
 magazines-0.114
keeping-0.114
icol-0.109
 computers-0.102
 parcels-0.101
Token more
Token ID517
Feature activation-5.38e-02

Pos logit contributions
 magazines+0.146
keeping+0.145
icol+0.132
 envelop+0.131
 parcels+0.130
Neg logit contributions
 loser-0.087
in-0.085
 thing-0.078
 problem-0.078
 am-0.077
Token trouble
Token ID5876
Feature activation+8.10e-02

Pos logit contributions
 loser+0.078
in+0.077
 problem+0.071
 thing+0.070
 am+0.070
Neg logit contributions
 magazines-0.120
keeping-0.119
icol-0.108
 envelop-0.108
 parcels-0.107
Token if
Token ID611
Feature activation-7.28e-02

Pos logit contributions
keeping+0.164
 magazines+0.164
 envelop+0.145
icol+0.145
 parcels+0.144
Neg logit contributions
 loser-0.134
in-0.131
 problem-0.122
 thing-0.121
 am-0.120
Token for
Token ID329
Feature activation+5.67e-02

Pos logit contributions
 loser+0.032
in+0.031
 problem+0.028
 thing+0.028
 am+0.028
Neg logit contributions
 magazines-0.051
keeping-0.051
 envelop-0.046
icol-0.046
 parcels-0.045
Token help
Token ID1037
Feature activation-1.13e-01

Pos logit contributions
keeping+0.183
icol+0.181
 magazines+0.180
 illustrations+0.168
bons+0.167
Neg logit contributions
 loser-0.024
 problem-0.021
 thing-0.021
 first-0.019
 end-0.019
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 loser+0.129
in+0.125
 problem+0.113
 thing+0.113
 am+0.111
Neg logit contributions
 magazines-0.288
keeping-0.287
icol-0.265
 envelop-0.262
 parcels-0.261
Token but
Token ID475
Feature activation-4.00e-02

Pos logit contributions
 loser+0.015
in+0.014
 thing+0.013
 friend+0.013
 problem+0.013
Neg logit contributions
keeping-0.037
 magazines-0.036
icol-0.034
 envelop-0.033
 parcels-0.033
Token he
Token ID339
Feature activation-9.12e-02

Pos logit contributions
 loser+0.059
in+0.059
 problem+0.053
 am+0.052
ching+0.051
Neg logit contributions
 magazines-0.088
keeping-0.086
 envelop-0.079
 computers-0.078
 parcels-0.078
Token only
Token ID691
Feature activation-2.23e-01

Pos logit contributions
 loser+0.135
in+0.120
 thing+0.118
 winner+0.106
 moment+0.105
Neg logit contributions
 magazines-0.199
 computers-0.178
keeping-0.178
 envelop-0.176
 parcels-0.175
Token bar
Token ID2318
Feature activation-1.07e-01

Pos logit contributions
 loser+0.318
in+0.307
 thing+0.283
 problem+0.281
 am+0.275
Neg logit contributions
 magazines-0.507
keeping-0.498
 envelop-0.456
icol-0.452
 parcels-0.452
Tokenks
Token ID591
Feature activation+1.72e-03

Pos logit contributions
 loser+0.226
 frog+0.225
 thing+0.222
ching+0.216
 problem+0.210
Neg logit contributions
keeping-0.148
 magazines-0.145
 desks-0.132
 envelop-0.128
 computers-0.126
Token and
Token ID290
Feature activation+4.47e-02

Pos logit contributions
 magazines+0.004
keeping+0.004
 envelop+0.004
icol+0.004
 computers+0.004
Neg logit contributions
 loser-0.002
in-0.002
 problem-0.002
 am-0.002
 thing-0.002
Token l
Token ID300
Feature activation+7.99e-04

Pos logit contributions
 magazines+0.099
keeping+0.096
 envelop+0.089
icol+0.089
 parcels+0.089
Neg logit contributions
in-0.066
 loser-0.065
 thing-0.058
a-0.057
 friend-0.056
Tokenicks
Token ID3378
Feature activation-3.79e-02

Pos logit contributions
 magazines+0.001
keeping+0.001
 envelop+0.001
 parcels+0.001
 computers+0.001
Neg logit contributions
 loser-0.002
in-0.002
 thing-0.002
 problem-0.002
 friend-0.002
Token about
Token ID546
Feature activation+1.12e-01

Pos logit contributions
in+0.076
 friend+0.076
 thing+0.075
 best+0.075
 problem+0.073
Neg logit contributions
keeping-0.088
 magazines-0.081
icol-0.080
bons-0.078
 parcels-0.076
Token the
Token ID262
Feature activation+4.76e-03

Pos logit contributions
keeping+0.227
 magazines+0.226
icol+0.205
 parcels+0.201
 envelop+0.201
Neg logit contributions
 loser-0.186
in-0.182
 thing-0.173
 problem-0.173
 am-0.169
Token town
Token ID3240
Feature activation+7.38e-02

Pos logit contributions
 magazines+0.011
keeping+0.010
 envelop+0.009
 computers+0.009
 parcels+0.009
Neg logit contributions
 loser-0.007
in-0.007
a-0.006
 am-0.006
'm-0.006
Token.
Token ID13
Feature activation-3.33e-03

Pos logit contributions
 magazines+0.183
keeping+0.182
 envelop+0.166
icol+0.165
 parcels+0.165
Neg logit contributions
 loser-0.089
in-0.086
 problem-0.078
 thing-0.077
 am-0.077
Token They
Token ID1119
Feature activation+5.47e-02

Pos logit contributions
 loser+0.006
in+0.005
 am+0.005
 thing+0.005
 friend+0.005
Neg logit contributions
keeping-0.007
 magazines-0.007
icol-0.006
 envelop-0.006
 parcels-0.006
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 magazines+0.128
keeping+0.124
 envelop+0.116
 parcels+0.114
icol+0.114
Neg logit contributions
 loser-0.072
in-0.072
 problem-0.066
 thing-0.065
a-0.061
Token remember
Token ID3505
Feature activation-4.31e-02

Pos logit contributions
 loser+0.265
in+0.256
 problem+0.232
 thing+0.231
 am+0.230
Neg logit contributions
 magazines-0.558
keeping-0.557
icol-0.510
 envelop-0.507
 parcels-0.503
Token their
Token ID511
Feature activation+5.07e-02

Pos logit contributions
in+0.063
 loser+0.062
 thing+0.060
 problem+0.059
 friend+0.059
Neg logit contributions
keeping-0.097
 magazines-0.095
icol-0.088
 envelop-0.086
 parcels-0.086
Token friendship
Token ID14738
Feature activation-2.05e-02

Pos logit contributions
 magazines+0.122
keeping+0.118
 envelop+0.111
 computers+0.110
 parcels+0.109
Neg logit contributions
in-0.066
 loser-0.065
a-0.057
ching-0.055
 am-0.055
Token.
Token ID13
Feature activation-3.13e-03

Pos logit contributions
 loser+0.027
in+0.025
 problem+0.023
 am+0.023
 thing+0.022
Neg logit contributions
 magazines-0.050
keeping-0.049
 envelop-0.045
 computers-0.044
 parcels-0.044
Token<|endoftext|>
Token ID50256
Feature activation+1.18e-01

Pos logit contributions
 loser+0.004
in+0.004
 am+0.004
 thing+0.004
 problem+0.004
Neg logit contributions
keeping-0.007
 magazines-0.007
icol-0.007
 envelop-0.007
 parcels-0.007
Token is
Token ID318
Feature activation-1.37e-02

Pos logit contributions
 loser+0.073
in+0.071
 problem+0.065
 thing+0.064
 am+0.064
Neg logit contributions
 magazines-0.135
keeping-0.135
icol-0.123
 envelop-0.122
 parcels-0.121
Token a
Token ID257
Feature activation+2.76e-02

Pos logit contributions
in+0.021
 loser+0.021
 thing+0.020
 am+0.019
 friend+0.019
Neg logit contributions
keeping-0.029
 magazines-0.029
icol-0.026
 envelop-0.026
 parcels-0.026
Token big
Token ID1263
Feature activation+3.28e-03

Pos logit contributions
 magazines+0.061
icol+0.057
keeping+0.057
 brand+0.057
 envelop+0.057
Neg logit contributions
in-0.036
 loser-0.034
umber-0.032
ching-0.031
 am-0.030
Token one
Token ID530
Feature activation-2.14e-01

Pos logit contributions
 magazines+0.007
keeping+0.007
 envelop+0.006
icol+0.006
 parcels+0.006
Neg logit contributions
in-0.005
 loser-0.004
ching-0.004
a-0.004
umber-0.004
Token for
Token ID329
Feature activation+5.66e-02

Pos logit contributions
 loser+0.364
ching+0.332
in+0.327
 problem+0.318
 am+0.317
Neg logit contributions
 magazines-0.387
keeping-0.372
 envelop-0.340
 parcels-0.338
 presents-0.337
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
keeping+0.126
 magazines+0.124
icol+0.119
 envelop+0.112
 parcels+0.111
Neg logit contributions
 loser-0.083
in-0.081
 best-0.076
 problem-0.076
 thing-0.075
Token two
Token ID734
Feature activation-1.42e-01

Pos logit contributions
 loser+0.214
in+0.211
 problem+0.176
 am+0.173
ching+0.171
Neg logit contributions
 magazines-0.592
keeping-0.591
 envelop-0.547
 parcels-0.542
icol-0.537
Token dollars
Token ID5054
Feature activation-2.45e-02

Pos logit contributions
 loser+0.230
in+0.217
 am+0.204
ching+0.199
 thing+0.198
Neg logit contributions
 magazines-0.293
keeping-0.278
 envelop-0.263
 parcels-0.262
 computers-0.256
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
 loser+0.036
in+0.035
 problem+0.033
 thing+0.032
 am+0.032
Neg logit contributions
 magazines-0.054
keeping-0.053
 envelop-0.048
icol-0.048
 parcels-0.048
Tokenâ
Token ID22940
Feature activation+4.10e-03

Pos logit contributions
 loser+0.030
in+0.028
 am+0.027
'm+0.027
 friend+0.027
Neg logit contributions
keeping-0.044
 magazines-0.044
 envelop-0.039
icol-0.039
 computers-0.039
Token
Token ID26391
Feature activation+4.33e-02

Pos logit contributions
 magazines+0.008
keeping+0.008
 envelop+0.008
 computers+0.007
 parcels+0.007
Neg logit contributions
 loser-0.007
in-0.007
 am-0.006
 thing-0.006
 problem-0.006
Token no
Token ID645
Feature activation-1.14e-01

Pos logit contributions
 magazines+0.110
keeping+0.110
icol+0.100
 envelop+0.100
 parcels+0.099
Neg logit contributions
 loser-0.060
in-0.059
 problem-0.053
 thing-0.053
 am-0.053
Token matches
Token ID7466
Feature activation-2.88e-02

Pos logit contributions
 loser+0.203
 problem+0.191
 thing+0.189
 friend+0.189
in+0.188
Neg logit contributions
keeping-0.228
 magazines-0.218
icol-0.204
 parcels-0.192
 computers-0.192
Token.
Token ID13
Feature activation-6.23e-03

Pos logit contributions
 loser+0.040
in+0.038
 problem+0.036
 am+0.035
 thing+0.034
Neg logit contributions
keeping-0.065
 magazines-0.065
 envelop-0.058
 computers-0.057
 videos-0.055
Token She
Token ID1375
Feature activation+5.13e-02

Pos logit contributions
 loser+0.011
in+0.010
 am+0.010
 thing+0.010
 friend+0.009
Neg logit contributions
keeping-0.013
 magazines-0.012
icol-0.011
 parcels-0.011
 envelop-0.011
Token has
Token ID468
Feature activation+4.62e-02

Pos logit contributions
keeping+0.134
 magazines+0.130
icol+0.120
 parcels+0.119
 envelop+0.119
Neg logit contributions
 loser-0.058
in-0.057
 am-0.054
 problem-0.054
 thing-0.050
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
keeping+0.112
 magazines+0.111
icol+0.104
 parcels+0.101
 envelop+0.100
Neg logit contributions
 loser-0.059
in-0.056
 problem-0.054
 thing-0.053
 am-0.052
Token one
Token ID530
Feature activation-2.12e-01

Pos logit contributions
 loser+0.314
in+0.300
 thing+0.290
 problem+0.288
 friend+0.282
Neg logit contributions
keeping-0.511
 magazines-0.501
icol-0.476
 parcels-0.449
 videos-0.443
Token card
Token ID2657
Feature activation+1.69e-02

Pos logit contributions
 loser+0.348
in+0.342
'm+0.330
 am+0.325
ching+0.318
Neg logit contributions
 magazines-0.420
 envelop-0.386
keeping-0.374
 candles-0.359
 presents-0.358
Token left
Token ID1364
Feature activation-3.99e-02

Pos logit contributions
 magazines+0.040
keeping+0.039
 envelop+0.036
 computers+0.035
 parcels+0.035
Neg logit contributions
 loser-0.022
in-0.022
 problem-0.020
 am-0.020
 thing-0.019
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
 loser+0.040
in+0.039
 thing+0.039
 friend+0.035
 best+0.034
Neg logit contributions
icol-0.105
keeping-0.105
 magazines-0.103
 parcels-0.099
 computers-0.096
Token She
Token ID1375
Feature activation+5.17e-02

Pos logit contributions
 loser+0.008
in+0.007
 am+0.007
 thing+0.007
 friend+0.007
Neg logit contributions
keeping-0.012
 magazines-0.011
icol-0.010
 parcels-0.010
 envelop-0.010
Token use
Token ID779
Feature activation-5.87e-02

Pos logit contributions
 loser+0.043
in+0.042
worm+0.041
a+0.040
 problem+0.040
Neg logit contributions
 magazines-0.044
bons-0.044
 envelop-0.042
keeping-0.041
 videos-0.041
Token my
Token ID616
Feature activation-5.93e-03

Pos logit contributions
 loser+0.106
in+0.105
 problem+0.099
 thing+0.099
 am+0.097
Neg logit contributions
keeping-0.110
 magazines-0.109
icol-0.098
 envelop-0.095
 parcels-0.095
Token rod
Token ID15299
Feature activation-9.95e-02

Pos logit contributions
in+0.009
 loser+0.009
a+0.009
Only+0.008
ching+0.008
Neg logit contributions
 magazines-0.012
 parcels-0.011
 envelop-0.011
 computers-0.011
keeping-0.011
Token.
Token ID13
Feature activation-3.82e-03

Pos logit contributions
 loser+0.139
in+0.133
 problem+0.126
 am+0.124
 thing+0.121
Neg logit contributions
 magazines-0.225
keeping-0.223
 envelop-0.200
 computers-0.197
 parcels-0.196
Token But
Token ID887
Feature activation-1.88e-02

Pos logit contributions
 loser+0.006
in+0.006
 thing+0.006
 am+0.006
 problem+0.006
Neg logit contributions
keeping-0.008
 magazines-0.008
 envelop-0.007
icol-0.007
 computers-0.007
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
ching+0.029
in+0.028
 loser+0.027
a+0.025
umber+0.025
Neg logit contributions
 magazines-0.036
nam-0.035
 computers-0.033
 envelop-0.033
 parcels-0.032
Token for
Token ID329
Feature activation+5.77e-02

Pos logit contributions
in+0.291
 loser+0.284
umber+0.257
ching+0.256
 am+0.252
Neg logit contributions
 magazines-0.515
keeping-0.480
 envelop-0.470
nam-0.462
 computers-0.458
Token a
Token ID257
Feature activation+3.08e-02

Pos logit contributions
keeping+0.128
 magazines+0.128
icol+0.120
 envelop+0.114
 parcels+0.113
Neg logit contributions
 loser-0.087
in-0.081
 problem-0.080
 thing-0.078
 am-0.076
Token little
Token ID1310
Feature activation-5.26e-02

Pos logit contributions
 magazines+0.064
 envelop+0.062
icol+0.062
keeping+0.061
 brand+0.059
Neg logit contributions
in-0.040
 am-0.039
umber-0.037
m-0.035
 loser-0.034
Token while
Token ID981
Feature activation-3.27e-02

Pos logit contributions
in+0.078
umber+0.072
 am+0.067
a+0.067
 loser+0.066
Neg logit contributions
 magazines-0.108
keeping-0.106
 envelop-0.100
 videos-0.100
icol-0.098
Token.
Token ID13
Feature activation-4.83e-03

Pos logit contributions
 loser+0.035
in+0.032
 thing+0.030
 problem+0.030
 friend+0.029
Neg logit contributions
 magazines-0.087
keeping-0.087
icol-0.081
 envelop-0.080
 parcels-0.079
Token
Token ID95
Feature activation+9.17e-02

Pos logit contributions
keeping+0.061
 magazines+0.061
 envelop+0.055
 parcels+0.053
 computers+0.053
Neg logit contributions
 loser-0.052
in-0.052
 am-0.048
 problem-0.047
 thing-0.047
Token
Token ID26391
Feature activation+4.44e-02

Pos logit contributions
 magazines+0.171
keeping+0.166
 envelop+0.151
nam+0.148
 computers+0.145
Neg logit contributions
 loser-0.172
in-0.168
 am-0.162
 thing-0.158
 winner-0.153
Token
Token ID129
Feature activation+2.04e-01

Pos logit contributions
 magazines+0.094
keeping+0.092
 envelop+0.084
 parcels+0.082
 computers+0.081
Neg logit contributions
 loser-0.071
in-0.070
 am-0.067
a-0.063
'm-0.063
Token
Token ID241
Feature activation-2.93e-02

Pos logit contributions
 magazines+0.443
keeping+0.439
nam+0.405
 envelop+0.400
 parcels+0.390
Neg logit contributions
 loser-0.297
in-0.292
 thing-0.278
 problem-0.277
 am-0.276
TokenYou
Token ID1639
Feature activation-5.38e-02

Pos logit contributions
 loser+0.041
in+0.040
 problem+0.037
 thing+0.036
 am+0.036
Neg logit contributions
 magazines-0.067
keeping-0.067
icol-0.061
 envelop-0.060
 parcels-0.060
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.066
in+0.064
 problem+0.058
 thing+0.058
 am+0.056
Neg logit contributions
 magazines-0.132
keeping-0.131
 envelop-0.120
icol-0.120
 parcels-0.120
Token get
Token ID651
Feature activation-6.26e-02

Pos logit contributions
 loser+0.279
in+0.268
 am+0.255
 thing+0.250
 problem+0.248
Neg logit contributions
 magazines-0.542
keeping-0.541
icol-0.501
 envelop-0.490
 computers-0.486
Token this
Token ID428
Feature activation-4.15e-02

Pos logit contributions
in+0.088
 loser+0.088
 problem+0.081
 thing+0.079
 am+0.079
Neg logit contributions
keeping-0.143
 magazines-0.143
icol-0.132
 envelop-0.128
 computers-0.127
Token cake
Token ID12187
Feature activation-6.05e-03

Pos logit contributions
 loser+0.059
in+0.057
 problem+0.051
 am+0.051
 thing+0.050
Neg logit contributions
 magazines-0.094
keeping-0.093
 envelop-0.085
icol-0.084
 parcels-0.084
Token for
Token ID329
Feature activation+5.56e-02

Pos logit contributions
 loser+0.009
in+0.008
 problem+0.008
 thing+0.008
 am+0.008
Neg logit contributions
 magazines-0.014
keeping-0.014
icol-0.012
 envelop-0.012
 parcels-0.012
Token one
Token ID530
Feature activation-2.13e-01

Pos logit contributions
keeping+0.120
 magazines+0.118
icol+0.117
 envelop+0.110
 parcels+0.107
Neg logit contributions
 loser-0.083
in-0.081
 best-0.078
 thing-0.076
 friend-0.076
Token Sarah
Token ID10490
Feature activation+1.54e-01

Pos logit contributions
 loser+0.064
in+0.062
 thing+0.060
 problem+0.059
 friend+0.058
Neg logit contributions
keeping-0.088
 magazines-0.088
icol-0.080
 envelop-0.078
 parcels-0.078
Token reminded
Token ID14516
Feature activation-2.22e-02

Pos logit contributions
 magazines+0.303
keeping+0.292
 computers+0.272
 envelop+0.267
 parcels+0.263
Neg logit contributions
 loser-0.268
in-0.231
 winner-0.220
 thing-0.217
 am-0.217
Token him
Token ID683
Feature activation-2.24e-02

Pos logit contributions
in+0.036
 friend+0.036
 thing+0.035
 best+0.034
 problem+0.033
Neg logit contributions
keeping-0.044
icol-0.043
 magazines-0.043
 parcels-0.040
 illustrations-0.039
Token that
Token ID326
Feature activation-8.70e-02

Pos logit contributions
 loser+0.029
in+0.028
 thing+0.027
 friend+0.026
 problem+0.026
Neg logit contributions
 magazines-0.053
keeping-0.053
icol-0.050
 envelop-0.049
 parcels-0.048
Token he
Token ID339
Feature activation-9.14e-02

Pos logit contributions
 loser+0.125
in+0.122
 thing+0.114
 problem+0.114
 am+0.112
Neg logit contributions
keeping-0.194
 magazines-0.194
icol-0.178
 envelop-0.174
 parcels-0.174
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.146
in+0.133
 thing+0.125
 problem+0.122
 winner+0.121
Neg logit contributions
 magazines-0.194
keeping-0.188
 computers-0.171
 parcels-0.169
 envelop-0.168
Token had
Token ID550
Feature activation+5.01e-02

Pos logit contributions
 loser+0.296
in+0.292
 thing+0.273
 problem+0.273
 am+0.271
Neg logit contributions
keeping-0.520
 magazines-0.517
icol-0.483
 envelop-0.468
 computers-0.464
Token a
Token ID257
Feature activation+2.99e-02

Pos logit contributions
keeping+0.114
 magazines+0.111
icol+0.111
 videos+0.101
 illustrations+0.100
Neg logit contributions
 loser-0.067
 problem-0.067
 best-0.066
 friend-0.066
 thing-0.065
Token little
Token ID1310
Feature activation-5.18e-02

Pos logit contributions
 magazines+0.068
 brand+0.063
 envelop+0.063
icol+0.061
 parcels+0.060
Neg logit contributions
in-0.042
 loser-0.040
ching-0.036
umber-0.035
 am-0.035
Token bit
Token ID1643
Feature activation-1.06e-01

Pos logit contributions
in+0.068
 loser+0.066
a+0.057
 am+0.057
ching+0.055
Neg logit contributions
 magazines-0.125
keeping-0.119
 envelop-0.113
 parcels-0.111
icol-0.109
Token of
Token ID286
Feature activation-2.13e-03

Pos logit contributions
 loser+0.136
in+0.130
 am+0.119
 thing+0.116
 problem+0.114
Neg logit contributions
 magazines-0.256
keeping-0.252
 envelop-0.231
 parcels-0.228
 computers-0.228
Token carefully
Token ID7773
Feature activation+1.37e-02

Pos logit contributions
 magazines+0.200
keeping+0.197
 envelop+0.179
 computers+0.177
icol+0.176
Neg logit contributions
 loser-0.116
in-0.112
 problem-0.104
 am-0.104
 thing-0.102
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
 magazines+0.034
keeping+0.033
icol+0.031
 parcels+0.030
 envelop+0.030
Neg logit contributions
 loser-0.017
in-0.016
 problem-0.015
 thing-0.015
 am-0.015
Token It
Token ID632
Feature activation+2.75e-02

Pos logit contributions
 loser+0.008
 am+0.007
in+0.007
 thing+0.007
'm+0.007
Neg logit contributions
keeping-0.009
 magazines-0.009
icol-0.008
 computers-0.008
 envelop-0.008
Token is
Token ID318
Feature activation+1.96e-03

Pos logit contributions
 magazines+0.056
 videos+0.050
 envelop+0.049
 computers+0.049
 presents+0.049
Neg logit contributions
 loser-0.044
in-0.039
 thing-0.038
umber-0.035
 friend-0.035
Token the
Token ID262
Feature activation+4.88e-03

Pos logit contributions
keeping+0.005
 magazines+0.005
icol+0.005
 parcels+0.004
 illustrations+0.004
Neg logit contributions
 loser-0.002
 best-0.002
 problem-0.002
 thing-0.002
 friend-0.002
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 magazines+0.012
keeping+0.012
 envelop+0.011
 computers+0.011
icol+0.011
Neg logit contributions
in-0.006
 loser-0.006
 am-0.005
 problem-0.005
a-0.005
Token one
Token ID530
Feature activation-2.12e-01

Pos logit contributions
 loser+0.303
in+0.289
 thing+0.288
 problem+0.277
 friend+0.273
Neg logit contributions
keeping-0.514
 magazines-0.510
icol-0.472
 parcels-0.457
 envelop-0.455
Token we
Token ID356
Feature activation+1.63e-03

Pos logit contributions
 loser+0.317
in+0.289
 problem+0.269
 am+0.268
ching+0.267
Neg logit contributions
 magazines-0.467
keeping-0.441
 envelop-0.416
 videos-0.409
 computers-0.409
Token have
Token ID423
Feature activation-2.37e-02

Pos logit contributions
 magazines+0.004
keeping+0.004
 envelop+0.004
icol+0.004
 parcels+0.003
Neg logit contributions
 loser-0.002
in-0.002
 problem-0.002
 thing-0.002
 am-0.002
Token."
Token ID526
Feature activation+5.09e-02

Pos logit contributions
 loser+0.030
in+0.029
 problem+0.027
 thing+0.027
 friend+0.027
Neg logit contributions
keeping-0.058
 magazines-0.057
icol-0.055
 parcels-0.052
 envelop-0.051
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
 magazines+0.136
keeping+0.134
icol+0.125
 parcels+0.124
 envelop+0.124
Neg logit contributions
 loser-0.053
in-0.051
 am-0.046
 thing-0.045
 friend-0.044
Token with
Token ID351
Feature activation-1.22e-02

Pos logit contributions
 magazines+0.054
keeping+0.052
 envelop+0.047
 computers+0.047
icol+0.047
Neg logit contributions
 loser-0.037
in-0.036
 problem-0.034
 am-0.033
 thing-0.032
Token at
Token ID379
Feature activation-4.25e-02

Pos logit contributions
in+0.014
 loser+0.013
 thing+0.012
 am+0.012
 friend+0.012
Neg logit contributions
keeping-0.032
 magazines-0.031
icol-0.030
 parcels-0.029
 envelop-0.029
Token school
Token ID1524
Feature activation+1.26e-01

Pos logit contributions
 loser+0.073
in+0.072
 thing+0.068
 problem+0.067
 am+0.067
Neg logit contributions
 magazines-0.083
keeping-0.083
icol-0.076
 parcels-0.074
 envelop-0.074
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
keeping+0.340
 magazines+0.327
icol+0.317
 parcels+0.307
 envelop+0.304
Neg logit contributions
 loser-0.128
in-0.126
 problem-0.113
 thing-0.113
 friend-0.111
Token She
Token ID1375
Feature activation+4.68e-02

Pos logit contributions
 loser+0.021
in+0.021
 am+0.020
 thing+0.020
 problem+0.019
Neg logit contributions
keeping-0.032
 magazines-0.031
icol-0.029
 parcels-0.028
 envelop-0.028
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
keeping+0.117
 magazines+0.113
icol+0.106
 envelop+0.104
 parcels+0.104
Neg logit contributions
 loser-0.058
in-0.057
 am-0.055
 problem-0.053
 friend-0.051
Token had
Token ID550
Feature activation+4.89e-02

Pos logit contributions
 loser+0.342
in+0.334
 problem+0.309
 thing+0.307
 am+0.304
Neg logit contributions
 magazines-0.478
keeping-0.474
icol-0.428
 envelop-0.427
 parcels-0.422
Token one
Token ID530
Feature activation-2.12e-01

Pos logit contributions
keeping+0.114
icol+0.106
 magazines+0.104
 parcels+0.100
 counters+0.096
Neg logit contributions
 best-0.071
 loser-0.070
 friend-0.067
 problem-0.066
 thing-0.064
Token toy
Token ID13373
Feature activation-1.11e-01

Pos logit contributions
in+0.256
 loser+0.256
ching+0.245
'm+0.239
umber+0.238
Neg logit contributions
 magazines-0.507
 presents-0.435
 videos-0.435
 envelop-0.434
keeping-0.431
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 loser+0.156
in+0.152
 problem+0.139
 am+0.139
 thing+0.138
Neg logit contributions
 magazines-0.252
keeping-0.240
 envelop-0.222
 computers-0.221
 videos-0.218
Token a
Token ID257
Feature activation+2.84e-02

Pos logit contributions
 loser+0.023
in+0.022
 thing+0.020
 problem+0.020
 am+0.020
Neg logit contributions
keeping-0.044
 magazines-0.043
icol-0.040
 envelop-0.039
 parcels-0.039
Token too
Token ID1165
Feature activation-1.29e-01

Pos logit contributions
 magazines+0.101
keeping+0.099
icol+0.091
 envelop+0.090
 parcels+0.090
Neg logit contributions
 loser-0.058
in-0.057
 thing-0.050
 problem-0.050
 friend-0.050
Token big
Token ID1263
Feature activation+8.02e-03

Pos logit contributions
 loser+0.193
in+0.180
 winner+0.168
 am+0.167
'm+0.163
Neg logit contributions
 magazines-0.281
keeping-0.265
 computers-0.249
 envelop-0.244
 parcels-0.243
Token.
Token ID13
Feature activation-3.09e-03

Pos logit contributions
keeping+0.021
 magazines+0.021
icol+0.019
 parcels+0.019
 envelop+0.019
Neg logit contributions
 loser-0.009
in-0.008
 problem-0.008
 thing-0.008
 am-0.008
Token We
Token ID775
Feature activation+9.02e-02

Pos logit contributions
 loser+0.005
 thing+0.005
 am+0.005
in+0.005
 frog+0.004
Neg logit contributions
keeping-0.007
 magazines-0.006
 envelop-0.006
icol-0.006
 computers-0.006
Token can
Token ID460
Feature activation-2.75e-02

Pos logit contributions
 magazines+0.191
keeping+0.184
 envelop+0.178
 computers+0.168
 parcels+0.166
Neg logit contributions
in-0.128
 loser-0.112
 problem-0.107
 best-0.107
 thing-0.105
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
in+0.045
 loser+0.042
 problem+0.042
 thing+0.042
a+0.039
Neg logit contributions
 magazines-0.053
keeping-0.050
 videos-0.048
 computers-0.048
 cartoons-0.048
Token look
Token ID804
Feature activation-3.97e-02

Pos logit contributions
 loser+0.294
in+0.288
 problem+0.264
 thing+0.259
 best+0.252
Neg logit contributions
 magazines-0.521
keeping-0.518
 envelop-0.473
 computers-0.468
 parcels-0.467
Token at
Token ID379
Feature activation-4.23e-02

Pos logit contributions
in+0.039
 loser+0.038
 problem+0.035
 thing+0.034
 best+0.033
Neg logit contributions
keeping-0.111
 magazines-0.107
icol-0.100
 parcels-0.098
bons-0.098
Token it
Token ID340
Feature activation-3.56e-02

Pos logit contributions
in+0.061
 loser+0.061
 thing+0.057
 problem+0.056
 best+0.054
Neg logit contributions
keeping-0.094
 magazines-0.094
icol-0.088
 envelop-0.084
 parcels-0.084
Token."
Token ID526
Feature activation+4.94e-02

Pos logit contributions
 loser+0.051
in+0.049
 am+0.046
 problem+0.045
'm+0.044
Neg logit contributions
 magazines-0.080
keeping-0.076
 computers-0.070
 envelop-0.070
 videos-0.069
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
 magazines+0.107
keeping+0.106
icol+0.096
 envelop+0.095
 videos+0.094
Neg logit contributions
 loser-0.073
in-0.071
 problem-0.068
 thing-0.065
 winner-0.065
Token He
Token ID679
Feature activation-1.43e-02

Pos logit contributions
keeping+0.004
 magazines+0.004
icol+0.004
bons+0.004
 parcels+0.004
Neg logit contributions
 loser-0.003
 friend-0.003
 am-0.003
in-0.003
 thing-0.003
Token says
Token ID1139
Feature activation-5.20e-03

Pos logit contributions
 loser+0.025
in+0.023
 thing+0.022
 problem+0.021
ching+0.021
Neg logit contributions
 magazines-0.029
 envelop-0.026
 parcels-0.025
keeping-0.025
 computers-0.025
Token yes
Token ID3763
Feature activation+1.37e-02

Pos logit contributions
 loser+0.008
in+0.008
 thing+0.007
 problem+0.007
 friend+0.007
Neg logit contributions
keeping-0.012
 magazines-0.011
icol-0.011
 parcels-0.010
 envelop-0.010
Token,
Token ID11
Feature activation-9.57e-03

Pos logit contributions
 magazines+0.030
keeping+0.029
 envelop+0.026
 videos+0.026
 computers+0.026
Neg logit contributions
 loser-0.021
in-0.020
 winner-0.018
ching-0.017
umber-0.017
Token but
Token ID475
Feature activation-3.70e-02

Pos logit contributions
 loser+0.010
in+0.009
 thing+0.008
 problem+0.008
 friend+0.008
Neg logit contributions
keeping-0.026
 magazines-0.025
icol-0.024
 envelop-0.023
 parcels-0.023
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
in+0.052
 loser+0.051
ching+0.047
umber+0.045
a+0.044
Neg logit contributions
 magazines-0.085
keeping-0.082
nam-0.076
 envelop-0.075
 parcels-0.074
Token if
Token ID611
Feature activation-6.65e-02

Pos logit contributions
in+0.294
 loser+0.293
 am+0.261
 problem+0.259
umber+0.253
Neg logit contributions
 magazines-0.523
keeping-0.500
 envelop-0.466
 videos-0.463
nam-0.460
Token they
Token ID484
Feature activation-1.14e-02

Pos logit contributions
 loser+0.075
in+0.073
 problem+0.066
 thing+0.065
 am+0.065
Neg logit contributions
 magazines-0.170
keeping-0.169
icol-0.156
 envelop-0.155
 parcels-0.154
Token can
Token ID460
Feature activation-2.68e-02

Pos logit contributions
in+0.017
 loser+0.016
 thing+0.016
 problem+0.015
 best+0.015
Neg logit contributions
 magazines-0.024
keeping-0.023
 envelop-0.021
 parcels-0.021
 computers-0.020
Token measure
Token ID3953
Feature activation-1.66e-01

Pos logit contributions
in+0.050
 problem+0.049
 thing+0.049
 loser+0.047
 mosquito+0.043
Neg logit contributions
 magazines-0.042
keeping-0.040
 videos-0.037
 envelop-0.037
bons-0.036
Token his
Token ID465
Feature activation-5.46e-02

Pos logit contributions
 loser+0.295
in+0.278
 problem+0.267
 am+0.264
 winner+0.261
Neg logit contributions
 magazines-0.325
keeping-0.307
 envelop-0.284
 computers-0.282
 videos-0.276
Token It
Token ID632
Feature activation+2.41e-02

Pos logit contributions
 magazines+0.028
keeping+0.028
icol+0.025
 envelop+0.025
 computers+0.024
Neg logit contributions
 loser-0.027
in-0.026
 am-0.024
 thing-0.024
 problem-0.024
Tokenâ
Token ID22940
Feature activation+5.49e-03

Pos logit contributions
 magazines+0.049
keeping+0.047
 parcels+0.046
 computers+0.045
 envelop+0.045
Neg logit contributions
 loser-0.037
in-0.033
 thing-0.030
 friend-0.029
umber-0.029
Token
Token ID26391
Feature activation+4.53e-02

Pos logit contributions
 magazines+0.011
keeping+0.010
icol+0.009
 envelop+0.009
 parcels+0.009
Neg logit contributions
 loser-0.010
in-0.010
 problem-0.009
 thing-0.009
 am-0.009
Token
Token ID8151
Feature activation+3.56e-02

Pos logit contributions
 magazines+0.095
keeping+0.093
 envelop+0.086
 parcels+0.084
icol+0.084
Neg logit contributions
 loser-0.074
in-0.070
 am-0.068
 thing-0.065
 problem-0.065
Tokens
Token ID82
Feature activation-3.38e-02

Pos logit contributions
 magazines+0.092
keeping+0.092
 computers+0.083
 envelop+0.082
 desks+0.082
Neg logit contributions
 loser-0.036
 thing-0.031
in-0.030
 problem-0.030
 friend-0.028
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.044
in+0.043
 problem+0.039
 thing+0.039
 am+0.038
Neg logit contributions
 magazines-0.080
keeping-0.080
 envelop-0.072
icol-0.072
 parcels-0.072
Token shiny
Token ID22441
Feature activation+1.59e-02

Pos logit contributions
 loser+0.309
in+0.301
 problem+0.277
 thing+0.275
 am+0.273
Neg logit contributions
 magazines-0.507
keeping-0.504
icol-0.458
 envelop-0.456
 parcels-0.452
Token rocks
Token ID12586
Feature activation-1.48e-02

Pos logit contributions
 magazines+0.038
keeping+0.038
icol+0.035
 envelop+0.034
 parcels+0.034
Neg logit contributions
 loser-0.021
in-0.020
 problem-0.018
 thing-0.018
 am-0.018
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
 loser+0.021
in+0.020
 problem+0.018
 thing+0.018
 friend+0.018
Neg logit contributions
 magazines-0.033
keeping-0.033
 envelop-0.030
 parcels-0.030
 computers-0.029
Tokenâ
Token ID22940
Feature activation+5.56e-03

Pos logit contributions
 loser+0.028
in+0.027
 am+0.025
 thing+0.025
'm+0.025
Neg logit contributions
 magazines-0.039
keeping-0.038
 envelop-0.035
icol-0.035
 computers-0.034
Token
Token ID26391
Feature activation+4.51e-02

Pos logit contributions
 magazines+0.011
keeping+0.010
icol+0.009
 envelop+0.009
 parcels+0.009
Neg logit contributions
 loser-0.010
in-0.010
 problem-0.009
 thing-0.009
 am-0.009
Token does
Token ID857
Feature activation+8.42e-03

Pos logit contributions
 magazines+0.020
keeping+0.019
 beds+0.019
 shelves+0.018
 computers+0.018
Neg logit contributions
 loser-0.013
in-0.013
 friend-0.011
a-0.011
 winner-0.011
Token not
Token ID407
Feature activation-9.60e-02

Pos logit contributions
 magazines+0.013
 videos+0.012
 brand+0.012
keeping+0.012
 computers+0.012
Neg logit contributions
in-0.017
 loser-0.016
umber-0.015
 winner-0.014
m-0.014
Token move
Token ID1445
Feature activation-1.25e-01

Pos logit contributions
in+0.161
 loser+0.146
a+0.138
umber+0.135
op+0.132
Neg logit contributions
 magazines-0.175
keeping-0.163
 videos-0.162
 blankets-0.159
 envelop-0.159
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
 loser+0.193
in+0.180
 winner+0.176
 friend+0.173
 am+0.171
Neg logit contributions
 magazines-0.253
keeping-0.252
 envelop-0.225
 computers-0.224
 videos-0.224
Token It
Token ID632
Feature activation+2.54e-02

Pos logit contributions
 loser+0.006
in+0.006
 am+0.006
 thing+0.005
 friend+0.005
Neg logit contributions
keeping-0.009
 magazines-0.008
icol-0.008
 parcels-0.008
 envelop-0.008
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 beds+0.044
 magazines+0.044
 candles+0.042
 shelves+0.041
 videos+0.041
Neg logit contributions
 loser-0.042
in-0.041
umber-0.039
 friend-0.036
 winner-0.034
Token makes
Token ID1838
Feature activation-4.44e-02

Pos logit contributions
 loser+0.347
in+0.332
 friend+0.296
 thing+0.295
umber+0.292
Neg logit contributions
 magazines-0.467
keeping-0.449
 computers-0.421
 envelop-0.419
 parcels-0.413
Token a
Token ID257
Feature activation+3.12e-02

Pos logit contributions
 loser+0.071
in+0.071
 thing+0.067
 problem+0.067
 friend+0.066
Neg logit contributions
keeping-0.092
 magazines-0.091
icol-0.084
 parcels-0.082
 envelop-0.081
Token mess
Token ID2085
Feature activation+4.02e-02

Pos logit contributions
 magazines+0.050
icol+0.048
 videos+0.047
 envelop+0.047
 brand+0.046
Neg logit contributions
in-0.059
umber-0.055
 loser-0.055
 am-0.051
 winner-0.051
Token.
Token ID13
Feature activation-4.11e-03

Pos logit contributions
 magazines+0.095
keeping+0.094
 envelop+0.085
 computers+0.085
 videos+0.084
Neg logit contributions
 loser-0.052
in-0.049
 problem-0.047
 am-0.046
 friend-0.046
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
 loser+0.005
in+0.004
 am+0.004
 thing+0.004
 friend+0.004
Neg logit contributions
keeping-0.011
 magazines-0.010
icol-0.010
 parcels-0.010
 envelop-0.010
Token
Token ID95
Feature activation+9.24e-02

Pos logit contributions
keeping+0.063
 magazines+0.063
 envelop+0.056
 parcels+0.055
 computers+0.055
Neg logit contributions
 loser-0.052
in-0.052
 am-0.049
 problem-0.048
 thing-0.048
Token
Token ID26391
Feature activation+4.42e-02

Pos logit contributions
 magazines+0.181
keeping+0.176
 envelop+0.162
nam+0.160
 parcels+0.155
Neg logit contributions
 loser-0.162
 am-0.156
in-0.155
 thing-0.150
 winner-0.146
Token
Token ID129
Feature activation+2.05e-01

Pos logit contributions
 magazines+0.088
keeping+0.087
 envelop+0.079
 parcels+0.077
nam+0.077
Neg logit contributions
 loser-0.074
 am-0.073
in-0.071
'm-0.069
a-0.068
Token
Token ID241
Feature activation-2.99e-02

Pos logit contributions
 magazines+0.446
keeping+0.440
nam+0.407
 envelop+0.400
 parcels+0.391
Neg logit contributions
 loser-0.293
 problem-0.282
in-0.281
 thing-0.279
 am-0.278
TokenNot
Token ID3673
Feature activation-5.07e-02

Pos logit contributions
 loser+0.040
in+0.039
 problem+0.036
 thing+0.035
 am+0.034
Neg logit contributions
keeping-0.070
 magazines-0.070
icol-0.064
 envelop-0.063
 parcels-0.063
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.071
in+0.067
 problem+0.065
 thing+0.065
 am+0.064
Neg logit contributions
 magazines-0.117
keeping-0.116
 envelop-0.106
icol-0.105
 parcels-0.105
Token that
Token ID326
Feature activation-8.95e-02

Pos logit contributions
 loser+0.280
in+0.269
 problem+0.249
 thing+0.249
 am+0.245
Neg logit contributions
 magazines-0.538
keeping-0.537
icol-0.492
 envelop-0.486
 parcels-0.483
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
 loser+0.123
in+0.120
 am+0.106
 problem+0.106
 thing+0.106
Neg logit contributions
 magazines-0.207
keeping-0.205
 envelop-0.186
icol-0.185
 computers-0.185
Token being
Token ID852
Feature activation-5.27e-02

Pos logit contributions
 loser+0.033
in+0.032
 thing+0.031
 problem+0.030
 am+0.030
Neg logit contributions
keeping-0.053
 magazines-0.052
 envelop-0.047
icol-0.047
 parcels-0.046
Token wealthy
Token ID11574
Feature activation+4.06e-02

Pos logit contributions
in+0.068
 loser+0.068
 best+0.062
a+0.062
 problem+0.061
Neg logit contributions
keeping-0.127
 magazines-0.126
icol-0.119
 envelop-0.113
 parcels-0.112
Token also
Token ID635
Feature activation-1.26e-01

Pos logit contributions
 magazines+0.099
keeping+0.098
icol+0.090
 envelop+0.089
 parcels+0.089
Neg logit contributions
 loser-0.051
in-0.049
 problem-0.045
 thing-0.044
 am-0.044
Token V
Token ID569
Feature activation+5.18e-02

Pos logit contributions
 magazines+0.258
keeping+0.250
 parcels+0.242
icol+0.236
 brand+0.235
Neg logit contributions
 loser-0.187
 friend-0.177
 thing-0.172
 frog-0.172
a-0.172
Tokenicky
Token ID17479
Feature activation-1.41e-01

Pos logit contributions
 magazines+0.128
keeping+0.128
icol+0.117
 parcels+0.117
 envelop+0.117
Neg logit contributions
 loser-0.063
in-0.063
 am-0.055
 thing-0.055
 problem-0.055
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
 loser+0.167
in+0.162
 problem+0.147
 thing+0.145
 am+0.144
Neg logit contributions
 magazines-0.352
keeping-0.350
icol-0.320
 envelop-0.319
 parcels-0.317
Token She
Token ID1375
Feature activation+3.79e-02

Pos logit contributions
 loser+0.039
 am+0.037
in+0.036
 thing+0.035
 frog+0.034
Neg logit contributions
keeping-0.053
 magazines-0.050
 parcels-0.047
icol-0.047
 envelop-0.047
Token was
Token ID373
Feature activation+8.61e-03

Pos logit contributions
keeping+0.094
 magazines+0.092
 parcels+0.087
 envelop+0.086
icol+0.083
Neg logit contributions
 am-0.051
in-0.049
 thing-0.048
a-0.047
'm-0.046
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
keeping+0.023
 magazines+0.022
icol+0.021
 parcels+0.020
 envelop+0.020
Neg logit contributions
 best-0.010
 loser-0.010
 friend-0.010
 only-0.009
 thing-0.009
Token 3
Token ID513
Feature activation+2.25e-02

Pos logit contributions
in+0.331
 loser+0.323
 problem+0.298
 am+0.297
 thing+0.285
Neg logit contributions
 magazines-0.484
keeping-0.466
 envelop-0.429
 computers-0.427
icol-0.425
Token years
Token ID812
Feature activation+2.07e-01

Pos logit contributions
keeping+0.030
 magazines+0.029
icol+0.028
 parcels+0.026
 envelop+0.026
Neg logit contributions
in-0.053
 loser-0.052
 thing-0.051
 friend-0.050
 problem-0.050
Token old
Token ID1468
Feature activation+6.59e-02

Pos logit contributions
keeping+0.511
 magazines+0.479
icol+0.468
 parcels+0.456
bons+0.453
Neg logit contributions
 problem-0.268
in-0.264
 friend-0.258
 thing-0.249
 loser-0.248
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
keeping+0.168
 magazines+0.166
icol+0.155
 envelop+0.153
 parcels+0.153
Neg logit contributions
 loser-0.075
in-0.073
 thing-0.068
 problem-0.067
 friend-0.066
Token loved
Token ID6151
Feature activation+1.17e-01

Pos logit contributions
keeping+0.072
 magazines+0.069
icol+0.066
 envelop+0.065
 parcels+0.063
Neg logit contributions
in-0.049
 loser-0.049
 am-0.049
 thing-0.049
 best-0.048
Token with
Token ID351
Feature activation-1.05e-02

Pos logit contributions
keeping+0.060
 magazines+0.059
icol+0.055
 envelop+0.053
 parcels+0.053
Neg logit contributions
 loser-0.036
in-0.036
 thing-0.033
 am-0.032
 problem-0.032
Token the
Token ID262
Feature activation+7.30e-03

Pos logit contributions
in+0.016
 loser+0.016
 thing+0.016
 problem+0.015
 friend+0.015
Neg logit contributions
keeping-0.023
icol-0.022
 magazines-0.021
 illustrations-0.020
 parcels-0.019
Token desk
Token ID6915
Feature activation+1.36e-01

Pos logit contributions
keeping+0.014
 magazines+0.014
icol+0.012
 envelop+0.012
 parcels+0.012
Neg logit contributions
 loser-0.013
in-0.013
 problem-0.012
 thing-0.012
 am-0.012
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
keeping+0.339
 magazines+0.339
icol+0.314
 envelop+0.309
 parcels+0.308
Neg logit contributions
 loser-0.162
in-0.157
 problem-0.142
 thing-0.141
 am-0.139
Token but
Token ID475
Feature activation-3.86e-02

Pos logit contributions
 loser+0.018
in+0.017
 thing+0.016
 problem+0.016
 am+0.016
Neg logit contributions
keeping-0.037
 magazines-0.037
icol-0.034
 envelop-0.034
 parcels-0.034
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.064
in+0.063
ching+0.059
a+0.056
 problem+0.055
Neg logit contributions
 magazines-0.076
keeping-0.071
 computers-0.067
 videos-0.067
nam-0.066
Token with
Token ID351
Feature activation-1.07e-02

Pos logit contributions
 loser+0.322
in+0.322
 problem+0.286
 am+0.285
 thing+0.282
Neg logit contributions
 magazines-0.492
keeping-0.478
 envelop-0.438
 computers-0.434
 parcels-0.432
Token the
Token ID262
Feature activation+6.68e-03

Pos logit contributions
 loser+0.017
in+0.016
 thing+0.016
 problem+0.016
 best+0.016
Neg logit contributions
keeping-0.023
 magazines-0.022
icol-0.021
 parcels-0.020
 illustrations-0.020
Token toys
Token ID14958
Feature activation+4.68e-02

Pos logit contributions
keeping+0.014
 magazines+0.013
icol+0.012
 parcels+0.012
 envelop+0.012
Neg logit contributions
 loser-0.011
in-0.011
 problem-0.010
 thing-0.010
 am-0.010
Token and
Token ID290
Feature activation+4.44e-02

Pos logit contributions
 magazines+0.112
keeping+0.110
 envelop+0.100
 computers+0.099
 parcels+0.098
Neg logit contributions
 loser-0.061
in-0.058
 problem-0.054
 thing-0.053
 am-0.052
Token books
Token ID3835
Feature activation+1.86e-01

Pos logit contributions
keeping+0.093
icol+0.093
 magazines+0.084
 illustrations+0.082
 parcels+0.080
Neg logit contributions
 loser-0.067
 thing-0.065
 am-0.065
 problem-0.063
 friend-0.063
Token remember
Token ID3505
Feature activation-4.20e-02

Pos logit contributions
in+0.030
 loser+0.030
ching+0.028
a+0.026
umber+0.026
Neg logit contributions
 magazines-0.038
keeping-0.036
nam-0.035
 envelop-0.034
 parcels-0.034
Token,
Token ID11
Feature activation-1.68e-02

Pos logit contributions
 loser+0.038
in+0.037
 problem+0.034
 thing+0.033
 am+0.033
Neg logit contributions
 magazines-0.116
keeping-0.116
icol-0.108
 envelop-0.107
 parcels-0.106
Token a
Token ID257
Feature activation+3.07e-02

Pos logit contributions
in+0.027
 loser+0.027
 problem+0.024
 am+0.024
ching+0.023
Neg logit contributions
 magazines-0.034
keeping-0.034
 envelop-0.031
 computers-0.030
 parcels-0.030
Token bike
Token ID7161
Feature activation+1.51e-02

Pos logit contributions
 magazines+0.072
 envelop+0.065
keeping+0.065
 brand+0.065
icol+0.064
Neg logit contributions
in-0.038
 loser-0.035
umber-0.034
 am-0.031
 neither-0.030
Token is
Token ID318
Feature activation-6.16e-04

Pos logit contributions
 magazines+0.037
keeping+0.035
 envelop+0.033
 computers+0.033
 parcels+0.033
Neg logit contributions
 loser-0.019
in-0.018
 problem-0.016
 thing-0.016
 friend-0.015
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
 loser+0.001
in+0.001
 problem+0.001
 thing+0.001
 am+0.001
Neg logit contributions
 magazines-0.001
keeping-0.001
icol-0.001
 envelop-0.001
 parcels-0.001
Token a
Token ID257
Feature activation+3.11e-02

Pos logit contributions
 loser+0.336
in+0.329
 problem+0.312
 thing+0.312
 best+0.304
Neg logit contributions
keeping-0.478
 magazines-0.473
icol-0.438
 envelop-0.423
 parcels-0.420
Token gift
Token ID6979
Feature activation-2.27e-02

Pos logit contributions
 magazines+0.073
keeping+0.068
 envelop+0.067
 brand+0.065
icol+0.065
Neg logit contributions
in-0.039
 loser-0.034
umber-0.034
 am-0.031
m-0.031
Token.
Token ID13
Feature activation-7.00e-03

Pos logit contributions
 loser+0.033
in+0.032
 problem+0.030
 am+0.029
 thing+0.029
Neg logit contributions
 magazines-0.051
keeping-0.050
 envelop-0.045
 parcels-0.045
 computers-0.044
Token A
Token ID317
Feature activation+8.89e-02

Pos logit contributions
 loser+0.012
 am+0.011
in+0.011
'm+0.011
 thing+0.011
Neg logit contributions
keeping-0.014
 magazines-0.014
icol-0.013
 computers-0.012
 envelop-0.012
Token friend
Token ID1545
Feature activation-1.11e-01

Pos logit contributions
 magazines+0.205
 envelop+0.188
keeping+0.188
 parcels+0.184
 computers+0.180
Neg logit contributions
in-0.115
 loser-0.115
umber-0.098
 am-0.098
 neither-0.096
Token not
Token ID407
Feature activation-9.60e-02

Pos logit contributions
 loser+0.000
in+0.000
 problem+0.000
 thing+0.000
 best+0.000
Neg logit contributions
keeping-0.000
 magazines-0.000
icol-0.000
 parcels-0.000
 computers-0.000
Token real
Token ID1103
Feature activation-5.26e-02

Pos logit contributions
 loser+0.136
in+0.132
 problem+0.124
 thing+0.124
 friend+0.122
Neg logit contributions
 magazines-0.218
keeping-0.218
icol-0.203
 parcels-0.195
 envelop-0.195
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
in+0.075
 loser+0.074
a+0.067
 am+0.066
umber+0.064
Neg logit contributions
 magazines-0.118
keeping-0.115
 envelop-0.106
 computers-0.105
 videos-0.105
Token It
Token ID632
Feature activation+2.56e-02

Pos logit contributions
 loser+0.009
in+0.008
 am+0.008
 thing+0.008
 friend+0.008
Neg logit contributions
 magazines-0.011
keeping-0.010
 envelop-0.009
icol-0.009
 parcels-0.009
Token is
Token ID318
Feature activation+6.81e-05

Pos logit contributions
 magazines+0.055
keeping+0.052
 computers+0.050
 videos+0.049
 envelop+0.049
Neg logit contributions
 loser-0.038
in-0.033
 thing-0.030
umber-0.030
 slider-0.029
Token only
Token ID691
Feature activation-2.22e-01

Pos logit contributions
keeping+0.000
 magazines+0.000
icol+0.000
 parcels+0.000
 envelop+0.000
Neg logit contributions
 loser-0.000
in-0.000
 problem-0.000
 thing-0.000
 best-0.000
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
 thing+0.334
 loser+0.333
 best+0.326
 problem+0.324
in+0.322
Neg logit contributions
keeping-0.472
 magazines-0.458
icol-0.443
 parcels-0.405
 illustrations-0.402
Token picture
Token ID4286
Feature activation+3.22e-02

Pos logit contributions
 magazines+0.073
keeping+0.071
 envelop+0.068
icol+0.066
 computers+0.066
Neg logit contributions
in-0.044
 loser-0.042
 am-0.038
umber-0.038
a-0.037
Token.
Token ID13
Feature activation-4.71e-03

Pos logit contributions
 magazines+0.082
keeping+0.082
icol+0.075
 envelop+0.075
 parcels+0.074
Neg logit contributions
 loser-0.037
in-0.035
 problem-0.032
 thing-0.032
 am-0.031
Token Do
Token ID2141
Feature activation+1.33e-01

Pos logit contributions
 loser+0.008
 am+0.007
'm+0.006
 thing+0.006
in+0.006
Neg logit contributions
keeping-0.010
 magazines-0.010
 envelop-0.009
icol-0.009
 desks-0.009
Token you
Token ID345
Feature activation-8.12e-03

Pos logit contributions
 magazines+0.267
keeping+0.256
 computers+0.235
 envelop+0.232
 videos+0.229
Neg logit contributions
 loser-0.227
in-0.222
 problem-0.205
 thing-0.204
 winner-0.199
Token the
Token ID262
Feature activation+8.01e-03

Pos logit contributions
 thing+0.031
in+0.030
 loser+0.030
 problem+0.029
 best+0.029
Neg logit contributions
keeping-0.047
icol-0.044
 magazines-0.043
 parcels-0.041
 illustrations-0.040
Token trail
Token ID8025
Feature activation+8.76e-02

Pos logit contributions
keeping+0.019
 magazines+0.018
icol+0.017
 envelop+0.017
 parcels+0.017
Neg logit contributions
 loser-0.011
in-0.010
 problem-0.010
 thing-0.010
 best-0.010
Token again
Token ID757
Feature activation-6.84e-02

Pos logit contributions
 magazines+0.211
keeping+0.208
 envelop+0.190
 computers+0.187
 videos+0.186
Neg logit contributions
 loser-0.106
in-0.103
 am-0.096
 problem-0.093
'm-0.091
Token.
Token ID13
Feature activation-6.59e-03

Pos logit contributions
 loser+0.091
in+0.089
 problem+0.080
 am+0.079
 winner+0.078
Neg logit contributions
 magazines-0.162
keeping-0.161
 envelop-0.146
 computers-0.145
 parcels-0.142
Token They
Token ID1119
Feature activation+5.33e-02

Pos logit contributions
 loser+0.010
 am+0.009
in+0.009
 thing+0.009
 friend+0.009
Neg logit contributions
 magazines-0.014
keeping-0.014
 envelop-0.013
 parcels-0.013
icol-0.013
Token only
Token ID691
Feature activation-2.21e-01

Pos logit contributions
 magazines+0.107
keeping+0.106
 envelop+0.095
icol+0.094
 parcels+0.093
Neg logit contributions
 loser-0.090
in-0.088
 problem-0.083
 thing-0.082
 am-0.080
Token painted
Token ID13055
Feature activation+6.38e-02

Pos logit contributions
 loser+0.354
in+0.346
 am+0.335
 problem+0.326
 thing+0.322
Neg logit contributions
keeping-0.469
 magazines-0.457
icol-0.427
 parcels-0.411
 envelop-0.405
Token on
Token ID319
Feature activation-2.22e-02

Pos logit contributions
keeping+0.145
 magazines+0.136
icol+0.131
 parcels+0.126
 envelop+0.125
Neg logit contributions
in-0.096
 loser-0.092
 best-0.091
 thing-0.091
 problem-0.085
Token paper
Token ID3348
Feature activation+5.58e-02

Pos logit contributions
 thing+0.032
in+0.029
 best+0.029
 problem+0.029
 loser+0.028
Neg logit contributions
keeping-0.052
icol-0.046
 magazines-0.045
 parcels-0.044
 illustrations-0.043
Token.
Token ID13
Feature activation-5.73e-03

Pos logit contributions
keeping+0.135
 magazines+0.134
icol+0.125
 parcels+0.122
 envelop+0.121
Neg logit contributions
 loser-0.068
in-0.067
 thing-0.062
 problem-0.062
 am-0.060
Token They
Token ID1119
Feature activation+5.34e-02

Pos logit contributions
 loser+0.009
in+0.009
 am+0.008
 thing+0.008
 friend+0.008
Neg logit contributions
keeping-0.012
 magazines-0.012
 parcels-0.011
 envelop-0.011
icol-0.011
Token a
Token ID257
Feature activation+3.35e-02

Pos logit contributions
keeping+0.129
 magazines+0.122
icol+0.122
 parcels+0.116
 envelop+0.115
Neg logit contributions
 best-0.071
 friend-0.062
 only-0.057
 good-0.056
 am-0.055
Token great
Token ID1049
Feature activation-8.77e-03

Pos logit contributions
 magazines+0.070
 brand+0.065
icol+0.064
 envelop+0.062
 videos+0.061
Neg logit contributions
in-0.052
 loser-0.050
umber-0.046
one-0.043
worm-0.043
Token time
Token ID640
Feature activation-9.14e-02

Pos logit contributions
in+0.010
 loser+0.010
umber+0.008
a+0.008
 winner+0.008
Neg logit contributions
 magazines-0.022
icol-0.021
keeping-0.021
 videos-0.020
bons-0.019
Token in
Token ID287
Feature activation-1.33e-02

Pos logit contributions
 loser+0.132
in+0.131
 problem+0.121
 thing+0.120
 friend+0.119
Neg logit contributions
 magazines-0.205
keeping-0.204
icol-0.187
 parcels-0.184
 envelop-0.183
Token the
Token ID262
Feature activation+6.10e-03

Pos logit contributions
 loser+0.024
in+0.023
 problem+0.022
 thing+0.021
 am+0.021
Neg logit contributions
 magazines-0.026
keeping-0.025
icol-0.023
 envelop-0.023
 computers-0.022
Token attic
Token ID46113
Feature activation+2.07e-01

Pos logit contributions
keeping+0.013
 magazines+0.013
icol+0.012
 parcels+0.012
 envelop+0.012
Neg logit contributions
 loser-0.010
in-0.009
 problem-0.009
 thing-0.009
 am-0.009
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
 magazines+0.513
keeping+0.510
icol+0.467
 envelop+0.465
 parcels+0.462
Neg logit contributions
 loser-0.248
in-0.241
 problem-0.218
 thing-0.217
 am-0.214
Token even
Token ID772
Feature activation+5.05e-03

Pos logit contributions
 loser+0.012
in+0.012
 friend+0.011
 best+0.011
 thing+0.011
Neg logit contributions
keeping-0.028
 magazines-0.028
 parcels-0.026
icol-0.026
 envelop-0.026
Token though
Token ID996
Feature activation-7.30e-02

Pos logit contributions
 magazines+0.012
keeping+0.011
 envelop+0.010
 videos+0.010
 computers+0.010
Neg logit contributions
 loser-0.007
in-0.007
 problem-0.006
 am-0.006
 thing-0.006
Token it
Token ID340
Feature activation-3.43e-02

Pos logit contributions
 loser+0.119
in+0.115
 problem+0.106
 thing+0.104
 am+0.104
Neg logit contributions
 magazines-0.151
keeping-0.149
 computers-0.133
icol-0.132
 envelop-0.132
Token was
Token ID373
Feature activation+1.53e-02

Pos logit contributions
 loser+0.062
in+0.054
umber+0.052
 winner+0.048
 slider+0.047
Neg logit contributions
 magazines-0.064
 computers-0.059
keeping-0.058
 videos-0.058
 beds-0.057
Token and
Token ID290
Feature activation+3.89e-02

Pos logit contributions
 magazines+0.062
keeping+0.059
 computers+0.052
icol+0.051
 videos+0.051
Neg logit contributions
 loser-0.087
in-0.084
a-0.078
 am-0.077
 winner-0.076
Token felt
Token ID2936
Feature activation+4.96e-02

Pos logit contributions
 magazines+0.095
keeping+0.091
icol+0.088
 envelop+0.087
 parcels+0.084
Neg logit contributions
 loser-0.047
in-0.046
 thing-0.045
 am-0.044
 problem-0.044
Token very
Token ID845
Feature activation+2.96e-02

Pos logit contributions
keeping+0.118
 magazines+0.118
icol+0.109
 envelop+0.107
 computers+0.106
Neg logit contributions
 loser-0.065
in-0.064
 thing-0.060
 problem-0.059
 am-0.058
Token happy
Token ID3772
Feature activation+3.13e-02

Pos logit contributions
 magazines+0.067
keeping+0.067
icol+0.061
 parcels+0.060
 envelop+0.060
Neg logit contributions
 loser-0.041
in-0.039
 thing-0.035
 problem-0.035
 am-0.035
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 magazines+0.079
keeping+0.078
 envelop+0.072
icol+0.072
 parcels+0.072
Neg logit contributions
 loser-0.036
in-0.036
 thing-0.034
 problem-0.033
 friend-0.032
Token From
Token ID3574
Feature activation+2.08e-01

Pos logit contributions
 loser+0.016
in+0.016
 am+0.015
 friend+0.015
 thing+0.014
Neg logit contributions
 magazines-0.039
keeping-0.038
 envelop-0.036
icol-0.035
 parcels-0.035
Token that
Token ID326
Feature activation-8.41e-02

Pos logit contributions
keeping+0.525
 magazines+0.517
 envelop+0.485
 parcels+0.476
 computers+0.474
Neg logit contributions
in-0.258
 thing-0.235
 friend-0.229
 problem-0.229
 best-0.224
Token day
Token ID1110
Feature activation-7.62e-02

Pos logit contributions
 loser+0.108
in+0.103
umber+0.096
Only+0.086
a+0.085
Neg logit contributions
 magazines-0.203
keeping-0.187
 videos-0.180
 envelop-0.179
 computers-0.177
Token forward
Token ID2651
Feature activation-2.75e-02

Pos logit contributions
 loser+0.113
in+0.112
 frog+0.104
 problem+0.104
 friend+0.103
Neg logit contributions
 magazines-0.159
keeping-0.155
 envelop-0.143
 parcels-0.141
 computers-0.140
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
 loser+0.028
in+0.027
 problem+0.024
 thing+0.023
 am+0.023
Neg logit contributions
keeping-0.073
 magazines-0.073
icol-0.067
 envelop-0.067
 parcels-0.066
Token Tom
Token ID4186
Feature activation+2.30e-02

Pos logit contributions
 loser+0.025
in+0.024
 problem+0.022
 am+0.022
 thing+0.022
Neg logit contributions
 magazines-0.038
keeping-0.037
 envelop-0.034
 computers-0.033
 parcels-0.033
Token She
Token ID1375
Feature activation+4.02e-02

Pos logit contributions
 am+0.019
 thing+0.019
'm+0.019
 frog+0.018
 friend+0.017
Neg logit contributions
keeping-0.042
 magazines-0.041
 envelop-0.040
 parcels-0.039
 computers-0.038
Token leapt
Token ID43713
Feature activation+6.15e-02

Pos logit contributions
keeping+0.114
 magazines+0.107
 envelop+0.101
 parcels+0.100
bons+0.099
Neg logit contributions
in-0.043
 friend-0.042
 am-0.040
a-0.040
 frog-0.039
Token out
Token ID503
Feature activation-7.61e-03

Pos logit contributions
keeping+0.168
 magazines+0.161
 computers+0.150
 parcels+0.149
 envelop+0.148
Neg logit contributions
in-0.064
 thing-0.062
 frog-0.058
 loser-0.058
 best-0.058
Token of
Token ID286
Feature activation+2.15e-03

Pos logit contributions
in+0.007
 loser+0.006
 thing+0.006
 friend+0.005
 problem+0.005
Neg logit contributions
keeping-0.022
 magazines-0.022
icol-0.021
 parcels-0.020
 computers-0.020
Token her
Token ID607
Feature activation-8.37e-04

Pos logit contributions
keeping+0.005
icol+0.005
 parcels+0.005
 counters+0.005
isions+0.005
Neg logit contributions
 thing-0.003
 friend-0.002
 frog-0.002
 end-0.002
 step-0.002
Token bed
Token ID3996
Feature activation+2.14e-01

Pos logit contributions
 loser+0.001
in+0.001
 problem+0.001
 thing+0.001
 friend+0.001
Neg logit contributions
keeping-0.002
 magazines-0.002
icol-0.001
 parcels-0.001
 envelop-0.001
Token and
Token ID290
Feature activation+3.62e-02

Pos logit contributions
keeping+0.570
 magazines+0.557
 parcels+0.521
 envelop+0.519
icol+0.518
Neg logit contributions
in-0.208
 loser-0.208
 friend-0.200
 thing-0.199
 problem-0.196
Token ran
Token ID4966
Feature activation+3.73e-02

Pos logit contributions
keeping+0.089
 magazines+0.088
bons+0.080
 envelop+0.080
 parcels+0.079
Neg logit contributions
in-0.044
 friend-0.043
 thing-0.041
 frog-0.041
a-0.040
Token outside
Token ID2354
Feature activation+4.72e-02

Pos logit contributions
 magazines+0.091
keeping+0.089
 parcels+0.085
 envelop+0.084
 computers+0.083
Neg logit contributions
 thing-0.045
in-0.044
 loser-0.043
 friend-0.042
 am-0.042
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 magazines+0.114
keeping+0.113
icol+0.104
 envelop+0.103
 parcels+0.103
Neg logit contributions
 loser-0.060
in-0.059
 thing-0.054
 am-0.054
 problem-0.053
Token She
Token ID1375
Feature activation+4.08e-02

Pos logit contributions
'm+0.017
 am+0.017
 thing+0.016
 friend+0.016
 frog+0.015
Neg logit contributions
keeping-0.045
 magazines-0.044
 envelop-0.042
 parcels-0.041
 computers-0.040
Token to
Token ID284
Feature activation+2.92e-02

Pos logit contributions
 magazines+0.166
keeping+0.163
icol+0.152
 envelop+0.150
 parcels+0.149
Neg logit contributions
 loser-0.078
in-0.077
 thing-0.073
 problem-0.072
 friend-0.071
Token be
Token ID307
Feature activation-1.03e-01

Pos logit contributions
 magazines+0.068
keeping+0.067
icol+0.061
 envelop+0.061
 parcels+0.060
Neg logit contributions
 loser-0.040
in-0.039
 thing-0.036
 problem-0.035
 am-0.035
Token friends
Token ID2460
Feature activation-2.30e-02

Pos logit contributions
 loser+0.123
in+0.121
 friend+0.112
 best+0.112
 thing+0.111
Neg logit contributions
 magazines-0.260
keeping-0.259
icol-0.241
 computers-0.231
 parcels-0.231
Token again
Token ID757
Feature activation-6.90e-02

Pos logit contributions
 loser+0.029
in+0.028
 problem+0.025
 thing+0.025
 am+0.025
Neg logit contributions
 magazines-0.056
keeping-0.056
icol-0.051
 envelop-0.051
 parcels-0.051
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 loser+0.090
in+0.086
 problem+0.079
 thing+0.078
 am+0.077
Neg logit contributions
keeping-0.165
 magazines-0.165
 envelop-0.150
 computers-0.148
 parcels-0.148
Token From
Token ID3574
Feature activation+2.07e-01

Pos logit contributions
 loser+0.020
in+0.019
 friend+0.018
 am+0.018
 frog+0.018
Neg logit contributions
 magazines-0.020
keeping-0.020
 envelop-0.017
icol-0.017
 computers-0.017
Token then
Token ID788
Feature activation-1.28e-02

Pos logit contributions
keeping+0.454
 magazines+0.453
icol+0.409
 envelop+0.408
 parcels+0.404
Neg logit contributions
in-0.313
 loser-0.310
 problem-0.288
 thing-0.287
 friend-0.282
Token on
Token ID319
Feature activation-2.15e-02

Pos logit contributions
 loser+0.022
in+0.021
 frog+0.020
 friend+0.020
 coward+0.019
Neg logit contributions
 magazines-0.023
 videos-0.021
keeping-0.021
 envelop-0.021
bons-0.020
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 loser+0.028
in+0.027
ching+0.023
 winner+0.023
 am+0.023
Neg logit contributions
 magazines-0.051
keeping-0.048
 envelop-0.046
 computers-0.046
 videos-0.045
Token they
Token ID484
Feature activation-1.72e-02

Pos logit contributions
 loser+0.032
in+0.031
 problem+0.029
 am+0.028
a+0.028
Neg logit contributions
 magazines-0.035
keeping-0.033
 videos-0.031
 envelop-0.031
 computers-0.030
Token always
Token ID1464
Feature activation-7.49e-02

Pos logit contributions
 loser+0.028
in+0.027
 problem+0.026
 thing+0.026
 am+0.025
Neg logit contributions
 magazines-0.035
keeping-0.035
 envelop-0.031
icol-0.031
 parcels-0.031
Token and
Token ID290
Feature activation+3.56e-02

Pos logit contributions
 magazines+0.067
keeping+0.066
 envelop+0.061
 parcels+0.061
icol+0.061
Neg logit contributions
in-0.033
 loser-0.033
 thing-0.030
 friend-0.030
 problem-0.030
Token hugged
Token ID41130
Feature activation+1.07e-01

Pos logit contributions
 magazines+0.090
bons+0.081
keeping+0.081
 computers+0.081
 envelop+0.081
Neg logit contributions
 best-0.044
 friend-0.042
 person-0.038
 thing-0.038
 frog-0.037
Token each
Token ID1123
Feature activation-1.01e-01

Pos logit contributions
 magazines+0.256
keeping+0.254
icol+0.240
 parcels+0.236
 envelop+0.234
Neg logit contributions
 thing-0.137
 loser-0.132
in-0.131
 friend-0.129
 best-0.122
Token other
Token ID584
Feature activation-7.65e-02

Pos logit contributions
 loser+0.091
in+0.086
 problem+0.076
 am+0.075
 winner+0.066
Neg logit contributions
 magazines-0.286
keeping-0.278
 envelop-0.265
 computers-0.258
 parcels-0.258
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
 loser+0.088
in+0.087
 thing+0.079
 friend+0.078
 best+0.077
Neg logit contributions
 magazines-0.192
keeping-0.191
icol-0.178
 envelop-0.175
 parcels-0.175
Token From
Token ID3574
Feature activation+2.10e-01

Pos logit contributions
 friend+0.027
 frog+0.027
 am+0.027
'm+0.027
 best+0.027
Neg logit contributions
keeping-0.040
 magazines-0.040
 computers-0.036
 envelop-0.035
 desks-0.034
Token that
Token ID326
Feature activation-8.48e-02

Pos logit contributions
keeping+0.505
 magazines+0.502
 envelop+0.458
icol+0.458
 parcels+0.454
Neg logit contributions
in-0.280
 loser-0.267
 thing-0.254
 problem-0.252
 friend-0.248
Token day
Token ID1110
Feature activation-7.47e-02

Pos logit contributions
 loser+0.125
umber+0.121
in+0.121
 slider+0.103
a+0.100
Neg logit contributions
 magazines-0.187
 videos-0.168
keeping-0.164
 envelop-0.163
 computers-0.161
Token on
Token ID319
Feature activation-2.27e-02

Pos logit contributions
in+0.118
 loser+0.117
 frog+0.113
 problem+0.108
 friend+0.107
Neg logit contributions
 magazines-0.142
keeping-0.132
 envelop-0.125
 videos-0.123
 computers-0.121
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 loser+0.027
in+0.026
a+0.022
 winner+0.022
ching+0.022
Neg logit contributions
 magazines-0.057
keeping-0.056
icol-0.052
 envelop-0.052
 computers-0.051
Token they
Token ID484
Feature activation-1.74e-02

Pos logit contributions
 loser+0.027
in+0.026
 problem+0.024
 thing+0.024
 am+0.024
Neg logit contributions
 magazines-0.041
keeping-0.041
icol-0.037
 envelop-0.037
 parcels-0.036
Token:
Token ID25
Feature activation+4.62e-02

Pos logit contributions
 loser+0.007
in+0.007
 thing+0.006
 problem+0.006
 winner+0.006
Neg logit contributions
 magazines-0.008
keeping-0.008
 computers-0.007
 videos-0.007
 envelop-0.007
Token Try
Token ID9993
Feature activation+3.04e-02

Pos logit contributions
 magazines+0.095
keeping+0.092
 computers+0.084
 envelop+0.084
 videos+0.084
Neg logit contributions
 loser-0.075
in-0.074
 problem-0.068
 thing-0.067
 am-0.066
Token hard
Token ID1327
Feature activation-5.12e-02

Pos logit contributions
 magazines+0.069
keeping+0.068
icol+0.062
 envelop+0.062
 computers+0.061
Neg logit contributions
 loser-0.044
in-0.042
 problem-0.039
 thing-0.039
 am-0.038
Token and
Token ID290
Feature activation+4.06e-02

Pos logit contributions
 loser+0.070
in+0.068
 thing+0.061
 problem+0.061
 am+0.061
Neg logit contributions
 magazines-0.120
keeping-0.119
icol-0.108
 envelop-0.108
 computers-0.107
Token ask
Token ID1265
Feature activation-5.82e-02

Pos logit contributions
 magazines+0.095
keeping+0.094
 computers+0.086
 parcels+0.086
 envelop+0.085
Neg logit contributions
 loser-0.054
in-0.054
a-0.046
 thing-0.045
 problem-0.045
Token for
Token ID329
Feature activation+5.86e-02

Pos logit contributions
 loser+0.079
in+0.077
 thing+0.076
 problem+0.075
 friend+0.074
Neg logit contributions
keeping-0.136
 magazines-0.130
icol-0.126
 parcels-0.119
 envelop-0.118
Token help
Token ID1037
Feature activation-1.12e-01

Pos logit contributions
keeping+0.149
 magazines+0.149
icol+0.138
 envelop+0.136
 parcels+0.135
Neg logit contributions
 loser-0.067
in-0.065
 problem-0.060
 thing-0.059
 am-0.058
Token when
Token ID618
Feature activation-5.92e-02

Pos logit contributions
 loser+0.145
in+0.139
 problem+0.126
 am+0.125
 thing+0.123
Neg logit contributions
 magazines-0.270
keeping-0.267
 computers-0.244
 envelop-0.243
 parcels-0.240
Token you
Token ID345
Feature activation-6.91e-03

Pos logit contributions
 loser+0.090
in+0.087
 problem+0.081
 thing+0.081
 am+0.080
Neg logit contributions
 magazines-0.128
keeping-0.128
icol-0.116
 envelop-0.115
 parcels-0.114
Token need
Token ID761
Feature activation-9.46e-03

Pos logit contributions
 loser+0.009
in+0.008
 thing+0.008
 problem+0.007
a+0.007
Neg logit contributions
 magazines-0.017
keeping-0.016
 computers-0.015
 parcels-0.015
 envelop-0.015
Token it
Token ID340
Feature activation-3.42e-02

Pos logit contributions
 loser+0.013
in+0.013
 problem+0.012
 thing+0.012
 am+0.012
Neg logit contributions
keeping-0.021
 magazines-0.021
icol-0.020
 envelop-0.019
 parcels-0.019
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 loser+0.108
in+0.102
 problem+0.094
 thing+0.092
 am+0.092
Neg logit contributions
 magazines-0.186
keeping-0.186
 envelop-0.168
 computers-0.165
 parcels-0.164
Token We
Token ID775
Feature activation+9.19e-02

Pos logit contributions
 loser+0.017
 am+0.016
 thing+0.016
in+0.016
 friend+0.015
Neg logit contributions
 magazines-0.021
keeping-0.020
 envelop-0.018
 computers-0.018
 parcels-0.018
Token can
Token ID460
Feature activation-3.22e-02

Pos logit contributions
 magazines+0.236
keeping+0.230
 envelop+0.219
 parcels+0.214
 computers+0.211
Neg logit contributions
 loser-0.093
in-0.092
 thing-0.079
 problem-0.076
 frog-0.076
Token glue
Token ID22749
Feature activation-6.52e-03

Pos logit contributions
 loser+0.055
in+0.052
 problem+0.050
 thing+0.050
a+0.048
Neg logit contributions
 magazines-0.063
keeping-0.061
 envelop-0.057
bons-0.056
 parcels-0.055
Token the
Token ID262
Feature activation+3.65e-03

Pos logit contributions
 loser+0.009
in+0.009
 thing+0.009
 problem+0.008
 am+0.008
Neg logit contributions
 magazines-0.015
keeping-0.015
icol-0.014
 parcels-0.013
 computers-0.013
Token beads
Token ID36116
Feature activation+6.94e-02

Pos logit contributions
 magazines+0.008
keeping+0.008
icol+0.007
 parcels+0.007
 envelop+0.007
Neg logit contributions
 loser-0.006
in-0.005
 problem-0.005
 thing-0.005
 am-0.005
Token and
Token ID290
Feature activation+4.10e-02

Pos logit contributions
 magazines+0.151
keeping+0.151
 envelop+0.136
 parcels+0.130
 computers+0.130
Neg logit contributions
 loser-0.103
in-0.098
 problem-0.094
 am-0.091
 thing-0.091
Token wrap
Token ID14441
Feature activation+1.52e-02

Pos logit contributions
keeping+0.101
icol+0.097
 magazines+0.091
 illustrations+0.086
 counters+0.086
Neg logit contributions
 loser-0.049
 am-0.048
 friend-0.048
 thing-0.048
 stick-0.046
Token the
Token ID262
Feature activation+4.17e-03

Pos logit contributions
 magazines+0.036
keeping+0.036
icol+0.034
 computers+0.033
 parcels+0.032
Neg logit contributions
in-0.020
 thing-0.019
 loser-0.019
a-0.018
 am-0.018
Token cater
Token ID20825
Feature activation-3.91e-02

Pos logit contributions
keeping+0.008
 magazines+0.008
icol+0.007
 parcels+0.007
 computers+0.007
Neg logit contributions
 loser-0.007
in-0.007
 problem-0.007
 thing-0.007
 am-0.007
Tokenpillar
Token ID41643
Feature activation-8.10e-02

Pos logit contributions
 loser+0.036
in+0.032
 problem+0.030
 thing+0.029
 am+0.028
Neg logit contributions
 magazines-0.109
keeping-0.108
icol-0.100
 envelop-0.100
 parcels-0.098
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
in+0.055
 loser+0.054
 thing+0.051
 problem+0.049
 friend+0.049
Neg logit contributions
 magazines-0.108
keeping-0.108
icol-0.099
 envelop-0.098
 parcels-0.098
Token and
Token ID290
Feature activation+3.65e-02

Pos logit contributions
in+0.019
 loser+0.018
 thing+0.018
 frog+0.017
 best+0.017
Neg logit contributions
keeping-0.051
 magazines-0.050
icol-0.045
bons-0.045
 envelop-0.045
Token a
Token ID257
Feature activation+2.97e-02

Pos logit contributions
 magazines+0.082
keeping+0.082
icol+0.074
 envelop+0.073
 parcels+0.073
Neg logit contributions
 loser-0.053
in-0.052
 thing-0.048
 problem-0.048
 am-0.047
Token secret
Token ID3200
Feature activation-1.13e-02

Pos logit contributions
 magazines+0.076
keeping+0.075
 envelop+0.069
icol+0.069
 parcels+0.069
Neg logit contributions
 loser-0.033
in-0.032
 am-0.029
 problem-0.028
 thing-0.028
Token door
Token ID3420
Feature activation+1.14e-01

Pos logit contributions
 loser+0.018
in+0.018
 am+0.016
 problem+0.016
 thing+0.015
Neg logit contributions
keeping-0.023
 magazines-0.023
 envelop-0.021
icol-0.021
 parcels-0.021
Token appeared
Token ID4120
Feature activation+1.21e-01

Pos logit contributions
 magazines+0.267
keeping+0.263
icol+0.243
nam+0.235
bons+0.235
Neg logit contributions
 thing-0.149
 am-0.148
 problem-0.147
in-0.146
 friend-0.139
Token!
Token ID0
Feature activation+1.84e-02

Pos logit contributions
 magazines+0.337
keeping+0.324
icol+0.310
 parcels+0.309
 computers+0.308
Neg logit contributions
in-0.114
 loser-0.109
 friend-0.105
 thing-0.103
 best-0.101
Token The
Token ID383
Feature activation+6.35e-02

Pos logit contributions
keeping+0.044
 magazines+0.041
icol+0.040
 envelop+0.038
bons+0.038
Neg logit contributions
 frog-0.022
'm-0.021
 person-0.021
 loser-0.021
 thing-0.021
Token rabbit
Token ID22746
Feature activation-8.04e-02

Pos logit contributions
 magazines+0.156
keeping+0.153
 envelop+0.140
icol+0.140
 parcels+0.140
Neg logit contributions
 loser-0.081
in-0.079
 am-0.070
 problem-0.069
 thing-0.069
Token hopped
Token ID45230
Feature activation+1.40e-02

Pos logit contributions
in+0.085
 loser+0.084
 problem+0.076
 thing+0.075
 friend+0.075
Neg logit contributions
 magazines-0.210
keeping-0.209
icol-0.193
 envelop-0.191
 computers-0.189
Token inside
Token ID2641
Feature activation+5.03e-02

Pos logit contributions
 magazines+0.036
keeping+0.035
 computers+0.033
icol+0.033
 envelop+0.032
Neg logit contributions
 thing-0.016
in-0.016
 loser-0.015
 frog-0.014
 friend-0.014
Token
Token ID198
Feature activation-3.71e-02

Pos logit contributions
 magazines+0.021
keeping+0.021
 envelop+0.019
 parcels+0.019
 computers+0.018
Neg logit contributions
 loser-0.016
 winner-0.014
 problem-0.014
in-0.014
 thing-0.014
Token
Token ID198
Feature activation-3.74e-02

Pos logit contributions
 loser+0.052
 problem+0.046
in+0.046
 thing+0.044
 am+0.043
Neg logit contributions
 magazines-0.086
keeping-0.085
 envelop-0.078
 parcels-0.078
 computers-0.077
TokenTom
Token ID13787
Feature activation+4.03e-02

Pos logit contributions
 loser+0.047
in+0.044
 problem+0.042
 am+0.041
 thing+0.040
Neg logit contributions
 magazines-0.091
keeping-0.090
 envelop-0.083
 parcels-0.083
icol-0.082
Tokenmy
Token ID1820
Feature activation-1.69e-02

Pos logit contributions
 magazines+0.081
keeping+0.079
icol+0.072
 envelop+0.072
 computers+0.071
Neg logit contributions
 loser-0.070
in-0.063
 thing-0.061
 problem-0.061
 am-0.060
Token loved
Token ID6151
Feature activation+1.20e-01

Pos logit contributions
 loser+0.030
in+0.026
 am+0.026
 problem+0.026
 thing+0.025
Neg logit contributions
 magazines-0.034
keeping-0.032
icol-0.030
 parcels-0.030
 envelop-0.030
Token playing
Token ID2712
Feature activation+6.36e-02

Pos logit contributions
keeping+0.267
 magazines+0.251
icol+0.249
 envelop+0.237
 parcels+0.236
Neg logit contributions
in-0.172
 thing-0.170
 friend-0.167
 problem-0.164
 frog-0.163
Token catch
Token ID4929
Feature activation-6.36e-02

Pos logit contributions
 magazines+0.123
keeping+0.122
icol+0.109
 envelop+0.109
 parcels+0.107
Neg logit contributions
 loser-0.112
in-0.110
 problem-0.103
 thing-0.102
 am-0.102
Token with
Token ID351
Feature activation-1.20e-02

Pos logit contributions
in+0.087
 loser+0.082
 thing+0.081
 frog+0.077
 friend+0.077
Neg logit contributions
keeping-0.149
 magazines-0.145
icol-0.139
 parcels-0.134
 envelop-0.133
Token Dad
Token ID17415
Feature activation+3.18e-02

Pos logit contributions
in+0.018
 loser+0.017
 thing+0.017
 friend+0.017
 frog+0.017
Neg logit contributions
keeping-0.027
 magazines-0.025
icol-0.025
 parcels-0.023
 envelop-0.023
Token and
Token ID290
Feature activation+3.39e-02

Pos logit contributions
keeping+0.083
 magazines+0.082
icol+0.078
 envelop+0.075
 parcels+0.075
Neg logit contributions
in-0.034
 loser-0.033
 thing-0.030
 friend-0.029
 problem-0.029
Token he
Token ID339
Feature activation-9.12e-02

Pos logit contributions
 magazines+0.082
keeping+0.081
icol+0.075
 envelop+0.074
bons+0.073
Neg logit contributions
 loser-0.042
in-0.041
 friend-0.040
 thing-0.039
 am-0.038
Token a
Token ID257
Feature activation+3.71e-02

Pos logit contributions
 magazines+0.092
keeping+0.090
icol+0.083
 envelop+0.083
 parcels+0.082
Neg logit contributions
 loser-0.058
in-0.054
 thing-0.050
 am-0.050
 problem-0.050
Token little
Token ID1310
Feature activation-4.23e-02

Pos logit contributions
 magazines+0.091
keeping+0.089
icol+0.083
 envelop+0.083
 parcels+0.082
Neg logit contributions
 loser-0.045
in-0.044
 am-0.039
 problem-0.038
 thing-0.038
Token bit
Token ID1643
Feature activation-8.59e-02

Pos logit contributions
 loser+0.036
in+0.035
 am+0.030
 problem+0.029
 thing+0.028
Neg logit contributions
 magazines-0.119
keeping-0.118
 envelop-0.109
icol-0.109
 parcels-0.108
Token into
Token ID656
Feature activation+3.46e-02

Pos logit contributions
 loser+0.124
in+0.113
 thing+0.108
 am+0.106
umber+0.104
Neg logit contributions
 magazines-0.176
keeping-0.172
 videos-0.161
 speakers-0.158
icol-0.156
Token their
Token ID511
Feature activation+6.54e-02

Pos logit contributions
 magazines+0.072
keeping+0.071
icol+0.067
 envelop+0.064
 parcels+0.064
Neg logit contributions
 loser-0.055
in-0.054
 thing-0.051
 problem-0.051
 am-0.050
Token cups
Token ID14180
Feature activation+1.28e-01

Pos logit contributions
keeping+0.132
 magazines+0.130
icol+0.119
 parcels+0.116
 envelop+0.115
Neg logit contributions
 loser-0.107
in-0.104
 problem-0.103
 thing-0.102
 friend-0.102
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
 magazines+0.313
keeping+0.313
 envelop+0.284
icol+0.283
 parcels+0.281
Neg logit contributions
 loser-0.157
in-0.151
 problem-0.138
 thing-0.137
 am-0.135
Token They
Token ID1119
Feature activation+6.42e-02

Pos logit contributions
 loser+0.003
 frog+0.003
in+0.003
 friend+0.002
 am+0.002
Neg logit contributions
keeping-0.004
 magazines-0.004
icol-0.004
 envelop-0.004
 computers-0.004
Token cl
Token ID537
Feature activation+2.46e-03

Pos logit contributions
keeping+0.150
 magazines+0.150
icol+0.136
 envelop+0.135
 parcels+0.134
Neg logit contributions
 loser-0.086
in-0.084
 problem-0.077
 am-0.077
 thing-0.077
Tokenink
Token ID676
Feature activation+8.08e-02

Pos logit contributions
 magazines+0.006
keeping+0.006
 computers+0.005
 envelop+0.005
icol+0.005
Neg logit contributions
 loser-0.004
in-0.004
 am-0.003
 friend-0.003
a-0.003
Tokened
Token ID276
Feature activation+1.12e-02

Pos logit contributions
 magazines+0.189
keeping+0.188
 envelop+0.172
 parcels+0.170
icol+0.169
Neg logit contributions
 loser-0.110
in-0.096
 am-0.095
 problem-0.095
 thing-0.091
Token tell
Token ID1560
Feature activation-1.00e-02

Pos logit contributions
keeping+0.092
 magazines+0.089
 videos+0.081
 envelop+0.081
 computers+0.080
Neg logit contributions
in-0.058
 loser-0.057
 problem-0.052
 thing-0.050
a-0.049
Token her
Token ID607
Feature activation+2.42e-02

Pos logit contributions
 loser+0.016
in+0.016
 thing+0.015
 problem+0.015
 friend+0.015
Neg logit contributions
keeping-0.021
 magazines-0.020
icol-0.019
 parcels-0.018
 envelop-0.018
Token mom
Token ID1995
Feature activation+2.50e-02

Pos logit contributions
keeping+0.063
 magazines+0.063
icol+0.059
 envelop+0.058
 parcels+0.058
Neg logit contributions
 loser-0.026
in-0.025
 problem-0.023
 friend-0.023
 thing-0.023
Tokenmy
Token ID1820
Feature activation+2.14e-03

Pos logit contributions
keeping+0.060
 magazines+0.060
icol+0.056
 envelop+0.054
 parcels+0.054
Neg logit contributions
 loser-0.032
in-0.032
 problem-0.029
 thing-0.029
 am-0.028
Token because
Token ID780
Feature activation-3.62e-02

Pos logit contributions
keeping+0.005
 magazines+0.005
icol+0.005
 envelop+0.005
 parcels+0.005
Neg logit contributions
in-0.003
 loser-0.002
 problem-0.002
 thing-0.002
 am-0.002
Token she
Token ID673
Feature activation-1.72e-03

Pos logit contributions
 thing+0.037
 problem+0.036
 best+0.036
 friend+0.035
in+0.035
Neg logit contributions
keeping-0.097
icol-0.094
 magazines-0.092
bons-0.087
 envelop-0.087
Token was
Token ID373
Feature activation+2.92e-02

Pos logit contributions
 loser+0.002
in+0.002
 problem+0.002
 thing+0.002
 winner+0.002
Neg logit contributions
 magazines-0.004
keeping-0.004
 computers-0.004
icol-0.004
 parcels-0.003
Token afraid
Token ID7787
Feature activation-6.58e-02

Pos logit contributions
keeping+0.069
 magazines+0.069
icol+0.065
 parcels+0.062
 envelop+0.062
Neg logit contributions
 loser-0.039
in-0.038
 problem-0.036
 friend-0.035
 best-0.035
Token she
Token ID673
Feature activation-2.43e-03

Pos logit contributions
 loser+0.074
in+0.073
 problem+0.068
 thing+0.068
 friend+0.065
Neg logit contributions
keeping-0.167
 magazines-0.165
icol-0.156
 envelop-0.153
 parcels-0.152
Token would
Token ID561
Feature activation+4.34e-02

Pos logit contributions
 loser+0.003
in+0.003
 problem+0.003
 thing+0.002
 am+0.002
Neg logit contributions
 magazines-0.006
keeping-0.006
icol-0.006
 envelop-0.006
 parcels-0.005
Token get
Token ID651
Feature activation-4.86e-02

Pos logit contributions
keeping+0.095
 magazines+0.090
 envelop+0.083
 computers+0.081
 parcels+0.081
Neg logit contributions
in-0.069
 loser-0.062
 problem-0.057
 thing-0.056
umber-0.054
Token a
Token ID257
Feature activation+3.11e-02

Pos logit contributions
in+0.009
 thing+0.009
 friend+0.009
 loser+0.009
 best+0.009
Neg logit contributions
keeping-0.014
 magazines-0.013
icol-0.013
 computers-0.012
 envelop-0.012
Token special
Token ID2041
Feature activation-2.47e-02

Pos logit contributions
 magazines+0.065
keeping+0.061
 envelop+0.059
icol+0.059
 parcels+0.058
Neg logit contributions
 loser-0.049
in-0.048
umber-0.041
 am-0.041
 thing-0.041
Token pink
Token ID11398
Feature activation-1.81e-03

Pos logit contributions
 loser+0.035
in+0.034
 problem+0.030
 am+0.029
a+0.029
Neg logit contributions
 magazines-0.057
keeping-0.055
icol-0.051
 envelop-0.051
 parcels-0.050
Token medicine
Token ID9007
Feature activation-2.10e-02

Pos logit contributions
 loser+0.003
in+0.003
 thing+0.002
 problem+0.002
 am+0.002
Neg logit contributions
keeping-0.004
 magazines-0.004
icol-0.004
 envelop-0.004
 parcels-0.003
Token that
Token ID326
Feature activation-8.40e-02

Pos logit contributions
 loser+0.032
in+0.031
 problem+0.028
 thing+0.028
 am+0.028
Neg logit contributions
 magazines-0.046
keeping-0.045
 envelop-0.041
icol-0.041
 parcels-0.040
Token will
Token ID481
Feature activation-4.15e-02

Pos logit contributions
 loser+0.119
in+0.111
 problem+0.097
 friend+0.094
 winner+0.092
Neg logit contributions
 magazines-0.198
keeping-0.189
 computers-0.176
 parcels-0.176
 videos-0.175
Token make
Token ID787
Feature activation-6.62e-02

Pos logit contributions
in+0.056
 loser+0.055
 thing+0.048
 problem+0.047
a+0.046
Neg logit contributions
 magazines-0.096
keeping-0.096
 envelop-0.088
 parcels-0.087
bons-0.085
Token you
Token ID345
Feature activation-5.74e-03

Pos logit contributions
 loser+0.099
in+0.099
 problem+0.092
 thing+0.092
 am+0.090
Neg logit contributions
keeping-0.143
 magazines-0.142
icol-0.131
 envelop-0.127
 parcels-0.127
Token feel
Token ID1254
Feature activation+1.75e-02

Pos logit contributions
 loser+0.006
in+0.006
 am+0.005
 problem+0.005
 thing+0.005
Neg logit contributions
keeping-0.015
 magazines-0.015
icol-0.014
 envelop-0.014
 computers-0.014
Token better
Token ID1365
Feature activation-6.08e-02

Pos logit contributions
keeping+0.049
 magazines+0.049
icol+0.046
 computers+0.045
 envelop+0.045
Neg logit contributions
 loser-0.015
in-0.015
 problem-0.013
 thing-0.013
 am-0.013
Token right
Token ID826
Feature activation-1.41e-01

Pos logit contributions
 loser+0.105
in+0.102
 am+0.095
 problem+0.095
 thing+0.094
Neg logit contributions
keeping-0.119
 magazines-0.118
 envelop-0.103
 parcels-0.102
 computers-0.101
Token.
Token ID13
Feature activation-4.36e-04

Pos logit contributions
in+0.067
 loser+0.065
 thing+0.060
 am+0.059
 problem+0.059
Neg logit contributions
 magazines-0.174
keeping-0.173
icol-0.167
 envelop-0.160
 parcels-0.160
Token<|endoftext|>
Token ID50256
Feature activation+1.20e-01

Pos logit contributions
 loser+0.000
in+0.000
 friend+0.000
 am+0.000
'm+0.000
Neg logit contributions
keeping-0.001
 magazines-0.001
 envelop-0.001
icol-0.001
 parcels-0.001
TokenOnce
Token ID7454
Feature activation-8.50e-02

Pos logit contributions
 magazines+0.319
keeping+0.316
icol+0.294
 envelop+0.292
 parcels+0.291
Neg logit contributions
 loser-0.124
in-0.122
 am-0.109
 thing-0.107
 problem-0.105
Token upon
Token ID2402
Feature activation+9.36e-02

Pos logit contributions
 loser+0.163
 winner+0.146
ching+0.138
 thing+0.137
 am+0.136
Neg logit contributions
 magazines-0.146
keeping-0.130
nam-0.126
 envelop-0.125
 brand-0.123
Token a
Token ID257
Feature activation+3.71e-02

Pos logit contributions
keeping+0.228
icol+0.223
 magazines+0.222
 parcels+0.215
 envelop+0.210
Neg logit contributions
in-0.120
 loser-0.119
 friend-0.115
 thing-0.113
 am-0.112
Token time
Token ID640
Feature activation-8.49e-02

Pos logit contributions
 magazines+0.084
icol+0.078
 envelop+0.075
 apr+0.074
 computers+0.074
Neg logit contributions
 loser-0.048
in-0.044
 thing-0.042
 am-0.042
 winner-0.040
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
 loser+0.130
in+0.122
 winner+0.114
 thing+0.112
ching+0.112
Neg logit contributions
 magazines-0.164
keeping-0.155
icol-0.149
 Emmy-0.145
nam-0.144
Token there
Token ID612
Feature activation-3.39e-02

Pos logit contributions
 loser+0.019
 problem+0.017
 am+0.017
 dare+0.017
 winner+0.017
Neg logit contributions
 magazines-0.020
nam-0.019
keeping-0.019
 envelop-0.018
 computers-0.017
Token was
Token ID373
Feature activation+1.80e-02

Pos logit contributions
 loser+0.060
 winner+0.054
 problem+0.052
in+0.051
 leader+0.047
Neg logit contributions
keeping-0.062
 magazines-0.061
 beds-0.055
 computers-0.054
 envelop-0.053
Token a
Token ID257
Feature activation+3.68e-02

Pos logit contributions
keeping+0.046
 parcels+0.043
 magazines+0.042
icol+0.042
 computers+0.040
Neg logit contributions
 best-0.023
in-0.022
 loser-0.021
 friend-0.021
a-0.021
Token little
Token ID1310
Feature activation-4.62e-02

Pos logit contributions
 magazines+0.090
icol+0.081
 envelop+0.081
keeping+0.081
 videos+0.080
Neg logit contributions
 loser-0.043
in-0.039
 am-0.039
 problem-0.037
ching-0.036
Token back
Token ID736
Feature activation-7.04e-02

Pos logit contributions
 loser+0.083
in+0.081
 problem+0.074
 thing+0.073
 am+0.073
Neg logit contributions
 magazines-0.145
keeping-0.144
icol-0.131
 envelop-0.131
 parcels-0.130
Token to
Token ID284
Feature activation+2.92e-02

Pos logit contributions
in+0.087
 loser+0.085
 friend+0.079
 thing+0.078
 am+0.078
Neg logit contributions
 magazines-0.173
keeping-0.169
icol-0.158
 envelop-0.157
 parcels-0.155
Token his
Token ID465
Feature activation-5.76e-02

Pos logit contributions
 magazines+0.065
keeping+0.064
icol+0.059
 parcels+0.057
 envelop+0.057
Neg logit contributions
 loser-0.042
in-0.042
 friend-0.040
 thing-0.040
 problem-0.040
Token house
Token ID2156
Feature activation+7.86e-02

Pos logit contributions
 loser+0.081
in+0.076
 am+0.071
a+0.069
ching+0.069
Neg logit contributions
 magazines-0.133
keeping-0.132
icol-0.120
 envelop-0.120
 parcels-0.119
Token quickly
Token ID2952
Feature activation+4.20e-02

Pos logit contributions
 magazines+0.210
keeping+0.204
icol+0.201
 parcels+0.194
 envelop+0.192
Neg logit contributions
 loser-0.075
 friend-0.074
in-0.074
 thing-0.069
 best-0.066
Token and
Token ID290
Feature activation+3.70e-02

Pos logit contributions
 magazines+0.108
keeping+0.106
icol+0.099
 envelop+0.098
 parcels+0.098
Neg logit contributions
 loser-0.046
in-0.044
 problem-0.042
 thing-0.041
 am-0.041
Token share
Token ID2648
Feature activation-3.32e-02

Pos logit contributions
 magazines+0.093
keeping+0.091
icol+0.084
 envelop+0.083
 computers+0.083
Neg logit contributions
 loser-0.042
in-0.041
 friend-0.040
 am-0.040
 thing-0.040
Token the
Token ID262
Feature activation+2.39e-03

Pos logit contributions
in+0.047
 thing+0.045
 loser+0.044
 friend+0.043
 frog+0.042
Neg logit contributions
keeping-0.076
 magazines-0.074
icol-0.072
 computers-0.067
 parcels-0.067
Token fish
Token ID5916
Feature activation-1.26e-01

Pos logit contributions
 magazines+0.005
keeping+0.005
 envelop+0.004
 parcels+0.004
icol+0.004
Neg logit contributions
 loser-0.004
in-0.004
 problem-0.003
 am-0.003
a-0.003
Token with
Token ID351
Feature activation-1.14e-02

Pos logit contributions
 loser+0.194
in+0.179
 problem+0.178
 am+0.171
 thing+0.165
Neg logit contributions
keeping-0.278
 magazines-0.274
 envelop-0.244
 parcels-0.241
 videos-0.238
Token his
Token ID465
Feature activation-5.74e-02

Pos logit contributions
 loser+0.018
in+0.018
 thing+0.016
 problem+0.016
 friend+0.016
Neg logit contributions
 magazines-0.024
keeping-0.024
icol-0.022
 envelop-0.022
 parcels-0.022
Token too
Token ID1165
Feature activation-1.30e-01

Pos logit contributions
in+0.036
 loser+0.035
 am+0.032
 thing+0.031
 problem+0.031
Neg logit contributions
 magazines-0.066
keeping-0.064
 computers-0.059
 envelop-0.059
 videos-0.059
Token quiet
Token ID5897
Feature activation-2.83e-03

Pos logit contributions
in+0.189
 loser+0.188
 am+0.186
umber+0.179
 winner+0.175
Neg logit contributions
 magazines-0.254
keeping-0.242
 computers-0.239
 desks-0.233
 videos-0.230
Token!
Token ID0
Feature activation+2.39e-02

Pos logit contributions
 loser+0.003
in+0.003
 problem+0.003
 thing+0.003
 am+0.003
Neg logit contributions
 magazines-0.007
keeping-0.007
icol-0.006
 parcels-0.006
 envelop-0.006
Token Dr
Token ID1583
Feature activation+1.34e-01

Pos logit contributions
 magazines+0.045
keeping+0.045
 envelop+0.040
icol+0.040
 parcels+0.039
Neg logit contributions
 loser-0.044
in-0.042
 thing-0.040
 problem-0.039
 am-0.039
Tokenums
Token ID5700
Feature activation-4.23e-02

Pos logit contributions
keeping+0.313
 magazines+0.303
 envelop+0.274
icol+0.271
 computers+0.270
Neg logit contributions
in-0.182
 loser-0.179
 problem-0.166
ching-0.159
 thing-0.157
Token are
Token ID389
Feature activation-2.79e-02

Pos logit contributions
 loser+0.065
in+0.062
Only+0.061
 winner+0.058
 problem+0.058
Neg logit contributions
 magazines-0.082
keeping-0.080
 envelop-0.072
bons-0.070
 videos-0.069
Token fun
Token ID1257
Feature activation+2.04e-02

Pos logit contributions
 loser+0.034
in+0.033
 problem+0.030
 thing+0.030
 am+0.030
Neg logit contributions
 magazines-0.069
keeping-0.069
icol-0.063
 envelop-0.063
 parcels-0.062
Token and
Token ID290
Feature activation+3.75e-02

Pos logit contributions
keeping+0.046
 magazines+0.046
 envelop+0.041
icol+0.041
 computers+0.041
Neg logit contributions
 loser-0.029
in-0.029
 am-0.026
 problem-0.026
 thing-0.026
Token loud
Token ID7812
Feature activation+4.87e-02

Pos logit contributions
 magazines+0.079
keeping+0.078
icol+0.071
 envelop+0.070
 computers+0.070
Neg logit contributions
 loser-0.059
in-0.058
 problem-0.053
 thing-0.053
 am-0.053
Token!"
Token ID2474
Feature activation+8.77e-02

Pos logit contributions
 magazines+0.117
keeping+0.117
 parcels+0.108
icol+0.108
 envelop+0.107
Neg logit contributions
 loser-0.062
in-0.060
 problem-0.057
 thing-0.057
 am-0.055
Token Tom
Token ID4186
Feature activation+2.02e-02

Pos logit contributions
 magazines+0.159
keeping+0.159
icol+0.140
 envelop+0.139
 parcels+0.138
Neg logit contributions
 loser-0.165
in-0.162
 problem-0.152
 thing-0.152
 am-0.151
Token every
Token ID790
Feature activation-1.01e-01

Pos logit contributions
 loser+0.014
in+0.014
 problem+0.013
 am+0.012
 thing+0.012
Neg logit contributions
 magazines-0.025
keeping-0.025
 envelop-0.023
icol-0.023
 computers-0.023
Token day
Token ID1110
Feature activation-7.54e-02

Pos logit contributions
 loser+0.118
in+0.108
 am+0.098
umber+0.097
a+0.093
Neg logit contributions
 magazines-0.253
keeping-0.246
 videos-0.230
 envelop-0.223
icol-0.222
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
 loser+0.067
in+0.067
 thing+0.060
 best+0.059
 friend+0.059
Neg logit contributions
 magazines-0.206
keeping-0.205
icol-0.194
 parcels-0.192
 envelop-0.189
Token Teddy
Token ID29345
Feature activation+8.32e-02

Pos logit contributions
 friend+0.016
 best+0.016
 am+0.016
in+0.016
 problem+0.015
Neg logit contributions
keeping-0.036
 magazines-0.031
 parcels-0.030
 desks-0.030
nam-0.030
Token was
Token ID373
Feature activation+1.20e-02

Pos logit contributions
 magazines+0.188
keeping+0.186
icol+0.170
 envelop+0.168
 computers+0.167
Neg logit contributions
 loser-0.120
in-0.114
 problem-0.106
 thing-0.105
 am-0.104
Token no
Token ID645
Feature activation-1.10e-01

Pos logit contributions
 magazines+0.027
keeping+0.027
icol+0.024
 envelop+0.024
 parcels+0.024
Neg logit contributions
 loser-0.017
in-0.017
 problem-0.016
 thing-0.016
 am-0.016
Token longer
Token ID2392
Feature activation-2.21e-02

Pos logit contributions
 am+0.191
in+0.185
 loser+0.181
'm+0.180
umber+0.178
Neg logit contributions
 magazines-0.180
 videos-0.168
 cartoons-0.167
 presents-0.155
 brand-0.154
Token lonely
Token ID21757
Feature activation+2.23e-02

Pos logit contributions
 loser+0.026
in+0.026
 problem+0.023
 thing+0.023
 am+0.023
Neg logit contributions
 magazines-0.055
keeping-0.055
icol-0.050
 envelop-0.050
 parcels-0.050
Token because
Token ID780
Feature activation-4.44e-02

Pos logit contributions
keeping+0.051
 magazines+0.051
 computers+0.044
icol+0.044
 envelop+0.044
Neg logit contributions
 loser-0.031
in-0.030
 am-0.028
 problem-0.028
a-0.027
Token he
Token ID339
Feature activation-9.01e-02

Pos logit contributions
 loser+0.053
in+0.051
 problem+0.047
 am+0.046
 thing+0.046
Neg logit contributions
 magazines-0.111
keeping-0.110
 envelop-0.100
icol-0.100
 computers-0.099
Token had
Token ID550
Feature activation+4.88e-02

Pos logit contributions
 loser+0.154
 problem+0.137
in+0.134
 thing+0.133
 winner+0.126
Neg logit contributions
 magazines-0.175
keeping-0.174
 computers-0.156
icol-0.154
 videos-0.153
Token a
Token ID257
Feature activation+3.15e-02

Pos logit contributions
in+0.009
 friend+0.009
 best+0.009
 thing+0.009
 loser+0.009
Neg logit contributions
keeping-0.015
icol-0.015
 magazines-0.015
 envelop-0.014
 desks-0.014
Token new
Token ID649
Feature activation+3.06e-02

Pos logit contributions
 magazines+0.063
 brand+0.062
icol+0.060
 envelop+0.058
 presents+0.058
Neg logit contributions
 loser-0.046
in-0.046
umber-0.043
 am-0.041
 neither-0.040
Token dinosaur
Token ID28257
Feature activation-5.47e-02

Pos logit contributions
 magazines+0.080
keeping+0.077
icol+0.074
 envelop+0.073
 computers+0.071
Neg logit contributions
in-0.032
 loser-0.031
umber-0.027
ching-0.026
 am-0.025
Token."
Token ID526
Feature activation+4.03e-02

Pos logit contributions
 loser+0.085
in+0.084
 problem+0.079
 am+0.078
 thing+0.076
Neg logit contributions
 magazines-0.114
keeping-0.114
 envelop-0.101
 parcels-0.099
 computers-0.099
Token<|endoftext|>
Token ID50256
Feature activation+1.17e-01

Pos logit contributions
 magazines+0.099
keeping+0.095
 parcels+0.091
 computers+0.090
 envelop+0.090
Neg logit contributions
'm-0.048
 friend-0.048
in-0.047
 am-0.047
 frog-0.045
TokenOnce
Token ID7454
Feature activation-9.01e-02

Pos logit contributions
 magazines+0.307
keeping+0.305
icol+0.282
 envelop+0.280
 parcels+0.279
Neg logit contributions
 loser-0.125
in-0.122
 am-0.108
 thing-0.108
 problem-0.108
Token upon
Token ID2402
Feature activation+8.85e-02

Pos logit contributions
 loser+0.179
 winner+0.160
ching+0.150
umber+0.149
 thing+0.148
Neg logit contributions
 magazines-0.152
 beds-0.130
 envelop-0.129
keeping-0.129
 brand-0.128
Token a
Token ID257
Feature activation+2.78e-02

Pos logit contributions
keeping+0.216
icol+0.208
 magazines+0.206
 parcels+0.200
 envelop+0.195
Neg logit contributions
in-0.117
 am-0.114
 friend-0.112
 loser-0.111
 thing-0.111
Token time
Token ID640
Feature activation-9.13e-02

Pos logit contributions
 magazines+0.066
icol+0.062
 envelop+0.059
keeping+0.059
 computers+0.058
Neg logit contributions
 loser-0.035
in-0.032
 thing-0.029
 winner-0.029
 am-0.028
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 loser+0.140
in+0.130
 winner+0.122
umber+0.119
ching+0.118
Neg logit contributions
 magazines-0.183
keeping-0.170
icol-0.164
 beds-0.159
 computers-0.159
Token there
Token ID612
Feature activation-4.24e-02

Pos logit contributions
 loser+0.031
umber+0.028
 winner+0.028
in+0.028
 problem+0.028
Neg logit contributions
 magazines-0.032
nam-0.029
keeping-0.028
 envelop-0.027
 videos-0.027
Token
Token ID198
Feature activation-3.50e-02

Pos logit contributions
 magazines+0.086
keeping+0.085
icol+0.077
 envelop+0.077
 parcels+0.077
Neg logit contributions
 loser-0.049
in-0.048
 problem-0.043
 thing-0.043
 am-0.042
Token
Token ID198
Feature activation-3.52e-02

Pos logit contributions
 loser+0.051
in+0.046
 thing+0.045
 problem+0.045
 winner+0.044
Neg logit contributions
 magazines-0.080
keeping-0.078
 envelop-0.073
 parcels-0.071
 computers-0.070
TokenHar
Token ID13587
Feature activation+1.05e-01

Pos logit contributions
 loser+0.074
 frog+0.070
in+0.068
 thing+0.068
 problem+0.067
Neg logit contributions
 magazines-0.060
 envelop-0.052
keeping-0.052
 parcels-0.051
 computers-0.049
Tokenley
Token ID1636
Feature activation+2.48e-03

Pos logit contributions
keeping+0.208
 magazines+0.207
 envelop+0.184
icol+0.184
 parcels+0.182
Neg logit contributions
 loser-0.181
in-0.176
 problem-0.166
 thing-0.165
 am-0.164
Token didn
Token ID1422
Feature activation+5.59e-02

Pos logit contributions
 magazines+0.006
keeping+0.006
icol+0.006
 envelop+0.006
 parcels+0.006
Neg logit contributions
 loser-0.003
in-0.003
 problem-0.003
 thing-0.003
 am-0.003
Token't
Token ID470
Feature activation-1.30e-01

Pos logit contributions
 magazines+0.129
keeping+0.126
 videos+0.116
 computers+0.116
icol+0.113
Neg logit contributions
 loser-0.075
in-0.072
 thing-0.067
 problem-0.066
 friend-0.066
Token like
Token ID588
Feature activation-2.65e-02

Pos logit contributions
in+0.226
 problem+0.199
 frog+0.193
ig+0.192
 loser+0.191
Neg logit contributions
keeping-0.245
 magazines-0.226
 videos-0.214
 envelop-0.202
 parcels-0.194
Token this
Token ID428
Feature activation-4.13e-02

Pos logit contributions
 loser+0.046
in+0.045
 problem+0.042
 thing+0.042
 am+0.042
Neg logit contributions
 magazines-0.052
keeping-0.052
icol-0.046
 envelop-0.046
 parcels-0.045
Token answer
Token ID3280
Feature activation-1.42e-01

Pos logit contributions
 loser+0.072
in+0.066
a+0.060
 winner+0.060
umber+0.059
Neg logit contributions
 magazines-0.087
 envelop-0.074
keeping-0.074
 videos-0.073
 computers-0.072
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
 loser+0.155
in+0.150
 problem+0.133
 am+0.131
 thing+0.130
Neg logit contributions
 magazines-0.369
keeping-0.365
 envelop-0.335
icol-0.333
 computers-0.333
Token He
Token ID679
Feature activation-2.48e-02

Pos logit contributions
 loser+0.031
in+0.030
 problem+0.029
 thing+0.028
 am+0.028
Neg logit contributions
 magazines-0.033
keeping-0.033
icol-0.029
 envelop-0.029
 parcels-0.029
Token frame
Token ID5739
Feature activation+9.50e-02

Pos logit contributions
 magazines+0.004
keeping+0.004
 envelop+0.004
icol+0.004
 parcels+0.004
Neg logit contributions
 loser-0.003
in-0.003
 problem-0.003
 thing-0.003
 am-0.003
Token,
Token ID11
Feature activation-2.07e-02

Pos logit contributions
 magazines+0.211
keeping+0.207
 envelop+0.192
 computers+0.188
bons+0.185
Neg logit contributions
 loser-0.139
in-0.131
 am-0.116
a-0.114
 problem-0.113
Token the
Token ID262
Feature activation+2.34e-03

Pos logit contributions
 loser+0.031
in+0.030
 problem+0.028
 am+0.027
 thing+0.027
Neg logit contributions
 magazines-0.046
keeping-0.045
 envelop-0.041
icol-0.041
 parcels-0.040
Token screw
Token ID9580
Feature activation-5.95e-02

Pos logit contributions
keeping+0.005
 magazines+0.005
icol+0.005
 parcels+0.005
 envelop+0.004
Neg logit contributions
 loser-0.004
in-0.003
 problem-0.003
 thing-0.003
 friend-0.003
Token wouldn
Token ID3636
Feature activation+6.24e-03

Pos logit contributions
 loser+0.087
in+0.084
 problem+0.076
 am+0.076
 thing+0.074
Neg logit contributions
keeping-0.129
 magazines-0.129
 envelop-0.117
 computers-0.116
 parcels-0.114
Token't
Token ID470
Feature activation-1.31e-01

Pos logit contributions
keeping+0.013
 magazines+0.013
 videos+0.012
icol+0.012
 computers+0.012
Neg logit contributions
 loser-0.009
in-0.009
 problem-0.008
 thing-0.008
 friend-0.008
Token go
Token ID467
Feature activation-6.53e-02

Pos logit contributions
in+0.191
 loser+0.188
a+0.166
 thing+0.159
umber+0.158
Neg logit contributions
 magazines-0.276
keeping-0.270
 envelop-0.255
 computers-0.251
 videos-0.250
Token in
Token ID287
Feature activation-1.52e-02

Pos logit contributions
 loser+0.076
in+0.074
 problem+0.068
 thing+0.067
 am+0.065
Neg logit contributions
 magazines-0.165
keeping-0.164
icol-0.151
 envelop-0.150
 parcels-0.150
Token the
Token ID262
Feature activation+1.64e-03

Pos logit contributions
 loser+0.021
in+0.020
 am+0.018
 problem+0.018
a+0.018
Neg logit contributions
keeping-0.035
 magazines-0.035
 envelop-0.032
 computers-0.031
icol-0.031
Token wall
Token ID3355
Feature activation+6.47e-02

Pos logit contributions
keeping+0.004
 magazines+0.004
icol+0.003
 parcels+0.003
 computers+0.003
Neg logit contributions
 loser-0.002
 problem-0.002
in-0.002
 thing-0.002
 best-0.002
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
keeping+0.150
 magazines+0.150
 envelop+0.136
 computers+0.135
icol+0.134
Neg logit contributions
 loser-0.086
in-0.082
 am-0.075
 problem-0.075
 thing-0.074
Token named
Token ID3706
Feature activation+1.65e-01

Pos logit contributions
 loser+0.140
in+0.137
 thing+0.131
 friend+0.130
 problem+0.126
Neg logit contributions
keeping-0.247
 magazines-0.238
icol-0.223
 parcels-0.222
 envelop-0.218
Token Stretch
Token ID44489
Feature activation+7.42e-03

Pos logit contributions
keeping+0.355
 magazines+0.329
 parcels+0.298
 counters+0.293
 desks+0.290
Neg logit contributions
 loser-0.269
 friend-0.264
a-0.261
 thing-0.259
in-0.254
Tokeny
Token ID88
Feature activation-1.19e-01

Pos logit contributions
 magazines+0.017
keeping+0.017
 envelop+0.016
 computers+0.015
 parcels+0.015
Neg logit contributions
 loser-0.010
 problem-0.009
in-0.009
 winner-0.009
 am-0.008
Token.
Token ID13
Feature activation-9.67e-03

Pos logit contributions
in+0.128
 loser+0.126
 thing+0.116
 friend+0.116
 problem+0.109
Neg logit contributions
keeping-0.316
 magazines-0.311
icol-0.288
 parcels-0.287
 envelop-0.286
Token Stretch
Token ID44489
Feature activation+7.36e-03

Pos logit contributions
 loser+0.017
 thing+0.016
in+0.016
 am+0.016
 problem+0.016
Neg logit contributions
keeping-0.019
 magazines-0.018
icol-0.016
 envelop-0.016
 computers-0.016
Tokeny
Token ID88
Feature activation-1.20e-01

Pos logit contributions
 magazines+0.017
keeping+0.017
 envelop+0.015
icol+0.015
 parcels+0.015
Neg logit contributions
 loser-0.010
in-0.009
 problem-0.009
 thing-0.009
 am-0.009
Token was
Token ID373
Feature activation+1.77e-02

Pos logit contributions
 loser+0.155
in+0.155
 am+0.145
 thing+0.144
 friend+0.143
Neg logit contributions
keeping-0.285
 magazines-0.284
icol-0.262
 envelop-0.260
 parcels-0.258
Token made
Token ID925
Feature activation-6.22e-02

Pos logit contributions
keeping+0.048
 magazines+0.046
icol+0.044
 parcels+0.042
 envelop+0.042
Neg logit contributions
 best-0.021
 loser-0.020
 friend-0.019
 thing-0.018
in-0.018
Token of
Token ID286
Feature activation+4.05e-03

Pos logit contributions
 thing+0.066
 friend+0.066
in+0.064
 loser+0.060
 best+0.060
Neg logit contributions
keeping-0.170
 magazines-0.164
icol-0.163
nam-0.150
 envelop-0.149
Token glue
Token ID22749
Feature activation-2.25e-04

Pos logit contributions
keeping+0.009
 magazines+0.008
icol+0.008
 parcels+0.007
 computers+0.007
Neg logit contributions
 loser-0.006
 thing-0.006
 problem-0.006
 friend-0.006
 best-0.006
Token and
Token ID290
Feature activation+4.26e-02

Pos logit contributions
 loser+0.000
in+0.000
 am+0.000
 problem+0.000
 thing+0.000
Neg logit contributions
 magazines-0.001
keeping-0.001
 envelop-0.000
 computers-0.000
 parcels-0.000