TOP ACTIVATIONS
MAX = 15.392

from the tank . Hel io was thankful and thanked T
. The st eg osaurus came out of the
D ott ie . D ott ie wanted to visit a
! The chip m unk was excited and
in a river . The hipp o was very happy and
". A less io bl ushed and said ,
." The prince went to the battlefield with
named Ab bie . Ab bie was only three years old
white hipp o . The hipp o was so cute and
named Pe anut . Pe anut lived in an airport .
as Hel io . Hel io explained that their oxygen was
. The hipp o found some vegetables and
was so attractive . The hipp o was excited to meet
you ." Mar iah was excited . What could
One day Aless io was walking down the street
he wanted . <|endoftext|> Ak io was a 3 year old
named Ab bie . Ab bie liked counting things . One
. The chip m unk told the boy
big hipp o . The hipp o lived near a river
The end . <|endoftext|> D ennis was feeling very lonely .

MIN ACTIVATIONS
MIN = -10.540

and prepare , you would fail the test . Next time
beauty around her without always needing to touch it . <|endoftext|>
helpless when it came to wanting toys . So , she
, but his mustache kept getting in the way and preventing
the mud or you 'll ruin your dress ."
you have instead of always wanting more . The end .
she dared to try to reach it . She tried and
that it would have to delay . Billy was
to help the people by stopping them from destroying the stadium
sugar if she helped him reach the highest branch of the
need a little help to reach tall things . <|endoftext|> Once
you have instead of always wanting more . From that day
message and hopes it will reach someone special out in the
was so happy to have figured out this mighty trick and
, not from having and keeping . Why don 't you
milk ? I think you deserve a treat ."
The fish said they would complete the mission of helping Fred
. She promised to never delay going to school again and
to you instead of always wanting something else .
� We are going to reach the beach and have so

INTERVAL 8.909 - 15.392
CONTAINS 3.838%

my !" Mom my smiled back . "
. It is dangerous ." Sue says . But
folded his hands together . Mama closed her eyes and said
asers to sell . The man felt more and more guilty
tough and scary . The dogs got very scared and stopped

INTERVAL 2.426 - 8.909
CONTAINS 19.632%

!" Lily said . " The t eddy bear is nice
It was a mushroom ! It was small ,
they saw some big red trucks coming . They had s
that day on , the bright king became even more famous
and waved at John as he left . John

INTERVAL -4.057 - 2.426
CONTAINS 62.556%

. It was too lovely not to pick up !"
the soap , he gave him a big hug . From
to have had such an adventure . <|endoftext|> Once there was
and came to Lily 's room . She saw the broken
. <|endoftext|> Once upon a time , there was a nice

INTERVAL -10.540 - -4.057
CONTAINS 13.974%

be generous too . She takes the book off her image
She asked him why he wanted to send the tuna away
friends who liked to play near the sea . They saw
learned that it 's not about how tall you are ,
young girl left the mine with the special thing . They
Token from
Token ID422
Feature activation-4.35e+00

Pos logit contributions
oise+0.077
anda+0.075
mun+0.073
ny+0.071
ll+0.070
Neg logit contributions
 its-0.070
 yours-0.069
 our-0.068
 such-0.065
 your-0.064
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
 yours+0.045
 our+0.041
 ourselves+0.038
 its+0.038
 yourselves+0.038
Neg logit contributions
oise-0.109
my-0.106
ny-0.104
mo-0.101
y-0.101
Token tank
Token ID6873
Feature activation+4.29e+00

Pos logit contributions
oise+0.026
ny+0.025
my+0.024
's+0.024
anda+0.024
Neg logit contributions
 its-0.024
 theirs-0.024
 yours-0.023
 itself-0.022
 our-0.022
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
oise+0.090
anda+0.088
mun+0.086
ny+0.082
ony+0.081
Neg logit contributions
 its-0.075
 yours-0.072
 our-0.071
 such-0.069
 itself-0.069
Token Hel
Token ID5053
Feature activation+4.82e+00

Pos logit contributions
 yours+0.003
 its+0.003
 our+0.003
 such+0.003
 theirs+0.003
Neg logit contributions
oise-0.003
anda-0.003
mun-0.003
ny-0.003
my-0.003
Tokenio
Token ID952
Feature activation+1.54e+01

Pos logit contributions
oise+0.066
mun+0.066
anda+0.060
ll+0.059
my+0.059
Neg logit contributions
 yours-0.117
 its-0.116
 our-0.113
 itself-0.112
 theirs-0.111
Token was
Token ID373
Feature activation-1.34e+00

Pos logit contributions
oise+0.247
mun+0.247
 salon+0.214
pit+0.209
anda+0.205
Neg logit contributions
 its-0.345
 itself-0.319
 our-0.311
 an-0.294
 such-0.293
Token thankful
Token ID25535
Feature activation-3.29e+00

Pos logit contributions
 yours+0.025
 our+0.024
 its+0.023
 electricity+0.023
 theirs+0.023
Neg logit contributions
oise-0.025
anda-0.025
a-0.024
my-0.024
mun-0.023
Token and
Token ID290
Feature activation-5.65e-01

Pos logit contributions
 yours+0.057
 its+0.054
 our+0.054
 theirs+0.052
 yourselves+0.052
Neg logit contributions
anda-0.065
oise-0.065
ll-0.064
my-0.063
ny-0.062
Token thanked
Token ID26280
Feature activation-4.60e+00

Pos logit contributions
 yours+0.013
 its+0.013
 our+0.012
 such+0.012
 theirs+0.012
Neg logit contributions
oise-0.009
anda-0.008
mun-0.008
ny-0.008
ll-0.008
Token T
Token ID309
Feature activation+6.45e+00

Pos logit contributions
 yours+0.067
 our+0.061
 its+0.058
 yourselves+0.058
 theirs+0.057
Neg logit contributions
oise-0.103
my-0.102
anda-0.100
mun-0.100
ony-0.098
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
oise+0.018
anda+0.018
mun+0.017
ny+0.017
ony+0.017
Neg logit contributions
 its-0.015
 yours-0.015
 our-0.015
 such-0.014
 your-0.014
Token
Token ID198
Feature activation-5.08e-01

Pos logit contributions
 yours+0.016
 electricity+0.015
 our+0.015
 such+0.015
 theirs+0.015
Neg logit contributions
ny-0.009
's-0.009
anda-0.009
my-0.009
a-0.009
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 its+0.009
 yours+0.009
 our+0.009
 such+0.008
 your+0.008
Neg logit contributions
oise-0.010
anda-0.010
mun-0.010
ny-0.010
ll-0.010
TokenThe
Token ID464
Feature activation+4.12e+00

Pos logit contributions
 its+0.002
 our+0.002
 yours+0.002
 your+0.002
 itself+0.002
Neg logit contributions
oise-0.002
anda-0.002
mun-0.002
ony-0.002
ny-0.002
Token st
Token ID336
Feature activation+7.07e+00

Pos logit contributions
oise+0.083
anda+0.079
mun+0.077
ll+0.075
ony+0.075
Neg logit contributions
 yours-0.075
 its-0.075
 our-0.075
 your-0.071
 such-0.071
Tokeneg
Token ID1533
Feature activation+1.54e+01

Pos logit contributions
oise+0.099
mun+0.094
anda+0.090
ll+0.085
ny+0.084
Neg logit contributions
 its-0.170
 yours-0.168
 our-0.166
 itself-0.162
 such-0.160
Tokenosaurus
Token ID47650
Feature activation+1.19e+01

Pos logit contributions
mun+0.291
oise+0.286
ited+0.279
sie+0.266
bi+0.263
Neg logit contributions
 its-0.236
 our-0.236
 such-0.231
 your-0.228
 my-0.228
Token came
Token ID1625
Feature activation-1.92e+00

Pos logit contributions
oise+0.214
mun+0.185
anda+0.176
ony+0.169
mo+0.168
Neg logit contributions
 its-0.261
 our-0.244
 their-0.229
 itself-0.229
 your-0.229
Token out
Token ID503
Feature activation-1.98e+00

Pos logit contributions
 its+0.031
 our+0.029
 yours+0.028
 itself+0.027
 such+0.027
Neg logit contributions
oise-0.044
anda-0.043
mun-0.043
ony-0.041
ll-0.040
Token of
Token ID286
Feature activation-5.82e+00

Pos logit contributions
 yours+0.048
 our+0.044
 its+0.044
 theirs+0.043
 yourselves+0.043
Neg logit contributions
oise-0.028
y-0.027
anda-0.027
ny-0.026
my-0.026
Token the
Token ID262
Feature activation+2.40e+00

Pos logit contributions
 yours+0.076
 ourselves+0.069
 disl+0.068
 yourselves+0.063
 awesome+0.061
Neg logit contributions
my-0.134
oise-0.125
y-0.125
ny-0.122
 aisle-0.119
Token D
Token ID360
Feature activation+8.02e+00

Pos logit contributions
 yourselves+0.025
 its+0.025
 yours+0.025
 itself+0.025
 electricity+0.024
Neg logit contributions
ny-0.029
my-0.029
's-0.028
ches-0.027
ppy-0.027
Tokenott
Token ID1252
Feature activation+1.41e+01

Pos logit contributions
oise+0.129
mun+0.127
anda+0.124
mo+0.119
cases+0.117
Neg logit contributions
 yours-0.169
 its-0.164
 our-0.159
 your-0.158
 itself-0.155
Tokenie
Token ID494
Feature activation+8.97e+00

Pos logit contributions
mun+0.136
cases+0.113
ird+0.110
oise+0.107
m+0.101
Neg logit contributions
 its-0.398
 yours-0.396
 itself-0.385
 theirs-0.377
 your-0.367
Token.
Token ID13
Feature activation+5.90e-01

Pos logit contributions
oise+0.192
anda+0.185
mun+0.184
ll+0.177
ony+0.175
Neg logit contributions
 its-0.155
 yours-0.147
 our-0.145
 your-0.139
 itself-0.138
Token D
Token ID360
Feature activation+9.03e+00

Pos logit contributions
ny+0.012
anda+0.011
my+0.011
y+0.011
oise+0.011
Neg logit contributions
 yours-0.011
 our-0.010
 electricity-0.010
 yourselves-0.010
 its-0.010
Tokenott
Token ID1252
Feature activation+1.51e+01

Pos logit contributions
oise+0.132
mun+0.130
anda+0.118
mo+0.114
ches+0.114
Neg logit contributions
 its-0.208
 yours-0.208
 itself-0.203
 your-0.200
 such-0.200
Tokenie
Token ID494
Feature activation+1.04e+01

Pos logit contributions
mun+0.172
ird+0.147
cases+0.141
oise+0.138
 salon+0.128
Neg logit contributions
 its-0.408
 itself-0.403
 yours-0.401
 your-0.376
 theirs-0.372
Token wanted
Token ID2227
Feature activation-4.95e+00

Pos logit contributions
oise+0.221
mun+0.204
pit+0.198
anda+0.193
ll+0.192
Neg logit contributions
 its-0.206
 your-0.187
 our-0.185
 an-0.169
 their-0.169
Token to
Token ID284
Feature activation-3.95e+00

Pos logit contributions
 yours+0.084
 yourselves+0.083
 ourselves+0.080
 itself+0.077
 our+0.076
Neg logit contributions
's-0.102
my-0.097
ny-0.092
m-0.088
mun-0.087
Token visit
Token ID3187
Feature activation-4.71e+00

Pos logit contributions
 yours+0.074
 its+0.074
 our+0.072
 theirs+0.069
 itself+0.069
Neg logit contributions
oise-0.073
anda-0.071
mun-0.069
ny-0.069
my-0.067
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
 electricity+0.061
 yourselves+0.057
 yours+0.055
 theirs+0.055
 ourselves+0.051
Neg logit contributions
my-0.115
's-0.109
ie-0.108
ches-0.102
oise-0.097
Token!
Token ID0
Feature activation-5.07e-01

Pos logit contributions
oise+0.024
anda+0.024
mun+0.023
ony+0.023
mo+0.022
Neg logit contributions
 its-0.018
 yours-0.018
 our-0.018
 such-0.017
 your-0.017
Token
Token ID220
Feature activation-1.16e+00

Pos logit contributions
 its+0.007
 our+0.006
 yours+0.006
 such+0.006
 your+0.006
Neg logit contributions
oise-0.013
anda-0.013
mun-0.012
ll-0.012
ppy-0.012
Token
Token ID198
Feature activation-9.61e-01

Pos logit contributions
 its+0.020
 yours+0.020
 our+0.019
 such+0.019
 your+0.018
Neg logit contributions
oise-0.024
anda-0.023
mun-0.023
ny-0.022
ll-0.022
Token
Token ID198
Feature activation-8.44e-01

Pos logit contributions
 its+0.014
 our+0.013
 such+0.012
 your+0.012
 itself+0.012
Neg logit contributions
oise-0.024
anda-0.023
mo-0.022
mun-0.021
ony-0.021
TokenThe
Token ID464
Feature activation+3.83e+00

Pos logit contributions
 its+0.012
 our+0.011
 yours+0.011
 such+0.011
 your+0.010
Neg logit contributions
anda-0.021
oise-0.021
ony-0.020
mun-0.020
mo-0.020
Token chip
Token ID11594
Feature activation+1.50e+01

Pos logit contributions
oise+0.080
anda+0.076
ppy+0.075
ches+0.074
ll+0.073
Neg logit contributions
 its-0.065
 our-0.065
 yours-0.062
 such-0.062
 your-0.062
Tokenm
Token ID76
Feature activation+5.31e+00

Pos logit contributions
cases+0.183
anda+0.181
jo+0.178
ches+0.169
azz+0.169
Neg logit contributions
 its-0.342
 the-0.338
 such-0.332
 our-0.324
 your-0.323
Tokenunk
Token ID2954
Feature activation+1.01e+01

Pos logit contributions
oise+0.100
mun+0.099
my+0.087
mie+0.087
ny+0.086
Neg logit contributions
 its-0.100
 itself-0.095
 yours-0.093
 our-0.092
 electricity-0.091
Token was
Token ID373
Feature activation-1.90e+00

Pos logit contributions
mun+0.156
oise+0.156
anda+0.143
ird+0.133
suits+0.131
Neg logit contributions
 its-0.241
 itself-0.214
 such-0.214
 our-0.210
 the-0.204
Token excited
Token ID6568
Feature activation-2.36e+00

Pos logit contributions
 yours+0.035
 electricity+0.034
 our+0.033
 its+0.033
 yourselves+0.033
Neg logit contributions
oise-0.035
anda-0.034
my-0.033
a-0.033
ny-0.033
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
 its+0.049
 yours+0.049
 our+0.048
 such+0.045
 your+0.045
Neg logit contributions
oise-0.041
anda-0.040
mun-0.038
ny-0.038
ll-0.037
Token in
Token ID287
Feature activation-1.61e+00

Pos logit contributions
 yours+0.035
 its+0.034
 our+0.034
 electricity+0.033
 theirs+0.033
Neg logit contributions
oise-0.032
anda-0.030
mun-0.030
ll-0.030
a-0.029
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
 yours+0.026
 theirs+0.023
 yourselves+0.021
 ours+0.021
 electricity+0.020
Neg logit contributions
's-0.032
my-0.030
ny-0.029
a-0.028
c-0.027
Token river
Token ID7850
Feature activation+4.65e+00

Pos logit contributions
oise+0.046
anda+0.045
mun+0.043
ny+0.043
my+0.042
Neg logit contributions
 yours-0.030
 its-0.030
 our-0.028
 such-0.028
 theirs-0.028
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
oise+0.099
anda+0.096
mun+0.093
ll+0.091
ony+0.091
Neg logit contributions
 its-0.079
 yours-0.076
 our-0.075
 your-0.071
 such-0.071
Token The
Token ID383
Feature activation+4.13e+00

Pos logit contributions
ny+0.010
oise+0.010
anda+0.010
my+0.010
mun+0.010
Neg logit contributions
 yours-0.008
 our-0.008
 its-0.008
 ourselves-0.008
 such-0.007
Token hipp
Token ID18568
Feature activation+1.50e+01

Pos logit contributions
oise+0.075
anda+0.073
mun+0.070
ny+0.070
my+0.068
Neg logit contributions
 its-0.082
 yours-0.082
 our-0.079
 such-0.077
 itself-0.076
Tokeno
Token ID78
Feature activation+1.13e+01

Pos logit contributions
ches+0.210
ppy+0.205
ede+0.201
ird+0.200
 salon+0.196
Neg logit contributions
 its-0.325
 our-0.320
 such-0.297
 your-0.292
 their-0.291
Token was
Token ID373
Feature activation-9.32e-01

Pos logit contributions
oise+0.178
mun+0.172
ird+0.160
ony+0.160
anda+0.160
Neg logit contributions
 its-0.251
 our-0.246
 itself-0.228
 your-0.227
 an-0.219
Token very
Token ID845
Feature activation-1.07e+00

Pos logit contributions
 ourselves+0.022
 itself+0.022
 yours+0.021
 yourselves+0.021
 electricity+0.021
Neg logit contributions
my-0.012
ny-0.012
a-0.011
anda-0.011
's-0.011
Token happy
Token ID3772
Feature activation-1.06e+00

Pos logit contributions
 yours+0.019
 itself+0.019
 such+0.019
 its+0.018
 electricity+0.018
Neg logit contributions
anda-0.022
ny-0.021
ll-0.020
oise-0.020
a-0.020
Token and
Token ID290
Feature activation+7.73e-01

Pos logit contributions
 yours+0.025
 our+0.024
 theirs+0.024
 electricity+0.024
 its+0.023
Neg logit contributions
y-0.014
oise-0.014
my-0.014
ll-0.014
anda-0.013
Token".
Token ID1911
Feature activation-5.18e-01

Pos logit contributions
oise+0.108
anda+0.105
mun+0.102
ny+0.100
ll+0.099
Neg logit contributions
 its-0.079
 yours-0.078
 our-0.076
 such-0.073
 your-0.072
Token
Token ID198
Feature activation-6.67e-01

Pos logit contributions
 its+0.008
 such+0.008
 our+0.008
 itself+0.007
 yours+0.007
Neg logit contributions
oise-0.012
anda-0.012
mun-0.011
ny-0.011
ony-0.011
Token
Token ID198
Feature activation-5.16e-01

Pos logit contributions
 its+0.010
 our+0.009
 yours+0.009
 itself+0.009
 your+0.009
Neg logit contributions
oise-0.016
anda-0.015
mun-0.015
ny-0.015
y-0.014
TokenA
Token ID32
Feature activation+1.05e+00

Pos logit contributions
 its+0.008
 our+0.008
 itself+0.007
 such+0.007
 your+0.007
Neg logit contributions
anda-0.012
oise-0.012
mun-0.011
ny-0.011
mo-0.011
Tokenless
Token ID1203
Feature activation+1.03e+00

Pos logit contributions
oise+0.022
anda+0.021
mun+0.021
ll+0.020
ony+0.020
Neg logit contributions
 its-0.018
 our-0.017
 yours-0.016
 such-0.016
 your-0.016
Tokenio
Token ID952
Feature activation+1.50e+01

Pos logit contributions
oise+0.016
anda+0.015
ony+0.015
ches+0.015
ppy+0.014
Neg logit contributions
 its-0.024
 such-0.022
 itself-0.022
 our-0.022
 yours-0.022
Token bl
Token ID698
Feature activation+7.79e+00

Pos logit contributions
oise+0.269
mun+0.258
sc+0.246
suits+0.234
izzy+0.234
Neg logit contributions
 its-0.305
 the-0.265
 our-0.257
 that-0.257
 their-0.252
Tokenushed
Token ID7474
Feature activation-1.03e+00

Pos logit contributions
mun+0.070
oise+0.068
ny+0.058
my+0.058
mo+0.053
Neg logit contributions
 its-0.219
 yours-0.212
 our-0.207
 your-0.205
 itself-0.204
Token and
Token ID290
Feature activation-5.41e-01

Pos logit contributions
 yours+0.023
 our+0.023
 its+0.023
 your+0.022
 theirs+0.021
Neg logit contributions
oise-0.016
anda-0.015
ny-0.015
mun-0.015
my-0.015
Token said
Token ID531
Feature activation-1.69e+00

Pos logit contributions
 its+0.013
 yours+0.013
 our+0.013
 such+0.013
 your+0.013
Neg logit contributions
oise-0.007
anda-0.007
mun-0.007
ny-0.006
ll-0.006
Token,
Token ID11
Feature activation-9.39e-01

Pos logit contributions
 yours+0.041
 yourselves+0.040
 electricity+0.040
 theirs+0.039
 ours+0.039
Neg logit contributions
anda-0.022
a-0.021
y-0.021
's-0.020
ny-0.020
Token."
Token ID526
Feature activation+1.53e+00

Pos logit contributions
 its+0.010
 yours+0.010
 our+0.010
 such+0.009
 itself+0.009
Neg logit contributions
oise-0.018
anda-0.017
mun-0.017
ny-0.017
ll-0.016
Token
Token ID220
Feature activation+8.22e-01

Pos logit contributions
oise+0.037
anda+0.036
a+0.034
mo+0.034
ony+0.033
Neg logit contributions
 its-0.024
 our-0.022
 such-0.022
 your-0.021
 itself-0.021
Token
Token ID198
Feature activation+4.09e-02

Pos logit contributions
oise+0.018
anda+0.018
mun+0.017
ny+0.017
ony+0.017
Neg logit contributions
 its-0.013
 yours-0.013
 our-0.013
 itself-0.012
 your-0.012
Token
Token ID198
Feature activation-9.94e-02

Pos logit contributions
oise+0.001
anda+0.001
mo+0.001
mun+0.001
ony+0.001
Neg logit contributions
 its-0.001
 our-0.001
 yours-0.001
 your-0.001
 such-0.001
TokenThe
Token ID464
Feature activation+3.74e+00

Pos logit contributions
 its+0.002
 our+0.002
 your+0.002
 yours+0.002
 such+0.002
Neg logit contributions
anda-0.002
oise-0.002
mo-0.002
ony-0.002
ppy-0.002
Token prince
Token ID19716
Feature activation+1.50e+01

Pos logit contributions
oise+0.070
anda+0.068
mun+0.065
ony+0.063
ll+0.063
Neg logit contributions
 yours-0.074
 its-0.074
 our-0.073
 your-0.069
 such-0.068
Token went
Token ID1816
Feature activation-2.15e+00

Pos logit contributions
anda+0.273
mun+0.273
oise+0.271
ears+0.256
itty+0.256
Neg logit contributions
 its-0.300
 the-0.265
 an-0.263
 itself-0.255
 our-0.252
Token to
Token ID284
Feature activation-4.25e+00

Pos logit contributions
 its+0.025
 yours+0.024
 our+0.023
 your+0.022
 such+0.022
Neg logit contributions
oise-0.058
anda-0.057
mun-0.056
ony-0.054
ny-0.054
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 yours+0.046
 electricity+0.043
 itself+0.041
 yourselves+0.041
 ourselves+0.039
Neg logit contributions
my-0.103
oise-0.103
mun-0.101
's-0.099
ny-0.099
Token battlefield
Token ID13480
Feature activation+2.45e+00

Pos logit contributions
oise+0.020
ny+0.019
anda+0.019
mun+0.019
's+0.019
Neg logit contributions
 its-0.024
 yours-0.023
 itself-0.022
 theirs-0.022
 our-0.022
Token with
Token ID351
Feature activation-3.41e+00

Pos logit contributions
oise+0.047
anda+0.046
mun+0.044
ony+0.042
ll+0.042
Neg logit contributions
 its-0.048
 yours-0.046
 our-0.045
 itself-0.044
 such-0.044
Token named
Token ID3706
Feature activation-7.57e-01

Pos logit contributions
oise+0.141
anda+0.136
mun+0.134
ll+0.130
ny+0.127
Neg logit contributions
 its-0.206
 our-0.197
 yours-0.196
 your-0.193
 such-0.186
Token Ab
Token ID2275
Feature activation+1.25e+01

Pos logit contributions
 its+0.010
 yours+0.010
 our+0.009
 such+0.009
 itself+0.009
Neg logit contributions
oise-0.019
anda-0.019
mun-0.018
ny-0.018
my-0.018
Tokenbie
Token ID12590
Feature activation+1.33e+01

Pos logit contributions
cases+0.132
mun+0.126
case+0.109
board+0.109
ny+0.107
Neg logit contributions
 yours-0.325
 its-0.323
 our-0.322
 your-0.321
 such-0.303
Token.
Token ID13
Feature activation+8.33e-01

Pos logit contributions
oise+0.337
mun+0.325
anda+0.313
cases+0.307
ny+0.306
Neg logit contributions
 its-0.189
 our-0.167
 your-0.161
 yours-0.160
 their-0.151
Token Ab
Token ID2275
Feature activation+1.21e+01

Pos logit contributions
oise+0.015
anda+0.015
my+0.014
mun+0.014
ny+0.014
Neg logit contributions
 yours-0.016
 our-0.016
 its-0.016
 your-0.015
 such-0.015
Tokenbie
Token ID12590
Feature activation+1.49e+01

Pos logit contributions
mun+0.136
cases+0.127
ny+0.121
oise+0.115
case+0.109
Neg logit contributions
 yours-0.307
 its-0.304
 your-0.304
 our-0.301
 such-0.291
Token was
Token ID373
Feature activation+1.97e-01

Pos logit contributions
oise+0.332
mun+0.306
pit+0.294
cases+0.290
ata+0.286
Neg logit contributions
 its-0.265
 our-0.243
 your-0.242
 the-0.225
 their-0.224
Token only
Token ID691
Feature activation+1.36e+00

Pos logit contributions
oise+0.004
anda+0.004
mun+0.004
my+0.004
ny+0.004
Neg logit contributions
 yours-0.003
 its-0.003
 itself-0.003
 our-0.003
 theirs-0.003
Token three
Token ID1115
Feature activation+3.56e+00

Pos logit contributions
oise+0.033
anda+0.032
mun+0.031
ll+0.031
ny+0.031
Neg logit contributions
 its-0.019
 yours-0.018
 our-0.018
 such-0.018
 your-0.017
Token years
Token ID812
Feature activation-5.17e-01

Pos logit contributions
oise+0.082
ll+0.077
ny+0.076
zag+0.073
ppy+0.073
Neg logit contributions
 its-0.058
 our-0.055
 your-0.053
 such-0.052
 their-0.049
Token old
Token ID1468
Feature activation+3.75e+00

Pos logit contributions
 its+0.007
 yours+0.007
 our+0.006
 such+0.006
 itself+0.006
Neg logit contributions
anda-0.013
oise-0.013
ny-0.012
ony-0.012
my-0.012
Token white
Token ID2330
Feature activation+5.40e+00

Pos logit contributions
oise+0.073
ony+0.070
ll+0.069
mun+0.069
anda+0.069
Neg logit contributions
 its-0.053
 our-0.053
 your-0.050
 yours-0.048
 my-0.045
Token hipp
Token ID18568
Feature activation+1.22e+01

Pos logit contributions
oise+0.100
ll+0.098
mun+0.097
anda+0.089
ony+0.089
Neg logit contributions
 its-0.111
 our-0.100
 yours-0.100
 your-0.099
 itself-0.096
Tokeno
Token ID78
Feature activation+9.47e+00

Pos logit contributions
oise+0.125
ches+0.124
cases+0.121
mun+0.121
mie+0.119
Neg logit contributions
 its-0.330
 our-0.302
 such-0.301
 your-0.298
 yours-0.296
Token.
Token ID13
Feature activation+8.40e-01

Pos logit contributions
oise+0.174
anda+0.170
mun+0.168
ll+0.163
ny+0.160
Neg logit contributions
 its-0.192
 our-0.179
 yours-0.176
 your-0.173
 itself-0.170
Token The
Token ID383
Feature activation+4.25e+00

Pos logit contributions
ny+0.019
anda+0.018
my+0.018
oise+0.018
y+0.018
Neg logit contributions
 yours-0.013
 our-0.012
 ourselves-0.012
 its-0.012
 yourselves-0.012
Token hipp
Token ID18568
Feature activation+1.49e+01

Pos logit contributions
oise+0.077
anda+0.076
mun+0.073
ny+0.072
my+0.070
Neg logit contributions
 its-0.084
 yours-0.084
 our-0.081
 such-0.079
 itself-0.078
Tokeno
Token ID78
Feature activation+1.16e+01

Pos logit contributions
ird+0.201
ches+0.198
cases+0.195
 salon+0.192
ppy+0.191
Neg logit contributions
 its-0.348
 our-0.325
 your-0.309
 such-0.308
 their-0.303
Token was
Token ID373
Feature activation-7.31e-01

Pos logit contributions
oise+0.209
mun+0.198
ird+0.186
anda+0.183
ony+0.178
Neg logit contributions
 its-0.249
 our-0.232
 your-0.222
 itself-0.219
 yours-0.215
Token so
Token ID523
Feature activation-7.74e-01

Pos logit contributions
 itself+0.016
 yours+0.016
 ourselves+0.016
 electricity+0.016
 yourselves+0.016
Neg logit contributions
my-0.010
anda-0.010
a-0.010
ny-0.010
oise-0.010
Token cute
Token ID13779
Feature activation+1.96e+00

Pos logit contributions
 yours+0.013
 itself+0.012
 such+0.012
 its+0.012
 electricity+0.012
Neg logit contributions
anda-0.016
oise-0.016
ny-0.015
ll-0.015
a-0.015
Token and
Token ID290
Feature activation+8.25e-01

Pos logit contributions
oise+0.040
anda+0.039
mun+0.038
ny+0.038
y+0.037
Neg logit contributions
 yours-0.034
 its-0.033
 our-0.033
 theirs-0.031
 such-0.031
Token named
Token ID3706
Feature activation-1.48e+00

Pos logit contributions
oise+0.151
mun+0.145
anda+0.144
ll+0.144
ny+0.137
Neg logit contributions
 its-0.189
 our-0.174
 yours-0.172
 your-0.171
 such-0.166
Token Pe
Token ID2631
Feature activation+8.17e+00

Pos logit contributions
 yourselves+0.022
 yours+0.021
 its+0.021
 theirs+0.021
 electricity+0.021
Neg logit contributions
my-0.032
ny-0.032
's-0.031
anda-0.031
ppy-0.031
Tokenanut
Token ID20651
Feature activation+1.30e+01

Pos logit contributions
mun+0.129
ny+0.122
anda+0.121
oise+0.121
board+0.112
Neg logit contributions
 yours-0.182
 our-0.174
 its-0.171
 your-0.170
 theirs-0.165
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
oise+0.318
mun+0.314
cases+0.314
ited+0.307
anda+0.307
Neg logit contributions
 its-0.179
 our-0.161
 your-0.156
 yours-0.154
 such-0.144
Token Pe
Token ID2631
Feature activation+8.17e+00

Pos logit contributions
anda+0.010
ny+0.010
oise+0.010
my+0.010
ony+0.010
Neg logit contributions
 yours-0.009
 our-0.009
 its-0.009
 electricity-0.008
 itself-0.008
Tokenanut
Token ID20651
Feature activation+1.48e+01

Pos logit contributions
mun+0.115
oise+0.115
ny+0.106
anda+0.105
ird+0.098
Neg logit contributions
 yours-0.199
 its-0.192
 our-0.189
 your-0.187
 itself-0.185
Token lived
Token ID5615
Feature activation-2.20e+00

Pos logit contributions
oise+0.324
cases+0.318
ird+0.314
ited+0.313
mun+0.313
Neg logit contributions
 its-0.248
 our-0.228
 your-0.226
 their-0.209
 the-0.205
Token in
Token ID287
Feature activation-1.66e+00

Pos logit contributions
 yours+0.042
 its+0.041
 our+0.040
 theirs+0.039
 electricity+0.038
Neg logit contributions
oise-0.042
anda-0.040
mun-0.039
ny-0.039
ll-0.039
Token an
Token ID281
Feature activation+8.50e-01

Pos logit contributions
 yours+0.024
 theirs+0.022
 ourselves+0.020
 yourselves+0.020
 itself+0.019
Neg logit contributions
's-0.035
my-0.034
ny-0.034
a-0.032
ony-0.030
Token airport
Token ID9003
Feature activation+2.75e+00

Pos logit contributions
oise+0.020
anda+0.020
ny+0.019
my+0.019
mun+0.019
Neg logit contributions
 yours-0.012
 its-0.012
 our-0.011
 such-0.011
 theirs-0.011
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
oise+0.053
anda+0.050
mun+0.050
ny+0.049
my+0.049
Neg logit contributions
 yours-0.051
 its-0.049
 our-0.049
 theirs-0.048
 electricity-0.047
Token as
Token ID355
Feature activation-4.39e+00

Pos logit contributions
oise+0.000
anda+0.000
ny+0.000
mun+0.000
ll+0.000
Neg logit contributions
 its-0.000
 yours-0.000
 our-0.000
 your-0.000
 such-0.000
Token Hel
Token ID5053
Feature activation+3.10e+00

Pos logit contributions
 yours+0.053
 yourselves+0.052
 our+0.051
 its+0.048
 ourselves+0.048
Neg logit contributions
a-0.099
oise-0.098
ll-0.098
anda-0.095
my-0.095
Tokenio
Token ID952
Feature activation+1.10e+01

Pos logit contributions
oise+0.036
anda+0.034
mun+0.033
ny+0.031
ll+0.031
Neg logit contributions
 its-0.082
 yours-0.082
 our-0.080
 such-0.078
 itself-0.078
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
mun+0.152
oise+0.150
anda+0.136
m+0.126
ll+0.124
Neg logit contributions
 its-0.265
 our-0.253
 yours-0.251
 itself-0.249
 such-0.244
Token Hel
Token ID5053
Feature activation+4.75e+00

Pos logit contributions
 yours+0.004
 its+0.004
 our+0.004
 theirs+0.004
 such+0.004
Neg logit contributions
oise-0.004
anda-0.004
ny-0.004
mun-0.004
my-0.004
Tokenio
Token ID952
Feature activation+1.48e+01

Pos logit contributions
oise+0.066
mun+0.066
anda+0.062
ll+0.061
my+0.060
Neg logit contributions
 yours-0.113
 its-0.113
 our-0.109
 itself-0.108
 theirs-0.107
Token explained
Token ID4893
Feature activation-3.86e+00

Pos logit contributions
mun+0.244
oise+0.232
 salon+0.205
pit+0.198
m+0.196
Neg logit contributions
 its-0.331
 itself-0.312
 our-0.292
 such-0.279
 an-0.279
Token that
Token ID326
Feature activation-2.11e+00

Pos logit contributions
 yours+0.056
 yourselves+0.050
 our+0.049
 ourselves+0.048
 ours+0.048
Neg logit contributions
anda-0.085
's-0.084
a-0.082
ony-0.082
y-0.081
Token their
Token ID511
Feature activation+2.24e+00

Pos logit contributions
 yours+0.031
 our+0.030
 its+0.030
 yourselves+0.028
 itself+0.028
Neg logit contributions
oise-0.047
mun-0.046
anda-0.045
my-0.045
's-0.044
Token oxygen
Token ID11863
Feature activation+4.77e+00

Pos logit contributions
anda+0.046
oise+0.044
ny+0.043
's+0.042
ll+0.042
Neg logit contributions
 yours-0.038
 its-0.038
 such-0.036
 theirs-0.035
 yourselves-0.035
Token was
Token ID373
Feature activation-3.57e+00

Pos logit contributions
oise+0.060
anda+0.057
mun+0.055
ny+0.052
ony+0.051
Neg logit contributions
 its-0.122
 yours-0.120
 our-0.119
 such-0.115
 itself-0.115
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
oise+0.003
anda+0.003
mun+0.003
ny+0.003
ll+0.003
Neg logit contributions
 its-0.005
 yours-0.005
 our-0.004
 such-0.004
 your-0.004
Token
Token ID220
Feature activation-2.29e-01

Pos logit contributions
oise+0.009
anda+0.009
mun+0.008
ll+0.008
ny+0.008
Neg logit contributions
 its-0.006
 yours-0.005
 our-0.005
 such-0.005
 your-0.005
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
 its+0.004
 yours+0.004
 our+0.004
 such+0.004
 your+0.004
Neg logit contributions
oise-0.005
anda-0.005
mun-0.005
ny-0.004
ony-0.004
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
 its+0.002
 our+0.002
 yours+0.002
 your+0.002
 itself+0.002
Neg logit contributions
oise-0.004
anda-0.004
mun-0.003
mo-0.003
ppy-0.003
TokenThe
Token ID464
Feature activation+4.19e+00

Pos logit contributions
anda+0.003
oise+0.003
mun+0.003
ony+0.003
ll+0.003
Neg logit contributions
 its-0.002
 our-0.002
 yours-0.002
 your-0.002
 such-0.002
Token hipp
Token ID18568
Feature activation+1.48e+01

Pos logit contributions
oise+0.081
anda+0.078
mun+0.076
ny+0.074
ony+0.074
Neg logit contributions
 its-0.079
 yours-0.078
 our-0.078
 your-0.074
 such-0.074
Tokeno
Token ID78
Feature activation+1.10e+01

Pos logit contributions
ches+0.228
cases+0.205
ede+0.204
ppy+0.203
ata+0.202
Neg logit contributions
 its-0.301
 our-0.291
 such-0.277
 their-0.266
 your-0.263
Token found
Token ID1043
Feature activation-3.71e+00

Pos logit contributions
oise+0.180
mun+0.174
anda+0.172
ird+0.161
 salon+0.155
Neg logit contributions
 its-0.238
 our-0.224
 itself-0.210
 their-0.205
 the-0.205
Token some
Token ID617
Feature activation+3.51e-02

Pos logit contributions
 yours+0.054
 yourselves+0.048
 purpose+0.047
 outer+0.046
 theirs+0.045
Neg logit contributions
's-0.083
my-0.081
a-0.077
m-0.077
ny-0.074
Token vegetables
Token ID13701
Feature activation+4.54e+00

Pos logit contributions
oise+0.001
anda+0.001
mun+0.001
ny+0.001
ony+0.001
Neg logit contributions
 yours-0.001
 its-0.001
 our-0.001
 itself-0.001
 theirs-0.000
Token and
Token ID290
Feature activation-5.52e-01

Pos logit contributions
oise+0.082
anda+0.080
mun+0.077
ny+0.075
ony+0.073
Neg logit contributions
 its-0.092
 yours-0.090
 our-0.089
 such-0.086
 your-0.085
Token was
Token ID373
Feature activation-1.45e+00

Pos logit contributions
oise+0.145
mun+0.135
anda+0.131
ll+0.123
mo+0.115
Neg logit contributions
 its-0.255
 yours-0.239
 such-0.237
 our-0.237
 itself-0.235
Token so
Token ID523
Feature activation-1.84e+00

Pos logit contributions
 electricity+0.023
 yours+0.023
 yourselves+0.022
 itself+0.022
 purpose+0.021
Neg logit contributions
's-0.028
my-0.026
a-0.026
ny-0.026
y-0.026
Token attractive
Token ID10966
Feature activation-6.83e-01

Pos logit contributions
 yours+0.026
 its+0.025
 our+0.024
 such+0.024
 itself+0.024
Neg logit contributions
anda-0.043
oise-0.043
mun-0.042
ny-0.041
ll-0.041
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 yours+0.013
 its+0.013
 our+0.013
 electricity+0.012
 theirs+0.012
Neg logit contributions
anda-0.012
oise-0.012
y-0.011
ll-0.011
mun-0.011
Token The
Token ID383
Feature activation+4.55e+00

Pos logit contributions
oise+0.026
anda+0.025
mun+0.025
ny+0.024
my+0.024
Neg logit contributions
 yours-0.023
 its-0.022
 our-0.022
 such-0.021
 theirs-0.021
Token hipp
Token ID18568
Feature activation+1.47e+01

Pos logit contributions
oise+0.090
anda+0.085
mun+0.083
ony+0.081
ny+0.081
Neg logit contributions
 its-0.084
 yours-0.083
 our-0.083
 your-0.078
 such-0.078
Tokeno
Token ID78
Feature activation+1.10e+01

Pos logit contributions
ches+0.212
ird+0.210
ede+0.203
ppy+0.203
 salon+0.201
Neg logit contributions
 its-0.310
 our-0.297
 such-0.286
 your-0.276
 their-0.271
Token was
Token ID373
Feature activation-1.33e+00

Pos logit contributions
oise+0.185
ird+0.174
mun+0.171
anda+0.157
ony+0.154
Neg logit contributions
 its-0.240
 our-0.226
 your-0.214
 itself-0.210
 the-0.209
Token excited
Token ID6568
Feature activation-1.36e+00

Pos logit contributions
 yours+0.028
 electricity+0.027
 ourselves+0.027
 itself+0.027
 yourselves+0.026
Neg logit contributions
a-0.020
anda-0.020
my-0.020
's-0.019
oise-0.019
Token to
Token ID284
Feature activation-2.97e+00

Pos logit contributions
 its+0.024
 yours+0.024
 our+0.023
 such+0.022
 your+0.022
Neg logit contributions
oise-0.028
anda-0.027
mun-0.026
ny-0.026
ll-0.026
Token meet
Token ID1826
Feature activation-3.31e+00

Pos logit contributions
 yours+0.058
 electricity+0.056
 yourselves+0.056
 our+0.055
 such+0.055
Neg logit contributions
oise-0.048
my-0.046
's-0.045
ny-0.045
anda-0.044
Token you
Token ID345
Feature activation+1.60e+00

Pos logit contributions
 theirs+0.067
 electricity+0.063
 its+0.060
 disl+0.060
 yours+0.058
Neg logit contributions
ie-0.110
me-0.106
's-0.106
ird-0.102
my-0.101
Token."
Token ID526
Feature activation+1.38e+00

Pos logit contributions
oise+0.038
anda+0.038
mun+0.036
ny+0.036
ll+0.036
Neg logit contributions
 its-0.022
 yours-0.022
 our-0.021
 such-0.020
 itself-0.020
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
oise+0.028
anda+0.027
mun+0.026
ny+0.025
ll+0.025
Neg logit contributions
 its-0.025
 yours-0.025
 our-0.024
 such-0.023
 your-0.023
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
oise+0.003
anda+0.003
mun+0.003
ny+0.003
ll+0.003
Neg logit contributions
 yours-0.003
 its-0.003
 our-0.003
 such-0.002
 theirs-0.002
TokenMar
Token ID7676
Feature activation+6.98e+00

Pos logit contributions
oise+0.002
anda+0.002
mun+0.002
ny+0.002
ll+0.002
Neg logit contributions
 its-0.003
 yours-0.003
 our-0.002
 such-0.002
 your-0.002
Tokeniah
Token ID9520
Feature activation+1.47e+01

Pos logit contributions
oise+0.088
anda+0.082
mun+0.082
ny+0.076
ll+0.073
Neg logit contributions
 its-0.177
 yours-0.176
 our-0.174
 your-0.169
 such-0.168
Token was
Token ID373
Feature activation-8.01e-01

Pos logit contributions
oise+0.253
mun+0.252
pillar+0.232
anda+0.223
pit+0.218
Neg logit contributions
 its-0.308
 the-0.283
 your-0.280
 an-0.277
 their-0.275
Token excited
Token ID6568
Feature activation-1.83e+00

Pos logit contributions
 yours+0.016
 its+0.015
 our+0.015
 itself+0.015
 theirs+0.015
Neg logit contributions
oise-0.014
anda-0.014
mun-0.014
my-0.014
ny-0.013
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 its+0.030
 yours+0.030
 our+0.029
 such+0.028
 your+0.028
Neg logit contributions
oise-0.040
anda-0.039
mun-0.038
ny-0.037
ll-0.036
Token What
Token ID1867
Feature activation-6.71e-01

Pos logit contributions
anda+0.002
my+0.002
oise+0.002
ny+0.002
y+0.002
Neg logit contributions
 yours-0.003
 our-0.003
 its-0.003
 theirs-0.002
 such-0.002
Token could
Token ID714
Feature activation-4.08e+00

Pos logit contributions
 its+0.011
 yours+0.011
 our+0.010
 such+0.010
 your+0.010
Neg logit contributions
oise-0.015
anda-0.014
mun-0.014
ny-0.014
ll-0.014
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
 its+0.014
 yours+0.014
 our+0.013
 such+0.013
 your+0.013
Neg logit contributions
oise-0.023
anda-0.023
mun-0.022
ny-0.022
ll-0.022
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 its+0.024
 yours+0.024
 our+0.023
 your+0.022
 itself+0.022
Neg logit contributions
oise-0.032
anda-0.032
mun-0.031
ny-0.030
ony-0.030
TokenOne
Token ID3198
Feature activation+1.46e+00

Pos logit contributions
 its+0.024
 yours+0.023
 our+0.023
 your+0.023
 itself+0.022
Neg logit contributions
oise-0.026
anda-0.026
mun-0.025
ony-0.024
ll-0.024
Token day
Token ID1110
Feature activation-1.76e+00

Pos logit contributions
mun+0.033
oise+0.033
ll+0.032
anda+0.031
ppy+0.031
Neg logit contributions
 its-0.024
 our-0.023
 yours-0.022
 your-0.022
 theirs-0.020
Token Aless
Token ID47319
Feature activation+5.28e+00

Pos logit contributions
 yours+0.038
 its+0.036
 our+0.036
 itself+0.036
 theirs+0.035
Neg logit contributions
oise-0.028
anda-0.027
mun-0.027
ny-0.026
y-0.026
Tokenio
Token ID952
Feature activation+1.47e+01

Pos logit contributions
oise+0.079
mun+0.078
ny+0.071
anda+0.069
my+0.068
Neg logit contributions
 yours-0.123
 its-0.122
 our-0.120
 itself-0.118
 your-0.116
Token was
Token ID373
Feature activation-1.53e+00

Pos logit contributions
mun+0.234
oise+0.222
sie+0.188
itty+0.188
sc+0.184
Neg logit contributions
 its-0.303
 an-0.295
 our-0.288
 your-0.286
 yours-0.278
Token walking
Token ID6155
Feature activation-1.94e+00

Pos logit contributions
 its+0.025
 yours+0.025
 our+0.024
 such+0.023
 your+0.023
Neg logit contributions
oise-0.033
anda-0.033
mun-0.032
ny-0.031
ll-0.031
Token down
Token ID866
Feature activation-5.19e+00

Pos logit contributions
 its+0.031
 yours+0.030
 our+0.029
 such+0.028
 your+0.028
Neg logit contributions
oise-0.044
anda-0.043
mun-0.042
ll-0.041
ony-0.040
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
 yours+0.064
 ourselves+0.052
 ours+0.052
 our+0.050
 yourselves+0.050
Neg logit contributions
my-0.130
ny-0.130
oise-0.130
y-0.127
mun-0.124
Token street
Token ID4675
Feature activation+1.43e+00

Pos logit contributions
oise+0.030
mun+0.029
ird+0.028
ppy+0.028
anda+0.028
Neg logit contributions
 our-0.028
 your-0.027
 its-0.027
 yours-0.026
 electricity-0.025
Token he
Token ID339
Feature activation+3.19e+00

Pos logit contributions
 its+0.020
 yours+0.020
 our+0.020
 such+0.019
 your+0.019
Neg logit contributions
oise-0.017
anda-0.017
mun-0.016
ony-0.015
ny-0.015
Token wanted
Token ID2227
Feature activation-6.16e+00

Pos logit contributions
oise+0.048
anda+0.045
mun+0.045
ony+0.042
pit+0.041
Neg logit contributions
 its-0.071
 such-0.068
 yours-0.068
 our-0.067
 your-0.067
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
 our+0.103
 yours+0.102
 yourselves+0.101
 its+0.100
 ourselves+0.098
Neg logit contributions
my-0.119
's-0.115
ny-0.114
oise-0.113
y-0.112
Token<|endoftext|>
Token ID50256
Feature activation+3.49e+00

Pos logit contributions
 its+0.013
 yours+0.012
 our+0.012
 such+0.012
 your+0.012
Neg logit contributions
oise-0.019
anda-0.018
mun-0.018
ony-0.017
ny-0.017
TokenAk
Token ID33901
Feature activation+8.36e+00

Pos logit contributions
oise+0.080
anda+0.078
mun+0.074
ony+0.074
ppy+0.072
Neg logit contributions
 its-0.056
 such-0.052
 our-0.052
 yours-0.051
 itself-0.050
Tokenio
Token ID952
Feature activation+1.47e+01

Pos logit contributions
mun+0.105
oise+0.094
ny+0.089
anda+0.084
ird+0.080
Neg logit contributions
 yours-0.213
 its-0.206
 your-0.204
 our-0.204
 theirs-0.199
Token was
Token ID373
Feature activation-8.29e-01

Pos logit contributions
mun+0.252
oise+0.247
anda+0.222
ny+0.207
mo+0.205
Neg logit contributions
 its-0.300
 your-0.280
 our-0.272
 an-0.271
 such-0.264
Token a
Token ID257
Feature activation+1.95e+00

Pos logit contributions
 yours+0.013
 its+0.013
 our+0.013
 itself+0.012
 theirs+0.012
Neg logit contributions
oise-0.018
anda-0.018
mun-0.017
my-0.017
ny-0.017
Token 3
Token ID513
Feature activation+5.72e+00

Pos logit contributions
oise+0.043
ll+0.042
mun+0.042
ony+0.042
anda+0.042
Neg logit contributions
 its-0.031
 our-0.030
 your-0.029
 yours-0.028
 my-0.026
Token year
Token ID614
Feature activation-5.62e-01

Pos logit contributions
ll+0.118
ppy+0.108
oise+0.106
pp+0.104
ches+0.104
Neg logit contributions
 its-0.111
 our-0.097
 the-0.096
 it-0.096
 your-0.096
Token old
Token ID1468
Feature activation+3.99e+00

Pos logit contributions
 its+0.007
 yours+0.007
 our+0.006
 such+0.006
 itself+0.006
Neg logit contributions
oise-0.015
anda-0.014
mun-0.014
ny-0.014
ony-0.014
Token named
Token ID3706
Feature activation-5.63e-01

Pos logit contributions
oise+0.145
anda+0.139
ll+0.138
mun+0.138
ny+0.133
Neg logit contributions
 its-0.192
 our-0.181
 your-0.179
 yours-0.178
 such-0.171
Token Ab
Token ID2275
Feature activation+1.18e+01

Pos logit contributions
 its+0.007
 yours+0.007
 our+0.007
 such+0.007
 itself+0.006
Neg logit contributions
oise-0.014
anda-0.014
mun-0.013
ny-0.013
my-0.013
Tokenbie
Token ID12590
Feature activation+1.26e+01

Pos logit contributions
mun+0.117
cases+0.113
ny+0.110
board+0.093
oise+0.091
Neg logit contributions
 our-0.315
 your-0.310
 yours-0.309
 its-0.307
 such-0.294
Token.
Token ID13
Feature activation+5.54e-01

Pos logit contributions
oise+0.320
mun+0.304
anda+0.296
ny+0.293
ll+0.289
Neg logit contributions
 its-0.181
 our-0.161
 your-0.155
 yours-0.153
 their-0.146
Token Ab
Token ID2275
Feature activation+1.17e+01

Pos logit contributions
anda+0.010
oise+0.010
mun+0.009
ny+0.009
my+0.009
Neg logit contributions
 yours-0.011
 our-0.011
 its-0.011
 such-0.010
 your-0.010
Tokenbie
Token ID12590
Feature activation+1.47e+01

Pos logit contributions
mun+0.129
ny+0.118
cases+0.113
oise+0.112
ches+0.100
Neg logit contributions
 yours-0.298
 your-0.298
 our-0.297
 its-0.296
 such-0.285
Token liked
Token ID8288
Feature activation-3.83e+00

Pos logit contributions
oise+0.322
mun+0.293
pit+0.277
ppy+0.276
ches+0.274
Neg logit contributions
 its-0.269
 your-0.249
 our-0.248
 their-0.233
 the-0.233
Token counting
Token ID14143
Feature activation-1.97e+00

Pos logit contributions
 itself+0.062
 yours+0.060
 ourselves+0.058
 yourselves+0.057
 its+0.056
Neg logit contributions
my-0.078
's-0.075
mun-0.074
y-0.073
ll-0.072
Token things
Token ID1243
Feature activation+1.18e+00

Pos logit contributions
 yours+0.026
 its+0.024
 our+0.024
 theirs+0.023
 itself+0.023
Neg logit contributions
anda-0.047
oise-0.047
mun-0.046
y-0.046
my-0.046
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
oise+0.023
anda+0.022
mun+0.021
ny+0.021
my+0.021
Neg logit contributions
 yours-0.021
 its-0.021
 our-0.021
 such-0.020
 your-0.020
Token One
Token ID1881
Feature activation+3.38e+00

Pos logit contributions
oise+0.007
anda+0.007
mun+0.007
ny+0.007
ll+0.007
Neg logit contributions
 its-0.006
 yours-0.006
 our-0.006
 such-0.006
 your-0.006
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
 yours+0.059
 our+0.055
 yourselves+0.054
 its+0.052
 electricity+0.051
Neg logit contributions
my-0.074
oise-0.074
mun-0.073
's-0.073
ny-0.073
Token
Token ID220
Feature activation-1.75e+00

Pos logit contributions
 yours+0.023
 its+0.022
 our+0.022
 theirs+0.021
 such+0.021
Neg logit contributions
oise-0.026
anda-0.025
mun-0.025
ny-0.024
my-0.024
Token
Token ID198
Feature activation-1.53e+00

Pos logit contributions
 its+0.030
 yours+0.029
 our+0.029
 such+0.027
 your+0.027
Neg logit contributions
oise-0.037
anda-0.036
mun-0.035
ny-0.034
ony-0.034
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
 its+0.026
 yours+0.025
 our+0.025
 such+0.024
 your+0.024
Neg logit contributions
oise-0.032
anda-0.032
mun-0.031
ny-0.030
ony-0.030
TokenThe
Token ID464
Feature activation+3.42e+00

Pos logit contributions
 its+0.021
 yours+0.021
 our+0.020
 such+0.019
 your+0.019
Neg logit contributions
oise-0.027
anda-0.027
mun-0.026
ny-0.025
ll-0.025
Token chip
Token ID11594
Feature activation+1.47e+01

Pos logit contributions
oise+0.064
anda+0.062
ppy+0.061
ony+0.060
ches+0.060
Neg logit contributions
 our-0.065
 its-0.063
 your-0.061
 electricity-0.061
 yours-0.061
Tokenm
Token ID76
Feature activation+5.28e+00

Pos logit contributions
cases+0.176
anda+0.176
ppy+0.169
jo+0.169
ony+0.167
Neg logit contributions
 the-0.334
 their-0.320
 its-0.320
 our-0.317
 your-0.314
Tokenunk
Token ID2954
Feature activation+9.57e+00

Pos logit contributions
oise+0.095
mun+0.088
anda+0.082
mie+0.082
ll+0.082
Neg logit contributions
 its-0.102
 our-0.098
 itself-0.097
 electricity-0.096
 yours-0.096
Token told
Token ID1297
Feature activation-4.15e+00

Pos logit contributions
oise+0.138
anda+0.134
mun+0.132
ony+0.126
ppy+0.121
Neg logit contributions
 its-0.222
 such-0.205
 our-0.199
 itself-0.197
 an-0.196
Token the
Token ID262
Feature activation+2.53e+00

Pos logit contributions
 yours+0.045
 theirs+0.035
 our+0.035
 ours+0.035
 its+0.034
Neg logit contributions
mun-0.106
my-0.106
ny-0.105
's-0.105
anda-0.105
Token boy
Token ID2933
Feature activation+7.63e+00

Pos logit contributions
oise+0.052
anda+0.050
ppy+0.048
ony+0.047
ll+0.046
Neg logit contributions
 its-0.044
 our-0.044
 such-0.043
 yours-0.043
 electricity-0.042
Token big
Token ID1263
Feature activation+2.58e+00

Pos logit contributions
oise+0.056
mun+0.053
anda+0.053
ony+0.052
ll+0.052
Neg logit contributions
 its-0.040
 our-0.040
 your-0.038
 yours-0.037
 my-0.033
Token hipp
Token ID18568
Feature activation+1.19e+01

Pos logit contributions
oise+0.050
anda+0.047
mun+0.047
ll+0.046
ony+0.046
Neg logit contributions
 its-0.049
 our-0.048
 yours-0.048
 your-0.046
 such-0.045
Tokeno
Token ID78
Feature activation+9.00e+00

Pos logit contributions
ches+0.122
oise+0.121
mie+0.121
mun+0.117
cases+0.113
Neg logit contributions
 its-0.317
 our-0.298
 such-0.293
 your-0.291
 yours-0.290
Token.
Token ID13
Feature activation+3.70e-01

Pos logit contributions
oise+0.159
anda+0.156
mun+0.155
ll+0.148
ony+0.146
Neg logit contributions
 its-0.186
 our-0.177
 yours-0.175
 your-0.171
 such-0.164
Token The
Token ID383
Feature activation+3.89e+00

Pos logit contributions
ny+0.008
anda+0.008
my+0.008
y+0.008
oise+0.007
Neg logit contributions
 yours-0.006
 our-0.005
 yourselves-0.005
 ourselves-0.005
 theirs-0.005
Token hipp
Token ID18568
Feature activation+1.47e+01

Pos logit contributions
anda+0.068
oise+0.068
mun+0.065
ny+0.065
my+0.063
Neg logit contributions
 yours-0.078
 its-0.078
 our-0.074
 such-0.074
 itself-0.073
Tokeno
Token ID78
Feature activation+1.13e+01

Pos logit contributions
ches+0.203
ppy+0.198
ede+0.192
ited+0.191
oise+0.191
Neg logit contributions
 its-0.333
 our-0.319
 such-0.299
 their-0.298
 your-0.298
Token lived
Token ID5615
Feature activation-2.05e+00

Pos logit contributions
oise+0.206
mun+0.197
ird+0.179
anda+0.179
ony+0.179
Neg logit contributions
 its-0.235
 our-0.227
 your-0.213
 yours-0.206
 itself-0.205
Token near
Token ID1474
Feature activation-2.02e+00

Pos logit contributions
 yours+0.039
 its+0.038
 our+0.037
 such+0.036
 itself+0.036
Neg logit contributions
oise-0.039
anda-0.037
mun-0.036
ny-0.036
ll-0.036
Token a
Token ID257
Feature activation+2.28e+00

Pos logit contributions
 yours+0.024
 yourselves+0.020
 electricity+0.020
 ourselves+0.019
 theirs+0.019
Neg logit contributions
's-0.050
my-0.050
ny-0.048
y-0.047
a-0.047
Token river
Token ID7850
Feature activation+4.96e+00

Pos logit contributions
oise+0.052
anda+0.052
mun+0.050
ny+0.050
ll+0.049
Neg logit contributions
 yours-0.034
 its-0.034
 our-0.032
 such-0.032
 itself-0.031
TokenThe
Token ID464
Feature activation+3.50e+00

Pos logit contributions
 its+0.013
 our+0.013
 yours+0.013
 such+0.012
 your+0.012
Neg logit contributions
oise-0.017
anda-0.016
ony-0.016
mun-0.015
ppy-0.015
Token end
Token ID886
Feature activation-2.05e-01

Pos logit contributions
oise+0.072
anda+0.069
mun+0.068
ny+0.066
my+0.066
Neg logit contributions
 yours-0.061
 its-0.061
 our-0.058
 theirs-0.057
 such-0.057
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 yours+0.004
 its+0.004
 our+0.004
 such+0.004
 your+0.004
Neg logit contributions
oise-0.004
anda-0.004
mun-0.004
ny-0.004
ll-0.004
Token<|endoftext|>
Token ID50256
Feature activation+3.38e+00

Pos logit contributions
 its+0.020
 yours+0.020
 our+0.019
 such+0.018
 your+0.018
Neg logit contributions
oise-0.024
anda-0.023
mun-0.023
ny-0.022
ony-0.022
TokenD
Token ID35
Feature activation+6.19e+00

Pos logit contributions
oise+0.073
anda+0.073
ony+0.069
mun+0.068
ll+0.067
Neg logit contributions
 its-0.057
 such-0.053
 our-0.053
 yours-0.053
 itself-0.051
Tokenennis
Token ID10679
Feature activation+1.46e+01

Pos logit contributions
oise+0.078
mun+0.077
anda+0.069
mo+0.068
ppy+0.063
Neg logit contributions
 yours-0.156
 our-0.153
 its-0.153
 your-0.149
 such-0.146
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
oise+0.242
mun+0.236
anda+0.234
ird+0.211
m+0.210
Neg logit contributions
 its-0.324
 our-0.305
 your-0.301
 such-0.287
 yours-0.284
Token feeling
Token ID4203
Feature activation-2.70e+00

Pos logit contributions
 yours+0.018
 its+0.017
 our+0.017
 itself+0.017
 theirs+0.016
Neg logit contributions
oise-0.029
anda-0.028
my-0.028
mun-0.028
ny-0.027
Token very
Token ID845
Feature activation-2.00e+00

Pos logit contributions
 yours+0.058
 itself+0.056
 yourselves+0.055
 ourselves+0.055
 theirs+0.055
Neg logit contributions
a-0.041
mun-0.041
ny-0.041
oise-0.040
's-0.040
Token lonely
Token ID21757
Feature activation-1.68e+00

Pos logit contributions
 yours+0.041
 itself+0.040
 its+0.040
 such+0.039
 our+0.039
Neg logit contributions
anda-0.033
oise-0.031
mun-0.031
a-0.031
ll-0.031
Token.
Token ID13
Feature activation+5.77e-01

Pos logit contributions
 its+0.032
 yours+0.032
 our+0.031
 such+0.030
 your+0.030
Neg logit contributions
oise-0.031
anda-0.030
mun-0.029
ny-0.029
ll-0.029
Token and
Token ID290
Feature activation-2.69e+00

Pos logit contributions
 its+0.082
 yours+0.081
 our+0.079
 itself+0.077
 theirs+0.077
Neg logit contributions
oise-0.070
y-0.067
ny-0.066
mun-0.065
anda-0.065
Token prepare
Token ID8335
Feature activation-6.88e+00

Pos logit contributions
 its+0.043
 yours+0.042
 theirs+0.042
 our+0.041
 itself+0.040
Neg logit contributions
oise-0.055
my-0.053
mun-0.053
ll-0.053
anda-0.052
Token,
Token ID11
Feature activation-3.36e+00

Pos logit contributions
 its+0.081
 theirs+0.081
 yours+0.078
 itself+0.072
 electricity+0.068
Neg logit contributions
my-0.155
's-0.154
a-0.151
ches-0.151
oise-0.151
Token you
Token ID345
Feature activation+9.98e-01

Pos logit contributions
 its+0.044
 yours+0.043
 our+0.041
 theirs+0.041
 itself+0.039
Neg logit contributions
oise-0.079
anda-0.078
my-0.077
mun-0.076
ll-0.075
Token would
Token ID561
Feature activation-4.36e+00

Pos logit contributions
oise+0.018
anda+0.018
mun+0.018
ony+0.016
ny+0.016
Neg logit contributions
 its-0.020
 yours-0.020
 our-0.020
 your-0.019
 such-0.019
Token fail
Token ID2038
Feature activation-9.24e+00

Pos logit contributions
 its+0.081
 yours+0.080
 our+0.078
 such+0.075
 theirs+0.075
Neg logit contributions
oise-0.084
anda-0.081
mun-0.079
ny-0.078
ll-0.077
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
 theirs+0.144
 yours+0.142
 its+0.139
 electricity+0.132
 our+0.131
Neg logit contributions
my-0.183
oise-0.180
y-0.175
a-0.174
's-0.173
Token test
Token ID1332
Feature activation+1.68e+00

Pos logit contributions
oise+0.022
anda+0.021
mun+0.020
ny+0.020
ll+0.020
Neg logit contributions
 its-0.021
 yours-0.021
 our-0.020
 theirs-0.020
 such-0.020
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
oise+0.036
anda+0.035
mun+0.034
ny+0.033
ony+0.033
Neg logit contributions
 its-0.028
 yours-0.028
 our-0.027
 your-0.026
 such-0.026
Token Next
Token ID7406
Feature activation-1.85e+00

Pos logit contributions
 its+0.014
 yours+0.014
 our+0.014
 such+0.013
 your+0.013
Neg logit contributions
oise-0.024
anda-0.024
mun-0.023
ny-0.023
ll-0.023
Token time
Token ID640
Feature activation-1.69e+00

Pos logit contributions
 its+0.032
 yours+0.032
 our+0.031
 such+0.030
 your+0.030
Neg logit contributions
oise-0.038
anda-0.037
mun-0.036
ny-0.035
ll-0.035
Token beauty
Token ID8737
Feature activation+8.46e-01

Pos logit contributions
oise+0.037
anda+0.036
mun+0.035
ny+0.034
my+0.034
Neg logit contributions
 its-0.023
 yours-0.023
 our-0.022
 such-0.021
 theirs-0.021
Token around
Token ID1088
Feature activation-3.50e+00

Pos logit contributions
oise+0.015
anda+0.015
mun+0.014
ny+0.014
y+0.014
Neg logit contributions
 yours-0.017
 its-0.017
 our-0.016
 your-0.016
 such-0.016
Token her
Token ID607
Feature activation+1.67e+00

Pos logit contributions
 yours+0.044
 our+0.040
 theirs+0.039
 electricity+0.038
 yourselves+0.038
Neg logit contributions
oise-0.085
y-0.084
ie-0.082
my-0.082
's-0.080
Token without
Token ID1231
Feature activation-6.14e+00

Pos logit contributions
oise+0.032
anda+0.032
mun+0.030
ny+0.030
ll+0.030
Neg logit contributions
 yours-0.031
 its-0.031
 our-0.030
 such-0.029
 your-0.029
Token always
Token ID1464
Feature activation-3.79e+00

Pos logit contributions
 our+0.091
 yours+0.091
 yourselves+0.085
 theirs+0.082
 its+0.081
Neg logit contributions
my-0.118
oise-0.117
y-0.117
a-0.115
ppy-0.114
Token needing
Token ID18139
Feature activation-9.18e+00

Pos logit contributions
 yours+0.072
 our+0.070
 its+0.068
 theirs+0.067
 electricity+0.066
Neg logit contributions
oise-0.069
my-0.065
mun-0.065
's-0.064
ny-0.064
Token to
Token ID284
Feature activation-5.17e+00

Pos logit contributions
 yourselves+0.112
 theirs+0.108
 yours+0.108
 our+0.106
 such+0.100
Neg logit contributions
's-0.191
a-0.186
ony-0.182
my-0.175
ches-0.174
Token touch
Token ID3638
Feature activation-6.21e+00

Pos logit contributions
 yours+0.102
 its+0.101
 our+0.100
 theirs+0.096
 such+0.095
Neg logit contributions
oise-0.091
anda-0.087
mun-0.084
ny-0.084
my-0.084
Token it
Token ID340
Feature activation-1.44e+00

Pos logit contributions
 yours+0.070
 our+0.062
 theirs+0.062
 electricity+0.061
 yourselves+0.060
Neg logit contributions
's-0.147
y-0.147
my-0.146
oise-0.145
ie-0.143
Token.
Token ID13
Feature activation-1.81e-01

Pos logit contributions
 yours+0.027
 its+0.027
 our+0.027
 theirs+0.025
 such+0.025
Neg logit contributions
oise-0.026
anda-0.026
ny-0.025
my-0.025
mun-0.024
Token<|endoftext|>
Token ID50256
Feature activation+3.31e+00

Pos logit contributions
 yours+0.003
 its+0.003
 our+0.003
 theirs+0.003
 yourselves+0.003
Neg logit contributions
oise-0.003
anda-0.003
my-0.003
mun-0.003
ll-0.003
Token helpless
Token ID21144
Feature activation-3.15e+00

Pos logit contributions
 electricity+0.045
 yours+0.045
 yourselves+0.044
 our+0.043
 ourselves+0.042
Neg logit contributions
a-0.046
my-0.046
's-0.046
oise-0.045
m-0.044
Token when
Token ID618
Feature activation-2.48e+00

Pos logit contributions
 yours+0.060
 its+0.060
 our+0.059
 your+0.056
 theirs+0.056
Neg logit contributions
oise-0.058
anda-0.057
mun-0.055
ny-0.055
y-0.054
Token it
Token ID340
Feature activation+1.66e+00

Pos logit contributions
 yours+0.043
 our+0.040
 its+0.040
 theirs+0.039
 itself+0.039
Neg logit contributions
oise-0.047
mun-0.047
my-0.046
anda-0.045
ll-0.045
Token came
Token ID1625
Feature activation-4.75e+00

Pos logit contributions
oise+0.023
anda+0.022
mun+0.021
ony+0.021
ny+0.019
Neg logit contributions
 its-0.041
 such-0.038
 our-0.038
 itself-0.037
 your-0.037
Token to
Token ID284
Feature activation-6.62e+00

Pos logit contributions
 yours+0.077
 our+0.073
 its+0.072
 theirs+0.071
 electricity+0.070
Neg logit contributions
my-0.099
oise-0.098
y-0.097
anda-0.096
ny-0.095
Token wanting
Token ID10291
Feature activation-8.93e+00

Pos logit contributions
 yours+0.089
 yourselves+0.080
 ours+0.080
 theirs+0.077
 such+0.075
Neg logit contributions
my-0.141
's-0.126
ie-0.126
ny-0.122
oise-0.120
Token toys
Token ID14958
Feature activation-9.21e-01

Pos logit contributions
 yours+0.126
 yourselves+0.124
 theirs+0.118
 our+0.114
 ours+0.111
Neg logit contributions
's-0.188
my-0.180
ie-0.172
a-0.169
ny-0.169
Token.
Token ID13
Feature activation-6.77e-01

Pos logit contributions
 its+0.016
 yours+0.016
 our+0.015
 such+0.015
 itself+0.015
Neg logit contributions
oise-0.019
anda-0.018
mun-0.017
ny-0.017
ll-0.017
Token So
Token ID1406
Feature activation-1.90e+00

Pos logit contributions
 its+0.011
 yours+0.011
 our+0.011
 such+0.010
 your+0.010
Neg logit contributions
oise-0.015
anda-0.014
mun-0.014
ny-0.013
ll-0.013
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 yours+0.030
 its+0.030
 our+0.029
 theirs+0.028
 itself+0.027
Neg logit contributions
oise-0.041
anda-0.040
mun-0.039
ny-0.039
ll-0.038
Token she
Token ID673
Feature activation+4.47e+00

Pos logit contributions
 its+0.027
 yours+0.026
 our+0.026
 such+0.025
 your+0.024
Neg logit contributions
oise-0.046
anda-0.045
mun-0.044
ny-0.043
ony-0.043
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
 its+0.111
 yours+0.111
 our+0.108
 theirs+0.104
 itself+0.103
Neg logit contributions
oise-0.097
anda-0.093
my-0.092
ny-0.092
mun-0.092
Token but
Token ID475
Feature activation-1.23e+00

Pos logit contributions
 yours+0.029
 its+0.029
 our+0.028
 such+0.027
 itself+0.027
Neg logit contributions
oise-0.032
anda-0.031
mun-0.031
ny-0.030
my-0.030
Token his
Token ID465
Feature activation+2.54e+00

Pos logit contributions
 its+0.017
 yours+0.017
 our+0.017
 such+0.016
 theirs+0.016
Neg logit contributions
oise-0.029
anda-0.028
mun-0.028
ny-0.027
ll-0.027
Token mustache
Token ID49303
Feature activation+7.57e+00

Pos logit contributions
oise+0.051
anda+0.049
mun+0.048
ny+0.047
ll+0.046
Neg logit contributions
 its-0.046
 yours-0.045
 our-0.044
 such-0.042
 your-0.042
Token kept
Token ID4030
Feature activation-7.45e+00

Pos logit contributions
oise+0.111
anda+0.100
mun+0.098
ony+0.093
mo+0.092
Neg logit contributions
 its-0.186
 such-0.179
 yours-0.177
 itself-0.175
 your-0.173
Token getting
Token ID1972
Feature activation-8.88e+00

Pos logit contributions
 yours+0.114
 our+0.111
 yourselves+0.109
 ourselves+0.107
 theirs+0.106
Neg logit contributions
my-0.156
's-0.143
oise-0.142
y-0.141
ny-0.141
Token in
Token ID287
Feature activation-7.19e+00

Pos logit contributions
 yours+0.167
 its+0.161
 yourselves+0.161
 our+0.160
 itself+0.159
Neg logit contributions
my-0.162
ny-0.151
oise-0.148
m-0.146
's-0.145
Token the
Token ID262
Feature activation-9.94e-01

Pos logit contributions
 yours+0.100
 yourselves+0.087
 our+0.086
 ourselves+0.084
 theirs+0.083
Neg logit contributions
my-0.160
ny-0.151
y-0.150
oise-0.148
m-0.147
Token way
Token ID835
Feature activation-4.27e+00

Pos logit contributions
 its+0.017
 yours+0.017
 our+0.017
 such+0.016
 your+0.016
Neg logit contributions
oise-0.020
anda-0.020
mun-0.019
ny-0.019
ll-0.018
Token and
Token ID290
Feature activation-2.42e+00

Pos logit contributions
 its+0.076
 yours+0.076
 our+0.074
 theirs+0.071
 such+0.071
Neg logit contributions
oise-0.082
anda-0.080
my-0.079
ny-0.078
y-0.077
Token preventing
Token ID12174
Feature activation-7.71e+00

Pos logit contributions
 yours+0.045
 our+0.043
 its+0.043
 theirs+0.042
 such+0.041
Neg logit contributions
oise-0.045
my-0.044
mun-0.044
anda-0.043
ny-0.042
Token the
Token ID262
Feature activation+1.52e+00

Pos logit contributions
 theirs+0.058
 its+0.056
 yours+0.055
 such+0.052
 itself+0.050
Neg logit contributions
my-0.138
anda-0.137
a-0.134
ie-0.133
's-0.133
Token mud
Token ID17492
Feature activation+3.45e+00

Pos logit contributions
oise+0.031
anda+0.030
mun+0.029
ny+0.029
my+0.028
Neg logit contributions
 its-0.027
 yours-0.026
 our-0.026
 such-0.025
 itself-0.025
Token or
Token ID393
Feature activation-4.81e+00

Pos logit contributions
oise+0.074
anda+0.071
mun+0.071
ony+0.069
ny+0.068
Neg logit contributions
 yours-0.058
 its-0.057
 our-0.056
 your-0.053
 itself-0.053
Token you
Token ID345
Feature activation-2.48e-03

Pos logit contributions
 its+0.062
 yours+0.061
 itself+0.059
 such+0.058
 theirs+0.057
Neg logit contributions
my-0.109
oise-0.105
anda-0.105
's-0.103
mun-0.102
Token'll
Token ID1183
Feature activation-6.14e+00

Pos logit contributions
 its+0.000
 yours+0.000
 our+0.000
 your+0.000
 such+0.000
Neg logit contributions
oise-0.000
anda-0.000
mun-0.000
ny-0.000
ony-0.000
Token ruin
Token ID16866
Feature activation-8.86e+00

Pos logit contributions
 its+0.119
 yours+0.119
 our+0.115
 such+0.112
 theirs+0.112
Neg logit contributions
oise-0.110
anda-0.107
ny-0.104
mun-0.103
my-0.102
Token your
Token ID534
Feature activation+4.51e-01

Pos logit contributions
 theirs+0.064
 yours+0.062
 its+0.055
 electricity+0.052
 such+0.051
Neg logit contributions
's-0.239
my-0.238
anda-0.233
a-0.232
oise-0.231
Token dress
Token ID6576
Feature activation+8.70e-01

Pos logit contributions
oise+0.009
anda+0.009
mun+0.008
ny+0.008
ll+0.008
Neg logit contributions
 its-0.008
 yours-0.008
 our-0.008
 such-0.008
 itself-0.007
Token."
Token ID526
Feature activation-2.53e-01

Pos logit contributions
oise+0.020
anda+0.019
mun+0.019
ny+0.018
ony+0.018
Neg logit contributions
 its-0.014
 yours-0.014
 our-0.013
 your-0.013
 such-0.013
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 its+0.004
 our+0.004
 your+0.004
 yours+0.004
 such+0.004
Neg logit contributions
oise-0.006
anda-0.006
mun-0.005
ny-0.005
ll-0.005
Token
Token ID198
Feature activation-9.96e-01

Pos logit contributions
 its+0.021
 our+0.020
 yours+0.020
 your+0.019
 itself+0.019
Neg logit contributions
oise-0.029
anda-0.028
mun-0.027
ny-0.027
ony-0.026
Token you
Token ID345
Feature activation+2.43e+00

Pos logit contributions
 yours+0.042
 its+0.042
 our+0.040
 itself+0.039
 theirs+0.039
Neg logit contributions
oise-0.050
mun-0.050
anda-0.049
my-0.048
ll-0.047
Token have
Token ID423
Feature activation-3.61e+00

Pos logit contributions
anda+0.051
oise+0.051
mun+0.048
ony+0.046
mo+0.046
Neg logit contributions
 yours-0.041
 its-0.041
 our-0.040
 such-0.039
 your-0.038
Token instead
Token ID2427
Feature activation-2.40e+00

Pos logit contributions
 its+0.075
 yours+0.074
 itself+0.072
 electricity+0.072
 our+0.071
Neg logit contributions
ll-0.059
oise-0.058
mun-0.055
my-0.055
's-0.055
Token of
Token ID286
Feature activation-3.88e+00

Pos logit contributions
 its+0.045
 yours+0.044
 our+0.043
 such+0.041
 your+0.041
Neg logit contributions
oise-0.046
anda-0.045
mun-0.043
ny-0.043
ll-0.042
Token always
Token ID1464
Feature activation-2.71e+00

Pos logit contributions
 yours+0.063
 itself+0.061
 electricity+0.060
ores+0.060
 its+0.059
Neg logit contributions
's-0.067
mun-0.065
my-0.064
ll-0.063
m-0.060
Token wanting
Token ID10291
Feature activation-8.81e+00

Pos logit contributions
 its+0.054
 yours+0.053
 our+0.052
 theirs+0.050
 itself+0.050
Neg logit contributions
oise-0.048
mun-0.045
my-0.045
ny-0.045
anda-0.045
Token more
Token ID517
Feature activation-3.51e+00

Pos logit contributions
 theirs+0.147
 yours+0.147
 its+0.142
 yourselves+0.140
 electricity+0.138
Neg logit contributions
's-0.162
my-0.152
s-0.141
ie-0.139
y-0.137
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
 its+0.060
 yours+0.059
 our+0.057
 itself+0.057
 such+0.055
Neg logit contributions
anda-0.067
oise-0.066
mun-0.066
my-0.065
y-0.064
Token The
Token ID383
Feature activation+2.89e+00

Pos logit contributions
 its+0.006
 yours+0.006
 our+0.006
 such+0.006
 your+0.006
Neg logit contributions
oise-0.008
anda-0.008
mun-0.008
ny-0.007
ony-0.007
Token end
Token ID886
Feature activation-4.28e-01

Pos logit contributions
oise+0.054
anda+0.052
mun+0.051
ny+0.049
ll+0.049
Neg logit contributions
 its-0.057
 yours-0.056
 our-0.055
 such-0.053
 your-0.053
Token.
Token ID13
Feature activation-6.61e-01

Pos logit contributions
 yours+0.008
 its+0.008
 our+0.008
 such+0.007
 your+0.007
Neg logit contributions
oise-0.008
anda-0.008
mun-0.008
ny-0.008
my-0.008
Token she
Token ID673
Feature activation+3.60e+00

Pos logit contributions
 yours+0.040
 our+0.038
 its+0.038
 theirs+0.037
 itself+0.036
Neg logit contributions
oise-0.044
mun-0.044
anda-0.043
my-0.043
's-0.041
Token dared
Token ID28765
Feature activation-5.88e+00

Pos logit contributions
oise+0.061
mun+0.056
pit+0.054
anda+0.053
ny+0.052
Neg logit contributions
 its-0.078
 such-0.074
 yours-0.073
 our-0.073
 your-0.072
Token to
Token ID284
Feature activation-4.10e+00

Pos logit contributions
 its+0.070
 yours+0.070
 our+0.068
 theirs+0.064
 such+0.062
Neg logit contributions
oise-0.146
anda-0.145
my-0.142
ny-0.140
ony-0.140
Token try
Token ID1949
Feature activation-6.92e+00

Pos logit contributions
 its+0.076
 yours+0.075
 our+0.074
 such+0.071
 your+0.070
Neg logit contributions
oise-0.080
anda-0.078
mun-0.075
ny-0.074
ll-0.073
Token to
Token ID284
Feature activation-5.47e+00

Pos logit contributions
 yours+0.082
 its+0.076
 our+0.075
 theirs+0.075
 electricity+0.074
Neg logit contributions
my-0.166
oise-0.166
's-0.162
anda-0.162
mun-0.162
Token reach
Token ID3151
Feature activation-8.80e+00

Pos logit contributions
 its+0.104
 yours+0.103
 our+0.101
 such+0.097
 your+0.097
Neg logit contributions
oise-0.104
anda-0.101
mun-0.097
ny-0.095
ll-0.094
Token it
Token ID340
Feature activation-1.55e+00

Pos logit contributions
 yours+0.078
 its+0.074
 our+0.072
 theirs+0.072
 electricity+0.071
Neg logit contributions
oise-0.231
my-0.230
anda-0.226
ony-0.224
ches-0.221
Token.
Token ID13
Feature activation-5.52e-01

Pos logit contributions
 its+0.027
 yours+0.027
 our+0.026
 such+0.025
 your+0.025
Neg logit contributions
oise-0.032
anda-0.031
mun-0.031
ny-0.029
ll-0.029
Token She
Token ID1375
Feature activation+3.80e+00

Pos logit contributions
 its+0.009
 yours+0.009
 our+0.008
 your+0.008
 such+0.008
Neg logit contributions
oise-0.013
anda-0.012
mun-0.012
ll-0.011
ny-0.011
Token tried
Token ID3088
Feature activation-6.33e+00

Pos logit contributions
oise+0.065
anda+0.061
mun+0.061
m+0.058
ny+0.058
Neg logit contributions
 its-0.084
 an-0.074
 it-0.074
 itself-0.073
 the-0.070
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 yours+0.090
 its+0.089
 our+0.087
 such+0.081
 theirs+0.081
Neg logit contributions
oise-0.149
anda-0.145
mun-0.142
ny-0.140
ll-0.140
Token that
Token ID326
Feature activation-2.07e+00

Pos logit contributions
oise+0.083
anda+0.080
mun+0.078
ny+0.076
ll+0.075
Neg logit contributions
 its-0.088
 yours-0.088
 our-0.085
 such-0.082
 your-0.082
Token it
Token ID340
Feature activation+2.21e+00

Pos logit contributions
 yours+0.033
 its+0.032
 our+0.031
 theirs+0.030
 such+0.030
Neg logit contributions
oise-0.044
anda-0.043
mun-0.043
my-0.042
ny-0.041
Token would
Token ID561
Feature activation-4.43e+00

Pos logit contributions
oise+0.032
anda+0.030
mun+0.029
ony+0.028
ny+0.027
Neg logit contributions
 its-0.054
 yours-0.053
 our-0.053
 your-0.051
 such-0.051
Token have
Token ID423
Feature activation-6.37e+00

Pos logit contributions
 yours+0.083
 its+0.081
 our+0.080
 such+0.079
 theirs+0.079
Neg logit contributions
oise-0.074
my-0.074
ny-0.073
anda-0.071
y-0.070
Token to
Token ID284
Feature activation-5.70e+00

Pos logit contributions
 yourselves+0.074
 yours+0.071
 theirs+0.066
 its+0.065
 our+0.065
Neg logit contributions
's-0.153
my-0.148
y-0.144
ny-0.142
anda-0.142
Token delay
Token ID5711
Feature activation-8.78e+00

Pos logit contributions
 yours+0.108
 its+0.106
 our+0.104
 such+0.102
 theirs+0.101
Neg logit contributions
oise-0.099
my-0.094
ny-0.093
anda-0.093
mun-0.092
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 yours+0.129
 its+0.119
 theirs+0.118
 electricity+0.117
 such+0.114
Neg logit contributions
my-0.175
's-0.172
oise-0.167
mun-0.166
y-0.166
Token
Token ID198
Feature activation-9.19e-01

Pos logit contributions
 its+0.008
 yours+0.008
 our+0.008
 your+0.008
 such+0.008
Neg logit contributions
oise-0.012
anda-0.012
mun-0.011
ll-0.011
ny-0.011
Token
Token ID198
Feature activation-8.34e-01

Pos logit contributions
 its+0.015
 yours+0.014
 our+0.014
 itself+0.013
 your+0.013
Neg logit contributions
oise-0.021
anda-0.020
mun-0.020
ny-0.019
ony-0.019
TokenBilly
Token ID49640
Feature activation+1.07e+01

Pos logit contributions
 its+0.015
 our+0.014
 yours+0.014
 your+0.014
 itself+0.014
Neg logit contributions
anda-0.017
oise-0.017
mun-0.016
ony-0.016
ll-0.016
Token was
Token ID373
Feature activation-1.72e+00

Pos logit contributions
oise+0.173
anda+0.167
mun+0.153
ouse+0.150
ony+0.149
Neg logit contributions
 its-0.234
 the-0.219
 your-0.216
 our-0.214
 such-0.212
Token to
Token ID284
Feature activation-5.07e+00

Pos logit contributions
 yours+0.063
 its+0.061
 our+0.060
 electricity+0.059
 itself+0.058
Neg logit contributions
ll-0.103
oise-0.102
my-0.102
anda-0.100
's-0.100
Token help
Token ID1037
Feature activation-4.95e+00

Pos logit contributions
 its+0.094
 yours+0.092
 our+0.091
 such+0.088
 your+0.087
Neg logit contributions
oise-0.100
anda-0.100
mun-0.095
ny-0.093
ll-0.092
Token the
Token ID262
Feature activation+7.10e-01

Pos logit contributions
 its+0.067
 yours+0.067
 our+0.064
 such+0.060
 theirs+0.060
Neg logit contributions
oise-0.120
anda-0.117
mun-0.114
ny-0.113
my-0.112
Token people
Token ID661
Feature activation+3.66e+00

Pos logit contributions
oise+0.014
anda+0.014
mun+0.013
ny+0.013
ll+0.013
Neg logit contributions
 its-0.013
 yours-0.013
 our-0.013
 your-0.012
 such-0.012
Token by
Token ID416
Feature activation-3.51e+00

Pos logit contributions
anda+0.076
oise+0.075
mun+0.072
ny+0.070
ony+0.070
Neg logit contributions
 its-0.064
 yours-0.062
 our-0.062
 such-0.060
 your-0.060
Token stopping
Token ID12225
Feature activation-8.76e+00

Pos logit contributions
 yours+0.050
 our+0.046
 its+0.045
 theirs+0.043
 such+0.043
Neg logit contributions
oise-0.081
mun-0.078
my-0.077
y-0.077
anda-0.075
Token them
Token ID606
Feature activation-4.81e-01

Pos logit contributions
 yours+0.140
 its+0.129
 our+0.128
 electricity+0.127
 theirs+0.126
Neg logit contributions
my-0.173
oise-0.172
y-0.170
ny-0.168
's-0.167
Token from
Token ID422
Feature activation-5.97e+00

Pos logit contributions
 its+0.008
 yours+0.008
 our+0.008
 such+0.008
 your+0.008
Neg logit contributions
oise-0.010
anda-0.010
mun-0.009
ny-0.009
ony-0.009
Token destroying
Token ID13897
Feature activation-7.15e+00

Pos logit contributions
 yours+0.068
 our+0.062
 its+0.061
 itself+0.060
 theirs+0.059
Neg logit contributions
oise-0.141
my-0.136
ny-0.131
y-0.129
mun-0.129
Token the
Token ID262
Feature activation+5.07e-01

Pos logit contributions
 yours+0.072
 its+0.066
 our+0.062
 theirs+0.062
 itself+0.058
Neg logit contributions
oise-0.185
my-0.182
y-0.180
mun-0.180
anda-0.178
Token stadium
Token ID10308
Feature activation+2.26e+00

Pos logit contributions
oise+0.010
anda+0.010
mun+0.009
ny+0.009
ll+0.009
Neg logit contributions
 its-0.009
 yours-0.009
 our-0.009
 such-0.009
 your-0.009
Token sugar
Token ID7543
Feature activation+4.90e+00

Pos logit contributions
oise+0.020
anda+0.019
mun+0.019
ny+0.018
ll+0.018
Neg logit contributions
 its-0.018
 yours-0.018
 our-0.018
 your-0.017
 such-0.017
Token if
Token ID611
Feature activation-3.95e+00

Pos logit contributions
oise+0.093
anda+0.088
mun+0.086
ny+0.081
ony+0.081
Neg logit contributions
 its-0.096
 yours-0.094
 our-0.093
 such-0.091
 your-0.090
Token she
Token ID673
Feature activation+3.45e+00

Pos logit contributions
 yours+0.060
 its+0.059
 our+0.056
 theirs+0.055
 itself+0.055
Neg logit contributions
oise-0.084
anda-0.083
mun-0.081
my-0.080
ll-0.080
Token helped
Token ID4193
Feature activation-6.47e+00

Pos logit contributions
oise+0.058
mun+0.056
anda+0.051
pit+0.049
ll+0.048
Neg logit contributions
 its-0.073
 yours-0.071
 our-0.068
 your-0.068
 such-0.067
Token him
Token ID683
Feature activation+6.69e-03

Pos logit contributions
 yours+0.070
 its+0.068
 our+0.065
 theirs+0.061
 such+0.060
Neg logit contributions
oise-0.171
anda-0.166
mun-0.164
my-0.163
ny-0.162
Token reach
Token ID3151
Feature activation-8.70e+00

Pos logit contributions
oise+0.000
anda+0.000
mun+0.000
ny+0.000
ll+0.000
Neg logit contributions
 its-0.000
 yours-0.000
 our-0.000
 such-0.000
 your-0.000
Token the
Token ID262
Feature activation+6.13e-01

Pos logit contributions
 yours+0.084
 electricity+0.082
 yourselves+0.080
 its+0.077
 theirs+0.077
Neg logit contributions
my-0.220
oise-0.212
ie-0.211
ny-0.210
y-0.209
Token highest
Token ID4511
Feature activation+1.09e+00

Pos logit contributions
oise+0.013
anda+0.012
mun+0.012
ny+0.012
ll+0.012
Neg logit contributions
 yours-0.011
 its-0.011
 our-0.010
 such-0.010
 your-0.010
Token branch
Token ID8478
Feature activation+8.34e-01

Pos logit contributions
oise+0.022
anda+0.021
mun+0.021
ll+0.020
ony+0.020
Neg logit contributions
 its-0.020
 yours-0.020
 our-0.020
 such-0.019
 your-0.019
Token of
Token ID286
Feature activation-6.50e+00

Pos logit contributions
oise+0.018
anda+0.017
mun+0.017
ony+0.016
ll+0.016
Neg logit contributions
 its-0.014
 yours-0.014
 our-0.013
 itself-0.013
 such-0.013
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 yours+0.063
 theirs+0.058
 our+0.058
 yourselves+0.056
 electricity+0.053
Neg logit contributions
my-0.153
mun-0.149
oise-0.148
's-0.146
ony-0.143
Token need
Token ID761
Feature activation-7.54e+00

Pos logit contributions
oise+0.032
anda+0.032
mun+0.031
ony+0.030
ny+0.029
Neg logit contributions
 its-0.033
 yours-0.033
 our-0.032
 your-0.031
 such-0.031
Token a
Token ID257
Feature activation-3.56e-01

Pos logit contributions
 theirs+0.108
 its+0.099
 itself+0.099
ores+0.097
 yourselves+0.093
Neg logit contributions
my-0.153
's-0.148
m-0.136
ird-0.131
oise-0.131
Token little
Token ID1310
Feature activation+1.40e+00

Pos logit contributions
 its+0.006
 yours+0.006
 our+0.006
 such+0.005
 your+0.005
Neg logit contributions
oise-0.008
anda-0.007
mun-0.007
ny-0.007
ll-0.007
Token help
Token ID1037
Feature activation-3.43e+00

Pos logit contributions
oise+0.024
anda+0.023
mun+0.023
ny+0.022
ll+0.022
Neg logit contributions
 its-0.029
 yours-0.029
 our-0.028
 such-0.027
 your-0.027
Token to
Token ID284
Feature activation-5.84e+00

Pos logit contributions
 its+0.061
 yours+0.060
 our+0.059
 theirs+0.057
 itself+0.056
Neg logit contributions
oise-0.067
anda-0.064
my-0.064
ny-0.063
mun-0.063
Token reach
Token ID3151
Feature activation-8.69e+00

Pos logit contributions
 yours+0.103
 its+0.101
 theirs+0.099
 our+0.095
 itself+0.094
Neg logit contributions
oise-0.105
my-0.104
mun-0.101
ll-0.100
's-0.098
Token tall
Token ID7331
Feature activation-2.52e+00

Pos logit contributions
 theirs+0.080
 its+0.078
 yours+0.077
 electricity+0.074
 yourselves+0.071
Neg logit contributions
my-0.219
's-0.212
ches-0.209
oise-0.208
ie-0.208
Token things
Token ID1243
Feature activation-1.14e+00

Pos logit contributions
 yours+0.050
 its+0.049
 our+0.048
 theirs+0.047
 itself+0.046
Neg logit contributions
oise-0.044
anda-0.043
mun-0.042
ny-0.041
my-0.040
Token.
Token ID13
Feature activation-3.56e-01

Pos logit contributions
 its+0.020
 yours+0.020
 our+0.019
 theirs+0.019
 such+0.019
Neg logit contributions
oise-0.022
anda-0.021
ll-0.021
ny-0.021
my-0.021
Token<|endoftext|>
Token ID50256
Feature activation+2.95e+00

Pos logit contributions
 its+0.006
 yours+0.006
 our+0.005
 your+0.005
 such+0.005
Neg logit contributions
oise-0.008
anda-0.008
mun-0.008
ll-0.007
ony-0.007
TokenOnce
Token ID7454
Feature activation+1.90e+00

Pos logit contributions
oise+0.065
anda+0.064
mun+0.061
ony+0.060
ny+0.059
Neg logit contributions
 its-0.048
 yours-0.046
 our-0.046
 such-0.045
 your-0.044
Token you
Token ID345
Feature activation+2.16e+00

Pos logit contributions
 its+0.040
 yours+0.040
 our+0.038
 theirs+0.037
 itself+0.037
Neg logit contributions
oise-0.057
anda-0.056
mun-0.056
my-0.054
ll-0.054
Token have
Token ID423
Feature activation-3.75e+00

Pos logit contributions
oise+0.044
anda+0.044
mun+0.043
ny+0.040
ony+0.040
Neg logit contributions
 yours-0.038
 its-0.038
 our-0.036
 such-0.036
 your-0.035
Token instead
Token ID2427
Feature activation-2.37e+00

Pos logit contributions
 its+0.075
 yours+0.074
 electricity+0.072
 our+0.072
 itself+0.071
Neg logit contributions
oise-0.063
ll-0.062
mun-0.059
my-0.059
anda-0.059
Token of
Token ID286
Feature activation-3.77e+00

Pos logit contributions
 its+0.044
 yours+0.044
 our+0.043
 such+0.041
 your+0.041
Neg logit contributions
oise-0.046
anda-0.045
mun-0.043
ny-0.042
ll-0.042
Token always
Token ID1464
Feature activation-2.86e+00

Pos logit contributions
 electricity+0.057
ores+0.055
 its+0.055
 yours+0.053
 itself+0.053
Neg logit contributions
my-0.074
mun-0.072
m-0.070
's-0.070
mie-0.070
Token wanting
Token ID10291
Feature activation-8.64e+00

Pos logit contributions
 its+0.056
 yours+0.055
 our+0.054
 itself+0.052
 theirs+0.052
Neg logit contributions
oise-0.051
my-0.048
ny-0.048
mun-0.048
anda-0.047
Token more
Token ID517
Feature activation-2.96e+00

Pos logit contributions
 electricity+0.143
 its+0.141
 theirs+0.141
 yours+0.140
ores+0.138
Neg logit contributions
's-0.154
my-0.148
s-0.140
y-0.139
ie-0.136
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
 its+0.047
 yours+0.045
 our+0.045
 itself+0.043
 such+0.042
Neg logit contributions
oise-0.063
anda-0.063
mun-0.061
my-0.060
ny-0.060
Token From
Token ID3574
Feature activation-1.78e+00

Pos logit contributions
 its+0.006
 yours+0.006
 our+0.005
 such+0.005
 your+0.005
Neg logit contributions
oise-0.007
anda-0.007
mun-0.007
ny-0.006
ll-0.006
Token that
Token ID326
Feature activation+1.20e+00

Pos logit contributions
 yours+0.027
 theirs+0.026
 its+0.025
 electricity+0.025
 our+0.025
Neg logit contributions
oise-0.039
anda-0.038
my-0.037
ll-0.037
y-0.036
Token day
Token ID1110
Feature activation+1.98e-01

Pos logit contributions
oise+0.024
anda+0.023
mun+0.023
ony+0.022
ll+0.022
Neg logit contributions
 its-0.022
 yours-0.022
 our-0.022
 your-0.021
 such-0.021
Token message
Token ID3275
Feature activation+5.39e-01

Pos logit contributions
oise+0.038
anda+0.037
mun+0.036
ny+0.035
my+0.035
Neg logit contributions
 its-0.035
 yours-0.035
 our-0.034
 such-0.033
 theirs-0.032
Token and
Token ID290
Feature activation-2.11e+00

Pos logit contributions
oise+0.011
anda+0.010
mun+0.010
ny+0.010
ll+0.010
Neg logit contributions
 its-0.010
 yours-0.010
 our-0.009
 such-0.009
 your-0.009
Token hopes
Token ID7176
Feature activation-5.13e+00

Pos logit contributions
 yours+0.038
 our+0.037
 its+0.036
 theirs+0.036
 itself+0.035
Neg logit contributions
oise-0.038
mun-0.037
my-0.037
's-0.036
anda-0.036
Token it
Token ID340
Feature activation+1.56e+00

Pos logit contributions
 yours+0.071
 our+0.067
 yourselves+0.066
 theirs+0.066
 its+0.063
Neg logit contributions
's-0.110
my-0.109
ie-0.107
anda-0.107
a-0.106
Token will
Token ID481
Feature activation-5.57e+00

Pos logit contributions
oise+0.019
anda+0.018
mun+0.017
ony+0.016
ny+0.016
Neg logit contributions
 its-0.042
 yours-0.040
 our-0.040
 such-0.039
 your-0.039
Token reach
Token ID3151
Feature activation-8.63e+00

Pos logit contributions
 yours+0.106
 yourselves+0.104
 its+0.102
 theirs+0.102
 our+0.101
Neg logit contributions
my-0.089
ll-0.089
oise-0.088
ny-0.087
's-0.084
Token someone
Token ID2130
Feature activation+1.23e+00

Pos logit contributions
 electricity+0.088
 yours+0.084
 yourselves+0.084
 theirs+0.082
 its+0.079
Neg logit contributions
my-0.209
ie-0.208
's-0.202
oise-0.199
y-0.198
Token special
Token ID2041
Feature activation-2.42e+00

Pos logit contributions
oise+0.023
anda+0.022
mun+0.022
ny+0.021
ony+0.021
Neg logit contributions
 yours-0.024
 its-0.024
 our-0.024
 your-0.023
 such-0.023
Token out
Token ID503
Feature activation-3.19e+00

Pos logit contributions
 its+0.043
 yours+0.042
 our+0.041
 theirs+0.040
 electricity+0.039
Neg logit contributions
oise-0.047
anda-0.044
mun-0.043
y-0.043
ny-0.043
Token in
Token ID287
Feature activation-4.93e+00

Pos logit contributions
 yours+0.062
 its+0.060
 our+0.060
 theirs+0.059
 electricity+0.059
Neg logit contributions
oise-0.054
anda-0.051
y-0.051
my-0.050
's-0.049
Token the
Token ID262
Feature activation-1.45e+00

Pos logit contributions
 electricity+0.055
 yours+0.052
 theirs+0.049
 itself+0.047
 yourselves+0.046
Neg logit contributions
my-0.112
oise-0.111
y-0.108
's-0.106
bug-0.103
Token was
Token ID373
Feature activation-2.55e+00

Pos logit contributions
oise+0.064
mun+0.061
anda+0.060
ony+0.055
ny+0.054
Neg logit contributions
 its-0.078
 it-0.068
 your-0.067
 such-0.067
 itself-0.066
Token so
Token ID523
Feature activation-2.75e+00

Pos logit contributions
 yours+0.047
 electricity+0.045
 our+0.045
 its+0.044
 ourselves+0.044
Neg logit contributions
oise-0.047
my-0.046
anda-0.045
a-0.045
's-0.044
Token happy
Token ID3772
Feature activation-2.98e+00

Pos logit contributions
 yours+0.053
 our+0.050
 its+0.050
 theirs+0.050
 electricity+0.049
Neg logit contributions
anda-0.050
oise-0.049
a-0.049
ll-0.048
my-0.047
Token to
Token ID284
Feature activation-3.77e+00

Pos logit contributions
 yours+0.049
 its+0.048
 our+0.047
 theirs+0.046
 electricity+0.045
Neg logit contributions
oise-0.061
anda-0.060
ll-0.059
mun-0.059
y-0.059
Token have
Token ID423
Feature activation-5.11e+00

Pos logit contributions
 yours+0.078
 our+0.076
 theirs+0.075
 its+0.075
 electricity+0.075
Neg logit contributions
oise-0.061
my-0.057
ny-0.056
anda-0.056
ll-0.055
Token figured
Token ID11638
Feature activation-8.62e+00

Pos logit contributions
 yours+0.079
 yourselves+0.077
 theirs+0.076
 electricity+0.076
 ourselves+0.076
Neg logit contributions
's-0.110
my-0.103
mie-0.099
oise-0.098
a-0.097
Token out
Token ID503
Feature activation-3.58e+00

Pos logit contributions
 yourselves+0.129
 our+0.119
 its+0.119
 ourselves+0.119
 yours+0.118
Neg logit contributions
's-0.177
a-0.176
my-0.175
ches-0.174
ny-0.170
Token this
Token ID428
Feature activation+1.62e+00

Pos logit contributions
 yours+0.037
 electricity+0.036
 ourselves+0.036
 yourselves+0.036
 itself+0.035
Neg logit contributions
's-0.086
my-0.086
oise-0.085
anda-0.084
a-0.084
Token mighty
Token ID18680
Feature activation+3.50e+00

Pos logit contributions
oise+0.035
anda+0.034
mun+0.033
ny+0.033
ll+0.032
Neg logit contributions
 its-0.027
 yours-0.026
 our-0.026
 such-0.024
 your-0.024
Token trick
Token ID6908
Feature activation+1.50e+00

Pos logit contributions
oise+0.067
anda+0.065
mun+0.063
ny+0.062
ll+0.061
Neg logit contributions
 its-0.066
 yours-0.065
 our-0.064
 such-0.061
 itself-0.061
Token and
Token ID290
Feature activation-2.39e+00

Pos logit contributions
oise+0.031
anda+0.030
mun+0.029
ony+0.028
ny+0.028
Neg logit contributions
 its-0.027
 yours-0.026
 our-0.026
 such-0.025
 itself-0.025
Token,
Token ID11
Feature activation-4.52e-01

Pos logit contributions
oise+0.033
anda+0.032
mun+0.032
my+0.032
ny+0.031
Neg logit contributions
 yours-0.015
 its-0.014
 our-0.014
 itself-0.013
 such-0.013
Token not
Token ID407
Feature activation-2.58e+00

Pos logit contributions
 its+0.007
 yours+0.007
 our+0.007
 such+0.006
 itself+0.006
Neg logit contributions
oise-0.010
anda-0.010
mun-0.010
ny-0.010
ll-0.009
Token from
Token ID422
Feature activation-5.04e+00

Pos logit contributions
 its+0.035
 yours+0.034
 our+0.033
 itself+0.033
 such+0.032
Neg logit contributions
oise-0.059
my-0.057
anda-0.057
mun-0.056
y-0.055
Token having
Token ID1719
Feature activation-6.10e+00

Pos logit contributions
 itself+0.051
 its+0.049
 yours+0.046
 theirs+0.045
 such+0.045
Neg logit contributions
my-0.116
oise-0.111
anda-0.110
ie-0.108
mo-0.107
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
 yours+0.075
 theirs+0.069
 its+0.067
 our+0.066
 itself+0.064
Neg logit contributions
's-0.143
my-0.141
mun-0.138
ny-0.133
anda-0.133
Token keeping
Token ID5291
Feature activation-8.61e+00

Pos logit contributions
 its+0.027
 yours+0.027
 our+0.026
 itself+0.026
 theirs+0.026
Neg logit contributions
oise-0.041
anda-0.040
my-0.040
mun-0.039
y-0.039
Token.
Token ID13
Feature activation-6.96e-01

Pos logit contributions
 yours+0.102
 theirs+0.092
 its+0.082
 electricity+0.081
 our+0.080
Neg logit contributions
my-0.202
's-0.198
mun-0.197
oise-0.197
mie-0.194
Token Why
Token ID4162
Feature activation-2.28e+00

Pos logit contributions
 yours+0.011
 its+0.011
 our+0.011
 such+0.010
 your+0.010
Neg logit contributions
oise-0.015
anda-0.015
mun-0.015
ny-0.014
ll-0.014
Token don
Token ID836
Feature activation-4.69e+00

Pos logit contributions
 its+0.043
 our+0.043
 yours+0.042
 your+0.041
 such+0.040
Neg logit contributions
oise-0.046
anda-0.044
ny-0.042
mun-0.042
ony-0.042
Token't
Token ID470
Feature activation-2.65e+00

Pos logit contributions
 its+0.073
 yours+0.073
 our+0.070
 such+0.067
 theirs+0.067
Neg logit contributions
oise-0.104
anda-0.101
mun-0.099
ny-0.097
ll-0.096
Token you
Token ID345
Feature activation-1.49e+00

Pos logit contributions
 its+0.032
 yours+0.032
 our+0.030
 theirs+0.029
 such+0.029
Neg logit contributions
oise-0.068
anda-0.066
mun-0.065
ny-0.064
my-0.064
Token milk
Token ID7545
Feature activation-1.13e+00

Pos logit contributions
 its+0.073
 yours+0.072
 itself+0.070
 theirs+0.067
 such+0.066
Neg logit contributions
my-0.079
mun-0.079
's-0.078
anda-0.078
oise-0.078
Token?
Token ID30
Feature activation-1.18e+00

Pos logit contributions
 its+0.022
 yours+0.022
 our+0.021
 itself+0.020
 such+0.020
Neg logit contributions
oise-0.021
anda-0.020
ny-0.019
mun-0.019
ll-0.019
Token I
Token ID314
Feature activation+1.38e+00

Pos logit contributions
 its+0.018
 yours+0.018
 our+0.016
 theirs+0.016
 itself+0.016
Neg logit contributions
oise-0.026
anda-0.025
ny-0.025
mun-0.025
my-0.025
Token think
Token ID892
Feature activation-5.29e+00

Pos logit contributions
anda+0.030
oise+0.029
mun+0.029
ony+0.028
pit+0.027
Neg logit contributions
 its-0.022
 our-0.022
 yours-0.022
 such-0.021
 your-0.021
Token you
Token ID345
Feature activation+2.75e+00

Pos logit contributions
 its+0.078
 itself+0.076
 yours+0.076
 theirs+0.072
 electricity+0.072
Neg logit contributions
anda-0.109
's-0.109
mun-0.107
my-0.104
oise-0.104
Token deserve
Token ID10925
Feature activation-8.57e+00

Pos logit contributions
anda+0.053
oise+0.052
mun+0.051
mo+0.048
ony+0.048
Neg logit contributions
 yours-0.052
 our-0.051
 its-0.051
 such-0.050
 your-0.048
Token a
Token ID257
Feature activation+2.53e-01

Pos logit contributions
 itself+0.099
 yourselves+0.098
 theirs+0.097
 ourselves+0.089
 its+0.087
Neg logit contributions
's-0.193
my-0.184
ches-0.182
a-0.175
ony-0.171
Token treat
Token ID2190
Feature activation-3.90e-01

Pos logit contributions
oise+0.006
anda+0.006
mun+0.005
ny+0.005
ony+0.005
Neg logit contributions
 its-0.004
 yours-0.004
 our-0.004
 your-0.004
 such-0.004
Token."
Token ID526
Feature activation-2.95e-01

Pos logit contributions
 its+0.007
 yours+0.006
 our+0.006
 itself+0.006
 electricity+0.006
Neg logit contributions
oise-0.008
anda-0.008
ny-0.008
my-0.008
y-0.007
Token
Token ID198
Feature activation-5.13e-01

Pos logit contributions
 yours+0.005
 its+0.005
 our+0.005
 such+0.005
 itself+0.005
Neg logit contributions
oise-0.006
anda-0.006
mun-0.006
ny-0.005
my-0.005
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
 yours+0.010
 its+0.009
 our+0.009
 such+0.009
 theirs+0.009
Neg logit contributions
oise-0.009
anda-0.009
mun-0.009
my-0.009
ny-0.009
Token The
Token ID383
Feature activation+1.33e+00

Pos logit contributions
oise+0.050
anda+0.049
mun+0.047
ny+0.046
ony+0.045
Neg logit contributions
 its-0.055
 yours-0.055
 our-0.054
 such-0.052
 itself-0.051
Token fish
Token ID5916
Feature activation+9.68e+00

Pos logit contributions
oise+0.024
anda+0.023
mun+0.022
ll+0.022
ony+0.022
Neg logit contributions
 its-0.027
 yours-0.026
 our-0.026
 your-0.025
 itself-0.025
Token said
Token ID531
Feature activation-2.21e+00

Pos logit contributions
oise+0.140
anda+0.133
mun+0.126
ony+0.126
ird+0.122
Neg logit contributions
 its-0.239
 our-0.225
 yours-0.217
 your-0.216
 itself-0.213
Token they
Token ID484
Feature activation+1.39e+00

Pos logit contributions
 yours+0.045
 yourselves+0.044
 electricity+0.042
 ours+0.041
 our+0.041
Neg logit contributions
a-0.037
's-0.036
anda-0.035
y-0.035
ppy-0.034
Token would
Token ID561
Feature activation-4.46e+00

Pos logit contributions
oise+0.017
anda+0.016
mun+0.015
ony+0.014
ll+0.014
Neg logit contributions
 its-0.037
 yours-0.036
 our-0.036
 such-0.035
 theirs-0.035
Token complete
Token ID1844
Feature activation-8.56e+00

Pos logit contributions
 its+0.081
 yours+0.081
 our+0.079
 such+0.076
 itself+0.075
Neg logit contributions
oise-0.086
anda-0.083
mun-0.081
ny-0.081
ll-0.079
Token the
Token ID262
Feature activation-9.80e-02

Pos logit contributions
 yours+0.073
 electricity+0.066
 yourselves+0.064
 its+0.058
 theirs+0.058
Neg logit contributions
my-0.218
's-0.218
oise-0.218
a-0.214
y-0.214
Token mission
Token ID4365
Feature activation-2.55e+00

Pos logit contributions
 its+0.002
 yours+0.002
 our+0.002
 such+0.002
 theirs+0.002
Neg logit contributions
oise-0.002
anda-0.002
mun-0.002
ny-0.002
ll-0.002
Token of
Token ID286
Feature activation-5.32e+00

Pos logit contributions
 its+0.044
 yours+0.044
 our+0.043
 such+0.041
 your+0.040
Neg logit contributions
oise-0.053
anda-0.051
mun-0.050
ny-0.049
ll-0.048
Token helping
Token ID5742
Feature activation-5.89e+00

Pos logit contributions
 yours+0.058
 yourselves+0.054
 its+0.053
 such+0.053
 electricity+0.052
Neg logit contributions
oise-0.126
mun-0.123
my-0.120
's-0.117
ll-0.116
Token Fred
Token ID8559
Feature activation+5.94e+00

Pos logit contributions
 yours+0.058
 its+0.057
 our+0.052
 such+0.051
 theirs+0.050
Neg logit contributions
oise-0.159
anda-0.154
ny-0.152
mun-0.152
my-0.152
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 yours+0.034
 its+0.034
 our+0.033
 theirs+0.032
 such+0.031
Neg logit contributions
oise-0.029
anda-0.028
ny-0.027
y-0.027
mun-0.027
Token She
Token ID1375
Feature activation+3.97e+00

Pos logit contributions
 yours+0.019
 its+0.019
 our+0.019
 theirs+0.018
 such+0.018
Neg logit contributions
oise-0.023
anda-0.022
mun-0.021
my-0.021
ny-0.021
Token promised
Token ID8072
Feature activation-4.73e+00

Pos logit contributions
oise+0.079
anda+0.073
mun+0.073
pit+0.071
ony+0.070
Neg logit contributions
 its-0.080
 our-0.072
 your-0.070
 itself-0.070
 such-0.068
Token to
Token ID284
Feature activation-4.12e+00

Pos logit contributions
 yours+0.066
 its+0.062
 theirs+0.061
 our+0.061
 electricity+0.056
Neg logit contributions
ll-0.109
my-0.108
anda-0.107
oise-0.105
mun-0.104
Token never
Token ID1239
Feature activation-2.89e+00

Pos logit contributions
 yours+0.081
 its+0.081
 our+0.079
 theirs+0.079
 such+0.077
Neg logit contributions
oise-0.071
my-0.068
ll-0.068
anda-0.068
mun-0.067
Token delay
Token ID5711
Feature activation-8.55e+00

Pos logit contributions
 its+0.054
 yours+0.054
 our+0.053
 such+0.050
 your+0.050
Neg logit contributions
oise-0.056
anda-0.054
mun-0.052
ny-0.051
ll-0.051
Token going
Token ID1016
Feature activation-6.72e+00

Pos logit contributions
 yours+0.119
 its+0.111
 theirs+0.110
 our+0.107
 electricity+0.106
Neg logit contributions
my-0.181
mun-0.178
's-0.174
oise-0.173
y-0.168
Token to
Token ID284
Feature activation-6.89e+00

Pos logit contributions
 its+0.117
 yours+0.117
 our+0.110
 itself+0.108
 such+0.108
Neg logit contributions
oise-0.127
ny-0.124
my-0.123
ll-0.120
anda-0.120
Token school
Token ID1524
Feature activation-2.70e+00

Pos logit contributions
 yours+0.088
 theirs+0.079
 its+0.075
 our+0.070
 itself+0.070
Neg logit contributions
's-0.159
mun-0.154
oise-0.153
my-0.151
ny-0.146
Token again
Token ID757
Feature activation-4.10e+00

Pos logit contributions
 its+0.051
 yours+0.050
 our+0.049
 such+0.047
 your+0.047
Neg logit contributions
oise-0.052
anda-0.050
mun-0.049
ny-0.048
ll-0.047
Token and
Token ID290
Feature activation-3.61e+00

Pos logit contributions
 yours+0.085
 its+0.082
 our+0.081
 theirs+0.080
 electricity+0.078
Neg logit contributions
oise-0.070
my-0.064
mun-0.064
ny-0.064
anda-0.064
Token to
Token ID284
Feature activation-4.97e+00

Pos logit contributions
 yours+0.060
 its+0.060
 theirs+0.056
 our+0.055
 such+0.054
Neg logit contributions
ny-0.140
oise-0.139
my-0.139
anda-0.136
's-0.135
Token you
Token ID345
Feature activation-9.71e-01

Pos logit contributions
 its+0.054
 yours+0.053
 theirs+0.049
 electricity+0.049
 itself+0.048
Neg logit contributions
my-0.123
oise-0.121
's-0.120
mun-0.119
ll-0.119
Token instead
Token ID2427
Feature activation-2.45e+00

Pos logit contributions
 yours+0.019
 its+0.019
 our+0.018
 theirs+0.018
 such+0.018
Neg logit contributions
oise-0.017
anda-0.016
my-0.016
ny-0.016
ll-0.016
Token of
Token ID286
Feature activation-4.19e+00

Pos logit contributions
 its+0.047
 yours+0.047
 our+0.046
 such+0.044
 theirs+0.044
Neg logit contributions
oise-0.045
anda-0.044
mun-0.042
ny-0.042
my-0.042
Token always
Token ID1464
Feature activation-3.29e+00

Pos logit contributions
ores+0.069
 yours+0.068
 outer+0.066
 theirs+0.066
 itself+0.066
Neg logit contributions
's-0.073
my-0.070
mun-0.066
ll-0.063
m-0.062
Token wanting
Token ID10291
Feature activation-8.55e+00

Pos logit contributions
 its+0.067
 yours+0.067
 our+0.064
 theirs+0.064
 itself+0.063
Neg logit contributions
oise-0.054
ny-0.053
my-0.053
mun-0.052
's-0.052
Token something
Token ID1223
Feature activation-1.75e+00

Pos logit contributions
 theirs+0.129
 yours+0.126
 yourselves+0.122
 its+0.121
ores+0.119
Neg logit contributions
's-0.174
my-0.163
s-0.158
ny-0.154
y-0.151
Token else
Token ID2073
Feature activation-5.87e-01

Pos logit contributions
 its+0.025
 yours+0.024
 itself+0.023
 theirs+0.023
 yourselves+0.023
Neg logit contributions
anda-0.039
ll-0.038
's-0.038
mun-0.037
oise-0.037
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
 its+0.011
 yours+0.010
 our+0.010
 such+0.010
 itself+0.010
Neg logit contributions
oise-0.011
anda-0.011
ll-0.011
mun-0.011
my-0.011
Token
Token ID220
Feature activation-1.18e+00

Pos logit contributions
oise+0.006
anda+0.006
mun+0.006
ony+0.006
y+0.006
Neg logit contributions
 its-0.004
 yours-0.004
 our-0.004
 such-0.004
 your-0.004
Token
Token ID198
Feature activation-5.06e-01

Pos logit contributions
 its+0.019
 yours+0.019
 our+0.019
 your+0.018
 such+0.018
Neg logit contributions
oise-0.026
anda-0.025
mun-0.024
ny-0.024
ony-0.024
Token�
Token ID241
Feature activation+3.64e-01

Pos logit contributions
oise+0.025
anda+0.025
ny+0.024
mun+0.023
ony+0.023
Neg logit contributions
 its-0.034
 yours-0.034
 our-0.034
 theirs-0.032
 such-0.032
TokenWe
Token ID1135
Feature activation+1.27e+00

Pos logit contributions
oise+0.008
anda+0.007
mun+0.007
ny+0.007
my+0.007
Neg logit contributions
 its-0.006
 yours-0.006
 our-0.006
 such-0.006
 theirs-0.006
Token are
Token ID389
Feature activation-1.98e+00

Pos logit contributions
anda+0.023
oise+0.023
mun+0.023
ll+0.022
ony+0.022
Neg logit contributions
 its-0.025
 yours-0.024
 our-0.024
 your-0.023
 such-0.023
Token going
Token ID1016
Feature activation-4.64e+00

Pos logit contributions
 its+0.032
 yours+0.031
 our+0.030
 itself+0.030
 such+0.029
Neg logit contributions
oise-0.043
anda-0.042
mun-0.041
ny-0.040
my-0.040
Token to
Token ID284
Feature activation-6.00e+00

Pos logit contributions
 its+0.065
 yours+0.062
 itself+0.060
 theirs+0.057
 our+0.057
Neg logit contributions
oise-0.110
anda-0.106
ny-0.104
ll-0.104
my-0.103
Token reach
Token ID3151
Feature activation-8.54e+00

Pos logit contributions
 yours+0.102
 its+0.101
 theirs+0.095
 itself+0.094
 our+0.092
Neg logit contributions
oise-0.118
my-0.111
ny-0.110
anda-0.110
mun-0.109
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
 its+0.066
 yours+0.064
 itself+0.058
 theirs+0.057
 electricity+0.055
Neg logit contributions
oise-0.234
anda-0.231
my-0.230
's-0.228
a-0.227
Token beach
Token ID10481
Feature activation+6.43e-01

Pos logit contributions
oise+0.013
anda+0.013
mun+0.013
ll+0.012
ny+0.012
Neg logit contributions
 yours-0.012
 its-0.012
 our-0.012
 your-0.011
 such-0.011
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
oise+0.014
anda+0.013
mun+0.013
ny+0.013
ony+0.013
Neg logit contributions
 its-0.011
 yours-0.011
 our-0.010
 your-0.010
 such-0.010
Token have
Token ID423
Feature activation-6.17e+00

Pos logit contributions
 its+0.039
 yours+0.039
 our+0.039
 your+0.037
 such+0.036
Neg logit contributions
oise-0.050
anda-0.049
mun-0.048
ny-0.047
ll-0.046
Token so
Token ID523
Feature activation-3.27e+00

Pos logit contributions
 yours+0.067
 its+0.065
 yourselves+0.064
 theirs+0.062
 electricity+0.062
Neg logit contributions
's-0.153
my-0.146
a-0.143
ny-0.139
oise-0.139
Tokenmy
Token ID1820
Feature activation+5.29e+00

Pos logit contributions
oise+0.035
mun+0.030
anda+0.029
ony+0.026
ny+0.022
Neg logit contributions
 yours-0.214
 its-0.212
 our-0.212
 your-0.208
 such-0.206
Token!"
Token ID2474
Feature activation-1.17e-01

Pos logit contributions
oise+0.119
anda+0.115
mun+0.113
ony+0.110
ny+0.110
Neg logit contributions
 its-0.082
 yours-0.082
 our-0.080
 such-0.076
 your-0.076
Token
Token ID220
Feature activation-8.81e-01

Pos logit contributions
 its+0.002
 our+0.002
 yours+0.002
 your+0.002
 itself+0.002
Neg logit contributions
oise-0.003
anda-0.002
ny-0.002
mun-0.002
ony-0.002
Token
Token ID198
Feature activation-7.36e-01

Pos logit contributions
 its+0.015
 yours+0.015
 our+0.015
 such+0.014
 your+0.014
Neg logit contributions
oise-0.019
anda-0.018
mun-0.018
ny-0.017
ony-0.017
Token
Token ID198
Feature activation-7.85e-01

Pos logit contributions
 its+0.013
 yours+0.013
 our+0.013
 such+0.012
 your+0.012
Neg logit contributions
oise-0.015
anda-0.015
mun-0.014
ny-0.014
ll-0.014
TokenMom
Token ID29252
Feature activation+9.00e+00

Pos logit contributions
 its+0.015
 yours+0.014
 our+0.014
 your+0.014
 itself+0.013
Neg logit contributions
oise-0.016
anda-0.015
mun-0.015
ny-0.014
ony-0.014
Tokenmy
Token ID1820
Feature activation+8.91e+00

Pos logit contributions
oise+0.082
mun+0.074
anda+0.064
ony+0.060
ny+0.057
Neg logit contributions
 its-0.261
 our-0.251
 your-0.247
 such-0.245
 yours-0.245
Token smiled
Token ID13541
Feature activation-4.73e-02

Pos logit contributions
oise+0.158
mun+0.141
anda+0.138
ny+0.134
ony+0.131
Neg logit contributions
 its-0.194
 our-0.178
 itself-0.172
 the-0.171
 your-0.170
Token back
Token ID736
Feature activation+1.58e+00

Pos logit contributions
 yours+0.001
 our+0.001
 electricity+0.001
 its+0.001
 yourselves+0.001
Neg logit contributions
oise-0.001
y-0.001
anda-0.001
ny-0.001
my-0.001
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
oise+0.026
anda+0.025
mun+0.025
ny+0.024
ll+0.023
Neg logit contributions
 its-0.034
 yours-0.033
 our-0.033
 your-0.032
 such-0.032
Token "
Token ID366
Feature activation+2.77e-02

Pos logit contributions
 yours+0.002
 our+0.002
 its+0.002
 theirs+0.002
 such+0.002
Neg logit contributions
oise-0.003
anda-0.003
my-0.003
mun-0.003
ny-0.003
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
 its+0.013
 yours+0.012
 itself+0.012
 electricity+0.012
 our+0.012
Neg logit contributions
anda-0.013
oise-0.012
ll-0.012
ny-0.012
my-0.012
Token It
Token ID632
Feature activation+2.12e+00

Pos logit contributions
 yours+0.020
 its+0.020
 such+0.020
 purpose+0.019
 theirs+0.018
Neg logit contributions
anda-0.030
's-0.029
ny-0.029
oise-0.029
my-0.028
Token is
Token ID318
Feature activation-4.09e+00

Pos logit contributions
oise+0.046
mun+0.044
alia+0.042
pit+0.041
enny+0.041
Neg logit contributions
 our-0.042
 its-0.038
 your-0.038
 yours-0.038
 my-0.036
Token dangerous
Token ID4923
Feature activation-2.54e+00

Pos logit contributions
 ourselves+0.047
 itself+0.047
 yourselves+0.047
 purpose+0.046
 such+0.046
Neg logit contributions
a-0.089
ny-0.089
's-0.089
anda-0.088
my-0.088
Token."
Token ID526
Feature activation+4.00e-01

Pos logit contributions
 its+0.050
 yours+0.048
 our+0.047
 such+0.046
 itself+0.046
Neg logit contributions
ny-0.044
anda-0.043
y-0.043
oise-0.043
my-0.042
Token Sue
Token ID26089
Feature activation+1.21e+01

Pos logit contributions
oise+0.009
anda+0.009
mun+0.009
ny+0.009
my+0.009
Neg logit contributions
 yours-0.006
 its-0.006
 our-0.006
 such-0.005
 theirs-0.005
Token says
Token ID1139
Feature activation+2.30e-01

Pos logit contributions
ouse+0.339
oise+0.338
pp+0.317
ird+0.308
any+0.300
Neg logit contributions
 the-0.168
 your-0.163
 our-0.156
 its-0.156
 my-0.149
Token.
Token ID13
Feature activation-9.40e-01

Pos logit contributions
oise+0.005
anda+0.004
mun+0.004
ny+0.004
ll+0.004
Neg logit contributions
 its-0.004
 yours-0.004
 our-0.004
 such-0.004
 your-0.004
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 yours+0.018
 our+0.017
 its+0.017
 such+0.016
 theirs+0.016
Neg logit contributions
oise-0.017
anda-0.017
mun-0.017
ny-0.017
my-0.017
Token
Token ID198
Feature activation-6.25e-01

Pos logit contributions
 yours+0.021
 its+0.021
 our+0.020
 such+0.020
 theirs+0.020
Neg logit contributions
oise-0.020
anda-0.019
ll-0.019
mun-0.019
my-0.019
TokenBut
Token ID1537
Feature activation+5.76e-02

Pos logit contributions
 its+0.012
 yours+0.011
 our+0.011
 such+0.011
 your+0.011
Neg logit contributions
oise-0.012
anda-0.012
mun-0.012
ny-0.011
ll-0.011
Token folded
Token ID24650
Feature activation-5.01e+00

Pos logit contributions
oise+0.011
anda+0.010
mun+0.010
ll+0.009
ny+0.009
Neg logit contributions
 its-0.020
 our-0.019
 yours-0.019
 itself-0.018
 your-0.018
Token his
Token ID465
Feature activation+1.87e+00

Pos logit contributions
 yours+0.067
 theirs+0.057
 our+0.056
 yourselves+0.055
 its+0.053
Neg logit contributions
ony-0.118
my-0.118
's-0.116
y-0.116
oise-0.116
Token hands
Token ID2832
Feature activation+1.08e+00

Pos logit contributions
oise+0.037
anda+0.036
mun+0.035
ny+0.035
ony+0.034
Neg logit contributions
 yours-0.033
 its-0.033
 our-0.032
 theirs-0.031
 such-0.031
Token together
Token ID1978
Feature activation-1.54e+00

Pos logit contributions
oise+0.022
anda+0.021
mun+0.021
ny+0.020
ll+0.020
Neg logit contributions
 its-0.019
 yours-0.019
 our-0.018
 such-0.018
 your-0.017
Token.
Token ID13
Feature activation-6.38e-01

Pos logit contributions
 yours+0.030
 our+0.029
 its+0.029
 your+0.028
 yourselves+0.028
Neg logit contributions
oise-0.027
anda-0.026
ny-0.026
mun-0.026
my-0.026
Token Mama
Token ID40084
Feature activation+1.02e+01

Pos logit contributions
 its+0.010
 yours+0.009
 our+0.009
 such+0.009
 your+0.009
Neg logit contributions
oise-0.015
anda-0.014
mun-0.014
ny-0.014
ll-0.014
Token closed
Token ID4838
Feature activation-4.55e+00

Pos logit contributions
oise+0.170
mun+0.158
anda+0.148
ll+0.141
ony+0.137
Neg logit contributions
 its-0.226
 our-0.207
 the-0.206
 it-0.206
 such-0.201
Token her
Token ID607
Feature activation+2.97e+00

Pos logit contributions
 yours+0.063
 theirs+0.050
 ourselves+0.050
 yourselves+0.050
 ours+0.050
Neg logit contributions
y-0.108
my-0.105
ie-0.105
ony-0.103
's-0.103
Token eyes
Token ID2951
Feature activation+1.07e+00

Pos logit contributions
oise+0.073
anda+0.072
mun+0.070
ny+0.069
ony+0.069
Neg logit contributions
 yours-0.039
 its-0.039
 our-0.037
 such-0.035
 theirs-0.035
Token and
Token ID290
Feature activation+2.90e-01

Pos logit contributions
oise+0.014
anda+0.013
ny+0.013
mun+0.012
my+0.012
Neg logit contributions
 yours-0.027
 its-0.026
 our-0.026
 your-0.025
 theirs-0.025
Token said
Token ID531
Feature activation-2.31e+00

Pos logit contributions
oise+0.004
anda+0.003
mun+0.003
ny+0.003
ll+0.003
Neg logit contributions
 its-0.008
 our-0.007
 yours-0.007
 such-0.007
 your-0.007
Tokenasers
Token ID19865
Feature activation+3.39e+00

Pos logit contributions
ny+0.069
mun+0.069
oise+0.068
anda+0.067
bi+0.064
Neg logit contributions
 yours-0.123
 such-0.118
 its-0.116
 your-0.114
 our-0.113
Token to
Token ID284
Feature activation-5.19e+00

Pos logit contributions
oise+0.070
anda+0.068
mun+0.066
ny+0.065
ll+0.064
Neg logit contributions
 its-0.058
 yours-0.057
 our-0.056
 such-0.053
 itself-0.053
Token sell
Token ID3677
Feature activation-5.08e+00

Pos logit contributions
 yours+0.098
 its+0.098
 our+0.097
 such+0.093
 electricity+0.093
Neg logit contributions
oise-0.089
my-0.086
anda-0.084
ny-0.084
ony-0.082
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
 its+0.080
 yours+0.078
 our+0.078
 electricity+0.075
 itself+0.074
Neg logit contributions
oise-0.104
my-0.102
ny-0.101
anda-0.101
y-0.100
Token The
Token ID383
Feature activation+2.94e+00

Pos logit contributions
 its+0.020
 yours+0.020
 our+0.019
 such+0.018
 your+0.018
Neg logit contributions
oise-0.033
anda-0.032
mun-0.032
ny-0.031
ll-0.031
Token man
Token ID582
Feature activation+1.10e+01

Pos logit contributions
oise+0.051
anda+0.048
ches+0.047
ll+0.047
mun+0.047
Neg logit contributions
 yours-0.060
 its-0.059
 our-0.058
 your-0.058
 such-0.057
Token felt
Token ID2936
Feature activation-3.83e+00

Pos logit contributions
oise+0.193
 pads+0.182
ppy+0.179
 pad+0.178
mun+0.176
Neg logit contributions
 its-0.224
 an-0.202
 the-0.200
 such-0.197
 it-0.196
Token more
Token ID517
Feature activation-1.81e+00

Pos logit contributions
 yours+0.084
 our+0.079
 ourselves+0.078
 yourselves+0.076
 its+0.075
Neg logit contributions
ny-0.058
mun-0.057
's-0.057
oise-0.056
anda-0.055
Token and
Token ID290
Feature activation-8.36e-01

Pos logit contributions
 its+0.033
 yours+0.032
 our+0.032
 such+0.031
 your+0.030
Neg logit contributions
oise-0.036
anda-0.035
mun-0.034
ll-0.033
ny-0.033
Token more
Token ID517
Feature activation-1.88e+00

Pos logit contributions
 its+0.017
 yours+0.016
 our+0.016
 such+0.015
 your+0.015
Neg logit contributions
oise-0.016
anda-0.015
mun-0.014
ny-0.014
ll-0.014
Token guilty
Token ID6717
Feature activation-2.05e+00

Pos logit contributions
 its+0.034
 yours+0.034
 our+0.033
 such+0.031
 your+0.031
Neg logit contributions
oise-0.038
anda-0.037
mun-0.036
ny-0.035
ll-0.035
Token tough
Token ID5802
Feature activation-1.27e+00

Pos logit contributions
 yourselves+0.045
 yours+0.044
 ourselves+0.044
 our+0.042
 itself+0.041
Neg logit contributions
ny-0.047
my-0.045
y-0.044
's-0.043
oise-0.042
Token and
Token ID290
Feature activation-6.69e-01

Pos logit contributions
 its+0.027
 yours+0.027
 our+0.026
 such+0.025
 your+0.025
Neg logit contributions
oise-0.021
anda-0.020
ny-0.020
mun-0.019
my-0.019
Token scary
Token ID14343
Feature activation-2.35e-01

Pos logit contributions
 yours+0.011
 yourselves+0.010
 such+0.010
 itself+0.010
 ourselves+0.010
Neg logit contributions
anda-0.013
oise-0.013
ny-0.013
my-0.013
y-0.012
Token.
Token ID13
Feature activation-7.63e-03

Pos logit contributions
 yours+0.005
 its+0.005
 our+0.005
 your+0.004
 theirs+0.004
Neg logit contributions
oise-0.004
anda-0.004
y-0.004
ny-0.004
my-0.004
Token The
Token ID383
Feature activation+3.58e+00

Pos logit contributions
 yours+0.000
 its+0.000
 theirs+0.000
 our+0.000
 itself+0.000
Neg logit contributions
anda-0.000
ny-0.000
oise-0.000
mun-0.000
my-0.000
Token dogs
Token ID6844
Feature activation+9.74e+00

Pos logit contributions
oise+0.067
anda+0.065
mun+0.064
ny+0.063
my+0.061
Neg logit contributions
 yours-0.069
 its-0.069
 our-0.067
 such-0.064
 your-0.064
Token got
Token ID1392
Feature activation-3.59e+00

Pos logit contributions
oise+0.175
anda+0.174
ony+0.163
alia+0.156
mo+0.155
Neg logit contributions
 its-0.210
 their-0.189
 it-0.184
 our-0.184
 such-0.182
Token very
Token ID845
Feature activation-1.80e+00

Pos logit contributions
 its+0.070
 yours+0.069
 our+0.068
 such+0.065
 your+0.065
Neg logit contributions
oise-0.067
anda-0.065
mun-0.063
ny-0.061
ll-0.061
Token scared
Token ID12008
Feature activation-3.78e+00

Pos logit contributions
 its+0.035
 yours+0.034
 our+0.034
 such+0.032
 your+0.032
Neg logit contributions
oise-0.034
anda-0.033
mun-0.032
ny-0.031
ll-0.031
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
 yours+0.097
 its+0.091
 our+0.090
 theirs+0.090
 yourselves+0.089
Neg logit contributions
y-0.046
my-0.044
ny-0.044
oise-0.043
anda-0.042
Token stopped
Token ID5025
Feature activation-5.49e+00

Pos logit contributions
oise+0.002
anda+0.002
mun+0.002
ny+0.002
ll+0.002
Neg logit contributions
 yours-0.005
 its-0.005
 our-0.005
 such-0.005
 itself-0.005
Token!"
Token ID2474
Feature activation+6.45e-01

Pos logit contributions
 its+0.045
 yours+0.044
 our+0.043
 theirs+0.043
 such+0.043
Neg logit contributions
oise-0.040
anda-0.040
my-0.039
mun-0.038
ny-0.038
Token Lily
Token ID20037
Feature activation+1.05e+01

Pos logit contributions
my+0.013
anda+0.012
mun+0.012
's+0.012
ny+0.012
Neg logit contributions
 yours-0.011
 theirs-0.010
 itself-0.010
 our-0.010
 such-0.010
Token said
Token ID531
Feature activation-3.14e-01

Pos logit contributions
oise+0.246
mo+0.229
anda+0.215
bi+0.211
sie+0.201
Neg logit contributions
 its-0.182
 an-0.175
 the-0.171
 a-0.170
 it-0.167
Token.
Token ID13
Feature activation-9.46e-01

Pos logit contributions
 yours+0.006
 its+0.006
 our+0.006
 such+0.006
 your+0.006
Neg logit contributions
oise-0.006
anda-0.006
mun-0.006
ny-0.005
my-0.005
Token "
Token ID366
Feature activation-6.48e-01

Pos logit contributions
 yours+0.016
 our+0.014
 theirs+0.014
 its+0.014
 such+0.014
Neg logit contributions
my-0.020
ny-0.020
oise-0.020
anda-0.020
mun-0.020
TokenThe
Token ID464
Feature activation+3.05e+00

Pos logit contributions
 its+0.010
 yours+0.010
 our+0.010
 such+0.010
 your+0.010
Neg logit contributions
oise-0.014
anda-0.014
mun-0.014
ny-0.013
ll-0.013
Token t
Token ID256
Feature activation+6.84e+00

Pos logit contributions
oise+0.042
anda+0.041
mun+0.039
ny+0.039
my+0.039
Neg logit contributions
 its-0.073
 yours-0.072
 our-0.071
 theirs-0.069
 such-0.069
Tokeneddy
Token ID21874
Feature activation+1.11e+01

Pos logit contributions
mun+0.159
oise+0.159
anda+0.146
ny+0.146
ony+0.144
Neg logit contributions
 yours-0.100
 its-0.097
 our-0.094
 your-0.090
 yourselves-0.088
Token bear
Token ID6842
Feature activation+9.52e+00

Pos logit contributions
oise+0.178
anda+0.168
mun+0.166
ird+0.166
cases+0.164
Neg logit contributions
 its-0.253
 our-0.246
 your-0.242
 yours-0.234
 my-0.233
Token is
Token ID318
Feature activation-2.58e+00

Pos logit contributions
oise+0.183
mun+0.173
anda+0.164
ony+0.160
mo+0.159
Neg logit contributions
 our-0.187
 its-0.186
 yours-0.185
 your-0.181
 such-0.171
Token nice
Token ID3621
Feature activation-1.16e+00

Pos logit contributions
 ourselves+0.042
 itself+0.042
 outer+0.040
 its+0.039
 electricity+0.039
Neg logit contributions
my-0.048
's-0.047
anda-0.046
ny-0.045
a-0.045
Token
Token ID198
Feature activation-4.55e-02

Pos logit contributions
 its+0.007
 yours+0.006
 our+0.006
 itself+0.006
 such+0.006
Neg logit contributions
oise-0.009
anda-0.008
mun-0.008
ny-0.008
ony-0.008
Token
Token ID198
Feature activation+7.23e-02

Pos logit contributions
 its+0.001
 yours+0.001
 our+0.001
 your+0.001
 such+0.001
Neg logit contributions
oise-0.001
anda-0.001
mun-0.001
ony-0.001
ll-0.001
TokenIt
Token ID1026
Feature activation+2.05e+00

Pos logit contributions
oise+0.001
anda+0.001
mun+0.001
ll+0.001
ony+0.001
Neg logit contributions
 its-0.001
 our-0.001
 yours-0.001
 your-0.001
 itself-0.001
Token was
Token ID373
Feature activation-3.88e+00

Pos logit contributions
oise+0.032
anda+0.030
mun+0.028
ll+0.028
ird+0.028
Neg logit contributions
 its-0.052
 itself-0.047
 our-0.046
 such-0.045
 it-0.045
Token a
Token ID257
Feature activation+2.10e+00

Pos logit contributions
 yours+0.050
 ourselves+0.046
 our+0.044
 yourselves+0.043
 ours+0.039
Neg logit contributions
my-0.092
ny-0.088
's-0.088
a-0.088
y-0.088
Token mushroom
Token ID28520
Feature activation+8.23e+00

Pos logit contributions
oise+0.049
anda+0.047
mun+0.047
ll+0.046
ony+0.045
Neg logit contributions
 its-0.032
 our-0.030
 yours-0.030
 your-0.029
 such-0.028
Token!
Token ID0
Feature activation-6.34e-01

Pos logit contributions
oise+0.204
mun+0.196
anda+0.193
ll+0.190
mo+0.184
Neg logit contributions
 its-0.117
 yours-0.106
 such-0.104
 our-0.103
 your-0.100
Token It
Token ID632
Feature activation+2.56e+00

Pos logit contributions
 yours+0.009
 our+0.008
 its+0.008
 theirs+0.008
 ourselves+0.007
Neg logit contributions
ny-0.015
oise-0.015
anda-0.015
's-0.015
my-0.015
Token was
Token ID373
Feature activation-3.52e+00

Pos logit contributions
oise+0.044
anda+0.039
mun+0.038
ll+0.037
ird+0.037
Neg logit contributions
 its-0.063
 itself-0.056
 such-0.055
 it-0.054
 our-0.054
Token small
Token ID1402
Feature activation+2.74e-02

Pos logit contributions
 yours+0.057
 ourselves+0.055
 yourselves+0.054
 our+0.054
 theirs+0.050
Neg logit contributions
my-0.069
y-0.068
a-0.068
's-0.067
anda-0.067
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
oise+0.000
anda+0.000
mun+0.000
ll+0.000
ny+0.000
Neg logit contributions
 its-0.001
 yours-0.001
 our-0.001
 such-0.001
 your-0.001
Token they
Token ID484
Feature activation+3.43e+00

Pos logit contributions
 yours+0.017
 its+0.016
 itself+0.015
 our+0.015
 theirs+0.015
Neg logit contributions
oise-0.023
mun-0.023
anda-0.023
my-0.023
ny-0.022
Token saw
Token ID2497
Feature activation-4.60e+00

Pos logit contributions
anda+0.059
oise+0.056
ony+0.055
mun+0.055
ll+0.052
Neg logit contributions
 its-0.077
 our-0.070
 their-0.069
 such-0.069
 theirs-0.067
Token some
Token ID617
Feature activation+2.02e-01

Pos logit contributions
 yours+0.063
 our+0.051
 yourselves+0.049
 ourselves+0.048
 your+0.048
Neg logit contributions
mun-0.111
anda-0.110
's-0.108
my-0.107
ny-0.106
Token big
Token ID1263
Feature activation+3.87e-01

Pos logit contributions
oise+0.004
anda+0.004
mun+0.004
ny+0.004
ll+0.004
Neg logit contributions
 its-0.003
 yours-0.003
 our-0.003
 such-0.003
 your-0.003
Token red
Token ID2266
Feature activation+6.51e-01

Pos logit contributions
oise+0.008
anda+0.007
mun+0.007
ny+0.007
my+0.007
Neg logit contributions
 yours-0.007
 its-0.007
 our-0.007
 such-0.007
 itself-0.007
Token trucks
Token ID13960
Feature activation+5.70e+00

Pos logit contributions
oise+0.013
anda+0.013
mun+0.013
ny+0.012
y+0.012
Neg logit contributions
 yours-0.011
 its-0.011
 our-0.011
 such-0.011
 your-0.010
Token coming
Token ID2406
Feature activation-3.63e+00

Pos logit contributions
oise+0.118
anda+0.118
ony+0.110
mun+0.110
ppy+0.108
Neg logit contributions
 its-0.103
 yours-0.098
 our-0.097
 such-0.095
 theirs-0.093
Token.
Token ID13
Feature activation-9.20e-01

Pos logit contributions
 yours+0.061
 its+0.061
 our+0.059
 such+0.056
 your+0.056
Neg logit contributions
oise-0.077
anda-0.075
mun-0.072
ny-0.072
my-0.070
Token They
Token ID1119
Feature activation+3.16e+00

Pos logit contributions
 yours+0.017
 our+0.016
 its+0.015
 ourselves+0.015
 your+0.015
Neg logit contributions
ny-0.018
anda-0.017
oise-0.016
my-0.016
mun-0.016
Token had
Token ID550
Feature activation-4.12e+00

Pos logit contributions
anda+0.056
oise+0.054
anna+0.051
ony+0.051
ig+0.050
Neg logit contributions
 its-0.070
 their-0.064
 an-0.062
 the-0.062
 our-0.060
Token s
Token ID264
Feature activation-6.73e-01

Pos logit contributions
 yours+0.063
 yourselves+0.058
 ourselves+0.057
 itself+0.055
 our+0.055
Neg logit contributions
's-0.087
my-0.084
ny-0.082
anda-0.080
a-0.078
Token that
Token ID326
Feature activation+1.68e+00

Pos logit contributions
 yours+0.027
 its+0.026
 theirs+0.025
 electricity+0.025
 our+0.025
Neg logit contributions
oise-0.036
anda-0.036
ll-0.034
my-0.034
ony-0.033
Token day
Token ID1110
Feature activation+4.69e-02

Pos logit contributions
oise+0.034
anda+0.034
mun+0.033
ony+0.032
ll+0.031
Neg logit contributions
 its-0.031
 yours-0.030
 our-0.030
 your-0.029
 such-0.028
Token on
Token ID319
Feature activation-1.59e+00

Pos logit contributions
oise+0.001
anda+0.001
ny+0.001
mun+0.001
ll+0.001
Neg logit contributions
 yours-0.001
 its-0.001
 our-0.001
 theirs-0.001
 itself-0.001
Token,
Token ID11
Feature activation-3.04e+00

Pos logit contributions
 yours+0.032
 its+0.032
 our+0.031
 theirs+0.030
 such+0.030
Neg logit contributions
oise-0.028
anda-0.027
mun-0.026
ny-0.026
ll-0.025
Token the
Token ID262
Feature activation+3.00e+00

Pos logit contributions
 its+0.033
 yours+0.033
 our+0.031
 such+0.029
 your+0.028
Neg logit contributions
oise-0.083
anda-0.081
mun-0.079
ny-0.078
ll-0.077
Token bright
Token ID6016
Feature activation+2.91e+00

Pos logit contributions
oise+0.054
anda+0.052
mun+0.051
ll+0.050
ny+0.050
Neg logit contributions
 its-0.060
 yours-0.060
 our-0.059
 your-0.057
 such-0.056
Token king
Token ID5822
Feature activation+1.04e+01

Pos logit contributions
oise+0.061
anda+0.059
mun+0.057
ny+0.056
ll+0.056
Neg logit contributions
 its-0.051
 yours-0.050
 our-0.049
 such-0.047
 your-0.046
Token became
Token ID2627
Feature activation-4.37e+00

Pos logit contributions
mun+0.171
anda+0.166
oise+0.163
ony+0.160
ny+0.155
Neg logit contributions
 its-0.223
 such-0.205
 your-0.194
 our-0.194
 itself-0.190
Token even
Token ID772
Feature activation-1.92e+00

Pos logit contributions
 yours+0.060
 our+0.055
 yourselves+0.055
 electricity+0.055
 itself+0.054
Neg logit contributions
oise-0.106
mun-0.100
ny-0.098
a-0.098
my-0.097
Token more
Token ID517
Feature activation-2.24e+00

Pos logit contributions
 its+0.026
 yours+0.025
 our+0.025
 such+0.024
 your+0.023
Neg logit contributions
oise-0.047
anda-0.047
mun-0.045
ll-0.044
ony-0.044
Token famous
Token ID5863
Feature activation-1.22e+00

Pos logit contributions
 its+0.037
 yours+0.037
 our+0.036
 such+0.034
 your+0.034
Neg logit contributions
oise-0.048
anda-0.047
mun-0.046
ny-0.045
ll-0.044
Token and
Token ID290
Feature activation-2.00e+00

Pos logit contributions
 yours+0.016
 its+0.016
 our+0.016
 such+0.015
 theirs+0.015
Neg logit contributions
oise-0.013
anda-0.013
mun-0.012
ny-0.012
my-0.012
Token waved
Token ID26834
Feature activation-3.92e+00

Pos logit contributions
 its+0.029
 yours+0.028
 our+0.028
 such+0.026
 your+0.026
Neg logit contributions
oise-0.047
anda-0.046
mun-0.045
ny-0.044
ll-0.044
Token at
Token ID379
Feature activation-5.60e+00

Pos logit contributions
 yours+0.064
 yourselves+0.062
 electricity+0.060
 ourselves+0.056
 theirs+0.055
Neg logit contributions
ny-0.079
y-0.079
's-0.076
a-0.076
anda-0.074
Token John
Token ID1757
Feature activation+6.24e+00

Pos logit contributions
 yours+0.053
 our+0.047
 its+0.046
 such+0.043
 your+0.042
Neg logit contributions
oise-0.157
anda-0.153
mun-0.149
my-0.149
y-0.148
Token as
Token ID355
Feature activation-2.39e+00

Pos logit contributions
oise+0.124
mun+0.120
anda+0.114
ird+0.107
ll+0.105
Neg logit contributions
 its-0.125
 our-0.118
 such-0.115
 yours-0.114
 itself-0.113
Token he
Token ID339
Feature activation+4.17e+00

Pos logit contributions
 yours+0.029
 our+0.027
 such+0.026
 its+0.026
 theirs+0.025
Neg logit contributions
oise-0.059
mun-0.057
anda-0.057
ny-0.057
a-0.056
Token left
Token ID1364
Feature activation-5.65e+00

Pos logit contributions
oise+0.063
anda+0.061
mun+0.060
my+0.056
ny+0.056
Neg logit contributions
 its-0.097
 such-0.089
 our-0.089
 itself-0.088
 your-0.084
Token.
Token ID13
Feature activation-5.76e-01

Pos logit contributions
 yours+0.097
 our+0.092
 electricity+0.088
 yourselves+0.087
 theirs+0.085
Neg logit contributions
oise-0.106
ny-0.103
my-0.101
anda-0.101
mun-0.100
Token
Token ID220
Feature activation-1.39e+00

Pos logit contributions
 yours+0.011
 its+0.011
 our+0.010
 theirs+0.010
 such+0.010
Neg logit contributions
oise-0.010
my-0.010
ny-0.010
mun-0.010
anda-0.010
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
 its+0.025
 yours+0.025
 our+0.024
 such+0.023
 your+0.023
Neg logit contributions
oise-0.028
anda-0.027
mun-0.026
ny-0.026
ll-0.026
TokenJohn
Token ID7554
Feature activation+9.74e+00

Pos logit contributions
 its+0.022
 yours+0.022
 our+0.021
 such+0.020
 your+0.020
Neg logit contributions
oise-0.024
anda-0.023
mun-0.023
ny-0.022
ll-0.022
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
oise+0.001
anda+0.001
mun+0.001
ny+0.001
ony+0.001
Neg logit contributions
 its-0.001
 yours-0.001
 our-0.001
 such-0.001
 your-0.001
Token It
Token ID632
Feature activation+2.21e+00

Pos logit contributions
 its+0.014
 yours+0.014
 theirs+0.014
 our+0.013
 such+0.013
Neg logit contributions
anda-0.023
oise-0.023
my-0.022
ny-0.022
mun-0.022
Token was
Token ID373
Feature activation-3.88e+00

Pos logit contributions
oise+0.035
mun+0.031
anda+0.031
ony+0.030
alia+0.030
Neg logit contributions
 our-0.052
 its-0.051
 your-0.050
 yours-0.050
 such-0.049
Token too
Token ID1165
Feature activation-2.27e+00

Pos logit contributions
 ourselves+0.059
 yourselves+0.057
 itself+0.053
 yours+0.052
 its+0.052
Neg logit contributions
my-0.076
's-0.073
a-0.071
ny-0.070
ches-0.068
Token lovely
Token ID14081
Feature activation-1.83e+00

Pos logit contributions
 yours+0.035
 such+0.035
 its+0.034
 itself+0.033
 our+0.033
Neg logit contributions
anda-0.049
a-0.049
oise-0.048
ny-0.048
y-0.048
Token not
Token ID407
Feature activation-2.85e+00

Pos logit contributions
 its+0.032
 yours+0.031
 our+0.031
 theirs+0.030
 itself+0.030
Neg logit contributions
oise-0.036
anda-0.036
ny-0.035
y-0.034
ll-0.034
Token to
Token ID284
Feature activation-4.09e+00

Pos logit contributions
 yourselves+0.029
 its+0.027
 itself+0.026
 ourselves+0.026
 electricity+0.025
Neg logit contributions
y-0.069
ny-0.069
my-0.068
a-0.068
anda-0.068
Token pick
Token ID2298
Feature activation-4.81e+00

Pos logit contributions
 its+0.078
 yours+0.078
 our+0.076
 such+0.073
 itself+0.072
Neg logit contributions
oise-0.076
anda-0.074
mun-0.071
ny-0.071
ony-0.069
Token up
Token ID510
Feature activation-5.57e+00

Pos logit contributions
 its+0.057
 yours+0.056
 theirs+0.055
 our+0.053
 electricity+0.052
Neg logit contributions
anda-0.113
's-0.112
y-0.112
ny-0.112
oise-0.111
Token!"
Token ID2474
Feature activation-4.41e-01

Pos logit contributions
 yours+0.082
 its+0.080
 electricity+0.080
 theirs+0.077
 itself+0.075
Neg logit contributions
anda-0.111
's-0.108
y-0.107
oise-0.106
ny-0.105
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 its+0.008
 yours+0.007
 our+0.007
 such+0.007
 your+0.007
Neg logit contributions
oise-0.010
anda-0.009
mun-0.009
ny-0.009
ony-0.009
Token the
Token ID262
Feature activation+2.00e+00

Pos logit contributions
 yours+0.052
 yourselves+0.051
 our+0.049
 ourselves+0.049
 its+0.044
Neg logit contributions
my-0.141
's-0.137
oise-0.133
a-0.132
mie-0.131
Token soap
Token ID19533
Feature activation+4.08e+00

Pos logit contributions
oise+0.046
ny+0.044
anda+0.044
mun+0.044
my+0.043
Neg logit contributions
 its-0.030
 yours-0.029
 our-0.028
 itself-0.027
 theirs-0.027
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
oise+0.075
anda+0.073
mun+0.071
ony+0.068
ny+0.068
Neg logit contributions
 its-0.081
 yours-0.080
 our-0.078
 such-0.076
 itself-0.075
Token he
Token ID339
Feature activation+4.00e+00

Pos logit contributions
 yours+0.028
 its+0.027
 our+0.027
 itself+0.026
 theirs+0.026
Neg logit contributions
oise-0.020
anda-0.019
my-0.019
mun-0.019
ny-0.018
Token gave
Token ID2921
Feature activation-5.10e+00

Pos logit contributions
oise+0.064
mun+0.060
anda+0.059
ony+0.056
mo+0.055
Neg logit contributions
 its-0.094
 our-0.089
 yours-0.088
 such-0.087
 your-0.086
Token him
Token ID683
Feature activation+9.19e-01

Pos logit contributions
 our+0.043
 yours+0.043
 its+0.042
 electricity+0.041
 yourselves+0.040
Neg logit contributions
my-0.143
ny-0.135
mie-0.134
oise-0.134
's-0.134
Token a
Token ID257
Feature activation-7.94e-02

Pos logit contributions
oise+0.027
anda+0.026
my+0.026
ny+0.026
ony+0.025
Neg logit contributions
 yours-0.008
 its-0.007
 our-0.007
 yourselves-0.006
 itself-0.006
Token big
Token ID1263
Feature activation+9.11e-01

Pos logit contributions
 its+0.001
 yours+0.001
 our+0.001
 such+0.001
 your+0.001
Neg logit contributions
oise-0.002
anda-0.002
mun-0.002
ny-0.002
ll-0.002
Token hug
Token ID16225
Feature activation+9.52e-02

Pos logit contributions
oise+0.015
anda+0.014
ny+0.014
mun+0.014
my+0.013
Neg logit contributions
 yours-0.019
 its-0.019
 our-0.019
 such-0.019
 itself-0.018
Token.
Token ID13
Feature activation-7.17e-01

Pos logit contributions
oise+0.002
anda+0.002
mun+0.002
ny+0.002
ll+0.002
Neg logit contributions
 its-0.002
 yours-0.002
 our-0.002
 such-0.002
 your-0.002
Token From
Token ID3574
Feature activation-1.41e+00

Pos logit contributions
 yours+0.013
 its+0.012
 our+0.012
 electricity+0.012
 itself+0.012
Neg logit contributions
anda-0.014
my-0.014
ny-0.013
oise-0.013
mun-0.013
Token to
Token ID284
Feature activation-3.76e+00

Pos logit contributions
 yours+0.045
 our+0.043
 its+0.042
 electricity+0.041
 theirs+0.041
Neg logit contributions
anda-0.065
oise-0.065
y-0.064
ll-0.064
mun-0.063
Token have
Token ID423
Feature activation-5.05e+00

Pos logit contributions
 yours+0.076
 our+0.073
 electricity+0.073
 theirs+0.073
 yourselves+0.072
Neg logit contributions
oise-0.062
ny-0.059
my-0.058
ll-0.057
anda-0.057
Token had
Token ID550
Feature activation-5.75e+00

Pos logit contributions
 yours+0.098
 our+0.095
 theirs+0.092
 electricity+0.092
 yourselves+0.091
Neg logit contributions
oise-0.088
my-0.086
's-0.085
mun-0.082
ny-0.082
Token such
Token ID884
Feature activation-4.98e+00

Pos logit contributions
 yours+0.075
 yourselves+0.073
 our+0.072
 electricity+0.072
 theirs+0.071
Neg logit contributions
's-0.128
my-0.126
oise-0.122
mie-0.121
a-0.120
Token an
Token ID281
Feature activation-7.74e-01

Pos logit contributions
 yours+0.052
 yourselves+0.052
 our+0.051
 ourselves+0.047
 electricity+0.046
Neg logit contributions
a-0.124
oise-0.117
anda-0.114
's-0.113
mie-0.113
Token adventure
Token ID8855
Feature activation-1.82e+00

Pos logit contributions
 its+0.010
 yours+0.010
 our+0.009
 such+0.009
 your+0.009
Neg logit contributions
oise-0.020
anda-0.019
mun-0.019
ny-0.018
my-0.018
Token.
Token ID13
Feature activation-1.32e-01

Pos logit contributions
 its+0.031
 yours+0.031
 our+0.030
 such+0.029
 your+0.029
Neg logit contributions
oise-0.037
anda-0.036
mun-0.035
ny-0.035
ll-0.034
Token<|endoftext|>
Token ID50256
Feature activation+3.25e+00

Pos logit contributions
 its+0.002
 yours+0.002
 our+0.002
 such+0.002
 your+0.002
Neg logit contributions
oise-0.003
anda-0.003
mun-0.003
ny-0.003
ll-0.003
TokenOnce
Token ID7454
Feature activation+1.64e+00

Pos logit contributions
oise+0.072
anda+0.070
mun+0.067
ony+0.066
ppy+0.065
Neg logit contributions
 its-0.054
 yours-0.050
 our-0.050
 such-0.050
 itself-0.049
Token there
Token ID612
Feature activation+4.43e+00

Pos logit contributions
oise+0.026
mun+0.024
anda+0.023
ny+0.023
ony+0.022
Neg logit contributions
 its-0.038
 yours-0.037
 our-0.037
 your-0.036
 theirs-0.035
Token was
Token ID373
Feature activation-2.23e+00

Pos logit contributions
oise+0.057
anda+0.052
ony+0.049
mun+0.049
ppy+0.049
Neg logit contributions
 its-0.117
 our-0.112
 yours-0.111
 such-0.111
 your-0.110
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
oise+0.018
anda+0.017
mun+0.016
ll+0.015
ony+0.015
Neg logit contributions
 its-0.019
 our-0.018
 such-0.017
 itself-0.017
 your-0.017
Token came
Token ID1625
Feature activation-2.13e+00

Pos logit contributions
oise+0.015
anda+0.014
mun+0.013
ll+0.013
ony+0.013
Neg logit contributions
 its-0.025
 our-0.024
 yours-0.023
 your-0.023
 itself-0.023
Token to
Token ID284
Feature activation-4.81e+00

Pos logit contributions
 its+0.026
 yours+0.026
 our+0.024
 such+0.023
 your+0.023
Neg logit contributions
oise-0.055
anda-0.054
mun-0.053
ny-0.052
ll-0.052
Token Lily
Token ID20037
Feature activation+6.12e+00

Pos logit contributions
 yours+0.085
 theirs+0.073
 ours+0.071
 electricity+0.071
 itself+0.070
Neg logit contributions
my-0.097
ny-0.088
cy-0.087
oise-0.087
's-0.086
Token's
Token ID338
Feature activation+2.26e+00

Pos logit contributions
oise+0.161
mun+0.155
anda+0.154
mo+0.147
int+0.145
Neg logit contributions
 its-0.082
 our-0.077
 such-0.077
 your-0.075
 yours-0.070
Token room
Token ID2119
Feature activation-1.05e-01

Pos logit contributions
oise+0.033
anda+0.032
mun+0.031
ny+0.030
ll+0.029
Neg logit contributions
 its-0.053
 yours-0.053
 our-0.052
 such-0.050
 your-0.050
Token.
Token ID13
Feature activation+7.30e-01

Pos logit contributions
 yours+0.002
 its+0.002
 our+0.002
 your+0.002
 such+0.002
Neg logit contributions
oise-0.002
anda-0.002
mun-0.002
ny-0.002
my-0.002
Token She
Token ID1375
Feature activation+4.13e+00

Pos logit contributions
my+0.011
anda+0.011
ny+0.010
's+0.010
y+0.010
Neg logit contributions
 yours-0.017
 theirs-0.016
 our-0.016
 such-0.016
 its-0.016
Token saw
Token ID2497
Feature activation-4.83e+00

Pos logit contributions
oise+0.066
anda+0.060
mun+0.059
ony+0.057
ny+0.057
Neg logit contributions
 its-0.096
 our-0.092
 your-0.089
 such-0.087
 yours-0.086
Token the
Token ID262
Feature activation+3.22e+00

Pos logit contributions
 yours+0.055
 its+0.047
 itself+0.047
 theirs+0.045
 our+0.045
Neg logit contributions
my-0.127
mun-0.123
anda-0.122
m-0.121
y-0.120
Token broken
Token ID5445
Feature activation+1.95e+00

Pos logit contributions
oise+0.051
anda+0.050
mun+0.049
ny+0.047
my+0.047
Neg logit contributions
 its-0.071
 yours-0.070
 our-0.068
 itself-0.067
 such-0.066
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
 yours+0.044
 electricity+0.039
 yourselves+0.038
 theirs+0.038
 our+0.037
Neg logit contributions
my-0.102
ie-0.100
's-0.099
ppy-0.096
ny-0.094
Token<|endoftext|>
Token ID50256
Feature activation+3.98e+00

Pos logit contributions
 yours+0.007
 our+0.006
 its+0.006
 theirs+0.006
 such+0.006
Neg logit contributions
anda-0.009
ny-0.009
oise-0.009
mun-0.009
my-0.008
TokenOnce
Token ID7454
Feature activation+2.32e+00

Pos logit contributions
oise+0.081
anda+0.079
mun+0.077
ny+0.075
ll+0.074
Neg logit contributions
 its-0.070
 yours-0.069
 our-0.068
 such-0.065
 your-0.064
Token upon
Token ID2402
Feature activation-6.10e-01

Pos logit contributions
oise+0.036
anda+0.033
mun+0.033
ny+0.032
ony+0.031
Neg logit contributions
 its-0.054
 yours-0.053
 our-0.052
 your-0.051
 theirs-0.050
Token a
Token ID257
Feature activation+3.87e+00

Pos logit contributions
 yours+0.005
 our+0.005
 its+0.005
 theirs+0.004
 itself+0.004
Neg logit contributions
oise-0.017
my-0.017
mun-0.017
ny-0.017
anda-0.017
Token time
Token ID640
Feature activation+1.89e+00

Pos logit contributions
oise+0.106
anda+0.101
mun+0.098
ony+0.096
mo+0.095
Neg logit contributions
 our-0.044
 its-0.044
 yours-0.042
 your-0.041
 an-0.037
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
oise+0.037
anda+0.036
mun+0.035
ny+0.034
ony+0.034
Neg logit contributions
 its-0.035
 yours-0.035
 our-0.034
 your-0.033
 such-0.032
Token there
Token ID612
Feature activation+4.70e+00

Pos logit contributions
oise+0.029
anda+0.028
ny+0.027
my+0.027
mun+0.027
Neg logit contributions
 yours-0.019
 its-0.018
 our-0.018
 such-0.017
 itself-0.017
Token was
Token ID373
Feature activation-2.31e+00

Pos logit contributions
oise+0.057
anda+0.052
mun+0.048
ony+0.048
ny+0.046
Neg logit contributions
 its-0.128
 our-0.123
 yours-0.122
 such-0.121
 your-0.120
Token a
Token ID257
Feature activation+3.14e+00

Pos logit contributions
 yours+0.023
 yourselves+0.021
 our+0.021
 itself+0.021
 its+0.021
Neg logit contributions
my-0.060
's-0.058
mun-0.058
ny-0.058
oise-0.058
Token nice
Token ID3621
Feature activation+4.10e+00

Pos logit contributions
oise+0.068
ony+0.065
anda+0.064
ll+0.064
mun+0.064
Neg logit contributions
 its-0.053
 our-0.053
 your-0.050
 yours-0.049
 my-0.044
Token be
Token ID307
Feature activation-3.04e+00

Pos logit contributions
 its+0.050
 yours+0.050
 our+0.049
 such+0.047
 theirs+0.047
Neg logit contributions
oise-0.058
anda-0.056
mun-0.055
ny-0.054
ll-0.054
Token generous
Token ID14431
Feature activation-9.17e-01

Pos logit contributions
 electricity+0.052
 yours+0.050
 itself+0.049
 theirs+0.048
 ourselves+0.047
Neg logit contributions
a-0.060
oise-0.058
my-0.057
bug-0.056
's-0.056
Token too
Token ID1165
Feature activation-2.45e+00

Pos logit contributions
 its+0.021
 yours+0.021
 our+0.020
 such+0.020
 your+0.020
Neg logit contributions
oise-0.014
anda-0.014
mun-0.013
ony-0.013
ll-0.013
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 yours+0.047
 its+0.047
 our+0.046
 such+0.044
 itself+0.044
Neg logit contributions
oise-0.045
anda-0.044
mun-0.042
ny-0.042
ll-0.041
Token She
Token ID1375
Feature activation+5.04e+00

Pos logit contributions
 yours+0.025
 our+0.024
 theirs+0.024
 electricity+0.024
 its+0.024
Neg logit contributions
anda-0.019
ny-0.019
mun-0.018
my-0.018
oise-0.018
Token takes
Token ID2753
Feature activation-4.82e+00

Pos logit contributions
oise+0.109
op+0.096
mun+0.094
y+0.094
ird+0.094
Neg logit contributions
 our-0.089
 its-0.087
 your-0.087
 yours-0.084
 my-0.079
Token the
Token ID262
Feature activation+3.28e+00

Pos logit contributions
 yours+0.057
 its+0.053
 theirs+0.051
 itself+0.051
 our+0.051
Neg logit contributions
oise-0.121
ny-0.121
my-0.118
anda-0.117
y-0.117
Token book
Token ID1492
Feature activation+2.32e+00

Pos logit contributions
oise+0.065
anda+0.063
mun+0.061
ny+0.060
ll+0.059
Neg logit contributions
 its-0.061
 yours-0.060
 our-0.059
 such-0.056
 your-0.056
Token off
Token ID572
Feature activation-3.60e+00

Pos logit contributions
oise+0.051
mun+0.050
anda+0.049
mo+0.047
ony+0.046
Neg logit contributions
 its-0.041
 our-0.039
 yours-0.038
 your-0.037
 theirs-0.036
Token her
Token ID607
Feature activation+3.66e+00

Pos logit contributions
 yours+0.044
 its+0.042
 our+0.041
 such+0.038
 itself+0.038
Neg logit contributions
oise-0.092
anda-0.090
mun-0.087
ny-0.087
my-0.086
Token image
Token ID2939
Feature activation+2.52e+00

Pos logit contributions
oise+0.071
anda+0.068
mun+0.067
ny+0.064
ll+0.064
Neg logit contributions
 its-0.071
 yours-0.070
 our-0.069
 your-0.066
 itself-0.065
Token She
Token ID1375
Feature activation+4.16e+00

Pos logit contributions
anda+0.010
oise+0.009
ny+0.009
my+0.009
mun+0.009
Neg logit contributions
 yours-0.012
 our-0.012
 its-0.011
 theirs-0.011
 such-0.011
Token asked
Token ID1965
Feature activation-3.34e+00

Pos logit contributions
oise+0.067
mun+0.063
anda+0.061
mo+0.061
ny+0.058
Neg logit contributions
 its-0.098
 our-0.093
 your-0.088
 yours-0.088
 such-0.086
Token him
Token ID683
Feature activation+2.91e+00

Pos logit contributions
 yours+0.040
 its+0.039
 our+0.037
 such+0.035
 itself+0.035
Neg logit contributions
oise-0.085
anda-0.083
mun-0.083
my-0.082
ny-0.081
Token why
Token ID1521
Feature activation-3.82e+00

Pos logit contributions
oise+0.074
anda+0.072
mun+0.070
ny+0.069
ll+0.069
Neg logit contributions
 its-0.037
 yours-0.036
 our-0.035
 such-0.033
 itself-0.033
Token he
Token ID339
Feature activation+3.66e+00

Pos logit contributions
 yours+0.063
 our+0.060
 its+0.059
 itself+0.058
 such+0.057
Neg logit contributions
oise-0.078
anda-0.077
mun-0.077
my-0.075
y-0.075
Token wanted
Token ID2227
Feature activation-5.45e+00

Pos logit contributions
oise+0.058
mun+0.057
anda+0.056
mo+0.053
pit+0.052
Neg logit contributions
 its-0.082
 yours-0.077
 such-0.076
 our-0.075
 your-0.074
Token to
Token ID284
Feature activation-4.60e+00

Pos logit contributions
 yourselves+0.078
 ourselves+0.072
 yours+0.070
 itself+0.068
 theirs+0.065
Neg logit contributions
's-0.128
ches-0.119
my-0.118
ny-0.116
s-0.115
Token send
Token ID3758
Feature activation-4.99e+00

Pos logit contributions
 yours+0.094
 yourselves+0.091
 theirs+0.090
 electricity+0.089
 its+0.089
Neg logit contributions
's-0.067
my-0.066
oise-0.066
ny-0.064
ony-0.061
Token the
Token ID262
Feature activation+3.59e+00

Pos logit contributions
 yours+0.050
 yourselves+0.047
 theirs+0.045
 its+0.044
 our+0.043
Neg logit contributions
my-0.131
's-0.131
ny-0.123
ches-0.122
s-0.120
Token tuna
Token ID38883
Feature activation+5.34e+00

Pos logit contributions
oise+0.076
ny+0.075
anda+0.074
mun+0.074
my+0.071
Neg logit contributions
 its-0.060
 yours-0.058
 our-0.056
 itself-0.054
 theirs-0.054
Token away
Token ID1497
Feature activation-2.27e+00

Pos logit contributions
oise+0.120
anda+0.117
ny+0.115
mun+0.112
y+0.112
Neg logit contributions
 its-0.079
 yours-0.078
 our-0.076
 such-0.073
 itself-0.072
Token friends
Token ID2460
Feature activation+3.52e+00

Pos logit contributions
oise+0.014
anda+0.013
mun+0.013
ny+0.013
ony+0.013
Neg logit contributions
 its-0.009
 yours-0.009
 our-0.009
 your-0.009
 such-0.009
Token who
Token ID508
Feature activation+3.63e+00

Pos logit contributions
oise+0.073
anda+0.071
mun+0.069
ny+0.068
ony+0.067
Neg logit contributions
 its-0.062
 yours-0.060
 our-0.059
 such-0.057
 your-0.056
Token liked
Token ID8288
Feature activation-4.63e+00

Pos logit contributions
oise+0.064
anda+0.061
ony+0.058
mun+0.056
ll+0.055
Neg logit contributions
 its-0.078
 yours-0.073
 such-0.072
 our-0.072
 your-0.071
Token to
Token ID284
Feature activation-2.70e+00

Pos logit contributions
 yourselves+0.074
 yours+0.073
 itself+0.073
 ourselves+0.068
 electricity+0.068
Neg logit contributions
my-0.089
's-0.087
y-0.087
mun-0.086
anda-0.084
Token play
Token ID711
Feature activation-3.98e+00

Pos logit contributions
 its+0.045
 yours+0.044
 our+0.043
 your+0.041
 such+0.041
Neg logit contributions
oise-0.059
anda-0.058
mun-0.056
ll-0.054
ny-0.054
Token near
Token ID1474
Feature activation-4.86e+00

Pos logit contributions
 yours+0.076
 its+0.071
 our+0.070
 electricity+0.069
 theirs+0.069
Neg logit contributions
oise-0.075
mun-0.073
anda-0.069
ny-0.069
y-0.068
Token the
Token ID262
Feature activation+3.57e-01

Pos logit contributions
 yours+0.042
 its+0.033
 our+0.032
 theirs+0.032
 electricity+0.031
Neg logit contributions
oise-0.138
my-0.137
anda-0.136
mun-0.135
y-0.135
Token sea
Token ID5417
Feature activation+2.61e+00

Pos logit contributions
oise+0.007
anda+0.007
mun+0.006
ny+0.006
ll+0.006
Neg logit contributions
 its-0.007
 yours-0.007
 our-0.007
 such-0.006
 your-0.006
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
oise+0.060
anda+0.059
mun+0.057
ony+0.056
mo+0.056
Neg logit contributions
 its-0.041
 our-0.039
 yours-0.038
 itself-0.036
 your-0.036
Token They
Token ID1119
Feature activation+3.65e+00

Pos logit contributions
 yours+0.010
 its+0.010
 our+0.010
 itself+0.010
 such+0.010
Neg logit contributions
oise-0.011
anda-0.011
ny-0.010
my-0.010
mun-0.010
Token saw
Token ID2497
Feature activation-3.81e+00

Pos logit contributions
anda+0.057
oise+0.057
ony+0.054
mun+0.054
ches+0.049
Neg logit contributions
 its-0.083
 our-0.079
 yours-0.078
 their-0.077
 theirs-0.076
Token learned
Token ID4499
Feature activation-5.73e+00

Pos logit contributions
anda+0.104
oise+0.096
ony+0.095
mun+0.091
ll+0.088
Neg logit contributions
 its-0.135
 such-0.124
 our-0.122
 their-0.118
 itself-0.117
Token that
Token ID326
Feature activation-2.81e+00

Pos logit contributions
 yours+0.078
 yourselves+0.070
 our+0.064
 its+0.063
ocol+0.062
Neg logit contributions
's-0.130
a-0.125
s-0.125
y-0.124
ny-0.124
Token it
Token ID340
Feature activation+1.37e+00

Pos logit contributions
 yours+0.038
 its+0.035
 our+0.035
 yourselves+0.034
 such+0.033
Neg logit contributions
mun-0.065
oise-0.064
my-0.064
's-0.063
ll-0.062
Token's
Token ID338
Feature activation-2.34e+00

Pos logit contributions
oise+0.017
anda+0.016
pit+0.015
ony+0.015
bi+0.014
Neg logit contributions
 our-0.036
 its-0.036
 such-0.035
 your-0.035
 yours-0.034
Token not
Token ID407
Feature activation-2.72e+00

Pos logit contributions
 its+0.035
 yours+0.034
 our+0.032
 yourselves+0.032
 itself+0.031
Neg logit contributions
oise-0.051
my-0.050
ny-0.049
a-0.048
mun-0.047
Token about
Token ID546
Feature activation-6.00e+00

Pos logit contributions
 yourselves+0.041
 its+0.040
 itself+0.039
 yours+0.039
 ourselves+0.038
Neg logit contributions
ny-0.055
my-0.054
oise-0.053
's-0.052
y-0.052
Token how
Token ID703
Feature activation-4.66e+00

Pos logit contributions
 yours+0.077
 its+0.072
 theirs+0.072
ores+0.071
 yourselves+0.070
Neg logit contributions
's-0.130
my-0.127
mun-0.126
oise-0.122
ll-0.121
Token tall
Token ID7331
Feature activation-1.18e+00

Pos logit contributions
 yours+0.073
 theirs+0.069
 its+0.069
 such+0.068
 itself+0.066
Neg logit contributions
anda-0.100
ll-0.098
s-0.096
oise-0.095
's-0.094
Token you
Token ID345
Feature activation+9.18e-01

Pos logit contributions
 yours+0.019
 its+0.018
 our+0.018
 such+0.018
 theirs+0.017
Neg logit contributions
oise-0.024
anda-0.024
mun-0.024
ny-0.023
y-0.023
Token are
Token ID389
Feature activation-2.67e+00

Pos logit contributions
oise+0.017
anda+0.017
mun+0.017
ny+0.016
ony+0.016
Neg logit contributions
 its-0.018
 yours-0.017
 our-0.017
 such-0.017
 your-0.016
Token,
Token ID11
Feature activation-2.18e+00

Pos logit contributions
 yours+0.053
 its+0.053
 our+0.051
 theirs+0.049
 itself+0.049
Neg logit contributions
oise-0.047
anda-0.046
mun-0.044
ny-0.044
ll-0.044
Token young
Token ID1862
Feature activation+5.12e+00

Pos logit contributions
oise+0.058
anda+0.056
ll+0.054
mun+0.054
ony+0.053
Neg logit contributions
 its-0.059
 yours-0.058
 our-0.057
 your-0.055
 such-0.055
Token girl
Token ID2576
Feature activation+8.43e+00

Pos logit contributions
anda+0.067
oise+0.066
ll+0.066
ony+0.063
ppy+0.062
Neg logit contributions
 its-0.130
 yours-0.125
 our-0.125
 your-0.123
 such-0.121
Token left
Token ID1364
Feature activation-4.36e+00

Pos logit contributions
oise+0.144
mun+0.139
ny+0.130
anda+0.130
zag+0.125
Neg logit contributions
 its-0.176
 their-0.164
 our-0.162
 such-0.162
 the-0.159
Token the
Token ID262
Feature activation+1.92e+00

Pos logit contributions
 yours+0.059
 its+0.054
 our+0.052
 itself+0.051
 theirs+0.050
Neg logit contributions
oise-0.104
anda-0.103
mun-0.102
m-0.098
a-0.098
Token mine
Token ID6164
Feature activation+3.42e+00

Pos logit contributions
oise+0.038
anda+0.037
mun+0.036
ny+0.035
ll+0.035
Neg logit contributions
 its-0.035
 yours-0.035
 our-0.034
 such-0.033
 itself-0.032
Token with
Token ID351
Feature activation-4.52e+00

Pos logit contributions
oise+0.075
mun+0.073
anda+0.070
ny+0.069
ony+0.068
Neg logit contributions
 its-0.057
 yours-0.056
 our-0.055
 itself-0.053
 your-0.053
Token the
Token ID262
Feature activation+2.07e+00

Pos logit contributions
 yours+0.053
 yourselves+0.047
 our+0.045
 electricity+0.043
 itself+0.043
Neg logit contributions
's-0.106
anda-0.105
a-0.104
ony-0.104
my-0.104
Token special
Token ID2041
Feature activation+2.98e+00

Pos logit contributions
oise+0.043
anda+0.042
mun+0.041
ny+0.040
ony+0.039
Neg logit contributions
 its-0.036
 yours-0.035
 our-0.034
 such-0.033
 itself-0.033
Token thing
Token ID1517
Feature activation+2.97e+00

Pos logit contributions
oise+0.061
anda+0.060
mun+0.058
ny+0.057
ony+0.056
Neg logit contributions
 its-0.051
 yours-0.051
 our-0.050
 such-0.048
 itself-0.047
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
oise+0.065
anda+0.063
mun+0.062
ny+0.060
ll+0.059
Neg logit contributions
 its-0.048
 yours-0.048
 our-0.047
 such-0.044
 your-0.044
Token They
Token ID1119
Feature activation+2.85e+00

Pos logit contributions
 yours+0.015
 its+0.015
 our+0.015
 your+0.014
 itself+0.014
Neg logit contributions
oise-0.024
anda-0.023
ny-0.023
mun-0.023
my-0.022