TOP ACTIVATIONS
MAX = 14.806

her brown fur . <|endoftext|> Ark ie wanted his own equipment
that he wanted . <|endoftext|> Ak io was a 3 year
he said . <|endoftext|> L aur ie and Mike were walking
was full ! <|endoftext|> M agg ie was playing in the
years to come . <|endoftext|> Ell ie was feeling very lonely
it seemed magical ! <|endoftext|> Sus ie was very proud .
the church . <|endoftext|> M agg ie was a 3 year
fly again soon ! <|endoftext|> Jul ie and Joe were very
<|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are friends
more wonderful discoveries . <|endoftext|> Pat was a 3 - year
never forgot it . <|endoftext|> Jan ie and John were best
still bl ossom . <|endoftext|> Sus ie and her mom were
exploring the park . <|endoftext|> Kat ie was 3 years old
shine again . <|endoftext|> M agg ie was a little girl
of love ! <|endoftext|> M agg ie was so excited .
so mysterious anymore ! <|endoftext|> Pat ty was a creative 3
squash ." <|endoftext|> <|endoftext|> <|endoftext|> L ily and Sam went on
your heart . <|endoftext|> M agg ie and her brother Jack
The end . <|endoftext|> M agg ie was a brave girl
lot brighter . <|endoftext|> S oph ie and her dad were

MIN ACTIVATIONS
MIN = -13.081

not played with the ash tr ay . They wished they
will not touch the ash tr ay again ," Tim says
not play with the ash tr ay . It is dangerous
wanted to keep the ash tr ay . So he tried
should never touch an ash tr ay ." S
not played with the ash tr ay . They wish they
learned how it was made too . <|endoftext|> Once upon a
healthy food can be tasty too ." <|endoftext|> Once upon a
make the waiter drop the pizz as . They said sorry
, you should have the therm ometer in your mouth .
had not played with the ash tr ay . They wished
dull things can be fun too . <|endoftext|> S ara and
you have to keep the therm ometer in your mouth for
throw away the dirty ash tr ay . <|endoftext|> Once upon
ugly things can be good too . They are happy .
was cheaper than the other therm ometer . Bob was happy
strawberry , it is tasty too ," Tom says .
how big it had grown too . From then on ,
not keep things that are too expensive ," she warned .
not nice to keep the dish all to yourself ."

INTERVAL 7.834 - 14.806
CONTAINS 1.091%

The End . <|endoftext|> Tim and Kim were twins who
toy cars !" L ily was happy that her
get this huge again ." Mary nodded and started to help
glow coming from it . Johnny was excited and wanted to
€ Joe yelled � � € � �

INTERVAL 0.862 - 7.834
CONTAINS 35.938%

for the little girl to go home . The
<|endoftext|> Once upon a time there was a beautiful girl named
B eth and her mummy set to work , painting and
the trees . He even saw a cow and a horse
with the wrapped pastry . The wild bird saw them and

INTERVAL -6.109 - 0.862
CONTAINS 60.871%

, chased it , and had lots of fun .
" I want cheese and ham and pineapple ,"
day ! <|endoftext|> Bob was a little boy with a big
a purple candy . It was so small that everyone wanted
it ." Lucy thought for a moment and then gave the

INTERVAL -13.081 - -6.109
CONTAINS 2.100%

on the tray and put it in the stove . She
One day , he found something heavy in his closet -
learned to be happy with what she had . She learned
were clean before touching the page again . He finally turned
" Maybe we can fix it ," Ben said . He
Token her
Token ID607
Feature activation-2.72e+00

Pos logit contributions
atted+0.028
lee+0.028
ered+0.027
 shrugged+0.026
 asks+0.026
Neg logit contributions
tr-0.041
ometer-0.038
 steep-0.032
br-0.032
 cho-0.032
Token brown
Token ID7586
Feature activation-5.22e-01

Pos logit contributions
tr+0.079
ometer+0.073
ute+0.060
pre+0.058
t+0.057
Neg logit contributions
lee-0.074
atted-0.074
ered-0.073
 asking-0.072
 shrugged-0.069
Token fur
Token ID9230
Feature activation-5.47e+00

Pos logit contributions
tr+0.015
ometer+0.014
ute+0.012
pre+0.012
br+0.011
Neg logit contributions
ered-0.014
atted-0.014
lee-0.014
 asking-0.013
 Laura-0.013
Token.
Token ID13
Feature activation-4.36e+00

Pos logit contributions
tr+0.162
ometer+0.151
ute+0.126
pre+0.121
br+0.120
Neg logit contributions
lee-0.143
atted-0.141
 asking-0.141
ered-0.139
 shrugged-0.134
Token<|endoftext|>
Token ID50256
Feature activation+5.77e+00

Pos logit contributions
tr+0.127
ometer+0.116
ute+0.098
pre+0.094
br+0.092
Neg logit contributions
lee-0.122
atted-0.118
 asking-0.118
ered-0.114
 asks-0.112
TokenArk
Token ID42007
Feature activation+1.48e+01

Pos logit contributions
lee+0.176
atted+0.175
ered+0.173
 asking+0.172
 shrugged+0.167
Neg logit contributions
tr-0.148
ometer-0.135
ute-0.109
t-0.105
br-0.104
Tokenie
Token ID494
Feature activation+7.65e+00

Pos logit contributions
atted+0.332
 begged+0.332
lee+0.321
 blond+0.319
 Michelle+0.312
Neg logit contributions
tr-0.371
ometer-0.343
ute-0.322
boat-0.298
ney-0.281
Token wanted
Token ID2227
Feature activation+5.24e+00

Pos logit contributions
els+0.214
ered+0.207
atted+0.201
lee+0.198
 asking+0.196
Neg logit contributions
tr-0.191
ometer-0.174
pre-0.140
t-0.139
ding-0.135
Token his
Token ID465
Feature activation+2.83e+00

Pos logit contributions
atted+0.166
lee+0.166
ered+0.163
 asking+0.159
 shrugged+0.155
Neg logit contributions
tr-0.128
ometer-0.117
ute-0.091
pre-0.088
t-0.086
Token own
Token ID898
Feature activation+7.77e-01

Pos logit contributions
atted+0.071
ered+0.071
lee+0.071
 asking+0.069
 shrugged+0.066
Neg logit contributions
tr-0.087
ometer-0.082
ute-0.069
pre-0.066
t-0.063
Token equipment
Token ID5112
Feature activation-3.31e+00

Pos logit contributions
lee+0.022
atted+0.022
ered+0.022
 asking+0.022
 shrugged+0.021
Neg logit contributions
tr-0.021
ometer-0.020
ute-0.016
pre-0.015
br-0.015
Token that
Token ID326
Feature activation-6.34e+00

Pos logit contributions
tr+0.263
ometer+0.255
ute+0.214
pre+0.209
powers+0.201
Neg logit contributions
 asking-0.200
ered-0.195
lee-0.194
atted-0.190
 asked-0.190
Token he
Token ID339
Feature activation-4.59e+00

Pos logit contributions
tr+0.195
ometer+0.184
ute+0.157
pre+0.148
br+0.145
Neg logit contributions
 asking-0.159
lee-0.157
ered-0.155
atted-0.155
 shrugged-0.152
Token wanted
Token ID2227
Feature activation+5.41e-02

Pos logit contributions
tr+0.135
ometer+0.121
t+0.105
 steep+0.100
pre+0.099
Neg logit contributions
lee-0.121
atted-0.118
 asking-0.115
ered-0.113
els-0.112
Token.
Token ID13
Feature activation-4.60e+00

Pos logit contributions
atted+0.002
lee+0.002
ered+0.002
 asks+0.002
 asking+0.002
Neg logit contributions
tr-0.001
ometer-0.001
t-0.001
br-0.001
ute-0.001
Token<|endoftext|>
Token ID50256
Feature activation+5.87e+00

Pos logit contributions
tr+0.135
ometer+0.123
ute+0.106
pre+0.101
br+0.101
Neg logit contributions
lee-0.125
 asking-0.123
atted-0.123
ered-0.118
 asks-0.117
TokenAk
Token ID33901
Feature activation+1.43e+01

Pos logit contributions
lee+0.179
atted+0.179
 asking+0.176
ered+0.176
 shrugged+0.169
Neg logit contributions
tr-0.152
ometer-0.139
ute-0.111
t-0.106
pre-0.105
Tokenio
Token ID952
Feature activation+1.02e+01

Pos logit contributions
 asking+0.394
 begged+0.358
 Laura+0.346
 begging+0.339
 asks+0.336
Neg logit contributions
tr-0.382
ometer-0.371
ute-0.336
pre-0.320
t-0.318
Token was
Token ID373
Feature activation+6.07e+00

Pos logit contributions
 asking+0.253
ered+0.236
eling+0.236
 hesitation+0.233
atted+0.232
Neg logit contributions
tr-0.336
ometer-0.302
pre-0.243
ute-0.235
ite-0.233
Token a
Token ID257
Feature activation-8.78e-01

Pos logit contributions
lee+0.191
ered+0.181
els+0.181
atted+0.180
eling+0.172
Neg logit contributions
tr-0.160
ometer-0.131
ute-0.115
 steep-0.109
 much-0.106
Token 3
Token ID513
Feature activation+3.99e+00

Pos logit contributions
tr+0.034
ometer+0.032
ute+0.028
t+0.028
br+0.027
Neg logit contributions
lee-0.015
 asked-0.015
 asking-0.015
 shrugged-0.014
 Laura-0.014
Token year
Token ID614
Feature activation+2.46e+00

Pos logit contributions
atted+0.113
lee+0.112
 asking+0.111
ered+0.110
 asks+0.110
Neg logit contributions
tr-0.094
ometer-0.085
pre-0.073
t-0.073
ute-0.071
Token he
Token ID339
Feature activation+1.03e+00

Pos logit contributions
lee+0.050
ered+0.050
atted+0.050
 asking+0.048
els+0.046
Neg logit contributions
tr-0.054
ometer-0.050
ute-0.041
t-0.040
pre-0.039
Token said
Token ID531
Feature activation+5.05e+00

Pos logit contributions
lee+0.019
els+0.019
atted+0.018
 asking+0.018
ice+0.017
Neg logit contributions
tr-0.036
ometer-0.034
t-0.028
powers-0.028
ute-0.028
Token.
Token ID13
Feature activation-2.20e+00

Pos logit contributions
lee+0.145
atted+0.143
 asks+0.143
ice+0.141
els+0.140
Neg logit contributions
tr-0.132
ometer-0.125
ute-0.099
t-0.099
pre-0.095
Token<|endoftext|>
Token ID50256
Feature activation+5.78e+00

Pos logit contributions
tr+0.058
ometer+0.054
ute+0.042
t+0.041
pre+0.040
Neg logit contributions
ered-0.066
atted-0.065
lee-0.064
 asking-0.063
eling-0.062
TokenL
Token ID43
Feature activation+1.16e+01

Pos logit contributions
lee+0.177
atted+0.177
 asking+0.175
ered+0.174
 shrugged+0.167
Neg logit contributions
tr-0.149
ometer-0.135
ute-0.108
pre-0.103
t-0.103
Tokenaur
Token ID2899
Feature activation+1.40e+01

Pos logit contributions
 shrugged+0.231
atted+0.221
 begged+0.219
 asking+0.218
 asks+0.206
Neg logit contributions
tr-0.360
ometer-0.329
t-0.325
ute-0.309
ork-0.307
Tokenie
Token ID494
Feature activation+9.55e+00

Pos logit contributions
 hesitation+0.251
 asking+0.250
 Laura+0.247
 asks+0.246
 Michelle+0.244
Neg logit contributions
ometer-0.376
tr-0.366
t-0.354
ork-0.340
 cho-0.339
Token and
Token ID290
Feature activation+7.19e+00

Pos logit contributions
ered+0.256
els+0.249
atted+0.244
ps+0.239
eling+0.232
Neg logit contributions
tr-0.247
ometer-0.217
 cho-0.187
br-0.184
ch-0.183
Token Mike
Token ID4995
Feature activation+1.24e+01

Pos logit contributions
atted+0.136
lee+0.130
ered+0.125
ice+0.123
 asking+0.122
Neg logit contributions
tr-0.266
ometer-0.249
ute-0.214
t-0.209
pre-0.209
Token were
Token ID547
Feature activation+7.26e+00

Pos logit contributions
 asking+0.355
 hesitation+0.341
lee+0.319
 Michelle+0.318
 Marie+0.316
Neg logit contributions
tr-0.313
ometer-0.275
ding-0.250
ite-0.232
ute-0.230
Token walking
Token ID6155
Feature activation+5.40e+00

Pos logit contributions
ered+0.189
eling+0.174
 shrugged+0.174
ise+0.170
lee+0.170
Neg logit contributions
tr-0.212
ometer-0.192
ute-0.170
 much-0.163
 steep-0.157
Token was
Token ID373
Feature activation-4.14e-01

Pos logit contributions
lee+0.019
atted+0.019
 asking+0.018
ered+0.018
 shrugged+0.017
Neg logit contributions
tr-0.023
ometer-0.021
ute-0.018
pre-0.017
t-0.017
Token full
Token ID1336
Feature activation+1.38e-01

Pos logit contributions
tr+0.010
ometer+0.009
 steep+0.007
t+0.007
br+0.007
Neg logit contributions
lee-0.013
atted-0.013
ered-0.013
 asks-0.012
 asking-0.012
Token!
Token ID0
Feature activation-4.01e+00

Pos logit contributions
lee+0.004
atted+0.004
ered+0.004
 asking+0.004
 asks+0.004
Neg logit contributions
tr-0.004
ometer-0.003
t-0.003
br-0.003
ute-0.003
Token<|endoftext|>
Token ID50256
Feature activation+6.35e+00

Pos logit contributions
tr+0.132
ometer+0.123
ute+0.109
br+0.103
pre+0.103
Neg logit contributions
lee-0.101
 asking-0.093
atted-0.092
 asked-0.090
ered-0.088
TokenM
Token ID44
Feature activation+9.37e+00

Pos logit contributions
atted+0.190
lee+0.190
ered+0.188
 asking+0.186
 shrugged+0.180
Neg logit contributions
tr-0.167
ometer-0.152
ute-0.123
t-0.118
br-0.118
Tokenagg
Token ID9460
Feature activation+1.39e+01

Pos logit contributions
 asking+0.213
atted+0.206
ered+0.205
lee+0.201
 shrugged+0.199
Neg logit contributions
tr-0.312
ometer-0.290
ute-0.251
t-0.244
pre-0.235
Tokenie
Token ID494
Feature activation+8.56e+00

Pos logit contributions
 Jennifer+0.401
 Michelle+0.384
 asking+0.382
 pleading+0.354
 begging+0.350
Neg logit contributions
ute-0.247
tr-0.242
ometer-0.235
t-0.223
bye-0.209
Token was
Token ID373
Feature activation+5.88e+00

Pos logit contributions
ered+0.244
els+0.240
atted+0.230
eling+0.226
 hesitation+0.224
Neg logit contributions
tr-0.215
ometer-0.186
pre-0.159
 je-0.157
 cho-0.157
Token playing
Token ID2712
Feature activation+4.13e+00

Pos logit contributions
lee+0.163
ered+0.161
atted+0.157
els+0.154
ice+0.151
Neg logit contributions
tr-0.170
ometer-0.141
ute-0.128
 much-0.114
 Whe-0.114
Token in
Token ID287
Feature activation+4.56e+00

Pos logit contributions
lee+0.122
atted+0.120
ered+0.115
 asking+0.113
 asks+0.111
Neg logit contributions
tr-0.115
ometer-0.100
ute-0.089
pre-0.082
br-0.082
Token the
Token ID262
Feature activation+4.58e-01

Pos logit contributions
atted+0.117
lee+0.108
 shrugged+0.103
ered+0.102
ice+0.100
Neg logit contributions
tr-0.138
 road-0.118
ometer-0.114
 bike-0.112
 Whe-0.110
Token years
Token ID812
Feature activation-2.69e+00

Pos logit contributions
tr+0.033
ometer+0.029
br+0.028
 steep+0.027
powers+0.026
Neg logit contributions
lee-0.045
atted-0.045
 shrugged-0.044
 asking-0.043
 Laura-0.043
Token to
Token ID284
Feature activation-1.42e+00

Pos logit contributions
tr+0.080
ometer+0.075
ute+0.062
t+0.060
br+0.060
Neg logit contributions
atted-0.069
ered-0.069
lee-0.068
 asking-0.068
 shrugged-0.065
Token come
Token ID1282
Feature activation+1.48e+00

Pos logit contributions
tr+0.037
ometer+0.032
ute+0.028
br+0.027
 steep+0.026
Neg logit contributions
atted-0.044
lee-0.043
 asking-0.041
 shrugged-0.040
ice-0.040
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
ered+0.034
lee+0.034
 asking+0.034
atted+0.033
 asked+0.032
Neg logit contributions
tr-0.049
ometer-0.046
pre-0.038
br-0.038
ute-0.037
Token<|endoftext|>
Token ID50256
Feature activation+6.44e+00

Pos logit contributions
tr+0.123
ometer+0.112
ute+0.097
pre+0.096
br+0.094
Neg logit contributions
lee-0.091
atted-0.089
 asking-0.087
 Laura-0.085
 asks-0.085
TokenEll
Token ID30639
Feature activation+1.39e+01

Pos logit contributions
lee+0.187
atted+0.186
 asking+0.183
ered+0.183
 shrugged+0.175
Neg logit contributions
tr-0.176
ometer-0.161
ute-0.131
pre-0.125
br-0.124
Tokenie
Token ID494
Feature activation+8.78e+00

Pos logit contributions
 begged+0.253
 asking+0.245
atted+0.243
 asks+0.238
 shrugged+0.233
Neg logit contributions
tr-0.512
ometer-0.422
ute-0.399
Snow-0.376
 je-0.373
Token was
Token ID373
Feature activation+6.36e+00

Pos logit contributions
ered+0.245
els+0.243
atted+0.239
ps+0.231
eling+0.229
Neg logit contributions
tr-0.220
ometer-0.196
 je-0.165
 road-0.162
pre-0.159
Token feeling
Token ID4203
Feature activation+5.92e+00

Pos logit contributions
lee+0.205
ered+0.198
atted+0.197
els+0.197
eling+0.192
Neg logit contributions
tr-0.156
ometer-0.123
ute-0.115
 much-0.106
 steep-0.104
Token very
Token ID845
Feature activation-4.72e-01

Pos logit contributions
lee+0.194
atted+0.193
 asking+0.191
ered+0.190
 shrugged+0.183
Neg logit contributions
tr-0.140
ometer-0.126
ute-0.099
pre-0.094
br-0.092
Token lonely
Token ID21757
Feature activation+4.07e+00

Pos logit contributions
tr+0.013
ometer+0.012
ute+0.011
t+0.010
pre+0.010
Neg logit contributions
atted-0.012
ice-0.012
 asking-0.012
els-0.012
lee-0.012
Token it
Token ID340
Feature activation-6.22e+00

Pos logit contributions
tr+0.130
ometer+0.119
ute+0.097
pre+0.092
br+0.090
Neg logit contributions
lee-0.145
 asking-0.144
atted-0.144
ered-0.143
 shrugged-0.137
Token seemed
Token ID3947
Feature activation-3.19e+00

Pos logit contributions
tr+0.181
ometer+0.165
t+0.138
br+0.134
pre+0.134
Neg logit contributions
lee-0.169
atted-0.168
 asking-0.164
ered-0.160
els-0.153
Token magical
Token ID10883
Feature activation-1.36e+00

Pos logit contributions
tr+0.088
ometer+0.080
ute+0.066
pre+0.063
br+0.061
Neg logit contributions
lee-0.092
 asking-0.092
atted-0.091
ered-0.090
 shrugged-0.086
Token!
Token ID0
Feature activation-4.92e+00

Pos logit contributions
tr+0.038
ometer+0.035
ute+0.028
t+0.027
br+0.027
Neg logit contributions
lee-0.039
atted-0.038
 asking-0.037
ered-0.037
 shrugged-0.036
Token<|endoftext|>
Token ID50256
Feature activation+6.83e+00

Pos logit contributions
tr+0.151
ometer+0.139
ute+0.120
pre+0.116
br+0.111
Neg logit contributions
lee-0.125
 asking-0.124
atted-0.121
 asks-0.120
 asked-0.119
TokenSus
Token ID30746
Feature activation+1.39e+01

Pos logit contributions
lee+0.207
atted+0.207
ered+0.204
 asking+0.202
 shrugged+0.195
Neg logit contributions
tr-0.177
ometer-0.161
ute-0.129
br-0.125
t-0.125
Tokenie
Token ID494
Feature activation+8.51e+00

Pos logit contributions
 hesitation+0.328
 begged+0.327
 convincing+0.314
 Michelle+0.314
 Eva+0.313
Neg logit contributions
tr-0.367
ometer-0.344
ute-0.325
ite-0.300
ding-0.297
Token was
Token ID373
Feature activation+5.74e+00

Pos logit contributions
ered+0.235
els+0.219
eling+0.217
atted+0.215
 asking+0.209
Neg logit contributions
tr-0.227
ometer-0.206
pre-0.173
ute-0.168
 je-0.168
Token very
Token ID845
Feature activation-3.38e-01

Pos logit contributions
lee+0.172
ered+0.168
atted+0.167
els+0.160
 shrugged+0.158
Neg logit contributions
tr-0.155
ometer-0.128
ute-0.116
 steep-0.104
 much-0.103
Token proud
Token ID6613
Feature activation+4.40e+00

Pos logit contributions
ometer+0.008
tr+0.008
ute+0.007
 steep+0.007
t+0.007
Neg logit contributions
ice-0.009
ered-0.009
els-0.009
atted-0.009
lee-0.009
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
lee+0.110
atted+0.109
 asking+0.108
ered+0.107
 shrugged+0.102
Neg logit contributions
tr-0.138
ometer-0.128
ute-0.107
pre-0.104
br-0.102
Token the
Token ID262
Feature activation-3.03e+00

Pos logit contributions
atted+0.008
lee+0.008
 shrugged+0.007
ered+0.007
ice+0.007
Neg logit contributions
tr-0.008
ometer-0.008
ute-0.007
 license-0.007
t-0.007
Token church
Token ID4928
Feature activation-5.58e+00

Pos logit contributions
tr+0.089
ometer+0.083
ute+0.067
pre+0.065
br+0.065
Neg logit contributions
 asking-0.083
ered-0.080
lee-0.079
 asks-0.079
atted-0.079
Token.
Token ID13
Feature activation-4.40e+00

Pos logit contributions
tr+0.179
ometer+0.167
ute+0.138
t+0.136
pre+0.135
Neg logit contributions
lee-0.135
atted-0.133
ered-0.130
 asking-0.129
 shrugged-0.124
Token<|endoftext|>
Token ID50256
Feature activation+6.40e+00

Pos logit contributions
tr+0.132
ometer+0.120
ute+0.103
pre+0.098
br+0.097
Neg logit contributions
lee-0.120
 asking-0.116
atted-0.115
ered-0.109
 asks-0.109
TokenM
Token ID44
Feature activation+9.95e+00

Pos logit contributions
lee+0.193
atted+0.193
ered+0.190
 asking+0.189
 shrugged+0.182
Neg logit contributions
tr-0.168
ometer-0.153
ute-0.122
t-0.118
br-0.118
Tokenagg
Token ID9460
Feature activation+1.38e+01

Pos logit contributions
 asking+0.232
atted+0.230
ered+0.230
lee+0.228
 shrugged+0.218
Neg logit contributions
tr-0.324
ometer-0.303
ute-0.258
t-0.251
pre-0.243
Tokenie
Token ID494
Feature activation+8.95e+00

Pos logit contributions
 asking+0.346
 Jennifer+0.345
 Michelle+0.325
 hesitation+0.320
 pleading+0.313
Neg logit contributions
ometer-0.307
tr-0.304
ute-0.287
t-0.285
pre-0.271
Token was
Token ID373
Feature activation+6.16e+00

Pos logit contributions
ered+0.251
els+0.240
atted+0.235
eling+0.229
 asking+0.225
Neg logit contributions
tr-0.238
ometer-0.212
ding-0.178
pre-0.177
storm-0.175
Token a
Token ID257
Feature activation-8.22e-01

Pos logit contributions
lee+0.165
ered+0.164
atted+0.161
els+0.155
ice+0.150
Neg logit contributions
tr-0.184
ometer-0.158
ute-0.141
pre-0.130
ding-0.129
Token 3
Token ID513
Feature activation+4.17e+00

Pos logit contributions
tr+0.032
ometer+0.030
ute+0.026
t+0.026
br+0.025
Neg logit contributions
lee-0.014
 asked-0.014
 asking-0.014
 Laura-0.014
 shrugged-0.013
Token year
Token ID614
Feature activation+2.58e+00

Pos logit contributions
ered+0.117
lee+0.116
atted+0.116
 asking+0.115
 asks+0.112
Neg logit contributions
tr-0.100
ometer-0.091
pre-0.078
ute-0.077
t-0.076
Token fly
Token ID6129
Feature activation-3.16e-01

Pos logit contributions
tr+0.023
ometer+0.021
ute+0.017
pre+0.017
t+0.016
Neg logit contributions
ered-0.020
lee-0.020
 asking-0.020
atted-0.019
 asked-0.019
Token again
Token ID757
Feature activation-6.79e-01

Pos logit contributions
tr+0.011
ometer+0.010
pre+0.009
ute+0.008
t+0.008
Neg logit contributions
ered-0.007
 asking-0.007
lee-0.007
atted-0.007
 asked-0.006
Token soon
Token ID2582
Feature activation-4.40e-01

Pos logit contributions
tr+0.022
ometer+0.020
ute+0.018
br+0.017
t+0.017
Neg logit contributions
atted-0.016
lee-0.016
ered-0.016
 asking-0.015
 shrugged-0.015
Token!
Token ID0
Feature activation-3.24e+00

Pos logit contributions
tr+0.012
ometer+0.011
ute+0.009
t+0.009
br+0.009
Neg logit contributions
atted-0.012
lee-0.012
ered-0.012
 shrugged-0.011
 asking-0.011
Token<|endoftext|>
Token ID50256
Feature activation+6.14e+00

Pos logit contributions
tr+0.102
ometer+0.094
ute+0.082
pre+0.081
br+0.077
Neg logit contributions
 asking-0.080
lee-0.079
atted-0.078
ered-0.077
 Laura-0.076
TokenJul
Token ID16980
Feature activation+1.38e+01

Pos logit contributions
lee+0.182
atted+0.182
ered+0.179
 asking+0.179
 shrugged+0.172
Neg logit contributions
tr-0.163
ometer-0.149
ute-0.120
t-0.115
pre-0.115
Tokenie
Token ID494
Feature activation+8.28e+00

Pos logit contributions
 asking+0.305
 begging+0.282
 asks+0.276
 begged+0.275
 Michelle+0.249
Neg logit contributions
tr-0.420
ometer-0.388
ute-0.375
t-0.374
ork-0.353
Token and
Token ID290
Feature activation+5.34e+00

Pos logit contributions
ered+0.214
els+0.209
atted+0.207
eling+0.202
ps+0.199
Neg logit contributions
tr-0.227
ometer-0.211
ute-0.173
ding-0.171
pre-0.169
Token Joe
Token ID5689
Feature activation+1.04e+01

Pos logit contributions
atted+0.110
lee+0.105
ered+0.101
ice+0.101
eling+0.099
Neg logit contributions
tr-0.188
ometer-0.175
ute-0.153
pre-0.147
 bike-0.146
Token were
Token ID547
Feature activation+6.11e+00

Pos logit contributions
 asking+0.295
atted+0.283
lee+0.282
 Laura+0.272
ered+0.269
Neg logit contributions
tr-0.269
ometer-0.236
ding-0.207
ite-0.203
 je-0.196
Token very
Token ID845
Feature activation-1.10e+00

Pos logit contributions
ered+0.180
lee+0.169
eling+0.168
 shrugged+0.167
ise+0.163
Neg logit contributions
tr-0.162
ometer-0.147
ute-0.128
ding-0.117
 cho-0.116
Token<|endoftext|>
Token ID50256
Feature activation+4.89e+00

Pos logit contributions
tr+0.125
ometer+0.123
ute+0.109
pre+0.104
 steep+0.101
Neg logit contributions
 asking-0.088
 asks-0.085
lee-0.085
 asked-0.085
 Laura-0.084
Token<|endoftext|>
Token ID50256
Feature activation+7.22e+00

Pos logit contributions
lee+0.129
atted+0.128
 asking+0.128
ered+0.126
 asks+0.122
Neg logit contributions
tr-0.145
ometer-0.135
ute-0.111
pre-0.107
t-0.104
Token<|endoftext|>
Token ID50256
Feature activation+8.41e+00

Pos logit contributions
lee+0.204
atted+0.203
 asking+0.201
ered+0.200
 shrugged+0.191
Neg logit contributions
tr-0.203
ometer-0.186
ute-0.152
pre-0.146
br-0.144
Token<|endoftext|>
Token ID50256
Feature activation+8.96e+00

Pos logit contributions
lee+0.253
atted+0.253
ered+0.249
 asking+0.247
 shrugged+0.240
Neg logit contributions
tr-0.221
ometer-0.198
ute-0.162
br-0.156
pre-0.156
Token<|endoftext|>
Token ID50256
Feature activation+9.23e+00

Pos logit contributions
atted+0.283
lee+0.282
ered+0.279
 asking+0.273
 shrugged+0.269
Neg logit contributions
tr-0.224
ometer-0.194
ute-0.160
br-0.158
pre-0.154
TokenL
Token ID43
Feature activation+1.38e+01

Pos logit contributions
atted+0.299
lee+0.297
ered+0.295
 asking+0.287
 shrugged+0.285
Neg logit contributions
tr-0.223
ometer-0.189
br-0.157
ute-0.156
pre-0.152
Tokenila
Token ID10102
Feature activation+1.16e+01

Pos logit contributions
eling+0.333
 shrugged+0.325
atted+0.319
ened+0.313
ered+0.312
Neg logit contributions
tr-0.395
ork-0.359
ute-0.343
t-0.331
ometer-0.327
Token and
Token ID290
Feature activation+9.19e+00

Pos logit contributions
eling+0.405
ered+0.397
 hesitation+0.374
 asking+0.373
atted+0.368
Neg logit contributions
tr-0.247
ometer-0.193
pre-0.165
ute-0.164
 road-0.156
Token Ben
Token ID3932
Feature activation+1.28e+01

Pos logit contributions
atted+0.181
lee+0.177
ered+0.168
 asking+0.164
ice+0.164
Neg logit contributions
tr-0.337
ometer-0.306
ute-0.274
pre-0.268
t-0.266
Token are
Token ID389
Feature activation+8.08e+00

Pos logit contributions
atted+0.363
 asking+0.357
lee+0.357
ered+0.355
eling+0.347
Neg logit contributions
tr-0.351
ometer-0.299
pre-0.270
ute-0.250
t-0.246
Token friends
Token ID2460
Feature activation+5.88e+00

Pos logit contributions
ered+0.225
 shrugged+0.217
eling+0.213
atted+0.207
lee+0.204
Neg logit contributions
tr-0.231
ometer-0.207
ute-0.185
pre-0.175
t-0.172
Token more
Token ID517
Feature activation-4.89e+00

Pos logit contributions
atted+0.045
lee+0.044
 shrugged+0.042
 asking+0.042
ered+0.041
Neg logit contributions
tr-0.062
ometer-0.059
ute-0.050
br-0.048
pre-0.047
Token wonderful
Token ID7932
Feature activation+1.02e+00

Pos logit contributions
tr+0.145
ometer+0.135
ute+0.112
pre+0.108
br+0.107
Neg logit contributions
lee-0.129
atted-0.128
 asking-0.126
ered-0.126
 shrugged-0.121
Token discoveries
Token ID24362
Feature activation-1.74e+00

Pos logit contributions
lee+0.028
atted+0.028
 asking+0.028
ered+0.027
 shrugged+0.026
Neg logit contributions
tr-0.029
ometer-0.027
ute-0.022
pre-0.021
br-0.021
Token.
Token ID13
Feature activation-3.65e+00

Pos logit contributions
tr+0.056
ometer+0.053
ute+0.044
pre+0.043
t+0.041
Neg logit contributions
lee-0.042
 asking-0.041
atted-0.041
ered-0.041
 shrugged-0.038
Token<|endoftext|>
Token ID50256
Feature activation+6.06e+00

Pos logit contributions
tr+0.101
ometer+0.094
ute+0.075
t+0.072
pre+0.072
Neg logit contributions
atted-0.104
lee-0.103
ered-0.103
 asking-0.102
 shrugged-0.098
TokenPat
Token ID12130
Feature activation+1.38e+01

Pos logit contributions
lee+0.186
atted+0.186
ered+0.183
 asking+0.183
 shrugged+0.176
Neg logit contributions
tr-0.156
ometer-0.141
ute-0.113
t-0.108
br-0.108
Token was
Token ID373
Feature activation+7.57e+00

Pos logit contributions
 asking+0.273
 sisters+0.244
 Laura+0.242
 begged+0.235
atted+0.227
Neg logit contributions
tr-0.467
ometer-0.423
 steep-0.397
ute-0.387
t-0.373
Token a
Token ID257
Feature activation+5.04e-01

Pos logit contributions
lee+0.201
ered+0.200
atted+0.197
els+0.191
 shrugged+0.186
Neg logit contributions
tr-0.220
ometer-0.190
ute-0.178
ding-0.160
 steep-0.160
Token 3
Token ID513
Feature activation+4.66e+00

Pos logit contributions
lee+0.010
 asking+0.009
 asked+0.009
atted+0.009
 shrugged+0.009
Neg logit contributions
tr-0.019
ometer-0.018
t-0.015
ute-0.015
br-0.014
Token-
Token ID12
Feature activation+1.63e+00

Pos logit contributions
 asking+0.140
lee+0.138
atted+0.134
 asks+0.133
ered+0.133
Neg logit contributions
tr-0.101
ometer-0.090
pre-0.080
ute-0.075
t-0.074
Tokenyear
Token ID1941
Feature activation+4.48e+00

Pos logit contributions
lee+0.036
 shrugged+0.035
 asking+0.035
atted+0.034
 asks+0.034
Neg logit contributions
tr-0.053
ometer-0.049
t-0.043
pre-0.043
ute-0.043
Token never
Token ID1239
Feature activation+1.71e+00

Pos logit contributions
tr+0.044
ometer+0.040
ute+0.034
br+0.034
t+0.034
Neg logit contributions
atted-0.034
lee-0.032
 asking-0.032
ered-0.032
ice-0.031
Token forgot
Token ID16453
Feature activation+1.87e+00

Pos logit contributions
atted+0.036
ered+0.036
lee+0.035
els+0.035
 asking+0.034
Neg logit contributions
tr-0.059
ometer-0.054
t-0.047
 steep-0.046
br-0.045
Token it
Token ID340
Feature activation-6.49e+00

Pos logit contributions
atted+0.041
lee+0.040
ered+0.040
 asking+0.040
 shrugged+0.038
Neg logit contributions
tr-0.063
ometer-0.059
ute-0.050
br-0.048
pre-0.048
Token.
Token ID13
Feature activation-4.36e+00

Pos logit contributions
tr+0.178
ometer+0.161
ute+0.131
t+0.128
br+0.126
Neg logit contributions
atted-0.188
lee-0.186
ered-0.181
 asking-0.181
 asks-0.174
Token<|endoftext|>
Token ID50256
Feature activation+6.23e+00

Pos logit contributions
tr+0.140
ometer+0.130
ute+0.113
pre+0.108
br+0.106
Neg logit contributions
lee-0.111
atted-0.105
 asking-0.104
 Laura-0.100
 asks-0.100
TokenJan
Token ID12128
Feature activation+1.38e+01

Pos logit contributions
lee+0.190
atted+0.189
ered+0.186
 asking+0.186
 shrugged+0.179
Neg logit contributions
tr-0.161
ometer-0.146
ute-0.118
t-0.113
br-0.112
Tokenie
Token ID494
Feature activation+8.37e+00

Pos logit contributions
 asking+0.270
 begging+0.238
 begged+0.232
atted+0.226
 asks+0.223
Neg logit contributions
tr-0.461
ometer-0.396
 steep-0.362
ute-0.360
ite-0.354
Token and
Token ID290
Feature activation+5.47e+00

Pos logit contributions
ered+0.231
els+0.224
atted+0.224
 asking+0.214
eling+0.213
Neg logit contributions
tr-0.218
ometer-0.196
pre-0.162
ute-0.158
ding-0.157
Token John
Token ID1757
Feature activation+1.02e+01

Pos logit contributions
atted+0.092
lee+0.090
ered+0.084
 asking+0.083
ice+0.082
Neg logit contributions
tr-0.216
ometer-0.202
ute-0.179
pre-0.175
t-0.170
Token were
Token ID547
Feature activation+5.77e+00

Pos logit contributions
lee+0.291
 asking+0.288
atted+0.286
ered+0.274
 Laura+0.270
Neg logit contributions
tr-0.268
ometer-0.233
ding-0.199
pre-0.195
t-0.187
Token best
Token ID1266
Feature activation+3.30e+00

Pos logit contributions
ered+0.170
lee+0.161
 shrugged+0.158
eling+0.156
atted+0.155
Neg logit contributions
tr-0.150
ometer-0.137
ute-0.122
ding-0.111
 cho-0.106
Token still
Token ID991
Feature activation-1.73e+00

Pos logit contributions
tr+0.183
ometer+0.170
ute+0.141
pre+0.137
br+0.134
Neg logit contributions
lee-0.151
atted-0.149
 asking-0.149
ered-0.148
 shrugged-0.140
Token bl
Token ID698
Feature activation-4.14e+00

Pos logit contributions
tr+0.049
ometer+0.045
ute+0.037
pre+0.036
t+0.035
Neg logit contributions
lee-0.048
atted-0.048
 asking-0.047
ered-0.046
 shrugged-0.045
Tokenossom
Token ID25548
Feature activation-4.80e+00

Pos logit contributions
tr+0.102
ometer+0.087
 steep+0.070
 bomb+0.068
storm+0.068
Neg logit contributions
lee-0.137
atted-0.134
ered-0.127
 asking-0.120
ice-0.117
Token.
Token ID13
Feature activation-3.94e+00

Pos logit contributions
tr+0.147
ometer+0.136
ute+0.114
pre+0.110
br+0.108
Neg logit contributions
lee-0.123
atted-0.122
 asking-0.121
ered-0.120
 shrugged-0.114
Token<|endoftext|>
Token ID50256
Feature activation+6.47e+00

Pos logit contributions
tr+0.126
ometer+0.115
ute+0.102
pre+0.099
br+0.097
Neg logit contributions
lee-0.098
atted-0.094
 asking-0.094
 asked-0.090
ered-0.089
TokenSus
Token ID30746
Feature activation+1.37e+01

Pos logit contributions
lee+0.199
atted+0.199
ered+0.195
 asking+0.195
 shrugged+0.188
Neg logit contributions
tr-0.166
ometer-0.151
ute-0.120
t-0.116
br-0.115
Tokenie
Token ID494
Feature activation+8.99e+00

Pos logit contributions
 begged+0.351
 hesitation+0.350
 convincing+0.341
 tries+0.339
 Eva+0.337
Neg logit contributions
tr-0.323
ute-0.283
ometer-0.282
ding-0.255
ite-0.251
Token and
Token ID290
Feature activation+6.57e+00

Pos logit contributions
els+0.259
ered+0.252
atted+0.249
ps+0.243
ries+0.240
Neg logit contributions
tr-0.216
ometer-0.194
 je-0.157
 road-0.154
ding-0.153
Token her
Token ID607
Feature activation+6.29e+00

Pos logit contributions
atted+0.126
ice+0.119
eling+0.112
lee+0.112
ered+0.110
Neg logit contributions
tr-0.238
ometer-0.220
ute-0.198
 bike-0.195
 bicycle-0.192
Token mom
Token ID1995
Feature activation+7.50e+00

Pos logit contributions
lee+0.140
atted+0.136
 asking+0.135
ered+0.131
 shrugged+0.127
Neg logit contributions
tr-0.215
ometer-0.201
ute-0.171
t-0.167
br-0.167
Token were
Token ID547
Feature activation+6.59e+00

Pos logit contributions
gging+0.192
ered+0.187
lee+0.183
ps+0.181
 hes+0.181
Neg logit contributions
tr-0.229
 je-0.175
ometer-0.172
ding-0.169
ite-0.155
Token exploring
Token ID13504
Feature activation+3.26e+00

Pos logit contributions
tr+0.123
ometer+0.114
ute+0.099
br+0.095
pre+0.095
Neg logit contributions
lee-0.090
ered-0.090
atted-0.090
 asking-0.085
 shrugged-0.084
Token the
Token ID262
Feature activation-3.76e+00

Pos logit contributions
atted+0.061
lee+0.061
ered+0.058
 asking+0.058
 shrugged+0.055
Neg logit contributions
tr-0.121
ometer-0.114
ute-0.100
br-0.097
pre-0.097
Token park
Token ID3952
Feature activation-3.07e+00

Pos logit contributions
tr+0.107
ometer+0.104
t+0.084
ute+0.083
pre+0.080
Neg logit contributions
 asking-0.102
lee-0.099
 asked-0.095
 asks-0.094
ma-0.094
Token.
Token ID13
Feature activation-3.54e+00

Pos logit contributions
tr+0.093
ometer+0.086
ute+0.072
pre+0.069
t+0.069
Neg logit contributions
lee-0.080
atted-0.079
ered-0.078
 asking-0.077
 shrugged-0.074
Token<|endoftext|>
Token ID50256
Feature activation+6.01e+00

Pos logit contributions
tr+0.109
ometer+0.101
ute+0.088
pre+0.086
br+0.083
Neg logit contributions
lee-0.090
 asking-0.089
atted-0.089
 asks-0.085
ered-0.085
TokenKat
Token ID25881
Feature activation+1.37e+01

Pos logit contributions
lee+0.184
atted+0.183
 asking+0.180
ered+0.180
 shrugged+0.174
Neg logit contributions
tr-0.154
ometer-0.140
ute-0.113
t-0.108
pre-0.107
Tokenie
Token ID494
Feature activation+9.20e+00

Pos logit contributions
 hesitation+0.304
 tries+0.297
 begging+0.296
 convincing+0.292
 begged+0.289
Neg logit contributions
tr-0.358
ometer-0.346
ute-0.303
 je-0.288
ding-0.283
Token was
Token ID373
Feature activation+6.83e+00

Pos logit contributions
els+0.259
ered+0.255
atted+0.251
ps+0.249
eling+0.245
Neg logit contributions
tr-0.219
ometer-0.196
 je-0.171
 road-0.165
ute-0.160
Token 3
Token ID513
Feature activation+7.88e+00

Pos logit contributions
lee+0.175
ered+0.174
els+0.169
eling+0.167
atted+0.167
Neg logit contributions
tr-0.204
ometer-0.175
ute-0.166
ding-0.148
 much-0.146
Token years
Token ID812
Feature activation+3.50e+00

Pos logit contributions
 asks+0.198
ered+0.193
 asking+0.193
 Laura+0.185
 shrugged+0.184
Neg logit contributions
tr-0.184
ometer-0.169
 much-0.161
powers-0.151
 road-0.148
Token old
Token ID1468
Feature activation+6.30e+00

Pos logit contributions
ered+0.095
atted+0.090
eling+0.088
 shrugged+0.087
 begged+0.086
Neg logit contributions
ometer-0.096
tr-0.094
ute-0.079
 steep-0.076
 road-0.073
Token shine
Token ID18340
Feature activation-2.95e+00

Pos logit contributions
tr+0.177
ometer+0.169
ute+0.136
pre+0.133
t+0.125
Neg logit contributions
ered-0.164
lee-0.159
 asking-0.158
atted-0.156
 asked-0.150
Token again
Token ID757
Feature activation-2.18e+00

Pos logit contributions
tr+0.092
ometer+0.086
ute+0.071
pre+0.069
powers+0.067
Neg logit contributions
lee-0.075
ered-0.075
atted-0.073
 asking-0.073
 asked-0.069
Token.
Token ID13
Feature activation-3.49e+00

Pos logit contributions
tr+0.062
ometer+0.057
ute+0.047
pre+0.045
br+0.045
Neg logit contributions
lee-0.060
atted-0.060
 asking-0.059
ered-0.059
 shrugged-0.057
Token<|endoftext|>
Token ID50256
Feature activation+6.30e+00

Pos logit contributions
tr+0.106
ometer+0.098
ute+0.081
pre+0.078
t+0.077
Neg logit contributions
lee-0.091
atted-0.090
 asking-0.089
ered-0.089
 shrugged-0.085
TokenM
Token ID44
Feature activation+9.83e+00

Pos logit contributions
lee+0.189
atted+0.188
 asking+0.185
ered+0.185
 shrugged+0.177
Neg logit contributions
tr-0.166
ometer-0.152
ute-0.122
pre-0.117
t-0.116
Tokenagg
Token ID9460
Feature activation+1.37e+01

Pos logit contributions
 asking+0.218
ered+0.211
atted+0.208
lee+0.202
 shrugged+0.200
Neg logit contributions
tr-0.330
ometer-0.311
ute-0.271
t-0.266
pre-0.250
Tokenie
Token ID494
Feature activation+9.55e+00

Pos logit contributions
 Jennifer+0.354
 Michelle+0.335
 asking+0.332
 hesitation+0.315
 pleading+0.312
Neg logit contributions
tr-0.288
ute-0.285
ometer-0.282
t-0.262
pre-0.252
Token was
Token ID373
Feature activation+6.97e+00

Pos logit contributions
ered+0.260
els+0.256
atted+0.253
 asking+0.240
ps+0.239
Neg logit contributions
tr-0.250
ometer-0.224
ding-0.186
 steep-0.182
 cho-0.181
Token a
Token ID257
Feature activation-2.29e-01

Pos logit contributions
ered+0.198
lee+0.197
atted+0.193
els+0.189
 shrugged+0.182
Neg logit contributions
tr-0.192
ometer-0.163
ute-0.147
ding-0.136
 steep-0.133
Token little
Token ID1310
Feature activation+4.27e+00

Pos logit contributions
tr+0.009
ometer+0.008
t+0.007
ute+0.007
ometers+0.007
Neg logit contributions
lee-0.004
 asked-0.004
 asking-0.004
 Laura-0.004
 shrugged-0.004
Token girl
Token ID2576
Feature activation+3.38e+00

Pos logit contributions
 asking+0.096
 asked+0.094
lee+0.093
 shrugged+0.093
 asks+0.093
Neg logit contributions
tr-0.148
ometer-0.133
t-0.116
br-0.114
ute-0.112
Token of
Token ID286
Feature activation-1.62e-01

Pos logit contributions
tr+0.168
ometer+0.153
br+0.127
ute+0.126
t+0.126
Neg logit contributions
lee-0.145
atted-0.144
ered-0.143
 asking-0.143
 shrugged-0.136
Token love
Token ID1842
Feature activation+5.42e-02

Pos logit contributions
tr+0.003
ometer+0.002
powers+0.002
t+0.002
br+0.002
Neg logit contributions
atted-0.006
lee-0.006
ered-0.006
 asking-0.006
els-0.006
Token!
Token ID0
Feature activation-4.21e+00

Pos logit contributions
atted+0.002
lee+0.002
ered+0.002
 asking+0.002
 shrugged+0.001
Neg logit contributions
tr-0.001
ometer-0.001
ute-0.001
t-0.001
pre-0.001
Token<|endoftext|>
Token ID50256
Feature activation+6.37e+00

Pos logit contributions
tr+0.137
ometer+0.131
ute+0.117
powers+0.111
pre+0.111
Neg logit contributions
lee-0.097
 asking-0.094
atted-0.090
 asked-0.088
 Laura-0.087
TokenM
Token ID44
Feature activation+9.65e+00

Pos logit contributions
atted+0.196
lee+0.196
ered+0.193
 asking+0.192
 shrugged+0.185
Neg logit contributions
tr-0.164
ometer-0.148
ute-0.118
t-0.114
br-0.114
Tokenagg
Token ID9460
Feature activation+1.37e+01

Pos logit contributions
 asking+0.221
ered+0.218
atted+0.217
lee+0.212
 shrugged+0.208
Neg logit contributions
tr-0.319
ometer-0.297
ute-0.255
t-0.250
pre-0.241
Tokenie
Token ID494
Feature activation+8.65e+00

Pos logit contributions
 Jennifer+0.376
 asking+0.370
 Michelle+0.356
 pleading+0.344
 begging+0.343
Neg logit contributions
ute-0.248
tr-0.247
ometer-0.247
t-0.233
bye-0.225
Token was
Token ID373
Feature activation+6.29e+00

Pos logit contributions
ered+0.243
els+0.236
atted+0.229
eling+0.225
 hesitation+0.220
Neg logit contributions
tr-0.225
ometer-0.197
pre-0.164
ding-0.164
 cho-0.163
Token so
Token ID523
Feature activation-3.72e-01

Pos logit contributions
lee+0.161
ered+0.159
atted+0.156
els+0.150
 shrugged+0.146
Neg logit contributions
tr-0.195
ometer-0.168
ute-0.151
pre-0.141
 Whe-0.139
Token excited
Token ID6568
Feature activation+7.86e+00

Pos logit contributions
tr+0.011
ometer+0.011
t+0.009
ute+0.009
pre+0.009
Neg logit contributions
ered-0.009
ice-0.009
els-0.009
atted-0.009
 asking-0.009
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
lee+0.232
atted+0.231
ered+0.228
 asking+0.227
 shrugged+0.218
Neg logit contributions
tr-0.210
ometer-0.192
ute-0.155
pre-0.148
t-0.148
Token so
Token ID523
Feature activation-6.45e+00

Pos logit contributions
tr+0.065
ometer+0.060
 steep+0.048
ute+0.047
br+0.047
Neg logit contributions
atted-0.071
lee-0.069
ered-0.067
els-0.065
 asking-0.064
Token mysterious
Token ID11428
Feature activation-1.19e+00

Pos logit contributions
tr+0.155
ometer+0.144
t+0.111
ute+0.111
 steep+0.111
Neg logit contributions
atted-0.202
lee-0.200
 asking-0.197
ered-0.197
ice-0.189
Token anymore
Token ID7471
Feature activation-4.58e+00

Pos logit contributions
tr+0.036
ometer+0.033
ute+0.027
pre+0.026
br+0.026
Neg logit contributions
lee-0.031
atted-0.031
 asking-0.031
ered-0.030
 shrugged-0.029
Token!
Token ID0
Feature activation-3.41e+00

Pos logit contributions
tr+0.130
ometer+0.120
ute+0.098
pre+0.094
t+0.093
Neg logit contributions
lee-0.127
atted-0.127
ered-0.124
 asking-0.124
 shrugged-0.119
Token<|endoftext|>
Token ID50256
Feature activation+6.41e+00

Pos logit contributions
tr+0.105
ometer+0.095
ute+0.081
pre+0.079
br+0.077
Neg logit contributions
lee-0.090
atted-0.087
 asking-0.086
 asks-0.083
ered-0.083
TokenPat
Token ID12130
Feature activation+1.37e+01

Pos logit contributions
lee+0.189
atted+0.189
ered+0.186
 asking+0.185
 shrugged+0.179
Neg logit contributions
tr-0.171
ometer-0.156
ute-0.127
br-0.122
t-0.122
Tokenty
Token ID774
Feature activation+1.03e+01

Pos logit contributions
 asking+0.311
 sisters+0.268
 Laura+0.263
 begged+0.261
atted+0.256
Neg logit contributions
tr-0.425
ometer-0.398
ute-0.368
t-0.353
ite-0.343
Token was
Token ID373
Feature activation+6.99e+00

Pos logit contributions
atted+0.292
 hesitation+0.291
ered+0.289
 asking+0.289
els+0.284
Neg logit contributions
tr-0.246
ometer-0.214
pre-0.178
 je-0.176
 road-0.175
Token a
Token ID257
Feature activation-3.06e-01

Pos logit contributions
lee+0.185
ered+0.183
atted+0.180
els+0.177
ice+0.173
Neg logit contributions
tr-0.201
ometer-0.174
ute-0.163
 steep-0.145
 much-0.145
Token creative
Token ID7325
Feature activation+3.38e+00

Pos logit contributions
tr+0.012
ometer+0.011
t+0.010
ute+0.009
br+0.009
Neg logit contributions
lee-0.006
 asking-0.005
 asked-0.005
 shrugged-0.005
 Laura-0.005
Token 3
Token ID513
Feature activation+3.45e+00

Pos logit contributions
ered+0.093
atted+0.093
lee+0.092
 asking+0.091
 begged+0.085
Neg logit contributions
tr-0.097
ometer-0.090
ute-0.076
pre-0.072
br-0.070
Token squash
Token ID34613
Feature activation-5.68e+00

Pos logit contributions
tr+0.137
ometer+0.124
br+0.110
powers+0.109
ute+0.104
Neg logit contributions
lee-0.173
ered-0.172
atted-0.165
eled-0.156
eling-0.155
Token."
Token ID526
Feature activation-1.16e+00

Pos logit contributions
tr+0.166
ometer+0.154
ute+0.127
pre+0.121
br+0.119
Neg logit contributions
lee-0.152
 asking-0.151
atted-0.151
ered-0.148
 shrugged-0.142
Token<|endoftext|>
Token ID50256
Feature activation+6.71e+00

Pos logit contributions
tr+0.036
ometer+0.032
ute+0.027
pre+0.026
br+0.026
Neg logit contributions
lee-0.031
 asking-0.030
atted-0.030
 asks-0.030
 asked-0.029
Token<|endoftext|>
Token ID50256
Feature activation+8.53e+00

Pos logit contributions
lee+0.190
atted+0.188
 asking+0.186
ered+0.186
 shrugged+0.178
Neg logit contributions
tr-0.188
ometer-0.173
ute-0.142
pre-0.136
br-0.135
Token<|endoftext|>
Token ID50256
Feature activation+9.32e+00

Pos logit contributions
lee+0.265
atted+0.264
ered+0.262
 asking+0.255
 shrugged+0.252
Neg logit contributions
tr-0.216
ometer-0.192
ute-0.158
br-0.154
pre-0.151
TokenL
Token ID43
Feature activation+1.36e+01

Pos logit contributions
atted+0.309
lee+0.307
ered+0.306
 shrugged+0.295
 asking+0.293
Neg logit contributions
tr-0.216
ometer-0.184
br-0.155
ute-0.152
pre-0.147
Tokenily
Token ID813
Feature activation+1.21e+01

Pos logit contributions
eling+0.317
atted+0.312
 shrugged+0.309
ered+0.299
ened+0.297
Neg logit contributions
tr-0.382
ork-0.359
ute-0.344
t-0.342
ometer-0.339
Token and
Token ID290
Feature activation+8.55e+00

Pos logit contributions
ered+0.456
eling+0.452
atted+0.420
ps+0.402
 hesitation+0.402
Neg logit contributions
tr-0.256
ometer-0.194
 road-0.154
 bike-0.142
 steep-0.138
Token Sam
Token ID3409
Feature activation+1.25e+01

Pos logit contributions
atted+0.175
lee+0.173
ice+0.165
ered+0.161
 asking+0.157
Neg logit contributions
tr-0.302
ometer-0.273
ute-0.245
pre-0.241
t-0.232
Token went
Token ID1816
Feature activation+8.85e+00

Pos logit contributions
 asking+0.373
lee+0.364
atted+0.363
ered+0.353
eling+0.334
Neg logit contributions
tr-0.314
ometer-0.259
pre-0.239
t-0.215
ding-0.214
Token on
Token ID319
Feature activation+7.90e+00

Pos logit contributions
lee+0.259
atted+0.258
 asking+0.256
ered+0.253
 asks+0.241
Neg logit contributions
tr-0.238
ometer-0.219
ute-0.176
pre-0.171
br-0.166
Token your
Token ID534
Feature activation-2.00e+00

Pos logit contributions
lee+0.041
atted+0.041
ered+0.040
 asking+0.040
 shrugged+0.038
Neg logit contributions
tr-0.053
ometer-0.050
ute-0.042
pre-0.040
t-0.040
Token heart
Token ID2612
Feature activation-3.30e+00

Pos logit contributions
tr+0.069
ometer+0.063
ute+0.053
br+0.053
pre+0.052
Neg logit contributions
lee-0.046
atted-0.045
 asking-0.045
ered-0.044
 shrugged-0.042
Token.
Token ID13
Feature activation-4.47e+00

Pos logit contributions
tr+0.098
ometer+0.090
ute+0.075
pre+0.073
br+0.071
Neg logit contributions
lee-0.087
atted-0.087
 asking-0.086
ered-0.086
 shrugged-0.081
Token<|endoftext|>
Token ID50256
Feature activation+6.41e+00

Pos logit contributions
tr+0.134
ometer+0.121
ute+0.105
pre+0.102
br+0.099
Neg logit contributions
lee-0.118
 asking-0.117
atted-0.116
 asks-0.114
ered-0.113
TokenM
Token ID44
Feature activation+9.57e+00

Pos logit contributions
lee+0.188
atted+0.186
 asking+0.184
ered+0.184
 shrugged+0.176
Neg logit contributions
tr-0.174
ometer-0.159
ute-0.129
pre-0.123
br-0.122
Tokenagg
Token ID9460
Feature activation+1.36e+01

Pos logit contributions
 asking+0.227
atted+0.226
lee+0.225
ered+0.224
 shrugged+0.214
Neg logit contributions
tr-0.308
ometer-0.290
ute-0.245
t-0.237
pre-0.232
Tokenie
Token ID494
Feature activation+8.97e+00

Pos logit contributions
 Jennifer+0.300
 asking+0.299
 Michelle+0.287
 hesitation+0.276
 begging+0.265
Neg logit contributions
ometer-0.351
tr-0.341
ute-0.335
t-0.320
pre-0.310
Token and
Token ID290
Feature activation+5.77e+00

Pos logit contributions
ered+0.248
atted+0.237
els+0.236
eling+0.229
lee+0.226
Neg logit contributions
tr-0.238
ometer-0.213
ding-0.177
pre-0.175
ite-0.173
Token her
Token ID607
Feature activation+6.23e+00

Pos logit contributions
atted+0.125
ice+0.119
eling+0.118
lee+0.117
ered+0.117
Neg logit contributions
tr-0.194
ometer-0.181
ute-0.155
 bike-0.151
pre-0.150
Token brother
Token ID3956
Feature activation+6.01e+00

Pos logit contributions
lee+0.128
atted+0.123
 asking+0.122
ered+0.118
 shrugged+0.115
Neg logit contributions
tr-0.226
ometer-0.211
ute-0.181
br-0.177
t-0.177
Token Jack
Token ID3619
Feature activation+9.67e+00

Pos logit contributions
ered+0.156
atted+0.151
els+0.149
eling+0.143
lee+0.143
Neg logit contributions
tr-0.189
ometer-0.158
 cho-0.135
 je-0.130
ding-0.130
Token The
Token ID383
Feature activation+1.55e+00

Pos logit contributions
tr+0.109
ometer+0.101
ute+0.081
t+0.080
pre+0.077
Neg logit contributions
ered-0.093
atted-0.091
lee-0.091
 asking-0.090
eling-0.087
Token end
Token ID886
Feature activation-3.08e+00

Pos logit contributions
lee+0.041
atted+0.040
 asking+0.040
ered+0.039
 shrugged+0.038
Neg logit contributions
tr-0.048
ometer-0.044
ute-0.037
pre-0.035
br-0.035
Token.
Token ID13
Feature activation-3.78e+00

Pos logit contributions
tr+0.104
ometer+0.101
ute+0.091
pre+0.091
ding+0.087
Neg logit contributions
 asking-0.064
lee-0.063
atted-0.061
ered-0.060
 Laura-0.059
Token<|endoftext|>
Token ID50256
Feature activation+6.08e+00

Pos logit contributions
tr+0.116
ometer+0.108
ute+0.095
pre+0.092
br+0.088
Neg logit contributions
lee-0.095
 asking-0.095
atted-0.093
 Laura-0.092
 asked-0.091
TokenM
Token ID44
Feature activation+9.60e+00

Pos logit contributions
lee+0.183
atted+0.182
 asking+0.180
ered+0.179
 shrugged+0.172
Neg logit contributions
tr-0.160
ometer-0.146
ute-0.117
pre-0.112
t-0.111
Tokenagg
Token ID9460
Feature activation+1.36e+01

Pos logit contributions
 asking+0.229
atted+0.221
ered+0.221
lee+0.217
 shrugged+0.211
Neg logit contributions
tr-0.312
ometer-0.291
ute-0.249
t-0.244
br-0.230
Tokenie
Token ID494
Feature activation+9.12e+00

Pos logit contributions
 Jennifer+0.365
 Michelle+0.352
 asking+0.347
 hesitation+0.338
 pleading+0.335
Neg logit contributions
ute-0.262
tr-0.251
ometer-0.248
t-0.225
pre-0.224
Token was
Token ID373
Feature activation+6.80e+00

Pos logit contributions
ered+0.257
els+0.253
atted+0.248
ps+0.240
 hesitation+0.237
Neg logit contributions
tr-0.223
ometer-0.198
 je-0.174
ding-0.168
 road-0.168
Token a
Token ID257
Feature activation-3.98e-01

Pos logit contributions
lee+0.189
ered+0.183
atted+0.180
els+0.178
ice+0.173
Neg logit contributions
tr-0.195
ometer-0.164
ute-0.153
ding-0.137
 much-0.135
Token brave
Token ID14802
Feature activation+3.78e+00

Pos logit contributions
tr+0.015
ometer+0.014
t+0.012
ute+0.012
ometers+0.012
Neg logit contributions
lee-0.007
 asked-0.007
 asking-0.007
 Laura-0.007
 shrugged-0.007
Token girl
Token ID2576
Feature activation+2.84e+00

Pos logit contributions
ered+0.101
atted+0.100
lee+0.099
 asking+0.098
 shrugged+0.091
Neg logit contributions
tr-0.112
ometer-0.105
ute-0.087
pre-0.084
br-0.084
Token lot
Token ID1256
Feature activation-5.73e+00

Pos logit contributions
tr+0.206
ometer+0.201
ometers+0.189
ding+0.178
ANG+0.177
Neg logit contributions
 asking-0.109
 told-0.108
 asked-0.107
 Laura-0.099
 sisters-0.096
Token brighter
Token ID26083
Feature activation-1.02e+00

Pos logit contributions
tr+0.187
ometer+0.175
ute+0.147
pre+0.143
br+0.139
Neg logit contributions
lee-0.136
atted-0.135
 asking-0.134
ered-0.133
 shrugged-0.125
Token.
Token ID13
Feature activation-3.79e+00

Pos logit contributions
tr+0.023
ometer+0.021
ute+0.016
pre+0.015
br+0.015
Neg logit contributions
lee-0.034
atted-0.034
ered-0.033
 asking-0.033
 shrugged-0.032
Token<|endoftext|>
Token ID50256
Feature activation+6.67e+00

Pos logit contributions
tr+0.109
ometer+0.101
ute+0.083
pre+0.080
br+0.079
Neg logit contributions
lee-0.105
atted-0.104
 asking-0.103
ered-0.101
 asks-0.097
TokenS
Token ID50
Feature activation+1.21e+01

Pos logit contributions
lee+0.198
atted+0.196
ered+0.194
 asking+0.193
 shrugged+0.186
Neg logit contributions
tr-0.178
ometer-0.163
ute-0.132
pre-0.126
t-0.125
Tokenoph
Token ID2522
Feature activation+1.35e+01

Pos logit contributions
 asking+0.294
ered+0.289
atted+0.286
 shrugged+0.280
lee+0.273
Neg logit contributions
tr-0.378
ometer-0.342
t-0.304
ute-0.297
ding-0.297
Tokenie
Token ID494
Feature activation+8.83e+00

Pos logit contributions
 asking+0.295
 begging+0.275
 begged+0.273
 tries+0.264
 hesitation+0.259
Neg logit contributions
tr-0.413
ometer-0.387
ute-0.356
ork-0.347
t-0.333
Token and
Token ID290
Feature activation+5.69e+00

Pos logit contributions
ered+0.254
els+0.246
atted+0.244
eling+0.240
 hesitation+0.238
Neg logit contributions
tr-0.227
ometer-0.200
pre-0.164
 je-0.157
 road-0.155
Token her
Token ID607
Feature activation+6.10e+00

Pos logit contributions
atted+0.109
lee+0.103
ice+0.100
ered+0.097
eling+0.097
Neg logit contributions
tr-0.212
ometer-0.194
ute-0.172
pre-0.168
 bike-0.164
Token dad
Token ID9955
Feature activation+6.74e+00

Pos logit contributions
lee+0.128
atted+0.123
 asking+0.123
 shrugged+0.117
ered+0.116
Neg logit contributions
tr-0.216
ometer-0.202
ute-0.173
t-0.170
br-0.170
Token were
Token ID547
Feature activation+6.21e+00

Pos logit contributions
lee+0.201
atted+0.197
ered+0.189
els+0.186
aned+0.185
Neg logit contributions
tr-0.161
ding-0.129
ometer-0.127
 je-0.121
 cho-0.113
Token not
Token ID407
Feature activation-2.18e+00

Pos logit contributions
tr+0.052
ometer+0.045
pre+0.037
Snow+0.037
t+0.037
Neg logit contributions
lee-0.040
atted-0.039
ered-0.037
 Laura-0.036
els-0.036
Token played
Token ID2826
Feature activation-9.14e-01

Pos logit contributions
tr+0.079
ometer+0.069
pre+0.059
t+0.059
br+0.058
Neg logit contributions
lee-0.046
atted-0.045
 Laura-0.044
ered-0.043
 asking-0.042
Token with
Token ID351
Feature activation-1.42e-02

Pos logit contributions
tr+0.030
ometer+0.029
ute+0.024
t+0.023
pre+0.023
Neg logit contributions
 asking-0.021
ered-0.020
lee-0.019
atted-0.019
 asked-0.019
Token the
Token ID262
Feature activation-5.03e+00

Pos logit contributions
tr+0.000
ometer+0.000
ute+0.000
pre+0.000
br+0.000
Neg logit contributions
lee-0.000
atted-0.000
 asking-0.000
ered-0.000
 shrugged-0.000
Token ash
Token ID12530
Feature activation-1.20e+01

Pos logit contributions
tr+0.161
ometer+0.157
ute+0.138
powers+0.129
br+0.129
Neg logit contributions
 asking-0.117
lee-0.112
atted-0.110
ered-0.109
 asked-0.105
Tokentr
Token ID2213
Feature activation-1.31e+01

Pos logit contributions
ute+0.104
powers+0.099
 lifetime+0.092
 sax+0.075
arest+0.075
Neg logit contributions
ma-0.483
 asking-0.475
 asked-0.466
els-0.463
ice-0.458
Tokenay
Token ID323
Feature activation-3.62e+00

Pos logit contributions
ometer+0.305
 villains+0.302
tr+0.301
powers+0.296
ute+0.290
Neg logit contributions
ma-0.368
atted-0.364
els-0.346
lee-0.340
ered-0.338
Token.
Token ID13
Feature activation-5.16e+00

Pos logit contributions
tr+0.110
ometer+0.104
ute+0.091
powers+0.087
pre+0.086
Neg logit contributions
 asking-0.089
ered-0.089
atted-0.089
lee-0.088
ice-0.084
Token They
Token ID1119
Feature activation-1.91e+00

Pos logit contributions
tr+0.171
ometer+0.157
ute+0.142
br+0.141
pre+0.137
Neg logit contributions
 asks-0.117
 asked-0.116
 asking-0.113
lee-0.111
 Laura-0.109
Token wished
Token ID16555
Feature activation+1.68e+00

Pos logit contributions
tr+0.070
ometer+0.066
ute+0.058
pre+0.055
br+0.055
Neg logit contributions
lee-0.037
atted-0.037
 asking-0.037
ered-0.036
 shrugged-0.035
Token they
Token ID484
Feature activation-3.33e+00

Pos logit contributions
lee+0.054
 asking+0.054
atted+0.053
ered+0.052
 shrugged+0.051
Neg logit contributions
tr-0.042
ometer-0.038
ute-0.030
br-0.028
pre-0.028
Token will
Token ID481
Feature activation-2.59e+00

Pos logit contributions
tr+0.100
ometer+0.095
ute+0.081
br+0.076
ding+0.075
Neg logit contributions
atted-0.078
ered-0.078
 asking-0.077
lee-0.077
 asked-0.076
Token not
Token ID407
Feature activation-1.47e+00

Pos logit contributions
tr+0.079
ometer+0.073
ute+0.061
pre+0.058
br+0.058
Neg logit contributions
lee-0.068
atted-0.067
 asking-0.066
ered-0.066
 shrugged-0.063
Token touch
Token ID3638
Feature activation-4.13e-01

Pos logit contributions
tr+0.046
ometer+0.045
ute+0.038
pre+0.037
br+0.036
Neg logit contributions
 asking-0.035
lee-0.035
ered-0.035
 asked-0.034
 begged-0.033
Token the
Token ID262
Feature activation-5.22e+00

Pos logit contributions
tr+0.015
ometer+0.014
ute+0.012
pre+0.012
t+0.011
Neg logit contributions
lee-0.009
 asking-0.008
atted-0.008
ered-0.008
 asks-0.008
Token ash
Token ID12530
Feature activation-1.17e+01

Pos logit contributions
ometer+0.168
tr+0.161
ute+0.157
powers+0.146
t+0.143
Neg logit contributions
 asking-0.113
els-0.100
ice-0.099
 asked-0.099
lee-0.098
Tokentr
Token ID2213
Feature activation-1.27e+01

Pos logit contributions
powers+0.094
 lifetime+0.091
ute+0.081
 villains+0.073
 sax+0.067
Neg logit contributions
ma-0.473
 asking-0.465
els-0.465
ice-0.460
ered-0.459
Tokenay
Token ID323
Feature activation-3.43e+00

Pos logit contributions
 villains+0.321
powers+0.305
ometer+0.293
fire+0.291
 steep+0.287
Neg logit contributions
ma-0.349
atted-0.340
els-0.334
ered-0.317
ries-0.314
Token again
Token ID757
Feature activation-4.72e+00

Pos logit contributions
tr+0.107
ometer+0.101
ute+0.089
powers+0.085
 steep+0.085
Neg logit contributions
ered-0.082
 asking-0.081
atted-0.080
lee-0.078
ice-0.077
Token,"
Token ID553
Feature activation-2.81e-01

Pos logit contributions
tr+0.120
ometer+0.111
ute+0.088
t+0.086
 steep+0.084
Neg logit contributions
atted-0.145
lee-0.144
ered-0.140
 asking-0.137
 shrugged-0.136
Token Tim
Token ID5045
Feature activation+6.61e+00

Pos logit contributions
tr+0.008
ometer+0.008
ute+0.006
pre+0.006
br+0.006
Neg logit contributions
lee-0.008
atted-0.007
 asking-0.007
ered-0.007
 shrugged-0.007
Token says
Token ID1139
Feature activation+2.19e+00

Pos logit contributions
lee+0.148
ered+0.142
atted+0.138
eled+0.134
 asking+0.129
Neg logit contributions
tr-0.228
ometer-0.207
powers-0.169
 steep-0.165
ute-0.162
Token not
Token ID407
Feature activation-7.95e-01

Pos logit contributions
tr+0.028
ometer+0.025
ute+0.020
Snow+0.019
t+0.019
Neg logit contributions
atted-0.033
lee-0.033
 asking-0.031
 shrugged-0.030
ered-0.030
Token play
Token ID711
Feature activation-1.22e+00

Pos logit contributions
tr+0.025
ometer+0.025
ute+0.021
pre+0.020
br+0.020
Neg logit contributions
ered-0.019
 asking-0.018
 asked-0.018
lee-0.018
 asks-0.018
Token with
Token ID351
Feature activation+1.73e-01

Pos logit contributions
ometer+0.043
tr+0.042
t+0.036
ute+0.035
pre+0.035
Neg logit contributions
 asking-0.025
ered-0.024
 asked-0.023
 asks-0.022
els-0.022
Token the
Token ID262
Feature activation-4.86e+00

Pos logit contributions
lee+0.004
atted+0.004
 shrugged+0.004
ered+0.004
 asking+0.004
Neg logit contributions
tr-0.005
ometer-0.005
ute-0.004
pre-0.004
br-0.004
Token ash
Token ID12530
Feature activation-1.17e+01

Pos logit contributions
tr+0.155
ometer+0.151
ute+0.133
br+0.127
t+0.127
Neg logit contributions
 asking-0.111
atted-0.103
ered-0.102
lee-0.101
 asks-0.101
Tokentr
Token ID2213
Feature activation-1.25e+01

Pos logit contributions
ometer+0.114
 lifetime+0.107
ute+0.098
powers+0.094
Snow+0.092
Neg logit contributions
ma-0.442
els-0.428
 asking-0.428
ice-0.426
ered-0.417
Tokenay
Token ID323
Feature activation-3.46e+00

Pos logit contributions
 villains+0.315
ometer+0.309
powers+0.301
fire+0.299
Snow+0.294
Neg logit contributions
ma-0.336
atted-0.314
els-0.309
ered-0.293
ries-0.288
Token.
Token ID13
Feature activation-6.12e+00

Pos logit contributions
tr+0.109
ometer+0.105
ute+0.091
pre+0.089
powers+0.087
Neg logit contributions
ered-0.080
 asking-0.079
atted-0.078
lee-0.077
ma-0.076
Token It
Token ID632
Feature activation-6.55e+00

Pos logit contributions
tr+0.165
ometer+0.151
ute+0.127
br+0.120
t+0.119
Neg logit contributions
lee-0.177
 asking-0.173
atted-0.173
ered-0.170
 asks-0.166
Token is
Token ID318
Feature activation-3.49e+00

Pos logit contributions
tr+0.214
ometer+0.204
ute+0.177
pre+0.167
br+0.165
Neg logit contributions
ered-0.148
atted-0.148
lee-0.147
 asking-0.147
 asks-0.139
Token dangerous
Token ID4923
Feature activation-3.16e+00

Pos logit contributions
tr+0.103
ometer+0.098
ute+0.086
pre+0.081
t+0.080
Neg logit contributions
 asking-0.093
lee-0.089
ered-0.087
atted-0.087
 asked-0.085
Token wanted
Token ID2227
Feature activation+1.43e+00

Pos logit contributions
lee+0.043
atted+0.043
ered+0.042
els+0.042
 asking+0.041
Neg logit contributions
tr-0.056
ometer-0.054
t-0.043
 steep-0.043
pre-0.042
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
atted+0.046
lee+0.046
ered+0.045
 shrugged+0.043
 asking+0.043
Neg logit contributions
tr-0.034
ometer-0.031
t-0.023
ute-0.023
pre-0.022
Token keep
Token ID1394
Feature activation+8.73e-01

Pos logit contributions
tr+0.035
ometer+0.032
ute+0.027
t+0.026
pre+0.026
Neg logit contributions
atted-0.032
lee-0.032
 asking-0.031
ered-0.030
 shrugged-0.029
Token the
Token ID262
Feature activation-5.48e+00

Pos logit contributions
atted+0.023
 shrugged+0.022
lee+0.021
ited+0.021
ere+0.020
Neg logit contributions
ometer-0.022
tr-0.021
 license-0.018
 job-0.018
pre-0.017
Token ash
Token ID12530
Feature activation-1.14e+01

Pos logit contributions
tr+0.170
ometer+0.163
ute+0.143
br+0.135
ding+0.134
Neg logit contributions
 asking-0.133
ered-0.130
lee-0.128
atted-0.127
 asked-0.121
Tokentr
Token ID2213
Feature activation-1.24e+01

Pos logit contributions
ute+0.180
 sax+0.175
 Whe+0.175
powers+0.171
Ready+0.165
Neg logit contributions
ered-0.386
ma-0.371
 asking-0.363
 her-0.357
 asked-0.350
Tokenay
Token ID323
Feature activation-2.96e+00

Pos logit contributions
tr+0.285
ometer+0.279
 villains+0.276
powers+0.267
ute+0.263
Neg logit contributions
ma-0.357
atted-0.342
ered-0.339
els-0.325
lee-0.324
Token.
Token ID13
Feature activation-3.92e+00

Pos logit contributions
tr+0.093
ometer+0.087
ute+0.076
pre+0.073
powers+0.071
Neg logit contributions
ered-0.071
 asking-0.070
atted-0.069
lee-0.069
ma-0.065
Token So
Token ID1406
Feature activation+8.50e-01

Pos logit contributions
tr+0.134
ometer+0.123
ute+0.114
br+0.113
pre+0.112
Neg logit contributions
lee-0.089
atted-0.084
 asking-0.082
 asks-0.080
 asked-0.078
Token he
Token ID339
Feature activation+8.63e-01

Pos logit contributions
 asking+0.014
lee+0.014
atted+0.014
ered+0.014
 asked+0.013
Neg logit contributions
tr-0.034
ometer-0.032
ute-0.028
pre-0.028
br-0.027
Token tried
Token ID3088
Feature activation+1.56e+00

Pos logit contributions
atted+0.019
 Laura+0.019
lee+0.019
eling+0.019
 asking+0.018
Neg logit contributions
tr-0.029
ometer-0.026
pre-0.021
Snow-0.021
br-0.021
Token should
Token ID815
Feature activation-1.65e+00

Pos logit contributions
tr+0.050
ometer+0.046
ute+0.039
pre+0.037
br+0.036
Neg logit contributions
lee-0.049
atted-0.049
ered-0.049
 asking-0.049
 shrugged-0.047
Token never
Token ID1239
Feature activation+7.50e-01

Pos logit contributions
tr+0.045
ometer+0.040
ute+0.033
Snow+0.031
t+0.031
Neg logit contributions
atted-0.049
lee-0.049
ered-0.045
 shrugged-0.045
 asking-0.045
Token touch
Token ID3638
Feature activation-7.73e-01

Pos logit contributions
lee+0.019
atted+0.019
 asking+0.019
ered+0.019
 shrugged+0.018
Neg logit contributions
tr-0.023
ometer-0.022
ute-0.018
pre-0.017
br-0.017
Token an
Token ID281
Feature activation-6.46e+00

Pos logit contributions
tr+0.023
ometer+0.021
ute+0.017
pre+0.017
br+0.016
Neg logit contributions
lee-0.021
atted-0.021
 asking-0.021
ered-0.020
 shrugged-0.020
Token ash
Token ID12530
Feature activation-1.12e+01

Pos logit contributions
tr+0.182
ometer+0.165
ute+0.134
pre+0.129
t+0.125
Neg logit contributions
atted-0.185
lee-0.183
 asking-0.179
ered-0.178
 shrugged-0.174
Tokentr
Token ID2213
Feature activation-1.23e+01

Pos logit contributions
 lifetime+0.076
powers+0.076
 steep+0.069
ometer+0.066
ute+0.061
Neg logit contributions
 asking-0.468
ice-0.452
els-0.447
ma-0.447
 asked-0.444
Tokenay
Token ID323
Feature activation-3.30e+00

Pos logit contributions
 villains+0.305
powers+0.295
fire+0.292
ometer+0.283
 steep+0.281
Neg logit contributions
ma-0.333
atted-0.318
els-0.315
ered-0.308
lee-0.288
Token."
Token ID526
Feature activation-1.79e+00

Pos logit contributions
tr+0.099
ometer+0.093
ute+0.079
 steep+0.077
powers+0.077
Neg logit contributions
 asking-0.085
ered-0.083
atted-0.081
lee-0.081
 asked-0.078
Token
Token ID198
Feature activation-1.91e+00

Pos logit contributions
tr+0.042
ometer+0.039
ute+0.031
pre+0.029
t+0.029
Neg logit contributions
ered-0.058
atted-0.057
lee-0.057
 asking-0.056
 shrugged-0.054
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
tr+0.057
ometer+0.052
ute+0.042
pre+0.042
br+0.041
Neg logit contributions
lee-0.052
atted-0.051
 asking-0.051
ered-0.049
 asks-0.049
TokenS
Token ID50
Feature activation+6.28e+00

Pos logit contributions
tr+0.052
ometer+0.048
ute+0.040
pre+0.039
t+0.038
Neg logit contributions
lee-0.038
atted-0.038
 asking-0.038
ered-0.036
 asks-0.036
Token not
Token ID407
Feature activation-1.72e+00

Pos logit contributions
tr+0.049
ometer+0.042
t+0.034
pre+0.034
 cho+0.034
Neg logit contributions
lee-0.040
atted-0.038
ered-0.036
els-0.035
 asking-0.035
Token played
Token ID2826
Feature activation-6.96e-01

Pos logit contributions
tr+0.060
ometer+0.054
t+0.045
pre+0.045
ute+0.045
Neg logit contributions
lee-0.039
atted-0.038
ered-0.037
 asking-0.036
 Laura-0.035
Token with
Token ID351
Feature activation+3.13e-01

Pos logit contributions
tr+0.021
ometer+0.019
ute+0.016
pre+0.015
br+0.015
Neg logit contributions
 asking-0.018
lee-0.018
atted-0.018
ered-0.018
 shrugged-0.017
Token the
Token ID262
Feature activation-4.27e+00

Pos logit contributions
atted+0.007
lee+0.007
 asking+0.007
ered+0.007
 begged+0.007
Neg logit contributions
tr-0.010
ometer-0.009
ute-0.008
pre-0.008
 steep-0.007
Token ash
Token ID12530
Feature activation-1.14e+01

Pos logit contributions
tr+0.125
ometer+0.116
ute+0.097
pre+0.092
br+0.092
Neg logit contributions
 asking-0.115
lee-0.112
atted-0.112
ered-0.110
 asks-0.107
Tokentr
Token ID2213
Feature activation-1.23e+01

Pos logit contributions
ometer+0.303
tr+0.291
ute+0.287
powers+0.264
Snow+0.261
Neg logit contributions
 asking-0.264
 asked-0.251
atted-0.250
ma-0.247
 shrugged-0.244
Tokenay
Token ID323
Feature activation-3.27e+00

Pos logit contributions
tr+0.289
ometer+0.281
ute+0.261
Snow+0.257
powers+0.256
Neg logit contributions
atted-0.354
ma-0.349
els-0.327
lee-0.327
ered-0.322
Token.
Token ID13
Feature activation-4.71e+00

Pos logit contributions
tr+0.094
ometer+0.087
ute+0.071
pre+0.068
t+0.068
Neg logit contributions
lee-0.091
atted-0.090
 asking-0.089
ered-0.088
 shrugged-0.085
Token They
Token ID1119
Feature activation-1.49e+00

Pos logit contributions
tr+0.157
ometer+0.146
ute+0.134
br+0.129
pre+0.125
Neg logit contributions
lee-0.109
 asked-0.108
 asking-0.104
 asks-0.104
 Laura-0.099
Token wish
Token ID4601
Feature activation-1.09e-02

Pos logit contributions
tr+0.045
ometer+0.043
ute+0.037
br+0.035
pre+0.035
Neg logit contributions
ered-0.038
 asked-0.038
lee-0.038
 asks-0.038
 begged-0.038
Token they
Token ID484
Feature activation-3.12e+00

Pos logit contributions
tr+0.000
ometer+0.000
ute+0.000
pre+0.000
br+0.000
Neg logit contributions
lee-0.000
atted-0.000
 asking-0.000
ered-0.000
 shrugged-0.000
Token learned
Token ID4499
Feature activation+2.86e-01

Pos logit contributions
atted+0.002
lee+0.002
els+0.002
ice+0.002
ered+0.002
Neg logit contributions
tr-0.001
ometer-0.001
br-0.001
 steep-0.001
powers-0.001
Token how
Token ID703
Feature activation-7.61e+00

Pos logit contributions
ered+0.007
 asking+0.007
lee+0.007
atted+0.007
 shrugged+0.006
Neg logit contributions
tr-0.009
ometer-0.009
ute-0.007
pre-0.007
ding-0.007
Token it
Token ID340
Feature activation-7.33e+00

Pos logit contributions
tr+0.240
ometer+0.205
ute+0.173
pre+0.165
br+0.163
Neg logit contributions
lee-0.215
atted-0.209
 asking-0.205
ered-0.204
 asked-0.199
Token was
Token ID373
Feature activation-3.61e+00

Pos logit contributions
tr+0.245
ometer+0.224
ute+0.189
pre+0.187
t+0.186
Neg logit contributions
lee-0.171
atted-0.169
 asking-0.166
ered-0.161
 Laura-0.153
Token made
Token ID925
Feature activation-2.11e+00

Pos logit contributions
tr+0.093
ometer+0.083
t+0.067
br+0.066
ute+0.066
Neg logit contributions
atted-0.110
lee-0.109
ered-0.104
 Laura-0.102
els-0.101
Token too
Token ID1165
Feature activation-1.22e+01

Pos logit contributions
tr+0.064
ometer+0.059
ute+0.049
t+0.048
br+0.048
Neg logit contributions
lee-0.055
atted-0.054
 asking-0.052
ered-0.052
 shrugged-0.051
Token.
Token ID13
Feature activation-4.19e+00

Pos logit contributions
tr+0.256
ometer+0.222
ute+0.167
pre+0.155
br+0.146
Neg logit contributions
lee-0.446
atted-0.439
 asking-0.437
ered-0.436
 asked-0.422
Token<|endoftext|>
Token ID50256
Feature activation+6.44e+00

Pos logit contributions
tr+0.127
ometer+0.118
ute+0.098
pre+0.095
br+0.093
Neg logit contributions
lee-0.110
atted-0.109
 asking-0.108
ered-0.107
 shrugged-0.102
TokenOnce
Token ID7454
Feature activation+7.49e+00

Pos logit contributions
lee+0.198
atted+0.198
ered+0.195
 asking+0.194
 shrugged+0.187
Neg logit contributions
tr-0.165
ometer-0.150
ute-0.120
t-0.115
pre-0.114
Token upon
Token ID2402
Feature activation+6.62e+00

Pos logit contributions
 asking+0.195
ered+0.195
lee+0.194
atted+0.192
ice+0.183
Neg logit contributions
tr-0.214
ometer-0.207
t-0.174
ute-0.172
 steep-0.161
Token a
Token ID257
Feature activation+4.23e+00

Pos logit contributions
 asking+0.136
ered+0.134
lee+0.133
atted+0.129
 asked+0.128
Neg logit contributions
tr-0.235
ometer-0.226
ute-0.191
pre-0.185
br-0.180
Token healthy
Token ID5448
Feature activation-1.74e+00

Pos logit contributions
tr+0.122
ometer+0.113
 steep+0.089
t+0.089
ute+0.088
Neg logit contributions
atted-0.137
lee-0.136
ered-0.132
els-0.123
 asking-0.123
Token food
Token ID2057
Feature activation-7.26e+00

Pos logit contributions
tr+0.043
ometer+0.041
t+0.032
ute+0.032
br+0.032
Neg logit contributions
atted-0.053
lee-0.053
ered-0.052
 asking-0.048
 shrugged-0.048
Token can
Token ID460
Feature activation-7.12e+00

Pos logit contributions
tr+0.179
ometer+0.164
ute+0.129
t+0.126
br+0.122
Neg logit contributions
lee-0.231
atted-0.229
 asking-0.223
ered-0.223
 shrugged-0.216
Token be
Token ID307
Feature activation-4.80e+00

Pos logit contributions
tr+0.203
ometer+0.185
ute+0.153
t+0.152
pre+0.148
Neg logit contributions
atted-0.199
lee-0.199
 asking-0.191
ered-0.189
 shrugged-0.185
Token tasty
Token ID25103
Feature activation-2.67e+00

Pos logit contributions
tr+0.102
ometer+0.090
 steep+0.076
br+0.074
 cho+0.070
Neg logit contributions
atted-0.161
lee-0.159
ered-0.151
 shrugged-0.149
eled-0.146
Token too
Token ID1165
Feature activation-1.21e+01

Pos logit contributions
tr+0.077
ometer+0.072
ute+0.059
t+0.057
br+0.057
Neg logit contributions
lee-0.074
atted-0.074
ered-0.072
 asking-0.070
 shrugged-0.068
Token."
Token ID526
Feature activation-1.70e+00

Pos logit contributions
tr+0.278
ometer+0.266
t+0.212
 steep+0.208
ute+0.203
Neg logit contributions
atted-0.396
lee-0.389
ered-0.381
 asking-0.369
 shrugged-0.366
Token<|endoftext|>
Token ID50256
Feature activation+6.63e+00

Pos logit contributions
tr+0.049
ometer+0.045
ute+0.037
pre+0.035
br+0.035
Neg logit contributions
lee-0.048
atted-0.047
 asking-0.047
ered-0.046
 shrugged-0.044
TokenOnce
Token ID7454
Feature activation+7.42e+00

Pos logit contributions
lee+0.208
atted+0.208
ered+0.205
 asking+0.204
 shrugged+0.197
Neg logit contributions
tr-0.166
ometer-0.150
ute-0.119
t-0.115
br-0.113
Token upon
Token ID2402
Feature activation+6.50e+00

Pos logit contributions
 asking+0.194
ered+0.194
lee+0.193
atted+0.191
ice+0.184
Neg logit contributions
tr-0.210
ometer-0.204
t-0.171
ute-0.169
 steep-0.157
Token a
Token ID257
Feature activation+4.34e+00

Pos logit contributions
 asking+0.123
ered+0.121
lee+0.119
 asked+0.118
 Laura+0.117
Neg logit contributions
tr-0.241
ometer-0.233
ute-0.198
pre-0.191
br-0.187
Token make
Token ID787
Feature activation-4.11e-01

Pos logit contributions
tr+0.102
ometer+0.094
ute+0.077
pre+0.074
br+0.073
Neg logit contributions
lee-0.098
atted-0.097
 asking-0.097
ered-0.096
 shrugged-0.091
Token the
Token ID262
Feature activation-4.30e+00

Pos logit contributions
tr+0.016
ometer+0.015
ute+0.013
pre+0.013
br+0.012
Neg logit contributions
lee-0.008
atted-0.007
 asking-0.007
ered-0.007
 shrugged-0.007
Token waiter
Token ID46612
Feature activation-5.04e+00

Pos logit contributions
tr+0.124
ometer+0.114
ute+0.094
pre+0.091
t+0.088
Neg logit contributions
lee-0.121
atted-0.120
ered-0.119
 asking-0.116
 shrugged-0.112
Token drop
Token ID4268
Feature activation-9.03e-01

Pos logit contributions
tr+0.161
ometer+0.150
ute+0.127
pre+0.122
br+0.121
Neg logit contributions
lee-0.123
atted-0.122
 asking-0.122
ered-0.120
 shrugged-0.114
Token the
Token ID262
Feature activation-5.26e+00

Pos logit contributions
tr+0.035
ometer+0.033
ute+0.028
pre+0.028
t+0.028
Neg logit contributions
lee-0.016
atted-0.016
 asking-0.016
ered-0.015
 shrugged-0.014
Token pizz
Token ID38748
Feature activation-1.21e+01

Pos logit contributions
tr+0.155
ometer+0.143
ute+0.118
pre+0.114
br+0.112
Neg logit contributions
lee-0.144
atted-0.142
 asking-0.140
ered-0.140
 shrugged-0.134
Tokenas
Token ID292
Feature activation-4.50e+00

Pos logit contributions
tr+0.381
ometer+0.343
 steep+0.321
storm+0.298
Snow+0.296
Neg logit contributions
atted-0.268
ered-0.260
els-0.257
ma-0.256
 asking-0.253
Token.
Token ID13
Feature activation-4.50e+00

Pos logit contributions
tr+0.134
ometer+0.124
ute+0.103
pre+0.099
br+0.098
Neg logit contributions
lee-0.119
atted-0.118
 asking-0.118
ered-0.117
 shrugged-0.111
Token They
Token ID1119
Feature activation-1.55e+00

Pos logit contributions
tr+0.144
ometer+0.131
ute+0.119
br+0.115
pre+0.112
Neg logit contributions
lee-0.109
 asks-0.108
 asked-0.106
 asking-0.106
 Laura-0.103
Token said
Token ID531
Feature activation+2.66e+00

Pos logit contributions
tr+0.052
ometer+0.045
t+0.039
pre+0.039
 steep+0.037
Neg logit contributions
lee-0.035
atted-0.035
 Laura-0.034
 asking-0.034
els-0.032
Token sorry
Token ID7926
Feature activation+3.37e+00

Pos logit contributions
ered+0.085
lee+0.084
atted+0.083
 asking+0.082
eling+0.080
Neg logit contributions
tr-0.061
ometer-0.059
pre-0.044
t-0.043
ute-0.043
Token,
Token ID11
Feature activation+1.04e-01

Pos logit contributions
lee+0.139
ered+0.138
atted+0.136
eled+0.129
 Laura+0.126
Neg logit contributions
tr-0.133
ometer-0.126
t-0.104
pre-0.099
ute-0.097
Token you
Token ID345
Feature activation-3.97e-01

Pos logit contributions
lee+0.003
atted+0.003
 asking+0.003
ered+0.003
 shrugged+0.003
Neg logit contributions
tr-0.003
ometer-0.003
ute-0.002
pre-0.002
br-0.002
Token should
Token ID815
Feature activation-1.13e+00

Pos logit contributions
tr+0.011
ometer+0.010
t+0.008
ute+0.008
br+0.008
Neg logit contributions
lee-0.011
atted-0.011
 asking-0.011
ered-0.011
 Laura-0.011
Token have
Token ID423
Feature activation-8.09e-01

Pos logit contributions
tr+0.027
ometer+0.023
ute+0.020
t+0.019
Snow+0.018
Neg logit contributions
atted-0.037
lee-0.036
 shrugged-0.034
ice-0.034
 asking-0.034
Token the
Token ID262
Feature activation-4.07e+00

Pos logit contributions
tr+0.028
ometer+0.025
 cho+0.022
t+0.022
pre+0.021
Neg logit contributions
lee-0.018
atted-0.017
ered-0.016
 Laura-0.016
els-0.016
Token therm
Token ID21969
Feature activation-1.21e+01

Pos logit contributions
tr+0.124
ometer+0.110
br+0.097
t+0.096
ding+0.092
Neg logit contributions
 asking-0.108
 asks-0.102
 asked-0.098
 shrugged-0.096
 begged-0.096
Tokenometer
Token ID15635
Feature activation-7.88e+00

Pos logit contributions
tr+0.143
br+0.136
Ready+0.127
Whe+0.118
 Whe+0.111
Neg logit contributions
atted-0.461
pa-0.442
ise-0.441
ma-0.435
ps-0.429
Token in
Token ID287
Feature activation-1.08e+00

Pos logit contributions
tr+0.247
ometer+0.229
ute+0.194
pre+0.190
br+0.188
Neg logit contributions
 asking-0.192
lee-0.191
atted-0.190
ered-0.187
 shrugged-0.178
Token your
Token ID534
Feature activation-3.91e+00

Pos logit contributions
tr+0.035
ometer+0.032
ute+0.028
t+0.027
 license+0.027
Neg logit contributions
atted-0.027
lee-0.026
 shrugged-0.024
ered-0.024
 begged-0.023
Token mouth
Token ID5422
Feature activation-6.52e+00

Pos logit contributions
tr+0.135
ometer+0.126
ute+0.108
t+0.107
pre+0.106
Neg logit contributions
 asking-0.086
lee-0.085
atted-0.084
ered-0.081
 asks-0.080
Token.
Token ID13
Feature activation-6.09e+00

Pos logit contributions
tr+0.192
ometer+0.177
ute+0.147
pre+0.141
t+0.139
Neg logit contributions
lee-0.175
atted-0.174
 asking-0.172
ered-0.171
 shrugged-0.163
Token had
Token ID550
Feature activation-1.58e+00

Pos logit contributions
tr+0.106
ometer+0.100
ute+0.086
pre+0.082
br+0.081
Neg logit contributions
lee-0.070
atted-0.070
ered-0.069
 asking-0.069
 shrugged-0.067
Token not
Token ID407
Feature activation-2.18e+00

Pos logit contributions
tr+0.052
ometer+0.045
pre+0.037
Snow+0.037
t+0.037
Neg logit contributions
lee-0.040
atted-0.039
ered-0.037
 Laura-0.036
els-0.036
Token played
Token ID2826
Feature activation-9.14e-01

Pos logit contributions
tr+0.079
ometer+0.069
pre+0.059
t+0.059
br+0.058
Neg logit contributions
lee-0.046
atted-0.045
 Laura-0.044
ered-0.043
 asking-0.042
Token with
Token ID351
Feature activation-1.42e-02

Pos logit contributions
tr+0.030
ometer+0.029
ute+0.024
t+0.023
pre+0.023
Neg logit contributions
 asking-0.021
ered-0.020
lee-0.019
atted-0.019
 asked-0.019
Token the
Token ID262
Feature activation-5.03e+00

Pos logit contributions
tr+0.000
ometer+0.000
ute+0.000
pre+0.000
br+0.000
Neg logit contributions
lee-0.000
atted-0.000
 asking-0.000
ered-0.000
 shrugged-0.000
Token ash
Token ID12530
Feature activation-1.20e+01

Pos logit contributions
tr+0.161
ometer+0.157
ute+0.138
powers+0.129
br+0.129
Neg logit contributions
 asking-0.117
lee-0.112
atted-0.110
ered-0.109
 asked-0.105
Tokentr
Token ID2213
Feature activation-1.31e+01

Pos logit contributions
ute+0.104
powers+0.099
 lifetime+0.092
 sax+0.075
arest+0.075
Neg logit contributions
ma-0.483
 asking-0.475
 asked-0.466
els-0.463
ice-0.458
Tokenay
Token ID323
Feature activation-3.62e+00

Pos logit contributions
ometer+0.305
 villains+0.302
tr+0.301
powers+0.296
ute+0.290
Neg logit contributions
ma-0.368
atted-0.364
els-0.346
lee-0.340
ered-0.338
Token.
Token ID13
Feature activation-5.16e+00

Pos logit contributions
tr+0.110
ometer+0.104
ute+0.091
powers+0.087
pre+0.086
Neg logit contributions
 asking-0.089
ered-0.089
atted-0.089
lee-0.088
ice-0.084
Token They
Token ID1119
Feature activation-1.91e+00

Pos logit contributions
tr+0.171
ometer+0.157
ute+0.142
br+0.141
pre+0.137
Neg logit contributions
 asks-0.117
 asked-0.116
 asking-0.113
lee-0.111
 Laura-0.109
Token wished
Token ID16555
Feature activation+1.68e+00

Pos logit contributions
tr+0.070
ometer+0.066
ute+0.058
pre+0.055
br+0.055
Neg logit contributions
lee-0.037
atted-0.037
 asking-0.037
ered-0.036
 shrugged-0.035
Token dull
Token ID19222
Feature activation-2.13e+00

Pos logit contributions
tr+0.125
ometer+0.115
ute+0.094
pre+0.090
br+0.088
Neg logit contributions
lee-0.128
atted-0.127
 asking-0.126
ered-0.125
 shrugged-0.120
Token things
Token ID1243
Feature activation-7.61e+00

Pos logit contributions
tr+0.059
ometer+0.054
ute+0.044
pre+0.042
br+0.042
Neg logit contributions
lee-0.060
atted-0.060
ered-0.059
 asking-0.059
 Laura-0.056
Token can
Token ID460
Feature activation-8.15e+00

Pos logit contributions
tr+0.214
ometer+0.198
ute+0.161
pre+0.154
br+0.147
Neg logit contributions
ered-0.215
 asking-0.214
lee-0.213
atted-0.212
 asks-0.204
Token be
Token ID307
Feature activation-5.47e+00

Pos logit contributions
tr+0.261
ometer+0.253
ute+0.206
pre+0.203
ding+0.193
Neg logit contributions
 asking-0.198
ered-0.196
lee-0.191
 asked-0.191
 asks-0.188
Token fun
Token ID1257
Feature activation-4.84e+00

Pos logit contributions
tr+0.115
ometer+0.098
br+0.077
 steep+0.077
pre+0.072
Neg logit contributions
atted-0.190
lee-0.189
ered-0.186
 asking-0.180
 Laura-0.179
Token too
Token ID1165
Feature activation-1.19e+01

Pos logit contributions
tr+0.143
ometer+0.132
ute+0.109
pre+0.105
br+0.103
Neg logit contributions
lee-0.130
atted-0.129
 asking-0.127
ered-0.127
 shrugged-0.121
Token.
Token ID13
Feature activation-4.63e+00

Pos logit contributions
tr+0.340
ometer+0.313
ute+0.256
pre+0.246
br+0.242
Neg logit contributions
lee-0.335
atted-0.332
 asking-0.330
ered-0.327
 shrugged-0.313
Token<|endoftext|>
Token ID50256
Feature activation+6.15e+00

Pos logit contributions
tr+0.147
ometer+0.135
ute+0.118
pre+0.114
br+0.114
Neg logit contributions
lee-0.116
 asking-0.113
 asks-0.113
 asked-0.110
atted-0.110
TokenS
Token ID50
Feature activation+1.15e+01

Pos logit contributions
lee+0.168
atted+0.166
 asking+0.166
ered+0.164
 asks+0.158
Neg logit contributions
tr-0.177
ometer-0.164
ute-0.134
pre-0.129
br-0.126
Tokenara
Token ID3301
Feature activation+1.10e+01

Pos logit contributions
ered+0.281
 asking+0.278
atted+0.277
 shrugged+0.264
lee+0.264
Neg logit contributions
tr-0.359
ometer-0.327
t-0.287
ute-0.285
ding-0.279
Token and
Token ID290
Feature activation+6.43e+00

Pos logit contributions
lee+0.330
eling+0.329
 hesitation+0.328
ered+0.326
atted+0.318
Neg logit contributions
tr-0.300
ometer-0.261
pre-0.221
ding-0.205
t-0.204
Token you
Token ID345
Feature activation-9.39e-01

Pos logit contributions
lee+0.000
 asking+0.000
atted+0.000
ered+0.000
 shrugged+0.000
Neg logit contributions
tr-0.000
ometer-0.000
ute-0.000
pre-0.000
br-0.000
Token have
Token ID423
Feature activation-3.02e-01

Pos logit contributions
tr+0.026
ometer+0.024
ute+0.019
pre+0.018
t+0.018
Neg logit contributions
lee-0.027
atted-0.027
 asking-0.027
ered-0.027
 shrugged-0.026
Token to
Token ID284
Feature activation-1.42e+00

Pos logit contributions
tr+0.008
ometer+0.007
ute+0.006
pre+0.006
br+0.005
Neg logit contributions
lee-0.009
atted-0.009
 asking-0.009
ered-0.009
 shrugged-0.009
Token keep
Token ID1394
Feature activation+8.78e-01

Pos logit contributions
tr+0.033
ometer+0.027
ute+0.025
Snow+0.024
 job+0.024
Neg logit contributions
atted-0.049
lee-0.047
 shrugged-0.043
els-0.043
 asking-0.042
Token the
Token ID262
Feature activation-5.32e+00

Pos logit contributions
atted+0.022
 shrugged+0.022
lee+0.021
ited+0.021
ered+0.021
Neg logit contributions
tr-0.024
ometer-0.023
 cho-0.020
ch-0.020
ute-0.020
Token therm
Token ID21969
Feature activation-1.19e+01

Pos logit contributions
tr+0.154
ometer+0.143
ute+0.124
br+0.121
t+0.119
Neg logit contributions
 asking-0.145
 asks-0.130
 asked-0.129
ered-0.128
 begged-0.128
Tokenometer
Token ID15635
Feature activation-8.68e+00

Pos logit contributions
tr+0.100
br+0.092
 lifetime+0.083
Ready+0.079
 souven+0.079
Neg logit contributions
atted-0.478
ma-0.474
ice-0.472
ered-0.468
 asking-0.466
Token in
Token ID287
Feature activation-1.10e+00

Pos logit contributions
tr+0.270
ometer+0.252
ute+0.211
pre+0.211
br+0.206
Neg logit contributions
 asking-0.213
ered-0.210
lee-0.205
atted-0.204
 asked-0.196
Token your
Token ID534
Feature activation-3.58e+00

Pos logit contributions
tr+0.035
ometer+0.032
ute+0.027
 steep+0.027
Snow+0.026
Neg logit contributions
atted-0.029
lee-0.028
 shrugged-0.027
ered-0.025
 begged-0.024
Token mouth
Token ID5422
Feature activation-6.29e+00

Pos logit contributions
tr+0.121
ometer+0.113
ute+0.095
pre+0.091
br+0.089
Neg logit contributions
lee-0.083
atted-0.082
ered-0.081
 asking-0.080
 shrugged-0.077
Token for
Token ID329
Feature activation-1.81e+00

Pos logit contributions
tr+0.178
ometer+0.164
ute+0.135
pre+0.129
br+0.127
Neg logit contributions
lee-0.176
atted-0.174
 asking-0.173
ered-0.172
 shrugged-0.164
Token throw
Token ID3714
Feature activation+3.05e+00

Pos logit contributions
tr+0.054
ometer+0.048
pre+0.040
t+0.039
 steep+0.039
Neg logit contributions
lee-0.038
atted-0.037
ered-0.035
ice-0.034
els-0.034
Token away
Token ID1497
Feature activation+1.40e+00

Pos logit contributions
lee+0.054
atted+0.053
 asking+0.052
ered+0.051
 shrugged+0.048
Neg logit contributions
tr-0.118
ometer-0.111
ute-0.096
pre-0.095
br-0.094
Token the
Token ID262
Feature activation-4.27e+00

Pos logit contributions
atted+0.023
lee+0.023
 asking+0.022
ered+0.022
 shrugged+0.021
Neg logit contributions
tr-0.056
ometer-0.052
ute-0.045
pre-0.045
br-0.044
Token dirty
Token ID11841
Feature activation-3.65e+00

Pos logit contributions
tr+0.125
ometer+0.116
ute+0.096
pre+0.092
br+0.091
Neg logit contributions
lee-0.115
atted-0.115
 asking-0.114
ered-0.113
 shrugged-0.108
Token ash
Token ID12530
Feature activation-1.01e+01

Pos logit contributions
tr+0.107
ometer+0.099
ute+0.082
pre+0.079
br+0.078
Neg logit contributions
lee-0.099
atted-0.098
 asking-0.097
ered-0.096
 shrugged-0.092
Tokentr
Token ID2213
Feature activation-1.19e+01

Pos logit contributions
ute+0.044
ometer+0.039
powers+0.039
 lifetime+0.037
 Whe+0.027
Neg logit contributions
 asking-0.429
ered-0.426
ma-0.424
els-0.423
ice-0.422
Tokenay
Token ID323
Feature activation-1.99e+00

Pos logit contributions
 villains+0.270
ometer+0.270
tr+0.265
powers+0.262
ute+0.258
Neg logit contributions
ma-0.334
atted-0.329
els-0.316
ered-0.314
lee-0.307
Token.
Token ID13
Feature activation-4.16e+00

Pos logit contributions
tr+0.059
ometer+0.054
ute+0.045
pre+0.043
t+0.043
Neg logit contributions
lee-0.054
atted-0.053
 asking-0.052
ered-0.052
 shrugged-0.050
Token<|endoftext|>
Token ID50256
Feature activation+6.97e+00

Pos logit contributions
tr+0.139
ometer+0.134
ute+0.123
br+0.120
pre+0.118
Neg logit contributions
lee-0.090
 asking-0.087
atted-0.085
 asked-0.084
 begged-0.082
TokenOnce
Token ID7454
Feature activation+7.55e+00

Pos logit contributions
lee+0.222
atted+0.221
ered+0.217
 asking+0.216
 shrugged+0.209
Neg logit contributions
tr-0.172
ometer-0.155
ute-0.122
t-0.119
br-0.116
Token upon
Token ID2402
Feature activation+6.87e+00

Pos logit contributions
ered+0.195
 asking+0.194
lee+0.194
atted+0.193
 shrugged+0.183
Neg logit contributions
tr-0.220
ometer-0.210
t-0.177
ute-0.175
 steep-0.162
Token ugly
Token ID13400
Feature activation-1.15e+00

Pos logit contributions
tr+0.121
ometer+0.112
ute+0.096
br+0.092
t+0.092
Neg logit contributions
 asking-0.087
lee-0.084
atted-0.084
ered-0.083
 asks-0.080
Token things
Token ID1243
Feature activation-6.67e+00

Pos logit contributions
tr+0.030
ometer+0.027
ute+0.022
pre+0.021
br+0.021
Neg logit contributions
lee-0.035
atted-0.035
 asking-0.035
ered-0.034
 shrugged-0.033
Token can
Token ID460
Feature activation-7.04e+00

Pos logit contributions
tr+0.183
ometer+0.168
ute+0.136
pre+0.131
br+0.126
Neg logit contributions
lee-0.191
 asking-0.191
ered-0.191
atted-0.191
 asks-0.181
Token be
Token ID307
Feature activation-4.74e+00

Pos logit contributions
tr+0.213
ometer+0.201
ute+0.166
pre+0.161
br+0.155
Neg logit contributions
 asking-0.181
ered-0.181
lee-0.180
atted-0.177
 asks-0.171
Token good
Token ID922
Feature activation-1.06e+00

Pos logit contributions
tr+0.098
ometer+0.082
 steep+0.067
br+0.064
 cho+0.058
Neg logit contributions
lee-0.171
atted-0.168
 Laura-0.166
ered-0.163
 asking-0.159
Token too
Token ID1165
Feature activation-1.18e+01

Pos logit contributions
tr+0.026
ometer+0.024
ute+0.019
t+0.019
br+0.018
Neg logit contributions
lee-0.034
ered-0.033
atted-0.033
 Laura-0.032
eling-0.032
Token.
Token ID13
Feature activation-4.48e+00

Pos logit contributions
tr+0.326
ometer+0.303
ute+0.246
t+0.234
pre+0.234
Neg logit contributions
lee-0.340
atted-0.338
ered-0.332
 asking-0.331
 shrugged-0.319
Token They
Token ID1119
Feature activation-1.62e+00

Pos logit contributions
tr+0.148
ometer+0.132
br+0.118
ute+0.118
storm+0.115
Neg logit contributions
 asks-0.108
 asked-0.101
 asking-0.100
lee-0.100
atted-0.099
Token are
Token ID389
Feature activation-1.69e-01

Pos logit contributions
tr+0.052
ometer+0.048
ute+0.040
pre+0.039
br+0.038
Neg logit contributions
lee-0.040
atted-0.039
 asking-0.039
ered-0.039
 shrugged-0.037
Token happy
Token ID3772
Feature activation+1.87e+00

Pos logit contributions
tr+0.005
ometer+0.004
ute+0.003
 steep+0.003
br+0.003
Neg logit contributions
lee-0.005
atted-0.005
ered-0.005
 asking-0.004
 Laura-0.004
Token.
Token ID13
Feature activation-4.12e+00

Pos logit contributions
 asking+0.046
lee+0.045
atted+0.045
ered+0.043
 asks+0.042
Neg logit contributions
tr-0.062
ometer-0.057
ute-0.048
br-0.046
pre-0.046
Token was
Token ID373
Feature activation+1.38e+00

Pos logit contributions
tr+0.070
ometer+0.068
 steep+0.054
t+0.054
ute+0.052
Neg logit contributions
lee-0.069
atted-0.066
ice-0.064
ered-0.063
els-0.063
Token cheaper
Token ID11721
Feature activation-1.31e+00

Pos logit contributions
lee+0.042
atted+0.041
els+0.040
ered+0.039
ice+0.039
Neg logit contributions
tr-0.033
ometer-0.031
 steep-0.026
t-0.025
 much-0.024
Token than
Token ID621
Feature activation-4.24e+00

Pos logit contributions
tr+0.037
ometer+0.035
ute+0.028
pre+0.027
t+0.027
Neg logit contributions
lee-0.037
atted-0.036
ered-0.036
 asking-0.035
 shrugged-0.034
Token the
Token ID262
Feature activation-5.36e+00

Pos logit contributions
tr+0.133
ometer+0.126
t+0.103
ute+0.102
pre+0.101
Neg logit contributions
lee-0.103
atted-0.102
ered-0.099
 asking-0.097
 asks-0.095
Token other
Token ID584
Feature activation-5.06e+00

Pos logit contributions
tr+0.164
ometer+0.152
ute+0.131
pre+0.124
br+0.123
Neg logit contributions
 asking-0.139
lee-0.133
atted-0.131
ered-0.131
 asked-0.127
Token therm
Token ID21969
Feature activation-1.18e+01

Pos logit contributions
tr+0.158
ometer+0.147
ute+0.122
pre+0.118
br+0.116
Neg logit contributions
lee-0.128
atted-0.127
 asking-0.125
ered-0.125
 shrugged-0.119
Tokenometer
Token ID15635
Feature activation-8.29e+00

Pos logit contributions
tr+0.093
br+0.048
 fart+0.035
 steep+0.031
 sax+0.029
Neg logit contributions
ice-0.522
 asking-0.520
atted-0.514
ise-0.501
lee-0.498
Token.
Token ID13
Feature activation-4.13e+00

Pos logit contributions
tr+0.230
ometer+0.212
ute+0.167
t+0.165
pre+0.162
Neg logit contributions
lee-0.239
atted-0.238
ered-0.232
 asking-0.229
 shrugged-0.222
Token Bob
Token ID5811
Feature activation+6.81e+00

Pos logit contributions
tr+0.126
ometer+0.122
t+0.095
ute+0.094
 steep+0.092
Neg logit contributions
lee-0.103
ered-0.103
atted-0.102
 asking-0.101
eling-0.098
Token was
Token ID373
Feature activation+2.40e+00

Pos logit contributions
els+0.186
ries+0.179
eling+0.176
 asking+0.176
ered+0.176
Neg logit contributions
tr-0.177
ometer-0.176
t-0.138
pre-0.124
powers-0.122
Token happy
Token ID3772
Feature activation+3.37e+00

Pos logit contributions
els+0.079
lee+0.079
atted+0.076
ice+0.075
ere+0.075
Neg logit contributions
tr-0.049
ometer-0.044
 much-0.037
 steep-0.035
t-0.035
Token strawberry
Token ID41236
Feature activation-3.75e+00

Pos logit contributions
lee+0.060
atted+0.059
ered+0.058
 asking+0.058
 shrugged+0.056
Neg logit contributions
tr-0.066
ometer-0.061
ute-0.049
pre-0.048
br-0.048
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
tr+0.119
ometer+0.111
ute+0.093
pre+0.090
br+0.088
Neg logit contributions
lee-0.091
atted-0.091
 asking-0.090
ered-0.089
 shrugged-0.085
Token it
Token ID340
Feature activation-5.01e+00

Pos logit contributions
tr+0.048
ometer+0.044
ute+0.036
pre+0.035
br+0.034
Neg logit contributions
lee-0.045
atted-0.044
 asking-0.044
ered-0.044
 shrugged-0.042
Token is
Token ID318
Feature activation-4.75e+00

Pos logit contributions
tr+0.151
ometer+0.137
br+0.109
ute+0.108
t+0.107
Neg logit contributions
lee-0.142
atted-0.136
 asking-0.133
ered-0.132
 shrugged-0.127
Token tasty
Token ID25103
Feature activation-1.77e+00

Pos logit contributions
tr+0.122
ometer+0.110
 steep+0.090
br+0.086
ute+0.084
Neg logit contributions
atted-0.149
lee-0.146
ered-0.143
 shrugged-0.136
 asking-0.136
Token too
Token ID1165
Feature activation-1.18e+01

Pos logit contributions
tr+0.055
ometer+0.052
ute+0.043
pre+0.042
t+0.041
Neg logit contributions
lee-0.045
atted-0.044
ered-0.043
 asking-0.043
 shrugged-0.041
Token,"
Token ID553
Feature activation+7.29e-01

Pos logit contributions
tr+0.313
ometer+0.300
t+0.239
 steep+0.239
ute+0.238
Neg logit contributions
lee-0.347
atted-0.346
ered-0.335
 asking-0.331
 shrugged-0.319
Token Tom
Token ID4186
Feature activation+8.14e+00

Pos logit contributions
atted+0.016
ered+0.016
lee+0.015
eled+0.014
 asking+0.013
Neg logit contributions
tr-0.024
ometer-0.022
pre-0.019
ute-0.019
 bike-0.019
Token says
Token ID1139
Feature activation+3.17e+00

Pos logit contributions
lee+0.206
atted+0.205
ered+0.201
eled+0.201
ise+0.177
Neg logit contributions
tr-0.233
ometer-0.214
 steep-0.178
br-0.178
powers-0.176
Token.
Token ID13
Feature activation-3.49e+00

Pos logit contributions
lee+0.104
ered+0.102
atted+0.101
ere+0.101
 asking+0.100
Neg logit contributions
tr-0.068
ometer-0.066
t-0.050
ute-0.049
pre-0.048
Token
Token ID198
Feature activation-2.05e+00

Pos logit contributions
tr+0.094
ometer+0.086
ute+0.070
pre+0.067
br+0.066
Neg logit contributions
lee-0.103
atted-0.103
 asking-0.102
ered-0.101
 shrugged-0.097
Token how
Token ID703
Feature activation-7.39e+00

Pos logit contributions
lee+0.065
atted+0.064
 asking+0.063
ered+0.063
 shrugged+0.060
Neg logit contributions
tr-0.099
ometer-0.093
ute-0.079
pre-0.077
br-0.076
Token big
Token ID1263
Feature activation-6.14e-01

Pos logit contributions
tr+0.206
ometer+0.183
ute+0.148
br+0.139
pre+0.139
Neg logit contributions
lee-0.227
atted-0.221
 asking-0.216
ered-0.216
 asked-0.206
Token it
Token ID340
Feature activation-8.14e+00

Pos logit contributions
tr+0.016
ometer+0.015
pre+0.013
ute+0.013
br+0.013
Neg logit contributions
atted-0.017
ered-0.017
lee-0.017
 asking-0.017
 begged-0.016
Token had
Token ID550
Feature activation-4.00e+00

Pos logit contributions
tr+0.261
ometer+0.231
pre+0.198
t+0.197
br+0.197
Neg logit contributions
atted-0.203
lee-0.201
 asking-0.196
ered-0.186
 Laura-0.184
Token grown
Token ID7334
Feature activation-1.47e+00

Pos logit contributions
tr+0.120
ometer+0.105
t+0.092
br+0.089
pre+0.088
Neg logit contributions
atted-0.108
lee-0.108
 asking-0.101
ered-0.100
 asks-0.099
Token too
Token ID1165
Feature activation-1.18e+01

Pos logit contributions
tr+0.041
ometer+0.039
ute+0.032
pre+0.032
br+0.032
Neg logit contributions
atted-0.039
 asking-0.038
 shrugged-0.038
 begged-0.037
lee-0.037
Token.
Token ID13
Feature activation-4.15e+00

Pos logit contributions
tr+0.279
ometer+0.268
t+0.218
ute+0.210
 steep+0.210
Neg logit contributions
atted-0.362
lee-0.358
 asking-0.352
ered-0.351
 shrugged-0.343
Token From
Token ID3574
Feature activation+5.66e+00

Pos logit contributions
tr+0.134
ometer+0.125
ute+0.103
pre+0.099
t+0.099
Neg logit contributions
ered-0.103
atted-0.101
lee-0.100
 asking-0.100
 shrugged-0.095
Token then
Token ID788
Feature activation+9.85e-01

Pos logit contributions
lee+0.195
atted+0.195
ered+0.190
 asking+0.189
 begged+0.186
Neg logit contributions
tr-0.115
ometer-0.103
ute-0.088
 steep-0.074
pre-0.072
Token on
Token ID319
Feature activation+4.52e+00

Pos logit contributions
 asking+0.028
atted+0.028
lee+0.028
ered+0.027
 asks+0.026
Neg logit contributions
tr-0.027
ometer-0.025
ute-0.020
pre-0.020
powers-0.019
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
atted+0.130
lee+0.128
eling+0.127
ered+0.126
eled+0.126
Neg logit contributions
tr-0.115
ometer-0.110
t-0.095
br-0.093
ute-0.090
Token not
Token ID407
Feature activation-5.61e-01

Pos logit contributions
tr+0.131
ometer+0.124
ute+0.101
t+0.099
pre+0.097
Neg logit contributions
lee-0.124
atted-0.122
ered-0.116
 asking-0.116
ice-0.113
Token keep
Token ID1394
Feature activation-4.21e-03

Pos logit contributions
tr+0.018
ometer+0.017
ute+0.014
pre+0.014
br+0.014
Neg logit contributions
 asking-0.014
ered-0.013
lee-0.013
 asked-0.013
atted-0.013
Token things
Token ID1243
Feature activation-9.23e+00

Pos logit contributions
ometer+0.000
tr+0.000
 license+0.000
ute+0.000
 possible+0.000
Neg logit contributions
atted-0.000
lee-0.000
 shrugged-0.000
ited-0.000
ered-0.000
Token that
Token ID326
Feature activation-8.09e+00

Pos logit contributions
tr+0.267
ometer+0.241
ute+0.197
pre+0.193
br+0.184
Neg logit contributions
 asking-0.258
ered-0.254
lee-0.251
atted-0.250
 asked-0.243
Token are
Token ID389
Feature activation-5.67e+00

Pos logit contributions
tr+0.210
ometer+0.193
ute+0.155
pre+0.149
t+0.146
Neg logit contributions
lee-0.244
atted-0.242
 asking-0.239
ered-0.237
 shrugged-0.228
Token too
Token ID1165
Feature activation-1.18e+01

Pos logit contributions
tr+0.144
ometer+0.131
 steep+0.102
ute+0.102
br+0.101
Neg logit contributions
atted-0.176
lee-0.175
ered-0.167
 asking-0.165
 shrugged-0.162
Token expensive
Token ID5789
Feature activation-3.89e+00

Pos logit contributions
tr+0.257
ometer+0.247
t+0.187
ute+0.184
 steep+0.184
Neg logit contributions
lee-0.394
atted-0.393
ered-0.385
 asking-0.378
 Laura-0.370
Token,"
Token ID553
Feature activation+7.49e-01

Pos logit contributions
tr+0.120
ometer+0.113
ute+0.094
t+0.092
pre+0.090
Neg logit contributions
lee-0.099
atted-0.098
ered-0.095
 asking-0.093
 shrugged-0.090
Token she
Token ID673
Feature activation-5.08e-01

Pos logit contributions
atted+0.019
lee+0.019
ice+0.018
ered+0.018
ps+0.018
Neg logit contributions
tr-0.019
ometer-0.019
t-0.016
pre-0.016
ute-0.016
Token warned
Token ID7728
Feature activation+4.39e+00

Pos logit contributions
tr+0.018
ometer+0.017
t+0.015
ute+0.014
pre+0.014
Neg logit contributions
lee-0.011
atted-0.010
els-0.009
 asking-0.009
 Laura-0.009
Token.
Token ID13
Feature activation-3.64e+00

Pos logit contributions
atted+0.122
lee+0.121
ered+0.114
 asks+0.113
 asking+0.113
Neg logit contributions
tr-0.120
ometer-0.112
t-0.092
ute-0.090
pre-0.087
Token not
Token ID407
Feature activation-3.32e+00

Pos logit contributions
tr+0.091
ometer+0.081
ute+0.065
 steep+0.062
br+0.061
Neg logit contributions
lee-0.100
atted-0.098
ered-0.095
 asking-0.092
 shrugged-0.091
Token nice
Token ID3621
Feature activation-1.83e+00

Pos logit contributions
tr+0.098
ometer+0.091
ute+0.075
pre+0.072
br+0.071
Neg logit contributions
lee-0.089
atted-0.088
 asking-0.088
ered-0.087
 shrugged-0.084
Token to
Token ID284
Feature activation-4.52e+00

Pos logit contributions
tr+0.048
ometer+0.044
ute+0.035
pre+0.034
t+0.033
Neg logit contributions
lee-0.055
atted-0.054
ered-0.054
 asking-0.054
 shrugged-0.052
Token keep
Token ID1394
Feature activation-1.53e+00

Pos logit contributions
tr+0.130
ometer+0.116
ute+0.099
t+0.097
 road+0.093
Neg logit contributions
atted-0.127
lee-0.122
ered-0.114
 shrugged-0.113
 asking-0.111
Token the
Token ID262
Feature activation-6.69e+00

Pos logit contributions
tr+0.038
ometer+0.038
ute+0.030
 license+0.030
 cho+0.030
Neg logit contributions
atted-0.043
 shrugged-0.042
lee-0.041
ered-0.041
 begged-0.039
Token dish
Token ID9433
Feature activation-1.18e+01

Pos logit contributions
tr+0.228
ometer+0.213
ute+0.186
br+0.182
t+0.176
Neg logit contributions
 asking-0.152
lee-0.141
atted-0.138
 asks-0.136
 asked-0.134
Token all
Token ID477
Feature activation-5.62e+00

Pos logit contributions
tr+0.319
ometer+0.273
br+0.240
ute+0.237
Snow+0.225
Neg logit contributions
ma-0.326
 asking-0.324
ered-0.323
lee-0.322
atted-0.316
Token to
Token ID284
Feature activation-4.90e+00

Pos logit contributions
tr+0.156
ometer+0.142
ute+0.114
t+0.112
pre+0.110
Neg logit contributions
lee-0.165
atted-0.162
ered-0.158
 asking-0.156
 shrugged-0.153
Token yourself
Token ID3511
Feature activation-3.31e+00

Pos logit contributions
tr+0.139
ometer+0.124
ute+0.107
t+0.105
Snow+0.101
Neg logit contributions
atted-0.139
lee-0.135
 shrugged-0.125
 asking-0.123
ered-0.122
Token."
Token ID526
Feature activation-1.25e+00

Pos logit contributions
tr+0.093
ometer+0.086
ute+0.070
pre+0.068
br+0.066
Neg logit contributions
lee-0.093
atted-0.092
 asking-0.091
ered-0.091
 shrugged-0.087
Token
Token ID198
Feature activation-1.60e+00

Pos logit contributions
tr+0.041
ometer+0.037
ute+0.033
pre+0.031
storm+0.030
Neg logit contributions
lee-0.030
 asks-0.030
 asked-0.030
 asking-0.030
atted-0.030
Token
Token ID198
Feature activation-1.62e+00

Pos logit contributions
tr+0.063
ometer+0.058
ute+0.047
pre+0.045
br+0.044
Neg logit contributions
lee-0.065
atted-0.064
 asking-0.063
ered-0.063
 shrugged-0.060
TokenThe
Token ID464
Feature activation+1.77e+00

Pos logit contributions
tr+0.053
ometer+0.050
pre+0.040
ute+0.040
powers+0.039
Neg logit contributions
 asking-0.039
lee-0.038
atted-0.037
ered-0.037
 asked-0.037
Token End
Token ID5268
Feature activation-1.54e+00

Pos logit contributions
 asking+0.042
lee+0.041
atted+0.040
ered+0.040
 asks+0.039
Neg logit contributions
tr-0.057
ometer-0.056
ute-0.046
ding-0.044
pre-0.044
Token.
Token ID13
Feature activation-3.49e+00

Pos logit contributions
tr+0.058
ometer+0.054
ute+0.048
 fart+0.047
 roof+0.047
Neg logit contributions
 asked-0.027
lee-0.026
 asks-0.026
atted-0.025
 told-0.025
Token<|endoftext|>
Token ID50256
Feature activation+5.66e+00

Pos logit contributions
tr+0.108
ometer+0.104
ute+0.088
powers+0.085
pre+0.085
Neg logit contributions
lee-0.084
 asking-0.084
atted-0.082
 asks-0.081
 asked-0.080
TokenTim
Token ID14967
Feature activation+1.15e+01

Pos logit contributions
lee+0.154
atted+0.153
 asking+0.152
ered+0.150
 shrugged+0.144
Neg logit contributions
tr-0.164
ometer-0.152
ute-0.125
pre-0.120
br-0.118
Token and
Token ID290
Feature activation+6.07e+00

Pos logit contributions
 hesitation+0.343
ered+0.342
lee+0.323
eling+0.316
atted+0.316
Neg logit contributions
tr-0.284
ometer-0.228
ite-0.209
ding-0.206
 je-0.205
Token Kim
Token ID6502
Feature activation+1.08e+01

Pos logit contributions
lee+0.117
atted+0.116
 asking+0.114
ered+0.113
 shrugged+0.105
Neg logit contributions
tr-0.227
ometer-0.214
ute-0.185
pre-0.180
br-0.177
Token were
Token ID547
Feature activation+6.64e+00

Pos logit contributions
 asking+0.323
atted+0.319
ered+0.308
lee+0.306
 Laura+0.298
Neg logit contributions
tr-0.275
ometer-0.220
ding-0.204
pre-0.202
 je-0.192
Token twins
Token ID20345
Feature activation+2.85e+00

Pos logit contributions
ered+0.184
 shrugged+0.173
lee+0.170
eling+0.168
 begged+0.165
Neg logit contributions
tr-0.186
ometer-0.167
ute-0.153
 much-0.139
 cho-0.135
Token who
Token ID508
Feature activation+8.26e-01

Pos logit contributions
lee+0.075
ered+0.075
atted+0.074
 asking+0.073
 shrugged+0.069
Neg logit contributions
tr-0.085
ometer-0.078
ute-0.065
pre-0.063
t-0.062
Token toy
Token ID13373
Feature activation-5.77e+00

Pos logit contributions
tr+0.099
ometer+0.097
t+0.079
ding+0.077
 steep+0.076
Neg logit contributions
lee-0.100
atted-0.097
ered-0.096
 asking-0.095
els-0.091
Token cars
Token ID5006
Feature activation-5.74e+00

Pos logit contributions
tr+0.179
ometer+0.176
 steep+0.165
Snow+0.156
pre+0.152
Neg logit contributions
lee-0.119
 asking-0.114
ered-0.112
atted-0.112
 asked-0.111
Token!"
Token ID2474
Feature activation+5.12e-01

Pos logit contributions
tr+0.183
ometer+0.175
ute+0.146
pre+0.142
powers+0.138
Neg logit contributions
 asking-0.134
ered-0.132
lee-0.131
atted-0.130
 asked-0.126
Token
Token ID198
Feature activation-1.30e-01

Pos logit contributions
lee+0.014
atted+0.014
 asking+0.013
ered+0.013
 shrugged+0.013
Neg logit contributions
tr-0.015
ometer-0.014
ute-0.012
pre-0.011
br-0.011
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
tr+0.004
ometer+0.003
ute+0.003
pre+0.003
br+0.003
Neg logit contributions
lee-0.004
atted-0.004
 asking-0.004
ered-0.003
 asked-0.003
TokenL
Token ID43
Feature activation+8.01e+00

Pos logit contributions
tr+0.001
ometer+0.001
ute+0.000
pre+0.000
br+0.000
Neg logit contributions
lee-0.001
 asking-0.001
ered-0.001
atted-0.001
 asked-0.001
Tokenily
Token ID813
Feature activation+7.39e+00

Pos logit contributions
 asking+0.173
lee+0.170
atted+0.165
 asks+0.157
 Laura+0.154
Neg logit contributions
tr-0.273
ometer-0.239
t-0.232
ute-0.226
pre-0.215
Token was
Token ID373
Feature activation+3.53e+00

Pos logit contributions
eling+0.187
 hesitation+0.182
 asking+0.182
ered+0.177
atted+0.177
Neg logit contributions
tr-0.235
ometer-0.199
t-0.162
ute-0.159
pre-0.158
Token happy
Token ID3772
Feature activation+4.05e+00

Pos logit contributions
atted+0.113
lee+0.111
ered+0.109
els+0.108
ice+0.108
Neg logit contributions
tr-0.084
ometer-0.072
ute-0.058
 steep-0.057
t-0.057
Token that
Token ID326
Feature activation-2.59e+00

Pos logit contributions
lee+0.110
 asking+0.110
atted+0.109
ered+0.106
 shrugged+0.103
Neg logit contributions
tr-0.121
ometer-0.112
ute-0.092
pre-0.089
br-0.088
Token her
Token ID607
Feature activation-2.37e-01

Pos logit contributions
tr+0.087
ometer+0.081
ute+0.069
pre+0.066
br+0.066
Neg logit contributions
lee-0.060
atted-0.060
 asking-0.059
ered-0.059
 shrugged-0.056
Token get
Token ID651
Feature activation-2.98e+00

Pos logit contributions
tr+0.048
ometer+0.045
ute+0.038
pre+0.037
br+0.035
Neg logit contributions
 asking-0.040
ered-0.040
lee-0.039
 asked-0.038
 begged-0.038
Token this
Token ID428
Feature activation-7.32e+00

Pos logit contributions
tr+0.081
ometer+0.075
ute+0.060
pre+0.058
t+0.057
Neg logit contributions
lee-0.086
atted-0.085
 asking-0.084
ered-0.084
 shrugged-0.081
Token huge
Token ID3236
Feature activation-3.24e+00

Pos logit contributions
tr+0.214
ometer+0.198
ute+0.163
pre+0.158
br+0.154
Neg logit contributions
lee-0.197
 asking-0.195
atted-0.194
ered-0.193
 shrugged-0.184
Token again
Token ID757
Feature activation-4.07e+00

Pos logit contributions
tr+0.094
ometer+0.088
ute+0.073
pre+0.070
br+0.069
Neg logit contributions
lee-0.087
atted-0.087
ered-0.085
 asking-0.083
 shrugged-0.080
Token."
Token ID526
Feature activation-9.30e-02

Pos logit contributions
tr+0.093
ometer+0.085
ute+0.067
t+0.066
br+0.063
Neg logit contributions
atted-0.136
lee-0.136
ered-0.129
 shrugged-0.128
 asking-0.126
Token Mary
Token ID5335
Feature activation+8.43e+00

Pos logit contributions
tr+0.003
ometer+0.003
ute+0.002
t+0.002
pre+0.002
Neg logit contributions
atted-0.002
lee-0.002
ered-0.002
 asking-0.002
eling-0.002
Token nodded
Token ID14464
Feature activation+7.56e+00

Pos logit contributions
 asking+0.207
lee+0.202
els+0.196
ries+0.189
 hesitation+0.187
Neg logit contributions
tr-0.260
ometer-0.239
t-0.204
pre-0.193
ute-0.185
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
atted+0.195
eling+0.194
 asks+0.193
ice+0.188
ere+0.186
Neg logit contributions
tr-0.201
ometer-0.179
 brightly-0.166
ute-0.155
pre-0.153
Token started
Token ID2067
Feature activation+2.17e+00

Pos logit contributions
els+0.029
lee+0.027
ice+0.026
atted+0.025
ere+0.025
Neg logit contributions
tr-0.039
ometer-0.038
pre-0.032
 we-0.030
t-0.030
Token to
Token ID284
Feature activation+8.24e-01

Pos logit contributions
ice+0.075
atted+0.073
ered+0.073
 shrugged+0.072
lee+0.072
Neg logit contributions
tr-0.041
ometer-0.040
ute-0.027
t-0.026
ding-0.026
Token help
Token ID1037
Feature activation+3.11e+00

Pos logit contributions
atted+0.026
lee+0.025
ice+0.024
els+0.024
riet+0.023
Neg logit contributions
tr-0.020
ometer-0.018
 job-0.014
 steep-0.014
 roof-0.014
Token glow
Token ID19634
Feature activation-3.42e+00

Pos logit contributions
tr+0.068
ometer+0.063
ute+0.051
t+0.051
pre+0.050
Neg logit contributions
lee-0.069
ered-0.069
 asking-0.069
atted-0.068
 shrugged-0.065
Token coming
Token ID2406
Feature activation-2.27e+00

Pos logit contributions
tr+0.106
ometer+0.101
ute+0.083
pre+0.083
t+0.080
Neg logit contributions
lee-0.082
 asking-0.082
ered-0.081
atted-0.080
 shrugged-0.076
Token from
Token ID422
Feature activation+2.45e+00

Pos logit contributions
tr+0.072
ometer+0.067
ute+0.058
pre+0.057
powers+0.056
Neg logit contributions
 asking-0.055
ered-0.055
lee-0.054
atted-0.052
 asked-0.050
Token it
Token ID340
Feature activation-5.30e+00

Pos logit contributions
lee+0.053
atted+0.052
 asking+0.051
ered+0.051
 shrugged+0.048
Neg logit contributions
tr-0.085
ometer-0.080
ute-0.068
pre-0.066
br-0.065
Token.
Token ID13
Feature activation-2.74e+00

Pos logit contributions
tr+0.141
ometer+0.128
ute+0.102
t+0.101
br+0.098
Neg logit contributions
atted-0.160
lee-0.159
ered-0.153
 asking-0.151
 shrugged-0.148
Token Johnny
Token ID15470
Feature activation+8.40e+00

Pos logit contributions
tr+0.092
ometer+0.086
ute+0.073
pre+0.071
t+0.070
Neg logit contributions
atted-0.062
lee-0.062
ered-0.061
 asking-0.060
 shrugged-0.057
Token was
Token ID373
Feature activation+2.85e+00

Pos logit contributions
atted+0.254
els+0.248
 hesitation+0.248
ries+0.244
 humor+0.236
Neg logit contributions
tr-0.206
ometer-0.182
t-0.153
ding-0.144
case-0.130
Token excited
Token ID6568
Feature activation+5.81e+00

Pos logit contributions
atted+0.096
els+0.096
ice+0.095
lee+0.095
pa+0.091
Neg logit contributions
tr-0.061
ometer-0.053
 steep-0.040
t-0.039
 much-0.038
Token and
Token ID290
Feature activation-7.72e-01

Pos logit contributions
atted+0.153
ered+0.152
lee+0.151
 asks+0.140
els+0.140
Neg logit contributions
tr-0.167
ometer-0.157
t-0.131
ute-0.128
ding-0.125
Token wanted
Token ID2227
Feature activation+2.45e+00

Pos logit contributions
ometer+0.023
tr+0.022
t+0.019
 steep+0.019
powers+0.019
Neg logit contributions
ice-0.019
els-0.018
atted-0.016
ma-0.016
ries-0.016
Token to
Token ID284
Feature activation-3.69e-01

Pos logit contributions
atted+0.081
lee+0.080
ered+0.078
 asking+0.077
 shrugged+0.076
Neg logit contributions
tr-0.056
ometer-0.052
ute-0.039
t-0.039
br-0.038
Token€
Token ID26391
Feature activation+2.88e-01

Pos logit contributions
atted+0.130
lee+0.130
 asking+0.128
ered+0.125
 shrugged+0.121
Neg logit contributions
tr-0.127
ometer-0.115
ute-0.094
pre-0.091
br-0.088
Token
Token ID220
Feature activation-1.57e+00

Pos logit contributions
lee+0.008
 asking+0.008
atted+0.008
 asked+0.007
 begged+0.007
Neg logit contributions
tr-0.008
ometer-0.007
ute-0.006
pre-0.006
br-0.006
Token
Token ID198
Feature activation-1.43e+00

Pos logit contributions
tr+0.045
ometer+0.041
ute+0.034
pre+0.033
br+0.033
Neg logit contributions
lee-0.043
 asking-0.042
atted-0.042
ered-0.041
 asks-0.040
Token
Token ID198
Feature activation-6.80e-01

Pos logit contributions
tr+0.038
ometer+0.034
ute+0.028
t+0.026
pre+0.026
Neg logit contributions
lee-0.043
atted-0.043
 asking-0.042
ered-0.042
 shrugged-0.041
TokenJoe
Token ID19585
Feature activation+7.93e+00

Pos logit contributions
tr+0.021
ometer+0.019
ute+0.016
pre+0.015
br+0.015
Neg logit contributions
lee-0.018
atted-0.018
 asking-0.017
ered-0.017
 shrugged-0.016
Token yelled
Token ID22760
Feature activation+8.12e+00

Pos logit contributions
 asking+0.175
ered+0.167
atted+0.163
els+0.163
 Laura+0.157
Neg logit contributions
tr-0.250
ometer-0.238
t-0.195
ute-0.190
 steep-0.188
Token �
Token ID6184
Feature activation+4.00e+00

Pos logit contributions
lee+0.182
atted+0.181
ered+0.179
 asking+0.179
 shrugged+0.168
Neg logit contributions
tr-0.273
ometer-0.258
ute-0.218
pre-0.209
t-0.208
Token�
Token ID95
Feature activation+1.68e+00

Pos logit contributions
lee+0.107
atted+0.103
ered+0.102
 asking+0.101
 shrugged+0.097
Neg logit contributions
tr-0.121
ometer-0.106
t-0.088
ute-0.086
br-0.085
Token€
Token ID26391
Feature activation-1.10e-01

Pos logit contributions
lee+0.044
 asking+0.043
ered+0.042
atted+0.041
ice+0.040
Neg logit contributions
tr-0.050
ometer-0.047
ute-0.040
pre-0.038
t-0.037
Token�
Token ID129
Feature activation+3.43e+00

Pos logit contributions
tr+0.003
ometer+0.003
ute+0.002
pre+0.002
 steep+0.002
Neg logit contributions
lee-0.003
atted-0.003
 asking-0.003
 asked-0.003
 begged-0.003
Token�
Token ID241
Feature activation-2.50e+00

Pos logit contributions
atted+0.082
lee+0.080
 asking+0.079
ered+0.079
 shrugged+0.077
Neg logit contributions
tr-0.110
ometer-0.104
ute-0.089
pre-0.083
 steep-0.082
Token for
Token ID329
Feature activation+3.72e-01

Pos logit contributions
tr+0.085
ometer+0.077
ute+0.064
pre+0.063
br+0.062
Neg logit contributions
 asking-0.084
lee-0.084
atted-0.082
ered-0.081
 shrugged-0.077
Token the
Token ID262
Feature activation-3.06e+00

Pos logit contributions
atted+0.009
ered+0.009
lee+0.009
 shrugged+0.009
 asking+0.008
Neg logit contributions
tr-0.011
ometer-0.010
ute-0.009
br-0.009
Snow-0.009
Token little
Token ID1310
Feature activation-1.54e+00

Pos logit contributions
tr+0.098
ometer+0.093
ute+0.077
pre+0.074
br+0.073
Neg logit contributions
 asking-0.076
lee-0.074
atted-0.072
ered-0.070
 asked-0.069
Token girl
Token ID2576
Feature activation-1.29e+00

Pos logit contributions
tr+0.053
ometer+0.049
ute+0.042
pre+0.040
ding+0.040
Neg logit contributions
 asking-0.036
lee-0.035
atted-0.034
 asked-0.034
 asks-0.033
Token to
Token ID284
Feature activation-1.08e+00

Pos logit contributions
tr+0.034
ometer+0.031
ute+0.025
pre+0.024
t+0.024
Neg logit contributions
lee-0.039
atted-0.039
ered-0.038
 asking-0.038
 shrugged-0.036
Token go
Token ID467
Feature activation+1.23e+00

Pos logit contributions
tr+0.023
ometer+0.018
ute+0.016
 steep+0.014
Snow+0.014
Neg logit contributions
atted-0.039
 asking-0.036
lee-0.036
ice-0.036
 shrugged-0.035
Token home
Token ID1363
Feature activation-4.93e-01

Pos logit contributions
lee+0.035
 asking+0.034
atted+0.034
ered+0.034
 asked+0.032
Neg logit contributions
tr-0.035
ometer-0.032
ute-0.026
pre-0.026
br-0.025
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
tr+0.015
ometer+0.013
ute+0.011
pre+0.011
br+0.011
Neg logit contributions
lee-0.013
atted-0.013
 asking-0.013
ered-0.013
 shrugged-0.012
Token
Token ID198
Feature activation-2.21e+00

Pos logit contributions
tr+0.106
ometer+0.097
ute+0.080
pre+0.077
br+0.076
Neg logit contributions
lee-0.107
atted-0.106
 asking-0.105
ered-0.104
 shrugged-0.100
Token
Token ID198
Feature activation-1.75e+00

Pos logit contributions
tr+0.057
ometer+0.052
ute+0.043
t+0.040
br+0.039
Neg logit contributions
atted-0.067
lee-0.067
 asking-0.066
ered-0.066
 shrugged-0.064
TokenThe
Token ID464
Feature activation+2.05e+00

Pos logit contributions
tr+0.046
ometer+0.042
ute+0.034
pre+0.032
br+0.031
Neg logit contributions
lee-0.053
atted-0.053
 asking-0.052
ered-0.052
 shrugged-0.050
Token<|endoftext|>
Token ID50256
Feature activation+6.29e+00

Pos logit contributions
tr+0.106
ometer+0.099
ute+0.079
t+0.076
pre+0.075
Neg logit contributions
ered-0.102
atted-0.101
lee-0.100
 asking-0.099
 shrugged-0.095
TokenOnce
Token ID7454
Feature activation+8.03e+00

Pos logit contributions
lee+0.195
atted+0.195
ered+0.192
 asking+0.191
 shrugged+0.185
Neg logit contributions
tr-0.159
ometer-0.145
ute-0.116
t-0.111
br-0.110
Token upon
Token ID2402
Feature activation+6.71e+00

Pos logit contributions
 asking+0.215
ered+0.215
lee+0.212
atted+0.211
ice+0.203
Neg logit contributions
tr-0.222
ometer-0.217
ute-0.181
t-0.180
 steep-0.167
Token a
Token ID257
Feature activation+4.97e+00

Pos logit contributions
 asking+0.127
ered+0.123
lee+0.122
 asked+0.121
 Laura+0.119
Neg logit contributions
tr-0.249
ometer-0.243
ute-0.205
pre-0.198
br-0.193
Token time
Token ID640
Feature activation+5.09e+00

Pos logit contributions
ered+0.140
lee+0.140
atted+0.137
 asking+0.137
els+0.129
Neg logit contributions
tr-0.145
ometer-0.125
br-0.107
ute-0.106
 steep-0.106
Token there
Token ID612
Feature activation+3.54e+00

Pos logit contributions
lee+0.101
atted+0.099
ered+0.097
 asking+0.096
 shrugged+0.091
Neg logit contributions
tr-0.186
ometer-0.174
ute-0.151
t-0.151
br-0.148
Token was
Token ID373
Feature activation+6.16e+00

Pos logit contributions
lee+0.107
atted+0.105
 asking+0.104
ered+0.097
eling+0.096
Neg logit contributions
tr-0.095
ometer-0.085
t-0.073
ute-0.067
pre-0.065
Token a
Token ID257
Feature activation+1.38e-01

Pos logit contributions
atted+0.168
lee+0.167
ered+0.165
 asking+0.162
 shrugged+0.157
Neg logit contributions
tr-0.182
ometer-0.162
ute-0.139
pre-0.134
br-0.133
Token beautiful
Token ID4950
Feature activation+2.15e+00

Pos logit contributions
lee+0.002
 asking+0.002
 asked+0.002
 Laura+0.002
 shrugged+0.002
Neg logit contributions
tr-0.006
ometer-0.005
t-0.005
ute-0.005
ometers-0.005
Token girl
Token ID2576
Feature activation+2.91e+00

Pos logit contributions
lee+0.055
 asking+0.054
atted+0.054
ered+0.052
 shrugged+0.051
Neg logit contributions
tr-0.066
ometer-0.061
ute-0.051
t-0.050
pre-0.049
Token named
Token ID3706
Feature activation+5.93e+00

Pos logit contributions
ered+0.057
atted+0.055
lee+0.054
 shrugged+0.051
 asking+0.051
Neg logit contributions
tr-0.111
ometer-0.101
ute-0.089
pre-0.087
br-0.085
TokenB
Token ID33
Feature activation+5.54e+00

Pos logit contributions
tr+0.017
ometer+0.015
ute+0.012
pre+0.012
t+0.012
Neg logit contributions
lee-0.019
atted-0.019
 asking-0.019
ered-0.019
 shrugged-0.018
Tokeneth
Token ID2788
Feature activation+6.87e+00

Pos logit contributions
lee+0.154
atted+0.153
ered+0.148
 asking+0.148
 shrugged+0.141
Neg logit contributions
tr-0.160
ometer-0.145
ute-0.119
pre-0.115
br-0.115
Token and
Token ID290
Feature activation+2.92e+00

Pos logit contributions
 hesitation+0.171
 asking+0.161
atted+0.156
eling+0.156
 asks+0.154
Neg logit contributions
tr-0.218
ometer-0.177
t-0.154
ding-0.144
pre-0.142
Token her
Token ID607
Feature activation+2.53e+00

Pos logit contributions
atted+0.054
ered+0.053
eling+0.051
lee+0.050
ice+0.050
Neg logit contributions
tr-0.105
ometer-0.102
ute-0.085
Snow-0.082
 bike-0.082
Token mummy
Token ID38567
Feature activation+5.29e+00

Pos logit contributions
ered+0.063
atted+0.061
 asking+0.061
lee+0.060
 asks+0.058
Neg logit contributions
tr-0.080
ometer-0.076
ute-0.063
pre-0.061
powers-0.059
Token set
Token ID900
Feature activation+5.27e+00

Pos logit contributions
riet+0.126
eling+0.125
 Eva+0.125
 Laura+0.124
gging+0.124
Neg logit contributions
tr-0.164
ometer-0.128
ding-0.121
br-0.116
pre-0.114
Token to
Token ID284
Feature activation+1.98e+00

Pos logit contributions
 shrugged+0.135
atted+0.133
lee+0.131
 asks+0.130
ice+0.128
Neg logit contributions
tr-0.153
ometer-0.147
ute-0.125
ding-0.116
br-0.116
Token work
Token ID670
Feature activation+6.32e-01

Pos logit contributions
atted+0.055
lee+0.052
 shrugged+0.050
 asking+0.048
ered+0.048
Neg logit contributions
tr-0.060
ometer-0.052
ute-0.045
br-0.043
pre-0.042
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
lee+0.017
atted+0.017
 asking+0.017
ered+0.017
 shrugged+0.016
Neg logit contributions
tr-0.018
ometer-0.017
ute-0.014
pre-0.014
br-0.013
Token painting
Token ID12036
Feature activation-1.86e-01

Pos logit contributions
atted+0.044
ered+0.044
lee+0.043
 asks+0.042
els+0.042
Neg logit contributions
tr-0.048
ometer-0.042
ding-0.036
 steep-0.034
pre-0.034
Token and
Token ID290
Feature activation-2.29e-01

Pos logit contributions
tr+0.007
ometer+0.006
ute+0.006
pre+0.005
t+0.005
Neg logit contributions
lee-0.004
atted-0.004
 asking-0.004
ered-0.003
 shrugged-0.003
Token the
Token ID262
Feature activation-3.22e+00

Pos logit contributions
atted+0.015
lee+0.014
ered+0.013
 shrugged+0.013
 asking+0.013
Neg logit contributions
tr-0.024
ometer-0.022
 steep-0.020
br-0.019
ute-0.019
Token trees
Token ID7150
Feature activation-3.89e+00

Pos logit contributions
tr+0.093
ometer+0.090
ute+0.075
t+0.069
pre+0.069
Neg logit contributions
lee-0.089
 asking-0.088
 asked-0.086
ered-0.084
atted-0.084
Token.
Token ID13
Feature activation-3.01e+00

Pos logit contributions
tr+0.111
ometer+0.103
ute+0.084
pre+0.081
br+0.080
Neg logit contributions
lee-0.107
atted-0.107
 asking-0.105
ered-0.105
 shrugged-0.100
Token He
Token ID679
Feature activation+2.07e-01

Pos logit contributions
tr+0.090
ometer+0.083
ute+0.070
pre+0.067
br+0.066
Neg logit contributions
lee-0.080
 asking-0.079
atted-0.079
ered-0.078
 shrugged-0.074
Token even
Token ID772
Feature activation+3.10e+00

Pos logit contributions
atted+0.005
lee+0.004
annah+0.004
els+0.004
 asking+0.004
Neg logit contributions
tr-0.007
ometer-0.006
 steep-0.005
t-0.005
br-0.005
Token saw
Token ID2497
Feature activation+5.50e+00

Pos logit contributions
annah+0.072
lee+0.069
els+0.069
ps+0.066
ries+0.065
Neg logit contributions
tr-0.092
ch-0.080
br-0.080
 steep-0.073
ometer-0.071
Token a
Token ID257
Feature activation-4.80e+00

Pos logit contributions
atted+0.153
lee+0.152
ered+0.148
 asking+0.148
 begged+0.143
Neg logit contributions
tr-0.152
ometer-0.137
br-0.110
ute-0.110
pre-0.110
Token cow
Token ID9875
Feature activation-1.81e+00

Pos logit contributions
tr+0.149
ometer+0.140
t+0.118
pre+0.116
ute+0.116
Neg logit contributions
 asking-0.117
 asked-0.112
lee-0.110
 Laura-0.105
 shrugged-0.104
Token and
Token ID290
Feature activation-3.56e-01

Pos logit contributions
tr+0.060
ometer+0.055
ute+0.047
pre+0.045
br+0.044
Neg logit contributions
lee-0.042
atted-0.042
ered-0.041
 asking-0.040
 shrugged-0.039
Token a
Token ID257
Feature activation-4.41e+00

Pos logit contributions
tr+0.011
ometer+0.010
br+0.008
powers+0.008
pre+0.008
Neg logit contributions
atted-0.009
lee-0.009
ered-0.008
els-0.008
ma-0.008
Token horse
Token ID8223
Feature activation-1.79e+00

Pos logit contributions
tr+0.135
ometer+0.127
ute+0.107
t+0.106
pre+0.104
Neg logit contributions
 asking-0.109
lee-0.108
 asked-0.102
atted-0.102
 Laura-0.100
Token with
Token ID351
Feature activation+4.94e+00

Pos logit contributions
atted+0.039
lee+0.039
ered+0.039
ice+0.037
 shrugged+0.037
Neg logit contributions
tr-0.046
ometer-0.043
ute-0.035
t-0.035
pre-0.034
Token the
Token ID262
Feature activation-1.57e+00

Pos logit contributions
atted+0.116
lee+0.114
ered+0.112
 asking+0.112
ice+0.110
Neg logit contributions
tr-0.148
ometer-0.140
 steep-0.122
ute-0.114
pre-0.114
Token wrapped
Token ID12908
Feature activation+5.90e-01

Pos logit contributions
tr+0.040
ometer+0.037
ute+0.028
pre+0.028
t+0.028
Neg logit contributions
lee-0.050
atted-0.049
ered-0.049
ise-0.045
ather-0.045
Token pastry
Token ID41738
Feature activation-6.56e+00

Pos logit contributions
lee+0.018
atted+0.016
 asking+0.016
ered+0.016
 shrugged+0.016
Neg logit contributions
tr-0.017
ometer-0.016
ute-0.013
pre-0.012
t-0.012
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
tr+0.214
ometer+0.197
ute+0.175
br+0.172
pre+0.169
Neg logit contributions
 asking-0.147
ered-0.145
atted-0.143
lee-0.143
 shrugged-0.138
Token The
Token ID383
Feature activation+1.21e+00

Pos logit contributions
tr+0.106
ometer+0.097
ute+0.079
pre+0.076
br+0.075
Neg logit contributions
lee-0.106
atted-0.105
 asking-0.104
ered-0.103
 shrugged-0.099
Token wild
Token ID4295
Feature activation-1.71e-01

Pos logit contributions
lee+0.038
atted+0.038
ered+0.037
 asking+0.036
 shrugged+0.034
Neg logit contributions
tr-0.032
ometer-0.028
ute-0.023
pre-0.023
br-0.022
Token bird
Token ID6512
Feature activation+4.77e-01

Pos logit contributions
tr+0.004
ometer+0.004
ute+0.003
pre+0.003
t+0.003
Neg logit contributions
lee-0.006
atted-0.005
ered-0.005
 asking-0.005
eling-0.005
Token saw
Token ID2497
Feature activation+4.77e+00

Pos logit contributions
lee+0.012
 asking+0.012
 Laura+0.012
riet+0.012
atted+0.012
Neg logit contributions
tr-0.015
ometer-0.013
br-0.011
t-0.011
ch-0.011
Token them
Token ID606
Feature activation-1.53e+00

Pos logit contributions
annah+0.128
lee+0.124
atted+0.123
ered+0.122
eling+0.122
Neg logit contributions
tr-0.133
ometer-0.117
 cho-0.112
pre-0.101
br-0.101
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
tr+0.048
ometer+0.043
ute+0.038
 steep+0.036
ding+0.036
Neg logit contributions
lee-0.038
atted-0.037
ered-0.035
 asks-0.035
 Laura-0.035
Token,
Token ID11
Feature activation-5.94e-03

Pos logit contributions
tr+0.153
ometer+0.149
pre+0.132
ute+0.129
ometers+0.121
Neg logit contributions
ered-0.089
 asking-0.086
atted-0.083
lee-0.079
 asked-0.079
Token chased
Token ID26172
Feature activation+5.63e+00

Pos logit contributions
tr+0.000
ometer+0.000
 steep+0.000
br+0.000
ding+0.000
Neg logit contributions
ered-0.000
lee-0.000
atted-0.000
ere-0.000
eling-0.000
Token it
Token ID340
Feature activation-4.68e+00

Pos logit contributions
 asking+0.108
lee+0.106
ered+0.102
atted+0.101
 asked+0.100
Neg logit contributions
tr-0.217
ometer-0.201
ute-0.176
pre-0.173
br-0.163
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
tr+0.163
ometer+0.157
pre+0.136
ute+0.136
ding+0.125
Neg logit contributions
 asking-0.097
ered-0.096
atted-0.093
lee-0.091
 asked-0.089
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
tr+0.035
ometer+0.030
 steep+0.027
ding+0.027
t+0.027
Neg logit contributions
lee-0.027
ered-0.026
atted-0.026
ere-0.025
els-0.024
Token had
Token ID550
Feature activation+3.79e-01

Pos logit contributions
tr+0.044
ometer+0.041
ute+0.035
pre+0.034
br+0.033
Neg logit contributions
lee-0.030
atted-0.029
 asking-0.029
ered-0.029
 shrugged-0.028
Token lots
Token ID6041
Feature activation-1.40e+00

Pos logit contributions
lee+0.013
atted+0.013
 asking+0.013
ered+0.013
 shrugged+0.012
Neg logit contributions
tr-0.008
ometer-0.007
ute-0.005
br-0.005
pre-0.005
Token of
Token ID286
Feature activation-3.59e-01

Pos logit contributions
tr+0.052
ometer+0.051
ute+0.043
pre+0.041
ometers+0.039
Neg logit contributions
atted-0.028
lee-0.027
 asking-0.027
ered-0.026
 begged-0.025
Token fun
Token ID1257
Feature activation-1.90e+00

Pos logit contributions
tr+0.007
ometer+0.006
ute+0.004
pre+0.004
br+0.004
Neg logit contributions
lee-0.014
atted-0.014
 asking-0.014
ered-0.014
 shrugged-0.013
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
tr+0.067
ometer+0.064
ute+0.051
pre+0.050
 steep+0.049
Neg logit contributions
 asking-0.044
atted-0.041
lee-0.041
ered-0.041
 asked-0.040
Token
Token ID198
Feature activation-1.43e+00

Pos logit contributions
tr+0.113
ometer+0.105
ute+0.091
pre+0.088
br+0.086
Neg logit contributions
lee-0.089
 asking-0.089
atted-0.088
ered-0.086
 asks-0.084
Token
Token ID198
Feature activation-2.25e+00

Pos logit contributions
tr+0.116
ometer+0.107
ute+0.095
pre+0.093
br+0.091
Neg logit contributions
 asks-0.097
 asked-0.093
 asking-0.092
lee-0.092
 begged-0.090
Token
Token ID198
Feature activation-1.68e+00

Pos logit contributions
tr+0.061
ometer+0.055
ute+0.045
pre+0.043
br+0.042
Neg logit contributions
lee-0.066
atted-0.065
 asking-0.065
ered-0.065
 shrugged-0.062
Token"
Token ID1
Feature activation-1.64e+00

Pos logit contributions
tr+0.047
ometer+0.044
ute+0.035
pre+0.035
powers+0.033
Neg logit contributions
 asking-0.048
ered-0.046
lee-0.046
atted-0.046
 asked-0.046
TokenI
Token ID40
Feature activation-8.58e-01

Pos logit contributions
ometer+0.046
tr+0.045
ute+0.038
powers+0.035
 steep+0.035
Neg logit contributions
 asked-0.044
 asking-0.044
 asks-0.043
 begged-0.043
 shrugged-0.040
Token want
Token ID765
Feature activation+1.79e+00

Pos logit contributions
tr+0.025
ometer+0.023
ute+0.018
pre+0.018
t+0.018
Neg logit contributions
lee-0.024
atted-0.024
 asking-0.023
ered-0.023
 shrugged-0.022
Token cheese
Token ID9891
Feature activation-5.28e+00

Pos logit contributions
atted+0.056
lee+0.056
ered+0.055
 asking+0.052
 shrugged+0.052
Neg logit contributions
tr-0.045
ometer-0.041
ute-0.031
t-0.031
br-0.031
Token and
Token ID290
Feature activation-4.18e+00

Pos logit contributions
tr+0.189
ometer+0.177
ute+0.153
pre+0.149
t+0.144
Neg logit contributions
 asking-0.105
lee-0.104
atted-0.104
ered-0.104
 shrugged-0.097
Token ham
Token ID8891
Feature activation-3.18e+00

Pos logit contributions
tr+0.125
ometer+0.116
br+0.093
pre+0.092
ute+0.092
Neg logit contributions
atted-0.111
lee-0.111
ered-0.110
 asking-0.105
 shrugged-0.101
Token and
Token ID290
Feature activation-3.91e+00

Pos logit contributions
tr+0.106
ometer+0.100
ute+0.084
pre+0.082
br+0.080
Neg logit contributions
lee-0.074
atted-0.073
 asking-0.071
ered-0.071
 shrugged-0.068
Token pineapple
Token ID45540
Feature activation-3.29e+00

Pos logit contributions
tr+0.117
ometer+0.106
br+0.086
pre+0.085
ute+0.084
Neg logit contributions
atted-0.106
lee-0.105
ered-0.105
 asking-0.099
 shrugged-0.096
Token,"
Token ID553
Feature activation+9.98e-01

Pos logit contributions
tr+0.105
ometer+0.098
ute+0.082
pre+0.080
br+0.080
Neg logit contributions
lee-0.081
atted-0.079
 asking-0.078
ered-0.078
 shrugged-0.074
Token day
Token ID1110
Feature activation-5.16e+00

Pos logit contributions
tr+0.062
ometer+0.057
ute+0.047
pre+0.045
br+0.045
Neg logit contributions
lee-0.059
atted-0.059
 asking-0.058
ered-0.057
 shrugged-0.055
Token!
Token ID0
Feature activation-3.76e+00

Pos logit contributions
tr+0.141
ometer+0.130
ute+0.107
t+0.104
pre+0.104
Neg logit contributions
atted-0.144
lee-0.144
ered-0.143
 asking-0.141
 shrugged-0.138
Token<|endoftext|>
Token ID50256
Feature activation+6.25e+00

Pos logit contributions
tr+0.120
ometer+0.109
ute+0.093
br+0.089
pre+0.089
Neg logit contributions
lee-0.096
atted-0.093
 asking-0.091
 asks-0.090
 Laura-0.089
TokenBob
Token ID18861
Feature activation+1.11e+01

Pos logit contributions
lee+0.189
atted+0.188
ered+0.185
 asking+0.185
 shrugged+0.177
Neg logit contributions
tr-0.163
ometer-0.149
ute-0.120
t-0.115
pre-0.114
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
ered+0.307
 hesitation+0.292
 asking+0.289
lee+0.287
eling+0.281
Neg logit contributions
tr-0.289
ometer-0.266
ite-0.215
pre-0.211
ding-0.211
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
lee+0.170
ered+0.164
atted+0.161
els+0.159
ice+0.154
Neg logit contributions
tr-0.149
ometer-0.125
ute-0.110
 steep-0.105
ding-0.103
Token little
Token ID1310
Feature activation+3.51e+00

Pos logit contributions
tr+0.044
ometer+0.041
t+0.035
ute+0.035
ometers+0.034
Neg logit contributions
 asked-0.022
lee-0.022
 asking-0.022
 Laura-0.021
 shrugged-0.021
Token boy
Token ID2933
Feature activation+3.43e+00

Pos logit contributions
 asked+0.092
 asking+0.091
 asks+0.090
lee+0.090
atted+0.089
Neg logit contributions
tr-0.111
ometer-0.096
t-0.083
br-0.082
ute-0.080
Token with
Token ID351
Feature activation+6.54e+00

Pos logit contributions
lee+0.089
atted+0.088
 asking+0.087
ered+0.086
 shrugged+0.082
Neg logit contributions
tr-0.104
ometer-0.097
ute-0.081
pre-0.078
br-0.077
Token a
Token ID257
Feature activation-2.03e+00

Pos logit contributions
atted+0.171
lee+0.171
ered+0.168
 asking+0.167
 shrugged+0.159
Neg logit contributions
tr-0.198
ometer-0.181
ute-0.150
pre-0.145
br-0.145
Token big
Token ID1263
Feature activation+1.58e+00

Pos logit contributions
tr+0.062
ometer+0.057
t+0.049
ute+0.047
pre+0.047
Neg logit contributions
lee-0.051
 shrugged-0.050
 asking-0.049
atted-0.049
 asked-0.048
Token a
Token ID257
Feature activation+6.01e-01

Pos logit contributions
lee+0.174
atted+0.173
 asking+0.170
ered+0.170
 shrugged+0.162
Neg logit contributions
tr-0.202
ometer-0.184
ute-0.157
pre-0.152
br-0.148
Token purple
Token ID14032
Feature activation+2.14e+00

Pos logit contributions
lee+0.008
 asked+0.008
 asking+0.008
 Laura+0.007
 asks+0.007
Neg logit contributions
tr-0.026
ometer-0.024
t-0.021
ometers-0.021
ute-0.021
Token candy
Token ID18550
Feature activation-2.15e+00

Pos logit contributions
lee+0.055
atted+0.054
 asking+0.053
 shrugged+0.052
 asked+0.051
Neg logit contributions
tr-0.063
ometer-0.060
t-0.049
powers-0.048
br-0.047
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
tr+0.066
ometer+0.064
powers+0.054
 steep+0.052
pre+0.051
Neg logit contributions
 asking-0.050
lee-0.049
 asked-0.048
 shrugged-0.047
atted-0.047
Token It
Token ID632
Feature activation-3.34e+00

Pos logit contributions
tr+0.041
ometer+0.038
ute+0.031
pre+0.030
br+0.030
Neg logit contributions
lee-0.038
atted-0.037
 asking-0.037
ered-0.036
 shrugged-0.035
Token was
Token ID373
Feature activation+8.14e-01

Pos logit contributions
tr+0.114
ometer+0.105
ute+0.089
pre+0.087
br+0.086
Neg logit contributions
lee-0.075
atted-0.075
 asking-0.074
ered-0.073
 shrugged-0.068
Token so
Token ID523
Feature activation-4.52e+00

Pos logit contributions
atted+0.029
lee+0.029
ered+0.028
eling+0.028
 asks+0.028
Neg logit contributions
tr-0.018
ometer-0.014
 steep-0.011
ute-0.011
 cho-0.010
Token small
Token ID1402
Feature activation+2.48e+00

Pos logit contributions
tr+0.102
ometer+0.094
ute+0.072
t+0.069
pre+0.068
Neg logit contributions
atted-0.149
 asking-0.149
ered-0.148
lee-0.147
 shrugged-0.142
Token that
Token ID326
Feature activation-3.18e+00

Pos logit contributions
lee+0.060
atted+0.060
 asking+0.059
ered+0.058
 shrugged+0.056
Neg logit contributions
tr-0.080
ometer-0.074
ute-0.063
pre-0.060
br-0.059
Token everyone
Token ID2506
Feature activation+6.45e-02

Pos logit contributions
tr+0.100
ometer+0.093
ute+0.078
pre+0.075
br+0.073
Neg logit contributions
lee-0.080
 asking-0.080
ered-0.079
atted-0.079
 shrugged-0.075
Token wanted
Token ID2227
Feature activation+1.74e+00

Pos logit contributions
lee+0.001
 asking+0.001
atted+0.001
ered+0.001
 Laura+0.001
Neg logit contributions
tr-0.002
ometer-0.002
ute-0.002
pre-0.002
t-0.002
Token it
Token ID340
Feature activation-7.04e+00

Pos logit contributions
lee+0.026
atted+0.026
 asking+0.026
ered+0.026
 shrugged+0.024
Neg logit contributions
tr-0.033
ometer-0.031
ute-0.026
pre-0.025
br-0.025
Token."
Token ID526
Feature activation-1.33e+00

Pos logit contributions
tr+0.183
ometer+0.165
ute+0.132
t+0.128
br+0.124
Neg logit contributions
lee-0.217
atted-0.215
 asking-0.210
ered-0.210
 shrugged-0.203
Token Lucy
Token ID22162
Feature activation+8.01e+00

Pos logit contributions
tr+0.050
ometer+0.048
t+0.040
ute+0.039
pre+0.039
Neg logit contributions
lee-0.025
ered-0.025
atted-0.025
 asking-0.024
els-0.023
Token thought
Token ID1807
Feature activation+3.17e+00

Pos logit contributions
els+0.224
ries+0.217
eling+0.210
irs+0.204
 asking+0.201
Neg logit contributions
tr-0.213
ometer-0.191
t-0.157
pre-0.149
Snow-0.137
Token for
Token ID329
Feature activation+2.58e+00

Pos logit contributions
atted+0.096
 asks+0.095
lee+0.094
ered+0.092
 asking+0.092
Neg logit contributions
tr-0.072
ometer-0.071
t-0.057
ute-0.055
 steep-0.054
Token a
Token ID257
Feature activation-4.45e+00

Pos logit contributions
atted+0.076
lee+0.075
ered+0.070
 asks+0.069
 asking+0.069
Neg logit contributions
tr-0.073
ometer-0.064
t-0.051
ute-0.051
Snow-0.051
Token moment
Token ID2589
Feature activation-1.55e+00

Pos logit contributions
tr+0.128
ometer+0.119
ute+0.106
pre+0.100
ometers+0.097
Neg logit contributions
 asking-0.121
 begged-0.115
 asked-0.115
 shrugged-0.114
 Laura-0.114
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
tr+0.053
ometer+0.049
t+0.042
ute+0.042
br+0.041
Neg logit contributions
lee-0.034
atted-0.034
ered-0.034
 asks-0.031
 asking-0.031
Token then
Token ID788
Feature activation+8.48e-01

Pos logit contributions
tr+0.008
ometer+0.007
pre+0.006
Snow+0.006
ding+0.006
Neg logit contributions
atted-0.006
els-0.006
lee-0.006
ered-0.005
ps-0.005
Token gave
Token ID2921
Feature activation+6.56e+00

Pos logit contributions
eling+0.017
els+0.017
ere+0.017
ice+0.017
atted+0.016
Neg logit contributions
tr-0.028
ometer-0.027
br-0.021
pre-0.021
 cho-0.021
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
lee+0.184
atted+0.172
eling+0.159
ather+0.155
ise+0.154
Neg logit contributions
tr-0.175
ometer-0.165
 cho-0.134
 helicopter-0.131
t-0.130
Token on
Token ID319
Feature activation+5.95e-01

Pos logit contributions
tr+0.123
ometer+0.111
ute+0.091
pre+0.090
t+0.085
Neg logit contributions
lee-0.118
atted-0.114
ered-0.112
 asking-0.111
 asks-0.108
Token the
Token ID262
Feature activation-4.15e+00

Pos logit contributions
atted+0.013
lee+0.012
ered+0.012
 asking+0.011
 shrugged+0.011
Neg logit contributions
tr-0.022
ometer-0.020
ute-0.017
br-0.017
t-0.017
Token tray
Token ID26473
Feature activation-5.54e+00

Pos logit contributions
tr+0.137
ometer+0.127
ute+0.107
pre+0.104
br+0.102
Neg logit contributions
lee-0.099
atted-0.098
ered-0.096
 asking-0.096
 shrugged-0.091
Token and
Token ID290
Feature activation-7.01e-01

Pos logit contributions
tr+0.187
ometer+0.170
ute+0.143
t+0.142
pre+0.137
Neg logit contributions
lee-0.135
atted-0.130
ered-0.127
 asking-0.125
eling-0.120
Token put
Token ID1234
Feature activation+3.26e+00

Pos logit contributions
tr+0.024
ometer+0.020
pre+0.017
br+0.017
 roof+0.016
Neg logit contributions
lee-0.017
atted-0.016
ere-0.016
ered-0.015
andy-0.014
Token it
Token ID340
Feature activation-6.31e+00

Pos logit contributions
lee+0.078
atted+0.076
 shrugged+0.072
ered+0.072
 asking+0.072
Neg logit contributions
tr-0.106
ometer-0.099
br-0.082
pre-0.081
t-0.081
Token in
Token ID287
Feature activation-5.40e-02

Pos logit contributions
tr+0.180
ometer+0.162
ute+0.131
pre+0.127
br+0.126
Neg logit contributions
lee-0.182
atted-0.177
 asking-0.176
ered-0.173
 shrugged-0.168
Token the
Token ID262
Feature activation-3.52e+00

Pos logit contributions
tr+0.002
ometer+0.002
Snow+0.001
 steep+0.001
t+0.001
Neg logit contributions
atted-0.001
lee-0.001
ered-0.001
 shrugged-0.001
eling-0.001
Token stove
Token ID27635
Feature activation-4.79e+00

Pos logit contributions
tr+0.138
ometer+0.131
ute+0.116
t+0.114
pre+0.112
Neg logit contributions
 asking-0.057
lee-0.057
atted-0.056
ered-0.054
 asks-0.052
Token.
Token ID13
Feature activation-3.68e+00

Pos logit contributions
tr+0.122
ometer+0.110
ute+0.084
pre+0.081
t+0.080
Neg logit contributions
lee-0.152
atted-0.149
ered-0.149
 asking-0.143
 shrugged-0.142
Token She
Token ID1375
Feature activation-1.23e+00

Pos logit contributions
tr+0.117
ometer+0.109
ute+0.090
pre+0.088
br+0.086
Neg logit contributions
atted-0.091
ered-0.091
lee-0.091
 asking-0.089
 shrugged-0.085
TokenOne
Token ID3198
Feature activation+8.30e-01

Pos logit contributions
tr+0.009
ometer+0.008
ute+0.007
pre+0.006
t+0.006
Neg logit contributions
lee-0.009
atted-0.009
 asking-0.009
ered-0.009
 shrugged-0.009
Token day
Token ID1110
Feature activation+7.80e-01

Pos logit contributions
lee+0.029
atted+0.029
 asking+0.028
ered+0.028
 shrugged+0.027
Neg logit contributions
tr-0.018
ometer-0.016
ute-0.012
pre-0.012
br-0.011
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
lee+0.018
atted+0.018
ered+0.017
eling+0.016
 asking+0.016
Neg logit contributions
tr-0.025
ometer-0.024
br-0.021
t-0.021
ute-0.020
Token he
Token ID339
Feature activation+1.35e+00

Pos logit contributions
lee+0.078
atted+0.077
 asking+0.077
ered+0.076
 shrugged+0.072
Neg logit contributions
tr-0.128
ometer-0.120
ute-0.103
pre-0.099
br-0.098
Token found
Token ID1043
Feature activation+5.41e+00

Pos logit contributions
atted+0.034
els+0.033
lee+0.032
ered+0.032
ries+0.031
Neg logit contributions
tr-0.039
ometer-0.033
t-0.029
 steep-0.029
ute-0.028
Token something
Token ID1223
Feature activation-6.63e+00

Pos logit contributions
lee+0.168
atted+0.164
 asking+0.157
ered+0.156
 asks+0.156
Neg logit contributions
tr-0.139
ometer-0.124
ute-0.097
pre-0.096
br-0.095
Token heavy
Token ID4334
Feature activation+1.06e+00

Pos logit contributions
tr+0.184
ometer+0.172
ute+0.139
pre+0.134
ding+0.129
Neg logit contributions
 asking-0.191
lee-0.186
ered-0.185
atted-0.183
 asked-0.180
Token in
Token ID287
Feature activation-2.09e-01

Pos logit contributions
lee+0.027
atted+0.027
ered+0.026
 asking+0.026
 shrugged+0.025
Neg logit contributions
tr-0.033
ometer-0.030
ute-0.025
pre-0.024
t-0.024
Token his
Token ID465
Feature activation-2.31e+00

Pos logit contributions
tr+0.007
ometer+0.006
 steep+0.005
 road+0.005
 license+0.005
Neg logit contributions
atted-0.005
lee-0.005
ered-0.005
 shrugged-0.004
 begged-0.004
Token closet
Token ID21615
Feature activation-3.82e+00

Pos logit contributions
tr+0.074
ometer+0.070
ute+0.059
pre+0.059
t+0.058
Neg logit contributions
lee-0.055
atted-0.054
 asking-0.053
ered-0.052
 shrugged-0.050
Token -
Token ID532
Feature activation-2.01e+00

Pos logit contributions
tr+0.104
ometer+0.091
t+0.073
ute+0.072
pre+0.069
Neg logit contributions
atted-0.116
lee-0.116
ered-0.113
 asking-0.111
 asks-0.109
Token learned
Token ID4499
Feature activation+2.09e+00

Pos logit contributions
tr+0.049
ometer+0.043
pre+0.038
ute+0.037
br+0.036
Neg logit contributions
 asking-0.030
lee-0.030
atted-0.030
 Laura-0.029
ered-0.028
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 asking+0.049
atted+0.048
ered+0.048
lee+0.048
 asked+0.045
Neg logit contributions
tr-0.068
ometer-0.065
ding-0.055
storm-0.054
pre-0.054
Token be
Token ID307
Feature activation-1.32e+00

Pos logit contributions
tr+0.031
ometer+0.028
ute+0.024
pre+0.022
t+0.022
Neg logit contributions
lee-0.033
atted-0.032
 asking-0.030
ered-0.030
 shrugged-0.030
Token happy
Token ID3772
Feature activation+1.26e+00

Pos logit contributions
tr+0.039
ometer+0.035
ute+0.029
br+0.029
pre+0.029
Neg logit contributions
lee-0.036
atted-0.035
ered-0.034
 asking-0.033
 Laura-0.033
Token with
Token ID351
Feature activation+1.11e+00

Pos logit contributions
 asking+0.030
atted+0.029
lee+0.029
ered+0.028
 asked+0.028
Neg logit contributions
tr-0.042
ometer-0.039
ute-0.033
ding-0.032
pre-0.032
Token what
Token ID644
Feature activation-7.26e+00

Pos logit contributions
ited+0.022
atted+0.022
eled+0.020
lee+0.020
 shrugged+0.020
Neg logit contributions
tr-0.037
ometer-0.034
 much-0.030
 well-0.030
 valuable-0.030
Token she
Token ID673
Feature activation-5.02e+00

Pos logit contributions
tr+0.205
ometer+0.190
ute+0.154
t+0.152
pre+0.151
Neg logit contributions
lee-0.203
atted-0.201
ered-0.198
 asking-0.192
 shrugged-0.185
Token had
Token ID550
Feature activation-2.86e+00

Pos logit contributions
tr+0.164
ometer+0.148
pre+0.130
t+0.128
ute+0.126
Neg logit contributions
lee-0.120
atted-0.118
ered-0.114
 Laura-0.111
 asking-0.108
Token.
Token ID13
Feature activation-4.51e+00

Pos logit contributions
tr+0.070
ometer+0.063
t+0.055
ute+0.055
pre+0.051
Neg logit contributions
lee-0.090
atted-0.089
ered-0.085
 Laura-0.084
ice-0.081
Token She
Token ID1375
Feature activation-1.59e+00

Pos logit contributions
tr+0.133
ometer+0.123
ute+0.101
pre+0.098
br+0.096
Neg logit contributions
lee-0.122
atted-0.121
ered-0.120
 asking-0.119
 shrugged-0.114
Token learned
Token ID4499
Feature activation+2.15e+00

Pos logit contributions
tr+0.057
ometer+0.050
pre+0.044
ute+0.043
br+0.042
Neg logit contributions
 asking-0.034
lee-0.033
atted-0.033
ered-0.032
 Laura-0.032
Token were
Token ID547
Feature activation+6.22e-01

Pos logit contributions
tr+0.043
ometer+0.040
ute+0.033
t+0.033
pre+0.032
Neg logit contributions
lee-0.036
 asking-0.035
atted-0.035
 asks-0.034
ered-0.034
Token clean
Token ID3424
Feature activation+1.95e+00

Pos logit contributions
lee+0.020
 asks+0.020
atted+0.020
 Laura+0.020
riet+0.020
Neg logit contributions
tr-0.013
ometer-0.011
 steep-0.010
 much-0.009
 valuable-0.009
Token before
Token ID878
Feature activation-3.91e+00

Pos logit contributions
lee+0.051
atted+0.051
ered+0.050
 asking+0.050
 asks+0.048
Neg logit contributions
tr-0.059
ometer-0.054
ute-0.045
pre-0.043
t-0.043
Token touching
Token ID15241
Feature activation+9.66e-01

Pos logit contributions
tr+0.116
ometer+0.109
ute+0.090
pre+0.087
t+0.087
Neg logit contributions
atted-0.101
lee-0.101
ered-0.099
 asking-0.096
 shrugged-0.094
Token the
Token ID262
Feature activation-4.15e+00

Pos logit contributions
lee+0.020
atted+0.020
 asking+0.019
ered+0.019
 shrugged+0.018
Neg logit contributions
tr-0.034
ometer-0.032
ute-0.027
br-0.026
pre-0.026
Token page
Token ID2443
Feature activation-7.08e+00

Pos logit contributions
tr+0.142
ometer+0.134
ute+0.116
pre+0.110
br+0.109
Neg logit contributions
lee-0.090
 asking-0.090
atted-0.089
ered-0.087
 shrugged-0.083
Token again
Token ID757
Feature activation-1.43e+00

Pos logit contributions
tr+0.211
ometer+0.196
ute+0.165
pre+0.159
br+0.154
Neg logit contributions
lee-0.185
 asking-0.183
atted-0.182
ered-0.182
 shrugged-0.172
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
tr+0.035
ometer+0.032
ute+0.026
pre+0.024
br+0.024
Neg logit contributions
atted-0.045
lee-0.044
ered-0.043
 asking-0.042
 shrugged-0.042
Token He
Token ID679
Feature activation+2.23e-01

Pos logit contributions
tr+0.109
ometer+0.099
ute+0.085
pre+0.082
br+0.082
Neg logit contributions
lee-0.096
atted-0.094
 asking-0.093
ered-0.091
 asks-0.088
Token finally
Token ID3443
Feature activation+4.55e+00

Pos logit contributions
 asking+0.006
atted+0.005
lee+0.005
els+0.005
 Laura+0.005
Neg logit contributions
tr-0.007
ometer-0.006
pre-0.005
 steep-0.005
t-0.005
Token turned
Token ID2900
Feature activation+3.54e+00

Pos logit contributions
 asking+0.111
els+0.110
atted+0.109
ries+0.099
lee+0.097
Neg logit contributions
tr-0.134
br-0.116
 steep-0.115
ometer-0.112
t-0.110
Token"
Token ID1
Feature activation-1.88e+00

Pos logit contributions
tr+0.050
ometer+0.046
ute+0.040
br+0.037
pre+0.037
Neg logit contributions
lee-0.041
atted-0.041
ered-0.040
 asking-0.040
 shrugged-0.038
TokenMaybe
Token ID13300
Feature activation-5.73e-01

Pos logit contributions
tr+0.046
ometer+0.043
ute+0.034
br+0.031
 steep+0.031
Neg logit contributions
 asking-0.059
atted-0.058
lee-0.058
ered-0.057
 asked-0.057
Token we
Token ID356
Feature activation-7.85e-01

Pos logit contributions
tr+0.012
ometer+0.010
ute+0.008
pre+0.007
br+0.007
Neg logit contributions
lee-0.021
atted-0.021
ered-0.020
 asking-0.020
 shrugged-0.019
Token can
Token ID460
Feature activation-3.52e+00

Pos logit contributions
tr+0.021
ometer+0.019
ute+0.016
pre+0.015
br+0.015
Neg logit contributions
lee-0.023
ered-0.023
atted-0.023
 asking-0.023
 asked-0.023
Token fix
Token ID4259
Feature activation+1.65e+00

Pos logit contributions
tr+0.103
ometer+0.094
ute+0.077
t+0.074
pre+0.073
Neg logit contributions
lee-0.098
atted-0.098
 asking-0.095
ered-0.093
 shrugged-0.091
Token it
Token ID340
Feature activation-6.30e+00

Pos logit contributions
lee+0.035
atted+0.035
 asking+0.034
ered+0.034
 shrugged+0.032
Neg logit contributions
tr-0.058
ometer-0.054
ute-0.046
pre-0.045
br-0.044
Token,"
Token ID553
Feature activation+6.84e-01

Pos logit contributions
tr+0.204
ometer+0.189
ute+0.159
t+0.153
br+0.153
Neg logit contributions
lee-0.155
atted-0.153
 asking-0.148
ered-0.146
 shrugged-0.142
Token Ben
Token ID3932
Feature activation+7.85e+00

Pos logit contributions
atted+0.013
lee+0.013
ered+0.013
 asking+0.012
eling+0.011
Neg logit contributions
tr-0.025
ometer-0.024
ute-0.020
pre-0.020
br-0.019
Token said
Token ID531
Feature activation+3.62e+00

Pos logit contributions
eling+0.218
lee+0.210
ered+0.210
atted+0.208
els+0.200
Neg logit contributions
tr-0.217
ometer-0.194
t-0.164
br-0.151
 license-0.143
Token.
Token ID13
Feature activation-3.88e+00

Pos logit contributions
lee+0.110
atted+0.108
ered+0.107
 asking+0.107
eling+0.104
Neg logit contributions
tr-0.091
ometer-0.086
ute-0.066
t-0.066
pre-0.065
Token He
Token ID679
Feature activation-8.75e-01

Pos logit contributions
tr+0.110
ometer+0.101
ute+0.085
pre+0.081
br+0.078
Neg logit contributions
 asking-0.108
lee-0.108
atted-0.107
ered-0.104
 asks-0.103