TOP ACTIVATIONS
MAX = 0.176

before dinner . <|endoftext|> S aman tha was a lazy girl
. The end . <|endoftext|> Once upon a time , there
was no . <|endoftext|> S aman tha opened the door and
in the field . <|endoftext|> Once upon a time , there
many new friends . <|endoftext|> Once upon a time , there
fun playing together . <|endoftext|> Once upon a time , there
the dish washer . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
. He shouted , " Ar gh mate ys !"
happy and independent . <|endoftext|> Once upon a time , there
with Mitt ens . <|endoftext|> Once upon a time , there
and cried too . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
best of friends . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
all day long . <|endoftext|> Once upon a time , there
its soft embrace . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
on adventures together . <|endoftext|> Once upon a time , there

MIN ACTIVATIONS
MIN = -0.181

the machine was a shred der , that could cut things
This made her even glad der and she smiled with joy
and Lily felt even sad der . From that day on
car made him even sad der . He asked everyone if
found something even we ir der ! There were little plastic
nothing made it p rou der . The
and it was even red der . Suddenly , she had
look even smaller and sad der . <|endoftext|> Once upon a
. Tim felt even sad der now , and he cried
said no , the mad der Sara got . Then ,
it made her even sad der . Suddenly ,
Then something even we ir der happened . The animal suddenly
girl felt even p rou der . She was happy that
. Red felt even sad der . But then
Tim my was even sad der now . He said ,
The ocean was a sad der place without Fin 's happy
. He felt even sad der . Max sat
sail and a red rud der . One day the boat
. He felt even sad der now . But then ,
. Lily felt even sad der than before . From that

INTERVAL 0.087 - 0.176
CONTAINS 5.734%

what it could be . " What 's that
at the fruit . " I want an apple
whole day playing together . The rabbit visited the
at the mess . " Oh , Ben ,
to me ?" L ily finally gave her

INTERVAL -0.002 - 0.087
CONTAINS 44.621%

upon a time , in a big sky , there was
and that even little kids can make a big difference .
their new friend and promised to always remember the happy times
He continued to hop back home , and
in the chairs did see Lily . They started to whisper

INTERVAL -0.092 - -0.002
CONTAINS 48.444%

. She thought it was not ordinary . "
here ?" Mom says . " Why are you fighting ?"
" I 'm sorry , kids . Rex is
and water and got in the car . They
. She was so excited - she loved spending time with

INTERVAL -0.181 - -0.092
CONTAINS 1.201%

mountain . The ant knew it was a big job ,
wrote ' friendly ' on it . He couldn 't wait
berry muff ins for you . They are in the kitchen
said , " Take this ! It 's special like you
. I did not know what that button did . Please
Token before
Token ID878
Feature activation-3.56e-02

Pos logit contributions
der+0.016
 that+0.010
 even+0.009
 valuable+0.009
 for+0.009
Neg logit contributions
een-0.038
loe-0.037
Laura-0.036
Jamie-0.036
Mary-0.036
Token dinner
Token ID8073
Feature activation-1.97e-02

Pos logit contributions
der+0.047
 that+0.034
 even+0.032
 he+0.031
 for+0.031
Neg logit contributions
een-0.082
loe-0.080
Laura-0.079
Jamie-0.078
Mary-0.077
Token.
Token ID13
Feature activation-6.58e-02

Pos logit contributions
der+0.025
 that+0.017
 even+0.016
 for+0.016
 valuable+0.016
Neg logit contributions
een-0.046
loe-0.046
Laura-0.045
Jamie-0.044
Mary-0.044
Token<|endoftext|>
Token ID50256
Feature activation+1.21e-01

Pos logit contributions
der+0.081
 he+0.055
 that+0.054
 even+0.052
 valuable+0.051
Neg logit contributions
een-0.158
loe-0.149
prise-0.146
Laura-0.145
Jamie-0.145
TokenS
Token ID50
Feature activation+1.05e-01

Pos logit contributions
een+0.286
ited+0.286
prise+0.285
elve+0.284
Laura+0.283
Neg logit contributions
 he-0.102
der-0.100
 that-0.100
.-0.100
 He-0.099
Tokenaman
Token ID10546
Feature activation+1.76e-01

Pos logit contributions
loe+0.145
een+0.144
Ali+0.141
e+0.135
icians+0.135
Neg logit contributions
 that-0.198
der-0.189
 for-0.184
 he-0.175
 back-0.171
Tokentha
Token ID12898
Feature activation+1.02e-01

Pos logit contributions
loe+0.227
Laura+0.222
een+0.222
 Deal+0.219
prise+0.216
Neg logit contributions
der-0.367
 that-0.358
 he-0.325
ly-0.309
 for-0.305
Token was
Token ID373
Feature activation+7.92e-02

Pos logit contributions
een+0.273
loe+0.273
Laura+0.265
Mary+0.261
Jamie+0.260
Neg logit contributions
der-0.084
 that-0.059
 even-0.053
 for-0.049
 he-0.048
Token a
Token ID257
Feature activation+3.66e-02

Pos logit contributions
een+0.185
loe+0.177
Laura+0.173
Mary+0.171
Jamie+0.170
Neg logit contributions
der-0.101
 even-0.077
 for-0.075
 that-0.074
 valuable-0.068
Token lazy
Token ID16931
Feature activation-5.50e-02

Pos logit contributions
een+0.073
loe+0.071
Laura+0.071
Jamie+0.069
Mary+0.068
Neg logit contributions
der-0.060
 valuable-0.042
 that-0.042
 even-0.041
 for-0.039
Token girl
Token ID2576
Feature activation+3.35e-02

Pos logit contributions
der+0.078
 valuable+0.057
attered+0.053
 smoothly+0.051
 even+0.048
Neg logit contributions
een-0.116
loe-0.113
Laura-0.112
iley-0.109
Mary-0.108
Token.
Token ID13
Feature activation-7.33e-02

Pos logit contributions
der+0.062
 valuable+0.032
 that+0.030
 even+0.029
 he+0.025
Neg logit contributions
een-0.177
loe-0.173
Laura-0.171
Jamie-0.169
Mary-0.168
Token The
Token ID383
Feature activation-2.87e-02

Pos logit contributions
der+0.154
ly+0.118
 valuable+0.116
 even+0.115
 helicopter+0.112
Neg logit contributions
loe-0.111
een-0.108
Laura-0.106
Mary-0.102
Jamie-0.099
Token end
Token ID886
Feature activation-1.17e-02

Pos logit contributions
der+0.045
 that+0.029
 valuable+0.029
ly+0.029
 even+0.028
Neg logit contributions
een-0.059
loe-0.058
Laura-0.057
Mary-0.056
Jamie-0.055
Token.
Token ID13
Feature activation-6.85e-02

Pos logit contributions
der+0.015
 ot+0.009
attered+0.009
 valuable+0.009
 yacht+0.009
Neg logit contributions
een-0.026
loe-0.026
Laura-0.026
Mary-0.026
e-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
der+0.142
 valuable+0.106
ly+0.105
 even+0.104
 that+0.102
Neg logit contributions
een-0.105
loe-0.104
Laura-0.102
Mary-0.098
Jamie-0.097
TokenOnce
Token ID7454
Feature activation+1.76e-01

Pos logit contributions
een+0.350
loe+0.344
prise+0.340
elve+0.338
Laura+0.335
Neg logit contributions
der-0.153
 that-0.137
 he-0.118
 for-0.115
 even-0.115
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.218
loe+0.205
Laura+0.199
Jamie+0.192
Mary+0.190
Neg logit contributions
der-0.419
 that-0.351
 valuable-0.343
 for-0.341
 even-0.341
Token a
Token ID257
Feature activation+6.25e-02

Pos logit contributions
een+0.252
loe+0.247
Laura+0.241
Jamie+0.237
Mary+0.235
Neg logit contributions
der-0.157
 that-0.107
 valuable-0.106
 even-0.105
 for-0.098
Token time
Token ID640
Feature activation+8.78e-02

Pos logit contributions
een+0.107
loe+0.104
icians+0.103
Jamie+0.100
igh+0.098
Neg logit contributions
der-0.109
 that-0.097
 for-0.096
 important-0.095
 even-0.094
Token,
Token ID11
Feature activation+5.50e-02

Pos logit contributions
Laura+0.253
Jamie+0.246
een+0.245
Josh+0.239
Mary+0.238
Neg logit contributions
 that-0.059
der-0.058
 for-0.056
 he-0.048
 they-0.045
Token there
Token ID612
Feature activation+3.87e-02

Pos logit contributions
een+0.098
loe+0.095
Laura+0.093
Jamie+0.091
Mary+0.090
Neg logit contributions
der-0.100
 that-0.084
 for-0.081
 even-0.081
 he-0.079
Token was
Token ID373
Feature activation+4.50e-02

Pos logit contributions
der+0.015
 valuable+0.011
 that+0.010
 even+0.010
 for+0.010
Neg logit contributions
een-0.022
loe-0.021
Laura-0.021
Jamie-0.020
Mary-0.020
Token no
Token ID645
Feature activation+1.70e-03

Pos logit contributions
een+0.096
loe+0.094
Laura+0.091
Jamie+0.090
Mary+0.089
Neg logit contributions
der-0.068
 that-0.050
 even-0.049
 valuable-0.049
 for-0.046
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
een+0.003
loe+0.003
Laura+0.003
Mary+0.003
Jamie+0.003
Neg logit contributions
der-0.003
attered-0.002
 valuable-0.002
 that-0.002
 missile-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.37e-01

Pos logit contributions
der+0.121
 valuable+0.086
 that+0.085
 he+0.084
 even+0.082
Neg logit contributions
een-0.157
loe-0.147
e-0.145
prise-0.145
Mary-0.143
TokenS
Token ID50
Feature activation+1.07e-01

Pos logit contributions
een+0.260
prise+0.255
Laura+0.251
elve+0.251
ighed+0.249
Neg logit contributions
der-0.212
 he-0.178
 that-0.174
 around-0.166
 He-0.165
Tokenaman
Token ID10546
Feature activation+1.75e-01

Pos logit contributions
een+0.143
ighed+0.137
e+0.136
loe+0.131
Ali+0.130
Neg logit contributions
 that-0.204
der-0.191
 back-0.186
 he-0.185
 for-0.185
Tokentha
Token ID12898
Feature activation+9.86e-02

Pos logit contributions
loe+0.190
een+0.187
Laura+0.181
Jamie+0.168
prise+0.165
Neg logit contributions
der-0.416
 that-0.386
 he-0.356
 for-0.347
ly-0.346
Token opened
Token ID4721
Feature activation+6.99e-02

Pos logit contributions
een+0.224
loe+0.220
Laura+0.215
Jamie+0.212
Mary+0.210
Neg logit contributions
der-0.132
 that-0.089
 even-0.087
 valuable-0.087
 for-0.081
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
een+0.132
loe+0.128
Laura+0.126
Jamie+0.123
Mary+0.122
Neg logit contributions
der-0.122
 valuable-0.090
 even-0.085
 that-0.084
 important-0.081
Token door
Token ID3420
Feature activation+1.28e-02

Pos logit contributions
der+0.036
 valuable+0.026
 even+0.025
 he+0.025
 smoothly+0.025
Neg logit contributions
een-0.031
Laura-0.030
loe-0.030
Jamie-0.029
Mary-0.028
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
een+0.029
loe+0.029
Laura+0.028
Jamie+0.028
Mary+0.028
Neg logit contributions
der-0.016
 valuable-0.011
 even-0.010
 that-0.010
 important-0.009
Token in
Token ID287
Feature activation+4.51e-02

Pos logit contributions
een+0.017
loe+0.017
Laura+0.016
Jamie+0.016
Mary+0.016
Neg logit contributions
der-0.007
 that-0.004
 even-0.004
 for-0.004
 valuable-0.004
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
een+0.115
loe+0.114
Laura+0.111
ighed+0.109
Mary+0.109
Neg logit contributions
der-0.043
 even-0.036
 that-0.035
 for-0.034
 valuable-0.031
Token field
Token ID2214
Feature activation+1.49e-02

Pos logit contributions
der+0.082
attered+0.056
 even+0.056
taker+0.055
 that+0.055
Neg logit contributions
een-0.064
Laura-0.063
Eva-0.059
loe-0.059
Mary-0.059
Token.
Token ID13
Feature activation-7.19e-02

Pos logit contributions
een+0.040
loe+0.039
Laura+0.038
Jamie+0.038
Mary+0.038
Neg logit contributions
der-0.014
 that-0.008
 even-0.008
 valuable-0.008
 for-0.007
Token<|endoftext|>
Token ID50256
Feature activation+1.49e-01

Pos logit contributions
der+0.145
ly+0.115
 worse+0.113
bike+0.113
 missile+0.112
Neg logit contributions
loe-0.105
een-0.103
Once-0.100
Laura-0.100
Mary-0.093
TokenOnce
Token ID7454
Feature activation+1.74e-01

Pos logit contributions
een+0.335
loe+0.330
Laura+0.321
prise+0.321
elve+0.316
Neg logit contributions
der-0.184
 that-0.154
 even-0.139
 he-0.137
 for-0.136
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.205
loe+0.196
Laura+0.188
Jamie+0.182
Mary+0.179
Neg logit contributions
der-0.426
 that-0.349
 valuable-0.348
 even-0.345
 for-0.338
Token a
Token ID257
Feature activation+6.26e-02

Pos logit contributions
een+0.237
loe+0.230
Laura+0.228
Jamie+0.224
Mary+0.222
Neg logit contributions
der-0.170
 valuable-0.118
 that-0.113
 even-0.112
 for-0.107
Token time
Token ID640
Feature activation+8.79e-02

Pos logit contributions
een+0.105
loe+0.102
icians+0.099
Jamie+0.098
igh+0.096
Neg logit contributions
der-0.113
 that-0.098
 for-0.097
 even-0.096
 important-0.096
Token,
Token ID11
Feature activation+5.35e-02

Pos logit contributions
Laura+0.246
een+0.239
Jamie+0.238
Josh+0.231
Mary+0.230
Neg logit contributions
der-0.065
 that-0.064
 for-0.059
 he-0.057
 they-0.052
Token there
Token ID612
Feature activation+3.80e-02

Pos logit contributions
een+0.093
loe+0.090
Laura+0.087
Jamie+0.085
Mary+0.085
Neg logit contributions
der-0.101
 that-0.083
 even-0.080
 for-0.079
 he-0.079
Token many
Token ID867
Feature activation+8.45e-03

Pos logit contributions
der+0.006
 valuable+0.005
 that+0.004
 missile+0.004
 even+0.004
Neg logit contributions
een-0.005
Laura-0.005
loe-0.005
Jamie-0.005
Mary-0.005
Token new
Token ID649
Feature activation-4.17e-02

Pos logit contributions
een+0.017
Laura+0.017
Jamie+0.017
Mary+0.016
loe+0.016
Neg logit contributions
der-0.013
attered-0.009
ometer-0.008
 that-0.008
 smoothly-0.008
Token friends
Token ID2460
Feature activation+1.48e-02

Pos logit contributions
der+0.079
 valuable+0.057
 that+0.057
 smoothly+0.056
attered+0.054
Neg logit contributions
een-0.072
Laura-0.068
loe-0.067
Jamie-0.067
Mary-0.066
Token.
Token ID13
Feature activation-7.44e-02

Pos logit contributions
een+0.033
e+0.032
Laura+0.031
Jamie+0.031
loe+0.031
Neg logit contributions
der-0.020
attered-0.012
 valuable-0.012
 smoothly-0.011
 worse-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.42e-01

Pos logit contributions
der+0.140
ly+0.104
 valuable+0.104
 even+0.102
 worse+0.101
Neg logit contributions
loe-0.130
een-0.127
Laura-0.124
Mary-0.120
Jamie-0.118
TokenOnce
Token ID7454
Feature activation+1.74e-01

Pos logit contributions
een+0.337
loe+0.330
prise+0.330
Laura+0.327
elve+0.325
Neg logit contributions
der-0.143
 that-0.129
 he-0.117
.-0.108
 around-0.107
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.176
loe+0.168
Laura+0.160
Jamie+0.153
Mary+0.150
Neg logit contributions
der-0.455
 that-0.378
 valuable-0.377
 even-0.374
 for-0.366
Token a
Token ID257
Feature activation+5.94e-02

Pos logit contributions
een+0.235
loe+0.228
Laura+0.228
Jamie+0.223
Mary+0.221
Neg logit contributions
der-0.174
 valuable-0.120
 that-0.116
 even-0.114
 for-0.110
Token time
Token ID640
Feature activation+8.77e-02

Pos logit contributions
een+0.105
loe+0.102
icians+0.101
used+0.097
igh+0.097
Neg logit contributions
der-0.099
 even-0.089
 for-0.087
 important-0.086
 that-0.086
Token,
Token ID11
Feature activation+5.39e-02

Pos logit contributions
Laura+0.242
een+0.236
Jamie+0.232
Josh+0.230
icians+0.229
Neg logit contributions
 that-0.066
der-0.063
 he-0.061
 for-0.059
 they-0.053
Token there
Token ID612
Feature activation+3.59e-02

Pos logit contributions
een+0.095
loe+0.093
Laura+0.091
Jamie+0.088
Mary+0.088
Neg logit contributions
der-0.099
 that-0.081
 even-0.078
 for-0.077
 valuable-0.077
Token fun
Token ID1257
Feature activation-9.56e-03

Pos logit contributions
een+0.102
loe+0.100
Laura+0.098
Jamie+0.096
Mary+0.096
Neg logit contributions
der-0.055
 that-0.033
 valuable-0.033
 even-0.032
 he-0.031
Token playing
Token ID2712
Feature activation-1.63e-02

Pos logit contributions
der+0.020
 valuable+0.016
ly+0.015
 smoothly+0.015
ometer+0.015
Neg logit contributions
een-0.014
e-0.013
loe-0.013
Jamie-0.012
prise-0.012
Token together
Token ID1978
Feature activation+2.62e-03

Pos logit contributions
der+0.028
 valuable+0.020
ly+0.019
attered+0.019
ometer+0.018
Neg logit contributions
e-0.030
een-0.030
loe-0.028
Laura-0.028
Rachel-0.028
Token.
Token ID13
Feature activation-6.74e-02

Pos logit contributions
een+0.007
loe+0.006
Laura+0.006
Jamie+0.006
Mary+0.006
Neg logit contributions
der-0.003
 that-0.002
 even-0.002
 valuable-0.002
 for-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.48e-01

Pos logit contributions
der+0.132
 helicopter+0.103
 missile+0.103
ly+0.102
 worse+0.100
Neg logit contributions
loe-0.104
een-0.104
Laura-0.096
Once-0.095
Mary-0.093
TokenOnce
Token ID7454
Feature activation+1.74e-01

Pos logit contributions
een+0.349
loe+0.343
prise+0.338
elve+0.336
Laura+0.334
Neg logit contributions
der-0.163
 that-0.145
 he-0.128
 even-0.124
 for-0.123
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.213
loe+0.201
Laura+0.194
Jamie+0.188
Mary+0.186
Neg logit contributions
der-0.417
 that-0.350
 valuable-0.342
 even-0.341
 for-0.340
Token a
Token ID257
Feature activation+6.19e-02

Pos logit contributions
een+0.241
loe+0.235
Laura+0.232
Jamie+0.227
Mary+0.226
Neg logit contributions
der-0.166
 valuable-0.114
 that-0.111
 even-0.110
 for-0.104
Token time
Token ID640
Feature activation+8.73e-02

Pos logit contributions
een+0.107
loe+0.104
icians+0.104
Jamie+0.101
igh+0.099
Neg logit contributions
der-0.108
 that-0.095
 for-0.094
 even-0.093
 important-0.092
Token,
Token ID11
Feature activation+5.46e-02

Pos logit contributions
Laura+0.248
een+0.241
Jamie+0.240
icians+0.234
Josh+0.233
Neg logit contributions
 that-0.061
der-0.059
 for-0.057
 he-0.055
 they-0.047
Token there
Token ID612
Feature activation+3.78e-02

Pos logit contributions
een+0.100
loe+0.098
Laura+0.095
Jamie+0.093
Mary+0.093
Neg logit contributions
der-0.095
 that-0.082
 for-0.079
 even-0.078
 he-0.077
Token the
Token ID262
Feature activation-5.05e-02

Pos logit contributions
der+0.059
 that+0.045
 even+0.044
 valuable+0.043
 for+0.042
Neg logit contributions
een-0.104
loe-0.103
Laura-0.100
Mary-0.099
Jamie-0.099
Token dish
Token ID9433
Feature activation-1.31e-02

Pos logit contributions
der+0.094
 that+0.068
 even+0.067
 valuable+0.067
 smoothly+0.063
Neg logit contributions
een-0.090
loe-0.087
Laura-0.084
Jamie-0.083
Mary-0.079
Tokenwasher
Token ID45146
Feature activation-4.64e-02

Pos logit contributions
der+0.020
 valuable+0.013
 helicopter+0.013
 arrow+0.013
 missile+0.013
Neg logit contributions
een-0.026
loe-0.025
Jamie-0.025
Laura-0.024
prise-0.024
Token.
Token ID13
Feature activation-7.41e-02

Pos logit contributions
der+0.057
 valuable+0.034
 missile+0.033
 ot+0.031
 helicopter+0.031
Neg logit contributions
een-0.105
loe-0.103
e-0.099
Laura-0.099
Jamie-0.098
Token<|endoftext|>
Token ID50256
Feature activation+1.45e-01

Pos logit contributions
der+0.143
 valuable+0.105
ly+0.105
 even+0.103
 missile+0.102
Neg logit contributions
loe-0.125
een-0.124
Laura-0.120
Mary-0.116
Jamie-0.114
TokenOnce
Token ID7454
Feature activation+1.74e-01

Pos logit contributions
een+0.344
loe+0.338
prise+0.336
elve+0.333
Laura+0.329
Neg logit contributions
der-0.155
 that-0.137
 he-0.123
 even-0.117
 for-0.117
Token upon
Token ID2402
Feature activation+1.11e-01

Pos logit contributions
een+0.192
loe+0.184
Laura+0.176
Jamie+0.169
Mary+0.167
Neg logit contributions
der-0.436
 that-0.359
 valuable-0.358
 even-0.355
 for-0.347
Token a
Token ID257
Feature activation+6.08e-02

Pos logit contributions
een+0.244
loe+0.239
Laura+0.234
Jamie+0.230
Mary+0.228
Neg logit contributions
der-0.157
 valuable-0.106
 that-0.106
 even-0.104
 for-0.098
Token time
Token ID640
Feature activation+8.50e-02

Pos logit contributions
een+0.105
loe+0.103
icians+0.102
igh+0.098
Jamie+0.097
Neg logit contributions
der-0.107
 that-0.093
 for-0.092
 even-0.091
 important-0.089
Token,
Token ID11
Feature activation+5.39e-02

Pos logit contributions
Laura+0.240
een+0.234
Jamie+0.232
icians+0.229
Josh+0.226
Neg logit contributions
 that-0.061
der-0.059
 for-0.055
 he-0.054
 they-0.048
Token there
Token ID612
Feature activation+3.66e-02

Pos logit contributions
een+0.098
loe+0.096
Laura+0.093
Jamie+0.091
Mary+0.090
Neg logit contributions
der-0.095
 that-0.082
 for-0.078
 even-0.078
 he-0.077
Token.
Token ID13
Feature activation-6.83e-02

Pos logit contributions
e+0.043
Once+0.036
een+0.035
loe+0.035
Mary+0.035
Neg logit contributions
der-0.031
 ot-0.026
 trigger-0.024
ometer-0.024
attered-0.024
Token The
Token ID383
Feature activation-2.75e-02

Pos logit contributions
der+0.167
ly+0.135
rier+0.134
 worse+0.129
 helicopter+0.127
Neg logit contributions
loe-0.077
een-0.077
Once-0.074
Jamie-0.069
Mary-0.068
Token end
Token ID886
Feature activation-6.54e-03

Pos logit contributions
der+0.051
attered+0.036
ly+0.036
 smoothly+0.034
ometer+0.034
Neg logit contributions
Laura-0.047
loe-0.046
een-0.046
Once-0.045
Jamie-0.045
Token.
Token ID13
Feature activation-6.58e-02

Pos logit contributions
der+0.011
 ot+0.008
attered+0.008
 yacht+0.008
rier+0.008
Neg logit contributions
ma-0.011
Mary-0.011
een-0.011
e-0.010
Laura-0.010
Token<|endoftext|>
Token ID50256
Feature activation+1.46e-01

Pos logit contributions
der+0.153
ly+0.119
 valuable+0.111
 helicopter+0.111
 smoothly+0.109
Neg logit contributions
een-0.085
loe-0.085
Laura-0.082
Mary-0.081
Jamie-0.080
TokenOnce
Token ID7454
Feature activation+1.74e-01

Pos logit contributions
een+0.329
loe+0.322
prise+0.315
Laura+0.315
elve+0.310
Neg logit contributions
der-0.181
 that-0.154
 he-0.138
 even-0.135
 for-0.134
Token upon
Token ID2402
Feature activation+1.12e-01

Pos logit contributions
een+0.201
loe+0.190
Laura+0.183
Jamie+0.176
Mary+0.174
Neg logit contributions
der-0.428
 that-0.356
 valuable-0.352
 even-0.349
 for-0.345
Token a
Token ID257
Feature activation+6.36e-02

Pos logit contributions
een+0.232
loe+0.225
Laura+0.224
Jamie+0.219
Mary+0.217
Neg logit contributions
der-0.172
 valuable-0.120
 that-0.114
 even-0.113
 for-0.109
Token time
Token ID640
Feature activation+8.69e-02

Pos logit contributions
een+0.105
loe+0.102
icians+0.099
Jamie+0.098
Laura+0.095
Neg logit contributions
der-0.118
 that-0.102
 for-0.101
 even-0.100
 important-0.100
Token,
Token ID11
Feature activation+5.52e-02

Pos logit contributions
Laura+0.235
een+0.230
Jamie+0.227
Mary+0.223
Josh+0.221
Neg logit contributions
der-0.075
 that-0.067
 he-0.062
 for-0.062
 they-0.055
Token there
Token ID612
Feature activation+3.83e-02

Pos logit contributions
een+0.096
loe+0.093
Laura+0.091
Jamie+0.088
Mary+0.088
Neg logit contributions
der-0.103
 that-0.086
 even-0.083
 for-0.083
 he-0.082
Token.
Token ID13
Feature activation-7.07e-02

Pos logit contributions
der+0.058
 valuable+0.037
 even+0.033
 that+0.031
 important+0.031
Neg logit contributions
een-0.122
loe-0.120
e-0.116
Laura-0.116
Jamie-0.115
Token He
Token ID679
Feature activation-1.72e-02

Pos logit contributions
der+0.091
 that+0.063
 he+0.062
 even+0.060
 valuable+0.058
Neg logit contributions
een-0.165
loe-0.157
Laura-0.155
Jamie-0.154
prise-0.153
Token shouted
Token ID17293
Feature activation+8.61e-02

Pos logit contributions
der+0.030
 even+0.022
 that+0.022
 valuable+0.022
 he+0.021
Neg logit contributions
een-0.033
loe-0.032
Laura-0.031
Jamie-0.031
Mary-0.030
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
een+0.163
loe+0.157
e+0.154
Laura+0.152
Jamie+0.149
Neg logit contributions
der-0.134
 valuable-0.103
attered-0.097
 important-0.091
 helicopter-0.091
Token "
Token ID366
Feature activation-1.10e-02

Pos logit contributions
een+0.079
loe+0.078
Laura+0.076
Jamie+0.075
Mary+0.074
Neg logit contributions
der-0.040
 that-0.026
 even-0.025
 valuable-0.025
 for-0.024
TokenAr
Token ID3163
Feature activation+1.74e-01

Pos logit contributions
der+0.018
 even+0.014
 he+0.014
 that+0.013
 valuable+0.013
Neg logit contributions
loe-0.021
een-0.021
Laura-0.020
Jamie-0.019
prise-0.019
Tokengh
Token ID456
Feature activation+7.13e-02

Pos logit contributions
Laura+0.235
 blonde+0.215
Elizabeth+0.213
rones+0.206
Rachel+0.201
Neg logit contributions
 for-0.341
ly-0.333
der-0.326
 that-0.319
 even-0.294
Token mate
Token ID16133
Feature activation+6.50e-02

Pos logit contributions
Laura+0.158
een+0.156
loe+0.156
Jamie+0.151
Mary+0.150
Neg logit contributions
der-0.097
 that-0.075
 for-0.073
 even-0.069
 he-0.065
Tokenys
Token ID893
Feature activation+3.96e-02

Pos logit contributions
Laura+0.141
loe+0.134
Elizabeth+0.134
een+0.133
Mary+0.132
Neg logit contributions
der-0.097
 for-0.076
 that-0.075
 even-0.069
 he-0.064
Token!"
Token ID2474
Feature activation-2.62e-02

Pos logit contributions
Laura+0.086
een+0.084
loe+0.084
Jamie+0.083
Mary+0.082
Neg logit contributions
der-0.058
 that-0.044
 for-0.042
 he-0.039
 even-0.039
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
der+0.037
 that+0.027
 he+0.026
 even+0.025
 for+0.024
Neg logit contributions
een-0.058
loe-0.058
Laura-0.056
Jamie-0.055
prise-0.055
Token happy
Token ID3772
Feature activation-1.66e-02

Pos logit contributions
der+0.073
ometer+0.053
ly+0.051
attered+0.051
 missile+0.049
Neg logit contributions
e-0.067
Laura-0.063
een-0.061
loe-0.061
Mary-0.060
Token and
Token ID290
Feature activation-4.60e-02

Pos logit contributions
der+0.023
attered+0.022
 valuable+0.022
ometer+0.022
controlled+0.021
Neg logit contributions
e-0.025
een-0.024
iley-0.024
Once-0.024
Laura-0.023
Token independent
Token ID4795
Feature activation-5.11e-02

Pos logit contributions
der+0.084
 helicopter+0.056
 missile+0.055
 ot+0.054
 automobile+0.053
Neg logit contributions
e-0.082
een-0.081
loe-0.077
Laura-0.076
Mary-0.075
Token.
Token ID13
Feature activation-7.36e-02

Pos logit contributions
der+0.061
 valuable+0.054
 missile+0.048
attered+0.048
 arrow+0.047
Neg logit contributions
een-0.098
loe-0.097
e-0.095
iley-0.091
Laura-0.088
Token<|endoftext|>
Token ID50256
Feature activation+1.40e-01

Pos logit contributions
der+0.160
ly+0.130
 helicopter+0.124
 worse+0.123
attered+0.122
Neg logit contributions
loe-0.103
Laura-0.102
een-0.101
Mary-0.097
Jamie-0.094
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.314
loe+0.308
Laura+0.303
prise+0.302
elve+0.296
Neg logit contributions
der-0.170
 that-0.145
 he-0.129
 even-0.127
 for-0.125
Token upon
Token ID2402
Feature activation+1.12e-01

Pos logit contributions
een+0.174
loe+0.167
Laura+0.158
Jamie+0.152
Mary+0.149
Neg logit contributions
der-0.453
 valuable-0.374
 that-0.373
 even-0.371
 for-0.360
Token a
Token ID257
Feature activation+5.93e-02

Pos logit contributions
een+0.221
Laura+0.216
loe+0.212
Jamie+0.211
e+0.210
Neg logit contributions
der-0.184
 valuable-0.129
 that-0.121
attered-0.119
 even-0.119
Token time
Token ID640
Feature activation+8.66e-02

Pos logit contributions
een+0.100
loe+0.097
icians+0.095
Jamie+0.092
igh+0.090
Neg logit contributions
der-0.109
 even-0.093
 that-0.093
 important-0.092
 for-0.092
Token,
Token ID11
Feature activation+5.42e-02

Pos logit contributions
Laura+0.235
een+0.229
Jamie+0.226
Josh+0.222
Mary+0.222
Neg logit contributions
der-0.070
 that-0.069
 for-0.062
 he-0.062
 they-0.054
Token there
Token ID612
Feature activation+3.50e-02

Pos logit contributions
een+0.097
loe+0.095
Laura+0.092
Jamie+0.090
Mary+0.090
Neg logit contributions
der-0.098
 that-0.081
 even-0.078
 for-0.077
 valuable-0.076
Token with
Token ID351
Feature activation+8.53e-03

Pos logit contributions
een+0.009
loe+0.009
e+0.008
Laura+0.008
Mary+0.008
Neg logit contributions
der-0.005
 valuable-0.003
ly-0.003
attered-0.003
 that-0.003
Token Mitt
Token ID16627
Feature activation+1.15e-03

Pos logit contributions
een+0.015
loe+0.015
e+0.015
Laura+0.014
Mary+0.014
Neg logit contributions
der-0.015
 valuable-0.011
ly-0.010
 even-0.010
attered-0.010
Tokenens
Token ID641
Feature activation+1.31e-02

Pos logit contributions
een+0.002
loe+0.002
Laura+0.002
Mary+0.002
Jamie+0.002
Neg logit contributions
der-0.002
 valuable-0.001
 even-0.001
 that-0.001
 helicopter-0.001
Token.
Token ID13
Feature activation-7.11e-02

Pos logit contributions
een+0.029
e+0.029
loe+0.029
Laura+0.027
prise+0.027
Neg logit contributions
der-0.016
 valuable-0.010
 missile-0.009
 helicopter-0.009
 smoothly-0.009
Token<|endoftext|>
Token ID50256
Feature activation+1.45e-01

Pos logit contributions
der+0.140
 missile+0.111
 helicopter+0.108
 valuable+0.107
ly+0.106
Neg logit contributions
een-0.107
loe-0.106
Laura-0.101
Mary-0.097
Once-0.096
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.331
loe+0.326
prise+0.318
Laura+0.316
elve+0.315
Neg logit contributions
der-0.173
 that-0.149
 he-0.132
 even-0.131
 for-0.128
Token upon
Token ID2402
Feature activation+1.11e-01

Pos logit contributions
een+0.209
loe+0.195
Laura+0.189
Jamie+0.183
Mary+0.180
Neg logit contributions
der-0.421
 that-0.356
 for-0.347
 valuable-0.347
 even-0.346
Token a
Token ID257
Feature activation+6.18e-02

Pos logit contributions
een+0.241
loe+0.235
Laura+0.231
Jamie+0.227
Mary+0.225
Neg logit contributions
der-0.160
 valuable-0.109
 that-0.108
 even-0.106
 for-0.100
Token time
Token ID640
Feature activation+8.62e-02

Pos logit contributions
een+0.106
loe+0.103
icians+0.101
Jamie+0.098
igh+0.097
Neg logit contributions
der-0.110
 that-0.096
 even-0.095
 for-0.095
 important-0.094
Token,
Token ID11
Feature activation+5.47e-02

Pos logit contributions
Laura+0.241
een+0.235
Jamie+0.235
Josh+0.234
icians+0.229
Neg logit contributions
 that-0.062
der-0.060
 for-0.057
 they-0.053
 he-0.051
Token there
Token ID612
Feature activation+3.60e-02

Pos logit contributions
een+0.103
loe+0.100
Laura+0.097
Josh+0.096
Jamie+0.096
Neg logit contributions
der-0.093
 that-0.082
 for-0.079
 even-0.078
 he-0.075
Token and
Token ID290
Feature activation-4.11e-02

Pos logit contributions
attered+0.109
 missile+0.109
 valuable+0.108
der+0.107
cream+0.105
Neg logit contributions
een-0.163
loe-0.161
iley-0.151
Laura-0.150
Jamie-0.148
Token cried
Token ID16896
Feature activation+2.63e-02

Pos logit contributions
der+0.076
 valuable+0.055
 missile+0.053
 smoothly+0.051
ometer+0.051
Neg logit contributions
een-0.066
Laura-0.066
loe-0.066
e-0.064
Jamie-0.064
Token too
Token ID1165
Feature activation-6.16e-02

Pos logit contributions
loe+0.048
een+0.046
e+0.045
Laura+0.043
 blonde+0.042
Neg logit contributions
der-0.033
 missile-0.029
 valuable-0.027
cream-0.027
attered-0.027
Token.
Token ID13
Feature activation-7.68e-02

Pos logit contributions
der+0.073
attered+0.045
 missile+0.042
 helicopter+0.042
 ot+0.041
Neg logit contributions
loe-0.141
Laura-0.139
een-0.138
Mary-0.136
e-0.135
Token<|endoftext|>
Token ID50256
Feature activation+1.45e-01

Pos logit contributions
der+0.125
 that+0.091
 valuable+0.090
 even+0.089
 he+0.086
Neg logit contributions
een-0.154
loe-0.149
Laura-0.146
Jamie-0.143
e-0.141
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
prise+0.342
een+0.339
Laura+0.337
elve+0.334
ighed+0.334
Neg logit contributions
der-0.143
 that-0.136
 he-0.128
.-0.117
 around-0.117
Token upon
Token ID2402
Feature activation+1.12e-01

Pos logit contributions
een+0.204
loe+0.188
Laura+0.185
e+0.178
Jamie+0.176
Neg logit contributions
der-0.423
 that-0.365
 for-0.356
 he-0.353
 even-0.351
Token a
Token ID257
Feature activation+5.89e-02

Pos logit contributions
een+0.239
loe+0.233
Laura+0.230
Jamie+0.226
Mary+0.224
Neg logit contributions
der-0.166
 valuable-0.114
 that-0.111
 even-0.109
 for-0.104
Token time
Token ID640
Feature activation+8.39e-02

Pos logit contributions
icians+0.110
een+0.109
loe+0.104
igh+0.103
used+0.101
Neg logit contributions
der-0.091
 important-0.082
 that-0.082
 even-0.081
 for-0.081
Token,
Token ID11
Feature activation+5.20e-02

Pos logit contributions
Laura+0.236
een+0.229
Jamie+0.226
icians+0.224
Josh+0.222
Neg logit contributions
 that-0.061
 he-0.059
der-0.056
 for-0.053
 they-0.050
Token there
Token ID612
Feature activation+3.45e-02

Pos logit contributions
een+0.090
loe+0.087
Laura+0.086
Jamie+0.083
Mary+0.083
Neg logit contributions
der-0.097
 that-0.081
 even-0.079
 for-0.077
 he-0.077
Token.
Token ID13
Feature activation-7.34e-02

Pos logit contributions
der+0.024
 valuable+0.014
 even+0.012
 smoothly+0.012
attered+0.012
Neg logit contributions
een-0.054
loe-0.054
Laura-0.052
e-0.052
Mary-0.052
Token The
Token ID383
Feature activation-2.45e-02

Pos logit contributions
der+0.125
 valuable+0.092
 that+0.091
 even+0.090
 for+0.086
Neg logit contributions
een-0.141
loe-0.138
Laura-0.135
Jamie-0.132
Mary-0.131
Token end
Token ID886
Feature activation-1.19e-02

Pos logit contributions
der+0.032
 valuable+0.022
 that+0.021
 even+0.021
 for+0.020
Neg logit contributions
een-0.057
loe-0.056
Laura-0.054
Jamie-0.054
Mary-0.053
Token.
Token ID13
Feature activation-6.62e-02

Pos logit contributions
der+0.016
 ot+0.011
attered+0.011
 yacht+0.011
 helicopter+0.010
Neg logit contributions
Laura-0.026
loe-0.026
een-0.026
Mary-0.025
Jamie-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.38e-01

Pos logit contributions
der+0.121
 valuable+0.091
 that+0.091
 even+0.090
 for+0.086
Neg logit contributions
een-0.119
loe-0.116
Laura-0.113
Jamie-0.110
Mary-0.109
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.334
prise+0.326
loe+0.323
elve+0.322
Laura+0.320
Neg logit contributions
der-0.136
 that-0.123
 he-0.111
.-0.106
 around-0.104
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.185
loe+0.173
Laura+0.166
Jamie+0.160
Mary+0.158
Neg logit contributions
der-0.444
 that-0.375
 valuable-0.369
 even-0.367
 for-0.364
Token a
Token ID257
Feature activation+5.93e-02

Pos logit contributions
een+0.239
loe+0.232
Laura+0.230
Jamie+0.226
Mary+0.224
Neg logit contributions
der-0.172
 valuable-0.119
 that-0.115
 even-0.113
 for-0.109
Token time
Token ID640
Feature activation+8.72e-02

Pos logit contributions
een+0.106
loe+0.105
icians+0.103
used+0.098
igh+0.098
Neg logit contributions
der-0.097
 even-0.087
 important-0.086
 that-0.085
 for-0.084
Token,
Token ID11
Feature activation+5.40e-02

Pos logit contributions
Laura+0.243
een+0.237
Jamie+0.233
icians+0.228
Mary+0.228
Neg logit contributions
 that-0.063
der-0.061
 he-0.059
 for-0.056
 they-0.048
Token there
Token ID612
Feature activation+3.56e-02

Pos logit contributions
een+0.097
loe+0.094
Laura+0.092
Jamie+0.089
Mary+0.089
Neg logit contributions
der-0.097
 that-0.080
 even-0.078
 for-0.077
 valuable-0.075
Token best
Token ID1266
Feature activation+1.70e-03

Pos logit contributions
der+0.073
 that+0.051
 valuable+0.051
 even+0.049
 for+0.048
Neg logit contributions
een-0.091
loe-0.089
Laura-0.087
Jamie-0.085
Mary-0.085
Token of
Token ID286
Feature activation+6.39e-03

Pos logit contributions
loe+0.003
een+0.002
e+0.002
iley+0.002
Mary+0.002
Neg logit contributions
der-0.003
 valuable-0.003
 smoothly-0.003
attered-0.002
cream-0.002
Token friends
Token ID2460
Feature activation+1.37e-02

Pos logit contributions
een+0.010
loe+0.010
Laura+0.009
e+0.009
Mary+0.009
Neg logit contributions
der-0.013
 valuable-0.010
 that-0.010
 even-0.010
ly-0.010
Token.
Token ID13
Feature activation-7.55e-02

Pos logit contributions
e+0.028
loe+0.024
een+0.024
Mary+0.022
igh+0.022
Neg logit contributions
der-0.021
 trigger-0.017
 valuable-0.016
attered-0.016
 arrow-0.015
Token<|endoftext|>
Token ID50256
Feature activation+1.47e-01

Pos logit contributions
der+0.156
ly+0.127
 worse+0.118
 helicopter+0.117
 valuable+0.116
Neg logit contributions
loe-0.113
een-0.110
Laura-0.104
Mary-0.102
Once-0.101
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.324
loe+0.320
Laura+0.311
prise+0.310
elve+0.305
Neg logit contributions
der-0.187
 that-0.157
 even-0.141
 he-0.139
 for-0.138
Token upon
Token ID2402
Feature activation+1.14e-01

Pos logit contributions
een+0.194
loe+0.186
Laura+0.178
Jamie+0.171
Mary+0.169
Neg logit contributions
der-0.432
 that-0.355
 valuable-0.354
 even-0.351
 for-0.343
Token a
Token ID257
Feature activation+6.31e-02

Pos logit contributions
een+0.237
loe+0.229
Laura+0.228
Jamie+0.223
Mary+0.222
Neg logit contributions
der-0.175
 valuable-0.121
 that-0.116
 even-0.114
 for-0.110
Token time
Token ID640
Feature activation+8.66e-02

Pos logit contributions
een+0.102
loe+0.100
icians+0.096
Jamie+0.094
igh+0.093
Neg logit contributions
der-0.118
 that-0.102
 for-0.101
 even-0.101
 valuable-0.100
Token,
Token ID11
Feature activation+5.29e-02

Pos logit contributions
Laura+0.237
een+0.231
Jamie+0.228
Mary+0.223
Josh+0.223
Neg logit contributions
der-0.066
 that-0.066
 for-0.060
 he-0.056
 they-0.053
Token there
Token ID612
Feature activation+3.76e-02

Pos logit contributions
een+0.093
loe+0.090
Laura+0.088
Jamie+0.085
Mary+0.085
Neg logit contributions
der-0.097
 that-0.081
 even-0.079
 for-0.078
 valuable-0.077
Token.
Token ID13
Feature activation-7.54e-02

Pos logit contributions
der+0.039
 that+0.019
 even+0.018
 valuable+0.018
 for+0.016
Neg logit contributions
een-0.124
loe-0.123
Laura-0.120
Jamie-0.119
Mary-0.118
Token The
Token ID383
Feature activation-2.95e-02

Pos logit contributions
der+0.174
ly+0.134
 helicopter+0.131
 missile+0.130
 worse+0.129
Neg logit contributions
loe-0.098
een-0.097
Laura-0.095
Once-0.089
e-0.089
Token end
Token ID886
Feature activation-1.17e-02

Pos logit contributions
der+0.056
attered+0.040
ometer+0.040
 smoothly+0.038
ly+0.038
Neg logit contributions
Once-0.047
een-0.047
loe-0.046
Laura-0.046
Eva-0.044
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
der+0.016
attered+0.010
 ot+0.009
 helicopter+0.009
bike+0.009
Neg logit contributions
een-0.026
loe-0.026
Laura-0.025
Mary-0.025
e-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.45e-01

Pos logit contributions
der+0.147
ly+0.110
 valuable+0.109
 even+0.107
attered+0.106
Neg logit contributions
een-0.106
loe-0.105
Laura-0.103
Mary-0.099
Jamie-0.098
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.347
prise+0.341
loe+0.340
elve+0.338
Laura+0.332
Neg logit contributions
der-0.149
 that-0.132
 he-0.116
 even-0.114
.-0.113
Token upon
Token ID2402
Feature activation+1.11e-01

Pos logit contributions
een+0.194
loe+0.182
Laura+0.176
Jamie+0.169
Mary+0.168
Neg logit contributions
der-0.433
 that-0.364
 valuable-0.358
 even-0.356
 for-0.353
Token a
Token ID257
Feature activation+6.23e-02

Pos logit contributions
een+0.247
loe+0.243
Laura+0.237
Jamie+0.233
Mary+0.231
Neg logit contributions
der-0.156
 that-0.106
 valuable-0.106
 even-0.104
 for-0.098
Token time
Token ID640
Feature activation+8.61e-02

Pos logit contributions
een+0.103
loe+0.101
icians+0.098
Jamie+0.095
igh+0.093
Neg logit contributions
der-0.115
 that-0.101
 for-0.099
 even-0.099
 valuable-0.097
Token,
Token ID11
Feature activation+5.36e-02

Pos logit contributions
Laura+0.234
een+0.230
Jamie+0.226
Mary+0.222
loe+0.220
Neg logit contributions
der-0.071
 that-0.066
 for-0.061
 he-0.061
 they-0.053
Token there
Token ID612
Feature activation+3.77e-02

Pos logit contributions
een+0.092
loe+0.090
Laura+0.088
Jamie+0.085
Mary+0.085
Neg logit contributions
der-0.101
 that-0.084
 even-0.081
 for-0.080
 he-0.080
Token.
Token ID13
Feature activation-7.42e-02

Pos logit contributions
een+0.031
loe+0.031
Laura+0.031
Jamie+0.030
Mary+0.030
Neg logit contributions
der-0.008
 that-0.004
 for-0.003
 even-0.003
 valuable-0.003
Token The
Token ID383
Feature activation-3.06e-02

Pos logit contributions
der+0.147
 valuable+0.110
ly+0.109
 helicopter+0.106
 even+0.106
Neg logit contributions
een-0.119
loe-0.119
Laura-0.113
Mary-0.113
Jamie-0.108
Token end
Token ID886
Feature activation-9.06e-03

Pos logit contributions
der+0.038
 valuable+0.025
 that+0.025
 even+0.024
 for+0.023
Neg logit contributions
een-0.073
loe-0.072
Laura-0.070
Jamie-0.069
Mary-0.069
Token.
Token ID13
Feature activation-7.12e-02

Pos logit contributions
der+0.012
attered+0.008
 even+0.007
bike+0.007
 smoothly+0.007
Neg logit contributions
een-0.020
loe-0.020
Laura-0.019
Mary-0.019
e-0.019
Token<|endoftext|>
Token ID50256
Feature activation+1.46e-01

Pos logit contributions
der+0.132
 valuable+0.099
 that+0.098
 even+0.098
 for+0.094
Neg logit contributions
een-0.125
loe-0.123
Laura-0.119
Jamie-0.116
Mary-0.116
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.346
prise+0.339
loe+0.338
Laura+0.334
elve+0.332
Neg logit contributions
der-0.157
 that-0.138
 he-0.121
 for-0.117
 even-0.117
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.210
loe+0.196
Laura+0.191
Jamie+0.185
Mary+0.182
Neg logit contributions
der-0.416
 that-0.355
 for-0.346
 valuable-0.344
 even-0.344
Token a
Token ID257
Feature activation+6.28e-02

Pos logit contributions
een+0.245
loe+0.239
Laura+0.235
Jamie+0.230
Mary+0.229
Neg logit contributions
der-0.163
 valuable-0.111
 that-0.110
 even-0.108
 for-0.102
Token time
Token ID640
Feature activation+8.75e-02

Pos logit contributions
een+0.109
loe+0.106
icians+0.105
Jamie+0.101
igh+0.099
Neg logit contributions
der-0.108
 that-0.096
 for-0.096
 even-0.095
 important-0.094
Token,
Token ID11
Feature activation+5.56e-02

Pos logit contributions
Laura+0.243
een+0.236
Jamie+0.235
Mary+0.229
Josh+0.229
Neg logit contributions
der-0.066
 that-0.064
 for-0.059
 he-0.055
 they-0.051
Token there
Token ID612
Feature activation+3.77e-02

Pos logit contributions
een+0.105
loe+0.101
Laura+0.100
Jamie+0.098
Josh+0.098
Neg logit contributions
der-0.093
 that-0.083
 for-0.079
 even-0.078
 he-0.076
Token all
Token ID477
Feature activation-6.36e-02

Pos logit contributions
e+0.022
een+0.021
loe+0.019
Mary+0.018
ighed+0.018
Neg logit contributions
der-0.018
 valuable-0.015
attered-0.014
ometer-0.013
ly-0.013
Token day
Token ID1110
Feature activation+2.02e-02

Pos logit contributions
der+0.114
 valuable+0.073
 missile+0.070
 helicopter+0.068
attered+0.067
Neg logit contributions
een-0.127
loe-0.123
e-0.123
Mary-0.120
Jamie-0.116
Token long
Token ID890
Feature activation-1.99e-02

Pos logit contributions
een+0.047
loe+0.046
e+0.046
Laura+0.046
Jamie+0.045
Neg logit contributions
der-0.024
 valuable-0.014
 helicopter-0.013
attered-0.013
 even-0.012
Token.
Token ID13
Feature activation-6.39e-02

Pos logit contributions
der+0.019
 valuable+0.012
 even+0.010
 that+0.009
 important+0.009
Neg logit contributions
een-0.051
loe-0.050
e-0.049
Laura-0.048
Mary-0.048
Token<|endoftext|>
Token ID50256
Feature activation+1.46e-01

Pos logit contributions
der+0.119
 missile+0.086
ly+0.085
 valuable+0.082
 helicopter+0.082
Neg logit contributions
loe-0.110
een-0.107
Mary-0.101
Laura-0.099
Once-0.099
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.356
loe+0.351
prise+0.349
elve+0.344
Laura+0.341
Neg logit contributions
der-0.143
 that-0.130
 he-0.120
 even-0.112
.-0.111
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.219
loe+0.203
Laura+0.200
Jamie+0.192
Mary+0.189
Neg logit contributions
der-0.407
 that-0.347
 for-0.338
 he-0.335
 valuable-0.334
Token a
Token ID257
Feature activation+6.30e-02

Pos logit contributions
een+0.249
loe+0.244
Laura+0.239
Jamie+0.235
Mary+0.233
Neg logit contributions
der-0.160
 valuable-0.109
 that-0.108
 even-0.107
 for-0.100
Token time
Token ID640
Feature activation+8.74e-02

Pos logit contributions
een+0.108
loe+0.104
icians+0.104
used+0.099
igh+0.099
Neg logit contributions
der-0.111
 that-0.096
 even-0.096
 for-0.095
 important-0.094
Token,
Token ID11
Feature activation+5.32e-02

Pos logit contributions
Laura+0.252
een+0.243
Jamie+0.242
icians+0.238
Josh+0.236
Neg logit contributions
 that-0.059
der-0.057
 for-0.054
 he-0.050
 they-0.043
Token there
Token ID612
Feature activation+3.91e-02

Pos logit contributions
een+0.097
loe+0.094
Laura+0.093
Jamie+0.091
Mary+0.090
Neg logit contributions
der-0.094
 that-0.079
 for-0.076
 even-0.076
 he-0.074
Token its
Token ID663
Feature activation-3.29e-02

Pos logit contributions
een+0.114
loe+0.111
ighed+0.109
Laura+0.109
ael+0.109
Neg logit contributions
der-0.038
 that-0.037
 even-0.035
 for-0.033
 he-0.029
Token soft
Token ID2705
Feature activation-7.58e-03

Pos logit contributions
der+0.050
 valuable+0.033
 that+0.033
 even+0.033
 he+0.030
Neg logit contributions
een-0.070
loe-0.069
Laura-0.068
Jamie-0.066
Mary-0.066
Token embrace
Token ID12553
Feature activation-2.25e-02

Pos logit contributions
der+0.012
 that+0.009
 valuable+0.008
 even+0.008
 for+0.008
Neg logit contributions
een-0.015
loe-0.015
Laura-0.015
Jamie-0.015
Mary-0.014
Token.
Token ID13
Feature activation-7.19e-02

Pos logit contributions
der+0.027
 valuable+0.016
 missile+0.014
attered+0.014
 helicopter+0.014
Neg logit contributions
een-0.053
loe-0.052
e-0.051
Laura-0.051
Mary-0.050
Token<|endoftext|>
Token ID50256
Feature activation+1.41e-01

Pos logit contributions
der+0.158
ly+0.127
 missile+0.121
 helicopter+0.120
 worse+0.119
Neg logit contributions
loe-0.099
een-0.098
Laura-0.097
Mary-0.095
Jamie-0.090
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.327
loe+0.319
prise+0.317
Laura+0.315
elve+0.312
Neg logit contributions
der-0.155
 that-0.138
 he-0.121
 even-0.115
 for-0.115
Token upon
Token ID2402
Feature activation+1.12e-01

Pos logit contributions
een+0.174
loe+0.167
Laura+0.159
Jamie+0.152
Mary+0.150
Neg logit contributions
der-0.450
 valuable-0.372
 that-0.371
 even-0.369
 for-0.359
Token a
Token ID257
Feature activation+5.86e-02

Pos logit contributions
een+0.227
Laura+0.221
loe+0.219
Jamie+0.216
Mary+0.214
Neg logit contributions
der-0.175
 valuable-0.122
 that-0.116
 even-0.114
 for-0.112
Token time
Token ID640
Feature activation+8.53e-02

Pos logit contributions
een+0.100
loe+0.098
icians+0.097
igh+0.092
used+0.092
Neg logit contributions
der-0.103
 even-0.089
 for-0.088
 that-0.088
 important-0.088
Token,
Token ID11
Feature activation+5.31e-02

Pos logit contributions
Laura+0.232
een+0.228
Jamie+0.222
icians+0.219
loe+0.219
Neg logit contributions
 that-0.068
der-0.065
 he-0.063
 for-0.059
 they-0.053
Token there
Token ID612
Feature activation+3.39e-02

Pos logit contributions
een+0.092
loe+0.090
Laura+0.087
Jamie+0.085
Mary+0.085
Neg logit contributions
der-0.099
 that-0.082
 even-0.080
 for-0.079
 he-0.078
Token.
Token ID13
Feature activation-7.01e-02

Pos logit contributions
der+0.010
 valuable+0.006
 that+0.005
 even+0.005
 he+0.005
Neg logit contributions
een-0.024
loe-0.024
Laura-0.023
e-0.023
Jamie-0.023
Token The
Token ID383
Feature activation-2.81e-02

Pos logit contributions
der+0.145
 missile+0.110
 helicopter+0.109
ly+0.108
 valuable+0.108
Neg logit contributions
loe-0.107
een-0.106
Laura-0.103
Mary-0.099
Jamie-0.097
Token end
Token ID886
Feature activation-5.83e-03

Pos logit contributions
der+0.048
attered+0.035
 smoothly+0.033
 valuable+0.032
ly+0.032
Neg logit contributions
loe-0.052
Laura-0.052
een-0.050
Jamie-0.049
prise-0.049
Token.
Token ID13
Feature activation-6.71e-02

Pos logit contributions
der+0.009
attered+0.007
 yacht+0.007
 ot+0.007
bike+0.006
Neg logit contributions
Laura-0.011
loe-0.011
een-0.011
Mary-0.011
Ali-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.44e-01

Pos logit contributions
der+0.141
ly+0.105
 valuable+0.105
 even+0.102
 helicopter+0.102
Neg logit contributions
een-0.100
loe-0.099
Laura-0.098
Mary-0.095
Jamie-0.093
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.339
prise+0.330
loe+0.329
elve+0.328
Laura+0.324
Neg logit contributions
der-0.159
 that-0.138
 he-0.121
 even-0.118
 for-0.117
Token upon
Token ID2402
Feature activation+1.11e-01

Pos logit contributions
een+0.199
loe+0.188
Laura+0.181
Jamie+0.175
Mary+0.172
Neg logit contributions
der-0.427
 that-0.356
 valuable-0.351
 even-0.349
 for-0.345
Token a
Token ID257
Feature activation+6.21e-02

Pos logit contributions
een+0.242
loe+0.237
Laura+0.233
Jamie+0.229
Mary+0.227
Neg logit contributions
der-0.159
 valuable-0.109
 that-0.107
 even-0.106
 for-0.100
Token time
Token ID640
Feature activation+8.54e-02

Pos logit contributions
een+0.106
icians+0.103
loe+0.103
Jamie+0.099
igh+0.097
Neg logit contributions
der-0.111
 that-0.097
 for-0.096
 even-0.095
 important-0.093
Token,
Token ID11
Feature activation+5.47e-02

Pos logit contributions
Laura+0.237
een+0.232
Jamie+0.230
Josh+0.225
icians+0.224
Neg logit contributions
 that-0.063
der-0.062
 for-0.058
 he-0.056
 they-0.050
Token there
Token ID612
Feature activation+3.68e-02

Pos logit contributions
een+0.100
loe+0.097
Laura+0.095
Jamie+0.094
Josh+0.093
Neg logit contributions
der-0.096
 that-0.084
 for-0.080
 even-0.080
 he-0.079
Token on
Token ID319
Feature activation+5.62e-02

Pos logit contributions
e+0.097
een+0.097
loe+0.094
Mary+0.089
Elizabeth+0.088
Neg logit contributions
der-0.083
 valuable-0.056
taker-0.051
 missile-0.050
ly-0.049
Token adventures
Token ID17545
Feature activation-1.24e-02

Pos logit contributions
een+0.110
loe+0.108
Laura+0.108
Mary+0.105
e+0.104
Neg logit contributions
der-0.092
ly-0.061
 valuable-0.061
 that-0.060
attered-0.059
Token together
Token ID1978
Feature activation+4.84e-03

Pos logit contributions
der+0.020
 valuable+0.014
attered+0.013
ly+0.012
 important+0.012
Neg logit contributions
een-0.024
e-0.024
loe-0.023
Laura-0.023
Mary-0.023
Token.
Token ID13
Feature activation-7.12e-02

Pos logit contributions
een+0.014
loe+0.013
Laura+0.013
Jamie+0.013
Mary+0.013
Neg logit contributions
der-0.004
 that-0.002
 even-0.002
 for-0.002
 valuable-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.45e-01

Pos logit contributions
der+0.163
ly+0.133
 worse+0.122
attered+0.121
 valuable+0.120
Neg logit contributions
loe-0.093
een-0.088
Mary-0.086
e-0.085
Once-0.082
TokenOnce
Token ID7454
Feature activation+1.73e-01

Pos logit contributions
een+0.334
loe+0.324
prise+0.324
Laura+0.320
elve+0.317
Neg logit contributions
der-0.165
 that-0.145
 he-0.128
 even-0.123
 for-0.122
Token upon
Token ID2402
Feature activation+1.13e-01

Pos logit contributions
een+0.189
loe+0.178
Laura+0.171
Jamie+0.165
Mary+0.162
Neg logit contributions
der-0.436
 that-0.366
 valuable-0.361
 even-0.359
 for-0.355
Token a
Token ID257
Feature activation+6.20e-02

Pos logit contributions
een+0.243
loe+0.237
Laura+0.233
Jamie+0.229
Mary+0.227
Neg logit contributions
der-0.165
 valuable-0.113
 that-0.110
 even-0.109
 for-0.103
Token time
Token ID640
Feature activation+8.74e-02

Pos logit contributions
een+0.104
loe+0.100
icians+0.096
Jamie+0.093
igh+0.093
Neg logit contributions
der-0.112
 even-0.099
 important-0.097
 that-0.097
 valuable-0.096
Token,
Token ID11
Feature activation+5.33e-02

Pos logit contributions
Laura+0.247
een+0.241
Jamie+0.239
icians+0.235
ited+0.235
Neg logit contributions
 that-0.062
 for-0.051
 he-0.051
der-0.050
 they-0.048
Token there
Token ID612
Feature activation+3.67e-02

Pos logit contributions
een+0.099
loe+0.096
Laura+0.094
Jamie+0.091
Josh+0.091
Neg logit contributions
der-0.091
 that-0.080
 even-0.076
 for-0.076
 he-0.074
Token the
Token ID262
Feature activation-5.17e-02

Pos logit contributions
der+0.118
 valuable+0.082
attered+0.078
ly+0.076
 smoothly+0.073
Neg logit contributions
een-0.159
loe-0.153
Jamie-0.151
Laura-0.149
Mary-0.147
Token machine
Token ID4572
Feature activation-5.00e-02

Pos logit contributions
der+0.092
attered+0.064
 even+0.063
 valuable+0.062
ly+0.062
Neg logit contributions
een-0.095
loe-0.090
Laura-0.089
Jamie-0.088
Josh-0.087
Token was
Token ID373
Feature activation+3.24e-02

Pos logit contributions
der+0.078
 valuable+0.056
 that+0.054
 even+0.053
 for+0.051
Neg logit contributions
een-0.102
loe-0.100
Laura-0.096
Jamie-0.095
Mary-0.094
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
een+0.069
loe+0.067
Laura+0.065
Jamie+0.065
Mary+0.064
Neg logit contributions
der-0.048
 valuable-0.032
 that-0.032
 even-0.031
ly-0.030
Token shred
Token ID21163
Feature activation-6.54e-02

Pos logit contributions
der+0.019
attered+0.012
ly+0.012
cream+0.012
 that+0.011
Neg logit contributions
een-0.024
Laura-0.023
prise-0.023
loe-0.023
Jamie-0.023
Tokender
Token ID1082
Feature activation-1.67e-01

Pos logit contributions
der+0.089
 even+0.082
 valuable+0.070
 lesson+0.069
 habit+0.068
Neg logit contributions
omed-0.091
een-0.091
Sean-0.090
loe-0.089
Josh-0.088
Token,
Token ID11
Feature activation-5.18e-03

Pos logit contributions
der+0.203
 helicopter+0.161
 valuable+0.152
 missile+0.145
attered+0.144
Neg logit contributions
een-0.346
Jamie-0.334
e-0.325
loe-0.321
Laura-0.316
Token that
Token ID326
Feature activation-8.53e-02

Pos logit contributions
der+0.008
 valuable+0.006
attered+0.006
 helicopter+0.005
ly+0.005
Neg logit contributions
een-0.010
loe-0.010
Laura-0.010
Jamie-0.010
e-0.010
Token could
Token ID714
Feature activation+5.35e-02

Pos logit contributions
der+0.133
 valuable+0.094
 that+0.088
 even+0.087
ly+0.085
Neg logit contributions
een-0.174
loe-0.170
Jamie-0.166
Laura-0.165
Mary-0.162
Token cut
Token ID2005
Feature activation-1.73e-02

Pos logit contributions
een+0.099
loe+0.097
Laura+0.093
Jamie+0.093
Mary+0.092
Neg logit contributions
der-0.094
 valuable-0.070
 that-0.065
 even-0.064
ly-0.063
Token things
Token ID1243
Feature activation-5.63e-02

Pos logit contributions
der+0.025
 valuable+0.018
 even+0.016
 that+0.016
 helicopter+0.015
Neg logit contributions
een-0.037
loe-0.036
e-0.035
Jamie-0.035
Laura-0.035
Token This
Token ID770
Feature activation-2.61e-02

Pos logit contributions
der+0.094
 that+0.063
 even+0.061
 valuable+0.059
 he+0.059
Neg logit contributions
een-0.169
loe-0.165
Laura-0.160
Jamie-0.159
prise-0.158
Token made
Token ID925
Feature activation-4.00e-03

Pos logit contributions
der+0.036
 that+0.028
 even+0.027
 valuable+0.026
 he+0.026
Neg logit contributions
een-0.059
loe-0.059
Laura-0.057
Jamie-0.057
prise-0.055
Token her
Token ID607
Feature activation+5.85e-03

Pos logit contributions
der+0.005
 that+0.005
 even+0.004
 for+0.004
 he+0.004
Neg logit contributions
een-0.009
ighed-0.009
loe-0.009
Laura-0.009
Jamie-0.009
Token even
Token ID772
Feature activation-6.24e-02

Pos logit contributions
een+0.013
loe+0.013
Laura+0.013
Jamie+0.012
Mary+0.012
Neg logit contributions
der-0.008
 even-0.006
 that-0.006
 valuable-0.006
 for-0.006
Token glad
Token ID9675
Feature activation-3.26e-02

Pos logit contributions
der+0.077
 that+0.044
 valuable+0.044
attered+0.040
 missile+0.039
Neg logit contributions
een-0.148
Laura-0.143
Jamie-0.139
Mary-0.139
loe-0.139
Tokender
Token ID1082
Feature activation-1.66e-01

Pos logit contributions
der+0.038
 valuable+0.025
 even+0.022
 that+0.021
 smoothly+0.021
Neg logit contributions
een-0.078
loe-0.076
Laura-0.075
Mary-0.074
Jamie-0.074
Token and
Token ID290
Feature activation-4.56e-02

Pos logit contributions
der+0.191
 valuable+0.125
 helicopter+0.121
 missile+0.119
 car+0.116
Neg logit contributions
een-0.380
loe-0.370
Jamie-0.364
Laura-0.363
Mary-0.362
Token she
Token ID673
Feature activation-5.14e-02

Pos logit contributions
der+0.072
 valuable+0.051
 that+0.051
 even+0.050
 for+0.048
Neg logit contributions
een-0.094
loe-0.092
Laura-0.090
Jamie-0.088
Mary-0.087
Token smiled
Token ID13541
Feature activation+6.71e-03

Pos logit contributions
der+0.082
 even+0.059
 that+0.059
 valuable+0.059
 for+0.055
Neg logit contributions
een-0.104
loe-0.103
Laura-0.100
Jamie-0.098
Mary-0.097
Token with
Token ID351
Feature activation+1.01e-02

Pos logit contributions
een+0.012
Laura+0.012
e+0.012
Jamie+0.012
Jul+0.012
Neg logit contributions
der-0.009
 valuable-0.007
 missile-0.007
attered-0.006
chair-0.006
Token joy
Token ID8716
Feature activation+4.92e-03

Pos logit contributions
een+0.017
loe+0.017
Laura+0.016
Jamie+0.016
Mary+0.016
Neg logit contributions
der-0.019
 that-0.015
 even-0.015
 valuable-0.015
 for-0.014
Token and
Token ID290
Feature activation-5.57e-02

Pos logit contributions
der+0.147
 valuable+0.100
 even+0.089
 smoothly+0.087
 helicopter+0.087
Neg logit contributions
een-0.297
loe-0.290
Laura-0.284
Jamie-0.284
Mary-0.283
Token Lily
Token ID20037
Feature activation+6.64e-03

Pos logit contributions
der+0.105
 valuable+0.078
 even+0.075
attered+0.072
 that+0.071
Neg logit contributions
een-0.094
loe-0.090
Laura-0.090
Mary-0.089
Jamie-0.088
Token felt
Token ID2936
Feature activation-3.12e-02

Pos logit contributions
een+0.014
loe+0.014
Laura+0.013
Jamie+0.013
Mary+0.013
Neg logit contributions
der-0.010
 that-0.007
 even-0.007
 valuable-0.007
 he-0.007
Token even
Token ID772
Feature activation-6.83e-02

Pos logit contributions
der+0.046
 automobile+0.033
ometer+0.033
 missile+0.032
attered+0.032
Neg logit contributions
e-0.060
loe-0.057
Laura-0.056
een-0.056
Jamie-0.055
Token sad
Token ID6507
Feature activation-9.50e-02

Pos logit contributions
der+0.103
 valuable+0.068
 that+0.066
 missile+0.066
 helicopter+0.064
Neg logit contributions
een-0.140
Laura-0.136
loe-0.133
Jamie-0.133
e-0.131
Tokender
Token ID1082
Feature activation-1.65e-01

Pos logit contributions
 missile+0.086
der+0.086
 valuable+0.085
 helicopter+0.080
attered+0.079
Neg logit contributions
loe-0.190
een-0.189
Laura-0.179
Jamie-0.177
iley-0.176
Token.
Token ID13
Feature activation-7.75e-02

Pos logit contributions
der+0.181
 valuable+0.118
 helicopter+0.114
 missile+0.113
 car+0.108
Neg logit contributions
een-0.386
loe-0.381
Laura-0.373
Jamie-0.373
Mary-0.369
Token From
Token ID3574
Feature activation+4.30e-02

Pos logit contributions
der+0.121
 that+0.087
 valuable+0.084
 even+0.084
 he+0.082
Neg logit contributions
een-0.162
loe-0.156
Laura-0.153
Jamie-0.150
Mary-0.149
Token that
Token ID326
Feature activation-6.65e-02

Pos logit contributions
een+0.096
Mary+0.094
Laura+0.094
loe+0.093
iley+0.092
Neg logit contributions
der-0.054
 valuable-0.035
attered-0.032
ometer-0.031
 smoothly-0.030
Token day
Token ID1110
Feature activation+2.58e-02

Pos logit contributions
der+0.089
 valuable+0.054
 that+0.053
 even+0.052
ly+0.048
Neg logit contributions
een-0.154
loe-0.153
Laura-0.149
Jamie-0.146
Mary-0.146
Token on
Token ID319
Feature activation+4.93e-02

Pos logit contributions
een+0.057
loe+0.057
Laura+0.055
Jamie+0.054
Mary+0.054
Neg logit contributions
der-0.036
 valuable-0.024
 that-0.024
 even-0.024
 for-0.022
Token car
Token ID1097
Feature activation-1.54e-02

Pos logit contributions
der+0.081
 that+0.054
 valuable+0.053
attered+0.052
 even+0.052
Neg logit contributions
een-0.105
loe-0.102
Laura-0.101
e-0.098
Jamie-0.098
Token made
Token ID925
Feature activation-1.03e-02

Pos logit contributions
der+0.018
 valuable+0.012
attered+0.009
 important+0.009
 even+0.009
Neg logit contributions
een-0.035
e-0.035
loe-0.034
Laura-0.033
Jamie-0.032
Token him
Token ID683
Feature activation-3.14e-03

Pos logit contributions
der+0.015
 valuable+0.010
 that+0.009
 even+0.009
 helicopter+0.009
Neg logit contributions
een-0.022
loe-0.022
Laura-0.021
Jamie-0.021
e-0.021
Token even
Token ID772
Feature activation-6.71e-02

Pos logit contributions
der+0.006
attered+0.004
 valuable+0.004
ly+0.004
 smoothly+0.003
Neg logit contributions
e-0.006
een-0.005
Laura-0.005
loe-0.005
ighed-0.005
Token sad
Token ID6507
Feature activation-9.95e-02

Pos logit contributions
der+0.082
attered+0.047
 valuable+0.046
 that+0.045
ometer+0.044
Neg logit contributions
een-0.155
Laura-0.150
Jamie-0.146
e-0.145
Mary-0.145
Tokender
Token ID1082
Feature activation-1.64e-01

Pos logit contributions
der+0.102
 valuable+0.085
attered+0.074
 smoothly+0.069
 missile+0.068
Neg logit contributions
een-0.217
loe-0.209
e-0.203
Laura-0.201
Jamie-0.200
Token.
Token ID13
Feature activation-7.57e-02

Pos logit contributions
der+0.176
 valuable+0.114
 helicopter+0.109
 missile+0.107
 automobile+0.100
Neg logit contributions
een-0.384
loe-0.376
Jamie-0.372
Laura-0.370
e-0.368
Token He
Token ID679
Feature activation-1.88e-02

Pos logit contributions
der+0.116
 that+0.085
 he+0.085
 even+0.081
 valuable+0.081
Neg logit contributions
een-0.160
loe-0.151
prise-0.150
Laura-0.150
Jamie-0.147
Token asked
Token ID1965
Feature activation+6.91e-02

Pos logit contributions
der+0.032
 valuable+0.023
 that+0.022
 for+0.022
 missile+0.021
Neg logit contributions
een-0.035
loe-0.034
Laura-0.034
Jamie-0.033
e-0.033
Token everyone
Token ID2506
Feature activation-1.38e-02

Pos logit contributions
een+0.163
loe+0.160
Laura+0.158
Jamie+0.155
Mary+0.154
Neg logit contributions
der-0.089
 that-0.062
 even-0.061
 for-0.059
 valuable-0.057
Token if
Token ID611
Feature activation-2.07e-02

Pos logit contributions
der+0.020
 valuable+0.013
attered+0.013
 smoothly+0.012
 even+0.012
Neg logit contributions
een-0.029
loe-0.028
Laura-0.028
Jamie-0.027
Mary-0.027
Token found
Token ID1043
Feature activation+2.57e-02

Pos logit contributions
der+0.033
 even+0.029
 that+0.026
 he+0.026
 for+0.025
Neg logit contributions
loe-0.046
een-0.045
Mary-0.043
Laura-0.042
Elizabeth-0.041
Token something
Token ID1223
Feature activation-1.02e-01

Pos logit contributions
een+0.057
loe+0.056
Laura+0.054
Jamie+0.053
Mary+0.053
Neg logit contributions
der-0.037
 valuable-0.025
 that-0.025
 even-0.025
 for-0.023
Token even
Token ID772
Feature activation-7.59e-02

Pos logit contributions
der+0.148
 smoothly+0.092
attered+0.090
bike+0.089
 helicopter+0.087
Neg logit contributions
een-0.209
Laura-0.204
Jamie-0.201
Mary-0.200
prise-0.199
Token we
Token ID356
Feature activation-4.10e-02

Pos logit contributions
der+0.097
 valuable+0.058
 that+0.056
 missile+0.055
 helicopter+0.055
Neg logit contributions
een-0.174
Laura-0.167
loe-0.164
Jamie-0.163
Mary-0.161
Tokenir
Token ID343
Feature activation+1.16e-02

Pos logit contributions
der+0.065
 that+0.047
 even+0.047
 valuable+0.046
 for+0.044
Neg logit contributions
een-0.084
loe-0.083
Laura-0.081
Mary-0.079
Jamie-0.079
Tokender
Token ID1082
Feature activation-1.63e-01

Pos logit contributions
een+0.019
loe+0.018
Laura+0.017
e+0.017
Jamie+0.017
Neg logit contributions
der-0.022
 valuable-0.018
 even-0.018
 that-0.017
 helicopter-0.017
Token!
Token ID0
Feature activation-8.21e-02

Pos logit contributions
der+0.191
 valuable+0.120
 even+0.114
 that+0.113
 helicopter+0.106
Neg logit contributions
een-0.394
loe-0.383
Laura-0.378
Jamie-0.375
Mary-0.370
Token There
Token ID1318
Feature activation+1.69e-02

Pos logit contributions
der+0.087
 He+0.066
 that+0.065
 he+0.064
 They+0.062
Neg logit contributions
een-0.199
ited-0.193
ighed-0.189
elve-0.188
ael-0.187
Token were
Token ID547
Feature activation+8.27e-02

Pos logit contributions
een+0.037
loe+0.036
Laura+0.036
Jamie+0.035
Mary+0.035
Neg logit contributions
der-0.023
 that-0.018
 even-0.018
 he-0.017
 valuable-0.017
Token little
Token ID1310
Feature activation+1.38e-02

Pos logit contributions
een+0.176
loe+0.172
Laura+0.167
Jamie+0.163
Mary+0.162
Neg logit contributions
der-0.126
 that-0.092
 even-0.091
 valuable-0.090
 for-0.085
Token plastic
Token ID7309
Feature activation-2.14e-02

Pos logit contributions
een+0.025
loe+0.025
Laura+0.024
Jamie+0.024
Mary+0.023
Neg logit contributions
der-0.024
 valuable-0.018
 that-0.017
 even-0.017
 important-0.016
Token nothing
Token ID2147
Feature activation-1.07e-01

Pos logit contributions
der+0.061
 valuable+0.040
 that+0.037
 even+0.036
attered+0.035
Neg logit contributions
een-0.094
loe-0.091
Laura-0.091
Jamie-0.090
Mary-0.090
Token made
Token ID925
Feature activation-1.97e-02

Pos logit contributions
der+0.192
 automobile+0.138
 missile+0.135
 helicopter+0.135
attered+0.135
Neg logit contributions
een-0.186
e-0.172
Mary-0.172
Laura-0.172
Jamie-0.171
Token it
Token ID340
Feature activation-9.26e-02

Pos logit contributions
der+0.026
 that+0.019
 even+0.018
 valuable+0.018
 for+0.018
Neg logit contributions
een-0.045
loe-0.044
Laura-0.044
Jamie-0.043
Mary-0.042
Token p
Token ID279
Feature activation+2.25e-03

Pos logit contributions
der+0.154
 valuable+0.103
 missile+0.101
attered+0.100
 automobile+0.099
Neg logit contributions
een-0.164
e-0.161
Laura-0.157
Jamie-0.157
Mary-0.156
Tokenrou
Token ID472
Feature activation-4.07e-02

Pos logit contributions
een+0.004
loe+0.004
Laura+0.004
Jamie+0.004
Mary+0.004
Neg logit contributions
der-0.004
 that-0.003
 valuable-0.003
 even-0.003
 for-0.003
Tokender
Token ID1082
Feature activation-1.63e-01

Pos logit contributions
 valuable+0.012
 automobile+0.007
der+0.007
 car+0.006
 boy+0.006
Neg logit contributions
loe-0.113
e-0.111
een-0.111
ink-0.105
Mary-0.104
Token.
Token ID13
Feature activation-7.62e-02

Pos logit contributions
der+0.192
 valuable+0.125
 missile+0.124
 helicopter+0.122
 car+0.117
Neg logit contributions
een-0.371
loe-0.359
Jamie-0.359
Laura-0.356
Mary-0.355
Token
Token ID220
Feature activation+7.45e-02

Pos logit contributions
der+0.125
 that+0.092
 valuable+0.090
 even+0.089
 he+0.087
Neg logit contributions
een-0.153
loe-0.147
Laura-0.144
Jamie-0.142
Mary-0.139
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
een+0.158
loe+0.149
Laura+0.149
prise+0.146
Jamie+0.145
Neg logit contributions
der-0.110
 that-0.083
 valuable-0.078
 he-0.077
 even-0.077
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
een+0.246
prise+0.232
loe+0.231
ael+0.229
igh+0.228
Neg logit contributions
der-0.105
 that-0.087
 he-0.087
 even-0.085
 valuable-0.085
TokenThe
Token ID464
Feature activation-1.56e-02

Pos logit contributions
een+0.214
loe+0.205
ael+0.192
Laura+0.191
ighed+0.190
Neg logit contributions
der-0.147
 even-0.117
 valuable-0.117
 that-0.113
 he-0.108
Token and
Token ID290
Feature activation-5.40e-02

Pos logit contributions
der+0.116
 valuable+0.070
 that+0.066
 even+0.065
 for+0.059
Neg logit contributions
een-0.249
loe-0.244
Laura-0.239
Jamie-0.235
Mary-0.234
Token it
Token ID340
Feature activation-8.94e-02

Pos logit contributions
der+0.077
 valuable+0.053
 that+0.047
 even+0.046
 missile+0.046
Neg logit contributions
een-0.117
Laura-0.113
loe-0.112
Jamie-0.112
Mary-0.111
Token was
Token ID373
Feature activation+3.33e-02

Pos logit contributions
der+0.155
 valuable+0.111
 that+0.106
 missile+0.104
 for+0.104
Neg logit contributions
een-0.164
loe-0.157
Jamie-0.155
Laura-0.155
e-0.152
Token even
Token ID772
Feature activation-6.61e-02

Pos logit contributions
een+0.079
loe+0.078
Laura+0.076
Jamie+0.074
Mary+0.073
Neg logit contributions
der-0.043
 even-0.032
 that-0.031
 valuable-0.030
 for-0.029
Token red
Token ID2266
Feature activation-2.42e-02

Pos logit contributions
der+0.086
 missile+0.050
attered+0.048
ometer+0.048
 valuable+0.047
Neg logit contributions
een-0.145
Laura-0.143
Jamie-0.141
Mary-0.138
prise-0.137
Tokender
Token ID1082
Feature activation-1.62e-01

Pos logit contributions
der+0.026
 valuable+0.026
 missile+0.021
 important+0.020
 smoothly+0.020
Neg logit contributions
een-0.049
loe-0.046
e-0.044
aney-0.044
Mary-0.044
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
der+0.197
 missile+0.142
 helicopter+0.142
ometer+0.133
 automobile+0.132
Neg logit contributions
Jamie-0.341
een-0.340
Mary-0.334
Laura-0.332
loe-0.327
Token Suddenly
Token ID24975
Feature activation+2.75e-02

Pos logit contributions
der+0.122
 that+0.087
 even+0.084
 valuable+0.084
 he+0.081
Neg logit contributions
een-0.165
loe-0.157
Laura-0.153
e-0.153
prise-0.151
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
een+0.056
loe+0.055
Laura+0.054
Jamie+0.053
Mary+0.052
Neg logit contributions
der-0.043
 that-0.031
 valuable-0.031
 even-0.031
 for-0.029
Token she
Token ID673
Feature activation-3.09e-02

Pos logit contributions
een+0.047
loe+0.046
Laura+0.046
Jamie+0.045
Mary+0.045
Neg logit contributions
der-0.031
 valuable-0.021
 that-0.019
 even-0.019
 for-0.018
Token had
Token ID550
Feature activation+2.21e-02

Pos logit contributions
der+0.054
 even+0.042
 that+0.040
 he+0.038
 valuable+0.038
Neg logit contributions
loe-0.060
een-0.060
Laura-0.056
Mary-0.055
Jamie-0.054
Token look
Token ID804
Feature activation-4.64e-02

Pos logit contributions
der+0.126
 missile+0.088
 trigger+0.085
 ot+0.084
 arrow+0.084
Neg logit contributions
Laura-0.111
Jamie-0.110
een-0.109
loe-0.108
e-0.108
Token even
Token ID772
Feature activation-6.80e-02

Pos logit contributions
der+0.069
ometer+0.047
bike+0.046
 missile+0.042
cream+0.040
Neg logit contributions
een-0.088
Laura-0.085
e-0.084
Jamie-0.083
prise-0.083
Token smaller
Token ID4833
Feature activation-5.96e-02

Pos logit contributions
der+0.087
ometer+0.058
attered+0.053
 missile+0.052
bike+0.051
Neg logit contributions
een-0.145
Laura-0.140
prise-0.137
ited-0.136
Mary-0.136
Token and
Token ID290
Feature activation-5.81e-02

Pos logit contributions
der+0.072
 valuable+0.056
 missile+0.052
 helicopter+0.052
 motor+0.049
Neg logit contributions
een-0.122
e-0.118
loe-0.116
Mary-0.115
Jamie-0.115
Token sad
Token ID6507
Feature activation-9.32e-02

Pos logit contributions
der+0.082
attered+0.054
ly+0.052
 valuable+0.052
ometer+0.052
Neg logit contributions
een-0.117
Laura-0.115
Mary-0.114
Jamie-0.113
prise-0.109
Tokender
Token ID1082
Feature activation-1.61e-01

Pos logit contributions
der+0.094
 valuable+0.089
 missile+0.082
attered+0.081
bike+0.077
Neg logit contributions
een-0.182
loe-0.180
Laura-0.171
Mary-0.170
Jamie-0.170
Token.
Token ID13
Feature activation-7.54e-02

Pos logit contributions
der+0.168
 valuable+0.104
 missile+0.103
 helicopter+0.100
 car+0.094
Neg logit contributions
een-0.384
loe-0.378
Jamie-0.374
Laura-0.373
Mary-0.369
Token<|endoftext|>
Token ID50256
Feature activation+1.27e-01

Pos logit contributions
der+0.124
 that+0.093
 valuable+0.091
 even+0.090
 he+0.088
Neg logit contributions
een-0.151
loe-0.140
Laura-0.138
Jamie-0.138
e-0.137
TokenOnce
Token ID7454
Feature activation+1.61e-01

Pos logit contributions
een+0.295
prise+0.290
elve+0.286
Laura+0.284
ited+0.283
Neg logit contributions
der-0.133
 that-0.116
 he-0.112
.-0.105
 around-0.104
Token upon
Token ID2402
Feature activation+1.04e-01

Pos logit contributions
een+0.177
loe+0.164
Laura+0.158
Jamie+0.151
Mary+0.150
Neg logit contributions
der-0.406
 that-0.345
 valuable-0.338
 even-0.337
 for-0.335
Token a
Token ID257
Feature activation+5.29e-02

Pos logit contributions
een+0.225
loe+0.220
Laura+0.217
Jamie+0.213
Mary+0.211
Neg logit contributions
der-0.152
 valuable-0.104
 that-0.102
 even-0.101
 for-0.096
Token.
Token ID13
Feature activation-7.85e-02

Pos logit contributions
der+0.035
 valuable+0.022
 even+0.020
 that+0.020
 important+0.019
Neg logit contributions
een-0.063
loe-0.062
Laura-0.060
e-0.059
Jamie-0.059
Token Tim
Token ID5045
Feature activation+5.87e-02

Pos logit contributions
der+0.105
 that+0.071
 valuable+0.069
 even+0.069
 he+0.068
Neg logit contributions
een-0.180
loe-0.173
Laura-0.172
Jamie-0.169
Mary-0.166
Token felt
Token ID2936
Feature activation-2.43e-02

Pos logit contributions
een+0.129
loe+0.127
Laura+0.125
Jamie+0.122
Mary+0.122
Neg logit contributions
der-0.082
 even-0.060
 that-0.060
 valuable-0.056
 for-0.055
Token even
Token ID772
Feature activation-6.58e-02

Pos logit contributions
der+0.041
ometer+0.029
attered+0.028
 automobile+0.027
 missile+0.027
Neg logit contributions
e-0.046
een-0.044
Laura-0.043
loe-0.043
Jamie-0.042
Token sad
Token ID6507
Feature activation-9.37e-02

Pos logit contributions
der+0.082
attered+0.043
 valuable+0.043
 that+0.042
 missile+0.041
Neg logit contributions
een-0.156
Laura-0.152
e-0.149
Jamie-0.148
loe-0.146
Tokender
Token ID1082
Feature activation-1.61e-01

Pos logit contributions
 valuable+0.105
 missile+0.105
attered+0.105
 automobile+0.102
cream+0.099
Neg logit contributions
een-0.158
loe-0.156
iley-0.147
Once-0.142
Laura-0.141
Token now
Token ID783
Feature activation-4.01e-02

Pos logit contributions
der+0.176
 valuable+0.120
 helicopter+0.117
 missile+0.117
attered+0.113
Neg logit contributions
een-0.366
loe-0.359
Jamie-0.354
Laura-0.352
e-0.351
Token,
Token ID11
Feature activation+7.34e-03

Pos logit contributions
der+0.044
attered+0.039
arp+0.034
ometer+0.034
cream+0.031
Neg logit contributions
e-0.079
loe-0.077
een-0.077
Once-0.076
Laura-0.076
Token and
Token ID290
Feature activation-3.95e-02

Pos logit contributions
een+0.015
loe+0.015
Laura+0.015
Mary+0.015
Jamie+0.014
Neg logit contributions
der-0.010
 valuable-0.007
attered-0.007
ly-0.007
 missile-0.006
Token he
Token ID339
Feature activation-3.41e-02

Pos logit contributions
der+0.051
 valuable+0.032
attered+0.028
 helicopter+0.027
 smoothly+0.027
Neg logit contributions
een-0.087
Jamie-0.086
Laura-0.086
loe-0.085
Mary-0.084
Token cried
Token ID16896
Feature activation+1.55e-02

Pos logit contributions
der+0.055
 valuable+0.039
 that+0.038
 for+0.037
 even+0.036
Neg logit contributions
een-0.067
loe-0.065
Laura-0.064
Jamie-0.063
e-0.063
Token said
Token ID531
Feature activation+6.92e-02

Pos logit contributions
der+0.062
 even+0.046
 that+0.046
 valuable+0.045
 for+0.043
Neg logit contributions
een-0.070
loe-0.069
Laura-0.067
Jamie-0.065
Mary-0.065
Token no
Token ID645
Feature activation+8.31e-03

Pos logit contributions
loe+0.121
een+0.119
Laura+0.113
e+0.112
Mary+0.112
Neg logit contributions
der-0.115
 valuable-0.089
attered-0.086
 smoothly-0.082
 even-0.082
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
een+0.016
iley+0.015
loe+0.015
Mary+0.014
e+0.014
Neg logit contributions
der-0.011
 helicopter-0.009
 missile-0.009
taker-0.009
attered-0.009
Token the
Token ID262
Feature activation-4.30e-02

Pos logit contributions
een+0.027
loe+0.027
Laura+0.026
Mary+0.026
Jamie+0.026
Neg logit contributions
der-0.019
 valuable-0.014
attered-0.012
 that-0.012
 even-0.012
Token mad
Token ID8805
Feature activation-7.74e-02

Pos logit contributions
der+0.068
 valuable+0.047
 that+0.047
 even+0.046
 for+0.044
Neg logit contributions
een-0.085
loe-0.083
Laura-0.083
Jamie-0.080
Mary-0.079
Tokender
Token ID1082
Feature activation-1.60e-01

Pos logit contributions
der+0.094
 valuable+0.076
attered+0.068
 smoothly+0.066
 even+0.063
Neg logit contributions
een-0.158
loe-0.155
Laura-0.149
ael-0.149
Mary-0.149
Token Sara
Token ID24799
Feature activation+3.20e-02

Pos logit contributions
der+0.179
 valuable+0.119
 helicopter+0.115
attered+0.113
 missile+0.111
Neg logit contributions
een-0.364
loe-0.359
Mary-0.352
Jamie-0.351
Laura-0.347
Token got
Token ID1392
Feature activation+8.59e-03

Pos logit contributions
een+0.071
loe+0.070
Laura+0.068
Jamie+0.067
Mary+0.066
Neg logit contributions
der-0.045
 valuable-0.030
 that-0.030
 even-0.029
 for-0.027
Token.
Token ID13
Feature activation-8.20e-02

Pos logit contributions
een+0.018
loe+0.017
e+0.017
Laura+0.017
Mary+0.017
Neg logit contributions
der-0.013
 valuable-0.008
 that-0.008
 missile-0.008
 even-0.008
Token Then
Token ID3244
Feature activation+2.00e-02

Pos logit contributions
der+0.093
 that+0.053
 valuable+0.050
 even+0.049
 he+0.049
Neg logit contributions
een-0.209
loe-0.204
Laura-0.200
Jamie-0.197
e-0.196
Token,
Token ID11
Feature activation+2.02e-02

Pos logit contributions
een+0.049
loe+0.048
Laura+0.047
Jamie+0.046
Mary+0.046
Neg logit contributions
der-0.024
 valuable-0.014
 that-0.014
 even-0.014
 for-0.013
Token it
Token ID340
Feature activation-7.77e-02

Pos logit contributions
der+0.057
 valuable+0.039
 that+0.037
 even+0.036
 important+0.035
Neg logit contributions
een-0.084
loe-0.081
Laura-0.081
Jamie-0.080
Mary-0.080
Token made
Token ID925
Feature activation-1.07e-02

Pos logit contributions
der+0.123
 that+0.088
 valuable+0.088
 even+0.087
 for+0.082
Neg logit contributions
een-0.159
loe-0.156
Laura-0.151
Jamie-0.149
Mary-0.147
Token her
Token ID607
Feature activation-2.72e-03

Pos logit contributions
der+0.017
 that+0.013
 even+0.013
 for+0.012
 valuable+0.012
Neg logit contributions
een-0.022
loe-0.021
ighed-0.020
Laura-0.020
Jamie-0.020
Token even
Token ID772
Feature activation-6.73e-02

Pos logit contributions
der+0.005
 valuable+0.003
 that+0.003
 even+0.003
ly+0.003
Neg logit contributions
een-0.005
loe-0.005
Laura-0.005
Jamie-0.005
Mary-0.005
Token sad
Token ID6507
Feature activation-9.89e-02

Pos logit contributions
der+0.081
 that+0.047
 valuable+0.047
attered+0.042
 missile+0.041
Neg logit contributions
een-0.161
Laura-0.158
Jamie-0.154
loe-0.153
Mary-0.152
Tokender
Token ID1082
Feature activation-1.60e-01

Pos logit contributions
der+0.105
 valuable+0.079
attered+0.074
 smoothly+0.073
 missile+0.071
Neg logit contributions
een-0.218
loe-0.215
Laura-0.211
Jamie-0.207
Mary-0.206
Token.
Token ID13
Feature activation-7.87e-02

Pos logit contributions
der+0.167
 missile+0.123
 helicopter+0.119
 automobile+0.110
 valuable+0.110
Neg logit contributions
een-0.360
Jamie-0.356
loe-0.355
Laura-0.352
Mary-0.350
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
der+0.115
 that+0.083
 valuable+0.079
 he+0.078
 even+0.078
Neg logit contributions
een-0.173
prise-0.162
loe-0.161
e-0.161
Jamie-0.158
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
een+0.253
ael+0.238
ited+0.234
igh+0.233
elve+0.231
Neg logit contributions
der-0.103
 that-0.095
 valuable-0.081
 even-0.081
 he-0.080
TokenSuddenly
Token ID38582
Feature activation+2.36e-02

Pos logit contributions
een+0.157
loe+0.143
igh+0.135
ael+0.135
Jamie+0.129
Neg logit contributions
der-0.207
 that-0.176
 even-0.175
 valuable-0.174
 important-0.163
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
een+0.049
loe+0.047
Laura+0.046
Jamie+0.046
Mary+0.045
Neg logit contributions
der-0.037
 that-0.028
 even-0.027
 valuable-0.026
 for-0.026
TokenThen
Token ID6423
Feature activation+4.21e-02

Pos logit contributions
een+0.176
igh+0.162
loe+0.160
ael+0.158
elve+0.155
Neg logit contributions
der-0.151
 he-0.134
 even-0.134
 valuable-0.133
 that-0.129
Token something
Token ID1223
Feature activation-9.27e-02

Pos logit contributions
een+0.098
loe+0.096
Laura+0.094
Jamie+0.092
ighed+0.091
Neg logit contributions
der-0.051
 that-0.040
 he-0.040
 even-0.039
 for-0.037
Token even
Token ID772
Feature activation-6.21e-02

Pos logit contributions
der+0.153
attered+0.102
ometer+0.097
 smoothly+0.095
bike+0.094
Neg logit contributions
een-0.173
Laura-0.169
Mary-0.166
loe-0.165
Jamie-0.165
Token we
Token ID356
Feature activation-3.73e-02

Pos logit contributions
der+0.096
 that+0.069
 valuable+0.069
 even+0.069
 for+0.065
Neg logit contributions
een-0.129
loe-0.127
Laura-0.123
Jamie-0.121
Mary-0.120
Tokenir
Token ID343
Feature activation+1.81e-02

Pos logit contributions
der+0.058
 even+0.043
 that+0.043
 valuable+0.041
 for+0.040
Neg logit contributions
een-0.077
loe-0.075
Laura-0.074
Mary-0.072
Jamie-0.072
Tokender
Token ID1082
Feature activation-1.59e-01

Pos logit contributions
een+0.030
loe+0.029
Laura+0.028
Jamie+0.027
Mary+0.027
Neg logit contributions
der-0.036
 valuable-0.028
 that-0.027
 even-0.027
 for-0.026
Token happened
Token ID3022
Feature activation+1.91e-02

Pos logit contributions
der+0.212
 valuable+0.140
 that+0.138
 even+0.137
 for+0.127
Neg logit contributions
een-0.363
loe-0.356
Laura-0.348
Jamie-0.343
Mary-0.340
Token.
Token ID13
Feature activation-7.79e-02

Pos logit contributions
een+0.048
loe+0.048
Laura+0.047
Jamie+0.046
Mary+0.046
Neg logit contributions
der-0.020
 valuable-0.012
 that-0.011
 even-0.011
 for-0.010
Token The
Token ID383
Feature activation-2.55e-02

Pos logit contributions
der+0.110
 that+0.077
 even+0.074
 he+0.073
 valuable+0.072
Neg logit contributions
een-0.170
loe-0.163
Laura-0.162
Jamie-0.159
Mary-0.158
Token animal
Token ID5044
Feature activation+1.03e-02

Pos logit contributions
der+0.039
 valuable+0.027
 that+0.027
 even+0.026
 for+0.025
Neg logit contributions
een-0.051
loe-0.050
Laura-0.049
Jamie-0.048
Mary-0.048
Token suddenly
Token ID6451
Feature activation-2.58e-02

Pos logit contributions
een+0.021
loe+0.021
Laura+0.020
Jamie+0.020
Mary+0.020
Neg logit contributions
der-0.016
 that-0.012
 valuable-0.012
 even-0.012
 for-0.011
Token girl
Token ID2576
Feature activation+1.32e-02

Pos logit contributions
der+0.087
 valuable+0.062
 that+0.061
 even+0.059
 smoothly+0.056
Neg logit contributions
een-0.094
loe-0.092
Laura-0.091
Jamie-0.088
e-0.085
Token felt
Token ID2936
Feature activation-3.22e-02

Pos logit contributions
een+0.028
loe+0.028
Laura+0.027
Jamie+0.026
Mary+0.026
Neg logit contributions
der-0.020
 that-0.014
 even-0.014
 valuable-0.014
 for-0.013
Token even
Token ID772
Feature activation-6.65e-02

Pos logit contributions
der+0.051
 missile+0.034
ometer+0.034
 automobile+0.033
 helicopter+0.033
Neg logit contributions
e-0.063
een-0.063
Laura-0.062
loe-0.062
Jamie-0.061
Token p
Token ID279
Feature activation+1.13e-02

Pos logit contributions
der+0.084
 missile+0.047
 valuable+0.046
 that+0.045
ometer+0.045
Neg logit contributions
een-0.155
Laura-0.151
Jamie-0.148
loe-0.146
e-0.146
Tokenrou
Token ID472
Feature activation-3.59e-02

Pos logit contributions
een+0.018
loe+0.017
Laura+0.017
Jamie+0.017
Mary+0.016
Neg logit contributions
der-0.023
 valuable-0.018
 that-0.018
 even-0.018
 for-0.017
Tokender
Token ID1082
Feature activation-1.59e-01

Pos logit contributions
 valuable+0.020
 automobile+0.015
 car+0.013
der+0.013
 important+0.012
Neg logit contributions
loe-0.093
e-0.093
een-0.090
ink-0.087
ir-0.087
Token.
Token ID13
Feature activation-8.07e-02

Pos logit contributions
der+0.179
 missile+0.128
 helicopter+0.127
 car+0.120
 automobile+0.119
Neg logit contributions
een-0.356
Jamie-0.350
loe-0.348
Laura-0.345
Mary-0.343
Token She
Token ID1375
Feature activation-2.56e-02

Pos logit contributions
der+0.108
 that+0.074
 he+0.070
 valuable+0.070
 even+0.070
Neg logit contributions
een-0.186
loe-0.175
e-0.173
prise-0.173
Jamie-0.171
Token was
Token ID373
Feature activation+4.75e-02

Pos logit contributions
der+0.039
 that+0.028
 even+0.028
 valuable+0.027
 for+0.026
Neg logit contributions
een-0.054
loe-0.053
Laura-0.051
Jamie-0.050
Mary-0.050
Token happy
Token ID3772
Feature activation-2.33e-02

Pos logit contributions
een+0.113
loe+0.110
Laura+0.107
Mary+0.104
Jamie+0.104
Neg logit contributions
der-0.060
 even-0.045
 that-0.044
 valuable-0.041
 for-0.039
Token that
Token ID326
Feature activation-9.12e-02

Pos logit contributions
der+0.026
 valuable+0.018
attered+0.016
 missile+0.015
 helicopter+0.015
Neg logit contributions
een-0.054
Laura-0.053
loe-0.053
Mary-0.052
Jamie-0.052
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
der+0.028
 valuable+0.019
 helicopter+0.017
 missile+0.017
 ot+0.016
Neg logit contributions
een-0.042
loe-0.040
Laura-0.040
Jamie-0.039
e-0.039
Token Red
Token ID2297
Feature activation+3.95e-02

Pos logit contributions
der+0.113
 even+0.078
 valuable+0.076
 that+0.073
 for+0.072
Neg logit contributions
loe-0.159
een-0.159
Laura-0.152
Mary-0.148
Jamie-0.147
Token felt
Token ID2936
Feature activation-2.45e-02

Pos logit contributions
een+0.078
loe+0.075
Laura+0.074
Jamie+0.073
Mary+0.072
Neg logit contributions
der-0.066
 valuable-0.047
 that-0.045
 even-0.045
 for-0.044
Token even
Token ID772
Feature activation-6.32e-02

Pos logit contributions
der+0.043
ometer+0.033
bike+0.029
attered+0.029
 automobile+0.028
Neg logit contributions
e-0.042
Once-0.040
een-0.037
loe-0.037
 blonde-0.036
Token sad
Token ID6507
Feature activation-9.54e-02

Pos logit contributions
der+0.089
ometer+0.054
attered+0.051
 missile+0.049
 valuable+0.047
Neg logit contributions
een-0.135
Laura-0.131
prise-0.131
e-0.128
ited-0.128
Tokender
Token ID1082
Feature activation-1.59e-01

Pos logit contributions
 missile+0.130
 helicopter+0.129
 valuable+0.129
attered+0.127
 automobile+0.123
Neg logit contributions
een-0.129
iley-0.121
loe-0.121
Once-0.118
re-0.115
Token.
Token ID13
Feature activation-7.63e-02

Pos logit contributions
der+0.180
 helicopter+0.131
 valuable+0.127
 missile+0.126
attered+0.123
Neg logit contributions
een-0.345
loe-0.337
Jamie-0.335
e-0.333
Laura-0.331
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
der+0.122
 even+0.087
 valuable+0.085
 that+0.084
 for+0.083
Neg logit contributions
een-0.148
loe-0.147
Laura-0.140
Mary-0.137
Jamie-0.136
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
een+0.214
loe+0.206
Laura+0.202
Jamie+0.197
Mary+0.193
Neg logit contributions
der-0.162
 that-0.128
 valuable-0.125
 even-0.122
 he-0.118
TokenBut
Token ID1537
Feature activation+1.08e-02

Pos logit contributions
een+0.147
loe+0.146
Laura+0.139
Mary+0.139
e+0.135
Neg logit contributions
der-0.224
 for-0.170
 even-0.162
 that-0.161
 valuable-0.159
Token then
Token ID788
Feature activation-1.42e-02

Pos logit contributions
een+0.021
loe+0.020
e+0.020
Mary+0.020
Laura+0.019
Neg logit contributions
der-0.017
 valuable-0.012
attered-0.011
taker-0.010
ometer-0.010
Token Tim
Token ID5045
Feature activation+6.46e-02

Pos logit contributions
der+0.126
 valuable+0.091
 even+0.089
 helicopter+0.088
 airplane+0.086
Neg logit contributions
loe-0.147
een-0.145
Mary-0.133
Laura-0.132
Jim-0.130
Tokenmy
Token ID1820
Feature activation+1.77e-02

Pos logit contributions
een+0.120
loe+0.117
Laura+0.115
Jamie+0.114
Mary+0.114
Neg logit contributions
der-0.107
 even-0.090
 that-0.087
 for-0.086
 valuable-0.083
Token was
Token ID373
Feature activation+4.50e-02

Pos logit contributions
een+0.037
loe+0.037
Laura+0.036
Jamie+0.035
Mary+0.035
Neg logit contributions
der-0.027
 that-0.019
 even-0.019
 valuable-0.019
 for-0.018
Token even
Token ID772
Feature activation-6.25e-02

Pos logit contributions
een+0.100
loe+0.099
Laura+0.097
e+0.096
Jamie+0.095
Neg logit contributions
der-0.062
 valuable-0.038
ly-0.036
 that-0.035
 smoothly-0.035
Token sad
Token ID6507
Feature activation-9.75e-02

Pos logit contributions
der+0.078
 valuable+0.044
 that+0.043
 missile+0.039
 even+0.039
Neg logit contributions
een-0.149
Laura-0.144
loe-0.142
Jamie-0.140
e-0.139
Tokender
Token ID1082
Feature activation-1.59e-01

Pos logit contributions
 valuable+0.115
 helicopter+0.107
 missile+0.107
 arrow+0.105
 automobile+0.105
Neg logit contributions
een-0.165
loe-0.164
iley-0.158
Once-0.154
Jamie-0.152
Token now
Token ID783
Feature activation-4.10e-02

Pos logit contributions
der+0.169
 helicopter+0.124
 valuable+0.120
 missile+0.118
 automobile+0.114
Neg logit contributions
een-0.356
loe-0.349
Jamie-0.349
Laura-0.342
Mary-0.342
Token.
Token ID13
Feature activation-8.09e-02

Pos logit contributions
der+0.049
attered+0.039
ometer+0.035
cream+0.034
arp+0.034
Neg logit contributions
e-0.081
loe-0.079
een-0.076
Once-0.076
Laura-0.072
Token He
Token ID679
Feature activation-2.07e-02

Pos logit contributions
der+0.107
 helicopter+0.078
 airplane+0.073
 even+0.072
ly+0.071
Neg logit contributions
loe-0.173
een-0.169
Laura-0.164
Jamie-0.162
Mary-0.160
Token said
Token ID531
Feature activation+7.40e-02

Pos logit contributions
der+0.039
 helicopter+0.027
 valuable+0.027
 missile+0.027
 smoothly+0.026
Neg logit contributions
een-0.035
e-0.034
Laura-0.034
loe-0.033
Jamie-0.033
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
loe+0.108
e+0.101
een+0.101
Laura+0.099
prise+0.098
Neg logit contributions
der-0.146
attered-0.116
 valuable-0.108
 smoothly-0.107
 automobile-0.104
Token The
Token ID383
Feature activation-3.21e-02

Pos logit contributions
der+0.110
 valuable+0.076
 even+0.074
 that+0.073
 for+0.069
Neg logit contributions
een-0.163
loe-0.160
Laura-0.154
Mary-0.152
Jamie-0.151
Token ocean
Token ID9151
Feature activation+2.96e-02

Pos logit contributions
der+0.051
attered+0.034
 valuable+0.031
 that+0.031
 smoothly+0.031
Neg logit contributions
een-0.065
Laura-0.062
loe-0.061
e-0.061
Jamie-0.060
Token was
Token ID373
Feature activation+4.60e-02

Pos logit contributions
een+0.055
loe+0.053
e+0.052
Laura+0.051
Jamie+0.050
Neg logit contributions
der-0.050
 valuable-0.036
 that-0.034
 even-0.033
attered-0.033
Token a
Token ID257
Feature activation-1.12e-03

Pos logit contributions
een+0.098
loe+0.096
Laura+0.094
Mary+0.092
Jamie+0.092
Neg logit contributions
der-0.068
 valuable-0.044
 that-0.042
 even-0.042
ly-0.041
Token sad
Token ID6507
Feature activation-9.21e-02

Pos logit contributions
der+0.002
attered+0.001
ly+0.001
cream+0.001
bike+0.001
Neg logit contributions
Laura-0.002
een-0.002
prise-0.002
Mary-0.002
loe-0.002
Tokender
Token ID1082
Feature activation-1.56e-01

Pos logit contributions
attered+0.128
cream+0.126
der+0.123
powered+0.112
controlled+0.111
Neg logit contributions
een-0.144
 question-0.140
ona-0.137
iley-0.136
loe-0.135
Token place
Token ID1295
Feature activation-6.33e-03

Pos logit contributions
der+0.182
attered+0.134
 valuable+0.128
 helicopter+0.122
 automobile+0.122
Neg logit contributions
een-0.334
loe-0.324
e-0.320
Jamie-0.319
Mary-0.317
Token without
Token ID1231
Feature activation-1.17e-02

Pos logit contributions
der+0.008
 valuable+0.005
 even+0.005
attered+0.005
 helicopter+0.005
Neg logit contributions
een-0.014
loe-0.014
e-0.013
Laura-0.013
Mary-0.013
Token Fin
Token ID4463
Feature activation+1.60e-02

Pos logit contributions
der+0.016
 that+0.012
 even+0.012
 valuable+0.011
 for+0.011
Neg logit contributions
loe-0.027
een-0.027
Laura-0.026
Jamie-0.025
Mary-0.025
Token's
Token ID338
Feature activation-6.74e-03

Pos logit contributions
een+0.039
loe+0.039
Laura+0.038
Jamie+0.037
Mary+0.037
Neg logit contributions
der-0.019
 that-0.012
 valuable-0.012
 even-0.011
 for-0.011
Token happy
Token ID3772
Feature activation-3.85e-02

Pos logit contributions
der+0.011
 valuable+0.008
 that+0.008
 for+0.008
 even+0.008
Neg logit contributions
loe-0.014
een-0.014
Laura-0.014
Jamie-0.014
Mary-0.014
Token.
Token ID13
Feature activation-7.78e-02

Pos logit contributions
der+0.084
 valuable+0.052
attered+0.048
 smoothly+0.047
 even+0.046
Neg logit contributions
een-0.162
loe-0.158
Mary-0.152
Laura-0.152
Jamie-0.151
Token He
Token ID679
Feature activation-1.76e-02

Pos logit contributions
der+0.110
 that+0.075
 valuable+0.075
 even+0.073
 he+0.069
Neg logit contributions
een-0.173
loe-0.169
Laura-0.165
Jamie-0.163
Mary-0.162
Token felt
Token ID2936
Feature activation-2.27e-02

Pos logit contributions
der+0.027
 even+0.020
 that+0.020
 valuable+0.019
 for+0.018
Neg logit contributions
een-0.037
loe-0.036
Laura-0.035
Jamie-0.034
Mary-0.034
Token even
Token ID772
Feature activation-6.03e-02

Pos logit contributions
der+0.042
ometer+0.030
attered+0.028
 automobile+0.028
bike+0.027
Neg logit contributions
e-0.041
een-0.038
Laura-0.036
Once-0.036
Jamie-0.035
Token sad
Token ID6507
Feature activation-9.47e-02

Pos logit contributions
der+0.079
 valuable+0.043
attered+0.042
ometer+0.042
 missile+0.042
Neg logit contributions
een-0.140
Laura-0.136
Mary-0.132
Jamie-0.131
prise-0.131
Tokender
Token ID1082
Feature activation-1.56e-01

Pos logit contributions
 valuable+0.137
 missile+0.134
 helicopter+0.134
 automobile+0.131
attered+0.130
Neg logit contributions
een-0.132
iley-0.122
loe-0.117
Once-0.116
e-0.114
Token.
Token ID13
Feature activation-7.57e-02

Pos logit contributions
der+0.177
 helicopter+0.139
 missile+0.134
 valuable+0.130
 automobile+0.128
Neg logit contributions
een-0.338
Jamie-0.329
Mary-0.326
e-0.324
loe-0.324
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
der+0.121
 that+0.089
 valuable+0.087
 he+0.085
 even+0.085
Neg logit contributions
een-0.155
loe-0.150
Laura-0.147
Jamie-0.145
Mary-0.144
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
een+0.229
loe+0.219
Laura+0.215
prise+0.210
ael+0.210
Neg logit contributions
der-0.145
 that-0.121
 valuable-0.111
 he-0.110
 even-0.109
TokenMax
Token ID11518
Feature activation+6.32e-02

Pos logit contributions
een+0.165
loe+0.163
Laura+0.159
Jamie+0.154
Mary+0.154
Neg logit contributions
der-0.215
 valuable-0.163
 that-0.162
 even-0.162
 for-0.159
Token sat
Token ID3332
Feature activation+2.85e-02

Pos logit contributions
een+0.130
loe+0.127
Laura+0.125
Jamie+0.122
Mary+0.121
Neg logit contributions
der-0.099
 valuable-0.070
 that-0.070
 even-0.069
 for-0.065
Token sail
Token ID14936
Feature activation-2.56e-02

Pos logit contributions
der+0.029
 valuable+0.020
 even+0.018
 worse+0.018
 important+0.017
Neg logit contributions
een-0.046
loe-0.044
e-0.044
Jamie-0.042
Laura-0.042
Token and
Token ID290
Feature activation-4.54e-02

Pos logit contributions
der+0.038
 valuable+0.025
 even+0.022
 important+0.022
 worse+0.022
Neg logit contributions
e-0.054
een-0.054
loe-0.050
igh-0.048
ighed-0.047
Token a
Token ID257
Feature activation-9.95e-04

Pos logit contributions
der+0.076
 valuable+0.053
ly+0.052
attered+0.050
 for+0.049
Neg logit contributions
een-0.085
Jamie-0.081
Laura-0.080
e-0.080
loe-0.079
Token red
Token ID2266
Feature activation-1.41e-02

Pos logit contributions
der+0.002
ly+0.001
 that+0.001
attered+0.001
 valuable+0.001
Neg logit contributions
een-0.002
Laura-0.002
Jamie-0.002
loe-0.002
Mary-0.002
Token rud
Token ID28906
Feature activation-5.42e-02

Pos logit contributions
der+0.019
 valuable+0.015
 important+0.014
 worse+0.013
attered+0.013
Neg logit contributions
een-0.029
e-0.028
ighed-0.027
loe-0.026
igh-0.026
Tokender
Token ID1082
Feature activation-1.56e-01

Pos logit contributions
der+0.075
 valuable+0.071
 important+0.065
 he+0.063
 even+0.063
Neg logit contributions
een-0.103
loe-0.101
e-0.097
igh-0.094
Jamie-0.090
Token.
Token ID13
Feature activation-7.59e-02

Pos logit contributions
der+0.189
 valuable+0.129
 helicopter+0.126
attered+0.123
 missile+0.119
Neg logit contributions
een-0.319
e-0.316
Jamie-0.310
loe-0.297
Mary-0.293
Token One
Token ID1881
Feature activation+2.31e-02

Pos logit contributions
der+0.068
 that+0.039
 he+0.036
 even+0.033
 valuable+0.030
Neg logit contributions
een-0.209
loe-0.200
Laura-0.199
prise-0.196
Jamie-0.195
Token day
Token ID1110
Feature activation+3.63e-02

Pos logit contributions
een+0.047
loe+0.045
e+0.045
Jamie+0.044
Mary+0.042
Neg logit contributions
der-0.039
 valuable-0.024
 even-0.023
 smoothly-0.023
attered-0.023
Token the
Token ID262
Feature activation-3.67e-02

Pos logit contributions
een+0.077
loe+0.075
Laura+0.074
Jamie+0.072
Mary+0.071
Neg logit contributions
der-0.055
 that-0.041
 even-0.039
 he-0.039
 valuable-0.038
Token boat
Token ID8848
Feature activation-5.98e-03

Pos logit contributions
der+0.047
 valuable+0.027
 that+0.025
 even+0.024
 for+0.024
Neg logit contributions
een-0.087
loe-0.084
Laura-0.084
Jamie-0.083
Mary-0.081
Token.
Token ID13
Feature activation-7.57e-02

Pos logit contributions
der+0.026
 valuable+0.017
 helicopter+0.016
 missile+0.015
cream+0.015
Neg logit contributions
een-0.039
loe-0.037
Laura-0.037
Jamie-0.036
e-0.036
Token He
Token ID679
Feature activation-1.82e-02

Pos logit contributions
der+0.087
 valuable+0.053
 even+0.052
 that+0.052
 for+0.047
Neg logit contributions
een-0.187
loe-0.185
Laura-0.181
Jamie-0.177
Mary-0.176
Token felt
Token ID2936
Feature activation-2.31e-02

Pos logit contributions
der+0.028
 even+0.021
 that+0.021
 valuable+0.020
 he+0.019
Neg logit contributions
een-0.038
loe-0.038
Laura-0.036
Jamie-0.035
Mary-0.035
Token even
Token ID772
Feature activation-5.98e-02

Pos logit contributions
der+0.040
ometer+0.030
bike+0.028
 automobile+0.027
attered+0.027
Neg logit contributions
e-0.041
een-0.038
Once-0.038
loe-0.037
Laura-0.037
Token sad
Token ID6507
Feature activation-9.53e-02

Pos logit contributions
der+0.080
ometer+0.046
 missile+0.044
attered+0.044
 valuable+0.044
Neg logit contributions
een-0.134
Laura-0.131
prise-0.128
Mary-0.127
e-0.127
Tokender
Token ID1082
Feature activation-1.55e-01

Pos logit contributions
 valuable+0.110
 missile+0.108
 helicopter+0.107
 automobile+0.106
attered+0.104
Neg logit contributions
een-0.162
loe-0.153
iley-0.148
Jamie-0.147
Laura-0.143
Token now
Token ID783
Feature activation-3.63e-02

Pos logit contributions
der+0.172
 helicopter+0.122
 missile+0.120
 valuable+0.119
 automobile+0.114
Neg logit contributions
een-0.347
Jamie-0.339
loe-0.338
Mary-0.335
Laura-0.334
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
der+0.046
attered+0.039
ometer+0.037
arp+0.035
eddy+0.034
Neg logit contributions
e-0.067
Once-0.065
 hesitation-0.062
een-0.062
loe-0.061
Token But
Token ID887
Feature activation+3.60e-04

Pos logit contributions
der+0.099
 that+0.064
 valuable+0.064
 even+0.063
 for+0.059
Neg logit contributions
een-0.178
loe-0.174
Laura-0.171
Jamie-0.168
Mary-0.167
Token then
Token ID788
Feature activation-1.67e-02

Pos logit contributions
een+0.001
loe+0.001
e+0.001
Laura+0.001
Jamie+0.001
Neg logit contributions
der-0.000
 valuable-0.000
attered-0.000
ly-0.000
 smoothly-0.000
Token,
Token ID11
Feature activation+2.12e-02

Pos logit contributions
der+0.026
 valuable+0.018
attered+0.018
 smoothly+0.016
 important+0.016
Neg logit contributions
een-0.034
loe-0.033
Laura-0.032
Jamie-0.032
e-0.032
Token.
Token ID13
Feature activation-7.65e-02

Pos logit contributions
der+0.021
 that+0.015
 even+0.015
 for+0.014
 valuable+0.014
Neg logit contributions
een-0.037
loe-0.036
Laura-0.035
Jamie-0.035
Mary-0.035
Token Lily
Token ID20037
Feature activation+2.63e-02

Pos logit contributions
der+0.133
 valuable+0.098
 even+0.097
 that+0.097
 for+0.092
Neg logit contributions
een-0.144
loe-0.141
Laura-0.138
Jamie-0.134
Mary-0.134
Token felt
Token ID2936
Feature activation-2.31e-02

Pos logit contributions
een+0.054
loe+0.053
Laura+0.052
Jamie+0.051
Mary+0.050
Neg logit contributions
der-0.041
 valuable-0.029
 that-0.028
 even-0.027
 for-0.027
Token even
Token ID772
Feature activation-6.14e-02

Pos logit contributions
der+0.039
ometer+0.033
bike+0.031
 automobile+0.029
attered+0.029
Neg logit contributions
e-0.040
Once-0.038
loe-0.033
Laura-0.033
prise-0.033
Token sad
Token ID6507
Feature activation-9.09e-02

Pos logit contributions
der+0.082
attered+0.046
ometer+0.045
 valuable+0.043
 missile+0.042
Neg logit contributions
een-0.136
Laura-0.134
prise-0.133
e-0.132
Jamie-0.128
Tokender
Token ID1082
Feature activation-1.54e-01

Pos logit contributions
 missile+0.121
attered+0.118
cream+0.117
bike+0.116
 automobile+0.115
Neg logit contributions
loe-0.131
een-0.131
Once-0.130
iley-0.125
Laura-0.118
Token than
Token ID621
Feature activation-3.19e-02

Pos logit contributions
der+0.174
 missile+0.128
 helicopter+0.127
 valuable+0.122
 automobile+0.120
Neg logit contributions
een-0.337
loe-0.334
Laura-0.328
Jamie-0.328
Mary-0.327
Token before
Token ID878
Feature activation-4.46e-02

Pos logit contributions
der+0.055
 valuable+0.040
 that+0.038
 even+0.037
attered+0.037
Neg logit contributions
een-0.060
loe-0.059
Laura-0.058
Jamie-0.057
Mary-0.056
Token.
Token ID13
Feature activation-7.38e-02

Pos logit contributions
der+0.038
 that+0.020
 even+0.018
 valuable+0.017
 he+0.016
Neg logit contributions
een-0.127
loe-0.124
Laura-0.122
Jamie-0.121
Mary-0.119
Token From
Token ID3574
Feature activation+4.62e-02

Pos logit contributions
der+0.126
 valuable+0.093
 that+0.092
 even+0.091
 for+0.087
Neg logit contributions
een-0.140
loe-0.138
Laura-0.135
Jamie-0.131
Mary-0.130
Token that
Token ID326
Feature activation-6.53e-02

Pos logit contributions
een+0.100
iley+0.098
Mary+0.098
Elizabeth+0.097
ma+0.097
Neg logit contributions
der-0.059
 valuable-0.038
attered-0.038
ometer-0.036
 smoothly-0.033
Token what
Token ID644
Feature activation-9.51e-02

Pos logit contributions
een+0.018
Jamie+0.017
Laura+0.017
loe+0.017
Mary+0.017
Neg logit contributions
der-0.021
attered-0.017
 valuable-0.017
 smoothly-0.016
 helicopter-0.015
Token it
Token ID340
Feature activation-1.07e-01

Pos logit contributions
der+0.135
 valuable+0.079
attered+0.078
 even+0.076
 smoothly+0.075
Neg logit contributions
een-0.214
loe-0.207
Laura-0.206
Jamie-0.206
Mary-0.204
Token could
Token ID714
Feature activation+4.19e-02

Pos logit contributions
der+0.145
 that+0.103
 even+0.102
 valuable+0.099
 he+0.097
Neg logit contributions
loe-0.243
een-0.242
Laura-0.237
Jamie-0.230
Mary-0.230
Token be
Token ID307
Feature activation-1.12e-02

Pos logit contributions
een+0.076
loe+0.073
e+0.070
Laura+0.070
Jamie+0.070
Neg logit contributions
der-0.076
 valuable-0.056
 that-0.053
 even-0.052
 helicopter-0.051
Token.
Token ID13
Feature activation-7.51e-02

Pos logit contributions
der+0.013
 valuable+0.007
 helicopter+0.006
 that+0.006
attered+0.006
Neg logit contributions
een-0.028
e-0.027
loe-0.026
Laura-0.026
Jamie-0.026
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
der+0.108
 that+0.074
 valuable+0.074
 even+0.072
 he+0.069
Neg logit contributions
een-0.166
loe-0.162
Laura-0.158
Jamie-0.155
Mary-0.154
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
een+0.253
loe+0.241
prise+0.237
ael+0.235
ited+0.233
Neg logit contributions
der-0.110
 that-0.098
 he-0.096
 even-0.088
 valuable-0.087
Token"
Token ID1
Feature activation-1.62e-02

Pos logit contributions
een+0.227
loe+0.223
Laura+0.209
prise+0.204
e+0.202
Neg logit contributions
der-0.150
 valuable-0.112
 even-0.111
 that-0.110
 he-0.107
TokenWhat
Token ID2061
Feature activation-5.25e-02

Pos logit contributions
der+0.024
 even+0.020
 he+0.018
 valuable+0.017
 that+0.017
Neg logit contributions
loe-0.034
een-0.032
elve-0.030
igh-0.030
prise-0.030
Token's
Token ID338
Feature activation-9.68e-03

Pos logit contributions
der+0.072
 that+0.053
 even+0.052
 valuable+0.052
 he+0.048
Neg logit contributions
een-0.117
loe-0.116
Laura-0.112
Mary-0.109
Jamie-0.109
Token that
Token ID326
Feature activation-9.47e-02

Pos logit contributions
der+0.013
attered+0.007
 valuable+0.007
ly+0.007
 smoothly+0.007
Neg logit contributions
een-0.022
loe-0.021
e-0.021
prise-0.021
Jamie-0.021
Token at
Token ID379
Feature activation+6.14e-02

Pos logit contributions
der+0.021
bike+0.015
ometer+0.013
 valuable+0.013
 missile+0.013
Neg logit contributions
een-0.023
Laura-0.023
ighed-0.022
prise-0.022
e-0.022
Token the
Token ID262
Feature activation-4.80e-02

Pos logit contributions
een+0.129
loe+0.125
Laura+0.123
Jamie+0.122
Mary+0.120
Neg logit contributions
der-0.092
 valuable-0.066
ly-0.058
 important-0.057
 smoothly-0.056
Token fruit
Token ID8234
Feature activation-3.71e-03

Pos logit contributions
der+0.086
 valuable+0.064
 that+0.062
 even+0.062
 smoothly+0.060
Neg logit contributions
een-0.084
Laura-0.082
loe-0.079
Jamie-0.078
Mary-0.078
Token.
Token ID13
Feature activation-7.07e-02

Pos logit contributions
der+0.005
 valuable+0.004
 important+0.003
 even+0.003
 smoothly+0.003
Neg logit contributions
een-0.008
loe-0.008
Laura-0.008
Jamie-0.008
e-0.008
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
der+0.075
 that+0.049
 valuable+0.044
 he+0.043
 even+0.042
Neg logit contributions
een-0.181
loe-0.176
Laura-0.170
e-0.170
prise-0.169
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
een+0.264
ited+0.257
loe+0.252
igh+0.250
ael+0.248
Neg logit contributions
der-0.103
 that-0.096
 he-0.085
 even-0.082
 they-0.079
Token"
Token ID1
Feature activation-1.77e-02

Pos logit contributions
een+0.187
loe+0.185
igh+0.170
Laura+0.169
ael+0.163
Neg logit contributions
der-0.181
 that-0.158
 even-0.150
 he-0.147
 valuable-0.146
TokenI
Token ID40
Feature activation-1.49e-02

Pos logit contributions
der+0.029
 even+0.022
 valuable+0.021
 that+0.021
 he+0.020
Neg logit contributions
loe-0.035
een-0.034
Laura-0.031
igh-0.031
e-0.031
Token want
Token ID765
Feature activation-9.06e-03

Pos logit contributions
der+0.018
 even+0.017
 that+0.014
 for+0.014
 valuable+0.014
Neg logit contributions
loe-0.035
een-0.034
Laura-0.033
Mary-0.032
Jamie-0.032
Token an
Token ID281
Feature activation-1.52e-02

Pos logit contributions
der+0.011
 that+0.007
 valuable+0.007
 even+0.007
 for+0.006
Neg logit contributions
een-0.022
loe-0.022
Laura-0.021
Jamie-0.021
Mary-0.021
Token apple
Token ID17180
Feature activation-1.20e-02

Pos logit contributions
der+0.021
 valuable+0.014
 that+0.014
 smoothly+0.014
 he+0.013
Neg logit contributions
een-0.032
loe-0.031
Laura-0.031
e-0.030
Mary-0.030
Token whole
Token ID2187
Feature activation-1.56e-02

Pos logit contributions
der+0.075
 valuable+0.054
 even+0.053
 that+0.053
 for+0.050
Neg logit contributions
een-0.126
loe-0.124
Laura-0.121
Jamie-0.119
Mary-0.117
Token day
Token ID1110
Feature activation+1.96e-02

Pos logit contributions
der+0.019
 valuable+0.012
 that+0.012
 even+0.012
 for+0.011
Neg logit contributions
een-0.038
loe-0.037
Laura-0.037
Jamie-0.036
Mary-0.036
Token playing
Token ID2712
Feature activation-2.20e-02

Pos logit contributions
een+0.034
loe+0.033
Laura+0.032
Jamie+0.031
Mary+0.031
Neg logit contributions
der-0.037
 valuable-0.028
 that-0.028
 even-0.027
 for-0.026
Token together
Token ID1978
Feature activation+1.80e-03

Pos logit contributions
der+0.034
 valuable+0.025
attered+0.022
 that+0.022
 even+0.022
Neg logit contributions
een-0.046
e-0.044
loe-0.043
Laura-0.041
Mary-0.041
Token.
Token ID13
Feature activation-7.13e-02

Pos logit contributions
een+0.005
loe+0.005
Laura+0.004
Jamie+0.004
Mary+0.004
Neg logit contributions
der-0.002
 valuable-0.001
 that-0.001
 even-0.001
 for-0.001
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
der+0.120
 that+0.088
 valuable+0.087
 even+0.086
 for+0.082
Neg logit contributions
een-0.139
loe-0.136
Laura-0.133
Jamie-0.130
Mary-0.129
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
een+0.239
loe+0.231
Laura+0.224
prise+0.218
Jamie+0.217
Neg logit contributions
der-0.146
 that-0.118
 even-0.112
 valuable-0.111
 he-0.104
TokenThe
Token ID464
Feature activation-1.28e-02

Pos logit contributions
een+0.218
loe+0.211
Laura+0.199
prise+0.196
ighed+0.193
Neg logit contributions
der-0.162
 even-0.134
 that-0.129
 valuable-0.128
 he-0.117
Token rabbit
Token ID22746
Feature activation+2.97e-02

Pos logit contributions
der+0.019
 that+0.012
 valuable+0.012
 even+0.012
 for+0.012
Neg logit contributions
een-0.028
loe-0.027
Laura-0.027
Jamie-0.026
Mary-0.026
Token visited
Token ID8672
Feature activation+6.29e-02

Pos logit contributions
een+0.066
loe+0.065
Laura+0.064
Jamie+0.063
Mary+0.062
Neg logit contributions
der-0.041
 even-0.029
 that-0.029
 valuable-0.028
 for-0.027
Token the
Token ID262
Feature activation-3.94e-02

Pos logit contributions
een+0.140
loe+0.138
Laura+0.135
Jamie+0.133
Mary+0.131
Neg logit contributions
der-0.088
 that-0.061
 valuable-0.060
 even-0.060
 for-0.057
Token at
Token ID379
Feature activation+5.48e-02

Pos logit contributions
der+0.032
bike+0.024
ometer+0.023
ger+0.023
bye+0.022
Neg logit contributions
ighed-0.024
prise-0.023
e-0.023
eks-0.023
een-0.022
Token the
Token ID262
Feature activation-4.92e-02

Pos logit contributions
een+0.108
loe+0.103
Jamie+0.101
Laura+0.101
e+0.099
Neg logit contributions
der-0.090
 valuable-0.074
 important-0.065
attered-0.063
 even-0.061
Token mess
Token ID2085
Feature activation-2.60e-02

Pos logit contributions
der+0.088
 valuable+0.068
 smoothly+0.064
 even+0.064
bike+0.062
Neg logit contributions
Laura-0.082
een-0.080
Jamie-0.078
loe-0.077
Eva-0.077
Token.
Token ID13
Feature activation-7.64e-02

Pos logit contributions
der+0.030
 valuable+0.023
 helicopter+0.020
 missile+0.019
 even+0.019
Neg logit contributions
een-0.058
e-0.058
loe-0.058
Jamie-0.056
Laura-0.055
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
der+0.087
 valuable+0.052
 even+0.051
 that+0.051
 for+0.046
Neg logit contributions
een-0.190
loe-0.187
Laura-0.185
Jamie-0.182
Mary-0.180
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
een+0.219
loe+0.211
Laura+0.205
Jamie+0.198
ael+0.198
Neg logit contributions
der-0.154
 that-0.125
 valuable-0.115
 even-0.114
 he-0.113
Token"
Token ID1
Feature activation-1.72e-02

Pos logit contributions
een+0.173
loe+0.167
Laura+0.157
Jamie+0.152
Mary+0.151
Neg logit contributions
der-0.195
 that-0.161
 valuable-0.155
 even-0.154
 he-0.153
TokenOh
Token ID5812
Feature activation+4.19e-02

Pos logit contributions
der+0.032
 even+0.024
 that+0.024
 valuable+0.024
 he+0.023
Neg logit contributions
een-0.030
loe-0.030
Laura-0.028
Jamie-0.027
Mary-0.027
Token,
Token ID11
Feature activation+9.68e-03

Pos logit contributions
een+0.095
loe+0.092
Laura+0.092
Jamie+0.089
Mary+0.088
Neg logit contributions
der-0.057
 that-0.047
 even-0.041
 for-0.040
 he-0.040
Token Ben
Token ID3932
Feature activation+4.08e-02

Pos logit contributions
een+0.020
loe+0.019
Laura+0.019
Jamie+0.018
Mary+0.018
Neg logit contributions
der-0.015
 valuable-0.011
 even-0.011
 smoothly-0.010
 that-0.010
Token,
Token ID11
Feature activation+5.00e-03

Pos logit contributions
een+0.084
loe+0.083
Laura+0.081
Jamie+0.079
Mary+0.078
Neg logit contributions
der-0.064
 that-0.046
 valuable-0.045
 even-0.045
 for-0.043
Token to
Token ID284
Feature activation+4.91e-02

Pos logit contributions
der+0.121
 valuable+0.069
attered+0.062
bike+0.058
 helicopter+0.057
Neg logit contributions
een-0.218
loe-0.210
prise-0.207
e-0.206
Laura-0.203
Token me
Token ID502
Feature activation+4.13e-02

Pos logit contributions
loe+0.085
een+0.085
Jamie+0.083
Laura+0.083
Mary+0.081
Neg logit contributions
der-0.088
 that-0.081
 even-0.075
 for-0.075
 he-0.069
Token?"
Token ID1701
Feature activation+1.37e-03

Pos logit contributions
een+0.080
loe+0.079
Laura+0.078
Jamie+0.077
Mary+0.076
Neg logit contributions
der-0.069
 that-0.056
 for-0.054
 even-0.054
 valuable-0.050
Token
Token ID220
Feature activation+8.91e-02

Pos logit contributions
een+0.002
loe+0.002
Laura+0.002
Jamie+0.002
Mary+0.002
Neg logit contributions
der-0.002
 that-0.002
 even-0.002
 he-0.002
 valuable-0.002
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
een+0.168
loe+0.163
Laura+0.159
Jamie+0.156
Mary+0.154
Neg logit contributions
der-0.155
 that-0.116
 valuable-0.115
 even-0.114
 for-0.110
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
een+0.253
loe+0.246
ael+0.239
igh+0.238
Laura+0.235
Neg logit contributions
der-0.135
 that-0.117
 even-0.106
 he-0.104
 for-0.102
TokenL
Token ID43
Feature activation+9.05e-02

Pos logit contributions
een+0.198
loe+0.193
Laura+0.187
Jamie+0.184
Mary+0.182
Neg logit contributions
der-0.196
 valuable-0.148
 that-0.148
 even-0.146
 for-0.139
Tokenily
Token ID813
Feature activation+2.17e-02

Pos logit contributions
een+0.191
loe+0.188
Laura+0.185
Jamie+0.182
Mary+0.181
Neg logit contributions
der-0.140
 that-0.096
 even-0.093
 valuable-0.093
 for-0.091
Token finally
Token ID3443
Feature activation+4.05e-02

Pos logit contributions
loe+0.049
prise+0.047
een+0.047
Jamie+0.046
Laura+0.046
Neg logit contributions
der-0.032
 even-0.025
 that-0.024
 he-0.021
ly-0.021
Token gave
Token ID2921
Feature activation+4.84e-02

Pos logit contributions
loe+0.074
een+0.074
Laura+0.070
Jamie+0.069
Mary+0.068
Neg logit contributions
der-0.072
 that-0.057
 even-0.056
 valuable-0.055
 he-0.053
Token her
Token ID607
Feature activation+1.58e-02

Pos logit contributions
een+0.094
loe+0.091
Laura+0.089
Jamie+0.087
Mary+0.087
Neg logit contributions
der-0.084
 valuable-0.061
 even-0.057
 that-0.057
 he-0.054
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
een+0.212
loe+0.195
Laura+0.192
Jamie+0.184
Mary+0.183
Neg logit contributions
der-0.378
 that-0.333
 for-0.325
 he-0.322
 even-0.318
Token a
Token ID257
Feature activation+5.68e-02

Pos logit contributions
een+0.234
loe+0.229
Laura+0.225
Jamie+0.221
Mary+0.219
Neg logit contributions
der-0.152
 valuable-0.103
 that-0.103
 even-0.101
 for-0.095
Token time
Token ID640
Feature activation+8.47e-02

Pos logit contributions
een+0.099
loe+0.098
icians+0.095
igh+0.093
used+0.092
Neg logit contributions
der-0.092
 even-0.087
 important-0.084
 that-0.084
 for-0.083
Token,
Token ID11
Feature activation+4.99e-02

Pos logit contributions
Laura+0.242
een+0.232
icians+0.231
Jamie+0.230
Josh+0.226
Neg logit contributions
 that-0.060
 he-0.053
 for-0.053
der-0.051
 they-0.046
Token in
Token ID287
Feature activation+8.45e-02

Pos logit contributions
een+0.095
loe+0.092
Laura+0.091
Jamie+0.088
Mary+0.088
Neg logit contributions
der-0.084
 that-0.068
 even-0.066
 for-0.065
 valuable-0.064
Token a
Token ID257
Feature activation+4.29e-02

Pos logit contributions
een+0.222
loe+0.221
Jamie+0.214
prise+0.214
Laura+0.212
Neg logit contributions
der-0.067
 that-0.064
 even-0.051
 for-0.048
 he-0.046
Token big
Token ID1263
Feature activation+2.44e-02

Pos logit contributions
een+0.098
loe+0.097
Laura+0.091
Jamie+0.090
Mary+0.090
Neg logit contributions
der-0.052
 even-0.041
 that-0.041
 valuable-0.041
 important-0.038
Token sky
Token ID6766
Feature activation+3.40e-02

Pos logit contributions
een+0.050
loe+0.049
Laura+0.048
Jamie+0.047
Mary+0.047
Neg logit contributions
der-0.036
 that-0.029
 even-0.028
 valuable-0.027
 for-0.027
Token,
Token ID11
Feature activation+3.34e-02

Pos logit contributions
loe+0.076
een+0.076
Laura+0.076
Jamie+0.073
Mary+0.072
Neg logit contributions
der-0.045
 that-0.037
 he-0.032
 for-0.032
 even-0.032
Token there
Token ID612
Feature activation+2.86e-02

Pos logit contributions
een+0.038
loe+0.037
Jamie+0.036
Laura+0.035
e+0.035
Neg logit contributions
der-0.086
 valuable-0.066
ly-0.063
attered-0.062
 smoothly-0.062
Token was
Token ID373
Feature activation+6.51e-02

Pos logit contributions
Laura+0.068
een+0.068
loe+0.068
prise+0.065
Mary+0.065
Neg logit contributions
der-0.033
 that-0.025
 even-0.024
 he-0.023
 for-0.020
Token and
Token ID290
Feature activation-4.85e-02

Pos logit contributions
der+0.130
 helicopter+0.085
 missile+0.083
 valuable+0.083
attered+0.075
Neg logit contributions
een-0.219
loe-0.213
Laura-0.203
e-0.200
prise-0.198
Token that
Token ID326
Feature activation-8.30e-02

Pos logit contributions
der+0.070
attered+0.046
 valuable+0.044
ly+0.044
ger+0.043
Neg logit contributions
een-0.096
loe-0.094
prise-0.094
Laura-0.094
e-0.091
Token even
Token ID772
Feature activation-6.67e-02

Pos logit contributions
der+0.119
ly+0.077
attered+0.073
 valuable+0.073
 smoothly+0.071
Neg logit contributions
een-0.176
loe-0.172
Laura-0.170
Jamie-0.167
e-0.167
Token little
Token ID1310
Feature activation-3.44e-04

Pos logit contributions
der+0.100
attered+0.068
 valuable+0.066
 helicopter+0.065
 missile+0.064
Neg logit contributions
een-0.132
Laura-0.126
loe-0.125
e-0.122
Jamie-0.121
Token kids
Token ID3988
Feature activation+3.63e-02

Pos logit contributions
der+0.001
 valuable+0.000
 that+0.000
 smoothly+0.000
 even+0.000
Neg logit contributions
een-0.001
loe-0.001
Laura-0.001
Jamie-0.001
Mary-0.001
Token can
Token ID460
Feature activation+1.93e-02

Pos logit contributions
een+0.074
loe+0.073
e+0.071
Laura+0.070
Jamie+0.069
Neg logit contributions
der-0.057
 valuable-0.038
 even-0.036
 that-0.035
 helicopter-0.035
Token make
Token ID787
Feature activation-3.41e-02

Pos logit contributions
een+0.030
loe+0.030
e+0.028
Jamie+0.027
Laura+0.027
Neg logit contributions
der-0.039
 valuable-0.030
attered-0.029
ly-0.029
 smoothly-0.027
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
der+0.053
 valuable+0.036
 that+0.035
 even+0.034
 he+0.034
Neg logit contributions
een-0.071
loe-0.069
Laura-0.068
Jamie-0.067
Mary-0.066
Token big
Token ID1263
Feature activation-1.30e-02

Pos logit contributions
der+0.021
attered+0.015
ly+0.015
bike+0.014
cream+0.014
Neg logit contributions
prise-0.018
Laura-0.018
Jamie-0.016
een-0.016
Mary-0.016
Token difference
Token ID3580
Feature activation-4.17e-02

Pos logit contributions
der+0.024
 valuable+0.017
attered+0.017
cream+0.017
 smoothly+0.016
Neg logit contributions
een-0.021
prise-0.021
Laura-0.021
loe-0.020
e-0.020
Token.
Token ID13
Feature activation-7.26e-02

Pos logit contributions
der+0.044
 valuable+0.026
 even+0.023
 that+0.022
 he+0.020
Neg logit contributions
een-0.107
loe-0.105
Laura-0.103
Jamie-0.101
e-0.101
Token their
Token ID511
Feature activation-3.68e-03

Pos logit contributions
een+0.085
e+0.081
loe+0.080
Laura+0.079
ffee+0.076
Neg logit contributions
der-0.100
attered-0.079
 valuable-0.079
ometer-0.074
ly-0.071
Token new
Token ID649
Feature activation-4.46e-02

Pos logit contributions
der+0.006
 valuable+0.004
 that+0.004
 even+0.004
 he+0.004
Neg logit contributions
een-0.007
loe-0.007
Laura-0.007
Jamie-0.007
Mary-0.007
Token friend
Token ID1545
Feature activation+7.86e-03

Pos logit contributions
der+0.080
 valuable+0.060
 that+0.059
 even+0.059
 for+0.056
Neg logit contributions
een-0.083
loe-0.081
Laura-0.079
Jamie-0.077
Mary-0.076
Token and
Token ID290
Feature activation-4.20e-02

Pos logit contributions
een+0.020
loe+0.019
Laura+0.019
Jamie+0.019
Mary+0.018
Neg logit contributions
der-0.009
 that-0.005
 valuable-0.005
 even-0.005
 for-0.005
Token promised
Token ID8072
Feature activation+6.30e-03

Pos logit contributions
der+0.073
 valuable+0.052
attered+0.049
ly+0.046
 important+0.046
Neg logit contributions
een-0.078
Laura-0.076
loe-0.076
e-0.076
Mary-0.075
Token to
Token ID284
Feature activation+4.92e-02

Pos logit contributions
een+0.013
Mary+0.013
loe+0.013
Laura+0.012
Jamie+0.012
Neg logit contributions
der-0.009
attered-0.006
 missile-0.006
 valuable-0.006
 helicopter-0.006
Token always
Token ID1464
Feature activation+4.57e-03

Pos logit contributions
een+0.097
loe+0.095
Laura+0.094
Jamie+0.091
Mary+0.088
Neg logit contributions
der-0.079
 that-0.068
 even-0.068
 for-0.062
 he-0.060
Token remember
Token ID3505
Feature activation-2.75e-02

Pos logit contributions
loe+0.010
Laura+0.010
een+0.010
Jamie+0.010
Mary+0.009
Neg logit contributions
der-0.006
 even-0.005
 that-0.005
 for-0.005
 he-0.005
Token the
Token ID262
Feature activation-6.26e-02

Pos logit contributions
der+0.039
 valuable+0.026
 that+0.026
 even+0.025
 for+0.024
Neg logit contributions
een-0.061
loe-0.060
Laura-0.059
Jamie-0.058
Mary-0.057
Token happy
Token ID3772
Feature activation-4.11e-02

Pos logit contributions
der+0.089
 that+0.058
 valuable+0.058
 even+0.057
 for+0.054
Neg logit contributions
een-0.139
loe-0.136
Laura-0.134
Jamie-0.131
Mary-0.131
Token times
Token ID1661
Feature activation-4.81e-02

Pos logit contributions
der+0.056
 valuable+0.037
 that+0.036
 even+0.036
 for+0.034
Neg logit contributions
een-0.092
loe-0.090
Laura-0.089
Mary-0.087
Jamie-0.087
Token
Token ID220
Feature activation+7.60e-02

Pos logit contributions
der+0.124
 that+0.092
 he+0.092
 even+0.089
 valuable+0.089
Neg logit contributions
een-0.146
loe-0.142
Laura-0.140
prise-0.138
Jamie-0.136
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
een+0.161
Laura+0.154
loe+0.154
prise+0.151
Mary+0.149
Neg logit contributions
der-0.114
 that-0.084
 he-0.082
 even-0.079
 valuable-0.079
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
prise+0.276
een+0.274
ited+0.269
loe+0.266
igh+0.263
Neg logit contributions
 he-0.078
der-0.077
 that-0.068
 He-0.068
 even-0.064
TokenHe
Token ID1544
Feature activation-4.72e-03

Pos logit contributions
een+0.220
loe+0.216
prise+0.201
Laura+0.199
ighed+0.198
Neg logit contributions
der-0.152
 even-0.123
 valuable-0.121
 he-0.118
 that-0.116
Token continued
Token ID3767
Feature activation+9.19e-03

Pos logit contributions
der+0.008
 even+0.006
 that+0.006
 valuable+0.006
 he+0.006
Neg logit contributions
loe-0.009
een-0.009
Laura-0.009
Jamie-0.009
Mary-0.009
Token to
Token ID284
Feature activation+5.64e-02

Pos logit contributions
een+0.020
loe+0.020
Laura+0.019
Jamie+0.019
Mary+0.019
Neg logit contributions
der-0.013
 that-0.010
 for-0.010
 even-0.010
 he-0.010
Token hop
Token ID1725
Feature activation+5.33e-04

Pos logit contributions
loe+0.107
een+0.106
Laura+0.104
Jamie+0.101
Mary+0.101
Neg logit contributions
der-0.091
 even-0.074
 that-0.074
 he-0.070
 for-0.069
Token back
Token ID736
Feature activation-2.53e-02

Pos logit contributions
een+0.001
loe+0.001
e+0.001
Mary+0.001
Jamie+0.001
Neg logit contributions
der-0.001
attered-0.001
 valuable-0.000
 important-0.000
 missile-0.000
Token home
Token ID1363
Feature activation+8.45e-03

Pos logit contributions
der+0.030
 valuable+0.018
 that+0.018
 even+0.018
 for+0.016
Neg logit contributions
een-0.062
loe-0.061
Laura-0.059
Jamie-0.059
Mary-0.058
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
een+0.019
loe+0.018
Laura+0.018
Jamie+0.018
Mary+0.017
Neg logit contributions
der-0.012
 that-0.008
 even-0.008
 valuable-0.008
 for-0.008
Token and
Token ID290
Feature activation-3.48e-02

Pos logit contributions
loe+0.030
een+0.030
Laura+0.029
Jamie+0.028
Mary+0.028
Neg logit contributions
der-0.023
 even-0.018
 that-0.018
 for-0.017
 valuable-0.017
Token in
Token ID287
Feature activation+5.55e-02

Pos logit contributions
een+0.011
loe+0.011
Laura+0.010
Jamie+0.010
Mary+0.010
Neg logit contributions
der-0.008
 that-0.006
 even-0.006
 for-0.006
 valuable-0.006
Token the
Token ID262
Feature activation-3.41e-02

Pos logit contributions
lor+0.159
prise+0.159
een+0.156
ona+0.155
reetings+0.153
Neg logit contributions
 that-0.034
 old-0.025
der-0.023
 each-0.019
 even-0.019
Token chairs
Token ID18791
Feature activation+2.67e-02

Pos logit contributions
der+0.059
 that+0.046
 valuable+0.045
 even+0.045
 for+0.044
Neg logit contributions
een-0.063
loe-0.062
Laura-0.059
Mary-0.059
Jamie-0.058
Token did
Token ID750
Feature activation+4.64e-02

Pos logit contributions
loe+0.065
Laura+0.064
een+0.064
Jamie+0.062
Mary+0.062
Neg logit contributions
der-0.034
 that-0.025
 even-0.023
 for-0.022
 he-0.021
Token see
Token ID766
Feature activation+1.85e-02

Pos logit contributions
loe+0.096
een+0.096
Laura+0.092
Jamie+0.089
Mary+0.088
Neg logit contributions
der-0.073
 that-0.057
 even-0.055
 valuable-0.053
 for-0.052
Token Lily
Token ID20037
Feature activation+1.37e-02

Pos logit contributions
een+0.038
loe+0.037
Laura+0.036
Jamie+0.036
Mary+0.035
Neg logit contributions
der-0.029
 that-0.021
 valuable-0.021
 even-0.020
 for-0.019
Token.
Token ID13
Feature activation-7.09e-02

Pos logit contributions
loe+0.034
een+0.034
Laura+0.033
Jamie+0.033
Mary+0.033
Neg logit contributions
der-0.015
 that-0.011
 even-0.010
 for-0.010
 valuable-0.009
Token They
Token ID1119
Feature activation-3.22e-02

Pos logit contributions
der+0.080
 valuable+0.048
 even+0.046
 that+0.045
ly+0.041
Neg logit contributions
een-0.174
loe-0.173
Laura-0.169
Mary-0.165
Jamie-0.165
Token started
Token ID2067
Feature activation+2.57e-03

Pos logit contributions
der+0.054
 even+0.042
 that+0.041
 he+0.039
 for+0.038
Neg logit contributions
een-0.064
loe-0.063
Laura-0.061
Mary-0.059
Jamie-0.059
Token to
Token ID284
Feature activation+5.38e-02

Pos logit contributions
een+0.007
loe+0.007
Laura+0.007
Jamie+0.007
Mary+0.007
Neg logit contributions
der-0.002
 that-0.001
 even-0.001
 for-0.001
 valuable-0.001
Token whisper
Token ID31992
Feature activation-2.82e-02

Pos logit contributions
loe+0.106
een+0.101
Laura+0.097
Mary+0.095
Jamie+0.094
Neg logit contributions
der-0.084
 even-0.074
 that-0.068
 for-0.064
 better-0.064
Token.
Token ID13
Feature activation-7.71e-02

Pos logit contributions
der+0.073
 valuable+0.044
 that+0.044
 even+0.043
 for+0.039
Neg logit contributions
een-0.154
loe-0.152
Laura-0.149
Jamie-0.146
Mary-0.146
Token She
Token ID1375
Feature activation-3.07e-02

Pos logit contributions
der+0.101
 that+0.074
 even+0.067
 he+0.066
 valuable+0.064
Neg logit contributions
een-0.180
loe-0.175
Jamie-0.168
Laura-0.167
prise-0.167
Token thought
Token ID1807
Feature activation-1.52e-02

Pos logit contributions
der+0.050
 even+0.039
 that+0.036
 he+0.035
 valuable+0.034
Neg logit contributions
loe-0.065
een-0.063
Laura-0.059
prise-0.059
Mary-0.058
Token it
Token ID340
Feature activation-9.09e-02

Pos logit contributions
der+0.018
 valuable+0.011
 that+0.011
 even+0.011
 for+0.010
Neg logit contributions
een-0.037
loe-0.036
Laura-0.035
Jamie-0.035
Mary-0.034
Token was
Token ID373
Feature activation+3.09e-02

Pos logit contributions
der+0.109
 even+0.078
 that+0.076
 valuable+0.072
 he+0.070
Neg logit contributions
loe-0.220
een-0.217
Laura-0.215
Jamie-0.210
Mary-0.210
Token not
Token ID407
Feature activation-1.72e-02

Pos logit contributions
een+0.070
loe+0.069
Laura+0.067
Jamie+0.066
Mary+0.066
Neg logit contributions
der-0.042
 valuable-0.028
 that-0.028
 even-0.028
 for-0.026
Token ordinary
Token ID8850
Feature activation-8.14e-02

Pos logit contributions
der+0.025
 valuable+0.016
 that+0.015
 even+0.015
 smoothly+0.015
Neg logit contributions
een-0.037
loe-0.036
Laura-0.035
e-0.035
Jamie-0.035
Token.
Token ID13
Feature activation-8.02e-02

Pos logit contributions
der+0.089
 valuable+0.056
 even+0.051
 that+0.050
 smoothly+0.048
Neg logit contributions
een-0.199
loe-0.193
Laura-0.188
e-0.185
Jamie-0.184
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
der+0.105
 that+0.074
 even+0.069
 he+0.067
 valuable+0.066
Neg logit contributions
een-0.189
loe-0.184
Laura-0.176
Jamie-0.176
prise-0.175
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
een+0.205
loe+0.199
Laura+0.192
Jamie+0.187
e+0.184
Neg logit contributions
der-0.165
 that-0.127
 valuable-0.121
 even-0.120
 he-0.117
Token"
Token ID1
Feature activation-2.06e-02

Pos logit contributions
een+0.156
loe+0.153
Laura+0.145
Jamie+0.141
Mary+0.137
Neg logit contributions
der-0.215
 that-0.173
 valuable-0.170
 even-0.168
 he-0.163
Token here
Token ID994
Feature activation+5.85e-03

Pos logit contributions
een+0.059
ighed+0.058
Laura+0.058
Mary+0.058
Jul+0.057
Neg logit contributions
 that-0.048
der-0.048
 for-0.044
 he-0.040
 car-0.040
Token?"
Token ID1701
Feature activation+1.96e-02

Pos logit contributions
loe+0.012
Laura+0.012
Mary+0.012
een+0.012
Jamie+0.012
Neg logit contributions
der-0.008
 that-0.008
 for-0.008
 even-0.007
 he-0.006
Token Mom
Token ID11254
Feature activation+4.67e-02

Pos logit contributions
een+0.048
loe+0.046
Laura+0.045
e+0.045
Jamie+0.044
Neg logit contributions
der-0.024
 that-0.017
 he-0.016
 even-0.016
 valuable-0.015
Token says
Token ID1139
Feature activation+6.65e-02

Pos logit contributions
een+0.084
loe+0.082
Laura+0.079
Mary+0.078
Jamie+0.077
Neg logit contributions
der-0.083
 that-0.067
 for-0.064
 even-0.064
 he-0.063
Token.
Token ID13
Feature activation-6.39e-02

Pos logit contributions
een+0.156
loe+0.151
Laura+0.150
e+0.146
Jamie+0.146
Neg logit contributions
der-0.076
 valuable-0.055
 even-0.049
 helicopter-0.047
 missile-0.047
Token "
Token ID366
Feature activation-1.55e-02

Pos logit contributions
der+0.077
 that+0.049
 even+0.044
 he+0.043
 valuable+0.042
Neg logit contributions
een-0.161
loe-0.158
e-0.152
Laura-0.152
prise-0.151
TokenWhy
Token ID5195
Feature activation-3.88e-02

Pos logit contributions
der+0.027
 that+0.020
 valuable+0.020
 even+0.019
 he+0.019
Neg logit contributions
een-0.030
loe-0.029
Laura-0.028
Jamie-0.027
Mary-0.027
Token are
Token ID389
Feature activation+5.43e-02

Pos logit contributions
der+0.065
 that+0.050
 even+0.048
 for+0.047
 he+0.047
Neg logit contributions
een-0.076
loe-0.074
Laura-0.073
Mary-0.071
Jamie-0.071
Token you
Token ID345
Feature activation-9.05e-03

Pos logit contributions
een+0.100
loe+0.099
Laura+0.096
Jamie+0.095
Mary+0.093
Neg logit contributions
der-0.097
 valuable-0.071
 even-0.069
 that-0.066
 smoothly-0.065
Token fighting
Token ID4330
Feature activation-6.62e-02

Pos logit contributions
der+0.012
 even+0.011
 that+0.011
 for+0.010
 because+0.010
Neg logit contributions
Laura-0.020
een-0.020
Mary-0.020
Eva-0.020
loe-0.020
Token?"
Token ID1701
Feature activation-1.29e-03

Pos logit contributions
der+0.104
 that+0.075
 even+0.074
 valuable+0.073
 for+0.072
Neg logit contributions
een-0.136
loe-0.134
Laura-0.131
Jamie-0.128
Mary-0.128
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
der+0.092
 that+0.062
 valuable+0.060
 even+0.059
 he+0.059
Neg logit contributions
een-0.159
loe-0.155
Laura-0.151
Jamie-0.149
Mary-0.147
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
een+0.252
loe+0.241
ael+0.234
Laura+0.230
prise+0.230
Neg logit contributions
der-0.129
 that-0.111
 he-0.103
 valuable-0.097
 even-0.094
Token"
Token ID1
Feature activation-1.17e-02

Pos logit contributions
een+0.173
loe+0.168
Laura+0.154
Jamie+0.151
ael+0.149
Neg logit contributions
der-0.210
 that-0.177
 he-0.173
 valuable-0.171
 even-0.166
TokenI
Token ID40
Feature activation-1.24e-02

Pos logit contributions
der+0.020
 that+0.014
 valuable+0.014
 even+0.014
 he+0.014
Neg logit contributions
een-0.023
loe-0.023
Laura-0.022
Jamie-0.021
Mary-0.021
Token'm
Token ID1101
Feature activation+3.61e-02

Pos logit contributions
der+0.019
 even+0.019
 that+0.017
 better+0.017
 for+0.017
Neg logit contributions
Laura-0.024
loe-0.024
een-0.023
Jamie-0.022
Eva-0.022
Token sorry
Token ID7926
Feature activation-5.16e-02

Pos logit contributions
een+0.089
loe+0.086
Laura+0.085
Jamie+0.083
Mary+0.082
Neg logit contributions
der-0.041
 that-0.031
 even-0.029
 he-0.028
 for-0.028
Token,
Token ID11
Feature activation+2.06e-03

Pos logit contributions
der+0.107
attered+0.092
 valuable+0.089
 smoothly+0.085
 missile+0.080
Neg logit contributions
loe-0.074
een-0.072
Mary-0.068
Laura-0.067
Jamie-0.067
Token kids
Token ID3988
Feature activation+5.04e-02

Pos logit contributions
een+0.004
loe+0.004
Laura+0.004
Jamie+0.004
Mary+0.004
Neg logit contributions
der-0.003
 valuable-0.002
 even-0.002
 that-0.002
attered-0.002
Token.
Token ID13
Feature activation-7.92e-02

Pos logit contributions
een+0.108
loe+0.106
e+0.105
Laura+0.102
Mary+0.102
Neg logit contributions
der-0.072
 valuable-0.052
 even-0.045
attered-0.044
 smoothly-0.044
Token Rex
Token ID17853
Feature activation+4.22e-02

Pos logit contributions
der+0.147
 even+0.117
 valuable+0.113
 important+0.103
 smoothly+0.101
Neg logit contributions
loe-0.139
een-0.136
e-0.134
Josh-0.127
Laura-0.126
Token is
Token ID318
Feature activation+2.20e-02

Pos logit contributions
een+0.080
loe+0.077
e+0.074
Laura+0.074
Jamie+0.072
Neg logit contributions
der-0.073
 valuable-0.053
 even-0.051
 that-0.049
 smoothly-0.049
Token and
Token ID290
Feature activation-4.48e-02

Pos logit contributions
een+0.076
loe+0.073
Laura+0.071
Jamie+0.071
Mary+0.070
Neg logit contributions
der-0.047
 valuable-0.032
 important-0.028
 even-0.028
 helicopter-0.027
Token water
Token ID1660
Feature activation-4.08e-02

Pos logit contributions
der+0.074
 valuable+0.053
 that+0.052
 even+0.051
 for+0.049
Neg logit contributions
een-0.089
loe-0.086
Laura-0.084
Jamie-0.083
Mary-0.082
Token and
Token ID290
Feature activation-4.60e-02

Pos logit contributions
der+0.043
 that+0.026
 even+0.024
 valuable+0.024
 for+0.023
Neg logit contributions
een-0.105
loe-0.104
Laura-0.102
Jamie-0.100
Mary-0.100
Token got
Token ID1392
Feature activation+1.61e-02

Pos logit contributions
der+0.080
 valuable+0.058
 that+0.054
 even+0.053
 missile+0.052
Neg logit contributions
een-0.087
loe-0.083
Laura-0.083
Jamie-0.081
Mary-0.081
Token in
Token ID287
Feature activation+4.31e-02

Pos logit contributions
een+0.032
loe+0.031
Laura+0.031
Jamie+0.031
e+0.030
Neg logit contributions
der-0.026
 valuable-0.017
 that-0.016
 missile-0.016
 even-0.015
Token the
Token ID262
Feature activation-4.64e-02

Pos logit contributions
een+0.108
loe+0.107
Laura+0.104
Jamie+0.102
Mary+0.100
Neg logit contributions
der-0.045
 that-0.034
 even-0.033
 for-0.032
 valuable-0.031
Token car
Token ID1097
Feature activation-6.46e-03

Pos logit contributions
der+0.094
attered+0.066
 valuable+0.066
 even+0.062
ly+0.062
Neg logit contributions
een-0.075
Laura-0.074
Jamie-0.074
loe-0.071
Eva-0.070
Token.
Token ID13
Feature activation-6.93e-02

Pos logit contributions
der+0.008
 valuable+0.005
 important+0.004
 even+0.004
 missile+0.004
Neg logit contributions
een-0.016
loe-0.015
e-0.015
Jamie-0.015
Laura-0.015
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
der+0.107
 that+0.076
 valuable+0.076
 even+0.075
 for+0.071
Neg logit contributions
een-0.145
loe-0.141
Laura-0.138
Jamie-0.136
Mary-0.135
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
een+0.256
loe+0.247
igh+0.241
ael+0.240
prise+0.238
Neg logit contributions
der-0.123
 that-0.106
 even-0.098
 he-0.097
 valuable-0.095
TokenThey
Token ID2990
Feature activation-6.57e-03

Pos logit contributions
een+0.176
loe+0.174
Laura+0.164
Jamie+0.161
Mary+0.158
Neg logit contributions
der-0.212
 valuable-0.169
 that-0.168
 even-0.167
 he-0.159
Token.
Token ID13
Feature activation-7.04e-02

Pos logit contributions
een+0.015
loe+0.014
Laura+0.014
Jamie+0.014
Mary+0.014
Neg logit contributions
der-0.007
 that-0.004
 valuable-0.004
 even-0.004
 for-0.004
Token She
Token ID1375
Feature activation-2.10e-02

Pos logit contributions
der+0.094
 that+0.063
 even+0.062
 he+0.062
 valuable+0.061
Neg logit contributions
een-0.162
loe-0.156
Laura-0.152
Jamie-0.151
prise-0.149
Token was
Token ID373
Feature activation+5.38e-02

Pos logit contributions
der+0.036
 valuable+0.026
 that+0.025
 even+0.024
ly+0.024
Neg logit contributions
een-0.040
loe-0.039
Laura-0.038
Jamie-0.038
Mary-0.037
Token so
Token ID523
Feature activation-2.29e-03

Pos logit contributions
een+0.125
loe+0.123
Laura+0.119
Jamie+0.117
Mary+0.115
Neg logit contributions
der-0.068
 even-0.054
 that-0.052
 valuable-0.049
 for-0.049
Token excited
Token ID6568
Feature activation-4.44e-03

Pos logit contributions
der+0.002
 valuable+0.002
 even+0.002
 that+0.002
 important+0.002
Neg logit contributions
een-0.006
loe-0.006
Laura-0.006
Jamie-0.005
Josh-0.005
Token -
Token ID532
Feature activation-1.99e-02

Pos logit contributions
der+0.005
 valuable+0.004
 missile+0.003
 helicopter+0.003
 smoothly+0.003
Neg logit contributions
een-0.010
Mary-0.010
loe-0.010
Laura-0.010
Jamie-0.009
Token she
Token ID673
Feature activation-3.49e-02

Pos logit contributions
der+0.027
 valuable+0.018
 that+0.016
 even+0.016
ly+0.015
Neg logit contributions
een-0.045
loe-0.044
Laura-0.043
Mary-0.043
Jamie-0.043
Token loved
Token ID6151
Feature activation+2.63e-02

Pos logit contributions
der+0.051
 even+0.038
 that+0.037
 valuable+0.036
 he+0.035
Neg logit contributions
een-0.075
loe-0.074
Laura-0.072
Jamie-0.070
Mary-0.070
Token spending
Token ID4581
Feature activation-4.53e-02

Pos logit contributions
een+0.061
loe+0.060
Laura+0.059
Jamie+0.058
Mary+0.057
Neg logit contributions
der-0.035
 that-0.024
 even-0.023
 valuable-0.023
 for-0.022
Token time
Token ID640
Feature activation+5.09e-02

Pos logit contributions
der+0.091
 that+0.081
 even+0.077
 for+0.075
 valuable+0.074
Neg logit contributions
loe-0.070
een-0.069
Laura-0.067
prise-0.067
Jamie-0.066
Token with
Token ID351
Feature activation+2.14e-02

Pos logit contributions
een+0.112
loe+0.110
Laura+0.107
Jamie+0.105
Mary+0.105
Neg logit contributions
der-0.072
 that-0.051
 even-0.050
 valuable-0.050
 for-0.047
Token mountain
Token ID8598
Feature activation-5.07e-03

Pos logit contributions
der+0.006
 valuable+0.004
 that+0.004
 even+0.004
 for+0.004
Neg logit contributions
een-0.007
loe-0.007
Laura-0.007
Jamie-0.007
Mary-0.007
Token.
Token ID13
Feature activation-6.70e-02

Pos logit contributions
der+0.006
 valuable+0.003
 even+0.003
 that+0.003
attered+0.003
Neg logit contributions
een-0.013
loe-0.012
Laura-0.012
e-0.012
Jamie-0.012
Token The
Token ID383
Feature activation-2.00e-02

Pos logit contributions
der+0.056
 he+0.032
 that+0.031
 He+0.028
 valuable+0.027
Neg logit contributions
een-0.186
loe-0.184
prise-0.182
Laura-0.180
Jamie-0.177
Token ant
Token ID1885
Feature activation+8.33e-03

Pos logit contributions
der+0.035
 that+0.025
 valuable+0.025
 even+0.025
 for+0.024
Neg logit contributions
een-0.038
loe-0.037
Laura-0.036
Jamie-0.036
Mary-0.035
Token knew
Token ID2993
Feature activation+1.32e-03

Pos logit contributions
een+0.017
loe+0.017
Laura+0.017
Jamie+0.016
Mary+0.016
Neg logit contributions
der-0.013
 that-0.010
 even-0.010
 valuable-0.009
 he-0.009
Token it
Token ID340
Feature activation-9.23e-02

Pos logit contributions
een+0.003
loe+0.003
Laura+0.003
Jamie+0.003
Mary+0.003
Neg logit contributions
der-0.002
 valuable-0.001
 even-0.001
 that-0.001
 smoothly-0.001
Token was
Token ID373
Feature activation+3.52e-02

Pos logit contributions
der+0.139
 valuable+0.092
 that+0.091
 even+0.090
 for+0.084
Neg logit contributions
een-0.197
loe-0.192
Laura-0.186
Jamie-0.184
Mary-0.183
Token a
Token ID257
Feature activation-1.09e-02

Pos logit contributions
een+0.091
loe+0.090
Laura+0.088
Jamie+0.087
Mary+0.086
Neg logit contributions
der-0.036
 valuable-0.023
 that-0.022
 even-0.022
 for-0.021
Token big
Token ID1263
Feature activation-1.56e-02

Pos logit contributions
der+0.016
 that+0.010
 even+0.010
 valuable+0.010
ly+0.010
Neg logit contributions
een-0.023
loe-0.023
Laura-0.023
Jamie-0.022
Mary-0.022
Token job
Token ID1693
Feature activation-6.25e-02

Pos logit contributions
der+0.028
attered+0.020
 valuable+0.020
 smoothly+0.019
 that+0.018
Neg logit contributions
een-0.027
loe-0.026
Laura-0.026
e-0.025
Jamie-0.025
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
der+0.090
 valuable+0.061
 that+0.059
 even+0.059
 for+0.055
Neg logit contributions
een-0.136
loe-0.133
Laura-0.130
Jamie-0.128
Mary-0.127
Token wrote
Token ID2630
Feature activation+1.24e-02

Pos logit contributions
der+0.079
 valuable+0.058
 that+0.055
 even+0.054
 important+0.053
Neg logit contributions
een-0.085
loe-0.081
Laura-0.081
Jamie-0.079
Mary-0.079
Token '
Token ID705
Feature activation-1.54e-02

Pos logit contributions
een+0.029
loe+0.028
Laura+0.028
Jamie+0.027
Mary+0.027
Neg logit contributions
der-0.016
 valuable-0.011
 that-0.011
 even-0.010
 for-0.010
Tokenfriendly
Token ID13120
Feature activation-4.57e-02

Pos logit contributions
der+0.026
 that+0.022
 even+0.021
 valuable+0.021
 smoothly+0.020
Neg logit contributions
loe-0.028
een-0.027
Laura-0.026
Jamie-0.026
Josh-0.025
Token'
Token ID6
Feature activation-1.64e-02

Pos logit contributions
der+0.076
 that+0.063
 valuable+0.059
 for+0.059
 even+0.057
Neg logit contributions
een-0.087
loe-0.086
Laura-0.085
Jamie-0.084
Mary-0.082
Token on
Token ID319
Feature activation+3.52e-02

Pos logit contributions
der+0.019
 valuable+0.011
 that+0.011
 even+0.011
 for+0.010
Neg logit contributions
een-0.040
loe-0.040
Laura-0.039
Jamie-0.038
Mary-0.038
Token it
Token ID340
Feature activation-1.05e-01

Pos logit contributions
een+0.081
loe+0.079
Laura+0.077
Jamie+0.076
prise+0.076
Neg logit contributions
der-0.046
 that-0.037
 valuable-0.031
 even-0.031
 for-0.030
Token.
Token ID13
Feature activation-8.52e-02

Pos logit contributions
der+0.158
 that+0.117
 even+0.113
 valuable+0.113
 for+0.111
Neg logit contributions
een-0.225
loe-0.219
Laura-0.214
Jamie-0.212
Mary-0.210
Token He
Token ID679
Feature activation-2.97e-02

Pos logit contributions
der+0.095
 that+0.061
 he+0.059
 even+0.055
 valuable+0.055
Neg logit contributions
een-0.214
loe-0.206
Laura-0.204
Jamie-0.200
Mary-0.199
Token couldn
Token ID3521
Feature activation+2.69e-02

Pos logit contributions
der+0.048
 even+0.036
 that+0.036
 valuable+0.035
 he+0.033
Neg logit contributions
een-0.060
loe-0.060
Laura-0.058
Jamie-0.056
Mary-0.056
Token't
Token ID470
Feature activation-3.51e-03

Pos logit contributions
een+0.038
Laura+0.036
loe+0.036
Jamie+0.035
Mary+0.034
Neg logit contributions
der-0.059
 that-0.049
 even-0.047
 valuable-0.047
 he-0.046
Token wait
Token ID4043
Feature activation-1.88e-02

Pos logit contributions
der+0.007
 valuable+0.005
 that+0.005
 important+0.005
 for+0.005
Neg logit contributions
een-0.006
loe-0.006
Laura-0.006
e-0.006
Jamie-0.005
Tokenberry
Token ID8396
Feature activation+1.15e-02

Pos logit contributions
der+0.019
 that+0.015
 for+0.015
 even+0.015
 he+0.014
Neg logit contributions
Laura-0.013
een-0.013
loe-0.012
Jamie-0.012
Josh-0.012
Token muff
Token ID27563
Feature activation+3.26e-02

Pos logit contributions
een+0.019
loe+0.019
Laura+0.018
Jamie+0.018
Mary+0.018
Neg logit contributions
der-0.022
 that-0.017
 valuable-0.017
 even-0.017
 for-0.016
Tokenins
Token ID1040
Feature activation+1.16e-02

Pos logit contributions
een+0.055
loe+0.054
Laura+0.052
Josh+0.051
Mary+0.050
Neg logit contributions
der-0.064
 that-0.053
 even-0.051
 for-0.050
 valuable-0.049
Token for
Token ID329
Feature activation-1.44e-02

Pos logit contributions
een+0.029
loe+0.028
e+0.027
Laura+0.027
Jamie+0.027
Neg logit contributions
der-0.013
 valuable-0.008
 even-0.007
 helicopter-0.007
 that-0.007
Token you
Token ID345
Feature activation-2.93e-02

Pos logit contributions
der+0.019
 that+0.018
 for+0.016
 even+0.016
 he+0.014
Neg logit contributions
ighed-0.032
een-0.031
loe-0.031
Josh-0.030
Laura-0.030
Token.
Token ID13
Feature activation-9.39e-02

Pos logit contributions
der+0.023
 that+0.014
 for+0.013
 even+0.013
 valuable+0.011
Neg logit contributions
een-0.084
loe-0.084
Laura-0.081
Jamie-0.080
Mary-0.080
Token They
Token ID1119
Feature activation-4.37e-02

Pos logit contributions
der+0.121
 that+0.078
 valuable+0.077
 even+0.076
 for+0.071
Neg logit contributions
een-0.220
loe-0.215
Laura-0.211
Jamie-0.208
Mary-0.207
Token are
Token ID389
Feature activation+5.89e-02

Pos logit contributions
der+0.076
 valuable+0.056
 that+0.056
 even+0.056
 for+0.053
Neg logit contributions
een-0.082
loe-0.080
Laura-0.078
Jamie-0.077
Mary-0.076
Token in
Token ID287
Feature activation+5.40e-02

Pos logit contributions
een+0.157
loe+0.154
Laura+0.147
Jamie+0.145
Mary+0.145
Neg logit contributions
der-0.058
 even-0.045
 that-0.045
 for-0.044
 valuable-0.043
Token the
Token ID262
Feature activation-3.75e-02

Pos logit contributions
een+0.143
loe+0.139
ighed+0.134
Mary+0.132
Jamie+0.132
Neg logit contributions
 that-0.049
der-0.047
 for-0.046
 even-0.045
 valuable-0.042
Token kitchen
Token ID9592
Feature activation+4.54e-03

Pos logit contributions
der+0.062
 that+0.050
 even+0.049
 valuable+0.049
 for+0.049
Neg logit contributions
een-0.072
loe-0.071
Laura-0.067
Mary-0.067
Jamie-0.067
Token said
Token ID531
Feature activation+7.72e-02

Pos logit contributions
der+0.080
 valuable+0.057
 important+0.052
 that+0.051
 even+0.050
Neg logit contributions
een-0.098
Laura-0.094
Jamie-0.093
loe-0.092
Mary-0.092
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
een+0.142
loe+0.139
Laura+0.134
Jamie+0.132
Mary+0.132
Neg logit contributions
der-0.134
 valuable-0.099
 even-0.096
 that-0.094
attered-0.092
Token "
Token ID366
Feature activation-2.35e-02

Pos logit contributions
een+0.052
loe+0.051
Laura+0.050
Jamie+0.049
Mary+0.049
Neg logit contributions
der-0.027
 that-0.017
 valuable-0.017
 even-0.017
 for-0.016
TokenTake
Token ID12322
Feature activation-8.27e-03

Pos logit contributions
der+0.037
 even+0.027
 valuable+0.027
 that+0.026
 he+0.025
Neg logit contributions
een-0.047
loe-0.047
Laura-0.044
Jamie-0.043
prise-0.043
Token this
Token ID428
Feature activation-7.95e-02

Pos logit contributions
der+0.009
 that+0.007
 even+0.006
 for+0.006
 valuable+0.006
Neg logit contributions
een-0.021
loe-0.021
Laura-0.021
Jamie-0.020
Mary-0.020
Token!
Token ID0
Feature activation-1.03e-01

Pos logit contributions
der+0.118
 valuable+0.080
 that+0.080
 even+0.079
 for+0.074
Neg logit contributions
een-0.171
loe-0.167
Laura-0.163
Jamie-0.160
Mary-0.159
Token It
Token ID632
Feature activation-8.43e-02

Pos logit contributions
der+0.132
 that+0.090
 even+0.082
 he+0.082
 valuable+0.081
Neg logit contributions
een-0.239
Laura-0.234
Jamie-0.231
Mary-0.228
loe-0.228
Token's
Token ID338
Feature activation-6.91e-03

Pos logit contributions
der+0.119
 valuable+0.081
 that+0.081
 even+0.079
 for+0.075
Neg logit contributions
een-0.187
loe-0.183
Laura-0.179
Jamie-0.176
Mary-0.175
Token special
Token ID2041
Feature activation-5.64e-02

Pos logit contributions
der+0.008
 even+0.007
 that+0.007
 for+0.006
 valuable+0.006
Neg logit contributions
een-0.017
loe-0.016
Laura-0.016
Jamie-0.016
Mary-0.016
Token like
Token ID588
Feature activation-2.97e-02

Pos logit contributions
der+0.080
 valuable+0.055
 that+0.054
 even+0.054
 for+0.050
Neg logit contributions
een-0.124
loe-0.121
Laura-0.119
Jamie-0.117
Mary-0.116
Token you
Token ID345
Feature activation-3.14e-02

Pos logit contributions
der+0.041
 that+0.030
 valuable+0.029
 even+0.029
 for+0.028
Neg logit contributions
een-0.067
loe-0.065
Laura-0.064
Jamie-0.063
Mary-0.062
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
der+0.037
 valuable+0.027
 important+0.023
attered+0.023
 even+0.022
Neg logit contributions
een-0.062
e-0.061
loe-0.060
Laura-0.058
ighed-0.057
Token I
Token ID314
Feature activation-2.27e-02

Pos logit contributions
der+0.144
 even+0.116
 valuable+0.112
 for+0.107
ly+0.106
Neg logit contributions
loe-0.144
e-0.138
een-0.137
Laura-0.136
 Deal-0.134
Token did
Token ID750
Feature activation+4.60e-02

Pos logit contributions
der+0.044
 helicopter+0.033
 airplane+0.033
 missile+0.032
 valuable+0.031
Neg logit contributions
e-0.038
een-0.035
ighed-0.034
loe-0.033
prise-0.033
Token not
Token ID407
Feature activation-1.21e-02

Pos logit contributions
e+0.052
prise+0.045
een+0.045
Laura+0.043
Eva+0.042
Neg logit contributions
der-0.115
attered-0.098
ly-0.093
 ot-0.089
lessly-0.087
Token know
Token ID760
Feature activation+4.56e-04

Pos logit contributions
der+0.021
 valuable+0.014
 that+0.014
 even+0.014
 helicopter+0.013
Neg logit contributions
een-0.024
loe-0.023
Laura-0.022
e-0.022
Mary-0.022
Token what
Token ID644
Feature activation-9.76e-02

Pos logit contributions
een+0.001
loe+0.001
Jamie+0.001
Laura+0.001
Mary+0.001
Neg logit contributions
der-0.001
 valuable-0.001
attered-0.001
 smoothly-0.000
ometer-0.000
Token that
Token ID326
Feature activation-9.79e-02

Pos logit contributions
der+0.153
 valuable+0.103
 even+0.102
 that+0.097
attered+0.096
Neg logit contributions
een-0.201
loe-0.197
Laura-0.194
Jamie-0.191
Mary-0.189
Token button
Token ID4936
Feature activation-4.06e-02

Pos logit contributions
der+0.163
 valuable+0.117
 even+0.113
 that+0.112
 smoothly+0.107
Neg logit contributions
een-0.192
loe-0.185
Laura-0.181
Jamie-0.178
Mary-0.177
Token did
Token ID750
Feature activation+3.06e-02

Pos logit contributions
der+0.069
 valuable+0.055
 smoothly+0.049
 even+0.049
attered+0.048
Neg logit contributions
een-0.072
e-0.069
loe-0.068
Jamie-0.064
Laura-0.063
Token.
Token ID13
Feature activation-8.10e-02

Pos logit contributions
een+0.058
e+0.056
loe+0.055
Mary+0.053
Laura+0.053
Neg logit contributions
der-0.046
attered-0.034
 ot-0.031
bike-0.031
 helicopter-0.031
Token Please
Token ID4222
Feature activation+5.80e-02

Pos logit contributions
der+0.131
 even+0.100
 valuable+0.099
 for+0.093
ly+0.092
Neg logit contributions
loe-0.154
een-0.153
Laura-0.151
e-0.147
Jamie-0.146