TOP ACTIVATIONS
MAX = 0.315

was still sad and kept wanting to play with the big
do something he 'd been wanting to do his whole life
, the restless kitten kept wanting to venture away .
it , but Jack kept wanting to touch it . Jack
are not meant to be kept in jars , Lily .
them but it must be kept very safe . As they
shook because it didn 't want you to climb up ,
still mad and didn 't want to listen . Lily felt
the toy he had been wanting for so long . His
'm sorry I did not want to play with you ,"
being selfish and didn 't want to share his jelly .
candy choice and did not want to share her mint candy
a book he had been wanting to read . The man
'm sorry I did not want to count with you ,"
I 'm sorry for not wanting to take a bath .
the house and didn 't want to share it with anyone
you have and not always want more . Come on ,
chocolate and we didn 't want to sleep ."
being selfish and didn 't want to share the spring with
toy that he had been wanting for a long time .

MIN ACTIVATIONS
MIN = -0.410

what is that ?" The girl said , " It 's
bedroom . The next day Pen elope went outside to pick
, Oak Tree !" The girl put her hands out and
which she loved . Everywhere Daisy went she took her net
to make music ," the girl said . " It will
. One night , El sie was alone in her room
something bad happened . The girl got lost . She was
by this special lamp that Pen elope had !
amazing things . The little girl listened to his stories with
a little girl . The girl was very scared , because
sun is strong ." The girl said , " Yes mummy
time together . The little girl always thought it was the
to her and comfort ed Pen elope . " I told
gray sky later ." The baby looked up and said ,
the thing apart . The girl explained to the kitten that
talking and laughing . The girl was curious and she wanted
I 'm sorry ." The girl smiled and said " It
swing with you ?" The girl and the boy say ,
surprise ." The girl thought this sounded fun and
her . The girl hugged the heavy pillow close

INTERVAL 0.134 - 0.315
CONTAINS 15.658%

and rolls over . She looks up at Ben and says
Anna and Ben do not have passports . They are too
in case another magical surprise was waiting for him . <|endoftext|>
When they boy woke up , he felt so
wished she had someone to talk to . She decided to

INTERVAL -0.047 - 0.134
CONTAINS 50.171%

girl played all sorts of games with the robot , like
face . But one day , it got very cold and
you ." Lily was happy to hear that .
reached a dark cre vice in the forest . He tried
her garden , Penny got very thirsty . She wanted to

INTERVAL -0.228 - -0.047
CONTAINS 32.945%

clean the train . The conductor was happy to have some
did a good job and they can go outside now .
. Tim was sad . The pigeon wanted to help Tim
the family was happy that they had prevented the bad thing
that day on , Brown ie and his friends always shared

INTERVAL -0.410 - -0.228
CONTAINS 1.226%

little girl named Lily . Lily loved to play outside in
day . Wally ran around the garden , smiling and laughing
quickly realized how boring the girl was . He told her
Every day Bob tried and tried , but
time to go home . Tim forgot to remove the hat
Token was
Token ID373
Feature activation+1.42e-01

Pos logit contributions
suit+0.544
issy+0.529
azz+0.506
acker+0.505
pp+0.502
Neg logit contributions
 not-0.339
 going-0.330
 helping-0.324
 reaching-0.321
 having-0.315
Token still
Token ID991
Feature activation+1.43e-01

Pos logit contributions
 saving+0.152
 insisting+0.150
 reaching+0.142
 Listen+0.140
 going+0.138
Neg logit contributions
suit-0.309
ppy-0.297
coat-0.288
kin-0.287
aky-0.285
Token sad
Token ID6507
Feature activation+1.18e-01

Pos logit contributions
 reaching+0.121
 saving+0.120
 insisting+0.115
 helping+0.114
 going+0.112
Neg logit contributions
suit-0.379
issy-0.354
azz-0.346
 folder-0.344
uffy-0.343
Token and
Token ID290
Feature activation+5.77e-02

Pos logit contributions
 saving+0.161
 insisting+0.160
 reaching+0.158
 going+0.155
 succeeding+0.153
Neg logit contributions
suit-0.253
y-0.242
issy-0.242
azz-0.240
uffy-0.238
Token kept
Token ID4030
Feature activation+2.84e-01

Pos logit contributions
 saving+0.080
 helping+0.079
 reaching+0.078
 going+0.075
 resisting+0.075
Neg logit contributions
suit-0.124
issy-0.119
azz-0.113
ppy-0.112
uffy-0.111
Token wanting
Token ID10291
Feature activation+3.15e-01

Pos logit contributions
 reaching+0.204
 Listen+0.200
 himself+0.200
 Then+0.197
 next+0.187
Neg logit contributions
 folder-0.652
 skirt-0.625
suit-0.612
x-0.606
 tant-0.603
Token to
Token ID284
Feature activation+1.40e-01

Pos logit contributions
 insisting+0.336
 reaching+0.324
 going+0.313
 resisting+0.301
 ordering+0.290
Neg logit contributions
 folder-0.710
 tant-0.705
suit-0.703
issy-0.671
 skirt-0.669
Token play
Token ID711
Feature activation+1.78e-01

Pos logit contributions
 going+0.196
 reaching+0.196
 helping+0.195
 saving+0.195
 insisting+0.193
Neg logit contributions
suit-0.306
issy-0.301
azz-0.286
acker-0.282
pp-0.281
Token with
Token ID351
Feature activation+1.71e-01

Pos logit contributions
 going+0.141
 reaching+0.138
 not+0.135
 helping+0.135
 saving+0.133
Neg logit contributions
suit-0.509
issy-0.492
azz-0.475
acker-0.471
pp-0.469
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
 going+0.092
 reaching+0.087
 helping+0.087
 saving+0.084
 insisting+0.084
Neg logit contributions
suit-0.524
issy-0.514
azz-0.494
acker-0.492
pp-0.491
Token big
Token ID1263
Feature activation-9.28e-02

Pos logit contributions
suit+0.244
issy+0.235
azz+0.219
acker+0.212
pp+0.212
Neg logit contributions
 going-0.364
 not-0.363
 helping-0.362
 reaching-0.357
 having-0.356
Token do
Token ID466
Feature activation+2.14e-01

Pos logit contributions
 not+0.127
 going+0.126
 helping+0.123
 reaching+0.121
 having+0.118
Neg logit contributions
suit-0.240
issy-0.231
azz-0.222
pp-0.221
acker-0.220
Token something
Token ID1223
Feature activation-2.26e-03

Pos logit contributions
 reaching+0.228
 insisting+0.226
 helping+0.221
 saving+0.218
 resisting+0.216
Neg logit contributions
suit-0.504
issy-0.491
azz-0.472
 folder-0.465
uffy-0.464
Token he
Token ID339
Feature activation-8.04e-02

Pos logit contributions
suit+0.005
issy+0.005
azz+0.005
 folder+0.005
uffy+0.005
Neg logit contributions
 saving-0.003
 insisting-0.003
 reaching-0.003
 resisting-0.003
 having-0.003
Token'd
Token ID1549
Feature activation+1.94e-01

Pos logit contributions
suit+0.152
issy+0.147
azz+0.138
acker+0.136
pp+0.135
Neg logit contributions
 helping-0.140
 reaching-0.140
 going-0.139
 not-0.139
 saving-0.138
Token been
Token ID587
Feature activation+1.61e-01

Pos logit contributions
 saving+0.264
 helping+0.262
 insisting+0.259
 reaching+0.257
 resisting+0.255
Neg logit contributions
suit-0.408
issy-0.391
coat-0.372
acker-0.371
azz-0.368
Token wanting
Token ID10291
Feature activation+3.15e-01

Pos logit contributions
 reaching+0.112
 insisting+0.105
 resisting+0.103
 saving+0.099
 himself+0.099
Neg logit contributions
suit-0.440
issy-0.427
pp-0.417
azz-0.407
 folder-0.407
Token to
Token ID284
Feature activation+1.41e-01

Pos logit contributions
 insisting+0.246
 resisting+0.237
 reaching+0.224
 saving+0.223
 succeeding+0.223
Neg logit contributions
suit-0.825
issy-0.789
 folder-0.768
azz-0.768
ppy-0.754
Token do
Token ID466
Feature activation+2.42e-01

Pos logit contributions
 insisting+0.195
 saving+0.195
 reaching+0.190
 helping+0.189
 resisting+0.189
Neg logit contributions
issy-0.294
suit-0.294
acker-0.277
azz-0.276
pp-0.273
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
 reaching+0.271
 insisting+0.271
 resisting+0.261
 saving+0.255
 helping+0.254
Neg logit contributions
suit-0.568
issy-0.538
kin-0.515
azz-0.514
 folder-0.511
Token whole
Token ID2187
Feature activation-4.44e-02

Pos logit contributions
suit+0.215
issy+0.200
uffy+0.200
 folder+0.197
acker+0.193
Neg logit contributions
 insisting-0.193
 saving-0.192
 reaching-0.190
 resisting-0.189
 helping-0.185
Token life
Token ID1204
Feature activation-1.43e-02

Pos logit contributions
suit+0.082
issy+0.078
azz+0.074
acker+0.073
pp+0.072
Neg logit contributions
 going-0.080
 not-0.079
 helping-0.079
 reaching-0.079
 saving-0.079
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
 reaching+0.100
 going+0.100
 saving+0.099
 insisting+0.098
 helping+0.096
Neg logit contributions
issy-0.166
suit-0.162
uffy-0.159
pp-0.157
azz-0.155
Token the
Token ID262
Feature activation-1.83e-01

Pos logit contributions
 saving+0.026
 reaching+0.025
 going+0.025
 insisting+0.024
 succeeding+0.024
Neg logit contributions
issy-0.094
suit-0.093
uffy-0.090
 folder-0.089
cy-0.088
Token restless
Token ID42537
Feature activation+1.65e-03

Pos logit contributions
suit+0.224
issy+0.217
azz+0.203
pp+0.199
acker+0.199
Neg logit contributions
 not-0.452
 going-0.451
 helping-0.450
 reaching-0.442
 having-0.440
Token kitten
Token ID42143
Feature activation-2.26e-01

Pos logit contributions
 next+0.004
 not+0.004
 following+0.004
 helping+0.004
 taking+0.003
Neg logit contributions
acker-0.003
issy-0.003
azz-0.003
pp-0.003
suit-0.002
Token kept
Token ID4030
Feature activation+2.80e-01

Pos logit contributions
suit+0.445
issy+0.430
azz+0.399
uffy+0.398
ppy+0.395
Neg logit contributions
 reaching-0.367
 saving-0.366
 going-0.365
 helping-0.365
 insisting-0.361
Token wanting
Token ID10291
Feature activation+3.12e-01

Pos logit contributions
 Listen+0.194
 not+0.160
 Then+0.155
 himself+0.152
 saving+0.150
Neg logit contributions
x-0.628
pp-0.622
c-0.620
 folder-0.617
suit-0.616
Token to
Token ID284
Feature activation+1.39e-01

Pos logit contributions
 insisting+0.284
 reaching+0.267
 having+0.258
 going+0.258
 resisting+0.255
Neg logit contributions
issy-0.747
suit-0.731
 tant-0.726
 folder-0.716
affe-0.707
Token venture
Token ID13189
Feature activation+1.81e-01

Pos logit contributions
 insisting+0.203
 saving+0.196
 reaching+0.196
 helping+0.195
 resisting+0.193
Neg logit contributions
issy-0.285
suit-0.282
azz-0.265
pp-0.265
acker-0.264
Token away
Token ID1497
Feature activation+4.44e-02

Pos logit contributions
 teaching+0.224
 helping+0.223
 resisting+0.219
 succeeding+0.218
 reaching+0.218
Neg logit contributions
suit-0.426
issy-0.388
ppy-0.378
azz-0.371
 folder-0.370
Token.
Token ID13
Feature activation+1.31e-03

Pos logit contributions
 insisting+0.051
 going+0.048
 teaching+0.047
 reaching+0.047
 encouraging+0.047
Neg logit contributions
suit-0.102
y-0.094
issy-0.093
ppy-0.092
pp-0.091
Token
Token ID198
Feature activation-2.69e-02

Pos logit contributions
 going+0.001
 helping+0.001
 reaching+0.001
 saving+0.001
 not+0.001
Neg logit contributions
suit-0.003
issy-0.003
azz-0.003
acker-0.003
uffy-0.003
Token it
Token ID340
Feature activation-9.04e-02

Pos logit contributions
 encouraging+0.070
 reaching+0.069
 succeeding+0.065
 teaching+0.064
 resisting+0.064
Neg logit contributions
issy-0.222
suit-0.220
x-0.215
 folder-0.215
 tant-0.211
Token,
Token ID11
Feature activation+4.19e-02

Pos logit contributions
suit+0.232
issy+0.225
acker+0.213
azz+0.213
uffy+0.211
Neg logit contributions
 reaching-0.094
 going-0.093
 saving-0.092
 insisting-0.091
 helping-0.090
Token but
Token ID475
Feature activation+9.14e-02

Pos logit contributions
 succeeding+0.059
 saving+0.059
 reaching+0.058
 resisting+0.057
 helping+0.057
Neg logit contributions
suit-0.084
issy-0.078
azz-0.076
coat-0.074
 folder-0.073
Token Jack
Token ID3619
Feature activation-1.80e-01

Pos logit contributions
 saving+0.131
 reaching+0.130
 going+0.128
 helping+0.128
 insisting+0.124
Neg logit contributions
suit-0.194
issy-0.189
azz-0.178
acker-0.175
uffy-0.173
Token kept
Token ID4030
Feature activation+2.79e-01

Pos logit contributions
suit+0.402
issy+0.391
azz+0.377
acker+0.371
pp+0.370
Neg logit contributions
 not-0.260
 going-0.256
 helping-0.252
 reaching-0.250
 saving-0.246
Token wanting
Token ID10291
Feature activation+3.12e-01

Pos logit contributions
 Listen+0.271
 succeeding+0.224
 teaching+0.207
 Then+0.196
 saving+0.190
Neg logit contributions
suit-0.585
c-0.572
coat-0.571
 scarf-0.569
 sweater-0.565
Token to
Token ID284
Feature activation+1.37e-01

Pos logit contributions
 insisting+0.318
 resisting+0.302
 reaching+0.302
 teaching+0.298
 succeeding+0.297
Neg logit contributions
suit-0.690
 tant-0.685
 vest-0.667
issy-0.660
 folder-0.657
Token touch
Token ID3638
Feature activation+1.89e-01

Pos logit contributions
 insisting+0.214
 resisting+0.206
 succeeding+0.202
 teaching+0.201
 saving+0.201
Neg logit contributions
issy-0.251
coat-0.247
suit-0.241
brush-0.233
uffy-0.231
Token it
Token ID340
Feature activation-6.23e-02

Pos logit contributions
 saving+0.137
 helping+0.135
 reaching+0.135
 insisting+0.134
 resisting+0.133
Neg logit contributions
suit-0.524
issy-0.513
azz-0.490
acker-0.489
 tant-0.484
Token.
Token ID13
Feature activation-1.47e-03

Pos logit contributions
suit+0.154
issy+0.147
acker+0.140
azz+0.140
ppy+0.139
Neg logit contributions
 saving-0.069
 resisting-0.066
 insisting-0.066
 teaching-0.065
 helping-0.065
Token Jack
Token ID3619
Feature activation-2.04e-01

Pos logit contributions
suit+0.004
issy+0.004
acker+0.003
azz+0.003
uffy+0.003
Neg logit contributions
 going-0.002
 saving-0.002
 helping-0.002
 reaching-0.002
 resisting-0.002
Token are
Token ID389
Feature activation+1.29e-01

Pos logit contributions
 not+0.096
 going+0.095
 helping+0.094
 reaching+0.093
 saving+0.093
Neg logit contributions
suit-0.117
issy-0.113
azz-0.107
pp-0.106
acker-0.106
Token not
Token ID407
Feature activation+1.83e-01

Pos logit contributions
 insisting+0.104
 reaching+0.100
 resisting+0.091
 succeeding+0.086
 helping+0.083
Neg logit contributions
suit-0.337
issy-0.331
ocado-0.327
ppy-0.322
azz-0.314
Token meant
Token ID4001
Feature activation+1.95e-01

Pos logit contributions
 reaching+0.152
 going+0.151
 not+0.151
 helping+0.150
 saving+0.148
Neg logit contributions
suit-0.507
issy-0.495
azz-0.477
pp-0.473
acker-0.472
Token to
Token ID284
Feature activation+1.35e-01

Pos logit contributions
 insisting+0.166
 reaching+0.147
 succeeding+0.147
 resisting+0.144
 going+0.140
Neg logit contributions
suit-0.499
uffy-0.482
issy-0.476
 folder-0.470
ppy-0.469
Token be
Token ID307
Feature activation+1.54e-01

Pos logit contributions
 going+0.161
 not+0.159
 helping+0.159
 reaching+0.157
 saving+0.156
Neg logit contributions
suit-0.335
issy-0.326
azz-0.312
acker-0.309
pp-0.308
Token kept
Token ID4030
Feature activation+3.12e-01

Pos logit contributions
 going+0.179
 reaching+0.177
 insisting+0.176
 helping+0.170
 saving+0.167
Neg logit contributions
suit-0.367
issy-0.351
ppy-0.342
azz-0.338
pp-0.333
Token in
Token ID287
Feature activation+1.02e-01

Pos logit contributions
 reaching+0.265
 succeeding+0.260
 insisting+0.248
 resisting+0.241
 helping+0.241
Neg logit contributions
suit-0.761
pp-0.728
issy-0.726
ppy-0.724
 folder-0.723
Token jars
Token ID33008
Feature activation-1.22e-02

Pos logit contributions
 reaching+0.114
 going+0.113
 insisting+0.107
 saving+0.107
 helping+0.106
Neg logit contributions
suit-0.236
issy-0.227
 folder-0.224
 tant-0.223
acker-0.219
Token,
Token ID11
Feature activation+6.00e-02

Pos logit contributions
suit+0.033
issy+0.031
acker+0.030
azz+0.030
pp+0.030
Neg logit contributions
 going-0.011
 reaching-0.011
 helping-0.011
 insisting-0.011
 saving-0.011
Token Lily
Token ID20037
Feature activation-1.89e-01

Pos logit contributions
 going+0.077
 not+0.076
 helping+0.076
 reaching+0.076
 saving+0.075
Neg logit contributions
suit-0.142
issy-0.138
azz-0.132
acker-0.130
pp-0.130
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
suit+0.520
issy+0.508
azz+0.488
acker+0.483
pp+0.483
Neg logit contributions
 going-0.167
 helping-0.164
 not-0.163
 reaching-0.163
 saving-0.160
Token them
Token ID606
Feature activation-6.96e-02

Pos logit contributions
 going+0.148
 not+0.146
 helping+0.145
 reaching+0.143
 saving+0.141
Neg logit contributions
suit-0.524
issy-0.511
azz-0.492
acker-0.488
pp-0.487
Token but
Token ID475
Feature activation+1.05e-01

Pos logit contributions
suit+0.195
issy+0.190
azz+0.182
acker+0.181
pp+0.181
Neg logit contributions
 going-0.058
 reaching-0.058
 helping-0.057
 not-0.057
 saving-0.057
Token it
Token ID340
Feature activation-7.47e-02

Pos logit contributions
 going+0.105
 reaching+0.103
 helping+0.103
 saving+0.102
 not+0.100
Neg logit contributions
suit-0.276
issy-0.268
azz-0.256
pp-0.254
acker-0.253
Token must
Token ID1276
Feature activation+1.84e-01

Pos logit contributions
suit+0.182
issy+0.175
azz+0.174
acker+0.171
pp+0.170
Neg logit contributions
 not-0.103
 going-0.097
 helping-0.092
 having-0.090
 reaching-0.090
Token be
Token ID307
Feature activation+1.65e-01

Pos logit contributions
 not+0.227
 going+0.209
 helping+0.201
 reaching+0.199
 having+0.196
Neg logit contributions
suit-0.470
issy-0.453
azz-0.443
pp-0.437
acker-0.434
Token kept
Token ID4030
Feature activation+3.11e-01

Pos logit contributions
 reaching+0.196
 resisting+0.184
 saving+0.183
 helping+0.183
 succeeding+0.182
Neg logit contributions
suit-0.399
issy-0.371
ppy-0.363
coat-0.356
pp-0.354
Token very
Token ID845
Feature activation+1.54e-01

Pos logit contributions
 reaching+0.268
 saving+0.252
 helping+0.248
 not+0.245
 succeeding+0.244
Neg logit contributions
suit-0.813
issy-0.797
pp-0.766
uffy-0.750
ppy-0.745
Token safe
Token ID3338
Feature activation+1.16e-01

Pos logit contributions
 saving+0.224
 teaching+0.221
 reaching+0.212
 helping+0.211
 having+0.210
Neg logit contributions
suit-0.344
kin-0.305
 tant-0.301
issy-0.300
ler-0.299
Token.
Token ID13
Feature activation+3.73e-03

Pos logit contributions
 reaching+0.116
 going+0.111
 helping+0.111
 not+0.110
 insisting+0.110
Neg logit contributions
suit-0.304
issy-0.285
pp-0.276
azz-0.271
cy-0.270
Token As
Token ID1081
Feature activation+8.97e-02

Pos logit contributions
 not+0.003
 going+0.003
 having+0.003
 helping+0.003
 reaching+0.003
Neg logit contributions
suit-0.011
issy-0.011
pp-0.010
azz-0.010
acker-0.010
Token they
Token ID484
Feature activation-9.77e-02

Pos logit contributions
 saving+0.095
 reaching+0.094
 helping+0.093
 going+0.093
 insisting+0.092
Neg logit contributions
suit-0.224
issy-0.222
azz-0.211
uffy-0.209
pp-0.207
Token shook
Token ID14682
Feature activation+2.34e-01

Pos logit contributions
suit+0.199
issy+0.191
azz+0.183
acker+0.180
pp+0.178
Neg logit contributions
 not-0.167
 going-0.166
 helping-0.164
 reaching-0.162
 saving-0.162
Token because
Token ID780
Feature activation+6.96e-02

Pos logit contributions
 reaching+0.220
 going+0.220
 helping+0.215
 saving+0.215
 insisting+0.207
Neg logit contributions
suit-0.616
issy-0.596
acker-0.570
azz-0.569
pp-0.568
Token it
Token ID340
Feature activation-8.08e-02

Pos logit contributions
 reaching+0.067
 insisting+0.066
 going+0.064
 saving+0.064
 succeeding+0.061
Neg logit contributions
issy-0.173
suit-0.171
pp-0.162
 folder-0.161
uffy-0.160
Token didn
Token ID1422
Feature activation+2.51e-01

Pos logit contributions
suit+0.165
issy+0.156
azz+0.151
acker+0.148
pp+0.146
Neg logit contributions
 not-0.136
 going-0.134
 helping-0.132
 saving-0.130
 having-0.130
Token't
Token ID470
Feature activation+1.57e-01

Pos logit contributions
 saving+0.241
 helping+0.234
 reaching+0.232
 insisting+0.229
 succeeding+0.226
Neg logit contributions
issy-0.653
azz-0.608
suit-0.601
uffy-0.599
acker-0.593
Token want
Token ID765
Feature activation+3.11e-01

Pos logit contributions
 going+0.236
 not+0.235
 helping+0.232
 reaching+0.228
 saving+0.225
Neg logit contributions
suit-0.351
issy-0.335
azz-0.321
uffy-0.317
pp-0.317
Token you
Token ID345
Feature activation-1.67e-02

Pos logit contributions
 succeeding+0.248
 insisting+0.247
 reaching+0.236
 resisting+0.232
 saving+0.227
Neg logit contributions
issy-0.818
suit-0.799
 folder-0.788
acker-0.775
azz-0.769
Token to
Token ID284
Feature activation+1.27e-01

Pos logit contributions
suit+0.052
issy+0.050
acker+0.049
ppy+0.049
azz+0.049
Neg logit contributions
 insisting-0.008
 reaching-0.008
 succeeding-0.008
 saving-0.007
 resisting-0.007
Token climb
Token ID12080
Feature activation+1.59e-01

Pos logit contributions
 saving+0.167
 insisting+0.164
 helping+0.163
 going+0.163
 reaching+0.163
Neg logit contributions
suit-0.285
issy-0.281
acker-0.268
azz-0.267
pp-0.266
Token up
Token ID510
Feature activation+1.06e-01

Pos logit contributions
 helping+0.143
 reaching+0.142
 saving+0.138
 going+0.137
 not+0.136
Neg logit contributions
suit-0.426
issy-0.406
pp-0.395
acker-0.392
uffy-0.391
Token,
Token ID11
Feature activation+5.74e-02

Pos logit contributions
 saving+0.111
 teaching+0.109
 encouraging+0.109
 insisting+0.109
 helping+0.109
Neg logit contributions
suit-0.253
 tant-0.232
issy-0.231
 folder-0.230
acker-0.229
Token still
Token ID991
Feature activation+1.50e-01

Pos logit contributions
 reaching+0.165
 insisting+0.164
 adding+0.163
 saving+0.161
 Listen+0.153
Neg logit contributions
suit-0.324
ppy-0.323
kin-0.317
coat-0.309
issy-0.307
Token mad
Token ID8805
Feature activation+1.42e-01

Pos logit contributions
 reaching+0.165
 saving+0.158
 going+0.153
 helping+0.152
 insisting+0.152
Neg logit contributions
suit-0.366
issy-0.341
uffy-0.335
azz-0.333
 folder-0.332
Token and
Token ID290
Feature activation+7.04e-02

Pos logit contributions
 saving+0.181
 reaching+0.174
 insisting+0.171
 helping+0.170
 resisting+0.168
Neg logit contributions
suit-0.322
azz-0.316
issy-0.310
y-0.307
ppy-0.294
Token didn
Token ID1422
Feature activation+2.45e-01

Pos logit contributions
 reaching+0.103
 saving+0.100
 helping+0.098
 himself+0.097
 succeeding+0.096
Neg logit contributions
suit-0.142
issy-0.140
uffy-0.131
azz-0.130
pp-0.130
Token't
Token ID470
Feature activation+1.56e-01

Pos logit contributions
 saving+0.244
 helping+0.238
 reaching+0.230
 asking+0.225
 resisting+0.220
Neg logit contributions
issy-0.642
azz-0.610
suit-0.599
uffy-0.599
acker-0.582
Token want
Token ID765
Feature activation+3.10e-01

Pos logit contributions
 going+0.213
 not+0.211
 helping+0.209
 reaching+0.207
 saving+0.204
Neg logit contributions
suit-0.362
issy-0.352
azz-0.334
acker-0.332
pp-0.331
Token to
Token ID284
Feature activation+1.42e-01

Pos logit contributions
 reaching+0.193
 saving+0.184
 insisting+0.174
 resisting+0.171
 helping+0.170
Neg logit contributions
issy-0.917
azz-0.881
suit-0.878
 folder-0.865
 tant-0.835
Token listen
Token ID6004
Feature activation+2.30e-01

Pos logit contributions
 saving+0.195
 reaching+0.194
 helping+0.191
 insisting+0.190
 going+0.189
Neg logit contributions
suit-0.309
issy-0.306
azz-0.293
uffy-0.285
acker-0.284
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
 saving+0.313
 reaching+0.311
 succeeding+0.300
 resisting+0.298
 releasing+0.290
Neg logit contributions
suit-0.487
y-0.477
 tant-0.448
azz-0.445
issy-0.431
Token Lily
Token ID20037
Feature activation-2.03e-01

Pos logit contributions
 reaching+0.011
 going+0.011
 helping+0.011
 saving+0.011
 insisting+0.010
Neg logit contributions
suit-0.020
issy-0.020
azz-0.019
uffy-0.018
acker-0.018
Token felt
Token ID2936
Feature activation+1.72e-01

Pos logit contributions
suit+0.433
issy+0.425
azz+0.401
uffy+0.393
pp+0.390
Neg logit contributions
 reaching-0.299
 helping-0.298
 going-0.297
 saving-0.295
 having-0.286
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
 going+0.123
 not+0.121
 reaching+0.120
 helping+0.120
 saving+0.116
Neg logit contributions
suit-0.537
issy-0.528
azz-0.506
acker-0.502
pp-0.501
Token toy
Token ID13373
Feature activation-1.28e-01

Pos logit contributions
suit+0.298
issy+0.291
azz+0.276
acker+0.272
pp+0.270
Neg logit contributions
 going-0.289
 not-0.286
 helping-0.286
 reaching-0.282
 saving-0.281
Token he
Token ID339
Feature activation-1.10e-01

Pos logit contributions
suit+0.288
issy+0.281
azz+0.270
acker+0.269
pp+0.266
Neg logit contributions
 going-0.180
 not-0.178
 helping-0.178
 saving-0.175
 reaching-0.174
Token had
Token ID550
Feature activation+1.30e-01

Pos logit contributions
suit+0.210
issy+0.203
azz+0.189
acker+0.187
pp+0.185
Neg logit contributions
 reaching-0.189
 helping-0.189
 going-0.188
 not-0.188
 saving-0.187
Token been
Token ID587
Feature activation+1.57e-01

Pos logit contributions
 insisting+0.219
 resisting+0.218
 helping+0.216
 reaching+0.213
 encouraging+0.212
Neg logit contributions
suit-0.227
issy-0.219
pp-0.206
acker-0.202
coat-0.200
Token wanting
Token ID10291
Feature activation+3.10e-01

Pos logit contributions
 Listen+0.171
 reaching+0.160
 resisting+0.153
 insisting+0.151
 himself+0.151
Neg logit contributions
suit-0.373
issy-0.342
pp-0.336
ppy-0.333
coat-0.332
Token for
Token ID329
Feature activation+1.85e-01

Pos logit contributions
 insisting+0.254
 reaching+0.253
 resisting+0.252
 saving+0.243
 succeeding+0.233
Neg logit contributions
suit-0.810
issy-0.784
 folder-0.754
azz-0.749
acker-0.732
Token so
Token ID523
Feature activation+9.51e-02

Pos logit contributions
 reaching+0.200
 insisting+0.188
 resisting+0.185
 going+0.182
 saving+0.180
Neg logit contributions
suit-0.425
issy-0.415
acker-0.393
kin-0.390
azz-0.388
Token long
Token ID890
Feature activation+2.80e-02

Pos logit contributions
 saving+0.123
 reaching+0.123
 showing+0.123
 insisting+0.122
 teaching+0.121
Neg logit contributions
azz-0.170
igsaw-0.169
issy-0.166
uffy-0.164
suit-0.162
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 not+0.028
 saving+0.027
 reaching+0.027
 going+0.027
 helping+0.027
Neg logit contributions
suit-0.072
issy-0.069
azz-0.066
pp-0.065
coat-0.064
Token His
Token ID2399
Feature activation-1.34e-01

Pos logit contributions
suit+0.024
issy+0.023
azz+0.023
acker+0.022
cy+0.021
Neg logit contributions
 going-0.013
 reaching-0.013
 saving-0.013
 helping-0.013
 not-0.013
Token'm
Token ID1101
Feature activation+1.37e-01

Pos logit contributions
 not+0.031
 going+0.031
 helping+0.031
 reaching+0.030
 saving+0.030
Neg logit contributions
suit-0.046
issy-0.045
azz-0.042
acker-0.042
pp-0.042
Token sorry
Token ID7926
Feature activation+1.58e-01

Pos logit contributions
 insisting+0.146
 reaching+0.143
 resisting+0.127
 next+0.127
 succeeding+0.127
Neg logit contributions
issy-0.311
uffy-0.301
azz-0.295
ppy-0.291
suit-0.289
Token I
Token ID314
Feature activation+2.95e-02

Pos logit contributions
 going+0.178
 helping+0.175
 reaching+0.175
 not+0.174
 saving+0.174
Neg logit contributions
suit-0.397
issy-0.387
azz-0.371
acker-0.366
uffy-0.365
Token did
Token ID750
Feature activation+2.39e-01

Pos logit contributions
 going+0.046
 helping+0.046
 reaching+0.046
 not+0.045
 saving+0.045
Neg logit contributions
suit-0.061
issy-0.060
azz-0.057
acker-0.056
pp-0.056
Token not
Token ID407
Feature activation+1.87e-01

Pos logit contributions
 insisting+0.274
 reaching+0.274
 resisting+0.261
 saving+0.249
 giving+0.248
Neg logit contributions
hell-0.506
 folder-0.497
issy-0.497
uffy-0.487
affe-0.485
Token want
Token ID765
Feature activation+3.10e-01

Pos logit contributions
 going+0.263
 helping+0.260
 reaching+0.260
 not+0.259
 saving+0.257
Neg logit contributions
suit-0.417
issy-0.408
azz-0.388
acker-0.384
pp-0.381
Token to
Token ID284
Feature activation+1.46e-01

Pos logit contributions
 insisting+0.210
 reaching+0.209
 succeeding+0.207
 resisting+0.202
 saving+0.198
Neg logit contributions
issy-0.851
suit-0.846
azz-0.830
 folder-0.817
acker-0.804
Token play
Token ID711
Feature activation+1.87e-01

Pos logit contributions
 insisting+0.195
 reaching+0.193
 saving+0.193
 helping+0.192
 going+0.191
Neg logit contributions
suit-0.320
issy-0.320
azz-0.308
uffy-0.297
acker-0.297
Token with
Token ID351
Feature activation+1.80e-01

Pos logit contributions
 not+0.152
 going+0.152
 helping+0.151
 reaching+0.147
 saving+0.146
Neg logit contributions
suit-0.534
issy-0.524
azz-0.503
acker-0.499
pp-0.498
Token you
Token ID345
Feature activation-4.83e-02

Pos logit contributions
 not+0.175
 going+0.173
 helping+0.171
 saving+0.168
 reaching+0.168
Neg logit contributions
suit-0.492
issy-0.478
azz-0.459
pp-0.458
acker-0.456
Token,"
Token ID553
Feature activation+3.94e-02

Pos logit contributions
suit+0.115
issy+0.110
azz+0.106
acker+0.106
pp+0.105
Neg logit contributions
 going-0.061
 helping-0.060
 saving-0.060
 reaching-0.059
 not-0.059
Token being
Token ID852
Feature activation+1.53e-01

Pos logit contributions
 insisting+0.191
 reaching+0.190
 saving+0.190
 deciding+0.181
 resisting+0.178
Neg logit contributions
suit-0.287
ppy-0.276
ety-0.274
y-0.267
coat-0.264
Token selfish
Token ID20363
Feature activation+1.08e-01

Pos logit contributions
 reaching+0.248
 insisting+0.239
 resisting+0.235
 saving+0.233
 going+0.231
Neg logit contributions
suit-0.291
issy-0.261
ppy-0.260
ocado-0.257
 folder-0.254
Token and
Token ID290
Feature activation+6.63e-02

Pos logit contributions
 reaching+0.117
 saving+0.113
 helping+0.110
 insisting+0.110
 resisting+0.108
Neg logit contributions
suit-0.280
issy-0.264
azz-0.258
y-0.255
acker-0.253
Token didn
Token ID1422
Feature activation+2.41e-01

Pos logit contributions
 reaching+0.105
 saving+0.103
 helping+0.101
 going+0.100
 insisting+0.100
Neg logit contributions
suit-0.133
issy-0.129
azz-0.121
pp-0.119
uffy-0.118
Token't
Token ID470
Feature activation+1.55e-01

Pos logit contributions
 saving+0.259
 helping+0.248
 reaching+0.247
 asking+0.240
 examining+0.234
Neg logit contributions
issy-0.604
azz-0.582
uffy-0.566
suit-0.560
acker-0.559
Token want
Token ID765
Feature activation+3.10e-01

Pos logit contributions
 saving+0.228
 going+0.227
 reaching+0.226
 helping+0.226
 not+0.224
Neg logit contributions
suit-0.332
issy-0.321
azz-0.308
acker-0.306
pp-0.303
Token to
Token ID284
Feature activation+1.42e-01

Pos logit contributions
 reaching+0.262
 insisting+0.257
 succeeding+0.256
 resisting+0.250
 believing+0.249
Neg logit contributions
issy-0.777
azz-0.767
 folder-0.766
suit-0.764
ety-0.758
Token share
Token ID2648
Feature activation+2.00e-01

Pos logit contributions
 saving+0.204
 insisting+0.198
 reaching+0.198
 going+0.196
 helping+0.196
Neg logit contributions
suit-0.298
issy-0.292
acker-0.284
azz-0.277
ppy-0.277
Token his
Token ID465
Feature activation-1.48e-01

Pos logit contributions
 going+0.148
 reaching+0.144
 helping+0.143
 saving+0.140
 not+0.139
Neg logit contributions
suit-0.579
issy-0.560
azz-0.544
acker-0.541
pp-0.539
Token jelly
Token ID27644
Feature activation-1.20e-01

Pos logit contributions
suit+0.248
issy+0.232
acker+0.218
azz+0.217
pp+0.215
Neg logit contributions
 going-0.294
 helping-0.290
 reaching-0.290
 not-0.289
 saving-0.288
Token.
Token ID13
Feature activation+7.34e-03

Pos logit contributions
suit+0.351
issy+0.346
azz+0.333
acker+0.329
pp+0.328
Neg logit contributions
 not-0.088
 going-0.086
 helping-0.086
 reaching-0.084
 saving-0.082
Token candy
Token ID18550
Feature activation-5.63e-02

Pos logit contributions
suit+0.159
issy+0.145
uffy+0.144
acker+0.143
 tant+0.141
Neg logit contributions
 saving-0.147
 going-0.146
 reaching-0.144
 resisting-0.142
 insisting-0.141
Token choice
Token ID3572
Feature activation+7.96e-02

Pos logit contributions
issy+0.162
suit+0.162
azz+0.156
acker+0.153
pp+0.153
Neg logit contributions
 not-0.046
 going-0.044
 helping-0.043
 reaching-0.041
 to-0.040
Token and
Token ID290
Feature activation+5.66e-02

Pos logit contributions
 reaching+0.081
 insisting+0.080
 saving+0.080
 helping+0.080
 going+0.080
Neg logit contributions
suit-0.201
issy-0.196
azz-0.190
pp-0.185
acker-0.183
Token did
Token ID750
Feature activation+2.37e-01

Pos logit contributions
 going+0.075
 not+0.075
 helping+0.074
 reaching+0.073
 saving+0.072
Neg logit contributions
suit-0.133
issy-0.129
azz-0.124
acker-0.123
pp-0.122
Token not
Token ID407
Feature activation+1.85e-01

Pos logit contributions
 insisting+0.177
 saving+0.165
 helping+0.165
 reaching+0.164
 resisting+0.164
Neg logit contributions
suit-0.611
uffy-0.594
hell-0.589
 folder-0.588
acker-0.579
Token want
Token ID765
Feature activation+3.09e-01

Pos logit contributions
 saving+0.266
 helping+0.260
 reaching+0.259
 going+0.259
 insisting+0.256
Neg logit contributions
suit-0.404
issy-0.389
azz-0.375
acker-0.374
pp-0.368
Token to
Token ID284
Feature activation+1.43e-01

Pos logit contributions
 saving+0.294
 reaching+0.285
 resisting+0.284
 succeeding+0.281
 insisting+0.280
Neg logit contributions
suit-0.754
ety-0.739
issy-0.729
 tant-0.729
 folder-0.727
Token share
Token ID2648
Feature activation+2.00e-01

Pos logit contributions
 saving+0.221
 insisting+0.210
 resisting+0.208
 helping+0.206
 reaching+0.205
Neg logit contributions
suit-0.281
issy-0.274
acker-0.272
cy-0.262
ety-0.262
Token her
Token ID607
Feature activation-1.42e-01

Pos logit contributions
 going+0.158
 helping+0.155
 reaching+0.154
 not+0.153
 saving+0.151
Neg logit contributions
suit-0.570
issy-0.554
azz-0.535
acker-0.532
pp-0.531
Token mint
Token ID21043
Feature activation-5.75e-02

Pos logit contributions
suit+0.228
issy+0.210
acker+0.199
pp+0.199
uffy+0.196
Neg logit contributions
 going-0.288
 reaching-0.285
 helping-0.284
 saving-0.283
 not-0.283
Token candy
Token ID18550
Feature activation-3.78e-02

Pos logit contributions
issy+0.153
suit+0.152
azz+0.146
acker+0.142
iet+0.140
Neg logit contributions
 not-0.058
 helping-0.056
 going-0.055
 reaching-0.055
 having-0.054
Token a
Token ID257
Feature activation-9.61e-02

Pos logit contributions
 insisting+0.088
 reaching+0.088
 resisting+0.085
 helping+0.083
 saving+0.083
Neg logit contributions
suit-0.244
issy-0.234
azz-0.228
pp-0.223
 folder-0.222
Token book
Token ID1492
Feature activation-1.02e-01

Pos logit contributions
issy+0.212
pp+0.204
suit+0.202
azz+0.201
ppy+0.197
Neg logit contributions
 helping-0.151
 not-0.148
 going-0.146
 giving-0.141
 to-0.140
Token he
Token ID339
Feature activation-1.18e-01

Pos logit contributions
suit+0.262
issy+0.259
acker+0.249
pp+0.249
azz+0.247
Neg logit contributions
 going-0.114
 not-0.113
 helping-0.108
 reaching-0.108
 saving-0.106
Token had
Token ID550
Feature activation+1.24e-01

Pos logit contributions
suit+0.223
issy+0.217
azz+0.202
acker+0.199
pp+0.196
Neg logit contributions
 reaching-0.204
 helping-0.203
 going-0.201
 saving-0.201
 not-0.199
Token been
Token ID587
Feature activation+1.48e-01

Pos logit contributions
 insisting+0.176
 helping+0.174
 resisting+0.174
 succeeding+0.169
 encouraging+0.169
Neg logit contributions
issy-0.252
suit-0.244
azz-0.228
pp-0.227
acker-0.227
Token wanting
Token ID10291
Feature activation+3.09e-01

Pos logit contributions
 Listen+0.145
 reaching+0.140
 himself+0.135
 insisting+0.134
 succeeding+0.134
Neg logit contributions
suit-0.363
issy-0.347
pp-0.343
coat-0.334
ppy-0.332
Token to
Token ID284
Feature activation+1.34e-01

Pos logit contributions
 insisting+0.225
 reaching+0.223
 resisting+0.218
 succeeding+0.209
 saving+0.202
Neg logit contributions
suit-0.846
issy-0.823
azz-0.793
 folder-0.787
acker-0.781
Token read
Token ID1100
Feature activation+2.31e-01

Pos logit contributions
 saving+0.174
 going+0.174
 insisting+0.173
 helping+0.173
 reaching+0.172
Neg logit contributions
issy-0.300
suit-0.300
azz-0.281
acker-0.281
pp-0.276
Token.
Token ID13
Feature activation-2.09e-03

Pos logit contributions
 going+0.173
 not+0.170
 helping+0.167
 reaching+0.162
 saving+0.161
Neg logit contributions
suit-0.673
issy-0.660
azz-0.634
acker-0.631
pp-0.630
Token The
Token ID383
Feature activation-1.55e-01

Pos logit contributions
suit+0.004
issy+0.004
azz+0.004
acker+0.004
ocado+0.004
Neg logit contributions
 going-0.004
 reaching-0.004
 saving-0.004
 helping-0.004
 insisting-0.003
Token man
Token ID582
Feature activation-1.79e-01

Pos logit contributions
suit+0.274
issy+0.263
azz+0.248
acker+0.239
 folder+0.239
Neg logit contributions
 reaching-0.285
 helping-0.281
 saving-0.281
 resisting-0.281
 insisting-0.281
Token'm
Token ID1101
Feature activation+1.42e-01

Pos logit contributions
 reaching+0.042
 going+0.041
 helping+0.041
 insisting+0.041
 saving+0.041
Neg logit contributions
suit-0.051
issy-0.049
acker-0.046
azz-0.046
uffy-0.046
Token sorry
Token ID7926
Feature activation+1.60e-01

Pos logit contributions
 insisting+0.151
 reaching+0.151
 succeeding+0.141
 asking+0.139
 resisting+0.137
Neg logit contributions
issy-0.325
uffy-0.322
ppy-0.312
suit-0.311
kin-0.304
Token I
Token ID314
Feature activation+3.13e-02

Pos logit contributions
 going+0.179
 reaching+0.177
 helping+0.176
 saving+0.176
 not+0.174
Neg logit contributions
suit-0.404
issy-0.394
azz-0.379
uffy-0.374
acker-0.372
Token did
Token ID750
Feature activation+2.39e-01

Pos logit contributions
 reaching+0.050
 going+0.050
 helping+0.049
 saving+0.049
 not+0.048
Neg logit contributions
suit-0.064
issy-0.063
azz-0.059
acker-0.059
uffy-0.058
Token not
Token ID407
Feature activation+1.86e-01

Pos logit contributions
 reaching+0.212
 insisting+0.205
 resisting+0.190
 asking+0.186
 giving+0.183
Neg logit contributions
uffy-0.574
 folder-0.569
issy-0.568
suit-0.567
hell-0.563
Token want
Token ID765
Feature activation+3.09e-01

Pos logit contributions
 going+0.260
 helping+0.257
 not+0.257
 reaching+0.256
 saving+0.253
Neg logit contributions
suit-0.420
issy-0.409
azz-0.389
acker-0.385
pp-0.383
Token to
Token ID284
Feature activation+1.45e-01

Pos logit contributions
 succeeding+0.253
 insisting+0.251
 reaching+0.246
 resisting+0.240
 encouraging+0.235
Neg logit contributions
suit-0.802
issy-0.801
azz-0.787
 folder-0.784
ppy-0.767
Token count
Token ID954
Feature activation+1.94e-01

Pos logit contributions
 reaching+0.206
 insisting+0.206
 saving+0.203
 going+0.202
 helping+0.202
Neg logit contributions
suit-0.306
issy-0.304
azz-0.290
uffy-0.284
acker-0.280
Token with
Token ID351
Feature activation+1.71e-01

Pos logit contributions
 helping+0.172
 going+0.171
 reaching+0.171
 not+0.168
 saving+0.167
Neg logit contributions
suit-0.521
issy-0.506
azz-0.493
uffy-0.482
acker-0.480
Token you
Token ID345
Feature activation-5.36e-02

Pos logit contributions
 going+0.180
 reaching+0.179
 helping+0.177
 not+0.174
 saving+0.173
Neg logit contributions
suit-0.436
issy-0.426
azz-0.408
acker-0.406
uffy-0.402
Token,"
Token ID553
Feature activation+4.12e-02

Pos logit contributions
suit+0.129
issy+0.122
acker+0.120
azz+0.119
uffy+0.118
Neg logit contributions
 helping-0.063
 going-0.062
 saving-0.062
 reaching-0.062
 insisting-0.061
Token I
Token ID314
Feature activation+3.44e-04

Pos logit contributions
 going+0.048
 not+0.047
 helping+0.047
 reaching+0.045
 saving+0.045
Neg logit contributions
suit-0.100
issy-0.097
azz-0.093
acker-0.093
pp-0.093
Token'm
Token ID1101
Feature activation+1.41e-01

Pos logit contributions
 not+0.001
 going+0.000
 helping+0.000
 having+0.000
 saving+0.000
Neg logit contributions
suit-0.001
issy-0.001
azz-0.001
pp-0.001
acker-0.001
Token sorry
Token ID7926
Feature activation+1.57e-01

Pos logit contributions
 himself+0.130
 reaching+0.124
 insisting+0.119
 saving+0.117
 advice+0.117
Neg logit contributions
issy-0.309
uffy-0.304
kin-0.301
suit-0.296
coat-0.295
Token for
Token ID329
Feature activation+1.64e-01

Pos logit contributions
 saving+0.184
 reaching+0.184
 insisting+0.179
 going+0.177
 helping+0.176
Neg logit contributions
suit-0.380
issy-0.372
azz-0.359
uffy-0.356
acker-0.347
Token not
Token ID407
Feature activation+1.57e-01

Pos logit contributions
 fascinated+0.171
 inspired+0.171
 Then+0.165
 reaching+0.163
 tempted+0.162
Neg logit contributions
 skirt-0.325
 sweater-0.320
uffy-0.316
ppy-0.309
 folder-0.306
Token wanting
Token ID10291
Feature activation+3.09e-01

Pos logit contributions
 reaching+0.124
 saving+0.113
 insisting+0.111
 Then+0.110
 succeeding+0.107
Neg logit contributions
suit-0.404
issy-0.387
coat-0.384
azz-0.382
ety-0.382
Token to
Token ID284
Feature activation+1.34e-01

Pos logit contributions
 reaching+0.258
 succeeding+0.251
 saving+0.249
 insisting+0.247
 resisting+0.242
Neg logit contributions
suit-0.774
 skirt-0.772
 folder-0.758
ppy-0.745
 sweater-0.743
Token take
Token ID1011
Feature activation+2.33e-01

Pos logit contributions
 saving+0.242
 reaching+0.240
 insisting+0.234
 succeeding+0.233
 resisting+0.232
Neg logit contributions
suit-0.229
issy-0.227
uffy-0.212
azz-0.212
ppy-0.210
Token a
Token ID257
Feature activation-4.44e-02

Pos logit contributions
 reaching+0.284
 saving+0.271
 helping+0.269
 resisting+0.266
 going+0.265
Neg logit contributions
suit-0.551
issy-0.535
azz-0.511
acker-0.506
uffy-0.504
Token bath
Token ID7837
Feature activation-4.67e-02

Pos logit contributions
suit+0.066
issy+0.065
azz+0.060
pp+0.059
acker+0.059
Neg logit contributions
 going-0.095
 helping-0.095
 not-0.095
 reaching-0.093
 saving-0.092
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
suit+0.125
issy+0.124
azz+0.119
acker+0.117
pp+0.117
Neg logit contributions
 going-0.044
 not-0.044
 helping-0.044
 reaching-0.043
 saving-0.043
Token the
Token ID262
Feature activation-2.17e-01

Pos logit contributions
 saving+0.088
 not+0.088
 reaching+0.087
 going+0.087
 insisting+0.086
Neg logit contributions
suit-0.239
issy-0.237
 folder-0.230
azz-0.225
 tant-0.224
Token house
Token ID2156
Feature activation-5.25e-02

Pos logit contributions
suit+0.422
issy+0.406
azz+0.383
acker+0.380
pp+0.375
Neg logit contributions
 going-0.370
 not-0.367
 helping-0.366
 reaching-0.366
 saving-0.364
Token and
Token ID290
Feature activation+4.93e-02

Pos logit contributions
suit+0.132
issy+0.126
pp+0.121
coat+0.121
azz+0.120
Neg logit contributions
 insisting-0.053
 saving-0.051
 reaching-0.051
 going-0.051
 resisting-0.050
Token didn
Token ID1422
Feature activation+2.36e-01

Pos logit contributions
 reaching+0.040
 warning+0.039
 himself+0.038
 saving+0.037
 insisting+0.037
Neg logit contributions
 skirt-0.115
suit-0.114
issy-0.114
 folder-0.113
ems-0.111
Token't
Token ID470
Feature activation+1.51e-01

Pos logit contributions
 saving+0.261
 asking+0.250
 helping+0.247
 insisting+0.237
 reaching+0.232
Neg logit contributions
issy-0.595
azz-0.580
suit-0.553
acker-0.552
uffy-0.546
Token want
Token ID765
Feature activation+3.09e-01

Pos logit contributions
 going+0.212
 not+0.210
 helping+0.210
 reaching+0.208
 saving+0.206
Neg logit contributions
suit-0.342
issy-0.332
azz-0.316
acker-0.313
pp-0.312
Token to
Token ID284
Feature activation+1.41e-01

Pos logit contributions
 saving+0.237
 insisting+0.234
 resisting+0.214
 reaching+0.207
 succeeding+0.204
Neg logit contributions
issy-0.822
azz-0.812
 folder-0.797
suit-0.793
pillar-0.781
Token share
Token ID2648
Feature activation+1.98e-01

Pos logit contributions
 saving+0.203
 helping+0.194
 insisting+0.194
 reaching+0.193
 going+0.192
Neg logit contributions
suit-0.302
issy-0.296
azz-0.284
acker-0.281
ppy-0.277
Token it
Token ID340
Feature activation-5.51e-02

Pos logit contributions
 going+0.156
 not+0.156
 helping+0.153
 reaching+0.151
 saving+0.148
Neg logit contributions
suit-0.571
issy-0.559
azz-0.536
acker-0.532
pp-0.531
Token with
Token ID351
Feature activation+1.77e-01

Pos logit contributions
suit+0.150
issy+0.144
acker+0.139
azz+0.139
pp+0.139
Neg logit contributions
 going-0.047
 saving-0.046
 insisting-0.046
 reaching-0.046
 helping-0.044
Token anyone
Token ID2687
Feature activation-7.49e-02

Pos logit contributions
 going+0.186
 helping+0.182
 reaching+0.182
 not+0.181
 saving+0.179
Neg logit contributions
suit-0.458
issy-0.449
azz-0.429
acker-0.426
pp-0.423
Token you
Token ID345
Feature activation-6.81e-02

Pos logit contributions
 reaching+0.107
 insisting+0.102
 saving+0.101
 resisting+0.099
 going+0.099
Neg logit contributions
issy-0.167
suit-0.164
uffy-0.158
azz-0.157
ems-0.153
Token have
Token ID423
Feature activation+1.57e-01

Pos logit contributions
suit+0.147
issy+0.142
azz+0.135
acker+0.134
pp+0.133
Neg logit contributions
 going-0.101
 helping-0.101
 reaching-0.100
 not-0.100
 saving-0.099
Token and
Token ID290
Feature activation+7.32e-02

Pos logit contributions
 reaching+0.209
 insisting+0.202
 succeeding+0.193
 resisting+0.192
 climbing+0.187
Neg logit contributions
suit-0.338
pp-0.336
ems-0.328
issy-0.321
azz-0.315
Token not
Token ID407
Feature activation+1.51e-01

Pos logit contributions
 reaching+0.065
 going+0.062
 insisting+0.061
 helping+0.061
 saving+0.060
Neg logit contributions
suit-0.193
issy-0.191
azz-0.181
pp-0.181
uffy-0.179
Token always
Token ID1464
Feature activation+1.75e-01

Pos logit contributions
 reaching+0.194
 saving+0.192
 helping+0.190
 going+0.190
 insisting+0.188
Neg logit contributions
suit-0.340
issy-0.330
azz-0.316
acker-0.314
pp-0.313
Token want
Token ID765
Feature activation+3.09e-01

Pos logit contributions
 reaching+0.235
 saving+0.226
 insisting+0.224
 helping+0.224
 going+0.220
Neg logit contributions
suit-0.369
issy-0.348
azz-0.341
pp-0.341
acker-0.338
Token more
Token ID517
Feature activation+1.09e-01

Pos logit contributions
 reaching+0.253
 succeeding+0.249
 insisting+0.247
 resisting+0.236
 saving+0.233
Neg logit contributions
issy-0.814
suit-0.790
azz-0.779
 folder-0.773
acker-0.756
Token.
Token ID13
Feature activation+2.46e-02

Pos logit contributions
 reaching+0.170
 not+0.168
 insisting+0.168
 saving+0.166
 resisting+0.164
Neg logit contributions
suit-0.206
issy-0.193
 tant-0.191
azz-0.190
coat-0.183
Token Come
Token ID7911
Feature activation+1.22e-01

Pos logit contributions
 going+0.032
 helping+0.032
 reaching+0.032
 not+0.032
 saving+0.031
Neg logit contributions
suit-0.057
issy-0.056
azz-0.053
acker-0.053
pp-0.052
Token on
Token ID319
Feature activation+8.59e-02

Pos logit contributions
 reaching+0.110
 going+0.105
 resisting+0.105
 insisting+0.104
 helping+0.103
Neg logit contributions
suit-0.334
issy-0.316
uffy-0.310
azz-0.309
cy-0.305
Token,
Token ID11
Feature activation+5.86e-02

Pos logit contributions
 reaching+0.113
 going+0.110
 saving+0.109
 insisting+0.107
 succeeding+0.106
Neg logit contributions
suit-0.193
issy-0.193
uffy-0.187
azz-0.179
ppy-0.176
Token chocolate
Token ID11311
Feature activation-2.37e-02

Pos logit contributions
 going+0.206
 reaching+0.206
 helping+0.205
 not+0.202
 saving+0.201
Neg logit contributions
suit-0.569
issy-0.552
acker-0.530
pp-0.529
azz-0.528
Token and
Token ID290
Feature activation+4.93e-02

Pos logit contributions
suit+0.062
issy+0.060
azz+0.057
pp+0.057
acker+0.057
Neg logit contributions
 going-0.024
 helping-0.024
 not-0.024
 reaching-0.024
 saving-0.024
Token we
Token ID356
Feature activation-7.41e-03

Pos logit contributions
 going+0.083
 not+0.083
 helping+0.083
 reaching+0.082
 saving+0.082
Neg logit contributions
suit-0.096
issy-0.093
azz-0.088
acker-0.087
pp-0.087
Token didn
Token ID1422
Feature activation+2.45e-01

Pos logit contributions
suit+0.016
issy+0.016
azz+0.015
pp+0.015
acker+0.015
Neg logit contributions
 not-0.011
 going-0.011
 helping-0.011
 having-0.011
 reaching-0.010
Token't
Token ID470
Feature activation+1.55e-01

Pos logit contributions
 saving+0.252
 helping+0.242
 reaching+0.240
 succeeding+0.236
 asking+0.236
Neg logit contributions
issy-0.624
azz-0.601
suit-0.585
uffy-0.584
acker-0.570
Token want
Token ID765
Feature activation+3.09e-01

Pos logit contributions
 going+0.214
 not+0.213
 helping+0.210
 reaching+0.207
 having+0.203
Neg logit contributions
suit-0.363
issy-0.349
azz-0.333
pp-0.330
uffy-0.329
Token to
Token ID284
Feature activation+1.42e-01

Pos logit contributions
 succeeding+0.237
 resisting+0.223
 reaching+0.220
 saving+0.217
 believing+0.215
Neg logit contributions
suit-0.835
issy-0.834
acker-0.798
azz-0.794
 folder-0.792
Token sleep
Token ID3993
Feature activation+1.50e-01

Pos logit contributions
 saving+0.216
 insisting+0.211
 resisting+0.207
 succeeding+0.207
 helping+0.206
Neg logit contributions
suit-0.278
issy-0.274
acker-0.266
cy-0.261
uffy-0.260
Token."
Token ID526
Feature activation+5.88e-03

Pos logit contributions
 reaching+0.158
 saving+0.150
 going+0.149
 helping+0.149
 resisting+0.148
Neg logit contributions
suit-0.389
issy-0.361
kin-0.352
uffy-0.350
ppy-0.348
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
 going+0.006
 reaching+0.006
 saving+0.006
 succeeding+0.006
 helping+0.006
Neg logit contributions
suit-0.015
issy-0.015
azz-0.015
uffy-0.015
acker-0.014
Token
Token ID198
Feature activation-4.06e-02

Pos logit contributions
issy+0.081
azz+0.079
uffy+0.078
suit+0.078
ara+0.074
Neg logit contributions
 reaching-0.047
 going-0.046
 succeeding-0.045
 resisting-0.043
 saving-0.042
Token being
Token ID852
Feature activation+1.51e-01

Pos logit contributions
 insisting+0.163
 saving+0.162
 reaching+0.154
 succeeding+0.153
 helping+0.151
Neg logit contributions
suit-0.342
ppy-0.324
y-0.315
ety-0.311
issy-0.310
Token selfish
Token ID20363
Feature activation+1.04e-01

Pos logit contributions
 reaching+0.203
 insisting+0.200
 going+0.197
 saving+0.194
 resisting+0.193
Neg logit contributions
suit-0.335
issy-0.308
ppy-0.306
azz-0.300
coat-0.297
Token and
Token ID290
Feature activation+6.54e-02

Pos logit contributions
 reaching+0.108
 saving+0.105
 insisting+0.103
 helping+0.102
 going+0.102
Neg logit contributions
suit-0.274
issy-0.258
azz-0.252
y-0.248
ppy-0.247
Token didn
Token ID1422
Feature activation+2.39e-01

Pos logit contributions
 reaching+0.094
 saving+0.092
 insisting+0.090
 helping+0.090
 resisting+0.088
Neg logit contributions
suit-0.138
issy-0.131
azz-0.125
ppy-0.125
pp-0.125
Token't
Token ID470
Feature activation+1.53e-01

Pos logit contributions
 saving+0.255
 helping+0.246
 asking+0.240
 reaching+0.237
 succeeding+0.235
Neg logit contributions
issy-0.599
azz-0.576
uffy-0.566
suit-0.564
acker-0.559
Token want
Token ID765
Feature activation+3.08e-01

Pos logit contributions
 going+0.214
 saving+0.213
 helping+0.212
 reaching+0.211
 not+0.210
Neg logit contributions
suit-0.340
issy-0.330
azz-0.316
acker-0.314
pp-0.311
Token to
Token ID284
Feature activation+1.40e-01

Pos logit contributions
 succeeding+0.247
 reaching+0.244
 insisting+0.243
 resisting+0.241
 saving+0.221
Neg logit contributions
suit-0.795
ety-0.776
azz-0.776
issy-0.766
 tant-0.751
Token share
Token ID2648
Feature activation+1.99e-01

Pos logit contributions
 saving+0.208
 insisting+0.205
 reaching+0.200
 resisting+0.199
 going+0.199
Neg logit contributions
suit-0.285
issy-0.279
acker-0.270
azz-0.270
ppy-0.269
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
 going+0.158
 reaching+0.155
 helping+0.152
 saving+0.149
 insisting+0.148
Neg logit contributions
suit-0.563
issy-0.538
azz-0.527
acker-0.523
pp-0.521
Token spring
Token ID6076
Feature activation+3.17e-02

Pos logit contributions
suit+0.319
issy+0.304
azz+0.288
acker+0.287
pp+0.285
Neg logit contributions
 going-0.254
 helping-0.252
 not-0.252
 reaching-0.250
 saving-0.250
Token with
Token ID351
Feature activation+1.61e-01

Pos logit contributions
 not+0.022
 going+0.022
 helping+0.021
 reaching+0.021
 having+0.020
Neg logit contributions
issy-0.093
suit-0.093
azz-0.089
acker-0.089
pp-0.088
Token toy
Token ID13373
Feature activation-1.34e-01

Pos logit contributions
issy+0.114
suit+0.113
azz+0.108
acker+0.108
ppy+0.107
Neg logit contributions
 helping-0.149
 going-0.149
 not-0.149
 to-0.146
 saving-0.143
Token that
Token ID326
Feature activation-4.42e-02

Pos logit contributions
suit+0.316
issy+0.308
azz+0.294
acker+0.292
pp+0.290
Neg logit contributions
 going-0.175
 not-0.172
 helping-0.172
 reaching-0.170
 saving-0.169
Token he
Token ID339
Feature activation-1.31e-01

Pos logit contributions
suit+0.104
issy+0.102
azz+0.097
acker+0.096
pp+0.096
Neg logit contributions
 going-0.057
 not-0.057
 helping-0.056
 reaching-0.056
 saving-0.055
Token had
Token ID550
Feature activation+1.17e-01

Pos logit contributions
suit+0.279
issy+0.272
azz+0.260
pp+0.258
acker+0.257
Neg logit contributions
 not-0.199
 going-0.198
 helping-0.195
 reaching-0.192
 saving-0.191
Token been
Token ID587
Feature activation+1.50e-01

Pos logit contributions
 insisting+0.169
 resisting+0.163
 reaching+0.162
 helping+0.160
 encouraging+0.159
Neg logit contributions
suit-0.250
issy-0.241
azz-0.224
ppy-0.220
acker-0.220
Token wanting
Token ID10291
Feature activation+3.08e-01

Pos logit contributions
 reaching+0.155
 Listen+0.152
 insisting+0.150
 succeeding+0.145
 resisting+0.145
Neg logit contributions
suit-0.373
issy-0.347
ppy-0.341
pp-0.338
coat-0.334
Token for
Token ID329
Feature activation+1.81e-01

Pos logit contributions
 insisting+0.278
 reaching+0.277
 resisting+0.274
 succeeding+0.256
 saving+0.250
Neg logit contributions
suit-0.800
issy-0.766
 folder-0.740
azz-0.736
acker-0.718
Token a
Token ID257
Feature activation-8.10e-02

Pos logit contributions
 reaching+0.205
 insisting+0.198
 encouraging+0.195
 resisting+0.191
 teaching+0.180
Neg logit contributions
suit-0.402
issy-0.396
acker-0.373
ppy-0.368
kin-0.368
Token long
Token ID890
Feature activation-4.79e-02

Pos logit contributions
suit+0.163
issy+0.160
pp+0.152
azz+0.150
acker+0.148
Neg logit contributions
 helping-0.137
 going-0.136
 not-0.134
 saving-0.129
 reaching-0.129
Token time
Token ID640
Feature activation-1.25e-02

Pos logit contributions
pp+0.114
issy+0.111
suit+0.111
azz+0.109
acker+0.107
Neg logit contributions
 to-0.066
 not-0.066
 going-0.066
 next-0.064
 helping-0.064
Token.
Token ID13
Feature activation-4.31e-03

Pos logit contributions
suit+0.031
issy+0.029
azz+0.029
pp+0.028
ppy+0.028
Neg logit contributions
 reaching-0.013
 insisting-0.013
 not-0.013
 resisting-0.012
 saving-0.012
Token what
Token ID644
Feature activation+1.39e-02

Pos logit contributions
 going+0.024
 not+0.024
 helping+0.024
 reaching+0.024
 having+0.023
Neg logit contributions
suit-0.050
issy-0.049
acker-0.047
pp-0.047
azz-0.046
Token is
Token ID318
Feature activation+1.23e-01

Pos logit contributions
 going+0.020
 not+0.020
 helping+0.020
 reaching+0.020
 saving+0.019
Neg logit contributions
suit-0.032
issy-0.030
pp-0.029
azz-0.029
acker-0.029
Token that
Token ID326
Feature activation-3.01e-02

Pos logit contributions
 reaching+0.111
 insisting+0.108
 helping+0.107
 resisting+0.106
 saving+0.106
Neg logit contributions
suit-0.317
issy-0.304
uffy-0.292
azz-0.291
pp-0.288
Token?"
Token ID1701
Feature activation-8.79e-03

Pos logit contributions
suit+0.070
issy+0.068
azz+0.065
acker+0.064
pp+0.064
Neg logit contributions
 going-0.038
 helping-0.038
 reaching-0.038
 saving-0.038
 not-0.037
Token The
Token ID383
Feature activation-2.34e-01

Pos logit contributions
suit+0.021
issy+0.021
azz+0.020
uffy+0.020
acker+0.019
Neg logit contributions
 going-0.011
 saving-0.010
 helping-0.010
 reaching-0.010
 not-0.010
Token girl
Token ID2576
Feature activation-3.12e-01

Pos logit contributions
suit+0.429
issy+0.426
acker+0.405
pp+0.401
azz+0.401
Neg logit contributions
 helping-0.428
 going-0.427
 not-0.427
 next-0.422
 reaching-0.416
Token said
Token ID531
Feature activation+4.48e-02

Pos logit contributions
issy+0.567
suit+0.566
azz+0.539
uffy+0.526
ppy+0.525
Neg logit contributions
 saving-0.532
 helping-0.514
 going-0.507
 insisting-0.494
 resisting-0.493
Token,
Token ID11
Feature activation+3.91e-03

Pos logit contributions
 insisting+0.072
 resisting+0.071
 having+0.071
 going+0.070
 succeeding+0.069
Neg logit contributions
suit-0.077
y-0.075
kin-0.074
ppy-0.073
pp-0.073
Token "
Token ID366
Feature activation-2.53e-02

Pos logit contributions
 succeeding+0.004
 saving+0.004
 resisting+0.004
 himself+0.004
 insisting+0.004
Neg logit contributions
suit-0.008
pp-0.008
ppy-0.008
y-0.008
azz-0.008
TokenIt
Token ID1026
Feature activation-4.42e-02

Pos logit contributions
suit+0.057
issy+0.056
azz+0.053
acker+0.052
pp+0.052
Neg logit contributions
 going-0.036
 not-0.036
 helping-0.035
 reaching-0.035
 saving-0.035
Token's
Token ID338
Feature activation-6.56e-02

Pos logit contributions
suit+0.079
issy+0.078
azz+0.072
uffy+0.072
acker+0.072
Neg logit contributions
 reaching-0.079
 going-0.078
 helping-0.078
 insisting-0.078
 saving-0.078
Token bedroom
Token ID14043
Feature activation-8.91e-02

Pos logit contributions
suit+0.444
issy+0.438
azz+0.427
acker+0.414
pp+0.407
Neg logit contributions
 helping-0.388
 not-0.387
 going-0.386
 saving-0.378
 having-0.377
Token.
Token ID13
Feature activation-3.68e-02

Pos logit contributions
suit+0.229
issy+0.217
pp+0.208
acker+0.205
uffy+0.204
Neg logit contributions
 going-0.094
 insisting-0.093
 reaching-0.093
 helping-0.091
 saving-0.091
Token The
Token ID383
Feature activation-1.99e-01

Pos logit contributions
suit+0.070
issy+0.067
azz+0.062
uffy+0.062
aire+0.062
Neg logit contributions
 saving-0.060
 going-0.059
 helping-0.059
 resisting-0.059
 succeeding-0.058
Token next
Token ID1306
Feature activation-4.72e-02

Pos logit contributions
suit+0.419
issy+0.405
azz+0.399
pp+0.389
acker+0.387
Neg logit contributions
 going-0.325
 helping-0.323
 not-0.320
 next-0.318
 reaching-0.314
Token day
Token ID1110
Feature activation-9.02e-02

Pos logit contributions
suit+0.106
issy+0.103
pp+0.102
azz+0.101
acker+0.099
Neg logit contributions
 helping-0.069
 going-0.069
 next-0.068
 not-0.067
 to-0.067
Token Pen
Token ID7507
Feature activation-3.12e-01

Pos logit contributions
suit+0.218
issy+0.212
uffy+0.201
azz+0.200
acker+0.199
Neg logit contributions
 going-0.107
 helping-0.105
 not-0.105
 saving-0.105
 insisting-0.103
Tokenelope
Token ID47329
Feature activation-2.34e-01

Pos logit contributions
suit+0.451
issy+0.437
azz+0.422
pp+0.401
acker+0.398
Neg logit contributions
 not-0.693
 next-0.679
 going-0.678
 reaching-0.677
 saving-0.674
Token went
Token ID1816
Feature activation+1.55e-01

Pos logit contributions
suit+0.458
issy+0.451
azz+0.417
pp+0.414
uffy+0.412
Neg logit contributions
 reaching-0.385
 helping-0.384
 not-0.380
 saving-0.379
 going-0.375
Token outside
Token ID2354
Feature activation+7.37e-02

Pos logit contributions
 succeeding+0.131
 resisting+0.131
 reaching+0.131
 not+0.128
 insisting+0.127
Neg logit contributions
suit-0.416
issy-0.405
azz-0.390
 folder-0.389
coat-0.383
Token to
Token ID284
Feature activation+6.70e-02

Pos logit contributions
 not+0.086
 insisting+0.083
 succeeding+0.083
 saving+0.083
 reaching+0.082
Neg logit contributions
suit-0.169
issy-0.163
y-0.158
cy-0.157
azz-0.156
Token pick
Token ID2298
Feature activation+1.80e-01

Pos logit contributions
 reaching+0.096
 going+0.096
 not+0.095
 helping+0.095
 saving+0.095
Neg logit contributions
suit-0.144
issy-0.142
azz-0.133
acker-0.133
pp-0.132
Token,
Token ID11
Feature activation-2.44e-03

Pos logit contributions
 reaching+0.154
 saving+0.153
 helping+0.144
 insisting+0.142
 resisting+0.141
Neg logit contributions
suit-0.560
issy-0.551
azz-0.534
acker-0.528
 folder-0.516
Token Oak
Token ID9121
Feature activation-1.87e-01

Pos logit contributions
suit+0.006
issy+0.006
acker+0.006
azz+0.006
pp+0.006
Neg logit contributions
 going-0.003
 not-0.003
 helping-0.002
 reaching-0.002
 saving-0.002
Token Tree
Token ID12200
Feature activation-1.17e-01

Pos logit contributions
suit+0.422
issy+0.409
azz+0.393
acker+0.393
pp+0.380
Neg logit contributions
 going-0.261
 not-0.256
 helping-0.251
 reaching-0.250
 saving-0.249
Token!"
Token ID2474
Feature activation-1.24e-02

Pos logit contributions
suit+0.270
issy+0.261
azz+0.249
acker+0.248
pp+0.244
Neg logit contributions
 going-0.158
 not-0.156
 helping-0.156
 reaching-0.154
 saving-0.153
Token The
Token ID383
Feature activation-2.14e-01

Pos logit contributions
suit+0.030
issy+0.029
acker+0.028
uffy+0.027
pp+0.027
Neg logit contributions
 going-0.016
 helping-0.016
 not-0.015
 reaching-0.015
 having-0.015
Token girl
Token ID2576
Feature activation-3.10e-01

Pos logit contributions
suit+0.409
issy+0.386
azz+0.370
acker+0.365
pp+0.365
Neg logit contributions
 going-0.393
 helping-0.389
 not-0.380
 reaching-0.378
 next-0.375
Token put
Token ID1234
Feature activation+1.49e-01

Pos logit contributions
suit+0.605
issy+0.599
azz+0.563
pp+0.554
uffy+0.553
Neg logit contributions
 saving-0.498
 helping-0.490
 going-0.488
 insisting-0.481
 reaching-0.473
Token her
Token ID607
Feature activation-1.89e-01

Pos logit contributions
 going+0.086
 helping+0.083
 reaching+0.079
 saving+0.079
 not+0.077
Neg logit contributions
suit-0.469
issy-0.457
azz-0.447
acker-0.444
uffy-0.436
Token hands
Token ID2832
Feature activation-3.08e-02

Pos logit contributions
suit+0.342
issy+0.331
azz+0.311
acker+0.307
pp+0.305
Neg logit contributions
 going-0.350
 not-0.348
 helping-0.347
 reaching-0.345
 saving-0.343
Token out
Token ID503
Feature activation+2.40e-03

Pos logit contributions
suit+0.088
issy+0.085
azz+0.082
pp+0.082
acker+0.081
Neg logit contributions
 going-0.025
 not-0.024
 helping-0.024
 reaching-0.024
 saving-0.024
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 going+0.003
 not+0.003
 saving+0.002
 reaching+0.002
 helping+0.002
Neg logit contributions
suit-0.006
issy-0.006
azz-0.006
pp-0.006
uffy-0.005
Token which
Token ID543
Feature activation-4.81e-02

Pos logit contributions
suit+0.363
issy+0.351
pp+0.338
azz+0.336
acker+0.334
Neg logit contributions
 going-0.105
 reaching-0.105
 not-0.104
 insisting-0.103
 saving-0.103
Token she
Token ID673
Feature activation-1.68e-01

Pos logit contributions
suit+0.114
issy+0.111
azz+0.107
acker+0.106
pp+0.105
Neg logit contributions
 not-0.064
 going-0.064
 helping-0.063
 reaching-0.061
 having-0.061
Token loved
Token ID6151
Feature activation+2.10e-01

Pos logit contributions
suit+0.364
issy+0.355
azz+0.338
acker+0.334
pp+0.332
Neg logit contributions
 going-0.248
 not-0.246
 helping-0.244
 reaching-0.241
 saving-0.240
Token.
Token ID13
Feature activation-3.33e-02

Pos logit contributions
 not+0.106
 reaching+0.105
 saving+0.104
 insisting+0.102
 going+0.100
Neg logit contributions
suit-0.641
issy-0.620
azz-0.600
pp-0.599
acker-0.593
Token Everywhere
Token ID48201
Feature activation-6.01e-02

Pos logit contributions
suit+0.081
issy+0.079
pp+0.075
azz+0.075
acker+0.074
Neg logit contributions
 going-0.041
 not-0.040
 helping-0.040
 reaching-0.040
 having-0.040
Token Daisy
Token ID40355
Feature activation-3.09e-01

Pos logit contributions
suit+0.163
issy+0.157
azz+0.153
acker+0.151
pp+0.150
Neg logit contributions
 going-0.064
 not-0.063
 helping-0.061
 reaching-0.060
 to-0.059
Token went
Token ID1816
Feature activation+8.60e-02

Pos logit contributions
acker+0.854
azz+0.851
igsaw+0.838
issy+0.838
suit+0.835
Neg logit contributions
 to-0.278
 going-0.275
 next-0.253
 not-0.246
 the-0.243
Token she
Token ID673
Feature activation-1.78e-01

Pos logit contributions
 going+0.093
 not+0.092
 helping+0.092
 reaching+0.091
 saving+0.090
Neg logit contributions
suit-0.222
issy-0.216
azz-0.207
acker-0.205
pp-0.205
Token took
Token ID1718
Feature activation+1.81e-01

Pos logit contributions
suit+0.394
issy+0.388
azz+0.374
acker+0.370
uffy+0.364
Neg logit contributions
 going-0.262
 not-0.252
 helping-0.250
 reaching-0.248
 having-0.245
Token her
Token ID607
Feature activation-1.65e-01

Pos logit contributions
 not+0.136
 going+0.136
 saving+0.134
 helping+0.134
 reaching+0.134
Neg logit contributions
suit-0.514
issy-0.503
azz-0.483
acker-0.479
pp-0.476
Token net
Token ID2010
Feature activation-1.02e-01

Pos logit contributions
suit+0.247
issy+0.234
azz+0.217
acker+0.214
pp+0.212
Neg logit contributions
 going-0.357
 helping-0.355
 not-0.354
 reaching-0.353
 saving-0.352
Token to
Token ID284
Feature activation+9.69e-02

Pos logit contributions
 succeeding+0.207
 resisting+0.205
 reaching+0.205
 saving+0.200
 encouraging+0.198
Neg logit contributions
suit-0.677
issy-0.653
ite-0.648
azz-0.637
coat-0.631
Token make
Token ID787
Feature activation+2.28e-01

Pos logit contributions
 saving+0.108
 reaching+0.108
 helping+0.108
 going+0.106
 insisting+0.105
Neg logit contributions
issy-0.231
suit-0.231
azz-0.219
acker-0.215
pp-0.214
Token music
Token ID2647
Feature activation+4.31e-02

Pos logit contributions
 reaching+0.177
 helping+0.174
 saving+0.173
 going+0.173
 not+0.170
Neg logit contributions
suit-0.634
issy-0.630
azz-0.601
uffy-0.594
acker-0.592
Token,"
Token ID553
Feature activation-7.73e-03

Pos logit contributions
 going+0.038
 reaching+0.037
 helping+0.037
 not+0.037
 saving+0.036
Neg logit contributions
suit-0.119
issy-0.116
azz-0.112
acker-0.110
pp-0.110
Token the
Token ID262
Feature activation-2.39e-01

Pos logit contributions
suit+0.019
issy+0.018
azz+0.018
acker+0.017
pp+0.017
Neg logit contributions
 going-0.009
 helping-0.009
 reaching-0.009
 not-0.009
 saving-0.009
Token girl
Token ID2576
Feature activation-3.07e-01

Pos logit contributions
suit+0.479
issy+0.464
acker+0.451
pp+0.445
azz+0.444
Neg logit contributions
 helping-0.402
 going-0.400
 not-0.399
 saving-0.387
 reaching-0.386
Token said
Token ID531
Feature activation+3.18e-02

Pos logit contributions
issy+0.590
suit+0.580
azz+0.546
uffy+0.533
ppy+0.528
Neg logit contributions
 saving-0.520
 helping-0.496
 going-0.491
 reaching-0.487
 resisting-0.481
Token.
Token ID13
Feature activation-4.78e-02

Pos logit contributions
 saving+0.042
 resisting+0.042
 going+0.042
 succeeding+0.041
 catching+0.041
Neg logit contributions
suit-0.065
y-0.061
issy-0.061
ppy-0.061
kin-0.060
Token "
Token ID366
Feature activation-4.67e-02

Pos logit contributions
suit+0.125
issy+0.122
azz+0.119
acker+0.115
uffy+0.115
Neg logit contributions
 going-0.049
 saving-0.048
 helping-0.047
 not-0.047
 reaching-0.047
TokenIt
Token ID1026
Feature activation-5.86e-02

Pos logit contributions
suit+0.112
issy+0.110
azz+0.104
pp+0.104
acker+0.104
Neg logit contributions
 not-0.058
 going-0.058
 helping-0.057
 reaching-0.057
 saving-0.056
Token will
Token ID481
Feature activation+1.46e-01

Pos logit contributions
suit+0.116
issy+0.112
azz+0.106
acker+0.105
pp+0.105
Neg logit contributions
 going-0.097
 not-0.097
 helping-0.097
 reaching-0.096
 saving-0.095
Token.
Token ID13
Feature activation-3.91e-02

Pos logit contributions
suit+0.241
issy+0.235
azz+0.226
acker+0.225
pp+0.224
Neg logit contributions
 going-0.074
 helping-0.072
 not-0.072
 reaching-0.071
 saving-0.070
Token One
Token ID1881
Feature activation-6.02e-02

Pos logit contributions
suit+0.102
issy+0.098
pp+0.095
acker+0.094
azz+0.093
Neg logit contributions
 going-0.041
 not-0.041
 helping-0.041
 having-0.040
 reaching-0.040
Token night
Token ID1755
Feature activation-7.30e-02

Pos logit contributions
suit+0.138
issy+0.132
pp+0.131
azz+0.130
acker+0.129
Neg logit contributions
 going-0.088
 not-0.087
 helping-0.086
 next-0.083
 reaching-0.083
Token,
Token ID11
Feature activation-8.14e-03

Pos logit contributions
suit+0.193
issy+0.189
azz+0.181
acker+0.180
pp+0.180
Neg logit contributions
 going-0.074
 not-0.073
 helping-0.073
 reaching-0.072
 saving-0.071
Token El
Token ID2574
Feature activation-2.20e-01

Pos logit contributions
suit+0.023
issy+0.022
acker+0.022
pp+0.022
azz+0.022
Neg logit contributions
 going-0.008
 not-0.008
 helping-0.007
 having-0.007
 reaching-0.007
Tokensie
Token ID44524
Feature activation-3.07e-01

Pos logit contributions
suit+0.270
issy+0.262
azz+0.253
acker+0.238
pp+0.234
Neg logit contributions
 going-0.527
 reaching-0.523
 not-0.518
 helping-0.513
 next-0.513
Token was
Token ID373
Feature activation+1.00e-01

Pos logit contributions
suit+0.697
issy+0.677
azz+0.645
acker+0.639
pp+0.637
Neg logit contributions
 going-0.422
 not-0.419
 helping-0.416
 reaching-0.413
 saving-0.409
Token alone
Token ID3436
Feature activation+6.35e-02

Pos logit contributions
 saving+0.092
 succeeding+0.091
 himself+0.091
 resisting+0.089
 reaching+0.086
Neg logit contributions
suit-0.228
issy-0.221
coat-0.219
uffy-0.213
ppy-0.213
Token in
Token ID287
Feature activation-7.07e-03

Pos logit contributions
 reaching+0.069
 saving+0.069
 insisting+0.068
 going+0.068
 not+0.067
Neg logit contributions
suit-0.156
issy-0.148
coat-0.141
azz-0.140
pp-0.140
Token her
Token ID607
Feature activation-2.37e-01

Pos logit contributions
suit+0.020
issy+0.020
azz+0.019
 folder+0.019
acker+0.019
Neg logit contributions
 reaching-0.005
 saving-0.004
 helping-0.004
 going-0.004
 not-0.004
Token room
Token ID2119
Feature activation-9.29e-02

Pos logit contributions
suit+0.427
issy+0.409
azz+0.383
acker+0.378
uffy+0.377
Neg logit contributions
 going-0.436
 reaching-0.433
 helping-0.433
 not-0.431
 saving-0.430
Token something
Token ID1223
Feature activation-7.37e-02

Pos logit contributions
suit+0.111
issy+0.106
azz+0.101
uffy+0.099
acker+0.098
Neg logit contributions
 going-0.132
 not-0.131
 helping-0.129
 reaching-0.129
 saving-0.127
Token bad
Token ID2089
Feature activation-2.36e-02

Pos logit contributions
suit+0.178
issy+0.171
azz+0.162
acker+0.162
pp+0.161
Neg logit contributions
 saving-0.088
 helping-0.086
 reaching-0.086
 going-0.086
 insisting-0.085
Token happened
Token ID3022
Feature activation+1.10e-01

Pos logit contributions
suit+0.060
issy+0.059
azz+0.056
acker+0.056
pp+0.055
Neg logit contributions
 going-0.026
 not-0.026
 helping-0.025
 reaching-0.025
 saving-0.025
Token.
Token ID13
Feature activation-4.18e-02

Pos logit contributions
 reaching+0.108
 saving+0.106
 helping+0.105
 going+0.103
 resisting+0.102
Neg logit contributions
suit-0.275
issy-0.258
azz-0.251
pp-0.250
y-0.248
Token The
Token ID383
Feature activation-2.21e-01

Pos logit contributions
suit+0.102
issy+0.100
azz+0.096
acker+0.094
pp+0.093
Neg logit contributions
 going-0.050
 helping-0.049
 reaching-0.049
 not-0.049
 saving-0.049
Token girl
Token ID2576
Feature activation-3.07e-01

Pos logit contributions
suit+0.412
issy+0.408
pp+0.393
azz+0.390
acker+0.386
Neg logit contributions
 not-0.400
 helping-0.399
 going-0.399
 next-0.393
 saving-0.387
Token got
Token ID1392
Feature activation+1.66e-01

Pos logit contributions
suit+0.667
issy+0.650
azz+0.614
acker+0.606
pp+0.602
Neg logit contributions
 going-0.446
 helping-0.442
 reaching-0.438
 saving-0.438
 not-0.433
Token lost
Token ID2626
Feature activation+9.95e-02

Pos logit contributions
 succeeding+0.203
 reaching+0.201
 seeing+0.199
 conquering+0.193
 resisting+0.189
Neg logit contributions
suit-0.352
cy-0.345
 tant-0.343
 folder-0.339
issy-0.338
Token.
Token ID13
Feature activation-3.15e-02

Pos logit contributions
 going+0.104
 not+0.103
 helping+0.103
 reaching+0.101
 saving+0.100
Neg logit contributions
suit-0.261
issy-0.257
azz-0.247
acker-0.245
pp-0.244
Token She
Token ID1375
Feature activation-1.46e-01

Pos logit contributions
suit+0.067
issy+0.067
azz+0.064
acker+0.062
uffy+0.062
Neg logit contributions
 going-0.047
 helping-0.046
 saving-0.046
 reaching-0.046
 not-0.045
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
suit+0.321
issy+0.312
azz+0.296
acker+0.292
pp+0.290
Neg logit contributions
 going-0.210
 helping-0.209
 reaching-0.208
 saving-0.206
 not-0.205
Token by
Token ID416
Feature activation-1.79e-03

Pos logit contributions
 saving+0.188
 going+0.186
 helping+0.185
 not+0.185
 reaching+0.184
Neg logit contributions
suit-0.514
issy-0.497
azz-0.483
pp-0.473
acker-0.471
Token this
Token ID428
Feature activation-1.76e-01

Pos logit contributions
suit+0.005
issy+0.005
azz+0.005
uffy+0.005
pp+0.005
Neg logit contributions
 going-0.001
 reaching-0.001
 helping-0.001
 not-0.001
 saving-0.001
Token special
Token ID2041
Feature activation-7.34e-02

Pos logit contributions
suit+0.445
issy+0.428
azz+0.410
acker+0.405
pp+0.404
Neg logit contributions
 reaching-0.188
 saving-0.187
 going-0.187
 helping-0.185
 not-0.182
Token lamp
Token ID20450
Feature activation-1.69e-01

Pos logit contributions
suit+0.139
issy+0.138
azz+0.131
acker+0.129
pp+0.128
Neg logit contributions
 going-0.129
 helping-0.129
 not-0.128
 reaching-0.126
 saving-0.125
Token that
Token ID326
Feature activation-7.11e-02

Pos logit contributions
suit+0.450
issy+0.440
azz+0.425
acker+0.422
pp+0.418
Neg logit contributions
 not-0.169
 going-0.169
 helping-0.167
 reaching-0.161
 saving-0.161
Token Pen
Token ID7507
Feature activation-3.06e-01

Pos logit contributions
suit+0.191
issy+0.186
azz+0.179
acker+0.178
pp+0.177
Neg logit contributions
 going-0.071
 not-0.070
 helping-0.069
 reaching-0.068
 saving-0.067
Tokenelope
Token ID47329
Feature activation-2.26e-01

Pos logit contributions
suit+0.484
issy+0.467
azz+0.446
pp+0.430
acker+0.428
Neg logit contributions
 not-0.632
 going-0.623
 helping-0.619
 having-0.619
 reaching-0.615
Token had
Token ID550
Feature activation+1.13e-01

Pos logit contributions
suit+0.466
issy+0.452
azz+0.426
acker+0.422
pp+0.421
Neg logit contributions
 going-0.352
 helping-0.350
 reaching-0.349
 not-0.348
 saving-0.345
Token!
Token ID0
Feature activation-1.45e-02

Pos logit contributions
 insisting+0.143
 reaching+0.142
 helping+0.140
 resisting+0.137
 saving+0.136
Neg logit contributions
suit-0.255
issy-0.252
azz-0.236
pp-0.232
acker-0.231
Token
Token ID220
Feature activation-4.57e-02

Pos logit contributions
suit+0.038
issy+0.037
azz+0.035
acker+0.035
pp+0.035
Neg logit contributions
 going-0.015
 not-0.015
 helping-0.015
 reaching-0.015
 saving-0.015
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
suit+0.129
issy+0.127
azz+0.122
acker+0.121
uffy+0.120
Neg logit contributions
 going-0.037
 helping-0.037
 reaching-0.036
 not-0.036
 saving-0.036
Token amazing
Token ID4998
Feature activation+8.21e-03

Pos logit contributions
 reaching+0.122
 going+0.119
 saving+0.117
 not+0.116
 helping+0.116
Neg logit contributions
suit-0.489
issy-0.482
azz-0.468
acker-0.459
pp-0.458
Token things
Token ID1243
Feature activation-7.16e-02

Pos logit contributions
 going+0.012
 not+0.012
 helping+0.012
 reaching+0.012
 saving+0.012
Neg logit contributions
suit-0.018
issy-0.017
azz-0.016
acker-0.016
pp-0.016
Token.
Token ID13
Feature activation-5.31e-02

Pos logit contributions
suit+0.186
issy+0.180
pp+0.173
azz+0.172
acker+0.170
Neg logit contributions
 reaching-0.069
 saving-0.068
 insisting-0.067
 helping-0.067
 not-0.067
Token The
Token ID383
Feature activation-2.21e-01

Pos logit contributions
suit+0.142
issy+0.137
pp+0.131
acker+0.131
azz+0.130
Neg logit contributions
 going-0.053
 not-0.052
 helping-0.052
 reaching-0.051
 having-0.051
Token little
Token ID1310
Feature activation-2.25e-01

Pos logit contributions
suit+0.448
issy+0.435
azz+0.419
acker+0.419
pp+0.418
Neg logit contributions
 going-0.370
 helping-0.369
 next-0.368
 not-0.364
 having-0.356
Token girl
Token ID2576
Feature activation-3.04e-01

Pos logit contributions
issy+0.348
acker+0.344
suit+0.340
azz+0.331
pp+0.323
Neg logit contributions
 next-0.503
 helping-0.488
 to-0.482
 going-0.479
 having-0.474
Token listened
Token ID16399
Feature activation+1.47e-01

Pos logit contributions
suit+0.648
issy+0.635
azz+0.599
pp+0.588
acker+0.587
Neg logit contributions
 going-0.441
 helping-0.441
 saving-0.441
 reaching-0.440
 not-0.427
Token to
Token ID284
Feature activation+6.23e-02

Pos logit contributions
 saving+0.174
 resisting+0.161
 succeeding+0.154
 insisting+0.152
 reaching+0.151
Neg logit contributions
suit-0.346
y-0.337
pp-0.318
azz-0.316
issy-0.315
Token his
Token ID465
Feature activation-1.96e-01

Pos logit contributions
 saving+0.060
 reaching+0.057
 insisting+0.055
 resisting+0.054
 helping+0.054
Neg logit contributions
issy-0.148
 folder-0.139
iet-0.139
 skirt-0.138
azz-0.138
Token stories
Token ID3923
Feature activation-3.20e-02

Pos logit contributions
suit+0.413
issy+0.394
uffy+0.385
 folder+0.378
azz+0.377
Neg logit contributions
 saving-0.288
 reaching-0.283
 going-0.281
 helping-0.276
 having-0.273
Token with
Token ID351
Feature activation+1.15e-01

Pos logit contributions
suit+0.083
issy+0.081
azz+0.078
acker+0.077
pp+0.077
Neg logit contributions
 going-0.033
 helping-0.032
 not-0.032
 reaching-0.032
 saving-0.032
Token a
Token ID257
Feature activation-1.52e-01

Pos logit contributions
 going+0.038
 not+0.037
 helping+0.036
 reaching+0.035
 saving+0.035
Neg logit contributions
suit-0.241
issy-0.236
azz-0.228
pp-0.227
acker-0.226
Token little
Token ID1310
Feature activation-1.95e-01

Pos logit contributions
issy+0.350
pp+0.338
azz+0.336
suit+0.336
acker+0.330
Neg logit contributions
 helping-0.231
 going-0.228
 not-0.225
 taking-0.218
 next-0.217
Token girl
Token ID2576
Feature activation-2.64e-01

Pos logit contributions
issy+0.321
acker+0.306
azz+0.304
abs+0.302
pp+0.284
Neg logit contributions
 helping-0.424
 next-0.419
 going-0.405
 trying-0.404
 to-0.401
Token.
Token ID13
Feature activation-3.84e-02

Pos logit contributions
suit+0.494
issy+0.483
azz+0.451
pp+0.451
uffy+0.441
Neg logit contributions
 reaching-0.450
 not-0.447
 saving-0.447
 going-0.446
 helping-0.443
Token The
Token ID383
Feature activation-2.13e-01

Pos logit contributions
suit+0.086
issy+0.084
azz+0.080
acker+0.079
pp+0.079
Neg logit contributions
 going-0.054
 not-0.054
 helping-0.054
 reaching-0.053
 saving-0.052
Token girl
Token ID2576
Feature activation-3.04e-01

Pos logit contributions
suit+0.368
issy+0.361
azz+0.357
acker+0.350
pp+0.347
Neg logit contributions
 next-0.424
 going-0.422
 helping-0.419
 not-0.413
 reaching-0.402
Token was
Token ID373
Feature activation+1.00e-01

Pos logit contributions
suit+0.690
issy+0.669
azz+0.639
acker+0.633
pp+0.630
Neg logit contributions
 going-0.420
 not-0.418
 helping-0.415
 reaching-0.411
 saving-0.406
Token very
Token ID845
Feature activation+8.33e-02

Pos logit contributions
 saving+0.093
 reaching+0.092
 insisting+0.089
 resisting+0.089
 himself+0.088
Neg logit contributions
suit-0.248
issy-0.245
uffy-0.233
coat-0.229
pp-0.229
Token scared
Token ID12008
Feature activation+1.57e-01

Pos logit contributions
 saving+0.106
 insisting+0.098
 reaching+0.098
 going+0.097
 resisting+0.096
Neg logit contributions
suit-0.196
uffy-0.182
issy-0.182
cy-0.179
ocado-0.176
Token,
Token ID11
Feature activation+2.93e-02

Pos logit contributions
 saving+0.181
 not+0.178
 reaching+0.176
 going+0.176
 helping+0.174
Neg logit contributions
suit-0.378
issy-0.371
azz-0.358
uffy-0.346
acker-0.345
Token because
Token ID780
Feature activation+3.13e-02

Pos logit contributions
 saving+0.038
 succeeding+0.037
 helping+0.037
 resisting+0.036
 teaching+0.036
Neg logit contributions
suit-0.065
issy-0.061
azz-0.059
coat-0.059
pp-0.057
Token sun
Token ID4252
Feature activation-1.51e-01

Pos logit contributions
suit+0.476
issy+0.465
pp+0.450
azz+0.449
acker+0.449
Neg logit contributions
 going-0.432
 helping-0.427
 not-0.424
 next-0.418
 saving-0.414
Token is
Token ID318
Feature activation+1.16e-01

Pos logit contributions
suit+0.320
issy+0.310
azz+0.293
pp+0.290
acker+0.289
Neg logit contributions
 going-0.228
 helping-0.227
 not-0.227
 reaching-0.225
 saving-0.224
Token strong
Token ID1913
Feature activation+7.20e-02

Pos logit contributions
 saving+0.127
 succeeding+0.123
 Listen+0.122
 himself+0.122
 resisting+0.121
Neg logit contributions
suit-0.258
issy-0.248
coat-0.245
y-0.242
iet-0.240
Token."
Token ID526
Feature activation-3.46e-02

Pos logit contributions
 going+0.078
 not+0.077
 helping+0.077
 reaching+0.076
 saving+0.076
Neg logit contributions
suit-0.184
issy-0.179
azz-0.172
acker-0.170
pp-0.170
Token The
Token ID383
Feature activation-2.20e-01

Pos logit contributions
suit+0.089
issy+0.087
azz+0.084
acker+0.083
pp+0.082
Neg logit contributions
 going-0.037
 helping-0.036
 not-0.036
 reaching-0.036
 saving-0.035
Token girl
Token ID2576
Feature activation-3.04e-01

Pos logit contributions
suit+0.442
issy+0.429
azz+0.413
acker+0.412
pp+0.411
Neg logit contributions
 going-0.375
 not-0.374
 helping-0.370
 next-0.364
 reaching-0.361
Token said
Token ID531
Feature activation+4.59e-02

Pos logit contributions
suit+0.590
issy+0.579
azz+0.541
uffy+0.541
pp+0.532
Neg logit contributions
 saving-0.494
 helping-0.489
 going-0.485
 reaching-0.476
 insisting-0.470
Token,
Token ID11
Feature activation+6.95e-03

Pos logit contributions
 insisting+0.072
 resisting+0.072
 having+0.070
 going+0.070
 saving+0.070
Neg logit contributions
suit-0.078
kin-0.077
y-0.072
uffy-0.071
ppy-0.071
Token "
Token ID366
Feature activation-2.65e-02

Pos logit contributions
 succeeding+0.008
 saving+0.007
 resisting+0.007
 insisting+0.007
 going+0.007
Neg logit contributions
suit-0.015
pp-0.014
coat-0.014
ppy-0.014
azz-0.014
TokenYes
Token ID5297
Feature activation+3.90e-02

Pos logit contributions
suit+0.064
issy+0.062
azz+0.059
acker+0.059
pp+0.059
Neg logit contributions
 not-0.034
 reaching-0.033
 going-0.033
 helping-0.033
 saving-0.032
Token mummy
Token ID38567
Feature activation-1.55e-01

Pos logit contributions
 going+0.040
 not+0.040
 helping+0.040
 reaching+0.039
 saving+0.038
Neg logit contributions
suit-0.103
issy-0.100
azz-0.096
acker-0.096
pp-0.096
Token time
Token ID640
Feature activation-2.58e-02

Pos logit contributions
issy+0.214
suit+0.211
azz+0.204
acker+0.202
pp+0.198
Neg logit contributions
 going-0.094
 helping-0.090
 not-0.089
 to-0.087
 reaching-0.087
Token together
Token ID1978
Feature activation+1.70e-02

Pos logit contributions
suit+0.067
issy+0.065
azz+0.063
pp+0.062
uffy+0.061
Neg logit contributions
 not-0.025
 helping-0.025
 reaching-0.025
 saving-0.024
 going-0.024
Token.
Token ID13
Feature activation-5.27e-02

Pos logit contributions
 not+0.017
 going+0.017
 reaching+0.017
 helping+0.017
 saving+0.017
Neg logit contributions
suit-0.044
issy-0.043
azz-0.041
acker-0.041
pp-0.041
Token The
Token ID383
Feature activation-2.22e-01

Pos logit contributions
suit+0.145
issy+0.139
pp+0.135
acker+0.133
uffy+0.132
Neg logit contributions
 not-0.049
 going-0.048
 helping-0.048
 having-0.048
 reaching-0.046
Token little
Token ID1310
Feature activation-2.29e-01

Pos logit contributions
suit+0.465
issy+0.452
azz+0.443
pp+0.441
acker+0.435
Neg logit contributions
 next-0.361
 going-0.358
 helping-0.355
 not-0.353
 taking-0.340
Token girl
Token ID2576
Feature activation-3.04e-01

Pos logit contributions
issy+0.367
acker+0.354
suit+0.353
azz+0.352
abs+0.341
Neg logit contributions
 next-0.506
 helping-0.488
 to-0.480
 going-0.479
 having-0.471
Token always
Token ID1464
Feature activation+7.97e-02

Pos logit contributions
suit+0.661
issy+0.646
azz+0.610
acker+0.602
pp+0.602
Neg logit contributions
 going-0.435
 reaching-0.432
 helping-0.432
 saving-0.428
 not-0.424
Token thought
Token ID1807
Feature activation+1.55e-01

Pos logit contributions
 reaching+0.120
 not+0.120
 going+0.119
 helping+0.119
 saving+0.118
Neg logit contributions
suit-0.167
issy-0.161
azz-0.152
pp-0.151
acker-0.150
Token it
Token ID340
Feature activation-1.07e-01

Pos logit contributions
 going+0.128
 saving+0.128
 reaching+0.128
 helping+0.127
 insisting+0.125
Neg logit contributions
suit-0.422
issy-0.419
azz-0.402
acker-0.392
pp-0.391
Token was
Token ID373
Feature activation+1.51e-01

Pos logit contributions
suit+0.242
issy+0.234
azz+0.225
acker+0.222
pp+0.222
Neg logit contributions
 going-0.151
 not-0.150
 helping-0.148
 reaching-0.146
 saving-0.144
Token the
Token ID262
Feature activation-1.69e-01

Pos logit contributions
 reaching+0.138
 saving+0.135
 helping+0.131
 insisting+0.131
 going+0.129
Neg logit contributions
suit-0.397
issy-0.392
azz-0.370
acker-0.368
pp-0.365
Token to
Token ID284
Feature activation+5.49e-02

Pos logit contributions
 reaching+0.050
 insisting+0.049
 saving+0.049
 helping+0.049
 resisting+0.049
Neg logit contributions
suit-0.086
issy-0.085
azz-0.080
acker-0.079
uffy-0.079
Token her
Token ID607
Feature activation-2.07e-01

Pos logit contributions
 going+0.061
 helping+0.060
 not+0.060
 reaching+0.059
 saving+0.059
Neg logit contributions
suit-0.139
issy-0.135
azz-0.130
acker-0.128
pp-0.128
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
suit+0.453
issy+0.439
azz+0.417
acker+0.412
pp+0.411
Neg logit contributions
 going-0.300
 helping-0.297
 reaching-0.296
 not-0.296
 saving-0.294
Token comfort
Token ID4467
Feature activation+8.58e-02

Pos logit contributions
 going+0.013
 helping+0.013
 not+0.013
 reaching+0.013
 saving+0.013
Neg logit contributions
suit-0.026
issy-0.025
azz-0.024
acker-0.024
pp-0.024
Tokened
Token ID276
Feature activation+9.36e-02

Pos logit contributions
 not+0.064
 going+0.062
 to+0.062
 helping+0.060
 himself+0.059
Neg logit contributions
suit-0.249
issy-0.244
acker-0.240
azz-0.236
pp-0.233
Token Pen
Token ID7507
Feature activation-3.03e-01

Pos logit contributions
 going+0.084
 reaching+0.082
 not+0.082
 helping+0.082
 saving+0.081
Neg logit contributions
suit-0.256
issy-0.249
azz-0.239
pp-0.236
acker-0.235
Tokenelope
Token ID47329
Feature activation-2.18e-01

Pos logit contributions
suit+0.550
issy+0.532
azz+0.501
acker+0.493
pp+0.490
Neg logit contributions
 going-0.554
 not-0.551
 helping-0.550
 reaching-0.547
 saving-0.543
Token.
Token ID13
Feature activation-4.72e-02

Pos logit contributions
suit+0.480
issy+0.452
uffy+0.436
ppy+0.436
pp+0.436
Neg logit contributions
 not-0.293
 reaching-0.293
 going-0.290
 succeeding-0.286
 saving-0.284
Token "
Token ID366
Feature activation-3.77e-02

Pos logit contributions
suit+0.115
issy+0.112
azz+0.108
acker+0.107
uffy+0.106
Neg logit contributions
 going-0.057
 helping-0.055
 not-0.055
 reaching-0.055
 saving-0.055
TokenI
Token ID40
Feature activation-5.52e-03

Pos logit contributions
suit+0.093
issy+0.089
uffy+0.085
azz+0.084
acker+0.083
Neg logit contributions
 not-0.047
 next-0.047
 reaching-0.046
 helping-0.046
 insisting-0.046
Token told
Token ID1297
Feature activation+1.83e-01

Pos logit contributions
suit+0.011
issy+0.011
azz+0.010
acker+0.010
pp+0.010
Neg logit contributions
 going-0.009
 helping-0.009
 reaching-0.009
 not-0.009
 saving-0.009
Token gray
Token ID12768
Feature activation-1.26e-01

Pos logit contributions
suit+0.480
issy+0.478
azz+0.464
pp+0.456
acker+0.444
Neg logit contributions
 going-0.449
 helping-0.442
 not-0.438
 next-0.429
 having-0.429
Token sky
Token ID6766
Feature activation-1.16e-01

Pos logit contributions
issy+0.251
azz+0.245
pp+0.240
acker+0.236
suit+0.233
Neg logit contributions
 to-0.226
 next-0.215
 helping-0.214
 going-0.211
 making-0.203
Token later
Token ID1568
Feature activation-2.08e-02

Pos logit contributions
suit+0.304
issy+0.291
pp+0.280
acker+0.278
azz+0.277
Neg logit contributions
 going-0.114
 saving-0.113
 helping-0.113
 insisting-0.113
 reaching-0.112
Token."
Token ID526
Feature activation-3.59e-02

Pos logit contributions
suit+0.054
issy+0.051
uffy+0.049
azz+0.049
pp+0.048
Neg logit contributions
 saving-0.021
 insisting-0.021
 helping-0.021
 reaching-0.021
 resisting-0.021
Token The
Token ID383
Feature activation-2.15e-01

Pos logit contributions
suit+0.095
issy+0.093
pp+0.089
acker+0.089
azz+0.088
Neg logit contributions
 not-0.036
 helping-0.036
 going-0.036
 reaching-0.035
 having-0.035
Token baby
Token ID5156
Feature activation-3.02e-01

Pos logit contributions
suit+0.430
issy+0.419
acker+0.409
pp+0.407
azz+0.399
Neg logit contributions
 helping-0.364
 going-0.363
 not-0.359
 next-0.358
 giving-0.348
Token looked
Token ID3114
Feature activation+1.57e-01

Pos logit contributions
suit+0.720
issy+0.688
acker+0.681
azz+0.668
pp+0.665
Neg logit contributions
 not-0.405
 helping-0.384
 going-0.383
 next-0.377
 having-0.375
Token up
Token ID510
Feature activation+7.31e-02

Pos logit contributions
 saving+0.155
 reaching+0.151
 helping+0.149
 not+0.147
 going+0.146
Neg logit contributions
suit-0.421
issy-0.394
azz-0.384
 tant-0.380
acker-0.378
Token and
Token ID290
Feature activation+3.10e-02

Pos logit contributions
 saving+0.115
 succeeding+0.109
 resisting+0.107
 helping+0.103
 having+0.103
Neg logit contributions
suit-0.152
y-0.144
cy-0.141
 tant-0.132
ara-0.132
Token said
Token ID531
Feature activation+7.11e-02

Pos logit contributions
 saving+0.051
 helping+0.051
 going+0.050
 reaching+0.050
 not+0.049
Neg logit contributions
suit-0.061
issy-0.058
azz-0.055
pp-0.055
cy-0.054
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
 insisting+0.107
 saving+0.106
 going+0.106
 resisting+0.105
 having+0.102
Neg logit contributions
suit-0.142
azz-0.128
uffy-0.128
kin-0.128
y-0.126
Token the
Token ID262
Feature activation-2.26e-01

Pos logit contributions
 not+0.123
 going+0.119
 having+0.113
 helping+0.113
 himself+0.111
Neg logit contributions
suit-0.631
issy-0.628
pp-0.605
acker-0.603
azz-0.602
Token thing
Token ID1517
Feature activation-1.44e-01

Pos logit contributions
issy+0.422
suit+0.417
azz+0.410
acker+0.399
pp+0.397
Neg logit contributions
 going-0.407
 helping-0.406
 not-0.404
 next-0.396
 saving-0.395
Token apart
Token ID5475
Feature activation-2.16e-02

Pos logit contributions
suit+0.411
issy+0.400
azz+0.385
acker+0.382
pp+0.380
Neg logit contributions
 going-0.112
 helping-0.110
 not-0.109
 reaching-0.108
 saving-0.108
Token.
Token ID13
Feature activation-4.86e-02

Pos logit contributions
suit+0.057
issy+0.055
azz+0.053
acker+0.052
pp+0.052
Neg logit contributions
 going-0.021
 not-0.021
 reaching-0.020
 helping-0.020
 saving-0.020
Token The
Token ID383
Feature activation-2.12e-01

Pos logit contributions
suit+0.121
issy+0.119
azz+0.114
acker+0.112
uffy+0.111
Neg logit contributions
 going-0.056
 helping-0.054
 reaching-0.054
 saving-0.054
 not-0.053
Token girl
Token ID2576
Feature activation-3.01e-01

Pos logit contributions
suit+0.392
issy+0.388
azz+0.370
pp+0.368
acker+0.364
Neg logit contributions
 helping-0.388
 going-0.385
 not-0.385
 reaching-0.378
 saving-0.378
Token explained
Token ID4893
Feature activation+1.12e-01

Pos logit contributions
suit+0.633
issy+0.624
azz+0.582
uffy+0.575
pp+0.568
Neg logit contributions
 going-0.450
 reaching-0.441
 helping-0.439
 saving-0.438
 insisting-0.425
Token to
Token ID284
Feature activation+4.94e-02

Pos logit contributions
 reaching+0.077
 having+0.074
 going+0.073
 saving+0.073
 insisting+0.069
Neg logit contributions
issy-0.306
azz-0.302
suit-0.299
 tant-0.293
affe-0.291
Token the
Token ID262
Feature activation-2.20e-01

Pos logit contributions
 going+0.040
 not+0.039
 helping+0.039
 reaching+0.038
 saving+0.038
Neg logit contributions
suit-0.143
issy-0.139
azz-0.134
acker-0.133
pp-0.133
Token kitten
Token ID42143
Feature activation-2.67e-01

Pos logit contributions
suit+0.383
issy+0.377
azz+0.354
pp+0.349
acker+0.349
Neg logit contributions
 going-0.423
 not-0.421
 helping-0.421
 reaching-0.416
 saving-0.414
Token that
Token ID326
Feature activation-7.95e-02

Pos logit contributions
suit+0.705
issy+0.694
azz+0.664
pp+0.655
uffy+0.654
Neg logit contributions
 going-0.252
 reaching-0.250
 helping-0.246
 saving-0.244
 not-0.242
Token talking
Token ID3375
Feature activation+1.69e-01

Pos logit contributions
suit+0.248
issy+0.241
azz+0.232
acker+0.232
pp+0.227
Neg logit contributions
 going-0.094
 helping-0.092
 not-0.091
 saving-0.089
 having-0.089
Token and
Token ID290
Feature activation+2.76e-02

Pos logit contributions
 going+0.151
 not+0.149
 helping+0.149
 reaching+0.147
 saving+0.145
Neg logit contributions
suit-0.464
issy-0.453
azz-0.435
acker-0.432
pp-0.431
Token laughing
Token ID14376
Feature activation+1.75e-01

Pos logit contributions
 not+0.022
 reaching+0.022
 saving+0.021
 insisting+0.020
 resisting+0.020
Neg logit contributions
suit-0.073
issy-0.071
pp-0.069
azz-0.068
cy-0.067
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
 going+0.149
 not+0.147
 helping+0.147
 reaching+0.146
 saving+0.144
Neg logit contributions
suit-0.490
issy-0.476
azz-0.459
acker-0.455
pp-0.454
Token The
Token ID383
Feature activation-2.06e-01

Pos logit contributions
suit+0.087
issy+0.084
azz+0.081
acker+0.080
pp+0.080
Neg logit contributions
 going-0.038
 not-0.038
 helping-0.038
 reaching-0.037
 saving-0.037
Token girl
Token ID2576
Feature activation-3.01e-01

Pos logit contributions
suit+0.432
issy+0.426
acker+0.415
azz+0.413
pp+0.411
Neg logit contributions
 not-0.327
 next-0.327
 going-0.326
 helping-0.322
 having-0.315
Token was
Token ID373
Feature activation+9.73e-02

Pos logit contributions
suit+0.647
issy+0.629
azz+0.597
acker+0.585
pp+0.585
Neg logit contributions
 helping-0.434
 going-0.434
 saving-0.434
 reaching-0.432
 not-0.422
Token curious
Token ID11040
Feature activation+1.30e-01

Pos logit contributions
 saving+0.082
 insisting+0.078
 reaching+0.078
 resisting+0.076
 succeeding+0.076
Neg logit contributions
suit-0.238
issy-0.229
coat-0.222
y-0.222
azz-0.221
Token and
Token ID290
Feature activation+1.31e-02

Pos logit contributions
 going+0.140
 not+0.139
 helping+0.138
 reaching+0.138
 saving+0.138
Neg logit contributions
suit-0.330
issy-0.322
azz-0.309
acker-0.305
pp-0.303
Token she
Token ID673
Feature activation-1.56e-01

Pos logit contributions
 saving+0.020
 resisting+0.018
 reaching+0.018
 helping+0.018
 insisting+0.018
Neg logit contributions
suit-0.026
issy-0.024
azz-0.023
ems-0.023
y-0.023
Token wanted
Token ID2227
Feature activation+2.48e-01

Pos logit contributions
suit+0.352
issy+0.341
azz+0.325
pp+0.317
acker+0.317
Neg logit contributions
 saving-0.214
 helping-0.214
 going-0.212
 reaching-0.211
 not-0.206
Token I
Token ID314
Feature activation-3.02e-02

Pos logit contributions
 not+0.027
 going+0.026
 helping+0.026
 reaching+0.025
 having+0.025
Neg logit contributions
suit-0.064
issy-0.062
acker-0.060
pp-0.060
azz-0.059
Token'm
Token ID1101
Feature activation+1.26e-01

Pos logit contributions
suit+0.065
issy+0.063
azz+0.060
acker+0.059
pp+0.059
Neg logit contributions
 not-0.047
 going-0.046
 helping-0.044
 having-0.043
 saving-0.043
Token sorry
Token ID7926
Feature activation+1.44e-01

Pos logit contributions
 reaching+0.097
 insisting+0.091
 saving+0.090
 next+0.088
 helping+0.086
Neg logit contributions
issy-0.328
suit-0.320
ppy-0.309
azz-0.307
uffy-0.302
Token."
Token ID526
Feature activation-3.44e-02

Pos logit contributions
 reaching+0.155
 saving+0.153
 going+0.152
 helping+0.150
 insisting+0.150
Neg logit contributions
suit-0.366
issy-0.357
azz-0.344
uffy-0.341
acker-0.334
Token The
Token ID383
Feature activation-2.06e-01

Pos logit contributions
suit+0.087
issy+0.085
azz+0.083
uffy+0.081
acker+0.080
Neg logit contributions
 going-0.038
 saving-0.036
 reaching-0.036
 helping-0.036
 not-0.034
Token girl
Token ID2576
Feature activation-3.01e-01

Pos logit contributions
suit+0.394
issy+0.373
pp+0.362
acker+0.362
azz+0.361
Neg logit contributions
 going-0.382
 helping-0.379
 not-0.374
 next-0.370
 saving-0.362
Token smiled
Token ID13541
Feature activation+9.58e-02

Pos logit contributions
suit+0.593
issy+0.576
azz+0.544
uffy+0.539
acker+0.525
Neg logit contributions
 going-0.487
 helping-0.486
 saving-0.486
 reaching-0.481
 insisting-0.469
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
 saving+0.148
 resisting+0.142
 reaching+0.141
 going+0.141
 insisting+0.141
Neg logit contributions
suit-0.201
y-0.190
azz-0.189
issy-0.183
ppy-0.181
Token said
Token ID531
Feature activation+6.61e-02

Pos logit contributions
 reaching+0.017
 going+0.017
 saving+0.017
 helping+0.017
 insisting+0.016
Neg logit contributions
suit-0.022
issy-0.021
pp-0.020
azz-0.020
uffy-0.019
Token "
Token ID366
Feature activation-6.08e-03

Pos logit contributions
 insisting+0.103
 going+0.102
 having+0.101
 resisting+0.101
 saving+0.101
Neg logit contributions
suit-0.125
uffy-0.118
pp-0.116
kin-0.115
azz-0.113
TokenIt
Token ID1026
Feature activation-3.15e-02

Pos logit contributions
suit+0.013
issy+0.013
azz+0.012
acker+0.012
pp+0.012
Neg logit contributions
 going-0.009
 not-0.009
 helping-0.009
 reaching-0.009
 saving-0.009
Token swing
Token ID9628
Feature activation+5.23e-02

Pos logit contributions
suit+0.110
issy+0.106
acker+0.101
pp+0.100
azz+0.099
Neg logit contributions
 reaching-0.087
 saving-0.086
 helping-0.086
 going-0.085
 insisting-0.084
Token with
Token ID351
Feature activation+1.35e-01

Pos logit contributions
 going+0.044
 not+0.044
 helping+0.043
 reaching+0.042
 saving+0.041
Neg logit contributions
suit-0.147
issy-0.144
azz-0.138
acker-0.137
pp-0.136
Token you
Token ID345
Feature activation-9.82e-02

Pos logit contributions
 going+0.145
 not+0.143
 helping+0.143
 reaching+0.142
 saving+0.140
Neg logit contributions
suit-0.348
issy-0.339
azz-0.325
acker-0.322
pp-0.321
Token?"
Token ID1701
Feature activation-1.07e-02

Pos logit contributions
suit+0.254
issy+0.246
azz+0.237
acker+0.236
pp+0.235
Neg logit contributions
 going-0.101
 reaching-0.101
 helping-0.101
 saving-0.100
 not-0.099
Token The
Token ID383
Feature activation-2.18e-01

Pos logit contributions
suit+0.025
issy+0.024
azz+0.023
uffy+0.023
acker+0.023
Neg logit contributions
 going-0.014
 saving-0.014
 reaching-0.013
 helping-0.013
 not-0.013
Token girl
Token ID2576
Feature activation-3.01e-01

Pos logit contributions
suit+0.408
issy+0.407
acker+0.396
pp+0.387
azz+0.381
Neg logit contributions
 next-0.403
 going-0.399
 not-0.398
 helping-0.393
 taking-0.383
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
suit+0.581
issy+0.560
azz+0.543
uffy+0.525
pp+0.519
Neg logit contributions
 saving-0.502
 helping-0.489
 going-0.478
 resisting-0.467
 reaching-0.467
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
suit+0.044
acker+0.043
issy+0.042
abs+0.042
pp+0.042
Neg logit contributions
 not-0.008
 the-0.008
 going-0.008
 helping-0.008
 having-0.008
Token boy
Token ID2933
Feature activation-2.79e-01

Pos logit contributions
issy+0.413
suit+0.409
acker+0.404
azz+0.389
pp+0.387
Neg logit contributions
 next-0.483
 helping-0.478
 going-0.477
 not-0.474
 reaching-0.457
Token say
Token ID910
Feature activation+1.02e-01

Pos logit contributions
suit+0.603
issy+0.587
azz+0.559
pp+0.552
acker+0.547
Neg logit contributions
 helping-0.399
 saving-0.398
 going-0.391
 reaching-0.390
 not-0.389
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
 reaching+0.139
 insisting+0.138
 resisting+0.137
 going+0.136
 saving+0.134
Neg logit contributions
suit-0.220
issy-0.208
azz-0.204
kin-0.201
uffy-0.201
Token surprise
Token ID5975
Feature activation+1.49e-02

Pos logit contributions
suit+0.248
issy+0.241
azz+0.229
acker+0.227
pp+0.227
Neg logit contributions
 going-0.133
 helping-0.132
 not-0.131
 reaching-0.131
 saving-0.130
Token."
Token ID526
Feature activation-4.98e-02

Pos logit contributions
 going+0.015
 not+0.014
 helping+0.014
 reaching+0.014
 saving+0.014
Neg logit contributions
suit-0.040
issy-0.039
azz-0.037
acker-0.037
pp-0.037
Token
Token ID198
Feature activation-8.10e-02

Pos logit contributions
suit+0.135
issy+0.132
acker+0.126
azz+0.126
pp+0.126
Neg logit contributions
 not-0.047
 going-0.046
 helping-0.046
 reaching-0.045
 saving-0.044
Token
Token ID198
Feature activation-7.80e-02

Pos logit contributions
suit+0.236
issy+0.230
azz+0.221
pp+0.220
acker+0.220
Neg logit contributions
 not-0.060
 going-0.060
 helping-0.059
 reaching-0.058
 saving-0.057
TokenThe
Token ID464
Feature activation-2.26e-01

Pos logit contributions
suit+0.170
issy+0.160
pp+0.155
acker+0.149
uffy+0.149
Neg logit contributions
 not-0.124
 helping-0.118
 reaching-0.116
 next-0.116
 insisting-0.115
Token girl
Token ID2576
Feature activation-3.00e-01

Pos logit contributions
suit+0.444
issy+0.437
pp+0.422
acker+0.420
azz+0.418
Neg logit contributions
 next-0.399
 helping-0.393
 going-0.392
 not-0.391
 taking-0.375
Token thought
Token ID1807
Feature activation+1.46e-01

Pos logit contributions
suit+0.636
issy+0.617
azz+0.583
uffy+0.572
acker+0.571
Neg logit contributions
 going-0.450
 helping-0.446
 saving-0.443
 reaching-0.438
 not-0.432
Token this
Token ID428
Feature activation-1.71e-01

Pos logit contributions
 going+0.110
 helping+0.108
 not+0.107
 reaching+0.107
 saving+0.105
Neg logit contributions
suit-0.421
issy-0.413
azz-0.397
acker-0.393
pp-0.392
Token sounded
Token ID14846
Feature activation+7.60e-02

Pos logit contributions
suit+0.457
pp+0.445
issy+0.438
azz+0.437
acker+0.434
Neg logit contributions
 not-0.186
 going-0.184
 helping-0.176
 next-0.173
 to-0.171
Token fun
Token ID1257
Feature activation+8.72e-02

Pos logit contributions
 saving+0.108
 reaching+0.106
 insisting+0.106
 resisting+0.105
 helping+0.104
Neg logit contributions
suit-0.168
issy-0.155
acker-0.148
ocado-0.146
azz-0.145
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
 saving+0.079
 reaching+0.079
 going+0.078
 helping+0.077
 insisting+0.076
Neg logit contributions
suit-0.235
issy-0.227
azz-0.218
acker-0.214
uffy-0.212
Token her
Token ID607
Feature activation-2.18e-01

Pos logit contributions
suit+0.459
issy+0.440
acker+0.433
pp+0.429
azz+0.427
Neg logit contributions
 going-0.194
 helping-0.191
 reaching-0.189
 not-0.187
 having-0.184
Token.
Token ID13
Feature activation-4.30e-02

Pos logit contributions
suit+0.491
issy+0.484
azz+0.463
acker+0.458
pp+0.456
Neg logit contributions
 not-0.307
 going-0.307
 helping-0.301
 reaching-0.295
 having-0.295
Token
Token ID198
Feature activation-7.35e-02

Pos logit contributions
suit+0.109
issy+0.107
pp+0.102
acker+0.102
azz+0.101
Neg logit contributions
 going-0.047
 not-0.047
 helping-0.047
 having-0.046
 reaching-0.046
Token
Token ID198
Feature activation-7.34e-02

Pos logit contributions
suit+0.205
issy+0.201
azz+0.193
acker+0.191
pp+0.190
Neg logit contributions
 going-0.063
 helping-0.062
 not-0.061
 reaching-0.061
 saving-0.060
TokenThe
Token ID464
Feature activation-2.28e-01

Pos logit contributions
suit+0.164
issy+0.156
pp+0.151
acker+0.147
uffy+0.146
Neg logit contributions
 not-0.110
 helping-0.106
 going-0.105
 reaching-0.105
 having-0.103
Token girl
Token ID2576
Feature activation-3.00e-01

Pos logit contributions
suit+0.466
issy+0.457
acker+0.443
azz+0.442
pp+0.441
Neg logit contributions
 next-0.388
 going-0.381
 helping-0.381
 not-0.375
 taking-0.363
Token hugged
Token ID41130
Feature activation+1.69e-01

Pos logit contributions
suit+0.657
issy+0.639
azz+0.606
acker+0.596
pp+0.595
Neg logit contributions
 going-0.430
 helping-0.427
 reaching-0.424
 saving-0.423
 not-0.421
Token the
Token ID262
Feature activation-2.31e-01

Pos logit contributions
 going+0.102
 not+0.101
 helping+0.099
 reaching+0.098
 having+0.094
Neg logit contributions
suit-0.525
issy-0.513
acker-0.497
azz-0.494
pp-0.493
Token heavy
Token ID4334
Feature activation-6.74e-02

Pos logit contributions
issy+0.378
azz+0.364
suit+0.361
acker+0.353
pp+0.352
Neg logit contributions
 next-0.483
 going-0.481
 helping-0.475
 not-0.467
 taking-0.461
Token pillow
Token ID28774
Feature activation-1.92e-01

Pos logit contributions
issy+0.126
azz+0.120
acker+0.119
pp+0.116
suit+0.114
Neg logit contributions
 to-0.128
 reaching-0.122
 helping-0.122
 next-0.122
 going-0.121
Token close
Token ID1969
Feature activation+5.18e-02

Pos logit contributions
suit+0.493
issy+0.475
azz+0.454
pp+0.453
acker+0.451
Neg logit contributions
 going-0.203
 reaching-0.198
 saving-0.198
 helping-0.197
 not-0.195
Token and
Token ID290
Feature activation+3.20e-02

Pos logit contributions
suit+0.260
issy+0.245
azz+0.239
acker+0.236
pp+0.235
Neg logit contributions
 going-0.128
 helping-0.128
 saving-0.126
 insisting-0.125
 resisting-0.122
Token rolls
Token ID14088
Feature activation+1.34e-01

Pos logit contributions
 helping+0.049
 saving+0.048
 going+0.047
 not+0.047
 resisting+0.047
Neg logit contributions
suit-0.066
issy-0.061
azz-0.060
cy-0.059
ems-0.059
Token over
Token ID625
Feature activation+1.30e-01

Pos logit contributions
 helping+0.130
 saving+0.126
 resisting+0.124
 going+0.121
 succeeding+0.119
Neg logit contributions
suit-0.369
azz-0.331
issy-0.330
coat-0.329
cy-0.328
Token.
Token ID13
Feature activation+6.93e-03

Pos logit contributions
 saving+0.100
 not+0.098
 helping+0.097
 going+0.096
 insisting+0.094
Neg logit contributions
suit-0.362
issy-0.345
azz-0.344
cy-0.341
uffy-0.333
Token She
Token ID1375
Feature activation-1.09e-01

Pos logit contributions
 saving+0.011
 going+0.011
 helping+0.011
 reaching+0.011
 insisting+0.011
Neg logit contributions
suit-0.014
azz-0.014
issy-0.013
uffy-0.013
acker-0.013
Token looks
Token ID3073
Feature activation+1.83e-01

Pos logit contributions
suit+0.212
issy+0.197
azz+0.196
uffy+0.189
acker+0.188
Neg logit contributions
 helping-0.185
 saving-0.181
 reaching-0.178
 insisting-0.176
 going-0.176
Token up
Token ID510
Feature activation+1.09e-01

Pos logit contributions
 saving+0.198
 helping+0.193
 resisting+0.188
 reaching+0.188
 not+0.184
Neg logit contributions
suit-0.477
azz-0.432
issy-0.431
cy-0.427
uffy-0.423
Token at
Token ID379
Feature activation+1.21e-01

Pos logit contributions
 saving+0.207
 succeeding+0.200
 insisting+0.190
 rescuing+0.189
 resisting+0.189
Neg logit contributions
y-0.177
suit-0.163
ara-0.162
cy-0.155
ite-0.153
Token Ben
Token ID3932
Feature activation-1.83e-01

Pos logit contributions
 helping+0.168
 going+0.167
 saving+0.167
 reaching+0.166
 not+0.166
Neg logit contributions
suit-0.267
issy-0.260
azz-0.250
acker-0.245
uffy-0.244
Token and
Token ID290
Feature activation+2.20e-02

Pos logit contributions
suit+0.415
issy+0.389
azz+0.387
pp+0.384
ppy+0.381
Neg logit contributions
 saving-0.232
 helping-0.230
 insisting-0.227
 going-0.226
 resisting-0.225
Token says
Token ID1139
Feature activation+8.63e-02

Pos logit contributions
 helping+0.037
 saving+0.037
 reaching+0.036
 resisting+0.036
 going+0.035
Neg logit contributions
suit-0.041
azz-0.038
ems-0.037
cy-0.037
issy-0.037
Token Anna
Token ID11735
Feature activation-2.20e-01

Pos logit contributions
 going+0.011
 helping+0.010
 reaching+0.010
 not+0.010
 saving+0.010
Neg logit contributions
suit-0.018
issy-0.018
azz-0.017
acker-0.017
uffy-0.017
Token and
Token ID290
Feature activation+8.91e-03

Pos logit contributions
suit+0.503
issy+0.484
azz+0.470
acker+0.456
pp+0.456
Neg logit contributions
 helping-0.296
 saving-0.293
 going-0.293
 reaching-0.292
 not-0.285
Token Ben
Token ID3932
Feature activation-1.97e-01

Pos logit contributions
 helping+0.011
 saving+0.011
 going+0.011
 reaching+0.011
 not+0.011
Neg logit contributions
suit-0.021
issy-0.020
azz-0.019
acker-0.019
pp-0.019
Token do
Token ID466
Feature activation+1.75e-01

Pos logit contributions
suit+0.391
issy+0.377
azz+0.356
acker+0.354
pp+0.351
Neg logit contributions
 helping-0.323
 going-0.322
 reaching-0.320
 saving-0.319
 not-0.319
Token not
Token ID407
Feature activation+1.44e-01

Pos logit contributions
 helping+0.091
 reaching+0.089
 saving+0.086
 resisting+0.086
 insisting+0.085
Neg logit contributions
suit-0.507
issy-0.493
iet-0.484
azz-0.481
ems-0.475
Token have
Token ID423
Feature activation+1.68e-01

Pos logit contributions
 going+0.183
 not+0.183
 helping+0.174
 reaching+0.171
 having+0.170
Neg logit contributions
suit-0.355
issy-0.345
uffy-0.334
pp-0.329
acker-0.329
Token passports
Token ID33052
Feature activation+6.15e-03

Pos logit contributions
 reaching+0.131
 helping+0.129
 saving+0.125
 insisting+0.123
 resisting+0.123
Neg logit contributions
suit-0.469
issy-0.457
azz-0.442
acker-0.434
ppy-0.431
Token.
Token ID13
Feature activation+5.69e-03

Pos logit contributions
 reaching+0.006
 helping+0.006
 insisting+0.005
 saving+0.005
 resisting+0.005
Neg logit contributions
suit-0.017
issy-0.016
azz-0.015
pp-0.015
acker-0.015
Token They
Token ID1119
Feature activation-6.03e-02

Pos logit contributions
 going+0.009
 reaching+0.009
 helping+0.009
 saving+0.009
 resisting+0.009
Neg logit contributions
suit-0.012
issy-0.011
azz-0.011
acker-0.011
uffy-0.011
Token are
Token ID389
Feature activation+1.04e-01

Pos logit contributions
suit+0.145
issy+0.141
azz+0.135
pp+0.135
uffy+0.133
Neg logit contributions
 not-0.082
 going-0.081
 reaching-0.075
 helping-0.073
 having-0.073
Token too
Token ID1165
Feature activation+9.47e-02

Pos logit contributions
 reaching+0.092
 insisting+0.088
 helping+0.087
 saving+0.087
 resisting+0.087
Neg logit contributions
suit-0.275
issy-0.260
azz-0.250
uffy-0.247
ppy-0.246
Token in
Token ID287
Feature activation+3.46e-02

Pos logit contributions
 saving+0.078
 insisting+0.077
 helping+0.077
 not+0.076
 resisting+0.075
Neg logit contributions
suit-0.135
issy-0.125
ppy-0.120
pp-0.120
ems-0.120
Token case
Token ID1339
Feature activation-4.24e-02

Pos logit contributions
 going+0.041
 not+0.041
 helping+0.041
 reaching+0.040
 saving+0.040
Neg logit contributions
suit-0.085
issy-0.083
azz-0.079
acker-0.079
pp-0.078
Token another
Token ID1194
Feature activation-1.33e-01

Pos logit contributions
suit+0.120
issy+0.115
acker+0.112
pp+0.112
azz+0.111
Neg logit contributions
 not-0.038
 going-0.037
 helping-0.036
 saving-0.035
 reaching-0.035
Token magical
Token ID10883
Feature activation-8.19e-02

Pos logit contributions
suit+0.297
issy+0.291
pp+0.281
azz+0.278
uffy+0.276
Neg logit contributions
 helping-0.197
 going-0.194
 reaching-0.194
 not-0.188
 having-0.184
Token surprise
Token ID5975
Feature activation-1.25e-03

Pos logit contributions
issy+0.164
suit+0.164
azz+0.158
acker+0.156
pp+0.155
Neg logit contributions
 helping-0.135
 reaching-0.134
 going-0.134
 not-0.132
 to-0.129
Token was
Token ID373
Feature activation+1.44e-01

Pos logit contributions
suit+0.003
issy+0.003
azz+0.003
acker+0.003
pp+0.003
Neg logit contributions
 going-0.001
 not-0.001
 helping-0.001
 reaching-0.001
 saving-0.001
Token waiting
Token ID4953
Feature activation+1.57e-01

Pos logit contributions
 reaching+0.120
 insisting+0.119
 helping+0.117
 saving+0.115
 resisting+0.114
Neg logit contributions
suit-0.374
issy-0.369
pp-0.347
azz-0.346
coat-0.342
Token for
Token ID329
Feature activation+1.72e-01

Pos logit contributions
 reaching+0.128
 helping+0.127
 insisting+0.126
 resisting+0.124
 saving+0.124
Neg logit contributions
suit-0.434
issy-0.412
pp-0.392
azz-0.390
uffy-0.387
Token him
Token ID683
Feature activation-7.10e-02

Pos logit contributions
 going+0.129
 not+0.127
 reaching+0.127
 helping+0.126
 saving+0.123
Neg logit contributions
suit-0.487
issy-0.481
azz-0.460
acker-0.456
pp-0.453
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
suit+0.190
issy+0.182
azz+0.178
pp+0.175
ppy+0.174
Neg logit contributions
 reaching-0.062
 helping-0.061
 insisting-0.061
 saving-0.061
 resisting-0.059
Token<|endoftext|>
Token ID50256
Feature activation-4.99e-02

Pos logit contributions
suit+0.038
issy+0.037
pp+0.036
acker+0.035
uffy+0.035
Neg logit contributions
 not-0.008
 going-0.008
 helping-0.008
 having-0.008
 reaching-0.008
Token
Token ID198
Feature activation-4.30e-02

Pos logit contributions
suit+0.049
issy+0.049
azz+0.047
acker+0.046
uffy+0.045
Neg logit contributions
 going-0.022
 reaching-0.021
 helping-0.021
 saving-0.021
 not-0.020
Token
Token ID198
Feature activation-5.03e-02

Pos logit contributions
issy+0.096
azz+0.094
suit+0.093
acker+0.092
uffy+0.091
Neg logit contributions
 going-0.057
 reaching-0.056
 helping-0.056
 saving-0.055
 giving-0.054
TokenWhen
Token ID2215
Feature activation+2.50e-02

Pos logit contributions
suit+0.115
issy+0.112
azz+0.107
acker+0.106
pp+0.104
Neg logit contributions
 going-0.069
 helping-0.068
 reaching-0.068
 not-0.067
 saving-0.067
Token they
Token ID484
Feature activation-1.21e-01

Pos logit contributions
 reaching+0.021
 going+0.020
 helping+0.020
 saving+0.020
 insisting+0.020
Neg logit contributions
suit-0.069
issy-0.068
azz-0.064
uffy-0.064
acker-0.064
Token boy
Token ID2933
Feature activation-1.93e-01

Pos logit contributions
suit+0.227
issy+0.216
azz+0.204
pp+0.203
uffy+0.201
Neg logit contributions
 going-0.219
 not-0.218
 reaching-0.214
 helping-0.211
 having-0.209
Token woke
Token ID19092
Feature activation+1.88e-01

Pos logit contributions
suit+0.361
issy+0.348
azz+0.328
acker+0.324
pp+0.323
Neg logit contributions
 going-0.342
 not-0.340
 helping-0.339
 reaching-0.337
 saving-0.335
Token up
Token ID510
Feature activation+9.81e-02

Pos logit contributions
 helping+0.136
 reaching+0.131
 saving+0.130
 giving+0.123
 teaching+0.121
Neg logit contributions
suit-0.524
issy-0.515
uffy-0.514
azz-0.496
pp-0.493
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
 saving+0.131
 succeeding+0.129
 not+0.128
 resisting+0.128
 insisting+0.126
Neg logit contributions
suit-0.201
issy-0.182
cy-0.182
y-0.181
pp-0.180
Token he
Token ID339
Feature activation-1.24e-01

Pos logit contributions
 going+0.025
 not+0.025
 helping+0.024
 reaching+0.024
 saving+0.024
Neg logit contributions
suit-0.108
issy-0.106
azz-0.102
acker-0.102
pp-0.102
Token felt
Token ID2936
Feature activation+1.68e-01

Pos logit contributions
suit+0.246
issy+0.236
azz+0.223
ppy+0.220
pp+0.220
Neg logit contributions
 helping-0.201
 reaching-0.200
 saving-0.199
 going-0.198
 insisting-0.196
Token so
Token ID523
Feature activation+7.99e-02

Pos logit contributions
 counting+0.234
 asking+0.222
 resisting+0.221
 adding+0.220
 reaching+0.220
Neg logit contributions
suit-0.334
kin-0.305
 skirt-0.303
issy-0.302
acker-0.300
Token wished
Token ID16555
Feature activation+2.24e-01

Pos logit contributions
suit+0.269
issy+0.261
azz+0.249
acker+0.246
pp+0.246
Neg logit contributions
 going-0.170
 not-0.169
 helping-0.168
 reaching-0.166
 saving-0.165
Token she
Token ID673
Feature activation-8.47e-02

Pos logit contributions
 reaching+0.301
 insisting+0.296
 saving+0.296
 going+0.295
 helping+0.291
Neg logit contributions
issy-0.485
suit-0.468
azz-0.456
uffy-0.448
pp-0.437
Token had
Token ID550
Feature activation+1.42e-01

Pos logit contributions
suit+0.178
issy+0.172
pp+0.167
azz+0.167
acker+0.166
Neg logit contributions
 not-0.143
 going-0.138
 having-0.132
 helping-0.132
 saving-0.130
Token someone
Token ID2130
Feature activation-6.29e-02

Pos logit contributions
 reaching+0.167
 insisting+0.162
 resisting+0.158
 encouraging+0.155
 helping+0.155
Neg logit contributions
suit-0.314
issy-0.310
ety-0.299
ppy-0.297
coat-0.296
Token to
Token ID284
Feature activation+9.12e-02

Pos logit contributions
suit+0.198
issy+0.195
azz+0.188
pp+0.186
acker+0.186
Neg logit contributions
 reaching-0.029
 going-0.028
 helping-0.028
 not-0.027
 saving-0.027
Token talk
Token ID1561
Feature activation+1.63e-01

Pos logit contributions
 reaching+0.122
 insisting+0.122
 saving+0.122
 resisting+0.120
 having+0.120
Neg logit contributions
issy-0.191
suit-0.189
acker-0.180
azz-0.178
pp-0.177
Token to
Token ID284
Feature activation+9.86e-02

Pos logit contributions
 reaching+0.080
 going+0.075
 insisting+0.074
 helping+0.074
 saving+0.073
Neg logit contributions
suit-0.502
issy-0.491
azz-0.480
acker-0.466
pp-0.465
Token.
Token ID13
Feature activation+2.47e-03

Pos logit contributions
 reaching+0.087
 saving+0.087
 helping+0.086
 going+0.086
 insisting+0.085
Neg logit contributions
issy-0.260
suit-0.258
azz-0.247
pp-0.243
acker-0.243
Token She
Token ID1375
Feature activation-1.11e-01

Pos logit contributions
 going+0.004
 reaching+0.004
 helping+0.003
 saving+0.003
 not+0.003
Neg logit contributions
suit-0.005
issy-0.005
azz-0.005
acker-0.005
uffy-0.005
Token decided
Token ID3066
Feature activation+1.61e-01

Pos logit contributions
suit+0.258
issy+0.252
azz+0.240
acker+0.236
pp+0.235
Neg logit contributions
 going-0.147
 helping-0.147
 reaching-0.146
 saving-0.144
 not-0.144
Token to
Token ID284
Feature activation+1.02e-01

Pos logit contributions
 reaching+0.125
 succeeding+0.123
 resisting+0.117
 saving+0.112
 helping+0.111
Neg logit contributions
 folder-0.401
suit-0.399
issy-0.393
azz-0.393
ems-0.389
Token girl
Token ID2576
Feature activation-2.67e-01

Pos logit contributions
suit+0.286
issy+0.273
pp+0.269
acker+0.263
azz+0.258
Neg logit contributions
 helping-0.387
 going-0.381
 next-0.379
 not-0.376
 having-0.375
Token played
Token ID2826
Feature activation+1.63e-01

Pos logit contributions
suit+0.574
issy+0.561
azz+0.525
pp+0.520
uffy+0.513
Neg logit contributions
 going-0.385
 reaching-0.382
 saving-0.380
 helping-0.378
 insisting-0.376
Token all
Token ID477
Feature activation-3.78e-03

Pos logit contributions
 going+0.135
 reaching+0.133
 insisting+0.132
 not+0.130
 helping+0.128
Neg logit contributions
suit-0.450
issy-0.432
azz-0.415
acker-0.410
pp-0.408
Token sorts
Token ID10524
Feature activation+5.77e-02

Pos logit contributions
suit+0.008
issy+0.008
azz+0.008
acker+0.008
pp+0.008
Neg logit contributions
 going-0.006
 helping-0.005
 not-0.005
 reaching-0.005
 saving-0.005
Token of
Token ID286
Feature activation+1.21e-02

Pos logit contributions
 going+0.088
 insisting+0.088
 reaching+0.087
 not+0.085
 saving+0.085
Neg logit contributions
suit-0.118
issy-0.115
azz-0.108
pp-0.107
affe-0.105
Token games
Token ID1830
Feature activation-4.77e-03

Pos logit contributions
 going+0.019
 helping+0.019
 not+0.019
 reaching+0.018
 having+0.018
Neg logit contributions
suit-0.026
issy-0.026
azz-0.024
pp-0.024
acker-0.024
Token with
Token ID351
Feature activation+1.24e-01

Pos logit contributions
suit+0.012
issy+0.012
azz+0.011
acker+0.011
pp+0.011
Neg logit contributions
 going-0.005
 not-0.005
 helping-0.005
 reaching-0.005
 saving-0.005
Token the
Token ID262
Feature activation-2.02e-01

Pos logit contributions
 going+0.072
 helping+0.071
 not+0.070
 reaching+0.068
 having+0.067
Neg logit contributions
suit-0.388
issy-0.379
azz-0.364
pp-0.364
acker-0.364
Token robot
Token ID9379
Feature activation-1.79e-01

Pos logit contributions
suit+0.301
issy+0.284
pp+0.275
acker+0.265
ppy+0.264
Neg logit contributions
 going-0.453
 helping-0.452
 having-0.447
 not-0.442
 saving-0.437
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
suit+0.470
issy+0.461
azz+0.441
acker+0.440
pp+0.437
Neg logit contributions
 going-0.187
 helping-0.185
 not-0.183
 reaching-0.179
 saving-0.178
Token like
Token ID588
Feature activation+1.67e-01

Pos logit contributions
 going+0.020
 not+0.019
 helping+0.019
 reaching+0.019
 saving+0.019
Neg logit contributions
suit-0.057
issy-0.055
azz-0.053
acker-0.053
pp-0.053
Token face
Token ID1986
Feature activation+4.11e-02

Pos logit contributions
suit+0.456
issy+0.455
acker+0.441
azz+0.437
pp+0.432
Neg logit contributions
 not-0.300
 going-0.295
 helping-0.289
 having-0.285
 to-0.281
Token.
Token ID13
Feature activation-6.75e-03

Pos logit contributions
 saving+0.040
 insisting+0.040
 resisting+0.039
 not+0.039
 helping+0.039
Neg logit contributions
suit-0.108
issy-0.106
azz-0.101
pp-0.101
acker-0.099
Token But
Token ID887
Feature activation+5.38e-02

Pos logit contributions
suit+0.018
issy+0.018
pp+0.017
ler+0.017
ppy+0.017
Neg logit contributions
 going-0.007
 not-0.007
 having-0.007
 to-0.007
 helping-0.006
Token one
Token ID530
Feature activation-1.38e-01

Pos logit contributions
 not+0.050
 going+0.049
 helping+0.047
 reaching+0.046
 having+0.045
Neg logit contributions
suit-0.155
issy-0.149
acker-0.144
azz-0.144
pp-0.142
Token day
Token ID1110
Feature activation-3.85e-02

Pos logit contributions
suit+0.342
issy+0.330
pp+0.328
azz+0.318
acker+0.311
Neg logit contributions
 helping-0.184
 going-0.180
 to-0.178
 next-0.177
 not-0.177
Token,
Token ID11
Feature activation+2.19e-02

Pos logit contributions
suit+0.102
issy+0.100
azz+0.095
acker+0.095
pp+0.095
Neg logit contributions
 going-0.039
 not-0.039
 helping-0.039
 reaching-0.038
 saving-0.038
Token it
Token ID340
Feature activation-1.27e-01

Pos logit contributions
 not+0.025
 going+0.024
 helping+0.024
 reaching+0.023
 saving+0.023
Neg logit contributions
suit-0.058
issy-0.056
acker-0.054
azz-0.054
pp-0.054
Token got
Token ID1392
Feature activation+2.06e-01

Pos logit contributions
suit+0.302
issy+0.293
azz+0.281
acker+0.278
pp+0.278
Neg logit contributions
 going-0.163
 not-0.162
 helping-0.160
 reaching-0.159
 saving-0.157
Token very
Token ID845
Feature activation+1.12e-01

Pos logit contributions
 saving+0.270
 reaching+0.270
 helping+0.266
 succeeding+0.262
 teaching+0.262
Neg logit contributions
suit-0.463
issy-0.458
acker-0.426
pp-0.424
azz-0.420
Token cold
Token ID4692
Feature activation+5.98e-02

Pos logit contributions
 saving+0.167
 helping+0.165
 reaching+0.165
 going+0.163
 having+0.161
Neg logit contributions
suit-0.242
issy-0.231
uffy-0.219
acker-0.218
pp-0.216
Token and
Token ID290
Feature activation+2.95e-02

Pos logit contributions
 not+0.064
 helping+0.063
 going+0.063
 reaching+0.063
 saving+0.063
Neg logit contributions
suit-0.154
issy-0.149
azz-0.142
acker-0.142
pp-0.141
Token you
Token ID345
Feature activation-6.36e-02

Pos logit contributions
 succeeding+0.326
 insisting+0.324
 reaching+0.321
 resisting+0.321
 saving+0.310
Neg logit contributions
issy-0.406
ppy-0.372
suit-0.358
uffy-0.350
x-0.350
Token."
Token ID526
Feature activation+8.30e-03

Pos logit contributions
issy+0.132
suit+0.128
pp+0.127
ppy+0.123
acker+0.121
Neg logit contributions
 insisting-0.097
 resisting-0.093
 reaching-0.090
 saving-0.088
 succeeding-0.088
Token Lily
Token ID20037
Feature activation-2.25e-01

Pos logit contributions
 not+0.006
 having+0.006
 going+0.006
 helping+0.006
 reaching+0.006
Neg logit contributions
suit-0.024
issy-0.024
azz-0.023
pp-0.023
acker-0.023
Token was
Token ID373
Feature activation+1.14e-01

Pos logit contributions
suit+0.522
issy+0.506
azz+0.488
acker+0.482
pp+0.478
Neg logit contributions
 not-0.305
 going-0.301
 helping-0.298
 reaching-0.293
 saving-0.290
Token happy
Token ID3772
Feature activation+1.17e-01

Pos logit contributions
 reaching+0.141
 Listen+0.136
 resisting+0.134
 himself+0.133
 insisting+0.132
Neg logit contributions
coat-0.204
suit-0.196
kin-0.194
 folder-0.193
house-0.192
Token to
Token ID284
Feature activation+7.53e-02

Pos logit contributions
 succeeding+0.139
 saving+0.139
 insisting+0.139
 reaching+0.137
 Out+0.134
Neg logit contributions
y-0.233
 skirt-0.228
 folder-0.227
suit-0.221
cy-0.217
Token hear
Token ID3285
Feature activation+2.26e-01

Pos logit contributions
 insisting+0.089
 resisting+0.088
 saving+0.084
 reaching+0.084
 having+0.082
Neg logit contributions
issy-0.183
suit-0.169
kin-0.166
ppy-0.162
 folder-0.161
Token that
Token ID326
Feature activation-2.12e-02

Pos logit contributions
 reaching+0.206
 resisting+0.193
 saving+0.191
 succeeding+0.190
 insisting+0.190
Neg logit contributions
issy-0.589
azz-0.563
 folder-0.551
suit-0.543
 skirt-0.534
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
issy+0.047
 folder+0.045
suit+0.044
 skirt+0.042
azz+0.042
Neg logit contributions
 saving-0.027
 reaching-0.026
 himself-0.026
 insisting-0.025
 resisting-0.025
Token
Token ID198
Feature activation-3.02e-02

Pos logit contributions
suit+0.006
issy+0.006
azz+0.006
acker+0.006
pp+0.006
Neg logit contributions
 going-0.002
 not-0.002
 helping-0.002
 reaching-0.002
 saving-0.002
Token
Token ID198
Feature activation-3.80e-02

Pos logit contributions
suit+0.085
issy+0.084
azz+0.081
acker+0.080
pp+0.079
Neg logit contributions
 going-0.024
 helping-0.024
 reaching-0.024
 not-0.024
 saving-0.024
Token reached
Token ID4251
Feature activation+2.04e-01

Pos logit contributions
suit+0.286
issy+0.274
azz+0.259
acker+0.257
pp+0.254
Neg logit contributions
 helping-0.207
 going-0.204
 reaching-0.204
 saving-0.203
 not-0.199
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
 helping+0.176
 resisting+0.168
 teaching+0.165
 saving+0.164
 giving+0.159
Neg logit contributions
suit-0.544
issy-0.511
cy-0.501
 folder-0.484
azz-0.484
Token dark
Token ID3223
Feature activation-8.68e-02

Pos logit contributions
issy+0.343
suit+0.324
azz+0.323
pp+0.318
ems+0.312
Neg logit contributions
 helping-0.217
 going-0.216
 not-0.216
 reaching-0.212
 to-0.211
Token cre
Token ID1126
Feature activation-7.08e-03

Pos logit contributions
issy+0.216
pp+0.213
azz+0.208
abs+0.207
acker+0.207
Neg logit contributions
 to-0.112
 reaching-0.105
 helping-0.105
 next-0.103
 going-0.101
Tokenvice
Token ID28281
Feature activation-2.78e-02

Pos logit contributions
suit+0.009
issy+0.009
azz+0.008
uffy+0.008
pp+0.008
Neg logit contributions
 reaching-0.017
 going-0.016
 helping-0.016
 not-0.016
 insisting-0.016
Token in
Token ID287
Feature activation-5.33e-03

Pos logit contributions
suit+0.071
issy+0.067
y+0.064
pp+0.064
kin+0.063
Neg logit contributions
 insisting-0.029
 helping-0.029
 going-0.028
 saving-0.028
 resisting-0.028
Token the
Token ID262
Feature activation-2.47e-01

Pos logit contributions
suit+0.015
issy+0.014
acker+0.014
cy+0.014
azz+0.013
Neg logit contributions
 going-0.004
 helping-0.004
 saving-0.004
 resisting-0.004
 insisting-0.004
Token forest
Token ID8222
Feature activation-1.22e-01

Pos logit contributions
suit+0.518
issy+0.512
azz+0.505
pp+0.501
uffy+0.487
Neg logit contributions
 going-0.378
 not-0.376
 helping-0.375
 saving-0.372
 reaching-0.369
Token.
Token ID13
Feature activation-3.78e-02

Pos logit contributions
suit+0.306
issy+0.287
pp+0.275
azz+0.274
acker+0.273
Neg logit contributions
 insisting-0.127
 helping-0.124
 going-0.124
 resisting-0.121
 saving-0.121
Token He
Token ID679
Feature activation-1.39e-01

Pos logit contributions
suit+0.095
issy+0.092
azz+0.088
acker+0.087
pp+0.087
Neg logit contributions
 going-0.043
 not-0.043
 helping-0.043
 reaching-0.042
 saving-0.042
Token tried
Token ID3088
Feature activation+2.40e-01

Pos logit contributions
suit+0.301
issy+0.290
azz+0.275
acker+0.271
pp+0.269
Neg logit contributions
 going-0.205
 helping-0.204
 saving-0.201
 reaching-0.200
 not-0.198
Token her
Token ID607
Feature activation-1.94e-01

Pos logit contributions
 going+0.024
 not+0.024
 reaching+0.023
 helping+0.023
 saving+0.023
Neg logit contributions
suit-0.120
issy-0.118
azz-0.113
acker-0.112
uffy-0.111
Token garden
Token ID11376
Feature activation-5.73e-02

Pos logit contributions
suit+0.344
issy+0.340
pp+0.329
azz+0.321
acker+0.321
Neg logit contributions
 not-0.374
 going-0.373
 helping-0.371
 having-0.361
 next-0.360
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
suit+0.138
issy+0.135
uffy+0.128
azz+0.127
acker+0.126
Neg logit contributions
 going-0.067
 insisting-0.066
 saving-0.066
 helping-0.065
 not-0.065
Token Penny
Token ID25965
Feature activation-2.33e-01

Pos logit contributions
 reaching+0.047
 saving+0.047
 going+0.047
 helping+0.047
 not+0.046
Neg logit contributions
suit-0.076
issy-0.074
azz-0.070
acker-0.069
pp-0.069
Token got
Token ID1392
Feature activation+2.07e-01

Pos logit contributions
suit+0.515
issy+0.499
acker+0.484
azz+0.482
pp+0.480
Neg logit contributions
 going-0.343
 not-0.342
 helping-0.327
 reaching-0.322
 having-0.321
Token very
Token ID845
Feature activation+1.15e-01

Pos logit contributions
 reaching+0.215
 saving+0.203
 giving+0.195
 resisting+0.193
 helping+0.192
Neg logit contributions
issy-0.506
suit-0.497
 folder-0.479
 skirt-0.471
cy-0.466
Token thirsty
Token ID47124
Feature activation+1.07e-01

Pos logit contributions
 saving+0.166
 reaching+0.164
 helping+0.159
 teaching+0.157
 giving+0.156
Neg logit contributions
suit-0.254
issy-0.237
uffy-0.235
 tant-0.230
cy-0.228
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 saving+0.110
 not+0.106
 reaching+0.106
 insisting+0.105
 helping+0.105
Neg logit contributions
suit-0.272
issy-0.259
acker-0.248
pp-0.246
uffy-0.245
Token She
Token ID1375
Feature activation-1.25e-01

Pos logit contributions
suit+0.022
issy+0.021
azz+0.020
acker+0.020
pp+0.019
Neg logit contributions
 saving-0.017
 going-0.017
 reaching-0.016
 helping-0.016
 not-0.016
Token wanted
Token ID2227
Feature activation+2.44e-01

Pos logit contributions
suit+0.292
issy+0.285
azz+0.274
pp+0.272
acker+0.271
Neg logit contributions
 not-0.165
 going-0.165
 helping-0.161
 reaching-0.160
 saving-0.156
Token to
Token ID284
Feature activation+1.12e-01

Pos logit contributions
 succeeding+0.184
 reaching+0.168
 insisting+0.167
 saving+0.165
 resisting+0.163
Neg logit contributions
issy-0.637
 folder-0.634
 skirt-0.626
suit-0.625
 sweater-0.613
Token clean
Token ID3424
Feature activation+1.15e-01

Pos logit contributions
 reaching+0.180
 saving+0.177
 insisting+0.176
 going+0.174
 resisting+0.174
Neg logit contributions
issy-0.527
suit-0.527
azz-0.494
uffy-0.490
pp-0.489
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
 going+0.053
 not+0.051
 helping+0.050
 reaching+0.049
 saving+0.048
Neg logit contributions
suit-0.369
issy-0.361
azz-0.350
acker-0.348
pp-0.346
Token train
Token ID4512
Feature activation-1.66e-01

Pos logit contributions
suit+0.218
issy+0.206
azz+0.184
acker+0.178
pp+0.177
Neg logit contributions
 going-0.503
 not-0.502
 reaching-0.501
 helping-0.501
 saving-0.496
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
suit+0.438
issy+0.428
azz+0.409
pp+0.404
acker+0.404
Neg logit contributions
 going-0.166
 helping-0.164
 reaching-0.164
 not-0.163
 saving-0.161
Token The
Token ID383
Feature activation-1.81e-01

Pos logit contributions
suit+0.043
issy+0.041
azz+0.039
uffy+0.039
cy+0.038
Neg logit contributions
 going-0.029
 reaching-0.028
 helping-0.028
 succeeding-0.028
 resisting-0.028
Token conductor
Token ID39206
Feature activation-1.41e-01

Pos logit contributions
suit+0.232
issy+0.220
azz+0.201
acker+0.197
pp+0.196
Neg logit contributions
 going-0.432
 not-0.429
 helping-0.429
 reaching-0.427
 saving-0.425
Token was
Token ID373
Feature activation+1.27e-01

Pos logit contributions
suit+0.321
issy+0.312
azz+0.297
acker+0.294
pp+0.293
Neg logit contributions
 going-0.194
 not-0.192
 helping-0.191
 reaching-0.190
 saving-0.188
Token happy
Token ID3772
Feature activation+1.20e-01

Pos logit contributions
 Listen+0.152
 succeeding+0.147
 reaching+0.146
 resisting+0.144
 conquering+0.139
Neg logit contributions
suit-0.249
coat-0.244
kin-0.239
aky-0.230
c-0.223
Token to
Token ID284
Feature activation+7.83e-02

Pos logit contributions
 insisting+0.131
 saving+0.124
 succeeding+0.123
 reaching+0.121
 going+0.120
Neg logit contributions
suit-0.263
y-0.261
azz-0.245
cy-0.243
issy-0.239
Token have
Token ID423
Feature activation+1.52e-01

Pos logit contributions
 saving+0.087
 reaching+0.086
 resisting+0.085
 insisting+0.085
 succeeding+0.083
Neg logit contributions
issy-0.186
suit-0.183
ppy-0.173
kin-0.171
cy-0.171
Token some
Token ID617
Feature activation-1.33e-02

Pos logit contributions
 reaching+0.180
 insisting+0.164
 resisting+0.161
 succeeding+0.156
 saving+0.156
Neg logit contributions
issy-0.361
suit-0.356
azz-0.333
acker-0.331
 folder-0.327
Token did
Token ID750
Feature activation+2.02e-01

Pos logit contributions
suit+0.212
issy+0.203
azz+0.191
pp+0.190
uffy+0.188
Neg logit contributions
 going-0.218
 not-0.218
 having-0.210
 helping-0.207
 saving-0.206
Token a
Token ID257
Feature activation-8.78e-02

Pos logit contributions
 insisting+0.186
 resisting+0.181
 reaching+0.173
 helping+0.170
 asking+0.162
Neg logit contributions
suit-0.486
azz-0.474
issy-0.473
iet-0.473
 folder-0.467
Token good
Token ID922
Feature activation-4.93e-03

Pos logit contributions
suit+0.226
issy+0.224
pp+0.214
azz+0.213
acker+0.210
Neg logit contributions
 not-0.102
 going-0.101
 helping-0.099
 reaching-0.092
 saving-0.092
Token job
Token ID1693
Feature activation+2.30e-02

Pos logit contributions
suit+0.010
issy+0.010
azz+0.009
acker+0.009
pp+0.009
Neg logit contributions
 going-0.008
 not-0.008
 helping-0.008
 reaching-0.008
 saving-0.008
Token and
Token ID290
Feature activation+3.72e-02

Pos logit contributions
 insisting+0.023
 reaching+0.023
 saving+0.022
 resisting+0.022
 himself+0.022
Neg logit contributions
suit-0.055
issy-0.052
azz-0.050
pp-0.050
y-0.050
Token they
Token ID484
Feature activation-1.19e-01

Pos logit contributions
 reaching+0.045
 helping+0.045
 going+0.045
 saving+0.044
 insisting+0.043
Neg logit contributions
suit-0.089
issy-0.086
azz-0.082
pp-0.080
acker-0.080
Token can
Token ID460
Feature activation+1.76e-01

Pos logit contributions
suit+0.300
issy+0.287
azz+0.279
pp+0.277
uffy+0.277
Neg logit contributions
 not-0.143
 going-0.141
 having-0.126
 helping-0.125
 reaching-0.124
Token go
Token ID467
Feature activation+1.88e-01

Pos logit contributions
 saving+0.222
 helping+0.221
 going+0.219
 insisting+0.219
 reaching+0.218
Neg logit contributions
suit-0.402
issy-0.401
azz-0.384
acker-0.379
uffy-0.375
Token outside
Token ID2354
Feature activation+1.05e-01

Pos logit contributions
 succeeding+0.188
 resisting+0.187
 reaching+0.187
 helping+0.182
 insisting+0.180
Neg logit contributions
suit-0.502
issy-0.462
azz-0.456
pp-0.455
cy-0.454
Token now
Token ID783
Feature activation+1.02e-01

Pos logit contributions
 reaching+0.125
 insisting+0.125
 saving+0.125
 succeeding+0.124
 resisting+0.124
Neg logit contributions
suit-0.236
issy-0.222
cy-0.220
y-0.220
azz-0.215
Token.
Token ID13
Feature activation-8.76e-03

Pos logit contributions
 saving+0.099
 reaching+0.098
 helping+0.098
 going+0.096
 insisting+0.096
Neg logit contributions
suit-0.265
issy-0.254
azz-0.244
uffy-0.242
acker-0.241
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
 reaching+0.053
 helping+0.052
 resisting+0.050
 insisting+0.050
 succeeding+0.050
Neg logit contributions
suit-0.164
issy-0.154
pp-0.151
azz-0.149
acker-0.147
Token Tim
Token ID5045
Feature activation-2.26e-01

Pos logit contributions
suit+0.043
issy+0.042
pp+0.042
acker+0.040
iet+0.040
Neg logit contributions
 not-0.016
 having-0.015
 going-0.015
 helping-0.015
 reaching-0.015
Token was
Token ID373
Feature activation+1.03e-01

Pos logit contributions
suit+0.433
issy+0.406
pp+0.392
azz+0.389
acker+0.385
Neg logit contributions
 not-0.411
 going-0.404
 having-0.399
 insisting-0.390
 reaching-0.388
Token sad
Token ID6507
Feature activation+1.05e-01

Pos logit contributions
 reaching+0.135
 helping+0.132
 saving+0.130
 succeeding+0.130
 resisting+0.125
Neg logit contributions
suit-0.197
 tant-0.183
issy-0.178
coat-0.177
ppy-0.177
Token.
Token ID13
Feature activation-2.30e-02

Pos logit contributions
 saving+0.143
 reaching+0.140
 going+0.140
 insisting+0.137
 succeeding+0.136
Neg logit contributions
suit-0.229
issy-0.221
azz-0.218
uffy-0.216
y-0.211
Token The
Token ID383
Feature activation-1.92e-01

Pos logit contributions
suit+0.051
issy+0.050
azz+0.048
uffy+0.047
acker+0.047
Neg logit contributions
 going-0.032
 helping-0.032
 saving-0.031
 reaching-0.031
 insisting-0.030
Token pigeon
Token ID46387
Feature activation-2.25e-01

Pos logit contributions
suit+0.418
issy+0.407
azz+0.388
pp+0.385
acker+0.384
Neg logit contributions
 going-0.287
 not-0.285
 helping-0.284
 reaching-0.280
 saving-0.278
Token wanted
Token ID2227
Feature activation+2.31e-01

Pos logit contributions
suit+0.477
issy+0.462
azz+0.437
acker+0.432
pp+0.431
Neg logit contributions
 going-0.335
 helping-0.333
 reaching-0.331
 saving-0.330
 not-0.325
Token to
Token ID284
Feature activation+9.68e-02

Pos logit contributions
 reaching+0.135
 succeeding+0.134
 insisting+0.134
 resisting+0.128
 going+0.117
Neg logit contributions
suit-0.662
issy-0.660
azz-0.632
 folder-0.624
affe-0.621
Token help
Token ID1037
Feature activation+1.91e-01

Pos logit contributions
 going+0.112
 helping+0.111
 reaching+0.111
 saving+0.110
 insisting+0.108
Neg logit contributions
suit-0.237
issy-0.232
azz-0.221
acker-0.220
pp-0.219
Token Tim
Token ID5045
Feature activation-1.85e-01

Pos logit contributions
 going+0.205
 insisting+0.203
 reaching+0.200
 helping+0.199
 saving+0.198
Neg logit contributions
suit-0.468
issy-0.464
azz-0.439
acker-0.433
uffy-0.432
Token the
Token ID262
Feature activation-2.29e-01

Pos logit contributions
 going+0.007
 not+0.007
 helping+0.007
 reaching+0.006
 saving+0.006
Neg logit contributions
suit-0.039
issy-0.038
azz-0.037
acker-0.037
pp-0.036
Token family
Token ID1641
Feature activation-1.82e-01

Pos logit contributions
suit+0.463
issy+0.446
azz+0.433
pp+0.432
acker+0.430
Neg logit contributions
 going-0.394
 not-0.389
 helping-0.387
 next-0.375
 having-0.374
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
suit+0.383
issy+0.375
azz+0.351
pp+0.347
acker+0.346
Neg logit contributions
 going-0.274
 reaching-0.274
 helping-0.273
 not-0.271
 saving-0.270
Token happy
Token ID3772
Feature activation+1.11e-01

Pos logit contributions
 reaching+0.140
 succeeding+0.137
 resisting+0.133
 Listen+0.133
 insisting+0.133
Neg logit contributions
suit-0.258
issy-0.257
c-0.247
y-0.245
uffy-0.245
Token that
Token ID326
Feature activation-4.11e-02

Pos logit contributions
 insisting+0.124
 reaching+0.115
 saving+0.112
 resisting+0.111
 succeeding+0.110
Neg logit contributions
suit-0.251
y-0.246
issy-0.235
uffy-0.233
cy-0.232
Token they
Token ID484
Feature activation-1.23e-01

Pos logit contributions
suit+0.109
issy+0.108
azz+0.102
uffy+0.101
acker+0.100
Neg logit contributions
 reaching-0.036
 saving-0.035
 going-0.034
 not-0.034
 helping-0.034
Token had
Token ID550
Feature activation+1.07e-01

Pos logit contributions
suit+0.260
issy+0.249
azz+0.241
acker+0.237
uffy+0.236
Neg logit contributions
 going-0.196
 not-0.193
 helping-0.190
 saving-0.186
 having-0.186
Token prevented
Token ID13351
Feature activation+2.61e-01

Pos logit contributions
 reaching+0.161
 insisting+0.157
 resisting+0.151
 encouraging+0.148
 succeeding+0.147
Neg logit contributions
suit-0.213
issy-0.202
acker-0.191
ocado-0.189
azz-0.188
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 reaching+0.183
 helping+0.176
 saving+0.169
 going+0.165
 insisting+0.160
Neg logit contributions
issy-0.717
suit-0.716
azz-0.683
uffy-0.683
ppy-0.671
Token bad
Token ID2089
Feature activation-1.92e-02

Pos logit contributions
suit+0.227
issy+0.216
azz+0.206
pp+0.204
acker+0.203
Neg logit contributions
 going-0.255
 not-0.250
 helping-0.249
 reaching-0.245
 saving-0.243
Token thing
Token ID1517
Feature activation-8.53e-02

Pos logit contributions
suit+0.031
issy+0.030
azz+0.028
acker+0.027
pp+0.027
Neg logit contributions
 going-0.039
 not-0.039
 helping-0.039
 reaching-0.039
 saving-0.038
Token that
Token ID326
Feature activation-4.57e-02

Pos logit contributions
 reaching+0.068
 succeeding+0.061
 saving+0.061
 having+0.061
 insisting+0.060
Neg logit contributions
suit-0.165
issy-0.163
ite-0.160
ara-0.158
uffy-0.158
Token day
Token ID1110
Feature activation-1.82e-02

Pos logit contributions
issy+0.066
suit+0.065
acker+0.062
y+0.062
pp+0.061
Neg logit contributions
 reaching-0.093
 saving-0.092
 himself-0.089
 going-0.089
 resisting-0.088
Token on
Token ID319
Feature activation+4.20e-02

Pos logit contributions
issy+0.045
suit+0.042
azz+0.042
uffy+0.041
ppy+0.041
Neg logit contributions
 reaching-0.020
 succeeding-0.018
 resisting-0.018
 having-0.018
 insisting-0.018
Token,
Token ID11
Feature activation+2.26e-02

Pos logit contributions
 going+0.047
 reaching+0.046
 not+0.046
 helping+0.046
 saving+0.045
Neg logit contributions
suit-0.106
issy-0.104
azz-0.099
uffy-0.098
acker-0.097
Token Brown
Token ID4373
Feature activation-1.83e-01

Pos logit contributions
 going+0.036
 reaching+0.036
 not+0.035
 helping+0.035
 saving+0.035
Neg logit contributions
suit-0.046
issy-0.045
azz-0.042
pp-0.042
acker-0.042
Tokenie
Token ID494
Feature activation-1.90e-01

Pos logit contributions
suit+0.246
azz+0.187
coat+0.186
issy+0.185
acker+0.184
Neg logit contributions
 helping-0.459
 not-0.454
 going-0.451
 having-0.450
 saving-0.440
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
issy+0.382
suit+0.366
pp+0.356
azz+0.355
pillar+0.355
Neg logit contributions
 reaching-0.295
 insisting-0.286
 saving-0.276
 going-0.272
 resisting-0.268
Token his
Token ID465
Feature activation-2.03e-01

Pos logit contributions
 helping+0.006
 going+0.006
 not+0.005
 having+0.005
 saving+0.005
Neg logit contributions
suit-0.038
issy-0.036
acker-0.036
azz-0.035
pp-0.035
Token friends
Token ID2460
Feature activation-1.06e-01

Pos logit contributions
suit+0.489
issy+0.475
uffy+0.457
acker+0.449
azz+0.449
Neg logit contributions
 going-0.244
 reaching-0.242
 not-0.239
 saving-0.236
 helping-0.234
Token always
Token ID1464
Feature activation+1.36e-01

Pos logit contributions
suit+0.219
issy+0.212
azz+0.201
acker+0.199
pp+0.198
Neg logit contributions
 going-0.170
 not-0.168
 helping-0.168
 reaching-0.167
 saving-0.165
Token shared
Token ID4888
Feature activation+2.01e-01

Pos logit contributions
 reaching+0.219
 going+0.210
 saving+0.210
 insisting+0.210
 not+0.209
Neg logit contributions
suit-0.269
issy-0.250
pp-0.249
cy-0.243
acker-0.242
Token little
Token ID1310
Feature activation-1.89e-01

Pos logit contributions
issy+0.370
pp+0.360
suit+0.360
azz+0.356
ems+0.355
Neg logit contributions
 helping-0.177
 not-0.175
 going-0.174
 next-0.171
 taking-0.162
Token girl
Token ID2576
Feature activation-2.63e-01

Pos logit contributions
issy+0.399
abs+0.384
azz+0.379
ems+0.374
acker+0.372
Neg logit contributions
 next-0.330
 helping-0.330
 the-0.320
 to-0.308
 looking-0.295
Token named
Token ID3706
Feature activation+2.48e-02

Pos logit contributions
suit+0.550
issy+0.527
azz+0.503
acker+0.502
pp+0.494
Neg logit contributions
 going-0.421
 helping-0.420
 not-0.419
 reaching-0.409
 saving-0.409
Token Lily
Token ID20037
Feature activation-2.43e-01

Pos logit contributions
 not+0.041
 going+0.041
 having+0.040
 helping+0.039
 reaching+0.039
Neg logit contributions
suit-0.052
issy-0.049
pp-0.047
azz-0.047
acker-0.047
Token.
Token ID13
Feature activation-3.09e-02

Pos logit contributions
suit+0.715
issy+0.692
azz+0.667
acker+0.665
pp+0.661
Neg logit contributions
 going-0.187
 not-0.184
 helping-0.181
 reaching-0.173
 saving-0.172
Token Lily
Token ID20037
Feature activation-2.44e-01

Pos logit contributions
suit+0.074
issy+0.070
pp+0.068
azz+0.066
acker+0.066
Neg logit contributions
 having-0.041
 helping-0.041
 not-0.040
 going-0.040
 reaching-0.039
Token loved
Token ID6151
Feature activation+1.94e-01

Pos logit contributions
suit+0.615
issy+0.588
azz+0.575
acker+0.571
pp+0.568
Neg logit contributions
 not-0.307
 going-0.301
 helping-0.288
 having-0.287
 saving-0.272
Token to
Token ID284
Feature activation+5.90e-02

Pos logit contributions
 resisting+0.116
 reaching+0.116
 not+0.114
 insisting+0.114
 succeeding+0.111
Neg logit contributions
issy-0.523
 folder-0.515
 skirt-0.510
suit-0.509
coat-0.507
Token play
Token ID711
Feature activation+1.32e-01

Pos logit contributions
 saving+0.090
 going+0.089
 reaching+0.089
 not+0.089
 helping+0.089
Neg logit contributions
suit-0.123
issy-0.121
azz-0.114
acker-0.112
pp-0.112
Token outside
Token ID2354
Feature activation+6.42e-02

Pos logit contributions
 going+0.116
 helping+0.115
 not+0.114
 reaching+0.111
 saving+0.110
Neg logit contributions
suit-0.373
issy-0.365
azz-0.350
acker-0.349
pp-0.346
Token in
Token ID287
Feature activation+9.25e-03

Pos logit contributions
 insisting+0.069
 saving+0.069
 resisting+0.068
 not+0.067
 succeeding+0.067
Neg logit contributions
suit-0.156
issy-0.151
cy-0.146
y-0.145
uffy-0.143
Token day
Token ID1110
Feature activation-3.08e-02

Pos logit contributions
issy+0.118
azz+0.114
ppy+0.111
pp+0.110
acker+0.110
Neg logit contributions
 going-0.070
 helping-0.069
 to-0.069
 next-0.067
 reaching-0.066
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
suit+0.077
issy+0.073
azz+0.069
y+0.068
acker+0.068
Neg logit contributions
 insisting-0.033
 resisting-0.033
 not-0.032
 reaching-0.032
 succeeding-0.032
Token Wally
Token ID50117
Feature activation-2.25e-01

Pos logit contributions
suit+0.034
acker+0.029
azz+0.029
affe+0.028
pillar+0.028
Neg logit contributions
 saving-0.035
 helping-0.034
 going-0.034
 reaching-0.033
 resisting-0.033
Token ran
Token ID4966
Feature activation+1.37e-01

Pos logit contributions
suit+0.503
issy+0.491
azz+0.467
acker+0.463
pp+0.460
Neg logit contributions
 going-0.317
 not-0.315
 helping-0.311
 reaching-0.309
 saving-0.304
Token around
Token ID1088
Feature activation+6.29e-02

Pos logit contributions
 resisting+0.154
 saving+0.152
 helping+0.152
 succeeding+0.151
 insisting+0.149
Neg logit contributions
suit-0.352
coat-0.313
suits-0.308
cy-0.306
azz-0.303
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
 insisting+0.088
 resisting+0.084
 not+0.083
 succeeding+0.082
 saving+0.082
Neg logit contributions
suit-0.115
cy-0.109
y-0.108
ro-0.101
 skirt-0.097
Token garden
Token ID11376
Feature activation-1.08e-01

Pos logit contributions
suit+0.426
issy+0.414
azz+0.389
pp+0.383
acker+0.383
Neg logit contributions
 going-0.461
 not-0.456
 helping-0.456
 reaching-0.452
 saving-0.448
Token,
Token ID11
Feature activation+1.97e-02

Pos logit contributions
suit+0.224
y+0.207
coat+0.204
suits+0.192
cy+0.191
Neg logit contributions
 insisting-0.159
 resisting-0.149
 succeeding-0.146
 going-0.138
 Listen-0.137
Token smiling
Token ID16755
Feature activation+1.11e-01

Pos logit contributions
 succeeding+0.018
 Listen+0.018
 insisting+0.017
 resisting+0.017
 himself+0.017
Neg logit contributions
suit-0.049
coat-0.046
 pad-0.043
y-0.043
pp-0.043
Token and
Token ID290
Feature activation+3.05e-02

Pos logit contributions
 insisting+0.134
 saving+0.134
 resisting+0.133
 reaching+0.132
 helping+0.132
Neg logit contributions
suit-0.275
issy-0.245
azz-0.241
y-0.239
pp-0.239
Token laughing
Token ID14376
Feature activation+1.65e-01

Pos logit contributions
 warning+0.029
 resisting+0.026
 saving+0.026
 Listen+0.026
 insisting+0.026
Neg logit contributions
suit-0.075
y-0.066
cy-0.066
pp-0.066
coat-0.065
Token quickly
Token ID2952
Feature activation+6.53e-02

Pos logit contributions
suit+0.467
issy+0.425
acker+0.423
azz+0.421
 tant+0.413
Neg logit contributions
 not-0.249
 going-0.243
 having-0.238
 next-0.233
 helping-0.232
Token realized
Token ID6939
Feature activation+1.63e-01

Pos logit contributions
 going+0.093
 not+0.092
 helping+0.092
 reaching+0.091
 saving+0.090
Neg logit contributions
suit-0.145
issy-0.142
azz-0.134
acker-0.134
pp-0.133
Token how
Token ID703
Feature activation+8.10e-02

Pos logit contributions
 going+0.108
 helping+0.106
 not+0.106
 reaching+0.105
 saving+0.103
Neg logit contributions
suit-0.483
issy-0.474
azz-0.457
pp-0.451
acker-0.450
Token boring
Token ID14262
Feature activation+3.09e-02

Pos logit contributions
 reaching+0.101
 saving+0.097
 insisting+0.095
 helping+0.091
 going+0.091
Neg logit contributions
suit-0.186
issy-0.185
azz-0.179
uffy-0.173
acker-0.171
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
 going+0.025
 not+0.024
 helping+0.023
 having+0.022
 to+0.022
Neg logit contributions
suit-0.092
acker-0.091
issy-0.090
pp-0.088
azz-0.087
Token girl
Token ID2576
Feature activation-2.32e-01

Pos logit contributions
issy+0.386
suit+0.386
pp+0.371
azz+0.369
acker+0.365
Neg logit contributions
 going-0.354
 helping-0.351
 next-0.346
 not-0.344
 having-0.342
Token was
Token ID373
Feature activation+1.31e-01

Pos logit contributions
suit+0.559
issy+0.539
acker+0.526
azz+0.523
pp+0.523
Neg logit contributions
 going-0.307
 not-0.307
 having-0.296
 helping-0.294
 reaching-0.285
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 reaching+0.120
 saving+0.116
 insisting+0.115
 resisting+0.109
 succeeding+0.109
Neg logit contributions
suit-0.332
issy-0.327
azz-0.316
pp-0.309
 folder-0.309
Token He
Token ID679
Feature activation-1.27e-01

Pos logit contributions
suit+0.048
issy+0.047
azz+0.045
acker+0.044
pp+0.043
Neg logit contributions
 going-0.033
 helping-0.033
 not-0.033
 reaching-0.032
 saving-0.032
Token told
Token ID1297
Feature activation+1.67e-01

Pos logit contributions
suit+0.292
issy+0.284
azz+0.271
acker+0.267
pp+0.266
Neg logit contributions
 going-0.170
 helping-0.168
 reaching-0.167
 not-0.166
 saving-0.166
Token her
Token ID607
Feature activation-1.67e-01

Pos logit contributions
 going+0.232
 helping+0.229
 not+0.228
 reaching+0.228
 saving+0.226
Neg logit contributions
suit-0.375
issy-0.368
azz-0.350
pp-0.344
acker-0.343
Token
Token ID220
Feature activation-3.70e-02

Pos logit contributions
suit+0.019
acker+0.018
uffy+0.018
issy+0.017
azz+0.017
Neg logit contributions
 helping-0.012
 going-0.012
 saving-0.011
 insisting-0.011
 giving-0.011
Token
Token ID198
Feature activation-3.34e-02

Pos logit contributions
suit+0.099
issy+0.098
azz+0.095
acker+0.094
uffy+0.094
Neg logit contributions
 going-0.034
 helping-0.034
 reaching-0.033
 saving-0.033
 insisting-0.032
Token
Token ID198
Feature activation-4.09e-02

Pos logit contributions
suit+0.087
issy+0.085
azz+0.083
acker+0.083
uffy+0.082
Neg logit contributions
 going-0.034
 helping-0.033
 saving-0.033
 reaching-0.032
 insisting-0.031
TokenEvery
Token ID6109
Feature activation-2.35e-03

Pos logit contributions
suit+0.072
issy+0.067
acker+0.066
azz+0.064
uffy+0.064
Neg logit contributions
 helping-0.076
 going-0.076
 saving-0.076
 giving-0.074
 insisting-0.074
Token day
Token ID1110
Feature activation-3.61e-02

Pos logit contributions
suit+0.004
issy+0.004
acker+0.004
azz+0.004
uffy+0.004
Neg logit contributions
 saving-0.004
 reaching-0.004
 going-0.004
 helping-0.004
 not-0.004
Token Bob
Token ID5811
Feature activation-2.37e-01

Pos logit contributions
suit+0.078
cy+0.070
y+0.070
issy+0.069
affe+0.069
Neg logit contributions
 insisting-0.052
 resisting-0.051
 saving-0.051
 succeeding-0.050
 helping-0.050
Token tried
Token ID3088
Feature activation+2.48e-01

Pos logit contributions
suit+0.464
issy+0.441
azz+0.419
acker+0.417
igsaw+0.411
Neg logit contributions
 going-0.417
 to-0.415
 not-0.413
 helping-0.401
 reaching-0.398
Token and
Token ID290
Feature activation+7.18e-02

Pos logit contributions
 insisting+0.174
 saving+0.172
 encouraging+0.162
 succeeding+0.160
 ordering+0.154
Neg logit contributions
y-0.581
 skirt-0.563
azz-0.561
 pad-0.553
 tant-0.552
Token tried
Token ID3088
Feature activation+2.68e-01

Pos logit contributions
 saving+0.107
 not+0.102
 helping+0.099
 teaching+0.099
 encouraging+0.099
Neg logit contributions
issy-0.138
ppy-0.134
suit-0.133
pp-0.132
acker-0.130
Token,
Token ID11
Feature activation+7.84e-02

Pos logit contributions
 insisting+0.256
 recommending+0.244
 saving+0.244
 ordering+0.233
 encouraging+0.232
Neg logit contributions
y-0.550
 pad-0.540
azz-0.532
 tant-0.525
suit-0.523
Token but
Token ID475
Feature activation+9.93e-02

Pos logit contributions
 saving+0.142
 teaching+0.135
 insisting+0.134
 succeeding+0.134
 helping+0.133
Neg logit contributions
suit-0.114
ppy-0.108
 pad-0.107
issy-0.107
azz-0.106
Token time
Token ID640
Feature activation+2.54e-02

Pos logit contributions
 saving+0.260
 resisting+0.251
 encouraging+0.250
 teaching+0.248
 succeeding+0.246
Neg logit contributions
suit-0.175
issy-0.172
kin-0.163
coat-0.155
acker-0.154
Token to
Token ID284
Feature activation+6.71e-02

Pos logit contributions
 succeeding+0.029
 resisting+0.025
 himself+0.024
 saving+0.024
 encouraging+0.024
Neg logit contributions
suit-0.062
 pad-0.057
y-0.056
suits-0.056
coat-0.054
Token go
Token ID467
Feature activation+1.70e-01

Pos logit contributions
 going+0.084
 not+0.082
 helping+0.079
 reaching+0.079
 saving+0.076
Neg logit contributions
suit-0.168
issy-0.162
azz-0.156
pp-0.154
acker-0.154
Token home
Token ID1363
Feature activation+1.58e-02

Pos logit contributions
 reaching+0.196
 succeeding+0.190
 helping+0.189
 not+0.189
 resisting+0.189
Neg logit contributions
suit-0.428
issy-0.404
azz-0.391
uffy-0.384
pp-0.383
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
 reaching+0.018
 succeeding+0.018
 insisting+0.017
 helping+0.017
 saving+0.017
Neg logit contributions
suit-0.038
issy-0.035
uffy-0.034
pp-0.034
azz-0.033
Token Tim
Token ID5045
Feature activation-2.32e-01

Pos logit contributions
suit+0.061
issy+0.060
pp+0.060
acker+0.058
uffy+0.056
Neg logit contributions
 not-0.037
 having-0.036
 going-0.036
 helping-0.036
 reaching-0.035
Token forgot
Token ID16453
Feature activation+1.09e-01

Pos logit contributions
suit+0.382
issy+0.356
pp+0.340
azz+0.339
acker+0.335
Neg logit contributions
 going-0.481
 not-0.478
 helping-0.465
 having-0.463
 insisting-0.459
Token to
Token ID284
Feature activation+4.84e-02

Pos logit contributions
 reaching+0.114
 saving+0.104
 insisting+0.102
 resisting+0.102
 helping+0.101
Neg logit contributions
suit-0.256
issy-0.252
azz-0.248
 folder-0.243
 skirt-0.240
Token remove
Token ID4781
Feature activation+1.62e-01

Pos logit contributions
 going+0.062
 not+0.062
 helping+0.060
 reaching+0.060
 saving+0.059
Neg logit contributions
suit-0.116
issy-0.114
azz-0.109
acker-0.107
pp-0.107
Token the
Token ID262
Feature activation-1.81e-01

Pos logit contributions
 not+0.061
 going+0.061
 helping+0.057
 reaching+0.056
 saving+0.054
Neg logit contributions
suit-0.536
issy-0.528
azz-0.511
acker-0.508
pp-0.508
Token hat
Token ID6877
Feature activation-1.14e-01

Pos logit contributions
issy+0.333
suit+0.327
azz+0.321
acker+0.311
pp+0.309
Neg logit contributions
 going-0.341
 helping-0.334
 not-0.332
 saving-0.327
 reaching-0.326