TOP ACTIVATIONS
MAX = 0.263

, " I knew you 'd like it !" Emily was
He couldn 't believe he 'd been so adventurous .
Bob a treat , he 'd give a little shrug .
the big caul iflower she 'd eaten . The friend was
and all his friends he 'd meet in the story .
and very grateful . She 'd gotten her dinner after all
the best Halloween night he 'd ever had . <|endoftext|> Once
his heart , knowing he 'd never see him again .
very happy and knew he 'd never leave the house without
, mom my . I 'd love that !"
Sally so happy - she 'd found something no one else
mom my and decided he 'd never touch the bug again
can help you if you 'd like ." The shell was
bear was so happy he 'd been able to help the
She was so glad she 'd found her hoop , and
strawberries and I thought I 'd surprise you !" The bear
and was organized , he 'd get a reward . The
. Mom my said she 'd wait for Sally at the
was feeling tired , he 'd take a short break to
out that the noise he 'd heard was actually a baby

MIN ACTIVATIONS
MIN = -0.265

it and posted it on social media . L
box and saw a j igsaw puzzle inside . He paused
It was a big j igsaw puzzle and she wanted to
would take out a j igsaw puzzle and try to solve
there is a rh in oc eros in the park .
wait until the rh in oc eros is gone ."
to see the rh in oc eros anymore . He is
It is a rh in oc eros . It is bright
not see the rh in oc eros . They asked a
to enjoy eating her av oc ados with a smile on
to help the rh in oc eros . The zo oke
there were no other av oc ados . The
there was a rh in oc eros . The rh in
Finally , the rh in oc eros remembered that animals often
animal called a rh in oc eros . He had a
Mom says they are av oc ados . Sara wants to
They heard the rh in oc eros sn ore .
eros . The rh in oc eros looked sad and scared
away . The rh in oc eros felt che ery and
Lily loved to eat av oc ados , they were her

INTERVAL 0.131 - 0.263
CONTAINS 13.524%

different directions , but we must take turns . First ,
mom my if she could have it . Her mom my
park . One day , they saw a big dog .
Ben are happy too . They like to play with the
Lily was upset , but she couldn 't stay mad at

INTERVAL -0.001 - 0.131
CONTAINS 29.341%

, Mr . Bear . Would you like to taste these
the woods . It started to rain really hard . Sally
all the colourful flowers . Sometimes , he even saw a
under the blankets . He found his toy and w agged
was too big for him to carry and he screamed louder

INTERVAL -0.133 - -0.001
CONTAINS 53.859%

Sure , I can manage that ," replied the bald man
One day , Tim my went to the
egg . She was very sad and angry at Tim and
it was complete . Tom said , " That
a big adventure . He hopped and skipped

INTERVAL -0.265 - -0.133
CONTAINS 3.277%

the sack and a million tiny colorful butterflies flew out !
adventures . They used their m ixtures to make magic and
has brown feathers and a red thing on its head .
â s a special meeting . I have to
bird was happy . The wise owl helped the bird fix
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
igsaw+0.083
 disl+0.083
ummy+0.078
 batch+0.075
 chip+0.073
Neg logit contributions
'd-0.070
 managed-0.062
 doomed-0.058
pped-0.058
 beginning-0.056
Token "
Token ID366
Feature activation+3.94e-02

Pos logit contributions
 disl+0.240
igsaw+0.237
ummy+0.216
 batch+0.210
 chip+0.204
Neg logit contributions
'd-0.235
 managed-0.206
pped-0.196
 doomed-0.193
 beginning-0.193
TokenI
Token ID40
Feature activation+2.02e-01

Pos logit contributions
'd+0.059
 managed+0.049
 beginning+0.045
pped+0.045
 doomed+0.045
Neg logit contributions
igsaw-0.092
 disl-0.091
ummy-0.084
 batch-0.082
 chip-0.079
Token knew
Token ID2993
Feature activation+7.70e-02

Pos logit contributions
pped+0.260
'd+0.253
 beginning+0.252
 doomed+0.235
pered+0.233
Neg logit contributions
 disl-0.463
igsaw-0.401
 batch-0.392
oc-0.385
 chip-0.359
Token you
Token ID345
Feature activation+1.55e-01

Pos logit contributions
'd+0.138
 managed+0.119
 beginning+0.112
pped+0.111
 doomed+0.111
Neg logit contributions
igsaw-0.156
 disl-0.155
ummy-0.141
 batch-0.137
 chip-0.132
Token'd
Token ID1549
Feature activation+2.60e-01

Pos logit contributions
pped+0.172
amed+0.162
pered+0.157
 beginning+0.155
 doomed+0.154
Neg logit contributions
 disl-0.364
igsaw-0.329
ummy-0.327
mmm-0.295
 batch-0.295
Token like
Token ID588
Feature activation+3.34e-03

Pos logit contributions
pped+0.396
iers+0.353
'd+0.350
 forward+0.342
amed+0.338
Neg logit contributions
 disl-0.553
igsaw-0.474
 batch-0.449
 purpose-0.442
 common-0.435
Token it
Token ID340
Feature activation+1.14e-01

Pos logit contributions
'd+0.005
 managed+0.005
 beginning+0.004
pped+0.004
 doomed+0.004
Neg logit contributions
igsaw-0.007
 disl-0.007
ummy-0.007
 batch-0.006
 chip-0.006
Token!"
Token ID2474
Feature activation-4.53e-02

Pos logit contributions
'd+0.229
 managed+0.204
pped+0.193
 doomed+0.191
 beginning+0.190
Neg logit contributions
igsaw-0.202
 disl-0.199
ummy-0.185
 batch-0.176
 chip-0.170
Token Emily
Token ID17608
Feature activation+1.18e-02

Pos logit contributions
igsaw+0.100
 disl+0.099
ummy+0.088
 batch+0.085
mm+0.081
Neg logit contributions
'd-0.070
 managed-0.062
 finally-0.062
 never-0.061
 found-0.061
Token was
Token ID373
Feature activation+2.00e-01

Pos logit contributions
'd+0.021
 managed+0.018
pped+0.017
 beginning+0.017
 doomed+0.017
Neg logit contributions
 disl-0.024
igsaw-0.024
ummy-0.022
 batch-0.021
 chip-0.021
Token He
Token ID679
Feature activation+1.59e-01

Pos logit contributions
igsaw+0.074
 disl+0.072
 batch+0.065
ummy+0.065
 chip+0.061
Neg logit contributions
'd-0.069
 managed-0.059
 never-0.056
 finally-0.056
 beginning-0.055
Token couldn
Token ID3521
Feature activation+1.20e-01

Pos logit contributions
'd+0.203
pped+0.176
 beginning+0.175
 managed+0.167
 doomed+0.167
Neg logit contributions
 disl-0.377
igsaw-0.356
 batch-0.337
ummy-0.327
 chip-0.317
Token't
Token ID470
Feature activation+1.10e-01

Pos logit contributions
'd+0.178
 managed+0.150
 beginning+0.143
 remembered+0.138
 doomed+0.137
Neg logit contributions
igsaw-0.280
 disl-0.273
ummy-0.254
 chip-0.244
 batch-0.239
Token believe
Token ID1975
Feature activation-9.43e-02

Pos logit contributions
'd+0.202
pped+0.183
 managed+0.182
 doomed+0.174
amed+0.174
Neg logit contributions
 disl-0.204
igsaw-0.200
ummy-0.167
 chip-0.161
mmm-0.154
Token he
Token ID339
Feature activation+1.49e-01

Pos logit contributions
igsaw+0.202
 disl+0.201
ummy+0.184
 batch+0.179
 chip+0.172
Neg logit contributions
'd-0.158
 managed-0.135
 beginning-0.126
pped-0.125
 doomed-0.124
Token'd
Token ID1549
Feature activation+2.58e-01

Pos logit contributions
 doomed+0.163
iers+0.161
hes+0.159
pped+0.157
pered+0.154
Neg logit contributions
 disl-0.298
igsaw-0.293
ummy-0.283
 grand-0.269
 batch-0.268
Token been
Token ID587
Feature activation+2.12e-01

Pos logit contributions
iers+0.267
 forward+0.251
pped+0.249
 beginning+0.230
eper+0.230
Neg logit contributions
 disl-0.649
igsaw-0.568
 common-0.518
grand-0.486
 bou-0.481
Token so
Token ID523
Feature activation-5.76e-02

Pos logit contributions
'd+0.314
pped+0.251
 managed+0.225
 beginning+0.223
 doomed+0.223
Neg logit contributions
 disl-0.463
igsaw-0.449
 batch-0.397
ummy-0.383
 bou-0.368
Token adventurous
Token ID39770
Feature activation-2.10e-01

Pos logit contributions
 disl+0.122
igsaw+0.121
ummy+0.109
 batch+0.103
 chip+0.100
Neg logit contributions
'd-0.100
 managed-0.084
pped-0.081
 beginning-0.079
 doomed-0.078
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
igsaw+0.393
 disl+0.383
ummy+0.360
 batch+0.342
 chip+0.326
Neg logit contributions
'd-0.402
 managed-0.354
pped-0.337
 doomed-0.331
 beginning-0.330
Token
Token ID220
Feature activation+4.42e-02

Pos logit contributions
 disl+0.056
igsaw+0.055
 batch+0.050
ummy+0.050
 chip+0.048
Neg logit contributions
'd-0.042
 managed-0.035
 finally-0.034
 never-0.034
 stumbled-0.033
Token Bob
Token ID5811
Feature activation-1.48e-02

Pos logit contributions
'd+0.034
 managed+0.029
 beginning+0.027
pped+0.027
 never+0.027
Neg logit contributions
igsaw-0.047
 disl-0.046
ummy-0.042
 batch-0.041
 chip-0.040
Token a
Token ID257
Feature activation-6.89e-02

Pos logit contributions
igsaw+0.034
 disl+0.034
ummy+0.032
 batch+0.031
 chip+0.030
Neg logit contributions
'd-0.022
 managed-0.018
pped-0.017
 doomed-0.017
 beginning-0.017
Token treat
Token ID2190
Feature activation-1.32e-02

Pos logit contributions
igsaw+0.123
 disl+0.121
ummy+0.109
 batch+0.106
 chip+0.101
Neg logit contributions
'd-0.140
 managed-0.124
pped-0.117
 beginning-0.117
 doomed-0.116
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
igsaw+0.028
 disl+0.027
ummy+0.025
 batch+0.025
 chip+0.024
Neg logit contributions
'd-0.023
 managed-0.020
pped-0.018
 doomed-0.018
 beginning-0.018
Token he
Token ID339
Feature activation+1.50e-01

Pos logit contributions
igsaw+0.256
 disl+0.255
ummy+0.233
 batch+0.229
 chip+0.220
Neg logit contributions
'd-0.209
 managed-0.179
pped-0.169
 beginning-0.168
 doomed-0.168
Token'd
Token ID1549
Feature activation+2.58e-01

Pos logit contributions
'd+0.169
pped+0.157
 beginning+0.156
 doomed+0.151
 forward+0.139
Neg logit contributions
 disl-0.359
igsaw-0.344
 batch-0.337
ummy-0.326
 chip-0.324
Token give
Token ID1577
Feature activation+1.64e-02

Pos logit contributions
pped+0.323
'd+0.320
 doomed+0.317
 beginning+0.315
 forward+0.303
Neg logit contributions
 disl-0.579
igsaw-0.516
 batch-0.495
ummy-0.454
 chip-0.449
Token a
Token ID257
Feature activation-6.78e-02

Pos logit contributions
'd+0.027
 managed+0.023
 beginning+0.021
pped+0.021
 doomed+0.021
Neg logit contributions
igsaw-0.036
 disl-0.036
ummy-0.033
 batch-0.032
 chip-0.031
Token little
Token ID1310
Feature activation-1.78e-01

Pos logit contributions
 disl+0.126
igsaw+0.125
ummy+0.114
 batch+0.108
 chip+0.104
Neg logit contributions
'd-0.134
 managed-0.116
 beginning-0.109
 heard-0.107
 doomed-0.107
Token shrug
Token ID32545
Feature activation-5.52e-02

Pos logit contributions
igsaw+0.342
 disl+0.339
ummy+0.307
 batch+0.298
 chip+0.285
Neg logit contributions
'd-0.336
 managed-0.293
 beginning-0.275
pped-0.274
 doomed-0.273
Token.
Token ID13
Feature activation-2.53e-02

Pos logit contributions
igsaw+0.119
 disl+0.118
ummy+0.108
 batch+0.105
 chip+0.101
Neg logit contributions
'd-0.092
 managed-0.078
 beginning-0.073
pped-0.072
 doomed-0.072
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
igsaw+0.086
 disl+0.085
ummy+0.078
 batch+0.076
 chip+0.073
Neg logit contributions
'd-0.069
 managed-0.058
 beginning-0.056
 never-0.055
 finally-0.055
Token big
Token ID1263
Feature activation-1.86e-01

Pos logit contributions
igsaw+0.180
 disl+0.178
ummy+0.159
 batch+0.156
 chip+0.151
Neg logit contributions
'd-0.214
 managed-0.181
 beginning-0.181
 found-0.179
 heard-0.179
Token caul
Token ID46087
Feature activation-1.43e-01

Pos logit contributions
 disl+0.337
igsaw+0.331
ummy+0.290
 chip+0.289
 batch+0.288
Neg logit contributions
'd-0.367
 found-0.319
 heard-0.317
 came-0.316
 back-0.314
Tokeniflower
Token ID42642
Feature activation-2.32e-02

Pos logit contributions
 disl+0.215
igsaw+0.212
ummy+0.177
 batch+0.173
 chip+0.167
Neg logit contributions
'd-0.334
 managed-0.295
pped-0.282
 beginning-0.281
 never-0.280
Token she
Token ID673
Feature activation+1.64e-01

Pos logit contributions
igsaw+0.047
 disl+0.046
ummy+0.043
 batch+0.042
 chip+0.040
Neg logit contributions
'd-0.040
 managed-0.035
pped-0.034
 doomed-0.033
 beginning-0.033
Token'd
Token ID1549
Feature activation+2.58e-01

Pos logit contributions
 doomed+0.143
pped+0.140
'd+0.138
 beginning+0.136
 forward+0.132
Neg logit contributions
 disl-0.410
igsaw-0.390
 batch-0.371
ummy-0.367
con-0.353
Token eaten
Token ID17065
Feature activation+1.23e-02

Pos logit contributions
'd+0.238
 doomed+0.225
 beginning+0.222
 forward+0.222
pped+0.219
Neg logit contributions
 disl-0.676
igsaw-0.614
 batch-0.563
 bou-0.541
ummy-0.540
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
'd+0.021
 managed+0.018
 doomed+0.017
pped+0.017
 remembered+0.017
Neg logit contributions
igsaw-0.025
 disl-0.024
ummy-0.024
 batch-0.022
 chip-0.021
Token The
Token ID383
Feature activation-1.19e-01

Pos logit contributions
 disl+0.057
igsaw+0.056
ummy+0.051
 batch+0.050
 chip+0.048
Neg logit contributions
'd-0.041
 finally-0.037
 managed-0.035
 never-0.034
 found-0.034
Token friend
Token ID1545
Feature activation-6.03e-02

Pos logit contributions
 disl+0.227
igsaw+0.225
ummy+0.200
 batch+0.194
 chip+0.185
Neg logit contributions
'd-0.230
 managed-0.198
 beginning-0.191
 doomed-0.185
 never-0.185
Token was
Token ID373
Feature activation+1.99e-01

Pos logit contributions
igsaw+0.153
 disl+0.152
ummy+0.142
 batch+0.140
 chip+0.136
Neg logit contributions
'd-0.075
 managed-0.060
 beginning-0.056
pped-0.055
 doomed-0.055
Token and
Token ID290
Feature activation-6.59e-02

Pos logit contributions
igsaw+0.184
 disl+0.183
ummy+0.177
 chip+0.162
con+0.160
Neg logit contributions
'd-0.149
pped-0.133
 managed-0.130
 doomed-0.128
 beginning-0.126
Token all
Token ID477
Feature activation+9.45e-03

Pos logit contributions
 disl+0.148
igsaw+0.135
ummy+0.123
 batch+0.120
 chip+0.118
Neg logit contributions
'd-0.102
pped-0.091
 managed-0.085
 beginning-0.080
 doomed-0.080
Token his
Token ID465
Feature activation+1.34e-02

Pos logit contributions
'd+0.018
 managed+0.015
pped+0.015
 beginning+0.014
 doomed+0.014
Neg logit contributions
 disl-0.018
igsaw-0.018
ummy-0.016
 batch-0.016
 chip-0.015
Token friends
Token ID2460
Feature activation-5.01e-04

Pos logit contributions
'd+0.027
 managed+0.024
pped+0.024
 doomed+0.023
 beginning+0.023
Neg logit contributions
 disl-0.023
igsaw-0.023
ummy-0.021
 batch-0.020
 chip-0.019
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
 disl+0.001
igsaw+0.001
ummy+0.001
 batch+0.001
 chip+0.001
Neg logit contributions
'd-0.001
pped-0.001
 doomed-0.000
 managed-0.000
 beginning-0.000
Token'd
Token ID1549
Feature activation+2.57e-01

Pos logit contributions
pped+0.153
 beginning+0.149
pered+0.142
'd+0.135
 doomed+0.132
Neg logit contributions
 disl-0.357
igsaw-0.312
oc-0.307
ummy-0.296
 batch-0.295
Token meet
Token ID1826
Feature activation+2.86e-02

Pos logit contributions
pped+0.277
'd+0.262
 forward+0.242
 doomed+0.241
 beginning+0.238
Neg logit contributions
 disl-0.632
igsaw-0.572
 batch-0.497
ummy-0.493
 common-0.490
Token in
Token ID287
Feature activation-3.94e-02

Pos logit contributions
'd+0.046
 managed+0.039
pped+0.037
 beginning+0.036
 doomed+0.036
Neg logit contributions
igsaw-0.063
 disl-0.062
ummy-0.057
 batch-0.056
 chip-0.054
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
igsaw+0.079
 disl+0.079
ummy+0.072
 batch+0.070
 chip+0.067
Neg logit contributions
'd-0.071
 managed-0.062
pped-0.058
 doomed-0.057
 beginning-0.057
Token story
Token ID1621
Feature activation+3.66e-03

Pos logit contributions
igsaw+0.195
 disl+0.193
ummy+0.174
 batch+0.169
 chip+0.161
Neg logit contributions
'd-0.214
 managed-0.189
pped-0.179
 doomed-0.174
 beginning-0.174
Token.
Token ID13
Feature activation-2.79e-02

Pos logit contributions
'd+0.006
 managed+0.006
pped+0.005
 doomed+0.005
 beginning+0.005
Neg logit contributions
igsaw-0.008
 disl-0.007
ummy-0.007
 batch-0.007
 chip-0.006
Token and
Token ID290
Feature activation-5.28e-02

Pos logit contributions
 disl+0.240
igsaw+0.237
ummy+0.216
 batch+0.210
 chip+0.204
Neg logit contributions
'd-0.235
 managed-0.206
pped-0.196
 doomed-0.193
 beginning-0.193
Token very
Token ID845
Feature activation-1.07e-02

Pos logit contributions
 disl+0.111
igsaw+0.108
ummy+0.099
 batch+0.099
 chip+0.096
Neg logit contributions
'd-0.088
 managed-0.075
pped-0.073
 beginning-0.071
 doomed-0.071
Token grateful
Token ID14066
Feature activation-9.65e-02

Pos logit contributions
 disl+0.023
igsaw+0.022
ummy+0.020
 batch+0.020
 chip+0.019
Neg logit contributions
'd-0.018
pped-0.015
 managed-0.015
 beginning-0.014
 doomed-0.014
Token.
Token ID13
Feature activation-3.24e-02

Pos logit contributions
igsaw+0.206
 disl+0.205
ummy+0.188
 batch+0.183
 chip+0.176
Neg logit contributions
'd-0.162
 managed-0.139
pped-0.130
 beginning-0.129
 doomed-0.128
Token She
Token ID1375
Feature activation+1.68e-01

Pos logit contributions
 disl+0.066
igsaw+0.066
 batch+0.060
ummy+0.058
 chip+0.055
Neg logit contributions
'd-0.055
 managed-0.046
 finally-0.046
 never-0.046
 stumbled-0.045
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
'd+0.213
 beginning+0.201
pped+0.192
 forward+0.188
 doomed+0.188
Neg logit contributions
 disl-0.384
igsaw-0.352
 batch-0.344
ummy-0.332
 chip-0.326
Token gotten
Token ID7891
Feature activation+1.15e-01

Pos logit contributions
pped+0.321
iers+0.299
 beginning+0.279
'd+0.274
 doomed+0.269
Neg logit contributions
 disl-0.606
igsaw-0.527
 batch-0.495
grand-0.470
 chip-0.465
Token her
Token ID607
Feature activation+3.99e-02

Pos logit contributions
'd+0.195
 managed+0.150
pped+0.143
 doomed+0.140
 beginning+0.138
Neg logit contributions
 disl-0.245
igsaw-0.243
ummy-0.230
 batch-0.226
 chip-0.217
Token dinner
Token ID8073
Feature activation-4.82e-02

Pos logit contributions
'd+0.076
 beginning+0.065
 managed+0.064
 back+0.064
 never+0.063
Neg logit contributions
 disl-0.077
igsaw-0.076
ummy-0.067
 batch-0.065
 chip-0.062
Token after
Token ID706
Feature activation-6.68e-02

Pos logit contributions
igsaw+0.089
 disl+0.084
ummy+0.083
 batch+0.080
 chip+0.076
Neg logit contributions
'd-0.091
 managed-0.079
pped-0.078
 doomed-0.077
 beginning-0.075
Token all
Token ID477
Feature activation+1.30e-02

Pos logit contributions
igsaw+0.136
 disl+0.134
ummy+0.127
 batch+0.122
 chip+0.117
Neg logit contributions
'd-0.121
 managed-0.104
 doomed-0.095
 beginning-0.094
pped-0.094
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
'd+0.304
 managed+0.250
pped+0.242
 doomed+0.232
 heard+0.227
Neg logit contributions
 disl-0.435
igsaw-0.427
ummy-0.405
 batch-0.401
 chip-0.393
Token best
Token ID1266
Feature activation-7.61e-02

Pos logit contributions
igsaw+0.223
 disl+0.220
ummy+0.210
 batch+0.203
 chip+0.201
Neg logit contributions
'd-0.175
 managed-0.156
pped-0.151
 doomed-0.143
 stumbled-0.139
Token Halloween
Token ID16032
Feature activation-1.15e-01

Pos logit contributions
igsaw+0.121
 disl+0.120
ummy+0.105
 batch+0.102
 chip+0.097
Neg logit contributions
'd-0.171
 managed-0.152
 beginning-0.144
pped-0.143
 doomed-0.143
Token night
Token ID1755
Feature activation-9.46e-02

Pos logit contributions
igsaw+0.286
ummy+0.278
 disl+0.274
 batch+0.261
 chip+0.258
Neg logit contributions
'd-0.133
pped-0.114
 managed-0.112
 doomed-0.112
 beginning-0.107
Token he
Token ID339
Feature activation+1.54e-01

Pos logit contributions
igsaw+0.199
 disl+0.195
ummy+0.188
 batch+0.178
 chip+0.175
Neg logit contributions
'd-0.151
 managed-0.131
pped-0.127
 doomed-0.124
 beginning-0.120
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
 beginning+0.150
pped+0.137
pered+0.134
 doomed+0.131
iers+0.122
Neg logit contributions
 disl-0.357
igsaw-0.337
 batch-0.326
ummy-0.322
corn-0.322
Token ever
Token ID1683
Feature activation+6.66e-02

Pos logit contributions
iers+0.269
'd+0.258
pped+0.256
 beginning+0.251
 forward+0.243
Neg logit contributions
 disl-0.620
igsaw-0.560
 batch-0.516
grand-0.490
 common-0.490
Token had
Token ID550
Feature activation+2.19e-01

Pos logit contributions
pped+0.079
'd+0.075
 doomed+0.072
pered+0.070
 forward+0.070
Neg logit contributions
 disl-0.154
igsaw-0.146
 batch-0.134
ummy-0.133
 chip-0.127
Token.
Token ID13
Feature activation-2.73e-02

Pos logit contributions
'd+0.367
 managed+0.305
 beginning+0.299
 doomed+0.299
pped+0.293
Neg logit contributions
 disl-0.453
igsaw-0.447
ummy-0.411
 batch-0.403
 chip-0.377
Token<|endoftext|>
Token ID50256
Feature activation-8.26e-02

Pos logit contributions
 disl+0.058
igsaw+0.057
ummy+0.052
 batch+0.052
 chip+0.049
Neg logit contributions
'd-0.046
 managed-0.039
 never-0.038
 finally-0.038
 been-0.037
TokenOnce
Token ID7454
Feature activation-1.11e-01

Pos logit contributions
igsaw+0.132
 disl+0.124
ummy+0.117
 batch+0.109
 chip+0.106
Neg logit contributions
'd-0.181
 managed-0.163
 beginning-0.155
 doomed-0.153
pped-0.152
Token his
Token ID465
Feature activation+1.34e-02

Pos logit contributions
igsaw+0.072
 disl+0.070
ummy+0.063
 batch+0.062
 chip+0.059
Neg logit contributions
'd-0.070
 managed-0.061
pped-0.057
 beginning-0.056
 doomed-0.056
Token heart
Token ID2612
Feature activation-1.59e-02

Pos logit contributions
'd+0.024
 managed+0.021
 back+0.021
 found+0.021
 beginning+0.021
Neg logit contributions
 disl-0.026
igsaw-0.026
ummy-0.024
 batch-0.022
 chip-0.022
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
igsaw+0.033
 disl+0.033
ummy+0.030
 batch+0.029
 chip+0.028
Neg logit contributions
'd-0.028
 managed-0.024
 beginning-0.022
pped-0.022
 never-0.022
Token knowing
Token ID6970
Feature activation+2.73e-02

Pos logit contributions
 disl+0.226
igsaw+0.218
 batch+0.200
ummy+0.198
 chip+0.196
Neg logit contributions
'd-0.232
 managed-0.200
pped-0.199
 doomed-0.192
 beginning-0.189
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
'd+0.058
 managed+0.051
pped+0.049
 doomed+0.049
 beginning+0.049
Neg logit contributions
 disl-0.045
igsaw-0.044
ummy-0.040
 batch-0.038
 chip-0.037
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
pered+0.233
iers+0.214
pped+0.210
bridge+0.194
amed+0.192
Neg logit contributions
oc-0.255
 disl-0.236
grand-0.229
 grand-0.222
corn-0.222
Token never
Token ID1239
Feature activation+1.40e-01

Pos logit contributions
pped+0.304
 forward+0.283
iers+0.281
amed+0.269
pered+0.260
Neg logit contributions
 disl-0.602
igsaw-0.502
 common-0.483
grand-0.470
 grand-0.460
Token see
Token ID766
Feature activation+6.98e-02

Pos logit contributions
pped+0.171
'd+0.170
 forward+0.154
amed+0.149
bridge+0.142
Neg logit contributions
 disl-0.318
igsaw-0.291
ummy-0.259
 chip-0.255
 batch-0.254
Token him
Token ID683
Feature activation+1.28e-01

Pos logit contributions
'd+0.111
 managed+0.093
pped+0.086
 beginning+0.085
 doomed+0.085
Neg logit contributions
igsaw-0.155
 disl-0.155
ummy-0.143
 batch-0.139
 chip-0.134
Token again
Token ID757
Feature activation-1.33e-02

Pos logit contributions
'd+0.224
 managed+0.201
pped+0.193
 doomed+0.184
amed+0.180
Neg logit contributions
igsaw-0.240
 disl-0.233
ummy-0.223
 batch-0.206
 chip-0.201
Token.
Token ID13
Feature activation-2.89e-02

Pos logit contributions
 disl+0.026
igsaw+0.026
ummy+0.024
 batch+0.023
 chip+0.022
Neg logit contributions
'd-0.024
 managed-0.021
pped-0.020
 doomed-0.019
 beginning-0.019
Token very
Token ID845
Feature activation-1.08e-02

Pos logit contributions
'd+0.321
pped+0.252
 managed+0.236
 stumbled+0.224
 doomed+0.224
Neg logit contributions
 disl-0.382
 batch-0.365
igsaw-0.359
 chip-0.355
ummy-0.340
Token happy
Token ID3772
Feature activation-1.03e-01

Pos logit contributions
 disl+0.022
igsaw+0.021
ummy+0.020
 batch+0.019
 chip+0.019
Neg logit contributions
'd-0.019
pped-0.016
 managed-0.016
 beginning-0.016
 doomed-0.015
Token and
Token ID290
Feature activation-6.37e-02

Pos logit contributions
igsaw+0.200
 disl+0.199
ummy+0.182
 batch+0.176
 chip+0.169
Neg logit contributions
'd-0.192
 managed-0.166
pped-0.157
 beginning-0.156
 doomed-0.155
Token knew
Token ID2993
Feature activation+6.94e-02

Pos logit contributions
 disl+0.139
igsaw+0.131
ummy+0.123
 batch+0.119
 chip+0.119
Neg logit contributions
'd-0.102
pped-0.087
 beginning-0.084
 managed-0.083
 doomed-0.082
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
'd+0.136
 managed+0.118
pped+0.112
 beginning+0.112
 doomed+0.111
Neg logit contributions
igsaw-0.130
 disl-0.129
ummy-0.117
 batch-0.112
 chip-0.108
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
pered+0.203
pped+0.188
iers+0.184
 beginning+0.182
 doomed+0.168
Neg logit contributions
 disl-0.296
oc-0.269
corn-0.267
grand-0.262
ci-0.260
Token never
Token ID1239
Feature activation+1.38e-01

Pos logit contributions
pped+0.307
 forward+0.303
iers+0.299
amed+0.279
pered+0.278
Neg logit contributions
 disl-0.618
igsaw-0.537
 batch-0.503
 chip-0.487
 common-0.485
Token leave
Token ID2666
Feature activation+8.04e-03

Pos logit contributions
'd+0.196
pped+0.190
 forward+0.179
amed+0.176
 doomed+0.170
Neg logit contributions
 disl-0.304
igsaw-0.286
 batch-0.251
 chip-0.250
ummy-0.249
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
'd+0.014
 managed+0.012
pped+0.011
 doomed+0.011
 beginning+0.011
Neg logit contributions
igsaw-0.017
 disl-0.017
ummy-0.016
 batch-0.015
 chip-0.014
Token house
Token ID2156
Feature activation+4.56e-03

Pos logit contributions
 disl+0.188
igsaw+0.186
ummy+0.165
 batch+0.159
 chip+0.150
Neg logit contributions
'd-0.220
 managed-0.191
 beginning-0.186
 never-0.182
 doomed-0.182
Token without
Token ID1231
Feature activation+2.42e-02

Pos logit contributions
'd+0.008
 managed+0.007
pped+0.006
 doomed+0.006
 remembered+0.006
Neg logit contributions
igsaw-0.009
 disl-0.009
ummy-0.008
 batch-0.008
 chip-0.008
Token,
Token ID11
Feature activation-1.28e-01

Pos logit contributions
'd+0.300
 managed+0.261
 beginning+0.250
 never+0.248
 finally+0.244
Neg logit contributions
 disl-0.280
igsaw-0.276
ummy-0.241
 batch-0.238
 chip-0.227
Token mom
Token ID1995
Feature activation-2.05e-02

Pos logit contributions
igsaw+0.220
 disl+0.220
ummy+0.189
 batch+0.184
con+0.171
Neg logit contributions
'd-0.270
 managed-0.234
 never-0.228
 finally-0.226
 back-0.223
Tokenmy
Token ID1820
Feature activation+7.10e-02

Pos logit contributions
 disl+0.039
igsaw+0.038
ummy+0.035
 batch+0.034
 chip+0.033
Neg logit contributions
'd-0.039
 managed-0.034
 doomed-0.033
pped-0.033
 beginning-0.032
Token.
Token ID13
Feature activation-3.31e-02

Pos logit contributions
'd+0.148
 managed+0.132
pped+0.127
 doomed+0.127
 beginning+0.127
Neg logit contributions
 disl-0.119
igsaw-0.115
ummy-0.108
 batch-0.103
 chip-0.101
Token I
Token ID314
Feature activation+2.04e-01

Pos logit contributions
igsaw+0.072
 disl+0.070
 batch+0.067
ummy+0.063
con+0.061
Neg logit contributions
'd-0.054
 never-0.043
 managed-0.042
 could-0.041
 finally-0.040
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
pped+0.308
'd+0.296
 beginning+0.295
 doomed+0.289
 forward+0.280
Neg logit contributions
 disl-0.447
igsaw-0.377
 batch-0.363
oc-0.349
 chip-0.345
Token love
Token ID1842
Feature activation+2.25e-03

Pos logit contributions
'd+0.427
pped+0.426
iers+0.403
 managed+0.381
 doomed+0.379
Neg logit contributions
 disl-0.517
igsaw-0.425
 batch-0.412
 bou-0.380
 chip-0.371
Token that
Token ID326
Feature activation-3.55e-02

Pos logit contributions
'd+0.003
 never+0.003
 been+0.003
 managed+0.003
 back+0.003
Neg logit contributions
igsaw-0.005
 disl-0.005
ummy-0.005
 batch-0.005
mm-0.004
Token!"
Token ID2474
Feature activation-5.08e-02

Pos logit contributions
igsaw+0.072
 disl+0.071
ummy+0.064
 batch+0.062
 chip+0.059
Neg logit contributions
'd-0.065
 managed-0.056
 beginning-0.053
 doomed-0.052
 never-0.052
Token
Token ID198
Feature activation-2.99e-02

Pos logit contributions
igsaw+0.119
 disl+0.113
ummy+0.101
 batch+0.095
izing+0.094
Neg logit contributions
'd-0.070
 finally-0.067
 came-0.067
 managed-0.064
 found-0.064
Token
Token ID198
Feature activation-2.97e-02

Pos logit contributions
igsaw+0.075
 disl+0.073
ummy+0.065
 batch+0.064
mm+0.062
Neg logit contributions
'd-0.038
 never-0.031
 finally-0.030
 found-0.030
 heard-0.030
Token Sally
Token ID25737
Feature activation+2.07e-02

Pos logit contributions
'd+0.129
 managed+0.109
 beginning+0.101
 finally+0.101
 never+0.101
Neg logit contributions
 disl-0.195
igsaw-0.194
ummy-0.176
 batch-0.173
 chip-0.167
Token so
Token ID523
Feature activation-5.97e-02

Pos logit contributions
'd+0.030
 managed+0.026
pped+0.026
 doomed+0.025
 beginning+0.024
Neg logit contributions
igsaw-0.047
 disl-0.047
ummy-0.043
 batch-0.043
 chip-0.042
Token happy
Token ID3772
Feature activation-1.01e-01

Pos logit contributions
 disl+0.127
igsaw+0.124
ummy+0.113
 batch+0.110
 chip+0.105
Neg logit contributions
'd-0.103
 managed-0.087
pped-0.087
 beginning-0.083
 doomed-0.081
Token -
Token ID532
Feature activation-2.70e-02

Pos logit contributions
igsaw+0.204
 disl+0.203
ummy+0.184
 batch+0.179
 chip+0.172
Neg logit contributions
'd-0.184
 managed-0.159
 beginning-0.149
pped-0.148
 doomed-0.148
Token she
Token ID673
Feature activation+1.62e-01

Pos logit contributions
igsaw+0.056
 disl+0.056
ummy+0.051
 batch+0.050
 chip+0.048
Neg logit contributions
'd-0.046
 managed-0.040
pped-0.038
 doomed-0.037
 beginning-0.037
Token'd
Token ID1549
Feature activation+2.56e-01

Pos logit contributions
 beginning+0.163
pped+0.161
 doomed+0.154
'd+0.148
iers+0.147
Neg logit contributions
 disl-0.387
igsaw-0.344
 batch-0.335
ummy-0.332
oc-0.324
Token found
Token ID1043
Feature activation+1.05e-01

Pos logit contributions
iers+0.298
pped+0.287
 forward+0.271
pered+0.259
amed+0.257
Neg logit contributions
 disl-0.620
igsaw-0.538
 batch-0.482
 common-0.473
grand-0.471
Token something
Token ID1223
Feature activation+2.14e-02

Pos logit contributions
'd+0.163
 managed+0.136
 beginning+0.126
pped+0.126
 doomed+0.125
Neg logit contributions
igsaw-0.239
 disl-0.237
ummy-0.219
 batch-0.214
 chip-0.206
Token no
Token ID645
Feature activation+1.38e-02

Pos logit contributions
'd+0.040
 managed+0.035
 beginning+0.033
 never+0.033
 quite+0.032
Neg logit contributions
igsaw-0.042
 disl-0.042
ummy-0.037
 batch-0.036
 chip-0.035
Token one
Token ID530
Feature activation-4.78e-02

Pos logit contributions
'd+0.030
 managed+0.027
pped+0.025
 doomed+0.025
 beginning+0.024
Neg logit contributions
igsaw-0.023
 disl-0.023
ummy-0.021
 batch-0.020
 chip-0.019
Token else
Token ID2073
Feature activation-3.52e-02

Pos logit contributions
 disl+0.109
igsaw+0.109
ummy+0.101
 batch+0.099
 chip+0.095
Neg logit contributions
'd-0.067
pped-0.058
 managed-0.057
 doomed-0.054
 beginning-0.054
Token mom
Token ID1995
Feature activation-1.94e-02

Pos logit contributions
'd+0.027
 back+0.023
 beginning+0.023
 managed+0.023
 finally+0.022
Neg logit contributions
 disl-0.028
igsaw-0.027
 batch-0.024
ummy-0.024
mm-0.023
Tokenmy
Token ID1820
Feature activation+7.46e-02

Pos logit contributions
igsaw+0.038
 disl+0.037
ummy+0.035
 batch+0.034
 chip+0.033
Neg logit contributions
'd-0.036
 managed-0.031
pped-0.030
 doomed-0.030
 beginning-0.029
Token and
Token ID290
Feature activation-5.93e-02

Pos logit contributions
'd+0.134
 managed+0.116
pped+0.110
 doomed+0.109
 beginning+0.107
Neg logit contributions
igsaw-0.150
 disl-0.148
ummy-0.137
 batch-0.133
 chip-0.129
Token decided
Token ID3066
Feature activation+1.07e-01

Pos logit contributions
 disl+0.126
igsaw+0.117
 batch+0.109
 chip+0.108
ummy+0.106
Neg logit contributions
'd-0.097
pped-0.083
 beginning-0.082
 doomed-0.081
 managed-0.077
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
'd+0.200
 managed+0.173
pped+0.169
 doomed+0.167
 beginning+0.164
Neg logit contributions
 disl-0.207
igsaw-0.197
ummy-0.178
 chip-0.170
 batch-0.169
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
 beginning+0.173
 doomed+0.163
pped+0.161
 forward+0.147
pered+0.147
Neg logit contributions
 disl-0.318
 chip-0.289
con-0.286
 batch-0.283
oc-0.277
Token never
Token ID1239
Feature activation+1.39e-01

Pos logit contributions
iers+0.339
pped+0.337
amed+0.326
 doomed+0.306
 beginning+0.300
Neg logit contributions
 disl-0.534
igsaw-0.467
 common-0.424
 batch-0.420
 bou-0.403
Token touch
Token ID3638
Feature activation-2.83e-02

Pos logit contributions
pped+0.208
'd+0.199
amed+0.191
 forward+0.185
 doomed+0.185
Neg logit contributions
 disl-0.284
igsaw-0.260
 chip-0.224
ummy-0.220
 pige-0.220
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
igsaw+0.065
 disl+0.064
ummy+0.059
 batch+0.058
 chip+0.055
Neg logit contributions
'd-0.043
 managed-0.036
 beginning-0.034
pped-0.034
 never-0.034
Token bug
Token ID5434
Feature activation-5.54e-02

Pos logit contributions
 disl+0.182
igsaw+0.181
ummy+0.159
 batch+0.154
mm+0.144
Neg logit contributions
'd-0.224
 beginning-0.193
 managed-0.192
 back-0.186
 heard-0.186
Token again
Token ID757
Feature activation-1.25e-02

Pos logit contributions
igsaw+0.122
 disl+0.120
ummy+0.112
 batch+0.111
 chip+0.107
Neg logit contributions
'd-0.088
 managed-0.075
 doomed-0.069
pped-0.068
 beginning-0.067
Token can
Token ID460
Feature activation+1.46e-01

Pos logit contributions
'd+0.246
pped+0.239
 beginning+0.236
 doomed+0.224
pered+0.206
Neg logit contributions
 disl-0.473
igsaw-0.435
 batch-0.403
oc-0.385
 chip-0.376
Token help
Token ID1037
Feature activation-2.53e-02

Pos logit contributions
'd+0.218
 managed+0.183
 beginning+0.169
 doomed+0.168
pped+0.167
Neg logit contributions
igsaw-0.339
 disl-0.338
ummy-0.312
 batch-0.303
 chip-0.295
Token you
Token ID345
Feature activation+1.52e-01

Pos logit contributions
igsaw+0.056
 disl+0.055
 batch+0.049
ummy+0.047
 chip+0.044
Neg logit contributions
'd-0.040
 never-0.034
 back-0.034
 managed-0.034
 quite-0.033
Token if
Token ID611
Feature activation-1.98e-02

Pos logit contributions
'd+0.264
 managed+0.226
 doomed+0.213
pped+0.211
 beginning+0.211
Neg logit contributions
igsaw-0.314
 disl-0.313
ummy-0.286
 batch-0.275
 chip-0.268
Token you
Token ID345
Feature activation+1.54e-01

Pos logit contributions
 disl+0.030
igsaw+0.029
ummy+0.027
 chip+0.025
 batch+0.025
Neg logit contributions
'd-0.044
 managed-0.040
pped-0.038
 beginning-0.038
 doomed-0.037
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
'd+0.203
 doomed+0.183
 beginning+0.181
pped+0.180
 managed+0.175
Neg logit contributions
 disl-0.349
igsaw-0.331
ummy-0.309
 batch-0.290
 chip-0.287
Token like
Token ID588
Feature activation+4.28e-04

Pos logit contributions
'd+0.361
pped+0.354
amed+0.316
 beginning+0.310
 doomed+0.308
Neg logit contributions
 disl-0.549
igsaw-0.493
 batch-0.459
 common-0.433
 chip-0.433
Token."
Token ID526
Feature activation-5.69e-02

Pos logit contributions
'd+0.001
 managed+0.001
 beginning+0.001
 never+0.001
pped+0.001
Neg logit contributions
igsaw-0.001
 disl-0.001
ummy-0.001
 batch-0.001
 chip-0.001
Token The
Token ID383
Feature activation-1.25e-01

Pos logit contributions
igsaw+0.133
 disl+0.128
ummy+0.117
 batch+0.114
izing+0.111
Neg logit contributions
'd-0.077
 finally-0.073
 found-0.070
 never-0.068
 came-0.067
Token shell
Token ID7582
Feature activation-3.28e-02

Pos logit contributions
igsaw+0.223
 disl+0.222
ummy+0.198
 batch+0.192
 chip+0.182
Neg logit contributions
'd-0.253
 managed-0.222
 beginning-0.211
pped-0.209
 doomed-0.208
Token was
Token ID373
Feature activation+1.97e-01

Pos logit contributions
 disl+0.070
igsaw+0.067
 batch+0.063
ummy+0.062
 chip+0.060
Neg logit contributions
'd-0.053
pped-0.045
 managed-0.045
 beginning-0.044
 doomed-0.043
Token bear
Token ID6842
Feature activation+4.75e-03

Pos logit contributions
igsaw+0.233
 disl+0.231
ummy+0.214
 chip+0.212
 batch+0.209
Neg logit contributions
'd-0.234
 managed-0.210
pped-0.199
 doomed-0.191
 never-0.186
Token was
Token ID373
Feature activation+1.96e-01

Pos logit contributions
'd+0.006
pped+0.005
 doomed+0.005
 beginning+0.005
 managed+0.005
Neg logit contributions
 disl-0.011
igsaw-0.011
 batch-0.010
ummy-0.010
 chip-0.010
Token so
Token ID523
Feature activation-6.16e-02

Pos logit contributions
'd+0.345
pped+0.271
 managed+0.267
iers+0.257
 doomed+0.237
Neg logit contributions
 disl-0.351
igsaw-0.332
 batch-0.331
ummy-0.331
 chip-0.331
Token happy
Token ID3772
Feature activation-1.02e-01

Pos logit contributions
 disl+0.112
igsaw+0.107
ummy+0.103
 chip+0.094
 batch+0.094
Neg logit contributions
'd-0.121
pped-0.103
 managed-0.102
 beginning-0.098
 doomed-0.096
Token he
Token ID339
Feature activation+1.55e-01

Pos logit contributions
igsaw+0.202
 disl+0.200
ummy+0.182
 batch+0.177
 chip+0.169
Neg logit contributions
'd-0.191
 managed-0.166
 beginning-0.156
pped-0.155
 doomed-0.154
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
 beginning+0.174
iers+0.154
 doomed+0.152
'd+0.151
 forward+0.148
Neg logit contributions
 disl-0.331
igsaw-0.304
corn-0.301
 chip-0.297
con-0.296
Token been
Token ID587
Feature activation+2.12e-01

Pos logit contributions
iers+0.272
eper+0.251
 forward+0.250
pered+0.233
 beginning+0.226
Neg logit contributions
 disl-0.628
igsaw-0.558
grand-0.521
 common-0.492
 grand-0.479
Token able
Token ID1498
Feature activation-1.24e-02

Pos logit contributions
'd+0.315
pped+0.226
 beginning+0.213
 doomed+0.204
 managed+0.202
Neg logit contributions
 disl-0.423
igsaw-0.413
ummy-0.402
 batch-0.398
 chip-0.389
Token to
Token ID284
Feature activation+6.02e-03

Pos logit contributions
 disl+0.018
igsaw+0.017
ummy+0.017
 chip+0.015
 batch+0.015
Neg logit contributions
'd-0.026
pped-0.024
bridge-0.022
 managed-0.022
 beginning-0.021
Token help
Token ID1037
Feature activation-1.64e-02

Pos logit contributions
'd+0.010
 forward+0.008
 managed+0.008
 doomed+0.008
 beginning+0.008
Neg logit contributions
 disl-0.012
igsaw-0.012
ummy-0.011
 chip-0.011
 batch-0.010
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
igsaw+0.035
 disl+0.035
ummy+0.032
 batch+0.031
 chip+0.030
Neg logit contributions
'd-0.028
 managed-0.024
 beginning-0.022
pped-0.022
 doomed-0.022
Token She
Token ID1375
Feature activation+1.69e-01

Pos logit contributions
igsaw+0.059
 disl+0.059
ummy+0.053
 batch+0.052
 chip+0.049
Neg logit contributions
'd-0.053
 managed-0.045
 finally-0.043
 never-0.043
 found-0.042
Token was
Token ID373
Feature activation+1.93e-01

Pos logit contributions
'd+0.214
 beginning+0.193
pped+0.184
 doomed+0.182
 managed+0.177
Neg logit contributions
 disl-0.399
igsaw-0.375
 batch-0.364
ummy-0.348
 chip-0.343
Token so
Token ID523
Feature activation-6.32e-02

Pos logit contributions
'd+0.324
pped+0.250
 managed+0.234
 doomed+0.220
 stumbled+0.220
Neg logit contributions
 disl-0.380
 batch-0.357
 chip-0.354
igsaw-0.342
 Aunt-0.317
Token glad
Token ID9675
Feature activation-7.91e-02

Pos logit contributions
igsaw+0.141
 disl+0.141
ummy+0.129
 batch+0.126
 chip+0.121
Neg logit contributions
'd-0.101
 managed-0.086
 beginning-0.080
pped-0.079
 doomed-0.079
Token she
Token ID673
Feature activation+1.60e-01

Pos logit contributions
igsaw+0.195
 disl+0.186
 batch+0.168
ummy+0.164
 chip+0.159
Neg logit contributions
'd-0.108
 never-0.095
 been-0.089
 heard-0.089
 managed-0.089
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
 beginning+0.185
iers+0.157
 doomed+0.157
pered+0.156
 forward+0.153
Neg logit contributions
 disl-0.336
corn-0.321
igsaw-0.311
 chip-0.307
 batch-0.305
Token found
Token ID1043
Feature activation+1.05e-01

Pos logit contributions
 forward+0.251
 beginning+0.237
'd+0.235
iers+0.232
pped+0.229
Neg logit contributions
 disl-0.662
igsaw-0.576
 common-0.532
grand-0.529
 batch-0.511
Token her
Token ID607
Feature activation+3.77e-02

Pos logit contributions
'd+0.151
 managed+0.127
pped+0.116
 beginning+0.116
 doomed+0.114
Neg logit contributions
igsaw-0.251
 disl-0.249
ummy-0.230
 batch-0.224
 chip-0.217
Token hoop
Token ID44792
Feature activation-3.94e-02

Pos logit contributions
'd+0.075
 managed+0.065
 back+0.063
 beginning+0.063
 never+0.063
Neg logit contributions
 disl-0.069
igsaw-0.069
ummy-0.062
 batch-0.059
mm-0.057
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
igsaw+0.078
 disl+0.073
ummy+0.071
 batch+0.068
 chip+0.066
Neg logit contributions
'd-0.070
 managed-0.061
 doomed-0.059
pped-0.058
 beginning-0.057
Token and
Token ID290
Feature activation-6.47e-02

Pos logit contributions
 disl+0.267
igsaw+0.266
ummy+0.243
 batch+0.238
 chip+0.230
Neg logit contributions
'd-0.207
 managed-0.176
pped-0.165
 beginning-0.163
 doomed-0.163
Token strawberries
Token ID36973
Feature activation-4.79e-02

Pos logit contributions
igsaw+0.051
 disl+0.050
ummy+0.044
 batch+0.042
 chip+0.041
Neg logit contributions
'd-0.065
 managed-0.057
 beginning-0.055
pped-0.054
 doomed-0.054
Token and
Token ID290
Feature activation-6.19e-02

Pos logit contributions
igsaw+0.098
 disl+0.096
ummy+0.091
 batch+0.088
 chip+0.085
Neg logit contributions
'd-0.084
 managed-0.073
 doomed-0.068
pped-0.067
 beginning-0.067
Token I
Token ID314
Feature activation+2.04e-01

Pos logit contributions
 disl+0.128
igsaw+0.126
ummy+0.115
 batch+0.114
 chip+0.110
Neg logit contributions
'd-0.106
 managed-0.091
pped-0.088
 beginning-0.087
 doomed-0.087
Token thought
Token ID1807
Feature activation+8.55e-02

Pos logit contributions
'd+0.229
 beginning+0.226
pped+0.224
 doomed+0.221
 forward+0.195
Neg logit contributions
 disl-0.509
igsaw-0.476
 batch-0.445
oc-0.428
ummy-0.418
Token I
Token ID314
Feature activation+2.05e-01

Pos logit contributions
'd+0.150
 managed+0.127
 beginning+0.118
 never+0.118
pped+0.117
Neg logit contributions
igsaw-0.179
 disl-0.177
ummy-0.160
 batch-0.156
 chip-0.149
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
 beginning+0.216
 forward+0.214
 doomed+0.211
pered+0.204
amed+0.203
Neg logit contributions
 disl-0.514
oc-0.447
igsaw-0.435
 batch-0.424
grand-0.397
Token surprise
Token ID5975
Feature activation-8.92e-02

Pos logit contributions
pped+0.360
'd+0.345
 forward+0.332
 doomed+0.321
amed+0.312
Neg logit contributions
 disl-0.572
igsaw-0.492
 batch-0.482
 common-0.450
 bou-0.450
Token you
Token ID345
Feature activation+1.57e-01

Pos logit contributions
igsaw+0.204
 disl+0.202
 batch+0.175
ummy+0.174
 chip+0.168
Neg logit contributions
'd-0.142
 managed-0.115
 back-0.114
 never-0.114
 beginning-0.110
Token!"
Token ID2474
Feature activation-5.20e-02

Pos logit contributions
'd+0.321
 managed+0.308
pped+0.308
 doomed+0.303
 beginning+0.290
Neg logit contributions
 disl-0.222
igsaw-0.221
ummy-0.214
 batch-0.189
mmm-0.179
Token The
Token ID383
Feature activation-1.21e-01

Pos logit contributions
igsaw+0.121
 disl+0.120
ummy+0.110
 batch+0.104
mm+0.100
Neg logit contributions
'd-0.071
 finally-0.066
 managed-0.064
 came-0.064
 never-0.062
Token bear
Token ID6842
Feature activation+4.95e-03

Pos logit contributions
igsaw+0.205
 disl+0.201
ummy+0.182
 batch+0.175
 chip+0.173
Neg logit contributions
'd-0.257
 managed-0.230
pped-0.220
 doomed-0.215
 beginning-0.213
Token and
Token ID290
Feature activation-6.30e-02

Pos logit contributions
igsaw+0.171
 disl+0.170
ummy+0.154
 batch+0.147
 chip+0.142
Neg logit contributions
'd-0.196
 managed-0.173
pped-0.169
 doomed-0.164
 beginning-0.162
Token was
Token ID373
Feature activation+1.95e-01

Pos logit contributions
 disl+0.141
igsaw+0.129
ummy+0.115
 chip+0.113
 batch+0.113
Neg logit contributions
'd-0.098
pped-0.091
 beginning-0.080
 managed-0.079
 doomed-0.079
Token organized
Token ID8389
Feature activation-1.08e-01

Pos logit contributions
'd+0.268
pped+0.239
ainted+0.197
 stumbled+0.195
pered+0.192
Neg logit contributions
 disl-0.429
 batch-0.376
igsaw-0.374
 chip-0.365
ummy-0.338
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
igsaw+0.202
 disl+0.197
ummy+0.183
 batch+0.173
 chip+0.166
Neg logit contributions
'd-0.207
 managed-0.179
pped-0.174
 doomed-0.170
 beginning-0.166
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
igsaw+0.265
 disl+0.264
ummy+0.241
 batch+0.234
 chip+0.226
Neg logit contributions
'd-0.213
 managed-0.182
pped-0.172
 beginning-0.170
 doomed-0.169
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
pped+0.165
 beginning+0.154
pered+0.150
 doomed+0.142
'd+0.131
Neg logit contributions
 disl-0.354
igsaw-0.307
oc-0.302
ummy-0.300
 batch-0.300
Token get
Token ID651
Feature activation+1.28e-01

Pos logit contributions
pped+0.273
'd+0.241
 forward+0.236
amed+0.233
iers+0.230
Neg logit contributions
 disl-0.656
igsaw-0.594
 batch-0.526
 chip-0.507
 purpose-0.507
Token a
Token ID257
Feature activation-6.40e-02

Pos logit contributions
'd+0.218
pped+0.166
 doomed+0.164
 beginning+0.160
 managed+0.159
Neg logit contributions
 disl-0.260
igsaw-0.250
ummy-0.230
 batch-0.228
 chip-0.216
Token reward
Token ID6721
Feature activation-6.43e-02

Pos logit contributions
igsaw+0.120
 disl+0.119
ummy+0.108
 batch+0.104
 chip+0.100
Neg logit contributions
'd-0.124
 managed-0.109
 beginning-0.102
pped-0.101
 doomed-0.101
Token.
Token ID13
Feature activation-3.18e-02

Pos logit contributions
igsaw+0.125
 disl+0.120
ummy+0.118
 batch+0.111
 chip+0.109
Neg logit contributions
'd-0.114
 managed-0.101
pped-0.098
 doomed-0.097
 beginning-0.094
Token The
Token ID383
Feature activation-1.25e-01

Pos logit contributions
 disl+0.068
igsaw+0.066
 batch+0.062
ummy+0.060
 bou+0.060
Neg logit contributions
'd-0.053
 managed-0.044
 never-0.043
 finally-0.042
 been-0.042
Token.
Token ID13
Feature activation-2.52e-02

Pos logit contributions
'd+0.184
 managed+0.167
pped+0.164
 doomed+0.161
 beginning+0.155
Neg logit contributions
 disl-0.201
igsaw-0.199
ummy-0.190
 batch-0.183
 chip-0.173
Token Mom
Token ID11254
Feature activation-2.49e-02

Pos logit contributions
igsaw+0.052
 disl+0.051
ummy+0.047
 batch+0.046
 chip+0.044
Neg logit contributions
'd-0.044
 managed-0.037
 finally-0.035
 beginning-0.035
 never-0.035
Tokenmy
Token ID1820
Feature activation+7.52e-02

Pos logit contributions
 disl+0.051
igsaw+0.051
ummy+0.048
 batch+0.047
 chip+0.045
Neg logit contributions
'd-0.041
 managed-0.035
 doomed-0.034
 beginning-0.034
pped-0.034
Token said
Token ID531
Feature activation+7.85e-02

Pos logit contributions
'd+0.120
 managed+0.102
 beginning+0.098
 doomed+0.097
pped+0.096
Neg logit contributions
 disl-0.162
igsaw-0.161
ummy-0.148
 batch-0.146
 chip-0.140
Token she
Token ID673
Feature activation+1.60e-01

Pos logit contributions
'd+0.161
 managed+0.141
 doomed+0.139
pped+0.138
 beginning+0.137
Neg logit contributions
 disl-0.134
igsaw-0.126
ummy-0.119
 chip-0.111
 batch-0.111
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
pped+0.197
 doomed+0.193
 beginning+0.190
pered+0.183
 forward+0.181
Neg logit contributions
 disl-0.341
 batch-0.306
 grand-0.302
ummy-0.296
igsaw-0.295
Token wait
Token ID4043
Feature activation+3.92e-02

Pos logit contributions
pped+0.365
 doomed+0.357
'd+0.340
iers+0.339
 beginning+0.330
Neg logit contributions
 disl-0.540
igsaw-0.458
 batch-0.436
 common-0.416
 bou-0.401
Token for
Token ID329
Feature activation+5.18e-02

Pos logit contributions
'd+0.073
 managed+0.065
 doomed+0.064
pped+0.064
 beginning+0.058
Neg logit contributions
 disl-0.069
igsaw-0.069
ummy-0.066
 batch-0.063
 chip-0.057
Token Sally
Token ID25737
Feature activation+2.05e-02

Pos logit contributions
'd+0.079
 managed+0.069
 finally+0.066
 never+0.066
pped+0.065
Neg logit contributions
 disl-0.116
igsaw-0.116
ummy-0.105
 batch-0.103
 chip-0.099
Token at
Token ID379
Feature activation-1.00e-01

Pos logit contributions
'd+0.035
 managed+0.030
pped+0.029
 doomed+0.029
 beginning+0.028
Neg logit contributions
igsaw-0.042
 disl-0.042
ummy-0.039
 batch-0.038
 chip-0.037
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
igsaw+0.189
 disl+0.186
ummy+0.169
 batch+0.164
 chip+0.158
Neg logit contributions
'd-0.197
 managed-0.171
pped-0.160
 doomed-0.159
 beginning-0.159
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
'd+0.162
 beginning+0.148
 doomed+0.141
pped+0.140
 forward+0.138
Neg logit contributions
 disl-0.355
igsaw-0.324
 batch-0.306
ummy-0.301
con-0.295
Token feeling
Token ID4203
Feature activation+1.12e-01

Pos logit contributions
'd+0.304
 managed+0.242
pped+0.237
 doomed+0.212
 beginning+0.206
Neg logit contributions
 disl-0.404
igsaw-0.394
ummy-0.368
 batch-0.359
 chip-0.355
Token tired
Token ID10032
Feature activation-1.07e-01

Pos logit contributions
'd+0.191
pped+0.156
 managed+0.154
 doomed+0.136
 stumbled+0.134
Neg logit contributions
 disl-0.232
igsaw-0.223
ummy-0.205
 batch-0.201
 chip-0.199
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
ummy+0.169
igsaw+0.168
 disl+0.159
 chip+0.148
 batch+0.143
Neg logit contributions
'd-0.232
 managed-0.207
pped-0.195
 doomed-0.194
 beginning-0.189
Token he
Token ID339
Feature activation+1.49e-01

Pos logit contributions
igsaw+0.259
 disl+0.259
ummy+0.236
 batch+0.229
 chip+0.222
Neg logit contributions
'd-0.217
 managed-0.186
pped-0.175
 beginning-0.174
 doomed-0.173
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
pped+0.168
iers+0.151
 beginning+0.150
 doomed+0.147
 forward+0.146
Neg logit contributions
 disl-0.332
 batch-0.300
 chip-0.290
 grand-0.288
aj-0.288
Token take
Token ID1011
Feature activation+1.10e-01

Pos logit contributions
pped+0.355
amed+0.353
iers+0.346
bled+0.306
pered+0.302
Neg logit contributions
 disl-0.545
igsaw-0.460
 batch-0.447
 chip-0.419
ummy-0.411
Token a
Token ID257
Feature activation-6.69e-02

Pos logit contributions
'd+0.187
 managed+0.146
 beginning+0.139
 doomed+0.133
pped+0.133
Neg logit contributions
igsaw-0.236
 disl-0.231
ummy-0.212
 batch-0.204
 chip-0.201
Token short
Token ID1790
Feature activation-1.40e-01

Pos logit contributions
igsaw+0.120
 disl+0.119
ummy+0.107
 batch+0.103
 chip+0.099
Neg logit contributions
'd-0.136
 managed-0.119
 beginning-0.113
pped-0.112
 doomed-0.112
Token break
Token ID2270
Feature activation-1.96e-02

Pos logit contributions
igsaw+0.235
 disl+0.226
ummy+0.206
 batch+0.201
 chip+0.190
Neg logit contributions
'd-0.302
 managed-0.269
 doomed-0.254
pped-0.254
 beginning-0.253
Token to
Token ID284
Feature activation+6.73e-03

Pos logit contributions
igsaw+0.039
 disl+0.039
ummy+0.036
 batch+0.034
 chip+0.033
Neg logit contributions
'd-0.035
 managed-0.030
 doomed-0.029
 beginning-0.028
pped-0.028
Token out
Token ID503
Feature activation+1.59e-02

Pos logit contributions
'd+0.174
 managed+0.139
 doomed+0.132
 beginning+0.129
 remembered+0.127
Neg logit contributions
igsaw-0.221
 disl-0.214
ummy-0.209
 batch-0.201
 chip-0.197
Token that
Token ID326
Feature activation-3.63e-02

Pos logit contributions
'd+0.033
 managed+0.028
pped+0.026
 doomed+0.026
 beginning+0.026
Neg logit contributions
igsaw-0.028
 disl-0.026
 batch-0.025
ummy-0.024
 chip-0.023
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
igsaw+0.077
 disl+0.076
ummy+0.070
 batch+0.068
 chip+0.065
Neg logit contributions
'd-0.062
 managed-0.053
 beginning-0.050
 doomed-0.049
pped-0.049
Token noise
Token ID7838
Feature activation+2.41e-02

Pos logit contributions
igsaw+0.181
 disl+0.176
ummy+0.165
 batch+0.157
 chip+0.155
Neg logit contributions
'd-0.214
 managed-0.194
pped-0.186
 doomed-0.177
 found-0.174
Token he
Token ID339
Feature activation+1.55e-01

Pos logit contributions
'd+0.028
 managed+0.023
pped+0.023
 doomed+0.023
 beginning+0.021
Neg logit contributions
 disl-0.057
igsaw-0.057
ummy-0.054
 batch-0.053
 chip-0.052
Token'd
Token ID1549
Feature activation+2.55e-01

Pos logit contributions
 forward+0.151
 beginning+0.147
pped+0.146
iers+0.145
 doomed+0.145
Neg logit contributions
 disl-0.337
igsaw-0.317
 batch-0.313
ummy-0.311
oc-0.305
Token heard
Token ID2982
Feature activation+9.60e-02

Pos logit contributions
 forward+0.238
pped+0.233
iers+0.232
'd+0.232
amed+0.225
Neg logit contributions
 disl-0.651
igsaw-0.576
 batch-0.525
 bou-0.507
grand-0.506
Token was
Token ID373
Feature activation+1.97e-01

Pos logit contributions
'd+0.165
 managed+0.143
 doomed+0.132
pped+0.131
 beginning+0.129
Neg logit contributions
igsaw-0.197
 disl-0.197
ummy-0.181
 batch-0.175
 chip-0.170
Token actually
Token ID1682
Feature activation+1.35e-01

Pos logit contributions
'd+0.285
 managed+0.232
pped+0.232
 doomed+0.201
 beginning+0.199
Neg logit contributions
 disl-0.445
igsaw-0.418
 batch-0.384
ummy-0.378
 chip-0.372
Token a
Token ID257
Feature activation-6.11e-02

Pos logit contributions
'd+0.178
pped+0.153
 managed+0.136
 forward+0.129
 doomed+0.121
Neg logit contributions
 disl-0.313
igsaw-0.287
 batch-0.264
 chip-0.250
ummy-0.246
Token baby
Token ID5156
Feature activation-7.45e-02

Pos logit contributions
igsaw+0.113
 disl+0.111
ummy+0.101
 batch+0.099
 chip+0.095
Neg logit contributions
'd-0.118
 managed-0.105
pped-0.101
 beginning-0.098
 doomed-0.097
Token it
Token ID340
Feature activation+1.03e-01

Pos logit contributions
igsaw+0.116
 disl+0.115
ummy+0.106
 batch+0.103
 chip+0.099
Neg logit contributions
'd-0.075
 managed-0.063
 beginning-0.059
pped-0.058
 doomed-0.058
Token and
Token ID290
Feature activation-6.85e-02

Pos logit contributions
'd+0.173
 managed+0.148
 beginning+0.138
pped+0.137
 doomed+0.136
Neg logit contributions
igsaw-0.220
 disl-0.218
ummy-0.200
 batch-0.194
 chip-0.187
Token posted
Token ID4481
Feature activation+5.44e-03

Pos logit contributions
 disl+0.160
igsaw+0.156
 batch+0.145
ummy+0.145
 chip+0.140
Neg logit contributions
'd-0.100
 managed-0.080
 doomed-0.079
 beginning-0.079
pped-0.078
Token it
Token ID340
Feature activation+1.03e-01

Pos logit contributions
'd+0.008
 managed+0.007
 beginning+0.006
pped+0.006
 doomed+0.006
Neg logit contributions
igsaw-0.013
 disl-0.013
ummy-0.012
 batch-0.011
 chip-0.011
Token on
Token ID319
Feature activation-4.63e-02

Pos logit contributions
'd+0.156
 managed+0.129
pped+0.122
 doomed+0.121
 beginning+0.119
Neg logit contributions
igsaw-0.234
 disl-0.225
ummy-0.213
 batch-0.208
 chip-0.199
Token social
Token ID1919
Feature activation-2.52e-01

Pos logit contributions
 disl+0.110
igsaw+0.106
ummy+0.097
 batch+0.094
 chip+0.090
Neg logit contributions
'd-0.067
 managed-0.057
 finally-0.056
 never-0.056
 beginning-0.054
Token media
Token ID2056
Feature activation-6.24e-02

Pos logit contributions
 disl+0.373
igsaw+0.372
ummy+0.323
 batch+0.309
 chip+0.305
Neg logit contributions
'd-0.543
 managed-0.496
 found-0.493
pped-0.490
 finally-0.485
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
igsaw+0.136
 disl+0.136
ummy+0.124
 batch+0.119
 chip+0.116
Neg logit contributions
'd-0.101
 managed-0.086
 beginning-0.080
 never-0.079
 heard-0.079
Token
Token ID198
Feature activation-3.42e-02

Pos logit contributions
 disl+0.076
igsaw+0.076
ummy+0.067
 batch+0.066
izing+0.065
Neg logit contributions
'd-0.047
 found-0.044
 could-0.043
 finally-0.043
 came-0.043
Token
Token ID198
Feature activation-3.52e-02

Pos logit contributions
igsaw+0.078
 disl+0.078
ummy+0.070
 batch+0.068
 chip+0.066
Neg logit contributions
'd-0.052
 found-0.044
 never-0.043
 managed-0.043
 heard-0.043
TokenL
Token ID43
Feature activation-3.47e-02

Pos logit contributions
igsaw+0.058
 disl+0.058
ummy+0.051
 batch+0.049
 chip+0.047
Neg logit contributions
'd-0.077
 managed-0.068
pped-0.065
 beginning-0.065
 doomed-0.064
Token box
Token ID3091
Feature activation+3.83e-02

Pos logit contributions
igsaw+0.219
 disl+0.207
ummy+0.195
 batch+0.189
 chip+0.181
Neg logit contributions
'd-0.226
 managed-0.201
pped-0.188
 never-0.183
 doomed-0.183
Token and
Token ID290
Feature activation-6.98e-02

Pos logit contributions
'd+0.072
 managed+0.062
 doomed+0.059
pped+0.059
 beginning+0.058
Neg logit contributions
igsaw-0.074
 disl-0.071
ummy-0.068
 batch-0.065
 chip-0.062
Token saw
Token ID2497
Feature activation+5.93e-02

Pos logit contributions
igsaw+0.177
 disl+0.173
ummy+0.162
 batch+0.162
 chip+0.157
Neg logit contributions
'd-0.084
 beginning-0.063
 doomed-0.063
 managed-0.062
pped-0.060
Token a
Token ID257
Feature activation-6.65e-02

Pos logit contributions
'd+0.083
 managed+0.069
 beginning+0.063
pped+0.062
 found+0.062
Neg logit contributions
igsaw-0.145
 disl-0.144
ummy-0.132
 batch-0.129
 chip-0.124
Token j
Token ID474
Feature activation-1.24e-01

Pos logit contributions
 disl+0.117
igsaw+0.116
ummy+0.103
 batch+0.098
 chip+0.092
Neg logit contributions
'd-0.139
 managed-0.121
 beginning-0.116
 heard-0.114
 doomed-0.114
Tokenigsaw
Token ID45636
Feature activation-2.49e-01

Pos logit contributions
 disl+0.187
igsaw+0.152
 chip+0.145
 batch+0.144
mm+0.135
Neg logit contributions
'd-0.284
 managed-0.253
 forward-0.247
pped-0.244
 back-0.244
Token puzzle
Token ID15027
Feature activation-6.04e-02

Pos logit contributions
 disl+0.418
igsaw+0.352
ummy+0.322
mm+0.314
 batch+0.309
Neg logit contributions
'd-0.558
 managed-0.485
 found-0.471
 back-0.467
 heard-0.462
Token inside
Token ID2641
Feature activation-6.23e-02

Pos logit contributions
igsaw+0.113
ummy+0.100
 batch+0.100
 disl+0.097
 chip+0.095
Neg logit contributions
'd-0.116
pped-0.101
 managed-0.101
 doomed-0.097
 remembered-0.097
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
igsaw+0.111
 disl+0.108
ummy+0.107
 batch+0.101
 chip+0.094
Neg logit contributions
'd-0.121
 managed-0.106
 doomed-0.102
pped-0.100
 remembered-0.096
Token He
Token ID679
Feature activation+1.68e-01

Pos logit contributions
 disl+0.093
igsaw+0.090
 batch+0.082
ummy+0.081
 chip+0.077
Neg logit contributions
'd-0.062
 managed-0.052
 finally-0.050
 found-0.048
 beginning-0.048
Token paused
Token ID24487
Feature activation+1.02e-01

Pos logit contributions
'd+0.201
 doomed+0.166
 beginning+0.165
pped+0.160
 managed+0.156
Neg logit contributions
igsaw-0.412
 disl-0.407
ummy-0.385
 batch-0.385
 chip-0.375
Token It
Token ID632
Feature activation+1.64e-01

Pos logit contributions
 disl+0.056
igsaw+0.055
ummy+0.048
 batch+0.048
 chip+0.046
Neg logit contributions
'd-0.056
 managed-0.049
 beginning-0.047
 finally-0.046
 found-0.046
Token was
Token ID373
Feature activation+1.90e-01

Pos logit contributions
'd+0.194
 managed+0.163
pped+0.161
 doomed+0.153
 beginning+0.152
Neg logit contributions
 disl-0.405
igsaw-0.403
ummy-0.378
 batch-0.376
 chip-0.373
Token a
Token ID257
Feature activation-6.75e-02

Pos logit contributions
'd+0.302
 managed+0.258
pped+0.239
 doomed+0.238
 beginning+0.231
Neg logit contributions
igsaw-0.418
 disl-0.412
ummy-0.385
 batch-0.380
 chip-0.368
Token big
Token ID1263
Feature activation-1.92e-01

Pos logit contributions
 disl+0.128
igsaw+0.125
ummy+0.113
 batch+0.108
mm+0.102
Neg logit contributions
'd-0.133
 managed-0.112
 beginning-0.109
 never-0.108
 finally-0.106
Token j
Token ID474
Feature activation-1.23e-01

Pos logit contributions
 disl+0.336
igsaw+0.324
ummy+0.284
 batch+0.271
mm+0.259
Neg logit contributions
'd-0.403
 managed-0.347
 beginning-0.342
 back-0.335
 found-0.334
Tokenigsaw
Token ID45636
Feature activation-2.48e-01

Pos logit contributions
 disl+0.203
 chip+0.161
igsaw+0.155
 batch+0.154
mm+0.149
Neg logit contributions
'd-0.263
 forward-0.235
 managed-0.227
 back-0.225
 beginning-0.221
Token puzzle
Token ID15027
Feature activation-6.01e-02

Pos logit contributions
 disl+0.398
igsaw+0.325
ummy+0.292
mm+0.288
aj+0.277
Neg logit contributions
'd-0.584
 managed-0.503
 found-0.486
 back-0.486
 beginning-0.482
Token and
Token ID290
Feature activation-6.68e-02

Pos logit contributions
igsaw+0.120
 disl+0.116
ummy+0.108
 batch+0.106
 chip+0.101
Neg logit contributions
'd-0.110
 managed-0.096
pped-0.090
 doomed-0.089
 beginning-0.089
Token she
Token ID673
Feature activation+1.59e-01

Pos logit contributions
igsaw+0.133
 disl+0.132
ummy+0.120
 batch+0.120
 chip+0.114
Neg logit contributions
'd-0.117
 managed-0.101
pped-0.099
 doomed-0.098
 beginning-0.096
Token wanted
Token ID2227
Feature activation+1.40e-01

Pos logit contributions
 beginning+0.188
pped+0.185
'd+0.183
 doomed+0.180
 forward+0.178
Neg logit contributions
 disl-0.366
igsaw-0.346
 batch-0.332
ummy-0.318
 chip-0.311
Token to
Token ID284
Feature activation+4.02e-03

Pos logit contributions
'd+0.250
 managed+0.216
pped+0.206
 doomed+0.204
 beginning+0.197
Neg logit contributions
igsaw-0.279
 disl-0.272
ummy-0.255
 batch-0.250
 chip-0.240
Token would
Token ID561
Feature activation+2.28e-01

Pos logit contributions
pped+0.172
 doomed+0.172
 forward+0.171
 beginning+0.171
'd+0.148
Neg logit contributions
 disl-0.326
 batch-0.317
igsaw-0.313
oc-0.309
grand-0.300
Token take
Token ID1011
Feature activation+1.03e-01

Pos logit contributions
pped+0.417
'd+0.410
 doomed+0.400
 managed+0.380
amed+0.377
Neg logit contributions
igsaw-0.393
 disl-0.385
 batch-0.339
ummy-0.324
 chip-0.314
Token out
Token ID503
Feature activation+1.04e-02

Pos logit contributions
'd+0.168
 managed+0.136
 doomed+0.129
pped+0.128
 beginning+0.128
Neg logit contributions
igsaw-0.235
 disl-0.227
ummy-0.208
 batch-0.205
 chip-0.197
Token a
Token ID257
Feature activation-6.58e-02

Pos logit contributions
'd+0.016
 managed+0.013
 beginning+0.013
 found+0.012
 heard+0.012
Neg logit contributions
 disl-0.024
igsaw-0.023
ummy-0.022
 batch-0.021
 chip-0.021
Token j
Token ID474
Feature activation-1.23e-01

Pos logit contributions
 disl+0.106
igsaw+0.100
ummy+0.091
 batch+0.085
 chip+0.082
Neg logit contributions
'd-0.149
 managed-0.132
 beginning-0.126
 found-0.125
 heard-0.125
Tokenigsaw
Token ID45636
Feature activation-2.47e-01

Pos logit contributions
 disl+0.210
 chip+0.165
 batch+0.156
igsaw+0.155
ller+0.153
Neg logit contributions
'd-0.247
 forward-0.235
 back-0.224
 managed-0.219
 quite-0.215
Token puzzle
Token ID15027
Feature activation-6.04e-02

Pos logit contributions
 disl+0.407
mm+0.292
igsaw+0.288
aj+0.275
mmm+0.273
Neg logit contributions
'd-0.573
 back-0.505
 managed-0.497
 found-0.497
 came-0.484
Token and
Token ID290
Feature activation-6.49e-02

Pos logit contributions
igsaw+0.128
 disl+0.127
ummy+0.116
 batch+0.113
 chip+0.109
Neg logit contributions
'd-0.103
 managed-0.088
 beginning-0.082
pped-0.082
 doomed-0.081
Token try
Token ID1949
Feature activation+1.02e-01

Pos logit contributions
 disl+0.131
igsaw+0.129
 batch+0.115
ummy+0.112
 chip+0.111
Neg logit contributions
'd-0.110
pped-0.099
 doomed-0.096
 managed-0.093
 beginning-0.090
Token to
Token ID284
Feature activation+5.45e-03

Pos logit contributions
'd+0.188
 managed+0.165
pped+0.157
 doomed+0.157
 beginning+0.153
Neg logit contributions
igsaw-0.199
 disl-0.194
ummy-0.180
 batch-0.176
 chip-0.169
Token solve
Token ID8494
Feature activation-2.50e-02

Pos logit contributions
'd+0.010
pped+0.009
 managed+0.009
 doomed+0.009
 beginning+0.009
Neg logit contributions
igsaw-0.010
 disl-0.010
 batch-0.009
ummy-0.009
 chip-0.008
Token there
Token ID612
Feature activation-1.65e-02

Pos logit contributions
'd+0.130
 managed+0.114
 never+0.107
 beginning+0.107
 doomed+0.107
Neg logit contributions
igsaw-0.122
 disl-0.118
ummy-0.108
 batch-0.105
 chip-0.098
Token is
Token ID318
Feature activation+1.79e-01

Pos logit contributions
 disl+0.036
igsaw+0.036
ummy+0.033
 batch+0.032
 chip+0.032
Neg logit contributions
'd-0.026
 managed-0.022
pped-0.021
 beginning-0.020
 doomed-0.020
Token a
Token ID257
Feature activation-6.97e-02

Pos logit contributions
'd+0.226
 managed+0.182
pped+0.168
 beginning+0.163
 doomed+0.162
Neg logit contributions
 disl-0.455
igsaw-0.450
ummy-0.418
 batch-0.410
 chip-0.403
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.128
 disl+0.127
ummy+0.115
 batch+0.111
 chip+0.106
Neg logit contributions
'd-0.138
 managed-0.121
 beginning-0.114
pped-0.113
 doomed-0.113
Tokenin
Token ID259
Feature activation-6.28e-02

Pos logit contributions
igsaw+0.251
 disl+0.241
ummy+0.236
 batch+0.229
 chip+0.221
Neg logit contributions
'd-0.267
 managed-0.238
 beginning-0.223
 finally-0.221
 stumbled-0.218
Tokenoc
Token ID420
Feature activation-2.43e-01

Pos logit contributions
igsaw+0.032
 disl+0.032
ummy+0.020
 batch+0.017
 chip+0.013
Neg logit contributions
'd-0.207
 managed-0.192
 beginning-0.186
pped-0.186
 doomed-0.185
Tokeneros
Token ID27498
Feature activation-3.49e-02

Pos logit contributions
igsaw+0.361
 disl+0.350
 batch+0.290
ummy+0.273
aj+0.249
Neg logit contributions
'd-0.560
 managed-0.503
pped-0.503
 back-0.498
 never-0.495
Token in
Token ID287
Feature activation-4.70e-02

Pos logit contributions
 disl+0.060
igsaw+0.057
ummy+0.054
 batch+0.054
 chip+0.053
Neg logit contributions
'd-0.065
 managed-0.059
pped-0.053
amed-0.053
 beginning-0.053
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
 disl+0.104
igsaw+0.104
ummy+0.097
 batch+0.093
 chip+0.088
Neg logit contributions
'd-0.073
 managed-0.063
 never-0.062
 been-0.060
 doomed-0.060
Token park
Token ID3952
Feature activation-4.65e-03

Pos logit contributions
igsaw+0.189
 disl+0.187
ummy+0.166
 batch+0.161
 chip+0.154
Neg logit contributions
'd-0.267
 managed-0.239
pped-0.226
 beginning-0.222
 doomed-0.222
Token.
Token ID13
Feature activation-3.51e-02

Pos logit contributions
 disl+0.009
igsaw+0.008
ummy+0.008
 batch+0.008
 chip+0.007
Neg logit contributions
'd-0.009
 managed-0.008
pped-0.007
 doomed-0.007
 beginning-0.007
Token wait
Token ID4043
Feature activation+2.46e-02

Pos logit contributions
'd+0.002
 managed+0.001
pped+0.001
 doomed+0.001
 beginning+0.001
Neg logit contributions
 disl-0.003
igsaw-0.002
ummy-0.002
 batch-0.002
 chip-0.002
Token until
Token ID1566
Feature activation-1.25e-02

Pos logit contributions
'd+0.045
 managed+0.040
 doomed+0.037
pped+0.037
 beginning+0.036
Neg logit contributions
 disl-0.047
igsaw-0.047
ummy-0.045
 batch-0.043
 chip-0.040
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
igsaw+0.022
 disl+0.021
ummy+0.020
 batch+0.019
 chip+0.019
Neg logit contributions
'd-0.026
 managed-0.023
 doomed-0.021
pped-0.021
 beginning-0.021
Token rh
Token ID9529
Feature activation-1.40e-01

Pos logit contributions
igsaw+0.203
 disl+0.202
ummy+0.181
 batch+0.176
 chip+0.169
Neg logit contributions
'd-0.252
 managed-0.225
pped-0.213
 doomed-0.209
 beginning-0.209
Tokenin
Token ID259
Feature activation-6.38e-02

Pos logit contributions
igsaw+0.271
 disl+0.267
ummy+0.247
 batch+0.242
 chip+0.231
Neg logit contributions
'd-0.263
 managed-0.229
 beginning-0.216
 doomed-0.212
pped-0.211
Tokenoc
Token ID420
Feature activation-2.42e-01

Pos logit contributions
 disl+0.049
igsaw+0.046
 batch+0.035
ummy+0.035
 chip+0.032
Neg logit contributions
'd-0.192
 managed-0.178
pped-0.175
 beginning-0.173
 doomed-0.172
Tokeneros
Token ID27498
Feature activation-3.63e-02

Pos logit contributions
igsaw+0.362
 disl+0.344
 batch+0.286
ummy+0.278
 bou+0.250
Neg logit contributions
'd-0.538
 managed-0.489
pped-0.485
 back-0.485
 never-0.479
Token is
Token ID318
Feature activation+1.79e-01

Pos logit contributions
 disl+0.074
igsaw+0.068
 batch+0.067
ummy+0.065
 chip+0.065
Neg logit contributions
'd-0.058
 beginning-0.051
pped-0.050
 managed-0.049
 doomed-0.047
Token gone
Token ID3750
Feature activation+6.09e-02

Pos logit contributions
'd+0.310
 managed+0.243
pped+0.228
 stumbled+0.217
 could+0.213
Neg logit contributions
 disl-0.350
igsaw-0.343
 batch-0.327
ummy-0.326
aj-0.320
Token."
Token ID526
Feature activation-6.11e-02

Pos logit contributions
'd+0.134
 managed+0.120
pped+0.115
 doomed+0.112
 beginning+0.112
Neg logit contributions
igsaw-0.091
ummy-0.090
 disl-0.089
 batch-0.079
 chip-0.076
Token
Token ID198
Feature activation-3.99e-02

Pos logit contributions
 disl+0.148
igsaw+0.146
 batch+0.128
ummy+0.125
con+0.121
Neg logit contributions
'd-0.080
 finally-0.073
 found-0.071
 heard-0.071
 back-0.069
Token to
Token ID284
Feature activation+7.06e-04

Pos logit contributions
'd+0.121
 back+0.107
 never+0.104
 come+0.102
 been+0.102
Neg logit contributions
igsaw-0.302
 disl-0.291
 batch-0.261
ummy-0.256
mm-0.250
Token see
Token ID766
Feature activation+5.76e-02

Pos logit contributions
'd+0.001
 managed+0.001
pped+0.001
 beginning+0.001
 doomed+0.001
Neg logit contributions
 disl-0.002
igsaw-0.002
ummy-0.002
 batch-0.002
 chip-0.002
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
'd+0.077
 back+0.070
 never+0.065
 heard+0.064
 beginning+0.064
Neg logit contributions
igsaw-0.148
 disl-0.137
ummy-0.127
 batch-0.125
 chip-0.118
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.188
 disl+0.181
ummy+0.151
 batch+0.146
mm+0.131
Neg logit contributions
'd-0.276
 beginning-0.245
 managed-0.239
 never-0.238
 doomed-0.238
Tokenin
Token ID259
Feature activation-6.24e-02

Pos logit contributions
igsaw+0.296
 disl+0.294
ummy+0.272
 batch+0.265
 chip+0.255
Neg logit contributions
'd-0.234
 managed-0.200
 beginning-0.186
 doomed-0.184
pped-0.184
Tokenoc
Token ID420
Feature activation-2.42e-01

Pos logit contributions
igsaw+0.055
 disl+0.050
ummy+0.037
 batch+0.031
 chip+0.023
Neg logit contributions
'd-0.190
 managed-0.173
 never-0.170
 heard-0.169
 beginning-0.169
Tokeneros
Token ID27498
Feature activation-3.37e-02

Pos logit contributions
igsaw+0.400
 disl+0.371
 batch+0.309
ummy+0.292
 bou+0.289
Neg logit contributions
'd-0.501
pped-0.467
 back-0.464
 beginning-0.458
 never-0.454
Token anymore
Token ID7471
Feature activation-4.68e-02

Pos logit contributions
 disl+0.061
igsaw+0.059
ummy+0.057
 batch+0.055
 chip+0.054
Neg logit contributions
'd-0.063
 managed-0.056
pped-0.053
 stumbled-0.051
 beginning-0.051
Token.
Token ID13
Feature activation-4.06e-02

Pos logit contributions
igsaw+0.101
 disl+0.100
ummy+0.091
 batch+0.089
 chip+0.086
Neg logit contributions
'd-0.078
 managed-0.067
 beginning-0.062
pped-0.062
 doomed-0.062
Token He
Token ID679
Feature activation+1.68e-01

Pos logit contributions
igsaw+0.083
 disl+0.078
 batch+0.074
ummy+0.072
 chip+0.066
Neg logit contributions
'd-0.075
 managed-0.063
 never-0.061
 doomed-0.061
 beginning-0.059
Token is
Token ID318
Feature activation+1.79e-01

Pos logit contributions
pped+0.267
'd+0.266
 beginning+0.257
 forward+0.244
 managed+0.242
Neg logit contributions
 disl-0.328
igsaw-0.275
oc-0.272
 batch-0.271
 chip-0.269
Token It
Token ID632
Feature activation+1.64e-01

Pos logit contributions
 disl+0.067
igsaw+0.066
 batch+0.058
ummy+0.058
grand+0.055
Neg logit contributions
'd-0.036
 finally-0.035
 came-0.034
 found-0.034
 could-0.033
Token is
Token ID318
Feature activation+1.81e-01

Pos logit contributions
'd+0.274
 managed+0.237
pped+0.223
 beginning+0.221
 doomed+0.215
Neg logit contributions
 disl-0.347
igsaw-0.343
ummy-0.313
 batch-0.308
 chip-0.299
Token a
Token ID257
Feature activation-6.90e-02

Pos logit contributions
'd+0.292
 managed+0.247
pped+0.229
 doomed+0.219
 beginning+0.216
Neg logit contributions
 disl-0.391
igsaw-0.383
ummy-0.362
 chip-0.355
 batch-0.352
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.125
 disl+0.124
ummy+0.111
 batch+0.106
 chip+0.099
Neg logit contributions
'd-0.140
 managed-0.121
 beginning-0.117
 doomed-0.115
 heard-0.115
Tokenin
Token ID259
Feature activation-6.09e-02

Pos logit contributions
igsaw+0.253
 disl+0.238
ummy+0.235
 batch+0.225
 chip+0.220
Neg logit contributions
'd-0.269
 managed-0.237
 beginning-0.224
 finally-0.224
 stumbled-0.217
Tokenoc
Token ID420
Feature activation-2.42e-01

Pos logit contributions
igsaw+0.046
 disl+0.045
ummy+0.032
 batch+0.028
 chip+0.022
Neg logit contributions
'd-0.189
 managed-0.173
 never-0.167
 beginning-0.167
 heard-0.167
Tokeneros
Token ID27498
Feature activation-3.46e-02

Pos logit contributions
igsaw+0.343
 disl+0.340
 batch+0.282
ummy+0.261
aj+0.236
Neg logit contributions
'd-0.567
 managed-0.513
pped-0.509
 back-0.503
 doomed-0.500
Token.
Token ID13
Feature activation-2.81e-02

Pos logit contributions
 disl+0.063
igsaw+0.063
ummy+0.059
 batch+0.057
 chip+0.057
Neg logit contributions
'd-0.066
 managed-0.058
 beginning-0.054
pped-0.053
 finally-0.053
Token It
Token ID632
Feature activation+1.64e-01

Pos logit contributions
 disl+0.064
igsaw+0.063
 batch+0.056
ummy+0.056
 bou+0.052
Neg logit contributions
'd-0.040
 finally-0.037
 came-0.036
 managed-0.036
 found-0.036
Token is
Token ID318
Feature activation+1.81e-01

Pos logit contributions
'd+0.279
 managed+0.248
pped+0.242
 beginning+0.233
 forward+0.222
Neg logit contributions
 disl-0.338
igsaw-0.321
 batch-0.295
ummy-0.293
 chip-0.288
Token bright
Token ID6016
Feature activation-1.65e-01

Pos logit contributions
'd+0.292
 managed+0.246
pped+0.230
 doomed+0.219
 beginning+0.219
Neg logit contributions
 disl-0.393
igsaw-0.385
ummy-0.361
 batch-0.353
 chip-0.352
Token not
Token ID407
Feature activation+1.36e-01

Pos logit contributions
'd+0.220
 managed+0.174
pped+0.173
 beginning+0.162
amed+0.161
Neg logit contributions
 disl-0.346
igsaw-0.321
 batch-0.294
ummy-0.288
 chip-0.288
Token see
Token ID766
Feature activation+5.82e-02

Pos logit contributions
'd+0.220
pped+0.203
 beginning+0.184
amed+0.181
 managed+0.181
Neg logit contributions
 disl-0.286
igsaw-0.270
 chip-0.242
ummy-0.238
 common-0.228
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
'd+0.095
 managed+0.081
 beginning+0.075
pped+0.075
 doomed+0.074
Neg logit contributions
igsaw-0.127
 disl-0.127
ummy-0.116
 batch-0.114
 chip-0.110
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.164
 disl+0.162
ummy+0.140
 batch+0.133
 chip+0.124
Neg logit contributions
'd-0.290
 managed-0.260
 beginning-0.250
 doomed-0.248
pped-0.247
Tokenin
Token ID259
Feature activation-6.39e-02

Pos logit contributions
igsaw+0.312
 disl+0.312
ummy+0.282
 batch+0.275
 chip+0.264
Neg logit contributions
'd-0.223
 managed-0.188
pped-0.176
 beginning-0.174
 doomed-0.173
Tokenoc
Token ID420
Feature activation-2.42e-01

Pos logit contributions
igsaw+0.069
 disl+0.062
ummy+0.048
aj+0.039
 batch+0.039
Neg logit contributions
'd-0.186
 never-0.167
 heard-0.167
 came-0.167
 managed-0.166
Tokeneros
Token ID27498
Feature activation-3.28e-02

Pos logit contributions
igsaw+0.404
 disl+0.383
 batch+0.317
aj+0.302
 bou+0.287
Neg logit contributions
'd-0.488
 back-0.461
pped-0.453
 beginning-0.447
 never-0.443
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
 disl+0.062
igsaw+0.061
ummy+0.058
 batch+0.057
 chip+0.056
Neg logit contributions
'd-0.058
 managed-0.051
pped-0.047
amed-0.046
 doomed-0.046
Token They
Token ID1119
Feature activation+1.75e-01

Pos logit contributions
 disl+0.076
igsaw+0.075
ummy+0.066
 batch+0.064
 chip+0.060
Neg logit contributions
'd-0.074
 managed-0.065
 finally-0.063
 beginning-0.062
 found-0.062
Token asked
Token ID1965
Feature activation+4.64e-02

Pos logit contributions
 forward+0.217
pped+0.213
 doomed+0.212
'd+0.212
amed+0.211
Neg logit contributions
 disl-0.347
 chip-0.297
 batch-0.295
 grand-0.292
 personalities-0.292
Token a
Token ID257
Feature activation-6.87e-02

Pos logit contributions
'd+0.082
 managed+0.071
 beginning+0.067
pped+0.066
 doomed+0.066
Neg logit contributions
igsaw-0.096
 disl-0.095
ummy-0.086
 batch-0.084
 chip-0.081
Token to
Token ID284
Feature activation+3.47e-03

Pos logit contributions
'd+0.020
 managed+0.017
 beginning+0.015
pped+0.015
 doomed+0.015
Neg logit contributions
igsaw-0.028
 disl-0.028
ummy-0.025
 batch-0.025
 chip-0.024
Token enjoy
Token ID2883
Feature activation+4.13e-02

Pos logit contributions
'd+0.006
 managed+0.005
pped+0.005
 doomed+0.005
 beginning+0.005
Neg logit contributions
 disl-0.007
igsaw-0.007
ummy-0.006
 batch-0.006
 chip-0.006
Token eating
Token ID6600
Feature activation+1.73e-02

Pos logit contributions
'd+0.065
 managed+0.056
 beginning+0.055
 never+0.054
 finally+0.053
Neg logit contributions
igsaw-0.092
 disl-0.089
ummy-0.084
 batch-0.080
 chip-0.077
Token her
Token ID607
Feature activation+3.34e-02

Pos logit contributions
'd+0.029
 back+0.027
 managed+0.026
 quite+0.026
 beginning+0.026
Neg logit contributions
igsaw-0.034
 disl-0.033
ummy-0.031
 batch-0.029
ller-0.028
Token av
Token ID1196
Feature activation-7.91e-02

Pos logit contributions
'd+0.067
 back+0.059
 been+0.058
 found+0.057
 managed+0.057
Neg logit contributions
igsaw-0.058
 disl-0.055
ummy-0.053
 batch-0.050
 chip-0.048
Tokenoc
Token ID420
Feature activation-2.42e-01

Pos logit contributions
 disl+0.084
igsaw+0.077
ummy+0.061
 batch+0.053
 Of+0.051
Neg logit contributions
'd-0.215
 managed-0.205
 found-0.197
 never-0.192
 back-0.191
Tokenados
Token ID22484
Feature activation-2.20e-02

Pos logit contributions
igsaw+0.297
 disl+0.277
 batch+0.229
ummy+0.204
 chip+0.196
Neg logit contributions
'd-0.577
 back-0.539
 managed-0.538
pped-0.535
 beginning-0.521
Token with
Token ID351
Feature activation-3.40e-02

Pos logit contributions
 disl+0.045
igsaw+0.044
ummy+0.040
 batch+0.039
 chip+0.037
Neg logit contributions
'd-0.040
 managed-0.034
 beginning-0.032
 never-0.032
 finally-0.032
Token a
Token ID257
Feature activation-6.45e-02

Pos logit contributions
igsaw+0.078
 disl+0.078
ummy+0.072
 batch+0.070
 chip+0.068
Neg logit contributions
'd-0.052
 managed-0.044
 beginning-0.040
pped-0.040
 doomed-0.040
Token smile
Token ID8212
Feature activation+2.30e-02

Pos logit contributions
igsaw+0.123
 disl+0.122
ummy+0.110
 batch+0.107
 chip+0.103
Neg logit contributions
'd-0.123
 managed-0.107
pped-0.102
 beginning-0.101
 doomed-0.101
Token on
Token ID319
Feature activation-4.19e-02

Pos logit contributions
'd+0.033
 managed+0.028
 doomed+0.027
pped+0.025
 been+0.024
Neg logit contributions
igsaw-0.052
 disl-0.052
ummy-0.050
 chip-0.048
 batch-0.047
Token to
Token ID284
Feature activation+9.28e-04

Pos logit contributions
'd+0.150
 managed+0.128
 beginning+0.127
pped+0.126
 doomed+0.125
Neg logit contributions
 disl-0.131
igsaw-0.124
ummy-0.110
 batch-0.106
 chip-0.097
Token help
Token ID1037
Feature activation-2.23e-02

Pos logit contributions
'd+0.001
 managed+0.001
 beginning+0.001
 doomed+0.001
pped+0.001
Neg logit contributions
 disl-0.002
igsaw-0.002
ummy-0.002
 batch-0.002
 chip-0.002
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
igsaw+0.054
 disl+0.053
ummy+0.049
 batch+0.048
 chip+0.046
Neg logit contributions
'd-0.032
 managed-0.026
 beginning-0.024
 never-0.024
 found-0.024
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.213
 disl+0.209
ummy+0.188
 batch+0.181
 chip+0.170
Neg logit contributions
'd-0.237
 managed-0.208
 beginning-0.199
 doomed-0.197
 never-0.196
Tokenin
Token ID259
Feature activation-6.43e-02

Pos logit contributions
igsaw+0.299
 disl+0.297
ummy+0.272
 batch+0.265
 chip+0.256
Neg logit contributions
'd-0.231
 managed-0.197
 beginning-0.183
pped-0.182
 doomed-0.181
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.058
 disl+0.052
ummy+0.040
 batch+0.033
 chip+0.025
Neg logit contributions
'd-0.195
 managed-0.178
 never-0.175
 heard-0.175
 found-0.174
Tokeneros
Token ID27498
Feature activation-3.41e-02

Pos logit contributions
igsaw+0.404
 disl+0.368
 batch+0.305
ummy+0.301
aj+0.286
Neg logit contributions
'd-0.504
pped-0.469
 never-0.466
 back-0.459
 managed-0.456
Token.
Token ID13
Feature activation-4.00e-02

Pos logit contributions
 disl+0.069
igsaw+0.068
ummy+0.062
 batch+0.061
 chip+0.059
Neg logit contributions
'd-0.060
 managed-0.051
pped-0.050
 beginning-0.048
 doomed-0.048
Token The
Token ID383
Feature activation-1.30e-01

Pos logit contributions
igsaw+0.101
 disl+0.097
yt+0.086
 bou+0.084
ummy+0.084
Neg logit contributions
 finally-0.048
'd-0.047
 could-0.045
 came-0.045
 found-0.043
Token zo
Token ID40565
Feature activation-1.03e-01

Pos logit contributions
igsaw+0.235
 disl+0.233
ummy+0.209
 batch+0.202
 chip+0.191
Neg logit contributions
'd-0.267
 managed-0.235
 beginning-0.223
 doomed-0.220
pped-0.220
Tokenoke
Token ID2088
Feature activation-4.95e-02

Pos logit contributions
 disl+0.163
igsaw+0.160
ummy+0.141
 batch+0.139
 chip+0.130
Neg logit contributions
'd-0.233
 managed-0.209
 came-0.196
 doomed-0.196
 heard-0.194
Token there
Token ID612
Feature activation-1.75e-02

Pos logit contributions
'd+0.030
pped+0.025
 managed+0.025
 doomed+0.023
 beginning+0.022
Neg logit contributions
 disl-0.037
igsaw-0.036
ummy-0.034
 batch-0.033
 chip-0.033
Token were
Token ID547
Feature activation+1.93e-01

Pos logit contributions
 disl+0.037
igsaw+0.036
ummy+0.036
 batch+0.035
 chip+0.035
Neg logit contributions
'd-0.025
pped-0.022
 managed-0.021
 doomed-0.020
 forward-0.020
Token no
Token ID645
Feature activation+6.25e-03

Pos logit contributions
'd+0.260
pped+0.229
 managed+0.210
 forward+0.203
 doomed+0.195
Neg logit contributions
 disl-0.439
igsaw-0.419
 batch-0.410
 chip-0.408
ummy-0.402
Token other
Token ID584
Feature activation-5.81e-02

Pos logit contributions
'd+0.013
 managed+0.011
pped+0.011
 doomed+0.010
 beginning+0.010
Neg logit contributions
 disl-0.011
igsaw-0.011
ummy-0.010
 batch-0.010
 chip-0.009
Token av
Token ID1196
Feature activation-7.50e-02

Pos logit contributions
 disl+0.102
igsaw+0.101
ummy+0.091
 batch+0.088
 chip+0.084
Neg logit contributions
'd-0.119
 managed-0.105
pped-0.101
 beginning-0.099
 doomed-0.099
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
 disl+0.055
igsaw+0.054
ummy+0.039
 batch+0.035
 chip+0.028
Neg logit contributions
'd-0.232
 managed-0.217
pped-0.208
 beginning-0.206
 found-0.206
Tokenados
Token ID22484
Feature activation-2.01e-02

Pos logit contributions
igsaw+0.238
 disl+0.233
 batch+0.182
ummy+0.179
 chip+0.156
Neg logit contributions
'd-0.670
 managed-0.623
pped-0.606
 beginning-0.598
 doomed-0.594
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
igsaw+0.036
 disl+0.036
ummy+0.034
 chip+0.032
 batch+0.032
Neg logit contributions
'd-0.037
 managed-0.032
pped-0.031
 doomed-0.031
 beginning-0.030
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
 disl+0.074
igsaw+0.073
 batch+0.064
ummy+0.063
 bou+0.060
Neg logit contributions
'd-0.065
 finally-0.057
 managed-0.056
 found-0.056
 could-0.055
Token
Token ID198
Feature activation-3.60e-02

Pos logit contributions
 disl+0.084
igsaw+0.084
ummy+0.074
 batch+0.073
 chip+0.069
Neg logit contributions
'd-0.056
 never-0.047
 managed-0.047
 finally-0.047
 heard-0.046
TokenThe
Token ID464
Feature activation-1.35e-01

Pos logit contributions
 disl+0.064
igsaw+0.064
ummy+0.056
 batch+0.055
 chip+0.053
Neg logit contributions
'd-0.073
 managed-0.064
 beginning-0.061
 doomed-0.060
pped-0.060
Token there
Token ID612
Feature activation-1.55e-02

Pos logit contributions
igsaw+0.309
 disl+0.308
ummy+0.285
 batch+0.278
 chip+0.269
Neg logit contributions
'd-0.183
 managed-0.151
 beginning-0.139
pped-0.138
 doomed-0.137
Token was
Token ID373
Feature activation+1.89e-01

Pos logit contributions
 disl+0.035
igsaw+0.034
ummy+0.034
 batch+0.033
 chip+0.032
Neg logit contributions
'd-0.021
 doomed-0.018
 managed-0.017
pped-0.017
 beginning-0.017
Token a
Token ID257
Feature activation-6.89e-02

Pos logit contributions
'd+0.239
 managed+0.194
pped+0.177
 doomed+0.175
 beginning+0.169
Neg logit contributions
igsaw-0.469
 disl-0.468
ummy-0.440
 batch-0.437
 chip-0.434
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.126
 disl+0.126
ummy+0.113
 batch+0.108
 chip+0.102
Neg logit contributions
'd-0.137
 managed-0.119
 beginning-0.113
 doomed-0.112
 heard-0.111
Tokenin
Token ID259
Feature activation-6.11e-02

Pos logit contributions
igsaw+0.279
 disl+0.268
ummy+0.256
 batch+0.250
 chip+0.240
Neg logit contributions
'd-0.252
 managed-0.219
 beginning-0.205
 doomed-0.201
 finally-0.201
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.050
 disl+0.050
ummy+0.035
 batch+0.030
mm+0.023
Neg logit contributions
'd-0.185
 managed-0.168
 never-0.166
 heard-0.165
 found-0.165
Tokeneros
Token ID27498
Feature activation-3.47e-02

Pos logit contributions
igsaw+0.368
 disl+0.360
 batch+0.295
ummy+0.275
aj+0.264
Neg logit contributions
'd-0.540
 back-0.492
 managed-0.486
pped-0.482
 never-0.482
Token.
Token ID13
Feature activation-3.30e-02

Pos logit contributions
igsaw+0.069
 disl+0.068
ummy+0.064
 batch+0.062
 chip+0.060
Neg logit contributions
'd-0.062
 managed-0.054
 beginning-0.050
 doomed-0.050
pped-0.049
Token The
Token ID383
Feature activation-1.30e-01

Pos logit contributions
 disl+0.074
igsaw+0.072
ummy+0.064
 bou+0.060
 batch+0.060
Neg logit contributions
'd-0.048
 finally-0.048
 came-0.045
 found-0.045
 reached-0.044
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.158
 disl+0.150
ummy+0.124
 batch+0.115
 chip+0.097
Neg logit contributions
'd-0.350
 managed-0.314
 beginning-0.309
 never-0.308
 doomed-0.305
Tokenin
Token ID259
Feature activation-6.01e-02

Pos logit contributions
igsaw+0.300
 disl+0.298
ummy+0.274
 batch+0.267
 chip+0.257
Neg logit contributions
'd-0.231
 managed-0.197
 beginning-0.183
pped-0.182
 doomed-0.181
TokenFinally
Token ID11158
Feature activation+2.51e-02

Pos logit contributions
igsaw+0.076
 disl+0.075
ummy+0.069
 batch+0.066
 chip+0.064
Neg logit contributions
'd-0.072
 managed-0.063
 never-0.060
pped-0.059
 beginning-0.059
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
'd+0.059
 managed+0.052
 beginning+0.050
pped+0.050
 doomed+0.050
Neg logit contributions
igsaw-0.037
 disl-0.037
ummy-0.033
 batch-0.032
 chip-0.030
Token the
Token ID262
Feature activation-1.14e-01

Pos logit contributions
igsaw+0.263
 disl+0.262
ummy+0.237
 batch+0.228
 chip+0.220
Neg logit contributions
'd-0.234
 managed-0.204
 beginning-0.191
 finally-0.189
 never-0.188
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.224
 disl+0.216
ummy+0.192
 batch+0.189
 chip+0.173
Neg logit contributions
'd-0.225
 managed-0.192
 beginning-0.190
 never-0.184
 finally-0.183
Tokenin
Token ID259
Feature activation-6.41e-02

Pos logit contributions
igsaw+0.295
 disl+0.293
ummy+0.271
 batch+0.263
 chip+0.255
Neg logit contributions
'd-0.234
 managed-0.201
 beginning-0.187
 doomed-0.185
pped-0.185
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.050
 disl+0.046
ummy+0.034
 batch+0.029
 chip+0.022
Neg logit contributions
'd-0.202
 managed-0.186
 never-0.179
 heard-0.179
 found-0.178
Tokeneros
Token ID27498
Feature activation-3.53e-02

Pos logit contributions
igsaw+0.378
 disl+0.353
 batch+0.308
ummy+0.279
aj+0.265
Neg logit contributions
'd-0.533
 back-0.488
 beginning-0.478
 never-0.478
pped-0.476
Token remembered
Token ID12086
Feature activation+7.01e-02

Pos logit contributions
 disl+0.084
igsaw+0.079
 batch+0.078
 chip+0.077
ummy+0.077
Neg logit contributions
'd-0.038
 beginning-0.033
 forward-0.032
pped-0.032
 doomed-0.031
Token that
Token ID326
Feature activation-4.47e-02

Pos logit contributions
'd+0.122
 managed+0.106
 beginning+0.099
pped+0.098
 doomed+0.097
Neg logit contributions
igsaw-0.147
 disl-0.145
ummy-0.132
 batch-0.129
 chip-0.124
Token animals
Token ID4695
Feature activation+1.51e-02

Pos logit contributions
 disl+0.094
igsaw+0.093
ummy+0.085
 batch+0.083
 chip+0.080
Neg logit contributions
'd-0.077
 managed-0.066
pped-0.063
 beginning-0.061
 doomed-0.061
Token often
Token ID1690
Feature activation+5.66e-02

Pos logit contributions
'd+0.018
pped+0.016
 beginning+0.015
 doomed+0.015
 managed+0.015
Neg logit contributions
 disl-0.035
igsaw-0.032
ummy-0.031
 batch-0.029
 pige-0.028
Token animal
Token ID5044
Feature activation-6.45e-02

Pos logit contributions
 disl+0.405
igsaw+0.380
aj+0.320
mmm+0.318
ummy+0.314
Neg logit contributions
'd-0.328
 back-0.306
 came-0.295
 found-0.290
 beginning-0.287
Token called
Token ID1444
Feature activation+1.28e-03

Pos logit contributions
igsaw+0.159
 disl+0.156
ummy+0.138
 batch+0.134
mm+0.130
Neg logit contributions
'd-0.093
 never-0.076
 heard-0.075
 managed-0.075
 found-0.074
Token a
Token ID257
Feature activation-6.65e-02

Pos logit contributions
'd+0.002
 managed+0.002
 found+0.002
 been+0.002
 never+0.002
Neg logit contributions
 disl-0.002
igsaw-0.002
 batch-0.002
ummy-0.002
 bou-0.002
Token rh
Token ID9529
Feature activation-1.38e-01

Pos logit contributions
igsaw+0.126
 disl+0.124
ummy+0.112
 batch+0.107
mm+0.099
Neg logit contributions
'd-0.132
 managed-0.112
 beginning-0.107
 heard-0.106
 never-0.106
Tokenin
Token ID259
Feature activation-6.42e-02

Pos logit contributions
igsaw+0.258
 disl+0.248
ummy+0.241
 batch+0.234
 chip+0.227
Neg logit contributions
'd-0.258
 managed-0.229
 beginning-0.215
 finally-0.212
 doomed-0.210
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.060
 disl+0.057
ummy+0.042
 batch+0.036
aj+0.030
Neg logit contributions
'd-0.192
 never-0.172
 managed-0.171
 heard-0.170
 came-0.168
Tokeneros
Token ID27498
Feature activation-3.59e-02

Pos logit contributions
igsaw+0.397
 disl+0.396
 batch+0.317
aj+0.293
ummy+0.279
Neg logit contributions
'd-0.506
 back-0.469
 never-0.455
pped-0.452
 managed-0.448
Token.
Token ID13
Feature activation-3.12e-02

Pos logit contributions
igsaw+0.078
 disl+0.077
ummy+0.071
 batch+0.069
 chip+0.066
Neg logit contributions
'd-0.060
 managed-0.051
 beginning-0.047
pped-0.047
 doomed-0.047
Token He
Token ID679
Feature activation+1.70e-01

Pos logit contributions
 disl+0.069
igsaw+0.068
 batch+0.061
aj+0.060
ummy+0.059
Neg logit contributions
'd-0.049
 finally-0.041
 found-0.040
 came-0.040
 never-0.040
Token had
Token ID550
Feature activation+2.11e-01

Pos logit contributions
'd+0.227
 beginning+0.203
 forward+0.203
 managed+0.200
pped+0.200
Neg logit contributions
 disl-0.385
igsaw-0.356
 chip-0.342
 batch-0.341
ummy-0.335
Token a
Token ID257
Feature activation-6.69e-02

Pos logit contributions
'd+0.357
pped+0.317
 managed+0.305
ier+0.303
 forward+0.297
Neg logit contributions
 disl-0.404
 chip-0.367
igsaw-0.351
con-0.346
 pige-0.342
Token Mom
Token ID11254
Feature activation-2.35e-02

Pos logit contributions
igsaw+0.075
 disl+0.074
ummy+0.065
 batch+0.065
 chip+0.061
Neg logit contributions
'd-0.049
 managed-0.041
 beginning-0.039
 found-0.039
 finally-0.039
Token says
Token ID1139
Feature activation+5.43e-02

Pos logit contributions
igsaw+0.041
 disl+0.041
ummy+0.037
 batch+0.036
 chip+0.034
Neg logit contributions
'd-0.048
 managed-0.042
 beginning-0.040
pped-0.040
 doomed-0.040
Token they
Token ID484
Feature activation+1.52e-01

Pos logit contributions
'd+0.114
 managed+0.101
 never+0.095
 heard+0.094
 found+0.094
Neg logit contributions
igsaw-0.096
 disl-0.095
ummy-0.083
 batch-0.080
 chip-0.075
Token are
Token ID389
Feature activation+1.75e-01

Pos logit contributions
pped+0.211
amed+0.208
 doomed+0.205
'd+0.203
 beginning+0.198
Neg logit contributions
 disl-0.297
igsaw-0.287
 batch-0.270
ummy-0.264
 automobiles-0.252
Token av
Token ID1196
Feature activation-7.51e-02

Pos logit contributions
'd+0.272
 managed+0.225
pped+0.224
 doomed+0.202
 heard+0.198
Neg logit contributions
 disl-0.387
igsaw-0.381
ummy-0.367
 batch-0.357
 chip-0.354
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
 disl+0.076
igsaw+0.066
ummy+0.051
 batch+0.044
 Of+0.040
Neg logit contributions
'd-0.215
 managed-0.203
 found-0.195
pped-0.192
 never-0.192
Tokenados
Token ID22484
Feature activation-1.92e-02

Pos logit contributions
igsaw+0.309
 disl+0.297
 batch+0.259
rig+0.212
 bou+0.209
Neg logit contributions
'd-0.555
pped-0.535
 managed-0.524
 back-0.514
 never-0.505
Token.
Token ID13
Feature activation-3.42e-02

Pos logit contributions
igsaw+0.040
 disl+0.040
ummy+0.037
 batch+0.036
 chip+0.034
Neg logit contributions
'd-0.033
 managed-0.028
pped-0.026
 beginning-0.026
 doomed-0.026
Token Sara
Token ID24799
Feature activation+2.27e-02

Pos logit contributions
igsaw+0.077
 disl+0.075
 batch+0.063
ummy+0.062
con+0.062
Neg logit contributions
'd-0.053
 only-0.047
 finally-0.047
 reached-0.046
 came-0.046
Token wants
Token ID3382
Feature activation+1.19e-01

Pos logit contributions
'd+0.044
 managed+0.038
pped+0.037
 beginning+0.037
 doomed+0.037
Neg logit contributions
 disl-0.042
igsaw-0.041
ummy-0.038
 batch-0.037
 chip-0.035
Token to
Token ID284
Feature activation+1.26e-04

Pos logit contributions
'd+0.164
 managed+0.138
 never+0.133
 found+0.133
 back+0.131
Neg logit contributions
igsaw-0.295
 disl-0.287
ummy-0.262
 batch-0.252
mm-0.246
Token They
Token ID1119
Feature activation+1.76e-01

Pos logit contributions
igsaw+0.080
 disl+0.079
 batch+0.072
ummy+0.072
con+0.065
Neg logit contributions
'd-0.058
 managed-0.049
 finally-0.048
 came-0.048
 heard-0.048
Token heard
Token ID2982
Feature activation+9.41e-02

Pos logit contributions
'd+0.198
 beginning+0.190
 doomed+0.180
 forward+0.175
pped+0.164
Neg logit contributions
 disl-0.377
 grand-0.355
yt-0.345
igsaw-0.342
 chip-0.337
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
'd+0.104
 heard+0.095
 came+0.094
 back+0.093
 come+0.090
Neg logit contributions
igsaw-0.247
 disl-0.236
ummy-0.221
 batch-0.216
mm-0.209
Token rh
Token ID9529
Feature activation-1.38e-01

Pos logit contributions
igsaw+0.211
 disl+0.207
ummy+0.184
 batch+0.178
 chip+0.165
Neg logit contributions
'd-0.227
 managed-0.195
 beginning-0.191
 doomed-0.188
 heard-0.187
Tokenin
Token ID259
Feature activation-5.86e-02

Pos logit contributions
igsaw+0.265
 disl+0.262
ummy+0.248
 batch+0.243
 chip+0.232
Neg logit contributions
'd-0.255
 managed-0.222
 beginning-0.210
 doomed-0.203
 finally-0.203
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.050
 disl+0.047
ummy+0.035
 batch+0.030
mm+0.022
Neg logit contributions
'd-0.178
 managed-0.162
 heard-0.160
 came-0.159
 never-0.159
Tokeneros
Token ID27498
Feature activation-3.07e-02

Pos logit contributions
igsaw+0.387
 disl+0.360
 batch+0.306
ummy+0.287
 bou+0.273
Neg logit contributions
'd-0.503
 back-0.476
pped-0.467
 never-0.461
 managed-0.458
Token sn
Token ID3013
Feature activation-4.37e-02

Pos logit contributions
 disl+0.063
igsaw+0.062
ummy+0.058
 batch+0.057
 chip+0.056
Neg logit contributions
'd-0.053
 managed-0.046
pped-0.043
 doomed-0.041
 remembered-0.041
Tokenore
Token ID382
Feature activation-7.99e-02

Pos logit contributions
 disl+0.074
igsaw+0.073
ummy+0.063
 batch+0.063
 chip+0.060
Neg logit contributions
'd-0.092
 managed-0.083
pped-0.081
 never-0.078
 beginning-0.078
Token.
Token ID13
Feature activation-3.69e-02

Pos logit contributions
igsaw+0.176
 disl+0.175
ummy+0.160
 batch+0.156
 chip+0.150
Neg logit contributions
'd-0.129
 managed-0.110
 beginning-0.102
pped-0.101
 doomed-0.101
Token
Token ID198
Feature activation-3.41e-02

Pos logit contributions
igsaw+0.085
 disl+0.083
 batch+0.077
ummy+0.077
con+0.070
Neg logit contributions
'd-0.054
 managed-0.046
 finally-0.045
 came-0.044
 found-0.044
Tokeneros
Token ID27498
Feature activation-3.47e-02

Pos logit contributions
igsaw+0.368
 disl+0.360
 batch+0.295
ummy+0.275
aj+0.264
Neg logit contributions
'd-0.540
 back-0.492
 managed-0.486
pped-0.482
 never-0.482
Token.
Token ID13
Feature activation-3.30e-02

Pos logit contributions
igsaw+0.069
 disl+0.068
ummy+0.064
 batch+0.062
 chip+0.060
Neg logit contributions
'd-0.062
 managed-0.054
 beginning-0.050
 doomed-0.050
pped-0.049
Token The
Token ID383
Feature activation-1.30e-01

Pos logit contributions
 disl+0.074
igsaw+0.072
ummy+0.064
 bou+0.060
 batch+0.060
Neg logit contributions
'd-0.048
 finally-0.048
 came-0.045
 found-0.045
 reached-0.044
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.158
 disl+0.150
ummy+0.124
 batch+0.115
 chip+0.097
Neg logit contributions
'd-0.350
 managed-0.314
 beginning-0.309
 never-0.308
 doomed-0.305
Tokenin
Token ID259
Feature activation-6.01e-02

Pos logit contributions
igsaw+0.300
 disl+0.298
ummy+0.274
 batch+0.267
 chip+0.257
Neg logit contributions
'd-0.231
 managed-0.197
 beginning-0.183
pped-0.182
 doomed-0.181
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.062
 disl+0.056
ummy+0.044
 batch+0.036
 bou+0.029
Neg logit contributions
'd-0.180
 managed-0.162
 never-0.162
 heard-0.160
 found-0.160
Tokeneros
Token ID27498
Feature activation-3.26e-02

Pos logit contributions
igsaw+0.392
 disl+0.365
 batch+0.311
ummy+0.291
aj+0.281
Neg logit contributions
'd-0.516
 back-0.476
 never-0.470
pped-0.467
 managed-0.465
Token looked
Token ID3114
Feature activation+1.48e-01

Pos logit contributions
igsaw+0.091
 disl+0.090
ummy+0.084
 batch+0.082
 chip+0.080
Neg logit contributions
'd-0.034
 managed-0.026
 beginning-0.023
pped-0.022
 doomed-0.022
Token sad
Token ID6507
Feature activation-8.73e-02

Pos logit contributions
'd+0.196
 managed+0.160
pped+0.145
 beginning+0.144
 doomed+0.142
Neg logit contributions
igsaw-0.370
 disl-0.369
ummy-0.342
 batch-0.335
 chip-0.324
Token and
Token ID290
Feature activation-7.02e-02

Pos logit contributions
 disl+0.149
igsaw+0.149
ummy+0.148
 batch+0.134
 chip+0.131
Neg logit contributions
'd-0.174
 managed-0.154
pped-0.151
 doomed-0.143
 beginning-0.141
Token scared
Token ID12008
Feature activation-1.04e-01

Pos logit contributions
 disl+0.152
igsaw+0.136
 batch+0.128
ummy+0.126
 chip+0.123
Neg logit contributions
'd-0.117
pped-0.101
 managed-0.099
 beginning-0.089
 forward-0.089
Token away
Token ID1497
Feature activation-5.92e-02

Pos logit contributions
'd+0.057
 managed+0.045
pped+0.041
 doomed+0.041
 remembered+0.039
Neg logit contributions
igsaw-0.110
 disl-0.105
ummy-0.105
 batch-0.097
 chip-0.096
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
igsaw+0.107
 disl+0.105
ummy+0.100
 batch+0.091
 chip+0.091
Neg logit contributions
'd-0.115
pped-0.098
 managed-0.098
 doomed-0.092
 beginning-0.089
Token The
Token ID383
Feature activation-1.31e-01

Pos logit contributions
 disl+0.066
igsaw+0.065
ummy+0.058
 batch+0.058
 chip+0.054
Neg logit contributions
'd-0.074
 managed-0.066
 finally-0.062
 never-0.062
 beginning-0.061
Token rh
Token ID9529
Feature activation-1.39e-01

Pos logit contributions
igsaw+0.207
 disl+0.204
ummy+0.177
 batch+0.170
 chip+0.153
Neg logit contributions
'd-0.302
 managed-0.266
 beginning-0.261
 never-0.256
 doomed-0.255
Tokenin
Token ID259
Feature activation-6.39e-02

Pos logit contributions
igsaw+0.310
 disl+0.309
ummy+0.282
 batch+0.275
 chip+0.265
Neg logit contributions
'd-0.222
 managed-0.189
pped-0.175
 beginning-0.174
 doomed-0.173
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
igsaw+0.056
 disl+0.052
ummy+0.040
 batch+0.033
 chip+0.025
Neg logit contributions
'd-0.196
 managed-0.180
 never-0.176
 heard-0.174
 remembered-0.173
Tokeneros
Token ID27498
Feature activation-3.26e-02

Pos logit contributions
igsaw+0.382
 disl+0.354
 batch+0.316
ummy+0.290
aj+0.283
Neg logit contributions
'd-0.515
pped-0.479
 back-0.478
 managed-0.472
 never-0.470
Token felt
Token ID2936
Feature activation+1.64e-01

Pos logit contributions
 disl+0.074
igsaw+0.071
 batch+0.069
 chip+0.068
ummy+0.067
Neg logit contributions
'd-0.041
pped-0.034
 beginning-0.034
 forward-0.033
 doomed-0.032
Token che
Token ID1125
Feature activation-4.20e-02

Pos logit contributions
'd+0.298
 managed+0.234
pped+0.233
iers+0.230
 stumbled+0.217
Neg logit contributions
 disl-0.322
igsaw-0.287
ummy-0.263
 chip-0.247
 batch-0.246
Tokenery
Token ID1924
Feature activation-1.17e-02

Pos logit contributions
igsaw+0.080
 disl+0.077
 batch+0.074
ummy+0.073
mm+0.072
Neg logit contributions
'd-0.070
 come-0.058
 beginning-0.058
 back-0.056
 doomed-0.055
Token and
Token ID290
Feature activation-7.14e-02

Pos logit contributions
igsaw+0.022
ummy+0.022
 chip+0.020
 disl+0.019
 batch+0.019
Neg logit contributions
'd-0.018
pped-0.016
aked-0.015
 managed-0.015
 stumbled-0.015
Token Lily
Token ID20037
Feature activation+3.88e-02

Pos logit contributions
 disl+0.072
igsaw+0.071
 batch+0.064
ummy+0.064
 chip+0.061
Neg logit contributions
'd-0.046
 managed-0.038
 found-0.036
 beginning-0.036
 finally-0.035
Token loved
Token ID6151
Feature activation+2.31e-02

Pos logit contributions
'd+0.045
 managed+0.036
 beginning+0.033
pped+0.033
 doomed+0.032
Neg logit contributions
 disl-0.102
igsaw-0.101
ummy-0.094
 batch-0.092
 chip-0.090
Token to
Token ID284
Feature activation+4.31e-03

Pos logit contributions
'd+0.026
 been+0.022
 came+0.022
 never+0.022
 back+0.022
Neg logit contributions
igsaw-0.062
 disl-0.058
ummy-0.054
mmm-0.052
mm-0.052
Token eat
Token ID4483
Feature activation-2.86e-03

Pos logit contributions
'd+0.007
 managed+0.006
 finally+0.006
 found+0.006
 never+0.006
Neg logit contributions
igsaw-0.009
 disl-0.009
ummy-0.009
 batch-0.008
 chip-0.008
Token av
Token ID1196
Feature activation-7.43e-02

Pos logit contributions
igsaw+0.006
 disl+0.006
acher+0.005
rig+0.005
aj+0.005
Neg logit contributions
 very-0.005
 really-0.004
 came-0.004
 out-0.004
 back-0.004
Tokenoc
Token ID420
Feature activation-2.41e-01

Pos logit contributions
 disl+0.086
igsaw+0.082
ummy+0.056
 batch+0.052
aj+0.049
Neg logit contributions
'd-0.196
 found-0.185
 managed-0.184
 never-0.181
pped-0.177
Tokenados
Token ID22484
Feature activation-1.84e-02

Pos logit contributions
igsaw+0.310
 disl+0.305
 batch+0.251
rig+0.233
aj+0.232
Neg logit contributions
pped-0.547
'd-0.534
 managed-0.515
 back-0.511
 finally-0.502
Token,
Token ID11
Feature activation-1.27e-01

Pos logit contributions
 disl+0.047
igsaw+0.046
 batch+0.039
ummy+0.039
aj+0.038
Neg logit contributions
'd-0.025
 quite-0.021
 never-0.021
 back-0.021
 managed-0.020
Token they
Token ID484
Feature activation+1.52e-01

Pos logit contributions
igsaw+0.358
 disl+0.346
ummy+0.321
 batch+0.310
 chip+0.300
Neg logit contributions
'd-0.147
 managed-0.110
 never-0.105
 finally-0.101
 found-0.100
Token were
Token ID547
Feature activation+1.96e-01

Pos logit contributions
 doomed+0.177
'd+0.177
pped+0.165
 beginning+0.161
 managed+0.159
Neg logit contributions
 disl-0.339
 batch-0.331
ummy-0.324
 chip-0.321
igsaw-0.319
Token her
Token ID607
Feature activation+3.11e-02

Pos logit contributions
'd+0.345
 managed+0.297
pped+0.277
 beginning+0.277
 doomed+0.275
Neg logit contributions
igsaw-0.402
 disl-0.400
ummy-0.366
 batch-0.356
 chip-0.343
Token different
Token ID1180
Feature activation-1.38e-01

Pos logit contributions
igsaw+0.072
 disl+0.071
ummy+0.065
 batch+0.063
 chip+0.060
Neg logit contributions
'd-0.064
 managed-0.055
 beginning-0.052
 doomed-0.052
pped-0.052
Token directions
Token ID11678
Feature activation-7.01e-02

Pos logit contributions
igsaw+0.247
 disl+0.242
ummy+0.218
 batch+0.213
 chip+0.208
Neg logit contributions
'd-0.281
 managed-0.247
 beginning-0.239
 heard-0.234
 found-0.234
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
igsaw+0.132
 disl+0.131
ummy+0.124
 batch+0.114
 chip+0.111
Neg logit contributions
'd-0.129
 managed-0.115
pped-0.109
 doomed-0.108
 beginning-0.104
Token but
Token ID475
Feature activation+2.64e-02

Pos logit contributions
 disl+0.262
igsaw+0.259
ummy+0.236
 batch+0.232
 chip+0.225
Neg logit contributions
'd-0.208
 managed-0.177
pped-0.169
 doomed-0.163
 beginning-0.162
Token we
Token ID356
Feature activation+1.73e-01

Pos logit contributions
'd+0.041
 managed+0.035
pped+0.033
 beginning+0.032
 doomed+0.032
Neg logit contributions
igsaw-0.059
 disl-0.059
ummy-0.054
 batch-0.053
 chip-0.051
Token must
Token ID1276
Feature activation+1.34e-01

Pos logit contributions
'd+0.199
pped+0.192
 beginning+0.183
 doomed+0.173
amed+0.170
Neg logit contributions
 disl-0.413
igsaw-0.386
ummy-0.343
 batch-0.336
 chip-0.322
Token take
Token ID1011
Feature activation+1.05e-01

Pos logit contributions
'd+0.177
 managed+0.144
pped+0.138
 doomed+0.134
 beginning+0.133
Neg logit contributions
 disl-0.335
igsaw-0.319
ummy-0.294
 batch-0.281
 chip-0.280
Token turns
Token ID4962
Feature activation+3.35e-02

Pos logit contributions
'd+0.180
 managed+0.150
 beginning+0.140
pped+0.138
 doomed+0.138
Neg logit contributions
igsaw-0.225
 disl-0.224
ummy-0.203
 batch-0.197
 chip-0.192
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
'd+0.057
 managed+0.048
 beginning+0.045
 doomed+0.045
pped+0.044
Neg logit contributions
 disl-0.071
igsaw-0.071
ummy-0.064
 batch-0.063
 chip-0.061
Token First
Token ID3274
Feature activation-1.81e-02

Pos logit contributions
 disl+0.065
igsaw+0.064
ummy+0.058
 batch+0.058
 chip+0.054
Neg logit contributions
'd-0.053
 managed-0.045
 finally-0.043
 never-0.043
 beginning-0.043
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
igsaw+0.029
 disl+0.029
ummy+0.026
 batch+0.025
 chip+0.024
Neg logit contributions
'd-0.040
 managed-0.035
 beginning-0.033
pped-0.033
 doomed-0.033
Token mom
Token ID1995
Feature activation-2.19e-02

Pos logit contributions
'd+0.070
 back+0.060
 managed+0.059
 found+0.059
 finally+0.058
Neg logit contributions
 disl-0.075
igsaw-0.073
ummy-0.063
 batch-0.062
mm-0.060
Tokenmy
Token ID1820
Feature activation+6.89e-02

Pos logit contributions
igsaw+0.046
 disl+0.045
ummy+0.043
 batch+0.042
 chip+0.040
Neg logit contributions
'd-0.036
 managed-0.032
 doomed-0.030
pped-0.030
 beginning-0.029
Token if
Token ID611
Feature activation-2.51e-02

Pos logit contributions
'd+0.119
 managed+0.101
 beginning+0.095
 never+0.093
 found+0.093
Neg logit contributions
igsaw-0.145
 disl-0.145
ummy-0.129
 batch-0.126
 chip-0.122
Token she
Token ID673
Feature activation+1.63e-01

Pos logit contributions
 disl+0.039
igsaw+0.036
ummy+0.035
 chip+0.034
 batch+0.034
Neg logit contributions
'd-0.053
pped-0.048
 managed-0.047
 doomed-0.047
 beginning-0.046
Token could
Token ID714
Feature activation+1.76e-01

Pos logit contributions
 beginning+0.139
 doomed+0.132
pped+0.124
 forward+0.120
'd+0.118
Neg logit contributions
igsaw-0.415
ummy-0.414
 disl-0.413
 chip-0.402
 batch-0.397
Token have
Token ID423
Feature activation+1.89e-01

Pos logit contributions
'd+0.303
 managed+0.266
 doomed+0.261
 beginning+0.251
pped+0.247
Neg logit contributions
igsaw-0.357
 disl-0.349
ummy-0.341
 batch-0.321
 chip-0.318
Token it
Token ID340
Feature activation+1.09e-01

Pos logit contributions
'd+0.303
 managed+0.251
 beginning+0.235
 doomed+0.234
pped+0.228
Neg logit contributions
igsaw-0.420
 disl-0.418
ummy-0.383
 batch-0.380
 chip-0.365
Token.
Token ID13
Feature activation-3.09e-02

Pos logit contributions
'd+0.199
 managed+0.172
pped+0.166
 doomed+0.166
 beginning+0.162
Neg logit contributions
igsaw-0.211
 disl-0.204
ummy-0.196
 batch-0.188
 chip-0.181
Token Her
Token ID2332
Feature activation+2.72e-02

Pos logit contributions
igsaw+0.071
 disl+0.071
con+0.059
ummy+0.058
izing+0.058
Neg logit contributions
 finally-0.043
'd-0.043
 back-0.040
 found-0.040
 came-0.040
Token mom
Token ID1995
Feature activation-2.35e-02

Pos logit contributions
'd+0.053
 managed+0.045
 finally+0.044
 back+0.044
 never+0.043
Neg logit contributions
igsaw-0.053
 disl-0.053
 batch-0.045
ummy-0.044
con-0.043
Tokenmy
Token ID1820
Feature activation+6.72e-02

Pos logit contributions
igsaw+0.049
 disl+0.049
ummy+0.046
 batch+0.045
 chip+0.044
Neg logit contributions
'd-0.039
 managed-0.033
 doomed-0.032
 beginning-0.031
pped-0.031
Token park
Token ID3952
Feature activation-3.73e-03

Pos logit contributions
 disl+0.253
igsaw+0.245
mmm+0.220
mm+0.216
ummy+0.215
Neg logit contributions
'd-0.185
 back-0.170
 beginning-0.165
 found-0.163
 never-0.156
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
igsaw+0.008
 disl+0.008
ummy+0.007
 batch+0.007
 chip+0.007
Neg logit contributions
'd-0.006
 managed-0.005
 beginning-0.005
 never-0.005
pped-0.005
Token One
Token ID1881
Feature activation-2.33e-02

Pos logit contributions
igsaw+0.069
 disl+0.067
ummy+0.060
 batch+0.060
 chip+0.057
Neg logit contributions
'd-0.053
 found-0.043
 finally-0.043
 beginning-0.043
 managed-0.042
Token day
Token ID1110
Feature activation-7.97e-02

Pos logit contributions
 disl+0.054
igsaw+0.053
con+0.046
 batch+0.045
ummy+0.045
Neg logit contributions
'd-0.038
 found-0.032
 came-0.031
 heard-0.031
 been-0.031
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
igsaw+0.158
 disl+0.153
 batch+0.138
ummy+0.132
 chip+0.127
Neg logit contributions
'd-0.153
 managed-0.139
 came-0.134
 found-0.133
 heard-0.132
Token they
Token ID484
Feature activation+1.51e-01

Pos logit contributions
igsaw+0.270
 disl+0.259
 batch+0.224
ummy+0.222
aj+0.210
Neg logit contributions
'd-0.239
 finally-0.221
 managed-0.220
 came-0.217
 found-0.215
Token saw
Token ID2497
Feature activation+6.00e-02

Pos logit contributions
 doomed+0.184
'd+0.176
 beginning+0.162
ps+0.156
pped+0.153
Neg logit contributions
ummy-0.315
 batch-0.312
 disl-0.310
igsaw-0.308
 pige-0.299
Token a
Token ID257
Feature activation-6.84e-02

Pos logit contributions
'd+0.068
 managed+0.053
 beginning+0.047
pped+0.047
 doomed+0.046
Neg logit contributions
igsaw-0.162
 disl-0.161
ummy-0.150
 batch-0.147
 chip-0.143
Token big
Token ID1263
Feature activation-1.93e-01

Pos logit contributions
 disl+0.143
igsaw+0.135
ummy+0.122
mm+0.116
 batch+0.113
Neg logit contributions
'd-0.125
 managed-0.103
 heard-0.103
 beginning-0.101
 found-0.100
Token dog
Token ID3290
Feature activation-3.38e-02

Pos logit contributions
 disl+0.386
igsaw+0.351
aj+0.306
grand+0.301
ummy+0.300
Neg logit contributions
'd-0.345
 back-0.322
 found-0.313
 came-0.309
 heard-0.309
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
igsaw+0.070
 disl+0.069
ummy+0.065
 batch+0.064
 chip+0.062
Neg logit contributions
'd-0.058
 managed-0.050
 beginning-0.045
 doomed-0.045
pped-0.045
Token Ben
Token ID3932
Feature activation+9.38e-03

Pos logit contributions
igsaw+0.130
 disl+0.128
ummy+0.117
 batch+0.114
 chip+0.108
Neg logit contributions
'd-0.121
 managed-0.106
 beginning-0.100
pped-0.099
 doomed-0.099
Token are
Token ID389
Feature activation+1.79e-01

Pos logit contributions
'd+0.015
 managed+0.013
pped+0.012
 doomed+0.012
 beginning+0.012
Neg logit contributions
 disl-0.021
igsaw-0.020
ummy-0.018
 batch-0.018
 chip-0.018
Token happy
Token ID3772
Feature activation-1.06e-01

Pos logit contributions
'd+0.391
 managed+0.311
pped+0.308
 stumbled+0.297
 beginning+0.281
Neg logit contributions
 disl-0.300
 chip-0.271
igsaw-0.259
ummy-0.256
 batch-0.253
Token too
Token ID1165
Feature activation+7.15e-02

Pos logit contributions
igsaw+0.247
 disl+0.244
ummy+0.224
 batch+0.220
 chip+0.211
Neg logit contributions
'd-0.159
 managed-0.134
 beginning-0.124
 doomed-0.122
 never-0.122
Token.
Token ID13
Feature activation-3.42e-02

Pos logit contributions
'd+0.060
 back+0.054
 never+0.054
 very+0.052
 quite+0.051
Neg logit contributions
igsaw-0.212
 disl-0.204
 batch-0.190
con-0.182
 bou-0.177
Token They
Token ID1119
Feature activation+1.77e-01

Pos logit contributions
igsaw+0.086
 batch+0.080
 disl+0.079
con+0.072
ummy+0.072
Neg logit contributions
'd-0.044
 could-0.040
 found-0.039
 came-0.039
 back-0.038
Token like
Token ID588
Feature activation-7.72e-03

Pos logit contributions
'd+0.282
 managed+0.245
pped+0.240
 beginning+0.237
 doomed+0.237
Neg logit contributions
 disl-0.383
igsaw-0.370
ummy-0.331
 batch-0.328
 chip-0.324
Token to
Token ID284
Feature activation+1.48e-03

Pos logit contributions
igsaw+0.018
 disl+0.017
ummy+0.016
 batch+0.016
mmm+0.015
Neg logit contributions
'd-0.012
 managed-0.010
 never-0.010
 back-0.010
 found-0.009
Token play
Token ID711
Feature activation+1.64e-02

Pos logit contributions
'd+0.003
 managed+0.002
 beginning+0.002
pped+0.002
 doomed+0.002
Neg logit contributions
 disl-0.003
igsaw-0.003
ummy-0.003
 batch-0.003
 chip-0.002
Token with
Token ID351
Feature activation-4.21e-02

Pos logit contributions
'd+0.034
 managed+0.030
 beginning+0.028
 doomed+0.028
pped+0.028
Neg logit contributions
 disl-0.029
igsaw-0.028
 batch-0.025
ummy-0.025
 chip-0.024
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
igsaw+0.106
 disl+0.096
ummy+0.093
 batch+0.090
mmm+0.088
Neg logit contributions
'd-0.053
 back-0.051
 found-0.050
 heard-0.050
 never-0.048
Token Lily
Token ID20037
Feature activation+4.17e-02

Pos logit contributions
igsaw+0.068
 disl+0.066
ummy+0.059
 batch+0.056
mmm+0.053
Neg logit contributions
'd-0.045
 finally-0.039
 found-0.039
 managed-0.039
 came-0.038
Token was
Token ID373
Feature activation+1.93e-01

Pos logit contributions
'd+0.051
 beginning+0.043
 doomed+0.042
 forward+0.042
pped+0.040
Neg logit contributions
 disl-0.100
igsaw-0.095
 batch-0.093
 chip-0.092
ummy-0.090
Token upset
Token ID9247
Feature activation-8.81e-02

Pos logit contributions
'd+0.375
pped+0.283
 managed+0.266
bridge+0.249
 stumbled+0.249
Neg logit contributions
 disl-0.363
 chip-0.349
 batch-0.321
igsaw-0.313
 viol-0.290
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
igsaw+0.172
 disl+0.171
ummy+0.153
 batch+0.150
 chip+0.143
Neg logit contributions
'd-0.167
 managed-0.146
 beginning-0.137
pped-0.137
 doomed-0.136
Token but
Token ID475
Feature activation+2.13e-02

Pos logit contributions
 disl+0.190
igsaw+0.184
ummy+0.168
 batch+0.164
 chip+0.160
Neg logit contributions
'd-0.282
 managed-0.252
pped-0.241
 doomed-0.237
 beginning-0.235
Token she
Token ID673
Feature activation+1.59e-01

Pos logit contributions
'd+0.041
 managed+0.034
pped+0.033
 doomed+0.032
 beginning+0.032
Neg logit contributions
 disl-0.036
 chip-0.034
ummy-0.033
igsaw-0.032
 batch-0.032
Token couldn
Token ID3521
Feature activation+1.17e-01

Pos logit contributions
'd+0.182
 forward+0.182
 beginning+0.175
 doomed+0.164
iers+0.163
Neg logit contributions
 disl-0.347
 chip-0.336
 batch-0.332
ummy-0.322
igsaw-0.314
Token't
Token ID470
Feature activation+1.06e-01

Pos logit contributions
'd+0.193
 managed+0.168
 beginning+0.158
 remembered+0.157
 doomed+0.152
Neg logit contributions
igsaw-0.254
ummy-0.240
 disl-0.236
 chip-0.229
 batch-0.219
Token stay
Token ID2652
Feature activation+9.55e-02

Pos logit contributions
'd+0.166
amed+0.148
 doomed+0.146
 beginning+0.144
aked+0.143
Neg logit contributions
 disl-0.215
igsaw-0.212
 chip-0.186
ummy-0.185
mmm-0.174
Token mad
Token ID8805
Feature activation-7.40e-02

Pos logit contributions
'd+0.156
 managed+0.113
pped+0.104
 beginning+0.103
 been+0.102
Neg logit contributions
 disl-0.213
igsaw-0.213
ummy-0.194
 batch-0.194
 chip-0.187
Token at
Token ID379
Feature activation-1.08e-01

Pos logit contributions
igsaw+0.139
ummy+0.138
 disl+0.135
 batch+0.120
 chip+0.119
Neg logit contributions
'd-0.144
 managed-0.123
 doomed-0.113
pped-0.113
 beginning-0.109
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
 disl+0.014
igsaw+0.014
 batch+0.012
grand+0.011
ummy+0.011
Neg logit contributions
'd-0.011
 back-0.010
 came-0.010
 heard-0.010
 found-0.010
Token Mr
Token ID1770
Feature activation-3.98e-02

Pos logit contributions
igsaw+0.258
 disl+0.250
 batch+0.219
ummy+0.218
con+0.201
Neg logit contributions
'd-0.237
 managed-0.196
 never-0.194
 beginning-0.194
 back-0.192
Token.
Token ID13
Feature activation-3.55e-02

Pos logit contributions
igsaw+0.088
 disl+0.088
ummy+0.080
 batch+0.079
 chip+0.075
Neg logit contributions
'd-0.064
 managed-0.054
 beginning-0.050
pped-0.050
 doomed-0.050
Token Bear
Token ID14732
Feature activation-2.19e-03

Pos logit contributions
igsaw+0.063
 disl+0.062
 batch+0.054
ummy+0.054
 chip+0.049
Neg logit contributions
'd-0.075
 managed-0.065
 beginning-0.062
 never-0.061
 doomed-0.061
Token.
Token ID13
Feature activation-3.57e-02

Pos logit contributions
 disl+0.004
igsaw+0.004
ummy+0.004
 batch+0.004
 chip+0.003
Neg logit contributions
'd-0.004
 managed-0.004
pped-0.004
 beginning-0.004
 doomed-0.004
Token Would
Token ID10928
Feature activation+1.11e-01

Pos logit contributions
igsaw+0.078
 disl+0.077
 batch+0.071
ummy+0.069
 chip+0.064
Neg logit contributions
'd-0.060
 managed-0.048
 beginning-0.046
 doomed-0.046
 never-0.045
Token you
Token ID345
Feature activation+1.54e-01

Pos logit contributions
'd+0.217
 managed+0.188
 beginning+0.178
pped+0.178
 never+0.177
Neg logit contributions
igsaw-0.212
 disl-0.210
ummy-0.186
 batch-0.183
 chip-0.174
Token like
Token ID588
Feature activation-4.33e-03

Pos logit contributions
'd+0.228
 doomed+0.218
 managed+0.206
pped+0.201
 beginning+0.189
Neg logit contributions
igsaw-0.325
 disl-0.324
ummy-0.297
 batch-0.294
 chip-0.275
Token to
Token ID284
Feature activation+4.50e-03

Pos logit contributions
igsaw+0.011
 disl+0.010
ummy+0.009
 batch+0.009
 chip+0.009
Neg logit contributions
'd-0.006
 never-0.005
 found-0.005
 come-0.005
pped-0.005
Token taste
Token ID6938
Feature activation+3.94e-02

Pos logit contributions
'd+0.006
 managed+0.005
 finally+0.005
 never+0.005
 come+0.005
Neg logit contributions
igsaw-0.011
 disl-0.011
ummy-0.010
 batch-0.010
 chip-0.009
Token these
Token ID777
Feature activation-6.38e-02

Pos logit contributions
 back+0.046
 really+0.045
'd+0.044
 come+0.044
 quite+0.043
Neg logit contributions
igsaw-0.103
 disl-0.096
ummy-0.083
con-0.082
aj-0.081
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 disl+0.099
igsaw+0.098
ummy+0.090
 batch+0.086
 chip+0.083
Neg logit contributions
'd-0.075
 managed-0.064
 beginning-0.063
 finally-0.062
 never-0.062
Token woods
Token ID16479
Feature activation+7.45e-03

Pos logit contributions
 disl+0.226
igsaw+0.226
ummy+0.201
 batch+0.197
 chip+0.187
Neg logit contributions
'd-0.221
 managed-0.191
 beginning-0.187
 never-0.182
 doomed-0.181
Token.
Token ID13
Feature activation-3.04e-02

Pos logit contributions
'd+0.015
 managed+0.013
pped+0.013
 doomed+0.013
 remembered+0.012
Neg logit contributions
 disl-0.012
igsaw-0.012
ummy-0.012
 batch-0.011
 chip-0.011
Token It
Token ID632
Feature activation+1.62e-01

Pos logit contributions
igsaw+0.060
 disl+0.059
ummy+0.055
 batch+0.052
 chip+0.051
Neg logit contributions
'd-0.056
 managed-0.049
pped-0.046
 doomed-0.046
 beginning-0.045
Token started
Token ID2067
Feature activation+1.30e-01

Pos logit contributions
pped+0.241
'd+0.240
 forward+0.222
 doomed+0.218
 beginning+0.209
Neg logit contributions
 disl-0.298
 chip-0.291
igsaw-0.285
 batch-0.284
ummy-0.277
Token to
Token ID284
Feature activation+2.37e-03

Pos logit contributions
'd+0.234
 managed+0.200
pped+0.193
 doomed+0.188
 beginning+0.180
Neg logit contributions
igsaw-0.258
 disl-0.258
ummy-0.242
 batch-0.233
 chip-0.228
Token rain
Token ID6290
Feature activation-5.26e-02

Pos logit contributions
'd+0.004
 managed+0.004
pped+0.004
 forward+0.004
 doomed+0.004
Neg logit contributions
 disl-0.004
igsaw-0.004
 chip-0.004
 batch-0.004
ummy-0.004
Token really
Token ID1107
Feature activation+3.18e-02

Pos logit contributions
igsaw+0.097
 disl+0.095
ummy+0.089
 batch+0.085
 chip+0.084
Neg logit contributions
'd-0.103
 managed-0.090
 doomed-0.085
pped-0.085
 beginning-0.084
Token hard
Token ID1327
Feature activation-1.14e-01

Pos logit contributions
'd+0.055
 managed+0.047
pped+0.044
 beginning+0.043
 doomed+0.043
Neg logit contributions
igsaw-0.067
 disl-0.066
ummy-0.061
 batch-0.059
 chip-0.058
Token.
Token ID13
Feature activation-3.05e-02

Pos logit contributions
igsaw+0.225
 disl+0.219
ummy+0.207
 batch+0.197
 chip+0.190
Neg logit contributions
'd-0.210
 managed-0.182
pped-0.171
 doomed-0.168
 beginning-0.168
Token Sally
Token ID25737
Feature activation+1.77e-02

Pos logit contributions
igsaw+0.059
 disl+0.058
ummy+0.052
 batch+0.051
 chip+0.049
Neg logit contributions
'd-0.058
 managed-0.051
 beginning-0.048
 finally-0.047
 found-0.047
Token all
Token ID477
Feature activation+1.06e-02

Pos logit contributions
igsaw+0.233
 disl+0.232
ummy+0.212
 batch+0.207
 chip+0.200
Neg logit contributions
'd-0.175
 managed-0.149
 beginning-0.139
pped-0.138
 doomed-0.137
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
'd+0.017
 beginning+0.015
 managed+0.015
 never+0.014
 been+0.014
Neg logit contributions
igsaw-0.023
 disl-0.023
ummy-0.021
 batch-0.021
mm-0.020
Token colourful
Token ID44799
Feature activation-1.57e-01

Pos logit contributions
igsaw+0.190
 disl+0.186
ummy+0.168
 batch+0.162
 chip+0.155
Neg logit contributions
'd-0.229
 managed-0.198
 beginning-0.192
 heard-0.189
 found-0.188
Token flowers
Token ID12734
Feature activation-1.96e-02

Pos logit contributions
igsaw+0.291
 disl+0.282
ummy+0.257
 batch+0.252
 chip+0.241
Neg logit contributions
'd-0.311
 managed-0.268
 heard-0.259
 beginning-0.258
 found-0.257
Token.
Token ID13
Feature activation-2.80e-02

Pos logit contributions
igsaw+0.038
 disl+0.037
ummy+0.035
 batch+0.033
 chip+0.032
Neg logit contributions
'd-0.035
 managed-0.031
pped-0.029
 doomed-0.029
 remembered-0.028
Token Sometimes
Token ID8975
Feature activation+2.26e-02

Pos logit contributions
 disl+0.058
igsaw+0.058
ummy+0.051
 batch+0.051
 chip+0.048
Neg logit contributions
'd-0.049
 managed-0.042
 finally-0.040
 beginning-0.040
 never-0.040
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
'd+0.041
 managed+0.035
 never+0.033
 beginning+0.033
 finally+0.032
Neg logit contributions
igsaw-0.047
 disl-0.046
ummy-0.042
 batch-0.041
 chip-0.039
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
igsaw+0.267
 disl+0.265
ummy+0.240
 batch+0.235
 chip+0.223
Neg logit contributions
'd-0.213
 managed-0.183
 beginning-0.172
 never-0.170
 finally-0.168
Token even
Token ID772
Feature activation+2.67e-02

Pos logit contributions
pped+0.139
'd+0.131
 doomed+0.124
 beginning+0.123
 forward+0.111
Neg logit contributions
 disl-0.388
igsaw-0.357
 batch-0.342
ummy-0.335
 chip-0.332
Token saw
Token ID2497
Feature activation+6.28e-02

Pos logit contributions
'd+0.029
pped+0.025
 doomed+0.024
 beginning+0.023
 managed+0.022
Neg logit contributions
 disl-0.068
igsaw-0.065
 batch-0.061
 chip-0.059
con-0.058
Token a
Token ID257
Feature activation-6.54e-02

Pos logit contributions
'd+0.099
 managed+0.084
pped+0.078
 doomed+0.077
 beginning+0.077
Neg logit contributions
 disl-0.139
igsaw-0.138
ummy-0.127
 batch-0.124
 chip-0.120
Token under
Token ID739
Feature activation-5.95e-02

Pos logit contributions
'd+0.216
 managed+0.179
pped+0.164
 beginning+0.163
 doomed+0.162
Neg logit contributions
igsaw-0.346
 disl-0.345
ummy-0.321
 batch-0.313
 chip-0.298
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
 disl+0.140
igsaw+0.137
ummy+0.129
 batch+0.125
mm+0.124
Neg logit contributions
'd-0.084
 finally-0.074
 managed-0.074
 back-0.073
 heard-0.073
Token blankets
Token ID34794
Feature activation-4.46e-02

Pos logit contributions
 disl+0.202
igsaw+0.190
 batch+0.167
ummy+0.163
mm+0.162
Neg logit contributions
'd-0.227
 back-0.209
 beginning-0.201
 heard-0.196
 found-0.193
Token.
Token ID13
Feature activation-3.20e-02

Pos logit contributions
igsaw+0.082
ummy+0.078
 disl+0.075
 batch+0.071
 chip+0.070
Neg logit contributions
'd-0.085
 managed-0.074
 doomed-0.071
 remembered-0.069
pped-0.069
Token He
Token ID679
Feature activation+1.71e-01

Pos logit contributions
igsaw+0.067
 disl+0.066
ummy+0.060
 batch+0.059
 chip+0.056
Neg logit contributions
'd-0.056
 managed-0.048
 beginning-0.045
pped-0.045
 doomed-0.044
Token found
Token ID1043
Feature activation+1.02e-01

Pos logit contributions
'd+0.209
 doomed+0.195
 beginning+0.189
 forward+0.186
pped+0.177
Neg logit contributions
 disl-0.348
 batch-0.341
igsaw-0.334
 grand-0.333
aj-0.332
Token his
Token ID465
Feature activation+1.03e-02

Pos logit contributions
'd+0.169
 managed+0.134
 beginning+0.124
 doomed+0.122
 stumbled+0.119
Neg logit contributions
igsaw-0.220
 disl-0.215
ummy-0.207
 chip-0.203
 batch-0.202
Token toy
Token ID13373
Feature activation-9.50e-02

Pos logit contributions
'd+0.021
 managed+0.018
 beginning+0.017
 back+0.017
 never+0.017
Neg logit contributions
igsaw-0.020
 disl-0.019
ummy-0.017
 batch-0.017
mmm-0.016
Token and
Token ID290
Feature activation-6.75e-02

Pos logit contributions
igsaw+0.169
ummy+0.163
 disl+0.161
 chip+0.149
 batch+0.149
Neg logit contributions
'd-0.183
 managed-0.164
 doomed-0.152
 beginning-0.150
pped-0.149
Token w
Token ID266
Feature activation-9.73e-02

Pos logit contributions
 disl+0.156
igsaw+0.149
 batch+0.144
 chip+0.142
ummy+0.139
Neg logit contributions
'd-0.092
 beginning-0.076
 doomed-0.074
 forward-0.072
pped-0.071
Tokenagged
Token ID14655
Feature activation+4.03e-02

Pos logit contributions
igsaw+0.168
ummy+0.159
 disl+0.154
aj+0.138
 batch+0.137
Neg logit contributions
'd-0.208
 beginning-0.175
 managed-0.175
 stumbled-0.174
 decided-0.173
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
igsaw+0.205
 disl+0.199
ummy+0.191
 chip+0.191
 batch+0.188
Neg logit contributions
'd-0.134
pped-0.110
 beginning-0.109
 managed-0.108
 doomed-0.107
Token too
Token ID1165
Feature activation+6.99e-02

Pos logit contributions
'd+0.243
pped+0.180
 managed+0.175
 doomed+0.162
bridge+0.162
Neg logit contributions
igsaw-0.453
 disl-0.431
 chip-0.426
 batch-0.420
ummy-0.411
Token big
Token ID1263
Feature activation-1.91e-01

Pos logit contributions
'd+0.108
 managed+0.097
 never+0.088
 heard+0.088
 found+0.087
Neg logit contributions
 disl-0.160
igsaw-0.157
ummy-0.142
 batch-0.142
mm-0.138
Token for
Token ID329
Feature activation+4.29e-02

Pos logit contributions
igsaw+0.313
ummy+0.305
 disl+0.302
 batch+0.273
 chip+0.271
Neg logit contributions
'd-0.386
 managed-0.342
pped-0.325
 doomed-0.322
 beginning-0.319
Token him
Token ID683
Feature activation+1.24e-01

Pos logit contributions
'd+0.064
 managed+0.051
 doomed+0.046
 beginning+0.045
 remembered+0.044
Neg logit contributions
igsaw-0.102
 disl-0.100
ummy-0.097
 batch-0.091
 chip-0.089
Token to
Token ID284
Feature activation+4.12e-03

Pos logit contributions
'd+0.236
 managed+0.203
pped+0.200
 beginning+0.199
 doomed+0.199
Neg logit contributions
 disl-0.220
igsaw-0.219
ummy-0.204
 chip-0.189
 batch-0.189
Token carry
Token ID3283
Feature activation+2.37e-02

Pos logit contributions
'd+0.006
 managed+0.005
 doomed+0.005
 beginning+0.005
pped+0.005
Neg logit contributions
 disl-0.009
igsaw-0.009
ummy-0.008
 batch-0.008
 chip-0.008
Token and
Token ID290
Feature activation-6.74e-02

Pos logit contributions
'd+0.044
 managed+0.036
 beginning+0.034
 doomed+0.034
 remembered+0.034
Neg logit contributions
igsaw-0.047
 disl-0.045
ummy-0.043
 batch-0.040
 chip-0.040
Token he
Token ID339
Feature activation+1.51e-01

Pos logit contributions
 disl+0.143
igsaw+0.138
ummy+0.124
 batch+0.124
 chip+0.123
Neg logit contributions
'd-0.113
 managed-0.092
pped-0.091
 beginning-0.091
 doomed-0.088
Token screamed
Token ID25421
Feature activation+5.71e-02

Pos logit contributions
 beginning+0.182
 forward+0.179
'd+0.177
 doomed+0.168
pped+0.168
Neg logit contributions
 chip-0.297
 batch-0.290
con-0.285
 disl-0.285
ummy-0.273
Token louder
Token ID27089
Feature activation-2.05e-02

Pos logit contributions
'd+0.116
 managed+0.100
 doomed+0.093
 been+0.093
pped+0.092
Neg logit contributions
igsaw-0.095
ummy-0.092
 disl-0.091
 chip-0.085
 batch-0.082
TokenSure
Token ID19457
Feature activation-1.09e-02

Pos logit contributions
'd+0.066
 managed+0.059
 doomed+0.058
 beginning+0.057
pped+0.057
Neg logit contributions
 disl-0.049
igsaw-0.048
ummy-0.041
 batch-0.041
 chip-0.040
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
igsaw+0.018
 disl+0.018
ummy+0.016
 batch+0.015
 chip+0.014
Neg logit contributions
'd-0.024
 managed-0.021
 never-0.020
 beginning-0.020
 heard-0.020
Token I
Token ID314
Feature activation+2.05e-01

Pos logit contributions
igsaw+0.308
 disl+0.301
ummy+0.273
 batch+0.273
 chip+0.256
Neg logit contributions
'd-0.188
 managed-0.152
 never-0.142
 finally-0.139
 beginning-0.138
Token can
Token ID460
Feature activation+1.45e-01

Pos logit contributions
pped+0.288
 doomed+0.272
 beginning+0.266
pered+0.255
amed+0.251
Neg logit contributions
 disl-0.432
igsaw-0.392
oc-0.375
 batch-0.374
ummy-0.371
Token manage
Token ID6687
Feature activation+2.03e-02

Pos logit contributions
'd+0.203
 managed+0.164
 never+0.151
pped+0.150
 finally+0.149
Neg logit contributions
igsaw-0.360
 disl-0.353
ummy-0.326
 batch-0.321
 chip-0.305
Token that
Token ID326
Feature activation-4.12e-02

Pos logit contributions
'd+0.025
 back+0.023
 never+0.022
pped+0.022
 really+0.021
Neg logit contributions
igsaw-0.050
 disl-0.049
ummy-0.044
mm-0.044
 batch-0.043
Token,"
Token ID553
Feature activation-5.84e-02

Pos logit contributions
igsaw+0.077
 disl+0.076
ummy+0.069
 batch+0.067
 chip+0.064
Neg logit contributions
'd-0.080
 managed-0.070
 beginning-0.066
pped-0.066
 doomed-0.066
Token replied
Token ID8712
Feature activation+5.94e-02

Pos logit contributions
igsaw+0.164
 disl+0.139
ummy+0.138
aj+0.133
yt+0.124
Neg logit contributions
'd-0.070
 found-0.069
 came-0.069
 decided-0.068
 finally-0.068
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
 finally+0.066
 back+0.065
 came+0.062
 found+0.058
 only+0.057
Neg logit contributions
igsaw-0.165
 disl-0.157
aj-0.147
ummy-0.142
yt-0.142
Token bald
Token ID28273
Feature activation-1.51e-01

Pos logit contributions
 disl+0.196
igsaw+0.192
 chip+0.179
ummy+0.177
 batch+0.171
Neg logit contributions
'd-0.232
 managed-0.209
pped-0.197
 doomed-0.192
 stumbled-0.188
Token man
Token ID582
Feature activation+1.66e-02

Pos logit contributions
igsaw+0.253
 disl+0.250
ummy+0.223
 batch+0.215
 chip+0.203
Neg logit contributions
'd-0.326
 managed-0.288
 beginning-0.273
pped-0.272
 doomed-0.271
Token
Token ID220
Feature activation+4.50e-02

Pos logit contributions
 disl+0.063
igsaw+0.063
 batch+0.057
ummy+0.057
 chip+0.054
Neg logit contributions
'd-0.052
 managed-0.043
 finally-0.042
 found-0.041
 never-0.041
Token
Token ID198
Feature activation-2.77e-02

Pos logit contributions
'd+0.081
 managed+0.069
 found+0.065
 never+0.065
 finally+0.065
Neg logit contributions
igsaw-0.090
 disl-0.090
 batch-0.080
ummy-0.080
 chip-0.076
Token
Token ID198
Feature activation-2.71e-02

Pos logit contributions
igsaw+0.058
 disl+0.057
ummy+0.053
 batch+0.051
 chip+0.049
Neg logit contributions
'd-0.048
 managed-0.041
 never-0.039
 finally-0.039
 found-0.039
TokenOne
Token ID3198
Feature activation-4.57e-02

Pos logit contributions
igsaw+0.054
 disl+0.054
ummy+0.050
 batch+0.048
 chip+0.046
Neg logit contributions
'd-0.050
 managed-0.043
 never-0.040
pped-0.040
 beginning-0.040
Token day
Token ID1110
Feature activation-6.92e-02

Pos logit contributions
 disl+0.091
igsaw+0.089
ummy+0.081
 batch+0.079
 chip+0.076
Neg logit contributions
'd-0.086
 managed-0.073
 beginning-0.071
 never-0.069
 found-0.069
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
 disl+0.124
igsaw+0.123
ummy+0.110
 batch+0.108
 chip+0.100
Neg logit contributions
'd-0.141
 managed-0.127
 finally-0.120
 found-0.119
 heard-0.119
Token Tim
Token ID5045
Feature activation+1.27e-02

Pos logit contributions
 disl+0.251
igsaw+0.249
ummy+0.226
 batch+0.219
 chip+0.203
Neg logit contributions
'd-0.223
 managed-0.199
 finally-0.194
 found-0.187
 never-0.187
Tokenmy
Token ID1820
Feature activation+8.53e-02

Pos logit contributions
'd+0.023
 doomed+0.022
 beginning+0.021
pped+0.021
 forward+0.021
Neg logit contributions
 disl-0.020
igsaw-0.020
ummy-0.020
 batch-0.019
 chip-0.018
Token went
Token ID1816
Feature activation+1.22e-01

Pos logit contributions
'd+0.120
 beginning+0.101
 doomed+0.098
 managed+0.097
pped+0.096
Neg logit contributions
 disl-0.195
igsaw-0.189
ummy-0.180
 batch-0.175
 chip-0.171
Token to
Token ID284
Feature activation+8.53e-03

Pos logit contributions
'd+0.227
 managed+0.197
pped+0.185
 doomed+0.178
 remembered+0.174
Neg logit contributions
igsaw-0.246
 disl-0.225
ummy-0.216
 batch-0.211
con-0.203
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
'd+0.013
 finally+0.011
 managed+0.011
 found+0.011
 heard+0.011
Neg logit contributions
 disl-0.019
igsaw-0.019
ummy-0.018
 batch-0.017
mm-0.017
Token egg
Token ID5935
Feature activation-5.27e-02

Pos logit contributions
igsaw+0.108
 disl+0.107
ummy+0.096
 batch+0.092
 chip+0.088
Neg logit contributions
'd-0.121
 managed-0.106
 beginning-0.101
pped-0.100
 doomed-0.100
Token.
Token ID13
Feature activation-2.87e-02

Pos logit contributions
ummy+0.098
igsaw+0.095
 disl+0.095
 chip+0.094
 batch+0.093
Neg logit contributions
'd-0.092
 managed-0.080
 doomed-0.077
pped-0.074
 stumbled-0.072
Token She
Token ID1375
Feature activation+1.74e-01

Pos logit contributions
igsaw+0.069
 disl+0.066
 batch+0.060
ummy+0.060
mm+0.057
Neg logit contributions
'd-0.039
 finally-0.032
 managed-0.032
 stumbled-0.032
 never-0.032
Token was
Token ID373
Feature activation+1.98e-01

Pos logit contributions
'd+0.262
 beginning+0.228
 forward+0.225
 doomed+0.218
 managed+0.212
Neg logit contributions
 disl-0.372
 batch-0.348
igsaw-0.348
 chip-0.338
ummy-0.335
Token very
Token ID845
Feature activation-7.04e-03

Pos logit contributions
'd+0.345
 managed+0.260
pped+0.243
 forward+0.232
 stumbled+0.232
Neg logit contributions
 disl-0.412
 batch-0.388
 chip-0.388
igsaw-0.383
ummy-0.380
Token sad
Token ID6507
Feature activation-7.73e-02

Pos logit contributions
 disl+0.016
igsaw+0.015
ummy+0.014
 batch+0.014
 chip+0.014
Neg logit contributions
'd-0.011
 managed-0.009
pped-0.009
 beginning-0.009
 doomed-0.008
Token and
Token ID290
Feature activation-6.24e-02

Pos logit contributions
igsaw+0.161
 disl+0.160
ummy+0.144
 batch+0.142
 chip+0.136
Neg logit contributions
'd-0.135
 managed-0.116
 beginning-0.109
pped-0.108
 never-0.108
Token angry
Token ID7954
Feature activation-1.06e-01

Pos logit contributions
 disl+0.136
igsaw+0.117
 batch+0.116
ummy+0.113
 chip+0.110
Neg logit contributions
'd-0.104
pped-0.086
 managed-0.083
 beginning-0.080
 forward-0.076
Token at
Token ID379
Feature activation-1.02e-01

Pos logit contributions
 disl+0.207
igsaw+0.203
ummy+0.192
 batch+0.180
 chip+0.178
Neg logit contributions
'd-0.201
 managed-0.173
pped-0.161
 beginning-0.158
 doomed-0.157
Token Tim
Token ID5045
Feature activation+2.00e-02

Pos logit contributions
igsaw+0.202
 disl+0.202
ummy+0.187
 batch+0.178
 chip+0.172
Neg logit contributions
'd-0.191
 managed-0.165
pped-0.150
 doomed-0.149
 beginning-0.147
Token and
Token ID290
Feature activation-6.23e-02

Pos logit contributions
'd+0.035
 managed+0.030
pped+0.028
 doomed+0.028
 beginning+0.028
Neg logit contributions
 disl-0.040
igsaw-0.040
ummy-0.037
 batch-0.036
 chip-0.035
Token it
Token ID340
Feature activation+1.04e-01

Pos logit contributions
'd+0.004
 managed+0.003
 beginning+0.003
pped+0.003
 doomed+0.003
Neg logit contributions
igsaw-0.004
 disl-0.004
ummy-0.003
 batch-0.003
 chip-0.003
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
'd+0.143
 forward+0.130
 beginning+0.126
pped+0.125
 doomed+0.123
Neg logit contributions
igsaw-0.218
 chip-0.208
 batch-0.206
ummy-0.205
 disl-0.204
Token complete
Token ID1844
Feature activation-7.27e-02

Pos logit contributions
'd+0.319
 doomed+0.260
 managed+0.257
pped+0.255
 heard+0.243
Neg logit contributions
igsaw-0.397
ummy-0.370
 disl-0.370
 batch-0.357
 chip-0.342
Token.
Token ID13
Feature activation-3.48e-02

Pos logit contributions
igsaw+0.137
 disl+0.128
ummy+0.123
 batch+0.119
 chip+0.114
Neg logit contributions
'd-0.138
 managed-0.117
 doomed-0.116
pped-0.116
 beginning-0.113
Token
Token ID198
Feature activation-3.27e-02

Pos logit contributions
 disl+0.075
igsaw+0.072
ummy+0.064
 batch+0.061
izing+0.060
Neg logit contributions
'd-0.053
 finally-0.050
 managed-0.047
 found-0.047
 came-0.046
Token
Token ID198
Feature activation-3.30e-02

Pos logit contributions
 disl+0.078
igsaw+0.077
ummy+0.067
 batch+0.066
 chip+0.063
Neg logit contributions
'd-0.049
 found-0.041
 finally-0.041
 managed-0.041
 never-0.041
TokenTom
Token ID13787
Feature activation+1.21e-02

Pos logit contributions
igsaw+0.057
 disl+0.057
ummy+0.051
 batch+0.049
 chip+0.046
Neg logit contributions
'd-0.070
 managed-0.062
pped-0.058
 never-0.058
 beginning-0.058
Token said
Token ID531
Feature activation+6.81e-02

Pos logit contributions
'd+0.016
 managed+0.013
 found+0.012
 never+0.012
 heard+0.012
Neg logit contributions
igsaw-0.031
 disl-0.031
ummy-0.028
 batch-0.028
 chip-0.027
Token,
Token ID11
Feature activation-1.27e-01

Pos logit contributions
'd+0.145
 managed+0.130
 finally+0.121
 found+0.121
 never+0.121
Neg logit contributions
igsaw-0.115
 disl-0.114
ummy-0.101
 batch-0.099
 chip-0.093
Token "
Token ID366
Feature activation+2.62e-02

Pos logit contributions
 disl+0.249
igsaw+0.249
ummy+0.227
 batch+0.220
 chip+0.215
Neg logit contributions
'd-0.231
 managed-0.202
pped-0.190
 doomed-0.189
 beginning-0.188
TokenThat
Token ID2504
Feature activation+7.16e-02

Pos logit contributions
'd+0.051
 managed+0.045
 beginning+0.042
pped+0.042
 doomed+0.042
Neg logit contributions
igsaw-0.049
 disl-0.049
ummy-0.044
 batch-0.043
 chip-0.041
Token a
Token ID257
Feature activation-6.53e-02

Pos logit contributions
 disl+0.094
igsaw+0.094
ummy+0.087
 batch+0.084
 chip+0.082
Neg logit contributions
'd-0.062
 managed-0.053
 beginning-0.049
 never-0.049
 finally-0.048
Token big
Token ID1263
Feature activation-1.91e-01

Pos logit contributions
igsaw+0.117
 disl+0.117
ummy+0.105
 batch+0.101
 chip+0.096
Neg logit contributions
'd-0.133
 managed-0.117
 beginning-0.110
 doomed-0.109
pped-0.109
Token adventure
Token ID8855
Feature activation-8.79e-02

Pos logit contributions
 disl+0.352
igsaw+0.330
ummy+0.296
mm+0.285
 batch+0.273
Neg logit contributions
'd-0.390
 back-0.340
 managed-0.338
 found-0.336
 never-0.329
Token.
Token ID13
Feature activation-3.01e-02

Pos logit contributions
igsaw+0.186
 disl+0.183
ummy+0.170
 batch+0.165
 chip+0.159
Neg logit contributions
'd-0.150
 managed-0.128
pped-0.120
 doomed-0.119
 beginning-0.118
Token
Token ID220
Feature activation+4.17e-02

Pos logit contributions
 disl+0.062
igsaw+0.061
ummy+0.055
 batch+0.054
 chip+0.051
Neg logit contributions
'd-0.054
 managed-0.047
 finally-0.044
 beginning-0.043
 found-0.043
Token
Token ID198
Feature activation-2.91e-02

Pos logit contributions
'd+0.083
 managed+0.072
 beginning+0.068
pped+0.068
 doomed+0.068
Neg logit contributions
igsaw-0.076
 disl-0.076
ummy-0.068
 batch-0.066
 chip-0.063
Token
Token ID198
Feature activation-2.96e-02

Pos logit contributions
igsaw+0.051
 disl+0.050
ummy+0.045
 batch+0.044
 chip+0.042
Neg logit contributions
'd-0.059
 managed-0.052
 beginning-0.051
 doomed-0.050
pped-0.049
TokenHe
Token ID1544
Feature activation+1.62e-01

Pos logit contributions
igsaw+0.051
 disl+0.051
 batch+0.044
ummy+0.043
 chip+0.043
Neg logit contributions
'd-0.059
 managed-0.052
 beginning-0.051
 doomed-0.051
 back-0.050
Token hopped
Token ID45230
Feature activation+9.01e-02

Pos logit contributions
'd+0.176
pped+0.167
 doomed+0.162
 beginning+0.160
 forward+0.147
Neg logit contributions
 disl-0.363
igsaw-0.355
 batch-0.349
 chip-0.333
oc-0.333
Token and
Token ID290
Feature activation-6.55e-02

Pos logit contributions
'd+0.150
 managed+0.124
 doomed+0.122
pped+0.118
 remembered+0.117
Neg logit contributions
igsaw-0.190
 disl-0.174
ummy-0.170
 batch-0.162
 chip-0.162
Token skipped
Token ID26684
Feature activation+7.37e-02

Pos logit contributions
igsaw+0.163
 disl+0.161
 batch+0.146
ummy+0.146
 chip+0.143
Neg logit contributions
'd-0.082
 managed-0.063
pped-0.063
 doomed-0.063
 beginning-0.063
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
'd+0.004
 back+0.004
 found+0.004
 finally+0.004
 managed+0.004
Neg logit contributions
 disl-0.008
igsaw-0.008
ummy-0.007
mm-0.007
 batch-0.007
Token sack
Token ID23704
Feature activation+8.01e-03

Pos logit contributions
igsaw+0.205
 disl+0.196
ummy+0.183
 batch+0.175
 chip+0.166
Neg logit contributions
'd-0.227
 managed-0.200
pped-0.189
 doomed-0.185
 beginning-0.184
Token and
Token ID290
Feature activation-6.68e-02

Pos logit contributions
'd+0.016
 managed+0.014
 doomed+0.014
pped+0.014
 remembered+0.013
Neg logit contributions
igsaw-0.013
ummy-0.013
 disl-0.012
 batch-0.012
 chip-0.011
Token a
Token ID257
Feature activation-6.59e-02

Pos logit contributions
 disl+0.175
igsaw+0.175
 batch+0.164
ummy+0.163
 chip+0.158
Neg logit contributions
'd-0.074
 managed-0.054
 beginning-0.053
 doomed-0.053
pped-0.051
Token million
Token ID1510
Feature activation-3.80e-02

Pos logit contributions
igsaw+0.117
 disl+0.117
ummy+0.105
 batch+0.101
 chip+0.096
Neg logit contributions
'd-0.134
 managed-0.118
 beginning-0.112
 doomed-0.110
pped-0.110
Token tiny
Token ID7009
Feature activation-1.75e-01

Pos logit contributions
igsaw+0.062
 disl+0.061
ummy+0.056
 batch+0.053
 chip+0.049
Neg logit contributions
'd-0.084
 managed-0.073
 beginning-0.070
pped-0.069
 doomed-0.069
Token colorful
Token ID20239
Feature activation-1.68e-01

Pos logit contributions
igsaw+0.285
 disl+0.272
ummy+0.255
 batch+0.242
 chip+0.232
Neg logit contributions
'd-0.387
 managed-0.341
pped-0.328
 never-0.326
 doomed-0.325
Token butterflies
Token ID44571
Feature activation-3.75e-02

Pos logit contributions
igsaw+0.245
 disl+0.238
ummy+0.216
 batch+0.206
 chip+0.193
Neg logit contributions
'd-0.396
 managed-0.356
 doomed-0.340
pped-0.337
 never-0.336
Token flew
Token ID13112
Feature activation+4.89e-02

Pos logit contributions
 disl+0.065
igsaw+0.063
ummy+0.060
 batch+0.059
con+0.057
Neg logit contributions
'd-0.067
 managed-0.059
 doomed-0.057
 beginning-0.055
 forward-0.055
Token out
Token ID503
Feature activation+1.23e-02

Pos logit contributions
'd+0.076
 managed+0.063
pped+0.059
 remembered+0.058
 doomed+0.058
Neg logit contributions
igsaw-0.108
 disl-0.105
ummy-0.100
 batch-0.095
 chip-0.093
Token!
Token ID0
Feature activation-3.23e-02

Pos logit contributions
'd+0.023
 managed+0.020
pped+0.019
 doomed+0.018
 beginning+0.018
Neg logit contributions
igsaw-0.024
 disl-0.023
ummy-0.022
 batch-0.021
 chip-0.020
Token adventures
Token ID17545
Feature activation-8.25e-02

Pos logit contributions
'd+0.338
pped+0.290
 managed+0.275
 beginning+0.267
 doomed+0.267
Neg logit contributions
 disl-0.466
igsaw-0.450
ummy-0.405
 batch-0.401
 chip-0.392
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
igsaw+0.173
 disl+0.171
ummy+0.158
 batch+0.154
 chip+0.147
Neg logit contributions
'd-0.139
 managed-0.120
pped-0.113
 doomed-0.111
 beginning-0.110
Token They
Token ID1119
Feature activation+1.71e-01

Pos logit contributions
 disl+0.059
igsaw+0.059
 batch+0.053
ummy+0.053
 chip+0.050
Neg logit contributions
'd-0.053
 managed-0.043
 finally-0.043
 never-0.042
 beginning-0.042
Token used
Token ID973
Feature activation+3.22e-02

Pos logit contributions
'd+0.185
pped+0.171
 doomed+0.166
 beginning+0.162
 managed+0.156
Neg logit contributions
 disl-0.414
igsaw-0.387
 batch-0.359
ummy-0.351
 chip-0.348
Token their
Token ID511
Feature activation-1.57e-02

Pos logit contributions
'd+0.058
 managed+0.051
 never+0.047
 beginning+0.047
 found+0.047
Neg logit contributions
igsaw-0.065
 disl-0.064
ummy-0.060
 batch-0.057
 chip-0.054
Token m
Token ID285
Feature activation-1.42e-01

Pos logit contributions
igsaw+0.028
 disl+0.027
ummy+0.025
 batch+0.024
 chip+0.023
Neg logit contributions
'd-0.033
 managed-0.029
 beginning-0.027
 doomed-0.027
pped-0.027
Tokenixtures
Token ID25506
Feature activation+4.23e-02

Pos logit contributions
igsaw+0.181
 disl+0.176
 batch+0.138
ummy+0.130
 chip+0.123
Neg logit contributions
'd-0.369
 managed-0.326
 been-0.317
 never-0.317
 beginning-0.316
Token to
Token ID284
Feature activation+8.64e-04

Pos logit contributions
'd+0.067
 managed+0.057
 beginning+0.052
pped+0.052
 doomed+0.052
Neg logit contributions
igsaw-0.094
 disl-0.094
ummy-0.086
 batch-0.084
 chip-0.081
Token make
Token ID787
Feature activation+9.30e-02

Pos logit contributions
'd+0.001
 managed+0.001
pped+0.001
 beginning+0.001
 doomed+0.001
Neg logit contributions
 disl-0.002
igsaw-0.002
ummy-0.002
 batch-0.002
 chip-0.002
Token magic
Token ID5536
Feature activation-1.39e-01

Pos logit contributions
'd+0.141
 managed+0.120
 beginning+0.111
 never+0.110
 doomed+0.109
Neg logit contributions
igsaw-0.214
 disl-0.210
ummy-0.194
 batch-0.188
 chip-0.181
Token and
Token ID290
Feature activation-6.62e-02

Pos logit contributions
igsaw+0.293
 disl+0.291
ummy+0.262
 batch+0.258
 chip+0.245
Neg logit contributions
'd-0.239
 managed-0.203
 beginning-0.192
 never-0.189
 back-0.188
Token has
Token ID468
Feature activation+1.97e-01

Pos logit contributions
pped+0.211
'd+0.209
 forward+0.190
 beginning+0.187
 managed+0.181
Neg logit contributions
 disl-0.367
igsaw-0.331
 batch-0.329
 chip-0.321
ummy-0.305
Token brown
Token ID7586
Feature activation-1.70e-01

Pos logit contributions
'd+0.336
 managed+0.284
pped+0.276
 forward+0.270
 doomed+0.261
Neg logit contributions
 disl-0.406
igsaw-0.383
ummy-0.350
 chip-0.348
 batch-0.340
Token feathers
Token ID27400
Feature activation-3.70e-02

Pos logit contributions
igsaw+0.319
 disl+0.315
ummy+0.284
 batch+0.277
 chip+0.263
Neg logit contributions
'd-0.330
 managed-0.288
 beginning-0.273
pped-0.272
 doomed-0.269
Token and
Token ID290
Feature activation-5.90e-02

Pos logit contributions
 disl+0.067
igsaw+0.065
ummy+0.064
 chip+0.059
 batch+0.058
Neg logit contributions
'd-0.070
 managed-0.062
 doomed-0.060
 beginning-0.057
pped-0.056
Token a
Token ID257
Feature activation-6.96e-02

Pos logit contributions
 disl+0.106
igsaw+0.102
ummy+0.092
 batch+0.090
 chip+0.089
Neg logit contributions
'd-0.119
 managed-0.105
pped-0.100
 beginning-0.100
 doomed-0.099
Token red
Token ID2266
Feature activation-1.84e-01

Pos logit contributions
 disl+0.106
igsaw+0.104
ummy+0.092
 batch+0.090
 chip+0.089
Neg logit contributions
'd-0.155
 managed-0.141
pped-0.136
 doomed-0.132
 beginning-0.132
Token thing
Token ID1517
Feature activation-7.16e-02

Pos logit contributions
 disl+0.324
igsaw+0.317
ummy+0.298
 batch+0.286
 chip+0.281
Neg logit contributions
'd-0.364
 managed-0.330
 doomed-0.310
pped-0.305
 beginning-0.300
Token on
Token ID319
Feature activation-4.06e-02

Pos logit contributions
 disl+0.147
igsaw+0.143
ummy+0.138
 batch+0.131
 chip+0.131
Neg logit contributions
'd-0.118
 managed-0.105
pped-0.098
 doomed-0.094
 beginning-0.094
Token its
Token ID663
Feature activation-1.12e-02

Pos logit contributions
igsaw+0.098
 disl+0.097
ummy+0.088
 batch+0.087
 chip+0.082
Neg logit contributions
'd-0.057
 managed-0.047
 never-0.046
pped-0.045
 beginning-0.045
Token head
Token ID1182
Feature activation+4.44e-02

Pos logit contributions
igsaw+0.025
 disl+0.025
ummy+0.023
 batch+0.023
 chip+0.022
Neg logit contributions
'd-0.017
 managed-0.015
 beginning-0.014
pped-0.013
 doomed-0.013
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
'd+0.084
 managed+0.076
 doomed+0.074
 beginning+0.069
pped+0.069
Neg logit contributions
 disl-0.079
igsaw-0.074
ummy-0.073
 chip-0.068
 batch-0.066
Tokenâ
Token ID22940
Feature activation-2.23e-02

Pos logit contributions
'd+0.266
pped+0.245
 managed+0.233
 beginning+0.226
 doomed+0.221
Neg logit contributions
 disl-0.437
igsaw-0.428
ummy-0.397
 batch-0.388
 chip-0.383
Token
Token ID26391
Feature activation-4.89e-02

Pos logit contributions
igsaw+0.043
 disl+0.042
ummy+0.037
 batch+0.036
 chip+0.035
Neg logit contributions
'd-0.043
 managed-0.038
 doomed-0.036
 beginning-0.035
pped-0.035
Token
Token ID8151
Feature activation+3.84e-03

Pos logit contributions
igsaw+0.092
 disl+0.091
ummy+0.083
 batch+0.081
 chip+0.077
Neg logit contributions
'd-0.095
 managed-0.083
pped-0.077
 never-0.077
 found-0.076
Tokens
Token ID82
Feature activation+2.62e-02

Pos logit contributions
'd+0.007
 managed+0.006
pped+0.006
 never+0.006
 beginning+0.006
Neg logit contributions
 disl-0.008
igsaw-0.007
 batch-0.006
ummy-0.006
 chip-0.006
Token a
Token ID257
Feature activation-6.31e-02

Pos logit contributions
'd+0.039
 managed+0.034
pped+0.033
 doomed+0.031
 beginning+0.030
Neg logit contributions
 disl-0.059
igsaw-0.059
ummy-0.055
 batch-0.053
 chip-0.051
Token special
Token ID2041
Feature activation-1.62e-01

Pos logit contributions
igsaw+0.119
 disl+0.118
ummy+0.105
 batch+0.104
 chip+0.100
Neg logit contributions
'd-0.119
 managed-0.107
pped-0.104
 doomed-0.100
 beginning-0.099
Token meeting
Token ID3249
Feature activation-2.56e-02

Pos logit contributions
igsaw+0.283
 disl+0.280
ummy+0.251
 batch+0.243
 chip+0.232
Neg logit contributions
'd-0.335
 managed-0.296
pped-0.280
 beginning-0.280
 doomed-0.278
Token.
Token ID13
Feature activation-3.60e-02

Pos logit contributions
igsaw+0.050
 disl+0.050
ummy+0.045
 batch+0.044
 chip+0.042
Neg logit contributions
'd-0.048
 managed-0.041
 beginning-0.039
pped-0.039
 doomed-0.039
Token I
Token ID314
Feature activation+2.07e-01

Pos logit contributions
 disl+0.074
igsaw+0.072
ummy+0.066
 batch+0.065
 chip+0.061
Neg logit contributions
'd-0.063
 managed-0.053
 beginning-0.051
 finally-0.050
 never-0.050
Token have
Token ID423
Feature activation+1.86e-01

Pos logit contributions
pped+0.266
 beginning+0.263
 doomed+0.262
'd+0.243
 forward+0.228
Neg logit contributions
 disl-0.475
igsaw-0.432
 batch-0.384
oc-0.380
 common-0.371
Token to
Token ID284
Feature activation+4.85e-03

Pos logit contributions
'd+0.286
 managed+0.242
pped+0.241
 doomed+0.236
 beginning+0.230
Neg logit contributions
 disl-0.402
igsaw-0.391
 batch-0.354
ummy-0.353
 chip-0.335
Token bird
Token ID6512
Feature activation-2.71e-02

Pos logit contributions
 disl+0.188
igsaw+0.178
 chip+0.169
ummy+0.166
 batch+0.159
Neg logit contributions
'd-0.268
 managed-0.242
pped-0.241
 doomed-0.226
 forward-0.223
Token was
Token ID373
Feature activation+1.96e-01

Pos logit contributions
 disl+0.063
igsaw+0.059
 batch+0.058
 chip+0.057
ummy+0.056
Neg logit contributions
'd-0.036
 beginning-0.030
pped-0.029
 forward-0.029
 doomed-0.029
Token happy
Token ID3772
Feature activation-1.03e-01

Pos logit contributions
'd+0.364
pped+0.272
 managed+0.271
 stumbled+0.258
 beginning+0.251
Neg logit contributions
 disl-0.374
 chip-0.342
 batch-0.340
igsaw-0.336
ummy-0.331
Token.
Token ID13
Feature activation-3.32e-02

Pos logit contributions
 disl+0.207
igsaw+0.203
ummy+0.192
 batch+0.183
 chip+0.179
Neg logit contributions
'd-0.186
 managed-0.156
pped-0.153
 beginning-0.147
 doomed-0.146
Token The
Token ID383
Feature activation-1.23e-01

Pos logit contributions
igsaw+0.075
 disl+0.074
 batch+0.068
ummy+0.066
 chip+0.063
Neg logit contributions
'd-0.050
 managed-0.044
 finally-0.042
 never-0.041
 could-0.040
Token wise
Token ID10787
Feature activation-1.67e-01

Pos logit contributions
 disl+0.196
igsaw+0.189
 chip+0.178
ummy+0.175
 batch+0.167
Neg logit contributions
'd-0.260
 managed-0.235
pped-0.232
 doomed-0.218
 stumbled-0.214
Token owl
Token ID39610
Feature activation+6.05e-03

Pos logit contributions
igsaw+0.285
 disl+0.284
ummy+0.256
 batch+0.244
 chip+0.238
Neg logit contributions
'd-0.351
 managed-0.311
pped-0.297
 beginning-0.293
 doomed-0.292
Token helped
Token ID4193
Feature activation-1.55e-02

Pos logit contributions
'd+0.009
 beginning+0.008
 forward+0.007
pped+0.007
 doomed+0.007
Neg logit contributions
 disl-0.013
igsaw-0.012
 batch-0.012
 chip-0.012
ummy-0.012
Token the
Token ID262
Feature activation-1.09e-01

Pos logit contributions
igsaw+0.035
 disl+0.035
ummy+0.031
 batch+0.031
 chip+0.029
Neg logit contributions
'd-0.025
 managed-0.021
 never-0.020
 finally-0.020
 found-0.020
Token bird
Token ID6512
Feature activation-2.71e-02

Pos logit contributions
 disl+0.177
igsaw+0.171
ummy+0.157
 chip+0.154
 batch+0.151
Neg logit contributions
'd-0.237
 managed-0.211
pped-0.208
 doomed-0.198
 beginning-0.196
Token fix
Token ID4259
Feature activation-2.73e-02

Pos logit contributions
 disl+0.059
igsaw+0.057
ummy+0.053
 batch+0.053
 chip+0.052
Neg logit contributions
'd-0.043
pped-0.036
 managed-0.035
 doomed-0.034
 beginning-0.034