TOP ACTIVATIONS
MAX = 17.173

. She loved to have pic n ics with her family
a family who really enjoyed pic n ics . They loved
fun . Lily realized that pic n ics were not so
family were ready to go pic n icking again . But
even see other people having pic n ics .
favourite place to go for pic n ics . <|endoftext|> Once
They missed their walks and pic n ics near the cliff
see the bird and have pic n ics . <|endoftext|> Once
liked to play and have pic n ics . One day
They decided to make more pic n ics and looked in
and it was going on pic n ics !
saw lots of people having pic n ics and walking their
be there to support her pic n ics -- no matter
and nodded . He loved pic n ics and couldn 't
so excited ! She loved pic n ics and spending time
Tim my loved going on pic n ics with his family
to be careful when having pic n ics and not to
children running around , families pic n icking , couples st
girl named Lily who loved pic n ics . She would
in the garden and have pic n ics together .

MIN ACTIVATIONS
MIN = -16.032

felt safe knowing that her guardian angel was there to watch
" I 'm the guardian of the valley ," said
thought he might be a guardian of the b erry bush
could , but eventually the guardian caught up with him and
The little girl thanked her guardian angel , and knew that
I am the guardian of the hidden forest .
again . The guardian became so angry that it
cute and belonged to a wizard . Every day the wizard
angel was your mom 's guardian angel . She was watching
that Aunt was really her guardian angel and she 'd always
of the underground . The guardian warned the man never to
wise man had shot a guardian of the underground . The
tail . It was a guardian of the cave .
from behind her : A guardian of the me adow !
The next morning the poor wizard decided to trust the voice
caught a glimpse of her guardian angel who was much bigger
" I 'm the guardian of the apples ! It
James . I 'm the guardian of this spring . I
than anything to have her guardian angel nearby to make her
One day , a grand wizard came to the village .

INTERVAL 8.872 - 17.173
CONTAINS 0.537%

around and saw many matt resses . Some were soft ,
Lily and her brother played together every day and enjoyed their
sure it was sturdy . Then , she pretended to sail
knew he wouldn â t get it back again
they both decided to measure together . They took turns and

INTERVAL 0.571 - 8.872
CONTAINS 36.190%

race to the slide !" Max says . He is competitive
Oh , I see . I am sorry . I did
a big gun and wanted to catch some animals .
day , Lily played a game with her friends . She
Ben says , " OK , Lily . I

INTERVAL -7.731 - 0.571
CONTAINS 62.086%

" Thank you , Mouse !" Bob said with a big
." The man smiled back and said " It
smiled and felt proud of herself for being such a good
pad . Sure enough , he saw a pale brush .
horse . T oby asked the horse if he

INTERVAL -16.032 - -7.731
CONTAINS 1.187%

went on his way to find something else to eat .
All of these perf umes smell so good . Take your
The moral of the story is that it 's important
more careful . He didn 't want to wreck his toys
my gasped . He couldn 't believe his ears . He
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
ashed+0.248
 tanks+0.246
ksh+0.244
 sizes+0.240
 rods+0.227
Neg logit contributions
 guardian-0.328
 wizard-0.304
 note-0.298
 fairy-0.295
ibrarian-0.291
Token She
Token ID1375
Feature activation-4.19e+00

Pos logit contributions
 rods+0.010
ksh+0.009
 tanks+0.009
ashed+0.009
 topp+0.009
Neg logit contributions
he-0.003
 fairy-0.003
 answer-0.003
 mirror-0.003
 guardian-0.003
Token loved
Token ID6151
Feature activation+9.37e-02

Pos logit contributions
 guardian+0.288
he+0.268
 answer+0.267
 wizard+0.264
 fairy+0.263
Neg logit contributions
ashed-0.228
ksh-0.216
 tanks-0.214
 rods-0.208
 pic-0.195
Token to
Token ID284
Feature activation+2.54e+00

Pos logit contributions
ksh+0.007
ashed+0.007
suits+0.006
 rods+0.006
 tanks+0.006
Neg logit contributions
 owl-0.005
 mirror-0.004
 answer-0.004
 fairy-0.004
 guardian-0.004
Token have
Token ID423
Feature activation-4.46e+00

Pos logit contributions
ashed+0.158
ksh+0.157
 rods+0.157
 tanks+0.156
 mouths+0.146
Neg logit contributions
 guardian-0.147
he-0.138
 fairy-0.135
 wizard-0.133
 note-0.132
Token pic
Token ID8301
Feature activation+1.72e+01

Pos logit contributions
 mirror+0.283
 owl+0.280
 veil+0.279
 fairy+0.277
he+0.276
Neg logit contributions
ksh-0.246
ashed-0.244
 rods-0.228
 tanks-0.221
 mouths-0.214
Tokenn
Token ID77
Feature activation+1.20e+00

Pos logit contributions
 rods+1.022
ashed+1.020
ksh+1.017
 tanks+1.001
 pic+0.943
Neg logit contributions
 guardian-1.031
 fairy-0.975
 figure-0.960
he-0.958
 wizard-0.955
Tokenics
Token ID873
Feature activation-2.81e-01

Pos logit contributions
ashed+0.057
 rods+0.054
 tanks+0.052
ksh+0.051
 pic+0.049
Neg logit contributions
 guardian-0.090
 fairy-0.086
 elf-0.084
 wizard-0.084
 mirror-0.082
Token with
Token ID351
Feature activation-2.75e+00

Pos logit contributions
 guardian+0.014
 figure+0.013
 fairy+0.013
he+0.013
 wizard+0.013
Neg logit contributions
ashed-0.020
ksh-0.020
 rods-0.020
 tanks-0.019
 pic-0.018
Token her
Token ID607
Feature activation-4.25e+00

Pos logit contributions
 answer+0.049
 fairy+0.049
 owl+0.046
 note+0.044
he+0.043
Neg logit contributions
ksh-0.287
ashed-0.285
 rods-0.271
 tanks-0.267
suits-0.266
Token family
Token ID1641
Feature activation-6.25e-01

Pos logit contributions
 fairy+0.176
 guardian+0.170
 owl+0.167
 note+0.164
 angel+0.158
Neg logit contributions
ksh-0.357
ashed-0.346
 rods-0.328
 tanks-0.322
 sizes-0.311
Token a
Token ID257
Feature activation-1.57e-01

Pos logit contributions
ksh+0.047
ashed+0.046
 rods+0.045
 tanks+0.045
 mouths+0.043
Neg logit contributions
 guardian-0.023
he-0.022
 fairy-0.022
 answer-0.021
 figure-0.021
Token family
Token ID1641
Feature activation-7.82e-01

Pos logit contributions
 guardian+0.008
he+0.007
 fairy+0.007
 answer+0.007
 figure+0.007
Neg logit contributions
ksh-0.011
ashed-0.011
 rods-0.011
 tanks-0.011
 mouths-0.010
Token who
Token ID508
Feature activation-9.55e-01

Pos logit contributions
 guardian+0.040
he+0.037
 fairy+0.036
 answer+0.036
 note+0.036
Neg logit contributions
 tanks-0.054
ashed-0.052
 rods-0.052
ksh-0.051
 mouths-0.048
Token really
Token ID1107
Feature activation-1.36e+00

Pos logit contributions
 guardian+0.050
he+0.050
 figure+0.050
 fairy+0.049
 answer+0.049
Neg logit contributions
ksh-0.065
 rods-0.064
ashed-0.062
 tanks-0.060
 mouths-0.058
Token enjoyed
Token ID8359
Feature activation+5.48e+00

Pos logit contributions
 guardian+0.073
he+0.072
 figure+0.072
 answer+0.071
 fairy+0.070
Neg logit contributions
ksh-0.091
 rods-0.089
 tanks-0.087
ashed-0.086
 mouths-0.082
Token pic
Token ID8301
Feature activation+1.71e+01

Pos logit contributions
ksh+0.440
ashed+0.438
suits+0.394
 sizes+0.383
 rods+0.376
Neg logit contributions
 owl-0.239
 note-0.231
 answer-0.231
 figure-0.229
 fairy-0.227
Tokenn
Token ID77
Feature activation+1.19e+00

Pos logit contributions
ksh+1.020
 rods+1.011
ashed+1.011
 tanks+0.996
 pic+0.931
Neg logit contributions
 guardian-1.023
 fairy-0.967
 figure-0.953
he-0.944
 wizard-0.942
Tokenics
Token ID873
Feature activation-4.05e-01

Pos logit contributions
ashed+0.067
 rods+0.066
ksh+0.066
 tanks+0.065
 sizes+0.062
Neg logit contributions
 guardian-0.078
 fairy-0.075
 wizard-0.072
 mirror-0.071
 owl-0.070
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
he+0.014
 guardian+0.014
 fairy+0.013
 figure+0.013
 answer+0.013
Neg logit contributions
ksh-0.035
ashed-0.035
 rods-0.033
 tanks-0.033
 mouths-0.032
Token They
Token ID1119
Feature activation+7.63e+00

Pos logit contributions
ashed+0.014
ksh+0.013
 rods+0.012
 appet+0.011
 topp+0.011
Neg logit contributions
 answer-0.017
he-0.017
 figure-0.016
 voice-0.016
 mirror-0.016
Token loved
Token ID6151
Feature activation+1.06e-01

Pos logit contributions
ksh+0.556
 rods+0.526
ashed+0.502
 mouths+0.493
 tanks+0.489
Neg logit contributions
he-0.395
 answer-0.380
 fairy-0.380
 guardian-0.377
 figure-0.376
Token fun
Token ID1257
Feature activation+8.62e+00

Pos logit contributions
ksh+0.172
 tanks+0.153
 rods+0.150
ashed+0.150
 mouths+0.141
Neg logit contributions
he-0.214
 answer-0.213
 note-0.204
 fairy-0.204
 mirror-0.195
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
ksh+0.820
 tanks+0.764
ashed+0.764
 rods+0.752
suits+0.740
Neg logit contributions
he-0.254
 figure-0.230
 note-0.219
 fairy-0.218
 answer-0.210
Token Lily
Token ID20037
Feature activation+4.86e+00

Pos logit contributions
 rods+0.021
suits+0.019
 topp+0.019
girls+0.018
 tanks+0.018
Neg logit contributions
he-0.014
 mirror-0.014
 fairy-0.014
 answer-0.013
 voice-0.013
Token realized
Token ID6939
Feature activation-2.03e+00

Pos logit contributions
ksh+0.349
ashed+0.342
 rods+0.338
 tanks+0.335
 mouths+0.315
Neg logit contributions
 guardian-0.239
 fairy-0.226
he-0.226
 figure-0.226
 answer-0.221
Token that
Token ID326
Feature activation-5.55e+00

Pos logit contributions
 guardian+0.090
he+0.084
 figure+0.083
 fairy+0.083
 answer+0.082
Neg logit contributions
ksh-0.154
ashed-0.154
 rods-0.152
 tanks-0.152
 mouths-0.143
Token pic
Token ID8301
Feature activation+1.71e+01

Pos logit contributions
 guardian+0.298
 fairy+0.288
 figure+0.285
he+0.284
 answer+0.283
Neg logit contributions
ksh-0.377
ashed-0.372
 rods-0.361
 tanks-0.354
suits-0.340
Tokenn
Token ID77
Feature activation+1.27e+00

Pos logit contributions
ksh+1.167
ashed+1.153
 rods+1.132
 tanks+1.128
 mouths+1.057
Neg logit contributions
 guardian-0.899
he-0.861
 fairy-0.849
 figure-0.845
 answer-0.839
Tokenics
Token ID873
Feature activation-2.40e-01

Pos logit contributions
ashed+0.106
 rods+0.096
ksh+0.096
 tanks+0.095
suits+0.091
Neg logit contributions
 guardian-0.054
 wizard-0.051
 owl-0.049
 fairy-0.049
 figure-0.049
Token were
Token ID547
Feature activation+4.53e+00

Pos logit contributions
 guardian+0.011
 fairy+0.011
he+0.011
 figure+0.011
 answer+0.011
Neg logit contributions
ksh-0.018
ashed-0.017
 rods-0.017
 tanks-0.017
 mouths-0.016
Token not
Token ID407
Feature activation-5.46e+00

Pos logit contributions
ksh+0.361
ashed+0.349
 rods+0.343
suits+0.331
 tanks+0.331
Neg logit contributions
 guardian-0.206
 fairy-0.198
 answer-0.197
 owl-0.190
 note-0.184
Token so
Token ID523
Feature activation+3.75e+00

Pos logit contributions
 answer+0.245
 owl+0.235
 fairy+0.232
 guardian+0.217
 angel+0.212
Neg logit contributions
ksh-0.433
 rods-0.424
ashed-0.415
suits-0.406
 mouths-0.390
Token family
Token ID1641
Feature activation-6.81e-01

Pos logit contributions
 guardian+0.230
 fairy+0.218
 figure+0.214
he+0.214
 answer+0.212
Neg logit contributions
ksh-0.327
ashed-0.321
 rods-0.314
 tanks-0.312
 mouths-0.292
Token were
Token ID547
Feature activation+4.45e+00

Pos logit contributions
 guardian+0.043
he+0.042
 fairy+0.041
 figure+0.041
 answer+0.041
Neg logit contributions
ksh-0.040
ashed-0.038
 rods-0.038
 tanks-0.038
 mouths-0.035
Token ready
Token ID3492
Feature activation+6.06e+00

Pos logit contributions
ashed+0.296
ksh+0.289
 tanks+0.285
 rods+0.283
 pic+0.268
Neg logit contributions
 guardian-0.238
 figure-0.227
 fairy-0.226
 answer-0.223
 wizard-0.221
Token to
Token ID284
Feature activation+2.75e+00

Pos logit contributions
ksh+0.393
ashed+0.386
 tanks+0.377
 rods+0.377
 mouths+0.352
Neg logit contributions
 guardian-0.344
 answer-0.330
he-0.329
 figure-0.327
 fairy-0.327
Token go
Token ID467
Feature activation+4.30e+00

Pos logit contributions
 rods+0.164
 tanks+0.163
 pic+0.162
ksh+0.158
ashed+0.157
Neg logit contributions
 guardian-0.158
 fairy-0.145
he-0.145
 wizard-0.145
 owl-0.143
Token pic
Token ID8301
Feature activation+1.71e+01

Pos logit contributions
ashed+0.284
 tanks+0.281
 rods+0.280
ksh+0.276
 pic+0.264
Neg logit contributions
 guardian-0.236
 fairy-0.217
 figure-0.215
 wizard-0.213
 note-0.213
Tokenn
Token ID77
Feature activation+1.12e+00

Pos logit contributions
ksh+1.036
 rods+1.031
ashed+1.031
 tanks+1.013
 pic+0.949
Neg logit contributions
 guardian-1.000
 fairy-0.949
he-0.936
 figure-0.931
 wizard-0.927
Tokenicking
Token ID7958
Feature activation+2.87e+00

Pos logit contributions
ashed+0.070
ksh+0.069
 rods+0.068
 tanks+0.068
 pic+0.064
Neg logit contributions
 guardian-0.066
 fairy-0.063
 wizard-0.061
 figure-0.061
 mirror-0.060
Token again
Token ID757
Feature activation+3.34e+00

Pos logit contributions
ksh+0.232
ashed+0.231
 rods+0.222
 tanks+0.220
 mouths+0.210
Neg logit contributions
 guardian-0.111
he-0.111
 answer-0.110
 fairy-0.110
 figure-0.108
Token.
Token ID13
Feature activation+4.59e-01

Pos logit contributions
ashed+0.280
ksh+0.278
 tanks+0.267
 rods+0.266
 mouths+0.256
Neg logit contributions
 guardian-0.131
 figure-0.126
 fairy-0.123
 answer-0.123
he-0.120
Token But
Token ID887
Feature activation+1.33e+00

Pos logit contributions
 rods+0.027
ashed+0.025
 tanks+0.025
ksh+0.024
 topp+0.024
Neg logit contributions
he-0.028
 guardian-0.027
 voice-0.026
 answer-0.026
 fairy-0.025
Token even
Token ID772
Feature activation-6.31e-01

Pos logit contributions
ashed+0.498
 pic+0.490
 rods+0.486
 mouths+0.480
 tails+0.471
Neg logit contributions
 guardian-0.423
 wizard-0.411
 fairy-0.410
 note-0.399
mother-0.387
Token see
Token ID766
Feature activation-6.66e+00

Pos logit contributions
 guardian+0.030
 fairy+0.027
he+0.027
 wizard+0.026
 figure+0.026
Neg logit contributions
ksh-0.046
ashed-0.046
 rods-0.045
 tanks-0.045
 mouths-0.043
Token other
Token ID584
Feature activation+1.73e+00

Pos logit contributions
 guardian+0.309
 fairy+0.298
he+0.294
 answer+0.291
 figure+0.290
Neg logit contributions
ksh-0.510
ashed-0.504
 rods-0.476
 tanks-0.474
suits-0.448
Token people
Token ID661
Feature activation-3.54e-01

Pos logit contributions
ksh+0.118
ashed+0.117
 rods+0.110
 tanks+0.110
suits+0.101
Neg logit contributions
 guardian-0.094
 answer-0.092
he-0.091
 fairy-0.090
 figure-0.090
Token having
Token ID1719
Feature activation+2.41e+00

Pos logit contributions
 guardian+0.018
 fairy+0.017
he+0.017
 wizard+0.017
 figure+0.017
Neg logit contributions
 tanks-0.023
ksh-0.023
ashed-0.023
 rods-0.023
 sizes-0.022
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
ksh+0.145
ashed+0.142
 tanks+0.129
 rods+0.127
suits+0.119
Neg logit contributions
 answer-0.151
he-0.148
 guardian-0.147
 fairy-0.145
 note-0.145
Tokenn
Token ID77
Feature activation+1.00e+00

Pos logit contributions
ksh+1.188
ashed+1.167
 tanks+1.146
 rods+1.142
 mouths+1.079
Neg logit contributions
 guardian-0.873
he-0.845
 answer-0.828
 figure-0.825
 fairy-0.821
Tokenics
Token ID873
Feature activation-5.06e-01

Pos logit contributions
ashed+0.063
ksh+0.062
 rods+0.061
 tanks+0.059
 sizes+0.056
Neg logit contributions
 guardian-0.059
 fairy-0.057
 wizard-0.055
 figure-0.055
 owl-0.054
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 guardian+0.019
he+0.019
 fairy+0.018
 answer+0.018
 figure+0.018
Neg logit contributions
ksh-0.043
ashed-0.042
 rods-0.040
 tanks-0.040
 mouths-0.040
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
ksh+0.011
 tanks+0.011
ashed+0.011
 rods+0.011
 trucks+0.011
Neg logit contributions
he-0.012
 figure-0.011
 answer-0.011
 guardian-0.011
 fairy-0.011
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
 guardian+0.093
 angel+0.088
 owl+0.086
 fairy+0.083
 wizard+0.082
Neg logit contributions
 rods-0.093
ashed-0.090
 tanks-0.088
 pic-0.085
ksh-0.082
Token favourite
Token ID12507
Feature activation+2.22e+00

Pos logit contributions
ksh+0.477
ashed+0.467
 rods+0.448
 tanks+0.444
suits+0.412
Neg logit contributions
 guardian-0.352
 fairy-0.329
 answer-0.326
 figure-0.322
 note-0.315
Token place
Token ID1295
Feature activation+7.01e-01

Pos logit contributions
ksh+0.164
ashed+0.160
 rods+0.154
 tanks+0.151
 mouths+0.143
Neg logit contributions
 guardian-0.113
 fairy-0.111
 answer-0.109
 figure-0.107
 note-0.105
Token to
Token ID284
Feature activation+2.71e+00

Pos logit contributions
ksh+0.053
ashed+0.052
 rods+0.051
 tanks+0.051
 mouths+0.048
Neg logit contributions
 guardian-0.033
he-0.031
 fairy-0.031
 figure-0.030
 answer-0.030
Token go
Token ID467
Feature activation+4.29e+00

Pos logit contributions
ksh+0.177
 tanks+0.175
 rods+0.175
ashed+0.175
 pic+0.167
Neg logit contributions
 guardian-0.142
he-0.131
 fairy-0.130
 note-0.129
 figure-0.129
Token for
Token ID329
Feature activation-3.82e-01

Pos logit contributions
ksh+0.321
ashed+0.318
 rods+0.312
 tanks+0.310
 mouths+0.293
Neg logit contributions
 guardian-0.198
he-0.187
 fairy-0.186
 figure-0.185
 answer-0.182
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
 fairy+0.017
 guardian+0.017
 answer+0.017
he+0.017
 figure+0.017
Neg logit contributions
ashed-0.029
ksh-0.029
 rods-0.027
 tanks-0.027
suits-0.026
Tokenn
Token ID77
Feature activation+1.15e+00

Pos logit contributions
ksh+1.090
ashed+1.080
 rods+1.060
 tanks+1.050
 mouths+0.974
Neg logit contributions
 guardian-0.963
 fairy-0.910
he-0.906
 figure-0.904
 answer-0.889
Tokenics
Token ID873
Feature activation-4.64e-01

Pos logit contributions
ashed+0.077
ksh+0.076
 tanks+0.074
 rods+0.074
 pic+0.070
Neg logit contributions
 guardian-0.062
 fairy-0.059
 wizard-0.058
 figure-0.057
 mirror-0.056
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
 guardian+0.019
he+0.018
 fairy+0.018
 figure+0.017
 answer+0.017
Neg logit contributions
ksh-0.038
ashed-0.037
 rods-0.037
 tanks-0.036
 mouths-0.035
Token<|endoftext|>
Token ID50256
Feature activation-4.87e+00

Pos logit contributions
ashed+0.026
ksh+0.026
 rods+0.025
 tanks+0.025
 tails+0.024
Neg logit contributions
he-0.011
 guardian-0.010
 fairy-0.009
 figure-0.009
 answer-0.009
TokenOnce
Token ID7454
Feature activation-7.76e-01

Pos logit contributions
 guardian+0.239
 figure+0.221
 fairy+0.220
 answer+0.217
he+0.216
Neg logit contributions
ksh-0.350
ashed-0.347
 rods-0.342
 tanks-0.340
 mouths-0.317
Token They
Token ID1119
Feature activation+7.35e+00

Pos logit contributions
ashed+0.009
ksh+0.009
 rods+0.009
 tanks+0.008
suits+0.008
Neg logit contributions
 guardian-0.006
he-0.006
 fairy-0.006
 figure-0.006
 answer-0.006
Token missed
Token ID6825
Feature activation-6.51e-01

Pos logit contributions
ashed+0.455
ksh+0.442
 tanks+0.437
 rods+0.430
 mouths+0.407
Neg logit contributions
 guardian-0.425
he-0.402
 figure-0.397
 fairy-0.397
 wizard-0.396
Token their
Token ID511
Feature activation+6.36e+00

Pos logit contributions
 owl+0.031
 answer+0.031
 guardian+0.031
 figure+0.031
 fairy+0.031
Neg logit contributions
ksh-0.050
ashed-0.049
 rods-0.045
suits-0.044
 tanks-0.044
Token walks
Token ID11114
Feature activation+3.88e+00

Pos logit contributions
ksh+0.474
ashed+0.459
 rods+0.446
 tanks+0.443
 sizes+0.413
Neg logit contributions
 guardian-0.326
 fairy-0.305
 figure-0.304
 answer-0.300
 note-0.293
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
ksh+0.309
 rods+0.291
ashed+0.287
 tanks+0.285
 mouths+0.282
Neg logit contributions
 guardian-0.175
he-0.174
 fairy-0.165
 figure-0.162
 note-0.156
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
 guardian+0.029
 fairy+0.029
 figure+0.029
 owl+0.028
 answer+0.028
Neg logit contributions
ksh-0.039
 rods-0.036
ashed-0.035
 tanks-0.034
suits-0.034
Tokenn
Token ID77
Feature activation+1.04e+00

Pos logit contributions
ksh+1.177
ashed+1.156
 rods+1.139
 tanks+1.138
 mouths+1.070
Neg logit contributions
 guardian-0.891
he-0.858
 fairy-0.839
 figure-0.838
 answer-0.835
Tokenics
Token ID873
Feature activation-5.04e-01

Pos logit contributions
ashed+0.066
ksh+0.062
 rods+0.062
 tanks+0.061
 mouths+0.057
Neg logit contributions
 guardian-0.061
 wizard-0.058
 fairy-0.058
 figure-0.058
 answer-0.057
Token near
Token ID1474
Feature activation-5.94e-01

Pos logit contributions
 guardian+0.026
he+0.026
 fairy+0.026
 figure+0.025
 note+0.025
Neg logit contributions
ksh-0.037
 rods-0.034
 tanks-0.034
ashed-0.033
 mouths-0.032
Token the
Token ID262
Feature activation+8.13e-01

Pos logit contributions
 guardian+0.027
 figure+0.026
 fairy+0.026
 answer+0.026
he+0.026
Neg logit contributions
ksh-0.046
ashed-0.046
 rods-0.043
 tanks-0.043
suits-0.041
Token cliff
Token ID19516
Feature activation-2.53e+00

Pos logit contributions
 tanks+0.049
 rods+0.049
ksh+0.046
ashed+0.045
 pic+0.045
Neg logit contributions
 guardian-0.045
he-0.044
ibrarian-0.044
 note-0.042
 figure-0.042
Token see
Token ID766
Feature activation-6.67e+00

Pos logit contributions
ksh+0.182
ashed+0.180
 rods+0.179
 tanks+0.178
 pic+0.169
Neg logit contributions
 guardian-0.140
he-0.132
 fairy-0.131
 figure-0.129
 note-0.129
Token the
Token ID262
Feature activation+9.53e-01

Pos logit contributions
 guardian+0.312
 fairy+0.303
 answer+0.298
 owl+0.296
he+0.291
Neg logit contributions
ashed-0.513
ksh-0.512
 rods-0.487
 tanks-0.477
suits-0.465
Token bird
Token ID6512
Feature activation-5.57e+00

Pos logit contributions
ksh+0.065
 rods+0.064
 tanks+0.063
 pic+0.062
ashed+0.062
Neg logit contributions
ibrarian-0.045
 guardian-0.045
he-0.045
 figure-0.043
 wizard-0.042
Token and
Token ID290
Feature activation-4.19e-01

Pos logit contributions
 guardian+0.294
he+0.275
 figure+0.272
 wizard+0.269
 answer+0.267
Neg logit contributions
ksh-0.374
ashed-0.368
 tanks-0.366
 rods-0.355
 mouths-0.346
Token have
Token ID423
Feature activation-4.60e+00

Pos logit contributions
 guardian+0.023
 fairy+0.022
 figure+0.022
 owl+0.022
 answer+0.021
Neg logit contributions
ksh-0.030
ashed-0.028
 rods-0.028
 tanks-0.027
suits-0.026
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
 fairy+0.207
 owl+0.204
 guardian+0.203
 note+0.198
 answer+0.198
Neg logit contributions
ksh-0.343
ashed-0.317
 tanks-0.312
suits-0.306
 rods-0.299
Tokenn
Token ID77
Feature activation+1.08e+00

Pos logit contributions
ksh+1.103
ashed+1.095
 rods+1.073
 tanks+1.062
 mouths+0.993
Neg logit contributions
 guardian-0.948
he-0.894
 fairy-0.894
 figure-0.891
 answer-0.874
Tokenics
Token ID873
Feature activation-4.64e-01

Pos logit contributions
ashed+0.066
ksh+0.062
 rods+0.061
 tanks+0.060
 tails+0.056
Neg logit contributions
 guardian-0.066
 fairy-0.062
 wizard-0.062
 figure-0.062
 answer-0.060
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
he+0.025
 guardian+0.025
 fairy+0.024
 answer+0.023
 note+0.023
Neg logit contributions
ksh-0.032
 rods-0.031
ashed-0.030
 tanks-0.030
 mouths-0.029
Token<|endoftext|>
Token ID50256
Feature activation-4.96e+00

Pos logit contributions
 rods+0.011
ksh+0.011
ashed+0.011
 tanks+0.011
suits+0.010
Neg logit contributions
he-0.004
 guardian-0.004
 fairy-0.004
 figure-0.004
 answer-0.004
TokenOnce
Token ID7454
Feature activation-8.27e-01

Pos logit contributions
 guardian+0.266
 figure+0.240
 fairy+0.240
 wizard+0.238
 owl+0.238
Neg logit contributions
ksh-0.335
 rods-0.333
ashed-0.332
 tanks-0.331
 pic-0.314
Token liked
Token ID8288
Feature activation+2.02e+00

Pos logit contributions
ksh+0.488
 rods+0.470
 tanks+0.443
 pic+0.436
ashed+0.428
Neg logit contributions
he-0.435
 answer-0.432
 figure-0.432
 fairy-0.428
 guardian-0.426
Token to
Token ID284
Feature activation+2.34e+00

Pos logit contributions
ksh+0.192
ashed+0.179
suits+0.179
 appet+0.167
 sizes+0.162
Neg logit contributions
 owl-0.069
 answer-0.063
 crying-0.056
 figure-0.056
 witch-0.052
Token play
Token ID711
Feature activation+3.37e+00

Pos logit contributions
ksh+0.166
ashed+0.160
 rods+0.155
 tanks+0.154
 mouths+0.145
Neg logit contributions
 guardian-0.121
 figure-0.120
 answer-0.119
 fairy-0.117
he-0.115
Token and
Token ID290
Feature activation-5.18e-01

Pos logit contributions
ksh+0.195
ashed+0.175
 sizes+0.171
 mouths+0.167
 tanks+0.161
Neg logit contributions
 figure-0.231
he-0.230
 fairy-0.223
 answer-0.216
 owl-0.215
Token have
Token ID423
Feature activation-4.70e+00

Pos logit contributions
 figure+0.020
 answer+0.019
 guardian+0.019
 fairy+0.018
 owl+0.018
Neg logit contributions
ksh-0.044
ashed-0.043
 rods-0.041
 tanks-0.040
suits-0.039
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
 answer+0.229
 owl+0.211
 note+0.209
 figure+0.206
 fairy+0.205
Neg logit contributions
ksh-0.343
ashed-0.315
 mouths-0.286
iked-0.283
 tanks-0.279
Tokenn
Token ID77
Feature activation+1.03e+00

Pos logit contributions
ksh+1.120
ashed+1.109
 rods+1.088
 tanks+1.080
 mouths+1.011
Neg logit contributions
 guardian-0.930
he-0.882
 fairy-0.878
 figure-0.874
 answer-0.864
Tokenics
Token ID873
Feature activation-5.39e-01

Pos logit contributions
ashed+0.066
ksh+0.064
 rods+0.064
 tanks+0.063
 sizes+0.059
Neg logit contributions
 guardian-0.060
 fairy-0.057
 wizard-0.056
 figure-0.055
 answer-0.054
Token.
Token ID13
Feature activation-7.18e-02

Pos logit contributions
he+0.028
 note+0.026
 answer+0.026
 guardian+0.026
 fairy+0.026
Neg logit contributions
ksh-0.040
ashed-0.037
 tanks-0.036
 mouths-0.036
 rods-0.036
Token One
Token ID1881
Feature activation+2.20e+00

Pos logit contributions
he+0.003
 answer+0.003
 figure+0.003
 mirror+0.003
 fairy+0.003
Neg logit contributions
ksh-0.005
ashed-0.005
 tanks-0.005
 rods-0.005
suits-0.004
Token day
Token ID1110
Feature activation+2.69e+00

Pos logit contributions
ksh+0.141
ashed+0.136
 rods+0.134
 tanks+0.133
 mouths+0.125
Neg logit contributions
 guardian-0.128
he-0.124
 fairy-0.122
 figure-0.122
 answer-0.122
TokenThey
Token ID2990
Feature activation+6.20e+00

Pos logit contributions
 guardian+0.080
 fairy+0.076
 answer+0.075
 figure+0.075
he+0.074
Neg logit contributions
ksh-0.082
ashed-0.082
 rods-0.081
 tanks-0.080
 pic-0.074
Token decided
Token ID3066
Feature activation+5.61e+00

Pos logit contributions
ksh+0.364
ashed+0.357
 rods+0.350
 tanks+0.347
 mouths+0.322
Neg logit contributions
 guardian-0.387
he-0.371
 fairy-0.369
 figure-0.368
 answer-0.364
Token to
Token ID284
Feature activation+2.56e+00

Pos logit contributions
ksh+0.409
ashed+0.383
 tanks+0.372
 rods+0.372
suits+0.353
Neg logit contributions
 guardian-0.293
 answer-0.283
 figure-0.283
 note-0.281
 fairy-0.277
Token make
Token ID787
Feature activation-2.18e+00

Pos logit contributions
 rods+0.168
ashed+0.168
ksh+0.167
 tanks+0.167
 mouths+0.157
Neg logit contributions
 guardian-0.133
he-0.125
 fairy-0.123
 wizard-0.122
 owl-0.121
Token more
Token ID517
Feature activation+3.91e+00

Pos logit contributions
 guardian+0.108
 fairy+0.105
 figure+0.105
he+0.104
 answer+0.104
Neg logit contributions
ashed-0.154
ksh-0.154
 tanks-0.146
 rods-0.145
suits-0.139
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
ksh+0.221
ashed+0.214
 rods+0.194
 tanks+0.194
suits+0.179
Neg logit contributions
 guardian-0.260
 answer-0.258
 fairy-0.254
 figure-0.252
 note-0.251
Tokenn
Token ID77
Feature activation+1.04e+00

Pos logit contributions
ksh+1.090
ashed+1.085
 rods+1.072
 tanks+1.056
 mouths+0.980
Neg logit contributions
 guardian-0.951
 fairy-0.895
he-0.893
 figure-0.887
 wizard-0.873
Tokenics
Token ID873
Feature activation-5.80e-01

Pos logit contributions
ashed+0.064
 rods+0.064
ksh+0.063
 tanks+0.062
 sizes+0.058
Neg logit contributions
 guardian-0.061
 fairy-0.058
 wizard-0.056
 mirror-0.056
 figure-0.056
Token and
Token ID290
Feature activation-3.48e-01

Pos logit contributions
 guardian+0.032
he+0.030
 fairy+0.030
 figure+0.030
 answer+0.030
Neg logit contributions
ksh-0.039
ashed-0.038
 rods-0.037
 tanks-0.037
 mouths-0.035
Token looked
Token ID3114
Feature activation-1.12e+00

Pos logit contributions
 figure+0.017
 fairy+0.017
 guardian+0.017
 answer+0.017
 note+0.016
Neg logit contributions
ksh-0.027
ashed-0.025
 rods-0.024
 tanks-0.024
suits-0.023
Token in
Token ID287
Feature activation-6.16e-01

Pos logit contributions
 answer+0.047
 guardian+0.047
 fairy+0.045
he+0.044
 wizard+0.042
Neg logit contributions
ksh-0.088
 rods-0.085
ashed-0.083
 tanks-0.081
girls-0.079
Token and
Token ID290
Feature activation-3.85e-01

Pos logit contributions
ksh+0.821
ashed+0.799
 rods+0.779
 tanks+0.773
 mouths+0.732
Neg logit contributions
 guardian-0.462
 answer-0.443
 figure-0.441
he-0.441
 fairy-0.437
Token it
Token ID340
Feature activation-1.78e+00

Pos logit contributions
 figure+0.022
 answer+0.021
 guardian+0.020
 fairy+0.020
 owl+0.020
Neg logit contributions
ksh-0.027
ashed-0.024
 rods-0.024
suits-0.023
 tanks-0.023
Token was
Token ID373
Feature activation+4.83e-01

Pos logit contributions
 guardian+0.087
 fairy+0.086
 figure+0.085
 answer+0.083
he+0.081
Neg logit contributions
ksh-0.131
 rods-0.127
ashed-0.125
 tanks-0.122
suits-0.118
Token going
Token ID1016
Feature activation+3.56e+00

Pos logit contributions
ksh+0.041
ashed+0.041
 rods+0.039
suits+0.038
 tanks+0.038
Neg logit contributions
 guardian-0.018
 answer-0.018
 fairy-0.017
 owl-0.017
 figure-0.016
Token on
Token ID319
Feature activation+1.71e-01

Pos logit contributions
ksh+0.298
 mouths+0.279
ashed+0.277
 rods+0.274
 pic+0.266
Neg logit contributions
 answer-0.135
 owl-0.126
 note-0.123
 figure-0.120
he-0.118
Token pic
Token ID8301
Feature activation+1.70e+01

Pos logit contributions
ashed+0.016
ksh+0.016
 mouths+0.015
suits+0.014
 tanks+0.014
Neg logit contributions
 answer-0.005
 note-0.005
 figure-0.005
he-0.005
 fairy-0.004
Tokenn
Token ID77
Feature activation+9.68e-01

Pos logit contributions
ksh+1.189
ashed+1.173
 rods+1.141
 tanks+1.140
 mouths+1.083
Neg logit contributions
 guardian-0.871
he-0.834
 figure-0.826
 answer-0.822
 fairy-0.818
Tokenics
Token ID873
Feature activation-4.91e-01

Pos logit contributions
ashed+0.069
ksh+0.068
 rods+0.067
 tanks+0.067
 sizes+0.062
Neg logit contributions
 guardian-0.048
 fairy-0.045
 figure-0.044
 wizard-0.044
 owl-0.043
Token!
Token ID0
Feature activation-2.11e+00

Pos logit contributions
he+0.018
 guardian+0.017
 answer+0.017
 figure+0.017
 note+0.017
Neg logit contributions
ksh-0.043
ashed-0.041
 rods-0.039
 tanks-0.039
 mouths-0.039
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
he+0.073
 figure+0.067
 answer+0.067
 guardian+0.065
 fairy+0.063
Neg logit contributions
ksh-0.177
ashed-0.174
 tanks-0.165
 rods-0.162
 sizes-0.161
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
 guardian+0.092
 angel+0.084
 fairy+0.083
 owl+0.083
 wizard+0.081
Neg logit contributions
 rods-0.090
ashed-0.086
 tanks-0.086
ksh-0.083
 pic-0.081
Token saw
Token ID2497
Feature activation-4.94e+00

Pos logit contributions
ksh+0.014
 rods+0.014
ashed+0.014
 tanks+0.014
 mouths+0.013
Neg logit contributions
 guardian-0.010
 figure-0.009
 fairy-0.009
he-0.009
 answer-0.009
Token lots
Token ID6041
Feature activation+4.86e+00

Pos logit contributions
 guardian+0.238
he+0.223
 fairy+0.223
 figure+0.221
 answer+0.219
Neg logit contributions
ksh-0.364
ashed-0.360
 rods-0.353
 tanks-0.350
 mouths-0.331
Token of
Token ID286
Feature activation-2.76e+00

Pos logit contributions
ksh+0.434
ashed+0.424
 rods+0.416
 tanks+0.412
 mouths+0.397
Neg logit contributions
 guardian-0.162
he-0.154
 fairy-0.151
 figure-0.148
 answer-0.147
Token people
Token ID661
Feature activation-4.11e-01

Pos logit contributions
 guardian+0.145
he+0.138
 fairy+0.138
 figure+0.138
 answer+0.138
Neg logit contributions
ksh-0.191
ashed-0.188
 rods-0.182
 tanks-0.180
 mouths-0.171
Token having
Token ID1719
Feature activation+2.45e+00

Pos logit contributions
 guardian+0.023
 fairy+0.021
he+0.021
 figure+0.021
 wizard+0.021
Neg logit contributions
ashed-0.027
ksh-0.027
 tanks-0.026
 rods-0.026
 mouths-0.024
Token pic
Token ID8301
Feature activation+1.69e+01

Pos logit contributions
ksh+0.145
ashed+0.141
 rods+0.125
 tanks+0.125
 mouths+0.118
Neg logit contributions
 answer-0.158
 note-0.151
 voice-0.151
he-0.149
 figure-0.149
Tokenn
Token ID77
Feature activation+1.04e+00

Pos logit contributions
ksh+1.222
ashed+1.194
 tanks+1.177
 rods+1.167
 mouths+1.119
Neg logit contributions
 guardian-0.829
he-0.814
 answer-0.802
 figure-0.784
 fairy-0.773
Tokenics
Token ID873
Feature activation-5.48e-01

Pos logit contributions
ashed+0.066
ksh+0.064
 rods+0.063
 tanks+0.063
 sizes+0.058
Neg logit contributions
 guardian-0.061
 fairy-0.058
 wizard-0.057
 figure-0.056
 owl-0.055
Token and
Token ID290
Feature activation-4.50e-01

Pos logit contributions
he+0.018
 guardian+0.018
 answer+0.017
 fairy+0.017
 figure+0.017
Neg logit contributions
ksh-0.050
ashed-0.047
 rods-0.046
 tanks-0.046
 mouths-0.045
Token walking
Token ID6155
Feature activation+7.67e-01

Pos logit contributions
 owl+0.021
 figure+0.021
 guardian+0.021
 fairy+0.021
 answer+0.020
Neg logit contributions
ksh-0.036
ashed-0.032
 rods-0.030
suits-0.030
 appet-0.029
Token their
Token ID511
Feature activation+6.44e+00

Pos logit contributions
ksh+0.066
 mouths+0.062
 rods+0.062
ashed+0.061
 tanks+0.060
Neg logit contributions
 answer-0.027
he-0.027
 guardian-0.026
 figure-0.025
 note-0.025
Token be
Token ID307
Feature activation-1.04e+00

Pos logit contributions
 guardian+0.107
he+0.106
 fairy+0.100
 figure+0.099
 wizard+0.097
Neg logit contributions
ksh-0.158
ashed-0.156
 rods-0.156
 tanks-0.156
 mouths-0.148
Token there
Token ID612
Feature activation+7.28e-01

Pos logit contributions
 guardian+0.049
 fairy+0.047
 answer+0.045
 owl+0.044
 wizard+0.043
Neg logit contributions
ksh-0.079
 rods-0.079
ashed-0.077
 tanks-0.076
 mouths-0.073
Token to
Token ID284
Feature activation+2.55e+00

Pos logit contributions
ksh+0.051
ashed+0.050
 rods+0.049
 tanks+0.049
 mouths+0.046
Neg logit contributions
 guardian-0.040
 fairy-0.038
 answer-0.038
he-0.037
 figure-0.036
Token support
Token ID1104
Feature activation-3.04e-01

Pos logit contributions
 tanks+0.155
 rods+0.155
 pic+0.152
ksh+0.149
ashed+0.145
Neg logit contributions
 guardian-0.132
he-0.124
ibrarian-0.122
 fairy-0.121
 note-0.121
Token her
Token ID607
Feature activation-4.56e+00

Pos logit contributions
 guardian+0.013
 fairy+0.013
 figure+0.012
he+0.012
 answer+0.012
Neg logit contributions
ksh-0.024
ashed-0.024
 rods-0.023
 tanks-0.023
 mouths-0.021
Token pic
Token ID8301
Feature activation+1.69e+01

Pos logit contributions
 guardian+0.226
 fairy+0.213
 figure+0.212
 answer+0.209
he+0.205
Neg logit contributions
ksh-0.339
ashed-0.338
 rods-0.325
 tanks-0.318
 mouths-0.299
Tokenn
Token ID77
Feature activation+9.40e-01

Pos logit contributions
ksh+1.086
ashed+1.074
 rods+1.057
 tanks+1.048
 mouths+0.974
Neg logit contributions
 guardian-0.952
he-0.904
 fairy-0.899
 figure-0.895
 answer-0.879
Tokenics
Token ID873
Feature activation-6.45e-01

Pos logit contributions
ashed+0.059
ksh+0.058
 rods+0.057
 tanks+0.057
 sizes+0.053
Neg logit contributions
 guardian-0.053
 fairy-0.051
 wizard-0.050
 figure-0.049
 mirror-0.049
Token--
Token ID438
Feature activation-1.95e+00

Pos logit contributions
 guardian+0.033
he+0.031
 fairy+0.031
 figure+0.031
 answer+0.031
Neg logit contributions
ksh-0.045
ashed-0.045
 rods-0.044
 tanks-0.043
 mouths-0.041
Tokenno
Token ID3919
Feature activation-2.38e-01

Pos logit contributions
 guardian+0.100
 figure+0.095
 fairy+0.095
 answer+0.093
he+0.092
Neg logit contributions
ksh-0.134
ashed-0.132
 rods-0.130
 tanks-0.128
 pic-0.121
Token matter
Token ID2300
Feature activation+1.98e+00

Pos logit contributions
 guardian+0.011
 fairy+0.010
he+0.010
 figure+0.010
 answer+0.010
Neg logit contributions
ksh-0.019
ashed-0.018
 rods-0.017
 tanks-0.017
suits-0.017
Token and
Token ID290
Feature activation-4.56e-01

Pos logit contributions
 guardian+0.046
he+0.043
 fairy+0.043
 figure+0.042
 answer+0.042
Neg logit contributions
ksh-0.077
ashed-0.076
 rods-0.075
 tanks-0.074
 mouths-0.070
Token nodded
Token ID14464
Feature activation+2.25e+00

Pos logit contributions
 figure+0.014
 guardian+0.014
 fairy+0.014
 owl+0.014
 voice+0.013
Neg logit contributions
ksh-0.042
 rods-0.040
 tanks-0.039
ashed-0.038
 sizes-0.038
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
ashed+0.185
ksh+0.185
 rods+0.182
 tanks+0.179
 pic+0.171
Neg logit contributions
 guardian-0.089
 answer-0.086
 figure-0.085
 fairy-0.085
 voice-0.081
Token He
Token ID679
Feature activation+9.34e-02

Pos logit contributions
ashed+0.011
ksh+0.011
girls+0.011
 pic+0.011
 rods+0.011
Neg logit contributions
 voice-0.006
 answer-0.006
 guardian-0.005
 figure-0.005
 mirror-0.005
Token loved
Token ID6151
Feature activation+1.07e-02

Pos logit contributions
ksh+0.007
 tanks+0.006
 rods+0.006
ashed+0.006
 mouths+0.006
Neg logit contributions
 guardian-0.005
 figure-0.005
 fairy-0.004
 answer-0.004
 owl-0.004
Token pic
Token ID8301
Feature activation+1.69e+01

Pos logit contributions
ksh+0.001
ashed+0.001
suits+0.001
 rods+0.001
 appet+0.001
Neg logit contributions
 owl-0.000
 answer-0.000
 fairy-0.000
 wizard-0.000
 guardian-0.000
Tokenn
Token ID77
Feature activation+9.71e-01

Pos logit contributions
ksh+0.925
ashed+0.916
 rods+0.914
 tanks+0.881
 pic+0.824
Neg logit contributions
 guardian-1.079
 fairy-1.025
 wizard-1.005
 figure-0.997
 owl-0.995
Tokenics
Token ID873
Feature activation-5.56e-01

Pos logit contributions
ashed+0.054
ksh+0.052
 rods+0.052
 tanks+0.050
 sizes+0.048
Neg logit contributions
 guardian-0.062
 fairy-0.060
 wizard-0.059
 mirror-0.058
 witch-0.057
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
 guardian+0.023
he+0.022
 figure+0.021
 fairy+0.021
 answer+0.021
Neg logit contributions
ksh-0.045
ashed-0.045
 rods-0.043
 tanks-0.043
 mouths-0.042
Token couldn
Token ID3521
Feature activation+6.55e+00

Pos logit contributions
 figure+0.024
 owl+0.024
 fairy+0.024
 answer+0.023
 guardian+0.023
Neg logit contributions
ksh-0.038
 rods-0.034
ashed-0.033
suits-0.032
 tanks-0.031
Token't
Token ID470
Feature activation-9.08e+00

Pos logit contributions
ksh+0.573
ashed+0.561
 rods+0.560
 tanks+0.554
 pic+0.532
Neg logit contributions
 guardian-0.216
 fairy-0.192
 figure-0.186
 wizard-0.186
 owl-0.182
Token so
Token ID523
Feature activation+3.48e+00

Pos logit contributions
ksh+0.022
ashed+0.022
 tanks+0.022
 rods+0.021
 mouths+0.020
Neg logit contributions
 guardian-0.021
he-0.020
 fairy-0.020
 wizard-0.020
 figure-0.020
Token excited
Token ID6568
Feature activation+1.93e+00

Pos logit contributions
ksh+0.245
ashed+0.243
 rods+0.238
 tanks+0.236
 mouths+0.222
Neg logit contributions
 guardian-0.178
he-0.168
 fairy-0.167
 figure-0.166
 answer-0.164
Token!
Token ID0
Feature activation-2.14e+00

Pos logit contributions
ksh+0.131
ashed+0.130
 rods+0.127
 tanks+0.127
 mouths+0.119
Neg logit contributions
 guardian-0.102
he-0.097
 fairy-0.097
 figure-0.096
 answer-0.095
Token She
Token ID1375
Feature activation-4.45e+00

Pos logit contributions
 guardian+0.079
he+0.077
 figure+0.074
 fairy+0.074
 answer+0.073
Neg logit contributions
ksh-0.178
ashed-0.175
 rods-0.175
 tanks-0.174
 mouths-0.164
Token loved
Token ID6151
Feature activation-6.41e-02

Pos logit contributions
 figure+0.207
 fairy+0.198
 guardian+0.197
 voice+0.192
 answer+0.190
Neg logit contributions
ksh-0.338
 rods-0.320
 tanks-0.309
 sizes-0.303
ashed-0.302
Token pic
Token ID8301
Feature activation+1.68e+01

Pos logit contributions
 owl+0.003
 answer+0.003
 note+0.003
 veil+0.003
 witch+0.003
Neg logit contributions
ksh-0.005
ashed-0.005
suits-0.004
 rods-0.004
 tanks-0.004
Tokenn
Token ID77
Feature activation+9.06e-01

Pos logit contributions
ksh+0.964
ashed+0.954
 rods+0.949
 tanks+0.923
 pic+0.878
Neg logit contributions
 guardian-1.048
 fairy-0.990
 wizard-0.971
 figure-0.969
 owl-0.957
Tokenics
Token ID873
Feature activation-6.36e-01

Pos logit contributions
ashed+0.049
ksh+0.047
 rods+0.047
 tanks+0.045
 sizes+0.042
Neg logit contributions
 guardian-0.061
 fairy-0.059
 wizard-0.057
 mirror-0.055
 figure-0.055
Token and
Token ID290
Feature activation-5.38e-01

Pos logit contributions
 guardian+0.028
he+0.027
 fairy+0.026
 figure+0.026
 answer+0.026
Neg logit contributions
ksh-0.049
ashed-0.048
 rods-0.048
 tanks-0.047
 mouths-0.045
Token spending
Token ID4581
Feature activation+6.21e+00

Pos logit contributions
 figure+0.024
 fairy+0.024
 answer+0.023
 owl+0.023
 mirror+0.023
Neg logit contributions
ksh-0.042
 rods-0.038
ashed-0.036
suits-0.035
 tanks-0.034
Token time
Token ID640
Feature activation+3.37e+00

Pos logit contributions
ksh+0.479
ashed+0.444
 tanks+0.441
 rods+0.439
 mouths+0.419
Neg logit contributions
he-0.285
 answer-0.272
 note-0.268
 figure-0.265
 fairy-0.262
Token Tim
Token ID5045
Feature activation+7.12e+00

Pos logit contributions
he+0.004
 answer+0.004
ad+0.004
 figure+0.004
 owl+0.004
Neg logit contributions
ashed-0.004
ksh-0.003
 rods-0.003
suits-0.003
 tails-0.003
Tokenmy
Token ID1820
Feature activation-6.25e+00

Pos logit contributions
ashed+0.534
 tanks+0.523
ksh+0.518
 sizes+0.497
 rods+0.495
Neg logit contributions
 guardian-0.346
 wizard-0.307
 fairy-0.304
 note-0.290
 figure-0.286
Token loved
Token ID6151
Feature activation-1.85e-01

Pos logit contributions
 guardian+0.387
he+0.364
 fairy+0.364
 answer+0.361
 figure+0.359
Neg logit contributions
ashed-0.370
ksh-0.368
 tanks-0.360
 rods-0.358
 mouths-0.332
Token going
Token ID1016
Feature activation+3.33e+00

Pos logit contributions
 owl+0.009
 mirror+0.008
 answer+0.008
 figure+0.008
 note+0.007
Neg logit contributions
ksh-0.015
ashed-0.014
suits-0.014
 sizes-0.013
 appet-0.013
Token on
Token ID319
Feature activation-7.24e-02

Pos logit contributions
ksh+0.234
ashed+0.228
 rods+0.222
 tanks+0.219
 mouths+0.213
Neg logit contributions
 guardian-0.174
he-0.166
 answer-0.165
 figure-0.163
 fairy-0.162
Token pic
Token ID8301
Feature activation+1.68e+01

Pos logit contributions
he+0.004
 guardian+0.004
 answer+0.004
 note+0.004
 figure+0.004
Neg logit contributions
ashed-0.005
ksh-0.004
 rods-0.004
 tanks-0.004
suits-0.004
Tokenn
Token ID77
Feature activation+7.75e-01

Pos logit contributions
 rods+0.998
ksh+0.990
ashed+0.986
 tanks+0.983
 pic+0.915
Neg logit contributions
 guardian-1.003
 fairy-0.958
he-0.930
 wizard-0.926
 figure-0.924
Tokenics
Token ID873
Feature activation-7.82e-01

Pos logit contributions
ashed+0.040
 rods+0.040
 tanks+0.039
ksh+0.039
 pic+0.037
Neg logit contributions
 guardian-0.053
 fairy-0.051
 wizard-0.048
 mirror-0.048
 elf-0.047
Token with
Token ID351
Feature activation-3.09e+00

Pos logit contributions
 guardian+0.032
he+0.032
 answer+0.030
 figure+0.030
 fairy+0.030
Neg logit contributions
ksh-0.063
ashed-0.062
 rods-0.060
 tanks-0.059
 mouths-0.057
Token his
Token ID465
Feature activation-8.48e-02

Pos logit contributions
 answer+0.093
 mirror+0.091
 owl+0.091
 fairy+0.087
he+0.086
Neg logit contributions
ashed-0.281
ksh-0.278
suits-0.265
 appet-0.256
 sizes-0.249
Token family
Token ID1641
Feature activation-1.15e+00

Pos logit contributions
 fairy+0.003
 guardian+0.003
 note+0.003
 owl+0.003
 voice+0.003
Neg logit contributions
ksh-0.008
ashed-0.007
 sizes-0.007
 rods-0.007
 appet-0.007
Token to
Token ID284
Feature activation+2.46e+00

Pos logit contributions
 guardian+0.058
 fairy+0.054
he+0.053
 figure+0.053
 answer+0.053
Neg logit contributions
ksh-0.114
ashed-0.112
 rods-0.109
 tanks-0.108
 mouths-0.103
Token be
Token ID307
Feature activation-1.25e+00

Pos logit contributions
 rods+0.153
ashed+0.152
 tanks+0.152
ksh+0.151
 tails+0.143
Neg logit contributions
 guardian-0.133
he-0.126
 fairy-0.124
 wizard-0.123
 note-0.119
Token careful
Token ID8161
Feature activation+5.44e+00

Pos logit contributions
 guardian+0.061
 fairy+0.057
 answer+0.057
 owl+0.057
 figure+0.055
Neg logit contributions
ksh-0.096
 rods-0.090
ashed-0.090
 tanks-0.089
suits-0.085
Token when
Token ID618
Feature activation+5.95e+00

Pos logit contributions
ksh+0.404
ashed+0.401
 rods+0.395
 tanks+0.393
 mouths+0.370
Neg logit contributions
 guardian-0.253
he-0.237
 fairy-0.236
 figure-0.234
 answer-0.231
Token having
Token ID1719
Feature activation+2.33e+00

Pos logit contributions
ksh+0.410
ashed+0.369
suits+0.341
 rods+0.306
 tanks+0.302
Neg logit contributions
 figure-0.379
 fairy-0.376
 answer-0.373
 guardian-0.371
 wizard-0.370
Token pic
Token ID8301
Feature activation+1.68e+01

Pos logit contributions
ksh+0.174
ashed+0.169
suits+0.163
 rods+0.153
 tanks+0.151
Neg logit contributions
 answer-0.117
 note-0.111
 voice-0.110
 owl-0.108
 fairy-0.107
Tokenn
Token ID77
Feature activation+8.56e-01

Pos logit contributions
ksh+1.159
ashed+1.141
 rods+1.119
 tanks+1.118
 mouths+1.050
Neg logit contributions
 guardian-0.874
he-0.841
 answer-0.825
 figure-0.824
 fairy-0.822
Tokenics
Token ID873
Feature activation-7.18e-01

Pos logit contributions
ksh+0.058
ashed+0.058
 rods+0.057
 tanks+0.056
 mouths+0.053
Neg logit contributions
 guardian-0.045
he-0.043
 fairy-0.042
 figure-0.042
 answer-0.042
Token and
Token ID290
Feature activation-6.21e-01

Pos logit contributions
 figure+0.025
he+0.025
 guardian+0.024
 answer+0.024
 fairy+0.024
Neg logit contributions
ksh-0.065
ashed-0.062
 rods-0.059
 tanks-0.058
 mouths-0.057
Token not
Token ID407
Feature activation-5.84e+00

Pos logit contributions
 guardian+0.030
 owl+0.029
 figure+0.029
 answer+0.029
 fairy+0.029
Neg logit contributions
ksh-0.048
ashed-0.045
 rods-0.043
suits-0.043
 tanks-0.042
Token to
Token ID284
Feature activation+2.39e+00

Pos logit contributions
 answer+0.254
 guardian+0.250
 figure+0.245
 owl+0.242
 note+0.239
Neg logit contributions
ksh-0.464
ashed-0.441
 rods-0.440
 tanks-0.427
suits-0.422
Token children
Token ID1751
Feature activation+7.53e-01

Pos logit contributions
ksh+0.270
ashed+0.256
 rods+0.238
 tanks+0.233
suits+0.227
Neg logit contributions
 guardian-0.234
 answer-0.223
 fairy-0.222
 figure-0.216
 voice-0.214
Token running
Token ID2491
Feature activation+3.64e+00

Pos logit contributions
ashed+0.048
ksh+0.047
 tanks+0.047
 rods+0.047
 mouths+0.043
Neg logit contributions
 guardian-0.043
he-0.040
 fairy-0.040
 figure-0.040
 answer-0.040
Token around
Token ID1088
Feature activation-4.71e+00

Pos logit contributions
ksh+0.327
 mouths+0.308
 rods+0.307
ashed+0.301
 topp+0.297
Neg logit contributions
he-0.117
 guardian-0.115
 answer-0.111
 fairy-0.110
 figure-0.110
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
 figure+0.179
 guardian+0.176
 answer+0.175
 fairy+0.173
 wizard+0.164
Neg logit contributions
ksh-0.411
ashed-0.398
 rods-0.372
 mouths-0.367
 tanks-0.362
Token families
Token ID4172
Feature activation+6.89e+00

Pos logit contributions
ksh+0.118
ashed+0.114
 rods+0.113
 tanks+0.109
 mouths+0.105
Neg logit contributions
 guardian-0.077
 figure-0.073
 fairy-0.072
 answer-0.071
 owl-0.070
Token pic
Token ID8301
Feature activation+1.67e+01

Pos logit contributions
ksh+0.491
ashed+0.484
 rods+0.475
 tanks+0.471
 mouths+0.445
Neg logit contributions
 guardian-0.344
he-0.325
 fairy-0.323
 figure-0.322
 answer-0.318
Tokenn
Token ID77
Feature activation+8.00e-01

Pos logit contributions
ksh+1.120
ashed+1.108
 rods+1.085
 tanks+1.079
 mouths+1.014
Neg logit contributions
 guardian-0.906
he-0.860
 fairy-0.854
 figure-0.851
 answer-0.842
Tokenicking
Token ID7958
Feature activation+2.54e+00

Pos logit contributions
ashed+0.051
ksh+0.048
 rods+0.048
 tanks+0.048
suits+0.044
Neg logit contributions
 guardian-0.047
 fairy-0.045
 wizard-0.044
 figure-0.044
 answer-0.044
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
ksh+0.214
ashed+0.203
 rods+0.193
 tanks+0.187
suits+0.182
Neg logit contributions
 answer-0.099
 guardian-0.097
 figure-0.095
 fairy-0.095
he-0.091
Token couples
Token ID11886
Feature activation-1.65e+00

Pos logit contributions
ksh+0.125
ashed+0.121
 rods+0.118
 tanks+0.113
 mouths+0.110
Neg logit contributions
 guardian-0.070
 fairy-0.067
 figure-0.066
 owl-0.065
 answer-0.065
Token st
Token ID336
Feature activation+4.29e+00

Pos logit contributions
 guardian+0.076
 figure+0.075
 fairy+0.074
he+0.074
 answer+0.072
Neg logit contributions
ksh-0.128
ashed-0.122
 rods-0.117
 tanks-0.113
 mouths-0.112
Token girl
Token ID2576
Feature activation-2.17e+00

Pos logit contributions
ksh+0.215
ashed+0.212
 rods+0.209
 tanks+0.207
 mouths+0.196
Neg logit contributions
 guardian-0.124
 fairy-0.116
he-0.116
 figure-0.115
 answer-0.113
Token named
Token ID3706
Feature activation-1.48e+00

Pos logit contributions
 guardian+0.148
he+0.145
 answer+0.142
 note+0.139
 wizard+0.138
Neg logit contributions
ashed-0.108
 tanks-0.104
ksh-0.101
 rods-0.100
 pic-0.094
Token Lily
Token ID20037
Feature activation+4.63e+00

Pos logit contributions
 guardian+0.110
 fairy+0.107
he+0.107
 answer+0.106
 figure+0.106
Neg logit contributions
ksh-0.070
 rods-0.069
 tanks-0.068
ashed-0.067
 mouths-0.061
Token who
Token ID508
Feature activation-1.22e+00

Pos logit contributions
ashed+0.244
 tanks+0.241
ksh+0.238
 sizes+0.227
 rods+0.221
Neg logit contributions
 guardian-0.303
 wizard-0.280
 note-0.276
 fairy-0.273
ibrarian-0.272
Token loved
Token ID6151
Feature activation-2.11e-01

Pos logit contributions
 guardian+0.084
 wizard+0.078
he+0.078
 figure+0.077
 answer+0.077
Neg logit contributions
ashed-0.065
ksh-0.062
 tanks-0.060
 rods-0.057
 pic-0.055
Token pic
Token ID8301
Feature activation+1.67e+01

Pos logit contributions
 owl+0.010
 mirror+0.009
 fairy+0.009
 answer+0.009
 note+0.009
Neg logit contributions
ksh-0.016
ashed-0.016
suits-0.015
 rods-0.015
 tanks-0.014
Tokenn
Token ID77
Feature activation+7.90e-01

Pos logit contributions
ksh+0.869
 rods+0.863
ashed+0.862
 tanks+0.831
 pic+0.821
Neg logit contributions
 guardian-1.097
 fairy-1.032
 wizard-1.029
 figure-1.013
 owl-1.003
Tokenics
Token ID873
Feature activation-7.41e-01

Pos logit contributions
ashed+0.039
ksh+0.039
 rods+0.037
 tanks+0.036
 sizes+0.035
Neg logit contributions
 guardian-0.056
 fairy-0.054
 wizard-0.053
 mirror-0.051
 elf-0.051
Token.
Token ID13
Feature activation-4.06e-02

Pos logit contributions
 guardian+0.032
he+0.031
 fairy+0.030
 figure+0.030
 answer+0.030
Neg logit contributions
ksh-0.057
ashed-0.057
 rods-0.056
 tanks-0.055
 mouths-0.052
Token She
Token ID1375
Feature activation-4.58e+00

Pos logit contributions
 answer+0.001
he+0.001
 guardian+0.001
 fairy+0.001
 figure+0.001
Neg logit contributions
 rods-0.003
ksh-0.003
ashed-0.003
 tanks-0.003
 mouths-0.003
Token would
Token ID561
Feature activation+8.17e+00

Pos logit contributions
 fairy+0.186
 figure+0.176
 owl+0.172
 answer+0.172
 guardian+0.171
Neg logit contributions
ksh-0.365
 rods-0.340
 mouths-0.323
 tanks-0.322
 topp-0.314
Token in
Token ID287
Feature activation-9.12e-01

Pos logit contributions
ksh+0.281
ashed+0.262
 mouths+0.257
 rods+0.255
 tanks+0.254
Neg logit contributions
 figure-0.117
he-0.116
 fairy-0.116
 answer-0.109
 note-0.104
Token the
Token ID262
Feature activation+4.61e-01

Pos logit contributions
 guardian+0.042
 figure+0.041
 fairy+0.041
he+0.040
 answer+0.040
Neg logit contributions
ksh-0.070
ashed-0.069
 rods-0.065
 tanks-0.065
 mouths-0.062
Token garden
Token ID11376
Feature activation-2.54e+00

Pos logit contributions
ksh+0.039
ashed+0.038
 rods+0.037
 tanks+0.037
 mouths+0.035
Neg logit contributions
 guardian-0.018
 fairy-0.016
he-0.016
 figure-0.016
 answer-0.016
Token and
Token ID290
Feature activation-6.91e-01

Pos logit contributions
he+0.079
 guardian+0.077
 figure+0.077
 answer+0.077
 fairy+0.076
Neg logit contributions
ksh-0.236
ashed-0.225
 rods-0.215
 tanks-0.214
 mouths-0.208
Token have
Token ID423
Feature activation-4.91e+00

Pos logit contributions
 guardian+0.028
 figure+0.027
 answer+0.027
 fairy+0.026
 owl+0.026
Neg logit contributions
ksh-0.057
ashed-0.056
 rods-0.054
 tanks-0.053
suits-0.051
Token pic
Token ID8301
Feature activation+1.66e+01

Pos logit contributions
 answer+0.270
 note+0.254
 fairy+0.253
 veil+0.249
 owl+0.248
Neg logit contributions
ksh-0.331
ashed-0.306
 tanks-0.280
 mouths-0.273
 rods-0.269
Tokenn
Token ID77
Feature activation+6.91e-01

Pos logit contributions
ksh+1.140
ashed+1.123
 rods+1.098
 tanks+1.097
 mouths+1.034
Neg logit contributions
 guardian-0.870
he-0.834
 figure-0.823
 fairy-0.821
 answer-0.820
Tokenics
Token ID873
Feature activation-9.08e-01

Pos logit contributions
ashed+0.043
ksh+0.042
 rods+0.042
 tanks+0.041
 sizes+0.038
Neg logit contributions
 guardian-0.042
 fairy-0.039
 wizard-0.039
 figure-0.038
 owl-0.038
Token together
Token ID1978
Feature activation+1.00e+01

Pos logit contributions
he+0.031
 note+0.029
 answer+0.029
 fairy+0.029
 guardian+0.028
Neg logit contributions
ksh-0.082
ashed-0.078
 mouths-0.076
 rods-0.075
 tanks-0.075
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
ksh+1.006
ashed+0.979
 tanks+0.959
 rods+0.950
 mouths+0.921
Neg logit contributions
 answer-0.243
he-0.240
 note-0.213
 fairy-0.200
 figure-0.197
Token
Token ID220
Feature activation-6.15e-01

Pos logit contributions
he+0.010
 answer+0.009
 guardian+0.008
 figure+0.008
 owl+0.008
Neg logit contributions
ashed-0.025
ksh-0.024
 rods-0.023
 tanks-0.022
suits-0.022
Token felt
Token ID2936
Feature activation-2.03e+00

Pos logit contributions
 guardian+0.259
he+0.252
 answer+0.244
 figure+0.243
 note+0.243
Neg logit contributions
ashed-0.281
ksh-0.274
 tanks-0.270
 rods-0.269
 mouths-0.252
Token safe
Token ID3338
Feature activation+7.36e-01

Pos logit contributions
 guardian+0.112
 figure+0.111
he+0.110
 note+0.110
 fairy+0.105
Neg logit contributions
ashed-0.127
ksh-0.119
 tanks-0.119
 rods-0.115
 pic-0.111
Token knowing
Token ID6970
Feature activation-2.10e+00

Pos logit contributions
ksh+0.056
ashed+0.056
 rods+0.055
 tanks+0.054
 mouths+0.051
Neg logit contributions
 guardian-0.033
he-0.031
 fairy-0.031
 figure-0.031
 answer-0.030
Token that
Token ID326
Feature activation-6.42e+00

Pos logit contributions
 guardian+0.091
he+0.086
 figure+0.084
 fairy+0.084
 answer+0.083
Neg logit contributions
ksh-0.162
ashed-0.161
 rods-0.158
 tanks-0.158
 mouths-0.149
Token her
Token ID607
Feature activation-5.10e+00

Pos logit contributions
 guardian+0.274
he+0.257
 fairy+0.254
 figure+0.253
 answer+0.249
Neg logit contributions
ksh-0.501
ashed-0.497
 rods-0.489
 tanks-0.486
 mouths-0.460
Token guardian
Token ID21688
Feature activation-1.51e+01

Pos logit contributions
he+0.264
iver+0.209
ad+0.206
ibrarian+0.203
 figure+0.203
Neg logit contributions
 mouths-0.329
 tails-0.326
 tanks-0.325
 rods-0.324
 pic-0.322
Token angel
Token ID18304
Feature activation-1.21e+01

Pos logit contributions
he+0.828
 guardian+0.775
 figure+0.772
 note+0.747
 mirror+0.739
Neg logit contributions
ashed-0.882
 tanks-0.875
 sizes-0.874
ksh-0.852
 rods-0.842
Token was
Token ID373
Feature activation-3.44e-01

Pos logit contributions
 guardian+0.703
he+0.684
 figure+0.666
 fairy+0.664
 answer+0.661
Neg logit contributions
ashed-0.732
ksh-0.730
 tanks-0.711
 rods-0.709
 mouths-0.665
Token there
Token ID612
Feature activation+1.56e-01

Pos logit contributions
he+0.018
 guardian+0.018
 figure+0.018
 note+0.017
 fairy+0.017
Neg logit contributions
ashed-0.021
 tanks-0.020
ksh-0.020
 rods-0.020
 pic-0.020
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
ksh+0.009
ashed+0.009
 rods+0.009
 tanks+0.009
 mouths+0.008
Neg logit contributions
 guardian-0.009
he-0.009
 figure-0.009
 fairy-0.009
 wizard-0.009
Token watch
Token ID2342
Feature activation-2.45e+00

Pos logit contributions
 pic+0.099
 tails+0.094
 tanks+0.093
 rods+0.092
 sizes+0.091
Neg logit contributions
ibrarian-0.099
 guardian-0.095
he-0.095
 mirror-0.092
 note-0.091
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
 guardian+0.025
he+0.024
 note+0.024
 fairy+0.024
 answer+0.023
Neg logit contributions
ksh-0.026
ashed-0.026
 tanks-0.025
 rods-0.025
suits-0.023
Token"
Token ID1
Feature activation+1.73e+00

Pos logit contributions
 guardian+0.121
ocket+0.115
 note+0.114
 wizard+0.113
 answer+0.113
Neg logit contributions
 rods-0.108
 tanks-0.105
suits-0.105
ksh-0.105
 pic-0.103
TokenI
Token ID40
Feature activation-1.48e+00

Pos logit contributions
ksh+0.108
ashed+0.106
 rods+0.104
 tanks+0.102
 sizes+0.095
Neg logit contributions
 guardian-0.091
 answer-0.090
 figure-0.088
 wizard-0.087
 fairy-0.086
Token'm
Token ID1101
Feature activation-2.16e+00

Pos logit contributions
 angel+0.085
 guardian+0.085
 owl+0.084
 mirror+0.083
he+0.083
Neg logit contributions
ashed-0.085
 pic-0.073
 tanks-0.072
suits-0.072
ksh-0.071
Token the
Token ID262
Feature activation+3.08e-01

Pos logit contributions
he+0.126
 note+0.113
orn+0.110
 guardian+0.106
 mirror+0.104
Neg logit contributions
 tanks-0.127
ksh-0.121
 pic-0.117
 rods-0.115
 they-0.115
Token guardian
Token ID21688
Feature activation-1.50e+01

Pos logit contributions
 pic+0.022
 tanks+0.021
 sizes+0.021
 trucks+0.021
 tails+0.020
Neg logit contributions
he-0.012
on-0.011
iver-0.011
orn-0.010
ad-0.010
Token of
Token ID286
Feature activation-3.26e+00

Pos logit contributions
 figure+0.686
he+0.681
 guardian+0.670
iver+0.652
 mirror+0.644
Neg logit contributions
ksh-0.941
 tanks-0.928
 sizes-0.895
ashed-0.882
 trucks-0.870
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
 guardian+0.178
he+0.173
 figure+0.169
 note+0.166
 fairy+0.165
Neg logit contributions
 tanks-0.203
 rods-0.201
ksh-0.197
ashed-0.194
 pic-0.189
Token valley
Token ID19272
Feature activation+1.88e+00

Pos logit contributions
 tanks+0.017
 pic+0.016
 rods+0.016
ashed+0.016
suits+0.015
Neg logit contributions
he-0.016
ibrarian-0.016
ad-0.015
 figure-0.015
iver-0.015
Token,"
Token ID553
Feature activation-1.77e+00

Pos logit contributions
ksh+0.129
ashed+0.127
 tanks+0.126
 rods+0.126
suits+0.117
Neg logit contributions
 guardian-0.094
 figure-0.089
he-0.088
 fairy-0.088
 answer-0.087
Token said
Token ID531
Feature activation-3.29e+00

Pos logit contributions
 guardian+0.105
he+0.100
 answer+0.094
iver+0.094
ocket+0.094
Neg logit contributions
 rods-0.095
 tanks-0.095
ksh-0.093
 they-0.090
 mouths-0.089
Token thought
Token ID1807
Feature activation-3.55e+00

Pos logit contributions
 guardian+0.022
he+0.022
 answer+0.020
 note+0.020
 fairy+0.020
Neg logit contributions
ashed-0.024
ksh-0.023
 tanks-0.023
 rods-0.022
 mouths-0.021
Token he
Token ID339
Feature activation-7.22e-01

Pos logit contributions
 guardian+0.181
he+0.170
 fairy+0.169
 figure+0.168
 answer+0.165
Neg logit contributions
ksh-0.248
ashed-0.247
 rods-0.243
 tanks-0.242
 mouths-0.227
Token might
Token ID1244
Feature activation+5.22e+00

Pos logit contributions
 guardian+0.047
he+0.047
 wizard+0.044
 fairy+0.044
 note+0.044
Neg logit contributions
ashed-0.040
ksh-0.039
 tanks-0.038
 rods-0.037
 mouths-0.035
Token be
Token ID307
Feature activation-1.53e+00

Pos logit contributions
 rods+0.278
 tanks+0.275
ashed+0.275
 sizes+0.271
 tails+0.271
Neg logit contributions
he-0.308
ibrarian-0.291
 angel-0.281
 mirror-0.278
 guardian-0.270
Token a
Token ID257
Feature activation-7.71e-01

Pos logit contributions
 guardian+0.064
he+0.062
 figure+0.060
 fairy+0.060
 answer+0.058
Neg logit contributions
ksh-0.119
ashed-0.119
 tanks-0.117
 rods-0.116
 mouths-0.110
Token guardian
Token ID21688
Feature activation-1.50e+01

Pos logit contributions
 guardian+0.042
he+0.041
 fairy+0.040
 figure+0.040
 answer+0.039
Neg logit contributions
ksh-0.051
ashed-0.050
 rods-0.049
 tanks-0.049
 mouths-0.046
Token of
Token ID286
Feature activation-3.17e+00

Pos logit contributions
 guardian+0.726
he+0.703
 figure+0.676
 fairy+0.668
 answer+0.661
Neg logit contributions
ksh-1.060
ashed-1.044
 tanks-1.033
 rods-1.031
 mouths-0.968
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
 guardian+0.150
 fairy+0.143
 answer+0.141
 figure+0.140
he+0.139
Neg logit contributions
ksh-0.239
ashed-0.235
 rods-0.228
 tanks-0.226
suits-0.215
Token b
Token ID275
Feature activation+4.07e+00

Pos logit contributions
 tanks+0.025
ksh+0.025
ashed+0.024
 rods+0.024
 pic+0.023
Neg logit contributions
he-0.021
 guardian-0.021
ibrarian-0.021
 figure-0.020
 wizard-0.020
Tokenerry
Token ID6996
Feature activation+1.66e+00

Pos logit contributions
ashed+0.234
ksh+0.229
 rods+0.223
 tanks+0.220
 mouths+0.205
Neg logit contributions
 guardian-0.263
he-0.253
 figure-0.252
 fairy-0.251
 answer-0.248
Token bush
Token ID24484
Feature activation-2.74e+00

Pos logit contributions
ksh+0.130
ashed+0.127
 rods+0.124
 tanks+0.122
 mouths+0.115
Neg logit contributions
 guardian-0.074
 fairy-0.072
 figure-0.070
he-0.070
 owl-0.069
Token could
Token ID714
Feature activation+7.42e+00

Pos logit contributions
 figure+0.047
 guardian+0.046
 fairy+0.046
 wizard+0.044
 answer+0.043
Neg logit contributions
ksh-0.049
 tanks-0.044
 rods-0.044
ashed-0.044
 pic-0.042
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
ashed+0.431
 rods+0.422
 tanks+0.410
ksh+0.387
 mouths+0.374
Neg logit contributions
 guardian-0.439
 mirror-0.424
he-0.424
 angel-0.417
 fairy-0.411
Token but
Token ID475
Feature activation+1.59e+00

Pos logit contributions
ksh+0.082
ashed+0.082
 rods+0.080
 tanks+0.080
 mouths+0.074
Neg logit contributions
 guardian-0.084
he-0.080
 fairy-0.079
 figure-0.079
 answer-0.078
Token eventually
Token ID4191
Feature activation+4.14e+00

Pos logit contributions
ksh+0.099
ashed+0.099
 rods+0.097
 tanks+0.097
 mouths+0.090
Neg logit contributions
 guardian-0.092
he-0.087
 fairy-0.087
 figure-0.086
 answer-0.086
Token the
Token ID262
Feature activation+3.80e-01

Pos logit contributions
ksh+0.317
ashed+0.295
 rods+0.286
 tanks+0.284
suits+0.275
Neg logit contributions
 guardian-0.212
 fairy-0.205
 figure-0.202
 wizard-0.200
 answer-0.197
Token guardian
Token ID21688
Feature activation-1.49e+01

Pos logit contributions
 rods+0.028
 tanks+0.028
ashed+0.026
ksh+0.026
 tails+0.025
Neg logit contributions
he-0.018
 guardian-0.015
 mirror-0.014
iver-0.014
 note-0.014
Token caught
Token ID4978
Feature activation-3.36e+00

Pos logit contributions
he+0.684
 guardian+0.650
 mirror+0.647
 answer+0.629
 note+0.616
Neg logit contributions
ashed-1.038
 tanks-1.026
 rods-1.017
ksh-0.982
 sizes-0.934
Token up
Token ID510
Feature activation-6.17e+00

Pos logit contributions
 guardian+0.169
he+0.159
 fairy+0.158
 figure+0.157
 answer+0.155
Neg logit contributions
ksh-0.237
ashed-0.235
 rods-0.232
 tanks-0.231
 mouths-0.216
Token with
Token ID351
Feature activation-3.32e+00

Pos logit contributions
 guardian+0.295
he+0.278
 fairy+0.276
 figure+0.275
 answer+0.271
Neg logit contributions
ksh-0.452
ashed-0.448
 rods-0.440
 tanks-0.437
 mouths-0.412
Token him
Token ID683
Feature activation-8.92e-01

Pos logit contributions
 guardian+0.172
he+0.163
 figure+0.160
 fairy+0.160
 wizard+0.157
Neg logit contributions
ksh-0.225
ashed-0.224
 rods-0.224
 tanks-0.223
 mouths-0.209
Token and
Token ID290
Feature activation-9.14e-01

Pos logit contributions
 guardian+0.040
he+0.037
 fairy+0.037
 figure+0.037
 answer+0.036
Neg logit contributions
ksh-0.068
ashed-0.067
 rods-0.066
 tanks-0.066
 mouths-0.062
TokenThe
Token ID464
Feature activation-1.32e+00

Pos logit contributions
 guardian+0.110
 answer+0.107
 figure+0.105
 wizard+0.103
 owl+0.101
Neg logit contributions
ksh-0.113
ashed-0.112
 pic-0.110
 rods-0.110
 tanks-0.105
Token little
Token ID1310
Feature activation+2.65e+00

Pos logit contributions
iver+0.062
he+0.062
 figure+0.059
 guardian+0.059
ad+0.058
Neg logit contributions
 tanks-0.079
ksh-0.078
ashed-0.078
 mouths-0.076
 tails-0.076
Token girl
Token ID2576
Feature activation-2.32e+00

Pos logit contributions
ashed+0.163
ksh+0.160
 tanks+0.152
 mouths+0.151
 sizes+0.150
Neg logit contributions
he-0.137
 guardian-0.132
ibrarian-0.130
 figure-0.130
iver-0.128
Token thanked
Token ID26280
Feature activation+5.00e-01

Pos logit contributions
 guardian+0.130
he+0.127
 answer+0.124
 figure+0.122
 fairy+0.121
Neg logit contributions
ashed-0.145
ksh-0.141
 rods-0.138
 tanks-0.137
 mouths-0.132
Token her
Token ID607
Feature activation-4.94e+00

Pos logit contributions
 pic+0.026
 tanks+0.026
 rods+0.026
ksh+0.026
 mouths+0.025
Neg logit contributions
 guardian-0.030
he-0.029
 note-0.027
 figure-0.027
orn-0.027
Token guardian
Token ID21688
Feature activation-1.49e+01

Pos logit contributions
he+0.260
iver+0.222
 mirror+0.208
 figure+0.206
 note+0.202
Neg logit contributions
 rods-0.315
 tanks-0.314
 tails-0.308
ashed-0.305
 mouths-0.303
Token angel
Token ID18304
Feature activation-1.19e+01

Pos logit contributions
he+0.870
 guardian+0.852
 figure+0.835
 note+0.806
 answer+0.792
Neg logit contributions
ksh-0.866
ashed-0.853
 tanks-0.849
 rods-0.843
 mouths-0.796
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
 guardian+0.692
 fairy+0.658
 figure+0.646
he+0.645
 answer+0.644
Neg logit contributions
ksh-0.754
ashed-0.749
 rods-0.733
 tanks-0.722
 mouths-0.674
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
ksh+0.088
ashed+0.088
 tanks+0.084
 rods+0.084
 mouths+0.080
Neg logit contributions
 guardian-0.066
he-0.063
 figure-0.062
 answer-0.061
 fairy-0.061
Token knew
Token ID2993
Feature activation-3.15e+00

Pos logit contributions
 guardian+0.052
 fairy+0.050
he+0.049
 figure+0.049
 answer+0.048
Neg logit contributions
ksh-0.063
ashed-0.062
 rods-0.061
 tanks-0.061
 mouths-0.057
Token that
Token ID326
Feature activation-6.23e+00

Pos logit contributions
 guardian+0.146
he+0.144
 figure+0.139
 fairy+0.135
 note+0.134
Neg logit contributions
ashed-0.218
ksh-0.217
 rods-0.216
 tanks-0.215
 pic-0.205
Token
Token ID129
Feature activation+5.36e+00

Pos logit contributions
ksh+0.038
ashed+0.037
 rods+0.037
 tanks+0.036
 mouths+0.034
Neg logit contributions
 guardian-0.046
 answer-0.045
he-0.045
 fairy-0.045
 figure-0.044
Token
Token ID241
Feature activation+1.32e+00

Pos logit contributions
ksh+0.298
 rods+0.281
 tanks+0.276
ashed+0.275
 tails+0.263
Neg logit contributions
 fairy-0.350
he-0.340
 guardian-0.340
 note-0.334
 mirror-0.331
TokenI
Token ID40
Feature activation-1.22e+00

Pos logit contributions
ksh+0.089
ashed+0.088
 rods+0.087
 tanks+0.085
 pic+0.079
Neg logit contributions
 guardian-0.070
 fairy-0.067
he-0.066
 figure-0.065
 answer-0.065
Token am
Token ID716
Feature activation-3.93e+00

Pos logit contributions
 angel+0.067
 owl+0.067
 guardian+0.065
 mirror+0.064
 note+0.064
Neg logit contributions
ashed-0.077
 pic-0.068
 rods-0.065
 tanks-0.065
ksh-0.065
Token the
Token ID262
Feature activation+5.84e-01

Pos logit contributions
he+0.197
 guardian+0.193
 note+0.187
 figure+0.184
 fairy+0.182
Neg logit contributions
ksh-0.266
 tanks-0.265
ashed-0.263
 rods-0.260
 pic-0.244
Token guardian
Token ID21688
Feature activation-1.48e+01

Pos logit contributions
 sizes+0.037
ksh+0.036
 tanks+0.036
 rods+0.035
 tails+0.035
Neg logit contributions
he-0.030
ibrarian-0.025
 note-0.025
ocket-0.024
mother-0.024
Token of
Token ID286
Feature activation-2.93e+00

Pos logit contributions
he+0.635
 note+0.592
 guardian+0.587
 figure+0.581
iver+0.559
Neg logit contributions
ksh-1.020
 tanks-1.000
 sizes-0.964
ashed-0.957
 rods-0.949
Token the
Token ID262
Feature activation+5.59e-01

Pos logit contributions
 guardian+0.150
he+0.142
 fairy+0.140
 figure+0.140
 answer+0.138
Neg logit contributions
ksh-0.204
ashed-0.202
 rods-0.199
 tanks-0.198
 mouths-0.186
Token hidden
Token ID7104
Feature activation-5.71e+00

Pos logit contributions
 tanks+0.034
 sizes+0.033
 tails+0.033
 pic+0.033
 rods+0.032
Neg logit contributions
he-0.029
ibrarian-0.027
ocket-0.025
 note-0.024
 guardian-0.024
Token forest
Token ID8222
Feature activation-2.61e+00

Pos logit contributions
he+0.250
 guardian+0.222
 note+0.211
 figure+0.197
 answer+0.194
Neg logit contributions
 tanks-0.420
 rods-0.419
ashed-0.403
ksh-0.394
 pic-0.391
Token.
Token ID13
Feature activation-3.27e-01

Pos logit contributions
 guardian+0.115
 fairy+0.108
 figure+0.106
he+0.105
 answer+0.105
Neg logit contributions
ksh-0.206
ashed-0.204
 rods-0.197
 tanks-0.194
 mouths-0.188
Token again
Token ID757
Feature activation+2.92e+00

Pos logit contributions
 guardian+0.116
 answer+0.111
 figure+0.111
 fairy+0.110
 owl+0.106
Neg logit contributions
ksh-0.193
ashed-0.191
 rods-0.183
 tanks-0.182
 pic-0.174
Token.
Token ID13
Feature activation-7.89e-02

Pos logit contributions
ksh+0.244
ashed+0.241
 rods+0.231
 tanks+0.229
 mouths+0.220
Neg logit contributions
 guardian-0.117
 figure-0.112
 fairy-0.109
 answer-0.108
he-0.105
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
he+0.004
 guardian+0.004
 wizard+0.004
 answer+0.004
 figure+0.004
Neg logit contributions
ashed-0.005
ksh-0.005
 rods-0.005
 tanks-0.005
 tails-0.005
Token
Token ID198
Feature activation-1.75e+00

Pos logit contributions
 guardian+0.116
 angel+0.112
 fairy+0.107
 mirror+0.105
 answer+0.103
Neg logit contributions
 rods-0.097
ashed-0.093
 tanks-0.093
 pic-0.091
suits-0.089
TokenThe
Token ID464
Feature activation-1.20e+00

Pos logit contributions
 guardian+0.093
 fairy+0.088
 answer+0.087
 figure+0.086
 wizard+0.085
Neg logit contributions
ksh-0.118
ashed-0.116
 rods-0.116
 tanks-0.114
 pic-0.106
Token guardian
Token ID21688
Feature activation-1.47e+01

Pos logit contributions
he+0.058
iver+0.048
 mirror+0.045
 guardian+0.045
bird+0.045
Neg logit contributions
 tanks-0.086
 rods-0.085
 trucks-0.078
ashed-0.076
 tails-0.074
Token became
Token ID2627
Feature activation+3.20e+00

Pos logit contributions
he+0.680
 guardian+0.676
 answer+0.637
 figure+0.626
 mirror+0.623
Neg logit contributions
ashed-1.050
 tanks-1.033
 rods-1.032
ksh-1.020
 mouths-0.954
Token so
Token ID523
Feature activation+3.30e+00

Pos logit contributions
ksh+0.251
ashed+0.247
 rods+0.242
 tanks+0.240
 mouths+0.229
Neg logit contributions
 guardian-0.141
 fairy-0.131
 figure-0.130
he-0.128
 answer-0.128
Token angry
Token ID7954
Feature activation-5.07e+00

Pos logit contributions
ashed+0.272
ksh+0.271
 rods+0.268
 tanks+0.267
 mouths+0.252
Neg logit contributions
 guardian-0.125
he-0.119
 fairy-0.115
 figure-0.115
 answer-0.113
Token that
Token ID326
Feature activation-6.00e+00

Pos logit contributions
 guardian+0.195
 fairy+0.184
 figure+0.183
he+0.181
 owl+0.179
Neg logit contributions
ksh-0.427
ashed-0.417
 rods-0.409
 tanks-0.405
 mouths-0.392
Token it
Token ID340
Feature activation-2.10e+00

Pos logit contributions
 guardian+0.370
he+0.352
 fairy+0.350
 figure+0.349
 answer+0.346
Neg logit contributions
ksh-0.353
ashed-0.349
 rods-0.344
 tanks-0.344
 mouths-0.318
Token cute
Token ID13779
Feature activation-3.53e+00

Pos logit contributions
 answer+0.154
 fairy+0.153
 guardian+0.152
 figure+0.150
 mirror+0.145
Neg logit contributions
ksh-0.168
ashed-0.155
 rods-0.145
 tanks-0.140
suits-0.133
Token and
Token ID290
Feature activation-3.19e+00

Pos logit contributions
 guardian+0.174
he+0.164
 fairy+0.163
 figure+0.162
 answer+0.160
Neg logit contributions
ksh-0.251
ashed-0.249
 rods-0.244
 tanks-0.243
 mouths-0.229
Token belonged
Token ID19611
Feature activation-2.51e+00

Pos logit contributions
 guardian+0.159
 fairy+0.155
 figure+0.151
 answer+0.151
 owl+0.149
Neg logit contributions
ksh-0.229
ashed-0.223
 rods-0.220
 tanks-0.214
suits-0.203
Token to
Token ID284
Feature activation+9.45e-01

Pos logit contributions
 guardian+0.144
 fairy+0.137
he+0.136
 figure+0.135
 answer+0.134
Neg logit contributions
ksh-0.162
ashed-0.158
 rods-0.156
 tanks-0.154
 mouths-0.145
Token a
Token ID257
Feature activation-2.01e+00

Pos logit contributions
ksh+0.069
ashed+0.068
 rods+0.065
 tanks+0.064
suits+0.062
Neg logit contributions
 guardian-0.048
 fairy-0.046
 answer-0.046
 figure-0.046
 owl-0.045
Token wizard
Token ID18731
Feature activation-1.46e+01

Pos logit contributions
 guardian+0.094
 fairy+0.088
he+0.088
 figure+0.088
 answer+0.086
Neg logit contributions
ksh-0.149
ashed-0.148
 rods-0.145
 tanks-0.144
 mouths-0.136
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 guardian+0.707
he+0.670
 fairy+0.653
 figure+0.651
 answer+0.650
Neg logit contributions
ksh-1.029
ashed-1.020
 tanks-1.012
 rods-1.005
 mouths-0.936
Token Every
Token ID3887
Feature activation+5.39e-02

Pos logit contributions
 guardian+0.059
 fairy+0.056
he+0.056
 wizard+0.055
 figure+0.055
Neg logit contributions
ashed-0.081
ksh-0.080
 rods-0.079
 tanks-0.077
 mouths-0.074
Token day
Token ID1110
Feature activation+2.26e+00

Pos logit contributions
ashed+0.003
 rods+0.003
 tanks+0.003
ksh+0.003
 pic+0.003
Neg logit contributions
 guardian-0.004
he-0.003
 figure-0.003
 fairy-0.003
 answer-0.003
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
ksh+0.190
ashed+0.182
 rods+0.175
suits+0.168
 tanks+0.167
Neg logit contributions
 fairy-0.106
 wizard-0.100
 guardian-0.098
 answer-0.098
 figure-0.097
Token wizard
Token ID18731
Feature activation-1.39e+01

Pos logit contributions
 tanks+0.025
 rods+0.024
ashed+0.023
 trucks+0.023
 topp+0.022
Neg logit contributions
he-0.008
iver-0.006
 guardian-0.005
She-0.005
 note-0.004
Token angel
Token ID18304
Feature activation-1.15e+01

Pos logit contributions
 pic+0.055
 mouths+0.055
 trucks+0.054
 games+0.054
 sizes+0.054
Neg logit contributions
iver-0.019
 figure-0.018
he-0.018
ad-0.018
orn-0.016
Token was
Token ID373
Feature activation+2.87e-01

Pos logit contributions
 guardian+0.699
he+0.674
 figure+0.659
 answer+0.645
 fairy+0.645
Neg logit contributions
ashed-0.667
ksh-0.664
 tanks-0.651
 rods-0.639
 mouths-0.611
Token your
Token ID534
Feature activation+1.29e+00

Pos logit contributions
 tanks+0.017
 pic+0.017
ksh+0.016
 tails+0.016
ashed+0.016
Neg logit contributions
he-0.016
 figure-0.014
 guardian-0.014
orn-0.014
 note-0.014
Token mom
Token ID1995
Feature activation-1.36e+00

Pos logit contributions
ksh+0.081
ashed+0.080
 tanks+0.080
 rods+0.080
 mouths+0.078
Neg logit contributions
he-0.069
 figure-0.062
 guardian-0.062
orn-0.060
ad-0.060
Token's
Token ID338
Feature activation-4.69e-01

Pos logit contributions
 guardian+0.072
 figure+0.066
 wizard+0.066
 fairy+0.065
 answer+0.065
Neg logit contributions
ksh-0.089
 tanks-0.086
ashed-0.086
 rods-0.082
 mouths-0.082
Token guardian
Token ID21688
Feature activation-1.46e+01

Pos logit contributions
he+0.024
orn+0.021
ad+0.020
ibrarian+0.020
iver+0.020
Neg logit contributions
 tails-0.028
 pic-0.028
 mouths-0.028
 tanks-0.027
 sizes-0.027
Token angel
Token ID18304
Feature activation-1.16e+01

Pos logit contributions
 guardian+0.694
he+0.661
 figure+0.655
 fairy+0.627
 note+0.625
Neg logit contributions
ksh-1.023
ashed-1.000
 tanks-0.999
 rods-0.983
 mouths-0.938
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 guardian+0.512
he+0.479
 fairy+0.479
 figure+0.474
 answer+0.468
Neg logit contributions
ksh-0.885
ashed-0.879
 rods-0.864
 tanks-0.857
 mouths-0.810
Token She
Token ID1375
Feature activation-4.81e+00

Pos logit contributions
 guardian+0.006
 fairy+0.006
 answer+0.006
 figure+0.006
he+0.006
Neg logit contributions
ashed-0.008
ksh-0.008
 rods-0.008
 tanks-0.008
 mouths-0.007
Token was
Token ID373
Feature activation+2.73e-01

Pos logit contributions
he+0.301
 guardian+0.294
 figure+0.283
 note+0.282
 answer+0.279
Neg logit contributions
ashed-0.261
ksh-0.260
 tanks-0.256
 rods-0.243
 mouths-0.238
Token watching
Token ID4964
Feature activation+5.79e-01

Pos logit contributions
 pic+0.015
 tanks+0.014
ksh+0.014
 they+0.013
 tails+0.013
Neg logit contributions
he-0.017
orn-0.015
 figure-0.015
 note-0.015
iver-0.014
Token that
Token ID326
Feature activation-5.72e+00

Pos logit contributions
 guardian+0.121
he+0.119
 figure+0.113
 fairy+0.111
 answer+0.111
Neg logit contributions
ashed-0.186
ksh-0.185
 rods-0.184
 tanks-0.183
 pic-0.174
Token Aunt
Token ID38074
Feature activation-8.79e-01

Pos logit contributions
 guardian+0.273
 fairy+0.260
 figure+0.256
 answer+0.254
 owl+0.248
Neg logit contributions
ksh-0.438
ashed-0.433
 rods-0.419
 tanks-0.414
suits-0.394
Token was
Token ID373
Feature activation+4.08e-01

Pos logit contributions
 guardian+0.044
he+0.042
 figure+0.042
 fairy+0.041
 answer+0.041
Neg logit contributions
ksh-0.061
ashed-0.061
 tanks-0.059
 rods-0.059
 mouths-0.056
Token really
Token ID1107
Feature activation-1.43e+00

Pos logit contributions
 tanks+0.025
ashed+0.024
ksh+0.024
 rods+0.023
 pic+0.023
Neg logit contributions
he-0.022
orn-0.020
 guardian-0.020
 figure-0.020
 note-0.020
Token her
Token ID607
Feature activation-4.48e+00

Pos logit contributions
he+0.068
 guardian+0.067
 figure+0.063
 fairy+0.063
 owl+0.062
Neg logit contributions
ashed-0.100
 tanks-0.098
ksh-0.097
 rods-0.096
 sizes-0.091
Token guardian
Token ID21688
Feature activation-1.46e+01

Pos logit contributions
 guardian+0.225
 fairy+0.210
 figure+0.210
 answer+0.208
he+0.205
Neg logit contributions
ksh-0.324
ashed-0.321
 rods-0.313
 tanks-0.311
 mouths-0.293
Token angel
Token ID18304
Feature activation-1.15e+01

Pos logit contributions
 guardian+0.651
he+0.616
 figure+0.605
 fairy+0.602
 answer+0.594
Neg logit contributions
ksh-1.096
ashed-1.086
 rods-1.068
 tanks-1.066
 mouths-1.005
Token and
Token ID290
Feature activation-4.14e-01

Pos logit contributions
 guardian+0.523
 fairy+0.495
 figure+0.484
 answer+0.474
he+0.474
Neg logit contributions
ksh-0.891
ashed-0.881
 rods-0.870
 tanks-0.855
 mouths-0.807
Token she
Token ID673
Feature activation-3.90e+00

Pos logit contributions
 guardian+0.021
 fairy+0.019
 figure+0.019
he+0.019
 answer+0.019
Neg logit contributions
ksh-0.030
ashed-0.029
 rods-0.029
 tanks-0.029
 mouths-0.027
Token'd
Token ID1549
Feature activation+3.10e+00

Pos logit contributions
 guardian+0.227
he+0.220
 figure+0.215
 fairy+0.214
 answer+0.213
Neg logit contributions
ksh-0.239
ashed-0.239
 tanks-0.232
 rods-0.230
 mouths-0.215
Token always
Token ID1464
Feature activation-2.18e+00

Pos logit contributions
ashed+0.204
 tanks+0.200
 rods+0.199
ksh+0.197
 pic+0.194
Neg logit contributions
he-0.158
 guardian-0.152
 note-0.143
 angel-0.143
 owl-0.143
Token of
Token ID286
Feature activation-2.80e+00

Pos logit contributions
he+0.694
 guardian+0.627
 mirror+0.592
iver+0.574
 note+0.569
Neg logit contributions
 tanks-1.015
 rods-0.993
ksh-0.972
ashed-0.932
 tails-0.922
Token the
Token ID262
Feature activation+8.33e-01

Pos logit contributions
 guardian+0.147
he+0.143
 fairy+0.136
 figure+0.136
 note+0.134
Neg logit contributions
 tanks-0.187
 rods-0.187
ksh-0.186
ashed-0.184
 mouths-0.173
Token underground
Token ID11447
Feature activation-2.00e+00

Pos logit contributions
 tanks+0.060
 rods+0.057
 tails+0.052
 trucks+0.052
ksh+0.051
Neg logit contributions
he-0.044
ibrarian-0.038
 guardian-0.037
ad-0.036
iver-0.034
Token.
Token ID13
Feature activation+9.44e-02

Pos logit contributions
 guardian+0.090
 fairy+0.085
 figure+0.084
he+0.083
 answer+0.082
Neg logit contributions
ksh-0.154
ashed-0.152
 rods-0.148
 tanks-0.146
 mouths-0.140
Token The
Token ID383
Feature activation+1.21e-01

Pos logit contributions
ashed+0.006
ksh+0.006
 rods+0.006
 tanks+0.006
 mouths+0.006
Neg logit contributions
 guardian-0.005
 figure-0.005
 fairy-0.005
 answer-0.005
 wizard-0.005
Token guardian
Token ID21688
Feature activation-1.46e+01

Pos logit contributions
 tanks+0.009
 rods+0.009
 trucks+0.008
ashed+0.008
 tails+0.008
Neg logit contributions
he-0.006
iver-0.004
 guardian-0.004
 mirror-0.004
ibrarian-0.004
Token warned
Token ID7728
Feature activation-8.02e-01

Pos logit contributions
he+0.698
 guardian+0.658
 mirror+0.639
 answer+0.630
 note+0.607
Neg logit contributions
 tanks-1.012
ashed-1.009
 rods-0.995
ksh-0.949
 mouths-0.914
Token the
Token ID262
Feature activation+7.70e-01

Pos logit contributions
 guardian+0.041
he+0.039
 fairy+0.039
 figure+0.039
 answer+0.038
Neg logit contributions
ksh-0.056
ashed-0.055
 rods-0.054
 tanks-0.054
 mouths-0.051
Token man
Token ID582
Feature activation-7.49e+00

Pos logit contributions
 tanks+0.056
 rods+0.055
ksh+0.052
ashed+0.051
 sizes+0.050
Neg logit contributions
he-0.038
 guardian-0.031
 mirror-0.031
iver-0.030
ibrarian-0.029
Token never
Token ID1239
Feature activation-4.96e+00

Pos logit contributions
 guardian+0.411
he+0.392
 fairy+0.388
 figure+0.386
 answer+0.381
Neg logit contributions
ksh-0.489
ashed-0.486
 rods-0.478
 tanks-0.476
 mouths-0.445
Token to
Token ID284
Feature activation+2.37e+00

Pos logit contributions
 guardian+0.303
he+0.294
 fairy+0.287
 figure+0.284
 answer+0.280
Neg logit contributions
ashed-0.293
ksh-0.292
 rods-0.286
 tanks-0.286
 mouths-0.263
Token wise
Token ID10787
Feature activation-3.30e+00

Pos logit contributions
he+0.041
 guardian+0.037
mother+0.033
 mirror+0.033
bird+0.033
Neg logit contributions
 tanks-0.065
 rods-0.063
 trucks-0.058
ashed-0.056
 tails-0.055
Token man
Token ID582
Feature activation-7.40e+00

Pos logit contributions
he+0.149
 guardian+0.132
 fairy+0.122
 mirror+0.121
 note+0.117
Neg logit contributions
 tanks-0.239
 rods-0.237
ashed-0.222
 trucks-0.218
 sizes-0.215
Token had
Token ID550
Feature activation-8.52e-02

Pos logit contributions
 guardian+0.420
he+0.414
 answer+0.390
 fairy+0.390
 mirror+0.386
Neg logit contributions
ashed-0.466
 tanks-0.460
 rods-0.457
ksh-0.453
 mouths-0.420
Token shot
Token ID2823
Feature activation-2.36e+00

Pos logit contributions
 guardian+0.005
he+0.005
 fairy+0.004
 figure+0.004
 wizard+0.004
Neg logit contributions
ashed-0.006
 rods-0.006
ksh-0.006
 tanks-0.006
 pic-0.005
Token a
Token ID257
Feature activation-3.59e-01

Pos logit contributions
 guardian+0.110
 fairy+0.103
 figure+0.103
 answer+0.102
he+0.101
Neg logit contributions
ksh-0.176
ashed-0.175
 rods-0.171
 tanks-0.169
 mouths-0.161
Token guardian
Token ID21688
Feature activation-1.45e+01

Pos logit contributions
he+0.019
 guardian+0.018
ibrarian+0.016
 answer+0.016
 mirror+0.016
Neg logit contributions
 tanks-0.024
 rods-0.024
ksh-0.024
ashed-0.023
 mouths-0.022
Token of
Token ID286
Feature activation-2.80e+00

Pos logit contributions
he+0.694
 guardian+0.627
 mirror+0.592
iver+0.574
 note+0.569
Neg logit contributions
 tanks-1.015
 rods-0.993
ksh-0.972
ashed-0.932
 tails-0.922
Token the
Token ID262
Feature activation+8.33e-01

Pos logit contributions
 guardian+0.147
he+0.143
 fairy+0.136
 figure+0.136
 note+0.134
Neg logit contributions
 tanks-0.187
 rods-0.187
ksh-0.186
ashed-0.184
 mouths-0.173
Token underground
Token ID11447
Feature activation-2.00e+00

Pos logit contributions
 tanks+0.060
 rods+0.057
 tails+0.052
 trucks+0.052
ksh+0.051
Neg logit contributions
he-0.044
ibrarian-0.038
 guardian-0.037
ad-0.036
iver-0.034
Token.
Token ID13
Feature activation+9.44e-02

Pos logit contributions
 guardian+0.090
 fairy+0.085
 figure+0.084
he+0.083
 answer+0.082
Neg logit contributions
ksh-0.154
ashed-0.152
 rods-0.148
 tanks-0.146
 mouths-0.140
Token The
Token ID383
Feature activation+1.21e-01

Pos logit contributions
ashed+0.006
ksh+0.006
 rods+0.006
 tanks+0.006
 mouths+0.006
Neg logit contributions
 guardian-0.005
 figure-0.005
 fairy-0.005
 answer-0.005
 wizard-0.005
Token tail
Token ID7894
Feature activation+2.09e+00

Pos logit contributions
he+0.025
ibrarian+0.022
 guardian+0.022
mother+0.022
father+0.021
Neg logit contributions
 mouths-0.028
 rods-0.028
 tails-0.028
 tanks-0.028
ashed-0.026
Token.
Token ID13
Feature activation-6.55e-02

Pos logit contributions
ashed+0.143
ksh+0.139
 rods+0.138
 tanks+0.138
 mouths+0.131
Neg logit contributions
 guardian-0.107
he-0.102
 fairy-0.096
 wizard-0.096
 figure-0.095
Token It
Token ID632
Feature activation-2.13e+00

Pos logit contributions
 guardian+0.003
he+0.003
 fairy+0.003
 figure+0.003
 answer+0.003
Neg logit contributions
ksh-0.005
ashed-0.005
 tanks-0.005
 rods-0.005
 mouths-0.005
Token was
Token ID373
Feature activation+3.07e-01

Pos logit contributions
 guardian+0.121
he+0.117
 figure+0.113
 fairy+0.113
 answer+0.112
Neg logit contributions
ksh-0.135
ashed-0.135
 rods-0.131
 tanks-0.131
 mouths-0.123
Token a
Token ID257
Feature activation-4.68e-01

Pos logit contributions
ksh+0.022
 tanks+0.022
ashed+0.021
 rods+0.021
 mouths+0.020
Neg logit contributions
he-0.015
 guardian-0.015
 figure-0.014
 fairy-0.014
 note-0.014
Token guardian
Token ID21688
Feature activation-1.45e+01

Pos logit contributions
he+0.022
 guardian+0.021
 answer+0.020
 figure+0.019
ibrarian+0.019
Neg logit contributions
 tanks-0.033
 rods-0.033
ksh-0.032
ashed-0.032
 mouths-0.031
Token of
Token ID286
Feature activation-2.75e+00

Pos logit contributions
he+0.549
 guardian+0.508
iver+0.489
 note+0.463
 figure+0.459
Neg logit contributions
 tanks-1.101
ksh-1.082
 rods-1.065
ashed-1.041
 sizes-1.033
Token the
Token ID262
Feature activation+7.17e-01

Pos logit contributions
 guardian+0.140
he+0.132
 fairy+0.131
 figure+0.131
 answer+0.129
Neg logit contributions
ksh-0.193
ashed-0.191
 rods-0.188
 tanks-0.187
 mouths-0.175
Token cave
Token ID11527
Feature activation-3.35e+00

Pos logit contributions
 tanks+0.050
 rods+0.049
 mouths+0.048
 tails+0.047
 trucks+0.047
Neg logit contributions
he-0.035
iver-0.032
ibrarian-0.031
 guardian-0.030
alla-0.030
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
 guardian+0.154
he+0.145
 fairy+0.143
 figure+0.143
 answer+0.141
Neg logit contributions
ksh-0.247
ashed-0.246
 rods-0.242
 tanks-0.241
 mouths-0.228
Token
Token ID198
Feature activation-1.76e+00

Pos logit contributions
 guardian+0.006
 fairy+0.006
 figure+0.006
he+0.006
 answer+0.006
Neg logit contributions
ksh-0.009
ashed-0.008
 rods-0.008
 tanks-0.008
 mouths-0.008
Token from
Token ID422
Feature activation-1.59e+00

Pos logit contributions
he+0.033
 guardian+0.031
ibrarian+0.030
 note+0.030
 wizard+0.030
Neg logit contributions
ksh-0.037
ashed-0.035
 tanks-0.035
 rods-0.034
 tails-0.032
Token behind
Token ID2157
Feature activation-4.15e+00

Pos logit contributions
 guardian+0.095
 note+0.089
he+0.087
 figure+0.085
ibrarian+0.084
Neg logit contributions
 tanks-0.092
 rods-0.089
ksh-0.087
ashed-0.086
 pic-0.084
Token her
Token ID607
Feature activation-4.33e+00

Pos logit contributions
 guardian+0.235
 figure+0.211
 wizard+0.208
he+0.208
 owl+0.207
Neg logit contributions
ashed-0.267
ksh-0.267
 rods-0.262
 tanks-0.261
 mouths-0.250
Token:
Token ID25
Feature activation-3.23e+00

Pos logit contributions
he+0.229
 guardian+0.219
 wizard+0.211
 answer+0.207
orn+0.206
Neg logit contributions
 rods-0.274
 tanks-0.273
 mouths-0.267
ksh-0.262
ashed-0.261
Token A
Token ID317
Feature activation+1.25e+00

Pos logit contributions
he+0.251
 guardian+0.245
 figure+0.236
 answer+0.234
 note+0.233
Neg logit contributions
ksh-0.127
 tanks-0.124
 rods-0.120
ashed-0.117
 mouths-0.107
Token guardian
Token ID21688
Feature activation-1.45e+01

Pos logit contributions
 tanks+0.092
 rods+0.088
ashed+0.084
suits+0.084
ksh+0.083
Neg logit contributions
he-0.055
iver-0.047
 guardian-0.046
ad-0.045
 answer-0.045
Token of
Token ID286
Feature activation-2.57e+00

Pos logit contributions
iver+0.608
he+0.605
 guardian+0.540
 note+0.517
 answer+0.497
Neg logit contributions
 tanks-1.004
 rods-0.955
ashed-0.934
ksh-0.934
 they-0.928
Token the
Token ID262
Feature activation+8.98e-01

Pos logit contributions
 guardian+0.143
he+0.138
 figure+0.134
 note+0.131
 wizard+0.130
Neg logit contributions
 tanks-0.163
 rods-0.161
ashed-0.159
ksh-0.158
 mouths-0.150
Token me
Token ID502
Feature activation-1.97e+00

Pos logit contributions
 tanks+0.059
 rods+0.059
 tails+0.058
 pic+0.056
ashed+0.056
Neg logit contributions
he-0.044
ibrarian-0.040
ad-0.040
 guardian-0.038
iver-0.038
Tokenadow
Token ID4584
Feature activation-4.13e-01

Pos logit contributions
 fairy+0.093
 guardian+0.093
 wizard+0.091
 owl+0.090
 angel+0.087
Neg logit contributions
ashed-0.140
 rods-0.131
ksh-0.131
suits-0.130
 tanks-0.129
Token!
Token ID0
Feature activation-1.99e+00

Pos logit contributions
 guardian+0.021
 answer+0.020
he+0.019
 figure+0.019
 fairy+0.019
Neg logit contributions
ashed-0.028
ksh-0.027
 tanks-0.027
 rods-0.027
 mouths-0.025
TokenThe
Token ID464
Feature activation-5.06e+00

Pos logit contributions
ksh+0.105
ashed+0.104
 rods+0.103
 tanks+0.102
 mouths+0.096
Neg logit contributions
 guardian-0.062
 fairy-0.058
he-0.058
 answer-0.058
 figure-0.057
Token next
Token ID1306
Feature activation-1.36e+00

Pos logit contributions
 guardian+0.231
he+0.230
 figure+0.211
 answer+0.211
 fairy+0.206
Neg logit contributions
 rods-0.365
 tanks-0.364
ksh-0.361
ashed-0.355
 mouths-0.341
Token morning
Token ID3329
Feature activation+1.52e+00

Pos logit contributions
 guardian+0.079
 fairy+0.075
 figure+0.075
he+0.075
 answer+0.074
Neg logit contributions
ksh-0.087
ashed-0.085
 tanks-0.083
 rods-0.083
 mouths-0.077
Token the
Token ID262
Feature activation-5.37e-01

Pos logit contributions
ksh+0.099
ashed+0.099
 rods+0.096
 tanks+0.096
 mouths+0.090
Neg logit contributions
 guardian-0.082
he-0.078
 fairy-0.077
 figure-0.077
 answer-0.076
Token poor
Token ID3595
Feature activation-2.93e+00

Pos logit contributions
 guardian+0.024
he+0.023
 figure+0.022
 answer+0.021
 fairy+0.021
Neg logit contributions
ksh-0.039
 tanks-0.039
 rods-0.039
ashed-0.039
 mouths-0.036
Token wizard
Token ID18731
Feature activation-1.44e+01

Pos logit contributions
 guardian+0.130
he+0.124
 figure+0.119
 fairy+0.119
 answer+0.118
Neg logit contributions
ksh-0.218
ashed-0.218
 rods-0.215
 tanks-0.215
 mouths-0.202
Token decided
Token ID3066
Feature activation+4.80e+00

Pos logit contributions
 guardian+0.764
he+0.724
 fairy+0.712
 figure+0.709
 answer+0.701
Neg logit contributions
ksh-0.964
ashed-0.961
 rods-0.937
 tanks-0.937
 mouths-0.877
Token to
Token ID284
Feature activation+1.90e+00

Pos logit contributions
ksh+0.280
ashed+0.278
 rods+0.272
 tanks+0.270
 mouths+0.250
Neg logit contributions
 guardian-0.297
he-0.285
 fairy-0.282
 figure-0.281
 answer-0.277
Token trust
Token ID3774
Feature activation-5.29e+00

Pos logit contributions
 tanks+0.119
 rods+0.118
 pic+0.115
ashed+0.112
ksh+0.111
Neg logit contributions
 guardian-0.092
he-0.090
 fairy-0.083
 note-0.083
 owl-0.082
Token the
Token ID262
Feature activation+4.00e-01

Pos logit contributions
 guardian+0.256
 fairy+0.242
he+0.241
 figure+0.241
 answer+0.238
Neg logit contributions
ksh-0.386
ashed-0.384
 rods-0.373
 tanks-0.370
 mouths-0.349
Token voice
Token ID3809
Feature activation-9.70e+00

Pos logit contributions
 tanks+0.032
 rods+0.031
ksh+0.030
ashed+0.030
 mouths+0.029
Neg logit contributions
he-0.015
 guardian-0.012
iver-0.012
ibrarian-0.011
ad-0.011
Token caught
Token ID4978
Feature activation-2.85e+00

Pos logit contributions
he+0.224
 guardian+0.224
 answer+0.217
 note+0.209
 wizard+0.209
Neg logit contributions
ashed-0.232
ksh-0.221
 tanks-0.214
 rods-0.211
 mouths-0.205
Token a
Token ID257
Feature activation-2.81e-01

Pos logit contributions
 guardian+0.155
he+0.144
 wizard+0.143
 figure+0.142
iver+0.138
Neg logit contributions
ksh-0.174
 rods-0.169
ashed-0.169
 tanks-0.167
 mouths-0.159
Token glimpse
Token ID19350
Feature activation-7.66e+00

Pos logit contributions
he+0.014
ibrarian+0.013
ad+0.013
 guardian+0.013
 answer+0.012
Neg logit contributions
ksh-0.018
 tanks-0.018
 rods-0.018
ashed-0.017
suits-0.017
Token of
Token ID286
Feature activation-2.58e+00

Pos logit contributions
 guardian+0.324
 figure+0.298
he+0.297
 answer+0.292
 note+0.291
Neg logit contributions
ashed-0.588
ksh-0.581
 rods-0.571
 tanks-0.568
 mouths-0.543
Token her
Token ID607
Feature activation-4.34e+00

Pos logit contributions
 guardian+0.130
he+0.126
 figure+0.121
 note+0.118
 fairy+0.117
Neg logit contributions
 tanks-0.173
 rods-0.173
ksh-0.172
ashed-0.171
 mouths-0.164
Token guardian
Token ID21688
Feature activation-1.44e+01

Pos logit contributions
he+0.203
 guardian+0.191
 answer+0.181
 wizard+0.180
 figure+0.179
Neg logit contributions
ashed-0.294
ksh-0.294
 rods-0.294
 mouths-0.294
 tanks-0.294
Token angel
Token ID18304
Feature activation-1.14e+01

Pos logit contributions
he+0.659
 guardian+0.630
 figure+0.575
 note+0.574
iver+0.574
Neg logit contributions
 tanks-0.960
ksh-0.957
 rods-0.945
 mouths-0.934
ashed-0.933
Token who
Token ID508
Feature activation-9.21e-01

Pos logit contributions
 guardian+0.534
he+0.514
 answer+0.494
 figure+0.490
 fairy+0.490
Neg logit contributions
ksh-0.819
ashed-0.812
 tanks-0.796
 rods-0.796
 mouths-0.758
Token was
Token ID373
Feature activation+4.11e-01

Pos logit contributions
 guardian+0.055
 answer+0.053
he+0.053
 note+0.052
 owl+0.052
Neg logit contributions
ashed-0.049
 tanks-0.046
 sizes-0.044
 rods-0.044
 pic-0.044
Token much
Token ID881
Feature activation+3.06e+00

Pos logit contributions
ashed+0.024
 tanks+0.024
ksh+0.024
 rods+0.024
 pic+0.024
Neg logit contributions
he-0.021
orn-0.019
 guardian-0.019
 figure-0.019
ibrarian-0.019
Token bigger
Token ID5749
Feature activation+5.96e-01

Pos logit contributions
 tanks+0.180
ashed+0.180
ksh+0.180
 rods+0.178
 mouths+0.171
Neg logit contributions
he-0.179
 guardian-0.176
 figure-0.167
ibrarian-0.163
 owl-0.161
Token
Token ID198
Feature activation-1.54e+00

Pos logit contributions
 guardian+0.086
 angel+0.081
 answer+0.080
 figure+0.079
 fairy+0.079
Neg logit contributions
 rods-0.091
ksh-0.089
ashed-0.088
 tanks-0.087
suits-0.083
Token"
Token ID1
Feature activation+2.20e+00

Pos logit contributions
 guardian+0.075
 answer+0.072
 fairy+0.071
he+0.070
 figure+0.070
Neg logit contributions
ksh-0.107
 rods-0.106
ashed-0.105
 tanks-0.103
suits-0.097
TokenI
Token ID40
Feature activation-9.89e-01

Pos logit contributions
ksh+0.136
 rods+0.133
ashed+0.133
 tanks+0.132
 sizes+0.124
Neg logit contributions
 guardian-0.118
 answer-0.118
 figure-0.113
he-0.111
 wizard-0.111
Token'm
Token ID1101
Feature activation-1.42e+00

Pos logit contributions
 guardian+0.056
 owl+0.055
 angel+0.055
he+0.054
 note+0.053
Neg logit contributions
ashed-0.060
 tanks-0.054
ksh-0.053
 rods-0.052
 pic-0.051
Token the
Token ID262
Feature activation+7.32e-01

Pos logit contributions
he+0.079
 guardian+0.071
 note+0.071
 figure+0.069
orn+0.067
Neg logit contributions
 tanks-0.089
ksh-0.086
 rods-0.083
ashed-0.083
 pic-0.082
Token guardian
Token ID21688
Feature activation-1.43e+01

Pos logit contributions
 tanks+0.055
 sizes+0.055
 pic+0.054
 tails+0.054
 rods+0.053
Neg logit contributions
he-0.027
iver-0.019
ad-0.016
on-0.015
orn-0.015
Token of
Token ID286
Feature activation-2.70e+00

Pos logit contributions
he+0.674
 guardian+0.654
 figure+0.651
 note+0.620
iver+0.617
Neg logit contributions
 tanks-0.908
ksh-0.904
 sizes-0.869
ashed-0.865
 rods-0.859
Token the
Token ID262
Feature activation+7.47e-01

Pos logit contributions
 guardian+0.142
he+0.136
 figure+0.133
 fairy+0.132
 answer+0.130
Neg logit contributions
ksh-0.180
ashed-0.178
 tanks-0.178
 rods-0.178
 mouths-0.166
Token apples
Token ID22514
Feature activation+9.20e+00

Pos logit contributions
 tanks+0.052
 rods+0.050
 pic+0.049
 mouths+0.049
 tails+0.047
Neg logit contributions
he-0.034
ibrarian-0.030
iver-0.029
 guardian-0.027
ad-0.027
Token!
Token ID0
Feature activation-2.10e+00

Pos logit contributions
 tanks+0.554
 rods+0.549
ksh+0.548
ashed+0.538
 sizes+0.510
Neg logit contributions
 guardian-0.512
 fairy-0.458
 figure-0.456
 wizard-0.452
 mirror-0.448
Token It
Token ID632
Feature activation-2.03e+00

Pos logit contributions
 guardian+0.130
 fairy+0.124
he+0.124
 figure+0.123
 answer+0.122
Neg logit contributions
ksh-0.123
ashed-0.122
 rods-0.120
 tanks-0.119
 mouths-0.110
Token James
Token ID3700
Feature activation+7.05e-01

Pos logit contributions
 guardian+0.147
 fairy+0.142
 owl+0.139
 figure+0.137
 wizard+0.137
Neg logit contributions
ashed-0.113
ksh-0.111
 rods-0.108
 tanks-0.103
suits-0.099
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
ashed+0.055
ksh+0.055
 rods+0.054
 tanks+0.053
 mouths+0.050
Neg logit contributions
 guardian-0.030
 fairy-0.029
he-0.028
 figure-0.028
 answer-0.028
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
ashed+0.013
 rods+0.013
ksh+0.013
 tanks+0.012
 mouths+0.012
Neg logit contributions
 guardian-0.012
 fairy-0.011
 owl-0.011
 figure-0.011
 wizard-0.011
Token'm
Token ID1101
Feature activation-1.47e+00

Pos logit contributions
ashed+0.118
 tanks+0.108
ksh+0.104
 rods+0.102
suits+0.102
Neg logit contributions
 guardian-0.097
 owl-0.095
 angel-0.094
he-0.093
 note-0.093
Token the
Token ID262
Feature activation+8.68e-01

Pos logit contributions
he+0.082
 note+0.076
 guardian+0.073
 figure+0.072
orn+0.071
Neg logit contributions
 tanks-0.093
ksh-0.090
ashed-0.086
 pic-0.086
 rods-0.086
Token guardian
Token ID21688
Feature activation-1.43e+01

Pos logit contributions
 tanks+0.057
ksh+0.057
 rods+0.054
 sizes+0.053
 pic+0.052
Neg logit contributions
he-0.046
 figure-0.040
 guardian-0.040
 note-0.039
ibrarian-0.039
Token of
Token ID286
Feature activation-2.51e+00

Pos logit contributions
 guardian+0.588
he+0.587
 figure+0.569
 note+0.554
 answer+0.541
Neg logit contributions
ksh-1.057
 tanks-1.050
ashed-1.021
 rods-1.008
 sizes-0.983
Token this
Token ID428
Feature activation-2.32e+00

Pos logit contributions
 guardian+0.128
he+0.121
 figure+0.120
 fairy+0.120
 answer+0.118
Neg logit contributions
ksh-0.174
ashed-0.172
 rods-0.171
 tanks-0.170
 mouths-0.159
Token spring
Token ID6076
Feature activation+1.10e+00

Pos logit contributions
he+0.110
 guardian+0.104
 figure+0.102
 note+0.099
ibrarian+0.098
Neg logit contributions
 tanks-0.161
ksh-0.154
 rods-0.153
ashed-0.150
 sizes-0.147
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
ksh+0.083
ashed+0.081
 rods+0.080
 tanks+0.079
 mouths+0.076
Neg logit contributions
 guardian-0.051
 fairy-0.050
he-0.049
 answer-0.048
 figure-0.048
Token I
Token ID314
Feature activation+1.83e+00

Pos logit contributions
ashed+0.012
 rods+0.011
ksh+0.011
 tanks+0.011
 mouths+0.011
Neg logit contributions
 guardian-0.011
 fairy-0.010
 figure-0.010
 answer-0.010
 owl-0.010
Token than
Token ID621
Feature activation-4.88e-01

Pos logit contributions
ksh+0.279
ashed+0.275
 tanks+0.275
 rods+0.273
 mouths+0.259
Neg logit contributions
 guardian-0.203
he-0.196
 figure-0.189
 fairy-0.188
 wizard-0.185
Token anything
Token ID1997
Feature activation+1.12e+00

Pos logit contributions
 guardian+0.028
he+0.026
 fairy+0.026
 figure+0.026
 answer+0.026
Neg logit contributions
ksh-0.031
ashed-0.031
 rods-0.030
 tanks-0.030
 mouths-0.028
Token to
Token ID284
Feature activation+2.71e+00

Pos logit contributions
ksh+0.091
ashed+0.088
 rods+0.088
 tanks+0.087
 mouths+0.082
Neg logit contributions
 guardian-0.047
 fairy-0.045
he-0.044
 figure-0.044
 answer-0.044
Token have
Token ID423
Feature activation-4.37e+00

Pos logit contributions
 pic+0.143
 tails+0.142
 tanks+0.139
 rods+0.135
girls+0.135
Neg logit contributions
ibrarian-0.155
 guardian-0.153
 wizard-0.142
mother-0.140
ocket-0.139
Token her
Token ID607
Feature activation-4.29e+00

Pos logit contributions
 guardian+0.189
he+0.178
 fairy+0.178
 figure+0.176
 answer+0.176
Neg logit contributions
ksh-0.342
ashed-0.335
 rods-0.331
 tanks-0.328
 mouths-0.311
Token guardian
Token ID21688
Feature activation-1.42e+01

Pos logit contributions
 guardian+0.225
he+0.220
 fairy+0.212
 figure+0.211
 wizard+0.209
Neg logit contributions
ksh-0.283
ashed-0.282
 rods-0.278
 tanks-0.278
 mouths-0.264
Token angel
Token ID18304
Feature activation-1.12e+01

Pos logit contributions
he+0.734
 guardian+0.724
 figure+0.683
 wizard+0.666
 note+0.663
Neg logit contributions
 tanks-0.886
ksh-0.876
ashed-0.861
 rods-0.860
 mouths-0.840
Token nearby
Token ID6716
Feature activation-1.78e+00

Pos logit contributions
 guardian+0.538
he+0.519
 figure+0.502
 fairy+0.499
 answer+0.494
Neg logit contributions
ksh-0.798
ashed-0.794
 tanks-0.781
 rods-0.778
 mouths-0.737
Token to
Token ID284
Feature activation+2.69e+00

Pos logit contributions
 guardian+0.074
he+0.070
 fairy+0.069
 figure+0.069
 answer+0.069
Neg logit contributions
ksh-0.143
ashed-0.141
 rods-0.139
 tanks-0.137
 mouths-0.130
Token make
Token ID787
Feature activation-2.07e+00

Pos logit contributions
 pic+0.151
 tanks+0.142
 tails+0.139
 rods+0.137
 games+0.135
Neg logit contributions
 guardian-0.145
ibrarian-0.137
 wizard-0.131
he-0.130
 mirror-0.128
Token her
Token ID607
Feature activation-4.29e+00

Pos logit contributions
 guardian+0.093
he+0.079
 figure+0.078
 wizard+0.077
 fairy+0.075
Neg logit contributions
ksh-0.152
 tanks-0.150
 rods-0.149
ashed-0.147
 pic-0.147
Token One
Token ID1881
Feature activation+1.70e+00

Pos logit contributions
 guardian+0.033
 figure+0.031
 fairy+0.031
he+0.030
 note+0.030
Neg logit contributions
ksh-0.031
ashed-0.030
 tanks-0.030
 rods-0.030
 mouths-0.028
Token day
Token ID1110
Feature activation+2.16e+00

Pos logit contributions
ashed+0.070
ksh+0.064
 tanks+0.063
 rods+0.062
 pic+0.058
Neg logit contributions
 guardian-0.132
 wizard-0.127
 fairy-0.123
ibrarian-0.122
he-0.122
Token,
Token ID11
Feature activation+1.14e+00

Pos logit contributions
ksh+0.174
ashed+0.163
 rods+0.162
 tanks+0.161
 sizes+0.152
Neg logit contributions
 guardian-0.095
 fairy-0.094
 figure-0.092
 answer-0.091
he-0.091
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
ashed+0.087
ksh+0.086
 rods+0.085
 tanks+0.085
 mouths+0.082
Neg logit contributions
 guardian-0.050
he-0.045
 figure-0.044
 fairy-0.044
 answer-0.044
Token grand
Token ID4490
Feature activation+2.46e+00

Pos logit contributions
 guardian+0.047
he+0.045
 figure+0.043
 fairy+0.043
 answer+0.043
Neg logit contributions
ksh-0.074
ashed-0.074
 rods-0.073
 tanks-0.073
 mouths-0.068
Token wizard
Token ID18731
Feature activation-1.42e+01

Pos logit contributions
ksh+0.158
 pic+0.154
 tanks+0.154
 trucks+0.153
 rods+0.149
Neg logit contributions
he-0.112
 guardian-0.110
She-0.101
 figure-0.098
 note-0.096
Token came
Token ID1625
Feature activation-1.61e+00

Pos logit contributions
 guardian+0.796
he+0.763
 answer+0.736
 fairy+0.729
 note+0.726
Neg logit contributions
ashed-0.916
 tanks-0.894
ksh-0.888
 rods-0.885
 mouths-0.812
Token to
Token ID284
Feature activation+1.73e+00

Pos logit contributions
 guardian+0.116
he+0.115
ibrarian+0.113
 fairy+0.111
ocket+0.110
Neg logit contributions
 tanks-0.074
ksh-0.070
ashed-0.068
 rods-0.067
 trucks-0.064
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
 tanks+0.106
 pic+0.103
 rods+0.103
 trucks+0.098
ksh+0.096
Neg logit contributions
 guardian-0.095
he-0.089
ibrarian-0.086
orn-0.082
 note-0.082
Token village
Token ID7404
Feature activation-1.39e+00

Pos logit contributions
 tanks+0.011
 rods+0.011
ksh+0.011
 pic+0.010
ashed+0.010
Neg logit contributions
he-0.007
 guardian-0.007
ibrarian-0.006
ad-0.006
 answer-0.006
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
 guardian+0.074
 fairy+0.069
he+0.068
 answer+0.067
 figure+0.066
Neg logit contributions
 tanks-0.092
 rods-0.091
ashed-0.090
ksh-0.090
 mouths-0.083
Token around
Token ID1088
Feature activation-4.81e+00

Pos logit contributions
 answer+0.047
he+0.047
 guardian+0.046
 figure+0.044
 mirror+0.043
Neg logit contributions
ksh-0.101
 rods-0.098
ashed-0.095
girls-0.094
 sizes-0.092
Token and
Token ID290
Feature activation-6.82e-01

Pos logit contributions
 answer+0.196
 guardian+0.191
 figure+0.188
he+0.188
 fairy+0.186
Neg logit contributions
ksh-0.397
ashed-0.392
 rods-0.370
 tanks-0.367
 mouths-0.354
Token saw
Token ID2497
Feature activation-5.15e+00

Pos logit contributions
 figure+0.020
he+0.020
 guardian+0.019
 fairy+0.018
 answer+0.018
Neg logit contributions
ksh-0.062
 rods-0.062
ashed-0.060
 tanks-0.060
 pic-0.057
Token many
Token ID867
Feature activation+2.30e+00

Pos logit contributions
 guardian+0.220
 fairy+0.218
 owl+0.215
 answer+0.215
he+0.215
Neg logit contributions
ksh-0.408
ashed-0.394
 rods-0.361
suits-0.354
 tanks-0.354
Token matt
Token ID23963
Feature activation+1.39e+01

Pos logit contributions
ksh+0.144
ashed+0.142
 rods+0.133
 tanks+0.127
 mouths+0.123
Neg logit contributions
 guardian-0.135
 answer-0.135
he-0.135
 fairy-0.134
 figure-0.134
Tokenresses
Token ID16746
Feature activation+9.78e+00

Pos logit contributions
ksh+0.419
ashed+0.409
 rods+0.385
 tanks+0.379
 mouths+0.336
Neg logit contributions
 guardian-1.264
he-1.241
 figure-1.225
 fairy-1.208
 answer-1.201
Token.
Token ID13
Feature activation-2.49e-01

Pos logit contributions
ksh+0.770
ashed+0.748
 rods+0.739
 tanks+0.733
 mouths+0.702
Neg logit contributions
 guardian-0.417
he-0.407
 figure-0.399
 answer-0.394
 fairy-0.391
Token Some
Token ID2773
Feature activation+1.17e+00

Pos logit contributions
he+0.009
 voice+0.009
 figure+0.009
 mirror+0.008
 answer+0.008
Neg logit contributions
ashed-0.020
ksh-0.019
 rods-0.019
 tails-0.019
 sizes-0.019
Token were
Token ID547
Feature activation+3.96e+00

Pos logit contributions
ksh+0.094
 rods+0.084
ashed+0.084
 appet+0.078
girls+0.077
Neg logit contributions
he-0.054
 answer-0.054
 fairy-0.052
 figure-0.051
 voice-0.050
Token soft
Token ID2705
Feature activation+1.42e+00

Pos logit contributions
ksh+0.315
ashed+0.305
 rods+0.293
 tanks+0.284
 mouths+0.276
Neg logit contributions
 guardian-0.173
 answer-0.166
 figure-0.164
 fairy-0.161
he-0.158
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
ksh+0.133
ashed+0.131
 rods+0.128
 pic+0.119
 tanks+0.118
Neg logit contributions
 fairy-0.039
 answer-0.037
 mirror-0.037
 owl-0.035
 voice-0.035
Token Lily
Token ID20037
Feature activation+4.79e+00

Pos logit contributions
ksh+0.128
suits+0.120
 rods+0.119
ashed+0.115
 tanks+0.108
Neg logit contributions
 fairy-0.100
he-0.100
 figure-0.093
 answer-0.092
 wizard-0.089
Token and
Token ID290
Feature activation-3.75e-01

Pos logit contributions
ksh+0.325
ashed+0.323
 rods+0.316
 tanks+0.315
 mouths+0.295
Neg logit contributions
 guardian-0.252
he-0.239
 fairy-0.237
 figure-0.236
 answer-0.234
Token her
Token ID607
Feature activation-4.62e+00

Pos logit contributions
 guardian+0.016
he+0.015
 fairy+0.015
 figure+0.015
 answer+0.015
Neg logit contributions
ksh-0.029
ashed-0.029
 rods-0.029
 tanks-0.028
 mouths-0.027
Token brother
Token ID3956
Feature activation+6.98e-01

Pos logit contributions
 guardian+0.232
 fairy+0.220
 figure+0.218
he+0.217
 answer+0.216
Neg logit contributions
ksh-0.331
ashed-0.328
 rods-0.322
 tanks-0.317
 mouths-0.299
Token played
Token ID2826
Feature activation+8.77e+00

Pos logit contributions
ksh+0.051
 rods+0.046
suits+0.043
ashed+0.042
lights+0.042
Neg logit contributions
 figure-0.038
he-0.036
 owl-0.036
 fairy-0.035
 answer-0.035
Token together
Token ID1978
Feature activation+1.05e+01

Pos logit contributions
ksh+0.411
ashed+0.370
 tanks+0.343
 rods+0.333
 mouths+0.329
Neg logit contributions
he-0.702
 figure-0.689
 answer-0.671
 note-0.669
 fairy-0.666
Token every
Token ID790
Feature activation+5.01e+00

Pos logit contributions
ksh+0.689
ashed+0.645
 tanks+0.625
 rods+0.620
suits+0.588
Neg logit contributions
he-0.616
 figure-0.610
 answer-0.605
 guardian-0.598
 fairy-0.596
Token day
Token ID1110
Feature activation+2.81e+00

Pos logit contributions
ksh+0.342
ashed+0.327
 rods+0.324
 tanks+0.318
 mouths+0.309
Neg logit contributions
he-0.270
 figure-0.269
 guardian-0.269
 fairy-0.266
 answer-0.265
Token and
Token ID290
Feature activation-4.35e-01

Pos logit contributions
ksh+0.244
ashed+0.237
 rods+0.231
 tanks+0.230
suits+0.219
Neg logit contributions
he-0.106
 figure-0.103
 fairy-0.094
 guardian-0.093
 owl-0.090
Token enjoyed
Token ID8359
Feature activation+5.32e+00

Pos logit contributions
 figure+0.023
 fairy+0.022
 wizard+0.022
 owl+0.021
he+0.021
Neg logit contributions
ksh-0.032
suits-0.029
 rods-0.029
ashed-0.028
 tanks-0.027
Token their
Token ID511
Feature activation+6.33e+00

Pos logit contributions
ksh+0.450
ashed+0.429
suits+0.400
 tanks+0.380
 rods+0.376
Neg logit contributions
 figure-0.225
he-0.220
 note-0.215
 fairy-0.207
 wizard-0.207
Token sure
Token ID1654
Feature activation-1.41e+00

Pos logit contributions
 guardian+0.115
he+0.109
 fairy+0.109
 figure+0.108
 answer+0.107
Neg logit contributions
ksh-0.168
ashed-0.166
 rods-0.162
 tanks-0.162
 mouths-0.152
Token it
Token ID340
Feature activation-1.88e+00

Pos logit contributions
 guardian+0.070
 fairy+0.068
 answer+0.067
 figure+0.067
 note+0.066
Neg logit contributions
ksh-0.108
ashed-0.100
 rods-0.096
 tanks-0.096
suits-0.093
Token was
Token ID373
Feature activation+2.88e-01

Pos logit contributions
 figure+0.086
 fairy+0.082
 answer+0.082
 guardian+0.081
he+0.080
Neg logit contributions
ksh-0.143
 rods-0.138
ashed-0.137
 tanks-0.135
 pic-0.132
Token sturdy
Token ID28055
Feature activation+5.97e+00

Pos logit contributions
ksh+0.024
ashed+0.023
 rods+0.023
 tanks+0.022
suits+0.022
Neg logit contributions
 answer-0.012
 guardian-0.011
 fairy-0.011
 figure-0.011
 ring-0.010
Token.
Token ID13
Feature activation-4.80e-02

Pos logit contributions
ashed+0.523
ksh+0.506
 rods+0.481
 mouths+0.474
 tanks+0.466
Neg logit contributions
 guardian-0.222
 figure-0.218
 answer-0.212
 fairy-0.209
 owl-0.205
Token Then
Token ID3244
Feature activation+9.31e+00

Pos logit contributions
 guardian+0.002
he+0.002
 fairy+0.002
 figure+0.002
 answer+0.002
Neg logit contributions
ksh-0.004
 rods-0.004
ashed-0.004
 tanks-0.004
 mouths-0.003
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
ksh+0.847
 rods+0.793
ashed+0.785
 tanks+0.777
suits+0.740
Neg logit contributions
 fairy-0.371
 figure-0.345
 guardian-0.338
 owl-0.328
 mirror-0.327
Token she
Token ID673
Feature activation-3.98e+00

Pos logit contributions
ksh+0.140
ashed+0.137
 rods+0.136
 tanks+0.135
 mouths+0.127
Neg logit contributions
 guardian-0.066
 fairy-0.063
he-0.062
 figure-0.062
 answer-0.060
Token pretended
Token ID36766
Feature activation+6.35e+00

Pos logit contributions
 figure+0.201
 fairy+0.195
 guardian+0.195
 wizard+0.188
he+0.187
Neg logit contributions
ksh-0.288
 rods-0.266
ashed-0.266
 tanks-0.263
 mouths-0.253
Token to
Token ID284
Feature activation+2.32e+00

Pos logit contributions
ashed+0.376
ksh+0.372
 rods+0.369
 tanks+0.367
 mouths+0.342
Neg logit contributions
 guardian-0.387
he-0.369
 figure-0.363
 fairy-0.363
 answer-0.358
Token sail
Token ID14936
Feature activation+2.26e+00

Pos logit contributions
 rods+0.157
 tanks+0.156
ksh+0.155
ashed+0.153
 mouths+0.144
Neg logit contributions
 guardian-0.119
he-0.114
 fairy-0.111
 note-0.108
 wizard-0.108
Token knew
Token ID2993
Feature activation-2.62e+00

Pos logit contributions
 guardian+0.033
he+0.031
 fairy+0.031
 figure+0.031
 answer+0.030
Neg logit contributions
ksh-0.035
ashed-0.034
 rods-0.034
 tanks-0.034
 mouths-0.031
Token he
Token ID339
Feature activation-4.46e-01

Pos logit contributions
 guardian+0.127
 fairy+0.120
 figure+0.118
 answer+0.117
 wizard+0.116
Neg logit contributions
ksh-0.199
ashed-0.194
 rods-0.188
 tanks-0.186
suits-0.178
Token wouldn
Token ID3636
Feature activation+7.28e+00

Pos logit contributions
 guardian+0.029
he+0.028
 fairy+0.028
 figure+0.027
 answer+0.027
Neg logit contributions
ashed-0.025
ksh-0.025
 tanks-0.024
 rods-0.024
 mouths-0.022
Tokenâ
Token ID22940
Feature activation-6.43e-01

Pos logit contributions
 tanks+0.444
ashed+0.443
 they+0.433
 Finn+0.420
ksh+0.415
Neg logit contributions
 guardian-0.359
 mirror-0.348
 note-0.340
 witch-0.338
 wizard-0.337
Token
Token ID26391
Feature activation+1.28e+00

Pos logit contributions
 guardian+0.037
 fairy+0.036
 wizard+0.034
 owl+0.034
 figure+0.034
Neg logit contributions
ksh-0.041
ashed-0.040
 rods-0.038
 tanks-0.037
suits-0.035
Token
Token ID8151
Feature activation+1.11e+01

Pos logit contributions
ksh+0.076
ashed+0.075
 rods+0.074
 tanks+0.073
 mouths+0.068
Neg logit contributions
 guardian-0.078
he-0.075
 figure-0.074
 fairy-0.074
 answer-0.073
Tokent
Token ID83
Feature activation-4.19e+00

Pos logit contributions
ksh+0.751
ashed+0.744
 rods+0.728
 tanks+0.724
 mouths+0.679
Neg logit contributions
 guardian-0.590
 fairy-0.556
he-0.555
 figure-0.554
 answer-0.547
Token get
Token ID651
Feature activation-2.05e+00

Pos logit contributions
 guardian+0.211
he+0.207
ibrarian+0.197
 fairy+0.192
 wizard+0.190
Neg logit contributions
ashed-0.274
 tanks-0.269
 sizes-0.257
ksh-0.254
 rods-0.253
Token it
Token ID340
Feature activation-1.60e+00

Pos logit contributions
 guardian+0.103
he+0.101
 figure+0.096
 fairy+0.094
 wizard+0.092
Neg logit contributions
ashed-0.144
ksh-0.141
 tanks-0.141
 rods-0.139
 mouths-0.133
Token back
Token ID736
Feature activation-1.25e+00

Pos logit contributions
 guardian+0.067
he+0.064
 fairy+0.063
 answer+0.063
 figure+0.062
Neg logit contributions
ksh-0.128
ashed-0.125
 rods-0.125
 tanks-0.123
 mouths-0.118
Token again
Token ID757
Feature activation+3.62e+00

Pos logit contributions
 guardian+0.061
he+0.058
 fairy+0.057
 figure+0.057
 answer+0.056
Neg logit contributions
ksh-0.091
ashed-0.090
 rods-0.088
 tanks-0.087
 mouths-0.083
Token they
Token ID484
Feature activation+7.59e+00

Pos logit contributions
 guardian+0.054
 fairy+0.052
 figure+0.051
 answer+0.050
 wizard+0.050
Neg logit contributions
ksh-0.063
ashed-0.061
 rods-0.060
 tanks-0.059
suits-0.055
Token both
Token ID1111
Feature activation+2.51e+00

Pos logit contributions
ksh+0.446
ashed+0.439
 rods+0.431
 tanks+0.427
 mouths+0.396
Neg logit contributions
 guardian-0.473
he-0.452
 fairy-0.450
 figure-0.449
 answer-0.444
Token decided
Token ID3066
Feature activation+5.34e+00

Pos logit contributions
ksh+0.156
ashed+0.153
 rods+0.152
 tanks+0.150
 mouths+0.139
Neg logit contributions
 guardian-0.147
he-0.141
 fairy-0.140
 figure-0.139
 answer-0.138
Token to
Token ID284
Feature activation+2.48e+00

Pos logit contributions
ksh+0.434
ashed+0.412
 rods+0.398
 tanks+0.390
suits+0.380
Neg logit contributions
 answer-0.241
 note-0.239
 figure-0.230
 guardian-0.228
 owl-0.226
Token measure
Token ID3953
Feature activation-5.44e+00

Pos logit contributions
ashed+0.150
 tanks+0.150
 rods+0.149
ksh+0.144
 mouths+0.144
Neg logit contributions
 guardian-0.135
he-0.129
 wizard-0.125
 fairy-0.122
 owl-0.121
Token together
Token ID1978
Feature activation+1.06e+01

Pos logit contributions
 answer+0.275
he+0.271
 figure+0.268
 guardian+0.264
 note+0.261
Neg logit contributions
ashed-0.389
ksh-0.374
 tanks-0.336
suits-0.330
 rods-0.330
Token.
Token ID13
Feature activation-6.87e-02

Pos logit contributions
ksh+0.926
ashed+0.915
 tanks+0.877
 rods+0.868
 pic+0.834
Neg logit contributions
he-0.374
 answer-0.374
 guardian-0.368
 figure-0.364
 note-0.349
Token They
Token ID1119
Feature activation+7.37e+00

Pos logit contributions
 guardian+0.003
 wizard+0.003
he+0.003
 fairy+0.003
 voice+0.003
Neg logit contributions
ksh-0.005
ashed-0.005
 rods-0.005
 tanks-0.005
 sizes-0.005
Token took
Token ID1718
Feature activation+6.82e-01

Pos logit contributions
ksh+0.455
ashed+0.449
 rods+0.440
 tanks+0.437
 mouths+0.407
Neg logit contributions
 guardian-0.438
he-0.419
 fairy-0.416
 figure-0.415
 answer-0.410
Token turns
Token ID4962
Feature activation+7.95e+00

Pos logit contributions
ksh+0.046
ashed+0.044
 sizes+0.042
 appet+0.041
 tanks+0.041
Neg logit contributions
he-0.035
 answer-0.035
 figure-0.033
 fairy-0.033
 note-0.032
Token and
Token ID290
Feature activation-6.02e-01

Pos logit contributions
ksh+0.514
ashed+0.511
 rods+0.502
 tanks+0.498
 mouths+0.465
Neg logit contributions
 guardian-0.446
he-0.421
 fairy-0.421
 figure-0.418
 answer-0.412
Token race
Token ID3234
Feature activation+5.19e+00

Pos logit contributions
 guardian+0.016
he+0.015
 figure+0.015
 answer+0.015
 fairy+0.015
Neg logit contributions
ksh-0.024
ashed-0.024
 rods-0.023
 tanks-0.023
 mouths-0.022
Token to
Token ID284
Feature activation+2.71e+00

Pos logit contributions
ksh+0.415
ashed+0.406
 tanks+0.385
 rods+0.376
 mouths+0.373
Neg logit contributions
he-0.217
 answer-0.215
 guardian-0.214
 figure-0.213
 note-0.204
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
ksh+0.192
ashed+0.190
 rods+0.186
 tanks+0.185
 mouths+0.174
Neg logit contributions
 guardian-0.137
he-0.130
 fairy-0.129
 figure-0.128
 answer-0.127
Token slide
Token ID10649
Feature activation-1.28e-01

Pos logit contributions
 rods+0.070
 tanks+0.068
ksh+0.068
ashed+0.068
 mouths+0.064
Neg logit contributions
 guardian-0.057
 fairy-0.053
 wizard-0.053
he-0.052
 figure-0.052
Token!"
Token ID2474
Feature activation-2.17e-01

Pos logit contributions
he+0.005
 answer+0.004
 figure+0.004
 guardian+0.004
 note+0.004
Neg logit contributions
ksh-0.011
ashed-0.011
 tanks-0.010
 rods-0.010
 mouths-0.010
Token Max
Token ID5436
Feature activation+4.48e+00

Pos logit contributions
 voice+0.013
 figure+0.013
 fairy+0.013
 answer+0.012
 owl+0.012
Neg logit contributions
ksh-0.014
ashed-0.013
 rods-0.012
 tanks-0.012
 sizes-0.012
Token says
Token ID1139
Feature activation-3.85e+00

Pos logit contributions
ksh+0.368
ashed+0.366
 rods+0.357
 tanks+0.354
 mouths+0.338
Neg logit contributions
 guardian-0.170
 answer-0.168
 figure-0.163
 fairy-0.159
he-0.156
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 answer+0.123
 figure+0.115
 voice+0.112
 guardian+0.111
 fairy+0.108
Neg logit contributions
ksh-0.349
 rods-0.343
 tanks-0.341
 pic-0.337
ashed-0.335
Token He
Token ID679
Feature activation+3.56e-01

Pos logit contributions
ksh+0.023
ashed+0.022
 rods+0.022
 sizes+0.021
 pic+0.021
Neg logit contributions
 voice-0.014
 fairy-0.013
 answer-0.013
 figure-0.013
 guardian-0.013
Token is
Token ID318
Feature activation-2.03e+00

Pos logit contributions
ksh+0.024
ashed+0.022
 tanks+0.022
 rods+0.022
 mouths+0.020
Neg logit contributions
 answer-0.019
 guardian-0.019
 figure-0.019
 fairy-0.018
he-0.018
Token competitive
Token ID7606
Feature activation+5.28e+00

Pos logit contributions
 guardian+0.090
 answer+0.084
 figure+0.083
he+0.082
 fairy+0.082
Neg logit contributions
ksh-0.158
ashed-0.155
 rods-0.153
 tanks-0.152
 mouths-0.145
TokenOh
Token ID5812
Feature activation-1.80e+00

Pos logit contributions
ksh+0.322
ashed+0.317
 rods+0.311
 tanks+0.306
 mouths+0.288
Neg logit contributions
 guardian-0.301
 fairy-0.284
he-0.284
 figure-0.282
 answer-0.282
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
 guardian+0.075
 fairy+0.072
 answer+0.070
 note+0.069
 figure+0.068
Neg logit contributions
ksh-0.148
ashed-0.147
 tanks-0.142
 rods-0.140
suits-0.133
Token I
Token ID314
Feature activation+1.39e+00

Pos logit contributions
ksh+0.079
ashed+0.078
 rods+0.077
 tanks+0.076
 mouths+0.071
Neg logit contributions
 guardian-0.076
he-0.072
 fairy-0.072
 figure-0.071
 answer-0.071
Token see
Token ID766
Feature activation-7.06e+00

Pos logit contributions
ashed+0.086
 rods+0.081
ksh+0.080
 tanks+0.078
suits+0.077
Neg logit contributions
 guardian-0.084
 angel-0.081
 owl-0.080
 fairy-0.079
 wizard-0.077
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
 guardian+0.269
 fairy+0.263
 answer+0.260
 figure+0.258
he+0.256
Neg logit contributions
ashed-0.584
ksh-0.579
 tanks-0.561
 rods-0.559
suits-0.536
Token I
Token ID314
Feature activation+1.37e+00

Pos logit contributions
 guardian+0.014
 fairy+0.014
 figure+0.013
 answer+0.013
he+0.013
Neg logit contributions
ashed-0.016
ksh-0.016
 tanks-0.016
 rods-0.016
suits-0.015
Token am
Token ID716
Feature activation-4.10e+00

Pos logit contributions
ashed+0.086
ksh+0.079
 rods+0.078
 tanks+0.076
suits+0.073
Neg logit contributions
 guardian-0.085
 owl-0.081
 angel-0.080
 wizard-0.078
 fairy-0.077
Token sorry
Token ID7926
Feature activation-1.89e+00

Pos logit contributions
he+0.240
 guardian+0.233
 fairy+0.217
 figure+0.216
 wizard+0.210
Neg logit contributions
 rods-0.249
 tanks-0.248
ksh-0.248
ashed-0.246
 mouths-0.242
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
 guardian+0.077
 fairy+0.076
 answer+0.075
 figure+0.074
 owl+0.072
Neg logit contributions
ashed-0.155
ksh-0.154
 rods-0.149
 tanks-0.146
suits-0.141
Token I
Token ID314
Feature activation+1.36e+00

Pos logit contributions
 guardian+0.016
 fairy+0.015
 figure+0.015
 answer+0.015
 owl+0.015
Neg logit contributions
ashed-0.020
 rods-0.020
ksh-0.020
 tanks-0.020
suits-0.018
Token did
Token ID750
Feature activation+4.55e+00

Pos logit contributions
ashed+0.083
ksh+0.080
 rods+0.077
 tanks+0.076
 mouths+0.072
Neg logit contributions
 guardian-0.085
he-0.079
 owl-0.079
 fairy-0.079
 wizard-0.079
Token a
Token ID257
Feature activation-6.93e-01

Pos logit contributions
 guardian+0.017
 fairy+0.016
 figure+0.016
he+0.016
 answer+0.015
Neg logit contributions
ksh-0.030
ashed-0.029
 rods-0.029
 tanks-0.029
 mouths-0.027
Token big
Token ID1263
Feature activation-3.31e+00

Pos logit contributions
he+0.035
 guardian+0.032
 fairy+0.030
 answer+0.030
 figure+0.030
Neg logit contributions
 rods-0.049
 tanks-0.049
ashed-0.048
ksh-0.047
 mouths-0.046
Token gun
Token ID2485
Feature activation-2.28e+00

Pos logit contributions
he+0.178
 guardian+0.173
 fairy+0.162
 figure+0.158
 wizard+0.157
Neg logit contributions
 tanks-0.220
 rods-0.220
ashed-0.216
ksh-0.210
 mouths-0.207
Token and
Token ID290
Feature activation-7.31e-01

Pos logit contributions
 guardian+0.113
he+0.107
 fairy+0.105
 figure+0.103
 answer+0.102
Neg logit contributions
ashed-0.161
 tanks-0.158
ksh-0.158
 rods-0.157
 mouths-0.147
Token wanted
Token ID2227
Feature activation+2.27e+00

Pos logit contributions
 fairy+0.036
 figure+0.035
 owl+0.035
 guardian+0.035
 wizard+0.034
Neg logit contributions
ksh-0.057
ashed-0.052
 rods-0.050
 tanks-0.049
suits-0.048
Token to
Token ID284
Feature activation+2.07e+00

Pos logit contributions
ksh+0.169
ashed+0.160
 rods+0.146
suits+0.145
 tanks+0.145
Neg logit contributions
 guardian-0.117
 answer-0.115
 owl-0.114
 fairy-0.114
 figure-0.112
Token catch
Token ID4929
Feature activation-6.80e-01

Pos logit contributions
 tanks+0.120
 rods+0.119
ashed+0.111
 tails+0.105
 pic+0.104
Neg logit contributions
he-0.128
 guardian-0.125
 mirror-0.121
ibrarian-0.119
 wizard-0.119
Token some
Token ID617
Feature activation+2.29e+00

Pos logit contributions
 guardian+0.041
he+0.040
 mirror+0.037
 wizard+0.037
 fairy+0.037
Neg logit contributions
 tanks-0.042
 rods-0.042
ksh-0.040
ashed-0.040
 tails-0.037
Token animals
Token ID4695
Feature activation+2.12e+00

Pos logit contributions
ksh+0.176
ashed+0.168
 rods+0.153
suits+0.151
 mouths+0.150
Neg logit contributions
 answer-0.112
 figure-0.108
 fairy-0.108
 owl-0.105
 guardian-0.105
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
ksh+0.161
ashed+0.159
 rods+0.156
 tanks+0.155
 mouths+0.147
Neg logit contributions
 guardian-0.096
he-0.090
 fairy-0.090
 figure-0.089
 answer-0.088
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
he+0.007
 voice+0.007
 figure+0.006
 owl+0.006
 wizard+0.006
Neg logit contributions
ksh-0.018
ashed-0.018
 rods-0.017
 sizes-0.017
 tails-0.017
Token day
Token ID1110
Feature activation+2.89e+00

Pos logit contributions
ashed+0.026
ksh+0.026
 tanks+0.025
 rods+0.025
 pic+0.022
Neg logit contributions
 guardian-0.037
 wizard-0.035
 figure-0.034
he-0.034
 fairy-0.034
Token,
Token ID11
Feature activation+1.90e+00

Pos logit contributions
ksh+0.278
 rods+0.253
 tanks+0.251
 sizes+0.250
girls+0.242
Neg logit contributions
 fairy-0.094
he-0.088
 figure-0.086
 mirror-0.086
 answer-0.085
Token Lily
Token ID20037
Feature activation+4.91e+00

Pos logit contributions
ksh+0.122
 rods+0.115
 tanks+0.112
ashed+0.107
 sizes+0.105
Neg logit contributions
 fairy-0.119
he-0.115
 figure-0.113
 answer-0.111
 mirror-0.110
Token played
Token ID2826
Feature activation+8.64e+00

Pos logit contributions
ashed+0.327
 tanks+0.311
ksh+0.310
 rods+0.305
 pic+0.290
Neg logit contributions
 guardian-0.276
he-0.255
 answer-0.249
 fairy-0.248
 wizard-0.248
Token a
Token ID257
Feature activation-2.57e-01

Pos logit contributions
ashed+0.499
 tanks+0.497
 rods+0.492
ksh+0.475
 pic+0.460
Neg logit contributions
 guardian-0.532
 wizard-0.454
 fairy-0.449
he-0.445
 answer-0.443
Token game
Token ID983
Feature activation+6.73e+00

Pos logit contributions
 guardian+0.019
he+0.017
 answer+0.017
 owl+0.017
 wizard+0.017
Neg logit contributions
ksh-0.012
ashed-0.012
 rods-0.012
 tanks-0.012
 pic-0.011
Token with
Token ID351
Feature activation-2.77e+00

Pos logit contributions
ashed+0.469
 rods+0.460
ksh+0.460
 tanks+0.456
 mouths+0.422
Neg logit contributions
 guardian-0.343
 fairy-0.317
he-0.315
 figure-0.311
 answer-0.309
Token her
Token ID607
Feature activation-4.46e+00

Pos logit contributions
 guardian+0.116
 wizard+0.101
he+0.099
 figure+0.097
 answer+0.094
Neg logit contributions
ashed-0.221
 rods-0.218
 tanks-0.217
 mouths-0.209
ksh-0.208
Token friends
Token ID2460
Feature activation+1.75e+00

Pos logit contributions
 guardian+0.262
he+0.251
 figure+0.243
 wizard+0.242
 answer+0.242
Neg logit contributions
ksh-0.268
 tanks-0.267
ashed-0.266
 rods-0.266
 mouths-0.252
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
ksh+0.151
 rods+0.143
ashed+0.142
 tanks+0.139
 mouths+0.139
Neg logit contributions
 figure-0.067
 answer-0.064
he-0.063
 owl-0.062
 fairy-0.061
Token She
Token ID1375
Feature activation-4.32e+00

Pos logit contributions
 rods+0.033
ksh+0.033
 pic+0.031
 topp+0.031
 tanks+0.031
Neg logit contributions
 answer-0.012
 voice-0.012
 figure-0.012
 fairy-0.011
 mirror-0.011
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
ksh+0.319
ashed+0.317
 rods+0.312
 tanks+0.310
 mouths+0.293
Neg logit contributions
 guardian-0.216
he-0.202
 fairy-0.200
 figure-0.200
 answer-0.197
Token
Token ID198
Feature activation-1.42e+00

Pos logit contributions
 guardian+0.103
 angel+0.094
 fairy+0.089
 wizard+0.089
 figure+0.089
Neg logit contributions
 rods-0.075
 tanks-0.069
 They-0.066
 Finn-0.065
 pic-0.065
TokenBen
Token ID11696
Feature activation+1.21e+00

Pos logit contributions
 guardian+0.102
 angel+0.096
 answer+0.096
 fairy+0.095
 figure+0.093
Neg logit contributions
 rods-0.074
 tanks-0.067
ashed-0.066
ksh-0.065
 tails-0.063
Token says
Token ID1139
Feature activation-3.93e+00

Pos logit contributions
ksh+0.084
ashed+0.083
 rods+0.082
 tanks+0.081
 mouths+0.076
Neg logit contributions
 guardian-0.063
he-0.059
 fairy-0.059
 figure-0.058
 answer-0.057
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
 fairy+0.145
 answer+0.144
 voice+0.141
 note+0.141
 guardian+0.135
Neg logit contributions
ksh-0.357
 tanks-0.328
 rods-0.326
 pic-0.323
ashed-0.323
Token "
Token ID366
Feature activation+5.48e+00

Pos logit contributions
ksh+0.106
ashed+0.104
 rods+0.103
 tanks+0.100
 sizes+0.096
Neg logit contributions
 fairy-0.097
 guardian-0.096
 wizard-0.095
 answer-0.094
 figure-0.094
TokenOK
Token ID11380
Feature activation+6.40e+00

Pos logit contributions
ksh+0.354
ashed+0.350
 rods+0.343
 tanks+0.341
 mouths+0.318
Neg logit contributions
 guardian-0.314
he-0.299
 fairy-0.297
 figure-0.296
 answer-0.293
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
ksh+0.525
ashed+0.519
 rods+0.505
 tanks+0.499
 pic+0.468
Neg logit contributions
 answer-0.269
 guardian-0.269
 fairy-0.263
 figure-0.260
 voice-0.257
Token Lily
Token ID20037
Feature activation+4.90e+00

Pos logit contributions
ksh+0.097
ashed+0.096
 rods+0.092
 tanks+0.091
suits+0.086
Neg logit contributions
 guardian-0.106
 fairy-0.106
 answer-0.104
 figure-0.103
 wizard-0.101
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
ksh+0.435
ashed+0.429
 rods+0.421
 tanks+0.415
 pic+0.397
Neg logit contributions
 guardian-0.170
 fairy-0.166
 answer-0.165
 figure-0.162
 owl-0.159
Token I
Token ID314
Feature activation+1.87e+00

Pos logit contributions
ashed+0.023
 rods+0.023
ksh+0.023
 tanks+0.022
 pic+0.021
Neg logit contributions
 guardian-0.014
 fairy-0.013
 answer-0.012
 owl-0.012
 figure-0.012
Token"
Token ID1
Feature activation+2.04e+00

Pos logit contributions
 guardian+0.074
he+0.071
 fairy+0.068
 figure+0.068
 answer+0.066
Neg logit contributions
ksh-0.134
ashed-0.133
 tanks-0.130
 rods-0.130
 mouths-0.126
TokenThank
Token ID10449
Feature activation+1.23e+00

Pos logit contributions
ksh+0.125
ashed+0.124
 rods+0.121
 tanks+0.121
 mouths+0.113
Neg logit contributions
 guardian-0.123
he-0.117
 fairy-0.117
 figure-0.116
 answer-0.115
Token you
Token ID345
Feature activation+1.91e+00

Pos logit contributions
ksh+0.079
ashed+0.077
 rods+0.077
 tanks+0.076
 mouths+0.071
Neg logit contributions
 guardian-0.069
he-0.065
 figure-0.065
 fairy-0.065
 answer-0.064
Token,
Token ID11
Feature activation+1.58e+00

Pos logit contributions
ashed+0.137
ksh+0.136
 rods+0.133
 tanks+0.131
 mouths+0.125
Neg logit contributions
 guardian-0.096
 fairy-0.091
he-0.090
 answer-0.088
 figure-0.088
Token Mouse
Token ID21839
Feature activation+7.01e-01

Pos logit contributions
ksh+0.079
 tanks+0.077
 rods+0.077
ashed+0.076
 pic+0.070
Neg logit contributions
 guardian-0.109
he-0.106
 figure-0.104
 fairy-0.104
 answer-0.104
Token!"
Token ID2474
Feature activation-6.11e-01

Pos logit contributions
ksh+0.051
ashed+0.051
 rods+0.050
 tanks+0.050
 mouths+0.047
Neg logit contributions
 guardian-0.033
he-0.031
 fairy-0.031
 figure-0.031
 answer-0.030
Token Bob
Token ID5811
Feature activation+4.51e+00

Pos logit contributions
 guardian+0.034
he+0.033
 fairy+0.032
 figure+0.032
 answer+0.032
Neg logit contributions
ksh-0.040
ashed-0.039
 rods-0.039
 tanks-0.038
 mouths-0.036
Token said
Token ID531
Feature activation-3.04e+00

Pos logit contributions
ashed+0.318
ksh+0.318
 rods+0.311
 tanks+0.309
 mouths+0.290
Neg logit contributions
 guardian-0.225
he-0.213
 answer-0.207
 fairy-0.207
 figure-0.207
Token with
Token ID351
Feature activation-2.99e+00

Pos logit contributions
 guardian+0.138
he+0.130
 fairy+0.129
 figure+0.129
 answer+0.127
Neg logit contributions
ksh-0.229
ashed-0.227
 rods-0.223
 tanks-0.222
 mouths-0.210
Token a
Token ID257
Feature activation-5.43e-01

Pos logit contributions
 guardian+0.177
 figure+0.164
 wizard+0.163
he+0.162
 owl+0.158
Neg logit contributions
 tanks-0.178
 rods-0.177
ashed-0.174
ksh-0.166
 mouths-0.166
Token big
Token ID1263
Feature activation-3.11e+00

Pos logit contributions
he+0.025
 guardian+0.023
ibrarian+0.023
mother+0.022
 angel+0.022
Neg logit contributions
 tanks-0.038
 rods-0.037
ashed-0.037
ksh-0.036
 mouths-0.036
Token."
Token ID526
Feature activation+4.14e+00

Pos logit contributions
ashed+0.154
ksh+0.152
 rods+0.148
 tanks+0.146
 mouths+0.140
Neg logit contributions
 guardian-0.111
 fairy-0.106
he-0.102
 answer-0.102
 figure-0.101
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
ksh+0.316
ashed+0.312
 rods+0.311
 tanks+0.304
 sizes+0.295
Neg logit contributions
he-0.187
 guardian-0.165
 fairy-0.165
 owl-0.163
 wizard-0.162
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
 guardian+0.084
 answer+0.076
 figure+0.076
 fairy+0.076
 wizard+0.074
Neg logit contributions
 rods-0.097
 tanks-0.095
ashed-0.095
ksh-0.094
 pic-0.090
TokenThe
Token ID464
Feature activation-9.99e-01

Pos logit contributions
 guardian+0.069
he+0.066
 fairy+0.065
 figure+0.065
 answer+0.063
Neg logit contributions
ksh-0.115
ashed-0.114
 rods-0.111
 tanks-0.111
 mouths-0.106
Token man
Token ID582
Feature activation-7.47e+00

Pos logit contributions
he+0.049
 guardian+0.045
iver+0.043
ibrarian+0.041
ocket+0.041
Neg logit contributions
 rods-0.068
 tanks-0.065
ashed-0.064
 tails-0.063
 mouths-0.062
Token smiled
Token ID13541
Feature activation-1.09e+00

Pos logit contributions
 guardian+0.400
 fairy+0.379
he+0.378
 figure+0.377
 answer+0.371
Neg logit contributions
ksh-0.505
ashed-0.496
 rods-0.489
 tanks-0.486
 mouths-0.455
Token back
Token ID736
Feature activation-1.69e+00

Pos logit contributions
 guardian+0.049
he+0.046
 fairy+0.046
 figure+0.046
 answer+0.045
Neg logit contributions
ksh-0.083
ashed-0.082
 rods-0.081
 tanks-0.080
 mouths-0.076
Token and
Token ID290
Feature activation-4.99e-01

Pos logit contributions
 guardian+0.067
he+0.062
 fairy+0.062
 figure+0.061
 answer+0.061
Neg logit contributions
ksh-0.140
ashed-0.136
 rods-0.135
 tanks-0.135
 sizes-0.128
Token said
Token ID531
Feature activation-2.85e+00

Pos logit contributions
 fairy+0.021
 guardian+0.021
 owl+0.020
 figure+0.020
 wizard+0.020
Neg logit contributions
ksh-0.041
 rods-0.039
 tanks-0.038
ashed-0.037
suits-0.037
Token "
Token ID366
Feature activation+5.44e+00

Pos logit contributions
 fairy+0.162
 guardian+0.162
 wizard+0.159
 answer+0.156
 note+0.154
Neg logit contributions
ksh-0.194
 rods-0.180
ashed-0.180
 tanks-0.178
 sizes-0.172
TokenIt
Token ID1026
Feature activation-4.92e+00

Pos logit contributions
ksh+0.383
ashed+0.377
 rods+0.369
 tanks+0.366
 mouths+0.343
Neg logit contributions
 guardian-0.277
 figure-0.259
 fairy-0.259
he-0.258
 answer-0.256
Token smiled
Token ID13541
Feature activation-1.72e+00

Pos logit contributions
 guardian+0.298
he+0.282
 fairy+0.278
 figure+0.278
 answer+0.277
Neg logit contributions
ashed-0.353
ksh-0.350
 rods-0.341
 tanks-0.337
 mouths-0.321
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
 guardian+0.093
he+0.092
 figure+0.090
ibrarian+0.088
 wizard+0.088
Neg logit contributions
ashed-0.110
ksh-0.110
 tanks-0.101
 rods-0.098
 mouths-0.096
Token felt
Token ID2936
Feature activation-1.97e+00

Pos logit contributions
 guardian+0.049
 fairy+0.048
he+0.048
 figure+0.048
 owl+0.046
Neg logit contributions
ksh-0.084
 tanks-0.082
 rods-0.082
ashed-0.078
 sizes-0.076
Token proud
Token ID6613
Feature activation-6.36e+00

Pos logit contributions
 guardian+0.102
 figure+0.096
 fairy+0.094
 owl+0.094
 wizard+0.093
Neg logit contributions
ashed-0.135
ksh-0.133
 tanks-0.128
 rods-0.127
 pic-0.121
Token of
Token ID286
Feature activation-3.28e+00

Pos logit contributions
 guardian+0.237
 fairy+0.225
he+0.219
 answer+0.218
 figure+0.216
Neg logit contributions
ksh-0.542
 rods-0.528
ashed-0.528
 tanks-0.523
 mouths-0.497
Token herself
Token ID5223
Feature activation-6.10e+00

Pos logit contributions
 guardian+0.133
 figure+0.114
he+0.113
 wizard+0.112
 fairy+0.109
Neg logit contributions
 tanks-0.246
 rods-0.243
ashed-0.242
 pic-0.238
 tails-0.237
Token for
Token ID329
Feature activation-1.05e+00

Pos logit contributions
 guardian+0.328
 fairy+0.311
he+0.311
 figure+0.311
 answer+0.307
Neg logit contributions
ksh-0.411
ashed-0.408
 rods-0.400
 tanks-0.397
 mouths-0.372
Token being
Token ID852
Feature activation-5.09e-01

Pos logit contributions
 guardian+0.055
 figure+0.052
 fairy+0.052
 answer+0.051
he+0.051
Neg logit contributions
ksh-0.074
ashed-0.074
 rods-0.072
 tanks-0.071
suits-0.068
Token such
Token ID884
Feature activation-1.13e+00

Pos logit contributions
 guardian+0.027
he+0.026
 figure+0.025
 fairy+0.025
 answer+0.025
Neg logit contributions
ksh-0.034
ashed-0.034
 rods-0.033
 tanks-0.033
 mouths-0.031
Token a
Token ID257
Feature activation-9.94e-01

Pos logit contributions
 guardian+0.053
he+0.050
 fairy+0.050
 figure+0.050
 answer+0.049
Neg logit contributions
ksh-0.084
ashed-0.083
 rods-0.081
 tanks-0.081
 mouths-0.076
Token good
Token ID922
Feature activation+5.12e+00

Pos logit contributions
 guardian+0.053
 fairy+0.050
 figure+0.048
 wizard+0.048
 note+0.047
Neg logit contributions
ksh-0.074
ashed-0.072
 rods-0.071
 tanks-0.070
 mouths-0.067
Token pad
Token ID14841
Feature activation+4.14e+00

Pos logit contributions
 guardian+0.289
 fairy+0.275
 figure+0.275
he+0.274
 answer+0.269
Neg logit contributions
ksh-0.352
ashed-0.348
 rods-0.341
 tanks-0.340
 mouths-0.316
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
ashed+0.273
 rods+0.273
ksh+0.267
 tanks+0.261
suits+0.253
Neg logit contributions
 guardian-0.217
 fairy-0.202
 wizard-0.198
 mirror-0.197
he-0.197
Token Sure
Token ID10889
Feature activation+1.19e+00

Pos logit contributions
 guardian+0.018
 fairy+0.017
 wizard+0.017
 owl+0.017
 answer+0.016
Neg logit contributions
 rods-0.027
ksh-0.026
 tanks-0.026
ashed-0.026
 mouths-0.025
Token enough
Token ID1576
Feature activation+4.65e+00

Pos logit contributions
ashed+0.055
ksh+0.054
 rods+0.053
 tanks+0.052
 mouths+0.048
Neg logit contributions
 guardian-0.089
he-0.087
 figure-0.085
 fairy-0.085
 answer-0.084
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
ksh+0.373
ashed+0.352
 tanks+0.339
 rods+0.334
 sizes+0.319
Neg logit contributions
 guardian-0.215
 answer-0.209
 fairy-0.209
 figure-0.206
 wizard-0.201
Token he
Token ID339
Feature activation-8.87e-01

Pos logit contributions
ksh+0.093
ashed+0.090
 tanks+0.086
 rods+0.086
 mouths+0.083
Neg logit contributions
 guardian-0.057
 fairy-0.055
 figure-0.055
 wizard-0.053
 answer-0.052
Token saw
Token ID2497
Feature activation-5.40e+00

Pos logit contributions
 guardian+0.054
he+0.052
 answer+0.051
 angel+0.050
 note+0.050
Neg logit contributions
ashed-0.054
 rods-0.051
 tanks-0.050
ksh-0.050
 mouths-0.047
Token a
Token ID257
Feature activation-9.35e-01

Pos logit contributions
 guardian+0.280
he+0.270
 figure+0.260
 answer+0.258
 fairy+0.256
Neg logit contributions
 rods-0.368
ashed-0.367
 tanks-0.365
ksh-0.363
 mouths-0.341
Token pale
Token ID14005
Feature activation-5.24e+00

Pos logit contributions
he+0.046
 guardian+0.044
ibrarian+0.041
 answer+0.041
ad+0.040
Neg logit contributions
 rods-0.064
 tanks-0.063
ashed-0.062
ksh-0.062
 pic-0.059
Token brush
Token ID14093
Feature activation-5.03e+00

Pos logit contributions
he+0.276
 guardian+0.255
ibrarian+0.248
mother+0.228
 answer+0.227
Neg logit contributions
ashed-0.345
 rods-0.341
 tanks-0.336
ksh-0.324
 mouths-0.321
Token.
Token ID13
Feature activation-4.56e-01

Pos logit contributions
 guardian+0.279
 wizard+0.246
 fairy+0.245
 answer+0.243
he+0.240
Neg logit contributions
ashed-0.315
 rods-0.309
ksh-0.307
 tanks-0.304
 sizes-0.288
Token horse
Token ID8223
Feature activation-1.92e+00

Pos logit contributions
ksh+0.136
ashed+0.134
 rods+0.131
 tanks+0.130
 mouths+0.123
Neg logit contributions
 guardian-0.084
 fairy-0.081
 figure-0.080
he-0.079
 answer-0.078
Token.
Token ID13
Feature activation-7.98e-01

Pos logit contributions
 guardian+0.091
he+0.086
 fairy+0.085
 figure+0.085
 answer+0.084
Neg logit contributions
ksh-0.141
ashed-0.139
 rods-0.137
 tanks-0.135
 mouths-0.128
Token
Token ID198
Feature activation-2.33e+00

Pos logit contributions
 guardian+0.040
he+0.039
 fairy+0.038
 owl+0.038
 wizard+0.037
Neg logit contributions
ashed-0.055
 rods-0.055
ksh-0.053
 tanks-0.051
suits-0.050
Token
Token ID198
Feature activation-2.36e+00

Pos logit contributions
 guardian+0.125
 fairy+0.115
 angel+0.115
 answer+0.114
 figure+0.114
Neg logit contributions
 rods-0.157
 tanks-0.154
ashed-0.152
ksh-0.151
 pic-0.148
TokenT
Token ID51
Feature activation+2.24e+00

Pos logit contributions
 guardian+0.127
 fairy+0.120
 answer+0.119
he+0.119
 figure+0.119
Neg logit contributions
ksh-0.158
ashed-0.156
 rods-0.154
 tanks-0.153
 pic-0.141
Tokenoby
Token ID26730
Feature activation-1.48e+00

Pos logit contributions
ksh+0.144
ashed+0.142
 tanks+0.133
 rods+0.131
suits+0.122
Neg logit contributions
 guardian-0.125
 figure-0.118
 note-0.116
he-0.115
 answer-0.113
Token asked
Token ID1965
Feature activation-9.11e-01

Pos logit contributions
 guardian+0.069
 figure+0.067
 fairy+0.067
he+0.066
 answer+0.065
Neg logit contributions
ksh-0.110
 rods-0.106
ashed-0.106
 tanks-0.105
 mouths-0.098
Token the
Token ID262
Feature activation+4.53e-01

Pos logit contributions
 guardian+0.043
 fairy+0.042
 answer+0.042
 figure+0.042
 wizard+0.041
Neg logit contributions
ksh-0.071
ashed-0.068
 rods-0.066
 tanks-0.064
 mouths-0.063
Token horse
Token ID8223
Feature activation-7.54e-01

Pos logit contributions
ksh+0.033
 tanks+0.033
 rods+0.033
ashed+0.032
 mouths+0.030
Neg logit contributions
he-0.021
 guardian-0.020
 figure-0.019
 note-0.018
 fairy-0.018
Token if
Token ID611
Feature activation+3.84e+00

Pos logit contributions
 guardian+0.043
he+0.043
 figure+0.041
 fairy+0.041
 answer+0.040
Neg logit contributions
ashed-0.046
ksh-0.046
 tanks-0.046
 rods-0.046
 mouths-0.042
Token he
Token ID339
Feature activation-7.67e-01

Pos logit contributions
ksh+0.302
ashed+0.289
 rods+0.270
suits+0.266
 sizes+0.263
Neg logit contributions
 fairy-0.194
 guardian-0.192
 wizard-0.191
 figure-0.185
 owl-0.184
Token went
Token ID1816
Feature activation+6.59e+00

Pos logit contributions
 fairy+0.033
 guardian+0.033
 figure+0.033
he+0.031
 owl+0.031
Neg logit contributions
ksh-0.044
 rods-0.042
 tanks-0.041
 sizes-0.040
ashed-0.039
Token on
Token ID319
Feature activation+2.15e-02

Pos logit contributions
ashed+0.472
 rods+0.454
 tanks+0.452
ksh+0.447
 pic+0.420
Neg logit contributions
 guardian-0.326
 figure-0.302
he-0.301
 fairy-0.299
 owl-0.294
Token his
Token ID465
Feature activation-1.40e-01

Pos logit contributions
ksh+0.002
ashed+0.002
 rods+0.002
 tanks+0.002
suits+0.001
Neg logit contributions
 guardian-0.001
 answer-0.001
 fairy-0.001
 figure-0.001
he-0.001
Token way
Token ID835
Feature activation+1.31e+00

Pos logit contributions
 guardian+0.007
 figure+0.007
 answer+0.007
 fairy+0.007
 note+0.006
Neg logit contributions
ksh-0.011
ashed-0.010
 tanks-0.010
 rods-0.010
 sizes-0.009
Token to
Token ID284
Feature activation+2.49e+00

Pos logit contributions
ashed+0.076
 rods+0.073
 tanks+0.073
ksh+0.070
 they+0.068
Neg logit contributions
 guardian-0.075
ibrarian-0.074
he-0.073
 fairy-0.071
 note-0.071
Token find
Token ID1064
Feature activation-8.58e+00

Pos logit contributions
 pic+0.147
 rods+0.146
 tanks+0.145
ashed+0.139
 mouths+0.138
Neg logit contributions
 guardian-0.141
he-0.139
 fairy-0.132
ibrarian-0.132
 note-0.131
Token something
Token ID1223
Feature activation-9.42e-01

Pos logit contributions
 guardian+0.464
he+0.449
 figure+0.436
 fairy+0.433
 note+0.426
Neg logit contributions
ksh-0.563
 rods-0.559
ashed-0.559
 tanks-0.555
 mouths-0.519
Token else
Token ID2073
Feature activation+1.80e+00

Pos logit contributions
 guardian+0.042
 fairy+0.040
 answer+0.039
 figure+0.039
 wizard+0.037
Neg logit contributions
ksh-0.075
ashed-0.073
 rods-0.072
 tanks-0.071
 mouths-0.068
Token to
Token ID284
Feature activation+2.45e+00

Pos logit contributions
ksh+0.140
ashed+0.132
 rods+0.129
 tanks+0.128
 mouths+0.122
Neg logit contributions
 guardian-0.085
 figure-0.081
 fairy-0.081
 answer-0.080
 wizard-0.078
Token eat
Token ID4483
Feature activation+1.06e+00

Pos logit contributions
 rods+0.140
 pic+0.139
 tanks+0.135
ashed+0.134
ksh+0.129
Neg logit contributions
 guardian-0.147
 wizard-0.135
ibrarian-0.133
 fairy-0.133
he-0.131
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
ksh+0.097
ashed+0.096
 tanks+0.090
 rods+0.090
suits+0.087
Neg logit contributions
he-0.033
 answer-0.032
 figure-0.031
 fairy-0.030
 note-0.030
Token All
Token ID1439
Feature activation-1.01e+00

Pos logit contributions
 guardian+0.020
 figure+0.018
 answer+0.018
 owl+0.018
 fairy+0.018
Neg logit contributions
ashed-0.020
 rods-0.019
 tanks-0.019
suits-0.018
ksh-0.018
Token of
Token ID286
Feature activation-3.19e+00

Pos logit contributions
 guardian+0.044
 fairy+0.043
 figure+0.043
he+0.042
 answer+0.042
Neg logit contributions
ksh-0.081
ashed-0.078
 rods-0.076
 tanks-0.075
suits-0.073
Token these
Token ID777
Feature activation+7.84e-01

Pos logit contributions
 guardian+0.148
 fairy+0.144
 figure+0.142
 answer+0.141
he+0.139
Neg logit contributions
ksh-0.245
ashed-0.243
 tanks-0.226
 rods-0.225
suits-0.218
Token perf
Token ID23035
Feature activation+2.89e+00

Pos logit contributions
ksh+0.052
ashed+0.048
 rods+0.047
 tanks+0.047
 mouths+0.042
Neg logit contributions
 fairy-0.044
 answer-0.044
 guardian-0.044
 figure-0.043
he-0.043
Tokenumes
Token ID8139
Feature activation+1.37e+00

Pos logit contributions
ksh+0.171
ashed+0.169
 rods+0.165
 tanks+0.164
 mouths+0.152
Neg logit contributions
 guardian-0.179
he-0.171
 fairy-0.170
 figure-0.169
 answer-0.167
Token smell
Token ID8508
Feature activation-8.06e+00

Pos logit contributions
ksh+0.101
ashed+0.097
 rods+0.094
 tanks+0.094
 mouths+0.087
Neg logit contributions
 guardian-0.068
 fairy-0.065
he-0.065
 answer-0.065
 figure-0.065
Token so
Token ID523
Feature activation+3.11e+00

Pos logit contributions
 fairy+0.366
 answer+0.361
he+0.356
 guardian+0.355
 note+0.351
Neg logit contributions
ksh-0.622
ashed-0.589
 rods-0.587
 tanks-0.582
suits-0.536
Token good
Token ID922
Feature activation+5.34e+00

Pos logit contributions
ksh+0.228
ashed+0.210
 rods+0.209
 tanks+0.207
suits+0.192
Neg logit contributions
 fairy-0.162
 guardian-0.159
 note-0.155
 answer-0.153
 figure-0.152
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
ksh+0.439
 rods+0.414
ashed+0.406
 tanks+0.402
 mouths+0.377
Neg logit contributions
 answer-0.239
 fairy-0.228
 angel-0.213
 mirror-0.212
 owl-0.211
Token Take
Token ID7214
Feature activation-1.07e+00

Pos logit contributions
 guardian+0.024
 owl+0.022
 figure+0.022
 fairy+0.022
 answer+0.022
Neg logit contributions
ashed-0.026
 rods-0.025
 tanks-0.023
suits-0.023
ksh-0.022
Token your
Token ID534
Feature activation+9.15e-01

Pos logit contributions
 guardian+0.057
 fairy+0.055
 figure+0.054
 answer+0.054
he+0.054
Neg logit contributions
ksh-0.072
ashed-0.071
 rods-0.070
 tanks-0.069
 mouths-0.065
Token
Token ID198
Feature activation-2.00e+00

Pos logit contributions
 guardian+0.129
 answer+0.119
 figure+0.118
 angel+0.116
 note+0.115
Neg logit contributions
 rods-0.114
 tanks-0.110
 pic-0.107
ashed-0.105
ksh-0.105
TokenThe
Token ID464
Feature activation-1.31e+00

Pos logit contributions
 guardian+0.116
 answer+0.113
he+0.109
 fairy+0.108
 figure+0.108
Neg logit contributions
 rods-0.118
ksh-0.118
ashed-0.116
 tanks-0.115
 pic-0.109
Token moral
Token ID6573
Feature activation-2.84e+00

Pos logit contributions
he+0.058
 guardian+0.057
 answer+0.051
 figure+0.051
 note+0.050
Neg logit contributions
 tanks-0.096
 rods-0.095
ksh-0.094
ashed-0.094
 mouths-0.088
Token of
Token ID286
Feature activation-3.12e+00

Pos logit contributions
 guardian+0.134
he+0.132
 figure+0.124
 wizard+0.122
 fairy+0.119
Neg logit contributions
 rods-0.200
 tanks-0.194
ashed-0.191
ksh-0.183
 pic-0.180
Token the
Token ID262
Feature activation+3.80e-01

Pos logit contributions
 guardian+0.202
he+0.188
 figure+0.182
 note+0.182
 answer+0.181
Neg logit contributions
 tanks-0.173
 rods-0.171
ksh-0.157
 pic-0.155
ashed-0.153
Token story
Token ID1621
Feature activation-8.89e+00

Pos logit contributions
 tanks+0.036
 rods+0.036
ksh+0.035
ashed+0.035
 pic+0.035
Neg logit contributions
he-0.009
 guardian-0.009
 figure-0.007
 answer-0.006
 mirror-0.006
Token is
Token ID318
Feature activation-2.46e+00

Pos logit contributions
 guardian+0.397
he+0.371
 figure+0.368
 fairy+0.367
 answer+0.360
Neg logit contributions
ksh-0.669
ashed-0.669
 rods-0.658
 tanks-0.654
 mouths-0.620
Token that
Token ID326
Feature activation-5.92e+00

Pos logit contributions
he+0.116
 guardian+0.114
 fairy+0.107
 figure+0.107
 answer+0.106
Neg logit contributions
 rods-0.175
ashed-0.174
 tanks-0.174
ksh-0.170
 mouths-0.162
Token it
Token ID340
Feature activation-1.98e+00

Pos logit contributions
 guardian+0.308
 figure+0.306
 fairy+0.305
 owl+0.296
 answer+0.291
Neg logit contributions
ksh-0.457
ashed-0.445
suits-0.409
 rods-0.407
 tanks-0.391
Token's
Token ID338
Feature activation-6.71e-01

Pos logit contributions
 guardian+0.099
he+0.097
 fairy+0.092
 figure+0.092
 answer+0.091
Neg logit contributions
ashed-0.136
ksh-0.135
 tanks-0.134
 rods-0.133
 mouths-0.126
Token important
Token ID1593
Feature activation+2.04e+00

Pos logit contributions
he+0.040
 guardian+0.040
 figure+0.038
 fairy+0.038
 wizard+0.037
Neg logit contributions
ksh-0.039
ashed-0.039
 tanks-0.039
 rods-0.038
 sizes-0.035
Token more
Token ID517
Feature activation+3.76e+00

Pos logit contributions
 guardian+0.050
 answer+0.047
 fairy+0.047
 figure+0.047
 owl+0.045
Neg logit contributions
ksh-0.094
 rods-0.088
ashed-0.088
 tanks-0.086
 mouths-0.083
Token careful
Token ID8161
Feature activation+5.50e+00

Pos logit contributions
ksh+0.258
ashed+0.250
 rods+0.246
 tanks+0.242
 mouths+0.232
Neg logit contributions
 guardian-0.209
 figure-0.199
 answer-0.196
he-0.196
 fairy-0.195
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
ksh+0.414
ashed+0.409
 rods+0.398
 tanks+0.394
 pic+0.374
Neg logit contributions
 guardian-0.255
he-0.246
 answer-0.244
 figure-0.243
 fairy-0.237
Token He
Token ID679
Feature activation+2.10e-03

Pos logit contributions
 guardian+0.002
 answer+0.002
 voice+0.002
 figure+0.002
he+0.002
Neg logit contributions
ksh-0.002
 rods-0.002
 tanks-0.002
suits-0.002
ashed-0.002
Token didn
Token ID1422
Feature activation+4.68e+00

Pos logit contributions
ksh+0.000
 rods+0.000
 tanks+0.000
ashed+0.000
 mouths+0.000
Neg logit contributions
 figure-0.000
 guardian-0.000
 fairy-0.000
 owl-0.000
he-0.000
Token't
Token ID470
Feature activation-9.07e+00

Pos logit contributions
ksh+0.397
 rods+0.393
 tanks+0.391
ashed+0.388
 pic+0.367
Neg logit contributions
 guardian-0.167
 fairy-0.149
 wizard-0.145
 owl-0.141
 figure-0.139
Token want
Token ID765
Feature activation-1.67e+00

Pos logit contributions
 guardian+0.442
he+0.412
 fairy+0.409
 wizard+0.397
 note+0.392
Neg logit contributions
ashed-0.652
ksh-0.648
 tanks-0.638
 rods-0.631
 sizes-0.597
Token to
Token ID284
Feature activation+2.50e+00

Pos logit contributions
 guardian+0.088
 answer+0.087
 fairy+0.085
 figure+0.083
 owl+0.083
Neg logit contributions
ksh-0.120
ashed-0.115
suits-0.108
 rods-0.108
 tanks-0.106
Token wreck
Token ID15228
Feature activation-3.81e+00

Pos logit contributions
ksh+0.187
ashed+0.186
 rods+0.183
 tanks+0.183
 mouths+0.172
Neg logit contributions
 guardian-0.114
he-0.107
 fairy-0.106
 figure-0.104
 wizard-0.103
Token his
Token ID465
Feature activation-5.66e-02

Pos logit contributions
he+0.163
 answer+0.161
 guardian+0.161
 figure+0.159
 fairy+0.156
Neg logit contributions
ksh-0.301
ashed-0.293
 rods-0.278
 tanks-0.272
 pic-0.264
Token toys
Token ID14958
Feature activation+6.43e+00

Pos logit contributions
 figure+0.003
 guardian+0.003
 answer+0.003
 fairy+0.003
 voice+0.003
Neg logit contributions
ksh-0.004
ashed-0.004
 rods-0.003
 tanks-0.003
 pic-0.003
Tokenmy
Token ID1820
Feature activation-5.71e+00

Pos logit contributions
ashed+0.103
ksh+0.102
 tanks+0.096
suits+0.095
 rods+0.095
Neg logit contributions
 guardian-0.051
 wizard-0.048
 fairy-0.047
 figure-0.046
he-0.046
Token gasped
Token ID45236
Feature activation-3.04e+00

Pos logit contributions
 guardian+0.322
he+0.306
 fairy+0.299
 wizard+0.296
 answer+0.296
Neg logit contributions
ashed-0.368
ksh-0.363
 rods-0.350
 tanks-0.349
 mouths-0.328
Token.
Token ID13
Feature activation-5.48e-02

Pos logit contributions
 guardian+0.162
he+0.147
 figure+0.146
 fairy+0.145
 answer+0.144
Neg logit contributions
ksh-0.207
 tanks-0.201
ashed-0.200
 rods-0.197
 mouths-0.186
Token He
Token ID679
Feature activation+6.46e-02

Pos logit contributions
 guardian+0.003
he+0.003
 fairy+0.003
 figure+0.003
 answer+0.003
Neg logit contributions
ksh-0.003
ashed-0.003
 rods-0.003
 tanks-0.003
 mouths-0.003
Token couldn
Token ID3521
Feature activation+6.55e+00

Pos logit contributions
ashed+0.004
 tanks+0.004
ksh+0.003
 rods+0.003
 mouths+0.003
Neg logit contributions
he-0.004
 guardian-0.004
 answer-0.004
 wizard-0.004
 note-0.004
Token't
Token ID470
Feature activation-9.01e+00

Pos logit contributions
ksh+0.441
 rods+0.429
 tanks+0.426
ashed+0.422
 they+0.418
Neg logit contributions
 guardian-0.328
 mirror-0.294
 wizard-0.285
 veil-0.284
 owl-0.284
Token believe
Token ID1975
Feature activation-8.08e+00

Pos logit contributions
oma+0.497
 guardian+0.491
orn+0.490
 note+0.477
 mirror+0.465
Neg logit contributions
ashed-0.449
 they-0.431
 Finn-0.430
 Nem-0.413
ksh-0.413
Token his
Token ID465
Feature activation-1.14e-01

Pos logit contributions
 guardian+0.403
 fairy+0.383
he+0.383
 figure+0.380
 answer+0.377
Neg logit contributions
ksh-0.581
ashed-0.576
 rods-0.561
 tanks-0.554
 mouths-0.523
Token ears
Token ID11368
Feature activation+3.24e+00

Pos logit contributions
he+0.009
 guardian+0.008
 wizard+0.008
 figure+0.007
ibrarian+0.007
Neg logit contributions
 rods-0.005
 tanks-0.005
 mouths-0.005
ashed-0.005
ksh-0.005
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
ksh+0.247
ashed+0.245
 rods+0.240
 tanks+0.239
 mouths+0.226
Neg logit contributions
 guardian-0.144
he-0.134
 fairy-0.134
 figure-0.133
 answer-0.131
Token He
Token ID679
Feature activation+3.15e-02

Pos logit contributions
 guardian+0.002
he+0.002
 answer+0.002
 fairy+0.002
 wizard+0.002
Neg logit contributions
ashed-0.003
ksh-0.003
 rods-0.002
 tanks-0.002
 mouths-0.002