TOP ACTIVATIONS
MAX = 17.130

to build towers and houses and cars with the blocks .
, she searched the maze and finally found the way out
You have to keep trying and learning and having fun .
my saw what had happened and told him that he should
They pretended to be knights and shot the arrow at imaginary
he looked he saw trees and mountains and rivers . And
mum saw what had happened and she rushed to Jem ma
't wait to tell mummy and daddy what aunt had whispered
jet tried to turn around and go back home , but
help him when he stumbled and fell . Sam
, she saw people talking and having fun , but Judy
wanted to keep it clean and warm always . The end
She lly thought and thought and then she had an idea
why don 't you go and hug each other and say
he saw trees and mountains and rivers . And then he
. Many cars were stuck and could not move . Lily
Sarah saw what was happening and yelled at the new boy
find it . He searched and searched , but it was
Ralph was trying to do and said , " Let me
liked to play with cars and robots . He saw Angel

MIN ACTIVATIONS
MIN = -19.326

smiled . " Looks like the frog likes you
table . " Looks like mom has some mail
laughed and said , " Looks like that apple made you
's mom my . " Looks like the wave destroyed all
smiled and said , " Looks like you like the wardrobe
the owner smiled . " Looks like you two are friends
made her spark le and gle am . She felt so
for the toast and smiled gle efully . His siblings were
sign ' happy ', ' birth day ', and ' you
garden . He was immediately aw est ruck .
garden , Molly noticed something gle aming on the ground .
the sky . Lily felt aw ed , with her eyes
and it spark led and gle amed . He ran around
the bottom . He laughed gle efully as he went .
curtain moved . Johnny laughed gle efully as he watched it
The young girl smiled gle efully as the net filled
up at the stars , aw est ruck that the night
. Everything was wet and gle aming , with rain still
was finished , the trophy gle amed and spark led in
was a toy car that gle amed in the morning sun

INTERVAL 8.016 - 17.130
CONTAINS 12.759%

he didn 't know what to do . He just stayed
tried to stand up , but he was too scared .
and didn 't know what to do . Then
dog runs to the little house . Lily and Tom are
She took their paper things and threw them in the trash

INTERVAL -1.098 - 8.016
CONTAINS 39.792%

L ily liked to design things . She used paper ,
" I am sorry , Ben . I should not have
little boy said , " Mom , can I rub the
walking near her house when she noticed her elderly neighbor .
and a spark ly blue light appeared . An elf popped

INTERVAL -10.212 - -1.098
CONTAINS 46.787%

But Mia didn 't listen . When they got
for a tutor . He looked high and low and eventually
on the table and grabbed the yarn . The yarn rolled
they couldn 't even see the shore anymore . Tim suddenly
thinks sorting is boring and wants to finish quickly .

INTERVAL -19.326 - -10.212
CONTAINS 0.662%

her face . <|endoftext|> Once upon a time there was a
new friends . <|endoftext|> Once upon a time , there was
her face . <|endoftext|> Once upon a time , there was
in moderation . <|endoftext|> Once upon a time , there was
guitar for Lily and they became good friends . <|endoftext|> Once
Token to
Token ID284
Feature activation+1.35e+01

Pos logit contributions
 best+0.170
 two+0.168
 biggest+0.151
 most+0.151
 certainly+0.139
Neg logit contributions
ho-0.155
anged-0.117
 spine-0.111
keeping-0.110
 inspection-0.110
Token build
Token ID1382
Feature activation-1.54e+00

Pos logit contributions
ho+0.368
 inspection+0.288
olph+0.281
keeping+0.268
 throats+0.261
Neg logit contributions
 best-0.314
 two-0.308
 exact-0.297
 biggest-0.290
 certainly-0.282
Token towers
Token ID18028
Feature activation+6.76e+00

Pos logit contributions
 best+0.033
 two+0.032
 exact+0.031
amed+0.031
 biggest+0.031
Neg logit contributions
ho-0.042
 inspection-0.034
olph-0.033
keeping-0.032
 throats-0.031
Token and
Token ID290
Feature activation+1.67e+01

Pos logit contributions
ho+0.165
 inspection+0.127
olph+0.123
keeping+0.117
 throats+0.115
Neg logit contributions
 best-0.174
 two-0.167
 exact-0.163
 biggest-0.162
amed-0.159
Token houses
Token ID7777
Feature activation+9.42e+00

Pos logit contributions
ho+0.433
 inspection+0.335
olph+0.328
keeping+0.312
 throats+0.308
Neg logit contributions
 best-0.407
 two-0.395
 exact-0.382
 biggest-0.380
amed-0.370
Token and
Token ID290
Feature activation+1.65e+01

Pos logit contributions
ho+0.294
olph+0.239
 inspection+0.239
 throats+0.229
ipop+0.225
Neg logit contributions
 best-0.184
 two-0.177
 exact-0.163
 biggest-0.162
amed-0.159
Token cars
Token ID5006
Feature activation+5.25e+00

Pos logit contributions
ho+0.466
olph+0.366
 inspection+0.361
ipop+0.353
 throats+0.342
Neg logit contributions
 best-0.374
 two-0.374
 biggest-0.347
 exact-0.342
 most-0.327
Token with
Token ID351
Feature activation-8.76e-01

Pos logit contributions
ho+0.152
 inspection+0.123
olph+0.119
keeping+0.114
 throats+0.113
Neg logit contributions
 best-0.111
 two-0.106
 exact-0.103
 biggest-0.102
amed-0.100
Token the
Token ID262
Feature activation-4.90e+00

Pos logit contributions
 two+0.019
 best+0.019
 biggest+0.017
 exact+0.017
 most+0.016
Neg logit contributions
ho-0.026
olph-0.021
 inspection-0.020
round-0.020
ipop-0.020
Token blocks
Token ID7021
Feature activation+6.63e+00

Pos logit contributions
 best+0.115
amed+0.111
 two+0.110
 exact+0.110
ceed+0.108
Neg logit contributions
ho-0.126
 inspection-0.097
olph-0.092
keeping-0.089
 throats-0.088
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
ho+0.209
 inspection+0.174
olph+0.172
 throats+0.162
keeping+0.161
Neg logit contributions
 best-0.128
 two-0.122
 biggest-0.113
 exact-0.113
amed-0.108
Token,
Token ID11
Feature activation+1.32e+01

Pos logit contributions
 best+0.115
 two+0.112
 exact+0.105
 biggest+0.104
amed+0.101
Neg logit contributions
ho-0.155
 inspection-0.124
olph-0.122
keeping-0.117
round-0.114
Token she
Token ID673
Feature activation+3.89e+00

Pos logit contributions
ho+0.343
 inspection+0.267
olph+0.260
keeping+0.254
 throats+0.249
Neg logit contributions
 best-0.322
 two-0.316
 exact-0.297
 biggest-0.295
 certainly-0.283
Token searched
Token ID16499
Feature activation-8.52e-01

Pos logit contributions
ho+0.100
 inspection+0.074
olph+0.073
 throats+0.068
ipop+0.067
Neg logit contributions
 best-0.093
 exact-0.091
 biggest-0.090
ceed-0.087
amed-0.086
Token the
Token ID262
Feature activation-5.20e+00

Pos logit contributions
 best+0.019
 two+0.018
 biggest+0.017
 exact+0.017
 most+0.016
Neg logit contributions
ho-0.025
olph-0.020
keeping-0.019
round-0.019
 throats-0.019
Token maze
Token ID31237
Feature activation+5.19e+00

Pos logit contributions
 best+0.112
 two+0.107
 biggest+0.104
 exact+0.104
amed+0.097
Neg logit contributions
ho-0.150
 inspection-0.125
olph-0.121
keeping-0.115
 throats-0.113
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.123
 inspection+0.094
olph+0.089
keeping+0.086
 throats+0.083
Neg logit contributions
 best-0.135
 two-0.130
 exact-0.128
 biggest-0.127
amed-0.126
Token finally
Token ID3443
Feature activation+6.28e+00

Pos logit contributions
ho+0.477
olph+0.381
 inspection+0.374
keeping+0.363
 throats+0.362
Neg logit contributions
 best-0.348
 two-0.342
 exact-0.326
 biggest-0.323
amed-0.316
Token found
Token ID1043
Feature activation-6.43e+00

Pos logit contributions
ho+0.146
 inspection+0.126
olph+0.097
ipop+0.095
 he+0.094
Neg logit contributions
 best-0.164
 exact-0.160
 biggest-0.159
ceed-0.157
 teaches-0.155
Token the
Token ID262
Feature activation-5.90e+00

Pos logit contributions
 best+0.144
 exact+0.143
amed+0.142
 teaches+0.140
ceed+0.137
Neg logit contributions
ho-0.161
 inspection-0.136
keeping-0.123
 Gig-0.117
olph-0.112
Token way
Token ID835
Feature activation+6.51e+00

Pos logit contributions
amed+0.114
 best+0.111
ceed+0.108
 exact+0.107
 two+0.105
Neg logit contributions
ho-0.172
 inspection-0.135
keeping-0.129
olph-0.126
 throats-0.123
Token out
Token ID503
Feature activation+3.50e+00

Pos logit contributions
ho+0.165
 inspection+0.127
olph+0.126
keeping+0.118
 throats+0.118
Neg logit contributions
 best-0.163
 two-0.156
 exact-0.152
 biggest-0.151
amed-0.147
Token You
Token ID921
Feature activation-5.29e+00

Pos logit contributions
amed+0.133
ceed+0.123
 certainly+0.121
icious+0.118
 best+0.118
Neg logit contributions
ho-0.129
 inspection-0.094
 yacht-0.087
ipop-0.085
 throats-0.085
Token have
Token ID423
Feature activation-3.94e+00

Pos logit contributions
amed+0.107
 exact+0.103
 best+0.101
 biggest+0.101
ceed+0.098
Neg logit contributions
ho-0.148
 inspection-0.112
olph-0.109
ipop-0.106
 he-0.103
Token to
Token ID284
Feature activation+1.37e+01

Pos logit contributions
 best+0.127
 exact+0.125
amed+0.123
 biggest+0.122
 two+0.122
Neg logit contributions
ho-0.065
 inspection-0.043
olph-0.042
 throats-0.038
keeping-0.038
Token keep
Token ID1394
Feature activation-4.61e+00

Pos logit contributions
ho+0.374
 inspection+0.294
olph+0.287
keeping+0.278
 throats+0.272
Neg logit contributions
 best-0.306
 two-0.288
 biggest-0.288
 exact-0.286
amed-0.285
Token trying
Token ID2111
Feature activation+2.07e+00

Pos logit contributions
 best+0.111
 two+0.109
 biggest+0.100
 exact+0.098
amed+0.094
Neg logit contributions
ho-0.132
 inspection-0.104
olph-0.093
ipop-0.092
 throats-0.092
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.052
 inspection+0.039
olph+0.038
keeping+0.036
 throats+0.036
Neg logit contributions
 best-0.053
 two-0.051
 exact-0.050
 biggest-0.049
amed-0.048
Token learning
Token ID4673
Feature activation+5.42e-01

Pos logit contributions
ho+0.449
 inspection+0.355
olph+0.344
keeping+0.331
 throats+0.325
Neg logit contributions
 best-0.373
 two-0.356
 exact-0.349
 biggest-0.346
amed-0.343
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.013
 inspection+0.011
keeping+0.010
olph+0.010
 throats+0.010
Neg logit contributions
amed-0.012
 exact-0.012
 best-0.012
 two-0.011
 biggest-0.011
Token having
Token ID1719
Feature activation-6.58e+00

Pos logit contributions
ho+0.476
 inspection+0.383
olph+0.374
keeping+0.357
 throats+0.353
Neg logit contributions
 best-0.342
 two-0.327
 exact-0.317
 biggest-0.313
amed-0.305
Token fun
Token ID1257
Feature activation+2.65e+00

Pos logit contributions
amed+0.118
 exact+0.115
 best+0.114
ceed+0.109
 biggest+0.108
Neg logit contributions
ho-0.185
 inspection-0.164
keeping-0.158
olph-0.149
 throats-0.148
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
ho+0.068
 inspection+0.055
olph+0.054
keeping+0.053
 throats+0.050
Neg logit contributions
 best-0.059
 exact-0.058
amed-0.057
 biggest-0.057
 two-0.057
Tokenmy
Token ID1820
Feature activation+2.65e+00

Pos logit contributions
 best+0.092
 two+0.087
 biggest+0.079
 exact+0.077
 nic+0.070
Neg logit contributions
ho-0.213
 inspection-0.186
keeping-0.183
 throats-0.174
olph-0.173
Token saw
Token ID2497
Feature activation-6.57e+00

Pos logit contributions
ho+0.072
 inspection+0.057
olph+0.056
keeping+0.053
 throats+0.053
Neg logit contributions
 best-0.059
 exact-0.056
 two-0.056
 biggest-0.056
amed-0.055
Token what
Token ID644
Feature activation+5.09e-01

Pos logit contributions
 best+0.106
 two+0.102
 exact+0.096
 biggest+0.095
amed+0.091
Neg logit contributions
ho-0.225
 inspection-0.186
olph-0.181
keeping-0.176
 throats-0.175
Token had
Token ID550
Feature activation-4.24e+00

Pos logit contributions
ho+0.015
 inspection+0.012
olph+0.011
keeping+0.011
 throats+0.011
Neg logit contributions
 best-0.011
 two-0.011
 biggest-0.010
 exact-0.010
 certainly-0.010
Token happened
Token ID3022
Feature activation-3.71e+00

Pos logit contributions
 best+0.099
 exact+0.094
 biggest+0.093
 two+0.092
amed+0.091
Neg logit contributions
ho-0.111
 inspection-0.088
olph-0.084
keeping-0.083
 throats-0.079
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
 best+0.086
 two+0.083
 exact+0.080
 biggest+0.079
amed+0.078
Neg logit contributions
ho-0.101
 inspection-0.080
olph-0.077
keeping-0.074
 throats-0.073
Token told
Token ID1297
Feature activation-4.58e+00

Pos logit contributions
ho+0.460
 inspection+0.367
olph+0.364
round+0.361
keeping+0.354
Neg logit contributions
 two-0.361
 best-0.352
 exact-0.324
amed-0.319
 biggest-0.318
Token him
Token ID683
Feature activation+1.28e+00

Pos logit contributions
 two+0.114
 best+0.111
 certainly+0.098
 biggest+0.098
 teaches+0.097
Neg logit contributions
ho-0.126
 inspection-0.095
round-0.091
olph-0.090
ipop-0.087
Token that
Token ID326
Feature activation+4.65e+00

Pos logit contributions
ho+0.033
olph+0.025
 inspection+0.024
keeping+0.023
 throats+0.023
Neg logit contributions
 best-0.034
 two-0.033
 certainly-0.030
 biggest-0.030
 exact-0.030
Token he
Token ID339
Feature activation+3.76e+00

Pos logit contributions
ho+0.106
round+0.083
 inspection+0.082
olph+0.078
 throats+0.078
Neg logit contributions
 two-0.138
 best-0.135
 most-0.121
 biggest-0.115
 teaches-0.114
Token should
Token ID815
Feature activation+4.01e+00

Pos logit contributions
ho+0.100
 inspection+0.076
olph+0.076
keeping+0.070
 throats+0.070
Neg logit contributions
 best-0.084
 exact-0.082
 biggest-0.081
amed-0.081
 two-0.081
Token They
Token ID1119
Feature activation+1.12e+00

Pos logit contributions
ho+0.004
 inspection+0.003
olph+0.003
keeping+0.003
 throats+0.003
Neg logit contributions
 best-0.003
 two-0.003
 exact-0.002
 biggest-0.002
amed-0.002
Token pretended
Token ID36766
Feature activation+1.03e+00

Pos logit contributions
ho+0.032
 inspection+0.025
olph+0.025
keeping+0.024
 throats+0.023
Neg logit contributions
 best-0.024
 two-0.023
 exact-0.023
 biggest-0.022
amed-0.022
Token to
Token ID284
Feature activation+1.35e+01

Pos logit contributions
ho+0.024
olph+0.019
anged+0.018
keeping+0.017
away+0.017
Neg logit contributions
 best-0.028
 two-0.028
 biggest-0.025
 most-0.025
 certainly-0.024
Token be
Token ID307
Feature activation-6.02e+00

Pos logit contributions
ho+0.398
 inspection+0.322
olph+0.312
keeping+0.302
 throats+0.297
Neg logit contributions
 best-0.275
 two-0.262
 exact-0.256
 biggest-0.253
amed-0.248
Token knights
Token ID27536
Feature activation+7.27e+00

Pos logit contributions
 best+0.123
 two+0.118
 exact+0.116
 biggest+0.114
amed+0.112
Neg logit contributions
ho-0.177
 inspection-0.143
olph-0.139
keeping-0.134
 throats-0.132
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.179
 inspection+0.143
olph+0.134
 throats+0.130
keeping+0.127
Neg logit contributions
 best-0.185
 two-0.178
amed-0.165
 exact-0.165
 biggest-0.165
Token shot
Token ID2823
Feature activation-3.61e+00

Pos logit contributions
ho+0.475
 inspection+0.393
olph+0.380
keeping+0.370
round+0.362
Neg logit contributions
 two-0.353
 best-0.352
 biggest-0.317
 most-0.301
 teaches-0.299
Token the
Token ID262
Feature activation-5.32e+00

Pos logit contributions
 two+0.078
 best+0.078
 biggest+0.069
 most+0.065
amed+0.064
Neg logit contributions
ho-0.106
olph-0.086
ipop-0.084
 inspection-0.083
round-0.081
Token arrow
Token ID15452
Feature activation+4.47e+00

Pos logit contributions
 best+0.132
 two+0.127
 biggest+0.123
 exact+0.121
amed+0.116
Neg logit contributions
ho-0.137
 inspection-0.109
olph-0.105
keeping-0.100
 throats-0.097
Token at
Token ID379
Feature activation+5.25e+00

Pos logit contributions
ho+0.130
 inspection+0.111
olph+0.109
keeping+0.103
 throats+0.101
Neg logit contributions
 best-0.096
 two-0.092
 biggest-0.084
 exact-0.081
amed-0.080
Token imaginary
Token ID26726
Feature activation-1.69e+00

Pos logit contributions
ho+0.147
 inspection+0.118
olph+0.116
ipop+0.110
 throats+0.109
Neg logit contributions
 best-0.118
 two-0.114
 biggest-0.107
 exact-0.104
amed-0.101
Token he
Token ID339
Feature activation+5.23e+00

Pos logit contributions
ho+0.083
 inspection+0.051
olph+0.048
keeping+0.044
 throats+0.043
Neg logit contributions
 best-0.191
 two-0.186
 exact-0.183
 biggest-0.181
amed-0.178
Token looked
Token ID3114
Feature activation-3.99e+00

Pos logit contributions
ho+0.133
 inspection+0.103
olph+0.100
keeping+0.096
 throats+0.094
Neg logit contributions
 best-0.128
 two-0.123
 exact-0.121
 biggest-0.119
amed-0.118
Token he
Token ID339
Feature activation+4.89e+00

Pos logit contributions
 best+0.100
 two+0.096
 exact+0.094
 biggest+0.093
amed+0.090
Neg logit contributions
ho-0.102
olph-0.077
 inspection-0.076
keeping-0.076
 throats-0.074
Token saw
Token ID2497
Feature activation-6.61e+00

Pos logit contributions
ho+0.122
 inspection+0.095
olph+0.089
keeping+0.081
anged+0.080
Neg logit contributions
 best-0.123
 two-0.116
 biggest-0.115
 exact-0.115
ceed-0.113
Token trees
Token ID7150
Feature activation+9.36e+00

Pos logit contributions
 best+0.120
 two+0.113
 exact+0.110
 biggest+0.108
amed+0.106
Neg logit contributions
ho-0.212
 inspection-0.174
olph-0.170
keeping-0.164
 throats-0.162
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.245
olph+0.206
 inspection+0.198
keeping+0.194
 throats+0.192
Neg logit contributions
 best-0.225
 two-0.222
 exact-0.208
 biggest-0.203
 most-0.195
Token mountains
Token ID12269
Feature activation+6.82e+00

Pos logit contributions
ho+0.497
olph+0.406
 inspection+0.393
ipop+0.388
keeping+0.387
Neg logit contributions
 two-0.340
 best-0.330
 biggest-0.307
 exact-0.305
 most-0.294
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.188
olph+0.157
 inspection+0.152
 throats+0.151
ipop+0.146
Neg logit contributions
 best-0.151
 two-0.146
 exact-0.138
 biggest-0.136
amed-0.134
Token rivers
Token ID18180
Feature activation+3.46e+00

Pos logit contributions
ho+0.457
 inspection+0.364
olph+0.355
keeping+0.340
 throats+0.335
Neg logit contributions
 best-0.360
 two-0.345
 exact-0.336
 biggest-0.333
amed-0.325
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
ho+0.099
olph+0.080
 inspection+0.080
 throats+0.077
keeping+0.075
Neg logit contributions
 best-0.073
 two-0.071
 exact-0.067
 biggest-0.066
amed-0.066
Token And
Token ID843
Feature activation+7.38e+00

Pos logit contributions
ho+0.012
 inspection+0.010
olph+0.010
keeping+0.009
 throats+0.009
Neg logit contributions
 best-0.009
 two-0.009
 exact-0.009
 biggest-0.009
amed-0.009
Token mum
Token ID25682
Feature activation+1.08e+01

Pos logit contributions
ho+0.300
 inspection+0.219
olph+0.218
keeping+0.207
 throats+0.197
Neg logit contributions
 best-0.347
 two-0.336
 biggest-0.319
 exact-0.318
 certainly-0.308
Token saw
Token ID2497
Feature activation-6.06e+00

Pos logit contributions
ho+0.281
 inspection+0.219
olph+0.213
keeping+0.200
 throats+0.199
Neg logit contributions
 best-0.254
 exact-0.242
 two-0.241
 biggest-0.240
amed-0.236
Token what
Token ID644
Feature activation+5.30e-01

Pos logit contributions
 two+0.100
 best+0.100
 exact+0.089
 biggest+0.089
amed+0.084
Neg logit contributions
ho-0.206
 inspection-0.169
olph-0.167
 throats-0.161
keeping-0.160
Token had
Token ID550
Feature activation-4.06e+00

Pos logit contributions
ho+0.014
 inspection+0.011
olph+0.011
keeping+0.011
round+0.010
Neg logit contributions
 best-0.013
 two-0.013
 exact-0.012
 biggest-0.012
 certainly-0.012
Token happened
Token ID3022
Feature activation-3.87e+00

Pos logit contributions
 best+0.086
 two+0.081
 exact+0.081
 biggest+0.081
amed+0.078
Neg logit contributions
ho-0.115
 inspection-0.093
olph-0.089
keeping-0.088
 throats-0.084
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
 best+0.089
 two+0.086
 exact+0.082
 biggest+0.080
amed+0.079
Neg logit contributions
ho-0.106
 inspection-0.084
olph-0.082
 throats-0.078
keeping-0.077
Token she
Token ID673
Feature activation+3.01e+00

Pos logit contributions
ho+0.451
olph+0.359
 inspection+0.358
keeping+0.350
round+0.350
Neg logit contributions
 two-0.365
 best-0.358
 exact-0.334
amed-0.327
 certainly-0.326
Token rushed
Token ID13999
Feature activation-3.42e+00

Pos logit contributions
ho+0.080
 inspection+0.062
olph+0.060
keeping+0.057
 throats+0.057
Neg logit contributions
 best-0.071
 two-0.068
 exact-0.067
 biggest-0.066
amed-0.065
Token to
Token ID284
Feature activation+1.32e+01

Pos logit contributions
 two+0.068
amed+0.059
 right+0.058
 best+0.055
 correct+0.048
Neg logit contributions
ho-0.109
 throats-0.094
 inspection-0.093
 tongues-0.091
fish-0.090
Token Jem
Token ID48199
Feature activation-2.05e-01

Pos logit contributions
ho+0.421
olph+0.327
round+0.325
 throats+0.322
 inspection+0.322
Neg logit contributions
 two-0.273
 best-0.244
 exact-0.233
 certainly-0.232
 correct-0.221
Tokenma
Token ID2611
Feature activation+8.15e+00

Pos logit contributions
 best+0.005
 biggest+0.005
 exact+0.004
ceed+0.004
amed+0.004
Neg logit contributions
ho-0.005
 inspection-0.004
olph-0.004
keeping-0.004
ipop-0.004
Token't
Token ID470
Feature activation+2.70e+00

Pos logit contributions
ho+0.129
olph+0.106
 inspection+0.105
 Gig+0.099
ipop+0.098
Neg logit contributions
 best-0.087
 two-0.084
 exact-0.072
 biggest-0.070
 teaches-0.069
Token wait
Token ID4043
Feature activation+2.96e+00

Pos logit contributions
ho+0.080
 inspection+0.064
olph+0.062
keeping+0.060
 throats+0.059
Neg logit contributions
 best-0.056
 two-0.054
 exact-0.052
 biggest-0.051
amed-0.050
Token to
Token ID284
Feature activation+1.36e+01

Pos logit contributions
ho+0.074
 inspection+0.056
olph+0.055
keeping+0.052
 throats+0.051
Neg logit contributions
 best-0.076
 two-0.073
 exact-0.071
 biggest-0.070
amed-0.067
Token tell
Token ID1560
Feature activation-2.94e+00

Pos logit contributions
ho+0.419
 inspection+0.338
olph+0.328
 throats+0.310
ipop+0.307
Neg logit contributions
 two-0.281
 best-0.279
 exact-0.255
 biggest-0.250
 certainly-0.242
Token mummy
Token ID38567
Feature activation+1.06e+01

Pos logit contributions
 best+0.070
 two+0.069
 biggest+0.063
 exact+0.061
 certainly+0.059
Neg logit contributions
ho-0.082
 inspection-0.063
olph-0.062
keeping-0.058
 throats-0.057
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.299
 inspection+0.238
olph+0.231
keeping+0.222
 throats+0.219
Neg logit contributions
 best-0.235
 two-0.226
 exact-0.217
 biggest-0.215
amed-0.210
Token daddy
Token ID35822
Feature activation+8.58e+00

Pos logit contributions
ho+0.422
 inspection+0.338
round+0.317
olph+0.311
keeping+0.301
Neg logit contributions
 two-0.410
 best-0.408
 biggest-0.363
 exact-0.356
 most-0.353
Token what
Token ID644
Feature activation-7.61e-02

Pos logit contributions
ho+0.246
 inspection+0.198
olph+0.192
keeping+0.185
 throats+0.183
Neg logit contributions
 best-0.185
 two-0.177
 exact-0.170
 biggest-0.168
amed-0.165
Token aunt
Token ID25949
Feature activation+1.04e+01

Pos logit contributions
 best+0.002
 two+0.002
 exact+0.002
 biggest+0.002
 certainly+0.002
Neg logit contributions
ho-0.002
 inspection-0.001
olph-0.001
keeping-0.001
round-0.001
Token had
Token ID550
Feature activation-4.99e+00

Pos logit contributions
ho+0.306
 inspection+0.248
olph+0.237
keeping+0.234
 throats+0.233
Neg logit contributions
 best-0.223
 two-0.213
 exact-0.200
 biggest-0.197
 certainly-0.197
Token whispered
Token ID25029
Feature activation-3.87e+00

Pos logit contributions
 best+0.104
 biggest+0.101
 exact+0.100
 two+0.097
ceed+0.094
Neg logit contributions
ho-0.142
 inspection-0.114
olph-0.111
keeping-0.109
 throats-0.103
Token jet
Token ID12644
Feature activation+6.67e+00

Pos logit contributions
ho+0.170
 inspection+0.150
keeping+0.142
 throats+0.140
olph+0.136
Neg logit contributions
 two-0.113
 best-0.110
 exact-0.098
 teaches-0.093
 biggest-0.093
Token tried
Token ID3088
Feature activation-1.15e+00

Pos logit contributions
ho+0.172
 inspection+0.131
olph+0.126
keeping+0.124
round+0.117
Neg logit contributions
 best-0.158
 exact-0.154
 two-0.152
 biggest-0.150
ceed-0.146
Token to
Token ID284
Feature activation+1.39e+01

Pos logit contributions
 best+0.035
 two+0.034
 biggest+0.032
 most+0.031
 exact+0.030
Neg logit contributions
ho-0.024
olph-0.018
round-0.017
ipop-0.017
 inspection-0.017
Token turn
Token ID1210
Feature activation+3.72e-01

Pos logit contributions
ho+0.379
 inspection+0.300
olph+0.290
keeping+0.279
 throats+0.274
Neg logit contributions
 best-0.319
 two-0.304
 exact-0.299
 biggest-0.295
amed-0.291
Token around
Token ID1088
Feature activation+8.01e+00

Pos logit contributions
ho+0.009
olph+0.007
 inspection+0.007
keeping+0.007
 throats+0.006
Neg logit contributions
 best-0.010
 two-0.010
 exact-0.009
 biggest-0.009
amed-0.009
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.225
olph+0.177
 inspection+0.173
ipop+0.170
 throats+0.170
Neg logit contributions
 best-0.182
 two-0.179
 exact-0.165
 biggest-0.160
 most-0.157
Token go
Token ID467
Feature activation+1.94e+00

Pos logit contributions
ho+0.443
olph+0.350
 inspection+0.340
ipop+0.323
keeping+0.322
Neg logit contributions
 two-0.384
 best-0.384
 exact-0.362
 biggest-0.358
 certainly-0.343
Token back
Token ID736
Feature activation+7.63e+00

Pos logit contributions
ho+0.050
 inspection+0.039
olph+0.038
keeping+0.036
 throats+0.036
Neg logit contributions
 best-0.048
 two-0.046
 exact-0.045
 biggest-0.044
amed-0.044
Token home
Token ID1363
Feature activation+8.79e+00

Pos logit contributions
ho+0.174
olph+0.126
 inspection+0.124
 throats+0.120
ipop+0.113
Neg logit contributions
 best-0.218
 two-0.210
 biggest-0.201
 exact-0.200
amed-0.194
Token,
Token ID11
Feature activation+1.33e+01

Pos logit contributions
ho+0.256
 inspection+0.206
olph+0.200
keeping+0.191
 throats+0.190
Neg logit contributions
 best-0.186
 two-0.177
 exact-0.171
 biggest-0.169
amed-0.166
Token but
Token ID475
Feature activation+8.28e+00

Pos logit contributions
ho+0.386
olph+0.305
 inspection+0.301
ipop+0.288
keeping+0.287
Neg logit contributions
 best-0.304
 two-0.293
 exact-0.278
 most-0.269
 biggest-0.266
Token help
Token ID1037
Feature activation+3.82e+00

Pos logit contributions
ho+0.356
 inspection+0.283
olph+0.274
keeping+0.262
 throats+0.259
Neg logit contributions
 best-0.280
 two-0.273
 exact-0.261
 biggest-0.258
amed-0.249
Token him
Token ID683
Feature activation+1.66e+00

Pos logit contributions
ho+0.120
 inspection+0.094
olph+0.090
ipop+0.090
anged+0.087
Neg logit contributions
 best-0.079
 two-0.079
 biggest-0.070
 exact-0.070
 certainly-0.067
Token when
Token ID618
Feature activation+7.39e+00

Pos logit contributions
ho+0.047
 inspection+0.037
olph+0.036
keeping+0.035
 throats+0.034
Neg logit contributions
 best-0.036
 two-0.035
 exact-0.034
 biggest-0.034
amed-0.033
Token he
Token ID339
Feature activation+4.38e+00

Pos logit contributions
ho+0.150
 inspection+0.100
olph+0.096
round+0.095
 throats+0.091
Neg logit contributions
 two-0.234
 best-0.232
 exact-0.212
 biggest-0.211
 most-0.208
Token stumbled
Token ID24241
Feature activation-3.63e-02

Pos logit contributions
ho+0.122
 inspection+0.096
olph+0.094
 throats+0.088
keeping+0.088
Neg logit contributions
 best-0.097
 exact-0.092
 two-0.092
 biggest-0.091
amed-0.090
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
 best+0.001
 exact+0.001
 two+0.001
 biggest+0.001
amed+0.001
Neg logit contributions
ho-0.001
 inspection-0.001
olph-0.001
keeping-0.001
ipop-0.001
Token fell
Token ID3214
Feature activation+7.48e-01

Pos logit contributions
ho+0.412
 inspection+0.319
olph+0.310
 throats+0.297
round+0.296
Neg logit contributions
 two-0.410
 best-0.401
 exact-0.373
amed-0.370
 biggest-0.367
Token.
Token ID13
Feature activation+8.56e-01

Pos logit contributions
ho+0.019
 inspection+0.015
olph+0.015
 throats+0.014
keeping+0.014
Neg logit contributions
 best-0.018
 two-0.018
amed-0.017
 biggest-0.017
 exact-0.016
Token
Token ID198
Feature activation-4.84e+00

Pos logit contributions
ho+0.025
 inspection+0.020
olph+0.019
keeping+0.019
 throats+0.019
Neg logit contributions
 best-0.018
 two-0.017
 exact-0.016
 biggest-0.016
 certainly-0.016
Token
Token ID198
Feature activation-5.06e+00

Pos logit contributions
 best+0.096
 biggest+0.090
 exact+0.088
 nic+0.084
amed+0.082
Neg logit contributions
ho-0.147
 inspection-0.126
olph-0.121
keeping-0.115
aro-0.110
TokenSam
Token ID16305
Feature activation+2.68e+00

Pos logit contributions
 best+0.100
 biggest+0.093
 exact+0.091
 two+0.090
amed+0.086
Neg logit contributions
ho-0.155
 inspection-0.127
olph-0.125
keeping-0.119
round-0.117
Token,
Token ID11
Feature activation+1.32e+01

Pos logit contributions
ho+0.018
 inspection+0.014
olph+0.014
 throats+0.013
keeping+0.013
Neg logit contributions
 best-0.018
 two-0.017
 biggest-0.016
 exact-0.016
 most-0.015
Token she
Token ID673
Feature activation+3.89e+00

Pos logit contributions
ho+0.343
 inspection+0.267
olph+0.260
keeping+0.254
 throats+0.249
Neg logit contributions
 best-0.322
 two-0.316
 exact-0.297
 biggest-0.295
 certainly-0.283
Token saw
Token ID2497
Feature activation-6.38e+00

Pos logit contributions
ho+0.100
 inspection+0.074
olph+0.073
 throats+0.068
ipop+0.067
Neg logit contributions
 best-0.093
 exact-0.091
 biggest-0.090
ceed-0.087
amed-0.086
Token people
Token ID661
Feature activation+5.97e+00

Pos logit contributions
 best+0.093
 two+0.089
 biggest+0.082
 exact+0.081
amed+0.078
Neg logit contributions
ho-0.229
 inspection-0.192
olph-0.188
keeping-0.183
 throats-0.181
Token talking
Token ID3375
Feature activation-1.42e+00

Pos logit contributions
ho+0.161
 inspection+0.133
keeping+0.130
olph+0.129
 throats+0.126
Neg logit contributions
 best-0.141
 two-0.136
 exact-0.125
 biggest-0.121
 most-0.119
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.032
 two+0.030
 biggest+0.028
amed+0.027
 exact+0.027
Neg logit contributions
ho-0.040
olph-0.033
 inspection-0.032
keeping-0.031
 throats-0.031
Token having
Token ID1719
Feature activation-6.79e+00

Pos logit contributions
ho+0.416
 inspection+0.320
olph+0.318
round+0.308
ipop+0.307
Neg logit contributions
 best-0.402
 two-0.397
 exact-0.364
 biggest-0.363
 certainly-0.353
Token fun
Token ID1257
Feature activation+2.39e+00

Pos logit contributions
 best+0.109
 exact+0.103
amed+0.102
 two+0.102
 biggest+0.100
Neg logit contributions
ho-0.222
 inspection-0.188
olph-0.179
keeping-0.178
 throats-0.173
Token,
Token ID11
Feature activation+1.34e+01

Pos logit contributions
ho+0.075
 inspection+0.060
olph+0.059
 throats+0.056
ipop+0.056
Neg logit contributions
 best-0.048
 two-0.045
 exact-0.042
 biggest-0.042
amed-0.040
Token but
Token ID475
Feature activation+8.36e+00

Pos logit contributions
ho+0.379
olph+0.311
 inspection+0.295
ipop+0.289
keeping+0.287
Neg logit contributions
 best-0.314
 two-0.297
 exact-0.278
 most-0.277
 biggest-0.275
Token Judy
Token ID26395
Feature activation+4.09e+00

Pos logit contributions
ho+0.189
olph+0.150
 inspection+0.149
keeping+0.144
round+0.143
Neg logit contributions
 best-0.232
 two-0.231
 most-0.205
 exact-0.205
 biggest-0.200
Token wanted
Token ID2227
Feature activation-6.03e+00

Pos logit contributions
 best+0.112
 two+0.106
 biggest+0.102
 exact+0.099
 nic+0.093
Neg logit contributions
ho-0.166
 inspection-0.139
olph-0.132
keeping-0.125
round-0.124
Token to
Token ID284
Feature activation+1.40e+01

Pos logit contributions
 best+0.184
 two+0.182
 biggest+0.163
 most+0.161
 certainly+0.155
Neg logit contributions
ho-0.136
olph-0.093
 inspection-0.091
oing-0.088
keeping-0.087
Token keep
Token ID1394
Feature activation-4.62e+00

Pos logit contributions
ho+0.395
 inspection+0.313
olph+0.304
keeping+0.291
 throats+0.288
Neg logit contributions
 best-0.309
 two-0.299
 exact-0.289
 biggest-0.285
amed-0.275
Token it
Token ID340
Feature activation+3.13e+00

Pos logit contributions
 best+0.119
 two+0.116
 biggest+0.105
 exact+0.102
 most+0.100
Neg logit contributions
ho-0.131
 inspection-0.100
olph-0.089
keeping-0.089
ipop-0.089
Token clean
Token ID3424
Feature activation+4.84e+00

Pos logit contributions
ho+0.092
 inspection+0.074
olph+0.072
 throats+0.068
keeping+0.068
Neg logit contributions
 best-0.068
 two-0.064
 exact-0.062
 biggest-0.061
amed-0.059
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.136
 inspection+0.109
olph+0.105
 throats+0.099
keeping+0.099
Neg logit contributions
 best-0.109
 two-0.104
 exact-0.101
 biggest-0.100
amed-0.096
Token warm
Token ID5814
Feature activation-4.19e-01

Pos logit contributions
ho+0.447
 inspection+0.359
olph+0.341
keeping+0.331
 throats+0.328
Neg logit contributions
 best-0.360
 two-0.344
amed-0.336
 exact-0.335
 biggest-0.332
Token always
Token ID1464
Feature activation+3.82e+00

Pos logit contributions
 best+0.008
 two+0.008
 biggest+0.007
 exact+0.007
 most+0.007
Neg logit contributions
ho-0.013
 inspection-0.011
olph-0.010
ipop-0.010
anged-0.010
Token.
Token ID13
Feature activation+4.52e-01

Pos logit contributions
ho+0.097
olph+0.078
 inspection+0.078
keeping+0.076
 throats+0.073
Neg logit contributions
 best-0.082
ceed-0.081
 biggest-0.080
amed-0.080
 two-0.080
Token The
Token ID383
Feature activation-8.46e+00

Pos logit contributions
ho+0.014
 inspection+0.012
olph+0.012
keeping+0.011
 throats+0.011
Neg logit contributions
 best-0.008
 two-0.008
 exact-0.007
 biggest-0.007
amed-0.007
Token end
Token ID886
Feature activation-1.63e+00

Pos logit contributions
 best+0.172
 two+0.163
 exact+0.161
 biggest+0.158
amed+0.158
Neg logit contributions
ho-0.250
 inspection-0.201
olph-0.196
keeping-0.188
 throats-0.186
TokenShe
Token ID3347
Feature activation+1.06e+00

Pos logit contributions
ho+0.088
 inspection+0.070
olph+0.068
keeping+0.066
 throats+0.064
Neg logit contributions
 best-0.073
 two-0.070
 exact-0.068
 biggest-0.067
amed-0.066
Tokenlly
Token ID12810
Feature activation-2.17e+00

Pos logit contributions
ho+0.029
 inspection+0.023
olph+0.022
 throats+0.021
keeping+0.021
Neg logit contributions
 best-0.024
 exact-0.023
 biggest-0.023
 two-0.022
amed-0.022
Token thought
Token ID1807
Feature activation-3.35e+00

Pos logit contributions
 best+0.046
 two+0.044
 exact+0.043
 biggest+0.043
amed+0.042
Neg logit contributions
ho-0.062
 inspection-0.050
olph-0.048
keeping-0.047
 throats-0.046
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.094
 two+0.091
 biggest+0.084
 exact+0.082
 certainly+0.082
Neg logit contributions
ho-0.078
olph-0.060
keeping-0.059
 throats-0.056
round-0.056
Token thought
Token ID1807
Feature activation-3.84e+00

Pos logit contributions
ho+0.440
 inspection+0.346
olph+0.337
keeping+0.326
 throats+0.320
Neg logit contributions
 best-0.376
 two-0.367
 exact-0.351
 biggest-0.346
amed-0.343
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.097
 two+0.092
 biggest+0.085
 exact+0.083
 certainly+0.083
Neg logit contributions
ho-0.102
olph-0.081
keeping-0.080
round-0.076
 throats-0.075
Token then
Token ID788
Feature activation+8.44e+00

Pos logit contributions
ho+0.434
 inspection+0.341
olph+0.331
keeping+0.317
 throats+0.312
Neg logit contributions
 best-0.385
 two-0.370
 exact-0.361
 biggest-0.357
amed-0.351
Token she
Token ID673
Feature activation+3.12e+00

Pos logit contributions
ho+0.179
olph+0.129
 inspection+0.127
keeping+0.123
round+0.120
Neg logit contributions
 best-0.247
 two-0.246
 exact-0.229
 biggest-0.227
amed-0.224
Token had
Token ID550
Feature activation-4.92e+00

Pos logit contributions
ho+0.079
 inspection+0.059
olph+0.057
 throats+0.053
anged+0.053
Neg logit contributions
 best-0.078
 exact-0.075
 biggest-0.075
ceed-0.072
 two-0.071
Token an
Token ID281
Feature activation-7.40e+00

Pos logit contributions
 best+0.084
 exact+0.078
 biggest+0.078
amed+0.077
ceed+0.076
Neg logit contributions
ho-0.154
 inspection-0.130
keeping-0.123
olph-0.122
 throats-0.119
Token idea
Token ID2126
Feature activation+1.11e+00

Pos logit contributions
amed+0.100
ceed+0.098
 two+0.093
 best+0.091
 biggest+0.084
Neg logit contributions
ho-0.240
 inspection-0.214
olph-0.196
away-0.195
 Gig-0.194
Token why
Token ID1521
Feature activation+4.14e+00

Pos logit contributions
ho+0.185
 inspection+0.149
olph+0.146
keeping+0.139
 throats+0.137
Neg logit contributions
 best-0.138
 two-0.132
 exact-0.128
 biggest-0.127
amed-0.124
Token don
Token ID836
Feature activation-2.01e+00

Pos logit contributions
ho+0.111
 inspection+0.088
keeping+0.085
olph+0.085
round+0.081
Neg logit contributions
 best-0.100
 two-0.095
 biggest-0.090
 exact-0.089
 certainly-0.087
Token't
Token ID470
Feature activation+2.00e+00

Pos logit contributions
amed+0.052
 best+0.051
 exact+0.050
 biggest+0.049
 nic+0.047
Neg logit contributions
ho-0.047
olph-0.034
 inspection-0.034
 he-0.031
othe-0.031
Token you
Token ID345
Feature activation+1.15e+00

Pos logit contributions
ho+0.060
 inspection+0.051
keeping+0.047
olph+0.047
ipop+0.045
Neg logit contributions
 best-0.042
 two-0.040
 exact-0.036
 biggest-0.036
 most-0.034
Token go
Token ID467
Feature activation+2.49e+00

Pos logit contributions
ho+0.033
 inspection+0.027
olph+0.026
keeping+0.025
 throats+0.025
Neg logit contributions
 best-0.024
 two-0.023
 exact-0.023
 biggest-0.022
amed-0.022
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.061
 inspection+0.044
olph+0.042
 throats+0.041
ipop+0.040
Neg logit contributions
 best-0.070
 two-0.068
 biggest-0.064
 exact-0.063
 most-0.061
Token hug
Token ID16225
Feature activation+1.19e-01

Pos logit contributions
ho+0.490
 inspection+0.403
olph+0.400
keeping+0.374
ipop+0.370
Neg logit contributions
 best-0.344
 two-0.335
 biggest-0.312
 exact-0.311
 nic-0.298
Token each
Token ID1123
Feature activation-8.31e-01

Pos logit contributions
ho+0.004
 inspection+0.003
olph+0.003
ipop+0.003
anged+0.003
Neg logit contributions
 best-0.003
 two-0.002
 biggest-0.002
 most-0.002
 exact-0.002
Token other
Token ID584
Feature activation-6.36e-01

Pos logit contributions
amed+0.012
 best+0.011
 exact+0.011
 certainly+0.011
ceed+0.010
Neg logit contributions
ho-0.029
 inspection-0.024
olph-0.024
round-0.023
keeping-0.023
Token and
Token ID290
Feature activation+1.54e+01

Pos logit contributions
 best+0.014
 exact+0.013
 two+0.013
amed+0.013
 biggest+0.013
Neg logit contributions
ho-0.018
 inspection-0.014
olph-0.014
keeping-0.013
 throats-0.013
Token say
Token ID910
Feature activation-6.35e+00

Pos logit contributions
ho+0.468
 inspection+0.384
olph+0.373
keeping+0.349
round+0.346
Neg logit contributions
 best-0.316
 two-0.302
 exact-0.283
 biggest-0.282
amed-0.272
Token he
Token ID339
Feature activation+4.89e+00

Pos logit contributions
 best+0.100
 two+0.096
 exact+0.094
 biggest+0.093
amed+0.090
Neg logit contributions
ho-0.102
olph-0.077
 inspection-0.076
keeping-0.076
 throats-0.074
Token saw
Token ID2497
Feature activation-6.61e+00

Pos logit contributions
ho+0.122
 inspection+0.095
olph+0.089
keeping+0.081
anged+0.080
Neg logit contributions
 best-0.123
 two-0.116
 biggest-0.115
 exact-0.115
ceed-0.113
Token trees
Token ID7150
Feature activation+9.36e+00

Pos logit contributions
 best+0.120
 two+0.113
 exact+0.110
 biggest+0.108
amed+0.106
Neg logit contributions
ho-0.212
 inspection-0.174
olph-0.170
keeping-0.164
 throats-0.162
Token and
Token ID290
Feature activation+1.64e+01

Pos logit contributions
ho+0.245
olph+0.206
 inspection+0.198
keeping+0.194
 throats+0.192
Neg logit contributions
 best-0.225
 two-0.222
 exact-0.208
 biggest-0.203
 most-0.195
Token mountains
Token ID12269
Feature activation+6.82e+00

Pos logit contributions
ho+0.497
olph+0.406
 inspection+0.393
ipop+0.388
keeping+0.387
Neg logit contributions
 two-0.340
 best-0.330
 biggest-0.307
 exact-0.305
 most-0.294
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.188
olph+0.157
 inspection+0.152
 throats+0.151
ipop+0.146
Neg logit contributions
 best-0.151
 two-0.146
 exact-0.138
 biggest-0.136
amed-0.134
Token rivers
Token ID18180
Feature activation+3.46e+00

Pos logit contributions
ho+0.457
 inspection+0.364
olph+0.355
keeping+0.340
 throats+0.335
Neg logit contributions
 best-0.360
 two-0.345
 exact-0.336
 biggest-0.333
amed-0.325
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
ho+0.099
olph+0.080
 inspection+0.080
 throats+0.077
keeping+0.075
Neg logit contributions
 best-0.073
 two-0.071
 exact-0.067
 biggest-0.066
amed-0.066
Token And
Token ID843
Feature activation+7.38e+00

Pos logit contributions
ho+0.012
 inspection+0.010
olph+0.010
keeping+0.009
 throats+0.009
Neg logit contributions
 best-0.009
 two-0.009
 exact-0.009
 biggest-0.009
amed-0.009
Token then
Token ID788
Feature activation+7.98e+00

Pos logit contributions
ho+0.178
olph+0.147
round+0.139
 inspection+0.134
 throats+0.134
Neg logit contributions
 best-0.205
 two-0.199
 most-0.187
 exact-0.181
 biggest-0.181
Token he
Token ID339
Feature activation+3.49e+00

Pos logit contributions
ho+0.178
olph+0.134
keeping+0.129
round+0.127
 throats+0.126
Neg logit contributions
 best-0.235
 two-0.231
 exact-0.209
 biggest-0.209
 most-0.205
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
ho+0.394
keeping+0.345
 inspection+0.340
away+0.325
olph+0.320
Neg logit contributions
 two-0.271
 best-0.246
 most-0.218
 nic-0.211
 teaches-0.211
Token Many
Token ID4650
Feature activation-7.51e+00

Pos logit contributions
ho+0.004
 inspection+0.003
olph+0.003
keeping+0.003
 throats+0.003
Neg logit contributions
 best-0.003
 two-0.003
 exact-0.002
 biggest-0.002
amed-0.002
Token cars
Token ID5006
Feature activation+5.17e+00

Pos logit contributions
 best+0.141
 two+0.134
 biggest+0.121
 most+0.115
 exact+0.113
Neg logit contributions
ho-0.248
 inspection-0.203
keeping-0.201
olph-0.199
round-0.191
Token were
Token ID547
Feature activation-8.28e+00

Pos logit contributions
ho+0.151
 inspection+0.125
olph+0.120
keeping+0.119
 throats+0.117
Neg logit contributions
 best-0.106
 two-0.103
 exact-0.094
 biggest-0.094
amed-0.094
Token stuck
Token ID7819
Feature activation-1.90e+00

Pos logit contributions
 best+0.189
 two+0.180
 exact+0.175
 biggest+0.174
amed+0.171
Neg logit contributions
ho-0.227
 inspection-0.181
olph-0.175
keeping-0.168
 throats-0.165
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.048
 two+0.046
 exact+0.045
 biggest+0.045
amed+0.044
Neg logit contributions
ho-0.047
 inspection-0.036
olph-0.035
keeping-0.033
 throats-0.032
Token could
Token ID714
Feature activation+1.74e+00

Pos logit contributions
ho+0.452
 inspection+0.370
olph+0.362
keeping+0.352
round+0.351
Neg logit contributions
 best-0.356
 two-0.354
 biggest-0.323
 exact-0.322
amed-0.313
Token not
Token ID407
Feature activation+2.12e-01

Pos logit contributions
ho+0.049
 inspection+0.039
olph+0.038
keeping+0.036
 throats+0.036
Neg logit contributions
 best-0.038
 two-0.036
 exact-0.036
 biggest-0.036
amed-0.035
Token move
Token ID1445
Feature activation-3.47e+00

Pos logit contributions
ho+0.006
 inspection+0.004
olph+0.004
keeping+0.004
 throats+0.004
Neg logit contributions
 best-0.005
 two-0.005
 exact-0.005
amed-0.005
 biggest-0.005
Token.
Token ID13
Feature activation+5.16e-01

Pos logit contributions
 best+0.069
 two+0.066
 exact+0.061
 biggest+0.061
 most+0.056
Neg logit contributions
ho-0.107
 inspection-0.087
olph-0.086
 throats-0.083
ipop-0.082
Token Lily
Token ID20037
Feature activation-1.91e+00

Pos logit contributions
ho+0.014
 inspection+0.011
olph+0.011
keeping+0.010
 throats+0.010
Neg logit contributions
 best-0.012
 two-0.011
 exact-0.011
 biggest-0.011
amed-0.011
TokenSarah
Token ID29284
Feature activation+1.82e+00

Pos logit contributions
 best+0.188
 two+0.178
 certainly+0.166
 biggest+0.160
 teaches+0.156
Neg logit contributions
ho-0.191
keeping-0.168
 inspection-0.165
olph-0.161
round-0.150
Token saw
Token ID2497
Feature activation-6.63e+00

Pos logit contributions
ho+0.050
 inspection+0.039
olph+0.039
keeping+0.037
 throats+0.036
Neg logit contributions
 best-0.040
 biggest-0.038
 exact-0.038
 two-0.037
ceed-0.037
Token what
Token ID644
Feature activation+2.29e-01

Pos logit contributions
 best+0.124
 two+0.117
 exact+0.114
 biggest+0.113
amed+0.110
Neg logit contributions
ho-0.208
 inspection-0.170
olph-0.166
keeping-0.160
 throats-0.158
Token was
Token ID373
Feature activation-7.40e+00

Pos logit contributions
ho+0.006
 inspection+0.005
olph+0.004
keeping+0.004
 throats+0.004
Neg logit contributions
 best-0.005
 two-0.005
 exact-0.005
 biggest-0.005
amed-0.005
Token happening
Token ID5836
Feature activation-4.04e-01

Pos logit contributions
 best+0.143
 two+0.136
 exact+0.132
 biggest+0.131
amed+0.128
Neg logit contributions
ho-0.229
 inspection-0.187
olph-0.182
keeping-0.176
 throats-0.173
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.010
 two+0.009
 exact+0.009
 biggest+0.009
amed+0.009
Neg logit contributions
ho-0.011
 inspection-0.008
olph-0.008
 throats-0.008
keeping-0.008
Token yelled
Token ID22760
Feature activation-6.71e+00

Pos logit contributions
ho+0.449
olph+0.359
 inspection+0.349
keeping+0.343
round+0.339
Neg logit contributions
 two-0.366
 best-0.362
 exact-0.338
 biggest-0.333
amed-0.330
Token at
Token ID379
Feature activation+5.28e+00

Pos logit contributions
 best+0.146
 two+0.142
 biggest+0.127
amed+0.127
 exact+0.125
Neg logit contributions
ho-0.194
 inspection-0.158
olph-0.154
keeping-0.154
round-0.145
Token the
Token ID262
Feature activation-5.78e+00

Pos logit contributions
ho+0.140
 inspection+0.109
olph+0.105
 throats+0.100
keeping+0.100
Neg logit contributions
 best-0.126
 two-0.124
 exact-0.118
 biggest-0.116
amed-0.114
Token new
Token ID649
Feature activation-4.93e+00

Pos logit contributions
 best+0.136
 two+0.131
 exact+0.128
 biggest+0.127
amed+0.122
Neg logit contributions
ho-0.153
 inspection-0.123
olph-0.118
keeping-0.113
 throats-0.111
Token boy
Token ID2933
Feature activation+4.67e+00

Pos logit contributions
 best+0.121
 two+0.118
 biggest+0.113
 exact+0.110
 most+0.105
Neg logit contributions
ho-0.127
 inspection-0.105
olph-0.099
 throats-0.097
keeping-0.094
Token find
Token ID1064
Feature activation-2.81e+00

Pos logit contributions
ho+0.425
olph+0.352
 inspection+0.350
ipop+0.339
round+0.337
Neg logit contributions
 best-0.324
 two-0.322
 teaches-0.287
amed-0.282
 exact-0.281
Token it
Token ID340
Feature activation+3.52e+00

Pos logit contributions
 best+0.056
 exact+0.053
 two+0.053
amed+0.052
 biggest+0.051
Neg logit contributions
ho-0.082
 inspection-0.068
olph-0.065
keeping-0.064
 throats-0.061
Token.
Token ID13
Feature activation+9.75e-01

Pos logit contributions
ho+0.090
 inspection+0.071
olph+0.065
keeping+0.065
 throats+0.061
Neg logit contributions
 best-0.081
amed-0.079
 exact-0.079
 two-0.079
ceed-0.078
Token He
Token ID679
Feature activation+1.49e+00

Pos logit contributions
ho+0.029
 inspection+0.023
olph+0.022
keeping+0.022
 throats+0.021
Neg logit contributions
 best-0.020
 two-0.019
 exact-0.019
 biggest-0.019
amed-0.018
Token searched
Token ID16499
Feature activation-1.58e+00

Pos logit contributions
ho+0.041
 inspection+0.033
olph+0.032
keeping+0.030
 throats+0.030
Neg logit contributions
 best-0.034
 two-0.032
 exact-0.031
 biggest-0.031
amed-0.031
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.036
 two+0.034
 biggest+0.031
 exact+0.030
 most+0.030
Neg logit contributions
ho-0.047
olph-0.038
keeping-0.038
round-0.037
 throats-0.036
Token searched
Token ID16499
Feature activation-1.66e+00

Pos logit contributions
ho+0.463
olph+0.370
keeping+0.363
 throats+0.359
round+0.356
Neg logit contributions
 best-0.352
 two-0.351
 exact-0.336
 biggest-0.329
amed-0.322
Token,
Token ID11
Feature activation+1.33e+01

Pos logit contributions
 best+0.042
 two+0.040
 biggest+0.038
 exact+0.038
 most+0.036
Neg logit contributions
ho-0.043
olph-0.033
keeping-0.032
round-0.032
 throats-0.032
Token but
Token ID475
Feature activation+8.44e+00

Pos logit contributions
ho+0.389
olph+0.319
 throats+0.313
round+0.311
keeping+0.311
Neg logit contributions
 best-0.290
 two-0.282
 exact-0.267
 most-0.265
 biggest-0.260
Token it
Token ID340
Feature activation+2.30e+00

Pos logit contributions
ho+0.173
round+0.163
keeping+0.150
oing+0.145
olph+0.142
Neg logit contributions
 two-0.243
 best-0.239
 most-0.221
 biggest-0.205
 exact-0.202
Token was
Token ID373
Feature activation-8.07e+00

Pos logit contributions
ho+0.071
 inspection+0.058
olph+0.056
keeping+0.054
 throats+0.053
Neg logit contributions
 best-0.044
 two-0.042
 exact-0.041
 biggest-0.041
amed-0.040
Token Ralph
Token ID20993
Feature activation+6.39e+00

Pos logit contributions
 best+0.000
 two+0.000
 biggest+0.000
 exact+0.000
 certainly+0.000
Neg logit contributions
ho-0.000
 inspection-0.000
olph-0.000
keeping-0.000
 throats-0.000
Token was
Token ID373
Feature activation-7.13e+00

Pos logit contributions
ho+0.185
 inspection+0.148
olph+0.144
keeping+0.139
 throats+0.136
Neg logit contributions
 best-0.134
 two-0.128
 exact-0.125
 biggest-0.124
amed-0.122
Token trying
Token ID2111
Feature activation+1.72e+00

Pos logit contributions
 best+0.147
 two+0.140
 biggest+0.139
 exact+0.139
amed+0.137
Neg logit contributions
ho-0.201
 inspection-0.160
olph-0.156
keeping-0.152
 throats-0.144
Token to
Token ID284
Feature activation+1.35e+01

Pos logit contributions
ho+0.037
olph+0.026
 inspection+0.024
keeping+0.023
round+0.023
Neg logit contributions
 best-0.053
 two-0.052
 exact-0.049
 biggest-0.049
 certainly-0.047
Token do
Token ID466
Feature activation-1.02e+00

Pos logit contributions
ho+0.371
 inspection+0.296
olph+0.285
keeping+0.278
 throats+0.270
Neg logit contributions
 best-0.302
 two-0.281
 exact-0.280
 biggest-0.280
amed-0.280
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 best+0.023
 two+0.022
 exact+0.021
 biggest+0.021
amed+0.020
Neg logit contributions
ho-0.028
 inspection-0.022
olph-0.022
keeping-0.021
 throats-0.021
Token said
Token ID531
Feature activation-5.77e+00

Pos logit contributions
ho+0.440
 inspection+0.344
olph+0.342
keeping+0.324
 throats+0.317
Neg logit contributions
 best-0.379
 two-0.373
 exact-0.356
 biggest-0.347
 certainly-0.341
Token,
Token ID11
Feature activation+1.31e+01

Pos logit contributions
 best+0.152
 two+0.149
 biggest+0.135
 exact+0.134
 certainly+0.132
Neg logit contributions
ho-0.140
keeping-0.109
olph-0.109
 inspection-0.105
round-0.103
Token "
Token ID366
Feature activation-2.25e+00

Pos logit contributions
ho+0.418
 inspection+0.343
olph+0.336
keeping+0.327
 throats+0.321
Neg logit contributions
 best-0.239
 two-0.229
 exact-0.219
 biggest-0.216
amed-0.211
TokenLet
Token ID5756
Feature activation+4.23e-01

Pos logit contributions
 best+0.043
 biggest+0.039
 two+0.039
 exact+0.038
amed+0.037
Neg logit contributions
ho-0.071
 inspection-0.058
olph-0.057
keeping-0.055
 throats-0.052
Token me
Token ID502
Feature activation-2.93e-01

Pos logit contributions
ho+0.013
 inspection+0.011
olph+0.010
keeping+0.010
 yacht+0.010
Neg logit contributions
amed-0.007
 best-0.007
 exact-0.007
 biggest-0.007
ceed-0.006
Token liked
Token ID8288
Feature activation-6.24e+00

Pos logit contributions
ho+0.027
 inspection+0.021
olph+0.021
 throats+0.019
ipop+0.018
Neg logit contributions
 best-0.023
 exact-0.023
 biggest-0.023
ceed-0.021
 nic-0.020
Token to
Token ID284
Feature activation+1.35e+01

Pos logit contributions
 best+0.170
 two+0.168
 biggest+0.151
 most+0.151
 certainly+0.139
Neg logit contributions
ho-0.155
anged-0.117
 spine-0.111
keeping-0.110
 inspection-0.110
Token play
Token ID711
Feature activation-1.67e+00

Pos logit contributions
ho+0.368
 inspection+0.288
olph+0.281
keeping+0.268
 throats+0.261
Neg logit contributions
 best-0.314
 two-0.308
 exact-0.297
 biggest-0.290
 certainly-0.282
Token with
Token ID351
Feature activation-5.30e-01

Pos logit contributions
 best+0.042
 two+0.041
 exact+0.039
 biggest+0.039
amed+0.038
Neg logit contributions
ho-0.042
 inspection-0.033
olph-0.031
keeping-0.030
 throats-0.030
Token cars
Token ID5006
Feature activation+5.54e+00

Pos logit contributions
 best+0.012
 two+0.012
 biggest+0.011
 most+0.010
 teaches+0.010
Neg logit contributions
ho-0.016
 inspection-0.012
keeping-0.012
olph-0.012
 spine-0.011
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
ho+0.137
 inspection+0.106
olph+0.101
keeping+0.097
 throats+0.095
Neg logit contributions
 best-0.143
 two-0.138
 exact-0.133
 biggest-0.132
amed-0.131
Token robots
Token ID14193
Feature activation+6.70e+00

Pos logit contributions
ho+0.407
ipop+0.303
olph+0.298
 inspection+0.297
keeping+0.290
Neg logit contributions
 two-0.435
 best-0.427
 biggest-0.385
 teaches-0.382
 most-0.374
Token.
Token ID13
Feature activation+7.61e-01

Pos logit contributions
ho+0.195
 inspection+0.157
olph+0.152
keeping+0.147
 throats+0.144
Neg logit contributions
 best-0.141
 two-0.134
 exact-0.131
 biggest-0.129
amed-0.127
Token He
Token ID679
Feature activation+1.47e+00

Pos logit contributions
ho+0.022
 inspection+0.018
olph+0.018
keeping+0.017
 throats+0.017
Neg logit contributions
 best-0.015
amed-0.015
 exact-0.014
 biggest-0.014
 two-0.014
Token saw
Token ID2497
Feature activation-6.88e+00

Pos logit contributions
ho+0.040
olph+0.031
 inspection+0.031
 throats+0.028
round+0.028
Neg logit contributions
 best-0.033
 biggest-0.031
 exact-0.031
amed-0.031
 two-0.030
Token Angel
Token ID3905
Feature activation+4.85e+00

Pos logit contributions
 best+0.114
 exact+0.107
 two+0.105
 biggest+0.102
amed+0.102
Neg logit contributions
ho-0.228
 inspection-0.190
olph-0.185
keeping-0.178
 throats-0.176
Token smiled
Token ID13541
Feature activation-9.25e+00

Pos logit contributions
ho+0.176
 inspection+0.147
keeping+0.143
 throats+0.138
olph+0.137
Neg logit contributions
 two-0.125
 best-0.124
amed-0.116
 certainly-0.113
 exact-0.111
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
 best+0.258
 exact+0.254
 biggest+0.248
 two+0.247
ceed+0.246
Neg logit contributions
ho-0.200
 inspection-0.144
olph-0.136
 throats-0.128
keeping-0.123
Token
Token ID198
Feature activation-7.05e+00

Pos logit contributions
 best+0.035
 exact+0.034
 biggest+0.033
amed+0.033
ceed+0.032
Neg logit contributions
ho-0.054
 inspection-0.045
olph-0.042
 throats-0.040
 Gig-0.040
Token
Token ID198
Feature activation-7.07e+00

Pos logit contributions
 best+0.169
 exact+0.165
 nic+0.165
 biggest+0.162
 coolest+0.151
Neg logit contributions
ho-0.182
 inspection-0.157
olph-0.143
keeping-0.136
 Gig-0.135
Token"
Token ID1
Feature activation-1.93e+00

Pos logit contributions
 best+0.167
 biggest+0.158
 nic+0.153
 exact+0.149
 coolest+0.144
Neg logit contributions
ho-0.195
olph-0.150
 inspection-0.148
away-0.140
round-0.139
TokenLooks
Token ID41102
Feature activation-1.93e+01

Pos logit contributions
 best+0.043
 exact+0.041
 biggest+0.039
amed+0.038
 correct+0.037
Neg logit contributions
ho-0.055
olph-0.042
 inspection-0.040
round-0.039
 crab-0.038
Token like
Token ID588
Feature activation-1.30e+00

Pos logit contributions
amed+0.464
ceed+0.451
 exact+0.442
 two+0.429
 best+0.429
Neg logit contributions
ho-0.405
 inspection-0.318
 throats-0.310
 Gig-0.289
 tongues-0.289
Token the
Token ID262
Feature activation-7.10e+00

Pos logit contributions
 best+0.028
 two+0.027
 biggest+0.025
 exact+0.024
 certainly+0.023
Neg logit contributions
ho-0.038
 inspection-0.031
olph-0.030
keeping-0.030
 throats-0.029
Token frog
Token ID21264
Feature activation+3.14e+00

Pos logit contributions
amed+0.215
 exact+0.208
 best+0.207
ceed+0.206
 certainly+0.202
Neg logit contributions
ho-0.136
 inspection-0.091
olph-0.090
 crab-0.082
ipop-0.080
Token likes
Token ID7832
Feature activation-7.76e+00

Pos logit contributions
ho+0.087
 inspection+0.071
olph+0.067
keeping+0.066
 throats+0.064
Neg logit contributions
 best-0.071
 two-0.068
 biggest-0.064
 exact-0.064
 certainly-0.062
Token you
Token ID345
Feature activation-1.08e+00

Pos logit contributions
 best+0.187
 two+0.182
 biggest+0.169
 most+0.161
 exact+0.160
Neg logit contributions
ho-0.203
 inspection-0.163
olph-0.160
keeping-0.157
anged-0.152
Token table
Token ID3084
Feature activation+1.15e+00

Pos logit contributions
 best+0.149
 two+0.143
 exact+0.139
 biggest+0.139
amed+0.132
Neg logit contributions
ho-0.187
 inspection-0.151
olph-0.146
keeping-0.138
 throats-0.137
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
ho+0.035
 inspection+0.029
olph+0.028
keeping+0.026
 throats+0.026
Neg logit contributions
 best-0.023
 two-0.022
 exact-0.021
 biggest-0.020
amed-0.020
Token
Token ID198
Feature activation-6.62e+00

Pos logit contributions
 best+0.034
 two+0.032
 exact+0.032
 biggest+0.031
amed+0.031
Neg logit contributions
ho-0.035
 inspection-0.027
olph-0.026
keeping-0.025
 throats-0.025
Token
Token ID198
Feature activation-6.66e+00

Pos logit contributions
 best+0.143
 exact+0.136
 biggest+0.133
 nic+0.133
amed+0.127
Neg logit contributions
ho-0.187
 inspection-0.159
olph-0.153
keeping-0.145
othe-0.141
Token"
Token ID1
Feature activation-1.80e+00

Pos logit contributions
 best+0.122
 biggest+0.113
 exact+0.111
 nic+0.107
 certainly+0.105
Neg logit contributions
ho-0.215
 inspection-0.177
olph-0.176
keeping-0.165
round-0.160
TokenLooks
Token ID41102
Feature activation-1.92e+01

Pos logit contributions
 best+0.036
 biggest+0.033
 exact+0.033
 two+0.032
 certainly+0.032
Neg logit contributions
ho-0.054
olph-0.044
 inspection-0.044
keeping-0.042
 throats-0.041
Token like
Token ID588
Feature activation-9.32e-01

Pos logit contributions
amed+0.433
ceed+0.426
 exact+0.423
 best+0.415
 certainly+0.409
Neg logit contributions
ho-0.429
 throats-0.344
 inspection-0.343
 tongues-0.316
 Gig-0.307
Token mom
Token ID1995
Feature activation+9.37e+00

Pos logit contributions
 best+0.020
 two+0.020
 biggest+0.018
 exact+0.018
 teaches+0.017
Neg logit contributions
ho-0.028
 inspection-0.022
olph-0.022
keeping-0.021
round-0.021
Token has
Token ID468
Feature activation-7.08e+00

Pos logit contributions
ho+0.240
 inspection+0.187
olph+0.182
keeping+0.173
 throats+0.169
Neg logit contributions
 best-0.226
 two-0.216
 exact-0.215
 biggest-0.213
amed-0.210
Token some
Token ID617
Feature activation-6.61e+00

Pos logit contributions
 best+0.138
 exact+0.130
 two+0.129
 biggest+0.127
amed+0.126
Neg logit contributions
ho-0.212
 inspection-0.173
olph-0.169
keeping-0.165
 throats-0.163
Token mail
Token ID6920
Feature activation+2.96e-01

Pos logit contributions
 best+0.165
 two+0.158
 exact+0.152
 biggest+0.152
amed+0.148
Neg logit contributions
ho-0.171
 inspection-0.134
olph-0.128
keeping-0.124
 throats-0.119
Token laughed
Token ID13818
Feature activation-8.01e+00

Pos logit contributions
ho+0.052
 inspection+0.041
olph+0.040
keeping+0.038
 throats+0.038
Neg logit contributions
 best-0.041
 two-0.039
 exact-0.038
 biggest-0.038
amed-0.037
Token and
Token ID290
Feature activation+1.41e+01

Pos logit contributions
 best+0.229
 exact+0.227
 biggest+0.222
 two+0.221
amed+0.218
Neg logit contributions
ho-0.161
 inspection-0.116
olph-0.110
 throats-0.100
round-0.096
Token said
Token ID531
Feature activation-7.37e+00

Pos logit contributions
ho+0.397
 inspection+0.317
olph+0.306
keeping+0.296
 throats+0.291
Neg logit contributions
 best-0.312
 two-0.299
 exact-0.290
 biggest-0.287
amed-0.283
Token,
Token ID11
Feature activation+1.16e+01

Pos logit contributions
 best+0.178
 two+0.173
 biggest+0.154
 certainly+0.151
 most+0.146
Neg logit contributions
ho-0.190
keeping-0.171
olph-0.151
anged-0.149
round-0.147
Token "
Token ID366
Feature activation-3.69e+00

Pos logit contributions
ho+0.394
 inspection+0.329
keeping+0.327
olph+0.323
 throats+0.314
Neg logit contributions
 best-0.182
 two-0.177
 exact-0.162
 biggest-0.160
 certainly-0.155
TokenLooks
Token ID41102
Feature activation-1.91e+01

Pos logit contributions
 best+0.081
 biggest+0.073
 two+0.072
 exact+0.070
 correct+0.067
Neg logit contributions
ho-0.103
round-0.080
olph-0.080
 inspection-0.080
keeping-0.074
Token like
Token ID588
Feature activation-7.83e-01

Pos logit contributions
amed+0.451
 exact+0.438
ceed+0.436
 two+0.436
 best+0.424
Neg logit contributions
ho-0.414
 inspection-0.327
 throats-0.319
 tongues-0.298
 Gig-0.296
Token that
Token ID326
Feature activation+2.96e+00

Pos logit contributions
 best+0.017
 two+0.016
 biggest+0.015
 exact+0.014
 certainly+0.014
Neg logit contributions
ho-0.024
 inspection-0.019
olph-0.019
keeping-0.019
anged-0.018
Token apple
Token ID17180
Feature activation+5.70e+00

Pos logit contributions
ho+0.078
 inspection+0.061
olph+0.058
keeping+0.056
 throats+0.055
Neg logit contributions
 best-0.068
amed-0.066
 exact-0.066
 two-0.066
 biggest-0.064
Token made
Token ID925
Feature activation-8.58e+00

Pos logit contributions
ho+0.166
 inspection+0.136
olph+0.132
keeping+0.128
 throats+0.124
Neg logit contributions
 best-0.122
 two-0.115
 biggest-0.110
 exact-0.109
 certainly-0.106
Token you
Token ID345
Feature activation-8.06e-01

Pos logit contributions
 exact+0.218
 best+0.210
amed+0.210
 biggest+0.203
 nic+0.200
Neg logit contributions
ho-0.194
 inspection-0.147
olph-0.140
 throats-0.136
keeping-0.134
Token's
Token ID338
Feature activation-6.30e+00

Pos logit contributions
ho+0.018
 inspection+0.015
olph+0.014
keeping+0.014
 throats+0.014
Neg logit contributions
 best-0.013
 two-0.013
 exact-0.012
amed-0.012
 biggest-0.012
Token mom
Token ID1995
Feature activation+9.70e+00

Pos logit contributions
 best+0.159
 two+0.152
 exact+0.150
 biggest+0.148
amed+0.147
Neg logit contributions
ho-0.157
 inspection-0.121
olph-0.117
keeping-0.111
 throats-0.110
Tokenmy
Token ID1820
Feature activation+2.14e+00

Pos logit contributions
ho+0.330
 inspection+0.277
olph+0.269
keeping+0.262
 throats+0.260
Neg logit contributions
 best-0.155
 two-0.148
 exact-0.140
 biggest-0.137
amed-0.135
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
ho+0.063
 inspection+0.052
olph+0.050
keeping+0.049
 throats+0.048
Neg logit contributions
 best-0.044
 two-0.043
 exact-0.040
amed-0.039
 biggest-0.038
Token "
Token ID366
Feature activation-3.59e+00

Pos logit contributions
 best+0.027
 exact+0.025
 two+0.025
 biggest+0.025
amed+0.024
Neg logit contributions
ho-0.044
 inspection-0.036
olph-0.035
keeping-0.033
 throats-0.033
TokenLooks
Token ID41102
Feature activation-1.89e+01

Pos logit contributions
 best+0.055
 two+0.052
 exact+0.050
 biggest+0.049
amed+0.048
Neg logit contributions
ho-0.125
 inspection-0.105
olph-0.102
keeping-0.099
 throats-0.098
Token like
Token ID588
Feature activation-6.71e-01

Pos logit contributions
amed+0.403
ceed+0.393
 exact+0.390
 best+0.387
 two+0.386
Neg logit contributions
ho-0.461
 inspection-0.367
 throats-0.365
 tongues-0.340
othe-0.330
Token the
Token ID262
Feature activation-6.71e+00

Pos logit contributions
 best+0.016
 two+0.015
 biggest+0.014
 exact+0.014
 most+0.014
Neg logit contributions
ho-0.019
 inspection-0.015
olph-0.015
round-0.015
keeping-0.015
Token wave
Token ID6769
Feature activation-3.59e+00

Pos logit contributions
 best+0.180
amed+0.178
ceed+0.174
 exact+0.174
 two+0.174
Neg logit contributions
ho-0.147
 inspection-0.109
olph-0.101
keeping-0.096
 throats-0.096
Token destroyed
Token ID6572
Feature activation-4.06e+00

Pos logit contributions
 best+0.078
 two+0.074
 exact+0.071
 biggest+0.071
amed+0.068
Neg logit contributions
ho-0.102
 inspection-0.083
olph-0.081
keeping-0.078
 throats-0.077
Token all
Token ID477
Feature activation+9.11e-01

Pos logit contributions
 best+0.083
 two+0.080
 exact+0.076
 biggest+0.075
amed+0.071
Neg logit contributions
ho-0.122
 inspection-0.100
olph-0.097
 throats-0.092
keeping-0.092
Token smiled
Token ID13541
Feature activation-8.27e+00

Pos logit contributions
ho+0.248
 inspection+0.193
olph+0.186
keeping+0.178
 throats+0.175
Neg logit contributions
 best-0.236
 two-0.227
 exact-0.222
 biggest-0.220
amed-0.217
Token and
Token ID290
Feature activation+1.45e+01

Pos logit contributions
 best+0.228
 exact+0.223
 biggest+0.219
ceed+0.218
 two+0.217
Neg logit contributions
ho-0.180
 inspection-0.131
olph-0.124
 throats-0.117
keeping-0.115
Token said
Token ID531
Feature activation-6.75e+00

Pos logit contributions
ho+0.429
 inspection+0.349
olph+0.337
keeping+0.332
 throats+0.322
Neg logit contributions
 best-0.297
 two-0.288
 exact-0.270
 biggest-0.269
amed-0.268
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
 best+0.185
 two+0.179
 biggest+0.170
 exact+0.169
amed+0.167
Neg logit contributions
ho-0.154
 inspection-0.116
olph-0.113
keeping-0.113
 throats-0.107
Token "
Token ID366
Feature activation-3.40e+00

Pos logit contributions
ho+0.387
 inspection+0.321
olph+0.312
keeping+0.305
 throats+0.299
Neg logit contributions
 best-0.203
 two-0.194
 exact-0.186
 biggest-0.183
amed-0.179
TokenLooks
Token ID41102
Feature activation-1.87e+01

Pos logit contributions
 best+0.067
 two+0.060
 biggest+0.060
amed+0.059
 exact+0.058
Neg logit contributions
ho-0.104
 inspection-0.082
olph-0.080
round-0.078
keeping-0.077
Token like
Token ID588
Feature activation-2.89e-01

Pos logit contributions
amed+0.437
ceed+0.422
 two+0.418
 exact+0.414
 best+0.412
Neg logit contributions
ho-0.424
 inspection-0.336
 throats-0.322
 Gig-0.303
 tongues-0.301
Token you
Token ID345
Feature activation-3.60e-01

Pos logit contributions
 best+0.006
 two+0.006
 biggest+0.006
 exact+0.006
 certainly+0.006
Neg logit contributions
ho-0.009
 inspection-0.007
olph-0.007
keeping-0.007
round-0.007
Token like
Token ID588
Feature activation-2.59e-01

Pos logit contributions
 best+0.008
 exact+0.008
 biggest+0.007
amed+0.007
ceed+0.007
Neg logit contributions
ho-0.010
 inspection-0.008
olph-0.008
 throats-0.007
ipop-0.007
Token the
Token ID262
Feature activation-6.30e+00

Pos logit contributions
 best+0.006
 two+0.006
 biggest+0.005
 certainly+0.005
 exact+0.005
Neg logit contributions
ho-0.007
 inspection-0.006
anged-0.006
olph-0.006
round-0.005
Token wardrobe
Token ID35593
Feature activation+6.73e+00

Pos logit contributions
amed+0.110
 best+0.105
ceed+0.104
 two+0.103
 exact+0.101
Neg logit contributions
ho-0.195
 inspection-0.158
olph-0.154
keeping-0.151
 throats-0.147
Token the
Token ID262
Feature activation-6.28e+00

Pos logit contributions
ho+0.424
 inspection+0.342
keeping+0.333
olph+0.324
 throats+0.313
Neg logit contributions
 best-0.318
 two-0.310
 biggest-0.278
amed-0.277
 certainly-0.275
Token owner
Token ID4870
Feature activation+6.77e+00

Pos logit contributions
 exact+0.209
amed+0.204
 best+0.204
 certainly+0.201
ceed+0.200
Neg logit contributions
ho-0.097
olph-0.057
 inspection-0.056
 throats-0.049
 crab-0.048
Token smiled
Token ID13541
Feature activation-8.24e+00

Pos logit contributions
ho+0.185
 inspection+0.149
olph+0.142
keeping+0.140
 throats+0.136
Neg logit contributions
 best-0.155
 two-0.149
 exact-0.142
 biggest-0.141
amed-0.141
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 exact+0.193
 best+0.193
ceed+0.190
 biggest+0.185
 teaches+0.185
Neg logit contributions
ho-0.209
 inspection-0.159
olph-0.153
 throats-0.144
keeping-0.140
Token "
Token ID366
Feature activation-3.31e+00

Pos logit contributions
 best+0.024
 exact+0.023
 two+0.023
 biggest+0.022
amed+0.022
Neg logit contributions
ho-0.032
 inspection-0.026
olph-0.025
keeping-0.024
 throats-0.024
TokenLooks
Token ID41102
Feature activation-1.85e+01

Pos logit contributions
 best+0.063
 two+0.057
 biggest+0.056
amed+0.055
 exact+0.055
Neg logit contributions
ho-0.102
olph-0.080
 inspection-0.079
round-0.078
 Gig-0.075
Token like
Token ID588
Feature activation-7.63e-02

Pos logit contributions
amed+0.446
ceed+0.434
 exact+0.433
 two+0.431
 best+0.419
Neg logit contributions
ho-0.384
 inspection-0.304
 throats-0.286
 Gig-0.279
away-0.273
Token you
Token ID345
Feature activation-1.16e-01

Pos logit contributions
 best+0.002
 two+0.002
 biggest+0.001
 most+0.001
 certainly+0.001
Neg logit contributions
ho-0.002
 inspection-0.002
olph-0.002
keeping-0.002
anged-0.002
Token two
Token ID734
Feature activation-3.99e+00

Pos logit contributions
 exact+0.002
 best+0.002
 biggest+0.002
amed+0.002
ceed+0.002
Neg logit contributions
ho-0.003
olph-0.003
 inspection-0.002
ipop-0.002
round-0.002
Token are
Token ID389
Feature activation-9.38e+00

Pos logit contributions
 best+0.084
 exact+0.081
 two+0.080
amed+0.080
 biggest+0.079
Neg logit contributions
ho-0.112
 inspection-0.090
olph-0.088
keeping-0.083
 throats-0.083
Token friends
Token ID2460
Feature activation+5.10e+00

Pos logit contributions
 exact+0.218
 best+0.215
amed+0.211
 biggest+0.209
 two+0.208
Neg logit contributions
ho-0.238
olph-0.184
 inspection-0.181
keeping-0.171
 Gig-0.169
Token made
Token ID925
Feature activation-8.70e+00

Pos logit contributions
ho+0.022
 inspection+0.017
olph+0.016
 throats+0.016
keeping+0.015
Neg logit contributions
 best-0.018
 exact-0.017
 two-0.017
 biggest-0.017
ceed-0.016
Token her
Token ID607
Feature activation-4.92e+00

Pos logit contributions
 best+0.204
 exact+0.199
amed+0.193
 biggest+0.192
 two+0.192
Neg logit contributions
ho-0.220
 inspection-0.175
olph-0.162
keeping-0.159
 throats-0.158
Token spark
Token ID9009
Feature activation-4.56e+00

Pos logit contributions
 best+0.101
 two+0.096
 exact+0.093
 biggest+0.093
amed+0.089
Neg logit contributions
ho-0.147
 inspection-0.120
olph-0.118
keeping-0.112
 throats-0.109
Tokenle
Token ID293
Feature activation+2.20e+00

Pos logit contributions
 best+0.101
 two+0.100
ceed+0.100
amed+0.100
 smallest+0.100
Neg logit contributions
ho-0.111
 inspection-0.091
away-0.083
 throats-0.080
ipop-0.079
Token and
Token ID290
Feature activation+1.40e+01

Pos logit contributions
ho+0.064
 inspection+0.052
olph+0.050
keeping+0.048
 throats+0.047
Neg logit contributions
 best-0.047
 two-0.044
 exact-0.043
 biggest-0.043
amed-0.042
Token gle
Token ID26852
Feature activation-1.67e+01

Pos logit contributions
ho+0.400
 inspection+0.321
olph+0.316
keeping+0.300
round+0.293
Neg logit contributions
 best-0.309
 two-0.296
 biggest-0.283
 exact-0.282
amed-0.275
Tokenam
Token ID321
Feature activation-4.11e+00

Pos logit contributions
 best+0.242
 teaches+0.242
 biggest+0.232
 exact+0.226
ceed+0.213
Neg logit contributions
ho-0.544
 throats-0.462
anged-0.456
keeping-0.451
 tongues-0.435
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
 best+0.107
 two+0.103
 exact+0.102
 biggest+0.101
amed+0.098
Neg logit contributions
ho-0.099
 inspection-0.075
olph-0.070
keeping-0.069
 throats-0.065
Token She
Token ID1375
Feature activation-8.87e-01

Pos logit contributions
 best+0.040
 exact+0.037
 two+0.036
 biggest+0.036
amed+0.036
Neg logit contributions
ho-0.051
 inspection-0.041
olph-0.040
keeping-0.039
 throats-0.038
Token felt
Token ID2936
Feature activation-1.26e+01

Pos logit contributions
 best+0.019
 two+0.018
 exact+0.017
amed+0.017
 biggest+0.017
Neg logit contributions
ho-0.026
 inspection-0.021
olph-0.020
keeping-0.020
 throats-0.019
Token so
Token ID523
Feature activation+6.36e+00

Pos logit contributions
 best+0.248
 two+0.236
ceed+0.235
 exact+0.232
 biggest+0.227
Neg logit contributions
ho-0.354
 inspection-0.290
 throats-0.266
olph-0.265
keeping-0.264
Token for
Token ID329
Feature activation-3.64e+00

Pos logit contributions
ho+0.005
 inspection+0.004
olph+0.004
keeping+0.004
 throats+0.003
Neg logit contributions
 best-0.004
 two-0.004
 exact-0.004
 biggest-0.004
amed-0.004
Token the
Token ID262
Feature activation-6.99e+00

Pos logit contributions
 best+0.084
 two+0.081
 exact+0.078
 biggest+0.077
amed+0.077
Neg logit contributions
ho-0.099
 inspection-0.078
olph-0.075
keeping-0.072
 throats-0.071
Token toast
Token ID27805
Feature activation+4.22e+00

Pos logit contributions
amed+0.161
 best+0.159
 two+0.154
 exact+0.154
ceed+0.153
Neg logit contributions
ho-0.178
 inspection-0.136
olph-0.131
 throats-0.125
ipop-0.124
Token and
Token ID290
Feature activation+1.40e+01

Pos logit contributions
ho+0.121
 inspection+0.100
olph+0.100
keeping+0.093
anged+0.092
Neg logit contributions
 best-0.097
 two-0.089
 certainly-0.082
 biggest-0.082
 exact-0.081
Token smiled
Token ID13541
Feature activation-8.94e+00

Pos logit contributions
ho+0.397
 inspection+0.330
olph+0.309
away+0.293
 throats+0.293
Neg logit contributions
 two-0.321
 best-0.319
 certainly-0.279
 teaches-0.279
amed-0.276
Token gle
Token ID26852
Feature activation-1.66e+01

Pos logit contributions
 best+0.211
 exact+0.209
ceed+0.206
 two+0.204
amed+0.204
Neg logit contributions
ho-0.223
 inspection-0.169
olph-0.160
 throats-0.154
keeping-0.153
Tokenefully
Token ID7549
Feature activation+3.68e-02

Pos logit contributions
 best+0.204
 biggest+0.192
 two+0.191
 exact+0.189
ceed+0.176
Neg logit contributions
ho-0.614
keeping-0.502
 throats-0.497
olph-0.496
anged-0.493
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
ho+0.001
 inspection+0.001
olph+0.001
keeping+0.001
 throats+0.001
Neg logit contributions
 best-0.001
 two-0.001
 exact-0.001
 biggest-0.001
amed-0.001
Token His
Token ID2399
Feature activation-7.67e+00

Pos logit contributions
 best+0.055
 biggest+0.052
 exact+0.051
amed+0.051
 two+0.051
Neg logit contributions
ho-0.040
 inspection-0.031
olph-0.029
keeping-0.028
round-0.026
Token siblings
Token ID20569
Feature activation+2.82e+00

Pos logit contributions
 best+0.183
 two+0.174
 biggest+0.168
 exact+0.168
amed+0.162
Neg logit contributions
ho-0.208
 inspection-0.165
olph-0.158
keeping-0.153
 throats-0.148
Token were
Token ID547
Feature activation-1.02e+01

Pos logit contributions
ho+0.075
 inspection+0.058
olph+0.057
keeping+0.054
 throats+0.053
Neg logit contributions
 best-0.066
 exact-0.062
 biggest-0.062
 two-0.062
amed-0.060
Token sign
Token ID1051
Feature activation+1.38e+00

Pos logit contributions
ho+0.353
 inspection+0.280
olph+0.272
keeping+0.262
 throats+0.260
Neg logit contributions
 best-0.268
 two-0.252
 exact-0.250
 biggest-0.249
amed-0.246
Token '
Token ID705
Feature activation-1.00e+00

Pos logit contributions
ho+0.040
olph+0.032
 inspection+0.032
keeping+0.030
round+0.029
Neg logit contributions
 best-0.031
 two-0.030
 biggest-0.028
 exact-0.027
 teaches-0.027
Tokenhappy
Token ID34191
Feature activation-3.75e+00

Pos logit contributions
 best+0.021
 two+0.021
 exact+0.020
 biggest+0.019
amed+0.019
Neg logit contributions
ho-0.029
 inspection-0.023
olph-0.022
 throats-0.021
keeping-0.021
Token',
Token ID3256
Feature activation+2.60e+00

Pos logit contributions
 best+0.070
 two+0.066
 exact+0.065
 biggest+0.065
amed+0.063
Neg logit contributions
ho-0.117
 inspection-0.095
olph-0.092
keeping-0.090
 throats-0.089
Token '
Token ID705
Feature activation-9.25e-01

Pos logit contributions
ho+0.053
 inspection+0.040
 throats+0.038
olph+0.036
keeping+0.034
Neg logit contributions
 best-0.072
 exact-0.071
amed-0.070
ceed-0.070
 biggest-0.069
Tokenbirth
Token ID24280
Feature activation-1.66e+01

Pos logit contributions
 best+0.019
 two+0.018
 exact+0.017
 biggest+0.017
amed+0.017
Neg logit contributions
ho-0.028
 inspection-0.022
olph-0.022
keeping-0.021
 throats-0.021
Tokenday
Token ID820
Feature activation+6.61e+00

Pos logit contributions
 best+0.360
 exact+0.359
 biggest+0.354
 certainly+0.328
 two+0.321
Neg logit contributions
ho-0.448
olph-0.350
keeping-0.342
 inspection-0.328
 throats-0.327
Token',
Token ID3256
Feature activation+2.77e+00

Pos logit contributions
ho+0.180
 inspection+0.140
olph+0.138
keeping+0.133
 throats+0.133
Neg logit contributions
 best-0.144
 biggest-0.138
amed-0.137
 exact-0.136
 two-0.132
Token and
Token ID290
Feature activation+1.49e+01

Pos logit contributions
ho+0.062
 inspection+0.047
 throats+0.044
olph+0.044
keeping+0.041
Neg logit contributions
 best-0.073
amed-0.069
 biggest-0.069
 exact-0.069
ceed-0.068
Token '
Token ID705
Feature activation-7.73e-01

Pos logit contributions
ho+0.395
 inspection+0.310
olph+0.300
keeping+0.287
 throats+0.284
Neg logit contributions
 best-0.350
 two-0.333
 exact-0.328
 biggest-0.326
amed-0.321
Tokenyou
Token ID5832
Feature activation-3.64e+00

Pos logit contributions
 best+0.015
 two+0.014
 exact+0.014
 biggest+0.014
amed+0.014
Neg logit contributions
ho-0.023
 inspection-0.019
olph-0.019
keeping-0.018
 throats-0.018
Token garden
Token ID11376
Feature activation+8.04e+00

Pos logit contributions
 best+0.146
 two+0.139
 biggest+0.135
 exact+0.133
 most+0.126
Neg logit contributions
ho-0.154
 inspection-0.126
olph-0.121
keeping-0.120
 throats-0.115
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
ho+0.222
 inspection+0.192
olph+0.189
keeping+0.176
 throats+0.174
Neg logit contributions
 best-0.173
 two-0.169
 exact-0.165
 biggest-0.157
amed-0.149
Token He
Token ID679
Feature activation-6.80e-02

Pos logit contributions
 best+0.032
 two+0.030
 exact+0.029
 biggest+0.029
amed+0.029
Neg logit contributions
ho-0.044
 inspection-0.035
olph-0.034
keeping-0.033
 throats-0.032
Token was
Token ID373
Feature activation-9.42e+00

Pos logit contributions
 best+0.001
 two+0.001
 exact+0.001
amed+0.001
 biggest+0.001
Neg logit contributions
ho-0.002
 inspection-0.002
olph-0.002
keeping-0.001
 throats-0.001
Token immediately
Token ID3393
Feature activation-3.00e-01

Pos logit contributions
 best+0.180
 exact+0.170
 two+0.170
 biggest+0.167
amed+0.162
Neg logit contributions
ho-0.286
 inspection-0.231
olph-0.224
keeping-0.216
 throats-0.214
Token aw
Token ID3253
Feature activation-1.66e+01

Pos logit contributions
 best+0.006
 two+0.006
 exact+0.006
 biggest+0.006
amed+0.006
Neg logit contributions
ho-0.009
olph-0.007
 inspection-0.007
round-0.007
keeping-0.007
Tokenest
Token ID395
Feature activation-4.83e+00

Pos logit contributions
 best+0.431
 exact+0.421
 two+0.407
 biggest+0.405
 certainly+0.405
Neg logit contributions
ho-0.379
 inspection-0.277
olph-0.265
keeping-0.260
away-0.247
Tokenruck
Token ID30915
Feature activation+5.45e+00

Pos logit contributions
 best+0.097
 two+0.094
 biggest+0.087
 exact+0.082
 most+0.080
Neg logit contributions
ho-0.151
olph-0.124
 inspection-0.121
keeping-0.117
ipop-0.114
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
ho+0.158
olph+0.142
 inspection+0.138
ipop+0.127
keeping+0.127
Neg logit contributions
 best-0.121
 two-0.114
 biggest-0.104
 exact-0.103
 most-0.101
Token
Token ID198
Feature activation-6.74e+00

Pos logit contributions
 best+0.032
 two+0.029
 exact+0.029
 biggest+0.029
amed+0.028
Neg logit contributions
ho-0.044
 inspection-0.036
olph-0.035
keeping-0.034
 throats-0.033
Token
Token ID198
Feature activation-6.76e+00

Pos logit contributions
 best+0.149
 exact+0.140
 biggest+0.139
 nic+0.135
amed+0.128
Neg logit contributions
ho-0.183
 inspection-0.162
olph-0.152
keeping-0.145
round-0.139
Token garden
Token ID11376
Feature activation+7.97e+00

Pos logit contributions
 best+0.146
 two+0.140
 biggest+0.136
 exact+0.135
amed+0.128
Neg logit contributions
ho-0.214
olph-0.174
 inspection-0.174
keeping-0.163
 throats-0.161
Token,
Token ID11
Feature activation+1.16e+01

Pos logit contributions
ho+0.185
olph+0.143
 inspection+0.141
keeping+0.134
round+0.131
Neg logit contributions
 best-0.218
 two-0.211
 biggest-0.202
 exact-0.200
amed-0.192
Token Molly
Token ID30236
Feature activation+2.17e+00

Pos logit contributions
ho+0.283
olph+0.211
keeping+0.210
 inspection+0.208
round+0.203
Neg logit contributions
 best-0.309
 two-0.309
 biggest-0.288
 exact-0.283
 most-0.275
Token noticed
Token ID6810
Feature activation-7.75e+00

Pos logit contributions
ho+0.062
 inspection+0.051
olph+0.049
keeping+0.048
 throats+0.047
Neg logit contributions
 best-0.045
 two-0.045
amed-0.041
 exact-0.041
 biggest-0.041
Token something
Token ID1223
Feature activation-3.93e+00

Pos logit contributions
 best+0.115
 two+0.113
 exact+0.102
 biggest+0.102
amed+0.097
Neg logit contributions
ho-0.276
 inspection-0.230
olph-0.226
keeping-0.220
 throats-0.218
Token gle
Token ID26852
Feature activation-1.65e+01

Pos logit contributions
 best+0.065
 two+0.060
 exact+0.058
 biggest+0.057
 certainly+0.053
Neg logit contributions
ho-0.134
olph-0.112
 inspection-0.112
keeping-0.111
 throats-0.107
Tokenaming
Token ID3723
Feature activation-1.89e+00

Pos logit contributions
 best+0.251
 teaches+0.238
 biggest+0.234
 two+0.229
 exact+0.227
Neg logit contributions
ho-0.571
olph-0.465
keeping-0.455
 inspection-0.450
away-0.441
Token on
Token ID319
Feature activation+4.96e+00

Pos logit contributions
 best+0.043
 two+0.041
 exact+0.041
 biggest+0.040
amed+0.038
Neg logit contributions
ho-0.051
olph-0.043
 inspection-0.043
keeping-0.040
 throats-0.040
Token the
Token ID262
Feature activation-7.05e+00

Pos logit contributions
ho+0.174
anged+0.148
 inspection+0.145
olph+0.144
ipop+0.142
Neg logit contributions
 two-0.093
 exact-0.072
 best-0.072
 biggest-0.063
 teaches-0.062
Token ground
Token ID2323
Feature activation+4.10e+00

Pos logit contributions
 best+0.127
 biggest+0.126
 two+0.125
 exact+0.118
 most+0.110
Neg logit contributions
ho-0.222
olph-0.199
 inspection-0.199
anged-0.189
ipop-0.181
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
ho+0.124
 inspection+0.103
olph+0.102
keeping+0.096
 throats+0.095
Neg logit contributions
 best-0.080
 two-0.077
 exact-0.074
 biggest-0.072
amed-0.070
Token the
Token ID262
Feature activation-6.45e+00

Pos logit contributions
ho+0.094
 inspection+0.075
keeping+0.071
olph+0.070
 throats+0.067
Neg logit contributions
 best-0.090
amed-0.086
 exact-0.085
 biggest-0.084
ceed-0.084
Token sky
Token ID6766
Feature activation+5.44e+00

Pos logit contributions
ceed+0.135
 certainly+0.134
 Deal+0.132
 best+0.131
 teaches+0.125
Neg logit contributions
ho-0.164
round-0.130
away-0.128
 yacht-0.127
keeping-0.126
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
ho+0.139
 inspection+0.106
keeping+0.100
olph+0.099
 throats+0.097
Neg logit contributions
 best-0.126
 exact-0.122
amed-0.122
ceed-0.120
 biggest-0.120
Token Lily
Token ID20037
Feature activation-2.81e+00

Pos logit contributions
 best+0.036
 two+0.033
 exact+0.033
amed+0.033
 biggest+0.033
Neg logit contributions
ho-0.034
 inspection-0.027
olph-0.026
 throats-0.025
keeping-0.024
Token felt
Token ID2936
Feature activation-1.22e+01

Pos logit contributions
 best+0.063
 exact+0.060
 biggest+0.059
 two+0.058
ceed+0.057
Neg logit contributions
ho-0.078
 inspection-0.061
olph-0.060
keeping-0.056
 throats-0.055
Token aw
Token ID3253
Feature activation-1.64e+01

Pos logit contributions
ceed+0.269
 best+0.259
icious+0.257
 exact+0.256
amed+0.255
Neg logit contributions
ho-0.285
 inspection-0.235
 he-0.225
 Gig-0.221
 crab-0.215
Tokened
Token ID276
Feature activation-2.59e+00

Pos logit contributions
 exact+0.396
 best+0.396
 certainly+0.378
 nic+0.374
 Deal+0.374
Neg logit contributions
ho-0.392
 inspection-0.285
keeping-0.282
olph-0.273
away-0.273
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
 best+0.057
 two+0.055
 exact+0.049
 biggest+0.049
 most+0.048
Neg logit contributions
ho-0.076
olph-0.065
ipop-0.063
 inspection-0.063
 throats-0.060
Token with
Token ID351
Feature activation-2.72e+00

Pos logit contributions
ho+0.346
olph+0.275
 inspection+0.272
keeping+0.261
anged+0.259
Neg logit contributions
 best-0.256
 two-0.248
 exact-0.237
 biggest-0.237
 most-0.227
Token her
Token ID607
Feature activation-4.36e+00

Pos logit contributions
 best+0.055
 two+0.054
 biggest+0.050
 exact+0.050
amed+0.049
Neg logit contributions
ho-0.083
 inspection-0.066
olph-0.066
keeping-0.063
 throats-0.062
Token eyes
Token ID2951
Feature activation+7.20e-01

Pos logit contributions
 best+0.102
 two+0.097
 biggest+0.094
 exact+0.093
amed+0.089
Neg logit contributions
ho-0.120
 inspection-0.096
olph-0.093
keeping-0.090
 throats-0.086
Token and
Token ID290
Feature activation+1.45e+01

Pos logit contributions
ho+0.021
 inspection+0.018
olph+0.017
aro+0.016
keeping+0.016
Neg logit contributions
 two-0.018
 best-0.018
 biggest-0.015
amed-0.015
 exact-0.015
Token it
Token ID340
Feature activation+1.29e+00

Pos logit contributions
ho+0.454
 inspection+0.374
olph+0.362
round+0.355
keeping+0.349
Neg logit contributions
 two-0.303
 best-0.279
 nic-0.251
amed-0.249
 most-0.242
Token spark
Token ID9009
Feature activation-4.21e+00

Pos logit contributions
ho+0.038
 inspection+0.030
olph+0.029
keeping+0.028
 throats+0.028
Neg logit contributions
 best-0.027
 two-0.026
 exact-0.025
 biggest-0.025
amed-0.024
Tokenled
Token ID992
Feature activation-5.67e+00

Pos logit contributions
 best+0.124
 two+0.120
 exact+0.118
 biggest+0.117
amed+0.116
Neg logit contributions
ho-0.087
 inspection-0.063
olph-0.060
keeping-0.057
 throats-0.055
Token and
Token ID290
Feature activation+1.43e+01

Pos logit contributions
 best+0.133
 two+0.127
 exact+0.121
 biggest+0.121
amed+0.118
Neg logit contributions
ho-0.154
 inspection-0.123
olph-0.123
keeping-0.117
 throats-0.114
Token gle
Token ID26852
Feature activation-1.64e+01

Pos logit contributions
ho+0.408
 inspection+0.328
olph+0.323
keeping+0.305
 throats+0.303
Neg logit contributions
 best-0.323
 two-0.312
 biggest-0.294
 exact-0.289
amed-0.285
Tokenamed
Token ID2434
Feature activation-6.48e+00

Pos logit contributions
 teaches+0.238
 best+0.235
 biggest+0.222
 exact+0.211
 two+0.204
Neg logit contributions
ho-0.555
anged-0.457
keeping-0.453
 throats-0.445
olph-0.442
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
 two+0.148
 best+0.148
amed+0.138
 exact+0.137
 biggest+0.136
Neg logit contributions
ho-0.175
 inspection-0.142
olph-0.137
 throats-0.133
ipop-0.130
Token He
Token ID679
Feature activation-3.39e-01

Pos logit contributions
 best+0.035
 two+0.033
 exact+0.033
 biggest+0.032
amed+0.032
Neg logit contributions
ho-0.043
 inspection-0.034
olph-0.033
keeping-0.032
 throats-0.031
Token ran
Token ID4966
Feature activation-2.59e+00

Pos logit contributions
 best+0.008
 exact+0.007
 two+0.007
 biggest+0.007
amed+0.007
Neg logit contributions
ho-0.009
 inspection-0.007
olph-0.007
keeping-0.007
 throats-0.007
Token around
Token ID1088
Feature activation+5.82e+00

Pos logit contributions
 two+0.053
amed+0.049
 best+0.048
 right+0.046
 biggest+0.043
Neg logit contributions
ho-0.082
 throats-0.069
olph-0.067
 tongues-0.065
 inspection-0.062
Token the
Token ID262
Feature activation-6.75e+00

Pos logit contributions
ho+0.249
 inspection+0.206
olph+0.178
keeping+0.173
round+0.170
Neg logit contributions
 best-0.318
amed-0.315
ceed-0.294
 exact-0.293
 two-0.292
Token bottom
Token ID4220
Feature activation-2.28e+00

Pos logit contributions
amed+0.179
 best+0.171
 teaches+0.171
ceed+0.168
 certainly+0.164
Neg logit contributions
ho-0.151
 inspection-0.108
round-0.098
 sled-0.098
 throats-0.098
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 best+0.044
 two+0.043
 biggest+0.040
 exact+0.040
amed+0.038
Neg logit contributions
ho-0.071
 inspection-0.058
olph-0.058
 throats-0.055
keeping-0.055
Token He
Token ID679
Feature activation-2.47e-01

Pos logit contributions
 best+0.040
 two+0.039
 exact+0.038
 biggest+0.038
amed+0.037
Neg logit contributions
ho-0.044
 inspection-0.034
olph-0.033
keeping-0.032
 throats-0.032
Token laughed
Token ID13818
Feature activation-7.79e+00

Pos logit contributions
 best+0.005
 two+0.005
 exact+0.005
 biggest+0.005
amed+0.005
Neg logit contributions
ho-0.007
 inspection-0.005
olph-0.005
keeping-0.005
 throats-0.005
Token gle
Token ID26852
Feature activation-1.64e+01

Pos logit contributions
 best+0.208
 two+0.201
 exact+0.195
 biggest+0.194
amed+0.191
Neg logit contributions
ho-0.185
 inspection-0.140
olph-0.136
keeping-0.130
 throats-0.126
Tokenefully
Token ID7549
Feature activation+2.18e-01

Pos logit contributions
 best+0.203
 exact+0.202
 biggest+0.194
 teaches+0.183
 two+0.182
Neg logit contributions
ho-0.584
anged-0.493
 throats-0.484
keeping-0.480
olph-0.475
Token as
Token ID355
Feature activation+4.88e+00

Pos logit contributions
ho+0.006
olph+0.004
 inspection+0.004
keeping+0.004
 throats+0.004
Neg logit contributions
 best-0.005
 two-0.005
 exact-0.005
 biggest-0.005
amed-0.005
Token he
Token ID339
Feature activation+2.10e+00

Pos logit contributions
ho+0.088
olph+0.059
 inspection+0.058
 throats+0.058
keeping+0.057
Neg logit contributions
 best-0.163
 two-0.160
 biggest-0.153
 exact-0.148
 most-0.145
Token went
Token ID1816
Feature activation-1.10e+00

Pos logit contributions
ho+0.050
 inspection+0.037
olph+0.037
keeping+0.033
 throats+0.033
Neg logit contributions
 best-0.055
 exact-0.053
 biggest-0.053
 two-0.052
ceed-0.050
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
 best+0.028
 two+0.027
 exact+0.026
 biggest+0.026
amed+0.025
Neg logit contributions
ho-0.028
 inspection-0.021
olph-0.021
 throats-0.020
keeping-0.020
Token curtain
Token ID29461
Feature activation+3.96e+00

Pos logit contributions
 best+0.155
 two+0.148
 exact+0.147
amed+0.144
 biggest+0.144
Neg logit contributions
ho-0.174
 inspection-0.136
olph-0.131
keeping-0.125
 throats-0.125
Token moved
Token ID3888
Feature activation-7.76e+00

Pos logit contributions
ho+0.110
 inspection+0.090
olph+0.089
keeping+0.086
 throats+0.083
Neg logit contributions
 best-0.086
 two-0.084
 exact-0.079
 biggest-0.078
amed-0.078
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 best+0.163
 two+0.158
 exact+0.150
 biggest+0.147
amed+0.145
Neg logit contributions
ho-0.227
olph-0.186
 inspection-0.185
 throats-0.176
ipop-0.174
Token Johnny
Token ID15470
Feature activation+2.21e+00

Pos logit contributions
 best+0.047
 exact+0.044
 two+0.044
amed+0.044
 biggest+0.043
Neg logit contributions
ho-0.037
 inspection-0.028
olph-0.027
keeping-0.026
 throats-0.025
Token laughed
Token ID13818
Feature activation-7.59e+00

Pos logit contributions
ho+0.069
 inspection+0.057
olph+0.055
keeping+0.054
 throats+0.053
Neg logit contributions
 best-0.041
 two-0.040
amed-0.038
 exact-0.038
 biggest-0.037
Token gle
Token ID26852
Feature activation-1.64e+01

Pos logit contributions
 best+0.201
 two+0.194
 exact+0.189
 biggest+0.188
amed+0.185
Neg logit contributions
ho-0.181
 inspection-0.138
olph-0.133
keeping-0.128
 throats-0.124
Tokenefully
Token ID7549
Feature activation+5.39e-01

Pos logit contributions
 best+0.215
 exact+0.208
 biggest+0.204
 two+0.193
 teaches+0.188
Neg logit contributions
ho-0.578
anged-0.479
 throats-0.476
keeping-0.470
olph-0.470
Token as
Token ID355
Feature activation+5.00e+00

Pos logit contributions
ho+0.016
olph+0.013
 inspection+0.012
keeping+0.012
 throats+0.012
Neg logit contributions
 best-0.012
 two-0.012
 exact-0.011
 biggest-0.010
 certainly-0.010
Token he
Token ID339
Feature activation+2.25e+00

Pos logit contributions
ho+0.096
olph+0.064
 inspection+0.064
keeping+0.063
 throats+0.063
Neg logit contributions
 best-0.165
 two-0.163
 biggest-0.153
 exact-0.148
 most-0.147
Token watched
Token ID7342
Feature activation-5.20e+00

Pos logit contributions
ho+0.055
 inspection+0.041
olph+0.040
keeping+0.037
 throats+0.037
Neg logit contributions
 best-0.058
 exact-0.056
 biggest-0.055
 two-0.055
amed-0.053
Token it
Token ID340
Feature activation+1.12e+00

Pos logit contributions
 two+0.117
 best+0.113
 biggest+0.103
 exact+0.100
 most+0.095
Neg logit contributions
ho-0.157
olph-0.121
 inspection-0.120
keeping-0.117
ipop-0.114
Token
Token ID198
Feature activation-6.79e+00

Pos logit contributions
 best+0.157
 exact+0.153
 nic+0.149
 biggest+0.148
 coolest+0.139
Neg logit contributions
ho-0.179
 inspection-0.155
olph-0.145
 Gig-0.133
keeping-0.133
TokenThe
Token ID464
Feature activation-8.97e+00

Pos logit contributions
 best+0.166
 biggest+0.153
 nic+0.143
 exact+0.139
 coolest+0.137
Neg logit contributions
ho-0.181
 inspection-0.145
olph-0.137
round-0.132
aro-0.132
Token young
Token ID1862
Feature activation+7.58e-01

Pos logit contributions
amed+0.186
 certainly+0.180
 exact+0.175
ceed+0.174
 best+0.172
Neg logit contributions
ho-0.233
 crab-0.187
 yacht-0.187
olph-0.184
 inspection-0.176
Token girl
Token ID2576
Feature activation+3.19e+00

Pos logit contributions
ho+0.019
olph+0.015
 inspection+0.015
keeping+0.014
 crab+0.014
Neg logit contributions
 certainly-0.017
amed-0.017
 best-0.017
ceed-0.017
 exact-0.016
Token smiled
Token ID13541
Feature activation-8.51e+00

Pos logit contributions
ho+0.096
 inspection+0.079
olph+0.076
keeping+0.074
 throats+0.073
Neg logit contributions
 best-0.064
 two-0.061
 exact-0.058
 biggest-0.057
amed-0.057
Token gle
Token ID26852
Feature activation-1.64e+01

Pos logit contributions
 best+0.231
 exact+0.229
ceed+0.226
 two+0.224
amed+0.223
Neg logit contributions
ho-0.185
 inspection-0.133
olph-0.123
 throats-0.119
away-0.113
Tokenefully
Token ID7549
Feature activation+5.64e-01

Pos logit contributions
 best+0.203
 exact+0.199
 biggest+0.190
 two+0.182
 certainly+0.178
Neg logit contributions
ho-0.594
 throats-0.488
anged-0.484
keeping-0.476
olph-0.475
Token as
Token ID355
Feature activation+5.04e+00

Pos logit contributions
ho+0.015
 inspection+0.013
olph+0.012
keeping+0.012
 throats+0.011
Neg logit contributions
 best-0.013
 two-0.013
 exact-0.012
 biggest-0.012
amed-0.011
Token the
Token ID262
Feature activation-6.73e+00

Pos logit contributions
ho+0.100
 inspection+0.072
olph+0.068
keeping+0.064
 throats+0.062
Neg logit contributions
 best-0.152
 two-0.147
 exact-0.145
 biggest-0.144
amed-0.142
Token net
Token ID2010
Feature activation+1.93e+00

Pos logit contributions
amed+0.168
 certainly+0.165
 exact+0.165
 best+0.163
ceed+0.162
Neg logit contributions
ho-0.150
 crab-0.112
 yacht-0.109
olph-0.104
 inspection-0.103
Token filled
Token ID5901
Feature activation-5.02e+00

Pos logit contributions
ho+0.053
 inspection+0.042
olph+0.041
keeping+0.039
 throats+0.039
Neg logit contributions
 best-0.043
 two-0.041
 exact-0.041
 biggest-0.040
amed-0.039
Token up
Token ID510
Feature activation+5.44e-01

Pos logit contributions
 best+0.093
 two+0.088
 biggest+0.085
 exact+0.084
amed+0.081
Neg logit contributions
ho-0.130
olph-0.103
keeping-0.102
 inspection-0.101
 throats-0.095
Token at
Token ID379
Feature activation+4.08e+00

Pos logit contributions
ho+0.016
olph+0.013
keeping+0.012
 inspection+0.012
round+0.012
Neg logit contributions
 best-0.012
 two-0.012
 exact-0.011
 biggest-0.011
amed-0.010
Token the
Token ID262
Feature activation-6.69e+00

Pos logit contributions
ho+0.116
olph+0.092
 inspection+0.091
round+0.089
keeping+0.087
Neg logit contributions
 best-0.091
 two-0.090
 biggest-0.083
 exact-0.082
 most-0.078
Token stars
Token ID5788
Feature activation+3.09e+00

Pos logit contributions
 best+0.178
amed+0.177
ceed+0.175
 exact+0.172
 teaches+0.171
Neg logit contributions
ho-0.147
 inspection-0.106
olph-0.101
 throats-0.098
keeping-0.092
Token,
Token ID11
Feature activation+1.17e+01

Pos logit contributions
ho+0.094
olph+0.080
 inspection+0.079
keeping+0.078
round+0.073
Neg logit contributions
 best-0.061
 two-0.059
 exact-0.054
 biggest-0.053
 certainly-0.050
Token aw
Token ID3253
Feature activation-1.63e+01

Pos logit contributions
ho+0.334
olph+0.281
 inspection+0.269
keeping+0.269
ipop+0.265
Neg logit contributions
 two-0.262
 best-0.260
 exact-0.241
 most-0.240
 biggest-0.238
Tokenest
Token ID395
Feature activation-4.56e+00

Pos logit contributions
 best+0.317
 exact+0.311
 teaches+0.294
 two+0.293
ceed+0.292
Neg logit contributions
ho-0.473
 inspection-0.368
olph-0.356
keeping-0.355
 throats-0.353
Tokenruck
Token ID30915
Feature activation+5.68e+00

Pos logit contributions
 best+0.098
 two+0.095
 biggest+0.090
 exact+0.087
amed+0.085
Neg logit contributions
ho-0.135
 inspection-0.106
olph-0.106
keeping-0.103
ipop-0.098
Token that
Token ID326
Feature activation+3.20e+00

Pos logit contributions
ho+0.163
olph+0.147
 inspection+0.141
keeping+0.134
 Gig+0.130
Neg logit contributions
 best-0.127
 two-0.119
 biggest-0.111
 exact-0.107
 most-0.104
Token the
Token ID262
Feature activation-6.59e+00

Pos logit contributions
ho+0.092
olph+0.074
 inspection+0.074
keeping+0.072
round+0.071
Neg logit contributions
 best-0.073
 two-0.069
 biggest-0.064
 exact-0.064
 certainly-0.061
Token night
Token ID1755
Feature activation+7.11e+00

Pos logit contributions
 best+0.164
 two+0.158
 exact+0.156
amed+0.155
 biggest+0.154
Neg logit contributions
ho-0.163
 inspection-0.126
olph-0.122
keeping-0.116
 throats-0.114
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
 best+0.110
 two+0.105
 exact+0.098
 biggest+0.096
 most+0.093
Neg logit contributions
ho-0.154
olph-0.126
 inspection-0.123
ipop-0.118
round-0.118
Token Everything
Token ID11391
Feature activation-4.50e+00

Pos logit contributions
 best+0.032
 two+0.030
 exact+0.029
 biggest+0.029
amed+0.029
Neg logit contributions
ho-0.043
 inspection-0.035
olph-0.034
keeping-0.032
 throats-0.032
Token was
Token ID373
Feature activation-9.62e+00

Pos logit contributions
 best+0.081
 two+0.076
 exact+0.071
 biggest+0.070
amed+0.069
Neg logit contributions
ho-0.148
 inspection-0.122
olph-0.121
keeping-0.119
 throats-0.115
Token wet
Token ID9583
Feature activation-1.46e+00

Pos logit contributions
 best+0.181
 exact+0.178
 biggest+0.178
ceed+0.177
 two+0.171
Neg logit contributions
ho-0.271
 inspection-0.221
olph-0.212
keeping-0.204
 throats-0.202
Token and
Token ID290
Feature activation+1.44e+01

Pos logit contributions
 best+0.037
 two+0.036
 exact+0.033
 biggest+0.033
amed+0.032
Neg logit contributions
ho-0.039
 inspection-0.030
olph-0.030
ipop-0.028
 throats-0.028
Token gle
Token ID26852
Feature activation-1.63e+01

Pos logit contributions
ho+0.428
 inspection+0.351
olph+0.350
keeping+0.336
round+0.332
Neg logit contributions
 best-0.307
 two-0.297
 biggest-0.275
 exact-0.274
 most-0.266
Tokenaming
Token ID3723
Feature activation-1.63e+00

Pos logit contributions
 best+0.212
 teaches+0.204
 exact+0.202
 biggest+0.198
ceed+0.188
Neg logit contributions
ho-0.568
 throats-0.478
olph-0.478
anged-0.472
keeping-0.471
Token,
Token ID11
Feature activation+1.15e+01

Pos logit contributions
 best+0.039
 two+0.037
 exact+0.035
 biggest+0.034
amed+0.033
Neg logit contributions
ho-0.045
 inspection-0.036
olph-0.036
keeping-0.035
 throats-0.033
Token with
Token ID351
Feature activation-2.69e+00

Pos logit contributions
ho+0.325
olph+0.255
 inspection+0.252
keeping+0.241
ipop+0.236
Neg logit contributions
 best-0.270
 two-0.265
 exact-0.249
 biggest-0.243
 certainly-0.242
Token rain
Token ID6290
Feature activation+2.52e+00

Pos logit contributions
 best+0.049
 two+0.046
 exact+0.045
 biggest+0.045
amed+0.044
Neg logit contributions
ho-0.086
 inspection-0.071
olph-0.069
keeping-0.066
 throats-0.066
Token still
Token ID991
Feature activation+1.72e+00

Pos logit contributions
ho+0.063
 inspection+0.050
olph+0.049
keeping+0.045
 throats+0.045
Neg logit contributions
 best-0.063
 two-0.061
amed-0.058
 exact-0.057
 biggest-0.057
Token was
Token ID373
Feature activation-9.11e+00

Pos logit contributions
ho+0.075
 inspection+0.060
olph+0.058
keeping+0.056
 throats+0.055
Neg logit contributions
 best-0.055
 two-0.053
 exact-0.052
 biggest-0.051
amed-0.050
Token finished
Token ID5201
Feature activation+6.48e-02

Pos logit contributions
 best+0.205
 biggest+0.204
 exact+0.200
 teaches+0.194
ceed+0.194
Neg logit contributions
ho-0.210
 inspection-0.179
olph-0.173
ipop-0.163
 throats-0.153
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
ho+0.002
round+0.002
keeping+0.001
 throats+0.001
 inspection+0.001
Neg logit contributions
 two-0.002
 best-0.002
 exact-0.001
 most-0.001
 biggest-0.001
Token the
Token ID262
Feature activation-6.74e+00

Pos logit contributions
ho+0.299
 inspection+0.223
olph+0.221
round+0.216
 throats+0.214
Neg logit contributions
 two-0.338
 best-0.328
 most-0.297
 biggest-0.290
 exact-0.281
Token trophy
Token ID16383
Feature activation+2.71e+00

Pos logit contributions
 best+0.186
 two+0.179
 exact+0.175
 biggest+0.174
amed+0.171
Neg logit contributions
ho-0.154
 inspection-0.116
olph-0.111
keeping-0.106
 throats-0.103
Token gle
Token ID26852
Feature activation-1.63e+01

Pos logit contributions
ho+0.081
 inspection+0.067
olph+0.064
keeping+0.063
 throats+0.062
Neg logit contributions
 best-0.054
 two-0.052
 exact-0.050
 biggest-0.049
amed-0.049
Tokenamed
Token ID2434
Feature activation-6.31e+00

Pos logit contributions
 best+0.188
 biggest+0.172
 teaches+0.172
 exact+0.167
 two+0.157
Neg logit contributions
ho-0.589
anged-0.505
olph-0.500
keeping-0.499
 throats-0.492
Token and
Token ID290
Feature activation+1.45e+01

Pos logit contributions
 two+0.141
 best+0.136
amed+0.131
 exact+0.126
 biggest+0.123
Neg logit contributions
ho-0.175
 inspection-0.146
olph-0.141
 throats-0.139
keeping-0.137
Token spark
Token ID9009
Feature activation-4.06e+00

Pos logit contributions
ho+0.404
 inspection+0.325
olph+0.312
keeping+0.305
 throats+0.299
Neg logit contributions
 best-0.326
 two-0.318
 exact-0.299
 biggest-0.299
amed-0.297
Tokenled
Token ID992
Feature activation-5.65e+00

Pos logit contributions
 best+0.121
 two+0.117
 exact+0.116
 biggest+0.115
amed+0.113
Neg logit contributions
ho-0.082
 inspection-0.059
olph-0.056
keeping-0.052
 throats-0.051
Token in
Token ID287
Feature activation+3.83e+00

Pos logit contributions
 best+0.134
 two+0.129
 exact+0.123
 biggest+0.122
amed+0.119
Neg logit contributions
ho-0.153
 inspection-0.121
olph-0.121
keeping-0.115
 throats-0.112
Token was
Token ID373
Feature activation-8.49e+00

Pos logit contributions
ho+0.108
 inspection+0.090
olph+0.088
keeping+0.085
 throats+0.084
Neg logit contributions
 best-0.071
 two-0.068
 biggest-0.063
 exact-0.063
amed-0.062
Token a
Token ID257
Feature activation-5.89e+00

Pos logit contributions
 best+0.110
 exact+0.101
 two+0.099
amed+0.097
 biggest+0.097
Neg logit contributions
ho-0.306
 inspection-0.256
olph-0.250
 throats-0.243
keeping-0.242
Token toy
Token ID13373
Feature activation+4.38e+00

Pos logit contributions
 best+0.114
 two+0.104
 biggest+0.102
 exact+0.098
 most+0.092
Neg logit contributions
ho-0.188
 inspection-0.166
keeping-0.162
olph-0.158
anged-0.155
Token car
Token ID1097
Feature activation+3.67e+00

Pos logit contributions
ho+0.113
 inspection+0.098
keeping+0.089
olph+0.088
 throats+0.083
Neg logit contributions
 best-0.107
 two-0.103
 biggest-0.097
 exact-0.096
 most-0.091
Token that
Token ID326
Feature activation+3.61e+00

Pos logit contributions
ho+0.100
 inspection+0.080
olph+0.077
keeping+0.074
 throats+0.074
Neg logit contributions
 best-0.084
 two-0.081
 exact-0.078
 biggest-0.077
amed-0.076
Token gle
Token ID26852
Feature activation-1.62e+01

Pos logit contributions
ho+0.095
 inspection+0.077
olph+0.073
keeping+0.070
 throats+0.070
Neg logit contributions
 best-0.087
 two-0.085
 biggest-0.080
 exact-0.079
 nic-0.077
Tokenamed
Token ID2434
Feature activation-6.10e+00

Pos logit contributions
 teaches+0.237
 best+0.229
 biggest+0.212
 certainly+0.209
 exact+0.206
Neg logit contributions
ho-0.549
olph-0.466
keeping-0.450
anged-0.448
round-0.432
Token in
Token ID287
Feature activation+4.09e+00

Pos logit contributions
 two+0.138
 best+0.137
 exact+0.127
amed+0.127
 biggest+0.125
Neg logit contributions
ho-0.168
 inspection-0.139
olph-0.132
 throats-0.129
ipop-0.126
Token the
Token ID262
Feature activation-6.60e+00

Pos logit contributions
ho+0.106
 inspection+0.083
olph+0.080
keeping+0.077
 throats+0.075
Neg logit contributions
 best-0.100
 two-0.094
 exact-0.093
 biggest-0.093
amed-0.092
Token morning
Token ID3329
Feature activation+7.77e+00

Pos logit contributions
 best+0.138
 two+0.132
 exact+0.128
 biggest+0.127
amed+0.124
Neg logit contributions
ho-0.193
 inspection-0.156
olph-0.151
keeping-0.146
 throats-0.144
Token sun
Token ID4252
Feature activation+9.00e+00

Pos logit contributions
ho+0.240
 inspection+0.203
olph+0.192
aro+0.181
ipop+0.180
Neg logit contributions
 two-0.154
 best-0.153
 exact-0.141
 biggest-0.139
amed-0.132
Token he
Token ID339
Feature activation+2.36e+00

Pos logit contributions
 two+0.058
 best+0.057
 exact+0.048
 certainly+0.048
 biggest+0.047
Neg logit contributions
ho-0.088
 throats-0.070
keeping-0.068
olph-0.067
aro-0.066
Token didn
Token ID1422
Feature activation-8.18e+00

Pos logit contributions
ho+0.062
 inspection+0.050
olph+0.047
keeping+0.047
 throats+0.046
Neg logit contributions
 best-0.055
 two-0.054
 exact-0.052
 certainly-0.051
amed-0.051
Token't
Token ID470
Feature activation-9.84e-01

Pos logit contributions
 best+0.204
 exact+0.194
 biggest+0.194
amed+0.192
 nic+0.191
Neg logit contributions
ho-0.203
 inspection-0.154
olph-0.146
keeping-0.139
othe-0.135
Token know
Token ID760
Feature activation-4.60e+00

Pos logit contributions
 best+0.016
 two+0.016
 exact+0.015
 biggest+0.015
 certainly+0.014
Neg logit contributions
ho-0.034
olph-0.027
 throats-0.027
 inspection-0.026
keeping-0.026
Token what
Token ID644
Feature activation-1.51e+00

Pos logit contributions
 best+0.096
 two+0.093
 biggest+0.086
 exact+0.086
 certainly+0.080
Neg logit contributions
ho-0.140
olph-0.111
 inspection-0.109
keeping-0.107
 throats-0.106
Token to
Token ID284
Feature activation+1.18e+01

Pos logit contributions
 best+0.046
 two+0.044
 biggest+0.042
 exact+0.041
 certainly+0.041
Neg logit contributions
ho-0.033
 throats-0.023
keeping-0.023
olph-0.022
round-0.022
Token do
Token ID466
Feature activation-2.98e+00

Pos logit contributions
ho+0.372
 inspection+0.309
olph+0.294
keeping+0.285
 throats+0.276
Neg logit contributions
 best-0.218
amed-0.205
 two-0.204
 exact-0.201
 biggest-0.200
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 best+0.062
 two+0.060
 exact+0.057
 biggest+0.056
amed+0.055
Neg logit contributions
ho-0.088
 inspection-0.071
olph-0.069
 throats-0.066
keeping-0.066
Token He
Token ID679
Feature activation-9.86e-02

Pos logit contributions
 best+0.024
 two+0.023
 exact+0.022
 biggest+0.022
amed+0.022
Neg logit contributions
ho-0.038
 inspection-0.031
olph-0.030
keeping-0.029
 throats-0.028
Token just
Token ID655
Feature activation+2.39e+00

Pos logit contributions
 best+0.002
 two+0.002
 exact+0.002
 biggest+0.002
amed+0.002
Neg logit contributions
ho-0.003
 inspection-0.002
olph-0.002
keeping-0.002
 throats-0.002
Token stayed
Token ID9658
Feature activation-5.97e+00

Pos logit contributions
ho+0.066
keeping+0.054
olph+0.054
 inspection+0.053
round+0.051
Neg logit contributions
 two-0.052
 best-0.050
 exact-0.047
amed-0.046
 certainly-0.046
Token tried
Token ID3088
Feature activation-3.43e+00

Pos logit contributions
ho+0.033
 inspection+0.027
olph+0.025
keeping+0.024
 throats+0.024
Neg logit contributions
 best-0.025
 two-0.024
 exact-0.024
 biggest-0.024
amed-0.023
Token to
Token ID284
Feature activation+1.29e+01

Pos logit contributions
 two+0.101
 best+0.097
 right+0.085
 biggest+0.083
 most+0.082
Neg logit contributions
ho-0.085
olph-0.064
keeping-0.063
 throats-0.060
round-0.060
Token stand
Token ID1302
Feature activation+5.16e-01

Pos logit contributions
ho+0.349
 inspection+0.274
olph+0.268
keeping+0.254
 throats+0.252
Neg logit contributions
 best-0.300
 two-0.296
 exact-0.284
 biggest-0.278
amed-0.273
Token up
Token ID510
Feature activation+1.51e+00

Pos logit contributions
ho+0.015
olph+0.013
keeping+0.013
 inspection+0.013
 throats+0.012
Neg logit contributions
 best-0.011
 two-0.010
 biggest-0.010
 exact-0.010
 most-0.009
Token,
Token ID11
Feature activation+1.30e+01

Pos logit contributions
ho+0.044
olph+0.038
keeping+0.037
 throats+0.037
anged+0.035
Neg logit contributions
 two-0.034
 best-0.033
 right-0.029
 most-0.029
 exact-0.028
Token but
Token ID475
Feature activation+8.04e+00

Pos logit contributions
ho+0.348
keeping+0.267
olph+0.265
 inspection+0.249
 throats+0.245
Neg logit contributions
 two-0.322
 best-0.316
 exact-0.308
 biggest-0.291
 most-0.286
Token he
Token ID339
Feature activation+3.48e+00

Pos logit contributions
ho+0.185
keeping+0.145
 inspection+0.141
olph+0.140
round+0.136
Neg logit contributions
 two-0.228
 best-0.222
 most-0.199
 biggest-0.194
 exact-0.194
Token was
Token ID373
Feature activation-8.55e+00

Pos logit contributions
ho+0.099
 inspection+0.083
keeping+0.081
olph+0.078
 throats+0.076
Neg logit contributions
 best-0.074
 two-0.072
 certainly-0.066
amed-0.065
 biggest-0.064
Token too
Token ID1165
Feature activation+5.37e+00

Pos logit contributions
 best+0.173
 exact+0.167
 biggest+0.166
amed+0.163
ceed+0.160
Neg logit contributions
ho-0.240
 inspection-0.194
olph-0.190
 throats-0.178
ipop-0.176
Token scared
Token ID12008
Feature activation-2.06e+00

Pos logit contributions
ho+0.154
 inspection+0.123
olph+0.119
keeping+0.113
 throats+0.112
Neg logit contributions
 best-0.114
 exact-0.107
 two-0.107
 biggest-0.106
amed-0.105
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
 best+0.037
 two+0.036
 exact+0.031
 biggest+0.030
 right+0.029
Neg logit contributions
ho-0.069
olph-0.057
 throats-0.056
keeping-0.055
 inspection-0.055
Token and
Token ID290
Feature activation+1.47e+01

Pos logit contributions
 best+0.129
 two+0.123
 exact+0.115
 biggest+0.115
 most+0.111
Neg logit contributions
ho-0.122
 inspection-0.091
keeping-0.089
olph-0.088
 throats-0.084
Token didn
Token ID1422
Feature activation-8.12e+00

Pos logit contributions
ho+0.417
 inspection+0.339
keeping+0.324
olph+0.323
round+0.314
Neg logit contributions
 best-0.329
 two-0.319
 exact-0.298
 biggest-0.293
amed-0.286
Token't
Token ID470
Feature activation-7.88e-01

Pos logit contributions
 best+0.212
amed+0.208
 exact+0.208
 biggest+0.204
 nic+0.203
Neg logit contributions
ho-0.179
 inspection-0.130
olph-0.130
othe-0.122
keeping-0.117
Token know
Token ID760
Feature activation-4.35e+00

Pos logit contributions
 best+0.016
 two+0.015
 exact+0.014
 biggest+0.014
amed+0.014
Neg logit contributions
ho-0.024
 inspection-0.020
olph-0.019
keeping-0.018
 throats-0.018
Token what
Token ID644
Feature activation-1.42e+00

Pos logit contributions
 best+0.081
 two+0.079
 biggest+0.068
 exact+0.066
 most+0.063
Neg logit contributions
ho-0.148
keeping-0.121
olph-0.117
 throats-0.116
round-0.114
Token to
Token ID284
Feature activation+1.20e+01

Pos logit contributions
 best+0.045
 two+0.043
 biggest+0.041
 exact+0.040
 smallest+0.039
Neg logit contributions
ho-0.030
keeping-0.021
round-0.020
 throats-0.020
 inspection-0.019
Token do
Token ID466
Feature activation-2.66e+00

Pos logit contributions
ho+0.371
 inspection+0.308
olph+0.299
keeping+0.286
 throats+0.282
Neg logit contributions
 best-0.222
amed-0.210
 two-0.209
 biggest-0.207
 exact-0.206
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 best+0.059
 two+0.056
 exact+0.056
 biggest+0.056
amed+0.055
Neg logit contributions
ho-0.072
 inspection-0.058
olph-0.056
keeping-0.054
 throats-0.052
Token
Token ID198
Feature activation-6.67e+00

Pos logit contributions
 best+0.021
 two+0.021
 biggest+0.020
 exact+0.020
amed+0.019
Neg logit contributions
ho-0.034
 inspection-0.027
olph-0.027
keeping-0.026
 throats-0.026
Token
Token ID198
Feature activation-6.70e+00

Pos logit contributions
 best+0.129
 exact+0.127
 biggest+0.122
 nic+0.119
amed+0.119
Neg logit contributions
ho-0.199
 inspection-0.171
olph-0.163
keeping-0.151
othe-0.148
TokenThen
Token ID6423
Feature activation+2.86e+00

Pos logit contributions
 best+0.131
 two+0.123
 exact+0.121
 biggest+0.120
amed+0.117
Neg logit contributions
ho-0.206
 inspection-0.168
olph-0.164
keeping-0.157
 throats-0.154
Token dog
Token ID3290
Feature activation+7.67e+00

Pos logit contributions
 best+0.210
 two+0.203
 exact+0.196
 biggest+0.195
amed+0.189
Neg logit contributions
ho-0.224
 inspection-0.177
olph-0.170
keeping-0.164
 throats-0.160
Token runs
Token ID4539
Feature activation-1.43e+00

Pos logit contributions
ho+0.250
keeping+0.221
 inspection+0.215
olph+0.213
round+0.200
Neg logit contributions
 best-0.126
 two-0.125
 exact-0.110
 teaches-0.109
 biggest-0.108
Token to
Token ID284
Feature activation+1.29e+01

Pos logit contributions
 two+0.034
 best+0.033
 right+0.030
amed+0.030
 same+0.029
Neg logit contributions
ho-0.035
olph-0.030
ipop-0.030
 throats-0.029
keeping-0.029
Token the
Token ID262
Feature activation-5.78e+00

Pos logit contributions
ho+0.348
 inspection+0.268
olph+0.266
 throats+0.254
keeping+0.252
Neg logit contributions
 two-0.297
 best-0.295
 exact-0.275
 biggest-0.273
 certainly-0.263
Token little
Token ID1310
Feature activation+2.61e+00

Pos logit contributions
 best+0.142
 two+0.141
 biggest+0.135
 exact+0.133
 most+0.125
Neg logit contributions
ho-0.146
 inspection-0.121
olph-0.114
 throats-0.105
ipop-0.105
Token house
Token ID2156
Feature activation+1.08e+01

Pos logit contributions
ho+0.080
 inspection+0.067
olph+0.063
 throats+0.063
keeping+0.061
Neg logit contributions
 two-0.050
 best-0.049
 exact-0.046
 biggest-0.046
 most-0.042
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
ho+0.300
olph+0.240
 inspection+0.240
keeping+0.224
 throats+0.224
Neg logit contributions
 best-0.237
 two-0.233
 exact-0.221
 biggest-0.213
amed-0.210
Token Lily
Token ID20037
Feature activation-2.53e+00

Pos logit contributions
 best+0.017
 two+0.016
 exact+0.016
 biggest+0.015
amed+0.015
Neg logit contributions
ho-0.022
 inspection-0.018
olph-0.017
keeping-0.016
 throats-0.016
Token and
Token ID290
Feature activation+1.46e+01

Pos logit contributions
 best+0.046
 two+0.045
 exact+0.042
 biggest+0.041
amed+0.040
Neg logit contributions
ho-0.080
 inspection-0.067
olph-0.065
keeping-0.064
 throats-0.062
Token Tom
Token ID4186
Feature activation+8.25e-01

Pos logit contributions
ho+0.454
 inspection+0.376
keeping+0.363
olph+0.362
round+0.361
Neg logit contributions
 two-0.302
 best-0.292
 biggest-0.254
 exact-0.238
 teaches-0.234
Token are
Token ID389
Feature activation-8.73e+00

Pos logit contributions
ho+0.026
 inspection+0.021
olph+0.021
keeping+0.020
 throats+0.020
Neg logit contributions
 best-0.016
 two-0.015
 exact-0.014
 biggest-0.014
amed-0.014
TokenShe
Token ID3347
Feature activation+4.35e-01

Pos logit contributions
 best+0.111
 biggest+0.102
 exact+0.101
 two+0.098
 nic+0.096
Neg logit contributions
ho-0.202
 inspection-0.166
olph-0.163
keeping-0.157
 throats-0.152
Token took
Token ID1718
Feature activation-9.60e+00

Pos logit contributions
ho+0.011
olph+0.008
 inspection+0.008
ipop+0.007
 throats+0.007
Neg logit contributions
 best-0.011
 biggest-0.011
 exact-0.011
 two-0.010
ceed-0.010
Token their
Token ID511
Feature activation-3.59e+00

Pos logit contributions
 best+0.208
 two+0.204
 biggest+0.191
 exact+0.188
amed+0.187
Neg logit contributions
ho-0.278
 inspection-0.224
olph-0.217
keeping-0.208
 throats-0.205
Token paper
Token ID3348
Feature activation+3.60e+00

Pos logit contributions
 two+0.082
 best+0.082
 biggest+0.074
 exact+0.069
 most+0.069
Neg logit contributions
ho-0.106
 inspection-0.088
round-0.084
 Gig-0.081
olph-0.081
Token things
Token ID1243
Feature activation+4.88e+00

Pos logit contributions
ho+0.101
 inspection+0.092
olph+0.086
round+0.082
 Gig+0.082
Neg logit contributions
 two-0.082
 best-0.077
amed-0.069
 biggest-0.067
 nic-0.064
Token and
Token ID290
Feature activation+1.45e+01

Pos logit contributions
ho+0.140
 inspection+0.128
olph+0.126
keeping+0.120
 throats+0.118
Neg logit contributions
 two-0.102
 best-0.102
 right-0.092
amed-0.082
 most-0.081
Token threw
Token ID9617
Feature activation-6.03e+00

Pos logit contributions
ho+0.483
 inspection+0.405
round+0.387
olph+0.382
ule+0.366
Neg logit contributions
 two-0.285
 best-0.260
amed-0.229
 teaches-0.228
 nic-0.222
Token them
Token ID606
Feature activation+1.25e-01

Pos logit contributions
 two+0.131
 best+0.126
 biggest+0.114
 exact+0.108
amed+0.107
Neg logit contributions
ho-0.179
olph-0.146
 inspection-0.143
keeping-0.138
round-0.137
Token in
Token ID287
Feature activation+3.83e+00

Pos logit contributions
ho+0.004
keeping+0.003
olph+0.003
 inspection+0.003
ule+0.003
Neg logit contributions
 best-0.003
 two-0.003
 right-0.002
amed-0.002
 biggest-0.002
Token the
Token ID262
Feature activation-6.43e+00

Pos logit contributions
ho+0.107
 inspection+0.085
olph+0.082
keeping+0.079
 throats+0.077
Neg logit contributions
 best-0.086
 two-0.084
 exact-0.080
 biggest-0.079
amed-0.078
Token trash
Token ID13913
Feature activation-7.05e-01

Pos logit contributions
 best+0.128
 two+0.123
 exact+0.119
 biggest+0.118
amed+0.112
Neg logit contributions
ho-0.199
 inspection-0.164
olph-0.159
keeping-0.152
 throats-0.146
TokenL
Token ID43
Feature activation-9.22e+00

Pos logit contributions
 exact+0.089
 best+0.087
 biggest+0.085
amed+0.083
 nic+0.083
Neg logit contributions
ho-0.131
olph-0.112
keeping-0.106
 inspection-0.105
 crab-0.095
Tokenily
Token ID813
Feature activation-8.76e-01

Pos logit contributions
 best+0.198
 exact+0.194
 biggest+0.191
 nic+0.186
ceed+0.186
Neg logit contributions
ho-0.259
 inspection-0.214
olph-0.205
keeping-0.195
ipop-0.189
Token liked
Token ID8288
Feature activation-8.20e+00

Pos logit contributions
 best+0.017
 two+0.017
 exact+0.015
amed+0.015
 teaches+0.015
Neg logit contributions
ho-0.027
 inspection-0.023
keeping-0.021
 throats-0.021
olph-0.021
Token to
Token ID284
Feature activation+1.17e+01

Pos logit contributions
 two+0.209
 best+0.191
 biggest+0.174
 four+0.169
 a+0.167
Neg logit contributions
ho-0.214
 inspection-0.190
 spine-0.179
anged-0.173
keeping-0.170
Token design
Token ID1486
Feature activation-1.93e+00

Pos logit contributions
ho+0.344
 inspection+0.266
olph+0.258
keeping+0.254
 throats+0.241
Neg logit contributions
 two-0.262
 best-0.249
 exact-0.238
 biggest-0.235
 teaches-0.227
Token things
Token ID1243
Feature activation+4.64e+00

Pos logit contributions
amed+0.054
 exact+0.052
 certainly+0.052
ceed+0.051
 smallest+0.050
Neg logit contributions
ho-0.032
 inspection-0.027
 he-0.024
 throats-0.024
 they-0.023
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
ho+0.120
 inspection+0.093
olph+0.090
keeping+0.086
 throats+0.084
Neg logit contributions
 best-0.108
 two-0.104
 exact-0.104
amed-0.104
 biggest-0.103
Token She
Token ID1375
Feature activation-3.81e-01

Pos logit contributions
 best+0.030
amed+0.029
 two+0.028
 biggest+0.028
 exact+0.028
Neg logit contributions
ho-0.035
 inspection-0.028
olph-0.026
 throats-0.026
keeping-0.025
Token used
Token ID973
Feature activation-8.00e+00

Pos logit contributions
 best+0.009
 exact+0.009
 biggest+0.009
 Deal+0.009
ceed+0.009
Neg logit contributions
ho-0.010
olph-0.008
 inspection-0.007
 throats-0.006
anged-0.006
Token paper
Token ID3348
Feature activation+3.45e+00

Pos logit contributions
 best+0.188
 two+0.183
 biggest+0.173
 exact+0.173
amed+0.169
Neg logit contributions
ho-0.217
 inspection-0.169
olph-0.166
keeping-0.160
round-0.156
Token,
Token ID11
Feature activation+1.20e+01

Pos logit contributions
ho+0.078
olph+0.062
 inspection+0.060
keeping+0.060
round+0.059
Neg logit contributions
 best-0.098
 two-0.096
 biggest-0.089
 most-0.085
amed-0.084
Token"
Token ID1
Feature activation-1.67e+00

Pos logit contributions
 best+0.119
 biggest+0.111
 exact+0.110
 two+0.105
 nic+0.103
Neg logit contributions
ho-0.209
olph-0.169
 inspection-0.168
keeping-0.159
 throats-0.158
TokenI
Token ID40
Feature activation-4.43e+00

Pos logit contributions
 best+0.029
 exact+0.027
 biggest+0.027
 two+0.026
amed+0.026
Neg logit contributions
ho-0.055
olph-0.045
 inspection-0.044
 throats-0.042
keeping-0.042
Token am
Token ID716
Feature activation-7.47e+00

Pos logit contributions
 best+0.093
 exact+0.090
 biggest+0.089
 two+0.088
amed+0.088
Neg logit contributions
ho-0.129
 inspection-0.101
olph-0.100
 throats-0.094
keeping-0.094
Token sorry
Token ID7926
Feature activation-1.77e+00

Pos logit contributions
 exact+0.167
amed+0.159
 biggest+0.158
 best+0.157
ceed+0.153
Neg logit contributions
ho-0.183
olph-0.151
 Gig-0.141
 throats-0.139
 inspection-0.138
Token,
Token ID11
Feature activation+1.19e+01

Pos logit contributions
 best+0.049
 two+0.047
 biggest+0.045
 exact+0.045
 certainly+0.043
Neg logit contributions
ho-0.041
 inspection-0.031
olph-0.030
keeping-0.030
 throats-0.028
Token Ben
Token ID3932
Feature activation+2.58e-01

Pos logit contributions
ho+0.284
 inspection+0.203
 throats+0.192
olph+0.191
ipop+0.183
Neg logit contributions
amed-0.317
 exact-0.302
ceed-0.299
 best-0.298
 biggest-0.288
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
ho+0.006
olph+0.005
 inspection+0.005
keeping+0.005
 throats+0.004
Neg logit contributions
 best-0.006
amed-0.006
 exact-0.006
 biggest-0.006
ceed-0.006
Token I
Token ID314
Feature activation-2.26e+00

Pos logit contributions
 best+0.025
 two+0.024
 biggest+0.023
 exact+0.022
 teaches+0.021
Neg logit contributions
ho-0.046
 inspection-0.040
olph-0.037
keeping-0.036
round-0.035
Token should
Token ID815
Feature activation+2.43e+00

Pos logit contributions
 best+0.046
 two+0.044
 exact+0.044
 biggest+0.044
amed+0.042
Neg logit contributions
ho-0.067
 inspection-0.053
olph-0.052
keeping-0.049
 throats-0.049
Token not
Token ID407
Feature activation-1.56e+00

Pos logit contributions
ho+0.074
 inspection+0.059
olph+0.059
keeping+0.056
 throats+0.056
Neg logit contributions
 best-0.047
amed-0.044
 exact-0.044
 biggest-0.044
 two-0.044
Token have
Token ID423
Feature activation-6.79e+00

Pos logit contributions
 best+0.031
 two+0.030
 exact+0.029
 biggest+0.028
amed+0.028
Neg logit contributions
ho-0.047
 inspection-0.039
olph-0.037
keeping-0.036
 throats-0.035
Token little
Token ID1310
Feature activation+1.70e+00

Pos logit contributions
 best+0.154
 two+0.149
 biggest+0.139
 exact+0.131
amed+0.123
Neg logit contributions
ho-0.324
 inspection-0.277
olph-0.263
keeping-0.261
 throats-0.256
Token boy
Token ID2933
Feature activation+3.46e+00

Pos logit contributions
ho+0.055
 inspection+0.046
keeping+0.044
 throats+0.043
olph+0.043
Neg logit contributions
 two-0.033
 best-0.033
 biggest-0.030
 exact-0.028
 most-0.027
Token said
Token ID531
Feature activation-7.46e+00

Pos logit contributions
ho+0.117
 inspection+0.100
keeping+0.099
 throats+0.094
olph+0.093
Neg logit contributions
 best-0.059
 two-0.058
 biggest-0.049
amed-0.048
 exact-0.048
Token,
Token ID11
Feature activation+1.16e+01

Pos logit contributions
 best+0.182
 two+0.173
 biggest+0.157
 most+0.144
 exact+0.137
Neg logit contributions
ho-0.196
keeping-0.186
olph-0.164
round-0.163
 throats-0.160
Token "
Token ID366
Feature activation-3.74e+00

Pos logit contributions
ho+0.430
keeping+0.384
 inspection+0.367
 throats+0.366
away+0.362
Neg logit contributions
 two-0.152
 best-0.143
 biggest-0.117
 exact-0.111
 certainly-0.109
TokenMom
Token ID29252
Feature activation+5.86e+00

Pos logit contributions
 best+0.069
 two+0.063
 exact+0.062
 biggest+0.062
amed+0.059
Neg logit contributions
ho-0.121
 inspection-0.098
olph-0.097
keeping-0.093
round-0.091
Token,
Token ID11
Feature activation+1.16e+01

Pos logit contributions
ho+0.162
 inspection+0.125
olph+0.124
keeping+0.113
ipop+0.113
Neg logit contributions
 best-0.126
 exact-0.125
amed-0.122
 biggest-0.121
ceed-0.119
Token can
Token ID460
Feature activation+2.95e+00

Pos logit contributions
ho+0.374
keeping+0.308
 inspection+0.306
olph+0.306
 throats+0.297
Neg logit contributions
 best-0.222
 two-0.215
 biggest-0.201
 exact-0.193
 certainly-0.182
Token I
Token ID314
Feature activation-2.72e+00

Pos logit contributions
ho+0.081
 inspection+0.065
keeping+0.062
olph+0.062
 throats+0.059
Neg logit contributions
 best-0.068
 two-0.067
 biggest-0.063
 exact-0.063
 certainly-0.060
Token rub
Token ID6437
Feature activation-5.84e-01

Pos logit contributions
 best+0.056
 two+0.054
 exact+0.051
 biggest+0.051
 certainly+0.049
Neg logit contributions
ho-0.081
 inspection-0.068
olph-0.065
keeping-0.063
 throats-0.063
Token the
Token ID262
Feature activation-6.94e+00

Pos logit contributions
 best+0.013
 two+0.012
 biggest+0.011
 exact+0.011
 most+0.011
Neg logit contributions
ho-0.017
 inspection-0.014
keeping-0.014
olph-0.014
anged-0.013
Token walking
Token ID6155
Feature activation-1.32e+00

Pos logit contributions
 best+0.160
 exact+0.159
amed+0.158
ceed+0.157
 teaches+0.156
Neg logit contributions
ho-0.288
olph-0.228
 inspection-0.223
 throats-0.215
 Gig-0.212
Token near
Token ID1474
Feature activation+1.96e+00

Pos logit contributions
 two+0.027
 best+0.026
 most+0.023
amed+0.023
 biggest+0.022
Neg logit contributions
ho-0.042
 inspection-0.035
 throats-0.035
 tongues-0.035
ipop-0.034
Token her
Token ID607
Feature activation-4.38e+00

Pos logit contributions
ho+0.056
 inspection+0.047
olph+0.045
keeping+0.044
 throats+0.042
Neg logit contributions
 best-0.040
 exact-0.037
amed-0.037
 biggest-0.037
 certainly-0.036
Token house
Token ID2156
Feature activation+9.80e+00

Pos logit contributions
 best+0.091
 two+0.085
 biggest+0.081
 exact+0.076
 most+0.074
Neg logit contributions
ho-0.133
 inspection-0.117
olph-0.111
keeping-0.106
ipop-0.104
Token when
Token ID618
Feature activation+5.32e+00

Pos logit contributions
ho+0.257
 inspection+0.202
olph+0.196
keeping+0.188
 throats+0.184
Neg logit contributions
 best-0.237
 two-0.228
 exact-0.221
 biggest-0.219
amed-0.215
Token she
Token ID673
Feature activation+1.56e+00

Pos logit contributions
ho+0.125
round+0.095
olph+0.092
ipop+0.089
keeping+0.089
Neg logit contributions
 two-0.151
 best-0.150
 biggest-0.132
 most-0.131
 exact-0.129
Token noticed
Token ID6810
Feature activation-6.89e+00

Pos logit contributions
ho+0.044
 inspection+0.036
keeping+0.036
olph+0.035
 throats+0.034
Neg logit contributions
 best-0.034
 two-0.033
 biggest-0.031
 exact-0.030
amed-0.030
Token her
Token ID607
Feature activation-4.23e+00

Pos logit contributions
 two+0.101
 best+0.099
 biggest+0.086
 exact+0.083
 most+0.078
Neg logit contributions
ho-0.254
 inspection-0.211
olph-0.210
keeping-0.205
 throats-0.202
Token elderly
Token ID13830
Feature activation-2.62e+00

Pos logit contributions
 best+0.095
 two+0.086
 biggest+0.081
 most+0.078
 nic+0.073
Neg logit contributions
ho-0.126
 inspection-0.116
olph-0.106
round-0.104
keeping-0.101
Token neighbor
Token ID4780
Feature activation+8.71e+00

Pos logit contributions
 best+0.061
 two+0.059
 biggest+0.054
 exact+0.053
 smallest+0.051
Neg logit contributions
ho-0.073
 inspection-0.061
keeping-0.054
olph-0.054
 throats-0.053
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
ho+0.256
 inspection+0.215
olph+0.203
keeping+0.202
 throats+0.197
Neg logit contributions
 best-0.188
 two-0.182
 exact-0.171
 biggest-0.163
 most-0.155
Token and
Token ID290
Feature activation+1.47e+01

Pos logit contributions
 best+0.122
 two+0.116
 exact+0.116
 biggest+0.114
amed+0.112
Neg logit contributions
ho-0.126
 inspection-0.098
olph-0.094
keeping-0.091
 throats-0.088
Token a
Token ID257
Feature activation-6.60e+00

Pos logit contributions
ho+0.388
 inspection+0.305
olph+0.291
keeping+0.276
 throats+0.272
Neg logit contributions
 best-0.349
 exact-0.329
 two-0.326
 biggest-0.326
amed-0.321
Token spark
Token ID9009
Feature activation-3.76e+00

Pos logit contributions
 best+0.133
 two+0.127
 exact+0.125
amed+0.123
 biggest+0.122
Neg logit contributions
ho-0.195
 inspection-0.157
olph-0.153
keeping-0.146
 throats-0.145
Tokenly
Token ID306
Feature activation+3.71e+00

Pos logit contributions
 best+0.096
 two+0.093
 exact+0.091
 biggest+0.091
amed+0.090
Neg logit contributions
ho-0.092
 inspection-0.070
olph-0.068
keeping-0.063
 throats-0.062
Token blue
Token ID4171
Feature activation-2.13e+00

Pos logit contributions
ho+0.112
 inspection+0.091
olph+0.088
keeping+0.085
 throats+0.083
Neg logit contributions
 best-0.073
 two-0.069
 exact-0.068
 biggest-0.067
amed-0.067
Token light
Token ID1657
Feature activation+2.85e-01

Pos logit contributions
 best+0.046
 two+0.044
 exact+0.043
amed+0.043
 biggest+0.043
Neg logit contributions
ho-0.060
 inspection-0.047
olph-0.046
keeping-0.044
 throats-0.044
Token appeared
Token ID4120
Feature activation-8.91e+00

Pos logit contributions
ho+0.008
 inspection+0.006
olph+0.006
keeping+0.006
 throats+0.006
Neg logit contributions
 best-0.006
 two-0.006
 exact-0.006
 biggest-0.006
amed-0.006
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 best+0.224
 exact+0.220
 biggest+0.216
 teaches+0.213
 nic+0.212
Neg logit contributions
ho-0.216
 inspection-0.155
keeping-0.151
olph-0.140
away-0.138
Token An
Token ID1052
Feature activation-6.66e+00

Pos logit contributions
 best+0.029
amed+0.027
 two+0.026
 biggest+0.026
 exact+0.026
Neg logit contributions
ho-0.032
 inspection-0.026
keeping-0.025
olph-0.025
 Gig-0.024
Token elf
Token ID23878
Feature activation+7.79e+00

Pos logit contributions
 best+0.135
 two+0.128
amed+0.127
 biggest+0.124
ceed+0.123
Neg logit contributions
ho-0.189
olph-0.154
 inspection-0.151
round-0.147
away-0.144
Token popped
Token ID22928
Feature activation-5.66e+00

Pos logit contributions
ho+0.215
 inspection+0.173
 throats+0.167
keeping+0.166
olph+0.166
Neg logit contributions
 best-0.170
 two-0.168
 exact-0.157
amed-0.153
 certainly-0.153
Token But
Token ID887
Feature activation+2.47e+00

Pos logit contributions
 best+0.026
 exact+0.024
 two+0.024
 biggest+0.024
amed+0.023
Neg logit contributions
ho-0.034
 inspection-0.028
olph-0.027
keeping-0.026
 throats-0.026
Token Mia
Token ID32189
Feature activation-2.25e-01

Pos logit contributions
ho+0.065
keeping+0.058
olph+0.054
round+0.053
 inspection+0.051
Neg logit contributions
 best-0.061
 two-0.059
 most-0.052
 biggest-0.051
 certainly-0.050
Token didn
Token ID1422
Feature activation-8.10e+00

Pos logit contributions
 best+0.005
 two+0.004
 exact+0.004
 biggest+0.004
amed+0.004
Neg logit contributions
ho-0.007
 inspection-0.006
olph-0.005
keeping-0.005
 throats-0.005
Token't
Token ID470
Feature activation-8.25e-01

Pos logit contributions
 best+0.210
 exact+0.205
amed+0.205
 biggest+0.202
 nic+0.199
Neg logit contributions
ho-0.184
 inspection-0.135
olph-0.132
round-0.121
othe-0.121
Token listen
Token ID6004
Feature activation-4.70e-01

Pos logit contributions
 best+0.020
 two+0.019
 exact+0.018
 biggest+0.017
 certainly+0.017
Neg logit contributions
ho-0.023
 inspection-0.018
olph-0.017
keeping-0.017
 throats-0.016
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 best+0.010
 two+0.010
 exact+0.009
 biggest+0.008
 teaches+0.008
Neg logit contributions
ho-0.014
olph-0.011
 inspection-0.011
aro-0.011
keeping-0.010
Token
Token ID198
Feature activation-6.64e+00

Pos logit contributions
 best+0.026
 exact+0.025
amed+0.024
 biggest+0.024
 two+0.024
Neg logit contributions
ho-0.033
 inspection-0.027
olph-0.025
keeping-0.024
 throats-0.024
Token
Token ID198
Feature activation-6.67e+00

Pos logit contributions
 best+0.130
 exact+0.129
 biggest+0.124
 nic+0.121
amed+0.120
Neg logit contributions
ho-0.198
 inspection-0.170
olph-0.160
keeping-0.149
othe-0.148
TokenWhen
Token ID2215
Feature activation+2.09e+00

Pos logit contributions
 best+0.128
 biggest+0.120
 exact+0.119
 two+0.112
 nic+0.112
Neg logit contributions
ho-0.209
 inspection-0.168
olph-0.160
 throats-0.153
round-0.152
Token they
Token ID484
Feature activation+2.29e+00

Pos logit contributions
ho+0.056
keeping+0.050
round+0.049
olph+0.047
gie+0.045
Neg logit contributions
 two-0.055
 best-0.055
 biggest-0.045
 most-0.044
 certainly-0.042
Token got
Token ID1392
Feature activation-8.37e+00

Pos logit contributions
ho+0.059
 inspection+0.046
olph+0.044
keeping+0.042
 throats+0.041
Neg logit contributions
 best-0.057
 two-0.054
 exact-0.053
 biggest-0.053
amed-0.052
Token for
Token ID329
Feature activation-2.98e+00

Pos logit contributions
 best+0.083
 two+0.080
 exact+0.077
 biggest+0.076
amed+0.074
Neg logit contributions
ho-0.108
 inspection-0.085
olph-0.083
keeping-0.081
 throats-0.079
Token a
Token ID257
Feature activation-6.72e+00

Pos logit contributions
 best+0.071
 two+0.070
 biggest+0.065
 exact+0.064
 certainly+0.060
Neg logit contributions
ho-0.085
 inspection-0.062
olph-0.062
round-0.059
keeping-0.059
Token tutor
Token ID48680
Feature activation+9.17e+00

Pos logit contributions
 best+0.126
 two+0.115
 biggest+0.112
 exact+0.105
 most+0.097
Neg logit contributions
ho-0.221
keeping-0.185
olph-0.183
 throats-0.183
 inspection-0.183
Token.
Token ID13
Feature activation-7.94e-01

Pos logit contributions
ho+0.280
 inspection+0.229
olph+0.221
 throats+0.218
keeping+0.207
Neg logit contributions
 best-0.183
 two-0.177
 exact-0.169
 biggest-0.160
amed-0.155
Token He
Token ID679
Feature activation+1.08e-01

Pos logit contributions
 best+0.016
 two+0.016
 exact+0.015
 biggest+0.015
amed+0.014
Neg logit contributions
ho-0.024
 inspection-0.019
olph-0.018
 throats-0.018
keeping-0.018
Token looked
Token ID3114
Feature activation-6.44e+00

Pos logit contributions
ho+0.003
 inspection+0.002
olph+0.002
keeping+0.002
 throats+0.002
Neg logit contributions
 best-0.002
 two-0.002
 exact-0.002
amed-0.002
 biggest-0.002
Token high
Token ID1029
Feature activation+1.53e-01

Pos logit contributions
 best+0.138
 two+0.130
 biggest+0.121
 exact+0.119
 most+0.118
Neg logit contributions
ho-0.202
keeping-0.174
olph-0.166
 throats-0.160
 inspection-0.157
Token and
Token ID290
Feature activation+1.49e+01

Pos logit contributions
ho+0.004
olph+0.003
 inspection+0.003
 throats+0.003
aro+0.003
Neg logit contributions
 best-0.004
 two-0.004
 exact-0.003
 biggest-0.003
 most-0.003
Token low
Token ID1877
Feature activation-2.57e+00

Pos logit contributions
ho+0.417
olph+0.332
 inspection+0.332
keeping+0.328
 throats+0.319
Neg logit contributions
 best-0.331
 two-0.321
 exact-0.307
 biggest-0.305
 nic-0.296
Token and
Token ID290
Feature activation+1.47e+01

Pos logit contributions
 best+0.066
 two+0.064
 exact+0.059
 biggest+0.058
 most+0.056
Neg logit contributions
ho-0.066
olph-0.051
 throats-0.049
keeping-0.048
round-0.048
Token eventually
Token ID4191
Feature activation+4.31e+00

Pos logit contributions
ho+0.384
olph+0.306
 inspection+0.304
keeping+0.302
 throats+0.295
Neg logit contributions
 best-0.353
 two-0.348
 exact-0.325
 biggest-0.324
 nic-0.322
Token on
Token ID319
Feature activation+5.00e+00

Pos logit contributions
 best+0.156
 two+0.154
amed+0.139
 biggest+0.139
 exact+0.136
Neg logit contributions
ho-0.190
 inspection-0.156
olph-0.152
keeping-0.152
 throats-0.145
Token the
Token ID262
Feature activation-6.70e+00

Pos logit contributions
ho+0.165
olph+0.139
keeping+0.136
 inspection+0.135
anged+0.135
Neg logit contributions
 two-0.097
 best-0.087
 biggest-0.076
 most-0.072
 exact-0.071
Token table
Token ID3084
Feature activation+1.56e+00

Pos logit contributions
 best+0.164
 exact+0.158
amed+0.156
 certainly+0.153
 two+0.150
Neg logit contributions
ho-0.161
olph-0.118
 inspection-0.118
 throats-0.114
keeping-0.111
Token and
Token ID290
Feature activation+1.41e+01

Pos logit contributions
ho+0.038
 inspection+0.030
olph+0.030
keeping+0.028
 throats+0.027
Neg logit contributions
 best-0.040
 two-0.039
 exact-0.037
 biggest-0.036
amed-0.036
Token grabbed
Token ID13176
Feature activation-8.42e+00

Pos logit contributions
ho+0.408
keeping+0.360
 inspection+0.356
away+0.346
round+0.337
Neg logit contributions
 two-0.290
 best-0.255
 nic-0.232
 most-0.224
amed-0.217
Token the
Token ID262
Feature activation-6.62e+00

Pos logit contributions
 two+0.172
 best+0.163
 biggest+0.147
 exact+0.142
 most+0.136
Neg logit contributions
ho-0.267
olph-0.212
 inspection-0.211
keeping-0.209
round-0.203
Token yarn
Token ID21181
Feature activation+7.51e+00

Pos logit contributions
amed+0.180
 exact+0.170
ceed+0.170
 certainly+0.169
 best+0.165
Neg logit contributions
ho-0.128
 chips-0.089
olph-0.088
cone-0.087
ipop-0.086
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
ho+0.235
 inspection+0.212
olph+0.204
keeping+0.187
round+0.187
Neg logit contributions
 best-0.141
 two-0.137
 biggest-0.120
 exact-0.118
amed-0.111
Token The
Token ID383
Feature activation-9.11e+00

Pos logit contributions
 best+0.042
 two+0.042
 biggest+0.040
 exact+0.039
 certainly+0.038
Neg logit contributions
ho-0.044
 inspection-0.034
olph-0.034
keeping-0.033
 throats-0.031
Token yarn
Token ID21181
Feature activation+7.51e+00

Pos logit contributions
 exact+0.248
amed+0.246
 certainly+0.236
ceed+0.229
 best+0.228
Neg logit contributions
ho-0.180
 throats-0.126
olph-0.119
 chips-0.115
cone-0.109
Token rolled
Token ID11686
Feature activation-4.41e+00

Pos logit contributions
ho+0.228
 inspection+0.197
olph+0.191
keeping+0.190
round+0.186
Neg logit contributions
 best-0.146
 two-0.142
 biggest-0.126
amed-0.124
 certainly-0.124
Token they
Token ID484
Feature activation+2.04e+00

Pos logit contributions
ho+0.103
 inspection+0.081
olph+0.081
keeping+0.076
 throats+0.075
Neg logit contributions
 best-0.080
 two-0.077
 biggest-0.073
 exact-0.073
amed-0.070
Token couldn
Token ID3521
Feature activation-3.33e+00

Pos logit contributions
ho+0.058
 inspection+0.048
olph+0.046
keeping+0.044
 throats+0.044
Neg logit contributions
 best-0.044
 two-0.042
 exact-0.040
 biggest-0.040
amed-0.039
Token't
Token ID470
Feature activation-9.14e-01

Pos logit contributions
 best+0.083
amed+0.080
 exact+0.080
 biggest+0.079
 nic+0.077
Neg logit contributions
ho-0.083
 inspection-0.062
olph-0.060
keeping-0.056
aro-0.054
Token even
Token ID772
Feature activation+5.93e+00

Pos logit contributions
 best+0.018
 two+0.018
 exact+0.017
 biggest+0.017
 certainly+0.016
Neg logit contributions
ho-0.027
 inspection-0.022
olph-0.022
keeping-0.021
 throats-0.021
Token see
Token ID766
Feature activation-2.26e+00

Pos logit contributions
ho+0.166
olph+0.132
 inspection+0.127
round+0.123
aro+0.123
Neg logit contributions
 best-0.135
 two-0.134
 exact-0.125
 biggest-0.123
 correct-0.116
Token the
Token ID262
Feature activation-6.65e+00

Pos logit contributions
 best+0.053
 two+0.051
 biggest+0.048
 exact+0.047
 certainly+0.045
Neg logit contributions
ho-0.063
 inspection-0.049
olph-0.049
keeping-0.046
 throats-0.046
Token shore
Token ID15191
Feature activation+5.48e+00

Pos logit contributions
 best+0.144
 two+0.138
 exact+0.135
amed+0.134
 biggest+0.133
Neg logit contributions
ho-0.188
 inspection-0.149
olph-0.144
keeping-0.139
 throats-0.137
Token anymore
Token ID7471
Feature activation+6.13e+00

Pos logit contributions
ho+0.175
 inspection+0.149
olph+0.146
 throats+0.138
keeping+0.135
Neg logit contributions
 best-0.101
 two-0.096
 biggest-0.090
 exact-0.089
 most-0.082
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ho+0.217
 inspection+0.179
olph+0.174
 throats+0.168
round+0.168
Neg logit contributions
 best-0.099
 two-0.098
 certainly-0.077
 biggest-0.076
 exact-0.076
Token Tim
Token ID5045
Feature activation+3.60e-01

Pos logit contributions
 best+0.035
 two+0.035
 biggest+0.033
 exact+0.033
 certainly+0.032
Neg logit contributions
ho-0.030
 inspection-0.022
olph-0.022
keeping-0.021
 throats-0.020
Token suddenly
Token ID6451
Feature activation+1.25e+00

Pos logit contributions
ho+0.010
 inspection+0.008
olph+0.008
keeping+0.008
 throats+0.008
Neg logit contributions
 best-0.008
 two-0.008
amed-0.007
 exact-0.007
 biggest-0.007
Token thinks
Token ID6834
Feature activation-5.84e+00

Pos logit contributions
 best+0.005
amed+0.005
 biggest+0.005
 exact+0.005
ceed+0.005
Neg logit contributions
ho-0.008
 inspection-0.007
olph-0.006
 throats-0.006
ipop-0.006
Token sorting
Token ID29407
Feature activation-9.79e-01

Pos logit contributions
 best+0.161
 two+0.157
 biggest+0.141
 most+0.134
 teaches+0.134
Neg logit contributions
ho-0.147
keeping-0.123
olph-0.110
round-0.105
 inspection-0.103
Token is
Token ID318
Feature activation-8.45e+00

Pos logit contributions
 best+0.022
 two+0.021
 exact+0.020
 biggest+0.020
amed+0.019
Neg logit contributions
ho-0.028
 inspection-0.022
olph-0.022
keeping-0.021
 throats-0.020
Token boring
Token ID14262
Feature activation-1.69e+00

Pos logit contributions
 best+0.141
 two+0.134
 exact+0.134
amed+0.132
 biggest+0.130
Neg logit contributions
ho-0.272
 inspection-0.225
olph-0.219
keeping-0.212
 throats-0.210
Token and
Token ID290
Feature activation+1.41e+01

Pos logit contributions
 best+0.040
 two+0.037
 biggest+0.035
 exact+0.035
 most+0.033
Neg logit contributions
ho-0.049
 inspection-0.038
olph-0.037
 throats-0.035
ipop-0.035
Token wants
Token ID3382
Feature activation-8.45e+00

Pos logit contributions
ho+0.449
 inspection+0.370
olph+0.360
keeping+0.349
 throats+0.342
Neg logit contributions
 best-0.261
 two-0.248
 exact-0.240
 biggest-0.236
amed-0.227
Token to
Token ID284
Feature activation+1.18e+01

Pos logit contributions
 best+0.263
 two+0.259
 biggest+0.244
 most+0.233
 exact+0.231
Neg logit contributions
ho-0.178
 inspection-0.124
olph-0.124
round-0.116
keeping-0.115
Token finish
Token ID5461
Feature activation-1.41e+00

Pos logit contributions
ho+0.284
 inspection+0.220
olph+0.212
keeping+0.203
 throats+0.199
Neg logit contributions
 best-0.296
amed-0.283
 two-0.278
 exact-0.278
 biggest-0.277
Token quickly
Token ID2952
Feature activation+2.45e+00

Pos logit contributions
 best+0.033
 two+0.032
 exact+0.031
 biggest+0.030
amed+0.029
Neg logit contributions
ho-0.039
 inspection-0.030
olph-0.029
keeping-0.028
 throats-0.028
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
ho+0.071
 inspection+0.057
olph+0.056
keeping+0.054
 throats+0.053
Neg logit contributions
 best-0.051
 two-0.049
 exact-0.048
 biggest-0.047
amed-0.046
Token
Token ID198
Feature activation-6.71e+00

Pos logit contributions
 best+0.031
amed+0.029
 exact+0.029
 biggest+0.028
 two+0.028
Neg logit contributions
ho-0.047
 inspection-0.039
olph-0.038
 Gig-0.035
keeping-0.035
Token her
Token ID607
Feature activation-4.88e+00

Pos logit contributions
ho+0.137
 inspection+0.108
olph+0.106
keeping+0.102
anged+0.099
Neg logit contributions
 two-0.114
 best-0.113
 exact-0.106
 biggest-0.105
 certainly-0.104
Token face
Token ID1986
Feature activation+4.20e-01

Pos logit contributions
 best+0.102
 two+0.097
 biggest+0.093
 exact+0.092
 certainly+0.087
Neg logit contributions
ho-0.150
 inspection-0.122
olph-0.117
keeping-0.113
round-0.110
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
ho+0.013
 inspection+0.011
olph+0.011
keeping+0.011
 throats+0.010
Neg logit contributions
 best-0.008
 two-0.008
 exact-0.007
 biggest-0.007
 certainly-0.006
Token<|endoftext|>
Token ID50256
Feature activation-4.25e+00

Pos logit contributions
 best+0.038
 exact+0.036
 biggest+0.035
amed+0.034
 two+0.033
Neg logit contributions
ho-0.048
 inspection-0.039
olph-0.038
 Gig-0.037
keeping-0.037
TokenOnce
Token ID7454
Feature activation+4.48e+00

Pos logit contributions
 best+0.084
 exact+0.084
 biggest+0.082
amed+0.079
 nic+0.079
Neg logit contributions
ho-0.123
olph-0.102
 inspection-0.100
keeping-0.099
 Gig-0.089
Token upon
Token ID2402
Feature activation-1.21e+01

Pos logit contributions
ho+0.133
 inspection+0.108
olph+0.104
keeping+0.098
 throats+0.097
Neg logit contributions
 best-0.090
 two-0.086
 exact-0.086
 biggest-0.084
amed-0.084
Token a
Token ID257
Feature activation-7.35e+00

Pos logit contributions
 best+0.218
amed+0.215
 exact+0.211
 teaches+0.208
 biggest+0.204
Neg logit contributions
ho-0.326
 inspection-0.295
 he-0.283
olph-0.279
keeping-0.272
Token time
Token ID640
Feature activation+7.94e+00

Pos logit contributions
 best+0.126
amed+0.125
 biggest+0.125
 two+0.123
 exact+0.123
Neg logit contributions
ho-0.230
 inspection-0.192
olph-0.185
 throats-0.174
ipop-0.171
Token there
Token ID612
Feature activation+2.70e-01

Pos logit contributions
ho+0.192
keeping+0.158
 throats+0.151
olph+0.147
 inspection+0.146
Neg logit contributions
 best-0.207
 two-0.199
 biggest-0.181
 exact-0.179
 certainly-0.176
Token was
Token ID373
Feature activation-9.54e+00

Pos logit contributions
ho+0.008
 inspection+0.007
olph+0.007
keeping+0.006
 throats+0.006
Neg logit contributions
 best-0.005
 exact-0.005
 two-0.005
 biggest-0.005
amed-0.005
Token a
Token ID257
Feature activation-7.30e+00

Pos logit contributions
 best+0.126
 exact+0.125
ceed+0.119
 teaches+0.118
 biggest+0.118
Neg logit contributions
ho-0.325
olph-0.275
 inspection-0.274
 Gig-0.263
keeping-0.260
Token new
Token ID649
Feature activation-5.77e+00

Pos logit contributions
amed+0.157
 exact+0.153
 teaches+0.137
ceed+0.136
 certainly+0.133
Neg logit contributions
ho-0.240
 inspection-0.204
keeping-0.199
 throats-0.198
 Gig-0.194
Token friends
Token ID2460
Feature activation+4.53e+00

Pos logit contributions
 best+0.085
 two+0.081
 exact+0.081
amed+0.078
 biggest+0.077
Neg logit contributions
ho-0.199
 inspection-0.167
olph-0.162
keeping-0.160
 throats-0.158
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
ho+0.119
 inspection+0.095
olph+0.091
keeping+0.089
 throats+0.086
Neg logit contributions
 best-0.101
 two-0.099
 exact-0.098
amed-0.098
 biggest-0.096
Token<|endoftext|>
Token ID50256
Feature activation-4.48e+00

Pos logit contributions
 best+0.024
 exact+0.023
 two+0.022
amed+0.022
 biggest+0.022
Neg logit contributions
ho-0.045
 inspection-0.037
olph-0.036
keeping-0.035
 throats-0.034
TokenOnce
Token ID7454
Feature activation+4.44e+00

Pos logit contributions
 exact+0.107
 best+0.103
 biggest+0.103
amed+0.101
 nic+0.101
Neg logit contributions
ho-0.107
olph-0.089
 inspection-0.088
keeping-0.087
 he-0.078
Token upon
Token ID2402
Feature activation-1.21e+01

Pos logit contributions
ho+0.126
 inspection+0.102
olph+0.097
ipop+0.091
 throats+0.088
Neg logit contributions
 best-0.093
 exact-0.093
amed-0.091
 two-0.090
ceed-0.090
Token a
Token ID257
Feature activation-7.32e+00

Pos logit contributions
 best+0.223
amed+0.223
 exact+0.222
 teaches+0.213
 biggest+0.210
Neg logit contributions
ho-0.322
 inspection-0.284
 he-0.281
olph-0.277
keeping-0.261
Token time
Token ID640
Feature activation+8.07e+00

Pos logit contributions
amed+0.137
ceed+0.134
 biggest+0.133
 exact+0.133
 two+0.131
Neg logit contributions
ho-0.218
 inspection-0.181
olph-0.171
 throats-0.162
ipop-0.159
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
ho+0.177
 inspection+0.131
olph+0.126
keeping+0.123
 throats+0.119
Neg logit contributions
 best-0.229
 two-0.220
 exact-0.213
 biggest-0.212
amed-0.208
Token there
Token ID612
Feature activation+4.43e-01

Pos logit contributions
ho+0.256
 inspection+0.194
olph+0.179
ipop+0.154
 they+0.150
Neg logit contributions
ceed-0.320
 exact-0.315
amed-0.314
 teaches-0.313
 biggest-0.310
Token was
Token ID373
Feature activation-9.58e+00

Pos logit contributions
ho+0.013
 inspection+0.010
olph+0.010
keeping+0.009
 throats+0.009
Neg logit contributions
amed-0.009
 exact-0.009
 best-0.009
 biggest-0.009
ceed-0.009
Token her
Token ID607
Feature activation-4.32e+00

Pos logit contributions
ho+0.152
olph+0.121
anged+0.119
 inspection+0.117
round+0.116
Neg logit contributions
 two-0.114
 best-0.108
 exact-0.102
 biggest-0.099
 certainly-0.098
Token face
Token ID1986
Feature activation+8.73e-01

Pos logit contributions
 best+0.091
 two+0.086
 biggest+0.083
 exact+0.081
amed+0.077
Neg logit contributions
ho-0.131
 inspection-0.107
olph-0.105
keeping-0.100
round-0.096
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
ho+0.028
 inspection+0.023
olph+0.023
keeping+0.022
 throats+0.022
Neg logit contributions
 best-0.016
 two-0.016
 exact-0.014
 biggest-0.014
 certainly-0.013
Token<|endoftext|>
Token ID50256
Feature activation-3.77e+00

Pos logit contributions
 best+0.036
 exact+0.034
amed+0.033
 biggest+0.033
 two+0.032
Neg logit contributions
ho-0.042
 inspection-0.035
keeping-0.032
 Gig-0.032
olph-0.031
TokenOnce
Token ID7454
Feature activation+4.86e+00

Pos logit contributions
 best+0.075
 exact+0.073
 biggest+0.072
amed+0.071
 nic+0.070
Neg logit contributions
ho-0.109
 inspection-0.090
olph-0.090
keeping-0.087
 Gig-0.080
Token upon
Token ID2402
Feature activation-1.16e+01

Pos logit contributions
ho+0.150
 inspection+0.121
olph+0.120
keeping+0.118
 throats+0.116
Neg logit contributions
 best-0.093
 two-0.088
 exact-0.084
 biggest-0.084
amed-0.081
Token a
Token ID257
Feature activation-6.96e+00

Pos logit contributions
 best+0.205
amed+0.202
 exact+0.195
 teaches+0.195
 biggest+0.191
Neg logit contributions
ho-0.324
 inspection-0.295
 he-0.279
olph-0.273
keeping-0.270
Token time
Token ID640
Feature activation+8.27e+00

Pos logit contributions
 best+0.118
amed+0.117
 biggest+0.116
 two+0.114
 exact+0.113
Neg logit contributions
ho-0.220
 inspection-0.186
olph-0.177
 throats-0.166
round-0.163
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
ho+0.203
keeping+0.169
 throats+0.163
olph+0.157
round+0.155
Neg logit contributions
 best-0.212
 two-0.204
 biggest-0.184
 exact-0.182
 certainly-0.179
Token there
Token ID612
Feature activation+4.76e-01

Pos logit contributions
ho+0.298
 inspection+0.235
olph+0.224
keeping+0.202
 throats+0.200
Neg logit contributions
 best-0.288
 exact-0.276
 biggest-0.275
amed-0.273
 two-0.272
Token was
Token ID373
Feature activation-9.20e+00

Pos logit contributions
ho+0.015
 inspection+0.012
olph+0.012
keeping+0.012
 throats+0.011
Neg logit contributions
 best-0.009
 two-0.008
 exact-0.008
 biggest-0.008
amed-0.008
Token in
Token ID287
Feature activation+3.61e+00

Pos logit contributions
ho+0.121
 inspection+0.110
olph+0.106
anged+0.104
gie+0.098
Neg logit contributions
 best-0.087
 two-0.077
 most-0.069
 right-0.068
 biggest-0.068
Token moderation
Token ID34401
Feature activation-2.42e+00

Pos logit contributions
ho+0.098
 inspection+0.078
olph+0.075
keeping+0.072
 throats+0.071
Neg logit contributions
 best-0.083
 two-0.080
 exact-0.078
 biggest-0.077
amed-0.076
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
 best+0.043
 two+0.039
 biggest+0.036
 most+0.033
 exact+0.033
Neg logit contributions
ho-0.082
 inspection-0.071
olph-0.070
anged-0.066
aro-0.066
Token<|endoftext|>
Token ID50256
Feature activation-4.75e+00

Pos logit contributions
 best+0.027
 exact+0.026
amed+0.026
 biggest+0.025
 two+0.025
Neg logit contributions
ho-0.038
 inspection-0.031
 Gig-0.030
olph-0.030
keeping-0.029
TokenOnce
Token ID7454
Feature activation+4.29e+00

Pos logit contributions
 exact+0.115
 best+0.110
 biggest+0.109
 nic+0.108
amed+0.106
Neg logit contributions
ho-0.111
olph-0.095
keeping-0.092
 inspection-0.091
 he-0.082
Token upon
Token ID2402
Feature activation-1.21e+01

Pos logit contributions
ho+0.122
 inspection+0.099
olph+0.094
ipop+0.089
 throats+0.086
Neg logit contributions
 best-0.090
 exact-0.089
amed-0.087
 two-0.086
 biggest-0.086
Token a
Token ID257
Feature activation-7.33e+00

Pos logit contributions
 best+0.214
 exact+0.211
amed+0.204
 biggest+0.200
 teaches+0.195
Neg logit contributions
ho-0.342
 inspection-0.302
olph-0.296
 he-0.291
keeping-0.281
Token time
Token ID640
Feature activation+8.01e+00

Pos logit contributions
 best+0.124
amed+0.123
 biggest+0.123
 exact+0.122
 two+0.122
Neg logit contributions
ho-0.230
 inspection-0.193
olph-0.186
 throats-0.176
ipop-0.173
Token,
Token ID11
Feature activation+1.18e+01

Pos logit contributions
ho+0.182
 inspection+0.136
keeping+0.134
olph+0.132
 throats+0.129
Neg logit contributions
 best-0.222
 two-0.214
 biggest-0.202
 exact-0.202
amed-0.196
Token there
Token ID612
Feature activation+3.69e-01

Pos logit contributions
ho+0.261
 inspection+0.201
olph+0.191
ipop+0.163
 he+0.156
Neg logit contributions
 exact-0.309
ceed-0.306
amed-0.304
 best-0.303
 teaches-0.303
Token was
Token ID373
Feature activation-9.50e+00

Pos logit contributions
ho+0.011
 inspection+0.009
olph+0.009
keeping+0.008
 throats+0.008
Neg logit contributions
 best-0.007
 exact-0.007
amed-0.007
 biggest-0.007
 two-0.007
Token guitar
Token ID10047
Feature activation+2.40e+00

Pos logit contributions
 exact+0.133
amed+0.133
 best+0.132
 two+0.131
ceed+0.129
Neg logit contributions
ho-0.116
olph-0.088
 inspection-0.087
 throats-0.081
 yacht-0.079
Token for
Token ID329
Feature activation-3.69e+00

Pos logit contributions
ho+0.067
 inspection+0.055
olph+0.053
keeping+0.051
 throats+0.050
Neg logit contributions
 best-0.057
 two-0.053
 biggest-0.050
amed-0.049
 most-0.048
Token Lily
Token ID20037
Feature activation-3.34e+00

Pos logit contributions
 best+0.067
 two+0.065
 exact+0.062
 biggest+0.061
amed+0.059
Neg logit contributions
ho-0.119
 inspection-0.097
olph-0.095
keeping-0.091
 throats-0.091
Token and
Token ID290
Feature activation+1.40e+01

Pos logit contributions
 best+0.072
 two+0.064
 biggest+0.058
 most+0.057
 exact+0.056
Neg logit contributions
ho-0.101
 inspection-0.089
olph-0.082
 throats-0.081
keeping-0.080
Token they
Token ID484
Feature activation+1.75e+00

Pos logit contributions
ho+0.354
 inspection+0.292
round+0.284
 throats+0.278
 tongues+0.267
Neg logit contributions
 best-0.355
 two-0.347
 teaches-0.305
 most-0.295
amed-0.287
Token became
Token ID2627
Feature activation-1.07e+01

Pos logit contributions
ho+0.049
 inspection+0.040
olph+0.038
keeping+0.038
 throats+0.037
Neg logit contributions
 best-0.039
 two-0.037
amed-0.035
 biggest-0.035
 exact-0.035
Token good
Token ID922
Feature activation+1.25e+00

Pos logit contributions
 exact+0.135
amed+0.128
ceed+0.123
 certainly+0.119
 two+0.119
Neg logit contributions
ho-0.375
olph-0.312
 inspection-0.311
 Gig-0.303
keeping-0.301
Token friends
Token ID2460
Feature activation+4.31e+00

Pos logit contributions
ho+0.047
 inspection+0.040
olph+0.039
keeping+0.038
 throats+0.037
Neg logit contributions
 best-0.017
 two-0.015
 biggest-0.014
 exact-0.014
amed-0.014
Token.
Token ID13
Feature activation-1.64e+00

Pos logit contributions
ho+0.140
 inspection+0.118
olph+0.111
 throats+0.107
keeping+0.106
Neg logit contributions
 best-0.082
 two-0.076
 exact-0.067
 biggest-0.067
 certainly-0.066
Token<|endoftext|>
Token ID50256
Feature activation-4.91e+00

Pos logit contributions
 best+0.032
 exact+0.030
amed+0.030
 two+0.030
 biggest+0.029
Neg logit contributions
ho-0.049
 inspection-0.041
olph-0.039
keeping-0.038
 throats-0.037
TokenOnce
Token ID7454
Feature activation+4.00e+00

Pos logit contributions
 exact+0.114
 best+0.111
 biggest+0.109
 nic+0.108
amed+0.106
Neg logit contributions
ho-0.121
 inspection-0.101
olph-0.101
keeping-0.100
 he-0.089