TOP ACTIVATIONS
MAX = 16.134

get it ." Hel en was a bit scared
of her ! Hel en decided to use the
Hel ping others is the kind
its nest . Hel en felt sorry for the
found me ." Hel en smiled and gently placed
her . Hel en was very proud of
said her mum . " Hel ping others can be a
with me ?" Hel i landed next to Tim
our game ". Hel en was so happy she
much better . Hel en had worked so hard
be together . Hel en 's friends were very
marry him ?" Hel en smiled and said ,
can find !" Hel en opened the cabinet and
. " Hel en , that is not
beautiful land full of rain bows and unic orns .
animals and stars and rain bows and people and houses and
flowers , animals and rain bows . The little girl was
colorful , like bright rain bows . One day
gifted - it had rain bows , flowers , animals and
said Alice , " rain bows are the best !" <|endoftext|>

MIN ACTIVATIONS
MIN = -14.420

bite . The apple belonged to little Jack .
squirrel . She ate her apple and they continued on their
Lily wanted to taste the apple because it looked very y
They both enjoyed the tasty apple and thanked the tree .
my wanted to eat an apple , but he accidentally knocked
He said , " This apple looks y ummy !" Mom
He did not like the apple sauce . He wanted to
" Don 't eat the apple , it 's bad ."
The polite raven ate the apple and was soon feeling full
. She bit into the apple and it was so sweet
happy . He shared the apple with Mr . Squirrel .
on , Jimmy and the apple were always together . <|endoftext|>
juice . Amy had an apple . They ate their lunch
tree . He got the apple and gave it to Tim
wanted to look at the apple some more . His mom
ue sp its out the apple . She sp its out
pick a fresh , delicious apple from the tree in her
, Fl uffy ate the apple and it was very y
the kitchen and found an apple . She grabbed it ,
he got from the magical apple . The

INTERVAL 8.496 - 16.134
CONTAINS 0.116%

Kevin !" Kevin was very excited and ran
a lot of love and care . The zoo will take
The water was much cal mer now . Everyone was able
listen . He did not care . He said , "
my castle is ?" El iza guessed , " Is

INTERVAL 0.857 - 8.496
CONTAINS 31.724%

mirror , she would always feel very special because of how
. I 'm glad you like the rh in oc eros
Tim my smiled and they continued to play in the
The next day , the man saw the same bird again
happy to remember something so funny . <|endoftext|> Once upon a

INTERVAL -6.781 - 0.857
CONTAINS 67.180%

through the trees . What he saw was a great big
lonely because it didn 't have any bunny friends to play
, but right as she was about to take a bite
, thank you ! You have saved our town !"
He thinks they are more cool and calm .

INTERVAL -14.420 - -6.781
CONTAINS 0.980%

are a bad dragon . Go away ." Mia said ,
the hurt bird was all better and could fly again .
of things to do . Every day he looked at the
Sally said , " Let 's discuss what we can do
grass . It was Lily 's lost toy ! Lily was
Token get
Token ID651
Feature activation-3.79e+00

Pos logit contributions
bel+0.180
Fre+0.180
Hel+0.167
Kay+0.166
ora+0.165
Neg logit contributions
omm-0.164
 His-0.148
 DVD-0.145
 magnet-0.145
oscope-0.145
Token it
Token ID340
Feature activation+1.11e-01

Pos logit contributions
omm+0.260
 His+0.247
oscope+0.247
 x+0.244
 magnet+0.241
Neg logit contributions
bel-0.260
Fre-0.257
Hel-0.244
Kay-0.235
ora-0.233
Token."
Token ID526
Feature activation-2.59e+00

Pos logit contributions
bel+0.007
Fre+0.007
ies+0.007
Hel+0.006
ora+0.006
Neg logit contributions
omm-0.008
 x-0.007
 His-0.007
oscope-0.007
 DVD-0.007
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
 His+0.131
penter+0.108
 magnet+0.105
 From+0.103
omm+0.102
Neg logit contributions
Fre-0.235
bel-0.235
uttering-0.226
ymes-0.208
Kay-0.208
Token
Token ID198
Feature activation+2.73e-01

Pos logit contributions
bel+0.026
uttering+0.025
Fre+0.024
ies+0.024
mons+0.023
Neg logit contributions
 His-0.010
 From-0.008
 teaches-0.008
penter-0.007
 x-0.007
TokenHel
Token ID12621
Feature activation+1.61e+01

Pos logit contributions
uttering+0.014
bel+0.014
mons+0.014
owers+0.014
ies+0.013
Neg logit contributions
-0.021
 teaches-0.021
penter-0.020
 His-0.020
 From-0.020
Tokenen
Token ID268
Feature activation+2.80e+00

Pos logit contributions
Fre+1.069
Hel+1.033
bel+1.022
ora+1.022
Kay+0.991
Neg logit contributions
omm-1.141
 microwave-1.124
 x-1.124
 His-1.111
 magnet-1.106
Token was
Token ID373
Feature activation-3.51e-01

Pos logit contributions
Fre+0.189
bel+0.186
Hel+0.178
Kay+0.174
ora+0.171
Neg logit contributions
omm-0.192
 His-0.189
oscope-0.188
 x-0.187
 magnet-0.184
Token a
Token ID257
Feature activation-7.28e-01

Pos logit contributions
 magnet+0.028
omm+0.028
icle+0.028
oscope+0.027
 x+0.027
Neg logit contributions
Fre-0.020
bel-0.018
Hel-0.017
Kay-0.017
uttering-0.016
Token bit
Token ID1643
Feature activation-5.37e-01

Pos logit contributions
omm+0.050
 x+0.049
oscope+0.049
 His+0.049
 magnet+0.047
Neg logit contributions
Fre-0.050
bel-0.049
Hel-0.047
Kay-0.046
ora-0.045
Token scared
Token ID12008
Feature activation+2.90e+00

Pos logit contributions
omm+0.035
 x+0.034
 His+0.034
oscope+0.034
 magnet+0.034
Neg logit contributions
Fre-0.038
bel-0.038
Hel-0.036
Kay-0.036
ora-0.035
Token of
Token ID286
Feature activation-3.21e+00

Pos logit contributions
Fre+0.112
bel+0.109
Hel+0.108
Kay+0.105
ora+0.104
Neg logit contributions
 His-0.046
omm-0.045
 magnet-0.043
 apple-0.042
 x-0.042
Token her
Token ID607
Feature activation+1.81e+00

Pos logit contributions
 His+0.190
omm+0.176
 x+0.175
 teaches+0.174
 apple+0.172
Neg logit contributions
bel-0.266
Fre-0.243
uttering-0.230
Hel-0.227
ora-0.226
Token!
Token ID0
Feature activation+2.28e+00

Pos logit contributions
bel+0.116
Fre+0.110
Hel+0.104
uttering+0.102
ies+0.101
Neg logit contributions
omm-0.133
oscope-0.129
 seventy-0.126
 x-0.125
 His-0.125
Token
Token ID198
Feature activation+4.06e-01

Pos logit contributions
Fre+0.163
Hel+0.157
Kay+0.155
bel+0.153
ora+0.150
Neg logit contributions
 x-0.153
oscope-0.152
omm-0.145
 magnet-0.144
 boo-0.143
Token
Token ID198
Feature activation+4.03e-01

Pos logit contributions
Fre+0.032
bel+0.032
Hel+0.031
Kay+0.030
ora+0.030
Neg logit contributions
omm-0.023
 x-0.022
 His-0.022
oscope-0.022
 magnet-0.022
TokenHel
Token ID12621
Feature activation+1.61e+01

Pos logit contributions
bel+0.029
Fre+0.028
ora+0.026
uttering+0.026
ies+0.026
Neg logit contributions
omm-0.025
 His-0.025
penter-0.024
 x-0.024
oscope-0.024
Tokenen
Token ID268
Feature activation+2.80e+00

Pos logit contributions
Fre+0.967
Hel+0.966
bel+0.940
ora+0.923
ies+0.909
Neg logit contributions
omm-1.201
 microwave-1.198
 His-1.189
 x-1.181
 magnet-1.175
Token decided
Token ID3066
Feature activation-5.50e-01

Pos logit contributions
Fre+0.194
bel+0.192
Hel+0.183
Kay+0.179
ora+0.177
Neg logit contributions
omm-0.187
 His-0.184
oscope-0.184
 x-0.182
 magnet-0.179
Token to
Token ID284
Feature activation+2.52e+00

Pos logit contributions
 His+0.037
 apple+0.035
omm+0.035
 x+0.035
 magnet+0.034
Neg logit contributions
Fre-0.038
bel-0.037
Hel-0.036
ora-0.036
uttering-0.035
Token use
Token ID779
Feature activation-6.51e-01

Pos logit contributions
Fre+0.170
bel+0.164
ora+0.161
Kay+0.159
Hel+0.159
Neg logit contributions
omm-0.176
 DVD-0.169
 microwave-0.163
oscope-0.161
 x-0.161
Token the
Token ID262
Feature activation+6.08e-01

Pos logit contributions
omm+0.043
 x+0.042
 His+0.042
oscope+0.042
 magnet+0.041
Neg logit contributions
Fre-0.046
bel-0.046
Hel-0.044
Kay-0.043
ora-0.042
Token
Token ID6184
Feature activation+1.67e-01

Pos logit contributions
Fre+0.064
Kay+0.060
Hel+0.060
Ag+0.060
bel+0.060
Neg logit contributions
 x-0.043
oscope-0.043
 magnet-0.040
omm-0.040
 boo-0.038
Token
Token ID95
Feature activation+3.76e+00

Pos logit contributions
Fre+0.011
bel+0.010
Hel+0.010
ora+0.010
Kay+0.010
Neg logit contributions
omm-0.012
oscope-0.012
 His-0.012
 x-0.012
 magnet-0.012
Token
Token ID26391
Feature activation+3.96e-01

Pos logit contributions
Fre+0.274
Hel+0.272
Kay+0.265
Ag+0.257
ora+0.256
Neg logit contributions
 magnet-0.218
 apple-0.212
 seventy-0.211
oll-0.210
 boo-0.210
Token
Token ID129
Feature activation+7.25e+00

Pos logit contributions
Fre+0.027
bel+0.027
ora+0.026
Hel+0.026
Kay+0.025
Neg logit contributions
omm-0.027
 His-0.026
oscope-0.026
 magnet-0.025
penter-0.025
Token
Token ID241
Feature activation+1.16e-03

Pos logit contributions
Fre+0.513
Kay+0.478
Hel+0.477
ora+0.469
Ag+0.466
Neg logit contributions
 magnet-0.506
omm-0.469
 seventy-0.461
 x-0.459
 microwave-0.458
TokenHel
Token ID12621
Feature activation+1.61e+01

Pos logit contributions
bel+0.000
Fre+0.000
ies+0.000
Hel+0.000
ora+0.000
Neg logit contributions
 His-0.000
oscope-0.000
 DVD-0.000
omm-0.000
icle-0.000
Tokenping
Token ID13886
Feature activation+5.43e-01

Pos logit contributions
Fre+1.125
ora+1.084
ies+1.041
Kay+1.001
Hel+0.987
Neg logit contributions
 magnet-1.082
 microwave-1.077
 bone-1.062
 His-1.058
 DVD-1.045
Token others
Token ID1854
Feature activation+5.57e+00

Pos logit contributions
Fre+0.041
bel+0.040
Hel+0.039
Kay+0.038
ora+0.037
Neg logit contributions
omm-0.033
 His-0.032
oscope-0.032
 x-0.032
 magnet-0.032
Token is
Token ID318
Feature activation-1.61e+00

Pos logit contributions
Fre+0.405
bel+0.400
Kay+0.380
Hel+0.378
Ag+0.373
Neg logit contributions
oscope-0.352
 magnet-0.349
 His-0.349
omm-0.345
 x-0.341
Token the
Token ID262
Feature activation+6.86e-01

Pos logit contributions
omm+0.105
 x+0.102
oscope+0.102
 His+0.100
 magnet+0.100
Neg logit contributions
Fre-0.117
bel-0.114
Hel-0.109
Kay-0.107
ora-0.106
Token kind
Token ID1611
Feature activation+9.48e-01

Pos logit contributions
Fre+0.050
bel+0.048
Hel+0.046
Kay+0.045
ora+0.045
Neg logit contributions
omm-0.044
 His-0.044
oscope-0.042
 x-0.042
penter-0.041
Token its
Token ID663
Feature activation-9.33e-01

Pos logit contributions
omm+0.213
 x+0.208
 His+0.208
oscope+0.208
 magnet+0.204
Neg logit contributions
Fre-0.235
bel-0.232
Hel-0.222
Kay-0.217
ora-0.215
Token nest
Token ID16343
Feature activation-2.45e+00

Pos logit contributions
omm+0.052
 x+0.051
oscope+0.050
 His+0.050
 magnet+0.049
Neg logit contributions
Fre-0.076
bel-0.075
Hel-0.072
Kay-0.071
ora-0.070
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
oscope+0.155
omm+0.153
 His+0.152
 x+0.151
 magnet+0.151
Neg logit contributions
Fre-0.179
bel-0.171
Hel-0.169
Kay-0.165
Ag-0.163
Token
Token ID198
Feature activation+3.91e-01

Pos logit contributions
Fre+0.062
Kay+0.061
Hel+0.060
Ag+0.058
bel+0.058
Neg logit contributions
oscope-0.045
 x-0.043
omm-0.040
 boo-0.040
 magnet-0.040
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
Fre+0.029
bel+0.028
Hel+0.027
Kay+0.027
ora+0.026
Neg logit contributions
omm-0.025
oscope-0.024
 x-0.024
 His-0.024
 magnet-0.024
TokenHel
Token ID12621
Feature activation+1.60e+01

Pos logit contributions
Fre+0.025
bel+0.025
Hel+0.024
Kay+0.023
ora+0.023
Neg logit contributions
omm-0.031
 x-0.030
oscope-0.030
 His-0.030
 magnet-0.030
Tokenen
Token ID268
Feature activation+2.78e+00

Pos logit contributions
Fre+1.071
bel+1.050
Hel+1.047
Kay+1.010
ora+1.005
Neg logit contributions
omm-1.112
 His-1.093
 x-1.088
 microwave-1.086
 magnet-1.075
Token felt
Token ID2936
Feature activation+5.78e+00

Pos logit contributions
Fre+0.194
bel+0.188
Hel+0.177
uttering+0.176
Kay+0.175
Neg logit contributions
 His-0.186
oscope-0.184
omm-0.183
 x-0.177
penter-0.177
Token sorry
Token ID7926
Feature activation-4.80e+00

Pos logit contributions
Fre+0.387
bel+0.380
Hel+0.365
Kay+0.355
Ag+0.351
Neg logit contributions
omm-0.413
oscope-0.390
 x-0.388
 magnet-0.384
icle-0.383
Token for
Token ID329
Feature activation-3.35e+00

Pos logit contributions
omm+0.251
oscope+0.248
 x+0.240
 DVD+0.233
 microwave+0.233
Neg logit contributions
bel-0.402
Fre-0.392
Hel-0.387
Ag-0.372
Kay-0.372
Token the
Token ID262
Feature activation+7.23e-01

Pos logit contributions
omm+0.217
oscope+0.211
 His+0.211
 x+0.211
 magnet+0.207
Neg logit contributions
Fre-0.242
bel-0.239
Hel-0.229
Kay-0.225
ora-0.221
Token found
Token ID1043
Feature activation+2.57e+00

Pos logit contributions
Fre+0.098
bel+0.097
Hel+0.092
Kay+0.091
ora+0.089
Neg logit contributions
oscope-0.099
omm-0.098
 His-0.098
 x-0.097
 magnet-0.097
Token me
Token ID502
Feature activation-6.96e-01

Pos logit contributions
Fre+0.191
bel+0.187
Hel+0.182
Kay+0.178
ora+0.175
Neg logit contributions
omm-0.161
oscope-0.153
 His-0.152
 x-0.149
penter-0.148
Token."
Token ID526
Feature activation-2.65e+00

Pos logit contributions
omm+0.045
 x+0.043
 His+0.043
 magnet+0.042
oscope+0.042
Neg logit contributions
Fre-0.051
bel-0.051
Hel-0.048
ora-0.047
Kay-0.047
Token
Token ID198
Feature activation+3.72e-01

Pos logit contributions
omm+0.164
 x+0.160
oscope+0.159
 His+0.158
 magnet+0.156
Neg logit contributions
Fre-0.198
bel-0.195
Hel-0.188
Kay-0.184
ora-0.182
Token
Token ID198
Feature activation+3.63e-01

Pos logit contributions
Fre+0.029
bel+0.029
Hel+0.027
ora+0.027
Kay+0.027
Neg logit contributions
omm-0.022
 His-0.021
 x-0.021
oscope-0.021
 magnet-0.021
TokenHel
Token ID12621
Feature activation+1.60e+01

Pos logit contributions
bel+0.025
ora+0.024
Fre+0.024
uttering+0.024
ies+0.024
Neg logit contributions
penter-0.022
 His-0.022
 teaches-0.021
 magnet-0.021
oscope-0.021
Tokenen
Token ID268
Feature activation+2.66e+00

Pos logit contributions
Fre+1.036
Hel+1.024
ora+0.991
bel+0.977
Kay+0.967
Neg logit contributions
 microwave-1.157
 His-1.152
 x-1.132
omm-1.132
 bone-1.119
Token smiled
Token ID13541
Feature activation+1.34e+00

Pos logit contributions
Fre+0.179
bel+0.176
Hel+0.168
Kay+0.164
ora+0.162
Neg logit contributions
omm-0.183
 His-0.180
oscope-0.180
 x-0.178
 magnet-0.176
Token and
Token ID290
Feature activation-9.31e-01

Pos logit contributions
Fre+0.099
bel+0.099
Hel+0.095
Kay+0.092
ora+0.092
Neg logit contributions
omm-0.084
 x-0.083
oscope-0.083
 magnet-0.079
 His-0.079
Token gently
Token ID15165
Feature activation-3.88e+00

Pos logit contributions
omm+0.064
 x+0.063
oscope+0.062
 His+0.061
 magnet+0.061
Neg logit contributions
Fre-0.064
bel-0.063
Hel-0.060
Kay-0.059
ora-0.059
Token placed
Token ID4624
Feature activation-3.17e+00

Pos logit contributions
 His+0.237
oscope+0.233
omm+0.225
 x+0.225
penter+0.222
Neg logit contributions
Fre-0.297
bel-0.286
Hel-0.276
Kay-0.272
ora-0.269
Token her
Token ID607
Feature activation+1.61e+00

Pos logit contributions
 x+0.070
 His+0.070
oscope+0.068
 apple+0.067
omm+0.066
Neg logit contributions
bel-0.087
Fre-0.085
uttering-0.081
Ag-0.078
Hel-0.077
Token.
Token ID13
Feature activation+4.87e-01

Pos logit contributions
Fre+0.107
bel+0.106
Hel+0.100
Kay+0.098
ora+0.098
Neg logit contributions
omm-0.113
 x-0.110
 His-0.110
oscope-0.110
 magnet-0.108
Token
Token ID220
Feature activation-1.41e+00

Pos logit contributions
bel+0.042
Fre+0.042
uttering+0.039
ora+0.039
Kay+0.039
Neg logit contributions
 His-0.025
omm-0.024
 magnet-0.022
penter-0.022
 x-0.022
Token
Token ID198
Feature activation+1.13e-01

Pos logit contributions
oscope+0.097
omm+0.096
 x+0.096
 magnet+0.094
 boo+0.093
Neg logit contributions
Fre-0.095
bel-0.094
Hel-0.091
Kay-0.089
ora-0.087
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
bel+0.011
ies+0.011
uttering+0.010
Fre+0.010
ora+0.010
Neg logit contributions
 His-0.004
 From-0.004
 teaches-0.004
omm-0.003
 He-0.003
TokenHel
Token ID12621
Feature activation+1.60e+01

Pos logit contributions
bel+0.007
ies+0.007
uttering+0.007
mons+0.007
owers+0.006
Neg logit contributions
 teaches-0.009
-0.009
 His-0.009
penter-0.009
 From-0.008
Tokenen
Token ID268
Feature activation+2.64e+00

Pos logit contributions
Fre+1.107
bel+1.077
Hel+1.066
ora+1.037
Kay+1.030
Neg logit contributions
omm-1.080
 x-1.063
 His-1.059
 magnet-1.051
 microwave-1.044
Token was
Token ID373
Feature activation-4.91e-01

Pos logit contributions
Fre+0.181
bel+0.179
Hel+0.171
Kay+0.167
ora+0.166
Neg logit contributions
omm-0.178
 x-0.174
 His-0.174
oscope-0.174
 magnet-0.171
Token very
Token ID845
Feature activation-1.97e-01

Pos logit contributions
omm+0.032
 x+0.031
 magnet+0.031
oscope+0.031
 microwave+0.030
Neg logit contributions
Fre-0.037
bel-0.035
Kay-0.034
Hel-0.034
Ag-0.032
Token proud
Token ID6613
Feature activation+8.75e-01

Pos logit contributions
 His+0.012
 boo+0.011
omm+0.011
 knee+0.011
 magnet+0.010
Neg logit contributions
Fre-0.015
Hel-0.015
bel-0.015
ora-0.014
Kay-0.014
Token of
Token ID286
Feature activation-3.41e+00

Pos logit contributions
Fre+0.081
bel+0.079
Hel+0.077
Kay+0.076
ora+0.075
Neg logit contributions
 His-0.040
omm-0.039
 x-0.037
 magnet-0.037
 bone-0.037
Token said
Token ID531
Feature activation+8.43e-01

Pos logit contributions
omm+0.046
 His+0.045
 x+0.045
oscope+0.045
 magnet+0.044
Neg logit contributions
Fre-0.041
bel-0.041
Hel-0.039
Kay-0.038
ora-0.037
Token her
Token ID607
Feature activation+1.52e+00

Pos logit contributions
bel+0.064
Fre+0.064
Hel+0.061
Kay+0.060
ora+0.059
Neg logit contributions
omm-0.051
oscope-0.049
 x-0.049
 His-0.048
 magnet-0.047
Token mum
Token ID25682
Feature activation-3.88e-01

Pos logit contributions
Fre+0.127
bel+0.125
Hel+0.122
Kay+0.120
ora+0.120
Neg logit contributions
oscope-0.079
omm-0.079
 x-0.077
 His-0.075
 microwave-0.074
Token.
Token ID13
Feature activation+4.00e-01

Pos logit contributions
omm+0.034
 x+0.032
 microwave+0.032
 DVD+0.032
 teaches+0.032
Neg logit contributions
bel-0.019
ies-0.017
Fre-0.017
Hel-0.017
Kay-0.017
Token "
Token ID366
Feature activation-3.22e+00

Pos logit contributions
Hel+0.030
Fre+0.030
bel+0.029
Kay+0.029
Ag+0.028
Neg logit contributions
 x-0.025
oscope-0.024
omm-0.023
icle-0.023
oll-0.023
TokenHel
Token ID12621
Feature activation+1.60e+01

Pos logit contributions
omm+0.257
oscope+0.252
 His+0.252
 x+0.252
 magnet+0.248
Neg logit contributions
Fre-0.182
bel-0.180
Hel-0.169
Kay-0.165
ora-0.163
Tokenping
Token ID13886
Feature activation+4.31e-01

Pos logit contributions
Fre+1.209
Hel+1.175
ora+1.147
Kay+1.131
bel+1.125
Neg logit contributions
 His-0.969
 microwave-0.959
 bone-0.943
 DVD-0.942
 magnet-0.934
Token others
Token ID1854
Feature activation+5.54e+00

Pos logit contributions
Fre+0.033
bel+0.032
Hel+0.031
Kay+0.030
ora+0.030
Neg logit contributions
omm-0.026
 His-0.025
oscope-0.025
 x-0.025
 magnet-0.025
Token can
Token ID460
Feature activation-9.19e-01

Pos logit contributions
Fre+0.420
bel+0.414
Kay+0.393
Hel+0.387
Ag+0.382
Neg logit contributions
oscope-0.337
 magnet-0.333
 His-0.330
omm-0.326
 boo-0.322
Token be
Token ID307
Feature activation+3.86e-01

Pos logit contributions
omm+0.051
oscope+0.050
 His+0.050
 x+0.050
 magnet+0.049
Neg logit contributions
Fre-0.074
bel-0.074
Hel-0.070
Kay-0.069
ora-0.068
Token a
Token ID257
Feature activation-7.23e-01

Pos logit contributions
Fre+0.030
bel+0.030
ora+0.028
Kay+0.028
Hel+0.028
Neg logit contributions
 His-0.022
oscope-0.022
 magnet-0.022
 x-0.022
 boo-0.021
Token with
Token ID351
Feature activation+6.76e-01

Pos logit contributions
omm+0.022
oscope+0.020
oll+0.020
 His+0.020
 microwave+0.020
Neg logit contributions
bel-0.015
Fre-0.014
Hel-0.014
Ag-0.013
Jay-0.013
Token me
Token ID502
Feature activation-7.87e-01

Pos logit contributions
Fre+0.056
bel+0.055
Hel+0.054
Kay+0.053
Ag+0.051
Neg logit contributions
omm-0.037
oscope-0.035
 x-0.034
 His-0.034
 magnet-0.034
Token?"
Token ID1701
Feature activation-3.42e+00

Pos logit contributions
omm+0.049
oscope+0.046
 His+0.045
 x+0.045
 microwave+0.044
Neg logit contributions
bel-0.060
Fre-0.058
Hel-0.056
ora-0.055
uttering-0.053
Token
Token ID198
Feature activation+4.92e-02

Pos logit contributions
omm+0.204
oscope+0.198
oll+0.196
 x+0.196
 microwave+0.190
Neg logit contributions
Fre-0.265
Hel-0.264
bel-0.263
Kay-0.259
Ag-0.249
Token
Token ID198
Feature activation+7.10e-02

Pos logit contributions
Fre+0.004
bel+0.004
ora+0.004
Hel+0.004
uttering+0.004
Neg logit contributions
 His-0.003
omm-0.003
 x-0.003
oscope-0.003
 magnet-0.003
TokenHel
Token ID12621
Feature activation+1.60e+01

Pos logit contributions
Fre+0.005
bel+0.005
ora+0.005
uttering+0.005
Hel+0.004
Neg logit contributions
 His-0.005
penter-0.005
omm-0.005
 x-0.005
oscope-0.004
Tokeni
Token ID72
Feature activation+4.59e+00

Pos logit contributions
Fre+0.955
ora+0.915
ies+0.862
Hel+0.856
Kay+0.850
Neg logit contributions
 magnet-1.266
 x-1.251
 DVD-1.250
 microwave-1.249
 bone-1.231
Token landed
Token ID11406
Feature activation-1.89e+00

Pos logit contributions
Fre+0.294
bel+0.291
Hel+0.280
Kay+0.272
ora+0.270
Neg logit contributions
omm-0.331
 x-0.325
oscope-0.321
 His-0.317
 magnet-0.314
Token next
Token ID1306
Feature activation-3.34e+00

Pos logit contributions
omm+0.124
 x+0.120
oscope+0.120
 His+0.120
 magnet+0.118
Neg logit contributions
Fre-0.136
bel-0.134
Hel-0.128
Kay-0.126
ora-0.124
Token to
Token ID284
Feature activation+2.39e+00

Pos logit contributions
omm+0.271
 x+0.265
oscope+0.265
 His+0.265
 magnet+0.261
Neg logit contributions
Fre-0.186
bel-0.184
Hel-0.173
Kay-0.168
ora-0.167
Token Tim
Token ID5045
Feature activation-3.05e+00

Pos logit contributions
Fre+0.201
bel+0.200
Hel+0.192
Kay+0.189
ora+0.187
Neg logit contributions
omm-0.125
 x-0.122
 His-0.121
oscope-0.121
 magnet-0.118
Token our
Token ID674
Feature activation-1.37e+00

Pos logit contributions
omm+0.204
oscope+0.197
 His+0.196
 x+0.194
 magnet+0.191
Neg logit contributions
Fre-0.256
bel-0.246
Hel-0.242
Kay-0.237
ora-0.235
Token game
Token ID983
Feature activation-5.65e-01

Pos logit contributions
omm+0.095
 x+0.093
 His+0.093
oscope+0.093
 magnet+0.092
Neg logit contributions
Fre-0.092
bel-0.091
Hel-0.087
Kay-0.085
ora-0.084
Token".
Token ID1911
Feature activation-2.13e+00

Pos logit contributions
omm+0.033
 x+0.032
oscope+0.032
 His+0.032
 magnet+0.031
Neg logit contributions
Fre-0.044
bel-0.044
Hel-0.042
Kay-0.041
ora-0.041
Token
Token ID198
Feature activation+3.09e-01

Pos logit contributions
 His+0.113
 From+0.088
 magnet+0.088
omm+0.087
 Your+0.086
Neg logit contributions
bel-0.195
Fre-0.191
uttering-0.179
ies-0.176
ora-0.174
Token
Token ID198
Feature activation+3.07e-01

Pos logit contributions
bel+0.027
Fre+0.027
ora+0.025
uttering+0.025
ies+0.025
Neg logit contributions
 His-0.015
omm-0.014
 x-0.014
 DVD-0.014
 magnet-0.013
TokenHel
Token ID12621
Feature activation+1.59e+01

Pos logit contributions
ies+0.014
bel+0.014
uttering+0.013
owers+0.012
mons+0.012
Neg logit contributions
 teaches-0.027
 His-0.026
-0.026
penter-0.025
 From-0.025
Tokenen
Token ID268
Feature activation+2.60e+00

Pos logit contributions
Fre+1.105
bel+1.074
Hel+1.072
ora+1.032
Kay+1.027
Neg logit contributions
omm-1.078
 x-1.062
 His-1.049
 magnet-1.037
 microwave-1.037
Token was
Token ID373
Feature activation-4.22e-01

Pos logit contributions
Fre+0.182
bel+0.179
Hel+0.170
Kay+0.167
ora+0.165
Neg logit contributions
omm-0.171
 His-0.169
oscope-0.168
 x-0.165
 magnet-0.164
Token so
Token ID523
Feature activation-1.33e+00

Pos logit contributions
 x+0.029
omm+0.028
 magnet+0.028
oscope+0.028
 microwave+0.027
Neg logit contributions
Fre-0.030
bel-0.029
Kay-0.028
Hel-0.027
Ag-0.026
Token happy
Token ID3772
Feature activation-7.50e-01

Pos logit contributions
 His+0.089
omm+0.089
 x+0.086
 magnet+0.085
oscope+0.084
Neg logit contributions
Fre-0.092
bel-0.091
Hel-0.088
ora-0.085
Kay-0.085
Token she
Token ID673
Feature activation+3.31e+00

Pos logit contributions
 His+0.037
omm+0.034
 bone+0.033
penter+0.033
 magnet+0.033
Neg logit contributions
Fre-0.068
bel-0.066
Hel-0.065
ora-0.062
Kay-0.062
Token much
Token ID881
Feature activation-2.09e+00

Pos logit contributions
Fre+0.271
bel+0.270
Hel+0.257
Kay+0.252
ora+0.250
Neg logit contributions
omm-0.231
oscope-0.225
 x-0.224
 His-0.223
 magnet-0.219
Token better
Token ID1365
Feature activation-7.17e+00

Pos logit contributions
omm+0.090
oscope+0.087
 His+0.086
 x+0.085
 magnet+0.081
Neg logit contributions
bel-0.194
Fre-0.194
Hel-0.186
Kay-0.184
ora-0.183
Token.
Token ID13
Feature activation+7.22e-01

Pos logit contributions
 knee+0.315
 apple+0.315
 bone+0.314
 magnet+0.305
oll+0.304
Neg logit contributions
Fre-0.686
Hel-0.664
Kay-0.637
bel-0.623
Jay-0.612
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
bel+0.069
Fre+0.068
ora+0.063
ies+0.063
uttering+0.062
Neg logit contributions
 His-0.035
omm-0.029
 Your-0.028
 From-0.028
 magnet-0.027
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
bel+0.029
Fre+0.027
ies+0.026
ora+0.026
uttering+0.026
Neg logit contributions
 His-0.013
 x-0.011
omm-0.011
 From-0.010
 teaches-0.010
TokenHel
Token ID12621
Feature activation+1.59e+01

Pos logit contributions
bel+0.019
ies+0.018
mons+0.017
owers+0.017
uttering+0.017
Neg logit contributions
-0.021
 His-0.021
 teaches-0.020
 From-0.020
penter-0.020
Tokenen
Token ID268
Feature activation+2.60e+00

Pos logit contributions
Fre+1.121
bel+1.105
Hel+1.076
Kay+1.045
ora+1.042
Neg logit contributions
omm-1.058
 His-1.036
 x-1.031
 magnet-1.015
oscope-1.010
Token had
Token ID550
Feature activation-4.96e-01

Pos logit contributions
Fre+0.195
bel+0.189
Hel+0.180
uttering+0.180
Kay+0.178
Neg logit contributions
oscope-0.160
 His-0.159
omm-0.159
 x-0.154
penter-0.154
Token worked
Token ID3111
Feature activation-8.12e-02

Pos logit contributions
 His+0.027
omm+0.027
 x+0.026
oscope+0.026
 magnet+0.026
Neg logit contributions
bel-0.041
Fre-0.041
ora-0.039
Hel-0.038
uttering-0.038
Token so
Token ID523
Feature activation-1.35e+00

Pos logit contributions
 x+0.004
 magnet+0.004
 His+0.004
omm+0.004
 bone+0.004
Neg logit contributions
Fre-0.007
ora-0.007
bel-0.007
Hel-0.007
Kay-0.007
Token hard
Token ID1327
Feature activation-2.59e-01

Pos logit contributions
 His+0.080
 x+0.078
omm+0.078
 magnet+0.076
 apple+0.075
Neg logit contributions
Fre-0.105
bel-0.103
ora-0.098
Hel-0.097
Kay-0.097
Token be
Token ID307
Feature activation+2.57e-01

Pos logit contributions
Fre+0.187
bel+0.185
Hel+0.178
Kay+0.176
ora+0.174
Neg logit contributions
omm-0.130
 x-0.125
oscope-0.125
 His-0.125
 magnet-0.122
Token together
Token ID1978
Feature activation+4.42e+00

Pos logit contributions
Fre+0.021
bel+0.019
ora+0.019
Hel+0.019
Kay+0.019
Neg logit contributions
 His-0.014
 x-0.014
penter-0.014
oscope-0.014
 magnet-0.014
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
Fre+0.324
bel+0.316
Hel+0.305
Kay+0.301
ora+0.298
Neg logit contributions
 His-0.274
oscope-0.273
 x-0.270
omm-0.268
 magnet-0.268
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
bel+0.047
Fre+0.046
uttering+0.043
ymes+0.043
ora+0.042
Neg logit contributions
 His-0.025
omm-0.020
 Your-0.019
 From-0.019
 magnet-0.019
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
bel+0.022
Fre+0.020
uttering+0.020
ies+0.020
ymes+0.020
Neg logit contributions
 His-0.009
 From-0.006
 He-0.006
omm-0.006
 teaches-0.006
TokenHel
Token ID12621
Feature activation+1.59e+01

Pos logit contributions
bel+0.014
owers+0.013
ymes+0.013
ies+0.013
uttering+0.013
Neg logit contributions
 His-0.015
-0.014
 From-0.013
penter-0.013
 teaches-0.013
Tokenen
Token ID268
Feature activation+2.54e+00

Pos logit contributions
Fre+1.159
bel+1.145
Hel+1.099
Kay+1.076
ora+1.068
Neg logit contributions
omm-1.016
 x-0.992
 His-0.988
oscope-0.985
 magnet-0.970
Token's
Token ID338
Feature activation-7.32e+00

Pos logit contributions
Fre+0.176
bel+0.174
uttering+0.162
Hel+0.159
Ag+0.157
Neg logit contributions
 His-0.175
oscope-0.165
omm-0.164
penter-0.163
 boo-0.156
Token friends
Token ID2460
Feature activation+4.18e+00

Pos logit contributions
omm+0.508
oscope+0.500
 x+0.497
 His+0.494
 magnet+0.486
Neg logit contributions
Fre-0.490
bel-0.487
Hel-0.461
Kay-0.453
ora-0.450
Token were
Token ID547
Feature activation+5.13e+00

Pos logit contributions
bel+0.295
Fre+0.282
uttering+0.259
 common+0.256
ora+0.254
Neg logit contributions
 His-0.290
penter-0.274
oscope-0.270
 magnet-0.269
 boo-0.267
Token very
Token ID845
Feature activation-1.90e-01

Pos logit contributions
Fre+0.380
bel+0.379
Hel+0.360
ora+0.353
Kay+0.352
Neg logit contributions
omm-0.320
 x-0.313
 His-0.313
oscope-0.309
 magnet-0.302
Token marry
Token ID12479
Feature activation+1.45e+00

Pos logit contributions
Fre+0.099
bel+0.098
Hel+0.094
Kay+0.092
ora+0.092
Neg logit contributions
omm-0.081
 x-0.077
oscope-0.077
 His-0.077
 microwave-0.075
Token him
Token ID683
Feature activation-6.55e-01

Pos logit contributions
Fre+0.127
bel+0.124
uttering+0.121
Ag+0.119
Kay+0.119
Neg logit contributions
 His-0.071
 x-0.070
oscope-0.069
omm-0.068
 magnet-0.067
Token?"
Token ID1701
Feature activation-3.38e+00

Pos logit contributions
omm+0.039
oscope+0.038
 His+0.037
 x+0.037
 microwave+0.036
Neg logit contributions
Fre-0.050
bel-0.049
Hel-0.046
ora-0.046
Kay-0.044
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
 His+0.207
 From+0.156
 He+0.156
 It+0.151
 Your+0.149
Neg logit contributions
bel-0.305
uttering-0.290
Fre-0.287
ymes-0.280
as-0.262
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
bel+0.017
ies+0.016
uttering+0.016
ymes+0.016
bows+0.015
Neg logit contributions
 His-0.006
 From-0.004
 He-0.004
 It-0.003
 Your-0.003
TokenHel
Token ID12621
Feature activation+1.58e+01

Pos logit contributions
bel+0.008
uttering+0.008
owers+0.008
ies+0.008
mons+0.007
Neg logit contributions
 His-0.013
-0.013
 From-0.012
penter-0.012
 teaches-0.012
Tokenen
Token ID268
Feature activation+2.49e+00

Pos logit contributions
Fre+1.095
bel+1.060
Hel+1.053
ora+1.024
Kay+1.015
Neg logit contributions
omm-1.069
 x-1.051
 His-1.046
 magnet-1.032
 microwave-1.030
Token smiled
Token ID13541
Feature activation+1.13e+00

Pos logit contributions
Fre+0.181
bel+0.179
uttering+0.166
Hel+0.165
Kay+0.162
Neg logit contributions
 His-0.163
omm-0.156
oscope-0.153
penter-0.151
 magnet-0.148
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
Fre+0.091
bel+0.090
Hel+0.086
Kay+0.085
ora+0.084
Neg logit contributions
omm-0.065
 His-0.063
 x-0.062
oscope-0.062
 magnet-0.061
Token said
Token ID531
Feature activation+7.81e-01

Pos logit contributions
 His+0.082
omm+0.077
penter+0.076
 magnet+0.075
 x+0.075
Neg logit contributions
Fre-0.088
bel-0.087
Hel-0.082
uttering-0.081
Kay-0.079
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
Fre+0.060
bel+0.059
Hel+0.057
Kay+0.056
ora+0.056
Neg logit contributions
 His-0.048
omm-0.046
 magnet-0.044
 seventy-0.044
 x-0.043
Token can
Token ID460
Feature activation-1.26e+00

Pos logit contributions
 magnet+0.023
oscope+0.022
 x+0.022
 His+0.022
omm+0.022
Neg logit contributions
Fre-0.037
bel-0.036
Kay-0.035
Hel-0.034
ora-0.034
Token find
Token ID1064
Feature activation-8.04e-01

Pos logit contributions
omm+0.072
 x+0.071
 His+0.071
 magnet+0.071
oscope+0.069
Neg logit contributions
Fre-0.099
bel-0.096
Hel-0.095
Kay-0.093
Ag-0.091
Token!"
Token ID2474
Feature activation-5.99e-01

Pos logit contributions
omm+0.042
 x+0.042
oscope+0.041
 His+0.041
 magnet+0.041
Neg logit contributions
Fre-0.067
bel-0.065
Hel-0.064
Kay-0.063
ora-0.062
Token
Token ID198
Feature activation+7.42e-02

Pos logit contributions
 His+0.024
 That+0.021
 From+0.021
 Your+0.020
 It+0.019
Neg logit contributions
bel-0.062
Fre-0.059
uttering-0.057
as-0.053
ies-0.052
Token
Token ID198
Feature activation+6.94e-02

Pos logit contributions
bel+0.008
uttering+0.008
mons+0.008
Fre+0.007
ies+0.007
Neg logit contributions
 His-0.002
 From-0.002
 teaches-0.002
-0.002
 It-0.002
TokenHel
Token ID12621
Feature activation+1.58e+01

Pos logit contributions
bel+0.004
uttering+0.004
mons+0.003
owers+0.003
Fre+0.003
Neg logit contributions
-0.006
 teaches-0.005
's-0.005
 From-0.005
 His-0.005
Tokenen
Token ID268
Feature activation+2.44e+00

Pos logit contributions
Fre+1.136
bel+1.122
Hel+1.079
Kay+1.057
ora+1.048
Neg logit contributions
omm-1.021
 x-0.996
 His-0.995
oscope-0.990
 magnet-0.976
Token opened
Token ID4721
Feature activation-2.19e+00

Pos logit contributions
Fre+0.173
bel+0.165
Hel+0.157
uttering+0.156
Ag+0.154
Neg logit contributions
 His-0.165
omm-0.163
oscope-0.160
penter-0.156
 x-0.155
Token the
Token ID262
Feature activation+1.94e-01

Pos logit contributions
 His+0.128
omm+0.128
 x+0.126
 magnet+0.126
 apple+0.123
Neg logit contributions
Fre-0.169
bel-0.169
Hel-0.161
Ag-0.157
Kay-0.156
Token cabinet
Token ID13447
Feature activation-1.27e+00

Pos logit contributions
Fre+0.017
bel+0.017
Hel+0.016
ora+0.016
Kay+0.016
Neg logit contributions
omm-0.010
oscope-0.009
 His-0.009
 x-0.009
penter-0.009
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
omm+0.078
oscope+0.076
 x+0.076
 His+0.076
 magnet+0.074
Neg logit contributions
bel-0.095
Fre-0.095
Hel-0.090
Kay-0.088
ora-0.088
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
omm+0.247
oscope+0.241
 x+0.231
 microwave+0.229
penter+0.227
Neg logit contributions
bel-0.217
ora-0.215
Fre-0.213
Kay-0.207
ies-0.201
Token
Token ID220
Feature activation-1.51e+00

Pos logit contributions
bel+0.038
Fre+0.038
Hel+0.035
ora+0.035
Kay+0.035
Neg logit contributions
 His-0.018
omm-0.018
 x-0.017
oscope-0.016
 magnet-0.016
Token
Token ID198
Feature activation+9.41e-02

Pos logit contributions
oscope+0.107
omm+0.105
 x+0.104
 magnet+0.103
penter+0.101
Neg logit contributions
Fre-0.100
bel-0.098
Hel-0.096
Kay-0.094
ora-0.091
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
bel+0.008
Fre+0.008
ora+0.008
ies+0.008
uttering+0.008
Neg logit contributions
 His-0.005
omm-0.004
 x-0.004
 DVD-0.004
oscope-0.004
Token"
Token ID1
Feature activation-1.94e+00

Pos logit contributions
bel+0.007
Fre+0.007
ies+0.007
ora+0.007
uttering+0.006
Neg logit contributions
 His-0.006
 teaches-0.006
penter-0.006
-0.006
 Your-0.006
TokenHel
Token ID12621
Feature activation+1.58e+01

Pos logit contributions
omm+0.152
 x+0.149
 His+0.149
oscope+0.149
 magnet+0.147
Neg logit contributions
Fre-0.113
bel-0.112
Hel-0.106
Kay-0.103
ora-0.102
Tokenen
Token ID268
Feature activation+2.40e+00

Pos logit contributions
Fre+1.037
ora+0.985
Hel+0.979
ies+0.965
Kay+0.964
Neg logit contributions
 microwave-1.138
 His-1.120
 DVD-1.116
 magnet-1.104
 bone-1.084
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
Fre+0.166
bel+0.166
ora+0.156
Hel+0.156
Kay+0.153
Neg logit contributions
omm-0.159
 x-0.156
 His-0.154
oscope-0.154
 magnet-0.150
Token that
Token ID326
Feature activation-1.48e+00

Pos logit contributions
Fre+0.092
bel+0.085
Kay+0.083
ora+0.083
Hel+0.083
Neg logit contributions
omm-0.093
penter-0.090
 His-0.090
oscope-0.089
 DVD-0.087
Token is
Token ID318
Feature activation-2.04e+00

Pos logit contributions
omm+0.086
penter+0.079
 trailer+0.078
 His+0.077
 x+0.077
Neg logit contributions
Fre-0.113
bel-0.110
ies-0.108
Hel-0.107
ora-0.107
Token not
Token ID407
Feature activation-4.58e-01

Pos logit contributions
oscope+0.165
 His+0.151
omm+0.150
 DVD+0.150
penter+0.149
Neg logit contributions
Fre-0.119
 common-0.108
bel-0.106
Jud-0.103
Kay-0.103
Token beautiful
Token ID4950
Feature activation+5.00e+00

Pos logit contributions
 x+0.043
 DVD+0.042
 magnet+0.042
 microwave+0.041
 bone+0.041
Neg logit contributions
bel-0.044
Fre-0.044
ora-0.042
Hel-0.041
Kay-0.041
Token land
Token ID1956
Feature activation+6.45e+00

Pos logit contributions
bel+0.361
Fre+0.357
Hel+0.323
Kay+0.322
Ag+0.319
Neg logit contributions
 magnet-0.337
 x-0.323
 apple-0.322
 His-0.318
 bone-0.316
Token full
Token ID1336
Feature activation+5.15e-03

Pos logit contributions
Fre+0.505
Hel+0.474
bel+0.464
Kay+0.461
Ag+0.458
Neg logit contributions
oscope-0.378
penter-0.377
 DVD-0.365
 His-0.364
 x-0.364
Token of
Token ID286
Feature activation-3.21e+00

Pos logit contributions
Fre+0.000
bel+0.000
Hel+0.000
Kay+0.000
ora+0.000
Neg logit contributions
omm-0.000
oscope-0.000
 x-0.000
 His-0.000
 magnet-0.000
Token rain
Token ID6290
Feature activation+2.64e+00

Pos logit contributions
 x+0.219
 apple+0.218
 magnet+0.213
 His+0.210
 bone+0.209
Neg logit contributions
bel-0.223
Fre-0.218
Hel-0.202
Kay-0.202
ora-0.200
Tokenbows
Token ID25435
Feature activation+1.49e+01

Pos logit contributions
Fre+0.200
Hel+0.178
Jay+0.177
Kay+0.175
Jud+0.173
Neg logit contributions
oscope-0.186
penter-0.177
 DVD-0.169
 bone-0.168
 x-0.167
Token and
Token ID290
Feature activation-8.91e-01

Pos logit contributions
Fre+1.226
bel+1.180
Hel+1.153
Kay+1.149
ora+1.116
Neg logit contributions
 magnet-0.824
oscope-0.823
 DVD-0.820
 apple-0.813
penter-0.807
Token unic
Token ID28000
Feature activation-8.55e-01

Pos logit contributions
oscope+0.060
 apple+0.060
 x+0.060
 magnet+0.060
 boo+0.059
Neg logit contributions
Fre-0.060
uttering-0.058
ora-0.058
bel-0.056
Kay-0.055
Tokenorns
Token ID19942
Feature activation+3.47e+00

Pos logit contributions
oscope+0.044
oll+0.043
 His+0.041
penter+0.040
 teaches+0.040
Neg logit contributions
bel-0.075
Fre-0.071
uttering-0.065
Kay-0.065
Ag-0.063
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
Fre+0.295
Kay+0.274
bel+0.273
Hel+0.270
Jay+0.268
Neg logit contributions
oscope-0.192
 bone-0.180
penter-0.179
 magnet-0.175
 apple-0.175
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
Hel+0.041
Fre+0.041
Kay+0.040
Ag+0.040
bel+0.040
Neg logit contributions
oscope-0.044
 x-0.042
omm-0.040
 magnet-0.040
 boo-0.040
Token animals
Token ID4695
Feature activation+5.60e+00

Pos logit contributions
oscope+0.057
 apple+0.057
 x+0.056
 magnet+0.056
 boo+0.055
Neg logit contributions
Fre-0.054
uttering-0.051
Kay-0.051
ora-0.050
Ag-0.050
Token and
Token ID290
Feature activation-8.11e-01

Pos logit contributions
Fre+0.451
bel+0.433
Kay+0.425
Ag+0.416
Hel+0.414
Neg logit contributions
oscope-0.324
 magnet-0.320
 DVD-0.314
 His-0.312
penter-0.310
Token stars
Token ID5788
Feature activation+4.99e+00

Pos logit contributions
oscope+0.057
 magnet+0.056
 apple+0.056
 x+0.056
 boo+0.055
Neg logit contributions
Fre-0.057
uttering-0.053
Kay-0.053
ora-0.052
Ag-0.051
Token and
Token ID290
Feature activation-8.13e-01

Pos logit contributions
Fre+0.399
bel+0.381
Kay+0.377
Hel+0.369
Ag+0.367
Neg logit contributions
oscope-0.288
 magnet-0.285
 DVD-0.284
 His-0.280
 apple-0.278
Token rain
Token ID6290
Feature activation+2.66e+00

Pos logit contributions
oscope+0.056
 apple+0.055
 x+0.055
 magnet+0.055
 boo+0.054
Neg logit contributions
Fre-0.058
uttering-0.054
Kay-0.053
ora-0.053
Jay-0.052
Tokenbows
Token ID25435
Feature activation+1.48e+01

Pos logit contributions
Fre+0.140
Jay+0.113
Kay+0.109
Jud+0.109
Hel+0.107
Neg logit contributions
oscope-0.252
penter-0.242
 DVD-0.239
 x-0.235
icle-0.234
Token and
Token ID290
Feature activation-8.28e-01

Pos logit contributions
Fre+1.224
bel+1.152
Kay+1.138
Hel+1.127
Ag+1.114
Neg logit contributions
oscope-0.842
 DVD-0.837
 magnet-0.832
 apple-0.824
penter-0.809
Token people
Token ID661
Feature activation+7.44e+00

Pos logit contributions
oscope+0.057
 x+0.057
 apple+0.057
 magnet+0.056
 boo+0.056
Neg logit contributions
Fre-0.058
uttering-0.055
ora-0.054
Kay-0.054
bel-0.053
Token and
Token ID290
Feature activation-8.35e-01

Pos logit contributions
Fre+0.603
bel+0.558
Kay+0.554
ora+0.550
Ag+0.548
Neg logit contributions
oscope-0.443
 DVD-0.433
 magnet-0.426
 His-0.419
 x-0.414
Token houses
Token ID7777
Feature activation+7.12e+00

Pos logit contributions
 magnet+0.057
oscope+0.057
 boo+0.057
 x+0.056
 apple+0.056
Neg logit contributions
Fre-0.060
uttering-0.056
Kay-0.056
ora-0.055
Jay-0.054
Token and
Token ID290
Feature activation-8.21e-01

Pos logit contributions
Fre+0.625
ora+0.574
Kay+0.568
Hel+0.566
Ag+0.564
Neg logit contributions
oscope-0.378
 magnet-0.364
penter-0.361
 DVD-0.361
 x-0.359
Token flowers
Token ID12734
Feature activation+6.27e+00

Pos logit contributions
 magnet+0.205
 x+0.203
 apple+0.197
 His+0.195
 bone+0.193
Neg logit contributions
Fre-0.235
bel-0.227
ora-0.219
uttering-0.216
Hel-0.212
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
Fre+0.556
bel+0.537
Hel+0.525
Kay+0.522
ora+0.511
Neg logit contributions
 magnet-0.306
penter-0.298
 His-0.296
oscope-0.296
 apple-0.295
Token animals
Token ID4695
Feature activation+5.44e+00

Pos logit contributions
Fre+0.107
bel+0.104
ora+0.101
uttering+0.099
Hel+0.098
Neg logit contributions
 x-0.111
 magnet-0.111
oscope-0.110
omm-0.110
 His-0.109
Token and
Token ID290
Feature activation-9.70e-01

Pos logit contributions
Fre+0.481
bel+0.464
Kay+0.457
ora+0.454
Hel+0.448
Neg logit contributions
oscope-0.263
 magnet-0.259
penter-0.252
 His-0.251
 bone-0.245
Token rain
Token ID6290
Feature activation+2.58e+00

Pos logit contributions
 His+0.062
 x+0.061
 magnet+0.061
oscope+0.061
 apple+0.061
Neg logit contributions
Fre-0.070
uttering-0.067
ora-0.066
bel-0.066
Hel-0.064
Tokenbows
Token ID25435
Feature activation+1.48e+01

Pos logit contributions
Fre+0.134
Hel+0.105
ora+0.104
Jay+0.101
Jud+0.099
Neg logit contributions
oscope-0.242
penter-0.239
icle-0.233
omm-0.231
 DVD-0.226
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
Fre+1.287
Hel+1.193
Kay+1.171
ora+1.170
bel+1.167
Neg logit contributions
 magnet-0.785
 DVD-0.783
oscope-0.776
 apple-0.774
penter-0.768
Token The
Token ID383
Feature activation-2.09e+00

Pos logit contributions
Fre+0.055
bel+0.054
Hel+0.053
Kay+0.052
ora+0.051
Neg logit contributions
oscope-0.035
 x-0.035
omm-0.034
 magnet-0.034
 boo-0.033
Token little
Token ID1310
Feature activation-1.32e+00

Pos logit contributions
omm+0.125
 x+0.124
oscope+0.122
 magnet+0.120
 His+0.119
Neg logit contributions
Fre-0.164
bel-0.161
Hel-0.154
Kay-0.152
ora-0.150
Token girl
Token ID2576
Feature activation+3.79e+00

Pos logit contributions
 apple+0.082
 magnet+0.079
 x+0.078
 knee+0.077
 boo+0.077
Neg logit contributions
Fre-0.105
bel-0.104
Hel-0.098
Kay-0.096
Ag-0.096
Token was
Token ID373
Feature activation-3.56e-01

Pos logit contributions
Fre+0.277
bel+0.272
Hel+0.260
Kay+0.256
ora+0.254
Neg logit contributions
omm-0.241
 His-0.236
oscope-0.236
 x-0.234
 magnet-0.230
Token colorful
Token ID20239
Feature activation+4.20e+00

Pos logit contributions
omm+0.050
oll+0.049
 His+0.049
 apple+0.047
 boo+0.047
Neg logit contributions
Hel-0.095
 perspective-0.095
Fre-0.094
Jay-0.090
Jud-0.089
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
Fre+0.382
bel+0.373
Hel+0.371
Kay+0.360
ora+0.351
Neg logit contributions
 bone-0.197
omm-0.195
 apple-0.193
 magnet-0.192
 x-0.191
Token like
Token ID588
Feature activation+3.70e+00

Pos logit contributions
Fre+0.129
bel+0.125
Hel+0.123
ora+0.121
Kay+0.119
Neg logit contributions
 apple-0.104
omm-0.103
 x-0.103
 His-0.103
 magnet-0.102
Token bright
Token ID6016
Feature activation+6.62e+00

Pos logit contributions
Fre+0.279
bel+0.272
ora+0.268
Hel+0.265
uttering+0.257
Neg logit contributions
omm-0.222
 His-0.221
 x-0.221
 apple-0.220
 magnet-0.218
Token rain
Token ID6290
Feature activation+2.62e+00

Pos logit contributions
Fre+0.543
bel+0.534
Hel+0.505
Kay+0.503
uttering+0.500
Neg logit contributions
 apple-0.381
 magnet-0.380
omm-0.370
 bone-0.366
 DVD-0.366
Tokenbows
Token ID25435
Feature activation+1.48e+01

Pos logit contributions
Fre+0.127
Hel+0.109
bel+0.106
Kay+0.103
Jay+0.102
Neg logit contributions
oscope-0.244
omm-0.242
penter-0.241
icle-0.238
 bone-0.237
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
Fre+1.352
bel+1.302
Hel+1.280
Kay+1.265
Jay+1.235
Neg logit contributions
 apple-0.723
 magnet-0.702
 bone-0.692
 DVD-0.692
oscope-0.681
Token
Token ID198
Feature activation+4.33e-01

Pos logit contributions
Fre+0.063
Hel+0.061
bel+0.060
Kay+0.059
ora+0.058
Neg logit contributions
 x-0.057
oscope-0.057
omm-0.057
 magnet-0.055
penter-0.054
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
bel+0.037
Fre+0.037
Hel+0.034
ora+0.034
Ag+0.034
Neg logit contributions
 His-0.022
omm-0.021
 x-0.021
 DVD-0.020
oscope-0.020
TokenOne
Token ID3198
Feature activation+1.94e-01

Pos logit contributions
bel+0.034
Fre+0.031
ies+0.030
ora+0.029
uttering+0.029
Neg logit contributions
omm-0.025
 His-0.025
 x-0.025
oscope-0.025
penter-0.024
Token day
Token ID1110
Feature activation-1.75e+00

Pos logit contributions
Fre+0.013
bel+0.013
Hel+0.012
Kay+0.012
ora+0.012
Neg logit contributions
omm-0.014
oscope-0.014
 x-0.014
 His-0.014
 magnet-0.013
Token gifted
Token ID22527
Feature activation-5.22e-02

Pos logit contributions
omm+0.022
oscope+0.021
 x+0.021
 His+0.020
 magnet+0.020
Neg logit contributions
bel-0.018
Fre-0.018
Hel-0.016
Kay-0.016
uttering-0.016
Token -
Token ID532
Feature activation+1.47e+00

Pos logit contributions
 His+0.003
 bone+0.003
omm+0.003
 magnet+0.003
 x+0.003
Neg logit contributions
Fre-0.004
bel-0.004
Hel-0.004
Kay-0.004
Jud-0.004
Token it
Token ID340
Feature activation+1.94e-01

Pos logit contributions
Fre+0.092
bel+0.090
Hel+0.086
Kay+0.084
ora+0.082
Neg logit contributions
omm-0.110
oscope-0.107
 x-0.106
 His-0.106
 magnet-0.104
Token had
Token ID550
Feature activation-2.82e-01

Pos logit contributions
Fre+0.015
bel+0.015
Kay+0.014
Hel+0.014
ora+0.014
Neg logit contributions
 His-0.011
oscope-0.011
omm-0.011
 magnet-0.011
 x-0.011
Token rain
Token ID6290
Feature activation+2.58e+00

Pos logit contributions
 His+0.017
 apple+0.017
 x+0.016
oscope+0.016
 bone+0.016
Neg logit contributions
bel-0.021
Fre-0.021
ora-0.021
uttering-0.020
Jay-0.020
Tokenbows
Token ID25435
Feature activation+1.48e+01

Pos logit contributions
Fre+0.197
Hel+0.176
ora+0.173
Kay+0.171
Jay+0.171
Neg logit contributions
oscope-0.171
penter-0.165
 bone-0.164
 x-0.163
 trailer-0.157
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
Fre+1.252
bel+1.195
Hel+1.190
Kay+1.174
ora+1.153
Neg logit contributions
 magnet-0.790
oscope-0.782
 apple-0.782
 bone-0.776
 DVD-0.762
Token flowers
Token ID12734
Feature activation+6.39e+00

Pos logit contributions
Fre+0.109
bel+0.106
uttering+0.104
Hel+0.103
ora+0.103
Neg logit contributions
 apple-0.115
oscope-0.115
 magnet-0.114
 His-0.112
 bone-0.112
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
Fre+0.563
bel+0.560
Hel+0.539
Kay+0.537
Ag+0.530
Neg logit contributions
oscope-0.300
 apple-0.296
 magnet-0.294
 bone-0.293
 His-0.292
Token animals
Token ID4695
Feature activation+5.69e+00

Pos logit contributions
Fre+0.131
bel+0.129
Hel+0.124
Kay+0.124
ora+0.124
Neg logit contributions
oscope-0.087
 magnet-0.084
 apple-0.084
 x-0.083
 His-0.083
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
bel+0.512
Fre+0.503
Kay+0.494
Ag+0.489
Hel+0.481
Neg logit contributions
oscope-0.270
 bone-0.256
 magnet-0.254
 apple-0.248
 His-0.247
Token said
Token ID531
Feature activation+1.20e+00

Pos logit contributions
omm+0.015
 His+0.015
 x+0.015
oscope+0.015
 magnet+0.014
Neg logit contributions
Fre-0.011
bel-0.011
Hel-0.010
Kay-0.010
ora-0.010
Token Alice
Token ID14862
Feature activation+5.90e-01

Pos logit contributions
Fre+0.099
bel+0.098
Kay+0.094
Hel+0.094
ora+0.093
Neg logit contributions
oscope-0.064
omm-0.064
 x-0.063
 magnet-0.061
 His-0.061
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
bel+0.036
Fre+0.035
Hel+0.033
ora+0.032
Kay+0.032
Neg logit contributions
omm-0.045
 x-0.045
oscope-0.044
 magnet-0.043
 microwave-0.043
Token "
Token ID366
Feature activation-3.02e+00

Pos logit contributions
bel+0.130
Fre+0.128
Hel+0.123
Kay+0.122
Ag+0.117
Neg logit contributions
 x-0.107
omm-0.106
oscope-0.102
 magnet-0.102
 bone-0.099
Tokenrain
Token ID3201
Feature activation-1.03e-01

Pos logit contributions
oscope+0.215
omm+0.214
 seventy+0.213
 His+0.213
icle+0.209
Neg logit contributions
Fre-0.182
bel-0.173
ies-0.168
ora-0.167
Hel-0.162
Tokenbows
Token ID25435
Feature activation+1.48e+01

Pos logit contributions
omm+0.007
 x+0.007
 His+0.006
oscope+0.006
 magnet+0.006
Neg logit contributions
Fre-0.007
bel-0.007
Hel-0.007
Kay-0.007
ora-0.007
Token are
Token ID389
Feature activation+3.19e+00

Pos logit contributions
Fre+1.165
bel+1.134
Hel+1.108
Kay+1.092
Ag+1.079
Neg logit contributions
 teaches-0.833
 magnet-0.826
oscope-0.819
omm-0.815
 apple-0.815
Token the
Token ID262
Feature activation+6.20e-01

Pos logit contributions
Fre+0.229
bel+0.228
Hel+0.219
Kay+0.213
ora+0.213
Neg logit contributions
omm-0.203
 x-0.199
 His-0.197
oscope-0.197
 magnet-0.196
Token best
Token ID1266
Feature activation+1.00e+00

Pos logit contributions
Fre+0.045
bel+0.045
Hel+0.043
Kay+0.042
ora+0.041
Neg logit contributions
omm-0.039
 x-0.039
oscope-0.039
 magnet-0.038
 His-0.038
Token!"
Token ID2474
Feature activation-2.05e-01

Pos logit contributions
Fre+0.078
bel+0.076
Hel+0.074
Kay+0.071
ora+0.070
Neg logit contributions
omm-0.060
 x-0.059
 magnet-0.058
oscope-0.057
 apple-0.057
Token<|endoftext|>
Token ID50256
Feature activation-4.40e+00

Pos logit contributions
 x+0.011
oscope+0.011
omm+0.011
 magnet+0.011
penter+0.011
Neg logit contributions
Fre-0.017
bel-0.016
Hel-0.016
Kay-0.016
ora-0.016
Token bite
Token ID13197
Feature activation-7.60e+00

Pos logit contributions
Fre+0.098
bel+0.097
Kay+0.092
ora+0.092
Hel+0.091
Neg logit contributions
 His-0.067
 magnet-0.066
 x-0.065
 apple-0.065
 bone-0.064
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 His+0.476
omm+0.476
oscope+0.467
 x+0.466
 magnet+0.462
Neg logit contributions
Fre-0.560
bel-0.550
Hel-0.531
Kay-0.520
Ag-0.510
Token
Token ID198
Feature activation-7.04e-01

Pos logit contributions
 x+0.070
oscope+0.069
omm+0.066
oll+0.063
 magnet+0.063
Neg logit contributions
Fre-0.095
bel-0.092
Hel-0.091
Kay-0.090
Ag-0.088
Token
Token ID198
Feature activation-5.64e-01

Pos logit contributions
omm+0.047
oscope+0.046
 x+0.046
 magnet+0.046
penter+0.045
Neg logit contributions
Fre-0.050
bel-0.048
Hel-0.048
Kay-0.047
ora-0.045
TokenThe
Token ID464
Feature activation+1.20e+00

Pos logit contributions
omm+0.045
 x+0.044
oscope+0.044
 His+0.044
 magnet+0.043
Neg logit contributions
Fre-0.033
bel-0.032
Hel-0.030
Kay-0.030
ora-0.029
Token apple
Token ID17180
Feature activation-1.31e+01

Pos logit contributions
bel+0.088
Fre+0.088
Hel+0.083
ora+0.081
Kay+0.081
Neg logit contributions
omm-0.075
 His-0.075
oscope-0.071
penter-0.068
 x-0.068
Token belonged
Token ID19611
Feature activation-6.02e+00

Pos logit contributions
omm+0.914
 x+0.888
 His+0.876
oscope+0.873
 DVD+0.864
Neg logit contributions
bel-0.874
Fre-0.850
Hel-0.802
ora-0.801
Kay-0.785
Token to
Token ID284
Feature activation+2.03e+00

Pos logit contributions
omm+0.458
 His+0.452
oscope+0.448
 x+0.448
 magnet+0.446
Neg logit contributions
Fre-0.367
bel-0.357
Hel-0.341
Kay-0.332
ora-0.328
Token little
Token ID1310
Feature activation-1.72e+00

Pos logit contributions
Fre+0.155
bel+0.152
Hel+0.147
Kay+0.145
ora+0.143
Neg logit contributions
omm-0.123
oscope-0.120
 x-0.118
 His-0.117
 magnet-0.116
Token Jack
Token ID3619
Feature activation-8.64e-01

Pos logit contributions
 apple+0.090
 magnet+0.090
 x+0.089
 bone+0.088
oscope+0.084
Neg logit contributions
Fre-0.148
bel-0.148
Kay-0.140
uttering-0.140
Hel-0.139
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
omm+0.070
icle+0.065
 x+0.065
 microwave+0.063
 His+0.062
Neg logit contributions
bel-0.048
Fre-0.048
ora-0.046
Hel-0.046
ies-0.045
Token squirrel
Token ID33039
Feature activation+1.46e-01

Pos logit contributions
omm+0.106
 His+0.104
 x+0.103
oscope+0.103
 magnet+0.100
Neg logit contributions
Fre-0.165
bel-0.161
Hel-0.156
Kay-0.154
ora-0.152
Token.
Token ID13
Feature activation-9.90e-01

Pos logit contributions
Fre+0.010
bel+0.010
Hel+0.010
Kay+0.010
ora+0.009
Neg logit contributions
 His-0.010
omm-0.010
oscope-0.009
 x-0.009
 magnet-0.009
Token She
Token ID1375
Feature activation+1.46e+00

Pos logit contributions
 x+0.059
oscope+0.056
omm+0.055
 boo+0.053
oll+0.052
Neg logit contributions
Fre-0.080
Hel-0.077
Kay-0.076
bel-0.075
Ag-0.074
Token ate
Token ID15063
Feature activation-6.12e+00

Pos logit contributions
Fre+0.094
bel+0.093
Hel+0.088
Kay+0.086
ora+0.085
Neg logit contributions
omm-0.104
 x-0.102
oscope-0.102
 His-0.102
 magnet-0.100
Token her
Token ID607
Feature activation+1.19e+00

Pos logit contributions
omm+0.388
 x+0.378
oscope+0.378
 His+0.376
 magnet+0.369
Neg logit contributions
Fre-0.449
bel-0.444
Hel-0.424
Kay-0.416
ora-0.412
Token apple
Token ID17180
Feature activation-1.30e+01

Pos logit contributions
bel+0.091
Fre+0.087
ora+0.084
ies+0.083
Hel+0.079
Neg logit contributions
omm-0.073
penter-0.070
oscope-0.068
 x-0.067
 His-0.065
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
omm+0.881
 x+0.873
oscope+0.841
 His+0.838
 DVD+0.830
Neg logit contributions
bel-0.905
Fre-0.866
ora-0.821
Hel-0.799
Kay-0.791
Token they
Token ID484
Feature activation+3.14e+00

Pos logit contributions
 His+0.094
omm+0.093
oscope+0.093
 magnet+0.092
 x+0.091
Neg logit contributions
Fre-0.120
bel-0.118
Hel-0.114
Kay-0.112
ora-0.111
Token continued
Token ID3767
Feature activation-1.15e+00

Pos logit contributions
Fre+0.233
bel+0.225
Kay+0.217
Hel+0.216
uttering+0.214
Neg logit contributions
oscope-0.195
 His-0.194
 magnet-0.192
 apple-0.188
omm-0.188
Token on
Token ID319
Feature activation-1.32e+00

Pos logit contributions
omm+0.087
 His+0.086
oscope+0.085
 x+0.085
 magnet+0.085
Neg logit contributions
Fre-0.070
bel-0.068
Hel-0.065
Kay-0.064
ora-0.063
Token their
Token ID511
Feature activation+3.46e+00

Pos logit contributions
 His+0.082
 apple+0.081
 bone+0.080
 magnet+0.079
 x+0.079
Neg logit contributions
bel-0.101
Fre-0.099
ora-0.095
Hel-0.093
Kay-0.093
Token Lily
Token ID20037
Feature activation-3.32e+00

Pos logit contributions
 x+0.038
oscope+0.037
omm+0.037
 boo+0.035
 magnet+0.035
Neg logit contributions
Fre-0.048
Hel-0.046
bel-0.045
Kay-0.045
Ag-0.044
Token wanted
Token ID2227
Feature activation-3.11e-01

Pos logit contributions
omm+0.225
oscope+0.220
 His+0.220
 x+0.220
 magnet+0.216
Neg logit contributions
Fre-0.227
bel-0.224
Hel-0.213
Kay-0.209
ora-0.206
Token to
Token ID284
Feature activation+2.01e+00

Pos logit contributions
omm+0.024
 x+0.024
 His+0.024
oscope+0.024
 magnet+0.023
Neg logit contributions
Fre-0.018
bel-0.018
Hel-0.017
Kay-0.017
ora-0.017
Token taste
Token ID6938
Feature activation+1.24e+00

Pos logit contributions
Fre+0.143
bel+0.142
Hel+0.135
Kay+0.133
ora+0.131
Neg logit contributions
omm-0.133
oscope-0.129
 x-0.128
 His-0.127
 magnet-0.125
Token the
Token ID262
Feature activation+1.03e-01

Pos logit contributions
Fre+0.085
bel+0.084
Hel+0.080
Kay+0.079
ora+0.078
Neg logit contributions
omm-0.084
oscope-0.082
 x-0.082
 His-0.082
 magnet-0.080
Token apple
Token ID17180
Feature activation-1.29e+01

Pos logit contributions
Fre+0.008
bel+0.008
Hel+0.008
Kay+0.008
ora+0.008
Neg logit contributions
omm-0.006
 x-0.006
oscope-0.006
 magnet-0.006
 His-0.006
Token because
Token ID780
Feature activation-1.39e+00

Pos logit contributions
omm+0.893
oscope+0.873
 His+0.873
 x+0.871
 DVD+0.845
Neg logit contributions
Fre-0.856
bel-0.847
ora-0.793
Hel-0.786
Kay-0.779
Token it
Token ID340
Feature activation-4.28e-01

Pos logit contributions
oscope+0.110
omm+0.107
 x+0.106
penter+0.104
 DVD+0.104
Neg logit contributions
Fre-0.082
bel-0.078
Hel-0.073
Kay-0.073
ora-0.072
Token looked
Token ID3114
Feature activation+3.33e+00

Pos logit contributions
omm+0.030
 x+0.029
 His+0.029
oscope+0.029
 magnet+0.029
Neg logit contributions
Fre-0.028
bel-0.028
Hel-0.027
Kay-0.026
ora-0.026
Token very
Token ID845
Feature activation-4.28e-01

Pos logit contributions
Fre+0.254
bel+0.250
Hel+0.244
Kay+0.239
ora+0.234
Neg logit contributions
omm-0.200
 His-0.200
 x-0.198
 magnet-0.196
 apple-0.193
Token y
Token ID331
Feature activation-3.25e+00

Pos logit contributions
omm+0.026
 His+0.026
 magnet+0.025
 x+0.025
 apple+0.025
Neg logit contributions
Fre-0.032
bel-0.032
Hel-0.032
Kay-0.031
Jay-0.030
Token They
Token ID1119
Feature activation+1.41e+00

Pos logit contributions
 x+0.127
oscope+0.123
omm+0.121
 boo+0.115
 magnet+0.115
Neg logit contributions
Fre-0.164
Hel-0.160
bel-0.159
Kay-0.156
Ag-0.153
Token both
Token ID1111
Feature activation-7.09e-01

Pos logit contributions
Fre+0.098
bel+0.096
uttering+0.093
Hel+0.091
Kay+0.091
Neg logit contributions
oscope-0.094
 His-0.093
 magnet-0.091
 x-0.090
penter-0.090
Token enjoyed
Token ID8359
Feature activation+2.08e+00

Pos logit contributions
oscope+0.048
 His+0.048
 boo+0.046
 teaches+0.045
penter+0.045
Neg logit contributions
Fre-0.049
bel-0.048
uttering-0.047
Ag-0.046
Hel-0.045
Token the
Token ID262
Feature activation-5.27e-02

Pos logit contributions
Fre+0.159
bel+0.157
ora+0.155
Hel+0.149
Kay+0.148
Neg logit contributions
 His-0.125
 DVD-0.119
 apple-0.118
 magnet-0.116
 knee-0.116
Token tasty
Token ID25103
Feature activation-2.28e+00

Pos logit contributions
 x+0.003
omm+0.003
oscope+0.003
 magnet+0.003
 His+0.003
Neg logit contributions
Fre-0.004
bel-0.004
Hel-0.004
Kay-0.004
ora-0.003
Token apple
Token ID17180
Feature activation-1.29e+01

Pos logit contributions
omm+0.117
 x+0.116
oscope+0.115
 His+0.114
 magnet+0.114
Neg logit contributions
Fre-0.195
bel-0.192
Hel-0.188
Kay-0.184
Ag-0.180
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
 magnet+0.937
oscope+0.932
 apple+0.930
omm+0.929
 His+0.927
Neg logit contributions
Fre-0.812
bel-0.770
Hel-0.766
Kay-0.738
ora-0.718
Token thanked
Token ID26280
Feature activation-1.39e+00

Pos logit contributions
 His+0.089
oscope+0.088
omm+0.087
 magnet+0.087
 apple+0.086
Neg logit contributions
Fre-0.094
bel-0.091
Hel-0.088
uttering-0.087
ora-0.086
Token the
Token ID262
Feature activation+3.81e-01

Pos logit contributions
omm+0.098
 His+0.098
 x+0.097
oscope+0.096
 magnet+0.095
Neg logit contributions
Fre-0.092
bel-0.091
Hel-0.086
Kay-0.084
ora-0.083
Token tree
Token ID5509
Feature activation-2.76e+00

Pos logit contributions
Fre+0.028
bel+0.027
Hel+0.026
Kay+0.026
ora+0.025
Neg logit contributions
omm-0.024
 His-0.024
oscope-0.024
 x-0.023
 magnet-0.023
Token.
Token ID13
Feature activation+4.16e-01

Pos logit contributions
omm+0.175
 x+0.167
 magnet+0.163
oscope+0.161
 boo+0.160
Neg logit contributions
bel-0.210
Fre-0.200
ora-0.191
Hel-0.190
Kay-0.190
Tokenmy
Token ID1820
Feature activation-5.34e+00

Pos logit contributions
omm+0.276
 x+0.258
 magnet+0.246
 DVD+0.245
 His+0.244
Neg logit contributions
Fre-0.410
bel-0.401
Hel-0.398
ora-0.396
Kay-0.385
Token wanted
Token ID2227
Feature activation-6.30e-01

Pos logit contributions
omm+0.407
 x+0.396
oscope+0.384
 magnet+0.381
 His+0.380
Neg logit contributions
Fre-0.324
bel-0.322
Hel-0.307
Kay-0.297
ora-0.297
Token to
Token ID284
Feature activation+1.70e+00

Pos logit contributions
omm+0.052
oscope+0.051
 His+0.050
 x+0.050
 magnet+0.050
Neg logit contributions
Fre-0.034
bel-0.034
Hel-0.032
Kay-0.031
ora-0.030
Token eat
Token ID4483
Feature activation-5.67e+00

Pos logit contributions
Fre+0.119
bel+0.118
Kay+0.112
Hel+0.111
ora+0.110
Neg logit contributions
omm-0.115
oscope-0.109
 DVD-0.107
 x-0.107
 microwave-0.106
Token an
Token ID281
Feature activation+1.29e+00

Pos logit contributions
omm+0.365
oscope+0.355
penter+0.345
 His+0.334
 DVD+0.330
Neg logit contributions
Fre-0.411
bel-0.399
ora-0.378
Hel-0.374
Kay-0.372
Token apple
Token ID17180
Feature activation-1.28e+01

Pos logit contributions
Fre+0.105
 insects+0.104
ies+0.103
ora+0.100
uttering+0.099
Neg logit contributions
penter-0.061
 DVD-0.061
 Promise-0.060
omm-0.059
oscope-0.059
Token,
Token ID11
Feature activation+8.94e-01

Pos logit contributions
omm+0.978
oscope+0.944
 x+0.938
 DVD+0.919
 His+0.916
Neg logit contributions
bel-0.772
Fre-0.751
ora-0.708
ies-0.679
uttering-0.667
Token but
Token ID475
Feature activation+2.64e+00

Pos logit contributions
Fre+0.057
bel+0.053
Hel+0.051
Kay+0.051
Ag+0.050
Neg logit contributions
omm-0.068
oscope-0.066
 DVD-0.064
 x-0.063
penter-0.063
Token he
Token ID339
Feature activation-2.41e+00

Pos logit contributions
bel+0.168
Fre+0.165
Kay+0.160
J+0.158
ies+0.157
Neg logit contributions
omm-0.187
oscope-0.186
 x-0.184
 DVD-0.181
 teaches-0.172
Token accidentally
Token ID14716
Feature activation-1.56e+00

Pos logit contributions
 DVD+0.191
omm+0.189
 x+0.188
oscope+0.186
 microwave+0.180
Neg logit contributions
Fre-0.137
bel-0.130
ora-0.126
Hel-0.123
Kay-0.123
Token knocked
Token ID13642
Feature activation-2.49e+00

Pos logit contributions
omm+0.120
 DVD+0.110
 x+0.110
penter+0.110
 His+0.109
Neg logit contributions
bel-0.095
Fre-0.089
J-0.083
Hel-0.083
Kay-0.082
Token He
Token ID679
Feature activation-3.92e+00

Pos logit contributions
 x+0.076
oscope+0.073
omm+0.070
 boo+0.069
oll+0.067
Neg logit contributions
Fre-0.090
Hel-0.087
Kay-0.085
bel-0.085
Ag-0.083
Token said
Token ID531
Feature activation+4.21e-01

Pos logit contributions
omm+0.298
 x+0.287
oscope+0.281
 His+0.278
icle+0.277
Neg logit contributions
bel-0.239
Fre-0.239
Hel-0.221
ora-0.219
Kay-0.217
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
Fre+0.029
bel+0.029
Hel+0.027
Kay+0.027
ora+0.026
Neg logit contributions
omm-0.028
oscope-0.028
 x-0.028
 His-0.027
 magnet-0.027
Token "
Token ID366
Feature activation-3.58e+00

Pos logit contributions
bel+0.088
Fre+0.087
Hel+0.083
Kay+0.080
Ag+0.079
Neg logit contributions
 x-0.068
oscope-0.067
omm-0.067
oll-0.065
penter-0.062
TokenThis
Token ID1212
Feature activation-6.76e-01

Pos logit contributions
omm+0.291
 x+0.286
oscope+0.282
 His+0.282
 magnet+0.279
Neg logit contributions
Fre-0.201
bel-0.198
Hel-0.188
Kay-0.183
ora-0.180
Token apple
Token ID17180
Feature activation-1.28e+01

Pos logit contributions
oscope+0.046
omm+0.044
 x+0.044
 DVD+0.044
penter+0.044
Neg logit contributions
Fre-0.044
ora-0.041
bel-0.039
Kay-0.039
ies-0.038
Token looks
Token ID3073
Feature activation+5.06e-01

Pos logit contributions
omm+0.810
 x+0.781
oscope+0.778
 His+0.775
 DVD+0.766
Neg logit contributions
bel-0.916
Fre-0.879
ora-0.853
ies-0.838
Kay-0.791
Token y
Token ID331
Feature activation-3.11e+00

Pos logit contributions
Fre+0.033
bel+0.033
Hel+0.031
Kay+0.030
ora+0.030
Neg logit contributions
omm-0.036
oscope-0.035
 x-0.034
 His-0.034
 magnet-0.034
Tokenummy
Token ID13513
Feature activation-2.86e+00

Pos logit contributions
oscope+0.161
 x+0.157
 His+0.154
omm+0.154
penter+0.152
Neg logit contributions
Fre-0.261
bel-0.255
ora-0.252
uttering-0.244
Hel-0.241
Token!"
Token ID2474
Feature activation-5.87e-01

Pos logit contributions
omm+0.195
 His+0.192
 x+0.190
oscope+0.187
penter+0.182
Neg logit contributions
Fre-0.194
bel-0.190
ora-0.181
Hel-0.175
Kay-0.174
Token Mom
Token ID11254
Feature activation-3.47e+00

Pos logit contributions
 x+0.036
omm+0.035
oscope+0.035
 magnet+0.034
 seventy+0.033
Neg logit contributions
bel-0.045
Fre-0.044
Hel-0.042
Kay-0.042
ora-0.042
Token He
Token ID679
Feature activation-3.68e+00

Pos logit contributions
 x+0.033
oscope+0.033
omm+0.031
 boo+0.030
 magnet+0.030
Neg logit contributions
Fre-0.041
bel-0.040
Hel-0.040
Kay-0.039
Ag-0.039
Token did
Token ID750
Feature activation+2.27e+00

Pos logit contributions
omm+0.261
 x+0.250
oscope+0.248
 His+0.246
icle+0.244
Neg logit contributions
bel-0.242
Fre-0.239
Hel-0.223
ora-0.218
Kay-0.217
Token not
Token ID407
Feature activation-5.92e-01

Pos logit contributions
bel+0.178
Fre+0.176
Kay+0.165
Hel+0.165
ora+0.161
Neg logit contributions
omm-0.133
oscope-0.130
 x-0.127
 magnet-0.125
 His-0.125
Token like
Token ID588
Feature activation+3.17e+00

Pos logit contributions
oscope+0.038
 x+0.038
 His+0.037
 magnet+0.037
 boo+0.037
Neg logit contributions
Fre-0.043
bel-0.042
Hel-0.041
Kay-0.040
uttering-0.040
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
bel+0.219
Fre+0.219
Hel+0.205
Kay+0.201
ora+0.197
Neg logit contributions
omm-0.212
oscope-0.205
 His-0.204
 x-0.203
penter-0.199
Token apple
Token ID17180
Feature activation-1.28e+01

Pos logit contributions
Fre+0.001
bel+0.001
Hel+0.001
Kay+0.001
ora+0.001
Neg logit contributions
omm-0.001
 x-0.001
oscope-0.001
 His-0.001
 magnet-0.001
Token sauce
Token ID10746
Feature activation-3.72e+00

Pos logit contributions
omm+0.872
 x+0.844
 His+0.841
oscope+0.840
 DVD+0.814
Neg logit contributions
bel-0.869
Fre-0.863
Hel-0.807
ora-0.796
Kay-0.794
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
omm+0.256
 x+0.249
 His+0.248
oscope+0.248
 magnet+0.241
Neg logit contributions
bel-0.248
Fre-0.248
Hel-0.232
ora-0.228
Kay-0.227
Token He
Token ID679
Feature activation-3.23e+00

Pos logit contributions
J+0.031
Fre+0.030
Kay+0.030
Hel+0.030
bel+0.029
Neg logit contributions
oscope-0.018
 x-0.017
 boo-0.016
 forehead-0.016
 seventy-0.016
Token wanted
Token ID2227
Feature activation+1.42e-01

Pos logit contributions
omm+0.246
 x+0.238
oscope+0.236
 His+0.236
 magnet+0.234
Neg logit contributions
Fre-0.194
bel-0.193
Hel-0.180
Kay-0.176
ora-0.176
Token to
Token ID284
Feature activation+2.32e+00

Pos logit contributions
bel+0.007
Fre+0.007
Hel+0.007
Kay+0.007
J+0.006
Neg logit contributions
omm-0.012
oscope-0.012
penter-0.011
 seventy-0.011
 DVD-0.011
Token "
Token ID366
Feature activation-5.23e+00

Pos logit contributions
 His+0.124
 apple+0.119
omm+0.115
 bone+0.111
 Your+0.107
Neg logit contributions
bel-0.198
Fre-0.189
Hel-0.188
Jay-0.183
Kay-0.182
TokenDon
Token ID3987
Feature activation+1.62e+00

Pos logit contributions
omm+0.430
 x+0.421
 His+0.421
oscope+0.420
 magnet+0.414
Neg logit contributions
Fre-0.285
bel-0.281
Hel-0.264
Kay-0.257
ora-0.254
Token't
Token ID470
Feature activation-1.47e+00

Pos logit contributions
Kay+0.088
Hel+0.088
Ag+0.087
Fre+0.086
bel+0.086
Neg logit contributions
 microwave-0.126
oscope-0.126
 trailer-0.123
urger-0.123
 DVD-0.123
Token eat
Token ID4483
Feature activation-5.74e+00

Pos logit contributions
omm+0.101
 x+0.098
oscope+0.097
 His+0.097
 DVD+0.095
Neg logit contributions
Fre-0.101
bel-0.101
Hel-0.096
Kay-0.095
ora-0.093
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
omm+0.390
oscope+0.383
 His+0.377
penter+0.374
 x+0.368
Neg logit contributions
Fre-0.391
bel-0.377
ora-0.357
Hel-0.354
Kay-0.352
Token apple
Token ID17180
Feature activation-1.28e+01

Pos logit contributions
omm+0.001
 His+0.001
oscope+0.001
penter+0.001
 x+0.001
Neg logit contributions
bel-0.002
Fre-0.002
ora-0.002
Hel-0.002
Kay-0.002
Token,
Token ID11
Feature activation+9.26e-01

Pos logit contributions
omm+0.902
 x+0.888
 His+0.881
oscope+0.879
 DVD+0.854
Neg logit contributions
bel-0.839
Fre-0.828
ora-0.776
Hel-0.749
Kay-0.748
Token it
Token ID340
Feature activation-2.71e-01

Pos logit contributions
Fre+0.055
bel+0.051
Kay+0.050
Ag+0.048
Hel+0.048
Neg logit contributions
oscope-0.072
omm-0.069
 DVD-0.067
 x-0.066
oll-0.065
Token's
Token ID338
Feature activation-7.44e+00

Pos logit contributions
omm+0.016
 x+0.015
 His+0.015
 DVD+0.014
icle+0.014
Neg logit contributions
bel-0.020
Fre-0.020
ora-0.020
ies-0.019
Hel-0.019
Token bad
Token ID2089
Feature activation-5.01e-01

Pos logit contributions
omm+0.534
penter+0.527
 His+0.527
 DVD+0.524
oscope+0.516
Neg logit contributions
 common-0.428
ies-0.424
Fre-0.419
bel-0.408
uttering-0.380
Token."
Token ID526
Feature activation-2.88e+00

Pos logit contributions
omm+0.033
 x+0.032
 His+0.032
oscope+0.032
penter+0.031
Neg logit contributions
bel-0.034
Fre-0.034
ora-0.032
Hel-0.032
Kay-0.031
TokenThe
Token ID464
Feature activation+1.36e+00

Pos logit contributions
omm+0.041
 His+0.040
 x+0.040
oscope+0.040
 magnet+0.039
Neg logit contributions
Fre-0.030
bel-0.030
Hel-0.028
Kay-0.027
ora-0.027
Token polite
Token ID23507
Feature activation+5.35e-01

Pos logit contributions
Fre+0.102
bel+0.101
Hel+0.097
ora+0.095
Kay+0.094
Neg logit contributions
omm-0.083
 His-0.083
oscope-0.079
 x-0.076
icle-0.075
Token raven
Token ID37735
Feature activation-9.57e-01

Pos logit contributions
Fre+0.039
bel+0.038
Hel+0.037
ora+0.036
Kay+0.036
Neg logit contributions
omm-0.034
 x-0.033
 His-0.033
oscope-0.033
 magnet-0.032
Token ate
Token ID15063
Feature activation-5.62e+00

Pos logit contributions
omm+0.068
 x+0.066
 His+0.065
oscope+0.065
 magnet+0.064
Neg logit contributions
Fre-0.063
bel-0.062
Hel-0.059
ora-0.058
Kay-0.058
Token the
Token ID262
Feature activation+2.98e-01

Pos logit contributions
omm+0.347
 x+0.338
oscope+0.337
 His+0.337
 magnet+0.330
Neg logit contributions
Fre-0.421
bel-0.416
Hel-0.399
Kay-0.391
ora-0.388
Token apple
Token ID17180
Feature activation-1.28e+01

Pos logit contributions
bel+0.022
Fre+0.022
Hel+0.021
ora+0.021
Kay+0.020
Neg logit contributions
omm-0.019
 His-0.018
penter-0.017
oscope-0.017
 x-0.017
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
omm+0.887
 x+0.861
oscope+0.838
 His+0.836
 DVD+0.821
Neg logit contributions
bel-0.872
Fre-0.846
ora-0.794
Hel-0.790
Kay-0.777
Token was
Token ID373
Feature activation-5.61e-01

Pos logit contributions
 His+0.089
oscope+0.089
omm+0.088
 magnet+0.088
 x+0.087
Neg logit contributions
Fre-0.100
bel-0.097
Hel-0.093
Kay-0.092
uttering-0.091
Token soon
Token ID2582
Feature activation+1.05e+00

Pos logit contributions
omm+0.043
 x+0.042
 magnet+0.041
oscope+0.041
 microwave+0.040
Neg logit contributions
bel-0.034
Fre-0.034
Kay-0.032
Hel-0.031
uttering-0.030
Token feeling
Token ID4203
Feature activation+1.99e+00

Pos logit contributions
Fre+0.085
bel+0.083
Hel+0.081
Kay+0.079
ora+0.079
Neg logit contributions
 His-0.057
omm-0.057
 magnet-0.056
oscope-0.056
 x-0.055
Token full
Token ID1336
Feature activation-3.50e-01

Pos logit contributions
bel+0.142
Fre+0.142
Hel+0.135
Kay+0.131
ora+0.129
Neg logit contributions
omm-0.130
 x-0.126
oscope-0.125
 His-0.124
 magnet-0.122
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
 His+0.032
 apple+0.031
omm+0.030
 x+0.029
penter+0.029
Neg logit contributions
Fre-0.057
Hel-0.054
bel-0.053
Jay-0.051
Kay-0.050
Token She
Token ID1375
Feature activation+2.13e+00

Pos logit contributions
Fre+0.016
bel+0.015
Hel+0.015
Kay+0.015
ora+0.015
Neg logit contributions
oscope-0.018
 x-0.018
 magnet-0.017
 boo-0.017
 DVD-0.017
Token bit
Token ID1643
Feature activation-8.47e-01

Pos logit contributions
Fre+0.144
bel+0.141
Hel+0.135
Kay+0.132
ora+0.131
Neg logit contributions
omm-0.144
 His-0.144
oscope-0.143
 x-0.142
 magnet-0.140
Token into
Token ID656
Feature activation+5.64e-01

Pos logit contributions
 His+0.052
 boo+0.051
 x+0.051
omm+0.051
 apple+0.050
Neg logit contributions
Fre-0.064
bel-0.064
Hel-0.062
Kay-0.060
Ag-0.059
Token the
Token ID262
Feature activation+9.62e-02

Pos logit contributions
bel+0.046
Fre+0.046
ora+0.045
Hel+0.044
Ag+0.044
Neg logit contributions
 x-0.032
 His-0.030
 apple-0.030
 magnet-0.030
oscope-0.029
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
Fre+0.007
bel+0.007
Hel+0.007
Kay+0.006
ora+0.006
Neg logit contributions
omm-0.006
 x-0.006
oscope-0.006
 His-0.006
 magnet-0.006
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
omm+0.767
 x+0.748
oscope+0.746
 His+0.746
 magnet+0.732
Neg logit contributions
Fre-0.969
bel-0.958
Hel-0.919
Kay-0.902
ora-0.893
Token it
Token ID340
Feature activation-3.33e-01

Pos logit contributions
omm+0.090
 His+0.090
oscope+0.090
 x+0.089
 magnet+0.089
Neg logit contributions
Fre-0.094
bel-0.091
Hel-0.089
Kay-0.086
uttering-0.085
Token was
Token ID373
Feature activation-6.36e-01

Pos logit contributions
omm+0.022
oscope+0.022
 His+0.022
 magnet+0.022
 x+0.022
Neg logit contributions
Fre-0.023
bel-0.023
Hel-0.022
Kay-0.022
uttering-0.021
Token so
Token ID523
Feature activation-1.48e+00

Pos logit contributions
omm+0.042
oscope+0.041
 x+0.041
 His+0.040
 magnet+0.040
Neg logit contributions
Fre-0.045
bel-0.044
Hel-0.042
Kay-0.042
ora-0.040
Token sweet
Token ID6029
Feature activation+2.43e+00

Pos logit contributions
 His+0.087
 apple+0.087
omm+0.087
 magnet+0.084
 x+0.082
Neg logit contributions
Fre-0.113
Hel-0.112
bel-0.112
Kay-0.107
Jay-0.107
Token happy
Token ID3772
Feature activation-1.69e+00

Pos logit contributions
omm+0.076
 His+0.075
 x+0.075
oscope+0.074
 magnet+0.073
Neg logit contributions
Fre-0.093
bel-0.091
Hel-0.088
Kay-0.086
ora-0.085
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 His+0.097
omm+0.096
oscope+0.094
 magnet+0.094
 x+0.093
Neg logit contributions
Fre-0.136
bel-0.130
Hel-0.128
Kay-0.125
ora-0.124
Token He
Token ID679
Feature activation-3.33e+00

Pos logit contributions
Fre+0.007
Kay+0.007
Hel+0.007
Ag+0.007
J+0.007
Neg logit contributions
oscope-0.007
 x-0.007
 boo-0.006
oll-0.006
 forehead-0.006
Token shared
Token ID4888
Feature activation-6.85e-02

Pos logit contributions
omm+0.252
 x+0.246
oscope+0.244
 His+0.238
 magnet+0.236
Neg logit contributions
Fre-0.207
bel-0.204
Hel-0.196
Kay-0.191
ora-0.188
Token the
Token ID262
Feature activation+2.44e-01

Pos logit contributions
omm+0.003
oscope+0.003
 x+0.003
 His+0.003
 magnet+0.003
Neg logit contributions
Fre-0.006
bel-0.006
Hel-0.006
Kay-0.006
ora-0.006
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
bel+0.018
Fre+0.018
Hel+0.017
ora+0.016
Kay+0.016
Neg logit contributions
omm-0.016
 His-0.015
oscope-0.015
penter-0.015
 x-0.014
Token with
Token ID351
Feature activation+4.23e-01

Pos logit contributions
omm+0.892
 x+0.874
oscope+0.863
 His+0.848
 DVD+0.829
Neg logit contributions
bel-0.861
Fre-0.842
ora-0.790
Hel-0.784
Kay-0.779
Token Mr
Token ID1770
Feature activation-6.77e+00

Pos logit contributions
Fre+0.037
bel+0.037
Hel+0.036
Kay+0.035
ora+0.035
Neg logit contributions
omm-0.020
oscope-0.020
 x-0.020
 seventy-0.019
 DVD-0.019
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
omm+0.477
 x+0.467
 His+0.467
oscope+0.466
 magnet+0.459
Neg logit contributions
Fre-0.450
bel-0.446
Hel-0.422
Kay-0.414
ora-0.407
Token Squirrel
Token ID44259
Feature activation-1.44e+00

Pos logit contributions
Fre+0.027
Hel+0.026
Kay+0.025
Jay+0.025
Ag+0.024
Neg logit contributions
oscope-0.018
 teaches-0.017
 x-0.017
oll-0.017
 forehead-0.016
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
omm+0.098
 x+0.096
oscope+0.093
 His+0.091
 magnet+0.091
Neg logit contributions
bel-0.100
Fre-0.100
Hel-0.094
ora-0.093
Kay-0.093
Token on
Token ID319
Feature activation-2.45e+00

Pos logit contributions
omm+0.525
 His+0.517
 x+0.515
oscope+0.514
 magnet+0.505
Neg logit contributions
Fre-0.565
bel-0.554
Hel-0.531
Kay-0.521
ora-0.515
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
omm+0.155
 His+0.154
 x+0.152
 magnet+0.150
oscope+0.150
Neg logit contributions
Fre-0.181
bel-0.177
Hel-0.170
ora-0.166
Kay-0.166
Token Jimmy
Token ID12963
Feature activation-3.14e+00

Pos logit contributions
Fre+0.002
bel+0.002
Hel+0.002
Kay+0.002
ora+0.002
Neg logit contributions
omm-0.002
 His-0.002
 x-0.002
oscope-0.002
 magnet-0.002
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
omm+0.209
 x+0.204
oscope+0.201
 His+0.200
 magnet+0.199
Neg logit contributions
Fre-0.219
bel-0.218
Hel-0.207
Kay-0.202
ora-0.202
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
omm+0.094
 x+0.089
oscope+0.087
 magnet+0.083
 microwave+0.082
Neg logit contributions
bel-0.141
Fre-0.137
Hel-0.131
Kay-0.130
ora-0.127
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
omm+0.009
 His+0.009
 x+0.009
oscope+0.009
 magnet+0.009
Neg logit contributions
Fre-0.012
bel-0.012
Hel-0.011
Kay-0.011
ora-0.011
Token were
Token ID547
Feature activation+5.05e+00

Pos logit contributions
omm+0.876
 His+0.867
oscope+0.867
 x+0.860
 magnet+0.856
Neg logit contributions
Fre-0.860
bel-0.829
Hel-0.805
Kay-0.789
ora-0.776
Token always
Token ID1464
Feature activation+6.68e-01

Pos logit contributions
bel+0.359
Fre+0.358
Hel+0.339
Kay+0.334
Ag+0.326
Neg logit contributions
omm-0.327
oscope-0.320
 x-0.319
 His-0.316
 magnet-0.305
Token together
Token ID1978
Feature activation+4.21e+00

Pos logit contributions
Fre+0.051
bel+0.048
Hel+0.048
Kay+0.047
ora+0.047
Neg logit contributions
 x-0.041
omm-0.041
 magnet-0.041
oscope-0.040
 His-0.040
Token.
Token ID13
Feature activation+4.51e-01

Pos logit contributions
Fre+0.309
bel+0.299
Hel+0.292
Kay+0.288
ora+0.284
Neg logit contributions
 His-0.264
oscope-0.260
omm-0.260
 magnet-0.259
 x-0.255
Token<|endoftext|>
Token ID50256
Feature activation-4.77e+00

Pos logit contributions
Fre+0.027
Kay+0.026
Hel+0.025
Ag+0.024
Jay+0.024
Neg logit contributions
oscope-0.036
 x-0.035
 boo-0.032
 knee-0.032
 apple-0.030
Token juice
Token ID13135
Feature activation-6.56e+00

Pos logit contributions
 apple+0.210
 x+0.210
oscope+0.208
 magnet+0.205
 His+0.204
Neg logit contributions
Fre-0.279
Hel-0.272
bel-0.269
Kay-0.266
Jay-0.259
Token.
Token ID13
Feature activation-5.57e-01

Pos logit contributions
omm+0.405
oscope+0.393
 x+0.393
 His+0.393
 magnet+0.386
Neg logit contributions
Fre-0.490
bel-0.483
Hel-0.465
Kay-0.456
ora-0.450
Token Amy
Token ID14235
Feature activation-3.15e-01

Pos logit contributions
 x+0.037
oscope+0.037
omm+0.034
 boo+0.034
 magnet+0.034
Neg logit contributions
Fre-0.040
Hel-0.038
Kay-0.038
Ag-0.037
bel-0.037
Token had
Token ID550
Feature activation-7.30e-01

Pos logit contributions
omm+0.021
 x+0.021
 His+0.021
oscope+0.021
 magnet+0.020
Neg logit contributions
Fre-0.022
bel-0.021
Hel-0.020
ora-0.020
Kay-0.020
Token an
Token ID281
Feature activation+1.39e+00

Pos logit contributions
 His+0.041
 x+0.040
omm+0.040
oscope+0.040
 boo+0.039
Neg logit contributions
Fre-0.059
bel-0.058
ora-0.056
Hel-0.056
Kay-0.055
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
Fre+0.058
ora+0.055
uttering+0.054
 insects+0.049
ies+0.049
Neg logit contributions
osaurus-0.117
 His-0.116
 DVD-0.115
oscope-0.114
bar-0.113
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
omm+0.919
 x+0.898
oscope+0.895
 His+0.894
 DVD+0.869
Neg logit contributions
bel-0.810
Fre-0.780
ora-0.742
Kay-0.712
ies-0.707
Token They
Token ID1119
Feature activation+2.62e+00

Pos logit contributions
Fre+0.017
Kay+0.017
Hel+0.017
J+0.016
Jud+0.016
Neg logit contributions
oscope-0.025
 x-0.023
 boo-0.023
 knee-0.022
bar-0.022
Token ate
Token ID15063
Feature activation-5.56e+00

Pos logit contributions
Fre+0.171
bel+0.167
Hel+0.160
Kay+0.156
ora+0.154
Neg logit contributions
omm-0.186
 His-0.185
oscope-0.184
 x-0.182
 magnet-0.182
Token their
Token ID511
Feature activation+3.46e+00

Pos logit contributions
omm+0.363
 His+0.355
 x+0.354
oscope+0.351
 magnet+0.349
Neg logit contributions
Fre-0.398
bel-0.393
Hel-0.377
Kay-0.369
ora-0.364
Token lunch
Token ID9965
Feature activation-3.61e+00

Pos logit contributions
bel+0.272
Fre+0.263
ora+0.251
Kay+0.245
Hel+0.244
Neg logit contributions
omm-0.199
oscope-0.196
 x-0.191
penter-0.191
 His-0.187
Token tree
Token ID5509
Feature activation-2.75e+00

Pos logit contributions
bel+0.020
Fre+0.020
ora+0.018
Kay+0.018
 perspective+0.018
Neg logit contributions
 DVD-0.017
omm-0.017
oscope-0.017
penter-0.016
 His-0.016
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
omm+0.238
 x+0.224
oscope+0.222
 boo+0.220
 Promise+0.219
Neg logit contributions
bel-0.150
Fre-0.132
ora-0.128
Kay-0.126
uttering-0.126
Token He
Token ID679
Feature activation-3.29e+00

Pos logit contributions
bel+0.019
Fre+0.019
Kay+0.017
Ag+0.017
Hel+0.017
Neg logit contributions
oscope-0.019
 x-0.016
oll-0.016
omm-0.016
 seventy-0.016
Token got
Token ID1392
Feature activation-1.23e+00

Pos logit contributions
omm+0.228
 x+0.223
oscope+0.222
 His+0.218
 DVD+0.216
Neg logit contributions
Fre-0.223
bel-0.222
Hel-0.206
ora-0.204
Kay-0.203
Token the
Token ID262
Feature activation+2.61e-01

Pos logit contributions
omm+0.090
oscope+0.089
penter+0.085
 x+0.084
 DVD+0.083
Neg logit contributions
bel-0.082
Fre-0.080
Hel-0.073
Kay-0.072
ora-0.072
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
bel+0.025
Fre+0.025
ora+0.024
uttering+0.023
Hel+0.023
Neg logit contributions
oscope-0.010
omm-0.010
 DVD-0.010
 x-0.010
penter-0.010
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
omm+0.973
 x+0.956
 DVD+0.955
oscope+0.951
 His+0.909
Neg logit contributions
bel-0.756
Fre-0.701
ora-0.681
ies-0.679
uttering-0.672
Token gave
Token ID2921
Feature activation-6.65e+00

Pos logit contributions
omm+0.102
 x+0.097
oscope+0.096
 DVD+0.093
 His+0.091
Neg logit contributions
bel-0.097
Fre-0.092
Hel-0.087
ora-0.086
Kay-0.085
Token it
Token ID340
Feature activation-3.85e-01

Pos logit contributions
omm+0.508
oscope+0.506
 x+0.485
 DVD+0.478
penter+0.475
Neg logit contributions
bel-0.388
Fre-0.385
Kay-0.380
Hel-0.375
ora-0.374
Token to
Token ID284
Feature activation+2.10e+00

Pos logit contributions
omm+0.034
oscope+0.033
 x+0.033
penter+0.032
 DVD+0.032
Neg logit contributions
bel-0.020
Fre-0.018
Hel-0.017
ora-0.017
J-0.016
Token Tim
Token ID5045
Feature activation-3.47e+00

Pos logit contributions
Fre+0.143
Kay+0.142
ora+0.135
bel+0.134
Hel+0.131
Neg logit contributions
oscope-0.140
oll-0.134
omm-0.132
 DVD-0.127
 x-0.124
Token wanted
Token ID2227
Feature activation-1.36e+00

Pos logit contributions
omm+0.342
 x+0.336
 His+0.335
oscope+0.333
 magnet+0.329
Neg logit contributions
Fre-0.322
bel-0.320
Hel-0.302
Kay-0.296
ora-0.295
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
omm+0.111
 x+0.108
oscope+0.108
 His+0.108
 magnet+0.107
Neg logit contributions
Fre-0.076
bel-0.075
Hel-0.070
Kay-0.069
ora-0.068
Token look
Token ID804
Feature activation+4.39e+00

Pos logit contributions
Fre+0.102
bel+0.099
Kay+0.095
Hel+0.095
ora+0.093
Neg logit contributions
omm-0.094
oscope-0.089
 x-0.087
 His-0.086
 DVD-0.086
Token at
Token ID379
Feature activation-1.01e+00

Pos logit contributions
Fre+0.327
bel+0.323
Hel+0.309
Kay+0.303
ora+0.300
Neg logit contributions
omm-0.272
 x-0.266
 His-0.265
oscope-0.265
 magnet-0.260
Token the
Token ID262
Feature activation-9.05e-02

Pos logit contributions
omm+0.062
 His+0.062
 x+0.061
oscope+0.061
 magnet+0.060
Neg logit contributions
Fre-0.076
bel-0.075
Hel-0.072
Kay-0.070
ora-0.069
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
omm+0.006
 x+0.006
oscope+0.006
 His+0.006
 magnet+0.005
Neg logit contributions
Fre-0.007
bel-0.007
Hel-0.006
Kay-0.006
ora-0.006
Token some
Token ID617
Feature activation-1.99e+00

Pos logit contributions
omm+0.929
 His+0.899
oscope+0.890
 x+0.887
 DVD+0.861
Neg logit contributions
Fre-0.803
bel-0.799
ora-0.750
Hel-0.734
Kay-0.727
Token more
Token ID517
Feature activation-2.63e+00

Pos logit contributions
 x+0.128
 magnet+0.126
 apple+0.126
oscope+0.126
 His+0.125
Neg logit contributions
Fre-0.145
bel-0.142
Hel-0.139
Kay-0.135
ora-0.132
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 apple+0.168
 magnet+0.166
 His+0.165
 x+0.165
oscope+0.164
Neg logit contributions
Fre-0.191
Hel-0.188
bel-0.187
Ag-0.176
Kay-0.176
Token His
Token ID2399
Feature activation-7.24e+00

Pos logit contributions
Fre+0.011
Kay+0.010
Hel+0.010
bel+0.010
ora+0.010
Neg logit contributions
 x-0.008
oscope-0.008
omm-0.008
 magnet-0.007
 boo-0.007
Token mom
Token ID1995
Feature activation-2.64e+00

Pos logit contributions
oscope+0.405
omm+0.401
 DVD+0.392
 x+0.391
penter+0.387
Neg logit contributions
bel-0.571
Fre-0.565
ora-0.533
Kay-0.529
Hel-0.528
Tokenue
Token ID518
Feature activation-1.84e+00

Pos logit contributions
Fre+0.269
bel+0.262
Hel+0.254
Kay+0.252
ora+0.247
Neg logit contributions
omm-0.195
 His-0.192
oscope-0.191
 x-0.191
 magnet-0.188
Token sp
Token ID599
Feature activation+3.77e+00

Pos logit contributions
omm+0.119
 His+0.119
oscope+0.119
 x+0.116
penter+0.114
Neg logit contributions
Fre-0.129
bel-0.128
Hel-0.123
Kay-0.119
Ag-0.118
Tokenits
Token ID896
Feature activation+1.92e+00

Pos logit contributions
Fre+0.235
Hel+0.235
bel+0.221
Kay+0.217
Ag+0.214
Neg logit contributions
omm-0.290
 His-0.274
oll-0.271
 x-0.267
oscope-0.262
Token out
Token ID503
Feature activation-1.41e+00

Pos logit contributions
Fre+0.145
bel+0.144
Hel+0.138
Kay+0.135
ora+0.134
Neg logit contributions
omm-0.117
 x-0.116
 His-0.116
oscope-0.114
 magnet-0.112
Token the
Token ID262
Feature activation-5.55e-02

Pos logit contributions
 His+0.079
 x+0.077
 apple+0.075
omm+0.075
 magnet+0.074
Neg logit contributions
Fre-0.115
bel-0.112
Hel-0.111
Ag-0.109
ora-0.108
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
omm+0.003
 His+0.003
oscope+0.003
 x+0.003
 magnet+0.003
Neg logit contributions
Fre-0.004
bel-0.004
Hel-0.004
ora-0.004
Kay-0.004
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
omm+0.895
 x+0.870
 His+0.864
oscope+0.856
 DVD+0.829
Neg logit contributions
bel-0.833
Fre-0.817
ora-0.766
Hel-0.756
Kay-0.746
Token She
Token ID1375
Feature activation+1.99e+00

Pos logit contributions
Fre+0.017
Kay+0.016
bel+0.016
Hel+0.016
ora+0.016
Neg logit contributions
oscope-0.017
 x-0.017
 boo-0.016
 DVD-0.016
omm-0.016
Token sp
Token ID599
Feature activation+4.81e+00

Pos logit contributions
Fre+0.134
bel+0.132
Hel+0.124
ora+0.123
Kay+0.122
Neg logit contributions
omm-0.137
oscope-0.133
 x-0.133
 His-0.129
icle-0.129
Tokenits
Token ID896
Feature activation+1.73e+00

Pos logit contributions
Fre+0.332
Hel+0.327
bel+0.322
Kay+0.307
Ag+0.304
Neg logit contributions
omm-0.337
oll-0.322
 His-0.316
 x-0.316
oscope-0.303
Token out
Token ID503
Feature activation-8.82e-01

Pos logit contributions
Fre+0.125
bel+0.124
Hel+0.119
Kay+0.116
ora+0.115
Neg logit contributions
omm-0.111
oscope-0.108
 x-0.108
 His-0.108
 magnet-0.106
Token pick
Token ID2298
Feature activation-2.82e+00

Pos logit contributions
 His+0.186
oscope+0.182
 x+0.181
 magnet+0.181
 boo+0.179
Neg logit contributions
Fre-0.188
bel-0.178
Hel-0.174
Kay-0.173
ora-0.170
Token a
Token ID257
Feature activation-1.77e+00

Pos logit contributions
omm+0.160
 x+0.156
 His+0.156
oscope+0.155
 magnet+0.152
Neg logit contributions
Fre-0.225
bel-0.223
Hel-0.214
Kay-0.210
ora-0.208
Token fresh
Token ID4713
Feature activation-7.14e-01

Pos logit contributions
omm+0.119
oscope+0.116
 His+0.116
 x+0.113
penter+0.112
Neg logit contributions
Fre-0.123
bel-0.122
Hel-0.115
Kay-0.114
ora-0.113
Token,
Token ID11
Feature activation+6.48e-01

Pos logit contributions
omm+0.050
 His+0.050
oscope+0.048
 x+0.048
penter+0.048
Neg logit contributions
Fre-0.046
bel-0.044
ora-0.044
Hel-0.042
Kay-0.041
Token delicious
Token ID12625
Feature activation-3.50e+00

Pos logit contributions
bel+0.041
Fre+0.041
Hel+0.039
Kay+0.038
Ag+0.037
Neg logit contributions
omm-0.048
oscope-0.046
 His-0.045
 x-0.045
penter-0.044
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
omm+0.202
 His+0.199
oscope+0.196
 x+0.192
penter+0.190
Neg logit contributions
Fre-0.274
bel-0.271
ora-0.257
Hel-0.255
Kay-0.255
Token from
Token ID422
Feature activation-4.21e+00

Pos logit contributions
omm+0.902
 x+0.878
 His+0.877
oscope+0.865
 magnet+0.836
Neg logit contributions
bel-0.836
Fre-0.827
ora-0.769
Hel-0.764
Kay-0.757
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
omm+0.299
oscope+0.287
 His+0.279
 x+0.278
 magnet+0.273
Neg logit contributions
Fre-0.281
bel-0.275
Kay-0.261
Hel-0.261
ora-0.260
Token tree
Token ID5509
Feature activation-2.76e+00

Pos logit contributions
Fre+0.011
bel+0.011
Hel+0.010
ora+0.010
Kay+0.010
Neg logit contributions
 His-0.009
omm-0.009
oscope-0.009
 x-0.009
penter-0.009
Token in
Token ID287
Feature activation+8.03e-01

Pos logit contributions
omm+0.211
 x+0.204
 His+0.200
oscope+0.198
 magnet+0.197
Neg logit contributions
bel-0.169
Fre-0.165
Hel-0.155
ora-0.152
Kay-0.152
Token her
Token ID607
Feature activation+1.31e+00

Pos logit contributions
Fre+0.057
bel+0.056
Hel+0.053
Kay+0.053
ora+0.052
Neg logit contributions
omm-0.054
oscope-0.052
 x-0.052
 His-0.050
penter-0.050
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
omm+0.290
 His+0.288
 x+0.283
oscope+0.281
 magnet+0.279
Neg logit contributions
Fre-0.298
bel-0.295
Hel-0.279
Kay-0.275
ora-0.271
Token Fl
Token ID1610
Feature activation-4.23e+00

Pos logit contributions
bel+0.067
Fre+0.066
Hel+0.063
Kay+0.061
Ag+0.061
Neg logit contributions
omm-0.076
oscope-0.075
 x-0.073
 magnet-0.071
 His-0.071
Tokenuffy
Token ID15352
Feature activation+4.20e+00

Pos logit contributions
omm+0.318
 x+0.304
 His+0.300
oscope+0.299
 magnet+0.295
Neg logit contributions
Fre-0.272
bel-0.268
Hel-0.249
Kay-0.246
Ag-0.238
Token ate
Token ID15063
Feature activation-5.73e+00

Pos logit contributions
Fre+0.261
Hel+0.250
bel+0.249
ora+0.242
Kay+0.235
Neg logit contributions
omm-0.311
 x-0.306
 DVD-0.305
oscope-0.296
icle-0.294
Token the
Token ID262
Feature activation+1.77e-01

Pos logit contributions
oscope+0.480
omm+0.462
penter+0.455
 x+0.446
 DVD+0.436
Neg logit contributions
Fre-0.313
bel-0.298
ora-0.285
Kay-0.277
J-0.277
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
bel+0.017
Fre+0.017
uttering+0.016
ora+0.016
Kay+0.016
Neg logit contributions
oscope-0.007
omm-0.006
 DVD-0.006
penter-0.006
 x-0.006
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
omm+0.930
 x+0.912
oscope+0.886
 DVD+0.881
 His+0.863
Neg logit contributions
bel-0.824
Fre-0.773
ora-0.745
uttering-0.724
ies-0.716
Token it
Token ID340
Feature activation-4.17e-01

Pos logit contributions
omm+0.110
 x+0.108
oscope+0.106
 His+0.105
 magnet+0.103
Neg logit contributions
Fre-0.108
bel-0.107
Hel-0.102
Kay-0.100
ora-0.099
Token was
Token ID373
Feature activation-9.10e-01

Pos logit contributions
omm+0.031
 x+0.030
oscope+0.030
 His+0.030
 magnet+0.029
Neg logit contributions
bel-0.026
Fre-0.026
Hel-0.024
Kay-0.024
ora-0.024
Token very
Token ID845
Feature activation-5.56e-01

Pos logit contributions
oscope+0.079
 x+0.077
omm+0.074
penter+0.073
 DVD+0.071
Neg logit contributions
Fre-0.046
bel-0.045
Kay-0.043
ies-0.042
J-0.042
Token y
Token ID331
Feature activation-3.32e+00

Pos logit contributions
oscope+0.041
omm+0.040
 x+0.040
penter+0.039
 DVD+0.038
Neg logit contributions
Fre-0.036
bel-0.036
ora-0.033
Kay-0.033
Hel-0.033
Token the
Token ID262
Feature activation+2.40e-01

Pos logit contributions
Fre+0.109
bel+0.103
Kay+0.102
Hel+0.099
ora+0.098
Neg logit contributions
omm-0.183
oscope-0.180
oll-0.174
penter-0.173
 DVD-0.173
Token kitchen
Token ID9592
Feature activation-1.76e+00

Pos logit contributions
bel+0.015
Hel+0.015
Fre+0.015
ora+0.014
Kay+0.014
Neg logit contributions
icle-0.016
oscope-0.016
omm-0.016
oll-0.015
 His-0.015
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
omm+0.095
 His+0.094
oscope+0.092
 x+0.092
 magnet+0.091
Neg logit contributions
Fre-0.148
Hel-0.142
bel-0.141
Kay-0.138
Ag-0.136
Token found
Token ID1043
Feature activation+2.17e+00

Pos logit contributions
omm+0.101
 x+0.098
oscope+0.098
 His+0.097
 magnet+0.096
Neg logit contributions
Fre-0.085
bel-0.085
Hel-0.080
Kay-0.078
ora-0.078
Token an
Token ID281
Feature activation+1.32e+00

Pos logit contributions
Fre+0.145
bel+0.141
Hel+0.134
Kay+0.133
ora+0.130
Neg logit contributions
oscope-0.151
omm-0.150
 His-0.145
penter-0.143
 x-0.142
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
Fre+0.096
 insects+0.094
uttering+0.094
ies+0.089
 perspective+0.089
Neg logit contributions
 DVD-0.067
oscope-0.066
 His-0.062
osaurus-0.059
icle-0.058
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
omm+0.970
 His+0.955
oscope+0.942
 x+0.938
 DVD+0.911
Neg logit contributions
bel-0.746
Fre-0.707
ora-0.687
ies-0.667
uttering-0.661
Token She
Token ID1375
Feature activation+2.02e+00

Pos logit contributions
Fre+0.016
bel+0.015
Hel+0.015
Kay+0.015
ora+0.014
Neg logit contributions
oscope-0.015
 x-0.015
 DVD-0.014
penter-0.014
 magnet-0.014
Token grabbed
Token ID13176
Feature activation-1.87e+00

Pos logit contributions
bel+0.118
Fre+0.116
Kay+0.106
ora+0.105
Hel+0.105
Neg logit contributions
omm-0.157
 x-0.155
 DVD-0.154
oscope-0.154
icle-0.148
Token it
Token ID340
Feature activation-3.58e-01

Pos logit contributions
oscope+0.149
omm+0.148
penter+0.143
 His+0.141
 DVD+0.139
Neg logit contributions
Fre-0.105
bel-0.100
Hel-0.095
Kay-0.095
ora-0.094
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
omm+0.027
oscope+0.027
 DVD+0.027
 x+0.026
 microwave+0.026
Neg logit contributions
bel-0.021
Fre-0.020
uttering-0.020
ora-0.019
Hel-0.019
Token he
Token ID339
Feature activation-4.15e+00

Pos logit contributions
omm+0.285
oscope+0.268
 His+0.265
 x+0.265
 magnet+0.259
Neg logit contributions
Fre-0.287
bel-0.281
Hel-0.269
ora-0.264
Kay-0.262
Token got
Token ID1392
Feature activation-2.13e+00

Pos logit contributions
omm+0.310
 x+0.302
 His+0.300
oscope+0.299
 magnet+0.295
Neg logit contributions
Fre-0.256
bel-0.254
Hel-0.240
Kay-0.234
ora-0.232
Token from
Token ID422
Feature activation-4.85e+00

Pos logit contributions
 His+0.129
omm+0.129
 x+0.128
oscope+0.125
 magnet+0.125
Neg logit contributions
Fre-0.160
bel-0.156
Hel-0.153
Kay-0.149
ora-0.148
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
omm+0.288
 x+0.285
 His+0.283
oscope+0.283
 magnet+0.278
Neg logit contributions
Fre-0.372
bel-0.367
Hel-0.353
Kay-0.347
ora-0.344
Token magical
Token ID10883
Feature activation+6.05e-01

Pos logit contributions
omm+0.017
 His+0.017
oscope+0.016
 x+0.015
penter+0.015
Neg logit contributions
bel-0.022
Fre-0.022
Hel-0.021
ora-0.021
Kay-0.021
Token apple
Token ID17180
Feature activation-1.27e+01

Pos logit contributions
Fre+0.045
bel+0.044
Hel+0.042
ora+0.041
Kay+0.041
Neg logit contributions
omm-0.038
 His-0.037
oscope-0.036
 x-0.036
 magnet-0.035
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
omm+0.921
 x+0.891
oscope+0.873
 His+0.870
 DVD+0.856
Neg logit contributions
bel-0.825
Fre-0.798
ora-0.744
Hel-0.739
Kay-0.724
Token
Token ID220
Feature activation-1.81e+00

Pos logit contributions
Fre+0.013
Kay+0.012
Hel+0.012
Ag+0.012
ora+0.012
Neg logit contributions
 x-0.009
oscope-0.009
 boo-0.008
omm-0.008
 knee-0.008
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
oscope+0.141
 x+0.140
 boo+0.138
omm+0.137
 magnet+0.134
Neg logit contributions
bel-0.106
Fre-0.106
Hel-0.104
Kay-0.102
ora-0.097
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
omm+0.010
 x+0.010
oscope+0.010
 magnet+0.010
 boo+0.010
Neg logit contributions
Fre-0.010
bel-0.010
Hel-0.010
Kay-0.010
ora-0.009
TokenThe
Token ID464
Feature activation+1.65e+00

Pos logit contributions
omm+0.012
 x+0.011
oscope+0.011
 magnet+0.011
 His+0.011
Neg logit contributions
Fre-0.009
bel-0.008
Hel-0.008
Kay-0.008
Ag-0.008
Token Kevin
Token ID7939
Feature activation+3.51e+00

Pos logit contributions
Fre+0.117
bel+0.110
Hel+0.107
Kay+0.106
ora+0.106
Neg logit contributions
omm-0.099
oscope-0.099
 x-0.098
 DVD-0.095
penter-0.095
Token!"
Token ID2474
Feature activation-2.94e-01

Pos logit contributions
Fre+0.258
bel+0.255
Hel+0.245
Kay+0.240
ora+0.237
Neg logit contributions
omm-0.220
 x-0.214
oscope-0.214
 His-0.214
 magnet-0.210
Token
Token ID220
Feature activation-1.28e+00

Pos logit contributions
omm+0.015
 His+0.015
 x+0.015
oscope+0.015
 magnet+0.015
Neg logit contributions
Fre-0.025
bel-0.025
Hel-0.024
Kay-0.023
ora-0.023
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
oscope+0.095
 x+0.091
omm+0.090
 magnet+0.089
 DVD+0.088
Neg logit contributions
Fre-0.081
bel-0.080
Hel-0.077
Kay-0.076
ora-0.075
Token
Token ID198
Feature activation+2.56e-01

Pos logit contributions
bel+0.020
Fre+0.019
uttering+0.018
Ag+0.018
Hel+0.018
Neg logit contributions
 His-0.014
omm-0.014
 x-0.013
 magnet-0.013
oscope-0.012
TokenKevin
Token ID23865
Feature activation+9.61e+00

Pos logit contributions
bel+0.014
Fre+0.013
uttering+0.013
ies+0.012
Ag+0.012
Neg logit contributions
 His-0.021
omm-0.020
 magnet-0.019
penter-0.019
oscope-0.019
Token was
Token ID373
Feature activation-3.46e-01

Pos logit contributions
bel+0.616
Fre+0.615
Hel+0.581
ora+0.575
Kay+0.570
Neg logit contributions
omm-0.693
 x-0.677
oscope-0.657
 magnet-0.657
 DVD-0.651
Token very
Token ID845
Feature activation-5.48e-02

Pos logit contributions
omm+0.023
 magnet+0.022
oscope+0.022
 x+0.022
 His+0.021
Neg logit contributions
Fre-0.025
bel-0.024
Kay-0.024
Hel-0.023
Ag-0.022
Token excited
Token ID6568
Feature activation+8.20e-02

Pos logit contributions
 His+0.003
omm+0.003
 boo+0.003
 x+0.003
 knee+0.003
Neg logit contributions
Fre-0.004
bel-0.004
Hel-0.004
Kay-0.004
ora-0.004
Token and
Token ID290
Feature activation-9.71e-01

Pos logit contributions
Fre+0.007
bel+0.006
Hel+0.006
Kay+0.006
ora+0.006
Neg logit contributions
 His-0.005
omm-0.005
 x-0.005
oscope-0.005
 magnet-0.004
Token ran
Token ID4966
Feature activation+2.53e-01

Pos logit contributions
omm+0.066
 x+0.064
oscope+0.064
 His+0.063
 magnet+0.062
Neg logit contributions
Fre-0.067
bel-0.067
Hel-0.064
Kay-0.062
ora-0.062
Token a
Token ID257
Feature activation-8.80e-01

Pos logit contributions
omm+0.408
 x+0.402
 His+0.401
oscope+0.401
 magnet+0.394
Neg logit contributions
Fre-0.378
bel-0.374
Hel-0.355
Kay-0.348
ora-0.345
Token lot
Token ID1256
Feature activation+3.60e+00

Pos logit contributions
omm+0.065
 His+0.061
oscope+0.061
penter+0.059
oll+0.059
Neg logit contributions
bel-0.055
Fre-0.055
Hel-0.051
Kay-0.050
 common-0.049
Token of
Token ID286
Feature activation-3.42e+00

Pos logit contributions
Fre+0.276
bel+0.274
Hel+0.262
Kay+0.257
ora+0.256
Neg logit contributions
omm-0.216
 His-0.205
oscope-0.204
 x-0.204
 magnet-0.202
Token love
Token ID1842
Feature activation+2.46e+00

Pos logit contributions
omm+0.282
 x+0.276
 His+0.276
oscope+0.276
 magnet+0.272
Neg logit contributions
Fre-0.186
bel-0.183
Hel-0.172
Kay-0.168
ora-0.166
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
Fre+0.153
bel+0.152
Hel+0.142
ora+0.139
Kay+0.139
Neg logit contributions
omm-0.181
 His-0.175
 x-0.175
oscope-0.174
penter-0.170
Token care
Token ID1337
Feature activation+8.63e+00

Pos logit contributions
omm+0.081
 x+0.078
oscope+0.078
 His+0.078
 magnet+0.076
Neg logit contributions
Fre-0.085
bel-0.084
Hel-0.080
Kay-0.078
ora-0.078
Token.
Token ID13
Feature activation+3.86e-01

Pos logit contributions
bel+0.626
Fre+0.612
Hel+0.590
Kay+0.567
ora+0.566
Neg logit contributions
omm-0.559
 x-0.537
oscope-0.532
 His-0.524
 magnet-0.519
Token The
Token ID383
Feature activation-2.10e+00

Pos logit contributions
Fre+0.025
Hel+0.024
Kay+0.023
Jay+0.022
J+0.022
Neg logit contributions
oscope-0.027
 boo-0.027
 x-0.026
 knee-0.026
 forehead-0.026
Token zoo
Token ID26626
Feature activation+1.60e+00

Pos logit contributions
omm+0.128
 His+0.128
 Your+0.118
oll+0.117
 seventy+0.116
Neg logit contributions
Fre-0.157
bel-0.148
ora-0.146
Hel-0.145
Kay-0.141
Token will
Token ID481
Feature activation+1.78e+00

Pos logit contributions
Fre+0.132
bel+0.128
Hel+0.125
Kay+0.123
Ag+0.122
Neg logit contributions
omm-0.087
oscope-0.086
 x-0.085
 His-0.085
 magnet-0.084
Token take
Token ID1011
Feature activation-3.43e+00

Pos logit contributions
Fre+0.141
bel+0.139
Hel+0.134
Kay+0.131
ora+0.130
Neg logit contributions
omm-0.102
 x-0.099
 His-0.099
oscope-0.099
 magnet-0.097
Token The
Token ID383
Feature activation-2.19e+00

Pos logit contributions
Fre+0.051
Hel+0.051
bel+0.050
Kay+0.049
Ag+0.048
Neg logit contributions
oscope-0.028
 x-0.027
omm-0.026
 seventy-0.026
 magnet-0.026
Token water
Token ID1660
Feature activation+1.08e+00

Pos logit contributions
omm+0.149
 His+0.144
oscope+0.144
 x+0.144
 magnet+0.140
Neg logit contributions
Fre-0.151
bel-0.149
Hel-0.142
Kay-0.139
ora-0.138
Token was
Token ID373
Feature activation-4.09e-01

Pos logit contributions
Fre+0.076
bel+0.074
Hel+0.071
Kay+0.070
ora+0.069
Neg logit contributions
omm-0.071
 His-0.071
oscope-0.070
 magnet-0.069
 x-0.069
Token much
Token ID881
Feature activation-2.05e+00

Pos logit contributions
omm+0.029
oscope+0.028
 x+0.028
 His+0.027
 magnet+0.027
Neg logit contributions
bel-0.027
Fre-0.027
Hel-0.026
Kay-0.025
uttering-0.025
Token cal
Token ID2386
Feature activation-1.43e+00

Pos logit contributions
 x+0.094
omm+0.093
 magnet+0.092
 His+0.092
oscope+0.090
Neg logit contributions
Fre-0.187
bel-0.182
Hel-0.177
ora-0.175
Kay-0.175
Tokenmer
Token ID647
Feature activation+9.27e+00

Pos logit contributions
penter+0.132
oscope+0.127
 x+0.123
 DVD+0.123
 magnet+0.119
Neg logit contributions
Fre-0.069
bel-0.061
Kay-0.061
Hel-0.059
Jay-0.057
Token now
Token ID783
Feature activation-3.98e+00

Pos logit contributions
Fre+0.685
bel+0.677
Hel+0.647
Kay+0.633
ora+0.625
Neg logit contributions
omm-0.579
oscope-0.573
 His-0.567
 x-0.566
 magnet-0.553
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
omm+0.254
 His+0.253
oscope+0.252
 x+0.252
 magnet+0.249
Neg logit contributions
Fre-0.289
bel-0.284
Hel-0.270
Kay-0.266
ora-0.264
Token Everyone
Token ID11075
Feature activation+3.54e+00

Pos logit contributions
Fre+0.052
Hel+0.051
Kay+0.050
bel+0.049
Ag+0.048
Neg logit contributions
oscope-0.035
 x-0.034
 seventy-0.033
 magnet-0.032
omm-0.032
Token was
Token ID373
Feature activation-3.71e-01

Pos logit contributions
Fre+0.271
bel+0.262
Hel+0.250
Kay+0.250
ora+0.248
Neg logit contributions
 His-0.218
oscope-0.207
 magnet-0.205
 DVD-0.203
penter-0.202
Token able
Token ID1498
Feature activation-6.03e+00

Pos logit contributions
omm+0.024
 x+0.023
 His+0.023
oscope+0.023
 magnet+0.023
Neg logit contributions
Fre-0.027
bel-0.027
Hel-0.026
Kay-0.025
ora-0.025
Token listen
Token ID6004
Feature activation+2.70e+00

Pos logit contributions
oscope+0.006
 x+0.006
 boo+0.006
 magnet+0.006
 His+0.006
Neg logit contributions
Fre-0.010
bel-0.010
Hel-0.009
Ag-0.009
uttering-0.009
Token.
Token ID13
Feature activation+6.24e-01

Pos logit contributions
Fre+0.210
bel+0.197
Hel+0.195
Kay+0.191
ora+0.189
Neg logit contributions
 His-0.169
omm-0.163
oscope-0.162
 x-0.161
 magnet-0.156
Token He
Token ID679
Feature activation-2.87e+00

Pos logit contributions
bel+0.059
Hel+0.058
J+0.058
Fre+0.057
Kay+0.056
Neg logit contributions
oscope-0.026
omm-0.023
 x-0.023
 seventy-0.023
penter-0.023
Token did
Token ID750
Feature activation+2.74e+00

Pos logit contributions
omm+0.202
oscope+0.199
 x+0.199
 His+0.198
 magnet+0.195
Neg logit contributions
Fre-0.189
bel-0.186
Hel-0.177
Kay-0.173
ora-0.171
Token not
Token ID407
Feature activation-1.36e-01

Pos logit contributions
bel+0.219
Fre+0.219
Hel+0.208
Kay+0.206
ora+0.203
Neg logit contributions
omm-0.155
oscope-0.150
 x-0.148
 His-0.148
 magnet-0.146
Token care
Token ID1337
Feature activation+8.79e+00

Pos logit contributions
oscope+0.008
 x+0.008
 boo+0.008
 magnet+0.008
 His+0.007
Neg logit contributions
Fre-0.011
bel-0.011
uttering-0.010
Ag-0.010
Hel-0.010
Token.
Token ID13
Feature activation+6.33e-01

Pos logit contributions
bel+0.571
Fre+0.562
Hel+0.540
Kay+0.523
ora+0.513
Neg logit contributions
omm-0.626
oscope-0.611
 magnet-0.598
 x-0.596
 seventy-0.593
Token He
Token ID679
Feature activation-2.89e+00

Pos logit contributions
bel+0.061
Fre+0.060
Hel+0.060
J+0.059
Kay+0.059
Neg logit contributions
oscope-0.025
omm-0.023
 x-0.023
penter-0.023
 seventy-0.022
Token said
Token ID531
Feature activation+1.18e+00

Pos logit contributions
omm+0.201
oscope+0.199
 x+0.198
 His+0.198
 magnet+0.193
Neg logit contributions
Fre-0.193
bel-0.190
Hel-0.181
Kay-0.177
ora-0.175
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
Fre+0.085
bel+0.084
Hel+0.080
Kay+0.079
ora+0.078
Neg logit contributions
omm-0.076
 x-0.074
oscope-0.074
 His-0.074
 magnet-0.072
Token "
Token ID366
Feature activation-2.94e+00

Pos logit contributions
bel+0.133
Hel+0.128
Fre+0.127
J+0.119
Ag+0.117
Neg logit contributions
omm-0.097
 x-0.090
oscope-0.088
oll-0.087
 magnet-0.087
Token my
Token ID616
Feature activation-5.25e+00

Pos logit contributions
Fre+0.347
bel+0.347
Hel+0.328
ora+0.324
Kay+0.319
Neg logit contributions
omm-0.269
oscope-0.259
 His-0.257
 x-0.257
penter-0.251
Token castle
Token ID16669
Feature activation+2.12e+00

Pos logit contributions
omm+0.330
 His+0.321
 x+0.321
oscope+0.321
 magnet+0.313
Neg logit contributions
Fre-0.386
bel-0.382
Hel-0.365
Kay-0.357
ora-0.355
Token is
Token ID318
Feature activation-1.66e+00

Pos logit contributions
bel+0.163
Fre+0.151
ora+0.144
ies+0.143
Hel+0.143
Neg logit contributions
omm-0.125
 DVD-0.121
 His-0.119
oscope-0.119
 microwave-0.117
Token?"
Token ID1701
Feature activation-3.19e+00

Pos logit contributions
omm+0.098
oscope+0.096
 His+0.093
 DVD+0.093
 x+0.091
Neg logit contributions
Fre-0.126
bel-0.126
Hel-0.118
uttering-0.115
ora-0.114
Token
Token ID198
Feature activation+3.52e-01

Pos logit contributions
 His+0.196
omm+0.180
 x+0.177
oscope+0.171
penter+0.168
Neg logit contributions
uttering-0.251
bel-0.250
Fre-0.248
Hel-0.237
Ag-0.234
TokenEl
Token ID9527
Feature activation+9.94e+00

Pos logit contributions
uttering+0.033
bel+0.033
ies+0.031
Fre+0.031
Ag+0.030
Neg logit contributions
 His-0.016
omm-0.014
 x-0.013
penter-0.012
 seventy-0.012
Tokeniza
Token ID23638
Feature activation+6.78e+00

Pos logit contributions
bel+0.621
Fre+0.603
Hel+0.584
Kay+0.561
Ag+0.556
Neg logit contributions
oscope-0.746
 x-0.740
omm-0.739
 His-0.709
 boo-0.701
Token guessed
Token ID25183
Feature activation+2.65e+00

Pos logit contributions
Fre+0.451
bel+0.445
Hel+0.425
Kay+0.416
ora+0.413
Neg logit contributions
omm-0.471
 x-0.462
oscope-0.460
 His-0.457
 magnet-0.451
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
Fre+0.185
bel+0.183
Hel+0.174
Kay+0.171
ora+0.169
Neg logit contributions
omm-0.177
 x-0.173
 His-0.173
oscope-0.172
 magnet-0.169
Token "
Token ID366
Feature activation-3.08e+00

Pos logit contributions
Fre+0.134
bel+0.129
Hel+0.128
Kay+0.124
Ag+0.121
Neg logit contributions
omm-0.091
 x-0.091
oscope-0.090
 DVD-0.088
 magnet-0.088
TokenIs
Token ID3792
Feature activation-2.71e+00

Pos logit contributions
 His+0.209
oscope+0.202
 magnet+0.200
omm+0.199
icle+0.199
Neg logit contributions
Fre-0.195
ies-0.190
bel-0.185
uttering-0.182
ora-0.182
Token mirror
Token ID10162
Feature activation+2.41e+00

Pos logit contributions
omm+0.017
 x+0.017
oscope+0.016
 magnet+0.016
 His+0.016
Neg logit contributions
Fre-0.019
bel-0.019
Hel-0.018
Kay-0.018
Ag-0.018
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
Fre+0.190
Hel+0.178
bel+0.175
Kay+0.174
Ag+0.173
Neg logit contributions
 x-0.147
omm-0.146
oscope-0.146
 His-0.144
 magnet-0.140
Token she
Token ID673
Feature activation+3.04e+00

Pos logit contributions
Fre+0.063
bel+0.059
uttering+0.058
Hel+0.057
ora+0.057
Neg logit contributions
 His-0.080
omm-0.076
 x-0.076
 magnet-0.075
 apple-0.074
Token would
Token ID561
Feature activation-8.99e-01

Pos logit contributions
Fre+0.184
bel+0.173
Hel+0.167
uttering+0.166
Ag+0.165
Neg logit contributions
 His-0.230
omm-0.229
oscope-0.226
 x-0.224
 magnet-0.223
Token always
Token ID1464
Feature activation+5.01e-01

Pos logit contributions
omm+0.057
 His+0.056
oscope+0.056
 x+0.056
 magnet+0.055
Neg logit contributions
Fre-0.065
bel-0.064
Hel-0.061
Kay-0.060
ora-0.060
Token feel
Token ID1254
Feature activation+5.60e+00

Pos logit contributions
Fre+0.038
bel+0.037
Hel+0.036
Kay+0.035
ora+0.035
Neg logit contributions
oscope-0.030
 His-0.030
 x-0.030
 magnet-0.030
omm-0.030
Token very
Token ID845
Feature activation-5.07e-01

Pos logit contributions
Fre+0.433
bel+0.424
Hel+0.410
Kay+0.404
ora+0.400
Neg logit contributions
 His-0.329
 x-0.327
omm-0.326
oscope-0.324
 magnet-0.321
Token special
Token ID2041
Feature activation-1.34e+00

Pos logit contributions
 His+0.028
 boo+0.028
omm+0.027
 knee+0.027
 x+0.027
Neg logit contributions
Fre-0.042
Hel-0.040
bel-0.040
Kay-0.039
Jay-0.038
Token because
Token ID780
Feature activation-1.12e+00

Pos logit contributions
oscope+0.080
omm+0.079
 His+0.079
 x+0.079
 magnet+0.078
Neg logit contributions
Fre-0.105
Hel-0.100
bel-0.100
Kay-0.098
ora-0.095
Token of
Token ID286
Feature activation-3.58e+00

Pos logit contributions
omm+0.090
 x+0.088
 His+0.088
oscope+0.088
 magnet+0.087
Neg logit contributions
Fre-0.062
bel-0.061
Hel-0.058
Kay-0.056
ora-0.056
Token how
Token ID703
Feature activation-4.71e+00

Pos logit contributions
omm+0.225
 x+0.225
 His+0.223
 magnet+0.219
oscope+0.217
Neg logit contributions
bel-0.266
Fre-0.261
Hel-0.249
ora-0.243
Kay-0.241
Token.
Token ID13
Feature activation+6.01e-01

Pos logit contributions
Fre+0.020
bel+0.020
Hel+0.019
ora+0.019
Kay+0.019
Neg logit contributions
omm-0.028
 x-0.028
 His-0.027
oscope-0.027
 magnet-0.027
Token I
Token ID314
Feature activation-3.88e+00

Pos logit contributions
Fre+0.054
Hel+0.053
ora+0.052
Kay+0.051
Ag+0.050
Neg logit contributions
 x-0.028
 magnet-0.027
oscope-0.027
 boo-0.026
 motorcycle-0.026
Token'm
Token ID1101
Feature activation+7.13e+00

Pos logit contributions
oscope+0.283
 teaches+0.278
 His+0.272
 x+0.270
 boo+0.268
Neg logit contributions
bel-0.248
Fre-0.239
uttering-0.230
Hel-0.224
Kay-0.216
Token glad
Token ID9675
Feature activation-1.95e+00

Pos logit contributions
Fre+0.523
bel+0.491
Hel+0.486
Kay+0.485
Ag+0.473
Neg logit contributions
omm-0.464
oscope-0.461
 magnet-0.458
 His-0.446
 x-0.443
Token you
Token ID345
Feature activation+1.47e+00

Pos logit contributions
omm+0.110
 His+0.108
 x+0.103
 apple+0.102
 seventy+0.100
Neg logit contributions
bel-0.160
Fre-0.155
Hel-0.151
Kay-0.147
uttering-0.146
Token like
Token ID588
Feature activation+3.54e+00

Pos logit contributions
bel+0.103
Fre+0.103
Kay+0.095
Hel+0.095
uttering+0.094
Neg logit contributions
oscope-0.098
 x-0.095
 His-0.095
 magnet-0.093
omm-0.093
Token the
Token ID262
Feature activation+7.13e-01

Pos logit contributions
Fre+0.259
bel+0.257
Hel+0.245
Kay+0.241
ora+0.238
Neg logit contributions
omm-0.226
 x-0.220
 His-0.220
oscope-0.220
 magnet-0.215
Token rh
Token ID9529
Feature activation-4.48e+00

Pos logit contributions
Fre+0.061
bel+0.058
Hel+0.057
Kay+0.056
 perspective+0.055
Neg logit contributions
omm-0.036
 His-0.035
oll-0.034
oscope-0.033
 knee-0.032
Tokenin
Token ID259
Feature activation+1.26e+00

Pos logit contributions
omm+0.351
 x+0.344
 His+0.344
oscope+0.344
 magnet+0.338
Neg logit contributions
Fre-0.262
bel-0.258
Hel-0.244
Kay-0.238
ora-0.235
Tokenoc
Token ID420
Feature activation-4.88e+00

Pos logit contributions
bel+0.112
Fre+0.107
ies+0.102
Hel+0.102
ora+0.100
Neg logit contributions
omm-0.062
 seventy-0.054
 boo-0.052
 x-0.052
 His-0.051
Tokeneros
Token ID27498
Feature activation-1.43e+00

Pos logit contributions
 His+0.216
 DVD+0.209
 Tomorrow+0.203
 microwave+0.202
 x+0.200
Neg logit contributions
Fre-0.428
bel-0.424
ora-0.403
Hel-0.393
Kay-0.389
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
Fre+0.015
Hel+0.015
bel+0.014
Kay+0.014
J+0.014
Neg logit contributions
oscope-0.016
omm-0.016
 x-0.015
 magnet-0.015
 seventy-0.015
TokenTim
Token ID14967
Feature activation+3.14e-01

Pos logit contributions
Fre+0.013
bel+0.013
Hel+0.012
Kay+0.012
ora+0.012
Neg logit contributions
omm-0.018
 His-0.018
 x-0.018
oscope-0.018
 magnet-0.018
Tokenmy
Token ID1820
Feature activation-3.91e+00

Pos logit contributions
Fre+0.028
bel+0.028
Hel+0.027
ora+0.027
Kay+0.027
Neg logit contributions
omm-0.014
 x-0.014
 DVD-0.014
 magnet-0.013
 His-0.013
Token smiled
Token ID13541
Feature activation+1.28e+00

Pos logit contributions
omm+0.322
 x+0.320
 DVD+0.316
 magnet+0.311
oscope+0.310
Neg logit contributions
Fre-0.205
bel-0.203
Hel-0.198
Kay-0.193
ora-0.191
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
bel+0.079
Hel+0.075
Fre+0.074
uttering+0.073
ora+0.072
Neg logit contributions
oscope-0.096
 x-0.096
omm-0.094
penter-0.090
 microwave-0.090
Token they
Token ID484
Feature activation+3.61e+00

Pos logit contributions
omm+0.083
 x+0.081
oscope+0.081
 magnet+0.079
 His+0.079
Neg logit contributions
Fre-0.064
bel-0.064
Hel-0.061
Kay-0.060
ora-0.059
Token continued
Token ID3767
Feature activation-8.75e-01

Pos logit contributions
Fre+0.271
bel+0.266
Kay+0.246
Hel+0.246
ora+0.245
Neg logit contributions
 His-0.220
oscope-0.219
omm-0.216
 magnet-0.213
penter-0.213
Token to
Token ID284
Feature activation+2.35e+00

Pos logit contributions
 His+0.062
 x+0.058
 DVD+0.058
oscope+0.057
 apple+0.057
Neg logit contributions
Fre-0.061
ora-0.058
bel-0.057
Kay-0.057
Hel-0.055
Token play
Token ID711
Feature activation-2.55e-01

Pos logit contributions
bel+0.171
Fre+0.170
Hel+0.160
Kay+0.158
ora+0.158
Neg logit contributions
 His-0.148
 x-0.148
oscope-0.148
omm-0.145
 magnet-0.145
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
 knee+0.014
 x+0.013
 His+0.013
 apple+0.013
 DVD+0.013
Neg logit contributions
Fre-0.023
Kay-0.021
bel-0.021
ora-0.020
Hel-0.020
Token the
Token ID262
Feature activation+5.75e-01

Pos logit contributions
Fre+0.098
bel+0.097
Hel+0.094
ora+0.094
Ag+0.093
Neg logit contributions
 x-0.060
 His-0.059
 magnet-0.058
 DVD-0.057
 apple-0.056
TokenThe
Token ID464
Feature activation+1.80e+00

Pos logit contributions
bel+0.017
Fre+0.017
Hel+0.016
Kay+0.016
ora+0.016
Neg logit contributions
 His-0.020
omm-0.020
oscope-0.019
 x-0.019
penter-0.019
Token next
Token ID1306
Feature activation-3.35e+00

Pos logit contributions
bel+0.124
Fre+0.123
Hel+0.117
Kay+0.114
ora+0.114
Neg logit contributions
omm-0.122
 His-0.121
 DVD-0.115
oscope-0.115
icle-0.112
Token day
Token ID1110
Feature activation-1.92e+00

Pos logit contributions
omm+0.213
 His+0.210
 x+0.210
oscope+0.208
 magnet+0.206
Neg logit contributions
Fre-0.247
bel-0.243
Hel-0.232
Kay-0.229
ora-0.225
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
oscope+0.150
omm+0.150
 x+0.146
 DVD+0.142
penter+0.142
Neg logit contributions
Fre-0.111
bel-0.109
Hel-0.105
Kay-0.101
ora-0.101
Token the
Token ID262
Feature activation+4.45e-01

Pos logit contributions
Fre+0.097
bel+0.096
Hel+0.091
Kay+0.088
ora+0.087
Neg logit contributions
omm-0.107
oscope-0.105
 x-0.103
 His-0.100
 DVD-0.098
Token man
Token ID582
Feature activation+2.58e+00

Pos logit contributions
bel+0.033
Fre+0.032
Hel+0.031
Kay+0.030
ora+0.030
Neg logit contributions
omm-0.029
 His-0.028
oscope-0.027
 DVD-0.027
 x-0.027
Token saw
Token ID2497
Feature activation+5.36e+00

Pos logit contributions
bel+0.149
Fre+0.144
Hel+0.135
ora+0.134
ies+0.132
Neg logit contributions
omm-0.212
 DVD-0.197
 x-0.196
 His-0.193
icle-0.191
Token the
Token ID262
Feature activation+4.20e-01

Pos logit contributions
Fre+0.296
bel+0.279
Hel+0.271
J+0.266
Kay+0.255
Neg logit contributions
oscope-0.446
omm-0.444
 DVD-0.415
 seventy-0.409
oll-0.409
Token same
Token ID976
Feature activation+2.75e+00

Pos logit contributions
Fre+0.034
bel+0.034
Hel+0.032
Kay+0.032
ora+0.032
Neg logit contributions
omm-0.023
 x-0.023
oscope-0.023
 His-0.023
 magnet-0.022
Token bird
Token ID6512
Feature activation-4.20e-01

Pos logit contributions
Fre+0.222
bel+0.219
Hel+0.211
Kay+0.207
Ag+0.204
Neg logit contributions
omm-0.155
 x-0.154
oscope-0.152
 magnet-0.150
 His-0.150
Token again
Token ID757
Feature activation-3.18e+00

Pos logit contributions
omm+0.029
 His+0.026
 x+0.026
 microwave+0.025
icle+0.025
Neg logit contributions
bel-0.030
Fre-0.028
ies-0.028
Hel-0.028
uttering-0.028
Token happy
Token ID3772
Feature activation-6.80e-01

Pos logit contributions
Fre+0.118
bel+0.117
Hel+0.111
Kay+0.109
ora+0.108
Neg logit contributions
omm-0.108
 x-0.105
oscope-0.105
 His-0.105
 magnet-0.103
Token to
Token ID284
Feature activation+2.48e+00

Pos logit contributions
 His+0.034
 knee+0.033
omm+0.032
 apple+0.031
 boo+0.030
Neg logit contributions
Fre-0.061
Hel-0.057
bel-0.056
Jud-0.055
Kay-0.055
Token remember
Token ID3505
Feature activation-4.44e-01

Pos logit contributions
Fre+0.188
bel+0.184
Hel+0.178
Kay+0.176
ora+0.174
Neg logit contributions
omm-0.152
oscope-0.147
 x-0.146
 His-0.145
 magnet-0.144
Token something
Token ID1223
Feature activation+4.64e+00

Pos logit contributions
 His+0.025
omm+0.025
 x+0.024
oscope+0.024
 magnet+0.023
Neg logit contributions
bel-0.036
Fre-0.035
Hel-0.034
ora-0.033
uttering-0.032
Token so
Token ID523
Feature activation-1.24e+00

Pos logit contributions
bel+0.359
Fre+0.350
Hel+0.341
Kay+0.325
uttering+0.323
Neg logit contributions
 His-0.279
omm-0.271
 x-0.271
 magnet-0.266
 apple-0.262
Token funny
Token ID8258
Feature activation+3.10e+00

Pos logit contributions
 His+0.078
 x+0.073
omm+0.072
 magnet+0.071
oll+0.070
Neg logit contributions
Fre-0.094
bel-0.094
Hel-0.090
Kay-0.085
Jay-0.085
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
Fre+0.231
bel+0.226
Hel+0.222
Kay+0.215
ora+0.210
Neg logit contributions
 His-0.190
omm-0.189
oscope-0.186
 x-0.186
 magnet-0.184
Token<|endoftext|>
Token ID50256
Feature activation-4.41e+00

Pos logit contributions
Fre+0.043
Kay+0.041
Hel+0.040
bel+0.040
Ag+0.039
Neg logit contributions
oscope-0.041
 x-0.041
 magnet-0.040
penter-0.039
 boo-0.039
TokenOnce
Token ID7454
Feature activation-2.05e+00

Pos logit contributions
 His+0.340
omm+0.333
 x+0.328
 magnet+0.325
 DVD+0.325
Neg logit contributions
bel-0.269
Fre-0.260
Hel-0.237
Kay-0.236
uttering-0.233
Token upon
Token ID2402
Feature activation+6.10e+00

Pos logit contributions
 His+0.192
omm+0.189
 magnet+0.182
 x+0.181
 microwave+0.177
Neg logit contributions
Fre-0.091
bel-0.088
Hel-0.082
Ag-0.077
Kay-0.076
Token a
Token ID257
Feature activation-7.41e-01

Pos logit contributions
Fre+0.465
bel+0.461
Hel+0.444
Kay+0.435
ora+0.428
Neg logit contributions
omm-0.369
oscope-0.354
 x-0.351
 His-0.349
 magnet-0.346
Token through
Token ID832
Feature activation+4.14e+00

Pos logit contributions
Fre+0.240
bel+0.233
Hel+0.225
Kay+0.222
ora+0.221
Neg logit contributions
 x-0.190
omm-0.190
oscope-0.189
 His-0.189
 magnet-0.184
Token the
Token ID262
Feature activation+4.44e-01

Pos logit contributions
Fre+0.371
bel+0.355
ora+0.353
Ag+0.350
Hel+0.349
Neg logit contributions
 His-0.210
 x-0.208
oscope-0.195
 magnet-0.195
 bone-0.194
Token trees
Token ID7150
Feature activation+4.88e+00

Pos logit contributions
Fre+0.030
bel+0.030
Hel+0.028
Kay+0.028
ora+0.028
Neg logit contributions
omm-0.030
 x-0.030
oscope-0.030
 His-0.030
 magnet-0.029
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
Fre+0.326
bel+0.322
Hel+0.307
Kay+0.301
ora+0.297
Neg logit contributions
omm-0.340
 x-0.332
 His-0.331
oscope-0.331
 magnet-0.325
Token What
Token ID1867
Feature activation+8.25e-01

Pos logit contributions
bel+0.064
Fre+0.063
Hel+0.062
Kay+0.061
Jay+0.060
Neg logit contributions
omm-0.023
oscope-0.022
 boo-0.021
 seventy-0.021
 x-0.021
Token he
Token ID339
Feature activation-2.08e+00

Pos logit contributions
Fre+0.060
bel+0.059
Hel+0.056
ora+0.056
Kay+0.055
Neg logit contributions
 His-0.054
 x-0.053
omm-0.052
 magnet-0.052
oscope-0.051
Token saw
Token ID2497
Feature activation+5.34e+00

Pos logit contributions
omm+0.147
 x+0.146
 His+0.145
oscope+0.145
 magnet+0.143
Neg logit contributions
Fre-0.137
bel-0.135
Hel-0.127
Kay-0.126
ora-0.124
Token was
Token ID373
Feature activation-4.68e-01

Pos logit contributions
bel+0.344
Fre+0.344
Hel+0.328
Kay+0.319
ora+0.312
Neg logit contributions
omm-0.384
oscope-0.366
 His-0.364
 x-0.362
 magnet-0.360
Token a
Token ID257
Feature activation-8.45e-01

Pos logit contributions
omm+0.031
oscope+0.030
 His+0.029
 x+0.029
 magnet+0.029
Neg logit contributions
Fre-0.034
bel-0.033
Hel-0.032
Kay-0.031
Ag-0.030
Token great
Token ID1049
Feature activation+2.69e+00

Pos logit contributions
omm+0.052
 x+0.052
 magnet+0.051
oscope+0.051
 His+0.051
Neg logit contributions
Fre-0.062
bel-0.062
Hel-0.059
Kay-0.058
ora-0.058
Token big
Token ID1263
Feature activation+1.97e+00

Pos logit contributions
Fre+0.201
bel+0.200
Hel+0.190
Kay+0.187
ora+0.185
Neg logit contributions
omm-0.167
 x-0.165
oscope-0.163
 His-0.163
 magnet-0.161
Token lonely
Token ID21757
Feature activation+4.70e-01

Pos logit contributions
Fre+0.400
bel+0.396
Hel+0.379
Kay+0.371
ora+0.369
Neg logit contributions
omm-0.314
oscope-0.304
 x-0.304
 His-0.303
 magnet-0.297
Token because
Token ID780
Feature activation-1.13e+00

Pos logit contributions
Fre+0.038
bel+0.037
Hel+0.036
Kay+0.035
ora+0.035
Neg logit contributions
omm-0.026
oscope-0.026
 x-0.026
 His-0.026
 magnet-0.026
Token it
Token ID340
Feature activation-3.55e-01

Pos logit contributions
omm+0.081
oscope+0.079
 x+0.078
 microwave+0.075
penter+0.075
Neg logit contributions
Fre-0.073
bel-0.072
Hel-0.069
Kay-0.066
ora-0.065
Token didn
Token ID1422
Feature activation-6.65e-01

Pos logit contributions
 His+0.022
oscope+0.021
omm+0.021
 magnet+0.021
 x+0.021
Neg logit contributions
Fre-0.027
bel-0.026
Kay-0.025
Hel-0.025
uttering-0.025
Token't
Token ID470
Feature activation-1.06e+00

Pos logit contributions
oscope+0.051
 microwave+0.050
 trailer+0.050
urger+0.049
 magnet+0.049
Neg logit contributions
Fre-0.039
bel-0.037
Hel-0.037
Kay-0.036
Ag-0.035
Token have
Token ID423
Feature activation-2.40e+00

Pos logit contributions
oscope+0.056
 x+0.056
 His+0.056
 magnet+0.054
 boo+0.054
Neg logit contributions
Fre-0.089
bel-0.089
Hel-0.083
Kay-0.082
uttering-0.082
Token any
Token ID597
Feature activation+7.95e-03

Pos logit contributions
 His+0.137
 x+0.134
omm+0.132
oscope+0.131
 boo+0.129
Neg logit contributions
bel-0.193
Fre-0.192
Hel-0.180
Kay-0.179
ora-0.178
Token bunny
Token ID44915
Feature activation-4.96e+00

Pos logit contributions
Fre+0.000
bel+0.000
Hel+0.000
Kay+0.000
ora+0.000
Neg logit contributions
 His-0.001
omm-0.001
 x-0.001
oscope-0.001
 magnet-0.001
Token friends
Token ID2460
Feature activation+3.93e+00

Pos logit contributions
oscope+0.317
 His+0.316
 magnet+0.308
 x+0.308
 bone+0.308
Neg logit contributions
Fre-0.362
bel-0.354
Kay-0.338
Hel-0.337
Jay-0.327
Token to
Token ID284
Feature activation+2.02e+00

Pos logit contributions
Fre+0.298
bel+0.292
Hel+0.278
Kay+0.274
ora+0.270
Neg logit contributions
oscope-0.239
 His-0.238
 magnet-0.233
 bone-0.232
 x-0.232
Token play
Token ID711
Feature activation-6.29e-01

Pos logit contributions
Fre+0.164
bel+0.162
Hel+0.153
uttering+0.151
Kay+0.150
Neg logit contributions
 His-0.115
oscope-0.112
 x-0.112
 magnet-0.110
 boo-0.110
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
Fre+0.014
bel+0.014
Hel+0.014
Jay+0.013
Kay+0.013
Neg logit contributions
 His-0.008
omm-0.008
 apple-0.008
 x-0.007
oscope-0.007
Token but
Token ID475
Feature activation+3.28e+00

Pos logit contributions
Fre+0.114
bel+0.114
Hel+0.107
Kay+0.107
Ag+0.105
Neg logit contributions
omm-0.120
 x-0.117
oscope-0.116
 magnet-0.113
 His-0.112
Token right
Token ID826
Feature activation-1.50e+00

Pos logit contributions
Fre+0.227
bel+0.220
Kay+0.213
Hel+0.212
ora+0.208
Neg logit contributions
omm-0.223
oscope-0.222
 x-0.221
 magnet-0.215
 DVD-0.215
Token as
Token ID355
Feature activation-3.35e+00

Pos logit contributions
 apple+0.069
omm+0.069
 magnet+0.068
 x+0.067
 His+0.067
Neg logit contributions
Fre-0.137
bel-0.136
Hel-0.133
Ag-0.129
Kay-0.126
Token she
Token ID673
Feature activation+3.42e+00

Pos logit contributions
 x+0.246
omm+0.246
 magnet+0.243
 His+0.243
 apple+0.242
Neg logit contributions
bel-0.207
Fre-0.202
Hel-0.195
uttering-0.187
ora-0.187
Token was
Token ID373
Feature activation-3.25e-01

Pos logit contributions
Fre+0.230
bel+0.227
ora+0.212
Hel+0.211
Kay+0.208
Neg logit contributions
omm-0.235
 x-0.229
 DVD-0.225
oscope-0.225
 His-0.225
Token about
Token ID546
Feature activation-4.43e+00

Pos logit contributions
 magnet+0.022
omm+0.022
oscope+0.021
 x+0.021
 His+0.021
Neg logit contributions
Fre-0.023
bel-0.021
Kay-0.020
uttering-0.020
ora-0.020
Token to
Token ID284
Feature activation+2.44e+00

Pos logit contributions
 apple+0.279
 His+0.258
 bone+0.257
 magnet+0.256
 x+0.251
Neg logit contributions
Fre-0.334
bel-0.331
Hel-0.328
ora-0.324
Kay-0.320
Token take
Token ID1011
Feature activation-3.40e+00

Pos logit contributions
Fre+0.177
ora+0.166
Kay+0.164
bel+0.162
 notice+0.161
Neg logit contributions
omm-0.158
 DVD-0.151
oscope-0.148
penter-0.145
 His-0.144
Token a
Token ID257
Feature activation-6.76e-01

Pos logit contributions
omm+0.217
oscope+0.211
 His+0.210
 x+0.209
 magnet+0.206
Neg logit contributions
Fre-0.249
bel-0.245
Hel-0.235
Kay-0.231
ora-0.228
Token bite
Token ID13197
Feature activation-5.33e+00

Pos logit contributions
omm+0.038
 His+0.036
oscope+0.036
 x+0.036
penter+0.035
Neg logit contributions
bel-0.055
Fre-0.055
Hel-0.052
Kay-0.051
ora-0.051
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
Fre+0.081
bel+0.075
Hel+0.074
Kay+0.072
J+0.071
Neg logit contributions
oscope-0.082
 boo-0.082
 magnet-0.080
 x-0.078
 seventy-0.077
Token thank
Token ID5875
Feature activation-4.97e+00

Pos logit contributions
Fre+0.094
Kay+0.086
Hel+0.083
Jay+0.081
Ag+0.081
Neg logit contributions
oscope-0.093
 DVD-0.088
omm-0.085
 microwave-0.082
 His-0.081
Token you
Token ID345
Feature activation+1.20e+00

Pos logit contributions
omm+0.285
 x+0.285
 His+0.280
oscope+0.276
 magnet+0.267
Neg logit contributions
bel-0.386
Fre-0.382
Hel-0.361
Kay-0.359
uttering-0.358
Token!
Token ID0
Feature activation+1.80e+00

Pos logit contributions
Fre+0.091
bel+0.088
Hel+0.085
Kay+0.083
ora+0.083
Neg logit contributions
 magnet-0.072
oscope-0.072
omm-0.071
 x-0.071
 boo-0.071
Token You
Token ID921
Feature activation-3.37e+00

Pos logit contributions
Hel+0.144
Fre+0.143
Ag+0.139
Kay+0.135
bel+0.130
Neg logit contributions
 microwave-0.103
 x-0.102
 boo-0.100
oscope-0.099
 apple-0.098
Token have
Token ID423
Feature activation-2.25e+00

Pos logit contributions
omm+0.230
 x+0.225
oscope+0.225
 His+0.224
 magnet+0.221
Neg logit contributions
Fre-0.231
bel-0.228
Hel-0.217
Kay-0.213
ora-0.211
Token saved
Token ID7448
Feature activation-4.09e+00

Pos logit contributions
omm+0.146
 His+0.143
 x+0.143
oscope+0.143
 magnet+0.140
Neg logit contributions
Fre-0.162
bel-0.160
Hel-0.152
Kay-0.150
ora-0.149
Token our
Token ID674
Feature activation-1.63e+00

Pos logit contributions
omm+0.247
 His+0.241
oscope+0.240
 x+0.232
 magnet+0.232
Neg logit contributions
Fre-0.316
bel-0.308
Hel-0.299
Kay-0.294
ora-0.290
Token town
Token ID3240
Feature activation+7.85e+00

Pos logit contributions
 magnet+0.102
 x+0.101
 apple+0.101
 boo+0.097
 bone+0.096
Neg logit contributions
Fre-0.119
bel-0.119
J-0.112
Kay-0.111
Jay-0.111
Token!"
Token ID2474
Feature activation-6.48e-01

Pos logit contributions
Fre+0.604
bel+0.601
Hel+0.572
Kay+0.562
ora+0.557
Neg logit contributions
omm-0.468
 x-0.454
 His-0.454
oscope-0.451
 magnet-0.442
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
omm+0.039
 x+0.039
oscope+0.037
 boo+0.035
 seventy+0.035
Neg logit contributions
bel-0.052
Hel-0.050
Fre-0.050
Kay-0.048
ora-0.047
Token He
Token ID679
Feature activation-2.94e+00

Pos logit contributions
Fre+0.021
bel+0.020
ora+0.020
Kay+0.020
Hel+0.019
Neg logit contributions
oscope-0.011
 x-0.010
penter-0.010
omm-0.010
 magnet-0.010
Token thinks
Token ID6834
Feature activation+2.52e+00

Pos logit contributions
omm+0.184
oscope+0.179
 x+0.179
 His+0.179
 magnet+0.174
Neg logit contributions
Fre-0.216
bel-0.213
Hel-0.205
ora-0.200
Kay-0.200
Token they
Token ID484
Feature activation+3.69e+00

Pos logit contributions
Fre+0.151
bel+0.149
Hel+0.137
Kay+0.136
J+0.135
Neg logit contributions
oscope-0.191
omm-0.185
penter-0.182
 x-0.177
 magnet-0.177
Token are
Token ID389
Feature activation+3.02e+00

Pos logit contributions
Fre+0.301
bel+0.294
Kay+0.282
ora+0.277
uttering+0.277
Neg logit contributions
oscope-0.215
 magnet-0.215
 x-0.214
 His-0.203
 boo-0.202
Token more
Token ID517
Feature activation-2.16e+00

Pos logit contributions
Fre+0.179
bel+0.176
Hel+0.167
Kay+0.163
ora+0.160
Neg logit contributions
omm-0.234
oscope-0.230
 His-0.226
 x-0.225
 magnet-0.220
Token cool
Token ID3608
Feature activation+8.25e-01

Pos logit contributions
 x+0.150
 teaches+0.147
 magnet+0.146
oll+0.146
 His+0.146
Neg logit contributions
bel-0.143
Fre-0.143
Hel-0.135
Kay-0.130
ora-0.130
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
Fre+0.081
Kay+0.077
Hel+0.076
Jud+0.074
bel+0.073
Neg logit contributions
 magnet-0.036
 bone-0.036
 apple-0.036
 knee-0.035
oscope-0.035
Token calm
Token ID9480
Feature activation+2.97e+00

Pos logit contributions
 His+0.092
 x+0.091
 magnet+0.091
oscope+0.089
omm+0.089
Neg logit contributions
Fre-0.074
bel-0.069
Hel-0.067
Kay-0.066
ora-0.065
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
Fre+0.264
Hel+0.245
Kay+0.243
bel+0.242
Jud+0.239
Neg logit contributions
 His-0.160
 knee-0.152
 apple-0.149
 bone-0.145
 magnet-0.145
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
Fre+0.013
bel+0.013
ora+0.012
Kay+0.012
Hel+0.012
Neg logit contributions
oscope-0.014
 x-0.014
omm-0.013
penter-0.013
 magnet-0.013
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
Fre+0.009
bel+0.008
Hel+0.008
Kay+0.008
ora+0.008
Neg logit contributions
omm-0.007
oscope-0.007
 x-0.007
 His-0.007
 magnet-0.007
Token are
Token ID389
Feature activation+2.95e+00

Pos logit contributions
omm+0.267
 His+0.252
oscope+0.249
 x+0.247
 DVD+0.246
Neg logit contributions
bel-0.190
Fre-0.185
Hel-0.179
Kay-0.168
ora-0.167
Token a
Token ID257
Feature activation-7.76e-01

Pos logit contributions
Fre+0.172
Hel+0.170
bel+0.170
Ag+0.160
J+0.160
Neg logit contributions
omm-0.214
 His-0.211
oscope-0.209
 microwave-0.205
icle-0.202
Token bad
Token ID2089
Feature activation-3.18e-01

Pos logit contributions
omm+0.044
 His+0.043
oscope+0.043
 x+0.043
 magnet+0.042
Neg logit contributions
Fre-0.062
bel-0.061
Hel-0.059
Kay-0.058
ora-0.057
Token dragon
Token ID10441
Feature activation-1.77e+00

Pos logit contributions
omm+0.016
oscope+0.016
 His+0.015
 x+0.015
 magnet+0.015
Neg logit contributions
Fre-0.027
bel-0.027
Hel-0.026
ora-0.026
Kay-0.025
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
omm+0.134
 His+0.130
 x+0.127
 magnet+0.124
 microwave+0.124
Neg logit contributions
bel-0.108
Fre-0.106
ora-0.098
Hel-0.098
Kay-0.094
Token Go
Token ID1514
Feature activation-7.17e+00

Pos logit contributions
Hel+0.026
Fre+0.025
bel+0.024
Kay+0.024
ora+0.023
Neg logit contributions
 motorcycle-0.019
 magnet-0.019
 trailer-0.019
 microwave-0.018
 pistol-0.018
Token away
Token ID1497
Feature activation+1.62e+00

Pos logit contributions
omm+0.480
oscope+0.474
 x+0.473
 His+0.472
 magnet+0.463
Neg logit contributions
Fre-0.500
bel-0.491
Hel-0.470
Kay-0.461
ora-0.455
Token."
Token ID526
Feature activation-2.89e+00

Pos logit contributions
Fre+0.151
Hel+0.147
Ag+0.145
Kay+0.145
bel+0.145
Neg logit contributions
 x-0.068
omm-0.067
 magnet-0.066
 bone-0.066
oscope-0.065
Token Mia
Token ID32189
Feature activation+3.46e+00

Pos logit contributions
 His+0.142
omm+0.117
 He+0.108
 microwave+0.105
 x+0.103
Neg logit contributions
Fre-0.287
uttering-0.277
bel-0.273
Hel-0.272
Ag-0.266
Token said
Token ID531
Feature activation+9.52e-01

Pos logit contributions
bel+0.195
Fre+0.193
Hel+0.183
Kay+0.179
ora+0.178
Neg logit contributions
omm-0.276
 x-0.266
 magnet-0.264
oscope-0.263
 His-0.262
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
Fre+0.069
bel+0.067
Hel+0.065
Kay+0.063
ora+0.063
Neg logit contributions
omm-0.063
 His-0.062
 x-0.061
oscope-0.060
 magnet-0.060
Token the
Token ID262
Feature activation+4.63e-01

Pos logit contributions
Fre+0.120
bel+0.119
Hel+0.114
Kay+0.112
Ag+0.109
Neg logit contributions
omm-0.101
oscope-0.097
 x-0.094
 His-0.091
 magnet-0.089
Token hurt
Token ID5938
Feature activation+2.36e-01

Pos logit contributions
Fre+0.034
bel+0.033
Hel+0.032
Kay+0.031
ora+0.031
Neg logit contributions
omm-0.029
 His-0.029
oscope-0.028
 x-0.028
 DVD-0.027
Token bird
Token ID6512
Feature activation-4.28e-01

Pos logit contributions
bel+0.018
Fre+0.018
Hel+0.017
Kay+0.017
ora+0.017
Neg logit contributions
omm-0.014
 x-0.014
oscope-0.014
 His-0.014
 DVD-0.013
Token was
Token ID373
Feature activation-4.42e-01

Pos logit contributions
omm+0.027
oscope+0.027
 His+0.027
 x+0.026
 magnet+0.026
Neg logit contributions
Fre-0.031
bel-0.031
Hel-0.029
Kay-0.029
ora-0.029
Token all
Token ID477
Feature activation+6.95e-01

Pos logit contributions
omm+0.028
 x+0.027
oscope+0.027
 His+0.027
 magnet+0.026
Neg logit contributions
Fre-0.033
bel-0.032
Hel-0.031
Kay-0.031
uttering-0.030
Token better
Token ID1365
Feature activation-7.24e+00

Pos logit contributions
bel+0.054
Fre+0.052
Hel+0.050
Kay+0.049
Ag+0.048
Neg logit contributions
omm-0.043
 x-0.041
oscope-0.040
 His-0.040
 magnet-0.038
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 knee+0.355
 bone+0.334
 apple+0.319
oscope+0.316
 arm+0.310
Neg logit contributions
Fre-0.649
Kay-0.616
Jay-0.573
Hel-0.571
Ag-0.564
Token could
Token ID714
Feature activation-7.23e-01

Pos logit contributions
omm+0.073
 x+0.071
oscope+0.071
 His+0.070
 magnet+0.069
Neg logit contributions
bel-0.071
Fre-0.071
Hel-0.067
Kay-0.066
ora-0.065
Token fly
Token ID6129
Feature activation-4.71e-01

Pos logit contributions
omm+0.046
 x+0.045
oscope+0.044
 His+0.044
 magnet+0.043
Neg logit contributions
Fre-0.053
bel-0.053
Hel-0.050
Kay-0.049
ora-0.049
Token again
Token ID757
Feature activation-3.13e+00

Pos logit contributions
 bone+0.020
 magnet+0.019
oscope+0.019
 His+0.019
 apple+0.019
Neg logit contributions
Fre-0.045
Kay-0.041
uttering-0.041
ora-0.041
Jay-0.040
Token.
Token ID13
Feature activation+7.41e-01

Pos logit contributions
 His+0.152
 bone+0.149
oscope+0.148
 apple+0.147
 magnet+0.146
Neg logit contributions
Fre-0.274
bel-0.263
Kay-0.262
Ag-0.256
ora-0.255
Token of
Token ID286
Feature activation-3.47e+00

Pos logit contributions
Fre+0.270
bel+0.268
Hel+0.257
Kay+0.253
ora+0.250
Neg logit contributions
omm-0.191
 x-0.186
oscope-0.185
 His-0.185
 magnet-0.181
Token things
Token ID1243
Feature activation+7.21e+00

Pos logit contributions
 x+0.262
 His+0.253
oscope+0.251
 magnet+0.249
omm+0.248
Neg logit contributions
Fre-0.230
bel-0.222
Hel-0.213
Kay-0.211
ora-0.210
Token to
Token ID284
Feature activation+2.25e+00

Pos logit contributions
Fre+0.484
bel+0.468
Hel+0.453
Kay+0.445
ora+0.440
Neg logit contributions
omm-0.499
 His-0.499
 x-0.499
oscope-0.497
 magnet-0.491
Token do
Token ID466
Feature activation+2.11e+00

Pos logit contributions
Fre+0.169
bel+0.165
Hel+0.159
Kay+0.155
Ag+0.153
Neg logit contributions
 x-0.144
 His-0.143
oscope-0.139
omm-0.139
 magnet-0.138
Token.
Token ID13
Feature activation+4.88e-01

Pos logit contributions
Fre+0.184
ora+0.167
Kay+0.166
Hel+0.166
Jud+0.162
Neg logit contributions
 His-0.120
oscope-0.114
 x-0.114
 magnet-0.109
 teaches-0.108
Token Every
Token ID3887
Feature activation-6.91e+00

Pos logit contributions
Kay+0.034
J+0.033
Hel+0.031
Fre+0.031
Ag+0.031
Neg logit contributions
oscope-0.032
 x-0.029
 boo-0.029
oll-0.028
penter-0.028
Token day
Token ID1110
Feature activation-2.09e+00

Pos logit contributions
omm+0.623
 seventy+0.593
penter+0.575
 x+0.571
oscope+0.569
Neg logit contributions
Fre-0.308
bel-0.300
Hel-0.285
ora-0.281
 perspective-0.277
Token he
Token ID339
Feature activation-2.24e+00

Pos logit contributions
omm+0.157
oscope+0.156
 x+0.155
 magnet+0.150
penter+0.148
Neg logit contributions
bel-0.126
Fre-0.124
Hel-0.118
Kay-0.117
ora-0.114
Token looked
Token ID3114
Feature activation+3.43e+00

Pos logit contributions
omm+0.166
oscope+0.164
 x+0.164
 His+0.164
 magnet+0.161
Neg logit contributions
Fre-0.139
bel-0.137
Hel-0.129
Kay-0.128
ora-0.125
Token at
Token ID379
Feature activation-5.45e-01

Pos logit contributions
bel+0.261
Fre+0.259
Hel+0.248
ora+0.240
Kay+0.240
Neg logit contributions
omm-0.211
oscope-0.202
 His-0.199
 x-0.198
 magnet-0.193
Token the
Token ID262
Feature activation+3.26e-01

Pos logit contributions
omm+0.037
oscope+0.037
penter+0.034
 seventy+0.034
 x+0.034
Neg logit contributions
Fre-0.036
bel-0.036
Kay-0.035
Hel-0.034
Jay-0.033
Token Sally
Token ID25737
Feature activation+2.09e+00

Pos logit contributions
 x+0.206
oscope+0.202
omm+0.195
 DVD+0.194
 seventy+0.191
Neg logit contributions
Fre-0.237
bel-0.232
Kay-0.229
Hel-0.227
ora-0.225
Token said
Token ID531
Feature activation+5.14e-01

Pos logit contributions
Fre+0.119
bel+0.114
Hel+0.114
Kay+0.111
ora+0.111
Neg logit contributions
 x-0.161
omm-0.161
 DVD-0.160
oscope-0.159
 magnet-0.155
Token,
Token ID11
Feature activation+9.74e-01

Pos logit contributions
bel+0.029
Fre+0.029
Hel+0.027
Kay+0.027
uttering+0.026
Neg logit contributions
 x-0.040
omm-0.040
oscope-0.039
penter-0.038
 microwave-0.038
Token "
Token ID366
Feature activation-3.60e+00

Pos logit contributions
bel+0.074
Fre+0.073
Hel+0.073
Kay+0.069
Ag+0.069
Neg logit contributions
 x-0.057
omm-0.055
oll-0.054
oscope-0.053
 magnet-0.053
TokenLet
Token ID5756
Feature activation+9.44e-01

Pos logit contributions
 His+0.243
omm+0.238
oscope+0.236
penter+0.232
 boo+0.231
Neg logit contributions
bel-0.235
Fre-0.234
uttering-0.214
ora-0.213
ies-0.212
Token's
Token ID338
Feature activation-7.61e+00

Pos logit contributions
Hel+0.088
Fre+0.088
Kay+0.087
Ag+0.086
ora+0.085
Neg logit contributions
oscope-0.038
 DVD-0.037
 seventy-0.037
omm-0.036
penter-0.035
Token discuss
Token ID2112
Feature activation-3.51e-01

Pos logit contributions
omm+0.451
 His+0.430
oscope+0.424
 x+0.420
 DVD+0.417
Neg logit contributions
Fre-0.591
bel-0.577
Hel-0.560
Kay-0.553
ora-0.550
Token what
Token ID644
Feature activation-4.73e-01

Pos logit contributions
omm+0.022
oscope+0.021
 His+0.021
 x+0.020
 magnet+0.020
Neg logit contributions
Fre-0.026
bel-0.026
Hel-0.025
Kay-0.025
ora-0.024
Token we
Token ID356
Feature activation-5.88e-01

Pos logit contributions
omm+0.032
 x+0.032
 His+0.031
oscope+0.031
 magnet+0.031
Neg logit contributions
Fre-0.032
bel-0.032
Hel-0.031
Kay-0.030
ora-0.030
Token can
Token ID460
Feature activation-1.43e+00

Pos logit contributions
 x+0.034
oscope+0.034
 magnet+0.034
 His+0.033
 boo+0.033
Neg logit contributions
Fre-0.046
bel-0.046
Kay-0.044
Hel-0.043
ora-0.043
Token do
Token ID466
Feature activation+1.92e+00

Pos logit contributions
 x+0.083
omm+0.082
 His+0.082
oscope+0.081
 magnet+0.080
Neg logit contributions
Fre-0.113
bel-0.112
Hel-0.107
Kay-0.104
Ag-0.103
Token grass
Token ID8701
Feature activation-8.26e-01

Pos logit contributions
Fre+0.018
bel+0.018
Hel+0.017
ora+0.017
Kay+0.017
Neg logit contributions
omm-0.020
 His-0.020
oscope-0.019
penter-0.019
 x-0.019
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
omm+0.055
oscope+0.054
 His+0.054
 x+0.054
 magnet+0.053
Neg logit contributions
Fre-0.057
bel-0.057
Hel-0.054
Kay-0.053
Ag-0.052
Token It
Token ID632
Feature activation-2.79e+00

Pos logit contributions
Fre+0.013
Hel+0.013
Ag+0.013
Kay+0.013
bel+0.012
Neg logit contributions
oscope-0.015
 x-0.014
penter-0.014
icle-0.013
 seventy-0.013
Token was
Token ID373
Feature activation-5.08e-01

Pos logit contributions
omm+0.175
 His+0.173
 x+0.172
oscope+0.170
 magnet+0.169
Neg logit contributions
Fre-0.207
bel-0.205
Hel-0.195
Kay-0.191
ora-0.190
Token Lily
Token ID20037
Feature activation-2.27e+00

Pos logit contributions
oscope+0.035
omm+0.034
 His+0.033
penter+0.032
 DVD+0.032
Neg logit contributions
Fre-0.035
bel-0.034
Kay-0.032
Hel-0.032
Ag-0.032
Token's
Token ID338
Feature activation-7.12e+00

Pos logit contributions
omm+0.067
 His+0.062
 x+0.061
oscope+0.060
 magnet+0.058
Neg logit contributions
bel-0.243
Fre-0.241
Hel-0.235
Kay-0.233
ora-0.230
Token lost
Token ID2626
Feature activation+3.95e-01

Pos logit contributions
omm+0.417
oscope+0.410
 His+0.407
 x+0.399
penter+0.395
Neg logit contributions
bel-0.552
Fre-0.552
Hel-0.517
Kay-0.517
ora-0.513
Token toy
Token ID13373
Feature activation-5.67e+00

Pos logit contributions
bel+0.027
Fre+0.027
Hel+0.025
ora+0.025
Kay+0.025
Neg logit contributions
omm-0.027
 His-0.026
oscope-0.025
 x-0.025
penter-0.025
Token!
Token ID0
Feature activation+1.99e+00

Pos logit contributions
omm+0.445
 His+0.430
oscope+0.427
 x+0.422
 seventy+0.415
Neg logit contributions
bel-0.320
Fre-0.316
ora-0.295
Hel-0.293
ies-0.290
Token Lily
Token ID20037
Feature activation-2.28e+00

Pos logit contributions
Ag+0.122
Kay+0.119
J+0.119
Hel+0.119
Fre+0.113
Neg logit contributions
oscope-0.158
 x-0.148
icle-0.140
penter-0.139
urger-0.138
Token was
Token ID373
Feature activation-4.97e-01

Pos logit contributions
omm+0.165
 x+0.163
oscope+0.161
 DVD+0.160
icle+0.159
Neg logit contributions
Fre-0.144
bel-0.141
Hel-0.137
Kay-0.134
ora-0.131