TOP ACTIVATIONS
MAX = 30.694

and they said goodbye by giving each other a wink and
around Fre ck les , giving her hugs and pet ting
day and the trees were giving a pleasant shade . Mom
, pushing her away and giving Lily all the sunshine .
me , thank you for giving me such good advice !
king 's tum my by giving him too many ch ocol
that was very nice and giving . Later that
way of thanking yoga for giving them a fun time .
Lisa !" Dad said , giving her the doll and the
hungry . Would you mind giving me some of your ol
and keep wandering , never stopping for too long at one
the right thing instead of giving in to his greed .
these ." He said , giving them two life jackets .
that he was serious about giving the toy to Jim .
proud of himself for not giving up . Tiny knew that
store with the perfume , giving it a little hug .
Sam learned the lesson of giving , taking and the importance
brave , and strong without giving up . B inky realized
, even if it means giving up something you really want
, even if it means giving up something that you have

MIN ACTIVATIONS
MIN = -26.888

put . Su zie tried again and again ,
. Su zie smiled . She and her
ended ! Su zie was sad that she no
being fair . <|endoftext|> Su zie loved to swim . She
. Su zie was so disappointed and tried
too . Su zie loved the magazine so much
sky ? â asked Sunny . The girl
all along . <|endoftext|> Su zie was a 3 year old
said . Su zie 's dad smiled and said
Lily . And this is Sunny . We are happy to
From that day on , Sunny knew he could go on
" Mom ," said Sunny . " I was mean
â it said . Sunny looked up and saw a
they saw something terrible . Sunny was gone . Someone had
some food ." So , Betsy and the thief went on
time . Su zie loved her magazine and kept
the garden again to see Sunny . But they saw something
You sound very pretty ," Sunny said . "
, just like her . Betsy was sad that she had
to name the sun flower Sunny . The next

INTERVAL 16.299 - 30.694
CONTAINS 4.107%

, " Thank you , thank you . This is my
the market and saw it getting busy . People were bringing
car outside . He was pushing it around and crashing it
and said , " I love you both the same .
remembered to be careful and help others be safe too .

INTERVAL 1.903 - 16.299
CONTAINS 43.423%

my was mad . She was mad at Tom for not
pond . Molly had had so much fun that she didn
mom cook . She wanted to be a good chef when
bottle ! He was so excited . The spark le granted
never knew that notes could glow ! She smiled and decided

INTERVAL -12.492 - 1.903
CONTAINS 34.457%

now and have a hot chocolate !" M olly
hurt her mom 's feelings . Later that day
always follow . <|endoftext|> Once upon a time there was a
reliable , like an exit ! <|endoftext|> Once upon a time
their own cars and trucks . They are smaller and safer

INTERVAL -26.888 - -12.492
CONTAINS 18.013%

was a small girl and she was very eager . She
together . The two of them were the best of friends
wear a special veil . He went to the market with
excited . M andy opened the note and found five smooth
higher . But soon , it became too hard . Her
Token and
Token ID290
Feature activation+6.58e+00

Pos logit contributions
 reaching+0.065
 pointing+0.059
 giving+0.059
 understanding+0.059
 encouraging+0.058
Neg logit contributions
issy-0.106
ppy-0.096
uffy-0.092
ens-0.085
isk-0.084
Token they
Token ID484
Feature activation-1.12e+01

Pos logit contributions
 reaching+0.096
 giving+0.092
 helping+0.086
 understanding+0.086
 counting+0.086
Neg logit contributions
issy-0.181
ppy-0.163
uffy-0.153
azz-0.149
pillar-0.147
Token said
Token ID531
Feature activation+8.61e+00

Pos logit contributions
issy+0.284
ppy+0.247
uffy+0.239
azz+0.236
pillar+0.232
Neg logit contributions
 reaching-0.184
 giving-0.180
 helping-0.171
 understanding-0.170
 counting-0.167
Token goodbye
Token ID24829
Feature activation+1.09e+01

Pos logit contributions
 reaching+0.131
 giving+0.114
 counting+0.114
 insisting+0.110
 testing+0.110
Neg logit contributions
issy-0.213
ppy-0.197
uffy-0.190
azz-0.176
affe-0.175
Token by
Token ID416
Feature activation+6.77e+00

Pos logit contributions
 reaching+0.148
 giving+0.137
 succeeding+0.130
 counting+0.129
 encouraging+0.128
Neg logit contributions
issy-0.304
ppy-0.283
uffy-0.267
azz-0.256
ens-0.248
Token giving
Token ID3501
Feature activation+2.80e+01

Pos logit contributions
 reaching+0.074
 mesmer+0.062
 encouraging+0.061
 succeeding+0.060
 intrigued+0.060
Neg logit contributions
issy-0.179
ppy-0.167
isk-0.164
uffy-0.162
x-0.151
Token each
Token ID1123
Feature activation-4.50e+00

Pos logit contributions
 reaching+0.292
 giving+0.282
 understanding+0.262
 helping+0.260
 listening+0.251
Neg logit contributions
issy-0.896
ppy-0.814
uffy-0.768
azz-0.763
pillar-0.751
Token other
Token ID584
Feature activation-4.42e+00

Pos logit contributions
issy+0.131
ppy+0.119
uffy+0.112
azz+0.109
pillar+0.108
Neg logit contributions
 reaching-0.060
 giving-0.058
 helping-0.054
 understanding-0.054
 counting-0.053
Token a
Token ID257
Feature activation-5.06e+00

Pos logit contributions
issy+0.147
ppy+0.132
uffy+0.128
azz+0.125
acker+0.125
Neg logit contributions
 giving-0.040
 understanding-0.038
 reaching-0.038
 listening-0.037
 helping-0.037
Token wink
Token ID41955
Feature activation+6.65e+00

Pos logit contributions
issy+0.134
ppy+0.119
uffy+0.114
azz+0.110
pillar+0.109
Neg logit contributions
 reaching-0.081
 giving-0.076
 counting-0.072
 helping-0.071
 pointing-0.071
Token and
Token ID290
Feature activation+4.23e+00

Pos logit contributions
 reaching+0.087
 giving+0.085
 helping+0.079
 understanding+0.077
 counting+0.077
Neg logit contributions
issy-0.195
ppy-0.174
uffy-0.166
azz-0.162
acker-0.160
Token around
Token ID1088
Feature activation+1.22e+01

Pos logit contributions
 reaching+0.178
 giving+0.169
 understanding+0.155
 helping+0.154
 pointing+0.150
Neg logit contributions
issy-0.492
ppy-0.453
uffy-0.423
azz-0.422
ens-0.416
Token Fre
Token ID4848
Feature activation-6.92e+00

Pos logit contributions
 reaching+0.122
 giving+0.115
 understanding+0.107
 nodding+0.098
 encouraging+0.098
Neg logit contributions
issy-0.351
ppy-0.326
uffy-0.310
ens-0.307
isk-0.306
Tokenck
Token ID694
Feature activation+9.76e+00

Pos logit contributions
issy+0.150
ppy+0.133
uffy+0.120
azz+0.115
pillar+0.115
Neg logit contributions
 reaching-0.140
 giving-0.139
 helping-0.134
 understanding-0.131
 pointing-0.129
Tokenles
Token ID829
Feature activation-3.58e-01

Pos logit contributions
 reaching+0.181
 giving+0.170
 pointing+0.165
 counting+0.163
 understanding+0.163
Neg logit contributions
issy-0.237
ppy-0.202
uffy-0.190
acker-0.188
azz-0.187
Token,
Token ID11
Feature activation+6.82e+00

Pos logit contributions
issy+0.010
ppy+0.009
uffy+0.009
pillar+0.008
azz+0.008
Neg logit contributions
 reaching-0.005
 giving-0.005
 understanding-0.004
 helping-0.004
 counting-0.004
Token giving
Token ID3501
Feature activation+2.76e+01

Pos logit contributions
 reaching+0.074
 giving+0.071
 understanding+0.067
 helping+0.066
 counting+0.065
Neg logit contributions
issy-0.193
ppy-0.182
pillar-0.168
uffy-0.166
azz-0.165
Token her
Token ID607
Feature activation-1.47e+01

Pos logit contributions
 reaching+0.356
 giving+0.350
 helping+0.330
 understanding+0.321
 counting+0.317
Neg logit contributions
issy-0.788
ppy-0.717
uffy-0.670
pillar-0.658
azz-0.657
Token hugs
Token ID40657
Feature activation+1.60e+01

Pos logit contributions
issy+0.429
ppy+0.389
uffy+0.367
azz+0.358
pillar+0.355
Neg logit contributions
 reaching-0.194
 giving-0.186
 helping-0.173
 understanding-0.171
 counting-0.170
Token and
Token ID290
Feature activation+4.65e+00

Pos logit contributions
 reaching+0.193
 giving+0.188
 helping+0.172
 understanding+0.169
 showing+0.165
Neg logit contributions
issy-0.491
ppy-0.444
uffy-0.426
azz-0.412
acker-0.408
Token pet
Token ID4273
Feature activation+8.66e+00

Pos logit contributions
 reaching+0.071
 giving+0.063
 insisting+0.061
 testing+0.061
 succeeding+0.060
Neg logit contributions
issy-0.117
ppy-0.104
uffy-0.099
ens-0.097
pillar-0.096
Tokenting
Token ID889
Feature activation+9.57e+00

Pos logit contributions
 reaching+0.130
 giving+0.125
 helping+0.116
 understanding+0.115
 counting+0.115
Neg logit contributions
issy-0.239
ppy-0.214
uffy-0.201
azz-0.196
acker-0.194
Token day
Token ID1110
Feature activation+4.87e+00

Pos logit contributions
 reaching+0.107
 giving+0.104
 helping+0.099
 understanding+0.098
 counting+0.098
Neg logit contributions
issy-0.148
ppy-0.132
uffy-0.122
azz-0.119
pillar-0.117
Token and
Token ID290
Feature activation+5.62e+00

Pos logit contributions
 reaching+0.073
 understanding+0.069
 giving+0.067
 succeeding+0.067
 insisting+0.066
Neg logit contributions
issy-0.122
ppy-0.113
uffy-0.103
pillar-0.100
azz-0.100
Token the
Token ID262
Feature activation-1.27e+01

Pos logit contributions
 reaching+0.085
 giving+0.080
 counting+0.075
 helping+0.075
 understanding+0.075
Neg logit contributions
issy-0.152
ppy-0.137
uffy-0.127
azz-0.124
pillar-0.123
Token trees
Token ID7150
Feature activation-4.23e+00

Pos logit contributions
issy+0.320
ppy+0.287
uffy+0.263
azz+0.262
acker+0.257
Neg logit contributions
 reaching-0.217
 giving-0.215
 helping-0.205
 understanding-0.203
 counting-0.200
Token were
Token ID547
Feature activation+1.05e+01

Pos logit contributions
issy+0.103
ppy+0.093
uffy+0.086
azz+0.082
pillar+0.082
Neg logit contributions
 reaching-0.071
 giving-0.069
 understanding-0.068
 counting-0.066
 helping-0.066
Token giving
Token ID3501
Feature activation+2.74e+01

Pos logit contributions
 reaching+0.133
 Listen+0.127
 giving+0.125
 understanding+0.125
 succeeding+0.122
Neg logit contributions
issy-0.279
ppy-0.252
uffy-0.241
que-0.224
acky-0.222
Token a
Token ID257
Feature activation-5.44e+00

Pos logit contributions
 reaching+0.325
 giving+0.309
 helping+0.284
 counting+0.283
 understanding+0.282
Neg logit contributions
issy-0.833
ppy-0.759
uffy-0.717
azz-0.698
pillar-0.695
Token pleasant
Token ID15497
Feature activation+6.29e+00

Pos logit contributions
issy+0.135
ppy+0.118
uffy+0.117
pillar+0.112
azz+0.110
Neg logit contributions
 reaching-0.094
 giving-0.085
 counting-0.083
 insisting-0.083
 asking-0.082
Token shade
Token ID17979
Feature activation-7.14e-01

Pos logit contributions
 reaching+0.110
 giving+0.102
 insisting+0.100
 succeeding+0.100
 counting+0.099
Neg logit contributions
issy-0.147
ppy-0.125
uffy-0.123
pillar-0.121
que-0.117
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
issy+0.022
ppy+0.020
uffy+0.019
azz+0.018
pillar+0.018
Neg logit contributions
 reaching-0.008
 giving-0.008
 helping-0.007
 counting-0.007
 understanding-0.007
Token Mom
Token ID11254
Feature activation-1.59e+01

Pos logit contributions
issy+0.047
ppy+0.043
uffy+0.040
azz+0.039
acker+0.039
Neg logit contributions
 reaching-0.022
 giving-0.021
 helping-0.019
 pointing-0.019
 counting-0.019
Token,
Token ID11
Feature activation+7.33e+00

Pos logit contributions
issy+0.585
ppy+0.518
uffy+0.493
azz+0.483
acker+0.477
Neg logit contributions
 reaching-0.279
 giving-0.278
 helping-0.257
 understanding-0.254
 showing-0.253
Token pushing
Token ID7796
Feature activation+2.40e+01

Pos logit contributions
 reaching+0.100
 giving+0.095
 helping+0.089
 understanding+0.088
 counting+0.088
Neg logit contributions
issy-0.208
ppy-0.189
uffy-0.176
azz-0.174
pillar-0.172
Token her
Token ID607
Feature activation-1.42e+01

Pos logit contributions
 reaching+0.264
 giving+0.252
 pushing+0.249
 asking+0.247
 counting+0.246
Neg logit contributions
issy-0.731
uffy-0.634
ppy-0.629
ase-0.624
acker-0.622
Token away
Token ID1497
Feature activation+6.17e+00

Pos logit contributions
issy+0.403
ppy+0.359
uffy+0.336
azz+0.330
acker+0.327
Neg logit contributions
 reaching-0.206
 giving-0.199
 helping-0.187
 counting-0.186
 understanding-0.184
Token and
Token ID290
Feature activation+4.85e+00

Pos logit contributions
 reaching+0.080
 giving+0.077
 helping+0.072
 understanding+0.071
 pointing+0.071
Neg logit contributions
issy-0.174
ppy-0.159
uffy-0.149
azz-0.146
pillar-0.144
Token giving
Token ID3501
Feature activation+2.74e+01

Pos logit contributions
 reaching+0.061
 giving+0.057
 packing+0.057
 understanding+0.055
 spending+0.054
Neg logit contributions
issy-0.132
ppy-0.122
uffy-0.113
ens-0.112
azz-0.112
Token Lily
Token ID20037
Feature activation-2.23e+01

Pos logit contributions
 reaching+0.266
 giving+0.265
 understanding+0.242
 helping+0.237
 encouraging+0.235
Neg logit contributions
issy-0.874
ppy-0.795
uffy-0.764
acker-0.743
azz-0.739
Token all
Token ID477
Feature activation+3.27e+00

Pos logit contributions
issy+0.841
ppy+0.755
uffy+0.748
azz+0.724
acker+0.717
Neg logit contributions
 giving-0.101
 to-0.094
 understanding-0.090
 reaching-0.089
 listening-0.085
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
 reaching+0.033
 giving+0.031
 helping+0.028
 understanding+0.028
 counting+0.028
Neg logit contributions
issy-0.106
ppy-0.097
uffy-0.092
azz-0.090
pillar-0.089
Token sunshine
Token ID34488
Feature activation-3.13e+00

Pos logit contributions
issy+0.361
ppy+0.326
uffy+0.305
azz+0.298
acker+0.290
Neg logit contributions
 reaching-0.194
 giving-0.190
 helping-0.179
 understanding-0.179
 counting-0.177
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
issy+0.095
ppy+0.086
uffy+0.081
azz+0.079
acker+0.078
Neg logit contributions
 reaching-0.038
 giving-0.037
 helping-0.034
 counting-0.033
 understanding-0.033
Token me
Token ID502
Feature activation-6.93e+00

Pos logit contributions
 reaching+0.263
 giving+0.251
 helping+0.229
 understanding+0.229
 counting+0.227
Neg logit contributions
issy-0.711
ppy-0.648
uffy-0.612
azz-0.597
pillar-0.591
Token,
Token ID11
Feature activation+7.47e+00

Pos logit contributions
issy+0.188
ppy+0.175
uffy+0.156
isk+0.150
ches+0.150
Neg logit contributions
 reaching-0.096
 giving-0.089
 pointing-0.088
 insisting-0.086
 understanding-0.086
Token thank
Token ID5875
Feature activation+2.08e+01

Pos logit contributions
 reaching+0.110
 giving+0.105
 counting+0.098
 pointing+0.097
 understanding+0.097
Neg logit contributions
issy-0.204
ppy-0.184
uffy-0.170
azz-0.167
pillar-0.164
Token you
Token ID345
Feature activation-8.57e+00

Pos logit contributions
 reaching+0.367
 giving+0.357
 understanding+0.337
 pointing+0.335
 counting+0.334
Neg logit contributions
issy-0.503
ppy-0.456
uffy-0.418
ens-0.402
azz-0.400
Token for
Token ID329
Feature activation+1.52e+01

Pos logit contributions
issy+0.227
ppy+0.213
uffy+0.191
azz+0.186
pillar+0.183
Neg logit contributions
 reaching-0.128
 giving-0.121
 pointing-0.117
 counting-0.114
 understanding-0.114
Token giving
Token ID3501
Feature activation+2.73e+01

Pos logit contributions
 mesmer+0.188
 intrigued+0.152
 reaching+0.149
 fascinated+0.143
 pointing+0.136
Neg logit contributions
issy-0.383
isk-0.376
ppy-0.361
que-0.339
Mic-0.336
Token me
Token ID502
Feature activation-7.67e+00

Pos logit contributions
 reaching+0.326
 giving+0.319
 helping+0.298
 understanding+0.295
 counting+0.286
Neg logit contributions
issy-0.832
ppy-0.750
uffy-0.713
azz-0.710
acker-0.691
Token such
Token ID884
Feature activation+5.90e+00

Pos logit contributions
issy+0.264
ppy+0.240
uffy+0.231
azz+0.228
pillar+0.224
Neg logit contributions
 reaching-0.060
 giving-0.058
 helping-0.052
 understanding-0.051
 counting-0.049
Token good
Token ID922
Feature activation+9.27e+00

Pos logit contributions
 reaching+0.065
 giving+0.057
 counting+0.055
 pointing+0.053
 listening+0.050
Neg logit contributions
issy-0.181
ppy-0.166
uffy-0.155
pillar-0.151
ocado-0.151
Token advice
Token ID5608
Feature activation+1.20e+01

Pos logit contributions
 reaching+0.153
 giving+0.141
 pointing+0.141
 mesmer+0.138
 counting+0.135
Neg logit contributions
issy-0.224
ppy-0.205
uffy-0.193
ens-0.188
pillar-0.188
Token!
Token ID0
Feature activation+2.19e-01

Pos logit contributions
 reaching+0.168
 giving+0.161
 understanding+0.150
 helping+0.150
 counting+0.150
Neg logit contributions
issy-0.339
ppy-0.308
uffy-0.288
azz-0.281
pillar-0.278
Token king
Token ID5822
Feature activation-9.82e+00

Pos logit contributions
issy+0.311
ppy+0.276
uffy+0.256
azz+0.250
pillar+0.244
Neg logit contributions
 reaching-0.230
 giving-0.225
 helping-0.213
 counting-0.211
 understanding-0.211
Token's
Token ID338
Feature activation-2.64e+00

Pos logit contributions
issy+0.301
ppy+0.262
uffy+0.257
azz+0.251
acky+0.249
Neg logit contributions
 reaching-0.120
 giving-0.117
 helping-0.107
 counting-0.106
 listening-0.103
Token tum
Token ID11814
Feature activation+7.01e+00

Pos logit contributions
issy+0.065
ppy+0.058
uffy+0.054
azz+0.052
pillar+0.052
Neg logit contributions
 reaching-0.046
 giving-0.044
 helping-0.042
 understanding-0.042
 counting-0.041
Tokenmy
Token ID1820
Feature activation-1.27e+01

Pos logit contributions
 reaching+0.175
 giving+0.172
 helping+0.166
 listening+0.166
 eating+0.163
Neg logit contributions
issy-0.113
ppy-0.091
uffy-0.086
azz-0.083
acky-0.080
Token by
Token ID416
Feature activation+5.56e+00

Pos logit contributions
issy+0.369
ppy+0.334
uffy+0.315
azz+0.307
pillar+0.304
Neg logit contributions
 reaching-0.169
 giving-0.162
 helping-0.151
 understanding-0.150
 counting-0.149
Token giving
Token ID3501
Feature activation+2.70e+01

Pos logit contributions
 reaching+0.052
 understanding+0.051
 encouraging+0.050
 mesmer+0.049
 pointing+0.047
Neg logit contributions
issy-0.153
ppy-0.149
isk-0.144
uffy-0.134
que-0.132
Token him
Token ID683
Feature activation-1.26e+01

Pos logit contributions
 reaching+0.361
 giving+0.355
 understanding+0.331
 helping+0.329
 asking+0.324
Neg logit contributions
issy-0.779
ppy-0.695
uffy-0.663
azz-0.653
acker-0.632
Token too
Token ID1165
Feature activation+5.31e+00

Pos logit contributions
issy+0.415
ppy+0.379
uffy+0.361
azz+0.355
pillar+0.350
Neg logit contributions
 reaching-0.117
 giving-0.112
 helping-0.101
 understanding-0.099
 counting-0.098
Token many
Token ID867
Feature activation+5.80e+00

Pos logit contributions
 reaching+0.096
 pointing+0.093
 showing+0.093
 giving+0.091
 joining+0.090
Neg logit contributions
issy-0.116
ppy-0.107
hell-0.102
acker-0.100
uffy-0.100
Token ch
Token ID442
Feature activation+1.31e+01

Pos logit contributions
 reaching+0.090
 giving+0.084
 pointing+0.082
 insisting+0.079
 showing+0.078
Neg logit contributions
issy-0.151
ppy-0.136
uffy-0.126
pillar-0.125
que-0.122
Tokenocol
Token ID4668
Feature activation+4.06e+00

Pos logit contributions
 reaching+0.285
 giving+0.277
 helping+0.266
 understanding+0.266
 counting+0.264
Neg logit contributions
issy-0.271
ppy-0.231
uffy-0.213
azz-0.206
pillar-0.202
Token that
Token ID326
Feature activation-4.60e-01

Pos logit contributions
issy+0.052
ppy+0.046
uffy+0.043
pillar+0.042
azz+0.042
Neg logit contributions
 reaching-0.032
 giving-0.031
 counting-0.029
 understanding-0.029
 helping-0.029
Token was
Token ID373
Feature activation+9.33e+00

Pos logit contributions
issy+0.012
ppy+0.010
uffy+0.010
azz+0.009
pillar+0.009
Neg logit contributions
 reaching-0.008
 giving-0.008
 helping-0.007
 understanding-0.007
 counting-0.007
Token very
Token ID845
Feature activation+9.38e+00

Pos logit contributions
 reaching+0.111
 giving+0.102
 understanding+0.098
 counting+0.098
 pointing+0.095
Neg logit contributions
issy-0.261
ppy-0.238
uffy-0.217
isk-0.211
ocado-0.210
Token nice
Token ID3621
Feature activation+6.02e+00

Pos logit contributions
 reaching+0.152
 giving+0.146
 joining+0.140
 counting+0.139
 spending+0.138
Neg logit contributions
issy-0.229
ppy-0.206
uffy-0.196
ocado-0.191
pillar-0.187
Token and
Token ID290
Feature activation+4.51e+00

Pos logit contributions
 reaching+0.086
 giving+0.080
 understanding+0.077
 pointing+0.076
 counting+0.075
Neg logit contributions
issy-0.164
ppy-0.149
uffy-0.139
pillar-0.137
azz-0.135
Token giving
Token ID3501
Feature activation+2.70e+01

Pos logit contributions
 reaching+0.066
 counting+0.060
 giving+0.059
 packing+0.058
 pointing+0.058
Neg logit contributions
issy-0.112
ppy-0.101
ens-0.094
uffy-0.093
ocado-0.093
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 reaching+0.206
 giving+0.196
 understanding+0.173
 helping+0.170
 counting+0.163
Neg logit contributions
issy-0.934
ppy-0.858
uffy-0.820
azz-0.807
acker-0.792
Token
Token ID198
Feature activation-3.22e+00

Pos logit contributions
issy+0.033
ppy+0.030
uffy+0.028
azz+0.027
pillar+0.027
Neg logit contributions
 reaching-0.016
 giving-0.015
 understanding-0.014
 helping-0.014
 counting-0.014
Token
Token ID198
Feature activation-3.26e+00

Pos logit contributions
issy+0.097
ppy+0.089
uffy+0.083
azz+0.080
acker+0.080
Neg logit contributions
 reaching-0.038
 giving-0.036
 understanding-0.035
 helping-0.034
 counting-0.033
TokenLater
Token ID18602
Feature activation+2.66e+00

Pos logit contributions
issy+0.083
ppy+0.074
uffy+0.070
ens+0.066
azz+0.066
Neg logit contributions
 reaching-0.054
 giving-0.052
 understanding-0.052
 helping-0.050
 encouraging-0.050
Token that
Token ID326
Feature activation-1.69e+00

Pos logit contributions
 reaching+0.031
 giving+0.029
 understanding+0.027
 helping+0.027
 counting+0.026
Neg logit contributions
issy-0.082
ppy-0.074
uffy-0.071
pillar-0.068
azz-0.068
Token way
Token ID835
Feature activation+9.80e+00

Pos logit contributions
issy+0.360
ppy+0.325
uffy+0.316
azz+0.299
pillar+0.298
Neg logit contributions
 reaching-0.196
 giving-0.187
 helping-0.175
 understanding-0.172
 listening-0.172
Token of
Token ID286
Feature activation+1.08e+01

Pos logit contributions
 reaching+0.105
 giving+0.103
 counting+0.096
 understanding+0.095
 teaching+0.094
Neg logit contributions
issy-0.292
ppy-0.273
uffy-0.254
azz-0.251
pillar-0.245
Token thanking
Token ID41892
Feature activation+2.01e+01

Pos logit contributions
 reaching+0.109
 giving+0.102
 mesmer+0.098
 helping+0.095
 understanding+0.094
Neg logit contributions
issy-0.297
ppy-0.274
ens-0.266
isk-0.264
uffy-0.260
Token yoga
Token ID20351
Feature activation+9.48e-01

Pos logit contributions
 reaching+0.234
 giving+0.223
 helping+0.204
 understanding+0.202
 pointing+0.199
Neg logit contributions
issy-0.605
ppy-0.554
uffy-0.519
azz-0.509
pillar-0.498
Token for
Token ID329
Feature activation+1.52e+01

Pos logit contributions
 reaching+0.013
 giving+0.013
 helping+0.012
 understanding+0.012
 counting+0.012
Neg logit contributions
issy-0.027
ppy-0.024
uffy-0.023
azz-0.022
pillar-0.022
Token giving
Token ID3501
Feature activation+2.69e+01

Pos logit contributions
 mesmer+0.160
 reaching+0.159
 pointing+0.141
 fascinated+0.136
 giving+0.134
Neg logit contributions
issy-0.410
ppy-0.385
isk-0.359
ens-0.351
que-0.336
Token them
Token ID606
Feature activation-1.37e+01

Pos logit contributions
 reaching+0.287
 giving+0.275
 understanding+0.252
 helping+0.251
 counting+0.247
Neg logit contributions
issy-0.856
ppy-0.776
uffy-0.735
azz-0.726
pillar-0.714
Token a
Token ID257
Feature activation-5.35e+00

Pos logit contributions
issy+0.477
ppy+0.437
uffy+0.418
azz+0.410
pillar+0.407
Neg logit contributions
 reaching-0.104
 giving-0.098
 helping-0.087
 understanding-0.086
 counting-0.084
Token fun
Token ID1257
Feature activation+9.40e+00

Pos logit contributions
issy+0.143
ppy+0.129
uffy+0.123
pillar+0.120
azz+0.119
Neg logit contributions
 reaching-0.082
 giving-0.077
 counting-0.072
 helping-0.072
 pointing-0.072
Token time
Token ID640
Feature activation+4.90e+00

Pos logit contributions
 reaching+0.158
 giving+0.153
 helping+0.144
 understanding+0.143
 counting+0.143
Neg logit contributions
issy-0.239
ppy-0.213
uffy-0.199
azz-0.194
pillar-0.192
Token.
Token ID13
Feature activation-1.98e+00

Pos logit contributions
 reaching+0.073
 giving+0.068
 understanding+0.066
 pointing+0.065
 counting+0.064
Neg logit contributions
issy-0.126
ppy-0.116
uffy-0.107
azz-0.105
pillar-0.103
Token Lisa
Token ID15378
Feature activation-2.26e+01

Pos logit contributions
 reaching+0.095
 giving+0.092
 helping+0.085
 understanding+0.085
 counting+0.084
Neg logit contributions
issy-0.199
ppy-0.180
uffy-0.169
azz-0.165
pillar-0.163
Token!"
Token ID2474
Feature activation+3.38e+00

Pos logit contributions
issy+0.614
ppy+0.555
uffy+0.520
azz+0.503
pillar+0.499
Neg logit contributions
 reaching-0.338
 giving-0.323
 understanding-0.306
 helping-0.304
 counting-0.299
Token Dad
Token ID17415
Feature activation-1.35e+01

Pos logit contributions
 reaching+0.056
 giving+0.054
 helping+0.051
 understanding+0.050
 counting+0.050
Neg logit contributions
issy-0.087
ppy-0.078
uffy-0.072
azz-0.071
pillar-0.070
Token said
Token ID531
Feature activation+6.92e+00

Pos logit contributions
issy+0.313
ppy+0.278
azz+0.251
uffy+0.242
ches+0.237
Neg logit contributions
 reaching-0.251
 giving-0.239
 understanding-0.238
 helping-0.233
 succeeding-0.230
Token,
Token ID11
Feature activation+5.61e+00

Pos logit contributions
 reaching+0.116
 diving+0.106
 giving+0.105
 spending+0.105
 testing+0.104
Neg logit contributions
issy-0.164
ppy-0.152
uffy-0.134
azz-0.129
ens-0.127
Token giving
Token ID3501
Feature activation+2.68e+01

Pos logit contributions
 succeeding+0.047
 spending+0.046
 example+0.046
 reaching+0.044
 Listen+0.044
Neg logit contributions
ppy-0.149
issy-0.146
isk-0.129
azz-0.127
illo-0.126
Token her
Token ID607
Feature activation-1.57e+01

Pos logit contributions
 reaching+0.351
 giving+0.326
 counting+0.319
 understanding+0.315
 helping+0.314
Neg logit contributions
issy-0.777
ppy-0.702
uffy-0.678
pillar-0.652
azz-0.650
Token the
Token ID262
Feature activation-1.32e+01

Pos logit contributions
issy+0.542
ppy+0.499
uffy+0.475
azz+0.466
pillar+0.462
Neg logit contributions
 reaching-0.122
 giving-0.115
 helping-0.101
 understanding-0.100
 counting-0.098
Token doll
Token ID3654
Feature activation-1.56e+01

Pos logit contributions
issy+0.320
ppy+0.283
uffy+0.265
azz+0.255
pillar+0.255
Neg logit contributions
 reaching-0.237
 giving-0.228
 helping-0.216
 understanding-0.215
 counting-0.214
Token and
Token ID290
Feature activation+3.47e+00

Pos logit contributions
issy+0.490
ppy+0.442
uffy+0.422
azz+0.415
pillar+0.412
Neg logit contributions
 reaching-0.168
 giving-0.167
 counting-0.155
 helping-0.152
 understanding-0.148
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 reaching+0.053
 giving+0.048
 understanding+0.046
 helping+0.046
 counting+0.046
Neg logit contributions
issy-0.091
ppy-0.081
uffy-0.075
azz-0.074
ens-0.073
Token hungry
Token ID14720
Feature activation+1.12e+01

Pos logit contributions
 reaching+0.145
 giving+0.129
 teaching+0.127
 showing+0.127
 joining+0.127
Neg logit contributions
issy-0.244
uffy-0.235
ppy-0.234
izzy-0.212
acky-0.212
Token.
Token ID13
Feature activation+3.49e-02

Pos logit contributions
 reaching+0.180
 giving+0.175
 understanding+0.170
 pointing+0.170
 showing+0.168
Neg logit contributions
issy-0.274
ppy-0.256
uffy-0.231
pillar-0.228
ens-0.226
Token Would
Token ID10928
Feature activation+1.95e+01

Pos logit contributions
 reaching+0.001
 giving+0.001
 understanding+0.001
 helping+0.001
 listening+0.001
Neg logit contributions
issy-0.001
ppy-0.001
uffy-0.001
azz-0.001
pillar-0.001
Token you
Token ID345
Feature activation-9.14e+00

Pos logit contributions
 reaching+0.392
 giving+0.376
 understanding+0.371
 mesmer+0.370
 pointing+0.369
Neg logit contributions
issy-0.411
ppy-0.364
uffy-0.336
que-0.326
ens-0.316
Token mind
Token ID2000
Feature activation+1.20e+01

Pos logit contributions
ppy+0.192
issy+0.190
acker+0.159
que+0.157
azz+0.152
Neg logit contributions
 reaching-0.174
 pointing-0.172
 testing-0.169
 pushing-0.169
 insisting-0.168
Token giving
Token ID3501
Feature activation+2.67e+01

Pos logit contributions
 reaching+0.178
 understanding+0.175
 succeeding+0.174
 encouraging+0.172
 mesmer+0.168
Neg logit contributions
issy-0.286
ppy-0.282
uffy-0.250
que-0.243
ens-0.238
Token me
Token ID502
Feature activation-7.73e+00

Pos logit contributions
 reaching+0.392
 giving+0.379
 helping+0.356
 understanding+0.351
 counting+0.350
Neg logit contributions
issy-0.743
ppy-0.663
uffy-0.625
azz-0.616
pillar-0.598
Token some
Token ID617
Feature activation+3.03e+00

Pos logit contributions
issy+0.239
ppy+0.221
uffy+0.206
azz+0.201
pillar+0.200
Neg logit contributions
 reaching-0.087
 giving-0.081
 pointing-0.075
 understanding-0.074
 counting-0.074
Token of
Token ID286
Feature activation+1.01e+01

Pos logit contributions
 reaching+0.052
 pointing+0.050
 giving+0.049
 counting+0.048
 insisting+0.047
Neg logit contributions
issy-0.073
ppy-0.065
uffy-0.060
que-0.060
pillar-0.060
Token your
Token ID534
Feature activation-1.09e+01

Pos logit contributions
 reaching+0.134
 giving+0.123
 pointing+0.120
 understanding+0.118
 counting+0.116
Neg logit contributions
issy-0.283
ppy-0.258
uffy-0.239
ens-0.235
pillar-0.234
Token ol
Token ID25776
Feature activation-5.82e+00

Pos logit contributions
issy+0.280
ppy+0.250
uffy+0.234
azz+0.228
pillar+0.225
Neg logit contributions
 reaching-0.180
 giving-0.174
 helping-0.165
 understanding-0.164
 counting-0.163
Token and
Token ID290
Feature activation+6.37e+00

Pos logit contributions
issy+0.020
ppy+0.017
uffy+0.016
azz+0.016
isk+0.016
Neg logit contributions
 reaching-0.015
 giving-0.015
 listening-0.014
 helping-0.014
 joining-0.014
Token keep
Token ID1394
Feature activation+2.32e+01

Pos logit contributions
 reaching+0.090
 giving+0.086
 helping+0.080
 understanding+0.080
 counting+0.079
Neg logit contributions
issy-0.177
ppy-0.161
uffy-0.150
azz-0.147
pillar-0.145
Token wandering
Token ID24504
Feature activation+1.73e+01

Pos logit contributions
 reaching+0.255
 mesmer+0.237
 Listen+0.222
 giving+0.219
 counting+0.215
Neg logit contributions
issy-0.628
ppy-0.586
uffy-0.554
x-0.547
ens-0.540
Token,
Token ID11
Feature activation+7.96e+00

Pos logit contributions
 reaching+0.210
 giving+0.201
 helping+0.188
 understanding+0.183
 counting+0.182
Neg logit contributions
issy-0.524
ppy-0.473
uffy-0.450
azz-0.441
pillar-0.434
Token never
Token ID1239
Feature activation+1.18e+01

Pos logit contributions
 reaching+0.097
 giving+0.091
 understanding+0.085
 counting+0.084
 helping+0.084
Neg logit contributions
issy-0.235
ppy-0.214
uffy-0.202
azz-0.197
pillar-0.195
Token stopping
Token ID12225
Feature activation+2.67e+01

Pos logit contributions
 reaching+0.136
 giving+0.127
 packing+0.123
 counting+0.123
 encouraging+0.123
Neg logit contributions
issy-0.336
ppy-0.316
uffy-0.286
pillar-0.285
azz-0.285
Token for
Token ID329
Feature activation+1.58e+01

Pos logit contributions
 reaching+0.316
 understanding+0.299
 giving+0.299
 encouraging+0.285
 teaching+0.284
Neg logit contributions
issy-0.720
ppy-0.690
uffy-0.632
azz-0.614
pillar-0.611
Token too
Token ID1165
Feature activation+6.20e+00

Pos logit contributions
 reaching+0.191
 giving+0.181
 pointing+0.179
 mesmer+0.175
 showing+0.171
Neg logit contributions
issy-0.435
ppy-0.399
ens-0.369
que-0.368
uffy-0.360
Token long
Token ID890
Feature activation+3.45e+00

Pos logit contributions
 giving+0.092
 reaching+0.089
 showing+0.089
 teaching+0.088
 pointing+0.087
Neg logit contributions
issy-0.157
ppy-0.145
uffy-0.135
pillar-0.133
ens-0.132
Token at
Token ID379
Feature activation+1.74e+01

Pos logit contributions
 reaching+0.048
 giving+0.047
 understanding+0.046
 pointing+0.045
 encouraging+0.044
Neg logit contributions
issy-0.093
ppy-0.086
uffy-0.078
azz-0.077
pillar-0.077
Token one
Token ID530
Feature activation-6.51e+00

Pos logit contributions
 understanding+0.190
 reaching+0.187
 encouraging+0.180
 giving+0.179
 pointing+0.177
Neg logit contributions
issy-0.495
ppy-0.455
uffy-0.423
isk-0.419
que-0.414
Token the
Token ID262
Feature activation-1.27e+01

Pos logit contributions
 reaching+0.151
 giving+0.146
 understanding+0.134
 helping+0.134
 pointing+0.133
Neg logit contributions
issy-0.329
ppy-0.299
uffy-0.279
azz-0.271
pillar-0.270
Token right
Token ID826
Feature activation+1.17e+01

Pos logit contributions
issy+0.315
ppy+0.280
uffy+0.261
azz+0.253
pillar+0.252
Neg logit contributions
 reaching-0.220
 giving-0.213
 helping-0.201
 understanding-0.200
 counting-0.199
Token thing
Token ID1517
Feature activation-2.95e+00

Pos logit contributions
 giving+0.210
 reaching+0.200
 teaching+0.198
 pointing+0.197
 encouraging+0.196
Neg logit contributions
issy-0.258
ppy-0.230
uffy-0.219
ase-0.210
que-0.206
Token instead
Token ID2427
Feature activation+8.34e+00

Pos logit contributions
issy+0.089
ppy+0.081
uffy+0.076
azz+0.075
acker+0.074
Neg logit contributions
 reaching-0.036
 giving-0.034
 helping-0.032
 counting-0.032
 understanding-0.031
Token of
Token ID286
Feature activation+1.05e+01

Pos logit contributions
 reaching+0.117
 giving+0.113
 understanding+0.106
 helping+0.105
 counting+0.104
Neg logit contributions
issy-0.235
ppy-0.212
uffy-0.199
azz-0.194
pillar-0.193
Token giving
Token ID3501
Feature activation+2.65e+01

Pos logit contributions
 mesmer+0.099
 reaching+0.095
 giving+0.091
 understanding+0.089
 pointing+0.087
Neg logit contributions
issy-0.285
ppy-0.267
uffy-0.252
ens-0.251
isk-0.250
Token in
Token ID287
Feature activation+8.64e+00

Pos logit contributions
 reaching+0.310
 giving+0.294
 helping+0.277
 counting+0.273
 understanding+0.272
Neg logit contributions
issy-0.822
ppy-0.737
uffy-0.700
azz-0.693
acker-0.676
Token to
Token ID284
Feature activation+1.09e+01

Pos logit contributions
 reaching+0.099
 giving+0.092
 pointing+0.087
 listening+0.083
 showing+0.082
Neg logit contributions
issy-0.256
ppy-0.236
uffy-0.220
pillar-0.214
azz-0.213
Token his
Token ID465
Feature activation-1.46e+01

Pos logit contributions
 reaching+0.118
 giving+0.111
 pointing+0.100
 helping+0.100
 understanding+0.100
Neg logit contributions
issy-0.340
ppy-0.311
uffy-0.293
azz-0.286
pillar-0.284
Token greed
Token ID25474
Feature activation+9.09e+00

Pos logit contributions
issy+0.374
ppy+0.334
uffy+0.332
pillar+0.308
azz+0.301
Neg logit contributions
 reaching-0.227
 giving-0.216
 pointing-0.202
 listening-0.198
 spending-0.197
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
 reaching+0.123
 giving+0.120
 helping+0.115
 understanding+0.114
 listening+0.110
Neg logit contributions
issy-0.259
uffy-0.227
ppy-0.225
azz-0.216
acker-0.209
Token these
Token ID777
Feature activation-4.23e+00

Pos logit contributions
 reaching+0.188
 giving+0.184
 helping+0.170
 understanding+0.168
 counting+0.166
Neg logit contributions
issy-0.486
ppy-0.440
uffy-0.417
azz-0.406
pillar-0.404
Token."
Token ID526
Feature activation+2.24e-01

Pos logit contributions
issy+0.105
ppy+0.093
uffy+0.086
azz+0.084
pillar+0.083
Neg logit contributions
 reaching-0.074
 giving-0.072
 helping-0.069
 counting-0.068
 understanding-0.068
Token He
Token ID679
Feature activation-1.33e+01

Pos logit contributions
 reaching+0.003
 giving+0.003
 understanding+0.003
 helping+0.003
 counting+0.003
Neg logit contributions
issy-0.006
ppy-0.006
uffy-0.005
acker-0.005
pillar-0.005
Token said
Token ID531
Feature activation+7.20e+00

Pos logit contributions
issy+0.345
ppy+0.310
uffy+0.288
azz+0.282
pillar+0.277
Neg logit contributions
 reaching-0.217
 giving-0.208
 helping-0.199
 understanding-0.195
 counting-0.195
Token,
Token ID11
Feature activation+5.46e+00

Pos logit contributions
 reaching+0.130
 mesmer+0.120
 planning+0.119
 counting+0.118
 insisting+0.118
Neg logit contributions
issy-0.148
ppy-0.143
uffy-0.127
hell-0.124
azz-0.120
Token giving
Token ID3501
Feature activation+2.65e+01

Pos logit contributions
 succeeding+0.072
 reaching+0.071
 anticipation+0.071
 example+0.070
 planning+0.069
Neg logit contributions
ppy-0.121
issy-0.118
azz-0.107
isk-0.104
uffy-0.103
Token them
Token ID606
Feature activation-1.35e+01

Pos logit contributions
 reaching+0.253
 giving+0.238
 helping+0.215
 counting+0.214
 understanding+0.212
Neg logit contributions
issy-0.876
ppy-0.798
uffy-0.760
azz-0.746
pillar-0.741
Token two
Token ID734
Feature activation-6.92e+00

Pos logit contributions
issy+0.498
ppy+0.455
uffy+0.443
pillar+0.434
azz+0.430
Neg logit contributions
 reaching-0.067
 giving-0.067
 understanding-0.059
 helping-0.058
 counting-0.057
Token life
Token ID1204
Feature activation+1.61e+00

Pos logit contributions
issy+0.191
ppy+0.171
uffy+0.161
azz+0.156
acker+0.155
Neg logit contributions
 reaching-0.103
 giving-0.102
 understanding-0.096
 helping-0.095
 counting-0.094
Token jackets
Token ID35129
Feature activation-3.88e+00

Pos logit contributions
 giving+0.017
 reaching+0.016
 helping+0.015
 counting+0.015
 understanding+0.014
Neg logit contributions
issy-0.052
ppy-0.047
uffy-0.044
acker-0.044
azz-0.044
Token.
Token ID13
Feature activation-1.59e+00

Pos logit contributions
issy+0.120
ppy+0.109
uffy+0.103
azz+0.101
pillar+0.100
Neg logit contributions
 reaching-0.044
 giving-0.043
 helping-0.040
 understanding-0.038
 counting-0.038
Token that
Token ID326
Feature activation-4.87e-01

Pos logit contributions
 reaching+0.170
 giving+0.162
 helping+0.147
 understanding+0.145
 counting+0.144
Neg logit contributions
issy-0.530
ppy-0.485
uffy-0.460
azz-0.449
pillar-0.446
Token he
Token ID339
Feature activation-1.44e+01

Pos logit contributions
issy+0.014
ppy+0.013
uffy+0.012
azz+0.012
pillar+0.012
Neg logit contributions
 reaching-0.006
 giving-0.006
 helping-0.006
 understanding-0.006
 counting-0.006
Token was
Token ID373
Feature activation+9.38e+00

Pos logit contributions
issy+0.355
ppy+0.314
uffy+0.296
azz+0.286
acker+0.282
Neg logit contributions
 reaching-0.259
 giving-0.255
 understanding-0.244
 helping-0.242
 pointing-0.239
Token serious
Token ID2726
Feature activation+9.12e+00

Pos logit contributions
 reaching+0.128
 spending+0.119
 diving+0.119
 Listen+0.119
 giving+0.115
Neg logit contributions
issy-0.218
ppy-0.196
uffy-0.191
isk-0.188
que-0.182
Token about
Token ID546
Feature activation+1.71e+01

Pos logit contributions
 reaching+0.146
 giving+0.137
 understanding+0.131
 diving+0.131
 pointing+0.129
Neg logit contributions
issy-0.223
ppy-0.202
uffy-0.184
ens-0.182
azz-0.181
Token giving
Token ID3501
Feature activation+2.64e+01

Pos logit contributions
 reaching+0.174
 giving+0.156
 understanding+0.145
 encouraging+0.142
 pointing+0.140
Neg logit contributions
issy-0.522
ppy-0.481
uffy-0.452
azz-0.443
ens-0.442
Token the
Token ID262
Feature activation-1.36e+01

Pos logit contributions
 reaching+0.282
 giving+0.268
 counting+0.252
 helping+0.250
 understanding+0.249
Neg logit contributions
issy-0.845
ppy-0.765
uffy-0.724
azz-0.715
acker-0.710
Token toy
Token ID13373
Feature activation-6.78e+00

Pos logit contributions
issy+0.334
ppy+0.296
uffy+0.277
azz+0.268
pillar+0.266
Neg logit contributions
 reaching-0.241
 giving-0.232
 helping-0.220
 understanding-0.219
 counting-0.218
Token to
Token ID284
Feature activation+1.06e+01

Pos logit contributions
issy+0.191
ppy+0.173
uffy+0.161
acker+0.159
azz+0.158
Neg logit contributions
 reaching-0.095
 giving-0.095
 helping-0.090
 counting-0.088
 showing-0.087
Token Jim
Token ID5395
Feature activation-2.23e+01

Pos logit contributions
 reaching+0.129
 giving+0.122
 helping+0.112
 counting+0.112
 understanding+0.111
Neg logit contributions
issy-0.320
ppy-0.290
uffy-0.273
azz-0.267
pillar-0.263
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
issy+0.625
ppy+0.565
uffy+0.530
azz+0.517
pillar+0.511
Neg logit contributions
 reaching-0.316
 giving-0.304
 helping-0.284
 understanding-0.283
 counting-0.281
Token proud
Token ID6613
Feature activation+1.36e+01

Pos logit contributions
 giving+0.107
 reaching+0.107
 insisting+0.101
 warning+0.097
 asking+0.097
Neg logit contributions
issy-0.257
uffy-0.230
ppy-0.229
ocado-0.228
acky-0.219
Token of
Token ID286
Feature activation+1.01e+01

Pos logit contributions
 reaching+0.224
 insisting+0.217
 giving+0.210
 pointing+0.208
 succeeding+0.207
Neg logit contributions
issy-0.321
ppy-0.273
azz-0.271
pillar-0.270
affe-0.255
Token himself
Token ID2241
Feature activation-1.21e+01

Pos logit contributions
 mesmer+0.132
 reaching+0.122
 giving+0.120
 intrigued+0.117
 insisting+0.115
Neg logit contributions
issy-0.208
hell-0.186
ppy-0.181
ens-0.180
isk-0.176
Token for
Token ID329
Feature activation+1.44e+01

Pos logit contributions
issy+0.330
ppy+0.299
uffy+0.278
azz+0.271
pillar+0.269
Neg logit contributions
 reaching-0.173
 giving-0.162
 understanding-0.155
 pointing-0.152
 helping-0.150
Token not
Token ID407
Feature activation+8.82e+00

Pos logit contributions
 mesmer+0.148
 intrigued+0.142
 fascinated+0.127
 Listen+0.127
 Asked+0.114
Neg logit contributions
issy-0.304
ppy-0.290
isk-0.284
x-0.262
lar-0.260
Token giving
Token ID3501
Feature activation+2.64e+01

Pos logit contributions
 Listen+0.058
 reaching+0.058
 encouraging+0.057
 mesmer+0.055
 insisting+0.055
Neg logit contributions
issy-0.244
ppy-0.233
isk-0.216
ens-0.214
ety-0.213
Token up
Token ID510
Feature activation+7.38e+00

Pos logit contributions
 reaching+0.244
 giving+0.230
 helping+0.207
 understanding+0.204
 counting+0.203
Neg logit contributions
issy-0.873
ppy-0.800
uffy-0.760
azz-0.745
pillar-0.738
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
 reaching+0.102
 giving+0.097
 pointing+0.090
 understanding+0.090
 helping+0.090
Neg logit contributions
issy-0.199
ppy-0.182
uffy-0.171
azz-0.167
ens-0.165
Token Tiny
Token ID20443
Feature activation-2.36e+01

Pos logit contributions
issy+0.044
ppy+0.040
uffy+0.037
azz+0.036
pillar+0.036
Neg logit contributions
 reaching-0.022
 giving-0.022
 helping-0.020
 counting-0.020
 understanding-0.020
Token knew
Token ID2993
Feature activation+1.37e+01

Pos logit contributions
issy+0.608
ppy+0.531
uffy+0.508
azz+0.496
acker+0.490
Neg logit contributions
 reaching-0.392
 giving-0.386
 understanding-0.360
 helping-0.359
 counting-0.358
Token that
Token ID326
Feature activation-1.89e+00

Pos logit contributions
 reaching+0.175
 giving+0.161
 pointing+0.150
 counting+0.149
 mesmer+0.147
Neg logit contributions
issy-0.392
ppy-0.353
uffy-0.332
azz-0.327
pillar-0.320
Token store
Token ID3650
Feature activation+8.97e+00

Pos logit contributions
issy+0.345
ppy+0.310
azz+0.282
uffy+0.280
acker+0.274
Neg logit contributions
 giving-0.197
 reaching-0.194
 counting-0.189
 showing-0.187
 helping-0.186
Token with
Token ID351
Feature activation+1.67e+01

Pos logit contributions
 reaching+0.131
 giving+0.124
 understanding+0.119
 helping+0.117
 insisting+0.116
Neg logit contributions
issy-0.235
ppy-0.210
uffy-0.202
isk-0.193
azz-0.192
Token the
Token ID262
Feature activation-1.30e+01

Pos logit contributions
 reaching+0.180
 giving+0.172
 helping+0.157
 understanding+0.155
 counting+0.155
Neg logit contributions
issy-0.534
ppy-0.489
uffy-0.463
azz-0.453
pillar-0.448
Token perfume
Token ID40094
Feature activation-2.75e+00

Pos logit contributions
issy+0.326
ppy+0.294
azz+0.266
uffy+0.266
acker+0.257
Neg logit contributions
 reaching-0.226
 giving-0.226
 helping-0.215
 counting-0.213
 understanding-0.211
Token,
Token ID11
Feature activation+5.60e+00

Pos logit contributions
issy+0.076
ppy+0.068
uffy+0.064
azz+0.063
acker+0.062
Neg logit contributions
 reaching-0.041
 giving-0.040
 helping-0.038
 showing-0.037
 counting-0.037
Token giving
Token ID3501
Feature activation+2.63e+01

Pos logit contributions
 Listen+0.054
 understanding+0.052
 succeeding+0.051
 reaching+0.051
 example+0.050
Neg logit contributions
issy-0.146
ppy-0.139
isk-0.135
ens-0.131
uffy-0.129
Token it
Token ID340
Feature activation-1.15e+01

Pos logit contributions
 reaching+0.195
 giving+0.181
 helping+0.158
 understanding+0.155
 counting+0.154
Neg logit contributions
issy-0.922
ppy-0.849
uffy-0.808
azz-0.794
pillar-0.787
Token a
Token ID257
Feature activation-5.59e+00

Pos logit contributions
issy+0.399
ppy+0.367
uffy+0.350
azz+0.343
pillar+0.340
Neg logit contributions
 reaching-0.089
 giving-0.083
 helping-0.073
 understanding-0.072
 counting-0.071
Token little
Token ID1310
Feature activation-5.61e+00

Pos logit contributions
issy+0.141
ppy+0.126
uffy+0.122
pillar+0.121
acker+0.120
Neg logit contributions
 reaching-0.092
 counting-0.082
 giving-0.082
 insisting-0.081
 asking-0.080
Token hug
Token ID16225
Feature activation+1.33e+01

Pos logit contributions
issy+0.124
ppy+0.110
pillar+0.104
uffy+0.103
ens+0.099
Neg logit contributions
 reaching-0.107
 giving-0.098
 spending-0.096
 counting-0.096
 succeeding-0.096
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 reaching+0.173
 giving+0.167
 helping+0.155
 understanding+0.152
 counting+0.152
Neg logit contributions
issy-0.389
ppy-0.352
uffy-0.333
azz-0.325
pillar-0.321
Token Sam
Token ID3409
Feature activation-2.18e+01

Pos logit contributions
issy+0.009
ppy+0.008
uffy+0.007
acker+0.007
pillar+0.007
Neg logit contributions
 reaching-0.004
 giving-0.004
 counting-0.003
 understanding-0.003
 helping-0.003
Token learned
Token ID4499
Feature activation+1.74e+01

Pos logit contributions
issy+0.573
ppy+0.502
uffy+0.479
azz+0.468
acker+0.461
Neg logit contributions
 reaching-0.357
 giving-0.346
 counting-0.329
 understanding-0.325
 helping-0.324
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
 reaching+0.219
 giving+0.185
 pointing+0.182
 counting+0.178
 mesmer+0.174
Neg logit contributions
issy-0.499
ppy-0.443
azz-0.434
ens-0.417
que-0.416
Token lesson
Token ID11483
Feature activation+7.74e+00

Pos logit contributions
issy+0.344
ppy+0.304
uffy+0.291
pillar+0.288
azz+0.281
Neg logit contributions
 reaching-0.200
 giving-0.186
 helping-0.175
 pointing-0.172
 understanding-0.172
Token of
Token ID286
Feature activation+1.01e+01

Pos logit contributions
 reaching+0.087
 giving+0.084
 helping+0.077
 understanding+0.076
 counting+0.076
Neg logit contributions
issy-0.241
ppy-0.220
uffy-0.208
azz-0.203
pillar-0.201
Token giving
Token ID3501
Feature activation+2.62e+01

Pos logit contributions
 reaching+0.093
 mesmer+0.089
 pointing+0.077
 giving+0.077
 insisting+0.072
Neg logit contributions
issy-0.301
ppy-0.279
isk-0.275
ens-0.261
uffy-0.261
Token,
Token ID11
Feature activation+5.47e+00

Pos logit contributions
 reaching+0.243
 giving+0.239
 understanding+0.222
 helping+0.221
 encouraging+0.212
Neg logit contributions
issy-0.862
ppy-0.783
uffy-0.752
azz-0.734
acker-0.731
Token taking
Token ID2263
Feature activation+2.20e+01

Pos logit contributions
 reaching+0.083
 giving+0.077
 counting+0.076
 pointing+0.076
 packing+0.072
Neg logit contributions
issy-0.140
ppy-0.126
azz-0.117
pillar-0.115
uffy-0.114
Token and
Token ID290
Feature activation+4.24e+00

Pos logit contributions
 reaching+0.307
 giving+0.294
 helping+0.274
 counting+0.272
 understanding+0.271
Neg logit contributions
issy-0.628
ppy-0.567
uffy-0.532
azz-0.521
pillar-0.516
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 reaching+0.037
 mesmer+0.034
 packing+0.029
 pointing+0.029
 intrigued+0.028
Neg logit contributions
issy-0.116
ppy-0.110
isk-0.107
ens-0.102
x-0.102
Token importance
Token ID6817
Feature activation+1.62e+01

Pos logit contributions
issy+0.342
ppy+0.304
uffy+0.284
azz+0.276
pillar+0.274
Neg logit contributions
 reaching-0.228
 giving-0.221
 helping-0.209
 understanding-0.207
 counting-0.207
Token brave
Token ID14802
Feature activation+7.10e+00

Pos logit contributions
 reaching+0.082
 giving+0.076
 counting+0.073
 pointing+0.071
 helping+0.071
Neg logit contributions
issy-0.158
ppy-0.143
azz-0.133
uffy-0.133
pillar-0.130
Token,
Token ID11
Feature activation+5.48e+00

Pos logit contributions
 reaching+0.102
 giving+0.102
 helping+0.096
 counting+0.094
 understanding+0.094
Neg logit contributions
issy-0.198
ppy-0.176
uffy-0.167
acker-0.164
azz-0.161
Token and
Token ID290
Feature activation+3.98e+00

Pos logit contributions
 reaching+0.082
 giving+0.077
 counting+0.074
 understanding+0.073
 pointing+0.073
Neg logit contributions
issy-0.143
ppy-0.129
azz-0.120
uffy-0.119
pillar-0.118
Token strong
Token ID1913
Feature activation+6.09e+00

Pos logit contributions
 reaching+0.059
 counting+0.051
 giving+0.051
 insisting+0.051
 packing+0.051
Neg logit contributions
issy-0.098
ppy-0.088
azz-0.080
ens-0.080
ocado-0.079
Token without
Token ID1231
Feature activation+1.44e+01

Pos logit contributions
 reaching+0.088
 giving+0.084
 helping+0.079
 understanding+0.078
 counting+0.078
Neg logit contributions
issy-0.171
ppy-0.154
uffy-0.145
azz-0.141
pillar-0.139
Token giving
Token ID3501
Feature activation+2.61e+01

Pos logit contributions
 reaching+0.130
 mesmer+0.124
 giving+0.108
 encouraging+0.108
 Listen+0.107
Neg logit contributions
issy-0.410
ppy-0.401
ens-0.363
isk-0.362
x-0.350
Token up
Token ID510
Feature activation+7.29e+00

Pos logit contributions
 reaching+0.260
 giving+0.249
 helping+0.228
 counting+0.226
 pointing+0.225
Neg logit contributions
issy-0.849
ppy-0.764
uffy-0.721
azz-0.721
acker-0.714
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
 reaching+0.098
 giving+0.093
 helping+0.086
 understanding+0.086
 counting+0.085
Neg logit contributions
issy-0.205
ppy-0.188
uffy-0.177
azz-0.171
pillar-0.170
Token B
Token ID347
Feature activation-1.36e+01

Pos logit contributions
issy+0.037
ppy+0.033
uffy+0.031
acker+0.030
pillar+0.030
Neg logit contributions
 reaching-0.018
 giving-0.017
 counting-0.016
 understanding-0.016
 pointing-0.016
Tokeninky
Token ID29246
Feature activation-1.54e+01

Pos logit contributions
issy+0.226
Mic+0.182
ppy+0.181
acker+0.171
ocado+0.170
Neg logit contributions
 giving-0.283
 to-0.267
 listening-0.264
 reaching-0.263
 encouraging-0.262
Token realized
Token ID6939
Feature activation+1.22e+01

Pos logit contributions
issy+0.393
ppy+0.343
uffy+0.326
azz+0.320
acker+0.315
Neg logit contributions
 reaching-0.258
 giving-0.254
 understanding-0.239
 helping-0.238
 counting-0.237
Token,
Token ID11
Feature activation+5.89e+00

Pos logit contributions
issy+0.320
ppy+0.291
uffy+0.273
azz+0.266
pillar+0.264
Neg logit contributions
 reaching-0.145
 giving-0.139
 helping-0.129
 understanding-0.129
 counting-0.128
Token even
Token ID772
Feature activation+6.32e+00

Pos logit contributions
 reaching+0.079
 giving+0.073
 counting+0.070
 pointing+0.069
 understanding+0.067
Neg logit contributions
issy-0.165
ppy-0.150
uffy-0.139
azz-0.138
pillar-0.136
Token if
Token ID611
Feature activation+7.46e+00

Pos logit contributions
 reaching+0.088
 counting+0.078
 giving+0.078
 pointing+0.078
 packing+0.077
Neg logit contributions
issy-0.168
ppy-0.155
ens-0.138
que-0.137
azz-0.136
Token it
Token ID340
Feature activation-1.15e+01

Pos logit contributions
 reaching+0.120
 mesmer+0.110
 giving+0.106
 counting+0.104
 pointing+0.103
Neg logit contributions
issy-0.178
ppy-0.155
uffy-0.153
ens-0.145
que-0.142
Token means
Token ID1724
Feature activation+9.81e+00

Pos logit contributions
issy+0.253
ppy+0.220
uffy+0.202
azz+0.198
pillar+0.192
Neg logit contributions
 reaching-0.236
 giving-0.231
 helping-0.220
 counting-0.219
 understanding-0.217
Token giving
Token ID3501
Feature activation+2.60e+01

Pos logit contributions
 reaching+0.114
 mesmer+0.107
 insisting+0.094
 encouraging+0.093
 pointing+0.093
Neg logit contributions
issy-0.247
ppy-0.227
uffy-0.225
que-0.206
ens-0.204
Token up
Token ID510
Feature activation+7.33e+00

Pos logit contributions
 reaching+0.292
 giving+0.287
 helping+0.266
 understanding+0.266
 counting+0.252
Neg logit contributions
issy-0.805
ppy-0.728
uffy-0.694
azz-0.685
acker-0.666
Token something
Token ID1223
Feature activation-2.63e+00

Pos logit contributions
 reaching+0.092
 giving+0.087
 helping+0.080
 counting+0.080
 pointing+0.080
Neg logit contributions
issy-0.215
ppy-0.197
uffy-0.185
azz-0.180
pillar-0.179
Token you
Token ID345
Feature activation-9.92e+00

Pos logit contributions
issy+0.065
ppy+0.059
uffy+0.056
pillar+0.055
ches+0.053
Neg logit contributions
 reaching-0.041
 counting-0.037
 pointing-0.037
 giving-0.036
 insisting-0.036
Token really
Token ID1107
Feature activation+9.20e+00

Pos logit contributions
issy+0.216
ppy+0.194
uffy+0.171
azz+0.167
pillar+0.166
Neg logit contributions
 reaching-0.202
 giving-0.194
 counting-0.186
 pointing-0.185
 helping-0.185
Token want
Token ID765
Feature activation+2.26e+01

Pos logit contributions
 reaching+0.174
 giving+0.164
 counting+0.159
 pointing+0.156
 helping+0.155
Neg logit contributions
issy-0.210
ppy-0.193
uffy-0.172
acker-0.167
ens-0.167
Token,
Token ID11
Feature activation+5.96e+00

Pos logit contributions
issy+0.204
ppy+0.190
uffy+0.172
pillar+0.169
azz+0.166
Neg logit contributions
 reaching-0.122
 giving-0.111
 counting-0.110
 pointing-0.108
 insisting-0.106
Token even
Token ID772
Feature activation+6.22e+00

Pos logit contributions
 reaching+0.081
 giving+0.075
 counting+0.072
 pointing+0.072
 helping+0.070
Neg logit contributions
issy-0.167
ppy-0.151
uffy-0.140
azz-0.140
pillar-0.137
Token if
Token ID611
Feature activation+7.40e+00

Pos logit contributions
 reaching+0.086
 counting+0.078
 giving+0.077
 insisting+0.076
 packing+0.076
Neg logit contributions
issy-0.164
ppy-0.153
que-0.137
azz-0.136
ens-0.136
Token it
Token ID340
Feature activation-1.14e+01

Pos logit contributions
 reaching+0.126
 mesmer+0.119
 counting+0.113
 giving+0.112
 insisting+0.111
Neg logit contributions
issy-0.163
ppy-0.143
uffy-0.137
ens-0.132
que-0.130
Token means
Token ID1724
Feature activation+9.82e+00

Pos logit contributions
issy+0.250
ppy+0.219
uffy+0.201
azz+0.195
pillar+0.191
Neg logit contributions
 reaching-0.234
 giving-0.228
 helping-0.218
 understanding-0.217
 counting-0.217
Token giving
Token ID3501
Feature activation+2.59e+01

Pos logit contributions
 reaching+0.111
 mesmer+0.106
 insisting+0.092
 pointing+0.091
 cheering+0.091
Neg logit contributions
issy-0.243
ppy-0.224
uffy-0.218
que-0.206
pillar-0.202
Token up
Token ID510
Feature activation+7.36e+00

Pos logit contributions
 reaching+0.298
 giving+0.291
 understanding+0.269
 helping+0.269
 counting+0.258
Neg logit contributions
issy-0.797
ppy-0.722
uffy-0.687
azz-0.674
acker-0.659
Token something
Token ID1223
Feature activation-2.56e+00

Pos logit contributions
 reaching+0.093
 giving+0.087
 counting+0.080
 pointing+0.080
 helping+0.080
Neg logit contributions
issy-0.215
ppy-0.197
uffy-0.185
azz-0.180
pillar-0.179
Token that
Token ID326
Feature activation-1.65e+00

Pos logit contributions
issy+0.065
ppy+0.059
uffy+0.056
pillar+0.055
que+0.053
Neg logit contributions
 reaching-0.038
 counting-0.035
 pointing-0.034
 giving-0.034
 packing-0.033
Token you
Token ID345
Feature activation-1.00e+01

Pos logit contributions
issy+0.038
ppy+0.034
uffy+0.031
azz+0.030
pillar+0.030
Neg logit contributions
 reaching-0.031
 giving-0.030
 understanding-0.029
 helping-0.029
 counting-0.029
Token have
Token ID423
Feature activation+9.96e+00

Pos logit contributions
issy+0.222
ppy+0.196
uffy+0.178
azz+0.173
pillar+0.171
Neg logit contributions
 reaching-0.202
 giving-0.196
 helping-0.187
 counting-0.187
 understanding-0.186
Token put
Token ID1234
Feature activation+1.64e+01

Pos logit contributions
 reaching+0.166
 giving+0.161
 counting+0.146
 understanding+0.145
 listening+0.144
Neg logit contributions
issy-0.336
ppy-0.307
uffy-0.288
acky-0.274
azz-0.272
Token.
Token ID13
Feature activation-2.44e+00

Pos logit contributions
 reaching+0.177
 giving+0.168
 helping+0.156
 counting+0.151
 showing+0.150
Neg logit contributions
issy-0.525
ppy-0.481
uffy-0.457
azz-0.453
pillar-0.442
Token
Token ID198
Feature activation-4.17e+00

Pos logit contributions
issy+0.063
ppy+0.053
uffy+0.052
azz+0.051
pillar+0.049
Neg logit contributions
 reaching-0.041
 giving-0.040
 listening-0.038
 helping-0.038
 packing-0.037
Token
Token ID198
Feature activation-4.12e+00

Pos logit contributions
issy+0.116
ppy+0.103
uffy+0.098
azz+0.096
pillar+0.094
Neg logit contributions
 reaching-0.061
 giving-0.059
 helping-0.055
 counting-0.055
 understanding-0.054
TokenSu
Token ID5606
Feature activation-1.64e+01

Pos logit contributions
issy+0.099
ppy+0.087
uffy+0.080
pillar+0.078
azz+0.078
Neg logit contributions
 reaching-0.074
 giving-0.074
 counting-0.069
 helping-0.069
 pointing-0.068
Tokenzie
Token ID49746
Feature activation-2.69e+01

Pos logit contributions
issy+0.261
ppy+0.228
ocado+0.218
azz+0.209
pillar+0.209
Neg logit contributions
 reaching-0.406
 giving-0.390
 counting-0.380
 asking-0.375
 listening-0.374
Token tried
Token ID3088
Feature activation+1.77e+01

Pos logit contributions
issy+0.619
ppy+0.549
uffy+0.503
pillar+0.494
azz+0.488
Neg logit contributions
 reaching-0.506
 giving-0.486
 helping-0.469
 understanding-0.463
 counting-0.457
Token again
Token ID757
Feature activation+3.46e+00

Pos logit contributions
 reaching+0.209
 packing+0.203
 offering+0.199
 loading+0.197
 giving+0.197
Neg logit contributions
issy-0.394
azz-0.373
ppy-0.373
que-0.359
ens-0.357
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
 reaching+0.057
 giving+0.056
 insisting+0.055
 packing+0.054
 spending+0.054
Neg logit contributions
issy-0.076
ppy-0.071
pillar-0.064
azz-0.064
uffy-0.062
Token again
Token ID757
Feature activation+3.50e+00

Pos logit contributions
 packing+0.034
 reaching+0.034
 giving+0.032
 warning+0.031
 insisting+0.031
Neg logit contributions
issy-0.059
ppy-0.053
azz-0.049
pillar-0.048
ens-0.048
Token,
Token ID11
Feature activation+3.47e+00

Pos logit contributions
 reaching+0.054
 giving+0.052
 packing+0.051
 insisting+0.050
 spending+0.050
Neg logit contributions
issy-0.087
ppy-0.079
uffy-0.072
azz-0.071
pillar-0.071
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
issy+0.402
ppy+0.373
uffy+0.335
azz+0.330
ens+0.321
Neg logit contributions
 reaching-0.237
 giving-0.222
 spending-0.214
 helping-0.213
 testing-0.211
Token
Token ID220
Feature activation-3.59e+00

Pos logit contributions
issy+0.067
ppy+0.057
uffy+0.056
azz+0.056
pillar+0.052
Neg logit contributions
 reaching-0.045
 listening-0.044
 giving-0.044
 packing-0.043
 succeeding-0.043
Token
Token ID198
Feature activation-4.33e+00

Pos logit contributions
issy+0.108
ppy+0.099
uffy+0.092
pillar+0.091
azz+0.090
Neg logit contributions
 reaching-0.043
 giving-0.041
 helping-0.038
 pointing-0.038
 understanding-0.037
Token
Token ID198
Feature activation-4.31e+00

Pos logit contributions
issy+0.124
ppy+0.111
uffy+0.105
azz+0.103
pillar+0.101
Neg logit contributions
 reaching-0.060
 giving-0.058
 helping-0.054
 counting-0.053
 understanding-0.053
TokenSu
Token ID5606
Feature activation-1.67e+01

Pos logit contributions
issy+0.098
ppy+0.086
uffy+0.079
azz+0.077
pillar+0.076
Neg logit contributions
 reaching-0.084
 giving-0.083
 counting-0.078
 helping-0.078
 listening-0.077
Tokenzie
Token ID49746
Feature activation-2.69e+01

Pos logit contributions
issy+0.269
ocado+0.237
ppy+0.235
uffy+0.226
azz+0.221
Neg logit contributions
 reaching-0.393
 giving-0.379
 asking-0.367
 encouraging-0.365
 counting-0.365
Token smiled
Token ID13541
Feature activation+7.43e+00

Pos logit contributions
issy+0.686
ppy+0.616
uffy+0.570
azz+0.554
pillar+0.551
Neg logit contributions
 reaching-0.447
 giving-0.432
 helping-0.410
 understanding-0.405
 counting-0.405
Token.
Token ID13
Feature activation-2.69e+00

Pos logit contributions
 reaching+0.137
 counting+0.131
 testing+0.129
 giving+0.128
 spending+0.127
Neg logit contributions
issy-0.157
ppy-0.151
azz-0.130
uffy-0.128
que-0.126
Token She
Token ID1375
Feature activation-1.63e+01

Pos logit contributions
issy+0.065
ppy+0.055
uffy+0.054
azz+0.053
pillar+0.052
Neg logit contributions
 reaching-0.046
 giving-0.044
 listening-0.043
 warning-0.042
 understanding-0.042
Token and
Token ID290
Feature activation+2.17e+00

Pos logit contributions
issy+0.385
ppy+0.345
uffy+0.313
pillar+0.307
azz+0.303
Neg logit contributions
 reaching-0.301
 giving-0.290
 helping-0.278
 counting-0.277
 pointing-0.272
Token her
Token ID607
Feature activation-1.71e+01

Pos logit contributions
 reaching+0.030
 giving+0.028
 counting+0.026
 helping+0.026
 understanding+0.026
Neg logit contributions
issy-0.062
ppy-0.056
uffy-0.052
azz-0.051
ens-0.050
Token ended
Token ID4444
Feature activation+1.42e+01

Pos logit contributions
 reaching+0.144
 understanding+0.135
 giving+0.134
 packing+0.134
 counting+0.133
Neg logit contributions
issy-0.193
ppy-0.176
uffy-0.156
ens-0.156
azz-0.154
Token!
Token ID0
Feature activation-1.40e+00

Pos logit contributions
 reaching+0.176
 giving+0.168
 understanding+0.158
 helping+0.156
 counting+0.152
Neg logit contributions
issy-0.417
ppy-0.382
uffy-0.364
azz-0.347
pillar-0.345
Token
Token ID198
Feature activation-4.20e+00

Pos logit contributions
issy+0.042
ppy+0.039
uffy+0.036
azz+0.035
pillar+0.035
Neg logit contributions
 reaching-0.017
 giving-0.016
 understanding-0.015
 helping-0.015
 counting-0.015
Token
Token ID198
Feature activation-4.18e+00

Pos logit contributions
issy+0.118
ppy+0.107
uffy+0.100
azz+0.098
pillar+0.097
Neg logit contributions
 reaching-0.060
 giving-0.058
 helping-0.054
 understanding-0.054
 counting-0.053
TokenSu
Token ID5606
Feature activation-1.66e+01

Pos logit contributions
issy+0.103
ppy+0.091
uffy+0.085
azz+0.082
pillar+0.081
Neg logit contributions
 reaching-0.075
 giving-0.073
 helping-0.069
 understanding-0.069
 counting-0.068
Tokenzie
Token ID49746
Feature activation-2.69e+01

Pos logit contributions
issy+0.265
ocado+0.237
ppy+0.235
azz+0.221
uffy+0.221
Neg logit contributions
 reaching-0.387
 giving-0.377
 listening-0.362
 understanding-0.359
 encouraging-0.359
Token was
Token ID373
Feature activation+7.70e+00

Pos logit contributions
issy+0.675
ppy+0.601
uffy+0.560
azz+0.544
pillar+0.538
Neg logit contributions
 reaching-0.461
 giving-0.447
 helping-0.423
 understanding-0.421
 counting-0.419
Token sad
Token ID6507
Feature activation+6.24e+00

Pos logit contributions
 reaching+0.100
 counting+0.091
 pointing+0.089
 giving+0.087
 asking+0.087
Neg logit contributions
issy-0.199
ppy-0.184
uffy-0.167
c-0.164
isk-0.163
Token that
Token ID326
Feature activation-2.44e+00

Pos logit contributions
 reaching+0.098
 giving+0.093
 pointing+0.090
 showing+0.090
 counting+0.090
Neg logit contributions
issy-0.156
ppy-0.142
uffy-0.136
azz-0.129
ens-0.128
Token she
Token ID673
Feature activation-1.74e+01

Pos logit contributions
issy+0.062
ppy+0.054
uffy+0.051
ens+0.050
azz+0.049
Neg logit contributions
 reaching-0.039
 giving-0.036
 counting-0.035
 joining-0.034
 pointing-0.034
Token no
Token ID645
Feature activation+3.50e+00

Pos logit contributions
issy+0.423
ppy+0.366
uffy+0.351
azz+0.347
acker+0.340
Neg logit contributions
 giving-0.307
 reaching-0.306
 helping-0.292
 understanding-0.290
 showing-0.289
Token being
Token ID852
Feature activation+6.96e+00

Pos logit contributions
 reaching+0.094
 mesmer+0.085
 pointing+0.082
 insisting+0.080
 nodding+0.079
Neg logit contributions
issy-0.216
ppy-0.202
isk-0.196
que-0.184
hell-0.182
Token fair
Token ID3148
Feature activation+3.01e+00

Pos logit contributions
 reaching+0.103
 counting+0.092
 packing+0.090
 insisting+0.089
 pointing+0.089
Neg logit contributions
issy-0.174
ppy-0.170
ocado-0.150
pillar-0.146
c-0.142
Token.
Token ID13
Feature activation-2.50e+00

Pos logit contributions
 reaching+0.044
 giving+0.041
 understanding+0.039
 helping+0.039
 counting+0.039
Neg logit contributions
issy-0.083
ppy-0.076
uffy-0.071
azz-0.069
pillar-0.068
Token<|endoftext|>
Token ID50256
Feature activation-4.93e+00

Pos logit contributions
issy+0.078
ppy+0.074
uffy+0.069
azz+0.066
x+0.066
Neg logit contributions
 reaching-0.025
 giving-0.025
 understanding-0.023
 helping-0.023
 spending-0.022
TokenSu
Token ID5606
Feature activation-1.66e+01

Pos logit contributions
issy+0.142
ppy+0.131
uffy+0.118
x+0.116
azz+0.115
Neg logit contributions
 reaching-0.065
 giving-0.062
 pointing-0.060
 counting-0.060
 spending-0.058
Tokenzie
Token ID49746
Feature activation-2.68e+01

Pos logit contributions
issy+0.262
ppy+0.236
ocado+0.223
acky+0.207
uffy+0.206
Neg logit contributions
 reaching-0.403
 giving-0.393
 counting-0.382
 asking-0.377
 listening-0.374
Token loved
Token ID6151
Feature activation+1.79e+01

Pos logit contributions
issy+0.733
ppy+0.660
uffy+0.618
azz+0.613
acker+0.602
Neg logit contributions
 reaching-0.400
 giving-0.390
 helping-0.365
 counting-0.363
 pointing-0.357
Token to
Token ID284
Feature activation+9.75e+00

Pos logit contributions
 reaching+0.207
 understanding+0.183
 warning+0.174
 encouraging+0.173
 succeeding+0.171
Neg logit contributions
issy-0.475
ens-0.417
ppy-0.412
pillar-0.406
uffy-0.402
Token swim
Token ID9422
Feature activation+5.93e+00

Pos logit contributions
 reaching+0.156
 giving+0.146
 insisting+0.144
 understanding+0.139
 succeeding+0.138
Neg logit contributions
issy-0.239
ppy-0.211
azz-0.191
ens-0.188
pillar-0.186
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
 reaching+0.061
 giving+0.058
 helping+0.053
 understanding+0.052
 counting+0.051
Neg logit contributions
issy-0.191
ppy-0.174
uffy-0.165
azz-0.162
pillar-0.160
Token She
Token ID1375
Feature activation-1.60e+01

Pos logit contributions
issy+0.051
ppy+0.047
uffy+0.044
azz+0.042
pillar+0.042
Neg logit contributions
 reaching-0.031
 giving-0.029
 counting-0.028
 helping-0.028
 spending-0.027
Token.
Token ID13
Feature activation-2.96e+00

Pos logit contributions
 reaching+0.033
 giving+0.030
 understanding+0.030
 helping+0.029
 counting+0.029
Neg logit contributions
issy-0.062
ppy-0.055
uffy-0.052
ens-0.051
azz-0.051
Token
Token ID220
Feature activation-3.57e+00

Pos logit contributions
issy+0.079
ppy+0.070
uffy+0.066
azz+0.064
pillar+0.063
Neg logit contributions
 reaching-0.047
 giving-0.045
 helping-0.042
 understanding-0.042
 counting-0.042
Token
Token ID198
Feature activation-4.39e+00

Pos logit contributions
issy+0.109
ppy+0.101
pillar+0.093
uffy+0.092
acker+0.091
Neg logit contributions
 reaching-0.041
 giving-0.039
 helping-0.036
 counting-0.036
 pointing-0.036
Token
Token ID198
Feature activation-4.33e+00

Pos logit contributions
issy+0.123
ppy+0.110
uffy+0.104
azz+0.102
pillar+0.100
Neg logit contributions
 reaching-0.063
 giving-0.061
 helping-0.057
 counting-0.056
 understanding-0.056
TokenSu
Token ID5606
Feature activation-1.64e+01

Pos logit contributions
issy+0.099
ppy+0.086
uffy+0.080
azz+0.077
pillar+0.076
Neg logit contributions
 reaching-0.085
 giving-0.083
 helping-0.079
 counting-0.079
 pointing-0.078
Tokenzie
Token ID49746
Feature activation-2.68e+01

Pos logit contributions
issy+0.266
ppy+0.236
ocado+0.228
azz+0.225
pillar+0.219
Neg logit contributions
 reaching-0.390
 giving-0.379
 listening-0.364
 asking-0.364
 understanding-0.363
Token was
Token ID373
Feature activation+7.54e+00

Pos logit contributions
issy+0.653
ppy+0.579
uffy+0.538
azz+0.522
pillar+0.516
Neg logit contributions
 reaching-0.481
 giving-0.467
 helping-0.444
 understanding-0.440
 counting-0.439
Token so
Token ID523
Feature activation+7.12e+00

Pos logit contributions
 reaching+0.096
 diving+0.086
 Listen+0.086
 succeeding+0.086
 giving+0.086
Neg logit contributions
issy-0.182
ppy-0.169
uffy-0.157
illo-0.151
isk-0.150
Token disappointed
Token ID11679
Feature activation+9.74e+00

Pos logit contributions
 reaching+0.089
 counting+0.088
 showing+0.082
 giving+0.082
 adding+0.081
Neg logit contributions
issy-0.181
ppy-0.167
uffy-0.165
ocado-0.160
acky-0.152
Token and
Token ID290
Feature activation+2.23e+00

Pos logit contributions
 reaching+0.146
 giving+0.141
 helping+0.132
 counting+0.132
 showing+0.132
Neg logit contributions
issy-0.257
ppy-0.230
uffy-0.222
azz-0.211
ens-0.209
Token tried
Token ID3088
Feature activation+1.77e+01

Pos logit contributions
 reaching+0.035
 giving+0.033
 counting+0.032
 helping+0.031
 joining+0.031
Neg logit contributions
issy-0.057
ppy-0.051
uffy-0.047
ens-0.047
azz-0.045
Token too
Token ID1165
Feature activation+2.72e+00

Pos logit contributions
issy+0.147
ppy+0.134
uffy+0.123
azz+0.123
ens+0.121
Neg logit contributions
 reaching-0.085
 understanding-0.081
 giving-0.081
 insisting-0.078
 helping-0.076
Token.
Token ID13
Feature activation-2.77e+00

Pos logit contributions
 reaching+0.046
 giving+0.043
 insisting+0.043
 understanding+0.043
 helping+0.041
Neg logit contributions
issy-0.065
ppy-0.058
uffy-0.055
azz-0.054
pillar-0.053
Token
Token ID198
Feature activation-4.05e+00

Pos logit contributions
issy+0.071
ppy+0.063
uffy+0.059
azz+0.058
pillar+0.057
Neg logit contributions
 reaching-0.046
 giving-0.045
 helping-0.042
 understanding-0.042
 counting-0.042
Token
Token ID198
Feature activation-3.99e+00

Pos logit contributions
issy+0.103
ppy+0.090
uffy+0.087
azz+0.084
pillar+0.082
Neg logit contributions
 giving-0.066
 reaching-0.065
 helping-0.061
 showing-0.061
 counting-0.060
TokenSu
Token ID5606
Feature activation-1.65e+01

Pos logit contributions
issy+0.086
ppy+0.073
uffy+0.066
pillar+0.065
azz+0.064
Neg logit contributions
 giving-0.081
 reaching-0.079
 helping-0.076
 counting-0.076
 packing-0.076
Tokenzie
Token ID49746
Feature activation-2.68e+01

Pos logit contributions
issy+0.264
ppy+0.224
azz+0.212
ocado+0.211
pillar+0.207
Neg logit contributions
 reaching-0.416
 giving-0.395
 counting-0.387
 listening-0.382
 asking-0.381
Token loved
Token ID6151
Feature activation+1.77e+01

Pos logit contributions
issy+0.649
ppy+0.586
uffy+0.534
azz+0.523
pillar+0.519
Neg logit contributions
 reaching-0.473
 giving-0.455
 helping-0.437
 understanding-0.433
 counting-0.426
Token the
Token ID262
Feature activation-1.42e+01

Pos logit contributions
 reaching+0.189
 understanding+0.175
 insisting+0.163
 encouraging+0.157
 succeeding+0.154
Neg logit contributions
issy-0.471
ppy-0.434
ens-0.425
azz-0.420
isk-0.410
Token magazine
Token ID7093
Feature activation-9.44e+00

Pos logit contributions
issy+0.283
ppy+0.249
uffy+0.225
azz+0.214
pillar+0.206
Neg logit contributions
 reaching-0.321
 giving-0.319
 helping-0.307
 showing-0.303
 understanding-0.303
Token so
Token ID523
Feature activation+7.01e+00

Pos logit contributions
issy+0.281
ppy+0.255
uffy+0.241
azz+0.236
pillar+0.233
Neg logit contributions
 reaching-0.119
 giving-0.114
 helping-0.106
 understanding-0.104
 counting-0.104
Token much
Token ID881
Feature activation+3.79e+00

Pos logit contributions
 reaching+0.093
 giving+0.089
 helping+0.083
 understanding+0.082
 counting+0.082
Neg logit contributions
issy-0.206
ppy-0.187
uffy-0.175
azz-0.172
pillar-0.169
Token sky
Token ID6766
Feature activation-6.37e+00

Pos logit contributions
issy+0.350
ppy+0.314
uffy+0.286
azz+0.283
acker+0.274
Neg logit contributions
 reaching-0.271
 giving-0.263
 helping-0.251
 understanding-0.250
 counting-0.250
Token?
Token ID30
Feature activation-2.36e+00

Pos logit contributions
issy+0.167
ppy+0.152
uffy+0.141
azz+0.137
pillar+0.135
Neg logit contributions
 reaching-0.099
 giving-0.096
 understanding-0.091
 helping-0.090
 pointing-0.090
Tokenâ
Token ID22940
Feature activation-7.16e+00

Pos logit contributions
issy+0.059
ppy+0.053
uffy+0.049
azz+0.048
pillar+0.048
Neg logit contributions
 reaching-0.040
 giving-0.039
 helping-0.037
 counting-0.037
 understanding-0.037
Token
Token ID26391
Feature activation-1.65e+01

Pos logit contributions
issy+0.137
ppy+0.119
uffy+0.112
azz+0.108
acker+0.100
Neg logit contributions
 reaching-0.162
 giving-0.157
 listening-0.152
 understanding-0.151
 encouraging-0.150
Token asked
Token ID1965
Feature activation+1.09e+01

Pos logit contributions
issy+0.408
ppy+0.362
que+0.321
acky+0.320
acker+0.318
Neg logit contributions
 reaching-0.292
 giving-0.278
 counting-0.273
 pointing-0.272
 helping-0.268
Token Sunny
Token ID32241
Feature activation-2.68e+01

Pos logit contributions
 reaching+0.136
 giving+0.130
 helping+0.121
 understanding+0.119
 counting+0.118
Neg logit contributions
issy-0.329
ppy-0.296
uffy-0.280
azz-0.276
pillar-0.272
Token.
Token ID13
Feature activation-3.02e+00

Pos logit contributions
issy+0.782
ppy+0.733
uffy+0.681
azz+0.662
pillar+0.648
Neg logit contributions
 reaching-0.337
 giving-0.320
 understanding-0.298
 helping-0.294
 showing-0.292
Token
Token ID198
Feature activation-4.43e+00

Pos logit contributions
issy+0.079
ppy+0.072
uffy+0.067
azz+0.066
pillar+0.065
Neg logit contributions
 reaching-0.047
 giving-0.045
 helping-0.043
 understanding-0.043
 counting-0.043
Token
Token ID198
Feature activation-4.38e+00

Pos logit contributions
issy+0.122
ppy+0.110
uffy+0.104
azz+0.102
pillar+0.101
Neg logit contributions
 reaching-0.065
 giving-0.063
 helping-0.059
 counting-0.059
 understanding-0.058
TokenThe
Token ID464
Feature activation-1.46e+01

Pos logit contributions
issy+0.097
ppy+0.086
uffy+0.079
azz+0.076
pillar+0.076
Neg logit contributions
 reaching-0.088
 giving-0.086
 helping-0.082
 counting-0.082
 understanding-0.081
Token girl
Token ID2576
Feature activation-2.15e+01

Pos logit contributions
issy+0.346
ppy+0.306
uffy+0.282
azz+0.275
pillar+0.269
Neg logit contributions
 reaching-0.278
 giving-0.271
 helping-0.258
 understanding-0.257
 counting-0.256
Token all
Token ID477
Feature activation+9.02e-01

Pos logit contributions
issy+0.484
ppy+0.437
uffy+0.409
azz+0.400
pillar+0.396
Neg logit contributions
 reaching-0.249
 giving-0.239
 helping-0.224
 understanding-0.223
 counting-0.222
Token along
Token ID1863
Feature activation+9.41e+00

Pos logit contributions
 reaching+0.013
 giving+0.012
 pointing+0.011
 counting+0.011
 understanding+0.011
Neg logit contributions
issy-0.024
ppy-0.022
pillar-0.020
uffy-0.020
ens-0.020
Token.
Token ID13
Feature activation-3.13e+00

Pos logit contributions
 reaching+0.103
 giving+0.095
 pointing+0.091
 understanding+0.088
 packing+0.087
Neg logit contributions
issy-0.276
ppy-0.253
pillar-0.235
uffy-0.235
isk-0.231
Token<|endoftext|>
Token ID50256
Feature activation-5.09e+00

Pos logit contributions
issy+0.092
ppy+0.084
uffy+0.079
azz+0.076
acker+0.075
Neg logit contributions
 reaching-0.040
 giving-0.039
 understanding-0.037
 helping-0.036
 counting-0.035
TokenSu
Token ID5606
Feature activation-1.67e+01

Pos logit contributions
issy+0.144
ppy+0.131
uffy+0.118
azz+0.116
x+0.115
Neg logit contributions
 reaching-0.073
 giving-0.069
 pointing-0.066
 counting-0.066
 helping-0.065
Tokenzie
Token ID49746
Feature activation-2.68e+01

Pos logit contributions
issy+0.264
ppy+0.234
ocado+0.218
pillar+0.203
acky+0.201
Neg logit contributions
 reaching-0.418
 giving-0.403
 counting-0.394
 asking-0.390
 insisting-0.387
Token was
Token ID373
Feature activation+7.20e+00

Pos logit contributions
issy+0.706
ppy+0.632
uffy+0.590
azz+0.579
acker+0.568
Neg logit contributions
 reaching-0.427
 giving-0.415
 helping-0.391
 counting-0.387
 understanding-0.384
Token a
Token ID257
Feature activation-6.68e+00

Pos logit contributions
 reaching+0.081
 giving+0.078
 understanding+0.072
 warning+0.069
 succeeding+0.068
Neg logit contributions
issy-0.202
ppy-0.184
uffy-0.180
acky-0.167
illo-0.166
Token 3
Token ID513
Feature activation-3.14e+00

Pos logit contributions
issy+0.201
ppy+0.181
uffy+0.170
azz+0.170
acker+0.166
Neg logit contributions
 reaching-0.082
 giving-0.080
 understanding-0.076
 helping-0.076
 counting-0.073
Token year
Token ID614
Feature activation+2.63e+00

Pos logit contributions
issy+0.073
ppy+0.065
uffy+0.060
azz+0.058
pillar+0.057
Neg logit contributions
 reaching-0.060
 giving-0.059
 helping-0.056
 understanding-0.056
 counting-0.055
Token old
Token ID1468
Feature activation-7.59e+00

Pos logit contributions
 reaching+0.034
 giving+0.032
 helping+0.030
 counting+0.029
 understanding+0.029
Neg logit contributions
issy-0.077
ppy-0.070
uffy-0.066
azz-0.065
pillar-0.064
Token said
Token ID531
Feature activation+4.94e+00

Pos logit contributions
issy+0.467
ppy+0.423
pillar+0.382
azz+0.372
uffy+0.369
Neg logit contributions
 reaching-0.381
 spending-0.369
 giving-0.365
 packing-0.362
 helping-0.357
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
 reaching+0.085
 diving+0.084
 spending+0.084
 counting+0.082
 packing+0.081
Neg logit contributions
issy-0.105
ppy-0.098
uffy-0.085
hell-0.083
azz-0.082
Token
Token ID198
Feature activation-4.17e+00

Pos logit contributions
issy+0.058
ppy+0.048
uffy+0.047
azz+0.045
pillar+0.045
Neg logit contributions
 reaching-0.050
 giving-0.049
 listening-0.047
 packing-0.047
 diving-0.046
Token
Token ID198
Feature activation-4.14e+00

Pos logit contributions
issy+0.118
ppy+0.105
uffy+0.100
azz+0.097
pillar+0.096
Neg logit contributions
 reaching-0.060
 giving-0.058
 helping-0.054
 counting-0.053
 understanding-0.053
TokenSu
Token ID5606
Feature activation-1.62e+01

Pos logit contributions
issy+0.095
ppy+0.083
uffy+0.076
pillar+0.075
azz+0.074
Neg logit contributions
 reaching-0.080
 giving-0.079
 counting-0.074
 helping-0.074
 packing-0.073
Tokenzie
Token ID49746
Feature activation-2.67e+01

Pos logit contributions
issy+0.250
ppy+0.215
ocado+0.195
azz+0.195
uffy+0.194
Neg logit contributions
 reaching-0.417
 giving-0.401
 counting-0.391
 helping-0.388
 understanding-0.387
Token's
Token ID338
Feature activation-4.99e+00

Pos logit contributions
issy+0.641
ppy+0.575
uffy+0.525
pillar+0.519
azz+0.511
Neg logit contributions
 reaching-0.472
 giving-0.456
 helping-0.438
 spending-0.429
 counting-0.427
Token dad
Token ID9955
Feature activation-1.43e+01

Pos logit contributions
issy+0.139
ppy+0.125
uffy+0.121
pillar+0.117
azz+0.115
Neg logit contributions
 reaching-0.070
 giving-0.066
 counting-0.060
 pointing-0.060
 listening-0.060
Token smiled
Token ID13541
Feature activation+7.76e+00

Pos logit contributions
issy+0.349
ppy+0.312
uffy+0.282
azz+0.279
pillar+0.279
Neg logit contributions
 reaching-0.250
 giving-0.239
 helping-0.229
 counting-0.227
 spending-0.226
Token and
Token ID290
Feature activation+2.29e+00

Pos logit contributions
 reaching+0.139
 counting+0.134
 testing+0.131
 giving+0.130
 spending+0.130
Neg logit contributions
issy-0.169
ppy-0.160
azz-0.141
uffy-0.137
pillar-0.136
Token said
Token ID531
Feature activation+4.82e+00

Pos logit contributions
 reaching+0.044
 counting+0.041
 giving+0.040
 packing+0.040
 insisting+0.040
Neg logit contributions
issy-0.051
ppy-0.045
uffy-0.041
azz-0.040
ens-0.040
Token Lily
Token ID20037
Feature activation-2.47e+01

Pos logit contributions
 reaching+0.123
 giving+0.123
 helping+0.117
 counting+0.115
 pointing+0.115
Neg logit contributions
issy-0.174
ppy-0.152
uffy-0.140
acker-0.138
ase-0.136
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
issy+0.819
ppy+0.696
uffy+0.686
acker+0.686
pillar+0.680
Neg logit contributions
 giving-0.238
 helping-0.226
 reaching-0.221
 to-0.204
 listening-0.204
Token And
Token ID843
Feature activation+1.58e+00

Pos logit contributions
issy+0.054
ppy+0.048
uffy+0.045
acker+0.044
acky+0.044
Neg logit contributions
 giving-0.026
 reaching-0.026
 helping-0.024
 pointing-0.024
 counting-0.024
Token this
Token ID428
Feature activation-1.16e+01

Pos logit contributions
 reaching+0.024
 giving+0.023
 helping+0.021
 counting+0.021
 understanding+0.021
Neg logit contributions
issy-0.042
ppy-0.038
uffy-0.036
azz-0.035
pillar-0.034
Token is
Token ID318
Feature activation+6.95e+00

Pos logit contributions
issy+0.285
ppy+0.253
uffy+0.228
azz+0.222
acker+0.220
Neg logit contributions
 giving-0.211
 reaching-0.209
 showing-0.197
 helping-0.196
 counting-0.195
Token Sunny
Token ID32241
Feature activation-2.67e+01

Pos logit contributions
 reaching+0.085
 giving+0.082
 helping+0.075
 counting+0.074
 understanding+0.074
Neg logit contributions
issy-0.211
ppy-0.191
uffy-0.181
azz-0.176
pillar-0.176
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
issy+0.816
ppy+0.710
azz+0.684
uffy+0.679
pillar+0.668
Neg logit contributions
 reaching-0.324
 giving-0.324
 helping-0.306
 listening-0.291
 showing-0.290
Token We
Token ID775
Feature activation-3.54e+00

Pos logit contributions
issy+0.056
ppy+0.049
uffy+0.047
ase+0.045
x+0.045
Neg logit contributions
 giving-0.029
 reaching-0.029
 helping-0.026
 counting-0.026
 pointing-0.026
Token are
Token ID389
Feature activation+6.47e+00

Pos logit contributions
issy+0.095
uffy+0.081
ppy+0.079
ase+0.078
azz+0.077
Neg logit contributions
 giving-0.053
 reaching-0.053
 helping-0.050
 showing-0.049
 counting-0.049
Token happy
Token ID3772
Feature activation+7.06e+00

Pos logit contributions
 reaching+0.088
 giving+0.081
 understanding+0.079
 insisting+0.076
 counting+0.076
Neg logit contributions
issy-0.180
ppy-0.167
uffy-0.155
azz-0.151
pillar-0.146
Token to
Token ID284
Feature activation+9.37e+00

Pos logit contributions
 reaching+0.107
 insisting+0.100
 pointing+0.099
 understanding+0.097
 mesmer+0.097
Neg logit contributions
issy-0.161
ppy-0.158
uffy-0.148
ens-0.140
pillar-0.140
Token From
Token ID3574
Feature activation+5.65e+00

Pos logit contributions
issy+0.078
ppy+0.070
uffy+0.065
azz+0.065
pillar+0.063
Neg logit contributions
 reaching-0.033
 giving-0.033
 helping-0.031
 understanding-0.030
 pointing-0.029
Token that
Token ID326
Feature activation-2.65e+00

Pos logit contributions
 reaching+0.066
 giving+0.063
 counting+0.058
 helping+0.057
 pointing+0.056
Neg logit contributions
issy-0.171
ppy-0.157
uffy-0.150
ens-0.143
pillar-0.143
Token day
Token ID1110
Feature activation+1.22e+00

Pos logit contributions
issy+0.062
ppy+0.054
uffy+0.051
pillar+0.049
ens+0.048
Neg logit contributions
 reaching-0.050
 giving-0.048
 pointing-0.046
 understanding-0.046
 counting-0.045
Token on
Token ID319
Feature activation+1.10e+01

Pos logit contributions
 reaching+0.013
 giving+0.013
 understanding+0.012
 counting+0.012
 showing+0.011
Neg logit contributions
issy-0.038
ppy-0.035
uffy-0.033
azz-0.032
pillar-0.032
Token,
Token ID11
Feature activation+3.68e+00

Pos logit contributions
 reaching+0.137
 giving+0.131
 helping+0.124
 counting+0.121
 pointing+0.117
Neg logit contributions
issy-0.334
ppy-0.303
azz-0.282
uffy-0.280
acker-0.275
Token Sunny
Token ID32241
Feature activation-2.67e+01

Pos logit contributions
 reaching+0.061
 giving+0.059
 understanding+0.056
 counting+0.056
 helping+0.056
Neg logit contributions
issy-0.093
ppy-0.084
uffy-0.079
pillar-0.076
azz-0.076
Token knew
Token ID2993
Feature activation+1.24e+01

Pos logit contributions
issy+0.687
ppy+0.604
uffy+0.571
azz+0.562
acker+0.546
Neg logit contributions
 reaching-0.444
 giving-0.435
 helping-0.414
 understanding-0.403
 showing-0.402
Token he
Token ID339
Feature activation-1.64e+01

Pos logit contributions
 reaching+0.174
 giving+0.155
 pointing+0.155
 mesmer+0.153
 counting+0.151
Neg logit contributions
issy-0.323
ppy-0.281
uffy-0.279
que-0.263
azz-0.261
Token could
Token ID714
Feature activation+1.24e+01

Pos logit contributions
issy+0.372
ppy+0.317
uffy+0.313
azz+0.313
acker+0.295
Neg logit contributions
 reaching-0.307
 giving-0.306
 listening-0.298
 helping-0.297
 spending-0.288
Token go
Token ID467
Feature activation+1.45e+01

Pos logit contributions
 pointing+0.200
 giving+0.199
 reaching+0.199
 understanding+0.195
 insisting+0.193
Neg logit contributions
issy-0.300
ppy-0.277
uffy-0.256
que-0.250
acker-0.239
Token on
Token ID319
Feature activation+1.10e+01

Pos logit contributions
 pointing+0.192
 understanding+0.190
 giving+0.185
 reaching+0.183
 mesmer+0.176
Neg logit contributions
issy-0.369
ppy-0.347
que-0.322
uffy-0.320
c-0.318
Token
Token ID198
Feature activation-4.41e+00

Pos logit contributions
issy+0.127
ppy+0.115
uffy+0.108
azz+0.105
pillar+0.104
Neg logit contributions
 reaching-0.062
 giving-0.060
 helping-0.056
 understanding-0.055
 counting-0.055
Token"
Token ID1
Feature activation-4.82e+00

Pos logit contributions
issy+0.108
ppy+0.096
uffy+0.089
azz+0.087
pillar+0.085
Neg logit contributions
 reaching-0.079
 giving-0.077
 understanding-0.074
 helping-0.074
 counting-0.072
TokenMom
Token ID29252
Feature activation-1.43e+01

Pos logit contributions
issy+0.121
ppy+0.108
uffy+0.101
azz+0.098
acker+0.096
Neg logit contributions
 reaching-0.084
 giving-0.082
 understanding-0.078
 helping-0.077
 pointing-0.076
Token,"
Token ID553
Feature activation+1.71e+00

Pos logit contributions
issy+0.329
ppy+0.291
uffy+0.270
azz+0.259
pillar+0.254
Neg logit contributions
 reaching-0.273
 giving-0.264
 helping-0.251
 understanding-0.250
 showing-0.249
Token said
Token ID531
Feature activation+4.79e+00

Pos logit contributions
 reaching+0.027
 giving+0.026
 helping+0.024
 counting+0.024
 understanding+0.024
Neg logit contributions
issy-0.046
ppy-0.041
uffy-0.038
azz-0.037
pillar-0.037
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
 reaching+0.083
 giving+0.079
 counting+0.076
 understanding+0.075
 helping+0.075
Neg logit contributions
issy-0.118
ppy-0.107
uffy-0.099
ens-0.095
azz-0.094
Token.
Token ID13
Feature activation-2.65e+00

Pos logit contributions
issy+0.741
ppy+0.679
uffy+0.629
azz+0.614
pillar+0.605
Neg logit contributions
 reaching-0.380
 giving-0.363
 helping-0.341
 understanding-0.338
 counting-0.338
Token "
Token ID366
Feature activation-2.02e+00

Pos logit contributions
issy+0.068
ppy+0.063
uffy+0.061
azz+0.059
pillar+0.059
Neg logit contributions
 reaching-0.035
 listening-0.033
 succeeding-0.033
 spending-0.033
 giving-0.032
TokenI
Token ID40
Feature activation-2.57e+00

Pos logit contributions
issy+0.056
ppy+0.049
uffy+0.048
azz+0.045
acker+0.045
Neg logit contributions
 reaching-0.031
 giving-0.030
 understanding-0.029
 helping-0.028
 pointing-0.028
Token was
Token ID373
Feature activation+7.63e+00

Pos logit contributions
issy+0.066
ppy+0.059
uffy+0.055
azz+0.054
pillar+0.053
Neg logit contributions
 reaching-0.042
 giving-0.041
 understanding-0.039
 helping-0.039
 counting-0.038
Token mean
Token ID1612
Feature activation+3.75e+00

Pos logit contributions
 reaching+0.108
 anticipation+0.100
 diving+0.094
 shore+0.093
 mesmer+0.093
Neg logit contributions
issy-0.179
ppy-0.163
uffy-0.155
itty-0.149
pillar-0.148
Tokenâ
Token ID22940
Feature activation-6.37e+00

Pos logit contributions
 reaching+0.053
 giving+0.050
 helping+0.047
 counting+0.047
 understanding+0.046
Neg logit contributions
issy-0.124
ppy-0.112
uffy-0.105
azz-0.104
pillar-0.102
Token
Token ID26391
Feature activation-1.54e+01

Pos logit contributions
issy+0.123
ppy+0.107
uffy+0.102
azz+0.097
pillar+0.093
Neg logit contributions
 reaching-0.144
 giving-0.139
 listening-0.134
 counting-0.134
 pointing-0.133
Token it
Token ID340
Feature activation-1.28e+01

Pos logit contributions
issy+0.387
ppy+0.346
acky+0.308
que+0.306
pillar+0.304
Neg logit contributions
 reaching-0.264
 giving-0.252
 pointing-0.249
 counting-0.245
 packing-0.238
Token said
Token ID531
Feature activation+5.31e+00

Pos logit contributions
issy+0.299
ppy+0.265
uffy+0.244
azz+0.240
pillar+0.235
Neg logit contributions
 reaching-0.244
 giving-0.238
 helping-0.226
 counting-0.224
 pointing-0.223
Token.
Token ID13
Feature activation-2.49e+00

Pos logit contributions
 reaching+0.067
 giving+0.064
 understanding+0.059
 helping+0.059
 counting+0.058
Neg logit contributions
issy-0.155
ppy-0.141
uffy-0.134
azz-0.130
pillar-0.128
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
issy+0.062
ppy+0.056
azz+0.052
uffy+0.052
pillar+0.050
Neg logit contributions
 reaching-0.042
 giving-0.040
 helping-0.039
 understanding-0.038
 counting-0.038
Token looked
Token ID3114
Feature activation+1.32e+01

Pos logit contributions
issy+0.680
ppy+0.612
uffy+0.568
azz+0.551
pillar+0.547
Neg logit contributions
 reaching-0.444
 giving-0.430
 helping-0.408
 understanding-0.404
 counting-0.403
Token up
Token ID510
Feature activation+6.26e+00

Pos logit contributions
 reaching+0.177
 giving+0.172
 helping+0.159
 counting+0.158
 listening+0.155
Neg logit contributions
issy-0.368
ppy-0.336
uffy-0.312
azz-0.308
pillar-0.304
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
 giving+0.127
 succeeding+0.127
 teaching+0.122
 joining+0.122
 spending+0.120
Neg logit contributions
issy-0.099
ppy-0.092
azz-0.083
uffy-0.083
pillar-0.082
Token saw
Token ID2497
Feature activation+1.44e+01

Pos logit contributions
 reaching+0.052
 giving+0.051
 helping+0.049
 counting+0.048
 understanding+0.047
Neg logit contributions
issy-0.062
ppy-0.055
uffy-0.050
azz-0.050
pillar-0.048
Token a
Token ID257
Feature activation-6.50e+00

Pos logit contributions
 reaching+0.135
 giving+0.125
 helping+0.113
 understanding+0.110
 counting+0.109
Neg logit contributions
issy-0.471
ppy-0.430
uffy-0.408
pillar-0.402
azz-0.402
Token they
Token ID484
Feature activation-1.38e+01

Pos logit contributions
 reaching+0.029
 understanding+0.027
 giving+0.027
 encouraging+0.026
 pushing+0.026
Neg logit contributions
issy-0.057
ppy-0.054
uffy-0.052
ens-0.048
que-0.046
Token saw
Token ID2497
Feature activation+1.41e+01

Pos logit contributions
issy+0.327
ppy+0.284
uffy+0.265
azz+0.260
pillar+0.256
Neg logit contributions
 reaching-0.265
 giving-0.257
 helping-0.244
 understanding-0.243
 counting-0.241
Token something
Token ID1223
Feature activation-3.37e+00

Pos logit contributions
 reaching+0.150
 giving+0.140
 counting+0.130
 understanding+0.126
 helping+0.125
Neg logit contributions
issy-0.430
ppy-0.396
uffy-0.378
pillar-0.368
ens-0.366
Token terrible
Token ID7818
Feature activation+1.30e+00

Pos logit contributions
issy+0.086
ppy+0.079
uffy+0.075
pillar+0.073
azz+0.072
Neg logit contributions
 reaching-0.049
 giving-0.047
 understanding-0.045
 counting-0.044
 spending-0.044
Token.
Token ID13
Feature activation-2.76e+00

Pos logit contributions
 reaching+0.018
 giving+0.018
 understanding+0.017
 helping+0.016
 counting+0.016
Neg logit contributions
issy-0.036
ppy-0.033
uffy-0.030
azz-0.030
pillar-0.030
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
issy+0.071
ppy+0.064
uffy+0.059
azz+0.058
pillar+0.057
Neg logit contributions
 reaching-0.046
 giving-0.044
 helping-0.042
 understanding-0.042
 counting-0.041
Token was
Token ID373
Feature activation+7.44e+00

Pos logit contributions
issy+0.674
ppy+0.583
uffy+0.548
azz+0.539
acker+0.529
Neg logit contributions
 reaching-0.470
 giving-0.456
 helping-0.429
 pointing-0.428
 counting-0.426
Token gone
Token ID3750
Feature activation+8.26e+00

Pos logit contributions
 reaching+0.090
 understanding+0.087
 giving+0.086
 encouraging+0.083
 spending+0.082
Neg logit contributions
issy-0.194
ppy-0.186
uffy-0.170
isk-0.164
azz-0.163
Token.
Token ID13
Feature activation-2.81e+00

Pos logit contributions
 reaching+0.125
 giving+0.120
 understanding+0.114
 helping+0.113
 counting+0.112
Neg logit contributions
issy-0.222
ppy-0.201
uffy-0.190
azz-0.183
pillar-0.181
Token Someone
Token ID17877
Feature activation-1.08e+01

Pos logit contributions
issy+0.076
ppy+0.068
uffy+0.063
azz+0.061
x+0.060
Neg logit contributions
 reaching-0.044
 giving-0.043
 helping-0.040
 pointing-0.040
 understanding-0.040
Token had
Token ID550
Feature activation+6.85e+00

Pos logit contributions
issy+0.253
ppy+0.221
uffy+0.207
azz+0.203
acker+0.200
Neg logit contributions
 reaching-0.211
 giving-0.206
 helping-0.197
 showing-0.194
 pointing-0.193
Token some
Token ID617
Feature activation+1.84e+00

Pos logit contributions
 reaching+0.133
 giving+0.125
 counting+0.113
 understanding+0.113
 helping+0.112
Neg logit contributions
issy-0.384
ppy-0.352
uffy-0.332
pillar-0.323
azz-0.322
Token food
Token ID2057
Feature activation-3.72e+00

Pos logit contributions
 reaching+0.031
 pointing+0.031
 mesmer+0.029
 giving+0.029
 counting+0.029
Neg logit contributions
issy-0.043
ppy-0.039
que-0.037
ens-0.036
pillar-0.036
Token."
Token ID526
Feature activation-1.91e+00

Pos logit contributions
issy+0.112
ppy+0.103
uffy+0.097
azz+0.094
pillar+0.093
Neg logit contributions
 reaching-0.045
 giving-0.043
 understanding-0.039
 helping-0.039
 counting-0.039
Token So
Token ID1406
Feature activation+3.22e+00

Pos logit contributions
issy+0.049
ppy+0.044
uffy+0.041
azz+0.040
pillar+0.040
Neg logit contributions
 reaching-0.032
 giving-0.030
 counting-0.029
 helping-0.029
 understanding-0.028
Token,
Token ID11
Feature activation+3.47e+00

Pos logit contributions
 reaching+0.051
 counting+0.048
 giving+0.047
 pointing+0.046
 testing+0.045
Neg logit contributions
issy-0.083
ppy-0.074
uffy-0.070
ens-0.069
pillar-0.067
Token Betsy
Token ID45775
Feature activation-2.66e+01

Pos logit contributions
 reaching+0.057
 counting+0.053
 giving+0.052
 pointing+0.051
 testing+0.051
Neg logit contributions
issy-0.087
ppy-0.077
uffy-0.073
ens-0.072
pillar-0.070
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
issy+0.722
ppy+0.646
uffy+0.611
azz+0.596
pillar+0.585
Neg logit contributions
 reaching-0.405
 giving-0.393
 helping-0.368
 understanding-0.363
 counting-0.362
Token the
Token ID262
Feature activation-1.43e+01

Pos logit contributions
 reaching+0.027
 giving+0.025
 counting+0.024
 understanding+0.023
 helping+0.023
Neg logit contributions
issy-0.074
ppy-0.067
uffy-0.063
ens-0.062
pillar-0.061
Token thief
Token ID25906
Feature activation-1.25e+01

Pos logit contributions
issy+0.276
ppy+0.237
uffy+0.215
azz+0.206
pillar+0.204
Neg logit contributions
 reaching-0.332
 giving-0.324
 helping-0.311
 counting-0.310
 understanding-0.309
Token went
Token ID1816
Feature activation+1.34e+01

Pos logit contributions
issy+0.313
ppy+0.278
uffy+0.264
azz+0.258
pillar+0.254
Neg logit contributions
 reaching-0.216
 giving-0.210
 helping-0.200
 showing-0.197
 counting-0.196
Token on
Token ID319
Feature activation+1.11e+01

Pos logit contributions
 reaching+0.159
 giving+0.151
 understanding+0.147
 mesmer+0.139
 pointing+0.138
Neg logit contributions
issy-0.393
ppy-0.360
uffy-0.347
ens-0.341
azz-0.335
Token time
Token ID640
Feature activation+3.52e+00

Pos logit contributions
 reaching+0.145
 giving+0.142
 helping+0.136
 counting+0.135
 pointing+0.134
Neg logit contributions
issy-0.162
ppy-0.144
uffy-0.129
azz-0.129
pillar-0.128
Token.
Token ID13
Feature activation-2.42e+00

Pos logit contributions
 giving+0.052
 reaching+0.052
 understanding+0.051
 insisting+0.050
 encouraging+0.049
Neg logit contributions
issy-0.086
ppy-0.080
uffy-0.071
pillar-0.071
azz-0.071
Token
Token ID198
Feature activation-3.74e+00

Pos logit contributions
issy+0.062
ppy+0.053
uffy+0.050
azz+0.049
pillar+0.048
Neg logit contributions
 reaching-0.041
 giving-0.040
 helping-0.038
 understanding-0.038
 showing-0.038
Token
Token ID198
Feature activation-3.73e+00

Pos logit contributions
issy+0.098
ppy+0.085
uffy+0.082
azz+0.079
pillar+0.079
Neg logit contributions
 reaching-0.059
 giving-0.059
 helping-0.054
 counting-0.054
 showing-0.054
TokenSu
Token ID5606
Feature activation-1.61e+01

Pos logit contributions
issy+0.086
ppy+0.072
uffy+0.067
pillar+0.067
azz+0.066
Neg logit contributions
 giving-0.070
 reaching-0.070
 counting-0.066
 pointing-0.066
 helping-0.066
Tokenzie
Token ID49746
Feature activation-2.66e+01

Pos logit contributions
issy+0.258
ppy+0.224
azz+0.210
ocado+0.208
uffy+0.203
Neg logit contributions
 reaching-0.403
 giving-0.384
 counting-0.375
 listening-0.373
 helping-0.369
Token loved
Token ID6151
Feature activation+1.82e+01

Pos logit contributions
issy+0.618
ppy+0.551
uffy+0.504
azz+0.488
pillar+0.485
Neg logit contributions
 reaching-0.502
 giving-0.485
 helping-0.463
 understanding-0.462
 counting-0.455
Token her
Token ID607
Feature activation-1.64e+01

Pos logit contributions
 reaching+0.177
 understanding+0.171
 insisting+0.159
 mesmer+0.156
 intrigued+0.156
Neg logit contributions
issy-0.438
ens-0.408
ppy-0.408
isk-0.391
pillar-0.391
Token magazine
Token ID7093
Feature activation-8.96e+00

Pos logit contributions
issy+0.336
ppy+0.288
uffy+0.270
pillar+0.259
azz+0.256
Neg logit contributions
 reaching-0.354
 giving-0.338
 understanding-0.324
 counting-0.321
 helping-0.321
Token and
Token ID290
Feature activation+2.61e+00

Pos logit contributions
issy+0.273
ppy+0.249
uffy+0.235
azz+0.231
pillar+0.227
Neg logit contributions
 reaching-0.107
 giving-0.102
 helping-0.095
 counting-0.094
 understanding-0.093
Token kept
Token ID4030
Feature activation+2.03e+01

Pos logit contributions
 reaching+0.046
 insisting+0.042
 giving+0.042
 warning+0.042
 understanding+0.041
Neg logit contributions
issy-0.056
ppy-0.048
ens-0.044
pillar-0.044
ems-0.043
Token the
Token ID262
Feature activation-1.46e+01

Pos logit contributions
 reaching+0.118
 giving+0.102
 understanding+0.100
 encouraging+0.099
 insisting+0.095
Neg logit contributions
issy-0.274
ppy-0.246
uffy-0.240
ens-0.234
isk-0.232
Token garden
Token ID11376
Feature activation-1.53e+00

Pos logit contributions
issy+0.333
ppy+0.292
uffy+0.273
azz+0.262
pillar+0.261
Neg logit contributions
 reaching-0.283
 giving-0.274
 helping-0.261
 understanding-0.261
 counting-0.258
Token again
Token ID757
Feature activation+3.43e+00

Pos logit contributions
issy+0.042
ppy+0.038
uffy+0.036
ase+0.036
isk+0.035
Neg logit contributions
 reaching-0.020
 understanding-0.019
 giving-0.019
 insisting-0.019
 pointing-0.018
Token to
Token ID284
Feature activation+9.64e+00

Pos logit contributions
 reaching+0.057
 giving+0.057
 insisting+0.054
 encouraging+0.054
 understanding+0.054
Neg logit contributions
issy-0.080
ppy-0.076
uffy-0.068
azz-0.065
pillar-0.064
Token see
Token ID766
Feature activation+1.46e+01

Pos logit contributions
 reaching+0.159
 understanding+0.148
 giving+0.148
 insisting+0.145
 encouraging+0.142
Neg logit contributions
issy-0.239
ppy-0.213
isk-0.200
uffy-0.194
ens-0.191
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
 reaching+0.151
 giving+0.137
 counting+0.128
 testing+0.123
 showing+0.122
Neg logit contributions
issy-0.449
ppy-0.416
uffy-0.397
ens-0.391
pillar-0.386
Token.
Token ID13
Feature activation-2.54e+00

Pos logit contributions
issy+0.701
ppy+0.659
uffy+0.606
azz+0.582
pillar+0.580
Neg logit contributions
 reaching-0.387
 giving-0.367
 understanding-0.363
 helping-0.346
 insisting-0.346
Token But
Token ID887
Feature activation+2.18e+00

Pos logit contributions
issy+0.066
ppy+0.059
uffy+0.055
azz+0.053
pillar+0.053
Neg logit contributions
 reaching-0.042
 giving-0.041
 helping-0.038
 understanding-0.038
 counting-0.038
Token they
Token ID484
Feature activation-1.38e+01

Pos logit contributions
 reaching+0.029
 understanding+0.027
 giving+0.027
 encouraging+0.026
 pushing+0.026
Neg logit contributions
issy-0.057
ppy-0.054
uffy-0.052
ens-0.048
que-0.046
Token saw
Token ID2497
Feature activation+1.41e+01

Pos logit contributions
issy+0.327
ppy+0.284
uffy+0.265
azz+0.260
pillar+0.256
Neg logit contributions
 reaching-0.265
 giving-0.257
 helping-0.244
 understanding-0.243
 counting-0.241
Token something
Token ID1223
Feature activation-3.37e+00

Pos logit contributions
 reaching+0.150
 giving+0.140
 counting+0.130
 understanding+0.126
 helping+0.125
Neg logit contributions
issy-0.430
ppy-0.396
uffy-0.378
pillar-0.368
ens-0.366
Token You
Token ID921
Feature activation-5.86e+00

Pos logit contributions
issy+0.052
ppy+0.046
uffy+0.043
azz+0.042
pillar+0.042
Neg logit contributions
 reaching-0.033
 giving-0.032
 helping-0.030
 understanding-0.030
 counting-0.029
Token sound
Token ID2128
Feature activation+5.65e+00

Pos logit contributions
issy+0.128
ppy+0.112
uffy+0.105
azz+0.100
pillar+0.098
Neg logit contributions
 reaching-0.119
 giving-0.117
 helping-0.112
 understanding-0.111
 counting-0.111
Token very
Token ID845
Feature activation+7.50e+00

Pos logit contributions
 reaching+0.080
 giving+0.076
 helping+0.071
 understanding+0.071
 counting+0.071
Neg logit contributions
issy-0.160
ppy-0.145
uffy-0.136
azz-0.133
pillar-0.131
Token pretty
Token ID2495
Feature activation-4.26e-01

Pos logit contributions
 reaching+0.104
 giving+0.097
 counting+0.091
 insisting+0.090
 pointing+0.090
Neg logit contributions
issy-0.214
ppy-0.196
uffy-0.181
azz-0.179
pillar-0.177
Token,"
Token ID553
Feature activation+1.57e+00

Pos logit contributions
issy+0.012
ppy+0.011
uffy+0.011
azz+0.010
pillar+0.010
Neg logit contributions
 reaching-0.006
 giving-0.005
 helping-0.005
 understanding-0.005
 counting-0.005
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
 reaching+0.029
 giving+0.028
 helping+0.027
 understanding+0.027
 counting+0.027
Neg logit contributions
issy-0.038
ppy-0.033
uffy-0.031
azz-0.030
pillar-0.029
Token said
Token ID531
Feature activation+4.54e+00

Pos logit contributions
issy+0.711
ppy+0.636
uffy+0.597
azz+0.581
pillar+0.574
Neg logit contributions
 reaching-0.415
 giving-0.402
 helping-0.377
 understanding-0.375
 counting-0.373
Token.
Token ID13
Feature activation-2.56e+00

Pos logit contributions
 reaching+0.080
 counting+0.074
 spending+0.074
 giving+0.074
 packing+0.074
Neg logit contributions
issy-0.107
ppy-0.101
uffy-0.087
azz-0.086
ens-0.085
Token
Token ID198
Feature activation-4.74e+00

Pos logit contributions
issy+0.069
ppy+0.062
uffy+0.058
azz+0.057
pillar+0.057
Neg logit contributions
 reaching-0.039
 giving-0.037
 counting-0.035
 helping-0.035
 understanding-0.035
Token
Token ID198
Feature activation-4.69e+00

Pos logit contributions
issy+0.162
ppy+0.148
x+0.138
uffy+0.136
acker+0.136
Neg logit contributions
 reaching-0.036
 giving-0.035
 understanding-0.034
 helping-0.033
 to-0.030
Token"
Token ID1
Feature activation-5.00e+00

Pos logit contributions
issy+0.123
uffy+0.100
azz+0.099
ppy+0.099
acker+0.096
Neg logit contributions
 understanding-0.076
 reaching-0.075
 giving-0.073
 helping-0.072
 encouraging-0.071
Token,
Token ID11
Feature activation+3.68e+00

Pos logit contributions
issy+0.456
ppy+0.409
uffy+0.386
azz+0.380
acker+0.374
Neg logit contributions
 reaching-0.221
 giving-0.216
 helping-0.203
 counting-0.198
 showing-0.196
Token just
Token ID655
Feature activation+4.30e+00

Pos logit contributions
 reaching+0.054
 giving+0.050
 counting+0.049
 understanding+0.049
 pointing+0.048
Neg logit contributions
issy-0.098
ppy-0.089
uffy-0.081
pillar-0.080
azz-0.079
Token like
Token ID588
Feature activation+1.52e+01

Pos logit contributions
 reaching+0.064
 giving+0.060
 understanding+0.060
 cheering+0.060
 packing+0.059
Neg logit contributions
issy-0.099
ppy-0.096
ens-0.086
ocado-0.083
isk-0.082
Token her
Token ID607
Feature activation-1.68e+01

Pos logit contributions
 reaching+0.188
 giving+0.172
 understanding+0.169
 pointing+0.167
 counting+0.165
Neg logit contributions
issy-0.436
ppy-0.396
uffy-0.371
ens-0.364
pillar-0.364
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
issy+0.434
ppy+0.402
uffy+0.387
ocado+0.358
pillar+0.358
Neg logit contributions
 reaching-0.251
 giving-0.235
 understanding-0.225
 pointing-0.224
 insisting-0.219
Token Betsy
Token ID45775
Feature activation-2.66e+01

Pos logit contributions
issy+0.068
ppy+0.060
uffy+0.057
azz+0.055
pillar+0.054
Neg logit contributions
 reaching-0.043
 giving-0.041
 helping-0.039
 counting-0.039
 understanding-0.039
Token was
Token ID373
Feature activation+7.49e+00

Pos logit contributions
issy+0.704
ppy+0.615
uffy+0.590
azz+0.587
acker+0.567
Neg logit contributions
 reaching-0.419
 giving-0.414
 helping-0.389
 understanding-0.387
 showing-0.382
Token sad
Token ID6507
Feature activation+6.00e+00

Pos logit contributions
 reaching+0.100
 diving+0.098
 shore+0.097
 packing+0.094
 insisting+0.093
Neg logit contributions
issy-0.160
ppy-0.156
ocado-0.143
c-0.141
isk-0.137
Token that
Token ID326
Feature activation-2.68e+00

Pos logit contributions
 reaching+0.096
 giving+0.091
 pointing+0.088
 counting+0.088
 showing+0.087
Neg logit contributions
issy-0.152
ppy-0.141
uffy-0.133
azz-0.128
ens-0.128
Token she
Token ID673
Feature activation-1.74e+01

Pos logit contributions
issy+0.071
ppy+0.063
uffy+0.059
ens+0.059
pillar+0.058
Neg logit contributions
 reaching-0.040
 giving-0.037
 counting-0.036
 understanding-0.036
 pointing-0.035
Token had
Token ID550
Feature activation+7.02e+00

Pos logit contributions
issy+0.447
ppy+0.385
uffy+0.371
azz+0.370
ase+0.358
Neg logit contributions
 reaching-0.287
 giving-0.284
 helping-0.272
 understanding-0.267
 showing-0.264
Token to
Token ID284
Feature activation+9.75e+00

Pos logit contributions
 reaching+0.136
 mesmer+0.118
 succeeding+0.113
 counting+0.113
 offering+0.111
Neg logit contributions
issy-0.219
ens-0.205
azz-0.197
ppy-0.196
edy-0.195
Token name
Token ID1438
Feature activation+4.63e+00

Pos logit contributions
 reaching+0.139
 giving+0.130
 pointing+0.123
 insisting+0.121
 understanding+0.120
Neg logit contributions
issy-0.274
ppy-0.246
uffy-0.228
azz-0.221
isk-0.219
Token the
Token ID262
Feature activation-1.45e+01

Pos logit contributions
 reaching+0.037
 giving+0.035
 helping+0.031
 understanding+0.030
 counting+0.030
Neg logit contributions
issy-0.160
ppy-0.148
uffy-0.141
azz-0.138
pillar-0.136
Token sun
Token ID4252
Feature activation-6.63e+00

Pos logit contributions
issy+0.359
ppy+0.314
uffy+0.299
pillar+0.293
azz+0.290
Neg logit contributions
 reaching-0.250
 giving-0.236
 helping-0.224
 understanding-0.223
 counting-0.220
Tokenflower
Token ID25547
Feature activation-1.41e+01

Pos logit contributions
issy+0.177
ppy+0.159
uffy+0.149
azz+0.145
pillar+0.143
Neg logit contributions
 reaching-0.103
 giving-0.100
 helping-0.094
 understanding-0.093
 counting-0.093
Token Sunny
Token ID32241
Feature activation-2.66e+01

Pos logit contributions
issy+0.406
ppy+0.365
uffy+0.345
azz+0.337
pillar+0.331
Neg logit contributions
 reaching-0.195
 giving-0.188
 helping-0.176
 counting-0.174
 understanding-0.171
Token.
Token ID13
Feature activation-2.42e+00

Pos logit contributions
issy+0.755
ppy+0.655
uffy+0.624
azz+0.622
acker+0.604
Neg logit contributions
 reaching-0.384
 giving-0.374
 helping-0.352
 counting-0.346
 pointing-0.343
Token
Token ID198
Feature activation-4.47e+00

Pos logit contributions
issy+0.060
ppy+0.053
uffy+0.050
azz+0.049
pillar+0.047
Neg logit contributions
 reaching-0.043
 giving-0.041
 helping-0.039
 counting-0.039
 understanding-0.039
Token
Token ID198
Feature activation-4.45e+00

Pos logit contributions
issy+0.129
ppy+0.117
uffy+0.110
azz+0.107
pillar+0.106
Neg logit contributions
 reaching-0.061
 giving-0.058
 helping-0.054
 understanding-0.054
 counting-0.054
TokenThe
Token ID464
Feature activation-1.45e+01

Pos logit contributions
issy+0.118
ppy+0.104
uffy+0.099
azz+0.095
acker+0.094
Neg logit contributions
 reaching-0.072
 giving-0.070
 understanding-0.068
 helping-0.067
 listening-0.065
Token next
Token ID1306
Feature activation+5.51e+00

Pos logit contributions
issy+0.385
ppy+0.342
uffy+0.326
pillar+0.315
ens+0.311
Neg logit contributions
 reaching-0.220
 giving-0.214
 helping-0.199
 counting-0.197
 pointing-0.196
Token,
Token ID11
Feature activation+3.81e+00

Pos logit contributions
 reaching+0.074
 insisting+0.067
 giving+0.067
 mesmer+0.067
 packing+0.066
Neg logit contributions
issy-0.107
ppy-0.096
uffy-0.091
azz-0.090
ens-0.089
Token "
Token ID366
Feature activation-1.98e+00

Pos logit contributions
 reaching+0.056
 succeeding+0.051
 counting+0.050
 packing+0.049
 insisting+0.049
Neg logit contributions
issy-0.093
ppy-0.086
azz-0.082
uffy-0.081
pillar-0.077
TokenThank
Token ID10449
Feature activation+1.64e+01

Pos logit contributions
issy+0.059
ppy+0.054
uffy+0.053
isk+0.049
ches+0.049
Neg logit contributions
 reaching-0.025
 giving-0.024
 understanding-0.024
 listening-0.023
 helping-0.022
Token you
Token ID345
Feature activation-1.08e+01

Pos logit contributions
 reaching+0.217
 giving+0.207
 helping+0.206
 encouraging+0.196
 understanding+0.195
Neg logit contributions
issy-0.478
ppy-0.429
uffy-0.408
azz-0.401
acker-0.399
Token,
Token ID11
Feature activation+3.88e+00

Pos logit contributions
issy+0.330
ppy+0.299
uffy+0.283
pillar+0.279
azz+0.278
Neg logit contributions
 reaching-0.128
 giving-0.126
 helping-0.119
 understanding-0.115
 counting-0.114
Token thank
Token ID5875
Feature activation+1.73e+01

Pos logit contributions
 reaching+0.047
 giving+0.045
 helping+0.041
 understanding+0.041
 counting+0.041
Neg logit contributions
issy-0.118
ppy-0.107
uffy-0.101
azz-0.099
pillar-0.098
Token you
Token ID345
Feature activation-1.09e+01

Pos logit contributions
 reaching+0.304
 giving+0.294
 understanding+0.278
 helping+0.277
 counting+0.276
Neg logit contributions
issy-0.426
ppy-0.380
uffy-0.354
azz-0.341
pillar-0.337
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
issy+0.314
ppy+0.285
uffy+0.268
azz+0.262
pillar+0.257
Neg logit contributions
 reaching-0.148
 giving-0.141
 understanding-0.131
 helping-0.131
 counting-0.130
Token This
Token ID770
Feature activation-3.21e+00

Pos logit contributions
issy+0.041
ppy+0.037
uffy+0.035
azz+0.034
pillar+0.033
Neg logit contributions
 reaching-0.025
 giving-0.024
 helping-0.023
 understanding-0.022
 counting-0.022
Token is
Token ID318
Feature activation+7.67e+00

Pos logit contributions
issy+0.067
ppy+0.057
uffy+0.053
azz+0.051
pillar+0.050
Neg logit contributions
 reaching-0.070
 giving-0.067
 understanding-0.065
 counting-0.064
 helping-0.064
Token my
Token ID616
Feature activation-1.30e+01

Pos logit contributions
 reaching+0.065
 giving+0.062
 helping+0.056
 understanding+0.055
 counting+0.054
Neg logit contributions
issy-0.261
ppy-0.240
uffy-0.228
azz-0.224
pillar-0.223
Token the
Token ID262
Feature activation-1.33e+01

Pos logit contributions
 reaching+0.289
 diving+0.280
 mesmer+0.278
 examining+0.274
 succeeding+0.270
Neg logit contributions
issy-0.505
ppy-0.451
que-0.417
isk-0.415
ens-0.411
Token market
Token ID1910
Feature activation+2.04e+00

Pos logit contributions
issy+0.292
ppy+0.251
uffy+0.233
pillar+0.228
azz+0.225
Neg logit contributions
 reaching-0.269
 giving-0.261
 helping-0.249
 counting-0.248
 understanding-0.244
Token and
Token ID290
Feature activation+2.99e+00

Pos logit contributions
 reaching+0.029
 insisting+0.028
 giving+0.027
 diving+0.027
 spending+0.027
Neg logit contributions
issy-0.050
ppy-0.042
isk-0.041
affe-0.040
uffy-0.040
Token saw
Token ID2497
Feature activation+1.45e+01

Pos logit contributions
 reaching+0.045
 giving+0.043
 helping+0.042
 insisting+0.042
 diving+0.041
Neg logit contributions
issy-0.070
ppy-0.063
ocado-0.056
ens-0.056
isk-0.056
Token it
Token ID340
Feature activation-1.19e+01

Pos logit contributions
 reaching+0.165
 giving+0.150
 insisting+0.135
 listening+0.134
 counting+0.133
Neg logit contributions
issy-0.426
ppy-0.374
pillar-0.360
ens-0.359
uffy-0.358
Token getting
Token ID1972
Feature activation+2.07e+01

Pos logit contributions
issy+0.276
ppy+0.243
uffy+0.225
isk+0.217
pillar+0.216
Neg logit contributions
 reaching-0.206
 giving-0.201
 understanding-0.198
 helping-0.192
 listening-0.189
Token busy
Token ID8179
Feature activation+7.17e+00

Pos logit contributions
 reaching+0.285
 believing+0.283
 giving+0.282
 showing+0.278
 testing+0.276
Neg logit contributions
issy-0.529
ppy-0.466
que-0.448
ens-0.438
ety-0.437
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 reaching+0.114
 understanding+0.114
 encouraging+0.109
 giving+0.108
 mesmer+0.107
Neg logit contributions
issy-0.150
ppy-0.135
uffy-0.126
ens-0.125
hell-0.124
Token People
Token ID4380
Feature activation-8.15e+00

Pos logit contributions
issy+0.030
affe+0.023
ase+0.023
pillar+0.022
azz+0.022
Neg logit contributions
 reaching-0.038
 diving-0.038
 giving-0.038
 joining-0.037
 believing-0.037
Token were
Token ID547
Feature activation+7.71e+00

Pos logit contributions
issy+0.182
ppy+0.157
azz+0.140
ens+0.139
pillar+0.138
Neg logit contributions
 reaching-0.152
 giving-0.146
 counting-0.141
 helping-0.140
 understanding-0.139
Token bringing
Token ID6079
Feature activation+2.16e+01

Pos logit contributions
 Listen+0.093
 reaching+0.076
 diving+0.072
 succeeding+0.071
 believing+0.070
Neg logit contributions
issy-0.169
ppy-0.158
ocado-0.158
uffy-0.152
house-0.150
Token car
Token ID1097
Feature activation-1.13e+01

Pos logit contributions
issy+0.192
ppy+0.170
azz+0.153
acker+0.151
uffy+0.150
Neg logit contributions
 reaching-0.170
 giving-0.168
 helping-0.166
 counting-0.162
 showing-0.162
Token outside
Token ID2354
Feature activation+6.06e+00

Pos logit contributions
issy+0.348
ppy+0.317
uffy+0.300
azz+0.295
pillar+0.291
Neg logit contributions
 reaching-0.134
 giving-0.128
 helping-0.120
 counting-0.118
 showing-0.116
Token.
Token ID13
Feature activation-2.14e+00

Pos logit contributions
 reaching+0.092
 understanding+0.092
 giving+0.089
 nodding+0.087
 succeeding+0.086
Neg logit contributions
issy-0.144
ppy-0.130
uffy-0.122
ase-0.122
ens-0.118
Token He
Token ID679
Feature activation-1.50e+01

Pos logit contributions
issy+0.029
ny+0.027
affe+0.027
ens+0.026
uffy+0.025
Neg logit contributions
 believing-0.052
 giving-0.050
 understanding-0.050
 teaching-0.050
 encouraging-0.049
Token was
Token ID373
Feature activation+7.19e+00

Pos logit contributions
issy+0.380
ppy+0.335
uffy+0.315
azz+0.308
pillar+0.307
Neg logit contributions
 reaching-0.259
 giving-0.253
 pointing-0.244
 showing-0.243
 helping-0.242
Token pushing
Token ID7796
Feature activation+1.92e+01

Pos logit contributions
 appreciation+0.096
 Listen+0.096
 assistance+0.091
 understanding+0.091
 advice+0.090
Neg logit contributions
issy-0.151
ppy-0.142
eh-0.128
 pads-0.127
aky-0.125
Token it
Token ID340
Feature activation-1.30e+01

Pos logit contributions
 reaching+0.129
 giving+0.118
 helping+0.101
 understanding+0.100
 counting+0.099
Neg logit contributions
issy-0.692
ppy-0.638
uffy-0.610
azz-0.597
pillar-0.592
Token around
Token ID1088
Feature activation+7.42e+00

Pos logit contributions
issy+0.349
ppy+0.323
ches+0.299
azz+0.297
uffy+0.296
Neg logit contributions
 giving-0.168
 understanding-0.166
 reaching-0.161
 spending-0.159
 packing-0.156
Token and
Token ID290
Feature activation+2.28e+00

Pos logit contributions
 understanding+0.106
 giving+0.105
 reaching+0.104
 nodding+0.099
 insisting+0.098
Neg logit contributions
issy-0.175
ppy-0.166
pillar-0.150
ens-0.149
x-0.146
Token crashing
Token ID21899
Feature activation+1.69e+01

Pos logit contributions
 understanding+0.027
 giving+0.026
 reaching+0.026
 warning+0.025
 packing+0.025
Neg logit contributions
issy-0.061
ppy-0.057
ens-0.053
x-0.052
azz-0.052
Token it
Token ID340
Feature activation-1.29e+01

Pos logit contributions
 reaching+0.190
 giving+0.182
 understanding+0.170
 helping+0.168
 counting+0.164
Neg logit contributions
issy-0.517
ppy-0.476
uffy-0.447
azz-0.438
pillar-0.432
Token and
Token ID290
Feature activation+2.42e+00

Pos logit contributions
 reaching+0.140
 giving+0.133
 counting+0.133
 packing+0.129
 testing+0.129
Neg logit contributions
issy-0.171
ppy-0.159
azz-0.144
uffy-0.139
pillar-0.139
Token said
Token ID531
Feature activation+4.70e+00

Pos logit contributions
 reaching+0.047
 giving+0.044
 counting+0.044
 helping+0.043
 packing+0.043
Neg logit contributions
issy-0.054
ppy-0.047
azz-0.043
uffy-0.043
ens-0.043
Token,
Token ID11
Feature activation+3.49e+00

Pos logit contributions
 reaching+0.079
 giving+0.075
 listening+0.071
 understanding+0.071
 packing+0.071
Neg logit contributions
issy-0.114
ppy-0.105
uffy-0.100
pillar-0.094
azz-0.093
Token "
Token ID366
Feature activation-2.22e+00

Pos logit contributions
 reaching+0.053
 diving+0.053
 packing+0.051
 anticipation+0.051
 succeeding+0.051
Neg logit contributions
issy-0.076
ppy-0.070
pillar-0.063
azz-0.062
uffy-0.062
TokenI
Token ID40
Feature activation-2.61e+00

Pos logit contributions
issy+0.061
ppy+0.057
uffy+0.054
isk+0.051
acker+0.050
Neg logit contributions
 reaching-0.030
 giving-0.030
 understanding-0.029
 listening-0.029
 spending-0.029
Token love
Token ID1842
Feature activation+1.64e+01

Pos logit contributions
issy+0.068
ppy+0.061
uffy+0.057
azz+0.056
pillar+0.055
Neg logit contributions
 reaching-0.043
 giving-0.041
 helping-0.039
 understanding-0.038
 counting-0.038
Token you
Token ID345
Feature activation-1.10e+01

Pos logit contributions
 reaching+0.293
 mesmer+0.267
 pointing+0.267
 understanding+0.266
 giving+0.262
Neg logit contributions
issy-0.365
ppy-0.323
ens-0.302
pillar-0.300
uffy-0.299
Token both
Token ID1111
Feature activation-4.80e+00

Pos logit contributions
issy+0.298
ppy+0.278
uffy+0.254
azz+0.245
acker+0.241
Neg logit contributions
 reaching-0.163
 giving-0.149
 pointing-0.142
 understanding-0.140
 insisting-0.139
Token the
Token ID262
Feature activation-1.44e+01

Pos logit contributions
issy+0.131
ppy+0.119
uffy+0.111
pillar+0.108
azz+0.108
Neg logit contributions
 reaching-0.068
 giving-0.062
 pointing-0.060
 understanding-0.059
 insisting-0.059
Token same
Token ID976
Feature activation+1.53e+00

Pos logit contributions
issy+0.396
ppy+0.360
uffy+0.335
azz+0.329
pillar+0.320
Neg logit contributions
 reaching-0.215
 giving-0.214
 helping-0.202
 understanding-0.199
 counting-0.198
Token.
Token ID13
Feature activation-2.20e+00

Pos logit contributions
 reaching+0.024
 giving+0.023
 helping+0.022
 understanding+0.022
 counting+0.022
Neg logit contributions
issy-0.040
ppy-0.036
uffy-0.034
azz-0.033
pillar-0.033
Token remembered
Token ID12086
Feature activation+1.26e+01

Pos logit contributions
 reaching+0.172
 mesmer+0.159
 counting+0.152
 giving+0.152
 joining+0.151
Neg logit contributions
issy-0.188
ppy-0.182
ens-0.155
isk-0.146
uffy-0.144
Token to
Token ID284
Feature activation+1.04e+01

Pos logit contributions
 reaching+0.123
 mesmer+0.111
 pointing+0.105
 cheering+0.103
 encouraging+0.101
Neg logit contributions
issy-0.379
ppy-0.347
ens-0.334
uffy-0.329
isk-0.326
Token be
Token ID307
Feature activation+8.28e+00

Pos logit contributions
 reaching+0.155
 mesmer+0.154
 giving+0.143
 pointing+0.142
 insisting+0.138
Neg logit contributions
issy-0.257
ppy-0.241
isk-0.224
que-0.216
ens-0.216
Token careful
Token ID8161
Feature activation+1.15e+01

Pos logit contributions
 reaching+0.118
 packing+0.111
 giving+0.108
 diving+0.107
 pointing+0.106
Neg logit contributions
issy-0.200
ppy-0.193
ocado-0.176
aky-0.174
isk-0.173
Token and
Token ID290
Feature activation+3.10e+00

Pos logit contributions
 reaching+0.146
 giving+0.139
 counting+0.132
 pointing+0.130
 diving+0.128
Neg logit contributions
issy-0.321
ppy-0.296
uffy-0.270
azz-0.264
pillar-0.263
Token help
Token ID1037
Feature activation+1.81e+01

Pos logit contributions
 reaching+0.041
 mesmer+0.039
 joining+0.037
 packing+0.037
 giving+0.037
Neg logit contributions
issy-0.076
ppy-0.072
ens-0.067
uffy-0.065
isk-0.065
Token others
Token ID1854
Feature activation-8.96e+00

Pos logit contributions
 reaching+0.229
 giving+0.222
 helping+0.209
 understanding+0.203
 counting+0.200
Neg logit contributions
issy-0.537
ppy-0.489
uffy-0.468
azz-0.454
acker-0.453
Token be
Token ID307
Feature activation+8.38e+00

Pos logit contributions
issy+0.251
ppy+0.231
uffy+0.213
azz+0.206
pillar+0.206
Neg logit contributions
 reaching-0.125
 giving-0.118
 counting-0.111
 understanding-0.110
 helping-0.109
Token safe
Token ID3338
Feature activation+5.20e+00

Pos logit contributions
 reaching+0.105
 giving+0.096
 counting+0.096
 packing+0.093
 diving+0.093
Neg logit contributions
issy-0.231
ppy-0.218
ocado-0.197
uffy-0.196
isk-0.194
Token too
Token ID1165
Feature activation+3.75e+00

Pos logit contributions
 reaching+0.085
 giving+0.078
 pointing+0.078
 counting+0.077
 understanding+0.076
Neg logit contributions
issy-0.128
ppy-0.119
uffy-0.108
ens-0.104
azz-0.102
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 reaching+0.063
 giving+0.060
 counting+0.060
 showing+0.059
 joining+0.059
Neg logit contributions
issy-0.087
ppy-0.081
uffy-0.074
ens-0.073
pillar-0.070
Tokenmy
Token ID1820
Feature activation-1.51e+01

Pos logit contributions
issy+0.172
ppy+0.135
uffy+0.117
azz+0.112
pillar+0.107
Neg logit contributions
 reaching-0.383
 giving-0.378
 helping-0.367
 counting-0.366
 listening-0.361
Token was
Token ID373
Feature activation+7.71e+00

Pos logit contributions
issy+0.381
ppy+0.341
uffy+0.318
pillar+0.309
azz+0.305
Neg logit contributions
 reaching-0.253
 giving-0.240
 understanding-0.233
 helping-0.227
 counting-0.227
Token mad
Token ID8805
Feature activation+7.29e+00

Pos logit contributions
 reaching+0.095
 succeeding+0.087
 Listen+0.086
 prove+0.084
 understanding+0.082
Neg logit contributions
issy-0.188
uffy-0.172
ppy-0.161
ase-0.158
house-0.157
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
 reaching+0.109
 giving+0.103
 counting+0.099
 helping+0.098
 understanding+0.098
Neg logit contributions
issy-0.192
ppy-0.175
uffy-0.166
azz-0.159
pillar-0.155
Token She
Token ID1375
Feature activation-1.51e+01

Pos logit contributions
issy+0.034
ppy+0.031
azz+0.028
uffy+0.027
acker+0.027
Neg logit contributions
 reaching-0.021
 giving-0.020
 pointing-0.020
 counting-0.019
 helping-0.019
Token was
Token ID373
Feature activation+7.76e+00

Pos logit contributions
issy+0.347
ppy+0.305
uffy+0.282
azz+0.273
pillar+0.269
Neg logit contributions
 reaching-0.293
 giving-0.285
 helping-0.272
 understanding-0.271
 counting-0.270
Token mad
Token ID8805
Feature activation+7.31e+00

Pos logit contributions
 reaching+0.088
 delight+0.087
 Listen+0.084
 succeeding+0.081
 mesmer+0.081
Neg logit contributions
issy-0.193
ppy-0.170
uffy-0.170
isk-0.169
ens-0.161
Token at
Token ID379
Feature activation+1.56e+01

Pos logit contributions
 reaching+0.098
 giving+0.094
 helping+0.088
 counting+0.088
 understanding+0.087
Neg logit contributions
issy-0.210
ppy-0.191
uffy-0.180
azz-0.176
pillar-0.173
Token Tom
Token ID4186
Feature activation-2.18e+01

Pos logit contributions
 reaching+0.206
 giving+0.193
 understanding+0.182
 pointing+0.180
 encouraging+0.179
Neg logit contributions
issy-0.445
ppy-0.402
uffy-0.377
isk-0.366
ens-0.364
Token for
Token ID329
Feature activation+1.31e+01

Pos logit contributions
issy+0.610
ppy+0.549
uffy+0.517
azz+0.505
pillar+0.498
Neg logit contributions
 reaching-0.313
 giving-0.303
 helping-0.283
 understanding-0.280
 counting-0.279
Token not
Token ID407
Feature activation+7.57e+00

Pos logit contributions
 mesmer+0.128
 reaching+0.119
 intrigued+0.110
 fascinated+0.106
 encouraging+0.103
Neg logit contributions
issy-0.364
ppy-0.338
isk-0.332
ens-0.321
que-0.310
Token pond
Token ID16723
Feature activation-3.74e+00

Pos logit contributions
issy+0.348
ppy+0.311
uffy+0.294
pillar+0.286
acker+0.281
Neg logit contributions
 reaching-0.230
 giving-0.220
 understanding-0.211
 helping-0.209
 pointing-0.207
Token.
Token ID13
Feature activation-2.13e+00

Pos logit contributions
issy+0.101
ppy+0.092
uffy+0.086
azz+0.083
pillar+0.082
Neg logit contributions
 reaching-0.055
 giving-0.052
 understanding-0.050
 pointing-0.049
 helping-0.049
Token Molly
Token ID30236
Feature activation-2.35e+01

Pos logit contributions
issy+0.050
ppy+0.045
uffy+0.041
pillar+0.041
azz+0.040
Neg logit contributions
 reaching-0.040
 giving-0.039
 helping-0.037
 listening-0.037
 understanding-0.037
Token had
Token ID550
Feature activation+7.41e+00

Pos logit contributions
issy+0.555
ppy+0.504
uffy+0.453
pillar+0.442
azz+0.437
Neg logit contributions
 reaching-0.433
 giving-0.413
 helping-0.398
 counting-0.396
 understanding-0.390
Token had
Token ID550
Feature activation+7.44e+00

Pos logit contributions
 reaching+0.114
 giving+0.108
 helping+0.103
 understanding+0.103
 pointing+0.103
Neg logit contributions
issy-0.189
ppy-0.172
azz-0.152
pillar-0.151
uffy-0.150
Token so
Token ID523
Feature activation+7.41e+00

Pos logit contributions
 reaching+0.093
 giving+0.086
 understanding+0.083
 encouraging+0.082
 helping+0.081
Neg logit contributions
issy-0.213
ppy-0.193
uffy-0.176
azz-0.174
pillar-0.174
Token much
Token ID881
Feature activation+4.21e+00

Pos logit contributions
 reaching+0.105
 giving+0.103
 showing+0.095
 helping+0.094
 pointing+0.092
Neg logit contributions
issy-0.206
ppy-0.184
uffy-0.178
pillar-0.172
azz-0.170
Token fun
Token ID1257
Feature activation+7.17e+00

Pos logit contributions
 reaching+0.073
 pointing+0.071
 showing+0.071
 giving+0.071
 insisting+0.070
Neg logit contributions
issy-0.084
hell-0.071
ens-0.070
pillar-0.069
que-0.069
Token that
Token ID326
Feature activation-1.99e+00

Pos logit contributions
 understanding+0.086
 reaching+0.085
 giving+0.083
 insisting+0.082
 Listen+0.080
Neg logit contributions
issy-0.177
ppy-0.161
uffy-0.157
pillar-0.156
hell-0.149
Token she
Token ID673
Feature activation-1.69e+01

Pos logit contributions
issy+0.046
ppy+0.041
uffy+0.038
pillar+0.037
ens+0.037
Neg logit contributions
 reaching-0.035
 giving-0.034
 understanding-0.033
 counting-0.032
 pointing-0.032
Token didn
Token ID1422
Feature activation+1.33e+01

Pos logit contributions
issy+0.389
ppy+0.347
uffy+0.315
azz+0.305
pillar+0.304
Neg logit contributions
 reaching-0.326
 giving-0.315
 counting-0.301
 helping-0.301
 understanding-0.301
Token mom
Token ID1995
Feature activation-1.55e+01

Pos logit contributions
issy+0.432
uffy+0.373
ppy+0.372
ens+0.344
isk+0.336
Neg logit contributions
 reaching-0.268
 giving-0.244
 listening-0.235
 spending-0.233
 succeeding-0.231
Token cook
Token ID4255
Feature activation+9.96e+00

Pos logit contributions
issy+0.409
ppy+0.368
uffy+0.339
ens+0.325
isk+0.325
Neg logit contributions
 reaching-0.242
 giving-0.234
 understanding-0.225
 counting-0.217
 pointing-0.216
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
 reaching+0.135
 understanding+0.133
 giving+0.130
 mesmer+0.129
 believing+0.124
Neg logit contributions
issy-0.258
ppy-0.235
uffy-0.223
ens-0.218
ase-0.217
Token She
Token ID1375
Feature activation-1.54e+01

Pos logit contributions
issy+0.035
ppy+0.032
uffy+0.029
pillar+0.029
azz+0.028
Neg logit contributions
 reaching-0.022
 giving-0.021
 helping-0.020
 counting-0.020
 pointing-0.020
Token wanted
Token ID2227
Feature activation+1.84e+01

Pos logit contributions
issy+0.368
ppy+0.325
uffy+0.304
azz+0.300
pillar+0.293
Neg logit contributions
 reaching-0.282
 giving-0.276
 helping-0.265
 counting-0.260
 understanding-0.259
Token to
Token ID284
Feature activation+9.71e+00

Pos logit contributions
 reaching+0.194
 succeeding+0.188
 believing+0.184
 insisting+0.182
 encouraging+0.176
Neg logit contributions
issy-0.535
ppy-0.471
que-0.453
ens-0.453
ase-0.439
Token be
Token ID307
Feature activation+7.50e+00

Pos logit contributions
 reaching+0.163
 believing+0.162
 insisting+0.162
 succeeding+0.158
 giving+0.154
Neg logit contributions
issy-0.228
ppy-0.195
acky-0.171
que-0.171
ens-0.169
Token a
Token ID257
Feature activation-6.33e+00

Pos logit contributions
 reaching+0.103
 giving+0.094
 insisting+0.094
 pointing+0.093
 diving+0.091
Neg logit contributions
issy-0.194
ppy-0.180
ocado-0.166
isk-0.159
ens-0.158
Token good
Token ID922
Feature activation+6.32e+00

Pos logit contributions
issy+0.158
ppy+0.139
uffy+0.136
ocado+0.128
pillar+0.128
Neg logit contributions
 reaching-0.106
 giving-0.098
 insisting-0.096
 understanding-0.093
 pointing-0.093
Token chef
Token ID21221
Feature activation-1.05e+01

Pos logit contributions
 reaching+0.107
 giving+0.104
 helping+0.099
 counting+0.098
 understanding+0.098
Neg logit contributions
issy-0.161
ppy-0.144
uffy-0.133
azz-0.132
pillar-0.129
Token when
Token ID618
Feature activation+5.07e+00

Pos logit contributions
issy+0.316
ppy+0.283
uffy+0.271
azz+0.264
acker+0.262
Neg logit contributions
 reaching-0.131
 giving-0.128
 helping-0.121
 counting-0.118
 showing-0.117
Token bottle
Token ID9294
Feature activation-1.46e+00

Pos logit contributions
issy+0.309
ppy+0.271
uffy+0.251
azz+0.243
pillar+0.240
Neg logit contributions
 reaching-0.263
 giving-0.256
 helping-0.244
 understanding-0.243
 counting-0.242
Token!
Token ID0
Feature activation-7.20e-01

Pos logit contributions
issy+0.043
ppy+0.039
uffy+0.037
azz+0.036
acker+0.035
Neg logit contributions
 reaching-0.019
 giving-0.018
 helping-0.017
 counting-0.016
 showing-0.016
Token He
Token ID679
Feature activation-1.42e+01

Pos logit contributions
issy+0.019
ppy+0.017
uffy+0.016
azz+0.016
pillar+0.015
Neg logit contributions
 reaching-0.011
 giving-0.011
 helping-0.010
 understanding-0.010
 counting-0.010
Token was
Token ID373
Feature activation+8.05e+00

Pos logit contributions
issy+0.316
ppy+0.282
pillar+0.251
azz+0.248
uffy+0.248
Neg logit contributions
 reaching-0.281
 giving-0.271
 helping-0.262
 counting-0.259
 understanding-0.257
Token so
Token ID523
Feature activation+7.60e+00

Pos logit contributions
 insisting+0.123
no+0.123
 spending+0.121
 Listen+0.120
 convincing+0.119
Neg logit contributions
issy-0.130
ppy-0.128
house-0.107
ocado-0.107
aky-0.106
Token excited
Token ID6568
Feature activation+1.31e+01

Pos logit contributions
 spending+0.116
 insisting+0.114
 counting+0.113
 packing+0.112
 reaching+0.112
Neg logit contributions
issy-0.163
ocado-0.153
ppy-0.148
hell-0.139
acky-0.139
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
 reaching+0.164
 giving+0.158
 understanding+0.147
 helping+0.146
 counting+0.143
Neg logit contributions
issy-0.384
ppy-0.348
uffy-0.328
azz-0.322
pillar-0.320
Token The
Token ID383
Feature activation-1.31e+01

Pos logit contributions
issy+0.044
ppy+0.038
uffy+0.036
azz+0.035
pillar+0.035
Neg logit contributions
 reaching-0.034
 giving-0.033
 understanding-0.031
 helping-0.031
 counting-0.031
Token spark
Token ID9009
Feature activation+4.00e+00

Pos logit contributions
issy+0.319
ppy+0.282
uffy+0.262
azz+0.254
pillar+0.251
Neg logit contributions
 reaching-0.241
 giving-0.233
 helping-0.221
 understanding-0.220
 counting-0.220
Tokenle
Token ID293
Feature activation-2.61e+00

Pos logit contributions
 reaching+0.078
 giving+0.077
 showing+0.073
 pointing+0.072
 helping+0.071
Neg logit contributions
issy-0.088
ppy-0.080
uffy-0.076
acker-0.075
isk-0.074
Token granted
Token ID7520
Feature activation+1.48e+01

Pos logit contributions
issy+0.066
ppy+0.059
uffy+0.055
azz+0.053
pillar+0.053
Neg logit contributions
 reaching-0.045
 giving-0.043
 helping-0.041
 understanding-0.041
 counting-0.041
Token never
Token ID1239
Feature activation+9.31e+00

Pos logit contributions
issy+0.328
ppy+0.290
pillar+0.261
azz+0.253
uffy+0.251
Neg logit contributions
 reaching-0.307
 giving-0.294
 helping-0.286
 counting-0.284
 listening-0.279
Token knew
Token ID2993
Feature activation+1.39e+01

Pos logit contributions
 reaching+0.197
 giving+0.191
 helping+0.184
 counting+0.182
 understanding+0.180
Neg logit contributions
issy-0.198
ppy-0.174
azz-0.153
uffy-0.153
pillar-0.150
Token that
Token ID326
Feature activation-1.96e+00

Pos logit contributions
 reaching+0.163
 giving+0.156
 showing+0.142
 pointing+0.141
 helping+0.140
Neg logit contributions
issy-0.405
ppy-0.370
uffy-0.348
azz-0.340
ens-0.339
Token notes
Token ID4710
Feature activation+2.64e+00

Pos logit contributions
issy+0.055
ppy+0.050
pillar+0.047
ens+0.047
uffy+0.046
Neg logit contributions
 reaching-0.024
 giving-0.022
 packing-0.021
 understanding-0.021
 pointing-0.021
Token could
Token ID714
Feature activation+1.31e+01

Pos logit contributions
 reaching+0.047
 giving+0.045
 helping+0.043
 understanding+0.043
 counting+0.042
Neg logit contributions
issy-0.065
ppy-0.057
uffy-0.053
azz-0.052
pillar-0.051
Token glow
Token ID19634
Feature activation+4.98e+00

Pos logit contributions
 reaching+0.183
 giving+0.175
 packing+0.172
 pointing+0.170
 testing+0.169
Neg logit contributions
issy-0.353
ppy-0.317
que-0.295
uffy-0.287
pillar-0.287
Token!
Token ID0
Feature activation-7.08e-01

Pos logit contributions
 reaching+0.060
 giving+0.057
 helping+0.053
 counting+0.052
 understanding+0.052
Neg logit contributions
issy-0.152
ppy-0.138
uffy-0.131
azz-0.127
pillar-0.126
Token She
Token ID1375
Feature activation-1.57e+01

Pos logit contributions
issy+0.020
ppy+0.018
uffy+0.016
azz+0.016
acker+0.016
Neg logit contributions
 reaching-0.010
 giving-0.010
 understanding-0.009
 helping-0.009
 counting-0.009
Token smiled
Token ID13541
Feature activation+8.51e+00

Pos logit contributions
issy+0.360
ppy+0.322
uffy+0.288
pillar+0.288
azz+0.284
Neg logit contributions
 reaching-0.295
 giving-0.283
 helping-0.274
 counting-0.269
 pointing-0.266
Token and
Token ID290
Feature activation+3.00e+00

Pos logit contributions
 reaching+0.139
 giving+0.130
 counting+0.128
 testing+0.126
 insisting+0.125
Neg logit contributions
issy-0.206
ppy-0.190
azz-0.174
uffy-0.173
pillar-0.168
Token decided
Token ID3066
Feature activation+1.07e+01

Pos logit contributions
 reaching+0.055
 counting+0.051
 giving+0.050
 packing+0.050
 warning+0.050
Neg logit contributions
issy-0.064
ppy-0.058
ens-0.051
pillar-0.051
uffy-0.051
Token now
Token ID783
Feature activation+4.29e+00

Pos logit contributions
issy+0.007
ppy+0.007
uffy+0.006
isk+0.006
acker+0.006
Neg logit contributions
 reaching-0.005
 pointing-0.004
 giving-0.004
 encouraging-0.004
 understanding-0.004
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
 reaching+0.072
 giving+0.066
 understanding+0.066
 pointing+0.066
 counting+0.066
Neg logit contributions
issy-0.101
ppy-0.094
uffy-0.086
ens-0.081
acker-0.080
Token have
Token ID423
Feature activation+8.62e+00

Pos logit contributions
 reaching+0.035
 giving+0.032
 pointing+0.031
 mesmer+0.031
 encouraging+0.031
Neg logit contributions
issy-0.060
ppy-0.053
uffy-0.050
ens-0.049
isk-0.048
Token a
Token ID257
Feature activation-6.18e+00

Pos logit contributions
 reaching+0.106
 giving+0.096
 pointing+0.091
 encouraging+0.088
 showing+0.088
Neg logit contributions
issy-0.248
ppy-0.223
uffy-0.210
ens-0.208
que-0.206
Token hot
Token ID3024
Feature activation+1.79e+00

Pos logit contributions
issy+0.159
ppy+0.140
uffy+0.138
acker+0.130
azz+0.128
Neg logit contributions
 reaching-0.100
 giving-0.093
 showing-0.087
 pointing-0.086
 spending-0.086
Token chocolate
Token ID11311
Feature activation-3.81e+00

Pos logit contributions
 reaching+0.032
 giving+0.031
 understanding+0.030
 pointing+0.029
 helping+0.029
Neg logit contributions
issy-0.043
ppy-0.038
uffy-0.036
pillar-0.034
ens-0.034
Token!"
Token ID2474
Feature activation+8.63e-01

Pos logit contributions
issy+0.103
ppy+0.093
uffy+0.087
azz+0.085
pillar+0.084
Neg logit contributions
 reaching-0.058
 giving-0.056
 helping-0.053
 counting-0.052
 understanding-0.052
Token
Token ID198
Feature activation-4.29e+00

Pos logit contributions
 reaching+0.013
 giving+0.013
 counting+0.012
 helping+0.012
 understanding+0.012
Neg logit contributions
issy-0.023
ppy-0.021
uffy-0.020
pillar-0.019
azz-0.019
Token
Token ID198
Feature activation-4.26e+00

Pos logit contributions
issy+0.116
ppy+0.105
uffy+0.099
pillar+0.096
azz+0.095
Neg logit contributions
 reaching-0.067
 giving-0.065
 counting-0.061
 helping-0.060
 pointing-0.059
TokenM
Token ID44
Feature activation-9.87e+00

Pos logit contributions
issy+0.095
ppy+0.083
uffy+0.076
azz+0.074
pillar+0.073
Neg logit contributions
 reaching-0.087
 giving-0.085
 helping-0.081
 counting-0.080
 understanding-0.080
Tokenolly
Token ID5098
Feature activation-1.48e+01

Pos logit contributions
issy+0.152
ppy+0.122
azz+0.105
uffy+0.104
x+0.100
Neg logit contributions
 reaching-0.266
 giving-0.259
 understanding-0.254
 helping-0.251
 counting-0.250
Token hurt
Token ID5938
Feature activation+9.37e+00

Pos logit contributions
 mesmer+0.170
 encouraging+0.168
 insisting+0.168
 reaching+0.168
 testing+0.167
Neg logit contributions
ppy-0.238
issy-0.237
ens-0.215
que-0.209
ety-0.201
Token her
Token ID607
Feature activation-1.57e+01

Pos logit contributions
 reaching+0.123
 giving+0.120
 understanding+0.114
 helping+0.113
 counting+0.111
Neg logit contributions
issy-0.263
ppy-0.243
uffy-0.224
pillar-0.220
azz-0.219
Token mom
Token ID1995
Feature activation-1.49e+01

Pos logit contributions
issy+0.403
ppy+0.374
uffy+0.351
azz+0.326
ocado+0.325
Neg logit contributions
 reaching-0.249
 giving-0.231
 counting-0.221
 understanding-0.219
 insisting-0.219
Token's
Token ID338
Feature activation-4.70e+00

Pos logit contributions
issy+0.416
ppy+0.369
uffy+0.352
azz+0.342
pillar+0.340
Neg logit contributions
 reaching-0.222
 giving-0.215
 helping-0.202
 understanding-0.198
 counting-0.197
Token feelings
Token ID7666
Feature activation-2.92e+00

Pos logit contributions
issy+0.106
ppy+0.098
uffy+0.093
ens+0.087
ocado+0.086
Neg logit contributions
 reaching-0.084
 giving-0.079
 counting-0.076
 pointing-0.076
 spending-0.075
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
issy+0.068
ppy+0.064
azz+0.056
uffy+0.056
hell+0.055
Neg logit contributions
 reaching-0.049
 counting-0.046
 testing-0.046
 examining-0.046
 giving-0.045
Token
Token ID198
Feature activation-3.18e+00

Pos logit contributions
issy+0.042
ppy+0.040
acker+0.036
pillar+0.035
azz+0.035
Neg logit contributions
 reaching-0.017
 giving-0.017
 pointing-0.016
 to-0.016
 asking-0.016
Token
Token ID198
Feature activation-3.12e+00

Pos logit contributions
issy+0.095
ppy+0.087
uffy+0.081
azz+0.079
pillar+0.079
Neg logit contributions
 reaching-0.039
 giving-0.037
 helping-0.035
 understanding-0.035
 counting-0.034
TokenLater
Token ID18602
Feature activation+2.62e+00

Pos logit contributions
issy+0.088
ppy+0.081
uffy+0.076
azz+0.074
acker+0.073
Neg logit contributions
 reaching-0.043
 giving-0.042
 understanding-0.041
 showing-0.039
 helping-0.039
Token that
Token ID326
Feature activation-1.16e+00

Pos logit contributions
 reaching+0.035
 giving+0.034
 understanding+0.033
 succeeding+0.032
 counting+0.032
Neg logit contributions
issy-0.073
uffy-0.063
ppy-0.063
pillar-0.061
ens-0.061
Token day
Token ID1110
Feature activation+2.02e+00

Pos logit contributions
issy+0.024
ppy+0.020
uffy+0.019
pillar+0.019
ens+0.019
Neg logit contributions
 reaching-0.024
 understanding-0.023
 giving-0.023
 counting-0.023
 succeeding-0.022
Token always
Token ID1464
Feature activation+1.06e+01

Pos logit contributions
 reaching+0.183
 giving+0.172
 pointing+0.167
 counting+0.165
 understanding+0.162
Neg logit contributions
issy-0.402
ppy-0.364
uffy-0.334
que-0.326
azz-0.325
Token follow
Token ID1061
Feature activation+1.72e+01

Pos logit contributions
 reaching+0.144
 giving+0.133
 pointing+0.132
 packing+0.131
 counting+0.128
Neg logit contributions
issy-0.282
ppy-0.260
uffy-0.232
isk-0.231
que-0.230
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 reaching+0.183
 giving+0.176
 helping+0.161
 understanding+0.159
 counting+0.157
Neg logit contributions
issy-0.548
ppy-0.499
uffy-0.475
azz-0.466
pillar-0.459
Token<|endoftext|>
Token ID50256
Feature activation-3.24e+00

Pos logit contributions
issy+0.033
ppy+0.031
uffy+0.029
x+0.028
acker+0.028
Neg logit contributions
 reaching-0.011
 giving-0.011
 understanding-0.011
 helping-0.010
 counting-0.010
TokenOnce
Token ID7454
Feature activation+6.80e+00

Pos logit contributions
issy+0.101
ppy+0.094
uffy+0.085
x+0.084
ems+0.084
Neg logit contributions
 reaching-0.035
 giving-0.033
 pointing-0.031
 counting-0.031
 helping-0.030
Token upon
Token ID2402
Feature activation-1.58e+00

Pos logit contributions
 reaching+0.107
 to+0.104
 giving+0.100
 pointing+0.098
 helping+0.098
Neg logit contributions
issy-0.172
ppy-0.156
pillar-0.147
isk-0.144
ocado-0.143
Token a
Token ID257
Feature activation-5.68e+00

Pos logit contributions
issy+0.058
ppy+0.053
uffy+0.052
azz+0.050
pillar+0.050
Neg logit contributions
 reaching-0.009
 giving-0.008
 understanding-0.007
 helping-0.007
 listening-0.007
Token time
Token ID640
Feature activation+4.81e+00

Pos logit contributions
issy+0.174
ppy+0.147
x+0.146
azz+0.145
rie+0.142
Neg logit contributions
 reaching-0.070
 understanding-0.064
 listening-0.062
 counting-0.062
 helping-0.062
Token there
Token ID612
Feature activation-3.30e-01

Pos logit contributions
 reaching+0.050
 giving+0.045
 counting+0.044
 helping+0.044
 pointing+0.043
Neg logit contributions
issy-0.161
ppy-0.144
pillar-0.135
azz-0.134
acker-0.133
Token was
Token ID373
Feature activation+8.43e+00

Pos logit contributions
issy+0.010
ppy+0.009
ems+0.009
azz+0.009
affe+0.008
Neg logit contributions
 reaching-0.004
 pointing-0.004
 giving-0.004
 counting-0.004
 helping-0.004
Token a
Token ID257
Feature activation-5.86e+00

Pos logit contributions
 reaching+0.042
 giving+0.034
 helping+0.033
 spending+0.030
 listening+0.030
Neg logit contributions
issy-0.320
ppy-0.296
uffy-0.284
azz-0.283
x-0.277
Token reliable
Token ID9314
Feature activation+2.28e+00

Pos logit contributions
issy+0.094
ppy+0.087
uffy+0.081
pillar+0.079
azz+0.078
Neg logit contributions
 reaching-0.050
 giving-0.047
 pointing-0.045
 counting-0.044
 understanding-0.044
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
 reaching+0.030
 giving+0.029
 helping+0.027
 understanding+0.027
 counting+0.026
Neg logit contributions
issy-0.066
ppy-0.060
uffy-0.056
azz-0.055
pillar-0.055
Token like
Token ID588
Feature activation+1.50e+01

Pos logit contributions
 reaching+0.041
 giving+0.039
 counting+0.037
 pointing+0.036
 helping+0.036
Neg logit contributions
issy-0.092
ppy-0.084
uffy-0.078
pillar-0.077
azz-0.077
Token an
Token ID281
Feature activation-3.96e+00

Pos logit contributions
 reaching+0.169
 giving+0.158
 understanding+0.147
 pointing+0.145
 counting+0.143
Neg logit contributions
issy-0.456
ppy-0.416
uffy-0.389
pillar-0.382
azz-0.382
Token exit
Token ID8420
Feature activation+5.14e+00

Pos logit contributions
issy+0.104
ppy+0.092
pillar+0.084
uffy+0.084
affe+0.083
Neg logit contributions
 reaching-0.062
 giving-0.059
 pointing-0.057
 helping-0.056
 showing-0.056
Token!
Token ID0
Feature activation-1.50e+00

Pos logit contributions
 reaching+0.066
 giving+0.064
 helping+0.059
 counting+0.059
 showing+0.058
Neg logit contributions
issy-0.154
ppy-0.140
uffy-0.132
azz-0.128
acker-0.127
Token<|endoftext|>
Token ID50256
Feature activation-4.81e+00

Pos logit contributions
issy+0.047
ppy+0.044
uffy+0.041
x+0.039
azz+0.039
Neg logit contributions
 reaching-0.016
 giving-0.015
 understanding-0.014
 helping-0.014
 counting-0.014
TokenOnce
Token ID7454
Feature activation+4.95e+00

Pos logit contributions
issy+0.134
ppy+0.121
uffy+0.110
azz+0.108
x+0.106
Neg logit contributions
 reaching-0.071
 giving-0.067
 pointing-0.065
 counting-0.064
 helping-0.063
Token upon
Token ID2402
Feature activation-2.63e+00

Pos logit contributions
 reaching+0.087
 giving+0.082
 to+0.080
 helping+0.080
 counting+0.080
Neg logit contributions
issy-0.120
ppy-0.109
pillar-0.100
isk-0.097
ocado-0.096
Token a
Token ID257
Feature activation-6.43e+00

Pos logit contributions
issy+0.095
ppy+0.088
uffy+0.084
azz+0.082
pillar+0.081
Neg logit contributions
 reaching-0.017
 giving-0.015
 helping-0.014
 understanding-0.013
 listening-0.013
Token time
Token ID640
Feature activation+3.01e+00

Pos logit contributions
issy+0.193
ppy+0.165
x+0.159
azz+0.159
ase+0.157
Neg logit contributions
 reaching-0.084
 understanding-0.079
 listening-0.078
 helping-0.077
 counting-0.076
Token their
Token ID511
Feature activation-1.47e+01

Pos logit contributions
 reaching+0.092
 giving+0.086
 helping+0.077
 understanding+0.075
 counting+0.072
Neg logit contributions
issy-0.526
ppy-0.487
azz-0.460
uffy-0.459
pillar-0.453
Token own
Token ID898
Feature activation+2.94e+00

Pos logit contributions
issy+0.380
ppy+0.338
uffy+0.321
azz+0.308
pillar+0.304
Neg logit contributions
 reaching-0.244
 giving-0.235
 helping-0.220
 understanding-0.219
 counting-0.218
Token cars
Token ID5006
Feature activation-6.87e+00

Pos logit contributions
 reaching+0.053
 giving+0.051
 helping+0.048
 understanding+0.048
 counting+0.048
Neg logit contributions
issy-0.071
ppy-0.063
uffy-0.059
azz-0.057
pillar-0.056
Token and
Token ID290
Feature activation+2.28e+00

Pos logit contributions
issy+0.225
ppy+0.198
uffy+0.192
pillar+0.189
azz+0.187
Neg logit contributions
 reaching-0.071
 giving-0.069
 helping-0.066
 counting-0.063
 showing-0.061
Token trucks
Token ID13960
Feature activation-1.04e+01

Pos logit contributions
 reaching+0.042
 giving+0.040
 understanding+0.038
 helping+0.038
 counting+0.038
Neg logit contributions
issy-0.054
ppy-0.048
uffy-0.044
azz-0.043
pillar-0.042
Token.
Token ID13
Feature activation-1.90e+00

Pos logit contributions
issy+0.315
ppy+0.284
uffy+0.269
azz+0.263
pillar+0.261
Neg logit contributions
 reaching-0.128
 giving-0.122
 helping-0.114
 counting-0.112
 understanding-0.112
Token They
Token ID1119
Feature activation-1.06e+01

Pos logit contributions
issy+0.048
ppy+0.042
uffy+0.039
azz+0.039
pillar+0.038
Neg logit contributions
 reaching-0.034
 giving-0.032
 helping-0.031
 counting-0.031
 understanding-0.030
Token are
Token ID389
Feature activation+6.76e+00

Pos logit contributions
issy+0.272
ppy+0.240
uffy+0.233
pillar+0.226
azz+0.224
Neg logit contributions
 reaching-0.169
 giving-0.168
 understanding-0.163
 helping-0.162
 encouraging-0.158
Token smaller
Token ID4833
Feature activation+4.08e+00

Pos logit contributions
 reaching+0.094
 giving+0.079
 insisting+0.078
 pointing+0.078
 counting+0.078
Neg logit contributions
issy-0.178
ppy-0.166
uffy-0.157
ocado-0.154
acky-0.149
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 reaching+0.055
 giving+0.053
 helping+0.051
 counting+0.050
 understanding+0.050
Neg logit contributions
issy-0.119
ppy-0.107
uffy-0.100
pillar-0.098
azz-0.098
Token safer
Token ID14178
Feature activation+4.69e+00

Pos logit contributions
 reaching+0.028
 giving+0.025
 pointing+0.024
 packing+0.024
 counting+0.024
Neg logit contributions
issy-0.063
ppy-0.059
azz-0.053
ens-0.053
uffy-0.053
Token was
Token ID373
Feature activation+7.58e+00

Pos logit contributions
issy+0.398
ppy+0.357
uffy+0.338
azz+0.331
acker+0.331
Neg logit contributions
 giving-0.196
 reaching-0.196
 showing-0.183
 counting-0.180
 pointing-0.179
Token a
Token ID257
Feature activation-6.41e+00

Pos logit contributions
 reaching+0.081
 giving+0.074
 understanding+0.073
 warning+0.071
 succeeding+0.067
Neg logit contributions
issy-0.213
ppy-0.192
uffy-0.190
acky-0.177
illo-0.175
Token small
Token ID1402
Feature activation-4.03e+00

Pos logit contributions
issy+0.193
ppy+0.173
uffy+0.162
azz+0.161
isk+0.158
Neg logit contributions
 reaching-0.079
 giving-0.078
 understanding-0.074
 helping-0.073
 pointing-0.072
Token girl
Token ID2576
Feature activation-2.11e+01

Pos logit contributions
issy+0.089
ppy+0.076
azz+0.069
uffy+0.068
acker+0.068
Neg logit contributions
 reaching-0.083
 giving-0.082
 helping-0.079
 counting-0.079
 understanding-0.078
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
issy+0.522
ppy+0.471
uffy+0.436
pillar+0.423
azz+0.419
Neg logit contributions
 reaching-0.363
 giving-0.341
 understanding-0.331
 counting-0.321
 helping-0.320
Token she
Token ID673
Feature activation-1.72e+01

Pos logit contributions
 reaching+0.045
 giving+0.043
 counting+0.041
 helping+0.040
 understanding+0.039
Neg logit contributions
issy-0.070
ppy-0.062
uffy-0.057
pillar-0.056
azz-0.056
Token was
Token ID373
Feature activation+7.49e+00

Pos logit contributions
issy+0.443
ppy+0.396
uffy+0.366
azz+0.363
acker+0.356
Neg logit contributions
 giving-0.285
 reaching-0.283
 helping-0.270
 showing-0.267
 counting-0.266
Token very
Token ID845
Feature activation+7.29e+00

Pos logit contributions
 reaching+0.100
 understanding+0.088
 giving+0.088
 warning+0.084
 insisting+0.083
Neg logit contributions
issy-0.206
ppy-0.186
uffy-0.180
isk-0.165
acky-0.163
Token eager
Token ID11069
Feature activation+1.49e+01

Pos logit contributions
 reaching+0.101
 giving+0.089
 insisting+0.088
 pushing+0.084
 joining+0.083
Neg logit contributions
issy-0.197
ppy-0.182
uffy-0.177
acky-0.166
ocado-0.164
Token.
Token ID13
Feature activation-1.84e+00

Pos logit contributions
 reaching+0.130
 mesmer+0.108
 succeeding+0.106
 pointing+0.104
 insisting+0.104
Neg logit contributions
issy-0.464
ppy-0.420
ens-0.402
pillar-0.396
azz-0.393
Token She
Token ID1375
Feature activation-1.58e+01

Pos logit contributions
issy+0.041
ppy+0.036
uffy+0.033
azz+0.032
pillar+0.032
Neg logit contributions
 reaching-0.037
 giving-0.036
 helping-0.035
 understanding-0.034
 counting-0.034
Token together
Token ID1978
Feature activation+5.41e+00

Pos logit contributions
 reaching+0.016
 giving+0.014
 understanding+0.014
 counting+0.014
 pointing+0.013
Neg logit contributions
issy-0.033
ppy-0.031
uffy-0.028
azz-0.028
pillar-0.028
Token.
Token ID13
Feature activation-2.91e+00

Pos logit contributions
 reaching+0.085
 giving+0.082
 understanding+0.081
 insisting+0.078
 testing+0.078
Neg logit contributions
issy-0.130
ppy-0.122
uffy-0.109
pillar-0.108
azz-0.107
Token The
Token ID383
Feature activation-1.40e+01

Pos logit contributions
issy+0.088
ppy+0.083
uffy+0.076
x+0.074
que+0.074
Neg logit contributions
 giving-0.031
 reaching-0.031
 spending-0.030
 helping-0.028
 counting-0.028
Token two
Token ID734
Feature activation-8.11e+00

Pos logit contributions
issy+0.394
ppy+0.359
azz+0.343
uffy+0.333
acker+0.329
Neg logit contributions
 giving-0.203
 reaching-0.202
 understanding-0.201
 encouraging-0.193
 helping-0.190
Token of
Token ID286
Feature activation+8.64e+00

Pos logit contributions
issy+0.227
ppy+0.209
azz+0.195
acker+0.194
uffy+0.193
Neg logit contributions
 reaching-0.118
 giving-0.114
 understanding-0.113
 helping-0.111
 encouraging-0.109
Token them
Token ID606
Feature activation-1.49e+01

Pos logit contributions
 reaching+0.107
 giving+0.102
 helping+0.094
 counting+0.093
 understanding+0.093
Neg logit contributions
issy-0.259
ppy-0.235
uffy-0.220
azz-0.217
pillar-0.215
Token were
Token ID547
Feature activation+7.17e+00

Pos logit contributions
issy+0.368
ppy+0.320
uffy+0.307
azz+0.302
acker+0.293
Neg logit contributions
 reaching-0.257
 giving-0.253
 understanding-0.240
 helping-0.239
 listening-0.238
Token the
Token ID262
Feature activation-1.44e+01

Pos logit contributions
 reaching+0.075
 giving+0.069
 understanding+0.065
 pointing+0.063
 helping+0.063
Neg logit contributions
issy-0.223
ppy-0.205
uffy-0.195
azz-0.186
pillar-0.185
Token best
Token ID1266
Feature activation-5.97e-01

Pos logit contributions
issy+0.438
ppy+0.398
azz+0.376
ase+0.371
uffy+0.371
Neg logit contributions
 reaching-0.174
 giving-0.169
 helping-0.162
 understanding-0.161
 encouraging-0.159
Token of
Token ID286
Feature activation+8.65e+00

Pos logit contributions
issy+0.018
ppy+0.016
uffy+0.015
azz+0.015
pillar+0.015
Neg logit contributions
 reaching-0.008
 giving-0.008
 helping-0.007
 understanding-0.007
 counting-0.007
Token friends
Token ID2460
Feature activation-8.04e+00

Pos logit contributions
 reaching+0.130
 giving+0.121
 pointing+0.118
 counting+0.112
 mesmer+0.111
Neg logit contributions
issy-0.215
ppy-0.197
uffy-0.191
acky-0.181
ens-0.178
Token wear
Token ID5806
Feature activation+1.22e+01

Pos logit contributions
 reaching+0.138
 giving+0.128
 counting+0.124
 pointing+0.123
 helping+0.121
Neg logit contributions
issy-0.276
ppy-0.249
isk-0.227
que-0.225
uffy-0.225
Token a
Token ID257
Feature activation-6.18e+00

Pos logit contributions
 reaching+0.122
 giving+0.117
 helping+0.107
 understanding+0.105
 counting+0.104
Neg logit contributions
issy-0.395
ppy-0.361
uffy-0.343
azz-0.336
pillar-0.333
Token special
Token ID2041
Feature activation+1.23e+00

Pos logit contributions
issy+0.149
ppy+0.132
uffy+0.126
azz+0.121
pillar+0.120
Neg logit contributions
 reaching-0.111
 giving-0.105
 helping-0.099
 counting-0.099
 understanding-0.099
Token veil
Token ID30615
Feature activation-1.10e+01

Pos logit contributions
 reaching+0.026
 giving+0.024
 mesmer+0.023
 insisting+0.023
 understanding+0.023
Neg logit contributions
issy-0.025
ppy-0.021
uffy-0.021
pillar-0.020
ase-0.019
Token.
Token ID13
Feature activation-1.96e+00

Pos logit contributions
issy+0.316
ppy+0.287
uffy+0.267
azz+0.263
pillar+0.258
Neg logit contributions
 reaching-0.147
 giving-0.140
 understanding-0.133
 pointing-0.129
 counting-0.129
Token He
Token ID679
Feature activation-1.49e+01

Pos logit contributions
issy+0.049
ppy+0.043
azz+0.041
uffy+0.041
pillar+0.039
Neg logit contributions
 giving-0.034
 reaching-0.034
 helping-0.032
 understanding-0.032
 listening-0.031
Token went
Token ID1816
Feature activation+1.36e+01

Pos logit contributions
issy+0.414
ppy+0.370
uffy+0.351
azz+0.344
pillar+0.339
Neg logit contributions
 reaching-0.216
 giving-0.210
 helping-0.196
 understanding-0.196
 counting-0.193
Token to
Token ID284
Feature activation+9.87e+00

Pos logit contributions
 reaching+0.138
 understanding+0.138
 succeeding+0.132
 mesmer+0.128
 giving+0.123
Neg logit contributions
issy-0.400
ppy-0.368
ches-0.351
uffy-0.349
azz-0.348
Token the
Token ID262
Feature activation-1.41e+01

Pos logit contributions
 reaching+0.143
 nodding+0.134
 mesmer+0.132
 encouraging+0.128
 succeeding+0.126
Neg logit contributions
issy-0.243
isk-0.207
ppy-0.207
ens-0.200
azz-0.195
Token market
Token ID1910
Feature activation+1.55e+00

Pos logit contributions
issy+0.303
ppy+0.261
uffy+0.245
pillar+0.245
ase+0.233
Neg logit contributions
 reaching-0.283
 giving-0.270
 understanding-0.262
 counting-0.258
 helping-0.256
Token with
Token ID351
Feature activation+1.47e+01

Pos logit contributions
 reaching+0.023
 succeeding+0.022
 joining+0.022
 insisting+0.022
 giving+0.022
Neg logit contributions
issy-0.037
ppy-0.033
uffy-0.032
isk-0.031
ase-0.031
Token excited
Token ID6568
Feature activation+1.23e+01

Pos logit contributions
 spending+0.099
 packing+0.098
 counting+0.095
 reaching+0.094
 giving+0.094
Neg logit contributions
issy-0.155
ppy-0.143
azz-0.140
ocado-0.140
uffy-0.140
Token.
Token ID13
Feature activation-2.78e+00

Pos logit contributions
 reaching+0.137
 giving+0.131
 helping+0.120
 understanding+0.119
 counting+0.118
Neg logit contributions
issy-0.386
ppy-0.352
uffy-0.333
azz-0.326
pillar-0.323
Token M
Token ID337
Feature activation-1.51e+01

Pos logit contributions
issy+0.058
ppy+0.049
uffy+0.046
azz+0.046
pillar+0.045
Neg logit contributions
 reaching-0.059
 giving-0.057
 helping-0.056
 understanding-0.056
 listening-0.055
Tokenandy
Token ID10757
Feature activation-1.86e+01

Pos logit contributions
issy+0.273
ppy+0.228
uffy+0.206
azz+0.203
acker+0.196
Neg logit contributions
 reaching-0.362
 giving-0.349
 counting-0.339
 helping-0.336
 understanding-0.334
Token opened
Token ID4721
Feature activation+1.36e+01

Pos logit contributions
issy+0.438
ppy+0.398
uffy+0.357
pillar+0.354
azz+0.350
Neg logit contributions
 reaching-0.336
 giving-0.326
 helping-0.317
 spending-0.311
 understanding-0.308
Token the
Token ID262
Feature activation-1.41e+01

Pos logit contributions
 giving+0.185
 reaching+0.177
 helping+0.177
 succeeding+0.175
 planning+0.171
Neg logit contributions
issy-0.343
ppy-0.301
azz-0.292
pillar-0.287
x-0.283
Token note
Token ID3465
Feature activation-1.18e+00

Pos logit contributions
issy+0.350
ppy+0.311
uffy+0.289
azz+0.281
pillar+0.277
Neg logit contributions
 reaching-0.252
 giving-0.245
 helping-0.232
 understanding-0.231
 counting-0.230
Token and
Token ID290
Feature activation+2.25e+00

Pos logit contributions
issy+0.029
ppy+0.026
azz+0.024
pillar+0.024
uffy+0.024
Neg logit contributions
 giving-0.020
 reaching-0.019
 helping-0.019
 spending-0.019
 understanding-0.018
Token found
Token ID1043
Feature activation+1.09e+01

Pos logit contributions
 reaching+0.041
 giving+0.040
 helping+0.039
 counting+0.038
 understanding+0.038
Neg logit contributions
issy-0.053
ppy-0.047
azz-0.042
uffy-0.042
pillar-0.041
Token five
Token ID1936
Feature activation-1.92e+00

Pos logit contributions
 reaching+0.114
 giving+0.106
 insisting+0.102
 listening+0.098
 packing+0.098
Neg logit contributions
issy-0.343
ppy-0.307
pillar-0.290
azz-0.286
uffy-0.285
Token smooth
Token ID7209
Feature activation+4.24e+00

Pos logit contributions
issy+0.048
ppy+0.043
uffy+0.040
pillar+0.040
azz+0.039
Neg logit contributions
 reaching-0.032
 giving-0.031
 helping-0.030
 understanding-0.029
 listening-0.029
Token higher
Token ID2440
Feature activation+7.23e+00

Pos logit contributions
 giving+0.035
 reaching+0.034
 packing+0.032
 helping+0.032
 understanding+0.032
Neg logit contributions
issy-0.079
ppy-0.072
uffy-0.067
pillar-0.066
azz-0.065
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
 reaching+0.102
 giving+0.102
 understanding+0.096
 helping+0.095
 teaching+0.093
Neg logit contributions
issy-0.196
ppy-0.179
uffy-0.167
azz-0.161
pillar-0.161
Token But
Token ID887
Feature activation+2.72e+00

Pos logit contributions
issy+0.066
ppy+0.058
uffy+0.056
azz+0.054
pillar+0.053
Neg logit contributions
 reaching-0.038
 giving-0.037
 helping-0.035
 understanding-0.034
 listening-0.034
Token soon
Token ID2582
Feature activation+2.64e+00

Pos logit contributions
 reaching+0.033
 giving+0.033
 helping+0.031
 understanding+0.030
 encouraging+0.030
Neg logit contributions
issy-0.079
ppy-0.072
uffy-0.069
ens-0.065
azz-0.064
Token,
Token ID11
Feature activation+3.88e+00

Pos logit contributions
 reaching+0.033
 giving+0.032
 helping+0.030
 understanding+0.029
 counting+0.029
Neg logit contributions
issy-0.078
ppy-0.071
uffy-0.067
azz-0.065
pillar-0.065
Token it
Token ID340
Feature activation-1.27e+01

Pos logit contributions
 reaching+0.056
 giving+0.054
 understanding+0.052
 helping+0.051
 encouraging+0.051
Neg logit contributions
issy-0.105
ppy-0.093
uffy-0.090
ens-0.085
pillar-0.084
Token became
Token ID2627
Feature activation+1.09e+01

Pos logit contributions
issy+0.344
ppy+0.311
azz+0.289
acker+0.286
uffy+0.285
Neg logit contributions
 reaching-0.198
 giving-0.177
 counting-0.171
 pointing-0.167
 pushing-0.164
Token too
Token ID1165
Feature activation+3.22e+00

Pos logit contributions
 reaching+0.126
 giving+0.125
 understanding+0.114
 showing+0.113
 pointing+0.113
Neg logit contributions
issy-0.326
ppy-0.295
uffy-0.278
que-0.271
azz-0.269
Token hard
Token ID1327
Feature activation+6.24e+00

Pos logit contributions
 reaching+0.042
 giving+0.042
 showing+0.040
 helping+0.039
 understanding+0.039
Neg logit contributions
issy-0.092
ppy-0.082
uffy-0.079
azz-0.076
pillar-0.076
Token.
Token ID13
Feature activation-2.55e+00

Pos logit contributions
 reaching+0.084
 giving+0.082
 understanding+0.081
 pointing+0.078
 showing+0.076
Neg logit contributions
issy-0.173
ppy-0.154
uffy-0.144
azz-0.143
ens-0.141
Token Her
Token ID2332
Feature activation-1.40e+01

Pos logit contributions
issy+0.066
ppy+0.058
uffy+0.056
azz+0.053
pillar+0.053
Neg logit contributions
 reaching-0.042
 giving-0.040
 helping-0.038
 understanding-0.038
 listening-0.038