TOP ACTIVATIONS
MAX = 11.986

. But the two little ones were too dizz y to
make sure that the big ones are in one side ,
One day , the little ones had an idea . They
the competition . The little ones were so happy and now
a few of your favourite ones and let 's go ."
appreci ative and supportive of those around us . <|endoftext|> Once
and knew that the gray ones were not ready to eat
. " Care ful , those berries aren 't safe to
toys and showed Joe which ones he could fill his sack
Sure ! I recommend the ones with the brown shells .
thankful to remember the bitter ones too . He kept a
and the l itt lest ones can go up and down
frog had found the perfect mate . <|endoftext|> Once upon a
a few of the smallest ones to put on her shelf
one side , the small ones in the other side and
L ily , you know those cookies are important . You
, and one of his favorites was catching . Every day
It 's one of my favorites ." They talked
different from all the other ones . His mom
blue socks because the red ones were dirty . Tim my

MIN ACTIVATIONS
MIN = -14.515

cookies and he couldn â t wait to get
so scared she didn â t know what to
now and it didn â t seem to be
quick or we won â t be able to
happy that it wasn â t getting bigger and
realized that it wasn â t fair for her
so frightened he couldn â t speak . He
fun but he wasn â t sure what .
it when things weren â t organized .
high up he couldn â t touch it .
was happy he didn â t have to be
then hopped onto Alice â s shoulder . They
They said it wasn â t enough .
river but Lily didn â t want to .
relieved that she didn â t have to be
woke up it wasn â t Tuesday anymore !
big that it didn â t fit in the
so good he couldn â t resist . So
and thought he didn â t need to include
, but he couldn â t . The dog

INTERVAL 5.361 - 11.986
CONTAINS 1.846%

lots of other boats . Some were bigger , some were
some of the most beautiful things in the world . <|endoftext|>
crazy rainbow with him . Everyone that saw it couldn 't
, and Max showed everyone how clever he was with his
door was closed . <|endoftext|> One day , Mrs Jones took

INTERVAL -1.265 - 5.361
CONTAINS 67.906%

to help him . The giant used his strong legs and
, outside . He wanted to show her the great string
My favorite food is apples ! â it
stirred and stirred until the pasta was y ummy and ready
you can plant and grow a tree ." L

INTERVAL -7.890 - -1.265
CONTAINS 30.104%

. Ben pushes harder and harder . But the tower does
everything they need . <|endoftext|> Ben and Lily were playing in
finished , the dad closed the guitar and put it away
share his toys with them . He put his
was so excited , she began to jump as everyone else

INTERVAL -14.515 - -7.890
CONTAINS 0.145%

daddy and rolling maze . <|endoftext|> Once upon a time ,
the tree , Please , can
Welcome ! We â ve been expecting you
smart kid , Mark !" <|endoftext|> Once upon a time ,
Now , let â s be a little
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 up+0.003
acked+0.003
 hurts+0.003
aned+0.002
chair+0.002
Neg logit contributions
 mate-0.005
 souven-0.004
 charity-0.004
 importantly-0.004
 rulers-0.004
Token But
Token ID887
Feature activation+4.18e+00

Pos logit contributions
 up+0.002
acked+0.002
ulum+0.002
 The+0.002
>+0.002
Neg logit contributions
 mate-0.005
 souven-0.005
 rulers-0.005
 cents-0.004
 charity-0.004
Token the
Token ID262
Feature activation-7.65e-01

Pos logit contributions
 mate+0.156
 charity+0.139
 souven+0.134
 importantly+0.129
ities+0.129
Neg logit contributions
 up-0.158
level-0.126
 hurts-0.120
ulum-0.119
uck-0.119
Token two
Token ID734
Feature activation+5.99e+00

Pos logit contributions
 up+0.021
acked+0.019
level+0.019
 hurts+0.018
erved+0.018
Neg logit contributions
 mate-0.036
 souven-0.031
 charity-0.030
 importantly-0.029
 rulers-0.028
Token little
Token ID1310
Feature activation+1.64e+00

Pos logit contributions
 mate+0.246
 souven+0.216
 charity+0.213
 possible+0.200
uries+0.199
Neg logit contributions
 up-0.196
acked-0.171
 rains-0.168
uddle-0.167
 hurts-0.166
Token ones
Token ID3392
Feature activation+1.20e+01

Pos logit contributions
 mate+0.070
 charity+0.062
 souven+0.061
uries+0.059
 importantly+0.058
Neg logit contributions
 up-0.052
 hurts-0.041
level-0.041
acked-0.041
 rains-0.041
Token were
Token ID547
Feature activation+2.37e+00

Pos logit contributions
 mate+0.531
 souven+0.453
 charity+0.452
 importantly+0.416
 rulers+0.413
Neg logit contributions
 up-0.366
acked-0.337
level-0.324
 hurts-0.316
aned-0.308
Token too
Token ID1165
Feature activation+1.21e+00

Pos logit contributions
 mate+0.108
 charity+0.095
 souven+0.094
 importantly+0.085
ities+0.085
Neg logit contributions
 up-0.076
 hurts-0.060
acked-0.059
level-0.058
aned-0.057
Token dizz
Token ID29527
Feature activation-1.99e+00

Pos logit contributions
 mate+0.056
 charity+0.048
 souven+0.048
ities+0.044
 importantly+0.044
Neg logit contributions
 up-0.037
acked-0.031
 hurts-0.030
level-0.030
erved-0.029
Tokeny
Token ID88
Feature activation-3.97e+00

Pos logit contributions
 up+0.062
level+0.061
acked+0.059
ulum+0.058
faced+0.058
Neg logit contributions
 mate-0.079
 importantly-0.061
ities-0.060
 souven-0.060
 than-0.060
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
 up+0.111
 hurts+0.085
acked+0.084
erved+0.079
level+0.077
Neg logit contributions
 mate-0.199
 souven-0.172
 charity-0.168
 importantly-0.162
 rulers-0.159
Token make
Token ID787
Feature activation+1.30e+00

Pos logit contributions
 up+0.038
 hurts+0.031
acked+0.030
aned+0.030
level+0.030
Neg logit contributions
 mate-0.045
 souven-0.038
 charity-0.036
 queens-0.035
 importantly-0.035
Token sure
Token ID1654
Feature activation+3.05e+00

Pos logit contributions
 mate+0.056
 souven+0.047
 charity+0.046
 importantly+0.043
 rulers+0.042
Neg logit contributions
 up-0.043
acked-0.038
level-0.037
 hurts-0.036
ulum-0.036
Token that
Token ID326
Feature activation+2.79e+00

Pos logit contributions
 mate+0.131
 souven+0.111
ities+0.108
 importantly+0.105
34+0.104
Neg logit contributions
 up-0.100
 hurts-0.078
level-0.071
uck-0.071
 rains-0.069
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
 mate+0.120
 souven+0.100
 charity+0.094
 importantly+0.094
34+0.093
Neg logit contributions
 up-0.098
 hurts-0.082
 rains-0.076
acked-0.071
ulum-0.070
Token big
Token ID1263
Feature activation+1.20e+00

Pos logit contributions
 up+0.008
 hurts+0.006
 rains+0.006
acked+0.006
>+0.006
Neg logit contributions
 mate-0.009
 souven-0.007
 importantly-0.007
34-0.007
 charity-0.007
Token ones
Token ID3392
Feature activation+1.19e+01

Pos logit contributions
 mate+0.050
 souven+0.041
 charity+0.040
 importantly+0.040
34+0.038
Neg logit contributions
 up-0.043
 hurts-0.036
acked-0.035
 rains-0.035
ulum-0.034
Token are
Token ID389
Feature activation+3.06e+00

Pos logit contributions
 mate+0.536
 souven+0.459
 charity+0.446
 importantly+0.423
ities+0.413
Neg logit contributions
 up-0.394
 hurts-0.332
acked-0.321
level-0.308
 rains-0.299
Token in
Token ID287
Feature activation+1.28e+00

Pos logit contributions
 mate+0.133
 charity+0.114
 souven+0.113
ities+0.107
 importantly+0.105
Neg logit contributions
 up-0.108
 hurts-0.074
 down-0.071
>-0.070
acked-0.070
Token one
Token ID530
Feature activation+7.62e+00

Pos logit contributions
 mate+0.070
 importantly+0.061
34+0.061
ities+0.061
uries+0.061
Neg logit contributions
 up-0.026
>-0.015
 hurts-0.014
acked-0.013
ulously-0.012
Token side
Token ID1735
Feature activation+3.56e+00

Pos logit contributions
 mate+0.340
 souven+0.280
 charity+0.275
ities+0.269
 importantly+0.263
Neg logit contributions
 up-0.254
acked-0.209
 hurts-0.207
uck-0.193
 rains-0.192
Token,
Token ID11
Feature activation+3.16e+00

Pos logit contributions
 mate+0.162
 souven+0.140
 charity+0.138
 importantly+0.130
 rulers+0.126
Neg logit contributions
 up-0.106
acked-0.094
 hurts-0.087
level-0.084
erved-0.082
TokenOne
Token ID3198
Feature activation+9.12e+00

Pos logit contributions
 mate+0.167
 charity+0.141
 souven+0.141
 importantly+0.131
 rulers+0.128
Neg logit contributions
 up-0.119
acked-0.110
level-0.107
 hurts-0.104
erved-0.103
Token day
Token ID1110
Feature activation-1.00e+00

Pos logit contributions
 mate+0.390
 charity+0.330
 souven+0.326
 rulers+0.302
ities+0.302
Neg logit contributions
 up-0.294
acked-0.268
level-0.252
 hurts-0.249
faced-0.243
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 up+0.027
acked+0.022
level+0.021
uck+0.019
faced+0.019
Neg logit contributions
 mate-0.048
 charity-0.043
 souven-0.041
 importantly-0.040
ities-0.040
Token the
Token ID262
Feature activation-4.15e-01

Pos logit contributions
 mate+0.171
 charity+0.149
 importantly+0.141
 souven+0.141
ities+0.141
Neg logit contributions
 up-0.109
acked-0.077
level-0.076
faced-0.075
forth-0.072
Token little
Token ID1310
Feature activation+1.84e+00

Pos logit contributions
 up+0.015
level+0.013
acked+0.013
 hurts+0.013
faced+0.012
Neg logit contributions
 mate-0.016
 souven-0.013
 charity-0.013
 importantly-0.012
ities-0.012
Token ones
Token ID3392
Feature activation+1.19e+01

Pos logit contributions
 mate+0.078
 souven+0.066
 charity+0.066
 importantly+0.062
 rulers+0.060
Neg logit contributions
 up-0.059
acked-0.054
level-0.052
 hurts-0.050
ulum-0.050
Token had
Token ID550
Feature activation-1.09e+00

Pos logit contributions
 mate+0.531
 souven+0.456
 charity+0.455
 importantly+0.420
 rulers+0.419
Neg logit contributions
 up-0.356
acked-0.331
level-0.313
 hurts-0.300
aned-0.297
Token an
Token ID281
Feature activation+4.95e+00

Pos logit contributions
 up+0.034
acked+0.031
level+0.030
erved+0.029
 hurts+0.029
Neg logit contributions
 mate-0.047
 souven-0.040
 charity-0.040
 importantly-0.038
 rulers-0.037
Token idea
Token ID2126
Feature activation+1.34e+00

Pos logit contributions
 mate+0.172
 souven+0.138
 charity+0.136
 importantly+0.124
 rulers+0.122
Neg logit contributions
 up-0.197
acked-0.187
level-0.181
 hurts-0.176
aned-0.176
Token.
Token ID13
Feature activation+5.55e-01

Pos logit contributions
 mate+0.069
 charity+0.060
 queens+0.060
 rulers+0.059
 souven+0.059
Neg logit contributions
 up-0.034
 hurts-0.023
acked-0.023
faced-0.023
level-0.023
Token They
Token ID1119
Feature activation+2.51e+00

Pos logit contributions
 mate+0.028
 souven+0.025
 charity+0.024
 rulers+0.023
 importantly+0.023
Neg logit contributions
 up-0.013
acked-0.012
level-0.011
 hurts-0.010
ulum-0.010
Token the
Token ID262
Feature activation-2.17e+00

Pos logit contributions
 mate+0.084
 souven+0.072
 importantly+0.072
 charity+0.071
ities+0.068
Neg logit contributions
 up-0.048
 hurts-0.036
acked-0.036
level-0.035
erved-0.035
Token competition
Token ID5449
Feature activation+2.49e+00

Pos logit contributions
 up+0.077
 hurts+0.063
level+0.062
erved+0.061
acked+0.060
Neg logit contributions
 mate-0.086
 importantly-0.071
 souven-0.069
 charity-0.068
ities-0.068
Token.
Token ID13
Feature activation+4.52e-01

Pos logit contributions
 mate+0.122
 souven+0.107
 importantly+0.105
 charity+0.104
ities+0.101
Neg logit contributions
 up-0.071
acked-0.055
hed-0.050
ulum-0.049
 hurts-0.049
Token The
Token ID383
Feature activation+1.20e+00

Pos logit contributions
 mate+0.024
 souven+0.021
 charity+0.020
 rulers+0.020
 importantly+0.019
Neg logit contributions
 up-0.010
acked-0.009
level-0.008
ulum-0.008
 hurts-0.008
Token little
Token ID1310
Feature activation+1.71e+00

Pos logit contributions
 mate+0.046
 souven+0.038
 charity+0.038
 importantly+0.035
ities+0.034
Neg logit contributions
 up-0.043
acked-0.039
level-0.038
 hurts-0.037
faced-0.037
Token ones
Token ID3392
Feature activation+1.18e+01

Pos logit contributions
 mate+0.068
 charity+0.059
 souven+0.057
 importantly+0.054
ities+0.052
Neg logit contributions
 up-0.060
acked-0.053
level-0.052
 hurts-0.050
ulum-0.049
Token were
Token ID547
Feature activation+2.01e+00

Pos logit contributions
 mate+0.532
 charity+0.457
 souven+0.451
 importantly+0.416
 rulers+0.415
Neg logit contributions
 up-0.352
acked-0.329
level-0.309
 hurts-0.296
aned-0.292
Token so
Token ID523
Feature activation-4.83e-01

Pos logit contributions
 mate+0.092
 charity+0.083
ities+0.081
 souven+0.078
 queens+0.073
Neg logit contributions
 up-0.063
acked-0.047
level-0.045
faced-0.044
aned-0.043
Token happy
Token ID3772
Feature activation-2.65e+00

Pos logit contributions
 up+0.016
acked+0.015
level+0.014
 hurts+0.014
faced+0.013
Neg logit contributions
 mate-0.020
 charity-0.017
 souven-0.017
ities-0.016
 rulers-0.015
Token and
Token ID290
Feature activation-7.15e-01

Pos logit contributions
 up+0.074
acked+0.054
 hurts+0.052
uck+0.051
ulum+0.050
Neg logit contributions
 mate-0.132
 charity-0.117
 souven-0.113
 rulers-0.107
ities-0.106
Token now
Token ID783
Feature activation-1.87e+00

Pos logit contributions
 up+0.026
acked+0.022
level+0.021
 hurts+0.021
erved+0.020
Neg logit contributions
 mate-0.030
 charity-0.025
 souven-0.024
ities-0.024
 rulers-0.023
Token a
Token ID257
Feature activation+3.39e+00

Pos logit contributions
 mate+0.262
 souven+0.221
 charity+0.219
 rulers+0.207
 importantly+0.204
Neg logit contributions
 up-0.129
acked-0.128
level-0.122
ulum-0.121
aned-0.117
Token few
Token ID1178
Feature activation+7.36e+00

Pos logit contributions
 mate+0.130
ities+0.110
 importantly+0.106
 souven+0.105
 charity+0.102
Neg logit contributions
 up-0.116
 hurts-0.101
acked-0.100
faced-0.100
level-0.098
Token of
Token ID286
Feature activation+4.61e+00

Pos logit contributions
 mate+0.280
 charity+0.223
 souven+0.218
ities+0.214
34+0.204
Neg logit contributions
 up-0.266
acked-0.244
 hurts-0.237
chair-0.226
level-0.226
Token your
Token ID534
Feature activation+3.40e+00

Pos logit contributions
 mate+0.213
ities+0.191
34+0.185
uries+0.178
 charity+0.177
Neg logit contributions
 up-0.125
acked-0.097
level-0.090
>-0.089
 hits-0.086
Token favourite
Token ID12507
Feature activation+7.42e+00

Pos logit contributions
 mate+0.133
ities+0.106
 souven+0.105
 rulers+0.105
 charity+0.104
Neg logit contributions
 up-0.120
 hurts-0.102
erved-0.098
uddle-0.098
acked-0.098
Token ones
Token ID3392
Feature activation+1.18e+01

Pos logit contributions
 mate+0.301
 souven+0.242
 charity+0.238
 importantly+0.238
34+0.231
Neg logit contributions
 up-0.261
acked-0.231
 hurts-0.222
level-0.215
ulum-0.209
Token and
Token ID290
Feature activation+6.43e-01

Pos logit contributions
 mate+0.596
 souven+0.533
 charity+0.514
 importantly+0.487
 rulers+0.476
Neg logit contributions
 up-0.317
 hurts-0.252
acked-0.236
uck-0.219
hed-0.216
Token let
Token ID1309
Feature activation-1.21e-01

Pos logit contributions
 mate+0.026
 souven+0.022
ities+0.022
 charity+0.022
34+0.022
Neg logit contributions
 up-0.022
 hurts-0.017
erved-0.017
chair-0.017
acked-0.016
Token's
Token ID338
Feature activation+2.41e+00

Pos logit contributions
 up+0.003
acked+0.003
level+0.003
 hurts+0.003
uck+0.003
Neg logit contributions
 mate-0.006
 souven-0.005
 charity-0.005
 importantly-0.005
 rulers-0.005
Token go
Token ID467
Feature activation-9.82e-01

Pos logit contributions
 mate+0.107
 souven+0.092
 charity+0.090
 importantly+0.087
 rulers+0.084
Neg logit contributions
 up-0.076
acked-0.065
level-0.062
 hurts-0.061
erved-0.060
Token."
Token ID526
Feature activation-2.37e+00

Pos logit contributions
 up+0.030
acked+0.027
level+0.026
 hurts+0.025
erved+0.025
Neg logit contributions
 mate-0.044
 souven-0.037
 charity-0.037
 importantly-0.035
 rulers-0.034
Token appreci
Token ID5763
Feature activation+3.87e+00

Pos logit contributions
 mate+0.135
 charity+0.112
ities+0.109
 souven+0.108
34+0.107
Neg logit contributions
 up-0.124
 hurts-0.098
 The-0.091
 cries-0.091
 rains-0.090
Tokenative
Token ID876
Feature activation+3.17e+00

Pos logit contributions
 mate+0.215
 souven+0.187
 charity+0.185
 age+0.177
uries+0.170
Neg logit contributions
 up-0.085
acked-0.073
uck-0.066
level-0.061
irled-0.060
Token and
Token ID290
Feature activation+7.46e-01

Pos logit contributions
 mate+0.136
 souven+0.119
 charity+0.118
 rulers+0.107
 importantly+0.106
Neg logit contributions
 up-0.098
acked-0.084
 hurts-0.081
uck-0.076
 rains-0.074
Token supportive
Token ID16443
Feature activation+2.84e+00

Pos logit contributions
 mate+0.029
34+0.023
ities+0.023
 charity+0.023
 souven+0.022
Neg logit contributions
 up-0.029
 hurts-0.024
level-0.023
forth-0.021
 cries-0.021
Token of
Token ID286
Feature activation+5.75e+00

Pos logit contributions
 mate+0.126
 charity+0.113
 souven+0.112
nam+0.106
uries+0.102
Neg logit contributions
 up-0.085
 hurts-0.071
acked-0.070
uck-0.069
 cries-0.064
Token those
Token ID883
Feature activation+1.16e+01

Pos logit contributions
 mate+0.276
 souven+0.226
uries+0.215
34+0.214
 buyer+0.213
Neg logit contributions
 up-0.180
ulously-0.123
 hurts-0.116
level-0.112
 cries-0.109
Token around
Token ID1088
Feature activation+8.95e-01

Pos logit contributions
 mate+0.484
 souven+0.389
 charity+0.380
 buyer+0.355
 customer+0.351
Neg logit contributions
 up-0.444
 hurts-0.362
led-0.337
level-0.335
acked-0.335
Token us
Token ID514
Feature activation+4.13e+00

Pos logit contributions
 mate+0.042
 souven+0.036
 charity+0.034
 importantly+0.033
 buyer+0.032
Neg logit contributions
 up-0.028
acked-0.023
 hurts-0.022
level-0.021
 rains-0.020
Token.
Token ID13
Feature activation+3.58e-01

Pos logit contributions
 mate+0.183
 souven+0.157
 charity+0.154
 importantly+0.146
 rulers+0.142
Neg logit contributions
 up-0.132
acked-0.113
 hurts-0.109
level-0.108
aned-0.102
Token<|endoftext|>
Token ID50256
Feature activation-5.81e+00

Pos logit contributions
 mate+0.019
 souven+0.018
 rulers+0.017
 charity+0.017
ities+0.016
Neg logit contributions
 up-0.007
acked-0.006
 From-0.005
 The-0.005
uck-0.005
TokenOnce
Token ID7454
Feature activation+3.33e+00

Pos logit contributions
erved+0.172
aned+0.170
 up+0.165
level+0.162
faced+0.159
Neg logit contributions
 mate-0.244
 souven-0.206
 charity-0.205
 importantly-0.194
 ones-0.186
Token and
Token ID290
Feature activation-9.55e-01

Pos logit contributions
 mate+0.124
 importantly+0.095
 souven+0.093
 traveler+0.093
 crew+0.091
Neg logit contributions
 up-0.069
 was-0.040
 down-0.038
level-0.036
 hurts-0.035
Token knew
Token ID2993
Feature activation+7.22e-01

Pos logit contributions
 up+0.027
level+0.022
acked+0.022
aned+0.020
 hurts+0.020
Neg logit contributions
 mate-0.047
 souven-0.040
 charity-0.039
 rulers-0.037
34-0.037
Token that
Token ID326
Feature activation+1.80e+00

Pos logit contributions
 mate+0.035
 souven+0.028
uries+0.028
 charity+0.028
ities+0.027
Neg logit contributions
 up-0.023
 hurts-0.015
uck-0.015
level-0.015
forth-0.014
Token the
Token ID262
Feature activation-3.28e-01

Pos logit contributions
 mate+0.094
uries+0.072
34+0.071
 importantly+0.069
 traveler+0.068
Neg logit contributions
 up-0.058
 was-0.039
 hurts-0.038
's-0.037
 rains-0.034
Token gray
Token ID12768
Feature activation+1.38e+00

Pos logit contributions
 up+0.010
 hurts+0.008
level+0.007
 rains+0.007
 nearest+0.007
Neg logit contributions
 mate-0.016
 importantly-0.013
ities-0.012
 souven-0.012
 charity-0.012
Token ones
Token ID3392
Feature activation+1.16e+01

Pos logit contributions
 mate+0.060
 charity+0.051
 souven+0.050
 importantly+0.048
 rulers+0.045
Neg logit contributions
 up-0.045
acked-0.040
 hurts-0.039
level-0.038
ulum-0.037
Token were
Token ID547
Feature activation+1.04e+00

Pos logit contributions
 mate+0.567
 souven+0.465
 charity+0.458
ities+0.434
 rulers+0.433
Neg logit contributions
 up-0.370
 hurts-0.296
acked-0.290
level-0.283
chair-0.272
Token not
Token ID407
Feature activation+2.21e+00

Pos logit contributions
 mate+0.049
ities+0.040
 souven+0.040
 charity+0.040
 importantly+0.039
Neg logit contributions
 up-0.037
 over-0.021
 down-0.021
 hurts-0.020
 was-0.019
Token ready
Token ID3492
Feature activation-1.25e+00

Pos logit contributions
 mate+0.093
 charity+0.078
 souven+0.077
 importantly+0.076
34+0.073
Neg logit contributions
 up-0.079
 down-0.056
 hurts-0.050
level-0.047
acked-0.047
Token to
Token ID284
Feature activation+2.67e-02

Pos logit contributions
 up+0.036
 hurts+0.023
 down+0.021
 was+0.019
 rains+0.019
Neg logit contributions
 mate-0.065
 importantly-0.056
 souven-0.054
 rulers-0.052
 charity-0.052
Token eat
Token ID4483
Feature activation+1.55e+00

Pos logit contributions
 mate+0.001
 importantly+0.001
 souven+0.001
34+0.001
 queens+0.001
Neg logit contributions
 up-0.001
 hurts-0.001
level-0.001
forth-0.001
acked-0.001
Token.
Token ID13
Feature activation+8.01e-01

Pos logit contributions
 mate+0.082
 souven+0.071
 charity+0.070
 importantly+0.067
 rulers+0.065
Neg logit contributions
 up-0.055
 hurts-0.043
level-0.042
 cries-0.042
acked-0.042
Token "
Token ID366
Feature activation+4.50e+00

Pos logit contributions
 mate+0.033
 charity+0.028
 souven+0.028
 importantly+0.026
 rulers+0.025
Neg logit contributions
 up-0.025
acked-0.024
level-0.023
erved-0.023
 hurts-0.023
TokenCare
Token ID17784
Feature activation+7.45e+00

Pos logit contributions
 mate+0.209
 souven+0.175
 charity+0.170
 rulers+0.166
 importantly+0.164
Neg logit contributions
acked-0.127
 up-0.127
level-0.120
faced-0.114
 hurts-0.110
Tokenful
Token ID913
Feature activation+1.95e+00

Pos logit contributions
 mate+0.234
 souven+0.172
 charity+0.167
 rulers+0.150
 importantly+0.150
Neg logit contributions
acked-0.315
 up-0.307
level-0.302
ulum-0.300
erved-0.298
Token,
Token ID11
Feature activation+4.13e+00

Pos logit contributions
 mate+0.083
 souven+0.068
 charity+0.068
 rulers+0.063
 importantly+0.063
Neg logit contributions
 up-0.060
acked-0.059
level-0.058
ulum-0.055
erved-0.055
Token those
Token ID883
Feature activation+1.16e+01

Pos logit contributions
 mate+0.180
 souven+0.153
 rulers+0.149
 charity+0.146
ities+0.144
Neg logit contributions
 up-0.131
 hurts-0.109
 rains-0.098
level-0.098
chair-0.098
Token berries
Token ID36322
Feature activation+6.68e+00

Pos logit contributions
 mate+0.506
 souven+0.440
 charity+0.419
 importantly+0.387
 buyer+0.383
Neg logit contributions
 up-0.402
 hurts-0.336
 rains-0.301
acked-0.299
level-0.298
Token aren
Token ID3588
Feature activation+1.92e+00

Pos logit contributions
 mate+0.269
 souven+0.225
 charity+0.220
 importantly+0.206
 rulers+0.200
Neg logit contributions
 up-0.238
acked-0.215
 hurts-0.209
level-0.207
aned-0.201
Token't
Token ID470
Feature activation+2.50e+00

Pos logit contributions
 mate+0.093
 souven+0.080
 charity+0.080
 importantly+0.075
 rulers+0.074
Neg logit contributions
 up-0.050
acked-0.046
level-0.044
 hurts-0.042
aned-0.042
Token safe
Token ID3338
Feature activation-2.27e+00

Pos logit contributions
 mate+0.102
 charity+0.094
ities+0.094
 souven+0.091
 importantly+0.091
Neg logit contributions
 up-0.090
 hurts-0.066
 down-0.065
faced-0.058
>-0.057
Token to
Token ID284
Feature activation+7.63e-01

Pos logit contributions
 up+0.065
 hurts+0.049
acked+0.048
level+0.045
uck+0.045
Neg logit contributions
 mate-0.112
 souven-0.098
 charity-0.097
 importantly-0.094
 rulers-0.090
Token toys
Token ID14958
Feature activation+4.74e+00

Pos logit contributions
 mate+0.129
 souven+0.100
 charity+0.098
 rulers+0.089
 importantly+0.088
Neg logit contributions
 up-0.161
acked-0.149
level-0.146
 hurts-0.144
erved-0.141
Token and
Token ID290
Feature activation-2.94e-01

Pos logit contributions
 mate+0.243
 souven+0.211
 charity+0.209
 importantly+0.203
ities+0.197
Neg logit contributions
 up-0.122
acked-0.092
 hurts-0.084
level-0.083
hed-0.082
Token showed
Token ID3751
Feature activation+5.24e+00

Pos logit contributions
 up+0.010
acked+0.008
level+0.008
 hurts+0.008
erved+0.008
Neg logit contributions
 mate-0.012
 souven-0.010
 charity-0.010
ities-0.009
 rulers-0.009
Token Joe
Token ID5689
Feature activation+9.99e-01

Pos logit contributions
 mate+0.256
 charity+0.227
 souven+0.219
 importantly+0.212
 rulers+0.206
Neg logit contributions
 up-0.153
acked-0.125
level-0.119
faced-0.110
erved-0.110
Token which
Token ID543
Feature activation+4.51e+00

Pos logit contributions
 mate+0.034
 souven+0.028
 charity+0.027
 rulers+0.023
 importantly+0.022
Neg logit contributions
acked-0.040
ulum-0.038
level-0.037
aned-0.037
erved-0.037
Token ones
Token ID3392
Feature activation+1.15e+01

Pos logit contributions
 mate+0.201
 souven+0.182
ities+0.180
 charity+0.178
 importantly+0.176
Neg logit contributions
 up-0.143
 hurts-0.116
 rains-0.097
chair-0.097
acked-0.097
Token he
Token ID339
Feature activation+9.14e-01

Pos logit contributions
 mate+0.564
 charity+0.477
 souven+0.462
ities+0.440
 rulers+0.439
Neg logit contributions
 up-0.359
acked-0.282
 hurts-0.274
level-0.261
led-0.255
Token could
Token ID714
Feature activation-2.97e+00

Pos logit contributions
 mate+0.046
 souven+0.041
 charity+0.040
 rulers+0.037
ities+0.037
Neg logit contributions
 up-0.023
acked-0.020
faced-0.018
level-0.018
 hurts-0.018
Token fill
Token ID6070
Feature activation-1.10e+00

Pos logit contributions
 up+0.101
acked+0.090
 hurts+0.086
level+0.086
erved+0.085
Neg logit contributions
 mate-0.123
 souven-0.104
 charity-0.100
 importantly-0.098
 rulers-0.093
Token his
Token ID465
Feature activation+1.79e-01

Pos logit contributions
 up+0.020
acked+0.013
level+0.011
ulum+0.010
 hurts+0.010
Neg logit contributions
 mate-0.068
 souven-0.059
 charity-0.059
 importantly-0.059
 rulers-0.056
Token sack
Token ID23704
Feature activation-3.70e-01

Pos logit contributions
 mate+0.007
 charity+0.006
 souven+0.006
 rulers+0.006
 importantly+0.006
Neg logit contributions
 up-0.006
acked-0.006
level-0.005
aned-0.005
ulum-0.005
TokenSure
Token ID19457
Feature activation+4.99e+00

Pos logit contributions
 mate+0.179
 rulers+0.142
ities+0.142
34+0.142
 charity+0.141
Neg logit contributions
acked-0.106
 up-0.105
level-0.102
faced-0.102
uck-0.100
Token!
Token ID0
Feature activation+1.42e+00

Pos logit contributions
 mate+0.249
 charity+0.226
 souven+0.217
ities+0.210
 rulers+0.209
Neg logit contributions
 up-0.120
 hurts-0.106
uck-0.094
level-0.091
ulum-0.090
Token I
Token ID314
Feature activation+3.22e+00

Pos logit contributions
 mate+0.069
 souven+0.061
 charity+0.060
 rulers+0.058
 importantly+0.056
Neg logit contributions
 up-0.034
acked-0.032
uck-0.030
level-0.029
 hurts-0.029
Token recommend
Token ID4313
Feature activation+1.98e+00

Pos logit contributions
 mate+0.144
 souven+0.122
 importantly+0.122
 charity+0.120
ities+0.117
Neg logit contributions
 up-0.095
acked-0.088
level-0.083
 hurts-0.080
faced-0.079
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 mate+0.088
 charity+0.072
 souven+0.072
 importantly+0.070
ities+0.068
Neg logit contributions
 up-0.063
 hurts-0.054
level-0.052
uck-0.049
acked-0.047
Token ones
Token ID3392
Feature activation+1.15e+01

Pos logit contributions
 up+0.008
 hurts+0.007
acked+0.006
 nearest+0.006
 rains+0.006
Neg logit contributions
 mate-0.011
 importantly-0.009
 souven-0.009
 charity-0.009
ities-0.008
Token with
Token ID351
Feature activation+7.11e+00

Pos logit contributions
 mate+0.517
 souven+0.454
 importantly+0.453
 charity+0.443
ities+0.433
Neg logit contributions
 up-0.361
acked-0.264
uck-0.264
 hurts-0.247
 down-0.245
Token the
Token ID262
Feature activation-6.32e-02

Pos logit contributions
 mate+0.348
 importantly+0.307
 charity+0.299
 souven+0.296
34+0.289
Neg logit contributions
 up-0.193
 hurts-0.151
acked-0.142
level-0.140
uck-0.128
Token brown
Token ID7586
Feature activation+1.37e+00

Pos logit contributions
 up+0.002
 hurts+0.002
 hits+0.001
 cries+0.001
 rains+0.001
Neg logit contributions
 mate-0.003
 importantly-0.003
 charity-0.002
 souven-0.002
34-0.002
Token shells
Token ID19679
Feature activation+5.56e+00

Pos logit contributions
 mate+0.059
 charity+0.053
 importantly+0.051
 souven+0.050
 possible+0.046
Neg logit contributions
 up-0.044
acked-0.038
 hurts-0.038
level-0.035
erved-0.034
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
 mate+0.260
 charity+0.232
 souven+0.231
 importantly+0.225
ities+0.212
Neg logit contributions
 up-0.159
acked-0.134
 hurts-0.125
uck-0.116
level-0.113
Token thankful
Token ID25535
Feature activation-8.84e-01

Pos logit contributions
 up+0.030
acked+0.028
level+0.026
 hurts+0.026
aned+0.025
Neg logit contributions
 mate-0.035
 charity-0.029
 souven-0.029
 importantly-0.027
 rulers-0.027
Token to
Token ID284
Feature activation+1.77e+00

Pos logit contributions
 up+0.027
 hurts+0.019
uck+0.018
 down+0.018
acked+0.017
Neg logit contributions
 mate-0.041
 charity-0.038
uries-0.036
ities-0.035
nam-0.035
Token remember
Token ID3505
Feature activation+4.03e+00

Pos logit contributions
 mate+0.090
 souven+0.080
 charity+0.079
ities+0.079
 importantly+0.077
Neg logit contributions
 up-0.042
 The-0.032
 hurts-0.031
acked-0.031
forth-0.029
Token the
Token ID262
Feature activation-4.86e-01

Pos logit contributions
 mate+0.214
 charity+0.184
uries+0.180
 souven+0.180
ities+0.179
Neg logit contributions
 up-0.098
>-0.057
level-0.056
acked-0.055
 hurts-0.055
Token bitter
Token ID12922
Feature activation+1.88e+00

Pos logit contributions
 up+0.017
 hurts+0.015
level+0.014
>+0.014
 rains+0.014
Neg logit contributions
 mate-0.020
 charity-0.015
 souven-0.015
ities-0.015
34-0.015
Token ones
Token ID3392
Feature activation+1.15e+01

Pos logit contributions
 mate+0.088
 charity+0.073
 souven+0.071
 rulers+0.067
34+0.067
Neg logit contributions
 up-0.061
 hurts-0.053
acked-0.051
level-0.047
uck-0.046
Token too
Token ID1165
Feature activation+2.59e+00

Pos logit contributions
 mate+0.578
 charity+0.486
 souven+0.480
ities+0.463
 rulers+0.452
Neg logit contributions
 up-0.353
acked-0.237
 hurts-0.236
chair-0.236
level-0.228
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
 mate+0.147
 charity+0.126
 souven+0.125
 buyer+0.119
 rulers+0.118
Neg logit contributions
 up-0.060
 hurts-0.039
acked-0.036
 cries-0.034
>-0.032
Token He
Token ID679
Feature activation+1.36e+00

Pos logit contributions
 mate+0.013
 souven+0.012
 charity+0.012
 rulers+0.011
 importantly+0.011
Neg logit contributions
 up-0.005
acked-0.004
level-0.004
 hurts-0.003
ulum-0.003
Token kept
Token ID4030
Feature activation-1.32e+00

Pos logit contributions
 mate+0.064
 souven+0.055
 charity+0.054
 importantly+0.051
 rulers+0.050
Neg logit contributions
 up-0.037
acked-0.035
level-0.033
ulum-0.032
 hurts-0.032
Token a
Token ID257
Feature activation+2.91e-01

Pos logit contributions
 up+0.042
acked+0.032
level+0.032
uck+0.030
 down+0.028
Neg logit contributions
 mate-0.065
 charity-0.056
 souven-0.053
ities-0.051
 importantly-0.051
Token and
Token ID290
Feature activation-8.08e-01

Pos logit contributions
 mate+0.013
 souven+0.011
 charity+0.011
 buyer+0.011
 rulers+0.011
Neg logit contributions
 up-0.005
 hurts-0.004
chair-0.003
 cries-0.003
 was-0.003
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
 up+0.026
 hurts+0.022
acked+0.021
level+0.021
erved+0.020
Neg logit contributions
 mate-0.036
 charity-0.031
 souven-0.030
 rulers-0.028
ities-0.028
Token l
Token ID300
Feature activation+5.62e+00

Pos logit contributions
 up+0.036
 hurts+0.032
level+0.030
acked+0.030
ulum+0.029
Neg logit contributions
 mate-0.050
 souven-0.041
 charity-0.041
 importantly-0.039
 rulers-0.039
Tokenitt
Token ID715
Feature activation+5.13e+00

Pos logit contributions
 mate+0.236
 souven+0.204
 charity+0.200
 importantly+0.180
 rulers+0.179
Neg logit contributions
 up-0.180
acked-0.176
aned-0.161
 hurts-0.160
level-0.158
Tokenlest
Token ID32712
Feature activation+6.93e+00

Pos logit contributions
 mate+0.250
 charity+0.217
 souven+0.215
 importantly+0.200
 rulers+0.197
Neg logit contributions
 up-0.134
acked-0.124
level-0.115
 hurts-0.114
aned-0.112
Token ones
Token ID3392
Feature activation+1.15e+01

Pos logit contributions
 mate+0.295
 charity+0.253
uries+0.240
34+0.239
 souven+0.238
Neg logit contributions
 up-0.231
 hurts-0.205
 cries-0.183
uck-0.173
acked-0.172
Token can
Token ID460
Feature activation-1.14e+00

Pos logit contributions
 mate+0.531
 souven+0.447
 charity+0.443
 rulers+0.415
 importantly+0.406
Neg logit contributions
 up-0.349
acked-0.308
 hurts-0.300
level-0.295
faced-0.274
Token go
Token ID467
Feature activation-2.02e+00

Pos logit contributions
 up+0.037
 hurts+0.032
acked+0.032
level+0.030
uck+0.030
Neg logit contributions
 mate-0.049
 souven-0.043
 importantly-0.042
 charity-0.041
ities-0.039
Token up
Token ID510
Feature activation-3.86e+00

Pos logit contributions
 up+0.054
acked+0.046
level+0.043
 hurts+0.042
erved+0.041
Neg logit contributions
 mate-0.101
 souven-0.086
 charity-0.085
 importantly-0.083
 rulers-0.081
Token and
Token ID290
Feature activation-1.78e+00

Pos logit contributions
 up+0.115
acked+0.105
level+0.100
 hurts+0.097
ulum+0.096
Neg logit contributions
 mate-0.175
 souven-0.147
 charity-0.146
 importantly-0.138
 rulers-0.136
Token down
Token ID866
Feature activation-2.23e+00

Pos logit contributions
 up+0.052
acked+0.044
level+0.043
 hurts+0.043
ulum+0.040
Neg logit contributions
 mate-0.085
 souven-0.072
 charity-0.071
 importantly-0.069
 rulers-0.067
Token frog
Token ID21264
Feature activation+2.11e+00

Pos logit contributions
 up+0.024
acked+0.021
 hurts+0.020
level+0.020
 rains+0.019
Neg logit contributions
 mate-0.029
 charity-0.024
 souven-0.024
ities-0.023
 importantly-0.023
Token had
Token ID550
Feature activation-1.68e+00

Pos logit contributions
 mate+0.103
 charity+0.091
 souven+0.090
ities+0.085
 rulers+0.085
Neg logit contributions
 up-0.054
acked-0.046
level-0.042
 hurts-0.041
faced-0.040
Token found
Token ID1043
Feature activation+1.34e+00

Pos logit contributions
 up+0.048
acked+0.042
level+0.039
 hurts+0.037
erved+0.037
Neg logit contributions
 mate-0.076
 charity-0.069
 souven-0.067
 importantly-0.063
 rulers-0.063
Token the
Token ID262
Feature activation-1.71e+00

Pos logit contributions
 mate+0.065
 charity+0.055
 souven+0.053
 importantly+0.052
ities+0.052
Neg logit contributions
 up-0.041
acked-0.032
level-0.029
hed-0.028
 hurts-0.028
Token perfect
Token ID2818
Feature activation+3.58e+00

Pos logit contributions
 up+0.069
 hurts+0.054
 rains+0.053
acked+0.051
level+0.051
Neg logit contributions
 mate-0.060
 charity-0.051
 importantly-0.051
ities-0.050
 souven-0.048
Token mate
Token ID16133
Feature activation+1.14e+01

Pos logit contributions
 mate+0.144
 charity+0.129
 importantly+0.123
34+0.122
ities+0.119
Neg logit contributions
 up-0.126
acked-0.104
 hurts-0.099
uck-0.094
level-0.094
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 mate+0.536
 charity+0.519
 souven+0.484
 importantly+0.464
ities+0.459
Neg logit contributions
 up-0.311
acked-0.251
uck-0.204
 hurts-0.201
level-0.200
Token<|endoftext|>
Token ID50256
Feature activation-5.65e+00

Pos logit contributions
 mate+0.005
 souven+0.005
 charity+0.005
 rulers+0.004
ities+0.004
Neg logit contributions
 up-0.002
acked-0.002
level-0.002
ulum-0.002
 hurts-0.002
TokenOnce
Token ID7454
Feature activation+3.46e+00

Pos logit contributions
erved+0.169
aned+0.169
 up+0.164
level+0.163
faced+0.158
Neg logit contributions
 mate-0.235
 souven-0.196
 charity-0.193
 importantly-0.186
 ones-0.177
Token upon
Token ID2402
Feature activation-2.03e+00

Pos logit contributions
 mate+0.140
 souven+0.114
 charity+0.111
 importantly+0.107
 rulers+0.102
Neg logit contributions
 up-0.113
acked-0.112
level-0.108
erved-0.106
ulum-0.106
Token a
Token ID257
Feature activation+2.03e+00

Pos logit contributions
acked+0.092
aned+0.091
level+0.085
uddle+0.084
umbled+0.083
Neg logit contributions
 mate-0.060
 souven-0.047
 charity-0.046
 importantly-0.041
 rulers-0.039
Token a
Token ID257
Feature activation+2.18e+00

Pos logit contributions
 mate+0.088
 charity+0.074
 queens+0.070
 souven+0.070
 importantly+0.067
Neg logit contributions
 up-0.069
level-0.055
erved-0.052
 hurts-0.051
acked-0.051
Token few
Token ID1178
Feature activation+6.44e+00

Pos logit contributions
 mate+0.090
ities+0.082
ances+0.081
 importantly+0.078
34+0.075
Neg logit contributions
 up-0.069
 hurts-0.058
 snowball-0.055
uddle-0.054
faced-0.052
Token of
Token ID286
Feature activation+3.69e+00

Pos logit contributions
 mate+0.283
 charity+0.217
 possible+0.209
 souven+0.204
 buyer+0.204
Neg logit contributions
 up-0.220
acked-0.185
 hurts-0.184
 cries-0.179
chair-0.176
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
 mate+0.187
34+0.158
ities+0.157
 rulers+0.149
 importantly+0.149
Neg logit contributions
 up-0.091
level-0.066
covered-0.061
 cries-0.057
ulously-0.057
Token smallest
Token ID18197
Feature activation+6.02e+00

Pos logit contributions
 up+0.047
 hurts+0.038
 rains+0.035
chair+0.035
 cries+0.034
Neg logit contributions
 mate-0.072
 souven-0.057
 importantly-0.056
 queens-0.055
 crew-0.055
Token ones
Token ID3392
Feature activation+1.14e+01

Pos logit contributions
 mate+0.260
 charity+0.211
 souven+0.210
 importantly+0.200
34+0.199
Neg logit contributions
 up-0.205
 hurts-0.165
acked-0.165
chair-0.155
 cries-0.152
Token to
Token ID284
Feature activation+1.84e+00

Pos logit contributions
 mate+0.615
 charity+0.525
 souven+0.522
 rulers+0.490
ities+0.488
Neg logit contributions
 up-0.311
 hurts-0.200
 down-0.193
acked-0.185
level-0.184
Token put
Token ID1234
Feature activation-4.13e-01

Pos logit contributions
 mate+0.086
 queens+0.072
 importantly+0.072
 souven+0.071
34+0.071
Neg logit contributions
 up-0.056
ulum-0.042
level-0.041
 hurts-0.040
forth-0.039
Token on
Token ID319
Feature activation-2.89e-01

Pos logit contributions
 up+0.014
acked+0.010
 down+0.009
level+0.009
hed+0.008
Neg logit contributions
 mate-0.020
 importantly-0.018
 charity-0.017
 souven-0.016
 age-0.016
Token her
Token ID607
Feature activation+6.89e-01

Pos logit contributions
 up+0.009
acked+0.006
 hurts+0.006
level+0.006
ulum+0.006
Neg logit contributions
 mate-0.015
 importantly-0.012
 souven-0.012
 charity-0.011
ities-0.011
Token shelf
Token ID18316
Feature activation-3.97e-02

Pos logit contributions
 mate+0.029
 souven+0.024
 importantly+0.023
 charity+0.023
 rulers+0.022
Neg logit contributions
 up-0.024
 hurts-0.020
acked-0.019
level-0.019
erved-0.019
Token one
Token ID530
Feature activation+7.62e+00

Pos logit contributions
 mate+0.070
 importantly+0.061
34+0.061
ities+0.061
uries+0.061
Neg logit contributions
 up-0.026
>-0.015
 hurts-0.014
acked-0.013
ulously-0.012
Token side
Token ID1735
Feature activation+3.56e+00

Pos logit contributions
 mate+0.340
 souven+0.280
 charity+0.275
ities+0.269
 importantly+0.263
Neg logit contributions
 up-0.254
acked-0.209
 hurts-0.207
uck-0.193
 rains-0.192
Token,
Token ID11
Feature activation+3.16e+00

Pos logit contributions
 mate+0.162
 souven+0.140
 charity+0.138
 importantly+0.130
 rulers+0.126
Neg logit contributions
 up-0.106
acked-0.094
 hurts-0.087
level-0.084
erved-0.082
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 mate+0.136
 souven+0.116
 charity+0.114
 importantly+0.110
ities+0.108
Neg logit contributions
 up-0.102
 hurts-0.075
acked-0.074
ulum-0.073
level-0.073
Token small
Token ID1402
Feature activation+2.10e+00

Pos logit contributions
 up+0.038
 hurts+0.031
>+0.030
 rains+0.030
level+0.029
Neg logit contributions
 mate-0.045
 souven-0.036
 importantly-0.035
34-0.035
 charity-0.034
Token ones
Token ID3392
Feature activation+1.13e+01

Pos logit contributions
 mate+0.085
 souven+0.072
 charity+0.070
 importantly+0.069
34+0.064
Neg logit contributions
 up-0.073
acked-0.060
 hurts-0.060
ulum-0.060
 rains-0.058
Token in
Token ID287
Feature activation+1.66e+00

Pos logit contributions
 mate+0.490
 souven+0.423
 charity+0.422
 importantly+0.390
ities+0.386
Neg logit contributions
 up-0.367
 hurts-0.310
acked-0.308
level-0.288
 rains-0.277
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 mate+0.085
 importantly+0.079
uries+0.078
ities+0.077
34+0.076
Neg logit contributions
 up-0.032
 The-0.020
 air-0.019
 hurts-0.019
ulously-0.019
Token other
Token ID584
Feature activation+3.49e+00

Pos logit contributions
 up+0.035
acked+0.028
 hurts+0.028
level+0.027
ulum+0.026
Neg logit contributions
 mate-0.055
 souven-0.046
 importantly-0.045
 charity-0.044
ities-0.041
Token side
Token ID1735
Feature activation+3.95e+00

Pos logit contributions
 mate+0.168
 souven+0.147
 importantly+0.142
 charity+0.135
 possible+0.128
Neg logit contributions
 up-0.100
's-0.075
 hurts-0.074
acked-0.073
 air-0.073
Token and
Token ID290
Feature activation-5.56e-01

Pos logit contributions
 mate+0.172
 souven+0.148
 charity+0.146
 importantly+0.136
 rulers+0.132
Neg logit contributions
 up-0.123
acked-0.113
 hurts-0.104
level-0.103
erved-0.100
TokenL
Token ID43
Feature activation+5.08e+00

Pos logit contributions
 mate+0.174
 souven+0.144
 charity+0.141
 rulers+0.134
 importantly+0.133
Neg logit contributions
 up-0.115
acked-0.113
level-0.108
 hurts-0.104
faced-0.103
Tokenily
Token ID813
Feature activation+7.05e-01

Pos logit contributions
 mate+0.169
 charity+0.135
 souven+0.127
 importantly+0.120
 rulers+0.116
Neg logit contributions
 up-0.209
acked-0.195
level-0.195
ulum-0.189
erved-0.188
Token,
Token ID11
Feature activation+4.11e+00

Pos logit contributions
 mate+0.026
 souven+0.020
 charity+0.020
 age+0.020
 importantly+0.020
Neg logit contributions
acked-0.024
 up-0.024
level-0.024
erved-0.023
ulum-0.022
Token you
Token ID345
Feature activation+4.15e+00

Pos logit contributions
 mate+0.159
 souven+0.135
 charity+0.132
 importantly+0.131
 buyer+0.128
Neg logit contributions
 up-0.148
 hurts-0.131
level-0.122
uck-0.118
 rains-0.115
Token know
Token ID760
Feature activation+4.94e+00

Pos logit contributions
 mate+0.162
 souven+0.126
 charity+0.125
 rulers+0.118
 importantly+0.117
Neg logit contributions
 up-0.149
acked-0.145
level-0.139
ulum-0.137
erved-0.133
Token those
Token ID883
Feature activation+1.13e+01

Pos logit contributions
 mate+0.189
 souven+0.155
 charity+0.153
 importantly+0.142
 rulers+0.140
Neg logit contributions
 up-0.176
acked-0.171
level-0.163
aned-0.158
erved-0.157
Token cookies
Token ID14746
Feature activation+4.33e+00

Pos logit contributions
 mate+0.567
 souven+0.482
 charity+0.466
 importantly+0.438
 buyer+0.434
Neg logit contributions
 up-0.332
 hurts-0.281
acked-0.255
level-0.247
chair-0.236
Token are
Token ID389
Feature activation+2.27e+00

Pos logit contributions
 mate+0.187
 souven+0.159
 charity+0.154
 importantly+0.147
 rulers+0.143
Neg logit contributions
 up-0.140
acked-0.126
 hurts-0.124
level-0.121
aned-0.117
Token important
Token ID1593
Feature activation+1.46e+00

Pos logit contributions
 mate+0.099
 souven+0.082
 charity+0.081
 importantly+0.077
 rulers+0.074
Neg logit contributions
 up-0.081
 hurts-0.066
acked-0.061
aned-0.061
>-0.060
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
 mate+0.077
 souven+0.066
 charity+0.065
 importantly+0.062
 rulers+0.061
Neg logit contributions
 up-0.036
 hurts-0.028
acked-0.027
uck-0.025
aned-0.025
Token You
Token ID921
Feature activation+3.42e+00

Pos logit contributions
 mate+0.031
 souven+0.028
 charity+0.026
 rulers+0.026
 possible+0.025
Neg logit contributions
 up-0.015
acked-0.014
 hurts-0.013
ulum-0.012
uck-0.012
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
 mate+0.038
 souven+0.035
 importantly+0.034
 charity+0.034
 cents+0.033
Neg logit contributions
 up-0.017
acked-0.010
chair-0.010
hed-0.009
 around-0.008
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 mate+0.049
 souven+0.039
 charity+0.039
 importantly+0.037
 rulers+0.035
Neg logit contributions
 up-0.057
acked-0.051
 hurts-0.050
ulum-0.050
level-0.049
Token one
Token ID530
Feature activation+7.41e+00

Pos logit contributions
 up+0.036
 hurts+0.029
level+0.029
acked+0.028
chair+0.028
Neg logit contributions
 mate-0.043
 souven-0.036
 charity-0.036
34-0.035
ities-0.035
Token of
Token ID286
Feature activation+4.32e+00

Pos logit contributions
 mate+0.338
 charity+0.289
 souven+0.281
ities+0.273
 rulers+0.267
Neg logit contributions
 up-0.224
acked-0.197
 hurts-0.183
level-0.181
chair-0.177
Token his
Token ID465
Feature activation+2.48e+00

Pos logit contributions
 mate+0.245
 souven+0.217
 charity+0.212
 cents+0.211
 rulers+0.208
Neg logit contributions
 up-0.084
acked-0.058
ulum-0.049
level-0.048
ulously-0.048
Token favorites
Token ID18852
Feature activation+1.13e+01

Pos logit contributions
 mate+0.091
 souven+0.075
 charity+0.073
 possible+0.069
 cents+0.068
Neg logit contributions
 up-0.095
 hurts-0.080
acked-0.079
 cries-0.079
erved-0.076
Token was
Token ID373
Feature activation-7.83e-03

Pos logit contributions
 mate+0.545
 souven+0.468
 charity+0.460
 importantly+0.432
 rulers+0.427
Neg logit contributions
 up-0.303
acked-0.278
level-0.257
 hurts-0.253
ulum-0.247
Token catching
Token ID16508
Feature activation+2.09e+00

Pos logit contributions
 up+0.000
acked+0.000
level+0.000
faced+0.000
 hurts+0.000
Neg logit contributions
 mate-0.000
 souven-0.000
 charity-0.000
ities-0.000
 importantly-0.000
Token.
Token ID13
Feature activation-5.99e-01

Pos logit contributions
 mate+0.083
 souven+0.069
 charity+0.068
 importantly+0.062
 rulers+0.062
Neg logit contributions
 up-0.071
acked-0.068
level-0.066
aned-0.065
erved-0.064
Token Every
Token ID3887
Feature activation+3.82e+00

Pos logit contributions
 up+0.017
acked+0.016
level+0.015
 hurts+0.015
ulum+0.015
Neg logit contributions
 mate-0.028
 souven-0.024
 charity-0.023
 importantly-0.022
 rulers-0.022
Token day
Token ID1110
Feature activation-1.71e+00

Pos logit contributions
 mate+0.139
 souven+0.112
 charity+0.106
 importantly+0.100
 rulers+0.097
Neg logit contributions
 up-0.142
acked-0.139
aned-0.137
level-0.137
ulum-0.130
Token It
Token ID632
Feature activation-1.52e+00

Pos logit contributions
 Now+0.008
 up+0.008
 The+0.008
acked+0.007
 From+0.007
Neg logit contributions
 mate-0.019
 souven-0.017
 charity-0.016
 rulers-0.016
 buyer-0.016
Token's
Token ID338
Feature activation+2.32e+00

Pos logit contributions
 up+0.026
acked+0.021
 hurts+0.021
level+0.018
 rains+0.017
Neg logit contributions
 mate-0.089
 souven-0.079
 charity-0.079
 importantly-0.075
ities-0.074
Token one
Token ID530
Feature activation+7.73e+00

Pos logit contributions
 mate+0.111
 charity+0.096
 souven+0.095
ities+0.092
 cents+0.091
Neg logit contributions
 up-0.074
>-0.049
 hurts-0.048
acked-0.043
level-0.043
Token of
Token ID286
Feature activation+4.35e+00

Pos logit contributions
 mate+0.331
 charity+0.280
ities+0.272
 souven+0.270
 rulers+0.254
Neg logit contributions
 up-0.256
 hurts-0.216
acked-0.214
chair-0.208
faced-0.202
Token my
Token ID616
Feature activation+4.15e+00

Pos logit contributions
 mate+0.232
ities+0.216
uries+0.214
34+0.209
 souven+0.209
Neg logit contributions
 up-0.075
<|endoftext|>-0.047
>-0.046
acked-0.043
ulously-0.041
Token favorites
Token ID18852
Feature activation+1.13e+01

Pos logit contributions
 mate+0.150
 importantly+0.117
 souven+0.115
 charity+0.113
 queens+0.110
Neg logit contributions
 up-0.154
 hurts-0.133
 cries-0.131
level-0.126
>-0.126
Token."
Token ID526
Feature activation-3.64e+00

Pos logit contributions
 mate+0.524
 souven+0.447
 charity+0.440
 importantly+0.416
 rulers+0.405
Neg logit contributions
 up-0.331
acked-0.284
 hurts-0.279
level-0.263
ulum-0.256
Token
Token ID198
Feature activation+2.20e+00

Pos logit contributions
 up+0.091
acked+0.083
level+0.075
 hurts+0.073
ulum+0.072
Neg logit contributions
 mate-0.181
 souven-0.155
 charity-0.154
 rulers-0.147
ities-0.144
Token
Token ID198
Feature activation+2.48e+00

Pos logit contributions
 mate+0.077
 charity+0.065
 souven+0.064
 importantly+0.060
 queens+0.052
Neg logit contributions
erved-0.083
level-0.081
aned-0.080
 up-0.079
acked-0.078
TokenThey
Token ID2990
Feature activation+1.05e+00

Pos logit contributions
 mate+0.097
 charity+0.080
 souven+0.080
 importantly+0.074
 rulers+0.072
Neg logit contributions
 up-0.089
acked-0.083
level-0.082
erved-0.080
aned-0.080
Token talked
Token ID6619
Feature activation+9.23e-01

Pos logit contributions
 mate+0.052
 charity+0.045
 souven+0.045
 importantly+0.043
 rulers+0.042
Neg logit contributions
 up-0.026
acked-0.024
level-0.023
erved-0.021
aned-0.021
Token different
Token ID1180
Feature activation+3.10e+00

Pos logit contributions
 up+0.027
acked+0.024
 hurts+0.023
level+0.022
erved+0.022
Neg logit contributions
 mate-0.035
 charity-0.030
 souven-0.030
 rulers-0.028
ities-0.027
Token from
Token ID422
Feature activation+3.79e+00

Pos logit contributions
 mate+0.152
 charity+0.130
 souven+0.126
 buyer+0.117
uries+0.115
Neg logit contributions
 up-0.097
acked-0.074
 hurts-0.073
level-0.067
ulum-0.065
Token all
Token ID477
Feature activation+4.96e+00

Pos logit contributions
 mate+0.197
 charity+0.170
 souven+0.169
 importantly+0.160
 rulers+0.157
Neg logit contributions
 up-0.103
ulum-0.072
acked-0.070
level-0.069
 hurts-0.069
Token the
Token ID262
Feature activation-6.53e-01

Pos logit contributions
 mate+0.241
 souven+0.208
 charity+0.207
 rulers+0.192
 importantly+0.192
Neg logit contributions
 up-0.134
acked-0.120
level-0.111
 hurts-0.110
erved-0.105
Token other
Token ID584
Feature activation+3.74e+00

Pos logit contributions
 up+0.020
 hurts+0.017
acked+0.015
 cries+0.015
 air+0.015
Neg logit contributions
 mate-0.031
 charity-0.024
 souven-0.024
34-0.023
uries-0.023
Token ones
Token ID3392
Feature activation+1.13e+01

Pos logit contributions
 mate+0.183
 souven+0.155
 charity+0.147
 buyer+0.143
 traveler+0.142
Neg logit contributions
 up-0.110
's-0.087
acked-0.086
 hurts-0.086
 air-0.084
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 mate+0.566
 souven+0.486
 charity+0.486
 rulers+0.441
ities+0.434
Neg logit contributions
 up-0.344
 hurts-0.255
acked-0.239
chair-0.232
level-0.232
Token
Token ID198
Feature activation+3.51e+00

Pos logit contributions
 mate+0.008
 souven+0.007
 charity+0.007
 rulers+0.007
 importantly+0.007
Neg logit contributions
 up-0.004
acked-0.004
level-0.003
ulum-0.003
 hurts-0.003
Token
Token ID198
Feature activation+3.63e+00

Pos logit contributions
 mate+0.114
 charity+0.100
 souven+0.092
 importantly+0.087
 buyer+0.079
Neg logit contributions
 up-0.131
erved-0.130
level-0.129
acked-0.127
 hurts-0.127
TokenHis
Token ID6653
Feature activation+4.76e+00

Pos logit contributions
 mate+0.139
 charity+0.121
 souven+0.114
 importantly+0.108
 rulers+0.103
Neg logit contributions
 up-0.130
level-0.119
acked-0.119
 hurts-0.117
aned-0.112
Token mom
Token ID1995
Feature activation+1.77e+00

Pos logit contributions
 mate+0.175
 souven+0.141
 charity+0.140
 importantly+0.128
 rulers+0.126
Neg logit contributions
 up-0.185
acked-0.165
 hurts-0.165
level-0.162
erved-0.159
Token blue
Token ID4171
Feature activation-7.42e-01

Pos logit contributions
 mate+0.069
 souven+0.057
 charity+0.057
 importantly+0.054
 rulers+0.054
Neg logit contributions
 up-0.056
 hurts-0.048
erved-0.047
acked-0.046
level-0.045
Token socks
Token ID24359
Feature activation+2.87e+00

Pos logit contributions
 up+0.024
acked+0.022
level+0.021
 hurts+0.020
erved+0.020
Neg logit contributions
 mate-0.032
 souven-0.027
 charity-0.026
 importantly-0.025
 rulers-0.024
Token because
Token ID780
Feature activation+6.39e+00

Pos logit contributions
 mate+0.129
 souven+0.109
 charity+0.106
 importantly+0.104
 rulers+0.101
Neg logit contributions
 up-0.093
acked-0.076
 hurts-0.074
erved-0.072
level-0.071
Token the
Token ID262
Feature activation-6.21e-01

Pos logit contributions
 mate+0.271
 souven+0.231
 charity+0.228
ities+0.214
 rulers+0.213
Neg logit contributions
 up-0.210
acked-0.177
 hurts-0.175
level-0.170
chair-0.170
Token red
Token ID2266
Feature activation-2.37e-01

Pos logit contributions
 up+0.018
 hurts+0.015
 rains+0.014
erved+0.013
chair+0.013
Neg logit contributions
 mate-0.028
 souven-0.024
 importantly-0.024
 rulers-0.023
 charity-0.023
Token ones
Token ID3392
Feature activation+1.13e+01

Pos logit contributions
 up+0.009
acked+0.008
level+0.008
 hurts+0.008
erved+0.008
Neg logit contributions
 mate-0.009
 souven-0.007
 charity-0.007
 importantly-0.007
 rulers-0.007
Token were
Token ID547
Feature activation+9.59e-01

Pos logit contributions
 mate+0.546
 souven+0.465
 charity+0.458
 rulers+0.434
 importantly+0.425
Neg logit contributions
 up-0.334
 hurts-0.287
acked-0.267
level-0.260
chair-0.252
Token dirty
Token ID11841
Feature activation-3.90e+00

Pos logit contributions
 mate+0.044
 charity+0.039
 souven+0.038
 rulers+0.036
ities+0.036
Neg logit contributions
 up-0.030
 hurts-0.022
erved-0.020
acked-0.020
 rains-0.019
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
 up+0.121
 hurts+0.096
acked+0.095
erved+0.092
ulum+0.090
Neg logit contributions
 mate-0.184
 souven-0.158
 charity-0.153
 importantly-0.147
 rulers-0.144
Token Tim
Token ID5045
Feature activation+1.21e-01

Pos logit contributions
level+0.006
acked+0.006
 up+0.006
erved+0.006
aned+0.006
Neg logit contributions
 mate-0.005
 charity-0.004
 importantly-0.004
 souven-0.004
 age-0.004
Tokenmy
Token ID1820
Feature activation+8.83e-01

Pos logit contributions
 mate+0.003
 matter+0.002
ities+0.002
 remember+0.002
 age+0.002
Neg logit contributions
level-0.005
 explodes-0.005
 SPL-0.005
atches-0.005
acked-0.005
Token cookies
Token ID14746
Feature activation-1.11e+00

Pos logit contributions
 up+0.143
acked+0.124
 hurts+0.121
level+0.120
ulum+0.113
Neg logit contributions
 mate-0.193
 charity-0.161
 souven-0.160
 importantly-0.152
34-0.146
Token and
Token ID290
Feature activation-3.74e+00

Pos logit contributions
 up+0.032
acked+0.027
 hurts+0.025
hed+0.025
level+0.024
Neg logit contributions
 mate-0.055
 souven-0.046
 charity-0.045
 importantly-0.045
 rulers-0.043
Token he
Token ID339
Feature activation-2.80e+00

Pos logit contributions
 up+0.116
 hurts+0.102
acked+0.099
level+0.097
erved+0.092
Neg logit contributions
 mate-0.166
 charity-0.135
 souven-0.134
 importantly-0.131
34-0.129
Token couldn
Token ID3521
Feature activation-8.89e+00

Pos logit contributions
 up+0.081
acked+0.075
level+0.071
 hurts+0.070
erved+0.068
Neg logit contributions
 mate-0.127
 souven-0.108
 charity-0.107
 importantly-0.101
 rulers-0.099
Tokenâ
Token ID22940
Feature activation-4.81e+00

Pos logit contributions
 up+0.271
acked+0.268
level+0.267
erved+0.253
aned+0.250
Neg logit contributions
 mate-0.367
 charity-0.304
 souven-0.300
 rulers-0.280
 possible-0.277
Token
Token ID26391
Feature activation-1.08e+01

Pos logit contributions
 up+0.130
acked+0.117
level+0.114
 hurts+0.112
ulum+0.106
Neg logit contributions
 mate-0.225
 charity-0.194
 souven-0.188
uries-0.175
 possible-0.173
Token
Token ID8151
Feature activation-2.30e+00

Pos logit contributions
 up+0.331
acked+0.310
level+0.299
 hurts+0.287
aned+0.284
Neg logit contributions
 mate-0.469
 souven-0.394
 charity-0.391
 importantly-0.365
 rulers-0.361
Tokent
Token ID83
Feature activation+1.03e+00

Pos logit contributions
 up+0.081
acked+0.076
level+0.076
ulum+0.074
 hurts+0.074
Neg logit contributions
 mate-0.087
 souven-0.071
 charity-0.070
 importantly-0.065
 rulers-0.063
Token wait
Token ID4043
Feature activation-1.56e+00

Pos logit contributions
 mate+0.038
 souven+0.032
 charity+0.031
 importantly+0.030
 rulers+0.029
Neg logit contributions
 up-0.038
 hurts-0.034
acked-0.033
aned-0.033
chair-0.033
Token to
Token ID284
Feature activation-1.65e+00

Pos logit contributions
 up+0.048
acked+0.040
 hurts+0.039
level+0.039
erved+0.039
Neg logit contributions
 mate-0.070
 charity-0.059
 souven-0.059
 importantly-0.056
 rulers-0.055
Token get
Token ID651
Feature activation+2.22e-03

Pos logit contributions
 up+0.056
 hurts+0.047
level+0.047
acked+0.046
ulum+0.045
Neg logit contributions
 mate-0.068
 souven-0.058
 importantly-0.057
 charity-0.056
34-0.053
Token so
Token ID523
Feature activation-5.04e+00

Pos logit contributions
 up+0.269
acked+0.259
level+0.249
 hurts+0.240
aned+0.239
Neg logit contributions
 mate-0.330
 souven-0.274
 charity-0.269
 importantly-0.252
 rulers-0.249
Token scared
Token ID12008
Feature activation-4.20e+00

Pos logit contributions
 up+0.168
acked+0.159
level+0.153
 hurts+0.148
aned+0.147
Neg logit contributions
 mate-0.206
 souven-0.172
 charity-0.170
 importantly-0.158
 rulers-0.156
Token she
Token ID673
Feature activation-3.62e+00

Pos logit contributions
 up+0.129
acked+0.101
 hurts+0.100
chair+0.097
uck+0.094
Neg logit contributions
 mate-0.192
 charity-0.172
 souven-0.166
 rulers-0.157
ities-0.154
Token didn
Token ID1422
Feature activation-6.88e+00

Pos logit contributions
acked+0.109
 up+0.104
level+0.101
ulum+0.099
erved+0.095
Neg logit contributions
 mate-0.158
 souven-0.131
 charity-0.126
 importantly-0.120
 rulers-0.118
Tokenâ
Token ID22940
Feature activation-5.16e+00

Pos logit contributions
acked+0.230
level+0.220
 up+0.214
ulum+0.212
erved+0.205
Neg logit contributions
 mate-0.272
 souven-0.224
 charity-0.220
 possible-0.210
 rulers-0.201
Token
Token ID26391
Feature activation-1.05e+01

Pos logit contributions
 up+0.122
uddle+0.108
acked+0.108
level+0.107
 hurts+0.100
Neg logit contributions
 mate-0.239
 charity-0.213
uries-0.203
 souven-0.199
 age-0.198
Token
Token ID8151
Feature activation-1.93e+00

Pos logit contributions
 up+0.328
acked+0.308
level+0.294
 hurts+0.286
aned+0.284
Neg logit contributions
 mate-0.456
 souven-0.386
 charity-0.379
 importantly-0.355
 rulers-0.352
Tokent
Token ID83
Feature activation+6.52e-01

Pos logit contributions
 up+0.070
acked+0.066
level+0.065
ulum+0.064
erved+0.063
Neg logit contributions
 mate-0.071
 souven-0.058
 charity-0.057
 importantly-0.052
 rulers-0.051
Token know
Token ID760
Feature activation+2.72e+00

Pos logit contributions
 mate+0.025
 souven+0.021
 charity+0.021
 importantly+0.020
 rulers+0.020
Neg logit contributions
 up-0.024
 hurts-0.021
acked-0.021
aned-0.021
level-0.020
Token what
Token ID644
Feature activation+5.38e+00

Pos logit contributions
 mate+0.119
 souven+0.095
 buyer+0.093
 rulers+0.091
 charity+0.090
Neg logit contributions
 up-0.094
 hurts-0.075
chair-0.074
level-0.068
 cries-0.067
Token to
Token ID284
Feature activation+9.35e-01

Pos logit contributions
 mate+0.275
 souven+0.234
 charity+0.228
 rulers+0.223
ities+0.218
Neg logit contributions
 up-0.133
 hurts-0.115
level-0.108
acked-0.106
chair-0.105
Token now
Token ID783
Feature activation-5.04e+00

Pos logit contributions
 up+0.101
acked+0.089
 hurts+0.087
level+0.085
ulum+0.083
Neg logit contributions
 mate-0.109
 charity-0.091
 souven-0.089
 importantly-0.081
ities-0.081
Token and
Token ID290
Feature activation-3.80e+00

Pos logit contributions
 up+0.169
acked+0.153
level+0.147
 hurts+0.147
ulum+0.143
Neg logit contributions
 mate-0.209
 souven-0.174
 charity-0.174
 importantly-0.161
 rulers-0.159
Token it
Token ID340
Feature activation-5.60e+00

Pos logit contributions
acked+0.124
 up+0.120
level+0.116
aned+0.114
ulum+0.112
Neg logit contributions
 mate-0.154
 souven-0.129
 charity-0.126
 importantly-0.117
 rulers-0.116
Token didn
Token ID1422
Feature activation-6.94e+00

Pos logit contributions
acked+0.126
 up+0.125
level+0.125
ulum+0.120
erved+0.116
Neg logit contributions
 mate-0.272
 souven-0.230
 charity-0.220
 rulers-0.218
 importantly-0.211
Tokenâ
Token ID22940
Feature activation-5.21e+00

Pos logit contributions
acked+0.246
level+0.241
 up+0.227
awed+0.226
faced+0.225
Neg logit contributions
 mate-0.256
 souven-0.210
 charity-0.208
 possible-0.199
 rulers-0.186
Token
Token ID26391
Feature activation-1.05e+01

Pos logit contributions
 up+0.136
uddle+0.125
level+0.120
acked+0.107
 hurts+0.107
Neg logit contributions
 mate-0.224
 charity-0.199
uries-0.192
ities-0.186
 souven-0.184
Token
Token ID8151
Feature activation-1.88e+00

Pos logit contributions
 up+0.331
acked+0.309
level+0.303
 hurts+0.286
aned+0.284
Neg logit contributions
 mate-0.445
 charity-0.369
 souven-0.368
 importantly-0.343
 age-0.335
Tokent
Token ID83
Feature activation+8.18e-01

Pos logit contributions
 up+0.071
level+0.067
acked+0.066
ulum+0.066
erved+0.065
Neg logit contributions
 mate-0.065
 souven-0.052
 charity-0.051
 importantly-0.047
 age-0.046
Token seem
Token ID1283
Feature activation+2.63e+00

Pos logit contributions
 mate+0.030
 souven+0.024
 charity+0.024
 importantly+0.022
 rulers+0.022
Neg logit contributions
 up-0.031
acked-0.030
level-0.029
 hurts-0.028
aned-0.028
Token to
Token ID284
Feature activation-9.83e-01

Pos logit contributions
 mate+0.107
 souven+0.090
 charity+0.088
 importantly+0.082
 rulers+0.081
Neg logit contributions
 up-0.086
acked-0.084
level-0.080
aned-0.077
ulum-0.077
Token be
Token ID307
Feature activation+1.38e+00

Pos logit contributions
 up+0.032
 hurts+0.027
ulum+0.026
level+0.026
forth+0.025
Neg logit contributions
 mate-0.042
 souven-0.036
 importantly-0.035
 charity-0.035
 rulers-0.034
Token quick
Token ID2068
Feature activation-1.87e+00

Pos logit contributions
 mate+0.028
 charity+0.024
 souven+0.024
 importantly+0.023
ities+0.022
Neg logit contributions
 up-0.014
acked-0.011
 hurts-0.011
erved-0.011
aned-0.010
Token or
Token ID393
Feature activation+2.60e-01

Pos logit contributions
 up+0.052
acked+0.050
level+0.049
aned+0.046
ulum+0.046
Neg logit contributions
 mate-0.085
 souven-0.072
 charity-0.071
 importantly-0.066
 rulers-0.066
Token we
Token ID356
Feature activation+1.56e+00

Pos logit contributions
 mate+0.010
 souven+0.008
 charity+0.008
 importantly+0.008
 rulers+0.008
Neg logit contributions
 up-0.010
 hurts-0.008
level-0.008
erved-0.008
ulum-0.008
Token won
Token ID1839
Feature activation-3.59e+00

Pos logit contributions
 mate+0.063
 souven+0.054
 charity+0.053
 importantly+0.051
 rulers+0.048
Neg logit contributions
 up-0.052
acked-0.048
level-0.047
 hurts-0.046
erved-0.045
Tokenâ
Token ID22940
Feature activation-5.11e+00

Pos logit contributions
acked+0.110
 up+0.107
level+0.103
erved+0.098
aned+0.097
Neg logit contributions
 mate-0.152
 souven-0.128
 charity-0.127
 rulers-0.118
 importantly-0.116
Token
Token ID26391
Feature activation-1.04e+01

Pos logit contributions
 up+0.165
acked+0.155
level+0.149
 hurts+0.145
ulum+0.143
Neg logit contributions
 mate-0.214
 charity-0.178
 souven-0.177
 importantly-0.161
 rulers-0.161
Token
Token ID8151
Feature activation-2.24e+00

Pos logit contributions
 up+0.373
acked+0.354
level+0.339
 hurts+0.332
erved+0.331
Neg logit contributions
 mate-0.401
 souven-0.332
 charity-0.324
 importantly-0.302
 rulers-0.299
Tokent
Token ID83
Feature activation+5.80e-01

Pos logit contributions
 up+0.078
acked+0.074
level+0.073
ulum+0.071
erved+0.070
Neg logit contributions
 mate-0.087
 souven-0.071
 charity-0.070
 importantly-0.065
 rulers-0.064
Token be
Token ID307
Feature activation+3.42e-01

Pos logit contributions
 mate+0.025
 souven+0.022
 charity+0.022
 importantly+0.022
 rulers+0.020
Neg logit contributions
 up-0.018
 hurts-0.017
level-0.015
aned-0.015
faced-0.014
Token able
Token ID1498
Feature activation-1.29e+00

Pos logit contributions
 mate+0.016
 charity+0.015
 souven+0.014
 importantly+0.013
ities+0.013
Neg logit contributions
 up-0.010
acked-0.008
 hurts-0.008
level-0.007
aned-0.007
Token to
Token ID284
Feature activation-6.90e-01

Pos logit contributions
 up+0.031
acked+0.027
level+0.026
 hurts+0.025
erved+0.024
Neg logit contributions
 mate-0.065
 charity-0.057
 souven-0.056
 importantly-0.053
 rulers-0.053
Token happy
Token ID3772
Feature activation-6.30e+00

Pos logit contributions
 up+0.176
acked+0.153
level+0.143
 hurts+0.143
faced+0.138
Neg logit contributions
 mate-0.235
 charity-0.202
 souven-0.198
 rulers-0.183
 importantly-0.183
Token that
Token ID326
Feature activation-1.39e+00

Pos logit contributions
 up+0.168
 hurts+0.122
acked+0.117
uck+0.112
ulum+0.105
Neg logit contributions
 mate-0.318
 charity-0.281
 souven-0.270
34-0.261
 rulers-0.259
Token it
Token ID340
Feature activation-4.92e+00

Pos logit contributions
 up+0.039
 hurts+0.026
 was+0.024
's+0.023
 down+0.022
Neg logit contributions
 mate-0.071
34-0.062
ities-0.061
ances-0.061
 charity-0.061
Token wasn
Token ID2492
Feature activation-3.42e+00

Pos logit contributions
 up+0.116
acked+0.104
level+0.096
 hurts+0.096
aned+0.090
Neg logit contributions
 mate-0.254
 souven-0.221
 charity-0.221
 importantly-0.207
 rulers-0.205
Tokenâ
Token ID22940
Feature activation-5.26e+00

Pos logit contributions
 up+0.085
acked+0.075
level+0.071
 hurts+0.070
aned+0.067
Neg logit contributions
 mate-0.171
 souven-0.149
 charity-0.148
 importantly-0.141
 rulers-0.139
Token
Token ID26391
Feature activation-1.04e+01

Pos logit contributions
 up+0.127
level+0.112
acked+0.109
 hurts+0.105
erved+0.104
Neg logit contributions
 mate-0.250
 charity-0.219
 souven-0.213
 possible-0.202
 importantly-0.202
Token
Token ID8151
Feature activation-2.05e+00

Pos logit contributions
 up+0.324
acked+0.308
level+0.292
 hurts+0.283
ulum+0.282
Neg logit contributions
 mate-0.446
 souven-0.379
 charity-0.372
 importantly-0.348
 rulers-0.346
Tokent
Token ID83
Feature activation+7.01e-01

Pos logit contributions
 up+0.077
acked+0.072
level+0.071
ulum+0.070
erved+0.070
Neg logit contributions
 mate-0.073
 souven-0.059
 charity-0.058
 importantly-0.054
 age-0.052
Token getting
Token ID1972
Feature activation+1.03e+00

Pos logit contributions
 mate+0.030
 souven+0.026
 charity+0.026
 rulers+0.024
 importantly+0.023
Neg logit contributions
 up-0.023
acked-0.020
 hurts-0.019
level-0.019
aned-0.019
Token bigger
Token ID5749
Feature activation-2.14e+00

Pos logit contributions
 mate+0.047
 charity+0.039
 souven+0.039
 importantly+0.036
 rulers+0.036
Neg logit contributions
 up-0.031
acked-0.028
level-0.027
 hurts-0.026
aned-0.025
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 up+0.061
 hurts+0.048
acked+0.046
 cries+0.043
 hits+0.041
Neg logit contributions
 mate-0.106
 charity-0.090
 souven-0.086
 buyer-0.081
 importantly-0.081
Token realized
Token ID6939
Feature activation-4.69e+00

Pos logit contributions
 up+0.143
acked+0.139
level+0.134
ulum+0.131
aned+0.129
Neg logit contributions
 mate-0.222
 souven-0.186
 charity-0.181
 importantly-0.171
 rulers-0.171
Token that
Token ID326
Feature activation-1.70e+00

Pos logit contributions
 up+0.153
acked+0.135
level+0.131
 hurts+0.130
ulum+0.128
Neg logit contributions
 mate-0.201
 souven-0.167
 charity-0.165
 rulers-0.154
 importantly-0.153
Token it
Token ID340
Feature activation-4.94e+00

Pos logit contributions
 up+0.058
 hurts+0.048
 rains+0.042
ulum+0.042
 cries+0.042
Neg logit contributions
 mate-0.077
 souven-0.060
 charity-0.059
34-0.058
 buyer-0.058
Token wasn
Token ID2492
Feature activation-3.71e+00

Pos logit contributions
acked+0.124
level+0.117
ulum+0.113
erved+0.113
 up+0.109
Neg logit contributions
 mate-0.222
 souven-0.189
 charity-0.186
 importantly-0.184
 none-0.182
Tokenâ
Token ID22940
Feature activation-5.34e+00

Pos logit contributions
 up+0.105
acked+0.103
level+0.098
ulum+0.094
erved+0.093
Neg logit contributions
 mate-0.169
 souven-0.143
 charity-0.141
 importantly-0.132
 rulers-0.131
Token
Token ID26391
Feature activation-1.04e+01

Pos logit contributions
 up+0.133
level+0.119
acked+0.117
ulum+0.111
erved+0.110
Neg logit contributions
 mate-0.246
 charity-0.216
 souven-0.210
 importantly-0.199
ities-0.199
Token
Token ID8151
Feature activation-1.84e+00

Pos logit contributions
 up+0.356
acked+0.337
level+0.325
aned+0.312
 hurts+0.312
Neg logit contributions
 mate-0.415
 souven-0.342
 charity-0.341
 importantly-0.316
 rulers-0.310
Tokent
Token ID83
Feature activation+4.67e-01

Pos logit contributions
 up+0.072
ulum+0.068
level+0.068
acked+0.067
erved+0.067
Neg logit contributions
 mate-0.058
 charity-0.046
 souven-0.046
 age-0.043
 importantly-0.042
Token fair
Token ID3148
Feature activation+1.72e-01

Pos logit contributions
 mate+0.019
 souven+0.016
 charity+0.016
 importantly+0.015
 rulers+0.015
Neg logit contributions
 up-0.016
acked-0.015
level-0.014
 hurts-0.014
aned-0.014
Token for
Token ID329
Feature activation+3.05e+00

Pos logit contributions
 mate+0.008
 souven+0.007
 charity+0.007
 importantly+0.006
 rulers+0.006
Neg logit contributions
 up-0.005
acked-0.004
 hurts-0.004
level-0.004
erved-0.004
Token her
Token ID607
Feature activation+1.08e+00

Pos logit contributions
 mate+0.146
 souven+0.118
 buyer+0.117
 rulers+0.115
 charity+0.113
Neg logit contributions
 up-0.085
 hurts-0.064
 cries-0.063
level-0.061
ulum-0.059
Token so
Token ID523
Feature activation-1.72e+00

Pos logit contributions
acked+0.053
 up+0.052
level+0.051
ulum+0.050
erved+0.048
Neg logit contributions
 mate-0.073
 souven-0.060
 charity-0.059
 importantly-0.057
 rulers-0.055
Token frightened
Token ID24776
Feature activation-2.38e+00

Pos logit contributions
 up+0.056
acked+0.055
ulum+0.055
level+0.054
erved+0.053
Neg logit contributions
 mate-0.067
 souven-0.056
 importantly-0.054
 charity-0.054
 rulers-0.050
Token he
Token ID339
Feature activation-2.44e+00

Pos logit contributions
 up+0.070
 hurts+0.054
acked+0.054
chair+0.052
uck+0.051
Neg logit contributions
 mate-0.119
 charity-0.103
 souven-0.101
 rulers-0.096
 buyer-0.092
Token couldn
Token ID3521
Feature activation-7.28e+00

Pos logit contributions
acked+0.071
 up+0.069
level+0.068
ulum+0.065
uck+0.062
Neg logit contributions
 mate-0.108
 souven-0.089
 charity-0.084
 importantly-0.082
 age-0.080
Tokenâ
Token ID22940
Feature activation-5.05e+00

Pos logit contributions
level+0.300
acked+0.291
faced+0.278
led+0.278
erved+0.278
Neg logit contributions
 mate-0.192
 possible-0.154
 charity-0.149
 souven-0.141
 anyone-0.140
Token
Token ID26391
Feature activation-1.04e+01

Pos logit contributions
 up+0.127
acked+0.116
level+0.113
ulum+0.103
 hurts+0.103
Neg logit contributions
 mate-0.238
 charity-0.209
 souven-0.202
ities-0.189
 customer-0.188
Token
Token ID8151
Feature activation-2.16e+00

Pos logit contributions
 up+0.348
acked+0.328
level+0.313
 hurts+0.308
erved+0.307
Neg logit contributions
 mate-0.425
 souven-0.355
 charity-0.346
 importantly-0.322
 rulers-0.321
Tokent
Token ID83
Feature activation+9.27e-01

Pos logit contributions
 up+0.076
acked+0.073
level+0.071
ulum+0.069
erved+0.069
Neg logit contributions
 mate-0.082
 souven-0.067
 charity-0.066
 importantly-0.061
 rulers-0.060
Token speak
Token ID2740
Feature activation+1.97e+00

Pos logit contributions
 mate+0.038
 importantly+0.034
 souven+0.034
 charity+0.033
 rulers+0.032
Neg logit contributions
 up-0.031
 hurts-0.026
pee-0.025
faced-0.024
bo-0.024
Token.
Token ID13
Feature activation-8.21e-01

Pos logit contributions
 mate+0.089
 charity+0.075
 souven+0.074
 importantly+0.071
 rulers+0.069
Neg logit contributions
 up-0.062
acked-0.053
 hurts-0.052
level-0.051
erved-0.049
Token He
Token ID679
Feature activation-2.51e-01

Pos logit contributions
 up+0.022
acked+0.020
level+0.019
 hurts+0.019
aned+0.018
Neg logit contributions
 mate-0.039
 souven-0.034
 charity-0.033
 importantly-0.031
 rulers-0.031
Token fun
Token ID1257
Feature activation-1.15e+00

Pos logit contributions
 mate+0.113
ities+0.098
 queens+0.092
34+0.091
 charity+0.091
Neg logit contributions
 up-0.089
 hurts-0.060
 down-0.057
uddle-0.052
faced-0.050
Token but
Token ID475
Feature activation+9.60e-01

Pos logit contributions
 up+0.033
 hurts+0.020
 down+0.020
chair+0.018
ulum+0.017
Neg logit contributions
 mate-0.062
 charity-0.054
 souven-0.053
 buyer-0.053
 cents-0.052
Token he
Token ID339
Feature activation-1.45e+00

Pos logit contributions
 mate+0.041
 souven+0.034
 charity+0.034
 rulers+0.031
 importantly+0.031
Neg logit contributions
 up-0.032
acked-0.029
level-0.028
 hurts-0.027
erved-0.027
Token wasn
Token ID2492
Feature activation-2.22e+00

Pos logit contributions
 up+0.037
acked+0.035
level+0.033
 hurts+0.032
ulum+0.032
Neg logit contributions
 mate-0.070
 souven-0.060
 charity-0.059
 importantly-0.056
 rulers-0.056
Tokenâ
Token ID22940
Feature activation-5.19e+00

Pos logit contributions
acked+0.065
 up+0.062
level+0.061
erved+0.058
ulum+0.057
Neg logit contributions
 mate-0.097
 charity-0.082
 souven-0.080
 possible-0.077
 importantly-0.076
Token
Token ID26391
Feature activation-1.04e+01

Pos logit contributions
 up+0.106
acked+0.090
level+0.089
 hurts+0.084
aned+0.083
Neg logit contributions
 mate-0.275
 charity-0.242
 souven-0.238
 importantly-0.225
 age-0.221
Token
Token ID8151
Feature activation-2.50e+00

Pos logit contributions
 up+0.331
acked+0.309
erved+0.293
ulum+0.293
 hurts+0.288
Neg logit contributions
 mate-0.438
 souven-0.385
 charity-0.367
 rulers-0.338
 importantly-0.333
Tokent
Token ID83
Feature activation+5.61e-01

Pos logit contributions
 up+0.085
acked+0.080
level+0.077
 hurts+0.075
aned+0.074
Neg logit contributions
 mate-0.101
 souven-0.084
 charity-0.083
 importantly-0.077
 rulers-0.076
Token sure
Token ID1654
Feature activation+6.77e-01

Pos logit contributions
 mate+0.022
 charity+0.019
 souven+0.019
 rulers+0.017
 importantly+0.017
Neg logit contributions
 up-0.021
acked-0.018
 hurts-0.018
aned-0.017
level-0.017
Token what
Token ID644
Feature activation+4.46e+00

Pos logit contributions
 mate+0.026
 souven+0.021
 charity+0.020
 buyer+0.019
 queens+0.019
Neg logit contributions
 up-0.025
 hurts-0.022
chair-0.021
erved-0.021
aned-0.020
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 mate+0.223
 souven+0.186
 charity+0.183
 rulers+0.179
ities+0.176
Neg logit contributions
 up-0.116
 hurts-0.097
acked-0.092
level-0.090
chair-0.090
Token it
Token ID340
Feature activation-5.92e+00

Pos logit contributions
 up+0.067
acked+0.057
level+0.057
 hurts+0.056
ulum+0.055
Neg logit contributions
 mate-0.097
 souven-0.082
 charity-0.080
 importantly-0.076
 rulers-0.075
Token when
Token ID618
Feature activation+3.36e-01

Pos logit contributions
 up+0.202
acked+0.190
level+0.182
 hurts+0.178
aned+0.176
Neg logit contributions
 mate-0.240
 souven-0.199
 charity-0.196
 importantly-0.182
 rulers-0.180
Token things
Token ID1243
Feature activation+4.28e+00

Pos logit contributions
 mate+0.014
 souven+0.011
 charity+0.011
ities+0.010
 rulers+0.010
Neg logit contributions
 up-0.012
 hurts-0.010
acked-0.010
level-0.010
ulum-0.010
Token weren
Token ID6304
Feature activation-1.67e+00

Pos logit contributions
 mate+0.185
 souven+0.155
 charity+0.153
 importantly+0.143
 rulers+0.142
Neg logit contributions
 up-0.137
acked-0.126
level-0.121
 hurts-0.119
aned-0.116
Tokenâ
Token ID22940
Feature activation-4.96e+00

Pos logit contributions
acked+0.047
 up+0.047
level+0.045
ulum+0.043
erved+0.043
Neg logit contributions
 mate-0.074
 souven-0.062
 charity-0.062
 rulers-0.057
 importantly-0.057
Token
Token ID26391
Feature activation-1.03e+01

Pos logit contributions
 up+0.103
acked+0.093
level+0.091
 hurts+0.085
erved+0.083
Neg logit contributions
 mate-0.258
 charity-0.226
 souven-0.222
 importantly-0.209
ities-0.207
Token
Token ID8151
Feature activation-1.99e+00

Pos logit contributions
 up+0.316
acked+0.301
level+0.281
ulum+0.280
 hurts+0.277
Neg logit contributions
 mate-0.448
 souven-0.388
 charity-0.372
 rulers-0.351
 importantly-0.349
Tokent
Token ID83
Feature activation+1.30e+00

Pos logit contributions
 up+0.063
acked+0.060
level+0.058
ulum+0.056
erved+0.056
Neg logit contributions
 mate-0.083
 souven-0.069
 charity-0.068
 importantly-0.064
 rulers-0.063
Token organized
Token ID8389
Feature activation-2.30e+00

Pos logit contributions
 mate+0.054
 souven+0.046
 charity+0.045
 rulers+0.042
 importantly+0.042
Neg logit contributions
 up-0.046
acked-0.039
 hurts-0.038
aned-0.038
level-0.037
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
 up+0.072
acked+0.060
 hurts+0.060
level+0.056
aned+0.056
Neg logit contributions
 mate-0.107
 souven-0.090
 charity-0.087
 rulers-0.084
 importantly-0.084
Token
Token ID220
Feature activation-9.99e-01

Pos logit contributions
 up+0.006
acked+0.005
level+0.005
ulum+0.005
 hurts+0.005
Neg logit contributions
 mate-0.013
 souven-0.011
 charity-0.011
 rulers-0.010
 importantly-0.010
Token high
Token ID1029
Feature activation-6.10e+00

Pos logit contributions
 up+0.069
acked+0.066
level+0.064
aned+0.061
ulum+0.061
Neg logit contributions
 mate-0.086
 souven-0.072
 charity-0.071
 importantly-0.066
 rulers-0.065
Token up
Token ID510
Feature activation-4.95e+00

Pos logit contributions
acked+0.128
level+0.122
 up+0.114
aned+0.113
faced+0.108
Neg logit contributions
 mate-0.311
 souven-0.272
 charity-0.260
 importantly-0.254
 rulers-0.253
Token he
Token ID339
Feature activation-2.75e+00

Pos logit contributions
 up+0.146
acked+0.137
level+0.130
 hurts+0.126
aned+0.125
Neg logit contributions
 mate-0.223
 souven-0.189
 charity-0.187
 importantly-0.175
 rulers-0.173
Token couldn
Token ID3521
Feature activation-7.47e+00

Pos logit contributions
 up+0.092
acked+0.091
level+0.086
ulum+0.086
 hurts+0.083
Neg logit contributions
 mate-0.107
 souven-0.088
 charity-0.082
 importantly-0.080
 rulers-0.079
Tokenâ
Token ID22940
Feature activation-4.72e+00

Pos logit contributions
level+0.291
acked+0.280
erved+0.273
faced+0.272
led+0.268
Neg logit contributions
 mate-0.220
 possible-0.178
 none-0.165
 charity-0.162
 souven-0.159
Token
Token ID26391
Feature activation-1.03e+01

Pos logit contributions
 up+0.120
acked+0.110
level+0.106
 hurts+0.101
aned+0.099
Neg logit contributions
 mate-0.228
 charity-0.197
 souven-0.192
 importantly-0.179
 rulers-0.178
Token
Token ID8151
Feature activation-2.28e+00

Pos logit contributions
 up+0.346
acked+0.329
ulum+0.315
erved+0.312
level+0.308
Neg logit contributions
 mate-0.416
 souven-0.355
 charity-0.337
 importantly-0.317
 rulers-0.315
Tokent
Token ID83
Feature activation+9.74e-01

Pos logit contributions
 up+0.080
acked+0.076
level+0.074
ulum+0.071
 hurts+0.071
Neg logit contributions
 mate-0.088
 souven-0.072
 charity-0.072
 importantly-0.066
 rulers-0.065
Token touch
Token ID3638
Feature activation+1.09e+00

Pos logit contributions
 mate+0.041
 importantly+0.039
 charity+0.037
 souven+0.036
 queens+0.034
Neg logit contributions
 up-0.031
level-0.025
 hurts-0.025
erved-0.024
faced-0.024
Token it
Token ID340
Feature activation-5.78e+00

Pos logit contributions
 mate+0.053
 charity+0.045
 souven+0.045
 importantly+0.042
 rulers+0.041
Neg logit contributions
 up-0.031
acked-0.027
level-0.026
 hurts-0.025
ulum-0.025
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 up+0.185
 hurts+0.137
acked+0.131
 down+0.124
erved+0.123
Neg logit contributions
 mate-0.286
 charity-0.244
 souven-0.241
 importantly-0.230
 queens-0.223
Token was
Token ID373
Feature activation-1.20e+00

Pos logit contributions
 up+0.000
acked+0.000
level+0.000
 hurts+0.000
aned+0.000
Neg logit contributions
 mate-0.000
 souven-0.000
 charity-0.000
 importantly-0.000
 rulers-0.000
Token happy
Token ID3772
Feature activation-4.09e+00

Pos logit contributions
 up+0.040
acked+0.032
level+0.032
 hurts+0.031
faced+0.031
Neg logit contributions
 mate-0.052
 charity-0.045
 souven-0.045
ities-0.041
 rulers-0.041
Token he
Token ID339
Feature activation-1.95e+00

Pos logit contributions
 up+0.104
 hurts+0.072
uck+0.067
chair+0.066
 cries+0.065
Neg logit contributions
 mate-0.219
 charity-0.190
 rulers-0.184
 souven-0.181
 buyer-0.178
Token didn
Token ID1422
Feature activation-5.29e+00

Pos logit contributions
 up+0.061
acked+0.059
level+0.057
ulum+0.056
erved+0.055
Neg logit contributions
 mate-0.080
 souven-0.067
 charity-0.065
 importantly-0.062
 rulers-0.060
Tokenâ
Token ID22940
Feature activation-5.08e+00

Pos logit contributions
acked+0.160
level+0.157
 up+0.156
erved+0.148
ulum+0.146
Neg logit contributions
 mate-0.215
 charity-0.185
 souven-0.183
 rulers-0.168
 possible-0.168
Token
Token ID26391
Feature activation-1.02e+01

Pos logit contributions
 up+0.105
level+0.093
acked+0.092
ulum+0.086
 hurts+0.084
Neg logit contributions
 mate-0.257
 charity-0.227
 souven-0.223
 importantly-0.210
uries-0.208
Token
Token ID8151
Feature activation-2.07e+00

Pos logit contributions
 up+0.342
acked+0.322
erved+0.303
ulum+0.302
level+0.302
Neg logit contributions
 mate-0.421
 souven-0.353
 charity-0.340
 rulers-0.319
 importantly-0.318
Tokent
Token ID83
Feature activation+1.13e+00

Pos logit contributions
 up+0.071
acked+0.068
level+0.065
 hurts+0.063
ulum+0.063
Neg logit contributions
 mate-0.082
 souven-0.068
 charity-0.067
 importantly-0.062
 rulers-0.061
Token have
Token ID423
Feature activation-2.54e+00

Pos logit contributions
 mate+0.048
 souven+0.041
 charity+0.039
34+0.039
 importantly+0.038
Neg logit contributions
 up-0.037
 hurts-0.034
 rains-0.030
faced-0.029
level-0.028
Token to
Token ID284
Feature activation-4.02e-01

Pos logit contributions
 up+0.065
 hurts+0.052
level+0.048
acked+0.045
uck+0.045
Neg logit contributions
 mate-0.133
 charity-0.113
 souven-0.112
 importantly-0.106
 rulers-0.105
Token be
Token ID307
Feature activation+1.15e+00

Pos logit contributions
 up+0.014
 hurts+0.012
ulum+0.011
level+0.011
uck+0.011
Neg logit contributions
 mate-0.017
 souven-0.014
 importantly-0.014
 charity-0.014
34-0.013
Token then
Token ID788
Feature activation-3.20e+00

Pos logit contributions
 up+0.053
acked+0.046
level+0.043
 hurts+0.043
erved+0.040
Neg logit contributions
 mate-0.104
 charity-0.092
 souven-0.087
ities-0.084
 rulers-0.084
Token hopped
Token ID45230
Feature activation-2.87e+00

Pos logit contributions
 up+0.085
acked+0.073
level+0.069
 hurts+0.067
erved+0.064
Neg logit contributions
 mate-0.157
 charity-0.140
 souven-0.134
ities-0.129
 importantly-0.129
Token onto
Token ID4291
Feature activation-3.01e+00

Pos logit contributions
ulum+0.073
aned+0.069
 hurts+0.069
level+0.068
acked+0.065
Neg logit contributions
 mate-0.137
 souven-0.125
 charity-0.111
 buyer-0.105
 customer-0.103
Token Alice
Token ID14862
Feature activation-5.33e+00

Pos logit contributions
 up+0.077
acked+0.064
level+0.061
 hurts+0.059
ulum+0.057
Neg logit contributions
 mate-0.154
 charity-0.131
 souven-0.130
 importantly-0.127
 rulers-0.124
Tokenâ
Token ID22940
Feature activation-4.48e+00

Pos logit contributions
acked+0.122
ulum+0.114
level+0.114
aned+0.112
 up+0.111
Neg logit contributions
 mate-0.265
 souven-0.232
 charity-0.221
 age-0.209
 rulers-0.209
Token
Token ID26391
Feature activation-1.01e+01

Pos logit contributions
 up+0.100
acked+0.087
level+0.086
 hurts+0.083
uddle+0.079
Neg logit contributions
 mate-0.226
 charity-0.195
 souven-0.190
uries-0.184
 age-0.179
Token
Token ID8151
Feature activation-1.60e+00

Pos logit contributions
 up+0.327
acked+0.310
level+0.301
aned+0.287
 hurts+0.284
Neg logit contributions
 mate-0.423
 souven-0.348
 charity-0.346
 importantly-0.320
 age-0.315
Tokens
Token ID82
Feature activation+1.29e-01

Pos logit contributions
 up+0.050
acked+0.046
level+0.043
aned+0.042
 hurts+0.042
Neg logit contributions
 mate-0.070
 souven-0.060
 charity-0.059
 importantly-0.055
 rulers-0.054
Token shoulder
Token ID8163
Feature activation-2.37e+00

Pos logit contributions
 mate+0.006
 charity+0.005
 souven+0.005
 importantly+0.005
 rulers+0.005
Neg logit contributions
 up-0.003
acked-0.003
 hurts-0.003
level-0.003
erved-0.003
Token.
Token ID13
Feature activation-5.52e-01

Pos logit contributions
 up+0.075
acked+0.065
level+0.060
 hurts+0.060
erved+0.059
Neg logit contributions
 mate-0.105
 charity-0.090
 souven-0.088
 importantly-0.085
 rulers-0.083
Token They
Token ID1119
Feature activation+5.44e-01

Pos logit contributions
 up+0.013
acked+0.012
level+0.011
 hurts+0.011
ulum+0.010
Neg logit contributions
 mate-0.029
 souven-0.025
 charity-0.024
 rulers-0.023
 importantly-0.023
Token They
Token ID1119
Feature activation+9.19e-01

Pos logit contributions
 up+0.026
acked+0.025
level+0.024
 hurts+0.023
aned+0.023
Neg logit contributions
 mate-0.035
 souven-0.030
 charity-0.029
 importantly-0.027
 rulers-0.027
Token said
Token ID531
Feature activation-1.77e-01

Pos logit contributions
 mate+0.044
 charity+0.038
 souven+0.038
 importantly+0.035
 rulers+0.035
Neg logit contributions
 up-0.026
acked-0.023
level-0.022
 hurts-0.022
aned-0.021
Token it
Token ID340
Feature activation-3.27e+00

Pos logit contributions
 up+0.006
acked+0.005
level+0.005
 hurts+0.005
erved+0.005
Neg logit contributions
 mate-0.007
 souven-0.006
 charity-0.006
 importantly-0.006
 rulers-0.006
Token wasn
Token ID2492
Feature activation-2.22e+00

Pos logit contributions
 up+0.061
acked+0.058
level+0.054
ulum+0.050
aned+0.049
Neg logit contributions
 mate-0.178
 souven-0.156
 charity-0.154
 importantly-0.146
 rulers-0.146
Tokenâ
Token ID22940
Feature activation-5.12e+00

Pos logit contributions
 up+0.058
acked+0.054
level+0.051
 hurts+0.050
aned+0.049
Neg logit contributions
 mate-0.107
 souven-0.092
 charity-0.091
 importantly-0.086
 rulers-0.085
Token
Token ID26391
Feature activation-1.01e+01

Pos logit contributions
 up+0.112
acked+0.100
level+0.098
 hurts+0.091
aned+0.090
Neg logit contributions
 mate-0.268
 charity-0.231
 souven-0.231
 importantly-0.217
 rulers-0.213
Token
Token ID8151
Feature activation-2.30e+00

Pos logit contributions
 up+0.296
acked+0.284
erved+0.279
ulum+0.269
chair+0.267
Neg logit contributions
 mate-0.426
 souven-0.376
 charity-0.351
 rulers-0.345
 importantly-0.340
Tokent
Token ID83
Feature activation+8.06e-01

Pos logit contributions
 up+0.074
acked+0.070
level+0.065
 hurts+0.064
aned+0.064
Neg logit contributions
 mate-0.098
 souven-0.083
 charity-0.082
 importantly-0.076
 rulers-0.076
Token enough
Token ID1576
Feature activation-4.02e-01

Pos logit contributions
 mate+0.033
 charity+0.029
 souven+0.028
 importantly+0.027
 rulers+0.026
Neg logit contributions
 up-0.029
 hurts-0.023
acked-0.023
level-0.022
erved-0.022
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
 up+0.010
 hurts+0.008
 cries+0.007
erved+0.007
 hits+0.006
Neg logit contributions
 mate-0.021
 charity-0.018
 rulers-0.017
 queens-0.017
 souven-0.017
Token
Token ID198
Feature activation+2.46e+00

Pos logit contributions
 up+0.021
acked+0.020
level+0.019
 hurts+0.019
aned+0.018
Neg logit contributions
 mate-0.033
 souven-0.028
 charity-0.028
 importantly-0.026
 rulers-0.025
Token river
Token ID7850
Feature activation-2.15e+00

Pos logit contributions
aned+0.233
acked+0.220
level+0.216
umbled+0.209
irled+0.203
Neg logit contributions
 mate-0.273
 charity-0.230
 souven-0.224
 ones-0.210
 rulers-0.205
Token but
Token ID475
Feature activation+9.01e-01

Pos logit contributions
 up+0.072
acked+0.070
level+0.068
aned+0.065
ulum+0.065
Neg logit contributions
 mate-0.086
 souven-0.070
 charity-0.069
 rulers-0.064
 importantly-0.063
Token Lily
Token ID20037
Feature activation-2.76e+00

Pos logit contributions
 mate+0.037
 souven+0.032
 charity+0.032
 importantly+0.029
 rulers+0.029
Neg logit contributions
 up-0.030
acked-0.027
level-0.026
 hurts-0.026
ulum-0.025
Token didn
Token ID1422
Feature activation-6.04e+00

Pos logit contributions
acked+0.092
level+0.088
 up+0.086
ulum+0.081
atches+0.081
Neg logit contributions
 mate-0.095
 age-0.085
 remember-0.081
 than-0.077
ities-0.073
Tokenâ
Token ID22940
Feature activation-4.77e+00

Pos logit contributions
acked+0.248
level+0.243
led+0.230
awed+0.228
erved+0.228
Neg logit contributions
 mate-0.177
 possible-0.150
 charity-0.138
 souven-0.135
 anyone-0.127
Token
Token ID26391
Feature activation-1.01e+01

Pos logit contributions
 up+0.115
level+0.097
acked+0.094
uddle+0.089
ulum+0.087
Neg logit contributions
 mate-0.221
 charity-0.200
uries-0.187
 age-0.186
 souven-0.186
Token
Token ID8151
Feature activation-1.76e+00

Pos logit contributions
 up+0.335
acked+0.317
level+0.305
 hurts+0.295
aned+0.293
Neg logit contributions
 mate-0.414
 souven-0.344
 charity-0.340
 importantly-0.317
 rulers-0.312
Tokent
Token ID83
Feature activation+7.82e-01

Pos logit contributions
 up+0.064
acked+0.061
level+0.060
ulum+0.058
erved+0.058
Neg logit contributions
 mate-0.064
 souven-0.052
 charity-0.052
 importantly-0.048
 rulers-0.047
Token want
Token ID765
Feature activation+8.32e-01

Pos logit contributions
 mate+0.028
 souven+0.023
 charity+0.023
 rulers+0.021
 importantly+0.021
Neg logit contributions
 up-0.031
 hurts-0.028
acked-0.028
aned-0.027
level-0.027
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
 mate+0.042
 souven+0.036
 charity+0.035
 importantly+0.034
 rulers+0.033
Neg logit contributions
 up-0.021
acked-0.019
level-0.018
 hurts-0.018
erved-0.017
Token.
Token ID13
Feature activation-5.75e-01

Pos logit contributions
 up+0.036
 hurts+0.031
acked+0.030
level+0.030
ulum+0.030
Neg logit contributions
 mate-0.044
 souven-0.038
 charity-0.036
 importantly-0.036
 rulers-0.034
Token relieved
Token ID22598
Feature activation-4.05e+00

Pos logit contributions
 up+0.055
acked+0.049
 hurts+0.047
level+0.047
faced+0.045
Neg logit contributions
 mate-0.068
 souven-0.056
 charity-0.056
 rulers-0.052
ities-0.050
Token that
Token ID326
Feature activation+6.04e-02

Pos logit contributions
 up+0.109
 hurts+0.081
acked+0.081
uck+0.078
erved+0.075
Neg logit contributions
 mate-0.205
 charity-0.180
 souven-0.177
 rulers-0.167
 buyer-0.166
Token she
Token ID673
Feature activation-1.35e+00

Pos logit contributions
 mate+0.003
 souven+0.002
 rulers+0.002
 charity+0.002
ities+0.002
Neg logit contributions
 up-0.002
 hurts-0.002
 rains-0.001
 down-0.001
chair-0.001
Token didn
Token ID1422
Feature activation-5.61e+00

Pos logit contributions
ulum+0.042
acked+0.042
 up+0.041
level+0.041
aned+0.039
Neg logit contributions
 mate-0.053
 souven-0.043
 importantly-0.041
 charity-0.041
 age-0.040
Tokenâ
Token ID22940
Feature activation-4.96e+00

Pos logit contributions
acked+0.194
level+0.190
 up+0.182
erved+0.180
ulum+0.179
Neg logit contributions
 mate-0.204
 charity-0.171
 souven-0.169
 possible-0.159
 rulers-0.154
Token
Token ID26391
Feature activation-1.01e+01

Pos logit contributions
 up+0.111
level+0.097
acked+0.094
ulum+0.093
uddle+0.089
Neg logit contributions
 mate-0.234
 charity-0.209
 souven-0.204
uries-0.192
ities-0.192
Token
Token ID8151
Feature activation-2.06e+00

Pos logit contributions
 up+0.356
acked+0.337
level+0.322
 hurts+0.317
erved+0.314
Neg logit contributions
 mate-0.396
 souven-0.328
 charity-0.320
 importantly-0.297
 rulers-0.295
Tokent
Token ID83
Feature activation+1.05e+00

Pos logit contributions
 up+0.074
acked+0.071
level+0.070
ulum+0.069
erved+0.068
Neg logit contributions
 mate-0.076
 souven-0.062
 charity-0.061
 importantly-0.056
 rulers-0.055
Token have
Token ID423
Feature activation-2.63e+00

Pos logit contributions
 mate+0.044
 souven+0.038
 charity+0.038
 importantly+0.037
 rulers+0.036
Neg logit contributions
 up-0.034
 hurts-0.033
 rains-0.029
bo-0.028
acked-0.027
Token to
Token ID284
Feature activation-6.73e-01

Pos logit contributions
 up+0.061
 hurts+0.053
level+0.044
erved+0.043
 cries+0.042
Neg logit contributions
 mate-0.141
 charity-0.125
 souven-0.121
 importantly-0.118
 rulers-0.117
Token be
Token ID307
Feature activation+9.71e-01

Pos logit contributions
 up+0.021
 hurts+0.018
 cries+0.016
erved+0.016
forth+0.015
Neg logit contributions
 mate-0.030
 importantly-0.026
 souven-0.026
 queens-0.025
 charity-0.025
Token woke
Token ID19092
Feature activation-3.29e+00

Pos logit contributions
 up+0.005
acked+0.004
level+0.004
aned+0.004
ulum+0.004
Neg logit contributions
 mate-0.008
 souven-0.007
 charity-0.007
 importantly-0.006
 rulers-0.006
Token up
Token ID510
Feature activation-4.77e+00

Pos logit contributions
acked+0.029
level+0.022
aned+0.016
 hurts+0.014
faced+0.013
Neg logit contributions
 mate-0.206
 souven-0.189
 charity-0.185
 possible-0.181
 importantly-0.179
Token it
Token ID340
Feature activation-5.85e+00

Pos logit contributions
 up+0.144
acked+0.125
level+0.119
 hurts+0.117
erved+0.113
Neg logit contributions
 mate-0.221
 charity-0.186
 souven-0.183
 importantly-0.173
 rulers-0.172
Token wasn
Token ID2492
Feature activation-2.09e+00

Pos logit contributions
 up+0.184
acked+0.171
level+0.163
 hurts+0.159
aned+0.157
Neg logit contributions
 mate-0.253
 souven-0.213
 charity-0.211
 importantly-0.197
 rulers-0.194
Tokenâ
Token ID22940
Feature activation-5.15e+00

Pos logit contributions
acked+0.066
 up+0.065
level+0.064
ulum+0.062
erved+0.061
Neg logit contributions
 mate-0.086
 souven-0.070
 charity-0.069
 possible-0.066
 rulers-0.063
Token
Token ID26391
Feature activation-1.01e+01

Pos logit contributions
 up+0.104
acked+0.092
level+0.090
 hurts+0.083
aned+0.082
Neg logit contributions
 mate-0.274
 charity-0.240
 souven-0.238
 importantly-0.224
ities-0.219
Token
Token ID8151
Feature activation-2.15e+00

Pos logit contributions
 up+0.338
acked+0.322
level+0.305
 hurts+0.299
ulum+0.299
Neg logit contributions
 mate-0.408
 souven-0.343
 charity-0.336
 importantly-0.312
 rulers-0.310
Tokent
Token ID83
Feature activation+1.21e+00

Pos logit contributions
 up+0.073
acked+0.069
level+0.067
 hurts+0.065
aned+0.064
Neg logit contributions
 mate-0.086
 souven-0.071
 charity-0.070
 importantly-0.065
 rulers-0.065
Token Tuesday
Token ID3431
Feature activation-4.41e+00

Pos logit contributions
 mate+0.053
 charity+0.046
 souven+0.045
 importantly+0.042
 rulers+0.042
Neg logit contributions
 up-0.038
acked-0.033
 hurts-0.032
level-0.032
aned-0.031
Token anymore
Token ID7471
Feature activation-5.41e-01

Pos logit contributions
 up+0.131
acked+0.104
 hurts+0.097
uck+0.096
hed+0.095
Neg logit contributions
 mate-0.217
 souven-0.184
 charity-0.184
 rulers-0.176
 importantly-0.173
Token!
Token ID0
Feature activation-3.58e-01

Pos logit contributions
 up+0.011
chair+0.007
 hurts+0.006
ned+0.006
 cries+0.006
Neg logit contributions
 mate-0.030
 charity-0.026
 souven-0.026
 buyer-0.026
 rulers-0.025
Token big
Token ID1263
Feature activation-4.62e+00

Pos logit contributions
 up+0.137
acked+0.130
level+0.125
 hurts+0.121
aned+0.120
Neg logit contributions
 mate-0.153
 souven-0.126
 charity-0.124
 importantly-0.115
 rulers-0.114
Token that
Token ID326
Feature activation-1.04e+00

Pos logit contributions
 up+0.126
acked+0.111
 hurts+0.106
ulum+0.104
level+0.104
Neg logit contributions
 mate-0.221
 charity-0.188
 souven-0.188
 rulers-0.175
 importantly-0.175
Token it
Token ID340
Feature activation-5.18e+00

Pos logit contributions
 up+0.033
 hurts+0.028
chair+0.025
acked+0.025
ulum+0.025
Neg logit contributions
 mate-0.046
 charity-0.039
 souven-0.038
ities-0.038
34-0.037
Token didn
Token ID1422
Feature activation-6.47e+00

Pos logit contributions
 up+0.146
acked+0.145
level+0.139
ulum+0.134
aned+0.131
Neg logit contributions
 mate-0.229
 souven-0.193
 charity-0.185
 rulers-0.177
 importantly-0.177
Tokenâ
Token ID22940
Feature activation-4.76e+00

Pos logit contributions
acked+0.225
level+0.217
 up+0.208
ulum+0.204
erved+0.203
Neg logit contributions
 mate-0.244
 souven-0.203
 charity-0.200
 possible-0.191
 rulers-0.180
Token
Token ID26391
Feature activation-1.00e+01

Pos logit contributions
 up+0.128
uddle+0.112
level+0.112
acked+0.107
 hurts+0.103
Neg logit contributions
 mate-0.206
 charity-0.180
 souven-0.171
uries-0.169
ities-0.166
Token
Token ID8151
Feature activation-1.57e+00

Pos logit contributions
 up+0.338
acked+0.317
level+0.310
aned+0.294
 hurts+0.293
Neg logit contributions
 mate-0.405
 charity-0.332
 souven-0.331
 importantly-0.305
 age-0.301
Tokent
Token ID83
Feature activation+7.40e-01

Pos logit contributions
 up+0.061
level+0.057
acked+0.057
ulum+0.056
erved+0.056
Neg logit contributions
 mate-0.052
 souven-0.042
 charity-0.042
 importantly-0.038
 age-0.038
Token fit
Token ID4197
Feature activation-1.02e+00

Pos logit contributions
 mate+0.026
 souven+0.022
 charity+0.021
 importantly+0.020
 rulers+0.020
Neg logit contributions
 up-0.029
acked-0.026
 hurts-0.026
aned-0.026
level-0.026
Token in
Token ID287
Feature activation+4.38e-01

Pos logit contributions
 up+0.031
acked+0.027
 hurts+0.026
level+0.025
aned+0.025
Neg logit contributions
 mate-0.047
 souven-0.040
 charity-0.039
 importantly-0.037
 rulers-0.037
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 mate+0.022
 souven+0.018
 rulers+0.018
 buyer+0.018
 charity+0.018
Neg logit contributions
 up-0.011
 hurts-0.009
acked-0.008
ulum-0.008
 cries-0.008
Token so
Token ID523
Feature activation-1.93e+00

Pos logit contributions
 up+0.075
acked+0.064
 hurts+0.061
level+0.060
faced+0.058
Neg logit contributions
 mate-0.099
 charity-0.080
 souven-0.077
 rulers-0.074
 age-0.074
Token good
Token ID922
Feature activation-1.70e+00

Pos logit contributions
 up+0.075
acked+0.068
 hurts+0.065
level+0.063
uck+0.062
Neg logit contributions
 mate-0.076
 charity-0.062
 souven-0.058
ities-0.056
uries-0.055
Token he
Token ID339
Feature activation-1.95e+00

Pos logit contributions
 up+0.049
acked+0.041
 hurts+0.039
erved+0.036
level+0.036
Neg logit contributions
 mate-0.085
 charity-0.074
 souven-0.070
 rulers-0.066
 importantly-0.065
Token couldn
Token ID3521
Feature activation-7.25e+00

Pos logit contributions
 up+0.066
acked+0.065
ulum+0.064
level+0.063
erved+0.062
Neg logit contributions
 mate-0.072
 souven-0.060
 importantly-0.056
 charity-0.055
 rulers-0.053
Tokenâ
Token ID22940
Feature activation-4.51e+00

Pos logit contributions
level+0.251
erved+0.244
 up+0.241
acked+0.239
ulum+0.238
Neg logit contributions
 mate-0.245
 charity-0.199
 souven-0.195
 possible-0.193
 none-0.186
Token
Token ID26391
Feature activation-9.98e+00

Pos logit contributions
 up+0.154
acked+0.143
level+0.140
 hurts+0.135
aned+0.134
Neg logit contributions
 mate-0.178
 charity-0.148
 souven-0.144
ities-0.130
 rulers-0.130
Token
Token ID8151
Feature activation-1.94e+00

Pos logit contributions
 up+0.347
acked+0.331
level+0.315
erved+0.309
ulum+0.308
Neg logit contributions
 mate-0.395
 souven-0.329
 charity-0.320
 importantly-0.299
 rulers-0.295
Tokent
Token ID83
Feature activation+1.32e+00

Pos logit contributions
 up+0.068
acked+0.065
level+0.064
ulum+0.062
erved+0.062
Neg logit contributions
 mate-0.074
 souven-0.060
 charity-0.060
 importantly-0.055
 rulers-0.054
Token resist
Token ID4180
Feature activation-1.89e+00

Pos logit contributions
 mate+0.056
34+0.050
 importantly+0.050
ities+0.048
 charity+0.047
Neg logit contributions
 up-0.034
faced-0.031
 hurts-0.030
 cries-0.028
level-0.028
Token.
Token ID13
Feature activation-9.81e-01

Pos logit contributions
 up+0.060
 hurts+0.046
acked+0.045
level+0.045
ulum+0.043
Neg logit contributions
 mate-0.090
 charity-0.073
 souven-0.072
 importantly-0.069
 rulers-0.069
Token So
Token ID1406
Feature activation+2.05e+00

Pos logit contributions
 up+0.023
acked+0.022
level+0.021
 hurts+0.020
aned+0.019
Neg logit contributions
 mate-0.050
 souven-0.043
 charity-0.043
 importantly-0.040
 rulers-0.040
Token and
Token ID290
Feature activation-2.17e+00

Pos logit contributions
 up+0.055
acked+0.045
 hurts+0.044
erved+0.042
level+0.042
Neg logit contributions
 mate-0.102
 charity-0.089
 souven-0.088
 importantly-0.082
 rulers-0.081
Token thought
Token ID1807
Feature activation-8.33e-01

Pos logit contributions
 up+0.075
acked+0.065
 hurts+0.065
level+0.063
erved+0.062
Neg logit contributions
 mate-0.091
 charity-0.076
 souven-0.075
 rulers-0.070
ities-0.070
Token he
Token ID339
Feature activation-7.96e-01

Pos logit contributions
 up+0.024
 hurts+0.019
chair+0.019
acked+0.018
ulum+0.018
Neg logit contributions
 mate-0.039
 souven-0.034
 charity-0.033
 buyer-0.033
 rulers-0.032
Token didn
Token ID1422
Feature activation-5.12e+00

Pos logit contributions
 up+0.021
acked+0.019
level+0.018
 hurts+0.017
erved+0.017
Neg logit contributions
 mate-0.039
 souven-0.034
 charity-0.034
 importantly-0.032
 rulers-0.031
Tokenâ
Token ID22940
Feature activation-4.82e+00

Pos logit contributions
acked+0.154
 up+0.151
level+0.150
erved+0.141
ulum+0.139
Neg logit contributions
 mate-0.213
 charity-0.181
 souven-0.179
 possible-0.166
 rulers-0.166
Token
Token ID26391
Feature activation-9.98e+00

Pos logit contributions
 up+0.104
level+0.090
acked+0.086
ulum+0.083
aned+0.082
Neg logit contributions
 mate-0.235
 charity-0.209
 souven-0.202
uries-0.194
ities-0.192
Token
Token ID8151
Feature activation-1.92e+00

Pos logit contributions
 up+0.365
acked+0.345
erved+0.326
ulum+0.326
level+0.325
Neg logit contributions
 mate-0.381
 souven-0.317
 charity-0.304
 importantly-0.283
 rulers-0.281
Tokent
Token ID83
Feature activation+1.17e+00

Pos logit contributions
 up+0.064
acked+0.061
level+0.058
 hurts+0.056
aned+0.056
Neg logit contributions
 mate-0.078
 souven-0.065
 charity-0.064
 importantly-0.060
 rulers-0.059
Token need
Token ID761
Feature activation+1.68e+00

Pos logit contributions
 mate+0.052
 souven+0.047
 importantly+0.046
 rulers+0.044
 charity+0.044
Neg logit contributions
 up-0.034
 hurts-0.032
 rains-0.028
faced-0.026
acked-0.026
Token to
Token ID284
Feature activation-1.56e-01

Pos logit contributions
 mate+0.091
 charity+0.077
 buyer+0.076
 importantly+0.076
 souven+0.076
Neg logit contributions
 up-0.038
 hurts-0.030
 cries-0.025
level-0.025
erved-0.023
Token include
Token ID2291
Feature activation+3.83e+00

Pos logit contributions
 up+0.005
 hurts+0.004
 cries+0.004
ulum+0.004
forth+0.003
Neg logit contributions
 mate-0.007
 importantly-0.006
 souven-0.006
 queens-0.006
34-0.006
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 up+0.066
acked+0.056
 hurts+0.053
level+0.052
aned+0.051
Neg logit contributions
 mate-0.107
 souven-0.090
 charity-0.090
 importantly-0.086
 rulers-0.084
Token but
Token ID475
Feature activation+4.53e-01

Pos logit contributions
 up+0.063
acked+0.058
level+0.056
 hurts+0.055
ulum+0.054
Neg logit contributions
 mate-0.066
 souven-0.054
 charity-0.053
 importantly-0.049
 rulers-0.049
Token he
Token ID339
Feature activation-1.25e+00

Pos logit contributions
 mate+0.018
 charity+0.016
 souven+0.016
 importantly+0.015
ities+0.015
Neg logit contributions
 up-0.016
 hurts-0.013
level-0.013
forth-0.012
acked-0.012
Token couldn
Token ID3521
Feature activation-7.91e+00

Pos logit contributions
 up+0.033
acked+0.031
level+0.030
ulum+0.029
aned+0.028
Neg logit contributions
 mate-0.060
 souven-0.051
 charity-0.050
 importantly-0.047
 rulers-0.047
Tokenâ
Token ID22940
Feature activation-4.33e+00

Pos logit contributions
acked+0.313
level+0.310
erved+0.293
faced+0.290
ulum+0.290
Neg logit contributions
 mate-0.252
 possible-0.190
 charity-0.186
 none-0.183
 souven-0.181
Token
Token ID26391
Feature activation-9.97e+00

Pos logit contributions
 up+0.111
acked+0.100
level+0.097
 hurts+0.093
uddle+0.093
Neg logit contributions
 mate-0.205
 charity-0.179
 souven-0.169
uries-0.164
 possible-0.162
Token
Token ID8151
Feature activation-1.80e+00

Pos logit contributions
 up+0.300
acked+0.282
level+0.269
 hurts+0.260
aned+0.259
Neg logit contributions
 mate-0.441
 souven-0.374
 charity-0.368
 importantly-0.345
 rulers-0.341
Tokent
Token ID83
Feature activation+1.43e+00

Pos logit contributions
 up+0.064
acked+0.061
level+0.060
ulum+0.058
 hurts+0.058
Neg logit contributions
 mate-0.068
 souven-0.055
 charity-0.055
 importantly-0.051
 rulers-0.049
Token.
Token ID13
Feature activation-4.52e-01

Pos logit contributions
 mate+0.059
 souven+0.052
 charity+0.051
 importantly+0.050
 rulers+0.047
Neg logit contributions
 up-0.047
 hurts-0.040
aned-0.039
acked-0.039
chair-0.038
Token The
Token ID383
Feature activation+7.17e-01

Pos logit contributions
 up+0.010
acked+0.009
level+0.008
ulum+0.008
 hurts+0.008
Neg logit contributions
 mate-0.024
 souven-0.021
 charity-0.020
 rulers-0.019
 importantly-0.019
Token dog
Token ID3290
Feature activation-5.22e-01

Pos logit contributions
 mate+0.031
 souven+0.026
 charity+0.026
 importantly+0.024
 rulers+0.024
Neg logit contributions
 up-0.022
acked-0.020
level-0.020
 hurts-0.019
ulum-0.019
Token lots
Token ID6041
Feature activation+5.81e+00

Pos logit contributions
 mate+0.030
 charity+0.025
 souven+0.025
 importantly+0.023
 rulers+0.023
Neg logit contributions
 up-0.022
acked-0.020
level-0.019
faced-0.018
 hurts-0.018
Token of
Token ID286
Feature activation+3.45e+00

Pos logit contributions
 mate+0.263
 charity+0.222
 souven+0.211
ities+0.206
uries+0.202
Neg logit contributions
 up-0.184
acked-0.163
level-0.148
chair-0.145
 hurts-0.142
Token other
Token ID584
Feature activation+1.97e+00

Pos logit contributions
 mate+0.138
 charity+0.111
 souven+0.107
ities+0.103
uries+0.103
Neg logit contributions
 up-0.125
acked-0.108
level-0.106
 hurts-0.102
ulum-0.099
Token boats
Token ID16659
Feature activation+2.94e+00

Pos logit contributions
 mate+0.079
 charity+0.073
uries+0.070
ities+0.069
 customer+0.066
Neg logit contributions
 up-0.069
am-0.050
acked-0.048
 rains-0.048
 air-0.047
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
 mate+0.132
 charity+0.113
 souven+0.113
 importantly+0.105
 rulers+0.104
Neg logit contributions
 up-0.091
acked-0.079
level-0.075
 hurts-0.071
ulum-0.071
Token Some
Token ID2773
Feature activation+8.34e+00

Pos logit contributions
 mate+0.016
 souven+0.014
 charity+0.014
 importantly+0.013
 rulers+0.013
Neg logit contributions
 up-0.006
acked-0.005
level-0.005
 hurts-0.005
aned-0.005
Token were
Token ID547
Feature activation+2.87e+00

Pos logit contributions
 mate+0.367
 charity+0.336
ities+0.310
uries+0.308
 souven+0.303
Neg logit contributions
 up-0.240
acked-0.217
led-0.212
level-0.199
am-0.196
Token bigger
Token ID5749
Feature activation-3.27e-01

Pos logit contributions
 mate+0.123
 charity+0.115
ities+0.108
 souven+0.107
 importantly+0.104
Neg logit contributions
 up-0.093
acked-0.069
level-0.066
faced-0.061
erved-0.060
Token,
Token ID11
Feature activation+2.99e+00

Pos logit contributions
 up+0.008
 hurts+0.005
am+0.005
acked+0.005
 hits+0.005
Neg logit contributions
 mate-0.017
 charity-0.016
 customer-0.015
uries-0.014
 souven-0.014
Token some
Token ID617
Feature activation+7.15e+00

Pos logit contributions
 mate+0.143
 charity+0.122
 souven+0.118
uries+0.115
 buyer+0.114
Neg logit contributions
 up-0.091
 rains-0.061
ulum-0.059
 hurts-0.059
acked-0.059
Token were
Token ID547
Feature activation+2.31e+00

Pos logit contributions
 mate+0.328
 charity+0.300
uries+0.291
ities+0.284
34+0.273
Neg logit contributions
 up-0.185
led-0.163
acked-0.156
 was-0.149
 started-0.149
Token some
Token ID617
Feature activation+5.80e+00

Pos logit contributions
 up+0.012
level+0.007
acked+0.007
 hurts+0.007
covered+0.007
Neg logit contributions
 mate-0.020
 charity-0.018
 cents-0.017
 souven-0.017
uries-0.016
Token of
Token ID286
Feature activation+3.09e+00

Pos logit contributions
 mate+0.254
 charity+0.204
uries+0.195
ities+0.192
34+0.182
Neg logit contributions
 up-0.207
level-0.166
 hurts-0.162
acked-0.162
 rains-0.151
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
 mate+0.180
uries+0.159
 charity+0.156
ities+0.155
 cents+0.153
Neg logit contributions
 up-0.059
level-0.032
<|endoftext|>-0.029
ulously-0.028
>-0.026
Token most
Token ID749
Feature activation+6.46e+00

Pos logit contributions
 up+0.063
acked+0.049
level+0.049
 hurts+0.047
>+0.047
Neg logit contributions
 mate-0.080
 charity-0.067
ities-0.066
 souven-0.063
 importantly-0.063
Token beautiful
Token ID4950
Feature activation+6.12e-01

Pos logit contributions
 mate+0.270
 charity+0.222
 souven+0.201
ities+0.200
uries+0.197
Neg logit contributions
 up-0.237
acked-0.202
level-0.195
 hurts-0.189
faced-0.183
Token things
Token ID1243
Feature activation+6.86e+00

Pos logit contributions
 mate+0.026
 charity+0.022
 souven+0.021
 rulers+0.020
 importantly+0.020
Neg logit contributions
 up-0.020
acked-0.019
level-0.017
 hurts-0.017
ulum-0.017
Token in
Token ID287
Feature activation+1.32e+00

Pos logit contributions
 mate+0.356
 charity+0.303
ities+0.286
 customer+0.285
 souven+0.281
Neg logit contributions
 up-0.213
level-0.144
 down-0.138
 hurts-0.136
acked-0.135
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
 mate+0.079
uries+0.073
ities+0.072
 charity+0.072
 cents+0.072
Neg logit contributions
 up-0.023
level-0.010
>-0.010
acked-0.009
 down-0.009
Token world
Token ID995
Feature activation+2.46e+00

Pos logit contributions
 up+0.040
acked+0.034
level+0.032
ulum+0.030
 hurts+0.030
Neg logit contributions
 mate-0.067
 souven-0.057
 charity-0.056
ities-0.055
 importantly-0.054
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
 mate+0.120
 charity+0.101
 souven+0.101
 rulers+0.096
ities+0.096
Neg logit contributions
 up-0.073
acked-0.061
level-0.056
hed-0.055
chair-0.055
Token<|endoftext|>
Token ID50256
Feature activation-6.29e+00

Pos logit contributions
 mate+0.001
 souven+0.001
 charity+0.001
 rulers+0.001
ities+0.001
Neg logit contributions
 up-0.000
acked-0.000
level-0.000
uck-0.000
forth-0.000
Token crazy
Token ID7165
Feature activation+2.48e+00

Pos logit contributions
 mate+0.136
 charity+0.113
 souven+0.113
 importantly+0.105
 rulers+0.104
Neg logit contributions
 up-0.099
acked-0.088
level-0.086
 hurts-0.085
ulum-0.082
Token rainbow
Token ID27223
Feature activation+4.13e+00

Pos logit contributions
 mate+0.106
 souven+0.089
 charity+0.089
 importantly+0.083
 rulers+0.081
Neg logit contributions
 up-0.078
acked-0.072
level-0.070
 hurts-0.068
erved-0.067
Token with
Token ID351
Feature activation+6.70e+00

Pos logit contributions
 mate+0.184
 charity+0.157
 souven+0.156
 importantly+0.146
 rulers+0.142
Neg logit contributions
 up-0.125
acked-0.114
level-0.108
 hurts-0.105
erved-0.104
Token him
Token ID683
Feature activation+1.77e+00

Pos logit contributions
 mate+0.319
 charity+0.284
 souven+0.263
 buyer+0.257
ities+0.257
Neg logit contributions
 up-0.193
acked-0.164
 hurts-0.149
level-0.148
hed-0.140
Token.
Token ID13
Feature activation+7.16e-01

Pos logit contributions
 mate+0.081
 charity+0.069
 souven+0.067
 importantly+0.063
 rulers+0.062
Neg logit contributions
 up-0.055
acked-0.046
 hurts-0.045
level-0.045
erved-0.042
Token Everyone
Token ID11075
Feature activation+6.96e+00

Pos logit contributions
 mate+0.039
 souven+0.034
 charity+0.033
 rulers+0.032
 importantly+0.032
Neg logit contributions
 up-0.014
acked-0.014
ulum-0.011
level-0.011
forth-0.011
Token that
Token ID326
Feature activation+2.47e+00

Pos logit contributions
 mate+0.332
 souven+0.286
 charity+0.273
 importantly+0.270
 rulers+0.263
Neg logit contributions
 up-0.174
acked-0.166
ulum-0.162
level-0.161
erved-0.160
Token saw
Token ID2497
Feature activation+1.69e+00

Pos logit contributions
 mate+0.108
 souven+0.092
 charity+0.092
 importantly+0.085
 rulers+0.084
Neg logit contributions
 up-0.078
acked-0.073
level-0.069
 hurts-0.067
ulum-0.064
Token it
Token ID340
Feature activation-3.27e+00

Pos logit contributions
 mate+0.081
 charity+0.069
 souven+0.069
 importantly+0.063
ities+0.063
Neg logit contributions
 up-0.052
acked-0.042
level-0.040
 hurts-0.038
hed-0.037
Token couldn
Token ID3521
Feature activation-6.07e+00

Pos logit contributions
 up+0.078
acked+0.068
level+0.064
 hurts+0.063
aned+0.060
Neg logit contributions
 mate-0.168
 souven-0.146
 charity-0.146
 importantly-0.137
 rulers-0.135
Token't
Token ID470
Feature activation+2.01e+00

Pos logit contributions
level+0.203
acked+0.202
 up+0.195
erved+0.190
faced+0.190
Neg logit contributions
 mate-0.223
 charity-0.182
 souven-0.179
 possible-0.173
 rulers-0.169
Token,
Token ID11
Feature activation+2.08e+00

Pos logit contributions
 up+0.011
acked+0.009
erved+0.008
 hurts+0.008
hed+0.008
Neg logit contributions
 mate-0.018
 souven-0.016
 charity-0.016
 importantly-0.015
 cents-0.015
Token and
Token ID290
Feature activation-9.80e-01

Pos logit contributions
 mate+0.094
 charity+0.080
 souven+0.079
 importantly+0.076
ities+0.076
Neg logit contributions
 up-0.066
acked-0.051
 hurts-0.051
erved-0.051
level-0.050
Token Max
Token ID5436
Feature activation-1.75e+00

Pos logit contributions
 up+0.030
 hurts+0.023
level+0.023
forth+0.023
acked+0.023
Neg logit contributions
 mate-0.044
 charity-0.038
 souven-0.037
ities-0.036
34-0.036
Token showed
Token ID3751
Feature activation+4.91e+00

Pos logit contributions
 up+0.045
acked+0.041
level+0.038
 hurts+0.037
aned+0.036
Neg logit contributions
 mate-0.086
 souven-0.074
 charity-0.074
 importantly-0.070
 rulers-0.069
Token everyone
Token ID2506
Feature activation+4.81e+00

Pos logit contributions
 mate+0.215
 souven+0.182
 charity+0.180
 importantly+0.169
 rulers+0.169
Neg logit contributions
 up-0.161
acked-0.141
level-0.134
 hurts-0.133
erved-0.131
Token how
Token ID703
Feature activation+5.90e+00

Pos logit contributions
 mate+0.201
 souven+0.169
 charity+0.166
 importantly+0.155
 rulers+0.153
Neg logit contributions
 up-0.154
acked-0.148
level-0.142
aned-0.138
 hurts-0.137
Token clever
Token ID14169
Feature activation+1.54e+00

Pos logit contributions
 mate+0.211
 souven+0.173
 charity+0.165
 importantly+0.161
 possible+0.154
Neg logit contributions
acked-0.215
 up-0.212
aned-0.211
ulum-0.208
level-0.203
Token he
Token ID339
Feature activation-5.74e-02

Pos logit contributions
 mate+0.064
 souven+0.055
 charity+0.053
 rulers+0.051
uries+0.050
Neg logit contributions
 up-0.053
 hurts-0.046
acked-0.045
chair-0.044
ulum-0.042
Token was
Token ID373
Feature activation-1.11e+00

Pos logit contributions
 up+0.001
acked+0.001
level+0.001
 hurts+0.001
erved+0.001
Neg logit contributions
 mate-0.003
 souven-0.003
 charity-0.003
 importantly-0.002
 rulers-0.002
Token with
Token ID351
Feature activation+5.63e+00

Pos logit contributions
 up+0.034
acked+0.028
 hurts+0.027
level+0.026
erved+0.026
Neg logit contributions
 mate-0.051
 souven-0.044
 charity-0.043
 importantly-0.040
 rulers-0.040
Token his
Token ID465
Feature activation+1.75e+00

Pos logit contributions
 mate+0.284
 souven+0.243
 charity+0.239
 rulers+0.231
 buyer+0.231
Neg logit contributions
 up-0.156
 hurts-0.117
acked-0.116
 cries-0.107
erved-0.106
Token door
Token ID3420
Feature activation+1.11e+00

Pos logit contributions
acked+0.080
 up+0.078
level+0.075
aned+0.073
ulum+0.073
Neg logit contributions
 mate-0.100
 souven-0.085
 charity-0.084
 importantly-0.076
 rulers-0.075
Token was
Token ID373
Feature activation-1.63e+00

Pos logit contributions
 mate+0.056
 souven+0.048
 charity+0.048
 importantly+0.045
 rulers+0.045
Neg logit contributions
 up-0.027
acked-0.025
level-0.023
 hurts-0.023
erved-0.022
Token closed
Token ID4838
Feature activation-3.76e+00

Pos logit contributions
 up+0.051
level+0.037
 hurts+0.036
erved+0.035
acked+0.035
Neg logit contributions
 mate-0.077
 charity-0.066
 souven-0.064
ities-0.062
 rulers-0.062
Token.
Token ID13
Feature activation-8.08e-01

Pos logit contributions
 up+0.117
acked+0.110
level+0.105
 hurts+0.101
aned+0.100
Neg logit contributions
 mate-0.163
 souven-0.137
 charity-0.136
 importantly-0.127
 rulers-0.125
Token<|endoftext|>
Token ID50256
Feature activation-7.06e+00

Pos logit contributions
 up+0.017
acked+0.016
level+0.015
 hurts+0.014
ulum+0.014
Neg logit contributions
 mate-0.043
 souven-0.037
 charity-0.037
 rulers-0.035
 importantly-0.035
TokenOne
Token ID3198
Feature activation+5.76e+00

Pos logit contributions
aned+0.244
erved+0.243
 up+0.239
level+0.238
acked+0.230
Neg logit contributions
 mate-0.259
 souven-0.214
 charity-0.212
 importantly-0.196
 customer-0.184
Token day
Token ID1110
Feature activation-4.23e+00

Pos logit contributions
 mate+0.163
 souven+0.122
 importantly+0.113
 charity+0.110
 possible+0.106
Neg logit contributions
aned-0.269
acked-0.257
ulum-0.248
level-0.247
 up-0.243
Token,
Token ID11
Feature activation+7.46e-01

Pos logit contributions
 up+0.132
acked+0.125
level+0.120
 hurts+0.115
aned+0.115
Neg logit contributions
 mate-0.183
 souven-0.154
 charity-0.151
 importantly-0.142
 rulers-0.140
Token Mrs
Token ID9074
Feature activation+8.88e-01

Pos logit contributions
 mate+0.027
 souven+0.022
 charity+0.021
 importantly+0.020
 age+0.019
Neg logit contributions
aned-0.027
acked-0.027
 up-0.026
erved-0.026
 hurts-0.026
Token Jones
Token ID5437
Feature activation-4.62e-02

Pos logit contributions
 mate+0.040
 souven+0.035
 charity+0.033
 importantly+0.032
 rulers+0.032
Neg logit contributions
 up-0.027
acked-0.024
ulum-0.023
level-0.022
 hurts-0.022
Token took
Token ID1718
Feature activation-1.74e+00

Pos logit contributions
 up+0.001
acked+0.001
level+0.001
aned+0.001
 hurts+0.001
Neg logit contributions
 mate-0.002
 souven-0.002
 charity-0.002
 importantly-0.002
 rulers-0.002
Token to
Token ID284
Feature activation-3.97e-01

Pos logit contributions
acked+0.101
 up+0.100
level+0.094
aned+0.090
ulum+0.090
Neg logit contributions
 mate-0.170
 souven-0.145
 charity-0.143
 importantly-0.132
 rulers-0.130
Token help
Token ID1037
Feature activation+2.55e+00

Pos logit contributions
 up+0.014
acked+0.013
level+0.012
 hurts+0.012
erved+0.012
Neg logit contributions
 mate-0.015
 souven-0.013
 charity-0.013
 importantly-0.012
 rulers-0.012
Token him
Token ID683
Feature activation-2.55e-01

Pos logit contributions
 mate+0.140
 importantly+0.121
 souven+0.118
 rulers+0.114
 charity+0.113
Neg logit contributions
 up-0.067
level-0.045
acked-0.044
 hurts-0.043
 cries-0.041
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
acked+0.009
 up+0.009
level+0.009
aned+0.009
ulum+0.008
Neg logit contributions
 mate-0.010
 charity-0.008
 souven-0.008
 rulers-0.007
 importantly-0.007
Token The
Token ID383
Feature activation-4.68e-01

Pos logit contributions
 up+0.030
level+0.029
acked+0.027
aned+0.026
ulum+0.026
Neg logit contributions
 mate-0.057
 charity-0.048
 souven-0.048
 importantly-0.046
 buyer-0.044
Token giant
Token ID6175
Feature activation+2.94e+00

Pos logit contributions
acked+0.016
 up+0.016
aned+0.015
level+0.015
erved+0.014
Neg logit contributions
 mate-0.019
 souven-0.016
 charity-0.015
 rulers-0.014
 importantly-0.014
Token used
Token ID973
Feature activation+1.98e+00

Pos logit contributions
 mate+0.137
 souven+0.116
 charity+0.114
 importantly+0.108
 rulers+0.107
Neg logit contributions
 up-0.082
acked-0.078
level-0.074
ulum-0.071
 hurts-0.071
Token his
Token ID465
Feature activation+2.22e+00

Pos logit contributions
 mate+0.096
 charity+0.084
 souven+0.082
 importantly+0.078
 rulers+0.077
Neg logit contributions
 up-0.054
acked-0.047
level-0.045
 hurts-0.043
erved-0.042
Token strong
Token ID1913
Feature activation-2.28e-01

Pos logit contributions
 mate+0.095
 souven+0.079
 charity+0.078
 importantly+0.073
 rulers+0.072
Neg logit contributions
 up-0.071
acked-0.067
level-0.064
 hurts-0.062
aned-0.062
Token legs
Token ID7405
Feature activation+1.94e+00

Pos logit contributions
 up+0.008
acked+0.007
level+0.007
aned+0.007
faced+0.007
Neg logit contributions
 mate-0.009
 souven-0.008
 charity-0.007
 rulers-0.007
 importantly-0.007
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
 mate+0.095
 souven+0.082
 charity+0.081
 importantly+0.077
 rulers+0.076
Neg logit contributions
 up-0.049
acked-0.046
level-0.043
 hurts-0.041
aned-0.041
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
 mate+0.021
 souven+0.018
 charity+0.017
 importantly+0.016
 rulers+0.016
Neg logit contributions
 up-0.017
acked-0.016
level-0.016
erved-0.015
aned-0.015
Token outside
Token ID2354
Feature activation-5.78e-02

Pos logit contributions
 mate+0.126
 souven+0.106
 charity+0.105
 importantly+0.098
 rulers+0.097
Neg logit contributions
 up-0.091
acked-0.085
level-0.081
 hurts-0.078
ulum-0.078
Token.
Token ID13
Feature activation+4.10e-02

Pos logit contributions
acked+0.002
 up+0.002
level+0.002
aned+0.002
erved+0.002
Neg logit contributions
 mate-0.002
 souven-0.002
 charity-0.002
 importantly-0.002
 age-0.002
Token He
Token ID679
Feature activation+1.28e+00

Pos logit contributions
 mate+0.002
 souven+0.001
 charity+0.001
 importantly+0.001
 rulers+0.001
Neg logit contributions
 up-0.001
acked-0.001
level-0.001
aned-0.001
erved-0.001
Token wanted
Token ID2227
Feature activation+2.93e-01

Pos logit contributions
 mate+0.059
 souven+0.050
 charity+0.049
 importantly+0.047
 rulers+0.046
Neg logit contributions
 up-0.035
acked-0.034
level-0.032
 hurts-0.031
ulum-0.031
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
 mate+0.018
 importantly+0.016
ities+0.016
 charity+0.015
 souven+0.015
Neg logit contributions
 up-0.005
level-0.003
 hurts-0.003
erved-0.002
faced-0.002
Token show
Token ID905
Feature activation+4.36e+00

Pos logit contributions
 mate+0.013
 souven+0.011
 charity+0.011
 importantly+0.010
 rulers+0.010
Neg logit contributions
 up-0.010
acked-0.009
level-0.009
 hurts-0.009
erved-0.009
Token her
Token ID607
Feature activation+2.59e+00

Pos logit contributions
 mate+0.219
 importantly+0.203
ities+0.203
 charity+0.200
 souven+0.192
Neg logit contributions
 up-0.129
 hurts-0.081
 down-0.080
faced-0.079
erved-0.078
Token the
Token ID262
Feature activation-4.68e-01

Pos logit contributions
 mate+0.104
 souven+0.086
 charity+0.084
 importantly+0.077
 rulers+0.077
Neg logit contributions
acked-0.087
 up-0.086
level-0.083
aned-0.082
ulum-0.081
Token great
Token ID1049
Feature activation+2.10e+00

Pos logit contributions
 up+0.015
 hurts+0.013
acked+0.013
level+0.013
ulum+0.012
Neg logit contributions
 mate-0.020
 charity-0.017
 souven-0.017
 importantly-0.016
 rulers-0.016
Token string
Token ID4731
Feature activation+1.80e+00

Pos logit contributions
 mate+0.084
 souven+0.070
 charity+0.068
 importantly+0.063
 rulers+0.063
Neg logit contributions
 up-0.072
acked-0.069
level-0.067
aned-0.064
 hurts-0.064
Token
Token ID129
Feature activation-8.45e-01

Pos logit contributions
 up+0.194
acked+0.180
level+0.175
aned+0.168
 hurts+0.166
Neg logit contributions
 mate-0.267
 charity-0.224
 souven-0.223
 importantly-0.209
 rulers-0.205
Token
Token ID241
Feature activation+3.73e+00

Pos logit contributions
level+0.026
 up+0.025
 hurts+0.025
acked+0.025
chair+0.024
Neg logit contributions
 mate-0.032
uries-0.027
 charity-0.027
 than-0.025
 age-0.025
TokenMy
Token ID3666
Feature activation+4.37e+00

Pos logit contributions
 mate+0.156
 souven+0.129
 charity+0.128
 importantly+0.119
 rulers+0.119
Neg logit contributions
 up-0.121
acked-0.115
level-0.110
 hurts-0.107
ulum-0.106
Token favorite
Token ID4004
Feature activation+6.75e+00

Pos logit contributions
 mate+0.180
 charity+0.152
 souven+0.149
 importantly+0.147
 rulers+0.140
Neg logit contributions
 up-0.145
acked-0.125
 hurts-0.124
level-0.118
erved-0.117
Token food
Token ID2057
Feature activation+6.45e+00

Pos logit contributions
 mate+0.288
 charity+0.238
 importantly+0.236
 souven+0.236
34+0.224
Neg logit contributions
 up-0.218
 hurts-0.193
acked-0.192
faced-0.174
level-0.173
Token is
Token ID318
Feature activation+1.18e+00

Pos logit contributions
 mate+0.348
 souven+0.305
 charity+0.299
 importantly+0.288
 rulers+0.283
Neg logit contributions
 up-0.146
 hurts-0.122
acked-0.117
level-0.098
uck-0.098
Token apples
Token ID22514
Feature activation+2.06e+00

Pos logit contributions
 mate+0.053
 souven+0.045
 charity+0.045
 importantly+0.042
ities+0.042
Neg logit contributions
 up-0.037
 hurts-0.030
acked-0.029
faced-0.026
 rains-0.026
Token!
Token ID0
Feature activation+6.74e-02

Pos logit contributions
 mate+0.101
 souven+0.088
 charity+0.086
 importantly+0.083
 rulers+0.079
Neg logit contributions
 up-0.059
acked-0.047
 hurts-0.047
level-0.043
uck-0.042
Tokenâ
Token ID22940
Feature activation-7.90e-01

Pos logit contributions
 mate+0.003
 souven+0.003
 rulers+0.003
 charity+0.003
ities+0.003
Neg logit contributions
 up-0.002
acked-0.002
uck-0.001
chair-0.001
 Now-0.001
Token
Token ID26391
Feature activation-6.58e+00

Pos logit contributions
 up+0.018
level+0.016
acked+0.016
 hurts+0.015
erved+0.015
Neg logit contributions
 mate-0.038
 charity-0.033
 souven-0.032
uries-0.031
 importantly-0.030
Token it
Token ID340
Feature activation-1.86e+00

Pos logit contributions
 up+0.199
acked+0.187
level+0.179
 hurts+0.172
aned+0.172
Neg logit contributions
 mate-0.289
 souven-0.244
 charity-0.242
 importantly-0.226
 rulers-0.223
Token stirred
Token ID33091
Feature activation-1.43e+00

Pos logit contributions
 mate+0.068
 souven+0.056
 charity+0.055
 rulers+0.051
 importantly+0.051
Neg logit contributions
 up-0.056
acked-0.053
level-0.051
 hurts-0.050
ulum-0.050
Token and
Token ID290
Feature activation-2.45e+00

Pos logit contributions
acked+0.043
aned+0.042
level+0.042
 hurts+0.042
 up+0.042
Neg logit contributions
 mate-0.058
 souven-0.050
 charity-0.047
 rulers-0.046
 importantly-0.044
Token stirred
Token ID33091
Feature activation-1.55e+00

Pos logit contributions
 up+0.066
acked+0.061
level+0.058
 hurts+0.056
aned+0.055
Neg logit contributions
 mate-0.117
 souven-0.100
 charity-0.100
 importantly-0.093
 rulers-0.092
Token until
Token ID1566
Feature activation+8.16e-01

Pos logit contributions
 up+0.052
acked+0.050
level+0.048
 hurts+0.047
aned+0.047
Neg logit contributions
 mate-0.062
 souven-0.052
 charity-0.051
 importantly-0.047
 rulers-0.047
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
 mate+0.033
 souven+0.028
 charity+0.027
 importantly+0.025
 rulers+0.025
Neg logit contributions
 up-0.028
acked-0.026
level-0.025
 hurts-0.024
ulum-0.024
Token pasta
Token ID26296
Feature activation+1.17e+00

Pos logit contributions
 up+0.082
acked+0.080
level+0.078
aned+0.077
erved+0.075
Neg logit contributions
 mate-0.097
 souven-0.081
 charity-0.079
 rulers-0.074
 importantly-0.073
Token was
Token ID373
Feature activation-1.66e+00

Pos logit contributions
 mate+0.065
 souven+0.057
 charity+0.057
 importantly+0.054
 rulers+0.053
Neg logit contributions
 up-0.022
acked-0.020
level-0.018
 hurts-0.017
aned-0.017
Token y
Token ID331
Feature activation+5.77e-02

Pos logit contributions
 up+0.054
acked+0.047
level+0.045
aned+0.044
 hurts+0.043
Neg logit contributions
 mate-0.073
 charity-0.063
 souven-0.061
 importantly-0.057
 rulers-0.056
Tokenummy
Token ID13513
Feature activation-1.86e+00

Pos logit contributions
 mate+0.003
 souven+0.002
 charity+0.002
 importantly+0.002
 rulers+0.002
Neg logit contributions
 up-0.002
acked-0.002
level-0.002
erved-0.002
 hurts-0.002
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
 up+0.053
acked+0.041
 hurts+0.038
uck+0.037
ulum+0.036
Neg logit contributions
 mate-0.095
 charity-0.082
 souven-0.079
 importantly-0.075
uries-0.073
Token ready
Token ID3492
Feature activation-1.34e+00

Pos logit contributions
 up+0.064
acked+0.053
level+0.053
 hurts+0.053
ulum+0.050
Neg logit contributions
 mate-0.089
 charity-0.075
 souven-0.072
ities-0.067
 age-0.067
Token you
Token ID345
Feature activation+3.50e+00

Pos logit contributions
 mate+0.073
 souven+0.064
 charity+0.063
 importantly+0.060
 rulers+0.059
Neg logit contributions
 up-0.032
acked-0.028
 hurts-0.027
level-0.025
erved-0.023
Token can
Token ID460
Feature activation-1.88e+00

Pos logit contributions
 mate+0.137
 souven+0.118
 importantly+0.117
 charity+0.116
 buyer+0.104
Neg logit contributions
 up-0.123
acked-0.110
 hurts-0.110
level-0.107
 rains-0.103
Token plant
Token ID4618
Feature activation-3.81e-02

Pos logit contributions
 up+0.063
acked+0.057
 hurts+0.055
level+0.054
aned+0.052
Neg logit contributions
 mate-0.078
 souven-0.066
 charity-0.065
 importantly-0.063
 rulers-0.059
Token and
Token ID290
Feature activation-1.85e+00

Pos logit contributions
 up+0.001
acked+0.001
 hurts+0.001
level+0.001
uck+0.001
Neg logit contributions
 mate-0.002
 souven-0.001
 charity-0.001
 importantly-0.001
 rulers-0.001
Token grow
Token ID1663
Feature activation+4.34e+00

Pos logit contributions
 up+0.060
 hurts+0.053
level+0.051
acked+0.050
ulum+0.048
Neg logit contributions
 mate-0.079
 charity-0.065
 souven-0.065
 importantly-0.064
34-0.060
Token a
Token ID257
Feature activation+3.27e-01

Pos logit contributions
 mate+0.173
 souven+0.147
 charity+0.140
 rulers+0.135
 importantly+0.131
Neg logit contributions
acked-0.139
 up-0.136
level-0.135
ulum-0.135
aned-0.133
Token tree
Token ID5509
Feature activation-2.76e+00

Pos logit contributions
 mate+0.015
 importantly+0.013
 charity+0.013
ities+0.013
uries+0.012
Neg logit contributions
 up-0.009
 hurts-0.008
uddle-0.007
acked-0.007
>-0.006
Token."
Token ID526
Feature activation-3.65e+00

Pos logit contributions
 up+0.087
acked+0.080
level+0.076
 hurts+0.075
ulum+0.073
Neg logit contributions
 mate-0.119
 souven-0.101
 charity-0.100
 importantly-0.094
 rulers-0.091
Token
Token ID198
Feature activation+2.03e+00

Pos logit contributions
 up+0.114
acked+0.112
level+0.107
aned+0.105
erved+0.104
Neg logit contributions
 mate-0.154
 charity-0.129
 souven-0.129
 importantly-0.121
 rulers-0.116
Token
Token ID198
Feature activation+2.38e+00

Pos logit contributions
 mate+0.062
 charity+0.051
 souven+0.047
 importantly+0.047
 than+0.042
Neg logit contributions
level-0.082
erved-0.082
acked-0.081
 up-0.080
aned-0.079
TokenL
Token ID43
Feature activation+3.49e+00

Pos logit contributions
 mate+0.083
 charity+0.067
 souven+0.063
 importantly+0.058
 rulers+0.058
Neg logit contributions
 up-0.098
acked-0.092
level-0.092
aned-0.089
erved-0.089
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 up+0.092
acked+0.078
 hurts+0.074
erved+0.072
aned+0.069
Neg logit contributions
 mate-0.155
 charity-0.135
 souven-0.135
 importantly-0.127
 rulers-0.124
Token Ben
Token ID3932
Feature activation-1.60e+00

Pos logit contributions
 up+0.031
acked+0.029
level+0.027
 hurts+0.027
ulum+0.026
Neg logit contributions
 mate-0.062
 souven-0.054
 charity-0.053
 rulers-0.050
 importantly-0.050
Token pushes
Token ID20070
Feature activation+6.77e-01

Pos logit contributions
faced+0.046
level+0.046
acked+0.044
covered+0.043
ulum+0.043
Neg logit contributions
 mate-0.062
 rulers-0.055
 than-0.054
 age-0.052
 ones-0.051
Token harder
Token ID7069
Feature activation-2.03e+00

Pos logit contributions
 mate+0.028
 souven+0.024
 charity+0.023
 rulers+0.021
 importantly+0.021
Neg logit contributions
acked-0.021
level-0.020
 up-0.020
aned-0.019
faced-0.019
Token and
Token ID290
Feature activation-1.97e+00

Pos logit contributions
 up+0.049
 hurts+0.039
acked+0.039
aned+0.036
ulum+0.036
Neg logit contributions
 mate-0.108
 charity-0.095
 souven-0.092
 importantly-0.088
 buyer-0.086
Token harder
Token ID7069
Feature activation-1.58e+00

Pos logit contributions
acked+0.068
level+0.062
ulum+0.061
aned+0.061
 up+0.060
Neg logit contributions
 mate-0.075
 souven-0.066
 charity-0.060
 rulers-0.058
 possible-0.057
Token.
Token ID13
Feature activation-9.90e-01

Pos logit contributions
 up+0.042
 hurts+0.032
 cries+0.031
aned+0.030
 hits+0.030
Neg logit contributions
 mate-0.081
 charity-0.073
 souven-0.068
 importantly-0.066
 buyer-0.066
Token But
Token ID887
Feature activation+3.79e+00

Pos logit contributions
 up+0.026
level+0.025
acked+0.025
aned+0.023
erved+0.023
Neg logit contributions
 mate-0.047
 souven-0.039
 charity-0.039
 importantly-0.037
 rulers-0.036
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
 mate+0.181
 charity+0.157
 souven+0.155
 importantly+0.146
 rulers+0.143
Neg logit contributions
 up-0.107
acked-0.090
level-0.088
 hurts-0.087
ulum-0.084
Token tower
Token ID10580
Feature activation+2.07e+00

Pos logit contributions
acked+0.030
 up+0.029
level+0.027
aned+0.027
faced+0.026
Neg logit contributions
 mate-0.053
 souven-0.045
 charity-0.044
 rulers-0.043
 importantly-0.041
Token does
Token ID857
Feature activation-5.03e+00

Pos logit contributions
 mate+0.102
 souven+0.089
 rulers+0.088
 charity+0.083
 than+0.082
Neg logit contributions
level-0.044
acked-0.044
faced-0.042
 up-0.042
ulum-0.041
Token everything
Token ID2279
Feature activation+4.09e+00

Pos logit contributions
 up+0.021
 hurts+0.020
erved+0.019
level+0.019
acked+0.018
Neg logit contributions
 mate-0.021
 souven-0.018
 charity-0.018
 importantly-0.017
ities-0.016
Token they
Token ID484
Feature activation-3.00e-02

Pos logit contributions
 mate+0.169
 charity+0.139
 souven+0.138
ities+0.131
 importantly+0.130
Neg logit contributions
 up-0.146
 hurts-0.122
acked-0.120
level-0.118
erved-0.114
Token need
Token ID761
Feature activation+2.98e+00

Pos logit contributions
 up+0.001
 hurts+0.001
acked+0.001
level+0.001
erved+0.001
Neg logit contributions
 mate-0.001
 souven-0.001
 importantly-0.001
 charity-0.001
 rulers-0.001
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
 mate+0.151
 importantly+0.129
 souven+0.129
 rulers+0.127
 charity+0.127
Neg logit contributions
 up-0.074
 hurts-0.055
acked-0.050
>-0.048
hed-0.047
Token<|endoftext|>
Token ID50256
Feature activation-6.23e+00

Pos logit contributions
 up+0.023
acked+0.022
 hurts+0.021
 The+0.020
 Now+0.018
Neg logit contributions
 mate-0.065
 souven-0.057
 rulers-0.056
 charity-0.054
 queens-0.054
TokenBen
Token ID11696
Feature activation-2.36e+00

Pos logit contributions
aned+0.189
erved+0.185
 up+0.176
faced+0.174
level+0.174
Neg logit contributions
 mate-0.264
 charity-0.225
 souven-0.223
 importantly-0.210
 rulers-0.205
Token and
Token ID290
Feature activation-1.50e+00

Pos logit contributions
aned+0.070
level+0.065
 up+0.063
faced+0.063
ulum+0.063
Neg logit contributions
 mate-0.103
 than-0.083
 charity-0.082
 souven-0.079
 ones-0.078
Token Lily
Token ID20037
Feature activation-2.71e+00

Pos logit contributions
aned+0.057
acked+0.056
 up+0.055
ulum+0.052
level+0.052
Neg logit contributions
 mate-0.057
 souven-0.045
 charity-0.044
 customer-0.039
 rulers-0.038
Token were
Token ID547
Feature activation-1.75e-01

Pos logit contributions
 up+0.090
acked+0.087
aned+0.085
level+0.083
 hurts+0.081
Neg logit contributions
 mate-0.108
 charity-0.088
 souven-0.087
 rulers-0.081
 age-0.079
Token playing
Token ID2712
Feature activation-2.87e+00

Pos logit contributions
acked+0.008
level+0.008
ulum+0.007
aned+0.007
faced+0.007
Neg logit contributions
 mate-0.005
 charity-0.004
 importantly-0.004
 souven-0.004
 rulers-0.004
Token in
Token ID287
Feature activation-1.01e+00

Pos logit contributions
 up+0.089
acked+0.088
aned+0.086
level+0.086
 hurts+0.084
Neg logit contributions
 mate-0.121
 souven-0.101
 charity-0.100
 rulers-0.093
 importantly-0.090
Token finished
Token ID5201
Feature activation-4.26e+00

Pos logit contributions
 up+0.033
acked+0.027
level+0.025
aned+0.024
 hurts+0.023
Neg logit contributions
 mate-0.078
 charity-0.069
 souven-0.068
 importantly-0.064
 rulers-0.063
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 up+0.120
acked+0.096
 hurts+0.092
ulum+0.091
level+0.091
Neg logit contributions
 mate-0.211
 charity-0.182
 souven-0.179
 importantly-0.167
 rulers-0.166
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
 mate+0.067
 charity+0.057
 souven+0.056
 importantly+0.053
 rulers+0.052
Neg logit contributions
 up-0.034
acked-0.028
level-0.028
 hurts-0.027
ulum-0.026
Token dad
Token ID9955
Feature activation+7.11e-01

Pos logit contributions
 up+0.101
acked+0.100
level+0.095
aned+0.094
 hurts+0.092
Neg logit contributions
 mate-0.081
 souven-0.064
 charity-0.063
 importantly-0.058
 rulers-0.057
Token closed
Token ID4838
Feature activation-2.61e+00

Pos logit contributions
 mate+0.033
 souven+0.028
 charity+0.027
 importantly+0.026
 rulers+0.026
Neg logit contributions
 up-0.020
acked-0.019
level-0.018
erved-0.017
ulum-0.017
Token the
Token ID262
Feature activation-2.22e+00

Pos logit contributions
acked+0.080
level+0.079
 up+0.077
 hurts+0.076
ulum+0.075
Neg logit contributions
 mate-0.106
 souven-0.092
 charity-0.083
 importantly-0.079
 rulers-0.079
Token guitar
Token ID10047
Feature activation-1.64e+00

Pos logit contributions
acked+0.082
level+0.078
irled+0.077
aned+0.076
forth+0.076
Neg logit contributions
 mate-0.078
 charity-0.065
 souven-0.063
 customer-0.059
 age-0.059
Token and
Token ID290
Feature activation-2.14e+00

Pos logit contributions
 up+0.042
acked+0.039
level+0.037
 hurts+0.035
aned+0.035
Neg logit contributions
 mate-0.080
 souven-0.068
 charity-0.068
 importantly-0.064
 rulers-0.063
Token put
Token ID1234
Feature activation-1.14e+00

Pos logit contributions
 up+0.058
acked+0.047
level+0.047
 hurts+0.045
ulum+0.043
Neg logit contributions
 mate-0.105
 charity-0.092
 souven-0.089
 importantly-0.087
ities-0.086
Token it
Token ID340
Feature activation-4.94e+00

Pos logit contributions
 up+0.032
acked+0.027
level+0.025
 hurts+0.023
hed+0.023
Neg logit contributions
 mate-0.058
 charity-0.051
 souven-0.048
 importantly-0.048
 rulers-0.047
Token away
Token ID1497
Feature activation-3.57e+00

Pos logit contributions
 up+0.126
acked+0.119
level+0.113
 hurts+0.108
ulum+0.108
Neg logit contributions
 mate-0.239
 souven-0.206
 charity-0.202
 importantly-0.191
 rulers-0.190
Token share
Token ID2648
Feature activation-6.32e-01

Pos logit contributions
 up+0.063
 hurts+0.056
erved+0.049
 cries+0.048
level+0.048
Neg logit contributions
 mate-0.091
34-0.080
ities-0.079
 souven-0.078
 importantly-0.077
Token his
Token ID465
Feature activation+2.49e-01

Pos logit contributions
 up+0.017
 hurts+0.010
acked+0.010
faced+0.009
 down+0.009
Neg logit contributions
 mate-0.034
 importantly-0.031
34-0.029
 rulers-0.029
ances-0.028
Token toys
Token ID14958
Feature activation+3.30e+00

Pos logit contributions
 mate+0.009
 souven+0.008
 charity+0.007
 importantly+0.007
 rulers+0.007
Neg logit contributions
 up-0.009
acked-0.009
level-0.008
 hurts-0.008
erved-0.008
Token with
Token ID351
Feature activation+4.84e+00

Pos logit contributions
 mate+0.155
 importantly+0.133
 souven+0.131
 charity+0.127
 rulers+0.125
Neg logit contributions
 up-0.098
acked-0.076
 hurts-0.075
faced-0.071
level-0.068
Token them
Token ID606
Feature activation+1.49e+00

Pos logit contributions
 mate+0.257
 souven+0.228
 importantly+0.219
 rulers+0.212
 possible+0.210
Neg logit contributions
 up-0.115
 hurts-0.085
acked-0.076
 cries-0.075
ulum-0.072
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
 mate+0.065
 souven+0.055
 charity+0.055
 importantly+0.053
 rulers+0.051
Neg logit contributions
 up-0.048
acked-0.041
 hurts-0.039
level-0.038
erved-0.037
Token
Token ID198
Feature activation+1.98e+00

Pos logit contributions
 up+0.033
acked+0.029
 hurts+0.027
erved+0.023
ulum+0.023
Neg logit contributions
 mate-0.094
 souven-0.084
 rulers-0.081
 charity-0.081
ities-0.079
Token
Token ID198
Feature activation+2.13e+00

Pos logit contributions
 mate+0.067
 charity+0.057
 souven+0.054
 importantly+0.049
 than+0.044
Neg logit contributions
 up-0.074
level-0.074
erved-0.073
acked-0.073
ulum-0.073
TokenHe
Token ID1544
Feature activation+3.63e-01

Pos logit contributions
 mate+0.090
 souven+0.075
 charity+0.075
 importantly+0.069
 rulers+0.068
Neg logit contributions
 up-0.070
acked-0.066
level-0.064
 hurts-0.061
ulum-0.061
Token put
Token ID1234
Feature activation-9.82e-01

Pos logit contributions
 mate+0.015
 souven+0.013
 charity+0.012
 rulers+0.012
 buyer+0.011
Neg logit contributions
acked-0.011
 up-0.011
level-0.011
ulum-0.011
aned-0.010
Token his
Token ID465
Feature activation-3.52e-01

Pos logit contributions
 up+0.028
acked+0.023
level+0.021
 hurts+0.020
erved+0.020
Neg logit contributions
 mate-0.049
 charity-0.043
 importantly-0.042
 souven-0.042
 age-0.040
Token was
Token ID373
Feature activation-8.50e-01

Pos logit contributions
 mate+0.011
 souven+0.009
 charity+0.009
 rulers+0.009
 age+0.009
Neg logit contributions
 up-0.006
acked-0.006
level-0.006
ulum-0.006
erved-0.006
Token so
Token ID523
Feature activation-1.37e+00

Pos logit contributions
acked+0.028
 up+0.028
level+0.027
ulum+0.027
 hurts+0.027
Neg logit contributions
 mate-0.034
 souven-0.027
 charity-0.026
 importantly-0.026
 rulers-0.025
Token excited
Token ID6568
Feature activation-2.87e+00

Pos logit contributions
 up+0.048
acked+0.048
ulum+0.048
level+0.047
erved+0.047
Neg logit contributions
 mate-0.050
 souven-0.042
 importantly-0.039
 charity-0.039
 rulers-0.037
Token,
Token ID11
Feature activation+2.01e+00

Pos logit contributions
 up+0.083
acked+0.069
 hurts+0.065
level+0.063
uck+0.063
Neg logit contributions
 mate-0.140
 charity-0.121
 souven-0.120
 rulers-0.112
 importantly-0.110
Token she
Token ID673
Feature activation-1.88e-02

Pos logit contributions
 mate+0.085
 charity+0.069
 souven+0.065
ities+0.065
 rulers+0.064
Neg logit contributions
 up-0.073
acked-0.059
 hurts-0.058
level-0.058
ulum-0.056
Token began
Token ID2540
Feature activation-1.49e+00

Pos logit contributions
 up+0.001
acked+0.001
level+0.001
ulum+0.000
erved+0.000
Neg logit contributions
 mate-0.001
 souven-0.001
 charity-0.001
 importantly-0.001
 rulers-0.001
Token to
Token ID284
Feature activation-7.03e-01

Pos logit contributions
 up+0.039
acked+0.037
level+0.035
 hurts+0.033
aned+0.033
Neg logit contributions
 mate-0.072
 souven-0.062
 charity-0.061
 importantly-0.057
 rulers-0.057
Token jump
Token ID4391
Feature activation-1.51e+00

Pos logit contributions
acked+0.024
 up+0.024
level+0.023
aned+0.022
erved+0.022
Neg logit contributions
 mate-0.028
 souven-0.022
 charity-0.022
 rulers-0.020
 age-0.020
Token as
Token ID355
Feature activation+2.59e+00

Pos logit contributions
level+0.041
acked+0.041
ulum+0.041
erved+0.040
aned+0.040
Neg logit contributions
 mate-0.068
 souven-0.059
 charity-0.052
 rulers-0.052
 possible-0.051
Token everyone
Token ID2506
Feature activation+3.60e+00

Pos logit contributions
 mate+0.104
 souven+0.093
 possible+0.086
 rulers+0.084
 charity+0.084
Neg logit contributions
acked-0.079
ulum-0.077
level-0.074
aned-0.074
 hurts-0.074
Token else
Token ID2073
Feature activation+4.41e+00

Pos logit contributions
 mate+0.175
 souven+0.151
 charity+0.148
 importantly+0.141
 rulers+0.139
Neg logit contributions
 up-0.092
acked-0.086
level-0.081
 hurts-0.078
aned-0.077
Token daddy
Token ID35822
Feature activation-1.36e+00

Pos logit contributions
 up+0.004
 hurts+0.004
ulum+0.004
chair+0.003
level+0.003
Neg logit contributions
 mate-0.005
 souven-0.004
 charity-0.004
34-0.004
ities-0.004
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
 up+0.043
acked+0.040
level+0.038
 hurts+0.037
ulum+0.036
Neg logit contributions
 mate-0.059
 souven-0.049
 charity-0.049
 importantly-0.046
 rulers-0.045
Token rolling
Token ID10708
Feature activation-2.03e+00

Pos logit contributions
 up+0.088
acked+0.077
 hurts+0.076
level+0.075
erved+0.073
Neg logit contributions
 mate-0.099
 souven-0.083
 charity-0.081
 importantly-0.075
 rulers-0.075
Token maze
Token ID31237
Feature activation-2.58e+00

Pos logit contributions
aned+0.062
level+0.059
acked+0.059
ulum+0.058
erved+0.057
Neg logit contributions
 mate-0.084
 souven-0.074
 ones-0.068
 charity-0.067
 than-0.064
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
 up+0.079
acked+0.070
 hurts+0.064
erved+0.063
level+0.063
Neg logit contributions
 mate-0.117
 charity-0.099
 souven-0.099
 importantly-0.095
ities-0.092
Token<|endoftext|>
Token ID50256
Feature activation-8.30e+00

Pos logit contributions
 up+0.054
acked+0.051
level+0.050
aned+0.048
 hurts+0.047
Neg logit contributions
 mate-0.084
 charity-0.071
 souven-0.071
 importantly-0.066
 rulers-0.064
TokenOnce
Token ID7454
Feature activation+8.01e-01

Pos logit contributions
erved+0.318
aned+0.318
level+0.304
faced+0.299
 up+0.296
Neg logit contributions
 mate-0.273
 souven-0.223
 charity-0.222
 importantly-0.203
 ones-0.192
Token upon
Token ID2402
Feature activation-3.99e+00

Pos logit contributions
 mate+0.031
 souven+0.025
 charity+0.024
 importantly+0.023
 age+0.022
Neg logit contributions
acked-0.028
 up-0.027
erved-0.027
level-0.027
ulum-0.027
Token a
Token ID257
Feature activation-8.71e-02

Pos logit contributions
acked+0.173
aned+0.173
level+0.162
 up+0.162
uddle+0.160
Neg logit contributions
 mate-0.128
 souven-0.097
 charity-0.095
 rulers-0.087
 importantly-0.086
Token time
Token ID640
Feature activation-2.32e+00

Pos logit contributions
 up+0.004
acked+0.004
level+0.003
 hurts+0.003
aned+0.003
Neg logit contributions
 mate-0.003
 souven-0.002
 charity-0.002
 importantly-0.002
 rulers-0.002
Token,
Token ID11
Feature activation+4.11e-01

Pos logit contributions
 up+0.065
acked+0.057
level+0.054
 hurts+0.054
uck+0.052
Neg logit contributions
 mate-0.107
 souven-0.094
 charity-0.094
 importantly-0.087
 rulers-0.086
Token the
Token ID262
Feature activation-2.97e+00

Pos logit contributions
 up+0.012
level+0.010
 hurts+0.010
acked+0.009
forth+0.009
Neg logit contributions
 mate-0.011
 importantly-0.010
34-0.009
 souven-0.009
ities-0.009
Token tree
Token ID5509
Feature activation-2.04e+00

Pos logit contributions
 up+0.099
 hurts+0.084
level+0.083
acked+0.082
ulum+0.080
Neg logit contributions
 mate-0.125
 souven-0.103
 importantly-0.099
 charity-0.099
ities-0.094
Token,
Token ID11
Feature activation+9.67e-01

Pos logit contributions
 up+0.064
acked+0.060
level+0.057
 hurts+0.056
aned+0.055
Neg logit contributions
 mate-0.088
 souven-0.074
 charity-0.073
 importantly-0.069
 rulers-0.067
Token
Token ID6184
Feature activation+1.02e+00

Pos logit contributions
 mate+0.035
 souven+0.029
 charity+0.028
 importantly+0.026
 rulers+0.026
Neg logit contributions
 up-0.039
acked-0.032
level-0.032
 hurts-0.031
aned-0.031
Token
Token ID95
Feature activation-7.27e-01

Pos logit contributions
 mate+0.041
 charity+0.033
 souven+0.033
 rulers+0.030
 importantly+0.030
Neg logit contributions
 up-0.035
acked-0.034
level-0.032
 hurts-0.032
aned-0.031
Token
Token ID26391
Feature activation-8.04e+00

Pos logit contributions
 up+0.022
level+0.020
acked+0.020
 hurts+0.019
aned+0.018
Neg logit contributions
 mate-0.032
 charity-0.027
 souven-0.025
uries-0.025
 customer-0.024
Token
Token ID129
Feature activation-2.49e+00

Pos logit contributions
 up+0.240
acked+0.223
level+0.216
 hurts+0.206
aned+0.205
Neg logit contributions
 mate-0.358
 charity-0.298
 souven-0.296
 importantly-0.279
 rulers-0.274
Token
Token ID241
Feature activation+2.26e+00

Pos logit contributions
 up+0.077
level+0.076
 hurts+0.073
uddle+0.071
acked+0.069
Neg logit contributions
 mate-0.096
uries-0.082
 charity-0.081
 age-0.074
34-0.074
TokenPlease
Token ID5492
Feature activation+1.52e+00

Pos logit contributions
 mate+0.097
 souven+0.081
 charity+0.080
 rulers+0.075
 importantly+0.075
Neg logit contributions
 up-0.071
acked-0.067
level-0.064
 hurts-0.062
ulum-0.061
Token,
Token ID11
Feature activation+2.19e+00

Pos logit contributions
 mate+0.065
 souven+0.055
 importantly+0.055
 charity+0.053
 rulers+0.051
Neg logit contributions
 up-0.050
 hurts-0.042
acked-0.041
level-0.041
uck-0.040
Token can
Token ID460
Feature activation-2.00e+00

Pos logit contributions
 mate+0.081
 souven+0.068
ities+0.066
 importantly+0.066
 charity+0.066
Neg logit contributions
 up-0.080
 hurts-0.067
faced-0.065
level-0.065
acked-0.065
Token
Token ID241
Feature activation+2.33e+00

Pos logit contributions
 hurts+0.082
acked+0.082
level+0.081
chair+0.080
 snowball+0.078
Neg logit contributions
 mate-0.083
uries-0.066
 charity-0.063
ances-0.058
 than-0.057
TokenWelcome
Token ID14618
Feature activation+2.09e+00

Pos logit contributions
 mate+0.102
 souven+0.085
 charity+0.084
 rulers+0.079
 importantly+0.078
Neg logit contributions
 up-0.072
acked-0.068
level-0.065
 hurts-0.063
ulum-0.062
Token!
Token ID0
Feature activation-5.47e-01

Pos logit contributions
 mate+0.092
 souven+0.078
 charity+0.076
 importantly+0.071
 rulers+0.071
Neg logit contributions
 up-0.063
acked-0.060
level-0.057
aned-0.056
 hurts-0.055
Token We
Token ID775
Feature activation+2.04e+00

Pos logit contributions
 up+0.016
acked+0.015
level+0.014
uck+0.013
faced+0.013
Neg logit contributions
 mate-0.024
 souven-0.021
 charity-0.020
 rulers-0.020
ities-0.019
Tokenâ
Token ID22940
Feature activation-2.65e+00

Pos logit contributions
 mate+0.082
 souven+0.071
 charity+0.070
 importantly+0.066
 rulers+0.063
Neg logit contributions
 up-0.068
acked-0.065
level-0.062
 hurts-0.059
erved-0.058
Token
Token ID26391
Feature activation-7.93e+00

Pos logit contributions
 up+0.066
acked+0.059
level+0.057
 hurts+0.055
aned+0.054
Neg logit contributions
 mate-0.131
 charity-0.112
 souven-0.112
 importantly-0.105
 rulers-0.103
Token
Token ID8151
Feature activation+2.14e-01

Pos logit contributions
 up+0.255
acked+0.240
level+0.230
 hurts+0.223
ulum+0.222
Neg logit contributions
 mate-0.333
 souven-0.281
 charity-0.276
 importantly-0.257
 rulers-0.255
Tokenve
Token ID303
Feature activation+4.36e+00

Pos logit contributions
 mate+0.009
 souven+0.007
 charity+0.007
 importantly+0.007
 rulers+0.007
Neg logit contributions
 up-0.007
ulum-0.006
level-0.006
acked-0.006
erved-0.006
Token been
Token ID587
Feature activation+6.51e-01

Pos logit contributions
 mate+0.183
 souven+0.149
 charity+0.145
 rulers+0.138
 importantly+0.136
Neg logit contributions
 up-0.142
acked-0.134
level-0.129
ulum-0.129
 hurts-0.126
Token expecting
Token ID12451
Feature activation+1.04e+00

Pos logit contributions
 mate+0.028
 charity+0.025
 souven+0.025
ities+0.025
 importantly+0.023
Neg logit contributions
 up-0.021
acked-0.016
level-0.016
uck-0.016
faced-0.015
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 mate+0.051
ities+0.044
 importantly+0.043
 rulers+0.043
 souven+0.042
Neg logit contributions
 up-0.030
 hurts-0.021
 cries-0.018
 down-0.018
uck-0.017
Token smart
Token ID4451
Feature activation-7.38e-01

Pos logit contributions
 up+0.010
 hurts+0.008
faced+0.008
level+0.008
 snowball+0.008
Neg logit contributions
 mate-0.012
ities-0.010
ances-0.010
 charity-0.009
 importantly-0.009
Token kid
Token ID5141
Feature activation-7.86e-01

Pos logit contributions
 up+0.025
 hurts+0.020
acked+0.020
faced+0.020
erved+0.019
Neg logit contributions
 mate-0.028
 charity-0.026
 souven-0.025
nam-0.025
 importantly-0.024
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
 up+0.024
acked+0.021
 hurts+0.020
faced+0.019
level+0.019
Neg logit contributions
 mate-0.034
 souven-0.030
 charity-0.030
 importantly-0.028
ities-0.028
Token Mark
Token ID2940
Feature activation-7.72e-01

Pos logit contributions
 mate+0.057
 souven+0.049
 charity+0.049
ities+0.047
 cents+0.046
Neg logit contributions
 up-0.045
 hurts-0.036
chair-0.035
ulum-0.033
 snowball-0.033
Token!"
Token ID2474
Feature activation-5.43e+00

Pos logit contributions
 up+0.025
acked+0.024
level+0.023
aned+0.022
ulum+0.022
Neg logit contributions
 mate-0.032
 souven-0.026
 charity-0.026
 importantly-0.024
 rulers-0.024
Token<|endoftext|>
Token ID50256
Feature activation-7.91e+00

Pos logit contributions
 up+0.169
acked+0.163
level+0.162
erved+0.155
 hurts+0.153
Neg logit contributions
 mate-0.225
 souven-0.189
 charity-0.187
 importantly-0.178
 rulers-0.172
TokenOnce
Token ID7454
Feature activation+1.65e+00

Pos logit contributions
erved+0.236
aned+0.232
 up+0.227
level+0.225
faced+0.217
Neg logit contributions
 mate-0.328
 souven-0.276
 charity-0.275
 importantly-0.264
 ones-0.250
Token upon
Token ID2402
Feature activation-3.92e+00

Pos logit contributions
 mate+0.068
 souven+0.055
 charity+0.054
 importantly+0.052
 rulers+0.049
Neg logit contributions
 up-0.053
acked-0.053
level-0.051
erved-0.050
aned-0.049
Token a
Token ID257
Feature activation+1.63e-01

Pos logit contributions
aned+0.171
acked+0.170
level+0.160
uddle+0.156
 up+0.156
Neg logit contributions
 mate-0.122
 charity-0.096
 souven-0.094
 importantly-0.082
 rulers-0.080
Token time
Token ID640
Feature activation-2.30e+00

Pos logit contributions
 mate+0.006
 souven+0.005
 charity+0.004
 importantly+0.004
 rulers+0.004
Neg logit contributions
 up-0.007
acked-0.006
level-0.006
 hurts-0.006
erved-0.006
Token,
Token ID11
Feature activation+3.40e-01

Pos logit contributions
 up+0.060
acked+0.049
 hurts+0.046
level+0.046
uck+0.045
Neg logit contributions
 mate-0.112
 souven-0.101
 charity-0.101
ities-0.094
 rulers-0.094
Token
Token ID241
Feature activation+2.76e+00

Pos logit contributions
level+0.056
acked+0.055
 hurts+0.054
 up+0.054
chair+0.052
Neg logit contributions
 mate-0.069
 charity-0.055
uries-0.055
 than-0.054
 souven-0.052
TokenNow
Token ID3844
Feature activation-2.34e-01

Pos logit contributions
 mate+0.121
 charity+0.100
 souven+0.096
 rulers+0.095
ities+0.091
Neg logit contributions
 up-0.084
acked-0.081
level-0.074
faced-0.073
forth-0.072
Token,
Token ID11
Feature activation+2.74e+00

Pos logit contributions
 up+0.007
 hurts+0.006
uck+0.006
acked+0.005
forth+0.005
Neg logit contributions
 mate-0.011
 charity-0.009
 rulers-0.009
ities-0.009
 souven-0.009
Token let
Token ID1309
Feature activation-1.03e+00

Pos logit contributions
 mate+0.109
 charity+0.090
ities+0.090
 rulers+0.088
 importantly+0.086
Neg logit contributions
 up-0.096
uck-0.074
 rains-0.073
 hurts-0.072
level-0.072
Tokenâ
Token ID22940
Feature activation-3.26e+00

Pos logit contributions
 up+0.018
acked+0.017
level+0.016
aned+0.015
ulum+0.015
Neg logit contributions
 mate-0.058
 souven-0.050
 charity-0.050
 importantly-0.047
 rulers-0.047
Token
Token ID26391
Feature activation-8.56e+00

Pos logit contributions
 up+0.087
acked+0.079
level+0.077
 hurts+0.074
ulum+0.072
Neg logit contributions
 mate-0.149
 charity-0.130
 souven-0.127
 remember-0.118
uries-0.116
Token
Token ID8151
Feature activation-5.45e-01

Pos logit contributions
 up+0.283
acked+0.268
level+0.258
aned+0.250
 hurts+0.248
Neg logit contributions
 mate-0.352
 souven-0.293
 charity-0.292
 importantly-0.270
 rulers-0.266
Tokens
Token ID82
Feature activation+1.72e+00

Pos logit contributions
 up+0.014
ulum+0.013
level+0.013
acked+0.013
erved+0.013
Neg logit contributions
 mate-0.026
 souven-0.022
 charity-0.021
 importantly-0.020
 rulers-0.020
Token be
Token ID307
Feature activation+2.08e+00

Pos logit contributions
 mate+0.073
 souven+0.063
 charity+0.062
 importantly+0.060
 queens+0.056
Neg logit contributions
 up-0.056
acked-0.048
 hurts-0.047
level-0.047
erved-0.045
Token a
Token ID257
Feature activation+2.06e-01

Pos logit contributions
 mate+0.080
 charity+0.067
 souven+0.066
 importantly+0.061
ities+0.059
Neg logit contributions
 up-0.077
acked-0.070
level-0.067
erved-0.065
 hurts-0.065
Token little
Token ID1310
Feature activation-2.45e+00

Pos logit contributions
 mate+0.008
ities+0.007
 importantly+0.006
 charity+0.006
ances+0.006
Neg logit contributions
 up-0.007
faced-0.006
level-0.006
acked-0.006
 hurts-0.006