TOP ACTIVATIONS
MAX = 23.107

and she was three - years - old . She wanted
't cry , boys . Acc idents happen . I have
. She was three - years - old and she loved
and listening . <|endoftext|> M omm a and Billy were walking
work , bravery and cle vern ess ! From now on
She was only three - years - old , but she
He was only 3 - years - old . He had
It 's okay , my de ars . I know you
. She was three - years - old and loved to
She was only three - years - old , but she
. She was three - years - old . Every day
He was only three - years - old . He thought
to get too wild . Id a was so happy she
it and said , " E w , Mom my ,
. Bella was three - years - old and very excited
She was only three - years - old and she loved
sad . He says " oh no , my car !"
. He was three - years old and he loved skipping
, soaking up the morning de w that gl owed on
's courage . The two de ers became good friends after

MIN ACTIVATIONS
MIN = -16.889

Once upon a time , there was a graceful hero named
<|endoftext|> Once upon a time there was a restless little girl
Once upon a time , there was a powerful lion named
a very angry man up there . We need to be
. â <|endoftext|> Once there was a girl named Jane
â <|endoftext|> Once there was a boy who liked
Once upon a time , there was a march . It
â s go there ! â They ran
<|endoftext|> Once upon a time there was a girl called Emma
in a small town , there was a big mill .
the park . <|endoftext|> Once there was a little boy .
the pumpkin together , until there was nothing left . They
Once upon a time , there was a mom my and
<|endoftext|> Once upon a time there was a friendly bear named
Once upon a time , there was a ch ubby little
<|endoftext|> Once upon a time there was a young man ,
<|endoftext|> Once upon a time there was a knight . He
Once upon a time , there was a bright blue berry
<|endoftext|> Once upon a time there was a girl named Maria
<|endoftext|> Once upon a time there was a plane in the

INTERVAL 13.108 - 23.107
CONTAINS 0.320%

friends . They say " Good bye !" They go to
Okay . â
is . He is a good dog . He will not
happy . The birds are happy . The man is happy
say yes . They are happy . They like to play

INTERVAL 3.109 - 13.108
CONTAINS 33.271%

The rain was so heavy , it replaced the steam .
laughed with delight . She eventually succeeded in flying it around
was a little girl named Lily . She loved to play
mom my what it was and she told him it was
. The train was afraid and said , " Oh no

INTERVAL -6.890 - 3.109
CONTAINS 55.136%

named Sara . She had two arms that were small and
. He was curious so he went closer to have a
. They wanted to climb the tree and see what was
. He climbed up and slid down . It was fun
bed ! Fully aware of its invitation , Polly me owed

INTERVAL -16.889 - -6.890
CONTAINS 11.273%

. The image of her family was something that
have a picnic together . Max wanted to make
spill more water and colors on the table , the floor
happily until the sun went down . The End . <|endoftext|>
next time ." <|endoftext|> Once upon a time , there was
Token and
Token ID290
Feature activation+8.58e+00

Pos logit contributions
 mad+0.334
lee+0.333
years+0.322
 glad+0.316
illy+0.314
Neg logit contributions
Ready-0.242
 hunt-0.240
-0.238
 ride-0.225
sc-0.223
Token she
Token ID673
Feature activation+5.16e+00

Pos logit contributions
lee+0.185
 mad+0.180
 glad+0.180
 years+0.176
heart+0.170
Neg logit contributions
Ready-0.233
 hunt-0.215
Circ-0.214
 stairs-0.212
Find-0.208
Token was
Token ID373
Feature activation+7.12e+00

Pos logit contributions
lee+0.117
 mad+0.116
 glad+0.113
 years+0.113
heart+0.110
Neg logit contributions
Ready-0.131
 hunt-0.121
Circ-0.118
 stairs-0.118
-0.116
Token three
Token ID1115
Feature activation+8.40e+00

Pos logit contributions
lee+0.123
 years+0.117
heart+0.117
 mad+0.116
illy+0.116
Neg logit contributions
Ready-0.210
 gra-0.198
-0.196
 hunt-0.196
 stairs-0.196
Token-
Token ID12
Feature activation+4.20e+00

Pos logit contributions
Acc+0.091
illy+0.089
 ok+0.085
 okay+0.082
 glad+0.078
Neg logit contributions
 stairs-0.284
sc-0.282
 aisle-0.278
 hunt-0.277
 hills-0.276
Tokenyears
Token ID19002
Feature activation+2.24e+01

Pos logit contributions
lee+0.028
 mad+0.027
 glad+0.025
 years+0.023
 ok+0.022
Neg logit contributions
Ready-0.173
Circ-0.164
 stairs-0.162
sc-0.162
Find-0.161
Token-
Token ID12
Feature activation+3.43e+00

Pos logit contributions
 glad+0.391
 mad+0.390
 ok+0.370
lee+0.366
 okay+0.360
Neg logit contributions
 stairs-0.632
 hunt-0.626
Circ-0.623
Ready-0.623
sc-0.611
Tokenold
Token ID727
Feature activation+6.22e+00

Pos logit contributions
 mad+0.058
lee+0.057
 glad+0.055
 years+0.054
heart+0.051
Neg logit contributions
Ready-0.108
Circ-0.101
 hunt-0.100
 stairs-0.099
sc-0.097
Token.
Token ID13
Feature activation-6.59e-01

Pos logit contributions
lee+0.154
 mad+0.152
 glad+0.149
aming+0.144
illy+0.144
Neg logit contributions
Ready-0.140
 hunt-0.132
 stairs-0.128
sc-0.127
-0.126
Token She
Token ID1375
Feature activation+5.29e+00

Pos logit contributions
Ready+0.014
 hunt+0.013
Circ+0.012
 shortcut+0.012
 stairs+0.012
Neg logit contributions
lee-0.018
 glad-0.018
 years-0.017
 mad-0.017
heart-0.017
Token wanted
Token ID2227
Feature activation+4.05e+00

Pos logit contributions
 mad+0.123
lee+0.122
 glad+0.120
 years+0.118
heart+0.116
Neg logit contributions
Ready-0.131
 hunt-0.122
Circ-0.119
 stairs-0.119
Find-0.116
Token't
Token ID470
Feature activation+1.08e+01

Pos logit contributions
 mad+0.409
lee+0.407
 glad+0.404
heart+0.399
 years+0.398
Neg logit contributions
Ready-0.219
 hunt-0.196
Circ-0.195
 stairs-0.194
 gra-0.178
Token cry
Token ID3960
Feature activation+1.24e+01

Pos logit contributions
lee+0.239
 years+0.227
years+0.225
 glad+0.225
 mad+0.222
Neg logit contributions
Ready-0.283
 hunt-0.272
 ride-0.259
 shortcut-0.256
Find-0.256
Token,
Token ID11
Feature activation+1.06e+01

Pos logit contributions
years+0.307
aming+0.306
lee+0.302
 years+0.298
 pleased+0.285
Neg logit contributions
Ready-0.260
 hunt-0.255
ro-0.254
-0.252
 ride-0.248
Token boys
Token ID6510
Feature activation+5.43e+00

Pos logit contributions
 mad+0.258
lee+0.257
 glad+0.255
 years+0.249
heart+0.244
Neg logit contributions
Ready-0.254
 hunt-0.232
Circ-0.231
 stairs-0.229
sc-0.224
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
lee+0.139
years+0.136
 mad+0.132
 glad+0.132
 years+0.131
Neg logit contributions
Ready-0.119
 hunt-0.115
-0.109
 stairs-0.108
sc-0.108
Token Acc
Token ID6366
Feature activation+2.23e+01

Pos logit contributions
lee+0.066
 mad+0.065
 glad+0.064
 years+0.064
heart+0.062
Neg logit contributions
Ready-0.048
 hunt-0.043
Circ-0.043
 stairs-0.042
Find-0.041
Tokenidents
Token ID3231
Feature activation+1.28e+01

Pos logit contributions
 years+0.610
years+0.607
aming+0.572
aky+0.567
 glad+0.565
Neg logit contributions
Ready-0.467
 hunt-0.427
ro-0.412
-0.409
con-0.405
Token happen
Token ID1645
Feature activation+3.22e+00

Pos logit contributions
 mad+0.350
 glad+0.337
lee+0.336
 years+0.331
heart+0.327
Neg logit contributions
Ready-0.279
Circ-0.257
 hunt-0.250
Find-0.243
 gra-0.242
Token.
Token ID13
Feature activation+8.98e-01

Pos logit contributions
years+0.112
lee+0.105
 glad+0.100
illy+0.098
aming+0.098
Neg logit contributions
-0.045
Ready-0.044
 hunt-0.043
sc-0.041
 stairs-0.039
Token I
Token ID314
Feature activation-6.42e-01

Pos logit contributions
 mad+0.027
lee+0.027
 years+0.027
illy+0.026
years+0.026
Neg logit contributions
Ready-0.017
-0.014
Circ-0.014
 hunt-0.014
 stairs-0.014
Token have
Token ID423
Feature activation+7.42e-02

Pos logit contributions
Ready+0.013
 hunt+0.012
Circ+0.011
 stairs+0.011
Find+0.011
Neg logit contributions
lee-0.018
 mad-0.018
 years-0.018
 glad-0.017
heart-0.017
Token.
Token ID13
Feature activation+2.40e+00

Pos logit contributions
lee+0.194
 mad+0.192
 glad+0.187
 years+0.187
heart+0.181
Neg logit contributions
Ready-0.240
 hunt-0.226
 stairs-0.220
-0.219
Circ-0.218
Token She
Token ID1375
Feature activation+8.06e+00

Pos logit contributions
lee+0.059
 glad+0.058
 mad+0.055
 years+0.055
heart+0.054
Neg logit contributions
Ready-0.056
 hunt-0.056
Circ-0.054
 shortcut-0.053
 stairs-0.052
Token was
Token ID373
Feature activation+7.14e+00

Pos logit contributions
lee+0.177
 mad+0.174
 glad+0.171
 years+0.171
heart+0.163
Neg logit contributions
Ready-0.213
 hunt-0.196
Circ-0.196
 stairs-0.193
Find-0.189
Token three
Token ID1115
Feature activation+8.56e+00

Pos logit contributions
lee+0.127
heart+0.124
illy+0.121
 years+0.117
ushing+0.115
Neg logit contributions
Ready-0.205
 stairs-0.197
 gra-0.194
 hunt-0.189
Find-0.188
Token-
Token ID12
Feature activation+4.13e+00

Pos logit contributions
Acc+0.115
illy+0.110
 ok+0.101
inch+0.100
 okay+0.097
Neg logit contributions
 stairs-0.276
 hills-0.265
 aisle-0.263
sc-0.252
 steps-0.250
Tokenyears
Token ID19002
Feature activation+2.22e+01

Pos logit contributions
lee+0.035
 mad+0.030
 glad+0.029
 okay+0.029
 ok+0.028
Neg logit contributions
Ready-0.163
Circ-0.154
 stairs-0.154
Find-0.152
sc-0.152
Token-
Token ID12
Feature activation+3.19e+00

Pos logit contributions
 glad+0.419
 mad+0.417
 ok+0.414
 okay+0.403
 alright+0.389
Neg logit contributions
 stairs-0.599
Circ-0.576
 hunt-0.575
con-0.575
ro-0.566
Tokenold
Token ID727
Feature activation+6.06e+00

Pos logit contributions
 mad+0.056
lee+0.055
 glad+0.052
 years+0.051
 ok+0.051
Neg logit contributions
Ready-0.098
Circ-0.092
 stairs-0.091
 hunt-0.090
Find-0.088
Token and
Token ID290
Feature activation+6.12e+00

Pos logit contributions
lee+0.137
 mad+0.136
 glad+0.133
aming+0.129
illy+0.128
Neg logit contributions
Ready-0.146
 hunt-0.139
 stairs-0.137
-0.134
sc-0.133
Token she
Token ID673
Feature activation+2.51e+00

Pos logit contributions
lee+0.118
 mad+0.118
 years+0.113
heart+0.111
 glad+0.110
Neg logit contributions
Ready-0.174
 hunt-0.163
 stairs-0.161
Circ-0.158
Find-0.156
Token loved
Token ID6151
Feature activation+7.10e+00

Pos logit contributions
lee+0.059
 mad+0.058
 glad+0.056
 years+0.056
heart+0.055
Neg logit contributions
Ready-0.062
 hunt-0.058
 stairs-0.056
Circ-0.056
Find-0.055
Token and
Token ID290
Feature activation+9.77e+00

Pos logit contributions
 mad+0.250
 glad+0.247
lee+0.246
 years+0.236
heart+0.233
Neg logit contributions
Ready-0.271
Circ-0.254
 hunt-0.250
 stairs-0.242
Find-0.242
Token listening
Token ID8680
Feature activation+8.33e+00

Pos logit contributions
 mad+0.214
 glad+0.211
lee+0.210
 years+0.203
heart+0.198
Neg logit contributions
Ready-0.259
Circ-0.241
 hunt-0.239
 stairs-0.234
Find-0.231
Token.
Token ID13
Feature activation+2.82e+00

Pos logit contributions
lee+0.205
 mad+0.203
 years+0.200
 glad+0.200
heart+0.192
Neg logit contributions
Ready-0.200
 hunt-0.180
Circ-0.180
Find-0.178
 stairs-0.175
Token<|endoftext|>
Token ID50256
Feature activation-4.83e+00

Pos logit contributions
lee+0.074
 glad+0.072
 mad+0.069
 years+0.069
heart+0.068
Neg logit contributions
Ready-0.060
 hunt-0.060
 shortcut-0.057
Circ-0.056
 stairs-0.056
TokenM
Token ID44
Feature activation+1.15e+01

Pos logit contributions
 hunt+0.109
Find+0.108
Ready+0.107
 shortcut+0.105
tr+0.099
Neg logit contributions
 years-0.108
 glad-0.104
lee-0.101
 brothers-0.101
illy-0.099
Tokenomm
Token ID2002
Feature activation+2.14e+01

Pos logit contributions
 mad+0.188
lee+0.181
 glad+0.180
years+0.175
 ok+0.174
Neg logit contributions
-0.341
Ready-0.338
 stairs-0.326
 hunt-0.324
 gra-0.319
Tokena
Token ID64
Feature activation+9.84e+00

Pos logit contributions
lee+0.292
 ok+0.279
 glad+0.268
 years+0.264
 okay+0.260
Neg logit contributions
-0.674
 gra-0.613
 aisle-0.606
 hunt-0.602
 stairs-0.591
Token and
Token ID290
Feature activation+7.14e+00

Pos logit contributions
lee+0.168
 mad+0.165
 years+0.161
 glad+0.161
heart+0.152
Neg logit contributions
Ready-0.308
 hunt-0.287
Circ-0.285
 stairs-0.284
Find-0.279
Token Billy
Token ID15890
Feature activation+7.10e+00

Pos logit contributions
lee+0.128
 mad+0.128
 glad+0.125
 years+0.121
heart+0.117
Neg logit contributions
Ready-0.218
 hunt-0.203
 stairs-0.201
Circ-0.201
sc-0.197
Token were
Token ID547
Feature activation+5.10e+00

Pos logit contributions
lee+0.174
 mad+0.173
 glad+0.166
 years+0.165
illy+0.160
Neg logit contributions
Ready-0.170
Circ-0.157
 stairs-0.156
Find-0.152
sc-0.150
Token walking
Token ID6155
Feature activation-3.03e+00

Pos logit contributions
heart+0.134
ious+0.131
ity+0.130
lee+0.122
ushing+0.121
Neg logit contributions
-0.106
 stairs-0.096
 aisle-0.093
 hunt-0.088
ground-0.088
Token work
Token ID670
Feature activation+3.26e+00

Pos logit contributions
lee+0.329
 mad+0.326
 years+0.317
heart+0.314
 glad+0.298
Neg logit contributions
Ready-0.339
Circ-0.304
Find-0.301
ground-0.290
-0.290
Token,
Token ID11
Feature activation+8.61e+00

Pos logit contributions
 years+0.066
 mad+0.066
lee+0.065
 glad+0.062
heart+0.061
Neg logit contributions
Ready-0.093
Circ-0.088
 hunt-0.085
Find-0.085
 gra-0.084
Token bravery
Token ID37808
Feature activation+6.05e+00

Pos logit contributions
lee+0.173
 mad+0.171
 glad+0.168
 years+0.167
heart+0.160
Neg logit contributions
Ready-0.242
 hunt-0.225
Circ-0.223
 stairs-0.221
Find-0.216
Token and
Token ID290
Feature activation+7.78e+00

Pos logit contributions
lee+0.114
 years+0.113
 mad+0.112
heart+0.110
 glad+0.109
Neg logit contributions
Ready-0.180
Circ-0.174
Find-0.166
 hunt-0.161
 gra-0.161
Token cle
Token ID1190
Feature activation+1.23e+01

Pos logit contributions
 mad+0.149
 glad+0.147
 years+0.144
lee+0.144
 careful+0.140
Neg logit contributions
Ready-0.226
Circ-0.217
 hunt-0.212
-0.210
ground-0.209
Tokenvern
Token ID933
Feature activation+2.09e+01

Pos logit contributions
lee+0.257
 mad+0.255
 glad+0.252
 years+0.250
heart+0.241
Neg logit contributions
Ready-0.337
 hunt-0.313
Circ-0.311
 stairs-0.306
Find-0.300
Tokeness
Token ID408
Feature activation+6.86e+00

Pos logit contributions
lee+0.524
 mad+0.523
 glad+0.510
 years+0.507
years+0.484
Neg logit contributions
Ready-0.482
 hunt-0.443
 stairs-0.440
Circ-0.439
sc-0.433
Token!
Token ID0
Feature activation+1.90e-01

Pos logit contributions
lee+0.142
heart+0.138
 years+0.137
 mad+0.134
 glad+0.134
Neg logit contributions
Ready-0.197
Circ-0.185
Find-0.177
 stairs-0.175
 hunt-0.171
Token From
Token ID3574
Feature activation+1.01e+01

Pos logit contributions
lee+0.005
 glad+0.005
heart+0.005
 years+0.005
illy+0.005
Neg logit contributions
 hunt-0.004
Ready-0.004
 shortcut-0.004
 stairs-0.004
Circ-0.004
Token now
Token ID783
Feature activation+7.54e+00

Pos logit contributions
lee+0.180
 mad+0.177
heart+0.171
 glad+0.168
 ok+0.163
Neg logit contributions
Ready-0.302
 hunt-0.281
-0.281
 stairs-0.275
ground-0.273
Token on
Token ID319
Feature activation-9.08e+00

Pos logit contributions
 mad+0.186
 years+0.184
 glad+0.184
lee+0.179
 careful+0.177
Neg logit contributions
Ready-0.174
Circ-0.166
 hunt-0.162
 stairs-0.160
 gra-0.156
Token She
Token ID1375
Feature activation+5.23e+00

Pos logit contributions
Ready+0.044
 hunt+0.040
Circ+0.040
 stairs+0.039
ground+0.039
Neg logit contributions
lee-0.046
 mad-0.045
 glad-0.044
 years-0.044
heart-0.043
Token was
Token ID373
Feature activation+5.02e+00

Pos logit contributions
heart+0.114
lee+0.112
 mad+0.109
years+0.105
 ok+0.105
Neg logit contributions
Ready-0.141
 hunt-0.133
-0.131
 ride-0.130
 stairs-0.127
Token only
Token ID691
Feature activation+9.33e+00

Pos logit contributions
lee+0.125
heart+0.122
illy+0.113
 years+0.112
ious+0.112
Neg logit contributions
Ready-0.116
 gra-0.113
 ride-0.112
 stairs-0.110
Find-0.110
Token three
Token ID1115
Feature activation+6.48e+00

Pos logit contributions
lee+0.234
 years+0.226
 mad+0.222
heart+0.217
 glad+0.217
Neg logit contributions
Ready-0.219
Circ-0.197
 stairs-0.197
 hunt-0.195
Find-0.192
Token-
Token ID12
Feature activation+3.15e+00

Pos logit contributions
 ok+0.091
illy+0.084
Acc+0.079
 alright+0.079
 okay+0.076
Neg logit contributions
 stairs-0.204
 aisle-0.198
 hills-0.194
-0.191
 hunt-0.191
Tokenyears
Token ID19002
Feature activation+2.04e+01

Pos logit contributions
lee+0.023
 mad+0.020
 glad+0.020
 years+0.019
 ok+0.019
Neg logit contributions
Ready-0.130
Find-0.122
Circ-0.122
sc-0.121
 stairs-0.120
Token-
Token ID12
Feature activation+3.12e+00

Pos logit contributions
 ok+0.405
 mad+0.396
 glad+0.394
 alright+0.387
 okay+0.376
Neg logit contributions
 stairs-0.530
 hunt-0.519
Circ-0.515
 ride-0.508
ro-0.504
Tokenold
Token ID727
Feature activation+5.61e+00

Pos logit contributions
lee+0.060
 ok+0.059
 mad+0.059
 alright+0.056
 glad+0.056
Neg logit contributions
Ready-0.087
Circ-0.082
 stairs-0.080
con-0.079
 hunt-0.079
Token,
Token ID11
Feature activation+5.52e+00

Pos logit contributions
aming+0.158
lee+0.154
 pleased+0.154
 ok+0.152
ivering+0.151
Neg logit contributions
-0.097
 hunt-0.095
Ready-0.095
 hills-0.091
 stairs-0.091
Token but
Token ID475
Feature activation+4.61e+00

Pos logit contributions
lee+0.140
 mad+0.133
heart+0.130
years+0.127
 glad+0.126
Neg logit contributions
Ready-0.129
 hunt-0.123
 stairs-0.119
 ride-0.116
Circ-0.114
Token she
Token ID673
Feature activation+3.08e+00

Pos logit contributions
lee+0.143
ivering+0.138
heart+0.137
 mad+0.131
ious+0.131
Neg logit contributions
Ready-0.082
 stairs-0.074
-0.071
 ride-0.068
 hunt-0.067
Token He
Token ID679
Feature activation+4.04e+00

Pos logit contributions
Ready+0.029
 hunt+0.027
Circ+0.026
 stairs+0.026
Find+0.026
Neg logit contributions
lee-0.032
 mad-0.031
 glad-0.031
 years-0.031
heart-0.030
Token was
Token ID373
Feature activation+4.36e+00

Pos logit contributions
lee+0.087
heart+0.086
 mad+0.085
 glad+0.082
illy+0.080
Neg logit contributions
Ready-0.109
 hunt-0.104
-0.102
 ride-0.100
Circ-0.100
Token only
Token ID691
Feature activation+8.72e+00

Pos logit contributions
lee+0.105
heart+0.103
illy+0.097
 years+0.095
ushing+0.095
Neg logit contributions
Ready-0.105
 gra-0.101
 ride-0.100
 stairs-0.099
Find-0.098
Token 3
Token ID513
Feature activation+6.32e+00

Pos logit contributions
lee+0.194
 mad+0.190
 years+0.187
 glad+0.186
heart+0.180
Neg logit contributions
Ready-0.229
 hunt-0.211
Circ-0.209
 stairs-0.208
Find-0.203
Token-
Token ID12
Feature activation+2.31e+00

Pos logit contributions
 ok+0.087
Acc+0.082
 alright+0.078
illy+0.077
 okay+0.076
Neg logit contributions
 stairs-0.199
 hunt-0.195
 bars-0.194
 aisle-0.193
 hills-0.193
Tokenyears
Token ID19002
Feature activation+2.03e+01

Pos logit contributions
lee+0.022
 mad+0.021
 ok+0.020
 glad+0.019
heart+0.019
Neg logit contributions
Ready-0.087
Circ-0.083
Find-0.083
 stairs-0.081
sc-0.081
Token-
Token ID12
Feature activation+2.06e+00

Pos logit contributions
 mad+0.364
 glad+0.361
 ok+0.354
lee+0.341
 okay+0.333
Neg logit contributions
 stairs-0.567
 hunt-0.561
Circ-0.561
Ready-0.540
 ride-0.538
Tokenold
Token ID727
Feature activation+4.51e+00

Pos logit contributions
 mad+0.040
lee+0.039
 ok+0.037
 glad+0.037
heart+0.037
Neg logit contributions
Ready-0.059
Circ-0.056
 hunt-0.055
 stairs-0.054
 ride-0.053
Token.
Token ID13
Feature activation-2.31e+00

Pos logit contributions
aming+0.126
lee+0.123
 mad+0.121
ivering+0.121
 glad+0.120
Neg logit contributions
 hunt-0.082
-0.077
Ready-0.077
 ride-0.076
 stairs-0.075
Token He
Token ID679
Feature activation+3.34e+00

Pos logit contributions
Ready+0.049
 hunt+0.045
sc+0.044
Circ+0.043
Find+0.043
Neg logit contributions
lee-0.063
 glad-0.062
 years-0.061
 mad-0.060
illy-0.057
Token had
Token ID550
Feature activation-2.01e+00

Pos logit contributions
lee+0.080
 mad+0.078
heart+0.077
 glad+0.076
 pleased+0.074
Neg logit contributions
Ready-0.083
 hunt-0.078
Circ-0.075
-0.075
 stairs-0.074
TokenIt
Token ID1026
Feature activation-1.01e+00

Pos logit contributions
 hunt+0.070
 shortcut+0.067
 stairs+0.066
Ready+0.065
 gra+0.064
Neg logit contributions
 glad-0.072
 years-0.070
 mad-0.070
aming-0.068
lee-0.067
Token's
Token ID338
Feature activation+5.77e+00

Pos logit contributions
Ready+0.025
 hunt+0.023
Circ+0.022
 stairs+0.022
+0.022
Neg logit contributions
lee-0.025
 years-0.024
 mad-0.024
 glad-0.024
years-0.023
Token okay
Token ID8788
Feature activation+1.04e+01

Pos logit contributions
lee+0.113
years+0.108
ious+0.097
ivering+0.093
itive+0.092
Neg logit contributions
Ready-0.166
-0.163
tr-0.150
 ride-0.149
 Ready-0.146
Token,
Token ID11
Feature activation+6.67e+00

Pos logit contributions
lee+0.299
aming+0.296
years+0.295
 pleased+0.289
 impressed+0.286
Neg logit contributions
-0.190
 hunt-0.182
 stairs-0.179
Ready-0.172
 visit-0.171
Token my
Token ID616
Feature activation+6.42e+00

Pos logit contributions
years+0.156
lee+0.154
aming+0.150
 years+0.144
 mad+0.142
Neg logit contributions
Ready-0.159
 stairs-0.156
 hunt-0.148
 ride-0.148
 aisle-0.142
Token de
Token ID390
Feature activation+2.03e+01

Pos logit contributions
lee+0.129
years+0.125
aming+0.116
 mad+0.116
 years+0.114
Neg logit contributions
Ready-0.178
 ride-0.174
 stairs-0.169
-0.169
 hunt-0.168
Tokenars
Token ID945
Feature activation-3.12e+00

Pos logit contributions
 mad+0.562
 glad+0.554
 years+0.554
lee+0.550
heart+0.527
Neg logit contributions
Ready-0.415
 hunt-0.376
Circ-0.375
 stairs-0.374
Find-0.357
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
Ready+0.082
Circ+0.076
 hunt+0.075
 stairs+0.074
Find+0.073
Neg logit contributions
 mad-0.069
lee-0.069
 glad-0.067
 years-0.067
heart-0.064
Token I
Token ID314
Feature activation-1.02e+00

Pos logit contributions
Ready+0.003
 hunt+0.003
Circ+0.003
 stairs+0.003
 shortcut+0.003
Neg logit contributions
 glad-0.004
lee-0.004
 mad-0.004
 years-0.004
heart-0.003
Token know
Token ID760
Feature activation+7.04e+00

Pos logit contributions
Ready+0.022
 hunt+0.020
Circ+0.020
+0.019
Find+0.019
Neg logit contributions
lee-0.028
 years-0.027
 mad-0.027
 glad-0.027
heart-0.026
Token you
Token ID345
Feature activation+4.11e+00

Pos logit contributions
lee+0.150
 mad+0.147
 glad+0.144
 years+0.142
heart+0.138
Neg logit contributions
Ready-0.190
 hunt-0.176
 stairs-0.174
Circ-0.172
-0.169
Token.
Token ID13
Feature activation-2.09e+00

Pos logit contributions
lee+0.155
 mad+0.154
 glad+0.151
 years+0.148
heart+0.146
Neg logit contributions
Ready-0.143
 hunt-0.135
 stairs-0.130
Circ-0.130
-0.128
Token She
Token ID1375
Feature activation+4.89e+00

Pos logit contributions
Ready+0.045
 stairs+0.042
 hunt+0.042
+0.041
Circ+0.041
Neg logit contributions
 mad-0.055
lee-0.054
 glad-0.054
 years-0.053
years-0.052
Token was
Token ID373
Feature activation+4.71e+00

Pos logit contributions
lee+0.098
heart+0.095
 mad+0.094
years+0.090
 glad+0.090
Neg logit contributions
Ready-0.140
 hunt-0.133
-0.131
 stairs-0.128
 ride-0.128
Token three
Token ID1115
Feature activation+6.43e+00

Pos logit contributions
lee+0.110
heart+0.104
illy+0.098
ious+0.096
 years+0.096
Neg logit contributions
Ready-0.117
 stairs-0.114
 gra-0.113
 ride-0.112
Find-0.112
Token-
Token ID12
Feature activation+2.92e+00

Pos logit contributions
 ok+0.107
 alright+0.095
Acc+0.094
illy+0.094
 okay+0.090
Neg logit contributions
 stairs-0.187
 aisle-0.185
 hunt-0.179
 hills-0.178
 hill-0.178
Tokenyears
Token ID19002
Feature activation+2.02e+01

Pos logit contributions
lee+0.025
 ok+0.021
 mad+0.021
 years+0.021
heart+0.020
Neg logit contributions
Ready-0.114
Circ-0.109
Find-0.109
 stairs-0.108
sc-0.107
Token-
Token ID12
Feature activation+2.90e+00

Pos logit contributions
 ok+0.415
 mad+0.399
 alright+0.395
 glad+0.393
 okay+0.379
Neg logit contributions
 stairs-0.514
 hunt-0.504
Circ-0.499
 ride-0.496
br-0.484
Tokenold
Token ID727
Feature activation+5.29e+00

Pos logit contributions
lee+0.056
 ok+0.056
 mad+0.054
 alright+0.052
heart+0.052
Neg logit contributions
Ready-0.080
Circ-0.077
 stairs-0.075
set-0.073
Find-0.073
Token and
Token ID290
Feature activation+6.04e+00

Pos logit contributions
aming+0.144
 ok+0.143
irling+0.143
 pleased+0.142
 mad+0.140
Neg logit contributions
-0.094
 hunt-0.092
 hills-0.090
 stairs-0.089
Circ-0.086
Token loved
Token ID6151
Feature activation+7.49e+00

Pos logit contributions
lee+0.137
heart+0.130
 mad+0.123
illy+0.121
 mistakes+0.119
Neg logit contributions
Ready-0.152
-0.143
 stairs-0.142
 ride-0.141
 hunt-0.140
Token to
Token ID284
Feature activation-8.03e+00

Pos logit contributions
 mad+0.215
lee+0.214
 years+0.213
 glad+0.212
years+0.201
Neg logit contributions
Ready-0.155
sc-0.135
ground-0.132
 hunt-0.130
 shortcut-0.127
Token She
Token ID1375
Feature activation+4.86e+00

Pos logit contributions
Ready+0.041
 stairs+0.039
+0.037
Circ+0.037
 hunt+0.036
Neg logit contributions
 mad-0.055
 years-0.053
 glad-0.053
lee-0.053
heart-0.051
Token was
Token ID373
Feature activation+4.65e+00

Pos logit contributions
heart+0.112
lee+0.108
 mad+0.107
 ok+0.103
years+0.102
Neg logit contributions
Ready-0.125
 hunt-0.121
 ride-0.117
-0.117
 stairs-0.115
Token only
Token ID691
Feature activation+8.97e+00

Pos logit contributions
lee+0.103
heart+0.101
illy+0.094
ious+0.092
ushing+0.091
Neg logit contributions
Ready-0.121
 stairs-0.115
 gra-0.115
Find-0.115
 ride-0.114
Token three
Token ID1115
Feature activation+6.15e+00

Pos logit contributions
lee+0.205
 mad+0.200
 years+0.198
 glad+0.195
heart+0.191
Neg logit contributions
Ready-0.230
 hunt-0.211
Circ-0.210
 stairs-0.209
Find-0.203
Token-
Token ID12
Feature activation+2.68e+00

Pos logit contributions
 ok+0.078
Acc+0.073
illy+0.072
 glad+0.067
aming+0.067
Neg logit contributions
 stairs-0.204
 aisle-0.198
 hills-0.192
 hunt-0.192
-0.186
Tokenyears
Token ID19002
Feature activation+2.00e+01

Pos logit contributions
lee+0.018
 mad+0.018
 glad+0.017
 ok+0.015
 years+0.015
Neg logit contributions
Ready-0.112
Circ-0.106
Find-0.105
 stairs-0.104
sc-0.103
Token-
Token ID12
Feature activation+2.67e+00

Pos logit contributions
 ok+0.392
 mad+0.389
 glad+0.385
 alright+0.373
aming+0.364
Neg logit contributions
 stairs-0.531
 hunt-0.523
Circ-0.517
con-0.501
 ride-0.501
Tokenold
Token ID727
Feature activation+5.28e+00

Pos logit contributions
 mad+0.050
lee+0.048
 ok+0.048
 glad+0.046
heart+0.046
Neg logit contributions
Ready-0.077
Circ-0.074
 stairs-0.072
 hunt-0.071
con-0.070
Token,
Token ID11
Feature activation+5.29e+00

Pos logit contributions
aming+0.141
 pleased+0.138
irling+0.137
 mad+0.135
ivering+0.135
Neg logit contributions
-0.099
 stairs-0.096
 hunt-0.096
Ready-0.094
 hills-0.094
Token but
Token ID475
Feature activation+4.36e+00

Pos logit contributions
lee+0.148
 mad+0.139
heart+0.137
years+0.136
illy+0.131
Neg logit contributions
Ready-0.108
 hunt-0.107
 stairs-0.104
 ride-0.103
Circ-0.095
Token she
Token ID673
Feature activation+2.73e+00

Pos logit contributions
lee+0.146
ivering+0.143
heart+0.136
ious+0.136
ushing+0.135
Neg logit contributions
 stairs-0.063
Ready-0.063
-0.059
 aisle-0.057
 ride-0.052
Token.
Token ID13
Feature activation-2.22e+00

Pos logit contributions
Ready+0.154
Circ+0.141
Find+0.136
 gra+0.131
 shortcut+0.129
Neg logit contributions
 years-0.099
lee-0.091
 mad-0.091
 glad-0.089
illy-0.086
Token She
Token ID1375
Feature activation+4.93e+00

Pos logit contributions
Ready+0.060
 hunt+0.055
Circ+0.054
 stairs+0.054
sc+0.053
Neg logit contributions
lee-0.049
 glad-0.047
 mad-0.046
 years-0.046
heart-0.045
Token was
Token ID373
Feature activation+4.66e+00

Pos logit contributions
 mad+0.106
lee+0.105
heart+0.102
 glad+0.102
 years+0.100
Neg logit contributions
Ready-0.131
 hunt-0.124
Circ-0.120
 stairs-0.120
-0.118
Token three
Token ID1115
Feature activation+6.07e+00

Pos logit contributions
heart+0.092
lee+0.091
illy+0.087
 years+0.085
ushing+0.084
Neg logit contributions
Ready-0.128
 gra-0.124
 stairs-0.122
 ride-0.121
Find-0.120
Token-
Token ID12
Feature activation+2.49e+00

Pos logit contributions
Acc+0.091
 ok+0.088
 okay+0.080
 alright+0.078
 mad+0.074
Neg logit contributions
 hills-0.186
 hunt-0.185
 stairs-0.183
 aisle-0.183
 ride-0.179
Tokenyears
Token ID19002
Feature activation+2.00e+01

Pos logit contributions
 mad+0.020
lee+0.020
 glad+0.018
 years+0.018
 ok+0.018
Neg logit contributions
Ready-0.100
Circ-0.095
Find-0.093
sc-0.093
 stairs-0.093
Token-
Token ID12
Feature activation+2.49e+00

Pos logit contributions
 mad+0.387
 ok+0.385
 glad+0.372
 alright+0.366
 okay+0.360
Neg logit contributions
 hunt-0.529
 stairs-0.526
Circ-0.523
 ride-0.510
 hills-0.501
Tokenold
Token ID727
Feature activation+5.15e+00

Pos logit contributions
 mad+0.047
 ok+0.045
lee+0.044
 glad+0.043
 okay+0.043
Neg logit contributions
Ready-0.073
Circ-0.070
 stairs-0.067
 hunt-0.067
sc-0.066
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
aming+0.130
 mad+0.130
irling+0.127
 pleased+0.125
ivering+0.125
Neg logit contributions
-0.101
 hunt-0.098
 hills-0.097
Ready-0.095
 stairs-0.095
Token Every
Token ID3887
Feature activation+2.17e-01

Pos logit contributions
Ready+0.040
 hunt+0.036
Circ+0.035
 shortcut+0.035
 stairs+0.035
Neg logit contributions
lee-0.055
 glad-0.054
 years-0.052
heart-0.051
 mad-0.051
Token day
Token ID1110
Feature activation-4.80e+00

Pos logit contributions
 years+0.005
 mad+0.005
 glad+0.005
lee+0.005
heart+0.005
Neg logit contributions
Ready-0.006
 hunt-0.005
Find-0.005
Circ-0.005
 stairs-0.005
Token He
Token ID679
Feature activation+4.62e+00

Pos logit contributions
Ready+0.045
 hunt+0.042
sc+0.041
 shortcut+0.040
ground+0.040
Neg logit contributions
lee-0.036
 mad-0.034
 years-0.033
 glad-0.033
 enthusiastic-0.032
Token was
Token ID373
Feature activation+4.73e+00

Pos logit contributions
lee+0.096
 mad+0.094
 glad+0.093
 years+0.092
heart+0.089
Neg logit contributions
Ready-0.128
 hunt-0.119
Circ-0.117
 stairs-0.117
Find-0.114
Token only
Token ID691
Feature activation+9.08e+00

Pos logit contributions
heart+0.082
lee+0.081
illy+0.079
 years+0.074
ious+0.074
Neg logit contributions
Ready-0.144
 stairs-0.139
 hunt-0.138
 shortcut-0.136
Find-0.134
Token three
Token ID1115
Feature activation+6.19e+00

Pos logit contributions
lee+0.190
 mad+0.187
 glad+0.184
 years+0.183
heart+0.177
Neg logit contributions
Ready-0.250
 hunt-0.232
Circ-0.229
 stairs-0.228
Find-0.223
Token-
Token ID12
Feature activation+2.55e+00

Pos logit contributions
illy+0.098
Acc+0.092
 ok+0.091
 glad+0.088
aming+0.086
Neg logit contributions
 stairs-0.192
 hills-0.189
 hunt-0.182
 hill-0.180
 aisle-0.178
Tokenyears
Token ID19002
Feature activation+1.99e+01

Pos logit contributions
lee+0.034
 mad+0.032
 glad+0.032
 years+0.032
heart+0.030
Neg logit contributions
Ready-0.091
Circ-0.085
 hunt-0.085
 stairs-0.084
Find-0.083
Token-
Token ID12
Feature activation+2.54e+00

Pos logit contributions
 glad+0.389
 ok+0.381
aming+0.373
 mad+0.372
years+0.367
Neg logit contributions
 stairs-0.522
 hunt-0.520
ro-0.502
con-0.497
Circ-0.495
Tokenold
Token ID727
Feature activation+5.24e+00

Pos logit contributions
lee+0.048
 mad+0.047
 glad+0.046
 years+0.046
 ok+0.045
Neg logit contributions
Ready-0.075
Circ-0.070
 hunt-0.069
 stairs-0.068
sc-0.067
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
lee+0.136
aming+0.135
 glad+0.135
illy+0.134
 pleased+0.132
Neg logit contributions
Ready-0.106
 hunt-0.105
 stairs-0.100
-0.100
 hills-0.097
Token He
Token ID679
Feature activation+4.75e+00

Pos logit contributions
Ready+0.041
 hunt+0.039
sc+0.038
 shortcut+0.038
 gra+0.037
Neg logit contributions
lee-0.034
 years-0.031
 mad-0.031
 glad-0.031
 obedient-0.030
Token thought
Token ID1807
Feature activation+9.34e+00

Pos logit contributions
lee+0.104
 mad+0.102
 glad+0.101
 years+0.100
heart+0.098
Neg logit contributions
Ready-0.125
 hunt-0.117
Circ-0.114
 stairs-0.114
Find-0.111
Token to
Token ID284
Feature activation-7.51e+00

Pos logit contributions
 mad+0.212
 glad+0.201
 careful+0.200
lee+0.199
 okay+0.197
Neg logit contributions
Ready-0.159
ground-0.145
sc-0.141
Circ-0.141
-0.139
Token get
Token ID651
Feature activation+5.33e-01

Pos logit contributions
Ready+0.156
Circ+0.138
sc+0.136
+0.136
ground+0.134
Neg logit contributions
 mad-0.223
 years-0.210
lee-0.206
 glad-0.203
heart-0.203
Token too
Token ID1165
Feature activation+5.64e+00

Pos logit contributions
lee+0.013
 years+0.012
 glad+0.012
 mad+0.012
heart+0.012
Neg logit contributions
Ready-0.013
 hunt-0.012
 stairs-0.012
Circ-0.012
Find-0.011
Token wild
Token ID4295
Feature activation+5.98e+00

Pos logit contributions
lee+0.130
 glad+0.123
 mad+0.123
heart+0.123
 years+0.122
Neg logit contributions
Ready-0.142
 stairs-0.134
 hunt-0.133
Circ-0.131
 shortcut-0.127
Token.
Token ID13
Feature activation-6.68e-01

Pos logit contributions
 mad+0.137
lee+0.136
 years+0.136
heart+0.131
 glad+0.128
Neg logit contributions
Ready-0.155
Find-0.139
Circ-0.139
 hunt-0.139
ground-0.137
Token Id
Token ID5121
Feature activation+1.99e+01

Pos logit contributions
 hunt+0.015
Ready+0.015
Circ+0.014
 shortcut+0.014
 gra+0.014
Neg logit contributions
lee-0.018
 glad-0.017
 mad-0.017
 years-0.016
years-0.016
Tokena
Token ID64
Feature activation+8.00e+00

Pos logit contributions
 glad+0.391
 mad+0.381
 ok+0.380
 years+0.373
lee+0.373
Neg logit contributions
Ready-0.529
Find-0.504
sc-0.500
ground-0.491
-0.489
Token was
Token ID373
Feature activation+5.59e+00

Pos logit contributions
 glad+0.191
lee+0.188
 mad+0.186
heart+0.182
illy+0.181
Neg logit contributions
Ready-0.184
 hunt-0.175
sc-0.168
 stairs-0.166
 ride-0.166
Token so
Token ID523
Feature activation+8.96e+00

Pos logit contributions
heart+0.097
lee+0.095
illy+0.095
ious+0.085
ushing+0.085
Neg logit contributions
Ready-0.162
 ride-0.158
Find-0.155
 stairs-0.154
 hunt-0.153
Token happy
Token ID3772
Feature activation+1.38e+01

Pos logit contributions
lee+0.127
heart+0.123
illy+0.120
 years+0.110
ious+0.107
Neg logit contributions
Ready-0.292
Find-0.280
 ride-0.279
 gra-0.275
-0.272
Token she
Token ID673
Feature activation+3.50e+00

Pos logit contributions
lee+0.306
 mad+0.302
 glad+0.297
 years+0.296
heart+0.286
Neg logit contributions
Ready-0.362
 hunt-0.334
Circ-0.330
 stairs-0.327
Find-0.320
Token it
Token ID340
Feature activation-9.05e-01

Pos logit contributions
Ready+0.112
sc+0.106
Circ+0.102
Find+0.101
ground+0.101
Neg logit contributions
 mad-0.087
lee-0.086
 glad-0.082
 years-0.082
heart-0.079
Token and
Token ID290
Feature activation+8.59e+00

Pos logit contributions
Ready+0.025
Circ+0.023
Find+0.023
 hunt+0.023
 stairs+0.023
Neg logit contributions
 mad-0.019
lee-0.018
 glad-0.018
 years-0.018
heart-0.017
Token said
Token ID531
Feature activation+9.51e+00

Pos logit contributions
lee+0.164
 mad+0.161
 glad+0.157
 pleased+0.147
 impressed+0.144
Neg logit contributions
Ready-0.257
Circ-0.240
 stairs-0.237
 hunt-0.234
ground-0.234
Token,
Token ID11
Feature activation+7.82e+00

Pos logit contributions
 years+0.198
lee+0.197
 mad+0.194
 glad+0.188
heart+0.188
Neg logit contributions
Ready-0.253
 hunt-0.235
 stairs-0.230
Circ-0.230
-0.229
Token "
Token ID366
Feature activation-2.64e+00

Pos logit contributions
 glad+0.222
lee+0.218
 mad+0.216
 pleased+0.209
 ok+0.207
Neg logit contributions
Ready-0.161
 hunt-0.150
Circ-0.148
 stairs-0.145
sc-0.142
TokenE
Token ID36
Feature activation+1.98e+01

Pos logit contributions
 stairs+0.051
 hunt+0.050
 shortcut+0.047
 aisle+0.045
ground+0.045
Neg logit contributions
heart-0.073
 mad-0.073
aming-0.071
 glad-0.070
 years-0.070
Tokenw
Token ID86
Feature activation+6.97e+00

Pos logit contributions
lee+0.448
 glad+0.431
 mad+0.431
 years+0.415
years+0.408
Neg logit contributions
Ready-0.523
 hunt-0.488
Circ-0.477
-0.473
 stairs-0.470
Token,
Token ID11
Feature activation+6.07e+00

Pos logit contributions
years+0.153
 years+0.151
 glad+0.150
 mad+0.146
lee+0.141
Neg logit contributions
Ready-0.163
 hunt-0.162
-0.160
 ride-0.155
br-0.151
Token Mom
Token ID11254
Feature activation+8.27e+00

Pos logit contributions
 mad+0.132
lee+0.132
 glad+0.128
 years+0.125
heart+0.122
Neg logit contributions
Ready-0.164
 hunt-0.151
Circ-0.150
 stairs-0.148
sc-0.147
Tokenmy
Token ID1820
Feature activation+7.86e+00

Pos logit contributions
lee+0.162
 years+0.159
 glad+0.158
years+0.158
 mad+0.156
Neg logit contributions
Ready-0.225
 hunt-0.217
-0.210
 stairs-0.208
 ride-0.206
Token,
Token ID11
Feature activation+5.69e+00

Pos logit contributions
lee+0.162
years+0.159
 years+0.158
 glad+0.156
 mad+0.155
Neg logit contributions
Ready-0.209
 hunt-0.202
-0.193
 stairs-0.190
 ride-0.190
Token.
Token ID13
Feature activation-2.14e+00

Pos logit contributions
 mad+0.176
lee+0.173
 glad+0.171
 years+0.166
heart+0.165
Neg logit contributions
Ready-0.150
 hunt-0.143
 stairs-0.138
Circ-0.136
-0.136
Token Bella
Token ID39304
Feature activation+6.60e+00

Pos logit contributions
Ready+0.053
 hunt+0.049
Circ+0.048
 stairs+0.047
ground+0.047
Neg logit contributions
lee-0.051
 glad-0.049
 mad-0.048
 years-0.048
heart-0.047
Token was
Token ID373
Feature activation+3.93e+00

Pos logit contributions
 mad+0.153
lee+0.149
heart+0.148
 glad+0.148
years+0.148
Neg logit contributions
Ready-0.158
 hunt-0.154
-0.152
 stairs-0.148
 ride-0.145
Token three
Token ID1115
Feature activation+5.63e+00

Pos logit contributions
lee+0.109
heart+0.107
ious+0.098
illy+0.098
ams+0.097
Neg logit contributions
 stairs-0.076
Ready-0.076
-0.076
 gra-0.075
 ride-0.073
Token-
Token ID12
Feature activation+2.06e+00

Pos logit contributions
 ok+0.094
Acc+0.088
 alright+0.083
 okay+0.080
illy+0.079
Neg logit contributions
 stairs-0.165
 aisle-0.162
 hills-0.156
 bench-0.155
 hill-0.154
Tokenyears
Token ID19002
Feature activation+1.98e+01

Pos logit contributions
lee+0.018
 ok+0.018
 mad+0.017
 okay+0.016
ious+0.016
Neg logit contributions
Ready-0.078
 stairs-0.075
Find-0.074
Circ-0.074
sc-0.073
Token-
Token ID12
Feature activation+1.89e+00

Pos logit contributions
 ok+0.388
 mad+0.377
 glad+0.375
 alright+0.367
 okay+0.362
Neg logit contributions
 stairs-0.523
 hunt-0.507
Circ-0.505
 ride-0.498
con-0.494
Tokenold
Token ID727
Feature activation+4.43e+00

Pos logit contributions
 mad+0.036
lee+0.036
 ok+0.036
heart+0.034
 okay+0.034
Neg logit contributions
Ready-0.052
Circ-0.050
 stairs-0.050
 hunt-0.049
 ride-0.048
Token and
Token ID290
Feature activation+4.70e+00

Pos logit contributions
aming+0.120
 mad+0.117
 pleased+0.117
lee+0.115
irling+0.115
Neg logit contributions
-0.080
 hunt-0.080
 stairs-0.076
 hills-0.075
ro-0.075
Token very
Token ID845
Feature activation+1.10e+01

Pos logit contributions
lee+0.100
 mad+0.099
heart+0.098
 mistakes+0.093
ivering+0.093
Neg logit contributions
Ready-0.119
-0.114
 hunt-0.114
 stairs-0.113
 ride-0.113
Token excited
Token ID6568
Feature activation+9.80e+00

Pos logit contributions
heart+0.178
illy+0.176
lee+0.176
aming+0.173
 years+0.168
Neg logit contributions
 stairs-0.323
Ready-0.323
 ride-0.317
-0.313
 hunt-0.310
Token She
Token ID1375
Feature activation+4.43e+00

Pos logit contributions
Ready+0.039
Circ+0.036
 hunt+0.036
 stairs+0.036
+0.036
Neg logit contributions
 mad-0.047
lee-0.046
 years-0.045
 glad-0.045
 pleased-0.043
Token was
Token ID373
Feature activation+3.85e+00

Pos logit contributions
lee+0.092
heart+0.091
 mad+0.090
 ok+0.086
years+0.084
Neg logit contributions
Ready-0.122
 hunt-0.117
-0.115
 stairs-0.113
Circ-0.113
Token only
Token ID691
Feature activation+8.18e+00

Pos logit contributions
lee+0.090
heart+0.089
ious+0.081
illy+0.081
ushing+0.081
Neg logit contributions
Ready-0.093
 stairs-0.092
 gra-0.091
Find-0.090
 ride-0.089
Token three
Token ID1115
Feature activation+5.38e+00

Pos logit contributions
lee+0.189
 mad+0.183
 years+0.181
 glad+0.179
heart+0.176
Neg logit contributions
Ready-0.208
Circ-0.190
 hunt-0.190
 stairs-0.189
Find-0.184
Token-
Token ID12
Feature activation+1.87e+00

Pos logit contributions
 ok+0.070
Acc+0.067
illy+0.061
 alright+0.060
 okay+0.059
Neg logit contributions
 stairs-0.180
 aisle-0.174
 hills-0.170
-0.166
 hunt-0.166
Tokenyears
Token ID19002
Feature activation+1.98e+01

Pos logit contributions
lee+0.014
 mad+0.013
 ok+0.012
 glad+0.012
heart+0.011
Neg logit contributions
Ready-0.076
Circ-0.072
Find-0.072
 stairs-0.071
sc-0.070
Token-
Token ID12
Feature activation+1.69e+00

Pos logit contributions
 mad+0.365
 ok+0.363
 glad+0.363
 alright+0.342
 okay+0.339
Neg logit contributions
 stairs-0.544
Circ-0.536
 hunt-0.534
 ride-0.513
ro-0.512
Tokenold
Token ID727
Feature activation+4.29e+00

Pos logit contributions
 mad+0.036
lee+0.035
 ok+0.034
 glad+0.034
heart+0.033
Neg logit contributions
Ready-0.045
Circ-0.043
 stairs-0.042
 hunt-0.042
Find-0.041
Token and
Token ID290
Feature activation+4.60e+00

Pos logit contributions
aming+0.122
 pleased+0.118
 mad+0.118
lee+0.118
 glad+0.117
Neg logit contributions
 hunt-0.073
-0.072
 stairs-0.070
Ready-0.069
ro-0.068
Token she
Token ID673
Feature activation+1.11e+00

Pos logit contributions
lee+0.093
 mad+0.091
heart+0.088
ivering+0.084
illy+0.083
Neg logit contributions
Ready-0.123
 hunt-0.120
 stairs-0.118
 ride-0.116
-0.115
Token loved
Token ID6151
Feature activation+5.62e+00

Pos logit contributions
lee+0.023
heart+0.023
 mad+0.022
illy+0.021
 glad+0.021
Neg logit contributions
Ready-0.031
 hunt-0.029
-0.029
Circ-0.028
Find-0.028
Token sad
Token ID6507
Feature activation+1.55e+01

Pos logit contributions
lee+0.137
 years+0.135
 mad+0.135
 glad+0.131
heart+0.129
Neg logit contributions
Ready-0.152
 hunt-0.138
Circ-0.137
Find-0.135
 stairs-0.134
Token.
Token ID13
Feature activation+3.26e+00

Pos logit contributions
lee+0.338
 glad+0.328
years+0.328
 years+0.324
 mad+0.323
Neg logit contributions
Ready-0.391
 hunt-0.374
 stairs-0.368
-0.363
Circ-0.358
Token He
Token ID679
Feature activation+7.44e+00

Pos logit contributions
lee+0.091
 glad+0.089
 mad+0.088
 years+0.087
heart+0.085
Neg logit contributions
Ready-0.067
 hunt-0.063
Circ-0.061
 stairs-0.059
 shortcut-0.059
Token says
Token ID1139
Feature activation+8.93e+00

Pos logit contributions
lee+0.160
 glad+0.157
illy+0.153
heart+0.153
 mad+0.153
Neg logit contributions
Ready-0.196
 hunt-0.184
 stairs-0.176
-0.176
sc-0.172
Token "
Token ID366
Feature activation-1.67e+00

Pos logit contributions
lee+0.195
 mad+0.191
 years+0.189
 glad+0.188
heart+0.183
Neg logit contributions
Ready-0.234
 hunt-0.217
Circ-0.214
 stairs-0.213
-0.209
Tokenoh
Token ID1219
Feature activation+1.97e+01

Pos logit contributions
 hunt+0.034
Ready+0.032
 stairs+0.032
 shortcut+0.031
ground+0.030
Neg logit contributions
 mad-0.047
heart-0.044
 years-0.043
 glad-0.042
aming-0.042
Token no
Token ID645
Feature activation+9.03e+00

Pos logit contributions
lee+0.376
 mad+0.365
 years+0.362
 glad+0.358
heart+0.345
Neg logit contributions
Ready-0.580
 hunt-0.540
Circ-0.529
 stairs-0.527
Find-0.520
Token,
Token ID11
Feature activation+7.04e+00

Pos logit contributions
 mad+0.185
 years+0.178
lee+0.177
 glad+0.175
heart+0.172
Neg logit contributions
Ready-0.257
Circ-0.241
 hunt-0.239
ground-0.235
 stairs-0.234
Token my
Token ID616
Feature activation+6.62e+00

Pos logit contributions
lee+0.170
 mad+0.165
 years+0.164
 glad+0.163
heart+0.159
Neg logit contributions
Ready-0.170
 hunt-0.157
 stairs-0.154
Circ-0.153
Find-0.149
Token car
Token ID1097
Feature activation-4.71e+00

Pos logit contributions
 mad+0.140
lee+0.139
 glad+0.137
 years+0.136
heart+0.130
Neg logit contributions
Ready-0.180
 hunt-0.166
Circ-0.166
 stairs-0.162
Find-0.160
Token!"
Token ID2474
Feature activation-5.92e-01

Pos logit contributions
Ready+0.110
Circ+0.106
 gra+0.098
Find+0.098
ground+0.096
Neg logit contributions
 mad-0.123
 years-0.116
 glad-0.113
lee-0.112
 careful-0.109
Token.
Token ID13
Feature activation-2.47e+00

Pos logit contributions
lee+0.195
 mad+0.188
heart+0.187
years+0.185
 pleased+0.184
Neg logit contributions
 hunt-0.115
-0.107
 ride-0.103
Ready-0.100
 gra-0.099
Token He
Token ID679
Feature activation+4.00e+00

Pos logit contributions
Ready+0.060
 hunt+0.055
Circ+0.054
 stairs+0.054
Find+0.052
Neg logit contributions
lee-0.060
 mad-0.059
 glad-0.058
 years-0.058
heart-0.056
Token was
Token ID373
Feature activation+4.41e+00

Pos logit contributions
heart+0.090
lee+0.088
 mad+0.087
 glad+0.083
 ok+0.083
Neg logit contributions
Ready-0.105
 hunt-0.101
-0.098
 ride-0.098
Circ-0.095
Token three
Token ID1115
Feature activation+5.93e+00

Pos logit contributions
lee+0.099
heart+0.097
illy+0.090
ushing+0.088
 years+0.088
Neg logit contributions
Ready-0.115
 ride-0.111
 gra-0.111
 hunt-0.109
Find-0.108
Token-
Token ID12
Feature activation+2.36e+00

Pos logit contributions
 ok+0.093
Acc+0.086
illy+0.084
 alright+0.083
 okay+0.081
Neg logit contributions
 stairs-0.173
 hunt-0.171
 aisle-0.171
 hill-0.169
 ride-0.167
Tokenyears
Token ID19002
Feature activation+1.97e+01

Pos logit contributions
lee+0.027
 mad+0.024
 ok+0.024
 years+0.024
 glad+0.023
Neg logit contributions
Ready-0.087
Circ-0.083
Find-0.082
sc-0.081
 stairs-0.079
Token old
Token ID1468
Feature activation+5.48e+00

Pos logit contributions
 ok+0.403
 mad+0.394
 glad+0.386
 alright+0.385
 okay+0.372
Neg logit contributions
 stairs-0.497
 hunt-0.490
Circ-0.489
 ride-0.482
con-0.471
Token and
Token ID290
Feature activation+5.74e+00

Pos logit contributions
 ok+0.145
aming+0.143
lee+0.141
 pleased+0.141
illy+0.139
Neg logit contributions
-0.113
 stairs-0.112
 hill-0.111
 hills-0.110
 hunt-0.110
Token he
Token ID339
Feature activation+1.72e+00

Pos logit contributions
lee+0.130
heart+0.126
illy+0.120
 mistakes+0.119
 mad+0.117
Neg logit contributions
Ready-0.142
 ride-0.140
 hunt-0.136
-0.131
 Whe-0.131
Token loved
Token ID6151
Feature activation+7.20e+00

Pos logit contributions
lee+0.039
heart+0.039
 mad+0.036
illy+0.036
ushing+0.034
Neg logit contributions
Ready-0.044
 hunt-0.043
 ride-0.042
-0.041
Circ-0.040
Token skipping
Token ID31017
Feature activation-2.79e+00

Pos logit contributions
lee+0.209
 years+0.208
 mad+0.208
 glad+0.206
years+0.196
Neg logit contributions
Ready-0.146
sc-0.127
ground-0.122
 hunt-0.121
Find-0.120
Token,
Token ID11
Feature activation+7.05e+00

Pos logit contributions
Ready+0.066
 hunt+0.061
 stairs+0.060
Circ+0.060
sc+0.058
Neg logit contributions
lee-0.055
 mad-0.055
 glad-0.054
 years-0.053
heart-0.052
Token soaking
Token ID41714
Feature activation+9.17e-01

Pos logit contributions
 mad+0.148
lee+0.147
 glad+0.145
 years+0.143
heart+0.137
Neg logit contributions
Ready-0.192
Circ-0.178
 hunt-0.178
 stairs-0.174
Find-0.172
Token up
Token ID510
Feature activation-1.84e+00

Pos logit contributions
years+0.025
lee+0.024
 glad+0.023
 years+0.023
illy+0.023
Neg logit contributions
 stairs-0.017
-0.017
 hills-0.017
sc-0.016
Ready-0.016
Token the
Token ID262
Feature activation-2.93e+00

Pos logit contributions
Ready+0.050
Circ+0.046
 hunt+0.046
Find+0.045
 stairs+0.045
Neg logit contributions
lee-0.039
 mad-0.039
 years-0.038
 glad-0.038
heart-0.036
Token morning
Token ID3329
Feature activation-3.13e+00

Pos logit contributions
Ready+0.060
 hunt+0.054
Circ+0.054
 stairs+0.053
Find+0.051
Neg logit contributions
lee-0.081
 mad-0.080
 glad-0.079
 years-0.079
heart-0.077
Token de
Token ID390
Feature activation+1.96e+01

Pos logit contributions
Ready+0.089
Circ+0.083
Find+0.082
 hunt+0.081
 shortcut+0.079
Neg logit contributions
lee-0.064
 years-0.063
 mad-0.061
 glad-0.060
heart-0.060
Tokenw
Token ID86
Feature activation+6.48e+00

Pos logit contributions
 mad+0.351
 years+0.341
 glad+0.338
aming+0.325
lee+0.319
Neg logit contributions
Ready-0.591
 stairs-0.576
Circ-0.576
 hunt-0.571
 shortcut-0.555
Token that
Token ID326
Feature activation-3.51e+00

Pos logit contributions
 mad+0.126
 glad+0.125
 years+0.122
lee+0.122
years+0.119
Neg logit contributions
Ready-0.174
 hunt-0.166
 stairs-0.164
-0.164
 gra-0.161
Token gl
Token ID1278
Feature activation+7.31e+00

Pos logit contributions
Ready+0.092
Circ+0.088
Find+0.085
 hunt+0.084
 stairs+0.082
Neg logit contributions
 mad-0.078
 glad-0.076
 years-0.075
heart-0.075
lee-0.075
Tokenowed
Token ID6972
Feature activation+1.30e+00

Pos logit contributions
 mad+0.142
lee+0.141
 years+0.140
 glad+0.140
years+0.129
Neg logit contributions
Ready-0.206
 hunt-0.192
Circ-0.189
sc-0.189
 stairs-0.188
Token on
Token ID319
Feature activation-9.55e+00

Pos logit contributions
lee+0.029
 mad+0.029
 glad+0.029
 years+0.028
years+0.027
Neg logit contributions
Ready-0.032
sc-0.030
 hunt-0.030
 stairs-0.030
Circ-0.029
Token's
Token ID338
Feature activation+4.98e+00

Pos logit contributions
Ready+0.071
 hunt+0.070
 stairs+0.066
sc+0.065
ground+0.065
Neg logit contributions
lee-0.066
 mad-0.063
 glad-0.062
illy-0.062
years-0.061
Token courage
Token ID11917
Feature activation+2.53e+00

Pos logit contributions
lee+0.095
 mad+0.095
 glad+0.093
 years+0.092
heart+0.087
Neg logit contributions
Ready-0.146
 hunt-0.135
Circ-0.135
 stairs-0.134
sc-0.131
Token.
Token ID13
Feature activation-8.60e-01

Pos logit contributions
lee+0.054
 mad+0.053
 glad+0.052
 years+0.052
heart+0.050
Neg logit contributions
Ready-0.068
 hunt-0.063
Circ-0.062
 stairs-0.062
Find-0.060
Token The
Token ID383
Feature activation-5.10e-01

Pos logit contributions
Ready+0.021
 hunt+0.020
 shortcut+0.019
 stairs+0.019
Circ+0.019
Neg logit contributions
lee-0.021
 glad-0.020
 years-0.019
 mad-0.019
years-0.018
Token two
Token ID734
Feature activation+2.43e+00

Pos logit contributions
Ready+0.011
 hunt+0.010
 stairs+0.010
Circ+0.010
+0.009
Neg logit contributions
lee-0.014
 mad-0.014
 years-0.013
 glad-0.013
heart-0.013
Token de
Token ID390
Feature activation+1.96e+01

Pos logit contributions
lee+0.057
 mad+0.056
illy+0.055
 glad+0.054
 years+0.054
Neg logit contributions
Ready-0.055
 hunt-0.055
 stairs-0.053
 shortcut-0.052
 ride-0.052
Tokeners
Token ID364
Feature activation-2.60e+00

Pos logit contributions
lee+0.509
 mad+0.490
 glad+0.480
 years+0.478
heart+0.478
Neg logit contributions
Ready-0.453
 hunt-0.409
Circ-0.400
sc-0.395
 stairs-0.393
Token became
Token ID2627
Feature activation+5.49e+00

Pos logit contributions
Ready+0.070
 hunt+0.066
Circ+0.065
 stairs+0.064
Find+0.063
Neg logit contributions
lee-0.054
 mad-0.054
 glad-0.053
 years-0.052
heart-0.051
Token good
Token ID922
Feature activation+1.30e+01

Pos logit contributions
lee+0.101
heart+0.097
illy+0.096
 years+0.093
ivering+0.090
Neg logit contributions
Ready-0.166
Circ-0.154
 hunt-0.152
Find-0.151
-0.150
Token friends
Token ID2460
Feature activation-1.35e+00

Pos logit contributions
lee+0.179
heart+0.170
 glad+0.166
illy+0.163
years+0.160
Neg logit contributions
Ready-0.426
 hunt-0.420
Circ-0.415
 ride-0.409
 stairs-0.404
Token after
Token ID706
Feature activation-3.24e+00

Pos logit contributions
 hunt+0.032
Ready+0.032
 ride+0.031
sc+0.031
ground+0.031
Neg logit contributions
lee-0.030
 mad-0.030
heart-0.029
illy-0.028
years-0.028
TokenOnce
Token ID7454
Feature activation-3.24e+00

Pos logit contributions
 hunt+0.201
tr+0.199
 shortcut+0.196
Find+0.187
Ready+0.181
Neg logit contributions
illy-0.344
 years-0.343
lee-0.335
aming-0.319
 brothers-0.310
Token upon
Token ID2402
Feature activation-8.78e+00

Pos logit contributions
Ready+0.073
 hunt+0.066
Circ+0.065
 stairs+0.065
Find+0.063
Neg logit contributions
lee-0.084
 mad-0.083
 glad-0.082
 years-0.081
heart-0.079
Token a
Token ID257
Feature activation-3.46e+00

Pos logit contributions
Ready+0.200
 hunt+0.183
Circ+0.180
 stairs+0.179
+0.176
Neg logit contributions
lee-0.227
 mad-0.220
 glad-0.215
 years-0.215
heart-0.212
Token time
Token ID640
Feature activation-7.31e+00

Pos logit contributions
Ready+0.060
+0.055
 hunt+0.054
Circ+0.052
sc+0.052
Neg logit contributions
lee-0.109
years-0.104
 mad-0.101
heart-0.100
 glad-0.098
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
Ready+0.225
 hunt+0.211
Circ+0.209
 stairs+0.207
Find+0.203
Neg logit contributions
lee-0.128
 mad-0.126
 glad-0.123
 years-0.122
heart-0.117
Token there
Token ID612
Feature activation-1.69e+01

Pos logit contributions
lee+0.062
 years+0.060
 mad+0.060
heart+0.058
 glad+0.058
Neg logit contributions
Ready-0.043
-0.040
 stairs-0.038
 hunt-0.038
Circ-0.037
Token was
Token ID373
Feature activation+1.66e+00

Pos logit contributions
Ready+0.399
 hunt+0.364
Circ+0.360
 stairs+0.354
sc+0.348
Neg logit contributions
lee-0.419
 mad-0.417
 glad-0.412
 years-0.411
heart-0.397
Token a
Token ID257
Feature activation-3.30e+00

Pos logit contributions
lee+0.055
heart+0.053
ushing+0.052
 mad+0.052
ivering+0.051
Neg logit contributions
Ready-0.027
-0.026
Circ-0.025
 stairs-0.024
Under-0.022
Token graceful
Token ID44363
Feature activation+4.92e+00

Pos logit contributions
Ready+0.078
 ride+0.072
 hunt+0.072
 hill+0.072
 stairs+0.070
Neg logit contributions
lee-0.083
heart-0.080
years-0.079
asured-0.077
 years-0.077
Token hero
Token ID4293
Feature activation-1.50e+00

Pos logit contributions
 mad+0.137
lee+0.137
 glad+0.135
 years+0.134
years+0.133
Neg logit contributions
Ready-0.098
 hunt-0.090
 stairs-0.088
Circ-0.087
 ride-0.083
Token named
Token ID3706
Feature activation-1.24e-02

Pos logit contributions
Ready+0.044
Circ+0.040
 stairs+0.039
Find+0.039
 hunt+0.039
Neg logit contributions
 years-0.030
lee-0.030
 mad-0.029
 glad-0.029
heart-0.028
Token<|endoftext|>
Token ID50256
Feature activation-1.12e+01

Pos logit contributions
lee+0.227
 glad+0.224
 years+0.222
 mad+0.218
 pleased+0.207
Neg logit contributions
Ready-0.176
Circ-0.161
 hunt-0.160
 stairs-0.159
sc-0.156
TokenOnce
Token ID7454
Feature activation-2.74e+00

Pos logit contributions
 hunt+0.202
 shortcut+0.191
tr+0.190
Find+0.182
Ready+0.175
Neg logit contributions
 years-0.321
illy-0.319
lee-0.309
aming-0.305
 glad-0.291
Token upon
Token ID2402
Feature activation-8.52e+00

Pos logit contributions
Ready+0.061
 hunt+0.056
Circ+0.055
 stairs+0.054
Find+0.053
Neg logit contributions
lee-0.071
 mad-0.070
 glad-0.069
 years-0.069
heart-0.067
Token a
Token ID257
Feature activation-3.25e+00

Pos logit contributions
Ready+0.200
 hunt+0.183
Circ+0.181
 stairs+0.179
+0.175
Neg logit contributions
lee-0.213
 mad-0.208
 glad-0.204
 years-0.203
heart-0.199
Token time
Token ID640
Feature activation-7.02e+00

Pos logit contributions
Ready+0.057
+0.052
 hunt+0.051
 ride+0.050
Circ+0.049
Neg logit contributions
lee-0.101
years-0.097
 mad-0.094
heart-0.093
 glad-0.091
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.214
 hunt+0.200
Circ+0.198
 stairs+0.197
+0.193
Neg logit contributions
lee-0.125
 mad-0.123
 glad-0.120
 years-0.119
heart-0.114
Token was
Token ID373
Feature activation+1.73e+00

Pos logit contributions
Ready+0.401
 hunt+0.366
Circ+0.363
 stairs+0.356
sc+0.351
Neg logit contributions
 mad-0.410
lee-0.410
 glad-0.402
 years-0.402
heart-0.389
Token a
Token ID257
Feature activation-3.21e+00

Pos logit contributions
lee+0.062
heart+0.060
ushing+0.059
ious+0.058
 mad+0.057
Neg logit contributions
-0.024
Ready-0.023
Circ-0.020
 stairs-0.020
 ride-0.019
Token restless
Token ID42537
Feature activation+5.74e+00

Pos logit contributions
Ready+0.076
 ride+0.071
 hunt+0.071
 hill+0.071
 stairs+0.069
Neg logit contributions
lee-0.080
heart-0.076
years-0.076
 years-0.074
aming-0.073
Token little
Token ID1310
Feature activation+2.28e+00

Pos logit contributions
lee+0.152
 mad+0.151
 glad+0.149
 years+0.148
years+0.145
Neg logit contributions
Ready-0.122
 hunt-0.113
 stairs-0.111
Circ-0.110
-0.106
Token girl
Token ID2576
Feature activation+1.34e+00

Pos logit contributions
lee+0.068
years+0.064
aming+0.064
 years+0.063
 glad+0.063
Neg logit contributions
Ready-0.043
 hill-0.042
 stairs-0.042
 hunt-0.041
Circ-0.040
TokenOnce
Token ID7454
Feature activation-2.77e+00

Pos logit contributions
 hunt+0.212
tr+0.210
Find+0.203
 shortcut+0.197
Ready+0.189
Neg logit contributions
 years-0.319
illy-0.298
lee-0.294
 brothers-0.289
aming-0.287
Token upon
Token ID2402
Feature activation-8.49e+00

Pos logit contributions
Ready+0.066
 hunt+0.060
Circ+0.059
Find+0.057
 stairs+0.057
Neg logit contributions
lee-0.069
 mad-0.068
 glad-0.068
 years-0.066
heart-0.065
Token a
Token ID257
Feature activation-3.24e+00

Pos logit contributions
Ready+0.199
 hunt+0.182
Circ+0.180
 stairs+0.178
+0.174
Neg logit contributions
lee-0.213
 mad-0.208
 glad-0.204
 years-0.203
heart-0.200
Token time
Token ID640
Feature activation-7.01e+00

Pos logit contributions
Ready+0.057
+0.052
 hunt+0.051
 ride+0.049
sc+0.049
Neg logit contributions
lee-0.102
years-0.098
 mad-0.094
heart-0.093
 glad-0.091
Token,
Token ID11
Feature activation+2.44e+00

Pos logit contributions
Ready+0.211
 hunt+0.198
Circ+0.195
 stairs+0.195
+0.191
Neg logit contributions
lee-0.127
 mad-0.125
 years-0.121
 glad-0.121
heart-0.116
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
lee+0.069
 years+0.066
 mad+0.066
heart+0.064
 glad+0.064
Neg logit contributions
Ready-0.050
-0.045
 hunt-0.044
 stairs-0.044
Circ-0.044
Token was
Token ID373
Feature activation+1.67e+00

Pos logit contributions
Ready+0.420
 hunt+0.383
Circ+0.379
 stairs+0.371
sc+0.371
Neg logit contributions
 mad-0.389
lee-0.387
 glad-0.387
 years-0.385
heart-0.367
Token a
Token ID257
Feature activation-3.16e+00

Pos logit contributions
lee+0.059
heart+0.057
ushing+0.055
ivering+0.054
 mad+0.054
Neg logit contributions
Ready-0.025
-0.024
Circ-0.023
 stairs-0.022
 ride-0.021
Token powerful
Token ID3665
Feature activation+3.11e+00

Pos logit contributions
Ready+0.073
 ride+0.069
 hill+0.068
 hunt+0.068
Circ+0.066
Neg logit contributions
lee-0.081
heart-0.078
years-0.077
asured-0.075
 years-0.075
Token lion
Token ID18744
Feature activation-6.01e+00

Pos logit contributions
lee+0.089
 mad+0.087
 glad+0.086
 years+0.086
years+0.086
Neg logit contributions
Ready-0.060
 hunt-0.056
 stairs-0.055
Circ-0.053
 ride-0.052
Token named
Token ID3706
Feature activation+2.08e-01

Pos logit contributions
Ready+0.184
Circ+0.166
 stairs+0.165
Find+0.163
 shortcut+0.161
Neg logit contributions
 years-0.115
lee-0.110
 mad-0.110
 glad-0.108
heart-0.103
Token a
Token ID257
Feature activation-3.35e+00

Pos logit contributions
lee+0.046
 mad+0.046
 glad+0.045
 years+0.045
heart+0.043
Neg logit contributions
Ready-0.065
Circ-0.061
 hunt-0.061
 stairs-0.059
Find-0.059
Token very
Token ID845
Feature activation+9.21e+00

Pos logit contributions
Ready+0.086
 hunt+0.081
Circ+0.081
 stairs+0.078
Find+0.078
Neg logit contributions
 mad-0.074
 glad-0.072
lee-0.072
 years-0.072
heart-0.068
Token angry
Token ID7954
Feature activation+6.46e+00

Pos logit contributions
lee+0.166
 years+0.156
 mad+0.156
 glad+0.154
heart+0.151
Neg logit contributions
Ready-0.279
 hunt-0.259
 stairs-0.259
Circ-0.252
 shortcut-0.250
Token man
Token ID582
Feature activation-3.26e+00

Pos logit contributions
lee+0.156
 mad+0.156
 years+0.151
heart+0.150
 glad+0.150
Neg logit contributions
Ready-0.158
Circ-0.143
 hunt-0.143
 stairs-0.141
Find-0.139
Token up
Token ID510
Feature activation-6.21e+00

Pos logit contributions
Ready+0.098
Circ+0.093
 stairs+0.092
Find+0.090
 hunt+0.088
Neg logit contributions
 mad-0.061
 years-0.059
lee-0.059
 glad-0.059
heart-0.057
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.180
Circ+0.169
 hunt+0.164
Find+0.163
 gra+0.157
Neg logit contributions
 mad-0.130
 years-0.127
lee-0.124
 glad-0.123
heart-0.119
Token.
Token ID13
Feature activation-4.42e+00

Pos logit contributions
Ready+0.492
Circ+0.479
Find+0.466
 hunt+0.447
 stairs+0.445
Neg logit contributions
 years-0.315
heart-0.313
 mad-0.310
 glad-0.306
lee-0.303
Token We
Token ID775
Feature activation-2.53e+00

Pos logit contributions
 hunt+0.104
Ready+0.097
Circ+0.095
 shortcut+0.093
 stairs+0.092
Neg logit contributions
lee-0.109
 glad-0.108
 years-0.102
 ok-0.101
 mad-0.100
Token need
Token ID761
Feature activation-2.65e+00

Pos logit contributions
Ready+0.059
Circ+0.055
 stairs+0.054
 hunt+0.054
sc+0.052
Neg logit contributions
 mad-0.063
lee-0.063
 glad-0.061
 years-0.061
heart-0.059
Token to
Token ID284
Feature activation-1.12e+01

Pos logit contributions
Ready+0.055
Circ+0.049
 hunt+0.048
Find+0.048
sc+0.047
Neg logit contributions
 years-0.073
 mad-0.071
lee-0.071
 glad-0.071
 careful-0.070
Token be
Token ID307
Feature activation+2.64e+00

Pos logit contributions
Ready+0.228
Circ+0.207
 stairs+0.200
sc+0.199
+0.197
Neg logit contributions
 mad-0.317
lee-0.311
 years-0.309
 glad-0.307
 ok-0.297
Token.
Token ID13
Feature activation-4.32e+00

Pos logit contributions
Ready+0.053
 hunt+0.050
Circ+0.049
 stairs+0.049
sc+0.047
Neg logit contributions
lee-0.037
 mad-0.036
 glad-0.035
 years-0.035
heart-0.034
Tokenâ
Token ID22940
Feature activation+1.07e+01

Pos logit contributions
 hunt+0.100
Ready+0.096
 stairs+0.093
Circ+0.092
 shortcut+0.090
Neg logit contributions
 glad-0.109
lee-0.107
 years-0.102
heart-0.101
 ok-0.101
Token
Token ID26391
Feature activation+8.36e+00

Pos logit contributions
 mad+0.252
lee+0.251
 years+0.249
 glad+0.247
heart+0.243
Neg logit contributions
Ready-0.256
 stairs-0.248
Circ-0.247
 hunt-0.245
Find-0.236
Token<|endoftext|>
Token ID50256
Feature activation-1.17e+01

Pos logit contributions
 glad+0.239
lee+0.239
 years+0.234
 mad+0.228
 pleased+0.220
Neg logit contributions
Ready-0.165
Circ-0.153
 hunt-0.151
sc-0.151
 stairs-0.149
TokenOnce
Token ID7454
Feature activation-3.02e+00

Pos logit contributions
 hunt+0.215
tr+0.212
 shortcut+0.200
Find+0.196
Circ+0.191
Neg logit contributions
 years-0.329
illy-0.317
lee-0.314
aming-0.302
 glad-0.296
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.067
 hunt+0.061
Circ+0.060
 stairs+0.059
Find+0.057
Neg logit contributions
lee-0.079
 mad-0.078
 glad-0.077
 years-0.077
heart-0.074
Token was
Token ID373
Feature activation+1.71e+00

Pos logit contributions
Ready+0.373
 hunt+0.341
Circ+0.336
 stairs+0.332
Find+0.324
Neg logit contributions
lee-0.433
 mad-0.431
 glad-0.425
 years-0.422
heart-0.410
Token a
Token ID257
Feature activation-3.11e+00

Pos logit contributions
lee+0.063
ivering+0.060
heart+0.059
ious+0.059
ushing+0.059
Neg logit contributions
-0.022
Ready-0.021
 Up-0.019
Circ-0.018
 ride-0.018
Token girl
Token ID2576
Feature activation+1.43e+00

Pos logit contributions
Ready+0.079
 hunt+0.072
 ride+0.072
Circ+0.071
 stairs+0.071
Neg logit contributions
lee-0.073
years-0.069
heart-0.069
 years-0.068
 mad-0.068
Token named
Token ID3706
Feature activation+5.28e-02

Pos logit contributions
lee+0.032
 mad+0.032
 pleased+0.031
 glad+0.030
 ok+0.030
Neg logit contributions
 hunt-0.034
Ready-0.034
-0.033
 ride-0.033
 stairs-0.032
Token Jane
Token ID12091
Feature activation+5.63e+00

Pos logit contributions
lee+0.001
 mad+0.001
 glad+0.001
 years+0.001
heart+0.001
Neg logit contributions
Ready-0.001
 hunt-0.001
Circ-0.001
 stairs-0.001
Find-0.001
Tokenâ
Token ID22940
Feature activation+1.07e+01

Pos logit contributions
Ready+0.147
 hunt+0.136
Circ+0.134
 stairs+0.132
+0.131
Neg logit contributions
lee-0.115
 mad-0.113
 glad-0.112
 years-0.111
years-0.107
Token
Token ID26391
Feature activation+8.52e+00

Pos logit contributions
 mad+0.248
lee+0.245
 years+0.244
 glad+0.241
heart+0.238
Neg logit contributions
Ready-0.261
 stairs-0.257
Circ-0.248
 hunt-0.245
Find-0.239
Token
Token ID8151
Feature activation-3.60e+00

Pos logit contributions
 glad+0.236
lee+0.234
 years+0.227
 mad+0.225
 pleased+0.217
Neg logit contributions
Ready-0.175
 stairs-0.162
 hunt-0.157
Circ-0.157
sc-0.156
Token<|endoftext|>
Token ID50256
Feature activation-1.13e+01

Pos logit contributions
Ready+0.081
 hunt+0.075
Circ+0.073
 stairs+0.072
Find+0.071
Neg logit contributions
lee-0.094
 mad-0.093
 glad-0.092
 years-0.090
heart-0.089
TokenOnce
Token ID7454
Feature activation-2.68e+00

Pos logit contributions
 hunt+0.191
tr+0.187
 shortcut+0.186
Find+0.169
Ready+0.166
Neg logit contributions
 years-0.337
illy-0.324
lee-0.322
aming-0.309
heart-0.294
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.054
 stairs+0.049
+0.049
 hunt+0.049
Circ+0.048
Neg logit contributions
 mad-0.073
lee-0.073
 years-0.072
 glad-0.071
years-0.070
Token was
Token ID373
Feature activation+1.74e+00

Pos logit contributions
Ready+0.341
 hunt+0.307
 stairs+0.301
Circ+0.299
+0.297
Neg logit contributions
lee-0.465
 mad-0.458
 glad-0.451
 years-0.450
years-0.438
Token a
Token ID257
Feature activation-3.19e+00

Pos logit contributions
ivering+0.059
lee+0.059
ushing+0.057
ious+0.057
 mad+0.057
Neg logit contributions
-0.026
Ready-0.024
 Up-0.022
Circ-0.022
 ride-0.019
Token boy
Token ID2933
Feature activation+3.43e-01

Pos logit contributions
Ready+0.083
Circ+0.075
 hunt+0.075
 ride+0.074
 stairs+0.073
Neg logit contributions
lee-0.073
 years-0.069
 mad-0.069
heart-0.069
years-0.068
Token who
Token ID508
Feature activation-3.29e+00

Pos logit contributions
lee+0.008
 pleased+0.008
 mad+0.008
 ok+0.008
 glad+0.008
Neg logit contributions
 hunt-0.008
-0.007
 ride-0.007
Ready-0.007
ro-0.007
Token liked
Token ID8288
Feature activation+1.71e+00

Pos logit contributions
Ready+0.101
 hunt+0.094
Circ+0.092
 stairs+0.091
+0.091
Neg logit contributions
lee-0.060
 mad-0.058
 glad-0.056
 years-0.056
heart-0.055
TokenOnce
Token ID7454
Feature activation-2.75e+00

Pos logit contributions
 hunt+0.225
 shortcut+0.210
Ready+0.207
tr+0.204
Find+0.203
Neg logit contributions
 years-0.290
 glad-0.276
aming-0.274
 brothers-0.267
illy-0.262
Token upon
Token ID2402
Feature activation-8.29e+00

Pos logit contributions
Ready+0.060
 hunt+0.054
Circ+0.053
 stairs+0.053
+0.052
Neg logit contributions
lee-0.072
 mad-0.072
 years-0.071
 glad-0.070
heart-0.068
Token a
Token ID257
Feature activation-2.96e+00

Pos logit contributions
Ready+0.191
 hunt+0.175
Circ+0.172
 stairs+0.171
+0.167
Neg logit contributions
lee-0.212
 mad-0.205
 glad-0.202
 years-0.201
heart-0.199
Token time
Token ID640
Feature activation-6.83e+00

Pos logit contributions
Ready+0.049
 hunt+0.047
+0.045
 ride+0.044
 hike+0.042
Neg logit contributions
lee-0.096
years-0.092
heart-0.089
 mad-0.088
ious-0.085
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
Ready+0.211
 hunt+0.197
Circ+0.195
 stairs+0.194
Find+0.190
Neg logit contributions
lee-0.119
 mad-0.117
 glad-0.115
 years-0.114
heart-0.109
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
lee+0.074
 mad+0.071
 years+0.071
 glad+0.070
heart+0.069
Neg logit contributions
Ready-0.059
 hunt-0.053
 stairs-0.052
Circ-0.052
-0.052
Token was
Token ID373
Feature activation+1.95e+00

Pos logit contributions
Ready+0.416
 hunt+0.378
Circ+0.376
 stairs+0.367
sc+0.367
Neg logit contributions
 mad-0.390
lee-0.388
 glad-0.387
 years-0.384
heart-0.367
Token a
Token ID257
Feature activation-2.91e+00

Pos logit contributions
lee+0.069
heart+0.068
ushing+0.065
ivering+0.064
ious+0.064
Neg logit contributions
Ready-0.029
-0.027
Circ-0.025
 stairs-0.024
 ride-0.023
Token march
Token ID9960
Feature activation-3.47e+00

Pos logit contributions
Ready+0.068
 ride+0.064
 hunt+0.064
 hill+0.063
 stairs+0.062
Neg logit contributions
lee-0.074
heart-0.071
years-0.070
 years-0.068
illy-0.067
Token.
Token ID13
Feature activation-4.30e+00

Pos logit contributions
Ready+0.104
Circ+0.095
Find+0.095
 gra+0.093
 hunt+0.090
Neg logit contributions
 years-0.066
 mad-0.063
heart-0.062
 glad-0.061
lee-0.061
Token It
Token ID632
Feature activation-5.58e+00

Pos logit contributions
Ready+0.094
 hunt+0.093
sc+0.091
 shortcut+0.088
con+0.087
Neg logit contributions
lee-0.101
 glad-0.101
 years-0.101
 indeed-0.096
 okay-0.096
Tokenâ
Token ID22940
Feature activation+1.06e+01

Pos logit contributions
Ready+0.150
 hunt+0.139
Circ+0.137
 stairs+0.136
sc+0.133
Neg logit contributions
lee-0.114
 mad-0.112
 glad-0.111
 years-0.110
heart-0.105
Token
Token ID26391
Feature activation+8.19e+00

Pos logit contributions
 mad+0.248
lee+0.245
 years+0.244
 glad+0.240
heart+0.239
Neg logit contributions
Ready-0.254
 stairs-0.250
Circ-0.243
 hunt-0.238
Find-0.234
Token
Token ID8151
Feature activation-3.62e+00

Pos logit contributions
 glad+0.217
lee+0.214
 years+0.212
 mad+0.207
 careful+0.201
Neg logit contributions
Ready-0.173
 stairs-0.162
Circ-0.156
 hunt-0.155
sc-0.155
Tokens
Token ID82
Feature activation+2.21e+00

Pos logit contributions
Ready+0.077
 hunt+0.071
Circ+0.069
 stairs+0.068
Find+0.067
Neg logit contributions
lee-0.098
 mad-0.098
 glad-0.097
 years-0.095
heart-0.094
Token go
Token ID467
Feature activation-4.07e+00

Pos logit contributions
 mad+0.064
lee+0.064
 glad+0.062
 years+0.062
heart+0.061
Neg logit contributions
Ready-0.042
Circ-0.040
 stairs-0.038
sc-0.037
 hunt-0.037
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.104
Circ+0.097
 hunt+0.093
 stairs+0.093
Find+0.093
Neg logit contributions
lee-0.093
 mad-0.092
 glad-0.091
 years-0.089
heart-0.086
Token!
Token ID0
Feature activation-4.21e+00

Pos logit contributions
Ready+0.492
Circ+0.456
 hunt+0.456
 stairs+0.448
Find+0.447
Neg logit contributions
lee-0.308
 mad-0.308
 glad-0.305
 years-0.304
heart-0.292
Tokenâ
Token ID22940
Feature activation+1.05e+01

Pos logit contributions
 hunt+0.104
 stairs+0.099
Ready+0.096
Circ+0.094
 gra+0.092
Neg logit contributions
 glad-0.097
 ok-0.094
 okay-0.093
lee-0.092
heart-0.090
Token
Token ID26391
Feature activation+8.06e+00

Pos logit contributions
 mad+0.241
lee+0.241
 years+0.239
 glad+0.235
heart+0.233
Neg logit contributions
Ready-0.254
 stairs-0.249
Circ-0.248
 hunt-0.243
Find-0.240
Token They
Token ID1119
Feature activation-1.86e-01

Pos logit contributions
 glad+0.217
lee+0.215
 years+0.214
 mad+0.207
 careful+0.202
Neg logit contributions
Ready-0.166
Circ-0.156
 stairs-0.154
 hunt-0.151
sc-0.149
Token ran
Token ID4966
Feature activation-6.68e-01

Pos logit contributions
Ready+0.005
 hunt+0.004
Circ+0.004
 stairs+0.004
Find+0.004
Neg logit contributions
lee-0.004
 mad-0.004
 glad-0.004
 years-0.004
heart-0.004
Token<|endoftext|>
Token ID50256
Feature activation-1.10e+01

Pos logit contributions
lee+0.239
 glad+0.238
 years+0.233
 mad+0.226
 pleased+0.219
Neg logit contributions
Ready-0.172
Circ-0.157
 stairs-0.155
 hunt-0.151
sc-0.150
TokenOnce
Token ID7454
Feature activation-2.56e+00

Pos logit contributions
 hunt+0.195
tr+0.185
 shortcut+0.181
Circ+0.173
Find+0.172
Neg logit contributions
 years-0.323
illy-0.311
aming-0.307
lee-0.305
 glad-0.291
Token upon
Token ID2402
Feature activation-8.27e+00

Pos logit contributions
Ready+0.056
 hunt+0.051
Circ+0.050
 stairs+0.050
Find+0.048
Neg logit contributions
lee-0.067
 mad-0.067
 glad-0.065
 years-0.065
heart-0.063
Token a
Token ID257
Feature activation-3.12e+00

Pos logit contributions
Ready+0.193
 hunt+0.176
Circ+0.174
 stairs+0.173
+0.169
Neg logit contributions
lee-0.208
 mad-0.203
 glad-0.199
 years-0.199
heart-0.195
Token time
Token ID640
Feature activation-6.81e+00

Pos logit contributions
Ready+0.054
+0.049
 hunt+0.046
 ride+0.046
Circ+0.045
Neg logit contributions
lee-0.100
years-0.096
 mad-0.092
heart-0.092
ious-0.089
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
Ready+0.206
 hunt+0.192
Circ+0.190
 stairs+0.190
+0.186
Neg logit contributions
lee-0.123
 mad-0.121
 years-0.117
 glad-0.117
heart-0.113
Token was
Token ID373
Feature activation+1.78e+00

Pos logit contributions
Ready+0.390
 hunt+0.356
Circ+0.352
 stairs+0.347
sc+0.341
Neg logit contributions
 mad-0.416
lee-0.416
 glad-0.408
 years-0.408
heart-0.395
Token a
Token ID257
Feature activation-3.15e+00

Pos logit contributions
lee+0.064
heart+0.062
ushing+0.060
ious+0.059
 mad+0.058
Neg logit contributions
-0.025
Ready-0.024
Circ-0.021
 stairs-0.021
 ride-0.020
Token girl
Token ID2576
Feature activation+1.51e+00

Pos logit contributions
Ready+0.075
 ride+0.069
 hill+0.069
 hunt+0.069
 stairs+0.067
Neg logit contributions
lee-0.079
years-0.076
heart-0.075
asured-0.073
aming-0.073
Token called
Token ID1444
Feature activation+8.37e-01

Pos logit contributions
lee+0.036
 mad+0.036
 pleased+0.035
 ok+0.034
heart+0.034
Neg logit contributions
 hunt-0.034
Ready-0.033
 ride-0.033
-0.033
 gra-0.032
Token Emma
Token ID18966
Feature activation+6.21e+00

Pos logit contributions
lee+0.016
 mad+0.015
illy+0.015
 glad+0.015
 years+0.014
Neg logit contributions
Ready-0.025
 hunt-0.023
Find-0.023
Circ-0.023
sc-0.023
Token in
Token ID287
Feature activation-1.20e+01

Pos logit contributions
lee+0.068
 mad+0.067
 glad+0.066
 years+0.066
heart+0.064
Neg logit contributions
Ready-0.061
 hunt-0.055
Circ-0.055
 stairs-0.054
Find-0.053
Token a
Token ID257
Feature activation-2.91e+00

Pos logit contributions
Ready+0.330
 hunt+0.304
Circ+0.299
 stairs+0.299
sc+0.294
Neg logit contributions
 glad-0.250
 years-0.249
 mad-0.248
lee-0.238
 careful-0.232
Token small
Token ID1402
Feature activation+1.40e+00

Pos logit contributions
Ready+0.064
 hunt+0.058
+0.057
Circ+0.056
 stairs+0.056
Neg logit contributions
lee-0.080
heart-0.077
 mad-0.075
years-0.075
illy-0.075
Token town
Token ID3240
Feature activation-9.03e+00

Pos logit contributions
lee+0.043
 mad+0.042
 glad+0.042
 years+0.041
years+0.041
Neg logit contributions
Ready-0.024
 hunt-0.022
 stairs-0.022
Circ-0.022
-0.021
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
Ready+0.280
 hunt+0.261
Circ+0.259
 stairs+0.255
Find+0.252
Neg logit contributions
lee-0.159
 mad-0.157
 glad-0.154
 years-0.154
heart-0.145
Token there
Token ID612
Feature activation-1.67e+01

Pos logit contributions
lee+0.075
 mad+0.072
 years+0.071
heart+0.070
 glad+0.070
Neg logit contributions
Ready-0.058
 stairs-0.052
 hunt-0.052
-0.051
Circ-0.051
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
Ready+0.368
 hunt+0.334
Circ+0.330
 stairs+0.326
sc+0.317
Neg logit contributions
lee-0.438
 mad-0.434
 glad-0.429
 years-0.427
heart-0.414
Token a
Token ID257
Feature activation-2.85e+00

Pos logit contributions
lee+0.053
 mad+0.051
heart+0.050
 glad+0.049
 years+0.049
Neg logit contributions
Ready-0.041
 stairs-0.037
Circ-0.036
 hunt-0.036
-0.036
Token big
Token ID1263
Feature activation+3.79e-01

Pos logit contributions
Ready+0.073
 hunt+0.067
 stairs+0.067
Circ+0.065
 ride+0.064
Neg logit contributions
lee-0.068
heart-0.064
years-0.063
 years-0.063
illy-0.062
Token mill
Token ID3939
Feature activation-6.84e+00

Pos logit contributions
lee+0.012
years+0.011
 mad+0.011
heart+0.011
 years+0.011
Neg logit contributions
Ready-0.007
 hunt-0.006
 stairs-0.006
 hill-0.006
 ride-0.006
Token.
Token ID13
Feature activation-4.28e+00

Pos logit contributions
Ready+0.202
Circ+0.181
Find+0.178
+0.172
 hunt+0.170
Neg logit contributions
 years-0.143
 mad-0.135
 glad-0.133
lee-0.133
heart-0.126
Token the
Token ID262
Feature activation-7.00e+00

Pos logit contributions
Ready+0.234
 hunt+0.209
Circ+0.209
sc+0.209
Find+0.205
Neg logit contributions
 mad-0.247
 glad-0.247
lee-0.247
 years-0.243
illy-0.230
Token park
Token ID3952
Feature activation-8.57e+00

Pos logit contributions
Ready+0.123
sc+0.116
Circ+0.116
 hunt+0.113
Find+0.110
Neg logit contributions
 mad-0.206
 years-0.204
 glad-0.203
lee-0.199
illy-0.193
Token.
Token ID13
Feature activation-3.89e+00

Pos logit contributions
Ready+0.255
Circ+0.233
Find+0.228
 hunt+0.225
 stairs+0.224
Neg logit contributions
lee-0.168
 years-0.165
 mad-0.162
 glad-0.162
heart-0.152
Token<|endoftext|>
Token ID50256
Feature activation-1.12e+01

Pos logit contributions
Ready+0.079
 hunt+0.077
 shortcut+0.072
Circ+0.070
 stairs+0.069
Neg logit contributions
 glad-0.112
lee-0.110
 years-0.106
 mad-0.104
 pleased-0.102
TokenOnce
Token ID7454
Feature activation-2.49e+00

Pos logit contributions
 hunt+0.269
Ready+0.261
 shortcut+0.257
Find+0.255
Circ+0.243
Neg logit contributions
 years-0.252
 glad-0.238
lee-0.234
aming-0.230
illy-0.227
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.050
 stairs+0.046
 hunt+0.046
+0.045
Circ+0.045
Neg logit contributions
lee-0.068
 mad-0.067
 years-0.067
 glad-0.065
heart-0.065
Token was
Token ID373
Feature activation+1.85e+00

Pos logit contributions
Ready+0.355
 hunt+0.322
Circ+0.316
 stairs+0.315
Find+0.305
Neg logit contributions
lee-0.448
 mad-0.443
 glad-0.436
 years-0.435
heart-0.423
Token a
Token ID257
Feature activation-2.92e+00

Pos logit contributions
lee+0.070
heart+0.065
ious+0.065
ushing+0.064
ivering+0.064
Neg logit contributions
-0.023
Ready-0.021
 Up-0.020
 ride-0.019
Circ-0.019
Token little
Token ID1310
Feature activation+2.31e+00

Pos logit contributions
Ready+0.071
 ride+0.066
 hunt+0.065
Circ+0.065
 hill+0.063
Neg logit contributions
lee-0.073
years-0.069
heart-0.069
 years-0.067
 mad-0.066
Token boy
Token ID2933
Feature activation+2.37e-01

Pos logit contributions
lee+0.070
years+0.067
 ok+0.065
 years+0.065
aming+0.064
Neg logit contributions
 hill-0.042
Ready-0.041
 stairs-0.040
 ride-0.039
 aisle-0.038
Token.
Token ID13
Feature activation-4.66e+00

Pos logit contributions
lee+0.006
 pleased+0.006
 mad+0.006
 ok+0.005
heart+0.005
Neg logit contributions
 hunt-0.005
 ride-0.005
-0.005
Ready-0.005
 gra-0.005
Token the
Token ID262
Feature activation-7.35e+00

Pos logit contributions
Ready+0.082
 hunt+0.076
Circ+0.075
 stairs+0.075
sc+0.073
Neg logit contributions
lee-0.070
 mad-0.069
 glad-0.068
 years-0.067
heart-0.066
Token pumpkin
Token ID30089
Feature activation-9.23e+00

Pos logit contributions
Ready+0.189
 hunt+0.177
 stairs+0.175
Circ+0.174
+0.169
Neg logit contributions
lee-0.166
 mad-0.159
 years-0.158
 glad-0.158
years-0.154
Token together
Token ID1978
Feature activation-1.21e+00

Pos logit contributions
Ready+0.263
 hunt+0.242
Circ+0.241
 stairs+0.238
Find+0.235
Neg logit contributions
lee-0.184
 mad-0.182
 glad-0.178
 years-0.177
heart-0.170
Token,
Token ID11
Feature activation+2.73e+00

Pos logit contributions
Ready+0.036
Circ+0.033
Find+0.032
 stairs+0.032
 hunt+0.032
Neg logit contributions
 years-0.023
lee-0.023
 mad-0.023
 glad-0.023
heart-0.021
Token until
Token ID1566
Feature activation-8.58e-01

Pos logit contributions
lee+0.052
 mad+0.051
 years+0.050
 glad+0.049
heart+0.048
Neg logit contributions
Ready-0.080
 hunt-0.074
 stairs-0.073
Circ-0.073
sc-0.071
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.021
 hunt+0.019
Circ+0.019
 stairs+0.019
sc+0.018
Neg logit contributions
lee-0.021
 mad-0.020
 glad-0.020
 years-0.020
heart-0.019
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
Ready+0.413
 hunt+0.390
Circ+0.380
 stairs+0.371
Find+0.369
Neg logit contributions
lee-0.388
 mad-0.385
 years-0.379
heart-0.377
 glad-0.373
Token nothing
Token ID2147
Feature activation+1.49e+00

Pos logit contributions
lee+0.045
 mad+0.044
 glad+0.044
 years+0.043
heart+0.042
Neg logit contributions
Ready-0.046
 stairs-0.042
Circ-0.042
 hunt-0.042
-0.040
Token left
Token ID1364
Feature activation-6.03e+00

Pos logit contributions
 mad+0.035
 glad+0.034
lee+0.034
 years+0.033
 pleased+0.032
Neg logit contributions
Ready-0.038
Circ-0.035
 hunt-0.034
Find-0.034
 stairs-0.034
Token.
Token ID13
Feature activation-3.96e+00

Pos logit contributions
Ready+0.172
Circ+0.161
Find+0.157
 hunt+0.155
 stairs+0.151
Neg logit contributions
 mad-0.125
lee-0.123
 years-0.123
 glad-0.119
heart-0.118
Token They
Token ID1119
Feature activation+3.10e-01

Pos logit contributions
 hunt+0.089
Ready+0.082
 shortcut+0.080
 stairs+0.078
Circ+0.077
Neg logit contributions
lee-0.107
 glad-0.106
 years-0.099
illy-0.097
 mad-0.096
TokenOnce
Token ID7454
Feature activation-2.67e+00

Pos logit contributions
 hunt+0.243
Find+0.225
Ready+0.225
 shortcut+0.224
 gra+0.216
Neg logit contributions
 years-0.274
illy-0.259
lee-0.258
 glad-0.258
aming-0.249
Token upon
Token ID2402
Feature activation-8.44e+00

Pos logit contributions
Ready+0.055
 stairs+0.050
 hunt+0.050
Circ+0.049
+0.049
Neg logit contributions
lee-0.072
 mad-0.071
 years-0.071
 glad-0.070
heart-0.068
Token a
Token ID257
Feature activation-3.03e+00

Pos logit contributions
Ready+0.184
 hunt+0.169
Circ+0.166
 stairs+0.165
+0.163
Neg logit contributions
lee-0.229
 mad-0.218
 glad-0.213
heart-0.213
 years-0.212
Token time
Token ID640
Feature activation-6.95e+00

Pos logit contributions
Ready+0.052
 hunt+0.049
+0.047
 ride+0.046
Circ+0.045
Neg logit contributions
lee-0.097
years-0.092
 mad-0.090
heart-0.089
 glad-0.087
Token,
Token ID11
Feature activation+2.65e+00

Pos logit contributions
Ready+0.213
 hunt+0.199
Circ+0.197
 stairs+0.196
Find+0.192
Neg logit contributions
lee-0.122
 mad-0.120
 glad-0.118
 years-0.117
heart-0.112
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
lee+0.073
 mad+0.071
 years+0.070
 glad+0.069
heart+0.069
Neg logit contributions
Ready-0.056
 hunt-0.050
 stairs-0.050
-0.050
Circ-0.049
Token was
Token ID373
Feature activation+2.04e+00

Pos logit contributions
Ready+0.390
 hunt+0.354
Circ+0.349
 stairs+0.344
sc+0.339
Neg logit contributions
lee-0.414
 mad-0.413
 glad-0.407
 years-0.406
heart-0.392
Token a
Token ID257
Feature activation-2.92e+00

Pos logit contributions
lee+0.071
heart+0.069
ushing+0.066
ivering+0.066
 mad+0.066
Neg logit contributions
Ready-0.031
-0.029
Circ-0.029
 stairs-0.027
 ride-0.025
Token mom
Token ID1995
Feature activation+2.30e+00

Pos logit contributions
Ready+0.067
 ride+0.063
 hunt+0.062
 hill+0.062
 stairs+0.061
Neg logit contributions
lee-0.075
heart-0.072
years-0.072
 years-0.069
asured-0.069
Tokenmy
Token ID1820
Feature activation+4.92e+00

Pos logit contributions
lee+0.050
years+0.047
 ok+0.046
 pleased+0.046
 mad+0.045
Neg logit contributions
 ride-0.058
 hunt-0.058
 aisle-0.057
-0.057
 stairs-0.055
Token and
Token ID290
Feature activation+3.28e+00

Pos logit contributions
lee+0.120
 mad+0.118
 ok+0.117
years+0.116
 pleased+0.115
Neg logit contributions
 hunt-0.112
 ride-0.109
 aisle-0.105
Ready-0.105
-0.104
Token<|endoftext|>
Token ID50256
Feature activation-1.11e+01

Pos logit contributions
Ready+0.072
 hunt+0.070
 shortcut+0.065
 stairs+0.064
Circ+0.064
Neg logit contributions
lee-0.111
 glad-0.109
 mad-0.105
 years-0.104
illy-0.104
TokenOnce
Token ID7454
Feature activation-2.54e+00

Pos logit contributions
 hunt+0.230
 shortcut+0.216
Ready+0.210
Find+0.209
tr+0.206
Neg logit contributions
 years-0.279
lee-0.270
illy-0.270
 glad-0.268
aming-0.267
Token upon
Token ID2402
Feature activation-8.32e+00

Pos logit contributions
Ready+0.050
 stairs+0.046
+0.046
 hunt+0.046
Circ+0.045
Neg logit contributions
lee-0.070
 mad-0.069
 years-0.069
 glad-0.067
heart-0.067
Token a
Token ID257
Feature activation-3.07e+00

Pos logit contributions
Ready+0.178
 hunt+0.163
Circ+0.160
 stairs+0.158
+0.158
Neg logit contributions
lee-0.230
 mad-0.217
heart-0.215
 glad-0.213
 years-0.213
Token time
Token ID640
Feature activation-6.85e+00

Pos logit contributions
Ready+0.051
 hunt+0.049
+0.047
 ride+0.046
Circ+0.045
Neg logit contributions
lee-0.098
years-0.094
heart-0.091
 mad-0.091
ious-0.088
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.208
 hunt+0.195
Circ+0.193
 stairs+0.192
Find+0.188
Neg logit contributions
lee-0.122
 mad-0.120
 glad-0.117
 years-0.117
heart-0.112
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
Ready+0.384
 hunt+0.350
Circ+0.346
 stairs+0.341
sc+0.334
Neg logit contributions
lee-0.419
 mad-0.418
 glad-0.411
 years-0.409
heart-0.397
Token a
Token ID257
Feature activation-3.09e+00

Pos logit contributions
lee+0.069
heart+0.067
ushing+0.064
ious+0.063
ivering+0.063
Neg logit contributions
-0.024
Ready-0.024
Circ-0.021
 Up-0.020
 ride-0.020
Token friendly
Token ID8030
Feature activation+5.34e+00

Pos logit contributions
Ready+0.075
 ride+0.069
 hunt+0.069
 hill+0.068
 stairs+0.067
Neg logit contributions
lee-0.076
heart-0.073
years-0.073
 years-0.071
aming-0.069
Token bear
Token ID6842
Feature activation-3.95e+00

Pos logit contributions
lee+0.148
 mad+0.147
 glad+0.145
 years+0.144
heart+0.140
Neg logit contributions
Ready-0.109
 hunt-0.099
Circ-0.097
 stairs-0.096
Find-0.093
Token named
Token ID3706
Feature activation+1.81e-01

Pos logit contributions
Ready+0.124
Circ+0.114
 hunt+0.114
 stairs+0.114
Find+0.111
Neg logit contributions
lee-0.068
 mad-0.067
 years-0.067
 glad-0.066
heart-0.062
TokenOnce
Token ID7454
Feature activation-2.56e+00

Pos logit contributions
 hunt+0.221
tr+0.209
 shortcut+0.206
Find+0.201
Ready+0.194
Neg logit contributions
 years-0.300
illy-0.284
aming-0.275
lee-0.274
 glad-0.271
Token upon
Token ID2402
Feature activation-8.39e+00

Pos logit contributions
Ready+0.057
 hunt+0.052
Circ+0.051
 stairs+0.051
Find+0.050
Neg logit contributions
lee-0.067
 mad-0.066
 glad-0.065
 years-0.064
heart-0.063
Token a
Token ID257
Feature activation-3.03e+00

Pos logit contributions
Ready+0.194
 hunt+0.177
Circ+0.175
 stairs+0.173
+0.170
Neg logit contributions
lee-0.214
 mad-0.207
 glad-0.204
 years-0.203
heart-0.200
Token time
Token ID640
Feature activation-6.90e+00

Pos logit contributions
Ready+0.052
+0.047
 hunt+0.047
 ride+0.046
sc+0.045
Neg logit contributions
lee-0.096
years-0.093
heart-0.089
 mad-0.089
ious-0.086
Token,
Token ID11
Feature activation+2.59e+00

Pos logit contributions
Ready+0.215
 hunt+0.201
Circ+0.199
 stairs+0.198
Find+0.194
Neg logit contributions
lee-0.118
 mad-0.117
 glad-0.115
 years-0.113
heart-0.108
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
lee+0.074
 years+0.071
 mad+0.071
heart+0.069
 glad+0.069
Neg logit contributions
Ready-0.052
-0.048
 stairs-0.046
 hunt-0.046
Circ-0.045
Token was
Token ID373
Feature activation+1.86e+00

Pos logit contributions
Ready+0.401
 hunt+0.365
Circ+0.361
 stairs+0.355
sc+0.351
Neg logit contributions
 mad-0.404
lee-0.401
 glad-0.398
 years-0.396
heart-0.381
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
lee+0.068
heart+0.066
ushing+0.063
ious+0.062
ivering+0.062
Neg logit contributions
Ready-0.025
-0.025
Circ-0.022
 stairs-0.022
Under-0.020
Token ch
Token ID442
Feature activation+3.25e+00

Pos logit contributions
Ready+0.068
 ride+0.063
 hill+0.063
 hunt+0.063
 stairs+0.061
Neg logit contributions
lee-0.076
heart-0.073
years-0.072
asured-0.070
 years-0.070
Tokenubby
Token ID38393
Feature activation+6.45e+00

Pos logit contributions
 glad+0.030
lee+0.030
 mad+0.030
 years+0.028
 ok+0.028
Neg logit contributions
Ready-0.120
 stairs-0.115
Circ-0.115
 hunt-0.115
ro-0.115
Token little
Token ID1310
Feature activation+2.48e+00

Pos logit contributions
lee+0.172
 mad+0.172
years+0.169
 years+0.168
 glad+0.167
Neg logit contributions
Ready-0.132
 hunt-0.126
 stairs-0.123
Circ-0.121
 ride-0.119
Token<|endoftext|>
Token ID50256
Feature activation-1.11e+01

Pos logit contributions
 hunt+0.069
Ready+0.069
 stairs+0.066
 shortcut+0.064
Circ+0.063
Neg logit contributions
 glad-0.103
lee-0.102
 years-0.097
 mad-0.095
illy-0.095
TokenOnce
Token ID7454
Feature activation-2.62e+00

Pos logit contributions
 hunt+0.221
 shortcut+0.207
Find+0.202
tr+0.198
Ready+0.197
Neg logit contributions
 years-0.286
lee-0.278
illy-0.276
aming-0.276
 glad-0.268
Token upon
Token ID2402
Feature activation-8.23e+00

Pos logit contributions
Ready+0.054
 hunt+0.050
 stairs+0.049
Circ+0.049
+0.048
Neg logit contributions
lee-0.070
 mad-0.070
 years-0.069
 glad-0.068
heart-0.067
Token a
Token ID257
Feature activation-3.03e+00

Pos logit contributions
Ready+0.182
 hunt+0.167
Circ+0.163
 stairs+0.161
+0.160
Neg logit contributions
lee-0.222
 mad-0.210
heart-0.206
 glad-0.206
 years-0.206
Token time
Token ID640
Feature activation-6.82e+00

Pos logit contributions
Ready+0.050
 hunt+0.047
+0.045
 ride+0.045
sc+0.044
Neg logit contributions
lee-0.099
years-0.094
heart-0.091
 mad-0.090
ious-0.088
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.211
 hunt+0.197
Circ+0.195
 stairs+0.194
Find+0.190
Neg logit contributions
lee-0.119
 mad-0.117
 glad-0.115
 years-0.114
heart-0.109
Token was
Token ID373
Feature activation+1.98e+00

Pos logit contributions
Ready+0.391
 hunt+0.356
Circ+0.352
 stairs+0.346
sc+0.341
Neg logit contributions
lee-0.412
 mad-0.412
 glad-0.404
 years-0.403
heart-0.391
Token a
Token ID257
Feature activation-3.06e+00

Pos logit contributions
lee+0.070
heart+0.068
ushing+0.066
 mad+0.065
ivering+0.064
Neg logit contributions
Ready-0.028
-0.028
Circ-0.026
 ride-0.024
 stairs-0.023
Token young
Token ID1862
Feature activation+4.34e+00

Pos logit contributions
Ready+0.074
 hunt+0.069
 ride+0.068
 hill+0.067
 stairs+0.067
Neg logit contributions
lee-0.076
heart-0.072
years-0.072
 years-0.070
aming-0.069
Token man
Token ID582
Feature activation-2.97e+00

Pos logit contributions
lee+0.123
years+0.120
 mad+0.119
 years+0.118
 glad+0.117
Neg logit contributions
Ready-0.083
 stairs-0.079
 hunt-0.078
 hill-0.076
Circ-0.076
Token,
Token ID11
Feature activation+2.69e+00

Pos logit contributions
Ready+0.082
 hunt+0.079
Circ+0.076
 stairs+0.076
+0.075
Neg logit contributions
lee-0.060
 mad-0.059
 glad-0.057
 years-0.056
heart-0.056
Token<|endoftext|>
Token ID50256
Feature activation-1.15e+01

Pos logit contributions
 glad+0.234
lee+0.233
 years+0.229
 mad+0.223
 pleased+0.216
Neg logit contributions
Ready-0.169
Circ-0.156
 hunt-0.155
 stairs-0.152
sc-0.151
TokenOnce
Token ID7454
Feature activation-2.76e+00

Pos logit contributions
 hunt+0.202
tr+0.198
 shortcut+0.187
Find+0.186
Circ+0.178
Neg logit contributions
 years-0.331
illy-0.324
lee-0.321
aming-0.308
 brothers-0.300
Token upon
Token ID2402
Feature activation-8.45e+00

Pos logit contributions
Ready+0.061
 hunt+0.055
Circ+0.054
 stairs+0.054
Find+0.052
Neg logit contributions
lee-0.072
 mad-0.072
 glad-0.071
 years-0.070
heart-0.068
Token a
Token ID257
Feature activation-3.05e+00

Pos logit contributions
Ready+0.193
 hunt+0.176
Circ+0.174
 stairs+0.172
+0.170
Neg logit contributions
lee-0.218
 mad-0.211
 glad-0.207
 years-0.206
heart-0.203
Token time
Token ID640
Feature activation-6.97e+00

Pos logit contributions
Ready+0.051
+0.047
 hunt+0.046
 ride+0.045
sc+0.044
Neg logit contributions
lee-0.098
years-0.095
 mad-0.090
heart-0.090
ious-0.088
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.213
 hunt+0.199
Circ+0.197
 stairs+0.196
Find+0.192
Neg logit contributions
lee-0.124
 mad-0.122
 glad-0.119
 years-0.118
heart-0.113
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
Ready+0.389
 hunt+0.354
Circ+0.351
 stairs+0.346
sc+0.339
Neg logit contributions
lee-0.413
 mad-0.413
 glad-0.405
 years-0.405
heart-0.392
Token a
Token ID257
Feature activation-3.00e+00

Pos logit contributions
lee+0.068
heart+0.065
ushing+0.063
ious+0.062
ivering+0.062
Neg logit contributions
-0.026
Ready-0.026
Circ-0.023
 Up-0.022
 stairs-0.021
Token knight
Token ID22062
Feature activation-4.18e+00

Pos logit contributions
Ready+0.073
 hunt+0.067
 ride+0.067
 hill+0.067
 stairs+0.065
Neg logit contributions
lee-0.073
years-0.070
heart-0.070
 years-0.068
aming-0.067
Token.
Token ID13
Feature activation-4.66e+00

Pos logit contributions
Ready+0.129
Circ+0.118
 stairs+0.117
Find+0.115
 hunt+0.114
Neg logit contributions
 years-0.077
lee-0.075
 mad-0.075
 glad-0.073
heart-0.070
Token He
Token ID679
Feature activation+1.89e+00

Pos logit contributions
sc+0.127
Ready+0.125
 shortcut+0.119
 hunt+0.118
ground+0.116
Neg logit contributions
 years-0.088
lee-0.087
 glad-0.087
 enthusiastic-0.085
 mad-0.084
TokenOnce
Token ID7454
Feature activation-2.59e+00

Pos logit contributions
 hunt+0.226
Find+0.212
Ready+0.211
 shortcut+0.210
tr+0.206
Neg logit contributions
 years-0.282
lee-0.270
illy-0.270
 glad-0.269
aming-0.262
Token upon
Token ID2402
Feature activation-8.22e+00

Pos logit contributions
Ready+0.054
 hunt+0.049
 stairs+0.049
Circ+0.048
+0.048
Neg logit contributions
lee-0.069
 mad-0.069
 years-0.068
 glad-0.067
heart-0.066
Token a
Token ID257
Feature activation-2.87e+00

Pos logit contributions
Ready+0.176
 hunt+0.162
Circ+0.159
+0.157
 stairs+0.156
Neg logit contributions
lee-0.227
 mad-0.215
heart-0.211
 glad-0.210
 years-0.209
Token time
Token ID640
Feature activation-6.75e+00

Pos logit contributions
Ready+0.049
 hunt+0.046
+0.044
sc+0.043
 ride+0.043
Neg logit contributions
lee-0.092
years-0.087
 mad-0.085
heart-0.084
 glad-0.082
Token,
Token ID11
Feature activation+2.68e+00

Pos logit contributions
Ready+0.204
 hunt+0.191
Circ+0.188
 stairs+0.188
+0.184
Neg logit contributions
lee-0.122
 mad-0.119
 glad-0.116
 years-0.116
heart-0.111
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
lee+0.075
 mad+0.072
 years+0.072
heart+0.071
 glad+0.070
Neg logit contributions
Ready-0.055
-0.050
 stairs-0.049
 hunt-0.049
Circ-0.049
Token was
Token ID373
Feature activation+2.14e+00

Pos logit contributions
Ready+0.397
 hunt+0.360
Circ+0.356
 stairs+0.350
sc+0.346
Neg logit contributions
lee-0.404
 mad-0.403
 glad-0.398
 years-0.397
heart-0.383
Token a
Token ID257
Feature activation-2.80e+00

Pos logit contributions
lee+0.076
heart+0.074
ushing+0.072
ivering+0.072
ious+0.070
Neg logit contributions
Ready-0.031
-0.030
Circ-0.028
 stairs-0.026
 Up-0.025
Token bright
Token ID6016
Feature activation+3.80e+00

Pos logit contributions
Ready+0.064
 ride+0.060
 hunt+0.059
 hill+0.059
 stairs+0.058
Neg logit contributions
lee-0.072
heart-0.070
years-0.069
asured-0.067
aming-0.067
Token blue
Token ID4171
Feature activation+5.39e-01

Pos logit contributions
years+0.115
lee+0.106
 ok+0.105
 mad+0.104
aming+0.103
Neg logit contributions
 hill-0.071
 ride-0.071
 stairs-0.069
 road-0.065
 aisle-0.065
Tokenberry
Token ID8396
Feature activation-4.57e+00

Pos logit contributions
lee+0.015
 mad+0.015
 years+0.015
 glad+0.014
heart+0.014
Neg logit contributions
Ready-0.011
 hunt-0.010
Circ-0.010
 stairs-0.010
Find-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.11e+01

Pos logit contributions
lee+0.064
 mad+0.063
 years+0.062
 glad+0.061
years+0.060
Neg logit contributions
Ready-0.084
 hunt-0.077
 stairs-0.076
Circ-0.076
-0.074
TokenOnce
Token ID7454
Feature activation-2.56e+00

Pos logit contributions
 hunt+0.195
tr+0.186
 shortcut+0.184
Circ+0.172
Find+0.168
Neg logit contributions
 years-0.330
illy-0.319
lee-0.315
aming-0.313
 glad-0.295
Token upon
Token ID2402
Feature activation-8.22e+00

Pos logit contributions
Ready+0.056
 hunt+0.051
Circ+0.050
 stairs+0.050
Find+0.049
Neg logit contributions
lee-0.067
 mad-0.066
 glad-0.065
 years-0.065
heart-0.063
Token a
Token ID257
Feature activation-3.09e+00

Pos logit contributions
Ready+0.185
 hunt+0.168
Circ+0.166
 stairs+0.164
+0.163
Neg logit contributions
lee-0.216
 mad-0.207
 glad-0.203
 years-0.203
heart-0.202
Token time
Token ID640
Feature activation-6.86e+00

Pos logit contributions
Ready+0.054
+0.048
 hunt+0.046
sc+0.046
 ride+0.046
Neg logit contributions
lee-0.098
years-0.094
 mad-0.091
heart-0.091
 glad-0.088
Token there
Token ID612
Feature activation-1.66e+01

Pos logit contributions
Ready+0.204
 hunt+0.191
 stairs+0.189
Circ+0.188
+0.186
Neg logit contributions
lee-0.126
 mad-0.124
 years-0.121
 glad-0.120
heart-0.116
Token was
Token ID373
Feature activation+1.89e+00

Pos logit contributions
Ready+0.406
 hunt+0.372
Circ+0.369
 stairs+0.361
sc+0.359
Neg logit contributions
 mad-0.397
lee-0.394
 years-0.389
 glad-0.389
heart-0.375
Token a
Token ID257
Feature activation-3.10e+00

Pos logit contributions
lee+0.066
heart+0.064
ushing+0.062
ious+0.062
ivering+0.061
Neg logit contributions
-0.027
Ready-0.027
Circ-0.024
 stairs-0.024
 ride-0.023
Token girl
Token ID2576
Feature activation+1.58e+00

Pos logit contributions
Ready+0.074
 ride+0.069
 hill+0.068
 hunt+0.068
 stairs+0.066
Neg logit contributions
lee-0.077
heart-0.073
years-0.073
asured-0.071
 years-0.071
Token named
Token ID3706
Feature activation+3.50e-01

Pos logit contributions
lee+0.036
 mad+0.036
 pleased+0.035
heart+0.034
 ok+0.034
Neg logit contributions
 hunt-0.037
Ready-0.036
-0.035
 ride-0.035
 stairs-0.035
Token Maria
Token ID14200
Feature activation+3.79e+00

Pos logit contributions
lee+0.007
 mad+0.007
 glad+0.007
 years+0.007
illy+0.007
Neg logit contributions
Ready-0.010
 hunt-0.009
Circ-0.009
 stairs-0.009
Find-0.009
Token<|endoftext|>
Token ID50256
Feature activation-1.08e+01

Pos logit contributions
 hunt+0.059
Ready+0.059
 shortcut+0.055
 stairs+0.053
Circ+0.053
Neg logit contributions
lee-0.089
 glad-0.089
 years-0.085
illy-0.082
heart-0.082
TokenOnce
Token ID7454
Feature activation-2.30e+00

Pos logit contributions
 hunt+0.244
Find+0.235
Ready+0.233
 shortcut+0.232
Circ+0.217
Neg logit contributions
 years-0.256
 glad-0.240
aming-0.230
illy-0.229
lee-0.229
Token upon
Token ID2402
Feature activation-8.09e+00

Pos logit contributions
Ready+0.048
 hunt+0.043
 stairs+0.043
Circ+0.043
+0.043
Neg logit contributions
lee-0.062
 mad-0.061
 years-0.061
 glad-0.060
heart-0.059
Token a
Token ID257
Feature activation-2.83e+00

Pos logit contributions
Ready+0.178
 hunt+0.164
Circ+0.160
 stairs+0.158
+0.157
Neg logit contributions
lee-0.219
 mad-0.207
heart-0.203
 glad-0.203
 years-0.202
Token time
Token ID640
Feature activation-6.60e+00

Pos logit contributions
Ready+0.046
 hunt+0.044
+0.042
 ride+0.042
Circ+0.041
Neg logit contributions
lee-0.093
years-0.088
heart-0.085
 mad-0.084
 glad-0.082
Token there
Token ID612
Feature activation-1.65e+01

Pos logit contributions
Ready+0.193
 hunt+0.182
 stairs+0.179
Circ+0.179
+0.177
Neg logit contributions
lee-0.124
 mad-0.121
 years-0.118
 glad-0.117
heart-0.114
Token was
Token ID373
Feature activation+2.15e+00

Pos logit contributions
Ready+0.398
 hunt+0.362
Circ+0.359
 stairs+0.352
sc+0.348
Neg logit contributions
 mad-0.402
lee-0.401
 glad-0.395
 years-0.394
heart-0.383
Token a
Token ID257
Feature activation-2.89e+00

Pos logit contributions
lee+0.077
heart+0.075
ushing+0.073
 mad+0.071
ivering+0.071
Neg logit contributions
Ready-0.030
-0.030
Circ-0.028
 stairs-0.025
 ride-0.025
Token plane
Token ID6614
Feature activation-9.25e+00

Pos logit contributions
Ready+0.068
 hunt+0.063
 ride+0.063
 hill+0.063
 stairs+0.061
Neg logit contributions
lee-0.074
heart-0.071
years-0.070
 years-0.068
asured-0.067
Token in
Token ID287
Feature activation-1.18e+01

Pos logit contributions
Ready+0.310
Circ+0.282
Find+0.271
 shortcut+0.263
 stairs+0.262
Neg logit contributions
 years-0.173
 glad-0.156
 mad-0.154
ivering-0.146
lee-0.146
Token the
Token ID262
Feature activation-7.13e+00

Pos logit contributions
Ready+0.323
sc+0.306
 hunt+0.293
Circ+0.288
 shortcut+0.287
Neg logit contributions
 years-0.253
 mad-0.250
 glad-0.248
lee-0.235
 careful-0.229
Token friends
Token ID2460
Feature activation-1.01e+00

Pos logit contributions
lee+0.110
 years+0.103
 mad+0.102
 glad+0.102
heart+0.102
Neg logit contributions
Ready-0.188
 hunt-0.177
 stairs-0.174
Circ-0.174
Find-0.170
Token.
Token ID13
Feature activation+6.86e-02

Pos logit contributions
Ready+0.026
 hunt+0.024
Circ+0.024
 stairs+0.024
Find+0.023
Neg logit contributions
 mad-0.022
lee-0.022
 years-0.022
 glad-0.022
heart-0.021
Token They
Token ID1119
Feature activation+4.20e+00

Pos logit contributions
 glad+0.002
 mad+0.002
 years+0.002
lee+0.002
 okay+0.002
Neg logit contributions
 hunt-0.002
Ready-0.001
 gra-0.001
 stairs-0.001
 shortcut-0.001
Token say
Token ID910
Feature activation+8.67e+00

Pos logit contributions
lee+0.102
 mad+0.099
 glad+0.098
 years+0.098
heart+0.096
Neg logit contributions
Ready-0.102
 hunt-0.094
 stairs-0.091
Circ-0.090
Find-0.089
Token "
Token ID366
Feature activation-2.63e+00

Pos logit contributions
 glad+0.217
 mad+0.215
lee+0.212
 okay+0.211
 sorry+0.208
Neg logit contributions
Ready-0.210
 hunt-0.197
 stairs-0.189
Circ-0.189
sc-0.188
TokenGood
Token ID10248
Feature activation+1.34e+01

Pos logit contributions
 hunt+0.059
 stairs+0.058
 gra+0.056
Ready+0.055
 shortcut+0.055
Neg logit contributions
 mad-0.067
 years-0.062
heart-0.061
 ok-0.060
aming-0.060
Tokenbye
Token ID16390
Feature activation+8.60e+00

Pos logit contributions
lee+0.352
years+0.343
 glad+0.328
 years+0.324
illy+0.320
Neg logit contributions
Ready-0.318
 hunt-0.297
 ride-0.285
-0.284
Circ-0.283
Token!"
Token ID2474
Feature activation-4.20e-02

Pos logit contributions
lee+0.144
 years+0.143
 mad+0.138
heart+0.137
 glad+0.133
Neg logit contributions
Ready-0.268
 stairs-0.252
 gra-0.252
 hunt-0.249
Find-0.248
Token They
Token ID1119
Feature activation+4.54e+00

Pos logit contributions
Ready+0.001
 hunt+0.001
Circ+0.001
Find+0.001
 gra+0.001
Neg logit contributions
lee-0.001
 mad-0.001
 glad-0.001
 years-0.001
 pleased-0.001
Token go
Token ID467
Feature activation+6.41e-01

Pos logit contributions
 mad+0.128
lee+0.125
 glad+0.124
 years+0.122
 ok+0.118
Neg logit contributions
Ready-0.091
Circ-0.085
 stairs-0.083
 hunt-0.080
Find-0.080
Token to
Token ID284
Feature activation-6.76e+00

Pos logit contributions
 mad+0.013
 glad+0.013
 careful+0.012
 years+0.012
lee+0.012
Neg logit contributions
Ready-0.018
Circ-0.017
 gra-0.017
Find-0.017
br-0.016
Token
Token ID6184
Feature activation+9.61e+00

Pos logit contributions
lee+0.182
 mad+0.180
 years+0.177
 glad+0.176
heart+0.172
Neg logit contributions
Ready-0.122
 hunt-0.108
Circ-0.106
 stairs-0.105
-0.103
Token
Token ID95
Feature activation+8.60e+00

Pos logit contributions
lee+0.263
 mad+0.258
 glad+0.257
 years+0.256
illy+0.252
Neg logit contributions
Ready-0.199
 stairs-0.186
 hunt-0.186
Circ-0.185
sc-0.177
Token
Token ID26391
Feature activation+1.25e+01

Pos logit contributions
lee+0.203
 years+0.201
illy+0.199
 glad+0.196
heart+0.195
Neg logit contributions
 stairs-0.200
Ready-0.199
 hunt-0.196
Circ-0.195
 shortcut-0.189
Token
Token ID129
Feature activation+9.60e+00

Pos logit contributions
lee+0.383
 glad+0.377
 years+0.366
 mad+0.356
 pleased+0.350
Neg logit contributions
Ready-0.234
 stairs-0.210
Circ-0.209
 hunt-0.207
sc-0.205
Token
Token ID241
Feature activation+1.08e+00

Pos logit contributions
lee+0.230
 mad+0.225
 glad+0.222
 years+0.222
heart+0.212
Neg logit contributions
Ready-0.236
 hunt-0.218
Circ-0.216
 stairs-0.215
Find-0.209
TokenOkay
Token ID16454
Feature activation+1.36e+01

Pos logit contributions
 glad+0.031
 mad+0.031
 years+0.030
lee+0.030
heart+0.029
Neg logit contributions
 hunt-0.020
Ready-0.020
Circ-0.018
 stairs-0.018
 shortcut-0.018
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 mad+0.227
 glad+0.226
 years+0.222
lee+0.221
heart+0.219
Neg logit contributions
Ready-0.426
Circ-0.411
 hunt-0.401
Find-0.397
 stairs-0.393
Tokenâ
Token ID22940
Feature activation+1.29e+01

Pos logit contributions
 hunt+0.029
Ready+0.028
 stairs+0.028
Circ+0.027
 shortcut+0.026
Neg logit contributions
 glad-0.030
lee-0.030
 ok-0.030
heart-0.029
 okay-0.029
Token
Token ID26391
Feature activation+1.07e+01

Pos logit contributions
 years+0.292
 mad+0.291
lee+0.290
 glad+0.287
heart+0.287
Neg logit contributions
Ready-0.316
 stairs-0.311
Circ-0.308
 hunt-0.305
Find-0.296
Token
Token ID198
Feature activation-6.81e+00

Pos logit contributions
lee+0.293
 glad+0.292
 years+0.285
 mad+0.269
 careful+0.267
Neg logit contributions
Ready-0.225
Circ-0.209
 stairs-0.209
 hunt-0.208
sc-0.203
Token
Token ID198
Feature activation-6.97e+00

Pos logit contributions
 hunt+0.122
Ready+0.117
 stairs+0.108
Circ+0.108
 shortcut+0.104
Neg logit contributions
 glad-0.207
 years-0.203
lee-0.203
illy-0.198
 mad-0.197
Token is
Token ID318
Feature activation+5.83e+00

Pos logit contributions
Ready+0.053
Circ+0.048
 hunt+0.048
 stairs+0.047
Find+0.046
Neg logit contributions
 mad-0.069
lee-0.068
 glad-0.067
 years-0.067
heart-0.066
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
lee+0.138
years+0.135
 years+0.130
illy+0.129
 glad+0.129
Neg logit contributions
Ready-0.144
 hunt-0.140
 stairs-0.132
-0.130
sc-0.129
Token He
Token ID679
Feature activation+5.63e+00

Pos logit contributions
 glad+0.020
lee+0.019
 okay+0.019
 ok+0.019
 years+0.018
Neg logit contributions
 hunt-0.022
 stairs-0.021
Ready-0.020
 shortcut-0.020
 aisle-0.020
Token is
Token ID318
Feature activation+5.13e+00

Pos logit contributions
lee+0.151
 mad+0.151
 glad+0.149
 years+0.149
heart+0.143
Neg logit contributions
Ready-0.119
Circ-0.107
 hunt-0.107
 stairs-0.106
Find-0.103
Token a
Token ID257
Feature activation+6.79e-01

Pos logit contributions
lee+0.110
heart+0.107
illy+0.102
 years+0.102
 mad+0.101
Neg logit contributions
Ready-0.140
 hunt-0.132
 stairs-0.127
 gra-0.124
Circ-0.124
Token good
Token ID922
Feature activation+1.33e+01

Pos logit contributions
lee+0.015
 mad+0.014
 years+0.014
 glad+0.014
heart+0.014
Neg logit contributions
Ready-0.018
 hunt-0.017
Circ-0.017
 stairs-0.017
Find-0.016
Token dog
Token ID3290
Feature activation-1.93e+00

Pos logit contributions
lee+0.349
 mad+0.342
 glad+0.338
 years+0.338
heart+0.326
Neg logit contributions
Ready-0.299
 hunt-0.273
Circ-0.267
 stairs-0.266
Find-0.257
Token.
Token ID13
Feature activation-9.59e-02

Pos logit contributions
Ready+0.055
Circ+0.050
 stairs+0.049
Find+0.049
 hunt+0.049
Neg logit contributions
 mad-0.039
 years-0.039
 glad-0.038
lee-0.038
heart-0.036
Token He
Token ID679
Feature activation+4.77e+00

Pos logit contributions
Ready+0.002
 hunt+0.002
 stairs+0.002
Circ+0.002
 aisle+0.002
Neg logit contributions
 glad-0.002
lee-0.002
 years-0.002
 mad-0.002
 ok-0.002
Token will
Token ID481
Feature activation+2.30e+00

Pos logit contributions
 mad+0.117
 years+0.115
 glad+0.115
lee+0.113
 careful+0.109
Neg logit contributions
Ready-0.113
Circ-0.105
 stairs-0.104
Find-0.099
ground-0.098
Token not
Token ID407
Feature activation+7.16e+00

Pos logit contributions
 mad+0.060
 glad+0.058
 years+0.057
 careful+0.056
 okay+0.055
Neg logit contributions
Ready-0.053
Circ-0.050
 stairs-0.049
-0.048
sc-0.047
Token happy
Token ID3772
Feature activation+1.37e+01

Pos logit contributions
heart+0.083
lee+0.081
illy+0.078
 years+0.072
years+0.072
Neg logit contributions
Ready-0.159
 hunt-0.151
 stairs-0.148
-0.147
 gra-0.147
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
lee+0.309
 mad+0.303
 glad+0.296
 years+0.292
heart+0.291
Neg logit contributions
Ready-0.349
 hunt-0.328
 stairs-0.322
Circ-0.320
-0.315
Token The
Token ID383
Feature activation+5.49e-01

Pos logit contributions
 glad+0.007
 years+0.007
lee+0.007
 mad+0.007
 okay+0.007
Neg logit contributions
 hunt-0.005
Ready-0.005
 shortcut-0.005
 stairs-0.005
sc-0.005
Token birds
Token ID10087
Feature activation-4.33e+00

Pos logit contributions
 mad+0.013
 years+0.012
 glad+0.012
lee+0.012
 careful+0.011
Neg logit contributions
Ready-0.014
Circ-0.013
sc-0.013
 hunt-0.013
ground-0.013
Token are
Token ID389
Feature activation+5.25e+00

Pos logit contributions
Ready+0.098
Circ+0.090
 stairs+0.087
 hunt+0.084
sc+0.084
Neg logit contributions
 mad-0.116
 glad-0.115
 years-0.112
lee-0.111
 pleased-0.106
Token happy
Token ID3772
Feature activation+1.38e+01

Pos logit contributions
heart+0.075
lee+0.073
illy+0.072
years+0.070
 years+0.068
Neg logit contributions
Ready-0.168
 hunt-0.166
 stairs-0.161
 gra-0.160
 ride-0.160
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
lee+0.296
 mad+0.285
heart+0.279
years+0.278
illy+0.278
Neg logit contributions
Ready-0.365
 hunt-0.351
 stairs-0.345
-0.342
Circ-0.339
Token The
Token ID383
Feature activation+6.99e-01

Pos logit contributions
 glad+0.010
lee+0.010
 mad+0.010
 okay+0.009
 years+0.009
Neg logit contributions
 hunt-0.007
Ready-0.007
 shortcut-0.007
 stairs-0.007
sc-0.007
Token man
Token ID582
Feature activation+1.09e+00

Pos logit contributions
 mad+0.017
 years+0.017
 glad+0.016
lee+0.016
 careful+0.015
Neg logit contributions
Ready-0.017
Circ-0.016
sc-0.016
 hunt-0.016
ground-0.015
Token is
Token ID318
Feature activation+5.14e+00

Pos logit contributions
 mad+0.027
lee+0.026
 glad+0.026
 years+0.026
 pleased+0.025
Neg logit contributions
Ready-0.026
Circ-0.024
 stairs-0.024
 hunt-0.023
Find-0.023
Token happy
Token ID3772
Feature activation+1.41e+01

Pos logit contributions
lee+0.068
heart+0.067
illy+0.062
 years+0.061
years+0.060
Neg logit contributions
Ready-0.177
 hunt-0.170
-0.167
 gra-0.165
 ride-0.165
Token say
Token ID910
Feature activation+9.88e+00

Pos logit contributions
 mad+0.197
lee+0.196
 glad+0.193
 years+0.189
heart+0.182
Neg logit contributions
Ready-0.242
Circ-0.223
 hunt-0.220
 stairs-0.220
Find-0.216
Token yes
Token ID3763
Feature activation+1.42e+01

Pos logit contributions
lee+0.219
 mad+0.214
 years+0.212
 glad+0.209
heart+0.205
Neg logit contributions
Ready-0.257
 hunt-0.236
Circ-0.233
 stairs-0.232
-0.228
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
lee+0.392
years+0.387
 glad+0.375
 mad+0.371
aming+0.361
Neg logit contributions
Ready-0.285
 hunt-0.272
 ride-0.261
 stairs-0.256
sc-0.252
Token They
Token ID1119
Feature activation+4.69e+00

Pos logit contributions
 glad+0.037
lee+0.037
 mad+0.035
 years+0.034
 pleased+0.034
Neg logit contributions
Ready-0.029
 hunt-0.028
 shortcut-0.027
Circ-0.027
 stairs-0.026
Token are
Token ID389
Feature activation+5.80e+00

Pos logit contributions
 mad+0.112
lee+0.111
 glad+0.110
 years+0.107
 careful+0.102
Neg logit contributions
Ready-0.114
Circ-0.106
 stairs-0.105
 hunt-0.102
ground-0.101
Token happy
Token ID3772
Feature activation+1.35e+01

Pos logit contributions
heart+0.100
illy+0.093
lee+0.090
ious+0.086
ivering+0.083
Neg logit contributions
Ready-0.174
 ride-0.170
 gra-0.170
 stairs-0.167
-0.164
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
lee+0.358
years+0.349
 mad+0.346
aming+0.341
illy+0.339
Neg logit contributions
Ready-0.267
-0.263
 hunt-0.259
 stairs-0.258
Circ-0.250
Token They
Token ID1119
Feature activation+4.04e+00

Pos logit contributions
 glad+0.017
lee+0.017
 years+0.016
 mad+0.016
 pleased+0.015
Neg logit contributions
Ready-0.011
 hunt-0.011
 shortcut-0.010
 stairs-0.010
Circ-0.010
Token like
Token ID588
Feature activation+5.46e+00

Pos logit contributions
 mad+0.094
lee+0.094
 glad+0.093
 years+0.091
 pleased+0.086
Neg logit contributions
Ready-0.100
Circ-0.093
 stairs-0.092
 hunt-0.091
sc-0.089
Token to
Token ID284
Feature activation-8.07e+00

Pos logit contributions
 mad+0.150
 glad+0.149
lee+0.142
 years+0.142
 careful+0.142
Neg logit contributions
Ready-0.123
sc-0.110
ground-0.107
 hunt-0.106
Find-0.104
Token play
Token ID711
Feature activation-3.98e+00

Pos logit contributions
Ready+0.160
Circ+0.144
sc+0.141
 stairs+0.140
+0.135
Neg logit contributions
 mad-0.239
 years-0.232
lee-0.231
 glad-0.231
 careful-0.220
Token The
Token ID383
Feature activation-2.39e+00

Pos logit contributions
Ready+0.068
 hunt+0.066
 shortcut+0.065
rob+0.060
sc+0.060
Neg logit contributions
lee-0.069
 glad-0.064
 years-0.063
 mad-0.062
 enthusiastic-0.062
Token rain
Token ID6290
Feature activation-7.94e-01

Pos logit contributions
Ready+0.055
 stairs+0.051
 hunt+0.051
+0.049
Circ+0.049
Neg logit contributions
lee-0.061
 mad-0.059
years-0.058
 years-0.057
 glad-0.057
Token was
Token ID373
Feature activation+3.23e+00

Pos logit contributions
Ready+0.020
 hunt+0.018
Circ+0.018
Find+0.017
 gra+0.017
Neg logit contributions
lee-0.019
 mad-0.019
 years-0.018
 glad-0.018
heart-0.017
Token so
Token ID523
Feature activation+6.56e+00

Pos logit contributions
lee+0.061
heart+0.058
illy+0.058
years+0.056
 years+0.056
Neg logit contributions
Ready-0.091
 stairs-0.091
 hunt-0.086
 shortcut-0.086
Find-0.085
Token heavy
Token ID4334
Feature activation+5.27e+00

Pos logit contributions
lee+0.136
illy+0.135
heart+0.134
years+0.131
 years+0.130
Neg logit contributions
 stairs-0.172
Ready-0.171
-0.161
 gra-0.157
 hills-0.156
Token,
Token ID11
Feature activation+4.15e+00

Pos logit contributions
lee+0.098
 mad+0.097
 years+0.095
 glad+0.095
heart+0.090
Neg logit contributions
Ready-0.157
Circ-0.145
 hunt-0.144
Find-0.142
 stairs-0.141
Token it
Token ID340
Feature activation-5.61e+00

Pos logit contributions
lee+0.103
years+0.098
 mad+0.097
 years+0.096
 glad+0.096
Neg logit contributions
Ready-0.097
 stairs-0.091
 hunt-0.090
Circ-0.086
sc-0.086
Token replaced
Token ID6928
Feature activation-1.84e-01

Pos logit contributions
Ready+0.143
 hunt+0.131
Circ+0.130
 stairs+0.127
Find+0.125
Neg logit contributions
lee-0.129
 mad-0.129
 years-0.125
 glad-0.125
heart-0.119
Token the
Token ID262
Feature activation-5.49e+00

Pos logit contributions
Ready+0.004
 hunt+0.004
 stairs+0.004
Circ+0.004
+0.004
Neg logit contributions
lee-0.005
 mad-0.004
 glad-0.004
 years-0.004
years-0.004
Token steam
Token ID13324
Feature activation+2.86e-01

Pos logit contributions
Ready+0.117
 hunt+0.107
 stairs+0.105
Circ+0.104
+0.101
Neg logit contributions
lee-0.150
 mad-0.146
 glad-0.144
 years-0.144
heart-0.141
Token.
Token ID13
Feature activation-2.73e+00

Pos logit contributions
lee+0.005
 years+0.005
 mad+0.005
 glad+0.005
heart+0.005
Neg logit contributions
Ready-0.009
Circ-0.008
Find-0.008
 hunt-0.008
East-0.008
Token laughed
Token ID13818
Feature activation+6.29e+00

Pos logit contributions
lee+0.020
 mad+0.019
 glad+0.019
 years+0.019
heart+0.018
Neg logit contributions
Ready-0.024
 hunt-0.022
Circ-0.022
 stairs-0.022
Find-0.021
Token with
Token ID351
Feature activation-3.57e+00

Pos logit contributions
lee+0.144
 mad+0.143
 years+0.142
years+0.139
 glad+0.138
Neg logit contributions
Ready-0.152
 hunt-0.140
 stairs-0.138
Circ-0.137
-0.136
Token delight
Token ID10974
Feature activation+6.57e+00

Pos logit contributions
Ready+0.114
 hunt+0.106
Circ+0.106
 stairs+0.105
 gra+0.103
Neg logit contributions
lee-0.057
 mad-0.055
heart-0.053
 years-0.052
illy-0.052
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
 mad+0.162
 glad+0.159
lee+0.159
 years+0.153
years+0.153
Neg logit contributions
Ready-0.143
 hunt-0.137
 stairs-0.137
ground-0.130
Circ-0.130
Token She
Token ID1375
Feature activation+3.76e+00

Pos logit contributions
 hunt+0.069
 shortcut+0.064
Ready+0.063
sc+0.062
Circ+0.060
Neg logit contributions
lee-0.055
 glad-0.053
years-0.047
illy-0.047
 pleased-0.046
Token eventually
Token ID4191
Feature activation+3.94e+00

Pos logit contributions
 mad+0.083
lee+0.083
 glad+0.081
 years+0.080
heart+0.079
Neg logit contributions
Ready-0.097
 hunt-0.090
Circ-0.087
 stairs-0.087
Find-0.086
Token succeeded
Token ID14131
Feature activation+1.08e+00

Pos logit contributions
lee+0.086
 mad+0.085
 glad+0.083
 years+0.082
 pleased+0.079
Neg logit contributions
Ready-0.106
Circ-0.098
 hunt-0.097
 stairs-0.096
 gra-0.094
Token in
Token ID287
Feature activation-1.06e+01

Pos logit contributions
lee+0.022
 mad+0.022
 glad+0.021
 years+0.021
heart+0.020
Neg logit contributions
Ready-0.030
 hunt-0.028
Circ-0.028
 stairs-0.028
Find-0.027
Token flying
Token ID7348
Feature activation-4.56e+00

Pos logit contributions
Ready+0.313
Circ+0.294
sc+0.287
 hunt+0.278
Find+0.271
Neg logit contributions
 mad-0.217
 years-0.216
 glad-0.216
lee-0.210
 pleased-0.196
Token it
Token ID340
Feature activation-5.62e+00

Pos logit contributions
Ready+0.136
Circ+0.129
 hunt+0.125
Find+0.125
 gra+0.121
Neg logit contributions
 mad-0.086
 years-0.085
lee-0.084
heart-0.082
 glad-0.082
Token around
Token ID1088
Feature activation-1.02e+01

Pos logit contributions
Ready+0.185
Circ+0.175
Find+0.169
 gra+0.167
 hunt+0.166
Neg logit contributions
 years-0.089
 mad-0.088
lee-0.088
 glad-0.084
heart-0.083
Token was
Token ID373
Feature activation+3.38e+00

Pos logit contributions
Ready+0.373
 hunt+0.337
Circ+0.335
 stairs+0.329
sc+0.327
Neg logit contributions
 mad-0.360
lee-0.359
 glad-0.356
 years-0.355
heart-0.340
Token a
Token ID257
Feature activation-1.49e+00

Pos logit contributions
lee+0.116
heart+0.113
ushing+0.108
ivering+0.107
 mad+0.107
Neg logit contributions
Ready-0.053
-0.051
Circ-0.049
 stairs-0.046
 ride-0.044
Token little
Token ID1310
Feature activation+4.00e+00

Pos logit contributions
Ready+0.036
 hunt+0.034
 ride+0.033
 stairs+0.033
 hill+0.033
Neg logit contributions
lee-0.037
heart-0.036
years-0.035
 years-0.034
illy-0.034
Token girl
Token ID2576
Feature activation+2.88e+00

Pos logit contributions
lee+0.123
years+0.116
 ok+0.116
 years+0.114
aming+0.113
Neg logit contributions
 hill-0.071
Ready-0.071
 stairs-0.070
Circ-0.067
 ride-0.067
Token named
Token ID3706
Feature activation+1.37e+00

Pos logit contributions
lee+0.069
 mad+0.068
 pleased+0.068
 ok+0.068
heart+0.067
Neg logit contributions
 hunt-0.065
 ride-0.062
-0.062
Ready-0.061
 stairs-0.060
Token Lily
Token ID20037
Feature activation+6.30e+00

Pos logit contributions
lee+0.030
 mad+0.030
 glad+0.030
 years+0.029
heart+0.028
Neg logit contributions
Ready-0.036
 hunt-0.033
Circ-0.033
 stairs-0.033
Find-0.032
Token.
Token ID13
Feature activation-3.08e+00

Pos logit contributions
lee+0.167
 mad+0.165
 glad+0.161
heart+0.159
 pleased+0.158
Neg logit contributions
Ready-0.132
 hunt-0.129
-0.122
Circ-0.122
 stairs-0.121
Token She
Token ID1375
Feature activation+3.73e+00

Pos logit contributions
Ready+0.068
 hunt+0.062
Circ+0.061
 stairs+0.061
Find+0.059
Neg logit contributions
lee-0.081
 mad-0.081
 glad-0.079
 years-0.079
heart-0.077
Token loved
Token ID6151
Feature activation+6.22e+00

Pos logit contributions
heart+0.086
lee+0.084
 mad+0.080
 ok+0.080
illy+0.078
Neg logit contributions
Ready-0.095
 hunt-0.092
-0.091
 ride-0.091
 stairs-0.088
Token to
Token ID284
Feature activation-9.28e+00

Pos logit contributions
lee+0.183
 mad+0.182
 glad+0.179
 years+0.178
years+0.171
Neg logit contributions
Ready-0.123
sc-0.108
ground-0.105
 hunt-0.104
 shortcut-0.102
Token play
Token ID711
Feature activation-5.07e+00

Pos logit contributions
Ready+0.180
sc+0.157
Circ+0.150
+0.148
 stairs+0.148
Neg logit contributions
lee-0.277
 mad-0.272
 glad-0.268
 years-0.266
illy-0.258
Token mom
Token ID1995
Feature activation+3.02e+00

Pos logit contributions
heart+0.057
lee+0.055
aming+0.054
ivering+0.054
illy+0.053
Neg logit contributions
 stairs-0.070
Ready-0.069
 hunt-0.066
Circ-0.065
 shortcut-0.065
Tokenmy
Token ID1820
Feature activation+5.40e+00

Pos logit contributions
lee+0.051
 mad+0.050
 years+0.048
 glad+0.047
aming+0.045
Neg logit contributions
Ready-0.093
 hunt-0.089
 stairs-0.087
-0.087
Circ-0.086
Token what
Token ID644
Feature activation-6.82e+00

Pos logit contributions
lee+0.106
 mad+0.105
 years+0.103
 glad+0.102
heart+0.098
Neg logit contributions
Ready-0.153
 hunt-0.144
 stairs-0.141
Circ-0.141
-0.138
Token it
Token ID340
Feature activation-5.96e+00

Pos logit contributions
Ready+0.179
 hunt+0.160
sc+0.159
Circ+0.159
 stairs+0.156
Neg logit contributions
 years-0.153
 mad-0.153
lee-0.151
 glad-0.151
illy-0.142
Token was
Token ID373
Feature activation+3.09e+00

Pos logit contributions
Ready+0.173
Circ+0.161
ground+0.154
Find+0.153
iffs+0.153
Neg logit contributions
 mad-0.119
lee-0.116
 years-0.113
 ok-0.113
 glad-0.112
Token and
Token ID290
Feature activation+4.40e+00

Pos logit contributions
lee+0.063
 mad+0.062
 glad+0.060
 years+0.060
years+0.059
Neg logit contributions
Ready-0.084
 hunt-0.082
 stairs-0.079
Circ-0.077
-0.077
Token she
Token ID673
Feature activation+8.69e-01

Pos logit contributions
lee+0.102
 years+0.102
 mad+0.101
 glad+0.099
aming+0.096
Neg logit contributions
Ready-0.107
 hunt-0.100
 stairs-0.098
Circ-0.097
-0.096
Token told
Token ID1297
Feature activation+3.80e+00

Pos logit contributions
 years+0.021
ivering+0.021
years+0.020
 glad+0.020
aming+0.020
Neg logit contributions
 hunt-0.018
 ride-0.017
Ready-0.017
-0.017
Find-0.016
Token him
Token ID683
Feature activation+1.63e+00

Pos logit contributions
 glad+0.086
lee+0.085
 years+0.084
 mad+0.084
years+0.082
Neg logit contributions
Ready-0.095
 hunt-0.090
 stairs-0.087
Circ-0.086
 gra-0.086
Token it
Token ID340
Feature activation-5.82e+00

Pos logit contributions
aming+0.046
ivering+0.046
 mad+0.045
years+0.045
 years+0.044
Neg logit contributions
 hunt-0.029
 stairs-0.029
Ready-0.028
 ride-0.027
 shortcut-0.027
Token was
Token ID373
Feature activation+3.21e+00

Pos logit contributions
Ready+0.123
 hunt+0.113
 stairs+0.110
Circ+0.109
sc+0.106
Neg logit contributions
lee-0.158
 mad-0.155
 glad-0.155
 years-0.153
heart-0.149
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
Ready+0.105
Circ+0.095
 hunt+0.093
 stairs+0.093
Find+0.092
Neg logit contributions
 mad-0.088
lee-0.087
 years-0.086
 glad-0.085
heart-0.082
Token The
Token ID383
Feature activation-2.25e+00

Pos logit contributions
Ready+0.064
 hunt+0.062
 shortcut+0.061
br+0.059
sc+0.058
Neg logit contributions
lee-0.066
 glad-0.061
 mad-0.056
 sorry-0.055
 pleased-0.055
Token train
Token ID4512
Feature activation-4.60e+00

Pos logit contributions
Ready+0.054
Circ+0.049
 hunt+0.049
 stairs+0.048
sc+0.047
Neg logit contributions
lee-0.055
 mad-0.055
 glad-0.055
 years-0.054
heart-0.051
Token was
Token ID373
Feature activation+3.13e+00

Pos logit contributions
Ready+0.120
Circ+0.111
 hunt+0.108
 stairs+0.106
 gra+0.106
Neg logit contributions
lee-0.105
 mad-0.104
 years-0.101
 glad-0.101
 pleased-0.096
Token afraid
Token ID7787
Feature activation+7.25e+00

Pos logit contributions
lee+0.059
heart+0.058
illy+0.057
 years+0.054
years+0.054
Neg logit contributions
Ready-0.089
 hunt-0.086
 stairs-0.086
 ride-0.083
 shortcut-0.083
Token and
Token ID290
Feature activation+4.52e+00

Pos logit contributions
lee+0.175
 mad+0.173
 glad+0.170
years+0.169
 years+0.168
Neg logit contributions
Ready-0.168
 hunt-0.158
 stairs-0.158
Circ-0.154
-0.151
Token said
Token ID531
Feature activation+5.84e+00

Pos logit contributions
lee+0.097
 mad+0.095
 years+0.094
 glad+0.094
heart+0.091
Neg logit contributions
Ready-0.119
 hunt-0.112
 stairs-0.109
Circ-0.108
Find-0.106
Token,
Token ID11
Feature activation+4.00e+00

Pos logit contributions
lee+0.121
 years+0.118
 mad+0.117
illy+0.113
 glad+0.113
Neg logit contributions
Ready-0.158
 hunt-0.149
Circ-0.146
 stairs-0.145
-0.143
Token "
Token ID366
Feature activation-5.48e+00

Pos logit contributions
lee+0.118
 glad+0.118
 mad+0.116
 years+0.113
 pleased+0.111
Neg logit contributions
Ready-0.076
 hunt-0.067
Circ-0.067
 stairs-0.066
Find-0.063
TokenOh
Token ID5812
Feature activation+9.02e+00

Pos logit contributions
 hunt+0.121
 stairs+0.121
 shortcut+0.110
 aisle+0.109
ground+0.108
Neg logit contributions
heart-0.142
 mad-0.134
 ok-0.134
 years-0.133
 careful-0.131
Token no
Token ID645
Feature activation+6.04e+00

Pos logit contributions
lee+0.194
aming+0.185
 years+0.178
years+0.177
illy+0.176
Neg logit contributions
Ready-0.231
 hunt-0.227
-0.223
 shortcut-0.221
br-0.220
Token named
Token ID3706
Feature activation+9.39e-01

Pos logit contributions
 mad+0.047
lee+0.046
 pleased+0.045
 glad+0.045
heart+0.045
Neg logit contributions
 hunt-0.052
Ready-0.051
-0.049
 gra-0.048
 ride-0.048
Token Sara
Token ID24799
Feature activation+8.17e+00

Pos logit contributions
lee+0.020
 mad+0.020
 glad+0.019
 years+0.019
heart+0.018
Neg logit contributions
Ready-0.026
 hunt-0.024
Circ-0.024
 stairs-0.023
Find-0.023
Token.
Token ID13
Feature activation-4.11e+00

Pos logit contributions
lee+0.236
 mad+0.235
heart+0.228
years+0.228
 pleased+0.226
Neg logit contributions
 hunt-0.146
-0.141
Ready-0.140
 ride-0.133
 gra-0.132
Token She
Token ID1375
Feature activation+2.76e+00

Pos logit contributions
Ready+0.099
 hunt+0.094
Circ+0.090
 shortcut+0.089
ground+0.088
Neg logit contributions
lee-0.099
 glad-0.099
 mad-0.098
 years-0.096
heart-0.092
Token had
Token ID550
Feature activation-3.34e+00

Pos logit contributions
heart+0.059
 mad+0.057
lee+0.057
 ok+0.055
 glad+0.054
Neg logit contributions
Ready-0.075
 hunt-0.072
-0.071
 ride-0.069
Circ-0.068
Token two
Token ID734
Feature activation-2.22e-01

Pos logit contributions
Ready+0.073
 stairs+0.065
 hunt+0.065
Circ+0.065
Find+0.063
Neg logit contributions
lee-0.090
heart-0.088
 mad-0.088
 years-0.085
 glad-0.085
Token arms
Token ID5101
Feature activation-4.12e+00

Pos logit contributions
 stairs+0.005
 hunt+0.005
 hills+0.005
 ride+0.005
 gra+0.005
Neg logit contributions
Acc-0.004
 pleased-0.004
heart-0.004
asured-0.004
illy-0.004
Token that
Token ID326
Feature activation-5.44e+00

Pos logit contributions
Ready+0.109
 hunt+0.105
 stairs+0.104
Circ+0.101
+0.100
Neg logit contributions
 mad-0.085
lee-0.083
 glad-0.082
 years-0.081
heart-0.080
Token were
Token ID547
Feature activation+1.93e+00

Pos logit contributions
Ready+0.154
 hunt+0.142
ground+0.140
Circ+0.140
 shortcut+0.139
Neg logit contributions
lee-0.111
 glad-0.110
 mad-0.107
 careful-0.104
 years-0.104
Token small
Token ID1402
Feature activation+2.14e+00

Pos logit contributions
lee+0.040
heart+0.040
 mad+0.039
 years+0.039
illy+0.038
Neg logit contributions
Ready-0.049
 stairs-0.047
 hunt-0.047
 gra-0.046
Find-0.046
Token and
Token ID290
Feature activation+3.74e+00

Pos logit contributions
lee+0.046
 glad+0.045
 mad+0.045
illy+0.044
 pleased+0.043
Neg logit contributions
Ready-0.054
 hunt-0.053
 stairs-0.053
sc-0.051
Circ-0.050
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
Ready+0.144
Circ+0.136
Find+0.130
 gra+0.130
 hunt+0.129
Neg logit contributions
lee-0.102
 mad-0.102
 years-0.101
 glad-0.098
heart-0.094
Token He
Token ID679
Feature activation+4.37e+00

Pos logit contributions
Ready+0.045
 hunt+0.044
sc+0.042
 gra+0.041
br+0.041
Neg logit contributions
lee-0.042
 glad-0.041
 mad-0.039
 okay-0.038
omo-0.037
Token was
Token ID373
Feature activation+4.64e+00

Pos logit contributions
 glad+0.108
heart+0.107
 mad+0.106
illy+0.105
lee+0.104
Neg logit contributions
Ready-0.101
 stairs-0.094
 hunt-0.093
Find-0.092
Circ-0.092
Token curious
Token ID11040
Feature activation+9.03e+00

Pos logit contributions
heart+0.076
illy+0.075
lee+0.074
 years+0.069
ious+0.064
Neg logit contributions
Ready-0.148
 stairs-0.142
 hunt-0.139
Find-0.137
Circ-0.136
Token so
Token ID523
Feature activation+7.99e+00

Pos logit contributions
years+0.232
aming+0.226
 glad+0.222
ivering+0.221
ppa+0.220
Neg logit contributions
 stairs-0.192
 hunt-0.191
 hills-0.183
 ride-0.182
-0.175
Token he
Token ID339
Feature activation+2.01e+00

Pos logit contributions
aming+0.253
lee+0.252
illy+0.250
heart+0.250
 mad+0.247
Neg logit contributions
Ready-0.119
 stairs-0.112
 hunt-0.109
-0.108
 ride-0.107
Token went
Token ID1816
Feature activation-2.42e-01

Pos logit contributions
 glad+0.051
 mad+0.049
 years+0.049
heart+0.049
illy+0.048
Neg logit contributions
Ready-0.047
 hunt-0.043
Circ-0.042
Find-0.042
 stairs-0.041
Token closer
Token ID5699
Feature activation+3.31e+00

Pos logit contributions
Ready+0.005
Circ+0.005
Find+0.005
 gra+0.005
 hunt+0.005
Neg logit contributions
 mad-0.006
lee-0.006
 glad-0.006
 years-0.006
 careful-0.006
Token to
Token ID284
Feature activation-8.00e+00

Pos logit contributions
lee+0.087
 mad+0.086
 years+0.084
 glad+0.084
heart+0.082
Neg logit contributions
Ready-0.074
Circ-0.066
 hunt-0.066
Find-0.064
 stairs-0.064
Token have
Token ID423
Feature activation-1.56e+00

Pos logit contributions
Ready+0.202
sc+0.201
br+0.197
tr+0.196
Circ+0.190
Neg logit contributions
 mad-0.184
lee-0.178
 glad-0.176
 years-0.162
 understanding-0.161
Token a
Token ID257
Feature activation+6.53e-02

Pos logit contributions
Ready+0.043
sc+0.039
Circ+0.039
ro+0.038
 stairs+0.038
Neg logit contributions
lee-0.033
 mad-0.033
 years-0.032
 glad-0.032
 careful-0.031
Token.
Token ID13
Feature activation-2.67e+00

Pos logit contributions
Ready+0.199
Circ+0.184
 hunt+0.179
Find+0.177
 stairs+0.175
Neg logit contributions
 mad-0.159
lee-0.158
 years-0.157
 glad-0.155
heart-0.146
Token They
Token ID1119
Feature activation+1.34e+00

Pos logit contributions
Ready+0.065
 hunt+0.062
 shortcut+0.059
br+0.058
con+0.056
Neg logit contributions
lee-0.066
 glad-0.063
 mad-0.061
 years-0.059
 pleased-0.058
Token wanted
Token ID2227
Feature activation+2.38e+00

Pos logit contributions
 mad+0.032
lee+0.032
 glad+0.032
ivering+0.032
heart+0.032
Neg logit contributions
Ready-0.030
 stairs-0.029
 hunt-0.028
Find-0.027
 ride-0.027
Token to
Token ID284
Feature activation-9.92e+00

Pos logit contributions
 mad+0.071
lee+0.070
 glad+0.069
 years+0.069
 careful+0.067
Neg logit contributions
Ready-0.045
sc-0.039
 hunt-0.038
Circ-0.037
Find-0.037
Token climb
Token ID12080
Feature activation-6.02e+00

Pos logit contributions
sc+0.238
Ready+0.234
rob+0.214
ground+0.211
+0.206
Neg logit contributions
 mad-0.248
 careful-0.233
 happy-0.231
 enthusiastic-0.231
 obedient-0.229
Token the
Token ID262
Feature activation-5.95e+00

Pos logit contributions
Ready+0.172
Find+0.157
Circ+0.157
 hunt+0.155
 gra+0.152
Neg logit contributions
 mad-0.121
lee-0.115
 years-0.113
 glad-0.112
i-0.104
Token tree
Token ID5509
Feature activation-1.10e+01

Pos logit contributions
Ready+0.126
Circ+0.122
ground+0.120
sc+0.118
ro+0.114
Neg logit contributions
 mad-0.149
 years-0.145
 glad-0.142
 adorable-0.140
lee-0.137
Token and
Token ID290
Feature activation+4.53e+00

Pos logit contributions
Ready+0.370
Circ+0.338
Find+0.334
 gra+0.323
 hunt+0.320
Neg logit contributions
 mad-0.176
 years-0.168
lee-0.165
 glad-0.163
heart-0.160
Token see
Token ID766
Feature activation-2.22e+00

Pos logit contributions
 mad+0.110
 glad+0.106
 careful+0.101
lee+0.101
 happy+0.100
Neg logit contributions
sc-0.103
Ready-0.102
br-0.101
Circ-0.100
ground-0.097
Token what
Token ID644
Feature activation-6.83e+00

Pos logit contributions
Ready+0.072
Circ+0.067
 hunt+0.066
sc+0.066
ro+0.065
Neg logit contributions
 mad-0.037
 glad-0.037
lee-0.036
 years-0.035
heart-0.033
Token was
Token ID373
Feature activation+2.91e+00

Pos logit contributions
Ready+0.194
ro+0.180
 hunt+0.179
ground+0.173
sc+0.171
Neg logit contributions
 mad-0.135
 years-0.134
lee-0.129
 glad-0.129
 careful-0.122
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
Ready+0.142
Circ+0.133
 hunt+0.131
Find+0.130
 gra+0.129
Neg logit contributions
 mad-0.099
lee-0.094
 years-0.094
 glad-0.093
heart-0.088
Token He
Token ID679
Feature activation+4.10e+00

Pos logit contributions
Ready+0.049
 hunt+0.048
 shortcut+0.047
sc+0.046
br+0.045
Neg logit contributions
lee-0.039
 glad-0.038
 mad-0.036
 enthusiastic-0.035
 careful-0.035
Token climbed
Token ID19952
Feature activation-4.00e+00

Pos logit contributions
 mad+0.098
lee+0.098
 glad+0.096
 years+0.095
 careful+0.093
Neg logit contributions
Ready-0.099
Circ-0.091
 hunt-0.091
ground-0.089
 gra-0.088
Token up
Token ID510
Feature activation-3.24e+00

Pos logit contributions
Ready+0.111
 hunt+0.108
Circ+0.105
 gra+0.104
Find+0.104
Neg logit contributions
 mad-0.080
 glad-0.077
lee-0.076
 years-0.073
 careful-0.070
Token and
Token ID290
Feature activation+5.94e+00

Pos logit contributions
Ready+0.088
 hunt+0.084
 gra+0.083
Find+0.081
Circ+0.080
Neg logit contributions
 mad-0.071
 years-0.065
 glad-0.064
lee-0.062
 careful-0.060
Token slid
Token ID27803
Feature activation+1.10e+00

Pos logit contributions
 mad+0.137
 glad+0.132
lee+0.130
 careful+0.126
 pleased+0.121
Neg logit contributions
Circ-0.142
 hunt-0.137
Ready-0.137
ground-0.137
 gra-0.135
Token down
Token ID866
Feature activation-9.72e+00

Pos logit contributions
 mad+0.028
lee+0.027
 glad+0.027
 years+0.026
 careful+0.025
Neg logit contributions
Ready-0.025
 hunt-0.024
Circ-0.024
 gra-0.023
Find-0.023
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
Ready+0.258
 hunt+0.253
 gra+0.245
Circ+0.244
Find+0.237
Neg logit contributions
 mad-0.215
 years-0.201
 glad-0.200
lee-0.189
 careful-0.187
Token It
Token ID632
Feature activation-2.91e+00

Pos logit contributions
 hunt+0.043
Ready+0.041
 shortcut+0.040
sc+0.040
tr+0.039
Neg logit contributions
 glad-0.033
lee-0.032
 sorry-0.029
 mad-0.029
 enthusiastic-0.028
Token was
Token ID373
Feature activation+4.66e+00

Pos logit contributions
Ready+0.076
 hunt+0.072
ground+0.072
Circ+0.071
 gra+0.069
Neg logit contributions
 mad-0.064
lee-0.060
 glad-0.059
 careful-0.058
 years-0.058
Token fun
Token ID1257
Feature activation+8.54e+00

Pos logit contributions
lee+0.097
heart+0.091
illy+0.090
 years+0.089
years+0.087
Neg logit contributions
Ready-0.133
 stairs-0.124
 hunt-0.121
 shortcut-0.118
Circ-0.118
Token bed
Token ID3996
Feature activation-4.56e+00

Pos logit contributions
lee+0.225
 years+0.223
 mad+0.219
heart+0.215
 glad+0.214
Neg logit contributions
Ready-0.173
Circ-0.153
sc-0.150
Find-0.146
-0.145
Token!
Token ID0
Feature activation-1.22e+00

Pos logit contributions
Ready+0.138
Circ+0.128
Find+0.125
 hunt+0.122
 shortcut+0.120
Neg logit contributions
 years-0.086
lee-0.086
 mad-0.082
heart-0.081
 glad-0.080
Token Fully
Token ID40234
Feature activation+3.28e+00

Pos logit contributions
 hunt+0.031
 shortcut+0.029
Ready+0.029
sc+0.028
 stairs+0.027
Neg logit contributions
lee-0.028
 glad-0.027
illy-0.027
 ok-0.026
 years-0.026
Token aware
Token ID3910
Feature activation+5.03e+00

Pos logit contributions
 mad+0.057
lee+0.056
 glad+0.050
 ok+0.049
Acc+0.048
Neg logit contributions
ground-0.091
-0.089
Ready-0.089
 stairs-0.088
Find-0.087
Token of
Token ID286
Feature activation-6.83e+00

Pos logit contributions
lee+0.111
 years+0.109
 mad+0.109
 glad+0.107
heart+0.107
Neg logit contributions
Ready-0.138
 hunt-0.126
Circ-0.126
Find-0.121
 stairs-0.119
Token its
Token ID663
Feature activation+1.94e+00

Pos logit contributions
Ready+0.177
Circ+0.163
 hunt+0.162
 stairs+0.158
Find+0.157
Neg logit contributions
lee-0.155
 mad-0.153
 years-0.153
 glad-0.151
heart-0.145
Token invitation
Token ID17023
Feature activation-4.88e+00

Pos logit contributions
lee+0.040
 mad+0.039
 glad+0.039
 years+0.037
years+0.037
Neg logit contributions
Ready-0.053
 hunt-0.050
 stairs-0.049
Circ-0.049
 gra-0.047
Token,
Token ID11
Feature activation+5.32e+00

Pos logit contributions
Ready+0.158
Circ+0.152
Find+0.146
 gra+0.141
+0.141
Neg logit contributions
 years-0.080
 mad-0.077
 ok-0.077
 glad-0.076
lee-0.075
Token Polly
Token ID36898
Feature activation+5.53e+00

Pos logit contributions
lee+0.116
 mad+0.114
 glad+0.113
 years+0.112
heart+0.108
Neg logit contributions
Ready-0.142
 hunt-0.131
Circ-0.131
 stairs-0.128
Find-0.126
Token me
Token ID502
Feature activation+6.82e-02

Pos logit contributions
lee+0.123
 mad+0.123
 glad+0.121
 years+0.119
heart+0.116
Neg logit contributions
Ready-0.142
 hunt-0.133
Circ-0.129
 stairs-0.129
sc-0.125
Tokenowed
Token ID6972
Feature activation+3.23e-01

Pos logit contributions
lee+0.001
 mad+0.001
 glad+0.001
 years+0.001
heart+0.001
Neg logit contributions
Ready-0.002
 hunt-0.002
Circ-0.002
 stairs-0.002
Find-0.002
Token.
Token ID13
Feature activation-3.01e+00

Pos logit contributions
Ready+0.014
 hunt+0.013
Circ+0.013
 stairs+0.013
Find+0.013
Neg logit contributions
lee-0.012
 mad-0.012
 glad-0.012
 years-0.012
heart-0.012
Token
Token ID198
Feature activation-8.36e+00

Pos logit contributions
 hunt+0.069
Ready+0.067
 shortcut+0.063
Circ+0.062
sc+0.061
Neg logit contributions
 glad-0.081
lee-0.080
 years-0.074
 pleased-0.073
heart-0.072
Token
Token ID198
Feature activation-8.37e+00

Pos logit contributions
Ready+0.206
 hunt+0.194
Circ+0.184
 stairs+0.183
sc+0.181
Neg logit contributions
 glad-0.206
lee-0.204
 years-0.196
 mad-0.196
 pleased-0.190
TokenThe
Token ID464
Feature activation-2.43e+00

Pos logit contributions
Ready+0.199
 hunt+0.193
 stairs+0.183
Circ+0.182
 shortcut+0.176
Neg logit contributions
 glad-0.199
 years-0.199
 mad-0.197
lee-0.197
 pleased-0.189
Token image
Token ID2939
Feature activation-7.95e+00

Pos logit contributions
Ready+0.059
 stairs+0.058
 ride+0.053
 hunt+0.053
 hill+0.053
Neg logit contributions
lee-0.056
ivering-0.054
heart-0.054
 ok-0.053
ushing-0.053
Token of
Token ID286
Feature activation-8.44e+00

Pos logit contributions
Ready+0.174
 hunt+0.161
 stairs+0.161
Circ+0.156
Find+0.155
Neg logit contributions
lee-0.203
 mad-0.200
 glad-0.197
 years-0.194
years-0.193
Token her
Token ID607
Feature activation+2.71e+00

Pos logit contributions
Ready+0.216
 hunt+0.196
sc+0.192
Circ+0.192
Find+0.187
Neg logit contributions
lee-0.201
 mad-0.200
 years-0.200
 glad-0.198
heart-0.188
Token family
Token ID1641
Feature activation-1.88e+00

Pos logit contributions
 mad+0.048
heart+0.047
lee+0.047
years+0.045
 glad+0.045
Neg logit contributions
Ready-0.077
 hunt-0.076
 stairs-0.075
 ride-0.073
Circ-0.073
Token was
Token ID373
Feature activation+2.67e+00

Pos logit contributions
Ready+0.042
 hunt+0.040
 stairs+0.040
 ride+0.039
ground+0.038
Neg logit contributions
 mad-0.046
lee-0.046
 glad-0.044
 years-0.044
heart-0.044
Token something
Token ID1223
Feature activation-3.17e+00

Pos logit contributions
lee+0.042
heart+0.041
illy+0.040
 mad+0.038
 years+0.038
Neg logit contributions
Ready-0.081
 stairs-0.078
 ride-0.077
 hunt-0.077
ground-0.077
Token that
Token ID326
Feature activation-4.85e+00

Pos logit contributions
Ready+0.091
 hunt+0.084
Circ+0.084
 stairs+0.083
Find+0.082
Neg logit contributions
lee-0.062
 mad-0.061
 glad-0.060
 years-0.060
heart-0.057
Token have
Token ID423
Feature activation-3.88e+00

Pos logit contributions
Ready+0.242
sc+0.237
Circ+0.218
 aisle+0.214
 stairs+0.214
Neg logit contributions
 mad-0.265
lee-0.251
 years-0.245
 ok-0.244
 careful-0.243
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
Ready+0.120
 hunt+0.111
 stairs+0.110
Circ+0.110
sc+0.110
Neg logit contributions
lee-0.068
 mad-0.068
 years-0.067
 glad-0.066
 careful-0.061
Token picnic
Token ID35715
Feature activation-7.82e+00

Pos logit contributions
Ready+0.045
sc+0.040
Circ+0.040
ground+0.040
 stairs+0.039
Neg logit contributions
 mad-0.067
 glad-0.066
 careful-0.063
lee-0.062
 okay-0.062
Token together
Token ID1978
Feature activation-1.03e-01

Pos logit contributions
Ready+0.228
Find+0.206
Circ+0.205
 gra+0.199
 stairs+0.197
Neg logit contributions
 years-0.158
 mad-0.151
 glad-0.146
 careful-0.144
lee-0.143
Token.
Token ID13
Feature activation-3.70e+00

Pos logit contributions
Ready+0.003
Circ+0.003
Find+0.003
 stairs+0.003
 gra+0.003
Neg logit contributions
 years-0.002
lee-0.002
 mad-0.002
 glad-0.002
 ok-0.002
Token
Token ID198
Feature activation-9.47e+00

Pos logit contributions
Ready+0.079
 stairs+0.070
Find+0.070
Circ+0.070
 shortcut+0.068
Neg logit contributions
 glad-0.098
lee-0.097
 okay-0.093
 years-0.092
 ok-0.092
Token
Token ID198
Feature activation-9.40e+00

Pos logit contributions
Ready+0.217
 hunt+0.188
ro+0.182
 stairs+0.180
Find+0.180
Neg logit contributions
 glad-0.252
 years-0.241
lee-0.232
 mad-0.230
 okay-0.226
TokenMax
Token ID11518
Feature activation+3.30e+00

Pos logit contributions
Ready+0.177
 stairs+0.161
 hunt+0.160
Circ+0.159
 gra+0.151
Neg logit contributions
 years-0.271
 glad-0.264
 mad-0.264
lee-0.263
 careful-0.256
Token wanted
Token ID2227
Feature activation+1.91e+00

Pos logit contributions
 mad+0.075
 glad+0.073
lee+0.073
years+0.073
heart+0.071
Neg logit contributions
 hunt-0.080
Ready-0.080
-0.075
Circ-0.074
 ride-0.074
Token to
Token ID284
Feature activation-1.02e+01

Pos logit contributions
 mad+0.053
 years+0.053
 glad+0.052
lee+0.052
 careful+0.051
Neg logit contributions
Ready-0.040
sc-0.034
Find-0.034
ground-0.033
 gra-0.033
Token make
Token ID787
Feature activation-3.99e+00

Pos logit contributions
Ready+0.216
sc+0.188
Circ+0.187
 stairs+0.185
+0.180
Neg logit contributions
 mad-0.282
lee-0.276
 years-0.275
 glad-0.271
 ok-0.265
Token spill
Token ID19431
Feature activation-4.17e+00

Pos logit contributions
 mad+0.146
 glad+0.125
lee+0.123
 sorry+0.121
 angry+0.120
Neg logit contributions
Ready-0.135
Circ-0.130
ground-0.128
East-0.127
-0.124
Token more
Token ID517
Feature activation+3.31e+00

Pos logit contributions
Ready+0.111
 hunt+0.104
Circ+0.102
Find+0.098
ground+0.097
Neg logit contributions
 mad-0.097
lee-0.091
 glad-0.088
 years-0.087
 careful-0.083
Token water
Token ID1660
Feature activation-3.12e+00

Pos logit contributions
lee+0.083
 mad+0.082
 years+0.080
 glad+0.079
 careful+0.075
Neg logit contributions
Ready-0.080
Circ-0.073
 hunt-0.073
Find-0.070
 stairs-0.069
Token and
Token ID290
Feature activation+5.51e+00

Pos logit contributions
Ready+0.097
Circ+0.089
Find+0.087
 hunt+0.086
East+0.086
Neg logit contributions
 mad-0.063
lee-0.058
 years-0.054
 glad-0.054
 careful-0.051
Token colors
Token ID7577
Feature activation-2.06e+00

Pos logit contributions
 mad+0.135
lee+0.125
 years+0.124
 glad+0.122
 careful+0.118
Neg logit contributions
Ready-0.138
Circ-0.129
 hunt-0.126
ground-0.124
 shortcut-0.122
Token on
Token ID319
Feature activation-1.02e+01

Pos logit contributions
Ready+0.063
Circ+0.059
 hunt+0.059
Find+0.057
ground+0.056
Neg logit contributions
 mad-0.039
lee-0.038
 years-0.035
 glad-0.035
 okay-0.034
Token the
Token ID262
Feature activation-5.16e+00

Pos logit contributions
Ready+0.264
 hunt+0.240
Circ+0.238
 shortcut+0.231
sc+0.231
Neg logit contributions
lee-0.234
 mad-0.232
 years-0.226
 glad-0.220
heart-0.214
Token table
Token ID3084
Feature activation-8.69e+00

Pos logit contributions
Ready+0.113
Circ+0.111
Find+0.104
East+0.101
 hunt+0.100
Neg logit contributions
lee-0.137
 mad-0.136
 years-0.129
 glad-0.128
 careful-0.125
Token,
Token ID11
Feature activation+4.74e+00

Pos logit contributions
Ready+0.246
Circ+0.228
 hunt+0.221
Find+0.218
 gra+0.215
Neg logit contributions
 mad-0.184
lee-0.178
 years-0.176
 glad-0.171
 careful-0.163
Token the
Token ID262
Feature activation-5.02e+00

Pos logit contributions
 mad+0.103
lee+0.101
 glad+0.100
 years+0.098
heart+0.093
Neg logit contributions
Ready-0.127
Circ-0.118
 hunt-0.118
 stairs-0.115
Find-0.113
Token floor
Token ID4314
Feature activation-9.06e+00

Pos logit contributions
Ready+0.111
Circ+0.109
 hunt+0.103
Find+0.100
ground+0.099
Neg logit contributions
 mad-0.133
lee-0.127
 glad-0.126
 years-0.123
 careful-0.120
Token happily
Token ID18177
Feature activation+4.82e+00

Pos logit contributions
Ready+0.174
 stairs+0.160
Circ+0.160
 hunt+0.157
Find+0.157
Neg logit contributions
 mad-0.099
 glad-0.098
lee-0.098
 years-0.098
 pleased-0.091
Token until
Token ID1566
Feature activation+6.82e-01

Pos logit contributions
lee+0.092
 mad+0.091
 glad+0.091
 years+0.091
 pleased+0.084
Neg logit contributions
Ready-0.144
Circ-0.132
 stairs-0.130
 hunt-0.129
Find-0.128
Token the
Token ID262
Feature activation-5.30e+00

Pos logit contributions
 mad+0.016
 glad+0.016
lee+0.015
 years+0.015
 pleased+0.015
Neg logit contributions
Ready-0.017
sc-0.016
Circ-0.016
 hunt-0.016
Find-0.015
Token sun
Token ID4252
Feature activation-4.07e+00

Pos logit contributions
Ready+0.122
 hunt+0.112
 stairs+0.110
Circ+0.109
Find+0.105
Neg logit contributions
lee-0.136
 mad-0.133
 years-0.131
 glad-0.131
heart-0.128
Token went
Token ID1816
Feature activation-1.91e+00

Pos logit contributions
Ready+0.107
Circ+0.099
 hunt+0.097
Find+0.097
 shortcut+0.097
Neg logit contributions
lee-0.089
 mad-0.082
heart-0.082
aming-0.081
i-0.080
Token down
Token ID866
Feature activation-1.13e+01

Pos logit contributions
Ready+0.046
 hunt+0.044
Circ+0.043
 stairs+0.043
Find+0.042
Neg logit contributions
 mad-0.046
lee-0.044
 glad-0.043
 ok-0.042
heart-0.042
Token.
Token ID13
Feature activation-2.49e+00

Pos logit contributions
Ready+0.339
Find+0.315
 hunt+0.313
Circ+0.312
 shortcut+0.301
Neg logit contributions
 mad-0.206
lee-0.206
 glad-0.204
 years-0.204
 pleased-0.188
Token The
Token ID383
Feature activation-1.88e+00

Pos logit contributions
 hunt+0.054
Ready+0.053
 stairs+0.051
 shortcut+0.050
br+0.049
Neg logit contributions
 glad-0.069
lee-0.065
 pleased-0.060
 years-0.059
 okay-0.059
Token End
Token ID5268
Feature activation+1.05e+00

Pos logit contributions
Ready+0.046
 stairs+0.042
 hunt+0.042
+0.041
 hill+0.041
Neg logit contributions
lee-0.046
heart-0.045
illy-0.044
years-0.044
 mad-0.043
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
 years+0.025
lee+0.023
 glad+0.022
 mad+0.022
heart+0.021
Neg logit contributions
Ready-0.029
 hunt-0.026
Find-0.026
 stairs-0.025
 gra-0.025
Token<|endoftext|>
Token ID50256
Feature activation-1.02e+01

Pos logit contributions
Ready+0.054
 hunt+0.052
 stairs+0.051
 shortcut+0.049
sc+0.049
Neg logit contributions
 glad-0.077
lee-0.070
 years-0.068
 okay-0.067
 pleased-0.066
Token next
Token ID1306
Feature activation-1.34e+00

Pos logit contributions
 years+0.001
lee+0.001
 mad+0.001
 glad+0.001
heart+0.001
Neg logit contributions
Ready-0.002
Circ-0.002
Find-0.002
 hunt-0.002
 stairs-0.002
Token time
Token ID640
Feature activation-5.37e+00

Pos logit contributions
Ready+0.032
 hunt+0.029
Circ+0.029
 stairs+0.029
Find+0.028
Neg logit contributions
lee-0.033
 mad-0.033
 years-0.032
 glad-0.032
heart-0.031
Token."
Token ID526
Feature activation-2.51e+00

Pos logit contributions
Ready+0.135
 hunt+0.124
Circ+0.123
 stairs+0.122
Find+0.119
Neg logit contributions
lee-0.124
 mad-0.123
 glad-0.121
 years-0.120
heart-0.116
Token<|endoftext|>
Token ID50256
Feature activation-9.75e+00

Pos logit contributions
Ready+0.039
 hunt+0.035
 stairs+0.034
Circ+0.034
 shortcut+0.033
Neg logit contributions
lee-0.084
 glad-0.082
 mad-0.081
 years-0.079
heart-0.078
TokenOnce
Token ID7454
Feature activation-1.36e+00

Pos logit contributions
Ready+0.203
 hunt+0.202
 shortcut+0.201
Find+0.188
 hike+0.188
Neg logit contributions
 years-0.251
 brothers-0.231
aming-0.230
 mad-0.228
lee-0.227
Token upon
Token ID2402
Feature activation-7.11e+00

Pos logit contributions
Ready+0.029
 hunt+0.026
 stairs+0.026
Circ+0.026
+0.025
Neg logit contributions
lee-0.036
 mad-0.036
 years-0.036
 glad-0.035
heart-0.034
Token a
Token ID257
Feature activation-1.99e+00

Pos logit contributions
Ready+0.154
 hunt+0.143
Circ+0.139
 stairs+0.138
+0.138
Neg logit contributions
lee-0.194
 mad-0.183
heart-0.182
 glad-0.181
 years-0.180
Token time
Token ID640
Feature activation-5.78e+00

Pos logit contributions
Ready+0.033
 hunt+0.033
+0.031
 ride+0.030
sc+0.029
Neg logit contributions
lee-0.063
years-0.061
heart-0.058
 mad-0.058
 glad-0.057
Token,
Token ID11
Feature activation+3.53e+00

Pos logit contributions
Ready+0.174
 hunt+0.163
Circ+0.161
 stairs+0.160
+0.158
Neg logit contributions
lee-0.105
 mad-0.103
 years-0.100
 glad-0.100
heart-0.096
Token there
Token ID612
Feature activation-1.57e+01

Pos logit contributions
lee+0.100
 mad+0.096
 years+0.096
heart+0.095
 glad+0.093
Neg logit contributions
Ready-0.072
-0.065
 hunt-0.065
 stairs-0.064
Circ-0.063
Token was
Token ID373
Feature activation+2.79e+00

Pos logit contributions
Ready+0.373
 hunt+0.338
Circ+0.335
 stairs+0.330
sc+0.324
Neg logit contributions
lee-0.383
 mad-0.383
 years-0.379
 glad-0.379
heart-0.362