TOP ACTIVATIONS
MAX = 0.380

as he ate his pops icle , feeling cooler and more
. He wanted the pops icle that was lost in the
. She took a bisc uit from her pocket and threw
brought it back to Ze ke and they both had a
the cooler , her pops icle stayed nice and cool -
O w ! Ow ! Ow !" Sam cried . "
" O w ! Ow ! Ow !" Tom and
" O w ! Ow ! Ow !" Sam cried
O w ! Ow ! Ow !" Tom and Sue cried
his knee and a pops icle to make him feel better
a delicious ice cream pops icle ! She was so happy
O w ! Ow ! Ow !" Tom cries .
. Every morning , Ze ke cooked some delicious food for
O w ! Ow ! Ow !" Sam cried . He
found a special pink pops icle inside the freezer ! She
O w ! Ow ! Ow !" Tom screamed . He
O w ! Ow ! Ow !" they cried and let
" O w ! Ow ! Ow !" Tom cries
the bowls and the pops icle sticks on the table .
" O w ! Ow ! Ow !" Tom screamed

MIN ACTIVATIONS
MIN = -0.307

. The End . <|endoftext|> Once upon a
. The End . <|endoftext|> Once upon a
accept each other . The End . <|endoftext|> Once upon a
joy and laughter . The End . <|endoftext|> Once upon a
happy . The End . <|endoftext|> Once upon a
were very happy . The End . <|endoftext|> Once upon a
. The End <|endoftext|> Once upon a time
again . The End . <|endoftext|> Once upon a
. The End . <|endoftext|> Once upon a
. The End <|endoftext|> One day , two
decision . The End . <|endoftext|> Once upon a
. The End . <|endoftext|> Once upon a
happily ever after . The End . <|endoftext|> Once upon a
. The End . <|endoftext|> Once there was
. The End . <|endoftext|> Jimmy was walking
. The End . <|endoftext|> Once upon a
The End . <|endoftext|> Once upon a
joy ! The End ! <|endoftext|> Once upon a
game . The End . <|endoftext|> Todd was so
. The End . <|endoftext|> Once upon a

INTERVAL 0.209 - 0.380
CONTAINS 0.648%

and cl apped while P ipp in sang .
her teeth . L ily tried to be independent
and have fun . <|endoftext|> L ily and Ben were friends
€ ™ s wrong , Mike y ? â € his
ran away . Tim my and his friends were

INTERVAL 0.037 - 0.209
CONTAINS 37.086%

it also brought her special treats like candy . She knew
small town , there was a bar ber . The bar
. They jumped in the p udd les and laughed .
Yes , he can ride with me ." Tim
in the air . It was a small box .

INTERVAL -0.135 - 0.037
CONTAINS 56.170%

newspaper . Suddenly , a spider crawled out of the newspaper
its strong sails to sail across the ocean and it was
Blue too . She is so pretty . Can we take
to draw a new animal . He drew a yellow bird
all agreed that Jen was so gifted and they gave her

INTERVAL -0.307 - -0.135
CONTAINS 6.096%

It was very quiet and he could hear the wind blowing
He told his mom that he liked to deliver therm ometers
He felt so sad and he stopped crying . He knew
boy saw the tube and he wanted to try it .
and its friends would come every day to drink water from
Token as
Token ID355
Feature activation+2.35e-02

Pos logit contributions
uzz+0.024
rett+0.023
uit+0.023
xc+0.023
zzy+0.022
Neg logit contributions
ience-0.018
 gloomy-0.018
 happiest-0.017
 End-0.017
 sizes-0.017
Token he
Token ID339
Feature activation-1.31e-01

Pos logit contributions
rett+0.042
uzz+0.042
zzy+0.041
xc+0.040
bur+0.040
Neg logit contributions
 gloomy-0.028
 End-0.027
 happiest-0.027
ience-0.026
 goals-0.026
Token ate
Token ID15063
Feature activation+3.61e-02

Pos logit contributions
 gloomy+0.172
ience+0.172
 happiest+0.170
 End+0.164
 goals+0.154
Neg logit contributions
uzz-0.222
xc-0.219
rett-0.218
Wh-0.210
uit-0.209
Token his
Token ID465
Feature activation+6.23e-02

Pos logit contributions
rett+0.056
xc+0.056
uzz+0.055
Wh+0.055
zzy+0.055
Neg logit contributions
 gloomy-0.054
 happiest-0.051
 End-0.050
 goals-0.049
ience-0.048
Token pops
Token ID26384
Feature activation+1.13e-01

Pos logit contributions
rett+0.109
uzz+0.108
xc+0.106
zzy+0.106
Wh+0.106
Neg logit contributions
 gloomy-0.081
 happiest-0.077
 End-0.075
ience-0.073
 goals-0.071
Tokenicle
Token ID1548
Feature activation+3.67e-01

Pos logit contributions
zzy+0.127
Wh+0.125
rett+0.124
xc+0.123
uzz+0.118
Neg logit contributions
 gloomy-0.211
s-0.202
 sizes-0.198
 happiest-0.198
 End-0.195
Token,
Token ID11
Feature activation-7.23e-02

Pos logit contributions
rett+0.633
uit+0.628
bur+0.621
uzz+0.619
xc+0.615
Neg logit contributions
 gloomy-0.461
 End-0.461
 happiest-0.453
ience-0.442
 educational-0.420
Token feeling
Token ID4203
Feature activation+5.36e-03

Pos logit contributions
 gloomy+0.088
 happiest+0.085
 End+0.081
ience+0.079
 sizes+0.077
Neg logit contributions
rett-0.132
xc-0.130
uzz-0.130
zzy-0.129
uit-0.127
Token cooler
Token ID19346
Feature activation+9.94e-02

Pos logit contributions
zzy+0.009
 unw+0.008
Flo+0.008
 Sara+0.008
uzz+0.008
Neg logit contributions
 problem-0.007
 jets-0.007
 These-0.007
ience-0.007
 network-0.007
Token and
Token ID290
Feature activation+5.93e-02

Pos logit contributions
uit+0.160
rett+0.156
uzz+0.155
bur+0.154
 unw+0.152
Neg logit contributions
 End-0.139
 goals-0.136
 gloomy-0.133
ience-0.133
leness-0.132
Token more
Token ID517
Feature activation-1.35e-01

Pos logit contributions
 unw+0.091
bur+0.087
rett+0.087
uzz+0.087
uit+0.085
Neg logit contributions
ience-0.082
 gloomy-0.081
 These-0.079
 End-0.078
 happiest-0.078
Token.
Token ID13
Feature activation-2.91e-02

Pos logit contributions
 happiest+0.007
 End+0.006
ience+0.006
 gloomy+0.006
 sizes+0.006
Neg logit contributions
rett-0.010
uzz-0.010
zzy-0.010
 unw-0.009
Wh-0.009
Token He
Token ID679
Feature activation+2.48e-02

Pos logit contributions
 gloomy+0.042
 happiest+0.041
 End+0.040
ience+0.039
 sizes+0.038
Neg logit contributions
rett-0.045
uzz-0.045
Wh-0.045
uit-0.044
xc-0.044
Token wanted
Token ID2227
Feature activation-4.79e-02

Pos logit contributions
rett+0.047
uit+0.046
uzz+0.046
xc+0.046
bur+0.045
Neg logit contributions
 gloomy-0.029
 happiest-0.028
ience-0.027
 End-0.027
 valuable-0.025
Token the
Token ID262
Feature activation-6.96e-02

Pos logit contributions
 gloomy+0.059
 happiest+0.054
 days+0.051
 educational+0.050
 valuable+0.050
Neg logit contributions
xc-0.092
zzy-0.091
rett-0.091
uzz-0.090
igi-0.088
Token pops
Token ID26384
Feature activation+1.10e-01

Pos logit contributions
 gloomy+0.093
 happiest+0.090
 End+0.089
ience+0.089
 goals+0.086
Neg logit contributions
rett-0.114
uzz-0.113
uit-0.113
xc-0.113
bur-0.112
Tokenicle
Token ID1548
Feature activation+3.65e-01

Pos logit contributions
zzy+0.086
rett+0.086
 unw+0.081
Wh+0.078
xc+0.078
Neg logit contributions
 gloomy-0.246
 happiest-0.236
s-0.233
lessness-0.231
 End-0.229
Token that
Token ID326
Feature activation-1.13e-01

Pos logit contributions
rett+0.618
uzz+0.613
uit+0.611
xc+0.608
bur+0.602
Neg logit contributions
 gloomy-0.468
 End-0.457
 happiest-0.453
ience-0.446
 goals-0.426
Token was
Token ID373
Feature activation+5.05e-02

Pos logit contributions
 gloomy+0.139
 happiest+0.134
 End+0.130
ience+0.127
 goals+0.124
Neg logit contributions
rett-0.203
uzz-0.200
xc-0.199
zzy-0.197
uit-0.197
Token lost
Token ID2626
Feature activation+2.25e-02

Pos logit contributions
rett+0.092
uzz+0.090
xc+0.090
zzy+0.089
uit+0.089
Neg logit contributions
 gloomy-0.063
 happiest-0.060
 End-0.058
ience-0.055
 goals-0.055
Token in
Token ID287
Feature activation+1.41e-01

Pos logit contributions
uzz+0.037
rett+0.037
uit+0.037
xc+0.037
zzy+0.036
Neg logit contributions
 gloomy-0.029
 happiest-0.029
 End-0.028
ience-0.028
 goals-0.027
Token the
Token ID262
Feature activation-6.82e-02

Pos logit contributions
rett+0.202
uzz+0.200
uit+0.197
xc+0.196
bur+0.195
Neg logit contributions
 gloomy-0.218
 happiest-0.216
ience-0.210
 End-0.209
 goals-0.200
Token.
Token ID13
Feature activation-3.83e-02

Pos logit contributions
 gloomy+0.137
 happiest+0.134
 End+0.128
ience+0.128
 tallest+0.127
Neg logit contributions
rett-0.177
uzz-0.177
uit-0.176
 unw-0.170
xc-0.169
Token She
Token ID1375
Feature activation-1.98e-02

Pos logit contributions
 gloomy+0.048
 happiest+0.046
 End+0.045
ience+0.043
 goals+0.043
Neg logit contributions
rett-0.068
xc-0.067
uzz-0.066
zzy-0.066
uit-0.066
Token took
Token ID1718
Feature activation+1.47e-01

Pos logit contributions
 gloomy+0.027
 happiest+0.026
 End+0.025
 sizes+0.024
lessness+0.024
Neg logit contributions
rett-0.032
uzz-0.032
zzy-0.032
xc-0.032
 unw-0.032
Token a
Token ID257
Feature activation+5.93e-02

Pos logit contributions
uzz+0.252
rett+0.250
xc+0.248
zzy+0.247
bur+0.244
Neg logit contributions
 gloomy-0.194
 days-0.182
 happiest-0.180
ience-0.177
 End-0.176
Token bisc
Token ID36228
Feature activation+2.10e-01

Pos logit contributions
uzz+0.114
xc+0.112
rett+0.112
Wh+0.111
bur+0.110
Neg logit contributions
 gloomy-0.070
 happiest-0.063
 valuable-0.061
ience-0.060
 End-0.059
Tokenuit
Token ID5013
Feature activation+3.64e-01

Pos logit contributions
zzy+0.151
ck+0.148
 unw+0.147
rett+0.147
xc+0.142
Neg logit contributions
s-0.463
�-0.455
lessness-0.442
 sky-0.437
 End-0.435
Token from
Token ID422
Feature activation+1.50e-01

Pos logit contributions
rett+0.626
uzz+0.622
xc+0.621
zzy+0.615
Wh+0.609
Neg logit contributions
 gloomy-0.460
 happiest-0.444
ience-0.429
 End-0.424
lessness-0.406
Token her
Token ID607
Feature activation+5.87e-03

Pos logit contributions
xc+0.269
rett+0.263
uzz+0.257
zzy+0.254
uit+0.250
Neg logit contributions
 gloomy-0.206
 End-0.188
 happiest-0.184
 days-0.176
 goals-0.172
Token pocket
Token ID10000
Feature activation+2.00e-03

Pos logit contributions
uit+0.009
rett+0.009
bur+0.009
 unw+0.009
uzz+0.009
Neg logit contributions
 gloomy-0.009
 happiest-0.008
 End-0.008
ience-0.008
lessness-0.008
Token and
Token ID290
Feature activation+5.62e-02

Pos logit contributions
uit+0.003
rett+0.003
uzz+0.003
bur+0.003
 unw+0.003
Neg logit contributions
 gloomy-0.003
 happiest-0.003
 End-0.003
ience-0.003
 goals-0.003
Token threw
Token ID9617
Feature activation+1.15e-01

Pos logit contributions
rett+0.088
xc+0.088
uzz+0.088
zzy+0.083
uit+0.082
Neg logit contributions
 gloomy-0.083
 happiest-0.080
ience-0.078
 End-0.076
 sizes-0.075
Token brought
Token ID3181
Feature activation+1.33e-01

Pos logit contributions
rett+0.039
xc+0.038
uzz+0.038
Wh+0.038
uit+0.038
Neg logit contributions
ience-0.027
s-0.025
 gloomy-0.024
 loves-0.024
 happiest-0.024
Token it
Token ID340
Feature activation-6.75e-02

Pos logit contributions
rett+0.223
uzz+0.221
xc+0.219
zzy+0.218
uit+0.216
Neg logit contributions
 gloomy-0.178
 happiest-0.171
 End-0.167
ience-0.163
 goals-0.160
Token back
Token ID736
Feature activation+4.39e-02

Pos logit contributions
 gloomy+0.088
 happiest+0.083
 End+0.079
 valuable+0.076
 goals+0.076
Neg logit contributions
rett-0.117
xc-0.116
zzy-0.115
Wh-0.114
uzz-0.113
Token to
Token ID284
Feature activation+1.28e-01

Pos logit contributions
Wh+0.076
rett+0.076
xc+0.076
zzy+0.075
uit+0.073
Neg logit contributions
 gloomy-0.056
 happiest-0.052
 End-0.050
ience-0.049
 goals-0.049
Token Ze
Token ID9033
Feature activation+9.26e-02

Pos logit contributions
xc+0.180
rett+0.178
uzz+0.176
zzy+0.176
bur+0.171
Neg logit contributions
 gloomy-0.213
 happiest-0.204
 End-0.199
 goals-0.190
ience-0.189
Tokenke
Token ID365
Feature activation+3.64e-01

Pos logit contributions
uzz+0.124
bur+0.117
xc+0.117
Wh+0.117
zip+0.115
Neg logit contributions
ience-0.163
 gloomy-0.148
 happiest-0.147
 End-0.145
 problem-0.141
Token and
Token ID290
Feature activation+6.20e-02

Pos logit contributions
rett+0.645
uit+0.627
bur+0.624
uzz+0.622
zzy+0.622
Neg logit contributions
 gloomy-0.456
 happiest-0.444
 End-0.434
 educational-0.416
 sizes-0.414
Token they
Token ID484
Feature activation-1.90e-01

Pos logit contributions
xc+0.086
rett+0.085
uzz+0.084
zzy+0.083
bur+0.083
Neg logit contributions
 gloomy-0.101
 happiest-0.098
 End-0.095
ience-0.095
 valuable-0.093
Token both
Token ID1111
Feature activation-7.51e-02

Pos logit contributions
 gloomy+0.239
 happiest+0.230
 End+0.223
ience+0.216
 goals+0.213
Neg logit contributions
rett-0.335
uzz-0.334
xc-0.329
zzy-0.329
uit-0.327
Token had
Token ID550
Feature activation-9.35e-02

Pos logit contributions
 gloomy+0.110
 happiest+0.107
 End+0.106
 educational+0.101
ience+0.100
Neg logit contributions
uzz-0.117
zzy-0.115
 unw-0.114
rett-0.114
bur-0.110
Token a
Token ID257
Feature activation+6.17e-02

Pos logit contributions
 gloomy+0.144
 happiest+0.139
 End+0.139
ience+0.135
lessness+0.133
Neg logit contributions
rett-0.136
uzz-0.136
 unw-0.134
bur-0.133
uit-0.133
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
leness+0.207
 happiest+0.202
 sizes+0.201
 End+0.200
lessness+0.199
Neg logit contributions
uzz-0.174
 unw-0.172
bur-0.170
rett-0.168
 Sara-0.167
Token cooler
Token ID19346
Feature activation+1.04e-01

Pos logit contributions
 gloomy+0.045
 happiest+0.043
ience+0.042
 End+0.041
lessness+0.040
Neg logit contributions
rett-0.071
uzz-0.070
xc-0.069
uit-0.069
bur-0.069
Token,
Token ID11
Feature activation-7.59e-02

Pos logit contributions
uit+0.165
uzz+0.165
rett+0.164
bur+0.163
zzy+0.160
Neg logit contributions
 gloomy-0.144
 End-0.141
 happiest-0.140
ience-0.138
 goals-0.137
Token her
Token ID607
Feature activation+1.02e-02

Pos logit contributions
 gloomy+0.101
 happiest+0.098
 End+0.095
ience+0.092
 goals+0.091
Neg logit contributions
rett-0.128
uzz-0.126
xc-0.125
zzy-0.124
uit-0.124
Token pops
Token ID26384
Feature activation+1.10e-01

Pos logit contributions
rett+0.017
uzz+0.017
xc+0.016
uit+0.016
zzy+0.016
Neg logit contributions
 gloomy-0.014
 happiest-0.013
 End-0.013
ience-0.013
 sizes-0.013
Tokenicle
Token ID1548
Feature activation+3.64e-01

Pos logit contributions
zzy+0.105
rett+0.102
uzz+0.099
Wh+0.098
xc+0.097
Neg logit contributions
 gloomy-0.229
s-0.217
 sizes-0.214
 happiest-0.212
 End-0.212
Token stayed
Token ID9658
Feature activation+4.21e-02

Pos logit contributions
rett+0.575
uit+0.567
xc+0.564
bur+0.563
uzz+0.562
Neg logit contributions
 gloomy-0.509
 happiest-0.502
 End-0.500
ience-0.485
 goals-0.466
Token nice
Token ID3621
Feature activation-9.36e-02

Pos logit contributions
rett+0.072
uzz+0.071
xc+0.070
zzy+0.070
uit+0.070
Neg logit contributions
 gloomy-0.054
 happiest-0.053
 End-0.052
ience-0.051
 goals-0.049
Token and
Token ID290
Feature activation+5.71e-02

Pos logit contributions
 gloomy+0.122
 happiest+0.120
 End+0.119
ience+0.117
 educational+0.109
Neg logit contributions
rett-0.159
uit-0.157
uzz-0.156
bur-0.154
 unw-0.153
Token cool
Token ID3608
Feature activation-2.04e-02

Pos logit contributions
 unw+0.092
zip+0.090
bur+0.090
Flo+0.088
Wh+0.087
Neg logit contributions
 End-0.073
 goals-0.072
ience-0.071
 jets-0.071
 These-0.071
Token -
Token ID532
Feature activation-1.34e-02

Pos logit contributions
 End+0.030
 happiest+0.029
ience+0.028
 gloomy+0.027
 educational+0.027
Neg logit contributions
rett-0.031
 unw-0.031
uit-0.031
uzz-0.031
bur-0.030
TokenO
Token ID46
Feature activation+2.82e-01

Pos logit contributions
 End+0.173
 happiest+0.169
 gloomy+0.164
 sizes+0.159
 goals+0.154
Neg logit contributions
rett-0.193
uzz-0.188
isk-0.178
igi-0.177
xc-0.175
Tokenw
Token ID86
Feature activation+1.17e-01

Pos logit contributions
rett+0.422
zzy+0.417
bur+0.416
xc+0.411
uzz+0.409
Neg logit contributions
 gloomy-0.447
 happiest-0.442
 goals-0.411
 End-0.408
 educational-0.400
Token!
Token ID0
Feature activation+1.33e-02

Pos logit contributions
uzz+0.250
zzy+0.249
rett+0.247
Wh+0.246
xc+0.243
Neg logit contributions
 gloomy-0.102
 happiest-0.093
ience-0.092
 End-0.089
 problem-0.085
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.023
rett+0.023
bur+0.022
igi+0.022
uit+0.022
Neg logit contributions
 These-0.017
 gloomy-0.017
 happiest-0.015
 goals-0.015
ience-0.015
Token!
Token ID0
Feature activation+1.37e-02

Pos logit contributions
rett+0.545
xc+0.525
bur+0.523
uit+0.515
zzy+0.500
Neg logit contributions
 happiest-0.563
 End-0.549
 gloomy-0.545
 educational-0.539
 valuable-0.522
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.018
rett+0.018
igi+0.018
bur+0.017
uit+0.017
Neg logit contributions
 These-0.024
 gloomy-0.023
 goals-0.022
ience-0.022
 happiest-0.022
Token!"
Token ID2474
Feature activation+4.99e-02

Pos logit contributions
rett+0.540
xc+0.520
bur+0.519
uit+0.509
zzy+0.497
Neg logit contributions
 happiest-0.568
 End-0.553
 gloomy-0.551
 educational-0.542
 valuable-0.527
Token Sam
Token ID3409
Feature activation+9.65e-02

Pos logit contributions
bur+0.075
xc+0.075
uit+0.075
rett+0.074
isk+0.071
Neg logit contributions
 gloomy-0.082
s-0.072
 miserable-0.072
 End-0.071
 bored-0.071
Token cried
Token ID16896
Feature activation+4.36e-02

Pos logit contributions
uzz+0.177
rett+0.175
xc+0.175
igi+0.173
bur+0.173
Neg logit contributions
s-0.114
 gloomy-0.113
ience-0.102
 sizes-0.100
 happiest-0.099
Token.
Token ID13
Feature activation-3.45e-02

Pos logit contributions
rett+0.104
bur+0.103
xc+0.102
igi+0.101
zip+0.101
Neg logit contributions
 gloomy-0.034
lessness-0.025
 happiest-0.025
 miserable-0.024
 bored-0.024
Token "
Token ID366
Feature activation-1.20e-01

Pos logit contributions
 gloomy+0.031
s+0.027
 sky+0.026
lessness+0.025
 End+0.025
Neg logit contributions
xc-0.077
bur-0.076
rett-0.076
zzy-0.073
igi-0.073
Token
Token ID198
Feature activation+9.55e-02

Pos logit contributions
xc+0.202
uit+0.189
rett+0.188
bur+0.187
 unw+0.183
Neg logit contributions
 End-0.102
 much-0.093
 gloomy-0.089
 goals-0.085
 valuable-0.085
Token"
Token ID1
Feature activation-1.11e-01

Pos logit contributions
xc+0.154
rett+0.153
bur+0.150
uit+0.148
uzz+0.142
Neg logit contributions
 End-0.138
 gloomy-0.128
�-0.121
 goals-0.121
 educational-0.120
TokenO
Token ID46
Feature activation+2.78e-01

Pos logit contributions
 End+0.161
 happiest+0.148
�+0.147
 gloomy+0.142
 sizes+0.130
Neg logit contributions
rett-0.185
uzz-0.178
rap-0.176
xc-0.176
onna-0.171
Tokenw
Token ID86
Feature activation+1.10e-01

Pos logit contributions
rett+0.345
uzz+0.340
xc+0.336
zzy+0.333
uit+0.331
Neg logit contributions
 gloomy-0.500
 happiest-0.487
 End-0.475
ience-0.467
 goals-0.462
Token!
Token ID0
Feature activation+1.53e-02

Pos logit contributions
uzz+0.268
zzy+0.266
Wh+0.266
rett+0.261
 unw+0.257
Neg logit contributions
 gloomy-0.061
ience-0.055
 happiest-0.052
 problem-0.052
 End-0.051
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.024
igi+0.023
rett+0.023
bur+0.022
uit+0.022
Neg logit contributions
 These-0.023
 gloomy-0.023
s-0.021
ience-0.021
�-0.021
Token!
Token ID0
Feature activation+1.50e-02

Pos logit contributions
rett+0.597
xc+0.582
bur+0.577
uit+0.572
zzy+0.569
Neg logit contributions
 happiest-0.497
 gloomy-0.495
 End-0.479
 educational-0.466
 sizes-0.459
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.016
rett+0.016
igi+0.016
uit+0.015
bur+0.015
Neg logit contributions
 These-0.030
 gloomy-0.029
ience-0.028
 needs-0.028
s-0.028
Token!"
Token ID2474
Feature activation+5.20e-02

Pos logit contributions
rett+0.575
xc+0.560
bur+0.556
uit+0.548
zzy+0.546
Neg logit contributions
 happiest-0.518
 gloomy-0.514
 End-0.501
 educational-0.487
 goals-0.479
Token Tom
Token ID4186
Feature activation+1.51e-01

Pos logit contributions
uit+0.085
xc+0.085
rett+0.084
bur+0.084
isk+0.081
Neg logit contributions
 gloomy-0.080
 miserable-0.070
 bored-0.070
 much-0.069
 lonely-0.069
Token and
Token ID290
Feature activation+5.69e-02

Pos logit contributions
uit+0.296
uzz+0.295
Wh+0.294
rett+0.292
bur+0.289
Neg logit contributions
s-0.166
 gloomy-0.158
ience-0.149
 goals-0.146
 bored-0.144
Token
Token ID198
Feature activation+8.80e-02

Pos logit contributions
xc+0.212
bur+0.196
 snatch+0.189
rett+0.188
uit+0.188
Neg logit contributions
s-0.070
 much-0.064
 End-0.062
 These-0.056
 Playing-0.055
Token"
Token ID1
Feature activation-1.17e-01

Pos logit contributions
xc+0.142
rett+0.141
bur+0.139
uit+0.133
uzz+0.133
Neg logit contributions
 End-0.125
 gloomy-0.118
�-0.113
 educational-0.112
ience-0.110
TokenO
Token ID46
Feature activation+2.82e-01

Pos logit contributions
 End+0.173
 happiest+0.169
 gloomy+0.164
 sizes+0.159
 goals+0.154
Neg logit contributions
rett-0.193
uzz-0.188
isk-0.178
igi-0.177
xc-0.175
Tokenw
Token ID86
Feature activation+1.17e-01

Pos logit contributions
rett+0.422
zzy+0.417
bur+0.416
xc+0.411
uzz+0.409
Neg logit contributions
 gloomy-0.447
 happiest-0.442
 goals-0.411
 End-0.408
 educational-0.400
Token!
Token ID0
Feature activation+1.33e-02

Pos logit contributions
uzz+0.250
zzy+0.249
rett+0.247
Wh+0.246
xc+0.243
Neg logit contributions
 gloomy-0.102
 happiest-0.093
ience-0.092
 End-0.089
 problem-0.085
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.023
rett+0.023
bur+0.022
igi+0.022
uit+0.022
Neg logit contributions
 These-0.017
 gloomy-0.017
 happiest-0.015
 goals-0.015
ience-0.015
Token!
Token ID0
Feature activation+1.37e-02

Pos logit contributions
rett+0.545
xc+0.525
bur+0.523
uit+0.515
zzy+0.500
Neg logit contributions
 happiest-0.563
 End-0.549
 gloomy-0.545
 educational-0.539
 valuable-0.522
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.018
rett+0.018
igi+0.018
bur+0.017
uit+0.017
Neg logit contributions
 These-0.024
 gloomy-0.023
 goals-0.022
ience-0.022
 happiest-0.022
Token!"
Token ID2474
Feature activation+4.99e-02

Pos logit contributions
rett+0.540
xc+0.520
bur+0.519
uit+0.509
zzy+0.497
Neg logit contributions
 happiest-0.568
 End-0.553
 gloomy-0.551
 educational-0.542
 valuable-0.527
Token Sam
Token ID3409
Feature activation+9.65e-02

Pos logit contributions
bur+0.075
xc+0.075
uit+0.075
rett+0.074
isk+0.071
Neg logit contributions
 gloomy-0.082
s-0.072
 miserable-0.072
 End-0.071
 bored-0.071
Token cried
Token ID16896
Feature activation+4.36e-02

Pos logit contributions
uzz+0.177
rett+0.175
xc+0.175
igi+0.173
bur+0.173
Neg logit contributions
s-0.114
 gloomy-0.113
ience-0.102
 sizes-0.100
 happiest-0.099
TokenO
Token ID46
Feature activation+2.78e-01

Pos logit contributions
 End+0.161
 happiest+0.148
�+0.147
 gloomy+0.142
 sizes+0.130
Neg logit contributions
rett-0.185
uzz-0.178
rap-0.176
xc-0.176
onna-0.171
Tokenw
Token ID86
Feature activation+1.10e-01

Pos logit contributions
rett+0.345
uzz+0.340
xc+0.336
zzy+0.333
uit+0.331
Neg logit contributions
 gloomy-0.500
 happiest-0.487
 End-0.475
ience-0.467
 goals-0.462
Token!
Token ID0
Feature activation+1.53e-02

Pos logit contributions
uzz+0.268
zzy+0.266
Wh+0.266
rett+0.261
 unw+0.257
Neg logit contributions
 gloomy-0.061
ience-0.055
 happiest-0.052
 problem-0.052
 End-0.051
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.024
igi+0.023
rett+0.023
bur+0.022
uit+0.022
Neg logit contributions
 These-0.023
 gloomy-0.023
s-0.021
ience-0.021
�-0.021
Token!
Token ID0
Feature activation+1.50e-02

Pos logit contributions
rett+0.597
xc+0.582
bur+0.577
uit+0.572
zzy+0.569
Neg logit contributions
 happiest-0.497
 gloomy-0.495
 End-0.479
 educational-0.466
 sizes-0.459
Token Ow
Token ID11960
Feature activation+3.63e-01

Pos logit contributions
zzy+0.016
rett+0.016
igi+0.016
uit+0.015
bur+0.015
Neg logit contributions
 These-0.030
 gloomy-0.029
ience-0.028
 needs-0.028
s-0.028
Token!"
Token ID2474
Feature activation+5.20e-02

Pos logit contributions
rett+0.575
xc+0.560
bur+0.556
uit+0.548
zzy+0.546
Neg logit contributions
 happiest-0.518
 gloomy-0.514
 End-0.501
 educational-0.487
 goals-0.479
Token Tom
Token ID4186
Feature activation+1.51e-01

Pos logit contributions
uit+0.085
xc+0.085
rett+0.084
bur+0.084
isk+0.081
Neg logit contributions
 gloomy-0.080
 miserable-0.070
 bored-0.070
 much-0.069
 lonely-0.069
Token and
Token ID290
Feature activation+5.69e-02

Pos logit contributions
uit+0.296
uzz+0.295
Wh+0.294
rett+0.292
bur+0.289
Neg logit contributions
s-0.166
 gloomy-0.158
ience-0.149
 goals-0.146
 bored-0.144
Token Sue
Token ID26089
Feature activation+2.45e-01

Pos logit contributions
rett+0.095
uit+0.093
xc+0.090
rap+0.089
bur+0.089
Neg logit contributions
 gloomy-0.091
 lonely-0.082
 miserable-0.079
 boring-0.079
 ordinary-0.078
Token cried
Token ID16896
Feature activation+4.15e-02

Pos logit contributions
rett+0.390
uzz+0.386
uit+0.382
Wh+0.380
xc+0.378
Neg logit contributions
 gloomy-0.358
 happiest-0.338
ience-0.324
s-0.324
 End-0.323
Token his
Token ID465
Feature activation+5.23e-02

Pos logit contributions
 gloomy+0.096
 happiest+0.092
 End+0.091
ience+0.086
 goals+0.086
Neg logit contributions
rett-0.088
uzz-0.087
xc-0.087
zzy-0.086
Wh-0.085
Token knee
Token ID10329
Feature activation+1.34e-01

Pos logit contributions
rett+0.075
uzz+0.073
xc+0.073
uit+0.072
bur+0.072
Neg logit contributions
 gloomy-0.081
 happiest-0.079
 End-0.077
ience-0.076
 sizes-0.075
Token and
Token ID290
Feature activation+6.06e-02

Pos logit contributions
uit+0.216
bur+0.210
rett+0.207
uzz+0.205
 unw+0.203
Neg logit contributions
 End-0.191
 gloomy-0.188
 happiest-0.182
ience-0.180
 goals-0.177
Token a
Token ID257
Feature activation+6.13e-02

Pos logit contributions
rett+0.093
uzz+0.092
xc+0.091
zzy+0.091
uit+0.090
Neg logit contributions
 gloomy-0.090
 happiest-0.087
 End-0.085
ience-0.083
 goals-0.081
Token pops
Token ID26384
Feature activation+1.03e-01

Pos logit contributions
rett+0.094
uzz+0.093
xc+0.092
zzy+0.091
Wh+0.090
Neg logit contributions
 gloomy-0.092
 happiest-0.089
 End-0.086
ience-0.084
 valuable-0.082
Tokenicle
Token ID1548
Feature activation+3.61e-01

Pos logit contributions
zzy+0.084
Wh+0.081
xc+0.080
rett+0.076
 unw+0.076
Neg logit contributions
 gloomy-0.225
lessness-0.217
 End-0.212
 happiest-0.209
s-0.208
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
rett+0.650
uzz+0.643
uit+0.642
xc+0.634
bur+0.632
Neg logit contributions
 gloomy-0.433
 happiest-0.422
 End-0.419
ience-0.404
 goals-0.389
Token make
Token ID787
Feature activation+1.83e-03

Pos logit contributions
xc+0.210
rett+0.209
uzz+0.209
zzy+0.195
igi+0.193
Neg logit contributions
 gloomy-0.184
 happiest-0.175
 End-0.164
ience-0.157
 educational-0.155
Token him
Token ID683
Feature activation+8.84e-02

Pos logit contributions
rett+0.003
uzz+0.003
xc+0.003
zzy+0.003
uit+0.002
Neg logit contributions
 gloomy-0.003
 happiest-0.003
 End-0.003
ience-0.003
 goals-0.003
Token feel
Token ID1254
Feature activation+4.52e-02

Pos logit contributions
rett+0.145
uzz+0.144
xc+0.142
zzy+0.141
bur+0.140
Neg logit contributions
 gloomy-0.118
 happiest-0.115
 End-0.114
ience-0.112
 sizes-0.108
Token better
Token ID1365
Feature activation-7.08e-02

Pos logit contributions
 unw+0.065
zzy+0.062
Flo+0.060
 Sara+0.058
zip+0.058
Neg logit contributions
 These-0.066
ience-0.064
 sizes-0.064
 jets-0.063
 nation-0.062
Token a
Token ID257
Feature activation+5.86e-02

Pos logit contributions
xc+0.202
rett+0.202
uzz+0.197
bur+0.196
zzy+0.196
Neg logit contributions
 gloomy-0.190
 happiest-0.173
 End-0.170
 goals-0.169
 valuable-0.167
Token delicious
Token ID12625
Feature activation-6.85e-02

Pos logit contributions
 unw+0.100
rett+0.094
uit+0.094
zzy+0.091
Wh+0.091
Neg logit contributions
lessness-0.071
ience-0.071
 sizes-0.071
 End-0.070
 happiest-0.070
Token ice
Token ID4771
Feature activation+8.07e-02

Pos logit contributions
 gloomy+0.104
 happiest+0.102
 End+0.100
ience+0.099
 goals+0.097
Neg logit contributions
rett-0.101
uzz-0.099
xc-0.098
uit-0.098
bur-0.097
Token cream
Token ID8566
Feature activation+1.19e-01

Pos logit contributions
rett+0.090
uzz+0.088
bur+0.087
uit+0.087
zzy+0.086
Neg logit contributions
 gloomy-0.152
 happiest-0.151
 End-0.148
ience-0.146
 goals-0.144
Token pops
Token ID26384
Feature activation+1.02e-01

Pos logit contributions
uit+0.198
rett+0.197
bur+0.194
 unw+0.192
uzz+0.190
Neg logit contributions
 gloomy-0.158
 happiest-0.155
 End-0.152
ience-0.150
 goals-0.148
Tokenicle
Token ID1548
Feature activation+3.61e-01

Pos logit contributions
Wh+0.199
zzy+0.197
rett+0.195
xc+0.193
uzz+0.192
Neg logit contributions
 gloomy-0.109
s-0.096
 sizes-0.094
 happiest-0.093
lessness-0.092
Token!
Token ID0
Feature activation+1.36e-02

Pos logit contributions
rett+0.617
uit+0.611
bur+0.605
 unw+0.599
uzz+0.592
Neg logit contributions
 End-0.470
 happiest-0.464
 gloomy-0.458
ience-0.441
 educational-0.422
Token She
Token ID1375
Feature activation-2.89e-02

Pos logit contributions
xc+0.023
bur+0.023
rett+0.023
zzy+0.022
uzz+0.022
Neg logit contributions
 gloomy-0.019
 End-0.018
 These-0.017
 happiest-0.017
 valuable-0.017
Token was
Token ID373
Feature activation+5.11e-02

Pos logit contributions
 gloomy+0.035
ience+0.034
 happiest+0.034
 End+0.033
 sizes+0.032
Neg logit contributions
rett-0.051
uit-0.051
uzz-0.051
xc-0.051
Wh-0.051
Token so
Token ID523
Feature activation-4.15e-02

Pos logit contributions
rett+0.112
xc+0.111
uzz+0.110
uit+0.109
zzy+0.109
Neg logit contributions
 gloomy-0.044
 happiest-0.041
 End-0.039
ience-0.036
 goals-0.036
Token happy
Token ID3772
Feature activation-2.16e-01

Pos logit contributions
ience+0.040
 happiest+0.037
 sizes+0.037
 gloomy+0.037
 jets+0.036
Neg logit contributions
uzz-0.084
 unw-0.084
zzy-0.083
rett-0.081
xc-0.079
TokenO
Token ID46
Feature activation+2.71e-01

Pos logit contributions
 End+0.169
 gloomy+0.158
�+0.149
 days+0.136
 sky+0.134
Neg logit contributions
rett-0.206
uzz-0.205
rap-0.196
ern-0.190
isk-0.188
Tokenw
Token ID86
Feature activation+1.05e-01

Pos logit contributions
rett+0.291
xc+0.283
uzz+0.283
zzy+0.281
bur+0.280
Neg logit contributions
 gloomy-0.533
 happiest-0.527
 End-0.506
 goals-0.499
 sizes-0.498
Token!
Token ID0
Feature activation+1.57e-02

Pos logit contributions
uzz+0.260
zzy+0.259
Wh+0.256
rett+0.253
 unw+0.249
Neg logit contributions
 gloomy-0.060
 happiest-0.050
 End-0.050
 problem-0.050
ience-0.050
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.023
igi+0.021
uit+0.020
bur+0.020
rett+0.020
Neg logit contributions
 gloomy-0.027
 These-0.026
 miserable-0.025
s-0.025
 loves-0.024
Token!
Token ID0
Feature activation+1.58e-02

Pos logit contributions
rett+0.591
xc+0.577
bur+0.572
uit+0.566
zzy+0.555
Neg logit contributions
 happiest-0.496
 gloomy-0.490
 End-0.476
 educational-0.469
 sizes-0.462
Token Ow
Token ID11960
Feature activation+3.61e-01

Pos logit contributions
zzy+0.018
igi+0.018
rett+0.017
uit+0.016
bur+0.016
Neg logit contributions
 gloomy-0.032
 These-0.032
 needs-0.030
 loves-0.029
 miserable-0.029
Token!"
Token ID2474
Feature activation+5.13e-02

Pos logit contributions
rett+0.572
xc+0.558
bur+0.555
uit+0.547
zzy+0.538
Neg logit contributions
 happiest-0.518
 gloomy-0.510
 End-0.497
 educational-0.489
 sizes-0.479
Token Tom
Token ID4186
Feature activation+1.48e-01

Pos logit contributions
xc+0.094
uit+0.092
bur+0.092
rett+0.091
isk+0.088
Neg logit contributions
 gloomy-0.073
 miserable-0.063
 bored-0.061
 that-0.061
 much-0.061
Token cries
Token ID24691
Feature activation+1.79e-03

Pos logit contributions
uit+0.311
uzz+0.311
rett+0.309
igi+0.302
Wh+0.302
Neg logit contributions
s-0.141
 gloomy-0.138
 goals-0.130
ience-0.122
 bored-0.122
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
rett+0.004
bur+0.004
xc+0.004
uit+0.004
uzz+0.004
Neg logit contributions
 gloomy-0.001
lessness-0.001
s-0.001
 End-0.001
 happiest-0.001
Token
Token ID198
Feature activation+9.74e-02

Pos logit contributions
 gloomy+0.030
 sky+0.024
 End+0.024
lessness+0.024
s+0.024
Neg logit contributions
xc-0.074
rett-0.072
bur-0.072
zzy-0.069
igi-0.069
Token.
Token ID13
Feature activation-3.07e-02

Pos logit contributions
rett+0.059
uit+0.058
uzz+0.058
bur+0.056
xc+0.056
Neg logit contributions
 gloomy-0.034
 happiest-0.032
 End-0.032
ience-0.031
 educational-0.030
Token Every
Token ID3887
Feature activation+1.74e-02

Pos logit contributions
 gloomy+0.037
 happiest+0.036
 End+0.035
ience+0.034
 goals+0.033
Neg logit contributions
rett-0.055
xc-0.055
zzy-0.054
bur-0.054
uit-0.054
Token morning
Token ID3329
Feature activation-1.55e-01

Pos logit contributions
rett+0.036
zzy+0.036
uzz+0.035
 unw+0.035
xc+0.034
Neg logit contributions
 sizes-0.014
 happiest-0.013
leness-0.012
ience-0.012
�-0.012
Token,
Token ID11
Feature activation-6.55e-02

Pos logit contributions
 gloomy+0.156
 happiest+0.154
 End+0.149
 goals+0.145
 sizes+0.143
Neg logit contributions
rett-0.305
zzy-0.300
uit-0.299
uzz-0.295
xc-0.294
Token Ze
Token ID9033
Feature activation+8.56e-02

Pos logit contributions
 gloomy+0.071
 happiest+0.067
 End+0.063
ience+0.061
 sizes+0.059
Neg logit contributions
uzz-0.128
rett-0.128
xc-0.127
zzy-0.125
uit-0.125
Tokenke
Token ID365
Feature activation+3.61e-01

Pos logit contributions
uzz+0.084
xc+0.080
bur+0.079
Wh+0.079
rett+0.078
Neg logit contributions
 gloomy-0.173
ience-0.173
 happiest-0.171
 End-0.169
 valuable-0.161
Token cooked
Token ID15847
Feature activation+4.17e-02

Pos logit contributions
rett+0.587
zzy+0.582
xc+0.576
uzz+0.565
bur+0.564
Neg logit contributions
 gloomy-0.496
 happiest-0.477
 End-0.466
 sizes-0.463
 educational-0.455
Token some
Token ID617
Feature activation+3.55e-02

Pos logit contributions
rett+0.064
uzz+0.063
xc+0.063
Wh+0.063
zzy+0.062
Neg logit contributions
 gloomy-0.063
 happiest-0.060
 End-0.059
 goals-0.057
ience-0.057
Token delicious
Token ID12625
Feature activation-6.72e-02

Pos logit contributions
 unw+0.059
rett+0.055
uit+0.055
zzy+0.055
uzz+0.054
Neg logit contributions
 End-0.045
 happiest-0.044
 gloomy-0.044
ience-0.043
lessness-0.041
Token food
Token ID2057
Feature activation-2.53e-02

Pos logit contributions
 happiest+0.106
lessness+0.103
 End+0.102
 gloomy+0.102
ience+0.099
Neg logit contributions
 unw-0.098
bur-0.088
rett-0.088
uit-0.086
zzy-0.085
Token for
Token ID329
Feature activation-6.12e-02

Pos logit contributions
 End+0.038
 gloomy+0.038
ience+0.037
 happiest+0.037
 goals+0.036
Neg logit contributions
uit-0.039
 unw-0.037
bur-0.036
uzz-0.036
rett-0.036
TokenO
Token ID46
Feature activation+2.73e-01

Pos logit contributions
 End+0.162
 gloomy+0.145
 happiest+0.139
�+0.132
 sky+0.127
Neg logit contributions
rett-0.218
uzz-0.213
ippy-0.202
rap-0.202
isk-0.199
Tokenw
Token ID86
Feature activation+1.07e-01

Pos logit contributions
rett+0.321
uzz+0.310
xc+0.310
zzy+0.310
bur+0.307
Neg logit contributions
 gloomy-0.514
 happiest-0.504
 End-0.486
 goals-0.478
 sizes-0.475
Token!
Token ID0
Feature activation+1.21e-02

Pos logit contributions
uzz+0.248
zzy+0.247
Wh+0.246
rett+0.238
xc+0.236
Neg logit contributions
 gloomy-0.077
ience-0.068
 happiest-0.067
 problem-0.067
 End-0.065
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.021
igi+0.020
bur+0.020
uit+0.019
rett+0.019
Neg logit contributions
 These-0.018
 gloomy-0.017
 Those-0.015
ience-0.015
s-0.015
Token!
Token ID0
Feature activation+1.21e-02

Pos logit contributions
rett+0.583
xc+0.562
bur+0.556
uit+0.551
zzy+0.545
Neg logit contributions
 happiest-0.510
 gloomy-0.504
 End-0.493
 educational-0.486
 sizes-0.474
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.017
igi+0.016
rett+0.016
bur+0.015
uit+0.015
Neg logit contributions
 These-0.022
 gloomy-0.021
 Those-0.020
ience-0.020
 needs-0.019
Token!"
Token ID2474
Feature activation+5.00e-02

Pos logit contributions
rett+0.572
xc+0.550
bur+0.546
uit+0.539
zzy+0.535
Neg logit contributions
 happiest-0.523
 gloomy-0.516
 End-0.506
 educational-0.497
 sizes-0.483
Token Sam
Token ID3409
Feature activation+9.42e-02

Pos logit contributions
uit+0.086
bur+0.086
xc+0.086
rett+0.085
isk+0.082
Neg logit contributions
 gloomy-0.074
 miserable-0.064
 bored-0.063
s-0.063
 that-0.062
Token cried
Token ID16896
Feature activation+3.83e-02

Pos logit contributions
uzz+0.166
igi+0.163
rett+0.162
uit+0.161
bur+0.161
Neg logit contributions
s-0.122
 gloomy-0.120
ience-0.109
 happiest-0.106
 bored-0.106
Token.
Token ID13
Feature activation-3.58e-02

Pos logit contributions
rett+0.091
bur+0.090
xc+0.089
igi+0.088
uit+0.088
Neg logit contributions
 gloomy-0.030
lessness-0.023
 happiest-0.022
 miserable-0.021
 End-0.021
Token He
Token ID679
Feature activation+2.11e-02

Pos logit contributions
 gloomy+0.034
s+0.029
 End+0.028
lessness+0.028
 sky+0.028
Neg logit contributions
xc-0.078
bur-0.078
rett-0.077
zzy-0.075
igi-0.075
Token found
Token ID1043
Feature activation+7.44e-02

Pos logit contributions
 gloomy+0.043
 happiest+0.041
 End+0.040
ience+0.040
 goals+0.038
Neg logit contributions
rett-0.054
uzz-0.054
xc-0.053
uit-0.053
Wh-0.053
Token a
Token ID257
Feature activation+5.55e-02

Pos logit contributions
rett+0.118
uzz+0.116
xc+0.116
zzy+0.115
Wh+0.114
Neg logit contributions
 gloomy-0.107
 happiest-0.104
 End-0.100
ience-0.098
 goals-0.096
Token special
Token ID2041
Feature activation-1.64e-01

Pos logit contributions
rett+0.106
uit+0.105
uzz+0.104
 unw+0.104
zzy+0.103
Neg logit contributions
 gloomy-0.057
 End-0.056
 happiest-0.056
ience-0.055
 goals-0.053
Token pink
Token ID11398
Feature activation-5.08e-02

Pos logit contributions
 gloomy+0.223
 happiest+0.212
 End+0.204
ience+0.197
 valuable+0.195
Neg logit contributions
rett-0.276
uzz-0.274
xc-0.273
Wh-0.270
zzy-0.270
Token pops
Token ID26384
Feature activation+1.01e-01

Pos logit contributions
 gloomy+0.072
 happiest+0.070
 End+0.068
ience+0.067
 goals+0.065
Neg logit contributions
rett-0.081
uzz-0.080
xc-0.079
zzy-0.078
uit-0.078
Tokenicle
Token ID1548
Feature activation+3.60e-01

Pos logit contributions
xc+0.113
zzy+0.112
rett+0.111
Wh+0.110
uzz+0.108
Neg logit contributions
 gloomy-0.190
 happiest-0.180
 End-0.178
 sizes-0.175
lessness-0.171
Token inside
Token ID2641
Feature activation+8.07e-02

Pos logit contributions
uit+0.611
rett+0.592
 unw+0.579
bur+0.577
uzz+0.568
Neg logit contributions
 End-0.493
 gloomy-0.481
 happiest-0.477
ience-0.467
 goals-0.455
Token the
Token ID262
Feature activation-7.26e-02

Pos logit contributions
uit+0.122
 unw+0.120
rett+0.115
bur+0.114
uzz+0.114
Neg logit contributions
ience-0.124
 happiest-0.119
 gloomy-0.116
leness-0.113
 End-0.113
Token freezer
Token ID30967
Feature activation+1.21e-01

Pos logit contributions
ience+0.134
 gloomy+0.133
leness+0.132
 happiest+0.132
 sizes+0.130
Neg logit contributions
bur-0.079
rett-0.079
uit-0.079
 unw-0.078
xc-0.076
Token!
Token ID0
Feature activation+9.73e-03

Pos logit contributions
uit+0.201
 unw+0.195
rett+0.193
uzz+0.190
bur+0.189
Neg logit contributions
 gloomy-0.170
 happiest-0.168
ience-0.167
 goals-0.167
leness-0.167
Token She
Token ID1375
Feature activation-3.30e-02

Pos logit contributions
xc+0.018
bur+0.018
rett+0.018
zzy+0.018
uit+0.018
Neg logit contributions
 gloomy-0.011
 happiest-0.010
 End-0.010
 These-0.010
 valuable-0.010
TokenO
Token ID46
Feature activation+2.73e-01

Pos logit contributions
 End+0.174
 gloomy+0.163
 sky+0.161
�+0.152
 days+0.150
Neg logit contributions
rett-0.219
uzz-0.212
onna-0.198
rap-0.197
xc-0.192
Tokenw
Token ID86
Feature activation+1.10e-01

Pos logit contributions
rett+0.292
bur+0.286
zzy+0.285
xc+0.278
uzz+0.276
Neg logit contributions
 happiest-0.544
 gloomy-0.544
 End-0.514
 goals-0.511
 sizes-0.506
Token!
Token ID0
Feature activation+8.55e-03

Pos logit contributions
uzz+0.254
zzy+0.252
Wh+0.250
rett+0.246
xc+0.245
Neg logit contributions
 gloomy-0.075
ience-0.065
 End-0.064
 problem-0.064
 happiest-0.064
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.008
rett+0.007
igi+0.007
uit+0.007
bur+0.007
Neg logit contributions
 gloomy-0.018
 These-0.018
 happiest-0.017
 End-0.017
s-0.017
Token!
Token ID0
Feature activation+8.96e-03

Pos logit contributions
rett+0.599
xc+0.575
bur+0.574
uit+0.573
zzy+0.563
Neg logit contributions
 happiest-0.497
 gloomy-0.491
 End-0.478
 educational-0.467
 sizes-0.454
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.009
rett+0.008
igi+0.008
uit+0.008
bur+0.008
Neg logit contributions
 These-0.019
 gloomy-0.019
 happiest-0.017
 End-0.017
 needs-0.017
Token!"
Token ID2474
Feature activation+4.76e-02

Pos logit contributions
rett+0.589
xc+0.565
bur+0.563
uit+0.562
zzy+0.554
Neg logit contributions
 happiest-0.509
 gloomy-0.501
 End-0.490
 educational-0.476
 valuable-0.463
Token Tom
Token ID4186
Feature activation+1.43e-01

Pos logit contributions
xc+0.090
bur+0.089
uit+0.089
rett+0.088
isk+0.085
Neg logit contributions
 gloomy-0.065
 miserable-0.056
 bored-0.055
 that-0.054
 lonely-0.054
Token screamed
Token ID25421
Feature activation+3.76e-03

Pos logit contributions
xc+0.262
uzz+0.262
uit+0.262
rett+0.262
bur+0.255
Neg logit contributions
s-0.186
 gloomy-0.165
 very-0.152
 needed-0.150
 bored-0.150
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
bur+0.009
rett+0.009
xc+0.009
igi+0.009
zzy+0.009
Neg logit contributions
 gloomy-0.003
 miserable-0.002
 End-0.002
 bored-0.002
lessness-0.002
Token He
Token ID679
Feature activation+1.83e-02

Pos logit contributions
 gloomy+0.026
 sky+0.021
 much+0.019
lessness+0.018
 that+0.018
Neg logit contributions
xc-0.102
bur-0.100
rett-0.098
igi-0.096
zzy-0.095
TokenO
Token ID46
Feature activation+2.73e-01

Pos logit contributions
 End+0.168
 gloomy+0.156
 happiest+0.148
 sky+0.140
�+0.139
Neg logit contributions
rett-0.194
uzz-0.186
rap-0.178
isk-0.172
ippy-0.171
Tokenw
Token ID86
Feature activation+1.08e-01

Pos logit contributions
rett+0.256
zzy+0.255
bur+0.250
Jen+0.244
xc+0.243
Neg logit contributions
 happiest-0.587
 gloomy-0.581
 goals-0.551
 educational-0.543
 sizes-0.542
Token!
Token ID0
Feature activation+1.34e-02

Pos logit contributions
uzz+0.250
zzy+0.249
Wh+0.247
rett+0.244
xc+0.242
Neg logit contributions
 gloomy-0.076
 happiest-0.068
ience-0.068
 End-0.065
 problem-0.062
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.022
bur+0.022
igi+0.022
rett+0.021
uit+0.021
Neg logit contributions
 gloomy-0.019
 These-0.019
s-0.018
 miserable-0.017
ience-0.017
Token!
Token ID0
Feature activation+1.32e-02

Pos logit contributions
rett+0.565
xc+0.542
bur+0.534
uit+0.533
zzy+0.518
Neg logit contributions
 happiest-0.534
 End-0.517
 gloomy-0.516
 educational-0.513
 valuable-0.496
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
igi+0.015
bur+0.015
zzy+0.015
rett+0.015
uit+0.014
Neg logit contributions
 These-0.026
 gloomy-0.025
 Those-0.023
ience-0.023
 needs-0.023
Token!"
Token ID2474
Feature activation+5.18e-02

Pos logit contributions
rett+0.556
xc+0.533
bur+0.528
uit+0.524
zzy+0.511
Neg logit contributions
 happiest-0.544
 End-0.526
 gloomy-0.525
 educational-0.519
 valuable-0.505
Token they
Token ID484
Feature activation-1.86e-01

Pos logit contributions
xc+0.089
bur+0.088
rett+0.087
uit+0.087
Wh+0.082
Neg logit contributions
 gloomy-0.078
 End-0.067
 miserable-0.067
lessness-0.065
 bored-0.065
Token cried
Token ID16896
Feature activation+3.91e-02

Pos logit contributions
 gloomy+0.297
 happiest+0.273
lessness+0.264
s+0.262
 sizes+0.259
Neg logit contributions
rett-0.276
uzz-0.273
bur-0.272
xc-0.270
uit-0.270
Token and
Token ID290
Feature activation+5.59e-02

Pos logit contributions
rett+0.084
xc+0.083
bur+0.082
uzz+0.082
zzy+0.082
Neg logit contributions
 gloomy-0.037
 happiest-0.032
 End-0.031
 goals-0.030
lessness-0.030
Token let
Token ID1309
Feature activation+1.81e-01

Pos logit contributions
rett+0.083
xc+0.081
uit+0.080
uzz+0.080
bur+0.079
Neg logit contributions
 gloomy-0.092
 happiest-0.086
lessness-0.082
ience-0.082
 bored-0.081
Token
Token ID198
Feature activation+9.29e-02

Pos logit contributions
xc+0.212
uit+0.199
bur+0.196
rett+0.195
 unw+0.188
Neg logit contributions
 End-0.089
 much-0.088
 gloomy-0.083
s-0.080
 valuable-0.077
Token"
Token ID1
Feature activation-1.16e-01

Pos logit contributions
rett+0.163
xc+0.163
uit+0.160
bur+0.159
 unw+0.154
Neg logit contributions
 End-0.122
 gloomy-0.117
ience-0.107
�-0.105
lessness-0.105
TokenO
Token ID46
Feature activation+2.71e-01

Pos logit contributions
 End+0.169
 gloomy+0.158
�+0.149
 days+0.136
 sky+0.134
Neg logit contributions
rett-0.206
uzz-0.205
rap-0.196
ern-0.190
isk-0.188
Tokenw
Token ID86
Feature activation+1.05e-01

Pos logit contributions
rett+0.291
xc+0.283
uzz+0.283
zzy+0.281
bur+0.280
Neg logit contributions
 gloomy-0.533
 happiest-0.527
 End-0.506
 goals-0.499
 sizes-0.498
Token!
Token ID0
Feature activation+1.57e-02

Pos logit contributions
uzz+0.260
zzy+0.259
Wh+0.256
rett+0.253
 unw+0.249
Neg logit contributions
 gloomy-0.060
 happiest-0.050
 End-0.050
 problem-0.050
ience-0.050
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.023
igi+0.021
uit+0.020
bur+0.020
rett+0.020
Neg logit contributions
 gloomy-0.027
 These-0.026
 miserable-0.025
s-0.025
 loves-0.024
Token!
Token ID0
Feature activation+1.58e-02

Pos logit contributions
rett+0.591
xc+0.577
bur+0.572
uit+0.566
zzy+0.555
Neg logit contributions
 happiest-0.496
 gloomy-0.490
 End-0.476
 educational-0.469
 sizes-0.462
Token Ow
Token ID11960
Feature activation+3.61e-01

Pos logit contributions
zzy+0.018
igi+0.018
rett+0.017
uit+0.016
bur+0.016
Neg logit contributions
 gloomy-0.032
 These-0.032
 needs-0.030
 loves-0.029
 miserable-0.029
Token!"
Token ID2474
Feature activation+5.13e-02

Pos logit contributions
rett+0.572
xc+0.558
bur+0.555
uit+0.547
zzy+0.538
Neg logit contributions
 happiest-0.518
 gloomy-0.510
 End-0.497
 educational-0.489
 sizes-0.479
Token Tom
Token ID4186
Feature activation+1.48e-01

Pos logit contributions
xc+0.094
uit+0.092
bur+0.092
rett+0.091
isk+0.088
Neg logit contributions
 gloomy-0.073
 miserable-0.063
 bored-0.061
 that-0.061
 much-0.061
Token cries
Token ID24691
Feature activation+1.79e-03

Pos logit contributions
uit+0.311
uzz+0.311
rett+0.309
igi+0.302
Wh+0.302
Neg logit contributions
s-0.141
 gloomy-0.138
 goals-0.130
ience-0.122
 bored-0.122
Token the
Token ID262
Feature activation-7.52e-02

Pos logit contributions
rett+0.031
xc+0.030
uzz+0.030
zzy+0.030
Wh+0.030
Neg logit contributions
 gloomy-0.025
 happiest-0.024
 End-0.024
ience-0.023
 goals-0.023
Token bowls
Token ID34796
Feature activation-5.88e-02

Pos logit contributions
 gloomy+0.101
 happiest+0.099
ience+0.099
 End+0.096
lessness+0.095
Neg logit contributions
rett-0.120
bur-0.119
uzz-0.118
uit-0.118
xc-0.118
Token and
Token ID290
Feature activation+5.24e-02

Pos logit contributions
ience+0.090
 gloomy+0.089
 happiest+0.089
 End+0.086
leness+0.085
Neg logit contributions
uzz-0.086
uit-0.085
bur-0.084
zzy-0.084
rett-0.083
Token the
Token ID262
Feature activation-7.41e-02

Pos logit contributions
rett+0.089
uzz+0.087
xc+0.086
uit+0.085
zzy+0.085
Neg logit contributions
 gloomy-0.073
 happiest-0.069
 End-0.068
ience-0.065
 sizes-0.064
Token pops
Token ID26384
Feature activation+1.04e-01

Pos logit contributions
 gloomy+0.095
 happiest+0.093
ience+0.092
 End+0.090
lessness+0.087
Neg logit contributions
rett-0.123
uzz-0.121
xc-0.121
bur-0.120
uit-0.120
Tokenicle
Token ID1548
Feature activation+3.60e-01

Pos logit contributions
Wh+0.087
zzy+0.082
rett+0.082
xc+0.081
uzz+0.077
Neg logit contributions
 gloomy-0.226
 End-0.215
 sizes-0.214
 happiest-0.214
s-0.214
Token sticks
Token ID16461
Feature activation+1.15e-02

Pos logit contributions
rett+0.559
uzz+0.556
uit+0.555
bur+0.551
 unw+0.547
Neg logit contributions
 End-0.507
 happiest-0.506
 gloomy-0.501
ience-0.497
leness-0.478
Token on
Token ID319
Feature activation+6.53e-02

Pos logit contributions
uzz+0.019
rett+0.018
uit+0.018
zzy+0.018
bur+0.018
Neg logit contributions
 gloomy-0.016
 happiest-0.016
 End-0.015
ience-0.015
 sizes-0.014
Token the
Token ID262
Feature activation-7.34e-02

Pos logit contributions
rett+0.135
xc+0.134
uzz+0.130
zzy+0.129
igi+0.127
Neg logit contributions
 gloomy-0.071
 goals-0.062
 End-0.062
 sky-0.057
 days-0.057
Token table
Token ID3084
Feature activation-9.53e-02

Pos logit contributions
 gloomy+0.092
ience+0.091
 happiest+0.090
 End+0.089
leness+0.088
Neg logit contributions
rett-0.122
bur-0.122
 unw-0.122
uit-0.122
uzz-0.119
Token.
Token ID13
Feature activation-3.45e-02

Pos logit contributions
 happiest+0.120
 gloomy+0.118
ience+0.116
 End+0.114
leness+0.108
Neg logit contributions
uzz-0.167
uit-0.165
rett-0.164
 unw-0.164
zzy-0.161
Token
Token ID198
Feature activation+8.50e-02

Pos logit contributions
xc+0.211
uit+0.194
bur+0.193
rett+0.192
icle+0.187
Neg logit contributions
 much-0.072
s-0.068
 End-0.059
 gloomy-0.056
 valuable-0.056
Token"
Token ID1
Feature activation-1.23e-01

Pos logit contributions
xc+0.152
rett+0.152
bur+0.147
uit+0.146
uzz+0.146
Neg logit contributions
 gloomy-0.107
 End-0.107
ience-0.096
lessness-0.096
 valuable-0.095
TokenO
Token ID46
Feature activation+2.73e-01

Pos logit contributions
 End+0.174
 gloomy+0.163
 sky+0.161
�+0.152
 days+0.150
Neg logit contributions
rett-0.219
uzz-0.212
onna-0.198
rap-0.197
xc-0.192
Tokenw
Token ID86
Feature activation+1.10e-01

Pos logit contributions
rett+0.292
bur+0.286
zzy+0.285
xc+0.278
uzz+0.276
Neg logit contributions
 happiest-0.544
 gloomy-0.544
 End-0.514
 goals-0.511
 sizes-0.506
Token!
Token ID0
Feature activation+8.55e-03

Pos logit contributions
uzz+0.254
zzy+0.252
Wh+0.250
rett+0.246
xc+0.245
Neg logit contributions
 gloomy-0.075
ience-0.065
 End-0.064
 problem-0.064
 happiest-0.064
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.008
rett+0.007
igi+0.007
uit+0.007
bur+0.007
Neg logit contributions
 gloomy-0.018
 These-0.018
 happiest-0.017
 End-0.017
s-0.017
Token!
Token ID0
Feature activation+8.96e-03

Pos logit contributions
rett+0.599
xc+0.575
bur+0.574
uit+0.573
zzy+0.563
Neg logit contributions
 happiest-0.497
 gloomy-0.491
 End-0.478
 educational-0.467
 sizes-0.454
Token Ow
Token ID11960
Feature activation+3.60e-01

Pos logit contributions
zzy+0.009
rett+0.008
igi+0.008
uit+0.008
bur+0.008
Neg logit contributions
 These-0.019
 gloomy-0.019
 happiest-0.017
 End-0.017
 needs-0.017
Token!"
Token ID2474
Feature activation+4.76e-02

Pos logit contributions
rett+0.589
xc+0.565
bur+0.563
uit+0.562
zzy+0.554
Neg logit contributions
 happiest-0.509
 gloomy-0.501
 End-0.490
 educational-0.476
 valuable-0.463
Token Tom
Token ID4186
Feature activation+1.43e-01

Pos logit contributions
xc+0.090
bur+0.089
uit+0.089
rett+0.088
isk+0.085
Neg logit contributions
 gloomy-0.065
 miserable-0.056
 bored-0.055
 that-0.054
 lonely-0.054
Token screamed
Token ID25421
Feature activation+3.76e-03

Pos logit contributions
xc+0.262
uzz+0.262
uit+0.262
rett+0.262
bur+0.255
Neg logit contributions
s-0.186
 gloomy-0.165
 very-0.152
 needed-0.150
 bored-0.150
Token.
Token ID13
Feature activation-3.95e-02

Pos logit contributions
zzy+0.234
rett+0.231
uzz+0.230
bur+0.229
 unw+0.227
Neg logit contributions
 End-0.213
ience-0.213
 happiest-0.211
 gloomy-0.211
lessness-0.205
Token
Token ID220
Feature activation-7.77e-02

Pos logit contributions
 gloomy+0.061
 happiest+0.059
ience+0.058
 problem+0.057
 goals+0.057
Neg logit contributions
uzz-0.059
rett-0.056
Wh-0.056
zzy-0.055
Flo-0.054
Token
Token ID198
Feature activation+7.97e-02

Pos logit contributions
 Playing+0.072
 These+0.071
s+0.069
 gloomy+0.069
 happiest+0.068
Neg logit contributions
xc-0.158
rett-0.150
zzy-0.149
bur-0.147
 snatch-0.147
Token
Token ID198
Feature activation+7.94e-02

Pos logit contributions
xc+0.164
rett+0.152
bur+0.152
uit+0.152
icle+0.145
Neg logit contributions
 End-0.089
 gloomy-0.082
 valuable-0.082
 much-0.081
s-0.080
TokenThe
Token ID464
Feature activation-1.60e-02

Pos logit contributions
rett+0.127
xc+0.124
uzz+0.124
bur+0.122
uit+0.122
Neg logit contributions
 gloomy-0.115
 End-0.115
 happiest-0.111
 valuable-0.107
ience-0.107
Token End
Token ID5268
Feature activation-3.07e-01

Pos logit contributions
 gloomy+0.018
 happiest+0.018
ience+0.018
 sizes+0.017
 goals+0.017
Neg logit contributions
Wh-0.028
uzz-0.028
rett-0.028
zzy-0.028
uit-0.028
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 gloomy+0.365
 happiest+0.356
 End+0.341
ience+0.332
 goals+0.322
Neg logit contributions
rett-0.561
uzz-0.556
zzy-0.552
xc-0.547
uit-0.546
Token<|endoftext|>
Token ID50256
Feature activation-1.37e-02

Pos logit contributions
 End+0.048
 gloomy+0.047
 happiest+0.044
lessness+0.044
 These+0.043
Neg logit contributions
xc-0.075
rett-0.073
zzy-0.072
uzz-0.070
zip-0.070
TokenOnce
Token ID7454
Feature activation+1.09e-01

Pos logit contributions
 sky+0.014
 gloomy+0.014
 End+0.013
s+0.013
 day+0.013
Neg logit contributions
xc-0.028
uzz-0.027
rett-0.027
ck-0.027
Wh-0.026
Token upon
Token ID2402
Feature activation+1.26e-01

Pos logit contributions
rett+0.115
uzz+0.113
xc+0.108
bur+0.106
zzy+0.106
Neg logit contributions
 happiest-0.213
 gloomy-0.212
ience-0.208
 sizes-0.206
 End-0.205
Token a
Token ID257
Feature activation+5.40e-02

Pos logit contributions
xc+0.233
rett+0.233
uzz+0.227
uit+0.226
bur+0.225
Neg logit contributions
 gloomy-0.172
 End-0.147
 much-0.147
 happiest-0.144
 ordinary-0.140
Token.
Token ID13
Feature activation-3.72e-02

Pos logit contributions
 unw+0.202
rett+0.188
uit+0.188
zzy+0.185
uzz+0.180
Neg logit contributions
 End-0.258
 happiest-0.257
ience-0.247
 These-0.242
 educational-0.239
Token
Token ID220
Feature activation-7.48e-02

Pos logit contributions
 gloomy+0.047
 happiest+0.045
 End+0.043
ience+0.042
 goals+0.041
Neg logit contributions
rett-0.066
uzz-0.066
uit-0.064
Wh-0.064
xc-0.064
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
 These+0.080
 gloomy+0.080
 Playing+0.078
 happiest+0.078
s+0.078
Neg logit contributions
xc-0.138
bur-0.133
zzy-0.131
rett-0.131
uzz-0.129
Token
Token ID198
Feature activation+8.00e-02

Pos logit contributions
xc+0.156
bur+0.150
rett+0.146
uzz+0.146
uit+0.145
Neg logit contributions
 End-0.090
 valuable-0.086
 gloomy-0.085
s-0.083
�-0.081
TokenThe
Token ID464
Feature activation-1.50e-02

Pos logit contributions
rett+0.112
uzz+0.110
xc+0.109
zzy+0.108
uit+0.108
Neg logit contributions
 gloomy-0.131
 happiest-0.127
 End-0.124
ience-0.122
 goals-0.120
Token End
Token ID5268
Feature activation-3.07e-01

Pos logit contributions
 gloomy+0.021
 End+0.020
 happiest+0.020
ience+0.019
 valuable+0.018
Neg logit contributions
uzz-0.026
rett-0.025
xc-0.025
zzy-0.025
uit-0.025
Token.
Token ID13
Feature activation-3.78e-02

Pos logit contributions
 happiest+0.385
 gloomy+0.381
 End+0.361
ience+0.350
 goals+0.339
Neg logit contributions
zzy-0.543
rett-0.543
uzz-0.537
uit-0.525
Wh-0.521
Token<|endoftext|>
Token ID50256
Feature activation-1.40e-02

Pos logit contributions
 End+0.044
 gloomy+0.044
 These+0.042
lessness+0.042
 happiest+0.042
Neg logit contributions
xc-0.070
zzy-0.068
rett-0.068
uzz-0.066
 unw-0.065
TokenOnce
Token ID7454
Feature activation+1.10e-01

Pos logit contributions
 gloomy+0.016
 sky+0.015
 End+0.015
s+0.015
 day+0.014
Neg logit contributions
xc-0.027
uzz-0.026
rett-0.026
Wh-0.026
zzy-0.025
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.116
uzz+0.114
xc+0.110
zzy+0.108
bur+0.107
Neg logit contributions
 happiest-0.211
 gloomy-0.211
ience-0.206
 sizes-0.204
 End-0.204
Token a
Token ID257
Feature activation+5.59e-02

Pos logit contributions
xc+0.235
rett+0.235
uzz+0.230
bur+0.228
igi+0.228
Neg logit contributions
 gloomy-0.172
 End-0.146
 much-0.146
 happiest-0.145
 ordinary-0.139
Token accept
Token ID2453
Feature activation+8.20e-02

Pos logit contributions
rett+0.002
uzz+0.002
xc+0.002
uit+0.002
zzy+0.002
Neg logit contributions
 gloomy-0.002
 happiest-0.002
 End-0.002
ience-0.002
lessness-0.002
Token each
Token ID1123
Feature activation-7.69e-02

Pos logit contributions
uzz+0.094
bur+0.091
 Sara+0.091
 unw+0.089
rett+0.089
Neg logit contributions
 End-0.139
 gloomy-0.138
 tallest-0.133
 These-0.132
lessness-0.132
Token other
Token ID584
Feature activation-1.62e-01

Pos logit contributions
 gloomy+0.066
 happiest+0.062
 End+0.060
ience+0.057
 goals+0.056
Neg logit contributions
rett-0.168
uzz-0.167
xc-0.165
zzy-0.164
uit-0.164
Token.
Token ID13
Feature activation-3.80e-02

Pos logit contributions
umm+0.271
 These+0.263
 tallest+0.262
 End+0.261
 gloomy+0.261
Neg logit contributions
uzz-0.206
rett-0.197
uit-0.187
 aid-0.186
 unw-0.181
Token The
Token ID383
Feature activation-1.97e-02

Pos logit contributions
 gloomy+0.063
 happiest+0.061
ience+0.059
 End+0.059
 sizes+0.057
Neg logit contributions
uzz-0.053
rett-0.052
Wh-0.050
uit-0.050
xc-0.050
Token End
Token ID5268
Feature activation-3.07e-01

Pos logit contributions
lessness+0.019
ience+0.019
leness+0.019
 gloomy+0.018
 sizes+0.018
Neg logit contributions
Wh-0.037
 unw-0.036
uit-0.035
bur-0.035
uzz-0.035
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
 gloomy+0.365
 happiest+0.363
 End+0.342
ience+0.332
 sizes+0.321
Neg logit contributions
rett-0.558
zzy-0.554
uzz-0.554
uit-0.542
xc-0.540
Token<|endoftext|>
Token ID50256
Feature activation-1.43e-02

Pos logit contributions
 gloomy+0.056
 End+0.054
 happiest+0.053
lessness+0.050
 valuable+0.050
Neg logit contributions
xc-0.062
rett-0.062
zzy-0.060
uzz-0.060
uit-0.059
TokenOnce
Token ID7454
Feature activation+1.09e-01

Pos logit contributions
 gloomy+0.015
 sky+0.014
 End+0.014
s+0.014
 day+0.014
Neg logit contributions
xc-0.029
Wh-0.028
uzz-0.028
rett-0.028
igi-0.027
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.119
uzz+0.118
xc+0.112
bur+0.110
zzy+0.110
Neg logit contributions
 happiest-0.207
 gloomy-0.207
ience-0.203
 sizes-0.200
 End-0.199
Token a
Token ID257
Feature activation+5.45e-02

Pos logit contributions
xc+0.231
rett+0.231
uit+0.225
bur+0.224
uzz+0.224
Neg logit contributions
 gloomy-0.172
 End-0.149
 much-0.147
 happiest-0.146
 goals-0.142
Token joy
Token ID8716
Feature activation-5.07e-02

Pos logit contributions
uzz+0.059
rett+0.058
 unw+0.057
xc+0.057
zzy+0.056
Neg logit contributions
 End-0.066
 gloomy-0.065
 happiest-0.063
ience-0.062
lessness-0.060
Token and
Token ID290
Feature activation+5.09e-02

Pos logit contributions
 End+0.091
 gloomy+0.087
 happiest+0.085
 goals+0.084
 tallest+0.083
Neg logit contributions
uzz-0.069
rett-0.067
uit-0.064
bur-0.060
zzy-0.059
Token laughter
Token ID20263
Feature activation-5.48e-02

Pos logit contributions
 unw+0.075
uzz+0.074
bur+0.072
rett+0.072
uit+0.070
Neg logit contributions
 gloomy-0.073
ience-0.072
 End-0.071
 happiest-0.069
 These-0.068
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
 End+0.094
ience+0.086
leness+0.085
 insignificant+0.084
 happiest+0.084
Neg logit contributions
uzz-0.074
uit-0.072
 unw-0.072
rett-0.070
zzy-0.067
Token The
Token ID383
Feature activation-2.00e-02

Pos logit contributions
 gloomy+0.061
 happiest+0.060
ience+0.058
 End+0.058
 sizes+0.057
Neg logit contributions
uzz-0.048
rett-0.048
Wh-0.046
zzy-0.046
uit-0.045
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 gloomy+0.015
ience+0.014
 happiest+0.013
lessness+0.013
 sizes+0.013
Neg logit contributions
bur-0.044
Wh-0.044
uzz-0.044
rett-0.043
zzy-0.043
Token.
Token ID13
Feature activation-3.77e-02

Pos logit contributions
 gloomy+0.385
 happiest+0.366
 End+0.358
ience+0.349
 goals+0.343
Neg logit contributions
rett-0.540
uzz-0.535
xc-0.533
uit-0.526
bur-0.523
Token<|endoftext|>
Token ID50256
Feature activation-1.24e-02

Pos logit contributions
 gloomy+0.054
 End+0.051
 happiest+0.051
 valuable+0.048
lessness+0.048
Neg logit contributions
xc-0.061
rett-0.060
uzz-0.059
zzy-0.059
uit-0.058
TokenOnce
Token ID7454
Feature activation+1.10e-01

Pos logit contributions
 sky+0.012
 gloomy+0.011
s+0.011
 day+0.011
 End+0.011
Neg logit contributions
xc-0.027
 snatch-0.025
ck-0.025
Wh-0.025
uzz-0.025
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.125
uzz+0.123
xc+0.119
zzy+0.117
bur+0.117
Neg logit contributions
 gloomy-0.205
 happiest-0.204
ience-0.198
 End-0.197
 sizes-0.196
Token a
Token ID257
Feature activation+5.49e-02

Pos logit contributions
rett+0.232
xc+0.232
uzz+0.226
uit+0.225
bur+0.222
Neg logit contributions
 gloomy-0.175
 End-0.152
 happiest-0.149
 much-0.149
 ordinary-0.142
Token happy
Token ID3772
Feature activation-2.28e-01

Pos logit contributions
ience+0.056
leness+0.054
 problem+0.052
 goals+0.051
 jets+0.051
Neg logit contributions
uzz-0.094
zzy-0.088
 unw-0.087
zip-0.084
Ab-0.084
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
 End+0.381
ience+0.375
 happiest+0.370
 gloomy+0.364
leness+0.360
Neg logit contributions
uzz-0.317
rett-0.302
 Sara-0.285
 unw-0.285
uit-0.279
Token
Token ID198
Feature activation+7.82e-02

Pos logit contributions
 gloomy+0.065
ience+0.063
 problem+0.063
 happiest+0.063
leness+0.060
Neg logit contributions
uzz-0.059
rett-0.054
 Sara-0.053
Wh-0.052
uit-0.051
Token
Token ID198
Feature activation+7.80e-02

Pos logit contributions
xc+0.130
rett+0.127
bur+0.123
uit+0.123
uzz+0.120
Neg logit contributions
 End-0.111
 gloomy-0.110
 valuable-0.103
 happiest-0.102
 educational-0.099
TokenThe
Token ID464
Feature activation-1.60e-02

Pos logit contributions
rett+0.123
xc+0.121
uzz+0.120
bur+0.118
uit+0.118
Neg logit contributions
 gloomy-0.115
 End-0.114
 happiest-0.111
 valuable-0.106
ience-0.105
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.019
lessness+0.018
ables+0.018
leness+0.018
s+0.018
Neg logit contributions
bur-0.027
 unw-0.026
xc-0.026
Wh-0.025
uit-0.024
Token.
Token ID13
Feature activation-4.04e-02

Pos logit contributions
 happiest+0.392
 gloomy+0.384
 End+0.365
ience+0.358
 goals+0.341
Neg logit contributions
zzy-0.539
rett-0.533
uzz-0.531
uit-0.517
Wh-0.511
Token<|endoftext|>
Token ID50256
Feature activation-1.39e-02

Pos logit contributions
 End+0.051
 gloomy+0.051
 happiest+0.048
 These+0.047
lessness+0.047
Neg logit contributions
xc-0.072
zzy-0.070
rett-0.070
uzz-0.068
 unw-0.067
TokenOnce
Token ID7454
Feature activation+1.09e-01

Pos logit contributions
 gloomy+0.015
 sky+0.015
 End+0.014
 day+0.014
s+0.014
Neg logit contributions
xc-0.028
uzz-0.027
rett-0.027
ck-0.026
Wh-0.026
Token upon
Token ID2402
Feature activation+1.25e-01

Pos logit contributions
rett+0.118
uzz+0.117
xc+0.112
zzy+0.110
bur+0.109
Neg logit contributions
 happiest-0.207
 gloomy-0.207
ience-0.203
 End-0.200
 sizes-0.200
Token a
Token ID257
Feature activation+5.35e-02

Pos logit contributions
xc+0.232
rett+0.232
uzz+0.226
uit+0.225
bur+0.224
Neg logit contributions
 gloomy-0.169
 much-0.145
 End-0.145
 happiest-0.142
 ordinary-0.137
Token were
Token ID547
Feature activation-4.73e-02

Pos logit contributions
 gloomy+0.264
 educational+0.259
 goals+0.255
 These+0.255
 problem+0.254
Neg logit contributions
bur-0.332
zzy-0.331
 unw-0.327
uzz-0.314
rett-0.306
Token very
Token ID845
Feature activation-1.90e-01

Pos logit contributions
ience+0.054
leness+0.052
 These+0.047
 sizes+0.046
 goals+0.045
Neg logit contributions
bur-0.090
uzz-0.089
zzy-0.086
 unw-0.085
rett-0.085
Token happy
Token ID3772
Feature activation-2.26e-01

Pos logit contributions
ience+0.232
 jets+0.222
 goals+0.221
 sizes+0.220
leness+0.218
Neg logit contributions
uzz-0.331
zzy-0.323
 unw-0.322
 Sara-0.321
 Tit-0.314
Token.
Token ID13
Feature activation-4.08e-02

Pos logit contributions
ience+0.380
leness+0.367
 happiest+0.365
 End+0.353
 insignificant+0.350
Neg logit contributions
uzz-0.326
rett-0.297
 Sara-0.290
 unw-0.287
onna-0.276
Token The
Token ID383
Feature activation-1.92e-02

Pos logit contributions
 happiest+0.067
 gloomy+0.067
ience+0.066
 problem+0.065
leness+0.064
Neg logit contributions
uzz-0.056
Wh-0.052
rett-0.051
bur-0.050
 Ow-0.050
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
ience+0.024
leness+0.024
 sizes+0.024
lessness+0.023
 goals+0.022
Neg logit contributions
 unw-0.030
rost-0.030
bur-0.030
 x-0.029
 aid-0.028
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
 gloomy+0.344
 happiest+0.328
 End+0.319
ience+0.309
 goals+0.302
Neg logit contributions
rett-0.580
uzz-0.574
xc-0.570
zzy-0.565
uit-0.565
Token<|endoftext|>
Token ID50256
Feature activation-1.12e-02

Pos logit contributions
 gloomy+0.065
 happiest+0.062
 End+0.062
ience+0.059
 goals+0.059
Neg logit contributions
rett-0.058
xc-0.057
uzz-0.057
zzy-0.057
uit-0.056
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 gloomy+0.011
 sky+0.011
s+0.011
 End+0.011
 day+0.010
Neg logit contributions
xc-0.024
rett-0.023
Wh-0.022
uzz-0.022
igi-0.022
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.137
uzz+0.136
xc+0.131
zzy+0.130
bur+0.129
Neg logit contributions
 gloomy-0.194
 happiest-0.193
ience-0.189
 End-0.186
 sizes-0.186
Token a
Token ID257
Feature activation+5.55e-02

Pos logit contributions
rett+0.231
xc+0.228
uit+0.223
uzz+0.223
bur+0.218
Neg logit contributions
 gloomy-0.178
 End-0.154
 happiest-0.152
 much-0.148
 goals-0.144
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
 End+0.187
 gloomy+0.176
ience+0.174
 happiest+0.169
 goals+0.169
Neg logit contributions
uit-0.195
 unw-0.192
rett-0.192
uzz-0.188
bur-0.188
Token
Token ID220
Feature activation-7.53e-02

Pos logit contributions
 gloomy+0.056
 happiest+0.054
ience+0.053
 End+0.053
 goals+0.052
Neg logit contributions
uzz-0.061
rett-0.061
uit-0.059
Wh-0.058
zzy-0.058
Token
Token ID198
Feature activation+8.04e-02

Pos logit contributions
 These+0.078
 Playing+0.075
 happiest+0.074
 gloomy+0.072
�+0.071
Neg logit contributions
xc-0.143
zzy-0.137
rett-0.135
 snatch-0.134
ck-0.133
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
xc+0.146
rett+0.139
bur+0.136
uzz+0.134
uit+0.134
Neg logit contributions
 End-0.102
 gloomy-0.095
 valuable-0.091
 living-0.090
 happiest-0.089
TokenThe
Token ID464
Feature activation-1.59e-02

Pos logit contributions
rett+0.132
uzz+0.131
xc+0.130
bur+0.127
uit+0.125
Neg logit contributions
 End-0.113
 gloomy-0.107
 happiest-0.104
 valuable-0.102
 educational-0.097
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
ience+0.013
 gloomy+0.012
 sizes+0.012
leness+0.012
 goals+0.012
Neg logit contributions
Wh-0.034
bur-0.033
uit-0.033
xc-0.033
rett-0.033
Token<|endoftext|>
Token ID50256
Feature activation-1.44e-02

Pos logit contributions
 happiest+0.392
 gloomy+0.384
 End+0.365
ience+0.356
 goals+0.343
Neg logit contributions
zzy-0.540
rett-0.537
uzz-0.533
uit-0.519
Wh-0.514
TokenOnce
Token ID7454
Feature activation+1.10e-01

Pos logit contributions
 gloomy+0.017
 sky+0.016
 End+0.016
s+0.015
 days+0.015
Neg logit contributions
xc-0.027
uzz-0.027
rett-0.027
Wh-0.026
igi-0.025
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.119
uzz+0.118
xc+0.114
zzy+0.112
bur+0.111
Neg logit contributions
 gloomy-0.211
 happiest-0.210
ience-0.205
 End-0.203
 sizes-0.203
Token a
Token ID257
Feature activation+5.53e-02

Pos logit contributions
rett+0.232
xc+0.230
uzz+0.226
uit+0.224
bur+0.223
Neg logit contributions
 gloomy-0.176
 happiest-0.151
 End-0.151
 much-0.149
 educational-0.144
Token time
Token ID640
Feature activation+3.61e-02

Pos logit contributions
rett+0.105
xc+0.105
uit+0.104
uzz+0.103
bur+0.102
Neg logit contributions
 gloomy-0.071
 End-0.063
 happiest-0.060
 day-0.058
 valuable-0.057
Token again
Token ID757
Feature activation+1.55e-01

Pos logit contributions
rett+0.074
uzz+0.073
xc+0.073
zzy+0.072
uit+0.072
Neg logit contributions
 gloomy-0.075
 happiest-0.071
 End-0.070
ience-0.070
 sizes-0.068
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
rett+0.250
zzy+0.247
 unw+0.244
uzz+0.241
uit+0.237
Neg logit contributions
ience-0.208
 gloomy-0.208
 End-0.207
 happiest-0.205
 sizes-0.201
Token
Token ID198
Feature activation+8.10e-02

Pos logit contributions
 gloomy+0.051
 happiest+0.050
ience+0.048
 End+0.047
leness+0.047
Neg logit contributions
rett-0.067
uzz-0.067
Wh-0.066
zzy-0.065
xc-0.064
Token
Token ID198
Feature activation+8.06e-02

Pos logit contributions
xc+0.143
bur+0.137
rett+0.136
uzz+0.134
uit+0.132
Neg logit contributions
 End-0.106
 gloomy-0.103
 valuable-0.099
 educational-0.095
 happiest-0.094
TokenThe
Token ID464
Feature activation-1.71e-02

Pos logit contributions
rett+0.116
uzz+0.114
xc+0.113
bur+0.111
uit+0.111
Neg logit contributions
 gloomy-0.128
 happiest-0.124
 End-0.123
ience-0.119
 goals-0.117
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
ience+0.022
 sizes+0.021
leness+0.021
 gloomy+0.020
 happiest+0.020
Neg logit contributions
Jen-0.028
xc-0.028
Wh-0.028
zzy-0.028
rett-0.028
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
 happiest+0.379
 gloomy+0.372
 End+0.353
ience+0.343
 sizes+0.330
Neg logit contributions
zzy-0.551
rett-0.548
uzz-0.544
uit-0.529
Wh-0.527
Token<|endoftext|>
Token ID50256
Feature activation-1.24e-02

Pos logit contributions
 End+0.045
 gloomy+0.044
 These+0.043
lessness+0.041
 much+0.041
Neg logit contributions
xc-0.075
rett-0.073
bur-0.072
ba-0.071
zzy-0.071
TokenOnce
Token ID7454
Feature activation+1.12e-01

Pos logit contributions
 sky+0.012
 gloomy+0.012
 day+0.011
s+0.011
 End+0.011
Neg logit contributions
xc-0.026
uzz-0.025
ck-0.025
rett-0.025
igi-0.025
Token upon
Token ID2402
Feature activation+1.28e-01

Pos logit contributions
rett+0.126
uzz+0.124
xc+0.120
zzy+0.118
uit+0.118
Neg logit contributions
 gloomy-0.210
 happiest-0.208
ience-0.202
 End-0.202
 sizes-0.199
Token a
Token ID257
Feature activation+5.56e-02

Pos logit contributions
xc+0.235
rett+0.235
uzz+0.229
bur+0.228
igi+0.228
Neg logit contributions
 gloomy-0.174
 End-0.150
 much-0.148
 happiest-0.147
 ordinary-0.142
Token.
Token ID13
Feature activation-3.79e-02

Pos logit contributions
uit+0.021
rett+0.020
uzz+0.020
zzy+0.020
bur+0.020
Neg logit contributions
 gloomy-0.015
 End-0.015
 happiest-0.014
ience-0.014
 goals-0.014
Token
Token ID220
Feature activation-7.65e-02

Pos logit contributions
 gloomy+0.058
 happiest+0.055
 End+0.054
ience+0.054
 goals+0.052
Neg logit contributions
uzz-0.057
rett-0.056
Wh-0.055
uit-0.055
xc-0.055
Token
Token ID198
Feature activation+7.95e-02

Pos logit contributions
 gloomy+0.089
 happiest+0.088
 End+0.087
s+0.086
 goals+0.085
Neg logit contributions
xc-0.135
zzy-0.132
rett-0.130
 unw-0.129
bur-0.129
Token
Token ID198
Feature activation+7.94e-02

Pos logit contributions
xc+0.140
rett+0.138
uzz+0.134
 unw+0.131
bur+0.131
Neg logit contributions
 End-0.103
 gloomy-0.100
 valuable-0.096
 happiest-0.096
s-0.094
TokenThe
Token ID464
Feature activation-1.68e-02

Pos logit contributions
rett+0.109
uzz+0.107
xc+0.106
zzy+0.105
uit+0.104
Neg logit contributions
 gloomy-0.132
 happiest-0.128
 End-0.126
ience-0.123
 goals-0.121
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.023
lessness+0.021
s+0.020
ience+0.020
attered+0.020
Neg logit contributions
Wh-0.026
zzy-0.026
 aid-0.026
Jen-0.025
 x-0.024
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
 gloomy+0.356
 happiest+0.349
 End+0.331
ience+0.322
 sizes+0.313
Neg logit contributions
rett-0.568
uzz-0.563
zzy-0.562
uit-0.553
xc-0.552
Token<|endoftext|>
Token ID50256
Feature activation-1.29e-02

Pos logit contributions
 End+0.047
 gloomy+0.046
 happiest+0.046
lessness+0.044
 valuable+0.043
Neg logit contributions
xc-0.066
rett-0.065
zzy-0.064
 unw-0.064
uzz-0.063
TokenOnce
Token ID7454
Feature activation+1.10e-01

Pos logit contributions
 gloomy+0.014
 End+0.014
 sky+0.013
s+0.012
 These+0.012
Neg logit contributions
xc-0.025
uzz-0.025
rett-0.024
Wh-0.024
ck-0.024
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.119
uzz+0.118
xc+0.112
zzy+0.111
bur+0.110
Neg logit contributions
 gloomy-0.211
 happiest-0.210
ience-0.205
 End-0.203
 sizes-0.202
Token a
Token ID257
Feature activation+5.49e-02

Pos logit contributions
xc+0.233
rett+0.231
bur+0.226
uit+0.225
uzz+0.224
Neg logit contributions
 gloomy-0.172
 End-0.150
 much-0.149
 happiest-0.148
 valuable-0.141
Token.
Token ID13
Feature activation-4.05e-02

Pos logit contributions
 End+0.051
 happiest+0.050
ience+0.050
 gloomy+0.048
 educational+0.048
Neg logit contributions
rett-0.056
uit-0.056
 unw-0.055
uzz-0.054
bur-0.053
Token
Token ID220
Feature activation-7.72e-02

Pos logit contributions
 gloomy+0.074
 happiest+0.072
ience+0.071
 problem+0.070
 sizes+0.068
Neg logit contributions
uzz-0.050
rett-0.049
uit-0.046
Wh-0.046
bur-0.045
Token
Token ID198
Feature activation+8.01e-02

Pos logit contributions
 gloomy+0.088
 happiest+0.088
 End+0.086
 These+0.085
 Playing+0.084
Neg logit contributions
xc-0.136
zzy-0.132
rett-0.132
bur-0.131
 unw-0.129
Token
Token ID198
Feature activation+8.00e-02

Pos logit contributions
xc+0.145
rett+0.138
bur+0.138
uit+0.136
uzz+0.134
Neg logit contributions
 End-0.102
 gloomy-0.098
 valuable-0.093
lessness-0.091
 happiest-0.091
TokenThe
Token ID464
Feature activation-1.26e-02

Pos logit contributions
rett+0.130
uzz+0.130
xc+0.128
bur+0.126
uit+0.125
Neg logit contributions
 End-0.113
 gloomy-0.112
 happiest-0.109
 valuable-0.104
ience-0.101
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.013
 gloomy+0.012
ience+0.011
 happiest+0.011
lessness+0.011
Neg logit contributions
Wh-0.024
zzy-0.024
 unw-0.023
uit-0.023
rett-0.023
Token<|endoftext|>
Token ID50256
Feature activation-1.46e-02

Pos logit contributions
 gloomy+0.362
 happiest+0.359
 End+0.340
ience+0.331
 sizes+0.319
Neg logit contributions
rett-0.559
uzz-0.556
zzy-0.555
uit-0.543
xc-0.541
TokenOne
Token ID3198
Feature activation+1.51e-02

Pos logit contributions
 gloomy+0.017
 sky+0.016
 End+0.016
 day+0.015
s+0.015
Neg logit contributions
xc-0.028
uzz-0.028
rett-0.027
Wh-0.027
igi-0.026
Token day
Token ID1110
Feature activation-2.22e-01

Pos logit contributions
zzy+0.033
Jen+0.031
uzz+0.031
bur+0.031
rett+0.031
Neg logit contributions
ience-0.014
leness-0.013
 sizes-0.013
lessness-0.011
 goals-0.011
Token,
Token ID11
Feature activation-8.66e-02

Pos logit contributions
 happiest+0.180
 gloomy+0.177
 End+0.165
 sizes+0.164
ience+0.161
Neg logit contributions
rett-0.487
uzz-0.481
xc-0.478
zzy-0.477
uit-0.473
Token two
Token ID734
Feature activation+4.52e-02

Pos logit contributions
 gloomy+0.134
 End+0.118
 happiest+0.115
 valuable+0.109
 day+0.109
Neg logit contributions
rett-0.142
xc-0.141
bur-0.139
uit-0.137
uzz-0.134
Token decision
Token ID2551
Feature activation-3.94e-02

Pos logit contributions
rett+0.181
uzz+0.179
xc+0.178
zzy+0.177
Wh+0.177
Neg logit contributions
 gloomy-0.106
 happiest-0.101
 End-0.098
ience-0.093
 goals-0.093
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
 End+0.064
 gloomy+0.063
 happiest+0.060
ience+0.060
leness+0.059
Neg logit contributions
rett-0.053
uit-0.053
uzz-0.053
 unw-0.051
bur-0.049
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
 gloomy+0.053
 happiest+0.051
 End+0.049
ience+0.048
 goals+0.047
Neg logit contributions
rett-0.061
uzz-0.061
xc-0.060
zzy-0.059
uit-0.059
Token
Token ID198
Feature activation+8.01e-02

Pos logit contributions
xc+0.136
rett+0.132
uit+0.130
bur+0.130
uzz+0.128
Neg logit contributions
 End-0.112
 gloomy-0.109
 valuable-0.104
 happiest-0.101
 goals-0.100
TokenThe
Token ID464
Feature activation-1.54e-02

Pos logit contributions
rett+0.128
uzz+0.127
xc+0.126
bur+0.124
uit+0.123
Neg logit contributions
 End-0.115
 gloomy-0.115
 happiest-0.111
ience-0.108
 valuable-0.107
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.020
leness+0.019
ience+0.019
lessness+0.018
 happiest+0.018
Neg logit contributions
Wh-0.025
bur-0.024
zzy-0.024
Jen-0.023
rett-0.023
Token.
Token ID13
Feature activation-3.75e-02

Pos logit contributions
 gloomy+0.349
 happiest+0.330
 End+0.323
ience+0.314
 goals+0.307
Neg logit contributions
rett-0.576
uzz-0.569
xc-0.568
uit-0.562
bur-0.559
Token<|endoftext|>
Token ID50256
Feature activation-1.31e-02

Pos logit contributions
 End+0.046
 gloomy+0.045
 happiest+0.043
lessness+0.042
 These+0.042
Neg logit contributions
xc-0.068
rett-0.067
zzy-0.066
uzz-0.065
bur-0.064
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 gloomy+0.014
 sky+0.014
 End+0.013
s+0.013
 day+0.013
Neg logit contributions
xc-0.026
uzz-0.026
rett-0.025
ck-0.025
Wh-0.025
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.122
uzz+0.120
xc+0.115
zzy+0.113
bur+0.113
Neg logit contributions
 gloomy-0.210
 happiest-0.209
ience-0.204
 End-0.202
 sizes-0.201
Token a
Token ID257
Feature activation+5.43e-02

Pos logit contributions
xc+0.242
rett+0.239
uzz+0.234
igi+0.233
bur+0.233
Neg logit contributions
 gloomy-0.168
 much-0.144
 End-0.142
 happiest-0.138
 ordinary-0.136
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
rett+0.118
uzz+0.116
zzy+0.115
xc+0.114
Wh+0.114
Neg logit contributions
 gloomy-0.074
 happiest-0.066
 sizes-0.062
lessness-0.062
 End-0.061
Token
Token ID220
Feature activation-7.79e-02

Pos logit contributions
 gloomy+0.066
 happiest+0.065
ience+0.064
 End+0.062
 goals+0.062
Neg logit contributions
rett-0.053
uzz-0.053
Wh-0.052
uit-0.052
zzy-0.051
Token
Token ID198
Feature activation+7.93e-02

Pos logit contributions
 gloomy+0.091
 happiest+0.087
lessness+0.086
 End+0.086
 These+0.086
Neg logit contributions
xc-0.139
rett-0.136
bur-0.134
zzy-0.133
uzz-0.131
Token
Token ID198
Feature activation+7.86e-02

Pos logit contributions
xc+0.140
rett+0.138
uzz+0.134
bur+0.134
uit+0.131
Neg logit contributions
 gloomy-0.102
 End-0.100
 valuable-0.094
 educational-0.093
 happiest-0.093
TokenThe
Token ID464
Feature activation-1.58e-02

Pos logit contributions
rett+0.102
uzz+0.100
xc+0.099
bur+0.097
uit+0.095
Neg logit contributions
 gloomy-0.138
 End-0.135
 happiest-0.133
 valuable-0.127
ience-0.127
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.015
leness+0.015
ience+0.014
 goals+0.014
s+0.014
Neg logit contributions
Wh-0.030
Jen-0.030
bur-0.029
xc-0.029
 unw-0.029
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 happiest+0.412
 gloomy+0.405
 End+0.384
ience+0.380
 sizes+0.368
Neg logit contributions
zzy-0.516
rett-0.511
uzz-0.506
uit-0.499
Wh-0.491
Token<|endoftext|>
Token ID50256
Feature activation-1.37e-02

Pos logit contributions
 gloomy+0.055
 End+0.055
 happiest+0.053
lessness+0.051
 valuable+0.050
Neg logit contributions
rett-0.065
xc-0.065
zzy-0.064
uzz-0.063
bur-0.062
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 gloomy+0.015
 End+0.015
 sky+0.014
 These+0.013
s+0.013
Neg logit contributions
xc-0.026
uzz-0.026
rett-0.026
Wh-0.025
bur-0.025
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.113
uzz+0.112
xc+0.105
zzy+0.103
bur+0.103
Neg logit contributions
 happiest-0.219
 gloomy-0.217
ience-0.215
 sizes-0.214
 educational-0.211
Token a
Token ID257
Feature activation+5.34e-02

Pos logit contributions
xc+0.234
rett+0.233
bur+0.228
uzz+0.227
uit+0.226
Neg logit contributions
 gloomy-0.174
 End-0.149
 much-0.149
 happiest-0.148
 educational-0.142
Token happily
Token ID18177
Feature activation+1.46e-01

Pos logit contributions
 gloomy+0.209
 happiest+0.203
 End+0.200
 goals+0.195
ience+0.194
Neg logit contributions
rett-0.129
xc-0.125
uzz-0.125
uit-0.123
Wh-0.123
Token ever
Token ID1683
Feature activation+4.59e-02

Pos logit contributions
uzz+0.120
rett+0.110
zzy+0.106
 unw+0.105
bur+0.104
Neg logit contributions
 gloomy-0.311
 sizes-0.307
 happiest-0.303
 End-0.302
 insignificant-0.301
Token after
Token ID706
Feature activation-6.44e-02

Pos logit contributions
rett+0.068
zzy+0.067
 unw+0.066
ke+0.065
bur+0.064
Neg logit contributions
 goals-0.063
 happiest-0.063
 jets-0.063
ience-0.062
 tallest-0.062
Token.
Token ID13
Feature activation-4.15e-02

Pos logit contributions
 happiest+0.098
 gloomy+0.092
ience+0.089
 End+0.088
 sizes+0.088
Neg logit contributions
uzz-0.098
zzy-0.098
rett-0.096
 unw-0.094
uit-0.092
Token The
Token ID383
Feature activation-1.96e-02

Pos logit contributions
 gloomy+0.074
 happiest+0.071
ience+0.070
 End+0.068
 sizes+0.068
Neg logit contributions
uzz-0.053
rett-0.052
zzy-0.050
Wh-0.050
uit-0.049
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.025
lessness+0.022
s+0.022
 educational+0.021
 These+0.021
Neg logit contributions
Wh-0.032
zzy-0.031
bur-0.030
rett-0.030
 stab-0.029
Token.
Token ID13
Feature activation-4.17e-02

Pos logit contributions
 gloomy+0.377
 happiest+0.375
 End+0.352
ience+0.342
 sizes+0.332
Neg logit contributions
zzy-0.545
rett-0.545
uzz-0.541
uit-0.528
xc-0.525
Token<|endoftext|>
Token ID50256
Feature activation-1.22e-02

Pos logit contributions
 gloomy+0.061
 End+0.059
 happiest+0.059
lessness+0.056
 goals+0.055
Neg logit contributions
xc-0.065
rett-0.064
zzy-0.063
uzz-0.063
uit-0.062
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 gloomy+0.014
 End+0.013
 sky+0.013
s+0.012
 goals+0.012
Neg logit contributions
xc-0.023
Wh-0.023
uzz-0.023
rett-0.023
bur-0.022
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.119
uzz+0.118
xc+0.112
zzy+0.111
bur+0.109
Neg logit contributions
 gloomy-0.213
 happiest-0.213
ience-0.209
 End-0.206
 sizes-0.206
Token a
Token ID257
Feature activation+5.52e-02

Pos logit contributions
xc+0.236
rett+0.235
bur+0.229
uit+0.229
uzz+0.228
Neg logit contributions
 gloomy-0.170
 End-0.147
 much-0.147
 happiest-0.146
 educational-0.139
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
 happiest+0.137
ience+0.136
 gloomy+0.134
 End+0.133
lessness+0.128
Neg logit contributions
uzz-0.151
 unw-0.148
uit-0.148
rett-0.147
zzy-0.146
Token
Token ID220
Feature activation-7.69e-02

Pos logit contributions
 gloomy+0.053
 happiest+0.050
 End+0.048
ience+0.048
 sizes+0.047
Neg logit contributions
rett-0.066
uzz-0.066
zzy-0.064
uit-0.064
xc-0.064
Token
Token ID198
Feature activation+7.83e-02

Pos logit contributions
 gloomy+0.100
 happiest+0.097
 End+0.095
 goals+0.094
 These+0.091
Neg logit contributions
xc-0.129
rett-0.125
bur-0.123
uit-0.122
zzy-0.122
Token
Token ID198
Feature activation+7.79e-02

Pos logit contributions
xc+0.147
bur+0.139
rett+0.139
uit+0.138
 unw+0.133
Neg logit contributions
 End-0.100
 gloomy-0.090
 valuable-0.088
 happiest-0.086
lessness-0.084
TokenThe
Token ID464
Feature activation-1.66e-02

Pos logit contributions
rett+0.124
xc+0.123
uzz+0.122
bur+0.120
uit+0.120
Neg logit contributions
 End-0.113
 gloomy-0.111
 happiest-0.109
ience-0.103
 valuable-0.103
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
s+0.017
ience+0.017
lessness+0.016
 sizes+0.016
ables+0.016
Neg logit contributions
Wh-0.030
 unw-0.029
uzz-0.028
Flo-0.028
bur-0.028
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
 gloomy+0.352
 happiest+0.338
 End+0.327
ience+0.318
 goals+0.309
Neg logit contributions
rett-0.572
uzz-0.566
xc-0.561
zzy-0.559
uit-0.557
Token<|endoftext|>
Token ID50256
Feature activation-1.52e-02

Pos logit contributions
 End+0.048
 gloomy+0.046
 happiest+0.045
 These+0.044
lessness+0.044
Neg logit contributions
xc-0.072
rett-0.070
zzy-0.069
 unw-0.068
bur-0.067
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 gloomy+0.017
 End+0.017
 sky+0.016
s+0.015
 happiest+0.015
Neg logit contributions
xc-0.029
uzz-0.028
rett-0.028
Wh-0.028
bur-0.028
Token there
Token ID612
Feature activation+6.66e-02

Pos logit contributions
rett+0.118
uzz+0.117
xc+0.112
zzy+0.110
bur+0.108
Neg logit contributions
 gloomy-0.213
 happiest-0.212
ience-0.207
 End-0.206
 sizes-0.205
Token was
Token ID373
Feature activation+4.47e-02

Pos logit contributions
rett+0.120
Wh+0.119
xc+0.118
uzz+0.118
zzy+0.118
Neg logit contributions
 gloomy-0.082
 happiest-0.079
 End-0.077
 sizes-0.070
 goals-0.070
Token.
Token ID13
Feature activation-3.91e-02

Pos logit contributions
zzy+0.234
rett+0.226
bur+0.222
 unw+0.221
uzz+0.218
Neg logit contributions
 End-0.222
 happiest-0.218
 gloomy-0.213
 sizes-0.208
 These-0.207
Token
Token ID220
Feature activation-7.70e-02

Pos logit contributions
 gloomy+0.055
 happiest+0.053
 End+0.053
 goals+0.050
 valuable+0.050
Neg logit contributions
xc-0.063
rett-0.062
zzy-0.062
bur-0.061
uit-0.061
Token
Token ID198
Feature activation+7.88e-02

Pos logit contributions
 Playing+0.080
 These+0.080
s+0.079
 goals+0.078
 gloomy+0.078
Neg logit contributions
xc-0.145
zzy-0.142
bur-0.138
 unw-0.138
rett-0.138
Token
Token ID198
Feature activation+7.87e-02

Pos logit contributions
xc+0.149
rett+0.142
bur+0.139
uit+0.136
uzz+0.134
Neg logit contributions
 End-0.097
 valuable-0.091
 gloomy-0.088
 proud-0.087
s-0.087
TokenThe
Token ID464
Feature activation-1.84e-02

Pos logit contributions
rett+0.130
uzz+0.128
xc+0.127
bur+0.124
uit+0.123
Neg logit contributions
 End-0.108
 gloomy-0.107
 happiest-0.105
 valuable-0.102
ience-0.102
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 gloomy+0.019
ience+0.019
 sizes+0.018
 happiest+0.018
lessness+0.018
Neg logit contributions
uzz-0.035
Wh-0.034
xc-0.034
rett-0.034
zzy-0.034
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 gloomy+0.357
 happiest+0.351
 End+0.333
ience+0.323
 goals+0.314
Neg logit contributions
rett-0.565
uzz-0.561
zzy-0.560
xc-0.549
uit-0.548
Token<|endoftext|>
Token ID50256
Feature activation-1.46e-02

Pos logit contributions
 End+0.042
 gloomy+0.040
 These+0.040
lessness+0.039
 happiest+0.039
Neg logit contributions
xc-0.080
rett-0.077
zzy-0.076
 unw-0.075
 snatch-0.074
TokenJimmy
Token ID40335
Feature activation+1.90e-01

Pos logit contributions
 gloomy+0.016
 sky+0.015
 End+0.015
s+0.015
 day+0.014
Neg logit contributions
xc-0.029
uzz-0.028
rett-0.028
Wh-0.028
ck-0.028
Token was
Token ID373
Feature activation+4.29e-02

Pos logit contributions
rett+0.304
uzz+0.303
zzy+0.303
bur+0.301
xc+0.298
Neg logit contributions
 gloomy-0.262
 happiest-0.254
 End-0.247
 goals-0.241
 sizes-0.238
Token walking
Token ID6155
Feature activation-4.07e-02

Pos logit contributions
rett+0.092
Wh+0.090
xc+0.089
uit+0.089
uzz+0.089
Neg logit contributions
 gloomy-0.044
 happiest-0.039
 End-0.038
 bored-0.033
 goals-0.033
Token.
Token ID13
Feature activation-4.21e-02

Pos logit contributions
uzz+0.233
uit+0.233
rett+0.224
bur+0.219
 unw+0.218
Neg logit contributions
 gloomy-0.208
 End-0.198
 happiest-0.187
ience-0.182
 educational-0.181
Token
Token ID220
Feature activation-7.77e-02

Pos logit contributions
 gloomy+0.053
 happiest+0.052
 End+0.051
 goals+0.049
ience+0.048
Neg logit contributions
xc-0.074
rett-0.073
zzy-0.072
uzz-0.071
bur-0.071
Token
Token ID198
Feature activation+7.85e-02

Pos logit contributions
 These+0.085
 happiest+0.085
s+0.085
 goals+0.084
 gloomy+0.083
Neg logit contributions
xc-0.140
bur-0.134
rett-0.133
zzy-0.133
 unw-0.132
Token
Token ID198
Feature activation+7.82e-02

Pos logit contributions
xc+0.141
bur+0.134
rett+0.133
uit+0.132
uzz+0.132
Neg logit contributions
 End-0.101
 gloomy-0.095
 happiest-0.093
 valuable-0.092
 goals-0.092
TokenThe
Token ID464
Feature activation-1.59e-02

Pos logit contributions
uzz+0.122
rett+0.121
xc+0.120
bur+0.118
uit+0.117
Neg logit contributions
 End-0.113
 gloomy-0.113
 happiest-0.112
ience-0.108
 goals-0.106
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
lessness+0.018
 sizes+0.017
 gloomy+0.017
leness+0.017
 educational+0.017
Neg logit contributions
Wh-0.029
bur-0.028
uit-0.028
Flo-0.027
zzy-0.027
Token.
Token ID13
Feature activation-4.16e-02

Pos logit contributions
 happiest+0.373
 gloomy+0.372
 End+0.348
ience+0.338
 sizes+0.326
Neg logit contributions
zzy-0.552
rett-0.551
uzz-0.546
uit-0.532
Wh-0.527
Token<|endoftext|>
Token ID50256
Feature activation-1.30e-02

Pos logit contributions
 End+0.043
 These+0.041
 happiest+0.039
 gloomy+0.038
s+0.038
Neg logit contributions
xc-0.085
zzy-0.081
rett-0.079
 unw-0.079
 snatch-0.079
TokenOnce
Token ID7454
Feature activation+1.12e-01

Pos logit contributions
 gloomy+0.016
 End+0.015
 sky+0.014
 happiest+0.014
s+0.014
Neg logit contributions
xc-0.024
uzz-0.024
rett-0.023
Wh-0.023
bur-0.023
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.117
uzz+0.115
xc+0.110
zzy+0.108
bur+0.107
Neg logit contributions
 happiest-0.216
 gloomy-0.216
ience-0.212
 sizes-0.210
 End-0.209
Token a
Token ID257
Feature activation+5.55e-02

Pos logit contributions
xc+0.229
rett+0.228
uzz+0.223
bur+0.222
uit+0.221
Neg logit contributions
 gloomy-0.174
 End-0.152
 happiest-0.151
 much-0.149
 educational-0.143
Token
Token ID220
Feature activation-7.62e-02

Pos logit contributions
 happiest+0.066
ience+0.066
 gloomy+0.065
 sizes+0.064
leness+0.062
Neg logit contributions
rett-0.050
uzz-0.050
zzy-0.047
 unw-0.047
Flo-0.046
Token
Token ID198
Feature activation+8.29e-02

Pos logit contributions
 gloomy+0.088
s+0.084
 End+0.084
lessness+0.082
 happiest+0.081
Neg logit contributions
xc-0.141
rett-0.135
zzy-0.134
bur-0.134
 unw-0.131
Token
Token ID220
Feature activation-7.62e-02

Pos logit contributions
xc+0.151
uit+0.143
rett+0.142
bur+0.142
uzz+0.141
Neg logit contributions
 gloomy-0.103
 End-0.102
s-0.097
 valuable-0.094
 happiest-0.092
Token
Token ID198
Feature activation+8.24e-02

Pos logit contributions
 gloomy+0.087
 These+0.081
 Playing+0.081
 End+0.081
s+0.080
Neg logit contributions
xc-0.143
bur-0.137
rett-0.136
uit-0.132
zzy-0.132
TokenThe
Token ID464
Feature activation-1.35e-02

Pos logit contributions
rett+0.106
zzy+0.104
uzz+0.104
xc+0.103
Wh+0.103
Neg logit contributions
 gloomy-0.143
 happiest-0.139
 End-0.135
ience-0.134
 goals-0.132
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 gloomy+0.017
ience+0.017
 happiest+0.017
 sizes+0.017
leness+0.016
Neg logit contributions
rett-0.022
bur-0.022
zzy-0.022
uit-0.022
xc-0.022
Token.
Token ID13
Feature activation-3.88e-02

Pos logit contributions
 gloomy+0.368
 happiest+0.367
 End+0.349
ience+0.339
 goals+0.328
Neg logit contributions
rett-0.549
zzy-0.546
uzz-0.545
uit-0.534
xc-0.533
Token<|endoftext|>
Token ID50256
Feature activation-1.22e-02

Pos logit contributions
 gloomy+0.059
 happiest+0.056
 End+0.055
ience+0.053
 goals+0.052
Neg logit contributions
rett-0.059
uzz-0.058
xc-0.058
zzy-0.057
uit-0.057
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 sky+0.012
 gloomy+0.012
s+0.011
 day+0.011
 End+0.011
Neg logit contributions
xc-0.027
rett-0.025
uzz-0.025
ck-0.025
 snatch-0.025
Token upon
Token ID2402
Feature activation+1.28e-01

Pos logit contributions
rett+0.130
uzz+0.128
xc+0.126
zzy+0.125
uit+0.124
Neg logit contributions
 gloomy-0.203
 happiest-0.198
 End-0.194
ience-0.191
 goals-0.188
Token a
Token ID257
Feature activation+5.62e-02

Pos logit contributions
rett+0.234
xc+0.232
uzz+0.227
uit+0.226
bur+0.223
Neg logit contributions
 gloomy-0.177
 End-0.152
 happiest-0.151
 much-0.148
 ordinary-0.144
Token joy
Token ID8716
Feature activation-4.88e-02

Pos logit contributions
 unw+0.084
uzz+0.077
rett+0.076
uit+0.076
zzy+0.076
Neg logit contributions
ience-0.063
 End-0.063
 These-0.062
 gloomy-0.061
 goals-0.060
Token!
Token ID0
Feature activation+2.89e-03

Pos logit contributions
 End+0.073
 goals+0.069
ience+0.069
 educational+0.068
 happiest+0.067
Neg logit contributions
rett-0.079
uit-0.079
uzz-0.077
 unw-0.073
zzy-0.073
Token
Token ID198
Feature activation+8.05e-02

Pos logit contributions
rett+0.005
uzz+0.005
xc+0.004
zzy+0.004
uit+0.004
Neg logit contributions
 gloomy-0.004
 happiest-0.004
 End-0.004
ience-0.004
 goals-0.004
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
xc+0.137
rett+0.132
bur+0.131
uit+0.130
uzz+0.127
Neg logit contributions
 End-0.111
 gloomy-0.108
 valuable-0.102
 happiest-0.100
lessness-0.097
TokenThe
Token ID464
Feature activation-1.60e-02

Pos logit contributions
rett+0.130
uzz+0.129
xc+0.128
bur+0.127
uit+0.125
Neg logit contributions
 End-0.113
 gloomy-0.113
 happiest-0.109
 valuable-0.105
ience-0.105
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 gloomy+0.019
 happiest+0.018
ience+0.017
 sizes+0.017
 goals+0.017
Neg logit contributions
rett-0.029
Wh-0.028
bur-0.028
uzz-0.028
uit-0.028
Token!
Token ID0
Feature activation+2.81e-03

Pos logit contributions
 gloomy+0.359
 happiest+0.356
 End+0.338
ience+0.329
 goals+0.318
Neg logit contributions
rett-0.562
zzy-0.557
uzz-0.557
uit-0.545
xc-0.543
Token<|endoftext|>
Token ID50256
Feature activation-1.16e-02

Pos logit contributions
xc+0.006
zzy+0.006
 snatch+0.006
rett+0.005
ck+0.005
Neg logit contributions
 End-0.003
 gloomy-0.003
lessness-0.003
 These-0.003
 sky-0.003
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 sky+0.012
 gloomy+0.012
 End+0.011
s+0.011
 day+0.011
Neg logit contributions
xc-0.024
uzz-0.023
ck-0.023
rett-0.022
Wh-0.022
Token upon
Token ID2402
Feature activation+1.28e-01

Pos logit contributions
rett+0.124
uzz+0.122
xc+0.119
zzy+0.117
uit+0.117
Neg logit contributions
 gloomy-0.208
 happiest-0.205
 End-0.199
ience-0.199
 sizes-0.196
Token a
Token ID257
Feature activation+5.53e-02

Pos logit contributions
xc+0.239
rett+0.237
uzz+0.233
igi+0.232
bur+0.232
Neg logit contributions
 gloomy-0.171
 much-0.146
 End-0.145
 happiest-0.143
 ordinary-0.139
Token game
Token ID983
Feature activation-8.78e-02

Pos logit contributions
ience+0.105
leness+0.104
lessness+0.101
 gloomy+0.097
 fortunate+0.096
Neg logit contributions
rett-0.123
xc-0.119
uit-0.118
Wh-0.118
 unw-0.117
Token.
Token ID13
Feature activation-4.17e-02

Pos logit contributions
 gloomy+0.126
 End+0.124
ience+0.124
leness+0.122
 happiest+0.120
Neg logit contributions
uit-0.136
uzz-0.133
rett-0.133
 unw-0.130
zzy-0.128
Token
Token ID198
Feature activation+7.92e-02

Pos logit contributions
 gloomy+0.063
 happiest+0.061
 End+0.059
ience+0.058
 goals+0.056
Neg logit contributions
rett-0.064
uzz-0.064
xc-0.062
zzy-0.062
uit-0.062
Token
Token ID198
Feature activation+7.91e-02

Pos logit contributions
xc+0.156
uit+0.145
rett+0.145
bur+0.144
 unw+0.140
Neg logit contributions
 End-0.095
 gloomy-0.091
 valuable-0.087
s-0.085
 day-0.085
TokenThe
Token ID464
Feature activation-1.71e-02

Pos logit contributions
rett+0.120
xc+0.118
uit+0.115
bur+0.115
uzz+0.114
Neg logit contributions
 End-0.126
 gloomy-0.121
 valuable-0.115
 happiest-0.114
 educational-0.111
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
ience+0.024
 sizes+0.024
lessness+0.024
leness+0.024
 gloomy+0.023
Neg logit contributions
Wh-0.025
rett-0.024
xc-0.024
Jen-0.024
uzz-0.024
Token.
Token ID13
Feature activation-4.13e-02

Pos logit contributions
 gloomy+0.339
 happiest+0.326
 End+0.315
ience+0.306
 goals+0.297
Neg logit contributions
rett-0.582
uzz-0.577
xc-0.571
zzy-0.570
uit-0.567
Token<|endoftext|>
Token ID50256
Feature activation-1.66e-02

Pos logit contributions
 End+0.050
 gloomy+0.049
 These+0.049
lessness+0.046
 happiest+0.046
Neg logit contributions
xc-0.076
rett-0.074
zzy-0.073
 unw-0.072
zip-0.071
TokenTodd
Token ID42817
Feature activation+1.13e-01

Pos logit contributions
 gloomy+0.017
 sky+0.017
 End+0.016
 day+0.016
s+0.016
Neg logit contributions
xc-0.035
rett-0.033
Wh-0.033
uzz-0.033
ck-0.032
Token was
Token ID373
Feature activation+4.26e-02

Pos logit contributions
bur+0.176
zzy+0.174
rett+0.172
uzz+0.172
xc+0.171
Neg logit contributions
 gloomy-0.160
 happiest-0.157
 End-0.150
 goals-0.148
 educational-0.148
Token so
Token ID523
Feature activation-4.98e-02

Pos logit contributions
rett+0.087
Wh+0.086
uit+0.085
xc+0.085
uzz+0.085
Neg logit contributions
 gloomy-0.048
 happiest-0.043
 End-0.042
 goals-0.037
 bored-0.037
Token.
Token ID13
Feature activation-4.12e-02

Pos logit contributions
zzy+0.249
rett+0.248
bur+0.244
uit+0.242
uzz+0.241
Neg logit contributions
 gloomy-0.201
 End-0.200
 happiest-0.199
ience-0.192
lessness-0.188
Token
Token ID220
Feature activation-7.72e-02

Pos logit contributions
 gloomy+0.050
 happiest+0.048
 End+0.047
ience+0.045
 sizes+0.044
Neg logit contributions
rett-0.075
uzz-0.075
xc-0.073
zzy-0.073
uit-0.073
Token
Token ID198
Feature activation+7.96e-02

Pos logit contributions
 Playing+0.073
 These+0.072
s+0.072
 goals+0.071
 End+0.070
Neg logit contributions
xc-0.156
zzy-0.150
bur-0.147
rett-0.146
igi-0.145
Token
Token ID198
Feature activation+7.95e-02

Pos logit contributions
xc+0.178
bur+0.164
icle+0.159
uzz+0.159
rett+0.158
Neg logit contributions
s-0.073
 End-0.072
 valuable-0.069
 much-0.068
 proud-0.068
TokenThe
Token ID464
Feature activation-1.36e-02

Pos logit contributions
uzz+0.134
rett+0.133
xc+0.131
bur+0.130
uit+0.127
Neg logit contributions
 End-0.108
 gloomy-0.105
 happiest-0.104
 valuable-0.102
ience-0.100
Token End
Token ID5268
Feature activation-3.06e-01

Pos logit contributions
 sizes+0.018
lessness+0.018
ience+0.017
leness+0.017
 These+0.017
Neg logit contributions
zzy-0.021
bur-0.021
Wh-0.020
Jen-0.020
rost-0.020
Token.
Token ID13
Feature activation-3.94e-02

Pos logit contributions
 gloomy+0.356
 happiest+0.349
 End+0.333
ience+0.323
 sizes+0.312
Neg logit contributions
rett-0.566
uzz-0.561
zzy-0.559
xc-0.550
uit-0.549
Token<|endoftext|>
Token ID50256
Feature activation-1.16e-02

Pos logit contributions
 End+0.043
 gloomy+0.041
lessness+0.041
 These+0.040
 happiest+0.039
Neg logit contributions
xc-0.076
zzy-0.075
rett-0.074
ck-0.073
uzz-0.073
TokenOnce
Token ID7454
Feature activation+1.12e-01

Pos logit contributions
 sky+0.011
 gloomy+0.011
s+0.011
 End+0.011
 day+0.011
Neg logit contributions
xc-0.025
uzz-0.023
ck-0.023
rett-0.023
Wh-0.023
Token upon
Token ID2402
Feature activation+1.27e-01

Pos logit contributions
rett+0.127
uzz+0.125
xc+0.122
zzy+0.120
uit+0.119
Neg logit contributions
 gloomy-0.210
 happiest-0.207
 End-0.202
ience-0.201
 sizes-0.198
Token a
Token ID257
Feature activation+5.60e-02

Pos logit contributions
xc+0.233
rett+0.232
uzz+0.227
uit+0.224
bur+0.223
Neg logit contributions
 gloomy-0.173
 End-0.150
 much-0.148
 happiest-0.146
 ordinary-0.142
Token and
Token ID290
Feature activation+5.75e-02

Pos logit contributions
 End+0.042
 happiest+0.040
ience+0.039
leness+0.039
 gloomy+0.039
Neg logit contributions
uzz-0.041
zzy-0.041
rett-0.040
uit-0.040
 unw-0.038
Token cl
Token ID537
Feature activation+3.44e-03

Pos logit contributions
 unw+0.071
uzz+0.070
zip+0.068
rett+0.068
Wh+0.068
Neg logit contributions
 End-0.094
 happiest-0.091
 These-0.090
 problem-0.090
 gloomy-0.089
Tokenapped
Token ID6320
Feature activation+6.53e-02

Pos logit contributions
uzz+0.005
zip+0.004
rett+0.004
xc+0.004
zzy+0.004
Neg logit contributions
 gloomy-0.006
 problem-0.006
 happiest-0.006
 valuable-0.006
 End-0.006
Token while
Token ID981
Feature activation-3.15e-02

Pos logit contributions
uzz+0.101
rett+0.094
uit+0.092
 unw+0.090
zzy+0.090
Neg logit contributions
ience-0.096
 happiest-0.096
 End-0.095
 gloomy-0.091
leness-0.090
Token P
Token ID350
Feature activation+1.98e-01

Pos logit contributions
 happiest+0.042
 gloomy+0.042
ience+0.041
 End+0.041
 valuable+0.039
Neg logit contributions
uzz-0.051
rett-0.051
uit-0.050
xc-0.049
Wh-0.049
Tokenipp
Token ID3974
Feature activation+2.19e-01

Pos logit contributions
uzz+0.276
Wh+0.275
zzy+0.272
xc+0.271
rett+0.271
Neg logit contributions
 gloomy-0.300
 happiest-0.298
 goals-0.294
ience-0.294
 End-0.287
Tokenin
Token ID259
Feature activation+5.93e-02

Pos logit contributions
rett+0.292
uzz+0.287
xc+0.285
uit+0.283
Wh+0.282
Neg logit contributions
 gloomy-0.369
 happiest-0.359
 End-0.352
ience-0.347
 goals-0.339
Token sang
Token ID25889
Feature activation-2.29e-02

Pos logit contributions
rett+0.087
zzy+0.087
uzz+0.085
 unw+0.084
Wh+0.083
Neg logit contributions
 gloomy-0.091
 happiest-0.087
 End-0.085
lessness-0.084
 educational-0.083
Token.
Token ID13
Feature activation-3.31e-02

Pos logit contributions
 End+0.033
ience+0.033
 happiest+0.032
 gloomy+0.031
leness+0.031
Neg logit contributions
uit-0.035
uzz-0.035
rett-0.035
zzy-0.034
Wh-0.033
Token
Token ID198
Feature activation+9.18e-02

Pos logit contributions
 gloomy+0.044
 happiest+0.042
 End+0.042
 goals+0.040
ience+0.040
Neg logit contributions
rett-0.056
xc-0.056
zzy-0.055
uzz-0.055
bur-0.054
Token
Token ID198
Feature activation+9.11e-02

Pos logit contributions
xc+0.186
bur+0.174
uit+0.173
 unw+0.172
rett+0.168
Neg logit contributions
 End-0.104
 gloomy-0.096
s-0.095
 valuable-0.092
 much-0.090
Token her
Token ID607
Feature activation+1.48e-02

Pos logit contributions
rett+0.037
uzz+0.037
uit+0.036
xc+0.036
bur+0.035
Neg logit contributions
 gloomy-0.046
 happiest-0.046
 End-0.045
ience-0.044
 educational-0.042
Token teeth
Token ID9941
Feature activation+1.00e-01

Pos logit contributions
rett+0.022
xc+0.022
uzz+0.022
 unw+0.021
zzy+0.021
Neg logit contributions
 gloomy-0.022
 happiest-0.021
 End-0.021
ience-0.021
lessness-0.020
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
rett+0.162
uit+0.162
 unw+0.160
zzy+0.158
uzz+0.158
Neg logit contributions
 End-0.140
 happiest-0.137
ience-0.132
 gloomy-0.132
leness-0.126
Token
Token ID198
Feature activation+8.45e-02

Pos logit contributions
 gloomy+0.044
 End+0.042
 happiest+0.042
 goals+0.040
ience+0.040
Neg logit contributions
xc-0.073
rett-0.072
bur-0.071
zzy-0.071
uit-0.070
Token
Token ID198
Feature activation+8.46e-02

Pos logit contributions
xc+0.173
bur+0.165
uit+0.162
rett+0.160
 unw+0.159
Neg logit contributions
 End-0.093
 gloomy-0.083
 much-0.082
s-0.080
 valuable-0.077
TokenL
Token ID43
Feature activation+2.34e-01

Pos logit contributions
xc+0.126
bur+0.126
rett+0.126
uit+0.123
uzz+0.120
Neg logit contributions
 End-0.131
 gloomy-0.127
 happiest-0.117
 goals-0.117
ience-0.116
Tokenily
Token ID813
Feature activation+1.92e-01

Pos logit contributions
bur+0.355
xc+0.343
Wh+0.341
zip+0.334
 snatch+0.327
Neg logit contributions
 End-0.380
lessness-0.359
 gloomy-0.333
 days-0.329
ience-0.327
Token tried
Token ID3088
Feature activation+4.78e-02

Pos logit contributions
rett+0.310
zzy+0.304
uzz+0.304
xc+0.304
 unw+0.303
Neg logit contributions
 gloomy-0.272
 happiest-0.259
 End-0.252
lessness-0.245
 educational-0.241
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
xc+0.084
zzy+0.083
Wh+0.082
bur+0.081
uzz+0.081
Neg logit contributions
 gloomy-0.064
 happiest-0.058
 valuable-0.056
 End-0.056
 problem-0.055
Token be
Token ID307
Feature activation+4.34e-02

Pos logit contributions
xc+0.234
uzz+0.233
igi+0.229
rett+0.216
isk+0.215
Neg logit contributions
 gloomy-0.149
 happiest-0.146
 end-0.137
 proud-0.135
 that-0.135
Token independent
Token ID4795
Feature activation-1.05e-01

Pos logit contributions
rett+0.076
zzy+0.076
uzz+0.076
uit+0.075
xc+0.075
Neg logit contributions
 gloomy-0.053
 happiest-0.051
 End-0.051
ience-0.050
 goals-0.048
Token and
Token ID290
Feature activation+4.93e-02

Pos logit contributions
uit+0.063
uzz+0.062
 unw+0.061
bur+0.060
rett+0.059
Neg logit contributions
 happiest-0.068
 gloomy-0.068
 End-0.067
leness-0.067
 educational-0.066
Token have
Token ID423
Feature activation-3.87e-02

Pos logit contributions
 unw+0.058
Wh+0.057
uzz+0.057
bur+0.057
uit+0.054
Neg logit contributions
ience-0.087
 gloomy-0.085
 happiest-0.084
lessness-0.084
 End-0.083
Token fun
Token ID1257
Feature activation-1.75e-01

Pos logit contributions
ience+0.033
 gloomy+0.033
 End+0.032
 happiest+0.032
lessness+0.032
Neg logit contributions
 unw-0.080
Wh-0.080
uzz-0.079
bur-0.079
uit-0.078
Token.
Token ID13
Feature activation-3.48e-02

Pos logit contributions
 happiest+0.267
ience+0.253
 gloomy+0.252
 insignificant+0.242
 End+0.241
Neg logit contributions
uzz-0.264
 unw-0.250
uit-0.247
xc-0.243
Flo-0.242
Token<|endoftext|>
Token ID50256
Feature activation-9.28e-03

Pos logit contributions
 gloomy+0.043
 happiest+0.042
ience+0.041
 End+0.040
 sizes+0.038
Neg logit contributions
uzz-0.062
rett-0.061
Wh-0.060
zzy-0.060
uit-0.060
TokenL
Token ID43
Feature activation+2.27e-01

Pos logit contributions
 gloomy+0.010
 End+0.010
 goals+0.009
 sky+0.009
s+0.009
Neg logit contributions
xc-0.018
rett-0.018
uzz-0.018
Wh-0.018
zzy-0.017
Tokenily
Token ID813
Feature activation+1.85e-01

Pos logit contributions
Wh+0.345
zip+0.310
xc+0.307
Flo+0.300
rett+0.299
Neg logit contributions
s-0.357
 End-0.353
 days-0.353
 gloomy-0.350
 problem-0.341
Token and
Token ID290
Feature activation+5.35e-02

Pos logit contributions
rett+0.336
zzy+0.334
bur+0.334
uzz+0.334
 unw+0.332
Neg logit contributions
 gloomy-0.212
 happiest-0.205
 End-0.196
lessness-0.194
ience-0.189
Token Ben
Token ID3932
Feature activation+4.65e-02

Pos logit contributions
rett+0.085
uit+0.082
uzz+0.080
xc+0.080
zzy+0.079
Neg logit contributions
 gloomy-0.083
 happiest-0.079
 End-0.076
 goals-0.072
 educational-0.072
Token were
Token ID547
Feature activation-4.20e-02

Pos logit contributions
uzz+0.086
zzy+0.086
 unw+0.086
rett+0.086
bur+0.085
Neg logit contributions
 gloomy-0.052
 happiest-0.049
 End-0.048
ience-0.048
lessness-0.046
Token friends
Token ID2460
Feature activation-3.14e-02

Pos logit contributions
 sizes+0.039
ience+0.037
 gloomy+0.037
leness+0.036
 happiest+0.036
Neg logit contributions
uzz-0.086
xc-0.085
rett-0.084
bur-0.084
zzy-0.083
Token€
Token ID26391
Feature activation+1.25e-01

Pos logit contributions
rett+0.044
uzz+0.044
xc+0.043
uit+0.041
zzy+0.041
Neg logit contributions
 gloomy-0.067
 happiest-0.065
 End-0.065
ience-0.063
 sizes-0.061
Tokenâ„¢
Token ID8151
Feature activation+8.17e-02

Pos logit contributions
rett+0.184
zzy+0.181
uzz+0.180
uit+0.176
Wh+0.175
Neg logit contributions
 gloomy-0.193
 happiest-0.191
 End-0.186
ience-0.181
 goals-0.179
Tokens
Token ID82
Feature activation-1.41e-01

Pos logit contributions
rett+0.188
xc+0.184
uit+0.183
bur+0.183
uzz+0.182
Neg logit contributions
 happiest-0.057
 gloomy-0.057
 sizes-0.054
 End-0.054
 goals-0.053
Token wrong
Token ID2642
Feature activation-8.77e-02

Pos logit contributions
 sizes+0.172
 End+0.164
 lifetime+0.162
 happiest+0.162
ience+0.161
Neg logit contributions
xc-0.244
uzz-0.242
rett-0.241
zzy-0.239
bur-0.238
Token,
Token ID11
Feature activation-8.56e-02

Pos logit contributions
 gloomy+0.122
 happiest+0.113
 problem+0.108
 End+0.108
 goals+0.105
Neg logit contributions
Wh-0.147
bur-0.145
rett-0.144
xc-0.144
uzz-0.143
Token Mike
Token ID4995
Feature activation+2.11e-01

Pos logit contributions
 gloomy+0.157
 happiest+0.151
 End+0.145
 goals+0.144
 sizes+0.143
Neg logit contributions
xc-0.104
rett-0.104
Wh-0.102
uit-0.101
bur-0.101
Tokeny
Token ID88
Feature activation+5.19e-02

Pos logit contributions
Wh+0.328
rett+0.327
uzz+0.325
xc+0.323
uit+0.320
Neg logit contributions
 gloomy-0.307
 happiest-0.292
ience-0.284
 End-0.284
 sizes-0.273
Token?
Token ID30
Feature activation+3.74e-02

Pos logit contributions
rett+0.059
zzy+0.058
xc+0.056
uzz+0.056
bur+0.056
Neg logit contributions
 gloomy-0.093
 happiest-0.092
 End-0.092
 sizes-0.091
 goals-0.089
Tokenâ
Token ID22940
Feature activation+3.60e-02

Pos logit contributions
rett+0.071
xc+0.071
zzy+0.071
bur+0.070
uzz+0.069
Neg logit contributions
 gloomy-0.041
 happiest-0.039
 End-0.037
ience-0.037
 goals-0.036
Token€
Token ID26391
Feature activation+1.24e-01

Pos logit contributions
rett+0.042
uzz+0.041
xc+0.041
uit+0.040
zzy+0.040
Neg logit contributions
 gloomy-0.067
 happiest-0.065
 End-0.064
ience-0.063
 goals-0.062
Token his
Token ID465
Feature activation+4.90e-02

Pos logit contributions
rett+0.179
zzy+0.174
bur+0.173
Wh+0.169
uit+0.169
Neg logit contributions
 gloomy-0.198
 End-0.197
 happiest-0.194
ience-0.185
 goals-0.182
Token ran
Token ID4966
Feature activation+3.36e-02

Pos logit contributions
 unw+0.023
rett+0.023
Wh+0.023
uit+0.023
zzy+0.023
Neg logit contributions
 End-0.027
 gloomy-0.027
 happiest-0.027
 educational-0.026
 goals-0.026
Token away
Token ID1497
Feature activation+8.63e-02

Pos logit contributions
rett+0.059
xc+0.059
bur+0.058
igi+0.057
uzz+0.056
Neg logit contributions
 gloomy-0.046
 valuable-0.037
 happiest-0.037
 End-0.037
 bored-0.037
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
rett+0.161
uzz+0.161
zzy+0.157
uit+0.156
xc+0.155
Neg logit contributions
 gloomy-0.098
 happiest-0.097
 End-0.095
ience-0.093
 goals-0.089
Token
Token ID198
Feature activation+9.00e-02

Pos logit contributions
 gloomy+0.050
 happiest+0.047
 End+0.046
 valuable+0.045
ience+0.044
Neg logit contributions
rett-0.055
xc-0.054
uzz-0.054
zzy-0.054
uit-0.054
Token
Token ID198
Feature activation+8.95e-02

Pos logit contributions
xc+0.204
bur+0.189
uit+0.188
rett+0.187
uzz+0.183
Neg logit contributions
s-0.086
 much-0.080
 gloomy-0.076
 valuable-0.074
 that-0.069
TokenTim
Token ID14967
Feature activation+2.39e-01

Pos logit contributions
rett+0.107
xc+0.102
uzz+0.102
bur+0.098
uit+0.097
Neg logit contributions
 gloomy-0.173
 End-0.166
 happiest-0.162
 valuable-0.160
 educational-0.157
Tokenmy
Token ID1820
Feature activation+1.21e-01

Pos logit contributions
uzz+0.377
rett+0.362
zzy+0.362
uit+0.353
xc+0.349
Neg logit contributions
 gloomy-0.354
 happiest-0.343
ience-0.337
 End-0.336
 problem-0.323
Token and
Token ID290
Feature activation+5.11e-02

Pos logit contributions
rett+0.225
uzz+0.221
xc+0.221
zzy+0.220
Wh+0.218
Neg logit contributions
 gloomy-0.140
 happiest-0.135
 End-0.131
ience-0.125
 sizes-0.124
Token his
Token ID465
Feature activation+4.88e-02

Pos logit contributions
uit+0.068
rett+0.068
zzy+0.068
xc+0.065
uzz+0.065
Neg logit contributions
 gloomy-0.094
 happiest-0.086
 End-0.084
 miserable-0.082
 days-0.082
Token friends
Token ID2460
Feature activation-2.98e-02

Pos logit contributions
rett+0.092
xc+0.090
uzz+0.090
zzy+0.090
uit+0.089
Neg logit contributions
 gloomy-0.058
 happiest-0.055
 End-0.053
 goals-0.051
ience-0.049
Token were
Token ID547
Feature activation-4.60e-02

Pos logit contributions
 gloomy+0.041
 happiest+0.040
 End+0.040
 goals+0.038
 sizes+0.038
Neg logit contributions
rett-0.048
uzz-0.048
zzy-0.048
uit-0.048
xc-0.047
Token it
Token ID340
Feature activation-7.88e-02

Pos logit contributions
uzz+0.161
 unw+0.157
rett+0.155
 Sara+0.148
xc+0.147
Neg logit contributions
ience-0.120
 gloomy-0.117
leness-0.114
 sizes-0.114
 happiest-0.111
Token also
Token ID635
Feature activation-4.01e-02

Pos logit contributions
 gloomy+0.130
leness+0.123
 End+0.122
 goals+0.120
lessness+0.118
Neg logit contributions
 unw-0.103
xc-0.100
rett-0.097
zzy-0.096
bur-0.094
Token brought
Token ID3181
Feature activation+1.20e-01

Pos logit contributions
leness+0.058
 horizon+0.057
plets+0.057
 End+0.056
 lifetime+0.056
Neg logit contributions
 Sue-0.053
 unw-0.053
isk-0.051
 Sara-0.050
bur-0.048
Token her
Token ID607
Feature activation+4.17e-04

Pos logit contributions
rett+0.186
uzz+0.183
xc+0.182
zzy+0.180
Wh+0.180
Neg logit contributions
 gloomy-0.180
 happiest-0.173
 End-0.167
ience-0.163
 goals-0.163
Token special
Token ID2041
Feature activation-1.73e-01

Pos logit contributions
rett+0.001
uzz+0.001
uit+0.001
bur+0.001
xc+0.001
Neg logit contributions
 gloomy-0.000
 End-0.000
 happiest-0.000
ience-0.000
 sizes-0.000
Token treats
Token ID18432
Feature activation+8.34e-02

Pos logit contributions
 gloomy+0.252
 happiest+0.240
 End+0.238
 valuable+0.227
ience+0.227
Neg logit contributions
uzz-0.273
xc-0.273
rett-0.273
Wh-0.270
zzy-0.268
Token like
Token ID588
Feature activation-5.77e-03

Pos logit contributions
uit+0.132
uzz+0.131
rett+0.130
zzy+0.128
 unw+0.126
Neg logit contributions
 gloomy-0.121
 End-0.119
 happiest-0.115
ience-0.114
leness-0.112
Token candy
Token ID18550
Feature activation+5.14e-02

Pos logit contributions
 gloomy+0.008
 happiest+0.008
 End+0.008
ience+0.007
lessness+0.007
Neg logit contributions
uit-0.009
rett-0.009
uzz-0.009
 unw-0.009
Wh-0.009
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
uit+0.076
 unw+0.076
rett+0.071
bur+0.070
zzy+0.070
Neg logit contributions
 End-0.082
 gloomy-0.076
 happiest-0.075
leness-0.073
 insignificant-0.070
Token She
Token ID1375
Feature activation-3.81e-02

Pos logit contributions
 gloomy+0.045
 happiest+0.043
 End+0.042
ience+0.041
 goals+0.039
Neg logit contributions
rett-0.073
uzz-0.073
xc-0.071
uit-0.071
zzy-0.071
Token knew
Token ID2993
Feature activation-1.34e-01

Pos logit contributions
 gloomy+0.048
 happiest+0.046
 End+0.045
ience+0.044
 goals+0.043
Neg logit contributions
rett-0.067
uzz-0.066
xc-0.066
zzy-0.065
uit-0.065
Token small
Token ID1402
Feature activation-1.27e-01

Pos logit contributions
 unw+0.100
Wh+0.099
Flo+0.098
zzy+0.098
Jen+0.096
Neg logit contributions
lessness-0.073
ience-0.071
 sizes-0.070
leness-0.068
 happiest-0.066
Token town
Token ID3240
Feature activation-8.42e-02

Pos logit contributions
ience+0.177
lessness+0.174
leness+0.171
 happiest+0.170
 End+0.163
Neg logit contributions
 unw-0.203
uzz-0.198
rett-0.195
bur-0.194
xc-0.192
Token,
Token ID11
Feature activation-8.18e-02

Pos logit contributions
ience+0.138
 happiest+0.127
leness+0.124
 goals+0.120
 educational+0.117
Neg logit contributions
uzz-0.136
Jen-0.128
 Sue-0.128
 Jen-0.126
 Sara-0.123
Token there
Token ID612
Feature activation+6.91e-02

Pos logit contributions
 gloomy+0.126
 happiest+0.124
 End+0.120
ience+0.119
 goals+0.116
Neg logit contributions
rett-0.119
uzz-0.119
xc-0.117
zzy-0.116
Wh-0.115
Token was
Token ID373
Feature activation+4.80e-02

Pos logit contributions
rett+0.121
uzz+0.121
xc+0.119
zzy+0.118
uit+0.118
Neg logit contributions
 gloomy-0.087
 happiest-0.084
 End-0.081
ience-0.081
 goals-0.078
Token a
Token ID257
Feature activation+5.75e-02

Pos logit contributions
rett+0.087
xc+0.085
bur+0.082
uit+0.081
uzz+0.080
Neg logit contributions
 gloomy-0.072
 End-0.064
 happiest-0.062
 days-0.060
 ordinary-0.059
Token bar
Token ID2318
Feature activation+1.32e-02

Pos logit contributions
Wh+0.088
 unw+0.086
zzy+0.085
Jen+0.084
uzz+0.084
Neg logit contributions
 sizes-0.080
ience-0.079
leness-0.078
lessness-0.078
 goals-0.076
Tokenber
Token ID527
Feature activation-1.22e-02

Pos logit contributions
uzz+0.016
zzy+0.014
ba+0.013
rett+0.013
Jen+0.013
Neg logit contributions
 happiest-0.025
ience-0.025
leness-0.024
 End-0.024
 sizes-0.024
Token.
Token ID13
Feature activation-3.47e-02

Pos logit contributions
 happiest+0.020
 gloomy+0.019
ience+0.019
 End+0.019
leness+0.019
Neg logit contributions
uzz-0.017
uit-0.016
zzy-0.016
Wh-0.016
bur-0.016
Token The
Token ID383
Feature activation-1.67e-02

Pos logit contributions
 gloomy+0.044
 happiest+0.043
 End+0.042
ience+0.041
 goals+0.040
Neg logit contributions
rett-0.060
uzz-0.060
zzy-0.059
Wh-0.058
uit-0.058
Token bar
Token ID2318
Feature activation+1.34e-02

Pos logit contributions
ience+0.027
 sizes+0.026
 gloomy+0.026
 happiest+0.025
leness+0.025
Neg logit contributions
rett-0.023
zzy-0.022
uzz-0.022
Wh-0.022
bur-0.022
Token.
Token ID13
Feature activation-4.10e-02

Pos logit contributions
leness+0.050
 gloomy+0.049
ience+0.049
 End+0.049
 happiest+0.049
Neg logit contributions
 unw-0.043
uit-0.043
uzz-0.042
rett-0.040
zzy-0.039
Token They
Token ID1119
Feature activation-7.70e-02

Pos logit contributions
 gloomy+0.042
ience+0.042
 happiest+0.042
 End+0.040
leness+0.038
Neg logit contributions
uzz-0.081
rett-0.079
Wh-0.079
zzy-0.078
uit-0.078
Token jumped
Token ID11687
Feature activation+4.97e-02

Pos logit contributions
 gloomy+0.090
 happiest+0.088
 End+0.080
 sizes+0.077
ience+0.076
Neg logit contributions
rett-0.146
xc-0.146
uzz-0.141
uit-0.139
Wh-0.137
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
rett+0.087
uzz+0.085
xc+0.085
bur+0.084
zzy+0.084
Neg logit contributions
 gloomy-0.066
 happiest-0.061
 End-0.059
 goals-0.058
ience-0.057
Token the
Token ID262
Feature activation-7.56e-02

Pos logit contributions
rett+0.209
uzz+0.206
xc+0.204
zzy+0.203
uit+0.202
Neg logit contributions
 gloomy-0.183
 happiest-0.176
 End-0.172
ience-0.168
 goals-0.165
Token p
Token ID279
Feature activation+1.26e-01

Pos logit contributions
ience+0.119
leness+0.114
 sizes+0.112
 educational+0.112
 valuable+0.111
Neg logit contributions
Wh-0.107
Jen-0.106
xc-0.104
Flo-0.104
bur-0.103
Tokenudd
Token ID4185
Feature activation+1.09e-03

Pos logit contributions
uzz+0.230
rett+0.221
Jen+0.220
zzy+0.219
uit+0.219
Neg logit contributions
 happiest-0.158
 End-0.158
 gloomy-0.156
ience-0.144
 valuable-0.143
Tokenles
Token ID829
Feature activation+5.11e-02

Pos logit contributions
rett+0.002
uit+0.002
bur+0.002
xc+0.002
zzy+0.002
Neg logit contributions
 happiest-0.001
 End-0.001
 educational-0.001
 gloomy-0.001
 lifetime-0.001
Token and
Token ID290
Feature activation+4.74e-02

Pos logit contributions
uit+0.064
uzz+0.061
 unw+0.061
rett+0.056
 aid+0.055
Neg logit contributions
ience-0.090
 End-0.090
 happiest-0.088
 insignificant-0.087
leness-0.087
Token laughed
Token ID13818
Feature activation-1.73e-02

Pos logit contributions
uzz+0.065
rett+0.065
zzy+0.064
Wh+0.063
xc+0.063
Neg logit contributions
 gloomy-0.077
 happiest-0.075
 End-0.074
ience-0.073
 goals-0.072
Token.
Token ID13
Feature activation-4.20e-02

Pos logit contributions
ience+0.027
leness+0.026
 happiest+0.025
 End+0.025
 insignificant+0.024
Neg logit contributions
uzz-0.026
uit-0.025
zzy-0.025
 unw-0.024
rett-0.024
TokenYes
Token ID5297
Feature activation+1.36e-01

Pos logit contributions
 gloomy+0.192
 happiest+0.188
 End+0.184
 educational+0.181
 valuable+0.180
Neg logit contributions
rett-0.174
uzz-0.167
xc-0.167
uit-0.165
bur-0.159
Token,
Token ID11
Feature activation-8.07e-02

Pos logit contributions
uit+0.289
rett+0.289
uzz+0.286
xc+0.286
bur+0.282
Neg logit contributions
 gloomy-0.118
 happiest-0.115
 End-0.108
 sizes-0.103
 goals-0.103
Token he
Token ID339
Feature activation-1.38e-01

Pos logit contributions
 gloomy+0.100
 happiest+0.096
ience+0.089
lessness+0.089
 valuable+0.088
Neg logit contributions
rett-0.152
zzy-0.149
uzz-0.147
xc-0.145
bur-0.140
Token can
Token ID460
Feature activation-7.55e-02

Pos logit contributions
 gloomy+0.174
 End+0.159
lessness+0.155
 happiest+0.153
 educational+0.150
Neg logit contributions
bur-0.247
rett-0.246
uit-0.244
 unw-0.244
zip-0.241
Token ride
Token ID6594
Feature activation+1.03e-01

Pos logit contributions
 gloomy+0.124
 happiest+0.120
 End+0.118
ience+0.116
 goals+0.114
Neg logit contributions
rett-0.104
uzz-0.103
xc-0.102
zzy-0.101
uit-0.101
Token with
Token ID351
Feature activation+1.27e-01

Pos logit contributions
rett+0.160
uzz+0.159
xc+0.156
Wh+0.155
uit+0.155
Neg logit contributions
 gloomy-0.151
 happiest-0.147
 End-0.144
ience-0.140
 goals-0.136
Token me
Token ID502
Feature activation+7.26e-02

Pos logit contributions
rett+0.181
uzz+0.179
xc+0.177
zzy+0.176
uit+0.174
Neg logit contributions
 gloomy-0.204
 happiest-0.197
 End-0.192
ience-0.189
 goals-0.186
Token."
Token ID526
Feature activation+9.22e-03

Pos logit contributions
rett+0.123
uzz+0.123
uit+0.122
bur+0.119
zzy+0.117
Neg logit contributions
 gloomy-0.099
 happiest-0.095
 End-0.094
ience-0.089
 educational-0.088
Token
Token ID198
Feature activation+9.10e-02

Pos logit contributions
rett+0.014
zzy+0.014
xc+0.014
bur+0.013
uzz+0.013
Neg logit contributions
 gloomy-0.015
 These-0.014
s-0.014
lessness-0.013
 educational-0.013
Token
Token ID198
Feature activation+9.04e-02

Pos logit contributions
xc+0.213
rett+0.208
uit+0.198
uzz+0.197
bur+0.196
Neg logit contributions
s-0.085
 much-0.069
 End-0.062
 gloomy-0.062
 that-0.061
TokenTim
Token ID14967
Feature activation+2.43e-01

Pos logit contributions
rett+0.084
uzz+0.075
xc+0.073
zzy+0.069
uit+0.067
Neg logit contributions
 gloomy-0.200
 End-0.196
 happiest-0.190
 valuable-0.189
 sizes-0.188
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
Wh+0.213
xc+0.212
rett+0.209
uit+0.204
bur+0.203
Neg logit contributions
 gloomy-0.092
 End-0.072
 valuable-0.072
 happiest-0.071
 goals-0.070
Token the
Token ID262
Feature activation-7.68e-02

Pos logit contributions
xc+0.224
rett+0.219
zzy+0.214
igi+0.212
uzz+0.206
Neg logit contributions
 gloomy-0.190
 goals-0.171
 End-0.170
 ordinary-0.167
 sizes-0.164
Token air
Token ID1633
Feature activation-1.31e-01

Pos logit contributions
ience+0.063
 sizes+0.062
 gloomy+0.059
 happiest+0.059
 End+0.059
Neg logit contributions
Wh-0.159
 unw-0.159
bur-0.158
uit-0.156
rett-0.156
Token.
Token ID13
Feature activation-3.74e-02

Pos logit contributions
 End+0.171
ience+0.170
 happiest+0.169
 gloomy+0.166
 educational+0.158
Neg logit contributions
uit-0.219
uzz-0.219
rett-0.218
 unw-0.212
zzy-0.209
Token It
Token ID632
Feature activation-1.42e-02

Pos logit contributions
 gloomy+0.036
 happiest+0.032
 These+0.031
 End+0.031
 valuable+0.030
Neg logit contributions
xc-0.079
bur-0.077
rett-0.077
zzy-0.075
uzz-0.075
Token was
Token ID373
Feature activation+4.39e-02

Pos logit contributions
 gloomy+0.016
 happiest+0.016
 sizes+0.014
ience+0.014
 End+0.014
Neg logit contributions
rett-0.028
uzz-0.028
xc-0.027
uit-0.027
bur-0.026
Token a
Token ID257
Feature activation+5.38e-02

Pos logit contributions
xc+0.077
rett+0.077
uzz+0.074
zzy+0.074
Wh+0.073
Neg logit contributions
 gloomy-0.062
 happiest-0.056
 End-0.054
 valuable-0.054
 sizes-0.053
Token small
Token ID1402
Feature activation-1.31e-01

Pos logit contributions
rett+0.097
 unw+0.097
Wh+0.095
uzz+0.095
uit+0.095
Neg logit contributions
ience-0.058
 End-0.058
 happiest-0.057
 sizes-0.056
lessness-0.056
Token box
Token ID3091
Feature activation-4.19e-02

Pos logit contributions
 End+0.180
 happiest+0.178
lessness+0.175
ience+0.173
leness+0.172
Neg logit contributions
 unw-0.205
uzz-0.196
rett-0.194
uit-0.193
 bru-0.190
Token.
Token ID13
Feature activation-3.79e-02

Pos logit contributions
 End+0.059
ience+0.059
 goals+0.059
 happiest+0.058
leness+0.057
Neg logit contributions
uit-0.068
 unw-0.067
uzz-0.066
rett-0.065
zzy-0.062
Token
Token ID220
Feature activation-7.61e-02

Pos logit contributions
 gloomy+0.040
 happiest+0.036
 End+0.035
lessness+0.035
 These+0.034
Neg logit contributions
xc-0.075
rett-0.074
bur-0.074
zzy-0.074
uzz-0.073
Token newspaper
Token ID7533
Feature activation-9.59e-02

Pos logit contributions
leness+0.104
ience+0.102
 happiest+0.102
 End+0.102
 sizes+0.096
Neg logit contributions
 unw-0.103
uit-0.098
rett-0.095
uzz-0.093
zip-0.093
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
 happiest+0.134
 End+0.132
 gloomy+0.129
ience+0.128
leness+0.126
Neg logit contributions
uit-0.162
 unw-0.158
rett-0.156
uzz-0.152
zzy-0.148
Token Suddenly
Token ID24975
Feature activation+5.37e-02

Pos logit contributions
 gloomy+0.032
 happiest+0.030
 End+0.029
 sizes+0.028
ience+0.027
Neg logit contributions
rett-0.044
xc-0.044
zzy-0.044
bur-0.043
uzz-0.043
Token,
Token ID11
Feature activation-7.55e-02

Pos logit contributions
uit+0.110
rett+0.110
uzz+0.109
xc+0.109
Wh+0.107
Neg logit contributions
 happiest-0.049
 gloomy-0.048
 End-0.047
ience-0.045
 goals-0.045
Token a
Token ID257
Feature activation+5.89e-02

Pos logit contributions
 gloomy+0.091
 happiest+0.082
 End+0.078
lessness+0.073
ience+0.072
Neg logit contributions
rett-0.145
uzz-0.143
xc-0.141
zzy-0.140
bur-0.137
Token spider
Token ID19230
Feature activation-9.42e-05

Pos logit contributions
 unw+0.096
Wh+0.095
uit+0.094
Jen+0.092
rett+0.092
Neg logit contributions
 sizes-0.075
 goals-0.075
ience-0.074
 happiest-0.073
 End-0.072
Token crawled
Token ID45668
Feature activation+9.05e-02

Pos logit contributions
 happiest+0.000
 gloomy+0.000
 End+0.000
 goals+0.000
 sizes+0.000
Neg logit contributions
rett-0.000
xc-0.000
 unw-0.000
Wh-0.000
uit-0.000
Token out
Token ID503
Feature activation+9.82e-02

Pos logit contributions
rett+0.162
zzy+0.156
uzz+0.156
xc+0.155
bur+0.155
Neg logit contributions
 gloomy-0.121
lessness-0.101
 End-0.100
 happiest-0.099
ience-0.099
Token of
Token ID286
Feature activation+2.60e-02

Pos logit contributions
rett+0.194
xc+0.191
zzy+0.190
bur+0.189
Wh+0.189
Neg logit contributions
 gloomy-0.107
lessness-0.095
 happiest-0.093
 sizes-0.091
 End-0.089
Token the
Token ID262
Feature activation-6.70e-02

Pos logit contributions
zzy+0.051
rett+0.050
xc+0.050
uzz+0.049
ba+0.048
Neg logit contributions
 gloomy-0.033
 days-0.029
 End-0.027
 ordinary-0.026
 day-0.026
Token newspaper
Token ID7533
Feature activation-9.76e-02

Pos logit contributions
 happiest+0.097
leness+0.096
 End+0.094
 sizes+0.093
ience+0.093
Neg logit contributions
 unw-0.104
 aid-0.099
Wh-0.099
uit-0.098
zip-0.095
Token its
Token ID663
Feature activation-3.66e-02

Pos logit contributions
uzz+0.006
rett+0.006
uit+0.006
zzy+0.006
xc+0.006
Neg logit contributions
 gloomy-0.006
 happiest-0.006
 End-0.006
ience-0.006
lessness-0.006
Token strong
Token ID1913
Feature activation-1.16e-01

Pos logit contributions
ience+0.047
 End+0.047
lessness+0.045
 gloomy+0.045
leness+0.045
Neg logit contributions
uzz-0.056
rett-0.056
 unw-0.056
bur-0.055
zzy-0.055
Token sails
Token ID48370
Feature activation+4.51e-02

Pos logit contributions
 happiest+0.159
ience+0.158
lessness+0.156
 gloomy+0.155
 End+0.155
Neg logit contributions
uzz-0.171
 unw-0.169
rett-0.169
uit-0.167
 aid-0.163
Token to
Token ID284
Feature activation+1.22e-01

Pos logit contributions
uit+0.065
rett+0.062
uzz+0.061
bur+0.061
 unw+0.059
Neg logit contributions
 End-0.073
 gloomy-0.070
ience-0.069
 educational-0.067
 happiest-0.067
Token sail
Token ID14936
Feature activation+2.73e-03

Pos logit contributions
rett+0.162
uzz+0.161
uit+0.158
zzy+0.158
xc+0.158
Neg logit contributions
 gloomy-0.203
 happiest-0.198
 End-0.194
ience-0.192
 goals-0.187
Token across
Token ID1973
Feature activation-5.36e-02

Pos logit contributions
uzz+0.004
rett+0.004
uit+0.004
zzy+0.004
bur+0.004
Neg logit contributions
 gloomy-0.004
 happiest-0.004
 End-0.004
ience-0.004
 educational-0.003
Token the
Token ID262
Feature activation-6.88e-02

Pos logit contributions
 happiest+0.083
 gloomy+0.083
ience+0.082
 End+0.081
 sizes+0.078
Neg logit contributions
rett-0.074
uzz-0.074
zzy-0.072
xc-0.072
 unw-0.072
Token ocean
Token ID9151
Feature activation-1.08e-01

Pos logit contributions
 These+0.086
ience+0.084
leness+0.082
lessness+0.080
 Playing+0.078
Neg logit contributions
bur-0.112
zzy-0.107
 unw-0.106
rett-0.105
xc-0.104
Token and
Token ID290
Feature activation+5.62e-02

Pos logit contributions
 End+0.179
ience+0.168
leness+0.163
 happiest+0.162
 These+0.162
Neg logit contributions
 unw-0.147
uit-0.140
rett-0.136
uzz-0.131
onna-0.130
Token it
Token ID340
Feature activation-7.22e-02

Pos logit contributions
uzz+0.079
rett+0.078
 unw+0.077
bur+0.075
uit+0.075
Neg logit contributions
 gloomy-0.083
 End-0.082
ience-0.081
 happiest-0.078
 These-0.078
Token was
Token ID373
Feature activation+4.95e-02

Pos logit contributions
 End+0.111
 These+0.110
 gloomy+0.109
 educational+0.107
leness+0.105
Neg logit contributions
 unw-0.097
rett-0.096
zzy-0.095
xc-0.094
bur-0.094
Token Blue
Token ID4518
Feature activation+1.33e-01

Pos logit contributions
rett+0.001
uzz+0.001
xc+0.001
bur+0.001
Wh+0.001
Neg logit contributions
 gloomy-0.001
 happiest-0.001
 End-0.001
ience-0.001
lessness-0.001
Token too
Token ID1165
Feature activation-1.78e-02

Pos logit contributions
bur+0.183
ba+0.181
Flo+0.178
uit+0.178
rett+0.177
Neg logit contributions
 educational-0.208
lessness-0.199
 End-0.197
 happiest-0.196
 lifetime-0.195
Token.
Token ID13
Feature activation-4.03e-02

Pos logit contributions
 happiest+0.029
ience+0.026
 End+0.025
 lifetime+0.024
lessness+0.024
Neg logit contributions
uzz-0.027
 unw-0.026
zzy-0.025
S-0.025
rett-0.025
Token She
Token ID1375
Feature activation-3.34e-02

Pos logit contributions
 gloomy+0.035
 happiest+0.034
 goals+0.032
 End+0.032
ience+0.031
Neg logit contributions
zzy-0.088
rett-0.088
xc-0.085
bur-0.085
uit-0.084
Token is
Token ID318
Feature activation-4.19e-02

Pos logit contributions
 gloomy+0.033
 End+0.030
 happiest+0.030
lessness+0.029
 educational+0.028
Neg logit contributions
rett-0.067
uzz-0.067
 unw-0.066
xc-0.066
bur-0.066
Token so
Token ID523
Feature activation-4.77e-02

Pos logit contributions
 gloomy+0.052
 happiest+0.049
 End+0.045
 valuable+0.045
 goals+0.044
Neg logit contributions
rett-0.082
xc-0.081
zzy-0.080
uit-0.079
uzz-0.078
Token pretty
Token ID2495
Feature activation-7.75e-02

Pos logit contributions
 gloomy+0.040
 happiest+0.039
 End+0.038
ience+0.036
 sizes+0.034
Neg logit contributions
uzz-0.102
rett-0.102
xc-0.100
zzy-0.100
Wh-0.100
Token.
Token ID13
Feature activation-3.96e-02

Pos logit contributions
 End+0.153
 lifetime+0.151
leness+0.150
 happiest+0.150
 gloomy+0.137
Neg logit contributions
 Sara-0.085
uzz-0.077
S-0.073
bur-0.072
Flo-0.072
Token Can
Token ID1680
Feature activation-1.80e-02

Pos logit contributions
 gloomy+0.034
 happiest+0.033
 End+0.031
 goals+0.031
 sizes+0.030
Neg logit contributions
rett-0.087
zzy-0.086
xc-0.085
bur-0.084
uzz-0.083
Token we
Token ID356
Feature activation-9.74e-02

Pos logit contributions
 gloomy+0.016
 happiest+0.015
 End+0.015
ience+0.014
 goals+0.013
Neg logit contributions
rett-0.039
zzy-0.038
uzz-0.038
xc-0.038
uit-0.038
Token take
Token ID1011
Feature activation+1.23e-01

Pos logit contributions
 gloomy+0.118
 happiest+0.116
 End+0.105
 goals+0.099
s+0.098
Neg logit contributions
rett-0.185
xc-0.182
zzy-0.177
uzz-0.175
igi-0.172
Token to
Token ID284
Feature activation+1.21e-01

Pos logit contributions
 End+0.239
 happiest+0.238
ience+0.237
 gloomy+0.234
 goals+0.222
Neg logit contributions
rett-0.212
 unw-0.206
uit-0.205
uzz-0.204
xc-0.198
Token draw
Token ID3197
Feature activation+2.39e-02

Pos logit contributions
rett+0.193
xc+0.191
uzz+0.191
zzy+0.188
uit+0.187
Neg logit contributions
 gloomy-0.176
 happiest-0.170
 End-0.166
 goals-0.159
ience-0.157
Token a
Token ID257
Feature activation+5.77e-02

Pos logit contributions
rett+0.036
uit+0.036
uzz+0.036
zzy+0.035
xc+0.035
Neg logit contributions
 gloomy-0.035
 happiest-0.034
 End-0.034
ience-0.033
lessness-0.031
Token new
Token ID649
Feature activation-7.74e-02

Pos logit contributions
 unw+0.102
rett+0.102
Wh+0.099
uzz+0.098
uit+0.098
Neg logit contributions
lessness-0.065
ience-0.065
 End-0.065
 happiest-0.063
 gloomy-0.062
Token animal
Token ID5044
Feature activation+3.30e-02

Pos logit contributions
 gloomy+0.098
 happiest+0.095
ience+0.095
 End+0.093
lessness+0.092
Neg logit contributions
rett-0.133
uzz-0.133
uit-0.129
xc-0.129
 unw-0.127
Token.
Token ID13
Feature activation-3.07e-02

Pos logit contributions
 unw+0.050
uit+0.049
uzz+0.049
rett+0.048
zzy+0.047
Neg logit contributions
 End-0.047
 gloomy-0.047
 happiest-0.046
ience-0.045
lessness-0.044
Token He
Token ID679
Feature activation+1.67e-02

Pos logit contributions
 gloomy+0.037
 happiest+0.035
 End+0.034
ience+0.033
 goals+0.033
Neg logit contributions
rett-0.056
xc-0.056
uzz-0.055
zzy-0.055
bur-0.055
Token drew
Token ID9859
Feature activation+7.46e-02

Pos logit contributions
rett+0.027
uzz+0.027
xc+0.027
zzy+0.027
uit+0.027
Neg logit contributions
 gloomy-0.023
 happiest-0.022
 End-0.022
ience-0.021
 goals-0.021
Token a
Token ID257
Feature activation+5.70e-02

Pos logit contributions
uit+0.110
uzz+0.109
rett+0.108
Wh+0.104
xc+0.103
Neg logit contributions
 gloomy-0.112
ience-0.109
 happiest-0.109
 End-0.108
leness-0.103
Token yellow
Token ID7872
Feature activation-6.70e-02

Pos logit contributions
 unw+0.103
Wh+0.102
rett+0.102
uit+0.100
uzz+0.100
Neg logit contributions
ience-0.063
lessness-0.061
 happiest-0.059
 gloomy-0.059
 End-0.059
Token bird
Token ID6512
Feature activation+9.74e-02

Pos logit contributions
 gloomy+0.092
 happiest+0.090
 End+0.087
ience+0.086
 goals+0.084
Neg logit contributions
rett-0.109
uzz-0.108
uit-0.107
xc-0.106
Wh-0.105
Token all
Token ID477
Feature activation-6.27e-02

Pos logit contributions
 gloomy+0.092
 happiest+0.085
 End+0.085
 goals+0.082
 sizes+0.082
Neg logit contributions
rett-0.129
uzz-0.129
zzy-0.128
bur-0.127
uit-0.126
Token agreed
Token ID4987
Feature activation-2.42e-02

Pos logit contributions
 gloomy+0.087
 happiest+0.083
 End+0.083
 goals+0.081
 problem+0.081
Neg logit contributions
uzz-0.103
zzy-0.102
rett-0.098
 unw-0.097
xc-0.096
Token that
Token ID326
Feature activation-1.17e-01

Pos logit contributions
 gloomy+0.029
 happiest+0.028
 End+0.028
 tallest+0.028
 sizes+0.027
Neg logit contributions
uzz-0.043
rett-0.043
 unw-0.042
uit-0.042
zzy-0.042
Token Jen
Token ID13364
Feature activation+2.19e-01

Pos logit contributions
ience+0.146
 gloomy+0.146
 End+0.142
 happiest+0.139
leness+0.139
Neg logit contributions
uzz-0.196
rett-0.190
xc-0.187
 unw-0.183
uit-0.182
Token was
Token ID373
Feature activation+4.90e-02

Pos logit contributions
zzy+0.383
rett+0.376
bur+0.376
xc+0.368
 unw+0.367
Neg logit contributions
 gloomy-0.273
 End-0.263
 happiest-0.249
 educational-0.243
 goals-0.242
Token so
Token ID523
Feature activation-4.02e-02

Pos logit contributions
zzy+0.071
uzz+0.071
bur+0.070
uit+0.069
rett+0.068
Neg logit contributions
ience-0.072
 sizes-0.071
 gloomy-0.071
 goals-0.070
leness-0.069
Token gifted
Token ID22527
Feature activation-2.68e-02

Pos logit contributions
 sizes+0.049
 goals+0.048
 moons+0.048
 jets+0.047
 End+0.046
Neg logit contributions
uzz-0.072
zzy-0.070
Ab-0.065
 unw-0.065
ck-0.063
Token and
Token ID290
Feature activation+5.73e-02

Pos logit contributions
 End+0.036
 gloomy+0.036
 happiest+0.036
 tallest+0.033
 goals+0.033
Neg logit contributions
uit-0.044
uzz-0.044
rett-0.043
zzy-0.042
xc-0.042
Token they
Token ID484
Feature activation-1.96e-01

Pos logit contributions
 unw+0.086
uzz+0.085
 Jen+0.082
 Sara+0.082
 Sue+0.082
Neg logit contributions
 End-0.077
 goals-0.074
 gloomy-0.071
 problem-0.070
 These-0.070
Token gave
Token ID2921
Feature activation+5.82e-02

Pos logit contributions
 gloomy+0.265
 End+0.258
 These+0.247
 educational+0.247
 goals+0.243
Neg logit contributions
zzy-0.322
 unw-0.320
uzz-0.319
bur-0.313
rett-0.313
Token her
Token ID607
Feature activation+8.91e-03

Pos logit contributions
rett+0.093
Wh+0.092
xc+0.091
zzy+0.091
uit+0.090
Neg logit contributions
 gloomy-0.081
 happiest-0.080
 End-0.075
 goals-0.075
 days-0.074
Token It
Token ID632
Feature activation-8.33e-03

Pos logit contributions
 gloomy+0.030
 happiest+0.027
 End+0.026
 These+0.025
 sky+0.024
Neg logit contributions
xc-0.062
rett-0.060
zzy-0.060
bur-0.059
uzz-0.059
Token was
Token ID373
Feature activation+5.14e-02

Pos logit contributions
 gloomy+0.012
ience+0.012
 happiest+0.012
 End+0.012
 goals+0.012
Neg logit contributions
zzy-0.012
rett-0.012
xc-0.012
Wh-0.012
uzz-0.012
Token very
Token ID845
Feature activation-1.83e-01

Pos logit contributions
rett+0.110
xc+0.109
uzz+0.108
uit+0.106
zzy+0.106
Neg logit contributions
 gloomy-0.050
 happiest-0.045
 End-0.043
 sizes-0.040
 goals-0.040
Token quiet
Token ID5897
Feature activation-1.44e-01

Pos logit contributions
ience+0.234
leness+0.223
 happiest+0.217
 End+0.215
 sizes+0.206
Neg logit contributions
zzy-0.334
uzz-0.322
Jen-0.315
 unw-0.311
rett-0.303
Token and
Token ID290
Feature activation+5.80e-02

Pos logit contributions
leness+0.291
ience+0.288
 End+0.288
 happiest+0.273
 lifetime+0.262
Neg logit contributions
uit-0.152
uzz-0.144
rett-0.142
 unw-0.140
zzy-0.132
Token he
Token ID339
Feature activation-1.38e-01

Pos logit contributions
 Jen+0.081
bur+0.080
 Sara+0.079
 Sue+0.078
Wh+0.078
Neg logit contributions
 These-0.088
leness-0.088
ience-0.085
 goals-0.081
pieces-0.080
Token could
Token ID714
Feature activation-8.44e-02

Pos logit contributions
 gloomy+0.170
 happiest+0.165
 End+0.163
ience+0.158
 sizes+0.156
Neg logit contributions
zzy-0.241
rett-0.241
uzz-0.239
xc-0.235
Wh-0.233
Token hear
Token ID3285
Feature activation+5.31e-02

Pos logit contributions
 gloomy+0.127
 happiest+0.123
 End+0.113
 goals+0.109
 educational+0.108
Neg logit contributions
xc-0.137
rett-0.134
uit-0.133
uzz-0.132
bur-0.131
Token the
Token ID262
Feature activation-6.71e-02

Pos logit contributions
uzz+0.068
rett+0.068
 unw+0.068
Wh+0.066
uit+0.065
Neg logit contributions
ience-0.086
 End-0.086
leness-0.082
 happiest-0.082
�-0.080
Token wind
Token ID2344
Feature activation-1.17e-01

Pos logit contributions
ience+0.082
leness+0.077
 End+0.076
 happiest+0.074
 educational+0.072
Neg logit contributions
rett-0.113
Wh-0.112
uzz-0.109
uit-0.109
bur-0.109
Token blowing
Token ID19280
Feature activation-1.94e-02

Pos logit contributions
ience+0.185
 End+0.182
 educational+0.178
 happiest+0.178
 valuable+0.177
Neg logit contributions
uzz-0.167
rett-0.165
uit-0.164
bur-0.162
ba-0.161
Token He
Token ID679
Feature activation+1.28e-02

Pos logit contributions
 gloomy+0.038
 goals+0.038
 These+0.035
 End+0.035
 educational+0.034
Neg logit contributions
xc-0.088
rett-0.085
bur-0.084
zzy-0.082
uit-0.082
Token told
Token ID1297
Feature activation+4.89e-02

Pos logit contributions
xc+0.026
rett+0.026
uit+0.025
uzz+0.025
icle+0.025
Neg logit contributions
 happiest-0.013
ience-0.013
 gloomy-0.012
 needed-0.012
 loves-0.012
Token his
Token ID465
Feature activation+4.25e-02

Pos logit contributions
zzy+0.085
rett+0.083
xc+0.083
zip+0.081
igi+0.079
Neg logit contributions
 gloomy-0.070
 ordinary-0.065
 days-0.065
 that-0.064
 End-0.063
Token mom
Token ID1995
Feature activation+4.58e-03

Pos logit contributions
uzz+0.049
rett+0.045
 unw+0.042
xc+0.040
zzy+0.040
Neg logit contributions
 happiest-0.080
leness-0.079
ience-0.079
 gloomy-0.079
 End-0.077
Token that
Token ID326
Feature activation-1.20e-01

Pos logit contributions
uzz+0.007
uit+0.007
rett+0.007
bur+0.007
 Sara+0.007
Neg logit contributions
 gloomy-0.007
leness-0.007
 happiest-0.007
 End-0.007
ience-0.006
Token he
Token ID339
Feature activation-1.44e-01

Pos logit contributions
 gloomy+0.118
 happiest+0.113
 End+0.112
ience+0.111
 goals+0.103
Neg logit contributions
uzz-0.239
rett-0.237
uit-0.235
bur-0.233
xc-0.232
Token liked
Token ID8288
Feature activation-8.48e-02

Pos logit contributions
 gloomy+0.213
 End+0.197
leness+0.196
 happiest+0.195
lessness+0.190
Neg logit contributions
zzy-0.221
uzz-0.217
 unw-0.213
bur-0.212
rett-0.212
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
 gloomy+0.130
 happiest+0.127
 End+0.125
ience+0.123
 goals+0.119
Neg logit contributions
rett-0.124
uzz-0.124
xc-0.121
uit-0.121
bur-0.120
Token deliver
Token ID5203
Feature activation+1.26e-02

Pos logit contributions
rett+0.172
xc+0.170
uzz+0.169
uit+0.166
zzy+0.166
Neg logit contributions
 gloomy-0.179
 happiest-0.172
 End-0.167
ience-0.164
 goals-0.162
Token therm
Token ID21969
Feature activation+4.99e-02

Pos logit contributions
uzz+0.022
uit+0.022
rett+0.022
 unw+0.021
bur+0.021
Neg logit contributions
 gloomy-0.015
 happiest-0.015
 End-0.015
ience-0.015
lessness-0.014
Tokenometers
Token ID40077
Feature activation-1.16e-01

Pos logit contributions
uzz+0.101
rett+0.099
uit+0.096
bur+0.094
zip+0.090
Neg logit contributions
 gloomy-0.060
 happiest-0.058
 End-0.056
ience-0.052
 goals-0.051
Token He
Token ID679
Feature activation+9.63e-03

Pos logit contributions
 gloomy+0.064
 happiest+0.062
 End+0.060
ience+0.060
 sizes+0.059
Neg logit contributions
uzz-0.055
rett-0.055
xc-0.052
Wh-0.051
bur-0.051
Token felt
Token ID2936
Feature activation-2.90e-02

Pos logit contributions
rett+0.019
uit+0.019
uzz+0.019
xc+0.018
igi+0.018
Neg logit contributions
ience-0.011
 gloomy-0.010
 happiest-0.010
 End-0.010
 needed-0.010
Token so
Token ID523
Feature activation-4.93e-02

Pos logit contributions
 sizes+0.033
ience+0.030
 lifetime+0.030
�+0.029
 These+0.028
Neg logit contributions
zzy-0.051
uzz-0.050
 unw-0.050
Wh-0.050
 snatch-0.049
Token sad
Token ID6507
Feature activation-1.62e-01

Pos logit contributions
 sizes+0.063
 jets+0.052
 happiest+0.052
 tallest+0.050
 lifetime+0.050
Neg logit contributions
uzz-0.084
zzy-0.084
 unw-0.082
Flo-0.081
ck-0.081
Token and
Token ID290
Feature activation+4.74e-02

Pos logit contributions
 happiest+0.253
 End+0.249
ience+0.244
 gloomy+0.242
 educational+0.238
Neg logit contributions
uit-0.227
rett-0.226
uzz-0.225
 Sara-0.206
bur-0.204
Token he
Token ID339
Feature activation-1.47e-01

Pos logit contributions
 snatch+0.071
 sob+0.071
xc+0.071
 unw+0.070
bur+0.070
Neg logit contributions
 sizes-0.063
 These-0.062
 goals-0.058
 happiest-0.058
 lifetime-0.058
Token stopped
Token ID5025
Feature activation+1.06e-01

Pos logit contributions
 gloomy+0.174
 happiest+0.167
ience+0.163
 End+0.161
 goals+0.153
Neg logit contributions
rett-0.273
uzz-0.270
uit-0.268
xc-0.266
zzy-0.262
Token crying
Token ID13774
Feature activation+3.38e-03

Pos logit contributions
rett+0.186
zzy+0.185
Wh+0.182
uit+0.182
xc+0.182
Neg logit contributions
 gloomy-0.140
 happiest-0.130
 End-0.128
 goals-0.123
ience-0.120
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
uit+0.005
uzz+0.004
rett+0.004
zzy+0.004
 Sara+0.004
Neg logit contributions
 happiest-0.005
 End-0.005
ience-0.005
 sizes-0.005
leness-0.005
Token He
Token ID679
Feature activation+9.40e-03

Pos logit contributions
 gloomy+0.067
 happiest+0.066
ience+0.065
 sizes+0.065
 End+0.063
Neg logit contributions
uzz-0.052
rett-0.051
xc-0.048
bur-0.048
Wh-0.048
Token knew
Token ID2993
Feature activation-1.35e-01

Pos logit contributions
rett+0.019
uzz+0.019
uit+0.018
xc+0.018
zzy+0.018
Neg logit contributions
 gloomy-0.010
ience-0.010
 happiest-0.010
 End-0.009
 valuable-0.009
Token boy
Token ID2933
Feature activation+8.34e-02

Pos logit contributions
uzz+0.081
xc+0.080
uit+0.077
Jen+0.077
 Sara+0.077
Neg logit contributions
leness-0.098
ience-0.097
 End-0.096
 goals-0.095
 happiest-0.092
Token saw
Token ID2497
Feature activation+9.69e-03

Pos logit contributions
rett+0.112
uzz+0.109
xc+0.109
uit+0.108
zzy+0.107
Neg logit contributions
 gloomy-0.142
 happiest-0.136
 End-0.134
ience-0.132
 sizes-0.129
Token the
Token ID262
Feature activation-6.82e-02

Pos logit contributions
rett+0.015
uzz+0.015
zzy+0.015
xc+0.015
uit+0.014
Neg logit contributions
 gloomy-0.015
 happiest-0.014
 End-0.014
ience-0.013
 goals-0.013
Token tube
Token ID12403
Feature activation-1.84e-02

Pos logit contributions
ience+0.096
leness+0.088
 happiest+0.086
 End+0.085
 goals+0.084
Neg logit contributions
uit-0.109
rett-0.108
Wh-0.107
xc-0.105
Jen-0.104
Token and
Token ID290
Feature activation+5.68e-02

Pos logit contributions
 happiest+0.031
 End+0.031
ience+0.031
leness+0.030
 fortunate+0.030
Neg logit contributions
 unw-0.024
uit-0.023
uzz-0.022
bur-0.021
rett-0.021
Token he
Token ID339
Feature activation-1.38e-01

Pos logit contributions
 unw+0.083
zip+0.077
uzz+0.076
Wh+0.076
rett+0.075
Neg logit contributions
 happiest-0.087
 End-0.086
ience-0.084
 gloomy-0.082
 goals-0.081
Token wanted
Token ID2227
Feature activation-5.26e-02

Pos logit contributions
 gloomy+0.144
 happiest+0.136
 End+0.132
ience+0.128
 goals+0.124
Neg logit contributions
rett-0.276
uzz-0.273
xc-0.271
uit-0.270
zzy-0.269
Token to
Token ID284
Feature activation+1.19e-01

Pos logit contributions
 gloomy+0.083
 happiest+0.082
 End+0.081
ience+0.080
 sizes+0.078
Neg logit contributions
rett-0.074
uzz-0.074
uit-0.072
xc-0.072
bur-0.072
Token try
Token ID1949
Feature activation+4.91e-02

Pos logit contributions
xc+0.248
igi+0.247
rett+0.246
uzz+0.243
bur+0.237
Neg logit contributions
 gloomy-0.122
 happiest-0.107
 educational-0.104
 nation-0.103
 day-0.103
Token it
Token ID340
Feature activation-7.30e-02

Pos logit contributions
xc+0.086
uzz+0.086
rett+0.086
zzy+0.084
Wh+0.084
Neg logit contributions
 gloomy-0.062
 happiest-0.059
 End-0.057
ience-0.056
 goals-0.056
Token.
Token ID13
Feature activation-3.01e-02

Pos logit contributions
 gloomy+0.070
 happiest+0.060
 goals+0.058
 valuable+0.057
 problem+0.055
Neg logit contributions
uzz-0.155
rett-0.154
xc-0.154
zzy-0.151
Wh-0.151
Token and
Token ID290
Feature activation+5.50e-02

Pos logit contributions
bur+0.141
zzy+0.137
rett+0.131
uzz+0.129
 unw+0.129
Neg logit contributions
 gloomy-0.192
 educational-0.185
 End-0.181
 happiest-0.179
 goals-0.177
Token its
Token ID663
Feature activation-3.50e-02

Pos logit contributions
rett+0.089
uzz+0.088
xc+0.087
zzy+0.087
uit+0.087
Neg logit contributions
 gloomy-0.079
 happiest-0.076
 End-0.074
ience-0.072
 goals-0.071
Token friends
Token ID2460
Feature activation-2.61e-02

Pos logit contributions
 gloomy+0.041
ience+0.041
 happiest+0.040
 End+0.040
 goals+0.037
Neg logit contributions
uzz-0.063
rett-0.061
uit-0.061
bur-0.060
Wh-0.060
Token would
Token ID561
Feature activation-5.33e-02

Pos logit contributions
 educational+0.041
 gloomy+0.039
 End+0.039
 insignificant+0.038
 goals+0.038
Neg logit contributions
uzz-0.037
 unw-0.037
bur-0.037
zzy-0.037
zip-0.035
Token come
Token ID1282
Feature activation+1.32e-01

Pos logit contributions
 gloomy+0.076
 happiest+0.073
 End+0.072
ience+0.070
 goals+0.069
Neg logit contributions
rett-0.085
uzz-0.084
xc-0.083
zzy-0.082
uit-0.082
Token every
Token ID790
Feature activation-1.35e-01

Pos logit contributions
uzz+0.188
rett+0.180
bur+0.179
uit+0.179
xc+0.178
Neg logit contributions
ience-0.208
 End-0.207
 gloomy-0.205
 happiest-0.203
leness-0.200
Token day
Token ID1110
Feature activation-2.16e-01

Pos logit contributions
 sizes+0.087
 happiest+0.085
ience+0.085
 goals+0.083
leness+0.081
Neg logit contributions
zzy-0.303
xc-0.302
bur-0.300
rett-0.299
 unw-0.299
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
 End+0.467
ience+0.431
leness+0.423
 happiest+0.404
 goals+0.393
Neg logit contributions
 unw-0.236
uzz-0.191
to-0.184
ill-0.182
bur-0.180
Token drink
Token ID4144
Feature activation+2.58e-02

Pos logit contributions
rett+0.183
uzz+0.181
xc+0.179
zzy+0.177
uit+0.176
Neg logit contributions
 gloomy-0.180
 happiest-0.174
 End-0.169
ience-0.165
 goals-0.162
Token water
Token ID1660
Feature activation-4.81e-02

Pos logit contributions
 unw+0.037
uzz+0.037
uit+0.037
zzy+0.036
bur+0.036
Neg logit contributions
 End-0.039
ience-0.038
 happiest-0.038
 gloomy-0.037
leness-0.036
Token from
Token ID422
Feature activation+1.42e-01

Pos logit contributions
 End+0.073
 happiest+0.071
 gloomy+0.070
ience+0.070
 goals+0.066
Neg logit contributions
uit-0.071
uzz-0.071
bur-0.070
 unw-0.069
rett-0.069