TOP ACTIVATIONS
MAX = 0.112

before . Suddenly , a loud , harsh noise startled Mike
. But suddenly , a booming voice said , " Stop
a kingdom ruled by a wicked wizard . Suddenly , the
away . They left the y ummy refrigerator and all the
Soon , the y ummy dinner was ready .
that day on , the y ummy place was protected and
very sad to leave the y ummy pie , but he
Suddenly , a loud noise came from the sky
Suddenly , a loud cheer sounded from the room
noise coming from behind a nearby bush . He was really
sight . Suddenly , a loud voice came out of nowhere
as they entered , a loud noise made them jump !
figure appeared . In a booming voice , the figure said
to be seen . The wicked witch had enchanted her ,
long . Soon , the wicked man became greedy again and
blow . Suddenly , a loud roar filled the air .
Suddenly , a loud laugh echoed around the chamber
Suddenly , a big loud voice came from behind her
Suddenly , a loud noise made the ants scatter
happened . Suddenly , a loud tune came out of nowhere

MIN ACTIVATIONS
MIN = -0.102

. They wear helmets and pads to be safe .
sk ates and helmets and pads . They use a puck
. They also have ink pads of different colors . They
their gear : helmets , pads and gloves . They knew
in the book . <|endoftext|> Once upon a time , there
sharing is good . <|endoftext|> Once , there was a girl
best of friends . <|endoftext|> Once upon a time , there
laughed all day . <|endoftext|> Once upon a time , there
teasing is bad . <|endoftext|> Once upon a time , there
red je ep . <|endoftext|> Once upon a time , there
the book together . <|endoftext|> Once upon a time , there
was raining outside . <|endoftext|> Once upon a time , there
wet and laughing . <|endoftext|> Once upon a time , there
became good friends . <|endoftext|> Once upon a time , there
ran and laughed . <|endoftext|> Once upon a time , there
its lemon ade . <|endoftext|> Once there was a lady .
old and rotten . <|endoftext|> Once upon a time , there
became best friends . <|endoftext|> Once upon a time , a
time playing together . <|endoftext|> Once upon a time , there
on the sack . <|endoftext|> Once upon a time , in

INTERVAL 0.058 - 0.112
CONTAINS 0.874%

no longer scared of the dark because she knew that the
bite . He found a juicy apple on the ground and
with the bird . Soon a lot of animals joined in
. Jack put on a big a pron and pretended he
The boy was old enough to talk , but he was

INTERVAL 0.005 - 0.058
CONTAINS 41.346%

that day - Having someone by your side can help you
it towards him . Suddenly , he heard a voice coming
in her bath tub . She liked to play with her
. She sped around the porch , jumping high and running
hugged each other . Then they went inside their house ,

INTERVAL -0.049 - 0.005
CONTAINS 56.110%

time . One day , Tim my 's friend Billy came
was happy to see that his lesson had helped so many
Mia 's mom told her that they were going to prepare
him to come and play with her . Fl
had a big truck and Mia had a doll . They

INTERVAL -0.102 - -0.049
CONTAINS 1.670%

also good friends . <|endoftext|> L ily likes to care for
to dig a hole and put the bulb inside .
<|endoftext|> Once upon a time , there was a little girl
. <|endoftext|> A little girl named Emma played with her warm
. They liked to gather shells and stones and make patterns
Token before
Token ID878
Feature activation+2.66e-02

Pos logit contributions
my+0.055
ophone+0.039
umper+0.039
ingu+0.038
tie+0.034
Neg logit contributions
 staircase-0.082
 site-0.078
uders-0.078
 spe-0.077
 tad-0.077
Token.
Token ID13
Feature activation-8.90e-03

Pos logit contributions
 staircase+0.061
uders+0.058
 spe+0.058
 tad+0.058
 click+0.057
Neg logit contributions
my-0.034
 best-0.023
 especially-0.022
 everywhere-0.021
ophone-0.021
Token Suddenly
Token ID24975
Feature activation+2.29e-02

Pos logit contributions
my+0.014
ophone+0.010
umper+0.010
 best+0.010
 especially+0.010
Neg logit contributions
 staircase-0.018
 spe-0.017
 site-0.017
uders-0.017
 tad-0.017
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
 staircase+0.052
uders+0.051
 spe+0.050
 tad+0.049
 swarm+0.049
Neg logit contributions
my-0.029
 best-0.020
ophone-0.018
 especially-0.018
 would-0.017
Token a
Token ID257
Feature activation+7.48e-02

Pos logit contributions
 staircase+0.044
uders+0.043
 spe+0.043
 site+0.042
 swarm+0.042
Neg logit contributions
my-0.018
ophone-0.011
 best-0.010
 especially-0.010
umper-0.010
Token loud
Token ID7812
Feature activation+1.12e-01

Pos logit contributions
uders+0.171
 staircase+0.153
eway+0.151
 Jess+0.151
 collapse+0.150
Neg logit contributions
my-0.105
 best-0.078
 would-0.071
 tired-0.070
 happiest-0.070
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
uders+0.194
umm+0.193
 staircase+0.186
yt+0.186
ipt+0.183
Neg logit contributions
my-0.204
 around-0.155
 all-0.149
 on-0.149
 always-0.146
Token harsh
Token ID11859
Feature activation+9.63e-02

Pos logit contributions
uders+0.056
 staircase+0.053
 site+0.052
umm+0.051
izing+0.051
Neg logit contributions
my-0.044
 best-0.033
 on-0.030
 tired-0.030
 especially-0.029
Token noise
Token ID7838
Feature activation+5.47e-02

Pos logit contributions
uders+0.163
 staircase+0.160
izing+0.158
rily+0.158
quet+0.157
Neg logit contributions
my-0.180
 back-0.131
 around-0.130
 all-0.126
 like-0.125
Token startled
Token ID36249
Feature activation+6.01e-02

Pos logit contributions
 staircase+0.122
uders+0.115
 tad+0.114
 spe+0.114
umm+0.113
Neg logit contributions
my-0.072
 around-0.053
 on-0.052
 everywhere-0.051
 would-0.050
Token Mike
Token ID4995
Feature activation-7.64e-03

Pos logit contributions
 staircase+0.160
 spe+0.153
 swarm+0.151
uders+0.151
 site+0.149
Neg logit contributions
my-0.063
umper-0.032
 especially-0.030
 best-0.030
 forward-0.028
Token.
Token ID13
Feature activation+7.15e-03

Pos logit contributions
 staircase+0.077
 site+0.073
 spe+0.073
uders+0.072
 tad+0.072
Neg logit contributions
my-0.041
ophone-0.027
umper-0.026
 best-0.024
 especially-0.024
Token But
Token ID887
Feature activation+1.34e-02

Pos logit contributions
 staircase+0.015
uders+0.015
yt+0.015
 swarm+0.015
 site+0.015
Neg logit contributions
my-0.010
 favorite-0.007
ophone-0.007
ingu-0.007
 especially-0.007
Token suddenly
Token ID6451
Feature activation+2.74e-02

Pos logit contributions
 staircase+0.028
 spe+0.026
 site+0.026
 Jess+0.026
Run+0.026
Neg logit contributions
my-0.020
umper-0.016
ophone-0.016
ingu-0.014
tie-0.013
Token,
Token ID11
Feature activation+9.30e-03

Pos logit contributions
 staircase+0.057
 site+0.054
 spe+0.054
uders+0.054
 tad+0.053
Neg logit contributions
my-0.042
ophone-0.030
umper-0.029
 best-0.028
 especially-0.028
Token a
Token ID257
Feature activation+6.97e-02

Pos logit contributions
 staircase+0.023
uders+0.022
 site+0.022
 spe+0.022
 tad+0.022
Neg logit contributions
my-0.010
ophone-0.006
umper-0.006
 best-0.006
 especially-0.006
Token booming
Token ID32017
Feature activation+1.11e-01

Pos logit contributions
uders+0.160
 site+0.143
 Jess+0.143
inct+0.142
 staircase+0.141
Neg logit contributions
my-0.098
 best-0.074
 tired-0.066
 happiest-0.066
 would-0.065
Token voice
Token ID3809
Feature activation+5.20e-02

Pos logit contributions
uders+0.166
 Sorry+0.149
 staircase+0.148
yt+0.148
umm+0.147
Neg logit contributions
my-0.245
 around-0.195
 back-0.190
 like-0.187
 always-0.182
Token said
Token ID531
Feature activation+2.22e-02

Pos logit contributions
 staircase+0.118
umm+0.118
uders+0.114
 site+0.108
 tad+0.107
Neg logit contributions
my-0.072
 around-0.050
 on-0.050
 would-0.047
ophone-0.047
Token,
Token ID11
Feature activation-6.20e-03

Pos logit contributions
 staircase+0.049
uders+0.046
 site+0.046
 spe+0.046
 tad+0.045
Neg logit contributions
my-0.031
ophone-0.022
 best-0.021
umper-0.021
 especially-0.020
Token "
Token ID366
Feature activation-3.46e-03

Pos logit contributions
my+0.010
ophone+0.008
umper+0.007
 best+0.007
 especially+0.007
Neg logit contributions
 staircase-0.012
uders-0.011
 site-0.011
 spe-0.011
 tad-0.011
TokenStop
Token ID19485
Feature activation+1.12e-02

Pos logit contributions
my+0.006
umper+0.005
ophone+0.005
 especially+0.005
 best+0.004
Neg logit contributions
 staircase-0.006
 site-0.006
uders-0.006
 spe-0.006
 tad-0.006
Token a
Token ID257
Feature activation+4.08e-02

Pos logit contributions
 staircase+0.012
uders+0.011
 site+0.011
 tad+0.011
 spe+0.011
Neg logit contributions
my-0.009
 best-0.006
 especially-0.006
umper-0.006
ophone-0.006
Token kingdom
Token ID13239
Feature activation+1.15e-02

Pos logit contributions
uders+0.074
 staircase+0.071
Run+0.068
 Jess+0.068
inct+0.067
Neg logit contributions
my-0.073
 best-0.056
 especially-0.054
 happiest-0.054
 forward-0.053
Token ruled
Token ID8879
Feature activation-1.33e-02

Pos logit contributions
 staircase+0.023
 site+0.022
 spe+0.022
uders+0.022
 tad+0.021
Neg logit contributions
my-0.018
ophone-0.013
umper-0.013
ingu-0.013
 best-0.012
Token by
Token ID416
Feature activation+1.51e-02

Pos logit contributions
my+0.017
ophone+0.012
umper+0.011
 best+0.011
 especially+0.011
Neg logit contributions
 staircase-0.030
 site-0.029
uders-0.029
 spe-0.029
 tad-0.028
Token a
Token ID257
Feature activation+3.39e-02

Pos logit contributions
 staircase+0.035
 spe+0.034
uders+0.033
Run+0.033
 click+0.033
Neg logit contributions
my-0.019
 best-0.012
 especially-0.012
ophone-0.011
umper-0.011
Token wicked
Token ID20589
Feature activation+1.11e-01

Pos logit contributions
uders+0.059
 staircase+0.058
Run+0.056
 extingu+0.055
 Jess+0.055
Neg logit contributions
my-0.061
 best-0.047
 especially-0.046
 happiest-0.046
 forward-0.045
Token wizard
Token ID18731
Feature activation+2.30e-02

Pos logit contributions
 staircase+0.175
Run+0.175
inct+0.169
uders+0.165
lev+0.164
Neg logit contributions
my-0.205
 best-0.165
 happiest-0.163
 especially-0.160
 back-0.154
Token.
Token ID13
Feature activation+6.80e-03

Pos logit contributions
 staircase+0.050
 spe+0.047
 site+0.047
uders+0.047
 tad+0.046
Neg logit contributions
my-0.033
ophone-0.023
umper-0.023
 best-0.021
 especially-0.021
Token Suddenly
Token ID24975
Feature activation+2.31e-02

Pos logit contributions
 staircase+0.015
 spe+0.015
uders+0.015
 site+0.014
 tad+0.014
Neg logit contributions
my-0.009
umper-0.006
ophone-0.006
ingu-0.006
 especially-0.006
Token,
Token ID11
Feature activation+1.91e-02

Pos logit contributions
 staircase+0.053
uders+0.051
 spe+0.050
 site+0.050
 tad+0.050
Neg logit contributions
my-0.030
ophone-0.019
 best-0.019
umper-0.019
 especially-0.018
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
 staircase+0.049
uders+0.048
 spe+0.047
 site+0.047
 tad+0.046
Neg logit contributions
my-0.019
 best-0.010
ophone-0.010
 especially-0.010
umper-0.010
Token away
Token ID1497
Feature activation+4.58e-02

Pos logit contributions
 staircase+0.046
 site+0.044
 spe+0.043
 tad+0.043
uders+0.043
Neg logit contributions
my-0.022
ophone-0.014
umper-0.014
 especially-0.013
ingu-0.013
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
uders+0.121
umm+0.119
 staircase+0.117
Uh+0.117
opter+0.116
Neg logit contributions
my-0.042
 on-0.032
 around-0.032
 everywhere-0.030
 all-0.030
Token They
Token ID1119
Feature activation+5.92e-03

Pos logit contributions
 staircase+0.030
 site+0.028
 spe+0.028
opter+0.027
Run+0.027
Neg logit contributions
my-0.026
 wherever-0.019
umper-0.018
ophone-0.018
 everywhere-0.018
Token left
Token ID1364
Feature activation-6.90e-03

Pos logit contributions
 staircase+0.013
 spe+0.013
 Jess+0.012
 tad+0.012
 site+0.012
Neg logit contributions
my-0.008
ophone-0.006
umper-0.006
 best-0.005
ingu-0.005
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
my+0.008
 best+0.005
 especially+0.005
 everywhere+0.005
ophone+0.005
Neg logit contributions
 staircase-0.017
uders-0.016
Run-0.016
 site-0.016
inct-0.015
Token y
Token ID331
Feature activation+1.11e-01

Pos logit contributions
 staircase+0.019
 site+0.018
 spe+0.016
uders+0.016
 tad+0.016
Neg logit contributions
my-0.035
ophone-0.031
umper-0.030
 pads-0.029
tie-0.029
Tokenummy
Token ID13513
Feature activation+5.55e-02

Pos logit contributions
 staircase+0.209
 spe+0.207
Run+0.206
 slit+0.202
uders+0.199
Neg logit contributions
my-0.208
ingu-0.138
umper-0.137
 best-0.134
 back-0.133
Token refrigerator
Token ID30500
Feature activation+6.14e-02

Pos logit contributions
 staircase+0.103
uders+0.098
 spe+0.097
 site+0.096
 tad+0.095
Neg logit contributions
my-0.099
 best-0.071
ophone-0.070
umper-0.069
 especially-0.069
Token and
Token ID290
Feature activation-1.83e-02

Pos logit contributions
 staircase+0.141
uders+0.137
 spe+0.135
 tad+0.135
 site+0.134
Neg logit contributions
my-0.081
ophone-0.052
 best-0.049
 especially-0.048
umper-0.048
Token all
Token ID477
Feature activation-3.88e-02

Pos logit contributions
my+0.025
 especially+0.016
 best+0.016
 would+0.015
ophone+0.015
Neg logit contributions
 staircase-0.042
uders-0.040
 site-0.040
inct-0.039
 spe-0.039
Token the
Token ID262
Feature activation-9.75e-03

Pos logit contributions
my+0.056
ophone+0.037
 especially+0.035
 best+0.035
umper+0.034
Neg logit contributions
 staircase-0.087
uders-0.084
 site-0.082
Run-0.081
 spe-0.081
Token
Token ID198
Feature activation-4.56e-03

Pos logit contributions
my+0.064
ophone+0.047
umper+0.046
 happiest+0.044
tie+0.044
Neg logit contributions
 staircase-0.097
 site-0.094
 Jess-0.093
uders-0.093
 tad-0.090
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
my+0.006
ophone+0.004
umper+0.004
 best+0.004
 happiest+0.004
Neg logit contributions
 staircase-0.010
 site-0.010
 Jess-0.010
 spe-0.010
uders-0.010
TokenSoon
Token ID28093
Feature activation+3.26e-02

Pos logit contributions
 staircase+0.029
 spe+0.029
uders+0.029
 site+0.028
 tad+0.028
Neg logit contributions
my-0.025
 especially-0.018
umper-0.018
ophone-0.018
Whenever-0.017
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 staircase+0.075
uders+0.073
 site+0.072
 tad+0.071
Run+0.071
Neg logit contributions
my-0.042
 best-0.029
 especially-0.026
ingu-0.025
 on-0.025
Token the
Token ID262
Feature activation+4.30e-03

Pos logit contributions
 staircase+0.062
uders+0.061
 site+0.059
 spe+0.059
 tad+0.059
Neg logit contributions
my-0.032
 best-0.019
 especially-0.019
ophone-0.018
umper-0.017
Token y
Token ID331
Feature activation+1.10e-01

Pos logit contributions
 staircase+0.007
uders+0.006
 site+0.006
 tad+0.006
 spe+0.006
Neg logit contributions
my-0.009
 best-0.007
ophone-0.007
 especially-0.007
 happiest-0.007
Tokenummy
Token ID13513
Feature activation+5.36e-02

Pos logit contributions
 staircase+0.239
eway+0.237
Run+0.234
AAAA+0.227
 slit+0.226
Neg logit contributions
my-0.162
 back-0.107
 best-0.107
ingu-0.106
ip-0.098
Token dinner
Token ID8073
Feature activation+5.46e-02

Pos logit contributions
 staircase+0.097
uders+0.097
 site+0.092
 tad+0.092
Run+0.091
Neg logit contributions
my-0.100
 best-0.078
 especially-0.070
 favorite-0.069
 happiest-0.068
Token was
Token ID373
Feature activation+2.31e-02

Pos logit contributions
uders+0.136
Run+0.134
 staircase+0.133
 tad+0.131
 Jess+0.131
Neg logit contributions
my-0.057
 would-0.045
 had-0.037
 on-0.037
 best-0.036
Token ready
Token ID3492
Feature activation+2.46e-02

Pos logit contributions
 staircase+0.050
 spe+0.048
 site+0.048
uders+0.046
 collapse+0.046
Neg logit contributions
my-0.032
ophone-0.024
umper-0.024
ingu-0.021
 pads-0.021
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
 staircase+0.063
uders+0.062
 tad+0.061
Run+0.060
 swarm+0.060
Neg logit contributions
my-0.028
 would-0.017
 especially-0.016
 on-0.016
 best-0.016
Token that
Token ID326
Feature activation-9.85e-03

Pos logit contributions
my+0.019
umper+0.014
ophone+0.014
ingu+0.014
 pads+0.013
Neg logit contributions
 staircase-0.024
Run-0.024
 site-0.024
 Jess-0.023
izing-0.023
Token day
Token ID1110
Feature activation-1.40e-03

Pos logit contributions
my+0.012
umper+0.011
ophone+0.010
ingu+0.010
tie+0.009
Neg logit contributions
 staircase-0.022
 Jess-0.021
 spe-0.021
 site-0.021
 flame-0.020
Token on
Token ID319
Feature activation+1.96e-02

Pos logit contributions
my+0.002
ophone+0.001
umper+0.001
ingu+0.001
 especially+0.001
Neg logit contributions
 staircase-0.004
 click-0.003
Run-0.003
 Jess-0.003
 spe-0.003
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
 staircase+0.043
uders+0.041
 spe+0.040
 tad+0.040
 site+0.040
Neg logit contributions
my-0.028
 best-0.019
ophone-0.019
umper-0.018
 especially-0.018
Token the
Token ID262
Feature activation+6.79e-03

Pos logit contributions
 staircase+0.029
 spe+0.027
 site+0.027
uders+0.027
 tad+0.027
Neg logit contributions
my-0.017
ophone-0.012
umper-0.011
 best-0.011
 especially-0.011
Token y
Token ID331
Feature activation+1.10e-01

Pos logit contributions
 staircase+0.013
 site+0.012
 tad+0.012
 swarm+0.012
 spe+0.012
Neg logit contributions
my-0.012
ophone-0.010
umper-0.010
 pads-0.009
ingu-0.009
Tokenummy
Token ID13513
Feature activation+4.98e-02

Pos logit contributions
Run+0.219
 staircase+0.211
 spe+0.211
AAAA+0.210
 click+0.203
Neg logit contributions
my-0.178
 best-0.128
ingu-0.124
umper-0.119
 back-0.118
Token place
Token ID1295
Feature activation+1.17e-02

Pos logit contributions
 staircase+0.082
 spe+0.078
uders+0.077
 site+0.076
 tad+0.074
Neg logit contributions
my-0.098
ophone-0.075
 best-0.074
umper-0.074
 especially-0.072
Token was
Token ID373
Feature activation+2.68e-02

Pos logit contributions
 staircase+0.025
 site+0.023
 spe+0.023
 Jess+0.023
uders+0.023
Neg logit contributions
my-0.017
umper-0.013
ophone-0.012
ingu-0.012
 pads-0.011
Token protected
Token ID6861
Feature activation+1.34e-02

Pos logit contributions
 spe+0.055
 staircase+0.052
 site+0.052
 collapse+0.052
 Jess+0.052
Neg logit contributions
my-0.036
umper-0.032
ophone-0.031
ingu-0.030
ip-0.027
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 staircase+0.030
uders+0.028
 spe+0.028
 site+0.028
 tad+0.028
Neg logit contributions
my-0.019
ophone-0.012
umper-0.012
 best-0.012
 especially-0.012
Token very
Token ID845
Feature activation+2.66e-02

Pos logit contributions
 staircase+0.002
 spe+0.002
 Jess+0.002
 swarm+0.002
 tad+0.002
Neg logit contributions
my-0.001
umper-0.001
ophone-0.001
ingu-0.001
ip-0.001
Token sad
Token ID6507
Feature activation+1.02e-02

Pos logit contributions
 staircase+0.068
uders+0.065
 site+0.065
 spe+0.064
Run+0.063
Neg logit contributions
my-0.029
 best-0.017
 especially-0.016
ophone-0.016
 happiest-0.015
Token to
Token ID284
Feature activation+4.84e-02

Pos logit contributions
 staircase+0.023
 spe+0.022
 site+0.022
uders+0.022
 tad+0.022
Neg logit contributions
my-0.013
ophone-0.009
umper-0.009
 best-0.008
 especially-0.008
Token leave
Token ID2666
Feature activation-1.79e-02

Pos logit contributions
 staircase+0.087
uders+0.082
 site+0.081
 spe+0.081
 Jess+0.080
Neg logit contributions
my-0.086
ophone-0.068
umper-0.065
ingu-0.063
 best-0.062
Token the
Token ID262
Feature activation+4.24e-03

Pos logit contributions
my+0.028
ophone+0.024
umper+0.022
ingu+0.021
 pads+0.020
Neg logit contributions
 staircase-0.035
 site-0.035
 spe-0.033
 Jess-0.033
 click-0.033
Token y
Token ID331
Feature activation+1.10e-01

Pos logit contributions
 staircase+0.007
 site+0.006
 spe+0.006
uders+0.006
 tad+0.006
Neg logit contributions
my-0.009
ophone-0.007
umper-0.007
ingu-0.007
 especially-0.007
Tokenummy
Token ID13513
Feature activation+4.96e-02

Pos logit contributions
 staircase+0.207
Run+0.203
uders+0.192
AAAA+0.192
fire+0.188
Neg logit contributions
my-0.209
 back-0.139
 best-0.137
umper-0.132
ingu-0.128
Token pie
Token ID2508
Feature activation+3.39e-02

Pos logit contributions
 staircase+0.080
uders+0.075
 spe+0.074
 site+0.073
Run+0.072
Neg logit contributions
my-0.102
 best-0.077
ophone-0.074
 especially-0.074
umper-0.074
Token,
Token ID11
Feature activation-6.32e-03

Pos logit contributions
 staircase+0.073
 site+0.069
 spe+0.068
uders+0.067
fire+0.066
Neg logit contributions
my-0.048
ophone-0.036
umper-0.036
ingu-0.033
 especially-0.031
Token but
Token ID475
Feature activation-6.85e-04

Pos logit contributions
my+0.010
umper+0.009
ophone+0.009
ingu+0.008
ip+0.008
Neg logit contributions
 staircase-0.012
 spe-0.012
 click-0.011
 tad-0.011
 Jess-0.011
Token he
Token ID339
Feature activation-7.71e-04

Pos logit contributions
my+0.001
umper+0.001
ophone+0.001
ingu+0.001
 pads+0.001
Neg logit contributions
 staircase-0.001
 Jess-0.001
 spe-0.001
 tad-0.001
 site-0.001
Token
Token ID220
Feature activation-3.78e-02

Pos logit contributions
yt+0.006
 staircase+0.006
 spe+0.006
izing+0.006
 tad+0.005
Neg logit contributions
my-0.004
ophone-0.003
 best-0.003
 especially-0.002
 about-0.002
Token
Token ID198
Feature activation-5.04e-03

Pos logit contributions
my+0.048
ophone+0.032
umper+0.031
 best+0.029
 especially+0.029
Neg logit contributions
 staircase-0.088
 site-0.083
uders-0.083
 spe-0.083
 tad-0.082
TokenSuddenly
Token ID38582
Feature activation+2.96e-02

Pos logit contributions
my+0.007
ophone+0.005
 especially+0.004
 would+0.004
umper+0.004
Neg logit contributions
 staircase-0.011
umm-0.011
 spe-0.011
uders-0.011
 aroma-0.011
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
umm+0.078
uders+0.075
 tad+0.075
 aroma+0.073
 staircase+0.073
Neg logit contributions
my-0.029
 best-0.019
 on-0.018
 would-0.015
 all-0.015
Token a
Token ID257
Feature activation+7.79e-02

Pos logit contributions
umm+0.051
uders+0.050
yt+0.050
 spe+0.049
 staircase+0.049
Neg logit contributions
my-0.022
 best-0.015
 on-0.015
 all-0.014
 everywhere-0.013
Token loud
Token ID7812
Feature activation+1.09e-01

Pos logit contributions
uders+0.187
iously+0.167
rily+0.167
eway+0.167
 Jess+0.167
Neg logit contributions
my-0.095
 best-0.077
 tired-0.069
 very-0.067
 happiest-0.065
Token noise
Token ID7838
Feature activation+7.01e-02

Pos logit contributions
umm+0.186
uders+0.170
 staircase+0.166
edly+0.165
 former+0.164
Neg logit contributions
my-0.208
 on-0.166
 around-0.165
 all-0.162
 back-0.159
Token came
Token ID1625
Feature activation+3.68e-02

Pos logit contributions
 staircase+0.159
uders+0.154
umm+0.151
 tad+0.148
 spe+0.148
Neg logit contributions
my-0.091
 around-0.068
 on-0.068
 everywhere-0.064
 would-0.061
Token from
Token ID422
Feature activation+3.60e-02

Pos logit contributions
 staircase+0.091
umm+0.091
 site+0.090
 aroma+0.089
 spe+0.087
Neg logit contributions
my-0.050
 on-0.033
 back-0.031
 everywhere-0.030
 over-0.030
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
umm+0.089
 staircase+0.088
uders+0.088
 tad+0.087
yt+0.086
Neg logit contributions
my-0.039
 around-0.024
 everywhere-0.023
 on-0.023
 wherever-0.021
Token sky
Token ID6766
Feature activation+1.77e-02

Pos logit contributions
 staircase+0.024
 Jess+0.023
 spe+0.023
uders+0.023
yt+0.022
Neg logit contributions
my-0.032
 best-0.025
 happiest-0.024
 especially-0.024
 everywhere-0.024
Token
Token ID198
Feature activation-1.77e-03

Pos logit contributions
 staircase+0.014
 spe+0.014
uders+0.014
 site+0.014
fire+0.013
Neg logit contributions
my-0.011
 especially-0.007
ophone-0.007
umper-0.007
 best-0.007
Token
Token ID198
Feature activation+1.55e-02

Pos logit contributions
my+0.002
ophone+0.002
umper+0.002
 best+0.002
 happiest+0.001
Neg logit contributions
 staircase-0.004
 site-0.004
 Jess-0.004
 tad-0.004
uders-0.004
TokenSuddenly
Token ID38582
Feature activation+2.76e-02

Pos logit contributions
 spe+0.029
uders+0.029
 staircase+0.029
 site+0.027
umm+0.027
Neg logit contributions
my-0.028
 especially-0.021
umper-0.020
Whenever-0.020
ophone-0.019
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
uders+0.067
 staircase+0.066
 site+0.065
umm+0.064
 spe+0.064
Neg logit contributions
my-0.033
 best-0.022
 on-0.019
 especially-0.018
 around-0.018
Token a
Token ID257
Feature activation+7.89e-02

Pos logit contributions
uders+0.039
 staircase+0.038
 site+0.037
 spe+0.037
fire+0.035
Neg logit contributions
my-0.032
 best-0.023
 especially-0.022
 on-0.021
 all-0.021
Token loud
Token ID7812
Feature activation+1.09e-01

Pos logit contributions
uders+0.170
izing+0.149
inct+0.147
eway+0.146
 Jess+0.146
Neg logit contributions
my-0.120
 best-0.098
 tired-0.088
 very-0.085
 happiest-0.085
Token cheer
Token ID14042
Feature activation+3.83e-02

Pos logit contributions
umm+0.163
uders+0.160
yt+0.147
 staircase+0.145
izing+0.144
Neg logit contributions
my-0.231
 on-0.182
 around-0.182
 back-0.175
 about-0.174
Token sounded
Token ID14846
Feature activation+2.88e-02

Pos logit contributions
 staircase+0.082
 spe+0.081
 site+0.080
 tad+0.079
uders+0.079
Neg logit contributions
my-0.057
ophone-0.038
 would-0.038
 on-0.037
 forward-0.036
Token from
Token ID422
Feature activation+3.11e-02

Pos logit contributions
 staircase+0.062
 spe+0.059
 site+0.059
uders+0.059
 tad+0.058
Neg logit contributions
my-0.042
ophone-0.030
umper-0.029
 best-0.027
 especially-0.027
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 staircase+0.078
uders+0.077
opter+0.077
umm+0.076
yt+0.075
Neg logit contributions
my-0.033
 around-0.020
 everywhere-0.019
 on-0.019
 all-0.017
Token room
Token ID2119
Feature activation+2.34e-02

Pos logit contributions
 staircase+0.017
 spe+0.016
 Jess+0.016
uders+0.016
 site+0.015
Neg logit contributions
my-0.021
 best-0.017
ophone-0.016
 especially-0.016
 happiest-0.016
Token noise
Token ID7838
Feature activation+5.96e-02

Pos logit contributions
uders+0.120
umm+0.103
keeping+0.092
ipt+0.091
ASH+0.090
Neg logit contributions
my-0.109
 very-0.097
 better-0.088
 would-0.088
 back-0.085
Token coming
Token ID2406
Feature activation+5.07e-02

Pos logit contributions
 staircase+0.141
uders+0.141
 spe+0.136
 tad+0.134
 site+0.133
Neg logit contributions
my-0.075
 around-0.048
 especially-0.047
 on-0.044
ophone-0.044
Token from
Token ID422
Feature activation+2.84e-02

Pos logit contributions
uders+0.125
umm+0.123
 staircase+0.121
 site+0.119
 spe+0.119
Neg logit contributions
my-0.066
 back-0.053
 over-0.049
 on-0.048
 around-0.047
Token behind
Token ID2157
Feature activation+3.39e-02

Pos logit contributions
 staircase+0.066
uders+0.064
 spe+0.063
 tad+0.063
 site+0.062
Neg logit contributions
my-0.036
ophone-0.021
umper-0.021
 especially-0.021
 around-0.020
Token a
Token ID257
Feature activation+5.47e-02

Pos logit contributions
 staircase+0.086
 spe+0.082
uders+0.081
 site+0.081
 tad+0.081
Neg logit contributions
my-0.037
ophone-0.022
umper-0.021
 best-0.020
 especially-0.020
Token nearby
Token ID6716
Feature activation+1.09e-01

Pos logit contributions
uders+0.098
inct+0.090
 staircase+0.089
umm+0.089
Run+0.089
Neg logit contributions
my-0.103
 would-0.073
 everywhere-0.072
 especially-0.072
 best-0.070
Token bush
Token ID24484
Feature activation+5.62e-02

Pos logit contributions
uders+0.187
izing+0.176
urg+0.175
inct+0.173
yt+0.173
Neg logit contributions
my-0.210
 back-0.154
 would-0.141
 around-0.141
 wrong-0.138
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 staircase+0.131
 spe+0.130
uders+0.130
inct+0.127
 tad+0.125
Neg logit contributions
my-0.074
 everywhere-0.042
 especially-0.041
 around-0.041
 would-0.041
Token He
Token ID679
Feature activation+2.60e-02

Pos logit contributions
 spe+0.032
 staircase+0.032
uders+0.031
yt+0.031
 swarm+0.030
Neg logit contributions
my-0.020
ophone-0.012
umper-0.012
ingu-0.012
 especially-0.012
Token was
Token ID373
Feature activation+8.38e-03

Pos logit contributions
umm+0.059
 staircase+0.057
uders+0.057
 site+0.056
izing+0.054
Neg logit contributions
my-0.038
 would-0.027
 loved-0.026
 learned-0.026
 especially-0.026
Token really
Token ID1107
Feature activation+1.48e-02

Pos logit contributions
 staircase+0.018
 spe+0.017
 site+0.017
uders+0.017
Run+0.017
Neg logit contributions
my-0.012
ophone-0.008
 especially-0.008
 best-0.008
umper-0.008
Token sight
Token ID6504
Feature activation+9.65e-03

Pos logit contributions
my+0.007
 everywhere+0.004
 best+0.004
 wherever+0.004
 especially+0.004
Neg logit contributions
 staircase-0.014
uders-0.014
Run-0.014
 tad-0.014
 site-0.014
Token.
Token ID13
Feature activation+6.89e-03

Pos logit contributions
 staircase+0.022
uders+0.022
Run+0.021
 site+0.021
 tad+0.021
Neg logit contributions
my-0.013
 best-0.008
ophone-0.008
 forward-0.008
 especially-0.008
Token Suddenly
Token ID24975
Feature activation+2.59e-02

Pos logit contributions
 staircase+0.013
 site+0.013
 tad+0.013
 spe+0.013
uders+0.013
Neg logit contributions
my-0.012
ophone-0.008
umper-0.008
 best-0.008
 especially-0.008
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
 staircase+0.060
 tad+0.059
uders+0.059
 site+0.059
umm+0.058
Neg logit contributions
my-0.032
 best-0.022
 wherever-0.019
 everywhere-0.019
 on-0.019
Token a
Token ID257
Feature activation+7.70e-02

Pos logit contributions
uders+0.060
 staircase+0.060
 tad+0.060
 site+0.059
umm+0.058
Neg logit contributions
my-0.034
 best-0.024
 on-0.023
 everywhere-0.023
 wherever-0.022
Token loud
Token ID7812
Feature activation+1.08e-01

Pos logit contributions
uders+0.161
 Jess+0.137
urg+0.136
inct+0.136
 site+0.135
Neg logit contributions
my-0.124
 best-0.098
 would-0.090
 happiest-0.089
 very-0.089
Token voice
Token ID3809
Feature activation+6.22e-02

Pos logit contributions
uders+0.167
umm+0.164
 tad+0.153
inct+0.151
ipt+0.149
Neg logit contributions
my-0.226
 around-0.173
 on-0.172
 back-0.172
 about-0.169
Token came
Token ID1625
Feature activation+3.85e-02

Pos logit contributions
umm+0.150
uders+0.144
 staircase+0.143
 tad+0.135
 site+0.132
Neg logit contributions
my-0.083
 around-0.060
 on-0.059
 would-0.055
 about-0.054
Token out
Token ID503
Feature activation+6.65e-02

Pos logit contributions
umm+0.098
 site+0.095
 staircase+0.093
uders+0.092
 aroma+0.091
Neg logit contributions
my-0.058
 back-0.039
 on-0.038
 over-0.036
 everywhere-0.035
Token of
Token ID286
Feature activation+1.50e-02

Pos logit contributions
uders+0.187
 site+0.174
umm+0.173
 staircase+0.169
opter+0.168
Neg logit contributions
my-0.072
 on-0.058
 everywhere-0.049
 around-0.048
 over-0.046
Token nowhere
Token ID12062
Feature activation+2.85e-02

Pos logit contributions
uders+0.040
 staircase+0.039
 tad+0.039
opter+0.038
 site+0.038
Neg logit contributions
my-0.015
 everywhere-0.010
 on-0.008
 all-0.008
 wherever-0.007
Token as
Token ID355
Feature activation+3.30e-02

Pos logit contributions
 staircase+0.090
uders+0.086
 site+0.085
 spe+0.083
 swarm+0.083
Neg logit contributions
my-0.054
umper-0.036
 especially-0.036
 best-0.036
ophone-0.035
Token they
Token ID484
Feature activation+1.20e-02

Pos logit contributions
 staircase+0.082
uders+0.078
 spe+0.077
 site+0.077
 tad+0.076
Neg logit contributions
my-0.038
 best-0.023
ophone-0.022
umper-0.022
 especially-0.022
Token entered
Token ID5982
Feature activation+4.37e-02

Pos logit contributions
 staircase+0.025
uders+0.024
 site+0.024
 spe+0.023
 flame+0.023
Neg logit contributions
my-0.019
ophone-0.013
umper-0.013
 especially-0.013
 would-0.012
Token,
Token ID11
Feature activation+2.50e-02

Pos logit contributions
uders+0.107
 staircase+0.107
 spe+0.104
 swarm+0.103
Uh+0.101
Neg logit contributions
my-0.045
 best-0.028
 around-0.027
 on-0.026
 forward-0.026
Token a
Token ID257
Feature activation+7.35e-02

Pos logit contributions
 staircase+0.055
uders+0.052
 spe+0.052
 site+0.051
yt+0.050
Neg logit contributions
my-0.035
 best-0.024
ophone-0.023
 especially-0.023
umper-0.023
Token loud
Token ID7812
Feature activation+1.08e-01

Pos logit contributions
uders+0.159
 Jess+0.141
 staircase+0.139
inct+0.137
eway+0.136
Neg logit contributions
my-0.111
 best-0.083
 especially-0.080
 happiest-0.080
 tired-0.079
Token noise
Token ID7838
Feature activation+6.98e-02

Pos logit contributions
uders+0.195
umm+0.191
inct+0.178
ipt+0.176
edly+0.176
Neg logit contributions
my-0.192
 around-0.153
 on-0.145
 all-0.145
 about-0.143
Token made
Token ID925
Feature activation+1.68e-02

Pos logit contributions
 staircase+0.156
uders+0.154
umm+0.145
 tad+0.145
 spe+0.143
Neg logit contributions
my-0.089
 around-0.073
 on-0.071
 like-0.067
 everywhere-0.066
Token them
Token ID606
Feature activation-5.75e-03

Pos logit contributions
 staircase+0.040
yt+0.038
 spe+0.038
uders+0.037
 gap+0.036
Neg logit contributions
my-0.022
 all-0.015
 best-0.015
 forward-0.014
 him-0.014
Token jump
Token ID4391
Feature activation-2.23e-04

Pos logit contributions
my+0.009
 best+0.007
ophone+0.007
 forward+0.006
 especially+0.006
Neg logit contributions
 staircase-0.012
 spe-0.011
 site-0.011
uders-0.011
umm-0.011
Token!
Token ID0
Feature activation+1.65e-02

Pos logit contributions
my+0.000
 around+0.000
 everywhere+0.000
 forward+0.000
 on+0.000
Neg logit contributions
 staircase-0.001
uders-0.001
 spe-0.001
 site-0.001
 tad-0.001
Token figure
Token ID3785
Feature activation+1.12e-02

Pos logit contributions
my+0.001
 on+0.001
 around+0.001
 everywhere+0.001
ophone+0.001
Neg logit contributions
 staircase-0.001
 spe-0.001
 site-0.001
uders-0.001
 tad-0.001
Token appeared
Token ID4120
Feature activation+1.52e-02

Pos logit contributions
 staircase+0.023
 tad+0.022
 spe+0.021
uders+0.021
umm+0.021
Neg logit contributions
my-0.018
ophone-0.013
 best-0.012
 on-0.012
 around-0.012
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
 staircase+0.034
 site+0.033
 spe+0.033
uders+0.033
 tad+0.032
Neg logit contributions
my-0.021
ophone-0.014
 best-0.014
 forward-0.013
umper-0.013
Token In
Token ID554
Feature activation+1.14e-02

Pos logit contributions
uders+0.059
yt+0.058
 staircase+0.057
izing+0.057
 tad+0.055
Neg logit contributions
my-0.024
 about-0.014
 around-0.014
 loved-0.013
 back-0.012
Token a
Token ID257
Feature activation+6.25e-02

Pos logit contributions
 staircase+0.029
 tad+0.029
uders+0.028
yt+0.028
Run+0.028
Neg logit contributions
my-0.012
 everywhere-0.006
 best-0.006
 on-0.006
 around-0.006
Token booming
Token ID32017
Feature activation+1.08e-01

Pos logit contributions
uders+0.146
umm+0.134
urg+0.133
 extingu+0.133
iously+0.130
Neg logit contributions
my-0.088
 everywhere-0.059
 learned-0.058
 tired-0.057
 would-0.056
Token voice
Token ID3809
Feature activation+4.45e-02

Pos logit contributions
uders+0.142
iders+0.133
comb+0.132
apy+0.129
yt+0.127
Neg logit contributions
my-0.261
 around-0.213
 back-0.206
 over-0.190
 about-0.189
Token,
Token ID11
Feature activation+6.82e-04

Pos logit contributions
 staircase+0.114
umm+0.113
uders+0.111
 spe+0.105
 gap+0.105
Neg logit contributions
my-0.052
 around-0.033
 on-0.032
 about-0.029
 would-0.027
Token the
Token ID262
Feature activation+3.72e-03

Pos logit contributions
 staircase+0.002
uders+0.002
 site+0.002
umm+0.002
 spe+0.002
Neg logit contributions
my-0.001
 best-0.000
 on-0.000
 especially-0.000
ophone-0.000
Token figure
Token ID3785
Feature activation+2.38e-02

Pos logit contributions
 staircase+0.006
uders+0.006
 spe+0.006
 site+0.006
 tad+0.006
Neg logit contributions
my-0.007
 best-0.005
 happiest-0.005
 everywhere-0.005
umper-0.005
Token said
Token ID531
Feature activation+1.67e-02

Pos logit contributions
 staircase+0.053
umm+0.051
uders+0.050
 tad+0.050
 spe+0.049
Neg logit contributions
my-0.034
ophone-0.023
 would-0.022
 especially-0.021
 best-0.021
Token to
Token ID284
Feature activation+4.66e-02

Pos logit contributions
 staircase+0.041
 spe+0.040
 site+0.039
uders+0.039
 tad+0.039
Neg logit contributions
my-0.021
ophone-0.014
umper-0.014
ingu-0.013
 best-0.012
Token be
Token ID307
Feature activation-1.93e-02

Pos logit contributions
 staircase+0.092
 spe+0.086
Run+0.085
uders+0.084
 tad+0.084
Neg logit contributions
my-0.080
 especially-0.058
 best-0.057
umper-0.056
 happiest-0.055
Token seen
Token ID1775
Feature activation-2.18e-02

Pos logit contributions
my+0.027
ophone+0.019
umper+0.019
ingu+0.018
 best+0.017
Neg logit contributions
 staircase-0.042
 spe-0.040
 site-0.040
uders-0.040
 tad-0.039
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
my+0.037
ophone+0.029
umper+0.028
ingu+0.027
 best+0.026
Neg logit contributions
 staircase-0.041
 site-0.038
 spe-0.038
uders-0.038
 tad-0.037
Token The
Token ID383
Feature activation+1.24e-03

Pos logit contributions
 staircase+0.027
 spe+0.026
uders+0.026
 site+0.026
 tad+0.025
Neg logit contributions
my-0.027
ophone-0.020
umper-0.020
 especially-0.019
 best-0.019
Token wicked
Token ID20589
Feature activation+1.08e-01

Pos logit contributions
 staircase+0.002
 spe+0.002
 site+0.002
uders+0.002
 tad+0.002
Neg logit contributions
my-0.003
ophone-0.002
umper-0.002
 best-0.002
 especially-0.002
Token witch
Token ID16365
Feature activation+4.12e-02

Pos logit contributions
 staircase+0.163
Run+0.154
uders+0.154
 spe+0.152
inct+0.151
Neg logit contributions
my-0.224
 best-0.172
umper-0.167
 happiest-0.166
 especially-0.166
Token had
Token ID550
Feature activation-1.49e-02

Pos logit contributions
 staircase+0.090
uders+0.086
 site+0.085
 spe+0.084
 tad+0.084
Neg logit contributions
my-0.060
ophone-0.041
 especially-0.039
 best-0.038
 would-0.038
Token enchanted
Token ID42282
Feature activation+2.92e-02

Pos logit contributions
my+0.021
ophone+0.015
umper+0.015
ingu+0.014
 best+0.014
Neg logit contributions
 staircase-0.032
 spe-0.031
 site-0.031
uders-0.031
 tad-0.030
Token her
Token ID607
Feature activation-1.90e-02

Pos logit contributions
 staircase+0.058
uders+0.054
 site+0.054
 spe+0.054
Run+0.054
Neg logit contributions
my-0.048
umper-0.034
ophone-0.033
 best-0.033
 especially-0.033
Token,
Token ID11
Feature activation+1.90e-03

Pos logit contributions
my+0.036
ophone+0.028
umper+0.027
ingu+0.026
 especially+0.026
Neg logit contributions
 staircase-0.033
 spe-0.031
 site-0.031
uders-0.031
 tad-0.030
Token long
Token ID890
Feature activation+2.26e-02

Pos logit contributions
 staircase+0.041
uders+0.038
 spe+0.038
 site+0.038
 tad+0.038
Neg logit contributions
my-0.030
ophone-0.022
umper-0.021
 best-0.021
 especially-0.020
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 staircase+0.046
 spe+0.043
 site+0.043
uders+0.043
 tad+0.043
Neg logit contributions
my-0.035
ophone-0.026
umper-0.025
 best-0.024
 especially-0.024
Token Soon
Token ID15894
Feature activation+2.03e-02

Pos logit contributions
 staircase+0.031
 spe+0.030
yt+0.029
 swarm+0.029
uders+0.029
Neg logit contributions
my-0.022
ophone-0.015
umper-0.015
ingu-0.015
 best-0.014
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
 staircase+0.039
 spe+0.037
 site+0.037
uders+0.036
 tad+0.036
Neg logit contributions
my-0.034
ophone-0.027
umper-0.026
tie-0.023
 especially-0.023
Token the
Token ID262
Feature activation+8.98e-03

Pos logit contributions
 staircase+0.045
 site+0.043
 spe+0.042
 tad+0.042
uders+0.042
Neg logit contributions
my-0.026
ophone-0.018
umper-0.017
ingu-0.016
 pads-0.015
Token wicked
Token ID20589
Feature activation+1.08e-01

Pos logit contributions
 staircase+0.013
 tad+0.013
 site+0.012
 click+0.012
 gap+0.012
Neg logit contributions
my-0.019
ophone-0.017
umper-0.016
tie-0.015
ingu-0.015
Token man
Token ID582
Feature activation+3.11e-02

Pos logit contributions
 staircase+0.147
 site+0.135
 spe+0.134
uders+0.133
 tad+0.130
Neg logit contributions
my-0.243
ophone-0.199
umper-0.194
 especially-0.187
 best-0.187
Token became
Token ID2627
Feature activation-6.30e-03

Pos logit contributions
 staircase+0.068
 spe+0.064
 site+0.064
uders+0.064
 tad+0.063
Neg logit contributions
my-0.044
ophone-0.031
umper-0.030
 especially-0.028
 best-0.028
Token greedy
Token ID31828
Feature activation+2.27e-03

Pos logit contributions
my+0.009
ophone+0.007
umper+0.006
ingu+0.006
 pads+0.006
Neg logit contributions
 staircase-0.013
 spe-0.013
 tad-0.013
 swarm-0.013
 site-0.013
Token again
Token ID757
Feature activation+2.20e-02

Pos logit contributions
 staircase+0.005
 site+0.005
uders+0.005
 spe+0.004
 tad+0.004
Neg logit contributions
my-0.003
ophone-0.002
umper-0.002
ingu-0.002
 especially-0.002
Token and
Token ID290
Feature activation-1.10e-02

Pos logit contributions
 staircase+0.047
 spe+0.044
 site+0.044
uders+0.044
 tad+0.043
Neg logit contributions
my-0.033
ophone-0.023
umper-0.023
 best-0.022
 especially-0.021
Token blow
Token ID6611
Feature activation+3.23e-03

Pos logit contributions
uders+0.112
umm+0.108
 tad+0.106
 staircase+0.105
yt+0.102
Neg logit contributions
my-0.115
 especially-0.088
 would-0.083
 still-0.082
 like-0.081
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
uders+0.008
 staircase+0.008
 site+0.007
 tad+0.007
yt+0.007
Neg logit contributions
my-0.004
 around-0.003
 best-0.003
 forward-0.003
 everywhere-0.003
Token Suddenly
Token ID24975
Feature activation+2.58e-02

Pos logit contributions
 staircase+0.029
 spe+0.027
uders+0.027
yt+0.026
 swarm+0.026
Neg logit contributions
my-0.024
ophone-0.017
 best-0.017
 especially-0.017
umper-0.016
Token,
Token ID11
Feature activation+2.61e-02

Pos logit contributions
 staircase+0.062
uders+0.061
umm+0.060
 spe+0.058
 site+0.058
Neg logit contributions
my-0.029
 best-0.020
 on-0.017
 would-0.016
 all-0.016
Token a
Token ID257
Feature activation+7.70e-02

Pos logit contributions
 staircase+0.063
uders+0.063
 spe+0.061
umm+0.060
yt+0.059
Neg logit contributions
my-0.029
 best-0.019
 especially-0.017
 all-0.017
 on-0.017
Token loud
Token ID7812
Feature activation+1.08e-01

Pos logit contributions
uders+0.153
eway+0.132
 Jess+0.130
inct+0.129
urg+0.128
Neg logit contributions
my-0.129
 best-0.105
 tired-0.100
 very-0.099
 happiest-0.097
Token roar
Token ID34532
Feature activation+5.62e-02

Pos logit contributions
uders+0.173
umm+0.172
 tad+0.161
yt+0.159
ipt+0.158
Neg logit contributions
my-0.215
 around-0.172
 back-0.168
 on-0.167
 all-0.163
Token filled
Token ID5901
Feature activation+3.08e-02

Pos logit contributions
 staircase+0.109
uders+0.108
 tad+0.107
umm+0.106
 spe+0.104
Neg logit contributions
my-0.089
 on-0.069
 around-0.068
 would-0.068
 everywhere-0.066
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
 staircase+0.093
yt+0.092
 spe+0.089
umm+0.089
uders+0.088
Neg logit contributions
my-0.018
 all-0.010
 around-0.010
 over-0.009
 on-0.009
Token air
Token ID1633
Feature activation+1.98e-02

Pos logit contributions
 staircase+0.020
 spe+0.020
uders+0.020
 tad+0.020
 Jess+0.020
Neg logit contributions
my-0.022
 best-0.016
ophone-0.016
 happiest-0.016
umper-0.016
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
 staircase+0.046
uders+0.046
 tad+0.045
Run+0.044
 spe+0.044
Neg logit contributions
my-0.025
 around-0.016
 especially-0.015
 everywhere-0.015
 on-0.015
Token
Token ID198
Feature activation-9.05e-03

Pos logit contributions
my+0.013
ophone+0.009
umper+0.009
 best+0.009
 especially+0.009
Neg logit contributions
 staircase-0.019
 site-0.018
uders-0.018
 spe-0.018
 Jess-0.018
Token
Token ID198
Feature activation+9.28e-03

Pos logit contributions
my+0.013
ophone+0.009
umper+0.009
 happiest+0.009
 best+0.009
Neg logit contributions
 staircase-0.019
 site-0.019
 Jess-0.018
 click-0.018
uders-0.018
TokenSuddenly
Token ID38582
Feature activation+2.79e-02

Pos logit contributions
 staircase+0.017
 spe+0.017
uders+0.017
 tad+0.016
 site+0.016
Neg logit contributions
my-0.016
ophone-0.012
umper-0.012
 especially-0.011
ingu-0.011
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
 staircase+0.063
uders+0.060
 site+0.060
 spe+0.060
 tad+0.059
Neg logit contributions
my-0.038
 best-0.025
ophone-0.025
umper-0.023
 especially-0.023
Token a
Token ID257
Feature activation+7.88e-02

Pos logit contributions
 staircase+0.048
uders+0.046
 spe+0.045
 site+0.045
 tad+0.045
Neg logit contributions
my-0.028
ophone-0.019
umper-0.018
 best-0.017
 especially-0.017
Token loud
Token ID7812
Feature activation+1.08e-01

Pos logit contributions
uders+0.186
rily+0.159
urg+0.158
eway+0.157
iously+0.157
Neg logit contributions
my-0.114
 best-0.085
 very-0.083
 tired-0.082
 would-0.079
Token laugh
Token ID6487
Feature activation+3.72e-02

Pos logit contributions
umm+0.171
uders+0.169
inct+0.155
 tad+0.151
yt+0.150
Neg logit contributions
my-0.227
 around-0.175
 on-0.172
 back-0.170
 about-0.170
Token echoed
Token ID22211
Feature activation+5.44e-02

Pos logit contributions
 staircase+0.080
 spe+0.078
uders+0.078
 tad+0.076
 site+0.076
Neg logit contributions
my-0.056
 on-0.038
 everywhere-0.038
 around-0.038
ophone-0.037
Token around
Token ID1088
Feature activation+2.07e-02

Pos logit contributions
 spe+0.142
uders+0.141
 staircase+0.140
 Sorry+0.137
lev+0.137
Neg logit contributions
my-0.050
 around-0.046
 everywhere-0.042
 back-0.039
 on-0.038
Token the
Token ID262
Feature activation+7.66e-03

Pos logit contributions
 staircase+0.055
 spe+0.054
umm+0.052
uders+0.052
inct+0.052
Neg logit contributions
my-0.020
 everywhere-0.014
 around-0.013
 on-0.012
 all-0.011
Token chamber
Token ID11847
Feature activation+6.68e-02

Pos logit contributions
 spe+0.011
 staircase+0.010
 tad+0.010
Run+0.010
 swarm+0.010
Neg logit contributions
my-0.017
 best-0.013
 everywhere-0.013
 around-0.013
ophone-0.013
Token
Token ID198
Feature activation+1.73e-02

Pos logit contributions
my+0.001
ophone+0.001
umper+0.001
tie+0.001
 happiest+0.001
Neg logit contributions
 staircase-0.002
 Jess-0.002
 site-0.002
 tad-0.002
 swarm-0.002
TokenSuddenly
Token ID38582
Feature activation+2.90e-02

Pos logit contributions
 staircase+0.028
 spe+0.026
uders+0.026
 site+0.026
 tad+0.025
Neg logit contributions
my-0.035
ophone-0.027
umper-0.027
 especially-0.027
 best-0.026
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
 staircase+0.069
 site+0.068
uders+0.067
 spe+0.066
umm+0.064
Neg logit contributions
my-0.035
 best-0.024
 especially-0.021
 on-0.021
 around-0.020
Token a
Token ID257
Feature activation+7.60e-02

Pos logit contributions
 staircase+0.049
 site+0.047
uders+0.047
 spe+0.047
inct+0.044
Neg logit contributions
my-0.035
 best-0.026
 especially-0.025
 everywhere-0.024
 forward-0.024
Token big
Token ID1263
Feature activation+9.56e-02

Pos logit contributions
uders+0.156
inct+0.137
eway+0.135
 site+0.134
 staircase+0.134
Neg logit contributions
my-0.121
 best-0.098
 tired-0.091
 happiest-0.090
 very-0.090
Token loud
Token ID7812
Feature activation+1.07e-01

Pos logit contributions
inct+0.175
uders+0.174
eway+0.165
 staircase+0.162
 site+0.161
Neg logit contributions
my-0.173
 back-0.125
 best-0.124
 on-0.120
 especially-0.118
Token voice
Token ID3809
Feature activation+5.79e-02

Pos logit contributions
uders+0.173
umm+0.169
inct+0.167
ipt+0.163
 staircase+0.160
Neg logit contributions
my-0.205
 around-0.162
 on-0.159
 all-0.157
 back-0.157
Token came
Token ID1625
Feature activation+3.69e-02

Pos logit contributions
umm+0.139
 staircase+0.137
uders+0.133
 tad+0.126
 gap+0.125
Neg logit contributions
my-0.076
 around-0.056
 on-0.054
 about-0.049
 would-0.048
Token from
Token ID422
Feature activation+3.23e-02

Pos logit contributions
umm+0.092
 site+0.091
 staircase+0.091
uders+0.087
opter+0.087
Neg logit contributions
my-0.055
 back-0.036
 on-0.035
 over-0.035
 everywhere-0.033
Token behind
Token ID2157
Feature activation+3.76e-02

Pos logit contributions
 staircase+0.082
uders+0.081
opter+0.079
umm+0.079
yt+0.079
Neg logit contributions
my-0.035
 around-0.021
 everywhere-0.020
 on-0.020
 back-0.018
Token her
Token ID607
Feature activation-4.54e-03

Pos logit contributions
 staircase+0.096
uders+0.091
 site+0.091
 spe+0.089
yt+0.088
Neg logit contributions
my-0.043
 everywhere-0.025
umper-0.024
 on-0.023
 around-0.023
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
my+0.024
ophone+0.018
umper+0.018
 best+0.017
ingu+0.017
Neg logit contributions
 staircase-0.029
 site-0.027
uders-0.026
 Jess-0.026
 spe-0.026
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
my+0.016
umper+0.012
ophone+0.012
 happiest+0.012
 best+0.011
Neg logit contributions
 staircase-0.022
 site-0.021
 Jess-0.021
Run-0.020
 click-0.020
TokenSuddenly
Token ID38582
Feature activation+2.53e-02

Pos logit contributions
 staircase+0.020
 site+0.018
 spe+0.018
uders+0.018
 tad+0.018
Neg logit contributions
my-0.023
ophone-0.018
umper-0.018
 best-0.017
 especially-0.017
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 staircase+0.055
uders+0.054
 spe+0.053
 site+0.053
 tad+0.053
Neg logit contributions
my-0.036
ophone-0.024
 best-0.024
 especially-0.022
umper-0.022
Token a
Token ID257
Feature activation+7.47e-02

Pos logit contributions
 staircase+0.041
uders+0.040
 spe+0.039
 site+0.039
 tad+0.039
Neg logit contributions
my-0.026
ophone-0.017
 best-0.017
 especially-0.016
umper-0.016
Token loud
Token ID7812
Feature activation+1.07e-01

Pos logit contributions
uders+0.171
eway+0.153
rily+0.151
inct+0.150
urg+0.149
Neg logit contributions
my-0.106
 best-0.083
 tired-0.080
 very-0.076
 happiest-0.074
Token noise
Token ID7838
Feature activation+6.66e-02

Pos logit contributions
umm+0.174
uders+0.165
inct+0.159
 staircase+0.158
yt+0.158
Neg logit contributions
my-0.215
 around-0.167
 on-0.166
 back-0.165
 all-0.158
Token made
Token ID925
Feature activation+1.44e-02

Pos logit contributions
 staircase+0.154
uders+0.151
 tad+0.146
umm+0.145
 gap+0.142
Neg logit contributions
my-0.085
 around-0.063
 on-0.061
 everywhere-0.058
 would-0.057
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
 staircase+0.032
 spe+0.032
uders+0.032
 tad+0.031
 site+0.031
Neg logit contributions
my-0.022
 everywhere-0.014
 forward-0.014
 best-0.013
 back-0.013
Token ants
Token ID27842
Feature activation+3.12e-02

Pos logit contributions
 spe+0.028
 staircase+0.026
 Jess+0.025
yt+0.025
 click+0.025
Neg logit contributions
my-0.025
 best-0.021
 happiest-0.019
 especially-0.018
 bored-0.018
Token scatter
Token ID41058
Feature activation-1.05e-02

Pos logit contributions
uders+0.053
 staircase+0.053
 spe+0.053
umm+0.051
 site+0.051
Neg logit contributions
my-0.060
 around-0.044
 especially-0.044
ophone-0.044
 everywhere-0.043
Token happened
Token ID3022
Feature activation+2.57e-02

Pos logit contributions
 staircase+0.088
 spe+0.085
uders+0.084
Run+0.082
 site+0.082
Neg logit contributions
my-0.064
 best-0.043
ophone-0.043
 especially-0.042
umper-0.040
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 staircase+0.052
 site+0.050
uders+0.049
 spe+0.049
 tad+0.049
Neg logit contributions
my-0.039
ophone-0.029
umper-0.029
ingu-0.027
 best-0.026
Token Suddenly
Token ID24975
Feature activation+2.34e-02

Pos logit contributions
 staircase+0.030
 spe+0.028
uders+0.028
 site+0.028
 tad+0.027
Neg logit contributions
my-0.022
ophone-0.016
umper-0.015
 especially-0.015
ingu-0.014
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
 staircase+0.053
uders+0.051
 spe+0.051
 site+0.050
 tad+0.050
Neg logit contributions
my-0.031
 best-0.020
ophone-0.020
umper-0.019
 especially-0.019
Token a
Token ID257
Feature activation+7.14e-02

Pos logit contributions
 staircase+0.034
uders+0.033
 spe+0.033
 site+0.032
 tad+0.032
Neg logit contributions
my-0.018
ophone-0.011
 best-0.011
umper-0.011
 especially-0.011
Token loud
Token ID7812
Feature activation+1.07e-01

Pos logit contributions
uders+0.145
 staircase+0.129
 Jess+0.129
inct+0.128
 site+0.128
Neg logit contributions
my-0.114
 best-0.091
 happiest-0.082
 tired-0.081
 would-0.079
Token tune
Token ID14009
Feature activation+2.34e-02

Pos logit contributions
uders+0.161
umm+0.157
 staircase+0.152
yt+0.151
inct+0.149
Neg logit contributions
my-0.224
 on-0.174
 around-0.172
 back-0.168
 like-0.164
Token came
Token ID1625
Feature activation+3.03e-02

Pos logit contributions
 staircase+0.049
umm+0.048
 site+0.047
 tad+0.047
 spe+0.047
Neg logit contributions
my-0.035
ophone-0.026
 would-0.025
 on-0.025
 around-0.024
Token out
Token ID503
Feature activation+5.83e-02

Pos logit contributions
 staircase+0.079
 site+0.076
 spe+0.075
uders+0.075
umm+0.074
Neg logit contributions
my-0.038
 forward-0.021
 back-0.021
 on-0.021
 everywhere-0.020
Token of
Token ID286
Feature activation+1.21e-02

Pos logit contributions
uders+0.161
 staircase+0.154
opter+0.151
umm+0.151
lev+0.151
Neg logit contributions
my-0.059
 on-0.043
 around-0.037
 over-0.037
 everywhere-0.035
Token nowhere
Token ID12062
Feature activation+2.67e-02

Pos logit contributions
 staircase+0.029
uders+0.029
 tad+0.028
yt+0.028
inct+0.028
Neg logit contributions
my-0.014
 everywhere-0.009
 on-0.008
 all-0.008
umper-0.007
Token.
Token ID13
Feature activation-9.21e-03

Pos logit contributions
my+0.015
umper+0.011
ophone+0.011
ingu+0.011
 especially+0.010
Neg logit contributions
 staircase-0.019
 spe-0.018
 site-0.018
uders-0.018
 click-0.018
Token They
Token ID1119
Feature activation-6.18e-03

Pos logit contributions
my+0.016
ophone+0.013
umper+0.012
 best+0.012
 especially+0.012
Neg logit contributions
 staircase-0.017
 site-0.015
 tad-0.015
uders-0.015
 spe-0.015
Token wear
Token ID5806
Feature activation-6.66e-02

Pos logit contributions
my+0.010
umper+0.009
ophone+0.009
ingu+0.008
ip+0.008
Neg logit contributions
 staircase-0.010
 click-0.010
 spe-0.010
Run-0.009
 gasp-0.009
Token helmets
Token ID28359
Feature activation-5.00e-02

Pos logit contributions
my+0.143
umper+0.134
ingu+0.116
ophone+0.114
ll+0.112
Neg logit contributions
 spe-0.093
 staircase-0.092
 click-0.090
 site-0.082
Run-0.080
Token and
Token ID290
Feature activation-3.50e-02

Pos logit contributions
my+0.076
umper+0.076
ingu+0.066
wy+0.065
 happiest+0.061
Neg logit contributions
 staircase-0.086
 click-0.083
 site-0.083
atively-0.081
 spe-0.080
Token pads
Token ID21226
Feature activation-1.02e-01

Pos logit contributions
my+0.063
umper+0.050
ophone+0.049
ingu+0.048
 best+0.045
Neg logit contributions
 staircase-0.062
 spe-0.059
 click-0.057
Run-0.057
 site-0.056
Token to
Token ID284
Feature activation+3.34e-02

Pos logit contributions
my+0.183
umper+0.181
ingu+0.163
 happiest+0.163
ophone+0.155
Neg logit contributions
 click-0.149
 staircase-0.146
 spe-0.144
atively-0.140
 site-0.137
Token be
Token ID307
Feature activation-3.93e-02

Pos logit contributions
 staircase+0.060
 site+0.056
 spe+0.056
 click+0.055
uders+0.055
Neg logit contributions
my-0.059
ophone-0.046
umper-0.046
ingu-0.044
 best-0.042
Token safe
Token ID3338
Feature activation+1.15e-02

Pos logit contributions
my+0.064
umper+0.051
ingu+0.050
ophone+0.049
ip+0.044
Neg logit contributions
 staircase-0.072
 spe-0.070
 site-0.068
 click-0.067
uders-0.067
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
 staircase+0.026
 spe+0.024
 tad+0.024
 site+0.024
uders+0.024
Neg logit contributions
my-0.016
ophone-0.011
 best-0.011
 especially-0.010
umper-0.010
Token
Token ID198
Feature activation-2.25e-02

Pos logit contributions
my+0.027
ophone+0.022
umper+0.022
tie+0.021
 happiest+0.020
Neg logit contributions
 staircase-0.030
 Jess-0.029
 site-0.029
 tad-0.028
uders-0.028
Token sk
Token ID1341
Feature activation-1.27e-02

Pos logit contributions
my+0.106
umper+0.087
ophone+0.081
ingu+0.078
 happiest+0.077
Neg logit contributions
 staircase-0.090
 spe-0.090
 flame-0.084
uders-0.084
 site-0.084
Tokenates
Token ID689
Feature activation-6.06e-02

Pos logit contributions
my+0.024
 happiest+0.019
 would+0.018
 best+0.018
 especially+0.018
Neg logit contributions
uders-0.022
 site-0.020
 staircase-0.020
 tad-0.020
urg-0.019
Token and
Token ID290
Feature activation-3.83e-02

Pos logit contributions
my+0.097
umper+0.083
 happiest+0.075
ingu+0.074
ophone+0.073
Neg logit contributions
 staircase-0.107
 spe-0.106
uders-0.105
 site-0.104
fire-0.103
Token helmets
Token ID28359
Feature activation-4.09e-02

Pos logit contributions
my+0.067
ophone+0.050
umper+0.049
 best+0.047
 especially+0.047
Neg logit contributions
 staircase-0.072
 spe-0.067
 site-0.067
uders-0.067
 tad-0.066
Token and
Token ID290
Feature activation-3.40e-02

Pos logit contributions
my+0.062
umper+0.051
ingu+0.046
ophone+0.046
 happiest+0.045
Neg logit contributions
 staircase-0.078
 click-0.075
uders-0.075
 spe-0.075
 site-0.074
Token pads
Token ID21226
Feature activation-9.73e-02

Pos logit contributions
my+0.061
ophone+0.047
umper+0.046
ingu+0.043
 best+0.043
Neg logit contributions
 staircase-0.062
 spe-0.058
 site-0.057
uders-0.057
 tad-0.057
Token.
Token ID13
Feature activation-6.17e-03

Pos logit contributions
my+0.168
umper+0.153
 happiest+0.140
ingu+0.140
ophone+0.140
Neg logit contributions
 click-0.151
 staircase-0.151
 spe-0.149
fire-0.146
 site-0.146
Token They
Token ID1119
Feature activation-4.12e-03

Pos logit contributions
my+0.012
ophone+0.010
umper+0.009
 best+0.009
 especially+0.009
Neg logit contributions
 staircase-0.010
 spe-0.009
uders-0.009
 site-0.009
 tad-0.009
Token use
Token ID779
Feature activation-2.66e-02

Pos logit contributions
my+0.006
umper+0.005
ophone+0.005
ingu+0.005
ip+0.004
Neg logit contributions
 spe-0.008
 staircase-0.008
 click-0.008
 collapse-0.008
 crumble-0.007
Token a
Token ID257
Feature activation+2.38e-02

Pos logit contributions
my+0.047
umper+0.037
ophone+0.036
ingu+0.034
 happiest+0.034
Neg logit contributions
 staircase-0.047
 spe-0.046
 site-0.044
 flame-0.044
uders-0.044
Token puck
Token ID20695
Feature activation-2.30e-02

Pos logit contributions
 staircase+0.041
uders+0.038
 site+0.038
 spe+0.038
 tad+0.037
Neg logit contributions
my-0.045
ophone-0.035
umper-0.034
 best-0.034
 especially-0.033
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
my+0.079
umper+0.060
ophone+0.060
 happiest+0.057
ingu+0.056
Neg logit contributions
 staircase-0.093
 site-0.089
 spe-0.089
uders-0.088
 tad-0.085
Token They
Token ID1119
Feature activation-3.47e-03

Pos logit contributions
my+0.026
ophone+0.019
umper+0.018
 especially+0.018
 best+0.018
Neg logit contributions
 staircase-0.031
 spe-0.029
uders-0.029
 site-0.029
 tad-0.029
Token also
Token ID635
Feature activation-1.56e-02

Pos logit contributions
my+0.005
umper+0.004
ophone+0.004
ingu+0.004
ip+0.004
Neg logit contributions
 spe-0.007
 staircase-0.007
 click-0.006
 tad-0.006
uders-0.006
Token have
Token ID423
Feature activation-2.47e-02

Pos logit contributions
my+0.023
umper+0.019
ophone+0.018
ingu+0.017
ip+0.017
Neg logit contributions
 spe-0.031
 staircase-0.030
 site-0.029
 tad-0.029
 click-0.029
Token ink
Token ID16882
Feature activation-4.79e-02

Pos logit contributions
my+0.039
umper+0.031
ophone+0.030
 forward+0.027
ingu+0.027
Neg logit contributions
 spe-0.050
 staircase-0.048
 site-0.047
 tad-0.046
uders-0.045
Token pads
Token ID21226
Feature activation-9.71e-02

Pos logit contributions
my+0.076
ophone+0.056
umper+0.056
 especially+0.052
 happiest+0.051
Neg logit contributions
 staircase-0.095
 spe-0.090
 site-0.090
uders-0.089
 tad-0.088
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
my+0.157
umper+0.143
ophone+0.131
 happiest+0.127
ingu+0.125
Neg logit contributions
 staircase-0.162
fire-0.158
 site-0.158
 spe-0.157
 click-0.153
Token different
Token ID1180
Feature activation-1.03e-02

Pos logit contributions
my+0.018
ophone+0.014
umper+0.014
 forward+0.012
 happiest+0.012
Neg logit contributions
 staircase-0.020
 spe-0.020
 tad-0.020
 site-0.020
uders-0.019
Token colors
Token ID7577
Feature activation-5.09e-02

Pos logit contributions
my+0.020
ophone+0.016
umper+0.016
 happiest+0.015
ingu+0.015
Neg logit contributions
 staircase-0.016
 spe-0.016
uders-0.015
 site-0.015
 tad-0.015
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
my+0.080
umper+0.064
ophone+0.062
 happiest+0.058
ingu+0.058
Neg logit contributions
 staircase-0.097
 site-0.092
 spe-0.092
uders-0.091
 flame-0.089
Token They
Token ID1119
Feature activation-3.73e-03

Pos logit contributions
my+0.029
ophone+0.021
umper+0.021
 best+0.020
 especially+0.020
Neg logit contributions
 staircase-0.035
 spe-0.033
 site-0.033
uders-0.033
 tad-0.032
Token their
Token ID511
Feature activation-5.35e-02

Pos logit contributions
my+0.150
umper+0.128
ophone+0.120
ingu+0.120
 happiest+0.110
Neg logit contributions
 spe-0.109
 staircase-0.108
 site-0.100
 Jess-0.100
 click-0.100
Token gear
Token ID7733
Feature activation-5.23e-02

Pos logit contributions
my+0.111
umper+0.107
ingu+0.095
ophone+0.093
pping+0.089
Neg logit contributions
 staircase-0.074
 spe-0.071
 voice-0.067
 site-0.066
 slit-0.065
Token:
Token ID25
Feature activation-2.61e-02

Pos logit contributions
my+0.082
umper+0.070
ingu+0.058
 happiest+0.056
ophone+0.056
Neg logit contributions
 staircase-0.105
 site-0.100
yt-0.094
uders-0.092
 spe-0.092
Token helmets
Token ID28359
Feature activation-5.35e-02

Pos logit contributions
my+0.044
umper+0.038
ophone+0.036
ingu+0.036
wy+0.033
Neg logit contributions
 staircase-0.047
 spe-0.045
Run-0.044
 tad-0.043
 Jess-0.042
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
my+0.078
umper+0.063
ingu+0.058
ophone+0.057
ip+0.053
Neg logit contributions
 staircase-0.107
 site-0.102
uders-0.101
 spe-0.101
 click-0.100
Token pads
Token ID21226
Feature activation-9.62e-02

Pos logit contributions
my+0.060
umper+0.054
ingu+0.053
ophone+0.051
 forward+0.048
Neg logit contributions
 spe-0.060
 staircase-0.058
 click-0.056
Run-0.055
 tad-0.054
Token and
Token ID290
Feature activation-3.70e-02

Pos logit contributions
my+0.159
umper+0.140
ophone+0.136
ingu+0.128
 happiest+0.123
Neg logit contributions
 staircase-0.161
 site-0.155
 click-0.153
 spe-0.150
uders-0.147
Token gloves
Token ID18051
Feature activation-6.27e-02

Pos logit contributions
my+0.063
umper+0.053
ophone+0.051
ingu+0.051
ip+0.046
Neg logit contributions
 staircase-0.066
 spe-0.064
 click-0.063
Run-0.060
 tad-0.060
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
my+0.101
umper+0.082
ophone+0.078
ingu+0.077
 happiest+0.072
Neg logit contributions
 staircase-0.115
 site-0.110
 spe-0.109
 click-0.106
uders-0.106
Token They
Token ID1119
Feature activation-5.49e-03

Pos logit contributions
my+0.030
ophone+0.025
umper+0.024
tie+0.024
 happiest+0.023
Neg logit contributions
 staircase-0.029
 tad-0.029
 Jess-0.029
 gap-0.028
 site-0.028
Token knew
Token ID2993
Feature activation-9.39e-04

Pos logit contributions
my+0.008
umper+0.007
ophone+0.007
ingu+0.007
 forward+0.006
Neg logit contributions
 staircase-0.010
 spe-0.010
Run-0.009
 slit-0.009
yt-0.009
Token in
Token ID287
Feature activation-5.18e-03

Pos logit contributions
my+0.027
umper+0.019
ophone+0.019
ingu+0.018
 especially+0.017
Neg logit contributions
 staircase-0.040
 site-0.037
 spe-0.037
uders-0.037
 click-0.036
Token the
Token ID262
Feature activation-2.65e-03

Pos logit contributions
my+0.008
umper+0.005
ophone+0.005
 best+0.005
 especially+0.005
Neg logit contributions
 staircase-0.011
Run-0.010
 tad-0.010
 site-0.010
uders-0.010
Token book
Token ID1492
Feature activation+1.07e-02

Pos logit contributions
my+0.005
umper+0.004
ophone+0.004
ingu+0.004
 especially+0.004
Neg logit contributions
 staircase-0.005
 site-0.004
 spe-0.004
 gap-0.004
 click-0.004
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 staircase+0.023
 spe+0.021
 site+0.021
uders+0.021
 tad+0.021
Neg logit contributions
my-0.016
ophone-0.012
umper-0.011
 best-0.011
 especially-0.011
Token<|endoftext|>
Token ID50256
Feature activation-4.31e-02

Pos logit contributions
my+0.019
ophone+0.014
umper+0.013
 best+0.013
 especially+0.013
Neg logit contributions
 staircase-0.022
 spe-0.020
 site-0.020
uders-0.020
 tad-0.020
TokenOnce
Token ID7454
Feature activation-9.54e-02

Pos logit contributions
my+0.073
ophone+0.066
umper+0.064
 happiest+0.063
ingu+0.060
Neg logit contributions
 site-0.072
 staircase-0.070
 Jess-0.067
fire-0.067
 gap-0.066
Token upon
Token ID2402
Feature activation-4.00e-02

Pos logit contributions
umper+0.148
tie+0.147
ophone+0.142
 pads+0.141
my+0.140
Neg logit contributions
 Jess-0.153
yt-0.152
 staircase-0.150
 site-0.139
 swarm-0.138
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
my+0.062
umper+0.054
ophone+0.053
ip+0.049
ingu+0.049
Neg logit contributions
 staircase-0.071
Run-0.071
 flame-0.069
 spe-0.069
 tad-0.069
Token time
Token ID640
Feature activation-4.77e-02

Pos logit contributions
my+0.024
umper+0.019
ophone+0.019
ingu+0.017
tie+0.017
Neg logit contributions
 staircase-0.034
 spe-0.032
 site-0.032
uders-0.031
Run-0.031
Token,
Token ID11
Feature activation-6.18e-02

Pos logit contributions
my+0.085
umper+0.071
ophone+0.070
tie+0.066
ingu+0.065
Neg logit contributions
 staircase-0.083
 Jess-0.077
 site-0.077
 spe-0.077
uders-0.077
Token there
Token ID612
Feature activation-4.35e-02

Pos logit contributions
my+0.081
umper+0.067
ophone+0.063
ingu+0.061
 pads+0.059
Neg logit contributions
 staircase-0.132
 spe-0.128
Run-0.127
 flame-0.127
 site-0.125
Token sharing
Token ID7373
Feature activation-3.02e-02

Pos logit contributions
my+0.020
umper+0.016
ophone+0.015
ingu+0.014
tie+0.014
Neg logit contributions
 staircase-0.028
 site-0.027
 spe-0.026
uders-0.026
 tad-0.026
Token is
Token ID318
Feature activation-1.41e-02

Pos logit contributions
my+0.054
umper+0.049
ophone+0.046
ingu+0.046
ip+0.046
Neg logit contributions
 site-0.045
 staircase-0.045
uders-0.044
 tad-0.043
 click-0.043
Token good
Token ID922
Feature activation-5.62e-03

Pos logit contributions
my+0.025
umper+0.022
ophone+0.022
ingu+0.020
 pads+0.019
Neg logit contributions
 tad-0.023
 collapse-0.022
 crumble-0.022
 click-0.022
izing-0.022
Token.
Token ID13
Feature activation-5.59e-04

Pos logit contributions
my+0.008
ophone+0.006
umper+0.006
ingu+0.005
 especially+0.005
Neg logit contributions
 staircase-0.012
uders-0.011
 site-0.011
 spe-0.011
 tad-0.011
Token<|endoftext|>
Token ID50256
Feature activation-2.52e-02

Pos logit contributions
my+0.001
ophone+0.001
umper+0.001
 best+0.001
 especially+0.001
Neg logit contributions
 staircase-0.001
 spe-0.001
 site-0.001
uders-0.001
 tad-0.001
TokenOnce
Token ID7454
Feature activation-9.41e-02

Pos logit contributions
my+0.042
ophone+0.035
 happiest+0.033
umper+0.033
 especially+0.032
Neg logit contributions
 staircase-0.045
 site-0.044
 tad-0.042
Run-0.042
 spe-0.042
Token,
Token ID11
Feature activation-7.30e-02

Pos logit contributions
tie+0.158
 pads+0.157
umper+0.150
my+0.148
wy+0.148
Neg logit contributions
yt-0.139
 staircase-0.139
 Jess-0.136
 swarm-0.129
 site-0.125
Token there
Token ID612
Feature activation-4.50e-02

Pos logit contributions
my+0.102
umper+0.099
 pads+0.092
tie+0.088
ophone+0.086
Neg logit contributions
 staircase-0.138
 flame-0.133
 Jess-0.132
 spe-0.129
uders-0.126
Token was
Token ID373
Feature activation-1.58e-02

Pos logit contributions
my+0.087
umper+0.083
ophone+0.082
tie+0.079
 pads+0.077
Neg logit contributions
 staircase-0.062
 site-0.060
 Jess-0.058
 tad-0.057
uders-0.056
Token a
Token ID257
Feature activation+2.30e-02

Pos logit contributions
my+0.023
ophone+0.019
umper+0.018
ingu+0.016
 happiest+0.016
Neg logit contributions
 staircase-0.031
 spe-0.030
 tad-0.030
Run-0.030
 Jess-0.030
Token girl
Token ID2576
Feature activation+1.03e-02

Pos logit contributions
 staircase+0.038
uders+0.036
 site+0.036
 spe+0.035
 tad+0.035
Neg logit contributions
my-0.045
ophone-0.035
umper-0.034
 best-0.033
 especially-0.033
Token best
Token ID1266
Feature activation+6.57e-03

Pos logit contributions
my+0.025
ophone+0.022
 pads+0.018
umper+0.018
ip+0.018
Neg logit contributions
 tad-0.063
 staircase-0.062
 flame-0.062
 spe-0.061
 click-0.060
Token of
Token ID286
Feature activation+2.11e-03

Pos logit contributions
 staircase+0.015
uders+0.014
 site+0.014
 spe+0.014
 tad+0.013
Neg logit contributions
my-0.009
ophone-0.006
umper-0.006
 best-0.006
 especially-0.006
Token friends
Token ID2460
Feature activation-3.24e-02

Pos logit contributions
 tad+0.004
 staircase+0.004
 spe+0.004
uders+0.004
 site+0.004
Neg logit contributions
my-0.003
ophone-0.003
umper-0.002
 pads-0.002
tie-0.002
Token.
Token ID13
Feature activation-9.42e-03

Pos logit contributions
my+0.058
umper+0.047
ophone+0.047
ingu+0.046
wy+0.042
Neg logit contributions
 spe-0.053
 staircase-0.053
 Jess-0.051
 site-0.051
 tad-0.050
Token<|endoftext|>
Token ID50256
Feature activation-3.79e-02

Pos logit contributions
my+0.016
ophone+0.015
umper+0.014
wy+0.014
tie+0.014
Neg logit contributions
 Jess-0.015
 spe-0.014
 site-0.014
 tad-0.014
 staircase-0.014
TokenOnce
Token ID7454
Feature activation-9.35e-02

Pos logit contributions
my+0.059
ophone+0.049
umper+0.048
 happiest+0.046
ingu+0.045
Neg logit contributions
 staircase-0.071
 site-0.070
 Jess-0.067
fire-0.066
 spe-0.066
Token upon
Token ID2402
Feature activation-3.41e-02

Pos logit contributions
umper+0.141
tie+0.140
 pads+0.136
my+0.135
wy+0.135
Neg logit contributions
 Jess-0.155
yt-0.155
 staircase-0.154
 swarm-0.141
 site-0.139
Token a
Token ID257
Feature activation-1.14e-02

Pos logit contributions
my+0.052
umper+0.044
ophone+0.043
ip+0.040
ingu+0.040
Neg logit contributions
 staircase-0.063
Run-0.062
 spe-0.061
 flame-0.060
 tad-0.060
Token time
Token ID640
Feature activation-4.28e-02

Pos logit contributions
my+0.017
umper+0.014
ophone+0.013
tie+0.012
ingu+0.012
Neg logit contributions
 staircase-0.023
 spe-0.022
 site-0.021
uders-0.021
Run-0.021
Token,
Token ID11
Feature activation-5.90e-02

Pos logit contributions
my+0.074
umper+0.061
ophone+0.059
tie+0.056
ingu+0.056
Neg logit contributions
 staircase-0.078
 Jess-0.072
uders-0.072
 site-0.072
 spe-0.072
Token there
Token ID612
Feature activation-3.80e-02

Pos logit contributions
my+0.081
umper+0.072
ingu+0.066
ophone+0.065
 pads+0.063
Neg logit contributions
 staircase-0.120
 spe-0.115
Run-0.115
 Jess-0.114
 flame-0.114
Token laughed
Token ID13818
Feature activation-1.49e-02

Pos logit contributions
my+0.034
umper+0.023
ophone+0.022
ingu+0.020
 best+0.019
Neg logit contributions
 staircase-0.070
 spe-0.066
 site-0.066
uders-0.065
 tad-0.065
Token all
Token ID477
Feature activation-6.16e-02

Pos logit contributions
my+0.023
umper+0.018
ophone+0.018
ingu+0.016
 pads+0.015
Neg logit contributions
 staircase-0.030
 site-0.028
 click-0.028
 spe-0.028
 tad-0.027
Token day
Token ID1110
Feature activation-2.51e-02

Pos logit contributions
umper+0.078
my+0.077
ingu+0.074
ip+0.066
ophone+0.066
Neg logit contributions
 staircase-0.124
 site-0.117
 Jess-0.117
 spe-0.115
 flame-0.114
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
my+0.044
ophone+0.034
umper+0.033
 best+0.032
 especially+0.032
Neg logit contributions
 staircase-0.046
 spe-0.043
uders-0.043
 site-0.043
 tad-0.043
Token<|endoftext|>
Token ID50256
Feature activation-3.75e-02

Pos logit contributions
my+0.029
umper+0.024
ophone+0.024
ingu+0.022
 especially+0.022
Neg logit contributions
 staircase-0.023
 Jess-0.022
 site-0.022
uders-0.022
 tad-0.022
TokenOnce
Token ID7454
Feature activation-9.32e-02

Pos logit contributions
my+0.059
ophone+0.050
umper+0.048
 happiest+0.046
ingu+0.045
Neg logit contributions
 staircase-0.069
 site-0.068
 Jess-0.066
 tad-0.065
Run-0.065
Token upon
Token ID2402
Feature activation-3.77e-02

Pos logit contributions
umper+0.138
tie+0.137
my+0.134
 pads+0.130
ophone+0.129
Neg logit contributions
 Jess-0.162
 staircase-0.159
yt-0.155
 swarm-0.147
 site-0.144
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
my+0.055
umper+0.045
ophone+0.044
ingu+0.041
ip+0.040
Neg logit contributions
 staircase-0.074
Run-0.071
 tad-0.070
 spe-0.070
 flame-0.070
Token time
Token ID640
Feature activation-4.45e-02

Pos logit contributions
my+0.022
umper+0.017
ophone+0.017
ingu+0.015
tie+0.015
Neg logit contributions
 staircase-0.033
 spe-0.031
 site-0.031
uders-0.031
Run-0.030
Token,
Token ID11
Feature activation-5.92e-02

Pos logit contributions
my+0.073
ophone+0.056
umper+0.056
ingu+0.052
tie+0.051
Neg logit contributions
 staircase-0.087
 site-0.081
uders-0.081
 spe-0.081
 tad-0.080
Token there
Token ID612
Feature activation-4.25e-02

Pos logit contributions
my+0.075
umper+0.058
ophone+0.055
ingu+0.053
 pads+0.052
Neg logit contributions
 staircase-0.132
 spe-0.127
Run-0.126
 Jess-0.125
 flame-0.125
Token teasing
Token ID34417
Feature activation-1.84e-02

Pos logit contributions
my+0.056
umper+0.046
ophone+0.045
ingu+0.042
tie+0.040
Neg logit contributions
 spe-0.064
 staircase-0.064
 tad-0.063
 click-0.062
 site-0.061
Token is
Token ID318
Feature activation-7.64e-03

Pos logit contributions
my+0.028
ophone+0.020
umper+0.020
 especially+0.019
ingu+0.018
Neg logit contributions
 staircase-0.038
uders-0.036
 site-0.036
 spe-0.036
 tad-0.036
Token bad
Token ID2089
Feature activation+6.08e-03

Pos logit contributions
my+0.012
umper+0.011
ophone+0.010
ingu+0.010
tie+0.009
Neg logit contributions
 tad-0.014
 collapse-0.013
 spe-0.013
 crumble-0.013
 slit-0.013
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 staircase+0.013
uders+0.012
 site+0.012
 spe+0.012
 tad+0.012
Neg logit contributions
my-0.009
ophone-0.007
umper-0.007
ingu-0.006
 especially-0.006
Token<|endoftext|>
Token ID50256
Feature activation-4.56e-02

Pos logit contributions
my+0.016
umper+0.013
ophone+0.013
tie+0.012
 happiest+0.012
Neg logit contributions
 Jess-0.019
 staircase-0.019
 site-0.019
 tad-0.019
uders-0.019
TokenOnce
Token ID7454
Feature activation-9.31e-02

Pos logit contributions
my+0.077
ophone+0.069
umper+0.067
 happiest+0.066
ingu+0.063
Neg logit contributions
 site-0.076
 staircase-0.074
 Jess-0.071
fire-0.071
Run-0.070
Token upon
Token ID2402
Feature activation-3.73e-02

Pos logit contributions
umper+0.144
tie+0.144
 pads+0.139
ophone+0.137
my+0.133
Neg logit contributions
 Jess-0.149
yt-0.147
 staircase-0.145
 swarm-0.134
 site-0.133
Token a
Token ID257
Feature activation-1.48e-02

Pos logit contributions
my+0.057
umper+0.048
ophone+0.047
ip+0.044
ingu+0.043
Neg logit contributions
 staircase-0.069
Run-0.067
 flame-0.066
 spe-0.066
 tad-0.065
Token time
Token ID640
Feature activation-4.79e-02

Pos logit contributions
my+0.022
umper+0.017
ophone+0.017
ingu+0.015
tie+0.015
Neg logit contributions
 staircase-0.030
 spe-0.029
 site-0.029
uders-0.028
Run-0.028
Token,
Token ID11
Feature activation-5.98e-02

Pos logit contributions
my+0.086
umper+0.072
ophone+0.071
tie+0.067
ingu+0.067
Neg logit contributions
 staircase-0.082
 Jess-0.077
uders-0.077
 spe-0.077
 site-0.076
Token there
Token ID612
Feature activation-4.32e-02

Pos logit contributions
my+0.079
umper+0.065
ophone+0.060
ingu+0.059
 pads+0.058
Neg logit contributions
 staircase-0.128
 spe-0.124
Run-0.123
 flame-0.123
 Jess-0.122
Token red
Token ID2266
Feature activation+3.83e-02

Pos logit contributions
 staircase+0.129
uders+0.122
 site+0.119
 tad+0.119
 spe+0.119
Neg logit contributions
my-0.135
ophone-0.100
umper-0.098
 best-0.097
 especially-0.095
Token je
Token ID11223
Feature activation+2.09e-02

Pos logit contributions
 staircase+0.076
uders+0.074
 Jess+0.073
 swarm+0.072
Run+0.072
Neg logit contributions
my-0.065
ophone-0.043
umper-0.043
 best-0.042
 especially-0.040
Tokenep
Token ID538
Feature activation+2.02e-02

Pos logit contributions
 staircase+0.034
 site+0.031
 spe+0.030
uders+0.030
 tad+0.030
Neg logit contributions
my-0.042
ophone-0.033
umper-0.033
 especially-0.032
 best-0.032
Token.
Token ID13
Feature activation-5.92e-03

Pos logit contributions
 staircase+0.044
 spe+0.042
uders+0.042
 tad+0.041
 site+0.041
Neg logit contributions
my-0.029
ophone-0.020
umper-0.019
 best-0.019
 especially-0.019
Token<|endoftext|>
Token ID50256
Feature activation-3.05e-02

Pos logit contributions
my+0.010
umper+0.007
ophone+0.007
ingu+0.007
 best+0.007
Neg logit contributions
 staircase-0.011
 site-0.011
uders-0.011
 spe-0.011
 Jess-0.011
TokenOnce
Token ID7454
Feature activation-9.26e-02

Pos logit contributions
my+0.046
ophone+0.037
umper+0.036
 happiest+0.035
ingu+0.034
Neg logit contributions
 staircase-0.060
 site-0.058
 Jess-0.056
 spe-0.056
 tad-0.056
Token upon
Token ID2402
Feature activation-3.48e-02

Pos logit contributions
umper+0.134
my+0.134
tie+0.129
ophone+0.125
 pads+0.122
Neg logit contributions
 Jess-0.165
 staircase-0.165
yt-0.160
 site-0.150
 swarm-0.147
Token a
Token ID257
Feature activation-1.31e-02

Pos logit contributions
my+0.051
umper+0.040
ophone+0.040
ingu+0.037
ip+0.036
Neg logit contributions
 staircase-0.070
Run-0.067
 spe-0.066
 tad-0.066
 Jess-0.065
Token time
Token ID640
Feature activation-4.33e-02

Pos logit contributions
my+0.018
umper+0.014
ophone+0.014
ingu+0.012
tie+0.012
Neg logit contributions
 staircase-0.028
 spe-0.027
 site-0.027
uders-0.026
Run-0.026
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
my+0.072
umper+0.055
ophone+0.055
ingu+0.052
tie+0.051
Neg logit contributions
 staircase-0.083
 site-0.077
uders-0.077
 spe-0.077
 Jess-0.076
Token there
Token ID612
Feature activation-3.92e-02

Pos logit contributions
my+0.075
umper+0.063
ophone+0.059
ingu+0.058
 pads+0.056
Neg logit contributions
 staircase-0.124
 spe-0.118
 flame-0.118
Run-0.118
 Jess-0.117
Token the
Token ID262
Feature activation-2.46e-02

Pos logit contributions
my+0.040
umper+0.031
ophone+0.030
ingu+0.027
 happiest+0.026
Neg logit contributions
 staircase-0.056
 site-0.054
 spe-0.054
 tad-0.052
 Jess-0.052
Token book
Token ID1492
Feature activation-3.55e-03

Pos logit contributions
my+0.047
umper+0.039
ophone+0.038
ingu+0.038
 especially+0.037
Neg logit contributions
 staircase-0.040
 site-0.037
 swarm-0.037
 click-0.037
 spe-0.037
Token together
Token ID1978
Feature activation-1.62e-02

Pos logit contributions
my+0.006
ophone+0.005
 best+0.004
umper+0.004
 especially+0.004
Neg logit contributions
 staircase-0.007
 spe-0.006
uders-0.006
 site-0.006
 tad-0.006
Token.
Token ID13
Feature activation-8.31e-03

Pos logit contributions
my+0.026
umper+0.019
ophone+0.019
ingu+0.017
 best+0.017
Neg logit contributions
 staircase-0.033
 site-0.031
 spe-0.031
uders-0.031
 tad-0.030
Token<|endoftext|>
Token ID50256
Feature activation-3.63e-02

Pos logit contributions
my+0.012
ophone+0.009
umper+0.009
 especially+0.009
 best+0.009
Neg logit contributions
 staircase-0.017
 site-0.016
 tad-0.016
 Jess-0.016
uders-0.016
TokenOnce
Token ID7454
Feature activation-9.26e-02

Pos logit contributions
my+0.059
ophone+0.051
 happiest+0.049
umper+0.048
 pads+0.046
Neg logit contributions
 staircase-0.064
 site-0.063
 Jess-0.061
 tad-0.060
fire-0.060
Token upon
Token ID2402
Feature activation-3.74e-02

Pos logit contributions
tie+0.143
umper+0.140
 pads+0.138
ophone+0.136
my+0.133
Neg logit contributions
 Jess-0.152
yt-0.149
 staircase-0.146
 site-0.136
 swarm-0.133
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
my+0.057
umper+0.048
ophone+0.047
ip+0.044
 happiest+0.043
Neg logit contributions
 staircase-0.070
Run-0.068
 flame-0.067
 spe-0.066
 tad-0.066
Token time
Token ID640
Feature activation-4.47e-02

Pos logit contributions
my+0.017
umper+0.013
ophone+0.013
ingu+0.012
tie+0.012
Neg logit contributions
 staircase-0.025
 site-0.024
 spe-0.024
uders-0.024
Run-0.024
Token,
Token ID11
Feature activation-5.83e-02

Pos logit contributions
my+0.076
umper+0.061
ophone+0.061
tie+0.057
ingu+0.056
Neg logit contributions
 staircase-0.083
 site-0.077
uders-0.077
 spe-0.077
 Jess-0.076
Token there
Token ID612
Feature activation-4.13e-02

Pos logit contributions
my+0.076
umper+0.063
ophone+0.060
 pads+0.058
ingu+0.057
Neg logit contributions
 staircase-0.125
Run-0.120
 flame-0.120
 spe-0.119
 Jess-0.119
Token was
Token ID373
Feature activation+8.01e-03

Pos logit contributions
my+0.011
ophone+0.008
umper+0.008
 best+0.007
 especially+0.007
Neg logit contributions
 staircase-0.015
 site-0.014
uders-0.014
 spe-0.014
 tad-0.014
Token raining
Token ID43079
Feature activation+2.17e-03

Pos logit contributions
 staircase+0.016
 spe+0.016
 swarm+0.015
uders+0.015
 site+0.015
Neg logit contributions
my-0.012
ophone-0.010
umper-0.009
ingu-0.009
tie-0.008
Token outside
Token ID2354
Feature activation+3.41e-02

Pos logit contributions
 staircase+0.005
 tad+0.005
uders+0.005
 spe+0.005
 site+0.005
Neg logit contributions
my-0.003
ophone-0.002
 especially-0.002
 best-0.002
umper-0.002
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
 staircase+0.079
uders+0.077
 tad+0.077
Run+0.077
 spe+0.076
Neg logit contributions
my-0.045
 best-0.029
 especially-0.029
ophone-0.026
 everywhere-0.026
Token<|endoftext|>
Token ID50256
Feature activation-3.78e-02

Pos logit contributions
my+0.016
ophone+0.012
umper+0.012
 best+0.011
tie+0.011
Neg logit contributions
 staircase-0.020
 Jess-0.020
 tad-0.019
 spe-0.019
uders-0.019
TokenOnce
Token ID7454
Feature activation-9.26e-02

Pos logit contributions
my+0.059
ophone+0.051
 happiest+0.048
umper+0.048
ingu+0.046
Neg logit contributions
 staircase-0.070
 site-0.067
 Jess-0.067
 spe-0.065
 tad-0.064
Token upon
Token ID2402
Feature activation-3.56e-02

Pos logit contributions
tie+0.137
umper+0.132
ophone+0.132
 pads+0.131
my+0.130
Neg logit contributions
 Jess-0.160
yt-0.160
 staircase-0.156
 swarm-0.147
 site-0.137
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
my+0.053
ophone+0.044
umper+0.044
ip+0.041
ingu+0.040
Neg logit contributions
 staircase-0.068
Run-0.065
 flame-0.064
 spe-0.064
 tad-0.064
Token time
Token ID640
Feature activation-4.22e-02

Pos logit contributions
my+0.019
ophone+0.015
umper+0.014
ingu+0.013
tie+0.013
Neg logit contributions
 staircase-0.030
 spe-0.029
 site-0.028
uders-0.028
 Jess-0.028
Token,
Token ID11
Feature activation-5.73e-02

Pos logit contributions
my+0.070
ophone+0.055
umper+0.054
ingu+0.050
tie+0.050
Neg logit contributions
 staircase-0.081
 spe-0.075
 site-0.075
uders-0.075
 Jess-0.075
Token there
Token ID612
Feature activation-3.72e-02

Pos logit contributions
my+0.073
umper+0.057
ophone+0.055
ingu+0.052
 pads+0.050
Neg logit contributions
 staircase-0.128
 spe-0.123
 Jess-0.122
Run-0.121
 flame-0.120
Token wet
Token ID9583
Feature activation+1.15e-02

Pos logit contributions
my+0.028
ophone+0.020
 best+0.019
umper+0.019
 especially+0.019
Neg logit contributions
 staircase-0.039
uders-0.037
 site-0.037
 spe-0.037
 tad-0.036
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 staircase+0.026
 site+0.025
uders+0.025
 Jess+0.025
 spe+0.025
Neg logit contributions
my-0.016
 best-0.010
 around-0.010
 especially-0.010
ophone-0.010
Token laughing
Token ID14376
Feature activation+2.87e-03

Pos logit contributions
my+0.037
umper+0.029
ophone+0.029
ingu+0.026
tie+0.026
Neg logit contributions
 staircase-0.045
 spe-0.043
 tad-0.042
 site-0.042
 swarm-0.041
Token.
Token ID13
Feature activation-9.26e-03

Pos logit contributions
 staircase+0.006
uders+0.006
 site+0.006
 spe+0.006
 tad+0.006
Neg logit contributions
my-0.004
ophone-0.003
 best-0.003
 especially-0.003
umper-0.003
Token<|endoftext|>
Token ID50256
Feature activation-3.55e-02

Pos logit contributions
my+0.014
ophone+0.012
umper+0.011
ingu+0.010
tie+0.010
Neg logit contributions
 Jess-0.018
 staircase-0.017
 tad-0.017
 site-0.017
Run-0.017
TokenOnce
Token ID7454
Feature activation-9.25e-02

Pos logit contributions
my+0.054
ophone+0.045
umper+0.043
 happiest+0.042
ingu+0.041
Neg logit contributions
 staircase-0.068
 site-0.066
 Jess-0.064
 tad-0.064
Run-0.063
Token upon
Token ID2402
Feature activation-3.72e-02

Pos logit contributions
umper+0.135
tie+0.135
my+0.134
 pads+0.131
ophone+0.128
Neg logit contributions
 Jess-0.163
 staircase-0.161
yt-0.157
 swarm-0.150
 site-0.144
Token a
Token ID257
Feature activation-1.47e-02

Pos logit contributions
my+0.057
umper+0.047
ophone+0.047
ingu+0.043
ip+0.043
Neg logit contributions
 staircase-0.070
Run-0.068
 tad-0.067
 spe-0.067
 flame-0.067
Token time
Token ID640
Feature activation-4.58e-02

Pos logit contributions
my+0.022
umper+0.017
ophone+0.017
ingu+0.015
tie+0.015
Neg logit contributions
 staircase-0.029
 spe-0.028
 site-0.028
Run-0.027
 swarm-0.027
Token,
Token ID11
Feature activation-5.77e-02

Pos logit contributions
my+0.078
umper+0.062
ophone+0.061
tie+0.057
ingu+0.057
Neg logit contributions
 staircase-0.086
uders-0.079
 site-0.079
 Jess-0.079
 spe-0.079
Token there
Token ID612
Feature activation-3.87e-02

Pos logit contributions
my+0.077
umper+0.064
ingu+0.060
ophone+0.059
 pads+0.057
Neg logit contributions
 staircase-0.123
 Jess-0.117
Run-0.117
 spe-0.116
 flame-0.116
Token became
Token ID2627
Feature activation-2.08e-02

Pos logit contributions
my+0.013
ophone+0.011
umper+0.011
ingu+0.010
 best+0.009
Neg logit contributions
 staircase-0.020
 spe-0.020
 tad-0.020
 slit-0.020
Run-0.019
Token good
Token ID922
Feature activation-1.13e-02

Pos logit contributions
my+0.029
ophone+0.022
umper+0.020
ingu+0.018
tie+0.018
Neg logit contributions
 staircase-0.045
 tad-0.045
 spe-0.045
 swarm-0.043
 Jess-0.043
Token friends
Token ID2460
Feature activation-4.86e-02

Pos logit contributions
my+0.012
 best+0.007
umper+0.007
 especially+0.006
ophone+0.006
Neg logit contributions
 staircase-0.029
 site-0.028
 spe-0.028
uders-0.028
 Jess-0.027
Token.
Token ID13
Feature activation-6.38e-03

Pos logit contributions
my+0.081
ophone+0.067
umper+0.066
ingu+0.061
tie+0.056
Neg logit contributions
 staircase-0.089
 spe-0.086
 Jess-0.083
 tad-0.083
 site-0.083
Token<|endoftext|>
Token ID50256
Feature activation-3.40e-02

Pos logit contributions
my+0.014
ophone+0.012
umper+0.012
tie+0.012
wy+0.012
Neg logit contributions
 Jess-0.008
 staircase-0.007
uders-0.007
 tad-0.007
 spe-0.007
TokenOnce
Token ID7454
Feature activation-9.25e-02

Pos logit contributions
my+0.051
ophone+0.041
umper+0.039
 happiest+0.038
ingu+0.037
Neg logit contributions
 staircase-0.068
 site-0.065
 Jess-0.063
 spe-0.063
 tad-0.063
Token upon
Token ID2402
Feature activation-3.31e-02

Pos logit contributions
tie+0.138
umper+0.137
my+0.136
 pads+0.136
ophone+0.130
Neg logit contributions
 staircase-0.156
 Jess-0.155
yt-0.154
 swarm-0.145
 site-0.141
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
my+0.050
umper+0.041
ophone+0.040
ip+0.037
ingu+0.037
Neg logit contributions
 staircase-0.063
Run-0.061
 spe-0.061
 tad-0.060
 flame-0.060
Token time
Token ID640
Feature activation-4.25e-02

Pos logit contributions
my+0.022
umper+0.017
ophone+0.017
tie+0.015
ingu+0.015
Neg logit contributions
 staircase-0.032
 spe-0.030
 site-0.030
uders-0.030
 swarm-0.029
Token,
Token ID11
Feature activation-5.81e-02

Pos logit contributions
my+0.072
umper+0.057
ophone+0.057
tie+0.053
ingu+0.053
Neg logit contributions
 staircase-0.079
 site-0.073
 spe-0.073
uders-0.073
 Jess-0.073
Token there
Token ID612
Feature activation-3.77e-02

Pos logit contributions
my+0.078
umper+0.067
ophone+0.062
ingu+0.062
 pads+0.061
Neg logit contributions
 staircase-0.120
 spe-0.115
 flame-0.115
 Jess-0.114
Run-0.113
Token ran
Token ID4966
Feature activation+9.64e-03

Pos logit contributions
my+0.011
ophone+0.008
umper+0.008
ingu+0.007
 best+0.007
Neg logit contributions
 staircase-0.016
 spe-0.015
uders-0.015
 tad-0.015
 site-0.015
Token and
Token ID290
Feature activation-2.45e-02

Pos logit contributions
 staircase+0.023
 site+0.022
 spe+0.022
uders+0.022
 tad+0.022
Neg logit contributions
my-0.012
ophone-0.008
umper-0.007
ingu-0.007
 especially-0.007
Token laughed
Token ID13818
Feature activation-1.33e-02

Pos logit contributions
my+0.035
umper+0.030
ingu+0.028
ophone+0.028
tie+0.025
Neg logit contributions
 staircase-0.049
 spe-0.048
 tad-0.046
 slit-0.046
 Jess-0.046
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
my+0.021
umper+0.017
ophone+0.016
ingu+0.015
 pads+0.014
Neg logit contributions
 staircase-0.026
 spe-0.024
 click-0.024
 Jess-0.024
 site-0.024
Token<|endoftext|>
Token ID50256
Feature activation-4.10e-02

Pos logit contributions
my+0.019
umper+0.018
ophone+0.018
tie+0.018
wy+0.017
Neg logit contributions
 Jess-0.019
 staircase-0.018
 spe-0.017
 swarm-0.017
 collapse-0.017
TokenOnce
Token ID7454
Feature activation-9.24e-02

Pos logit contributions
my+0.067
ophone+0.060
 happiest+0.058
umper+0.057
tie+0.054
Neg logit contributions
 staircase-0.070
 site-0.070
 Jess-0.067
 spe-0.065
 gap-0.065
Token upon
Token ID2402
Feature activation-3.93e-02

Pos logit contributions
tie+0.144
umper+0.142
wy+0.137
 pads+0.136
ophone+0.134
Neg logit contributions
yt-0.158
 staircase-0.156
 Jess-0.152
 swarm-0.139
 site-0.138
Token a
Token ID257
Feature activation-1.95e-02

Pos logit contributions
my+0.060
umper+0.052
ophone+0.051
ip+0.048
ingu+0.046
Neg logit contributions
 staircase-0.071
Run-0.069
 flame-0.068
 spe-0.067
 tad-0.067
Token time
Token ID640
Feature activation-4.77e-02

Pos logit contributions
my+0.029
umper+0.023
ophone+0.023
tie+0.020
ingu+0.020
Neg logit contributions
 staircase-0.039
 spe-0.037
 site-0.037
yt-0.036
uders-0.036
Token,
Token ID11
Feature activation-5.67e-02

Pos logit contributions
my+0.084
umper+0.070
ophone+0.070
tie+0.066
ingu+0.064
Neg logit contributions
 staircase-0.085
 Jess-0.079
 spe-0.077
 site-0.077
uders-0.077
Token there
Token ID612
Feature activation-4.20e-02

Pos logit contributions
my+0.079
umper+0.070
ophone+0.066
ingu+0.063
 pads+0.063
Neg logit contributions
 staircase-0.115
 spe-0.110
 Jess-0.110
Run-0.110
 flame-0.109
Token its
Token ID663
Feature activation-4.06e-02

Pos logit contributions
my+0.036
ophone+0.027
umper+0.026
ingu+0.024
tie+0.024
Neg logit contributions
 staircase-0.048
 spe-0.047
 site-0.046
 tad-0.046
uders-0.045
Token lemon
Token ID18873
Feature activation-1.82e-02

Pos logit contributions
my+0.072
ophone+0.060
umper+0.060
ingu+0.053
 forward+0.053
Neg logit contributions
 staircase-0.073
 tad-0.072
 spe-0.071
 swarm-0.070
 site-0.069
Tokenade
Token ID671
Feature activation-3.42e-02

Pos logit contributions
my+0.032
ophone+0.027
umper+0.026
ingu+0.024
 forward+0.023
Neg logit contributions
 staircase-0.031
 tad-0.030
 spe-0.030
 site-0.029
 swarm-0.029
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
my+0.053
ophone+0.042
umper+0.041
ingu+0.037
tie+0.036
Neg logit contributions
 staircase-0.068
 spe-0.064
 site-0.063
 tad-0.063
uders-0.062
Token<|endoftext|>
Token ID50256
Feature activation-4.67e-02

Pos logit contributions
my+0.031
ophone+0.026
umper+0.025
wy+0.024
tie+0.024
Neg logit contributions
 Jess-0.031
 staircase-0.030
 tad-0.029
 spe-0.028
uders-0.028
TokenOnce
Token ID7454
Feature activation-9.22e-02

Pos logit contributions
my+0.076
ophone+0.068
 happiest+0.067
umper+0.064
ingu+0.062
Neg logit contributions
 staircase-0.079
 site-0.078
 Jess-0.076
Run-0.074
 tad-0.074
Token there
Token ID612
Feature activation-5.16e-02

Pos logit contributions
tie+0.141
umper+0.140
 pads+0.140
wy+0.137
my+0.134
Neg logit contributions
 Jess-0.153
 staircase-0.152
yt-0.152
 swarm-0.139
 site-0.135
Token was
Token ID373
Feature activation-1.88e-02

Pos logit contributions
my+0.098
umper+0.096
ophone+0.095
 pads+0.089
tie+0.089
Neg logit contributions
 staircase-0.069
 Jess-0.068
 spe-0.067
 site-0.064
 tad-0.063
Token a
Token ID257
Feature activation+1.71e-02

Pos logit contributions
my+0.029
ophone+0.026
umper+0.024
ingu+0.022
 happiest+0.022
Neg logit contributions
Run-0.034
 Jess-0.034
 spe-0.034
 staircase-0.034
 flame-0.033
Token lady
Token ID10846
Feature activation+1.45e-02

Pos logit contributions
 staircase+0.030
uders+0.028
 site+0.028
 spe+0.027
 tad+0.027
Neg logit contributions
my-0.032
ophone-0.024
umper-0.024
 best-0.024
 especially-0.024
Token.
Token ID13
Feature activation-2.39e-02

Pos logit contributions
 spe+0.025
 staircase+0.024
 Jess+0.024
 site+0.023
yt+0.023
Neg logit contributions
my-0.022
umper-0.020
ingu-0.020
 happiest-0.019
ophone-0.019
Token old
Token ID1468
Feature activation-1.82e-02

Pos logit contributions
 spe+0.006
 flame+0.006
 swarm+0.006
 collapse+0.006
 crumble+0.006
Neg logit contributions
my-0.004
umper-0.004
ophone-0.004
ingu-0.003
tie-0.003
Token and
Token ID290
Feature activation-3.69e-02

Pos logit contributions
my+0.026
umper+0.019
ophone+0.019
ingu+0.018
tie+0.016
Neg logit contributions
 staircase-0.039
 spe-0.037
 site-0.037
uders-0.037
 tad-0.036
Token rotten
Token ID36371
Feature activation+9.79e-03

Pos logit contributions
my+0.060
umper+0.048
ophone+0.047
ingu+0.044
tie+0.042
Neg logit contributions
 spe-0.071
 staircase-0.069
 tad-0.067
 swarm-0.066
 flame-0.066
Token.
Token ID13
Feature activation-7.47e-04

Pos logit contributions
 staircase+0.020
 spe+0.019
 site+0.019
uders+0.019
 tad+0.019
Neg logit contributions
my-0.015
ophone-0.011
umper-0.010
 best-0.010
 especially-0.010
Token<|endoftext|>
Token ID50256
Feature activation-3.13e-02

Pos logit contributions
my+0.001
ophone+0.001
umper+0.001
 especially+0.001
ingu+0.001
Neg logit contributions
 staircase-0.002
 spe-0.002
 site-0.002
uders-0.002
 tad-0.002
TokenOnce
Token ID7454
Feature activation-9.22e-02

Pos logit contributions
my+0.050
ophone+0.041
 happiest+0.039
umper+0.038
ingu+0.037
Neg logit contributions
 staircase-0.058
 site-0.057
 Jess-0.055
 tad-0.055
 spe-0.054
Token upon
Token ID2402
Feature activation-3.33e-02

Pos logit contributions
 pads+0.138
tie+0.138
umper+0.137
my+0.137
ophone+0.132
Neg logit contributions
 Jess-0.158
 staircase-0.158
yt-0.151
 swarm-0.144
 site-0.143
Token a
Token ID257
Feature activation-1.25e-02

Pos logit contributions
my+0.051
ophone+0.042
umper+0.041
ip+0.038
ingu+0.038
Neg logit contributions
 staircase-0.064
Run-0.061
 spe-0.061
 tad-0.061
 flame-0.060
Token time
Token ID640
Feature activation-4.61e-02

Pos logit contributions
my+0.018
ophone+0.014
umper+0.014
tie+0.013
ingu+0.012
Neg logit contributions
 staircase-0.026
 spe-0.025
 site-0.024
uders-0.024
 swarm-0.024
Token,
Token ID11
Feature activation-5.91e-02

Pos logit contributions
my+0.079
ophone+0.064
umper+0.064
tie+0.060
ingu+0.059
Neg logit contributions
 staircase-0.084
 spe-0.078
 Jess-0.078
 site-0.078
uders-0.078
Token there
Token ID612
Feature activation-3.75e-02

Pos logit contributions
my+0.080
umper+0.069
ophone+0.063
ingu+0.063
 pads+0.062
Neg logit contributions
 staircase-0.123
 spe-0.118
 flame-0.118
 Jess-0.116
Run-0.115
Token became
Token ID2627
Feature activation-1.69e-02

Pos logit contributions
my+0.004
umper+0.003
ophone+0.003
ingu+0.003
tie+0.003
Neg logit contributions
 staircase-0.006
 spe-0.006
uders-0.006
 slit-0.006
 Jess-0.006
Token best
Token ID1266
Feature activation-2.77e-02

Pos logit contributions
my+0.026
ophone+0.023
umper+0.020
ingu+0.019
tie+0.019
Neg logit contributions
 tad-0.033
 spe-0.032
 staircase-0.030
 swarm-0.030
 flame-0.030
Token friends
Token ID2460
Feature activation-4.89e-02

Pos logit contributions
my+0.031
ophone+0.019
umper+0.018
 best+0.018
 especially+0.017
Neg logit contributions
 staircase-0.070
 site-0.067
uders-0.066
 spe-0.066
 tad-0.065
Token.
Token ID13
Feature activation-2.42e-03

Pos logit contributions
my+0.092
ophone+0.078
umper+0.077
ingu+0.075
wy+0.069
Neg logit contributions
 staircase-0.075
 spe-0.075
 Jess-0.072
 site-0.071
uders-0.070
Token<|endoftext|>
Token ID50256
Feature activation-3.04e-02

Pos logit contributions
my+0.005
ophone+0.004
umper+0.004
tie+0.004
 happiest+0.004
Neg logit contributions
 Jess-0.003
 spe-0.003
 staircase-0.003
uders-0.003
 tad-0.003
TokenOnce
Token ID7454
Feature activation-9.21e-02

Pos logit contributions
my+0.047
ophone+0.038
umper+0.036
 happiest+0.036
ingu+0.035
Neg logit contributions
 staircase-0.058
 site-0.057
 spe-0.055
fire-0.055
 tad-0.054
Token upon
Token ID2402
Feature activation-3.22e-02

Pos logit contributions
my+0.135
tie+0.134
umper+0.134
 pads+0.132
wy+0.125
Neg logit contributions
 Jess-0.157
 staircase-0.154
yt-0.153
 site-0.142
 swarm-0.141
Token a
Token ID257
Feature activation-8.32e-03

Pos logit contributions
my+0.050
umper+0.042
ophone+0.041
ip+0.039
ingu+0.038
Neg logit contributions
 staircase-0.059
Run-0.058
 spe-0.057
 flame-0.056
 tad-0.056
Token time
Token ID640
Feature activation-4.10e-02

Pos logit contributions
my+0.012
umper+0.010
ophone+0.010
tie+0.009
ingu+0.009
Neg logit contributions
 staircase-0.016
 spe-0.016
 site-0.015
uders-0.015
yt-0.015
Token,
Token ID11
Feature activation-5.93e-02

Pos logit contributions
my+0.071
umper+0.058
ophone+0.057
tie+0.054
ingu+0.053
Neg logit contributions
 staircase-0.074
 spe-0.069
 site-0.069
 Jess-0.069
uders-0.068
Token a
Token ID257
Feature activation+1.93e-02

Pos logit contributions
my+0.080
umper+0.069
ingu+0.064
ophone+0.062
 pads+0.060
Neg logit contributions
 staircase-0.124
 spe-0.119
 flame-0.117
Run-0.117
 Jess-0.117
Token time
Token ID640
Feature activation+6.12e-03

Pos logit contributions
 staircase+0.051
 site+0.048
 spe+0.048
uders+0.047
 tad+0.047
Neg logit contributions
my-0.029
umper-0.020
ophone-0.020
ingu-0.018
 especially-0.017
Token playing
Token ID2712
Feature activation-2.30e-02

Pos logit contributions
 staircase+0.013
 spe+0.012
 site+0.012
uders+0.012
 tad+0.012
Neg logit contributions
my-0.010
ophone-0.007
umper-0.007
ingu-0.007
 happiest-0.006
Token together
Token ID1978
Feature activation-5.40e-03

Pos logit contributions
my+0.029
 best+0.021
 around+0.021
 on+0.021
 especially+0.020
Neg logit contributions
 staircase-0.053
 spe-0.052
yt-0.051
uders-0.051
 swarm-0.050
Token.
Token ID13
Feature activation-9.62e-03

Pos logit contributions
my+0.008
ophone+0.006
umper+0.006
 best+0.005
ingu+0.005
Neg logit contributions
 staircase-0.011
 site-0.011
 spe-0.011
uders-0.011
 tad-0.010
Token<|endoftext|>
Token ID50256
Feature activation-3.54e-02

Pos logit contributions
my+0.016
umper+0.014
ophone+0.014
 happiest+0.013
tie+0.013
Neg logit contributions
 Jess-0.016
 staircase-0.016
 site-0.016
 tad-0.016
Run-0.016
TokenOnce
Token ID7454
Feature activation-9.21e-02

Pos logit contributions
my+0.054
ophone+0.045
umper+0.043
 happiest+0.042
ingu+0.041
Neg logit contributions
 staircase-0.068
 site-0.066
 Jess-0.064
 tad-0.063
 spe-0.063
Token upon
Token ID2402
Feature activation-3.15e-02

Pos logit contributions
umper+0.133
tie+0.132
my+0.130
 pads+0.128
ophone+0.127
Neg logit contributions
 Jess-0.161
 staircase-0.159
yt-0.157
 swarm-0.150
 flame-0.145
Token a
Token ID257
Feature activation-1.32e-02

Pos logit contributions
my+0.046
ophone+0.037
umper+0.036
ingu+0.033
ip+0.033
Neg logit contributions
 staircase-0.063
Run-0.060
 spe-0.060
 tad-0.060
 flame-0.059
Token time
Token ID640
Feature activation-4.33e-02

Pos logit contributions
my+0.018
ophone+0.014
umper+0.013
ingu+0.012
tie+0.012
Neg logit contributions
 staircase-0.029
 spe-0.027
 site-0.027
uders-0.027
Run-0.026
Token,
Token ID11
Feature activation-5.84e-02

Pos logit contributions
my+0.071
ophone+0.054
umper+0.053
ingu+0.050
 especially+0.049
Neg logit contributions
 staircase-0.085
 site-0.079
 spe-0.079
uders-0.079
 tad-0.078
Token there
Token ID612
Feature activation-3.79e-02

Pos logit contributions
my+0.075
umper+0.061
ophone+0.057
ingu+0.056
 pads+0.054
Neg logit contributions
 staircase-0.128
 spe-0.122
 flame-0.121
Run-0.121
 Jess-0.120
Token on
Token ID319
Feature activation+1.22e-02

Pos logit contributions
my+0.028
ophone+0.020
umper+0.020
ingu+0.018
 especially+0.018
Neg logit contributions
 staircase-0.043
 spe-0.041
uders-0.040
 site-0.040
 tad-0.039
Token the
Token ID262
Feature activation-5.67e-03

Pos logit contributions
 staircase+0.029
 spe+0.027
uders+0.027
 swarm+0.027
Run+0.026
Neg logit contributions
my-0.016
 best-0.011
 especially-0.010
 on-0.010
 all-0.010
Token sack
Token ID23704
Feature activation+3.15e-03

Pos logit contributions
my+0.012
ophone+0.010
 best+0.010
umper+0.010
 especially+0.009
Neg logit contributions
 staircase-0.008
uders-0.008
 site-0.007
 tad-0.007
Run-0.007
Token.
Token ID13
Feature activation-8.29e-03

Pos logit contributions
 staircase+0.007
uders+0.007
 spe+0.007
 tad+0.007
Run+0.007
Neg logit contributions
my-0.004
ophone-0.003
 best-0.003
 especially-0.003
umper-0.003
Token<|endoftext|>
Token ID50256
Feature activation-3.50e-02

Pos logit contributions
my+0.015
ophone+0.012
umper+0.012
 especially+0.011
tie+0.011
Neg logit contributions
 staircase-0.014
 Jess-0.014
uders-0.013
 site-0.013
 tad-0.013
TokenOnce
Token ID7454
Feature activation-9.21e-02

Pos logit contributions
my+0.054
ophone+0.045
umper+0.044
 happiest+0.042
ingu+0.041
Neg logit contributions
 staircase-0.066
 site-0.065
 Jess-0.063
 tad-0.062
 spe-0.062
Token upon
Token ID2402
Feature activation-3.31e-02

Pos logit contributions
umper+0.135
tie+0.134
my+0.131
ophone+0.130
 pads+0.128
Neg logit contributions
 Jess-0.156
yt-0.154
 staircase-0.152
 swarm-0.143
 site-0.140
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
my+0.050
umper+0.041
ophone+0.041
ip+0.037
ingu+0.037
Neg logit contributions
 staircase-0.063
Run-0.061
 spe-0.060
 tad-0.060
 flame-0.060
Token time
Token ID640
Feature activation-4.18e-02

Pos logit contributions
my+0.018
umper+0.014
ophone+0.014
tie+0.012
ingu+0.012
Neg logit contributions
 staircase-0.025
 spe-0.024
 site-0.024
uders-0.023
Run-0.023
Token,
Token ID11
Feature activation-5.91e-02

Pos logit contributions
my+0.070
umper+0.055
ophone+0.055
ingu+0.051
tie+0.051
Neg logit contributions
 staircase-0.079
 site-0.074
uders-0.073
 spe-0.073
 Jess-0.073
Token in
Token ID287
Feature activation-1.31e-02

Pos logit contributions
my+0.079
umper+0.069
ingu+0.063
ophone+0.063
 pads+0.061
Neg logit contributions
 staircase-0.122
Run-0.117
 flame-0.117
 spe-0.116
 Jess-0.116
Token no
Token ID645
Feature activation+1.88e-03

Pos logit contributions
my+0.009
umper+0.007
ophone+0.007
ingu+0.006
 pads+0.006
Neg logit contributions
 staircase-0.015
 spe-0.014
 Jess-0.014
 tad-0.014
 site-0.014
Token longer
Token ID2392
Feature activation+1.59e-02

Pos logit contributions
 staircase+0.004
 site+0.003
uders+0.003
 Jess+0.003
fire+0.003
Neg logit contributions
my-0.003
 best-0.002
 especially-0.002
umper-0.002
ophone-0.002
Token scared
Token ID12008
Feature activation+4.51e-02

Pos logit contributions
 staircase+0.033
 site+0.031
uders+0.031
 spe+0.031
 tad+0.030
Neg logit contributions
my-0.025
ophone-0.017
umper-0.017
 best-0.017
 especially-0.017
Token of
Token ID286
Feature activation+6.60e-03

Pos logit contributions
 staircase+0.100
 site+0.094
uders+0.094
 spe+0.094
 tad+0.093
Neg logit contributions
my-0.064
ophone-0.044
umper-0.043
 best-0.041
 especially-0.041
Token the
Token ID262
Feature activation+7.77e-03

Pos logit contributions
 staircase+0.014
 site+0.013
uders+0.013
 spe+0.013
 tad+0.013
Neg logit contributions
my-0.010
ophone-0.007
umper-0.007
 best-0.006
 especially-0.006
Token dark
Token ID3223
Feature activation+6.40e-02

Pos logit contributions
 staircase+0.012
uders+0.011
 site+0.011
 spe+0.011
 tad+0.011
Neg logit contributions
my-0.016
ophone-0.012
 best-0.012
umper-0.012
 especially-0.012
Token because
Token ID780
Feature activation+7.47e-03

Pos logit contributions
 staircase+0.134
 tad+0.134
uders+0.132
Run+0.131
inct+0.131
Neg logit contributions
my-0.097
 especially-0.062
 around-0.059
 best-0.059
 would-0.057
Token she
Token ID673
Feature activation-1.16e-02

Pos logit contributions
 staircase+0.015
 spe+0.014
 Jess+0.014
 site+0.014
 flame+0.013
Neg logit contributions
my-0.012
umper-0.010
ophone-0.010
ingu-0.009
 pads-0.009
Token knew
Token ID2993
Feature activation+1.96e-03

Pos logit contributions
my+0.016
ophone+0.012
umper+0.012
ingu+0.011
 best+0.010
Neg logit contributions
 staircase-0.025
 spe-0.024
uders-0.024
 tad-0.024
 site-0.024
Token that
Token ID326
Feature activation-1.73e-02

Pos logit contributions
 staircase+0.004
 Jess+0.004
 spe+0.004
 site+0.004
uders+0.004
Neg logit contributions
my-0.003
umper-0.002
ophone-0.002
ingu-0.002
tie-0.002
Token the
Token ID262
Feature activation-2.65e-03

Pos logit contributions
my+0.026
ophone+0.019
umper+0.019
ingu+0.018
tie+0.017
Neg logit contributions
 staircase-0.037
 spe-0.035
 site-0.034
 Jess-0.034
uders-0.034
Token bite
Token ID13197
Feature activation+1.33e-02

Pos logit contributions
 staircase+0.092
uders+0.085
yt+0.084
 site+0.084
inct+0.084
Neg logit contributions
my-0.074
 especially-0.053
umper-0.050
 best-0.050
ophone-0.049
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
 tad+0.025
Run+0.023
pour+0.022
 spe+0.022
 site+0.022
Neg logit contributions
my-0.021
ophone-0.018
ingu-0.017
 happiest-0.016
umper-0.015
Token He
Token ID679
Feature activation+1.79e-02

Pos logit contributions
 staircase+0.030
 site+0.028
uders+0.028
 tad+0.028
 spe+0.028
Neg logit contributions
my-0.030
ophone-0.023
umper-0.022
 especially-0.022
ingu-0.021
Token found
Token ID1043
Feature activation-9.45e-03

Pos logit contributions
 staircase+0.038
uders+0.036
 site+0.036
 spe+0.035
 click+0.035
Neg logit contributions
my-0.028
ophone-0.019
umper-0.019
 especially-0.018
 best-0.018
Token a
Token ID257
Feature activation+3.89e-02

Pos logit contributions
my+0.015
ophone+0.012
ingu+0.011
umper+0.011
 especially+0.011
Neg logit contributions
 tad-0.018
 spe-0.018
 Jess-0.018
 staircase-0.017
Run-0.017
Token juicy
Token ID35075
Feature activation+6.14e-02

Pos logit contributions
uders+0.078
 staircase+0.077
 site+0.074
Run+0.074
 Jess+0.073
Neg logit contributions
my-0.063
 best-0.048
 would-0.044
 especially-0.043
 favorite-0.042
Token apple
Token ID17180
Feature activation+2.26e-02

Pos logit contributions
 staircase+0.120
uders+0.115
 site+0.113
Run+0.113
 Jess+0.112
Neg logit contributions
my-0.105
 best-0.074
 especially-0.071
tie-0.070
umper-0.069
Token on
Token ID319
Feature activation+3.45e-02

Pos logit contributions
 staircase+0.050
 spe+0.047
 site+0.047
uders+0.047
 tad+0.047
Neg logit contributions
my-0.031
ophone-0.022
umper-0.021
 best-0.020
ingu-0.020
Token the
Token ID262
Feature activation+4.80e-03

Pos logit contributions
 staircase+0.084
uders+0.079
 site+0.079
 spe+0.078
 tad+0.078
Neg logit contributions
my-0.042
umper-0.026
ophone-0.025
 especially-0.024
 best-0.024
Token ground
Token ID2323
Feature activation+1.57e-02

Pos logit contributions
 staircase+0.008
 spe+0.007
 site+0.007
uders+0.007
 Jess+0.007
Neg logit contributions
my-0.010
 best-0.007
ophone-0.007
umper-0.007
 especially-0.007
Token and
Token ID290
Feature activation-3.19e-02

Pos logit contributions
 staircase+0.033
 site+0.032
 spe+0.031
uders+0.031
 tad+0.031
Neg logit contributions
my-0.022
ophone-0.017
umper-0.017
ingu-0.016
 best-0.015
Token with
Token ID351
Feature activation-4.11e-02

Pos logit contributions
my+0.008
ophone+0.005
 best+0.005
 especially+0.005
 forward+0.005
Neg logit contributions
 staircase-0.014
uders-0.013
 site-0.013
 spe-0.013
 tad-0.013
Token the
Token ID262
Feature activation-1.70e-02

Pos logit contributions
my+0.072
ophone+0.054
umper+0.053
 best+0.052
 especially+0.051
Neg logit contributions
 staircase-0.077
 site-0.072
uders-0.072
 spe-0.072
 tad-0.070
Token bird
Token ID6512
Feature activation+2.00e-03

Pos logit contributions
my+0.031
 best+0.023
ophone+0.023
umper+0.022
 happiest+0.022
Neg logit contributions
 staircase-0.031
uders-0.029
 site-0.029
 spe-0.028
 Jess-0.028
Token.
Token ID13
Feature activation-2.09e-03

Pos logit contributions
 staircase+0.004
 spe+0.004
 Jess+0.004
 tad+0.004
uders+0.004
Neg logit contributions
my-0.003
umper-0.002
ophone-0.002
ingu-0.002
 especially-0.002
Token Soon
Token ID15894
Feature activation+2.65e-02

Pos logit contributions
my+0.003
ophone+0.002
umper+0.002
 best+0.002
 especially+0.002
Neg logit contributions
 staircase-0.004
 site-0.004
 spe-0.004
uders-0.004
 tad-0.004
Token a
Token ID257
Feature activation+6.62e-02

Pos logit contributions
 staircase+0.059
uders+0.056
 site+0.055
 spe+0.055
 tad+0.054
Neg logit contributions
my-0.037
 best-0.025
ophone-0.024
umper-0.024
 especially-0.023
Token lot
Token ID1256
Feature activation+4.46e-02

Pos logit contributions
uders+0.135
 staircase+0.128
 site+0.124
 Jess+0.123
inct+0.122
Neg logit contributions
my-0.103
 best-0.082
 happiest-0.075
 would-0.074
 especially-0.072
Token of
Token ID286
Feature activation+2.34e-02

Pos logit contributions
 staircase+0.104
uders+0.101
 site+0.098
inct+0.097
 spe+0.097
Neg logit contributions
my-0.058
ophone-0.037
 best-0.036
umper-0.035
 especially-0.035
Token animals
Token ID4695
Feature activation-7.36e-03

Pos logit contributions
 staircase+0.040
 spe+0.037
 site+0.037
 tad+0.037
uders+0.036
Neg logit contributions
my-0.045
ophone-0.036
umper-0.035
ingu-0.033
 especially-0.033
Token joined
Token ID5399
Feature activation-1.08e-02

Pos logit contributions
my+0.011
ophone+0.008
umper+0.008
ingu+0.008
 best+0.008
Neg logit contributions
 staircase-0.015
 spe-0.014
uders-0.014
 site-0.014
 tad-0.014
Token in
Token ID287
Feature activation+1.20e-02

Pos logit contributions
my+0.013
umper+0.008
ophone+0.008
 especially+0.007
 best+0.007
Neg logit contributions
 staircase-0.027
 site-0.025
 spe-0.025
uders-0.025
 collapse-0.024
Token.
Token ID13
Feature activation-8.97e-03

Pos logit contributions
 staircase+0.024
uders+0.023
 spe+0.022
 site+0.022
 tad+0.022
Neg logit contributions
my-0.015
 especially-0.010
 best-0.010
ophone-0.010
umper-0.009
Token Jack
Token ID3619
Feature activation-1.45e-03

Pos logit contributions
my+0.017
 especially+0.012
umper+0.012
 best+0.011
ophone+0.011
Neg logit contributions
 staircase-0.017
 spe-0.016
uders-0.016
 site-0.015
 swarm-0.015
Token put
Token ID1234
Feature activation-4.26e-02

Pos logit contributions
my+0.002
ophone+0.002
umper+0.002
 best+0.002
 especially+0.001
Neg logit contributions
 staircase-0.003
 spe-0.003
 site-0.003
uders-0.003
 tad-0.003
Token on
Token ID319
Feature activation+2.43e-02

Pos logit contributions
my+0.049
ophone+0.037
umper+0.036
 happiest+0.034
ingu+0.032
Neg logit contributions
 staircase-0.094
 tad-0.092
 spe-0.091
Run-0.090
 flame-0.090
Token a
Token ID257
Feature activation+2.52e-02

Pos logit contributions
 staircase+0.054
 spe+0.051
uders+0.051
 site+0.051
 tad+0.050
Neg logit contributions
my-0.034
ophone-0.024
umper-0.023
 best-0.022
 especially-0.022
Token big
Token ID1263
Feature activation+6.44e-02

Pos logit contributions
 staircase+0.046
 spe+0.042
 site+0.041
 tad+0.041
 flame+0.041
Neg logit contributions
my-0.046
umper-0.037
ophone-0.035
ingu-0.033
 pads-0.032
Token a
Token ID257
Feature activation+3.20e-02

Pos logit contributions
 staircase+0.116
 spe+0.107
 site+0.106
uders+0.104
 tad+0.104
Neg logit contributions
my-0.117
ophone-0.091
umper-0.090
ingu-0.084
 especially-0.084
Tokenpron
Token ID31186
Feature activation+1.55e-02

Pos logit contributions
 staircase+0.060
 spe+0.056
 site+0.056
 tad+0.055
uders+0.054
Neg logit contributions
my-0.056
umper-0.042
ophone-0.042
ingu-0.039
 especially-0.039
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
 staircase+0.035
 site+0.034
 spe+0.033
uders+0.033
 tad+0.033
Neg logit contributions
my-0.020
umper-0.014
ophone-0.014
ingu-0.013
 best-0.013
Token pretended
Token ID36766
Feature activation-2.82e-02

Pos logit contributions
my+0.042
ophone+0.030
 especially+0.028
 best+0.028
 pads+0.027
Neg logit contributions
 staircase-0.064
uders-0.063
 site-0.062
 spe-0.061
 swarm-0.060
Token he
Token ID339
Feature activation-3.20e-02

Pos logit contributions
my+0.038
umper+0.026
ophone+0.026
ingu+0.024
 best+0.023
Neg logit contributions
 staircase-0.064
 site-0.061
 spe-0.060
uders-0.060
 Jess-0.060
TokenThe
Token ID464
Feature activation-9.48e-03

Pos logit contributions
 staircase+0.004
 site+0.004
Run+0.004
 Jess+0.004
 flame+0.004
Neg logit contributions
my-0.003
ophone-0.003
umper-0.003
ingu-0.003
 best-0.003
Token boy
Token ID2933
Feature activation+3.19e-02

Pos logit contributions
my+0.023
umper+0.021
ophone+0.021
 pads+0.020
pping+0.020
Neg logit contributions
 staircase-0.010
 tad-0.009
Run-0.008
 flame-0.008
 site-0.008
Token was
Token ID373
Feature activation-1.23e-02

Pos logit contributions
 spe+0.065
 staircase+0.063
 Jess+0.063
 slit+0.061
 flame+0.061
Neg logit contributions
my-0.040
umper-0.035
ophone-0.033
 forward-0.031
tie-0.031
Token old
Token ID1468
Feature activation-9.63e-03

Pos logit contributions
my+0.015
ophone+0.012
umper+0.012
 pads+0.011
tie+0.011
Neg logit contributions
 spe-0.027
 staircase-0.026
 Jess-0.026
 flame-0.026
 site-0.025
Token enough
Token ID1576
Feature activation+8.02e-03

Pos logit contributions
my+0.012
ophone+0.007
umper+0.007
 best+0.007
 especially+0.007
Neg logit contributions
 staircase-0.023
uders-0.022
 site-0.022
 spe-0.022
 tad-0.022
Token to
Token ID284
Feature activation+6.07e-02

Pos logit contributions
 staircase+0.019
 spe+0.018
 site+0.018
 tad+0.018
uders+0.018
Neg logit contributions
my-0.010
ophone-0.007
umper-0.007
ingu-0.006
 happiest-0.006
Token talk
Token ID1561
Feature activation-1.14e-02

Pos logit contributions
 staircase+0.121
uders+0.114
 site+0.113
 spe+0.112
 tad+0.111
Neg logit contributions
my-0.102
umper-0.072
 especially-0.072
ophone-0.072
 best-0.069
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
my+0.014
umper+0.010
ophone+0.009
ingu+0.008
 pads+0.008
Neg logit contributions
 staircase-0.027
 site-0.025
 spe-0.025
uders-0.025
 Jess-0.025
Token but
Token ID475
Feature activation-9.00e-03

Pos logit contributions
my+0.018
ophone+0.013
umper+0.013
ingu+0.012
 best+0.012
Neg logit contributions
 staircase-0.027
 spe-0.026
 site-0.025
 Jess-0.025
 tad-0.025
Token he
Token ID339
Feature activation-8.60e-03

Pos logit contributions
my+0.013
umper+0.012
ophone+0.012
ingu+0.011
wy+0.010
Neg logit contributions
 staircase-0.017
 Jess-0.017
 spe-0.017
 site-0.017
Run-0.016
Token was
Token ID373
Feature activation-5.10e-03

Pos logit contributions
my+0.012
umper+0.010
ophone+0.010
ingu+0.009
 forward+0.009
Neg logit contributions
 staircase-0.017
 spe-0.017
 slit-0.016
 site-0.016
 Jess-0.016
Token that
Token ID326
Feature activation-1.22e-02

Pos logit contributions
 staircase+0.028
 site+0.027
 Jess+0.026
uders+0.026
 spe+0.026
Neg logit contributions
my-0.019
umper-0.014
ophone-0.014
ingu-0.013
 happiest-0.012
Token day
Token ID1110
Feature activation-1.93e-02

Pos logit contributions
my+0.015
ingu+0.014
umper+0.014
ophone+0.013
 pads+0.012
Neg logit contributions
 staircase-0.025
 flame-0.025
 site-0.024
Run-0.024
 Jess-0.024
Token -
Token ID532
Feature activation+7.80e-03

Pos logit contributions
my+0.035
ophone+0.027
umper+0.027
ingu+0.027
 happiest+0.025
Neg logit contributions
 staircase-0.034
 site-0.032
uders-0.031
 Jess-0.031
 spe-0.031
Token Having
Token ID11136
Feature activation-4.69e-02

Pos logit contributions
 staircase+0.016
 spe+0.016
 tad+0.016
 site+0.015
 Jess+0.015
Neg logit contributions
my-0.011
ophone-0.009
umper-0.008
ingu-0.008
tie-0.007
Token someone
Token ID2130
Feature activation-1.74e-02

Pos logit contributions
my+0.067
ophone+0.050
umper+0.048
ingu+0.045
tie+0.043
Neg logit contributions
 staircase-0.099
 spe-0.095
 tad-0.094
 site-0.094
uders-0.094
Token by
Token ID416
Feature activation+1.01e-02

Pos logit contributions
my+0.026
umper+0.022
ingu+0.022
ophone+0.020
ip+0.019
Neg logit contributions
 spe-0.031
 tad-0.030
 staircase-0.030
 Jess-0.029
uders-0.029
Token your
Token ID534
Feature activation-1.77e-02

Pos logit contributions
 staircase+0.019
 spe+0.018
 site+0.018
uders+0.018
 tad+0.018
Neg logit contributions
my-0.017
ophone-0.013
umper-0.013
 best-0.012
 especially-0.012
Token side
Token ID1735
Feature activation-1.59e-02

Pos logit contributions
my+0.032
ophone+0.026
umper+0.025
ingu+0.024
tie+0.023
Neg logit contributions
 staircase-0.031
 spe-0.030
 site-0.030
uders-0.029
 tad-0.029
Token can
Token ID460
Feature activation+1.43e-02

Pos logit contributions
my+0.024
ophone+0.019
umper+0.019
ingu+0.018
tie+0.017
Neg logit contributions
 staircase-0.032
 site-0.030
 spe-0.029
uders-0.029
 tad-0.029
Token help
Token ID1037
Feature activation+1.87e-02

Pos logit contributions
 staircase+0.027
uders+0.025
 spe+0.025
 site+0.025
 tad+0.024
Neg logit contributions
my-0.026
ophone-0.019
umper-0.019
 especially-0.018
 best-0.018
Token you
Token ID345
Feature activation-1.82e-02

Pos logit contributions
 staircase+0.037
 site+0.035
uders+0.035
 spe+0.035
 tad+0.035
Neg logit contributions
my-0.029
ophone-0.022
umper-0.021
ingu-0.020
 best-0.020
Token it
Token ID340
Feature activation-1.98e-02

Pos logit contributions
my+0.029
 happiest+0.028
ingu+0.028
ophone+0.028
 highlight+0.026
Neg logit contributions
 click-0.023
 staircase-0.023
Run-0.022
 gap-0.021
 flame-0.021
Token towards
Token ID3371
Feature activation+3.14e-02

Pos logit contributions
my+0.028
ophone+0.020
umper+0.020
ingu+0.019
 especially+0.018
Neg logit contributions
 staircase-0.043
 site-0.040
 spe-0.040
uders-0.040
 tad-0.040
Token him
Token ID683
Feature activation-1.83e-02

Pos logit contributions
 staircase+0.055
 site+0.055
 Jess+0.051
 click+0.050
uders+0.050
Neg logit contributions
my-0.052
ophone-0.046
 happiest-0.042
ip-0.042
umper-0.042
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
my+0.030
umper+0.023
ophone+0.023
ingu+0.021
 happiest+0.021
Neg logit contributions
 staircase-0.035
 site-0.033
uders-0.033
 spe-0.033
 click-0.032
Token Suddenly
Token ID24975
Feature activation+1.85e-02

Pos logit contributions
my+0.019
ophone+0.014
umper+0.014
 best+0.013
 especially+0.013
Neg logit contributions
 staircase-0.025
 site-0.024
 tad-0.023
uders-0.023
 Jess-0.023
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
 staircase+0.036
 site+0.034
 spe+0.034
uders+0.034
 tad+0.033
Neg logit contributions
my-0.031
ophone-0.023
umper-0.023
 especially-0.021
ingu-0.021
Token he
Token ID339
Feature activation+1.06e-02

Pos logit contributions
 staircase+0.037
Run+0.035
 site+0.035
 tad+0.035
 spe+0.034
Neg logit contributions
my-0.023
ophone-0.017
umper-0.016
 pads-0.015
ingu-0.014
Token heard
Token ID2982
Feature activation+1.42e-02

Pos logit contributions
 staircase+0.020
uders+0.019
 spe+0.018
 tad+0.018
 swarm+0.018
Neg logit contributions
my-0.018
ophone-0.014
umper-0.013
 best-0.013
ingu-0.013
Token a
Token ID257
Feature activation+5.04e-02

Pos logit contributions
 staircase+0.032
 site+0.031
uders+0.030
 spe+0.030
 tad+0.030
Neg logit contributions
my-0.019
ophone-0.013
umper-0.013
 best-0.012
ingu-0.012
Token voice
Token ID3809
Feature activation+4.34e-02

Pos logit contributions
uders+0.112
umm+0.095
keeping+0.087
 avail+0.085
edly+0.084
Neg logit contributions
my-0.088
 very-0.078
 would-0.069
 better-0.068
 back-0.067
Token coming
Token ID2406
Feature activation+4.01e-02

Pos logit contributions
 staircase+0.096
uders+0.094
 spe+0.091
umm+0.090
 tad+0.089
Neg logit contributions
my-0.064
ophone-0.043
 around-0.041
 especially-0.040
umper-0.039
Token in
Token ID287
Feature activation-3.91e-05

Pos logit contributions
my+0.008
ophone+0.006
 best+0.006
umper+0.006
 especially+0.006
Neg logit contributions
 staircase-0.013
 spe-0.012
uders-0.012
 site-0.012
 tad-0.012
Token her
Token ID607
Feature activation-3.01e-02

Pos logit contributions
my+0.000
ophone+0.000
umper+0.000
 especially+0.000
 best+0.000
Neg logit contributions
 staircase-0.000
uders-0.000
 site-0.000
 spe-0.000
 tad-0.000
Token bath
Token ID7837
Feature activation-9.81e-03

Pos logit contributions
my+0.055
ophone+0.042
umper+0.042
 especially+0.040
 best+0.039
Neg logit contributions
 staircase-0.053
 spe-0.050
 site-0.050
uders-0.050
 tad-0.049
Tokentub
Token ID37995
Feature activation-1.20e-02

Pos logit contributions
my+0.016
ophone+0.012
umper+0.012
 best+0.011
 especially+0.011
Neg logit contributions
 staircase-0.019
 site-0.018
uders-0.018
 spe-0.018
 tad-0.018
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
my+0.019
ophone+0.014
umper+0.014
ingu+0.013
 best+0.013
Neg logit contributions
 staircase-0.024
 site-0.022
 spe-0.022
uders-0.022
 tad-0.022
Token She
Token ID1375
Feature activation+8.89e-03

Pos logit contributions
my+0.027
ophone+0.022
umper+0.021
 best+0.020
ingu+0.020
Neg logit contributions
 staircase-0.021
 spe-0.020
uders-0.019
 site-0.019
 Jess-0.019
Token liked
Token ID8288
Feature activation-3.33e-02

Pos logit contributions
 staircase+0.018
 spe+0.018
 slit+0.017
 tad+0.017
 flame+0.017
Neg logit contributions
my-0.012
umper-0.010
tie-0.010
 forward-0.009
ophone-0.009
Token to
Token ID284
Feature activation+3.07e-02

Pos logit contributions
my+0.052
ophone+0.043
umper+0.041
tie+0.038
 happiest+0.037
Neg logit contributions
 spe-0.065
 staircase-0.064
 flame-0.063
 Jess-0.061
 tad-0.061
Token play
Token ID711
Feature activation-3.67e-02

Pos logit contributions
 staircase+0.056
uders+0.053
 site+0.052
 spe+0.052
 tad+0.051
Neg logit contributions
my-0.055
ophone-0.042
umper-0.041
 especially-0.040
 best-0.040
Token with
Token ID351
Feature activation-4.63e-02

Pos logit contributions
my+0.062
ophone+0.047
umper+0.046
 pads+0.043
ingu+0.043
Neg logit contributions
 staircase-0.070
 site-0.067
 spe-0.065
 tad-0.065
 flame-0.064
Token her
Token ID607
Feature activation-5.11e-02

Pos logit contributions
my+0.079
ophone+0.065
umper+0.061
 happiest+0.060
tie+0.059
Neg logit contributions
 flame-0.082
 staircase-0.082
 spe-0.079
 Jess-0.078
 tad-0.078
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
 staircase+0.006
 spe+0.005
uders+0.005
 site+0.005
 tad+0.005
Neg logit contributions
my-0.004
ophone-0.003
umper-0.003
ingu-0.002
 best-0.002
Token She
Token ID1375
Feature activation+1.34e-02

Pos logit contributions
my+0.026
ophone+0.021
umper+0.021
ingu+0.019
 best+0.019
Neg logit contributions
 staircase-0.032
 Jess-0.031
 site-0.030
 spe-0.029
uders-0.029
Token sped
Token ID40424
Feature activation-1.93e-02

Pos logit contributions
 staircase+0.026
 spe+0.025
 slit+0.024
 site+0.024
 tad+0.024
Neg logit contributions
my-0.022
umper-0.017
ophone-0.017
ingu-0.016
 forward-0.016
Token around
Token ID1088
Feature activation+3.51e-03

Pos logit contributions
my+0.025
ophone+0.020
umper+0.020
ingu+0.019
 happiest+0.018
Neg logit contributions
 staircase-0.042
 Jess-0.040
Run-0.039
 site-0.038
 tad-0.038
Token the
Token ID262
Feature activation-6.62e-03

Pos logit contributions
 staircase+0.008
uders+0.008
 site+0.008
 spe+0.008
 tad+0.008
Neg logit contributions
my-0.004
ophone-0.003
umper-0.003
 especially-0.003
 best-0.003
Token porch
Token ID33179
Feature activation+3.81e-02

Pos logit contributions
my+0.014
ophone+0.013
umper+0.012
tie+0.012
ingu+0.012
Neg logit contributions
 staircase-0.009
 tad-0.008
 site-0.008
 swarm-0.008
 gap-0.008
Token,
Token ID11
Feature activation-7.15e-03

Pos logit contributions
 staircase+0.081
 site+0.077
 spe+0.076
uders+0.076
 tad+0.075
Neg logit contributions
my-0.055
ophone-0.040
umper-0.040
ingu-0.037
 best-0.036
Token jumping
Token ID14284
Feature activation-8.22e-03

Pos logit contributions
my+0.011
umper+0.010
ingu+0.010
ophone+0.010
wy+0.009
Neg logit contributions
 staircase-0.013
 spe-0.012
 site-0.012
 tad-0.012
Run-0.012
Token high
Token ID1029
Feature activation+1.29e-02

Pos logit contributions
my+0.010
ophone+0.006
 best+0.006
 especially+0.006
umper+0.005
Neg logit contributions
 staircase-0.020
uders-0.019
 spe-0.019
 site-0.019
 swarm-0.019
Token and
Token ID290
Feature activation-2.56e-02

Pos logit contributions
 staircase+0.033
 swarm+0.032
uders+0.032
 tad+0.031
 click+0.031
Neg logit contributions
my-0.014
 around-0.008
 best-0.008
 everywhere-0.008
 on-0.007
Token running
Token ID2491
Feature activation+6.72e-03

Pos logit contributions
my+0.042
umper+0.037
ingu+0.035
ophone+0.034
ip+0.031
Neg logit contributions
 staircase-0.048
 spe-0.045
 Jess-0.044
 click-0.044
 site-0.043
Token hugged
Token ID41130
Feature activation-1.52e-02

Pos logit contributions
my+0.042
ophone+0.029
 best+0.029
 especially+0.027
umper+0.027
Neg logit contributions
 staircase-0.063
uders-0.060
 site-0.060
 spe-0.058
 swarm-0.057
Token each
Token ID1123
Feature activation-3.93e-02

Pos logit contributions
my+0.021
 best+0.013
 especially+0.013
 forward+0.013
ophone+0.013
Neg logit contributions
 staircase-0.035
uders-0.034
 site-0.033
 swarm-0.033
 tad-0.033
Token other
Token ID584
Feature activation-2.60e-02

Pos logit contributions
my+0.049
umper+0.041
ophone+0.040
ingu+0.039
 happiest+0.037
Neg logit contributions
 staircase-0.081
 Jess-0.080
 flame-0.080
 tad-0.079
Run-0.079
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
my+0.043
umper+0.037
ophone+0.034
ingu+0.032
 especially+0.031
Neg logit contributions
 staircase-0.048
 site-0.044
 spe-0.044
 Jess-0.044
 click-0.044
Token Then
Token ID3244
Feature activation+2.92e-02

Pos logit contributions
 staircase+0.026
uders+0.024
 spe+0.024
 site+0.024
 tad+0.023
Neg logit contributions
my-0.029
ophone-0.023
umper-0.022
 best-0.021
 especially-0.021
Token they
Token ID484
Feature activation+8.19e-03

Pos logit contributions
 staircase+0.063
uders+0.060
 site+0.060
 spe+0.059
 tad+0.059
Neg logit contributions
my-0.044
 best-0.030
ophone-0.029
 especially-0.029
umper-0.028
Token went
Token ID1816
Feature activation+1.53e-02

Pos logit contributions
 staircase+0.016
 spe+0.015
 Jess+0.014
Run+0.014
 tad+0.014
Neg logit contributions
my-0.013
ophone-0.010
umper-0.010
 best-0.009
 happiest-0.009
Token inside
Token ID2641
Feature activation+2.07e-02

Pos logit contributions
 staircase+0.036
 site+0.034
 Jess+0.033
 tad+0.033
 spe+0.033
Neg logit contributions
my-0.019
ophone-0.013
umper-0.013
 happiest-0.012
ingu-0.012
Token their
Token ID511
Feature activation-4.66e-03

Pos logit contributions
 staircase+0.051
uders+0.050
 spe+0.049
 site+0.048
Run+0.048
Neg logit contributions
my-0.025
 especially-0.014
ophone-0.014
 best-0.014
umper-0.013
Token house
Token ID2156
Feature activation+4.15e-03

Pos logit contributions
my+0.009
ophone+0.008
umper+0.007
 especially+0.007
ingu+0.007
Neg logit contributions
 staircase-0.007
 site-0.007
 spe-0.007
uders-0.007
 tad-0.007
Token,
Token ID11
Feature activation-9.56e-04

Pos logit contributions
 staircase+0.010
uders+0.009
 spe+0.009
 tad+0.009
Run+0.009
Neg logit contributions
my-0.005
ophone-0.003
 best-0.003
 especially-0.003
umper-0.003
Token time
Token ID640
Feature activation-1.00e-02

Pos logit contributions
my+0.063
ophone+0.056
umper+0.052
tie+0.048
ip+0.046
Neg logit contributions
 staircase-0.069
yt-0.068
 tad-0.068
 spe-0.068
 site-0.067
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
my+0.016
ophone+0.012
umper+0.012
ingu+0.010
tie+0.010
Neg logit contributions
 staircase-0.020
 site-0.019
 spe-0.019
uders-0.019
 tad-0.019
Token One
Token ID1881
Feature activation-6.49e-03

Pos logit contributions
my+0.023
ophone+0.020
umper+0.020
 forward+0.018
 happiest+0.018
Neg logit contributions
 staircase-0.022
 site-0.021
 Jess-0.020
 flame-0.020
 click-0.020
Token day
Token ID1110
Feature activation+2.22e-02

Pos logit contributions
my+0.008
ophone+0.008
umper+0.008
ingu+0.007
tie+0.007
Neg logit contributions
 staircase-0.013
 flame-0.013
 Jess-0.013
 spe-0.012
 swarm-0.012
Token,
Token ID11
Feature activation-2.22e-03

Pos logit contributions
 staircase+0.039
 spe+0.036
 site+0.036
Run+0.036
 Jess+0.035
Neg logit contributions
my-0.040
ophone-0.033
umper-0.032
 happiest-0.030
ingu-0.030
Token Tim
Token ID5045
Feature activation-1.63e-02

Pos logit contributions
my+0.004
ophone+0.003
umper+0.003
 pads+0.003
ingu+0.003
Neg logit contributions
 staircase-0.004
Run-0.004
 site-0.004
 spe-0.004
 Jess-0.003
Tokenmy
Token ID1820
Feature activation+3.51e-03

Pos logit contributions
 forward+0.009
tie+0.008
ophone+0.008
 happiest+0.007
 wherever+0.006
Neg logit contributions
 staircase-0.040
yt-0.040
ASH-0.039
uders-0.039
 slit-0.038
Token's
Token ID338
Feature activation-1.73e-02

Pos logit contributions
 staircase+0.007
 site+0.007
uders+0.007
 spe+0.007
yt+0.007
Neg logit contributions
my-0.004
ophone-0.004
umper-0.003
tie-0.003
 forward-0.003
Token friend
Token ID1545
Feature activation+7.38e-03

Pos logit contributions
my+0.030
ophone+0.025
umper+0.024
 pads+0.022
tie+0.022
Neg logit contributions
 staircase-0.031
 spe-0.029
 flame-0.029
 site-0.029
 tad-0.029
Token Billy
Token ID15890
Feature activation-3.96e-02

Pos logit contributions
 Jess+0.012
 staircase+0.012
Run+0.012
 site+0.011
 swarm+0.011
Neg logit contributions
my-0.012
ophone-0.011
 pads-0.010
tie-0.010
umper-0.010
Token came
Token ID1625
Feature activation+2.92e-04

Pos logit contributions
my+0.067
ophone+0.065
 happiest+0.063
tie+0.062
 favorite+0.061
Neg logit contributions
 slit-0.056
Run-0.054
 Jess-0.054
yt-0.053
 swarm-0.053
Token was
Token ID373
Feature activation+3.23e-03

Pos logit contributions
 staircase+0.050
uders+0.047
 site+0.047
 spe+0.046
 tad+0.046
Neg logit contributions
my-0.032
ophone-0.022
 especially-0.021
umper-0.021
 best-0.021
Token happy
Token ID3772
Feature activation+1.96e-02

Pos logit contributions
 staircase+0.007
 spe+0.007
 Jess+0.007
 site+0.007
 tad+0.007
Neg logit contributions
my-0.004
umper-0.003
ophone-0.003
ingu-0.003
 pads-0.003
Token to
Token ID284
Feature activation+2.45e-02

Pos logit contributions
 staircase+0.047
 spe+0.044
 site+0.044
uders+0.044
 tad+0.044
Neg logit contributions
my-0.024
ophone-0.016
umper-0.016
 best-0.014
ingu-0.014
Token see
Token ID766
Feature activation-2.45e-02

Pos logit contributions
 staircase+0.052
uders+0.049
 spe+0.049
 site+0.049
Run+0.048
Neg logit contributions
my-0.037
ophone-0.026
umper-0.026
 especially-0.025
 best-0.025
Token that
Token ID326
Feature activation-1.36e-02

Pos logit contributions
my+0.035
ophone+0.027
umper+0.025
 pads+0.025
ingu+0.024
Neg logit contributions
 staircase-0.049
 Jess-0.049
 site-0.049
 spe-0.048
 tad-0.047
Token his
Token ID465
Feature activation-1.27e-02

Pos logit contributions
my+0.018
ophone+0.013
umper+0.013
ingu+0.011
 especially+0.011
Neg logit contributions
 staircase-0.031
 spe-0.029
 site-0.029
 tad-0.029
uders-0.029
Token lesson
Token ID11483
Feature activation+2.00e-02

Pos logit contributions
my+0.021
ophone+0.018
umper+0.017
 pads+0.015
tie+0.015
Neg logit contributions
 staircase-0.023
 spe-0.022
 tad-0.022
 site-0.022
 flame-0.021
Token had
Token ID550
Feature activation-3.19e-02

Pos logit contributions
 staircase+0.043
 site+0.041
uders+0.040
 spe+0.040
 Jess+0.040
Neg logit contributions
my-0.029
umper-0.021
ophone-0.020
ingu-0.019
 especially-0.019
Token helped
Token ID4193
Feature activation+1.76e-02

Pos logit contributions
my+0.043
ophone+0.030
umper+0.029
ingu+0.027
 especially+0.027
Neg logit contributions
 staircase-0.071
uders-0.068
 site-0.068
 spe-0.067
 tad-0.067
Token so
Token ID523
Feature activation+1.48e-02

Pos logit contributions
 staircase+0.041
 spe+0.039
uders+0.039
 site+0.039
 tad+0.038
Neg logit contributions
my-0.023
ophone-0.015
umper-0.014
 best-0.014
 especially-0.014
Token many
Token ID867
Feature activation-8.10e-03

Pos logit contributions
 staircase+0.035
 spe+0.033
 site+0.033
uders+0.033
 tad+0.032
Neg logit contributions
my-0.019
ophone-0.012
umper-0.012
 best-0.011
 especially-0.011
Token Mia
Token ID32189
Feature activation+1.10e-02

Pos logit contributions
 staircase+0.006
 site+0.006
uders+0.006
inct+0.006
 collapse+0.006
Neg logit contributions
my-0.004
 best-0.002
 especially-0.002
 favorite-0.002
ophone-0.002
Token's
Token ID338
Feature activation-8.10e-03

Pos logit contributions
 staircase+0.025
 spe+0.024
 site+0.024
uders+0.024
 tad+0.024
Neg logit contributions
my-0.015
ophone-0.010
umper-0.009
 especially-0.009
 best-0.009
Token mom
Token ID1995
Feature activation+3.10e-03

Pos logit contributions
my+0.013
umper+0.010
ophone+0.010
 pads+0.009
tie+0.009
Neg logit contributions
 staircase-0.016
 site-0.015
uders-0.015
 spe-0.015
 flame-0.014
Token told
Token ID1297
Feature activation+1.67e-02

Pos logit contributions
 staircase+0.006
 site+0.006
uders+0.006
 spe+0.006
 Jess+0.006
Neg logit contributions
my-0.004
ophone-0.003
umper-0.003
 best-0.003
tie-0.003
Token her
Token ID607
Feature activation-7.71e-03

Pos logit contributions
 staircase+0.045
 spe+0.043
 site+0.043
uders+0.043
inct+0.043
Neg logit contributions
my-0.018
 especially-0.008
 best-0.008
 everywhere-0.008
umper-0.008
Token that
Token ID326
Feature activation-3.01e-02

Pos logit contributions
my+0.011
umper+0.008
ophone+0.008
 pads+0.007
ingu+0.007
Neg logit contributions
 staircase-0.017
 Jess-0.016
 site-0.016
 spe-0.016
 click-0.016
Token they
Token ID484
Feature activation-2.47e-02

Pos logit contributions
my+0.048
umper+0.038
ophone+0.037
ingu+0.035
tie+0.034
Neg logit contributions
 staircase-0.059
 spe-0.056
 Jess-0.056
 site-0.055
 flame-0.055
Token were
Token ID547
Feature activation-1.29e-02

Pos logit contributions
my+0.037
umper+0.037
ophone+0.035
ip+0.034
 pads+0.033
Neg logit contributions
 spe-0.040
 staircase-0.039
 crumble-0.038
 click-0.037
 collapse-0.037
Token going
Token ID1016
Feature activation+8.06e-03

Pos logit contributions
my+0.016
ophone+0.012
umper+0.012
tie+0.011
 pads+0.011
Neg logit contributions
 staircase-0.029
 spe-0.028
 site-0.028
uders-0.028
 flame-0.027
Token to
Token ID284
Feature activation+4.41e-02

Pos logit contributions
 staircase+0.017
 site+0.016
 spe+0.015
fire+0.015
uders+0.015
Neg logit contributions
my-0.011
umper-0.009
ophone-0.008
ingu-0.008
 happiest-0.008
Token prepare
Token ID8335
Feature activation-1.29e-02

Pos logit contributions
 staircase+0.101
umm+0.099
uders+0.099
izing+0.099
yt+0.098
Neg logit contributions
my-0.060
 especially-0.041
 best-0.037
 always-0.035
 wherever-0.035
Token him
Token ID683
Feature activation-3.01e-02

Pos logit contributions
my+0.012
ophone+0.010
umper+0.009
 happiest+0.009
 pads+0.009
Neg logit contributions
 Jess-0.016
 staircase-0.015
 spe-0.015
 site-0.014
 flame-0.014
Token to
Token ID284
Feature activation+3.11e-02

Pos logit contributions
my+0.047
umper+0.042
wy+0.041
ophone+0.038
tie+0.037
Neg logit contributions
 Jess-0.057
 tad-0.050
 staircase-0.048
 swarm-0.047
 flame-0.047
Token come
Token ID1282
Feature activation-5.13e-03

Pos logit contributions
 staircase+0.063
 site+0.058
 spe+0.058
uders+0.058
Run+0.057
Neg logit contributions
my-0.051
ophone-0.037
umper-0.037
 especially-0.036
 best-0.035
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
my+0.008
ingu+0.007
 happiest+0.006
ophone+0.006
umper+0.006
Neg logit contributions
 Jess-0.009
 staircase-0.009
 tad-0.009
 spe-0.009
fire-0.009
Token play
Token ID711
Feature activation-2.38e-02

Pos logit contributions
my+0.044
ophone+0.031
umper+0.030
 especially+0.029
 best+0.029
Neg logit contributions
 staircase-0.057
uders-0.053
 site-0.053
 spe-0.053
 tad-0.052
Token with
Token ID351
Feature activation-3.16e-02

Pos logit contributions
my+0.040
ophone+0.030
umper+0.029
ingu+0.028
 best+0.028
Neg logit contributions
 staircase-0.046
 site-0.043
 spe-0.043
 tad-0.043
uders-0.042
Token her
Token ID607
Feature activation-4.73e-02

Pos logit contributions
my+0.042
ophone+0.032
umper+0.029
 happiest+0.029
 pads+0.027
Neg logit contributions
 Jess-0.068
 spe-0.068
 staircase-0.067
 site-0.065
 flame-0.065
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
my+0.086
umper+0.072
ophone+0.068
ingu+0.067
wy+0.062
Neg logit contributions
 staircase-0.080
 spe-0.076
uders-0.075
 tad-0.075
 Jess-0.074
Token
Token ID198
Feature activation-1.52e-02

Pos logit contributions
my+0.021
ophone+0.012
ingu+0.012
 especially+0.011
umper+0.011
Neg logit contributions
 staircase-0.043
uders-0.042
 site-0.041
 spe-0.041
 tad-0.041
Token
Token ID198
Feature activation+5.25e-03

Pos logit contributions
my+0.021
ophone+0.016
umper+0.016
 happiest+0.015
 best+0.014
Neg logit contributions
 staircase-0.033
 site-0.031
 Jess-0.031
 spe-0.031
Run-0.030
TokenFl
Token ID7414
Feature activation+8.57e-03

Pos logit contributions
 staircase+0.010
uders+0.010
 tad+0.010
 site+0.010
umm+0.010
Neg logit contributions
my-0.009
 especially-0.007
 would-0.006
ophone-0.006
 best-0.006
Token had
Token ID550
Feature activation-3.50e-02

Pos logit contributions
my+0.025
 forward+0.025
umper+0.024
 highlight+0.024
ophone+0.023
Neg logit contributions
 staircase-0.042
 site-0.040
 spe-0.040
 slit-0.039
uders-0.038
Token a
Token ID257
Feature activation+1.50e-02

Pos logit contributions
my+0.059
umper+0.051
ophone+0.047
ingu+0.043
ip+0.042
Neg logit contributions
 staircase-0.063
 spe-0.062
 site-0.060
 tad-0.057
 Jess-0.057
Token big
Token ID1263
Feature activation+4.36e-02

Pos logit contributions
 staircase+0.029
 spe+0.027
 site+0.027
 tad+0.027
uders+0.026
Neg logit contributions
my-0.025
umper-0.019
ophone-0.019
 especially-0.017
ingu-0.017
Token truck
Token ID7779
Feature activation+6.37e-03

Pos logit contributions
 staircase+0.080
 spe+0.074
 site+0.074
uders+0.074
 tad+0.072
Neg logit contributions
my-0.078
ophone-0.060
umper-0.059
 especially-0.056
 best-0.056
Token and
Token ID290
Feature activation-3.69e-02

Pos logit contributions
 staircase+0.013
 site+0.012
 spe+0.012
uders+0.012
 click+0.012
Neg logit contributions
my-0.009
umper-0.007
ophone-0.007
ingu-0.007
 happiest-0.007
Token Mia
Token ID32189
Feature activation-1.04e-02

Pos logit contributions
my+0.072
ophone+0.055
 especially+0.054
 best+0.054
umper+0.053
Neg logit contributions
 staircase-0.062
uders-0.058
 site-0.058
 spe-0.057
 swarm-0.056
Token had
Token ID550
Feature activation-3.78e-02

Pos logit contributions
my+0.013
umper+0.013
ingu+0.011
 forward+0.010
ophone+0.010
Neg logit contributions
 spe-0.021
 staircase-0.021
 slit-0.020
 site-0.020
 Jess-0.020
Token a
Token ID257
Feature activation+8.92e-03

Pos logit contributions
my+0.054
umper+0.041
ophone+0.038
ingu+0.036
 happiest+0.034
Neg logit contributions
 staircase-0.082
 spe-0.079
 site-0.078
uders-0.076
 Jess-0.075
Token doll
Token ID3654
Feature activation-3.11e-02

Pos logit contributions
 staircase+0.017
uders+0.017
 site+0.017
 tad+0.016
 spe+0.016
Neg logit contributions
my-0.015
ophone-0.011
 best-0.010
 especially-0.010
umper-0.010
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
my+0.049
umper+0.038
ophone+0.037
ingu+0.035
 happiest+0.034
Neg logit contributions
 staircase-0.061
 site-0.057
 spe-0.057
uders-0.057
yt-0.055
Token They
Token ID1119
Feature activation-4.36e-03

Pos logit contributions
my+0.024
 especially+0.017
ophone+0.016
 best+0.016
umper+0.016
Neg logit contributions
 staircase-0.022
 spe-0.021
uders-0.021
 site-0.020
yt-0.020
Token also
Token ID635
Feature activation-9.61e-03

Pos logit contributions
 staircase+0.006
 spe+0.006
 flame+0.006
 Jess+0.006
uders+0.006
Neg logit contributions
my-0.004
umper-0.003
ophone-0.003
ingu-0.003
ip-0.003
Token good
Token ID922
Feature activation-1.14e-02

Pos logit contributions
my+0.014
umper+0.010
ophone+0.010
ingu+0.009
tie+0.009
Neg logit contributions
 staircase-0.021
 spe-0.020
 tad-0.020
uders-0.020
 site-0.020
Token friends
Token ID2460
Feature activation-4.80e-02

Pos logit contributions
my+0.015
ophone+0.010
umper+0.010
 best+0.010
 especially+0.010
Neg logit contributions
 staircase-0.026
 spe-0.024
 site-0.024
uders-0.024
 tad-0.024
Token.
Token ID13
Feature activation-4.46e-03

Pos logit contributions
my+0.090
umper+0.074
ophone+0.072
ingu+0.070
 happiest+0.064
Neg logit contributions
 staircase-0.078
 spe-0.077
fire-0.075
 site-0.075
uders-0.073
Token<|endoftext|>
Token ID50256
Feature activation-2.51e-02

Pos logit contributions
my+0.008
ophone+0.006
umper+0.006
ingu+0.006
 best+0.006
Neg logit contributions
 staircase-0.008
 spe-0.007
uders-0.007
 site-0.007
 tad-0.007
TokenL
Token ID43
Feature activation-5.01e-02

Pos logit contributions
my+0.043
ophone+0.033
umper+0.032
 especially+0.030
 best+0.030
Neg logit contributions
 staircase-0.047
 site-0.044
 spe-0.044
uders-0.044
 tad-0.044
Tokenily
Token ID813
Feature activation-2.55e-02

Pos logit contributions
my+0.077
 happiest+0.063
ophone+0.063
tie+0.062
 pads+0.058
Neg logit contributions
 staircase-0.096
 site-0.092
uders-0.090
 Jess-0.088
yt-0.088
Token likes
Token ID7832
Feature activation-3.55e-02

Pos logit contributions
my+0.042
ophone+0.033
umper+0.032
ingu+0.030
 best+0.030
Neg logit contributions
 staircase-0.048
uders-0.046
 spe-0.045
 site-0.045
 tad-0.044
Token to
Token ID284
Feature activation+1.48e-02

Pos logit contributions
my+0.045
ophone+0.032
umper+0.029
 happiest+0.027
 best+0.027
Neg logit contributions
 staircase-0.082
 spe-0.080
 site-0.078
 tad-0.078
uders-0.077
Token care
Token ID1337
Feature activation-5.02e-02

Pos logit contributions
 staircase+0.032
uders+0.031
 site+0.031
 spe+0.030
 tad+0.030
Neg logit contributions
my-0.021
ophone-0.015
umper-0.014
 especially-0.014
 best-0.014
Token for
Token ID329
Feature activation-4.44e-03

Pos logit contributions
my+0.086
umper+0.072
ophone+0.068
 happiest+0.067
 pads+0.067
Neg logit contributions
 staircase-0.089
 site-0.086
 flame-0.082
 Jess-0.082
izing-0.082
Token to
Token ID284
Feature activation+5.63e-02

Pos logit contributions
 staircase+0.011
 site+0.011
fire+0.010
 collapse+0.010
 spe+0.010
Neg logit contributions
my-0.008
umper-0.006
ingu-0.006
 happiest-0.006
ophone-0.006
Token dig
Token ID3100
Feature activation-4.70e-03

Pos logit contributions
 staircase+0.127
umm+0.121
yt+0.121
Run+0.121
uders+0.120
Neg logit contributions
my-0.079
 all-0.061
 always-0.058
 especially-0.057
 still-0.056
Token a
Token ID257
Feature activation+2.25e-02

Pos logit contributions
my+0.006
ophone+0.004
umper+0.004
 best+0.004
 especially+0.004
Neg logit contributions
 staircase-0.011
uders-0.010
 spe-0.010
 site-0.010
 tad-0.010
Token hole
Token ID7604
Feature activation+2.20e-02

Pos logit contributions
uders+0.042
Run+0.040
iously+0.038
atively+0.038
 extingu+0.037
Neg logit contributions
my-0.038
 everywhere-0.034
 best-0.034
 very-0.033
 back-0.033
Token and
Token ID290
Feature activation-2.72e-02

Pos logit contributions
 staircase+0.051
 spe+0.048
 site+0.048
uders+0.048
 tad+0.048
Neg logit contributions
my-0.028
ophone-0.019
umper-0.018
 best-0.017
 especially-0.017
Token put
Token ID1234
Feature activation-5.29e-02

Pos logit contributions
my+0.042
 would+0.031
 especially+0.030
 learned+0.029
 all+0.029
Neg logit contributions
uders-0.056
umm-0.056
 staircase-0.055
izing-0.055
 site-0.054
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
my+0.096
ophone+0.083
umper+0.080
ingu+0.077
 happiest+0.076
Neg logit contributions
 spe-0.086
 staircase-0.086
 tad-0.085
 flame-0.084
 Jess-0.082
Token bulb
Token ID28287
Feature activation-1.18e-02

Pos logit contributions
my+0.030
 best+0.024
 everywhere+0.023
ophone+0.023
 around+0.023
Neg logit contributions
 staircase-0.019
 site-0.018
uders-0.018
Run-0.017
yt-0.017
Token inside
Token ID2641
Feature activation-5.78e-03

Pos logit contributions
my+0.016
umper+0.013
ingu+0.012
ophone+0.012
 especially+0.011
Neg logit contributions
 staircase-0.025
 spe-0.024
 tad-0.023
fire-0.023
 flame-0.023
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
my+0.010
umper+0.009
ophone+0.009
ingu+0.009
 happiest+0.008
Neg logit contributions
 staircase-0.009
 site-0.009
 spe-0.009
 flame-0.009
 Jess-0.009
Token
Token ID220
Feature activation-6.17e-02

Pos logit contributions
my+0.025
ophone+0.019
umper+0.018
 best+0.017
 especially+0.017
Neg logit contributions
 staircase-0.040
 site-0.038
 Jess-0.037
 tad-0.037
 spe-0.037
Token<|endoftext|>
Token ID50256
Feature activation-3.10e-02

Pos logit contributions
 staircase+0.012
 site+0.012
 click+0.012
yt+0.012
inct+0.011
Neg logit contributions
my-0.011
ophone-0.010
wy-0.010
umper-0.010
tie-0.010
TokenOnce
Token ID7454
Feature activation-8.49e-02

Pos logit contributions
my+0.048
ophone+0.042
 happiest+0.040
umper+0.038
ingu+0.037
Neg logit contributions
 staircase-0.057
 site-0.055
 Jess-0.054
Run-0.053
 tad-0.053
Token upon
Token ID2402
Feature activation-3.80e-02

Pos logit contributions
tie+0.133
 pads+0.129
umper+0.128
wy+0.124
ophone+0.123
Neg logit contributions
yt-0.141
 staircase-0.137
 Jess-0.134
 site-0.124
opter-0.120
Token a
Token ID257
Feature activation-1.49e-02

Pos logit contributions
my+0.057
ophone+0.048
umper+0.048
ip+0.045
 happiest+0.044
Neg logit contributions
 staircase-0.071
Run-0.069
 tad-0.067
 flame-0.067
 click-0.067
Token time
Token ID640
Feature activation-4.43e-02

Pos logit contributions
my+0.021
ophone+0.016
umper+0.016
tie+0.014
ingu+0.014
Neg logit contributions
 staircase-0.032
 spe-0.030
 site-0.030
uders-0.029
Run-0.029
Token,
Token ID11
Feature activation-5.23e-02

Pos logit contributions
my+0.075
umper+0.060
ophone+0.060
tie+0.056
ingu+0.055
Neg logit contributions
 staircase-0.083
 site-0.077
 Jess-0.077
uders-0.076
 spe-0.076
Token there
Token ID612
Feature activation-4.15e-02

Pos logit contributions
my+0.069
umper+0.059
ophone+0.055
ingu+0.053
 pads+0.052
Neg logit contributions
 staircase-0.113
 Jess-0.107
Run-0.106
 spe-0.106
 flame-0.105
Token was
Token ID373
Feature activation-1.10e-02

Pos logit contributions
my+0.067
ophone+0.050
umper+0.049
 best+0.046
 especially+0.046
Neg logit contributions
 staircase-0.082
 site-0.077
 spe-0.077
uders-0.077
 tad-0.076
Token a
Token ID257
Feature activation+2.65e-02

Pos logit contributions
my+0.017
ophone+0.015
umper+0.014
ingu+0.013
 happiest+0.013
Neg logit contributions
 staircase-0.020
 Jess-0.020
Run-0.020
 spe-0.020
 tad-0.020
Token little
Token ID1310
Feature activation+3.54e-02

Pos logit contributions
 staircase+0.047
uders+0.046
 site+0.045
 collapse+0.043
inct+0.043
Neg logit contributions
my-0.049
 best-0.036
 especially-0.036
ophone-0.036
umper-0.035
Token girl
Token ID2576
Feature activation+8.05e-03

Pos logit contributions
uders+0.062
 staircase+0.062
 site+0.061
inct+0.060
 collapse+0.060
Neg logit contributions
my-0.069
 best-0.051
umper-0.049
 especially-0.048
ip-0.048
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
my+0.091
ophone+0.074
umper+0.073
ingu+0.070
ip+0.066
Neg logit contributions
 staircase-0.095
 spe-0.088
 site-0.087
 flame-0.086
uders-0.086
Token<|endoftext|>
Token ID50256
Feature activation-2.31e-02

Pos logit contributions
 staircase+0.018
 spe+0.017
uders+0.016
 site+0.016
 tad+0.016
Neg logit contributions
my-0.020
ophone-0.015
umper-0.015
 best-0.015
ingu-0.014
TokenA
Token ID32
Feature activation-1.63e-03

Pos logit contributions
my+0.038
ophone+0.032
 happiest+0.031
umper+0.030
 especially+0.029
Neg logit contributions
 staircase-0.041
 site-0.039
 spe-0.039
 Jess-0.039
 tad-0.039
Token little
Token ID1310
Feature activation+3.92e-02

Pos logit contributions
my+0.002
ophone+0.001
umper+0.001
 especially+0.001
 best+0.001
Neg logit contributions
 staircase-0.004
uders-0.004
 spe-0.004
 site-0.004
 tad-0.004
Token girl
Token ID2576
Feature activation+2.16e-02

Pos logit contributions
 staircase+0.066
uders+0.062
 site+0.062
 collapse+0.060
 tad+0.060
Neg logit contributions
my-0.079
umper-0.058
ophone-0.058
 best-0.058
 especially-0.055
Token named
Token ID3706
Feature activation-4.93e-02

Pos logit contributions
 staircase+0.050
 site+0.048
uders+0.047
 tad+0.047
 click+0.047
Neg logit contributions
my-0.029
ophone-0.019
 especially-0.018
 would-0.017
 best-0.017
Token Emma
Token ID18966
Feature activation-2.47e-02

Pos logit contributions
my+0.085
ophone+0.064
umper+0.063
 best+0.060
 especially+0.060
Neg logit contributions
 staircase-0.093
uders-0.087
 spe-0.087
 site-0.087
 tad-0.085
Token played
Token ID2826
Feature activation-3.11e-02

Pos logit contributions
my+0.035
ophone+0.021
 would+0.021
 especially+0.021
umper+0.020
Neg logit contributions
 staircase-0.057
 site-0.055
 tad-0.054
uders-0.054
 spe-0.054
Token with
Token ID351
Feature activation-5.70e-02

Pos logit contributions
my+0.039
 especially+0.027
 best+0.027
 around+0.026
 on+0.025
Neg logit contributions
 staircase-0.072
 spe-0.071
uders-0.070
 collapse-0.070
 swarm-0.069
Token her
Token ID607
Feature activation-5.24e-02

Pos logit contributions
my+0.079
ophone+0.056
umper+0.054
 best+0.050
 especially+0.049
Neg logit contributions
 staircase-0.127
 spe-0.121
uders-0.120
 site-0.120
 tad-0.118
Token warm
Token ID5814
Feature activation+3.24e-03

Pos logit contributions
my+0.089
umper+0.070
ophone+0.069
ingu+0.064
 especially+0.062
Neg logit contributions
 staircase-0.101
 spe-0.094
 tad-0.092
 flame-0.091
 site-0.091
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
my+0.018
ophone+0.013
umper+0.012
 best+0.012
 especially+0.012
Neg logit contributions
 staircase-0.026
 spe-0.024
uders-0.024
 site-0.024
 tad-0.024
Token They
Token ID1119
Feature activation-4.43e-03

Pos logit contributions
my+0.019
umper+0.016
ophone+0.016
 best+0.014
ingu+0.014
Neg logit contributions
 staircase-0.019
 site-0.017
 Jess-0.017
uders-0.017
 gap-0.017
Token liked
Token ID8288
Feature activation-2.13e-02

Pos logit contributions
my+0.006
umper+0.006
ophone+0.006
 forward+0.005
ingu+0.005
Neg logit contributions
 spe-0.008
 staircase-0.008
 tad-0.008
 slit-0.008
 Jess-0.007
Token to
Token ID284
Feature activation+2.88e-02

Pos logit contributions
my+0.031
ophone+0.024
umper+0.024
ingu+0.021
wy+0.020
Neg logit contributions
 staircase-0.046
 spe-0.044
 site-0.044
 tad-0.043
 Jess-0.042
Token gather
Token ID6431
Feature activation-1.86e-02

Pos logit contributions
 staircase+0.051
 site+0.047
 spe+0.047
uders+0.046
 tad+0.046
Neg logit contributions
my-0.053
ophone-0.042
umper-0.042
ingu-0.039
 best-0.039
Token shells
Token ID19679
Feature activation-6.93e-02

Pos logit contributions
my+0.036
ophone+0.030
umper+0.029
ingu+0.027
 happiest+0.026
Neg logit contributions
 tad-0.031
 staircase-0.030
 spe-0.029
 site-0.029
 gap-0.028
Token and
Token ID290
Feature activation-3.47e-02

Pos logit contributions
my+0.112
ophone+0.093
umper+0.091
 happiest+0.085
ingu+0.082
Neg logit contributions
 staircase-0.124
 tad-0.122
yt-0.121
 spe-0.121
 site-0.118
Token stones
Token ID14966
Feature activation-5.89e-02

Pos logit contributions
my+0.065
ophone+0.052
umper+0.051
 best+0.047
ingu+0.047
Neg logit contributions
 staircase-0.060
 spe-0.057
 tad-0.056
 site-0.055
uders-0.054
Token and
Token ID290
Feature activation-3.51e-02

Pos logit contributions
my+0.095
umper+0.077
ophone+0.076
 happiest+0.072
ingu+0.071
Neg logit contributions
 staircase-0.108
 spe-0.107
 tad-0.105
yt-0.104
 site-0.104
Token make
Token ID787
Feature activation-1.56e-02

Pos logit contributions
my+0.065
ophone+0.051
umper+0.051
ingu+0.047
 best+0.047
Neg logit contributions
 staircase-0.061
 spe-0.058
 tad-0.057
 site-0.056
uders-0.056
Token patterns
Token ID7572
Feature activation-5.74e-02

Pos logit contributions
my+0.025
 best+0.019
ophone+0.018
 favorite+0.018
 especially+0.018
Neg logit contributions
 staircase-0.032
uders-0.030
Run-0.029
 spe-0.029
 site-0.029