TOP ACTIVATIONS
MAX = 0.222

wanted to touch the ost ric hes . They reached out
and looked at the ost ric hes . They saw one
and run to the ost ric hes . "
too close . The ost ric hes might pe ck you
that said " O st ric hes ". Sara and Tom
cried . " The ost ric hes are mean ! They
should not tease the ost ric hes . They are not
you count how many ost ric hes there are ?"
shouted . " The ost ric hes are so funny !"
. One of the ost ric hes looked different from the
But the ost ric hes did not like that
. It sees other ost ric hes . It makes new
did not know what ost ric hes were . They looked
mom and to the ost ric hes . They learned their
Look , mom , ost ric hes !" "
. They were called ost ric hes . Lily and Ben
Yes , they are ost ric hes ," mom said .
but be careful . Ost ric hes can be gr umpy
" O st ric hes do not like people
They do not pack ost ric hes anymore . They pack

MIN ACTIVATIONS
MIN = -0.159

<|endoftext|> Once upon a time , there was a graceful dove
, a st eth oscope , and a bag of tools
<|endoftext|> Once upon a time , there was a king who
<|endoftext|> Once upon a time , there was a little girl
. She could make birds , stars , and flowers with
<|endoftext|> Once upon a time , there was a happy bunny
<|endoftext|> Once upon a time , there was a j olly
<|endoftext|> Once upon a time , there was a little boy
<|endoftext|> Once upon a time , there was a little bunny
<|endoftext|> Once upon a time , in a small village ,
<|endoftext|> Once upon a time , a humble driver drove his
<|endoftext|> Once upon a time , there was a little girl
<|endoftext|> Once upon a time , there was a little girl
<|endoftext|> Once upon a time , there was a little girl
<|endoftext|> Once upon a time , there was a little boy
<|endoftext|> Once upon a time , there was a little girl
<|endoftext|> Once upon a time , there was a little girl
<|endoftext|> Once upon a time , there was a big ,
<|endoftext|> Once upon a time , there was a little bunny
<|endoftext|> Once upon a time , there was a little boy

INTERVAL 0.127 - 0.222
CONTAINS 0.043%

to study . They may seem fun at first , but
was a lazy valley . Everything seemed peaceful and quiet .
, looking around and examining everything he saw . <|endoftext|> Once
stars tw ink ling . Everything was peaceful and calm .
at the park , and everything was so clean . <|endoftext|>

INTERVAL 0.032 - 0.127
CONTAINS 28.377%

Mia . They laughed and had fun . They forgot about
the ball and at the same time everyone was happy .
to the ground . S ara runs to Ben
drift off to sleep . One day , his
" You 're welcome . You are very honest children .

INTERVAL -0.064 - 0.032
CONTAINS 60.384%

was so happy and jumped for joy ! <|endoftext|> Once upon
afford a burger . And he learned that being kind is
what was inside the building so she went in . Inside
the dragon talked , and the dragon was so kind .
friend . He waved goodbye to the tutor bird and happily

INTERVAL -0.159 - -0.064
CONTAINS 11.196%

, why not ?" Santa Claus said . " But you
worked . Grand father went on to explain
a gigg ly stuffed bear , a soft blanket and a
better after that . <|endoftext|> John and his dad were out
of it . She ran and ran until she found a
Token wanted
Token ID2227
Feature activation+3.67e-02

Pos logit contributions
 fastest+0.061
 whole+0.061
tted+0.059
 certainly+0.059
 quickest+0.058
Neg logit contributions
 uncle-0.060
 Tim-0.060
 Lee-0.058
ries-0.058
 Uncle-0.058
Token to
Token ID284
Feature activation-4.57e-02

Pos logit contributions
 whole+0.058
 best+0.057
 most+0.057
 biggest+0.057
 will+0.056
Neg logit contributions
lee-0.062
kids-0.058
parent-0.057
ries-0.057
ens-0.057
Token touch
Token ID3638
Feature activation+3.97e-03

Pos logit contributions
 Tim+0.079
 uncle+0.079
 Uncle+0.074
 Lee+0.074
 Sue+0.074
Neg logit contributions
 fastest-0.077
 whole-0.076
tted-0.074
 quickest-0.073
 biggest-0.072
Token the
Token ID262
Feature activation+3.46e-02

Pos logit contributions
 fastest+0.008
 whole+0.008
 biggest+0.008
tted+0.008
 quickest+0.008
Neg logit contributions
 uncle-0.005
 Tim-0.005
 Uncle-0.005
 Lee-0.005
 Sue-0.005
Token ost
Token ID23619
Feature activation+1.65e-03

Pos logit contributions
tted+0.082
 surely+0.078
ative+0.078
hest+0.078
 certainly+0.077
Neg logit contributions
 uncle-0.041
 Tim-0.038
 Gig-0.037
 Sue-0.035
ries-0.034
Tokenric
Token ID1173
Feature activation+2.22e-01

Pos logit contributions
tted+0.002
 whole+0.002
 fastest+0.002
hest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.004
 uncle-0.004
ries-0.004
 Sue-0.004
 Lee-0.004
Tokenhes
Token ID956
Feature activation+9.04e-02

Pos logit contributions
 fastest+0.280
 quickest+0.273
 winner+0.255
 biggest+0.252
 coolest+0.246
Neg logit contributions
 Tim-0.494
 Sue-0.487
 Tom-0.487
ries-0.465
 uncle-0.460
Token.
Token ID13
Feature activation-6.63e-03

Pos logit contributions
 fastest+0.190
 whole+0.186
tted+0.184
 quickest+0.182
 biggest+0.181
Neg logit contributions
 Tim-0.123
 uncle-0.119
 Sue-0.113
 Tom-0.111
 Lee-0.110
Token They
Token ID1119
Feature activation+4.05e-02

Pos logit contributions
 Tim+0.012
 uncle+0.012
 Sue+0.011
 Uncle+0.011
 Tom+0.011
Neg logit contributions
 fastest-0.011
 whole-0.011
tted-0.011
hest-0.011
 quickest-0.011
Token reached
Token ID4251
Feature activation+1.14e-02

Pos logit contributions
 fastest+0.069
 whole+0.069
tted+0.068
 certainly+0.067
 quickest+0.066
Neg logit contributions
 uncle-0.066
 Tim-0.065
 Uncle-0.063
 Lee-0.063
ries-0.063
Token out
Token ID503
Feature activation+1.31e-02

Pos logit contributions
 fastest+0.016
 whole+0.016
 biggest+0.015
 quickest+0.015
tted+0.014
Neg logit contributions
 uncle-0.023
 Tim-0.023
 Gig-0.022
 Uncle-0.021
 Lee-0.021
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
kids+0.032
ens+0.029
riet+0.029
child+0.028
 purs+0.028
Neg logit contributions
 whole-0.037
 best-0.035
 fastest-0.035
 most-0.034
 will-0.033
Token looked
Token ID3114
Feature activation+4.56e-02

Pos logit contributions
 uncle+0.228
kids+0.220
ries+0.220
lee+0.218
 Tim+0.216
Neg logit contributions
 will-0.177
'll-0.172
 certainly-0.168
 whole-0.168
 surely-0.166
Token at
Token ID379
Feature activation-3.07e-02

Pos logit contributions
 whole+0.077
 fastest+0.074
 biggest+0.074
 most+0.074
 best+0.073
Neg logit contributions
 Lee-0.075
 uncle-0.075
 Tim-0.074
kids-0.073
 Kim-0.073
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
 uncle+0.037
 Tim+0.036
lee+0.036
 Lee+0.036
 handing+0.035
Neg logit contributions
 whole-0.063
 fastest-0.062
 biggest-0.061
 most-0.060
 best-0.059
Token ost
Token ID23619
Feature activation-3.04e-03

Pos logit contributions
tted+0.080
ative+0.077
hest+0.077
 surely+0.075
aying+0.073
Neg logit contributions
 uncle-0.038
 Gig-0.034
 Sue-0.033
ries-0.032
 Uncle-0.031
Tokenric
Token ID1173
Feature activation+2.20e-01

Pos logit contributions
 uncle+0.007
ries+0.007
 Tim+0.006
lee+0.006
 Lee+0.006
Neg logit contributions
tted-0.004
 whole-0.004
 fastest-0.004
hest-0.004
 biggest-0.003
Tokenhes
Token ID956
Feature activation+8.73e-02

Pos logit contributions
 fastest+0.326
 quickest+0.324
 coolest+0.303
 winner+0.298
 virtue+0.296
Neg logit contributions
 Tom-0.446
 Sue-0.437
 Tim-0.432
ries-0.415
 Mia-0.402
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
 fastest+0.193
 quickest+0.185
 biggest+0.184
 whole+0.183
hest+0.181
Neg logit contributions
 Tim-0.114
 uncle-0.106
 Tom-0.106
 Sue-0.104
 Lee-0.100
Token They
Token ID1119
Feature activation+3.71e-02

Pos logit contributions
 uncle+0.029
ries+0.028
pa+0.028
ens+0.027
 Tim+0.027
Neg logit contributions
 whole-0.020
tted-0.019
 biggest-0.019
 fastest-0.019
 most-0.019
Token saw
Token ID2497
Feature activation-7.51e-03

Pos logit contributions
 fastest+0.068
 whole+0.068
tted+0.066
 certainly+0.065
 quickest+0.065
Neg logit contributions
 uncle-0.057
 Tim-0.056
 Uncle-0.053
 Lee-0.053
 Sue-0.053
Token one
Token ID530
Feature activation+6.94e-02

Pos logit contributions
 uncle+0.013
 Tim+0.013
 Lee+0.012
 Uncle+0.012
ens+0.012
Neg logit contributions
 fastest-0.012
 whole-0.012
 biggest-0.012
 most-0.011
tted-0.011
Token and
Token ID290
Feature activation-1.17e-01

Pos logit contributions
 fastest+0.071
 whole+0.070
tted+0.069
 quickest+0.068
 biggest+0.068
Neg logit contributions
 Tim-0.028
 uncle-0.027
 Sue-0.026
 Tom-0.025
 Lee-0.024
Token run
Token ID1057
Feature activation-4.04e-02

Pos logit contributions
 uncle+0.205
 Tim+0.205
 Uncle+0.193
 Sue+0.192
lee+0.190
Neg logit contributions
 fastest-0.191
 whole-0.186
tted-0.180
 quickest-0.179
 biggest-0.178
Token to
Token ID284
Feature activation-6.33e-02

Pos logit contributions
 uncle+0.059
 Tim+0.059
 Uncle+0.055
 Sue+0.054
 Lee+0.054
Neg logit contributions
 fastest-0.079
 whole-0.078
tted-0.077
 quickest-0.075
 biggest-0.074
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
 uncle+0.084
lee+0.083
ries+0.082
ens+0.081
 Tim+0.079
Neg logit contributions
 whole-0.127
 fastest-0.124
 biggest-0.121
 quickest-0.118
 certainly-0.118
Token ost
Token ID23619
Feature activation-2.41e-03

Pos logit contributions
 fastest+0.039
 whole+0.038
tted+0.038
hest+0.037
 quickest+0.037
Neg logit contributions
 uncle-0.036
 Tim-0.036
 Sue-0.033
 Uncle-0.033
 Lee-0.033
Tokenric
Token ID1173
Feature activation+2.18e-01

Pos logit contributions
 uncle+0.005
 Tim+0.005
ries+0.005
 Lee+0.005
 Sue+0.005
Neg logit contributions
tted-0.003
 whole-0.003
 fastest-0.003
hest-0.003
 biggest-0.003
Tokenhes
Token ID956
Feature activation+8.62e-02

Pos logit contributions
 fastest+0.305
 quickest+0.304
 coolest+0.284
 virtue+0.281
 winner+0.277
Neg logit contributions
 Tom-0.461
 Sue-0.455
 Tim-0.449
ries-0.420
 uncle-0.415
Token.
Token ID13
Feature activation-9.43e-03

Pos logit contributions
 fastest+0.185
 quickest+0.178
 whole+0.177
hest+0.176
tted+0.176
Neg logit contributions
 Tim-0.118
 uncle-0.111
 Tom-0.110
 Sue-0.110
 Lee-0.105
Token
Token ID198
Feature activation+7.31e-02

Pos logit contributions
 uncle+0.017
 Tim+0.017
 Uncle+0.016
ries+0.016
 Lee+0.016
Neg logit contributions
 fastest-0.015
 whole-0.015
tted-0.015
 biggest-0.014
 quickest-0.014
Token
Token ID198
Feature activation+6.67e-02

Pos logit contributions
 fastest+0.118
 whole+0.113
 quickest+0.112
 biggest+0.110
tted+0.110
Neg logit contributions
 Tim-0.137
 uncle-0.134
 Sue-0.129
 Tom-0.127
 Uncle-0.126
Token"
Token ID1
Feature activation+2.10e-02

Pos logit contributions
 fastest+0.106
 biggest+0.104
 whole+0.102
 quickest+0.100
 certainly+0.100
Neg logit contributions
 Tim-0.125
 uncle-0.123
 Sue-0.119
 Tom-0.118
 Uncle-0.115
Token too
Token ID1165
Feature activation+5.08e-02

Pos logit contributions
 Tim+0.047
 uncle+0.047
 Uncle+0.044
 Sue+0.044
 Lee+0.044
Neg logit contributions
 fastest-0.051
 whole-0.050
tted-0.049
 quickest-0.048
 biggest-0.048
Token close
Token ID1969
Feature activation-2.12e-02

Pos logit contributions
 fastest+0.071
 whole+0.069
tted+0.068
 quickest+0.066
hest+0.066
Neg logit contributions
 Tim-0.103
 uncle-0.103
 Sue-0.098
 Uncle-0.097
 Lee-0.096
Token.
Token ID13
Feature activation+1.17e-03

Pos logit contributions
 uncle+0.028
 Tim+0.028
 Uncle+0.026
 Lee+0.026
 Sue+0.026
Neg logit contributions
 fastest-0.043
 whole-0.043
 quickest-0.041
 biggest-0.041
 most-0.040
Token The
Token ID383
Feature activation+3.86e-02

Pos logit contributions
 fastest+0.002
 whole+0.002
tted+0.002
 quickest+0.002
hest+0.002
Neg logit contributions
 Tim-0.002
 uncle-0.002
 Sue-0.002
 Uncle-0.002
 Lee-0.002
Token ost
Token ID23619
Feature activation+1.99e-03

Pos logit contributions
tted+0.082
hest+0.082
ative+0.078
 certainly+0.077
arest+0.077
Neg logit contributions
 uncle-0.057
 Sue-0.051
 Tim-0.051
 Gig-0.051
 Uncle-0.050
Tokenric
Token ID1173
Feature activation+2.18e-01

Pos logit contributions
tted+0.003
 whole+0.003
hest+0.003
 biggest+0.003
 fastest+0.003
Neg logit contributions
ries-0.004
 uncle-0.004
 Tim-0.004
lee-0.004
 Lee-0.004
Tokenhes
Token ID956
Feature activation+8.89e-02

Pos logit contributions
 quickest+0.325
 fastest+0.321
 coolest+0.309
 virtue+0.306
 biggest+0.299
Neg logit contributions
 Tom-0.429
 Sue-0.422
 Tim-0.405
ries-0.390
 Mia-0.385
Token might
Token ID1244
Feature activation+9.52e-02

Pos logit contributions
 fastest+0.134
 biggest+0.131
 quickest+0.130
arest+0.128
tted+0.128
Neg logit contributions
 Tim-0.170
 Sue-0.161
 Tom-0.160
ens-0.157
 uncle-0.154
Token pe
Token ID613
Feature activation-1.68e-02

Pos logit contributions
ative+0.191
tted+0.182
 biggest+0.177
antly+0.176
 highlight+0.171
Neg logit contributions
 Tim-0.141
 Sue-0.141
 uncle-0.136
 Kitty-0.135
 Tom-0.133
Tokenck
Token ID694
Feature activation+1.31e-02

Pos logit contributions
ries+0.021
 uncle+0.018
lee+0.017
 Kim+0.016
 stro+0.016
Neg logit contributions
tted-0.038
 whole-0.037
ative-0.036
 virtue-0.036
 entire-0.035
Token you
Token ID345
Feature activation+5.60e-02

Pos logit contributions
 fastest+0.026
tted+0.025
 whole+0.025
 quickest+0.025
 certainly+0.025
Neg logit contributions
 Tim-0.019
 uncle-0.018
 Sue-0.017
 Tom-0.017
ens-0.017
Token that
Token ID326
Feature activation-1.29e-02

Pos logit contributions
 fastest+0.016
 whole+0.016
tted+0.016
 biggest+0.015
hest+0.015
Neg logit contributions
 Tim-0.014
 uncle-0.014
 Uncle-0.013
 Lee-0.013
ens-0.013
Token said
Token ID531
Feature activation-2.57e-02

Pos logit contributions
 Tim+0.012
 uncle+0.011
 Sue+0.010
 Tom+0.010
 Uncle+0.009
Neg logit contributions
 fastest-0.034
 whole-0.033
tted-0.033
hest-0.033
 quickest-0.032
Token "
Token ID366
Feature activation+7.07e-02

Pos logit contributions
 Tim+0.034
 uncle+0.032
 Sue+0.031
 Tom+0.031
 Lee+0.030
Neg logit contributions
 fastest-0.057
 whole-0.056
 quickest-0.055
hest-0.055
tted-0.055
TokenO
Token ID46
Feature activation-3.68e-02

Pos logit contributions
 biggest+0.115
 fastest+0.114
 certainly+0.113
ery+0.109
 best+0.109
Neg logit contributions
 Tim-0.127
 uncle-0.122
 Gig-0.119
 Sue-0.118
 Lee-0.118
Tokenst
Token ID301
Feature activation+5.91e-02

Pos logit contributions
 Tim+0.059
 uncle+0.058
ries+0.057
ens+0.056
 Kitty+0.056
Neg logit contributions
 fastest-0.066
 whole-0.066
 biggest-0.065
tted-0.064
 quickest-0.064
Tokenric
Token ID1173
Feature activation+2.16e-01

Pos logit contributions
 whole+0.097
 highlight+0.096
 fastest+0.096
 quickest+0.093
 coolest+0.093
Neg logit contributions
 Tim-0.104
 Sue-0.100
 uncle-0.099
 Lee-0.098
 Tom-0.097
Tokenhes
Token ID956
Feature activation+9.70e-02

Pos logit contributions
 quickest+0.305
 fastest+0.304
 coolest+0.294
 virtue+0.285
 surely+0.279
Neg logit contributions
 Tom-0.430
 Sue-0.415
 Tim-0.410
ries-0.386
 Amy-0.381
Token".
Token ID1911
Feature activation-1.29e-02

Pos logit contributions
 fastest+0.155
 quickest+0.151
tted+0.150
 whole+0.145
 entire+0.144
Neg logit contributions
 Tim-0.185
 Tom-0.177
 Sue-0.176
 uncle-0.170
 Sally-0.168
Token Sara
Token ID24799
Feature activation-7.28e-02

Pos logit contributions
 uncle+0.010
lee+0.009
ens+0.009
pa+0.009
 Tim+0.009
Neg logit contributions
 whole-0.033
tted-0.032
 biggest-0.032
 fastest-0.032
 most-0.032
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
 Tim+0.098
 uncle+0.097
 Sue+0.092
ries+0.091
 Gig+0.091
Neg logit contributions
 whole-0.146
 fastest-0.146
tted-0.143
 certainly-0.143
 quickest-0.139
Token Tom
Token ID4186
Feature activation-8.04e-02

Pos logit contributions
lee+0.116
 stro+0.108
olph+0.106
 handing+0.102
 babys+0.100
Neg logit contributions
 best-0.265
 whole-0.258
 most-0.253
 certainly-0.252
 biggest-0.250
Token cried
Token ID16896
Feature activation-2.92e-02

Pos logit contributions
 uncle+0.059
 Tim+0.058
 Uncle+0.052
 Sue+0.051
 Lee+0.051
Neg logit contributions
 fastest-0.152
 whole-0.151
tted-0.149
 certainly-0.146
 quickest-0.146
Token.
Token ID13
Feature activation-2.56e-02

Pos logit contributions
 uncle+0.048
 Tim+0.047
 Uncle+0.045
ries+0.045
 Lee+0.045
Neg logit contributions
 whole-0.050
 fastest-0.049
tted-0.048
 biggest-0.047
hest-0.046
Token "
Token ID366
Feature activation+5.21e-02

Pos logit contributions
 Tim+0.044
 uncle+0.044
 Sue+0.042
 Uncle+0.041
 Lee+0.041
Neg logit contributions
 fastest-0.044
 whole-0.043
tted-0.042
 quickest-0.042
hest-0.042
TokenThe
Token ID464
Feature activation+4.80e-02

Pos logit contributions
 fastest+0.067
 quickest+0.063
 winner+0.062
 biggest+0.062
 certainly+0.061
Neg logit contributions
 Tim-0.116
 Tom-0.108
 uncle-0.107
ens-0.106
 Sue-0.105
Token ost
Token ID23619
Feature activation+8.43e-03

Pos logit contributions
tted+0.099
hest+0.098
 fastest+0.095
 certainly+0.095
 quickest+0.093
Neg logit contributions
 uncle-0.069
 Tim-0.065
 Uncle-0.061
 Lee-0.060
 Sue-0.059
Tokenric
Token ID1173
Feature activation+2.15e-01

Pos logit contributions
 whole+0.011
tted+0.011
 fastest+0.010
 biggest+0.010
hest+0.010
Neg logit contributions
ries-0.018
 Tim-0.018
 uncle-0.018
 Lee-0.017
 Sue-0.017
Tokenhes
Token ID956
Feature activation+9.30e-02

Pos logit contributions
 quickest+0.368
 fastest+0.360
 surely+0.343
 highlight+0.337
 winner+0.332
Neg logit contributions
 Tom-0.399
 Sue-0.377
 Tim-0.375
ries-0.358
 Mia-0.351
Token are
Token ID389
Feature activation+7.97e-02

Pos logit contributions
 fastest+0.133
 quickest+0.130
arest+0.125
tted+0.125
 biggest+0.125
Neg logit contributions
 Tim-0.194
 Sue-0.184
 Tom-0.181
ens-0.178
 Lily-0.177
Token mean
Token ID1612
Feature activation-3.46e-02

Pos logit contributions
tted+0.149
 fastest+0.142
hest+0.141
 quickest+0.139
ative+0.138
Neg logit contributions
 Tim-0.134
 uncle-0.126
 Sue-0.125
 Tom-0.123
 Kitty-0.120
Token!
Token ID0
Feature activation-1.27e-03

Pos logit contributions
 Tim+0.054
 uncle+0.054
 Sue+0.050
 Uncle+0.050
 Lee+0.050
Neg logit contributions
 fastest-0.063
 whole-0.063
tted-0.061
hest-0.060
 biggest-0.060
Token They
Token ID1119
Feature activation+4.63e-02

Pos logit contributions
 Tim+0.002
 uncle+0.002
 Sue+0.002
 Lee+0.002
 Uncle+0.002
Neg logit contributions
 fastest-0.002
 whole-0.002
tted-0.002
 quickest-0.002
hest-0.002
Token should
Token ID815
Feature activation+1.02e-02

Pos logit contributions
 fastest+0.154
 quickest+0.148
 biggest+0.144
arest+0.143
 whole+0.143
Neg logit contributions
 Tim-0.204
 Tommy-0.179
 Tom-0.178
 uncle-0.176
 Lily-0.174
Token not
Token ID407
Feature activation+2.07e-02

Pos logit contributions
 fastest+0.016
 whole+0.015
tted+0.015
 biggest+0.015
 quickest+0.015
Neg logit contributions
 Tim-0.020
 uncle-0.019
 Sue-0.018
 Tom-0.018
 Uncle-0.018
Token tease
Token ID34691
Feature activation-2.09e-02

Pos logit contributions
 fastest+0.041
 biggest+0.040
 whole+0.040
tted+0.039
 quickest+0.039
Neg logit contributions
 Tim-0.032
 uncle-0.030
 Sue-0.029
 Tom-0.028
 Sally-0.028
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
 uncle+0.030
 Tim+0.029
lee+0.028
 Uncle+0.028
 Lee+0.027
Neg logit contributions
 fastest-0.041
 whole-0.039
 biggest-0.038
 most-0.038
 quickest-0.038
Token ost
Token ID23619
Feature activation+6.03e-03

Pos logit contributions
tted+0.073
hest+0.071
ative+0.069
 surely+0.069
arest+0.068
Neg logit contributions
 uncle-0.028
 Sue-0.024
 Gig-0.024
 Tim-0.024
 Kitty-0.023
Tokenric
Token ID1173
Feature activation+2.15e-01

Pos logit contributions
tted+0.007
 whole+0.007
 fastest+0.007
hest+0.007
 biggest+0.007
Neg logit contributions
 Tim-0.013
 uncle-0.013
ries-0.013
 Lee-0.013
 Sue-0.013
Tokenhes
Token ID956
Feature activation+9.52e-02

Pos logit contributions
 quickest+0.290
 fastest+0.284
 surely+0.267
 biggest+0.266
 coolest+0.265
Neg logit contributions
 Tom-0.451
 Tim-0.446
 Sue-0.442
ries-0.428
 uncle-0.409
Token.
Token ID13
Feature activation-2.64e-03

Pos logit contributions
 fastest+0.194
tted+0.192
 whole+0.190
 quickest+0.188
 certainly+0.187
Neg logit contributions
 Tim-0.135
 uncle-0.129
 Sue-0.124
 Tom-0.123
 Lee-0.120
Token They
Token ID1119
Feature activation+4.65e-02

Pos logit contributions
 Tim+0.005
 uncle+0.004
 Sue+0.004
 Tom+0.004
 Lee+0.004
Neg logit contributions
 whole-0.005
 fastest-0.005
tted-0.005
hest-0.005
 quickest-0.005
Token are
Token ID389
Feature activation+7.61e-02

Pos logit contributions
ative+0.064
 whole+0.063
tted+0.063
 fastest+0.062
 entire+0.062
Neg logit contributions
 Tim-0.097
ens-0.091
 uncle-0.089
 Tom-0.087
 Kitty-0.085
Token not
Token ID407
Feature activation+2.23e-02

Pos logit contributions
 fastest+0.121
tted+0.120
 whole+0.119
hest+0.116
 quickest+0.115
Neg logit contributions
 Tim-0.140
 uncle-0.139
 Sue-0.131
 Uncle-0.129
 Tom-0.129
Token you
Token ID345
Feature activation+3.52e-02

Pos logit contributions
 uncle+0.015
 Tim+0.014
 Uncle+0.014
 Lee+0.014
lee+0.014
Neg logit contributions
 fastest-0.018
 whole-0.017
 biggest-0.017
 most-0.017
 quickest-0.017
Token count
Token ID954
Feature activation-1.61e-02

Pos logit contributions
 fastest+0.067
tted+0.067
 whole+0.065
 quickest+0.065
 biggest+0.064
Neg logit contributions
 Tim-0.055
 uncle-0.053
 Sue-0.051
 Tom-0.049
 Sally-0.048
Token how
Token ID703
Feature activation+5.89e-02

Pos logit contributions
 Tim+0.022
 uncle+0.022
 Sue+0.020
ries+0.020
 Lee+0.020
Neg logit contributions
tted-0.034
 fastest-0.033
 whole-0.033
 quickest-0.032
 certainly-0.032
Token many
Token ID867
Feature activation+3.68e-02

Pos logit contributions
 fastest+0.111
tted+0.111
 quickest+0.108
hest+0.107
 whole+0.107
Neg logit contributions
 Tim-0.092
 uncle-0.091
 Sue-0.087
 Uncle-0.085
 Tom-0.084
Token ost
Token ID23619
Feature activation-2.93e-04

Pos logit contributions
tted+0.079
 whole+0.076
 fastest+0.075
 certainly+0.074
hest+0.074
Neg logit contributions
 uncle-0.049
 Tim-0.049
 Sue-0.046
ries-0.046
 Kitty-0.044
Tokenric
Token ID1173
Feature activation+2.14e-01

Pos logit contributions
ries+0.001
lee+0.001
 Tim+0.001
 uncle+0.001
 Lee+0.001
Neg logit contributions
tted-0.000
 whole-0.000
hest-0.000
 biggest-0.000
 virtue-0.000
Tokenhes
Token ID956
Feature activation+9.61e-02

Pos logit contributions
 quickest+0.299
 virtue+0.299
 fastest+0.298
tted+0.289
 biggest+0.283
Neg logit contributions
 Sue-0.423
 Tom-0.420
 Tim-0.420
ries-0.410
 bananas-0.385
Token there
Token ID612
Feature activation+9.95e-02

Pos logit contributions
tted+0.190
 fastest+0.179
 whole+0.177
 quickest+0.177
 entire+0.176
Neg logit contributions
 Tim-0.149
 Sue-0.143
 Tom-0.140
 uncle-0.136
 Lily-0.135
Token are
Token ID389
Feature activation+8.49e-02

Pos logit contributions
tted+0.238
 virtue+0.222
arest+0.222
ative+0.220
 certainly+0.216
Neg logit contributions
 Tim-0.116
 uncle-0.103
 Tom-0.102
 Sue-0.102
 Lily-0.095
Token?"
Token ID1701
Feature activation-2.62e-02

Pos logit contributions
tted+0.163
 whole+0.151
 fastest+0.151
hest+0.151
arest+0.150
Neg logit contributions
 Tim-0.134
 uncle-0.131
 Sue-0.128
 Tom-0.122
 Kitty-0.122
Token
Token ID198
Feature activation+7.02e-02

Pos logit contributions
 uncle+0.039
kids+0.036
pa+0.036
ries+0.036
 purs+0.035
Neg logit contributions
 whole-0.048
It-0.047
 certainly-0.047
 most-0.047
 am-0.046
Token shouted
Token ID17293
Feature activation-4.91e-02

Pos logit contributions
 Gig+0.144
ries+0.144
parent+0.140
 uncle+0.138
 Betsy+0.135
Neg logit contributions
 fastest-0.149
 certainly-0.147
 most-0.146
tted-0.146
 whole-0.146
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
ries+0.096
kids+0.094
 uncle+0.094
parent+0.092
 Gig+0.092
Neg logit contributions
 whole-0.067
tted-0.063
 best-0.062
 across-0.059
 biggest-0.059
Token "
Token ID366
Feature activation+5.06e-02

Pos logit contributions
 uncle+0.034
 Tim+0.034
ries+0.033
ens+0.032
 Gig+0.032
Neg logit contributions
 whole-0.033
 fastest-0.033
tted-0.032
 biggest-0.031
 certainly-0.031
TokenThe
Token ID464
Feature activation+4.24e-02

Pos logit contributions
 fastest+0.066
 biggest+0.062
 certainly+0.062
 virtue+0.062
 best+0.062
Neg logit contributions
 Tim-0.110
 Tom-0.106
 uncle-0.106
 Sue-0.104
 Lee-0.102
Token ost
Token ID23619
Feature activation+2.26e-03

Pos logit contributions
tted+0.098
 fastest+0.096
hest+0.095
 quickest+0.093
 whole+0.093
Neg logit contributions
 uncle-0.052
 Tim-0.049
 Uncle-0.046
 Lee-0.045
 Sue-0.044
Tokenric
Token ID1173
Feature activation+2.12e-01

Pos logit contributions
tted+0.003
 whole+0.003
 fastest+0.003
 biggest+0.003
hest+0.003
Neg logit contributions
 Tim-0.005
ries-0.005
 uncle-0.005
 Lee-0.005
 Sue-0.005
Tokenhes
Token ID956
Feature activation+8.97e-02

Pos logit contributions
 quickest+0.317
 fastest+0.315
 highlight+0.299
 virtue+0.297
 coolest+0.296
Neg logit contributions
 Tom-0.431
 Sue-0.424
 Tim-0.412
ries-0.390
 Mia-0.386
Token are
Token ID389
Feature activation+8.15e-02

Pos logit contributions
 fastest+0.128
 quickest+0.124
tted+0.122
 biggest+0.121
arest+0.121
Neg logit contributions
 Tim-0.182
 Sue-0.176
 Tom-0.174
 Lily-0.167
ens-0.162
Token so
Token ID523
Feature activation-1.21e-02

Pos logit contributions
tted+0.139
 fastest+0.135
hest+0.133
 whole+0.133
 quickest+0.130
Neg logit contributions
 Tim-0.142
 uncle-0.140
 Sue-0.136
 Tom-0.132
 Kitty-0.131
Token funny
Token ID8258
Feature activation-8.54e-04

Pos logit contributions
 Tim+0.024
 Sue+0.024
 uncle+0.023
 Tom+0.023
 Kitty+0.022
Neg logit contributions
 fastest-0.018
tted-0.017
 quickest-0.017
hest-0.017
 certainly-0.017
Token!"
Token ID2474
Feature activation-3.93e-02

Pos logit contributions
 Tim+0.001
 uncle+0.001
 Sue+0.001
 Uncle+0.001
 Lee+0.001
Neg logit contributions
 fastest-0.001
 whole-0.001
tted-0.001
 quickest-0.001
 biggest-0.001
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 fastest+0.148
 whole+0.145
tted+0.143
 quickest+0.139
hest+0.138
Neg logit contributions
 Tim-0.191
 uncle-0.189
 Sue-0.181
 Uncle-0.177
 Tom-0.176
Token One
Token ID1881
Feature activation+7.98e-02

Pos logit contributions
 uncle+0.024
 Tim+0.023
pa+0.022
ries+0.022
ens+0.022
Neg logit contributions
 whole-0.019
tted-0.019
 biggest-0.019
 fastest-0.019
 certainly-0.018
Token of
Token ID286
Feature activation-3.33e-03

Pos logit contributions
tted+0.162
hest+0.162
 fastest+0.158
arest+0.155
 certainly+0.155
Neg logit contributions
 Sue-0.120
 Tim-0.116
 uncle-0.111
 Kitty-0.109
 Sally-0.107
Token the
Token ID262
Feature activation+3.34e-02

Pos logit contributions
 uncle+0.004
 Tim+0.004
 Lee+0.004
 Uncle+0.003
 Sue+0.003
Neg logit contributions
 fastest-0.007
 whole-0.007
tted-0.007
 biggest-0.007
 quickest-0.007
Token ost
Token ID23619
Feature activation+1.61e-03

Pos logit contributions
tted+0.078
ative+0.075
hest+0.074
 surely+0.073
 certainly+0.072
Neg logit contributions
 uncle-0.042
 Tim-0.037
 Sue-0.037
 Gig-0.036
ries-0.035
Tokenric
Token ID1173
Feature activation+2.12e-01

Pos logit contributions
tted+0.002
 whole+0.002
hest+0.002
 biggest+0.002
arest+0.002
Neg logit contributions
ries-0.003
lee-0.003
 uncle-0.003
 Tim-0.003
pa-0.003
Tokenhes
Token ID956
Feature activation+8.42e-02

Pos logit contributions
 fastest+0.298
 quickest+0.291
 virtue+0.267
 winner+0.267
 coolest+0.265
Neg logit contributions
 Sue-0.444
 Tom-0.443
 Tim-0.436
ries-0.419
 Amy-0.412
Token looked
Token ID3114
Feature activation+6.12e-02

Pos logit contributions
 fastest+0.149
 quickest+0.141
 biggest+0.140
 whole+0.139
hest+0.138
Neg logit contributions
 Tim-0.148
 Sue-0.139
 uncle-0.137
 Tom-0.135
 Sally-0.133
Token different
Token ID1180
Feature activation+2.10e-02

Pos logit contributions
 fastest+0.112
 whole+0.111
 biggest+0.107
tted+0.106
 quickest+0.105
Neg logit contributions
 uncle-0.096
 Tim-0.096
 Lee-0.092
 Uncle-0.090
 Sue-0.089
Token from
Token ID422
Feature activation-4.75e-02

Pos logit contributions
tted+0.033
 fastest+0.033
 whole+0.032
hest+0.031
 certainly+0.031
Neg logit contributions
 Tim-0.039
 uncle-0.039
 Sue-0.037
ries-0.037
 Uncle-0.036
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
 Tim+0.081
 uncle+0.081
 Uncle+0.076
 Lee+0.076
 Sue+0.076
Neg logit contributions
 fastest-0.080
 whole-0.079
tted-0.076
 biggest-0.076
 quickest-0.076
Token
Token ID198
Feature activation+7.59e-02

Pos logit contributions
 Tim+0.016
 uncle+0.016
 Sue+0.015
 Uncle+0.015
 Tom+0.015
Neg logit contributions
 fastest-0.016
 whole-0.015
hest-0.015
tted-0.015
 quickest-0.015
Token
Token ID198
Feature activation+7.12e-02

Pos logit contributions
 fastest+0.107
 whole+0.106
tted+0.104
hest+0.100
 biggest+0.100
Neg logit contributions
 Tim-0.150
 uncle-0.150
 Sue-0.142
 Lee-0.142
 Uncle-0.141
TokenBut
Token ID1537
Feature activation-1.27e-02

Pos logit contributions
 fastest+0.111
 whole+0.107
 biggest+0.104
 quickest+0.104
tted+0.104
Neg logit contributions
 uncle-0.135
 Tim-0.135
 Sue-0.128
 Uncle-0.127
 Lee-0.126
Token the
Token ID262
Feature activation+3.01e-02

Pos logit contributions
lee+0.018
kids+0.017
ens+0.017
parent+0.016
child+0.016
Neg logit contributions
 surely-0.025
 whole-0.024
 most-0.024
 best-0.024
 fastest-0.024
Token ost
Token ID23619
Feature activation-2.46e-04

Pos logit contributions
tted+0.074
hest+0.073
ative+0.070
 certainly+0.070
 fastest+0.069
Neg logit contributions
 uncle-0.035
 Sue-0.030
 Tim-0.030
 Uncle-0.029
 Gig-0.028
Tokenric
Token ID1173
Feature activation+2.10e-01

Pos logit contributions
 uncle+0.001
ries+0.001
 Tim+0.001
lee+0.001
 Sue+0.001
Neg logit contributions
tted-0.000
 whole-0.000
 fastest-0.000
 biggest-0.000
hest-0.000
Tokenhes
Token ID956
Feature activation+8.26e-02

Pos logit contributions
 fastest+0.323
 quickest+0.319
 winner+0.299
 coolest+0.293
 highlight+0.291
Neg logit contributions
 Tom-0.413
 Sue-0.411
 Tim-0.409
ries-0.384
 Amy-0.377
Token did
Token ID750
Feature activation+7.20e-02

Pos logit contributions
 fastest+0.152
 quickest+0.144
 biggest+0.142
hest+0.141
arest+0.139
Neg logit contributions
 Tim-0.141
 Sue-0.133
 Tom-0.132
 uncle-0.131
ens-0.127
Token not
Token ID407
Feature activation+2.19e-02

Pos logit contributions
 fastest+0.119
tted+0.115
 whole+0.115
 quickest+0.113
hest+0.112
Neg logit contributions
 Tim-0.130
 uncle-0.128
 Sue-0.123
 Tom-0.121
 Uncle-0.120
Token like
Token ID588
Feature activation+1.04e-02

Pos logit contributions
 fastest+0.037
 whole+0.036
tted+0.036
 quickest+0.035
 biggest+0.035
Neg logit contributions
 Tim-0.038
 uncle-0.038
 Sue-0.036
 Uncle-0.035
 Lee-0.035
Token that
Token ID326
Feature activation-2.61e-03

Pos logit contributions
 whole+0.020
 fastest+0.020
 biggest+0.020
 most+0.019
 certainly+0.019
Neg logit contributions
 uncle-0.014
 Tim-0.013
 Lee-0.013
ens-0.013
lee-0.013
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 Tim+0.070
 uncle+0.070
 Sue+0.065
 Uncle+0.064
 Lee+0.064
Neg logit contributions
 fastest-0.088
 whole-0.086
tted-0.085
 quickest-0.083
 biggest-0.083
Token It
Token ID632
Feature activation+1.07e-01

Pos logit contributions
 uncle+0.018
 Tim+0.018
ries+0.017
 Lee+0.017
 Uncle+0.017
Neg logit contributions
 fastest-0.018
 whole-0.018
tted-0.017
 biggest-0.017
 certainly-0.017
Token sees
Token ID7224
Feature activation-1.03e-02

Pos logit contributions
 fastest+0.177
 whole+0.175
tted+0.174
hest+0.172
arest+0.168
Neg logit contributions
 Tim-0.185
 uncle-0.184
 Sue-0.175
 Tom-0.171
 Lee-0.171
Token other
Token ID584
Feature activation-1.30e-02

Pos logit contributions
 uncle+0.016
 Tim+0.016
 Sue+0.015
 Lee+0.015
 Uncle+0.015
Neg logit contributions
 fastest-0.019
 whole-0.018
tted-0.018
 biggest-0.018
 quickest-0.017
Token ost
Token ID23619
Feature activation-4.00e-03

Pos logit contributions
 Tim+0.019
 uncle+0.019
 Uncle+0.018
 Sue+0.018
 Lee+0.018
Neg logit contributions
 fastest-0.025
 whole-0.025
tted-0.024
 biggest-0.024
 quickest-0.024
Tokenric
Token ID1173
Feature activation+2.09e-01

Pos logit contributions
ries+0.008
 uncle+0.008
 Tim+0.008
lee+0.008
 Lee+0.008
Neg logit contributions
tted-0.005
 whole-0.005
 fastest-0.005
 biggest-0.005
hest-0.005
Tokenhes
Token ID956
Feature activation+7.63e-02

Pos logit contributions
 fastest+0.269
 quickest+0.264
 virtue+0.249
 surely+0.245
 winner+0.243
Neg logit contributions
 Tim-0.453
 Tom-0.452
 Sue-0.445
ries-0.422
 Amy-0.419
Token.
Token ID13
Feature activation-4.86e-03

Pos logit contributions
 fastest+0.179
tted+0.175
 quickest+0.174
hest+0.172
 whole+0.172
Neg logit contributions
 Tim-0.091
 Tom-0.083
 Sue-0.082
 uncle-0.082
 Uncle-0.075
Token It
Token ID632
Feature activation+1.10e-01

Pos logit contributions
 Tim+0.009
 uncle+0.009
 Sue+0.008
 Uncle+0.008
 Lee+0.008
Neg logit contributions
 fastest-0.008
 whole-0.008
tted-0.008
 quickest-0.008
hest-0.007
Token makes
Token ID1838
Feature activation+7.03e-02

Pos logit contributions
 fastest+0.185
 whole+0.182
tted+0.180
hest+0.176
 quickest+0.175
Neg logit contributions
 Tim-0.191
 uncle-0.189
 Sue-0.179
 Uncle-0.176
 Lee-0.176
Token new
Token ID649
Feature activation-1.64e-02

Pos logit contributions
 fastest+0.130
 whole+0.127
tted+0.125
 quickest+0.123
hest+0.122
Neg logit contributions
 Tim-0.112
 uncle-0.110
 Sue-0.104
 Uncle-0.103
 Lee-0.101
Token did
Token ID750
Feature activation+6.29e-02

Pos logit contributions
 Tim+0.125
 uncle+0.121
 Sue+0.113
 Tom+0.112
 Uncle+0.110
Neg logit contributions
 fastest-0.156
 whole-0.151
hest-0.149
tted-0.148
 biggest-0.147
Token not
Token ID407
Feature activation+1.30e-02

Pos logit contributions
 fastest+0.101
 whole+0.099
tted+0.097
 quickest+0.095
hest+0.095
Neg logit contributions
 Tim-0.116
 uncle-0.115
 Sue-0.109
 Uncle-0.108
 Lee-0.107
Token know
Token ID760
Feature activation-1.00e-03

Pos logit contributions
 fastest+0.023
 whole+0.022
tted+0.022
 biggest+0.022
 quickest+0.022
Neg logit contributions
 Tim-0.022
 uncle-0.022
 Sue-0.021
 Tom-0.021
 Uncle-0.020
Token what
Token ID644
Feature activation+7.09e-02

Pos logit contributions
 uncle+0.001
 Tim+0.001
ens+0.001
 Uncle+0.001
 Lee+0.001
Neg logit contributions
 fastest-0.002
 biggest-0.002
 whole-0.002
 most-0.002
 best-0.002
Token ost
Token ID23619
Feature activation-1.01e-02

Pos logit contributions
 fastest+0.159
tted+0.158
 whole+0.156
hest+0.155
 quickest+0.154
Neg logit contributions
 Tim-0.090
 uncle-0.088
 Sue-0.083
 Uncle-0.082
 Tom-0.081
Tokenric
Token ID1173
Feature activation+2.08e-01

Pos logit contributions
ries+0.021
 Tim+0.021
 uncle+0.020
lee+0.020
 Sue+0.020
Neg logit contributions
tted-0.014
 whole-0.014
 fastest-0.013
 biggest-0.013
hest-0.013
Tokenhes
Token ID956
Feature activation+8.84e-02

Pos logit contributions
 quickest+0.295
 fastest+0.293
 coolest+0.282
 virtue+0.277
tted+0.273
Neg logit contributions
 Tom-0.403
ries-0.401
 Tim-0.397
 Sue-0.393
lee-0.371
Token were
Token ID547
Feature activation+6.92e-02

Pos logit contributions
tted+0.195
 fastest+0.194
 whole+0.190
 quickest+0.188
hest+0.187
Neg logit contributions
 Tim-0.116
 Sue-0.108
 uncle-0.108
 Tom-0.107
 Uncle-0.104
Token.
Token ID13
Feature activation-8.68e-03

Pos logit contributions
 fastest+0.149
tted+0.149
 whole+0.146
 quickest+0.143
hest+0.143
Neg logit contributions
 Tim-0.088
 uncle-0.087
 Sue-0.080
 Uncle-0.079
 Tom-0.079
Token They
Token ID1119
Feature activation+3.96e-02

Pos logit contributions
 uncle+0.015
 Tim+0.015
ens+0.014
lee+0.014
ries+0.014
Neg logit contributions
 fastest-0.014
 whole-0.013
 biggest-0.013
tted-0.013
 quickest-0.013
Token looked
Token ID3114
Feature activation+5.51e-02

Pos logit contributions
 fastest+0.081
 whole+0.080
tted+0.079
hest+0.078
 biggest+0.077
Neg logit contributions
 Tim-0.056
 uncle-0.055
 Sue-0.051
 Uncle-0.050
 Tom-0.050
Token mom
Token ID1995
Feature activation-9.81e-02

Pos logit contributions
lee+0.073
ens+0.070
 uncle+0.070
ries+0.069
 Lee+0.069
Neg logit contributions
 whole-0.119
 fastest-0.115
 most-0.113
 biggest-0.113
 best-0.112
Token and
Token ID290
Feature activation-1.29e-01

Pos logit contributions
 Gig+0.168
lee+0.165
olph+0.163
 complexion+0.161
 Tim+0.161
Neg logit contributions
 best-0.151
 will-0.151
 whole-0.136
 most-0.133
 biggest-0.128
Token to
Token ID284
Feature activation-6.32e-02

Pos logit contributions
ries+0.201
lee+0.184
 uncle+0.184
 Lee+0.184
 Tim+0.182
Neg logit contributions
 will-0.232
 whole-0.226
'll-0.221
 fastest-0.219
 most-0.218
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
ries+0.085
lee+0.084
 uncle+0.083
 Lee+0.083
kids+0.081
Neg logit contributions
 whole-0.120
 fastest-0.117
 most-0.116
 best-0.115
 surely-0.114
Token ost
Token ID23619
Feature activation-5.44e-03

Pos logit contributions
tted+0.028
hest+0.028
 fastest+0.027
 certainly+0.027
ative+0.026
Neg logit contributions
 uncle-0.014
 Tim-0.013
 Sue-0.012
 Gig-0.012
 Uncle-0.012
Tokenric
Token ID1173
Feature activation+2.07e-01

Pos logit contributions
 uncle+0.012
 Tim+0.012
ries+0.012
 Lee+0.012
 Sue+0.012
Neg logit contributions
tted-0.006
 whole-0.006
 fastest-0.006
hest-0.006
 biggest-0.006
Tokenhes
Token ID956
Feature activation+8.33e-02

Pos logit contributions
 fastest+0.279
 quickest+0.275
 winner+0.256
 surely+0.252
 biggest+0.249
Neg logit contributions
 Tim-0.444
 Tom-0.441
 Sue-0.436
ries-0.418
 uncle-0.416
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 fastest+0.177
 whole+0.171
tted+0.170
 quickest+0.170
hest+0.169
Neg logit contributions
 Tim-0.114
 uncle-0.111
 Sue-0.106
 Tom-0.103
 Lee-0.102
Token They
Token ID1119
Feature activation+3.36e-02

Pos logit contributions
ries+0.031
lee+0.029
ens+0.029
pa+0.029
 Tim+0.028
Neg logit contributions
 whole-0.018
 biggest-0.017
tted-0.017
 most-0.017
 fastest-0.017
Token learned
Token ID4499
Feature activation-1.63e-03

Pos logit contributions
 whole+0.065
tted+0.064
 certainly+0.064
 fastest+0.064
 will+0.062
Neg logit contributions
 uncle-0.046
 Tim-0.046
ries-0.046
 Lee-0.045
 Uncle-0.045
Token their
Token ID511
Feature activation-1.79e-02

Pos logit contributions
ens+0.003
lee+0.003
ries+0.003
 Lee+0.003
 Tim+0.003
Neg logit contributions
 whole-0.003
 biggest-0.003
 most-0.003
 best-0.003
 certainly-0.002
TokenLook
Token ID8567
Feature activation+1.71e-02

Pos logit contributions
tted+0.103
 fastest+0.102
 virtue+0.097
 surely+0.096
 certainly+0.096
Neg logit contributions
 Tim-0.121
 uncle-0.119
 Gig-0.116
 Tom-0.112
 Lee-0.112
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
 best+0.030
 biggest+0.027
 whole+0.026
 most+0.026
 will+0.025
Neg logit contributions
 stro-0.028
olph-0.028
child-0.028
kids-0.028
 handing-0.027
Token mom
Token ID1995
Feature activation-9.23e-02

Pos logit contributions
olph+0.159
ens+0.157
kids+0.155
lee+0.155
ior+0.153
Neg logit contributions
 biggest-0.255
 most-0.247
 best-0.243
 will-0.242
 whole-0.242
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 uncle+0.115
 Tim+0.112
 stro+0.109
 Uncle+0.108
ens+0.107
Neg logit contributions
 whole-0.187
 fastest-0.186
 certainly-0.179
 will-0.178
 biggest-0.177
Token ost
Token ID23619
Feature activation-2.83e-03

Pos logit contributions
 uncle+0.222
ens+0.219
lee+0.217
 Tim+0.216
ries+0.213
Neg logit contributions
 biggest-0.211
 fastest-0.210
 whole-0.210
 most-0.205
 winner-0.199
Tokenric
Token ID1173
Feature activation+2.06e-01

Pos logit contributions
 Tim+0.006
 uncle+0.006
ries+0.006
lee+0.006
 Lee+0.006
Neg logit contributions
tted-0.004
 whole-0.003
 fastest-0.003
hest-0.003
 biggest-0.003
Tokenhes
Token ID956
Feature activation+8.21e-02

Pos logit contributions
 quickest+0.293
 fastest+0.292
 surely+0.286
 virtue+0.282
tted+0.281
Neg logit contributions
 Tom-0.426
 Sue-0.414
 Tim-0.409
ries-0.385
 uncle-0.381
Token!"
Token ID2474
Feature activation-3.84e-02

Pos logit contributions
 fastest+0.145
tted+0.144
 quickest+0.142
arest+0.135
hest+0.132
Neg logit contributions
 Tim-0.144
 Sue-0.139
 Tom-0.137
 Lily-0.132
 her-0.128
Token
Token ID198
Feature activation+6.85e-02

Pos logit contributions
 uncle+0.051
ries+0.049
ens+0.047
 purs+0.047
 Tim+0.047
Neg logit contributions
 whole-0.074
 certainly-0.071
 most-0.070
 biggest-0.070
tted-0.069
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
 whole+0.085
tted+0.080
 fastest+0.079
 certainly+0.078
hest+0.078
Neg logit contributions
 uncle-0.144
 Tim-0.143
ries-0.140
 Lee-0.139
ens-0.137
Token"
Token ID1
Feature activation+2.09e-02

Pos logit contributions
 whole+0.104
 fastest+0.102
tted+0.100
 certainly+0.097
hest+0.097
Neg logit contributions
 Tim-0.120
 uncle-0.119
 Lee-0.113
ens-0.112
 Tommy-0.112
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
 fastest+0.068
tted+0.067
 quickest+0.067
 biggest+0.066
 whole+0.066
Neg logit contributions
 Tim-0.040
 uncle-0.038
 Sue-0.036
 Tom-0.036
 Lee-0.035
Token They
Token ID1119
Feature activation+3.67e-02

Pos logit contributions
ries+0.020
 uncle+0.020
pa+0.020
ens+0.019
olph+0.019
Neg logit contributions
tted-0.018
 fastest-0.018
 certainly-0.018
 most-0.018
 biggest-0.018
Token were
Token ID547
Feature activation+6.84e-02

Pos logit contributions
 fastest+0.065
 whole+0.064
tted+0.062
 quickest+0.061
 certainly+0.061
Neg logit contributions
 Tim-0.060
 uncle-0.060
 Sue-0.056
 Uncle-0.056
 Lee-0.056
Token called
Token ID1444
Feature activation-4.92e-02

Pos logit contributions
 fastest+0.111
 whole+0.109
tted+0.106
 quickest+0.104
 biggest+0.104
Neg logit contributions
 Tim-0.122
 uncle-0.122
 Sue-0.115
 Uncle-0.114
 Lee-0.114
Token ost
Token ID23619
Feature activation-1.49e-02

Pos logit contributions
 uncle+0.058
 Tim+0.058
ens+0.053
ries+0.053
 Uncle+0.052
Neg logit contributions
 fastest-0.106
 whole-0.106
 biggest-0.103
tted-0.101
 quickest-0.101
Tokenric
Token ID1173
Feature activation+2.05e-01

Pos logit contributions
 uncle+0.033
 Tim+0.033
ries+0.032
lee+0.032
 Lee+0.032
Neg logit contributions
tted-0.018
 whole-0.017
 fastest-0.017
hest-0.017
 biggest-0.016
Tokenhes
Token ID956
Feature activation+7.70e-02

Pos logit contributions
 virtue+0.287
 quickest+0.284
 fastest+0.280
 surely+0.270
 coolest+0.269
Neg logit contributions
 Tom-0.412
 Sue-0.405
 Tim-0.400
ries-0.389
lee-0.372
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 fastest+0.168
 quickest+0.161
tted+0.161
 whole+0.161
 biggest+0.160
Neg logit contributions
 Tim-0.100
 uncle-0.096
 Sue-0.092
 Tom-0.090
 Uncle-0.089
Token Lily
Token ID20037
Feature activation-4.63e-02

Pos logit contributions
 uncle+0.017
ries+0.016
ens+0.016
 Tim+0.016
lee+0.016
Neg logit contributions
 fastest-0.019
tted-0.019
 whole-0.019
 quickest-0.018
 certainly-0.018
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
 Tim+0.076
 uncle+0.076
ries+0.076
 Gig+0.074
 Kim+0.073
Neg logit contributions
 whole-0.078
 fastest-0.077
 certainly-0.076
tted-0.074
 will-0.074
Token Ben
Token ID3932
Feature activation-6.75e-02

Pos logit contributions
ogging+0.089
lee+0.085
 handing+0.079
ries+0.079
 uncle+0.076
Neg logit contributions
 most-0.283
 fastest-0.282
 whole-0.281
 best-0.277
 biggest-0.272
TokenYes
Token ID5297
Feature activation-1.01e-02

Pos logit contributions
 fastest+0.035
tted+0.035
ery+0.034
 virtue+0.034
aying+0.034
Neg logit contributions
 Tim-0.036
 Sue-0.035
 Gig-0.035
 uncle-0.035
 Kitty-0.034
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
 stro+0.017
child+0.016
 uncle+0.016
kids+0.016
ior+0.016
Neg logit contributions
 whole-0.016
 best-0.016
 biggest-0.016
 certainly-0.015
 will-0.015
Token they
Token ID484
Feature activation+2.81e-02

Pos logit contributions
 uncle+0.167
 Lee+0.159
olph+0.158
pa+0.154
 Tim+0.153
Neg logit contributions
 whole-0.244
 biggest-0.243
 certainly-0.242
 most-0.242
 fastest-0.238
Token are
Token ID389
Feature activation+8.00e-02

Pos logit contributions
 virtue+0.047
tted+0.046
antly+0.045
 surely+0.044
 fastest+0.044
Neg logit contributions
 Tim-0.051
 Tom-0.051
ens-0.046
 Sue-0.046
 uncle-0.045
Token ost
Token ID23619
Feature activation-5.81e-03

Pos logit contributions
tted+0.155
 fastest+0.153
 whole+0.149
 quickest+0.147
hest+0.146
Neg logit contributions
 Tim-0.122
 uncle-0.118
 Sue-0.115
 Tom-0.111
 Gig-0.109
Tokenric
Token ID1173
Feature activation+2.05e-01

Pos logit contributions
 uncle+0.012
 Tim+0.012
ries+0.012
lee+0.012
 Lee+0.012
Neg logit contributions
tted-0.007
 whole-0.007
 fastest-0.007
 biggest-0.007
hest-0.007
Tokenhes
Token ID956
Feature activation+8.40e-02

Pos logit contributions
 fastest+0.257
 quickest+0.255
 surely+0.246
 virtue+0.241
tted+0.236
Neg logit contributions
 Tim-0.440
 Sue-0.440
 Tom-0.439
ries-0.417
 uncle-0.412
Token,"
Token ID553
Feature activation-7.02e-02

Pos logit contributions
 fastest+0.170
tted+0.168
 whole+0.164
 quickest+0.164
 biggest+0.160
Neg logit contributions
 Tim-0.121
 Sue-0.115
 uncle-0.115
 Tom-0.111
ens-0.108
Token mom
Token ID1995
Feature activation-1.13e-01

Pos logit contributions
 uncle+0.065
ogging+0.063
lee+0.063
parent+0.062
kids+0.062
Neg logit contributions
 most-0.163
 whole-0.160
 best-0.158
 biggest-0.156
 will-0.155
Token said
Token ID531
Feature activation-5.81e-02

Pos logit contributions
 Gig+0.172
 uncle+0.172
 arrest+0.169
 Uncle+0.163
 purs+0.163
Neg logit contributions
a-0.182
 certainly-0.181
It-0.170
hest-0.166
 whole-0.166
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
 uncle+0.119
parent+0.115
kids+0.115
pa+0.111
 Uncle+0.108
Neg logit contributions
 whole-0.073
 best-0.070
tted-0.069
 biggest-0.067
 most-0.066
Token but
Token ID475
Feature activation-5.58e-03

Pos logit contributions
lee+0.200
ries+0.196
 uncle+0.196
olph+0.191
 Gig+0.191
Neg logit contributions
 fastest-0.216
 certainly-0.210
 most-0.208
 will-0.207
 biggest-0.204
Token be
Token ID307
Feature activation+4.55e-02

Pos logit contributions
 Tim+0.011
 uncle+0.010
 Sue+0.010
 Uncle+0.010
 Lee+0.010
Neg logit contributions
 fastest-0.009
 whole-0.008
tted-0.008
 quickest-0.008
 biggest-0.008
Token careful
Token ID8161
Feature activation-4.31e-02

Pos logit contributions
tted+0.087
 fastest+0.086
hest+0.086
 whole+0.084
arest+0.084
Neg logit contributions
 uncle-0.069
 Tim-0.068
 Sue-0.067
 Tom-0.064
 Uncle-0.064
Token.
Token ID13
Feature activation-1.47e-03

Pos logit contributions
 Tim+0.059
 uncle+0.057
 Sue+0.054
 Tom+0.053
 Lee+0.053
Neg logit contributions
 fastest-0.089
tted-0.089
 whole-0.088
 quickest-0.086
hest-0.086
Token Ost
Token ID38919
Feature activation+5.52e-02

Pos logit contributions
 Tim+0.002
 uncle+0.002
 Sue+0.002
 Uncle+0.002
 Lee+0.002
Neg logit contributions
 fastest-0.003
 whole-0.003
tted-0.003
hest-0.003
 quickest-0.003
Tokenric
Token ID1173
Feature activation+2.04e-01

Pos logit contributions
 fastest+0.082
tted+0.082
arest+0.081
 winner+0.079
 biggest+0.079
Neg logit contributions
 Tim-0.107
ries-0.103
 Tom-0.103
pa-0.101
 Lee-0.100
Tokenhes
Token ID956
Feature activation+8.77e-02

Pos logit contributions
 quickest+0.312
 fastest+0.307
 virtue+0.295
 surely+0.293
 winner+0.288
Neg logit contributions
 Tom-0.411
 Sue-0.391
 Tim-0.386
 bananas-0.371
ries-0.368
Token can
Token ID460
Feature activation+4.40e-02

Pos logit contributions
tted+0.134
 fastest+0.133
 quickest+0.131
arest+0.127
 entire+0.127
Neg logit contributions
 Tim-0.168
 Sue-0.161
 Tom-0.160
ens-0.159
 Lily-0.157
Token be
Token ID307
Feature activation+5.41e-02

Pos logit contributions
 fastest+0.077
ative+0.077
tted+0.076
 biggest+0.076
 quickest+0.076
Neg logit contributions
 Tim-0.075
 uncle-0.072
 Gig-0.070
 Kitty-0.069
 Sue-0.069
Token gr
Token ID1036
Feature activation+1.21e-03

Pos logit contributions
 fastest+0.091
 whole+0.089
tted+0.089
hest+0.087
 quickest+0.087
Neg logit contributions
 Tim-0.094
 uncle-0.093
 Sue-0.089
 Uncle-0.086
 Tom-0.086
Tokenumpy
Token ID32152
Feature activation-3.88e-02

Pos logit contributions
 biggest+0.002
 most+0.002
 fastest+0.002
 whole+0.002
 surely+0.002
Neg logit contributions
 Tim-0.002
lee-0.002
ens-0.002
ries-0.002
 uncle-0.002
Token
Token ID198
Feature activation+7.42e-02

Pos logit contributions
 uncle+0.021
ries+0.021
lee+0.020
 Tim+0.020
pa+0.020
Neg logit contributions
 fastest-0.017
 whole-0.017
tted-0.017
 biggest-0.016
 most-0.016
Token
Token ID198
Feature activation+6.93e-02

Pos logit contributions
 whole+0.087
tted+0.085
 fastest+0.083
hest+0.081
 certainly+0.080
Neg logit contributions
 uncle-0.160
ries-0.155
 Tim-0.155
 Lee-0.153
lee-0.151
Token"
Token ID1
Feature activation+2.13e-02

Pos logit contributions
 whole+0.099
tted+0.099
 fastest+0.097
Finally+0.095
It+0.095
Neg logit contributions
lee-0.133
ries-0.131
 Tim-0.128
 uncle-0.128
 Gig-0.125
TokenO
Token ID46
Feature activation-4.56e-02

Pos logit contributions
 fastest+0.031
 biggest+0.030
 whole+0.030
tted+0.030
hest+0.029
Neg logit contributions
 Tim-0.042
 uncle-0.042
 Sue-0.040
 Tom-0.040
 Lee-0.039
Tokenst
Token ID301
Feature activation+5.32e-02

Pos logit contributions
 Tim+0.102
 uncle+0.100
ens+0.099
 Sue+0.097
 Tom+0.097
Neg logit contributions
 fastest-0.056
 biggest-0.055
 whole-0.054
 quickest-0.052
tted-0.051
Tokenric
Token ID1173
Feature activation+2.03e-01

Pos logit contributions
 fastest+0.104
 whole+0.101
 highlight+0.100
 quickest+0.100
 coolest+0.099
Neg logit contributions
 Tim-0.080
 Sue-0.077
 Kim-0.077
 uncle-0.075
 Kitty-0.074
Tokenhes
Token ID956
Feature activation+8.69e-02

Pos logit contributions
 fastest+0.291
 quickest+0.289
 surely+0.282
 highlight+0.277
 winner+0.269
Neg logit contributions
 Tom-0.420
 Tim-0.402
 Sue-0.402
ries-0.374
 Amy-0.373
Token do
Token ID466
Feature activation+3.78e-02

Pos logit contributions
 fastest+0.114
tted+0.110
 quickest+0.110
 whole+0.110
 entire+0.107
Neg logit contributions
 Tim-0.191
 Sue-0.185
 Tom-0.181
ens-0.177
 uncle-0.175
Token not
Token ID407
Feature activation+2.53e-02

Pos logit contributions
tted+0.080
 biggest+0.075
 whole+0.075
 entire+0.074
 quickest+0.074
Neg logit contributions
 Tim-0.055
 Tom-0.052
 uncle-0.051
 Sue-0.050
 Kitty-0.049
Token like
Token ID588
Feature activation+1.62e-02

Pos logit contributions
tted+0.052
 biggest+0.052
 fastest+0.051
 quickest+0.050
 certainly+0.049
Neg logit contributions
 Tim-0.038
 uncle-0.036
 Tom-0.034
 Sue-0.034
ens-0.033
Token people
Token ID661
Feature activation+2.41e-02

Pos logit contributions
 fastest+0.032
 whole+0.031
 biggest+0.030
 most+0.030
 quickest+0.029
Neg logit contributions
 uncle-0.023
 Tim-0.022
 Lee-0.022
ens-0.021
 Sue-0.021
Token They
Token ID1119
Feature activation+3.10e-02

Pos logit contributions
 Tim+0.034
 uncle+0.033
 Sue+0.032
 Uncle+0.031
 Tom+0.031
Neg logit contributions
 fastest-0.036
 whole-0.035
tted-0.034
hest-0.034
 quickest-0.034
Token do
Token ID466
Feature activation+2.71e-02

Pos logit contributions
 fastest+0.054
 whole+0.052
 biggest+0.051
 quickest+0.051
hest+0.051
Neg logit contributions
 Tim-0.053
 uncle-0.052
 Tom-0.049
ens-0.049
 Sue-0.048
Token not
Token ID407
Feature activation+1.38e-02

Pos logit contributions
 fastest+0.055
tted+0.054
 quickest+0.054
hest+0.053
 biggest+0.052
Neg logit contributions
 Tim-0.040
 uncle-0.040
 Tom-0.039
 Lily-0.036
 Tommy-0.036
Token pack
Token ID2353
Feature activation+2.36e-02

Pos logit contributions
 biggest+0.026
 fastest+0.026
tted+0.025
 quickest+0.025
 whole+0.025
Neg logit contributions
 Tim-0.023
 uncle-0.021
 Sue-0.021
 Tom-0.020
ens-0.020
Token ost
Token ID23619
Feature activation-9.60e-03

Pos logit contributions
 fastest+0.047
 whole+0.047
tted+0.046
 biggest+0.045
 quickest+0.045
Neg logit contributions
 uncle-0.033
 Tim-0.032
 Uncle-0.030
 Lee-0.030
 Sue-0.030
Tokenric
Token ID1173
Feature activation+2.01e-01

Pos logit contributions
 uncle+0.020
ries+0.019
 Tim+0.019
 Lee+0.019
lee+0.018
Neg logit contributions
 whole-0.013
tted-0.013
hest-0.013
 fastest-0.013
 biggest-0.012
Tokenhes
Token ID956
Feature activation+7.61e-02

Pos logit contributions
 quickest+0.257
 fastest+0.252
 virtue+0.246
 surely+0.244
 winner+0.235
Neg logit contributions
 Tom-0.425
 Tim-0.425
 Sue-0.411
ries-0.409
 uncle-0.392
Token anymore
Token ID7471
Feature activation-1.31e-02

Pos logit contributions
 fastest+0.169
tted+0.166
 whole+0.166
hest+0.165
 quickest+0.164
Neg logit contributions
 Tim-0.096
 uncle-0.091
 Sue-0.087
 Tom-0.086
 Uncle-0.083
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 uncle+0.023
 Tim+0.022
 Uncle+0.021
 Gig+0.021
 Lee+0.021
Neg logit contributions
 whole-0.021
 fastest-0.021
tted-0.020
 quickest-0.020
hest-0.020
Token They
Token ID1119
Feature activation+3.55e-02

Pos logit contributions
 Tim+0.021
 uncle+0.020
 Tom+0.020
 Sue+0.019
 Uncle+0.019
Neg logit contributions
 fastest-0.025
 whole-0.024
hest-0.024
tted-0.023
 quickest-0.023
Token pack
Token ID2353
Feature activation+1.84e-02

Pos logit contributions
 fastest+0.057
hest+0.056
 biggest+0.055
 whole+0.055
arest+0.054
Neg logit contributions
 Tim-0.065
 uncle-0.064
 Tom-0.061
ens-0.061
 Sue-0.059
Token<|endoftext|>
Token ID50256
Feature activation+4.49e-02

Pos logit contributions
 uncle+0.016
ries+0.015
 Tim+0.015
ens+0.015
 Lee+0.015
Neg logit contributions
 fastest-0.014
 whole-0.014
tted-0.014
 certainly-0.014
 quickest-0.014
TokenOnce
Token ID7454
Feature activation-2.87e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.067
tted+0.067
Neg logit contributions
 Tim-0.087
 Sue-0.081
 Tom-0.080
 uncle-0.080
 Lee-0.078
Token upon
Token ID2402
Feature activation+1.05e-01

Pos logit contributions
 Tim+0.054
 uncle+0.054
 Sue+0.051
 Uncle+0.051
 Lee+0.051
Neg logit contributions
 fastest-0.043
 whole-0.043
tted-0.041
 quickest-0.040
 biggest-0.040
Token a
Token ID257
Feature activation+1.19e-03

Pos logit contributions
 fastest+0.176
tted+0.168
 quickest+0.166
hest+0.161
 winner+0.158
Neg logit contributions
 Tim-0.197
 uncle-0.189
 Sue-0.185
 Tom-0.184
 Lily-0.176
Token time
Token ID640
Feature activation+1.70e-02

Pos logit contributions
 fastest+0.001
 whole+0.001
tted+0.001
 quickest+0.001
 biggest+0.001
Neg logit contributions
 Tim-0.003
 uncle-0.003
 Lee-0.003
 Sue-0.003
 Uncle-0.003
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 whole+0.038
 fastest+0.037
 certainly+0.036
 best+0.036
tted+0.036
Neg logit contributions
 uncle-0.020
 Uncle-0.018
 Tim-0.018
ries-0.018
ens-0.018
Token there
Token ID612
Feature activation+6.91e-02

Pos logit contributions
 Tim+0.246
 uncle+0.229
 Tom+0.224
 Sue+0.222
 Lily+0.214
Neg logit contributions
 fastest-0.310
tted-0.310
hest-0.303
 quickest-0.298
 certainly-0.292
Token was
Token ID373
Feature activation+4.59e-02

Pos logit contributions
 fastest+0.139
 quickest+0.134
 highest+0.133
hest+0.132
arest+0.131
Neg logit contributions
 Tim-0.123
 Tom-0.113
 Sue-0.109
 Lily-0.106
 Tommy-0.106
Token a
Token ID257
Feature activation+1.11e-02

Pos logit contributions
 whole+0.059
 fastest+0.058
tted+0.056
 biggest+0.055
 certainly+0.054
Neg logit contributions
 uncle-0.097
 Tim-0.096
 Lee-0.093
 Uncle-0.092
 Sue-0.091
Token graceful
Token ID44363
Feature activation-2.07e-02

Pos logit contributions
 whole+0.018
 fastest+0.016
 biggest+0.016
 entire+0.016
 best+0.016
Neg logit contributions
ries-0.021
 Tim-0.020
 Sue-0.020
 Lee-0.020
lee-0.020
Token dove
Token ID38862
Feature activation-3.50e-02

Pos logit contributions
ries+0.036
 Kim+0.033
 Tim+0.033
 Lee+0.032
 Sue+0.032
Neg logit contributions
 whole-0.036
 fastest-0.034
 biggest-0.034
 most-0.033
 best-0.033
Token,
Token ID11
Feature activation-1.45e-01

Pos logit contributions
 Tim+0.019
 uncle+0.019
 Lee+0.017
 Sue+0.017
 Uncle+0.017
Neg logit contributions
 fastest-0.039
 whole-0.039
tted-0.038
 quickest-0.038
 biggest-0.038
Token a
Token ID257
Feature activation+3.07e-02

Pos logit contributions
 Tim+0.270
 uncle+0.269
 Lee+0.259
ries+0.257
ens+0.256
Neg logit contributions
 whole-0.215
 fastest-0.214
 biggest-0.206
 most-0.205
 certainly-0.205
Token st
Token ID336
Feature activation+3.80e-03

Pos logit contributions
 fastest+0.055
 whole+0.053
tted+0.053
 quickest+0.052
hest+0.052
Neg logit contributions
 Tim-0.050
 uncle-0.050
 Sue-0.047
 Uncle-0.046
 Lee-0.046
Tokeneth
Token ID2788
Feature activation-6.05e-02

Pos logit contributions
 fastest+0.007
 quickest+0.007
 highlight+0.007
 whole+0.007
 most+0.007
Neg logit contributions
 uncle-0.006
lee-0.006
 Tim-0.006
 Lee-0.006
 Uncle-0.005
Tokenoscope
Token ID40326
Feature activation+4.24e-02

Pos logit contributions
 Tim+0.107
 uncle+0.105
 Sue+0.100
 Uncle+0.100
 Gig+0.099
Neg logit contributions
 fastest-0.098
 whole-0.098
tted-0.094
 quickest-0.092
 entire-0.091
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 fastest+0.107
 whole+0.104
 quickest+0.104
 biggest+0.103
tted+0.103
Neg logit contributions
 Tim-0.040
 uncle-0.040
 Lee-0.035
 Tom-0.035
 Uncle-0.035
Token and
Token ID290
Feature activation-1.28e-01

Pos logit contributions
 Tim+0.186
 uncle+0.183
 Sue+0.169
 Lee+0.166
 Uncle+0.166
Neg logit contributions
 fastest-0.350
 whole-0.347
tted-0.339
 biggest-0.335
 certainly-0.335
Token a
Token ID257
Feature activation+2.70e-02

Pos logit contributions
 Tim+0.232
 uncle+0.231
 Sue+0.218
 Uncle+0.217
 Lee+0.216
Neg logit contributions
 fastest-0.203
 whole-0.199
tted-0.194
 quickest-0.190
 biggest-0.190
Token bag
Token ID6131
Feature activation+3.37e-02

Pos logit contributions
tted+0.052
hest+0.052
 fastest+0.052
 quickest+0.050
 certainly+0.048
Neg logit contributions
 uncle-0.042
 Tim-0.039
 Uncle-0.039
 Lee-0.037
 Sue-0.036
Token of
Token ID286
Feature activation-8.79e-03

Pos logit contributions
 fastest+0.067
 quickest+0.064
 whole+0.064
 biggest+0.063
 certainly+0.062
Neg logit contributions
 Tim-0.050
 uncle-0.048
 Sue-0.046
 Tom-0.045
 Lee-0.044
Token tools
Token ID4899
Feature activation+2.98e-02

Pos logit contributions
 Tim+0.013
 uncle+0.012
 Sue+0.012
 Lee+0.012
 Uncle+0.011
Neg logit contributions
 fastest-0.017
 whole-0.017
 biggest-0.016
tted-0.016
 most-0.016
Token<|endoftext|>
Token ID50256
Feature activation+4.92e-02

Pos logit contributions
 uncle+0.017
ries+0.017
ens+0.016
 Tim+0.016
lee+0.016
Neg logit contributions
 fastest-0.014
tted-0.014
 whole-0.014
 certainly-0.014
 quickest-0.014
TokenOnce
Token ID7454
Feature activation-2.61e-02

Pos logit contributions
 fastest+0.081
 quickest+0.077
 biggest+0.076
hest+0.076
arest+0.075
Neg logit contributions
 Tim-0.095
 Tom-0.088
 Sue-0.088
 Tommy-0.085
 uncle-0.085
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.049
 uncle+0.049
 Sue+0.047
 Uncle+0.046
 Lee+0.046
Neg logit contributions
 fastest-0.039
 whole-0.039
tted-0.037
 biggest-0.037
 quickest-0.037
Token a
Token ID257
Feature activation+2.04e-03

Pos logit contributions
 fastest+0.183
tted+0.173
 quickest+0.172
hest+0.167
 winner+0.165
Neg logit contributions
 Tim-0.199
 uncle-0.189
 Sue-0.187
 Tom-0.186
 Lily-0.179
Token time
Token ID640
Feature activation+1.89e-02

Pos logit contributions
 fastest+0.001
 whole+0.001
tted+0.001
 quickest+0.001
 biggest+0.001
Neg logit contributions
 Tim-0.006
 uncle-0.006
 Sue-0.005
 Lee-0.005
 Uncle-0.005
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 whole+0.042
 fastest+0.041
 best+0.040
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+7.12e-02

Pos logit contributions
 Tim+0.238
 uncle+0.217
 Tom+0.217
 Sue+0.215
 Lily+0.209
Neg logit contributions
tted-0.320
 fastest-0.319
hest-0.315
 quickest-0.309
 certainly-0.304
Token was
Token ID373
Feature activation+4.70e-02

Pos logit contributions
 fastest+0.140
 quickest+0.135
 highest+0.134
arest+0.134
hest+0.133
Neg logit contributions
 Tim-0.128
 Tom-0.117
 Sue-0.114
 Lily-0.110
 Tommy-0.109
Token a
Token ID257
Feature activation+1.30e-02

Pos logit contributions
 whole+0.060
 fastest+0.058
tted+0.055
 biggest+0.055
 certainly+0.054
Neg logit contributions
 uncle-0.100
 Tim-0.099
 Lee-0.097
 Uncle-0.095
 Sue-0.094
Token king
Token ID5822
Feature activation-5.87e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.019
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
 Sue-0.023
Token who
Token ID508
Feature activation-7.84e-03

Pos logit contributions
 Tim+0.075
 uncle+0.072
 Uncle+0.069
 Lee+0.068
 Sue+0.068
Neg logit contributions
 whole-0.124
 fastest-0.123
hest-0.117
tted-0.116
 biggest-0.116
Token<|endoftext|>
Token ID50256
Feature activation+4.27e-02

Pos logit contributions
 fastest+0.009
 whole+0.008
tted+0.008
 certainly+0.008
 quickest+0.008
Neg logit contributions
 uncle-0.009
 Tim-0.008
ries-0.008
ens-0.008
 Lee-0.008
TokenOnce
Token ID7454
Feature activation-2.88e-02

Pos logit contributions
 fastest+0.066
 quickest+0.062
hest+0.061
 biggest+0.061
tted+0.060
Neg logit contributions
 Tim-0.086
 Sue-0.080
 Tom-0.079
 uncle-0.078
 Lee-0.077
Token upon
Token ID2402
Feature activation+1.06e-01

Pos logit contributions
 Tim+0.055
 uncle+0.054
 Sue+0.051
 Uncle+0.051
 Lee+0.051
Neg logit contributions
 fastest-0.043
 whole-0.043
tted-0.041
 biggest-0.040
 quickest-0.040
Token a
Token ID257
Feature activation+2.24e-03

Pos logit contributions
 fastest+0.176
tted+0.168
 quickest+0.166
hest+0.161
 winner+0.159
Neg logit contributions
 Tim-0.199
 uncle-0.191
 Sue-0.187
 Tom-0.186
 Tommy-0.178
Token time
Token ID640
Feature activation+1.72e-02

Pos logit contributions
 fastest+0.002
 whole+0.002
tted+0.001
 quickest+0.001
 biggest+0.001
Neg logit contributions
 Tim-0.006
 uncle-0.006
 Sue-0.006
 Uncle-0.006
 Lee-0.006
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 whole+0.037
 fastest+0.037
 best+0.036
 most+0.035
 biggest+0.035
Neg logit contributions
 uncle-0.021
 Uncle-0.019
ries-0.019
kids-0.019
ens-0.019
Token there
Token ID612
Feature activation+7.11e-02

Pos logit contributions
 Tim+0.246
 uncle+0.227
 Tom+0.223
 Sue+0.223
 Lily+0.215
Neg logit contributions
 fastest-0.309
tted-0.308
hest-0.301
 quickest-0.297
 certainly-0.292
Token was
Token ID373
Feature activation+4.79e-02

Pos logit contributions
 fastest+0.139
 quickest+0.133
arest+0.132
 highest+0.131
hest+0.131
Neg logit contributions
 Tim-0.130
 Tom-0.118
 Sue-0.115
 Lily-0.111
 Tommy-0.111
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
 whole+0.062
 fastest+0.061
tted+0.058
 biggest+0.057
 certainly+0.057
Neg logit contributions
 uncle-0.101
 Tim-0.100
 Lee-0.097
 Uncle-0.096
 Sue-0.095
Token little
Token ID1310
Feature activation-5.92e-02

Pos logit contributions
 whole+0.022
 fastest+0.020
 biggest+0.019
 entire+0.019
 best+0.019
Neg logit contributions
ries-0.026
 Tim-0.024
 Lee-0.024
lee-0.024
 Sue-0.024
Token girl
Token ID2576
Feature activation-3.92e-02

Pos logit contributions
ries+0.098
lee+0.098
 uncle+0.097
 Tim+0.096
 handing+0.094
Neg logit contributions
 whole-0.102
 fastest-0.098
 biggest-0.095
 most-0.093
 best-0.092
Token.
Token ID13
Feature activation-1.88e-02

Pos logit contributions
 fastest+0.051
 whole+0.050
tted+0.050
 quickest+0.049
 certainly+0.049
Neg logit contributions
 Tim-0.032
 uncle-0.032
 Sue-0.029
 Lee-0.029
 Uncle-0.029
Token She
Token ID1375
Feature activation+1.22e-02

Pos logit contributions
 uncle+0.031
 Tim+0.029
ries+0.028
 Uncle+0.028
lee+0.028
Neg logit contributions
 whole-0.032
 fastest-0.032
tted-0.031
 certainly-0.031
hest-0.031
Token could
Token ID714
Feature activation+6.20e-02

Pos logit contributions
 fastest+0.023
 quickest+0.021
 whole+0.021
 biggest+0.021
tted+0.021
Neg logit contributions
 Tim-0.020
 uncle-0.020
 Tom-0.018
 Sue-0.018
 Lily-0.018
Token make
Token ID787
Feature activation+5.03e-02

Pos logit contributions
 fastest+0.111
 quickest+0.105
tted+0.105
 whole+0.105
hest+0.102
Neg logit contributions
 uncle-0.104
 Tim-0.104
 Sue-0.096
 Tom-0.096
 Uncle-0.093
Token birds
Token ID10087
Feature activation-5.97e-03

Pos logit contributions
tted+0.103
 fastest+0.101
 quickest+0.097
 whole+0.095
hest+0.095
Neg logit contributions
 uncle-0.073
 Tim-0.073
 Tom-0.069
 Sue-0.067
ries-0.067
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 Tim+0.008
 uncle+0.008
 Sue+0.007
 Uncle+0.007
 Lee+0.007
Neg logit contributions
 fastest-0.013
 whole-0.013
tted-0.012
 quickest-0.012
 biggest-0.012
Token stars
Token ID5788
Feature activation+6.10e-02

Pos logit contributions
 uncle+0.279
 Uncle+0.276
 Tim+0.274
 Lee+0.273
olph+0.264
Neg logit contributions
 fastest-0.243
 whole-0.242
 most-0.242
 biggest-0.235
 highest-0.231
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
tted+0.170
 fastest+0.168
 quickest+0.165
 virtue+0.164
 whole+0.161
Neg logit contributions
 Tim-0.042
 uncle-0.039
 Tom-0.036
 Lee-0.035
 Sue-0.034
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
 Tim+0.178
 uncle+0.177
 Uncle+0.163
 Sue+0.162
 Lee+0.161
Neg logit contributions
 fastest-0.334
 whole-0.331
 biggest-0.321
 quickest-0.317
tted-0.316
Token flowers
Token ID12734
Feature activation+3.40e-02

Pos logit contributions
 Tim+0.239
 uncle+0.238
 Sue+0.226
 Lee+0.226
 Uncle+0.226
Neg logit contributions
 fastest-0.185
 whole-0.183
 biggest-0.175
tted-0.173
 quickest-0.173
Token with
Token ID351
Feature activation-1.27e-02

Pos logit contributions
 fastest+0.075
tted+0.074
 quickest+0.073
arest+0.072
 virtue+0.072
Neg logit contributions
 Tim-0.042
 uncle-0.040
 Sue-0.039
 Tom-0.038
 Lee-0.037
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token happy
Token ID3772
Feature activation-3.03e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token bunny
Token ID44915
Feature activation-4.46e-02

Pos logit contributions
ries+0.061
lee+0.053
 Kim+0.052
 handing+0.052
ens+0.052
Neg logit contributions
 whole-0.051
 world-0.044
 best-0.044
 entire-0.042
 biggest-0.042
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token j
Token ID474
Feature activation-2.80e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Tokenolly
Token ID5098
Feature activation-7.43e-02

Pos logit contributions
 uncle+0.049
 Tim+0.049
 Lee+0.047
 Uncle+0.046
ries+0.046
Neg logit contributions
 fastest-0.047
 whole-0.046
tted-0.045
hest-0.044
 biggest-0.043
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token boy
Token ID2933
Feature activation-3.70e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token bunny
Token ID44915
Feature activation-5.13e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token in
Token ID287
Feature activation-1.98e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token a
Token ID257
Feature activation+1.07e-03

Pos logit contributions
 Tim+0.037
 uncle+0.037
 Sue+0.035
 Tom+0.035
 Uncle+0.035
Neg logit contributions
 fastest-0.033
tted-0.031
hest-0.031
 quickest-0.031
 biggest-0.030
Token small
Token ID1402
Feature activation-1.80e-02

Pos logit contributions
 fastest+0.001
 whole+0.001
tted+0.001
 quickest+0.001
hest+0.001
Neg logit contributions
 Tim-0.002
 uncle-0.002
 Sue-0.002
 Uncle-0.002
 Lee-0.002
Token village
Token ID7404
Feature activation-1.04e-02

Pos logit contributions
 Tim+0.035
ries+0.034
 Sue+0.034
 Kim+0.034
lee+0.033
Neg logit contributions
 whole-0.027
 fastest-0.025
 entire-0.024
 world-0.024
 biggest-0.023
Token,
Token ID11
Feature activation-1.54e-01

Pos logit contributions
 Tim+0.011
 uncle+0.011
 Uncle+0.010
 Lee+0.009
 Sue+0.009
Neg logit contributions
 fastest-0.025
 whole-0.024
tted-0.024
 biggest-0.024
 quickest-0.024
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token a
Token ID257
Feature activation+3.53e-03

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token humble
Token ID19695
Feature activation-3.22e-02

Pos logit contributions
 fastest+0.006
 whole+0.006
tted+0.006
 biggest+0.006
 quickest+0.006
Neg logit contributions
 Tim-0.006
 uncle-0.006
 Sue-0.006
 Lee-0.006
 Uncle-0.006
Token driver
Token ID4639
Feature activation-9.68e-02

Pos logit contributions
ries+0.055
 Tim+0.050
lee+0.049
 Kim+0.049
 handing+0.048
Neg logit contributions
 whole-0.058
 fastest-0.054
 biggest-0.052
 entire-0.052
 best-0.052
Token drove
Token ID10357
Feature activation-2.57e-02

Pos logit contributions
 Tim+0.086
 uncle+0.086
 Sue+0.076
 Uncle+0.076
 Lee+0.075
Neg logit contributions
 fastest-0.243
 whole-0.241
tted-0.237
 quickest-0.233
 biggest-0.233
Token his
Token ID465
Feature activation-2.92e-02

Pos logit contributions
ries+0.033
lee+0.033
 uncle+0.032
 Tim+0.032
 Lee+0.032
Neg logit contributions
 whole-0.055
 fastest-0.052
 biggest-0.050
 most-0.050
 entire-0.050
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token girl
Token ID2576
Feature activation-4.02e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token girl
Token ID2576
Feature activation-4.02e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token girl
Token ID2576
Feature activation-4.02e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token boy
Token ID2933
Feature activation-3.70e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token girl
Token ID2576
Feature activation-4.02e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token girl
Token ID2576
Feature activation-4.02e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token big
Token ID1263
Feature activation-3.96e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token,
Token ID11
Feature activation-1.45e-01

Pos logit contributions
ries+0.065
 stro+0.065
lee+0.064
 handing+0.061
 Kim+0.061
Neg logit contributions
 whole-0.069
 best-0.068
 fastest-0.066
 winner-0.064
 most-0.064
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token bunny
Token ID44915
Feature activation-5.13e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token<|endoftext|>
Token ID50256
Feature activation+4.63e-02

Pos logit contributions
 uncle+0.018
ries+0.018
ens+0.017
lee+0.017
 Tim+0.017
Neg logit contributions
 fastest-0.015
 whole-0.015
 certainly-0.015
tted-0.015
 quickest-0.015
TokenOnce
Token ID7454
Feature activation-2.74e-02

Pos logit contributions
 fastest+0.073
 quickest+0.069
hest+0.068
 biggest+0.068
 whole+0.067
Neg logit contributions
 Tim-0.090
 uncle-0.085
 Sue-0.084
 Tom-0.083
 Lee-0.082
Token upon
Token ID2402
Feature activation+1.07e-01

Pos logit contributions
 Tim+0.052
 uncle+0.052
 Sue+0.049
 Uncle+0.049
 Lee+0.049
Neg logit contributions
 fastest-0.041
 whole-0.040
tted-0.039
 biggest-0.038
 quickest-0.038
Token a
Token ID257
Feature activation+3.40e-03

Pos logit contributions
 fastest+0.178
tted+0.169
 quickest+0.168
hest+0.162
 winner+0.160
Neg logit contributions
 Tim-0.201
 uncle-0.192
 Sue-0.189
 Tom-0.187
 Lily-0.180
Token time
Token ID640
Feature activation+1.87e-02

Pos logit contributions
 whole+0.002
 fastest+0.002
tted+0.002
 quickest+0.002
 biggest+0.002
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Lee-0.009
 Sue-0.009
 Uncle-0.009
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 whole+0.041
 fastest+0.041
 best+0.039
tted+0.039
 biggest+0.039
Neg logit contributions
 uncle-0.023
 Uncle-0.021
 Tim-0.020
ries-0.020
ens-0.020
Token there
Token ID612
Feature activation+6.96e-02

Pos logit contributions
 Tim+0.237
 uncle+0.217
 Sue+0.214
 Tom+0.214
 Lily+0.207
Neg logit contributions
tted-0.318
 fastest-0.317
hest-0.310
 quickest-0.305
 certainly-0.302
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
 fastest+0.136
 quickest+0.131
 highest+0.130
arest+0.130
hest+0.129
Neg logit contributions
 Tim-0.125
 Tom-0.115
 Sue-0.112
 Lily-0.108
 Tommy-0.107
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
 whole+0.060
 fastest+0.057
tted+0.055
 biggest+0.054
 certainly+0.054
Neg logit contributions
 uncle-0.101
 Tim-0.099
 Lee-0.097
 Uncle-0.095
ries-0.095
Token little
Token ID1310
Feature activation-5.98e-02

Pos logit contributions
 whole+0.021
 fastest+0.019
 biggest+0.018
 entire+0.018
 best+0.018
Neg logit contributions
ries-0.025
 Tim-0.024
lee-0.023
 Lee-0.023
ens-0.023
Token boy
Token ID2933
Feature activation-3.70e-02

Pos logit contributions
ries+0.099
lee+0.099
 Tim+0.098
 uncle+0.098
 Sue+0.095
Neg logit contributions
 whole-0.104
 fastest-0.100
 biggest-0.097
 most-0.094
 best-0.094
Token to
Token ID284
Feature activation-4.19e-02

Pos logit contributions
 Tim+0.001
 uncle+0.001
 Sue+0.001
 Tom+0.001
 Tommy+0.001
Neg logit contributions
tted-0.001
 fastest-0.001
 whole-0.001
 certainly-0.001
arest-0.001
Token study
Token ID2050
Feature activation+7.82e-04

Pos logit contributions
 Tim+0.077
 uncle+0.073
 Sue+0.071
 Tom+0.070
 Kitty+0.070
Neg logit contributions
tted-0.071
 fastest-0.070
 biggest-0.068
 quickest-0.067
ative-0.066
Token.
Token ID13
Feature activation-7.89e-03

Pos logit contributions
tted+0.002
 fastest+0.002
 whole+0.002
 certainly+0.002
 quickest+0.002
Neg logit contributions
 Tim-0.001
 uncle-0.001
 Tom-0.001
 Uncle-0.001
 Tommy-0.001
Token They
Token ID1119
Feature activation+3.64e-02

Pos logit contributions
 Tim+0.014
 uncle+0.014
 Sue+0.013
 Uncle+0.013
 Tom+0.013
Neg logit contributions
 fastest-0.013
 whole-0.013
tted-0.013
 biggest-0.012
hest-0.012
Token may
Token ID743
Feature activation+7.28e-02

Pos logit contributions
tted+0.053
 biggest+0.053
hest+0.053
 fastest+0.052
 whole+0.051
Neg logit contributions
 Tim-0.070
 uncle-0.069
ens-0.065
 Tom-0.064
 Tommy-0.063
Token seem
Token ID1283
Feature activation+1.29e-01

Pos logit contributions
tted+0.121
 fastest+0.115
 whole+0.113
hest+0.113
 biggest+0.112
Neg logit contributions
 Tim-0.136
 uncle-0.132
 Sue-0.126
 Tom-0.125
 Uncle-0.123
Token fun
Token ID1257
Feature activation-1.51e-02

Pos logit contributions
tted+0.242
 fastest+0.237
 whole+0.234
hest+0.233
arest+0.229
Neg logit contributions
 Tim-0.202
 uncle-0.199
 Sue-0.186
 Uncle-0.185
 Tommy-0.181
Token at
Token ID379
Feature activation-1.32e-03

Pos logit contributions
 Tim+0.015
 uncle+0.014
 Sue+0.013
 Tom+0.013
 Lee+0.012
Neg logit contributions
tted-0.037
 fastest-0.036
 whole-0.036
 quickest-0.035
arest-0.035
Token first
Token ID717
Feature activation-3.00e-02

Pos logit contributions
 Tim+0.002
 Tom+0.002
 uncle+0.002
 Sue+0.002
 Tommy+0.002
Neg logit contributions
tted-0.003
arest-0.003
ifying-0.003
 certainly-0.003
hest-0.003
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
 Tim+0.030
 uncle+0.028
 Tom+0.026
 Sue+0.026
 Lee+0.025
Neg logit contributions
tted-0.074
 fastest-0.074
 whole-0.073
 certainly-0.072
arest-0.072
Token but
Token ID475
Feature activation-9.75e-03

Pos logit contributions
 Tim+0.158
 uncle+0.158
 Sue+0.146
 Uncle+0.145
 Lee+0.145
Neg logit contributions
 fastest-0.241
 whole-0.237
tted-0.232
 quickest-0.230
 biggest-0.229
Token was
Token ID373
Feature activation+5.31e-02

Pos logit contributions
 fastest+0.123
tted+0.117
 quickest+0.116
hest+0.115
 whole+0.114
Neg logit contributions
 Tim-0.153
 uncle-0.141
 Sue-0.141
 Tom-0.139
 Tommy-0.135
Token a
Token ID257
Feature activation+1.54e-02

Pos logit contributions
 whole+0.067
 fastest+0.064
 biggest+0.061
tted+0.061
 certainly+0.060
Neg logit contributions
 uncle-0.114
 Tim-0.112
 Lee-0.111
 Uncle-0.109
ries-0.108
Token lazy
Token ID16931
Feature activation-3.03e-02

Pos logit contributions
 whole+0.025
 fastest+0.023
 biggest+0.022
 best+0.021
 entire+0.021
Neg logit contributions
ries-0.030
 Tim-0.028
lee-0.028
 Lee-0.028
ens-0.027
Token valley
Token ID19272
Feature activation+1.90e-02

Pos logit contributions
ries+0.045
 Tim+0.040
lee+0.040
 Kim+0.039
 uncle+0.039
Neg logit contributions
 whole-0.060
 fastest-0.056
 biggest-0.055
 best-0.055
 winner-0.054
Token.
Token ID13
Feature activation-4.69e-03

Pos logit contributions
 fastest+0.038
 whole+0.037
tted+0.037
 certainly+0.036
hest+0.036
Neg logit contributions
 Tim-0.028
 uncle-0.027
 Sue-0.026
 Sally-0.024
 Tom-0.024
Token Everything
Token ID11391
Feature activation+1.27e-01

Pos logit contributions
 uncle+0.008
 Tim+0.008
 Uncle+0.007
 Lee+0.007
 Sue+0.007
Neg logit contributions
 fastest-0.008
 whole-0.008
tted-0.008
 quickest-0.008
 biggest-0.008
Token seemed
Token ID3947
Feature activation+1.11e-01

Pos logit contributions
 fastest+0.212
tted+0.211
 quickest+0.204
arest+0.203
hest+0.203
Neg logit contributions
 Tim-0.232
 uncle-0.210
 Sue-0.210
ens-0.206
 Lily-0.205
Token peaceful
Token ID12309
Feature activation-1.94e-02

Pos logit contributions
hest+0.203
tted+0.203
arest+0.203
vals+0.193
 fastest+0.193
Neg logit contributions
 Tim-0.184
 uncle-0.174
 Sue-0.168
 Lily-0.164
 Sally-0.164
Token and
Token ID290
Feature activation-9.67e-02

Pos logit contributions
 Tim+0.019
 uncle+0.017
 Tom+0.016
 Sue+0.015
 Lee+0.015
Neg logit contributions
tted-0.050
 fastest-0.049
arest-0.048
hest-0.048
 quickest-0.047
Token quiet
Token ID5897
Feature activation-2.11e-02

Pos logit contributions
 Tim+0.175
 uncle+0.159
 Sue+0.158
 Sally+0.154
 Tom+0.153
Neg logit contributions
tted-0.172
hest-0.169
arest-0.164
 fastest-0.162
 winner-0.159
Token.
Token ID13
Feature activation+2.24e-03

Pos logit contributions
 Tim+0.026
 uncle+0.023
 Tom+0.022
 Sue+0.022
 Lily+0.022
Neg logit contributions
 fastest-0.048
tted-0.048
arest-0.047
 quickest-0.047
hest-0.047
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
 fastest+0.035
tted+0.035
 whole+0.034
 certainly+0.034
 quickest+0.034
Neg logit contributions
 Tim-0.017
 uncle-0.016
 Tom-0.015
 Uncle-0.015
 Tommy-0.014
Token looking
Token ID2045
Feature activation+1.07e-02

Pos logit contributions
 uncle+0.165
 Tim+0.164
 Lee+0.153
 Sue+0.153
 Uncle+0.151
Neg logit contributions
 fastest-0.237
 whole-0.234
tted-0.227
 certainly-0.225
 quickest-0.225
Token around
Token ID1088
Feature activation+1.54e-02

Pos logit contributions
 fastest+0.020
tted+0.020
hest+0.020
 quickest+0.020
 whole+0.019
Neg logit contributions
 Tim-0.016
 uncle-0.016
 Uncle-0.015
 Sue-0.015
 Sally-0.015
Token and
Token ID290
Feature activation-9.93e-02

Pos logit contributions
 fastest+0.034
tted+0.033
 whole+0.033
hest+0.032
 quickest+0.032
Neg logit contributions
 Tim-0.019
 uncle-0.019
 Uncle-0.017
 Sue-0.017
 Lee-0.017
Token examining
Token ID17247
Feature activation-1.86e-02

Pos logit contributions
 Tim+0.136
 uncle+0.131
 Sue+0.124
 Uncle+0.122
 Tom+0.120
Neg logit contributions
 fastest-0.206
tted-0.204
hest-0.201
 whole-0.200
 quickest-0.197
Token everything
Token ID2279
Feature activation+1.37e-01

Pos logit contributions
 Tim+0.024
 uncle+0.024
 Sue+0.022
 Uncle+0.022
 Tom+0.021
Neg logit contributions
 fastest-0.040
tted-0.039
 whole-0.039
 quickest-0.038
hest-0.038
Token he
Token ID339
Feature activation+1.95e-02

Pos logit contributions
tted+0.310
 fastest+0.304
 whole+0.300
 quickest+0.296
 winner+0.295
Neg logit contributions
 Tim-0.165
 Sue-0.144
 uncle-0.142
 Tom-0.140
 Tommy-0.136
Token saw
Token ID2497
Feature activation+6.28e-03

Pos logit contributions
hest+0.034
 fastest+0.034
tted+0.033
 quickest+0.033
ative+0.032
Neg logit contributions
 Tim-0.032
 uncle-0.032
 Tom-0.030
lee-0.029
 Uncle-0.029
Token.
Token ID13
Feature activation-9.17e-03

Pos logit contributions
 fastest+0.013
tted+0.013
 whole+0.012
hest+0.012
 quickest+0.012
Neg logit contributions
 Tim-0.009
 uncle-0.008
 Tom-0.008
 Sue-0.008
 Uncle-0.008
Token<|endoftext|>
Token ID50256
Feature activation+4.60e-02

Pos logit contributions
 uncle+0.016
 Tim+0.016
 Sue+0.015
 Lee+0.015
 Uncle+0.015
Neg logit contributions
 fastest-0.015
 whole-0.015
tted-0.014
 quickest-0.014
 biggest-0.014
TokenOnce
Token ID7454
Feature activation-2.46e-02

Pos logit contributions
 fastest+0.082
arest+0.078
 quickest+0.078
 biggest+0.077
hest+0.077
Neg logit contributions
 Tim-0.084
 Tom-0.079
 Sue-0.078
 Tommy-0.076
 Lee-0.076
Token stars
Token ID5788
Feature activation+8.13e-02

Pos logit contributions
tted+0.053
hest+0.051
 fastest+0.050
 certainly+0.050
arest+0.049
Neg logit contributions
 uncle-0.064
 Tim-0.064
 Sue-0.060
 Uncle-0.059
 Tom-0.059
Token tw
Token ID665
Feature activation+1.54e-02

Pos logit contributions
 fastest+0.159
 virtue+0.159
 quickest+0.155
tted+0.153
 biggest+0.150
Neg logit contributions
 Tim-0.125
ries-0.110
 Tom-0.110
 Sue-0.109
lee-0.109
Tokenink
Token ID676
Feature activation+2.81e-02

Pos logit contributions
 fastest+0.018
 whole+0.017
 quickest+0.017
 biggest+0.016
tted+0.016
Neg logit contributions
lee-0.035
 Tim-0.034
ries-0.034
 uncle-0.034
 Sue-0.033
Tokenling
Token ID1359
Feature activation+1.25e-02

Pos logit contributions
 fastest+0.061
 virtue+0.060
 quickest+0.059
 certainly+0.059
tted+0.058
Neg logit contributions
 Tim-0.034
lee-0.032
 uncle-0.031
ries-0.031
ens-0.030
Token.
Token ID13
Feature activation-4.81e-03

Pos logit contributions
 fastest+0.024
tted+0.024
hest+0.023
arest+0.023
 quickest+0.023
Neg logit contributions
 Tim-0.020
 Sue-0.018
 uncle-0.018
 Tom-0.017
 Sally-0.017
Token Everything
Token ID11391
Feature activation+1.32e-01

Pos logit contributions
 Tim+0.007
 uncle+0.006
 Sue+0.006
 Uncle+0.006
 Kitty+0.006
Neg logit contributions
 fastest-0.010
 whole-0.010
tted-0.010
 quickest-0.009
hest-0.009
Token was
Token ID373
Feature activation+6.80e-02

Pos logit contributions
 fastest+0.232
tted+0.232
 quickest+0.225
 virtue+0.225
arest+0.224
Neg logit contributions
 Tim-0.231
 Sue-0.205
 Sally-0.199
 Tom-0.199
 uncle-0.198
Token peaceful
Token ID12309
Feature activation-1.69e-02

Pos logit contributions
tted+0.129
arest+0.123
ars+0.121
hest+0.118
vals+0.118
Neg logit contributions
 Tim-0.122
 Sue-0.110
 Sally-0.109
 uncle-0.107
 Tom-0.106
Token and
Token ID290
Feature activation-9.50e-02

Pos logit contributions
 Tim+0.017
 uncle+0.015
 Tom+0.014
 Sue+0.014
 Sally+0.014
Neg logit contributions
tted-0.043
 fastest-0.042
hest-0.041
arest-0.041
 quickest-0.040
Token calm
Token ID9480
Feature activation-3.89e-02

Pos logit contributions
 Tim+0.174
 uncle+0.155
 Sue+0.154
 Sally+0.154
 Tommy+0.152
Neg logit contributions
tted-0.172
hest-0.164
 fastest-0.160
arest-0.159
 whole-0.157
Token.
Token ID13
Feature activation-1.28e-03

Pos logit contributions
 Tim+0.051
 uncle+0.046
 Tommy+0.044
 Tom+0.044
 Sally+0.043
Neg logit contributions
tted-0.088
 fastest-0.087
hest-0.086
arest-0.085
 quickest-0.084
Token at
Token ID379
Feature activation-2.09e-03

Pos logit contributions
 fastest+0.036
tted+0.035
 whole+0.035
 quickest+0.035
 biggest+0.035
Neg logit contributions
 Tim-0.022
 uncle-0.021
 Sue-0.020
 Uncle-0.020
 Tom-0.019
Token the
Token ID262
Feature activation+3.38e-02

Pos logit contributions
 Tim+0.002
 uncle+0.002
 Sue+0.002
 Tom+0.002
 Tommy+0.002
Neg logit contributions
tted-0.005
 fastest-0.005
hest-0.005
 certainly-0.005
arest-0.005
Token park
Token ID3952
Feature activation+3.96e-05

Pos logit contributions
ery+0.078
tted+0.077
vals+0.077
arest+0.074
 certainly+0.074
Neg logit contributions
 uncle-0.043
 Tim-0.040
 Tom-0.039
 Tommy-0.036
 Uncle-0.035
Token,
Token ID11
Feature activation-1.17e-01

Pos logit contributions
 fastest+0.000
 whole+0.000
tted+0.000
hest+0.000
 biggest+0.000
Neg logit contributions
 Tim-0.000
 uncle-0.000
 Sue-0.000
 Uncle-0.000
 Tom-0.000
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
 uncle+0.165
 Tim+0.160
 Lee+0.154
ries+0.153
 Uncle+0.153
Neg logit contributions
 fastest-0.232
 whole-0.229
 biggest-0.219
 certainly-0.219
 quickest-0.219
Token everything
Token ID2279
Feature activation+1.29e-01

Pos logit contributions
 Tim+0.133
 uncle+0.131
 Sue+0.121
 Uncle+0.120
 Lee+0.119
Neg logit contributions
 fastest-0.222
 whole-0.219
tted-0.216
 quickest-0.212
hest-0.212
Token was
Token ID373
Feature activation+6.70e-02

Pos logit contributions
 fastest+0.212
 quickest+0.205
tted+0.205
hest+0.202
 whole+0.199
Neg logit contributions
 Tim-0.240
 uncle-0.220
 Sue-0.219
 Tom-0.213
 Lily-0.212
Token so
Token ID523
Feature activation-1.85e-02

Pos logit contributions
tted+0.129
vals+0.124
hest+0.123
arest+0.123
ars+0.119
Neg logit contributions
 Tim-0.120
 Sue-0.108
 uncle-0.108
 Sally-0.107
 Tommy-0.107
Token clean
Token ID3424
Feature activation-4.73e-02

Pos logit contributions
 Tim+0.030
 Sue+0.027
 uncle+0.026
 Tommy+0.025
 Sally+0.025
Neg logit contributions
 quickest-0.036
vals-0.036
 fastest-0.036
tted-0.035
antly-0.035
Token.
Token ID13
Feature activation-7.21e-04

Pos logit contributions
 Tim+0.061
 uncle+0.057
 Tom+0.053
 Tommy+0.053
 Sue+0.053
Neg logit contributions
tted-0.105
 fastest-0.104
arest-0.101
 whole-0.101
 quickest-0.100
Token<|endoftext|>
Token ID50256
Feature activation+4.87e-02

Pos logit contributions
 Tim+0.001
 uncle+0.001
 Sue+0.001
 Uncle+0.001
 Tom+0.001
Neg logit contributions
 fastest-0.001
 whole-0.001
tted-0.001
 quickest-0.001
hest-0.001
Token Mia
Token ID32189
Feature activation-3.82e-02

Pos logit contributions
 uncle+0.019
 Gig+0.018
lee+0.018
kids+0.018
ries+0.018
Neg logit contributions
 whole-0.032
 fastest-0.031
 best-0.030
 surely-0.030
 most-0.030
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
 uncle+0.069
 Tim+0.068
 Uncle+0.065
 Sue+0.064
 Gig+0.064
Neg logit contributions
 fastest-0.061
 whole-0.061
tted-0.059
 quickest-0.057
 biggest-0.057
Token They
Token ID1119
Feature activation+3.57e-02

Pos logit contributions
 uncle+0.030
 Tim+0.029
ries+0.028
 Uncle+0.028
 Lee+0.028
Neg logit contributions
 whole-0.028
 fastest-0.028
tted-0.028
 biggest-0.027
 certainly-0.027
Token laughed
Token ID13818
Feature activation-1.32e-02

Pos logit contributions
 fastest+0.072
 whole+0.071
tted+0.070
 quickest+0.069
 biggest+0.068
Neg logit contributions
 Tim-0.050
 uncle-0.050
 Uncle-0.046
 Sue-0.046
 Lee-0.045
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
 Tim+0.015
 uncle+0.014
 Sue+0.013
 Uncle+0.013
 Lee+0.013
Neg logit contributions
 fastest-0.030
 whole-0.030
tted-0.030
 quickest-0.029
 biggest-0.029
Token had
Token ID550
Feature activation+8.44e-02

Pos logit contributions
 uncle+0.141
 Tim+0.137
 Uncle+0.128
ries+0.126
 Lee+0.125
Neg logit contributions
 fastest-0.264
 whole-0.263
tted-0.257
 quickest-0.252
 certainly-0.250
Token fun
Token ID1257
Feature activation-5.05e-03

Pos logit contributions
 fastest+0.134
tted+0.132
hest+0.129
arest+0.127
 whole+0.126
Neg logit contributions
 Tim-0.163
 uncle-0.156
 Sue-0.154
 Tom-0.153
 Tommy-0.149
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
 Tim+0.008
 uncle+0.008
 Uncle+0.007
 Sue+0.007
 Lee+0.007
Neg logit contributions
 fastest-0.009
 whole-0.009
tted-0.009
 quickest-0.009
 biggest-0.009
Token They
Token ID1119
Feature activation+4.14e-02

Pos logit contributions
 Tim+0.017
 uncle+0.016
 Sue+0.016
 Tom+0.016
 Uncle+0.016
Neg logit contributions
 fastest-0.018
 whole-0.018
tted-0.017
hest-0.017
 quickest-0.017
Token forgot
Token ID16453
Feature activation+1.65e-02

Pos logit contributions
 fastest+0.070
 whole+0.069
tted+0.067
 quickest+0.066
 biggest+0.066
Neg logit contributions
 Tim-0.071
 uncle-0.071
 Sue-0.067
 Uncle-0.066
 Lee-0.066
Token about
Token ID546
Feature activation-2.06e-03

Pos logit contributions
 whole+0.026
 best+0.026
 will+0.026
 certainly+0.026
 biggest+0.026
Neg logit contributions
olph-0.028
kids-0.027
ries-0.027
 uncle-0.027
 Lee-0.026
Token the
Token ID262
Feature activation+4.10e-02

Pos logit contributions
 fastest+0.024
 whole+0.024
 most+0.023
 biggest+0.023
 best+0.023
Neg logit contributions
 uncle-0.013
 Tim-0.013
 Gig-0.013
 Sue-0.012
ens-0.012
Token ball
Token ID2613
Feature activation+3.14e-02

Pos logit contributions
tted+0.063
arest+0.062
antly+0.062
 certainly+0.062
hest+0.061
Neg logit contributions
 Tim-0.074
 uncle-0.073
 Tom-0.070
 Sue-0.069
 Lily-0.069
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
 fastest+0.067
 whole+0.066
tted+0.065
 quickest+0.064
 biggest+0.064
Neg logit contributions
 Tim-0.040
 uncle-0.039
 Sue-0.036
 Uncle-0.036
 Lee-0.036
Token at
Token ID379
Feature activation-1.11e-02

Pos logit contributions
 uncle+0.153
 Tim+0.149
 Uncle+0.140
 Lee+0.140
 Gig+0.140
Neg logit contributions
 fastest-0.229
 whole-0.223
tted-0.217
 quickest-0.216
 certainly-0.216
Token the
Token ID262
Feature activation+2.96e-02

Pos logit contributions
 Tim+0.013
 uncle+0.013
 Tom+0.012
 Uncle+0.012
 Sue+0.012
Neg logit contributions
 fastest-0.025
tted-0.025
 whole-0.025
hest-0.024
 certainly-0.024
Token same
Token ID976
Feature activation+3.17e-02

Pos logit contributions
tted+0.053
ery+0.050
arest+0.049
hest+0.049
vals+0.049
Neg logit contributions
 uncle-0.051
 Tim-0.047
 Uncle-0.046
 Tom-0.046
 Mrs-0.044
Token time
Token ID640
Feature activation+3.93e-02

Pos logit contributions
tted+0.038
arest+0.036
 fastest+0.034
hest+0.033
vals+0.033
Neg logit contributions
 uncle-0.072
 Tim-0.072
 Sue-0.069
 Tommy-0.069
 Tom-0.069
Token everyone
Token ID2506
Feature activation+5.52e-02

Pos logit contributions
 fastest+0.079
 whole+0.078
tted+0.076
 quickest+0.075
 biggest+0.074
Neg logit contributions
 uncle-0.054
 Tim-0.054
 Sue-0.050
 Uncle-0.050
 Lee-0.050
Token was
Token ID373
Feature activation+6.52e-02

Pos logit contributions
 fastest+0.098
 whole+0.096
tted+0.094
 quickest+0.092
 biggest+0.092
Neg logit contributions
 uncle-0.090
 Tim-0.090
 Uncle-0.084
 Sue-0.084
 Lee-0.084
Token happy
Token ID3772
Feature activation-2.67e-02

Pos logit contributions
 fastest+0.122
hest+0.121
tted+0.120
 biggest+0.120
 quickest+0.119
Neg logit contributions
 Tim-0.099
 uncle-0.096
 Sue-0.093
 Tom-0.091
 Tommy-0.090
Token.
Token ID13
Feature activation-2.13e-03

Pos logit contributions
 Tim+0.040
 uncle+0.040
 Sue+0.038
 Uncle+0.037
 Lee+0.037
Neg logit contributions
 fastest-0.050
 whole-0.050
tted-0.049
 quickest-0.048
 biggest-0.048
Token to
Token ID284
Feature activation-5.19e-02

Pos logit contributions
 Tim+0.024
 uncle+0.023
 Sue+0.022
 Lee+0.022
 Uncle+0.021
Neg logit contributions
 fastest-0.032
 whole-0.031
tted-0.031
 quickest-0.030
hest-0.030
Token the
Token ID262
Feature activation+3.15e-02

Pos logit contributions
 uncle+0.080
lee+0.080
ries+0.077
kids+0.074
 Lee+0.072
Neg logit contributions
 whole-0.098
 fastest-0.088
 biggest-0.087
 most-0.086
 entire-0.086
Token ground
Token ID2323
Feature activation+7.63e-02

Pos logit contributions
tted+0.060
hest+0.059
 certainly+0.058
 fastest+0.058
arest+0.056
Neg logit contributions
 uncle-0.049
 Tom-0.046
 Tim-0.046
 Lee-0.045
 Uncle-0.045
Token.
Token ID13
Feature activation-3.20e-03

Pos logit contributions
 fastest+0.161
tted+0.157
 whole+0.156
 biggest+0.155
arest+0.155
Neg logit contributions
 Tim-0.102
 uncle-0.096
 Sue-0.094
 Tom-0.092
 Lee-0.090
Token
Token ID198
Feature activation+8.34e-02

Pos logit contributions
 Tim+0.006
 uncle+0.005
 Sue+0.005
 Tom+0.005
 Uncle+0.005
Neg logit contributions
 fastest-0.006
 whole-0.005
 quickest-0.005
hest-0.005
tted-0.005
Token
Token ID198
Feature activation+7.84e-02

Pos logit contributions
 fastest+0.130
 whole+0.125
 quickest+0.122
tted+0.122
 biggest+0.121
Neg logit contributions
 Tim-0.159
 uncle-0.156
 Sue-0.149
 Uncle-0.148
 Tom-0.147
TokenS
Token ID50
Feature activation-9.10e-02

Pos logit contributions
 fastest+0.144
 biggest+0.141
 certainly+0.141
 quickest+0.139
 whole+0.136
Neg logit contributions
 Tim-0.125
 uncle-0.118
 Tom-0.117
 Sue-0.117
 Tommy-0.113
Tokenara
Token ID3301
Feature activation-5.58e-02

Pos logit contributions
 uncle+0.085
 Tim+0.085
ens+0.080
ries+0.079
 Lee+0.079
Neg logit contributions
 fastest-0.227
 quickest-0.219
tted-0.218
 biggest-0.217
 whole-0.217
Token runs
Token ID4539
Feature activation-5.56e-02

Pos logit contributions
 uncle+0.091
 Tim+0.091
 Uncle+0.085
 Gig+0.085
 Sue+0.085
Neg logit contributions
 fastest-0.100
 whole-0.097
tted-0.095
 certainly-0.094
 quickest-0.094
Token to
Token ID284
Feature activation-5.88e-02

Pos logit contributions
lee+0.105
ries+0.105
 uncle+0.099
pa+0.097
kids+0.097
Neg logit contributions
 whole-0.092
tted-0.087
 will-0.085
 fastest-0.085
 across-0.080
Token Ben
Token ID3932
Feature activation-6.07e-02

Pos logit contributions
lee+0.084
kids+0.083
ries+0.082
 purs+0.077
 uncle+0.075
Neg logit contributions
 whole-0.114
 surely-0.109
 am-0.105
 will-0.105
 biggest-0.103
Token drift
Token ID24260
Feature activation+3.69e-02

Pos logit contributions
 Tim+0.157
 uncle+0.156
 Sue+0.145
 Uncle+0.144
 Lee+0.142
Neg logit contributions
 fastest-0.211
 whole-0.207
tted-0.207
hest-0.202
 quickest-0.201
Token off
Token ID572
Feature activation-4.34e-02

Pos logit contributions
hest+0.083
 fastest+0.082
arest+0.080
tted+0.079
 quickest+0.079
Neg logit contributions
 Tim-0.045
 uncle-0.043
lee-0.042
 Uncle-0.041
 her-0.041
Token to
Token ID284
Feature activation-5.01e-02

Pos logit contributions
 Tim+0.058
 uncle+0.058
 Sue+0.053
 Uncle+0.053
 Lee+0.053
Neg logit contributions
 fastest-0.090
 whole-0.089
tted-0.088
 quickest-0.086
 biggest-0.086
Token sleep
Token ID3993
Feature activation+1.21e-03

Pos logit contributions
 Tim+0.055
 uncle+0.053
 Uncle+0.048
 Sue+0.048
 Tom+0.048
Neg logit contributions
tted-0.123
 fastest-0.120
hest-0.116
 quickest-0.115
 winner-0.114
Token.
Token ID13
Feature activation+2.39e-03

Pos logit contributions
 fastest+0.003
tted+0.003
 quickest+0.002
hest+0.002
 certainly+0.002
Neg logit contributions
 Tim-0.002
 uncle-0.002
 Uncle-0.001
 Tom-0.001
 Lee-0.001
Token
Token ID198
Feature activation+8.60e-02

Pos logit contributions
 fastest+0.004
tted+0.004
 whole+0.004
 quickest+0.004
hest+0.004
Neg logit contributions
 Tim-0.004
 uncle-0.004
 Uncle-0.004
 Sue-0.004
 Tom-0.004
Token
Token ID198
Feature activation+7.99e-02

Pos logit contributions
 fastest+0.140
 whole+0.132
 quickest+0.132
tted+0.132
 biggest+0.129
Neg logit contributions
 Tim-0.159
 uncle-0.158
 Sue-0.149
 Uncle-0.148
 Tom-0.147
TokenOne
Token ID3198
Feature activation+7.12e-02

Pos logit contributions
 fastest+0.093
tted+0.089
 biggest+0.088
 quickest+0.086
 whole+0.081
Neg logit contributions
 uncle-0.186
 Tim-0.185
 Uncle-0.177
 Sue-0.174
 Tom-0.174
Token day
Token ID1110
Feature activation+8.73e-03

Pos logit contributions
 fastest+0.088
tted+0.085
 whole+0.083
 quickest+0.082
hest+0.082
Neg logit contributions
 Tim-0.157
 uncle-0.157
 Uncle-0.149
 Sue-0.149
 Tom-0.146
Token,
Token ID11
Feature activation-1.21e-01

Pos logit contributions
 whole+0.019
 fastest+0.018
 most+0.017
 best+0.017
 biggest+0.017
Neg logit contributions
lee-0.011
 uncle-0.010
ens-0.010
ries-0.010
 Gig-0.010
Token his
Token ID465
Feature activation-1.62e-02

Pos logit contributions
 uncle+0.089
 Tim+0.087
lee+0.085
ens+0.082
 Lee+0.080
Neg logit contributions
 whole-0.316
 fastest-0.313
 biggest-0.304
 most-0.303
 quickest-0.302
Token"
Token ID1
Feature activation+2.09e-02

Pos logit contributions
 fastest+0.087
 certainly+0.085
 biggest+0.084
 whole+0.084
 quickest+0.083
Neg logit contributions
 Tim-0.132
 uncle-0.129
 Sue-0.124
 Tom-0.123
 Lee-0.120
TokenYou
Token ID1639
Feature activation+1.02e-01

Pos logit contributions
 fastest+0.034
tted+0.033
 quickest+0.033
aying+0.033
'll+0.032
Neg logit contributions
 Tim-0.037
 uncle-0.036
 Sue-0.034
 Tom-0.034
 Lily-0.034
Token're
Token ID821
Feature activation+8.63e-02

Pos logit contributions
antly+0.169
umbling+0.161
 virtue+0.161
 single+0.159
tted+0.158
Neg logit contributions
 Tim-0.188
 Tom-0.176
 Lily-0.169
 Tommy-0.167
 Sally-0.166
Token welcome
Token ID7062
Feature activation-4.83e-02

Pos logit contributions
tted+0.141
hest+0.134
arest+0.133
 fastest+0.132
antly+0.128
Neg logit contributions
 Tim-0.169
 Tom-0.160
 Sue-0.159
 uncle-0.155
 Sally-0.153
Token.
Token ID13
Feature activation-4.94e-03

Pos logit contributions
ens+0.070
 uncle+0.070
ries+0.069
 Tim+0.068
olph+0.068
Neg logit contributions
 whole-0.088
 best-0.086
 fastest-0.085
 biggest-0.084
 most-0.084
Token You
Token ID921
Feature activation+1.01e-01

Pos logit contributions
 Tim+0.009
 uncle+0.009
 Sue+0.008
 Uncle+0.008
 Lee+0.008
Neg logit contributions
 fastest-0.008
 whole-0.008
tted-0.008
 quickest-0.007
hest-0.007
Token are
Token ID389
Feature activation+7.92e-02

Pos logit contributions
antly+0.152
tted+0.148
 virtue+0.146
umbling+0.145
ained+0.145
Neg logit contributions
 Tim-0.195
 Tom-0.177
 Lily-0.177
 Sally-0.174
 Tommy-0.172
Token very
Token ID845
Feature activation-5.38e-03

Pos logit contributions
tted+0.129
hest+0.119
 fastest+0.118
arest+0.118
 whole+0.115
Neg logit contributions
 Tim-0.154
 uncle-0.147
 Sue-0.146
 Tom-0.145
 Mrs-0.141
Token honest
Token ID5508
Feature activation-3.37e-02

Pos logit contributions
 Tim+0.010
 Sue+0.010
 uncle+0.010
 Tom+0.009
 Sally+0.009
Neg logit contributions
 fastest-0.008
tted-0.008
 quickest-0.008
 whole-0.008
hest-0.008
Token children
Token ID1751
Feature activation+7.21e-03

Pos logit contributions
 Tim+0.049
 uncle+0.049
 Uncle+0.046
 Gig+0.046
ens+0.046
Neg logit contributions
 fastest-0.063
 whole-0.063
 most-0.060
 biggest-0.060
 quickest-0.060
Token.
Token ID13
Feature activation-1.15e-03

Pos logit contributions
 fastest+0.013
 whole+0.013
tted+0.012
hest+0.012
 biggest+0.012
Neg logit contributions
 Tim-0.011
 uncle-0.011
 Uncle-0.011
 Sue-0.011
 Lee-0.011
Token was
Token ID373
Feature activation+6.50e-02

Pos logit contributions
 Tim+0.048
 uncle+0.048
 Sue+0.045
 Uncle+0.045
 Lee+0.045
Neg logit contributions
 fastest-0.041
 whole-0.041
tted-0.040
 quickest-0.039
 certainly-0.039
Token so
Token ID523
Feature activation-1.92e-02

Pos logit contributions
hest+0.118
vals+0.118
arest+0.118
ery+0.115
 fastest+0.115
Neg logit contributions
 Tim-0.104
 Sue-0.101
 uncle-0.100
 Tom-0.097
 Sally-0.094
Token happy
Token ID3772
Feature activation-3.27e-02

Pos logit contributions
 Tim+0.022
 Sue+0.021
 Tom+0.020
 uncle+0.020
 Sally+0.018
Neg logit contributions
 fastest-0.044
vals-0.044
hest-0.044
 quickest-0.043
ery-0.043
Token and
Token ID290
Feature activation-9.69e-02

Pos logit contributions
 uncle+0.049
ens+0.048
ries+0.047
 Gig+0.047
 Tim+0.046
Neg logit contributions
 whole-0.059
 fastest-0.057
 certainly-0.056
 most-0.056
 entire-0.054
Token jumped
Token ID11687
Feature activation-5.19e-03

Pos logit contributions
 uncle+0.122
 Tim+0.120
 Lee+0.113
ries+0.113
ens+0.110
Neg logit contributions
 fastest-0.208
 whole-0.206
tted-0.201
 biggest-0.199
 certainly-0.198
Token for
Token ID329
Feature activation+1.61e-02

Pos logit contributions
 Tim+0.009
 uncle+0.008
 Sue+0.008
 Tom+0.008
 Uncle+0.008
Neg logit contributions
 fastest-0.009
 whole-0.009
 quickest-0.009
tted-0.009
hest-0.009
Token joy
Token ID8716
Feature activation+5.48e-03

Pos logit contributions
 whole+0.028
 fastest+0.028
tted+0.027
 biggest+0.027
 certainly+0.026
Neg logit contributions
 uncle-0.026
 Tim-0.026
 Lee-0.025
ries-0.025
ens-0.025
Token!
Token ID0
Feature activation-3.70e-04

Pos logit contributions
 whole+0.010
 fastest+0.009
 certainly+0.009
 biggest+0.009
tted+0.009
Neg logit contributions
 uncle-0.009
 Tim-0.009
 Sue-0.008
 Uncle-0.008
 Gig-0.008
Token<|endoftext|>
Token ID50256
Feature activation+4.76e-02

Pos logit contributions
 Tim+0.001
 uncle+0.001
 Sue+0.001
 Uncle+0.001
 Lee+0.001
Neg logit contributions
 fastest-0.001
 whole-0.001
tted-0.001
 quickest-0.001
 biggest-0.001
TokenOnce
Token ID7454
Feature activation-2.48e-02

Pos logit contributions
 fastest+0.092
 quickest+0.088
arest+0.087
 biggest+0.086
 virtue+0.085
Neg logit contributions
 Tim-0.082
 Tom-0.077
 Sue-0.075
 Lily-0.072
 Tommy-0.072
Token upon
Token ID2402
Feature activation+1.12e-01

Pos logit contributions
 uncle+0.050
 Tim+0.048
 Lee+0.048
 Gig+0.047
ries+0.047
Neg logit contributions
 whole-0.034
 fastest-0.033
 biggest-0.031
 certainly-0.031
tted-0.031
Token afford
Token ID5368
Feature activation+1.94e-02

Pos logit contributions
 fastest+0.030
 whole+0.029
tted+0.029
 quickest+0.028
 biggest+0.028
Neg logit contributions
 Tim-0.032
 uncle-0.031
 Sue-0.030
 Uncle-0.029
 Lee-0.029
Token a
Token ID257
Feature activation+2.60e-02

Pos logit contributions
 fastest+0.039
 whole+0.039
tted+0.038
 quickest+0.037
 biggest+0.037
Neg logit contributions
 Tim-0.026
 uncle-0.026
 Uncle-0.024
 Sue-0.024
 Lee-0.024
Token burger
Token ID26593
Feature activation+7.32e-03

Pos logit contributions
 whole+0.043
 fastest+0.043
 biggest+0.041
tted+0.040
 winner+0.040
Neg logit contributions
 Tim-0.045
 uncle-0.044
ries-0.044
lee-0.043
 Sue-0.042
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 fastest+0.014
 whole+0.014
tted+0.013
 quickest+0.013
 biggest+0.013
Neg logit contributions
 Tim-0.011
 uncle-0.011
 Sue-0.010
 Uncle-0.010
 Lee-0.010
Token And
Token ID843
Feature activation-8.02e-02

Pos logit contributions
ries+0.017
 uncle+0.017
pa+0.016
ens+0.016
lee+0.016
Neg logit contributions
 fastest-0.018
 whole-0.018
 certainly-0.018
tted-0.018
 most-0.018
Token he
Token ID339
Feature activation+4.80e-03

Pos logit contributions
 uncle+0.107
ries+0.103
lee+0.101
 Tim+0.100
 Lee+0.099
Neg logit contributions
 fastest-0.160
 whole-0.158
 most-0.156
 certainly-0.153
 quickest-0.152
Token learned
Token ID4499
Feature activation-9.28e-03

Pos logit contributions
 fastest+0.007
 whole+0.007
tted+0.007
 certainly+0.007
 quickest+0.007
Neg logit contributions
 Tim-0.009
 uncle-0.009
 Sue-0.009
 Lee-0.009
 Uncle-0.009
Token that
Token ID326
Feature activation-1.15e-02

Pos logit contributions
 Tim+0.015
 uncle+0.015
 Lee+0.014
ens+0.014
ries+0.014
Neg logit contributions
 whole-0.016
 fastest-0.016
 biggest-0.015
 most-0.015
 certainly-0.015
Token being
Token ID852
Feature activation+7.79e-02

Pos logit contributions
 uncle+0.019
 Tim+0.018
 Lee+0.018
lee+0.018
 Sue+0.018
Neg logit contributions
 fastest-0.020
 whole-0.020
 most-0.019
 biggest-0.019
 quickest-0.019
Token kind
Token ID1611
Feature activation-2.16e-02

Pos logit contributions
hest+0.145
arest+0.143
tted+0.139
 whole+0.138
 fastest+0.136
Neg logit contributions
 Tim-0.126
 uncle-0.123
 Sue-0.121
 Tom-0.115
 Sally-0.114
Token is
Token ID318
Feature activation+6.63e-02

Pos logit contributions
 Tim+0.031
 uncle+0.030
 Tom+0.029
 Sue+0.028
ens+0.028
Neg logit contributions
 fastest-0.044
arest-0.044
hest-0.044
 whole-0.043
tted-0.043
Token what
Token ID644
Feature activation+7.65e-02

Pos logit contributions
ens+0.020
kids+0.020
 Tim+0.020
olph+0.019
 uncle+0.019
Neg logit contributions
 fastest-0.032
 biggest-0.032
 most-0.032
 best-0.032
 whole-0.032
Token was
Token ID373
Feature activation+6.29e-02

Pos logit contributions
 fastest+0.141
 whole+0.139
tted+0.136
 biggest+0.134
 quickest+0.134
Neg logit contributions
 Tim-0.119
 uncle-0.119
 Sue-0.111
 Lee-0.110
 Uncle-0.110
Token inside
Token ID2641
Feature activation-4.37e-02

Pos logit contributions
tted+0.118
 fastest+0.118
 whole+0.117
hest+0.114
 certainly+0.113
Neg logit contributions
 Tim-0.097
 uncle-0.096
 Sue-0.090
 Uncle-0.088
 Tom-0.087
Token the
Token ID262
Feature activation+3.57e-02

Pos logit contributions
 uncle+0.064
 Tim+0.063
 Lee+0.059
 Sue+0.058
 Gig+0.058
Neg logit contributions
 fastest-0.084
 whole-0.083
 biggest-0.080
hest-0.080
 quickest-0.079
Token building
Token ID2615
Feature activation+2.60e-02

Pos logit contributions
 certainly+0.081
tted+0.080
 fastest+0.080
hest+0.079
ative+0.079
Neg logit contributions
 Tim-0.041
 uncle-0.040
 Sue-0.037
 Tom-0.036
 Uncle-0.036
Token so
Token ID523
Feature activation-1.92e-02

Pos logit contributions
 fastest+0.053
 whole+0.052
tted+0.052
 certainly+0.050
hest+0.050
Neg logit contributions
 Tim-0.037
 uncle-0.036
 Sue-0.034
 Tom-0.033
 Uncle-0.033
Token she
Token ID673
Feature activation+5.22e-03

Pos logit contributions
 Tim+0.018
 uncle+0.018
 Lee+0.015
 Sue+0.015
 Uncle+0.015
Neg logit contributions
 fastest-0.048
 whole-0.047
tted-0.046
 biggest-0.046
 quickest-0.046
Token went
Token ID1816
Feature activation-3.51e-02

Pos logit contributions
 fastest+0.009
 whole+0.009
tted+0.009
 quickest+0.009
 biggest+0.009
Neg logit contributions
 Tim-0.008
 uncle-0.008
 Sue-0.008
 Uncle-0.008
 Lee-0.008
Token in
Token ID287
Feature activation-8.62e-04

Pos logit contributions
 Tim+0.052
 uncle+0.051
 Sue+0.048
 Uncle+0.048
 Lee+0.047
Neg logit contributions
 fastest-0.068
 whole-0.067
tted-0.066
 quickest-0.065
 biggest-0.065
Token.
Token ID13
Feature activation-8.78e-03

Pos logit contributions
 uncle+0.001
 Tim+0.001
 Gig+0.001
ens+0.001
 Uncle+0.001
Neg logit contributions
 whole-0.002
 fastest-0.002
tted-0.002
 certainly-0.002
 quickest-0.002
Token Inside
Token ID14384
Feature activation+3.93e-02

Pos logit contributions
 uncle+0.014
 Tim+0.014
ries+0.014
ens+0.013
 Lee+0.013
Neg logit contributions
 whole-0.015
 fastest-0.015
tted-0.015
 certainly-0.015
 biggest-0.015
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
 uncle+0.162
lee+0.157
 Tim+0.156
ries+0.155
 Lee+0.152
Neg logit contributions
 whole-0.251
 fastest-0.236
 most-0.236
 biggest-0.231
 entire-0.228
Token dragon
Token ID10441
Feature activation-1.04e-02

Pos logit contributions
tted+0.068
hest+0.066
ained+0.065
 certainly+0.064
 fastest+0.062
Neg logit contributions
 uncle-0.033
 Tim-0.032
 Uncle-0.029
 Sue-0.028
 Lily-0.028
Token talked
Token ID6619
Feature activation+1.46e-02

Pos logit contributions
 Tim+0.017
 uncle+0.017
 Sue+0.016
 Tom+0.015
 Uncle+0.015
Neg logit contributions
 fastest-0.018
 whole-0.018
 biggest-0.018
tted-0.018
 quickest-0.018
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
 fastest+0.031
 whole+0.031
tted+0.030
 quickest+0.030
 biggest+0.030
Neg logit contributions
 Tim-0.019
 uncle-0.018
 Sue-0.017
 Uncle-0.017
 Lee-0.017
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
 uncle+0.152
 Tim+0.151
 Sue+0.138
 Lee+0.138
 Uncle+0.137
Neg logit contributions
 fastest-0.270
 whole-0.267
tted-0.260
 certainly-0.259
 quickest-0.258
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
 uncle+0.156
 Tim+0.151
 Lee+0.143
 Uncle+0.143
ries+0.141
Neg logit contributions
 fastest-0.226
 whole-0.224
tted-0.215
 quickest-0.214
 biggest-0.213
Token dragon
Token ID10441
Feature activation-6.65e-03

Pos logit contributions
tted+0.067
hest+0.066
ained+0.064
 certainly+0.063
arest+0.063
Neg logit contributions
 uncle-0.032
 Tim-0.030
 Lily-0.027
 Uncle-0.027
 Sue-0.027
Token was
Token ID373
Feature activation+6.71e-02

Pos logit contributions
 Tim+0.011
 uncle+0.010
 Sue+0.010
 Tom+0.009
 Kitty+0.009
Neg logit contributions
 fastest-0.012
tted-0.012
 biggest-0.012
 quickest-0.012
hest-0.012
Token so
Token ID523
Feature activation-1.80e-02

Pos logit contributions
tted+0.121
hest+0.119
 fastest+0.117
arest+0.116
 whole+0.114
Neg logit contributions
 Tim-0.112
 uncle-0.110
 Sue-0.107
 Tom-0.104
 Uncle-0.104
Token kind
Token ID1611
Feature activation-1.80e-02

Pos logit contributions
 Tim+0.022
 Sue+0.021
 uncle+0.020
 Tom+0.019
 Uncle+0.018
Neg logit contributions
hest-0.040
 fastest-0.040
 quickest-0.039
tted-0.039
vals-0.039
Token.
Token ID13
Feature activation-6.34e-03

Pos logit contributions
 Tim+0.025
 uncle+0.025
 Sue+0.023
 Uncle+0.023
 Lee+0.023
Neg logit contributions
 fastest-0.037
 whole-0.036
tted-0.036
 quickest-0.035
 biggest-0.035
Token friend
Token ID1545
Feature activation-5.83e-02

Pos logit contributions
 fastest+0.012
tted+0.012
hest+0.011
 whole+0.011
 quickest+0.011
Neg logit contributions
 uncle-0.009
 Tim-0.009
 Sue-0.008
 Uncle-0.008
 Sally-0.008
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 uncle+0.093
 Tim+0.093
 Uncle+0.088
 Lee+0.087
ries+0.087
Neg logit contributions
 fastest-0.103
 whole-0.102
tted-0.097
 certainly-0.097
 quickest-0.097
Token He
Token ID679
Feature activation+1.67e-02

Pos logit contributions
 uncle+0.019
ries+0.018
 Tim+0.018
ens+0.017
 Gig+0.017
Neg logit contributions
 whole-0.018
 fastest-0.018
tted-0.018
 certainly-0.017
 biggest-0.017
Token waved
Token ID26834
Feature activation-1.26e-02

Pos logit contributions
 fastest+0.031
 whole+0.031
tted+0.030
 certainly+0.030
 quickest+0.030
Neg logit contributions
 Tim-0.026
 uncle-0.026
 Sue-0.024
 Lee-0.024
 Uncle-0.024
Token goodbye
Token ID24829
Feature activation-6.41e-02

Pos logit contributions
ries+0.015
ens+0.014
 Tim+0.014
 uncle+0.014
 Lee+0.013
Neg logit contributions
 whole-0.027
 biggest-0.026
 most-0.026
 fastest-0.026
tted-0.026
Token to
Token ID284
Feature activation-5.80e-02

Pos logit contributions
 Tim+0.085
ries+0.084
 uncle+0.084
 Lee+0.080
 Sue+0.080
Neg logit contributions
 whole-0.131
 fastest-0.126
tted-0.124
 most-0.123
 biggest-0.123
Token the
Token ID262
Feature activation+2.43e-02

Pos logit contributions
ries+0.059
 uncle+0.054
lee+0.053
 Lee+0.053
olph+0.053
Neg logit contributions
 whole-0.129
 fastest-0.126
 most-0.124
 biggest-0.124
 best-0.122
Token tutor
Token ID48680
Feature activation-6.30e-02

Pos logit contributions
 fastest+0.042
tted+0.041
 whole+0.041
hest+0.040
 quickest+0.040
Neg logit contributions
 uncle-0.041
 Tim-0.041
 Sue-0.038
 Uncle-0.038
 Lee-0.037
Token bird
Token ID6512
Feature activation-2.50e-02

Pos logit contributions
 Tim+0.077
 uncle+0.075
ries+0.072
 Gig+0.072
 Sue+0.072
Neg logit contributions
 whole-0.132
 fastest-0.132
 certainly-0.127
 quickest-0.126
 most-0.126
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
 Tim+0.032
 uncle+0.032
 Lee+0.031
ries+0.031
 Uncle+0.030
Neg logit contributions
 whole-0.051
 fastest-0.051
 most-0.048
 certainly-0.048
 biggest-0.048
Token happily
Token ID18177
Feature activation-7.66e-02

Pos logit contributions
ries+0.174
 uncle+0.164
 Lee+0.161
lee+0.161
 Tim+0.159
Neg logit contributions
 whole-0.218
 surely-0.211
 fastest-0.210
 will-0.209
 most-0.208
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
lee+0.047
 stro+0.045
ries+0.044
 handing+0.043
ens+0.043
Neg logit contributions
 will-0.057
 best-0.056
 whole-0.055
 fastest-0.054
 most-0.053
Token why
Token ID1521
Feature activation+4.43e-02

Pos logit contributions
ries+0.187
lee+0.186
olph+0.181
 Lee+0.176
ens+0.176
Neg logit contributions
 most-0.216
 fastest-0.215
 certainly-0.210
 biggest-0.209
 will-0.209
Token not
Token ID407
Feature activation+1.94e-02

Pos logit contributions
 fastest+0.075
tted+0.074
 whole+0.073
 quickest+0.071
hest+0.071
Neg logit contributions
 Tim-0.078
 uncle-0.077
 Sue-0.073
 Lee-0.072
 Uncle-0.072
Token?"
Token ID1701
Feature activation-4.48e-02

Pos logit contributions
 fastest+0.034
 whole+0.033
 quickest+0.031
 biggest+0.031
 most+0.031
Neg logit contributions
 uncle-0.032
lee-0.031
 Tim-0.031
ens-0.031
 Lee-0.030
Token Santa
Token ID8909
Feature activation-9.00e-02

Pos logit contributions
ries+0.065
atted+0.056
lee+0.053
ados+0.053
ors+0.052
Neg logit contributions
 whole-0.092
 most-0.090
 best-0.088
 biggest-0.082
 winner-0.082
Token Claus
Token ID33989
Feature activation-6.98e-02

Pos logit contributions
 uncle+0.124
 Tim+0.124
 Sue+0.118
ries+0.116
 Gig+0.116
Neg logit contributions
 fastest-0.171
 whole-0.171
 certainly-0.164
tted-0.163
 quickest-0.160
Token said
Token ID531
Feature activation-6.30e-02

Pos logit contributions
 uncle+0.086
 Tim+0.083
 Kitty+0.080
 Sue+0.080
 Gig+0.079
Neg logit contributions
 whole-0.144
 fastest-0.140
 certainly-0.137
tted-0.135
hest-0.134
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
 uncle+0.110
 Tim+0.108
 Gig+0.104
 Uncle+0.103
 Sue+0.102
Neg logit contributions
 whole-0.102
 fastest-0.101
tted-0.100
 biggest-0.097
 quickest-0.095
Token "
Token ID366
Feature activation+5.01e-02

Pos logit contributions
 Tim+0.030
 uncle+0.030
 Sue+0.028
 Uncle+0.028
 Lee+0.028
Neg logit contributions
 fastest-0.036
 whole-0.036
tted-0.035
 biggest-0.034
 quickest-0.034
TokenBut
Token ID1537
Feature activation-1.32e-02

Pos logit contributions
 biggest+0.073
 certainly+0.071
 fastest+0.071
 quickest+0.068
 impossible+0.067
Neg logit contributions
 uncle-0.093
 Tim-0.091
 Lee-0.088
 Tom-0.088
 Sue-0.086
Token you
Token ID345
Feature activation+4.59e-02

Pos logit contributions
 Tim+0.022
 uncle+0.021
 Tom+0.020
 Sue+0.020
 Lee+0.020
Neg logit contributions
tted-0.025
 whole-0.024
hest-0.024
 fastest-0.024
 entire-0.023
Token worked
Token ID3111
Feature activation+2.28e-02

Pos logit contributions
hest+0.003
tted+0.003
 fastest+0.003
arest+0.002
 whole+0.002
Neg logit contributions
 Tim-0.003
 uncle-0.003
 Uncle-0.003
 Sue-0.003
 Tom-0.003
Token.
Token ID13
Feature activation-6.53e-04

Pos logit contributions
 fastest+0.051
tted+0.051
arest+0.050
 biggest+0.050
 quickest+0.050
Neg logit contributions
 Tim-0.028
 uncle-0.025
 Tommy-0.024
 Sue-0.024
 Tom-0.024
Token
Token ID220
Feature activation+1.17e-01

Pos logit contributions
 Tim+0.001
 uncle+0.001
ries+0.001
 Sue+0.001
ens+0.001
Neg logit contributions
 whole-0.001
 fastest-0.001
 certainly-0.001
 biggest-0.001
tted-0.001
Token
Token ID198
Feature activation+8.66e-02

Pos logit contributions
 fastest+0.209
 quickest+0.200
 whole+0.198
 winner+0.196
 entire+0.192
Neg logit contributions
 Tim-0.209
 uncle-0.197
 Sue-0.195
 Tom-0.193
 Kitty-0.187
Token
Token ID198
Feature activation+8.22e-02

Pos logit contributions
 fastest+0.125
 whole+0.123
tted+0.120
hest+0.116
 quickest+0.116
Neg logit contributions
 Tim-0.170
 uncle-0.169
 Sue-0.161
 Uncle-0.160
 Lee-0.160
TokenGrand
Token ID23581
Feature activation-9.66e-02

Pos logit contributions
 fastest+0.120
 whole+0.115
tted+0.113
 biggest+0.113
 quickest+0.112
Neg logit contributions
 uncle-0.162
 Tim-0.160
 Sue-0.153
 Uncle-0.152
 Tom-0.151
Tokenfather
Token ID11358
Feature activation-4.60e-02

Pos logit contributions
 Tim+0.078
 Gig+0.071
 Sue+0.071
 Kitty+0.067
 Sally+0.065
Neg logit contributions
 whole-0.248
 certainly-0.247
 fastest-0.242
 entire-0.241
 best-0.238
Token went
Token ID1816
Feature activation-4.03e-02

Pos logit contributions
 uncle+0.066
 Tim+0.066
 Gig+0.065
 Lee+0.064
ens+0.064
Neg logit contributions
 whole-0.085
 fastest-0.083
 certainly-0.083
tted-0.082
 winner-0.081
Token on
Token ID319
Feature activation+1.38e-03

Pos logit contributions
 Tim+0.073
lee+0.072
ries+0.071
ens+0.070
 uncle+0.070
Neg logit contributions
 whole-0.065
tted-0.060
 fastest-0.060
 surely-0.058
 entire-0.057
Token to
Token ID284
Feature activation-6.01e-02

Pos logit contributions
 whole+0.003
 most+0.002
 surely+0.002
 entire+0.002
 fastest+0.002
Neg logit contributions
lee-0.002
ries-0.002
ens-0.002
 uncle-0.002
 Tim-0.002
Token explain
Token ID4727
Feature activation-2.08e-02

Pos logit contributions
 uncle+0.091
lee+0.089
ries+0.089
 Tim+0.088
ens+0.087
Neg logit contributions
 whole-0.111
 fastest-0.105
 biggest-0.102
tted-0.102
 entire-0.102
Token a
Token ID257
Feature activation+3.39e-02

Pos logit contributions
 fastest+0.109
tted+0.107
 whole+0.106
hest+0.104
 quickest+0.102
Neg logit contributions
 Tim-0.137
 uncle-0.134
 Sue-0.130
 Uncle-0.127
 Tom-0.127
Token gigg
Token ID30442
Feature activation-1.25e-02

Pos logit contributions
 fastest+0.051
tted+0.050
hest+0.049
 quickest+0.048
 whole+0.048
Neg logit contributions
 uncle-0.065
 Tim-0.064
 Uncle-0.061
 Sue-0.061
 Lee-0.060
Tokenly
Token ID306
Feature activation-3.22e-02

Pos logit contributions
 Tim+0.018
 uncle+0.017
ens+0.016
ries+0.016
 Sue+0.016
Neg logit contributions
tted-0.024
 fastest-0.024
 whole-0.024
 quickest-0.023
 certainly-0.023
Token stuffed
Token ID22259
Feature activation-2.63e-02

Pos logit contributions
 Tim+0.049
 uncle+0.049
 Sue+0.045
 Uncle+0.045
 Lee+0.045
Neg logit contributions
 fastest-0.061
 whole-0.060
tted-0.059
 quickest-0.058
hest-0.058
Token bear
Token ID6842
Feature activation-2.59e-02

Pos logit contributions
ries+0.042
 uncle+0.041
 Uncle+0.041
 Tim+0.040
 Tommy+0.040
Neg logit contributions
 whole-0.048
 entire-0.045
ative-0.043
 most-0.041
 biggest-0.041
Token,
Token ID11
Feature activation-1.32e-01

Pos logit contributions
 uncle+0.043
 Tim+0.042
 Uncle+0.040
 Sue+0.040
ries+0.040
Neg logit contributions
 whole-0.044
 fastest-0.044
tted-0.042
 certainly-0.041
 quickest-0.041
Token a
Token ID257
Feature activation+3.41e-02

Pos logit contributions
 uncle+0.185
 Tim+0.180
ries+0.179
ens+0.178
 Lee+0.172
Neg logit contributions
 fastest-0.251
 whole-0.250
 certainly-0.247
 most-0.242
 biggest-0.241
Token soft
Token ID2705
Feature activation+2.11e-02

Pos logit contributions
 fastest+0.063
tted+0.062
hest+0.061
 quickest+0.060
 whole+0.060
Neg logit contributions
 uncle-0.053
 Tim-0.053
 Uncle-0.049
 Sue-0.049
 Lee-0.049
Token blanket
Token ID18447
Feature activation+4.21e-02

Pos logit contributions
 whole+0.044
 fastest+0.042
 entire+0.041
 biggest+0.041
 certainly+0.041
Neg logit contributions
 uncle-0.028
 Tim-0.028
ries-0.027
 Uncle-0.026
 Lee-0.026
Token and
Token ID290
Feature activation-1.14e-01

Pos logit contributions
 fastest+0.088
 whole+0.087
tted+0.086
 quickest+0.084
 biggest+0.084
Neg logit contributions
 Tim-0.056
 uncle-0.055
 Sue-0.051
 Uncle-0.050
 Lee-0.050
Token a
Token ID257
Feature activation+3.14e-02

Pos logit contributions
 Tim+0.208
 uncle+0.207
 Sue+0.196
 Uncle+0.195
 Lee+0.194
Neg logit contributions
 fastest-0.179
 whole-0.175
tted-0.171
 quickest-0.167
 biggest-0.167
Token better
Token ID1365
Feature activation-4.77e-02

Pos logit contributions
 fastest+0.042
 quickest+0.041
ifying+0.041
tted+0.040
vals+0.040
Neg logit contributions
 Tim-0.053
 Lily-0.051
 Sue-0.049
 Sally-0.049
 uncle-0.049
Token after
Token ID706
Feature activation-6.24e-03

Pos logit contributions
 uncle+0.046
 Tim+0.044
 Lee+0.039
 Uncle+0.039
 Tom+0.038
Neg logit contributions
 fastest-0.119
tted-0.119
hest-0.117
 biggest-0.116
 quickest-0.115
Token that
Token ID326
Feature activation-1.05e-02

Pos logit contributions
ens+0.008
 uncle+0.007
ior+0.007
 Gig+0.007
 Lee+0.007
Neg logit contributions
 whole-0.013
 certainly-0.012
 best-0.012
 truly-0.012
 most-0.012
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 Tim+0.015
 uncle+0.015
 Sue+0.014
 Uncle+0.013
 Lee+0.013
Neg logit contributions
 fastest-0.021
 whole-0.021
tted-0.020
 quickest-0.020
 biggest-0.020
Token<|endoftext|>
Token ID50256
Feature activation+4.96e-02

Pos logit contributions
 Tim+0.016
 uncle+0.015
 Sue+0.014
 Uncle+0.014
 Tom+0.014
Neg logit contributions
 fastest-0.021
 whole-0.020
tted-0.020
 quickest-0.020
hest-0.020
TokenJohn
Token ID7554
Feature activation-7.50e-02

Pos logit contributions
 fastest+0.095
arest+0.091
 quickest+0.091
 biggest+0.089
 virtue+0.089
Neg logit contributions
 Tim-0.087
 Tom-0.081
 Sue-0.079
 Lily-0.077
 Tommy-0.077
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
 uncle+0.128
 Tim+0.124
 Sue+0.123
 Lee+0.123
ries+0.122
Neg logit contributions
 whole-0.123
 fastest-0.119
 will-0.118
 certainly-0.118
'll-0.117
Token his
Token ID465
Feature activation-2.87e-02

Pos logit contributions
ries+0.158
lee+0.140
aj+0.137
 handing+0.135
 Lee+0.134
Neg logit contributions
 whole-0.267
 most-0.255
 will-0.248
 biggest-0.248
 best-0.248
Token dad
Token ID9955
Feature activation-8.23e-02

Pos logit contributions
ries+0.024
 Tim+0.024
 Sue+0.023
 Gig+0.022
ens+0.022
Neg logit contributions
 whole-0.074
 biggest-0.072
 fastest-0.072
 best-0.070
 most-0.070
Token were
Token ID547
Feature activation+6.15e-02

Pos logit contributions
 uncle+0.141
 Sue+0.136
 Tim+0.136
 Lee+0.135
ries+0.134
Neg logit contributions
 whole-0.134
 fastest-0.131
 certainly-0.127
tted-0.126
 will-0.124
Token out
Token ID503
Feature activation+9.01e-03

Pos logit contributions
 fastest+0.145
 whole+0.142
tted+0.141
hest+0.139
 quickest+0.139
Neg logit contributions
 Tim-0.066
 uncle-0.065
 Sue-0.059
 Uncle-0.058
 Lee-0.058
Token of
Token ID286
Feature activation+3.03e-03

Pos logit contributions
 uncle+0.093
 Tim+0.091
 Uncle+0.088
 Lee+0.087
 Gig+0.086
Neg logit contributions
 whole-0.105
 fastest-0.104
tted-0.101
 biggest-0.099
 quickest-0.098
Token it
Token ID340
Feature activation+9.55e-02

Pos logit contributions
 whole+0.006
 fastest+0.006
 most+0.006
 biggest+0.006
 best+0.005
Neg logit contributions
ens-0.004
 uncle-0.004
lee-0.004
 Lee-0.004
 Tim-0.004
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
 fastest+0.186
 whole+0.182
tted+0.180
 quickest+0.176
hest+0.176
Neg logit contributions
 Tim-0.141
 uncle-0.139
 Sue-0.130
 Uncle-0.128
 Lee-0.127
Token She
Token ID1375
Feature activation+1.66e-02

Pos logit contributions
 uncle+0.022
ries+0.022
ens+0.021
pa+0.021
lee+0.020
Neg logit contributions
 certainly-0.017
tted-0.017
 fastest-0.017
 whole-0.017
 quickest-0.016
Token ran
Token ID4966
Feature activation-5.52e-02

Pos logit contributions
 certainly+0.027
 whole+0.027
 fastest+0.027
tted+0.026
 will+0.026
Neg logit contributions
 uncle-0.028
 Tim-0.028
 Lee-0.027
ries-0.027
 Gig-0.026
Token and
Token ID290
Feature activation-1.21e-01

Pos logit contributions
ries+0.111
 uncle+0.105
pa+0.103
lee+0.102
 Uncle+0.100
Neg logit contributions
 whole-0.086
tted-0.077
 fastest-0.076
 across-0.075
 surely-0.074
Token ran
Token ID4966
Feature activation-5.69e-02

Pos logit contributions
 uncle+0.199
pa+0.190
ries+0.186
 Uncle+0.183
child+0.179
Neg logit contributions
 whole-0.211
tted-0.207
 fastest-0.207
'll-0.203
 will-0.199
Token until
Token ID1566
Feature activation-4.18e-02

Pos logit contributions
 uncle+0.098
ries+0.096
 Uncle+0.092
 Tim+0.091
 Lee+0.090
Neg logit contributions
 whole-0.101
 fastest-0.097
tted-0.096
hest-0.091
 biggest-0.090
Token she
Token ID673
Feature activation+3.41e-03

Pos logit contributions
 uncle+0.047
ries+0.042
 Lee+0.041
 Tim+0.041
kids+0.040
Neg logit contributions
 whole-0.097
 surely-0.094
 fastest-0.094
tted-0.093
 most-0.090
Token found
Token ID1043
Feature activation+1.07e-02

Pos logit contributions
 fastest+0.006
 whole+0.006
tted+0.005
 certainly+0.005
 quickest+0.005
Neg logit contributions
 uncle-0.006
 Tim-0.006
 Uncle-0.006
 Lee-0.006
 Sue-0.006
Token a
Token ID257
Feature activation+3.67e-02

Pos logit contributions
 whole+0.020
 fastest+0.020
 biggest+0.020
tted+0.020
 certainly+0.019
Neg logit contributions
 uncle-0.016
 Tim-0.015
 Lee-0.015
 Uncle-0.014
ens-0.014