TOP ACTIVATIONS
MAX = 7.002

, which it was very proud of !
day his peaceful journey was disrupted by an ignorant animal .
ira and R avi were surprised and happy . They said
The mom my was very proud because she knew she was
day , an odd noise disrupted the peaceful ness of the
my 's mom was very strict and said he couldn 't
outside . It was a small yellow kitten !
the store . Dad was proud , because he wanted to
". His teacher was very proud of him .
, and the harbor was poorer without him . <|endoftext|> One
. The mom was very strict and wanted her children to
. Her mom was very proud of her for being so
myself !" His parents were surprised , their eyes widened .
Val erie 's mum was surprised when she saw Valerie holding
They also said they were proud of them for telling the
village people were all very proud of the bell , because
beautiful horn which she was proud of . Every morning ,
ily 's mom my was proud of her for being so
my 's mom my was surprised when she heard a loud
. The man was very proud of his trophy and showed

MIN ACTIVATIONS
MIN = -10.730

sharing is good . <|endoftext|> Once , there was a girl
happy and tired . <|endoftext|> Once upon a time , there
Mrs . Smith . <|endoftext|> Once upon a time , there
and clean again . <|endoftext|> Once upon a time , in
happy and safe . <|endoftext|> Once upon a time , there
and respect others . <|endoftext|> Once upon a time , there
happy and content . <|endoftext|> Once upon a time , there
and each other . <|endoftext|> Once upon a time , a
try new things . <|endoftext|> Once upon a time , there
with the kids . <|endoftext|> Once upon a time , there
happy and tired . <|endoftext|> Once upon a time , there
on their faces . <|endoftext|> Once upon a time there was
and safe again . <|endoftext|> Once upon a time , there
have good friends . <|endoftext|> Once upon a time , there
to their family . <|endoftext|> Once upon a time , there
, my boy ! <|endoftext|> Once upon a time , a
the sparkling stone . <|endoftext|> Once upon a time there was
they both smiled . <|endoftext|> Once upon a time , there
kind and helpful . <|endoftext|> Once upon a time , there
passed the bridge . <|endoftext|> Once upon a time , there

INTERVAL 2.569 - 7.002
CONTAINS 6.895%

tree . She was so happy and the tree was happy
It was very , very big . He wanted to show
walked , Marc ie noticed a hidden pole . She walked
she looked around , she noticed a big furry monster with
a time , there was a little girl named Sarah who

INTERVAL -1.864 - 2.569
CONTAINS 80.095%

The creature had left behind a feather
games with her and had lots of fun . They learned
and her friends had lots of fun playing with the mysterious
" Where did you go Nate ? I was looking for
You are not a good friend . You are selfish and

INTERVAL -6.297 - -1.864
CONTAINS 12.640%

mom my ran back to their house , but they got
out . One day , the group decided to
went to the shop with his mum . He wanted a
. She would go to her huge book case everyday ,
and say , I love you

INTERVAL -10.730 - -6.297
CONTAINS 0.370%

shared with everyone . <|endoftext|> Once upon a time there was
the giant ! â The mouse replied
enjoy the show ! <|endoftext|> Once upon a time there was
is cold outside !" <|endoftext|> Once upon a time , there
That â s how the three of
Token,
Token ID11
Feature activation+5.32e-01

Pos logit contributions
ail+0.024
at+0.023
affle+0.021
ip+0.021
 seeker+0.020
Neg logit contributions
 kinds-0.021
 than-0.020
 sorts-0.020
 sizes-0.019
 type-0.019
Token which
Token ID543
Feature activation-8.42e-01

Pos logit contributions
 kinds+0.010
 sorts+0.010
 than+0.010
 sizes+0.009
lier+0.009
Neg logit contributions
at-0.010
 seeker-0.010
ail-0.010
 skating-0.010
affle-0.009
Token it
Token ID340
Feature activation+7.19e-02

Pos logit contributions
at+0.014
ail+0.014
 seeker+0.013
affle+0.012
 skating+0.012
Neg logit contributions
 than-0.019
 unison-0.018
 kinds-0.018
ever-0.018
lier-0.018
Token was
Token ID373
Feature activation+9.54e-01

Pos logit contributions
 than+0.002
 kinds+0.002
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
ail-0.001
affle-0.001
 seeker-0.001
ogo-0.001
Token very
Token ID845
Feature activation+3.37e+00

Pos logit contributions
 than+0.021
 kinds+0.021
 sizes+0.020
 sorts+0.020
 type+0.019
Neg logit contributions
at-0.019
ail-0.018
affle-0.017
 seeker-0.017
ogo-0.016
Token proud
Token ID6613
Feature activation+7.00e+00

Pos logit contributions
 than+0.067
 kinds+0.067
 sizes+0.064
 sorts+0.061
 type+0.059
Neg logit contributions
at-0.073
ail-0.072
 seeker-0.066
affle-0.066
ogo-0.064
Token of
Token ID286
Feature activation+2.76e+00

Pos logit contributions
 unison+0.154
 target+0.147
Mine+0.143
 sizes+0.139
 advertise+0.136
Neg logit contributions
 on-0.113
ail-0.110
 skating-0.106
 morning-0.105
 surfing-0.105
Token!
Token ID0
Feature activation+2.35e-01

Pos logit contributions
 kinds+0.044
 than+0.043
 sizes+0.042
 sorts+0.040
 target+0.037
Neg logit contributions
at-0.068
ail-0.068
affle-0.063
ogo-0.062
 seeker-0.061
Token
Token ID220
Feature activation-3.72e+00

Pos logit contributions
 than+0.005
 kinds+0.005
 sizes+0.005
 sorts+0.005
 type+0.005
Neg logit contributions
ail-0.004
at-0.004
affle-0.004
 seeker-0.004
ip-0.004
Token
Token ID198
Feature activation+6.01e-01

Pos logit contributions
ail+0.069
at+0.068
affle+0.060
ogo+0.060
 seeker+0.059
Neg logit contributions
 kinds-0.084
 than-0.084
 sizes-0.082
 sorts-0.078
 type-0.076
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
 than+0.014
 kinds+0.014
 sizes+0.013
 sorts+0.013
 type+0.012
Neg logit contributions
ail-0.011
at-0.011
affle-0.010
 seeker-0.009
ogo-0.009
Token day
Token ID1110
Feature activation-2.53e+00

Pos logit contributions
 than+0.036
 kinds+0.036
 sizes+0.035
 sorts+0.034
 type+0.033
Neg logit contributions
at-0.016
ail-0.016
affle-0.014
 seeker-0.013
ogo-0.012
Token his
Token ID465
Feature activation-1.57e+00

Pos logit contributions
at+0.065
ail+0.065
ogo+0.061
affle+0.059
add+0.056
Neg logit contributions
 than-0.046
 kinds-0.042
 type-0.040
 sizes-0.037
 shapes-0.036
Token peaceful
Token ID12309
Feature activation+2.47e+00

Pos logit contributions
at+0.036
ail+0.033
abella+0.031
affle+0.031
ogo+0.031
Neg logit contributions
 than-0.030
 kinds-0.030
 sizes-0.029
 type-0.027
 sorts-0.027
Token journey
Token ID7002
Feature activation-1.26e+00

Pos logit contributions
 than+0.053
 kinds+0.052
 sizes+0.050
 sorts+0.047
 type+0.047
Neg logit contributions
at-0.052
ail-0.050
affle-0.046
ogo-0.046
 seeker-0.045
Token was
Token ID373
Feature activation+1.70e-02

Pos logit contributions
at+0.027
ail+0.026
affle+0.026
 seeker+0.024
ogo+0.024
Neg logit contributions
 than-0.026
 kinds-0.025
 sizes-0.024
 sorts-0.023
 type-0.022
Token disrupted
Token ID30067
Feature activation+6.95e+00

Pos logit contributions
 than+0.000
 kinds+0.000
 sizes+0.000
 sorts+0.000
 type+0.000
Neg logit contributions
at-0.000
ail-0.000
affle-0.000
atmeal-0.000
ogo-0.000
Token by
Token ID416
Feature activation+4.32e-01

Pos logit contributions
 sizes+0.155
 unison+0.144
 kinds+0.143
 textures+0.135
lets+0.135
Neg logit contributions
at-0.127
 skating-0.118
ail-0.117
 seeker-0.114
 onwards-0.109
Token an
Token ID281
Feature activation+2.67e+00

Pos logit contributions
 than+0.008
 kinds+0.008
 sizes+0.008
 sorts+0.008
 type+0.007
Neg logit contributions
at-0.010
ail-0.010
affle-0.009
 seeker-0.009
ogo-0.009
Token ignorant
Token ID20109
Feature activation+4.71e+00

Pos logit contributions
 kinds+0.053
 than+0.053
 sizes+0.051
 sorts+0.048
 type+0.046
Neg logit contributions
at-0.057
ail-0.056
affle-0.052
 seeker-0.052
ogo-0.050
Token animal
Token ID5044
Feature activation+2.19e+00

Pos logit contributions
 kinds+0.087
 than+0.087
 sizes+0.083
 sorts+0.079
 type+0.075
Neg logit contributions
at-0.107
ail-0.106
affle-0.099
 seeker-0.098
ogo-0.095
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 kinds+0.037
 than+0.036
 sizes+0.036
 sorts+0.034
 type+0.032
Neg logit contributions
at-0.053
ail-0.052
 seeker-0.048
affle-0.048
 skating-0.047
Tokenira
Token ID8704
Feature activation+9.26e-01

Pos logit contributions
at+0.053
ail+0.053
 seeker+0.049
affle+0.047
abella+0.047
Neg logit contributions
 kinds-0.050
 than-0.050
 sizes-0.049
 sorts-0.046
 type-0.045
Token and
Token ID290
Feature activation+6.07e-01

Pos logit contributions
 kinds+0.018
 than+0.018
 sizes+0.018
 sorts+0.017
 type+0.016
Neg logit contributions
at-0.020
ail-0.019
affle-0.018
 seeker-0.018
ogo-0.017
Token R
Token ID371
Feature activation-1.15e+00

Pos logit contributions
 kinds+0.012
 than+0.012
 sizes+0.012
 sorts+0.011
 type+0.011
Neg logit contributions
at-0.013
ail-0.013
affle-0.012
 seeker-0.012
ogo-0.011
Tokenavi
Token ID15820
Feature activation+6.77e-01

Pos logit contributions
ail+0.026
at+0.026
 seeker+0.026
affle+0.025
 skating+0.024
Neg logit contributions
 than-0.020
 kinds-0.020
 sizes-0.019
 sorts-0.018
 unison-0.017
Token were
Token ID547
Feature activation-4.16e-01

Pos logit contributions
 than+0.012
 kinds+0.012
 sizes+0.011
 sorts+0.010
 type+0.010
Neg logit contributions
ail-0.016
at-0.016
 seeker-0.015
affle-0.015
ogo-0.015
Token surprised
Token ID6655
Feature activation+6.90e+00

Pos logit contributions
at+0.008
ail+0.007
affle+0.007
 seeker+0.007
ogo+0.006
Neg logit contributions
 than-0.010
 kinds-0.010
 sizes-0.009
 sorts-0.009
 type-0.009
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 kinds+0.126
 sizes+0.125
 sorts+0.121
 target+0.120
 type+0.118
Neg logit contributions
ail-0.146
at-0.144
ogo-0.131
 skating-0.130
affle-0.129
Token happy
Token ID3772
Feature activation+4.16e+00

Pos logit contributions
 than+0.021
 kinds+0.021
 sizes+0.020
 sorts+0.019
 type+0.019
Neg logit contributions
at-0.024
ail-0.024
affle-0.022
 seeker-0.022
ogo-0.021
Token.
Token ID13
Feature activation-3.30e-02

Pos logit contributions
 kinds+0.072
 sizes+0.068
 than+0.068
 sorts+0.067
 type+0.063
Neg logit contributions
ail-0.096
at-0.095
affle-0.087
 seeker-0.087
 skating-0.085
Token They
Token ID1119
Feature activation+3.36e-01

Pos logit contributions
at+0.001
ail+0.001
affle+0.001
 seeker+0.001
ogo+0.001
Neg logit contributions
 kinds-0.001
 than-0.001
 sizes-0.001
 sorts-0.000
 type-0.000
Token said
Token ID531
Feature activation-1.86e+00

Pos logit contributions
 than+0.007
 kinds+0.007
 sizes+0.007
 sorts+0.006
 type+0.006
Neg logit contributions
at-0.007
ail-0.007
 seeker-0.006
abella-0.006
affle-0.006
Token The
Token ID383
Feature activation+1.71e+00

Pos logit contributions
 kinds+0.010
 than+0.010
 sizes+0.010
 sorts+0.009
 type+0.009
Neg logit contributions
at-0.010
ail-0.010
affle-0.009
 seeker-0.009
ogo-0.009
Token mom
Token ID1995
Feature activation+2.77e+00

Pos logit contributions
 than+0.032
 kinds+0.032
 type+0.030
 sizes+0.029
 textures+0.028
Neg logit contributions
at-0.040
ail-0.039
ogo-0.036
abella-0.034
affle-0.033
Tokenmy
Token ID1820
Feature activation+2.18e+00

Pos logit contributions
 kinds+0.066
 than+0.064
 sizes+0.062
 sorts+0.061
 type+0.058
Neg logit contributions
at-0.048
ail-0.048
 seeker-0.046
affle-0.044
ogo-0.043
Token was
Token ID373
Feature activation+3.03e-01

Pos logit contributions
 kinds+0.044
 than+0.044
 sizes+0.042
 sorts+0.040
 type+0.039
Neg logit contributions
at-0.046
ail-0.045
affle-0.042
 seeker-0.042
ogo-0.040
Token very
Token ID845
Feature activation+2.65e+00

Pos logit contributions
 than+0.007
 kinds+0.007
 type+0.006
 sizes+0.006
 sorts+0.006
Neg logit contributions
at-0.006
ail-0.006
affle-0.006
ogo-0.005
 seeker-0.005
Token proud
Token ID6613
Feature activation+6.86e+00

Pos logit contributions
 than+0.050
 kinds+0.049
 type+0.045
 sorts+0.044
 sizes+0.044
Neg logit contributions
at-0.062
ail-0.060
ogo-0.056
 seeker-0.056
affle-0.056
Token because
Token ID780
Feature activation+1.23e+00

Pos logit contributions
 unison+0.139
 target+0.137
 sizes+0.135
Mine+0.128
 kinds+0.125
Neg logit contributions
ail-0.124
 skating-0.119
ip-0.119
 on-0.117
 tired-0.116
Token she
Token ID673
Feature activation+1.86e-01

Pos logit contributions
 than+0.025
 kinds+0.025
 sizes+0.024
 sorts+0.023
 type+0.022
Neg logit contributions
at-0.025
ail-0.025
 seeker-0.023
affle-0.023
ogo-0.022
Token knew
Token ID2993
Feature activation+2.25e+00

Pos logit contributions
 than+0.003
 kinds+0.003
 sizes+0.003
 sorts+0.003
 type+0.003
Neg logit contributions
at-0.004
ail-0.004
 seeker-0.004
affle-0.004
ogo-0.004
Token she
Token ID673
Feature activation+6.20e-01

Pos logit contributions
 than+0.042
 kinds+0.042
 sizes+0.040
 sorts+0.038
 type+0.037
Neg logit contributions
at-0.052
ail-0.050
affle-0.047
 seeker-0.047
ogo-0.045
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
 kinds+0.013
 than+0.013
 sizes+0.013
 sorts+0.012
 type+0.012
Neg logit contributions
at-0.013
ail-0.012
affle-0.011
 seeker-0.011
ogo-0.011
Token day
Token ID1110
Feature activation-2.72e+00

Pos logit contributions
 kinds+0.024
 than+0.023
 sorts+0.022
 sizes+0.022
 type+0.022
Neg logit contributions
ail-0.014
at-0.014
ogo-0.013
affle-0.012
 seeker-0.012
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
at+0.062
ail+0.058
affle+0.056
ogo+0.056
atmeal+0.053
Neg logit contributions
 than-0.055
 kinds-0.053
 sizes-0.048
 sorts-0.048
 type-0.048
Token an
Token ID281
Feature activation+3.70e+00

Pos logit contributions
ail+0.027
at+0.027
 seeker+0.025
affle+0.024
 skating+0.024
Neg logit contributions
 than-0.027
 kinds-0.027
 sizes-0.026
 sorts-0.025
 type-0.024
Token odd
Token ID5629
Feature activation+3.11e+00

Pos logit contributions
 kinds+0.068
 than+0.068
 sizes+0.064
 sorts+0.062
 type+0.059
Neg logit contributions
at-0.085
ail-0.085
affle-0.078
 seeker-0.078
ogo-0.076
Token noise
Token ID7838
Feature activation+2.94e+00

Pos logit contributions
 than+0.059
 kinds+0.058
 type+0.054
 sorts+0.053
 sizes+0.052
Neg logit contributions
at-0.071
ail-0.071
ogo-0.068
ip-0.065
affle-0.064
Token disrupted
Token ID30067
Feature activation+6.85e+00

Pos logit contributions
 kinds+0.068
 than+0.067
 sizes+0.067
 sorts+0.063
 type+0.061
Neg logit contributions
at-0.053
ail-0.051
affle-0.047
 seeker-0.045
 skating-0.044
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 sizes+0.158
 kinds+0.150
 than+0.146
 target+0.142
 sorts+0.140
Neg logit contributions
at-0.117
ail-0.113
 seeker-0.106
 skating-0.102
abella-0.101
Token peaceful
Token ID12309
Feature activation+2.90e+00

Pos logit contributions
 kinds+0.031
 than+0.030
 sizes+0.029
 sorts+0.028
 type+0.027
Neg logit contributions
ail-0.036
at-0.036
affle-0.033
ogo-0.033
 seeker-0.033
Tokenness
Token ID1108
Feature activation+9.19e-01

Pos logit contributions
 kinds+0.064
 sizes+0.063
 than+0.063
 sorts+0.061
lier+0.059
Neg logit contributions
at-0.050
ail-0.049
 seeker-0.045
affle-0.044
 skating-0.042
Token of
Token ID286
Feature activation+3.61e-01

Pos logit contributions
 kinds+0.015
 sizes+0.015
 unison+0.015
 than+0.014
 type+0.014
Neg logit contributions
at-0.021
ail-0.021
 skating-0.019
 onwards-0.019
 onward-0.018
Token the
Token ID262
Feature activation-6.19e-02

Pos logit contributions
 than+0.007
 kinds+0.007
 sizes+0.007
 sorts+0.007
lier+0.007
Neg logit contributions
at-0.007
ail-0.007
affle-0.007
 seeker-0.007
ogo-0.006
Tokenmy
Token ID1820
Feature activation+1.56e+00

Pos logit contributions
 kinds+0.019
 than+0.019
 sizes+0.018
 sorts+0.018
 type+0.017
Neg logit contributions
at-0.017
ail-0.017
affle-0.016
 seeker-0.016
bled-0.015
Token's
Token ID338
Feature activation-1.12e+00

Pos logit contributions
 than+0.029
 kinds+0.028
 sizes+0.026
 type+0.025
 sorts+0.025
Neg logit contributions
at-0.036
ail-0.034
affle-0.033
 seeker-0.033
ogo-0.032
Token mom
Token ID1995
Feature activation+3.23e+00

Pos logit contributions
at+0.025
ogo+0.021
ail+0.021
affle+0.021
abella+0.021
Neg logit contributions
 than-0.023
 kinds-0.022
 type-0.022
 shapes-0.021
 sizes-0.021
Token was
Token ID373
Feature activation+4.52e-01

Pos logit contributions
 kinds+0.059
 than+0.058
 sizes+0.055
 sorts+0.053
 type+0.051
Neg logit contributions
at-0.074
ail-0.073
affle-0.069
 seeker-0.069
ogo-0.067
Token very
Token ID845
Feature activation+2.88e+00

Pos logit contributions
 than+0.009
 kinds+0.008
 type+0.008
 who+0.008
 sorts+0.008
Neg logit contributions
at-0.010
ail-0.010
ogo-0.010
affle-0.009
ipop-0.009
Token strict
Token ID7646
Feature activation+6.84e+00

Pos logit contributions
 than+0.058
 kinds+0.057
 type+0.054
 sorts+0.052
 sizes+0.051
Neg logit contributions
at-0.063
ail-0.059
ogo-0.057
 seeker-0.056
affle-0.055
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
 sizes+0.129
 kinds+0.129
 unison+0.126
 sorts+0.122
ever+0.121
Neg logit contributions
at-0.140
ail-0.137
 skating-0.130
affle-0.128
 seeker-0.126
Token said
Token ID531
Feature activation-2.21e-01

Pos logit contributions
 than+0.020
 kinds+0.020
 sizes+0.019
 sorts+0.019
 type+0.018
Neg logit contributions
at-0.021
ail-0.021
affle-0.019
 seeker-0.019
abella-0.019
Token he
Token ID339
Feature activation+7.53e-01

Pos logit contributions
at+0.005
ail+0.005
 seeker+0.005
ogo+0.005
affle+0.005
Neg logit contributions
 than-0.004
 kinds-0.004
 sizes-0.004
 type-0.003
 shapes-0.003
Token couldn
Token ID3521
Feature activation-1.32e+00

Pos logit contributions
 kinds+0.013
 than+0.012
 sizes+0.012
 type+0.012
 sorts+0.011
Neg logit contributions
at-0.018
ail-0.017
affle-0.017
 seeker-0.017
ogo-0.017
Token't
Token ID470
Feature activation+1.29e+00

Pos logit contributions
at+0.022
ail+0.021
 seeker+0.021
affle+0.020
ogo+0.019
Neg logit contributions
 kinds-0.030
 than-0.029
 sorts-0.028
 sizes-0.028
 type-0.028
Token outside
Token ID2354
Feature activation+6.67e-01

Pos logit contributions
 kinds+0.064
 sizes+0.062
 than+0.061
 sorts+0.059
 type+0.056
Neg logit contributions
at-0.054
ail-0.053
 seeker-0.048
affle-0.048
 skating-0.047
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
 kinds+0.012
 sizes+0.012
 than+0.012
 sorts+0.011
 type+0.011
Neg logit contributions
at-0.015
ail-0.014
 seeker-0.013
affle-0.013
 skating-0.013
Token It
Token ID632
Feature activation+1.02e+00

Pos logit contributions
 sizes+0.014
 than+0.014
 kinds+0.013
lier+0.012
 target+0.012
Neg logit contributions
ail-0.011
at-0.011
 onwards-0.010
abella-0.010
 onward-0.010
Token was
Token ID373
Feature activation+1.35e+00

Pos logit contributions
 kinds+0.023
 than+0.023
 sizes+0.022
 sorts+0.021
 type+0.020
Neg logit contributions
at-0.019
ail-0.019
affle-0.017
 seeker-0.017
ogo-0.017
Token a
Token ID257
Feature activation+4.61e+00

Pos logit contributions
 kinds+0.029
 than+0.029
 sizes+0.028
 sorts+0.027
 type+0.026
Neg logit contributions
at-0.026
ail-0.025
 seeker-0.024
affle-0.023
ogo-0.022
Token small
Token ID1402
Feature activation+6.79e+00

Pos logit contributions
 than+0.086
 kinds+0.085
 sizes+0.081
 sorts+0.078
 type+0.077
Neg logit contributions
at-0.106
ail-0.103
affle-0.095
ogo-0.093
 seeker-0.093
Token yellow
Token ID7872
Feature activation+4.60e+00

Pos logit contributions
 kinds+0.137
 sizes+0.134
 than+0.134
 sorts+0.126
lier+0.118
Neg logit contributions
ail-0.134
 seeker-0.132
at-0.131
affle-0.125
 skating-0.119
Token kitten
Token ID42143
Feature activation+3.39e+00

Pos logit contributions
 kinds+0.086
 than+0.085
 sizes+0.083
 sorts+0.079
 type+0.074
Neg logit contributions
at-0.104
ail-0.102
affle-0.096
 seeker-0.095
ogo-0.091
Token!
Token ID0
Feature activation+1.05e+00

Pos logit contributions
 kinds+0.069
 sizes+0.066
 than+0.063
 sorts+0.063
 textures+0.059
Neg logit contributions
ail-0.071
at-0.070
 seeker-0.065
affle-0.063
ip-0.063
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 sizes+0.022
 than+0.022
 kinds+0.022
 type+0.020
 target+0.020
Neg logit contributions
at-0.019
ail-0.019
abella-0.018
 onwards-0.018
 seeker-0.017
Token
Token ID198
Feature activation+2.08e+00

Pos logit contributions
 sizes+0.027
 kinds+0.026
 than+0.025
 textures+0.024
 sorts+0.024
Neg logit contributions
ail-0.016
at-0.016
abella-0.015
ogo-0.014
 onwards-0.014
Token the
Token ID262
Feature activation+7.16e-01

Pos logit contributions
 than+0.023
 sizes+0.023
 kinds+0.022
 sorts+0.021
lier+0.020
Neg logit contributions
at-0.020
ail-0.020
 seeker-0.018
affle-0.018
 skating-0.017
Token store
Token ID3650
Feature activation-6.68e-01

Pos logit contributions
 kinds+0.015
 than+0.015
 sorts+0.014
 sizes+0.014
 type+0.014
Neg logit contributions
at-0.015
ail-0.014
affle-0.013
ogo-0.013
 seeker-0.013
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
ail+0.013
at+0.012
 skating+0.012
 seeker+0.011
ip+0.011
Neg logit contributions
 sizes-0.014
 unison-0.013
 kinds-0.013
 matter-0.013
 target-0.013
Token Dad
Token ID17415
Feature activation+2.50e+00

Pos logit contributions
 than+0.019
 kinds+0.018
 sizes+0.017
 sorts+0.016
 type+0.015
Neg logit contributions
at-0.030
ail-0.030
 seeker-0.029
affle-0.028
ip-0.027
Token was
Token ID373
Feature activation-3.31e-02

Pos logit contributions
 than+0.055
 sizes+0.054
 kinds+0.054
 unison+0.051
 sorts+0.051
Neg logit contributions
ail-0.047
at-0.047
affle-0.041
ip-0.041
 skating-0.040
Token proud
Token ID6613
Feature activation+6.79e+00

Pos logit contributions
at+0.001
ail+0.001
ogo+0.001
affle+0.001
ip+0.001
Neg logit contributions
 than-0.001
 kinds-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token,
Token ID11
Feature activation+1.81e+00

Pos logit contributions
 target+0.138
 unison+0.136
ymes+0.131
 sizes+0.129
ever+0.124
Neg logit contributions
 skating-0.127
 surfing-0.119
ail-0.119
 tired-0.115
 camping-0.114
Token because
Token ID780
Feature activation+1.58e+00

Pos logit contributions
 kinds+0.036
 sizes+0.035
 unison+0.035
 sorts+0.034
 than+0.034
Neg logit contributions
 skating-0.034
at-0.033
 seeker-0.032
ail-0.032
affle-0.031
Token he
Token ID339
Feature activation+5.16e-01

Pos logit contributions
 than+0.037
 kinds+0.036
 sizes+0.036
 sorts+0.034
 type+0.033
Neg logit contributions
at-0.027
ail-0.027
 seeker-0.025
affle-0.024
 skating-0.023
Token wanted
Token ID2227
Feature activation+1.87e+00

Pos logit contributions
 kinds+0.010
 than+0.010
 sizes+0.010
 sorts+0.009
 type+0.009
Neg logit contributions
at-0.011
ail-0.011
affle-0.010
 seeker-0.010
ogo-0.010
Token to
Token ID284
Feature activation+7.99e-01

Pos logit contributions
 kinds+0.036
 than+0.035
 sizes+0.034
 sorts+0.032
 type+0.031
Neg logit contributions
at-0.043
ail-0.042
affle-0.039
 seeker-0.039
ogo-0.038
Token".
Token ID1911
Feature activation+1.08e+00

Pos logit contributions
at+0.007
ail+0.007
affle+0.006
 seeker+0.006
ogo+0.006
Neg logit contributions
 kinds-0.005
 than-0.005
 sizes-0.005
 sorts-0.004
 type-0.004
Token His
Token ID2399
Feature activation-1.71e+00

Pos logit contributions
 than+0.024
 kinds+0.023
 sizes+0.023
 type+0.021
 sorts+0.021
Neg logit contributions
ail-0.021
at-0.021
affle-0.018
 seeker-0.018
ogo-0.018
Token teacher
Token ID4701
Feature activation+3.15e+00

Pos logit contributions
at+0.043
ail+0.038
atmeal+0.038
affle+0.037
pee+0.037
Neg logit contributions
 kinds-0.030
 than-0.030
 sizes-0.028
 type-0.028
 sorts-0.027
Token was
Token ID373
Feature activation+2.05e-01

Pos logit contributions
 than+0.064
 kinds+0.064
 sizes+0.062
 sorts+0.059
 type+0.056
Neg logit contributions
at-0.065
ail-0.065
affle-0.058
 seeker-0.057
abella-0.056
Token very
Token ID845
Feature activation+3.09e+00

Pos logit contributions
 than+0.005
 kinds+0.005
 sizes+0.005
 sorts+0.004
 type+0.004
Neg logit contributions
at-0.004
ail-0.004
affle-0.003
 seeker-0.003
ogo-0.003
Token proud
Token ID6613
Feature activation+6.79e+00

Pos logit contributions
 kinds+0.059
 than+0.059
 sizes+0.057
 sorts+0.054
 type+0.051
Neg logit contributions
at-0.068
ail-0.067
affle-0.062
 seeker-0.062
ogo-0.060
Token of
Token ID286
Feature activation+2.69e+00

Pos logit contributions
 unison+0.133
 sizes+0.131
 target+0.129
Mine+0.127
 kinds+0.123
Neg logit contributions
ail-0.122
 skating-0.114
 on-0.113
 surfing-0.109
 onwards-0.109
Token him
Token ID683
Feature activation-4.11e-01

Pos logit contributions
 kinds+0.057
 than+0.057
 sizes+0.055
 sorts+0.053
 type+0.051
Neg logit contributions
at-0.054
ail-0.053
affle-0.049
 seeker-0.048
ogo-0.047
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
at+0.010
ail+0.010
affle+0.009
 seeker+0.009
ogo+0.009
Neg logit contributions
 kinds-0.007
 than-0.007
 sizes-0.007
 sorts-0.006
 type-0.006
Token
Token ID198
Feature activation+5.85e-01

Pos logit contributions
 than+0.011
 kinds+0.011
 sizes+0.011
 type+0.010
 sorts+0.010
Neg logit contributions
ail-0.009
at-0.009
affle-0.008
 seeker-0.008
ogo-0.008
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 kinds+0.011
 than+0.011
 sizes+0.010
 sorts+0.010
 type+0.009
Neg logit contributions
at-0.013
ail-0.013
 seeker-0.012
affle-0.012
atmeal-0.012
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
at+0.006
ail+0.005
affle+0.005
 seeker+0.005
ogo+0.005
Neg logit contributions
 than-0.004
 kinds-0.004
 sizes-0.004
 sorts-0.004
 type-0.004
Token and
Token ID290
Feature activation+7.80e-02

Pos logit contributions
at+0.007
ail+0.007
affle+0.006
 seeker+0.006
ogo+0.006
Neg logit contributions
 kinds-0.006
 than-0.006
 sizes-0.006
 sorts-0.006
 type-0.006
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
 than+0.002
 kinds+0.002
 sizes+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.001
ail-0.001
affle-0.001
 seeker-0.001
ogo-0.001
Token harbor
Token ID25451
Feature activation-2.09e-01

Pos logit contributions
 kinds+0.003
 than+0.003
 sizes+0.003
 type+0.003
 sorts+0.003
Neg logit contributions
at-0.003
ail-0.003
affle-0.003
ogo-0.003
 seeker-0.003
Token was
Token ID373
Feature activation+7.34e-02

Pos logit contributions
at+0.004
ail+0.004
affle+0.004
 seeker+0.004
ogo+0.003
Neg logit contributions
 kinds-0.005
 than-0.005
 sizes-0.004
 sorts-0.004
 type-0.004
Token poorer
Token ID26647
Feature activation+6.78e+00

Pos logit contributions
 than+0.002
 kinds+0.002
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.002
ail-0.002
affle-0.001
 seeker-0.001
ogo-0.001
Token without
Token ID1231
Feature activation+7.29e-01

Pos logit contributions
 unison+0.079
 advertise+0.075
 matter+0.075
Mine+0.073
 sorts+0.072
Neg logit contributions
 seeker-0.180
ail-0.175
 back-0.171
 skating-0.170
 home-0.169
Token him
Token ID683
Feature activation-1.73e+00

Pos logit contributions
 kinds+0.016
 than+0.016
 sizes+0.015
 sorts+0.014
 type+0.014
Neg logit contributions
at-0.015
ail-0.014
affle-0.013
 seeker-0.013
ogo-0.013
Token.
Token ID13
Feature activation-7.12e-03

Pos logit contributions
at+0.043
ail+0.043
affle+0.040
 seeker+0.039
ogo+0.038
Neg logit contributions
 kinds-0.028
 than-0.027
 sizes-0.027
 sorts-0.026
 type-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.77e+00

Pos logit contributions
at+0.000
ail+0.000
affle+0.000
 seeker+0.000
ogo+0.000
Neg logit contributions
 than-0.000
 kinds-0.000
 sizes-0.000
 sorts-0.000
 type-0.000
TokenOne
Token ID3198
Feature activation-6.71e-01

Pos logit contributions
at+0.034
ail+0.033
affle+0.032
atmeal+0.030
unct+0.030
Neg logit contributions
 kinds-0.039
 than-0.039
 sorts-0.037
 sizes-0.035
lier-0.034
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 than+0.045
 kinds+0.045
 sizes+0.044
 sorts+0.041
 type+0.040
Neg logit contributions
at-0.049
ail-0.048
 seeker-0.044
affle-0.044
ip-0.042
Token The
Token ID383
Feature activation+1.99e+00

Pos logit contributions
 than+0.017
 kinds+0.017
ever+0.016
izing+0.015
 sizes+0.015
Neg logit contributions
 seeker-0.022
at-0.021
 morning-0.020
ail-0.020
abella-0.019
Token mom
Token ID1995
Feature activation+3.43e+00

Pos logit contributions
 kinds+0.043
 than+0.041
 sizes+0.041
 type+0.040
 sorts+0.040
Neg logit contributions
at-0.041
ail-0.038
ogo-0.036
affle-0.036
ip-0.034
Token was
Token ID373
Feature activation+4.61e-01

Pos logit contributions
 than+0.080
 sizes+0.078
 kinds+0.078
 unison+0.077
 target+0.075
Neg logit contributions
at-0.061
ail-0.059
affle-0.051
ip-0.050
atmeal-0.048
Token very
Token ID845
Feature activation+3.18e+00

Pos logit contributions
 than+0.011
 kinds+0.011
 sizes+0.010
 sorts+0.010
 type+0.010
Neg logit contributions
at-0.009
ail-0.009
affle-0.008
ogo-0.008
 seeker-0.007
Token strict
Token ID7646
Feature activation+6.77e+00

Pos logit contributions
 than+0.062
 kinds+0.062
 sizes+0.058
 sorts+0.057
 type+0.057
Neg logit contributions
at-0.070
ail-0.068
affle-0.063
ogo-0.063
 seeker-0.063
Token and
Token ID290
Feature activation+1.96e+00

Pos logit contributions
 kinds+0.130
 sizes+0.130
 unison+0.125
 target+0.124
 sorts+0.124
Neg logit contributions
at-0.131
ail-0.129
 seeker-0.123
 skating-0.122
affle-0.119
Token wanted
Token ID2227
Feature activation+2.12e+00

Pos logit contributions
 than+0.041
 kinds+0.040
 sizes+0.039
 target+0.038
 unison+0.037
Neg logit contributions
ail-0.037
at-0.037
 seeker-0.034
affle-0.034
abella-0.033
Token her
Token ID607
Feature activation+1.61e+00

Pos logit contributions
 kinds+0.040
 than+0.040
 sizes+0.039
 sorts+0.037
 type+0.036
Neg logit contributions
at-0.048
ail-0.047
affle-0.044
 seeker-0.043
ogo-0.042
Token children
Token ID1751
Feature activation+1.61e+00

Pos logit contributions
 than+0.034
 kinds+0.034
 sizes+0.033
 sorts+0.032
 type+0.030
Neg logit contributions
at-0.032
ail-0.032
 seeker-0.029
affle-0.029
ip-0.027
Token to
Token ID284
Feature activation+9.03e-01

Pos logit contributions
 kinds+0.032
 than+0.032
 sizes+0.031
 sorts+0.030
 type+0.028
Neg logit contributions
at-0.034
ail-0.034
affle-0.031
 seeker-0.031
ogo-0.030
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
ail+0.015
at+0.015
 back+0.014
ip+0.014
 seeker+0.014
Neg logit contributions
 kinds-0.020
 sizes-0.019
 sorts-0.019
ever-0.019
 unison-0.018
Token Her
Token ID2332
Feature activation-1.88e+00

Pos logit contributions
at+0.003
ail+0.003
affle+0.003
 seeker+0.003
ogo+0.003
Neg logit contributions
 kinds-0.002
 than-0.002
 sizes-0.002
 sorts-0.002
 type-0.002
Token mom
Token ID1995
Feature activation+2.68e+00

Pos logit contributions
at+0.039
affle+0.033
ail+0.033
ogo+0.032
abella+0.032
Neg logit contributions
 than-0.042
 kinds-0.041
 sizes-0.039
 type-0.039
 shapes-0.038
Token was
Token ID373
Feature activation+4.32e-02

Pos logit contributions
 than+0.056
 sizes+0.055
 kinds+0.055
 sorts+0.050
 textures+0.048
Neg logit contributions
ail-0.057
at-0.056
affle-0.050
ip-0.049
abella-0.048
Token very
Token ID845
Feature activation+2.91e+00

Pos logit contributions
 than+0.001
 kinds+0.001
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
ail-0.001
affle-0.001
ogo-0.001
 seeker-0.001
Token proud
Token ID6613
Feature activation+6.75e+00

Pos logit contributions
 kinds+0.064
 than+0.063
 sizes+0.063
 sorts+0.059
 type+0.055
Neg logit contributions
ail-0.057
at-0.056
affle-0.051
 seeker-0.051
 skating-0.048
Token of
Token ID286
Feature activation+2.70e+00

Pos logit contributions
 unison+0.150
 sizes+0.148
 target+0.147
Mine+0.140
ever+0.135
Neg logit contributions
ail-0.115
 on-0.108
m-0.104
ip-0.104
 skating-0.102
Token her
Token ID607
Feature activation-2.97e-01

Pos logit contributions
 than+0.049
 kinds+0.049
 sizes+0.046
 sorts+0.044
 type+0.042
Neg logit contributions
at-0.064
ail-0.062
affle-0.059
 seeker-0.058
ogo-0.057
Token for
Token ID329
Feature activation+5.38e-01

Pos logit contributions
ail+0.006
at+0.006
affle+0.005
 seeker+0.005
ip+0.005
Neg logit contributions
 kinds-0.006
 sizes-0.006
 than-0.006
 sorts-0.006
 type-0.006
Token being
Token ID852
Feature activation-3.40e-01

Pos logit contributions
 than+0.011
 kinds+0.011
 sizes+0.010
 sorts+0.010
 type+0.010
Neg logit contributions
at-0.012
ail-0.011
affle-0.011
 seeker-0.010
ogo-0.010
Token so
Token ID523
Feature activation+6.57e-01

Pos logit contributions
at+0.007
ail+0.006
 seeker+0.006
affle+0.006
 skating+0.006
Neg logit contributions
 sizes-0.007
 kinds-0.007
 than-0.007
 sorts-0.007
 unison-0.007
Token myself
Token ID3589
Feature activation+2.68e+00

Pos logit contributions
 kinds+0.064
 sizes+0.062
 than+0.062
 sorts+0.060
 textures+0.057
Neg logit contributions
at-0.060
ail-0.056
affle-0.052
 seeker-0.050
 skating-0.050
Token!"
Token ID2474
Feature activation+2.07e+00

Pos logit contributions
 kinds+0.045
 sizes+0.043
 than+0.043
 sorts+0.042
 type+0.040
Neg logit contributions
at-0.064
ail-0.063
affle-0.057
 seeker-0.057
ogo-0.055
Token His
Token ID2399
Feature activation-1.73e+00

Pos logit contributions
 kinds+0.047
 sizes+0.047
 than+0.046
 sorts+0.043
 type+0.042
Neg logit contributions
ail-0.037
at-0.036
abella-0.033
affle-0.033
 seeker-0.033
Token parents
Token ID3397
Feature activation+1.22e+00

Pos logit contributions
at+0.029
ail+0.028
affle+0.025
 seeker+0.025
ogo+0.024
Neg logit contributions
 kinds-0.043
 than-0.043
 sizes-0.042
 sorts-0.040
 type-0.039
Token were
Token ID547
Feature activation-3.56e-01

Pos logit contributions
 than+0.025
 kinds+0.024
 sizes+0.023
 sorts+0.022
 type+0.021
Neg logit contributions
at-0.026
ail-0.026
affle-0.024
 seeker-0.023
abella-0.023
Token surprised
Token ID6655
Feature activation+6.72e+00

Pos logit contributions
at+0.006
ail+0.006
affle+0.006
 seeker+0.006
ogo+0.005
Neg logit contributions
 kinds-0.008
 than-0.008
 sizes-0.008
 sorts-0.008
 type-0.008
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 sizes+0.159
 kinds+0.154
 target+0.152
 sorts+0.151
 type+0.150
Neg logit contributions
ail-0.100
 back-0.098
m-0.088
 forward-0.085
 on-0.084
Token their
Token ID511
Feature activation-1.86e-01

Pos logit contributions
 kinds+0.029
 sizes+0.027
 sorts+0.026
 than+0.026
 textures+0.025
Neg logit contributions
ail-0.025
at-0.024
 skating-0.024
 tired-0.023
 refreshed-0.023
Token eyes
Token ID2951
Feature activation-6.53e-01

Pos logit contributions
at+0.004
ail+0.004
affle+0.004
 seeker+0.004
ogo+0.004
Neg logit contributions
 kinds-0.004
 than-0.004
 sizes-0.003
 sorts-0.003
 type-0.003
Token widened
Token ID36405
Feature activation+4.20e+00

Pos logit contributions
at+0.013
ail+0.013
affle+0.012
 seeker+0.012
ogo+0.011
Neg logit contributions
 kinds-0.014
 than-0.013
 sizes-0.013
 sorts-0.013
 type-0.012
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
 kinds+0.102
 sizes+0.100
 sorts+0.099
 deserve+0.097
ever+0.088
Neg logit contributions
ail-0.060
 skating-0.059
at-0.058
bled-0.055
 seeker-0.052
TokenVal
Token ID7762
Feature activation+1.17e+00

Pos logit contributions
 kinds+0.029
 than+0.029
 sizes+0.028
 sorts+0.028
 type+0.026
Neg logit contributions
ail-0.026
at-0.025
 seeker-0.023
affle-0.022
ogo-0.021
Tokenerie
Token ID18287
Feature activation+2.65e+00

Pos logit contributions
 kinds+0.022
 than+0.022
 sizes+0.021
 sorts+0.020
 type+0.019
Neg logit contributions
at-0.026
ail-0.026
affle-0.023
 seeker-0.023
ogo-0.023
Token's
Token ID338
Feature activation-2.36e-01

Pos logit contributions
 kinds+0.053
 than+0.053
 sizes+0.051
 sorts+0.049
 type+0.047
Neg logit contributions
at-0.056
ail-0.055
affle-0.050
 seeker-0.049
ip-0.048
Token mum
Token ID25682
Feature activation+2.20e+00

Pos logit contributions
at+0.005
ail+0.005
affle+0.004
 seeker+0.004
ogo+0.004
Neg logit contributions
 kinds-0.005
 than-0.005
 sizes-0.005
 sorts-0.005
 type-0.004
Token was
Token ID373
Feature activation+5.73e-01

Pos logit contributions
 kinds+0.040
 than+0.040
 sizes+0.038
 sorts+0.037
 type+0.035
Neg logit contributions
at-0.050
ail-0.050
 seeker-0.046
affle-0.046
ogo-0.045
Token surprised
Token ID6655
Feature activation+6.71e+00

Pos logit contributions
 than+0.013
 kinds+0.013
 sizes+0.012
 sorts+0.012
 type+0.011
Neg logit contributions
at-0.011
ail-0.011
affle-0.010
 seeker-0.010
ogo-0.010
Token when
Token ID618
Feature activation+2.87e-01

Pos logit contributions
 sizes+0.146
 kinds+0.143
 unison+0.140
 target+0.139
 type+0.136
Neg logit contributions
ail-0.119
at-0.108
 back-0.102
 onwards-0.099
 skating-0.098
Token she
Token ID673
Feature activation-9.13e-01

Pos logit contributions
 than+0.006
 kinds+0.006
 sizes+0.005
 sorts+0.005
 type+0.005
Neg logit contributions
at-0.006
ail-0.006
affle-0.006
 seeker-0.006
ogo-0.006
Token saw
Token ID2497
Feature activation+1.13e+00

Pos logit contributions
at+0.018
ail+0.018
affle+0.017
 seeker+0.016
ogo+0.016
Neg logit contributions
 kinds-0.019
 than-0.019
 sizes-0.019
 sorts-0.018
 type-0.017
Token Valerie
Token ID45603
Feature activation+1.29e+00

Pos logit contributions
 than+0.023
 kinds+0.023
 sizes+0.023
 sorts+0.021
 type+0.021
Neg logit contributions
at-0.023
ail-0.023
affle-0.020
 seeker-0.020
abella-0.020
Token holding
Token ID4769
Feature activation-9.66e-01

Pos logit contributions
 kinds+0.027
 than+0.027
 sizes+0.026
 sorts+0.025
 type+0.024
Neg logit contributions
at-0.026
ail-0.026
affle-0.024
 seeker-0.024
ogo-0.023
Token They
Token ID1119
Feature activation+1.28e+00

Pos logit contributions
ail+0.044
at+0.043
affle+0.039
 seeker+0.039
ogo+0.037
Neg logit contributions
 kinds-0.065
 sorts-0.062
 than-0.061
 sizes-0.061
 type-0.057
Token also
Token ID635
Feature activation+1.20e+00

Pos logit contributions
 kinds+0.025
 than+0.025
 sizes+0.024
 sorts+0.023
 type+0.022
Neg logit contributions
at-0.027
ail-0.027
 seeker-0.025
affle-0.025
ogo-0.024
Token said
Token ID531
Feature activation-1.56e+00

Pos logit contributions
 kinds+0.023
 than+0.023
 sizes+0.022
 sorts+0.021
 type+0.020
Neg logit contributions
at-0.026
ail-0.025
 seeker-0.024
affle-0.023
 skating-0.022
Token they
Token ID484
Feature activation+1.99e-01

Pos logit contributions
at+0.033
ail+0.032
affle+0.030
 seeker+0.029
ogo+0.029
Neg logit contributions
 than-0.032
 kinds-0.032
 sizes-0.031
 sorts-0.029
 type-0.029
Token were
Token ID547
Feature activation-1.17e+00

Pos logit contributions
 than+0.004
 kinds+0.004
 sizes+0.004
 sorts+0.004
 type+0.003
Neg logit contributions
at-0.004
ail-0.004
affle-0.004
 seeker-0.004
ogo-0.004
Token proud
Token ID6613
Feature activation+6.70e+00

Pos logit contributions
at+0.022
ail+0.022
affle+0.019
 seeker+0.019
ogo+0.019
Neg logit contributions
 than-0.027
 kinds-0.027
 sizes-0.026
 sorts-0.025
 type-0.024
Token of
Token ID286
Feature activation+1.49e+00

Pos logit contributions
 target+0.138
 sizes+0.131
 kinds+0.123
 textures+0.123
 unison+0.121
Neg logit contributions
ail-0.114
 surfing-0.110
 skating-0.109
 on-0.109
 morning-0.107
Token them
Token ID606
Feature activation-4.75e-01

Pos logit contributions
 kinds+0.029
 than+0.028
 sizes+0.027
 sorts+0.026
 type+0.025
Neg logit contributions
at-0.033
ail-0.032
affle-0.030
 seeker-0.030
ogo-0.029
Token for
Token ID329
Feature activation+3.26e-02

Pos logit contributions
at+0.012
ail+0.012
affle+0.011
 seeker+0.011
ogo+0.010
Neg logit contributions
 kinds-0.008
 than-0.008
 sizes-0.007
 sorts-0.007
 type-0.007
Token telling
Token ID5149
Feature activation+1.45e+00

Pos logit contributions
 kinds+0.001
 than+0.001
 sizes+0.001
 sorts+0.001
lier+0.001
Neg logit contributions
at-0.001
ail-0.001
affle-0.001
 seeker-0.001
 skating-0.001
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
 than+0.031
 kinds+0.031
 sizes+0.030
 sorts+0.029
 type+0.027
Neg logit contributions
ail-0.028
at-0.028
affle-0.025
 seeker-0.025
abella-0.025
Token village
Token ID7404
Feature activation+1.83e+00

Pos logit contributions
 than+0.032
 kinds+0.031
 sizes+0.029
 type+0.028
 sorts+0.027
Neg logit contributions
ail-0.040
at-0.039
ogo-0.037
affle-0.036
ip-0.036
Token people
Token ID661
Feature activation+1.25e+00

Pos logit contributions
 kinds+0.038
 than+0.038
 sizes+0.036
 sorts+0.035
 type+0.033
Neg logit contributions
at-0.038
ail-0.037
affle-0.034
 seeker-0.034
ogo-0.033
Token were
Token ID547
Feature activation-9.15e-01

Pos logit contributions
 kinds+0.024
 than+0.024
 sizes+0.023
 sorts+0.022
 type+0.021
Neg logit contributions
at-0.027
ail-0.027
affle-0.025
 seeker-0.025
ogo-0.024
Token all
Token ID477
Feature activation+2.13e+00

Pos logit contributions
at+0.017
ail+0.017
affle+0.016
 seeker+0.015
ogo+0.015
Neg logit contributions
 than-0.021
 kinds-0.021
 sizes-0.020
 sorts-0.019
 type-0.019
Token very
Token ID845
Feature activation+2.81e+00

Pos logit contributions
 kinds+0.051
 than+0.050
 sizes+0.049
 sorts+0.047
 type+0.045
Neg logit contributions
at-0.038
ail-0.037
affle-0.034
 seeker-0.034
ogo-0.033
Token proud
Token ID6613
Feature activation+6.70e+00

Pos logit contributions
 kinds+0.060
 than+0.060
 sizes+0.058
 sorts+0.056
 type+0.053
Neg logit contributions
at-0.056
ail-0.055
affle-0.051
 seeker-0.050
ogo-0.049
Token of
Token ID286
Feature activation+2.28e+00

Pos logit contributions
 unison+0.133
 target+0.131
Mine+0.127
 type+0.123
 sizes+0.122
Neg logit contributions
 skating-0.115
 on-0.112
at-0.112
ail-0.111
 surfing-0.111
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
 kinds+0.049
 sizes+0.048
 than+0.047
 sorts+0.046
 target+0.045
Neg logit contributions
ail-0.042
at-0.041
affle-0.038
abella-0.038
ogo-0.037
Token bell
Token ID8966
Feature activation-1.77e+00

Pos logit contributions
 kinds+0.032
 than+0.032
 sizes+0.030
 sorts+0.029
 type+0.028
Neg logit contributions
at-0.036
ail-0.036
affle-0.033
 seeker-0.033
ogo-0.032
Token,
Token ID11
Feature activation+5.49e-01

Pos logit contributions
at+0.041
ail+0.041
ip+0.036
affle+0.036
 skating+0.036
Neg logit contributions
 kinds-0.031
 than-0.030
 sizes-0.029
 sorts-0.029
 type-0.027
Token because
Token ID780
Feature activation+1.50e+00

Pos logit contributions
 kinds+0.010
 than+0.010
 sizes+0.010
 sorts+0.010
lier+0.009
Neg logit contributions
at-0.011
ail-0.011
 seeker-0.010
 skating-0.010
affle-0.010
Token beautiful
Token ID4950
Feature activation+4.34e+00

Pos logit contributions
 unison+0.089
 sizes+0.087
 kinds+0.084
 target+0.083
 than+0.081
Neg logit contributions
at-0.074
affle-0.069
ail-0.069
ip-0.065
 skating-0.061
Token horn
Token ID12718
Feature activation-1.07e-01

Pos logit contributions
 sizes+0.082
 kinds+0.081
 than+0.081
 sorts+0.077
 unison+0.077
Neg logit contributions
at-0.094
ail-0.090
affle-0.085
 seeker-0.081
ip-0.080
Token which
Token ID543
Feature activation-9.52e-01

Pos logit contributions
at+0.002
ail+0.002
affle+0.002
 seeker+0.002
ip+0.002
Neg logit contributions
 kinds-0.002
 than-0.002
 sizes-0.002
 sorts-0.002
 type-0.002
Token she
Token ID673
Feature activation+3.47e-01

Pos logit contributions
at+0.018
ail+0.017
 seeker+0.016
affle+0.015
atmeal+0.015
Neg logit contributions
 than-0.021
 sizes-0.020
 kinds-0.020
 unison-0.020
ever-0.019
Token was
Token ID373
Feature activation+2.71e-01

Pos logit contributions
 kinds+0.007
 than+0.007
 sizes+0.006
 sorts+0.006
 type+0.006
Neg logit contributions
at-0.008
ail-0.008
affle-0.007
 seeker-0.007
ogo-0.007
Token proud
Token ID6613
Feature activation+6.70e+00

Pos logit contributions
 than+0.006
 kinds+0.006
 sizes+0.006
 sorts+0.006
 type+0.005
Neg logit contributions
at-0.005
ail-0.005
affle-0.005
 seeker-0.005
ogo-0.005
Token of
Token ID286
Feature activation+2.63e+00

Pos logit contributions
 unison+0.143
 target+0.133
 sizes+0.130
ever+0.126
Mine+0.125
Neg logit contributions
ail-0.114
 on-0.113
 morning-0.110
at-0.109
 back-0.106
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
 kinds+0.050
 than+0.050
 sizes+0.048
 sorts+0.046
 type+0.044
Neg logit contributions
at-0.059
ail-0.058
affle-0.054
 seeker-0.053
ogo-0.052
Token Every
Token ID3887
Feature activation-6.17e-02

Pos logit contributions
at+0.006
ail+0.006
 seeker+0.005
affle+0.005
ogo+0.005
Neg logit contributions
 than-0.006
 kinds-0.006
 sizes-0.006
 sorts-0.006
 type-0.005
Token morning
Token ID3329
Feature activation-1.70e+00

Pos logit contributions
at+0.001
ail+0.001
affle+0.001
 seeker+0.001
ogo+0.001
Neg logit contributions
 kinds-0.002
 than-0.002
 sizes-0.002
 sorts-0.002
 type-0.002
Token,
Token ID11
Feature activation-7.89e-01

Pos logit contributions
at+0.038
ail+0.037
ogo+0.036
affle+0.036
 seeker+0.033
Neg logit contributions
 than-0.035
 kinds-0.033
 sizes-0.031
 sorts-0.030
 type-0.030
Tokenily
Token ID813
Feature activation+1.70e+00

Pos logit contributions
 than+0.030
 sizes+0.030
 kinds+0.030
 sorts+0.028
 type+0.027
Neg logit contributions
at-0.021
ail-0.020
affle-0.019
 seeker-0.018
ogo-0.017
Token's
Token ID338
Feature activation-1.84e+00

Pos logit contributions
 than+0.039
 kinds+0.038
 sizes+0.037
 sorts+0.035
 type+0.033
Neg logit contributions
ail-0.034
at-0.032
ip-0.029
affle-0.029
pee-0.029
Token mom
Token ID1995
Feature activation+3.04e+00

Pos logit contributions
at+0.028
ail+0.028
affle+0.024
 seeker+0.024
ogo+0.023
Neg logit contributions
 kinds-0.048
 than-0.048
 sizes-0.047
 sorts-0.045
 type-0.044
Tokenmy
Token ID1820
Feature activation+2.57e+00

Pos logit contributions
 than+0.086
 kinds+0.085
 sizes+0.083
 sorts+0.080
 type+0.078
Neg logit contributions
ail-0.041
at-0.041
affle-0.033
 seeker-0.032
ogo-0.032
Token was
Token ID373
Feature activation-3.71e-02

Pos logit contributions
 than+0.060
 kinds+0.060
 sizes+0.058
 sorts+0.056
 type+0.054
Neg logit contributions
at-0.047
ail-0.046
affle-0.042
 seeker-0.041
ogo-0.040
Token proud
Token ID6613
Feature activation+6.69e+00

Pos logit contributions
at+0.001
ail+0.001
ogo+0.001
affle+0.001
ipop+0.001
Neg logit contributions
 than-0.001
 kinds-0.001
 type-0.001
 who-0.001
 sorts-0.001
Token of
Token ID286
Feature activation+2.91e+00

Pos logit contributions
 sizes+0.142
 target+0.141
 unison+0.140
Mine+0.130
 kinds+0.130
Neg logit contributions
ail-0.122
ip-0.110
 tired-0.109
 skating-0.108
 on-0.108
Token her
Token ID607
Feature activation-6.35e-02

Pos logit contributions
 than+0.052
 kinds+0.051
 sizes+0.048
 sorts+0.046
 type+0.045
Neg logit contributions
at-0.071
ail-0.068
affle-0.064
 seeker-0.064
ogo-0.062
Token for
Token ID329
Feature activation+5.29e-01

Pos logit contributions
ail+0.001
at+0.001
affle+0.001
 seeker+0.001
ip+0.001
Neg logit contributions
 kinds-0.001
 than-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token being
Token ID852
Feature activation+4.29e-02

Pos logit contributions
 than+0.011
 kinds+0.010
 sizes+0.010
 sorts+0.010
 type+0.009
Neg logit contributions
at-0.012
ail-0.011
affle-0.010
 seeker-0.010
ogo-0.010
Token so
Token ID523
Feature activation+7.01e-01

Pos logit contributions
 kinds+0.001
 than+0.001
 sizes+0.001
 sorts+0.001
 unison+0.001
Neg logit contributions
at-0.001
ail-0.001
 seeker-0.001
affle-0.001
 skating-0.001
Tokenmy
Token ID1820
Feature activation+1.83e+00

Pos logit contributions
 kinds+0.031
 than+0.031
 sizes+0.030
 sorts+0.029
 type+0.028
Neg logit contributions
at-0.024
ail-0.023
 seeker-0.022
affle-0.022
ogo-0.021
Token's
Token ID338
Feature activation-1.45e+00

Pos logit contributions
 kinds+0.038
 than+0.037
 sizes+0.036
 sorts+0.035
 type+0.033
Neg logit contributions
at-0.038
ail-0.038
affle-0.035
 seeker-0.034
ogo-0.034
Token mom
Token ID1995
Feature activation+3.36e+00

Pos logit contributions
at+0.027
ail+0.024
affle+0.023
 seeker+0.022
ogo+0.022
Neg logit contributions
 than-0.034
 kinds-0.034
 sizes-0.033
 type-0.031
 sorts-0.031
Tokenmy
Token ID1820
Feature activation+2.73e+00

Pos logit contributions
 than+0.080
 kinds+0.080
 sizes+0.078
 sorts+0.074
 type+0.072
Neg logit contributions
ail-0.060
at-0.060
affle-0.053
 seeker-0.051
ogo-0.050
Token was
Token ID373
Feature activation-6.23e-02

Pos logit contributions
 kinds+0.061
 than+0.061
 sizes+0.058
 sorts+0.057
 type+0.055
Neg logit contributions
at-0.051
ail-0.049
affle-0.047
 seeker-0.047
ogo-0.045
Token surprised
Token ID6655
Feature activation+6.68e+00

Pos logit contributions
at+0.001
ail+0.001
ogo+0.001
affle+0.001
ipop+0.001
Neg logit contributions
 than-0.001
 kinds-0.001
 type-0.001
 sorts-0.001
 who-0.001
Token when
Token ID618
Feature activation+7.62e-01

Pos logit contributions
 sizes+0.160
 unison+0.155
 kinds+0.153
 target+0.149
 type+0.146
Neg logit contributions
ail-0.111
 back-0.096
at-0.096
m-0.088
 skating-0.088
Token she
Token ID673
Feature activation-1.12e+00

Pos logit contributions
 than+0.015
 kinds+0.015
 sizes+0.014
 sorts+0.013
 type+0.013
Neg logit contributions
at-0.018
ail-0.016
affle-0.016
 seeker-0.015
ogo-0.015
Token heard
Token ID2982
Feature activation+2.79e+00

Pos logit contributions
at+0.026
 seeker+0.025
ail+0.024
affle+0.024
ogo+0.024
Neg logit contributions
 kinds-0.020
 than-0.019
 sorts-0.018
 sizes-0.018
 type-0.018
Token a
Token ID257
Feature activation+2.41e+00

Pos logit contributions
 than+0.062
 kinds+0.061
 sizes+0.060
 sorts+0.057
 type+0.055
Neg logit contributions
at-0.053
ail-0.052
affle-0.047
 seeker-0.046
ogo-0.045
Token loud
Token ID7812
Feature activation+2.48e+00

Pos logit contributions
 kinds+0.042
 than+0.042
 sizes+0.040
 sorts+0.038
 type+0.037
Neg logit contributions
at-0.057
ail-0.056
affle-0.053
 seeker-0.052
ogo-0.051
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
 kinds+0.040
 sizes+0.040
 sorts+0.038
 than+0.037
 textures+0.037
Neg logit contributions
at-0.018
 seeker-0.017
ail-0.017
 skating-0.016
affle-0.015
Token The
Token ID383
Feature activation+1.85e+00

Pos logit contributions
 kinds+0.028
 than+0.028
 sizes+0.027
 sorts+0.026
 type+0.024
Neg logit contributions
at-0.031
ail-0.030
affle-0.028
 seeker-0.028
ogo-0.027
Token man
Token ID582
Feature activation+2.49e+00

Pos logit contributions
 kinds+0.044
 than+0.044
 sizes+0.043
 sorts+0.042
lier+0.040
Neg logit contributions
at-0.030
ail-0.030
 seeker-0.029
affle-0.028
 skating-0.026
Token was
Token ID373
Feature activation+1.50e-01

Pos logit contributions
 kinds+0.050
 than+0.050
 sizes+0.048
 sorts+0.046
 type+0.044
Neg logit contributions
at-0.052
ail-0.052
affle-0.048
 seeker-0.047
ogo-0.046
Token very
Token ID845
Feature activation+3.06e+00

Pos logit contributions
 than+0.003
 kinds+0.003
 sizes+0.003
 sorts+0.003
 type+0.003
Neg logit contributions
at-0.003
ail-0.003
affle-0.003
atmeal-0.003
ogo-0.003
Token proud
Token ID6613
Feature activation+6.67e+00

Pos logit contributions
 kinds+0.064
 than+0.064
 sizes+0.061
 sorts+0.059
 type+0.056
Neg logit contributions
at-0.063
ail-0.062
affle-0.057
 seeker-0.057
ogo-0.055
Token of
Token ID286
Feature activation+2.28e+00

Pos logit contributions
 unison+0.140
Mine+0.133
 sizes+0.130
 target+0.126
 type+0.123
Neg logit contributions
 skating-0.114
ail-0.113
 surfing-0.113
 on-0.109
loo-0.108
Token his
Token ID465
Feature activation-1.69e+00

Pos logit contributions
 than+0.044
 kinds+0.043
 sizes+0.041
 sorts+0.039
 type+0.038
Neg logit contributions
at-0.053
ail-0.052
 seeker-0.048
affle-0.048
ip-0.046
Token trophy
Token ID16383
Feature activation+2.55e-01

Pos logit contributions
ail+0.033
at+0.032
 seeker+0.030
affle+0.030
 skating+0.029
Neg logit contributions
 kinds-0.037
 than-0.036
 sizes-0.035
 sorts-0.034
lier-0.032
Token and
Token ID290
Feature activation+5.19e-02

Pos logit contributions
 kinds+0.005
 sizes+0.005
 unison+0.005
 sorts+0.005
 than+0.005
Neg logit contributions
ail-0.005
at-0.005
 seeker-0.005
affle-0.005
ip-0.005
Token showed
Token ID3751
Feature activation+2.15e+00

Pos logit contributions
 than+0.001
 kinds+0.001
 unison+0.001
 sizes+0.001
lier+0.001
Neg logit contributions
ail-0.001
at-0.001
 seeker-0.001
 skating-0.001
affle-0.001
Token sharing
Token ID7373
Feature activation+1.36e-03

Pos logit contributions
at+0.002
ail+0.002
 seeker+0.002
affle+0.002
 skating+0.002
Neg logit contributions
 kinds-0.003
 than-0.003
 sizes-0.003
 sorts-0.003
lier-0.002
Token is
Token ID318
Feature activation+2.54e-01

Pos logit contributions
 unison+0.000
 kinds+0.000
 sorts+0.000
ever+0.000
 sizes+0.000
Neg logit contributions
ail-0.000
at-0.000
 on-0.000
 back-0.000
 done-0.000
Token good
Token ID922
Feature activation+1.12e+00

Pos logit contributions
 kinds+0.006
 than+0.006
 sizes+0.005
 sorts+0.005
 type+0.005
Neg logit contributions
ail-0.004
at-0.004
 seeker-0.004
affle-0.004
 skating-0.004
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
 kinds+0.027
 sorts+0.027
 sizes+0.026
 unison+0.025
 type+0.024
Neg logit contributions
 seeker-0.017
 on-0.016
 back-0.015
ail-0.015
 best-0.015
Token<|endoftext|>
Token ID50256
Feature activation-2.33e+00

Pos logit contributions
 kinds+0.003
 than+0.003
 sizes+0.003
 sorts+0.003
 type+0.003
Neg logit contributions
ail-0.003
 seeker-0.003
at-0.003
abella-0.003
ip-0.003
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
ail+0.048
at+0.047
affle+0.044
ogo+0.043
 seeker+0.042
Neg logit contributions
 kinds-0.049
 than-0.048
 sorts-0.045
 sizes-0.043
lier-0.042
Token,
Token ID11
Feature activation-7.11e+00

Pos logit contributions
at+0.205
atmeal+0.188
affle+0.184
pee+0.179
 seeker+0.176
Neg logit contributions
 than-0.241
 kinds-0.241
 sorts-0.223
 type-0.219
 sizes-0.212
Token there
Token ID612
Feature activation-4.59e+00

Pos logit contributions
at+0.164
affle+0.153
atmeal+0.152
ail+0.142
ogo+0.141
Neg logit contributions
 than-0.139
 who-0.132
 type-0.124
 kinds-0.124
 whose-0.117
Token was
Token ID373
Feature activation-2.39e+00

Pos logit contributions
at+0.099
ail+0.096
affle+0.090
 seeker+0.088
ogo+0.087
Neg logit contributions
 kinds-0.093
 than-0.092
 sizes-0.088
 sorts-0.085
 type-0.082
Token a
Token ID257
Feature activation+1.31e+00

Pos logit contributions
at+0.046
ail+0.044
 seeker+0.042
affle+0.042
 skating+0.039
Neg logit contributions
 kinds-0.053
 than-0.052
 sizes-0.052
 sorts-0.049
 type-0.047
Token girl
Token ID2576
Feature activation+2.04e+00

Pos logit contributions
 than+0.023
 kinds+0.023
 sizes+0.022
 sorts+0.021
 type+0.020
Neg logit contributions
at-0.031
ail-0.031
affle-0.029
 seeker-0.028
ogo-0.028
Token happy
Token ID3772
Feature activation+2.74e+00

Pos logit contributions
at+0.037
ail+0.035
affle+0.034
ogo+0.032
 seeker+0.031
Neg logit contributions
 than-0.035
 kinds-0.033
 sizes-0.032
 sorts-0.030
 type-0.029
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
 kinds+0.052
 than+0.050
 sizes+0.050
 sorts+0.048
 type+0.046
Neg logit contributions
at-0.061
ail-0.061
affle-0.056
 seeker-0.056
ogo-0.054
Token tired
Token ID10032
Feature activation+1.91e+00

Pos logit contributions
at+0.027
ail+0.026
affle+0.025
ogo+0.023
 seeker+0.023
Neg logit contributions
 than-0.033
 kinds-0.033
 sizes-0.032
 sorts-0.031
 type-0.029
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
 kinds+0.034
 sizes+0.032
 than+0.032
 sorts+0.031
 type+0.029
Neg logit contributions
at-0.044
ail-0.044
 seeker-0.041
affle-0.040
 skating-0.039
Token<|endoftext|>
Token ID50256
Feature activation-2.45e+00

Pos logit contributions
ail+0.005
at+0.005
 seeker+0.005
affle+0.005
ogo+0.005
Neg logit contributions
 kinds-0.004
 than-0.004
 sizes-0.004
 sorts-0.004
 type-0.003
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
ail+0.052
at+0.052
affle+0.049
belt+0.047
atmeal+0.047
Neg logit contributions
 kinds-0.048
 than-0.048
 sorts-0.045
 sizes-0.042
lier-0.041
Token upon
Token ID2402
Feature activation-1.13e-01

Pos logit contributions
at+0.210
atmeal+0.195
affle+0.193
pee+0.184
 seeker+0.183
Neg logit contributions
 kinds-0.236
 than-0.235
 sorts-0.218
 type-0.211
 sizes-0.203
Token a
Token ID257
Feature activation-3.39e+00

Pos logit contributions
at+0.002
 seeker+0.002
ail+0.002
affle+0.002
pee+0.002
Neg logit contributions
 kinds-0.003
 sizes-0.003
 than-0.003
 sorts-0.003
 type-0.002
Token time
Token ID640
Feature activation-6.62e+00

Pos logit contributions
at+0.052
ail+0.050
affle+0.046
 seeker+0.045
ogo+0.042
Neg logit contributions
 kinds-0.090
 sizes-0.088
 than-0.087
 sorts-0.084
 textures-0.080
Token,
Token ID11
Feature activation-5.91e+00

Pos logit contributions
at+0.151
affle+0.142
abella+0.135
ail+0.134
atmeal+0.133
Neg logit contributions
 than-0.132
 kinds-0.118
 sizes-0.110
 type-0.110
 sorts-0.107
Token there
Token ID612
Feature activation-3.92e+00

Pos logit contributions
at+0.129
affle+0.121
atmeal+0.119
ail+0.117
ogo+0.112
Neg logit contributions
 than-0.125
 kinds-0.112
 type-0.110
 who-0.109
 sizes-0.107
Token Mrs
Token ID9074
Feature activation+2.22e+00

Pos logit contributions
 kinds+0.022
 than+0.022
 sizes+0.021
 sorts+0.021
 type+0.019
Neg logit contributions
ail-0.020
at-0.020
affle-0.018
 seeker-0.018
ogo-0.018
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 kinds+0.042
 sorts+0.040
 sizes+0.038
 than+0.038
 type+0.037
Neg logit contributions
ail-0.049
at-0.048
 seeker-0.045
affle-0.044
ogo-0.044
Token Smith
Token ID4176
Feature activation+1.58e+00

Pos logit contributions
at+0.024
ail+0.023
affle+0.021
 seeker+0.021
ogo+0.020
Neg logit contributions
 than-0.030
 kinds-0.030
 sizes-0.029
 sorts-0.028
 type-0.027
Token.
Token ID13
Feature activation-4.47e-01

Pos logit contributions
 than+0.029
 kinds+0.029
 sizes+0.028
 sorts+0.027
 unison+0.026
Neg logit contributions
ail-0.035
at-0.034
ip-0.031
affle-0.031
ipop-0.030
Token<|endoftext|>
Token ID50256
Feature activation-2.39e+00

Pos logit contributions
ail+0.008
at+0.007
abella+0.007
 seeker+0.007
ip+0.007
Neg logit contributions
 than-0.011
 kinds-0.010
 sizes-0.010
 type-0.010
 sorts-0.009
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
at+0.056
ail+0.056
affle+0.052
 seeker+0.051
ogo+0.050
Neg logit contributions
 kinds-0.043
 than-0.042
 sorts-0.039
 sizes-0.039
 type-0.036
Token upon
Token ID2402
Feature activation+1.68e-01

Pos logit contributions
at+0.183
affle+0.164
atmeal+0.159
pee+0.158
 seeker+0.157
Neg logit contributions
 kinds-0.267
 than-0.260
 sorts-0.251
 type-0.240
 sizes-0.238
Token a
Token ID257
Feature activation-3.00e+00

Pos logit contributions
 sizes+0.004
 kinds+0.004
 than+0.004
 sorts+0.004
 type+0.004
Neg logit contributions
at-0.002
 seeker-0.002
ail-0.002
affle-0.002
pee-0.002
Token time
Token ID640
Feature activation-6.47e+00

Pos logit contributions
at+0.040
ail+0.037
affle+0.035
 seeker+0.033
ip+0.032
Neg logit contributions
 kinds-0.087
 sizes-0.085
 than-0.082
 sorts-0.081
 textures-0.078
Token,
Token ID11
Feature activation-5.70e+00

Pos logit contributions
at+0.141
affle+0.131
ail+0.127
atmeal+0.123
abella+0.122
Neg logit contributions
 than-0.135
 kinds-0.126
 sizes-0.118
 sorts-0.116
 type-0.115
Token there
Token ID612
Feature activation-3.36e+00

Pos logit contributions
at+0.121
affle+0.113
ail+0.113
atmeal+0.111
ogo+0.108
Neg logit contributions
 than-0.121
 kinds-0.112
 type-0.108
 sizes-0.105
 who-0.105
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
 kinds+0.049
 sizes+0.048
 sorts+0.047
 than+0.046
 type+0.045
Neg logit contributions
ail-0.054
at-0.052
affle-0.048
 seeker-0.048
ipop-0.047
Token clean
Token ID3424
Feature activation+5.13e-01

Pos logit contributions
 than+0.005
 kinds+0.005
 sizes+0.005
 sorts+0.004
 type+0.004
Neg logit contributions
ail-0.004
at-0.004
 seeker-0.004
affle-0.004
 skating-0.004
Token again
Token ID757
Feature activation+2.40e-02

Pos logit contributions
 kinds+0.009
 than+0.009
 sizes+0.009
 sorts+0.009
 type+0.008
Neg logit contributions
at-0.012
ail-0.012
affle-0.011
 seeker-0.011
ogo-0.010
Token.
Token ID13
Feature activation-7.16e-01

Pos logit contributions
 kinds+0.000
 sizes+0.000
 sorts+0.000
 than+0.000
 type+0.000
Neg logit contributions
ail-0.001
at-0.001
 seeker-0.000
 skating-0.000
affle-0.000
Token<|endoftext|>
Token ID50256
Feature activation-2.65e+00

Pos logit contributions
at+0.014
ail+0.014
 seeker+0.013
affle+0.013
ip+0.013
Neg logit contributions
 kinds-0.015
 than-0.015
 sizes-0.014
 sorts-0.014
 type-0.013
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
at+0.067
ail+0.067
affle+0.065
atmeal+0.063
belt+0.063
Neg logit contributions
 than-0.040
 kinds-0.040
 sorts-0.037
 sizes-0.033
lier-0.033
Token upon
Token ID2402
Feature activation-1.02e-01

Pos logit contributions
at+0.228
atmeal+0.216
affle+0.212
pee+0.202
ridge+0.201
Neg logit contributions
 than-0.211
 kinds-0.210
 sorts-0.193
 type-0.187
 who-0.186
Token a
Token ID257
Feature activation-3.34e+00

Pos logit contributions
at+0.002
ail+0.002
 seeker+0.002
affle+0.002
 skating+0.002
Neg logit contributions
 kinds-0.002
 than-0.002
 sizes-0.002
 sorts-0.002
 type-0.002
Token time
Token ID640
Feature activation-7.04e+00

Pos logit contributions
at+0.052
ail+0.051
affle+0.047
 seeker+0.045
ogo+0.043
Neg logit contributions
 kinds-0.088
 sizes-0.086
 than-0.085
 sorts-0.082
 type-0.078
Token,
Token ID11
Feature activation-6.06e+00

Pos logit contributions
at+0.166
affle+0.157
abella+0.149
atmeal+0.149
ail+0.146
Neg logit contributions
 than-0.136
 kinds-0.119
 sizes-0.111
 type-0.110
ever-0.110
Token in
Token ID287
Feature activation-2.30e+00

Pos logit contributions
at+0.138
affle+0.134
atmeal+0.132
ail+0.123
ogo+0.121
Neg logit contributions
 than-0.120
 who-0.109
 type-0.103
 kinds-0.102
 sizes-0.098
Token happy
Token ID3772
Feature activation+2.41e+00

Pos logit contributions
at+0.026
ail+0.025
affle+0.024
 seeker+0.023
ogo+0.023
Neg logit contributions
 than-0.023
 kinds-0.023
 sizes-0.022
 sorts-0.021
 type-0.020
Token and
Token ID290
Feature activation-1.38e+00

Pos logit contributions
 kinds+0.049
 than+0.048
 sizes+0.048
 sorts+0.046
 type+0.044
Neg logit contributions
ail-0.049
at-0.049
 seeker-0.045
affle-0.045
ogo-0.043
Token safe
Token ID3338
Feature activation+1.34e+00

Pos logit contributions
at+0.027
ail+0.026
affle+0.025
 seeker+0.024
ogo+0.024
Neg logit contributions
 than-0.031
 kinds-0.031
 sizes-0.030
 sorts-0.028
 type-0.027
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
 kinds+0.022
 than+0.022
 sizes+0.021
 sorts+0.020
 type+0.019
Neg logit contributions
at-0.033
ail-0.033
 seeker-0.030
affle-0.030
ogo-0.029
Token<|endoftext|>
Token ID50256
Feature activation-2.34e+00

Pos logit contributions
ail+0.006
at+0.006
 seeker+0.006
affle+0.006
ogo+0.005
Neg logit contributions
 kinds-0.005
 than-0.005
 sizes-0.004
 sorts-0.004
 type-0.004
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
ail+0.049
at+0.049
affle+0.047
unct+0.045
belt+0.045
Neg logit contributions
 kinds-0.046
 than-0.046
 sorts-0.043
 sizes-0.040
lier-0.039
Token upon
Token ID2402
Feature activation-5.08e-02

Pos logit contributions
at+0.214
atmeal+0.200
affle+0.198
pee+0.187
 seeker+0.185
Neg logit contributions
 kinds-0.230
 than-0.229
 sorts-0.214
 type-0.206
 who-0.200
Token a
Token ID257
Feature activation-3.22e+00

Pos logit contributions
at+0.001
ail+0.001
 seeker+0.001
affle+0.001
 skating+0.001
Neg logit contributions
 kinds-0.001
 sizes-0.001
 than-0.001
 sorts-0.001
 type-0.001
Token time
Token ID640
Feature activation-6.64e+00

Pos logit contributions
at+0.048
ail+0.046
affle+0.042
 seeker+0.041
ogo+0.039
Neg logit contributions
 kinds-0.088
 sizes-0.085
 than-0.083
 sorts-0.082
 textures-0.077
Token,
Token ID11
Feature activation-5.78e+00

Pos logit contributions
at+0.156
affle+0.147
atmeal+0.139
abella+0.139
ail+0.137
Neg logit contributions
 than-0.130
 kinds-0.114
 type-0.107
 sizes-0.106
ever-0.104
Token there
Token ID612
Feature activation-3.72e+00

Pos logit contributions
at+0.132
atmeal+0.126
affle+0.126
ail+0.120
ogo+0.116
Neg logit contributions
 than-0.115
 who-0.103
 type-0.101
 kinds-0.101
 sizes-0.096
Token and
Token ID290
Feature activation-6.09e-01

Pos logit contributions
 sorts+0.119
 sizes+0.118
 unison+0.118
 kinds+0.116
 type+0.112
Neg logit contributions
 back-0.072
ail-0.070
 home-0.069
 seeker-0.066
 on-0.060
Token respect
Token ID2461
Feature activation+1.73e+00

Pos logit contributions
ail+0.012
at+0.011
 seeker+0.011
 tired+0.011
affle+0.011
Neg logit contributions
 kinds-0.012
 than-0.011
 sizes-0.011
lier-0.011
 unison-0.011
Token others
Token ID1854
Feature activation+2.37e-01

Pos logit contributions
 kinds+0.033
 sorts+0.032
 sizes+0.032
ever+0.030
 unison+0.030
Neg logit contributions
at-0.033
ail-0.032
affle-0.030
 skating-0.029
ogo-0.029
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
 kinds+0.004
 sorts+0.004
 sizes+0.004
 unison+0.004
 type+0.004
Neg logit contributions
at-0.005
ail-0.005
 onwards-0.004
 skating-0.004
ip-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.38e+00

Pos logit contributions
 kinds+0.005
 than+0.005
 sizes+0.005
 sorts+0.005
 type+0.005
Neg logit contributions
ail-0.005
at-0.005
 seeker-0.005
affle-0.005
ip-0.005
TokenOnce
Token ID7454
Feature activation-1.07e+01

Pos logit contributions
ail+0.051
at+0.049
affle+0.046
ogo+0.045
 seeker+0.045
Neg logit contributions
 kinds-0.048
 than-0.048
 sorts-0.045
 sizes-0.044
lier-0.042
Token upon
Token ID2402
Feature activation+1.65e-01

Pos logit contributions
at+0.217
atmeal+0.201
affle+0.199
 seeker+0.189
pee+0.189
Neg logit contributions
 than-0.227
 kinds-0.226
 sorts-0.206
 type-0.204
 sizes-0.196
Token a
Token ID257
Feature activation-3.23e+00

Pos logit contributions
 kinds+0.004
 sizes+0.004
 than+0.004
 sorts+0.004
 type+0.004
Neg logit contributions
at-0.003
 seeker-0.003
ail-0.003
affle-0.002
 skating-0.002
Token time
Token ID640
Feature activation-6.74e+00

Pos logit contributions
at+0.046
ail+0.044
affle+0.041
 seeker+0.039
ogo+0.037
Neg logit contributions
 kinds-0.090
 sizes-0.088
 than-0.085
 sorts-0.084
 textures-0.080
Token,
Token ID11
Feature activation-5.92e+00

Pos logit contributions
at+0.147
affle+0.137
ail+0.132
abella+0.129
atmeal+0.128
Neg logit contributions
 than-0.142
 kinds-0.130
 sizes-0.124
 type-0.120
 sorts-0.119
Token there
Token ID612
Feature activation-3.75e+00

Pos logit contributions
at+0.131
affle+0.125
atmeal+0.123
ail+0.119
ogo+0.114
Neg logit contributions
 than-0.122
 who-0.109
 kinds-0.108
 type-0.107
 sizes-0.104
Token happy
Token ID3772
Feature activation+2.98e+00

Pos logit contributions
at+0.032
ail+0.031
affle+0.029
 seeker+0.029
ogo+0.028
Neg logit contributions
 than-0.030
 kinds-0.030
 sizes-0.029
 sorts-0.027
 type-0.026
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
 kinds+0.060
 sizes+0.059
 than+0.058
 sorts+0.056
 type+0.054
Neg logit contributions
at-0.060
ail-0.060
 seeker-0.055
affle-0.054
 skating-0.053
Token content
Token ID2695
Feature activation+9.28e-01

Pos logit contributions
at+0.005
ail+0.005
affle+0.004
 seeker+0.004
ogo+0.004
Neg logit contributions
 kinds-0.006
 than-0.006
 sizes-0.006
 sorts-0.006
 type-0.005
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 than+0.014
 kinds+0.014
 sizes+0.013
 sorts+0.012
 type+0.012
Neg logit contributions
at-0.024
ail-0.024
affle-0.023
 seeker-0.022
ogo-0.022
Token<|endoftext|>
Token ID50256
Feature activation-2.45e+00

Pos logit contributions
at+0.022
ail+0.021
affle+0.020
 seeker+0.020
ogo+0.019
Neg logit contributions
 kinds-0.020
 than-0.020
 sizes-0.019
 sorts-0.019
 type-0.018
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
at+0.049
ail+0.049
affle+0.046
atmeal+0.045
unct+0.045
Neg logit contributions
 kinds-0.051
 than-0.050
 sorts-0.048
 sizes-0.044
lier-0.044
Token upon
Token ID2402
Feature activation-3.02e-02

Pos logit contributions
at+0.230
atmeal+0.218
affle+0.215
pee+0.203
ridge+0.201
Neg logit contributions
 kinds-0.211
 than-0.210
 sorts-0.194
 type-0.186
 who-0.185
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
at+0.001
ail+0.001
 seeker+0.001
affle+0.000
 skating+0.000
Neg logit contributions
 kinds-0.001
 than-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token time
Token ID640
Feature activation-6.69e+00

Pos logit contributions
at+0.047
ail+0.045
affle+0.041
 seeker+0.040
ogo+0.038
Neg logit contributions
 kinds-0.081
 sizes-0.079
 than-0.078
 sorts-0.076
 textures-0.072
Token,
Token ID11
Feature activation-5.80e+00

Pos logit contributions
at+0.160
affle+0.151
atmeal+0.143
abella+0.140
ail+0.139
Neg logit contributions
 than-0.130
 kinds-0.113
 sizes-0.105
 type-0.105
ever-0.104
Token there
Token ID612
Feature activation-3.60e+00

Pos logit contributions
at+0.135
affle+0.129
atmeal+0.129
ail+0.120
ogo+0.116
Neg logit contributions
 than-0.114
 who-0.104
 type-0.098
 kinds-0.097
 sizes-0.093
Token and
Token ID290
Feature activation-8.35e-01

Pos logit contributions
 kinds+0.000
 than+0.000
 sizes+0.000
 sorts+0.000
 type+0.000
Neg logit contributions
ail-0.000
at-0.000
affle-0.000
 seeker-0.000
 skating-0.000
Token each
Token ID1123
Feature activation-1.79e+00

Pos logit contributions
ail+0.016
at+0.016
 seeker+0.015
affle+0.015
 skating+0.014
Neg logit contributions
 kinds-0.017
 than-0.017
 sizes-0.016
 sorts-0.015
lier-0.015
Token other
Token ID584
Feature activation-6.25e-01

Pos logit contributions
at+0.033
ail+0.032
affle+0.029
 seeker+0.029
ogo+0.028
Neg logit contributions
 than-0.041
 kinds-0.041
 sizes-0.040
 sorts-0.038
 type-0.037
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
ail+0.014
at+0.013
affle+0.012
 seeker+0.012
 skating+0.012
Neg logit contributions
 than-0.012
 kinds-0.012
 sizes-0.011
 sorts-0.011
 type-0.010
Token<|endoftext|>
Token ID50256
Feature activation-2.49e+00

Pos logit contributions
ail+0.015
at+0.015
affle+0.013
 seeker+0.013
ogo+0.013
Neg logit contributions
 kinds-0.014
 than-0.014
 sizes-0.013
 sorts-0.013
 type-0.013
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
ail+0.056
at+0.056
affle+0.053
atmeal+0.051
ogo+0.051
Neg logit contributions
 kinds-0.047
 than-0.046
 sorts-0.043
 sizes-0.040
lier-0.039
Token upon
Token ID2402
Feature activation-3.32e-02

Pos logit contributions
at+0.216
atmeal+0.203
affle+0.200
pee+0.191
 seeker+0.191
Neg logit contributions
 kinds-0.224
 than-0.223
 sorts-0.206
 type-0.198
 who-0.193
Token a
Token ID257
Feature activation-3.10e+00

Pos logit contributions
at+0.001
ail+0.001
 seeker+0.001
affle+0.000
 skating+0.000
Neg logit contributions
 kinds-0.001
 sizes-0.001
 than-0.001
 sorts-0.001
 type-0.001
Token time
Token ID640
Feature activation-6.55e+00

Pos logit contributions
at+0.045
ail+0.043
affle+0.040
 seeker+0.038
ogo+0.036
Neg logit contributions
 kinds-0.085
 sizes-0.083
 than-0.081
 sorts-0.080
 textures-0.075
Token,
Token ID11
Feature activation-5.74e+00

Pos logit contributions
at+0.154
affle+0.146
atmeal+0.139
abella+0.138
ail+0.136
Neg logit contributions
 than-0.127
 kinds-0.111
 sizes-0.104
 type-0.104
ever-0.102
Token a
Token ID257
Feature activation+2.10e-01

Pos logit contributions
at+0.130
atmeal+0.125
affle+0.124
ail+0.119
ogo+0.115
Neg logit contributions
 than-0.115
 who-0.103
 kinds-0.100
 type-0.099
 sizes-0.096
Token try
Token ID1949
Feature activation-8.43e-01

Pos logit contributions
ail+0.004
at+0.004
 seeker+0.004
 skating+0.003
affle+0.003
Neg logit contributions
 kinds-0.004
 than-0.004
 sizes-0.004
 sorts-0.004
 unison-0.004
Token new
Token ID649
Feature activation+2.68e+00

Pos logit contributions
at+0.017
ail+0.017
affle+0.016
 seeker+0.015
ogo+0.015
Neg logit contributions
 than-0.018
 kinds-0.018
 sizes-0.017
 sorts-0.016
 type-0.016
Token things
Token ID1243
Feature activation-8.83e-01

Pos logit contributions
 kinds+0.063
 sizes+0.062
 sorts+0.060
 than+0.060
 unison+0.060
Neg logit contributions
ail-0.040
at-0.040
 seeker-0.037
affle-0.037
 skating-0.036
Token.
Token ID13
Feature activation-8.00e-01

Pos logit contributions
ail+0.018
at+0.018
ip+0.016
 skating+0.016
 seeker+0.016
Neg logit contributions
 kinds-0.018
 sorts-0.017
 sizes-0.017
 unison-0.016
 than-0.016
Token<|endoftext|>
Token ID50256
Feature activation-2.53e+00

Pos logit contributions
ail+0.013
at+0.013
affle+0.012
ip+0.012
 seeker+0.012
Neg logit contributions
 kinds-0.019
 than-0.019
 sizes-0.019
 type-0.018
 textures-0.017
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
at+0.060
ail+0.059
affle+0.056
 seeker+0.055
ogo+0.054
Neg logit contributions
 kinds-0.045
 than-0.045
 sorts-0.041
 sizes-0.041
 type-0.038
Token upon
Token ID2402
Feature activation-7.60e-02

Pos logit contributions
at+0.187
affle+0.167
atmeal+0.165
 seeker+0.162
pee+0.160
Neg logit contributions
 kinds-0.257
 than-0.253
 sorts-0.240
 type-0.231
 sizes-0.230
Token a
Token ID257
Feature activation-3.29e+00

Pos logit contributions
at+0.001
 seeker+0.001
ail+0.001
affle+0.001
 skating+0.001
Neg logit contributions
 kinds-0.002
 sizes-0.002
 than-0.002
 sorts-0.002
 type-0.002
Token time
Token ID640
Feature activation-6.55e+00

Pos logit contributions
at+0.048
ail+0.046
affle+0.043
 seeker+0.041
ip+0.039
Neg logit contributions
 kinds-0.090
 sizes-0.089
 than-0.087
 sorts-0.084
 textures-0.080
Token,
Token ID11
Feature activation-5.85e+00

Pos logit contributions
at+0.143
affle+0.132
ail+0.129
abella+0.126
atmeal+0.124
Neg logit contributions
 than-0.136
 kinds-0.127
 sizes-0.118
 type-0.117
 sorts-0.116
Token there
Token ID612
Feature activation-3.60e+00

Pos logit contributions
at+0.130
affle+0.122
atmeal+0.122
ail+0.119
ogo+0.115
Neg logit contributions
 than-0.119
 who-0.106
 kinds-0.106
 type-0.105
 sizes-0.099
Token with
Token ID351
Feature activation-3.07e-03

Pos logit contributions
at+0.036
ail+0.036
affle+0.033
 seeker+0.033
ogo+0.033
Neg logit contributions
 than-0.023
 kinds-0.023
 sizes-0.021
 sorts-0.021
 type-0.020
Token the
Token ID262
Feature activation-4.64e-01

Pos logit contributions
at+0.000
ail+0.000
affle+0.000
 seeker+0.000
ogo+0.000
Neg logit contributions
 kinds-0.000
 than-0.000
 sizes-0.000
 sorts-0.000
 type-0.000
Token kids
Token ID3988
Feature activation-1.40e-01

Pos logit contributions
at+0.010
ail+0.010
affle+0.009
 seeker+0.009
ogo+0.009
Neg logit contributions
 kinds-0.009
 than-0.009
 sizes-0.009
 sorts-0.008
 type-0.008
Token.
Token ID13
Feature activation-9.86e-01

Pos logit contributions
at+0.003
ail+0.003
affle+0.003
 seeker+0.003
ogo+0.003
Neg logit contributions
 kinds-0.003
 than-0.003
 sizes-0.003
 sorts-0.002
 type-0.002
Token<|endoftext|>
Token ID50256
Feature activation-2.37e+00

Pos logit contributions
at+0.021
ail+0.021
affle+0.020
 seeker+0.019
ogo+0.019
Neg logit contributions
 kinds-0.019
 than-0.019
 sizes-0.018
 sorts-0.018
 type-0.017
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
at+0.059
ail+0.058
affle+0.055
 seeker+0.054
atmeal+0.053
Neg logit contributions
 kinds-0.039
 than-0.039
 sorts-0.035
 sizes-0.034
lier-0.031
Token upon
Token ID2402
Feature activation+1.01e-01

Pos logit contributions
at+0.194
affle+0.176
atmeal+0.176
 seeker+0.170
pee+0.166
Neg logit contributions
 kinds-0.249
 than-0.245
 sorts-0.233
 type-0.223
 sizes-0.219
Token a
Token ID257
Feature activation-3.27e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.002
ail-0.002
 seeker-0.002
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.38e+00

Pos logit contributions
at+0.050
ail+0.048
affle+0.044
 seeker+0.043
ogo+0.041
Neg logit contributions
 kinds-0.087
 sizes-0.085
 than-0.084
 sorts-0.081
 textures-0.077
Token,
Token ID11
Feature activation-5.73e+00

Pos logit contributions
at+0.140
affle+0.131
ail+0.125
atmeal+0.123
abella+0.122
Neg logit contributions
 than-0.133
 kinds-0.122
 sizes-0.114
 type-0.112
 sorts-0.112
Token there
Token ID612
Feature activation-3.56e+00

Pos logit contributions
at+0.127
affle+0.120
atmeal+0.119
ail+0.117
ogo+0.113
Neg logit contributions
 than-0.117
 kinds-0.104
 who-0.103
 type-0.103
 sizes-0.098
Token happy
Token ID3772
Feature activation+3.24e+00

Pos logit contributions
at+0.019
ail+0.019
affle+0.017
 seeker+0.017
ogo+0.016
Neg logit contributions
 than-0.020
 kinds-0.020
 sizes-0.019
 sorts-0.018
 type-0.018
Token and
Token ID290
Feature activation-6.42e-01

Pos logit contributions
 kinds+0.065
 sizes+0.063
 than+0.062
 sorts+0.060
 type+0.058
Neg logit contributions
ail-0.069
at-0.068
 seeker-0.062
affle-0.062
 skating-0.060
Token tired
Token ID10032
Feature activation+2.42e+00

Pos logit contributions
at+0.012
ail+0.012
affle+0.011
 seeker+0.011
ogo+0.011
Neg logit contributions
 than-0.014
 kinds-0.014
 sizes-0.014
 sorts-0.013
 type-0.013
Token.
Token ID13
Feature activation-6.88e-01

Pos logit contributions
 kinds+0.046
 sizes+0.043
 sorts+0.042
 than+0.042
 type+0.040
Neg logit contributions
at-0.053
ail-0.052
 seeker-0.049
affle-0.047
 skating-0.047
Token<|endoftext|>
Token ID50256
Feature activation-2.34e+00

Pos logit contributions
ail+0.014
at+0.014
affle+0.013
 seeker+0.013
abella+0.013
Neg logit contributions
 than-0.014
 kinds-0.014
 sizes-0.013
 sorts-0.013
 type-0.012
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
at+0.059
ail+0.058
affle+0.057
atmeal+0.056
unct+0.055
Neg logit contributions
 kinds-0.036
 than-0.036
 sorts-0.033
 sizes-0.030
lier-0.029
Token upon
Token ID2402
Feature activation-3.26e-02

Pos logit contributions
at+0.207
atmeal+0.192
affle+0.191
 seeker+0.180
pee+0.180
Neg logit contributions
 kinds-0.233
 than-0.230
 sorts-0.217
 type-0.209
 sizes-0.202
Token a
Token ID257
Feature activation-2.97e+00

Pos logit contributions
at+0.001
ail+0.001
 seeker+0.001
affle+0.000
 skating+0.000
Neg logit contributions
 kinds-0.001
 sizes-0.001
 than-0.001
 sorts-0.001
 type-0.001
Token time
Token ID640
Feature activation-6.40e+00

Pos logit contributions
at+0.045
ail+0.043
affle+0.040
 seeker+0.038
ogo+0.036
Neg logit contributions
 kinds-0.080
 sizes-0.078
 than-0.077
 sorts-0.075
 textures-0.071
Token,
Token ID11
Feature activation-5.71e+00

Pos logit contributions
at+0.150
affle+0.142
atmeal+0.135
abella+0.133
ail+0.132
Neg logit contributions
 than-0.124
 kinds-0.110
 type-0.102
 sizes-0.102
ever-0.101
Token there
Token ID612
Feature activation-3.67e+00

Pos logit contributions
at+0.128
atmeal+0.121
affle+0.121
ail+0.115
ogo+0.111
Neg logit contributions
 than-0.117
 who-0.104
 kinds-0.103
 type-0.103
 sizes-0.098
Token on
Token ID319
Feature activation-3.13e-01

Pos logit contributions
 kinds+0.017
 sizes+0.016
 than+0.016
 sorts+0.015
 type+0.015
Neg logit contributions
at-0.014
ail-0.014
 seeker-0.013
affle-0.012
 skating-0.012
Token their
Token ID511
Feature activation-4.08e+00

Pos logit contributions
ail+0.006
at+0.005
 onwards+0.005
affle+0.005
 onward+0.005
Neg logit contributions
 kinds-0.007
 sizes-0.007
 sorts-0.006
 than-0.006
ever-0.006
Token faces
Token ID6698
Feature activation-1.34e+00

Pos logit contributions
ail+0.101
at+0.100
affle+0.094
 seeker+0.091
ogo+0.089
Neg logit contributions
 kinds-0.067
 than-0.066
 sizes-0.063
 sorts-0.061
 type-0.057
Token.
Token ID13
Feature activation-4.71e-01

Pos logit contributions
ail+0.030
at+0.030
 seeker+0.027
affle+0.026
ip+0.026
Neg logit contributions
 kinds-0.025
 sizes-0.023
 than-0.023
 sorts-0.023
 type-0.022
Token<|endoftext|>
Token ID50256
Feature activation-2.40e+00

Pos logit contributions
ail+0.010
at+0.010
affle+0.009
 seeker+0.009
ogo+0.009
Neg logit contributions
 kinds-0.009
 than-0.009
 sizes-0.009
 sorts-0.009
 type-0.008
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
ail+0.054
at+0.054
affle+0.051
atmeal+0.049
belt+0.049
Neg logit contributions
 kinds-0.045
 than-0.045
 sorts-0.042
 sizes-0.039
lier-0.037
Token upon
Token ID2402
Feature activation+7.02e-02

Pos logit contributions
at+0.220
atmeal+0.209
affle+0.204
pee+0.196
ridge+0.194
Neg logit contributions
 kinds-0.219
 than-0.217
 sorts-0.202
 type-0.196
 who-0.188
Token a
Token ID257
Feature activation-3.18e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.001
Neg logit contributions
at-0.001
 seeker-0.001
ail-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.71e+00

Pos logit contributions
at+0.047
ail+0.044
affle+0.041
 seeker+0.040
ip+0.037
Neg logit contributions
 kinds-0.087
 sizes-0.085
 than-0.083
 sorts-0.082
 textures-0.077
Token there
Token ID612
Feature activation-4.38e+00

Pos logit contributions
at+0.158
affle+0.150
atmeal+0.143
abella+0.142
ail+0.139
Neg logit contributions
 than-0.130
 kinds-0.113
 type-0.106
 sizes-0.105
ever-0.103
Token was
Token ID373
Feature activation-1.85e+00

Pos logit contributions
at+0.102
ail+0.098
affle+0.096
 seeker+0.092
ogo+0.090
Neg logit contributions
 kinds-0.081
 than-0.080
 sizes-0.075
 sorts-0.074
 type-0.071
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
 kinds+0.073
 sizes+0.072
 unison+0.069
 sorts+0.068
 type+0.067
Neg logit contributions
ail-0.056
 skating-0.050
at-0.050
 back-0.049
 seeker-0.048
Token safe
Token ID3338
Feature activation+1.77e+00

Pos logit contributions
ail+0.007
at+0.007
 seeker+0.006
affle+0.006
ogo+0.006
Neg logit contributions
 kinds-0.007
 than-0.007
 sizes-0.007
 sorts-0.006
 type-0.006
Token again
Token ID757
Feature activation-1.93e-01

Pos logit contributions
 kinds+0.032
 than+0.031
 sizes+0.031
 sorts+0.030
 type+0.028
Neg logit contributions
at-0.040
ail-0.040
 seeker-0.037
affle-0.036
 skating-0.035
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ail+0.004
at+0.004
 seeker+0.004
ip+0.004
 skating+0.004
Neg logit contributions
 kinds-0.003
 sizes-0.003
 sorts-0.003
 than-0.003
 type-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.20e+00

Pos logit contributions
ail+0.005
at+0.005
 seeker+0.005
affle+0.005
abella+0.005
Neg logit contributions
 kinds-0.006
 than-0.006
 sizes-0.006
 type-0.005
 sorts-0.005
TokenOnce
Token ID7454
Feature activation-1.06e+01

Pos logit contributions
at+0.055
ail+0.055
affle+0.052
ogo+0.050
 seeker+0.050
Neg logit contributions
 than-0.035
 kinds-0.035
 sorts-0.032
 sizes-0.031
lier-0.030
Token upon
Token ID2402
Feature activation+7.38e-02

Pos logit contributions
at+0.206
atmeal+0.191
affle+0.188
pee+0.180
 seeker+0.179
Neg logit contributions
 kinds-0.236
 than-0.234
 sorts-0.217
 type-0.211
 sizes-0.205
Token a
Token ID257
Feature activation-3.42e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.001
ail-0.001
 seeker-0.001
affle-0.001
pee-0.001
Token time
Token ID640
Feature activation-6.44e+00

Pos logit contributions
at+0.054
ail+0.052
affle+0.047
 seeker+0.046
ogo+0.044
Neg logit contributions
 kinds-0.090
 sizes-0.088
 than-0.087
 sorts-0.084
 type-0.079
Token,
Token ID11
Feature activation-5.71e+00

Pos logit contributions
at+0.144
affle+0.134
ail+0.130
abella+0.128
atmeal+0.127
Neg logit contributions
 than-0.132
 kinds-0.120
 sizes-0.112
 type-0.110
 sorts-0.109
Token there
Token ID612
Feature activation-3.68e+00

Pos logit contributions
at+0.127
affle+0.120
atmeal+0.119
ail+0.117
ogo+0.112
Neg logit contributions
 than-0.117
 kinds-0.105
 type-0.103
 who-0.103
 sizes-0.099
Token have
Token ID423
Feature activation-9.43e-01

Pos logit contributions
ail+0.013
 done+0.012
at+0.012
 seeker+0.012
 tired+0.012
Neg logit contributions
 kinds-0.013
lier-0.012
 sizes-0.012
 than-0.012
 sorts-0.012
Token good
Token ID922
Feature activation-1.75e-01

Pos logit contributions
ail+0.015
 done+0.014
 back+0.014
 refreshed+0.013
at+0.013
Neg logit contributions
 than-0.021
 kinds-0.020
 sizes-0.020
 sorts-0.019
izing-0.019
Token friends
Token ID2460
Feature activation-3.69e-02

Pos logit contributions
ail+0.003
at+0.003
 seeker+0.003
affle+0.003
 skating+0.003
Neg logit contributions
 kinds-0.004
 sizes-0.004
 sorts-0.003
 than-0.003
 type-0.003
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
at+0.001
ail+0.001
 seeker+0.001
affle+0.001
ip+0.001
Neg logit contributions
 kinds-0.001
 sizes-0.001
 sorts-0.001
 than-0.001
 textures-0.001
Token<|endoftext|>
Token ID50256
Feature activation-2.07e+00

Pos logit contributions
ail+0.006
at+0.006
 seeker+0.006
abella+0.006
ip+0.006
Neg logit contributions
 kinds-0.009
 than-0.009
 sizes-0.009
 type-0.009
 textures-0.009
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
ail+0.041
at+0.040
affle+0.038
 seeker+0.036
ogo+0.036
Neg logit contributions
 kinds-0.045
 than-0.044
 sorts-0.042
 sizes-0.040
lier-0.039
Token upon
Token ID2402
Feature activation+7.92e-02

Pos logit contributions
at+0.198
atmeal+0.182
affle+0.178
pee+0.174
 seeker+0.174
Neg logit contributions
 kinds-0.245
 than-0.238
 sorts-0.226
 type-0.220
 sizes-0.212
Token a
Token ID257
Feature activation-3.32e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.001
ail-0.001
 seeker-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.45e+00

Pos logit contributions
at+0.049
ail+0.047
affle+0.043
 seeker+0.042
ip+0.039
Neg logit contributions
 kinds-0.090
 sizes-0.089
 than-0.087
 sorts-0.084
 textures-0.080
Token,
Token ID11
Feature activation-5.85e+00

Pos logit contributions
at+0.141
affle+0.131
ail+0.126
abella+0.123
atmeal+0.123
Neg logit contributions
 than-0.135
 kinds-0.125
 sizes-0.118
 type-0.115
 sorts-0.115
Token there
Token ID612
Feature activation-3.54e+00

Pos logit contributions
at+0.130
affle+0.123
atmeal+0.123
ail+0.119
ogo+0.115
Neg logit contributions
 than-0.119
 who-0.107
 kinds-0.107
 type-0.105
 sizes-0.101
Token to
Token ID284
Feature activation-4.32e-01

Pos logit contributions
at+0.005
ail+0.004
affle+0.004
 seeker+0.004
ogo+0.004
Neg logit contributions
 than-0.004
 kinds-0.004
 sizes-0.004
 sorts-0.003
 type-0.003
Token their
Token ID511
Feature activation-2.34e+00

Pos logit contributions
ail+0.010
at+0.010
 seeker+0.009
affle+0.009
ogo+0.009
Neg logit contributions
 kinds-0.007
 than-0.007
 sizes-0.007
 sorts-0.007
lier-0.006
Token family
Token ID1641
Feature activation-5.31e-02

Pos logit contributions
at+0.044
ail+0.043
affle+0.039
 seeker+0.039
ogo+0.037
Neg logit contributions
 than-0.053
 kinds-0.053
 sizes-0.051
 sorts-0.049
 type-0.048
Token.
Token ID13
Feature activation+1.18e-01

Pos logit contributions
ail+0.001
at+0.001
 skating+0.001
 seeker+0.001
ip+0.001
Neg logit contributions
 kinds-0.001
 sizes-0.001
 than-0.001
 sorts-0.001
 type-0.001
Token<|endoftext|>
Token ID50256
Feature activation-2.31e+00

Pos logit contributions
 kinds+0.002
 than+0.002
 sizes+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
ail-0.003
at-0.003
 seeker-0.002
abella-0.002
ogo-0.002
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
ail+0.043
at+0.043
affle+0.039
ogo+0.038
 seeker+0.038
Neg logit contributions
 kinds-0.052
 than-0.052
 sorts-0.049
 sizes-0.048
lier-0.046
Token upon
Token ID2402
Feature activation+6.13e-02

Pos logit contributions
at+0.191
affle+0.171
atmeal+0.170
pee+0.165
 seeker+0.164
Neg logit contributions
 kinds-0.253
 than-0.249
 sorts-0.234
 type-0.227
 sizes-0.225
Token a
Token ID257
Feature activation-3.33e+00

Pos logit contributions
 kinds+0.001
 sizes+0.001
 than+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
 seeker-0.001
ail-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.56e+00

Pos logit contributions
at+0.047
ail+0.045
affle+0.041
 seeker+0.040
ip+0.038
Neg logit contributions
 kinds-0.094
 sizes-0.092
 than-0.089
 sorts-0.088
 textures-0.083
Token,
Token ID11
Feature activation-5.68e+00

Pos logit contributions
at+0.146
affle+0.136
ail+0.131
abella+0.129
atmeal+0.127
Neg logit contributions
 than-0.135
 kinds-0.123
 sizes-0.115
 type-0.114
 sorts-0.111
Token there
Token ID612
Feature activation-3.64e+00

Pos logit contributions
at+0.125
affle+0.117
atmeal+0.116
ail+0.115
ogo+0.110
Neg logit contributions
 than-0.118
 kinds-0.107
 type-0.104
 sizes-0.101
 who-0.101
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
at+0.012
ail+0.012
affle+0.011
 seeker+0.011
 skating+0.010
Neg logit contributions
 kinds-0.010
 than-0.010
 sizes-0.009
 sorts-0.009
 type-0.009
Token my
Token ID616
Feature activation-1.33e+00

Pos logit contributions
at+0.024
ail+0.023
affle+0.021
 seeker+0.021
ogo+0.021
Neg logit contributions
 kinds-0.022
 than-0.022
 sizes-0.021
 sorts-0.020
 type-0.019
Token boy
Token ID2933
Feature activation+1.25e+00

Pos logit contributions
at+0.028
ail+0.027
affle+0.025
 seeker+0.025
ogo+0.024
Neg logit contributions
 than-0.027
 kinds-0.027
 sizes-0.026
 sorts-0.025
 type-0.024
Token!
Token ID0
Feature activation+3.91e-01

Pos logit contributions
 kinds+0.026
 than+0.026
 sizes+0.025
 sorts+0.024
 type+0.023
Neg logit contributions
at-0.026
ail-0.026
affle-0.024
 seeker-0.023
ogo-0.023
Token<|endoftext|>
Token ID50256
Feature activation-2.71e+00

Pos logit contributions
 kinds+0.007
 than+0.007
 sizes+0.006
 sorts+0.006
 type+0.006
Neg logit contributions
at-0.010
ail-0.009
affle-0.009
 seeker-0.009
ogo-0.008
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
ail+0.053
at+0.053
affle+0.050
belt+0.049
atmeal+0.048
Neg logit contributions
 kinds-0.057
 than-0.056
 sorts-0.054
 sizes-0.050
lier-0.050
Token upon
Token ID2402
Feature activation+3.52e-01

Pos logit contributions
at+0.224
affle+0.212
atmeal+0.210
pee+0.199
 seeker+0.193
Neg logit contributions
 kinds-0.208
 than-0.207
 sorts-0.194
 who-0.182
 type-0.181
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
 kinds+0.008
 than+0.008
 sizes+0.008
 sorts+0.008
 type+0.007
Neg logit contributions
at-0.006
ail-0.006
 seeker-0.006
affle-0.005
pee-0.005
Token time
Token ID640
Feature activation-6.55e+00

Pos logit contributions
at+0.044
ail+0.042
affle+0.039
 seeker+0.038
ogo+0.036
Neg logit contributions
 kinds-0.080
 sizes-0.078
 than-0.076
 sorts-0.074
 textures-0.071
Token,
Token ID11
Feature activation-5.43e+00

Pos logit contributions
at+0.161
affle+0.153
atmeal+0.145
abella+0.142
ail+0.139
Neg logit contributions
 than-0.123
 kinds-0.105
 type-0.098
ever-0.098
 sizes-0.098
Token a
Token ID257
Feature activation+3.88e-01

Pos logit contributions
at+0.126
affle+0.121
atmeal+0.120
ail+0.114
ogo+0.110
Neg logit contributions
 than-0.106
 who-0.099
 kinds-0.092
 type-0.092
 sizes-0.088
Token the
Token ID262
Feature activation-3.93e-01

Pos logit contributions
at+0.035
ail+0.034
 seeker+0.031
affle+0.031
abella+0.031
Neg logit contributions
 than-0.039
 kinds-0.038
 sizes-0.037
 sorts-0.036
 type-0.035
Token sparkling
Token ID39072
Feature activation+3.24e-01

Pos logit contributions
at+0.006
ail+0.006
affle+0.005
ogo+0.005
abella+0.005
Neg logit contributions
 kinds-0.010
 than-0.010
 sizes-0.010
 sorts-0.010
 type-0.010
Token stone
Token ID7815
Feature activation-1.56e+00

Pos logit contributions
 kinds+0.007
 than+0.007
 sizes+0.007
 sorts+0.006
 type+0.006
Neg logit contributions
at-0.006
ail-0.006
affle-0.006
 seeker-0.006
ogo-0.006
Token.
Token ID13
Feature activation-9.22e-01

Pos logit contributions
ail+0.032
at+0.032
 seeker+0.028
affle+0.028
ip+0.027
Neg logit contributions
 kinds-0.032
 sizes-0.031
 than-0.030
 sorts-0.030
 type-0.028
Token<|endoftext|>
Token ID50256
Feature activation-2.04e+00

Pos logit contributions
ail+0.019
at+0.019
 seeker+0.018
affle+0.017
ogo+0.017
Neg logit contributions
 kinds-0.019
 than-0.018
 sizes-0.017
 sorts-0.017
 type-0.017
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
ail+0.031
at+0.031
affle+0.029
ogo+0.027
 seeker+0.027
Neg logit contributions
 kinds-0.053
 than-0.052
 sorts-0.050
 sizes-0.048
lier-0.047
Token upon
Token ID2402
Feature activation-5.40e-02

Pos logit contributions
at+0.175
affle+0.158
atmeal+0.152
 seeker+0.148
pee+0.147
Neg logit contributions
 than-0.266
 kinds-0.266
 sorts-0.250
 sizes-0.242
 type-0.239
Token a
Token ID257
Feature activation-3.19e+00

Pos logit contributions
at+0.001
 seeker+0.001
ail+0.001
affle+0.001
 skating+0.001
Neg logit contributions
 kinds-0.001
 sizes-0.001
 than-0.001
 sorts-0.001
 type-0.001
Token time
Token ID640
Feature activation-6.35e+00

Pos logit contributions
at+0.046
ail+0.044
affle+0.040
 seeker+0.039
ogo+0.036
Neg logit contributions
 kinds-0.088
 sizes-0.086
 than-0.084
 sorts-0.082
 textures-0.078
Token there
Token ID612
Feature activation-3.93e+00

Pos logit contributions
at+0.148
affle+0.139
atmeal+0.133
ail+0.130
abella+0.130
Neg logit contributions
 than-0.124
 kinds-0.111
 type-0.104
 sizes-0.102
 sorts-0.102
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
at+0.088
ail+0.085
affle+0.081
 seeker+0.079
ogo+0.077
Neg logit contributions
 kinds-0.076
 than-0.075
 sizes-0.071
 sorts-0.070
 type-0.067
Token they
Token ID484
Feature activation-2.67e-01

Pos logit contributions
ail+0.001
at+0.001
affle+0.001
 seeker+0.001
ip+0.001
Neg logit contributions
 kinds-0.001
 than-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token both
Token ID1111
Feature activation-3.97e-01

Pos logit contributions
at+0.005
ail+0.005
affle+0.005
 seeker+0.005
ogo+0.005
Neg logit contributions
 kinds-0.006
 than-0.006
 sizes-0.005
 sorts-0.005
 type-0.005
Token smiled
Token ID13541
Feature activation+1.86e+00

Pos logit contributions
at+0.008
ail+0.008
affle+0.007
 seeker+0.007
ip+0.007
Neg logit contributions
 than-0.009
 kinds-0.008
 sizes-0.008
 sorts-0.008
 type-0.008
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
 kinds+0.039
 sizes+0.037
 sorts+0.036
 than+0.035
 type+0.035
Neg logit contributions
ail-0.036
at-0.034
 seeker-0.032
 onwards-0.032
 back-0.032
Token<|endoftext|>
Token ID50256
Feature activation-2.34e+00

Pos logit contributions
ail+0.003
at+0.003
 seeker+0.003
affle+0.003
abella+0.003
Neg logit contributions
 kinds-0.003
 than-0.003
 sizes-0.003
 sorts-0.002
 type-0.002
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
at+0.043
ail+0.042
affle+0.040
 seeker+0.038
atmeal+0.038
Neg logit contributions
 kinds-0.054
 than-0.053
 sorts-0.050
 sizes-0.049
lier-0.047
Token upon
Token ID2402
Feature activation+3.05e-02

Pos logit contributions
at+0.202
affle+0.187
atmeal+0.187
 seeker+0.176
pee+0.173
Neg logit contributions
 kinds-0.239
 than-0.235
 sorts-0.222
 type-0.214
 sizes-0.210
Token a
Token ID257
Feature activation-3.15e+00

Pos logit contributions
 kinds+0.001
 sizes+0.001
 than+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.000
 seeker-0.000
ail-0.000
affle-0.000
 skating-0.000
Token time
Token ID640
Feature activation-6.67e+00

Pos logit contributions
at+0.042
ail+0.040
affle+0.037
 seeker+0.035
ip+0.033
Neg logit contributions
 kinds-0.091
 sizes-0.088
 than-0.085
 sorts-0.085
 textures-0.081
Token,
Token ID11
Feature activation-5.83e+00

Pos logit contributions
at+0.156
affle+0.146
atmeal+0.138
ail+0.136
abella+0.134
Neg logit contributions
 than-0.133
 kinds-0.118
 sizes-0.112
 type-0.110
 sorts-0.108
Token there
Token ID612
Feature activation-3.70e+00

Pos logit contributions
at+0.133
affle+0.127
atmeal+0.126
ail+0.119
ogo+0.116
Neg logit contributions
 than-0.118
 who-0.106
 type-0.102
 kinds-0.102
 sizes-0.098
Token kind
Token ID1611
Feature activation+5.74e+00

Pos logit contributions
 sizes+0.017
 kinds+0.017
 than+0.017
 sorts+0.016
 type+0.015
Neg logit contributions
at-0.011
ail-0.010
 seeker-0.010
affle-0.010
 skating-0.010
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
 sizes+0.106
 unison+0.103
 than+0.101
 sorts+0.099
 kinds+0.099
Neg logit contributions
ail-0.112
at-0.104
ipop-0.099
affle-0.098
 back-0.098
Token helpful
Token ID7613
Feature activation+3.59e+00

Pos logit contributions
 than+0.008
 kinds+0.008
 sizes+0.008
 sorts+0.008
lier+0.008
Neg logit contributions
ail-0.009
at-0.008
 seeker-0.008
affle-0.008
 onwards-0.008
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
 than+0.067
 sizes+0.066
 kinds+0.065
 sorts+0.064
 unison+0.060
Neg logit contributions
ail-0.069
at-0.062
 morning-0.061
 seeker-0.060
 home-0.060
Token<|endoftext|>
Token ID50256
Feature activation-2.61e+00

Pos logit contributions
 kinds+0.000
 than+0.000
 sizes+0.000
 sorts+0.000
 type+0.000
Neg logit contributions
ail-0.000
at-0.000
 seeker-0.000
affle-0.000
ip-0.000
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
ail+0.061
at+0.061
affle+0.058
belt+0.057
atmeal+0.056
Neg logit contributions
 kinds-0.045
 than-0.045
 sorts-0.042
 sizes-0.038
lier-0.037
Token upon
Token ID2402
Feature activation+9.03e-02

Pos logit contributions
at+0.221
atmeal+0.208
affle+0.205
pee+0.196
 seeker+0.194
Neg logit contributions
 than-0.211
 kinds-0.211
 sorts-0.194
 type-0.188
 who-0.187
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.002
ail-0.002
 seeker-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.62e+00

Pos logit contributions
at+0.044
ail+0.042
affle+0.039
 seeker+0.038
ogo+0.036
Neg logit contributions
 kinds-0.080
 sizes-0.078
 than-0.076
 sorts-0.075
 textures-0.070
Token,
Token ID11
Feature activation-5.66e+00

Pos logit contributions
at+0.155
affle+0.147
atmeal+0.139
abella+0.139
ail+0.137
Neg logit contributions
 than-0.129
 kinds-0.112
 sizes-0.105
 type-0.104
ever-0.104
Token there
Token ID612
Feature activation-3.56e+00

Pos logit contributions
at+0.127
affle+0.123
atmeal+0.122
ail+0.116
ogo+0.112
Neg logit contributions
 than-0.113
 who-0.102
 kinds-0.098
 type-0.098
 sizes-0.094
Token passed
Token ID3804
Feature activation+4.31e-01

Pos logit contributions
at+0.031
ail+0.031
 seeker+0.027
affle+0.027
 skating+0.026
Neg logit contributions
 kinds-0.035
 sizes-0.035
 than-0.034
 sorts-0.033
 type-0.031
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
 kinds+0.009
 than+0.008
 sizes+0.008
 sorts+0.008
 type+0.008
Neg logit contributions
at-0.009
ail-0.009
 seeker-0.008
affle-0.008
ogo-0.008
Token bridge
Token ID7696
Feature activation-1.39e+00

Pos logit contributions
at+0.024
ail+0.024
affle+0.022
 seeker+0.021
ogo+0.021
Neg logit contributions
 than-0.025
 kinds-0.025
 sizes-0.024
 sorts-0.023
 type-0.022
Token.
Token ID13
Feature activation-9.18e-01

Pos logit contributions
at+0.030
ail+0.030
 seeker+0.026
 skating+0.026
affle+0.026
Neg logit contributions
 kinds-0.027
 sizes-0.026
 than-0.025
 sorts-0.024
 type-0.024
Token<|endoftext|>
Token ID50256
Feature activation-2.31e+00

Pos logit contributions
at+0.019
ail+0.019
affle+0.017
 seeker+0.017
ogo+0.016
Neg logit contributions
 kinds-0.019
 than-0.019
 sizes-0.018
 sorts-0.017
 type-0.017
TokenOnce
Token ID7454
Feature activation-1.05e+01

Pos logit contributions
at+0.053
ail+0.053
affle+0.051
atmeal+0.049
unct+0.049
Neg logit contributions
 kinds-0.041
 than-0.041
 sorts-0.038
 sizes-0.035
lier-0.035
Token upon
Token ID2402
Feature activation+8.77e-02

Pos logit contributions
at+0.222
atmeal+0.210
affle+0.207
 seeker+0.196
pee+0.195
Neg logit contributions
 kinds-0.213
 than-0.210
 sorts-0.197
 type-0.189
 who-0.188
Token a
Token ID257
Feature activation-3.25e+00

Pos logit contributions
 kinds+0.002
 sizes+0.002
 than+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.002
ail-0.001
 seeker-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-6.89e+00

Pos logit contributions
at+0.051
ail+0.050
affle+0.046
 seeker+0.044
ogo+0.042
Neg logit contributions
 kinds-0.085
 sizes-0.083
 than-0.083
 sorts-0.079
 type-0.075
Token,
Token ID11
Feature activation-5.88e+00

Pos logit contributions
at+0.164
affle+0.155
atmeal+0.147
abella+0.145
ail+0.143
Neg logit contributions
 than-0.133
 kinds-0.116
 sizes-0.110
 type-0.109
ever-0.106
Token there
Token ID612
Feature activation-3.68e+00

Pos logit contributions
at+0.138
affle+0.135
atmeal+0.134
ail+0.124
ogo+0.122
Neg logit contributions
 than-0.111
 who-0.104
 type-0.095
 kinds-0.093
 whose-0.091
Token tree
Token ID5509
Feature activation-2.76e+00

Pos logit contributions
at+0.049
ail+0.048
affle+0.045
abella+0.045
ogo+0.044
Neg logit contributions
 than-0.036
 kinds-0.035
 sizes-0.033
 sorts-0.032
 type-0.031
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
ail+0.058
at+0.054
 skating+0.049
 back+0.048
 onwards+0.047
Neg logit contributions
 kinds-0.054
 sizes-0.054
 than-0.052
 type-0.049
 textures-0.049
Token She
Token ID1375
Feature activation+4.63e-02

Pos logit contributions
ail+0.007
at+0.007
 onwards+0.006
 seeker+0.006
abella+0.006
Neg logit contributions
 than-0.010
 sizes-0.009
 kinds-0.009
 type-0.009
lier-0.009
Token was
Token ID373
Feature activation-1.56e+00

Pos logit contributions
 than+0.001
 kinds+0.001
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
ail-0.001
at-0.001
affle-0.001
abella-0.001
ip-0.001
Token so
Token ID523
Feature activation+1.14e+00

Pos logit contributions
at+0.038
ail+0.037
affle+0.036
ogo+0.034
ip+0.033
Neg logit contributions
 than-0.028
 kinds-0.028
 sorts-0.026
 sizes-0.026
 type-0.024
Token happy
Token ID3772
Feature activation+3.24e+00

Pos logit contributions
 than+0.024
 kinds+0.024
 sizes+0.022
 sorts+0.022
 type+0.021
Neg logit contributions
at-0.024
ail-0.024
affle-0.022
 seeker-0.022
ogo-0.022
Token and
Token ID290
Feature activation-3.68e-01

Pos logit contributions
 sizes+0.073
 kinds+0.073
 unison+0.070
 type+0.069
 sorts+0.068
Neg logit contributions
ail-0.058
at-0.053
 skating-0.052
 seeker-0.049
ip-0.048
Token the
Token ID262
Feature activation-5.84e-03

Pos logit contributions
ail+0.008
at+0.007
 seeker+0.007
affle+0.007
 skating+0.007
Neg logit contributions
 kinds-0.007
 than-0.007
 sizes-0.007
 sorts-0.007
 type-0.007
Token tree
Token ID5509
Feature activation-6.47e-01

Pos logit contributions
at+0.000
ail+0.000
affle+0.000
ogo+0.000
ip+0.000
Neg logit contributions
 kinds-0.000
 than-0.000
 sizes-0.000
 sorts-0.000
 type-0.000
Token was
Token ID373
Feature activation-6.09e-01

Pos logit contributions
ail+0.012
at+0.011
affle+0.010
 seeker+0.009
ip+0.009
Neg logit contributions
 kinds-0.015
 than-0.015
 sizes-0.015
 sorts-0.014
 type-0.014
Token happy
Token ID3772
Feature activation+3.87e+00

Pos logit contributions
at+0.013
ail+0.012
affle+0.012
 seeker+0.011
ogo+0.011
Neg logit contributions
 than-0.013
 kinds-0.013
 sizes-0.012
 sorts-0.012
 type-0.011
Token It
Token ID632
Feature activation+6.44e-01

Pos logit contributions
 kinds+0.004
 than+0.004
 sizes+0.004
 sorts+0.003
 type+0.003
Neg logit contributions
at-0.004
ail-0.004
affle-0.003
 seeker-0.003
ogo-0.003
Token was
Token ID373
Feature activation+1.09e+00

Pos logit contributions
 kinds+0.013
 than+0.013
 sizes+0.013
 sorts+0.012
 type+0.012
Neg logit contributions
at-0.014
ail-0.014
 seeker-0.013
affle-0.013
ogo-0.012
Token very
Token ID845
Feature activation+3.31e+00

Pos logit contributions
 than+0.023
 kinds+0.022
 sizes+0.022
 type+0.021
 sorts+0.020
Neg logit contributions
at-0.024
ail-0.024
affle-0.021
atmeal-0.021
ogo-0.021
Token,
Token ID11
Feature activation+4.18e-01

Pos logit contributions
 than+0.066
 kinds+0.064
 sizes+0.061
 sorts+0.059
 type+0.059
Neg logit contributions
at-0.075
ail-0.073
affle-0.067
 seeker-0.066
ogo-0.065
Token very
Token ID845
Feature activation+3.49e+00

Pos logit contributions
 kinds+0.008
 sizes+0.007
 sorts+0.007
 than+0.007
lier+0.007
Neg logit contributions
at-0.009
ail-0.008
 seeker-0.008
 skating-0.008
affle-0.008
Token big
Token ID1263
Feature activation+4.91e+00

Pos logit contributions
 than+0.072
 kinds+0.072
 sizes+0.069
 sorts+0.066
 type+0.065
Neg logit contributions
at-0.075
ail-0.072
affle-0.067
 seeker-0.066
ogo-0.064
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 unison+0.107
 kinds+0.106
 sorts+0.104
 type+0.103
 sizes+0.100
Neg logit contributions
 seeker-0.077
 skating-0.074
at-0.074
ail-0.074
ip-0.073
Token He
Token ID679
Feature activation+9.27e-01

Pos logit contributions
 kinds+0.002
 than+0.002
 sizes+0.002
 sorts+0.002
 type+0.002
Neg logit contributions
at-0.002
ail-0.002
affle-0.002
 seeker-0.002
ogo-0.002
Token wanted
Token ID2227
Feature activation+1.85e+00

Pos logit contributions
 than+0.018
 kinds+0.017
 sizes+0.017
 sorts+0.016
 unison+0.016
Neg logit contributions
at-0.020
ail-0.020
affle-0.019
 skating-0.018
abella-0.018
Token to
Token ID284
Feature activation+9.06e-01

Pos logit contributions
 than+0.032
 kinds+0.032
 sizes+0.030
 sorts+0.029
 type+0.028
Neg logit contributions
at-0.046
ail-0.045
affle-0.042
 seeker-0.042
ogo-0.041
Token show
Token ID905
Feature activation+1.64e+00

Pos logit contributions
 sizes+0.022
 than+0.021
lier+0.021
 kinds+0.021
ymes+0.020
Neg logit contributions
at-0.015
 seeker-0.015
ail-0.014
affle-0.014
atmeal-0.013
Token walked
Token ID6807
Feature activation-3.92e+00

Pos logit contributions
at+0.040
ail+0.040
affle+0.038
 seeker+0.037
ogo+0.036
Neg logit contributions
 kinds-0.038
 sorts-0.036
 than-0.035
 sizes-0.034
 type-0.033
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
at+0.105
affle+0.104
atmeal+0.103
uch+0.102
abella+0.098
Neg logit contributions
 than-0.071
 kinds-0.062
 sorts-0.058
 each-0.052
 sizes-0.051
Token Marc
Token ID13067
Feature activation-1.24e+00

Pos logit contributions
at+0.045
affle+0.043
ail+0.042
ogo+0.040
atmeal+0.039
Neg logit contributions
 than-0.031
 kinds-0.030
 sizes-0.029
 sorts-0.028
 type-0.026
Tokenie
Token ID494
Feature activation+1.34e+00

Pos logit contributions
at+0.025
 seeker+0.024
ail+0.024
affle+0.023
 skating+0.022
Neg logit contributions
 kinds-0.025
 than-0.025
 sizes-0.024
 sorts-0.023
 type-0.022
Token noticed
Token ID6810
Feature activation+3.02e+00

Pos logit contributions
 than+0.021
 kinds+0.021
 sizes+0.019
 sorts+0.019
 who+0.018
Neg logit contributions
affle-0.034
at-0.033
 seeker-0.032
ogo-0.031
ail-0.030
Token a
Token ID257
Feature activation+3.10e+00

Pos logit contributions
 kinds+0.058
 than+0.057
 sizes+0.054
 sorts+0.053
 type+0.051
Neg logit contributions
at-0.069
ail-0.067
affle-0.064
 seeker-0.062
ogo-0.060
Token hidden
Token ID7104
Feature activation+2.32e+00

Pos logit contributions
 than+0.057
 kinds+0.056
 sizes+0.054
 type+0.054
 sorts+0.052
Neg logit contributions
at-0.073
ail-0.070
affle-0.064
ogo-0.064
 seeker-0.063
Token pole
Token ID16825
Feature activation+5.35e-01

Pos logit contributions
 than+0.049
 kinds+0.049
 sizes+0.047
 sorts+0.046
 type+0.044
Neg logit contributions
at-0.047
ail-0.046
affle-0.042
 seeker-0.041
ogo-0.041
Token.
Token ID13
Feature activation+2.99e-01

Pos logit contributions
 kinds+0.012
 sizes+0.012
 than+0.012
 sorts+0.012
 textures+0.011
Neg logit contributions
at-0.010
ail-0.009
 skating-0.008
 seeker-0.008
affle-0.008
Token She
Token ID1375
Feature activation+1.37e+00

Pos logit contributions
 kinds+0.005
 than+0.005
 sizes+0.005
 sorts+0.005
 type+0.005
Neg logit contributions
at-0.007
ail-0.007
affle-0.007
 seeker-0.006
ogo-0.006
Token walked
Token ID6807
Feature activation-1.71e+00

Pos logit contributions
 kinds+0.025
 than+0.025
 sizes+0.024
 sorts+0.023
 type+0.022
Neg logit contributions
at-0.032
ail-0.032
affle-0.029
 seeker-0.029
ogo-0.028
Token she
Token ID673
Feature activation-2.71e+00

Pos logit contributions
 than+0.016
 kinds+0.015
 sizes+0.015
 sorts+0.014
 type+0.014
Neg logit contributions
at-0.018
ail-0.017
affle-0.017
 seeker-0.016
ogo-0.016
Token looked
Token ID3114
Feature activation-4.72e+00

Pos logit contributions
at+0.053
ail+0.052
 seeker+0.049
affle+0.049
ogo+0.047
Neg logit contributions
 kinds-0.059
 than-0.058
 sizes-0.056
 sorts-0.054
 type-0.052
Token around
Token ID1088
Feature activation-4.66e-01

Pos logit contributions
at+0.112
uch+0.110
affle+0.110
atmeal+0.109
 seeker+0.104
Neg logit contributions
 than-0.092
 kinds-0.081
 sorts-0.077
 who-0.075
 each-0.074
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
at+0.011
affle+0.010
ail+0.010
ogo+0.009
atmeal+0.009
Neg logit contributions
 than-0.010
 kinds-0.009
 sorts-0.009
 sizes-0.008
 type-0.008
Token she
Token ID673
Feature activation+4.26e-01

Pos logit contributions
at+0.038
affle+0.037
ogo+0.036
ail+0.035
aj+0.032
Neg logit contributions
 than-0.020
 kinds-0.019
 sizes-0.017
 sorts-0.017
 type-0.016
Token noticed
Token ID6810
Feature activation+2.68e+00

Pos logit contributions
 than+0.007
 kinds+0.007
 sorts+0.007
 sizes+0.007
 type+0.007
Neg logit contributions
at-0.010
affle-0.010
ail-0.009
 seeker-0.009
ogo-0.009
Token a
Token ID257
Feature activation+2.84e+00

Pos logit contributions
 kinds+0.055
 than+0.055
 sizes+0.052
 sorts+0.050
 type+0.048
Neg logit contributions
at-0.056
ail-0.055
affle-0.051
 seeker-0.050
ogo-0.049
Token big
Token ID1263
Feature activation+4.58e+00

Pos logit contributions
 than+0.053
 kinds+0.053
 sizes+0.051
 sorts+0.048
 type+0.047
Neg logit contributions
at-0.065
ail-0.063
affle-0.059
 seeker-0.058
ogo-0.057
Token furry
Token ID40857
Feature activation+4.27e+00

Pos logit contributions
 kinds+0.101
 than+0.101
 sizes+0.098
 sorts+0.094
 type+0.089
Neg logit contributions
at-0.086
ail-0.085
 seeker-0.079
affle-0.078
ogo-0.074
Token monster
Token ID9234
Feature activation+2.18e+00

Pos logit contributions
 kinds+0.079
 than+0.079
 sizes+0.076
 sorts+0.072
 type+0.068
Neg logit contributions
at-0.097
ail-0.096
affle-0.089
 seeker-0.089
ogo-0.086
Token with
Token ID351
Feature activation+4.82e-01

Pos logit contributions
 kinds+0.047
 than+0.045
 sizes+0.045
 sorts+0.043
 type+0.041
Neg logit contributions
at-0.043
ail-0.042
affle-0.039
 seeker-0.038
 skating-0.037
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
 than+0.009
 sizes+0.009
 kinds+0.009
 sorts+0.008
 type+0.008
Neg logit contributions
at-0.006
ail-0.006
 seeker-0.005
affle-0.005
 skating-0.005
Token time
Token ID640
Feature activation-5.60e+00

Pos logit contributions
at+0.041
ail+0.038
affle+0.036
 seeker+0.034
 skating+0.031
Neg logit contributions
 kinds-0.082
 sizes-0.082
 than-0.079
 sorts-0.076
 textures-0.074
Token,
Token ID11
Feature activation-4.75e+00

Pos logit contributions
at+0.129
affle+0.121
ail+0.115
atmeal+0.114
ogo+0.111
Neg logit contributions
 than-0.112
 kinds-0.102
 sorts-0.094
 type-0.093
 sizes-0.093
Token there
Token ID612
Feature activation-2.91e+00

Pos logit contributions
at+0.086
ail+0.082
affle+0.078
ogo+0.075
atmeal+0.073
Neg logit contributions
 than-0.115
 kinds-0.113
 sizes-0.107
 sorts-0.106
 type-0.104
Token was
Token ID373
Feature activation-9.49e-01

Pos logit contributions
at+0.063
ail+0.061
affle+0.057
 seeker+0.057
ogo+0.055
Neg logit contributions
 kinds-0.058
 than-0.057
 sizes-0.055
 sorts-0.054
 type-0.052
Token a
Token ID257
Feature activation+2.81e+00

Pos logit contributions
at+0.017
ail+0.017
 seeker+0.016
affle+0.016
 skating+0.015
Neg logit contributions
 than-0.022
 kinds-0.021
 sizes-0.021
 sorts-0.020
 type-0.019
Token little
Token ID1310
Feature activation+5.05e+00

Pos logit contributions
 than+0.046
 kinds+0.045
 sizes+0.042
 sorts+0.041
 type+0.040
Neg logit contributions
at-0.071
ail-0.069
affle-0.064
ogo-0.064
 seeker-0.063
Token girl
Token ID2576
Feature activation+2.70e+00

Pos logit contributions
 kinds+0.092
 than+0.092
 sizes+0.090
 sorts+0.084
 type+0.079
Neg logit contributions
at-0.113
ail-0.113
 seeker-0.108
affle-0.106
ip-0.102
Token named
Token ID3706
Feature activation-2.23e+00

Pos logit contributions
 sizes+0.080
 kinds+0.078
 textures+0.073
 target+0.073
 sorts+0.072
Neg logit contributions
ail-0.033
at-0.031
 seeker-0.028
ip-0.026
ig-0.025
Token Sarah
Token ID10490
Feature activation+1.84e+00

Pos logit contributions
at+0.052
ail+0.049
affle+0.048
ogo+0.046
 seeker+0.046
Neg logit contributions
 kinds-0.042
 than-0.041
 sizes-0.039
 sorts-0.038
 type-0.037
Token who
Token ID508
Feature activation-1.82e+00

Pos logit contributions
 sizes+0.045
 kinds+0.044
 unison+0.042
 target+0.042
 sorts+0.041
Neg logit contributions
ail-0.028
at-0.028
 back-0.025
pee-0.023
 on-0.023
Token The
Token ID383
Feature activation+5.00e-01

Pos logit contributions
 kinds+0.006
 than+0.006
 sizes+0.006
 sorts+0.006
 type+0.006
Neg logit contributions
at-0.009
ail-0.008
affle-0.008
 seeker-0.008
ogo-0.008
Token creature
Token ID7185
Feature activation+1.88e+00

Pos logit contributions
 kinds+0.008
 than+0.007
 sizes+0.007
 sorts+0.007
 type+0.007
Neg logit contributions
at-0.013
ail-0.012
atmeal-0.012
ip-0.012
ogo-0.012
Token had
Token ID550
Feature activation-6.55e-03

Pos logit contributions
 kinds+0.043
 than+0.042
 sizes+0.042
 sorts+0.040
 type+0.038
Neg logit contributions
ail-0.035
at-0.034
affle-0.030
 skating-0.029
ogo-0.029
Token left
Token ID1364
Feature activation-7.43e-01

Pos logit contributions
at+0.000
ail+0.000
 seeker+0.000
ogo+0.000
abella+0.000
Neg logit contributions
 than-0.000
 kinds-0.000
 sizes-0.000
 sorts-0.000
 type-0.000
Token behind
Token ID2157
Feature activation+1.88e+00

Pos logit contributions
at+0.016
ail+0.015
affle+0.014
 seeker+0.014
ogo+0.014
Neg logit contributions
 than-0.015
 kinds-0.015
 sizes-0.015
 sorts-0.014
 type-0.013
Token a
Token ID257
Feature activation-8.74e-01

Pos logit contributions
 than+0.035
 kinds+0.035
 sizes+0.033
 sorts+0.032
 type+0.030
Neg logit contributions
at-0.043
ail-0.042
affle-0.040
 seeker-0.039
ogo-0.038
Token feather
Token ID17972
Feature activation-2.97e+00

Pos logit contributions
at+0.018
ail+0.018
affle+0.016
 seeker+0.016
ogo+0.016
Neg logit contributions
 kinds-0.018
 than-0.018
 sizes-0.017
 sorts-0.017
 type-0.016
Token
Token ID6184
Feature activation+1.69e-01

Pos logit contributions
at+0.073
ail+0.072
 seeker+0.067
affle+0.067
ogo+0.065
Neg logit contributions
 kinds-0.051
 than-0.049
 sizes-0.048
 sorts-0.046
 type-0.043
Token
Token ID95
Feature activation+6.83e-01

Pos logit contributions
 than+0.004
 kinds+0.003
 sizes+0.003
 sorts+0.003
 type+0.003
Neg logit contributions
at-0.003
ail-0.003
affle-0.003
 seeker-0.003
ogo-0.003
Token
Token ID26391
Feature activation-3.79e+00

Pos logit contributions
 kinds+0.014
 than+0.014
 sizes+0.014
 sorts+0.013
 type+0.013
Neg logit contributions
at-0.014
ail-0.014
affle-0.013
 seeker-0.013
ogo-0.012
Token
Token ID447
Feature activation-2.68e+00

Pos logit contributions
at+0.073
ail+0.072
affle+0.066
 seeker+0.066
ogo+0.063
Neg logit contributions
 kinds-0.083
 than-0.083
 sizes-0.080
 sorts-0.077
 type-0.074
Token games
Token ID1830
Feature activation+7.51e-01

Pos logit contributions
 than+0.019
 sizes+0.019
 kinds+0.019
 unison+0.017
 sorts+0.017
Neg logit contributions
at-0.022
ail-0.022
affle-0.021
 seeker-0.020
 skating-0.020
Token with
Token ID351
Feature activation+1.74e-02

Pos logit contributions
 kinds+0.016
 sizes+0.016
 than+0.015
 sorts+0.015
 textures+0.014
Neg logit contributions
ail-0.014
at-0.014
 seeker-0.013
 skating-0.012
 onwards-0.012
Token her
Token ID607
Feature activation-3.62e+00

Pos logit contributions
 than+0.000
 sizes+0.000
 kinds+0.000
ever+0.000
 sorts+0.000
Neg logit contributions
ail-0.000
at-0.000
 onwards-0.000
affle-0.000
 seeker-0.000
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
ail+0.064
at+0.056
affle+0.055
 seeker+0.055
ip+0.054
Neg logit contributions
 kinds-0.082
 sizes-0.080
 than-0.079
 sorts-0.078
lier-0.073
Token had
Token ID550
Feature activation-2.03e+00

Pos logit contributions
ail+0.040
at+0.039
 seeker+0.037
affle+0.035
abella+0.035
Neg logit contributions
 than-0.045
 kinds-0.044
 sizes-0.042
 sorts-0.041
lier-0.040
Token lots
Token ID6041
Feature activation+1.94e+00

Pos logit contributions
at+0.028
ail+0.028
abella+0.027
 skating+0.026
 done+0.025
Neg logit contributions
 than-0.052
 kinds-0.048
 sizes-0.047
ever-0.046
izing-0.045
Token of
Token ID286
Feature activation+1.16e+00

Pos logit contributions
 sizes+0.032
 target+0.031
 type+0.030
 kinds+0.030
 than+0.030
Neg logit contributions
at-0.043
ail-0.042
affle-0.041
ip-0.040
 skating-0.039
Token fun
Token ID1257
Feature activation+3.75e-01

Pos logit contributions
 than+0.023
 sizes+0.022
 kinds+0.022
 target+0.021
 sorts+0.021
Neg logit contributions
affle-0.022
at-0.022
ail-0.021
 seeker-0.020
 skating-0.020
Token.
Token ID13
Feature activation-8.21e-01

Pos logit contributions
 target+0.007
 sizes+0.007
 kinds+0.007
 type+0.007
 unison+0.007
Neg logit contributions
ail-0.007
 skating-0.007
at-0.007
affle-0.007
 onwards-0.006
Token They
Token ID1119
Feature activation-5.14e-01

Pos logit contributions
ail+0.017
at+0.017
affle+0.015
 seeker+0.015
ip+0.015
Neg logit contributions
 than-0.017
 kinds-0.017
 sizes-0.016
 type-0.015
 sorts-0.015
Token learned
Token ID4499
Feature activation+2.07e+00

Pos logit contributions
at+0.010
ail+0.010
affle+0.009
 seeker+0.009
ogo+0.009
Neg logit contributions
 than-0.011
 kinds-0.011
 sizes-0.011
 sorts-0.010
 type-0.010
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
 than+0.023
 kinds+0.023
 sizes+0.022
 sorts+0.021
 type+0.020
Neg logit contributions
ail-0.020
at-0.020
affle-0.018
 seeker-0.017
abella-0.017
Token her
Token ID607
Feature activation-3.75e+00

Pos logit contributions
ail+0.033
at+0.031
 seeker+0.031
affle+0.028
ipop+0.028
Neg logit contributions
 than-0.039
 sizes-0.038
 kinds-0.037
lier-0.035
ever-0.034
Token friends
Token ID2460
Feature activation+3.80e-01

Pos logit contributions
at+0.085
ail+0.079
 seeker+0.075
abella+0.074
affle+0.074
Neg logit contributions
 kinds-0.073
 than-0.071
 sizes-0.068
 sorts-0.066
 type-0.065
Token had
Token ID550
Feature activation-1.44e+00

Pos logit contributions
 kinds+0.007
 than+0.007
 sizes+0.007
 sorts+0.007
 type+0.007
Neg logit contributions
at-0.008
ail-0.008
affle-0.008
 seeker-0.007
ogo-0.007
Token lots
Token ID6041
Feature activation+2.76e+00

Pos logit contributions
ail+0.023
at+0.022
abella+0.021
 done+0.020
 skating+0.020
Neg logit contributions
 than-0.035
 sizes-0.033
 kinds-0.033
 textures-0.031
 sorts-0.031
Token of
Token ID286
Feature activation+1.60e+00

Pos logit contributions
 sizes+0.049
 target+0.046
 than+0.045
 kinds+0.044
 type+0.044
Neg logit contributions
at-0.059
ail-0.059
 refreshed-0.055
 skating-0.055
affle-0.055
Token fun
Token ID1257
Feature activation+1.05e+00

Pos logit contributions
 than+0.031
 sizes+0.030
 kinds+0.029
 target+0.027
 sorts+0.027
Neg logit contributions
at-0.034
ail-0.034
affle-0.033
 seeker-0.031
 skating-0.030
Token playing
Token ID2712
Feature activation-7.24e-01

Pos logit contributions
 sizes+0.025
 kinds+0.025
 target+0.025
 than+0.025
lier+0.024
Neg logit contributions
ail-0.016
at-0.016
 skating-0.015
affle-0.013
ip-0.013
Token with
Token ID351
Feature activation+1.41e-01

Pos logit contributions
at+0.014
ail+0.014
 seeker+0.013
affle+0.013
 skating+0.013
Neg logit contributions
 sizes-0.015
 kinds-0.015
 than-0.015
 sorts-0.014
 type-0.013
Token the
Token ID262
Feature activation-9.20e-01

Pos logit contributions
 than+0.003
 sizes+0.003
 target+0.003
 kinds+0.003
ever+0.003
Neg logit contributions
ail-0.002
at-0.002
ip-0.002
 skating-0.002
 seeker-0.002
Token mysterious
Token ID11428
Feature activation+6.10e-01

Pos logit contributions
at+0.018
ail+0.018
affle+0.016
 seeker+0.016
ogo+0.016
Neg logit contributions
 kinds-0.020
 than-0.020
 sizes-0.019
 sorts-0.019
 type-0.018
Token "
Token ID366
Feature activation+5.35e+00

Pos logit contributions
at+0.028
affle+0.026
 seeker+0.026
ail+0.025
ogo+0.025
Neg logit contributions
 than-0.011
 sizes-0.010
 who-0.009
 type-0.009
 kinds-0.009
TokenWhere
Token ID8496
Feature activation-1.32e+00

Pos logit contributions
 kinds+0.098
 than+0.098
 sizes+0.093
 sorts+0.089
 type+0.085
Neg logit contributions
at-0.124
ail-0.122
affle-0.114
 seeker-0.113
ogo-0.109
Token did
Token ID750
Feature activation+9.67e-01

Pos logit contributions
at+0.028
ail+0.028
affle+0.026
 seeker+0.025
ogo+0.025
Neg logit contributions
 kinds-0.027
 than-0.027
 sizes-0.026
 sorts-0.025
 type-0.024
Token you
Token ID345
Feature activation-6.58e-01

Pos logit contributions
 kinds+0.016
 than+0.016
 sizes+0.015
 sorts+0.014
 type+0.013
Neg logit contributions
at-0.024
ail-0.024
affle-0.022
 seeker-0.022
ogo-0.022
Token go
Token ID467
Feature activation-2.44e+00

Pos logit contributions
at+0.014
ail+0.013
 seeker+0.012
affle+0.012
ogo+0.012
Neg logit contributions
 kinds-0.014
 than-0.014
 sizes-0.014
 sorts-0.013
 type-0.012
Token Nate
Token ID23486
Feature activation-1.13e+00

Pos logit contributions
at+0.053
ail+0.053
ogo+0.052
abella+0.052
affle+0.051
Neg logit contributions
 than-0.042
 kinds-0.040
 sorts-0.038
 type-0.038
 who-0.036
Token?
Token ID30
Feature activation-7.84e-02

Pos logit contributions
ail+0.023
at+0.022
affle+0.021
 seeker+0.021
ogo+0.021
Neg logit contributions
 than-0.024
 kinds-0.023
 sizes-0.021
 sorts-0.021
 type-0.021
Token I
Token ID314
Feature activation-2.54e-01

Pos logit contributions
at+0.002
ail+0.002
affle+0.002
ogo+0.002
 seeker+0.002
Neg logit contributions
 than-0.001
 kinds-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token was
Token ID373
Feature activation-2.01e+00

Pos logit contributions
at+0.005
ail+0.005
affle+0.005
 seeker+0.005
ogo+0.004
Neg logit contributions
 than-0.005
 kinds-0.005
 sizes-0.005
 sorts-0.005
 type-0.005
Token looking
Token ID2045
Feature activation+5.18e-01

Pos logit contributions
at+0.037
ail+0.037
affle+0.034
ogo+0.033
 seeker+0.032
Neg logit contributions
 than-0.047
 kinds-0.045
 sizes-0.043
 sorts-0.043
 type-0.042
Token for
Token ID329
Feature activation+2.62e+00

Pos logit contributions
 than+0.012
 kinds+0.012
 sorts+0.011
 sizes+0.011
 type+0.010
Neg logit contributions
at-0.010
ail-0.010
ogo-0.009
affle-0.009
atmeal-0.009
Token You
Token ID921
Feature activation-6.85e-01

Pos logit contributions
 kinds+0.037
 than+0.035
 sorts+0.034
 sizes+0.034
 type+0.031
Neg logit contributions
 seeker-0.042
at-0.041
ail-0.041
affle-0.039
abella-0.038
Token are
Token ID389
Feature activation-1.10e+00

Pos logit contributions
at+0.012
 seeker+0.012
ail+0.012
affle+0.011
 bumped+0.011
Neg logit contributions
 kinds-0.016
 than-0.015
 sizes-0.015
 sorts-0.015
 textures-0.014
Token not
Token ID407
Feature activation+3.95e-01

Pos logit contributions
at+0.018
ail+0.018
 seeker+0.017
affle+0.017
 skating+0.016
Neg logit contributions
 kinds-0.027
 than-0.026
 sizes-0.026
 sorts-0.025
 type-0.024
Token a
Token ID257
Feature activation+2.33e+00

Pos logit contributions
 sizes+0.009
 kinds+0.009
 unison+0.009
 sorts+0.008
 than+0.008
Neg logit contributions
 seeker-0.006
at-0.006
affle-0.006
 skating-0.006
 done-0.006
Token good
Token ID922
Feature activation+4.56e-01

Pos logit contributions
 kinds+0.059
 sizes+0.056
 sorts+0.055
 than+0.054
 unison+0.052
Neg logit contributions
 seeker-0.037
ail-0.036
affle-0.035
at-0.034
 skating-0.031
Token friend
Token ID1545
Feature activation+2.45e+00

Pos logit contributions
 kinds+0.010
 sizes+0.010
 than+0.009
 sorts+0.009
 unison+0.009
Neg logit contributions
 seeker-0.009
ail-0.008
affle-0.008
at-0.008
 skating-0.007
Token.
Token ID13
Feature activation+1.81e+00

Pos logit contributions
 kinds+0.055
 sorts+0.054
 sizes+0.052
 textures+0.052
 unison+0.050
Neg logit contributions
 seeker-0.040
at-0.039
 back-0.038
 on-0.038
 anywhere-0.037
Token You
Token ID921
Feature activation-6.79e-01

Pos logit contributions
 kinds+0.039
 sorts+0.036
 than+0.036
 sizes+0.036
 type+0.034
Neg logit contributions
 seeker-0.035
at-0.033
ail-0.032
affle-0.031
abella-0.031
Token are
Token ID389
Feature activation-1.16e+00

Pos logit contributions
at+0.012
 seeker+0.012
ail+0.012
affle+0.011
uch+0.011
Neg logit contributions
 kinds-0.015
 than-0.015
 sizes-0.015
 sorts-0.014
 unison-0.014
Token selfish
Token ID20363
Feature activation+3.14e+00

Pos logit contributions
at+0.018
 seeker+0.018
ail+0.018
affle+0.017
 skating+0.016
Neg logit contributions
 kinds-0.029
 sizes-0.028
 sorts-0.027
 than-0.027
 textures-0.026
Token and
Token ID290
Feature activation+4.64e-01

Pos logit contributions
 sorts+0.068
 kinds+0.066
 unison+0.065
 type+0.064
 sizes+0.062
Neg logit contributions
 seeker-0.053
 back-0.050
ail-0.050
 skating-0.049
 on-0.048
Token mom
Token ID1995
Feature activation+1.20e+00

Pos logit contributions
at+0.072
ogo+0.061
affle+0.061
or+0.060
abella+0.059
Neg logit contributions
 kinds-0.053
 than-0.050
 sizes-0.050
 shapes-0.049
 type-0.049
Tokenmy
Token ID1820
Feature activation+7.48e-01

Pos logit contributions
 kinds+0.028
 than+0.027
 sizes+0.027
 sorts+0.026
 type+0.024
Neg logit contributions
at-0.022
ail-0.021
affle-0.021
 seeker-0.020
ogo-0.019
Token ran
Token ID4966
Feature activation-1.05e+00

Pos logit contributions
 kinds+0.015
 than+0.014
 sizes+0.014
 sorts+0.013
 type+0.013
Neg logit contributions
at-0.016
ail-0.016
affle-0.015
 seeker-0.015
ogo-0.014
Token back
Token ID736
Feature activation-2.20e+00

Pos logit contributions
at+0.022
ail+0.021
affle+0.021
atmeal+0.020
ogo+0.019
Neg logit contributions
 than-0.023
 kinds-0.021
 sorts-0.021
 sizes-0.020
 type-0.019
Token to
Token ID284
Feature activation-1.71e-01

Pos logit contributions
at+0.046
affle+0.044
atmeal+0.042
ail+0.039
uch+0.039
Neg logit contributions
 than-0.050
 kinds-0.048
 sorts-0.046
 sizes-0.043
 type-0.041
Token their
Token ID511
Feature activation-3.49e+00

Pos logit contributions
ail+0.004
at+0.003
 seeker+0.003
affle+0.003
 skating+0.003
Neg logit contributions
 than-0.004
 kinds-0.004
 sizes-0.003
 sorts-0.003
 type-0.003
Token house
Token ID2156
Feature activation-1.88e+00

Pos logit contributions
at+0.073
ail+0.066
affle+0.065
 seeker+0.062
ogo+0.061
Neg logit contributions
 kinds-0.075
 than-0.074
 sizes-0.072
 sorts-0.069
 type-0.067
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
ail+0.041
at+0.040
 seeker+0.037
affle+0.036
 skating+0.036
Neg logit contributions
 kinds-0.036
 than-0.035
 sizes-0.035
 sorts-0.033
 type-0.032
Token but
Token ID475
Feature activation-2.92e-01

Pos logit contributions
at+0.028
ail+0.027
affle+0.025
 seeker+0.025
ogo+0.024
Neg logit contributions
 than-0.026
 kinds-0.025
 sizes-0.024
 sorts-0.023
 type-0.022
Token they
Token ID484
Feature activation+4.42e-02

Pos logit contributions
at+0.009
affle+0.008
ogo+0.008
atmeal+0.008
ail+0.008
Neg logit contributions
 than-0.004
 kinds-0.004
 each-0.004
 type-0.003
 sizes-0.003
Token got
Token ID1392
Feature activation-1.35e+00

Pos logit contributions
 than+0.001
 kinds+0.001
 sizes+0.001
 type+0.001
 sorts+0.001
Neg logit contributions
at-0.001
affle-0.001
ail-0.001
ogo-0.001
 seeker-0.001
Token out
Token ID503
Feature activation-4.19e-01

Pos logit contributions
 kinds+0.014
 than+0.013
 sizes+0.013
 sorts+0.013
 type+0.012
Neg logit contributions
ail-0.017
at-0.016
 skating-0.015
affle-0.014
 seeker-0.014
Token.
Token ID13
Feature activation-4.82e-01

Pos logit contributions
ail+0.009
at+0.009
 onwards+0.008
ip+0.008
 onward+0.008
Neg logit contributions
 kinds-0.007
 than-0.007
 sizes-0.007
 sorts-0.007
 target-0.006
Token
Token ID198
Feature activation+2.93e-02

Pos logit contributions
at+0.009
ail+0.009
affle+0.008
 seeker+0.008
ogo+0.008
Neg logit contributions
 than-0.011
 kinds-0.011
 sizes-0.010
 sorts-0.010
 type-0.010
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
 kinds+0.001
 than+0.001
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
ail-0.001
 seeker-0.001
affle-0.001
ogo-0.001
TokenOne
Token ID3198
Feature activation+1.07e+00

Pos logit contributions
 than+0.025
 kinds+0.024
 sizes+0.024
 sorts+0.022
 textures+0.021
Neg logit contributions
at-0.022
ail-0.021
affle-0.020
abella-0.019
ogo-0.019
Token day
Token ID1110
Feature activation-2.54e+00

Pos logit contributions
 kinds+0.029
 than+0.029
 sizes+0.028
 sorts+0.028
 type+0.027
Neg logit contributions
at-0.015
ail-0.015
affle-0.014
 seeker-0.013
ogo-0.013
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
at+0.048
ail+0.046
affle+0.043
ogo+0.042
 seeker+0.041
Neg logit contributions
 than-0.058
 kinds-0.058
 sizes-0.055
 sorts-0.054
 type-0.052
Token the
Token ID262
Feature activation+5.96e-01

Pos logit contributions
ail+0.024
at+0.023
 seeker+0.022
abella+0.021
affle+0.021
Neg logit contributions
 kinds-0.027
 than-0.026
 sizes-0.026
 sorts-0.025
 type-0.023
Token group
Token ID1448
Feature activation+3.70e+00

Pos logit contributions
 kinds+0.011
 than+0.011
 sizes+0.010
 sorts+0.010
 type+0.010
Neg logit contributions
at-0.014
ail-0.014
affle-0.013
 seeker-0.012
ogo-0.012
Token decided
Token ID3066
Feature activation+9.15e-01

Pos logit contributions
 sizes+0.066
 than+0.066
 kinds+0.064
 textures+0.059
 sorts+0.058
Neg logit contributions
ail-0.086
at-0.085
 seeker-0.076
abella-0.075
 onwards-0.075
Token to
Token ID284
Feature activation+1.60e+00

Pos logit contributions
 kinds+0.019
 than+0.018
 sizes+0.018
 sorts+0.017
 type+0.016
Neg logit contributions
at-0.019
ail-0.019
 seeker-0.017
affle-0.017
ogo-0.017
Token went
Token ID1816
Feature activation-3.73e+00

Pos logit contributions
 than+0.017
 kinds+0.017
 sizes+0.016
 sorts+0.016
 type+0.015
Neg logit contributions
at-0.018
ail-0.018
affle-0.017
 seeker-0.016
ogo-0.016
Token to
Token ID284
Feature activation+1.99e-01

Pos logit contributions
at+0.100
uch+0.097
atmeal+0.093
affle+0.093
ail+0.091
Neg logit contributions
 than-0.059
 sorts-0.048
 kinds-0.047
 each-0.042
 target-0.042
Token the
Token ID262
Feature activation-5.62e-01

Pos logit contributions
 kinds+0.004
 than+0.004
 sizes+0.004
 sorts+0.004
 type+0.003
Neg logit contributions
at-0.004
ail-0.004
affle-0.004
 seeker-0.004
ogo-0.004
Token shop
Token ID6128
Feature activation-1.10e+00

Pos logit contributions
at+0.012
ail+0.011
affle+0.011
ogo+0.010
ip+0.010
Neg logit contributions
 than-0.012
 kinds-0.011
 sizes-0.011
 type-0.011
 sorts-0.011
Token with
Token ID351
Feature activation+2.15e-01

Pos logit contributions
ail+0.024
at+0.022
 seeker+0.021
 skating+0.021
belt+0.020
Neg logit contributions
 sizes-0.022
 kinds-0.021
 than-0.020
 sorts-0.020
 unison-0.019
Token his
Token ID465
Feature activation-2.85e+00

Pos logit contributions
 kinds+0.005
 than+0.004
 sizes+0.004
 sorts+0.004
 type+0.004
Neg logit contributions
at-0.005
ail-0.005
affle-0.004
ogo-0.004
 seeker-0.004
Token mum
Token ID25682
Feature activation+1.19e+00

Pos logit contributions
at+0.053
ail+0.049
affle+0.046
ogo+0.045
abella+0.045
Neg logit contributions
 than-0.066
 kinds-0.066
 sizes-0.063
 type-0.061
 sorts-0.061
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
 kinds+0.020
 than+0.019
 sizes+0.019
 sorts+0.018
 type+0.017
Neg logit contributions
at-0.029
ail-0.029
affle-0.027
 seeker-0.026
ip-0.026
Token He
Token ID679
Feature activation+6.51e-01

Pos logit contributions
 kinds+0.003
 than+0.003
 sizes+0.003
 sorts+0.003
 type+0.003
Neg logit contributions
ail-0.004
at-0.004
affle-0.003
 seeker-0.003
ogo-0.003
Token wanted
Token ID2227
Feature activation+1.64e+00

Pos logit contributions
 kinds+0.012
 than+0.012
 sizes+0.012
 sorts+0.011
 type+0.011
Neg logit contributions
at-0.015
ail-0.014
affle-0.014
 seeker-0.013
ogo-0.013
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
 kinds+0.029
 than+0.028
 sizes+0.027
 sorts+0.026
 type+0.025
Neg logit contributions
at-0.041
ail-0.040
affle-0.037
 seeker-0.037
ogo-0.036
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
 kinds+0.014
 sizes+0.014
 type+0.013
 textures+0.013
 sorts+0.013
Neg logit contributions
ail-0.013
at-0.013
 onwards-0.013
 seeker-0.012
ip-0.012
Token She
Token ID1375
Feature activation-2.25e-01

Pos logit contributions
ail+0.018
at+0.017
ip+0.016
abella+0.015
affle+0.015
Neg logit contributions
 than-0.019
 kinds-0.019
 sizes-0.018
 type-0.018
 sorts-0.017
Token would
Token ID561
Feature activation-4.74e-01

Pos logit contributions
ail+0.004
at+0.004
ip+0.004
affle+0.004
 seeker+0.004
Neg logit contributions
 than-0.005
 kinds-0.005
 sizes-0.005
 sorts-0.005
 type-0.005
Token go
Token ID467
Feature activation-2.73e+00

Pos logit contributions
at+0.009
ail+0.009
 seeker+0.008
affle+0.008
ogo+0.008
Neg logit contributions
 kinds-0.010
 than-0.010
 sizes-0.010
 sorts-0.009
lier-0.009
Token to
Token ID284
Feature activation+2.76e-01

Pos logit contributions
affle+0.067
at+0.067
uch+0.065
abella+0.064
bled+0.063
Neg logit contributions
 than-0.050
 sorts-0.043
 kinds-0.042
 each-0.042
 who-0.040
Token her
Token ID607
Feature activation-2.93e+00

Pos logit contributions
 than+0.006
 sizes+0.006
 kinds+0.006
lier+0.006
 sorts+0.006
Neg logit contributions
ail-0.005
at-0.005
 seeker-0.005
affle-0.004
 skating-0.004
Token huge
Token ID3236
Feature activation+4.55e-01

Pos logit contributions
ail+0.051
at+0.047
 seeker+0.045
affle+0.043
 skating+0.042
Neg logit contributions
 kinds-0.070
 sizes-0.070
 than-0.069
 sorts-0.065
 textures-0.062
Token book
Token ID1492
Feature activation-2.06e+00

Pos logit contributions
 than+0.010
 kinds+0.010
 sizes+0.010
 sorts+0.010
 type+0.009
Neg logit contributions
ail-0.008
at-0.008
 seeker-0.008
affle-0.007
 skating-0.007
Tokencase
Token ID7442
Feature activation-8.83e-01

Pos logit contributions
at+0.044
ail+0.044
 seeker+0.040
affle+0.039
ogo+0.039
Neg logit contributions
 kinds-0.040
 sizes-0.039
 than-0.039
 sorts-0.037
 type-0.035
Token everyday
Token ID10908
Feature activation-1.02e+00

Pos logit contributions
at+0.018
ail+0.018
 seeker+0.016
affle+0.015
ogo+0.015
Neg logit contributions
 kinds-0.018
 sizes-0.018
 than-0.018
 sorts-0.017
 type-0.016
Token,
Token ID11
Feature activation-9.55e-01

Pos logit contributions
ail+0.019
at+0.019
 seeker+0.017
 onwards+0.017
 onward+0.016
Neg logit contributions
 sizes-0.022
 kinds-0.021
 sorts-0.020
 textures-0.019
 unison-0.019
Token and
Token ID290
Feature activation-6.47e-01

Pos logit contributions
at+0.001
ail+0.001
affle+0.001
 seeker+0.001
ogo+0.001
Neg logit contributions
 than-0.001
 kinds-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
Token say
Token ID910
Feature activation-2.30e+00

Pos logit contributions
ail+0.012
at+0.012
 seeker+0.012
affle+0.011
atmeal+0.011
Neg logit contributions
 than-0.013
 kinds-0.013
 sizes-0.013
lier-0.012
 sorts-0.012
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
at+0.055
ail+0.051
affle+0.050
 seeker+0.048
ogo+0.048
Neg logit contributions
 than-0.044
 kinds-0.042
 sizes-0.040
 type-0.039
 shapes-0.039
Token
Token ID6184
Feature activation+8.17e-01

Pos logit contributions
at+0.013
ail+0.013
affle+0.012
 seeker+0.012
ogo+0.012
Neg logit contributions
 than-0.010
 kinds-0.010
 sizes-0.009
 sorts-0.009
 type-0.009
Token
Token ID95
Feature activation+9.16e-01

Pos logit contributions
 than+0.019
 kinds+0.018
 sizes+0.018
 sorts+0.017
 type+0.017
Neg logit contributions
at-0.015
ail-0.014
affle-0.013
ogo-0.013
 seeker-0.013
Token
Token ID26391
Feature activation-3.64e+00

Pos logit contributions
 kinds+0.020
 than+0.020
 sizes+0.020
 sorts+0.018
 textures+0.018
Neg logit contributions
at-0.018
ail-0.017
 seeker-0.016
affle-0.016
abella-0.015
Token
Token ID129
Feature activation-5.99e-01

Pos logit contributions
at+0.065
ail+0.064
affle+0.059
 seeker+0.058
ogo+0.056
Neg logit contributions
 than-0.085
 kinds-0.084
 sizes-0.081
 sorts-0.079
 type-0.076
Token
Token ID241
Feature activation+3.30e+00

Pos logit contributions
ail+0.013
at+0.013
 seeker+0.011
affle+0.011
ipop+0.011
Neg logit contributions
 than-0.012
 kinds-0.012
 sizes-0.011
 sorts-0.011
 type-0.011
TokenI
Token ID40
Feature activation-2.38e-01

Pos logit contributions
 kinds+0.058
 sizes+0.055
 than+0.054
 sorts+0.052
 type+0.049
Neg logit contributions
at-0.075
 seeker-0.072
ail-0.071
abella-0.069
affle-0.069
Token love
Token ID1842
Feature activation+1.92e+00

Pos logit contributions
ail+0.005
at+0.005
affle+0.005
ogo+0.005
 seeker+0.005
Neg logit contributions
 than-0.004
 kinds-0.004
 sizes-0.004
 sorts-0.004
 type-0.004
Token you
Token ID345
Feature activation-8.74e-01

Pos logit contributions
 than+0.035
 kinds+0.035
 sizes+0.034
 sorts+0.032
 type+0.031
Neg logit contributions
at-0.045
ail-0.044
affle-0.041
 seeker-0.041
ogo-0.040
Token shared
Token ID4888
Feature activation+1.22e+00

Pos logit contributions
 kinds+0.013
 than+0.013
 sizes+0.013
 sorts+0.012
ever+0.012
Neg logit contributions
at-0.012
ail-0.011
affle-0.010
 seeker-0.010
ip-0.010
Token with
Token ID351
Feature activation+6.47e-01

Pos logit contributions
 kinds+0.023
 sizes+0.023
 than+0.021
 sorts+0.021
ever+0.021
Neg logit contributions
at-0.025
ail-0.024
affle-0.021
 onwards-0.021
 seeker-0.021
Token everyone
Token ID2506
Feature activation+8.17e-01

Pos logit contributions
 than+0.012
 kinds+0.012
 sizes+0.012
 sorts+0.011
 type+0.011
Neg logit contributions
ail-0.014
at-0.014
affle-0.013
abella-0.012
ip-0.012
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 sizes+0.014
 kinds+0.014
 target+0.013
 than+0.013
 type+0.013
Neg logit contributions
ail-0.018
at-0.018
ip-0.016
 onwards-0.016
 skating-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.35e+00

Pos logit contributions
ail+0.006
at+0.006
affle+0.005
 seeker+0.005
abella+0.005
Neg logit contributions
 than-0.008
 kinds-0.008
 sizes-0.007
 type-0.007
 sorts-0.007
TokenOnce
Token ID7454
Feature activation-9.79e+00

Pos logit contributions
at+0.026
ail+0.026
affle+0.025
 seeker+0.024
atmeal+0.024
Neg logit contributions
 kinds-0.029
 than-0.029
 sorts-0.028
 sizes-0.027
lier-0.025
Token upon
Token ID2402
Feature activation+2.06e-01

Pos logit contributions
at+0.172
affle+0.157
atmeal+0.153
 seeker+0.149
pee+0.144
Neg logit contributions
 kinds-0.242
 than-0.236
 sorts-0.227
 sizes-0.215
 type-0.214
Token a
Token ID257
Feature activation-2.80e+00

Pos logit contributions
 sizes+0.005
 kinds+0.005
 than+0.005
 sorts+0.005
 type+0.004
Neg logit contributions
at-0.003
ail-0.003
 seeker-0.003
affle-0.003
 skating-0.003
Token time
Token ID640
Feature activation-5.83e+00

Pos logit contributions
at+0.041
ail+0.039
affle+0.036
 seeker+0.035
ip+0.033
Neg logit contributions
 kinds-0.076
 sizes-0.075
 than-0.073
 sorts-0.071
 textures-0.068
Token there
Token ID612
Feature activation-3.68e+00

Pos logit contributions
at+0.132
affle+0.123
ail+0.118
atmeal+0.117
abella+0.113
Neg logit contributions
 than-0.119
 kinds-0.110
 sizes-0.103
 sorts-0.101
 type-0.101
Token was
Token ID373
Feature activation-1.27e+00

Pos logit contributions
at+0.079
ail+0.077
affle+0.072
 seeker+0.071
ogo+0.070
Neg logit contributions
 kinds-0.074
 than-0.073
 sizes-0.070
 sorts-0.068
 type-0.066
Token the
Token ID262
Feature activation+7.98e-01

Pos logit contributions
 sizes+0.043
 kinds+0.042
 than+0.042
 sorts+0.039
lier+0.037
Neg logit contributions
at-0.040
ail-0.040
affle-0.037
 seeker-0.036
abella-0.035
Token giant
Token ID6175
Feature activation+1.64e+00

Pos logit contributions
 kinds+0.018
 than+0.017
 sizes+0.017
 sorts+0.017
 type+0.015
Neg logit contributions
at-0.015
ail-0.015
 seeker-0.014
affle-0.013
 skating-0.013
Token!
Token ID0
Feature activation+7.10e-01

Pos logit contributions
 kinds+0.030
 sizes+0.030
 than+0.030
 sorts+0.028
 textures+0.027
Neg logit contributions
at-0.036
ail-0.034
affle-0.032
 seeker-0.031
ip-0.030
Tokenâ
Token ID22940
Feature activation+3.60e-01

Pos logit contributions
 kinds+0.015
 than+0.015
 sizes+0.015
 sorts+0.014
 type+0.013
Neg logit contributions
at-0.014
ail-0.014
 seeker-0.013
affle-0.013
abella-0.013
Token
Token ID26391
Feature activation-4.34e+00

Pos logit contributions
 kinds+0.009
 than+0.009
 sizes+0.008
 sorts+0.008
 type+0.008
Neg logit contributions
ail-0.006
at-0.006
 seeker-0.005
affle-0.005
ogo-0.005
Token
Token ID8151
Feature activation-6.80e+00

Pos logit contributions
at+0.086
ail+0.086
 seeker+0.077
affle+0.075
ip+0.074
Neg logit contributions
 kinds-0.094
 than-0.091
 sizes-0.091
 sorts-0.087
 type-0.083
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
at+0.118
ail+0.113
affle+0.113
 seeker+0.111
atmeal+0.102
Neg logit contributions
 kinds-0.158
 than-0.157
 sizes-0.149
 sorts-0.147
lier-0.140
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
 kinds+0.004
 than+0.004
 sorts+0.004
 sizes+0.004
 type+0.004
Neg logit contributions
at-0.005
ail-0.005
 seeker-0.005
affle-0.005
atmeal-0.005
TokenThe
Token ID464
Feature activation+1.06e+00

Pos logit contributions
 than+0.026
 kinds+0.026
 sizes+0.025
 sorts+0.024
 type+0.022
Neg logit contributions
at-0.035
ail-0.034
affle-0.032
 seeker-0.032
ogo-0.031
Token mouse
Token ID10211
Feature activation+2.95e+00

Pos logit contributions
 kinds+0.018
 than+0.018
 sizes+0.017
 type+0.017
 sorts+0.016
Neg logit contributions
at-0.026
ail-0.025
ogo-0.024
ip-0.024
affle-0.023
Token replied
Token ID8712
Feature activation-1.29e+00

Pos logit contributions
 kinds+0.061
 than+0.061
 sizes+0.059
 sorts+0.057
 type+0.054
Neg logit contributions
at-0.060
ail-0.059
affle-0.054
 seeker-0.053
ogo-0.052
Token enjoy
Token ID2883
Feature activation+6.62e-01

Pos logit contributions
 kinds+0.005
 than+0.005
 sizes+0.005
 sorts+0.005
 type+0.004
Neg logit contributions
at-0.006
ail-0.006
affle-0.006
 seeker-0.006
ip-0.006
Token the
Token ID262
Feature activation-2.79e-01

Pos logit contributions
 than+0.012
 kinds+0.011
 sizes+0.011
 sorts+0.010
 type+0.010
Neg logit contributions
at-0.017
ail-0.016
affle-0.015
ogo-0.015
 seeker-0.015
Token show
Token ID905
Feature activation+5.38e-01

Pos logit contributions
ail+0.006
at+0.006
ogo+0.005
ip+0.005
abella+0.005
Neg logit contributions
 kinds-0.006
 than-0.006
 sorts-0.006
 sizes-0.006
 type-0.005
Token!
Token ID0
Feature activation-9.41e-01

Pos logit contributions
 kinds+0.010
 sizes+0.010
 than+0.009
 sorts+0.009
 type+0.009
Neg logit contributions
at-0.012
ail-0.012
 seeker-0.011
 onwards-0.010
abella-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.74e+00

Pos logit contributions
ail+0.017
at+0.016
 seeker+0.014
abella+0.014
 onwards+0.014
Neg logit contributions
 than-0.022
 kinds-0.022
 sizes-0.022
 type-0.020
 sorts-0.020
TokenOnce
Token ID7454
Feature activation-9.35e+00

Pos logit contributions
at+0.042
ail+0.042
affle+0.040
 seeker+0.038
ogo+0.038
Neg logit contributions
 kinds-0.030
 than-0.029
 sorts-0.027
 sizes-0.027
 type-0.025
Token upon
Token ID2402
Feature activation+7.32e-01

Pos logit contributions
at+0.154
affle+0.140
 seeker+0.131
atmeal+0.130
pee+0.130
Neg logit contributions
 kinds-0.239
 than-0.235
 sorts-0.224
 sizes-0.217
 type-0.214
Token a
Token ID257
Feature activation-1.88e+00

Pos logit contributions
 sizes+0.018
 than+0.018
 kinds+0.018
 sorts+0.017
 type+0.016
Neg logit contributions
at-0.012
ail-0.011
 seeker-0.011
affle-0.010
pee-0.010
Token time
Token ID640
Feature activation-5.30e+00

Pos logit contributions
at+0.025
ail+0.023
affle+0.022
 seeker+0.021
ip+0.019
Neg logit contributions
 kinds-0.054
 sizes-0.053
 than-0.051
 sorts-0.050
 textures-0.048
Token there
Token ID612
Feature activation-2.97e+00

Pos logit contributions
at+0.125
affle+0.116
ail+0.113
atmeal+0.111
abella+0.109
Neg logit contributions
 than-0.103
 kinds-0.095
 type-0.087
 sizes-0.087
 sorts-0.086
Token was
Token ID373
Feature activation-6.70e-01

Pos logit contributions
at+0.063
ail+0.061
affle+0.057
 seeker+0.056
ogo+0.055
Neg logit contributions
 kinds-0.061
 than-0.060
 sizes-0.058
 sorts-0.056
 type-0.054
Token is
Token ID318
Feature activation-1.65e+00

Pos logit contributions
ail+0.053
at+0.053
 seeker+0.051
affle+0.051
ogo+0.049
Neg logit contributions
 kinds-0.056
 than-0.054
 sizes-0.053
 sorts-0.051
 type-0.048
Token cold
Token ID4692
Feature activation+9.37e-01

Pos logit contributions
ail+0.034
at+0.034
affle+0.031
ip+0.030
ogo+0.030
Neg logit contributions
 than-0.036
 sizes-0.034
 kinds-0.034
 sorts-0.032
 type-0.032
Token outside
Token ID2354
Feature activation-7.35e-01

Pos logit contributions
 than+0.018
 kinds+0.018
 sizes+0.017
 sorts+0.016
 type+0.015
Neg logit contributions
at-0.021
ail-0.021
affle-0.019
 seeker-0.019
ogo-0.019
Token!"
Token ID2474
Feature activation+1.60e+00

Pos logit contributions
at+0.018
ail+0.018
 seeker+0.017
affle+0.017
ogo+0.016
Neg logit contributions
 kinds-0.012
 than-0.012
 sizes-0.011
 sorts-0.011
 type-0.010
Token<|endoftext|>
Token ID50256
Feature activation-1.08e+00

Pos logit contributions
 kinds+0.039
 sizes+0.039
 than+0.039
 sorts+0.036
 type+0.036
Neg logit contributions
ail-0.027
at-0.025
affle-0.023
ip-0.023
abella-0.023
TokenOnce
Token ID7454
Feature activation-8.85e+00

Pos logit contributions
ail+0.018
at+0.018
affle+0.016
 seeker+0.016
ogo+0.015
Neg logit contributions
 kinds-0.027
 than-0.026
 sorts-0.025
 sizes-0.025
 type-0.024
Token upon
Token ID2402
Feature activation+6.20e-02

Pos logit contributions
at+0.141
affle+0.124
atmeal+0.121
 seeker+0.120
ail+0.113
Neg logit contributions
 kinds-0.234
 than-0.231
 sorts-0.218
 sizes-0.217
 type-0.210
Token a
Token ID257
Feature activation-3.02e+00

Pos logit contributions
 kinds+0.001
 sizes+0.001
 than+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
ail-0.001
 seeker-0.001
affle-0.001
 skating-0.001
Token time
Token ID640
Feature activation-5.95e+00

Pos logit contributions
at+0.044
ail+0.042
affle+0.039
 seeker+0.037
ogo+0.035
Neg logit contributions
 kinds-0.083
 sizes-0.081
 than-0.080
 sorts-0.078
 textures-0.074
Token,
Token ID11
Feature activation-4.91e+00

Pos logit contributions
at+0.132
affle+0.124
ail+0.117
atmeal+0.114
 seeker+0.113
Neg logit contributions
 than-0.124
 kinds-0.115
 sizes-0.109
 sorts-0.105
 type-0.104
Token there
Token ID612
Feature activation-3.52e+00

Pos logit contributions
at+0.096
ail+0.089
affle+0.087
atmeal+0.084
ogo+0.081
Neg logit contributions
 than-0.115
 kinds-0.109
 sizes-0.106
 type-0.103
 sorts-0.102
Token
Token ID198
Feature activation-6.40e-02

Pos logit contributions
at+0.098
ail+0.096
affle+0.089
 seeker+0.089
ogo+0.085
Neg logit contributions
 than-0.093
 kinds-0.093
 sizes-0.089
 sorts-0.085
 type-0.081
Token
Token ID198
Feature activation+8.36e-01

Pos logit contributions
at+0.002
ail+0.001
 seeker+0.001
affle+0.001
atmeal+0.001
Neg logit contributions
 kinds-0.001
 than-0.001
 sizes-0.001
 sorts-0.001
 type-0.001
TokenThat
Token ID2504
Feature activation-2.26e+00

Pos logit contributions
 kinds+0.014
 than+0.014
 sizes+0.013
 sorts+0.012
 type+0.012
Neg logit contributions
at-0.021
ail-0.021
affle-0.019
 seeker-0.019
ogo-0.019
Tokenâ
Token ID22940
Feature activation+4.06e-01

Pos logit contributions
at+0.030
ail+0.028
 seeker+0.023
abella+0.023
affle+0.023
Neg logit contributions
 than-0.064
 sizes-0.061
 kinds-0.060
ever-0.059
lier-0.057
Token
Token ID26391
Feature activation-4.66e+00

Pos logit contributions
 than+0.009
 kinds+0.009
 sizes+0.009
 sorts+0.009
 type+0.008
Neg logit contributions
at-0.008
ail-0.008
affle-0.007
 seeker-0.007
ogo-0.007
Token
Token ID8151
Feature activation-6.64e+00

Pos logit contributions
ail+0.084
at+0.083
 seeker+0.072
affle+0.071
ogo+0.070
Neg logit contributions
 kinds-0.111
 than-0.108
 sizes-0.106
 sorts-0.103
 type-0.099
Tokens
Token ID82
Feature activation-7.70e-02

Pos logit contributions
at+0.123
affle+0.117
ail+0.116
 seeker+0.115
ogo+0.107
Neg logit contributions
 kinds-0.148
 than-0.146
 sizes-0.140
 sorts-0.137
lier-0.129
Token how
Token ID703
Feature activation+3.42e-02

Pos logit contributions
at+0.002
ail+0.002
affle+0.002
 seeker+0.002
ogo+0.001
Neg logit contributions
 kinds-0.002
 than-0.002
 sizes-0.001
 sorts-0.001
 type-0.001
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 than+0.001
 kinds+0.001
 sizes+0.001
 sorts+0.001
 type+0.001
Neg logit contributions
at-0.001
ail-0.001
affle-0.001
 seeker-0.001
ogo-0.001
Token three
Token ID1115
Feature activation+1.52e+00

Pos logit contributions
 kinds+0.022
 than+0.022
 sizes+0.021
 sorts+0.020
 type+0.019
Neg logit contributions
at-0.029
ail-0.029
affle-0.027
 seeker-0.027
ogo-0.026
Token of
Token ID286
Feature activation-5.06e-01

Pos logit contributions
 than+0.031
 sizes+0.030
 kinds+0.029
 sorts+0.028
 target+0.028
Neg logit contributions
at-0.032
ail-0.032
affle-0.028
ip-0.027
ig-0.027