TOP ACTIVATIONS
MAX = 0.162

for saying sorry ," the mushroom said . " That is
!" But the turkey was persistent . It did
" Look , the beetle is awake ," Tom says
stop it !" the scare crow said . " You are
does not understand why the bull is so mean . She
the camera back on the tripod . " Go
back . They wished the bull was peaceful again . <|endoftext|>
, please !" cried the crab . " I 'm in
them ?" The bull looks at the farmer .
" I saw the enormous beetle climbing up a tree !
. Suddenly , the scare crow moved and waved its arms
better ." The beetle sees the water . It
friends . <|endoftext|> The ancient turkey could not fly . He
the children approached , the beetle revealed a secret . It
see what was inside the barrel . Suddenly , his older
Suddenly , the all igator snapped its jaws and bit
went closer to the scare crow and Mom my held the
" I am the pl umber . I am fixing the
time there was a calm bull . He liked to walk
" Hello Mr Man ! Will you swing me

MIN ACTIVATIONS
MIN = -0.124

" Max , you 're such a creative boy !"
so glad I could deliver such joy to you , my
You â re such a clever boy , John
friends thanked her for being such a good leader . Lily
, the little girl added such wonderful noises that it was
showed Bob many things , such as rabbits and flowers .
said , " You 're such a good helper . I
Thank you Jack for finding such a nice shelter ."
was so pleased to have such a long day of fun
so surprised and happy that such a strange creature could be
thanked his Uncle for recommending such a great toy . They
said , " You 're such a gifted , little boy
heart . You 're doing such a great job ," said
ro ggy , you are such a good friend . I
't have ended up in such bad situations .
so excited to have found such a big carrot . They
and hugged . It was such a peaceful moment . Then
we did ! You 're such a smart kid !"
, Lily ! You are such a good builder !" Lily
felt so lucky to have such a wonderful mom who made

INTERVAL 0.091 - 0.162
CONTAINS 4.818%

. The noisy brother and the quiet brother celebrated
can share another one ." Sara and Ben look at each
, Lily saw a little boy crying . She recognized him
so much ?" The old tiger said , " I blink
got worse . Sadly , Spot passed away .

INTERVAL 0.019 - 0.091
CONTAINS 21.127%

the pilot looked down , he could see children looking up
spots on its wings . It moved its wings slowly .
out of some cardboard . He was busy cutting and drawing
and play with me ?" She shook them gently and tried
. He was bleeding and he looked sad .

INTERVAL -0.052 - 0.019
CONTAINS 66.407%

was infectious ! Whenever anyone saw her , they couldn 't
little bear . His name was Bear . He was so
vegetables in the garden . The caul iflower
fun . They became friends and learned to share . They
. After a while , the cow spotted an old cup

INTERVAL -0.124 - -0.052
CONTAINS 7.647%

was scared and tried to get out of the pond .
is cel ery . It is green and long and crunch
are my sister and I like you ." Lily smiled and
little girl named Lily came to the garden . She saw
was an ambulance . Kitty wanted to see what was inside
Token for
Token ID329
Feature activation-2.62e-02

Pos logit contributions
owed+0.071
bee+0.069
op+0.066
ops+0.065
m+0.063
Neg logit contributions
 our-0.070
 fun-0.069
 awesome-0.068
 my-0.065
 sight-0.064
Token saying
Token ID2282
Feature activation-2.70e-02

Pos logit contributions
 sight+0.048
 common+0.046
 vast+0.046
 pleasure+0.046
 thrilling+0.045
Neg logit contributions
m-0.049
es-0.044
owed-0.044
bee-0.044
mallow-0.040
Token sorry
Token ID7926
Feature activation+1.47e-03

Pos logit contributions
 sight+0.037
 common+0.036
 awesome+0.035
 comparison+0.034
 thrilling+0.034
Neg logit contributions
owed-0.060
m-0.060
es-0.057
bee-0.057
op-0.056
Token,"
Token ID553
Feature activation+2.66e-02

Pos logit contributions
owed+0.002
m+0.002
op+0.002
bee+0.002
ops+0.002
Neg logit contributions
 fun-0.003
 sight-0.003
 our-0.003
 awesome-0.003
 purpose-0.003
Token the
Token ID262
Feature activation+6.27e-03

Pos logit contributions
owed+0.070
m+0.067
bee+0.065
op+0.065
es+0.064
Neg logit contributions
 sight-0.032
 fun-0.032
 awesome-0.031
 our-0.030
 common-0.029
Token mushroom
Token ID28520
Feature activation+1.62e-01

Pos logit contributions
owed+0.012
m+0.012
op+0.011
bee+0.011
ird+0.011
Neg logit contributions
 sight-0.012
 awesome-0.011
 fun-0.011
 purpose-0.011
 our-0.011
Token said
Token ID531
Feature activation+8.11e-03

Pos logit contributions
op+0.312
ird+0.292
m+0.286
itive+0.286
mallow+0.284
Neg logit contributions
 fun-0.292
 new-0.274
 old-0.270
 beautiful-0.257
 sight-0.257
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
bee+0.014
owed+0.013
m+0.013
ops+0.013
op+0.013
Neg logit contributions
 sight-0.016
 fun-0.015
 our-0.015
 awesome-0.015
 new-0.015
Token "
Token ID366
Feature activation+4.29e-03

Pos logit contributions
owed+0.048
m+0.047
bee+0.045
op+0.044
ops+0.044
Neg logit contributions
 sight-0.033
 awesome-0.032
 fun-0.032
 common-0.031
 our-0.031
TokenThat
Token ID2504
Feature activation-1.57e-04

Pos logit contributions
owed+0.009
m+0.008
bee+0.008
op+0.008
es+0.007
Neg logit contributions
 awesome-0.007
 sight-0.007
 fun-0.007
 common-0.007
 our-0.007
Token is
Token ID318
Feature activation-4.54e-02

Pos logit contributions
 fun+0.000
 sight+0.000
 awesome+0.000
 our+0.000
 same+0.000
Neg logit contributions
owed-0.000
op-0.000
m-0.000
ird-0.000
bee-0.000
Token!"
Token ID2474
Feature activation+3.02e-02

Pos logit contributions
owed+0.019
m+0.019
bee+0.018
op+0.018
es+0.017
Neg logit contributions
 sight-0.023
 awesome-0.023
 fun-0.022
 common-0.022
 purpose-0.022
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
owed+0.054
m+0.053
bee+0.050
op+0.049
es+0.049
Neg logit contributions
 sight-0.058
 awesome-0.058
 common-0.056
 fun-0.056
 purpose-0.055
Token
Token ID198
Feature activation+2.33e-02

Pos logit contributions
owed+0.031
m+0.031
bee+0.029
op+0.028
es+0.028
Neg logit contributions
 sight-0.035
 awesome-0.034
 fun-0.033
 common-0.033
 purpose-0.033
TokenBut
Token ID1537
Feature activation+1.07e-02

Pos logit contributions
m+0.043
es+0.041
owed+0.041
op+0.040
bee+0.040
Neg logit contributions
 sight-0.044
 pleasure-0.043
 awesome-0.042
 purpose-0.040
 common-0.040
Token the
Token ID262
Feature activation-4.08e-03

Pos logit contributions
m+0.025
es+0.023
owed+0.023
bee+0.021
mallow+0.021
Neg logit contributions
 common-0.016
 purpose-0.016
 rightful-0.015
 sight-0.015
 pleasure-0.014
Token turkey
Token ID26876
Feature activation+1.60e-01

Pos logit contributions
 awesome+0.007
 sight+0.007
 common+0.007
 purpose+0.007
 fun+0.007
Neg logit contributions
m-0.007
bee-0.007
owed-0.007
es-0.007
op-0.007
Token was
Token ID373
Feature activation-4.83e-02

Pos logit contributions
owed+0.282
m+0.270
op+0.267
pp+0.254
ird+0.254
Neg logit contributions
 sight-0.303
 fun-0.301
 awesome-0.289
 our-0.281
 common-0.274
Token persistent
Token ID16218
Feature activation+2.90e-03

Pos logit contributions
 purpose+0.084
 pleasure+0.081
 sight+0.076
 amaz+0.074
 harmony+0.074
Neg logit contributions
m-0.100
owed-0.093
bee-0.090
es-0.089
ppy-0.086
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
owed+0.005
m+0.004
bee+0.004
op+0.004
ops+0.004
Neg logit contributions
 fun-0.006
 sight-0.006
 awesome-0.006
 purpose-0.006
 common-0.006
Token It
Token ID632
Feature activation+4.41e-02

Pos logit contributions
m+0.042
owed+0.042
es+0.039
op+0.039
bee+0.038
Neg logit contributions
 sight-0.031
 awesome-0.031
 common-0.030
 purpose-0.030
 comparison-0.029
Token did
Token ID750
Feature activation-5.29e-02

Pos logit contributions
owed+0.074
m+0.073
bee+0.069
op+0.068
es+0.067
Neg logit contributions
 sight-0.089
 awesome-0.088
 fun-0.086
 common-0.086
 purpose-0.084
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
owed+0.005
m+0.005
bee+0.005
op+0.005
es+0.004
Neg logit contributions
 sight-0.005
 awesome-0.005
 fun-0.005
 common-0.005
 purpose-0.005
Token"
Token ID1
Feature activation+8.74e-03

Pos logit contributions
owed+0.024
m+0.023
bee+0.022
es+0.021
op+0.021
Neg logit contributions
 awesome-0.026
 sight-0.026
 fun-0.026
 common-0.026
 purpose-0.025
TokenLook
Token ID8567
Feature activation-1.38e-02

Pos logit contributions
owed+0.018
m+0.018
bee+0.017
op+0.017
es+0.017
Neg logit contributions
 sight-0.014
 awesome-0.014
 fun-0.014
 common-0.013
 purpose-0.013
Token,
Token ID11
Feature activation+5.18e-04

Pos logit contributions
 our+0.027
 awesome+0.026
 fun+0.026
 sight+0.026
 my+0.025
Neg logit contributions
m-0.025
owed-0.024
bee-0.024
op-0.022
ops-0.022
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
m+0.001
owed+0.001
es+0.001
bee+0.001
op+0.001
Neg logit contributions
 sight-0.001
 rightful-0.001
 thrilling-0.001
 accomplishment-0.001
 purpose-0.001
Token beetle
Token ID45489
Feature activation+1.60e-01

Pos logit contributions
 awesome+0.022
 common+0.021
 sight+0.021
 our+0.021
 purpose+0.021
Neg logit contributions
m-0.016
bee-0.015
owed-0.014
es-0.014
ops-0.014
Token is
Token ID318
Feature activation-4.33e-02

Pos logit contributions
owed+0.343
m+0.339
bee+0.332
op+0.330
ird+0.312
Neg logit contributions
 awesome-0.246
 fun-0.244
 sight-0.240
 our-0.239
 common-0.223
Token awake
Token ID21693
Feature activation+1.64e-02

Pos logit contributions
 rightful+0.056
 purpose+0.056
 pleasure+0.055
 common+0.055
 accomplishment+0.055
Neg logit contributions
owed-0.090
m-0.089
es-0.088
bee-0.087
ppy-0.085
Token,"
Token ID553
Feature activation+2.77e-02

Pos logit contributions
owed+0.026
m+0.025
bee+0.024
op+0.024
es+0.023
Neg logit contributions
 sight-0.035
 awesome-0.034
 fun-0.034
 common-0.033
 purpose-0.033
Token Tom
Token ID4186
Feature activation+1.00e-01

Pos logit contributions
m+0.062
owed+0.062
bee+0.060
es+0.060
op+0.058
Neg logit contributions
 awesome-0.038
 sight-0.037
 thrilling-0.035
 purpose-0.035
 comparison-0.035
Token says
Token ID1139
Feature activation+7.27e-03

Pos logit contributions
owed+0.224
op+0.219
ird+0.215
ops+0.214
pp+0.212
Neg logit contributions
 fun-0.153
 trouble-0.139
 our-0.136
 my-0.134
 awesome-0.134
Token stop
Token ID2245
Feature activation-2.02e-02

Pos logit contributions
 awesome+0.014
 sight+0.014
 fun+0.013
 our+0.013
 common+0.013
Neg logit contributions
owed-0.021
m-0.020
bee-0.019
es-0.019
op-0.019
Token it
Token ID340
Feature activation+7.42e-02

Pos logit contributions
 awesome+0.041
 sight+0.041
 common+0.040
 fun+0.040
 purpose+0.039
Neg logit contributions
owed-0.033
m-0.033
bee-0.031
op-0.030
es-0.030
Token!"
Token ID2474
Feature activation+3.17e-02

Pos logit contributions
owed+0.151
bee+0.143
m+0.140
op+0.135
es+0.133
Neg logit contributions
 our-0.126
 awesome-0.126
 fun-0.125
 sight-0.119
 my-0.119
Token the
Token ID262
Feature activation+6.12e-03

Pos logit contributions
m+0.072
owed+0.070
es+0.068
op+0.067
bee+0.066
Neg logit contributions
 awesome-0.042
 comparison-0.041
 purpose-0.040
 rightful-0.040
 sight-0.040
Token scare
Token ID19437
Feature activation+3.46e-02

Pos logit contributions
owed+0.013
m+0.013
bee+0.013
op+0.012
es+0.012
Neg logit contributions
 sight-0.010
 awesome-0.009
 fun-0.009
 common-0.009
 purpose-0.009
Tokencrow
Token ID47114
Feature activation+1.59e-01

Pos logit contributions
owed+0.063
m+0.058
bee+0.057
op+0.055
ird+0.054
Neg logit contributions
 sight-0.066
 common-0.063
 our-0.062
 fun-0.061
 awesome-0.061
Token said
Token ID531
Feature activation+6.79e-03

Pos logit contributions
owed+0.275
m+0.272
bee+0.271
erry+0.269
pp+0.267
Neg logit contributions
 fun-0.283
 loud-0.269
 sight-0.265
 common-0.251
 awesome-0.251
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
m+0.011
bee+0.011
owed+0.011
ops+0.011
op+0.010
Neg logit contributions
 sight-0.013
 fun-0.013
 awesome-0.013
 our-0.013
 purpose-0.012
Token "
Token ID366
Feature activation+6.91e-03

Pos logit contributions
owed+0.054
m+0.053
bee+0.051
op+0.050
es+0.050
Neg logit contributions
 sight-0.044
 awesome-0.044
 fun-0.043
 common-0.043
 purpose-0.042
TokenYou
Token ID1639
Feature activation+1.37e-02

Pos logit contributions
owed+0.014
m+0.014
bee+0.013
es+0.013
op+0.013
Neg logit contributions
 sight-0.011
 awesome-0.011
 fun-0.011
 common-0.011
 purpose-0.011
Token are
Token ID389
Feature activation-3.55e-02

Pos logit contributions
owed+0.025
m+0.024
bee+0.024
es+0.022
op+0.022
Neg logit contributions
 awesome-0.026
 fun-0.025
 sight-0.025
 our-0.025
 common-0.025
Token does
Token ID857
Feature activation-5.46e-02

Pos logit contributions
owed+0.051
m+0.050
bee+0.048
es+0.047
op+0.045
Neg logit contributions
 sight-0.059
 purpose-0.057
 awesome-0.056
 common-0.055
 pleasure-0.055
Token not
Token ID407
Feature activation-4.57e-02

Pos logit contributions
 sight+0.093
 pleasure+0.092
 vast+0.088
 harmony+0.088
 rightful+0.086
Neg logit contributions
owed-0.099
m-0.095
bee-0.095
es-0.091
op-0.084
Token understand
Token ID1833
Feature activation-3.14e-02

Pos logit contributions
 fun+0.069
 awesome+0.068
 common+0.066
 sight+0.063
 exciting+0.062
Neg logit contributions
owed-0.103
m-0.101
op-0.100
ops-0.098
es-0.098
Token why
Token ID1521
Feature activation-3.91e-02

Pos logit contributions
 sight+0.057
 awesome+0.057
 fun+0.056
 common+0.055
 purpose+0.054
Neg logit contributions
owed-0.060
m-0.059
bee-0.056
op-0.056
es-0.055
Token the
Token ID262
Feature activation-6.22e-03

Pos logit contributions
 sight+0.056
 awesome+0.056
 fun+0.053
 common+0.053
 purpose+0.052
Neg logit contributions
owed-0.089
m-0.088
bee-0.084
op-0.083
es-0.083
Token bull
Token ID6473
Feature activation+1.59e-01

Pos logit contributions
 sight+0.010
 awesome+0.010
 fun+0.010
 common+0.009
 purpose+0.009
Neg logit contributions
owed-0.014
m-0.013
es-0.013
op-0.013
bee-0.013
Token is
Token ID318
Feature activation-4.19e-02

Pos logit contributions
owed+0.357
op+0.344
bee+0.338
ird+0.337
m+0.333
Neg logit contributions
 fun-0.221
 our-0.208
 awesome-0.205
 sight-0.205
 same-0.203
Token so
Token ID523
Feature activation-4.31e-02

Pos logit contributions
 sight+0.058
 pleasure+0.058
 purpose+0.057
 harmony+0.057
 accomplishment+0.056
Neg logit contributions
m-0.092
bee-0.089
owed-0.088
es-0.084
ppy-0.079
Token mean
Token ID1612
Feature activation+2.58e-03

Pos logit contributions
 sight+0.054
 pleasure+0.054
 awesome+0.052
 harmony+0.051
 rightful+0.051
Neg logit contributions
m-0.104
owed-0.102
bee-0.098
op-0.096
es-0.094
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
owed+0.004
bee+0.004
m+0.004
op+0.004
ops+0.004
Neg logit contributions
 fun-0.005
 sight-0.005
 awesome-0.005
 our-0.005
 purpose-0.005
Token She
Token ID1375
Feature activation+3.51e-02

Pos logit contributions
owed+0.027
m+0.026
bee+0.025
op+0.024
es+0.024
Neg logit contributions
 sight-0.031
 awesome-0.031
 fun-0.030
 common-0.030
 purpose-0.030
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 thrilling+0.074
 rightful+0.073
 pleasure+0.073
 purpose+0.071
 awesome+0.071
Neg logit contributions
m-0.111
es-0.109
owed-0.102
mine-0.099
mallow-0.097
Token camera
Token ID4676
Feature activation+8.26e-02

Pos logit contributions
 awesome+0.036
 sight+0.035
 purpose+0.034
 comparison+0.034
 fun+0.034
Neg logit contributions
m-0.034
owed-0.032
bee-0.032
op-0.031
ops-0.030
Token back
Token ID736
Feature activation+2.78e-02

Pos logit contributions
owed+0.153
m+0.150
es+0.141
op+0.138
bee+0.138
Neg logit contributions
 sight-0.154
 awesome-0.154
 common-0.150
 purpose-0.148
 pleasure-0.146
Token on
Token ID319
Feature activation-3.22e-02

Pos logit contributions
owed+0.046
m+0.045
bee+0.043
op+0.042
es+0.041
Neg logit contributions
 sight-0.056
 awesome-0.056
 fun-0.055
 common-0.054
 our-0.054
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
 awesome+0.047
 pleasure+0.043
 thrilling+0.043
 fantastic+0.040
 enjoyable+0.040
Neg logit contributions
es-0.070
m-0.069
bee-0.066
op-0.065
mine-0.065
Token tripod
Token ID47439
Feature activation+1.59e-01

Pos logit contributions
 awesome+0.036
 sight+0.036
 common+0.035
 fun+0.034
 comparison+0.034
Neg logit contributions
m-0.044
owed-0.043
bee-0.042
op-0.040
ops-0.040
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
m+0.270
owed+0.267
bee+0.263
itty+0.245
op+0.243
Neg logit contributions
 sight-0.297
 purpose-0.294
 our-0.289
 fun-0.285
 awesome-0.278
Token
Token ID198
Feature activation+9.75e-03

Pos logit contributions
owed+0.012
m+0.012
es+0.012
op+0.011
bee+0.011
Neg logit contributions
 awesome-0.018
 sight-0.018
 comparison-0.017
 rightful-0.017
 ours-0.017
Token
Token ID198
Feature activation+1.97e-02

Pos logit contributions
m+0.016
owed+0.016
op+0.015
es+0.015
bee+0.015
Neg logit contributions
 sight-0.020
 awesome-0.020
 common-0.019
 fun-0.019
 purpose-0.019
Token"
Token ID1
Feature activation+1.52e-02

Pos logit contributions
m+0.031
owed+0.030
es+0.028
op+0.028
bee+0.028
Neg logit contributions
 sight-0.043
 awesome-0.042
 fun-0.041
 purpose-0.041
 common-0.041
TokenGo
Token ID5247
Feature activation+1.03e-02

Pos logit contributions
owed+0.032
m+0.031
bee+0.030
op+0.030
es+0.029
Neg logit contributions
 sight-0.025
 awesome-0.024
 fun-0.024
 common-0.023
 purpose-0.023
Token back
Token ID736
Feature activation+1.91e-02

Pos logit contributions
owed+0.153
m+0.151
bee+0.141
es+0.140
op+0.140
Neg logit contributions
 sight-0.156
 awesome-0.155
 common-0.150
 fun-0.148
 purpose-0.147
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
owed+0.027
m+0.026
bee+0.026
op+0.025
es+0.024
Neg logit contributions
 sight-0.044
 awesome-0.043
 fun-0.043
 common-0.042
 purpose-0.042
Token They
Token ID1119
Feature activation+3.47e-02

Pos logit contributions
m+0.032
es+0.030
op+0.030
owed+0.030
oke+0.027
Neg logit contributions
 awesome-0.046
 sight-0.044
 accomplishment-0.044
 comparison-0.043
 thrilling-0.043
Token wished
Token ID16555
Feature activation-3.04e-02

Pos logit contributions
owed+0.063
m+0.061
op+0.060
ops+0.058
es+0.058
Neg logit contributions
 fun-0.065
 sight-0.062
 awesome-0.062
 common-0.060
 exciting-0.060
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 awesome+0.063
 purpose+0.062
 sight+0.061
 common+0.061
 rightful+0.060
Neg logit contributions
m-0.051
owed-0.048
es-0.046
bee-0.046
op-0.044
Token bull
Token ID6473
Feature activation+1.57e-01

Pos logit contributions
 awesome+0.033
 sight+0.033
 common+0.033
 purpose+0.032
 fun+0.032
Neg logit contributions
m-0.036
owed-0.036
bee-0.035
op-0.033
es-0.033
Token was
Token ID373
Feature activation-4.10e-02

Pos logit contributions
owed+0.246
m+0.237
op+0.233
bee+0.229
ops+0.222
Neg logit contributions
 fun-0.330
 awesome-0.328
 sight-0.323
 our-0.319
 common-0.315
Token peaceful
Token ID12309
Feature activation+7.00e-03

Pos logit contributions
 amaz+0.065
 purpose+0.060
 sight+0.058
 pleasure+0.058
 vast+0.058
Neg logit contributions
m-0.081
bee-0.078
owed-0.078
es-0.076
ppy-0.072
Token again
Token ID757
Feature activation+1.36e-02

Pos logit contributions
owed+0.010
m+0.010
bee+0.009
op+0.009
es+0.009
Neg logit contributions
 sight-0.016
 awesome-0.016
 fun-0.015
 common-0.015
 purpose-0.015
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
owed+0.020
m+0.019
bee+0.019
op+0.018
es+0.017
Neg logit contributions
 sight-0.031
 fun-0.030
 awesome-0.030
 common-0.030
 purpose-0.029
Token<|endoftext|>
Token ID50256
Feature activation+4.72e-02

Pos logit contributions
m+0.024
owed+0.022
es+0.022
op+0.022
bee+0.019
Neg logit contributions
 awesome-0.040
 accomplishment-0.039
 sight-0.039
 ours-0.038
 comparison-0.038
Token,
Token ID11
Feature activation-5.78e-03

Pos logit contributions
bee+0.076
owed+0.074
ops+0.073
es+0.072
m+0.072
Neg logit contributions
 sight-0.065
 our-0.064
 fun-0.062
 my-0.061
 awesome-0.060
Token please
Token ID3387
Feature activation+3.56e-03

Pos logit contributions
 purpose+0.010
 sight+0.010
 common+0.010
 orderly+0.010
 rightful+0.010
Neg logit contributions
m-0.011
owed-0.010
bee-0.010
op-0.010
ppy-0.010
Token!"
Token ID2474
Feature activation+3.40e-02

Pos logit contributions
owed+0.006
m+0.006
bee+0.006
es+0.006
op+0.006
Neg logit contributions
 sight-0.007
 fun-0.007
 awesome-0.007
 common-0.007
 our-0.007
Token cried
Token ID16896
Feature activation-1.27e-02

Pos logit contributions
owed+0.060
m+0.058
bee+0.055
es+0.053
op+0.053
Neg logit contributions
 sight-0.066
 fun-0.066
 common-0.064
 awesome-0.064
 our-0.063
Token the
Token ID262
Feature activation-5.30e-03

Pos logit contributions
 sight+0.019
 awesome+0.019
 fun+0.018
 common+0.018
 purpose+0.018
Neg logit contributions
owed-0.028
m-0.028
bee-0.027
op-0.026
es-0.026
Token crab
Token ID32202
Feature activation+1.56e-01

Pos logit contributions
 awesome+0.010
 common+0.010
 sight+0.010
 fun+0.010
 purpose+0.010
Neg logit contributions
m-0.010
bee-0.009
owed-0.009
op-0.009
ops-0.008
Token.
Token ID13
Feature activation+3.08e-02

Pos logit contributions
owers+0.322
ops+0.314
ird+0.303
oke+0.299
mallow+0.298
Neg logit contributions
 loud-0.250
 fun-0.249
 new-0.240
 big-0.235
 easy-0.232
Token "
Token ID366
Feature activation+5.12e-03

Pos logit contributions
owed+0.063
m+0.061
bee+0.059
op+0.058
es+0.058
Neg logit contributions
 sight-0.051
 awesome-0.050
 fun-0.049
 common-0.048
 purpose-0.048
TokenI
Token ID40
Feature activation+8.50e-03

Pos logit contributions
owed+0.010
m+0.010
bee+0.010
op+0.010
es+0.009
Neg logit contributions
 sight-0.009
 awesome-0.009
 fun-0.008
 common-0.008
 purpose-0.008
Token'm
Token ID1101
Feature activation-5.14e-02

Pos logit contributions
owed+0.017
bee+0.017
m+0.016
ops+0.016
es+0.016
Neg logit contributions
 fun-0.014
 common-0.014
 sight-0.014
 awesome-0.014
 our-0.013
Token in
Token ID287
Feature activation-1.96e-02

Pos logit contributions
 common+0.089
 purpose+0.087
 sight+0.087
 rightful+0.085
 awesome+0.084
Neg logit contributions
m-0.101
owed-0.098
op-0.094
es-0.093
bee-0.092
Token them
Token ID606
Feature activation+6.11e-02

Pos logit contributions
 sight+0.050
 awesome+0.049
 common+0.046
 fun+0.046
 purpose+0.046
Neg logit contributions
m-0.118
owed-0.118
bee-0.114
es-0.113
op-0.111
Token?"
Token ID1701
Feature activation+3.25e-02

Pos logit contributions
owed+0.118
bee+0.110
m+0.107
ops+0.106
op+0.106
Neg logit contributions
 our-0.113
 fun-0.112
 awesome-0.110
 sight-0.106
 common-0.106
Token
Token ID198
Feature activation+2.56e-02

Pos logit contributions
m+0.068
owed+0.067
bee+0.064
op+0.063
es+0.063
Neg logit contributions
 awesome-0.052
 sight-0.051
 purpose-0.049
 common-0.049
 fun-0.048
Token
Token ID198
Feature activation+2.79e-02

Pos logit contributions
owed+0.046
m+0.045
bee+0.043
op+0.042
es+0.042
Neg logit contributions
 sight-0.049
 awesome-0.049
 fun-0.047
 common-0.047
 purpose-0.046
TokenThe
Token ID464
Feature activation-2.60e-03

Pos logit contributions
m+0.050
owed+0.050
bee+0.047
es+0.047
op+0.047
Neg logit contributions
 sight-0.053
 awesome-0.053
 fun-0.051
 common-0.051
 purpose-0.050
Token bull
Token ID6473
Feature activation+1.56e-01

Pos logit contributions
 sight+0.004
 awesome+0.004
 fun+0.004
 our+0.004
 common+0.004
Neg logit contributions
owed-0.006
m-0.006
es-0.005
op-0.005
bee-0.005
Token looks
Token ID3073
Feature activation-4.56e-02

Pos logit contributions
owed+0.341
op+0.340
ird+0.334
m+0.329
ops+0.319
Neg logit contributions
 fun-0.225
 sight-0.225
 awesome-0.215
 our-0.208
 common-0.207
Token at
Token ID379
Feature activation-3.51e-02

Pos logit contributions
 sight+0.091
 purpose+0.090
 awesome+0.090
 common+0.088
 comparison+0.086
Neg logit contributions
owed-0.077
m-0.076
bee-0.071
es-0.070
op-0.068
Token the
Token ID262
Feature activation-8.66e-03

Pos logit contributions
 awesome+0.051
 pleasure+0.049
 thrilling+0.048
 orderly+0.047
 comparison+0.046
Neg logit contributions
m-0.078
es-0.073
bee-0.073
ie-0.072
owed-0.072
Token farmer
Token ID18739
Feature activation+1.17e-01

Pos logit contributions
 sight+0.015
 awesome+0.014
 fun+0.014
 purpose+0.013
 common+0.013
Neg logit contributions
owed-0.018
m-0.017
es-0.017
op-0.017
bee-0.017
Token.
Token ID13
Feature activation+2.23e-02

Pos logit contributions
owed+0.194
bee+0.188
m+0.186
op+0.183
ird+0.181
Neg logit contributions
 sight-0.224
 fun-0.215
 our-0.215
 awesome-0.212
 purpose-0.206
Token "
Token ID366
Feature activation+5.63e-03

Pos logit contributions
 sight+0.003
 awesome+0.003
 fun+0.003
 our+0.003
 common+0.003
Neg logit contributions
owed-0.005
m-0.004
bee-0.004
op-0.004
ops-0.004
TokenI
Token ID40
Feature activation+3.59e-03

Pos logit contributions
owed+0.013
op+0.012
es+0.012
ops+0.012
ird+0.012
Neg logit contributions
 sight-0.008
 comparison-0.007
 common-0.007
 awesome-0.007
 purpose-0.007
Token saw
Token ID2497
Feature activation-4.95e-02

Pos logit contributions
owed+0.007
m+0.007
bee+0.007
op+0.007
es+0.007
Neg logit contributions
 sight-0.006
 fun-0.006
 common-0.006
 awesome-0.006
 our-0.006
Token the
Token ID262
Feature activation-5.65e-03

Pos logit contributions
 common+0.058
 rightful+0.055
 awesome+0.053
 accomplishment+0.052
 orderly+0.051
Neg logit contributions
m-0.123
owed-0.121
es-0.120
bee-0.117
op-0.113
Token enormous
Token ID9812
Feature activation+1.85e-02

Pos logit contributions
 common+0.012
 awesome+0.012
 our+0.011
 sight+0.011
 purpose+0.011
Neg logit contributions
m-0.009
bee-0.008
owed-0.008
ops-0.008
op-0.008
Token beetle
Token ID45489
Feature activation+1.56e-01

Pos logit contributions
owed+0.031
m+0.030
bee+0.029
op+0.028
es+0.028
Neg logit contributions
 sight-0.038
 awesome-0.037
 fun-0.036
 common-0.036
 purpose-0.036
Token climbing
Token ID14281
Feature activation-3.35e-03

Pos logit contributions
owed+0.306
m+0.295
op+0.294
bee+0.293
ops+0.285
Neg logit contributions
 awesome-0.262
 sight-0.261
 our-0.259
 fun-0.255
 my-0.246
Token up
Token ID510
Feature activation-1.73e-02

Pos logit contributions
 awesome+0.006
 common+0.006
 fun+0.006
 pleasure+0.006
 sight+0.006
Neg logit contributions
m-0.006
owed-0.006
es-0.006
bee-0.006
op-0.006
Token a
Token ID257
Feature activation-2.78e-02

Pos logit contributions
 awesome+0.032
 rightful+0.032
 pleasure+0.031
 common+0.031
 purpose+0.030
Neg logit contributions
m-0.030
owed-0.029
es-0.029
op-0.026
bee-0.026
Token tree
Token ID5509
Feature activation+1.00e-01

Pos logit contributions
 sight+0.048
 awesome+0.048
 fun+0.047
 common+0.047
 purpose+0.046
Neg logit contributions
owed-0.054
m-0.053
bee-0.051
op-0.050
es-0.049
Token!
Token ID0
Feature activation+2.56e-02

Pos logit contributions
owed+0.176
m+0.169
bee+0.167
op+0.160
es+0.158
Neg logit contributions
 sight-0.195
 awesome-0.191
 fun-0.188
 our-0.185
 purpose-0.183
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
owed+0.115
m+0.112
bee+0.106
op+0.105
es+0.105
Neg logit contributions
 awesome-0.090
 sight-0.090
 common-0.088
 fun-0.086
 purpose-0.085
Token Suddenly
Token ID24975
Feature activation+4.29e-03

Pos logit contributions
m+0.032
owed+0.032
op+0.030
bee+0.030
es+0.029
Neg logit contributions
 awesome-0.025
 common-0.024
 sight-0.024
 fun-0.023
 purpose-0.023
Token,
Token ID11
Feature activation-2.87e-03

Pos logit contributions
owed+0.007
m+0.007
bee+0.007
op+0.006
ops+0.006
Neg logit contributions
 sight-0.009
 fun-0.009
 our-0.009
 awesome-0.008
 purpose-0.008
Token the
Token ID262
Feature activation+5.15e-04

Pos logit contributions
 awesome+0.003
 sight+0.003
 common+0.003
 purpose+0.003
 fun+0.003
Neg logit contributions
m-0.008
owed-0.008
bee-0.007
op-0.007
es-0.007
Token scare
Token ID19437
Feature activation+3.76e-02

Pos logit contributions
m+0.001
owed+0.001
bee+0.001
op+0.001
es+0.001
Neg logit contributions
 awesome-0.001
 sight-0.001
 common-0.001
 fun-0.001
 pleasure-0.001
Tokencrow
Token ID47114
Feature activation+1.55e-01

Pos logit contributions
owed+0.061
m+0.058
bee+0.056
es+0.054
op+0.054
Neg logit contributions
 sight-0.079
 awesome-0.075
 purpose-0.074
 fun-0.074
 common-0.074
Token moved
Token ID3888
Feature activation-1.94e-02

Pos logit contributions
owed+0.277
m+0.274
bee+0.274
ird+0.262
ops+0.257
Neg logit contributions
 sight-0.267
 fun-0.260
 beautiful-0.249
 awesome-0.248
 loud-0.243
Token and
Token ID290
Feature activation-2.73e-02

Pos logit contributions
 awesome+0.033
 sight+0.033
 pleasure+0.032
 common+0.032
 purpose+0.032
Neg logit contributions
owed-0.038
m-0.038
es-0.036
bee-0.035
op-0.035
Token waved
Token ID26834
Feature activation-4.13e-02

Pos logit contributions
 awesome+0.036
 sight+0.036
 common+0.034
 fun+0.034
 purpose+0.034
Neg logit contributions
owed-0.065
m-0.064
bee-0.062
es-0.061
op-0.061
Token its
Token ID663
Feature activation-6.53e-03

Pos logit contributions
 awesome+0.064
 common+0.062
 purpose+0.061
 rightful+0.060
 sight+0.059
Neg logit contributions
owed-0.087
es-0.084
m-0.083
bee-0.082
op-0.077
Token arms
Token ID5101
Feature activation+5.79e-02

Pos logit contributions
 awesome+0.009
 sight+0.008
 common+0.008
 fun+0.008
 purpose+0.008
Neg logit contributions
owed-0.016
m-0.015
bee-0.015
op-0.015
es-0.014
Token better
Token ID1365
Feature activation-1.37e-02

Pos logit contributions
 purpose+0.114
 comparison+0.113
 sight+0.113
 outer+0.110
 common+0.108
Neg logit contributions
m-0.121
owed-0.105
es-0.099
bee-0.098
ppy-0.094
Token."
Token ID526
Feature activation+2.16e-02

Pos logit contributions
 sight+0.026
 awesome+0.025
 fun+0.025
 common+0.025
 purpose+0.024
Neg logit contributions
owed-0.025
m-0.024
bee-0.023
op-0.023
es-0.023
Token
Token ID198
Feature activation+1.43e-02

Pos logit contributions
owed+0.044
m+0.042
bee+0.041
op+0.039
es+0.039
Neg logit contributions
 sight-0.037
 fun-0.036
 awesome-0.036
 common-0.036
 our-0.035
Token
Token ID198
Feature activation+1.87e-02

Pos logit contributions
owed+0.027
m+0.026
bee+0.026
op+0.025
es+0.024
Neg logit contributions
 sight-0.026
 awesome-0.026
 fun-0.025
 common-0.025
 our-0.024
TokenThe
Token ID464
Feature activation-6.61e-03

Pos logit contributions
owed+0.032
m+0.031
bee+0.030
op+0.029
es+0.029
Neg logit contributions
 sight-0.038
 awesome-0.037
 fun-0.037
 common-0.036
 purpose-0.036
Token beetle
Token ID45489
Feature activation+1.55e-01

Pos logit contributions
 awesome+0.015
 sight+0.015
 purpose+0.015
 common+0.015
 pleasure+0.015
Neg logit contributions
m-0.009
bee-0.009
owed-0.008
ops-0.008
es-0.008
Token sees
Token ID7224
Feature activation-3.14e-02

Pos logit contributions
owed+0.321
m+0.312
op+0.308
bee+0.297
ird+0.295
Neg logit contributions
 fun-0.252
 sight-0.249
 awesome-0.247
 our-0.236
 common-0.235
Token the
Token ID262
Feature activation-7.51e-03

Pos logit contributions
 rightful+0.044
 orderly+0.043
 purpose+0.042
 pleasure+0.042
 thrilling+0.042
Neg logit contributions
m-0.075
es-0.072
owed-0.069
bee-0.069
oke-0.065
Token water
Token ID1660
Feature activation+9.45e-02

Pos logit contributions
 awesome+0.013
 common+0.013
 purpose+0.013
 our+0.012
 sight+0.012
Neg logit contributions
m-0.015
bee-0.014
owed-0.013
ops-0.013
op-0.012
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
owed+0.144
m+0.140
bee+0.132
op+0.129
es+0.128
Neg logit contributions
 sight-0.206
 awesome-0.204
 fun-0.200
 common-0.199
 purpose-0.197
Token It
Token ID632
Feature activation+3.64e-02

Pos logit contributions
m+0.028
owed+0.028
op+0.027
es+0.027
bee+0.026
Neg logit contributions
 awesome-0.022
 sight-0.021
 common-0.021
 fun-0.021
 purpose-0.021
Token friends
Token ID2460
Feature activation+3.96e-02

Pos logit contributions
 sight+0.035
 fun+0.035
 awesome+0.033
 pleasure+0.031
 our+0.031
Neg logit contributions
owed-0.062
bee-0.059
m-0.059
ops-0.057
op-0.056
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
owed+0.072
bee+0.068
m+0.067
op+0.065
ops+0.063
Neg logit contributions
 fun-0.074
 sight-0.074
 awesome-0.074
 our-0.073
 pleasure-0.070
Token<|endoftext|>
Token ID50256
Feature activation+4.46e-02

Pos logit contributions
owed+0.033
bee+0.031
m+0.031
op+0.029
es+0.029
Neg logit contributions
 sight-0.023
 awesome-0.022
 fun-0.022
 our-0.022
 pleasure-0.022
TokenThe
Token ID464
Feature activation+6.83e-03

Pos logit contributions
owed+0.107
bee+0.103
op+0.096
itive+0.094
ppy+0.093
Neg logit contributions
 sight-0.066
 fun-0.062
 common-0.059
 awesome-0.058
 our-0.056
Token ancient
Token ID6156
Feature activation+4.58e-02

Pos logit contributions
owed+0.016
m+0.016
bee+0.016
op+0.015
es+0.015
Neg logit contributions
 sight-0.009
 awesome-0.009
 common-0.008
 fun-0.008
 purpose-0.008
Token turkey
Token ID26876
Feature activation+1.55e-01

Pos logit contributions
owed+0.105
m+0.098
es+0.095
bee+0.094
op+0.092
Neg logit contributions
 sight-0.074
 fun-0.069
 common-0.067
 purpose-0.066
 our-0.066
Token could
Token ID714
Feature activation-3.59e-02

Pos logit contributions
owed+0.276
m+0.262
op+0.255
bee+0.253
ird+0.243
Neg logit contributions
 fun-0.292
 sight-0.291
 our-0.278
 awesome-0.273
 exciting-0.268
Token not
Token ID407
Feature activation-4.19e-02

Pos logit contributions
 sight+0.060
 awesome+0.058
 common+0.057
 purpose+0.056
 comparison+0.056
Neg logit contributions
owed-0.073
bee-0.070
m-0.069
op-0.066
es-0.065
Token fly
Token ID6129
Feature activation-2.21e-03

Pos logit contributions
 fun+0.052
 common+0.052
 purpose+0.051
 awesome+0.051
 sight+0.050
Neg logit contributions
m-0.100
owed-0.100
es-0.099
op-0.094
oke-0.094
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
 awesome+0.005
 sight+0.004
 pleasure+0.004
 accomplishment+0.004
 rightful+0.004
Neg logit contributions
owed-0.004
m-0.003
es-0.003
bee-0.003
op-0.003
Token He
Token ID679
Feature activation+3.54e-02

Pos logit contributions
owed+0.041
m+0.041
bee+0.039
op+0.039
es+0.038
Neg logit contributions
 sight-0.025
 awesome-0.025
 common-0.024
 fun-0.024
 purpose-0.023
Token the
Token ID262
Feature activation-5.60e-03

Pos logit contributions
 rightful+0.046
 awesome+0.045
 purpose+0.045
 ours+0.044
 comparison+0.043
Neg logit contributions
m-0.037
es-0.036
owed-0.033
bee-0.033
op-0.032
Token children
Token ID1751
Feature activation+1.15e-01

Pos logit contributions
 awesome+0.011
 sight+0.011
 common+0.011
 purpose+0.011
 fun+0.011
Neg logit contributions
m-0.009
owed-0.009
bee-0.009
op-0.009
es-0.008
Token approached
Token ID10448
Feature activation-3.19e-02

Pos logit contributions
owed+0.206
bee+0.200
m+0.192
pp+0.189
op+0.189
Neg logit contributions
 fun-0.202
 sight-0.197
 exciting-0.187
 success-0.182
 joy-0.180
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 awesome+0.038
 sight+0.038
 fun+0.036
 common+0.036
 purpose+0.035
Neg logit contributions
owed-0.080
m-0.079
bee-0.076
es-0.075
op-0.075
Token the
Token ID262
Feature activation-8.13e-05

Pos logit contributions
 awesome+0.015
 sight+0.015
 purpose+0.015
 common+0.015
 fun+0.015
Neg logit contributions
owed-0.022
m-0.022
es-0.021
bee-0.021
op-0.021
Token beetle
Token ID45489
Feature activation+1.55e-01

Pos logit contributions
 common+0.000
 awesome+0.000
 our+0.000
 purpose+0.000
 sight+0.000
Neg logit contributions
m-0.000
bee-0.000
owed-0.000
ops-0.000
op-0.000
Token revealed
Token ID4602
Feature activation-4.45e-02

Pos logit contributions
m+0.274
owed+0.271
op+0.261
ird+0.253
bee+0.249
Neg logit contributions
 fun-0.289
 sight-0.285
 beautiful-0.267
 exciting-0.267
 awesome-0.265
Token a
Token ID257
Feature activation-3.56e-02

Pos logit contributions
 common+0.054
 awesome+0.054
 accomplishment+0.053
 rightful+0.053
 comparison+0.052
Neg logit contributions
es-0.110
owed-0.107
m-0.101
bee-0.093
op-0.093
Token secret
Token ID3200
Feature activation+2.48e-02

Pos logit contributions
 awesome+0.034
 common+0.031
 our+0.031
 comparison+0.031
 sight+0.031
Neg logit contributions
m-0.094
owed-0.090
bee-0.090
op-0.089
es-0.085
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
owed+0.044
m+0.043
bee+0.041
op+0.040
es+0.040
Neg logit contributions
 sight-0.047
 awesome-0.047
 fun-0.046
 common-0.046
 purpose-0.045
Token It
Token ID632
Feature activation+3.70e-02

Pos logit contributions
owed+0.047
m+0.047
es+0.044
bee+0.044
op+0.044
Neg logit contributions
 awesome-0.035
 sight-0.035
 common-0.033
 fun-0.033
 purpose-0.033
Token see
Token ID766
Feature activation-4.42e-02

Pos logit contributions
 sight+0.047
 awesome+0.047
 common+0.044
 fun+0.044
 purpose+0.043
Neg logit contributions
owed-0.069
m-0.067
bee-0.065
op-0.064
es-0.063
Token what
Token ID644
Feature activation-2.39e-02

Pos logit contributions
 common+0.069
 sight+0.067
 awesome+0.066
 purpose+0.066
 comparison+0.064
Neg logit contributions
m-0.093
owed-0.093
es-0.088
op-0.085
bee-0.085
Token was
Token ID373
Feature activation-2.10e-02

Pos logit contributions
 sight+0.056
 awesome+0.056
 common+0.055
 purpose+0.054
 comparison+0.053
Neg logit contributions
owed-0.031
m-0.031
es-0.028
bee-0.028
op-0.028
Token inside
Token ID2641
Feature activation+1.10e-02

Pos logit contributions
 sight+0.033
 awesome+0.033
 common+0.032
 purpose+0.032
 fun+0.031
Neg logit contributions
owed-0.044
m-0.044
bee-0.042
es-0.041
op-0.040
Token the
Token ID262
Feature activation+2.37e-04

Pos logit contributions
m+0.021
owed+0.020
es+0.019
op+0.019
bee+0.019
Neg logit contributions
 sight-0.020
 awesome-0.020
 common-0.020
 fun-0.019
 purpose-0.019
Token barrel
Token ID9036
Feature activation+1.55e-01

Pos logit contributions
owed+0.001
m+0.000
bee+0.000
op+0.000
es+0.000
Neg logit contributions
 sight-0.000
 awesome-0.000
 fun-0.000
 common-0.000
 purpose-0.000
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
owed+0.271
bee+0.267
m+0.253
op+0.249
es+0.240
Neg logit contributions
 sight-0.303
 fun-0.294
 awesome-0.288
 exciting-0.276
 purpose-0.272
Token Suddenly
Token ID24975
Feature activation+6.38e-03

Pos logit contributions
owed+0.044
m+0.044
bee+0.042
es+0.041
op+0.041
Neg logit contributions
 sight-0.030
 awesome-0.029
 fun-0.029
 purpose-0.028
 common-0.028
Token,
Token ID11
Feature activation-1.27e-03

Pos logit contributions
owed+0.010
m+0.010
bee+0.009
op+0.009
es+0.009
Neg logit contributions
 sight-0.014
 awesome-0.013
 fun-0.013
 common-0.013
 purpose-0.013
Token his
Token ID465
Feature activation-1.23e-02

Pos logit contributions
 sight+0.002
 awesome+0.002
 fun+0.002
 common+0.002
 purpose+0.002
Neg logit contributions
owed-0.003
m-0.003
bee-0.003
op-0.003
es-0.003
Token older
Token ID4697
Feature activation+2.83e-02

Pos logit contributions
 awesome+0.020
 common+0.020
 our+0.020
 comparison+0.019
 ours+0.019
Neg logit contributions
owed-0.023
op-0.023
bee-0.023
m-0.023
es-0.021
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
m+0.027
owed+0.027
op+0.025
es+0.025
bee+0.025
Neg logit contributions
 sight-0.024
 awesome-0.024
 common-0.023
 purpose-0.023
 fun-0.023
TokenSuddenly
Token ID38582
Feature activation+1.41e-03

Pos logit contributions
m+0.036
owed+0.035
op+0.034
es+0.034
bee+0.033
Neg logit contributions
 sight-0.053
 awesome-0.051
 purpose-0.050
 pleasure-0.049
 fun-0.049
Token,
Token ID11
Feature activation+6.12e-04

Pos logit contributions
owed+0.002
m+0.002
bee+0.002
es+0.002
op+0.002
Neg logit contributions
 awesome-0.003
 sight-0.003
 fun-0.003
 common-0.003
 purpose-0.003
Token the
Token ID262
Feature activation+5.30e-04

Pos logit contributions
m+0.001
es+0.001
owed+0.001
bee+0.001
ops+0.001
Neg logit contributions
 orderly-0.001
 awesome-0.001
 rightful-0.001
 purpose-0.001
 thrilling-0.001
Token all
Token ID477
Feature activation+2.96e-02

Pos logit contributions
m+0.001
owed+0.001
bee+0.001
es+0.001
op+0.001
Neg logit contributions
 awesome-0.001
 sight-0.001
 common-0.001
 fun-0.001
 purpose-0.001
Tokenigator
Token ID23823
Feature activation+1.55e-01

Pos logit contributions
owed+0.036
m+0.035
bee+0.033
es+0.031
op+0.031
Neg logit contributions
 sight-0.073
 awesome-0.073
 purpose-0.072
 fun-0.072
 common-0.072
Token snapped
Token ID20821
Feature activation-4.10e-02

Pos logit contributions
owed+0.294
m+0.276
bee+0.266
op+0.265
ird+0.261
Neg logit contributions
 sight-0.290
 fun-0.272
 awesome-0.269
 our-0.263
 common-0.259
Token its
Token ID663
Feature activation-6.37e-03

Pos logit contributions
 awesome+0.091
 purpose+0.088
 pleasure+0.087
 rightful+0.087
 common+0.087
Neg logit contributions
owed-0.063
es-0.059
m-0.059
bee-0.055
op-0.052
Token jaws
Token ID39940
Feature activation+4.06e-02

Pos logit contributions
 awesome+0.010
 sight+0.009
 comparison+0.009
 fun+0.009
 thrilling+0.009
Neg logit contributions
owed-0.014
m-0.013
bee-0.013
es-0.013
op-0.012
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
owed+0.056
m+0.055
bee+0.051
es+0.051
op+0.050
Neg logit contributions
 sight-0.094
 awesome-0.093
 common-0.091
 fun-0.091
 purpose-0.090
Token bit
Token ID1643
Feature activation-3.92e-02

Pos logit contributions
 awesome+0.034
 pleasure+0.033
 purpose+0.032
 sight+0.032
 thrilling+0.032
Neg logit contributions
es-0.021
owed-0.021
bee-0.020
m-0.019
ops-0.018
Token went
Token ID1816
Feature activation-2.32e-02

Pos logit contributions
m+0.048
bee+0.047
owed+0.047
ops+0.046
oke+0.046
Neg logit contributions
 fun-0.046
 sight-0.044
 exciting-0.044
 awesome-0.043
 thrilling-0.042
Token closer
Token ID5699
Feature activation+1.00e-04

Pos logit contributions
 awesome+0.035
 sight+0.034
 purpose+0.034
 common+0.033
 comparison+0.033
Neg logit contributions
m-0.052
owed-0.050
op-0.047
bee-0.047
es-0.047
Token to
Token ID284
Feature activation-4.21e-02

Pos logit contributions
owed+0.000
m+0.000
bee+0.000
op+0.000
es+0.000
Neg logit contributions
 sight-0.000
 fun-0.000
 awesome-0.000
 common-0.000
 purpose-0.000
Token the
Token ID262
Feature activation-5.44e-03

Pos logit contributions
 awesome+0.051
 purpose+0.048
 sight+0.047
 common+0.047
 fun+0.047
Neg logit contributions
m-0.102
owed-0.101
bee-0.096
op-0.094
es-0.092
Token scare
Token ID19437
Feature activation+1.18e-02

Pos logit contributions
 awesome+0.009
 sight+0.009
 common+0.009
 pleasure+0.009
 fun+0.009
Neg logit contributions
m-0.011
owed-0.011
bee-0.010
op-0.010
es-0.010
Tokencrow
Token ID47114
Feature activation+1.55e-01

Pos logit contributions
owed+0.018
m+0.018
bee+0.017
op+0.016
es+0.016
Neg logit contributions
 sight-0.026
 awesome-0.025
 fun-0.025
 common-0.025
 purpose-0.024
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
bee+0.282
m+0.268
owed+0.263
ops+0.263
ird+0.256
Neg logit contributions
 fun-0.261
 sight-0.261
 our-0.251
 exciting-0.247
 awesome-0.243
Token Mom
Token ID11254
Feature activation+6.28e-02

Pos logit contributions
 sight+0.032
 awesome+0.031
 fun+0.031
 our+0.030
 common+0.030
Neg logit contributions
owed-0.036
m-0.035
bee-0.035
op-0.033
es-0.033
Tokenmy
Token ID1820
Feature activation+5.90e-02

Pos logit contributions
bee+0.110
ops+0.108
op+0.104
bees+0.104
keys+0.103
Neg logit contributions
 fun-0.104
 new-0.095
 same-0.094
 sight-0.093
 exciting-0.092
Token held
Token ID2714
Feature activation-5.07e-02

Pos logit contributions
bee+0.102
owed+0.099
m+0.098
ops+0.097
itive+0.095
Neg logit contributions
 fun-0.102
 sight-0.097
 exciting-0.095
 beautiful-0.093
 new-0.092
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 sight+0.057
 awesome+0.057
 fun+0.054
 common+0.054
 purpose+0.053
Neg logit contributions
owed-0.130
m-0.128
bee-0.123
es-0.122
op-0.122
Token "
Token ID366
Feature activation+8.47e-03

Pos logit contributions
owed+0.010
m+0.010
bee+0.010
op+0.010
ops+0.010
Neg logit contributions
 sight-0.007
 awesome-0.007
 fun-0.007
 common-0.006
 our-0.006
TokenI
Token ID40
Feature activation+6.96e-03

Pos logit contributions
owed+0.019
bee+0.018
m+0.017
ops+0.017
op+0.017
Neg logit contributions
 common-0.012
 awesome-0.012
 our-0.012
 sight-0.012
 fun-0.012
Token am
Token ID716
Feature activation-4.64e-02

Pos logit contributions
owed+0.013
bee+0.013
m+0.012
ops+0.012
es+0.012
Neg logit contributions
 common-0.013
 fun-0.012
 sight-0.012
 awesome-0.012
 our-0.012
Token the
Token ID262
Feature activation-9.94e-03

Pos logit contributions
 sight+0.056
 awesome+0.056
 thrilling+0.054
 rightful+0.054
 comparison+0.053
Neg logit contributions
m-0.108
owed-0.108
op-0.107
es-0.105
bee-0.104
Token pl
Token ID458
Feature activation+6.67e-02

Pos logit contributions
 awesome+0.016
 sight+0.015
 comparison+0.015
 thrilling+0.015
 accomplishment+0.015
Neg logit contributions
m-0.021
op-0.019
bee-0.019
owed-0.018
ird-0.018
Tokenumber
Token ID4494
Feature activation+1.55e-01

Pos logit contributions
bee+0.088
m+0.084
ie+0.079
mun+0.079
owed+0.077
Neg logit contributions
 example-0.131
 awesome-0.130
 common-0.127
 purpose-0.126
 backup-0.126
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
owed+0.270
bee+0.259
m+0.258
ops+0.253
op+0.243
Neg logit contributions
 our-0.296
 awesome-0.291
 sight-0.284
 fun-0.283
 common-0.281
Token I
Token ID314
Feature activation+9.85e-03

Pos logit contributions
owed+0.036
m+0.036
bee+0.034
op+0.034
es+0.033
Neg logit contributions
 sight-0.027
 awesome-0.027
 fun-0.026
 common-0.026
 purpose-0.025
Token am
Token ID716
Feature activation-4.98e-02

Pos logit contributions
owed+0.019
m+0.018
bee+0.018
es+0.017
op+0.017
Neg logit contributions
 common-0.018
 awesome-0.017
 fun-0.017
 sight-0.017
 our-0.017
Token fixing
Token ID18682
Feature activation-2.51e-02

Pos logit contributions
 sight+0.072
 purpose+0.070
 thrilling+0.069
 awesome+0.069
 comparison+0.069
Neg logit contributions
m-0.104
owed-0.101
op-0.099
es-0.099
ird-0.096
Token the
Token ID262
Feature activation-1.85e-02

Pos logit contributions
 sight+0.025
 comparison+0.024
 pleasure+0.024
 awesome+0.024
 common+0.024
Neg logit contributions
es-0.064
m-0.064
owed-0.063
op-0.061
bee-0.061
Token time
Token ID640
Feature activation+2.94e-03

Pos logit contributions
 sight+0.014
 awesome+0.014
 fun+0.014
 common+0.013
 purpose+0.013
Neg logit contributions
owed-0.051
m-0.050
bee-0.048
es-0.048
op-0.048
Token there
Token ID612
Feature activation+2.23e-02

Pos logit contributions
owed+0.005
bee+0.005
m+0.005
ops+0.005
es+0.005
Neg logit contributions
 sight-0.006
 fun-0.006
 awesome-0.005
 our-0.005
 common-0.005
Token was
Token ID373
Feature activation-5.26e-02

Pos logit contributions
owed+0.035
bee+0.033
m+0.033
op+0.032
es+0.031
Neg logit contributions
 sight-0.047
 fun-0.046
 awesome-0.046
 common-0.045
 purpose-0.045
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
 sight+0.049
 rightful+0.048
 common+0.047
 awesome+0.046
 purpose+0.046
Neg logit contributions
m-0.141
owed-0.138
bee-0.136
op-0.131
es-0.131
Token calm
Token ID9480
Feature activation-4.83e-03

Pos logit contributions
 fun+0.013
 sight+0.013
 awesome+0.013
 common+0.012
 our+0.011
Neg logit contributions
owed-0.032
es-0.030
m-0.030
ops-0.029
bee-0.029
Token bull
Token ID6473
Feature activation+1.55e-01

Pos logit contributions
 sight+0.009
 fun+0.009
 awesome+0.008
 purpose+0.008
 common+0.008
Neg logit contributions
owed-0.009
m-0.009
es-0.009
op-0.008
ops-0.008
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
owed+0.314
ops+0.309
ird+0.303
m+0.302
oke+0.292
Neg logit contributions
 sight-0.237
 fun-0.228
 our-0.225
 its-0.220
 their-0.217
Token He
Token ID679
Feature activation+3.82e-02

Pos logit contributions
owed+0.049
m+0.048
bee+0.047
op+0.046
es+0.046
Neg logit contributions
 sight-0.029
 awesome-0.029
 fun-0.028
 common-0.028
 purpose-0.027
Token liked
Token ID8288
Feature activation-7.40e-02

Pos logit contributions
owed+0.075
m+0.072
op+0.067
ird+0.066
es+0.065
Neg logit contributions
 fun-0.070
 awesome-0.067
 sight-0.066
 common-0.065
 exciting-0.063
Token to
Token ID284
Feature activation-4.26e-02

Pos logit contributions
 common+0.129
 rightful+0.128
 awesome+0.122
 comparison+0.118
 purpose+0.118
Neg logit contributions
m-0.140
bee-0.139
op-0.134
es-0.129
owed-0.128
Token walk
Token ID2513
Feature activation-8.18e-03

Pos logit contributions
 sight+0.065
 awesome+0.064
 fun+0.062
 common+0.062
 purpose+0.061
Neg logit contributions
owed-0.093
m-0.092
bee-0.087
op-0.086
es-0.086
Token
Token ID198
Feature activation+1.54e-03

Pos logit contributions
 awesome+0.007
 sight+0.007
 fun+0.007
 purpose+0.007
 common+0.007
Neg logit contributions
owed-0.008
bee-0.008
m-0.008
op-0.007
ops-0.007
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
owed+0.003
m+0.003
bee+0.003
es+0.003
op+0.003
Neg logit contributions
 sight-0.003
 awesome-0.003
 common-0.003
 fun-0.003
 purpose-0.003
Token"
Token ID1
Feature activation+1.43e-02

Pos logit contributions
owed+0.019
m+0.018
bee+0.018
es+0.017
op+0.017
Neg logit contributions
 awesome-0.020
 sight-0.020
 fun-0.020
 common-0.019
 purpose-0.019
TokenHello
Token ID15496
Feature activation+1.31e-02

Pos logit contributions
owed+0.034
op+0.032
ops+0.031
es+0.031
m+0.031
Neg logit contributions
 awesome-0.020
 sight-0.020
 fun-0.020
 common-0.019
 thrilling-0.019
Token Mr
Token ID1770
Feature activation+4.98e-02

Pos logit contributions
owed+0.027
bee+0.025
ops+0.025
m+0.024
op+0.024
Neg logit contributions
 sight-0.023
 awesome-0.022
 fun-0.022
 our-0.022
 my-0.021
Token Man
Token ID1869
Feature activation+1.55e-01

Pos logit contributions
owed+0.095
bee+0.089
op+0.088
m+0.085
ops+0.084
Neg logit contributions
 common-0.089
 sight-0.089
 our-0.086
 fun-0.085
 rightful-0.085
Token!
Token ID0
Feature activation+3.97e-02

Pos logit contributions
bee+0.280
ops+0.273
op+0.272
owed+0.270
m+0.266
Neg logit contributions
 our-0.263
 my-0.254
 awesome-0.244
 yours-0.240
 your-0.240
Token Will
Token ID2561
Feature activation+5.31e-02

Pos logit contributions
owed+0.090
bee+0.085
ops+0.085
es+0.080
m+0.079
Neg logit contributions
 sight-0.058
 pleasure-0.055
 my-0.054
 our-0.053
 awesome-0.053
Token you
Token ID345
Feature activation+2.03e-02

Pos logit contributions
owed+0.102
m+0.101
bee+0.099
op+0.096
es+0.095
Neg logit contributions
 awesome-0.093
 fun-0.092
 sight-0.091
 common-0.090
 our-0.090
Token swing
Token ID9628
Feature activation+3.34e-02

Pos logit contributions
owed+0.039
bee+0.039
m+0.038
op+0.037
es+0.037
Neg logit contributions
 awesome-0.035
 fun-0.034
 sight-0.033
 common-0.033
 pleasure-0.032
Token me
Token ID502
Feature activation+4.69e-02

Pos logit contributions
owed+0.065
m+0.064
es+0.060
bee+0.060
op+0.059
Neg logit contributions
 sight-0.059
 awesome-0.057
 common-0.056
 comparison-0.056
 purpose-0.056
Token "
Token ID366
Feature activation-1.81e-02

Pos logit contributions
 sight+0.003
 awesome+0.003
 fun+0.003
 common+0.003
 purpose+0.003
Neg logit contributions
owed-0.004
m-0.004
bee-0.004
op-0.004
es-0.004
TokenMax
Token ID11518
Feature activation+7.77e-02

Pos logit contributions
 awesome+0.025
 sight+0.025
 fun+0.025
 common+0.025
 purpose+0.024
Neg logit contributions
owed-0.042
m-0.039
es-0.039
bee-0.038
op-0.038
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
owed+0.126
m+0.124
bee+0.117
op+0.114
es+0.112
Neg logit contributions
 sight-0.158
 awesome-0.157
 fun-0.155
 common-0.153
 our-0.153
Token you
Token ID345
Feature activation-1.30e-03

Pos logit contributions
 sight+0.033
 awesome+0.032
 common+0.032
 purpose+0.031
 fun+0.031
Neg logit contributions
owed-0.040
m-0.040
bee-0.039
es-0.037
op-0.037
Token're
Token ID821
Feature activation-6.13e-02

Pos logit contributions
 sight+0.003
 awesome+0.003
 fun+0.002
 common+0.002
 purpose+0.002
Neg logit contributions
owed-0.002
m-0.002
bee-0.002
op-0.002
es-0.002
Token such
Token ID884
Feature activation-9.99e-02

Pos logit contributions
 common+0.098
 sight+0.095
 rightful+0.094
 purpose+0.093
 outer+0.091
Neg logit contributions
m-0.128
bee-0.112
es-0.110
owed-0.109
mallow-0.106
Token a
Token ID257
Feature activation-5.69e-02

Pos logit contributions
 destination+0.136
 comparison+0.131
 sight+0.128
rous+0.123
 harmony+0.121
Neg logit contributions
bee-0.191
m-0.183
ird-0.177
owed-0.175
es-0.175
Token creative
Token ID7325
Feature activation-3.10e-02

Pos logit contributions
 comparison+0.058
 sight+0.055
 ours+0.054
 outer+0.054
 common+0.053
Neg logit contributions
m-0.135
bee-0.130
op-0.129
ird-0.123
itty-0.121
Token boy
Token ID2933
Feature activation+7.92e-02

Pos logit contributions
 awesome+0.052
 sight+0.050
 common+0.049
 fun+0.048
 comparison+0.048
Neg logit contributions
m-0.060
owed-0.060
bee-0.060
op-0.057
es-0.056
Token!"
Token ID2474
Feature activation+1.14e-02

Pos logit contributions
owed+0.146
bee+0.134
m+0.133
ops+0.129
op+0.127
Neg logit contributions
 our-0.154
 fun-0.151
 awesome-0.148
 sight-0.143
 my-0.142
Token
Token ID198
Feature activation-5.57e-03

Pos logit contributions
owed+0.026
m+0.026
bee+0.025
op+0.024
ops+0.024
Neg logit contributions
 sight-0.016
 fun-0.015
 our-0.015
 awesome-0.015
 common-0.015
Token so
Token ID523
Feature activation-7.37e-02

Pos logit contributions
owed+0.018
m+0.017
bee+0.017
op+0.016
ops+0.016
Neg logit contributions
 awesome-0.015
 fun-0.014
 sight-0.014
 common-0.014
 our-0.014
Token glad
Token ID9675
Feature activation-5.88e-02

Pos logit contributions
 sight+0.112
 purpose+0.111
 common+0.109
 awesome+0.108
 harmony+0.107
Neg logit contributions
owed-0.153
m-0.152
bee-0.143
op-0.139
es-0.134
Token I
Token ID314
Feature activation-1.69e-02

Pos logit contributions
 awesome+0.098
 sight+0.098
 common+0.095
 fun+0.094
 purpose+0.094
Neg logit contributions
owed-0.118
m-0.116
bee-0.111
es-0.108
op-0.108
Token could
Token ID714
Feature activation-6.66e-02

Pos logit contributions
 fun+0.034
 awesome+0.033
 sight+0.033
 common+0.033
 pleasure+0.032
Neg logit contributions
owed-0.027
op-0.027
bee-0.026
es-0.026
m-0.025
Token deliver
Token ID5203
Feature activation-6.52e-02

Pos logit contributions
 sight+0.105
 awesome+0.103
 common+0.099
 fun+0.098
 purpose+0.097
Neg logit contributions
owed-0.141
m-0.138
bee-0.134
op-0.131
es-0.131
Token such
Token ID884
Feature activation-9.98e-02

Pos logit contributions
 comparison+0.083
 sight+0.079
 awesome+0.079
 common+0.077
 rightful+0.076
Neg logit contributions
owed-0.148
es-0.148
m-0.147
op-0.137
ops-0.136
Token joy
Token ID8716
Feature activation-8.19e-03

Pos logit contributions
 comparison+0.114
 sidelines+0.104
 common+0.101
 sight+0.101
 awesome+0.100
Neg logit contributions
owed-0.224
es-0.217
ird-0.208
bee-0.208
m-0.205
Token to
Token ID284
Feature activation-5.56e-02

Pos logit contributions
 awesome+0.016
 sight+0.016
 fun+0.016
 our+0.016
 common+0.016
Neg logit contributions
owed-0.014
m-0.014
bee-0.013
op-0.013
ops-0.012
Token you
Token ID345
Feature activation+1.00e-02

Pos logit contributions
 sight+0.076
 awesome+0.074
 common+0.070
 fun+0.069
 comparison+0.069
Neg logit contributions
owed-0.125
m-0.124
bee-0.119
es-0.119
op-0.114
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
owed+0.016
m+0.015
bee+0.014
op+0.014
es+0.014
Neg logit contributions
 awesome-0.021
 sight-0.021
 fun-0.021
 common-0.021
 purpose-0.021
Token my
Token ID616
Feature activation-3.85e-02

Pos logit contributions
 sight+0.045
 awesome+0.044
 common+0.043
 purpose+0.043
 comparison+0.042
Neg logit contributions
m-0.058
owed-0.057
bee-0.056
es-0.053
op-0.053
TokenYou
Token ID1639
Feature activation-3.82e-03

Pos logit contributions
owed+0.019
m+0.018
bee+0.018
op+0.017
es+0.017
Neg logit contributions
 sight-0.017
 awesome-0.017
 fun-0.016
 common-0.016
 purpose-0.016
Tokenâ
Token ID22940
Feature activation+2.32e-02

Pos logit contributions
 awesome+0.007
 sight+0.007
 fun+0.007
 common+0.007
 our+0.007
Neg logit contributions
owed-0.007
m-0.007
bee-0.006
op-0.006
es-0.006
Token
Token ID26391
Feature activation+5.23e-02

Pos logit contributions
es+0.048
bee+0.048
owed+0.047
m+0.047
op+0.047
Neg logit contributions
 awesome-0.035
 common-0.034
 fun-0.034
 sight-0.034
 purpose-0.033
Token
Token ID8151
Feature activation+6.27e-02

Pos logit contributions
owed+0.093
bee+0.092
m+0.092
es+0.080
op+0.079
Neg logit contributions
 sight-0.097
 awesome-0.097
 fun-0.096
 common-0.094
 purpose-0.093
Tokenre
Token ID260
Feature activation-2.78e-02

Pos logit contributions
bee+0.083
owed+0.072
bees+0.068
itive+0.067
int+0.066
Neg logit contributions
 awesome-0.141
 fun-0.139
 our-0.138
 sight-0.135
 purpose-0.133
Token such
Token ID884
Feature activation-9.98e-02

Pos logit contributions
 sight+0.039
 purpose+0.036
 common+0.036
 comparison+0.035
 awesome+0.034
Neg logit contributions
owed-0.065
m-0.064
es-0.060
op-0.059
bee-0.058
Token a
Token ID257
Feature activation-5.17e-02

Pos logit contributions
 destination+0.141
 sight+0.136
 common+0.135
 comparison+0.133
rous+0.129
Neg logit contributions
ird-0.187
m-0.187
bee-0.184
es-0.183
owed-0.176
Token clever
Token ID14169
Feature activation-2.15e-02

Pos logit contributions
 common+0.048
 sight+0.047
 comparison+0.046
 ours+0.046
 purpose+0.044
Neg logit contributions
m-0.130
op-0.125
bee-0.124
ird-0.122
owed-0.118
Token boy
Token ID2933
Feature activation+9.92e-02

Pos logit contributions
 sight+0.036
 awesome+0.036
 fun+0.035
 common+0.035
 purpose+0.034
Neg logit contributions
owed-0.043
m-0.043
bee-0.041
op-0.040
es-0.040
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
owed+0.179
bee+0.172
m+0.169
ops+0.160
op+0.158
Neg logit contributions
 awesome-0.185
 our-0.185
 fun-0.183
 sight-0.179
 my-0.174
Token John
Token ID1757
Feature activation+8.12e-02

Pos logit contributions
 sight+0.024
 common+0.023
 awesome+0.023
 purpose+0.023
 comparison+0.022
Neg logit contributions
m-0.029
owed-0.028
bee-0.027
op-0.027
es-0.026
Token friends
Token ID2460
Feature activation+2.64e-02

Pos logit contributions
owed+0.253
owers+0.236
pp+0.234
ird+0.232
m+0.230
Neg logit contributions
 fun-0.210
 our-0.197
 sight-0.195
 big-0.194
 loud-0.193
Token thanked
Token ID26280
Feature activation-6.92e-02

Pos logit contributions
op+0.046
owed+0.044
m+0.043
ops+0.042
bee+0.041
Neg logit contributions
 fun-0.052
 awesome-0.048
 our-0.048
 sight-0.048
 exciting-0.047
Token her
Token ID607
Feature activation-7.74e-03

Pos logit contributions
 sight+0.071
 awesome+0.070
 fun+0.066
 common+0.066
 purpose+0.064
Neg logit contributions
owed-0.184
m-0.182
bee-0.176
es-0.174
op-0.172
Token for
Token ID329
Feature activation-5.44e-02

Pos logit contributions
 sight+0.013
 awesome+0.013
 fun+0.013
 common+0.013
 purpose+0.013
Neg logit contributions
owed-0.015
m-0.015
bee-0.014
op-0.014
es-0.014
Token being
Token ID852
Feature activation-5.37e-02

Pos logit contributions
 common+0.073
 purpose+0.070
 sight+0.070
 comparison+0.069
 awesome+0.069
Neg logit contributions
m-0.124
es-0.120
owed-0.120
bee-0.115
ird-0.112
Token such
Token ID884
Feature activation-9.97e-02

Pos logit contributions
 sight+0.074
 comparison+0.074
 purpose+0.072
 common+0.072
 accomplishment+0.071
Neg logit contributions
m-0.122
es-0.116
owed-0.116
ird-0.107
itty-0.105
Token a
Token ID257
Feature activation-5.59e-02

Pos logit contributions
 sight+0.118
 comparison+0.117
 destination+0.112
 thrilling+0.106
 sidelines+0.104
Neg logit contributions
owed-0.218
m-0.216
es-0.213
bee-0.211
ird-0.199
Token good
Token ID922
Feature activation-4.67e-02

Pos logit contributions
 comparison+0.062
 awesome+0.060
 ours+0.060
 sight+0.060
 common+0.057
Neg logit contributions
m-0.134
op-0.125
bee-0.124
ird-0.123
owed-0.122
Token leader
Token ID3554
Feature activation+3.96e-02

Pos logit contributions
 awesome+0.095
 sight+0.092
 common+0.090
 fun+0.090
 purpose+0.089
Neg logit contributions
m-0.077
owed-0.076
bee-0.073
op-0.070
es-0.069
Token.
Token ID13
Feature activation+4.93e-03

Pos logit contributions
owed+0.065
m+0.060
bee+0.060
op+0.058
ops+0.057
Neg logit contributions
 sight-0.081
 awesome-0.081
 fun-0.081
 our-0.079
 common-0.078
Token Lily
Token ID20037
Feature activation+6.16e-02

Pos logit contributions
owed+0.010
m+0.010
bee+0.010
op+0.010
es+0.010
Neg logit contributions
 sight-0.008
 awesome-0.008
 fun-0.007
 common-0.007
 our-0.007
Token,
Token ID11
Feature activation-3.58e-02

Pos logit contributions
 sight+0.001
 awesome+0.001
 fun+0.001
 common+0.001
 our+0.001
Neg logit contributions
owed-0.001
m-0.001
bee-0.001
es-0.001
op-0.001
Token the
Token ID262
Feature activation-3.89e-02

Pos logit contributions
 sight+0.058
 awesome+0.057
 fun+0.055
 common+0.055
 purpose+0.054
Neg logit contributions
owed-0.074
m-0.073
bee-0.070
op-0.069
es-0.069
Token little
Token ID1310
Feature activation-1.70e-02

Pos logit contributions
 awesome+0.063
 sight+0.062
 common+0.061
 fun+0.059
 purpose+0.059
Neg logit contributions
m-0.078
owed-0.077
bee-0.075
op-0.073
es-0.071
Token girl
Token ID2576
Feature activation+8.22e-02

Pos logit contributions
 sight+0.036
 awesome+0.035
 fun+0.035
 common+0.034
 purpose+0.034
Neg logit contributions
owed-0.028
m-0.026
es-0.025
bee-0.025
op-0.024
Token added
Token ID2087
Feature activation-6.29e-02

Pos logit contributions
owed+0.135
m+0.131
bee+0.127
op+0.124
ops+0.119
Neg logit contributions
 fun-0.162
 sight-0.161
 awesome-0.158
 our-0.157
 common-0.153
Token such
Token ID884
Feature activation-9.96e-02

Pos logit contributions
 comparison+0.074
 awesome+0.074
 common+0.073
 rightful+0.073
 sight+0.071
Neg logit contributions
m-0.155
owed-0.151
es-0.148
bee-0.143
op-0.142
Token wonderful
Token ID7932
Feature activation-5.53e-02

Pos logit contributions
 comparison+0.112
 destination+0.109
 sidelines+0.108
 common+0.107
 harmony+0.106
Neg logit contributions
bee-0.206
m-0.201
owed-0.199
ird-0.194
op-0.188
Token noises
Token ID26782
Feature activation+3.81e-02

Pos logit contributions
 awesome+0.077
 common+0.072
 sight+0.070
 comparison+0.070
 ours+0.070
Neg logit contributions
m-0.123
owed-0.123
bee-0.119
es-0.114
op-0.114
Token that
Token ID326
Feature activation-3.63e-02

Pos logit contributions
owed+0.064
m+0.061
bee+0.059
op+0.057
es+0.056
Neg logit contributions
 sight-0.077
 awesome-0.077
 fun-0.076
 common-0.074
 our-0.074
Token it
Token ID340
Feature activation+2.03e-02

Pos logit contributions
 awesome+0.064
 sight+0.064
 common+0.061
 purpose+0.061
 fun+0.060
Neg logit contributions
owed-0.069
m-0.069
bee-0.065
es-0.064
op-0.063
Token was
Token ID373
Feature activation-7.77e-02

Pos logit contributions
owed+0.032
m+0.031
bee+0.030
op+0.030
es+0.029
Neg logit contributions
 sight-0.042
 awesome-0.042
 fun-0.042
 common-0.041
 our-0.040
Token showed
Token ID3751
Feature activation-7.83e-02

Pos logit contributions
owed+0.098
m+0.096
bee+0.095
op+0.093
ops+0.090
Neg logit contributions
 sight-0.114
 fun-0.112
 awesome-0.109
 our-0.107
 same-0.105
Token Bob
Token ID5811
Feature activation+7.30e-02

Pos logit contributions
 common+0.125
 rightful+0.123
 comparison+0.118
 awesome+0.117
 harmony+0.116
Neg logit contributions
owed-0.162
es-0.152
ie-0.149
m-0.148
bee-0.147
Token many
Token ID867
Feature activation-5.06e-02

Pos logit contributions
owed+0.188
m+0.185
bee+0.179
op+0.177
es+0.175
Neg logit contributions
 sight-0.082
 awesome-0.081
 fun-0.078
 common-0.076
 purpose-0.075
Token things
Token ID1243
Feature activation+2.11e-02

Pos logit contributions
 awesome+0.055
 sight+0.055
 rightful+0.054
 common+0.054
 purpose+0.053
Neg logit contributions
m-0.121
owed-0.120
bee-0.119
es-0.118
op-0.114
Token,
Token ID11
Feature activation-2.99e-02

Pos logit contributions
owed+0.038
m+0.037
es+0.035
bee+0.034
op+0.034
Neg logit contributions
 sight-0.041
 awesome-0.040
 purpose-0.039
 common-0.039
 fun-0.038
Token such
Token ID884
Feature activation-9.89e-02

Pos logit contributions
 purpose+0.059
 sight+0.058
 common+0.056
 rightful+0.056
 awesome+0.056
Neg logit contributions
owed-0.050
m-0.049
bee-0.047
es-0.047
bees-0.042
Token as
Token ID355
Feature activation-3.93e-02

Pos logit contributions
 destination+0.121
 sight+0.115
 sidelines+0.112
rous+0.112
 comparison+0.108
Neg logit contributions
bee-0.207
owed-0.201
es-0.186
bees-0.184
ppy-0.184
Token rabbits
Token ID33043
Feature activation+8.58e-02

Pos logit contributions
 sight+0.043
 purpose+0.041
 awesome+0.040
 rightful+0.040
 common+0.039
Neg logit contributions
owed-0.098
m-0.096
es-0.095
bee-0.094
op-0.091
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
owed+0.148
m+0.144
bee+0.137
op+0.134
es+0.132
Neg logit contributions
 sight-0.168
 awesome-0.167
 fun-0.165
 common-0.162
 our-0.159
Token flowers
Token ID12734
Feature activation+4.64e-02

Pos logit contributions
 sight+0.053
 awesome+0.053
 purpose+0.050
 common+0.050
 comparison+0.049
Neg logit contributions
bee-0.066
owed-0.066
es-0.065
m-0.065
bees-0.060
Token.
Token ID13
Feature activation+3.92e-03

Pos logit contributions
owed+0.071
m+0.069
bee+0.066
op+0.064
es+0.062
Neg logit contributions
 sight-0.099
 awesome-0.099
 fun-0.098
 common-0.096
 our-0.095
Token said
Token ID531
Feature activation-3.59e-02

Pos logit contributions
 sight+0.086
 awesome+0.086
 common+0.084
 purpose+0.084
 fun+0.083
Neg logit contributions
owed-0.039
m-0.038
bee-0.034
es-0.034
op-0.032
Token,
Token ID11
Feature activation-4.33e-03

Pos logit contributions
 sight+0.070
 awesome+0.070
 fun+0.069
 common+0.067
 purpose+0.067
Neg logit contributions
owed-0.062
m-0.061
bee-0.058
op-0.057
es-0.056
Token "
Token ID366
Feature activation-4.16e-03

Pos logit contributions
 sight+0.006
 awesome+0.006
 fun+0.006
 our+0.006
 common+0.006
Neg logit contributions
owed-0.010
m-0.010
bee-0.009
ops-0.009
op-0.009
TokenYou
Token ID1639
Feature activation-9.70e-03

Pos logit contributions
 sight+0.007
 awesome+0.007
 fun+0.007
 common+0.007
 purpose+0.007
Neg logit contributions
owed-0.008
m-0.008
bee-0.008
op-0.008
es-0.008
Token're
Token ID821
Feature activation-6.08e-02

Pos logit contributions
 awesome+0.022
 fun+0.022
 sight+0.022
 common+0.021
 our+0.021
Neg logit contributions
owed-0.014
m-0.013
bee-0.013
op-0.012
es-0.012
Token such
Token ID884
Feature activation-9.82e-02

Pos logit contributions
 common+0.084
 sight+0.084
 purpose+0.079
 comparison+0.078
 thrilling+0.075
Neg logit contributions
m-0.144
owed-0.125
mallow-0.124
ird-0.123
es-0.122
Token a
Token ID257
Feature activation-5.45e-02

Pos logit contributions
 destination+0.140
 sight+0.133
 comparison+0.130
 sidelines+0.120
 thrilling+0.117
Neg logit contributions
m-0.193
ird-0.190
bee-0.190
owed-0.184
es-0.183
Token good
Token ID922
Feature activation-4.10e-02

Pos logit contributions
 comparison+0.057
 sight+0.057
 common+0.055
 ours+0.054
 purpose+0.052
Neg logit contributions
m-0.132
op-0.125
bee-0.121
ird-0.121
itty-0.119
Token helper
Token ID31904
Feature activation+7.58e-02

Pos logit contributions
 sight+0.075
 fun+0.072
 awesome+0.072
 purpose+0.071
 common+0.070
Neg logit contributions
owed-0.080
m-0.077
es-0.074
bee-0.073
op-0.072
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
owed+0.147
bee+0.132
m+0.130
op+0.129
ops+0.127
Neg logit contributions
 our-0.137
 fun-0.132
 awesome-0.131
 sight-0.129
 my-0.126
Token I
Token ID314
Feature activation-5.40e-04

Pos logit contributions
owed+0.013
m+0.013
bee+0.013
es+0.012
op+0.012
Neg logit contributions
 awesome-0.009
 sight-0.009
 fun-0.009
 common-0.009
 our-0.009
TokenThank
Token ID10449
Feature activation-4.64e-02

Pos logit contributions
 sight+0.022
 awesome+0.021
 common+0.021
 fun+0.021
 purpose+0.020
Neg logit contributions
owed-0.029
m-0.028
es-0.027
bee-0.027
op-0.027
Token you
Token ID345
Feature activation+1.24e-02

Pos logit contributions
 awesome+0.057
 our+0.057
 fun+0.056
 sight+0.054
 common+0.052
Neg logit contributions
owed-0.118
m-0.112
bee-0.112
op-0.109
ops-0.106
Token Jack
Token ID3619
Feature activation+7.83e-02

Pos logit contributions
owed+0.020
m+0.020
bee+0.019
op+0.018
es+0.018
Neg logit contributions
 sight-0.026
 awesome-0.026
 fun-0.025
 common-0.025
 purpose-0.025
Token for
Token ID329
Feature activation-4.65e-02

Pos logit contributions
owed+0.136
m+0.130
op+0.127
bee+0.126
ops+0.120
Neg logit contributions
 awesome-0.152
 sight-0.152
 fun-0.150
 our-0.149
 purpose-0.145
Token finding
Token ID4917
Feature activation-6.63e-02

Pos logit contributions
 sight+0.080
 common+0.079
 vast+0.077
 thrilling+0.077
 awesome+0.076
Neg logit contributions
m-0.085
es-0.083
bee-0.082
owed-0.080
ird-0.075
Token such
Token ID884
Feature activation-9.82e-02

Pos logit contributions
 comparison+0.084
 rightful+0.077
 common+0.077
 accomplishment+0.074
 thrilling+0.072
Neg logit contributions
es-0.156
ie-0.143
m-0.141
owed-0.138
ne-0.130
Token a
Token ID257
Feature activation-5.39e-02

Pos logit contributions
 sidelines+0.138
 comparison+0.132
rous+0.122
 destination+0.121
 recess+0.119
Neg logit contributions
es-0.170
owed-0.168
m-0.165
bee-0.165
ird-0.165
Token nice
Token ID3621
Feature activation-4.07e-02

Pos logit contributions
 comparison+0.052
 sight+0.045
 awesome+0.045
 common+0.044
 ours+0.043
Neg logit contributions
m-0.133
op-0.126
ird-0.122
bee-0.120
itty-0.118
Token shelter
Token ID11772
Feature activation+5.78e-02

Pos logit contributions
 awesome+0.055
 comparison+0.049
 common+0.048
 sight+0.048
 our+0.048
Neg logit contributions
m-0.093
owed-0.092
bee-0.090
op-0.088
ops-0.084
Token."
Token ID526
Feature activation+1.12e-02

Pos logit contributions
owed+0.103
m+0.100
bee+0.096
op+0.094
es+0.093
Neg logit contributions
 sight-0.111
 awesome-0.110
 fun-0.108
 common-0.107
 purpose-0.105
Token
Token ID220
Feature activation-2.50e-02

Pos logit contributions
owed+0.026
bee+0.026
m+0.025
op+0.024
ops+0.023
Neg logit contributions
 sight-0.015
 fun-0.015
 our-0.015
 pleasure-0.014
 awesome-0.014
Token was
Token ID373
Feature activation-7.88e-02

Pos logit contributions
owed+0.024
m+0.023
bee+0.021
es+0.021
op+0.021
Neg logit contributions
 sight-0.035
 awesome-0.035
 purpose-0.034
 common-0.034
 fun-0.034
Token so
Token ID523
Feature activation-7.33e-02

Pos logit contributions
 common+0.137
 purpose+0.136
 sight+0.125
 rightful+0.124
 ours+0.120
Neg logit contributions
m-0.156
owed-0.155
es-0.145
bee-0.142
op-0.134
Token pleased
Token ID10607
Feature activation-4.36e-02

Pos logit contributions
 purpose+0.127
 sight+0.122
 common+0.120
 rightful+0.118
 our+0.114
Neg logit contributions
m-0.145
owed-0.142
op-0.132
es-0.130
bee-0.126
Token to
Token ID284
Feature activation-6.31e-02

Pos logit contributions
 sight+0.070
 awesome+0.069
 fun+0.067
 common+0.066
 purpose+0.065
Neg logit contributions
owed-0.092
m-0.090
bee-0.087
op-0.085
es-0.085
Token have
Token ID423
Feature activation-6.81e-02

Pos logit contributions
 sight+0.091
 awesome+0.088
 fun+0.086
 common+0.085
 purpose+0.085
Neg logit contributions
owed-0.139
m-0.137
bee-0.134
es-0.129
op-0.126
Token such
Token ID884
Feature activation-9.81e-02

Pos logit contributions
 common+0.123
 awesome+0.121
 purpose+0.121
 sight+0.119
 ours+0.119
Neg logit contributions
m-0.124
owed-0.120
es-0.120
bee-0.112
ird-0.108
Token a
Token ID257
Feature activation-5.38e-02

Pos logit contributions
 common+0.124
 sidelines+0.124
 destination+0.123
 purpose+0.121
 comparison+0.121
Neg logit contributions
owed-0.185
bee-0.179
m-0.177
es-0.177
ird-0.176
Token long
Token ID890
Feature activation-3.24e-02

Pos logit contributions
 comparison+0.051
 ours+0.051
 our+0.050
 awesome+0.049
 common+0.048
Neg logit contributions
m-0.132
op-0.127
bee-0.126
owed-0.122
ird-0.121
Token day
Token ID1110
Feature activation-7.53e-03

Pos logit contributions
 awesome+0.048
 our+0.045
 ours+0.044
 sight+0.044
 common+0.043
Neg logit contributions
bee-0.070
owed-0.069
m-0.068
op-0.064
es-0.063
Token of
Token ID286
Feature activation-7.24e-02

Pos logit contributions
 sight+0.016
 awesome+0.016
 common+0.015
 fun+0.015
 purpose+0.015
Neg logit contributions
owed-0.012
m-0.012
bee-0.011
es-0.011
op-0.011
Token fun
Token ID1257
Feature activation-3.11e-02

Pos logit contributions
 common+0.115
 purpose+0.114
 awesome+0.108
 rightful+0.107
 our+0.107
Neg logit contributions
m-0.139
owed-0.135
bee-0.135
es-0.130
ird-0.118
Token so
Token ID523
Feature activation-6.20e-02

Pos logit contributions
 common+0.124
 purpose+0.120
 sight+0.115
 rightful+0.113
 comparison+0.111
Neg logit contributions
m-0.136
owed-0.127
bee-0.123
es-0.119
op-0.114
Token surprised
Token ID6655
Feature activation-4.60e-02

Pos logit contributions
 sight+0.101
 purpose+0.101
 common+0.094
 harmony+0.094
 rightful+0.093
Neg logit contributions
m-0.131
owed-0.122
op-0.120
bee-0.114
es-0.110
Token and
Token ID290
Feature activation-3.14e-02

Pos logit contributions
 awesome+0.083
 sight+0.082
 common+0.080
 fun+0.079
 purpose+0.079
Neg logit contributions
owed-0.087
m-0.086
bee-0.082
op-0.080
es-0.080
Token happy
Token ID3772
Feature activation-3.62e-02

Pos logit contributions
 purpose+0.067
 common+0.066
 sight+0.064
 awesome+0.063
 comparison+0.062
Neg logit contributions
owed-0.049
m-0.049
bee-0.043
es-0.041
op-0.040
Token that
Token ID326
Feature activation-3.04e-02

Pos logit contributions
 awesome+0.071
 sight+0.071
 common+0.069
 fun+0.068
 purpose+0.068
Neg logit contributions
owed-0.063
m-0.062
bee-0.058
es-0.057
op-0.057
Token such
Token ID884
Feature activation-9.79e-02

Pos logit contributions
 awesome+0.057
 sight+0.057
 common+0.056
 purpose+0.055
 fun+0.054
Neg logit contributions
m-0.055
owed-0.053
bee-0.050
es-0.050
op-0.049
Token a
Token ID257
Feature activation-4.94e-02

Pos logit contributions
 destination+0.113
rous+0.109
 sight+0.103
 common+0.101
 comparison+0.100
Neg logit contributions
bee-0.217
owed-0.202
itive-0.196
m-0.193
es-0.188
Token strange
Token ID6283
Feature activation-3.29e-02

Pos logit contributions
 ours+0.050
 comparison+0.048
 common+0.046
 our+0.045
 awesome+0.045
Neg logit contributions
m-0.127
bee-0.119
op-0.118
ird-0.109
owed-0.109
Token creature
Token ID7185
Feature activation+8.78e-02

Pos logit contributions
 awesome+0.051
 sight+0.049
 common+0.048
 fun+0.047
 our+0.047
Neg logit contributions
owed-0.070
m-0.070
bee-0.068
op-0.065
es-0.064
Token could
Token ID714
Feature activation-4.15e-02

Pos logit contributions
owed+0.153
m+0.148
bee+0.142
op+0.139
es+0.136
Neg logit contributions
 sight-0.172
 awesome-0.171
 fun-0.169
 common-0.163
 our-0.162
Token be
Token ID307
Feature activation-4.81e-02

Pos logit contributions
 sight+0.066
 awesome+0.065
 fun+0.062
 common+0.061
 pleasure+0.060
Neg logit contributions
owed-0.087
m-0.086
bee-0.084
es-0.081
op-0.080
Token thanked
Token ID26280
Feature activation-6.66e-02

Pos logit contributions
owed+0.005
m+0.005
bee+0.004
es+0.004
op+0.004
Neg logit contributions
 sight-0.007
 awesome-0.007
 fun-0.006
 common-0.006
 purpose-0.006
Token his
Token ID465
Feature activation-2.48e-02

Pos logit contributions
 awesome+0.071
 sight+0.070
 common+0.069
 purpose+0.066
 fun+0.066
Neg logit contributions
owed-0.171
m-0.165
es-0.164
bee-0.163
op-0.153
Token Uncle
Token ID23169
Feature activation+7.22e-02

Pos logit contributions
 comparison+0.037
 awesome+0.036
 common+0.036
 sight+0.035
 purpose+0.034
Neg logit contributions
owed-0.052
m-0.049
bee-0.049
es-0.047
op-0.046
Token for
Token ID329
Feature activation-5.46e-02

Pos logit contributions
owed+0.136
op+0.134
m+0.130
bee+0.130
ops+0.125
Neg logit contributions
 sight-0.124
 fun-0.121
 awesome-0.121
 our-0.119
 pleasure-0.115
Token recommending
Token ID34639
Feature activation-5.61e-02

Pos logit contributions
 common+0.071
 sight+0.069
 awesome+0.069
 comparison+0.066
 purpose+0.066
Neg logit contributions
m-0.125
owed-0.122
es-0.122
bee-0.119
ird-0.113
Token such
Token ID884
Feature activation-9.76e-02

Pos logit contributions
 common+0.065
 sight+0.064
 comparison+0.063
 awesome+0.063
 rightful+0.059
Neg logit contributions
m-0.135
owed-0.135
es-0.133
bee-0.129
op-0.127
Token a
Token ID257
Feature activation-4.96e-02

Pos logit contributions
 comparison+0.124
 sight+0.113
 common+0.112
 sidelines+0.111
 recess+0.108
Neg logit contributions
owed-0.198
m-0.193
es-0.192
bee-0.191
ird-0.185
Token great
Token ID1049
Feature activation-6.00e-02

Pos logit contributions
 comparison+0.045
 awesome+0.044
 sight+0.043
 common+0.043
 ours+0.042
Neg logit contributions
m-0.131
op-0.124
bee-0.122
owed-0.122
ird-0.120
Token toy
Token ID13373
Feature activation+3.65e-02

Pos logit contributions
 awesome+0.080
 ours+0.076
 comparison+0.075
 our+0.074
 common+0.074
Neg logit contributions
m-0.134
owed-0.129
bee-0.127
op-0.127
ird-0.120
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
owed+0.063
m+0.059
bee+0.057
op+0.054
es+0.054
Neg logit contributions
 awesome-0.074
 sight-0.074
 fun-0.074
 our-0.071
 purpose-0.070
Token They
Token ID1119
Feature activation+5.94e-03

Pos logit contributions
owed+0.015
m+0.015
bee+0.014
es+0.014
op+0.014
Neg logit contributions
 sight-0.012
 awesome-0.011
 fun-0.011
 common-0.011
 purpose-0.011
Token said
Token ID531
Feature activation-3.28e-02

Pos logit contributions
owed+0.033
m+0.032
bee+0.030
op+0.029
es+0.029
Neg logit contributions
 sight-0.046
 awesome-0.046
 fun-0.045
 common-0.045
 purpose-0.044
Token,
Token ID11
Feature activation-1.61e-04

Pos logit contributions
 sight+0.066
 awesome+0.064
 common+0.064
 fun+0.062
 purpose+0.062
Neg logit contributions
owed-0.057
m-0.055
es-0.053
bee-0.052
op-0.052
Token "
Token ID366
Feature activation-3.31e-03

Pos logit contributions
 sight+0.000
 awesome+0.000
 fun+0.000
 our+0.000
 common+0.000
Neg logit contributions
owed-0.000
m-0.000
bee-0.000
ops-0.000
op-0.000
TokenYou
Token ID1639
Feature activation-7.14e-03

Pos logit contributions
 sight+0.005
 awesome+0.005
 fun+0.005
 common+0.005
 purpose+0.005
Neg logit contributions
owed-0.007
m-0.007
bee-0.006
op-0.006
es-0.006
Token're
Token ID821
Feature activation-5.41e-02

Pos logit contributions
 sight+0.016
 awesome+0.016
 fun+0.016
 common+0.016
 purpose+0.016
Neg logit contributions
owed-0.010
m-0.010
bee-0.009
op-0.009
es-0.009
Token such
Token ID884
Feature activation-9.74e-02

Pos logit contributions
 purpose+0.077
 common+0.077
 sight+0.077
 comparison+0.072
 rightful+0.069
Neg logit contributions
m-0.121
owed-0.107
es-0.104
mallow-0.103
ird-0.103
Token a
Token ID257
Feature activation-5.44e-02

Pos logit contributions
 destination+0.140
 comparison+0.136
 sight+0.136
 sidelines+0.129
rous+0.128
Neg logit contributions
m-0.171
bee-0.169
es-0.169
ird-0.168
owed-0.159
Token gifted
Token ID22527
Feature activation-3.07e-02

Pos logit contributions
 comparison+0.057
 sight+0.056
 ours+0.055
 purpose+0.055
 common+0.055
Neg logit contributions
m-0.128
op-0.119
bee-0.119
ird-0.117
itty-0.115
Token,
Token ID11
Feature activation-2.60e-02

Pos logit contributions
 awesome+0.054
 sight+0.053
 common+0.052
 fun+0.051
 purpose+0.051
Neg logit contributions
owed-0.057
m-0.057
bee-0.055
op-0.054
es-0.052
Token little
Token ID1310
Feature activation-1.01e-02

Pos logit contributions
 sight+0.038
 common+0.037
 purpose+0.036
 awesome+0.036
 rightful+0.035
Neg logit contributions
m-0.056
owed-0.056
bee-0.054
op-0.053
es-0.052
Token boy
Token ID2933
Feature activation+9.52e-02

Pos logit contributions
 awesome+0.018
 sight+0.018
 common+0.018
 purpose+0.018
 fun+0.018
Neg logit contributions
m-0.018
owed-0.018
bee-0.017
op-0.016
es-0.016
Tokenheart
Token ID11499
Feature activation+4.77e-02

Pos logit contributions
 sight+0.040
 fun+0.039
 purpose+0.039
 awesome+0.038
 our+0.038
Neg logit contributions
owed-0.020
es-0.018
op-0.018
m-0.017
ops-0.017
Token.
Token ID13
Feature activation+9.98e-03

Pos logit contributions
m+0.073
owed+0.072
bee+0.069
op+0.067
ops+0.067
Neg logit contributions
 sight-0.102
 fun-0.101
 our-0.099
 awesome-0.099
 purpose-0.098
Token You
Token ID921
Feature activation-5.10e-03

Pos logit contributions
owed+0.021
m+0.021
bee+0.020
es+0.020
op+0.020
Neg logit contributions
 sight-0.015
 awesome-0.015
 fun-0.015
 common-0.015
 our-0.014
Token're
Token ID821
Feature activation-5.98e-02

Pos logit contributions
 awesome+0.010
 sight+0.010
 fun+0.010
 common+0.010
 our+0.010
Neg logit contributions
owed-0.009
m-0.009
bee-0.008
op-0.008
es-0.008
Token doing
Token ID1804
Feature activation-7.73e-02

Pos logit contributions
 common+0.095
 sight+0.091
 purpose+0.091
 comparison+0.088
 thrilling+0.087
Neg logit contributions
m-0.125
owed-0.109
bee-0.105
es-0.104
mallow-0.103
Token such
Token ID884
Feature activation-9.74e-02

Pos logit contributions
 comparison+0.093
 rightful+0.093
 sight+0.089
 common+0.087
 harmony+0.082
Neg logit contributions
es-0.167
m-0.157
owed-0.156
bee-0.155
op-0.153
Token a
Token ID257
Feature activation-5.45e-02

Pos logit contributions
 destination+0.123
 sight+0.117
 sidelines+0.117
 comparison+0.116
 thrilling+0.111
Neg logit contributions
bee-0.189
owed-0.185
m-0.183
es-0.177
ird-0.176
Token great
Token ID1049
Feature activation-5.13e-02

Pos logit contributions
 comparison+0.049
 theirs+0.048
 ours+0.048
 common+0.047
 sight+0.046
Neg logit contributions
m-0.133
op-0.127
bee-0.126
ird-0.122
itty-0.117
Token job
Token ID1693
Feature activation-2.12e-03

Pos logit contributions
 awesome+0.071
 sight+0.066
 common+0.066
 our+0.065
 comparison+0.065
Neg logit contributions
m-0.113
owed-0.112
bee-0.111
op-0.109
es-0.103
Token,"
Token ID553
Feature activation+1.36e-02

Pos logit contributions
 sight+0.004
 awesome+0.004
 fun+0.004
 common+0.004
 purpose+0.004
Neg logit contributions
owed-0.003
m-0.003
bee-0.003
op-0.003
es-0.003
Token said
Token ID531
Feature activation-2.15e-02

Pos logit contributions
owed+0.027
m+0.026
bee+0.025
op+0.025
ops+0.025
Neg logit contributions
 sight-0.024
 fun-0.023
 awesome-0.023
 our-0.023
 common-0.022
Tokenro
Token ID305
Feature activation+6.97e-02

Pos logit contributions
owed+0.017
m+0.017
bee+0.016
op+0.016
es+0.016
Neg logit contributions
 sight-0.020
 awesome-0.020
 fun-0.020
 common-0.020
 purpose-0.019
Tokenggy
Token ID19970
Feature activation+7.42e-02

Pos logit contributions
owed+0.112
m+0.109
bee+0.105
es+0.101
op+0.101
Neg logit contributions
 awesome-0.146
 sight-0.146
 fun-0.142
 common-0.140
 purpose-0.138
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
owed+0.119
m+0.118
bee+0.112
op+0.107
es+0.105
Neg logit contributions
 sight-0.153
 fun-0.152
 awesome-0.151
 our-0.149
 common-0.148
Token you
Token ID345
Feature activation+6.63e-03

Pos logit contributions
 common+0.029
 sight+0.029
 purpose+0.028
 awesome+0.027
 comparison+0.027
Neg logit contributions
m-0.029
owed-0.028
bee-0.027
op-0.026
es-0.025
Token are
Token ID389
Feature activation-5.67e-02

Pos logit contributions
owed+0.011
m+0.011
bee+0.011
op+0.010
es+0.010
Neg logit contributions
 sight-0.013
 awesome-0.013
 fun-0.013
 common-0.013
 purpose-0.013
Token such
Token ID884
Feature activation-9.73e-02

Pos logit contributions
 common+0.107
 purpose+0.100
 sight+0.093
 accomplishment+0.092
 rightful+0.091
Neg logit contributions
m-0.109
ird-0.089
es-0.088
ppy-0.087
mallow-0.086
Token a
Token ID257
Feature activation-5.46e-02

Pos logit contributions
 destination+0.145
 comparison+0.140
 sight+0.135
 common+0.134
rous+0.134
Neg logit contributions
m-0.172
ird-0.168
bee-0.157
es-0.151
ppy-0.147
Token good
Token ID922
Feature activation-4.19e-02

Pos logit contributions
 common+0.057
 comparison+0.057
 ours+0.056
 theirs+0.055
 sight+0.054
Neg logit contributions
m-0.129
bee-0.121
op-0.120
ird-0.117
itty-0.114
Token friend
Token ID1545
Feature activation+4.86e-02

Pos logit contributions
 awesome+0.073
 sight+0.073
 common+0.070
 fun+0.070
 purpose+0.069
Neg logit contributions
owed-0.081
m-0.081
bee-0.078
op-0.076
es-0.074
Token.
Token ID13
Feature activation+6.85e-03

Pos logit contributions
owed+0.098
bee+0.089
op+0.089
ops+0.088
m+0.088
Neg logit contributions
 fun-0.086
 our-0.085
 awesome-0.085
 sight-0.083
 my-0.080
Token I
Token ID314
Feature activation+2.26e-03

Pos logit contributions
owed+0.015
m+0.015
bee+0.014
es+0.014
op+0.014
Neg logit contributions
 awesome-0.010
 sight-0.010
 fun-0.010
 common-0.010
 pleasure-0.010
Token't
Token ID470
Feature activation-7.61e-02

Pos logit contributions
 sight+0.121
 awesome+0.121
 fun+0.117
 purpose+0.116
 our+0.115
Neg logit contributions
owed-0.146
m-0.142
bee-0.138
es-0.133
op-0.132
Token have
Token ID423
Feature activation-8.43e-02

Pos logit contributions
 awesome+0.102
 fun+0.102
 sight+0.100
 purpose+0.096
 our+0.095
Neg logit contributions
owed-0.182
m-0.177
es-0.174
op-0.172
bee-0.171
Token ended
Token ID4444
Feature activation-7.14e-02

Pos logit contributions
 vast+0.122
 awesome+0.121
 common+0.119
 thrilling+0.117
 harmony+0.117
Neg logit contributions
m-0.176
re-0.166
owed-0.163
ird-0.156
es-0.155
Token up
Token ID510
Feature activation-4.71e-02

Pos logit contributions
 awesome+0.116
 thrilling+0.116
 common+0.111
 pleasure+0.111
 vast+0.109
Neg logit contributions
owed-0.134
es-0.132
m-0.131
op-0.123
bee-0.121
Token in
Token ID287
Feature activation-5.90e-02

Pos logit contributions
 awesome+0.092
 common+0.088
 sight+0.087
 thrilling+0.086
 pleasure+0.086
Neg logit contributions
m-0.080
owed-0.074
es-0.073
op-0.069
bee-0.069
Token such
Token ID884
Feature activation-9.72e-02

Pos logit contributions
 awesome+0.082
 thrilling+0.074
 rightful+0.074
 realistic+0.072
 ours+0.072
Neg logit contributions
m-0.130
es-0.122
op-0.115
ird-0.114
mallow-0.112
Token bad
Token ID2089
Feature activation-3.38e-02

Pos logit contributions
 common+0.109
 comparison+0.106
 sight+0.102
 awesome+0.099
rous+0.099
Neg logit contributions
owed-0.212
m-0.205
bee-0.205
ird-0.202
es-0.201
Token situations
Token ID7445
Feature activation+3.10e-02

Pos logit contributions
 awesome+0.042
 sight+0.042
 common+0.040
 fun+0.039
 purpose+0.039
Neg logit contributions
owed-0.081
m-0.081
bee-0.078
op-0.076
es-0.076
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
owed+0.047
m+0.045
bee+0.044
op+0.042
es+0.042
Neg logit contributions
 awesome-0.067
 sight-0.067
 fun-0.066
 common-0.065
 purpose-0.064
Token
Token ID220
Feature activation-2.30e-02

Pos logit contributions
owed+0.022
m+0.021
bee+0.021
es+0.020
op+0.020
Neg logit contributions
 sight-0.016
 awesome-0.016
 fun-0.016
 purpose-0.015
 common-0.015
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
 sight+0.037
 awesome+0.037
 fun+0.037
 common+0.036
 purpose+0.036
Neg logit contributions
owed-0.046
bee-0.046
m-0.045
es-0.042
op-0.041
Token so
Token ID523
Feature activation-7.37e-02

Pos logit contributions
 sight+0.021
 awesome+0.020
 common+0.019
 purpose+0.019
 comparison+0.018
Neg logit contributions
m-0.062
owed-0.060
es-0.058
op-0.057
bee-0.057
Token excited
Token ID6568
Feature activation-5.24e-02

Pos logit contributions
 sight+0.112
 purpose+0.111
 common+0.109
 awesome+0.108
 harmony+0.107
Neg logit contributions
owed-0.153
m-0.152
bee-0.143
op-0.139
es-0.134
Token to
Token ID284
Feature activation-6.88e-02

Pos logit contributions
 awesome+0.089
 sight+0.088
 common+0.086
 fun+0.084
 purpose+0.084
Neg logit contributions
owed-0.105
m-0.103
bee-0.098
es-0.096
op-0.095
Token have
Token ID423
Feature activation-8.03e-02

Pos logit contributions
 sight+0.111
 awesome+0.106
 purpose+0.104
 rightful+0.103
 common+0.103
Neg logit contributions
owed-0.137
bee-0.131
m-0.128
op-0.128
ird-0.128
Token found
Token ID1043
Feature activation-6.08e-02

Pos logit contributions
 common+0.102
 accomplishment+0.102
 rightful+0.101
 comparison+0.099
 pleasure+0.096
Neg logit contributions
m-0.177
es-0.172
ird-0.166
owed-0.164
bee-0.163
Token such
Token ID884
Feature activation-9.72e-02

Pos logit contributions
 common+0.074
 comparison+0.073
 rightful+0.073
 awesome+0.069
 purpose+0.068
Neg logit contributions
es-0.145
owed-0.141
m-0.140
op-0.133
bee-0.131
Token a
Token ID257
Feature activation-5.21e-02

Pos logit contributions
 comparison+0.120
 sidelines+0.119
 common+0.115
 harmony+0.112
 sight+0.110
Neg logit contributions
owed-0.191
m-0.188
bee-0.187
es-0.184
ird-0.182
Token big
Token ID1263
Feature activation-5.02e-02

Pos logit contributions
 comparison+0.050
 ours+0.048
 awesome+0.047
 common+0.046
 our+0.045
Neg logit contributions
m-0.128
op-0.125
bee-0.123
ird-0.117
itty-0.117
Token carrot
Token ID40562
Feature activation+8.51e-02

Pos logit contributions
 awesome+0.090
 common+0.087
 comparison+0.084
 ours+0.083
 our+0.083
Neg logit contributions
m-0.089
bee-0.083
op-0.081
owed-0.079
itty-0.076
Token.
Token ID13
Feature activation+7.10e-03

Pos logit contributions
owed+0.155
m+0.147
bee+0.138
op+0.134
es+0.134
Neg logit contributions
 sight-0.164
 fun-0.159
 awesome-0.159
 purpose-0.156
 our-0.154
Token They
Token ID1119
Feature activation+1.16e-02

Pos logit contributions
owed+0.015
m+0.014
bee+0.014
es+0.013
op+0.013
Neg logit contributions
 sight-0.012
 awesome-0.011
 fun-0.011
 common-0.011
 purpose-0.011
Token and
Token ID290
Feature activation-4.26e-02

Pos logit contributions
 common+0.161
 awesome+0.159
 purpose+0.157
 sight+0.154
 comparison+0.153
Neg logit contributions
m-0.083
owed-0.081
bee-0.073
es-0.070
op-0.062
Token hugged
Token ID41130
Feature activation-7.41e-02

Pos logit contributions
 sight+0.105
 awesome+0.105
 common+0.102
 purpose+0.102
 fun+0.101
Neg logit contributions
owed-0.053
m-0.051
bee-0.047
es-0.046
op-0.045
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
 sight+0.098
 awesome+0.098
 fun+0.094
 common+0.093
 purpose+0.091
Neg logit contributions
owed-0.176
m-0.174
bee-0.167
op-0.164
es-0.164
Token It
Token ID632
Feature activation+8.25e-03

Pos logit contributions
owed+0.008
m+0.008
bee+0.007
op+0.007
es+0.007
Neg logit contributions
 sight-0.006
 awesome-0.006
 fun-0.006
 common-0.006
 purpose-0.006
Token was
Token ID373
Feature activation-7.77e-02

Pos logit contributions
owed+0.012
m+0.012
op+0.011
bee+0.011
es+0.011
Neg logit contributions
 awesome-0.018
 sight-0.018
 fun-0.018
 common-0.017
 our-0.017
Token such
Token ID884
Feature activation-9.70e-02

Pos logit contributions
 backup+0.110
 common+0.108
 purpose+0.103
 recess+0.103
 sidelines+0.103
Neg logit contributions
m-0.159
owed-0.145
ie-0.142
bee-0.136
my-0.133
Token a
Token ID257
Feature activation-4.95e-02

Pos logit contributions
 destination+0.125
 sidelines+0.121
 comparison+0.116
 backup+0.114
 recess+0.111
Neg logit contributions
m-0.191
bee-0.189
owed-0.180
ird-0.174
itive-0.170
Token peaceful
Token ID12309
Feature activation-3.69e-02

Pos logit contributions
 ours+0.047
 theirs+0.045
 comparison+0.045
 common+0.043
 outer+0.040
Neg logit contributions
m-0.124
bee-0.110
op-0.110
ird-0.108
itty-0.106
Token moment
Token ID2589
Feature activation-8.74e-03

Pos logit contributions
 awesome+0.058
 ours+0.054
 common+0.053
 comparison+0.051
 thrilling+0.050
Neg logit contributions
m-0.077
bee-0.075
owed-0.074
op-0.070
ie-0.068
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
 sight+0.018
 fun+0.018
 awesome+0.018
 our+0.017
 common+0.017
Neg logit contributions
owed-0.015
m-0.014
bee-0.014
op-0.013
es-0.013
Token Then
Token ID3244
Feature activation-5.65e-03

Pos logit contributions
owed+0.012
m+0.012
bee+0.012
op+0.011
es+0.011
Neg logit contributions
 sight-0.010
 awesome-0.010
 fun-0.010
 common-0.010
 purpose-0.010
Token we
Token ID356
Feature activation+1.71e-03

Pos logit contributions
 sight+0.012
 common+0.012
 awesome+0.012
 purpose+0.012
 rightful+0.011
Neg logit contributions
m-0.014
owed-0.014
bee-0.014
op-0.013
es-0.013
Token did
Token ID750
Feature activation-6.30e-02

Pos logit contributions
owed+0.003
m+0.003
bee+0.003
es+0.003
op+0.003
Neg logit contributions
 awesome-0.003
 fun-0.003
 sight-0.003
 common-0.003
 our-0.003
Token!
Token ID0
Feature activation+8.04e-03

Pos logit contributions
 sight+0.151
 comparison+0.149
 rightful+0.147
 thrilling+0.145
 awesome+0.145
Neg logit contributions
owed-0.073
es-0.072
m-0.070
op-0.068
bee-0.067
Token You
Token ID921
Feature activation-2.64e-03

Pos logit contributions
owed+0.017
m+0.016
bee+0.016
op+0.015
es+0.015
Neg logit contributions
 awesome-0.013
 sight-0.013
 fun-0.013
 common-0.013
 our-0.012
Token're
Token ID821
Feature activation-5.67e-02

Pos logit contributions
 fun+0.005
 awesome+0.005
 common+0.005
 sight+0.005
 our+0.005
Neg logit contributions
owed-0.005
m-0.004
bee-0.004
op-0.004
es-0.004
Token such
Token ID884
Feature activation-9.67e-02

Pos logit contributions
 sight+0.076
 common+0.073
 comparison+0.073
 purpose+0.073
 rightful+0.069
Neg logit contributions
m-0.129
es-0.117
owed-0.116
bee-0.115
ird-0.113
Token a
Token ID257
Feature activation-5.68e-02

Pos logit contributions
 destination+0.132
 comparison+0.127
 sight+0.126
rous+0.115
 sidelines+0.115
Neg logit contributions
bee-0.185
m-0.182
ird-0.181
es-0.176
owed-0.169
Token smart
Token ID4451
Feature activation-1.49e-02

Pos logit contributions
 comparison+0.058
 sight+0.057
 outer+0.055
 ours+0.054
 common+0.054
Neg logit contributions
m-0.136
op-0.129
bee-0.128
ird-0.124
itty-0.120
Token kid
Token ID5141
Feature activation+9.52e-02

Pos logit contributions
 sight+0.028
 awesome+0.028
 fun+0.027
 common+0.027
 purpose+0.026
Neg logit contributions
owed-0.027
m-0.027
bee-0.026
op-0.025
es-0.025
Token!"
Token ID2474
Feature activation+1.14e-02

Pos logit contributions
owed+0.192
bee+0.179
m+0.175
ops+0.171
op+0.171
Neg logit contributions
 our-0.168
 fun-0.165
 awesome-0.163
 my-0.156
 your-0.151
Token
Token ID220
Feature activation-1.58e-02

Pos logit contributions
owed+0.026
bee+0.025
m+0.025
op+0.024
itive+0.023
Neg logit contributions
 sight-0.016
 fun-0.016
 our-0.016
 awesome-0.015
 common-0.015
Token,
Token ID11
Feature activation-2.53e-02

Pos logit contributions
owed+0.006
bee+0.006
m+0.006
op+0.006
es+0.006
Neg logit contributions
 fun-0.008
 awesome-0.008
 our-0.008
 sight-0.008
 great-0.007
Token Lily
Token ID20037
Feature activation+6.63e-02

Pos logit contributions
 sight+0.040
 harmony+0.040
 rightful+0.039
 comparison+0.038
 common+0.038
Neg logit contributions
m-0.050
owed-0.048
bee-0.046
ird-0.046
ie-0.045
Token!
Token ID0
Feature activation+8.53e-03

Pos logit contributions
owed+0.121
m+0.115
bee+0.113
op+0.111
es+0.108
Neg logit contributions
 fun-0.128
 sight-0.127
 awesome-0.124
 our-0.123
 purpose-0.118
Token You
Token ID921
Feature activation-4.83e-03

Pos logit contributions
owed+0.018
m+0.018
bee+0.017
es+0.017
op+0.017
Neg logit contributions
 awesome-0.013
 sight-0.013
 fun-0.013
 our-0.013
 common-0.013
Token are
Token ID389
Feature activation-5.70e-02

Pos logit contributions
 sight+0.011
 awesome+0.011
 fun+0.011
 common+0.011
 purpose+0.010
Neg logit contributions
owed-0.007
m-0.007
bee-0.006
op-0.006
es-0.006
Token such
Token ID884
Feature activation-9.64e-02

Pos logit contributions
 common+0.097
 sight+0.095
 purpose+0.092
 accomplishment+0.090
 rightful+0.089
Neg logit contributions
m-0.109
es-0.099
bee-0.096
ird-0.092
mallow-0.092
Token a
Token ID257
Feature activation-5.39e-02

Pos logit contributions
 destination+0.141
 comparison+0.127
 sight+0.127
 directions+0.120
rous+0.116
Neg logit contributions
ird-0.185
bee-0.183
m-0.180
es-0.173
op-0.169
Token good
Token ID922
Feature activation-4.05e-02

Pos logit contributions
 comparison+0.049
 common+0.048
 sight+0.048
 ours+0.046
 theirs+0.045
Neg logit contributions
m-0.135
op-0.130
bee-0.127
ird-0.123
itty-0.122
Token builder
Token ID27098
Feature activation+7.28e-02

Pos logit contributions
 sight+0.071
 awesome+0.070
 fun+0.068
 common+0.067
 purpose+0.067
Neg logit contributions
owed-0.080
m-0.079
bee-0.075
es-0.074
op-0.074
Token!"
Token ID2474
Feature activation+1.32e-02

Pos logit contributions
owed+0.135
m+0.126
bee+0.123
op+0.122
ops+0.115
Neg logit contributions
 awesome-0.137
 fun-0.137
 sight-0.135
 our-0.134
 common-0.129
Token Lily
Token ID20037
Feature activation+7.29e-02

Pos logit contributions
owed+0.033
m+0.032
bee+0.032
op+0.030
es+0.030
Neg logit contributions
 fun-0.016
 sight-0.016
 our-0.016
 my-0.015
 purpose-0.015
Token felt
Token ID2936
Feature activation-1.03e-01

Pos logit contributions
m+0.074
owed+0.072
op+0.067
bee+0.067
itty+0.065
Neg logit contributions
 fun-0.108
 awesome-0.103
 sight-0.101
 our-0.100
 exciting-0.099
Token so
Token ID523
Feature activation-7.54e-02

Pos logit contributions
 purpose+0.167
 common+0.166
 sight+0.160
 destination+0.158
 rightful+0.157
Neg logit contributions
m-0.201
owed-0.189
bee-0.182
es-0.180
bees-0.171
Token lucky
Token ID9670
Feature activation-4.08e-02

Pos logit contributions
 sight+0.103
 purpose+0.103
 harmony+0.097
 outer+0.096
 common+0.096
Neg logit contributions
m-0.168
owed-0.166
op-0.154
bee-0.150
bees-0.143
Token to
Token ID284
Feature activation-7.05e-02

Pos logit contributions
 awesome+0.063
 sight+0.063
 common+0.061
 fun+0.060
 purpose+0.060
Neg logit contributions
owed-0.087
m-0.086
bee-0.082
es-0.080
op-0.080
Token have
Token ID423
Feature activation-6.90e-02

Pos logit contributions
 sight+0.110
 awesome+0.109
 fun+0.105
 common+0.104
 purpose+0.103
Neg logit contributions
owed-0.151
m-0.148
bee-0.143
es-0.138
op-0.138
Token such
Token ID884
Feature activation-9.63e-02

Pos logit contributions
 sight+0.115
 awesome+0.113
 common+0.113
 purpose+0.112
 accomplishment+0.111
Neg logit contributions
m-0.133
es-0.131
owed-0.131
bee-0.123
ird-0.118
Token a
Token ID257
Feature activation-5.49e-02

Pos logit contributions
 comparison+0.110
 sight+0.107
 sidelines+0.104
 orderly+0.098
 common+0.097
Neg logit contributions
owed-0.213
es-0.210
bee-0.205
m-0.204
ird-0.198
Token wonderful
Token ID7932
Feature activation-6.06e-02

Pos logit contributions
 comparison+0.055
 ours+0.054
 awesome+0.051
 common+0.049
 our+0.048
Neg logit contributions
m-0.136
bee-0.129
op-0.128
itty-0.124
ird-0.124
Token mom
Token ID1995
Feature activation+5.39e-02

Pos logit contributions
 awesome+0.083
 ours+0.078
 comparison+0.076
 our+0.074
 common+0.074
Neg logit contributions
bee-0.132
m-0.132
owed-0.131
op-0.123
es-0.119
Token who
Token ID508
Feature activation-3.93e-03

Pos logit contributions
owed+0.089
bee+0.084
op+0.083
ops+0.081
es+0.075
Neg logit contributions
 fun-0.114
 awesome-0.110
 our-0.110
 sight-0.107
 exciting-0.105
Token made
Token ID925
Feature activation-5.93e-02

Pos logit contributions
 sight+0.008
 awesome+0.008
 fun+0.008
 common+0.008
 purpose+0.008
Neg logit contributions
owed-0.006
m-0.006
bee-0.006
op-0.006
es-0.006
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
owed+0.144
m+0.138
bee+0.136
op+0.128
es+0.126
Neg logit contributions
 sight-0.217
 awesome-0.214
 fun-0.213
 purpose-0.207
 common-0.206
Token
Token ID198
Feature activation+1.85e-03

Pos logit contributions
owed+0.035
m+0.034
bee+0.032
op+0.032
es+0.032
Neg logit contributions
 sight-0.030
 awesome-0.030
 common-0.029
 fun-0.029
 purpose-0.028
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
owed+0.004
m+0.004
bee+0.003
es+0.003
op+0.003
Neg logit contributions
 awesome-0.003
 sight-0.003
 fun-0.003
 common-0.003
 purpose-0.003
TokenThe
Token ID464
Feature activation-1.83e-02

Pos logit contributions
m+0.023
owed+0.022
op+0.022
bee+0.022
es+0.022
Neg logit contributions
 sight-0.017
 awesome-0.016
 common-0.015
 fun-0.015
 purpose-0.015
Token noisy
Token ID31210
Feature activation+2.23e-02

Pos logit contributions
 common+0.037
 ours+0.037
 sight+0.036
 awesome+0.035
 purpose+0.035
Neg logit contributions
m-0.031
bee-0.027
op-0.025
es-0.025
owed-0.024
Token brother
Token ID3956
Feature activation+1.12e-01

Pos logit contributions
owed+0.041
m+0.041
bee+0.039
op+0.038
es+0.038
Neg logit contributions
 awesome-0.041
 sight-0.041
 common-0.040
 fun-0.039
 purpose-0.039
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
owed+0.201
m+0.186
op+0.180
bee+0.179
owing+0.172
Neg logit contributions
 fun-0.202
 sight-0.195
 our-0.195
 awesome-0.192
 success-0.192
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 awesome+0.030
 sight+0.029
 common+0.028
 ours+0.028
 thrilling+0.028
Neg logit contributions
bee-0.023
m-0.023
es-0.022
owed-0.022
op-0.021
Token quiet
Token ID5897
Feature activation+1.03e-02

Pos logit contributions
 common+0.038
 ours+0.036
 awesome+0.036
 our+0.035
 comparison+0.035
Neg logit contributions
m-0.031
bee-0.028
es-0.026
op-0.026
itive-0.025
Token brother
Token ID3956
Feature activation+1.09e-01

Pos logit contributions
owed+0.021
m+0.021
bee+0.020
op+0.020
es+0.020
Neg logit contributions
 awesome-0.017
 common-0.016
 sight-0.016
 fun-0.016
 our-0.016
Token celebrated
Token ID13943
Feature activation-2.51e-02

Pos logit contributions
owed+0.202
m+0.188
op+0.185
bee+0.182
itty+0.181
Neg logit contributions
 fun-0.189
 our-0.178
 success-0.177
 sight-0.175
 awesome-0.175
Token can
Token ID460
Feature activation-4.99e-02

Pos logit contributions
owed+0.011
m+0.011
bee+0.011
op+0.010
es+0.010
Neg logit contributions
 fun-0.011
 awesome-0.011
 our-0.011
 common-0.010
 sight-0.010
Token share
Token ID2648
Feature activation-3.71e-02

Pos logit contributions
 sight+0.077
 awesome+0.076
 fun+0.074
 common+0.073
 purpose+0.072
Neg logit contributions
owed-0.108
m-0.106
bee-0.102
op-0.100
es-0.100
Token another
Token ID1194
Feature activation-1.07e-02

Pos logit contributions
 sight+0.052
 awesome+0.051
 thrilling+0.050
 pleasure+0.050
 accomplishment+0.049
Neg logit contributions
es-0.084
m-0.083
owed-0.080
bee-0.078
ops-0.075
Token one
Token ID530
Feature activation+3.10e-02

Pos logit contributions
 sight+0.017
 awesome+0.017
 fun+0.016
 common+0.016
 purpose+0.016
Neg logit contributions
owed-0.023
m-0.022
bee-0.022
op-0.021
es-0.021
Token."
Token ID526
Feature activation+2.54e-02

Pos logit contributions
owed+0.061
m+0.056
bee+0.055
op+0.053
es+0.053
Neg logit contributions
 fun-0.057
 awesome-0.057
 sight-0.056
 our-0.055
 common-0.054
Token Sara
Token ID24799
Feature activation+9.53e-02

Pos logit contributions
owed+0.054
m+0.053
bee+0.051
op+0.050
es+0.050
Neg logit contributions
 awesome-0.040
 sight-0.040
 fun-0.038
 common-0.038
 purpose-0.038
Token and
Token ID290
Feature activation+5.80e-03

Pos logit contributions
owed+0.187
bee+0.175
ird+0.171
op+0.169
m+0.169
Neg logit contributions
 fun-0.170
 sight-0.162
 our-0.162
 awesome-0.160
 new-0.154
Token Ben
Token ID3932
Feature activation+7.96e-02

Pos logit contributions
m+0.014
owed+0.014
bee+0.013
es+0.013
op+0.013
Neg logit contributions
 sight-0.007
 awesome-0.007
 common-0.007
 purpose-0.007
 fun-0.007
Token look
Token ID804
Feature activation-3.56e-02

Pos logit contributions
owed+0.170
op+0.163
m+0.163
bee+0.162
es+0.159
Neg logit contributions
 fun-0.127
 awesome-0.124
 sight-0.120
 our-0.118
 common-0.117
Token at
Token ID379
Feature activation-3.97e-02

Pos logit contributions
 sight+0.071
 awesome+0.070
 common+0.069
 purpose+0.068
 fun+0.068
Neg logit contributions
owed-0.061
m-0.060
bee-0.056
es-0.056
op-0.055
Token each
Token ID1123
Feature activation+3.80e-03

Pos logit contributions
 awesome+0.061
 thrilling+0.060
 brand+0.058
 fantastic+0.056
 orderly+0.056
Neg logit contributions
m-0.077
bee-0.076
owed-0.074
ie-0.073
es-0.073
Token,
Token ID11
Feature activation-4.62e-03

Pos logit contributions
owed+0.121
m+0.118
bee+0.114
op+0.111
es+0.108
Neg logit contributions
 sight-0.157
 awesome-0.155
 fun-0.154
 common-0.151
 purpose-0.148
Token Lily
Token ID20037
Feature activation+7.55e-02

Pos logit contributions
 awesome+0.010
 sight+0.009
 common+0.009
 purpose+0.009
 comparison+0.009
Neg logit contributions
m-0.007
owed-0.007
es-0.007
bee-0.007
op-0.007
Token saw
Token ID2497
Feature activation-4.18e-02

Pos logit contributions
owed+0.135
m+0.131
bee+0.127
op+0.124
ops+0.119
Neg logit contributions
 sight-0.139
 fun-0.137
 awesome-0.135
 common-0.130
 our-0.129
Token a
Token ID257
Feature activation-2.21e-02

Pos logit contributions
 common+0.059
 rightful+0.055
 awesome+0.053
 purpose+0.053
 comparison+0.052
Neg logit contributions
m-0.100
es-0.098
owed-0.094
bee-0.091
op-0.087
Token little
Token ID1310
Feature activation+1.37e-03

Pos logit contributions
 awesome+0.028
 sight+0.026
 common+0.026
 comparison+0.026
 our+0.026
Neg logit contributions
m-0.052
bee-0.050
op-0.049
owed-0.047
es-0.046
Token boy
Token ID2933
Feature activation+1.12e-01

Pos logit contributions
m+0.002
owed+0.002
bee+0.002
op+0.002
es+0.002
Neg logit contributions
 awesome-0.003
 common-0.003
 sight-0.003
 purpose-0.003
 fun-0.003
Token crying
Token ID13774
Feature activation-4.56e-03

Pos logit contributions
owed+0.204
bee+0.198
ops+0.192
m+0.183
op+0.183
Neg logit contributions
 sight-0.204
 fun-0.204
 awesome-0.197
 our-0.191
 exciting-0.190
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
 sight+0.009
 fun+0.009
 awesome+0.009
 exciting+0.008
 purpose+0.008
Neg logit contributions
owed-0.008
op-0.008
ops-0.007
m-0.007
oke-0.007
Token She
Token ID1375
Feature activation+3.62e-02

Pos logit contributions
owed+0.051
m+0.051
bee+0.048
op+0.047
es+0.047
Neg logit contributions
 sight-0.053
 awesome-0.052
 fun-0.051
 common-0.051
 purpose-0.050
Token recognized
Token ID8018
Feature activation-4.54e-02

Pos logit contributions
owed+0.064
m+0.062
op+0.062
ops+0.060
bee+0.058
Neg logit contributions
 fun-0.069
 sight-0.069
 awesome-0.067
 common-0.065
 our-0.065
Token him
Token ID683
Feature activation+5.89e-02

Pos logit contributions
 rightful+0.065
 common+0.064
 awesome+0.062
 purpose+0.062
 sight+0.062
Neg logit contributions
m-0.103
owed-0.099
es-0.099
bee-0.097
op-0.092
Token so
Token ID523
Feature activation-2.67e-02

Pos logit contributions
owed+0.071
m+0.070
bee+0.066
op+0.065
es+0.064
Neg logit contributions
 sight-0.078
 awesome-0.077
 common-0.075
 fun-0.075
 purpose-0.074
Token much
Token ID881
Feature activation-2.45e-02

Pos logit contributions
 sight+0.027
 purpose+0.027
 common+0.027
 rightful+0.026
 awesome+0.026
Neg logit contributions
owed-0.068
m-0.067
es-0.063
bee-0.063
op-0.063
Token?"
Token ID1701
Feature activation+2.17e-02

Pos logit contributions
 awesome+0.056
 sight+0.055
 common+0.055
 comparison+0.055
 rightful+0.054
Neg logit contributions
owed-0.032
m-0.032
bee-0.029
es-0.027
op-0.027
Token The
Token ID383
Feature activation-1.98e-02

Pos logit contributions
owed+0.070
bee+0.067
m+0.067
ppy+0.065
pp+0.064
Neg logit contributions
 sight-0.015
 fun-0.014
 purpose-0.011
 our-0.011
 common-0.010
Token old
Token ID1468
Feature activation+3.72e-02

Pos logit contributions
 common+0.021
 awesome+0.020
 sight+0.020
 purpose+0.020
 comparison+0.019
Neg logit contributions
m-0.051
bee-0.049
owed-0.048
op-0.047
itty-0.047
Token tiger
Token ID26241
Feature activation+1.41e-01

Pos logit contributions
owed+0.085
m+0.077
ird+0.077
ppy+0.076
op+0.076
Neg logit contributions
 sight-0.063
 fun-0.057
 purpose-0.055
 awesome-0.055
 new-0.054
Token said
Token ID531
Feature activation-1.08e-02

Pos logit contributions
ird+0.287
op+0.277
ops+0.275
pp+0.270
m+0.266
Neg logit contributions
 fun-0.233
 sight-0.218
 beautiful-0.215
 loud-0.206
 exciting-0.205
Token,
Token ID11
Feature activation+1.02e-02

Pos logit contributions
 sight+0.021
 fun+0.021
 awesome+0.021
 our+0.020
 purpose+0.020
Neg logit contributions
owed-0.018
m-0.018
bee-0.017
ops-0.017
op-0.017
Token "
Token ID366
Feature activation+4.26e-03

Pos logit contributions
owed+0.026
m+0.024
op+0.024
ird+0.024
ops+0.024
Neg logit contributions
 sight-0.013
 fun-0.012
 awesome-0.012
 our-0.011
 simple-0.011
TokenI
Token ID40
Feature activation+1.05e-03

Pos logit contributions
owed+0.008
m+0.008
bee+0.008
es+0.008
op+0.008
Neg logit contributions
 sight-0.007
 awesome-0.007
 fun-0.007
 common-0.007
 purpose-0.007
Token blink
Token ID21019
Feature activation-2.57e-02

Pos logit contributions
owed+0.002
m+0.002
bee+0.002
es+0.002
op+0.002
Neg logit contributions
 sight-0.002
 awesome-0.002
 common-0.002
 fun-0.002
 purpose-0.002
Token got
Token ID1392
Feature activation-4.62e-02

Pos logit contributions
 sight+0.008
 awesome+0.008
 fun+0.008
 common+0.008
 purpose+0.008
Neg logit contributions
owed-0.006
m-0.006
bee-0.006
op-0.006
es-0.006
Token worse
Token ID4785
Feature activation-8.89e-03

Pos logit contributions
 awesome+0.071
 common+0.071
 sight+0.070
 harmony+0.069
 pleasure+0.069
Neg logit contributions
m-0.102
owed-0.097
es-0.088
ie-0.087
op-0.086
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 sight+0.021
 awesome+0.021
 common+0.020
 fun+0.020
 purpose+0.020
Neg logit contributions
owed-0.012
m-0.011
bee-0.011
es-0.010
op-0.010
Token Sadly
Token ID27213
Feature activation+5.79e-03

Pos logit contributions
owed+0.027
m+0.027
op+0.026
es+0.026
bee+0.025
Neg logit contributions
 awesome-0.021
 sight-0.020
 common-0.020
 fun-0.019
 comparison-0.019
Token,
Token ID11
Feature activation-8.54e-03

Pos logit contributions
owed+0.010
m+0.010
bee+0.009
op+0.009
ird+0.009
Neg logit contributions
 sight-0.011
 our-0.011
 fun-0.011
 purpose-0.011
 awesome-0.011
Token Spot
Token ID15899
Feature activation+1.01e-01

Pos logit contributions
 awesome+0.013
 common+0.012
 pleasure+0.012
 orderly+0.012
 sight+0.012
Neg logit contributions
m-0.018
es-0.017
owed-0.016
bee-0.016
ops-0.016
Token passed
Token ID3804
Feature activation-4.02e-02

Pos logit contributions
owed+0.191
m+0.181
bee+0.178
op+0.170
ird+0.165
Neg logit contributions
 sight-0.176
 fun-0.173
 awesome-0.170
 purpose-0.167
 our-0.167
Token away
Token ID1497
Feature activation+3.43e-03

Pos logit contributions
 awesome+0.066
 comparison+0.062
 common+0.062
 ours+0.062
 pleasure+0.061
Neg logit contributions
m-0.086
owed-0.082
es-0.081
op-0.075
bee-0.073
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
owed+0.005
m+0.005
bee+0.005
es+0.005
op+0.005
Neg logit contributions
 awesome-0.007
 sight-0.007
 fun-0.007
 common-0.007
 purpose-0.007
Token
Token ID220
Feature activation-5.06e-03

Pos logit contributions
m+0.030
owed+0.030
es+0.028
op+0.028
ops+0.027
Neg logit contributions
 awesome-0.030
 common-0.028
 sight-0.028
 fun-0.028
 thrilling-0.027
Token
Token ID198
Feature activation+4.47e-03

Pos logit contributions
 sight+0.009
 awesome+0.009
 fun+0.008
 purpose+0.008
 common+0.008
Neg logit contributions
owed-0.010
bee-0.010
m-0.010
op-0.009
es-0.009
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 awesome+0.003
 common+0.002
 ours+0.002
 rightful+0.002
 comparison+0.002
Neg logit contributions
m-0.004
es-0.003
op-0.003
ops-0.003
owed-0.003
Token pilot
Token ID8022
Feature activation+1.16e-01

Pos logit contributions
 awesome+0.020
 sight+0.020
 common+0.019
 fun+0.018
 purpose+0.018
Neg logit contributions
owed-0.043
m-0.043
bee-0.041
op-0.040
es-0.040
Token looked
Token ID3114
Feature activation-4.34e-02

Pos logit contributions
owed+0.197
m+0.194
op+0.191
bee+0.190
es+0.177
Neg logit contributions
 sight-0.213
 fun-0.205
 awesome-0.200
 our-0.197
 common-0.194
Token down
Token ID866
Feature activation-9.78e-03

Pos logit contributions
 purpose+0.091
 common+0.089
 harmony+0.084
 awesome+0.084
 comparison+0.084
Neg logit contributions
m-0.074
owed-0.072
es-0.065
op-0.062
bee-0.061
Token,
Token ID11
Feature activation-9.96e-03

Pos logit contributions
 awesome+0.022
 purpose+0.021
 common+0.021
 sight+0.020
 fun+0.020
Neg logit contributions
m-0.015
owed-0.014
es-0.013
op-0.013
bee-0.012
Token he
Token ID339
Feature activation+3.63e-02

Pos logit contributions
 awesome+0.016
 sight+0.016
 common+0.016
 purpose+0.016
 fun+0.016
Neg logit contributions
owed-0.020
m-0.020
es-0.019
bee-0.019
op-0.019
Token could
Token ID714
Feature activation-4.29e-02

Pos logit contributions
owed+0.064
m+0.062
bee+0.060
op+0.059
es+0.057
Neg logit contributions
 sight-0.070
 awesome-0.069
 fun-0.068
 common-0.066
 our-0.065
Token see
Token ID766
Feature activation-4.04e-02

Pos logit contributions
 sight+0.068
 awesome+0.065
 purpose+0.065
 common+0.063
 ours+0.062
Neg logit contributions
owed-0.092
m-0.085
bee-0.083
op-0.082
es-0.081
Token children
Token ID1751
Feature activation+7.67e-02

Pos logit contributions
 common+0.052
 awesome+0.051
 sight+0.050
 purpose+0.050
 comparison+0.048
Neg logit contributions
es-0.095
owed-0.094
m-0.094
bee-0.089
op-0.086
Token looking
Token ID2045
Feature activation-4.21e-02

Pos logit contributions
owed+0.165
m+0.157
bee+0.157
op+0.153
ops+0.151
Neg logit contributions
 sight-0.114
 awesome-0.114
 fun-0.113
 our-0.108
 comparison-0.105
Token up
Token ID510
Feature activation-2.52e-02

Pos logit contributions
 purpose+0.081
 common+0.079
 awesome+0.077
 sight+0.077
 comparison+0.074
Neg logit contributions
m-0.076
owed-0.074
es-0.068
bee-0.066
op-0.064
Token spots
Token ID10222
Feature activation+4.61e-02

Pos logit contributions
 accomplishment+0.107
 rightful+0.100
 common+0.093
rous+0.091
 orderly+0.089
Neg logit contributions
bee-0.141
es-0.137
m-0.135
owed-0.130
re-0.127
Token on
Token ID319
Feature activation-5.10e-02

Pos logit contributions
m+0.071
owed+0.069
es+0.064
bee+0.063
op+0.061
Neg logit contributions
 awesome-0.101
 purpose-0.101
 sight-0.100
 rightful-0.100
 common-0.099
Token its
Token ID663
Feature activation-1.28e-02

Pos logit contributions
 rightful+0.070
 orderly+0.067
 pleasure+0.067
 educational+0.065
 rewarding+0.065
Neg logit contributions
bee-0.120
es-0.109
owed-0.107
ops-0.105
m-0.104
Token wings
Token ID12098
Feature activation+5.63e-02

Pos logit contributions
 awesome+0.020
 sight+0.020
 common+0.020
 fun+0.019
 comparison+0.019
Neg logit contributions
owed-0.027
m-0.026
bee-0.025
op-0.024
es-0.024
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
owed+0.073
m+0.072
bee+0.066
es+0.064
op+0.064
Neg logit contributions
 awesome-0.135
 sight-0.135
 common-0.132
 fun-0.131
 purpose-0.131
Token It
Token ID632
Feature activation+3.60e-02

Pos logit contributions
owed+0.037
m+0.036
bee+0.035
op+0.034
es+0.034
Neg logit contributions
 sight-0.020
 awesome-0.020
 fun-0.019
 common-0.019
 purpose-0.019
Token moved
Token ID3888
Feature activation-2.92e-02

Pos logit contributions
owed+0.061
m+0.060
op+0.058
ops+0.055
bee+0.055
Neg logit contributions
 fun-0.072
 sight-0.072
 awesome-0.071
 common-0.069
 exciting-0.067
Token its
Token ID663
Feature activation-5.07e-03

Pos logit contributions
 rightful+0.050
 purpose+0.049
 pleasure+0.049
 awesome+0.048
 accomplishment+0.047
Neg logit contributions
m-0.058
owed-0.057
es-0.056
bee-0.054
y-0.052
Token wings
Token ID12098
Feature activation+5.73e-02

Pos logit contributions
 awesome+0.007
 sight+0.007
 common+0.006
 fun+0.006
 purpose+0.006
Neg logit contributions
owed-0.012
m-0.012
bee-0.011
es-0.011
op-0.011
Token slowly
Token ID6364
Feature activation+1.37e-02

Pos logit contributions
owed+0.081
m+0.077
es+0.072
bee+0.071
op+0.068
Neg logit contributions
 awesome-0.133
 common-0.131
 rightful-0.131
 sight-0.131
 purpose-0.130
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
owed+0.023
m+0.023
bee+0.022
op+0.021
es+0.021
Neg logit contributions
 sight-0.028
 fun-0.027
 awesome-0.026
 purpose-0.026
 common-0.026
Token out
Token ID503
Feature activation-4.64e-03

Pos logit contributions
owed+0.202
m+0.198
bee+0.190
op+0.187
es+0.186
Neg logit contributions
 sight-0.173
 awesome-0.171
 common-0.165
 fun-0.165
 purpose-0.163
Token of
Token ID286
Feature activation-5.90e-02

Pos logit contributions
 sight+0.011
 common+0.011
 awesome+0.011
 purpose+0.011
 pleasure+0.011
Neg logit contributions
owed-0.006
m-0.006
es-0.006
op-0.005
bee-0.005
Token some
Token ID617
Feature activation-1.62e-02

Pos logit contributions
 rightful+0.098
 awesome+0.096
 rewarding+0.096
 ours+0.090
 educational+0.090
Neg logit contributions
m-0.077
mallow-0.072
ie-0.071
my-0.070
es-0.069
Token cardboard
Token ID26819
Feature activation+6.81e-02

Pos logit contributions
 common+0.022
 awesome+0.021
 sight+0.021
 pleasure+0.021
 comparison+0.021
Neg logit contributions
m-0.034
op-0.034
es-0.032
bee-0.032
owed-0.031
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
owed+0.107
m+0.105
bee+0.099
op+0.096
es+0.095
Neg logit contributions
 sight-0.145
 awesome-0.144
 fun-0.141
 common-0.140
 purpose-0.139
Token He
Token ID679
Feature activation+3.11e-02

Pos logit contributions
owed+0.025
op+0.025
m+0.024
es+0.022
ops+0.022
Neg logit contributions
 awesome-0.040
 common-0.040
 comparison-0.037
 sight-0.037
 ours-0.037
Token was
Token ID373
Feature activation-5.99e-02

Pos logit contributions
m+0.061
owed+0.060
bee+0.057
ops+0.055
es+0.054
Neg logit contributions
 fun-0.053
 awesome-0.051
 sight-0.051
 common-0.050
 exciting-0.050
Token busy
Token ID8179
Feature activation-2.64e-02

Pos logit contributions
 common+0.119
 purpose+0.115
 sight+0.110
 rightful+0.108
 harmony+0.108
Neg logit contributions
m-0.108
owed-0.102
bee-0.095
es-0.094
op-0.094
Token cutting
Token ID7720
Feature activation-2.35e-02

Pos logit contributions
 awesome+0.048
 common+0.047
 ours+0.046
 pleasure+0.046
 rightful+0.045
Neg logit contributions
m-0.048
op-0.047
owed-0.047
es-0.047
bee-0.045
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 rightful+0.031
 comparison+0.030
 awesome+0.030
 pleasure+0.029
 common+0.029
Neg logit contributions
es-0.056
op-0.053
owed-0.053
m-0.052
ops-0.051
Token drawing
Token ID8263
Feature activation-2.09e-02

Pos logit contributions
 awesome+0.062
 common+0.061
 sight+0.061
 pleasure+0.059
 purpose+0.059
Neg logit contributions
m-0.074
owed-0.071
es-0.068
bee-0.068
op-0.068
Token and
Token ID290
Feature activation-3.33e-02

Pos logit contributions
 sight+0.045
 awesome+0.043
 rightful+0.043
 pleasure+0.042
 accomplishment+0.042
Neg logit contributions
m-0.109
es-0.106
owed-0.106
bee-0.104
op-0.101
Token play
Token ID711
Feature activation-3.49e-02

Pos logit contributions
 awesome+0.056
 sight+0.056
 common+0.053
 thrilling+0.052
 comparison+0.052
Neg logit contributions
m-0.065
owed-0.065
bee-0.061
es-0.059
op-0.058
Token with
Token ID351
Feature activation-4.28e-02

Pos logit contributions
 sight+0.067
 awesome+0.065
 common+0.064
 comparison+0.063
 purpose+0.062
Neg logit contributions
owed-0.062
m-0.061
es-0.059
bee-0.058
op-0.056
Token me
Token ID502
Feature activation+5.10e-02

Pos logit contributions
 sight+0.050
 awesome+0.050
 accomplishment+0.048
 thrilling+0.048
 common+0.048
Neg logit contributions
m-0.103
es-0.103
owed-0.102
bee-0.100
op-0.098
Token?"
Token ID1701
Feature activation+1.77e-02

Pos logit contributions
owed+0.088
m+0.087
bee+0.086
op+0.082
ops+0.081
Neg logit contributions
 awesome-0.096
 fun-0.096
 sight-0.095
 our-0.094
 common-0.093
Token She
Token ID1375
Feature activation+3.03e-02

Pos logit contributions
owed+0.044
bee+0.041
m+0.040
es+0.039
ppy+0.039
Neg logit contributions
 sight-0.024
 fun-0.023
 purpose-0.022
 common-0.022
 our-0.021
Token shook
Token ID14682
Feature activation-5.71e-02

Pos logit contributions
owed+0.051
m+0.050
op+0.049
ops+0.047
bee+0.046
Neg logit contributions
 fun-0.060
 sight-0.059
 awesome-0.058
 common-0.056
 simple-0.055
Token them
Token ID606
Feature activation+5.07e-02

Pos logit contributions
 awesome+0.086
 sight+0.083
 common+0.081
 fun+0.079
 purpose+0.079
Neg logit contributions
owed-0.129
m-0.126
bee-0.121
es-0.119
op-0.114
Token gently
Token ID15165
Feature activation+7.61e-05

Pos logit contributions
owed+0.097
oke+0.096
bee+0.095
ops+0.094
ird+0.094
Neg logit contributions
 fun-0.083
 sight-0.082
 loud-0.079
 same-0.078
 new-0.077
Token and
Token ID290
Feature activation-2.48e-02

Pos logit contributions
owed+0.000
m+0.000
bee+0.000
op+0.000
ops+0.000
Neg logit contributions
 sight-0.000
 fun-0.000
 purpose-0.000
 awesome-0.000
 common-0.000
Token tried
Token ID3088
Feature activation-4.73e-02

Pos logit contributions
 sight+0.060
 awesome+0.060
 fun+0.059
 common+0.058
 purpose+0.058
Neg logit contributions
owed-0.032
m-0.031
bee-0.029
op-0.028
es-0.028
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
owed+0.091
m+0.088
bee+0.086
op+0.080
es+0.079
Neg logit contributions
 sight-0.122
 fun-0.120
 purpose-0.119
 awesome-0.118
 common-0.115
Token He
Token ID679
Feature activation+2.55e-02

Pos logit contributions
owed+0.027
m+0.027
bee+0.025
op+0.025
es+0.025
Neg logit contributions
 sight-0.024
 awesome-0.024
 fun-0.024
 common-0.023
 purpose-0.023
Token was
Token ID373
Feature activation-6.49e-02

Pos logit contributions
owed+0.041
m+0.041
bee+0.038
op+0.037
ops+0.036
Neg logit contributions
 fun-0.052
 sight-0.052
 awesome-0.052
 common-0.050
 our-0.049
Token bleeding
Token ID16832
Feature activation-1.62e-02

Pos logit contributions
 common+0.113
 sight+0.109
 harmony+0.106
 purpose+0.106
 comparison+0.103
Neg logit contributions
m-0.132
owed-0.126
bee-0.120
es-0.119
op-0.118
Token and
Token ID290
Feature activation-2.84e-02

Pos logit contributions
 sight+0.031
 fun+0.030
 awesome+0.030
 purpose+0.029
 our+0.029
Neg logit contributions
owed-0.028
m-0.027
bee-0.026
oke-0.026
op-0.025
Token he
Token ID339
Feature activation+3.18e-02

Pos logit contributions
 awesome+0.052
 sight+0.052
 common+0.050
 purpose+0.049
 fun+0.049
Neg logit contributions
owed-0.054
m-0.053
bee-0.050
es-0.050
op-0.049
Token looked
Token ID3114
Feature activation-5.79e-02

Pos logit contributions
owed+0.054
m+0.053
bee+0.050
op+0.050
ops+0.049
Neg logit contributions
 fun-0.061
 sight-0.059
 awesome-0.058
 exciting-0.056
 beautiful-0.056
Token sad
Token ID6507
Feature activation-3.92e-02

Pos logit contributions
 common+0.095
 purpose+0.093
 sight+0.093
 awesome+0.091
 comparison+0.089
Neg logit contributions
m-0.119
owed-0.117
bee-0.108
es-0.107
op-0.105
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
 sight+0.087
 fun+0.087
 awesome+0.086
 purpose+0.083
 common+0.083
Neg logit contributions
owed-0.058
m-0.056
bee-0.054
op-0.053
ops-0.051
Token
Token ID198
Feature activation-2.16e-03

Pos logit contributions
owed+0.031
m+0.030
bee+0.029
op+0.028
es+0.028
Neg logit contributions
 sight-0.028
 awesome-0.028
 fun-0.027
 common-0.027
 purpose-0.027
Token
Token ID198
Feature activation+9.02e-03

Pos logit contributions
 sight+0.004
 awesome+0.004
 fun+0.004
 common+0.004
 purpose+0.004
Neg logit contributions
owed-0.004
m-0.004
bee-0.004
es-0.004
op-0.004
Token was
Token ID373
Feature activation-4.50e-02

Pos logit contributions
owed+0.049
m+0.048
bee+0.046
op+0.045
es+0.044
Neg logit contributions
 sight-0.069
 awesome-0.068
 fun-0.068
 common-0.066
 purpose-0.065
Token infectious
Token ID25246
Feature activation-7.37e-03

Pos logit contributions
 common+0.062
 purpose+0.062
 ours+0.061
 comparison+0.061
 sight+0.059
Neg logit contributions
m-0.095
owed-0.093
es-0.087
bee-0.083
ie-0.082
Token!
Token ID0
Feature activation+2.09e-02

Pos logit contributions
 common+0.019
 comparison+0.018
 awesome+0.018
 purpose+0.018
 ours+0.018
Neg logit contributions
es-0.009
m-0.008
owed-0.008
bee-0.007
op-0.007
Token Whenever
Token ID21326
Feature activation-7.97e-03

Pos logit contributions
owed+0.045
bee+0.044
m+0.043
es+0.041
ops+0.040
Neg logit contributions
 sight-0.034
 fun-0.032
 purpose-0.031
 pleasure-0.031
 awesome-0.031
Token anyone
Token ID2687
Feature activation+7.44e-02

Pos logit contributions
 awesome+0.014
 sight+0.014
 common+0.013
 fun+0.013
 purpose+0.013
Neg logit contributions
owed-0.015
m-0.015
bee-0.014
es-0.014
op-0.014
Token saw
Token ID2497
Feature activation-3.73e-02

Pos logit contributions
owed+0.128
ops+0.124
bee+0.123
m+0.122
op+0.121
Neg logit contributions
 fun-0.145
 sight-0.140
 common-0.137
 exciting-0.137
 awesome-0.136
Token her
Token ID607
Feature activation+1.61e-02

Pos logit contributions
 rightful+0.050
 awesome+0.048
 common+0.048
 sight+0.046
 thrilling+0.045
Neg logit contributions
owed-0.085
es-0.085
m-0.084
bee-0.081
ie-0.076
Token,
Token ID11
Feature activation-3.71e-03

Pos logit contributions
owed+0.030
m+0.029
bee+0.028
op+0.027
es+0.027
Neg logit contributions
 awesome-0.030
 sight-0.030
 fun-0.029
 common-0.029
 purpose-0.028
Token they
Token ID484
Feature activation+2.82e-02

Pos logit contributions
 awesome+0.008
 common+0.007
 purpose+0.007
 sight+0.007
 rightful+0.007
Neg logit contributions
m-0.006
es-0.005
owed-0.005
bee-0.005
op-0.005
Token couldn
Token ID3521
Feature activation-3.44e-02

Pos logit contributions
owed+0.051
op+0.051
m+0.049
ops+0.049
itty+0.047
Neg logit contributions
 fun-0.051
 sight-0.049
 awesome-0.048
 exciting-0.048
 beautiful-0.047
Token't
Token ID470
Feature activation-4.00e-02

Pos logit contributions
 awesome+0.050
 sight+0.049
 our+0.048
 purpose+0.048
 fun+0.048
Neg logit contributions
owed-0.079
bee-0.075
m-0.074
es-0.069
ppy-0.069
Token little
Token ID1310
Feature activation+3.94e-03

Pos logit contributions
 fun+0.019
 sight+0.019
 awesome+0.018
 common+0.017
 our+0.017
Neg logit contributions
owed-0.046
es-0.043
m-0.043
ops-0.042
bee-0.041
Token bear
Token ID6842
Feature activation+1.34e-01

Pos logit contributions
owed+0.008
es+0.007
ops+0.007
m+0.007
bee+0.006
Neg logit contributions
 sight-0.008
 fun-0.008
 our-0.007
 awesome-0.007
 modern-0.007
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
ops+0.268
owed+0.267
m+0.266
oke+0.261
ird+0.255
Neg logit contributions
 sight-0.192
 fun-0.190
 far-0.187
 big-0.184
 their-0.183
Token His
Token ID2399
Feature activation-1.74e-02

Pos logit contributions
owed+0.048
m+0.047
es+0.045
bee+0.045
op+0.044
Neg logit contributions
 sight-0.020
 fun-0.019
 purpose-0.018
 awesome-0.018
 common-0.017
Token name
Token ID1438
Feature activation+4.96e-02

Pos logit contributions
 sight+0.020
 awesome+0.020
 common+0.019
 fun+0.019
 purpose+0.019
Neg logit contributions
owed-0.044
m-0.044
bee-0.042
op-0.041
es-0.041
Token was
Token ID373
Feature activation-5.12e-02

Pos logit contributions
owed+0.102
ppy+0.094
pp+0.094
owing+0.094
op+0.093
Neg logit contributions
 fun-0.083
 sight-0.082
 awesome-0.078
 beautiful-0.077
 great-0.077
Token Bear
Token ID14732
Feature activation+1.07e-01

Pos logit contributions
 sight+0.084
 awesome+0.082
 common+0.081
 purpose+0.080
 fun+0.078
Neg logit contributions
m-0.104
owed-0.104
bee-0.100
es-0.097
op-0.097
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
owed+0.197
ird+0.189
ops+0.184
m+0.179
oke+0.178
Neg logit contributions
 our-0.188
 fun-0.186
 sight-0.184
 purpose-0.182
 their-0.177
Token He
Token ID679
Feature activation+2.82e-02

Pos logit contributions
owed+0.045
m+0.043
bee+0.042
ops+0.040
op+0.040
Neg logit contributions
 sight-0.033
 fun-0.033
 awesome-0.032
 our-0.032
 purpose-0.032
Token was
Token ID373
Feature activation-6.50e-02

Pos logit contributions
owed+0.056
m+0.053
op+0.052
ird+0.051
oke+0.050
Neg logit contributions
 fun-0.052
 awesome-0.048
 exciting-0.047
 common-0.046
 sight-0.046
Token so
Token ID523
Feature activation-6.05e-02

Pos logit contributions
 common+0.114
 purpose+0.113
 sight+0.110
 accomplishment+0.108
 comparison+0.108
Neg logit contributions
m-0.129
bee-0.113
owed-0.112
op-0.108
es-0.106
Token vegetables
Token ID13701
Feature activation+1.10e-01

Pos logit contributions
owed+0.119
op+0.108
bee+0.104
oke+0.103
itty+0.103
Neg logit contributions
 sight-0.074
 fun-0.069
 our-0.067
 original-0.065
 same-0.065
Token in
Token ID287
Feature activation-2.88e-02

Pos logit contributions
owed+0.194
m+0.182
bee+0.181
op+0.179
ops+0.174
Neg logit contributions
 fun-0.211
 sight-0.206
 awesome-0.205
 our-0.201
 exciting-0.196
Token the
Token ID262
Feature activation-1.53e-02

Pos logit contributions
 awesome+0.040
 ours+0.037
 accomplishment+0.037
 brand+0.037
 orderly+0.037
Neg logit contributions
m-0.061
es-0.061
ird-0.054
ne-0.053
bee-0.052
Token garden
Token ID11376
Feature activation+7.28e-02

Pos logit contributions
 awesome+0.024
 common+0.024
 accomplishment+0.024
 ours+0.023
 thrilling+0.022
Neg logit contributions
m-0.033
op-0.029
bee-0.028
owed-0.028
es-0.028
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
owed+0.107
m+0.102
bee+0.101
op+0.097
es+0.093
Neg logit contributions
 sight-0.162
 fun-0.158
 awesome-0.157
 common-0.153
 our-0.153
Token
Token ID220
Feature activation-1.24e-02

Pos logit contributions
owed+0.041
bee+0.040
m+0.040
es+0.037
op+0.037
Neg logit contributions
 sight-0.029
 fun-0.028
 awesome-0.027
 purpose-0.027
 common-0.027
Token
Token ID198
Feature activation+2.23e-03

Pos logit contributions
 fun+0.018
 awesome+0.018
 sight+0.018
 common+0.017
 our+0.017
Neg logit contributions
bee-0.027
owed-0.027
m-0.026
ops-0.025
itive-0.024
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
owed+0.005
bee+0.005
m+0.005
ops+0.004
es+0.004
Neg logit contributions
 fun-0.003
 sight-0.003
 common-0.003
 awesome-0.003
 purpose-0.003
TokenThe
Token ID464
Feature activation-1.23e-02

Pos logit contributions
owed+0.031
m+0.031
bee+0.029
op+0.029
es+0.029
Neg logit contributions
 sight-0.025
 awesome-0.025
 fun-0.025
 common-0.024
 purpose-0.024
Token caul
Token ID46087
Feature activation+9.21e-02

Pos logit contributions
 sight+0.019
 awesome+0.019
 fun+0.019
 purpose+0.018
 common+0.018
Neg logit contributions
owed-0.028
m-0.026
es-0.025
op-0.025
bee-0.025
Tokeniflower
Token ID42642
Feature activation+1.26e-01

Pos logit contributions
owed+0.187
ird+0.169
m+0.167
bee+0.166
itty+0.160
Neg logit contributions
 sight-0.156
 purpose-0.150
 awesome-0.147
 fun-0.146
 common-0.146
Token fun
Token ID1257
Feature activation-3.24e-02

Pos logit contributions
 outer+0.098
 common+0.094
 comparison+0.093
 accomplishment+0.088
 avail+0.088
Neg logit contributions
m-0.140
re-0.134
es-0.131
's-0.131
 were-0.123
Token.
Token ID13
Feature activation+3.93e-04

Pos logit contributions
 sight+0.071
 fun+0.071
 awesome+0.069
 purpose+0.067
 our+0.067
Neg logit contributions
owed-0.052
m-0.048
bee-0.046
op-0.045
ops-0.043
Token They
Token ID1119
Feature activation+1.73e-02

Pos logit contributions
m+0.001
es+0.000
owed+0.000
op+0.000
oke+0.000
Neg logit contributions
 awesome-0.001
 comparison-0.001
 ours-0.001
 sight-0.001
 accomplishment-0.001
Token became
Token ID2627
Feature activation-7.86e-02

Pos logit contributions
owed+0.028
m+0.027
op+0.025
bee+0.025
es+0.025
Neg logit contributions
 awesome-0.036
 fun-0.036
 sight-0.036
 common-0.035
 our-0.034
Token friends
Token ID2460
Feature activation+1.92e-02

Pos logit contributions
 sight+0.127
 purpose+0.126
 outer+0.124
 comparison+0.123
 common+0.120
Neg logit contributions
m-0.151
es-0.144
bee-0.137
op-0.129
mun-0.128
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
owed+0.033
bee+0.031
m+0.030
op+0.030
ops+0.029
Neg logit contributions
 sight-0.039
 fun-0.039
 awesome-0.038
 our-0.037
 harmony-0.036
Token learned
Token ID4499
Feature activation-6.72e-02

Pos logit contributions
 sight+0.051
 awesome+0.050
 common+0.049
 purpose+0.049
 fun+0.048
Neg logit contributions
owed-0.046
m-0.045
bee-0.043
es-0.042
op-0.041
Token to
Token ID284
Feature activation-4.26e-02

Pos logit contributions
 comparison+0.109
 sight+0.102
 pleasure+0.102
 awesome+0.097
 harmony+0.097
Neg logit contributions
owed-0.132
m-0.130
es-0.127
bee-0.115
op-0.115
Token share
Token ID2648
Feature activation-4.08e-02

Pos logit contributions
 sight+0.073
 awesome+0.071
 pleasure+0.066
 thrilling+0.066
 comparison+0.066
Neg logit contributions
owed-0.083
m-0.082
bee-0.081
ops-0.074
op-0.074
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
 awesome+0.076
 sight+0.076
 pleasure+0.073
 comparison+0.072
 thrilling+0.072
Neg logit contributions
m-0.073
owed-0.072
es-0.070
bee-0.067
op-0.065
Token They
Token ID1119
Feature activation+1.45e-02

Pos logit contributions
m+0.007
owed+0.006
es+0.006
op+0.006
itty+0.006
Neg logit contributions
 awesome-0.010
 sight-0.010
 accomplishment-0.010
 thrilling-0.010
 purpose-0.010
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
owed+0.101
m+0.095
bee+0.093
op+0.091
es+0.090
Neg logit contributions
 sight-0.132
 fun-0.130
 awesome-0.127
 common-0.125
 our-0.124
Token After
Token ID2293
Feature activation-4.55e-02

Pos logit contributions
owed+0.036
m+0.035
bee+0.034
es+0.033
op+0.033
Neg logit contributions
 sight-0.023
 awesome-0.023
 fun-0.022
 common-0.022
 purpose-0.022
Token a
Token ID257
Feature activation-3.17e-02

Pos logit contributions
 awesome+0.053
 common+0.052
 sight+0.051
 rightful+0.051
 purpose+0.050
Neg logit contributions
m-0.116
owed-0.113
bee-0.109
es-0.109
op-0.108
Token while
Token ID981
Feature activation-2.19e-03

Pos logit contributions
 sight+0.054
 awesome+0.053
 fun+0.052
 common+0.051
 purpose+0.051
Neg logit contributions
owed-0.064
m-0.063
bee-0.060
op-0.059
es-0.059
Token,
Token ID11
Feature activation-1.54e-02

Pos logit contributions
 sight+0.005
 fun+0.004
 awesome+0.004
 our+0.004
 common+0.004
Neg logit contributions
owed-0.004
m-0.003
bee-0.003
es-0.003
op-0.003
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
 awesome+0.016
 common+0.015
 sight+0.014
 purpose+0.014
 fun+0.014
Neg logit contributions
owed-0.041
m-0.041
bee-0.039
es-0.039
op-0.039
Token cow
Token ID9875
Feature activation+1.27e-01

Pos logit contributions
 sight+0.027
 awesome+0.027
 common+0.027
 fun+0.027
 purpose+0.026
Neg logit contributions
owed-0.027
m-0.027
bee-0.025
op-0.025
es-0.025
Token spotted
Token ID13489
Feature activation-4.39e-02

Pos logit contributions
owed+0.238
m+0.225
bee+0.225
op+0.219
ops+0.216
Neg logit contributions
 sight-0.218
 fun-0.209
 exciting-0.194
 beautiful-0.193
 rightful-0.192
Token an
Token ID281
Feature activation-5.09e-02

Pos logit contributions
 common+0.053
 awesome+0.050
 rightful+0.048
 harmony+0.046
 orderly+0.045
Neg logit contributions
m-0.112
owed-0.110
es-0.107
bee-0.105
op-0.100
Token old
Token ID1468
Feature activation+2.20e-02

Pos logit contributions
 sight+0.050
 common+0.049
 awesome+0.049
 fun+0.046
 purpose+0.046
Neg logit contributions
owed-0.136
m-0.136
bee-0.132
op-0.129
es-0.126
Token cup
Token ID6508
Feature activation+8.50e-02

Pos logit contributions
owed+0.043
m+0.042
bee+0.040
es+0.040
op+0.039
Neg logit contributions
 sight-0.039
 awesome-0.038
 fun-0.037
 purpose-0.037
 common-0.037
Token was
Token ID373
Feature activation-5.16e-02

Pos logit contributions
owed+0.184
m+0.171
ird+0.170
op+0.165
bee+0.160
Neg logit contributions
 fun-0.192
 sight-0.185
 our-0.179
 awesome-0.177
 common-0.174
Token scared
Token ID12008
Feature activation-2.37e-02

Pos logit contributions
 purpose+0.100
 pleasure+0.100
 accomplishment+0.098
 rightful+0.096
 sight+0.095
Neg logit contributions
m-0.093
owed-0.081
bee-0.081
es-0.079
ppy-0.073
Token and
Token ID290
Feature activation-5.51e-03

Pos logit contributions
 fun+0.049
 sight+0.049
 awesome+0.048
 our+0.046
 exciting+0.045
Neg logit contributions
owed-0.041
bee-0.037
ops-0.037
m-0.037
op-0.037
Token tried
Token ID3088
Feature activation-4.27e-02

Pos logit contributions
 purpose+0.011
 sight+0.011
 awesome+0.011
 pleasure+0.011
 common+0.011
Neg logit contributions
owed-0.009
m-0.009
bee-0.009
es-0.008
op-0.008
Token to
Token ID284
Feature activation-3.38e-02

Pos logit contributions
 awesome+0.063
 sight+0.061
 pleasure+0.058
 common+0.057
 accomplishment+0.057
Neg logit contributions
m-0.098
owed-0.095
bee-0.090
es-0.089
op-0.088
Token get
Token ID651
Feature activation-5.73e-02

Pos logit contributions
 sight+0.057
 awesome+0.055
 pleasure+0.053
 purpose+0.053
 fun+0.052
Neg logit contributions
owed-0.068
m-0.066
bee-0.066
es-0.061
op-0.060
Token out
Token ID503
Feature activation-3.00e-03

Pos logit contributions
 purpose+0.091
 sight+0.089
 awesome+0.089
 accomplishment+0.088
 thrilling+0.087
Neg logit contributions
m-0.120
owed-0.116
es-0.109
bee-0.103
ie-0.102
Token of
Token ID286
Feature activation-6.22e-02

Pos logit contributions
 sight+0.007
 awesome+0.007
 purpose+0.007
 common+0.007
 fun+0.007
Neg logit contributions
owed-0.004
m-0.004
es-0.004
bee-0.004
op-0.003
Token the
Token ID262
Feature activation-9.19e-03

Pos logit contributions
 fantastic+0.108
 awesome+0.108
 thrilling+0.105
 rewarding+0.104
 vast+0.104
Neg logit contributions
m-0.103
mallow-0.094
ie-0.091
bee-0.089
itty-0.089
Token pond
Token ID16723
Feature activation+1.01e-01

Pos logit contributions
 awesome+0.014
 pleasure+0.014
 comparison+0.014
 outer+0.013
 thrilling+0.013
Neg logit contributions
m-0.020
bee-0.018
op-0.017
ops-0.017
itty-0.016
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
owed+0.141
m+0.137
bee+0.130
op+0.126
es+0.123
Neg logit contributions
 sight-0.234
 awesome-0.231
 fun-0.229
 common-0.226
 purpose-0.224
Token is
Token ID318
Feature activation-4.36e-02

Pos logit contributions
owed+0.023
m+0.022
bee+0.022
op+0.021
es+0.021
Neg logit contributions
 sight-0.018
 awesome-0.017
 fun-0.017
 common-0.017
 purpose-0.017
Token cel
Token ID18725
Feature activation+7.29e-02

Pos logit contributions
 comparison+0.059
 accomplishment+0.057
 sight+0.057
 harmony+0.057
 thrilling+0.056
Neg logit contributions
m-0.092
bee-0.090
es-0.088
owed-0.086
ppy-0.085
Tokenery
Token ID1924
Feature activation+1.07e-01

Pos logit contributions
owed+0.122
m+0.111
ird+0.108
itty+0.102
bee+0.101
Neg logit contributions
 awesome-0.149
 fun-0.147
 purpose-0.142
 common-0.138
 sight-0.138
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
owed+0.207
m+0.197
bee+0.186
op+0.180
itty+0.179
Neg logit contributions
 fun-0.196
 awesome-0.196
 sight-0.190
 purpose-0.188
 our-0.186
Token It
Token ID632
Feature activation+2.40e-02

Pos logit contributions
owed+0.028
m+0.027
bee+0.026
op+0.026
es+0.026
Neg logit contributions
 sight-0.017
 awesome-0.017
 fun-0.016
 common-0.016
 purpose-0.016
Token is
Token ID318
Feature activation-5.29e-02

Pos logit contributions
owed+0.052
m+0.049
bee+0.048
op+0.048
ird+0.047
Neg logit contributions
 awesome-0.038
 fun-0.038
 our-0.038
 sight-0.037
 common-0.035
Token green
Token ID4077
Feature activation+1.76e-02

Pos logit contributions
 comparison+0.072
 harmony+0.072
 sight+0.071
 accomplishment+0.070
rous+0.069
Neg logit contributions
m-0.111
bee-0.107
es-0.101
owed-0.099
ppy-0.098
Token and
Token ID290
Feature activation-2.83e-02

Pos logit contributions
owed+0.031
m+0.029
op+0.028
bee+0.028
ird+0.027
Neg logit contributions
 fun-0.036
 sight-0.035
 awesome-0.035
 our-0.034
 purpose-0.033
Token long
Token ID890
Feature activation+2.52e-03

Pos logit contributions
 sight+0.037
 purpose+0.034
 awesome+0.033
 pleasure+0.033
 comparison+0.033
Neg logit contributions
m-0.067
bee-0.064
owed-0.063
es-0.063
op-0.060
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
owed+0.004
m+0.004
op+0.004
bee+0.004
itty+0.004
Neg logit contributions
 fun-0.005
 sight-0.005
 awesome-0.005
 purpose-0.005
 our-0.005
Token crunch
Token ID23527
Feature activation+2.79e-02

Pos logit contributions
 sight+0.028
 purpose+0.026
 comparison+0.026
 awesome+0.026
 common+0.025
Neg logit contributions
m-0.042
owed-0.042
es-0.041
bee-0.041
op-0.039
Token are
Token ID389
Feature activation-3.91e-02

Pos logit contributions
owed+0.014
bee+0.013
m+0.013
op+0.013
es+0.012
Neg logit contributions
 fun-0.017
 awesome-0.016
 our-0.016
 common-0.015
 sight-0.015
Token my
Token ID616
Feature activation-2.97e-02

Pos logit contributions
 sight+0.049
 purpose+0.045
 comparison+0.045
 common+0.044
 accomplishment+0.044
Neg logit contributions
m-0.092
es-0.086
ppy-0.082
owed-0.082
que-0.080
Token sister
Token ID6621
Feature activation+7.16e-02

Pos logit contributions
 sight+0.052
 awesome+0.051
 common+0.050
 purpose+0.049
 comparison+0.049
Neg logit contributions
m-0.057
owed-0.057
bee-0.054
op-0.053
es-0.053
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
owed+0.117
bee+0.111
m+0.109
op+0.108
ops+0.103
Neg logit contributions
 fun-0.149
 awesome-0.148
 sight-0.145
 our-0.143
 purpose-0.139
Token I
Token ID314
Feature activation+1.61e-02

Pos logit contributions
 sight+0.027
 common+0.026
 comparison+0.026
 purpose+0.026
 thrilling+0.026
Neg logit contributions
m-0.024
owed-0.023
bee-0.023
es-0.023
ppy-0.021
Token like
Token ID588
Feature activation-5.71e-02

Pos logit contributions
owed+0.041
bee+0.040
op+0.038
m+0.038
es+0.038
Neg logit contributions
 fun-0.020
 common-0.019
 awesome-0.018
 sight-0.017
 our-0.017
Token you
Token ID345
Feature activation+4.04e-02

Pos logit contributions
 sight+0.099
 vast+0.097
 amaz+0.096
 thrilling+0.096
 common+0.096
Neg logit contributions
m-0.104
es-0.100
bee-0.097
ie-0.092
owed-0.091
Token."
Token ID526
Feature activation+2.20e-02

Pos logit contributions
owed+0.077
op+0.073
bee+0.073
itty+0.069
m+0.069
Neg logit contributions
 fun-0.075
 our-0.072
 awesome-0.070
 sight-0.068
 common-0.068
Token Lily
Token ID20037
Feature activation+8.90e-02

Pos logit contributions
owed+0.047
m+0.045
bee+0.044
op+0.043
ops+0.042
Neg logit contributions
 sight-0.036
 fun-0.035
 common-0.035
 awesome-0.035
 purpose-0.034
Token smiled
Token ID13541
Feature activation-3.12e-02

Pos logit contributions
owed+0.152
op+0.149
m+0.144
bee+0.144
itty+0.137
Neg logit contributions
 fun-0.172
 sight-0.164
 our-0.162
 awesome-0.161
 beautiful-0.156
Token and
Token ID290
Feature activation-1.30e-02

Pos logit contributions
 sight+0.071
 awesome+0.070
 fun+0.069
 common+0.069
 purpose+0.068
Neg logit contributions
owed-0.045
m-0.044
bee-0.041
op-0.040
es-0.040
Token little
Token ID1310
Feature activation-8.67e-03

Pos logit contributions
 awesome+0.051
 sight+0.050
 common+0.049
 comparison+0.049
 purpose+0.048
Neg logit contributions
m-0.084
owed-0.081
bee-0.081
op-0.080
es-0.075
Token girl
Token ID2576
Feature activation+7.86e-02

Pos logit contributions
 sight+0.019
 awesome+0.019
 fun+0.019
 common+0.018
 purpose+0.018
Neg logit contributions
owed-0.013
m-0.013
bee-0.012
es-0.012
op-0.012
Token named
Token ID3706
Feature activation-1.88e-02

Pos logit contributions
owed+0.137
m+0.135
bee+0.134
ops+0.134
op+0.126
Neg logit contributions
 sight-0.144
 fun-0.141
 our-0.138
 common-0.137
 exciting-0.131
Token Lily
Token ID20037
Feature activation+7.55e-02

Pos logit contributions
 sight+0.025
 awesome+0.025
 fun+0.024
 common+0.024
 purpose+0.024
Neg logit contributions
owed-0.044
m-0.043
bee-0.042
op-0.041
es-0.041
Token came
Token ID1625
Feature activation-4.77e-02

Pos logit contributions
ops+0.160
bee+0.160
owed+0.158
op+0.155
oke+0.149
Neg logit contributions
 fun-0.096
 sight-0.094
 their-0.092
 far-0.092
 big-0.091
Token to
Token ID284
Feature activation-5.77e-02

Pos logit contributions
 sight+0.059
 awesome+0.059
 fun+0.056
 common+0.056
 purpose+0.055
Neg logit contributions
owed-0.118
m-0.117
bee-0.111
es-0.110
op-0.110
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
 awesome+0.077
 orderly+0.069
 thrilling+0.067
 comparison+0.067
 accomplishment+0.067
Neg logit contributions
m-0.128
owed-0.125
bee-0.123
itty-0.117
ne-0.117
Token garden
Token ID11376
Feature activation+6.23e-02

Pos logit contributions
 awesome+0.030
 accomplishment+0.030
 comparison+0.029
 common+0.029
 thrilling+0.029
Neg logit contributions
m-0.033
op-0.029
itty-0.028
mallow-0.028
bee-0.028
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
owed+0.098
m+0.094
bee+0.093
op+0.088
es+0.087
Neg logit contributions
 sight-0.134
 fun-0.129
 awesome-0.129
 purpose-0.126
 common-0.125
Token She
Token ID1375
Feature activation+1.46e-02

Pos logit contributions
owed+0.017
m+0.017
bee+0.016
op+0.015
es+0.015
Neg logit contributions
 awesome-0.020
 sight-0.020
 common-0.020
 fun-0.020
 purpose-0.019
Token saw
Token ID2497
Feature activation-6.41e-02

Pos logit contributions
owed+0.028
m+0.027
bee+0.026
es+0.025
op+0.025
Neg logit contributions
 fun-0.026
 sight-0.026
 awesome-0.026
 common-0.025
 our-0.024
Token was
Token ID373
Feature activation-7.49e-02

Pos logit contributions
owed+0.035
m+0.032
bee+0.032
op+0.032
es+0.031
Neg logit contributions
 fun-0.041
 sight-0.040
 awesome-0.040
 exciting-0.039
 common-0.038
Token an
Token ID281
Feature activation-5.54e-02

Pos logit contributions
 rightful+0.117
 harmony+0.117
rous+0.117
 purpose+0.116
 pleasure+0.116
Neg logit contributions
m-0.146
bee-0.129
owed-0.122
op-0.121
o-0.120
Token ambulance
Token ID22536
Feature activation+9.66e-02

Pos logit contributions
 sight+0.060
 awesome+0.060
 fun+0.057
 common+0.056
 purpose+0.055
Neg logit contributions
owed-0.145
m-0.143
bee-0.138
op-0.136
es-0.136
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
owed+0.176
m+0.169
bee+0.168
es+0.160
op+0.156
Neg logit contributions
 sight-0.182
 awesome-0.180
 fun-0.179
 our-0.171
 purpose-0.170
Token Kitty
Token ID21168
Feature activation+8.55e-02

Pos logit contributions
bee+0.038
owed+0.037
m+0.036
ppy+0.036
itive+0.035
Neg logit contributions
 fun-0.022
 sight-0.022
 awesome-0.020
 our-0.020
 my-0.019
Token wanted
Token ID2227
Feature activation-5.92e-02

Pos logit contributions
owed+0.171
m+0.166
ird+0.161
bee+0.159
pp+0.158
Neg logit contributions
 fun-0.135
 exciting-0.124
 beautiful-0.121
 sight-0.118
 our-0.117
Token to
Token ID284
Feature activation-3.81e-02

Pos logit contributions
 awesome+0.089
 common+0.087
 sight+0.087
 comparison+0.086
 accomplishment+0.086
Neg logit contributions
m-0.128
owed-0.124
op-0.121
es-0.120
bee-0.115
Token see
Token ID766
Feature activation-5.10e-02

Pos logit contributions
 sight+0.058
 awesome+0.057
 common+0.055
 fun+0.054
 purpose+0.054
Neg logit contributions
owed-0.082
m-0.080
op-0.077
bee-0.077
es-0.074
Token what
Token ID644
Feature activation-2.81e-02

Pos logit contributions
 awesome+0.054
 sight+0.054
 fun+0.052
 our+0.050
 common+0.049
Neg logit contributions
owed-0.135
m-0.133
bee-0.130
op-0.127
ops-0.126
Token was
Token ID373
Feature activation-3.14e-02

Pos logit contributions
 fun+0.053
 sight+0.053
 awesome+0.053
 our+0.051
 common+0.050
Neg logit contributions
owed-0.052
m-0.050
bee-0.049
op-0.047
ops-0.046
Token inside
Token ID2641
Feature activation+3.58e-03

Pos logit contributions
 awesome+0.053
 sight+0.053
 fun+0.052
 our+0.050
 common+0.050
Neg logit contributions
owed-0.065
m-0.063
bee-0.061
op-0.060
es-0.060