TOP ACTIVATIONS
MAX = 8.280

flavors they liked and what topp ings would go best .
then took out his hand ker chief and carefully picked up
down and used his hand ker chief to pick up the
let go of the hand ker chief and started climbing back
mom said . " R az ors are not toys ,
clown s juggling and ac rob ats doing flips . But
He took out a hand ker chief from his pocket and
sund ae with lots of topp ings . They
faces and shapes with the topp ings . "
it . A lic ia smiled while she looked
â M az zy was so excited about
thought they looked like ac rob ats . She enjoyed the
noticed that some of the topp ings were missing . She
saw clown s and ac rob ats and lions and elephants
she went , the hand ker chief unw inding until she
being too curious . N ont he less , it showed
L illy and O zz ie played together every day
swe aters and hats and scar ves from the wool .
He smiled as the rain drops tick led his face .
elephants , lions and ac rob ats . They are amazed

MIN ACTIVATIONS
MIN = -8.520

price tag . It said $ 1 . 99 per pound
The le opard toy costs $ 10 ."
the price - it was $ 5 . Sally was so
he said it was only $ 10 .
" Maybe it will come back if we hang here and
out . It will come back soon ." He
that her leg was coming back . She was reapp earing
busy . It will come back soon . Then we can
. Maybe it will come back to visit us ."
ie guessed that it must belong to one the big people
sea but then it came back . They cheered together ,
, there was a loud * be ep * from the
. The sunshine will come back ." L ily
. Maybe it will come back and sing for us again
night , the noise came back . It was louder this
, it wouldn 't come back . Mary nodded and made
it so it could come back . He said " rocket
click . The screen came back on , but the movie
away . The sun came back , and everyone was happy
mom said , " Think back to your memory of where

INTERVAL 4.080 - 8.280
CONTAINS 0.344%

brave . You did not let the big boy take your
about cleaning up . They did not think about how paper
What â s going on ? â
remembered her mom my 's words . She said , "
The girl and her brother were so excited and started

INTERVAL -0.120 - 4.080
CONTAINS 55.020%

fun to climb the hill , so he did .
teacher and told her what Tom did . The teacher was
. <|endoftext|> Once upon a time , in a magical land
garden when she heard a sound . She looked around and
led and said , " Let 's hug ". Jane laughed

INTERVAL -4.320 - -0.120
CONTAINS 43.496%

there was a boy who loved carrots . He would wash
was very eager to learn about the world . One day
fun day ! <|endoftext|> Once upon a time , there was
to climb the hill . It 's too deep and you
ducks was curious about the new duck . " Hello ,

INTERVAL -8.520 - -4.320
CONTAINS 1.140%

, Look at that branch !
and slipped . She fell down and scrap ed her knee
bad . He said , " I 'm sorry , Lily
reach it . " Maybe we can climb a
The bird replied , " But there 's so much
Token flavors
Token ID17361
Feature activation+5.75e-01

Pos logit contributions
 thank+0.007
 show+0.007
 performance+0.006
 medal+0.006
 story+0.006
Neg logit contributions
 yourselves-0.013
ety-0.012
ried-0.011
neys-0.011
Or-0.011
Token they
Token ID484
Feature activation+5.54e-01

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.009
 medal+0.008
on+0.008
Neg logit contributions
 yourselves-0.022
ety-0.021
ried-0.020
neys-0.019
Or-0.019
Token liked
Token ID8288
Feature activation-9.50e-01

Pos logit contributions
 thank+0.012
 performance+0.012
 show+0.011
 medal+0.011
 story+0.011
Neg logit contributions
 yourselves-0.018
ety-0.017
ried-0.017
Or-0.016
 hyd-0.016
Token and
Token ID290
Feature activation+9.16e-02

Pos logit contributions
 yourselves+0.035
ety+0.032
ried+0.031
neys+0.030
Or+0.030
Neg logit contributions
 thank-0.019
 show-0.018
 performance-0.017
 medal-0.016
 story-0.015
Token what
Token ID644
Feature activation+2.50e+00

Pos logit contributions
 thank+0.002
 performance+0.002
 show+0.002
 medal+0.002
 story+0.002
Neg logit contributions
 yourselves-0.003
ety-0.003
ried-0.003
Or-0.003
 late-0.003
Token topp
Token ID23126
Feature activation+8.28e+00

Pos logit contributions
 thank+0.057
 performance+0.055
 show+0.053
 leap+0.051
 grin+0.050
Neg logit contributions
 yourselves-0.081
ety-0.075
Or-0.071
neys-0.071
 late-0.070
Tokenings
Token ID654
Feature activation+2.20e+00

Pos logit contributions
 thank+0.234
 performance+0.220
 show+0.220
 leap+0.201
 medal+0.200
Neg logit contributions
 yourselves-0.241
ety-0.214
ried-0.204
neys-0.195
Or-0.193
Token would
Token ID561
Feature activation+1.22e+00

Pos logit contributions
 thank+0.043
 performance+0.041
 show+0.040
 medal+0.036
 leap+0.036
Neg logit contributions
 yourselves-0.080
ety-0.074
ried-0.070
 late-0.069
Or-0.069
Token go
Token ID467
Feature activation-2.83e+00

Pos logit contributions
 performance+0.022
 thank+0.022
 show+0.020
odic+0.020
 medal+0.019
Neg logit contributions
 yourselves-0.045
ety-0.041
Or-0.039
ried-0.039
 hyd-0.038
Token best
Token ID1266
Feature activation-1.40e+00

Pos logit contributions
 yourselves+0.101
ety+0.094
ried+0.090
neys+0.088
Or+0.087
Neg logit contributions
 thank-0.060
 show-0.056
 performance-0.053
 medal-0.048
 story-0.048
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 yourselves+0.050
ety+0.045
ried+0.044
neys+0.043
 late+0.042
Neg logit contributions
 thank-0.029
 performance-0.027
 show-0.027
 medal-0.024
 grin-0.024
Token then
Token ID788
Feature activation-5.54e-02

Pos logit contributions
 yourselves+0.072
ety+0.064
neys+0.061
ried+0.061
oky+0.060
Neg logit contributions
 thank-0.040
 show-0.037
 performance-0.035
 medal-0.032
on-0.032
Token took
Token ID1718
Feature activation-2.03e-01

Pos logit contributions
 yourselves+0.002
ety+0.002
ried+0.002
neys+0.002
oky+0.002
Neg logit contributions
 thank-0.001
 show-0.001
 performance-0.001
 medal-0.001
on-0.001
Token out
Token ID503
Feature activation-1.75e+00

Pos logit contributions
 yourselves+0.007
ety+0.006
ried+0.006
 late+0.006
How+0.006
Neg logit contributions
 thank-0.004
 performance-0.004
 show-0.004
 grin-0.004
 story-0.004
Token his
Token ID465
Feature activation-1.36e+00

Pos logit contributions
 yourselves+0.052
 late+0.048
ety+0.048
ried+0.045
neys+0.043
Neg logit contributions
 thank-0.044
 performance-0.043
 show-0.041
 grin-0.039
 hardly-0.039
Token hand
Token ID1021
Feature activation+4.49e-01

Pos logit contributions
 yourselves+0.042
ety+0.039
ried+0.036
Or+0.035
neys+0.035
Neg logit contributions
 thank-0.035
 performance-0.032
 show-0.032
 medal-0.029
 grin-0.029
Tokenker
Token ID6122
Feature activation+7.44e+00

Pos logit contributions
 thank+0.011
 performance+0.011
 show+0.011
 medal+0.010
 story+0.010
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.013
 late-0.012
 hyd-0.012
Tokenchief
Token ID17351
Feature activation+1.37e+00

Pos logit contributions
 thank+0.155
 performance+0.143
 show+0.142
 story+0.128
 medal+0.128
Neg logit contributions
 yourselves-0.266
ety-0.250
ried-0.238
 hyd-0.228
Or-0.226
Token and
Token ID290
Feature activation+4.12e-01

Pos logit contributions
 thank+0.037
 performance+0.035
 show+0.034
 medal+0.031
 grin+0.031
Neg logit contributions
 yourselves-0.039
ety-0.038
ried-0.036
 late-0.035
neys-0.034
Token carefully
Token ID7773
Feature activation+9.91e-01

Pos logit contributions
 thank+0.009
 show+0.009
 performance+0.009
 story+0.008
 medal+0.008
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.012
Or-0.012
neys-0.012
Token picked
Token ID6497
Feature activation-1.90e-02

Pos logit contributions
 thank+0.026
 performance+0.025
 show+0.025
 story+0.023
 medal+0.023
Neg logit contributions
 yourselves-0.029
ety-0.027
ried-0.026
Or-0.025
 hyd-0.025
Token up
Token ID510
Feature activation-2.56e+00

Pos logit contributions
 yourselves+0.001
ety+0.001
ried+0.001
 late+0.001
How+0.001
Neg logit contributions
 thank-0.000
 performance-0.000
 show-0.000
 story-0.000
 grin-0.000
Token down
Token ID866
Feature activation-4.21e+00

Pos logit contributions
 yourselves+0.072
neys+0.068
ety+0.067
Or+0.065
oky+0.065
Neg logit contributions
 thank-0.049
 show-0.048
on-0.044
 goodbye-0.044
 replied-0.043
Token and
Token ID290
Feature activation+3.00e-01

Pos logit contributions
 yourselves+0.148
ety+0.138
ried+0.129
neys+0.128
Or+0.124
Neg logit contributions
 thank-0.096
 show-0.090
 performance-0.084
 medal-0.078
 story-0.077
Token used
Token ID973
Feature activation+8.31e-01

Pos logit contributions
 thank+0.006
 show+0.006
 replied+0.005
on+0.005
 performance+0.005
Neg logit contributions
 yourselves-0.012
ety-0.010
neys-0.010
oky-0.010
ried-0.010
Token his
Token ID465
Feature activation-1.26e+00

Pos logit contributions
 thank+0.018
 performance+0.018
 show+0.017
 story+0.016
 medal+0.016
Neg logit contributions
 yourselves-0.028
ety-0.026
ried-0.025
 late-0.024
neys-0.023
Token hand
Token ID1021
Feature activation+6.26e-01

Pos logit contributions
 yourselves+0.038
ety+0.035
ried+0.034
Or+0.033
neys+0.032
Neg logit contributions
 thank-0.031
 performance-0.029
 show-0.029
ner-0.027
 medal-0.026
Tokenker
Token ID6122
Feature activation+7.41e+00

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.013
 medal+0.012
 story+0.011
Neg logit contributions
 yourselves-0.022
ety-0.020
ried-0.019
neys-0.019
Or-0.019
Tokenchief
Token ID17351
Feature activation+1.84e+00

Pos logit contributions
 thank+0.155
 show+0.143
 performance+0.142
 medal+0.128
 story+0.127
Neg logit contributions
 yourselves-0.265
ety-0.249
ried-0.237
neys-0.227
Or-0.227
Token to
Token ID284
Feature activation+1.06e+00

Pos logit contributions
 thank+0.044
 show+0.041
 performance+0.040
 medal+0.037
 story+0.036
Neg logit contributions
 yourselves-0.060
ety-0.057
ried-0.054
neys-0.052
 late-0.052
Token pick
Token ID2298
Feature activation-2.26e-01

Pos logit contributions
 thank+0.023
 performance+0.021
 show+0.020
 medal+0.020
 story+0.020
Neg logit contributions
 yourselves-0.037
ety-0.034
ried-0.033
Or-0.031
 hyd-0.031
Token up
Token ID510
Feature activation-2.69e+00

Pos logit contributions
 yourselves+0.008
ety+0.007
ried+0.007
 late+0.007
How+0.006
Neg logit contributions
 thank-0.005
 performance-0.005
 show-0.005
 hardly-0.004
 grin-0.004
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
 yourselves+0.097
ety+0.090
ried+0.086
 late+0.083
neys+0.083
Neg logit contributions
 thank-0.055
 performance-0.051
 show-0.050
 medal-0.045
 story-0.045
Token let
Token ID1309
Feature activation+4.25e+00

Pos logit contributions
 yourselves+0.016
ety+0.015
ried+0.015
Or+0.014
 hyd+0.014
Neg logit contributions
 thank-0.013
 performance-0.013
 show-0.012
 story-0.012
 medal-0.012
Token go
Token ID467
Feature activation-1.58e+00

Pos logit contributions
 performance+0.089
 thank+0.087
 medal+0.081
odic+0.081
 grin+0.081
Neg logit contributions
 yourselves-0.140
ety-0.130
ried-0.128
 late-0.122
Or-0.119
Token of
Token ID286
Feature activation-7.31e-02

Pos logit contributions
 yourselves+0.053
ety+0.050
ried+0.050
aking+0.046
 late+0.046
Neg logit contributions
 thank-0.034
 performance-0.032
 medal-0.029
 jugg-0.029
odic-0.029
Token the
Token ID262
Feature activation+7.92e-01

Pos logit contributions
 yourselves+0.002
ety+0.002
 late+0.002
ried+0.002
How+0.002
Neg logit contributions
 thank-0.002
 performance-0.002
 story-0.002
odic-0.002
 grin-0.002
Token hand
Token ID1021
Feature activation+3.67e-01

Pos logit contributions
 thank+0.017
 show+0.016
 performance+0.015
on+0.014
 grin+0.014
Neg logit contributions
 yourselves-0.028
ety-0.025
ried-0.024
 late-0.023
neys-0.023
Tokenker
Token ID6122
Feature activation+7.29e+00

Pos logit contributions
 thank+0.009
 performance+0.008
 show+0.008
 medal+0.007
 story+0.007
Neg logit contributions
 yourselves-0.012
ety-0.011
ried-0.011
 late-0.010
 hyd-0.010
Tokenchief
Token ID17351
Feature activation+1.67e+00

Pos logit contributions
 thank+0.165
 performance+0.153
 show+0.151
 story+0.143
 contain+0.137
Neg logit contributions
 yourselves-0.245
ety-0.229
ried-0.220
 hyd-0.208
neys-0.204
Token and
Token ID290
Feature activation+5.04e-01

Pos logit contributions
 thank+0.040
 performance+0.036
 show+0.036
 medal+0.033
 story+0.032
Neg logit contributions
 yourselves-0.054
ety-0.051
ried-0.049
 late-0.048
neys-0.046
Token started
Token ID2067
Feature activation-1.25e+00

Pos logit contributions
 thank+0.010
 performance+0.010
 show+0.009
 story+0.009
 medal+0.009
Neg logit contributions
 yourselves-0.018
ety-0.016
ried-0.016
 hyd-0.015
Or-0.015
Token climbing
Token ID14281
Feature activation+9.95e-02

Pos logit contributions
 yourselves+0.039
ety+0.036
ried+0.035
 late+0.034
 hyd+0.034
Neg logit contributions
 thank-0.031
 performance-0.028
 show-0.028
 medal-0.026
 story-0.026
Token back
Token ID736
Feature activation-5.89e+00

Pos logit contributions
 thank+0.002
 show+0.002
 performance+0.002
 story+0.002
 medal+0.002
Neg logit contributions
 yourselves-0.003
ety-0.003
ried-0.003
 late-0.003
neys-0.003
Token mom
Token ID1995
Feature activation+1.12e+00

Pos logit contributions
 yourselves+0.058
ety+0.053
ried+0.052
neys+0.050
 hyd+0.050
Neg logit contributions
 thank-0.034
 show-0.032
 performance-0.030
 story-0.028
 medal-0.028
Token said
Token ID531
Feature activation+2.35e+00

Pos logit contributions
 thank+0.023
 performance+0.023
 show+0.021
 medal+0.021
 leap+0.020
Neg logit contributions
 yourselves-0.038
ety-0.038
ried-0.035
 late-0.034
Or-0.034
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
 thank+0.051
 performance+0.048
 show+0.048
 medal+0.045
 leap+0.043
Neg logit contributions
 yourselves-0.080
ety-0.075
ried-0.073
 late-0.071
 hyd-0.069
Token "
Token ID366
Feature activation-4.70e+00

Pos logit contributions
 yourselves+0.067
ried+0.063
neys+0.061
ety+0.061
 hyd+0.060
Neg logit contributions
 thank-0.029
 show-0.023
 -0.020
 replied-0.020
 story-0.020
TokenR
Token ID49
Feature activation-4.47e-02

Pos logit contributions
ried+0.143
 yourselves+0.130
ety+0.128
aj+0.126
neys+0.125
Neg logit contributions
 thank-0.106
 show-0.105
ner-0.100
odic-0.100
on-0.098
Tokenaz
Token ID1031
Feature activation+7.16e+00

Pos logit contributions
Or+0.001
How+0.001
ried+0.001
 yourselves+0.001
 late+0.001
Neg logit contributions
 story-0.001
on-0.001
 thank-0.001
 performance-0.001
 leap-0.001
Tokenors
Token ID669
Feature activation+1.99e+00

Pos logit contributions
 thank+0.170
 show+0.159
 performance+0.154
on+0.145
 medal+0.143
Neg logit contributions
 yourselves-0.232
ety-0.213
ried-0.205
Or-0.203
neys-0.201
Token are
Token ID389
Feature activation+3.72e-01

Pos logit contributions
 thank+0.033
 performance+0.030
 show+0.030
 medal+0.026
 story+0.025
Neg logit contributions
 yourselves-0.080
ety-0.075
ried-0.072
neys-0.069
 hyd-0.069
Token not
Token ID407
Feature activation-9.37e-01

Pos logit contributions
 thank+0.010
 show+0.009
 start+0.007
 teach+0.007
 performance+0.007
Neg logit contributions
ipt-0.012
 yourselves-0.012
ety-0.012
ried-0.012
oky-0.011
Token toys
Token ID14958
Feature activation+8.38e-01

Pos logit contributions
 yourselves+0.030
ety+0.027
ried+0.027
neys+0.026
Or+0.026
Neg logit contributions
 thank-0.024
 show-0.022
 performance-0.021
 medal-0.020
 story-0.019
Token,
Token ID11
Feature activation-5.69e-01

Pos logit contributions
 thank+0.015
 show+0.014
 performance+0.013
 medal+0.012
 story+0.012
Neg logit contributions
 yourselves-0.033
ety-0.031
ried-0.030
neys-0.029
Or-0.028
Token clown
Token ID25573
Feature activation-2.34e-04

Pos logit contributions
 thank+0.004
 show+0.004
 performance+0.004
 grin+0.004
 leap+0.004
Neg logit contributions
 yourselves-0.006
ety-0.005
Or-0.005
 late-0.005
neys-0.005
Tokens
Token ID82
Feature activation+3.72e-01

Pos logit contributions
ety+0.000
 yourselves+0.000
ried+0.000
aking+0.000
neys+0.000
Neg logit contributions
 thank-0.000
 show-0.000
 performance-0.000
 medal-0.000
 grin-0.000
Token juggling
Token ID48097
Feature activation-6.24e-02

Pos logit contributions
 thank+0.007
 performance+0.007
 show+0.007
 medal+0.006
 leap+0.006
Neg logit contributions
 yourselves-0.013
ety-0.012
ried-0.012
neys-0.012
Or-0.011
Token and
Token ID290
Feature activation+3.47e-01

Pos logit contributions
 yourselves+0.002
ety+0.002
Or+0.002
ried+0.002
neys+0.002
Neg logit contributions
 thank-0.002
 performance-0.001
 show-0.001
ner-0.001
 hardly-0.001
Token ac
Token ID936
Feature activation-8.50e-01

Pos logit contributions
 thank+0.008
 performance+0.007
 story+0.007
 medal+0.007
 show+0.007
Neg logit contributions
 yourselves-0.011
ety-0.010
ried-0.010
Or-0.010
How-0.009
Tokenrob
Token ID22609
Feature activation+7.14e+00

Pos logit contributions
 yourselves+0.034
ety+0.033
ried+0.031
neys+0.031
 late+0.030
Neg logit contributions
 thank-0.015
 performance-0.013
 show-0.013
 story-0.011
 grin-0.011
Tokenats
Token ID1381
Feature activation+4.10e+00

Pos logit contributions
 thank+0.195
 show+0.179
 performance+0.175
 medal+0.163
 leap+0.161
Neg logit contributions
 yourselves-0.217
ety-0.200
ried-0.187
neys-0.183
aking-0.176
Token doing
Token ID1804
Feature activation+3.23e+00

Pos logit contributions
 thank+0.111
 performance+0.106
 story+0.100
 show+0.098
 medal+0.097
Neg logit contributions
 yourselves-0.102
ried-0.100
ety-0.099
 late-0.092
Or-0.088
Token flips
Token ID45971
Feature activation+6.56e-01

Pos logit contributions
 thank+0.073
 performance+0.065
 show+0.064
ner+0.061
on+0.061
Neg logit contributions
 yourselves-0.109
ety-0.094
Or-0.093
How-0.092
ried-0.091
Token.
Token ID13
Feature activation+6.02e-01

Pos logit contributions
 thank+0.017
 performance+0.015
 show+0.015
 grin+0.014
 medal+0.014
Neg logit contributions
 yourselves-0.020
ety-0.019
 late-0.018
ried-0.018
neys-0.017
Token But
Token ID887
Feature activation+1.27e+00

Pos logit contributions
 thank+0.013
 show+0.012
 performance+0.012
on+0.011
 story+0.011
Neg logit contributions
 yourselves-0.021
ety-0.020
ried-0.019
neys-0.018
 hyd-0.018
Token He
Token ID679
Feature activation-1.79e+00

Pos logit contributions
 thank+0.007
 show+0.006
 Everybody+0.006
 Everyone+0.006
 replied+0.005
Neg logit contributions
 yourselves-0.011
ety-0.010
ried-0.010
 hyd-0.009
neys-0.009
Token took
Token ID1718
Feature activation+2.68e-01

Pos logit contributions
 yourselves+0.061
ety+0.058
ried+0.056
Or+0.054
neys+0.053
Neg logit contributions
 thank-0.039
 performance-0.036
 show-0.036
 medal-0.032
 story-0.032
Token out
Token ID503
Feature activation-1.35e+00

Pos logit contributions
 thank+0.006
 performance+0.005
 show+0.005
 medal+0.005
 story+0.005
Neg logit contributions
 yourselves-0.009
ety-0.008
ried-0.008
 late-0.008
neys-0.008
Token a
Token ID257
Feature activation+2.65e+00

Pos logit contributions
 yourselves+0.046
ety+0.042
 late+0.041
ried+0.040
neys+0.039
Neg logit contributions
 thank-0.030
 performance-0.028
 show-0.027
 medal-0.025
 grin-0.025
Token hand
Token ID1021
Feature activation+2.24e-01

Pos logit contributions
 thank+0.069
 performance+0.066
 Everybody+0.064
 show+0.063
 hardly+0.063
Neg logit contributions
ety-0.070
 yourselves-0.069
 late-0.068
 hyd-0.062
Or-0.062
Tokenker
Token ID6122
Feature activation+7.12e+00

Pos logit contributions
 thank+0.006
 performance+0.006
 show+0.006
 medal+0.005
 story+0.005
Neg logit contributions
 yourselves-0.006
ety-0.006
ried-0.006
 late-0.006
 hyd-0.006
Tokenchief
Token ID17351
Feature activation+1.38e+00

Pos logit contributions
 thank+0.158
 performance+0.152
 show+0.147
 story+0.137
 contain+0.136
Neg logit contributions
 yourselves-0.237
ety-0.226
ried-0.213
 hyd-0.204
Or-0.200
Token from
Token ID422
Feature activation-5.84e-01

Pos logit contributions
 thank+0.038
 performance+0.035
 show+0.035
 medal+0.032
 story+0.032
Neg logit contributions
 yourselves-0.038
ety-0.037
ried-0.035
 late-0.035
 hyd-0.033
Token his
Token ID465
Feature activation-1.21e+00

Pos logit contributions
 yourselves+0.022
ety+0.020
ried+0.019
neys+0.019
Or+0.019
Neg logit contributions
 thank-0.012
 show-0.011
 performance-0.010
 medal-0.009
 story-0.009
Token pocket
Token ID10000
Feature activation-1.57e-01

Pos logit contributions
 yourselves+0.038
ety+0.035
ried+0.033
neys+0.032
Or+0.032
Neg logit contributions
 thank-0.031
 show-0.029
 performance-0.029
 medal-0.026
 story-0.026
Token and
Token ID290
Feature activation+1.36e-01

Pos logit contributions
 yourselves+0.005
ety+0.005
ried+0.005
 late+0.004
neys+0.004
Neg logit contributions
 thank-0.004
 performance-0.003
 show-0.003
 medal-0.003
 story-0.003
Token sund
Token ID37437
Feature activation-2.37e+00

Pos logit contributions
 thank+0.046
 performance+0.045
 Everybody+0.043
ner+0.042
 show+0.042
Neg logit contributions
 yourselves-0.056
ety-0.056
Or-0.052
 late-0.052
 hyd-0.052
Tokenae
Token ID3609
Feature activation+1.31e-01

Pos logit contributions
 yourselves+0.068
ety+0.063
ried+0.059
neys+0.058
 hyd+0.058
Neg logit contributions
 thank-0.067
 show-0.061
 performance-0.060
 leap-0.057
 grin-0.056
Token with
Token ID351
Feature activation+3.93e-01

Pos logit contributions
 thank+0.003
 performance+0.003
 Everybody+0.003
 show+0.003
ner+0.003
Neg logit contributions
 yourselves-0.004
ety-0.004
ried-0.003
 hyd-0.003
 late-0.003
Token lots
Token ID6041
Feature activation+2.43e-01

Pos logit contributions
 performance+0.010
 thank+0.009
 story+0.009
ner+0.009
 show+0.009
Neg logit contributions
 yourselves-0.011
ety-0.010
 late-0.010
Or-0.009
How-0.009
Token of
Token ID286
Feature activation+4.02e-01

Pos logit contributions
 performance+0.004
 thank+0.003
 battle+0.003
 show+0.003
 leap+0.003
Neg logit contributions
 yourselves-0.009
 late-0.008
oky-0.008
ety-0.008
neys-0.008
Token topp
Token ID23126
Feature activation+7.05e+00

Pos logit contributions
 performance+0.011
 thank+0.011
 leap+0.011
 battle+0.011
ner+0.010
Neg logit contributions
 yourselves-0.011
ety-0.009
 late-0.009
oky-0.008
How-0.008
Tokenings
Token ID654
Feature activation+4.82e-01

Pos logit contributions
 thank+0.227
 show+0.216
 performance+0.215
 medal+0.198
 leap+0.197
Neg logit contributions
 yourselves-0.178
ety-0.156
ried-0.148
neys-0.140
oky-0.137
Token.
Token ID13
Feature activation+8.44e-01

Pos logit contributions
 thank+0.011
 performance+0.010
 show+0.010
 leap+0.009
 medal+0.009
Neg logit contributions
 yourselves-0.016
ety-0.015
ried-0.014
 late-0.014
oky-0.014
Token
Token ID198
Feature activation+4.98e-01

Pos logit contributions
 thank+0.018
 show+0.017
 performance+0.016
on+0.015
 hardly+0.014
Neg logit contributions
 yourselves-0.030
ety-0.028
ried-0.027
neys-0.027
 hyd-0.026
Token
Token ID198
Feature activation+5.17e-02

Pos logit contributions
 thank+0.013
 show+0.013
 performance+0.012
 medal+0.012
 story+0.011
Neg logit contributions
 yourselves-0.015
ety-0.014
ried-0.013
neys-0.012
 late-0.012
TokenThey
Token ID2990
Feature activation+1.81e+00

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
on+0.001
 story+0.001
Neg logit contributions
 yourselves-0.002
ety-0.001
ried-0.001
neys-0.001
 hyd-0.001
Token faces
Token ID6698
Feature activation+3.61e+00

Pos logit contributions
 thank+0.067
 performance+0.061
 show+0.061
ner+0.058
 medal+0.058
Neg logit contributions
 yourselves-0.075
ety-0.066
neys-0.063
 hyd-0.061
oky-0.061
Token and
Token ID290
Feature activation+2.95e-01

Pos logit contributions
 thank+0.101
 performance+0.097
 show+0.093
 leap+0.092
 medal+0.090
Neg logit contributions
 yourselves-0.094
ety-0.086
Or-0.083
 late-0.083
 hyd-0.082
Token shapes
Token ID15268
Feature activation+4.22e+00

Pos logit contributions
 performance+0.007
 medal+0.006
 thank+0.006
ner+0.006
 story+0.006
Neg logit contributions
 yourselves-0.009
Or-0.008
ety-0.008
 hyd-0.008
ried-0.008
Token with
Token ID351
Feature activation+6.64e-01

Pos logit contributions
 thank+0.097
 performance+0.092
 show+0.089
 leap+0.084
 medal+0.083
Neg logit contributions
 yourselves-0.135
ety-0.124
ried-0.120
Or-0.119
neys-0.118
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
 performance+0.017
 thank+0.016
 show+0.016
 medal+0.015
ner+0.015
Neg logit contributions
 yourselves-0.020
ety-0.018
neys-0.017
 late-0.017
Or-0.017
Token topp
Token ID23126
Feature activation+7.04e+00

Pos logit contributions
 thank+0.027
 performance+0.025
 show+0.025
 leap+0.023
 medal+0.023
Neg logit contributions
 yourselves-0.034
ety-0.030
neys-0.029
 hyd-0.028
Or-0.028
Tokenings
Token ID654
Feature activation+2.24e-01

Pos logit contributions
 thank+0.218
 show+0.204
 performance+0.204
 medal+0.188
 leap+0.188
Neg logit contributions
 yourselves-0.184
ety-0.164
ried-0.154
neys-0.148
 hyd-0.148
Token.
Token ID13
Feature activation+3.29e-02

Pos logit contributions
 thank+0.005
 performance+0.005
 show+0.005
 leap+0.004
 medal+0.004
Neg logit contributions
 yourselves-0.008
ety-0.007
ried-0.007
 late-0.007
neys-0.007
Token
Token ID198
Feature activation-8.88e-02

Pos logit contributions
 thank+0.001
 show+0.001
 Everyone+0.001
 Everybody+0.001
 story+0.001
Neg logit contributions
 yourselves-0.001
ried-0.001
ety-0.001
har-0.001
neys-0.001
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
 yourselves+0.003
ety+0.003
ried+0.003
 hyd+0.003
neys+0.003
Neg logit contributions
 thank-0.002
 show-0.002
 performance-0.002
 story-0.002
on-0.002
Token"
Token ID1
Feature activation-4.98e+00

Pos logit contributions
 yourselves+0.012
ried+0.011
ety+0.011
neys+0.010
Or+0.010
Neg logit contributions
 thank-0.012
 show-0.012
 performance-0.011
 story-0.011
 start-0.011
Token it
Token ID340
Feature activation-9.34e-01

Pos logit contributions
 yourselves+0.013
ety+0.012
ried+0.011
Or+0.011
 late+0.011
Neg logit contributions
 thank-0.005
 show-0.005
 performance-0.005
 medal-0.004
 story-0.004
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
 yourselves+0.033
ety+0.031
ried+0.030
 late+0.029
neys+0.029
Neg logit contributions
 thank-0.019
 performance-0.017
 show-0.017
 medal-0.016
 grin-0.015
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
 thank+0.008
 show+0.008
 performance+0.008
 medal+0.007
 story+0.007
Neg logit contributions
 yourselves-0.011
ety-0.010
ried-0.009
neys-0.009
Or-0.009
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
 thank+0.009
 performance+0.008
 show+0.008
 medal+0.008
 trophy+0.008
Neg logit contributions
 yourselves-0.008
ety-0.007
 late-0.007
ried-0.007
Or-0.007
TokenA
Token ID32
Feature activation+1.02e+00

Pos logit contributions
 yourselves+0.002
ety+0.002
How+0.002
Or+0.002
ried+0.002
Neg logit contributions
 thank-0.003
 show-0.003
 performance-0.003
 medal-0.003
 grin-0.003
Tokenlic
Token ID677
Feature activation+7.02e+00

Pos logit contributions
 thank+0.021
on+0.021
mine+0.020
 ease+0.020
hing+0.019
Neg logit contributions
ety-0.030
 late-0.029
How-0.029
 yourselves-0.028
Or-0.028
Tokenia
Token ID544
Feature activation+1.21e+00

Pos logit contributions
 thank+0.252
 show+0.234
ner+0.231
 performance+0.231
 medal+0.229
Neg logit contributions
 yourselves-0.126
ried-0.113
ety-0.111
Or-0.101
neys-0.100
Token smiled
Token ID13541
Feature activation+2.26e-01

Pos logit contributions
 thank+0.027
 show+0.025
 performance+0.024
 medal+0.022
 story+0.022
Neg logit contributions
 yourselves-0.042
ety-0.039
ried-0.037
neys-0.036
Or-0.036
Token while
Token ID981
Feature activation-1.79e+00

Pos logit contributions
 thank+0.005
 show+0.005
 performance+0.005
 medal+0.004
 story+0.004
Neg logit contributions
 yourselves-0.008
ety-0.007
ried-0.007
neys-0.006
Or-0.006
Token she
Token ID673
Feature activation-1.54e+00

Pos logit contributions
 yourselves+0.056
ety+0.052
 late+0.051
Or+0.050
ried+0.049
Neg logit contributions
 thank-0.039
 performance-0.037
 show-0.037
 trophy-0.037
 medal-0.036
Token looked
Token ID3114
Feature activation+1.53e+00

Pos logit contributions
 yourselves+0.050
ety+0.048
ried+0.046
Or+0.044
neys+0.043
Neg logit contributions
 thank-0.035
 performance-0.033
 show-0.032
 medal-0.030
 story-0.030
Tokenâ
Token ID22940
Feature activation-2.17e-01

Pos logit contributions
 thank+0.015
 show+0.014
 performance+0.013
 Everybody+0.013
ner+0.012
Neg logit contributions
 yourselves-0.020
ety-0.018
ried-0.018
neys-0.017
Or-0.017
Token
Token ID26391
Feature activation+2.55e+00

Pos logit contributions
 yourselves+0.007
ety+0.007
ried+0.007
Or+0.007
neys+0.006
Neg logit contributions
 thank-0.005
 show-0.005
 performance-0.004
 story-0.004
on-0.004
Token
Token ID198
Feature activation+3.65e-01

Pos logit contributions
 thank+0.061
 show+0.056
 performance+0.054
on+0.053
 story+0.052
Neg logit contributions
 yourselves-0.083
ety-0.078
ried-0.076
Or-0.072
oky-0.071
Token
Token ID198
Feature activation-9.13e-02

Pos logit contributions
 thank+0.009
 show+0.009
 performance+0.008
 medal+0.008
 story+0.008
Neg logit contributions
 yourselves-0.012
ety-0.011
ried-0.010
neys-0.010
Or-0.010
TokenM
Token ID44
Feature activation+6.07e-01

Pos logit contributions
 yourselves+0.002
ety+0.002
ried+0.002
neys+0.002
oky+0.002
Neg logit contributions
 thank-0.003
 show-0.003
 performance-0.002
 story-0.002
on-0.002
Tokenaz
Token ID1031
Feature activation+6.98e+00

Pos logit contributions
 thank+0.015
 show+0.013
 performance+0.013
 story+0.013
 medal+0.012
Neg logit contributions
 yourselves-0.020
ety-0.019
ried-0.018
Or-0.018
 hyd-0.017
Tokenzy
Token ID7357
Feature activation+1.29e+00

Pos logit contributions
 thank+0.163
 show+0.151
 performance+0.149
 medal+0.137
 story+0.134
Neg logit contributions
 yourselves-0.236
ety-0.220
ried-0.209
neys-0.200
Or-0.198
Token was
Token ID373
Feature activation+1.34e+00

Pos logit contributions
 thank+0.028
 show+0.026
 performance+0.025
 medal+0.023
 story+0.023
Neg logit contributions
 yourselves-0.045
ety-0.042
ried-0.040
neys-0.039
Or-0.039
Token so
Token ID523
Feature activation+3.61e-01

Pos logit contributions
 thank+0.030
 show+0.028
 performance+0.027
 medal+0.025
 story+0.025
Neg logit contributions
 yourselves-0.046
ety-0.043
ried-0.041
neys-0.040
Or-0.039
Token excited
Token ID6568
Feature activation-9.41e-01

Pos logit contributions
 thank+0.009
 show+0.008
 performance+0.008
 grin+0.007
 story+0.007
Neg logit contributions
 yourselves-0.012
ety-0.011
ried-0.010
neys-0.010
Or-0.010
Token about
Token ID546
Feature activation-7.19e-01

Pos logit contributions
 yourselves+0.033
ety+0.031
 late+0.029
ried+0.028
oky+0.028
Neg logit contributions
 thank-0.020
 performance-0.018
 show-0.018
 medal-0.017
 grin-0.017
Token thought
Token ID1807
Feature activation+1.76e+00

Pos logit contributions
 thank+0.029
 performance+0.026
 show+0.026
 medal+0.024
 story+0.024
Neg logit contributions
 yourselves-0.039
ety-0.037
ried-0.035
Or-0.034
neys-0.034
Token they
Token ID484
Feature activation-4.11e-01

Pos logit contributions
 grin+0.035
 thank+0.035
odic+0.033
 performance+0.032
 show+0.032
Neg logit contributions
 yourselves-0.056
 late-0.049
ety-0.048
Or-0.047
ried-0.046
Token looked
Token ID3114
Feature activation+7.59e-01

Pos logit contributions
 yourselves+0.014
ety+0.014
ried+0.013
neys+0.013
Or+0.013
Neg logit contributions
 thank-0.009
 performance-0.008
 show-0.008
 medal-0.007
 story-0.007
Token like
Token ID588
Feature activation-8.22e-01

Pos logit contributions
 thank+0.018
 show+0.017
 performance+0.017
 medal+0.015
 story+0.015
Neg logit contributions
 yourselves-0.025
ety-0.023
ried-0.022
neys-0.021
Or-0.021
Token ac
Token ID936
Feature activation-9.53e-01

Pos logit contributions
 yourselves+0.027
ety+0.024
Or+0.023
 late+0.023
How+0.023
Neg logit contributions
 thank-0.019
 show-0.017
 performance-0.017
 grin-0.017
 leap-0.016
Tokenrob
Token ID22609
Feature activation+6.97e+00

Pos logit contributions
 yourselves+0.040
ety+0.038
ried+0.036
neys+0.036
 late+0.035
Neg logit contributions
 thank-0.015
 show-0.013
 performance-0.013
 grin-0.011
 story-0.010
Tokenats
Token ID1381
Feature activation+3.70e+00

Pos logit contributions
 thank+0.182
 show+0.166
 performance+0.161
 medal+0.150
 grin+0.148
Neg logit contributions
 yourselves-0.219
ety-0.204
ried-0.188
neys-0.185
oky-0.181
Token.
Token ID13
Feature activation+7.75e-01

Pos logit contributions
 thank+0.094
 performance+0.089
 show+0.086
 grin+0.082
 medal+0.081
Neg logit contributions
 yourselves-0.108
ety-0.103
ried-0.094
 late-0.093
neys-0.091
Token She
Token ID1375
Feature activation-1.31e+00

Pos logit contributions
 thank+0.016
 show+0.015
 Everyone+0.015
 Everybody+0.014
 performance+0.014
Neg logit contributions
 yourselves-0.028
 hyd-0.025
ried-0.025
ety-0.025
har-0.025
Token enjoyed
Token ID8359
Feature activation+8.16e-01

Pos logit contributions
 yourselves+0.042
ety+0.039
ried+0.037
Or+0.036
neys+0.036
Neg logit contributions
 thank-0.032
 show-0.030
 performance-0.030
 medal-0.027
 story-0.027
Token the
Token ID262
Feature activation+1.94e+00

Pos logit contributions
 thank+0.019
 show+0.017
 performance+0.017
 grin+0.017
 contain+0.017
Neg logit contributions
 yourselves-0.027
ety-0.024
Or-0.023
How-0.023
 late-0.022
Token noticed
Token ID6810
Feature activation-2.11e-01

Pos logit contributions
 thank+0.038
 performance+0.035
 show+0.035
 medal+0.032
 story+0.031
Neg logit contributions
 yourselves-0.057
ety-0.054
ried-0.051
Or-0.049
neys-0.049
Token that
Token ID326
Feature activation-4.36e-01

Pos logit contributions
 yourselves+0.008
ety+0.007
ried+0.007
 late+0.006
Or+0.006
Neg logit contributions
 thank-0.005
 performance-0.004
 show-0.004
 grin-0.004
 medal-0.004
Token some
Token ID617
Feature activation+1.35e+00

Pos logit contributions
 yourselves+0.015
ety+0.014
 late+0.013
ried+0.013
Or+0.013
Neg logit contributions
 thank-0.009
 performance-0.008
 show-0.008
 medal-0.008
 leap-0.008
Token of
Token ID286
Feature activation+7.49e-01

Pos logit contributions
 thank+0.023
 performance+0.022
 show+0.021
 leap+0.019
 grin+0.019
Neg logit contributions
 yourselves-0.052
ety-0.048
neys-0.045
 late-0.045
ried-0.045
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
 thank+0.016
 performance+0.015
 show+0.014
 grin+0.014
 medal+0.014
Neg logit contributions
 yourselves-0.027
ety-0.024
 late-0.022
ried-0.022
How-0.022
Token topp
Token ID23126
Feature activation+6.96e+00

Pos logit contributions
 thank+0.020
 performance+0.018
 show+0.018
on+0.017
 leap+0.017
Neg logit contributions
 yourselves-0.026
ety-0.023
ried-0.022
neys-0.022
Or-0.022
Tokenings
Token ID654
Feature activation+9.97e-01

Pos logit contributions
 thank+0.222
 show+0.208
 performance+0.206
 medal+0.189
ner+0.189
Neg logit contributions
 yourselves-0.178
ety-0.155
ried-0.149
neys-0.142
 hyd-0.142
Token were
Token ID547
Feature activation+5.98e-01

Pos logit contributions
 thank+0.020
 performance+0.018
 show+0.018
 medal+0.017
 story+0.016
Neg logit contributions
 yourselves-0.036
ety-0.034
ried-0.032
neys-0.031
 late-0.031
Token missing
Token ID4814
Feature activation-5.19e-01

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 medal+0.013
 story+0.013
Neg logit contributions
 yourselves-0.018
ety-0.017
ried-0.016
neys-0.016
oky-0.015
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
 yourselves+0.019
ety+0.018
ried+0.018
neys+0.017
Or+0.017
Neg logit contributions
 thank-0.010
 show-0.009
 performance-0.009
 medal-0.008
 story-0.008
Token She
Token ID1375
Feature activation-1.43e+00

Pos logit contributions
 thank+0.005
 Everyone+0.005
 Everybody+0.005
 show+0.005
 story+0.005
Neg logit contributions
 yourselves-0.010
ried-0.009
 hyd-0.009
ety-0.009
aking-0.009
Token saw
Token ID2497
Feature activation-5.99e-01

Pos logit contributions
 yourselves+0.071
ety+0.066
ried+0.063
oky+0.062
neys+0.061
Neg logit contributions
 thank-0.039
 show-0.037
 performance-0.033
 medal-0.031
 story-0.031
Token clown
Token ID25573
Feature activation-2.39e-01

Pos logit contributions
 yourselves+0.020
ety+0.018
ried+0.017
neys+0.017
Or+0.017
Neg logit contributions
 thank-0.014
 show-0.013
 performance-0.013
 medal-0.012
 story-0.012
Tokens
Token ID82
Feature activation-2.72e-02

Pos logit contributions
 yourselves+0.008
ety+0.007
ried+0.007
neys+0.007
oky+0.007
Neg logit contributions
 thank-0.006
 performance-0.005
 show-0.005
 medal-0.005
 grin-0.005
Token and
Token ID290
Feature activation+5.46e-01

Pos logit contributions
 yourselves+0.001
ety+0.001
ried+0.001
neys+0.001
Or+0.001
Neg logit contributions
 thank-0.001
 show-0.001
 performance-0.001
 medal-0.001
 story-0.001
Token ac
Token ID936
Feature activation-1.09e+00

Pos logit contributions
 thank+0.013
 performance+0.012
 show+0.011
on+0.011
 story+0.011
Neg logit contributions
 yourselves-0.018
ety-0.016
Or-0.015
ried-0.015
neys-0.015
Tokenrob
Token ID22609
Feature activation+6.93e+00

Pos logit contributions
 yourselves+0.044
ety+0.041
ried+0.040
Or+0.039
neys+0.038
Neg logit contributions
 thank-0.017
 show-0.016
 performance-0.015
on-0.014
 medal-0.014
Tokenats
Token ID1381
Feature activation+3.75e+00

Pos logit contributions
 thank+0.165
 show+0.154
 performance+0.151
 medal+0.139
 story+0.138
Neg logit contributions
 yourselves-0.228
ety-0.210
ried-0.201
Or-0.194
neys-0.194
Token and
Token ID290
Feature activation+6.21e-01

Pos logit contributions
 thank+0.100
 performance+0.097
 show+0.091
 medal+0.089
 leap+0.086
Neg logit contributions
 yourselves-0.108
ety-0.099
ried-0.096
 late-0.092
 hyd-0.089
Token lions
Token ID31420
Feature activation+1.77e+00

Pos logit contributions
 thank+0.015
 performance+0.015
 show+0.014
on+0.014
ner+0.014
Neg logit contributions
 yourselves-0.019
ety-0.017
Or-0.016
ried-0.016
neys-0.016
Token and
Token ID290
Feature activation+3.46e-01

Pos logit contributions
 thank+0.038
 performance+0.035
 show+0.035
 medal+0.032
 grin+0.031
Neg logit contributions
 yourselves-0.062
ety-0.057
ried-0.055
neys-0.053
Or-0.053
Token elephants
Token ID25368
Feature activation+2.32e+00

Pos logit contributions
 thank+0.009
 performance+0.008
 show+0.008
on+0.007
 story+0.007
Neg logit contributions
 yourselves-0.011
ety-0.010
Or-0.009
ried-0.009
neys-0.009
Token she
Token ID673
Feature activation-3.23e+00

Pos logit contributions
 thank+0.018
 show+0.016
 performance+0.016
 medal+0.014
 leap+0.014
Neg logit contributions
 yourselves-0.042
ety-0.039
ried-0.038
neys-0.037
oky-0.037
Token went
Token ID1816
Feature activation-1.84e+00

Pos logit contributions
 yourselves+0.114
ety+0.105
ried+0.098
neys+0.097
oky+0.097
Neg logit contributions
 thank-0.072
 show-0.067
 performance-0.064
 medal-0.059
on-0.058
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 yourselves+0.057
ried+0.051
 late+0.050
ety+0.050
 hyd+0.046
Neg logit contributions
 thank-0.046
 performance-0.044
 show-0.041
 jugg-0.040
 story-0.040
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
 thank+0.025
 show+0.024
 performance+0.022
 leap+0.020
 grin+0.020
Neg logit contributions
 yourselves-0.040
ety-0.037
neys-0.035
ried-0.035
oky-0.035
Token hand
Token ID1021
Feature activation+4.67e-01

Pos logit contributions
 thank+0.005
 show+0.004
 performance+0.004
 medal+0.004
 story+0.004
Neg logit contributions
 yourselves-0.007
ety-0.006
ried-0.006
neys-0.006
Or-0.006
Tokenker
Token ID6122
Feature activation+6.89e+00

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.010
 medal+0.009
 story+0.009
Neg logit contributions
 yourselves-0.016
ety-0.015
ried-0.014
neys-0.013
Or-0.013
Tokenchief
Token ID17351
Feature activation+1.58e+00

Pos logit contributions
 thank+0.144
 show+0.133
 performance+0.132
 story+0.119
 medal+0.118
Neg logit contributions
 yourselves-0.246
ety-0.230
ried-0.221
Or-0.211
neys-0.211
Token unw
Token ID7379
Feature activation+8.66e-01

Pos logit contributions
 thank+0.038
 show+0.035
 performance+0.035
 story+0.032
 medal+0.032
Neg logit contributions
 yourselves-0.051
ety-0.048
ried-0.047
Or-0.044
 late-0.044
Tokeninding
Token ID6020
Feature activation+1.36e+00

Pos logit contributions
 thank+0.027
 show+0.026
 performance+0.026
 medal+0.025
 leap+0.024
Neg logit contributions
 yourselves-0.022
ety-0.020
neys-0.018
oky-0.018
Or-0.018
Token until
Token ID1566
Feature activation+1.24e+00

Pos logit contributions
 thank+0.028
 performance+0.026
 show+0.025
 medal+0.023
 story+0.023
Neg logit contributions
 yourselves-0.049
ety-0.046
ried-0.044
neys-0.042
 late-0.042
Token she
Token ID673
Feature activation-1.87e+00

Pos logit contributions
 thank+0.020
 show+0.018
 performance+0.018
 medal+0.015
 story+0.015
Neg logit contributions
 yourselves-0.051
ety-0.048
ried-0.046
neys-0.045
Or-0.044
Token being
Token ID852
Feature activation-2.78e+00

Pos logit contributions
 thank+0.019
 show+0.017
 performance+0.016
 medal+0.015
 story+0.014
Neg logit contributions
 yourselves-0.036
ety-0.034
ried-0.033
neys-0.032
Or-0.032
Token too
Token ID1165
Feature activation-1.92e+00

Pos logit contributions
 yourselves+0.092
ety+0.087
ried+0.081
ipt+0.079
Or+0.079
Neg logit contributions
 thank-0.066
 show-0.062
 performance-0.060
 medal-0.054
 story-0.053
Token curious
Token ID11040
Feature activation+6.26e-01

Pos logit contributions
 yourselves+0.062
ety+0.060
ried+0.056
ipt+0.055
Or+0.055
Neg logit contributions
 thank-0.047
 show-0.043
 performance-0.040
 story-0.038
 medal-0.037
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
 thank+0.013
 show+0.012
 performance+0.012
 medal+0.011
 story+0.011
Neg logit contributions
 yourselves-0.022
ety-0.020
ried-0.020
neys-0.019
 late-0.019
Token N
Token ID399
Feature activation-1.28e+00

Pos logit contributions
 yourselves+0.016
ety+0.015
ried+0.014
 hyd+0.014
neys+0.014
Neg logit contributions
 thank-0.010
 show-0.009
 performance-0.009
on-0.008
 story-0.008
Tokenont
Token ID756
Feature activation+6.88e+00

Pos logit contributions
 yourselves+0.040
ety+0.037
ried+0.036
Or+0.035
neys+0.034
Neg logit contributions
 thank-0.033
 performance-0.030
 show-0.030
 story-0.028
 medal-0.028
Tokenhe
Token ID258
Feature activation+3.07e+00

Pos logit contributions
 thank+0.181
 show+0.170
 performance+0.167
 medal+0.155
 story+0.152
Neg logit contributions
 yourselves-0.211
ety-0.194
ried-0.184
neys-0.177
Or-0.175
Tokenless
Token ID1203
Feature activation+4.05e-01

Pos logit contributions
 thank+0.069
 performance+0.063
 show+0.062
 leap+0.056
 medal+0.056
Neg logit contributions
 yourselves-0.104
ety-0.096
ried-0.092
Or-0.089
neys-0.088
Token,
Token ID11
Feature activation+2.12e+00

Pos logit contributions
 thank+0.009
 show+0.008
 performance+0.008
 medal+0.007
 leap+0.007
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.012
Or-0.012
 late-0.012
Token it
Token ID340
Feature activation-9.48e-01

Pos logit contributions
 thank+0.046
 show+0.043
 performance+0.042
 medal+0.038
 story+0.038
Neg logit contributions
 yourselves-0.074
ety-0.069
ried-0.066
neys-0.064
Or-0.063
Token showed
Token ID3751
Feature activation+2.49e-01

Pos logit contributions
 yourselves+0.033
ety+0.031
ried+0.029
neys+0.029
Or+0.028
Neg logit contributions
 thank-0.021
 show-0.019
 performance-0.019
 medal-0.017
 story-0.017
Token
Token ID198
Feature activation-3.70e-01

Pos logit contributions
 yourselves+0.003
ety+0.002
ried+0.002
neys+0.002
Or+0.002
Neg logit contributions
 thank-0.002
 show-0.002
 performance-0.002
 medal-0.002
 story-0.002
TokenL
Token ID43
Feature activation-2.27e+00

Pos logit contributions
 yourselves+0.010
ety+0.010
ried+0.009
neys+0.009
oky+0.009
Neg logit contributions
 thank-0.011
 show-0.010
 performance-0.010
 medal-0.009
on-0.009
Tokenilly
Token ID6548
Feature activation+2.85e+00

Pos logit contributions
 yourselves+0.072
ety+0.066
ried+0.062
Or+0.062
neys+0.062
Neg logit contributions
 thank-0.057
 show-0.053
 performance-0.052
on-0.049
 story-0.048
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
 thank+0.065
 show+0.060
 performance+0.059
 medal+0.054
 story+0.054
Neg logit contributions
 yourselves-0.097
ety-0.090
ried-0.086
neys-0.083
Or-0.083
Token O
Token ID440
Feature activation+2.21e+00

Pos logit contributions
 thank+0.037
 performance+0.034
 show+0.034
 medal+0.032
 story+0.031
Neg logit contributions
 yourselves-0.061
ety-0.058
ried-0.056
Or-0.052
 late-0.052
Tokenzz
Token ID3019
Feature activation+6.85e+00

Pos logit contributions
 thank+0.065
 show+0.063
 performance+0.063
on+0.063
 story+0.061
Neg logit contributions
 yourselves-0.059
 late-0.049
Or-0.049
ried-0.048
ety-0.048
Tokenie
Token ID494
Feature activation+2.55e+00

Pos logit contributions
 thank+0.145
 show+0.133
 performance+0.128
 medal+0.119
 story+0.116
Neg logit contributions
 yourselves-0.243
ety-0.238
ried-0.229
neys-0.213
Or-0.210
Token played
Token ID2826
Feature activation+1.54e+00

Pos logit contributions
 thank+0.068
 performance+0.063
 show+0.062
 medal+0.059
 story+0.059
Neg logit contributions
ety-0.072
ried-0.070
 yourselves-0.070
Or-0.063
 hyd-0.062
Token together
Token ID1978
Feature activation-8.66e-02

Pos logit contributions
 thank+0.046
 performance+0.043
 show+0.042
 grin+0.041
 medal+0.040
Neg logit contributions
 yourselves-0.040
ried-0.033
ety-0.033
Or-0.033
 late-0.032
Token every
Token ID790
Feature activation-1.22e+00

Pos logit contributions
 yourselves+0.002
ried+0.002
Or+0.002
ety+0.002
 late+0.002
Neg logit contributions
 thank-0.002
odic-0.002
 grin-0.002
 Everybody-0.002
 performance-0.002
Token day
Token ID1110
Feature activation-3.53e-01

Pos logit contributions
 yourselves+0.042
ety+0.041
ried+0.039
 late+0.039
Or+0.038
Neg logit contributions
 thank-0.024
 performance-0.020
 show-0.020
 grin-0.020
 medal-0.019
Token swe
Token ID3490
Feature activation-2.86e-02

Pos logit contributions
 thank+0.079
 show+0.072
 performance+0.072
 medal+0.066
 grin+0.066
Neg logit contributions
 yourselves-0.121
ety-0.110
ried-0.106
neys-0.103
 late-0.103
Tokenaters
Token ID8605
Feature activation+1.70e+00

Pos logit contributions
 yourselves+0.001
 late+0.001
Or+0.001
ried+0.001
 wasteful+0.001
Neg logit contributions
 thank-0.001
 show-0.001
 performance-0.001
on-0.001
 grin-0.001
Token and
Token ID290
Feature activation+7.34e-01

Pos logit contributions
 thank+0.038
 performance+0.036
 show+0.035
 grin+0.033
 story+0.032
Neg logit contributions
 yourselves-0.056
ety-0.052
ried-0.050
 hyd-0.049
neys-0.049
Token hats
Token ID23910
Feature activation+3.67e+00

Pos logit contributions
 performance+0.017
 thank+0.017
 show+0.016
 story+0.015
on+0.015
Neg logit contributions
 yourselves-0.023
ety-0.021
Or-0.020
ried-0.020
neys-0.019
Token and
Token ID290
Feature activation+7.73e-01

Pos logit contributions
 thank+0.087
 performance+0.084
 show+0.081
 story+0.076
 leap+0.076
Neg logit contributions
 yourselves-0.114
ety-0.108
ried-0.104
 late-0.101
 hyd-0.100
Token scar
Token ID10153
Feature activation+6.83e+00

Pos logit contributions
 performance+0.018
 thank+0.018
 show+0.017
 story+0.017
on+0.016
Neg logit contributions
 yourselves-0.024
ety-0.022
Or-0.021
ried-0.021
neys-0.020
Tokenves
Token ID1158
Feature activation+2.40e+00

Pos logit contributions
 performance+0.140
 thank+0.138
 show+0.131
 grin+0.125
 leap+0.124
Neg logit contributions
 yourselves-0.238
ried-0.212
ety-0.204
 late-0.202
neys-0.200
Token from
Token ID422
Feature activation+1.02e-01

Pos logit contributions
 thank+0.054
 performance+0.052
 show+0.051
 story+0.047
 medal+0.046
Neg logit contributions
 yourselves-0.078
ety-0.074
ried-0.072
Or-0.069
neys-0.068
Token the
Token ID262
Feature activation+6.07e-01

Pos logit contributions
 thank+0.002
 show+0.002
 performance+0.002
 medal+0.001
 story+0.001
Neg logit contributions
 yourselves-0.004
ety-0.004
ried-0.004
neys-0.003
oky-0.003
Token wool
Token ID25749
Feature activation+3.07e+00

Pos logit contributions
 thank+0.015
 performance+0.014
 show+0.014
on+0.014
 grin+0.013
Neg logit contributions
 yourselves-0.019
ety-0.017
ried-0.016
Or-0.016
 hyd-0.016
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 thank+0.071
 performance+0.065
 show+0.063
 grin+0.060
 medal+0.058
Neg logit contributions
 yourselves-0.103
ety-0.096
ried-0.092
 hyd-0.090
neys-0.089
Token He
Token ID679
Feature activation-1.37e+00

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.009
 medal+0.009
 story+0.009
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.012
 hyd-0.012
neys-0.012
Token smiled
Token ID13541
Feature activation+7.34e-01

Pos logit contributions
 yourselves+0.049
ety+0.045
ried+0.043
neys+0.042
oky+0.041
Neg logit contributions
 thank-0.030
 show-0.028
 performance-0.027
 medal-0.024
 story-0.024
Token as
Token ID355
Feature activation-1.52e+00

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 medal+0.013
 grin+0.013
Neg logit contributions
 yourselves-0.026
ety-0.024
ried-0.023
neys-0.023
Or-0.022
Token the
Token ID262
Feature activation+6.55e-01

Pos logit contributions
 yourselves+0.055
 late+0.051
ety+0.048
ried+0.047
Or+0.047
Neg logit contributions
 performance-0.028
 show-0.025
 story-0.025
 thank-0.025
 grin-0.023
Token rain
Token ID6290
Feature activation+1.42e-01

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.015
 medal+0.013
 story+0.013
Neg logit contributions
 yourselves-0.021
ety-0.020
ried-0.019
neys-0.018
Or-0.018
Tokendrops
Token ID49253
Feature activation+6.82e+00

Pos logit contributions
 thank+0.003
 show+0.003
 performance+0.003
 medal+0.003
 story+0.003
Neg logit contributions
 yourselves-0.005
ety-0.005
ried-0.004
neys-0.004
Or-0.004
Token tick
Token ID4378
Feature activation+1.30e+00

Pos logit contributions
 thank+0.159
 performance+0.150
 show+0.143
 story+0.143
 medal+0.137
Neg logit contributions
 yourselves-0.214
ety-0.203
ried-0.201
 hyd-0.191
Or-0.190
Tokenled
Token ID992
Feature activation+2.14e-01

Pos logit contributions
 thank+0.034
 show+0.032
 performance+0.031
 medal+0.029
 leap+0.028
Neg logit contributions
 yourselves-0.040
ety-0.037
ried-0.035
neys-0.034
Or-0.034
Token his
Token ID465
Feature activation-7.67e-01

Pos logit contributions
 thank+0.004
 show+0.004
 performance+0.004
 medal+0.003
 story+0.003
Neg logit contributions
 yourselves-0.008
ety-0.007
ried-0.007
Or-0.007
neys-0.007
Token face
Token ID1986
Feature activation+2.11e+00

Pos logit contributions
 yourselves+0.029
ety+0.027
ried+0.026
neys+0.025
 late+0.025
Neg logit contributions
 thank-0.015
 show-0.014
 performance-0.014
 medal-0.012
 story-0.012
Token.
Token ID13
Feature activation+7.20e-01

Pos logit contributions
 thank+0.047
 performance+0.043
 show+0.043
 medal+0.040
 story+0.039
Neg logit contributions
 yourselves-0.069
ety-0.066
ried-0.064
 hyd-0.061
Or-0.061
Token elephants
Token ID25368
Feature activation+9.25e-01

Pos logit contributions
 thank+0.038
 performance+0.036
 show+0.035
 medal+0.033
 story+0.032
Neg logit contributions
 yourselves-0.044
ety-0.040
ried-0.037
Or-0.037
neys-0.036
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
 thank+0.016
 performance+0.015
 show+0.014
 medal+0.013
 hardly+0.013
Neg logit contributions
 yourselves-0.035
ety-0.032
neys-0.031
oky-0.031
Or-0.030
Token lions
Token ID31420
Feature activation+5.92e-01

Pos logit contributions
 thank+0.030
 show+0.028
 performance+0.028
 medal+0.025
 story+0.025
Neg logit contributions
 yourselves-0.041
ety-0.037
ried-0.036
neys-0.034
Or-0.034
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
 thank+0.008
 performance+0.007
 show+0.007
 medal+0.006
 grin+0.006
Neg logit contributions
 yourselves-0.025
ety-0.023
ried-0.023
neys-0.022
Or-0.022
Token ac
Token ID936
Feature activation-1.31e+00

Pos logit contributions
 yourselves+0.001
ety+0.001
Or+0.001
ried+0.001
neys+0.001
Neg logit contributions
 thank-0.001
 performance-0.000
 show-0.000
 grin-0.000
on-0.000
Tokenrob
Token ID22609
Feature activation+6.82e+00

Pos logit contributions
 yourselves+0.054
ety+0.051
ried+0.049
neys+0.048
Or+0.048
Neg logit contributions
 thank-0.021
 show-0.019
 performance-0.018
 medal-0.016
 story-0.015
Tokenats
Token ID1381
Feature activation+3.44e+00

Pos logit contributions
 thank+0.173
 show+0.161
 performance+0.159
 medal+0.147
 story+0.144
Neg logit contributions
 yourselves-0.216
ety-0.199
ried-0.189
neys-0.183
Or-0.180
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
 thank+0.088
 performance+0.086
 medal+0.080
 grin+0.079
 show+0.077
Neg logit contributions
 yourselves-0.104
ety-0.091
ried-0.089
neys-0.084
 hyd-0.083
Token They
Token ID1119
Feature activation+1.55e+00

Pos logit contributions
 yourselves+0.007
ety+0.006
ried+0.006
 hyd+0.006
aking+0.006
Neg logit contributions
 thank-0.007
 show-0.006
 Everyone-0.006
 Everybody-0.006
 performance-0.006
Token are
Token ID389
Feature activation+1.31e+00

Pos logit contributions
 performance+0.030
on+0.028
odic+0.028
 thank+0.027
 medal+0.027
Neg logit contributions
 yourselves-0.053
ety-0.048
Or-0.047
ried-0.047
neys-0.045
Token amazed
Token ID24562
Feature activation+7.14e-01

Pos logit contributions
 thank+0.033
 performance+0.030
 show+0.030
 medal+0.028
on+0.028
Neg logit contributions
 yourselves-0.041
ety-0.038
ried-0.036
neys-0.035
 late-0.035
Token price
Token ID2756
Feature activation+1.67e-01

Pos logit contributions
 thank+0.013
 performance+0.012
 show+0.012
 medal+0.011
 leap+0.010
Neg logit contributions
 yourselves-0.017
ety-0.016
ried-0.015
Or-0.015
 late-0.015
Token tag
Token ID7621
Feature activation+1.04e+00

Pos logit contributions
 thank+0.004
 show+0.003
 performance+0.003
 medal+0.003
 story+0.003
Neg logit contributions
 yourselves-0.006
ety-0.005
ried-0.005
Or-0.005
neys-0.005
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
 thank+0.023
 show+0.021
 performance+0.021
 medal+0.019
 story+0.019
Neg logit contributions
 yourselves-0.036
ety-0.034
ried-0.032
Or-0.031
neys-0.031
Token It
Token ID632
Feature activation-5.96e-01

Pos logit contributions
 yourselves+0.025
ried+0.024
 hyd+0.024
ety+0.023
asured+0.023
Neg logit contributions
 thank-0.013
 show-0.011
 Everyone-0.010
on-0.010
 Everybody-0.010
Token said
Token ID531
Feature activation+1.75e+00

Pos logit contributions
 yourselves+0.025
ety+0.022
oky+0.021
 late+0.021
ried+0.021
Neg logit contributions
 thank-0.010
 show-0.010
 performance-0.008
 replied-0.008
 medal-0.008
Token $
Token ID720
Feature activation-8.52e+00

Pos logit contributions
 thank+0.031
 show+0.029
 performance+0.029
 medal+0.025
 story+0.025
Neg logit contributions
 yourselves-0.067
ety-0.062
ried-0.060
Or-0.059
neys-0.058
Token1
Token ID16
Feature activation+1.63e+00

Pos logit contributions
 yourselves+0.223
ried+0.195
Or+0.193
 hyd+0.191
ety+0.190
Neg logit contributions
 thank-0.249
 show-0.231
on-0.224
 leap-0.218
 performance-0.217
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
 thank+0.023
 show+0.019
 performance+0.018
on+0.016
 medal+0.016
Neg logit contributions
 yourselves-0.071
ety-0.066
ried-0.064
neys-0.063
 late-0.063
Token99
Token ID2079
Feature activation+6.06e-02

Pos logit contributions
 yourselves+0.061
ried+0.058
ety+0.056
neys+0.056
 property+0.056
Neg logit contributions
 thank-0.033
 show-0.027
ner-0.025
on-0.025
 Everyone-0.025
Token per
Token ID583
Feature activation-2.52e+00

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
 medal+0.001
 story+0.001
Neg logit contributions
 yourselves-0.002
ety-0.002
ried-0.002
neys-0.002
Or-0.002
Token pound
Token ID14896
Feature activation+9.72e-01

Pos logit contributions
 yourselves+0.086
ety+0.079
ried+0.076
neys+0.073
Or+0.073
Neg logit contributions
 thank-0.058
 show-0.053
 performance-0.052
 medal-0.048
 story-0.047
TokenThe
Token ID464
Feature activation+1.72e-01

Pos logit contributions
 yourselves+0.141
ried+0.122
ety+0.118
 hyd+0.117
neys+0.117
Neg logit contributions
 thank-0.127
 show-0.123
 ride-0.118
ner-0.117
 hardly-0.115
Token le
Token ID443
Feature activation-1.41e+00

Pos logit contributions
 thank+0.004
 show+0.004
 performance+0.004
 story+0.004
 medal+0.004
Neg logit contributions
 yourselves-0.006
ety-0.005
ried-0.005
neys-0.005
oky-0.005
Tokenopard
Token ID15478
Feature activation+7.85e-01

Pos logit contributions
 yourselves+0.028
ety+0.024
ried+0.024
 late+0.023
 hyd+0.021
Neg logit contributions
 thank-0.051
 show-0.049
 medal-0.047
on-0.047
 prize-0.047
Token toy
Token ID13373
Feature activation-9.47e-01

Pos logit contributions
 thank+0.019
 show+0.018
 performance+0.017
 medal+0.016
 story+0.016
Neg logit contributions
 yourselves-0.026
ety-0.024
ried-0.023
oky-0.022
 late-0.022
Token costs
Token ID3484
Feature activation-2.29e+00

Pos logit contributions
 yourselves+0.030
ety+0.029
ried+0.027
 late+0.026
neys+0.026
Neg logit contributions
 thank-0.023
 show-0.022
 performance-0.021
 medal-0.020
on-0.019
Token $
Token ID720
Feature activation-8.48e+00

Pos logit contributions
 yourselves+0.078
ety+0.075
ried+0.070
oky+0.070
 hyd+0.070
Neg logit contributions
 thank-0.052
 show-0.048
 performance-0.044
 medal-0.043
on-0.042
Token10
Token ID940
Feature activation-5.57e-01

Pos logit contributions
 yourselves+0.218
 hyd+0.195
Or+0.189
neys+0.181
ried+0.181
Neg logit contributions
 thank-0.246
 show-0.233
 performance-0.227
on-0.227
 medal-0.226
Token."
Token ID526
Feature activation+9.38e-01

Pos logit contributions
 yourselves+0.018
ety+0.016
ried+0.015
neys+0.015
Or+0.015
Neg logit contributions
 thank-0.014
 show-0.013
 performance-0.013
 medal-0.012
 story-0.012
Token
Token ID220
Feature activation+2.06e-02

Pos logit contributions
 thank+0.022
 show+0.020
 replied+0.018
on+0.018
 performance+0.018
Neg logit contributions
 yourselves-0.033
ety-0.029
ried-0.029
Or-0.028
neys-0.027
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
 thank+0.000
 show+0.000
on+0.000
 Everyone+0.000
 Everybody+0.000
Neg logit contributions
 yourselves-0.001
ety-0.001
 hyd-0.001
ried-0.001
Or-0.001
Token
Token ID198
Feature activation-6.22e-01

Pos logit contributions
 yourselves+0.006
ety+0.006
ried+0.006
neys+0.005
Or+0.005
Neg logit contributions
 thank-0.005
 show-0.005
 performance-0.005
 story-0.004
 medal-0.004
Token the
Token ID262
Feature activation-3.69e-01

Pos logit contributions
 yourselves+0.067
ety+0.062
ried+0.059
neys+0.057
 late+0.057
Neg logit contributions
 thank-0.041
 show-0.038
 performance-0.037
 medal-0.034
 grin-0.033
Token price
Token ID2756
Feature activation+6.21e-01

Pos logit contributions
 yourselves+0.012
ety+0.011
ried+0.010
Or+0.010
 late+0.010
Neg logit contributions
 thank-0.009
 performance-0.008
 show-0.008
on-0.007
 leap-0.007
Token -
Token ID532
Feature activation+1.01e-01

Pos logit contributions
 thank+0.012
 performance+0.011
 show+0.011
 grin+0.010
 leap+0.010
Neg logit contributions
 yourselves-0.023
ety-0.021
ried-0.020
neys-0.020
Or-0.020
Token it
Token ID340
Feature activation-1.74e+00

Pos logit contributions
 thank+0.002
 show+0.002
 performance+0.002
 medal+0.002
 story+0.002
Neg logit contributions
 yourselves-0.003
ety-0.003
ried-0.003
Or-0.003
neys-0.003
Token was
Token ID373
Feature activation-1.31e+00

Pos logit contributions
 yourselves+0.065
ety+0.060
ried+0.057
neys+0.056
oky+0.056
Neg logit contributions
 thank-0.035
 show-0.032
 performance-0.031
 medal-0.028
 story-0.028
Token $
Token ID720
Feature activation-8.45e+00

Pos logit contributions
 yourselves+0.044
ety+0.041
ried+0.039
neys+0.037
Or+0.037
Neg logit contributions
 thank-0.031
 show-0.028
 performance-0.028
 medal-0.026
 story-0.025
Token5
Token ID20
Feature activation-2.04e+00

Pos logit contributions
 yourselves+0.217
Or+0.191
 hyd+0.190
neys+0.183
ety+0.183
Neg logit contributions
 thank-0.245
 show-0.232
on-0.231
 performance-0.229
ner-0.228
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
 yourselves+0.074
ety+0.068
ried+0.066
neys+0.064
 late+0.063
Neg logit contributions
 thank-0.042
 show-0.038
 performance-0.038
 grin-0.034
 medal-0.034
Token Sally
Token ID25737
Feature activation+1.40e+00

Pos logit contributions
 yourselves+0.001
ety+0.001
ried+0.001
neys+0.001
 hyd+0.001
Neg logit contributions
 thank-0.001
 show-0.001
 performance-0.001
 grin-0.001
on-0.001
Token was
Token ID373
Feature activation+1.46e+00

Pos logit contributions
 thank+0.030
 show+0.027
 performance+0.027
 medal+0.024
 story+0.024
Neg logit contributions
 yourselves-0.050
ety-0.047
ried-0.044
neys-0.043
Or-0.043
Token so
Token ID523
Feature activation+6.57e-01

Pos logit contributions
 thank+0.032
 show+0.030
 performance+0.030
 medal+0.027
 story+0.027
Neg logit contributions
 yourselves-0.050
ety-0.046
ried-0.045
neys-0.043
 late-0.043
Token he
Token ID339
Feature activation-1.56e+00

Pos logit contributions
 thank+0.009
 performance+0.008
 show+0.008
 medal+0.007
 story+0.007
Neg logit contributions
 yourselves-0.015
ety-0.014
ried-0.013
Or-0.013
 late-0.013
Token said
Token ID531
Feature activation+2.59e+00

Pos logit contributions
 yourselves+0.057
ety+0.054
ried+0.052
Or+0.050
neys+0.050
Neg logit contributions
 thank-0.031
 performance-0.028
 show-0.028
 medal-0.025
 story-0.025
Token it
Token ID340
Feature activation-2.16e+00

Pos logit contributions
 performance+0.058
 show+0.056
 leap+0.053
 thank+0.051
 battle+0.051
Neg logit contributions
 yourselves-0.082
Or-0.074
ried-0.071
ety-0.071
 late-0.070
Token was
Token ID373
Feature activation-1.51e+00

Pos logit contributions
 yourselves+0.077
ety+0.072
ried+0.069
neys+0.067
Or+0.067
Neg logit contributions
 thank-0.045
 show-0.041
 performance-0.041
 medal-0.037
 story-0.036
Token only
Token ID691
Feature activation-3.02e+00

Pos logit contributions
 yourselves+0.054
ety+0.050
ried+0.048
neys+0.047
Or+0.046
Neg logit contributions
 thank-0.032
 show-0.030
 performance-0.029
 medal-0.026
 story-0.026
Token $
Token ID720
Feature activation-8.38e+00

Pos logit contributions
 yourselves+0.107
ety+0.100
ried+0.095
neys+0.093
Or+0.092
Neg logit contributions
 thank-0.065
 show-0.060
 performance-0.059
 medal-0.054
 story-0.053
Token10
Token ID940
Feature activation-1.09e-01

Pos logit contributions
 yourselves+0.228
Or+0.203
 hyd+0.196
neys+0.192
ried+0.186
Neg logit contributions
 thank-0.228
 show-0.217
on-0.216
ner-0.213
 performance-0.212
Token.
Token ID13
Feature activation+5.17e-02

Pos logit contributions
 yourselves+0.004
ety+0.003
ried+0.003
Or+0.003
neys+0.003
Neg logit contributions
 thank-0.002
 performance-0.002
 show-0.002
 grin-0.002
 story-0.002
Token
Token ID220
Feature activation+3.63e-01

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
 medal+0.001
 story+0.001
Neg logit contributions
 yourselves-0.002
ety-0.002
ried-0.002
neys-0.002
Or-0.002
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
 thank+0.007
on+0.006
 show+0.006
 Everyone+0.006
 Everybody+0.006
Neg logit contributions
 yourselves-0.014
 hyd-0.012
ety-0.012
ried-0.012
Or-0.012
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
 thank+0.003
 performance+0.003
 show+0.003
 medal+0.003
 leap+0.003
Neg logit contributions
 yourselves-0.003
ety-0.003
 late-0.003
ried-0.002
How-0.002
Token"
Token ID1
Feature activation-4.63e+00

Pos logit contributions
ety+0.029
 yourselves+0.028
ried+0.028
rying+0.027
oky+0.027
Neg logit contributions
 thank-0.021
 show-0.019
 Everyone-0.016
 of-0.016
 ride-0.016
TokenMaybe
Token ID13300
Feature activation-1.28e+00

Pos logit contributions
 yourselves+0.149
neys+0.145
ety+0.142
ney+0.136
oky+0.135
Neg logit contributions
 thank-0.115
 show-0.107
 hardly-0.099
 yes-0.099
 medal-0.098
Token it
Token ID340
Feature activation-1.84e+00

Pos logit contributions
 yourselves+0.041
ety+0.039
ried+0.037
neys+0.036
oky+0.035
Neg logit contributions
 thank-0.032
 show-0.030
 performance-0.028
 medal-0.027
 story-0.026
Token will
Token ID481
Feature activation+5.22e-01

Pos logit contributions
 yourselves+0.068
ety+0.061
 late+0.058
oky+0.057
neys+0.057
Neg logit contributions
 thank-0.039
 show-0.038
 performance-0.033
 story-0.033
 leap-0.031
Token come
Token ID1282
Feature activation-1.41e+00

Pos logit contributions
 thank+0.012
 show+0.011
 performance+0.011
 medal+0.010
on+0.010
Neg logit contributions
 yourselves-0.018
ety-0.017
ried-0.016
Or-0.015
neys-0.015
Token back
Token ID736
Feature activation-7.39e+00

Pos logit contributions
 yourselves+0.049
neys+0.043
oky+0.043
ety+0.042
Or+0.041
Neg logit contributions
 thank-0.033
 show-0.031
 performance-0.027
 story-0.027
 start-0.026
Token if
Token ID611
Feature activation-9.37e-01

Pos logit contributions
 yourselves+0.280
ety+0.258
ipt+0.251
neys+0.249
ried+0.248
Neg logit contributions
 thank-0.139
 show-0.132
 performance-0.112
 story-0.105
 ride-0.101
Token we
Token ID356
Feature activation+2.19e+00

Pos logit contributions
 yourselves+0.030
ety+0.028
ried+0.027
neys+0.026
oky+0.026
Neg logit contributions
 thank-0.024
 show-0.022
 performance-0.021
 medal-0.020
 story-0.019
Token hang
Token ID8181
Feature activation+7.55e-01

Pos logit contributions
 thank+0.041
 show+0.038
 performance+0.037
 medal+0.033
 story+0.033
Neg logit contributions
 yourselves-0.083
ety-0.078
ried-0.074
neys-0.072
Or-0.072
Token here
Token ID994
Feature activation-1.08e+00

Pos logit contributions
 thank+0.017
 show+0.015
 performance+0.014
 medal+0.013
on+0.013
Neg logit contributions
 yourselves-0.026
ety-0.025
ried-0.024
Or-0.024
neys-0.024
Token and
Token ID290
Feature activation-4.89e-01

Pos logit contributions
 yourselves+0.040
ety+0.038
ried+0.036
neys+0.036
oky+0.035
Neg logit contributions
 thank-0.021
 show-0.019
 performance-0.018
 medal-0.017
 story-0.016
Token out
Token ID503
Feature activation-2.04e+00

Pos logit contributions
 yourselves+0.045
ety+0.042
ipt+0.040
ried+0.039
oky+0.039
Neg logit contributions
 thank-0.034
 show-0.032
 performance-0.028
 start-0.027
 medal-0.026
Token.
Token ID13
Feature activation-8.35e-01

Pos logit contributions
 yourselves+0.077
ety+0.075
ried+0.069
ipt+0.068
Or+0.068
Neg logit contributions
 thank-0.039
 show-0.036
 performance-0.032
 story-0.029
on-0.028
Token It
Token ID632
Feature activation-1.29e+00

Pos logit contributions
ety+0.028
 yourselves+0.026
ried+0.026
Or+0.025
rying+0.024
Neg logit contributions
 thank-0.021
 Everyone-0.018
 Everybody-0.017
 show-0.017
 teach-0.016
Token will
Token ID481
Feature activation+8.19e-01

Pos logit contributions
 yourselves+0.048
ety+0.044
oky+0.041
ipt+0.041
ried+0.041
Neg logit contributions
 thank-0.028
 show-0.028
 performance-0.023
 story-0.022
 start-0.022
Token come
Token ID1282
Feature activation-1.38e+00

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 medal+0.012
 story+0.012
Neg logit contributions
 yourselves-0.031
ety-0.030
ried-0.028
neys-0.027
oky-0.027
Token back
Token ID736
Feature activation-7.23e+00

Pos logit contributions
 yourselves+0.052
ety+0.046
oky+0.045
neys+0.045
 wasteful+0.045
Neg logit contributions
 thank-0.027
 show-0.026
 start-0.022
 performance-0.022
 story-0.021
Token soon
Token ID2582
Feature activation-2.73e+00

Pos logit contributions
 yourselves+0.278
ety+0.259
ipt+0.259
oky+0.251
 wasteful+0.248
Neg logit contributions
 show-0.122
 thank-0.121
 ride-0.094
 goodbye-0.094
,-0.091
Token."
Token ID526
Feature activation+5.26e-01

Pos logit contributions
 yourselves+0.101
ety+0.096
ried+0.091
neys+0.089
ipt+0.088
Neg logit contributions
 thank-0.056
 show-0.052
 performance-0.047
 story-0.043
 medal-0.042
Token
Token ID198
Feature activation-9.66e-01

Pos logit contributions
 thank+0.010
 show+0.009
 story+0.008
on+0.008
 Everyone+0.008
Neg logit contributions
 yourselves-0.020
ried-0.020
ety-0.019
neys-0.018
 hyd-0.018
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
 yourselves+0.034
ety+0.033
ried+0.033
 hyd+0.031
neys+0.031
Neg logit contributions
 thank-0.020
 show-0.019
 story-0.017
on-0.017
 performance-0.016
TokenHe
Token ID1544
Feature activation-2.32e+00

Pos logit contributions
ried+0.035
 yourselves+0.034
ety+0.033
asured+0.032
neys+0.032
Neg logit contributions
 thank-0.028
on-0.027
 show-0.027
 story-0.025
odic-0.024
Token that
Token ID326
Feature activation-8.88e-01

Pos logit contributions
 yourselves+0.024
ety+0.021
 late+0.020
Or+0.020
ried+0.020
Neg logit contributions
 thank-0.016
 performance-0.015
 show-0.015
 story-0.014
 medal-0.014
Token her
Token ID607
Feature activation-1.95e+00

Pos logit contributions
 yourselves+0.033
ety+0.031
ried+0.029
 late+0.029
Or+0.028
Neg logit contributions
 thank-0.018
 performance-0.016
 show-0.016
 medal-0.014
 grin-0.014
Token leg
Token ID1232
Feature activation+1.39e-01

Pos logit contributions
 yourselves+0.060
ety+0.055
ried+0.052
neys+0.051
Or+0.050
Neg logit contributions
 thank-0.050
 show-0.045
 performance-0.045
on-0.042
ner-0.042
Token was
Token ID373
Feature activation-4.27e-01

Pos logit contributions
 thank+0.003
 show+0.003
 performance+0.003
 medal+0.003
 story+0.003
Neg logit contributions
 yourselves-0.005
ety-0.004
ried-0.004
neys-0.004
Or-0.004
Token coming
Token ID2406
Feature activation-1.86e+00

Pos logit contributions
 yourselves+0.012
ety+0.012
ried+0.011
neys+0.011
oky+0.010
Neg logit contributions
 thank-0.012
 show-0.011
 performance-0.011
 medal-0.010
 story-0.010
Token back
Token ID736
Feature activation-7.23e+00

Pos logit contributions
 yourselves+0.064
ety+0.059
ried+0.056
neys+0.055
Or+0.055
Neg logit contributions
 thank-0.042
 show-0.039
 performance-0.038
 medal-0.035
 story-0.034
Token.
Token ID13
Feature activation-4.82e-01

Pos logit contributions
 yourselves+0.279
ety+0.262
Or+0.250
ried+0.248
neys+0.246
Neg logit contributions
 thank-0.127
 show-0.123
 performance-0.112
 story-0.098
 medal-0.097
Token She
Token ID1375
Feature activation-1.45e+00

Pos logit contributions
 yourselves+0.017
ried+0.016
ety+0.016
 hyd+0.015
neys+0.015
Neg logit contributions
 thank-0.011
 show-0.010
 Everyone-0.009
 Everybody-0.009
 replied-0.009
Token was
Token ID373
Feature activation+1.29e+00

Pos logit contributions
 yourselves+0.052
ety+0.048
ried+0.046
neys+0.045
Or+0.044
Neg logit contributions
 thank-0.031
 show-0.029
 performance-0.028
 medal-0.025
 story-0.025
Token reapp
Token ID24578
Feature activation-2.23e+00

Pos logit contributions
 thank+0.034
 show+0.033
 performance+0.031
 grin+0.029
 hardly+0.029
Neg logit contributions
 yourselves-0.040
ety-0.038
ried-0.035
Or-0.035
neys-0.034
Tokenearing
Token ID6648
Feature activation+2.90e-01

Pos logit contributions
 yourselves+0.053
ety+0.047
ried+0.044
Or+0.043
neys+0.043
Neg logit contributions
 thank-0.073
 show-0.070
 performance-0.069
 medal-0.065
 story-0.064
Token busy
Token ID8179
Feature activation-6.56e-01

Pos logit contributions
 yourselves+0.056
ety+0.053
ipt+0.051
ried+0.050
neys+0.049
Neg logit contributions
 thank-0.038
 show-0.036
 performance-0.031
 start-0.030
 story-0.030
Token.
Token ID13
Feature activation-1.93e+00

Pos logit contributions
ety+0.024
 yourselves+0.024
ried+0.023
ipt+0.023
Or+0.023
Neg logit contributions
 thank-0.012
 show-0.012
 performance-0.011
 story-0.010
 ride-0.009
Token It
Token ID632
Feature activation-1.29e+00

Pos logit contributions
ety+0.065
ried+0.063
 yourselves+0.060
rying+0.057
Or+0.057
Neg logit contributions
 thank-0.047
 show-0.040
 Everyone-0.040
 Everybody-0.038
on-0.035
Token will
Token ID481
Feature activation+7.53e-01

Pos logit contributions
 yourselves+0.046
ety+0.044
ipt+0.040
ried+0.040
oky+0.039
Neg logit contributions
 thank-0.029
 show-0.029
 performance-0.024
 story-0.024
 start-0.024
Token come
Token ID1282
Feature activation-1.28e+00

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 medal+0.013
 story+0.013
Neg logit contributions
 yourselves-0.027
ety-0.025
ried-0.024
neys-0.024
Or-0.023
Token back
Token ID736
Feature activation-7.23e+00

Pos logit contributions
 yourselves+0.046
ety+0.042
oky+0.040
Or+0.040
neys+0.040
Neg logit contributions
 thank-0.027
 show-0.026
 performance-0.023
 start-0.022
 ride-0.022
Token soon
Token ID2582
Feature activation-2.81e+00

Pos logit contributions
 yourselves+0.257
ety+0.244
oky+0.231
ipt+0.228
ried+0.228
Neg logit contributions
 thank-0.145
 show-0.144
 ride-0.120
 performance-0.119
 goodbye-0.114
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 yourselves+0.097
ety+0.092
ried+0.087
neys+0.085
oky+0.084
Neg logit contributions
 thank-0.065
 show-0.060
 performance-0.056
 story-0.052
 medal-0.050
Token Then
Token ID3244
Feature activation-7.02e-01

Pos logit contributions
ety+0.053
ried+0.050
 yourselves+0.050
rying+0.046
aking+0.045
Neg logit contributions
 thank-0.048
 show-0.041
 Everyone-0.039
 Everybody-0.038
 performance-0.036
Token we
Token ID356
Feature activation+2.04e+00

Pos logit contributions
 yourselves+0.021
ety+0.019
ried+0.018
neys+0.018
oky+0.018
Neg logit contributions
 thank-0.019
 show-0.018
 performance-0.017
 medal-0.016
 story-0.016
Token can
Token ID460
Feature activation+2.86e+00

Pos logit contributions
 thank+0.047
 performance+0.046
 show+0.044
 grin+0.041
 medal+0.041
Neg logit contributions
 yourselves-0.065
ety-0.061
ried-0.059
 hyd-0.057
neys-0.056
Token.
Token ID13
Feature activation-4.86e-01

Pos logit contributions
 thank+0.040
 show+0.037
 performance+0.036
 medal+0.034
 story+0.033
Neg logit contributions
 yourselves-0.054
ety-0.050
ried-0.047
neys-0.046
Or-0.045
Token Maybe
Token ID6674
Feature activation-1.01e+00

Pos logit contributions
 yourselves+0.012
ety+0.012
ried+0.011
aking+0.010
rying+0.010
Neg logit contributions
 thank-0.017
 show-0.015
 teach-0.014
 Everyone-0.013
 Everybody-0.013
Token it
Token ID340
Feature activation-1.42e+00

Pos logit contributions
 yourselves+0.034
ety+0.032
ried+0.030
neys+0.029
oky+0.029
Neg logit contributions
 thank-0.024
 show-0.022
 performance-0.021
 medal-0.020
 story-0.019
Token will
Token ID481
Feature activation+7.66e-01

Pos logit contributions
 yourselves+0.053
ety+0.047
oky+0.045
ried+0.045
ipt+0.045
Neg logit contributions
 thank-0.031
 show-0.030
 story-0.026
 performance-0.026
 medal-0.024
Token come
Token ID1282
Feature activation-1.23e+00

Pos logit contributions
 thank+0.016
 performance+0.015
 show+0.015
 medal+0.013
on+0.013
Neg logit contributions
 yourselves-0.027
ety-0.025
ried-0.024
Or-0.024
neys-0.023
Token back
Token ID736
Feature activation-7.22e+00

Pos logit contributions
 yourselves+0.042
neys+0.037
oky+0.036
ety+0.036
 wasteful+0.035
Neg logit contributions
 thank-0.029
 show-0.028
 performance-0.025
 story-0.024
 start-0.023
Token to
Token ID284
Feature activation+7.95e-01

Pos logit contributions
 yourselves+0.246
ety+0.223
oky+0.218
ried+0.217
ipt+0.216
Neg logit contributions
 thank-0.165
 show-0.160
 performance-0.136
 story-0.135
 goodbye-0.131
Token visit
Token ID3187
Feature activation-1.11e+00

Pos logit contributions
 thank+0.018
 show+0.017
 performance+0.015
 story+0.014
 teach+0.013
Neg logit contributions
 yourselves-0.028
ety-0.027
ried-0.027
neys-0.026
oky-0.025
Token us
Token ID514
Feature activation-2.83e+00

Pos logit contributions
 yourselves+0.037
ety+0.035
ried+0.034
neys+0.034
Or+0.034
Neg logit contributions
 thank-0.025
 show-0.024
 performance-0.022
 story-0.020
 medal-0.020
Token."
Token ID526
Feature activation+6.58e-01

Pos logit contributions
 yourselves+0.097
ety+0.090
oky+0.087
ried+0.086
Or+0.085
Neg logit contributions
 thank-0.063
 show-0.061
 performance-0.056
 story-0.054
 medal-0.051
Token
Token ID198
Feature activation-5.83e-01

Pos logit contributions
 thank+0.013
 show+0.011
 story+0.009
 Everyone+0.009
on+0.009
Neg logit contributions
 yourselves-0.026
ety-0.024
ried-0.024
neys-0.023
 property-0.022
Tokenie
Token ID494
Feature activation+5.40e-01

Pos logit contributions
 thank+0.002
 show+0.001
 performance+0.001
 medal+0.001
 grin+0.001
Neg logit contributions
 yourselves-0.003
ety-0.003
ried-0.003
neys-0.002
 late-0.002
Token guessed
Token ID25183
Feature activation+6.57e-01

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.010
 replied+0.009
 leap+0.009
Neg logit contributions
 yourselves-0.021
ety-0.018
neys-0.018
oky-0.018
 hyd-0.017
Token that
Token ID326
Feature activation-1.24e+00

Pos logit contributions
 thank+0.014
 performance+0.014
 show+0.013
 grin+0.013
 medal+0.012
Neg logit contributions
 yourselves-0.022
ety-0.020
ried-0.020
 late-0.019
Or-0.019
Token it
Token ID340
Feature activation-2.98e+00

Pos logit contributions
 yourselves+0.045
ety+0.041
ried+0.039
neys+0.038
 late+0.038
Neg logit contributions
 thank-0.025
 performance-0.024
 show-0.023
 grin-0.021
 medal-0.021
Token must
Token ID1276
Feature activation+5.52e-02

Pos logit contributions
 yourselves+0.120
ety+0.108
ried+0.103
oky+0.102
 late+0.102
Neg logit contributions
 thank-0.053
 show-0.050
 performance-0.046
 story-0.043
on-0.042
Token belong
Token ID5594
Feature activation-7.18e+00

Pos logit contributions
 thank+0.001
 performance+0.001
 show+0.001
 medal+0.001
 grin+0.001
Neg logit contributions
 yourselves-0.002
ety-0.002
ried-0.002
neys-0.001
 hyd-0.001
Token to
Token ID284
Feature activation-4.41e-01

Pos logit contributions
 yourselves+0.305
oky+0.290
 wasteful+0.278
ety+0.275
outing+0.274
Neg logit contributions
 show-0.086
 thank-0.078
 of-0.062
,-0.061
 all-0.059
Token one
Token ID530
Feature activation-3.31e+00

Pos logit contributions
 yourselves+0.017
ety+0.016
ried+0.016
oky+0.015
neys+0.015
Neg logit contributions
 thank-0.008
 show-0.008
 performance-0.007
 story-0.006
 teach-0.006
Token the
Token ID262
Feature activation-6.91e-01

Pos logit contributions
 yourselves+0.116
ety+0.109
ried+0.103
neys+0.100
 late+0.100
Neg logit contributions
 thank-0.071
 performance-0.065
 show-0.064
 medal-0.059
 grin-0.058
Token big
Token ID1263
Feature activation+1.26e+00

Pos logit contributions
 yourselves+0.027
ried+0.025
ety+0.025
neys+0.024
iffs+0.024
Neg logit contributions
 thank-0.012
 show-0.012
 story-0.010
 performance-0.010
 medal-0.010
Token people
Token ID661
Feature activation-3.24e-01

Pos logit contributions
 thank+0.027
 show+0.025
 performance+0.024
 medal+0.022
 story+0.022
Neg logit contributions
 yourselves-0.045
ety-0.042
ried-0.040
neys-0.039
Or-0.039
Token sea
Token ID5417
Feature activation-1.51e+00

Pos logit contributions
 yourselves+0.002
ety+0.002
ried+0.002
 hyd+0.002
Or+0.002
Neg logit contributions
 thank-0.001
 performance-0.001
 show-0.001
 medal-0.001
 story-0.001
Token but
Token ID475
Feature activation+1.57e+00

Pos logit contributions
 yourselves+0.056
ety+0.053
ried+0.050
neys+0.049
Or+0.049
Neg logit contributions
 thank-0.030
 show-0.027
 performance-0.027
 medal-0.024
 story-0.023
Token then
Token ID788
Feature activation-1.45e-01

Pos logit contributions
 thank+0.031
 show+0.029
 performance+0.028
 medal+0.026
 story+0.025
Neg logit contributions
 yourselves-0.058
ety-0.054
ried-0.052
neys-0.050
Or-0.050
Token it
Token ID340
Feature activation-2.10e+00

Pos logit contributions
 yourselves+0.005
ety+0.004
 late+0.004
ried+0.004
Or+0.004
Neg logit contributions
 thank-0.003
 performance-0.003
 show-0.003
 medal-0.003
on-0.003
Token came
Token ID1625
Feature activation-2.54e+00

Pos logit contributions
 yourselves+0.076
ety+0.071
ried+0.069
Or+0.066
 late+0.066
Neg logit contributions
 thank-0.043
 show-0.039
 performance-0.038
 medal-0.035
on-0.034
Token back
Token ID736
Feature activation-7.17e+00

Pos logit contributions
 yourselves+0.087
ety+0.081
ried+0.076
neys+0.076
Or+0.075
Neg logit contributions
 thank-0.057
 show-0.053
 performance-0.052
 story-0.047
 medal-0.047
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 yourselves+0.268
ety+0.250
neys+0.236
Or+0.235
ried+0.234
Neg logit contributions
 thank-0.141
 show-0.134
 performance-0.125
 grin-0.110
 medal-0.109
Token They
Token ID1119
Feature activation+1.36e+00

Pos logit contributions
 yourselves+0.004
ety+0.004
ried+0.004
neys+0.004
 hyd+0.004
Neg logit contributions
 thank-0.003
 show-0.002
 performance-0.002
 story-0.002
 grin-0.002
Token cheered
Token ID34767
Feature activation+1.66e+00

Pos logit contributions
 performance+0.031
 thank+0.031
 story+0.029
 show+0.028
 medal+0.028
Neg logit contributions
ried-0.040
 yourselves-0.040
ety-0.040
 hyd-0.037
Or-0.036
Token together
Token ID1978
Feature activation-2.67e-01

Pos logit contributions
 thank+0.041
 show+0.038
 performance+0.038
 medal+0.036
on+0.036
Neg logit contributions
 yourselves-0.052
ety-0.047
ried-0.046
 late-0.044
 hyd-0.044
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 yourselves+0.007
ried+0.007
 late+0.007
ety+0.007
 hyd+0.007
Neg logit contributions
 thank-0.006
odic-0.006
on-0.006
 performance-0.006
ner-0.006
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
 yourselves+0.011
ety+0.010
 late+0.010
ried+0.010
neys+0.010
Neg logit contributions
 thank-0.004
 performance-0.004
 show-0.004
 medal-0.003
 grin-0.003
Token there
Token ID612
Feature activation-3.14e-03

Pos logit contributions
 thank+0.029
 performance+0.027
 show+0.027
 story+0.024
 medal+0.024
Neg logit contributions
 yourselves-0.053
ety-0.047
 late-0.047
ried-0.046
 hyd-0.045
Token was
Token ID373
Feature activation-1.57e+00

Pos logit contributions
 yourselves+0.000
ried+0.000
neys+0.000
Or+0.000
ety+0.000
Neg logit contributions
 thank-0.000
 performance-0.000
 show-0.000
 medal-0.000
ner-0.000
Token a
Token ID257
Feature activation+8.48e-01

Pos logit contributions
 yourselves+0.055
ety+0.052
ried+0.050
neys+0.048
Or+0.047
Neg logit contributions
 thank-0.034
 show-0.031
 performance-0.031
 medal-0.028
 story-0.027
Token loud
Token ID7812
Feature activation+1.43e+00

Pos logit contributions
 thank+0.019
 performance+0.019
 show+0.018
on+0.018
ner+0.017
Neg logit contributions
 yourselves-0.025
 late-0.023
ety-0.023
Or-0.023
 hyd-0.023
Token *
Token ID1635
Feature activation-7.17e+00

Pos logit contributions
 thank+0.030
 performance+0.029
 medal+0.028
 contain+0.028
 show+0.027
Neg logit contributions
 yourselves-0.045
 hyd-0.040
 late-0.039
oky-0.038
ety-0.038
Tokenbe
Token ID1350
Feature activation+1.20e+00

Pos logit contributions
 yourselves+0.219
ety+0.198
ried+0.183
neys+0.182
Or+0.179
Neg logit contributions
 thank-0.185
 show-0.177
 performance-0.173
on-0.168
ner-0.162
Tokenep
Token ID538
Feature activation-1.74e+00

Pos logit contributions
 thank+0.034
 show+0.032
 performance+0.031
 medal+0.030
 leap+0.029
Neg logit contributions
 yourselves-0.034
ety-0.031
ried-0.029
neys-0.028
 late-0.028
Token*
Token ID9
Feature activation-2.28e+00

Pos logit contributions
 yourselves+0.060
ried+0.055
ety+0.055
 hyd+0.053
 late+0.052
Neg logit contributions
 thank-0.036
 show-0.033
 performance-0.033
 medal-0.032
 trophy-0.030
Token from
Token ID422
Feature activation-1.27e+00

Pos logit contributions
 yourselves+0.076
ety+0.069
 late+0.067
ried+0.066
 hyd+0.066
Neg logit contributions
 thank-0.052
 show-0.049
 performance-0.049
 medal-0.045
 leap-0.044
Token the
Token ID262
Feature activation-6.29e-02

Pos logit contributions
 yourselves+0.047
ety+0.046
ried+0.043
neys+0.042
Or+0.042
Neg logit contributions
 thank-0.025
 show-0.024
 performance-0.022
 medal-0.020
 story-0.019
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 yourselves+0.119
ety+0.113
ried+0.108
neys+0.105
Or+0.104
Neg logit contributions
 thank-0.067
 show-0.062
 performance-0.059
 story-0.053
 medal-0.051
Token The
Token ID383
Feature activation+5.01e-01

Pos logit contributions
ety+0.033
 yourselves+0.032
ried+0.031
rying+0.029
Or+0.028
Neg logit contributions
 thank-0.031
 show-0.027
 Everyone-0.025
 Everybody-0.025
 performance-0.024
Token sunshine
Token ID34488
Feature activation+4.20e-01

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.010
 story+0.009
 medal+0.009
Neg logit contributions
 yourselves-0.018
ety-0.017
ried-0.016
neys-0.015
Or-0.015
Token will
Token ID481
Feature activation+1.02e+00

Pos logit contributions
 thank+0.010
 show+0.010
 performance+0.009
 story+0.008
 start+0.008
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.013
oky-0.012
 late-0.012
Token come
Token ID1282
Feature activation-1.09e+00

Pos logit contributions
 thank+0.018
 show+0.017
 performance+0.016
 medal+0.015
 story+0.014
Neg logit contributions
 yourselves-0.039
ety-0.037
ried-0.035
neys-0.034
Or-0.034
Token back
Token ID736
Feature activation-7.15e+00

Pos logit contributions
 yourselves+0.040
ety+0.035
neys+0.035
oky+0.034
Or+0.034
Neg logit contributions
 thank-0.023
 show-0.022
 performance-0.019
 start-0.019
 story-0.019
Token."
Token ID526
Feature activation+3.35e-01

Pos logit contributions
 yourselves+0.270
ety+0.248
ipt+0.246
 wasteful+0.240
oky+0.238
Neg logit contributions
 thank-0.129
 show-0.129
 ride-0.103
 goodbye-0.100
 performance-0.097
Token
Token ID198
Feature activation-8.51e-01

Pos logit contributions
 thank+0.008
 show+0.007
 performance+0.007
 story+0.007
on+0.006
Neg logit contributions
 yourselves-0.011
ried-0.011
ety-0.010
neys-0.010
Or-0.010
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
 yourselves+0.027
ety+0.025
ried+0.024
neys+0.023
Or+0.023
Neg logit contributions
 thank-0.021
 show-0.020
 performance-0.019
 story-0.018
on-0.018
TokenL
Token ID43
Feature activation-2.87e+00

Pos logit contributions
ried+0.031
 yourselves+0.030
neys+0.030
ety+0.029
oky+0.028
Neg logit contributions
 thank-0.038
 show-0.037
 write-0.032
 story-0.032
 bow-0.032
Tokenily
Token ID813
Feature activation+7.32e-01

Pos logit contributions
 yourselves+0.085
 hyd+0.081
neys+0.080
ety+0.079
ried+0.078
Neg logit contributions
 thank-0.071
 show-0.066
 performance-0.064
on-0.064
 story-0.063
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
 yourselves+0.021
ety+0.020
ried+0.019
neys+0.018
Or+0.018
Neg logit contributions
 thank-0.013
 show-0.012
 performance-0.012
 medal-0.011
 story-0.010
Token Maybe
Token ID6674
Feature activation-8.61e-01

Pos logit contributions
 yourselves+0.011
ety+0.011
ried+0.011
neys+0.010
rying+0.010
Neg logit contributions
 thank-0.008
 show-0.007
on-0.006
 performance-0.006
 Everybody-0.006
Token it
Token ID340
Feature activation-1.34e+00

Pos logit contributions
 yourselves+0.037
ety+0.035
ried+0.035
neys+0.034
oky+0.033
Neg logit contributions
 thank-0.013
 show-0.011
 performance-0.009
on-0.008
 medal-0.008
Token will
Token ID481
Feature activation+8.08e-01

Pos logit contributions
 yourselves+0.055
oky+0.048
ety+0.047
ipt+0.047
 late+0.047
Neg logit contributions
 thank-0.024
 show-0.024
 story-0.019
 may-0.018
 teach-0.018
Token come
Token ID1282
Feature activation-1.06e+00

Pos logit contributions
 thank+0.016
 performance+0.015
 show+0.014
 medal+0.013
on+0.013
Neg logit contributions
 yourselves-0.030
ety-0.028
ried-0.026
Or-0.026
neys-0.026
Token back
Token ID736
Feature activation-7.15e+00

Pos logit contributions
 yourselves+0.036
neys+0.032
oky+0.032
ety+0.032
ipt+0.031
Neg logit contributions
 thank-0.025
 show-0.024
 performance-0.021
 story-0.021
 start-0.020
Token and
Token ID290
Feature activation-4.39e-01

Pos logit contributions
 yourselves+0.257
ipt+0.238
oky+0.237
ety+0.233
neys+0.232
Neg logit contributions
 thank-0.143
 show-0.139
 goodbye-0.112
 story-0.111
 performance-0.110
Token sing
Token ID1702
Feature activation+1.56e+00

Pos logit contributions
 yourselves+0.018
ety+0.017
ried+0.016
oky+0.016
neys+0.016
Neg logit contributions
 thank-0.008
 show-0.007
 performance-0.006
 medal-0.006
 story-0.006
Token for
Token ID329
Feature activation+6.25e-02

Pos logit contributions
 thank+0.039
 show+0.037
 performance+0.036
 medal+0.033
 story+0.033
Neg logit contributions
 yourselves-0.050
ety-0.047
ried-0.044
neys-0.043
oky-0.042
Token us
Token ID514
Feature activation-1.78e+00

Pos logit contributions
 thank+0.002
 show+0.002
 performance+0.001
 medal+0.001
 story+0.001
Neg logit contributions
 yourselves-0.002
ety-0.002
ried-0.002
neys-0.002
oky-0.002
Token again
Token ID757
Feature activation-2.95e+00

Pos logit contributions
 yourselves+0.057
ety+0.052
ried+0.050
oky+0.049
Or+0.048
Neg logit contributions
 thank-0.045
 show-0.042
 performance-0.041
 story-0.038
 medal-0.038
Token night
Token ID1755
Feature activation-6.05e-01

Pos logit contributions
 yourselves+0.149
ety+0.138
ried+0.137
neys+0.131
Or+0.130
Neg logit contributions
 thank-0.044
 show-0.040
 performance-0.037
 story-0.034
 medal-0.030
Token,
Token ID11
Feature activation-2.79e-01

Pos logit contributions
 yourselves+0.024
ety+0.023
 late+0.022
oky+0.022
 hyd+0.021
Neg logit contributions
 thank-0.009
 performance-0.008
 show-0.008
 medal-0.008
 leap-0.007
Token the
Token ID262
Feature activation-7.72e-01

Pos logit contributions
 yourselves+0.006
 late+0.005
ety+0.005
How+0.005
Or+0.004
Neg logit contributions
 performance-0.010
 thank-0.009
odic-0.009
 leap-0.009
 grin-0.009
Token noise
Token ID7838
Feature activation+8.91e-01

Pos logit contributions
 yourselves+0.027
ety+0.025
ried+0.024
neys+0.023
Or+0.023
Neg logit contributions
 thank-0.017
 show-0.016
 performance-0.016
 medal-0.014
 story-0.014
Token came
Token ID1625
Feature activation-3.19e+00

Pos logit contributions
 thank+0.018
 performance+0.017
 show+0.016
 medal+0.015
 grin+0.015
Neg logit contributions
 yourselves-0.031
ety-0.030
ried-0.028
 late-0.028
 hyd-0.027
Token back
Token ID736
Feature activation-7.15e+00

Pos logit contributions
 yourselves+0.112
ety+0.103
ried+0.098
neys+0.097
Or+0.096
Neg logit contributions
 thank-0.070
 show-0.065
 performance-0.064
 story-0.058
 medal-0.057
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 yourselves+0.275
ety+0.251
Or+0.245
ried+0.241
neys+0.239
Neg logit contributions
 show-0.141
 thank-0.136
 performance-0.125
 story-0.109
 ride-0.108
Token It
Token ID632
Feature activation-1.31e+00

Pos logit contributions
 yourselves+0.016
ety+0.015
ried+0.014
neys+0.013
Or+0.013
Neg logit contributions
 thank-0.013
 show-0.012
 performance-0.011
 story-0.010
 grin-0.010
Token was
Token ID373
Feature activation-1.58e+00

Pos logit contributions
 yourselves+0.049
ety+0.045
ried+0.043
neys+0.042
Or+0.042
Neg logit contributions
 thank-0.027
 show-0.025
 performance-0.024
 medal-0.021
 story-0.021
Token louder
Token ID27089
Feature activation-6.72e-01

Pos logit contributions
 yourselves+0.053
ety+0.049
ried+0.047
neys+0.045
Or+0.045
Neg logit contributions
 thank-0.038
 show-0.036
 performance-0.035
 story-0.032
 medal-0.031
Token this
Token ID428
Feature activation-1.92e+00

Pos logit contributions
 yourselves+0.023
ety+0.022
ried+0.021
neys+0.020
Or+0.020
Neg logit contributions
 thank-0.015
 show-0.014
 performance-0.014
 story-0.013
 leap-0.012
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 yourselves+0.073
ety+0.068
neys+0.066
ried+0.065
oky+0.065
Neg logit contributions
 thank-0.027
 show-0.024
 performance-0.021
 medal-0.019
on-0.019
Token it
Token ID340
Feature activation-1.91e+00

Pos logit contributions
 thank+0.027
 performance+0.026
 show+0.025
 medal+0.024
 grin+0.023
Neg logit contributions
 yourselves-0.055
ety-0.049
ried-0.047
Or-0.047
neys-0.047
Token wouldn
Token ID3636
Feature activation+1.14e+00

Pos logit contributions
 yourselves+0.083
ety+0.074
oky+0.072
neys+0.070
ried+0.070
Neg logit contributions
 thank-0.030
 show-0.028
 performance-0.024
 story-0.022
 start-0.022
Token't
Token ID470
Feature activation-1.01e+00

Pos logit contributions
 thank+0.020
 performance+0.019
 show+0.017
 medal+0.017
 grin+0.017
Neg logit contributions
 yourselves-0.044
ety-0.041
 late-0.039
ried-0.039
 hyd-0.038
Token come
Token ID1282
Feature activation-1.46e+00

Pos logit contributions
 yourselves+0.034
ety+0.033
ried+0.031
 late+0.030
 hyd+0.029
Neg logit contributions
 thank-0.021
 performance-0.021
 show-0.019
on-0.019
 medal-0.019
Token back
Token ID736
Feature activation-7.12e+00

Pos logit contributions
 yourselves+0.054
ipt+0.048
neys+0.048
oky+0.048
Or+0.048
Neg logit contributions
 thank-0.028
 show-0.026
 performance-0.023
 start-0.022
 story-0.022
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
 yourselves+0.259
ety+0.238
ipt+0.232
oky+0.231
neys+0.230
Neg logit contributions
 thank-0.138
 show-0.134
 performance-0.119
 story-0.108
 start-0.107
Token Mary
Token ID5335
Feature activation+1.05e+00

Pos logit contributions
 thank+0.004
 show+0.004
 performance+0.004
 story+0.004
 grin+0.003
Neg logit contributions
 yourselves-0.008
ety-0.008
ried-0.007
 hyd-0.007
 late-0.007
Token nodded
Token ID14464
Feature activation+2.60e+00

Pos logit contributions
 thank+0.021
 show+0.020
 performance+0.019
 medal+0.017
 story+0.017
Neg logit contributions
 yourselves-0.039
ety-0.036
ried-0.034
neys-0.034
oky-0.034
Token and
Token ID290
Feature activation+3.48e-01

Pos logit contributions
 thank+0.058
 show+0.054
 performance+0.053
 medal+0.049
 story+0.048
Neg logit contributions
 yourselves-0.089
ety-0.083
ried-0.079
Or-0.076
 late-0.076
Token made
Token ID925
Feature activation+2.99e+00

Pos logit contributions
 thank+0.008
 performance+0.008
 show+0.007
 story+0.007
 medal+0.007
Neg logit contributions
 yourselves-0.011
ety-0.011
ried-0.010
Or-0.010
 hyd-0.010
Token it
Token ID340
Feature activation-2.56e+00

Pos logit contributions
 performance+0.033
odic+0.033
ner+0.032
 grin+0.032
 thank+0.031
Neg logit contributions
 yourselves-0.047
 late-0.039
 hyd-0.038
ried-0.037
neys-0.036
Token so
Token ID523
Feature activation-5.65e-01

Pos logit contributions
 yourselves+0.076
ety+0.072
 late+0.072
ried+0.071
 hyd+0.068
Neg logit contributions
 thank-0.058
 performance-0.054
 grin-0.054
 medal-0.052
ner-0.052
Token it
Token ID340
Feature activation-1.94e+00

Pos logit contributions
 yourselves+0.022
ety+0.021
ried+0.020
 late+0.020
neys+0.020
Neg logit contributions
 thank-0.009
 performance-0.009
 show-0.009
 story-0.008
 grin-0.008
Token could
Token ID714
Feature activation+2.97e+00

Pos logit contributions
 yourselves+0.076
ety+0.070
ried+0.066
neys+0.065
oky+0.065
Neg logit contributions
 thank-0.036
 show-0.033
 performance-0.032
 medal-0.028
 story-0.028
Token come
Token ID1282
Feature activation-1.32e+00

Pos logit contributions
 thank+0.056
 performance+0.054
 grin+0.050
ner+0.049
 medal+0.049
Neg logit contributions
 yourselves-0.104
ety-0.095
 hyd-0.093
ried-0.093
Or-0.092
Token back
Token ID736
Feature activation-7.12e+00

Pos logit contributions
 yourselves+0.045
ety+0.042
ried+0.039
neys+0.039
Or+0.039
Neg logit contributions
 thank-0.030
 show-0.029
 performance-0.027
 medal-0.025
 story-0.025
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
 yourselves+0.251
ety+0.233
ried+0.222
Or+0.216
neys+0.216
Neg logit contributions
 thank-0.154
 show-0.145
 performance-0.140
 medal-0.127
 story-0.125
Token He
Token ID679
Feature activation-1.89e+00

Pos logit contributions
 thank+0.009
 show+0.008
 performance+0.008
 story+0.007
on+0.007
Neg logit contributions
 yourselves-0.016
ety-0.015
ried-0.015
Or-0.014
neys-0.014
Token said
Token ID531
Feature activation+2.53e+00

Pos logit contributions
 yourselves+0.063
ety+0.059
ried+0.056
Or+0.054
neys+0.054
Neg logit contributions
 thank-0.044
 show-0.041
 performance-0.040
 medal-0.037
 story-0.037
Token "
Token ID366
Feature activation-4.75e+00

Pos logit contributions
 performance+0.042
 show+0.042
 thank+0.037
 leap+0.036
odic+0.035
Neg logit contributions
 yourselves-0.093
Or-0.083
ried-0.083
 late-0.082
ety-0.082
Tokenrocket
Token ID30431
Feature activation-2.03e+00

Pos logit contributions
 yourselves+0.135
ety+0.113
neys+0.112
ried+0.111
 wasteful+0.107
Neg logit contributions
 thank-0.132
 show-0.130
 performance-0.127
on-0.121
 ride-0.120
Token click
Token ID3904
Feature activation+3.03e+00

Pos logit contributions
 thank+0.044
 performance+0.043
on+0.041
 medal+0.041
ner+0.040
Neg logit contributions
 yourselves-0.063
Or-0.058
 hyd-0.055
 late-0.055
How-0.055
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 thank+0.077
 performance+0.070
 leap+0.069
 Everybody+0.066
 medal+0.066
Neg logit contributions
 yourselves-0.092
 hyd-0.080
 late-0.079
ried-0.077
Or-0.075
Token The
Token ID383
Feature activation+6.00e-01

Pos logit contributions
 yourselves+0.036
ety+0.034
ried+0.032
neys+0.030
 hyd+0.030
Neg logit contributions
 thank-0.024
 show-0.021
 performance-0.020
 story-0.019
on-0.019
Token screen
Token ID3159
Feature activation+2.25e+00

Pos logit contributions
 thank+0.012
 leap+0.011
 hardly+0.011
 contain+0.011
 ease+0.011
Neg logit contributions
 yourselves-0.020
oky-0.017
 hyd-0.017
 late-0.017
Or-0.017
Token came
Token ID1625
Feature activation-2.74e+00

Pos logit contributions
 thank+0.051
 performance+0.047
 show+0.045
 medal+0.043
 leap+0.042
Neg logit contributions
 yourselves-0.072
ety-0.065
ried-0.065
Or-0.064
 hyd-0.063
Token back
Token ID736
Feature activation-7.12e+00

Pos logit contributions
 yourselves+0.101
ety+0.094
ried+0.090
neys+0.088
Or+0.087
Neg logit contributions
 thank-0.055
 show-0.050
 performance-0.049
 medal-0.044
 story-0.044
Token on
Token ID319
Feature activation+5.90e-01

Pos logit contributions
 yourselves+0.251
ety+0.234
ried+0.221
Or+0.215
neys+0.214
Neg logit contributions
 thank-0.157
 show-0.152
 performance-0.143
 story-0.129
 medal-0.128
Token,
Token ID11
Feature activation+7.77e-01

Pos logit contributions
 thank+0.014
 performance+0.013
 show+0.013
 medal+0.012
 leap+0.012
Neg logit contributions
 yourselves-0.019
 late-0.017
ety-0.017
ried-0.016
 hyd-0.016
Token but
Token ID475
Feature activation+7.73e-01

Pos logit contributions
 thank+0.017
 show+0.017
 start+0.014
 performance+0.014
 grin+0.014
Neg logit contributions
 yourselves-0.028
ety-0.027
neys-0.026
ried-0.025
oky-0.025
Token the
Token ID262
Feature activation+3.50e-01

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 medal+0.013
 story+0.013
Neg logit contributions
 yourselves-0.028
ety-0.027
ried-0.025
neys-0.025
Or-0.024
Token movie
Token ID3807
Feature activation+1.92e-01

Pos logit contributions
 thank+0.008
 performance+0.007
 hardly+0.007
 leap+0.007
 show+0.007
Neg logit contributions
 yourselves-0.012
 hyd-0.010
 late-0.010
Or-0.010
ety-0.010
Token away
Token ID1497
Feature activation-3.31e+00

Pos logit contributions
 yourselves+0.080
ety+0.073
ried+0.069
 late+0.068
oky+0.066
Neg logit contributions
 thank-0.077
 performance-0.071
 show-0.070
 medal-0.066
 grin-0.066
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
 yourselves+0.127
ety+0.119
ried+0.113
neys+0.110
Or+0.109
Neg logit contributions
 thank-0.062
 show-0.058
 performance-0.056
 medal-0.049
 story-0.049
Token The
Token ID383
Feature activation+7.45e-01

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.012
 Everybody+0.011
 Everyone+0.011
Neg logit contributions
 yourselves-0.020
ety-0.019
ried-0.018
 hyd-0.018
Or-0.017
Token sun
Token ID4252
Feature activation+1.32e+00

Pos logit contributions
 thank+0.019
 performance+0.017
on+0.017
 hardly+0.016
ner+0.016
Neg logit contributions
 yourselves-0.022
ety-0.020
 late-0.020
How-0.019
 hyd-0.018
Token came
Token ID1625
Feature activation-2.62e+00

Pos logit contributions
 thank+0.028
 show+0.025
 performance+0.025
 medal+0.023
 story+0.022
Neg logit contributions
 yourselves-0.047
ety-0.044
ried-0.042
neys-0.041
Or-0.041
Token back
Token ID736
Feature activation-7.10e+00

Pos logit contributions
 yourselves+0.098
ety+0.090
ried+0.086
neys+0.085
Or+0.084
Neg logit contributions
 thank-0.052
 show-0.048
 performance-0.046
 story-0.041
 medal-0.041
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 yourselves+0.272
ety+0.248
ried+0.239
Or+0.238
neys+0.236
Neg logit contributions
 thank-0.135
 show-0.131
 performance-0.121
 story-0.109
 medal-0.107
Token and
Token ID290
Feature activation-3.51e-01

Pos logit contributions
 thank+0.025
 show+0.023
 performance+0.021
 story+0.019
 medal+0.019
Neg logit contributions
 yourselves-0.041
ety-0.039
ried-0.037
neys-0.036
oky-0.036
Token everyone
Token ID2506
Feature activation+3.90e-02

Pos logit contributions
 yourselves+0.014
ety+0.013
ried+0.012
neys+0.012
Or+0.012
Neg logit contributions
 thank-0.006
 show-0.006
 performance-0.005
 medal-0.005
 story-0.005
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
 thank+0.001
 performance+0.001
 show+0.001
 medal+0.001
 leap+0.001
Neg logit contributions
 yourselves-0.001
ety-0.001
ried-0.001
Or-0.001
 hyd-0.001
Token happy
Token ID3772
Feature activation-1.14e+00

Pos logit contributions
 thank+0.035
 show+0.033
 performance+0.032
 story+0.030
 medal+0.029
Neg logit contributions
 yourselves-0.045
ety-0.041
ried-0.039
neys-0.038
Or-0.038
Token mom
Token ID1995
Feature activation+7.39e-01

Pos logit contributions
 yourselves+0.043
ety+0.041
ried+0.038
 late+0.038
 hyd+0.037
Neg logit contributions
 thank-0.027
 show-0.024
 performance-0.024
 medal-0.022
 leap-0.022
Token said
Token ID531
Feature activation+2.19e+00

Pos logit contributions
 thank+0.016
 performance+0.015
 show+0.014
 medal+0.013
 leap+0.013
Neg logit contributions
 yourselves-0.026
ety-0.025
ried-0.023
Or-0.022
neys-0.022
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
 performance+0.046
 show+0.045
 thank+0.042
 medal+0.041
 leap+0.041
Neg logit contributions
 yourselves-0.073
ety-0.067
Or-0.064
 late-0.063
ried-0.061
Token "
Token ID366
Feature activation-5.29e+00

Pos logit contributions
 thank+0.005
 show+0.004
 performance+0.004
 story+0.004
on+0.004
Neg logit contributions
 yourselves-0.007
ety-0.007
ried-0.006
neys-0.006
oky-0.006
TokenThink
Token ID22073
Feature activation+6.57e-01

Pos logit contributions
 yourselves+0.134
ried+0.128
neys+0.127
asured+0.115
aj+0.113
Neg logit contributions
 show-0.144
 thank-0.142
ner-0.131
 replied-0.131
 write-0.131
Token back
Token ID736
Feature activation-7.09e+00

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.010
 medal+0.009
 grin+0.008
Neg logit contributions
 yourselves-0.026
ety-0.025
ried-0.024
neys-0.023
 late-0.023
Token to
Token ID284
Feature activation+1.94e-01

Pos logit contributions
 yourselves+0.278
ety+0.270
oky+0.261
ried+0.259
ipt+0.258
Neg logit contributions
 thank-0.124
 show-0.117
 performance-0.093
 story-0.091
 goodbye-0.085
Token your
Token ID534
Feature activation-8.68e-01

Pos logit contributions
 thank+0.005
 show+0.004
 performance+0.004
 story+0.004
 medal+0.004
Neg logit contributions
 yourselves-0.006
ety-0.006
ried-0.006
neys-0.006
oky-0.006
Token memory
Token ID4088
Feature activation-5.51e-01

Pos logit contributions
 yourselves+0.030
ety+0.028
ried+0.028
Or+0.027
neys+0.027
Neg logit contributions
 show-0.020
 thank-0.020
 performance-0.018
 story-0.018
 medal-0.016
Token of
Token ID286
Feature activation-4.22e-01

Pos logit contributions
 yourselves+0.019
ety+0.018
ried+0.017
Or+0.016
neys+0.016
Neg logit contributions
 thank-0.013
 show-0.012
 performance-0.011
 medal-0.010
 story-0.010
Token where
Token ID810
Feature activation-1.17e-01

Pos logit contributions
 yourselves+0.013
ety+0.013
ried+0.012
neys+0.012
oky+0.012
Neg logit contributions
 thank-0.011
 show-0.010
 performance-0.010
 medal-0.009
 story-0.009
Token brave
Token ID14802
Feature activation-2.65e-01

Pos logit contributions
 thank+0.018
 show+0.017
 performance+0.015
 story+0.014
 medal+0.014
Neg logit contributions
 yourselves-0.027
ety-0.027
ried-0.026
oky-0.025
ipt-0.025
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 yourselves+0.010
ety+0.009
ried+0.009
neys+0.008
Or+0.008
Neg logit contributions
 thank-0.006
 show-0.005
 performance-0.005
 story-0.004
 medal-0.004
Token You
Token ID921
Feature activation+1.29e+00

Pos logit contributions
ried+0.043
ety+0.042
 yourselves+0.042
har+0.038
rying+0.038
Neg logit contributions
 thank-0.040
 show-0.034
 Everyone-0.031
 Everybody-0.031
 performance-0.031
Token did
Token ID750
Feature activation+3.97e+00

Pos logit contributions
 performance+0.022
 thank+0.022
 show+0.020
 medal+0.019
 grin+0.019
Neg logit contributions
 yourselves-0.048
ety-0.045
ried-0.043
neys-0.042
 hyd-0.042
Token not
Token ID407
Feature activation-1.01e+00

Pos logit contributions
 thank+0.107
 performance+0.104
 show+0.100
 grin+0.099
on+0.097
Neg logit contributions
 yourselves-0.119
ety-0.101
 late-0.098
How-0.094
neys-0.092
Token let
Token ID1309
Feature activation+4.32e+00

Pos logit contributions
 yourselves+0.037
ety+0.034
ried+0.033
neys+0.032
Or+0.032
Neg logit contributions
 thank-0.021
 show-0.019
 performance-0.019
 medal-0.017
 story-0.017
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
 thank+0.112
 performance+0.105
 show+0.105
 grin+0.097
 medal+0.097
Neg logit contributions
 yourselves-0.134
ety-0.121
ried-0.113
neys-0.109
 late-0.109
Token big
Token ID1263
Feature activation+2.29e+00

Pos logit contributions
 thank+0.031
 performance+0.028
 show+0.028
on+0.027
 hardly+0.026
Neg logit contributions
 yourselves-0.049
ety-0.044
ried-0.042
 hyd-0.042
Or-0.041
Token boy
Token ID2933
Feature activation-1.23e+00

Pos logit contributions
 thank+0.054
 performance+0.049
 show+0.049
 hardly+0.047
 medal+0.046
Neg logit contributions
 yourselves-0.074
ety-0.068
ried-0.063
 hyd-0.063
neys-0.062
Token take
Token ID1011
Feature activation-1.58e-02

Pos logit contributions
 yourselves+0.048
ety+0.045
ried+0.043
neys+0.041
 late+0.041
Neg logit contributions
 thank-0.022
 performance-0.020
 show-0.019
 medal-0.018
 grin-0.017
Token your
Token ID534
Feature activation-6.66e-01

Pos logit contributions
 yourselves+0.001
ety+0.000
ried+0.000
neys+0.000
Or+0.000
Neg logit contributions
 thank-0.000
 show-0.000
 performance-0.000
 medal-0.000
 story-0.000
Token about
Token ID546
Feature activation-8.20e-01

Pos logit contributions
 thank+0.030
 performance+0.028
 show+0.027
 leap+0.026
 grin+0.026
Neg logit contributions
 yourselves-0.050
ety-0.045
 late-0.043
neys-0.043
ried-0.043
Token cleaning
Token ID12724
Feature activation+3.24e-02

Pos logit contributions
 yourselves+0.029
ety+0.028
ried+0.028
aking+0.027
oky+0.026
Neg logit contributions
 thank-0.017
 show-0.016
 performance-0.015
 story-0.014
 medal-0.013
Token up
Token ID510
Feature activation-2.67e+00

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
 story+0.000
 medal+0.000
Neg logit contributions
 yourselves-0.001
ety-0.001
ried-0.001
Or-0.001
oky-0.001
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
 yourselves+0.101
ety+0.098
ried+0.094
ipt+0.090
Or+0.089
Neg logit contributions
 thank-0.049
 show-0.046
 performance-0.042
 story-0.039
 medal-0.038
Token They
Token ID1119
Feature activation+1.07e+00

Pos logit contributions
ried+0.026
rying+0.024
ety+0.024
 yourselves+0.024
aking+0.023
Neg logit contributions
 Everyone-0.015
 thank-0.015
 Everybody-0.014
 story-0.014
 -0.013
Token did
Token ID750
Feature activation+4.11e+00

Pos logit contributions
 thank+0.026
 performance+0.026
 show+0.024
 medal+0.023
 story+0.023
Neg logit contributions
 yourselves-0.033
ety-0.031
ried-0.030
Or-0.029
 hyd-0.028
Token not
Token ID407
Feature activation-1.11e+00

Pos logit contributions
 thank+0.135
 show+0.128
 performance+0.127
 medal+0.120
 story+0.118
Neg logit contributions
 yourselves-0.100
ety-0.088
ried-0.082
neys-0.079
Or-0.079
Token think
Token ID892
Feature activation+1.79e+00

Pos logit contributions
 yourselves+0.039
ety+0.037
ried+0.034
 late+0.034
 hyd+0.034
Neg logit contributions
 thank-0.022
 performance-0.021
 show-0.020
 medal-0.020
on-0.019
Token about
Token ID546
Feature activation-1.27e+00

Pos logit contributions
 thank+0.037
 performance+0.036
 show+0.035
 grin+0.033
 leap+0.033
Neg logit contributions
 yourselves-0.061
ety-0.054
 late-0.054
neys-0.053
Or-0.052
Token how
Token ID703
Feature activation-8.61e-02

Pos logit contributions
 yourselves+0.043
ety+0.041
ried+0.041
oky+0.038
aking+0.038
Neg logit contributions
 thank-0.029
 show-0.028
 performance-0.026
 story-0.024
 medal-0.024
Token paper
Token ID3348
Feature activation+3.46e+00

Pos logit contributions
 yourselves+0.003
ety+0.003
ried+0.003
Or+0.003
ipt+0.003
Neg logit contributions
 thank-0.002
 show-0.001
 performance-0.001
 story-0.001
 start-0.001
Token
Token ID129
Feature activation+1.75e+00

Pos logit contributions
 thank+0.063
 show+0.060
 performance+0.059
on+0.056
 hardly+0.056
Neg logit contributions
 yourselves-0.079
ety-0.071
ried-0.070
Or-0.066
oky-0.064
Token
Token ID241
Feature activation-4.85e+00

Pos logit contributions
 show+0.050
 thank+0.050
 performance+0.048
 hardly+0.046
 start+0.044
Neg logit contributions
 yourselves-0.047
ety-0.045
ried-0.041
Or-0.041
oky-0.040
TokenWhat
Token ID2061
Feature activation+1.37e+00

Pos logit contributions
 yourselves+0.112
ety+0.101
ried+0.096
neys+0.091
 hyd+0.087
Neg logit contributions
 thank-0.158
 show-0.152
 performance-0.147
 hardly-0.143
on-0.139
Tokenâ
Token ID22940
Feature activation-1.36e+00

Pos logit contributions
 thank+0.032
 performance+0.031
 show+0.030
 grin+0.028
ner+0.028
Neg logit contributions
 yourselves-0.044
neys-0.038
ety-0.038
 late-0.038
How-0.037
Token
Token ID26391
Feature activation+1.75e+00

Pos logit contributions
 yourselves+0.048
ety+0.045
ried+0.043
Or+0.042
neys+0.042
Neg logit contributions
 thank-0.030
 show-0.028
 performance-0.027
 story-0.024
on-0.024
Token
Token ID8151
Feature activation+4.37e+00

Pos logit contributions
 thank+0.044
 performance+0.042
 show+0.042
 story+0.041
on+0.040
Neg logit contributions
 yourselves-0.055
ety-0.050
ried-0.050
Or-0.047
 hyd-0.045
Tokens
Token ID82
Feature activation-6.62e-02

Pos logit contributions
 thank+0.116
 performance+0.110
 show+0.109
 medal+0.103
 grin+0.097
Neg logit contributions
 yourselves-0.130
ety-0.126
ried-0.115
 late-0.113
oky-0.112
Token going
Token ID1016
Feature activation-2.09e+00

Pos logit contributions
 yourselves+0.002
ety+0.002
ried+0.002
neys+0.002
 late+0.002
Neg logit contributions
 thank-0.002
 show-0.002
 performance-0.002
 medal-0.002
 grin-0.001
Token on
Token ID319
Feature activation+1.90e+00

Pos logit contributions
 yourselves+0.077
ety+0.071
ried+0.069
neys+0.067
 late+0.066
Neg logit contributions
 thank-0.042
 show-0.038
 performance-0.038
 medal-0.034
 grin-0.033
Token?
Token ID30
Feature activation+1.99e+00

Pos logit contributions
 thank+0.050
 show+0.047
 performance+0.046
 medal+0.043
 story+0.042
Neg logit contributions
 yourselves-0.058
ety-0.054
ried-0.051
neys-0.049
Or-0.049
Tokenâ
Token ID22940
Feature activation-4.95e-01

Pos logit contributions
 thank+0.046
 show+0.042
 performance+0.040
 Everybody+0.040
ner+0.040
Neg logit contributions
 yourselves-0.065
ety-0.061
ried-0.059
neys-0.057
Or-0.057
Token remembered
Token ID12086
Feature activation-1.21e+00

Pos logit contributions
 yourselves+0.093
ety+0.082
ried+0.079
neys+0.078
Or+0.076
Neg logit contributions
 thank-0.073
 show-0.068
 performance-0.066
 medal-0.061
 story-0.061
Token her
Token ID607
Feature activation-2.25e+00

Pos logit contributions
 yourselves+0.044
ety+0.040
ried+0.038
 late+0.037
Or+0.037
Neg logit contributions
 thank-0.025
 performance-0.024
 show-0.024
 medal-0.021
 grin-0.021
Token mom
Token ID1995
Feature activation+3.87e-02

Pos logit contributions
 yourselves+0.070
ety+0.066
ried+0.061
 late+0.060
Or+0.059
Neg logit contributions
 thank-0.055
 show-0.051
 performance-0.051
on-0.047
ner-0.047
Tokenmy
Token ID1820
Feature activation+6.24e-01

Pos logit contributions
 thank+0.001
 performance+0.001
 medal+0.001
 show+0.001
 leap+0.001
Neg logit contributions
ety-0.001
 yourselves-0.001
ried-0.001
 late-0.001
neys-0.001
Token's
Token ID338
Feature activation+1.81e+00

Pos logit contributions
 thank+0.015
 performance+0.014
 show+0.013
 medal+0.013
 leap+0.013
Neg logit contributions
 yourselves-0.020
ety-0.019
ried-0.018
 late-0.017
neys-0.017
Token words
Token ID2456
Feature activation+4.51e+00

Pos logit contributions
 thank+0.037
 show+0.034
 performance+0.034
ner+0.033
on+0.032
Neg logit contributions
 yourselves-0.061
ety-0.058
ried-0.054
Or-0.054
neys-0.054
Token.
Token ID13
Feature activation+4.29e-01

Pos logit contributions
 thank+0.111
 performance+0.109
 show+0.104
 leap+0.100
 medal+0.098
Neg logit contributions
 yourselves-0.135
ety-0.130
ried-0.120
 late-0.120
Or-0.118
Token She
Token ID1375
Feature activation-1.70e+00

Pos logit contributions
 thank+0.009
 Everyone+0.008
 show+0.008
 replied+0.007
 Everybody+0.007
Neg logit contributions
 yourselves-0.016
ried-0.015
 hyd-0.014
neys-0.014
ety-0.014
Token said
Token ID531
Feature activation+2.40e+00

Pos logit contributions
 yourselves+0.058
ety+0.054
ried+0.052
neys+0.050
Or+0.050
Neg logit contributions
 thank-0.038
 show-0.035
 performance-0.035
 medal-0.032
 story-0.031
Token,
Token ID11
Feature activation+2.65e-01

Pos logit contributions
 show+0.034
 performance+0.034
 thank+0.033
 medal+0.030
 leap+0.029
Neg logit contributions
 yourselves-0.097
ety-0.090
ried-0.086
Or-0.085
 late-0.085
Token "
Token ID366
Feature activation-4.67e+00

Pos logit contributions
 thank+0.006
 show+0.006
 performance+0.005
 story+0.005
 medal+0.005
Neg logit contributions
 yourselves-0.009
ety-0.009
ried-0.008
neys-0.008
oky-0.008
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 thank+0.008
 performance+0.007
 show+0.007
 medal+0.007
 leap+0.006
Neg logit contributions
 yourselves-0.008
ety-0.007
ried-0.007
 late-0.007
Or-0.007
TokenThe
Token ID464
Feature activation+4.46e-01

Pos logit contributions
 yourselves+0.004
ety+0.004
ried+0.004
neys+0.004
oky+0.004
Neg logit contributions
 thank-0.004
 show-0.004
 performance-0.004
 hardly-0.004
on-0.004
Token girl
Token ID2576
Feature activation+7.12e-01

Pos logit contributions
 thank+0.012
 performance+0.011
 show+0.011
 medal+0.010
on+0.010
Neg logit contributions
 yourselves-0.013
ety-0.013
ried-0.012
 late-0.012
Or-0.011
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
 thank+0.015
 show+0.014
 performance+0.013
on+0.012
 replied+0.012
Neg logit contributions
 yourselves-0.027
ety-0.024
ried-0.023
oky-0.023
neys-0.022
Token her
Token ID607
Feature activation+6.54e-01

Pos logit contributions
 thank+0.051
 performance+0.048
 medal+0.046
 contain+0.046
odic+0.046
Neg logit contributions
 yourselves-0.048
ety-0.048
 late-0.043
ried-0.041
How-0.041
Token brother
Token ID3956
Feature activation+4.15e+00

Pos logit contributions
 thank+0.020
 performance+0.018
 show+0.018
 medal+0.017
on+0.017
Neg logit contributions
 yourselves-0.016
ety-0.016
ried-0.014
neys-0.014
 late-0.014
Token were
Token ID547
Feature activation+2.05e+00

Pos logit contributions
 thank+0.086
 performance+0.086
 show+0.080
 medal+0.080
 story+0.074
Neg logit contributions
ety-0.140
 yourselves-0.135
ried-0.131
Or-0.124
neys-0.124
Token so
Token ID523
Feature activation+2.53e-01

Pos logit contributions
 thank+0.045
 show+0.042
 performance+0.041
 medal+0.038
 story+0.037
Neg logit contributions
 yourselves-0.072
ety-0.066
ried-0.063
neys-0.062
Or-0.061
Token excited
Token ID6568
Feature activation-1.15e+00

Pos logit contributions
 thank+0.006
 show+0.005
 performance+0.005
 hardly+0.004
 g+0.004
Neg logit contributions
 yourselves-0.009
ety-0.009
ried-0.008
neys-0.008
How-0.008
Token and
Token ID290
Feature activation+2.80e-01

Pos logit contributions
 yourselves+0.037
ety+0.035
 late+0.032
ried+0.032
oky+0.031
Neg logit contributions
 thank-0.029
 performance-0.026
 show-0.026
 medal-0.025
 grin-0.025
Token started
Token ID2067
Feature activation-1.62e+00

Pos logit contributions
 thank+0.007
 performance+0.007
 story+0.006
 medal+0.006
 show+0.006
Neg logit contributions
 yourselves-0.009
ety-0.008
ried-0.008
Or-0.007
 late-0.007
Token fun
Token ID1257
Feature activation-2.00e+00

Pos logit contributions
 yourselves+0.128
ety+0.118
oky+0.113
ried+0.113
ipt+0.112
Neg logit contributions
 thank-0.076
 show-0.071
 performance-0.063
 story-0.061
 medal-0.061
Token to
Token ID284
Feature activation+6.41e-01

Pos logit contributions
 yourselves+0.065
ety+0.061
ried+0.058
neys+0.056
 late+0.056
Neg logit contributions
 thank-0.049
 performance-0.044
 show-0.044
 grin-0.041
 medal-0.041
Token climb
Token ID12080
Feature activation-7.37e-01

Pos logit contributions
 thank+0.014
 performance+0.014
odic+0.014
on+0.013
 story+0.013
Neg logit contributions
 yourselves-0.020
ried-0.018
ety-0.018
Or-0.017
neys-0.016
Token the
Token ID262
Feature activation-2.50e-01

Pos logit contributions
 yourselves+0.028
ety+0.026
ried+0.025
neys+0.024
Or+0.024
Neg logit contributions
 thank-0.014
 show-0.013
 performance-0.013
 medal-0.011
 story-0.011
Token hill
Token ID12788
Feature activation-1.60e+00

Pos logit contributions
 yourselves+0.007
ety+0.006
neys+0.006
 late+0.006
ried+0.006
Neg logit contributions
 thank-0.007
 performance-0.007
 show-0.006
 grin-0.006
on-0.006
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
 yourselves+0.060
ety+0.056
ried+0.053
neys+0.052
Or+0.052
Neg logit contributions
 thank-0.031
 show-0.028
 performance-0.028
 medal-0.025
 story-0.024
Token so
Token ID523
Feature activation-9.56e-01

Pos logit contributions
 thank+0.024
 performance+0.022
 show+0.022
 medal+0.020
 story+0.020
Neg logit contributions
 yourselves-0.034
ety-0.032
ried-0.030
Or-0.029
neys-0.029
Token he
Token ID339
Feature activation-2.41e+00

Pos logit contributions
 yourselves+0.040
ety+0.038
ried+0.036
 late+0.036
Or+0.035
Neg logit contributions
 thank-0.014
 performance-0.013
 show-0.013
 grin-0.011
 story-0.011
Token did
Token ID750
Feature activation+4.42e+00

Pos logit contributions
 yourselves+0.086
ety+0.077
oky+0.073
neys+0.073
ried+0.072
Neg logit contributions
 thank-0.054
 show-0.051
 performance-0.047
 medal-0.045
 leap-0.043
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
 thank+0.107
 performance+0.101
 show+0.098
 grin+0.091
 story+0.091
Neg logit contributions
 yourselves-0.144
ety-0.128
ried-0.125
Or-0.121
neys-0.120
Token
Token ID198
Feature activation+2.54e-01

Pos logit contributions
 thank+0.013
 show+0.012
on+0.011
 story+0.011
 replied+0.010
Neg logit contributions
 yourselves-0.024
ried-0.021
ety-0.021
Or-0.021
 hyd-0.021
Token teacher
Token ID4701
Feature activation-2.24e-01

Pos logit contributions
 yourselves+0.003
ety+0.003
Or+0.002
ried+0.002
How+0.002
Neg logit contributions
 thank-0.002
 performance-0.001
 show-0.001
 leap-0.001
 grin-0.001
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
 yourselves+0.007
ety+0.007
ried+0.006
neys+0.006
Or+0.006
Neg logit contributions
 thank-0.006
 performance-0.005
 show-0.005
 medal-0.005
 hardly-0.005
Token told
Token ID1297
Feature activation+2.40e+00

Pos logit contributions
 yourselves+0.011
ety+0.010
ried+0.010
Or+0.010
 late+0.009
Neg logit contributions
 thank-0.007
 performance-0.006
 show-0.006
 story-0.006
 medal-0.006
Token her
Token ID607
Feature activation-3.79e+00

Pos logit contributions
odic+0.057
 performance+0.056
iest+0.054
 medal+0.053
 hardly+0.053
Neg logit contributions
 yourselves-0.074
ety-0.060
 late-0.059
 wounds-0.058
 wasteful-0.057
Token what
Token ID644
Feature activation+1.69e+00

Pos logit contributions
 yourselves+0.092
ety+0.083
neys+0.081
 late+0.078
ried+0.076
Neg logit contributions
odic-0.107
 performance-0.105
 leap-0.105
robat-0.102
 contain-0.100
Token Tom
Token ID4186
Feature activation+1.25e-02

Pos logit contributions
 thank+0.040
 performance+0.039
 leap+0.038
 show+0.037
 grin+0.036
Neg logit contributions
 yourselves-0.052
ety-0.049
 late-0.047
neys-0.046
Or-0.046
Token did
Token ID750
Feature activation+3.78e+00

Pos logit contributions
 thank+0.000
 show+0.000
 performance+0.000
 medal+0.000
 story+0.000
Neg logit contributions
 yourselves-0.000
ety-0.000
ried-0.000
neys-0.000
Or-0.000
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
 thank+0.087
 performance+0.081
 show+0.077
 leap+0.076
 medal+0.074
Neg logit contributions
 yourselves-0.129
ety-0.114
 late-0.110
ried-0.109
Or-0.108
Token The
Token ID383
Feature activation+1.07e+00

Pos logit contributions
 yourselves+0.010
ety+0.009
ried+0.009
 hyd+0.008
neys+0.008
Neg logit contributions
 thank-0.008
 show-0.007
 performance-0.007
 story-0.006
 grin-0.006
Token teacher
Token ID4701
Feature activation+1.36e+00

Pos logit contributions
 thank+0.021
 leap+0.021
odic+0.020
 performance+0.020
hing+0.020
Neg logit contributions
 yourselves-0.036
ety-0.032
oky-0.031
 late-0.031
Or-0.030
Token was
Token ID373
Feature activation+6.17e-01

Pos logit contributions
 thank+0.028
 performance+0.026
 show+0.025
 leap+0.024
 medal+0.024
Neg logit contributions
 yourselves-0.046
ety-0.044
ried-0.042
Or-0.042
neys-0.041
Token.
Token ID13
Feature activation+9.71e-01

Pos logit contributions
 thank+0.012
 show+0.011
 performance+0.011
 medal+0.010
 story+0.010
Neg logit contributions
 yourselves-0.020
ety-0.019
ried-0.018
 late-0.017
Or-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.21e+00

Pos logit contributions
 thank+0.022
 show+0.020
 performance+0.019
on+0.018
 story+0.018
Neg logit contributions
 yourselves-0.034
ety-0.032
ried-0.031
Or-0.029
 hyd-0.029
TokenOnce
Token ID7454
Feature activation+1.69e+00

Pos logit contributions
 yourselves+0.049
ety+0.044
ried+0.040
neys+0.039
 late+0.038
Neg logit contributions
 thank-0.077
 show-0.072
 performance-0.072
 medal-0.068
 grin-0.067
Token upon
Token ID2402
Feature activation-2.77e+00

Pos logit contributions
 thank+0.058
 performance+0.056
 show+0.055
 medal+0.053
 leap+0.052
Neg logit contributions
 yourselves-0.035
ety-0.035
ried-0.030
 late-0.029
neys-0.028
Token a
Token ID257
Feature activation+3.02e+00

Pos logit contributions
 yourselves+0.078
 late+0.069
ety+0.067
How+0.065
Or+0.061
Neg logit contributions
 thank-0.079
 performance-0.076
 show-0.073
 grin-0.069
 leap-0.068
Token time
Token ID640
Feature activation+2.44e-01

Pos logit contributions
 thank+0.034
 show+0.030
 performance+0.027
 story+0.026
 grin+0.021
Neg logit contributions
 yourselves-0.143
ried-0.131
ety-0.130
neys-0.126
ipt-0.123
Token,
Token ID11
Feature activation+1.99e+00

Pos logit contributions
 thank+0.003
 show+0.002
 performance+0.002
 story+0.002
 medal+0.002
Neg logit contributions
 yourselves-0.011
ety-0.011
ried-0.010
neys-0.010
Or-0.010
Token in
Token ID287
Feature activation+5.94e-01

Pos logit contributions
 thank+0.053
 show+0.050
on+0.049
 story+0.047
 performance+0.046
Neg logit contributions
 yourselves-0.060
neys-0.053
ety-0.053
ried-0.051
oky-0.050
Token a
Token ID257
Feature activation+3.18e+00

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.013
 medal+0.011
 grin+0.011
Neg logit contributions
 yourselves-0.020
ety-0.018
ried-0.017
 late-0.017
Or-0.017
Token magical
Token ID10883
Feature activation+1.41e+00

Pos logit contributions
 show+0.068
 thank+0.065
 story+0.063
 performance+0.061
 g+0.057
Neg logit contributions
 yourselves-0.115
ried-0.113
ety-0.107
neys-0.107
oky-0.097
Token land
Token ID1956
Feature activation-1.53e+00

Pos logit contributions
 thank+0.035
 show+0.033
 performance+0.033
 medal+0.030
 story+0.030
Neg logit contributions
 yourselves-0.045
ety-0.042
ried-0.040
neys-0.038
Or-0.038
Token garden
Token ID11376
Feature activation+1.49e-01

Pos logit contributions
 thank+0.019
 show+0.018
 performance+0.017
 medal+0.016
 story+0.016
Neg logit contributions
 yourselves-0.024
ety-0.022
ried-0.021
neys-0.020
Or-0.020
Token when
Token ID618
Feature activation+5.97e-02

Pos logit contributions
 thank+0.004
 show+0.004
 performance+0.004
 medal+0.003
 story+0.003
Neg logit contributions
 yourselves-0.005
ety-0.004
ried-0.004
neys-0.004
Or-0.004
Token she
Token ID673
Feature activation-2.15e+00

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
 medal+0.001
 story+0.001
Neg logit contributions
 yourselves-0.002
ety-0.002
 late-0.002
ried-0.002
Or-0.002
Token heard
Token ID2982
Feature activation+2.85e+00

Pos logit contributions
 yourselves+0.069
ety+0.063
ried+0.060
neys+0.058
Or+0.057
Neg logit contributions
 thank-0.054
 show-0.051
 performance-0.050
 medal-0.046
on-0.045
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
 thank+0.079
 performance+0.076
 show+0.075
 medal+0.073
 grin+0.073
Neg logit contributions
 yourselves-0.079
 late-0.066
ety-0.061
How-0.059
neys-0.059
Token sound
Token ID2128
Feature activation+1.72e+00

Pos logit contributions
 thank+0.040
on+0.040
ner+0.039
 ease+0.039
 performance+0.038
Neg logit contributions
 yourselves-0.045
Or-0.040
How-0.040
 hyd-0.040
 late-0.039
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
 thank+0.048
 medal+0.046
 performance+0.045
 prize+0.045
 ease+0.044
Neg logit contributions
 late-0.040
 yourselves-0.039
 hyd-0.037
How-0.036
oky-0.036
Token She
Token ID1375
Feature activation-1.73e+00

Pos logit contributions
 thank+0.017
 show+0.016
 performance+0.015
 story+0.014
 grin+0.014
Neg logit contributions
 yourselves-0.022
ety-0.021
ried-0.020
neys-0.019
Or-0.019
Token looked
Token ID3114
Feature activation+1.34e+00

Pos logit contributions
 yourselves+0.060
ety+0.056
ried+0.053
neys+0.052
Or+0.051
Neg logit contributions
 thank-0.038
 show-0.035
 performance-0.035
 medal-0.031
 story-0.031
Token around
Token ID1088
Feature activation-4.43e-01

Pos logit contributions
 thank+0.038
 performance+0.035
 show+0.034
on+0.033
 medal+0.033
Neg logit contributions
 yourselves-0.037
ety-0.034
ried-0.033
 late-0.032
oky-0.031
Token and
Token ID290
Feature activation+9.18e-01

Pos logit contributions
 yourselves+0.014
ety+0.012
ried+0.012
 late+0.012
Or+0.012
Neg logit contributions
 thank-0.011
 performance-0.010
 show-0.010
 medal-0.010
on-0.009
Tokenled
Token ID992
Feature activation+1.86e-02

Pos logit contributions
 thank+0.042
 show+0.040
 performance+0.040
 grin+0.037
 medal+0.037
Neg logit contributions
 yourselves-0.042
ety-0.037
neys-0.036
Or-0.035
 hyd-0.034
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
 thank+0.000
 show+0.000
 performance+0.000
 medal+0.000
 grin+0.000
Neg logit contributions
 yourselves-0.001
ety-0.001
ried-0.001
neys-0.001
oky-0.001
Token said
Token ID531
Feature activation+2.69e+00

Pos logit contributions
 yourselves+0.004
ety+0.003
ried+0.003
Or+0.003
 late+0.003
Neg logit contributions
 thank-0.002
 performance-0.002
 show-0.002
 story-0.002
 medal-0.002
Token,
Token ID11
Feature activation+2.95e-01

Pos logit contributions
 show+0.039
 performance+0.039
 thank+0.037
 medal+0.034
 story+0.033
Neg logit contributions
 yourselves-0.106
ried-0.098
ety-0.096
Or-0.095
 late-0.092
Token "
Token ID366
Feature activation-4.72e+00

Pos logit contributions
 thank+0.007
 show+0.007
 performance+0.007
 medal+0.006
 story+0.006
Neg logit contributions
 yourselves-0.010
ety-0.009
ried-0.008
Or-0.008
neys-0.008
TokenLet
Token ID5756
Feature activation+2.97e+00

Pos logit contributions
 yourselves+0.122
ety+0.106
neys+0.102
ried+0.102
 hyd+0.098
Neg logit contributions
 thank-0.148
 show-0.140
 performance-0.137
 medal-0.129
on-0.128
Token's
Token ID338
Feature activation-1.72e-02

Pos logit contributions
 thank+0.073
 performance+0.070
 medal+0.067
 show+0.066
 grin+0.065
Neg logit contributions
 yourselves-0.093
ety-0.086
ried-0.079
How-0.077
 late-0.076
Token hug
Token ID16225
Feature activation+3.37e-01

Pos logit contributions
 yourselves+0.001
ety+0.001
 hyd+0.001
ried+0.001
 yourself+0.000
Neg logit contributions
odic-0.000
ner-0.000
 thank-0.000
on-0.000
dream-0.000
Token".
Token ID1911
Feature activation+6.69e-01

Pos logit contributions
 thank+0.009
 show+0.008
 performance+0.008
 medal+0.007
 story+0.007
Neg logit contributions
 yourselves-0.011
ety-0.010
ried-0.009
neys-0.009
Or-0.009
Token Jane
Token ID12091
Feature activation+1.08e+00

Pos logit contributions
 thank+0.015
 show+0.013
 performance+0.012
 Everybody+0.012
 grin+0.012
Neg logit contributions
 yourselves-0.024
ety-0.023
ried-0.021
 hyd-0.021
neys-0.021
Token laughed
Token ID13818
Feature activation+2.33e+00

Pos logit contributions
 thank+0.023
 performance+0.021
 show+0.021
 medal+0.019
 story+0.019
Neg logit contributions
 yourselves-0.036
ety-0.035
ried-0.034
Or-0.033
 hyd-0.032
Token there
Token ID612
Feature activation+1.15e+00

Pos logit contributions
 thank+0.050
 show+0.046
on+0.046
 story+0.044
 performance+0.042
Neg logit contributions
 yourselves-0.061
neys-0.054
ety-0.052
ried-0.051
 hyd-0.051
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
 thank+0.027
 performance+0.025
 show+0.025
 medal+0.023
 grin+0.022
Neg logit contributions
 yourselves-0.038
ety-0.035
ried-0.033
Or-0.032
neys-0.032
Token a
Token ID257
Feature activation+2.93e-01

Pos logit contributions
 yourselves+0.043
ety+0.040
ried+0.038
 late+0.037
Or+0.037
Neg logit contributions
 thank-0.029
 show-0.027
 performance-0.026
 medal-0.024
 story-0.023
Token boy
Token ID2933
Feature activation-5.96e-01

Pos logit contributions
 thank+0.006
 show+0.006
 performance+0.006
 medal+0.005
 story+0.005
Neg logit contributions
 yourselves-0.010
ety-0.009
ried-0.009
Or-0.009
neys-0.009
Token who
Token ID508
Feature activation-1.17e+00

Pos logit contributions
 yourselves+0.020
ety+0.019
ried+0.018
neys+0.017
Or+0.017
Neg logit contributions
 thank-0.014
 show-0.013
 performance-0.013
 medal-0.011
 story-0.011
Token loved
Token ID6151
Feature activation-1.78e+00

Pos logit contributions
 yourselves+0.041
ety+0.037
ried+0.036
neys+0.034
oky+0.034
Neg logit contributions
 thank-0.027
 show-0.025
 performance-0.024
 story-0.022
 medal-0.022
Token carrots
Token ID34397
Feature activation+2.20e+00

Pos logit contributions
 yourselves+0.059
ety+0.054
ried+0.051
neys+0.050
 late+0.050
Neg logit contributions
 thank-0.042
 show-0.039
 performance-0.038
 grin-0.035
 medal-0.035
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 thank+0.053
 performance+0.050
 show+0.047
 grin+0.047
 leap+0.045
Neg logit contributions
 yourselves-0.070
ety-0.066
 hyd-0.062
neys-0.062
ried-0.061
Token He
Token ID679
Feature activation-1.58e+00

Pos logit contributions
 thank+0.033
 show+0.030
 performance+0.027
 story+0.026
 ride+0.026
Neg logit contributions
 yourselves-0.049
ety-0.044
ried-0.044
har-0.041
Or-0.041
Token would
Token ID561
Feature activation+1.97e+00

Pos logit contributions
 yourselves+0.062
ety+0.056
ried+0.054
oky+0.054
neys+0.053
Neg logit contributions
 thank-0.030
 show-0.028
 performance-0.025
on-0.024
 story-0.024
Token wash
Token ID13502
Feature activation+2.09e+00

Pos logit contributions
 performance+0.035
 thank+0.034
odic+0.033
ner+0.033
 medal+0.031
Neg logit contributions
 yourselves-0.069
ety-0.066
Or-0.065
 hyd-0.063
ried-0.063
Token was
Token ID373
Feature activation+5.11e-01

Pos logit contributions
 yourselves+0.049
ety+0.044
ried+0.043
neys+0.042
oky+0.041
Neg logit contributions
 thank-0.030
 show-0.028
 performance-0.026
 story-0.025
 medal-0.024
Token very
Token ID845
Feature activation-2.08e+00

Pos logit contributions
 thank+0.011
 show+0.011
 performance+0.010
 story+0.009
 medal+0.009
Neg logit contributions
 yourselves-0.018
ety-0.017
ried-0.016
neys-0.015
Or-0.015
Token eager
Token ID11069
Feature activation-1.81e+00

Pos logit contributions
 yourselves+0.061
ety+0.055
ried+0.054
neys+0.052
Or+0.052
Neg logit contributions
 thank-0.061
 show-0.059
 performance-0.055
 story-0.051
 g-0.050
Token to
Token ID284
Feature activation+7.36e-01

Pos logit contributions
 yourselves+0.049
ety+0.046
 late+0.043
oky+0.041
 hyd+0.041
Neg logit contributions
 thank-0.047
odic-0.047
ner-0.046
on-0.045
 performance-0.045
Token learn
Token ID2193
Feature activation+8.81e-01

Pos logit contributions
robat+0.015
odic+0.015
ner+0.015
iest+0.015
rob+0.014
Neg logit contributions
 yourselves-0.023
 hyd-0.018
 yourself-0.018
How-0.018
 late-0.018
Token about
Token ID546
Feature activation-9.99e-01

Pos logit contributions
 thank+0.021
 performance+0.020
 grin+0.020
 leap+0.019
 hardly+0.019
Neg logit contributions
 yourselves-0.029
 late-0.024
neys-0.024
ety-0.024
How-0.023
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
 yourselves+0.033
ety+0.028
neys+0.026
How+0.026
 late+0.026
Neg logit contributions
 thank-0.024
 performance-0.022
 grin-0.021
 show-0.021
 medal-0.021
Token world
Token ID995
Feature activation-1.61e+00

Pos logit contributions
 yourselves+0.003
ety+0.003
 late+0.003
neys+0.003
oky+0.003
Neg logit contributions
 thank-0.002
 grin-0.002
on-0.002
 performance-0.002
 hardly-0.002
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 yourselves+0.053
ety+0.050
ried+0.048
neys+0.048
 late+0.047
Neg logit contributions
 thank-0.036
 performance-0.033
 show-0.032
 grin-0.031
 medal-0.030
Token One
Token ID1881
Feature activation-3.72e+00

Pos logit contributions
 thank+0.026
 show+0.024
 performance+0.023
 story+0.021
 grin+0.021
Neg logit contributions
 yourselves-0.033
ried-0.030
ety-0.030
 hyd-0.028
neys-0.028
Token day
Token ID1110
Feature activation-6.12e-01

Pos logit contributions
 yourselves+0.135
ety+0.132
 late+0.124
oky+0.122
ried+0.122
Neg logit contributions
 thank-0.070
 performance-0.062
 show-0.062
 medal-0.057
ner-0.056
Token fun
Token ID1257
Feature activation+7.17e-02

Pos logit contributions
 thank+0.019
 performance+0.017
on+0.017
 show+0.017
ner+0.017
Neg logit contributions
 yourselves-0.027
ety-0.025
Or-0.024
 late-0.024
 hyd-0.024
Token day
Token ID1110
Feature activation-1.12e+00

Pos logit contributions
 thank+0.002
ner+0.002
 Everybody+0.001
 contain+0.001
odic+0.001
Neg logit contributions
 late-0.002
 yourselves-0.002
 hyd-0.002
ety-0.002
Or-0.002
Token!
Token ID0
Feature activation+5.28e-01

Pos logit contributions
 yourselves+0.039
ety+0.036
 late+0.035
 hyd+0.034
oky+0.034
Neg logit contributions
 thank-0.024
 medal-0.021
 performance-0.021
 show-0.020
 story-0.020
Token<|endoftext|>
Token ID50256
Feature activation-2.37e+00

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.013
 medal+0.012
 story+0.012
Neg logit contributions
 yourselves-0.016
ety-0.015
ried-0.014
neys-0.013
 late-0.013
TokenOnce
Token ID7454
Feature activation+1.23e+00

Pos logit contributions
 yourselves+0.071
ety+0.065
ried+0.061
neys+0.059
 late+0.059
Neg logit contributions
 thank-0.064
 show-0.060
 performance-0.060
 medal-0.056
 leap-0.054
Token upon
Token ID2402
Feature activation-2.92e+00

Pos logit contributions
 thank+0.040
 performance+0.039
 show+0.039
 medal+0.037
 leap+0.037
Neg logit contributions
 yourselves-0.027
ety-0.026
 late-0.023
ried-0.023
oky-0.021
Token a
Token ID257
Feature activation+3.05e+00

Pos logit contributions
 yourselves+0.092
ety+0.083
 late+0.080
How+0.077
ried+0.075
Neg logit contributions
 thank-0.075
 performance-0.070
 show-0.068
 grin-0.063
 medal-0.063
Token time
Token ID640
Feature activation+2.36e-01

Pos logit contributions
 thank+0.035
 show+0.030
 performance+0.027
 story+0.026
 grin+0.021
Neg logit contributions
 yourselves-0.144
ried-0.133
ety-0.130
neys-0.128
 hyd-0.125
Token,
Token ID11
Feature activation+1.52e+00

Pos logit contributions
 thank+0.002
 show+0.002
 performance+0.002
 story+0.001
on+0.001
Neg logit contributions
 yourselves-0.011
ety-0.011
ried-0.010
neys-0.010
Or-0.010
Token there
Token ID612
Feature activation+1.02e+00

Pos logit contributions
 thank+0.037
 show+0.035
on+0.033
 story+0.032
 replied+0.030
Neg logit contributions
 yourselves-0.050
neys-0.046
ety-0.043
 hyd-0.042
har-0.042
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 thank+0.022
 show+0.021
 performance+0.020
 medal+0.019
 grin+0.018
Neg logit contributions
 yourselves-0.036
ety-0.033
ried-0.031
neys-0.031
Or-0.031
Token to
Token ID284
Feature activation+6.43e-01

Pos logit contributions
 thank+0.035
 performance+0.034
 show+0.032
 grin+0.031
 medal+0.031
Neg logit contributions
 yourselves-0.051
ety-0.046
ried-0.044
neys-0.044
Or-0.043
Token climb
Token ID12080
Feature activation-7.18e-01

Pos logit contributions
 thank+0.012
 performance+0.012
 grin+0.011
 goodbye+0.011
ner+0.011
Neg logit contributions
 yourselves-0.023
ety-0.021
 hyd-0.019
ried-0.019
neys-0.019
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
 yourselves+0.027
ety+0.026
ried+0.025
neys+0.025
Or+0.024
Neg logit contributions
 thank-0.013
 show-0.012
 performance-0.011
 medal-0.010
on-0.010
Token hill
Token ID12788
Feature activation-1.11e+00

Pos logit contributions
 yourselves+0.016
ety+0.014
ried+0.013
neys+0.013
Or+0.013
Neg logit contributions
 thank-0.011
 show-0.010
 performance-0.010
 grin-0.009
 hardly-0.009
Token.
Token ID13
Feature activation-9.55e-01

Pos logit contributions
 yourselves+0.041
ety+0.038
ried+0.036
Or+0.035
neys+0.035
Neg logit contributions
 thank-0.022
 show-0.021
 performance-0.019
 medal-0.018
on-0.017
Token It
Token ID632
Feature activation-8.40e-01

Pos logit contributions
 yourselves+0.031
ried+0.029
ety+0.029
neys+0.027
 hyd+0.026
Neg logit contributions
 thank-0.024
 show-0.021
 Everybody-0.020
 Everyone-0.020
on-0.019
Token's
Token ID338
Feature activation-9.55e-01

Pos logit contributions
 yourselves+0.030
ety+0.027
ried+0.025
oky+0.025
 late+0.025
Neg logit contributions
 thank-0.019
 show-0.018
 performance-0.016
 leap-0.016
 story-0.016
Token too
Token ID1165
Feature activation-2.48e+00

Pos logit contributions
 yourselves+0.032
ety+0.030
ipt+0.030
oky+0.029
ucks+0.029
Neg logit contributions
 thank-0.023
 show-0.021
 start-0.019
 story-0.018
 performance-0.018
Token deep
Token ID2769
Feature activation-1.65e+00

Pos logit contributions
 yourselves+0.084
ried+0.077
ety+0.075
Or+0.073
ipt+0.073
Neg logit contributions
 thank-0.059
 show-0.055
 story-0.051
 performance-0.048
 leap-0.048
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
 yourselves+0.064
ety+0.062
neys+0.059
Or+0.057
ipt+0.057
Neg logit contributions
 thank-0.028
 show-0.026
 performance-0.023
 goodbye-0.021
 medal-0.021
Token you
Token ID345
Feature activation+1.62e+00

Pos logit contributions
 yourselves+0.045
ety+0.043
ipt+0.040
ried+0.039
oky+0.039
Neg logit contributions
 thank-0.038
 show-0.036
 leap-0.032
 performance-0.031
 start-0.031
Token ducks
Token ID39694
Feature activation+1.47e+00

Pos logit contributions
 thank+0.016
 show+0.015
 performance+0.014
 story+0.013
 medal+0.013
Neg logit contributions
 yourselves-0.027
ety-0.026
ried-0.025
neys-0.025
Or-0.025
Token was
Token ID373
Feature activation+9.64e-01

Pos logit contributions
 thank+0.031
 performance+0.029
 show+0.028
 medal+0.026
 story+0.025
Neg logit contributions
 yourselves-0.052
ety-0.049
ried-0.047
neys-0.045
Or-0.044
Token curious
Token ID11040
Feature activation+1.31e-01

Pos logit contributions
 thank+0.023
 show+0.022
 performance+0.021
 story+0.019
 medal+0.019
Neg logit contributions
 yourselves-0.032
ety-0.030
ried-0.029
neys-0.028
Or-0.028
Token about
Token ID546
Feature activation-1.02e+00

Pos logit contributions
 thank+0.003
 show+0.003
 performance+0.003
 medal+0.003
 hardly+0.003
Neg logit contributions
 yourselves-0.004
ety-0.004
ried-0.004
 late-0.004
neys-0.004
Token the
Token ID262
Feature activation+9.70e-02

Pos logit contributions
 yourselves+0.038
ety+0.036
ried+0.035
neys+0.034
oky+0.033
Neg logit contributions
 thank-0.021
 show-0.019
 performance-0.018
 story-0.016
 medal-0.016
Token new
Token ID649
Feature activation-1.73e+00

Pos logit contributions
 thank+0.003
 show+0.003
 performance+0.002
 story+0.002
 medal+0.002
Neg logit contributions
 yourselves-0.003
ety-0.003
ried-0.003
neys-0.003
Or-0.003
Token duck
Token ID22045
Feature activation-1.35e+00

Pos logit contributions
 yourselves+0.053
ety+0.048
ried+0.046
neys+0.045
Or+0.044
Neg logit contributions
 thank-0.046
 show-0.045
 performance-0.043
 story-0.040
 medal-0.040
Token.
Token ID13
Feature activation+6.08e-01

Pos logit contributions
 yourselves+0.046
ety+0.044
ried+0.041
 hyd+0.040
neys+0.040
Neg logit contributions
 thank-0.030
 performance-0.027
 show-0.027
 medal-0.025
 grin-0.025
Token "
Token ID366
Feature activation-4.29e+00

Pos logit contributions
 thank+0.016
 show+0.015
 Everyone+0.014
 Everybody+0.014
 replied+0.014
Neg logit contributions
 yourselves-0.019
ety-0.018
har-0.017
ried-0.017
aking-0.016
TokenHello
Token ID15496
Feature activation-1.14e+00

Pos logit contributions
 yourselves+0.126
aj+0.117
neys+0.114
ried+0.108
 wasteful+0.105
Neg logit contributions
 show-0.105
 jugg-0.103
 thank-0.103
 ride-0.103
mine-0.101
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
 yourselves+0.053
ried+0.049
Or+0.049
ety+0.049
neys+0.048
Neg logit contributions
 thank-0.014
 show-0.011
 performance-0.009
 story-0.009
 medal-0.008
Token,
Token ID11
Feature activation+5.12e-01

Pos logit contributions
 performance+0.055
 show+0.053
 thank+0.046
odic+0.046
 contain+0.045
Neg logit contributions
 yourselves-0.103
ety-0.093
ried-0.093
Or-0.091
 late-0.089
Token
Token ID6184
Feature activation-9.31e-02

Pos logit contributions
 thank+0.013
 performance+0.012
 show+0.012
 medal+0.011
 story+0.011
Neg logit contributions
 yourselves-0.016
ety-0.014
ried-0.014
Or-0.013
 late-0.013
Token
Token ID95
Feature activation-9.50e-01

Pos logit contributions
 yourselves+0.003
ety+0.003
ried+0.003
neys+0.002
 late+0.002
Neg logit contributions
 thank-0.003
 show-0.002
 performance-0.002
 medal-0.002
 leap-0.002
Token
Token ID26391
Feature activation+1.39e+00

Pos logit contributions
 yourselves+0.029
ety+0.028
ried+0.027
Or+0.025
oky+0.025
Neg logit contributions
 thank-0.024
 show-0.024
 performance-0.022
on-0.021
 ride-0.021
Token
Token ID129
Feature activation+1.28e+00

Pos logit contributions
 thank+0.038
 show+0.036
 performance+0.034
on+0.033
 story+0.033
Neg logit contributions
 yourselves-0.041
ety-0.037
ried-0.036
Or-0.034
oky-0.033
Token
Token ID241
Feature activation-5.43e+00

Pos logit contributions
 thank+0.036
 show+0.034
 performance+0.033
 hardly+0.031
 medal+0.031
Neg logit contributions
 yourselves-0.037
ety-0.034
ried-0.032
neys-0.031
Or-0.031
TokenLook
Token ID8567
Feature activation+1.77e+00

Pos logit contributions
 yourselves+0.122
ety+0.109
ried+0.100
Or+0.096
neys+0.095
Neg logit contributions
 thank-0.187
 show-0.177
 performance-0.176
 medal-0.165
 story-0.164
Token at
Token ID379
Feature activation-1.86e+00

Pos logit contributions
 thank+0.043
 performance+0.042
 show+0.040
 grin+0.038
 story+0.038
Neg logit contributions
 yourselves-0.056
ety-0.052
ried-0.047
 hyd-0.045
 late-0.045
Token that
Token ID326
Feature activation+2.65e-02

Pos logit contributions
 yourselves+0.061
ety+0.059
ried+0.057
neys+0.056
oky+0.056
Neg logit contributions
 thank-0.044
 show-0.041
 performance-0.038
 medal-0.037
 story-0.035
Token branch
Token ID8478
Feature activation-2.35e-01

Pos logit contributions
 thank+0.001
 show+0.001
 performance+0.001
 medal+0.001
 grin+0.001
Neg logit contributions
 yourselves-0.001
ety-0.001
ried-0.001
neys-0.001
 late-0.001
Token!
Token ID0
Feature activation+1.15e+00

Pos logit contributions
 yourselves+0.009
ety+0.009
ried+0.008
neys+0.008
Or+0.008
Neg logit contributions
 thank-0.004
 show-0.004
 performance-0.004
 medal-0.003
 story-0.003
Token and
Token ID290
Feature activation+7.14e-02

Pos logit contributions
 yourselves+0.026
ety+0.024
ried+0.023
Or+0.022
neys+0.022
Neg logit contributions
 thank-0.019
 show-0.017
 performance-0.017
 medal-0.016
 story-0.015
Token slipped
Token ID18859
Feature activation+1.00e+00

Pos logit contributions
 thank+0.002
 show+0.001
 performance+0.001
 medal+0.001
 leap+0.001
Neg logit contributions
 yourselves-0.003
ety-0.002
ried-0.002
neys-0.002
oky-0.002
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
 thank+0.024
 performance+0.023
 show+0.022
 medal+0.021
 story+0.020
Neg logit contributions
 yourselves-0.033
ety-0.030
ried-0.029
 late-0.029
Or-0.028
Token She
Token ID1375
Feature activation-2.15e+00

Pos logit contributions
 yourselves+0.024
ety+0.022
ried+0.022
 hyd+0.021
neys+0.020
Neg logit contributions
 thank-0.018
 show-0.016
 performance-0.015
on-0.015
 grin-0.014
Token fell
Token ID3214
Feature activation-1.11e+00

Pos logit contributions
 yourselves+0.067
ety+0.061
ried+0.059
Or+0.057
neys+0.055
Neg logit contributions
 thank-0.056
 performance-0.053
 show-0.051
 story-0.048
 medal-0.047
Token down
Token ID866
Feature activation-4.34e+00

Pos logit contributions
 yourselves+0.038
ety+0.035
ried+0.033
neys+0.033
Or+0.033
Neg logit contributions
 thank-0.025
 show-0.024
 performance-0.022
 medal-0.021
on-0.020
Token and
Token ID290
Feature activation-7.19e-01

Pos logit contributions
 yourselves+0.138
ety+0.125
ried+0.120
 late+0.119
Or+0.117
Neg logit contributions
 thank-0.108
 performance-0.102
 show-0.100
 medal-0.093
 story-0.091
Token scrap
Token ID15881
Feature activation+1.46e+00

Pos logit contributions
 yourselves+0.022
ety+0.020
ried+0.019
Or+0.019
 late+0.019
Neg logit contributions
 thank-0.019
 performance-0.018
 show-0.017
 story-0.016
 medal-0.016
Tokened
Token ID276
Feature activation-1.46e+00

Pos logit contributions
 thank+0.043
 show+0.041
 performance+0.040
 medal+0.037
 story+0.037
Neg logit contributions
 yourselves-0.040
ety-0.036
ried-0.034
neys-0.033
Or-0.032
Token her
Token ID607
Feature activation-2.23e+00

Pos logit contributions
 yourselves+0.056
ety+0.053
ried+0.051
neys+0.050
Or+0.049
Neg logit contributions
 thank-0.027
 show-0.024
 performance-0.023
 medal-0.021
 story-0.020
Token knee
Token ID10329
Feature activation-1.68e-01

Pos logit contributions
 yourselves+0.057
ety+0.050
ried+0.048
Or+0.045
neys+0.045
Neg logit contributions
 thank-0.071
 show-0.066
 performance-0.066
 story-0.061
 medal-0.061
Token bad
Token ID2089
Feature activation-1.34e+00

Pos logit contributions
 performance+0.028
 thank+0.027
 show+0.027
 leap+0.025
 medal+0.025
Neg logit contributions
 yourselves-0.026
ety-0.023
 late-0.022
Or-0.021
ried-0.021
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
 yourselves+0.044
ety+0.041
 late+0.039
ried+0.038
oky+0.038
Neg logit contributions
 thank-0.031
 performance-0.030
 show-0.029
 leap-0.027
 medal-0.027
Token He
Token ID679
Feature activation-2.91e+00

Pos logit contributions
 yourselves+0.055
ety+0.053
ried+0.053
rying+0.050
 hyd+0.049
Neg logit contributions
 thank-0.029
 show-0.024
 Everyone-0.022
 Everybody-0.020
 story-0.020
Token said
Token ID531
Feature activation+1.88e+00

Pos logit contributions
 yourselves+0.103
ety+0.096
ried+0.092
neys+0.089
Or+0.089
Neg logit contributions
 thank-0.063
 show-0.058
 performance-0.057
 medal-0.052
 story-0.051
Token,
Token ID11
Feature activation-6.59e-01

Pos logit contributions
 thank+0.040
 performance+0.040
 show+0.039
 leap+0.036
 medal+0.035
Neg logit contributions
 yourselves-0.064
ety-0.059
 late-0.056
ried-0.056
Or-0.055
Token "
Token ID366
Feature activation-5.76e+00

Pos logit contributions
 yourselves+0.025
ety+0.023
ried+0.022
neys+0.021
oky+0.021
Neg logit contributions
 thank-0.014
 show-0.013
 performance-0.012
 story-0.011
on-0.011
TokenI
Token ID40
Feature activation+6.69e-01

Pos logit contributions
ried+0.164
rying+0.150
 yourselves+0.145
aj+0.140
neys+0.139
Neg logit contributions
 thank-0.154
 show-0.154
on-0.144
 jugg-0.140
 medal-0.137
Token'm
Token ID1101
Feature activation+1.65e+00

Pos logit contributions
 thank+0.018
 performance+0.017
 show+0.017
 medal+0.016
 leap+0.016
Neg logit contributions
 yourselves-0.019
ety-0.019
ried-0.017
Or-0.017
 late-0.017
Token sorry
Token ID7926
Feature activation-1.79e+00

Pos logit contributions
 thank+0.041
 show+0.038
 performance+0.036
on+0.033
 story+0.033
Neg logit contributions
 yourselves-0.053
ety-0.051
ried-0.049
neys-0.047
Or-0.047
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
 yourselves+0.076
ety+0.071
ried+0.067
 late+0.066
neys+0.066
Neg logit contributions
 thank-0.025
 performance-0.023
 show-0.023
 medal-0.020
 grin-0.019
Token Lily
Token ID20037
Feature activation-1.14e+00

Pos logit contributions
 yourselves+0.044
ety+0.040
ried+0.036
Or+0.035
 late+0.035
Neg logit contributions
 thank-0.056
 performance-0.053
 show-0.053
 medal-0.049
 leap-0.049
Token reach
Token ID3151
Feature activation-3.22e-01

Pos logit contributions
 yourselves+0.019
ety+0.018
ried+0.017
Or+0.016
 hyd+0.016
Neg logit contributions
 thank-0.013
 performance-0.013
 show-0.012
 medal-0.012
 story-0.012
Token it
Token ID340
Feature activation-2.66e+00

Pos logit contributions
 yourselves+0.014
ety+0.013
ried+0.013
neys+0.013
Or+0.013
Neg logit contributions
 thank-0.004
 show-0.004
 performance-0.003
 medal-0.003
 story-0.003
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 yourselves+0.091
ety+0.086
ried+0.082
Or+0.079
 late+0.079
Neg logit contributions
 thank-0.059
 performance-0.055
 show-0.053
 medal-0.049
 grin-0.049
Token
Token ID198
Feature activation-9.31e-01

Pos logit contributions
 yourselves+0.046
ried+0.045
ety+0.045
asured+0.044
rying+0.043
Neg logit contributions
 thank-0.030
 Everyone-0.026
 show-0.025
 -0.024
 Everybody-0.024
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 yourselves+0.036
ried+0.033
ety+0.033
 hyd+0.032
esty+0.031
Neg logit contributions
 thank-0.018
 show-0.016
 story-0.015
on-0.015
 hardly-0.014
Token"
Token ID1
Feature activation-5.21e+00

Pos logit contributions
 yourselves+0.034
ried+0.034
asured+0.034
ety+0.033
neys+0.032
Neg logit contributions
 thank-0.021
 show-0.020
on-0.019
 story-0.018
ner-0.018
TokenMaybe
Token ID13300
Feature activation-1.25e+00

Pos logit contributions
asured+0.155
rying+0.155
aj+0.152
 yourselves+0.151
neys+0.151
Neg logit contributions
 thank-0.111
 start-0.109
 show-0.109
 jugg-0.107
 bow-0.101
Token we
Token ID356
Feature activation+2.26e+00

Pos logit contributions
 yourselves+0.047
ety+0.045
ried+0.043
neys+0.043
oky+0.043
Neg logit contributions
 thank-0.026
 show-0.023
 performance-0.020
 medal-0.019
 story-0.018
Token can
Token ID460
Feature activation+2.99e+00

Pos logit contributions
 thank+0.048
 show+0.044
 performance+0.044
 medal+0.040
 story+0.039
Neg logit contributions
 yourselves-0.080
ety-0.075
ried-0.072
neys-0.069
Or-0.069
Token climb
Token ID12080
Feature activation-4.16e-01

Pos logit contributions
 performance+0.063
 thank+0.061
 medal+0.057
 grin+0.056
 goodbye+0.055
Neg logit contributions
 yourselves-0.101
ety-0.094
ried-0.088
 hyd-0.086
Or-0.085
Token a
Token ID257
Feature activation+2.04e+00

Pos logit contributions
 yourselves+0.017
ety+0.016
ried+0.016
neys+0.015
Or+0.015
Neg logit contributions
 thank-0.007
 show-0.006
 performance-0.005
 medal-0.005
on-0.005
Token
Token ID198
Feature activation-8.23e-01

Pos logit contributions
 yourselves+0.019
ety+0.017
ried+0.017
neys+0.016
 hyd+0.016
Neg logit contributions
 thank-0.015
 show-0.014
 performance-0.013
 story-0.012
on-0.012
TokenThe
Token ID464
Feature activation+3.24e-01

Pos logit contributions
 yourselves+0.027
ety+0.025
ried+0.025
oky+0.024
neys+0.023
Neg logit contributions
 thank-0.020
 show-0.018
 performance-0.017
on-0.017
 story-0.017
Token bird
Token ID6512
Feature activation+7.12e-01

Pos logit contributions
 thank+0.007
 show+0.007
 performance+0.006
 story+0.006
 medal+0.006
Neg logit contributions
 yourselves-0.012
ety-0.011
ried-0.011
neys-0.010
Or-0.010
Token replied
Token ID8712
Feature activation+1.54e+00

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.013
 medal+0.011
 story+0.011
Neg logit contributions
 yourselves-0.026
ety-0.025
ried-0.024
neys-0.023
Or-0.023
Token,
Token ID11
Feature activation+4.34e-01

Pos logit contributions
 performance+0.023
 show+0.023
 thank+0.022
 medal+0.020
 story+0.019
Neg logit contributions
 yourselves-0.060
ried-0.055
ety-0.055
Or-0.054
 late-0.053
Token "
Token ID366
Feature activation-5.05e+00

Pos logit contributions
 thank+0.011
 show+0.010
 performance+0.010
 medal+0.009
 story+0.009
Neg logit contributions
 yourselves-0.014
ety-0.013
ried-0.012
Or-0.012
 late-0.012
TokenBut
Token ID1537
Feature activation+3.62e-01

Pos logit contributions
 yourselves+0.135
ried+0.120
ety+0.113
neys+0.112
 wasteful+0.109
Neg logit contributions
 thank-0.152
 show-0.144
odic-0.139
 performance-0.138
 jugg-0.134
Token there
Token ID612
Feature activation+6.34e-01

Pos logit contributions
 thank+0.009
 show+0.009
 performance+0.008
 medal+0.008
 story+0.008
Neg logit contributions
 yourselves-0.011
ety-0.011
ried-0.010
oky-0.010
neys-0.010
Token's
Token ID338
Feature activation-2.31e+00

Pos logit contributions
 thank+0.014
 show+0.013
 performance+0.013
 medal+0.012
 grin+0.012
Neg logit contributions
 yourselves-0.022
ety-0.020
ried-0.019
neys-0.019
Or-0.018
Token so
Token ID523
Feature activation-1.44e+00

Pos logit contributions
 yourselves+0.085
ety+0.082
oky+0.080
ried+0.079
ipt+0.079
Neg logit contributions
 thank-0.047
 show-0.043
 performance-0.037
 start-0.036
 story-0.036
Token much
Token ID881
Feature activation-5.11e+00

Pos logit contributions
 yourselves+0.051
ety+0.051
ried+0.050
Or+0.048
esty+0.048
Neg logit contributions
 thank-0.030
 show-0.026
 performance-0.024
 story-0.023
 leap-0.022