TOP ACTIVATIONS
MAX = 0.155

that day forward , whenever the bear felt sad he would
with me ?" The p anda nods . He
She didn 't understand why the boy was so mean .
have some ?" The p anda was so surprised
she always remembered how kind the bear was to share his
even f atter . Whenever the girl said hello and asked
bird realized that even though the egg seemed tough , what
that day on , whenever the m op wasn 't available
your picture ?" The p anda said , "
and started to zip down the busy road . She felt
your words ?" The elephants made some more noises
was lonely . The little girl said " Don
became friends . Now whenever the bear gets hungry , the
no choice but to leave the park and go home without
that day on , whenever the little bird flew in the
proud of it . Whenever the train left the station everyone
of things . The cats had so much fun
its beauty . The tiger moved from the forest
your camera ." The little boy took out his
fun together . Finally , the little boy said , "

MIN ACTIVATIONS
MIN = -0.085

a persistent lady came up to her and told her she
grandma 's house for safe keeping . When
it okay if I rest here too ?' Fred smiled and
let 's rock back and forth to make you feel better
tree would sway back and forth , while the small bush
. Just rock back and forth and you 'll make a
. Just rock back and forth and feel the music in
, a man came up to him and asked , "
! He flew back and forth in the sky , spir
an older man came up to Sophie . He asked her
in and twisted back and forth . Suddenly , the toe
mole . " To get to the other side
tag . It says ' To Lily and Tom , from
a tag that said " To Ben , from Dad ".
" It says : ' To whoever finds this note ,
. He rocked back and forth , pushing and pulling as
The note said : " To open the box , you
Then her mom came up to her with a big smile
rocked the bucket back and forth , making a game out
, " Let 's search inside the desk !"

INTERVAL 0.095 - 0.155
CONTAINS 4.448%

little girl looked sadly at the gem as all the memories
as he went fast . The sun was warm and bright
very wise . The wise bear knew he 'd
. But his mom said the ghost was too expensive and
friends . The end . <|endoftext|> Once upon

INTERVAL 0.035 - 0.095
CONTAINS 10.414%

play with her friends in the park . One day ,
- all . There are many things that I cannot do
," said Lily , feeling a bit disappointed .
a time , there was a boy named Tim my .
crib for her . â € The day

INTERVAL -0.025 - 0.035
CONTAINS 60.439%

The End . <|endoftext|> John was excited . He had been
never told a lie again either - everything it said was
, but they also loved sleeping . They kissed
play with it . " Let 's decor
. Her mom was proud of her for being

INTERVAL -0.085 - -0.025
CONTAINS 24.699%

good , but they can turn out bad . We just
face . It was cold against her skin and made her
to make him feel better by ordering some new red socks
so delicious . She really wanted to eat it , but
playing outside every day and recognizing different things . It was
Token that
Token ID326
Feature activation+3.69e-02

Pos logit contributions
 biggest+0.056
ini+0.053
 elderly+0.052
 sun+0.052
 jungle+0.051
Neg logit contributions
 it-0.017
pped-0.014
pping-0.013
 let-0.011
 him-0.010
Token day
Token ID1110
Feature activation-2.56e-02

Pos logit contributions
 biggest+0.037
umm+0.034
ini+0.033
err+0.030
 former+0.027
Neg logit contributions
 it-0.105
pped-0.091
 let-0.090
 him-0.089
pping-0.088
Token forward
Token ID2651
Feature activation-3.28e-02

Pos logit contributions
 it+0.036
pped+0.030
 how+0.029
 whoever+0.029
 him+0.029
Neg logit contributions
 biggest-0.053
 best-0.052
 sun-0.050
ubby-0.048
ama-0.048
Token,
Token ID11
Feature activation-2.75e-02

Pos logit contributions
pping+0.054
pped+0.054
 it+0.044
sels+0.042
hot+0.042
Neg logit contributions
 biggest-0.058
 girl-0.057
 sun-0.056
 best-0.056
 jungle-0.056
Token whenever
Token ID8797
Feature activation-2.96e-02

Pos logit contributions
 it+0.025
pped+0.021
pping+0.018
 let+0.016
 him+0.016
Neg logit contributions
 biggest-0.076
umm-0.073
ini-0.072
 jungle-0.070
err-0.070
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
pped+0.030
 it+0.029
pping+0.027
 let+0.021
ged+0.021
Neg logit contributions
 biggest-0.077
umm-0.072
 jungle-0.071
 sun-0.071
 best-0.070
Token bear
Token ID6842
Feature activation+2.38e-03

Pos logit contributions
opl+0.238
umm+0.229
anny+0.219
omm+0.218
err+0.216
Neg logit contributions
 it-0.314
 let-0.293
 don-0.247
 in-0.243
 tried-0.241
Token felt
Token ID2936
Feature activation+9.93e-03

Pos logit contributions
 biggest+0.005
ini+0.005
umm+0.005
 jungle+0.005
err+0.005
Neg logit contributions
 it-0.004
pped-0.003
pping-0.003
 him-0.003
 let-0.003
Token sad
Token ID6507
Feature activation-1.31e-02

Pos logit contributions
 biggest+0.014
umm+0.013
ini+0.013
err+0.012
 jungle+0.011
Neg logit contributions
 it-0.024
pped-0.021
 him-0.020
 let-0.020
pping-0.020
Token he
Token ID339
Feature activation-3.27e-03

Pos logit contributions
pped+0.017
 it+0.017
pping+0.016
 let+0.013
 whoever+0.012
Neg logit contributions
 biggest-0.030
umm-0.028
 best-0.028
 sun-0.027
ini-0.027
Token would
Token ID561
Feature activation-3.02e-02

Pos logit contributions
 it+0.005
pped+0.004
pping+0.004
 whoever+0.004
 him+0.004
Neg logit contributions
 biggest-0.007
 best-0.006
 jungle-0.006
 world-0.006
ini-0.006
Token with
Token ID351
Feature activation-2.90e-02

Pos logit contributions
 it+0.019
pped+0.018
pping+0.016
ged+0.015
 risking+0.015
Neg logit contributions
 biggest-0.037
ini-0.034
 sun-0.034
 jungle-0.033
 best-0.033
Token me
Token ID502
Feature activation-7.13e-03

Pos logit contributions
 it+0.059
pped+0.057
pping+0.056
 let+0.053
ged+0.051
Neg logit contributions
 biggest-0.043
ini-0.037
 sun-0.037
 jungle-0.036
umm-0.035
Token?"
Token ID1701
Feature activation-1.18e-02

Pos logit contributions
 it+0.012
pped+0.012
ged+0.010
 whoever+0.010
pping+0.010
Neg logit contributions
 biggest-0.013
 best-0.012
ini-0.012
 jungle-0.012
 world-0.012
Token
Token ID198
Feature activation+2.20e-02

Pos logit contributions
 it+0.022
pped+0.018
 him+0.017
 let+0.017
pping+0.017
Neg logit contributions
 biggest-0.023
ini-0.022
umm-0.022
err-0.021
 jungle-0.020
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
 biggest+0.043
umm+0.041
ini+0.041
err+0.039
 jungle+0.038
Neg logit contributions
 it-0.040
pped-0.034
 him-0.032
 let-0.032
pping-0.031
TokenThe
Token ID464
Feature activation+1.51e-01

Pos logit contributions
 biggest+0.021
ini+0.019
 jungle+0.019
 sun+0.019
 elderly+0.019
Neg logit contributions
 it-0.021
pped-0.019
pping-0.018
 him-0.017
 wouldn-0.017
Token p
Token ID279
Feature activation+1.09e-01

Pos logit contributions
umm+0.383
 former+0.363
igsaw+0.363
auldron+0.354
ini+0.348
Neg logit contributions
 it-0.220
 let-0.172
 him-0.137
 them-0.135
 that-0.131
Tokenanda
Token ID5282
Feature activation+8.65e-03

Pos logit contributions
umm+0.188
 youngest+0.182
 biggest+0.180
 eldest+0.171
err+0.163
Neg logit contributions
 it-0.218
 that-0.167
 him-0.164
 how-0.160
 instead-0.160
Token nods
Token ID34833
Feature activation-3.73e-02

Pos logit contributions
 biggest+0.016
ini+0.015
umm+0.015
err+0.014
 jungle+0.014
Neg logit contributions
 it-0.017
pped-0.014
 him-0.013
pping-0.013
 let-0.013
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
 it+0.046
pped+0.045
pping+0.043
 whoever+0.039
 risking+0.037
Neg logit contributions
 biggest-0.084
err-0.081
 sun-0.080
 best-0.077
ini-0.077
Token He
Token ID679
Feature activation-2.19e-02

Pos logit contributions
 it+0.022
pped+0.021
pping+0.020
 him+0.018
 let+0.018
Neg logit contributions
 biggest-0.023
umm-0.021
err-0.020
ini-0.020
 sun-0.020
Token She
Token ID1375
Feature activation-8.82e-03

Pos logit contributions
 it+0.014
 him+0.011
pped+0.011
 let+0.010
pping+0.010
Neg logit contributions
 biggest-0.017
ini-0.016
umm-0.016
err-0.015
ubby-0.015
Token didn
Token ID1422
Feature activation-1.65e-02

Pos logit contributions
 it+0.013
pped+0.011
pping+0.010
 him+0.010
 let+0.009
Neg logit contributions
 biggest-0.019
ini-0.018
err-0.018
 jungle-0.018
 eldest-0.018
Token't
Token ID470
Feature activation-1.16e-03

Pos logit contributions
 it+0.021
pped+0.017
 let+0.015
 him+0.015
pping+0.015
Neg logit contributions
 biggest-0.040
ini-0.038
umm-0.038
err-0.037
 jungle-0.036
Token understand
Token ID1833
Feature activation-2.79e-02

Pos logit contributions
 it+0.002
pped+0.002
pping+0.001
 him+0.001
ged+0.001
Neg logit contributions
 biggest-0.003
ini-0.002
err-0.002
 sun-0.002
 jungle-0.002
Token why
Token ID1521
Feature activation-3.57e-02

Pos logit contributions
pped+0.040
sels+0.035
pping+0.033
ged+0.033
ched+0.031
Neg logit contributions
 biggest-0.057
 best-0.055
 new-0.055
 sun-0.054
 old-0.054
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
pped+0.044
pping+0.039
 it+0.037
sels+0.033
assed+0.032
Neg logit contributions
 biggest-0.077
 elderly-0.076
 sun-0.076
 jungle-0.074
 best-0.074
Token boy
Token ID2933
Feature activation-5.60e-03

Pos logit contributions
umm+0.301
igsaw+0.264
 former+0.264
omm+0.258
ini+0.240
Neg logit contributions
 it-0.340
 let-0.314
 the-0.257
 him-0.257
 that-0.247
Token was
Token ID373
Feature activation+8.74e-03

Pos logit contributions
pped+0.010
 it+0.009
pping+0.009
sels+0.008
 risking+0.008
Neg logit contributions
 biggest-0.009
 best-0.009
 jungle-0.009
ini-0.008
 youngest-0.008
Token so
Token ID523
Feature activation+2.85e-02

Pos logit contributions
 biggest+0.015
 elderly+0.014
 best+0.014
 older+0.014
 jungle+0.013
Neg logit contributions
 it-0.016
pped-0.016
pping-0.014
 him-0.013
ged-0.013
Token mean
Token ID1612
Feature activation-1.83e-02

Pos logit contributions
umm+0.059
 biggest+0.053
ini+0.053
omm+0.050
 former+0.049
Neg logit contributions
 it-0.057
 let-0.049
 him-0.046
pped-0.044
 that-0.042
Token.
Token ID13
Feature activation-2.31e-03

Pos logit contributions
pped+0.032
pping+0.029
sels+0.027
ck+0.026
 breathe+0.025
Neg logit contributions
 best-0.032
 biggest-0.031
 sun-0.031
 world-0.030
 old-0.030
Token have
Token ID423
Feature activation-3.73e-02

Pos logit contributions
 it+0.052
pped+0.046
pping+0.041
ged+0.040
 him+0.039
Neg logit contributions
 biggest-0.076
ini-0.072
 sun-0.069
umm-0.069
 jungle-0.067
Token some
Token ID617
Feature activation+3.87e-02

Pos logit contributions
 it+0.054
pped+0.050
pping+0.047
 let+0.043
 him+0.042
Neg logit contributions
 biggest-0.081
 sun-0.073
umm-0.073
ini-0.072
 jungle-0.070
Token?"
Token ID1701
Feature activation-1.16e-02

Pos logit contributions
 biggest+0.066
ini+0.065
umm+0.064
err+0.062
 jungle+0.058
Neg logit contributions
 it-0.080
pped-0.067
 him-0.064
 let-0.064
pping-0.062
Token
Token ID198
Feature activation+7.60e-03

Pos logit contributions
 it+0.023
pped+0.023
pping+0.021
 let+0.019
 him+0.019
Neg logit contributions
 biggest-0.018
ini-0.017
 sun-0.017
 jungle-0.017
umm-0.017
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
 biggest+0.015
umm+0.014
ini+0.014
err+0.013
 jungle+0.013
Neg logit contributions
 it-0.014
pped-0.012
 him-0.011
 let-0.011
pping-0.011
TokenThe
Token ID464
Feature activation+1.49e-01

Pos logit contributions
 biggest+0.004
ini+0.004
 jungle+0.003
 sun+0.003
umm+0.003
Neg logit contributions
 it-0.004
pped-0.004
pping-0.004
 him-0.003
ged-0.003
Token p
Token ID279
Feature activation+1.09e-01

Pos logit contributions
umm+0.331
 former+0.308
igsaw+0.302
ini+0.292
auldron+0.283
Neg logit contributions
 it-0.263
 let-0.236
 how-0.192
 the-0.184
 that-0.182
Tokenanda
Token ID5282
Feature activation+1.11e-02

Pos logit contributions
 biggest+0.186
umm+0.180
 youngest+0.171
 eldest+0.160
err+0.158
Neg logit contributions
 it-0.211
pped-0.169
 him-0.167
 that-0.163
ged-0.162
Token was
Token ID373
Feature activation+1.74e-03

Pos logit contributions
 biggest+0.024
ini+0.023
umm+0.022
err+0.022
 jungle+0.021
Neg logit contributions
 it-0.017
pped-0.015
pping-0.014
 him-0.013
 let-0.013
Token so
Token ID523
Feature activation+1.29e-02

Pos logit contributions
umm+0.003
ini+0.003
 biggest+0.003
err+0.003
 former+0.003
Neg logit contributions
 it-0.003
pped-0.003
 let-0.003
 him-0.003
pping-0.002
Token surprised
Token ID6655
Feature activation-2.39e-02

Pos logit contributions
umm+0.023
 biggest+0.023
ini+0.022
err+0.021
 jungle+0.020
Neg logit contributions
 it-0.027
 let-0.023
 him-0.021
 instead-0.020
pped-0.020
Token she
Token ID673
Feature activation-1.53e-02

Pos logit contributions
pped+0.037
 it+0.035
pping+0.033
ged+0.029
 him+0.028
Neg logit contributions
 biggest-0.058
umm-0.053
 best-0.052
 sun-0.052
ini-0.051
Token always
Token ID1464
Feature activation-1.59e-02

Pos logit contributions
 it+0.024
pped+0.023
 him+0.020
pping+0.020
ged+0.020
Neg logit contributions
 biggest-0.029
ini-0.028
 jungle-0.028
 best-0.027
umm-0.027
Token remembered
Token ID12086
Feature activation-3.92e-02

Pos logit contributions
 it+0.023
pped+0.022
pping+0.020
 him+0.019
 whoever+0.017
Neg logit contributions
 biggest-0.033
 best-0.031
 jungle-0.030
 sun-0.030
umm-0.030
Token how
Token ID703
Feature activation-1.26e-02

Pos logit contributions
pped+0.044
 it+0.039
pping+0.039
ged+0.035
assed+0.031
Neg logit contributions
 biggest-0.096
 best-0.089
 sun-0.088
 elderly-0.087
 jungle-0.087
Token kind
Token ID1611
Feature activation-1.86e-02

Pos logit contributions
 it+0.017
pped+0.017
pping+0.015
 let+0.013
 him+0.013
Neg logit contributions
 biggest-0.029
ini-0.027
umm-0.026
 elderly-0.025
err-0.025
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
pped+0.025
 it+0.024
pping+0.023
ged+0.020
assed+0.019
Neg logit contributions
 biggest-0.041
 sun-0.038
umm-0.038
ini-0.038
 best-0.037
Token bear
Token ID6842
Feature activation-1.63e-02

Pos logit contributions
umm+0.237
igsaw+0.227
ini+0.227
err+0.223
omm+0.222
Neg logit contributions
 it-0.303
 let-0.269
 him-0.244
 don-0.239
 share-0.232
Token was
Token ID373
Feature activation-2.14e-02

Pos logit contributions
 it+0.023
pped+0.022
pping+0.019
 him+0.018
ged+0.018
Neg logit contributions
 biggest-0.035
ini-0.033
 jungle-0.032
 best-0.032
err-0.031
Token to
Token ID284
Feature activation-3.37e-02

Pos logit contributions
pped+0.029
 it+0.028
pping+0.025
ged+0.024
 him+0.022
Neg logit contributions
 biggest-0.048
 best-0.045
 elderly-0.044
 sun-0.043
 jungle-0.043
Token share
Token ID2648
Feature activation-4.58e-02

Pos logit contributions
 it+0.046
pped+0.045
pping+0.038
ged+0.038
 him+0.033
Neg logit contributions
 biggest-0.074
 jungle-0.068
 sun-0.068
 former-0.068
umm-0.067
Token his
Token ID465
Feature activation+1.04e-01

Pos logit contributions
pped+0.051
 it+0.049
pping+0.046
ged+0.039
 let+0.039
Neg logit contributions
 biggest-0.110
ini-0.102
 sun-0.101
 jungle-0.101
 world-0.100
Token even
Token ID772
Feature activation+2.90e-02

Pos logit contributions
 biggest+0.006
ini+0.005
umm+0.005
err+0.005
 jungle+0.005
Neg logit contributions
 it-0.007
pped-0.006
 him-0.006
 let-0.006
pping-0.006
Token f
Token ID277
Feature activation+4.14e-02

Pos logit contributions
 biggest+0.059
umm+0.059
ini+0.058
err+0.056
omm+0.055
Neg logit contributions
 it-0.050
 let-0.040
pped-0.039
 him-0.037
 instead-0.037
Tokenatter
Token ID1436
Feature activation-7.96e-03

Pos logit contributions
 biggest+0.065
umm+0.057
 jungle+0.057
ini+0.056
err+0.055
Neg logit contributions
 it-0.088
pped-0.084
pping-0.079
 let-0.074
 him-0.074
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
 it+0.010
pped+0.008
pping+0.007
 let+0.007
 whoever+0.007
Neg logit contributions
 biggest-0.020
ini-0.019
umm-0.019
err-0.018
 sun-0.018
Token Whenever
Token ID21326
Feature activation-2.68e-02

Pos logit contributions
 it+0.035
pped+0.031
pping+0.028
 him+0.028
 let+0.028
Neg logit contributions
 biggest-0.034
umm-0.033
ini-0.032
err-0.031
 jungle-0.030
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
pped+0.030
pping+0.028
ged+0.022
 it+0.022
assed+0.022
Neg logit contributions
 biggest-0.068
 sun-0.065
 best-0.063
 jungle-0.063
 sky-0.063
Token girl
Token ID2576
Feature activation-1.11e-03

Pos logit contributions
umm+0.288
err+0.270
igsaw+0.270
omm+0.268
anny+0.261
Neg logit contributions
 it-0.302
 let-0.261
 that-0.219
 there-0.207
 the-0.204
Token said
Token ID531
Feature activation-2.72e-02

Pos logit contributions
 it+0.002
pped+0.001
pping+0.001
 him+0.001
 whoever+0.001
Neg logit contributions
 biggest-0.002
ini-0.002
 jungle-0.002
umm-0.002
 eldest-0.002
Token hello
Token ID23748
Feature activation-1.59e-02

Pos logit contributions
pped+0.034
pping+0.031
ge+0.027
assed+0.027
ged+0.026
Neg logit contributions
 biggest-0.068
 sun-0.064
 best-0.062
 new-0.060
 old-0.059
Token and
Token ID290
Feature activation-3.00e-02

Pos logit contributions
pped+0.021
 it+0.020
pping+0.019
 risking+0.016
 whoever+0.016
Neg logit contributions
 biggest-0.037
 sun-0.034
umm-0.034
ini-0.033
 best-0.033
Token asked
Token ID1965
Feature activation-4.65e-02

Pos logit contributions
 it+0.034
pped+0.034
pping+0.031
 him+0.026
assed+0.024
Neg logit contributions
 biggest-0.072
 sun-0.067
 jungle-0.067
ini-0.067
umm-0.066
Token bird
Token ID6512
Feature activation+3.30e-04

Pos logit contributions
ini+0.212
umm+0.210
anny+0.205
 former+0.196
err+0.195
Neg logit contributions
 it-0.312
 let-0.248
 that-0.245
 how-0.235
 him-0.234
Token realized
Token ID6939
Feature activation-3.48e-02

Pos logit contributions
 biggest+0.001
ini+0.001
 jungle+0.001
umm+0.001
err+0.001
Neg logit contributions
 it-0.000
pped-0.000
 him-0.000
pping-0.000
 let-0.000
Token that
Token ID326
Feature activation-2.30e-02

Pos logit contributions
pped+0.039
pping+0.036
ged+0.030
sels+0.030
 breathe+0.029
Neg logit contributions
 biggest-0.080
 best-0.079
 elderly-0.076
 new-0.075
 sun-0.075
Token even
Token ID772
Feature activation+4.20e-03

Pos logit contributions
pped+0.025
 it+0.021
pping+0.020
ged+0.018
 let+0.016
Neg logit contributions
 biggest-0.059
 best-0.055
 jungle-0.054
 world-0.054
umm-0.054
Token though
Token ID996
Feature activation-2.72e-02

Pos logit contributions
 biggest+0.011
ini+0.011
umm+0.011
err+0.010
 jungle+0.010
Neg logit contributions
 it-0.005
pped-0.004
pping-0.003
 him-0.003
 let-0.003
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
pped+0.027
pping+0.024
 it+0.024
ged+0.020
 let+0.019
Neg logit contributions
 biggest-0.071
 best-0.066
 jungle-0.065
 sun-0.065
 world-0.064
Token egg
Token ID5935
Feature activation+9.02e-03

Pos logit contributions
ini+0.210
anny+0.199
err+0.190
umm+0.188
omm+0.179
Neg logit contributions
 it-0.353
 let-0.281
 him-0.273
 that-0.265
 each-0.260
Token seemed
Token ID3947
Feature activation+8.54e-04

Pos logit contributions
 biggest+0.020
ini+0.019
umm+0.019
err+0.018
 jungle+0.018
Neg logit contributions
 it-0.013
pped-0.011
pping-0.010
 him-0.010
 let-0.010
Token tough
Token ID5802
Feature activation-2.03e-02

Pos logit contributions
 biggest+0.002
ini+0.002
umm+0.002
err+0.002
 jungle+0.002
Neg logit contributions
 it-0.001
pped-0.001
pping-0.001
 him-0.001
 let-0.001
Token,
Token ID11
Feature activation-4.04e-02

Pos logit contributions
pped+0.029
 it+0.027
pping+0.026
 let+0.024
ck+0.023
Neg logit contributions
 biggest-0.042
 best-0.041
 world-0.038
 sun-0.037
umm-0.037
Token what
Token ID644
Feature activation-1.58e-02

Pos logit contributions
pped+0.023
pping+0.016
 it+0.013
ged+0.012
assed+0.009
Neg logit contributions
 biggest-0.124
 best-0.117
 sun-0.116
 world-0.115
 jungle-0.113
Token that
Token ID326
Feature activation+4.27e-02

Pos logit contributions
 biggest+0.064
ini+0.061
 sun+0.060
 elderly+0.060
umm+0.059
Neg logit contributions
 it-0.019
pped-0.015
pping-0.014
 let-0.012
 him-0.012
Token day
Token ID1110
Feature activation-9.94e-03

Pos logit contributions
 biggest+0.047
umm+0.046
ini+0.044
err+0.040
 former+0.039
Neg logit contributions
 it-0.117
 let-0.099
pped-0.099
 him-0.098
pping-0.097
Token on
Token ID319
Feature activation-2.12e-04

Pos logit contributions
 it+0.012
pped+0.009
 him+0.009
 whoever+0.009
 let+0.009
Neg logit contributions
 biggest-0.023
 best-0.023
 sun-0.022
ubby-0.022
ama-0.021
Token,
Token ID11
Feature activation-2.17e-02

Pos logit contributions
 it+0.000
pped+0.000
pping+0.000
 him+0.000
 let+0.000
Neg logit contributions
 biggest-0.000
 best-0.000
 sun-0.000
umm-0.000
ini-0.000
Token whenever
Token ID8797
Feature activation-2.26e-02

Pos logit contributions
 it+0.034
pped+0.028
 him+0.026
 let+0.026
pping+0.025
Neg logit contributions
 biggest-0.048
ini-0.045
umm-0.045
err-0.044
 jungle-0.043
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
pped+0.033
 it+0.032
pping+0.032
ged+0.027
 him+0.027
Neg logit contributions
 biggest-0.047
 sun-0.043
 best-0.043
 jungle-0.042
umm-0.042
Token m
Token ID285
Feature activation+1.29e-01

Pos logit contributions
 former+0.290
err+0.279
umm+0.278
igsaw+0.277
ini+0.253
Neg logit contributions
 it-0.279
 let-0.220
 him-0.194
 the-0.193
 instead-0.187
Tokenop
Token ID404
Feature activation+2.88e-02

Pos logit contributions
 biggest+0.322
 former+0.302
 jungle+0.298
umm+0.295
err+0.295
Neg logit contributions
 it-0.167
pped-0.129
pping-0.117
 him-0.113
ged-0.104
Token wasn
Token ID2492
Feature activation-1.25e-02

Pos logit contributions
 biggest+0.061
ini+0.059
umm+0.057
err+0.055
 jungle+0.055
Neg logit contributions
 it-0.046
pped-0.039
pping-0.035
 let-0.035
 him-0.034
Token't
Token ID470
Feature activation+6.22e-03

Pos logit contributions
 it+0.018
pped+0.015
 let+0.013
pping+0.013
 him+0.013
Neg logit contributions
 biggest-0.028
ini-0.026
umm-0.025
err-0.025
omm-0.025
Token available
Token ID1695
Feature activation-1.59e-02

Pos logit contributions
 biggest+0.010
ini+0.009
 elderly+0.009
 best+0.009
umm+0.009
Neg logit contributions
 it-0.013
pped-0.012
pping-0.011
 him-0.011
 let-0.011
Token your
Token ID534
Feature activation+8.83e-02

Pos logit contributions
 it+0.050
pped+0.041
pping+0.037
 let+0.037
 him+0.036
Neg logit contributions
 biggest-0.085
ini-0.080
umm-0.079
err-0.077
 jungle-0.076
Token picture
Token ID4286
Feature activation-1.76e-02

Pos logit contributions
umm+0.151
ini+0.147
err+0.140
 biggest+0.139
 former+0.135
Neg logit contributions
 it-0.189
 let-0.151
 him-0.149
pped-0.148
 how-0.137
Token?"
Token ID1701
Feature activation-9.80e-03

Pos logit contributions
 it+0.023
pped+0.019
pping+0.017
 let+0.016
 him+0.016
Neg logit contributions
 biggest-0.043
ini-0.041
umm-0.041
err-0.039
 jungle-0.039
Token
Token ID198
Feature activation+7.48e-03

Pos logit contributions
 it+0.018
pped+0.017
pping+0.016
 let+0.015
 him+0.015
Neg logit contributions
 biggest-0.018
ini-0.017
umm-0.017
err-0.016
 jungle-0.016
Token
Token ID198
Feature activation+3.48e-03

Pos logit contributions
 biggest+0.015
umm+0.015
ini+0.015
err+0.014
 jungle+0.014
Neg logit contributions
 it-0.013
pped-0.011
 let-0.010
 him-0.010
pping-0.010
TokenThe
Token ID464
Feature activation+1.48e-01

Pos logit contributions
 biggest+0.007
ini+0.006
umm+0.006
err+0.006
 jungle+0.006
Neg logit contributions
 it-0.006
pped-0.006
pping-0.005
 him-0.005
 let-0.005
Token p
Token ID279
Feature activation+1.14e-01

Pos logit contributions
umm+0.307
igsaw+0.289
 former+0.278
err+0.248
auldron+0.247
Neg logit contributions
 it-0.298
 let-0.268
 that-0.218
 the-0.215
 put-0.211
Tokenanda
Token ID5282
Feature activation+1.78e-02

Pos logit contributions
umm+0.195
 biggest+0.194
 youngest+0.186
 eldest+0.176
err+0.169
Neg logit contributions
 it-0.219
 him-0.173
 that-0.167
pped-0.166
 perched-0.165
Token said
Token ID531
Feature activation-3.12e-02

Pos logit contributions
 biggest+0.039
ini+0.037
umm+0.037
err+0.036
 jungle+0.035
Neg logit contributions
 it-0.027
pped-0.023
pping-0.021
 him-0.020
 let-0.020
Token,
Token ID11
Feature activation-3.36e-02

Pos logit contributions
pped+0.055
pping+0.052
assed+0.047
ge+0.047
 risking+0.046
Neg logit contributions
 biggest-0.058
 best-0.055
 new-0.050
 sun-0.049
 little-0.047
Token "
Token ID366
Feature activation-1.98e-02

Pos logit contributions
pped+0.070
 it+0.065
pping+0.063
 risking+0.059
ge+0.058
Neg logit contributions
 biggest-0.051
 sun-0.047
 sky-0.043
omm-0.042
 best-0.042
Token and
Token ID290
Feature activation-3.51e-02

Pos logit contributions
 it+0.071
pped+0.071
 whoever+0.063
 risking+0.063
pping+0.061
Neg logit contributions
 biggest-0.089
 sun-0.084
 best-0.083
 sky-0.081
 elderly-0.079
Token started
Token ID2067
Feature activation-2.91e-02

Pos logit contributions
pped+0.047
pping+0.046
 it+0.045
 him+0.039
 risking+0.039
Neg logit contributions
 biggest-0.075
 sun-0.070
 best-0.066
 jungle-0.066
 elderly-0.066
Token to
Token ID284
Feature activation-1.90e-02

Pos logit contributions
pped+0.045
pping+0.037
 it+0.036
ms+0.035
 whoever+0.035
Neg logit contributions
 biggest-0.057
 new-0.054
 jungle-0.053
 sun-0.053
 older-0.052
Token zip
Token ID19974
Feature activation-3.40e-02

Pos logit contributions
 it+0.036
pped+0.036
 risking+0.031
pping+0.031
ged+0.030
Neg logit contributions
 biggest-0.032
 jungle-0.028
 tallest-0.028
 sun-0.028
 best-0.027
Token down
Token ID866
Feature activation-3.23e-02

Pos logit contributions
pped+0.050
pping+0.046
 it+0.045
 let+0.042
 whoever+0.041
Neg logit contributions
 biggest-0.074
 best-0.065
 sun-0.064
 tallest-0.062
err-0.062
Token the
Token ID262
Feature activation+1.48e-01

Pos logit contributions
pped+0.034
 it+0.032
pping+0.031
 let+0.027
 risking+0.023
Neg logit contributions
 biggest-0.081
 jungle-0.075
 sun-0.074
 best-0.074
 sky-0.073
Token busy
Token ID8179
Feature activation+1.06e-01

Pos logit contributions
umm+0.221
 former+0.208
ini+0.196
omm+0.195
err+0.193
Neg logit contributions
 it-0.346
 him-0.288
 let-0.287
pping-0.286
pped-0.278
Token road
Token ID2975
Feature activation-3.24e-02

Pos logit contributions
 biggest+0.131
umm+0.128
err+0.124
ini+0.123
 former+0.120
Neg logit contributions
 it-0.272
 him-0.227
 let-0.224
pped-0.215
pping-0.212
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
pped+0.046
 it+0.042
pping+0.037
 risking+0.034
 let+0.034
Neg logit contributions
 biggest-0.072
 best-0.066
 jungle-0.065
 world-0.064
 sky-0.063
Token She
Token ID1375
Feature activation-1.28e-02

Pos logit contributions
 it+0.033
pped+0.027
 him+0.026
 let+0.026
pping+0.025
Neg logit contributions
 biggest-0.035
ini-0.034
umm-0.033
err-0.032
 jungle-0.031
Token felt
Token ID2936
Feature activation-4.64e-03

Pos logit contributions
 it+0.019
pped+0.017
pping+0.016
 him+0.015
 let+0.014
Neg logit contributions
 biggest-0.028
ini-0.026
 jungle-0.025
 best-0.024
err-0.024
Token your
Token ID534
Feature activation+7.37e-02

Pos logit contributions
 it+0.024
pped+0.022
pping+0.019
 let+0.018
 him+0.017
Neg logit contributions
 biggest-0.050
ini-0.046
 jungle-0.045
 sun-0.045
err-0.044
Token words
Token ID2456
Feature activation-4.34e-02

Pos logit contributions
umm+0.112
ini+0.104
 biggest+0.096
err+0.092
omm+0.088
Neg logit contributions
 it-0.178
 let-0.145
 him-0.145
 how-0.138
 instead-0.133
Token?"
Token ID1701
Feature activation-3.19e-03

Pos logit contributions
pped+0.070
 it+0.068
pping+0.062
ged+0.061
 risking+0.059
Neg logit contributions
 biggest-0.086
 best-0.077
 sun-0.077
 world-0.074
ini-0.074
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
 it+0.006
pped+0.006
pping+0.005
 let+0.005
 him+0.005
Neg logit contributions
 biggest-0.006
umm-0.005
ini-0.005
 sun-0.005
err-0.005
Token
Token ID198
Feature activation+1.32e-02

Pos logit contributions
 biggest+0.032
umm+0.030
ini+0.030
err+0.029
 jungle+0.028
Neg logit contributions
 it-0.030
pped-0.025
 him-0.024
 let-0.023
pping-0.023
TokenThe
Token ID464
Feature activation+1.48e-01

Pos logit contributions
 biggest+0.022
ini+0.021
umm+0.020
err+0.019
 jungle+0.019
Neg logit contributions
 it-0.027
pped-0.024
pping-0.023
 him-0.022
 let-0.022
Token elephants
Token ID25368
Feature activation-2.88e-03

Pos logit contributions
umm+0.321
ini+0.313
 former+0.312
igsaw+0.303
aj+0.280
Neg logit contributions
 it-0.247
 let-0.227
 them-0.204
 the-0.201
 each-0.194
Token made
Token ID925
Feature activation-2.09e-02

Pos logit contributions
 it+0.004
pped+0.004
pping+0.004
 him+0.003
 whoever+0.003
Neg logit contributions
 biggest-0.006
 jungle-0.005
 sun-0.005
 sky-0.005
ini-0.005
Token some
Token ID617
Feature activation+3.89e-02

Pos logit contributions
 it+0.021
pped+0.013
 him+0.013
 let+0.011
pping+0.009
Neg logit contributions
 biggest-0.060
ini-0.059
umm-0.059
err-0.056
 jungle-0.056
Token more
Token ID517
Feature activation+5.28e-02

Pos logit contributions
ini+0.063
umm+0.062
 biggest+0.057
omm+0.057
err+0.057
Neg logit contributions
 it-0.087
 let-0.073
 him-0.072
 instead-0.066
pped-0.066
Token noises
Token ID26782
Feature activation-3.95e-02

Pos logit contributions
umm+0.082
 biggest+0.081
ini+0.078
omm+0.073
err+0.073
Neg logit contributions
 it-0.122
 him-0.099
 let-0.098
pped-0.096
 instead-0.092
Token was
Token ID373
Feature activation-1.39e-02

Pos logit contributions
pped+0.057
pping+0.053
ged+0.050
 it+0.049
assed+0.047
Neg logit contributions
 biggest-0.047
 best-0.044
 world-0.041
err-0.041
 sky-0.040
Token lonely
Token ID21757
Feature activation-2.08e-02

Pos logit contributions
 it+0.023
pped+0.022
pping+0.020
ged+0.019
 him+0.019
Neg logit contributions
 biggest-0.027
 best-0.025
 sun-0.024
err-0.024
ini-0.024
Token.
Token ID13
Feature activation-5.56e-03

Pos logit contributions
pped+0.035
pping+0.030
 it+0.030
 whoever+0.029
ged+0.028
Neg logit contributions
 biggest-0.042
 best-0.041
 world-0.040
 sun-0.038
 old-0.037
Token
Token ID198
Feature activation+7.27e-03

Pos logit contributions
 it+0.010
pped+0.009
pping+0.009
 him+0.008
 let+0.008
Neg logit contributions
 biggest-0.010
umm-0.010
ini-0.010
err-0.009
 jungle-0.009
Token
Token ID198
Feature activation+2.94e-04

Pos logit contributions
 biggest+0.013
ini+0.013
umm+0.012
err+0.012
 jungle+0.012
Neg logit contributions
 it-0.014
pped-0.012
pping-0.011
 let-0.011
 him-0.011
TokenThe
Token ID464
Feature activation+1.47e-01

Pos logit contributions
 biggest+0.001
ini+0.000
err+0.000
 jungle+0.000
umm+0.000
Neg logit contributions
 it-0.001
pped-0.000
 him-0.000
pping-0.000
 let-0.000
Token little
Token ID1310
Feature activation+1.22e-01

Pos logit contributions
umm+0.260
 former+0.235
omo+0.213
opl+0.209
ini+0.199
Neg logit contributions
 it-0.316
 let-0.294
 that-0.249
 put-0.245
 this-0.245
Token girl
Token ID2576
Feature activation-4.50e-03

Pos logit contributions
 biggest+0.120
umm+0.120
 former+0.118
err+0.110
ini+0.108
Neg logit contributions
 it-0.342
 let-0.298
 instead-0.279
 him-0.274
 if-0.272
Token said
Token ID531
Feature activation-3.27e-02

Pos logit contributions
 it+0.007
pped+0.007
pping+0.006
 him+0.006
 risking+0.005
Neg logit contributions
 biggest-0.009
ini-0.008
 jungle-0.008
err-0.008
 eldest-0.008
Token "
Token ID366
Feature activation-1.64e-02

Pos logit contributions
pped+0.054
pping+0.053
ge+0.051
assed+0.047
sels+0.043
Neg logit contributions
 biggest-0.062
 best-0.061
 sun-0.058
 new-0.058
 old-0.056
TokenDon
Token ID3987
Feature activation-2.31e-02

Pos logit contributions
 it+0.029
pped+0.025
pping+0.023
 him+0.023
 let+0.022
Neg logit contributions
 biggest-0.032
ini-0.031
umm-0.031
err-0.030
 jungle-0.028
Token became
Token ID2627
Feature activation+2.24e-02

Pos logit contributions
 it+0.023
pped+0.021
pping+0.019
 risking+0.017
 whoever+0.017
Neg logit contributions
 biggest-0.032
 jungle-0.030
 sun-0.029
 best-0.029
ini-0.029
Token friends
Token ID2460
Feature activation-1.15e-02

Pos logit contributions
ini+0.029
umm+0.029
 biggest+0.028
err+0.026
omm+0.025
Neg logit contributions
 it-0.058
 let-0.048
 him-0.048
pped-0.047
 instead-0.046
Token.
Token ID13
Feature activation-6.53e-03

Pos logit contributions
pped+0.017
 it+0.016
pping+0.014
 whoever+0.014
assed+0.013
Neg logit contributions
 biggest-0.023
 best-0.023
 world-0.022
 jungle-0.022
 sun-0.022
Token Now
Token ID2735
Feature activation-4.30e-02

Pos logit contributions
 it+0.012
pped+0.010
 him+0.010
 let+0.010
pping+0.009
Neg logit contributions
 biggest-0.013
ini-0.012
umm-0.012
err-0.011
 jungle-0.011
Token whenever
Token ID8797
Feature activation-3.56e-02

Pos logit contributions
pped+0.069
pping+0.060
ged+0.052
ge+0.051
ms+0.051
Neg logit contributions
 best-0.090
 biggest-0.090
 sun-0.089
 jungle-0.084
 little-0.081
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
pped+0.039
pping+0.034
 it+0.031
ged+0.028
assed+0.025
Neg logit contributions
 biggest-0.092
 sun-0.086
 best-0.086
 jungle-0.085
umm-0.083
Token bear
Token ID6842
Feature activation-6.05e-04

Pos logit contributions
anny+0.309
ini+0.304
rig+0.291
err+0.289
andel+0.287
Neg logit contributions
 it-0.230
 let-0.205
 him-0.181
 don-0.166
 instead-0.165
Token gets
Token ID3011
Feature activation+2.34e-03

Pos logit contributions
 it+0.001
pped+0.001
pping+0.001
 him+0.001
 let+0.001
Neg logit contributions
 biggest-0.001
 jungle-0.001
ini-0.001
umm-0.001
err-0.001
Token hungry
Token ID14720
Feature activation-8.57e-03

Pos logit contributions
 biggest+0.005
umm+0.005
ini+0.005
err+0.005
 jungle+0.005
Neg logit contributions
 it-0.004
pped-0.003
 him-0.003
 let-0.003
pping-0.003
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
 it+0.011
pped+0.009
pping+0.008
 let+0.007
 him+0.007
Neg logit contributions
 biggest-0.021
umm-0.020
ini-0.019
 jungle-0.019
 sun-0.019
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
pped+0.038
pping+0.033
 it+0.031
ged+0.027
assed+0.025
Neg logit contributions
 biggest-0.091
 sun-0.083
 best-0.083
 jungle-0.083
umm-0.081
Token no
Token ID645
Feature activation+7.79e-02

Pos logit contributions
 it+0.024
pped+0.024
pping+0.022
ged+0.021
ms+0.020
Neg logit contributions
 biggest-0.033
 best-0.031
 former-0.030
 elderly-0.030
 older-0.030
Token choice
Token ID3572
Feature activation-2.98e-02

Pos logit contributions
umm+0.129
ini+0.125
err+0.119
 biggest+0.119
 jungle+0.115
Neg logit contributions
 it-0.168
 him-0.140
 let-0.140
 instead-0.134
pped-0.134
Token but
Token ID475
Feature activation-2.82e-02

Pos logit contributions
pped+0.046
pping+0.040
 it+0.037
 breathe+0.037
sels+0.036
Neg logit contributions
 best-0.054
 biggest-0.053
 new-0.051
 old-0.049
 older-0.047
Token to
Token ID284
Feature activation-3.84e-02

Pos logit contributions
 it+0.036
pped+0.034
pping+0.031
 him+0.026
 whoever+0.026
Neg logit contributions
 biggest-0.066
 best-0.060
umm-0.060
ini-0.059
err-0.059
Token leave
Token ID2666
Feature activation-4.85e-02

Pos logit contributions
 it+0.053
pped+0.053
ged+0.049
pping+0.044
ges+0.044
Neg logit contributions
 biggest-0.076
 best-0.074
 former-0.073
 youngest-0.071
 elderly-0.071
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
pped+0.048
 it+0.047
pping+0.045
 let+0.039
 risking+0.035
Neg logit contributions
 biggest-0.119
 best-0.109
 sun-0.108
ini-0.108
 world-0.107
Token park
Token ID3952
Feature activation-2.80e-02

Pos logit contributions
umm+0.238
err+0.223
ubby+0.216
omm+0.214
ini+0.209
Neg logit contributions
 it-0.307
 let-0.271
pped-0.257
 him-0.245
pping-0.244
Token and
Token ID290
Feature activation-3.61e-02

Pos logit contributions
 it+0.036
pped+0.034
pping+0.029
 let+0.027
 him+0.026
Neg logit contributions
 biggest-0.065
ini-0.060
 best-0.060
 jungle-0.059
umm-0.059
Token go
Token ID467
Feature activation-2.67e-02

Pos logit contributions
pped+0.047
 it+0.044
pping+0.043
ged+0.036
 risking+0.033
Neg logit contributions
 biggest-0.082
 former-0.074
 best-0.074
 sun-0.073
 jungle-0.073
Token home
Token ID1363
Feature activation-3.11e-02

Pos logit contributions
 it+0.051
pped+0.049
pping+0.045
 whoever+0.043
 let+0.042
Neg logit contributions
 biggest-0.044
ini-0.038
 best-0.038
 jungle-0.038
 sun-0.038
Token without
Token ID1231
Feature activation-1.99e-02

Pos logit contributions
pped+0.043
 it+0.041
pping+0.037
 whoever+0.032
 him+0.032
Neg logit contributions
 biggest-0.068
 elderly-0.064
 best-0.064
 former-0.064
ini-0.063
Token that
Token ID326
Feature activation+3.69e-02

Pos logit contributions
 biggest+0.040
ini+0.038
umm+0.038
err+0.037
 jungle+0.037
Neg logit contributions
 it-0.010
pped-0.007
pping-0.006
 let-0.005
 him-0.005
Token day
Token ID1110
Feature activation-2.61e-02

Pos logit contributions
 biggest+0.038
umm+0.036
ini+0.035
err+0.032
 former+0.030
Neg logit contributions
 it-0.104
pped-0.088
 let-0.088
 him-0.087
pping-0.086
Token on
Token ID319
Feature activation-1.78e-02

Pos logit contributions
 it+0.034
pped+0.026
 whoever+0.026
 him+0.026
 let+0.025
Neg logit contributions
 biggest-0.059
 best-0.058
 sun-0.055
ubby-0.054
ama-0.053
Token,
Token ID11
Feature activation-3.08e-02

Pos logit contributions
pped+0.027
pping+0.025
 it+0.025
 let+0.023
assed+0.022
Neg logit contributions
 biggest-0.035
 best-0.035
 jungle-0.033
 sun-0.033
ama-0.032
Token whenever
Token ID8797
Feature activation-2.88e-02

Pos logit contributions
 it+0.029
pped+0.027
pping+0.022
 let+0.020
 him+0.018
Neg logit contributions
 biggest-0.084
umm-0.079
ini-0.078
 jungle-0.078
 sun-0.076
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
pped+0.029
 it+0.027
pping+0.025
ged+0.020
 let+0.020
Neg logit contributions
 biggest-0.076
 jungle-0.071
umm-0.070
 sun-0.070
 best-0.069
Token little
Token ID1310
Feature activation+1.11e-01

Pos logit contributions
ini+0.236
 former+0.225
umm+0.222
omm+0.215
err+0.210
Neg logit contributions
 it-0.345
 let-0.269
 that-0.265
 the-0.251
 in-0.249
Token bird
Token ID6512
Feature activation+4.24e-04

Pos logit contributions
ini+0.171
omm+0.161
 former+0.160
 biggest+0.159
err+0.159
Neg logit contributions
 it-0.266
 let-0.210
 him-0.205
 instead-0.198
pped-0.190
Token flew
Token ID13112
Feature activation-1.95e-02

Pos logit contributions
 biggest+0.001
ini+0.001
umm+0.001
 jungle+0.001
err+0.001
Neg logit contributions
 it-0.001
pped-0.001
pping-0.000
 him-0.000
 let-0.000
Token in
Token ID287
Feature activation-8.53e-03

Pos logit contributions
 it+0.024
pped+0.019
pping+0.017
 let+0.017
 him+0.016
Neg logit contributions
 biggest-0.050
umm-0.047
ini-0.046
err-0.045
 jungle-0.045
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 it+0.010
pped+0.008
pping+0.007
 let+0.007
 him+0.007
Neg logit contributions
 biggest-0.022
umm-0.020
ini-0.020
err-0.020
 jungle-0.020
Token proud
Token ID6613
Feature activation-2.41e-03

Pos logit contributions
 biggest+0.023
ini+0.021
umm+0.021
err+0.019
 jungle+0.018
Neg logit contributions
 it-0.047
pped-0.043
 him-0.040
pping-0.040
 let-0.040
Token of
Token ID286
Feature activation-2.46e-02

Pos logit contributions
 it+0.003
pped+0.003
pping+0.003
 him+0.003
 let+0.003
Neg logit contributions
 biggest-0.005
umm-0.005
ini-0.005
 sun-0.005
 jungle-0.005
Token it
Token ID340
Feature activation-2.36e-02

Pos logit contributions
pped+0.021
pping+0.017
 it+0.016
ged+0.014
 argued+0.014
Neg logit contributions
 biggest-0.065
 sun-0.063
 jungle-0.061
 best-0.060
 world-0.060
Token.
Token ID13
Feature activation+2.53e-03

Pos logit contributions
pped+0.040
 it+0.033
ged+0.032
pping+0.032
assed+0.032
Neg logit contributions
 biggest-0.047
 best-0.046
 sun-0.042
 most-0.040
 big-0.039
Token Whenever
Token ID21326
Feature activation-1.79e-02

Pos logit contributions
 biggest+0.005
ini+0.005
umm+0.005
err+0.005
 jungle+0.004
Neg logit contributions
 it-0.005
pped-0.004
 him-0.004
 let-0.004
pping-0.003
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
pped+0.019
pping+0.019
 it+0.016
assed+0.015
ge+0.015
Neg logit contributions
 biggest-0.045
 sun-0.043
 jungle-0.042
 best-0.042
 sky-0.042
Token train
Token ID4512
Feature activation+1.96e-02

Pos logit contributions
umm+0.328
err+0.303
 former+0.291
igsaw+0.289
omm+0.280
Neg logit contributions
 it-0.277
 let-0.230
pped-0.191
 that-0.190
 the-0.190
Token left
Token ID1364
Feature activation-1.35e-02

Pos logit contributions
 biggest+0.045
ini+0.043
umm+0.042
 jungle+0.041
err+0.041
Neg logit contributions
 it-0.028
pped-0.024
pping-0.021
 him-0.021
 let-0.020
Token the
Token ID262
Feature activation+1.31e-01

Pos logit contributions
 it+0.016
pped+0.015
pping+0.014
 let+0.013
 him+0.012
Neg logit contributions
 biggest-0.033
 sun-0.030
ini-0.030
 best-0.030
 jungle-0.030
Token station
Token ID4429
Feature activation-1.07e-02

Pos logit contributions
umm+0.248
err+0.228
omm+0.221
igsaw+0.209
 former+0.196
Neg logit contributions
 it-0.267
 let-0.226
 him-0.202
pping-0.202
pped-0.201
Token everyone
Token ID2506
Feature activation-5.37e-03

Pos logit contributions
 it+0.014
pped+0.013
pping+0.012
 him+0.010
 let+0.010
Neg logit contributions
 biggest-0.025
ini-0.023
 jungle-0.023
 best-0.023
umm-0.023
Token of
Token ID286
Feature activation+4.95e-02

Pos logit contributions
 biggest+0.021
umm+0.019
ini+0.019
 sun+0.018
 best+0.018
Neg logit contributions
 it-0.029
pped-0.027
pping-0.025
 him-0.024
 let-0.024
Token things
Token ID1243
Feature activation-3.53e-02

Pos logit contributions
 biggest+0.067
umm+0.062
ini+0.062
err+0.058
 jungle+0.055
Neg logit contributions
 it-0.120
pped-0.105
 him-0.101
 let-0.100
pping-0.100
Token.
Token ID13
Feature activation-1.54e-03

Pos logit contributions
pped+0.052
pping+0.046
 it+0.040
assed+0.038
gency+0.038
Neg logit contributions
 biggest-0.070
 best-0.068
 new-0.065
 old-0.063
 sun-0.063
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
 it+0.003
 him+0.002
pped+0.002
 let+0.002
 whoever+0.002
Neg logit contributions
 biggest-0.003
ini-0.003
umm-0.003
err-0.003
 jungle-0.003
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 biggest+0.039
umm+0.037
ini+0.037
err+0.035
 jungle+0.034
Neg logit contributions
 it-0.034
pped-0.028
 him-0.026
 let-0.026
pping-0.025
TokenThe
Token ID464
Feature activation+1.47e-01

Pos logit contributions
 biggest+0.022
ini+0.021
umm+0.020
err+0.020
 jungle+0.020
Neg logit contributions
 it-0.022
pped-0.019
 him-0.018
pping-0.017
 let-0.017
Token cats
Token ID11875
Feature activation-1.47e-02

Pos logit contributions
umm+0.266
ini+0.254
 ey+0.248
err+0.244
 biggest+0.228
Neg logit contributions
 it-0.281
 let-0.242
 instead-0.224
 each-0.220
 that-0.215
Token had
Token ID550
Feature activation-1.29e-02

Pos logit contributions
 it+0.021
pped+0.020
pping+0.017
 risking+0.017
 him+0.016
Neg logit contributions
 biggest-0.029
 jungle-0.028
 neighbourhood-0.028
 youngest-0.028
 sky-0.028
Token so
Token ID523
Feature activation+6.08e-03

Pos logit contributions
pped+0.021
 it+0.020
pping+0.020
ged+0.018
ges+0.017
Neg logit contributions
 biggest-0.025
 best-0.022
 jungle-0.022
 elderly-0.022
 sun-0.022
Token much
Token ID881
Feature activation+2.27e-02

Pos logit contributions
umm+0.011
 biggest+0.011
ini+0.010
omm+0.010
err+0.010
Neg logit contributions
 it-0.013
pped-0.010
 him-0.010
 let-0.010
 instead-0.010
Token fun
Token ID1257
Feature activation-7.18e-03

Pos logit contributions
 biggest+0.029
ini+0.027
umm+0.027
err+0.025
omm+0.023
Neg logit contributions
 it-0.057
pped-0.050
 him-0.048
 let-0.048
pping-0.047
Token its
Token ID663
Feature activation+9.54e-02

Pos logit contributions
 it+0.038
pped+0.035
pping+0.030
 let+0.027
 him+0.024
Neg logit contributions
 biggest-0.100
umm-0.093
ini-0.093
 sun-0.091
 jungle-0.091
Token beauty
Token ID8737
Feature activation-1.04e-02

Pos logit contributions
umm+0.125
ini+0.118
 biggest+0.115
omm+0.114
err+0.111
Neg logit contributions
 it-0.236
 him-0.196
 let-0.194
 instead-0.185
 how-0.183
Token.
Token ID13
Feature activation+4.28e-04

Pos logit contributions
pped+0.014
 it+0.013
pping+0.012
 whoever+0.011
 him+0.011
Neg logit contributions
 biggest-0.024
 sun-0.022
umm-0.022
 sky-0.022
ini-0.022
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
 biggest+0.001
ini+0.001
umm+0.001
err+0.001
 jungle+0.001
Neg logit contributions
 it-0.001
pped-0.001
 him-0.001
 let-0.001
pping-0.001
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
 biggest+0.054
umm+0.053
ini+0.050
err+0.048
 eldest+0.047
Neg logit contributions
 it-0.044
pped-0.035
 him-0.033
 let-0.033
pping-0.031
TokenThe
Token ID464
Feature activation+1.46e-01

Pos logit contributions
 biggest+0.037
umm+0.035
ini+0.035
err+0.033
 jungle+0.032
Neg logit contributions
 it-0.035
pped-0.029
 let-0.027
 him-0.027
pping-0.027
Token tiger
Token ID26241
Feature activation-1.90e-03

Pos logit contributions
umm+0.215
ini+0.203
omm+0.202
err+0.197
 biggest+0.191
Neg logit contributions
 it-0.324
 let-0.285
 that-0.250
 instead-0.250
 don-0.248
Token moved
Token ID3888
Feature activation-3.64e-02

Pos logit contributions
 it+0.003
pped+0.003
pping+0.002
 him+0.002
 let+0.002
Neg logit contributions
 biggest-0.004
ini-0.004
 jungle-0.004
umm-0.004
err-0.004
Token from
Token ID422
Feature activation-1.16e-02

Pos logit contributions
 it+0.047
pped+0.044
pping+0.041
 let+0.037
 whoever+0.035
Neg logit contributions
 biggest-0.085
ini-0.077
err-0.077
umm-0.076
 sun-0.076
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
 it+0.016
 him+0.011
pped+0.011
 let+0.010
 instead+0.009
Neg logit contributions
 biggest-0.029
ini-0.029
umm-0.029
err-0.027
omm-0.027
Token forest
Token ID8222
Feature activation-2.94e-02

Pos logit contributions
ini+0.166
omm+0.159
err+0.156
umm+0.155
ubby+0.141
Neg logit contributions
 it-0.293
 let-0.245
 him-0.244
 instead-0.227
 this-0.225
Token your
Token ID534
Feature activation+8.80e-02

Pos logit contributions
 it+0.046
pped+0.044
pping+0.041
 let+0.039
ged+0.037
Neg logit contributions
 biggest-0.064
 sun-0.057
ini-0.055
 jungle-0.055
umm-0.055
Token camera
Token ID4676
Feature activation-2.04e-02

Pos logit contributions
umm+0.157
ini+0.140
err+0.138
omm+0.133
 Arctic+0.130
Neg logit contributions
 it-0.199
 let-0.156
 him-0.146
 how-0.140
 this-0.140
Token."
Token ID526
Feature activation-2.37e-03

Pos logit contributions
 it+0.026
pped+0.024
pping+0.020
ged+0.019
 risking+0.019
Neg logit contributions
 biggest-0.050
umm-0.046
ini-0.045
 best-0.045
 sun-0.045
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
 it+0.005
pped+0.004
pping+0.004
 let+0.004
 him+0.004
Neg logit contributions
 biggest-0.004
umm-0.004
ini-0.004
err-0.004
 jungle-0.004
Token
Token ID198
Feature activation+9.00e-03

Pos logit contributions
 biggest+0.034
umm+0.033
ini+0.033
err+0.031
 jungle+0.030
Neg logit contributions
 it-0.031
pped-0.026
 him-0.024
 let-0.024
pping-0.024
TokenThe
Token ID464
Feature activation+1.46e-01

Pos logit contributions
 biggest+0.016
ini+0.015
umm+0.015
 jungle+0.015
err+0.014
Neg logit contributions
 it-0.017
pped-0.015
pping-0.014
 him-0.014
 let-0.013
Token little
Token ID1310
Feature activation+1.16e-01

Pos logit contributions
umm+0.286
igsaw+0.265
 former+0.260
ini+0.235
anny+0.232
Neg logit contributions
 it-0.309
 let-0.282
 that-0.228
 how-0.224
 put-0.223
Token boy
Token ID2933
Feature activation-4.84e-03

Pos logit contributions
umm+0.151
 former+0.151
err+0.142
omm+0.140
ini+0.140
Neg logit contributions
 it-0.303
 let-0.258
 him-0.247
 instead-0.235
 how-0.232
Token took
Token ID1718
Feature activation-3.44e-02

Pos logit contributions
 it+0.008
pped+0.008
pping+0.007
 risking+0.007
 him+0.006
Neg logit contributions
 biggest-0.009
ini-0.008
 jungle-0.008
 youngest-0.008
err-0.008
Token out
Token ID503
Feature activation-7.63e-03

Pos logit contributions
 it+0.032
pped+0.019
 him+0.018
 let+0.016
pping+0.013
Neg logit contributions
 biggest-0.101
ini-0.100
umm-0.100
err-0.096
 former-0.094
Token his
Token ID465
Feature activation+9.42e-02

Pos logit contributions
 it+0.010
 him+0.007
pped+0.007
 let+0.006
 how+0.006
Neg logit contributions
umm-0.021
ini-0.020
 biggest-0.020
err-0.018
 former-0.018
Token fun
Token ID1257
Feature activation-4.37e-03

Pos logit contributions
ini+0.061
umm+0.057
 biggest+0.055
omm+0.055
 eldest+0.054
Neg logit contributions
 it-0.070
 let-0.055
 him-0.054
 that-0.053
 instead-0.052
Token together
Token ID1978
Feature activation-4.18e-02

Pos logit contributions
pped+0.006
 it+0.005
pping+0.005
 let+0.005
 him+0.005
Neg logit contributions
 jungle-0.009
 biggest-0.009
 sun-0.009
 best-0.009
ini-0.009
Token.
Token ID13
Feature activation-6.78e-05

Pos logit contributions
pped+0.083
ocated+0.076
cher+0.072
uable+0.071
mine+0.070
Neg logit contributions
 sun-0.061
 best-0.061
 jungle-0.055
 new-0.053
 little-0.051
Token Finally
Token ID9461
Feature activation-2.53e-02

Pos logit contributions
 it+0.000
 him+0.000
pped+0.000
 let+0.000
pping+0.000
Neg logit contributions
 biggest-0.000
ini-0.000
umm-0.000
err-0.000
 former-0.000
Token,
Token ID11
Feature activation-3.29e-02

Pos logit contributions
pping+0.043
pped+0.042
ge+0.038
ms+0.037
assed+0.037
Neg logit contributions
 sun-0.046
 best-0.045
 biggest-0.043
 jungle-0.039
 new-0.038
Token the
Token ID262
Feature activation+1.46e-01

Pos logit contributions
pped+0.029
 it+0.028
pping+0.026
 him+0.020
 let+0.019
Neg logit contributions
 biggest-0.089
 sun-0.084
 jungle-0.082
umm-0.082
 best-0.081
Token little
Token ID1310
Feature activation+1.12e-01

Pos logit contributions
igsaw+0.294
umm+0.293
ini+0.279
ady+0.276
anny+0.273
Neg logit contributions
 it-0.263
 let-0.242
 did-0.191
 that-0.190
 landed-0.188
Token boy
Token ID2933
Feature activation-1.27e-02

Pos logit contributions
err+0.144
 former+0.142
umm+0.142
ini+0.140
igsaw+0.139
Neg logit contributions
 it-0.286
 let-0.245
 him-0.226
 instead-0.225
 how-0.214
Token said
Token ID531
Feature activation-2.88e-02

Pos logit contributions
 it+0.019
pped+0.019
pping+0.018
ged+0.016
 risking+0.016
Neg logit contributions
 jungle-0.024
 biggest-0.024
 sun-0.023
 best-0.023
 neighbor-0.023
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
pped+0.053
pping+0.050
ge+0.043
assed+0.042
 risking+0.041
Neg logit contributions
 biggest-0.055
 best-0.053
 sun-0.050
 new-0.048
 older-0.046
Token "
Token ID366
Feature activation-1.51e-02

Pos logit contributions
pped+0.066
pping+0.063
ge+0.059
ms+0.058
assed+0.057
Neg logit contributions
 sun-0.037
 biggest-0.034
 elderly-0.030
 sky-0.030
 best-0.029
Token a
Token ID257
Feature activation+8.29e-02

Pos logit contributions
pping+0.044
pped+0.044
 it+0.044
 let+0.037
 him+0.036
Neg logit contributions
 biggest-0.084
 sun-0.077
 best-0.075
 sky-0.074
umm-0.074
Token persistent
Token ID16218
Feature activation+7.82e-02

Pos logit contributions
err+0.114
umm+0.112
ini+0.106
 biggest+0.105
 former+0.105
Neg logit contributions
 it-0.211
 let-0.175
 that-0.160
 him-0.152
 the-0.150
Token lady
Token ID10846
Feature activation-6.04e-03

Pos logit contributions
 biggest+0.098
umm+0.089
ini+0.088
err+0.083
 former+0.080
Neg logit contributions
 it-0.206
 let-0.172
pped-0.169
 him-0.166
pping-0.160
Token came
Token ID1625
Feature activation-2.67e-02

Pos logit contributions
 it+0.009
pping+0.008
pped+0.008
 instead+0.007
 whoever+0.007
Neg logit contributions
ini-0.011
 jungle-0.011
 biggest-0.011
 sky-0.010
 eldest-0.010
Token up
Token ID510
Feature activation-2.76e-02

Pos logit contributions
 it+0.036
pped+0.033
pping+0.030
 whoever+0.028
 let+0.027
Neg logit contributions
 biggest-0.058
err-0.057
ini-0.056
 world-0.055
 elderly-0.055
Token to
Token ID284
Feature activation-8.52e-02

Pos logit contributions
 it+0.031
pped+0.028
pping+0.026
 let+0.022
 whoever+0.021
Neg logit contributions
 biggest-0.066
err-0.065
 elderly-0.064
ini-0.064
umm-0.063
Token her
Token ID607
Feature activation+5.19e-02

Pos logit contributions
pping+0.131
pped+0.130
ched+0.111
ms+0.104
 risking+0.104
Neg logit contributions
 little-0.150
 girl-0.150
 sun-0.146
 new-0.146
 elderly-0.143
Token and
Token ID290
Feature activation-4.28e-02

Pos logit contributions
 biggest+0.129
ini+0.123
umm+0.123
err+0.119
 jungle+0.116
Neg logit contributions
 it-0.067
pped-0.053
 him-0.047
 let-0.047
pping-0.047
Token told
Token ID1297
Feature activation-6.24e-02

Pos logit contributions
pped+0.072
pping+0.072
 it+0.064
 him+0.057
sels+0.056
Neg logit contributions
 biggest-0.074
 elderly-0.073
 girl-0.071
ini-0.068
 eldest-0.068
Token her
Token ID607
Feature activation+3.17e-03

Pos logit contributions
pped+0.094
pping+0.091
 it+0.085
 risking+0.078
ged+0.078
Neg logit contributions
 biggest-0.118
 sun-0.112
 little-0.109
 old-0.107
 new-0.107
Token she
Token ID673
Feature activation-1.70e-02

Pos logit contributions
 biggest+0.008
 best+0.007
 sun+0.007
 neighbor+0.007
err+0.007
Neg logit contributions
pped-0.004
pping-0.004
 it-0.003
ged-0.003
 him-0.003
Token grandma
Token ID49890
Feature activation+5.70e-03

Pos logit contributions
umm+0.119
err+0.102
ini+0.100
 biggest+0.096
umpy+0.093
Neg logit contributions
 it-0.195
 let-0.164
 him-0.157
 instead-0.157
 that-0.153
Token's
Token ID338
Feature activation+9.55e-02

Pos logit contributions
 biggest+0.011
ini+0.011
 best+0.011
 older+0.010
 elderly+0.010
Neg logit contributions
pped-0.009
 it-0.009
 risking-0.007
pping-0.007
ged-0.007
Token house
Token ID2156
Feature activation-3.38e-02

Pos logit contributions
umm+0.125
 biggest+0.113
err+0.109
ini+0.106
omm+0.101
Neg logit contributions
 it-0.239
 him-0.202
 let-0.199
 instead-0.192
 that-0.187
Token for
Token ID329
Feature activation-2.62e-02

Pos logit contributions
 it+0.046
pped+0.045
pping+0.038
 risking+0.036
 whoever+0.035
Neg logit contributions
 biggest-0.074
 best-0.068
ini-0.068
 elderly-0.068
 world-0.067
Token safe
Token ID3338
Feature activation-1.80e-02

Pos logit contributions
 it+0.035
pped+0.034
pping+0.034
 let+0.030
ched+0.028
Neg logit contributions
 biggest-0.053
 elderly-0.050
 sun-0.050
 former-0.049
 jungle-0.048
Tokenkeeping
Token ID19934
Feature activation-8.34e-02

Pos logit contributions
pped+0.026
 it+0.025
pping+0.024
 whoever+0.022
 let+0.022
Neg logit contributions
 biggest-0.035
 world-0.035
ini-0.034
 jungle-0.034
 elderly-0.033
Token.
Token ID13
Feature activation+1.12e-03

Pos logit contributions
pped+0.133
 it+0.128
 whoever+0.123
pping+0.111
ck+0.111
Neg logit contributions
 best-0.134
 biggest-0.134
 world-0.128
 sky-0.120
ini-0.120
Token
Token ID220
Feature activation+2.16e-02

Pos logit contributions
 biggest+0.002
umm+0.002
ini+0.002
err+0.002
ubby+0.002
Neg logit contributions
 it-0.002
 him-0.001
pped-0.001
 let-0.001
 instead-0.001
Token
Token ID198
Feature activation+9.84e-03

Pos logit contributions
 biggest+0.043
umm+0.041
ini+0.041
err+0.039
 jungle+0.038
Neg logit contributions
 it-0.038
pped-0.032
 him-0.030
 let-0.030
pping-0.029
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 biggest+0.019
ini+0.018
umm+0.018
err+0.017
 jungle+0.017
Neg logit contributions
 it-0.018
pped-0.015
 him-0.014
 let-0.014
pping-0.014
TokenWhen
Token ID2215
Feature activation-2.12e-02

Pos logit contributions
 biggest+0.020
ini+0.020
umm+0.018
err+0.018
 jungle+0.018
Neg logit contributions
 it-0.023
pped-0.019
pping-0.018
 him-0.018
 let-0.018
Token it
Token ID340
Feature activation-2.29e-02

Pos logit contributions
 it+0.023
pped+0.019
pping+0.017
 let+0.016
 him+0.015
Neg logit contributions
 biggest-0.049
umm-0.045
ini-0.045
err-0.045
 jungle-0.044
Token okay
Token ID8788
Feature activation-4.04e-02

Pos logit contributions
 it+0.035
pped+0.032
pping+0.028
 him+0.027
 let+0.027
Neg logit contributions
 biggest-0.050
ini-0.046
umm-0.046
err-0.044
 jungle-0.044
Token if
Token ID611
Feature activation-3.52e-02

Pos logit contributions
pped+0.057
 it+0.056
pping+0.049
ged+0.046
 him+0.043
Neg logit contributions
 biggest-0.089
ini-0.083
umm-0.081
 best-0.080
 world-0.080
Token I
Token ID314
Feature activation-1.69e-02

Pos logit contributions
pped+0.039
 it+0.037
pping+0.036
ged+0.029
 let+0.027
Neg logit contributions
 biggest-0.088
umm-0.080
 sun-0.079
 best-0.079
 jungle-0.078
Token rest
Token ID1334
Feature activation-4.38e-02

Pos logit contributions
 it+0.026
pped+0.025
pping+0.023
ged+0.021
 risking+0.020
Neg logit contributions
 biggest-0.033
ini-0.031
 world-0.030
 sun-0.030
 best-0.030
Token here
Token ID994
Feature activation-8.31e-02

Pos logit contributions
pped+0.058
 it+0.054
 whoever+0.051
pping+0.050
 let+0.047
Neg logit contributions
 biggest-0.095
 best-0.088
 world-0.088
 sun-0.085
ini-0.084
Token too
Token ID1165
Feature activation-2.18e-02

Pos logit contributions
pped+0.114
 it+0.108
ged+0.097
pping+0.096
 risking+0.095
Neg logit contributions
 biggest-0.180
 world-0.163
 best-0.160
 sun-0.159
 former-0.154
Token?'
Token ID8348
Feature activation-1.15e-02

Pos logit contributions
 it+0.031
pped+0.026
pping+0.023
 him+0.023
 let+0.023
Neg logit contributions
 biggest-0.051
ini-0.048
umm-0.048
err-0.046
 jungle-0.045
Token Fred
Token ID8559
Feature activation-1.19e-02

Pos logit contributions
 it+0.021
pped+0.019
pping+0.018
 let+0.016
 him+0.016
Neg logit contributions
 biggest-0.022
ini-0.020
umm-0.020
err-0.019
 jungle-0.019
Token smiled
Token ID13541
Feature activation-1.12e-02

Pos logit contributions
 it+0.018
pped+0.017
pping+0.015
 him+0.014
 risking+0.013
Neg logit contributions
 biggest-0.024
err-0.023
ini-0.022
 jungle-0.022
 eldest-0.022
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 it+0.014
pped+0.012
pping+0.011
 let+0.010
 whoever+0.009
Neg logit contributions
 biggest-0.027
ini-0.026
umm-0.026
err-0.025
 sun-0.025
Token let
Token ID1309
Feature activation-2.88e-02

Pos logit contributions
pped+0.046
pping+0.040
ged+0.038
 it+0.035
 him+0.031
Neg logit contributions
 biggest-0.118
 sun-0.110
 best-0.110
 most-0.104
umm-0.104
Token's
Token ID338
Feature activation-6.01e-03

Pos logit contributions
 it+0.039
pped+0.035
pping+0.032
 him+0.030
ged+0.029
Neg logit contributions
 biggest-0.066
ini-0.062
umm-0.059
err-0.058
 best-0.058
Token rock
Token ID3881
Feature activation-4.01e-02

Pos logit contributions
 it+0.008
pped+0.007
ged+0.007
pping+0.006
 him+0.006
Neg logit contributions
 biggest-0.014
umm-0.013
 sun-0.013
ini-0.012
 jungle-0.012
Token back
Token ID736
Feature activation-4.24e-02

Pos logit contributions
 it+0.050
pped+0.048
pping+0.043
 let+0.039
 risking+0.038
Neg logit contributions
 biggest-0.091
ini-0.085
 jungle-0.081
 sun-0.081
 best-0.080
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
 it+0.050
pped+0.047
pping+0.040
 him+0.039
 let+0.036
Neg logit contributions
 biggest-0.104
ini-0.095
 jungle-0.093
 sun-0.093
err-0.093
Token forth
Token ID6071
Feature activation-8.29e-02

Pos logit contributions
pped+0.041
 it+0.037
pping+0.037
ged+0.031
 him+0.028
Neg logit contributions
 biggest-0.087
 sun-0.081
umm-0.081
 best-0.078
ini-0.077
Token to
Token ID284
Feature activation-3.59e-02

Pos logit contributions
pped+0.128
 risking+0.108
 him+0.108
 it+0.105
 argued+0.105
Neg logit contributions
 biggest-0.164
 best-0.158
 sun-0.148
 new-0.139
 most-0.135
Token make
Token ID787
Feature activation-3.06e-02

Pos logit contributions
 it+0.044
pped+0.041
pping+0.035
ged+0.031
 him+0.031
Neg logit contributions
 biggest-0.086
umm-0.079
 jungle-0.079
ini-0.078
 sun-0.078
Token you
Token ID345
Feature activation-9.43e-03

Pos logit contributions
 it+0.042
pped+0.034
 him+0.030
 let+0.030
pping+0.030
Neg logit contributions
 biggest-0.073
ini-0.070
umm-0.070
err-0.067
 jungle-0.066
Token feel
Token ID1254
Feature activation+6.34e-03

Pos logit contributions
 it+0.018
pped+0.017
pping+0.015
 him+0.014
ged+0.014
Neg logit contributions
 biggest-0.018
ini-0.016
umm-0.015
 best-0.015
 tallest-0.015
Token better
Token ID1365
Feature activation-8.44e-03

Pos logit contributions
 biggest+0.011
 sun+0.010
 best+0.009
umm+0.009
ini+0.009
Neg logit contributions
 it-0.012
pped-0.011
pping-0.011
ged-0.010
assed-0.010
Token tree
Token ID5509
Feature activation-2.84e-02

Pos logit contributions
 biggest+0.164
ini+0.160
umm+0.159
 former+0.147
err+0.146
Neg logit contributions
 it-0.212
 let-0.170
pped-0.169
 him-0.161
pping-0.158
Token would
Token ID561
Feature activation-3.03e-02

Pos logit contributions
 it+0.043
pped+0.039
 him+0.035
pping+0.035
 whoever+0.031
Neg logit contributions
 biggest-0.061
ini-0.057
 jungle-0.056
err-0.055
umm-0.055
Token sway
Token ID20009
Feature activation-3.74e-02

Pos logit contributions
 it+0.051
pped+0.049
pping+0.043
ged+0.042
 him+0.041
Neg logit contributions
 biggest-0.059
 sun-0.053
 jungle-0.052
 tallest-0.051
 world-0.051
Token back
Token ID736
Feature activation-2.69e-02

Pos logit contributions
 it+0.048
pped+0.046
pping+0.041
 let+0.038
 whoever+0.038
Neg logit contributions
 biggest-0.088
ini-0.079
 sun-0.079
 best-0.078
umm-0.077
Token and
Token ID290
Feature activation-3.34e-02

Pos logit contributions
 it+0.034
pped+0.030
pping+0.026
 him+0.025
 let+0.025
Neg logit contributions
 biggest-0.066
umm-0.061
ini-0.061
err-0.060
 sun-0.059
Token forth
Token ID6071
Feature activation-8.20e-02

Pos logit contributions
pped+0.051
 it+0.049
pping+0.044
 him+0.039
ged+0.038
Neg logit contributions
 biggest-0.071
 sun-0.065
umm-0.063
 jungle-0.063
 tallest-0.062
Token,
Token ID11
Feature activation-2.98e-02

Pos logit contributions
pped+0.136
 risking+0.115
pping+0.110
assed+0.108
 whoever+0.107
Neg logit contributions
 biggest-0.155
 best-0.152
 sun-0.146
 new-0.137
 big-0.136
Token while
Token ID981
Feature activation-3.06e-02

Pos logit contributions
pped+0.045
 it+0.043
pping+0.037
 him+0.035
ged+0.035
Neg logit contributions
 biggest-0.063
 sun-0.058
 jungle-0.056
 sky-0.056
 tallest-0.056
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
pped+0.029
 it+0.028
pping+0.026
 let+0.022
assed+0.021
Neg logit contributions
 biggest-0.077
 sun-0.075
 jungle-0.073
 sky-0.072
 best-0.071
Token small
Token ID1402
Feature activation+1.05e-01

Pos logit contributions
ini+0.220
umm+0.200
omm+0.198
err+0.187
 former+0.186
Neg logit contributions
 it-0.288
 let-0.242
 the-0.214
 that-0.208
 instead-0.207
Token bush
Token ID24484
Feature activation-2.79e-02

Pos logit contributions
ini+0.177
umm+0.169
 biggest+0.166
err+0.159
 former+0.159
Neg logit contributions
 it-0.235
 let-0.188
 instead-0.177
pped-0.174
 him-0.172
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
pped+0.013
 it+0.012
ged+0.011
pping+0.010
 whoever+0.010
Neg logit contributions
 biggest-0.024
 best-0.022
 sun-0.022
 world-0.021
umm-0.021
Token Just
Token ID2329
Feature activation-3.10e-02

Pos logit contributions
 it+0.031
pped+0.024
 let+0.022
 him+0.021
pping+0.021
Neg logit contributions
 biggest-0.058
ini-0.056
umm-0.056
err-0.054
 jungle-0.053
Token rock
Token ID3881
Feature activation-3.47e-02

Pos logit contributions
 it+0.049
pped+0.047
ged+0.041
 him+0.041
pping+0.040
Neg logit contributions
 biggest-0.063
 sun-0.057
 best-0.055
 tallest-0.054
 jungle-0.053
Token back
Token ID736
Feature activation-3.43e-02

Pos logit contributions
pped+0.046
 it+0.042
 whoever+0.037
pping+0.037
 let+0.035
Neg logit contributions
 biggest-0.077
 best-0.069
ini-0.067
 jungle-0.066
 tallest-0.066
Token and
Token ID290
Feature activation-3.80e-02

Pos logit contributions
 it+0.040
pped+0.037
pping+0.031
 him+0.031
 let+0.029
Neg logit contributions
 biggest-0.085
ini-0.079
 jungle-0.077
err-0.076
 sun-0.076
Token forth
Token ID6071
Feature activation-8.13e-02

Pos logit contributions
pped+0.049
 it+0.046
pping+0.042
ged+0.039
 him+0.038
Neg logit contributions
 biggest-0.088
 sun-0.079
ini-0.077
umm-0.077
 best-0.076
Token and
Token ID290
Feature activation-3.65e-02

Pos logit contributions
pped+0.113
 it+0.103
 argued+0.100
 him+0.098
 fought+0.092
Neg logit contributions
 biggest-0.177
 best-0.164
 sun-0.155
 tallest-0.151
 world-0.147
Token you
Token ID345
Feature activation-1.50e-02

Pos logit contributions
pped+0.047
 it+0.047
pping+0.040
 him+0.037
ged+0.037
Neg logit contributions
 biggest-0.084
 sun-0.076
 jungle-0.074
umm-0.074
 tallest-0.073
Token'll
Token ID1183
Feature activation-3.56e-02

Pos logit contributions
 it+0.022
pped+0.020
ged+0.018
 him+0.017
pping+0.017
Neg logit contributions
 biggest-0.033
ini-0.029
 best-0.029
 jungle-0.029
 sun-0.029
Token make
Token ID787
Feature activation-3.14e-02

Pos logit contributions
 it+0.052
pped+0.050
ged+0.047
 him+0.040
 whoever+0.039
Neg logit contributions
 biggest-0.076
 sun-0.069
ini-0.068
 former-0.067
 jungle-0.067
Token a
Token ID257
Feature activation+6.66e-02

Pos logit contributions
 it+0.030
pped+0.025
pping+0.021
 let+0.020
 him+0.019
Neg logit contributions
 biggest-0.087
ini-0.081
umm-0.081
err-0.079
 sun-0.078
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
pped+0.035
 it+0.034
pping+0.029
 risking+0.028
 him+0.027
Neg logit contributions
 biggest-0.071
 best-0.065
 sun-0.064
 world-0.063
ini-0.061
Token Just
Token ID2329
Feature activation-3.57e-02

Pos logit contributions
 it+0.029
pped+0.024
 let+0.023
 him+0.022
pping+0.022
Neg logit contributions
 biggest-0.038
ini-0.036
umm-0.036
err-0.034
 jungle-0.034
Token rock
Token ID3881
Feature activation-3.50e-02

Pos logit contributions
 it+0.050
pped+0.046
 him+0.039
pping+0.038
ged+0.037
Neg logit contributions
 biggest-0.080
 sun-0.073
 best-0.072
ini-0.071
umm-0.070
Token back
Token ID736
Feature activation-3.83e-02

Pos logit contributions
 it+0.047
pped+0.046
 let+0.038
pping+0.038
 whoever+0.036
Neg logit contributions
 biggest-0.077
 best-0.070
ini-0.069
 jungle-0.066
 sun-0.066
Token and
Token ID290
Feature activation-3.77e-02

Pos logit contributions
 it+0.046
pped+0.043
pping+0.035
 him+0.035
 let+0.033
Neg logit contributions
 biggest-0.095
ini-0.086
umm-0.085
 best-0.085
 jungle-0.085
Token forth
Token ID6071
Feature activation-8.12e-02

Pos logit contributions
pped+0.047
 it+0.045
pping+0.039
ged+0.036
 him+0.036
Neg logit contributions
 biggest-0.087
 sun-0.080
 best-0.078
umm-0.078
ini-0.077
Token and
Token ID290
Feature activation-3.68e-02

Pos logit contributions
pped+0.108
 it+0.101
 argued+0.092
 him+0.089
 fought+0.088
Neg logit contributions
 biggest-0.183
 best-0.171
 sun-0.157
 tallest-0.154
 world-0.152
Token feel
Token ID1254
Feature activation-6.57e-03

Pos logit contributions
pped+0.045
 it+0.044
pping+0.038
 him+0.035
ged+0.034
Neg logit contributions
 biggest-0.087
 sun-0.079
umm-0.078
 best-0.077
ini-0.076
Token the
Token ID262
Feature activation+9.79e-02

Pos logit contributions
 it+0.012
pped+0.012
pping+0.011
ged+0.010
 let+0.010
Neg logit contributions
 biggest-0.012
 sun-0.010
 best-0.010
ini-0.010
umm-0.010
Token music
Token ID2647
Feature activation-1.92e-02

Pos logit contributions
ini+0.128
umm+0.125
omm+0.121
err+0.118
 biggest+0.111
Neg logit contributions
 it-0.241
 let-0.207
pped-0.202
pping-0.196
 him-0.194
Token in
Token ID287
Feature activation-1.88e-02

Pos logit contributions
pped+0.027
 it+0.025
 him+0.023
ged+0.023
pping+0.022
Neg logit contributions
 biggest-0.043
 sun-0.039
 best-0.039
 world-0.039
umm-0.038
Token,
Token ID11
Feature activation-2.52e-02

Pos logit contributions
 it+0.020
pped+0.016
pping+0.014
 let+0.013
 him+0.013
Neg logit contributions
 biggest-0.050
umm-0.047
ini-0.047
err-0.046
 jungle-0.045
Token a
Token ID257
Feature activation+7.84e-02

Pos logit contributions
 it+0.030
pped+0.023
 him+0.021
 let+0.021
pping+0.020
Neg logit contributions
 biggest-0.065
ini-0.062
umm-0.062
err-0.060
 jungle-0.059
Token man
Token ID582
Feature activation-8.83e-03

Pos logit contributions
umm+0.111
err+0.107
 biggest+0.103
 former+0.099
ini+0.098
Neg logit contributions
 it-0.201
 let-0.168
 that-0.149
 him-0.144
 don-0.142
Token came
Token ID1625
Feature activation-1.73e-02

Pos logit contributions
 it+0.014
pping+0.013
pped+0.012
 properly+0.011
 instead+0.011
Neg logit contributions
ini-0.016
 jungle-0.015
 biggest-0.014
 forest-0.014
 eldest-0.014
Token up
Token ID510
Feature activation-2.15e-02

Pos logit contributions
 it+0.023
pped+0.018
pping+0.016
 let+0.016
 whoever+0.015
Neg logit contributions
 biggest-0.041
ini-0.040
err-0.039
umm-0.038
 jungle-0.038
Token to
Token ID284
Feature activation-8.10e-02

Pos logit contributions
 it+0.023
pped+0.020
pping+0.018
 let+0.015
 him+0.015
Neg logit contributions
 biggest-0.053
err-0.052
ini-0.052
 elderly-0.051
umm-0.051
Token him
Token ID683
Feature activation-4.23e-03

Pos logit contributions
pping+0.079
pped+0.073
 it+0.069
 risking+0.060
ched+0.057
Neg logit contributions
ini-0.188
 biggest-0.187
 jungle-0.185
 sun-0.183
 elderly-0.181
Token and
Token ID290
Feature activation-3.67e-02

Pos logit contributions
pped+0.007
 it+0.007
 whoever+0.006
pping+0.006
 risking+0.006
Neg logit contributions
 elderly-0.007
err-0.007
 old-0.007
ini-0.007
 biggest-0.006
Token asked
Token ID1965
Feature activation-5.15e-02

Pos logit contributions
pping+0.060
pped+0.060
 it+0.055
 him+0.047
sels+0.046
Neg logit contributions
 biggest-0.066
 elderly-0.064
ini-0.062
 girl-0.061
 jungle-0.061
Token,
Token ID11
Feature activation-3.53e-02

Pos logit contributions
pped+0.054
pping+0.052
 it+0.044
 risking+0.039
ms+0.038
Neg logit contributions
 biggest-0.122
 jungle-0.115
 best-0.115
 new-0.114
 sun-0.113
Token "
Token ID366
Feature activation-2.17e-02

Pos logit contributions
pped+0.069
 it+0.065
pping+0.064
 him+0.057
 argued+0.056
Neg logit contributions
 biggest-0.055
 elderly-0.051
 sun-0.050
ini-0.048
 older-0.048
Token!
Token ID0
Feature activation-1.49e-02

Pos logit contributions
pped+0.039
pping+0.037
sels+0.029
mine+0.029
ged+0.028
Neg logit contributions
 sun-0.051
 jungle-0.050
 biggest-0.050
 world-0.048
 sky-0.047
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
 it+0.032
pped+0.028
 him+0.026
 let+0.026
pping+0.026
Neg logit contributions
 biggest-0.024
ini-0.023
umm-0.023
err-0.021
 jungle-0.021
Token flew
Token ID13112
Feature activation-2.92e-02

Pos logit contributions
 it+0.014
pped+0.013
pping+0.013
 him+0.012
 whoever+0.011
Neg logit contributions
 biggest-0.021
 sun-0.020
ini-0.020
 jungle-0.020
 elderly-0.019
Token back
Token ID736
Feature activation-2.23e-02

Pos logit contributions
 it+0.042
pped+0.042
pping+0.039
 whoever+0.037
 let+0.035
Neg logit contributions
 biggest-0.063
 sun-0.057
err-0.055
 best-0.054
 sky-0.054
Token and
Token ID290
Feature activation-2.60e-02

Pos logit contributions
 it+0.025
pped+0.020
pping+0.017
 him+0.017
 let+0.017
Neg logit contributions
 biggest-0.058
ini-0.055
umm-0.055
err-0.054
 jungle-0.053
Token forth
Token ID6071
Feature activation-8.07e-02

Pos logit contributions
 it+0.036
pped+0.035
pping+0.032
 him+0.029
 whoever+0.028
Neg logit contributions
 biggest-0.056
 sun-0.052
ini-0.051
 jungle-0.050
err-0.049
Token in
Token ID287
Feature activation-9.90e-03

Pos logit contributions
pped+0.125
 risking+0.119
 whoever+0.118
 it+0.112
ms+0.109
Neg logit contributions
 biggest-0.147
 sun-0.141
 best-0.136
 new-0.132
 big-0.131
Token the
Token ID262
Feature activation+9.01e-02

Pos logit contributions
 it+0.011
pped+0.009
pping+0.008
 let+0.007
 him+0.007
Neg logit contributions
 biggest-0.026
umm-0.025
ini-0.025
err-0.024
 jungle-0.024
Token sky
Token ID6766
Feature activation-3.93e-02

Pos logit contributions
omm+0.105
umm+0.101
err+0.100
ini+0.095
ubby+0.086
Neg logit contributions
 it-0.244
 let-0.204
pped-0.202
 him-0.197
 that-0.189
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
pped+0.053
 it+0.050
pping+0.045
 whoever+0.042
 risking+0.042
Neg logit contributions
 biggest-0.091
 sun-0.084
 jungle-0.082
 sky-0.081
 best-0.081
Token spir
Token ID9158
Feature activation+2.70e-02

Pos logit contributions
 it+0.038
pped+0.038
pping+0.033
 him+0.033
 whoever+0.032
Neg logit contributions
 biggest-0.048
 sun-0.044
 jungle-0.042
 sky-0.041
ini-0.041
Token an
Token ID281
Feature activation+8.86e-02

Pos logit contributions
 it+0.040
pping+0.037
pped+0.036
 him+0.032
 let+0.031
Neg logit contributions
 biggest-0.067
 sun-0.061
umm-0.060
 best-0.060
err-0.060
Token older
Token ID4697
Feature activation+7.23e-02

Pos logit contributions
umm+0.115
 biggest+0.108
 former+0.102
ini+0.100
err+0.093
Neg logit contributions
 it-0.226
 him-0.198
 let-0.188
pped-0.188
 how-0.187
Token man
Token ID582
Feature activation-1.56e-02

Pos logit contributions
 biggest+0.081
umm+0.072
ini+0.070
err+0.066
 former+0.064
Neg logit contributions
 it-0.200
 let-0.173
 him-0.167
pped-0.167
 instead-0.162
Token came
Token ID1625
Feature activation-2.61e-02

Pos logit contributions
 it+0.024
pping+0.020
pped+0.019
 instead+0.019
 whoever+0.018
Neg logit contributions
 jungle-0.029
ini-0.029
 biggest-0.028
 sky-0.027
 eldest-0.027
Token up
Token ID510
Feature activation-2.76e-02

Pos logit contributions
 it+0.035
pped+0.030
pping+0.027
 let+0.026
 whoever+0.025
Neg logit contributions
 biggest-0.058
err-0.057
ini-0.056
 jungle-0.054
 elderly-0.054
Token to
Token ID284
Feature activation-8.06e-02

Pos logit contributions
pped+0.033
 it+0.033
pping+0.031
 whoever+0.026
 let+0.024
Neg logit contributions
 elderly-0.060
err-0.060
 biggest-0.059
 world-0.058
ini-0.058
Token Sophie
Token ID35331
Feature activation-4.72e-03

Pos logit contributions
pping+0.099
pped+0.097
 it+0.093
 wouldn+0.079
ched+0.079
Neg logit contributions
 elderly-0.165
 biggest-0.163
 sun-0.163
 girl-0.160
err-0.157
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
pped+0.008
 it+0.007
ged+0.006
 risking+0.006
pping+0.006
Neg logit contributions
 world-0.008
 elderly-0.008
 jungle-0.008
 older-0.008
 biggest-0.008
Token He
Token ID679
Feature activation-2.03e-02

Pos logit contributions
 it+0.032
pped+0.029
pping+0.027
 him+0.027
 let+0.026
Neg logit contributions
 biggest-0.022
umm-0.020
ini-0.020
err-0.019
 jungle-0.019
Token asked
Token ID1965
Feature activation-5.74e-02

Pos logit contributions
 it+0.032
pped+0.030
pping+0.029
 him+0.026
 instead+0.025
Neg logit contributions
 biggest-0.039
ini-0.038
 jungle-0.037
err-0.036
 elderly-0.035
Token her
Token ID607
Feature activation+8.72e-03

Pos logit contributions
pped+0.064
 it+0.063
pping+0.062
 breathe+0.051
 risking+0.049
Neg logit contributions
 biggest-0.131
 best-0.121
 sun-0.120
err-0.120
 elderly-0.119
Token in
Token ID287
Feature activation-2.22e-02

Pos logit contributions
 it+0.041
pped+0.036
pping+0.035
 let+0.033
 whoever+0.033
Neg logit contributions
 biggest-0.075
err-0.069
 sun-0.068
umm-0.067
ini-0.066
Token and
Token ID290
Feature activation-3.71e-02

Pos logit contributions
pped+0.026
 it+0.024
pping+0.023
 let+0.020
 risking+0.019
Neg logit contributions
 biggest-0.052
 sun-0.050
 world-0.049
 jungle-0.048
err-0.048
Token twisted
Token ID19074
Feature activation-4.04e-02

Pos logit contributions
pped+0.045
pping+0.042
 it+0.039
 risking+0.036
 him+0.035
Neg logit contributions
 biggest-0.082
 sun-0.076
 jungle-0.073
 best-0.072
ini-0.072
Token back
Token ID736
Feature activation-1.87e-02

Pos logit contributions
pped+0.036
pping+0.034
 risking+0.030
 whoever+0.030
 it+0.030
Neg logit contributions
 biggest-0.103
 sun-0.098
 best-0.094
 sky-0.092
 new-0.089
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
 it+0.022
pped+0.020
pping+0.018
 let+0.017
 him+0.016
Neg logit contributions
 biggest-0.047
ini-0.043
umm-0.043
err-0.042
 jungle-0.042
Token forth
Token ID6071
Feature activation-8.04e-02

Pos logit contributions
pped+0.039
 it+0.039
pping+0.035
 him+0.031
 risking+0.029
Neg logit contributions
 biggest-0.076
umm-0.068
 sun-0.068
ini-0.067
 tallest-0.067
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
pped+0.121
 risking+0.108
 whoever+0.106
 it+0.104
pping+0.100
Neg logit contributions
 biggest-0.157
 best-0.153
 sun-0.138
 new-0.135
 most-0.132
Token Suddenly
Token ID24975
Feature activation-3.81e-02

Pos logit contributions
 it+0.021
pped+0.018
pping+0.017
 him+0.016
 let+0.016
Neg logit contributions
 biggest-0.022
umm-0.021
ini-0.021
err-0.020
 jungle-0.020
Token,
Token ID11
Feature activation-3.58e-02

Pos logit contributions
pped+0.053
pping+0.053
 it+0.043
assed+0.042
ge+0.040
Neg logit contributions
 biggest-0.080
 sun-0.077
 best-0.075
 sky-0.072
 jungle-0.070
Token the
Token ID262
Feature activation+1.25e-01

Pos logit contributions
pped+0.027
pping+0.026
 it+0.025
 let+0.018
 him+0.017
Neg logit contributions
 biggest-0.099
 sun-0.092
umm-0.091
 best-0.090
ini-0.089
Token toe
Token ID21189
Feature activation-8.22e-03

Pos logit contributions
umm+0.204
ini+0.201
err+0.192
omm+0.188
 biggest+0.183
Neg logit contributions
 it-0.274
 let-0.224
pped-0.216
 him-0.207
 instead-0.200
Token mole
Token ID9411
Feature activation-1.29e-02

Pos logit contributions
anny+0.235
umm+0.225
ini+0.216
omm+0.215
lights+0.209
Neg logit contributions
 it-0.286
 let-0.236
 him-0.225
 that-0.207
 instead-0.205
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
pped+0.016
 it+0.016
pping+0.013
 risking+0.012
 him+0.012
Neg logit contributions
 biggest-0.030
ini-0.027
 eldest-0.027
 older-0.027
 best-0.027
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
 it+0.024
pped+0.021
pping+0.019
 him+0.019
 let+0.019
Neg logit contributions
 biggest-0.028
umm-0.026
ini-0.026
err-0.025
 jungle-0.025
Token
Token ID198
Feature activation+5.82e-03

Pos logit contributions
 biggest+0.028
umm+0.027
ini+0.026
 tallest+0.025
err+0.025
Neg logit contributions
 it-0.021
pped-0.016
 him-0.015
 let-0.015
 whoever-0.014
Token"
Token ID1
Feature activation-5.74e-03

Pos logit contributions
 biggest+0.013
ini+0.013
umm+0.013
err+0.012
 jungle+0.012
Neg logit contributions
 it-0.009
pped-0.007
 him-0.006
 let-0.006
pping-0.006
TokenTo
Token ID2514
Feature activation-8.03e-02

Pos logit contributions
 it+0.009
pped+0.007
 let+0.007
 him+0.006
pping+0.006
Neg logit contributions
 biggest-0.013
umm-0.013
ini-0.012
omm-0.012
ubby-0.011
Token get
Token ID651
Feature activation-2.56e-02

Pos logit contributions
 it+0.106
pped+0.104
ged+0.093
pping+0.087
 risking+0.079
Neg logit contributions
 biggest-0.171
 sun-0.157
 former-0.156
 jungle-0.154
ini-0.154
Token to
Token ID284
Feature activation-4.36e-02

Pos logit contributions
 it+0.031
pped+0.027
pping+0.025
 let+0.023
 him+0.022
Neg logit contributions
 biggest-0.062
ini-0.056
 sun-0.056
umm-0.056
 best-0.055
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
pped+0.047
 it+0.045
pping+0.042
ged+0.041
 instead+0.032
Neg logit contributions
 biggest-0.105
 jungle-0.098
 sun-0.098
 elderly-0.097
 former-0.096
Token other
Token ID584
Feature activation+7.86e-02

Pos logit contributions
ini+0.152
umm+0.139
err+0.137
omm+0.137
 biggest+0.136
Neg logit contributions
 it-0.264
 let-0.224
 him-0.217
pped-0.214
 instead-0.212
Token side
Token ID1735
Feature activation-1.95e-02

Pos logit contributions
 biggest+0.117
ini+0.116
umm+0.111
err+0.109
omm+0.108
Neg logit contributions
 it-0.176
 let-0.149
pped-0.146
 him-0.144
 instead-0.142
Token tag
Token ID7621
Feature activation-2.20e-02

Pos logit contributions
umm+0.078
ini+0.078
 biggest+0.076
err+0.074
 eldest+0.070
Neg logit contributions
 it-0.138
 let-0.112
pped-0.107
 him-0.105
 instead-0.102
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
 it+0.028
pped+0.026
pping+0.024
 him+0.022
 whoever+0.022
Neg logit contributions
 biggest-0.050
 jungle-0.046
 sun-0.046
 best-0.045
 world-0.044
Token It
Token ID632
Feature activation-1.68e-02

Pos logit contributions
 it+0.027
pped+0.022
 him+0.020
 let+0.020
pping+0.020
Neg logit contributions
 biggest-0.036
ini-0.035
umm-0.034
err-0.033
 jungle-0.032
Token says
Token ID1139
Feature activation-1.53e-02

Pos logit contributions
 it+0.028
pped+0.025
pping+0.023
 him+0.022
ged+0.021
Neg logit contributions
 biggest-0.035
ini-0.031
err-0.031
 jungle-0.031
umm-0.030
Token '
Token ID705
Feature activation-1.21e-03

Pos logit contributions
pped+0.021
 it+0.020
pping+0.020
ged+0.018
 him+0.018
Neg logit contributions
 biggest-0.034
 sun-0.031
 best-0.030
 jungle-0.029
omm-0.029
TokenTo
Token ID2514
Feature activation-8.03e-02

Pos logit contributions
 it+0.002
pped+0.002
pping+0.002
 him+0.002
 let+0.002
Neg logit contributions
 biggest-0.002
ini-0.002
umm-0.002
err-0.002
 jungle-0.002
Token Lily
Token ID20037
Feature activation-1.13e-02

Pos logit contributions
pped+0.114
 it+0.112
ged+0.104
pping+0.101
 risking+0.097
Neg logit contributions
 biggest-0.157
 sun-0.149
 elderly-0.141
omm-0.141
 former-0.140
Token and
Token ID290
Feature activation-2.99e-02

Pos logit contributions
 it+0.014
pped+0.013
 risking+0.011
ged+0.011
 him+0.011
Neg logit contributions
 biggest-0.024
 world-0.024
ini-0.024
 elderly-0.023
ama-0.023
Token Tom
Token ID4186
Feature activation-1.87e-02

Pos logit contributions
 it+0.042
pped+0.038
pping+0.036
 him+0.031
 let+0.031
Neg logit contributions
 biggest-0.067
umm-0.061
ini-0.060
 sun-0.060
 jungle-0.060
Token,
Token ID11
Feature activation-3.74e-02

Pos logit contributions
 it+0.025
pped+0.023
 risking+0.019
ged+0.019
pping+0.018
Neg logit contributions
 biggest-0.039
 world-0.038
 best-0.037
 elderly-0.037
 jungle-0.036
Token from
Token ID422
Feature activation-6.32e-03

Pos logit contributions
pped+0.044
 it+0.040
pping+0.037
ged+0.036
 him+0.030
Neg logit contributions
 biggest-0.086
 sun-0.079
 best-0.079
 world-0.079
 jungle-0.077
Token a
Token ID257
Feature activation+4.58e-02

Pos logit contributions
 it+0.042
pped+0.039
pping+0.038
 him+0.034
ged+0.034
Neg logit contributions
 biggest-0.044
 sun-0.039
 jungle-0.037
 best-0.037
 sky-0.037
Token tag
Token ID7621
Feature activation-1.99e-02

Pos logit contributions
umm+0.068
ini+0.068
 biggest+0.067
err+0.064
 eldest+0.060
Neg logit contributions
 it-0.107
 let-0.086
pped-0.084
 him-0.081
 instead-0.080
Token that
Token ID326
Feature activation-3.11e-02

Pos logit contributions
pped+0.026
 it+0.025
pping+0.024
 whoever+0.024
 him+0.023
Neg logit contributions
 biggest-0.041
 sky-0.041
 sun-0.040
 jungle-0.039
 best-0.038
Token said
Token ID531
Feature activation-2.13e-02

Pos logit contributions
 it+0.046
pped+0.041
pping+0.037
 him+0.034
 let+0.034
Neg logit contributions
 biggest-0.070
umm-0.065
ini-0.065
err-0.062
 jungle-0.062
Token "
Token ID366
Feature activation-5.88e-03

Pos logit contributions
pped+0.041
pping+0.039
 argued+0.038
 it+0.036
ge+0.036
Neg logit contributions
 biggest-0.035
 sun-0.032
 best-0.032
omm-0.030
 new-0.029
TokenTo
Token ID2514
Feature activation-8.00e-02

Pos logit contributions
 it+0.011
pped+0.009
pping+0.009
 him+0.009
 let+0.009
Neg logit contributions
 biggest-0.011
ini-0.010
umm-0.010
err-0.010
 jungle-0.009
Token Ben
Token ID3932
Feature activation-4.35e-04

Pos logit contributions
 it+0.111
pped+0.110
ged+0.099
pping+0.095
 risking+0.090
Neg logit contributions
 biggest-0.160
 sun-0.151
omm-0.148
 elderly-0.146
 jungle-0.145
Token,
Token ID11
Feature activation-3.20e-02

Pos logit contributions
 it+0.001
pped+0.001
ged+0.000
 risking+0.000
 him+0.000
Neg logit contributions
 biggest-0.001
ini-0.001
 world-0.001
ubby-0.001
 jungle-0.001
Token from
Token ID422
Feature activation-1.55e-02

Pos logit contributions
 it+0.049
pped+0.044
pping+0.041
 him+0.038
 let+0.036
Neg logit contributions
 biggest-0.069
ini-0.064
umm-0.063
err-0.062
 jungle-0.061
Token Dad
Token ID17415
Feature activation-2.05e-02

Pos logit contributions
 it+0.017
pped+0.016
pping+0.014
 let+0.013
 him+0.012
Neg logit contributions
 biggest-0.037
 elderly-0.035
 sun-0.035
 jungle-0.034
umm-0.034
Token".
Token ID1911
Feature activation-6.85e-03

Pos logit contributions
 it+0.028
pped+0.028
ged+0.024
 risking+0.024
pping+0.022
Neg logit contributions
 biggest-0.042
 world-0.040
ini-0.040
 elderly-0.039
ama-0.039
Token"
Token ID1
Feature activation-1.89e-02

Pos logit contributions
 biggest+0.019
ini+0.019
umm+0.018
err+0.018
 jungle+0.018
Neg logit contributions
 it-0.013
pped-0.011
 him-0.010
pping-0.010
 let-0.010
TokenIt
Token ID1026
Feature activation-9.54e-03

Pos logit contributions
 it+0.037
pped+0.032
 him+0.030
 let+0.030
pping+0.030
Neg logit contributions
 biggest-0.034
ini-0.032
umm-0.032
err-0.031
 jungle-0.030
Token says
Token ID1139
Feature activation-2.20e-02

Pos logit contributions
 it+0.015
pped+0.013
pping+0.012
 him+0.012
 let+0.011
Neg logit contributions
 biggest-0.021
ini-0.019
err-0.019
umm-0.019
 jungle-0.019
Token:
Token ID25
Feature activation-3.36e-02

Pos logit contributions
pped+0.032
 it+0.030
pping+0.029
ged+0.027
ge+0.026
Neg logit contributions
 biggest-0.047
 sun-0.044
 best-0.043
omm-0.042
umm-0.041
Token '
Token ID705
Feature activation-7.11e-03

Pos logit contributions
 it+0.065
pped+0.061
pping+0.057
 him+0.055
 let+0.052
Neg logit contributions
 biggest-0.058
umm-0.052
ini-0.051
err-0.051
 jungle-0.051
TokenTo
Token ID2514
Feature activation-8.00e-02

Pos logit contributions
 it+0.013
pping+0.011
pped+0.011
 him+0.011
 let+0.010
Neg logit contributions
ini-0.012
 biggest-0.012
umm-0.011
err-0.011
 world-0.011
Token whoever
Token ID16958
Feature activation-4.52e-02

Pos logit contributions
 it+0.118
pped+0.114
ged+0.106
pping+0.102
 risking+0.097
Neg logit contributions
 biggest-0.150
 sun-0.144
omm-0.139
 elderly-0.139
 former-0.138
Token finds
Token ID7228
Feature activation-5.35e-02

Pos logit contributions
pped+0.064
 it+0.062
pping+0.062
ged+0.055
ge+0.052
Neg logit contributions
 biggest-0.088
 jungle-0.081
 best-0.080
 sun-0.079
 world-0.079
Token this
Token ID428
Feature activation+4.25e-02

Pos logit contributions
pped+0.045
 it+0.043
pping+0.041
 let+0.033
ged+0.032
Neg logit contributions
 biggest-0.144
 sun-0.131
 jungle-0.129
 best-0.129
ini-0.128
Token note
Token ID3465
Feature activation-3.92e-02

Pos logit contributions
 biggest+0.065
ini+0.059
umm+0.059
err+0.056
 jungle+0.054
Neg logit contributions
 it-0.093
pped-0.086
pping-0.080
 him-0.080
 let-0.078
Token,
Token ID11
Feature activation-4.39e-02

Pos logit contributions
pped+0.055
 it+0.054
pping+0.049
 him+0.046
ged+0.044
Neg logit contributions
 biggest-0.086
 best-0.078
 sun-0.075
ini-0.075
 world-0.074
Token.
Token ID13
Feature activation-7.36e-03

Pos logit contributions
pped+0.058
 it+0.056
pping+0.051
 risking+0.049
 whoever+0.044
Neg logit contributions
 biggest-0.102
 best-0.089
 old-0.088
 older-0.088
 tallest-0.088
Token He
Token ID679
Feature activation-1.15e-02

Pos logit contributions
 it+0.016
pped+0.014
pping+0.013
 him+0.013
 let+0.013
Neg logit contributions
 biggest-0.012
umm-0.011
ini-0.011
err-0.011
 jungle-0.010
Token rocked
Token ID36872
Feature activation-3.53e-02

Pos logit contributions
 it+0.016
pped+0.014
pping+0.013
 him+0.013
 let+0.012
Neg logit contributions
 biggest-0.026
ini-0.024
err-0.023
umm-0.023
 jungle-0.023
Token back
Token ID736
Feature activation-2.01e-02

Pos logit contributions
 it+0.045
pped+0.040
pping+0.038
 let+0.035
 whoever+0.034
Neg logit contributions
 biggest-0.084
 sun-0.075
ini-0.075
err-0.074
 best-0.074
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
 it+0.026
pped+0.023
pping+0.021
 let+0.020
 him+0.019
Neg logit contributions
 biggest-0.048
ini-0.044
umm-0.044
 jungle-0.043
err-0.043
Token forth
Token ID6071
Feature activation-7.96e-02

Pos logit contributions
 it+0.035
pped+0.034
pping+0.032
 him+0.028
 risking+0.028
Neg logit contributions
 biggest-0.046
 sun-0.041
 jungle-0.040
 tallest-0.040
umm-0.040
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
pped+0.125
 risking+0.124
ms+0.116
 whoever+0.114
 it+0.112
Neg logit contributions
 biggest-0.138
 best-0.131
 sun-0.127
 big-0.122
 new-0.119
Token pushing
Token ID7796
Feature activation-4.73e-02

Pos logit contributions
 it+0.037
pped+0.036
pping+0.031
 him+0.030
 whoever+0.029
Neg logit contributions
 biggest-0.045
 jungle-0.041
 sun-0.041
 tallest-0.039
 sky-0.039
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
 it+0.058
pped+0.051
pping+0.046
 let+0.043
 him+0.040
Neg logit contributions
 biggest-0.115
umm-0.106
ini-0.105
err-0.104
 sun-0.104
Token pulling
Token ID10427
Feature activation-4.43e-02

Pos logit contributions
 it+0.037
pped+0.035
pping+0.031
 him+0.030
 let+0.029
Neg logit contributions
 biggest-0.047
 jungle-0.043
 sun-0.042
umm-0.042
 tallest-0.042
Token as
Token ID355
Feature activation-1.46e-02

Pos logit contributions
 it+0.056
pped+0.055
pping+0.049
 let+0.045
 risking+0.045
Neg logit contributions
 biggest-0.101
 best-0.092
 sun-0.092
umm-0.089
err-0.088
Token The
Token ID383
Feature activation+1.10e-01

Pos logit contributions
 it+0.021
pped+0.018
 him+0.016
 let+0.016
pping+0.016
Neg logit contributions
 biggest-0.026
ini-0.025
umm-0.024
err-0.023
 jungle-0.023
Token note
Token ID3465
Feature activation-1.63e-02

Pos logit contributions
umm+0.202
ini+0.198
 biggest+0.196
err+0.189
 former+0.185
Neg logit contributions
 it-0.227
 let-0.172
pped-0.172
 him-0.162
 that-0.157
Token said
Token ID531
Feature activation-3.00e-02

Pos logit contributions
 it+0.025
pping+0.025
pped+0.025
 him+0.022
assed+0.020
Neg logit contributions
 biggest-0.032
ini-0.030
 best-0.029
 sun-0.029
 elderly-0.029
Token:
Token ID25
Feature activation-4.26e-02

Pos logit contributions
pping+0.049
pped+0.048
ge+0.040
ged+0.040
assed+0.040
Neg logit contributions
 biggest-0.058
omm-0.056
 sun-0.054
 best-0.052
 new-0.052
Token "
Token ID366
Feature activation-2.41e-02

Pos logit contributions
pped+0.083
pping+0.078
 it+0.076
ged+0.069
 risking+0.068
Neg logit contributions
 biggest-0.068
 sun-0.064
 jungle-0.061
 sky-0.060
omm-0.059
TokenTo
Token ID2514
Feature activation-7.94e-02

Pos logit contributions
 it+0.047
pped+0.040
pping+0.039
 him+0.038
 let+0.037
Neg logit contributions
 biggest-0.043
ini-0.041
umm-0.040
err-0.039
 jungle-0.037
Token open
Token ID1280
Feature activation-4.71e-02

Pos logit contributions
 it+0.112
pped+0.111
ged+0.103
pping+0.099
 risking+0.095
Neg logit contributions
 biggest-0.156
 sun-0.148
 elderly-0.147
omm-0.145
 former-0.144
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
pped+0.037
pping+0.030
 breathe+0.028
ged+0.028
ched+0.026
Neg logit contributions
 biggest-0.128
 old-0.119
 world-0.118
 elderly-0.118
 sun-0.117
Token box
Token ID3091
Feature activation-4.30e-02

Pos logit contributions
 biggest+0.156
ini+0.153
umm+0.149
err+0.141
 eldest+0.132
Neg logit contributions
 it-0.281
pped-0.241
 let-0.233
 him-0.229
pping-0.224
Token,
Token ID11
Feature activation-5.34e-02

Pos logit contributions
pped+0.059
 it+0.058
ged+0.052
pping+0.051
 risking+0.050
Neg logit contributions
 biggest-0.097
 world-0.086
ini-0.086
 best-0.085
 sky-0.085
Token you
Token ID345
Feature activation-8.88e-03

Pos logit contributions
pped+0.065
 it+0.063
pping+0.062
ged+0.054
 him+0.051
Neg logit contributions
 biggest-0.127
 sun-0.116
 elderly-0.113
ini-0.113
umm-0.112
Token Then
Token ID3244
Feature activation-2.26e-02

Pos logit contributions
 it+0.005
 him+0.003
 let+0.003
 instead+0.003
 whoever+0.003
Neg logit contributions
ini-0.007
umm-0.007
 biggest-0.007
ubby-0.007
 former-0.006
Token her
Token ID607
Feature activation+8.49e-02

Pos logit contributions
 it+0.028
pped+0.026
pping+0.024
 let+0.021
 him+0.021
Neg logit contributions
 biggest-0.054
 sun-0.049
umm-0.049
 best-0.049
 jungle-0.048
Token mom
Token ID1995
Feature activation-1.14e-03

Pos logit contributions
umm+0.165
ini+0.164
err+0.145
 Arctic+0.140
umpy+0.140
Neg logit contributions
 it-0.160
 let-0.137
 to-0.128
 that-0.123
 the-0.120
Token came
Token ID1625
Feature activation-2.61e-02

Pos logit contributions
 it+0.002
pped+0.002
pping+0.002
 risking+0.002
 him+0.002
Neg logit contributions
ini-0.002
 biggest-0.002
 eldest-0.001
 older-0.001
 elderly-0.001
Token up
Token ID510
Feature activation-3.01e-02

Pos logit contributions
 it+0.035
pped+0.030
pping+0.027
 let+0.026
 whoever+0.025
Neg logit contributions
 biggest-0.059
ini-0.057
err-0.056
 elderly-0.055
 world-0.054
Token to
Token ID284
Feature activation-7.92e-02

Pos logit contributions
 it+0.037
pped+0.034
pping+0.031
 him+0.027
 whoever+0.027
Neg logit contributions
 biggest-0.067
 elderly-0.067
err-0.065
ini-0.065
 sun-0.064
Token her
Token ID607
Feature activation+3.85e-02

Pos logit contributions
pping+0.109
pped+0.109
 it+0.093
ched+0.093
es+0.090
Neg logit contributions
 sun-0.142
 girl-0.138
 biggest-0.137
 little-0.137
 elderly-0.136
Token with
Token ID351
Feature activation-8.41e-03

Pos logit contributions
 biggest+0.094
ini+0.090
umm+0.089
err+0.087
 jungle+0.086
Neg logit contributions
 it-0.050
pped-0.041
pping-0.036
 him-0.036
 let-0.035
Token a
Token ID257
Feature activation+5.58e-02

Pos logit contributions
 it+0.013
pped+0.011
pping+0.011
 let+0.010
 whoever+0.010
Neg logit contributions
 biggest-0.018
ini-0.016
 sun-0.016
umm-0.016
 best-0.016
Token big
Token ID1263
Feature activation+6.18e-02

Pos logit contributions
umm+0.078
ini+0.074
 biggest+0.073
err+0.069
 former+0.065
Neg logit contributions
 it-0.136
 let-0.114
pped-0.111
 him-0.108
pping-0.105
Token smile
Token ID8212
Feature activation-2.03e-02

Pos logit contributions
 biggest+0.113
ini+0.109
umm+0.108
err+0.103
 former+0.099
Neg logit contributions
 it-0.121
pped-0.100
 let-0.096
 him-0.096
pping-0.093
Token rocked
Token ID36872
Feature activation-4.62e-02

Pos logit contributions
 it+0.025
pped+0.022
pping+0.021
 him+0.020
 whoever+0.018
Neg logit contributions
 biggest-0.038
ini-0.036
 jungle-0.034
err-0.034
 youngest-0.034
Token the
Token ID262
Feature activation+9.71e-02

Pos logit contributions
pped+0.029
 it+0.027
pping+0.027
 risking+0.023
 let+0.021
Neg logit contributions
 biggest-0.131
 sun-0.124
 best-0.120
ini-0.117
 sky-0.117
Token bucket
Token ID19236
Feature activation-4.86e-02

Pos logit contributions
 biggest+0.161
ini+0.148
umm+0.147
err+0.138
 jungle+0.135
Neg logit contributions
 it-0.205
pped-0.182
 him-0.171
pping-0.170
 let-0.168
Token back
Token ID736
Feature activation-2.22e-02

Pos logit contributions
 it+0.067
pped+0.059
pping+0.052
 whoever+0.052
 him+0.051
Neg logit contributions
 biggest-0.112
umm-0.101
ini-0.101
 sun-0.100
 best-0.098
Token and
Token ID290
Feature activation-2.53e-02

Pos logit contributions
 it+0.028
pped+0.025
pping+0.021
 him+0.021
 let+0.021
Neg logit contributions
 biggest-0.054
ini-0.050
umm-0.050
err-0.048
 sun-0.048
Token forth
Token ID6071
Feature activation-7.90e-02

Pos logit contributions
pped+0.037
pping+0.035
 it+0.035
 risking+0.030
assed+0.029
Neg logit contributions
 biggest-0.050
 sun-0.047
 best-0.045
umm-0.044
ini-0.044
Token,
Token ID11
Feature activation-2.97e-02

Pos logit contributions
pped+0.121
 risking+0.113
 whoever+0.110
 it+0.106
pping+0.104
Neg logit contributions
 biggest-0.149
 best-0.144
 sun-0.137
 new-0.130
 sky-0.127
Token making
Token ID1642
Feature activation-2.64e-02

Pos logit contributions
pped+0.044
 it+0.043
pping+0.037
 him+0.036
 whoever+0.035
Neg logit contributions
 biggest-0.060
 sun-0.055
 best-0.053
 sky-0.052
 jungle-0.052
Token a
Token ID257
Feature activation+5.71e-02

Pos logit contributions
 it+0.037
pped+0.033
pping+0.030
 let+0.029
 him+0.027
Neg logit contributions
 biggest-0.061
ini-0.056
umm-0.056
err-0.054
 sun-0.053
Token game
Token ID983
Feature activation-2.82e-02

Pos logit contributions
ini+0.074
umm+0.073
omm+0.071
err+0.071
 biggest+0.069
Neg logit contributions
 it-0.144
 let-0.119
 him-0.112
 instead-0.110
pped-0.108
Token out
Token ID503
Feature activation-1.53e-02

Pos logit contributions
pped+0.042
 it+0.040
pping+0.036
ged+0.034
 risking+0.033
Neg logit contributions
 biggest-0.060
 best-0.054
 sun-0.053
umm-0.052
 jungle-0.052
Token,
Token ID11
Feature activation-3.14e-02

Pos logit contributions
pped+0.047
pping+0.045
 it+0.037
ge+0.036
 risking+0.036
Neg logit contributions
 biggest-0.061
 best-0.055
 sun-0.053
 new-0.051
 older-0.050
Token "
Token ID366
Feature activation-1.28e-02

Pos logit contributions
pped+0.064
 it+0.060
pping+0.060
 risking+0.055
 him+0.055
Neg logit contributions
 biggest-0.044
 sun-0.043
omm-0.040
 sky-0.039
 elderly-0.038
TokenLet
Token ID5756
Feature activation-3.60e-02

Pos logit contributions
 it+0.024
pped+0.021
 let+0.020
 him+0.019
pping+0.019
Neg logit contributions
 biggest-0.024
umm-0.023
ini-0.023
err-0.022
 jungle-0.021
Token's
Token ID338
Feature activation-1.83e-02

Pos logit contributions
 it+0.052
pped+0.050
pping+0.046
ged+0.043
 risking+0.041
Neg logit contributions
 biggest-0.078
ini-0.071
 sun-0.068
 best-0.068
err-0.068
Token search
Token ID2989
Feature activation-5.89e-02

Pos logit contributions
 it+0.028
pped+0.026
ged+0.024
pping+0.023
 risking+0.022
Neg logit contributions
 biggest-0.039
 sun-0.035
ini-0.034
 jungle-0.034
umm-0.034
Token inside
Token ID2641
Feature activation-7.89e-02

Pos logit contributions
pped+0.087
pping+0.083
 it+0.080
 risking+0.072
ged+0.071
Neg logit contributions
 biggest-0.126
 sun-0.114
 best-0.108
 new-0.105
 old-0.103
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
pped+0.099
ged+0.092
 argued+0.087
 fought+0.084
ched+0.082
Neg logit contributions
 biggest-0.183
 elderly-0.165
 world-0.165
 sun-0.163
 new-0.162
Token desk
Token ID6915
Feature activation-4.38e-02

Pos logit contributions
umm+0.139
ini+0.137
err+0.132
 biggest+0.129
ubby+0.124
Neg logit contributions
 it-0.272
 let-0.223
 him-0.220
pped-0.220
 instead-0.212
Token!"
Token ID2474
Feature activation+1.10e-02

Pos logit contributions
 it+0.056
pped+0.056
 him+0.046
pping+0.046
 risking+0.046
Neg logit contributions
 biggest-0.103
ini-0.093
umm-0.092
 world-0.091
 tallest-0.091
Token
Token ID198
Feature activation+3.09e-02

Pos logit contributions
 biggest+0.021
ini+0.020
umm+0.020
err+0.019
 jungle+0.019
Neg logit contributions
 it-0.020
pped-0.017
pping-0.016
 him-0.016
 let-0.016
Token
Token ID198
Feature activation+2.48e-02

Pos logit contributions
 biggest+0.058
ini+0.055
umm+0.054
err+0.052
 jungle+0.051
Neg logit contributions
 it-0.057
pped-0.049
pping-0.046
 him-0.046
 let-0.046
Token little
Token ID1310
Feature activation+9.15e-02

Pos logit contributions
omm+0.146
umm+0.141
err+0.132
opl+0.130
anny+0.130
Neg logit contributions
 it-0.298
 let-0.268
pped-0.231
 that-0.229
 again-0.226
Token girl
Token ID2576
Feature activation-3.62e-03

Pos logit contributions
 biggest+0.073
ini+0.070
umm+0.069
omm+0.067
err+0.064
Neg logit contributions
 it-0.276
 let-0.239
 instead-0.226
pped-0.226
 him-0.223
Token looked
Token ID3114
Feature activation-2.59e-02

Pos logit contributions
 it+0.006
pped+0.005
 him+0.005
pping+0.005
 whoever+0.005
Neg logit contributions
 biggest-0.007
ini-0.007
 sun-0.006
 jungle-0.006
 youngest-0.006
Token sadly
Token ID21098
Feature activation-1.84e-02

Pos logit contributions
 it+0.039
pped+0.034
pping+0.030
 let+0.030
 him+0.030
Neg logit contributions
 biggest-0.058
ini-0.054
umm-0.054
err-0.052
 jungle-0.051
Token at
Token ID379
Feature activation-1.95e-02

Pos logit contributions
 it+0.025
pped+0.024
pping+0.021
 whoever+0.020
 let+0.019
Neg logit contributions
 biggest-0.041
 sun-0.038
 best-0.038
 former-0.037
 surroundings-0.037
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 it+0.019
pped+0.012
 him+0.012
 let+0.010
pping+0.010
Neg logit contributions
 biggest-0.055
ini-0.054
umm-0.054
err-0.052
 jungle-0.051
Token gem
Token ID16840
Feature activation-1.46e-02

Pos logit contributions
omm+0.185
umm+0.177
ini+0.173
anny+0.173
omo+0.171
Neg logit contributions
 it-0.258
 let-0.209
 him-0.195
pped-0.193
 that-0.191
Token as
Token ID355
Feature activation-1.32e-02

Pos logit contributions
pped+0.020
 it+0.020
 him+0.017
pping+0.017
 whoever+0.016
Neg logit contributions
 biggest-0.033
 best-0.030
 sky-0.029
 world-0.029
 sun-0.029
Token all
Token ID477
Feature activation+2.67e-03

Pos logit contributions
 it+0.012
pped+0.012
pping+0.010
 him+0.009
 let+0.009
Neg logit contributions
 biggest-0.035
ini-0.033
 sun-0.032
 best-0.032
umm-0.032
Token the
Token ID262
Feature activation+8.33e-02

Pos logit contributions
 biggest+0.006
 sun+0.006
 sky+0.005
ini+0.005
umm+0.005
Neg logit contributions
pped-0.004
 it-0.003
pping-0.003
assed-0.003
 him-0.003
Token memories
Token ID9846
Feature activation-2.91e-02

Pos logit contributions
ini+0.114
omm+0.113
 biggest+0.110
umm+0.109
err+0.103
Neg logit contributions
 it-0.206
 let-0.176
 him-0.165
pped-0.155
 share-0.150
Token as
Token ID355
Feature activation-2.16e-02

Pos logit contributions
pped+0.032
 it+0.030
pping+0.027
 risking+0.026
assed+0.025
Neg logit contributions
 biggest-0.053
 sun-0.051
 best-0.048
ini-0.047
 jungle-0.047
Token he
Token ID339
Feature activation+6.05e-03

Pos logit contributions
 it+0.026
pped+0.025
pping+0.023
 let+0.020
 him+0.019
Neg logit contributions
 biggest-0.053
ini-0.047
 sun-0.047
 best-0.047
 jungle-0.046
Token went
Token ID1816
Feature activation-1.74e-02

Pos logit contributions
 biggest+0.013
ini+0.013
 jungle+0.012
 eldest+0.012
err+0.012
Neg logit contributions
 it-0.009
pped-0.007
pping-0.007
 him-0.007
 let-0.006
Token fast
Token ID3049
Feature activation+5.12e-03

Pos logit contributions
 it+0.025
pped+0.024
pping+0.021
 let+0.020
 whoever+0.020
Neg logit contributions
 biggest-0.038
 jungle-0.035
 sun-0.034
 best-0.034
 world-0.033
Token.
Token ID13
Feature activation+5.04e-03

Pos logit contributions
 biggest+0.010
 jungle+0.010
 world+0.010
 sun+0.010
 sky+0.010
Neg logit contributions
pped-0.007
 it-0.007
 whoever-0.006
pping-0.006
 let-0.006
Token The
Token ID383
Feature activation+1.13e-01

Pos logit contributions
 biggest+0.010
ini+0.010
umm+0.010
err+0.009
 former+0.009
Neg logit contributions
 it-0.009
pped-0.007
 him-0.007
 let-0.007
pping-0.007
Token sun
Token ID4252
Feature activation-7.65e-03

Pos logit contributions
umm+0.141
err+0.135
ini+0.134
 former+0.124
omm+0.121
Neg logit contributions
 it-0.300
 let-0.256
pped-0.250
 him-0.245
pping-0.235
Token was
Token ID373
Feature activation-5.58e-03

Pos logit contributions
 it+0.012
pped+0.011
pping+0.010
 him+0.010
 whoever+0.010
Neg logit contributions
 biggest-0.015
ini-0.015
 sun-0.014
 sky-0.014
 jungle-0.014
Token warm
Token ID5814
Feature activation-7.65e-03

Pos logit contributions
 it+0.013
pped+0.011
 him+0.011
 let+0.011
pping+0.011
Neg logit contributions
 biggest-0.008
ini-0.008
umm-0.008
err-0.007
 jungle-0.007
Token and
Token ID290
Feature activation-2.19e-02

Pos logit contributions
 it+0.010
pped+0.008
pping+0.008
 let+0.007
 him+0.007
Neg logit contributions
 biggest-0.018
umm-0.018
ini-0.017
 sun-0.017
 jungle-0.017
Token bright
Token ID6016
Feature activation-1.18e-02

Pos logit contributions
 it+0.028
pped+0.026
pping+0.024
 him+0.021
 let+0.021
Neg logit contributions
 biggest-0.052
ini-0.047
umm-0.047
 sun-0.047
 jungle-0.046
Token very
Token ID845
Feature activation+2.20e-02

Pos logit contributions
 it+0.012
pped+0.012
pping+0.012
ged+0.011
assed+0.011
Neg logit contributions
 biggest-0.011
 sun-0.009
 best-0.009
umm-0.009
 elderly-0.008
Token wise
Token ID10787
Feature activation-7.91e-03

Pos logit contributions
 biggest+0.030
ini+0.029
umm+0.028
err+0.027
 jungle+0.026
Neg logit contributions
 it-0.053
pped-0.046
 let-0.044
 him-0.044
pping-0.043
Token.
Token ID13
Feature activation+3.84e-03

Pos logit contributions
pped+0.013
 it+0.011
 whoever+0.011
pping+0.010
ck+0.010
Neg logit contributions
 biggest-0.016
umm-0.015
 best-0.015
 world-0.015
 elderly-0.014
Token
Token ID198
Feature activation+1.82e-02

Pos logit contributions
 biggest+0.008
ini+0.008
umm+0.008
err+0.008
ubby+0.008
Neg logit contributions
 it-0.006
 him-0.005
 let-0.005
pped-0.004
 whoever-0.004
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
 biggest+0.037
umm+0.036
ini+0.035
err+0.033
 jungle+0.033
Neg logit contributions
 it-0.032
pped-0.027
 him-0.025
 let-0.025
pping-0.024
TokenThe
Token ID464
Feature activation+1.19e-01

Pos logit contributions
 biggest+0.025
ini+0.023
umm+0.023
err+0.022
 jungle+0.022
Neg logit contributions
 it-0.026
pped-0.022
 him-0.021
 let-0.021
pping-0.021
Token wise
Token ID10787
Feature activation+1.17e-01

Pos logit contributions
ini+0.189
err+0.178
umm+0.177
anny+0.164
umpy+0.164
Neg logit contributions
 it-0.266
 let-0.227
 him-0.215
 that-0.204
 later-0.203
Token bear
Token ID6842
Feature activation-3.37e-03

Pos logit contributions
 biggest+0.097
ini+0.086
umm+0.086
err+0.077
 jungle+0.065
Neg logit contributions
 it-0.348
pped-0.306
 let-0.301
 him-0.301
pping-0.296
Token knew
Token ID2993
Feature activation-3.51e-02

Pos logit contributions
 it+0.005
pped+0.004
pping+0.004
 him+0.004
 whoever+0.004
Neg logit contributions
 biggest-0.007
ini-0.007
 jungle-0.007
umm-0.007
err-0.007
Token he
Token ID339
Feature activation-1.40e-02

Pos logit contributions
pped+0.038
pping+0.034
 it+0.031
ged+0.031
sels+0.028
Neg logit contributions
 biggest-0.084
 best-0.082
 sun-0.077
 elderly-0.077
 new-0.076
Token'd
Token ID1549
Feature activation-2.19e-02

Pos logit contributions
 it+0.022
pped+0.022
pping+0.021
 whoever+0.019
ged+0.019
Neg logit contributions
 biggest-0.025
ini-0.024
 best-0.023
err-0.023
 jungle-0.022
Token.
Token ID13
Feature activation+2.16e-03

Pos logit contributions
pped+0.028
 it+0.026
pping+0.025
sels+0.023
 let+0.023
Neg logit contributions
 biggest-0.049
 best-0.047
 sun-0.044
 world-0.044
 jungle-0.044
Token But
Token ID887
Feature activation-1.59e-02

Pos logit contributions
ini+0.005
umm+0.005
 biggest+0.005
ubby+0.005
andel+0.005
Neg logit contributions
 it-0.003
 him-0.003
 let-0.002
 instead-0.002
 whoever-0.002
Token his
Token ID465
Feature activation+8.18e-02

Pos logit contributions
 it+0.019
pped+0.017
pping+0.015
 let+0.014
 him+0.014
Neg logit contributions
 biggest-0.039
ini-0.037
umm-0.036
err-0.036
 jungle-0.035
Token mom
Token ID1995
Feature activation-6.23e-03

Pos logit contributions
umm+0.175
ini+0.172
 biggest+0.153
err+0.153
eh+0.148
Neg logit contributions
 it-0.155
 let-0.119
 again-0.113
 that-0.108
 instead-0.104
Token said
Token ID531
Feature activation-3.58e-02

Pos logit contributions
 it+0.009
pped+0.008
pping+0.007
 him+0.007
 whoever+0.007
Neg logit contributions
 biggest-0.013
ini-0.013
 best-0.012
umm-0.012
err-0.012
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
pped+0.042
pping+0.041
ge+0.038
assed+0.036
sels+0.035
Neg logit contributions
 best-0.075
 sun-0.073
 biggest-0.071
 little-0.069
omm-0.069
Token ghost
Token ID10905
Feature activation-1.39e-02

Pos logit contributions
umm+0.186
ini+0.184
 biggest+0.172
ubby+0.156
err+0.156
Neg logit contributions
 it-0.264
 let-0.216
pped-0.204
 him-0.201
 instead-0.192
Token was
Token ID373
Feature activation+2.48e-03

Pos logit contributions
pped+0.022
 it+0.021
pping+0.020
 him+0.017
 whoever+0.017
Neg logit contributions
 biggest-0.028
 best-0.026
ini-0.025
err-0.025
 jungle-0.024
Token too
Token ID1165
Feature activation+1.48e-02

Pos logit contributions
 biggest+0.004
 best+0.004
 elderly+0.004
 sun+0.004
 older+0.004
Neg logit contributions
 it-0.004
pped-0.004
ged-0.004
pping-0.004
 him-0.004
Token expensive
Token ID5789
Feature activation-2.71e-02

Pos logit contributions
 biggest+0.021
umm+0.021
ini+0.020
err+0.018
 jungle+0.017
Neg logit contributions
 it-0.036
pped-0.031
 let-0.030
 him-0.029
pping-0.029
Token and
Token ID290
Feature activation-3.36e-02

Pos logit contributions
pped+0.048
pping+0.042
ck+0.041
 whoever+0.039
 it+0.039
Neg logit contributions
 best-0.049
 biggest-0.047
 old-0.046
 new-0.044
 world-0.042
Token friends
Token ID2460
Feature activation-2.02e-02

Pos logit contributions
umm+0.075
ini+0.073
 biggest+0.071
omm+0.069
err+0.067
Neg logit contributions
 it-0.108
 him-0.085
 let-0.083
pped-0.082
 instead-0.082
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
pped+0.032
 it+0.025
pping+0.025
ged+0.023
assed+0.023
Neg logit contributions
 biggest-0.041
 world-0.040
 best-0.039
 jungle-0.037
 sun-0.037
Token
Token ID220
Feature activation+2.03e-02

Pos logit contributions
 biggest+0.003
ini+0.003
umm+0.003
err+0.003
ubby+0.003
Neg logit contributions
 it-0.003
 him-0.002
pped-0.002
 let-0.002
 instead-0.002
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
 biggest+0.040
umm+0.038
ini+0.038
err+0.036
 jungle+0.035
Neg logit contributions
 it-0.037
pped-0.031
 him-0.029
 let-0.029
pping-0.028
Token
Token ID198
Feature activation+1.95e-02

Pos logit contributions
 biggest+0.048
umm+0.047
ini+0.045
err+0.043
 eldest+0.042
Neg logit contributions
 it-0.036
pped-0.029
 him-0.028
 let-0.027
pping-0.026
TokenThe
Token ID464
Feature activation+1.25e-01

Pos logit contributions
 biggest+0.036
umm+0.034
ini+0.034
err+0.032
 jungle+0.031
Neg logit contributions
 it-0.038
pped-0.033
 let-0.031
 him-0.031
pping-0.030
Token end
Token ID886
Feature activation+5.11e-03

Pos logit contributions
ini+0.196
umm+0.190
err+0.177
 biggest+0.175
omm+0.172
Neg logit contributions
 it-0.275
 let-0.232
 him-0.223
pped-0.211
 share-0.208
Token.
Token ID13
Feature activation+6.50e-03

Pos logit contributions
 biggest+0.012
ini+0.011
err+0.011
 jungle+0.011
umm+0.011
Neg logit contributions
 it-0.006
pped-0.006
pping-0.005
 let-0.005
 him-0.005
Token<|endoftext|>
Token ID50256
Feature activation+2.55e-02

Pos logit contributions
 biggest+0.014
ini+0.014
umm+0.013
err+0.013
 eldest+0.012
Neg logit contributions
 it-0.011
pped-0.008
 him-0.008
 let-0.008
pping-0.008
TokenOnce
Token ID7454
Feature activation+1.46e-02

Pos logit contributions
 biggest+0.036
ini+0.033
umm+0.032
err+0.031
 jungle+0.031
Neg logit contributions
 it-0.058
pped-0.053
pping-0.051
 let-0.050
 him-0.048
Token upon
Token ID2402
Feature activation+7.62e-03

Pos logit contributions
 biggest+0.021
ini+0.019
umm+0.019
err+0.018
 jungle+0.017
Neg logit contributions
 it-0.034
pped-0.030
pping-0.028
 let-0.028
 him-0.028
Token play
Token ID711
Feature activation-1.48e-03

Pos logit contributions
 it+0.029
pped+0.027
pping+0.024
ged+0.022
 him+0.022
Neg logit contributions
 biggest-0.038
ini-0.035
 sun-0.035
 jungle-0.034
umm-0.034
Token with
Token ID351
Feature activation-1.16e-02

Pos logit contributions
 it+0.002
pped+0.002
pping+0.002
 whoever+0.002
ge+0.002
Neg logit contributions
 biggest-0.003
 sun-0.003
 jungle-0.003
ini-0.003
 best-0.003
Token her
Token ID607
Feature activation+6.60e-02

Pos logit contributions
 it+0.010
 him+0.006
pped+0.005
 let+0.004
 how+0.004
Neg logit contributions
umm-0.035
 biggest-0.035
ini-0.035
err-0.033
 former-0.033
Token friends
Token ID2460
Feature activation-1.52e-02

Pos logit contributions
umm+0.121
ini+0.105
omm+0.104
 biggest+0.103
err+0.101
Neg logit contributions
 it-0.143
 instead-0.109
 let-0.108
 him-0.108
 each-0.105
Token in
Token ID287
Feature activation+2.49e-04

Pos logit contributions
pped+0.024
 it+0.024
fed+0.021
 risking+0.020
assed+0.020
Neg logit contributions
 best-0.030
 biggest-0.029
 world-0.027
 sun-0.027
 jungle-0.026
Token the
Token ID262
Feature activation+8.82e-02

Pos logit contributions
 biggest+0.001
ini+0.001
umm+0.001
 jungle+0.001
err+0.001
Neg logit contributions
 it-0.000
pped-0.000
pping-0.000
 let-0.000
 him-0.000
Token park
Token ID3952
Feature activation-2.75e-02

Pos logit contributions
umm+0.149
err+0.144
omm+0.140
ini+0.138
ubby+0.131
Neg logit contributions
 it-0.190
 him-0.159
 the-0.147
 let-0.146
 this-0.143
Token.
Token ID13
Feature activation+3.11e-03

Pos logit contributions
 it+0.036
pped+0.034
pping+0.030
 him+0.026
 whoever+0.026
Neg logit contributions
 biggest-0.064
 jungle-0.060
 best-0.059
ini-0.059
 sun-0.059
Token One
Token ID1881
Feature activation+4.94e-02

Pos logit contributions
ini+0.006
 biggest+0.006
umm+0.006
err+0.006
 former+0.006
Neg logit contributions
 it-0.006
 him-0.004
 let-0.004
pped-0.004
 instead-0.004
Token day
Token ID1110
Feature activation-2.30e-02

Pos logit contributions
umm+0.083
 former+0.078
 biggest+0.073
omm+0.073
ini+0.069
Neg logit contributions
 it-0.119
 let-0.102
 instead-0.093
 him-0.092
 that-0.091
Token,
Token ID11
Feature activation-2.90e-02

Pos logit contributions
 it+0.030
pped+0.027
pping+0.026
 him+0.023
 let+0.023
Neg logit contributions
 biggest-0.053
umm-0.049
 sun-0.048
 best-0.048
err-0.048
Token-
Token ID12
Feature activation+4.07e-02

Pos logit contributions
 biggest+0.042
ini+0.038
 best+0.036
umm+0.036
err+0.035
Neg logit contributions
 it-0.053
pped-0.047
 him-0.045
pping-0.044
 let-0.044
Tokenall
Token ID439
Feature activation+1.20e-02

Pos logit contributions
 biggest+0.075
ini+0.071
umm+0.070
err+0.067
 jungle+0.065
Neg logit contributions
 it-0.078
pped-0.067
 him-0.063
 let-0.062
pping-0.062
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
 biggest+0.029
ini+0.027
 best+0.026
umm+0.026
 world+0.026
Neg logit contributions
 it-0.015
pped-0.015
pping-0.012
 him-0.012
 let-0.012
Token There
Token ID1318
Feature activation-4.95e-02

Pos logit contributions
 it+0.006
pped+0.005
pping+0.005
 him+0.005
 let+0.005
Neg logit contributions
 biggest-0.006
umm-0.006
ini-0.006
err-0.006
 jungle-0.006
Token are
Token ID389
Feature activation+3.10e-03

Pos logit contributions
pped+0.067
 it+0.063
pping+0.061
 let+0.051
ged+0.051
Neg logit contributions
 biggest-0.115
 best-0.105
ini-0.103
 world-0.101
umm-0.100
Token many
Token ID867
Feature activation+5.63e-02

Pos logit contributions
umm+0.005
ini+0.005
 biggest+0.005
err+0.005
omm+0.004
Neg logit contributions
 it-0.007
 him-0.005
pped-0.005
 let-0.005
 instead-0.005
Token things
Token ID1243
Feature activation-3.46e-02

Pos logit contributions
umm+0.074
 biggest+0.074
ini+0.072
err+0.070
omm+0.069
Neg logit contributions
 it-0.139
 him-0.115
pped-0.114
 let-0.113
 instead-0.109
Token that
Token ID326
Feature activation-1.72e-02

Pos logit contributions
pped+0.048
pping+0.043
 it+0.039
ged+0.037
assed+0.035
Neg logit contributions
 biggest-0.076
 best-0.071
 world-0.068
 sun-0.067
 sky-0.067
Token I
Token ID314
Feature activation-1.32e-02

Pos logit contributions
 it+0.021
pped+0.020
pping+0.018
 let+0.016
 him+0.015
Neg logit contributions
 biggest-0.041
umm-0.039
ini-0.039
 jungle-0.037
 sun-0.037
Token cannot
Token ID2314
Feature activation-3.52e-02

Pos logit contributions
 it+0.019
pped+0.018
pping+0.016
 him+0.015
ged+0.014
Neg logit contributions
 biggest-0.028
ini-0.027
 jungle-0.025
 former-0.025
 world-0.025
Token do
Token ID466
Feature activation-2.87e-02

Pos logit contributions
pped+0.056
 it+0.054
ged+0.054
pping+0.047
 risking+0.046
Neg logit contributions
 biggest-0.065
 world-0.062
 jungle-0.061
 former-0.060
 sun-0.060
Token,"
Token ID553
Feature activation-1.78e-02

Pos logit contributions
 it+0.033
pped+0.032
pping+0.029
ged+0.028
assed+0.028
Neg logit contributions
 biggest-0.057
 sun-0.053
 world-0.052
ini-0.052
umm-0.051
Token said
Token ID531
Feature activation-2.95e-02

Pos logit contributions
 it+0.030
pped+0.028
pping+0.026
 let+0.023
 him+0.023
Neg logit contributions
 biggest-0.035
ini-0.033
umm-0.032
err-0.031
 jungle-0.031
Token Lily
Token ID20037
Feature activation-3.40e-03

Pos logit contributions
 it+0.048
pped+0.046
pping+0.044
 let+0.039
 him+0.037
Neg logit contributions
 biggest-0.057
ini-0.051
umm-0.051
 sun-0.051
 elderly-0.051
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
 it+0.005
pped+0.004
pping+0.004
 risking+0.004
 whoever+0.003
Neg logit contributions
 biggest-0.008
 eldest-0.007
ini-0.007
 youngest-0.007
 best-0.007
Token feeling
Token ID4203
Feature activation-4.81e-03

Pos logit contributions
 it+0.042
pped+0.037
pping+0.033
 him+0.033
 whoever+0.031
Neg logit contributions
 biggest-0.046
 elderly-0.044
 eldest-0.043
 youngest-0.043
 sun-0.043
Token a
Token ID257
Feature activation+4.71e-02

Pos logit contributions
 it+0.011
pped+0.010
pping+0.009
 let+0.009
 him+0.009
Neg logit contributions
 biggest-0.007
ini-0.006
umm-0.006
err-0.006
 jungle-0.006
Token bit
Token ID1643
Feature activation+9.99e-03

Pos logit contributions
 biggest+0.056
umm+0.055
ini+0.054
err+0.049
 jungle+0.048
Neg logit contributions
 it-0.124
 let-0.109
pped-0.106
pping-0.100
 him-0.100
Token disappointed
Token ID11679
Feature activation-2.01e-02

Pos logit contributions
 biggest+0.018
ini+0.017
umm+0.017
err+0.016
 jungle+0.016
Neg logit contributions
 it-0.020
pped-0.017
 let-0.016
 him-0.016
pping-0.016
Token.
Token ID13
Feature activation+7.06e-04

Pos logit contributions
pped+0.027
 it+0.027
pping+0.024
 whoever+0.021
assed+0.021
Neg logit contributions
 biggest-0.044
umm-0.042
 sun-0.042
 best-0.042
err-0.039
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
 biggest+0.001
ini+0.001
umm+0.001
err+0.001
 jungle+0.001
Neg logit contributions
 it-0.001
pped-0.001
 him-0.001
 let-0.001
pping-0.001
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
 biggest+0.038
umm+0.037
ini+0.036
err+0.034
 tallest+0.033
Neg logit contributions
 it-0.030
pped-0.024
 him-0.023
 let-0.022
pping-0.021
Token a
Token ID257
Feature activation+7.03e-02

Pos logit contributions
 biggest+0.025
ini+0.024
umm+0.023
err+0.023
 jungle+0.023
Neg logit contributions
 it-0.018
pped-0.015
pping-0.014
 let-0.014
 him-0.014
Token time
Token ID640
Feature activation+6.54e-03

Pos logit contributions
 biggest+0.129
umm+0.126
ini+0.122
 eldest+0.112
 oldest+0.111
Neg logit contributions
 it-0.141
 let-0.125
pped-0.118
 him-0.115
pping-0.114
Token,
Token ID11
Feature activation+6.12e-03

Pos logit contributions
 biggest+0.017
ini+0.016
umm+0.016
err+0.016
 jungle+0.016
Neg logit contributions
 it-0.008
pped-0.006
 him-0.005
 let-0.005
pping-0.005
Token there
Token ID612
Feature activation+5.57e-03

Pos logit contributions
 biggest+0.017
ini+0.016
umm+0.015
err+0.015
 jungle+0.015
Neg logit contributions
 it-0.006
pped-0.005
pping-0.004
 him-0.004
 let-0.004
Token was
Token ID373
Feature activation+8.04e-03

Pos logit contributions
 biggest+0.010
ini+0.009
 best+0.009
 jungle+0.009
 world+0.009
Neg logit contributions
pped-0.010
pping-0.009
 it-0.009
 risking-0.008
 let-0.008
Token a
Token ID257
Feature activation+6.61e-02

Pos logit contributions
umm+0.022
ini+0.022
 biggest+0.021
err+0.020
 former+0.020
Neg logit contributions
 it-0.010
 him-0.007
pped-0.007
 let-0.006
 instead-0.005
Token boy
Token ID2933
Feature activation+1.80e-02

Pos logit contributions
umm+0.098
err+0.087
 former+0.087
 biggest+0.083
omm+0.079
Neg logit contributions
 it-0.169
 let-0.152
 who-0.127
 don-0.126
 that-0.126
Token named
Token ID3706
Feature activation+3.13e-03

Pos logit contributions
 biggest+0.032
ini+0.032
 jungle+0.030
umm+0.029
 sun+0.028
Neg logit contributions
 it-0.034
pped-0.030
pping-0.028
 him-0.027
 let-0.026
Token Tim
Token ID5045
Feature activation+4.55e-02

Pos logit contributions
 biggest+0.008
ini+0.008
umm+0.008
err+0.007
 jungle+0.007
Neg logit contributions
 it-0.004
pped-0.003
 him-0.003
 let-0.003
pping-0.003
Tokenmy
Token ID1820
Feature activation+2.33e-02

Pos logit contributions
 biggest+0.056
ini+0.054
 jungle+0.050
err+0.049
 world+0.046
Neg logit contributions
 it-0.110
pped-0.100
pping-0.094
 whoever-0.092
 risking-0.089
Token.
Token ID13
Feature activation+3.34e-03

Pos logit contributions
 biggest+0.055
ini+0.055
 jungle+0.053
err+0.051
 sun+0.051
Neg logit contributions
 it-0.028
pped-0.025
pping-0.022
 whoever-0.019
 him-0.019
Token crib
Token ID48083
Feature activation-4.65e-02

Pos logit contributions
 biggest+0.073
umm+0.065
 sun+0.061
ini+0.060
 world+0.060
Neg logit contributions
 it-0.118
pped-0.114
pping-0.107
 let-0.103
 him-0.102
Token for
Token ID329
Feature activation-3.71e-02

Pos logit contributions
pped+0.065
 it+0.059
pping+0.055
 him+0.050
ged+0.048
Neg logit contributions
 biggest-0.107
umm-0.098
 best-0.096
 sun-0.095
ini-0.094
Token her
Token ID607
Feature activation+4.07e-02

Pos logit contributions
 it+0.052
pped+0.046
pping+0.041
 let+0.040
 him+0.038
Neg logit contributions
 biggest-0.084
umm-0.078
ini-0.078
 sun-0.075
 jungle-0.075
Token.
Token ID13
Feature activation-7.00e-03

Pos logit contributions
 biggest+0.084
umm+0.077
ini+0.076
err+0.073
 best+0.073
Neg logit contributions
 it-0.069
pped-0.063
pping-0.057
 him-0.054
 let-0.053
Tokenâ
Token ID22940
Feature activation+6.86e-02

Pos logit contributions
 it+0.013
 let+0.010
pped+0.010
 him+0.010
 instead+0.009
Neg logit contributions
 biggest-0.014
ini-0.014
umm-0.013
err-0.013
 jungle-0.013
Token€
Token ID26391
Feature activation+4.40e-02

Pos logit contributions
ini+0.101
 biggest+0.101
umm+0.100
err+0.094
 sun+0.087
Neg logit contributions
 it-0.154
pped-0.132
 him-0.127
 let-0.120
 whoever-0.119
Token
Token ID220
Feature activation+1.87e-02

Pos logit contributions
 biggest+0.074
ini+0.070
 jungle+0.069
 countryside+0.068
 neighbourhood+0.068
Neg logit contributions
 it-0.079
pped-0.078
 him-0.070
pping-0.069
 let-0.066
Token
Token ID198
Feature activation+1.32e-02

Pos logit contributions
 biggest+0.035
ini+0.033
umm+0.033
err+0.032
 jungle+0.031
Neg logit contributions
 it-0.035
pped-0.030
pping-0.027
 let-0.027
 him-0.027
Token
Token ID198
Feature activation+1.20e-02

Pos logit contributions
 biggest+0.026
ini+0.024
umm+0.024
err+0.023
 jungle+0.022
Neg logit contributions
 it-0.024
pped-0.021
 him-0.019
 let-0.019
pping-0.019
TokenThe
Token ID464
Feature activation+1.14e-01

Pos logit contributions
 biggest+0.022
ini+0.021
umm+0.021
err+0.020
 jungle+0.019
Neg logit contributions
 it-0.023
pped-0.020
 him-0.018
pping-0.018
 let-0.018
Token day
Token ID1110
Feature activation-2.40e-02

Pos logit contributions
ini+0.170
umm+0.152
err+0.149
 biggest+0.145
 jungle+0.135
Neg logit contributions
 it-0.274
 let-0.228
 him-0.219
 instead-0.218
pped-0.216
TokenThe
Token ID464
Feature activation+1.19e-01

Pos logit contributions
 biggest+0.042
ini+0.040
err+0.038
umm+0.038
 jungle+0.038
Neg logit contributions
 it-0.046
pped-0.040
pping-0.038
 let-0.037
 him-0.037
Token End
Token ID5268
Feature activation+2.56e-02

Pos logit contributions
umm+0.191
omm+0.174
ini+0.167
err+0.160
 biggest+0.159
Neg logit contributions
 it-0.278
 let-0.236
 the-0.215
 instead-0.212
 that-0.212
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 biggest+0.061
ini+0.058
umm+0.058
err+0.057
 jungle+0.056
Neg logit contributions
 it-0.033
pped-0.029
pping-0.026
 him-0.025
 let-0.024
Token<|endoftext|>
Token ID50256
Feature activation+2.74e-02

Pos logit contributions
 biggest+0.021
umm+0.019
ini+0.019
err+0.019
 jungle+0.018
Neg logit contributions
 it-0.020
pped-0.018
pping-0.016
 him-0.016
 let-0.016
TokenJohn
Token ID7554
Feature activation-4.82e-03

Pos logit contributions
 biggest+0.037
ini+0.034
umm+0.033
err+0.032
 jungle+0.032
Neg logit contributions
 it-0.064
pped-0.058
pping-0.056
 let-0.055
 him-0.054
Token was
Token ID373
Feature activation+5.79e-03

Pos logit contributions
 it+0.008
pped+0.007
pping+0.007
 risking+0.006
 whoever+0.006
Neg logit contributions
 jungle-0.009
 biggest-0.009
ini-0.009
 neighbourhood-0.008
 countryside-0.008
Token excited
Token ID6568
Feature activation-2.25e-02

Pos logit contributions
umm+0.014
ini+0.014
 biggest+0.014
err+0.013
 former+0.012
Neg logit contributions
 it-0.008
pped-0.006
 him-0.006
pping-0.006
 let-0.006
Token.
Token ID13
Feature activation+3.50e-03

Pos logit contributions
pped+0.027
 it+0.026
pping+0.023
 let+0.019
 risking+0.019
Neg logit contributions
 biggest-0.055
umm-0.052
 sun-0.051
ini-0.051
 world-0.050
Token He
Token ID679
Feature activation-2.53e-03

Pos logit contributions
 biggest+0.006
ini+0.005
umm+0.005
err+0.005
 jungle+0.005
Neg logit contributions
 it-0.008
pped-0.006
 him-0.006
 let-0.006
pping-0.006
Token had
Token ID550
Feature activation-9.60e-03

Pos logit contributions
 it+0.004
pped+0.004
pping+0.004
 risking+0.003
 him+0.003
Neg logit contributions
 biggest-0.005
ini-0.005
 jungle-0.005
 sun-0.004
 world-0.004
Token been
Token ID587
Feature activation+3.73e-04

Pos logit contributions
 it+0.015
pped+0.013
pping+0.012
 him+0.012
 let+0.012
Neg logit contributions
 biggest-0.021
ini-0.019
umm-0.019
 jungle-0.018
 sun-0.018
Token never
Token ID1239
Feature activation-1.84e-02

Pos logit contributions
 it+0.020
pped+0.018
pping+0.016
 him+0.016
 let+0.014
Neg logit contributions
 biggest-0.030
ini-0.028
err-0.028
umm-0.028
 jungle-0.027
Token told
Token ID1297
Feature activation-4.15e-02

Pos logit contributions
 it+0.030
pped+0.028
pping+0.026
 him+0.023
 whoever+0.022
Neg logit contributions
 biggest-0.036
 jungle-0.033
 youngest-0.033
ini-0.033
 best-0.033
Token a
Token ID257
Feature activation+5.40e-02

Pos logit contributions
pped+0.040
 it+0.038
pping+0.033
ged+0.032
 let+0.029
Neg logit contributions
 biggest-0.106
 sun-0.099
 best-0.098
umm-0.095
 elderly-0.095
Token lie
Token ID6486
Feature activation-7.33e-03

Pos logit contributions
 biggest+0.059
ini+0.058
umm+0.058
err+0.052
omm+0.051
Neg logit contributions
 it-0.148
 let-0.127
 him-0.118
 whoever-0.116
 instead-0.115
Token again
Token ID757
Feature activation-2.93e-02

Pos logit contributions
pped+0.009
 it+0.009
pping+0.008
 whoever+0.007
assed+0.007
Neg logit contributions
 biggest-0.017
 best-0.015
ini-0.015
 world-0.015
umm-0.015
Token either
Token ID2035
Feature activation+5.78e-03

Pos logit contributions
pped+0.044
 it+0.040
pping+0.036
ged+0.033
 him+0.032
Neg logit contributions
 biggest-0.059
 world-0.058
 best-0.057
 jungle-0.056
 elderly-0.055
Token-
Token ID12
Feature activation+8.73e-03

Pos logit contributions
 biggest+0.013
ini+0.013
umm+0.013
err+0.013
 world+0.012
Neg logit contributions
 it-0.008
pped-0.007
pping-0.006
 him-0.006
 whoever-0.005
Token everything
Token ID2279
Feature activation-3.16e-02

Pos logit contributions
 biggest+0.016
ini+0.016
umm+0.015
err+0.014
 jungle+0.014
Neg logit contributions
 it-0.016
pped-0.013
pping-0.013
 let-0.013
 him-0.013
Token it
Token ID340
Feature activation-2.87e-02

Pos logit contributions
pped+0.052
pping+0.043
assed+0.040
ged+0.040
sels+0.036
Neg logit contributions
 biggest-0.066
 best-0.062
 world-0.061
 new-0.060
 old-0.059
Token said
Token ID531
Feature activation-1.49e-02

Pos logit contributions
 it+0.043
pped+0.042
pping+0.038
 whoever+0.034
ged+0.034
Neg logit contributions
 biggest-0.059
ini-0.053
 best-0.053
err-0.053
 world-0.053
Token was
Token ID373
Feature activation-1.43e-02

Pos logit contributions
pped+0.022
 argued+0.019
ck+0.019
assed+0.019
ged+0.018
Neg logit contributions
 biggest-0.031
 best-0.030
 sun-0.028
 world-0.027
 new-0.027
Token,
Token ID11
Feature activation-5.18e-03

Pos logit contributions
pped+0.022
 it+0.019
pping+0.019
 let+0.017
 whoever+0.017
Neg logit contributions
 biggest-0.026
 world-0.025
 jungle-0.025
 best-0.025
 sun-0.024
Token but
Token ID475
Feature activation-6.21e-03

Pos logit contributions
 it+0.008
pped+0.007
pping+0.006
 him+0.006
 let+0.006
Neg logit contributions
 biggest-0.011
ini-0.011
umm-0.010
 jungle-0.010
err-0.010
Token they
Token ID484
Feature activation-6.15e-03

Pos logit contributions
pped+0.006
 it+0.006
pping+0.006
 him+0.004
ged+0.004
Neg logit contributions
 biggest-0.016
 best-0.015
 sun-0.015
ini-0.015
 elderly-0.015
Token also
Token ID635
Feature activation-1.01e-02

Pos logit contributions
 it+0.010
pped+0.009
pping+0.009
 whoever+0.008
 him+0.008
Neg logit contributions
 biggest-0.012
 youngest-0.011
ini-0.011
 best-0.011
 eldest-0.011
Token loved
Token ID6151
Feature activation-1.81e-02

Pos logit contributions
 it+0.017
pped+0.016
pping+0.015
 him+0.014
 whoever+0.013
Neg logit contributions
 biggest-0.019
ini-0.018
 jungle-0.017
 elderly-0.017
err-0.017
Token sleeping
Token ID11029
Feature activation-1.64e-02

Pos logit contributions
pped+0.024
 it+0.023
pping+0.021
 let+0.018
ged+0.018
Neg logit contributions
 biggest-0.041
 sun-0.039
 jungle-0.039
 elderly-0.038
 best-0.038
Token.
Token ID13
Feature activation+7.68e-03

Pos logit contributions
pped+0.021
 it+0.021
pping+0.019
 let+0.017
 whoever+0.017
Neg logit contributions
 biggest-0.038
 best-0.035
 sun-0.035
 world-0.035
umm-0.034
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
ini+0.018
 biggest+0.018
umm+0.017
err+0.016
 tallest+0.016
Neg logit contributions
 it-0.012
 let-0.009
 him-0.009
 instead-0.008
pped-0.008
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
 biggest+0.058
umm+0.058
ini+0.055
err+0.052
 tallest+0.051
Neg logit contributions
 it-0.050
pped-0.039
 him-0.039
 let-0.037
pping-0.035
TokenThey
Token ID2990
Feature activation-1.89e-02

Pos logit contributions
 biggest+0.034
ini+0.032
umm+0.031
err+0.031
 jungle+0.030
Neg logit contributions
 it-0.027
pped-0.023
pping-0.021
 him-0.021
 let-0.020
Token kissed
Token ID27946
Feature activation-4.82e-02

Pos logit contributions
 it+0.030
pped+0.025
pping+0.024
 risking+0.022
 him+0.022
Neg logit contributions
 biggest-0.038
 youngest-0.036
 eldest-0.036
 sun-0.035
 best-0.035
Token play
Token ID711
Feature activation-3.14e-02

Pos logit contributions
 it+0.049
pped+0.048
pping+0.045
ged+0.040
 risking+0.039
Neg logit contributions
 biggest-0.079
ini-0.072
 sun-0.071
umm-0.070
 jungle-0.069
Token with
Token ID351
Feature activation-4.56e-02

Pos logit contributions
pped+0.042
pping+0.042
 risking+0.038
ge+0.038
assed+0.036
Neg logit contributions
 biggest-0.068
 sun-0.066
 best-0.061
 new-0.059
 world-0.059
Token it
Token ID340
Feature activation-1.23e-02

Pos logit contributions
pping+0.032
pped+0.028
assed+0.021
ocated+0.020
 risking+0.018
Neg logit contributions
 biggest-0.135
 sun-0.131
 new-0.126
 best-0.120
 sky-0.118
Token.
Token ID13
Feature activation-4.05e-03

Pos logit contributions
pped+0.021
pping+0.018
ged+0.017
 it+0.017
assed+0.017
Neg logit contributions
 biggest-0.026
 best-0.026
 new-0.023
 sun-0.023
 tallest-0.022
Token
Token ID220
Feature activation+1.71e-02

Pos logit contributions
 it+0.007
pped+0.006
pping+0.006
 him+0.006
 let+0.006
Neg logit contributions
 biggest-0.008
ini-0.007
umm-0.007
err-0.007
 jungle-0.007
Token
Token ID198
Feature activation-3.77e-03

Pos logit contributions
 biggest+0.033
umm+0.032
ini+0.032
err+0.030
 jungle+0.029
Neg logit contributions
 it-0.031
pped-0.026
 him-0.024
 let-0.024
pping-0.024
Token
Token ID198
Feature activation+1.92e-03

Pos logit contributions
 it+0.007
pped+0.005
 him+0.005
 let+0.005
pping+0.005
Neg logit contributions
 biggest-0.008
umm-0.007
ini-0.007
err-0.007
 jungle-0.007
Token"
Token ID1
Feature activation-7.38e-03

Pos logit contributions
 biggest+0.004
umm+0.004
ini+0.004
err+0.003
 jungle+0.003
Neg logit contributions
 it-0.003
pped-0.003
 him-0.003
 let-0.003
pping-0.003
TokenLet
Token ID5756
Feature activation-3.07e-02

Pos logit contributions
 it+0.014
pped+0.012
 let+0.011
 him+0.011
pping+0.011
Neg logit contributions
 biggest-0.014
ini-0.013
umm-0.013
err-0.013
 jungle-0.012
Token's
Token ID338
Feature activation-1.13e-02

Pos logit contributions
 it+0.042
pped+0.036
pping+0.033
 him+0.031
 let+0.031
Neg logit contributions
 biggest-0.072
ini-0.067
umm-0.065
err-0.064
 jungle-0.063
Token decor
Token ID11705
Feature activation+1.43e-02

Pos logit contributions
 it+0.018
pped+0.017
ged+0.016
pping+0.015
 risking+0.015
Neg logit contributions
 biggest-0.023
 sun-0.021
ini-0.020
 jungle-0.020
 youngest-0.020
Token.
Token ID13
Feature activation+9.59e-04

Pos logit contributions
pped+0.044
pping+0.043
 whoever+0.043
 it+0.043
 breathe+0.042
Neg logit contributions
 biggest-0.082
 best-0.076
ini-0.075
 sun-0.075
err-0.073
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 biggest+0.002
ini+0.002
umm+0.002
ubby+0.002
err+0.002
Neg logit contributions
 it-0.002
 let-0.001
 him-0.001
pped-0.001
 instead-0.001
Token
Token ID198
Feature activation+6.04e-03

Pos logit contributions
 biggest+0.022
umm+0.021
ini+0.021
err+0.020
 jungle+0.020
Neg logit contributions
 it-0.020
pped-0.017
 him-0.016
 let-0.016
pping-0.015
TokenHer
Token ID9360
Feature activation+7.25e-02

Pos logit contributions
 biggest+0.011
umm+0.010
ini+0.010
err+0.010
 jungle+0.009
Neg logit contributions
 it-0.012
pped-0.011
 let-0.010
pping-0.010
 him-0.010
Token mom
Token ID1995
Feature activation-2.13e-03

Pos logit contributions
umm+0.168
ini+0.150
err+0.147
 biggest+0.147
 countryside+0.147
Neg logit contributions
 it-0.127
 let-0.099
 that-0.092
 how-0.087
 the-0.086
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
 it+0.003
pping+0.003
pped+0.003
 him+0.002
 risking+0.002
Neg logit contributions
ini-0.004
 biggest-0.004
 jungle-0.004
err-0.004
 eldest-0.004
Token proud
Token ID6613
Feature activation+4.59e-03

Pos logit contributions
 biggest+0.026
ini+0.023
 elderly+0.022
 best+0.022
umm+0.022
Neg logit contributions
 it-0.035
pped-0.031
 him-0.030
pping-0.029
 let-0.029
Token of
Token ID286
Feature activation-2.50e-02

Pos logit contributions
 biggest+0.008
ini+0.007
umm+0.007
 sun+0.007
 elderly+0.007
Neg logit contributions
 it-0.009
pped-0.008
pping-0.008
 him-0.007
 let-0.007
Token her
Token ID607
Feature activation+5.08e-02

Pos logit contributions
 it+0.032
pped+0.030
pping+0.030
 let+0.029
sels+0.028
Neg logit contributions
 biggest-0.047
ubby-0.045
err-0.045
ini-0.045
 sun-0.045
Token for
Token ID329
Feature activation-3.06e-02

Pos logit contributions
 biggest+0.119
ini+0.113
umm+0.112
err+0.109
 jungle+0.106
Neg logit contributions
 it-0.072
pped-0.061
pping-0.053
 him-0.053
 let-0.052
Token being
Token ID852
Feature activation+2.81e-02

Pos logit contributions
ched+0.042
 it+0.042
pped+0.041
sels+0.040
pping+0.037
Neg logit contributions
 elderly-0.056
 best-0.053
 biggest-0.052
 older-0.049
 new-0.049
Token good
Token ID922
Feature activation+8.05e-03

Pos logit contributions
 biggest+0.014
ini+0.012
err+0.012
umm+0.012
 jungle+0.012
Neg logit contributions
 it-0.019
pped-0.017
 him-0.016
pping-0.016
 let-0.015
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
 biggest+0.018
 best+0.017
 sun+0.017
umm+0.016
 older+0.016
Neg logit contributions
 it-0.011
pped-0.011
pping-0.009
 whoever-0.009
 him-0.008
Token but
Token ID475
Feature activation-2.39e-02

Pos logit contributions
 it+0.028
pped+0.023
pping+0.020
 him+0.020
 let+0.020
Neg logit contributions
 biggest-0.053
ini-0.050
umm-0.050
err-0.048
 jungle-0.048
Token they
Token ID484
Feature activation-1.28e-02

Pos logit contributions
 it+0.023
pped+0.021
pping+0.018
 him+0.015
 let+0.015
Neg logit contributions
 biggest-0.063
umm-0.060
ini-0.059
 sun-0.058
 best-0.058
Token can
Token ID460
Feature activation-2.94e-02

Pos logit contributions
pped+0.018
 it+0.018
ged+0.015
pping+0.014
 him+0.014
Neg logit contributions
 biggest-0.028
 best-0.026
ini-0.025
 sun-0.025
 tallest-0.024
Token turn
Token ID1210
Feature activation-3.14e-02

Pos logit contributions
 it+0.043
pped+0.038
ged+0.033
pping+0.033
 him+0.031
Neg logit contributions
 biggest-0.064
ini-0.060
umm-0.058
 sun-0.057
 jungle-0.057
Token out
Token ID503
Feature activation-4.01e-02

Pos logit contributions
 it+0.037
pped+0.036
pping+0.034
 let+0.032
 whoever+0.030
Neg logit contributions
 biggest-0.072
 sun-0.066
ini-0.064
 best-0.064
 older-0.062
Token bad
Token ID2089
Feature activation+1.56e-02

Pos logit contributions
pped+0.048
 it+0.046
pping+0.043
ged+0.038
assed+0.037
Neg logit contributions
 biggest-0.094
 sun-0.085
 best-0.084
ini-0.082
 world-0.082
Token.
Token ID13
Feature activation-4.11e-03

Pos logit contributions
 biggest+0.024
 sun+0.021
umm+0.021
 world+0.020
 best+0.020
Neg logit contributions
 it-0.033
pped-0.032
pping-0.031
 let-0.028
 whoever-0.028
Token We
Token ID775
Feature activation+5.32e-04

Pos logit contributions
 it+0.007
pped+0.006
 let+0.006
 him+0.005
pping+0.005
Neg logit contributions
 biggest-0.009
ini-0.008
umm-0.008
err-0.008
 jungle-0.008
Token just
Token ID655
Feature activation-2.05e-02

Pos logit contributions
 biggest+0.001
ini+0.001
 best+0.001
 sun+0.001
 world+0.001
Neg logit contributions
 it-0.001
pped-0.001
pping-0.001
ged-0.001
 him-0.001
Token face
Token ID1986
Feature activation-3.89e-02

Pos logit contributions
 biggest+0.082
umm+0.078
ini+0.077
err+0.073
 jungle+0.070
Neg logit contributions
 it-0.117
pped-0.102
 him-0.097
 let-0.097
pping-0.096
Token.
Token ID13
Feature activation-9.28e-03

Pos logit contributions
 it+0.049
pped+0.045
pping+0.041
 whoever+0.041
 risking+0.039
Neg logit contributions
 biggest-0.088
 sun-0.082
 best-0.079
 sky-0.079
umm-0.078
Token It
Token ID632
Feature activation-1.73e-02

Pos logit contributions
 it+0.017
pped+0.014
 him+0.013
 let+0.013
pping+0.013
Neg logit contributions
 biggest-0.018
ini-0.017
umm-0.017
err-0.016
 jungle-0.016
Token was
Token ID373
Feature activation-1.37e-02

Pos logit contributions
 it+0.027
pped+0.024
pping+0.022
 him+0.021
 let+0.020
Neg logit contributions
 biggest-0.037
ini-0.035
umm-0.034
err-0.034
 jungle-0.033
Token cold
Token ID4692
Feature activation-3.24e-03

Pos logit contributions
 it+0.024
pped+0.020
pping+0.019
 let+0.018
 him+0.018
Neg logit contributions
 biggest-0.028
umm-0.026
ini-0.026
err-0.025
 jungle-0.024
Token against
Token ID1028
Feature activation-3.19e-02

Pos logit contributions
 it+0.004
pped+0.004
 let+0.003
pping+0.003
 him+0.003
Neg logit contributions
 biggest-0.008
umm-0.007
ini-0.007
 sun-0.007
 jungle-0.007
Token her
Token ID607
Feature activation+4.77e-02

Pos logit contributions
 it+0.033
pped+0.026
pping+0.023
 let+0.022
 him+0.020
Neg logit contributions
 biggest-0.085
umm-0.081
ini-0.081
err-0.079
 jungle-0.078
Token skin
Token ID4168
Feature activation-1.86e-02

Pos logit contributions
 biggest+0.089
umm+0.082
ini+0.082
err+0.077
 jungle+0.077
Neg logit contributions
 it-0.091
pped-0.080
pping-0.073
 let-0.072
 him-0.072
Token and
Token ID290
Feature activation-3.24e-02

Pos logit contributions
 it+0.027
pped+0.026
 whoever+0.025
 risking+0.022
pping+0.022
Neg logit contributions
 biggest-0.038
 sun-0.035
 sky-0.035
 best-0.035
umm-0.034
Token made
Token ID925
Feature activation-2.81e-02

Pos logit contributions
pped+0.041
pping+0.040
 it+0.038
 whoever+0.032
 him+0.032
Neg logit contributions
 biggest-0.072
 sun-0.067
 jungle-0.064
 sky-0.064
 best-0.064
Token her
Token ID607
Feature activation+3.19e-02

Pos logit contributions
 it+0.032
pped+0.024
 him+0.021
 let+0.021
pping+0.021
Neg logit contributions
 biggest-0.074
ini-0.071
umm-0.071
err-0.068
 jungle-0.067
Token to
Token ID284
Feature activation-3.27e-02

Pos logit contributions
pped+0.055
 it+0.051
pping+0.047
 let+0.041
 whoever+0.041
Neg logit contributions
 biggest-0.112
 older-0.105
 best-0.105
 sun-0.105
 new-0.105
Token make
Token ID787
Feature activation-3.15e-02

Pos logit contributions
 it+0.055
pped+0.049
ged+0.045
pping+0.045
 risking+0.044
Neg logit contributions
 biggest-0.061
ini-0.057
 elderly-0.056
 eldest-0.055
 youngest-0.054
Token him
Token ID683
Feature activation-1.06e-02

Pos logit contributions
 it+0.016
pped+0.013
pping+0.010
 let+0.008
 instead+0.005
Neg logit contributions
 biggest-0.098
ini-0.092
umm-0.092
 sun-0.090
 best-0.089
Token feel
Token ID1254
Feature activation+1.08e-02

Pos logit contributions
 it+0.019
pped+0.019
pping+0.017
 whoever+0.016
 risking+0.016
Neg logit contributions
 biggest-0.020
 best-0.017
ini-0.017
 older-0.016
 elderly-0.016
Token better
Token ID1365
Feature activation-8.22e-03

Pos logit contributions
 biggest+0.018
ini+0.015
 sun+0.015
umm+0.015
 best+0.015
Neg logit contributions
 it-0.022
pped-0.020
pping-0.019
 let-0.019
 him-0.018
Token by
Token ID416
Feature activation-3.62e-02

Pos logit contributions
pped+0.013
 it+0.012
pping+0.010
 whoever+0.010
 him+0.009
Neg logit contributions
 best-0.016
 biggest-0.016
umm-0.015
 elderly-0.015
 sun-0.015
Token ordering
Token ID16216
Feature activation-3.24e-02

Pos logit contributions
 it+0.054
pped+0.047
pping+0.042
 let+0.041
 him+0.040
Neg logit contributions
 biggest-0.078
ini-0.074
umm-0.073
 sun-0.071
err-0.070
Token some
Token ID617
Feature activation+4.25e-02

Pos logit contributions
 it+0.040
pped+0.038
pping+0.035
 let+0.032
 whoever+0.028
Neg logit contributions
 biggest-0.076
 sun-0.070
umm-0.069
 best-0.068
ini-0.068
Token new
Token ID649
Feature activation+7.83e-02

Pos logit contributions
 biggest+0.067
ini+0.062
umm+0.062
err+0.059
 jungle+0.057
Neg logit contributions
 it-0.093
pped-0.082
 him-0.077
 let-0.077
pping-0.077
Token red
Token ID2266
Feature activation+6.62e-02

Pos logit contributions
umm+0.146
 biggest+0.145
ini+0.143
err+0.135
omm+0.131
Neg logit contributions
 it-0.151
pped-0.120
 him-0.120
 let-0.118
pping-0.113
Token socks
Token ID24359
Feature activation-3.83e-02

Pos logit contributions
 biggest+0.105
ini+0.094
umm+0.093
 sun+0.090
err+0.089
Neg logit contributions
 it-0.141
pped-0.129
pping-0.120
 let-0.118
 whoever-0.114
Token so
Token ID523
Feature activation-3.49e-03

Pos logit contributions
 it+0.058
pped+0.053
pping+0.050
 him+0.049
 let+0.048
Neg logit contributions
 biggest-0.046
umm-0.042
ini-0.041
err-0.039
 sun-0.038
Token delicious
Token ID12625
Feature activation-1.73e-02

Pos logit contributions
 it+0.009
pped+0.008
 him+0.008
 let+0.008
pping+0.008
Neg logit contributions
 biggest-0.004
ini-0.004
umm-0.004
err-0.003
 jungle-0.003
Token.
Token ID13
Feature activation-9.18e-03

Pos logit contributions
pped+0.028
pping+0.027
 risking+0.024
assed+0.023
 breathe+0.023
Neg logit contributions
umm-0.036
 sun-0.035
 biggest-0.034
 best-0.032
ummy-0.031
Token She
Token ID1375
Feature activation-1.03e-02

Pos logit contributions
 it+0.015
 let+0.011
 him+0.011
pped+0.011
 how+0.010
Neg logit contributions
ini-0.020
 biggest-0.020
umm-0.019
err-0.019
ubby-0.019
Token really
Token ID1107
Feature activation+1.52e-04

Pos logit contributions
 it+0.015
pped+0.013
pping+0.012
 him+0.012
 risking+0.011
Neg logit contributions
 biggest-0.022
ini-0.021
umm-0.020
err-0.020
 jungle-0.020
Token wanted
Token ID2227
Feature activation-4.56e-02

Pos logit contributions
 biggest+0.000
ini+0.000
umm+0.000
 sun+0.000
err+0.000
Neg logit contributions
 it-0.000
pped-0.000
pping-0.000
 him-0.000
 risking-0.000
Token to
Token ID284
Feature activation-3.10e-02

Pos logit contributions
pped+0.045
pping+0.042
ged+0.038
 it+0.036
assed+0.033
Neg logit contributions
 biggest-0.119
 sun-0.109
 best-0.108
 new-0.106
 elderly-0.103
Token eat
Token ID4483
Feature activation-5.38e-02

Pos logit contributions
 it+0.041
pped+0.038
pping+0.035
ged+0.035
 risking+0.031
Neg logit contributions
 biggest-0.069
umm-0.065
 sun-0.065
 jungle-0.063
ini-0.063
Token it
Token ID340
Feature activation-2.42e-02

Pos logit contributions
pped+0.039
pping+0.033
ged+0.029
 fought+0.028
 risking+0.025
Neg logit contributions
 biggest-0.152
 sun-0.143
 best-0.142
 new-0.141
 most-0.133
Token,
Token ID11
Feature activation-2.60e-02

Pos logit contributions
pped+0.035
 it+0.032
 risking+0.031
ged+0.030
pping+0.030
Neg logit contributions
 biggest-0.055
 best-0.051
umm-0.048
 sun-0.047
 tallest-0.047
Token but
Token ID475
Feature activation-1.85e-02

Pos logit contributions
pped+0.041
 it+0.036
pping+0.036
ged+0.034
 him+0.031
Neg logit contributions
 biggest-0.055
 sun-0.050
 best-0.048
 elderly-0.047
umm-0.047
Token playing
Token ID2712
Feature activation-1.85e-02

Pos logit contributions
pped+0.021
pping+0.018
ged+0.017
 it+0.015
ched+0.015
Neg logit contributions
 biggest-0.048
 sun-0.047
 jungle-0.047
 elderly-0.045
 new-0.044
Token outside
Token ID2354
Feature activation-3.22e-02

Pos logit contributions
pped+0.024
 it+0.022
pping+0.022
ge+0.020
 let+0.019
Neg logit contributions
 biggest-0.041
 jungle-0.041
 sun-0.039
 world-0.038
 best-0.037
Token every
Token ID790
Feature activation+4.36e-02

Pos logit contributions
pped+0.041
 it+0.041
pping+0.035
 let+0.033
 whoever+0.032
Neg logit contributions
 biggest-0.074
 jungle-0.069
 best-0.068
umm-0.068
 world-0.068
Token day
Token ID1110
Feature activation-2.50e-02

Pos logit contributions
 biggest+0.049
ini+0.047
umm+0.046
omm+0.043
 former+0.043
Neg logit contributions
 it-0.120
 let-0.101
pped-0.101
 him-0.097
 instead-0.096
Token and
Token ID290
Feature activation-2.74e-02

Pos logit contributions
pped+0.035
 it+0.032
pping+0.029
 let+0.027
 risking+0.026
Neg logit contributions
 biggest-0.054
 sun-0.052
umm-0.051
 jungle-0.051
 best-0.050
Token recognizing
Token ID22650
Feature activation-2.68e-02

Pos logit contributions
pped+0.036
 it+0.032
pping+0.031
ged+0.025
assed+0.025
Neg logit contributions
 biggest-0.062
 jungle-0.059
 sun-0.058
 best-0.055
ini-0.055
Token different
Token ID1180
Feature activation+3.76e-02

Pos logit contributions
 it+0.024
pped+0.024
pping+0.020
 let+0.018
ged+0.016
Neg logit contributions
 biggest-0.073
umm-0.067
ini-0.067
 jungle-0.066
 sun-0.066
Token things
Token ID1243
Feature activation-2.74e-02

Pos logit contributions
 biggest+0.051
umm+0.046
ini+0.046
err+0.043
 jungle+0.042
Neg logit contributions
 it-0.090
pped-0.082
pping-0.077
 let-0.077
 him-0.076
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
pped+0.050
pping+0.041
assed+0.040
 argued+0.039
ged+0.039
Neg logit contributions
 biggest-0.053
 best-0.052
 new-0.048
 most-0.045
 sun-0.044
Token It
Token ID632
Feature activation-8.19e-03

Pos logit contributions
 biggest+0.021
ini+0.021
umm+0.020
err+0.019
 eldest+0.019
Neg logit contributions
 it-0.021
 him-0.017
pped-0.016
 let-0.016
pping-0.015
Token was
Token ID373
Feature activation-6.17e-03

Pos logit contributions
 it+0.012
pped+0.009
 let+0.009
 him+0.008
pping+0.008
Neg logit contributions
 biggest-0.019
umm-0.019
ini-0.018
err-0.018
 jungle-0.017