TOP ACTIVATIONS
MAX = 7.028

wanted to bounce on the tr amp oline , but Mia
Lily . She loved eating w affles for breakfast every day
" Let 's give each other a haircut !" Tom
noise . " Crunch , crunch , crunch .
'm sorry " and hugged each other . They
Tim loved to eat warm w affles for breakfast . Every
. She loved to eat w affles for breakfast every day
thought they looked like ac rob ats . She enjoyed the
like to jump on the tr amp oline . It is
Lily loved to ride roller co asters and always remembered how
to sing . He sang each morning to the birds in
. She loved to eat w affles for breakfast every day
lesson and never pretended with tr ays again . <|endoftext|> Billy
hurry . They grabbed their tr unks and set off .
sad . She shrugged her shoulders and her eyes filled with
hurry . They grabbed their tr unks and set off .
. She loved to eat w affles for breakfast . One
. She loved to eat w affles for breakfast . Her
. They liked to spray each other and make funny noises
decided to jump on the tr amp oline , but Lily

MIN ACTIVATIONS
MIN = -7.780

bad happened , she was able to make new friends and
before and the toy was able to yield !
was broken and he was able to fix it . He
poked it and it was able to turn the situation around
sun disappeared and they were able to make it safely back
and no - one was able to help her .
, and eventually he was able to buy the gear he
ower until the bear was able to cross it .
it was day a minute ago ." They heard
, that she was not able to play in the park
practiced everyday until he was able to print all of the
and the little girl was able to get out .
and soon , it was able to fly again .
and then they were finally able to leave . From that
the box on her night stand . She hopes to mail
and she will not be able to come back .
and soon Tim my was able to support the cup all
, and the tourist was able to finish his tour happily
then people wouldn 't be able to get the help they
longer weak and she was able to go out and explore

INTERVAL 3.326 - 7.028
CONTAINS 0.930%

and sniff all the smells in the park . One day
when he stopped he felt g iddy , but also something
smiled and said , " Yes , of course we can
there was a duck . He was silly and always looked
graceful !" exclaimed Daisy . S ally smiled

INTERVAL -0.376 - 3.326
CONTAINS 61.846%

sun flower and she was very excited . She yelled to
the garden ! <|endoftext|> Once upon a time , there was
decided it was their new favorite game . They would play
and Mia liked to paint . They had many colors and
what it was , so he became very confused .

INTERVAL -4.078 - -0.376
CONTAINS 37.023%

rabbit through the woods . The rabbit was scared
was so excited that she started to jump and dance around
That sounds scary . I don 't want my rock to
for the leaf . But the leaf is not water .
away . They did not come back to that garden ever

INTERVAL -7.780 - -4.078
CONTAINS 0.202%

happier . By the end of the day , the sour
a line from one side of the room to the other
new comb and it never got hot . The man had
But again he was not able to touch the clouds with
uzz y enjoyed the rest of their walk . <|endoftext|> The
Token wanted
Token ID2227
Feature activation+1.81e+00

Pos logit contributions
 galaxies+0.063
iles+0.060
 obstacles+0.057
 spear+0.057
 warrior+0.056
Neg logit contributions
 Lily-0.036
 chip-0.033
 Sara-0.032
 Fl-0.031
 Wh-0.030
Token to
Token ID284
Feature activation+2.06e+00

Pos logit contributions
 galaxies+0.092
iles+0.088
 spear+0.086
 obstacles+0.085
 warrior+0.083
Neg logit contributions
 Lily-0.030
 chip-0.028
 paramed-0.025
 Sara-0.025
 Fl-0.024
Token bounce
Token ID20110
Feature activation+1.41e+00

Pos logit contributions
 galaxies+0.065
 spear+0.063
iles+0.061
 warrior+0.059
 obstacles+0.058
Neg logit contributions
 Lily-0.066
 chip-0.064
 paramed-0.064
 Sara-0.061
 Fl-0.061
Token on
Token ID319
Feature activation+3.35e+00

Pos logit contributions
 galaxies+0.076
iles+0.074
 warrior+0.070
eur+0.069
 villages+0.069
Neg logit contributions
 Lily-0.029
 Sara-0.022
 Fl-0.020
 v-0.018
 from-0.018
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
 galaxies+0.177
iles+0.169
 spear+0.164
 warrior+0.162
 obstacles+0.161
Neg logit contributions
 Lily-0.054
 chip-0.046
 Sara-0.045
 Fl-0.041
 Wh-0.040
Token tr
Token ID491
Feature activation+7.03e+00

Pos logit contributions
 galaxies+0.115
iles+0.108
 spear+0.102
 obstacles+0.100
 warrior+0.100
Neg logit contributions
 Lily-0.092
 chip-0.087
 Sara-0.084
 Fl-0.080
 Wh-0.079
Tokenamp
Token ID696
Feature activation+4.73e+00

Pos logit contributions
 galaxies+0.277
 obstacles+0.252
 warrior+0.249
 spear+0.247
 questions+0.237
Neg logit contributions
 Lily-0.197
 chip-0.192
stand-0.185
 Sara-0.179
 Wh-0.179
Tokenoline
Token ID14453
Feature activation+1.92e+00

Pos logit contributions
 galaxies+0.109
 spear+0.093
iles+0.091
 obstacles+0.088
 warrior+0.087
Neg logit contributions
 Lily-0.222
 Sara-0.208
 chip-0.206
 Fl-0.199
 Wh-0.198
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
 galaxies+0.084
iles+0.079
 spear+0.077
 obstacles+0.075
 warrior+0.075
Neg logit contributions
 Lily-0.045
 chip-0.042
 paramed-0.041
 Sara-0.041
 Wh-0.039
Token but
Token ID475
Feature activation+1.91e+00

Pos logit contributions
 galaxies+0.089
iles+0.086
 spear+0.080
 obstacles+0.080
 warrior+0.080
Neg logit contributions
 Lily-0.037
 Sara-0.032
 chip-0.029
 Fl-0.029
 Wh-0.027
Token Mia
Token ID32189
Feature activation-8.47e-01

Pos logit contributions
 galaxies+0.062
 spear+0.059
iles+0.056
 obstacles+0.054
 armor+0.054
Neg logit contributions
 chip-0.063
 paramed-0.063
burn-0.061
stand-0.053
 Wh-0.053
Token Lily
Token ID20037
Feature activation-1.08e+00

Pos logit contributions
 paramed+0.032
burn+0.030
 chip+0.030
stand+0.022
paste+0.022
Neg logit contributions
 galaxies-0.053
 questions-0.052
 wonders-0.051
ines-0.050
 cities-0.049
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
 Lily+0.025
 chip+0.025
 Sara+0.023
 paramed+0.023
 Fl+0.022
Neg logit contributions
 galaxies-0.047
iles-0.044
 spear-0.044
 obstacles-0.042
 warrior-0.042
Token She
Token ID1375
Feature activation+2.37e+00

Pos logit contributions
 chip+0.006
 paramed+0.006
burn+0.005
 Lily+0.005
 Sara+0.005
Neg logit contributions
 galaxies-0.008
iles-0.008
 spear-0.007
 obstacles-0.007
 warrior-0.007
Token loved
Token ID6151
Feature activation+2.11e+00

Pos logit contributions
 galaxies+0.069
iles+0.062
 spear+0.060
 warrior+0.057
 obstacles+0.057
Neg logit contributions
 Lily-0.091
 chip-0.089
 Sara-0.086
 paramed-0.085
burn-0.083
Token eating
Token ID6600
Feature activation+1.90e+00

Pos logit contributions
 galaxies+0.101
iles+0.094
 spear+0.093
 obstacles+0.091
 armor+0.088
Neg logit contributions
 Lily-0.040
 chip-0.036
 Sara-0.036
 okay-0.035
 paramed-0.034
Token w
Token ID266
Feature activation+6.75e+00

Pos logit contributions
 galaxies+0.076
iles+0.073
 spear+0.073
 armor+0.068
 obstacles+0.067
Neg logit contributions
 Lily-0.046
 paramed-0.044
 chip-0.044
burn-0.043
 okay-0.042
Tokenaffles
Token ID48501
Feature activation+4.84e+00

Pos logit contributions
 galaxies+0.199
 warrior+0.171
 spear+0.169
 obstacles+0.160
 cities+0.150
Neg logit contributions
 Lily-0.271
 chip-0.265
 Sara-0.258
 v-0.254
 coffee-0.253
Token for
Token ID329
Feature activation+2.39e+00

Pos logit contributions
 galaxies+0.197
iles+0.195
 spear+0.188
 obstacles+0.176
 warrior+0.174
Neg logit contributions
 Lily-0.117
 paramed-0.110
 chip-0.106
 Sara-0.106
burn-0.102
Token breakfast
Token ID12607
Feature activation+1.82e+00

Pos logit contributions
 galaxies+0.099
iles+0.096
 spear+0.091
 obstacles+0.089
 armor+0.087
Neg logit contributions
 Lily-0.058
 chip-0.056
 paramed-0.055
burn-0.052
 Sara-0.052
Token every
Token ID790
Feature activation+2.22e+00

Pos logit contributions
iles+0.058
 galaxies+0.057
 spear+0.054
 facts+0.052
 reefs+0.052
Neg logit contributions
 paramed-0.057
 Lily-0.053
 Sara-0.050
 Wh-0.050
burn-0.050
Token day
Token ID1110
Feature activation+2.88e-01

Pos logit contributions
 galaxies+0.078
 spear+0.073
iles+0.073
 warrior+0.068
 obstacles+0.067
Neg logit contributions
 Lily-0.064
 chip-0.064
 Wh-0.063
 paramed-0.063
 okay-0.061
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 Lily+0.006
 chip+0.005
 Sara+0.005
 paramed+0.005
 Fl+0.005
Neg logit contributions
 galaxies-0.010
iles-0.010
 spear-0.009
 obstacles-0.009
 warrior-0.009
Token"
Token ID1
Feature activation-1.02e+00

Pos logit contributions
 chip+0.004
 Lily+0.004
burn+0.004
 Fl+0.004
 Sara+0.004
Neg logit contributions
 galaxies-0.005
iles-0.005
 obstacles-0.004
 spear-0.004
Nice-0.004
TokenLet
Token ID5756
Feature activation+9.10e-01

Pos logit contributions
 Lily+0.035
 chip+0.033
 Fl+0.033
 Sara+0.032
 Wh+0.032
Neg logit contributions
 galaxies-0.031
iles-0.029
 obstacles-0.028
 spear-0.027
 warrior-0.027
Token's
Token ID338
Feature activation-8.68e-01

Pos logit contributions
iles+0.030
 spear+0.029
 galaxies+0.029
 obstacles+0.029
 obstacle+0.028
Neg logit contributions
 chip-0.024
 paramed-0.024
less-0.023
 Sn-0.023
 Wh-0.022
Token give
Token ID1577
Feature activation+3.68e-01

Pos logit contributions
 Lily+0.023
 chip+0.022
 paramed+0.022
 Sara+0.022
 Fl+0.021
Neg logit contributions
 galaxies-0.034
 spear-0.032
iles-0.032
 warrior-0.031
 obstacles-0.030
Token each
Token ID1123
Feature activation+6.62e+00

Pos logit contributions
 galaxies+0.015
iles+0.015
 spear+0.014
 obstacles+0.014
 warrior+0.013
Neg logit contributions
 paramed-0.008
 Lily-0.008
 chip-0.008
burn-0.008
 Sara-0.007
Token other
Token ID584
Feature activation+3.00e+00

Pos logit contributions
 galaxies+0.270
iles+0.260
 spear+0.254
 obstacles+0.247
 warrior+0.242
Neg logit contributions
 Lily-0.169
 chip-0.162
 paramed-0.157
 Sara-0.154
 Wh-0.150
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
 galaxies+0.133
iles+0.126
 spear+0.124
 obstacles+0.121
 warrior+0.117
Neg logit contributions
 Lily-0.067
 chip-0.064
 paramed-0.062
 Sara-0.060
 Wh-0.060
Token haircut
Token ID45548
Feature activation+1.82e+00

Pos logit contributions
 galaxies+0.059
iles+0.056
 spear+0.054
 obstacles+0.053
 warrior+0.052
Neg logit contributions
 Lily-0.036
 chip-0.033
 Sara-0.032
 paramed-0.031
 Fl-0.030
Token!"
Token ID2474
Feature activation+8.23e-01

Pos logit contributions
 galaxies+0.069
iles+0.066
 spear+0.065
 warrior+0.061
 obstacles+0.061
Neg logit contributions
 Lily-0.049
 paramed-0.047
 Wh-0.046
 Mitt-0.046
 Fl-0.045
Token Tom
Token ID4186
Feature activation+3.48e-01

Pos logit contributions
 galaxies+0.029
iles+0.029
 obstacles+0.029
 questions+0.028
 wonders+0.027
Neg logit contributions
burn-0.023
less-0.019
 chip-0.017
tr-0.017
 paramed-0.017
Token noise
Token ID7838
Feature activation-1.60e-01

Pos logit contributions
 galaxies+0.020
iles+0.019
 spear+0.018
 obstacles+0.018
 warrior+0.018
Neg logit contributions
 Lily-0.015
 Sara-0.014
 chip-0.013
 Fl-0.013
 paramed-0.013
Token.
Token ID13
Feature activation-5.48e-01

Pos logit contributions
 Lily+0.004
 chip+0.004
 Sara+0.004
 paramed+0.004
 Wh+0.004
Neg logit contributions
 galaxies-0.006
iles-0.006
 spear-0.006
 obstacles-0.006
 warrior-0.006
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
 chip+0.016
burn+0.016
 Lily+0.016
 paramed+0.015
 v+0.015
Neg logit contributions
 galaxies-0.019
iles-0.018
 spear-0.017
 obstacles-0.017
 warrior-0.016
Token
Token ID198
Feature activation-3.41e-01

Pos logit contributions
 Lily+0.007
 Sara+0.006
 chip+0.005
 Fl+0.005
 Wh+0.005
Neg logit contributions
 galaxies-0.019
iles-0.018
 warrior-0.018
 spear-0.018
 obstacles-0.017
Token"
Token ID1
Feature activation-1.94e+00

Pos logit contributions
 Lily+0.008
 chip+0.007
 Sara+0.007
 paramed+0.007
 Fl+0.007
Neg logit contributions
 galaxies-0.015
iles-0.014
 spear-0.014
 obstacles-0.014
 warrior-0.014
TokenCrunch
Token ID49384
Feature activation+6.47e+00

Pos logit contributions
 Lily+0.066
 chip+0.062
 Sara+0.062
 okay+0.061
 Fl+0.060
Neg logit contributions
 galaxies-0.058
iles-0.057
 obstacles-0.054
 spear-0.053
 warrior-0.051
Token,
Token ID11
Feature activation+2.75e+00

Pos logit contributions
 galaxies+0.292
iles+0.275
 spear+0.268
 obstacles+0.263
 warrior+0.261
Neg logit contributions
 Lily-0.150
 chip-0.138
 Sara-0.133
 paramed-0.126
 Fl-0.125
Token crunch
Token ID23527
Feature activation+2.19e+00

Pos logit contributions
 galaxies+0.107
iles+0.106
 spear+0.102
 obstacles+0.101
 facts+0.099
Neg logit contributions
burn-0.067
 paramed-0.059
ird-0.058
 Sn-0.057
nes-0.056
Token,
Token ID11
Feature activation+2.11e+00

Pos logit contributions
 galaxies+0.088
iles+0.086
 spear+0.081
 obstacles+0.079
 warrior+0.078
Neg logit contributions
 Lily-0.059
 chip-0.055
 paramed-0.053
 Sara-0.053
 Wh-0.052
Token crunch
Token ID23527
Feature activation+1.54e+00

Pos logit contributions
iles+0.084
 galaxies+0.082
 spear+0.078
 facts+0.076
 obstacles+0.076
Neg logit contributions
burn-0.048
 paramed-0.047
 Lily-0.044
 Sn-0.043
nes-0.043
Token.
Token ID13
Feature activation+9.09e-02

Pos logit contributions
iles+0.059
 galaxies+0.058
 spear+0.055
 obstacles+0.052
 warrior+0.051
Neg logit contributions
 Lily-0.043
 paramed-0.041
 chip-0.040
 Wh-0.039
 Sara-0.039
Token'm
Token ID1101
Feature activation+6.31e-01

Pos logit contributions
 galaxies+0.066
iles+0.062
 spear+0.061
 warrior+0.058
 obstacles+0.058
Neg logit contributions
 Lily-0.059
 chip-0.058
 Sara-0.055
 paramed-0.054
burn-0.053
Token sorry
Token ID7926
Feature activation-8.33e-01

Pos logit contributions
iles+0.017
 galaxies+0.017
 spear+0.016
 warrior+0.016
 goals+0.015
Neg logit contributions
burn-0.021
 chip-0.021
stand-0.020
 paramed-0.019
keeping-0.019
Token"
Token ID1
Feature activation-6.93e-01

Pos logit contributions
 Lily+0.020
 chip+0.020
 paramed+0.020
 Sara+0.018
burn+0.018
Neg logit contributions
 galaxies-0.034
iles-0.033
 spear-0.032
 obstacles-0.031
 warrior-0.031
Token and
Token ID290
Feature activation+2.82e-02

Pos logit contributions
 paramed+0.018
 chip+0.017
 Fl+0.017
 v+0.016
burn+0.016
Neg logit contributions
 galaxies-0.026
iles-0.025
 obstacles-0.024
 spear-0.023
 warrior-0.023
Token hugged
Token ID41130
Feature activation+8.27e-01

Pos logit contributions
 spear+0.001
 galaxies+0.001
 warrior+0.001
 nods+0.001
 remembers+0.001
Neg logit contributions
burn-0.001
ird-0.001
stand-0.001
 paramed-0.001
 chip-0.001
Token each
Token ID1123
Feature activation+6.44e+00

Pos logit contributions
 galaxies+0.039
iles+0.038
 obstacles+0.037
 spear+0.037
 goals+0.036
Neg logit contributions
 paramed-0.014
 chip-0.013
burn-0.012
ird-0.012
stand-0.012
Token other
Token ID584
Feature activation+2.74e+00

Pos logit contributions
 galaxies+0.239
iles+0.231
 spear+0.225
 obstacles+0.219
 warrior+0.211
Neg logit contributions
 Lily-0.175
 chip-0.175
 paramed-0.171
burn-0.163
 Sara-0.162
Token.
Token ID13
Feature activation-2.83e-02

Pos logit contributions
 goals+0.087
 galaxies+0.084
 spear+0.082
 obstacles+0.081
iles+0.078
Neg logit contributions
 paramed-0.087
 chip-0.081
burn-0.080
 Wh-0.078
 Lily-0.076
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
 chip+0.001
 Lily+0.001
 paramed+0.001
burn+0.001
 v+0.001
Neg logit contributions
 galaxies-0.001
iles-0.001
 spear-0.001
 obstacles-0.001
 warrior-0.001
Token
Token ID198
Feature activation-8.47e-01

Pos logit contributions
 Lily+0.011
 chip+0.010
 Sara+0.009
 Fl+0.009
 Wh+0.008
Neg logit contributions
 galaxies-0.027
iles-0.025
 warrior-0.025
 spear-0.025
 obstacles-0.024
TokenThey
Token ID2990
Feature activation+4.68e-01

Pos logit contributions
 Lily+0.024
 chip+0.022
 Sara+0.021
 paramed+0.020
 Fl+0.020
Neg logit contributions
 galaxies-0.035
iles-0.033
 spear-0.032
 obstacles-0.031
 warrior-0.031
Token Tim
Token ID5045
Feature activation+7.93e-01

Pos logit contributions
 Lily+0.003
 chip+0.002
 Sara+0.002
 Wh+0.002
 Fl+0.002
Neg logit contributions
 galaxies-0.015
iles-0.014
 obstacles-0.014
 warrior-0.014
 obstacle-0.014
Token loved
Token ID6151
Feature activation+2.09e+00

Pos logit contributions
 galaxies+0.032
iles+0.031
 obstacles+0.030
 warrior+0.029
 obstacle+0.029
Neg logit contributions
 Lily-0.024
 Sara-0.021
 chip-0.020
 Fl-0.020
 Wh-0.020
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
 galaxies+0.102
 spear+0.096
iles+0.096
 obstacles+0.093
 armor+0.091
Neg logit contributions
 Lily-0.038
 chip-0.034
 Sara-0.034
 paramed-0.032
 okay-0.031
Token eat
Token ID4483
Feature activation+1.15e+00

Pos logit contributions
 galaxies+0.052
iles+0.049
 obstacles+0.047
 spear+0.046
 warrior+0.046
Neg logit contributions
 Lily-0.034
 chip-0.031
 Sara-0.030
 paramed-0.028
 Fl-0.028
Token warm
Token ID5814
Feature activation+1.99e+00

Pos logit contributions
 galaxies+0.049
iles+0.046
 spear+0.045
 obstacles+0.044
 warrior+0.044
Neg logit contributions
 Lily-0.029
 chip-0.027
 Sara-0.026
 paramed-0.025
 Fl-0.025
Token w
Token ID266
Feature activation+6.43e+00

Pos logit contributions
 galaxies+0.082
iles+0.078
 spear+0.077
 obstacles+0.073
 armor+0.073
Neg logit contributions
 Lily-0.050
 chip-0.047
 paramed-0.046
 Sara-0.045
 okay-0.044
Tokenaffles
Token ID48501
Feature activation+4.34e+00

Pos logit contributions
 galaxies+0.168
 warrior+0.138
 spear+0.136
 obstacles+0.130
 cities+0.123
Neg logit contributions
 Lily-0.282
 chip-0.273
 Sara-0.269
 v-0.263
 coffee-0.262
Token for
Token ID329
Feature activation+1.73e+00

Pos logit contributions
 galaxies+0.167
iles+0.166
 spear+0.163
 obstacles+0.149
 warrior+0.148
Neg logit contributions
 Lily-0.112
 paramed-0.108
 Sara-0.102
 chip-0.102
 Wh-0.099
Token breakfast
Token ID12607
Feature activation+1.20e+00

Pos logit contributions
 galaxies+0.072
iles+0.070
 spear+0.067
 obstacles+0.066
 armor+0.065
Neg logit contributions
 Lily-0.041
 chip-0.040
 paramed-0.040
burn-0.038
 Sara-0.038
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
iles+0.037
 galaxies+0.036
 spear+0.036
 reefs+0.033
 facts+0.033
Neg logit contributions
 paramed-0.039
 Lily-0.036
 Sara-0.034
 Wh-0.034
burn-0.034
Token Every
Token ID3887
Feature activation+2.51e+00

Pos logit contributions
 galaxies+0.005
iles+0.004
 obstacles+0.004
 warrior+0.004
 obstacle+0.004
Neg logit contributions
 Lily-0.005
 Sara-0.005
 chip-0.004
 Fl-0.004
 Wh-0.004
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
 Lily+0.023
 chip+0.021
 Sara+0.021
 paramed+0.020
 Fl+0.019
Neg logit contributions
 galaxies-0.045
iles-0.043
 spear-0.042
 obstacles-0.041
 warrior-0.040
Token She
Token ID1375
Feature activation+2.41e+00

Pos logit contributions
 paramed+0.003
 chip+0.003
burn+0.003
 v+0.003
 Sara+0.003
Neg logit contributions
 galaxies-0.004
iles-0.004
 spear-0.004
 obstacles-0.004
 warrior-0.004
Token loved
Token ID6151
Feature activation+2.26e+00

Pos logit contributions
 galaxies+0.072
iles+0.065
 spear+0.063
 warrior+0.060
 obstacles+0.060
Neg logit contributions
 Lily-0.091
 chip-0.088
 Sara-0.086
 paramed-0.085
 Fl-0.083
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 galaxies+0.107
iles+0.100
 spear+0.100
 obstacles+0.096
 armor+0.094
Neg logit contributions
 Lily-0.043
 chip-0.039
 Sara-0.038
 okay-0.038
 paramed-0.037
Token eat
Token ID4483
Feature activation+1.25e+00

Pos logit contributions
 galaxies+0.055
iles+0.052
 obstacles+0.050
 spear+0.049
 warrior+0.049
Neg logit contributions
 Lily-0.034
 chip-0.031
 Sara-0.030
 Fl-0.029
 paramed-0.029
Token w
Token ID266
Feature activation+6.41e+00

Pos logit contributions
 galaxies+0.052
iles+0.050
 spear+0.049
 obstacles+0.047
 warrior+0.046
Neg logit contributions
 Lily-0.030
 chip-0.028
 paramed-0.028
 Sara-0.027
burn-0.027
Tokenaffles
Token ID48501
Feature activation+4.50e+00

Pos logit contributions
 galaxies+0.188
 warrior+0.158
 spear+0.153
 obstacles+0.146
 cities+0.142
Neg logit contributions
 Lily-0.260
 chip-0.251
 Sara-0.248
 v-0.247
 coffee-0.244
Token for
Token ID329
Feature activation+1.90e+00

Pos logit contributions
iles+0.177
 galaxies+0.175
 spear+0.169
 obstacles+0.157
 feats+0.155
Neg logit contributions
 Lily-0.112
 paramed-0.108
 chip-0.102
 Sara-0.102
burn-0.100
Token breakfast
Token ID12607
Feature activation+1.48e+00

Pos logit contributions
 galaxies+0.079
iles+0.078
 spear+0.073
 obstacles+0.073
 feats+0.071
Neg logit contributions
 Lily-0.044
 chip-0.043
 paramed-0.043
burn-0.041
 Sara-0.040
Token every
Token ID790
Feature activation+2.15e+00

Pos logit contributions
iles+0.047
 galaxies+0.044
 spear+0.042
 facts+0.041
 reefs+0.041
Neg logit contributions
 paramed-0.049
 Lily-0.044
 Wh-0.042
burn-0.042
 Sara-0.042
Token day
Token ID1110
Feature activation+1.12e-01

Pos logit contributions
 galaxies+0.079
iles+0.075
 spear+0.074
 warrior+0.070
 obstacles+0.069
Neg logit contributions
 Lily-0.060
 chip-0.060
 paramed-0.058
 Wh-0.058
 Sara-0.056
Token thought
Token ID1807
Feature activation+3.33e+00

Pos logit contributions
 Lily+0.061
 chip+0.056
 Sara+0.054
 Fl+0.052
 Wh+0.051
Neg logit contributions
 galaxies-0.087
iles-0.084
 spear-0.080
 obstacles-0.078
 warrior-0.077
Token they
Token ID484
Feature activation+1.16e+00

Pos logit contributions
 galaxies+0.163
iles+0.156
 spear+0.152
 obstacles+0.148
 warrior+0.148
Neg logit contributions
 Lily-0.066
 chip-0.057
 Sara-0.057
 Fl-0.052
 paramed-0.051
Token looked
Token ID3114
Feature activation+2.77e+00

Pos logit contributions
iles+0.053
 galaxies+0.051
 warrior+0.050
 spear+0.050
 obstacle+0.049
Neg logit contributions
 Lily-0.027
 clean-0.023
 let-0.023
 okay-0.022
 baby-0.022
Token like
Token ID588
Feature activation+1.62e+00

Pos logit contributions
iles+0.136
 galaxies+0.132
 spear+0.125
 warrior+0.122
smith+0.121
Neg logit contributions
 Lily-0.061
 Sara-0.048
 okay-0.047
 v-0.047
 Fl-0.044
Token ac
Token ID936
Feature activation+2.39e+00

Pos logit contributions
 galaxies+0.076
iles+0.069
 spear+0.068
 obstacles+0.068
 warrior+0.066
Neg logit contributions
 chip-0.034
 paramed-0.032
 Lily-0.032
burn-0.030
 Sara-0.030
Tokenrob
Token ID22609
Feature activation+6.41e+00

Pos logit contributions
 galaxies+0.061
iles+0.053
 spear+0.051
 obstacles+0.051
 warrior+0.049
Neg logit contributions
 Lily-0.104
 chip-0.101
 Sara-0.096
 Wh-0.096
 Fl-0.095
Tokenats
Token ID1381
Feature activation+1.90e+00

Pos logit contributions
 galaxies+0.206
 spear+0.180
 warrior+0.177
 obstacles+0.171
 boulder+0.166
Neg logit contributions
 Lily-0.236
 Sara-0.220
 chip-0.215
burn-0.212
ird-0.211
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
 galaxies+0.073
iles+0.068
 spear+0.067
 obstacles+0.064
 warrior+0.064
Neg logit contributions
 Lily-0.051
 paramed-0.050
 chip-0.049
 Sara-0.048
 Wh-0.045
Token She
Token ID1375
Feature activation+1.70e+00

Pos logit contributions
 galaxies+0.011
iles+0.010
 spear+0.010
 obstacles+0.010
 warrior+0.010
Neg logit contributions
 Lily-0.008
 chip-0.007
 Sara-0.007
 paramed-0.007
 Fl-0.007
Token enjoyed
Token ID8359
Feature activation+1.23e+00

Pos logit contributions
iles+0.078
 galaxies+0.077
 spear+0.072
 obstacle+0.072
 obstacles+0.071
Neg logit contributions
 Lily-0.041
 Sara-0.034
 let-0.033
 Fl-0.032
 v-0.032
Token the
Token ID262
Feature activation+1.65e+00

Pos logit contributions
 galaxies+0.053
iles+0.050
 spear+0.048
 obstacles+0.047
 warrior+0.047
Neg logit contributions
 Lily-0.031
 chip-0.029
 Sara-0.028
 paramed-0.027
 Fl-0.026
Token like
Token ID588
Feature activation+1.18e+00

Pos logit contributions
 galaxies+0.082
iles+0.080
 obstacles+0.078
 obstacle+0.078
 feats+0.077
Neg logit contributions
 Lily-0.049
 Sara-0.042
 clean-0.039
 let-0.038
 L-0.037
Token to
Token ID284
Feature activation+9.20e-01

Pos logit contributions
 galaxies+0.062
iles+0.059
 spear+0.058
 obstacles+0.057
 warrior+0.056
Neg logit contributions
 Lily-0.018
 chip-0.016
 Sara-0.015
 paramed-0.015
 Fl-0.014
Token jump
Token ID4391
Feature activation+8.68e-01

Pos logit contributions
 galaxies+0.043
iles+0.040
 obstacles+0.038
 spear+0.038
 warrior+0.037
Neg logit contributions
 Lily-0.025
 Sara-0.021
 chip-0.020
 clean-0.019
 v-0.018
Token on
Token ID319
Feature activation+2.52e+00

Pos logit contributions
 galaxies+0.042
iles+0.040
 warrior+0.038
 spear+0.038
 obstacles+0.038
Neg logit contributions
 Lily-0.020
 Sara-0.016
 chip-0.016
 Fl-0.016
 v-0.015
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
 galaxies+0.136
iles+0.133
 warrior+0.129
 spear+0.127
 obstacles+0.123
Neg logit contributions
 Lily-0.041
 Sara-0.033
 chip-0.032
 Fl-0.031
 v-0.029
Token tr
Token ID491
Feature activation+6.40e+00

Pos logit contributions
 galaxies+0.075
iles+0.070
 spear+0.067
 warrior+0.067
 obstacles+0.066
Neg logit contributions
 Lily-0.064
 chip-0.060
 Sara-0.058
 Fl-0.056
 v-0.055
Tokenamp
Token ID696
Feature activation+4.26e+00

Pos logit contributions
 galaxies+0.246
 warrior+0.239
 spear+0.235
 obstacles+0.232
 armor+0.221
Neg logit contributions
 Lily-0.178
 chip-0.168
stand-0.163
 Sara-0.162
 Mitt-0.159
Tokenoline
Token ID14453
Feature activation+1.14e+00

Pos logit contributions
 galaxies+0.101
 spear+0.087
 warrior+0.083
iles+0.083
 obstacles+0.082
Neg logit contributions
 Lily-0.198
 Sara-0.185
 chip-0.182
 Fl-0.176
 Wh-0.174
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
 galaxies+0.052
iles+0.049
 spear+0.048
 obstacles+0.047
 warrior+0.047
Neg logit contributions
 Lily-0.026
 chip-0.024
 Sara-0.023
 Fl-0.021
 paramed-0.021
Token It
Token ID632
Feature activation+1.30e+00

Pos logit contributions
 galaxies+0.010
iles+0.009
 spear+0.009
 warrior+0.009
 obstacles+0.009
Neg logit contributions
 Lily-0.005
 Sara-0.004
 Fl-0.004
 chip-0.004
 Anna-0.003
Token is
Token ID318
Feature activation+2.91e-01

Pos logit contributions
 galaxies+0.060
 armour+0.056
 warrior+0.056
iles+0.056
 armor+0.056
Neg logit contributions
 Lily-0.035
 Fl-0.031
 Sara-0.029
 v-0.028
 chip-0.027
Token Lily
Token ID20037
Feature activation-2.59e+00

Pos logit contributions
 galaxies+0.020
iles+0.018
 obstacles+0.018
 warrior+0.018
 villages+0.017
Neg logit contributions
 paramed-0.008
burn-0.007
 chip-0.007
stand-0.007
 v-0.006
Token loved
Token ID6151
Feature activation+1.13e+00

Pos logit contributions
 Lily+0.074
 chip+0.068
 Sara+0.067
 paramed+0.064
 Fl+0.064
Neg logit contributions
 galaxies-0.103
iles-0.097
 spear-0.094
 obstacles-0.092
 warrior-0.091
Token to
Token ID284
Feature activation+6.83e-01

Pos logit contributions
 galaxies+0.050
iles+0.047
 spear+0.046
 obstacles+0.045
 warrior+0.044
Neg logit contributions
 Lily-0.027
 chip-0.024
 Sara-0.024
 paramed-0.023
 Fl-0.022
Token ride
Token ID6594
Feature activation+8.74e-01

Pos logit contributions
 galaxies+0.028
iles+0.027
 obstacles+0.025
 spear+0.024
 warrior+0.024
Neg logit contributions
 Lily-0.019
 chip-0.018
 Sara-0.017
 Wh-0.017
 Fl-0.017
Token roller
Token ID24471
Feature activation+2.37e+00

Pos logit contributions
 galaxies+0.045
iles+0.043
 spear+0.042
 obstacles+0.041
 warrior+0.041
Neg logit contributions
 Lily-0.015
 chip-0.013
 Sara-0.012
 Fl-0.011
 Wh-0.011
Tokenco
Token ID1073
Feature activation+6.38e+00

Pos logit contributions
 galaxies+0.068
iles+0.062
 spear+0.059
 obstacles+0.057
 warrior+0.056
Neg logit contributions
 Lily-0.094
 chip-0.089
 Sara-0.088
 paramed-0.085
 Fl-0.085
Tokenasters
Token ID7060
Feature activation+9.64e-01

Pos logit contributions
 galaxies+0.205
 spear+0.182
 warrior+0.177
iles+0.177
 obstacles+0.171
Neg logit contributions
 Lily-0.233
 chip-0.229
 Sara-0.220
 Wh-0.216
 Fl-0.211
Token and
Token ID290
Feature activation+7.03e-01

Pos logit contributions
 galaxies+0.037
iles+0.034
 spear+0.034
 obstacles+0.033
 warrior+0.032
Neg logit contributions
 Lily-0.027
 paramed-0.026
 chip-0.025
 Sara-0.025
 okay-0.023
Token always
Token ID1464
Feature activation-5.32e-01

Pos logit contributions
 galaxies+0.027
iles+0.025
 spear+0.024
 obstacles+0.023
 warrior+0.023
Neg logit contributions
 Lily-0.021
 chip-0.020
 Sara-0.019
 paramed-0.019
 Fl-0.018
Token remembered
Token ID12086
Feature activation-1.60e+00

Pos logit contributions
 Lily+0.015
 Sara+0.013
 chip+0.013
 Wh+0.012
 Fl+0.012
Neg logit contributions
 galaxies-0.022
iles-0.021
 spear-0.020
 obstacles-0.020
 warrior-0.019
Token how
Token ID703
Feature activation+5.70e-01

Pos logit contributions
 Lily+0.034
 chip+0.033
 paramed+0.031
 Sara+0.030
 Wh+0.029
Neg logit contributions
 galaxies-0.073
iles-0.067
 obstacles-0.066
 spear-0.066
 islands-0.063
Token to
Token ID284
Feature activation+1.31e+00

Pos logit contributions
 galaxies+0.106
iles+0.101
 spear+0.099
 obstacles+0.097
 warrior+0.096
Neg logit contributions
 Lily-0.035
 chip-0.031
 Sara-0.030
 paramed-0.028
 Fl-0.027
Token sing
Token ID1702
Feature activation+1.12e+00

Pos logit contributions
 galaxies+0.057
iles+0.055
 obstacles+0.054
 obstacle+0.051
 spear+0.051
Neg logit contributions
 Lily-0.033
 chip-0.029
 Sara-0.029
 v-0.028
 Fl-0.027
Token.
Token ID13
Feature activation+9.43e-01

Pos logit contributions
 galaxies+0.050
iles+0.049
 obstacles+0.046
 spear+0.046
 warrior+0.045
Neg logit contributions
 Lily-0.027
 chip-0.023
 Sara-0.023
 Fl-0.022
 v-0.022
Token He
Token ID679
Feature activation+4.33e+00

Pos logit contributions
 galaxies+0.031
iles+0.029
 obstacles+0.028
 spear+0.028
 obstacle+0.027
Neg logit contributions
 Lily-0.037
 Sara-0.034
 Fl-0.032
 chip-0.032
 Wh-0.031
Token sang
Token ID25889
Feature activation+7.38e-01

Pos logit contributions
 galaxies+0.171
iles+0.163
 spear+0.160
 obstacles+0.157
 warrior+0.153
Neg logit contributions
 Lily-0.128
 Sara-0.114
 chip-0.113
 Fl-0.111
 Wh-0.108
Token each
Token ID1123
Feature activation+6.38e+00

Pos logit contributions
 galaxies+0.034
iles+0.034
 spear+0.032
 obstacles+0.032
 warrior+0.031
Neg logit contributions
 Lily-0.016
 Sara-0.014
 chip-0.013
 Fl-0.013
 v-0.012
Token morning
Token ID3329
Feature activation+6.01e-01

Pos logit contributions
 galaxies+0.260
iles+0.243
 spear+0.237
 warrior+0.231
 obstacles+0.229
Neg logit contributions
 Lily-0.171
 chip-0.165
 paramed-0.160
 Sara-0.158
 Wh-0.153
Token to
Token ID284
Feature activation+7.49e-01

Pos logit contributions
 galaxies+0.023
 spear+0.020
iles+0.020
 feats+0.020
 planets+0.020
Neg logit contributions
 paramed-0.017
 Lily-0.017
 Sara-0.017
 Wh-0.017
 chip-0.016
Token the
Token ID262
Feature activation+1.30e+00

Pos logit contributions
 galaxies+0.034
iles+0.033
 spear+0.032
 obstacles+0.031
 warrior+0.031
Neg logit contributions
 Lily-0.017
 chip-0.015
 Sara-0.015
 Fl-0.014
 paramed-0.014
Token birds
Token ID10087
Feature activation+9.63e-01

Pos logit contributions
 galaxies+0.046
iles+0.042
 spear+0.042
 warrior+0.041
 obstacles+0.039
Neg logit contributions
 Lily-0.043
 chip-0.040
 Sara-0.040
 paramed-0.040
 Wh-0.039
Token in
Token ID287
Feature activation+2.16e+00

Pos logit contributions
 galaxies+0.043
iles+0.040
 spear+0.039
 obstacles+0.039
 warrior+0.038
Neg logit contributions
 Lily-0.023
 chip-0.021
 Sara-0.020
 Fl-0.020
 Wh-0.019
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 Lily+0.030
 chip+0.029
 Sara+0.028
 paramed+0.028
 Fl+0.026
Neg logit contributions
 galaxies-0.055
iles-0.051
 spear-0.050
 obstacles-0.048
 warrior-0.048
Token She
Token ID1375
Feature activation+2.27e+00

Pos logit contributions
 chip+0.009
 paramed+0.009
burn+0.008
 Sara+0.008
 Lily+0.007
Neg logit contributions
 galaxies-0.012
iles-0.010
 spear-0.010
 obstacles-0.010
 warrior-0.010
Token loved
Token ID6151
Feature activation+2.04e+00

Pos logit contributions
 galaxies+0.066
iles+0.060
 spear+0.058
 warrior+0.055
 obstacles+0.054
Neg logit contributions
 Lily-0.086
 chip-0.084
 Sara-0.082
 paramed-0.081
burn-0.079
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
 galaxies+0.095
iles+0.089
 spear+0.088
 obstacles+0.085
 armor+0.083
Neg logit contributions
 Lily-0.039
 chip-0.036
 okay-0.036
 Sara-0.035
 paramed-0.034
Token eat
Token ID4483
Feature activation+1.20e+00

Pos logit contributions
 galaxies+0.052
iles+0.049
 obstacles+0.047
 spear+0.047
 warrior+0.046
Neg logit contributions
 Lily-0.034
 chip-0.031
 Sara-0.030
 Fl-0.028
 paramed-0.028
Token w
Token ID266
Feature activation+6.37e+00

Pos logit contributions
 galaxies+0.051
iles+0.049
 spear+0.048
 obstacles+0.046
 warrior+0.045
Neg logit contributions
 Lily-0.028
 chip-0.026
 paramed-0.026
 Sara-0.025
burn-0.025
Tokenaffles
Token ID48501
Feature activation+4.47e+00

Pos logit contributions
 galaxies+0.192
 warrior+0.164
 spear+0.157
 obstacles+0.151
 cities+0.147
Neg logit contributions
 Lily-0.252
 chip-0.244
 v-0.240
 Sara-0.240
 coffee-0.239
Token for
Token ID329
Feature activation+1.86e+00

Pos logit contributions
iles+0.179
 galaxies+0.179
 spear+0.172
 obstacles+0.161
 warrior+0.159
Neg logit contributions
 Lily-0.110
 paramed-0.103
 chip-0.099
 Sara-0.099
burn-0.095
Token breakfast
Token ID12607
Feature activation+1.54e+00

Pos logit contributions
 galaxies+0.078
iles+0.077
 spear+0.072
 obstacles+0.071
 armor+0.069
Neg logit contributions
 Lily-0.043
 chip-0.042
 paramed-0.041
burn-0.039
 Sara-0.039
Token every
Token ID790
Feature activation+2.12e+00

Pos logit contributions
iles+0.049
 galaxies+0.047
 spear+0.045
 facts+0.043
 reefs+0.043
Neg logit contributions
 paramed-0.049
 Lily-0.045
 Sara-0.043
burn-0.043
 Wh-0.043
Token day
Token ID1110
Feature activation+4.56e-02

Pos logit contributions
 galaxies+0.075
iles+0.071
 spear+0.071
 warrior+0.066
 obstacles+0.065
Neg logit contributions
 chip-0.061
 Lily-0.060
 paramed-0.059
 Wh-0.059
 okay-0.057
Token lesson
Token ID11483
Feature activation+6.77e-01

Pos logit contributions
 Lily+0.005
 chip+0.004
 baby+0.004
 okay+0.004
 Sara+0.004
Neg logit contributions
 galaxies-0.010
iles-0.009
 valleys-0.009
 islands-0.009
Nice-0.009
Token and
Token ID290
Feature activation-9.58e-01

Pos logit contributions
 galaxies+0.029
iles+0.027
 spear+0.027
 warrior+0.026
 obstacles+0.026
Neg logit contributions
 Lily-0.018
 chip-0.016
 Sara-0.016
 Wh-0.015
 Fl-0.015
Token never
Token ID1239
Feature activation-2.13e+00

Pos logit contributions
 chip+0.026
 Lily+0.026
 paramed+0.025
 Sara+0.024
burn+0.024
Neg logit contributions
 galaxies-0.038
 spear-0.034
iles-0.034
 obstacles-0.033
 warrior-0.033
Token pretended
Token ID36766
Feature activation-3.99e+00

Pos logit contributions
 chip+0.060
 Lily+0.058
 paramed+0.058
 Sara+0.056
 Sn+0.055
Neg logit contributions
 galaxies-0.081
iles-0.076
 spear-0.075
 obstacles-0.074
 warrior-0.073
Token with
Token ID351
Feature activation+6.70e-02

Pos logit contributions
 Lily+0.068
 chip+0.060
 Sara+0.058
 paramed+0.054
 Fl+0.053
Neg logit contributions
 galaxies-0.204
iles-0.194
 spear-0.190
 obstacles-0.186
 warrior-0.185
Token tr
Token ID491
Feature activation+6.34e+00

Pos logit contributions
 galaxies+0.003
iles+0.003
 spear+0.003
 obstacles+0.003
 warrior+0.003
Neg logit contributions
 Lily-0.002
 chip-0.002
 Sara-0.002
 Fl-0.001
 Wh-0.001
Tokenays
Token ID592
Feature activation-7.17e-01

Pos logit contributions
 galaxies+0.248
 spear+0.239
 warrior+0.233
 armor+0.216
 facts+0.215
Neg logit contributions
stand-0.154
ula-0.153
 Lily-0.153
ird-0.151
 Mitt-0.150
Token again
Token ID757
Feature activation+1.85e+00

Pos logit contributions
 Lily+0.016
 chip+0.014
 Sara+0.014
 Fl+0.013
 paramed+0.013
Neg logit contributions
 galaxies-0.033
iles-0.032
 spear-0.031
 obstacles-0.030
 warrior-0.030
Token.
Token ID13
Feature activation-7.40e-01

Pos logit contributions
 galaxies+0.083
iles+0.078
 spear+0.076
 obstacles+0.075
 warrior+0.074
Neg logit contributions
 Lily-0.043
 chip-0.040
 Sara-0.039
 paramed-0.038
 Fl-0.036
Token<|endoftext|>
Token ID50256
Feature activation+2.47e+00

Pos logit contributions
 Lily+0.018
 chip+0.016
 Sara+0.016
 paramed+0.015
 Fl+0.015
Neg logit contributions
 galaxies-0.033
iles-0.031
 spear-0.030
 obstacles-0.029
 warrior-0.029
TokenBilly
Token ID49640
Feature activation+1.04e+00

Pos logit contributions
 galaxies+0.108
iles+0.102
 spear+0.099
 obstacles+0.097
 warrior+0.097
Neg logit contributions
 Lily-0.059
 chip-0.055
 Sara-0.053
 paramed-0.051
 Fl-0.050
Token hurry
Token ID23290
Feature activation-1.04e+00

Pos logit contributions
 galaxies+0.024
 spear+0.023
iles+0.022
 obstacles+0.022
 warrior+0.022
Neg logit contributions
 chip-0.022
 Lily-0.021
 paramed-0.021
less-0.020
burn-0.020
Token.
Token ID13
Feature activation+4.39e-01

Pos logit contributions
 Lily+0.023
 chip+0.022
 paramed+0.020
 Sara+0.020
 Fl+0.019
Neg logit contributions
 galaxies-0.047
iles-0.044
 spear-0.043
 obstacles-0.043
 warrior-0.042
Token They
Token ID1119
Feature activation+1.98e+00

Pos logit contributions
 galaxies+0.022
iles+0.020
 warrior+0.019
 obstacles+0.019
 spear+0.019
Neg logit contributions
 Lily-0.010
 Sara-0.009
 chip-0.008
 Wh-0.008
 Fl-0.008
Token grabbed
Token ID13176
Feature activation+2.46e+00

Pos logit contributions
 galaxies+0.081
iles+0.076
 spear+0.073
 obstacles+0.072
 warrior+0.071
Neg logit contributions
 Lily-0.056
 chip-0.051
 Sara-0.050
 paramed-0.048
 Fl-0.048
Token their
Token ID511
Feature activation-6.89e-01

Pos logit contributions
 galaxies+0.114
iles+0.108
 spear+0.106
 obstacles+0.104
 warrior+0.102
Neg logit contributions
 Lily-0.053
 chip-0.049
 Sara-0.046
 paramed-0.045
 Fl-0.044
Token tr
Token ID491
Feature activation+6.33e+00

Pos logit contributions
 Lily+0.020
 chip+0.019
 Sara+0.018
 Fl+0.018
 paramed+0.017
Neg logit contributions
 galaxies-0.027
iles-0.025
 spear-0.024
 warrior-0.024
 obstacles-0.024
Tokenunks
Token ID14125
Feature activation+2.85e+00

Pos logit contributions
 galaxies+0.183
 warrior+0.164
 obstacles+0.157
 spear+0.156
 facts+0.151
Neg logit contributions
 chip-0.232
 Lily-0.229
burn-0.227
 Wh-0.224
 Sn-0.221
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 galaxies+0.099
 spear+0.093
iles+0.090
 obstacles+0.088
 warrior+0.085
Neg logit contributions
 Lily-0.091
 chip-0.086
 paramed-0.085
 Sara-0.085
 Wh-0.081
Token set
Token ID900
Feature activation-1.10e+00

Pos logit contributions
 galaxies+0.038
 spear+0.037
 obstacles+0.034
 armor+0.033
iles+0.033
Neg logit contributions
 chip-0.038
 paramed-0.037
 Lily-0.036
 Sara-0.035
burn-0.034
Token off
Token ID572
Feature activation-3.59e-01

Pos logit contributions
 Lily+0.021
 chip+0.021
 paramed+0.021
 Sara+0.018
 Wh+0.018
Neg logit contributions
 galaxies-0.050
 spear-0.047
 obstacles-0.047
iles-0.046
 islands-0.044
Token.
Token ID13
Feature activation+5.21e-01

Pos logit contributions
 Lily+0.008
 chip+0.008
 paramed+0.008
 Sara+0.007
burn+0.007
Neg logit contributions
 galaxies-0.016
iles-0.015
 spear-0.015
 obstacles-0.014
 islands-0.014
Token sad
Token ID6507
Feature activation-8.09e-01

Pos logit contributions
iles+0.025
 galaxies+0.025
 spear+0.024
 goals+0.024
 questions+0.024
Neg logit contributions
burn-0.028
stand-0.027
 Wh-0.027
 Sn-0.026
 Lily-0.026
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
 chip+0.022
 Lily+0.022
 paramed+0.022
 Wh+0.021
 Sara+0.020
Neg logit contributions
 galaxies-0.030
iles-0.028
 spear-0.027
 obstacles-0.027
 warrior-0.026
Token She
Token ID1375
Feature activation+1.67e+00

Pos logit contributions
 Lily+0.001
 chip+0.001
 Sara+0.001
 paramed+0.001
 Fl+0.001
Neg logit contributions
 galaxies-0.002
iles-0.002
 spear-0.002
 obstacles-0.002
 warrior-0.002
Token shrugged
Token ID28271
Feature activation+3.97e+00

Pos logit contributions
 galaxies+0.068
iles+0.064
 spear+0.062
 obstacles+0.060
 warrior+0.060
Neg logit contributions
 Lily-0.047
 chip-0.043
 Sara-0.042
 Fl-0.040
 paramed-0.040
Token her
Token ID607
Feature activation+1.73e+00

Pos logit contributions
 galaxies+0.182
iles+0.172
 spear+0.168
 obstacles+0.164
 warrior+0.163
Neg logit contributions
 Lily-0.090
 chip-0.082
 Sara-0.079
 paramed-0.075
 Fl-0.075
Token shoulders
Token ID12450
Feature activation+6.32e+00

Pos logit contributions
 galaxies+0.066
iles+0.063
 obstacles+0.057
 warrior+0.056
 islands+0.056
Neg logit contributions
 Lily-0.055
 chip-0.052
 Sara-0.049
 Fl-0.049
 baby-0.048
Token and
Token ID290
Feature activation+1.25e+00

Pos logit contributions
 galaxies+0.211
 spear+0.186
iles+0.183
 obstacles+0.181
 warrior+0.174
Neg logit contributions
 Lily-0.210
 Sara-0.202
 paramed-0.197
 chip-0.195
 Wh-0.192
Token her
Token ID607
Feature activation-4.50e-01

Pos logit contributions
 galaxies+0.043
 spear+0.039
 obstacles+0.038
 wonders+0.037
 warrior+0.037
Neg logit contributions
 chip-0.038
 paramed-0.036
burn-0.035
less-0.034
stand-0.034
Token eyes
Token ID2951
Feature activation+7.19e-01

Pos logit contributions
 Lily+0.006
 chip+0.006
 Sara+0.005
 paramed+0.005
 Fl+0.005
Neg logit contributions
 galaxies-0.025
iles-0.023
 spear-0.023
 obstacles-0.022
 warrior-0.022
Token filled
Token ID5901
Feature activation-1.79e+00

Pos logit contributions
 galaxies+0.024
 spear+0.021
 obstacles+0.021
 tirelessly+0.020
 armor+0.020
Neg logit contributions
 Lily-0.023
 Sara-0.022
 chip-0.022
 paramed-0.021
 Wh-0.020
Token with
Token ID351
Feature activation+1.12e+00

Pos logit contributions
 paramed+0.048
ird+0.047
burn+0.044
ume+0.043
 okay+0.041
Neg logit contributions
 galaxies-0.065
 curiously-0.060
iles-0.059
 spear-0.059
 obstacles-0.058
Token hurry
Token ID23290
Feature activation-1.24e+00

Pos logit contributions
 galaxies+0.019
 spear+0.018
iles+0.018
 obstacles+0.018
 warrior+0.018
Neg logit contributions
 Lily-0.017
 chip-0.017
 paramed-0.016
 Sara-0.015
burn-0.015
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
 Lily+0.028
 chip+0.026
 Sara+0.025
 paramed+0.025
 Fl+0.024
Neg logit contributions
 galaxies-0.056
iles-0.053
 spear-0.051
 obstacles-0.051
 warrior-0.050
Token They
Token ID1119
Feature activation+1.80e+00

Pos logit contributions
 galaxies+0.014
iles+0.013
 warrior+0.013
 obstacle+0.013
 obstacles+0.012
Neg logit contributions
 Lily-0.005
 Sara-0.005
 chip-0.004
 Wh-0.004
 Fl-0.004
Token grabbed
Token ID13176
Feature activation+2.55e+00

Pos logit contributions
 galaxies+0.075
iles+0.071
 spear+0.068
 obstacles+0.067
 warrior+0.067
Neg logit contributions
 Lily-0.050
 Sara-0.044
 chip-0.044
 Fl-0.042
 Wh-0.042
Token their
Token ID511
Feature activation-9.43e-01

Pos logit contributions
 galaxies+0.120
iles+0.114
 spear+0.111
 obstacles+0.109
 warrior+0.108
Neg logit contributions
 Lily-0.054
 chip-0.049
 Sara-0.048
 paramed-0.045
 Fl-0.045
Token tr
Token ID491
Feature activation+6.32e+00

Pos logit contributions
 Lily+0.027
 chip+0.026
 Sara+0.025
 Fl+0.024
 paramed+0.024
Neg logit contributions
 galaxies-0.037
iles-0.035
 spear-0.033
 warrior-0.033
 obstacles-0.033
Tokenunks
Token ID14125
Feature activation+2.86e+00

Pos logit contributions
 galaxies+0.183
 warrior+0.163
 obstacles+0.156
 spear+0.154
iles+0.153
Neg logit contributions
 chip-0.237
burn-0.231
 Lily-0.230
 Wh-0.226
 Sn-0.224
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 galaxies+0.101
 spear+0.094
iles+0.092
 obstacles+0.089
 warrior+0.087
Neg logit contributions
 Lily-0.091
 chip-0.085
 paramed-0.085
 Sara-0.085
 Wh-0.081
Token set
Token ID900
Feature activation-1.31e+00

Pos logit contributions
 galaxies+0.036
 spear+0.034
 obstacles+0.032
iles+0.032
 armor+0.031
Neg logit contributions
 chip-0.034
 paramed-0.033
 Lily-0.033
 Sara-0.032
stand-0.031
Token off
Token ID572
Feature activation-5.03e-01

Pos logit contributions
 Lily+0.025
 chip+0.025
 paramed+0.024
 Sara+0.021
 Wh+0.021
Neg logit contributions
 galaxies-0.061
 spear-0.057
 obstacles-0.057
iles-0.056
 islands-0.054
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
 Lily+0.011
 chip+0.011
 paramed+0.010
 Sara+0.010
burn+0.009
Neg logit contributions
 galaxies-0.023
iles-0.021
 spear-0.021
 obstacles-0.021
 warrior-0.020
Token.
Token ID13
Feature activation+6.94e-02

Pos logit contributions
 Lily+0.014
 chip+0.013
 Sara+0.012
 Fl+0.012
 Wh+0.011
Neg logit contributions
 galaxies-0.032
iles-0.031
 spear-0.030
 obstacles-0.029
 warrior-0.029
Token She
Token ID1375
Feature activation+2.47e+00

Pos logit contributions
 galaxies+0.003
iles+0.002
 spear+0.002
 obstacles+0.002
 warrior+0.002
Neg logit contributions
 paramed-0.002
 chip-0.002
burn-0.002
 Lily-0.002
 v-0.002
Token loved
Token ID6151
Feature activation+2.32e+00

Pos logit contributions
 galaxies+0.075
iles+0.068
 spear+0.066
 warrior+0.063
 obstacles+0.063
Neg logit contributions
 Lily-0.093
 chip-0.090
 Sara-0.087
 paramed-0.086
 Wh-0.083
Token to
Token ID284
Feature activation+1.52e+00

Pos logit contributions
 galaxies+0.113
iles+0.106
 spear+0.105
 obstacles+0.103
 warrior+0.100
Neg logit contributions
 Lily-0.043
 chip-0.039
 Sara-0.038
 paramed-0.036
 okay-0.035
Token eat
Token ID4483
Feature activation+1.24e+00

Pos logit contributions
 galaxies+0.067
iles+0.064
 obstacles+0.062
 spear+0.060
 warrior+0.059
Neg logit contributions
 Lily-0.039
 chip-0.036
 Sara-0.035
 Fl-0.033
 paramed-0.032
Token w
Token ID266
Feature activation+6.32e+00

Pos logit contributions
 galaxies+0.053
iles+0.051
 spear+0.049
 obstacles+0.048
 warrior+0.047
Neg logit contributions
 Lily-0.031
 chip-0.028
 Sara-0.027
 paramed-0.027
 Wh-0.026
Tokenaffles
Token ID48501
Feature activation+4.74e+00

Pos logit contributions
 galaxies+0.185
 warrior+0.154
 spear+0.150
 obstacles+0.146
 cities+0.139
Neg logit contributions
 Lily-0.257
 chip-0.248
 Sara-0.246
 v-0.242
 coffee-0.241
Token for
Token ID329
Feature activation+2.07e+00

Pos logit contributions
 galaxies+0.196
iles+0.195
 spear+0.187
 obstacles+0.176
 warrior+0.174
Neg logit contributions
 Lily-0.114
 paramed-0.106
 chip-0.104
 Sara-0.102
 Wh-0.098
Token breakfast
Token ID12607
Feature activation+1.79e+00

Pos logit contributions
 galaxies+0.088
iles+0.085
 spear+0.081
 obstacles+0.080
 armor+0.077
Neg logit contributions
 Lily-0.049
 chip-0.047
 paramed-0.046
 Sara-0.044
burn-0.043
Token.
Token ID13
Feature activation+8.88e-01

Pos logit contributions
iles+0.060
 galaxies+0.058
 spear+0.055
 reefs+0.053
 facts+0.053
Neg logit contributions
 paramed-0.055
 Lily-0.051
burn-0.048
 Wh-0.048
 Sara-0.047
Token One
Token ID1881
Feature activation+3.79e+00

Pos logit contributions
 galaxies+0.020
iles+0.018
 spear+0.017
 obstacles+0.016
 warrior+0.016
Neg logit contributions
 Lily-0.039
 chip-0.038
 paramed-0.037
 Sara-0.037
burn-0.036
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
 Lily+0.011
 chip+0.010
 Sara+0.009
 Fl+0.009
 paramed+0.009
Neg logit contributions
 galaxies-0.024
iles-0.022
 spear-0.022
 obstacles-0.021
 warrior-0.021
Token She
Token ID1375
Feature activation+2.45e+00

Pos logit contributions
 galaxies+0.008
iles+0.007
 spear+0.007
 obstacles+0.007
 warrior+0.007
Neg logit contributions
 paramed-0.005
 chip-0.005
 Lily-0.005
burn-0.005
 Sara-0.005
Token loved
Token ID6151
Feature activation+2.26e+00

Pos logit contributions
 galaxies+0.074
iles+0.066
 spear+0.065
 warrior+0.062
 obstacles+0.061
Neg logit contributions
 Lily-0.093
 chip-0.090
 Sara-0.088
 paramed-0.086
 Wh-0.084
Token to
Token ID284
Feature activation+1.55e+00

Pos logit contributions
 galaxies+0.112
iles+0.106
 spear+0.104
 obstacles+0.102
 warrior+0.100
Neg logit contributions
 Lily-0.040
 chip-0.036
 Sara-0.035
 paramed-0.034
 Fl-0.032
Token eat
Token ID4483
Feature activation+1.24e+00

Pos logit contributions
 galaxies+0.072
iles+0.070
 obstacles+0.067
 warrior+0.064
 obstacle+0.063
Neg logit contributions
 Lily-0.038
 chip-0.034
 Sara-0.032
 Fl-0.030
 v-0.030
Token w
Token ID266
Feature activation+6.32e+00

Pos logit contributions
 galaxies+0.054
iles+0.051
 spear+0.049
 obstacles+0.048
 warrior+0.048
Neg logit contributions
 Lily-0.031
 chip-0.029
 Sara-0.028
 paramed-0.027
 Fl-0.026
Tokenaffles
Token ID48501
Feature activation+4.78e+00

Pos logit contributions
 galaxies+0.185
 warrior+0.154
 spear+0.149
 obstacles+0.147
 cities+0.140
Neg logit contributions
 Lily-0.257
 chip-0.248
 Sara-0.245
 v-0.240
 coffee-0.240
Token for
Token ID329
Feature activation+2.10e+00

Pos logit contributions
 galaxies+0.200
iles+0.198
 spear+0.190
 obstacles+0.180
 warrior+0.178
Neg logit contributions
 Lily-0.114
 paramed-0.105
 chip-0.104
 Sara-0.102
 Wh-0.097
Token breakfast
Token ID12607
Feature activation+1.84e+00

Pos logit contributions
 galaxies+0.091
iles+0.088
 spear+0.084
 obstacles+0.082
 warrior+0.080
Neg logit contributions
 Lily-0.049
 chip-0.047
 paramed-0.045
 Sara-0.044
burn-0.043
Token.
Token ID13
Feature activation+9.65e-01

Pos logit contributions
iles+0.064
 galaxies+0.062
 spear+0.059
 reefs+0.057
 facts+0.056
Neg logit contributions
 paramed-0.055
 Lily-0.052
 Wh-0.049
burn-0.048
 Sara-0.048
Token Her
Token ID2332
Feature activation-5.02e-01

Pos logit contributions
 galaxies+0.022
iles+0.019
 spear+0.018
 obstacles+0.017
 warrior+0.017
Neg logit contributions
 Lily-0.043
 chip-0.042
 paramed-0.041
 Sara-0.041
burn-0.040
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
 galaxies+0.014
iles+0.014
 spear+0.014
 obstacles+0.013
 warrior+0.013
Neg logit contributions
 Lily-0.008
 chip-0.008
 paramed-0.008
 Sara-0.008
 Wh-0.007
Token They
Token ID1119
Feature activation+1.79e+00

Pos logit contributions
 chip+0.004
 paramed+0.003
burn+0.003
 Lily+0.003
 Fl+0.003
Neg logit contributions
 galaxies-0.007
iles-0.006
 spear-0.006
 obstacles-0.006
 warrior-0.006
Token liked
Token ID8288
Feature activation+1.97e+00

Pos logit contributions
 galaxies+0.082
iles+0.079
 obstacle+0.077
 feats+0.077
 obstacles+0.076
Neg logit contributions
 Lily-0.046
 Sara-0.039
 let-0.037
 clean-0.036
 L-0.035
Token to
Token ID284
Feature activation+1.33e+00

Pos logit contributions
 galaxies+0.102
iles+0.096
 spear+0.095
 obstacles+0.093
 warrior+0.091
Neg logit contributions
 Lily-0.032
 chip-0.029
 Sara-0.027
 paramed-0.026
 Fl-0.025
Token spray
Token ID11662
Feature activation+1.63e+00

Pos logit contributions
 galaxies+0.058
iles+0.054
 obstacles+0.051
 warrior+0.050
 spear+0.049
Neg logit contributions
 Lily-0.038
 Sara-0.034
 chip-0.032
 v-0.031
 Fl-0.031
Token each
Token ID1123
Feature activation+6.31e+00

Pos logit contributions
 galaxies+0.071
iles+0.067
 spear+0.065
 obstacles+0.064
 warrior+0.063
Neg logit contributions
 Lily-0.040
 chip-0.037
 Sara-0.036
 paramed-0.034
 Fl-0.034
Token other
Token ID584
Feature activation+2.36e+00

Pos logit contributions
 galaxies+0.277
iles+0.261
 spear+0.253
 obstacles+0.248
 warrior+0.246
Neg logit contributions
 Lily-0.155
 chip-0.143
 Sara-0.139
 paramed-0.132
 Fl-0.131
Token and
Token ID290
Feature activation+6.56e-01

Pos logit contributions
 galaxies+0.087
 spear+0.082
 obstacles+0.079
iles+0.078
 goals+0.078
Neg logit contributions
 paramed-0.070
 Lily-0.067
 Sara-0.065
 chip-0.063
 Wh-0.063
Token make
Token ID787
Feature activation-5.22e-01

Pos logit contributions
 galaxies+0.027
iles+0.025
 spear+0.025
 obstacles+0.024
 warrior+0.024
Neg logit contributions
 Lily-0.017
 chip-0.017
 Sara-0.016
 paramed-0.016
 Fl-0.015
Token funny
Token ID8258
Feature activation+1.48e+00

Pos logit contributions
 Lily+0.015
 chip+0.015
 paramed+0.015
burn+0.014
 Sara+0.014
Neg logit contributions
 galaxies-0.019
iles-0.018
 obstacles-0.018
 spear-0.017
 obstacle-0.016
Token noises
Token ID26782
Feature activation+1.07e+00

Pos logit contributions
 galaxies+0.054
iles+0.051
 spear+0.049
 warrior+0.049
 obstacles+0.047
Neg logit contributions
 Lily-0.047
 chip-0.044
 Sara-0.043
 Fl-0.041
 v-0.041
Token decided
Token ID3066
Feature activation-3.54e-01

Pos logit contributions
 galaxies+0.079
iles+0.075
 obstacles+0.072
 spear+0.072
 warrior+0.071
Neg logit contributions
 Lily-0.052
 Sara-0.044
 Wh-0.043
 chip-0.042
 Fl-0.042
Token to
Token ID284
Feature activation-1.27e-02

Pos logit contributions
 chip+0.008
 paramed+0.008
burn+0.007
ird+0.007
 Sn+0.007
Neg logit contributions
 galaxies-0.015
 spear-0.014
 obstacles-0.014
 warrior-0.013
 cities-0.013
Token jump
Token ID4391
Feature activation+7.60e-01

Pos logit contributions
 chip+0.000
 Lily+0.000
 paramed+0.000
 Sara+0.000
burn+0.000
Neg logit contributions
 galaxies-0.000
 spear-0.000
iles-0.000
 warrior-0.000
 obstacles-0.000
Token on
Token ID319
Feature activation+2.37e+00

Pos logit contributions
 galaxies+0.037
iles+0.036
 spear+0.034
 warrior+0.034
 obstacles+0.034
Neg logit contributions
 Lily-0.017
 Sara-0.014
 Fl-0.013
 chip-0.013
 Wh-0.012
Token the
Token ID262
Feature activation+2.11e+00

Pos logit contributions
 galaxies+0.125
iles+0.119
 spear+0.116
 warrior+0.114
 obstacles+0.114
Neg logit contributions
 Lily-0.038
 chip-0.032
 Sara-0.031
 Fl-0.029
 Wh-0.028
Token tr
Token ID491
Feature activation+6.30e+00

Pos logit contributions
 galaxies+0.076
iles+0.071
 obstacles+0.067
 warrior+0.067
 spear+0.067
Neg logit contributions
 Lily-0.069
 chip-0.064
 Sara-0.063
 Fl-0.062
 v-0.060
Tokenamp
Token ID696
Feature activation+4.19e+00

Pos logit contributions
 galaxies+0.230
 obstacles+0.219
 warrior+0.217
 spear+0.216
 questions+0.208
Neg logit contributions
 Lily-0.180
stand-0.171
 chip-0.167
 Sara-0.163
 Fl-0.161
Tokenoline
Token ID14453
Feature activation+1.20e+00

Pos logit contributions
 galaxies+0.088
iles+0.075
 spear+0.073
 obstacles+0.069
 warrior+0.068
Neg logit contributions
 Lily-0.202
 chip-0.191
 Sara-0.189
 Fl-0.184
 Wh-0.183
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
 galaxies+0.050
iles+0.047
 spear+0.046
 obstacles+0.045
 warrior+0.045
Neg logit contributions
 Lily-0.030
 chip-0.028
 paramed-0.028
 Sara-0.028
 Wh-0.026
Token but
Token ID475
Feature activation+1.28e+00

Pos logit contributions
 galaxies+0.049
iles+0.047
 spear+0.045
 obstacles+0.044
 warrior+0.044
Neg logit contributions
 Lily-0.025
 chip-0.022
 Sara-0.022
 Fl-0.021
 paramed-0.020
Token Lily
Token ID20037
Feature activation-2.56e+00

Pos logit contributions
 galaxies+0.058
 obstacles+0.055
 planets+0.052
 spear+0.051
 tirelessly+0.050
Neg logit contributions
 paramed-0.025
 chip-0.024
burn-0.023
stand-0.020
ume-0.019
Token bad
Token ID2089
Feature activation-1.88e+00

Pos logit contributions
 Lily+0.028
 chip+0.026
 Sara+0.025
 Fl+0.024
 Wh+0.023
Neg logit contributions
 galaxies-0.041
iles-0.039
 spear-0.037
 warrior-0.037
 obstacles-0.036
Token happened
Token ID3022
Feature activation-3.14e+00

Pos logit contributions
 Lily+0.063
 chip+0.061
 Sara+0.060
 paramed+0.058
 Fl+0.057
Neg logit contributions
 galaxies-0.064
iles-0.059
 spear-0.058
 obstacles-0.056
 warrior-0.055
Token,
Token ID11
Feature activation-6.80e-01

Pos logit contributions
 Lily+0.064
 Sara+0.053
 chip+0.052
 Fl+0.050
 Wh+0.048
Neg logit contributions
 galaxies-0.156
iles-0.150
 spear-0.143
 obstacles-0.141
 warrior-0.141
Token she
Token ID673
Feature activation-2.20e+00

Pos logit contributions
 Lily+0.016
 chip+0.016
 Sara+0.015
 paramed+0.015
burn+0.015
Neg logit contributions
 galaxies-0.029
 spear-0.027
iles-0.027
 obstacles-0.026
 warrior-0.025
Token was
Token ID373
Feature activation-1.31e+00

Pos logit contributions
 Lily+0.049
 chip+0.041
 Fl+0.039
 let+0.039
 v+0.038
Neg logit contributions
 galaxies-0.102
iles-0.102
 obstacles-0.096
 obstacle-0.093
eways-0.093
Token able
Token ID1498
Feature activation-6.53e+00

Pos logit contributions
 chip+0.038
burn+0.035
stand+0.034
 Lily+0.034
 paramed+0.033
Neg logit contributions
 galaxies-0.048
 spear-0.044
 warrior-0.043
 armor-0.042
iles-0.041
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
 paramed+0.159
 chip+0.155
 Lily+0.151
ird+0.149
 Sara+0.146
Neg logit contributions
 galaxies-0.265
 spear-0.258
 obstacles-0.250
iles-0.243
 warrior-0.230
Token make
Token ID787
Feature activation-1.36e+00

Pos logit contributions
 Lily+0.035
 chip+0.032
 Sara+0.032
 paramed+0.030
 Fl+0.030
Neg logit contributions
 galaxies-0.059
iles-0.056
 spear-0.055
 obstacles-0.053
 warrior-0.053
Token new
Token ID649
Feature activation+1.07e+00

Pos logit contributions
 paramed+0.032
 Lily+0.031
 chip+0.031
burn+0.030
stand+0.029
Neg logit contributions
 galaxies-0.056
 obstacles-0.052
 spear-0.051
iles-0.051
 obstacle-0.048
Token friends
Token ID2460
Feature activation+1.47e+00

Pos logit contributions
 galaxies+0.048
iles+0.045
 spear+0.043
 obstacles+0.043
 warrior+0.042
Neg logit contributions
 Lily-0.026
 chip-0.024
 Sara-0.023
 Fl-0.021
 Mitt-0.021
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 galaxies+0.064
iles+0.059
 spear+0.058
 obstacles+0.057
 warrior+0.057
Neg logit contributions
 Lily-0.036
 chip-0.034
 Sara-0.033
 paramed-0.032
 Wh-0.031
Token before
Token ID878
Feature activation-3.06e+00

Pos logit contributions
 chip+0.005
 paramed+0.005
burn+0.005
 Sara+0.005
Pe+0.004
Neg logit contributions
 galaxies-0.008
 armor-0.007
 spear-0.007
 obstacles-0.007
 tirelessly-0.007
Token and
Token ID290
Feature activation-6.90e-01

Pos logit contributions
 paramed+0.072
 chip+0.072
 Lily+0.071
 Sara+0.069
burn+0.065
Neg logit contributions
 galaxies-0.129
 obstacles-0.118
iles-0.116
 spear-0.116
 warrior-0.111
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
burn+0.018
 paramed+0.018
 chip+0.017
Pe+0.016
S+0.016
Neg logit contributions
 galaxies-0.027
 obstacles-0.024
 spear-0.024
 warrior-0.023
 obstacle-0.023
Token toy
Token ID13373
Feature activation-2.47e+00

Pos logit contributions
 chip+0.042
burn+0.042
 paramed+0.041
 Lily+0.041
 Sara+0.040
Neg logit contributions
 galaxies-0.060
 spear-0.056
 obstacles-0.055
 warrior-0.054
 armor-0.052
Token was
Token ID373
Feature activation-1.03e+00

Pos logit contributions
 chip+0.074
 Lily+0.071
 paramed+0.071
 Sara+0.070
burn+0.070
Neg logit contributions
 galaxies-0.091
 spear-0.081
iles-0.079
 warrior-0.079
 obstacles-0.078
Token able
Token ID1498
Feature activation-6.51e+00

Pos logit contributions
 chip+0.028
burn+0.027
 paramed+0.027
 Sara+0.025
 Woo+0.025
Neg logit contributions
 galaxies-0.037
 armor-0.035
 tirelessly-0.035
 armour-0.034
 obstacles-0.034
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
 paramed+0.182
 chip+0.167
burn+0.166
ird+0.160
 Sn+0.159
Neg logit contributions
 galaxies-0.249
 obstacles-0.244
 spear-0.232
 obstacle-0.216
iles-0.215
Token yield
Token ID7800
Feature activation-9.78e-01

Pos logit contributions
 Lily+0.033
 chip+0.033
 paramed+0.032
 Sara+0.032
burn+0.030
Neg logit contributions
 galaxies-0.047
 spear-0.044
iles-0.043
 warrior-0.042
 obstacles-0.042
Token!
Token ID0
Feature activation+2.52e-01

Pos logit contributions
 paramed+0.024
 chip+0.023
 Lily+0.022
burn+0.022
 Sara+0.021
Neg logit contributions
 galaxies-0.041
 obstacles-0.039
iles-0.038
 spear-0.037
 goals-0.036
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
 galaxies+0.009
 spear+0.008
 obstacles+0.007
 warrior+0.007
 planets+0.007
Neg logit contributions
burn-0.007
 paramed-0.007
 chip-0.007
 Sara-0.007
 v-0.007
Token
Token ID198
Feature activation-9.98e-01

Pos logit contributions
 Lily+0.028
 chip+0.024
 Sara+0.023
 Fl+0.023
 Mitt+0.022
Neg logit contributions
 galaxies-0.047
iles-0.046
 spear-0.043
 obstacles-0.043
 warrior-0.043
Token was
Token ID373
Feature activation-1.04e+00

Pos logit contributions
 chip+0.055
 Lily+0.054
 Sara+0.052
burn+0.051
 paramed+0.051
Neg logit contributions
 galaxies-0.057
iles-0.054
 spear-0.052
 obstacles-0.051
 warrior-0.050
Token broken
Token ID5445
Feature activation-2.87e+00

Pos logit contributions
 chip+0.034
 Lily+0.034
 Sara+0.033
 Mitt+0.032
L+0.032
Neg logit contributions
 spear-0.034
 galaxies-0.033
 armor-0.031
iles-0.030
 obstacles-0.030
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
 Lily+0.073
 chip+0.071
 paramed+0.069
 Sara+0.068
 Fl+0.064
Neg logit contributions
 galaxies-0.118
iles-0.111
 spear-0.110
 obstacles-0.107
 warrior-0.105
Token he
Token ID339
Feature activation-4.41e-01

Pos logit contributions
 chip+0.034
 Lily+0.033
 paramed+0.033
 Sara+0.032
burn+0.031
Neg logit contributions
 galaxies-0.047
 spear-0.043
iles-0.043
 obstacles-0.042
 warrior-0.041
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
 Lily+0.011
 chip+0.010
 Sara+0.010
 Fl+0.010
 Wh+0.010
Neg logit contributions
 galaxies-0.019
iles-0.018
 obstacles-0.017
 spear-0.017
 warrior-0.017
Token able
Token ID1498
Feature activation-6.51e+00

Pos logit contributions
 chip+0.034
 Lily+0.032
L+0.032
 Sara+0.031
 Lara+0.030
Neg logit contributions
 galaxies-0.035
 spear-0.034
 armor-0.033
 obstacles-0.033
 armour-0.032
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
 chip+0.166
 paramed+0.164
 Lily+0.162
 Sn+0.151
ird+0.151
Neg logit contributions
 galaxies-0.255
 spear-0.252
 obstacles-0.244
iles-0.236
 armor-0.226
Token fix
Token ID4259
Feature activation-1.19e+00

Pos logit contributions
 Lily+0.044
 Sara+0.041
 chip+0.040
 paramed+0.039
 Fl+0.038
Neg logit contributions
 galaxies-0.056
 spear-0.055
iles-0.053
 warrior-0.052
 obstacles-0.050
Token it
Token ID340
Feature activation-8.77e-01

Pos logit contributions
 Lily+0.026
 paramed+0.026
 chip+0.026
burn+0.024
 Sara+0.024
Neg logit contributions
 galaxies-0.052
iles-0.049
 spear-0.049
 obstacles-0.048
 warrior-0.046
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
 paramed+0.028
 chip+0.025
 Sara+0.025
 Lily+0.025
burn+0.024
Neg logit contributions
 galaxies-0.030
 spear-0.029
 obstacles-0.028
iles-0.027
 obstacle-0.027
Token He
Token ID679
Feature activation+2.14e+00

Pos logit contributions
burn+0.010
 chip+0.010
 Lily+0.010
 Sara+0.010
 paramed+0.010
Neg logit contributions
 galaxies-0.007
 spear-0.006
iles-0.006
 warrior-0.006
 obstacles-0.006
Token poked
Token ID47320
Feature activation+1.68e+00

Pos logit contributions
 chip+0.065
 paramed+0.059
 Lily+0.059
 Wh+0.055
 Sara+0.054
Neg logit contributions
 galaxies-0.107
iles-0.097
 obstacles-0.097
 spear-0.096
 armor-0.092
Token it
Token ID340
Feature activation+2.57e-01

Pos logit contributions
 galaxies+0.077
iles+0.073
 spear+0.071
 obstacles+0.070
 warrior+0.069
Neg logit contributions
 Lily-0.038
 chip-0.034
 Sara-0.033
 Fl-0.031
 Wh-0.031
Token and
Token ID290
Feature activation-2.78e-01

Pos logit contributions
 galaxies+0.011
iles+0.011
 spear+0.010
 obstacles+0.010
 warrior+0.010
Neg logit contributions
 Lily-0.006
 chip-0.006
 Sara-0.005
 Fl-0.005
 Wh-0.005
Token it
Token ID340
Feature activation-1.74e+00

Pos logit contributions
 Lily+0.007
 chip+0.007
 Sara+0.007
 paramed+0.007
 Wh+0.006
Neg logit contributions
 galaxies-0.012
iles-0.011
 spear-0.011
 obstacles-0.010
 warrior-0.010
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 Lily+0.051
 chip+0.048
 Sara+0.047
 paramed+0.045
 Wh+0.044
Neg logit contributions
 galaxies-0.068
iles-0.063
 spear-0.062
 obstacles-0.060
 warrior-0.060
Token able
Token ID1498
Feature activation-6.48e+00

Pos logit contributions
 chip+0.025
 Lily+0.024
 paramed+0.024
 Sara+0.022
 Wh+0.022
Neg logit contributions
 galaxies-0.043
iles-0.039
 spear-0.039
 obstacles-0.039
 armor-0.037
Token to
Token ID284
Feature activation-1.33e+00

Pos logit contributions
 Lily+0.132
 chip+0.132
 paramed+0.132
 Sara+0.120
 Wh+0.119
Neg logit contributions
 galaxies-0.302
 spear-0.283
 obstacles-0.282
iles-0.280
 warrior-0.263
Token turn
Token ID1210
Feature activation-1.48e+00

Pos logit contributions
 Lily+0.034
 chip+0.031
 Sara+0.030
 Fl+0.029
 Wh+0.028
Neg logit contributions
 galaxies-0.057
iles-0.054
 obstacles-0.051
 spear-0.051
 warrior-0.049
Token the
Token ID262
Feature activation+8.18e-01

Pos logit contributions
 Lily+0.033
 chip+0.032
 paramed+0.030
 Sara+0.029
 Wh+0.028
Neg logit contributions
 galaxies-0.068
iles-0.063
 spear-0.062
 obstacles-0.061
 warrior-0.060
Token situation
Token ID3074
Feature activation-4.15e-02

Pos logit contributions
 galaxies+0.031
iles+0.029
 spear+0.028
 obstacles+0.028
 warrior+0.027
Neg logit contributions
 Lily-0.025
 chip-0.023
 Sara-0.023
 paramed-0.023
 Wh-0.022
Token around
Token ID1088
Feature activation+9.94e-01

Pos logit contributions
 paramed+0.001
 Lily+0.001
 chip+0.001
 Sara+0.001
 Wh+0.001
Neg logit contributions
 galaxies-0.002
 obstacles-0.002
 spear-0.002
iles-0.002
 warrior-0.002
Token sun
Token ID4252
Feature activation-2.40e+00

Pos logit contributions
 Lily+0.041
 chip+0.040
 Sara+0.037
 baby+0.037
 Fl+0.036
Neg logit contributions
 galaxies-0.040
iles-0.036
 obstacles-0.034
 spear-0.034
 warrior-0.033
Token disappeared
Token ID12120
Feature activation-3.19e+00

Pos logit contributions
 Lily+0.075
 chip+0.070
 paramed+0.070
 Sara+0.070
 Wh+0.068
Neg logit contributions
 galaxies-0.087
iles-0.081
 spear-0.078
 obstacles-0.075
 islands-0.075
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
 Lily+0.079
 chip+0.076
 paramed+0.073
 Sara+0.072
 Wh+0.069
Neg logit contributions
 galaxies-0.139
iles-0.129
 spear-0.126
 obstacles-0.124
 warrior-0.121
Token they
Token ID484
Feature activation-2.61e+00

Pos logit contributions
 Lily+0.021
 chip+0.021
 paramed+0.020
 Sara+0.019
 Wh+0.019
Neg logit contributions
 galaxies-0.051
iles-0.047
 spear-0.046
 obstacles-0.045
 warrior-0.044
Token were
Token ID547
Feature activation-2.08e+00

Pos logit contributions
 Lily+0.079
 chip+0.074
 Sara+0.072
 paramed+0.070
 Fl+0.069
Neg logit contributions
 galaxies-0.100
iles-0.093
 spear-0.090
 obstacles-0.088
 warrior-0.087
Token able
Token ID1498
Feature activation-6.48e+00

Pos logit contributions
 chip+0.063
 paramed+0.063
 Lily+0.060
 Wh+0.056
L+0.055
Neg logit contributions
 galaxies-0.080
 spear-0.069
 obstacles-0.068
 planets-0.068
iles-0.068
Token to
Token ID284
Feature activation-1.51e+00

Pos logit contributions
 paramed+0.148
 chip+0.146
 Lily+0.144
 Wh+0.134
 Sn+0.132
Neg logit contributions
 galaxies-0.286
 obstacles-0.266
iles-0.264
 spear-0.263
 islands-0.251
Token make
Token ID787
Feature activation-1.42e+00

Pos logit contributions
 Lily+0.042
 chip+0.038
 Sara+0.038
 Fl+0.036
 Wh+0.036
Neg logit contributions
 galaxies-0.061
iles-0.058
 spear-0.055
 obstacles-0.055
 warrior-0.053
Token it
Token ID340
Feature activation-8.53e-01

Pos logit contributions
 Lily+0.031
 paramed+0.030
 chip+0.030
 Wh+0.028
 Sara+0.028
Neg logit contributions
 galaxies-0.064
iles-0.060
 obstacles-0.058
 spear-0.058
 islands-0.056
Token safely
Token ID11512
Feature activation-2.77e-02

Pos logit contributions
 Lily+0.022
 chip+0.021
 paramed+0.021
 Sara+0.020
 Wh+0.019
Neg logit contributions
 galaxies-0.036
iles-0.033
 spear-0.033
 obstacles-0.032
 warrior-0.031
Token back
Token ID736
Feature activation-5.15e-01

Pos logit contributions
 Lily+0.001
 paramed+0.001
 chip+0.001
 Sara+0.001
 Mitt+0.001
Neg logit contributions
 galaxies-0.001
 spear-0.001
iles-0.001
 islands-0.001
 obstacles-0.001
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
 Lily+0.069
 paramed+0.069
 chip+0.069
 Sara+0.064
 Wh+0.062
Neg logit contributions
 galaxies-0.127
iles-0.118
 obstacles-0.115
 spear-0.115
 warrior-0.110
Token no
Token ID645
Feature activation-3.05e+00

Pos logit contributions
 chip+0.035
 paramed+0.033
burn+0.032
 Lily+0.030
 Sara+0.029
Neg logit contributions
 galaxies-0.061
 obstacles-0.054
iles-0.054
 spear-0.053
 planets-0.051
Token-
Token ID12
Feature activation-3.56e+00

Pos logit contributions
 Lily+0.086
 chip+0.080
 Sara+0.078
 paramed+0.075
 Fl+0.074
Neg logit contributions
 galaxies-0.123
iles-0.115
 spear-0.112
 obstacles-0.109
 warrior-0.108
Tokenone
Token ID505
Feature activation-3.14e+00

Pos logit contributions
 Lily+0.113
 chip+0.112
 Sara+0.106
 Fl+0.103
 paramed+0.101
Neg logit contributions
 galaxies-0.129
iles-0.119
 obstacles-0.113
 spear-0.113
 warrior-0.109
Token was
Token ID373
Feature activation-2.23e+00

Pos logit contributions
 Lily+0.085
 chip+0.080
 Sara+0.077
 paramed+0.076
 Wh+0.074
Neg logit contributions
 galaxies-0.128
iles-0.121
 spear-0.117
 obstacles-0.115
 warrior-0.113
Token able
Token ID1498
Feature activation-6.48e+00

Pos logit contributions
 chip+0.067
L+0.060
 Mitt+0.060
 paramed+0.059
 Wh+0.058
Neg logit contributions
 galaxies-0.076
 spear-0.072
 obstacles-0.072
 armor-0.070
 warrior-0.070
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
 chip+0.169
 paramed+0.167
 Lily+0.150
burn+0.150
 Sn+0.147
Neg logit contributions
 galaxies-0.256
 obstacles-0.247
 spear-0.243
iles-0.241
 questions-0.228
Token help
Token ID1037
Feature activation-2.82e-01

Pos logit contributions
 chip+0.043
 Lily+0.043
 paramed+0.042
 Sara+0.041
burn+0.039
Neg logit contributions
 galaxies-0.053
 spear-0.050
iles-0.049
 warrior-0.048
 obstacles-0.047
Token her
Token ID607
Feature activation-2.40e+00

Pos logit contributions
 paramed+0.007
 chip+0.006
burn+0.006
 Lily+0.006
 Wh+0.006
Neg logit contributions
 galaxies-0.012
iles-0.011
 obstacles-0.011
 spear-0.011
 cliffs-0.010
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 Lily+0.063
 paramed+0.062
 chip+0.061
 Wh+0.058
 Sara+0.058
Neg logit contributions
 galaxies-0.098
 spear-0.090
iles-0.087
 obstacles-0.086
 warrior-0.085
Token
Token ID220
Feature activation+1.53e+00

Pos logit contributions
 chip+0.029
 paramed+0.029
 Lily+0.029
burn+0.028
 v+0.026
Neg logit contributions
 galaxies-0.044
iles-0.040
 spear-0.039
 obstacles-0.038
 warrior-0.038
Token,
Token ID11
Feature activation+7.04e-01

Pos logit contributions
 Lily+0.012
 chip+0.011
 Sara+0.011
 paramed+0.010
 Fl+0.010
Neg logit contributions
 galaxies-0.024
iles-0.022
 spear-0.022
 obstacles-0.021
 warrior-0.021
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
 galaxies+0.032
iles+0.030
 spear+0.029
 obstacles+0.029
 warrior+0.028
Neg logit contributions
 Lily-0.016
 chip-0.015
 Sara-0.015
 paramed-0.014
 Fl-0.014
Token eventually
Token ID4191
Feature activation-2.95e+00

Pos logit contributions
 Lily+0.039
 chip+0.038
 paramed+0.037
burn+0.036
 Sara+0.036
Neg logit contributions
 galaxies-0.049
iles-0.045
 spear-0.044
 obstacles-0.043
 warrior-0.043
Token he
Token ID339
Feature activation-2.00e+00

Pos logit contributions
 chip+0.103
burn+0.101
 paramed+0.100
 Lily+0.094
stand+0.094
Neg logit contributions
 galaxies-0.095
 spear-0.089
 warrior-0.084
iles-0.084
 obstacles-0.082
Token was
Token ID373
Feature activation-2.11e+00

Pos logit contributions
 Lily+0.053
 chip+0.049
 Sara+0.048
 paramed+0.045
 Fl+0.045
Neg logit contributions
 galaxies-0.084
iles-0.079
 spear-0.077
 obstacles-0.075
 warrior-0.074
Token able
Token ID1498
Feature activation-6.45e+00

Pos logit contributions
 chip+0.061
 Wh+0.055
 Lily+0.054
 Mitt+0.053
 paramed+0.053
Neg logit contributions
 galaxies-0.073
 spear-0.070
 warrior-0.069
 armor-0.069
 armour-0.068
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
 chip+0.142
 paramed+0.140
 Lily+0.140
burn+0.127
 Wh+0.126
Neg logit contributions
 galaxies-0.281
 spear-0.271
 obstacles-0.266
iles-0.262
 warrior-0.251
Token buy
Token ID2822
Feature activation-1.10e+00

Pos logit contributions
 Lily+0.033
 chip+0.031
 Sara+0.030
 paramed+0.029
 Fl+0.029
Neg logit contributions
 galaxies-0.051
 spear-0.049
iles-0.048
 warrior-0.047
 obstacles-0.047
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
 Lily+0.026
 chip+0.025
 Sara+0.023
 paramed+0.023
 Fl+0.022
Neg logit contributions
 galaxies-0.048
iles-0.045
 spear-0.044
 obstacles-0.044
 warrior-0.043
Token gear
Token ID7733
Feature activation+1.89e+00

Pos logit contributions
 galaxies+0.051
iles+0.051
alk+0.043
 feats+0.043
 cliffs+0.042
Neg logit contributions
 Lily-0.048
 chip-0.045
 v-0.044
 Wh-0.042
 baby-0.041
Token he
Token ID339
Feature activation+3.42e-01

Pos logit contributions
 galaxies+0.075
 spear+0.073
 obstacles+0.068
 warrior+0.068
iles+0.067
Neg logit contributions
 Lily-0.047
 paramed-0.045
 chip-0.045
 Sara-0.044
 Wh-0.043
Tokenower
Token ID789
Feature activation-9.00e-01

Pos logit contributions
 Lily+0.090
 chip+0.083
 Sara+0.080
 Fl+0.076
 paramed+0.076
Neg logit contributions
 galaxies-0.162
iles-0.153
 spear-0.148
 obstacles-0.145
 warrior-0.144
Token until
Token ID1566
Feature activation-2.71e+00

Pos logit contributions
 Lily+0.022
 chip+0.021
 Sara+0.020
 paramed+0.020
 Fl+0.019
Neg logit contributions
 galaxies-0.038
iles-0.036
 spear-0.035
 obstacles-0.034
 warrior-0.034
Token the
Token ID262
Feature activation-2.15e+00

Pos logit contributions
 paramed+0.065
 chip+0.065
 Lily+0.062
burn+0.060
stand+0.056
Neg logit contributions
 galaxies-0.113
 spear-0.105
 obstacles-0.103
iles-0.103
 warrior-0.101
Token bear
Token ID6842
Feature activation-1.17e+00

Pos logit contributions
 Lily+0.065
 Sara+0.061
 chip+0.060
 Fl+0.058
 paramed+0.057
Neg logit contributions
 galaxies-0.080
 spear-0.075
iles-0.075
 obstacles-0.072
 warrior-0.072
Token was
Token ID373
Feature activation-1.87e+00

Pos logit contributions
 Lily+0.033
 chip+0.031
 Sara+0.031
 paramed+0.030
 Wh+0.029
Neg logit contributions
 galaxies-0.045
iles-0.043
 spear-0.042
 warrior-0.041
 obstacles-0.040
Token able
Token ID1498
Feature activation-6.44e+00

Pos logit contributions
 chip+0.051
 Lily+0.048
 paramed+0.047
 Wh+0.046
 Mitt+0.044
Neg logit contributions
 galaxies-0.071
 spear-0.069
 armor-0.066
 armour-0.066
iles-0.065
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
 chip+0.129
 paramed+0.129
 Lily+0.128
 Wh+0.115
 Sara+0.113
Neg logit contributions
 galaxies-0.298
 spear-0.285
iles-0.282
 obstacles-0.278
 warrior-0.266
Token cross
Token ID3272
Feature activation+1.11e+00

Pos logit contributions
 Lily+0.029
 chip+0.027
 Sara+0.026
 Fl+0.025
 paramed+0.024
Neg logit contributions
 galaxies-0.054
iles-0.050
 obstacles-0.048
 spear-0.047
 warrior-0.046
Token it
Token ID340
Feature activation-1.25e+00

Pos logit contributions
 galaxies+0.059
iles+0.056
 spear+0.055
 warrior+0.054
 obstacles+0.053
Neg logit contributions
 Lily-0.018
 Sara-0.015
 chip-0.014
 Fl-0.013
 Wh-0.013
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
 Lily+0.028
 chip+0.026
 Sara+0.025
 paramed+0.024
 Fl+0.024
Neg logit contributions
 galaxies-0.057
iles-0.054
 spear-0.052
 obstacles-0.051
 warrior-0.051
Token
Token ID220
Feature activation+2.00e+00

Pos logit contributions
 Lily+0.015
 chip+0.015
burn+0.014
 paramed+0.014
 Sara+0.014
Neg logit contributions
 galaxies-0.022
 spear-0.020
iles-0.020
 warrior-0.020
 obstacles-0.019
Token it
Token ID340
Feature activation-1.80e+00

Pos logit contributions
 galaxies+0.057
iles+0.053
 spear+0.052
 obstacles+0.051
 planets+0.050
Neg logit contributions
 paramed-0.030
 chip-0.029
 Lily-0.028
burn-0.027
stand-0.027
Token was
Token ID373
Feature activation-1.16e+00

Pos logit contributions
 Lily+0.046
 chip+0.044
 paramed+0.043
 Sara+0.042
 Wh+0.040
Neg logit contributions
 galaxies-0.076
iles-0.072
 spear-0.069
 obstacles-0.068
 warrior-0.066
Token day
Token ID1110
Feature activation-1.39e+00

Pos logit contributions
 chip+0.031
 paramed+0.031
 Lily+0.029
burn+0.028
ird+0.027
Neg logit contributions
 galaxies-0.046
iles-0.044
 obstacles-0.043
 spear-0.042
 planets-0.041
Token a
Token ID257
Feature activation-2.18e+00

Pos logit contributions
 chip+0.039
 paramed+0.037
 Wh+0.037
burn+0.036
 Mitt+0.035
Neg logit contributions
 galaxies-0.050
 spear-0.047
iles-0.046
 questions-0.045
 obstacles-0.044
Token minute
Token ID5664
Feature activation-1.23e+00

Pos logit contributions
 Lily+0.055
 chip+0.051
 Sara+0.051
 Wh+0.049
 paramed+0.048
Neg logit contributions
 galaxies-0.090
 spear-0.085
iles-0.084
 warrior-0.082
 obstacles-0.081
Token ago
Token ID2084
Feature activation-6.44e+00

Pos logit contributions
 Lily+0.032
 chip+0.031
 paramed+0.031
 Wh+0.030
 Mitt+0.030
Neg logit contributions
 galaxies-0.050
iles-0.048
 spear-0.044
 obstacles-0.044
 warrior-0.043
Token."
Token ID526
Feature activation-1.14e+00

Pos logit contributions
 paramed+0.193
 chip+0.186
 Lily+0.186
 Wh+0.182
 Mitt+0.182
Neg logit contributions
 galaxies-0.216
iles-0.212
 spear-0.209
 obstacles-0.192
 armor-0.192
Token
Token ID198
Feature activation-6.67e-01

Pos logit contributions
 chip+0.022
 Lily+0.021
burn+0.020
 paramed+0.020
 Sara+0.019
Neg logit contributions
 galaxies-0.053
iles-0.050
 spear-0.050
 obstacles-0.048
 warrior-0.048
Token
Token ID198
Feature activation-7.12e-01

Pos logit contributions
 Lily+0.016
 chip+0.014
 Sara+0.014
 paramed+0.013
 Fl+0.013
Neg logit contributions
 galaxies-0.030
iles-0.028
 spear-0.027
 obstacles-0.027
 warrior-0.027
TokenThey
Token ID2990
Feature activation+1.74e-01

Pos logit contributions
 Lily+0.020
 chip+0.019
 paramed+0.018
 Sara+0.018
burn+0.018
Neg logit contributions
 galaxies-0.027
 spear-0.026
iles-0.026
 obstacles-0.025
 warrior-0.025
Token heard
Token ID2982
Feature activation+2.84e-01

Pos logit contributions
 galaxies+0.006
iles+0.006
 spear+0.006
 warrior+0.005
 obstacles+0.005
Neg logit contributions
 chip-0.006
 Lily-0.005
 paramed-0.005
 Wh-0.005
 Sara-0.005
Token,
Token ID11
Feature activation+1.52e-02

Pos logit contributions
 paramed+0.021
 chip+0.020
burn+0.019
 Sara+0.018
 Lily+0.017
Neg logit contributions
 galaxies-0.026
 spear-0.024
 obstacles-0.024
iles-0.023
 tirelessly-0.023
Token that
Token ID326
Feature activation-2.28e+00

Pos logit contributions
 galaxies+0.001
 spear+0.001
 obstacles+0.001
iles+0.000
 warrior+0.000
Neg logit contributions
 chip-0.000
 paramed-0.000
burn-0.000
 Lily-0.000
 Wh-0.000
Token she
Token ID673
Feature activation-1.59e+00

Pos logit contributions
 paramed+0.054
 chip+0.051
 Lily+0.048
burn+0.048
 Wh+0.048
Neg logit contributions
 galaxies-0.100
iles-0.091
 obstacles-0.090
 spear-0.089
 planets-0.086
Token was
Token ID373
Feature activation-7.75e-01

Pos logit contributions
 Lily+0.045
 chip+0.041
 Sara+0.040
 Fl+0.038
 Wh+0.038
Neg logit contributions
 galaxies-0.065
iles-0.061
 spear-0.059
 obstacles-0.058
 warrior-0.057
Token not
Token ID407
Feature activation-3.32e+00

Pos logit contributions
 chip+0.024
stand+0.022
burn+0.022
L+0.020
 paramed+0.020
Neg logit contributions
 galaxies-0.024
 obstacles-0.023
 questions-0.022
iles-0.021
 spear-0.021
Token able
Token ID1498
Feature activation-6.42e+00

Pos logit contributions
 chip+0.090
stand+0.087
 paramed+0.084
burn+0.080
L+0.076
Neg logit contributions
 obstacles-0.113
 galaxies-0.111
 spear-0.108
iles-0.108
 warrior-0.106
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
 paramed+0.160
 chip+0.155
burn+0.144
 Sn+0.143
 Lily+0.142
Neg logit contributions
 galaxies-0.254
 obstacles-0.248
iles-0.246
 spear-0.242
 questions-0.232
Token play
Token ID711
Feature activation-1.93e-02

Pos logit contributions
 Lily+0.040
 paramed+0.039
 chip+0.038
 Sara+0.037
 Fl+0.035
Neg logit contributions
iles-0.045
 galaxies-0.045
 spear-0.044
 obstacles-0.043
 warrior-0.042
Token in
Token ID287
Feature activation+1.87e+00

Pos logit contributions
 chip+0.000
 paramed+0.000
burn+0.000
 Lily+0.000
 Sara+0.000
Neg logit contributions
 galaxies-0.001
 spear-0.001
iles-0.001
 obstacles-0.001
 warrior-0.001
Token the
Token ID262
Feature activation+9.45e-02

Pos logit contributions
 galaxies+0.087
 obstacles+0.080
iles+0.080
 spear+0.079
 cliffs+0.077
Neg logit contributions
 paramed-0.037
 chip-0.036
burn-0.034
 Lily-0.033
 Sara-0.031
Token park
Token ID3952
Feature activation-1.78e-01

Pos logit contributions
 galaxies+0.003
iles+0.003
 spear+0.003
 obstacles+0.003
 warrior+0.003
Neg logit contributions
 Lily-0.003
 chip-0.003
 Sara-0.003
 paramed-0.003
 Wh-0.003
Token practiced
Token ID19893
Feature activation-1.61e-01

Pos logit contributions
 Lily+0.010
 chip+0.009
 Sara+0.009
 Fl+0.008
 Wh+0.008
Neg logit contributions
 galaxies-0.017
iles-0.017
 spear-0.016
 obstacles-0.016
 warrior-0.015
Token everyday
Token ID10908
Feature activation+1.01e+00

Pos logit contributions
 Lily+0.004
 chip+0.003
 v+0.003
 Sara+0.003
 Fl+0.003
Neg logit contributions
iles-0.007
 galaxies-0.007
 spear-0.007
 warrior-0.007
 obstacle-0.007
Token until
Token ID1566
Feature activation-2.09e+00

Pos logit contributions
 galaxies+0.048
iles+0.046
 spear+0.044
 obstacles+0.043
 warrior+0.043
Neg logit contributions
 Lily-0.021
 chip-0.019
 Sara-0.018
 Fl-0.017
 Wh-0.016
Token he
Token ID339
Feature activation-1.34e+00

Pos logit contributions
 Lily+0.061
 chip+0.059
 Sara+0.057
 paramed+0.057
 Fl+0.055
Neg logit contributions
 galaxies-0.080
iles-0.073
 spear-0.072
 obstacles-0.071
 warrior-0.070
Token was
Token ID373
Feature activation-1.93e+00

Pos logit contributions
 Lily+0.033
 chip+0.029
 Sara+0.028
 Fl+0.027
 Wh+0.026
Neg logit contributions
 galaxies-0.060
iles-0.056
 spear-0.056
 obstacles-0.054
 warrior-0.053
Token able
Token ID1498
Feature activation-6.41e+00

Pos logit contributions
 chip+0.052
 Wh+0.047
 Lily+0.047
 paramed+0.046
 Fl+0.046
Neg logit contributions
 galaxies-0.078
 warrior-0.071
 spear-0.070
iles-0.070
 armor-0.070
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
 Lily+0.128
 paramed+0.127
 chip+0.127
 Sara+0.116
 Wh+0.115
Neg logit contributions
 galaxies-0.299
 spear-0.282
 obstacles-0.279
iles-0.278
 warrior-0.265
Token print
Token ID3601
Feature activation-1.09e+00

Pos logit contributions
 Lily+0.032
 chip+0.029
 Sara+0.028
 paramed+0.027
 Wh+0.026
Neg logit contributions
 galaxies-0.057
iles-0.054
 spear-0.052
 obstacles-0.051
 warrior-0.050
Token all
Token ID477
Feature activation+3.09e+00

Pos logit contributions
 Lily+0.025
 Sara+0.021
 chip+0.020
 Fl+0.019
 v+0.018
Neg logit contributions
 galaxies-0.050
iles-0.049
 spear-0.047
 warrior-0.046
 obstacles-0.045
Token of
Token ID286
Feature activation-2.27e+00

Pos logit contributions
 galaxies+0.135
iles+0.125
 spear+0.122
 obstacles+0.121
 warrior+0.117
Neg logit contributions
 Lily-0.073
 chip-0.073
 paramed-0.072
 Sara-0.069
 Wh-0.068
Token the
Token ID262
Feature activation+7.04e-01

Pos logit contributions
 Lily+0.062
 Sara+0.055
 chip+0.054
 Fl+0.052
 Wh+0.050
Neg logit contributions
 galaxies-0.096
iles-0.092
 spear-0.087
 warrior-0.086
 obstacles-0.085
Token and
Token ID290
Feature activation-3.62e-01

Pos logit contributions
 chip+0.054
 Lily+0.053
 paramed+0.051
 Sara+0.049
burn+0.049
Neg logit contributions
 galaxies-0.087
 spear-0.078
 obstacles-0.078
iles-0.077
 warrior-0.074
Token the
Token ID262
Feature activation-5.28e-01

Pos logit contributions
 chip+0.010
burn+0.009
 paramed+0.009
stand+0.008
 coffee+0.008
Neg logit contributions
 galaxies-0.015
 obstacles-0.013
 spear-0.013
 cities-0.012
 planets-0.012
Token little
Token ID1310
Feature activation-2.96e+00

Pos logit contributions
 Lily+0.016
 chip+0.014
 Sara+0.014
 Fl+0.014
 paramed+0.013
Neg logit contributions
 galaxies-0.020
iles-0.019
 spear-0.018
 obstacles-0.018
 warrior-0.018
Token girl
Token ID2576
Feature activation-2.43e+00

Pos logit contributions
 Lily+0.053
 chip+0.052
 paramed+0.049
 Sara+0.048
 Wh+0.046
Neg logit contributions
 galaxies-0.147
iles-0.136
 spear-0.135
 warrior-0.133
 obstacles-0.132
Token was
Token ID373
Feature activation-1.23e+00

Pos logit contributions
 Lily+0.076
 chip+0.073
 Sara+0.070
 paramed+0.069
burn+0.069
Neg logit contributions
 galaxies-0.088
iles-0.079
 spear-0.079
 obstacles-0.077
 warrior-0.076
Token able
Token ID1498
Feature activation-6.41e+00

Pos logit contributions
 chip+0.037
 Lily+0.034
burn+0.033
 Wh+0.032
 paramed+0.031
Neg logit contributions
 galaxies-0.045
 obstacles-0.040
 armor-0.040
 armour-0.040
 spear-0.040
Token to
Token ID284
Feature activation-1.50e+00

Pos logit contributions
 chip+0.164
 paramed+0.159
 Lily+0.150
burn+0.145
 Wh+0.143
Neg logit contributions
 galaxies-0.261
 obstacles-0.250
 spear-0.244
iles-0.228
 tirelessly-0.225
Token get
Token ID651
Feature activation-2.22e+00

Pos logit contributions
 Lily+0.045
 chip+0.043
 Sara+0.041
 paramed+0.041
 Fl+0.039
Neg logit contributions
 galaxies-0.056
 spear-0.052
iles-0.052
 warrior-0.050
 obstacles-0.050
Token out
Token ID503
Feature activation-6.26e-01

Pos logit contributions
 chip+0.059
 paramed+0.053
burn+0.050
stand+0.050
 Lily+0.049
Neg logit contributions
 galaxies-0.086
 obstacles-0.081
 spear-0.080
 armor-0.077
iles-0.076
Token.
Token ID13
Feature activation-7.92e-01

Pos logit contributions
 chip+0.015
 Lily+0.015
 paramed+0.014
 Sara+0.013
 Wh+0.013
Neg logit contributions
 galaxies-0.027
iles-0.025
 spear-0.024
 obstacles-0.024
 warrior-0.024
Token
Token ID220
Feature activation+1.74e+00

Pos logit contributions
 Lily+0.021
 chip+0.021
 paramed+0.020
burn+0.019
 Sara+0.019
Neg logit contributions
 galaxies-0.032
iles-0.029
 spear-0.028
 obstacles-0.028
 warrior-0.028
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
 chip+0.017
 paramed+0.017
burn+0.017
 Lily+0.016
gan+0.015
Neg logit contributions
 galaxies-0.024
 obstacles-0.022
 spear-0.022
 armor-0.021
iles-0.021
Token soon
Token ID2582
Feature activation-3.09e+00

Pos logit contributions
 paramed+0.000
 chip+0.000
ula+0.000
burn+0.000
S+0.000
Neg logit contributions
 galaxies-0.000
 spear-0.000
 obstacles-0.000
 hops-0.000
 armor-0.000
Token,
Token ID11
Feature activation-4.93e-01

Pos logit contributions
 paramed+0.090
 chip+0.086
burn+0.081
tr+0.073
stand+0.072
Neg logit contributions
 galaxies-0.123
 obstacles-0.108
 hops-0.106
 goals-0.105
 islands-0.104
Token it
Token ID340
Feature activation-2.66e+00

Pos logit contributions
 paramed+0.012
 chip+0.011
burn+0.011
ird+0.010
nes+0.010
Neg logit contributions
 galaxies-0.021
 obstacles-0.019
 spear-0.019
 cities-0.018
 warrior-0.018
Token was
Token ID373
Feature activation-1.41e+00

Pos logit contributions
 Lily+0.090
 Sara+0.089
 chip+0.087
 paramed+0.083
 coffee+0.082
Neg logit contributions
 galaxies-0.089
iles-0.077
 spear-0.076
 obstacles-0.076
 warrior-0.074
Token able
Token ID1498
Feature activation-6.41e+00

Pos logit contributions
 chip+0.036
 paramed+0.036
 Lily+0.034
 Sara+0.032
L+0.031
Neg logit contributions
 galaxies-0.059
 spear-0.053
 armor-0.052
 obstacles-0.052
 armour-0.052
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
 paramed+0.186
ird+0.177
 chip+0.167
stand+0.161
 Sn+0.160
Neg logit contributions
 galaxies-0.243
 obstacles-0.235
 spear-0.227
iles-0.215
 cliffs-0.215
Token fly
Token ID6129
Feature activation-1.99e+00

Pos logit contributions
 Lily+0.031
 chip+0.029
 Sara+0.028
 paramed+0.028
 Fl+0.026
Neg logit contributions
 galaxies-0.043
iles-0.040
 spear-0.040
 warrior-0.038
 obstacles-0.038
Token again
Token ID757
Feature activation+8.31e-01

Pos logit contributions
 Lily+0.045
 chip+0.043
 paramed+0.040
 Sara+0.040
 Wh+0.037
Neg logit contributions
 galaxies-0.091
iles-0.085
 spear-0.083
 obstacles-0.082
 warrior-0.081
Token.
Token ID13
Feature activation-4.91e-01

Pos logit contributions
 galaxies+0.034
 spear+0.031
iles+0.031
 obstacles+0.031
 warrior+0.030
Neg logit contributions
 paramed-0.022
 Lily-0.021
 chip-0.021
 Sara-0.020
 Wh-0.019
Token
Token ID198
Feature activation-8.99e-01

Pos logit contributions
burn+0.019
 paramed+0.016
less+0.016
 oh+0.015
ter+0.015
Neg logit contributions
 galaxies-0.012
Nice-0.011
 Everywhere-0.011
 goals-0.010
 spear-0.010
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
 Lily+0.002
 chip+0.002
 paramed+0.002
 Sara+0.001
burn+0.001
Neg logit contributions
 galaxies-0.002
 spear-0.002
 obstacles-0.002
 warrior-0.002
iles-0.002
Token then
Token ID788
Feature activation-1.72e+00

Pos logit contributions
 chip+0.042
 paramed+0.041
 Lily+0.040
burn+0.039
 Sara+0.038
Neg logit contributions
 galaxies-0.076
 obstacles-0.068
 spear-0.067
iles-0.065
 islands-0.065
Token they
Token ID484
Feature activation-2.94e+00

Pos logit contributions
 chip+0.040
 Lily+0.040
 paramed+0.038
 Sara+0.037
burn+0.035
Neg logit contributions
 galaxies-0.076
 spear-0.068
 obstacles-0.068
iles-0.066
 warrior-0.065
Token were
Token ID547
Feature activation-2.07e+00

Pos logit contributions
 Lily+0.093
 chip+0.092
 Sara+0.087
 paramed+0.085
 Wh+0.082
Neg logit contributions
 galaxies-0.109
iles-0.098
 spear-0.097
 warrior-0.093
 obstacles-0.092
Token finally
Token ID3443
Feature activation-3.97e+00

Pos logit contributions
 chip+0.059
 Lily+0.057
 Wh+0.053
L+0.052
 paramed+0.052
Neg logit contributions
 galaxies-0.078
 obstacles-0.069
 armour-0.069
 spear-0.069
 armor-0.068
Token able
Token ID1498
Feature activation-6.41e+00

Pos logit contributions
 chip+0.125
 Lily+0.112
 Sara+0.107
 paramed+0.105
 Wh+0.104
Neg logit contributions
 galaxies-0.155
 spear-0.141
 armor-0.139
 islands-0.137
 armour-0.137
Token to
Token ID284
Feature activation-1.30e+00

Pos logit contributions
 paramed+0.162
 chip+0.161
 Lily+0.149
ird+0.143
 Wh+0.142
Neg logit contributions
 galaxies-0.265
 obstacles-0.248
 spear-0.244
iles-0.239
 islands-0.235
Token leave
Token ID2666
Feature activation-2.41e+00

Pos logit contributions
 Lily+0.038
 chip+0.036
 Sara+0.035
 paramed+0.034
 Fl+0.033
Neg logit contributions
 galaxies-0.051
iles-0.047
 spear-0.046
 obstacles-0.045
 warrior-0.045
Token.
Token ID13
Feature activation-7.39e-01

Pos logit contributions
 chip+0.050
 Lily+0.048
 paramed+0.048
burn+0.044
 Sara+0.043
Neg logit contributions
 galaxies-0.111
iles-0.103
 spear-0.102
 obstacles-0.102
 islands-0.098
Token From
Token ID3574
Feature activation-8.04e-01

Pos logit contributions
 chip+0.020
 Lily+0.020
 paramed+0.019
burn+0.019
 v+0.019
Neg logit contributions
 galaxies-0.029
iles-0.026
 spear-0.026
 obstacles-0.025
 warrior-0.025
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 Lily+0.018
 chip+0.016
 Sara+0.016
 Fl+0.015
 paramed+0.015
Neg logit contributions
 galaxies-0.038
iles-0.036
 spear-0.035
 obstacles-0.034
 warrior-0.034
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
 galaxies+0.018
iles+0.017
 spear+0.016
 obstacles+0.016
 warrior+0.016
Neg logit contributions
 Lily-0.014
 chip-0.013
 Sara-0.013
 Fl-0.012
 Wh-0.012
Token box
Token ID3091
Feature activation-8.80e-01

Pos logit contributions
 galaxies+0.014
 spear+0.013
 planets+0.013
 armor+0.012
 obstacles+0.012
Neg logit contributions
 paramed-0.009
 Lily-0.008
 Su-0.008
burn-0.008
 chip-0.008
Token on
Token ID319
Feature activation-4.02e-03

Pos logit contributions
 chip+0.021
 paramed+0.020
 Lily+0.020
 Mitt+0.019
 Wh+0.019
Neg logit contributions
 galaxies-0.038
iles-0.034
 spear-0.034
 obstacles-0.033
 planets-0.033
Token her
Token ID607
Feature activation-3.13e+00

Pos logit contributions
 paramed+0.000
burn+0.000
 chip+0.000
 okay+0.000
less+0.000
Neg logit contributions
 galaxies-0.000
 planets-0.000
 obstacles-0.000
 armor-0.000
 spear-0.000
Token night
Token ID1755
Feature activation-2.34e+00

Pos logit contributions
 paramed+0.093
burn+0.083
 Su+0.081
 Ph+0.081
 Gem+0.080
Neg logit contributions
 galaxies-0.114
 spear-0.112
 armor-0.111
 armour-0.108
 planets-0.105
Tokenstand
Token ID1481
Feature activation-6.40e+00

Pos logit contributions
 oh+0.065
 paramed+0.064
 Twe+0.061
 okay+0.061
 Em+0.061
Neg logit contributions
 galaxies-0.087
 reefs-0.079
 planets-0.079
 forests-0.077
 valleys-0.074
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 paramed+0.224
 chip+0.217
 oh+0.214
 Em+0.208
 Ph+0.207
Neg logit contributions
 galaxies-0.178
 goals-0.168
ines-0.167
iles-0.160
 spear-0.154
Token She
Token ID1375
Feature activation+8.04e-01

Pos logit contributions
 chip+0.031
 paramed+0.031
burn+0.030
nes+0.028
 v+0.028
Neg logit contributions
 galaxies-0.036
 spear-0.030
 planets-0.029
 goals-0.029
 obstacles-0.028
Token hopes
Token ID7176
Feature activation+5.11e-02

Pos logit contributions
 galaxies+0.030
iles+0.029
 obstacles+0.028
eways+0.028
 spear+0.028
Neg logit contributions
 Lily-0.026
 Sara-0.024
 Fl-0.024
 chip-0.023
 milk-0.022
Token to
Token ID284
Feature activation+9.26e-02

Pos logit contributions
 galaxies+0.002
 questions+0.002
 planets+0.002
 islands+0.002
 mountains+0.002
Neg logit contributions
 paramed-0.001
 chip-0.001
burn-0.001
ird-0.001
nes-0.001
Token mail
Token ID6920
Feature activation-1.50e+00

Pos logit contributions
 galaxies+0.004
iles+0.003
 spear+0.003
 warrior+0.003
 obstacles+0.003
Neg logit contributions
 Lily-0.003
 chip-0.003
 paramed-0.003
 Sara-0.002
 Wh-0.002
Token and
Token ID290
Feature activation-6.72e-01

Pos logit contributions
 paramed+0.019
 Lily+0.018
 chip+0.018
 Sara+0.017
 Mitt+0.017
Neg logit contributions
 galaxies-0.019
 spear-0.018
iles-0.017
 obstacles-0.017
 armor-0.016
Token she
Token ID673
Feature activation-7.08e-01

Pos logit contributions
 chip+0.018
 Lily+0.018
 paramed+0.017
burn+0.017
 Sara+0.016
Neg logit contributions
 galaxies-0.026
 spear-0.025
 obstacles-0.024
iles-0.024
 warrior-0.023
Token will
Token ID481
Feature activation-1.99e+00

Pos logit contributions
 Lily+0.021
 Sara+0.019
 Fl+0.019
 v+0.018
 Wh+0.018
Neg logit contributions
 galaxies-0.028
iles-0.026
 spear-0.025
 warrior-0.025
 obstacles-0.024
Token not
Token ID407
Feature activation-3.15e+00

Pos logit contributions
 chip+0.063
 paramed+0.060
burn+0.058
 Lily+0.058
 coffee+0.057
Neg logit contributions
 galaxies-0.065
 spear-0.062
 warrior-0.060
 obstacles-0.059
 armor-0.058
Token be
Token ID307
Feature activation-1.47e+00

Pos logit contributions
 chip+0.099
 Lily+0.098
 paramed+0.097
 Sara+0.092
 Fl+0.089
Neg logit contributions
 galaxies-0.106
 spear-0.102
iles-0.100
 warrior-0.099
 obstacles-0.097
Token able
Token ID1498
Feature activation-6.40e+00

Pos logit contributions
 chip+0.050
burn+0.043
 paramed+0.043
 Daughter+0.041
 milk+0.040
Neg logit contributions
 galaxies-0.047
 armor-0.047
 obstacles-0.045
iles-0.044
 armour-0.044
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
 paramed+0.187
 chip+0.174
burn+0.167
 Daughter+0.160
ird+0.158
Neg logit contributions
 obstacles-0.223
 galaxies-0.223
iles-0.218
 spear-0.212
 questions-0.212
Token come
Token ID1282
Feature activation-2.42e+00

Pos logit contributions
 chip+0.032
 Lily+0.032
 paramed+0.032
 Sara+0.031
burn+0.030
Neg logit contributions
 galaxies-0.034
 spear-0.032
 obstacles-0.032
iles-0.032
 warrior-0.031
Token back
Token ID736
Feature activation+3.13e-01

Pos logit contributions
 chip+0.056
 Lily+0.054
 paramed+0.053
 Sara+0.049
 Fl+0.047
Neg logit contributions
 galaxies-0.106
iles-0.098
 spear-0.097
 obstacles-0.095
 warrior-0.093
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 galaxies+0.012
iles+0.011
 obstacles+0.011
 spear+0.011
 islands+0.011
Neg logit contributions
 paramed-0.008
 chip-0.008
 Lily-0.007
burn-0.007
stand-0.007
Token
Token ID198
Feature activation-1.39e+00

Pos logit contributions
 Lily+0.032
 chip+0.030
 Sara+0.028
 paramed+0.028
burn+0.027
Neg logit contributions
 galaxies-0.053
iles-0.050
 spear-0.049
 obstacles-0.048
 warrior-0.047
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
 galaxies+0.003
 spear+0.002
iles+0.002
 obstacles+0.002
 warrior+0.002
Neg logit contributions
 Lily-0.002
 chip-0.002
 paramed-0.002
 Sara-0.002
 Wh-0.002
Token soon
Token ID2582
Feature activation-3.48e+00

Pos logit contributions
 chip+0.036
 paramed+0.035
burn+0.035
stand+0.035
 Lily+0.034
Neg logit contributions
 galaxies-0.055
 spear-0.052
 obstacles-0.049
iles-0.048
 warrior-0.048
Token Tim
Token ID5045
Feature activation-2.69e+00

Pos logit contributions
 chip+0.096
 paramed+0.095
burn+0.093
 Sn+0.088
 Wh+0.087
Neg logit contributions
 galaxies-0.132
 spear-0.118
iles-0.118
 obstacles-0.116
 islands-0.114
Tokenmy
Token ID1820
Feature activation-1.44e+00

Pos logit contributions
 Lily+0.084
 chip+0.080
 paramed+0.079
 Sara+0.079
 Wh+0.074
Neg logit contributions
 galaxies-0.092
iles-0.090
 spear-0.085
 warrior-0.081
 obstacles-0.081
Token was
Token ID373
Feature activation-1.52e+00

Pos logit contributions
 Lily+0.041
 chip+0.039
 Sara+0.039
 paramed+0.038
 Wh+0.037
Neg logit contributions
 galaxies-0.054
iles-0.050
 spear-0.049
 obstacles-0.048
 warrior-0.048
Token able
Token ID1498
Feature activation-6.33e+00

Pos logit contributions
 chip+0.045
 Wh+0.042
 Lily+0.041
L+0.039
 Sara+0.039
Neg logit contributions
 galaxies-0.050
 armor-0.044
 tirelessly-0.043
 obstacles-0.043
 warrior-0.043
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
 paramed+0.153
 chip+0.150
 Lily+0.149
ird+0.144
 Wh+0.141
Neg logit contributions
 galaxies-0.253
 spear-0.246
 obstacles-0.242
iles-0.232
 goals-0.222
Token support
Token ID1104
Feature activation+9.82e-02

Pos logit contributions
 Lily+0.027
 chip+0.025
 Sara+0.025
 paramed+0.025
 Wh+0.024
Neg logit contributions
 galaxies-0.039
iles-0.037
 spear-0.037
 warrior-0.036
 obstacles-0.035
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
 galaxies+0.003
iles+0.003
 spear+0.003
 obstacles+0.003
 warrior+0.003
Neg logit contributions
 Lily-0.003
 paramed-0.003
 chip-0.003
burn-0.003
 Sara-0.003
Token cup
Token ID6508
Feature activation-2.42e+00

Pos logit contributions
 obstacles+0.015
 galaxies+0.015
iles+0.015
 obstacle+0.015
 questions+0.014
Neg logit contributions
burn-0.007
nes-0.006
 Wh-0.006
 Twe-0.006
ird-0.006
Token all
Token ID477
Feature activation+1.84e+00

Pos logit contributions
 paramed+0.064
 Wh+0.062
burn+0.062
Pe+0.061
 Lily+0.061
Neg logit contributions
iles-0.082
 galaxies-0.082
 spear-0.079
 boulder-0.075
 obstacles-0.074
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
 Lily+0.092
 chip+0.088
 Sara+0.085
 paramed+0.084
 Fl+0.080
Neg logit contributions
 galaxies-0.151
iles-0.140
 spear-0.139
 obstacles-0.136
 warrior-0.134
Token and
Token ID290
Feature activation-1.92e+00

Pos logit contributions
 Lily+0.003
 chip+0.003
 paramed+0.003
 Sara+0.003
burn+0.003
Neg logit contributions
 galaxies-0.004
 spear-0.004
 obstacles-0.004
iles-0.004
 warrior-0.004
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
 chip+0.048
 paramed+0.047
burn+0.047
 Lily+0.045
 Sara+0.044
Neg logit contributions
 galaxies-0.082
 spear-0.073
 obstacles-0.072
 warrior-0.072
iles-0.070
Token tourist
Token ID18473
Feature activation-2.67e+00

Pos logit contributions
 Lily+0.047
 Sara+0.041
 chip+0.040
 v+0.039
 Fl+0.039
Neg logit contributions
 galaxies-0.059
iles-0.057
 spear-0.053
 obstacles-0.052
 feats-0.051
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
 Lily+0.077
 chip+0.074
 Sara+0.072
 paramed+0.069
 Fl+0.068
Neg logit contributions
 galaxies-0.104
iles-0.095
 spear-0.094
 warrior-0.092
 obstacles-0.092
Token able
Token ID1498
Feature activation-6.32e+00

Pos logit contributions
 chip+0.035
 Lily+0.031
stand+0.031
 coffee+0.031
 Fl+0.030
Neg logit contributions
 galaxies-0.035
 armor-0.032
 spear-0.032
 armour-0.032
 tirelessly-0.031
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
 paramed+0.142
 chip+0.141
 Lily+0.137
burn+0.130
 Sara+0.128
Neg logit contributions
 galaxies-0.274
 spear-0.264
 obstacles-0.260
iles-0.256
 warrior-0.244
Token finish
Token ID5461
Feature activation-2.89e+00

Pos logit contributions
 Lily+0.032
 chip+0.030
 Sara+0.029
 paramed+0.028
 Fl+0.028
Neg logit contributions
 galaxies-0.048
iles-0.045
 spear-0.044
 obstacles-0.043
 warrior-0.043
Token his
Token ID465
Feature activation-2.49e+00

Pos logit contributions
 Lily+0.063
 chip+0.057
 Sara+0.056
 Fl+0.052
 Wh+0.051
Neg logit contributions
 galaxies-0.135
iles-0.129
 spear-0.124
 warrior-0.122
 obstacles-0.122
Token tour
Token ID4205
Feature activation-2.10e+00

Pos logit contributions
 Lily+0.085
 chip+0.080
 Sara+0.078
 paramed+0.077
 v+0.076
Neg logit contributions
 galaxies-0.085
iles-0.077
 spear-0.074
 warrior-0.073
 obstacles-0.072
Token happily
Token ID18177
Feature activation+6.17e-02

Pos logit contributions
 Lily+0.054
 chip+0.053
 Sara+0.052
 paramed+0.051
 Fl+0.049
Neg logit contributions
 galaxies-0.086
 spear-0.078
iles-0.078
 obstacles-0.077
 warrior-0.076
Token then
Token ID788
Feature activation-1.45e+00

Pos logit contributions
 galaxies+0.004
 obstacles+0.004
 spear+0.004
iles+0.004
 warrior+0.004
Neg logit contributions
 paramed-0.002
 chip-0.002
burn-0.002
 Lily-0.002
 Fl-0.002
Token people
Token ID661
Feature activation-2.58e+00

Pos logit contributions
 chip+0.031
 paramed+0.030
 Lily+0.029
burn+0.029
 Fl+0.027
Neg logit contributions
 galaxies-0.065
iles-0.061
 spear-0.060
 obstacles-0.060
 warrior-0.059
Token wouldn
Token ID3636
Feature activation-1.15e+00

Pos logit contributions
 Lily+0.074
 chip+0.070
 Sara+0.068
 paramed+0.065
 Fl+0.065
Neg logit contributions
 galaxies-0.101
iles-0.095
 spear-0.092
 obstacles-0.090
 warrior-0.089
Token't
Token ID470
Feature activation-2.97e+00

Pos logit contributions
 Lily+0.025
 chip+0.023
 Sara+0.022
 paramed+0.021
 Fl+0.020
Neg logit contributions
 galaxies-0.052
iles-0.049
 spear-0.048
 obstacles-0.047
 warrior-0.047
Token be
Token ID307
Feature activation-1.48e+00

Pos logit contributions
 chip+0.079
 Lily+0.078
 Sara+0.075
 Fl+0.075
 paramed+0.075
Neg logit contributions
 galaxies-0.110
 spear-0.107
iles-0.105
 obstacles-0.103
 warrior-0.102
Token able
Token ID1498
Feature activation-6.30e+00

Pos logit contributions
 chip+0.044
 Fl+0.039
 Sn+0.039
stand+0.038
Fl+0.037
Neg logit contributions
 galaxies-0.051
 obstacles-0.051
iles-0.050
 spear-0.048
 armor-0.048
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 paramed+0.153
 chip+0.149
burn+0.138
 Lily+0.137
 Sn+0.134
Neg logit contributions
 galaxies-0.258
 obstacles-0.251
 spear-0.247
iles-0.243
 obstacle-0.230
Token get
Token ID651
Feature activation-2.12e+00

Pos logit contributions
 Lily+0.029
 chip+0.027
 Sara+0.027
 paramed+0.026
 Fl+0.026
Neg logit contributions
 galaxies-0.045
iles-0.043
 spear-0.042
 obstacles-0.041
 warrior-0.041
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
 chip+0.057
 paramed+0.053
stand+0.050
burn+0.049
L+0.048
Neg logit contributions
 galaxies-0.079
 obstacles-0.075
 spear-0.073
iles-0.073
 armor-0.070
Token help
Token ID1037
Feature activation+2.42e-01

Pos logit contributions
 galaxies+0.015
 obstacles+0.015
 obstacle+0.014
 questions+0.014
iles+0.013
Neg logit contributions
 Lily-0.013
 chip-0.013
 Sara-0.012
burn-0.012
S-0.012
Token they
Token ID484
Feature activation-1.68e+00

Pos logit contributions
 galaxies+0.009
 obstacles+0.008
 obstacle+0.008
 spear+0.008
iles+0.008
Neg logit contributions
 paramed-0.006
 Sara-0.006
 chip-0.006
 Lily-0.006
 Wh-0.006
Token longer
Token ID2392
Feature activation-2.65e+00

Pos logit contributions
 chip+0.103
 Lily+0.102
 paramed+0.099
 Wh+0.097
 Sara+0.095
Neg logit contributions
 galaxies-0.103
iles-0.095
 obstacles-0.092
 warrior-0.091
 spear-0.089
Token weak
Token ID4939
Feature activation-2.04e+00

Pos logit contributions
 chip+0.090
 Lily+0.085
 Wh+0.084
burn+0.082
 paramed+0.081
Neg logit contributions
 galaxies-0.092
 obstacles-0.080
 warrior-0.080
 spear-0.076
 obstacle-0.074
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 Lily+0.055
 chip+0.053
 paramed+0.053
 Sara+0.051
 Wh+0.050
Neg logit contributions
 galaxies-0.083
 obstacles-0.075
iles-0.074
 warrior-0.074
 spear-0.074
Token she
Token ID673
Feature activation-2.07e+00

Pos logit contributions
 chip+0.026
burn+0.026
 paramed+0.025
 Lily+0.023
stand+0.023
Neg logit contributions
 galaxies-0.045
 obstacles-0.039
 spear-0.038
 warrior-0.038
iles-0.037
Token was
Token ID373
Feature activation-1.36e+00

Pos logit contributions
 Lily+0.060
 chip+0.056
 Sara+0.055
 paramed+0.053
 Fl+0.052
Neg logit contributions
 galaxies-0.081
iles-0.076
 spear-0.074
 obstacles-0.072
 warrior-0.072
Token able
Token ID1498
Feature activation-6.29e+00

Pos logit contributions
 chip+0.043
 Wh+0.039
 paramed+0.038
 Lily+0.037
L+0.037
Neg logit contributions
 galaxies-0.045
 warrior-0.040
 obstacles-0.038
 goals-0.038
 spear-0.038
Token to
Token ID284
Feature activation-1.35e+00

Pos logit contributions
 paramed+0.169
 chip+0.166
 Lily+0.154
burn+0.152
 Sn+0.151
Neg logit contributions
 galaxies-0.245
 obstacles-0.232
 spear-0.227
iles-0.216
 questions-0.215
Token go
Token ID467
Feature activation-1.72e+00

Pos logit contributions
 Lily+0.037
 chip+0.035
 Sara+0.034
 paramed+0.032
 Fl+0.032
Neg logit contributions
 galaxies-0.055
iles-0.052
 spear-0.051
 warrior-0.049
 obstacles-0.049
Token out
Token ID503
Feature activation-1.28e+00

Pos logit contributions
 Lily+0.033
 chip+0.030
 Sara+0.029
 paramed+0.028
 Fl+0.027
Neg logit contributions
 galaxies-0.084
iles-0.080
 spear-0.078
 obstacles-0.076
 warrior-0.076
Token and
Token ID290
Feature activation-6.78e-01

Pos logit contributions
 Lily+0.029
 chip+0.029
 paramed+0.027
 Sara+0.026
 Wh+0.026
Neg logit contributions
 galaxies-0.058
iles-0.053
 obstacles-0.052
 spear-0.051
 warrior-0.050
Token explore
Token ID7301
Feature activation+5.62e-01

Pos logit contributions
 chip+0.021
 paramed+0.021
 Lily+0.020
 Sara+0.020
burn+0.019
Neg logit contributions
 galaxies-0.024
 spear-0.022
 warrior-0.021
 obstacles-0.021
iles-0.020
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
 galaxies+0.009
iles+0.008
 spear+0.008
 obstacles+0.008
 warrior+0.008
Neg logit contributions
 Lily-0.005
 chip-0.005
 Sara-0.004
 paramed-0.004
 Fl-0.004
Token sniff
Token ID26300
Feature activation+1.73e+00

Pos logit contributions
 chip+0.032
 Lily+0.031
 paramed+0.030
burn+0.029
 Sara+0.028
Neg logit contributions
 galaxies-0.060
 spear-0.056
iles-0.056
 obstacles-0.054
 warrior-0.054
Token all
Token ID477
Feature activation+3.09e+00

Pos logit contributions
 galaxies+0.058
iles+0.055
 spear+0.052
 obstacles+0.051
 warrior+0.049
Neg logit contributions
 Lily-0.058
 chip-0.057
 paramed-0.055
 Sara-0.054
 Wh-0.053
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 galaxies+0.141
iles+0.135
 spear+0.129
 obstacles+0.128
 obstacle+0.124
Neg logit contributions
 chip-0.064
 paramed-0.063
 Lily-0.062
 Wh-0.059
burn-0.057
Token smells
Token ID25760
Feature activation-8.82e-02

Pos logit contributions
 galaxies+0.040
iles+0.037
 spear+0.036
 warrior+0.035
 obstacles+0.035
Neg logit contributions
 Lily-0.029
 chip-0.027
 Sara-0.026
 Fl-0.025
 v-0.024
Token in
Token ID287
Feature activation+3.39e+00

Pos logit contributions
 Lily+0.002
 Wh+0.002
 paramed+0.002
 chip+0.002
burn+0.002
Neg logit contributions
 galaxies-0.004
iles-0.003
 spear-0.003
 obstacles-0.003
 obstacle-0.003
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
 galaxies+0.161
iles+0.151
 spear+0.150
 obstacles+0.149
 warrior+0.143
Neg logit contributions
 Lily-0.062
 chip-0.062
 paramed-0.060
burn-0.057
 Wh-0.055
Token park
Token ID3952
Feature activation+4.04e-01

Pos logit contributions
 galaxies+0.060
iles+0.057
 spear+0.054
 obstacles+0.054
 warrior+0.053
Neg logit contributions
 Lily-0.040
 chip-0.037
 Sara-0.036
 paramed-0.035
 Fl-0.034
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
 galaxies+0.015
iles+0.014
 spear+0.014
 obstacles+0.014
 islands+0.013
Neg logit contributions
 paramed-0.011
 Lily-0.011
 chip-0.010
 Wh-0.010
burn-0.010
Token One
Token ID1881
Feature activation+3.23e+00

Pos logit contributions
 Lily+0.004
 chip+0.004
 paramed+0.004
burn+0.004
 Sara+0.004
Neg logit contributions
 galaxies-0.006
iles-0.006
 spear-0.005
 obstacles-0.005
 warrior-0.005
Token day
Token ID1110
Feature activation-3.97e-01

Pos logit contributions
 galaxies+0.129
iles+0.124
 spear+0.121
 obstacles+0.116
 warrior+0.114
Neg logit contributions
 Lily-0.085
 chip-0.082
 paramed-0.080
 Wh-0.078
 Sara-0.077
Token when
Token ID618
Feature activation-3.34e+00

Pos logit contributions
 galaxies+0.004
iles+0.004
 spear+0.004
 obstacles+0.004
 warrior+0.004
Neg logit contributions
 Lily-0.003
 chip-0.003
 Sara-0.003
 paramed-0.003
 Fl-0.003
Token he
Token ID339
Feature activation-4.58e-01

Pos logit contributions
 Lily+0.096
 Sara+0.085
 Fl+0.081
 Wh+0.077
 chip+0.076
Neg logit contributions
iles-0.143
 galaxies-0.141
 spear-0.127
 warrior-0.127
 facts-0.125
Token stopped
Token ID5025
Feature activation-1.43e+00

Pos logit contributions
 chip+0.014
 Lily+0.014
 paramed+0.013
 Sara+0.013
 Wh+0.013
Neg logit contributions
 galaxies-0.016
iles-0.015
 spear-0.015
 obstacles-0.014
 warrior-0.014
Token he
Token ID339
Feature activation-3.18e-02

Pos logit contributions
 Lily+0.025
 Sara+0.019
 chip+0.018
 Fl+0.018
 Mitt+0.018
Neg logit contributions
 galaxies-0.076
iles-0.074
 spear-0.070
 warrior-0.070
 facts-0.069
Token felt
Token ID2936
Feature activation+5.10e-01

Pos logit contributions
 Lily+0.001
 Sara+0.001
 Fl+0.001
 chip+0.001
 Wh+0.001
Neg logit contributions
 galaxies-0.001
iles-0.001
 spear-0.001
 warrior-0.001
 obstacles-0.001
Token g
Token ID308
Feature activation+3.54e+00

Pos logit contributions
 galaxies+0.024
iles+0.023
 spear+0.022
 obstacles+0.022
 warrior+0.021
Neg logit contributions
 Lily-0.011
 chip-0.010
 Sara-0.010
 paramed-0.009
 Fl-0.009
Tokeniddy
Token ID34208
Feature activation-1.67e+00

Pos logit contributions
 galaxies+0.115
iles+0.107
 spear+0.102
 obstacles+0.099
 warrior+0.098
Neg logit contributions
 Lily-0.127
 chip-0.121
 Sara-0.118
 paramed-0.115
 Fl-0.114
Token,
Token ID11
Feature activation+8.33e-01

Pos logit contributions
 Lily+0.046
 Sara+0.044
 chip+0.044
 Mitt+0.042
 paramed+0.042
Neg logit contributions
 galaxies-0.064
 spear-0.060
 obstacles-0.057
 warrior-0.057
 obstacle-0.056
Token but
Token ID475
Feature activation+6.93e-01

Pos logit contributions
 galaxies+0.037
iles+0.035
 spear+0.034
 obstacles+0.033
 warrior+0.033
Neg logit contributions
 Lily-0.020
 chip-0.019
 Sara-0.018
 paramed-0.017
 Fl-0.017
Token also
Token ID635
Feature activation-1.41e+00

Pos logit contributions
 galaxies+0.025
iles+0.023
 spear+0.023
 obstacles+0.022
 warrior+0.022
Neg logit contributions
 Lily-0.021
 chip-0.021
 paramed-0.020
 Sara-0.020
burn-0.019
Token something
Token ID1223
Feature activation-1.43e+00

Pos logit contributions
 Lily+0.035
 chip+0.031
 Sara+0.031
 paramed+0.029
 Fl+0.029
Neg logit contributions
 galaxies-0.062
iles-0.059
 spear-0.057
 obstacles-0.055
 warrior-0.055
Token smiled
Token ID13541
Feature activation+2.10e+00

Pos logit contributions
 galaxies+0.041
iles+0.038
 spear+0.037
 warrior+0.036
 obstacles+0.036
Neg logit contributions
 Lily-0.028
 chip-0.027
 Sara-0.026
 paramed-0.025
 Wh-0.025
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
 galaxies+0.093
iles+0.090
 obstacles+0.085
 spear+0.084
 warrior+0.082
Neg logit contributions
 Lily-0.054
 Sara-0.047
 chip-0.047
 okay-0.044
 Fl-0.044
Token said
Token ID531
Feature activation+9.01e-01

Pos logit contributions
 galaxies+0.065
 spear+0.065
iles+0.062
 warrior+0.060
 obstacles+0.057
Neg logit contributions
stand-0.046
burn-0.042
 paramed-0.042
 okay-0.041
 chip-0.041
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
 galaxies+0.033
iles+0.032
 spear+0.032
 warrior+0.030
 obstacles+0.029
Neg logit contributions
 chip-0.024
burn-0.024
 Lily-0.024
 paramed-0.023
 Wh-0.022
Token "
Token ID366
Feature activation+1.70e+00

Pos logit contributions
 galaxies+0.061
 spear+0.057
iles+0.056
 warrior+0.054
 obstacles+0.053
Neg logit contributions
 Lily-0.046
burn-0.045
 chip-0.044
 paramed-0.043
 Sara-0.043
TokenYes
Token ID5297
Feature activation+4.35e+00

Pos logit contributions
 galaxies+0.077
iles+0.073
 spear+0.071
 obstacles+0.069
 warrior+0.068
Neg logit contributions
 Lily-0.040
 chip-0.037
 Sara-0.035
 paramed-0.034
 Fl-0.033
Token,
Token ID11
Feature activation+2.04e+00

Pos logit contributions
iles+0.120
 spear+0.114
 galaxies+0.109
 warrior+0.107
 boulder+0.106
Neg logit contributions
burn-0.150
 paramed-0.149
 chip-0.143
stand-0.139
 perfume-0.134
Token of
Token ID286
Feature activation-2.01e+00

Pos logit contributions
 spear+0.082
 galaxies+0.082
iles+0.080
 obstacles+0.077
 warrior+0.076
Neg logit contributions
burn-0.049
 paramed-0.047
stand-0.043
 Sn-0.043
 chip-0.042
Token course
Token ID1781
Feature activation+2.56e+00

Pos logit contributions
 Lily+0.093
 chip+0.089
 Sara+0.088
 paramed+0.086
 Fl+0.085
Neg logit contributions
 galaxies-0.045
iles-0.040
 spear-0.038
 obstacles-0.036
 warrior-0.036
Token we
Token ID356
Feature activation-8.12e-01

Pos logit contributions
 galaxies+0.088
iles+0.088
 spear+0.084
 obstacles+0.080
 warrior+0.080
Neg logit contributions
 paramed-0.081
burn-0.075
 chip-0.072
stand-0.071
 Wh-0.065
Token can
Token ID460
Feature activation+4.89e-02

Pos logit contributions
 Lily+0.024
 okay+0.021
 Sara+0.021
 let+0.021
 chip+0.021
Neg logit contributions
 galaxies-0.032
iles-0.029
 obstacles-0.029
 spear-0.027
 feats-0.027
Token there
Token ID612
Feature activation-1.85e+00

Pos logit contributions
 Lily+0.034
 chip+0.033
 Sara+0.032
 paramed+0.031
 Fl+0.031
Neg logit contributions
 galaxies-0.030
iles-0.028
 spear-0.027
 obstacles-0.026
 warrior-0.026
Token was
Token ID373
Feature activation+3.63e-01

Pos logit contributions
 chip+0.054
 Lily+0.054
 paramed+0.053
burn+0.051
 Sara+0.050
Neg logit contributions
 galaxies-0.066
iles-0.062
 spear-0.060
 obstacles-0.058
 warrior-0.057
Token a
Token ID257
Feature activation-1.12e+00

Pos logit contributions
 galaxies+0.018
iles+0.017
 spear+0.017
 obstacles+0.016
 warrior+0.016
Neg logit contributions
 Lily-0.006
 chip-0.006
 Sara-0.006
 paramed-0.005
 Wh-0.005
Token duck
Token ID22045
Feature activation-3.41e-01

Pos logit contributions
 Lily+0.016
 chip+0.014
 Sara+0.013
 paramed+0.012
 Wh+0.012
Neg logit contributions
 galaxies-0.059
iles-0.056
 spear-0.056
 warrior-0.055
 obstacles-0.054
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 Lily+0.008
 chip+0.008
 Sara+0.007
 paramed+0.007
 Fl+0.007
Neg logit contributions
 galaxies-0.015
iles-0.014
 spear-0.014
 obstacles-0.014
 warrior-0.013
Token He
Token ID679
Feature activation+3.69e+00

Pos logit contributions
 galaxies+0.012
iles+0.011
 spear+0.011
 warrior+0.010
 obstacles+0.010
Neg logit contributions
 Lily-0.008
 chip-0.008
 paramed-0.008
burn-0.008
 okay-0.007
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
 galaxies+0.154
iles+0.145
 obstacles+0.142
 obstacle+0.140
 warrior+0.139
Neg logit contributions
 Lily-0.112
 Fl-0.099
 Sara-0.097
 v-0.094
 Wh-0.092
Token silly
Token ID14397
Feature activation+5.70e-01

Pos logit contributions
 spear+0.035
 galaxies+0.035
iles+0.034
 armor+0.031
 armour+0.031
Neg logit contributions
ird-0.037
stand-0.037
 paramed-0.036
nes-0.035
 chip-0.034
Token and
Token ID290
Feature activation+1.51e-01

Pos logit contributions
 galaxies+0.026
iles+0.024
 spear+0.023
 obstacles+0.023
 warrior+0.023
Neg logit contributions
 Lily-0.014
 chip-0.013
 Sara-0.012
 Fl-0.012
 Wh-0.011
Token always
Token ID1464
Feature activation-1.13e-01

Pos logit contributions
 galaxies+0.005
iles+0.005
 spear+0.005
 obstacles+0.004
 warrior+0.004
Neg logit contributions
 Lily-0.005
 chip-0.005
 Sara-0.005
 paramed-0.005
 Fl-0.005
Token looked
Token ID3114
Feature activation+2.34e+00

Pos logit contributions
 Lily+0.003
 chip+0.003
 paramed+0.003
 Sara+0.003
 Wh+0.003
Neg logit contributions
 galaxies-0.004
iles-0.004
 spear-0.004
 obstacles-0.004
 warrior-0.004
Token graceful
Token ID44363
Feature activation+1.85e+00

Pos logit contributions
 galaxies+0.129
iles+0.123
 spear+0.119
 obstacles+0.116
 warrior+0.115
Neg logit contributions
 Lily-0.070
 chip-0.064
 Sara-0.061
 Fl-0.058
 paramed-0.057
Token!"
Token ID2474
Feature activation+7.56e-01

Pos logit contributions
 galaxies+0.081
iles+0.077
 spear+0.075
 obstacles+0.073
 warrior+0.073
Neg logit contributions
 Lily-0.044
 chip-0.041
 Sara-0.040
 paramed-0.039
 Wh-0.038
Token exclaimed
Token ID33552
Feature activation-2.88e-01

Pos logit contributions
 galaxies+0.029
 spear+0.027
 warrior+0.026
iles+0.026
 obstacles+0.026
Neg logit contributions
burn-0.021
 chip-0.019
 paramed-0.019
less-0.018
 v-0.018
Token Daisy
Token ID40355
Feature activation-1.85e-01

Pos logit contributions
burn+0.009
stand+0.008
 paramed+0.008
 chip+0.008
 v+0.008
Neg logit contributions
 galaxies-0.009
iles-0.009
 warrior-0.009
 spear-0.009
 obstacles-0.009
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
 paramed+0.006
burn+0.005
 Lily+0.005
 Wh+0.005
 Woo+0.005
Neg logit contributions
 galaxies-0.006
 spear-0.006
 warrior-0.006
 hops-0.005
 curiously-0.005
Token
Token ID220
Feature activation+3.37e+00

Pos logit contributions
 galaxies+0.040
iles+0.037
 spear+0.037
 warrior+0.036
 obstacles+0.036
Neg logit contributions
 Lily-0.023
 chip-0.022
 paramed-0.021
 Sara-0.021
burn-0.021
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
 galaxies+0.121
iles+0.116
 obstacles+0.110
 spear+0.109
 goals+0.104
Neg logit contributions
 paramed-0.092
 chip-0.091
burn-0.090
 Lily-0.090
 v-0.088
Token
Token ID198
Feature activation-1.82e-01

Pos logit contributions
 Lily+0.002
 chip+0.002
 Sara+0.002
 paramed+0.002
 Wh+0.002
Neg logit contributions
 galaxies-0.004
iles-0.004
 spear-0.004
 obstacles-0.004
 warrior-0.004
TokenS
Token ID50
Feature activation-1.58e+00

Pos logit contributions
 Lily+0.005
 chip+0.005
 Sara+0.005
 Fl+0.004
 paramed+0.004
Neg logit contributions
 galaxies-0.007
iles-0.007
 spear-0.007
 obstacles-0.007
 warrior-0.007
Tokenally
Token ID453
Feature activation-1.27e+00

Pos logit contributions
 Lily+0.045
 chip+0.042
 Sara+0.041
 Fl+0.040
 paramed+0.040
Neg logit contributions
 galaxies-0.062
iles-0.060
 spear-0.056
 obstacles-0.056
 warrior-0.055
Token smiled
Token ID13541
Feature activation+1.79e+00

Pos logit contributions
 Lily+0.036
 chip+0.034
 Sara+0.032
 Fl+0.032
 paramed+0.030
Neg logit contributions
 galaxies-0.050
iles-0.048
 spear-0.045
 obstacles-0.045
 warrior-0.044
Token sun
Token ID4252
Feature activation-5.11e-01

Pos logit contributions
 galaxies+0.060
iles+0.056
 obstacles+0.052
 spear+0.051
 warrior+0.050
Neg logit contributions
 Lily-0.046
 chip-0.044
 Sara-0.042
 v-0.040
 Fl-0.040
Tokenflower
Token ID25547
Feature activation+2.82e-01

Pos logit contributions
 Lily+0.018
 Sara+0.017
 okay+0.016
nes+0.016
 paramed+0.016
Neg logit contributions
 spear-0.015
 galaxies-0.014
iles-0.013
 planets-0.013
 armour-0.012
Token and
Token ID290
Feature activation+5.45e-01

Pos logit contributions
 spear+0.010
 galaxies+0.009
 boulder+0.009
 warrior+0.009
 armour+0.008
Neg logit contributions
 Lily-0.008
 Sara-0.008
 paramed-0.008
 Lara-0.008
 okay-0.007
Token she
Token ID673
Feature activation-5.29e-02

Pos logit contributions
 galaxies+0.022
iles+0.020
 spear+0.020
 obstacles+0.019
 warrior+0.019
Neg logit contributions
 chip-0.014
 Lily-0.014
 paramed-0.014
 Sara-0.013
 Wh-0.013
Token was
Token ID373
Feature activation+2.39e-01

Pos logit contributions
 Lily+0.002
 chip+0.002
 Sara+0.002
 paramed+0.002
 Wh+0.001
Neg logit contributions
 galaxies-0.002
iles-0.002
 spear-0.002
 obstacles-0.002
 warrior-0.002
Token very
Token ID845
Feature activation+1.39e+00

Pos logit contributions
 galaxies+0.006
 particularly+0.005
 spear+0.005
 armor+0.005
 questions+0.005
Neg logit contributions
stand-0.008
L-0.008
 chip-0.008
ird-0.008
less-0.008
Token excited
Token ID6568
Feature activation-8.25e-01

Pos logit contributions
 galaxies+0.040
 spear+0.039
iles+0.038
 wonders+0.036
 hops+0.036
Neg logit contributions
stand-0.047
burn-0.045
 Wh-0.044
 chip-0.043
 paramed-0.043
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 paramed+0.019
 chip+0.019
 Lily+0.018
 Sara+0.017
 Wh+0.017
Neg logit contributions
 galaxies-0.035
iles-0.033
 spear-0.032
 obstacles-0.031
 feats-0.031
Token She
Token ID1375
Feature activation+2.02e+00

Pos logit contributions
 galaxies+0.007
 spear+0.006
 obstacles+0.006
Nice+0.006
 warrior+0.006
Neg logit contributions
 paramed-0.007
burn-0.007
 chip-0.007
 okay-0.006
 v-0.006
Token yelled
Token ID22760
Feature activation+1.71e+00

Pos logit contributions
 galaxies+0.091
iles+0.086
 spear+0.083
 obstacles+0.082
 warrior+0.081
Neg logit contributions
 Lily-0.051
 Sara-0.044
 chip-0.044
 Fl-0.042
 Wh-0.041
Token to
Token ID284
Feature activation+1.54e+00

Pos logit contributions
 galaxies+0.071
 spear+0.066
 warrior+0.064
iles+0.064
 obstacles+0.063
Neg logit contributions
 chip-0.039
 Lily-0.038
 paramed-0.038
burn-0.038
 Sara-0.036
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
 galaxies+0.076
iles+0.071
 spear+0.070
 obstacles+0.069
 warrior+0.068
Neg logit contributions
 Lily-0.033
 chip-0.031
 paramed-0.030
 Sara-0.029
 Wh-0.028
Token garden
Token ID11376
Feature activation-2.09e+00

Pos logit contributions
 Lily+0.007
 chip+0.006
 Sara+0.006
 paramed+0.006
 Wh+0.006
Neg logit contributions
 galaxies-0.009
 spear-0.008
iles-0.008
 obstacles-0.008
 warrior-0.008
Token!
Token ID0
Feature activation+4.13e-01

Pos logit contributions
 Lily+0.055
 paramed+0.052
 chip+0.052
 Sara+0.051
 Wh+0.050
Neg logit contributions
 galaxies-0.084
iles-0.077
 spear-0.077
 warrior-0.075
 obstacles-0.074
Token<|endoftext|>
Token ID50256
Feature activation+2.62e+00

Pos logit contributions
 galaxies+0.019
iles+0.018
 spear+0.017
 obstacles+0.017
 warrior+0.017
Neg logit contributions
 Lily-0.010
 chip-0.009
 Sara-0.009
 Fl-0.008
 Wh-0.008
TokenOnce
Token ID7454
Feature activation-7.52e-01

Pos logit contributions
 galaxies+0.092
iles+0.085
 spear+0.082
 obstacles+0.081
 warrior+0.080
Neg logit contributions
 Lily-0.089
 Sara-0.082
 chip-0.082
 Fl-0.079
 Wh-0.078
Token upon
Token ID2402
Feature activation-1.74e-01

Pos logit contributions
 Lily+0.027
 chip+0.025
 Sara+0.025
 paramed+0.024
 Fl+0.024
Neg logit contributions
 galaxies-0.024
iles-0.023
 spear-0.022
 obstacles-0.021
 warrior-0.021
Token a
Token ID257
Feature activation-5.76e-01

Pos logit contributions
 Lily+0.003
 chip+0.002
 Sara+0.002
 paramed+0.002
 Fl+0.002
Neg logit contributions
 galaxies-0.009
iles-0.009
 spear-0.008
 obstacles-0.008
 warrior-0.008
Token time
Token ID640
Feature activation-2.08e+00

Pos logit contributions
 Lily+0.013
 Sara+0.011
 chip+0.011
 Fl+0.010
 okay+0.010
Neg logit contributions
 galaxies-0.028
iles-0.026
 spear-0.025
 obstacles-0.025
 warrior-0.025
Token,
Token ID11
Feature activation+1.77e-01

Pos logit contributions
 Lily+0.049
 chip+0.046
 Sara+0.044
 paramed+0.043
 Wh+0.041
Neg logit contributions
 galaxies-0.091
iles-0.087
 spear-0.084
 obstacles-0.082
 warrior-0.082
Token there
Token ID612
Feature activation-2.04e+00

Pos logit contributions
 galaxies+0.009
iles+0.008
 spear+0.008
 obstacles+0.008
 warrior+0.008
Neg logit contributions
 chip-0.003
 paramed-0.003
 Lily-0.003
burn-0.003
 Sara-0.002
Token was
Token ID373
Feature activation+3.64e-01

Pos logit contributions
 Lily+0.056
 chip+0.051
 Sara+0.050
 Fl+0.048
 paramed+0.047
Neg logit contributions
 galaxies-0.085
iles-0.079
 spear-0.077
 obstacles-0.075
 warrior-0.075
Token decided
Token ID3066
Feature activation-1.18e+00

Pos logit contributions
 Lily+0.043
 chip+0.038
 Sara+0.038
 Fl+0.036
 Wh+0.036
Neg logit contributions
 galaxies-0.067
iles-0.063
 spear-0.061
 obstacles-0.061
 warrior-0.059
Token it
Token ID340
Feature activation-9.93e-01

Pos logit contributions
 chip+0.024
 paramed+0.023
 Lily+0.023
 Fl+0.021
 Sn+0.021
Neg logit contributions
 galaxies-0.053
 spear-0.050
 obstacles-0.050
iles-0.048
 warrior-0.047
Token was
Token ID373
Feature activation-3.06e-01

Pos logit contributions
 Lily+0.027
 chip+0.025
 Sara+0.024
 Fl+0.023
 Wh+0.023
Neg logit contributions
 galaxies-0.042
iles-0.039
 spear-0.038
 warrior-0.037
 obstacles-0.037
Token their
Token ID511
Feature activation-3.30e+00

Pos logit contributions
 chip+0.007
 Lily+0.007
 paramed+0.007
 Sara+0.007
 Fl+0.007
Neg logit contributions
 galaxies-0.013
iles-0.012
 spear-0.012
 obstacles-0.012
 warrior-0.011
Token new
Token ID649
Feature activation-7.98e-01

Pos logit contributions
 Lily+0.083
 chip+0.076
 Sara+0.073
 Fl+0.071
 baby+0.067
Neg logit contributions
 galaxies-0.146
iles-0.139
 obstacles-0.129
 warrior-0.128
 spear-0.128
Token favorite
Token ID4004
Feature activation+9.23e-01

Pos logit contributions
 Lily+0.017
 chip+0.016
 Sara+0.014
 baby+0.014
 Fl+0.013
Neg logit contributions
 galaxies-0.039
iles-0.038
 asteroids-0.034
 obstacles-0.034
 valleys-0.033
Token game
Token ID983
Feature activation-6.32e-01

Pos logit contributions
 galaxies+0.044
iles+0.042
 warrior+0.038
 asteroids+0.038
 obstacles+0.038
Neg logit contributions
 Lily-0.022
 chip-0.019
 Sara-0.018
 Fl-0.017
 baby-0.017
Token.
Token ID13
Feature activation+3.92e-01

Pos logit contributions
 Lily+0.015
 chip+0.014
 paramed+0.013
 Sara+0.013
 Wh+0.012
Neg logit contributions
 galaxies-0.028
iles-0.026
 spear-0.026
 obstacles-0.025
 warrior-0.025
Token They
Token ID1119
Feature activation+1.36e+00

Pos logit contributions
 galaxies+0.014
iles+0.013
 spear+0.012
 obstacles+0.012
 warrior+0.012
Neg logit contributions
 Lily-0.012
 chip-0.012
burn-0.011
 paramed-0.011
 v-0.011
Token would
Token ID561
Feature activation+6.35e-01

Pos logit contributions
 galaxies+0.058
iles+0.055
 obstacles+0.052
 spear+0.052
 warrior+0.051
Neg logit contributions
 Lily-0.036
 Sara-0.032
 chip-0.032
 Fl-0.030
 Wh-0.030
Token play
Token ID711
Feature activation+8.89e-01

Pos logit contributions
 galaxies+0.027
iles+0.026
 obstacles+0.025
 spear+0.024
 warrior+0.024
Neg logit contributions
 Lily-0.017
 chip-0.015
 Sara-0.015
 Fl-0.014
 Wh-0.014
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
 galaxies+0.028
iles+0.026
 obstacles+0.025
 spear+0.025
 feats+0.024
Neg logit contributions
 Lily-0.017
 Sara-0.015
 Fl-0.014
 chip-0.014
 Wh-0.013
Token Mia
Token ID32189
Feature activation-1.41e+00

Pos logit contributions
 chip+0.037
 Lily+0.036
 paramed+0.034
 Sara+0.032
 Fl+0.032
Neg logit contributions
 galaxies-0.068
 spear-0.065
iles-0.063
 warrior-0.062
 obstacles-0.060
Token liked
Token ID8288
Feature activation+7.98e-01

Pos logit contributions
 Lily+0.037
 Sara+0.033
 Fl+0.030
 chip+0.030
 Wh+0.028
Neg logit contributions
 galaxies-0.063
iles-0.059
 obstacles-0.058
 spear-0.058
 warrior-0.056
Token to
Token ID284
Feature activation+3.66e-01

Pos logit contributions
 galaxies+0.038
 spear+0.036
iles+0.036
 obstacles+0.035
 warrior+0.034
Neg logit contributions
 Lily-0.014
 chip-0.013
 paramed-0.013
 Sara-0.013
 Wh-0.012
Token paint
Token ID7521
Feature activation+6.16e-01

Pos logit contributions
 galaxies+0.016
iles+0.015
 obstacles+0.015
 spear+0.014
 warrior+0.014
Neg logit contributions
 Lily-0.010
 Sara-0.009
 chip-0.009
 Fl-0.008
 Wh-0.008
Token.
Token ID13
Feature activation-3.88e-02

Pos logit contributions
 galaxies+0.028
iles+0.026
 spear+0.025
 obstacles+0.025
 warrior+0.024
Neg logit contributions
 Lily-0.014
 chip-0.013
 paramed-0.013
 Sara-0.012
 Fl-0.012
Token They
Token ID1119
Feature activation+1.91e+00

Pos logit contributions
 Lily+0.001
 Sara+0.001
 L+0.001
 chip+0.001
 Fl+0.001
Neg logit contributions
 galaxies-0.002
 warrior-0.002
 spear-0.002
 obstacles-0.002
iles-0.002
Token had
Token ID550
Feature activation-1.41e-01

Pos logit contributions
 galaxies+0.075
iles+0.070
 spear+0.066
 obstacles+0.066
 warrior+0.064
Neg logit contributions
 Lily-0.052
 chip-0.051
 Sara-0.049
 paramed-0.048
 Fl-0.048
Token many
Token ID867
Feature activation+4.38e-01

Pos logit contributions
 chip+0.004
 Lily+0.003
 paramed+0.003
 Fl+0.003
burn+0.003
Neg logit contributions
 galaxies-0.006
iles-0.005
 obstacles-0.005
 planets-0.005
 spear-0.005
Token colors
Token ID7577
Feature activation+8.56e-01

Pos logit contributions
 galaxies+0.018
 spear+0.017
iles+0.017
 warrior+0.017
 obstacles+0.016
Neg logit contributions
 Lily-0.013
 chip-0.011
 Sara-0.011
 v-0.011
 paramed-0.010
Token and
Token ID290
Feature activation+9.85e-01

Pos logit contributions
 galaxies+0.036
iles+0.034
 spear+0.033
 obstacles+0.032
 warrior+0.032
Neg logit contributions
 Lily-0.022
 chip-0.021
 Sara-0.020
 paramed-0.020
 Fl-0.019
Token what
Token ID644
Feature activation+6.26e-01

Pos logit contributions
 galaxies+0.040
iles+0.038
 spear+0.037
 obstacles+0.036
 warrior+0.035
Neg logit contributions
 Lily-0.031
 chip-0.030
 Sara-0.028
 paramed-0.028
 Wh-0.027
Token it
Token ID340
Feature activation-1.13e+00

Pos logit contributions
 galaxies+0.033
iles+0.031
 warrior+0.030
 spear+0.030
 obstacles+0.029
Neg logit contributions
 Lily-0.011
 Sara-0.009
 chip-0.009
 Fl-0.009
 Wh-0.008
Token was
Token ID373
Feature activation-3.88e-01

Pos logit contributions
 Lily+0.026
 Fl+0.022
 chip+0.022
 Sara+0.021
 Wh+0.021
Neg logit contributions
 galaxies-0.053
iles-0.049
 spear-0.048
 warrior-0.048
 armor-0.048
Token,
Token ID11
Feature activation+5.00e-01

Pos logit contributions
 Lily+0.011
 chip+0.010
 Sara+0.010
 paramed+0.010
 Fl+0.010
Neg logit contributions
 galaxies-0.015
iles-0.014
 spear-0.014
 obstacles-0.014
 warrior-0.013
Token so
Token ID523
Feature activation-2.25e-01

Pos logit contributions
 galaxies+0.025
iles+0.024
 spear+0.023
 warrior+0.022
 obstacles+0.022
Neg logit contributions
 Lily-0.010
 Sara-0.008
 chip-0.008
 Fl-0.008
 Wh-0.008
Token he
Token ID339
Feature activation-5.52e-02

Pos logit contributions
 Lily+0.008
 chip+0.008
 paramed+0.008
burn+0.008
 Sara+0.007
Neg logit contributions
 galaxies-0.007
 spear-0.007
iles-0.006
 obstacles-0.006
 warrior-0.006
Token became
Token ID2627
Feature activation-2.24e+00

Pos logit contributions
 Lily+0.002
 chip+0.001
 Sara+0.001
 Fl+0.001
 paramed+0.001
Neg logit contributions
 galaxies-0.002
iles-0.002
 spear-0.002
 obstacles-0.002
 warrior-0.002
Token very
Token ID845
Feature activation+1.19e+00

Pos logit contributions
 Lily+0.054
 Sara+0.048
 chip+0.047
 Fl+0.045
 Wh+0.044
Neg logit contributions
 galaxies-0.102
iles-0.098
 spear-0.093
 warrior-0.091
 obstacles-0.090
Token confused
Token ID10416
Feature activation+1.70e-01

Pos logit contributions
 galaxies+0.053
iles+0.050
 spear+0.048
 obstacles+0.047
 warrior+0.047
Neg logit contributions
 Lily-0.029
 chip-0.027
 Sara-0.026
 paramed-0.025
 Fl-0.024
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 galaxies+0.009
iles+0.008
 spear+0.008
 warrior+0.008
 obstacles+0.008
Neg logit contributions
 Lily-0.003
 Sara-0.003
 chip-0.003
 Fl-0.002
 Wh-0.002
Token
Token ID198
Feature activation-6.67e-01

Pos logit contributions
 galaxies+0.005
iles+0.004
 spear+0.004
 obstacles+0.004
 warrior+0.004
Neg logit contributions
 Lily-0.005
 chip-0.004
 Sara-0.004
 Fl-0.004
 Wh-0.004
Token rabbit
Token ID22746
Feature activation+1.16e+00

Pos logit contributions
 galaxies+0.043
iles+0.040
 spear+0.040
 obstacles+0.038
 warrior+0.037
Neg logit contributions
 Lily-0.052
 chip-0.049
 Sara-0.049
 paramed-0.047
 Fl-0.047
Token through
Token ID832
Feature activation+1.12e+00

Pos logit contributions
 galaxies+0.053
iles+0.050
 spear+0.048
 obstacles+0.047
 warrior+0.047
Neg logit contributions
 Lily-0.027
 chip-0.024
 Sara-0.024
 Fl-0.023
 Wh-0.022
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
 galaxies+0.060
iles+0.057
 spear+0.056
 obstacles+0.055
 warrior+0.055
Neg logit contributions
 Lily-0.017
 chip-0.015
 Sara-0.014
 paramed-0.013
 Fl-0.012
Token woods
Token ID16479
Feature activation+6.37e-01

Pos logit contributions
 galaxies+0.008
iles+0.008
 warrior+0.007
 obstacles+0.007
 spear+0.007
Neg logit contributions
 Lily-0.007
 chip-0.007
 Mitt-0.006
 Wh-0.006
 Sara-0.006
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
 galaxies+0.028
iles+0.026
 spear+0.026
 obstacles+0.025
 warrior+0.025
Neg logit contributions
 Lily-0.016
 chip-0.014
 Sara-0.014
 paramed-0.013
 Fl-0.013
Token
Token ID198
Feature activation-7.19e-01

Pos logit contributions
 galaxies+0.001
iles+0.001
 spear+0.001
 obstacles+0.001
 warrior+0.001
Neg logit contributions
 Lily-0.001
 chip-0.001
 Sara-0.001
 paramed-0.001
burn-0.000
Token
Token ID198
Feature activation-6.13e-01

Pos logit contributions
 Lily+0.014
 Sara+0.011
 chip+0.011
 Fl+0.011
 Wh+0.010
Neg logit contributions
 galaxies-0.037
iles-0.035
 spear-0.034
 warrior-0.034
 obstacles-0.034
TokenThe
Token ID464
Feature activation-3.54e-02

Pos logit contributions
 Lily+0.016
 chip+0.015
 Sara+0.014
 Wh+0.014
 paramed+0.014
Neg logit contributions
 galaxies-0.026
iles-0.025
 spear-0.024
 obstacles-0.023
 warrior-0.023
Token rabbit
Token ID22746
Feature activation+1.08e-01

Pos logit contributions
 Lily+0.001
 chip+0.001
 Mitt+0.001
 baby+0.001
 paramed+0.001
Neg logit contributions
 galaxies-0.001
iles-0.001
 feats-0.001
 obstacles-0.001
 facts-0.001
Token was
Token ID373
Feature activation+4.89e-01

Pos logit contributions
 galaxies+0.004
iles+0.004
 spear+0.004
 obstacles+0.004
 warrior+0.004
Neg logit contributions
 Lily-0.003
 chip-0.003
 Sara-0.003
 Fl-0.003
 Wh-0.003
Token scared
Token ID12008
Feature activation+1.85e-01

Pos logit contributions
 galaxies+0.018
 obstacles+0.017
 spear+0.017
iles+0.016
 warrior+0.016
Neg logit contributions
 chip-0.014
 Lily-0.013
 paramed-0.013
burn-0.013
 Sara-0.013
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
 Lily+0.009
 chip+0.008
 Sara+0.008
 paramed+0.008
 Fl+0.008
Neg logit contributions
 galaxies-0.013
iles-0.012
 spear-0.012
 obstacles-0.011
 warrior-0.011
Token so
Token ID523
Feature activation+4.60e-01

Pos logit contributions
 galaxies+0.007
 obstacles+0.007
 warrior+0.006
iles+0.006
 spear+0.006
Neg logit contributions
 chip-0.005
 Lily-0.005
burn-0.005
 paramed-0.005
 Wh-0.005
Token excited
Token ID6568
Feature activation-7.54e-01

Pos logit contributions
 galaxies+0.017
iles+0.015
 obstacles+0.015
 warrior+0.015
 spear+0.015
Neg logit contributions
 chip-0.013
 Lily-0.013
burn-0.013
 Wh-0.013
 paramed-0.012
Token that
Token ID326
Feature activation-6.03e-01

Pos logit contributions
 paramed+0.017
 chip+0.017
 Lily+0.017
burn+0.016
 Wh+0.016
Neg logit contributions
 galaxies-0.032
iles-0.031
 obstacles-0.029
 spear-0.029
 warrior-0.028
Token she
Token ID673
Feature activation-1.45e-04

Pos logit contributions
 chip+0.012
 paramed+0.012
burn+0.011
 Lily+0.011
 Wh+0.010
Neg logit contributions
 galaxies-0.029
 spear-0.026
 obstacles-0.026
 warrior-0.025
iles-0.025
Token started
Token ID2067
Feature activation-2.12e+00

Pos logit contributions
 Lily+0.000
 chip+0.000
 Sara+0.000
 paramed+0.000
 Fl+0.000
Neg logit contributions
 galaxies-0.000
iles-0.000
 spear-0.000
 obstacles-0.000
 warrior-0.000
Token to
Token ID284
Feature activation-8.94e-01

Pos logit contributions
 Lily+0.047
 chip+0.044
 Sara+0.042
 paramed+0.041
 Fl+0.040
Neg logit contributions
 galaxies-0.095
 obstacles-0.088
 spear-0.087
iles-0.087
 warrior-0.084
Token jump
Token ID4391
Feature activation+1.95e-01

Pos logit contributions
 Lily+0.027
 chip+0.025
 Sara+0.025
 Fl+0.024
 paramed+0.024
Neg logit contributions
 galaxies-0.034
iles-0.032
 spear-0.031
 obstacles-0.030
 warrior-0.030
Token and
Token ID290
Feature activation+9.50e-01

Pos logit contributions
 galaxies+0.008
iles+0.008
 spear+0.007
 obstacles+0.007
 warrior+0.007
Neg logit contributions
 Lily-0.005
 chip-0.005
 Sara-0.005
 Fl-0.004
 paramed-0.004
Token dance
Token ID9280
Feature activation-7.34e-01

Pos logit contributions
 galaxies+0.035
iles+0.032
 spear+0.032
 obstacles+0.031
 warrior+0.030
Neg logit contributions
 Lily-0.029
 chip-0.028
 Sara-0.027
 paramed-0.027
 Wh-0.026
Token around
Token ID1088
Feature activation+2.49e+00

Pos logit contributions
 Lily+0.019
 chip+0.018
 paramed+0.018
 Sara+0.017
burn+0.017
Neg logit contributions
 galaxies-0.031
iles-0.028
 spear-0.028
 obstacles-0.027
 warrior-0.027
Token That
Token ID1320
Feature activation+1.66e+00

Pos logit contributions
 galaxies+0.027
iles+0.025
 spear+0.024
 warrior+0.023
 obstacles+0.023
Neg logit contributions
 Lily-0.022
 chip-0.020
 Sara-0.020
 paramed-0.019
burn-0.019
Token sounds
Token ID5238
Feature activation+5.51e-01

Pos logit contributions
 galaxies+0.068
iles+0.063
Nice+0.061
 warrior+0.061
 obstacles+0.061
Neg logit contributions
 Lily-0.045
 chip-0.043
 v-0.040
 baby-0.039
 Fl-0.039
Token scary
Token ID14343
Feature activation+1.25e+00

Pos logit contributions
 galaxies+0.026
iles+0.025
 warrior+0.024
 spear+0.024
 obstacles+0.023
Neg logit contributions
 Lily-0.013
 Sara-0.010
 chip-0.010
 okay-0.010
 Fl-0.010
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
 galaxies+0.060
iles+0.056
 spear+0.055
 warrior+0.055
 obstacles+0.054
Neg logit contributions
 Lily-0.027
 chip-0.023
 Sara-0.023
 Fl-0.021
 v-0.021
Token I
Token ID314
Feature activation+3.69e-01

Pos logit contributions
 Lily+0.011
 chip+0.011
burn+0.011
 paramed+0.010
 Sara+0.010
Neg logit contributions
 galaxies-0.013
iles-0.012
 spear-0.011
 obstacles-0.011
 warrior-0.011
Token don
Token ID836
Feature activation-9.09e-01

Pos logit contributions
 galaxies+0.015
iles+0.013
 obstacles+0.013
 spear+0.013
 warrior+0.013
Neg logit contributions
 Lily-0.011
 Sara-0.010
 chip-0.010
 Fl-0.010
 okay-0.009
Token't
Token ID470
Feature activation-1.27e+00

Pos logit contributions
 Lily+0.014
 chip+0.013
 Sara+0.012
 Wh+0.011
 Fl+0.011
Neg logit contributions
 galaxies-0.049
iles-0.046
 spear-0.045
 obstacles-0.044
 warrior-0.044
Token want
Token ID765
Feature activation-9.62e-01

Pos logit contributions
 Lily+0.039
 chip+0.036
 Sara+0.036
 Fl+0.034
 paramed+0.034
Neg logit contributions
 galaxies-0.048
iles-0.045
 spear-0.043
 obstacles-0.043
 warrior-0.042
Token my
Token ID616
Feature activation-1.49e+00

Pos logit contributions
 Lily+0.015
 chip+0.014
 Sara+0.013
 paramed+0.012
 Fl+0.012
Neg logit contributions
 galaxies-0.050
iles-0.048
 spear-0.047
 obstacles-0.046
 warrior-0.046
Token rock
Token ID3881
Feature activation+1.43e-01

Pos logit contributions
 Lily+0.050
 Sara+0.047
 baby+0.046
 chip+0.046
 v+0.045
Neg logit contributions
 galaxies-0.052
iles-0.047
Nice-0.045
 obstacles-0.045
 warrior-0.043
Token to
Token ID284
Feature activation+3.33e-02

Pos logit contributions
 galaxies+0.006
iles+0.006
 spear+0.006
 obstacles+0.005
 warrior+0.005
Neg logit contributions
 Lily-0.004
 chip-0.003
 paramed-0.003
 Sara-0.003
 Fl-0.003
Token for
Token ID329
Feature activation+1.12e+00

Pos logit contributions
 galaxies+0.049
iles+0.046
 spear+0.045
 warrior+0.045
 obstacles+0.044
Neg logit contributions
 Lily-0.014
 Sara-0.012
 chip-0.011
 Fl-0.011
 Wh-0.010
Token the
Token ID262
Feature activation+7.32e-01

Pos logit contributions
 galaxies+0.055
iles+0.053
 spear+0.052
 obstacles+0.051
 warrior+0.050
Neg logit contributions
 Lily-0.021
 chip-0.019
 Sara-0.018
 paramed-0.018
 Wh-0.017
Token leaf
Token ID12835
Feature activation-4.84e-01

Pos logit contributions
 galaxies+0.027
iles+0.025
 spear+0.024
 obstacles+0.024
 warrior+0.023
Neg logit contributions
 Lily-0.023
 chip-0.022
 Sara-0.021
 paramed-0.021
 Wh-0.020
Token.
Token ID13
Feature activation-4.24e-01

Pos logit contributions
 paramed+0.016
 Em+0.014
 Lily+0.014
Pe+0.014
 Wh+0.014
Neg logit contributions
iles-0.015
 spear-0.015
 galaxies-0.015
 cliffs-0.014
 reefs-0.014
Token But
Token ID887
Feature activation+1.82e+00

Pos logit contributions
burn+0.014
 chip+0.014
 paramed+0.014
 Lily+0.014
 v+0.013
Neg logit contributions
 galaxies-0.013
iles-0.012
 spear-0.012
 obstacles-0.012
 reefs-0.011
Token the
Token ID262
Feature activation-4.06e-01

Pos logit contributions
 galaxies+0.074
 spear+0.070
iles+0.070
 obstacles+0.068
 cliffs+0.066
Neg logit contributions
 paramed-0.045
burn-0.043
 chip-0.043
stand-0.040
ird-0.037
Token leaf
Token ID12835
Feature activation-1.92e+00

Pos logit contributions
 Lily+0.012
 chip+0.011
 Sara+0.011
 Fl+0.011
 Mitt+0.010
Neg logit contributions
 galaxies-0.016
iles-0.015
 spear-0.014
 obstacles-0.014
 warrior-0.014
Token is
Token ID318
Feature activation-1.27e+00

Pos logit contributions
 Lily+0.069
 chip+0.064
 Sara+0.064
 Fl+0.061
 Wh+0.060
Neg logit contributions
 galaxies-0.065
iles-0.058
 spear-0.056
 warrior-0.055
 obstacles-0.055
Token not
Token ID407
Feature activation-3.09e+00

Pos logit contributions
 paramed+0.034
 chip+0.032
 Lily+0.032
ird+0.031
 Wh+0.030
Neg logit contributions
 galaxies-0.048
iles-0.048
 spear-0.048
 obstacles-0.046
 goals-0.043
Token water
Token ID1660
Feature activation-2.56e+00

Pos logit contributions
 paramed+0.090
 chip+0.078
stand+0.076
 Wh+0.075
 Mitt+0.073
Neg logit contributions
iles-0.121
 spear-0.120
 galaxies-0.118
 obstacles-0.112
 islands-0.107
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
 paramed+0.080
 Lily+0.073
 Wh+0.070
ird+0.070
 Daughter+0.069
Neg logit contributions
iles-0.089
 spear-0.079
 cliffs-0.077
 obstacles-0.076
 obstacle-0.076
Token away
Token ID1497
Feature activation-3.63e-01

Pos logit contributions
 Lily+0.010
 chip+0.010
 paramed+0.009
 Sara+0.009
burn+0.008
Neg logit contributions
 galaxies-0.021
 spear-0.020
iles-0.020
 obstacles-0.020
 warrior-0.019
Token.
Token ID13
Feature activation-9.21e-02

Pos logit contributions
burn+0.009
 paramed+0.008
 Lily+0.008
 chip+0.008
ird+0.008
Neg logit contributions
 galaxies-0.015
 spear-0.014
 obstacles-0.014
iles-0.013
 warrior-0.013
Token They
Token ID1119
Feature activation+5.95e-01

Pos logit contributions
 Lily+0.003
 chip+0.002
 Sara+0.002
 Fl+0.002
 paramed+0.002
Neg logit contributions
 galaxies-0.004
iles-0.004
 spear-0.003
 obstacles-0.003
 warrior-0.003
Token did
Token ID750
Feature activation-4.73e-01

Pos logit contributions
 galaxies+0.022
iles+0.020
 spear+0.020
 obstacles+0.019
 warrior+0.019
Neg logit contributions
 chip-0.018
 Lily-0.018
 paramed-0.017
 Sara-0.016
burn-0.016
Token not
Token ID407
Feature activation-1.55e+00

Pos logit contributions
 Lily+0.007
 chip+0.005
 Sara+0.005
 Fl+0.005
 Wh+0.005
Neg logit contributions
 galaxies-0.026
iles-0.025
 spear-0.024
 warrior-0.024
 obstacles-0.024
Token come
Token ID1282
Feature activation-2.64e+00

Pos logit contributions
 chip+0.054
 Lily+0.053
 paramed+0.050
 Fl+0.050
 Sara+0.050
Neg logit contributions
 galaxies-0.049
iles-0.045
 warrior-0.044
 spear-0.044
 obstacles-0.043
Token back
Token ID736
Feature activation+2.95e-01

Pos logit contributions
 Lily+0.053
 chip+0.049
 Sara+0.046
 paramed+0.045
 Fl+0.043
Neg logit contributions
 galaxies-0.127
iles-0.120
 spear-0.118
 obstacles-0.115
 warrior-0.115
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
 galaxies+0.014
iles+0.013
 spear+0.013
 obstacles+0.012
 warrior+0.012
Neg logit contributions
 Lily-0.006
 chip-0.006
 paramed-0.006
 Sara-0.006
 Fl-0.005
Token that
Token ID326
Feature activation-1.70e+00

Pos logit contributions
 Lily+0.027
 chip+0.026
 paramed+0.025
 Sara+0.023
burn+0.023
Neg logit contributions
 galaxies-0.066
iles-0.063
 spear-0.061
 obstacles-0.061
 warrior-0.060
Token garden
Token ID11376
Feature activation-2.21e+00

Pos logit contributions
 Lily+0.044
 chip+0.040
 Sara+0.039
 Fl+0.037
 paramed+0.036
Neg logit contributions
 galaxies-0.073
iles-0.069
 spear-0.067
 obstacles-0.066
 warrior-0.065
Token ever
Token ID1683
Feature activation-1.12e+00

Pos logit contributions
 Lily+0.056
 chip+0.053
 paramed+0.051
 Sara+0.051
 Wh+0.049
Neg logit contributions
 galaxies-0.093
iles-0.087
 spear-0.084
 obstacles-0.084
 warrior-0.081
Token happier
Token ID23030
Feature activation-2.43e+00

Pos logit contributions
 Lily+0.015
 chip+0.015
 paramed+0.014
 Sara+0.014
 Wh+0.014
Neg logit contributions
 galaxies-0.020
 spear-0.018
iles-0.018
 obstacles-0.017
 warrior-0.017
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 Lily+0.056
 chip+0.052
 Sara+0.050
 paramed+0.048
 Wh+0.047
Neg logit contributions
 galaxies-0.110
iles-0.104
 spear-0.101
 obstacles-0.099
 warrior-0.098
Token By
Token ID2750
Feature activation-2.60e+00

Pos logit contributions
 Lily+0.000
 Sara+0.000
 chip+0.000
 Fl+0.000
 Mitt+0.000
Neg logit contributions
 galaxies-0.000
iles-0.000
 spear-0.000
 obstacles-0.000
 obstacle-0.000
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
 Lily+0.044
 chip+0.039
 Sara+0.037
 Fl+0.034
 Wh+0.033
Neg logit contributions
 galaxies-0.134
iles-0.128
 spear-0.125
 obstacles-0.122
 warrior-0.122
Token end
Token ID886
Feature activation+1.26e-01

Pos logit contributions
 Lily+0.069
 chip+0.065
 Sara+0.065
 paramed+0.063
 Wh+0.062
Neg logit contributions
 galaxies-0.056
iles-0.052
 spear-0.050
 obstacles-0.048
 warrior-0.048
Token of
Token ID286
Feature activation-4.85e+00

Pos logit contributions
 galaxies+0.006
iles+0.006
 spear+0.006
 obstacles+0.006
 warrior+0.005
Neg logit contributions
 Lily-0.002
 chip-0.002
 Sara-0.002
burn-0.002
 paramed-0.002
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
 Lily+0.112
 chip+0.105
 paramed+0.102
 Sara+0.100
 Wh+0.096
Neg logit contributions
 galaxies-0.215
iles-0.205
 spear-0.200
 obstacles-0.194
 warrior-0.193
Token day
Token ID1110
Feature activation-2.04e+00

Pos logit contributions
 Lily+0.037
 chip+0.035
 Sara+0.033
 paramed+0.032
 Fl+0.032
Neg logit contributions
 galaxies-0.061
iles-0.057
 spear-0.056
 obstacles-0.054
 warrior-0.054
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
 chip+0.059
 Lily+0.059
burn+0.058
 paramed+0.058
 Sara+0.055
Neg logit contributions
 galaxies-0.077
iles-0.070
 warrior-0.068
 spear-0.068
 obstacles-0.067
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 galaxies+0.008
 spear+0.007
iles+0.007
 obstacles+0.007
 warrior+0.007
Neg logit contributions
 Lily-0.004
 chip-0.004
 paramed-0.004
 Sara-0.004
burn-0.004
Token sour
Token ID11348
Feature activation-1.59e+00

Pos logit contributions
 Lily+0.040
 chip+0.037
 Sara+0.036
 paramed+0.035
 Fl+0.034
Neg logit contributions
 galaxies-0.057
iles-0.055
 spear-0.052
 obstacles-0.051
 warrior-0.050
Token a
Token ID257
Feature activation+4.40e-01

Pos logit contributions
 Lily+0.035
 chip+0.032
 Sara+0.031
 paramed+0.029
 Fl+0.028
Neg logit contributions
 galaxies-0.091
iles-0.087
 spear-0.085
 obstacles-0.083
 warrior-0.082
Token line
Token ID1627
Feature activation-1.17e+00

Pos logit contributions
 galaxies+0.019
 spear+0.018
iles+0.017
 warrior+0.017
 obstacles+0.017
Neg logit contributions
 Lily-0.011
 Sara-0.010
 chip-0.010
 paramed-0.010
 Wh-0.010
Token from
Token ID422
Feature activation-3.26e-01

Pos logit contributions
 Lily+0.027
 chip+0.026
 paramed+0.025
 Fl+0.025
 Wh+0.025
Neg logit contributions
 galaxies-0.051
iles-0.048
 spear-0.047
 obstacles-0.047
 warrior-0.046
Token one
Token ID530
Feature activation+1.08e+00

Pos logit contributions
 Lily+0.006
 Sara+0.005
 chip+0.005
 Fl+0.005
 Wh+0.004
Neg logit contributions
 galaxies-0.017
iles-0.016
 spear-0.015
 warrior-0.015
 obstacles-0.015
Token side
Token ID1735
Feature activation-1.30e+00

Pos logit contributions
 galaxies+0.042
iles+0.040
 spear+0.039
 obstacles+0.038
 warrior+0.038
Neg logit contributions
 Lily-0.031
 paramed-0.030
 Wh-0.029
 Sara-0.029
 chip-0.029
Token of
Token ID286
Feature activation-4.30e+00

Pos logit contributions
 Lily+0.029
 chip+0.027
 paramed+0.027
 Sara+0.026
 Wh+0.025
Neg logit contributions
 galaxies-0.058
iles-0.056
 spear-0.054
 obstacles-0.053
 warrior-0.052
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 Lily+0.066
 chip+0.057
 Sara+0.055
 Fl+0.049
 paramed+0.049
Neg logit contributions
 galaxies-0.229
iles-0.219
 spear-0.213
 obstacles-0.209
 warrior-0.209
Token room
Token ID2119
Feature activation-3.13e+00

Pos logit contributions
 Lily+0.035
 chip+0.033
 Sara+0.033
 paramed+0.032
burn+0.031
Neg logit contributions
 galaxies-0.046
iles-0.043
 spear-0.042
 obstacles-0.041
 warrior-0.041
Token to
Token ID284
Feature activation-9.56e-01

Pos logit contributions
 Lily+0.080
 chip+0.078
 paramed+0.076
 Wh+0.074
 Sara+0.074
Neg logit contributions
 galaxies-0.130
iles-0.119
 spear-0.116
 obstacles-0.115
 warrior-0.114
Token the
Token ID262
Feature activation-7.95e-01

Pos logit contributions
 Lily+0.025
 chip+0.024
 Sara+0.023
 paramed+0.023
 Fl+0.022
Neg logit contributions
 galaxies-0.040
iles-0.037
 spear-0.036
 obstacles-0.035
 warrior-0.035
Token other
Token ID584
Feature activation-1.07e+00

Pos logit contributions
 Lily+0.023
 paramed+0.021
 Sara+0.021
 chip+0.021
burn+0.021
Neg logit contributions
 galaxies-0.031
 spear-0.028
iles-0.028
 warrior-0.028
 obstacles-0.028
Token new
Token ID649
Feature activation+8.64e-01

Pos logit contributions
 galaxies+0.046
 spear+0.044
 obstacles+0.044
 warrior+0.042
iles+0.041
Neg logit contributions
 Lily-0.027
 Sara-0.024
 Wh-0.023
 Fl-0.023
 chip-0.022
Token comb
Token ID1974
Feature activation+1.12e+00

Pos logit contributions
 galaxies+0.040
 spear+0.038
 obstacles+0.038
iles+0.037
 warrior+0.037
Neg logit contributions
 Lily-0.017
 Sara-0.015
 chip-0.015
 Wh-0.015
 Fl-0.014
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
 galaxies+0.044
 spear+0.041
 obstacles+0.039
iles+0.039
 warrior+0.038
Neg logit contributions
 Lily-0.030
 paramed-0.029
 Sara-0.029
 Wh-0.027
 chip-0.027
Token it
Token ID340
Feature activation-1.03e+00

Pos logit contributions
 galaxies+0.004
 spear+0.004
iles+0.004
 obstacles+0.004
 warrior+0.004
Neg logit contributions
 Lily-0.002
 chip-0.002
 paramed-0.002
 Sara-0.002
 Wh-0.002
Token never
Token ID1239
Feature activation-2.52e+00

Pos logit contributions
 Lily+0.034
 chip+0.032
 Sara+0.032
 paramed+0.031
 Wh+0.030
Neg logit contributions
 galaxies-0.036
iles-0.033
 spear-0.032
 obstacles-0.031
 warrior-0.031
Token got
Token ID1392
Feature activation-4.27e+00

Pos logit contributions
 chip+0.072
 paramed+0.072
 Lily+0.068
 Sara+0.067
 Mitt+0.066
Neg logit contributions
 galaxies-0.095
iles-0.084
 obstacles-0.084
 spear-0.082
 warrior-0.082
Token hot
Token ID3024
Feature activation-2.60e-01

Pos logit contributions
 paramed+0.111
 chip+0.108
stand+0.099
ird+0.099
 Lily+0.096
Neg logit contributions
 galaxies-0.163
 obstacles-0.155
iles-0.150
 spear-0.147
 armor-0.141
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
 Lily+0.007
 chip+0.007
 paramed+0.007
 Wh+0.007
 Sara+0.006
Neg logit contributions
 galaxies-0.010
iles-0.009
 obstacles-0.009
 spear-0.009
 warrior-0.009
Token The
Token ID383
Feature activation+4.26e-01

Pos logit contributions
 Lily+0.004
 chip+0.004
 paramed+0.003
burn+0.003
 v+0.003
Neg logit contributions
 galaxies-0.005
iles-0.004
 spear-0.004
 warrior-0.004
 obstacles-0.004
Token man
Token ID582
Feature activation+6.27e-01

Pos logit contributions
 galaxies+0.019
iles+0.018
 spear+0.018
 obstacles+0.017
 warrior+0.017
Neg logit contributions
 Lily-0.010
 chip-0.009
 Sara-0.009
 paramed-0.009
 Fl-0.008
Token had
Token ID550
Feature activation-1.27e+00

Pos logit contributions
 galaxies+0.025
iles+0.024
 spear+0.023
 obstacles+0.023
 warrior+0.022
Neg logit contributions
 Lily-0.017
 chip-0.016
 Sara-0.016
 paramed-0.015
 Fl-0.015
Token But
Token ID887
Feature activation+1.85e+00

Pos logit contributions
 galaxies+0.041
iles+0.039
 spear+0.037
 obstacles+0.036
 warrior+0.036
Neg logit contributions
 Lily-0.023
 chip-0.021
 Sara-0.020
 Fl-0.019
 Wh-0.019
Token again
Token ID757
Feature activation+7.44e-01

Pos logit contributions
 galaxies+0.073
 spear+0.067
iles+0.066
 obstacles+0.066
 warrior+0.065
Neg logit contributions
 Lily-0.050
 chip-0.050
 paramed-0.049
 Sara-0.047
burn-0.047
Token he
Token ID339
Feature activation-9.69e-01

Pos logit contributions
 galaxies+0.033
iles+0.031
 spear+0.030
 obstacles+0.030
 warrior+0.030
Neg logit contributions
 Lily-0.018
 chip-0.016
 Sara-0.016
 paramed-0.015
 Fl-0.015
Token was
Token ID373
Feature activation-1.02e+00

Pos logit contributions
 Lily+0.026
 Sara+0.023
 chip+0.023
 Fl+0.022
 Wh+0.022
Neg logit contributions
 galaxies-0.041
iles-0.038
 spear-0.037
 obstacles-0.036
 warrior-0.036
Token not
Token ID407
Feature activation-3.28e+00

Pos logit contributions
 Lily+0.026
 chip+0.025
 paramed+0.024
 Sara+0.023
 Wh+0.023
Neg logit contributions
 galaxies-0.042
iles-0.040
 spear-0.039
 obstacles-0.039
 warrior-0.038
Token able
Token ID1498
Feature activation-5.67e+00

Pos logit contributions
 chip+0.083
 paramed+0.083
 Lily+0.082
 Wh+0.077
burn+0.074
Neg logit contributions
 galaxies-0.133
iles-0.124
 obstacles-0.124
 spear-0.123
 warrior-0.123
Token to
Token ID284
Feature activation-8.34e-01

Pos logit contributions
 Lily+0.094
 chip+0.087
 paramed+0.080
 Sara+0.079
 Wh+0.074
Neg logit contributions
 galaxies-0.290
iles-0.277
 spear-0.272
 obstacles-0.268
 warrior-0.263
Token touch
Token ID3638
Feature activation+2.04e+00

Pos logit contributions
 Lily+0.023
 chip+0.022
 Sara+0.022
 Fl+0.020
 Wh+0.020
Neg logit contributions
 galaxies-0.034
iles-0.032
 spear-0.030
 obstacles-0.030
 warrior-0.029
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
 galaxies+0.110
iles+0.106
 spear+0.102
 warrior+0.102
 obstacles+0.099
Neg logit contributions
 Lily-0.031
 Sara-0.026
 Fl-0.023
 chip-0.022
 Wh-0.020
Token clouds
Token ID15114
Feature activation+6.41e-01

Pos logit contributions
 galaxies+0.049
iles+0.047
 spear+0.044
 warrior+0.043
 obstacles+0.042
Neg logit contributions
 Lily-0.046
 chip-0.044
 Sara-0.042
 baby-0.041
 v-0.041
Token with
Token ID351
Feature activation-2.47e-01

Pos logit contributions
 galaxies+0.025
iles+0.024
 spear+0.024
 obstacles+0.023
 warrior+0.023
Neg logit contributions
 Lily-0.017
 paramed-0.016
 chip-0.016
 Sara-0.016
 Wh-0.016
Tokenuzz
Token ID4715
Feature activation+1.71e-01

Pos logit contributions
 galaxies+0.020
 spear+0.020
 obstacles+0.019
 warrior+0.019
 armor+0.019
Neg logit contributions
 Lily-0.009
 Sara-0.008
 L-0.007
gan-0.007
 Fl-0.007
Tokeny
Token ID88
Feature activation-1.67e+00

Pos logit contributions
 galaxies+0.007
iles+0.006
 spear+0.006
 obstacles+0.006
 warrior+0.006
Neg logit contributions
 Lily-0.005
 chip-0.004
 Sara-0.004
 Fl-0.004
 Wh-0.004
Token enjoyed
Token ID8359
Feature activation-5.95e-03

Pos logit contributions
 Lily+0.051
 chip+0.048
 Sara+0.047
 paramed+0.046
 Wh+0.044
Neg logit contributions
 galaxies-0.063
iles-0.058
 spear-0.056
 obstacles-0.055
 warrior-0.055
Token the
Token ID262
Feature activation+8.37e-01

Pos logit contributions
 Lily+0.000
 chip+0.000
 paramed+0.000
 Sara+0.000
burn+0.000
Neg logit contributions
 galaxies-0.000
iles-0.000
 spear-0.000
 obstacles-0.000
 warrior-0.000
Token rest
Token ID1334
Feature activation-9.63e-01

Pos logit contributions
 galaxies+0.033
iles+0.031
 obstacles+0.030
 spear+0.030
 warrior+0.029
Neg logit contributions
 Lily-0.024
 chip-0.022
 Sara-0.021
 Fl-0.020
 v-0.020
Token of
Token ID286
Feature activation-4.12e+00

Pos logit contributions
 paramed+0.024
 chip+0.022
 Lily+0.022
 Wh+0.022
burn+0.021
Neg logit contributions
 galaxies-0.041
iles-0.040
 spear-0.037
 obstacles-0.036
 warrior-0.035
Token their
Token ID511
Feature activation-3.74e+00

Pos logit contributions
 Lily+0.070
 paramed+0.068
 chip+0.066
burn+0.059
 okay+0.059
Neg logit contributions
 galaxies-0.203
iles-0.192
 spear-0.188
 obstacles-0.187
 warrior-0.182
Token walk
Token ID2513
Feature activation-3.84e-01

Pos logit contributions
 Lily+0.115
 chip+0.105
 Sara+0.102
 v+0.099
 Fl+0.099
Neg logit contributions
 galaxies-0.136
iles-0.126
 spear-0.122
 warrior-0.121
 obstacles-0.120
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
 Lily+0.009
 chip+0.009
 Sara+0.009
 paramed+0.009
 Fl+0.008
Neg logit contributions
 galaxies-0.017
iles-0.015
 spear-0.015
 obstacles-0.015
 warrior-0.015
Token<|endoftext|>
Token ID50256
Feature activation+2.32e+00

Pos logit contributions
 chip+0.012
 Lily+0.012
 paramed+0.011
burn+0.011
 v+0.011
Neg logit contributions
 galaxies-0.015
iles-0.014
 obstacles-0.013
 spear-0.013
 warrior-0.013
TokenThe
Token ID464
Feature activation+2.79e-01

Pos logit contributions
 galaxies+0.082
iles+0.076
 spear+0.073
 obstacles+0.072
 warrior+0.071
Neg logit contributions
 Lily-0.080
 Sara-0.073
 chip-0.072
 Fl-0.071
 Wh-0.069