TOP ACTIVATIONS
MAX = 0.188

loo and pretended to be Esk im os . After a
she could . A few minutes later , the creature pulled
would sh iver every few minutes . Just then , he
toys and for a few minutes she did , but then
. So , every few minutes she would shout : '
rain for a few more minutes before finally saying goodbye and
animal . After a few minutes , Amber had a weight
bake . After a few minutes , the cookies were ready
noises . After a few minutes , the doctor came back
, but after a few minutes he managed to do it
home . After a few minutes of arguing , Sam knew
After a few minutes , Tim my found the
. But after a few minutes , something changed . Suddenly
done . After a few minutes , the rain stopped and
dry . After a few minutes , it was ready and
film . After a few minutes , Anna realized that the
looking around for a few minutes , he heard a voice
practise . After a few minutes , he was able to
only watch TV for 30 minutes . Lily asked Mitt ens
After searching for a few minutes , they found an old

MIN ACTIVATIONS
MIN = -0.198

. Sara likes the water melon . She says it is
. They eat the water melon and enjoy the spring .
my d arl ings . Enjoy your lunch . I made
welcome , little girl . Enjoy your balloon ." the man
Sara bites into the water melon . It is juicy and
says it is a water melon . She says it is
wants to try the water melon . She asks her mom
Lily replied , " To ast with jam ! It was
market and find a water melon . Her mom pays the
man and takes the water melon home . She cuts it
made his mother even p rou der . <|endoftext|> John and
her body like a gymn ast . Lily tried her best
one is for free . Enjoy the show !"
the other wild er be ast . " Let 's explore
a big , juicy water melon in the field .
. S eb ast ian disagreed with his friend
and the wild er be ast smiled . "
feels even happier and p rou der . He displays his
a piece of juicy water melon . When she took a
" Here you go . Enjoy your grapes . They are

INTERVAL 0.091 - 0.188
CONTAINS 0.416%

raven , and it was sitting on the fence . It
" I 'm OK , Lily . It 's
land of muff ins . There are many muff ins like
being adventurous is not about being dangerous . It is about
obedient . She knew that being obedient would make her mom

INTERVAL -0.005 - 0.091
CONTAINS 57.900%

<|endoftext|> Ben liked to balance on the wall . It was
will keep looking until they do . She says they will
Lily . Thank you !" Tom said , putting it on
the slide too . They waited and waited , but the
wanted one day ." L ily smiled and

INTERVAL -0.101 - -0.005
CONTAINS 41.584%

up the nuts . They saw them hide them in their
was a big , red balloon floating in the sky !
was confused , but the park keeper was shout i ing
his toy gun and decided to play with his friends instead
tired and wanted to go back to his cozy home in

INTERVAL -0.198 - -0.101
CONTAINS 0.100%

Yes , of course . Enjoy it !" The
. She hugs him and thanks Sara and Ben . She
. It was the biggest squash that anyone had ever seen
was very excited and qu acked and qu acked with g
asked . M um said , " It 's
Tokenloo
Token ID29680
Feature activation+2.31e-02

Pos logit contributions
 ghosts+0.177
 cob+0.166
 snowy+0.163
 staying+0.162
 breathing+0.158
Neg logit contributions
-0.224
oured-0.202
ete-0.194
oni-0.185
iane-0.184
Token and
Token ID290
Feature activation+1.49e-02

Pos logit contributions
 ghosts+0.037
 cob+0.035
 staying+0.034
 snowy+0.033
 breathing+0.033
Neg logit contributions
-0.045
oured-0.040
ete-0.039
oni-0.037
iane-0.037
Token pretended
Token ID36766
Feature activation+2.82e-02

Pos logit contributions
 ghosts+0.026
 cob+0.024
 exc+0.023
 staying+0.023
 snowy+0.023
Neg logit contributions
-0.026
oured-0.024
ete-0.022
oni-0.022
als-0.022
Token to
Token ID284
Feature activation-3.73e-03

Pos logit contributions
 ghosts+0.046
 cob+0.042
 staying+0.041
 snowy+0.040
 breathing+0.040
Neg logit contributions
-0.056
oured-0.049
ete-0.047
oni-0.046
or-0.045
Token be
Token ID307
Feature activation+7.08e-02

Pos logit contributions
+0.007
oured+0.006
 course+0.006
ete+0.006
ar+0.006
Neg logit contributions
 ghosts-0.006
 staying-0.006
 cob-0.005
 breathing-0.005
 snowy-0.005
Token Esk
Token ID47534
Feature activation+1.88e-01

Pos logit contributions
 ghosts+0.102
 staying+0.093
 cob+0.092
 snowy+0.089
 exc+0.088
Neg logit contributions
-0.145
oured-0.130
ete-0.126
oni-0.124
ar-0.122
Tokenim
Token ID320
Feature activation-7.48e-02

Pos logit contributions
 ghosts+0.357
 staying+0.332
 cob+0.332
 snowy+0.324
 breathing+0.321
Neg logit contributions
-0.314
oured-0.269
ete-0.259
oni-0.248
or-0.245
Tokenos
Token ID418
Feature activation-4.75e-02

Pos logit contributions
+0.147
ete+0.125
oured+0.124
iane+0.123
ory+0.117
Neg logit contributions
 ghosts-0.127
 staying-0.121
 breathing-0.119
 snowy-0.112
 cob-0.111
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
+0.105
oured+0.092
iane+0.092
ete+0.092
um+0.086
Neg logit contributions
 ghosts-0.070
 staying-0.063
 snowy-0.061
 cob-0.061
 breathing-0.061
Token After
Token ID2293
Feature activation-3.79e-02

Pos logit contributions
 ghosts+0.015
 raining+0.014
 staying+0.014
 Kim+0.014
 exciting+0.014
Neg logit contributions
-0.024
oured-0.023
or-0.023
ar-0.023
les-0.022
Token a
Token ID257
Feature activation-1.03e-02

Pos logit contributions
+0.081
oured+0.075
ete+0.071
or+0.070
iane+0.070
Neg logit contributions
 ghosts-0.056
 staying-0.051
 cob-0.051
 breathing-0.049
 snowy-0.049
Token she
Token ID673
Feature activation+5.17e-03

Pos logit contributions
+0.074
oured+0.066
ete+0.063
oni+0.063
ar+0.061
Neg logit contributions
 ghosts-0.045
 staying-0.041
 cob-0.040
 snowy-0.039
 exciting-0.038
Token could
Token ID714
Feature activation+9.39e-03

Pos logit contributions
 ghosts+0.007
 cob+0.007
 snowy+0.006
 exciting+0.006
 exc+0.006
Neg logit contributions
-0.011
iane-0.010
oured-0.010
oni-0.009
ete-0.009
Token.
Token ID13
Feature activation+8.55e-03

Pos logit contributions
 ghosts+0.012
 staying+0.012
 exc+0.011
 breathing+0.011
 than+0.011
Neg logit contributions
-0.021
oured-0.019
ted-0.018
oni-0.018
iane-0.018
Token A
Token ID317
Feature activation-1.75e-02

Pos logit contributions
 ghosts+0.014
 staying+0.013
 cob+0.013
 snowy+0.013
 breathing+0.013
Neg logit contributions
-0.015
oured-0.014
oni-0.014
ete-0.013
 course-0.013
Token few
Token ID1178
Feature activation+1.12e-02

Pos logit contributions
+0.039
oured+0.036
ete+0.035
oni+0.035
ted+0.034
Neg logit contributions
 ghosts-0.023
 cob-0.022
 snowy-0.021
 exciting-0.021
 staying-0.020
Token minutes
Token ID2431
Feature activation+1.75e-01

Pos logit contributions
 ghosts+0.014
 cob+0.012
 staying+0.012
 exciting+0.011
 snowy+0.011
Neg logit contributions
-0.028
oured-0.025
oni-0.024
ete-0.024
iane-0.024
Token later
Token ID1568
Feature activation+3.01e-02

Pos logit contributions
 ghosts+0.242
 staying+0.232
 cob+0.212
 snowy+0.207
 raining+0.206
Neg logit contributions
-0.376
oni-0.329
oured-0.323
cient-0.321
ete-0.315
Token,
Token ID11
Feature activation-5.75e-03

Pos logit contributions
 ghosts+0.045
 staying+0.042
 oh+0.038
 than+0.038
 cob+0.038
Neg logit contributions
-0.060
oured-0.055
oni-0.054
cient-0.053
ete-0.053
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
+0.013
oured+0.011
ete+0.011
oni+0.011
ar+0.011
Neg logit contributions
 ghosts-0.008
 staying-0.007
 cob-0.007
 snowy-0.007
 exciting-0.007
Token creature
Token ID7185
Feature activation-2.34e-02

Pos logit contributions
+0.026
oured+0.024
ete+0.023
or+0.022
oni+0.022
Neg logit contributions
 ghosts-0.021
 cob-0.019
 snowy-0.018
 exciting-0.017
 staying-0.017
Token pulled
Token ID5954
Feature activation+1.89e-04

Pos logit contributions
+0.048
oured+0.041
ete+0.039
iane+0.039
oni+0.039
Neg logit contributions
 ghosts-0.037
 cob-0.034
 staying-0.034
 snowy-0.033
 exciting-0.033
Token would
Token ID561
Feature activation-6.56e-02

Pos logit contributions
 ghosts+0.019
 outage+0.017
 cob+0.017
 than+0.016
 staying+0.016
Neg logit contributions
-0.027
ted-0.024
oured-0.024
ete-0.023
 course-0.023
Token sh
Token ID427
Feature activation+8.71e-02

Pos logit contributions
+0.136
oured+0.119
ete+0.116
oni+0.112
or+0.111
Neg logit contributions
 ghosts-0.099
 cob-0.089
 staying-0.088
 snowy-0.086
 breathing-0.085
Tokeniver
Token ID1428
Feature activation+3.45e-02

Pos logit contributions
 ghosts+0.126
 cob+0.112
 staying+0.109
 snowy+0.106
 breathing+0.104
Neg logit contributions
-0.193
oured-0.170
ete-0.164
or-0.161
ar-0.159
Token every
Token ID790
Feature activation-3.50e-02

Pos logit contributions
 ghosts+0.059
 cob+0.056
 staying+0.055
 exc+0.054
 breathing+0.053
Neg logit contributions
-0.059
oured-0.051
ete-0.050
or-0.050
oni-0.050
Token few
Token ID1178
Feature activation+9.88e-03

Pos logit contributions
+0.066
oured+0.057
ete+0.056
 course+0.055
or+0.054
Neg logit contributions
 ghosts-0.058
 staying-0.053
 cob-0.052
 breathing-0.050
 exc-0.050
Token minutes
Token ID2431
Feature activation+1.75e-01

Pos logit contributions
 ghosts+0.012
 staying+0.012
 outage+0.012
 exc+0.012
 breathing+0.011
Neg logit contributions
-0.020
les-0.019
 course-0.018
ted-0.018
ete-0.018
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 ghosts+0.292
 cob+0.268
 staying+0.265
 snowy+0.259
 breathing+0.256
Neg logit contributions
-0.336
oured-0.292
ete-0.283
oni-0.274
or-0.273
Token Just
Token ID2329
Feature activation-1.39e-02

Pos logit contributions
 ghosts+0.026
 cob+0.024
 staying+0.024
 snowy+0.023
 breathing+0.023
Neg logit contributions
-0.019
oured-0.015
ete-0.015
oni-0.014
or-0.014
Token then
Token ID788
Feature activation+2.69e-02

Pos logit contributions
+0.023
oured+0.021
 course+0.020
ete+0.020
ted+0.019
Neg logit contributions
 ghosts-0.025
 outage-0.022
 cob-0.021
 staying-0.021
 cigarettes-0.021
Token,
Token ID11
Feature activation+1.31e-03

Pos logit contributions
 ghosts+0.058
 cob+0.055
 staying+0.054
 snowy+0.053
 exc+0.053
Neg logit contributions
-0.036
oured-0.030
ete-0.029
ar-0.029
or-0.028
Token he
Token ID339
Feature activation+2.74e-03

Pos logit contributions
 ghosts+0.002
 cob+0.002
 staying+0.002
 snowy+0.002
 breathing+0.002
Neg logit contributions
-0.003
oured-0.002
ete-0.002
oni-0.002
iane-0.002
Token toys
Token ID14958
Feature activation+2.12e-02

Pos logit contributions
 ghosts+0.065
 snowy+0.056
 staying+0.056
 cob+0.055
 exc+0.053
Neg logit contributions
-0.113
oured-0.104
or-0.099
ian-0.098
les-0.097
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
 ghosts+0.031
 staying+0.029
 cob+0.029
 snowy+0.028
 breathing+0.028
Neg logit contributions
-0.044
oured-0.039
ete-0.038
oni-0.037
ar-0.037
Token for
Token ID329
Feature activation-3.62e-02

Pos logit contributions
 ghosts+0.026
 cob+0.024
 staying+0.024
 snowy+0.024
 breathing+0.023
Neg logit contributions
-0.030
oured-0.027
ete-0.026
or-0.025
oni-0.025
Token a
Token ID257
Feature activation-1.20e-02

Pos logit contributions
+0.072
oured+0.064
ete+0.062
oni+0.060
iane+0.060
Neg logit contributions
 ghosts-0.059
 staying-0.054
 cob-0.053
 breathing-0.051
 snowy-0.051
Token few
Token ID1178
Feature activation+1.16e-02

Pos logit contributions
+0.023
oured+0.020
ete+0.020
oni+0.020
als+0.019
Neg logit contributions
 ghosts-0.020
 cob-0.019
 staying-0.018
 snowy-0.018
 breathing-0.018
Token minutes
Token ID2431
Feature activation+1.75e-01

Pos logit contributions
 ghosts+0.012
 staying+0.010
 cob+0.010
 snowy+0.010
 breathing+0.009
Neg logit contributions
-0.030
oured-0.027
 course-0.026
ete-0.026
or-0.026
Token she
Token ID673
Feature activation+2.97e-03

Pos logit contributions
 ghosts+0.281
 staying+0.260
 cob+0.256
 breathing+0.249
 snowy+0.248
Neg logit contributions
-0.343
oured-0.299
ete-0.293
oni-0.285
iane-0.280
Token did
Token ID750
Feature activation-8.54e-03

Pos logit contributions
 ghosts+0.004
 cob+0.004
 staying+0.004
 exc+0.004
 snowy+0.004
Neg logit contributions
-0.006
ete-0.005
 course-0.005
oured-0.005
oni-0.005
Token,
Token ID11
Feature activation+1.63e-03

Pos logit contributions
+0.018
oured+0.017
ete+0.016
oni+0.016
als+0.016
Neg logit contributions
 ghosts-0.012
 staying-0.011
 cob-0.011
 breathing-0.011
 exciting-0.011
Token but
Token ID475
Feature activation+3.42e-02

Pos logit contributions
 ghosts+0.002
 cob+0.001
 staying+0.001
 snowy+0.001
 breathing+0.001
Neg logit contributions
-0.004
oured-0.004
ete-0.004
oni-0.004
or-0.004
Token then
Token ID788
Feature activation+2.47e-02

Pos logit contributions
 ghosts+0.051
 cob+0.046
 snowy+0.045
 breathing+0.045
 staying+0.045
Neg logit contributions
-0.071
oured-0.062
ete-0.060
or-0.059
ar-0.059
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
+0.072
oured+0.063
ete+0.062
oni+0.060
iane+0.059
Neg logit contributions
 ghosts-0.052
 staying-0.049
 cob-0.047
 than-0.046
 exciting-0.046
Token So
Token ID1406
Feature activation+7.63e-03

Pos logit contributions
 ghosts+0.019
 staying+0.018
 cob+0.017
 exc+0.017
 breathing+0.017
Neg logit contributions
-0.023
oni-0.022
oured-0.021
 course-0.021
igh-0.021
Token,
Token ID11
Feature activation+1.04e-03

Pos logit contributions
 ghosts+0.011
 staying+0.010
 cob+0.010
 snowy+0.010
 exciting+0.009
Neg logit contributions
-0.016
oured-0.014
oni-0.014
ete-0.014
 course-0.013
Token every
Token ID790
Feature activation-3.44e-02

Pos logit contributions
 ghosts+0.001
 staying+0.001
 cob+0.001
 breathing+0.001
 snowy+0.001
Neg logit contributions
-0.002
oured-0.002
oni-0.002
ete-0.002
 course-0.002
Token few
Token ID1178
Feature activation+1.11e-02

Pos logit contributions
+0.073
oured+0.064
ete+0.062
or+0.060
oni+0.060
Neg logit contributions
 ghosts-0.051
 cob-0.046
 staying-0.046
 snowy-0.045
 breathing-0.044
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.016
 cob+0.015
 staying+0.015
 snowy+0.014
 breathing+0.014
Neg logit contributions
-0.024
oured-0.021
ete-0.020
oni-0.020
or-0.020
Token she
Token ID673
Feature activation+6.13e-03

Pos logit contributions
 ghosts+0.251
 staying+0.235
 cob+0.228
 snowy+0.217
 raining+0.215
Neg logit contributions
-0.360
oured-0.320
ete-0.318
oni-0.315
iane-0.305
Token would
Token ID561
Feature activation-6.62e-02

Pos logit contributions
 ghosts+0.009
 cob+0.009
 staying+0.009
 exc+0.008
 snowy+0.008
Neg logit contributions
-0.012
 trigger-0.010
ete-0.010
iane-0.010
oni-0.010
Token shout
Token ID9191
Feature activation-1.73e-03

Pos logit contributions
+0.139
oni+0.128
oured+0.127
ted+0.125
iane+0.123
Neg logit contributions
 exc-0.090
 staying-0.087
 exciting-0.084
 than-0.080
 ghosts-0.080
Token:
Token ID25
Feature activation+4.88e-02

Pos logit contributions
+0.004
oured+0.003
ete+0.003
 course+0.003
oni+0.003
Neg logit contributions
 ghosts-0.003
 cob-0.002
 staying-0.002
 breathing-0.002
 exciting-0.002
Token '
Token ID705
Feature activation+5.83e-02

Pos logit contributions
 staying+0.057
 ghosts+0.056
 exc+0.052
 breathing+0.052
 cob+0.052
Neg logit contributions
-0.111
oured-0.103
ete-0.100
iane-0.099
les-0.097
Token rain
Token ID6290
Feature activation+6.19e-02

Pos logit contributions
+0.012
oured+0.011
ete+0.011
oni+0.010
or+0.010
Neg logit contributions
 ghosts-0.009
 cob-0.008
 staying-0.008
 snowy-0.008
 breathing-0.007
Token for
Token ID329
Feature activation-3.63e-02

Pos logit contributions
 ghosts+0.099
 cob+0.090
 staying+0.090
 snowy+0.088
 breathing+0.086
Neg logit contributions
-0.123
oured-0.109
ete-0.106
oni-0.102
or-0.101
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
+0.071
oured+0.062
ete+0.061
oni+0.059
 course+0.058
Neg logit contributions
 ghosts-0.058
 cob-0.053
 staying-0.053
 snowy-0.052
 breathing-0.051
Token few
Token ID1178
Feature activation+1.06e-02

Pos logit contributions
+0.025
oured+0.022
ete+0.021
oni+0.021
iane+0.021
Neg logit contributions
 ghosts-0.018
 cob-0.016
 staying-0.016
 snowy-0.016
 breathing-0.015
Token more
Token ID517
Feature activation+5.62e-02

Pos logit contributions
 ghosts+0.010
 staying+0.009
 cob+0.009
 breathing+0.009
 snowy+0.008
Neg logit contributions
-0.027
 course-0.024
oured-0.024
ete-0.024
ar-0.024
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.074
 cob+0.066
 staying+0.065
 cigarettes+0.063
 snowy+0.062
Neg logit contributions
-0.125
 course-0.107
oured-0.106
les-0.106
ar-0.105
Token before
Token ID878
Feature activation-1.75e-02

Pos logit contributions
 ghosts+0.267
 staying+0.248
 cob+0.241
 breathing+0.237
 snowy+0.236
Neg logit contributions
-0.355
oured-0.312
ete-0.305
oni-0.295
iane-0.294
Token finally
Token ID3443
Feature activation-1.48e-02

Pos logit contributions
+0.036
oured+0.032
ete+0.031
or+0.030
oni+0.030
Neg logit contributions
 ghosts-0.027
 staying-0.024
 cob-0.024
 snowy-0.023
 breathing-0.023
Token saying
Token ID2282
Feature activation-1.19e-02

Pos logit contributions
+0.027
oured+0.024
ete+0.023
oni+0.022
iane+0.022
Neg logit contributions
 ghosts-0.026
 staying-0.024
 cob-0.024
 snowy-0.023
 breathing-0.023
Token goodbye
Token ID24829
Feature activation-2.60e-02

Pos logit contributions
+0.019
oured+0.016
iane+0.015
ete+0.015
oni+0.015
Neg logit contributions
 ghosts-0.024
 cob-0.022
 staying-0.022
 exciting-0.022
 snowy-0.022
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
+0.056
oured+0.049
ete+0.047
 course+0.046
iane+0.046
Neg logit contributions
 ghosts-0.038
 staying-0.035
 cob-0.034
 breathing-0.034
 snowy-0.033
Token animal
Token ID5044
Feature activation+1.25e-03

Pos logit contributions
 ghosts+0.071
 cob+0.067
 staying+0.059
 snowy+0.059
 exciting+0.057
Neg logit contributions
-0.118
ete-0.103
oured-0.103
oni-0.098
iane-0.098
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
 ghosts+0.002
 cob+0.002
 staying+0.002
 than+0.002
 breathing+0.002
Neg logit contributions
-0.003
oured-0.002
ete-0.002
iane-0.002
oni-0.002
Token After
Token ID2293
Feature activation-3.58e-02

Pos logit contributions
 ghosts+0.019
 raining+0.018
 staying+0.018
 Kim+0.017
 exciting+0.017
Neg logit contributions
-0.027
oured-0.026
igh-0.026
oni-0.026
 course-0.026
Token a
Token ID257
Feature activation-9.42e-03

Pos logit contributions
+0.075
oured+0.073
or+0.070
les+0.070
ete+0.069
Neg logit contributions
 ghosts-0.051
 staying-0.048
 cob-0.045
 breathing-0.044
 exciting-0.044
Token few
Token ID1178
Feature activation+1.15e-02

Pos logit contributions
+0.018
oured+0.016
oni+0.015
als+0.015
ted+0.015
Neg logit contributions
 ghosts-0.015
 cob-0.015
 exciting-0.015
 snowy-0.015
 sunny-0.015
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.011
 cob+0.009
 staying+0.009
 snowy+0.009
 breathing+0.008
Neg logit contributions
-0.031
oured-0.028
ete-0.027
oni-0.027
iane-0.027
Token,
Token ID11
Feature activation+1.41e-03

Pos logit contributions
 staying+0.252
 ghosts+0.247
 cob+0.232
 raining+0.231
 snowy+0.225
Neg logit contributions
-0.339
cient-0.302
oured-0.300
lor-0.295
 trigger-0.294
Token Amber
Token ID21896
Feature activation-6.52e-02

Pos logit contributions
 ghosts+0.002
 exciting+0.001
 sunny+0.001
 staying+0.001
 oh+0.001
Neg logit contributions
-0.004
oured-0.003
igh-0.003
eg-0.003
oni-0.003
Token had
Token ID550
Feature activation+1.76e-02

Pos logit contributions
+0.153
 trigger+0.132
oured+0.128
 course+0.127
cient+0.125
Neg logit contributions
 cob-0.078
 staying-0.074
 ghosts-0.074
 than-0.074
 snowy-0.073
Token a
Token ID257
Feature activation-9.51e-03

Pos logit contributions
 exciting+0.024
 ghosts+0.024
 staying+0.024
 cob+0.024
 snowy+0.023
Neg logit contributions
-0.038
or-0.033
cient-0.033
 trigger-0.033
oured-0.032
Token weight
Token ID3463
Feature activation-2.56e-03

Pos logit contributions
+0.020
ted+0.020
or+0.020
als+0.020
oured+0.019
Neg logit contributions
 cob-0.013
 exciting-0.012
 snowy-0.012
 ghosts-0.012
 sunny-0.011
Token bake
Token ID28450
Feature activation-2.55e-02

Pos logit contributions
+0.012
oured+0.011
ete+0.010
or+0.010
ar+0.010
Neg logit contributions
 ghosts-0.010
 cob-0.010
 staying-0.009
 exc-0.009
 snowy-0.009
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
+0.054
oured+0.048
ete+0.047
iane+0.045
or+0.045
Neg logit contributions
 ghosts-0.037
 cob-0.034
 staying-0.034
 snowy-0.033
 exciting-0.032
Token After
Token ID2293
Feature activation-3.85e-02

Pos logit contributions
 ghosts+0.024
 cob+0.023
 staying+0.022
 snowy+0.022
 exc+0.021
Neg logit contributions
-0.031
oured-0.028
ete-0.027
or-0.027
oni-0.027
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
+0.083
oured+0.082
or+0.079
ete+0.079
les+0.077
Neg logit contributions
 ghosts-0.051
 staying-0.048
 cob-0.047
 snowy-0.044
 breathing-0.043
Token few
Token ID1178
Feature activation+8.48e-03

Pos logit contributions
+0.023
oured+0.020
ete+0.020
or+0.019
oni+0.019
Neg logit contributions
 ghosts-0.021
 cob-0.020
 staying-0.019
 snowy-0.019
 breathing-0.019
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.007
 cob+0.005
 staying+0.005
 snowy+0.005
 breathing+0.005
Neg logit contributions
-0.024
oured-0.022
ete-0.021
oni-0.021
 course-0.021
Token,
Token ID11
Feature activation+2.49e-03

Pos logit contributions
 ghosts+0.322
 staying+0.303
 cob+0.302
 snowy+0.292
 breathing+0.289
Neg logit contributions
-0.296
oured-0.255
ete-0.248
oni-0.239
iane-0.235
Token the
Token ID262
Feature activation-8.67e-03

Pos logit contributions
 ghosts+0.003
 cob+0.002
 staying+0.002
 snowy+0.002
 exciting+0.002
Neg logit contributions
-0.006
oured-0.006
ete-0.006
or-0.005
oni-0.005
Token cookies
Token ID14746
Feature activation-2.19e-02

Pos logit contributions
+0.017
oured+0.016
ete+0.016
or+0.015
iane+0.015
Neg logit contributions
 ghosts-0.013
 cob-0.013
 staying-0.012
 snowy-0.012
 breathing-0.012
Token were
Token ID547
Feature activation+2.23e-02

Pos logit contributions
+0.039
ete+0.038
oured+0.037
or+0.035
lor+0.034
Neg logit contributions
 cob-0.034
 ghosts-0.034
 staying-0.033
 snowy-0.031
 exc-0.031
Token ready
Token ID3492
Feature activation-2.52e-02

Pos logit contributions
 ghosts+0.038
 cob+0.034
 staying+0.034
 snowy+0.033
 breathing+0.033
Neg logit contributions
-0.042
oured-0.037
oni-0.035
ete-0.035
um-0.034
Token noises
Token ID26782
Feature activation+7.29e-02

Pos logit contributions
 ghosts+0.011
 staying+0.010
 snowy+0.010
 exc+0.010
 cob+0.010
Neg logit contributions
-0.023
oured-0.022
oni-0.021
ete-0.021
les-0.021
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
 ghosts+0.116
 cob+0.106
 staying+0.106
 snowy+0.103
 breathing+0.101
Neg logit contributions
-0.144
oured-0.127
ete-0.122
oni-0.119
or-0.118
Token After
Token ID2293
Feature activation-3.60e-02

Pos logit contributions
 ghosts+0.028
 cob+0.026
 staying+0.026
 snowy+0.026
 breathing+0.025
Neg logit contributions
-0.027
oured-0.023
ete-0.022
oni-0.021
or-0.021
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
+0.073
oured+0.065
ete+0.063
or+0.061
oni+0.061
Neg logit contributions
 ghosts-0.057
 cob-0.051
 staying-0.051
 breathing-0.050
 snowy-0.050
Token few
Token ID1178
Feature activation+9.23e-03

Pos logit contributions
+0.026
oured+0.022
ete+0.022
oni+0.021
or+0.021
Neg logit contributions
 ghosts-0.025
 cob-0.023
 staying-0.023
 snowy-0.022
 breathing-0.022
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.009
 staying+0.009
 exc+0.008
 snowy+0.008
 cob+0.008
Neg logit contributions
-0.023
 course-0.021
oured-0.020
ar-0.020
les-0.020
Token,
Token ID11
Feature activation+3.57e-04

Pos logit contributions
 ghosts+0.338
 cob+0.314
 staying+0.313
 snowy+0.306
 breathing+0.304
Neg logit contributions
-0.287
oured-0.244
ete-0.236
oni-0.227
or-0.224
Token the
Token ID262
Feature activation-9.82e-03

Pos logit contributions
 ghosts+0.001
 cob+0.001
 staying+0.000
 snowy+0.000
 breathing+0.000
Neg logit contributions
-0.001
oured-0.001
ete-0.001
oni-0.001
or-0.001
Token doctor
Token ID6253
Feature activation+1.40e-03

Pos logit contributions
+0.020
oured+0.017
ete+0.017
oni+0.016
or+0.016
Neg logit contributions
 ghosts-0.015
 staying-0.014
 cob-0.014
 snowy-0.013
 breathing-0.013
Token came
Token ID1625
Feature activation+1.68e-02

Pos logit contributions
 ghosts+0.002
 cob+0.002
 staying+0.002
 breathing+0.002
 snowy+0.002
Neg logit contributions
-0.003
ete-0.002
oured-0.002
 course-0.002
iane-0.002
Token back
Token ID736
Feature activation-7.80e-03

Pos logit contributions
 ghosts+0.031
 cob+0.028
 staying+0.028
 snowy+0.028
 breathing+0.027
Neg logit contributions
-0.030
oured-0.026
ete-0.025
or-0.024
oni-0.024
Token,
Token ID11
Feature activation+9.27e-04

Pos logit contributions
 ghosts+0.038
 cob+0.037
 staying+0.036
 breathing+0.035
 than+0.034
Neg logit contributions
-0.050
oured-0.045
ete-0.044
iane-0.042
oni-0.041
Token but
Token ID475
Feature activation+3.55e-02

Pos logit contributions
 ghosts+0.001
 snowy+0.001
 cob+0.001
 staying+0.001
 outage+0.001
Neg logit contributions
-0.002
oured-0.002
 course-0.002
ian-0.002
ete-0.002
Token after
Token ID706
Feature activation-1.17e-02

Pos logit contributions
 ghosts+0.072
 cob+0.064
 staying+0.064
 outage+0.063
 breathing+0.063
Neg logit contributions
-0.055
ete-0.045
oured-0.045
on-0.043
or-0.043
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
+0.023
oured+0.020
ete+0.020
or+0.019
oni+0.019
Neg logit contributions
 ghosts-0.019
 cob-0.018
 staying-0.017
 snowy-0.017
 breathing-0.017
Token few
Token ID1178
Feature activation+9.91e-03

Pos logit contributions
+0.018
oured+0.016
ete+0.015
oni+0.014
or+0.014
Neg logit contributions
 ghosts-0.019
 cob-0.018
 staying-0.018
 snowy-0.017
 breathing-0.017
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.015
 staying+0.013
 snowy+0.013
 cob+0.013
 exc+0.013
Neg logit contributions
-0.021
 course-0.018
oured-0.018
ar-0.017
les-0.017
Token he
Token ID339
Feature activation-2.07e-03

Pos logit contributions
 ghosts+0.288
 staying+0.278
 cob+0.270
 breathing+0.265
 snowy+0.263
Neg logit contributions
-0.313
oured-0.279
ete-0.270
iane-0.265
cient-0.262
Token managed
Token ID5257
Feature activation-6.59e-02

Pos logit contributions
+0.004
ete+0.004
oured+0.004
iane+0.004
cient+0.003
Neg logit contributions
 cob-0.003
 ghosts-0.003
 exc-0.003
 staying-0.003
 snowy-0.003
Token to
Token ID284
Feature activation-6.57e-03

Pos logit contributions
+0.166
oured+0.146
cient+0.145
als+0.145
ete+0.141
Neg logit contributions
 staying-0.070
 breathing-0.066
 ghosts-0.065
 raining-0.064
 cob-0.063
Token do
Token ID466
Feature activation+4.49e-02

Pos logit contributions
+0.014
ete+0.013
oured+0.013
oni+0.013
ted+0.013
Neg logit contributions
 ghosts-0.008
 cob-0.008
 exc-0.008
 staying-0.008
 breathing-0.007
Token it
Token ID340
Feature activation+5.50e-03

Pos logit contributions
 ghosts+0.065
 staying+0.060
 cob+0.059
 breathing+0.058
 exciting+0.056
Neg logit contributions
-0.096
oured-0.087
ete-0.084
iane-0.083
als-0.082
Token home
Token ID1363
Feature activation-6.27e-02

Pos logit contributions
+0.105
oured+0.092
ete+0.088
oni+0.086
ian+0.085
Neg logit contributions
 ghosts-0.077
 staying-0.071
 cob-0.069
 snowy-0.068
 exciting-0.068
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
+0.132
oured+0.116
ete+0.113
oni+0.110
iane+0.109
Neg logit contributions
 ghosts-0.093
 staying-0.086
 cob-0.084
 snowy-0.082
 breathing-0.082
Token After
Token ID2293
Feature activation-3.67e-02

Pos logit contributions
 ghosts+0.026
 staying+0.024
 cob+0.024
 snowy+0.024
 exciting+0.023
Neg logit contributions
-0.026
oured-0.023
les-0.021
oni-0.021
als-0.021
Token a
Token ID257
Feature activation-8.12e-03

Pos logit contributions
+0.079
oured+0.074
ete+0.069
or+0.069
iane+0.069
Neg logit contributions
 ghosts-0.055
 staying-0.050
 cob-0.048
 breathing-0.047
 exciting-0.046
Token few
Token ID1178
Feature activation+1.03e-02

Pos logit contributions
+0.015
oured+0.014
ete+0.013
oni+0.013
als+0.013
Neg logit contributions
 ghosts-0.014
 cob-0.013
 exciting-0.013
 snowy-0.013
 staying-0.013
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.011
 cob+0.010
 staying+0.010
 snowy+0.009
 breathing+0.009
Neg logit contributions
-0.026
oured-0.023
ete-0.023
 course-0.022
ar-0.022
Token of
Token ID286
Feature activation+4.09e-02

Pos logit contributions
 ghosts+0.301
 staying+0.287
 cob+0.276
 breathing+0.271
 snowy+0.268
Neg logit contributions
-0.315
oured-0.272
oni-0.262
ete-0.260
iane-0.256
Token arguing
Token ID11810
Feature activation-1.85e-02

Pos logit contributions
 ghosts+0.054
 cob+0.049
 snowy+0.048
 staying+0.047
 exc+0.047
Neg logit contributions
-0.089
 course-0.079
oured-0.077
ete-0.076
oni-0.074
Token,
Token ID11
Feature activation+2.03e-03

Pos logit contributions
+0.031
oured+0.027
ete+0.026
oni+0.025
or+0.024
Neg logit contributions
 ghosts-0.035
 staying-0.033
 cob-0.033
 snowy-0.032
 breathing-0.032
Token Sam
Token ID3409
Feature activation-2.43e-02

Pos logit contributions
 ghosts+0.003
 staying+0.002
 snowy+0.002
 exciting+0.002
 cob+0.002
Neg logit contributions
-0.005
oured-0.004
ete-0.004
oni-0.004
iane-0.004
Token knew
Token ID2993
Feature activation-2.90e-02

Pos logit contributions
+0.054
iane+0.043
oured+0.043
ete+0.042
 course+0.042
Neg logit contributions
 ghosts-0.035
 staying-0.033
 cob-0.033
 snowy-0.032
 breathing-0.032
Token
Token ID198
Feature activation+1.46e-02

Pos logit contributions
 ghosts+0.045
 cob+0.042
 staying+0.041
 snowy+0.041
 breathing+0.040
Neg logit contributions
-0.040
oured-0.035
ete-0.033
oni-0.032
or-0.032
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
 ghosts+0.029
 cob+0.027
 staying+0.026
 snowy+0.026
 breathing+0.025
Neg logit contributions
-0.024
ete-0.020
oured-0.020
or-0.019
oni-0.019
TokenAfter
Token ID3260
Feature activation-2.97e-02

Pos logit contributions
 ghosts+0.028
 cob+0.026
 staying+0.025
 breathing+0.025
 snowy+0.024
Neg logit contributions
-0.024
ete-0.020
oured-0.019
or-0.019
 course-0.019
Token a
Token ID257
Feature activation-9.53e-03

Pos logit contributions
+0.062
oured+0.057
ete+0.056
or+0.055
oni+0.054
Neg logit contributions
 ghosts-0.044
 staying-0.040
 cob-0.039
 breathing-0.038
 snowy-0.038
Token few
Token ID1178
Feature activation+9.13e-03

Pos logit contributions
+0.018
oured+0.015
ete+0.015
ar+0.014
oni+0.014
Neg logit contributions
 ghosts-0.017
 staying-0.015
 cob-0.015
 snowy-0.015
 breathing-0.015
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.011
 staying+0.010
 exc+0.010
 cob+0.010
 snowy+0.010
Neg logit contributions
-0.021
 course-0.019
oured-0.018
ar-0.018
ted-0.018
Token,
Token ID11
Feature activation+5.92e-03

Pos logit contributions
 ghosts+0.340
 cob+0.316
 staying+0.315
 snowy+0.308
 breathing+0.305
Neg logit contributions
-0.284
oured-0.242
ete-0.233
oni-0.224
or-0.222
Token Tim
Token ID5045
Feature activation+3.41e-02

Pos logit contributions
 ghosts+0.006
 cob+0.005
 staying+0.005
 snowy+0.005
 exciting+0.004
Neg logit contributions
-0.015
oured-0.014
ete-0.014
or-0.014
oni-0.014
Tokenmy
Token ID1820
Feature activation-3.02e-02

Pos logit contributions
 ghosts+0.043
 breathing+0.040
 cob+0.040
 staying+0.039
 snowy+0.039
Neg logit contributions
-0.082
oured-0.068
ar-0.067
ete-0.067
als-0.067
Token found
Token ID1043
Feature activation-1.09e-02

Pos logit contributions
+0.066
 course+0.056
oured+0.055
oni+0.053
ete+0.053
Neg logit contributions
 ghosts-0.043
 cob-0.042
 breathing-0.040
 staying-0.039
 snowy-0.039
Token the
Token ID262
Feature activation-6.63e-03

Pos logit contributions
+0.025
oured+0.022
ete+0.021
or+0.021
oni+0.021
Neg logit contributions
 ghosts-0.015
 cob-0.014
 exciting-0.013
 staying-0.013
 breathing-0.013
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
 ghosts+0.049
 cob+0.045
 staying+0.044
 than+0.044
 breathing+0.044
Neg logit contributions
-0.083
oured-0.072
ete-0.071
iane-0.070
oni-0.067
Token But
Token ID887
Feature activation+1.56e-02

Pos logit contributions
 raining+0.026
 ghosts+0.025
 Kim+0.024
 staying+0.024
 snowy+0.024
Neg logit contributions
-0.039
oured-0.038
u-0.038
or-0.037
ar-0.037
Token after
Token ID706
Feature activation-7.73e-03

Pos logit contributions
 ghosts+0.025
 staying+0.023
 cob+0.023
 snowy+0.022
 breathing+0.022
Neg logit contributions
-0.032
oured-0.028
ete-0.026
oni-0.026
or-0.026
Token a
Token ID257
Feature activation-3.00e-03

Pos logit contributions
+0.016
oured+0.014
ete+0.014
or+0.013
oni+0.013
Neg logit contributions
 ghosts-0.012
 cob-0.011
 staying-0.011
 snowy-0.011
 breathing-0.011
Token few
Token ID1178
Feature activation+1.07e-02

Pos logit contributions
+0.006
oured+0.005
oni+0.005
ete+0.005
iane+0.005
Neg logit contributions
 ghosts-0.005
 cob-0.005
 exciting-0.004
 snowy-0.004
 staying-0.004
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.013
 cob+0.011
 staying+0.011
 snowy+0.011
 breathing+0.011
Neg logit contributions
-0.025
oured-0.023
ete-0.022
 course-0.022
ar-0.022
Token,
Token ID11
Feature activation+5.99e-03

Pos logit contributions
 ghosts+0.277
 staying+0.266
 cob+0.258
 snowy+0.249
 breathing+0.247
Neg logit contributions
-0.336
oured-0.289
oni-0.280
cient-0.277
iane-0.277
Token something
Token ID1223
Feature activation+2.73e-02

Pos logit contributions
 ghosts+0.008
 exciting+0.007
 oh+0.007
 sunny+0.007
 interesting+0.007
Neg logit contributions
-0.014
oured-0.014
ives-0.012
aff-0.012
u-0.012
Token changed
Token ID3421
Feature activation+4.70e-02

Pos logit contributions
 ghosts+0.037
 staying+0.032
 cob+0.032
 snowy+0.031
 breathing+0.031
Neg logit contributions
-0.061
oured-0.055
ete-0.054
oni-0.052
or-0.052
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 ghosts+0.077
 cob+0.072
 staying+0.071
 snowy+0.070
 breathing+0.070
Neg logit contributions
-0.094
oured-0.080
ete-0.077
oni-0.075
or-0.075
Token Suddenly
Token ID24975
Feature activation+1.82e-02

Pos logit contributions
 ghosts+0.025
 raining+0.025
 breathing+0.023
 Kim+0.023
 staying+0.022
Neg logit contributions
les-0.037
u-0.037
or-0.037
oured-0.037
-0.036
Token done
Token ID1760
Feature activation-1.48e-02

Pos logit contributions
 ghosts+0.011
 cob+0.010
 textures+0.009
 staying+0.009
 exc+0.009
Neg logit contributions
-0.016
oured-0.014
 course-0.013
um-0.013
ete-0.013
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
+0.029
oured+0.025
ete+0.024
oni+0.024
or+0.024
Neg logit contributions
 ghosts-0.024
 cob-0.022
 staying-0.022
 snowy-0.021
 breathing-0.021
Token After
Token ID2293
Feature activation-3.79e-02

Pos logit contributions
 ghosts+0.043
 staying+0.040
 cob+0.040
 snowy+0.040
 breathing+0.039
Neg logit contributions
-0.040
oured-0.036
oni-0.033
or-0.033
ete-0.033
Token a
Token ID257
Feature activation-5.54e-03

Pos logit contributions
+0.076
oured+0.069
ete+0.066
oni+0.065
or+0.065
Neg logit contributions
 ghosts-0.061
 staying-0.055
 cob-0.055
 snowy-0.053
 breathing-0.053
Token few
Token ID1178
Feature activation+7.29e-03

Pos logit contributions
+0.010
oured+0.009
ete+0.008
oni+0.008
or+0.008
Neg logit contributions
 ghosts-0.010
 cob-0.009
 staying-0.009
 snowy-0.009
 breathing-0.009
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.007
 staying+0.006
 cob+0.006
 breathing+0.006
 snowy+0.006
Neg logit contributions
-0.019
 course-0.017
oured-0.017
ar-0.016
ete-0.016
Token,
Token ID11
Feature activation+6.47e-03

Pos logit contributions
 ghosts+0.310
 staying+0.292
 cob+0.286
 snowy+0.279
 breathing+0.279
Neg logit contributions
-0.308
oured-0.264
ete-0.254
oni-0.253
iane-0.248
Token the
Token ID262
Feature activation-3.30e-04

Pos logit contributions
 ghosts+0.010
 staying+0.009
 cob+0.009
 snowy+0.008
 breathing+0.008
Neg logit contributions
-0.014
oured-0.013
ete-0.012
oni-0.012
or-0.012
Token rain
Token ID6290
Feature activation+5.98e-02

Pos logit contributions
+0.001
oured+0.001
 course+0.001
ete+0.001
als+0.001
Neg logit contributions
 ghosts-0.000
 staying-0.000
 cob-0.000
 breathing-0.000
 snowy-0.000
Token stopped
Token ID5025
Feature activation+5.47e-02

Pos logit contributions
 ghosts+0.117
 cob+0.109
 staying+0.108
 snowy+0.106
 breathing+0.105
Neg logit contributions
-0.097
oured-0.082
ete-0.081
oni-0.077
or-0.076
Token and
Token ID290
Feature activation+1.96e-02

Pos logit contributions
 ghosts+0.092
 cob+0.083
 staying+0.081
 exciting+0.080
 snowy+0.079
Neg logit contributions
-0.104
oured-0.088
ete-0.085
oni-0.084
or-0.081
Token dry
Token ID5894
Feature activation-2.93e-03

Pos logit contributions
+0.015
oured+0.014
ete+0.013
oni+0.013
or+0.013
Neg logit contributions
 ghosts-0.013
 cob-0.012
 staying-0.012
 snowy-0.011
 breathing-0.011
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
+0.006
oured+0.005
ete+0.005
oni+0.005
iane+0.005
Neg logit contributions
 ghosts-0.005
 cob-0.004
 staying-0.004
 snowy-0.004
 breathing-0.004
Token After
Token ID2293
Feature activation-3.88e-02

Pos logit contributions
 ghosts+0.018
 cob+0.016
 staying+0.016
 snowy+0.016
 breathing+0.016
Neg logit contributions
-0.019
oured-0.017
oni-0.016
ete-0.016
 course-0.016
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
+0.078
oured+0.071
ete+0.069
oni+0.068
or+0.067
Neg logit contributions
 ghosts-0.060
 staying-0.055
 cob-0.055
 breathing-0.053
 snowy-0.053
Token few
Token ID1178
Feature activation+1.14e-02

Pos logit contributions
+0.025
oured+0.022
oni+0.021
ete+0.021
als+0.021
Neg logit contributions
 ghosts-0.023
 cob-0.022
 snowy-0.021
 exciting-0.021
 staying-0.021
Token minutes
Token ID2431
Feature activation+1.74e-01

Pos logit contributions
 ghosts+0.010
 staying+0.008
 cob+0.008
 snowy+0.008
 breathing+0.008
Neg logit contributions
-0.031
oured-0.028
ete-0.028
 course-0.027
ar-0.027
Token,
Token ID11
Feature activation-4.74e-03

Pos logit contributions
 ghosts+0.308
 staying+0.290
 cob+0.286
 snowy+0.276
 breathing+0.276
Neg logit contributions
-0.308
oured-0.265
ete-0.257
oni-0.256
iane-0.247
Token it
Token ID340
Feature activation+6.65e-03

Pos logit contributions
+0.010
oured+0.009
ete+0.009
oni+0.009
or+0.008
Neg logit contributions
 ghosts-0.007
 cob-0.006
 staying-0.006
 snowy-0.006
 breathing-0.006
Token was
Token ID373
Feature activation-2.82e-03

Pos logit contributions
 ghosts+0.011
 cob+0.011
 staying+0.010
 snowy+0.010
 breathing+0.010
Neg logit contributions
-0.013
oured-0.011
ete-0.011
oni-0.010
or-0.010
Token ready
Token ID3492
Feature activation-2.09e-02

Pos logit contributions
+0.005
oured+0.005
ete+0.005
oni+0.004
um+0.004
Neg logit contributions
 ghosts-0.005
 cob-0.004
 staying-0.004
 breathing-0.004
 snowy-0.004
Token and
Token ID290
Feature activation+9.73e-03

Pos logit contributions
+0.042
oured+0.037
ete+0.036
oni+0.035
or+0.034
Neg logit contributions
 ghosts-0.033
 staying-0.031
 cob-0.030
 snowy-0.029
 breathing-0.029
Token film
Token ID2646
Feature activation-2.24e-02

Pos logit contributions
+0.004
oured+0.003
ete+0.003
or+0.003
iane+0.003
Neg logit contributions
 ghosts-0.002
 cob-0.002
 snowy-0.002
 exciting-0.002
 staying-0.002
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
+0.049
oured+0.044
ete+0.043
iane+0.041
or+0.041
Neg logit contributions
 ghosts-0.030
 staying-0.028
 cob-0.028
 exciting-0.027
 breathing-0.027
Token After
Token ID2293
Feature activation-3.68e-02

Pos logit contributions
 ghosts+0.022
 cob+0.021
 staying+0.021
 breathing+0.020
 snowy+0.020
Neg logit contributions
-0.028
oured-0.026
 course-0.025
or-0.025
oni-0.025
Token a
Token ID257
Feature activation-7.73e-03

Pos logit contributions
+0.078
oured+0.074
or+0.073
ete+0.072
les+0.071
Neg logit contributions
 ghosts-0.053
 staying-0.049
 cob-0.047
 breathing-0.045
 exciting-0.044
Token few
Token ID1178
Feature activation+9.97e-03

Pos logit contributions
+0.015
oured+0.013
igh+0.013
als+0.013
ete+0.013
Neg logit contributions
 ghosts-0.012
 cob-0.012
 exciting-0.012
 sunny-0.012
 snowy-0.012
Token minutes
Token ID2431
Feature activation+1.73e-01

Pos logit contributions
 ghosts+0.010
 cob+0.008
 staying+0.008
 snowy+0.008
 breathing+0.008
Neg logit contributions
-0.026
oured-0.024
ete-0.023
oni-0.023
or-0.023
Token,
Token ID11
Feature activation+3.08e-03

Pos logit contributions
 ghosts+0.294
 staying+0.282
 cob+0.277
 breathing+0.264
 snowy+0.262
Neg logit contributions
-0.315
oured-0.272
ete-0.266
oni-0.265
cient-0.258
Token Anna
Token ID11735
Feature activation+4.51e-02

Pos logit contributions
 ghosts+0.004
 cob+0.003
 staying+0.003
 snowy+0.003
 breathing+0.003
Neg logit contributions
-0.007
oured-0.007
ete-0.007
oni-0.006
or-0.006
Token realized
Token ID6939
Feature activation-1.40e-02

Pos logit contributions
 cob+0.062
 snowy+0.058
 staying+0.058
 ghosts+0.057
 breathing+0.057
Neg logit contributions
-0.097
 trigger-0.081
 course-0.081
iane-0.080
cient-0.080
Token that
Token ID326
Feature activation-1.21e-02

Pos logit contributions
+0.029
oured+0.026
ete+0.025
oni+0.024
ar+0.024
Neg logit contributions
 ghosts-0.022
 staying-0.020
 cob-0.020
 exciting-0.019
 breathing-0.019
Token the
Token ID262
Feature activation-3.27e-03

Pos logit contributions
+0.021
 course+0.018
ete+0.017
on+0.017
oni+0.017
Neg logit contributions
 ghosts-0.020
 outage-0.018
 snowy-0.018
 breathing-0.018
 cob-0.018
Token looking
Token ID2045
Feature activation+3.29e-02

Pos logit contributions
+0.075
oured+0.066
ete+0.064
oni+0.062
or+0.062
Neg logit contributions
 ghosts-0.055
 cob-0.050
 staying-0.050
 snowy-0.049
 breathing-0.048
Token around
Token ID1088
Feature activation+4.21e-03

Pos logit contributions
 ghosts+0.052
 cob+0.047
 staying+0.047
 snowy+0.046
 breathing+0.045
Neg logit contributions
-0.066
oured-0.058
ete-0.056
oni-0.055
or-0.054
Token for
Token ID329
Feature activation-3.62e-02

Pos logit contributions
 ghosts+0.008
 cob+0.008
 staying+0.008
 snowy+0.008
 breathing+0.007
Neg logit contributions
-0.007
oured-0.006
ete-0.006
oni-0.005
or-0.005
Token a
Token ID257
Feature activation-8.29e-03

Pos logit contributions
+0.061
 course+0.054
oured+0.053
ete+0.052
ar+0.051
Neg logit contributions
 ghosts-0.063
 outage-0.060
 cob-0.059
 staying-0.058
 exc-0.057
Token few
Token ID1178
Feature activation+1.05e-02

Pos logit contributions
+0.017
oured+0.015
ete+0.014
oni+0.014
or+0.014
Neg logit contributions
 ghosts-0.013
 staying-0.012
 cob-0.012
 breathing-0.011
 snowy-0.011
Token minutes
Token ID2431
Feature activation+1.73e-01

Pos logit contributions
 ghosts+0.013
 staying+0.012
 cob+0.011
 breathing+0.011
 outage+0.011
Neg logit contributions
-0.023
 course-0.022
ar-0.021
les-0.021
oured-0.020
Token,
Token ID11
Feature activation-1.10e-03

Pos logit contributions
 ghosts+0.351
 cob+0.327
 staying+0.323
 snowy+0.318
 breathing+0.315
Neg logit contributions
-0.273
oured-0.230
ete-0.221
or-0.211
oni-0.211
Token he
Token ID339
Feature activation+2.48e-03

Pos logit contributions
+0.002
oured+0.002
ete+0.002
oni+0.002
or+0.002
Neg logit contributions
 ghosts-0.002
 cob-0.002
 staying-0.002
 snowy-0.002
 breathing-0.002
Token heard
Token ID2982
Feature activation-1.63e-02

Pos logit contributions
 ghosts+0.003
 snowy+0.003
 cob+0.003
 staying+0.003
 breathing+0.003
Neg logit contributions
-0.006
oured-0.005
 course-0.005
oni-0.005
iane-0.005
Token a
Token ID257
Feature activation-8.47e-03

Pos logit contributions
+0.033
oured+0.029
ete+0.028
oni+0.027
or+0.027
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.023
 snowy-0.023
 breathing-0.023
Token voice
Token ID3809
Feature activation+5.27e-02

Pos logit contributions
+0.019
oured+0.017
ete+0.017
oni+0.016
or+0.016
Neg logit contributions
 ghosts-0.011
 staying-0.010
 cob-0.010
 snowy-0.010
 breathing-0.009
Token practise
Token ID44811
Feature activation-4.82e-03

Pos logit contributions
 ghosts+0.004
 exc+0.004
 staying+0.004
 cob+0.004
 exciting+0.004
Neg logit contributions
-0.007
oured-0.006
ted-0.006
oni-0.006
als-0.006
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
+0.011
oured+0.010
ete+0.009
iane+0.009
als+0.009
Neg logit contributions
 ghosts-0.007
 staying-0.007
 breathing-0.006
 cob-0.006
 exciting-0.006
Token After
Token ID2293
Feature activation-3.50e-02

Pos logit contributions
 ghosts+0.033
 staying+0.032
 exciting+0.031
 snowy+0.030
 exc+0.030
Neg logit contributions
-0.042
oured-0.040
les-0.039
oni-0.038
als-0.038
Token a
Token ID257
Feature activation-2.10e-03

Pos logit contributions
+0.076
oured+0.074
iane+0.070
als+0.069
ete+0.069
Neg logit contributions
 ghosts-0.050
 staying-0.045
 breathing-0.043
 cob-0.042
 exciting-0.042
Token few
Token ID1178
Feature activation+1.08e-02

Pos logit contributions
+0.004
oured+0.004
ete+0.003
oni+0.003
als+0.003
Neg logit contributions
 ghosts-0.003
 cob-0.003
 exciting-0.003
 snowy-0.003
 sunny-0.003
Token minutes
Token ID2431
Feature activation+1.73e-01

Pos logit contributions
 ghosts+0.013
 cob+0.012
 staying+0.011
 snowy+0.011
 breathing+0.011
Neg logit contributions
-0.026
oured-0.023
ete-0.023
oni-0.022
or-0.022
Token,
Token ID11
Feature activation+8.98e-03

Pos logit contributions
 ghosts+0.260
 staying+0.253
 cob+0.239
 breathing+0.235
 raining+0.234
Neg logit contributions
-0.337
oured-0.298
cient-0.295
lor-0.291
oni-0.287
Token he
Token ID339
Feature activation+6.18e-03

Pos logit contributions
 ghosts+0.013
 staying+0.011
 cob+0.011
 snowy+0.011
 oh+0.011
Neg logit contributions
-0.019
oured-0.018
ete-0.017
les-0.017
ives-0.017
Token was
Token ID373
Feature activation+8.05e-03

Pos logit contributions
 ghosts+0.008
 cob+0.008
 staying+0.008
 exciting+0.008
 snowy+0.008
Neg logit contributions
-0.014
oured-0.011
cient-0.011
iane-0.011
ete-0.011
Token able
Token ID1498
Feature activation-3.08e-02

Pos logit contributions
 ghosts+0.012
 staying+0.012
 breathing+0.011
 cob+0.011
 exciting+0.011
Neg logit contributions
-0.017
ete-0.015
als-0.015
oured-0.014
or-0.014
Token to
Token ID284
Feature activation+1.75e-03

Pos logit contributions
+0.065
oured+0.057
ete+0.055
or+0.054
 course+0.054
Neg logit contributions
 ghosts-0.046
 staying-0.043
 cob-0.042
 breathing-0.042
 snowy-0.041
Token only
Token ID691
Feature activation+3.47e-03

Pos logit contributions
 ghosts+0.013
 staying+0.012
 exc+0.012
 cob+0.012
 snowy+0.012
Neg logit contributions
-0.018
oured-0.017
ete-0.017
iane-0.016
oni-0.016
Token watch
Token ID2342
Feature activation-1.80e-02

Pos logit contributions
 ghosts+0.006
 cob+0.005
 staying+0.005
 snowy+0.005
 breathing+0.005
Neg logit contributions
-0.007
oured-0.006
ete-0.006
oni-0.006
or-0.006
Token TV
Token ID3195
Feature activation-1.22e-02

Pos logit contributions
+0.038
oured+0.033
ete+0.032
oni+0.032
or+0.031
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.024
 snowy-0.023
 breathing-0.023
Token for
Token ID329
Feature activation-3.79e-02

Pos logit contributions
+0.025
oured+0.022
ete+0.021
oni+0.021
or+0.021
Neg logit contributions
 ghosts-0.018
 cob-0.017
 staying-0.017
 snowy-0.016
 breathing-0.016
Token 30
Token ID1542
Feature activation+1.51e-02

Pos logit contributions
+0.074
oured+0.063
ete+0.062
oni+0.060
 course+0.059
Neg logit contributions
 ghosts-0.061
 cob-0.057
 staying-0.056
 snowy-0.055
 breathing-0.054
Token minutes
Token ID2431
Feature activation+1.73e-01

Pos logit contributions
 ghosts+0.016
 staying+0.014
 cob+0.014
 snowy+0.013
 breathing+0.013
Neg logit contributions
-0.036
les-0.033
oured-0.032
oni-0.032
 course-0.032
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
 ghosts+0.255
 staying+0.236
 cob+0.229
 breathing+0.225
 snowy+0.222
Neg logit contributions
-0.364
oured-0.321
ete-0.317
ar-0.304
iane-0.303
Token Lily
Token ID20037
Feature activation+2.16e-02

Pos logit contributions
 ghosts+0.027
 cob+0.024
 staying+0.024
 snowy+0.024
 breathing+0.023
Neg logit contributions
-0.031
oured-0.027
ete-0.026
oni-0.026
or-0.025
Token asked
Token ID1965
Feature activation+5.95e-02

Pos logit contributions
 ghosts+0.035
 cob+0.033
 staying+0.032
 snowy+0.031
 breathing+0.031
Neg logit contributions
-0.042
oured-0.037
ete-0.036
oni-0.034
or-0.034
Token Mitt
Token ID16627
Feature activation-5.58e-02

Pos logit contributions
 ghosts+0.095
 snowy+0.087
 cob+0.086
 staying+0.085
 outage+0.084
Neg logit contributions
-0.117
oured-0.101
ete-0.100
or-0.098
oni-0.098
Tokenens
Token ID641
Feature activation-1.81e-02

Pos logit contributions
+0.110
oured+0.098
ete+0.094
or+0.090
oni+0.088
Neg logit contributions
 ghosts-0.090
 cob-0.082
 staying-0.081
 snowy-0.079
 breathing-0.078
Token After
Token ID2293
Feature activation-3.69e-02

Pos logit contributions
 ghosts+0.021
 staying+0.019
 cob+0.019
 breathing+0.018
 raining+0.018
Neg logit contributions
-0.023
oni-0.022
oured-0.021
or-0.021
 course-0.020
Token searching
Token ID10342
Feature activation+4.94e-02

Pos logit contributions
+0.079
oured+0.071
oni+0.069
ete+0.068
or+0.067
Neg logit contributions
 ghosts-0.055
 staying-0.050
 cob-0.049
 exciting-0.047
 snowy-0.047
Token for
Token ID329
Feature activation-3.59e-02

Pos logit contributions
 ghosts+0.079
 cob+0.072
 staying+0.072
 snowy+0.070
 breathing+0.069
Neg logit contributions
-0.098
oured-0.086
ete-0.083
oni-0.081
or-0.080
Token a
Token ID257
Feature activation-9.02e-03

Pos logit contributions
+0.072
oured+0.063
ete+0.061
or+0.059
oni+0.059
Neg logit contributions
 ghosts-0.057
 cob-0.052
 staying-0.051
 snowy-0.050
 breathing-0.050
Token few
Token ID1178
Feature activation+1.04e-02

Pos logit contributions
+0.020
oured+0.018
oni+0.017
ete+0.017
iane+0.017
Neg logit contributions
 ghosts-0.013
 cob-0.012
 snowy-0.011
 staying-0.011
 exciting-0.011
Token minutes
Token ID2431
Feature activation+1.73e-01

Pos logit contributions
 ghosts+0.013
 staying+0.012
 cob+0.012
 snowy+0.012
 exc+0.012
Neg logit contributions
-0.023
 course-0.021
oured-0.021
ete-0.020
ar-0.020
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
 ghosts+0.304
 staying+0.291
 cob+0.277
 breathing+0.274
 snowy+0.269
Neg logit contributions
-0.309
oured-0.266
oni-0.258
ete-0.254
iane-0.248
Token they
Token ID484
Feature activation-1.77e-02

Pos logit contributions
 ghosts+0.002
 cob+0.002
 staying+0.002
 exciting+0.001
 snowy+0.001
Neg logit contributions
-0.003
oured-0.003
oni-0.003
ete-0.002
or-0.002
Token found
Token ID1043
Feature activation-1.01e-02

Pos logit contributions
+0.042
oni+0.034
oured+0.033
 course+0.033
 trigger+0.032
Neg logit contributions
 ghosts-0.023
 staying-0.021
 exciting-0.020
 cob-0.020
 snowy-0.020
Token an
Token ID281
Feature activation+1.12e-02

Pos logit contributions
+0.023
oured+0.020
oni+0.019
ete+0.019
or+0.019
Neg logit contributions
 ghosts-0.014
 cob-0.013
 exciting-0.012
 staying-0.012
 interesting-0.012
Token old
Token ID1468
Feature activation+3.43e-02

Pos logit contributions
 ghosts+0.015
 breathing+0.013
 staying+0.013
 snowy+0.012
 cigarettes+0.012
Neg logit contributions
oured-0.022
-0.022
ete-0.022
um-0.021
 course-0.021
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
 ghosts+0.079
 cob+0.073
 exciting+0.071
 staying+0.071
 exc+0.071
Neg logit contributions
-0.079
oured-0.066
les-0.066
oni-0.065
als-0.065
Token Sara
Token ID24799
Feature activation-4.97e-03

Pos logit contributions
+0.030
oured+0.027
ete+0.026
oni+0.025
or+0.025
Neg logit contributions
 ghosts-0.025
 staying-0.023
 cob-0.023
 snowy-0.022
 breathing-0.022
Token likes
Token ID7832
Feature activation-5.15e-02

Pos logit contributions
+0.011
oured+0.009
ete+0.009
or+0.009
iane+0.009
Neg logit contributions
 ghosts-0.007
 cob-0.007
 staying-0.007
 snowy-0.007
 breathing-0.007
Token the
Token ID262
Feature activation-2.41e-02

Pos logit contributions
+0.089
oni+0.076
 course+0.075
oured+0.075
ted+0.074
Neg logit contributions
 ghosts-0.085
 staying-0.077
 cob-0.077
 exc-0.077
 breathing-0.076
Token water
Token ID1660
Feature activation-5.71e-02

Pos logit contributions
+0.054
oured+0.049
ete+0.048
or+0.046
ar+0.046
Neg logit contributions
 ghosts-0.033
 cob-0.030
 staying-0.029
 snowy-0.029
 breathing-0.028
Tokenmelon
Token ID45690
Feature activation-1.90e-01

Pos logit contributions
+0.115
oured+0.101
ete+0.098
oni+0.095
or+0.095
Neg logit contributions
 ghosts-0.090
 cob-0.082
 staying-0.081
 snowy-0.079
 breathing-0.078
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
+0.383
oured+0.339
ete+0.328
or+0.315
iane+0.313
Neg logit contributions
 ghosts-0.298
 staying-0.273
 cob-0.272
 snowy-0.265
 breathing-0.261
Token She
Token ID1375
Feature activation-3.10e-02

Pos logit contributions
 ghosts+0.005
 cob+0.005
 staying+0.005
 snowy+0.005
 breathing+0.005
Neg logit contributions
-0.006
oured-0.006
ete-0.006
or-0.005
oni-0.005
Token says
Token ID1139
Feature activation-1.57e-02

Pos logit contributions
+0.079
oured+0.069
lor+0.069
or+0.069
ete+0.068
Neg logit contributions
 exc-0.035
 cob-0.033
 snowy-0.032
 ghosts-0.031
 exciting-0.030
Token it
Token ID340
Feature activation-5.70e-03

Pos logit contributions
+0.032
oured+0.030
ete+0.027
iane+0.027
or+0.026
Neg logit contributions
 ghosts-0.024
 staying-0.023
 cob-0.023
 exc-0.023
 exciting-0.022
Token is
Token ID318
Feature activation-1.82e-02

Pos logit contributions
+0.012
oured+0.011
ete+0.011
iane+0.011
or+0.011
Neg logit contributions
 ghosts-0.008
 cob-0.008
 snowy-0.007
 staying-0.007
 exc-0.007
Token.
Token ID13
Feature activation+2.78e-03

Pos logit contributions
+0.099
oured+0.087
ete+0.084
oni+0.082
or+0.081
Neg logit contributions
 ghosts-0.073
 staying-0.066
 cob-0.066
 snowy-0.064
 breathing-0.063
Token They
Token ID1119
Feature activation-3.69e-02

Pos logit contributions
 ghosts+0.006
 cob+0.006
 staying+0.005
 breathing+0.005
 snowy+0.005
Neg logit contributions
-0.004
als-0.003
ete-0.003
um-0.003
oured-0.003
Token eat
Token ID4483
Feature activation-1.60e-02

Pos logit contributions
+0.084
lor+0.082
ete+0.073
ar+0.073
or+0.072
Neg logit contributions
 exc-0.050
 cob-0.045
 exciting-0.044
 interesting-0.042
 snowy-0.042
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
+0.035
oured+0.031
ete+0.030
or+0.029
ian+0.029
Neg logit contributions
 ghosts-0.022
 cob-0.021
 staying-0.021
 snowy-0.020
 exc-0.020
Token water
Token ID1660
Feature activation-4.88e-02

Pos logit contributions
+0.025
oured+0.024
ete+0.023
lor+0.023
or+0.022
Neg logit contributions
 exciting-0.017
 snowy-0.017
 interesting-0.017
 exc-0.016
 cob-0.016
Tokenmelon
Token ID45690
Feature activation-1.83e-01

Pos logit contributions
+0.108
oured+0.098
ete+0.097
iane+0.094
ian+0.093
Neg logit contributions
 ghosts-0.062
 staying-0.061
 snowy-0.059
 cob-0.059
 breathing-0.058
Token and
Token ID290
Feature activation+7.32e-03

Pos logit contributions
+0.400
oured+0.361
ete+0.354
iane+0.338
ian+0.336
Neg logit contributions
 ghosts-0.244
 staying-0.232
 snowy-0.226
 cob-0.225
 than-0.222
Token enjoy
Token ID2883
Feature activation-6.71e-03

Pos logit contributions
 ghosts+0.012
 staying+0.011
 cob+0.011
 snowy+0.011
 breathing+0.011
Neg logit contributions
-0.014
oured-0.012
ete-0.012
or-0.012
oni-0.012
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
+0.013
oured+0.011
ete+0.011
oni+0.011
 course+0.010
Neg logit contributions
 ghosts-0.011
 cob-0.010
 staying-0.010
 snowy-0.010
 breathing-0.010
Token spring
Token ID6076
Feature activation-1.19e-02

Pos logit contributions
+0.026
oured+0.024
ete+0.023
or+0.022
ian+0.022
Neg logit contributions
 ghosts-0.016
 snowy-0.016
 cob-0.015
 staying-0.015
 exciting-0.015
Token.
Token ID13
Feature activation+5.08e-03

Pos logit contributions
+0.026
oured+0.024
ete+0.024
iane+0.023
ian+0.023
Neg logit contributions
 ghosts-0.015
 snowy-0.014
 staying-0.014
 cob-0.014
 than-0.014
Token my
Token ID616
Feature activation-3.03e-02

Pos logit contributions
+0.028
oured+0.025
ete+0.024
oni+0.024
or+0.024
Neg logit contributions
 ghosts-0.020
 cob-0.018
 staying-0.018
 snowy-0.017
 breathing-0.017
Token d
Token ID288
Feature activation-6.45e-02

Pos logit contributions
+0.065
oured+0.058
ete+0.056
iane+0.055
or+0.055
Neg logit contributions
 ghosts-0.044
 cob-0.042
 staying-0.039
 snowy-0.039
 exc-0.038
Tokenarl
Token ID7063
Feature activation-1.05e-01

Pos logit contributions
+0.136
 course+0.115
onia+0.109
neath+0.108
cient+0.106
Neg logit contributions
 ghosts-0.089
 than-0.085
 raining-0.080
 cob-0.078
 staying-0.077
Tokenings
Token ID654
Feature activation-1.01e-01

Pos logit contributions
+0.164
oured+0.160
ete+0.143
 course+0.138
on+0.135
Neg logit contributions
 than-0.211
 staying-0.169
 suitable-0.164
 exc-0.158
 ghosts-0.158
Token.
Token ID13
Feature activation-2.00e-03

Pos logit contributions
+0.227
oured+0.204
ete+0.195
ret+0.184
iane+0.184
Neg logit contributions
 ghosts-0.137
 than-0.132
 staying-0.129
 cob-0.115
 exciting-0.114
Token Enjoy
Token ID18179
Feature activation-1.78e-01

Pos logit contributions
+0.004
oured+0.004
oni+0.004
ete+0.004
or+0.004
Neg logit contributions
 ghosts-0.003
 staying-0.003
 cob-0.003
 exc-0.002
 snowy-0.002
Token your
Token ID534
Feature activation-1.53e-02

Pos logit contributions
+0.454
oured+0.396
ete+0.373
iane+0.366
 trigger+0.356
Neg logit contributions
 ghosts-0.204
 staying-0.202
 than-0.199
 cob-0.192
 snowy-0.181
Token lunch
Token ID9965
Feature activation-2.14e-02

Pos logit contributions
+0.031
oured+0.028
ete+0.027
iane+0.027
or+0.026
Neg logit contributions
 ghosts-0.023
 cob-0.022
 staying-0.021
 snowy-0.021
 exciting-0.021
Token.
Token ID13
Feature activation+6.92e-03

Pos logit contributions
+0.044
oured+0.040
ete+0.038
iane+0.037
ted+0.037
Neg logit contributions
 ghosts-0.031
 staying-0.030
 than-0.029
 cob-0.028
 snowy-0.028
Token I
Token ID314
Feature activation+1.09e-02

Pos logit contributions
 ghosts+0.011
 staying+0.010
 cob+0.010
 snowy+0.010
 exc+0.010
Neg logit contributions
-0.013
oured-0.012
ete-0.011
or-0.011
oni-0.011
Token made
Token ID925
Feature activation-1.95e-02

Pos logit contributions
 exc+0.014
 staying+0.014
 snowy+0.014
 exciting+0.014
 ghosts+0.013
Neg logit contributions
-0.024
oured-0.022
als-0.021
ete-0.021
iane-0.021
Token welcome
Token ID7062
Feature activation-6.59e-02

Pos logit contributions
+0.152
oured+0.135
ete+0.133
 course+0.128
oni+0.128
Neg logit contributions
 ghosts-0.098
 cob-0.087
 snowy-0.083
 staying-0.083
 breathing-0.081
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
+0.141
oured+0.124
ete+0.118
oni+0.116
or+0.115
Neg logit contributions
 ghosts-0.098
 staying-0.091
 cob-0.086
 breathing-0.085
 than-0.084
Token little
Token ID1310
Feature activation-6.37e-03

Pos logit contributions
+0.033
oured+0.029
ete+0.028
or+0.028
oni+0.028
Neg logit contributions
 ghosts-0.019
 staying-0.018
 cob-0.017
 snowy-0.017
 breathing-0.017
Token girl
Token ID2576
Feature activation+2.00e-02

Pos logit contributions
+0.016
oured+0.014
ete+0.014
or+0.013
oni+0.013
Neg logit contributions
 ghosts-0.008
 cob-0.007
 staying-0.007
 snowy-0.007
 exciting-0.006
Token.
Token ID13
Feature activation+7.77e-03

Pos logit contributions
 staying+0.023
 ghosts+0.023
 cob+0.022
 exc+0.022
 breathing+0.022
Neg logit contributions
-0.048
 trigger-0.041
oured-0.041
oni-0.041
ete-0.041
Token Enjoy
Token ID18179
Feature activation-1.78e-01

Pos logit contributions
 ghosts+0.012
 staying+0.011
 cob+0.011
 snowy+0.011
 exc+0.011
Neg logit contributions
-0.015
oured-0.014
oni-0.013
ete-0.013
or-0.013
Token your
Token ID534
Feature activation-1.73e-02

Pos logit contributions
+0.432
oured+0.385
ete+0.368
iane+0.360
oni+0.355
Neg logit contributions
 ghosts-0.215
 staying-0.213
 cob-0.197
 than-0.193
 breathing-0.189
Token balloon
Token ID21190
Feature activation-3.61e-02

Pos logit contributions
+0.036
oured+0.032
iane+0.031
ete+0.031
or+0.031
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.024
 snowy-0.024
 exciting-0.024
Token."
Token ID526
Feature activation-1.91e-02

Pos logit contributions
+0.078
oured+0.070
ete+0.068
iane+0.067
ian+0.065
Neg logit contributions
 ghosts-0.048
 than-0.047
 staying-0.047
 cob-0.044
 snowy-0.044
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
+0.038
oured+0.036
oni+0.034
ar+0.034
or+0.033
Neg logit contributions
 ghosts-0.027
 staying-0.026
 exc-0.025
 cob-0.025
 snowy-0.024
Token man
Token ID582
Feature activation-3.82e-02

Pos logit contributions
+0.030
oured+0.028
ete+0.027
or+0.026
ar+0.026
Neg logit contributions
 ghosts-0.014
 cob-0.014
 snowy-0.012
 exc-0.012
 exciting-0.011
Token Sara
Token ID24799
Feature activation+1.97e-02

Pos logit contributions
 ghosts+0.005
 staying+0.004
 cob+0.004
 snowy+0.004
 breathing+0.004
Neg logit contributions
-0.007
oured-0.007
ete-0.006
oni-0.006
or-0.006
Token bites
Token ID26081
Feature activation-2.08e-02

Pos logit contributions
 exc+0.023
 cob+0.023
 snowy+0.020
 staying+0.020
 than+0.019
Neg logit contributions
-0.046
lor-0.042
 trigger-0.041
ete-0.041
or-0.041
Token into
Token ID656
Feature activation-3.53e-03

Pos logit contributions
+0.049
oured+0.045
ete+0.044
cient+0.043
lor+0.043
Neg logit contributions
 cob-0.023
 ghosts-0.023
 than-0.021
 breathing-0.021
 staying-0.021
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
+0.007
oured+0.007
ete+0.006
or+0.006
ar+0.006
Neg logit contributions
 ghosts-0.005
 cob-0.005
 staying-0.005
 snowy-0.005
 breathing-0.005
Token water
Token ID1660
Feature activation-4.79e-02

Pos logit contributions
+0.039
oured+0.037
lor+0.037
or+0.036
ete+0.036
Neg logit contributions
 sunny-0.022
 snowy-0.022
 cob-0.022
 interesting-0.022
 exciting-0.021
Tokenmelon
Token ID45690
Feature activation-1.77e-01

Pos logit contributions
+0.117
oured+0.111
ete+0.110
ival+0.105
iane+0.105
Neg logit contributions
 than-0.049
 breathing-0.046
 cob-0.045
 staying-0.045
 snowy-0.043
Token.
Token ID13
Feature activation+2.81e-03

Pos logit contributions
+0.419
oured+0.399
ete+0.389
ival+0.368
iane+0.367
Neg logit contributions
 than-0.193
 cob-0.184
 staying-0.180
 exc-0.179
 snowy-0.178
Token It
Token ID632
Feature activation+1.42e-02

Pos logit contributions
 ghosts+0.005
 cob+0.004
 staying+0.004
 snowy+0.004
 breathing+0.004
Neg logit contributions
-0.005
oured-0.005
ete-0.005
or-0.004
oni-0.004
Token is
Token ID318
Feature activation-1.90e-02

Pos logit contributions
 cob+0.019
 snowy+0.019
 exc+0.018
 ghosts+0.018
 exciting+0.017
Neg logit contributions
-0.031
oured-0.029
ete-0.029
iane-0.028
or-0.027
Token juicy
Token ID35075
Feature activation-3.01e-02

Pos logit contributions
+0.038
oni+0.034
oured+0.033
ete+0.032
um+0.032
Neg logit contributions
 ghosts-0.030
 staying-0.026
 cob-0.025
 outage-0.025
 breathing-0.025
Token and
Token ID290
Feature activation+3.59e-03

Pos logit contributions
+0.062
oured+0.056
ete+0.055
or+0.053
iane+0.052
Neg logit contributions
 ghosts-0.044
 cob-0.041
 staying-0.041
 snowy-0.040
 than-0.039
Token says
Token ID1139
Feature activation-6.73e-03

Pos logit contributions
 ghosts+0.038
 exciting+0.038
 interesting+0.037
 cob+0.037
 exc+0.037
Neg logit contributions
-0.087
ifying-0.074
ete-0.073
lor-0.072
oured-0.071
Token it
Token ID340
Feature activation-5.78e-04

Pos logit contributions
+0.011
oured+0.009
ete+0.009
oni+0.009
 course+0.009
Neg logit contributions
 ghosts-0.013
 cob-0.012
 snowy-0.012
 staying-0.012
 breathing-0.012
Token is
Token ID318
Feature activation-1.66e-02

Pos logit contributions
+0.001
oured+0.001
ete+0.001
or+0.001
oni+0.001
Neg logit contributions
 ghosts-0.001
 cob-0.001
 staying-0.001
 snowy-0.001
 breathing-0.001
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
+0.028
oni+0.025
oured+0.024
iane+0.023
ete+0.023
Neg logit contributions
 ghosts-0.029
 outage-0.026
 staying-0.026
 snowy-0.026
 cob-0.026
Token water
Token ID1660
Feature activation-4.78e-02

Pos logit contributions
+0.033
oured+0.028
ete+0.027
oni+0.027
 course+0.027
Neg logit contributions
 ghosts-0.027
 staying-0.024
 cob-0.024
 snowy-0.024
 breathing-0.024
Tokenmelon
Token ID45690
Feature activation-1.77e-01

Pos logit contributions
+0.087
oured+0.077
ete+0.074
or+0.070
oni+0.069
Neg logit contributions
 ghosts-0.084
 cob-0.079
 staying-0.078
 snowy-0.075
 breathing-0.075
Token.
Token ID13
Feature activation+5.84e-03

Pos logit contributions
+0.369
oured+0.330
ete+0.320
or+0.307
ar+0.303
Neg logit contributions
 ghosts-0.267
 cob-0.244
 staying-0.243
 snowy-0.235
 breathing-0.232
Token She
Token ID1375
Feature activation-2.61e-02

Pos logit contributions
 ghosts+0.010
 cob+0.009
 staying+0.009
 breathing+0.009
 snowy+0.009
Neg logit contributions
-0.012
oured-0.010
ete-0.010
um-0.009
oni-0.009
Token says
Token ID1139
Feature activation-7.22e-03

Pos logit contributions
+0.057
oured+0.048
ete+0.048
or+0.048
ar+0.047
Neg logit contributions
 ghosts-0.037
 cob-0.035
 exc-0.034
 staying-0.033
 exciting-0.033
Token it
Token ID340
Feature activation-9.44e-04

Pos logit contributions
+0.015
oured+0.013
ete+0.013
or+0.012
oni+0.012
Neg logit contributions
 ghosts-0.011
 cob-0.010
 staying-0.010
 snowy-0.010
 breathing-0.010
Token is
Token ID318
Feature activation-1.72e-02

Pos logit contributions
+0.002
oured+0.002
ete+0.002
or+0.002
 course+0.002
Neg logit contributions
 ghosts-0.001
 cob-0.001
 staying-0.001
 cigarettes-0.001
 snowy-0.001
Token wants
Token ID3382
Feature activation+2.30e-02

Pos logit contributions
+0.024
oured+0.020
ete+0.020
or+0.019
lor+0.019
Neg logit contributions
 cob-0.013
 ghosts-0.013
 exc-0.013
 staying-0.012
 snowy-0.012
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 ghosts+0.037
 cob+0.034
 staying+0.034
 snowy+0.033
 breathing+0.032
Neg logit contributions
-0.046
oured-0.041
ete-0.039
or-0.038
ar-0.038
Token try
Token ID1949
Feature activation-4.95e-02

Pos logit contributions
+0.028
lor+0.028
oured+0.027
ian+0.027
or+0.026
Neg logit contributions
 exc-0.018
 exciting-0.016
 interesting-0.016
 cob-0.016
 sunny-0.016
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
+0.093
oured+0.080
ete+0.077
oni+0.076
or+0.075
Neg logit contributions
 ghosts-0.086
 cob-0.078
 staying-0.077
 snowy-0.076
 breathing-0.075
Token water
Token ID1660
Feature activation-4.75e-02

Pos logit contributions
+0.031
oured+0.028
ete+0.027
or+0.027
ian+0.026
Neg logit contributions
 ghosts-0.022
 cob-0.022
 exciting-0.021
 snowy-0.021
 exc-0.021
Tokenmelon
Token ID45690
Feature activation-1.77e-01

Pos logit contributions
+0.106
oured+0.098
ete+0.094
ian+0.092
lor+0.090
Neg logit contributions
 ghosts-0.062
 staying-0.059
 cob-0.058
 exc-0.056
 than-0.056
Token.
Token ID13
Feature activation+4.72e-03

Pos logit contributions
+0.365
oured+0.327
ete+0.315
or+0.302
ar+0.299
Neg logit contributions
 ghosts-0.268
 staying-0.247
 cob-0.246
 snowy-0.239
 exc-0.234
Token She
Token ID1375
Feature activation-2.69e-02

Pos logit contributions
 ghosts+0.009
 staying+0.008
 snowy+0.008
 cob+0.008
 breathing+0.008
Neg logit contributions
-0.008
oured-0.007
ete-0.007
als-0.007
um-0.007
Token asks
Token ID7893
Feature activation+3.98e-02

Pos logit contributions
+0.062
lor+0.053
ete+0.052
or+0.052
oured+0.052
Neg logit contributions
 ghosts-0.034
 cob-0.033
 exc-0.033
 snowy-0.032
 exciting-0.031
Token her
Token ID607
Feature activation+4.00e-02

Pos logit contributions
 ghosts+0.062
 cob+0.057
 snowy+0.057
 staying+0.056
 breathing+0.054
Neg logit contributions
-0.079
oured-0.069
ete-0.068
oni-0.068
or-0.068
Token mom
Token ID1995
Feature activation+3.48e-02

Pos logit contributions
 ghosts+0.060
 staying+0.055
 snowy+0.054
 cob+0.053
 breathing+0.053
Neg logit contributions
-0.080
oured-0.071
oni-0.071
ete-0.070
iane-0.068
Token Lily
Token ID20037
Feature activation+1.62e-02

Pos logit contributions
+0.015
oured+0.014
ete+0.014
sel+0.013
oni+0.013
Neg logit contributions
 ghosts-0.010
 staying-0.009
 breathing-0.009
 cob-0.009
 exc-0.009
Token replied
Token ID8712
Feature activation-2.64e-02

Pos logit contributions
 ghosts+0.022
 cob+0.022
 staying+0.021
 exc+0.020
 snowy+0.020
Neg logit contributions
-0.034
oured-0.030
or-0.029
ete-0.029
 trigger-0.029
Token,
Token ID11
Feature activation+2.75e-03

Pos logit contributions
+0.049
oured+0.043
ete+0.042
iane+0.041
oni+0.040
Neg logit contributions
 ghosts-0.044
 staying-0.042
 breathing-0.041
 cob-0.041
 exc-0.039
Token "
Token ID366
Feature activation+4.74e-02

Pos logit contributions
 ghosts+0.003
 staying+0.003
 breathing+0.003
 cob+0.003
 exc+0.003
Neg logit contributions
-0.006
oured-0.006
ete-0.006
or-0.006
iane-0.006
TokenTo
Token ID2514
Feature activation-5.76e-02

Pos logit contributions
 ghosts+0.078
 cob+0.071
 staying+0.071
 snowy+0.070
 breathing+0.068
Neg logit contributions
-0.092
oured-0.082
ete-0.079
oni-0.076
or-0.075
Tokenast
Token ID459
Feature activation-1.77e-01

Pos logit contributions
+0.121
oured+0.111
ted+0.103
ete+0.102
les+0.101
Neg logit contributions
 ghosts-0.086
 staying-0.084
 cob-0.081
 breathing-0.076
 exc-0.075
Token with
Token ID351
Feature activation-5.85e-03

Pos logit contributions
+0.385
oured+0.285
 course+0.281
ete+0.274
ar+0.274
Neg logit contributions
 ghosts-0.254
 staying-0.243
 cob-0.236
 about-0.230
 Kim-0.226
Token jam
Token ID16853
Feature activation-4.52e-02

Pos logit contributions
+0.013
oured+0.012
ete+0.012
or+0.012
ar+0.011
Neg logit contributions
 ghosts-0.008
 cob-0.008
 staying-0.007
 cigarettes-0.007
 snowy-0.007
Token!
Token ID0
Feature activation-5.20e-02

Pos logit contributions
+0.088
oured+0.081
ete+0.077
iane+0.074
or+0.072
Neg logit contributions
 ghosts-0.072
 staying-0.069
 cob-0.067
 breathing-0.064
 snowy-0.063
Token It
Token ID632
Feature activation+1.73e-02

Pos logit contributions
+0.100
oured+0.095
ete+0.091
or+0.091
les+0.090
Neg logit contributions
 ghosts-0.079
 cob-0.072
 staying-0.071
 exc-0.071
 breathing-0.069
Token was
Token ID373
Feature activation+1.47e-03

Pos logit contributions
 ghosts+0.031
 cob+0.029
 staying+0.028
 snowy+0.028
 breathing+0.027
Neg logit contributions
-0.032
oured-0.028
ete-0.027
or-0.026
iane-0.025
Token market
Token ID1910
Feature activation-6.22e-02

Pos logit contributions
+0.032
oured+0.028
ete+0.028
or+0.027
ian+0.026
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.024
 snowy-0.024
 breathing-0.023
Token and
Token ID290
Feature activation+5.90e-03

Pos logit contributions
+0.124
oured+0.109
ete+0.106
oni+0.103
or+0.102
Neg logit contributions
 ghosts-0.099
 cob-0.091
 staying-0.089
 snowy-0.088
 breathing-0.087
Token find
Token ID1064
Feature activation+2.50e-02

Pos logit contributions
 ghosts+0.009
 staying+0.009
 cob+0.009
 snowy+0.008
 exc+0.008
Neg logit contributions
-0.012
ete-0.010
oured-0.010
oni-0.010
lor-0.010
Token a
Token ID257
Feature activation-1.86e-02

Pos logit contributions
 ghosts+0.040
 cob+0.037
 staying+0.036
 snowy+0.036
 exciting+0.035
Neg logit contributions
-0.050
oured-0.043
ete-0.042
or-0.040
ar-0.040
Token water
Token ID1660
Feature activation-4.76e-02

Pos logit contributions
+0.040
oured+0.036
ete+0.035
or+0.034
oni+0.034
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.024
 snowy-0.023
 breathing-0.023
Tokenmelon
Token ID45690
Feature activation-1.77e-01

Pos logit contributions
+0.098
oured+0.089
ete+0.087
ian+0.082
or+0.081
Neg logit contributions
 ghosts-0.068
 staying-0.065
 cob-0.065
 than-0.062
 snowy-0.062
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
+0.410
oured+0.374
ete+0.361
ian+0.345
or+0.342
Neg logit contributions
 ghosts-0.215
 staying-0.204
 than-0.199
 cob-0.194
 snowy-0.193
Token Her
Token ID2332
Feature activation+4.38e-02

Pos logit contributions
 ghosts+0.006
 staying+0.005
 breathing+0.005
 cob+0.005
 snowy+0.005
Neg logit contributions
-0.010
oured-0.008
als-0.008
ete-0.008
ted-0.008
Token mom
Token ID1995
Feature activation+3.44e-02

Pos logit contributions
 ghosts+0.052
 cob+0.047
 staying+0.046
 snowy+0.044
 exc+0.044
Neg logit contributions
-0.105
oured-0.096
ete-0.093
or-0.091
 course-0.090
Token pays
Token ID13831
Feature activation-7.32e-02

Pos logit contributions
 cob+0.041
 exc+0.041
 staying+0.039
 exciting+0.038
 ghosts+0.038
Neg logit contributions
-0.078
lor-0.067
ete-0.067
or-0.066
 trigger-0.066
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
+0.164
oured+0.134
als+0.127
ete+0.127
ar+0.124
Neg logit contributions
 ghosts-0.109
 staying-0.104
 cob-0.100
 exciting-0.099
 breathing-0.098
Token man
Token ID582
Feature activation-3.61e-02

Pos logit contributions
+0.033
oured+0.031
ete+0.030
or+0.029
ted+0.028
Neg logit contributions
 ghosts-0.022
 cob-0.022
 exciting-0.022
 exc-0.021
 staying-0.021
Token and
Token ID290
Feature activation+5.73e-03

Pos logit contributions
+0.083
oured+0.070
ete+0.070
iane+0.068
um+0.066
Neg logit contributions
 ghosts-0.047
 staying-0.046
 cob-0.043
 than-0.043
 breathing-0.043
Token takes
Token ID2753
Feature activation-4.66e-02

Pos logit contributions
 ghosts+0.009
 staying+0.008
 cob+0.008
 snowy+0.008
 exc+0.008
Neg logit contributions
-0.012
oured-0.010
ete-0.010
or-0.010
ar-0.010
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
+0.114
oured+0.096
ete+0.095
opus+0.093
iane+0.092
Neg logit contributions
 ghosts-0.056
 cob-0.054
 exciting-0.049
 cigarettes-0.049
 snowy-0.048
Token water
Token ID1660
Feature activation-4.75e-02

Pos logit contributions
+0.034
oured+0.031
ete+0.030
lor+0.030
igh+0.030
Neg logit contributions
 interesting-0.022
 exciting-0.022
 exc-0.021
 cob-0.021
 snowy-0.021
Tokenmelon
Token ID45690
Feature activation-1.77e-01

Pos logit contributions
+0.111
oured+0.101
lor+0.099
ival+0.098
ete+0.098
Neg logit contributions
 than-0.054
 staying-0.051
 exc-0.050
 raining-0.050
 snowy-0.049
Token home
Token ID1363
Feature activation-6.91e-02

Pos logit contributions
+0.451
oured+0.425
ete+0.404
ival+0.394
lor+0.393
Neg logit contributions
 than-0.168
 staying-0.151
 exc-0.148
 snowy-0.148
 raining-0.141
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
+0.152
oured+0.134
ete+0.130
 course+0.126
iane+0.125
Neg logit contributions
 ghosts-0.096
 staying-0.089
 cob-0.088
 breathing-0.086
 snowy-0.086
Token She
Token ID1375
Feature activation-2.62e-02

Pos logit contributions
 ghosts+0.006
 staying+0.005
 cob+0.005
 breathing+0.005
 snowy+0.005
Neg logit contributions
-0.009
oured-0.008
ete-0.008
oni-0.007
als-0.007
Token cuts
Token ID6630
Feature activation-1.45e-02

Pos logit contributions
+0.059
or+0.050
ete+0.050
oured+0.050
ar+0.049
Neg logit contributions
 cob-0.034
 ghosts-0.034
 exc-0.033
 snowy-0.032
 staying-0.032
Token it
Token ID340
Feature activation-9.44e-04

Pos logit contributions
+0.037
ete+0.032
oured+0.032
cient+0.032
lor+0.031
Neg logit contributions
 cob-0.015
 ghosts-0.014
 cigarettes-0.014
 breathing-0.013
 than-0.013
Token made
Token ID925
Feature activation-1.20e-02

Pos logit contributions
+0.031
ete+0.027
oured+0.027
 course+0.026
oni+0.026
Neg logit contributions
 ghosts-0.032
 cob-0.029
 staying-0.028
 breathing-0.028
 snowy-0.027
Token his
Token ID465
Feature activation+2.16e-02

Pos logit contributions
+0.023
oured+0.019
als+0.018
iane+0.018
ete+0.017
Neg logit contributions
 ghosts-0.021
 staying-0.020
 exciting-0.020
 cob-0.020
 snowy-0.019
Token mother
Token ID2802
Feature activation+8.76e-03

Pos logit contributions
 ghosts+0.033
 cob+0.030
 staying+0.030
 snowy+0.029
 breathing+0.029
Neg logit contributions
-0.044
oured-0.039
ete-0.038
or-0.036
oni-0.036
Token even
Token ID772
Feature activation+1.94e-02

Pos logit contributions
 ghosts+0.011
 staying+0.010
 cob+0.010
 breathing+0.010
 exciting+0.010
Neg logit contributions
-0.021
oured-0.018
ete-0.018
oni-0.018
iane-0.018
Token p
Token ID279
Feature activation+2.69e-02

Pos logit contributions
 ghosts+0.031
 cob+0.027
 staying+0.027
 snowy+0.027
 breathing+0.026
Neg logit contributions
-0.038
oured-0.034
ete-0.033
oni-0.032
 course-0.032
Tokenrou
Token ID472
Feature activation-1.77e-01

Pos logit contributions
 staying+0.035
 ghosts+0.033
 snowy+0.032
 exciting+0.032
 cigarettes+0.032
Neg logit contributions
-0.056
 course-0.050
 trigger-0.050
ian-0.049
iane-0.048
Tokender
Token ID1082
Feature activation+7.15e-02

Pos logit contributions
+0.317
oured+0.297
ete+0.289
iane+0.286
igh+0.273
Neg logit contributions
 ghosts-0.279
 staying-0.271
 than-0.269
 cob-0.265
 breathing-0.258
Token.
Token ID13
Feature activation+9.95e-03

Pos logit contributions
 ghosts+0.096
 cob+0.086
 staying+0.086
 snowy+0.083
 breathing+0.082
Neg logit contributions
-0.161
oured-0.144
ete-0.140
oni-0.136
or-0.135
Token<|endoftext|>
Token ID50256
Feature activation-8.66e-03

Pos logit contributions
 ghosts+0.014
 staying+0.014
 Kim+0.014
 snowy+0.014
 exc+0.013
Neg logit contributions
-0.019
oured-0.017
ar-0.017
als-0.017
les-0.017
TokenJohn
Token ID7554
Feature activation+4.27e-02

Pos logit contributions
+0.014
oured+0.014
ete+0.013
oni+0.013
cient+0.012
Neg logit contributions
 ghosts-0.015
 staying-0.014
 snowy-0.014
 cob-0.013
 breathing-0.013
Token and
Token ID290
Feature activation+9.91e-03

Pos logit contributions
 ghosts+0.049
 snowy+0.049
 staying+0.049
 cob+0.048
 exc+0.046
Neg logit contributions
-0.103
oured-0.086
 trigger-0.086
ete-0.085
iane-0.084
Token her
Token ID607
Feature activation+4.59e-02

Pos logit contributions
 ghosts+0.053
 cob+0.048
 staying+0.047
 snowy+0.046
 breathing+0.046
Neg logit contributions
-0.076
oured-0.066
ete-0.065
oni-0.062
 course-0.062
Token body
Token ID1767
Feature activation+1.53e-02

Pos logit contributions
 ghosts+0.060
 staying+0.054
 cob+0.054
 snowy+0.052
 breathing+0.051
Neg logit contributions
-0.104
oured-0.093
ete-0.090
or-0.088
oni-0.088
Token like
Token ID588
Feature activation-1.44e-02

Pos logit contributions
 ghosts+0.021
 cob+0.020
 staying+0.019
 breathing+0.019
 snowy+0.019
Neg logit contributions
-0.034
ete-0.030
oured-0.030
iane-0.028
 course-0.028
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
+0.031
oured+0.029
ete+0.028
or+0.027
ar+0.027
Neg logit contributions
 ghosts-0.021
 cob-0.019
 staying-0.019
 breathing-0.018
 exciting-0.018
Token gymn
Token ID38581
Feature activation+1.85e-02

Pos logit contributions
+0.027
oured+0.025
ete+0.024
oni+0.023
or+0.023
Neg logit contributions
 ghosts-0.019
 cob-0.019
 snowy-0.017
 breathing-0.017
 staying-0.017
Tokenast
Token ID459
Feature activation-1.76e-01

Pos logit contributions
 ghosts+0.035
 cob+0.033
 staying+0.032
 snowy+0.032
 breathing+0.031
Neg logit contributions
-0.031
oured-0.027
ete-0.026
oni-0.025
or-0.024
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
+0.372
ete+0.334
oured+0.319
iane+0.310
ingu+0.299
Neg logit contributions
 ghosts-0.275
 cob-0.254
 breathing-0.239
 staying-0.238
 snowy-0.231
Token Lily
Token ID20037
Feature activation+2.10e-02

Pos logit contributions
 ghosts+0.017
 staying+0.015
 cob+0.015
 snowy+0.015
 breathing+0.015
Neg logit contributions
-0.025
oured-0.022
ete-0.021
oni-0.020
als-0.020
Token tried
Token ID3088
Feature activation+2.65e-03

Pos logit contributions
 ghosts+0.034
 cob+0.032
 staying+0.031
 snowy+0.030
 exc+0.030
Neg logit contributions
-0.042
oured-0.036
ete-0.035
oni-0.034
iane-0.034
Token her
Token ID607
Feature activation+4.53e-02

Pos logit contributions
 ghosts+0.004
 cob+0.004
 staying+0.004
 snowy+0.004
 exc+0.004
Neg logit contributions
-0.005
oured-0.004
ete-0.004
or-0.004
oni-0.004
Token best
Token ID1266
Feature activation+3.00e-02

Pos logit contributions
 ghosts+0.072
 staying+0.065
 cob+0.065
 snowy+0.064
 breathing+0.062
Neg logit contributions
-0.091
oured-0.079
ete-0.076
or-0.075
oni-0.075
Token one
Token ID530
Feature activation-2.23e-02

Pos logit contributions
+0.138
oured+0.127
ete+0.118
ted+0.114
or+0.112
Neg logit contributions
 ghosts-0.099
 cob-0.097
 staying-0.092
 exciting-0.091
 snowy-0.089
Token is
Token ID318
Feature activation-1.97e-02

Pos logit contributions
+0.043
oured+0.037
ete+0.036
 course+0.035
oni+0.035
Neg logit contributions
 ghosts-0.037
 staying-0.033
 cob-0.033
 snowy-0.032
 breathing-0.032
Token for
Token ID329
Feature activation-4.18e-02

Pos logit contributions
+0.043
oured+0.038
ete+0.037
or+0.036
oni+0.036
Neg logit contributions
 ghosts-0.028
 cob-0.025
 staying-0.025
 snowy-0.024
 breathing-0.024
Token free
Token ID1479
Feature activation-5.36e-02

Pos logit contributions
+0.091
oured+0.082
ete+0.078
or+0.077
iane+0.076
Neg logit contributions
 ghosts-0.061
 staying-0.055
 cob-0.054
 breathing-0.052
 snowy-0.051
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
+0.114
oured+0.101
ete+0.097
iane+0.093
oni+0.093
Neg logit contributions
 ghosts-0.081
 staying-0.075
 cob-0.074
 than-0.071
 breathing-0.070
Token Enjoy
Token ID18179
Feature activation-1.76e-01

Pos logit contributions
 ghosts+0.006
 staying+0.006
 cob+0.006
 exc+0.006
 snowy+0.006
Neg logit contributions
-0.008
oured-0.007
ete-0.007
oni-0.007
or-0.007
Token the
Token ID262
Feature activation-8.17e-03

Pos logit contributions
+0.428
oured+0.380
ete+0.358
iane+0.349
als+0.349
Neg logit contributions
 staying-0.214
 ghosts-0.208
 than-0.199
 cob-0.191
 breathing-0.187
Token show
Token ID905
Feature activation-6.29e-02

Pos logit contributions
+0.017
oured+0.015
ete+0.015
iane+0.014
or+0.014
Neg logit contributions
 ghosts-0.012
 cob-0.012
 snowy-0.012
 staying-0.011
 exciting-0.011
Token!"
Token ID2474
Feature activation-4.46e-02

Pos logit contributions
+0.142
oured+0.124
ete+0.121
iane+0.117
 trigger+0.117
Neg logit contributions
 than-0.083
 ghosts-0.082
 staying-0.081
 cob-0.073
 breathing-0.071
Token
Token ID198
Feature activation+4.47e-03

Pos logit contributions
+0.100
oured+0.094
or+0.087
sel+0.087
les+0.086
Neg logit contributions
 ghosts-0.055
 staying-0.051
 exc-0.050
 Kim-0.048
 breathing-0.048
Token
Token ID198
Feature activation+4.26e-03

Pos logit contributions
 ghosts+0.007
 staying+0.007
 cob+0.007
 snowy+0.007
 exc+0.006
Neg logit contributions
-0.009
oured-0.008
ete-0.007
als-0.007
oni-0.007
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
+0.002
oured+0.002
ete+0.002
oni+0.001
or+0.001
Neg logit contributions
 ghosts-0.001
 cob-0.001
 staying-0.001
 breathing-0.001
 snowy-0.001
Token other
Token ID584
Feature activation+5.06e-03

Pos logit contributions
+0.037
oured+0.034
ete+0.033
or+0.031
oni+0.031
Neg logit contributions
 ghosts-0.021
 cob-0.020
 staying-0.019
 breathing-0.019
 snowy-0.018
Token wild
Token ID4295
Feature activation+1.61e-02

Pos logit contributions
 ghosts+0.008
 cob+0.007
 staying+0.007
 snowy+0.007
 breathing+0.007
Neg logit contributions
-0.011
oured-0.009
ete-0.009
oni-0.009
iane-0.009
Tokener
Token ID263
Feature activation+3.31e-03

Pos logit contributions
 ghosts+0.026
 cob+0.024
 staying+0.024
 snowy+0.023
 breathing+0.023
Neg logit contributions
-0.032
oured-0.028
ete-0.027
oni-0.026
or-0.026
Tokenbe
Token ID1350
Feature activation+6.95e-02

Pos logit contributions
 ghosts+0.004
 staying+0.004
 breathing+0.004
 cob+0.004
 exc+0.004
Neg logit contributions
-0.008
oured-0.007
ete-0.007
iane-0.006
 course-0.006
Tokenast
Token ID459
Feature activation-1.75e-01

Pos logit contributions
 ghosts+0.096
 staying+0.094
 exc+0.093
 exciting+0.091
 breathing+0.091
Neg logit contributions
-0.159
oured-0.132
als-0.127
iane-0.125
ete-0.124
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
+0.323
ete+0.272
iane+0.259
oured+0.259
ory+0.250
Neg logit contributions
 ghosts-0.308
 breathing-0.295
 staying-0.294
 exc-0.280
 cob-0.276
Token "
Token ID366
Feature activation+4.86e-02

Pos logit contributions
 ghosts+0.005
 staying+0.004
 breathing+0.004
 cob+0.004
 exc+0.004
Neg logit contributions
-0.008
oured-0.007
or-0.007
ete-0.007
les-0.007
TokenLet
Token ID5756
Feature activation-8.78e-03

Pos logit contributions
 ghosts+0.065
 staying+0.060
 exc+0.057
 breathing+0.057
 than+0.056
Neg logit contributions
-0.106
oured-0.099
ete-0.095
oni-0.092
ar-0.091
Token's
Token ID338
Feature activation+4.53e-02

Pos logit contributions
+0.021
oured+0.018
oni+0.016
iane+0.016
ete+0.015
Neg logit contributions
 ghosts-0.012
 exciting-0.012
 cob-0.012
 staying-0.011
 interesting-0.011
Token explore
Token ID7301
Feature activation+5.02e-02

Pos logit contributions
 exc+0.055
 ghosts+0.051
 staying+0.050
 exciting+0.050
 cob+0.048
Neg logit contributions
-0.101
oured-0.099
oni-0.095
ted-0.094
les-0.092
Token a
Token ID257
Feature activation-1.45e-02

Pos logit contributions
+0.011
oured+0.010
ete+0.009
or+0.009
oni+0.009
Neg logit contributions
 ghosts-0.008
 cob-0.008
 staying-0.007
 snowy-0.007
 breathing-0.007
Token big
Token ID1263
Feature activation-2.27e-02

Pos logit contributions
+0.032
oured+0.028
ete+0.027
oni+0.026
or+0.026
Neg logit contributions
 ghosts-0.021
 cob-0.019
 staying-0.018
 snowy-0.018
 breathing-0.018
Token,
Token ID11
Feature activation-6.12e-03

Pos logit contributions
+0.042
oured+0.036
ete+0.035
 course+0.034
oni+0.034
Neg logit contributions
 ghosts-0.039
 staying-0.036
 cob-0.036
 snowy-0.035
 breathing-0.035
Token juicy
Token ID35075
Feature activation-3.11e-02

Pos logit contributions
+0.012
 course+0.011
oured+0.011
oni+0.011
als+0.011
Neg logit contributions
 ghosts-0.007
 staying-0.007
 outage-0.006
 textures-0.006
 breathing-0.006
Token water
Token ID1660
Feature activation-4.81e-02

Pos logit contributions
+0.065
oured+0.057
ete+0.056
or+0.053
oni+0.053
Neg logit contributions
 ghosts-0.048
 cob-0.046
 staying-0.043
 snowy-0.043
 breathing-0.042
Tokenmelon
Token ID45690
Feature activation-1.75e-01

Pos logit contributions
+0.104
oured+0.096
ete+0.095
or+0.088
um+0.088
Neg logit contributions
 ghosts-0.068
 staying-0.064
 cob-0.062
 breathing-0.060
 snowy-0.059
Token in
Token ID287
Feature activation+1.87e-02

Pos logit contributions
+0.408
oured+0.376
ete+0.366
or+0.348
ar+0.343
Neg logit contributions
 ghosts-0.229
 staying-0.205
 snowy-0.199
 cob-0.195
 than-0.193
Token the
Token ID262
Feature activation-1.37e-02

Pos logit contributions
 ghosts+0.024
 cob+0.021
 snowy+0.020
 staying+0.020
 breathing+0.020
Neg logit contributions
-0.045
oured-0.043
or-0.039
ete-0.039
ar-0.038
Token field
Token ID2214
Feature activation-1.22e-02

Pos logit contributions
+0.032
oured+0.029
ete+0.028
or+0.028
oni+0.027
Neg logit contributions
 ghosts-0.019
 cob-0.018
 snowy-0.017
 sunny-0.016
 breathing-0.016
Token.
Token ID13
Feature activation+7.77e-03

Pos logit contributions
+0.030
oured+0.027
ete+0.026
ian+0.026
iane+0.025
Neg logit contributions
 staying-0.014
 ghosts-0.013
 than-0.012
 raining-0.012
 breathing-0.012
Token
Token ID198
Feature activation+4.25e-03

Pos logit contributions
 ghosts+0.010
 raining+0.010
 staying+0.010
 snowy+0.010
 exciting+0.010
Neg logit contributions
-0.015
oured-0.014
or-0.014
igh-0.014
sel-0.013
Token.
Token ID13
Feature activation+8.48e-03

Pos logit contributions
 ghosts+0.074
 staying+0.068
 breathing+0.066
 cob+0.064
 exc+0.064
Neg logit contributions
-0.120
oured-0.106
ete-0.101
 course-0.100
iane-0.099
Token
Token ID198
Feature activation+5.75e-03

Pos logit contributions
 staying+0.015
 exc+0.014
 ghosts+0.014
 snowy+0.014
 breathing+0.014
Neg logit contributions
-0.013
oured-0.012
 course-0.011
als-0.011
les-0.011
Token
Token ID198
Feature activation+5.80e-03

Pos logit contributions
 exc+0.007
 staying+0.007
 ghosts+0.007
 exciting+0.007
 snowy+0.007
Neg logit contributions
-0.012
oured-0.011
als-0.011
oni-0.010
iane-0.010
TokenS
Token ID50
Feature activation-3.71e-02

Pos logit contributions
 staying+0.009
 exc+0.009
 snowy+0.009
 than+0.009
 ghosts+0.009
Neg logit contributions
-0.011
oured-0.009
als-0.009
iane-0.008
ian-0.008
Tokeneb
Token ID1765
Feature activation-1.59e-02

Pos logit contributions
+0.045
oured+0.034
als+0.034
 trigger+0.031
cient+0.030
Neg logit contributions
 snowy-0.078
 staying-0.078
 than-0.078
 ghosts-0.077
 exciting-0.076
Tokenast
Token ID459
Feature activation-1.75e-01

Pos logit contributions
+0.034
oured+0.028
 course+0.027
als+0.027
ted+0.027
Neg logit contributions
 cob-0.022
 exc-0.022
 cigarettes-0.022
 ghosts-0.022
 breathing-0.021
Tokenian
Token ID666
Feature activation-1.89e-02

Pos logit contributions
+0.233
ives+0.176
 course+0.164
iane+0.154
oured+0.152
Neg logit contributions
 ghosts-0.376
 cob-0.370
 staying-0.367
 snowy-0.365
 breathing-0.362
Token disagreed
Token ID27884
Feature activation-3.87e-02

Pos logit contributions
+0.039
ives+0.031
 trigger+0.031
oured+0.031
ifying+0.031
Neg logit contributions
 exciting-0.025
 cob-0.025
 staying-0.025
 ghosts-0.025
 breathing-0.024
Token with
Token ID351
Feature activation-6.09e-03

Pos logit contributions
+0.083
oured+0.073
ete+0.072
iane+0.069
or+0.069
Neg logit contributions
 ghosts-0.056
 staying-0.052
 cob-0.050
 breathing-0.049
 snowy-0.049
Token his
Token ID465
Feature activation+2.04e-02

Pos logit contributions
+0.013
oured+0.012
or+0.011
ete+0.011
als+0.011
Neg logit contributions
 ghosts-0.009
 cob-0.008
 staying-0.008
 exciting-0.008
 breathing-0.008
Token friend
Token ID1545
Feature activation-7.97e-02

Pos logit contributions
 ghosts+0.028
 cob+0.026
 staying+0.025
 snowy+0.024
 breathing+0.024
Neg logit contributions
-0.045
oured-0.040
ete-0.039
or-0.038
iane-0.038
Token and
Token ID290
Feature activation+7.48e-03

Pos logit contributions
 ghosts+0.017
 staying+0.016
 cob+0.016
 breathing+0.015
 snowy+0.015
Neg logit contributions
-0.025
oured-0.021
ete-0.021
oni-0.021
iane-0.020
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
 ghosts+0.013
 cob+0.012
 staying+0.012
 snowy+0.012
 breathing+0.012
Neg logit contributions
-0.013
oured-0.012
ete-0.011
oni-0.011
or-0.011
Token wild
Token ID4295
Feature activation+1.60e-02

Pos logit contributions
+0.031
oured+0.027
ete+0.026
oni+0.026
or+0.026
Neg logit contributions
 ghosts-0.021
 cob-0.019
 staying-0.019
 snowy-0.019
 breathing-0.018
Tokener
Token ID263
Feature activation+3.79e-03

Pos logit contributions
 ghosts+0.027
 cob+0.025
 staying+0.025
 snowy+0.024
 breathing+0.024
Neg logit contributions
-0.031
oured-0.027
ete-0.026
oni-0.025
or-0.025
Tokenbe
Token ID1350
Feature activation+6.97e-02

Pos logit contributions
 staying+0.005
 ghosts+0.005
 exc+0.005
 cob+0.005
 breathing+0.005
Neg logit contributions
-0.008
oured-0.007
ete-0.006
 course-0.006
oni-0.006
Tokenast
Token ID459
Feature activation-1.75e-01

Pos logit contributions
 ghosts+0.102
 staying+0.097
 breathing+0.094
 than+0.093
 exc+0.093
Neg logit contributions
-0.153
oured-0.129
ete-0.123
als-0.122
iane-0.121
Token smiled
Token ID13541
Feature activation-2.75e-02

Pos logit contributions
+0.289
ete+0.237
iane+0.230
 course+0.226
oured+0.221
Neg logit contributions
 ghosts-0.332
 staying-0.312
 cob-0.306
 breathing-0.306
 snowy-0.302
Token.
Token ID13
Feature activation+5.35e-03

Pos logit contributions
+0.058
oured+0.051
ete+0.050
oni+0.048
or+0.048
Neg logit contributions
 ghosts-0.041
 staying-0.037
 cob-0.037
 snowy-0.036
 breathing-0.036
Token
Token ID198
Feature activation+4.26e-03

Pos logit contributions
 ghosts+0.008
 staying+0.008
 cob+0.008
 breathing+0.008
 snowy+0.008
Neg logit contributions
-0.010
oured-0.009
or-0.009
 course-0.009
ete-0.009
Token
Token ID198
Feature activation+4.10e-03

Pos logit contributions
 ghosts+0.006
 staying+0.006
 cob+0.006
 exc+0.006
 breathing+0.006
Neg logit contributions
-0.009
oured-0.008
als-0.007
oni-0.007
ete-0.007
Token"
Token ID1
Feature activation-2.00e-02

Pos logit contributions
 ghosts+0.006
 staying+0.006
 exc+0.006
 snowy+0.006
 cob+0.006
Neg logit contributions
-0.008
oured-0.008
als-0.007
oni-0.007
ian-0.007
Token feels
Token ID5300
Feature activation-1.71e-02

Pos logit contributions
+0.077
or+0.064
oured+0.063
lor+0.062
ar+0.062
Neg logit contributions
 ghosts-0.027
 exc-0.025
 staying-0.024
 exciting-0.024
 than-0.024
Token even
Token ID772
Feature activation+1.90e-02

Pos logit contributions
+0.044
oured+0.040
or+0.038
ete+0.038
iane+0.038
Neg logit contributions
 ghosts-0.018
 exciting-0.016
 staying-0.016
 cob-0.016
 exc-0.016
Token happier
Token ID23030
Feature activation+1.11e-02

Pos logit contributions
 ghosts+0.031
 cob+0.028
 staying+0.028
 snowy+0.027
 breathing+0.027
Neg logit contributions
-0.036
oured-0.032
ete-0.031
oni-0.030
or-0.030
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 ghosts+0.017
 staying+0.015
 cob+0.015
 breathing+0.015
 snowy+0.015
Neg logit contributions
-0.024
oured-0.021
ete-0.020
or-0.019
iane-0.019
Token p
Token ID279
Feature activation+2.30e-02

Pos logit contributions
 ghosts+0.018
 cob+0.017
 staying+0.016
 snowy+0.016
 breathing+0.016
Neg logit contributions
-0.021
oured-0.019
ete-0.019
oni-0.018
ar-0.018
Tokenrou
Token ID472
Feature activation-1.75e-01

Pos logit contributions
 staying+0.033
 breathing+0.030
 exc+0.029
 ghosts+0.028
 snowy+0.028
Neg logit contributions
-0.046
 course-0.043
 trigger-0.039
 top-0.037
ian-0.037
Tokender
Token ID1082
Feature activation+6.97e-02

Pos logit contributions
+0.366
oured+0.355
ete+0.343
iane+0.339
igh+0.333
Neg logit contributions
 than-0.232
 ghosts-0.212
 staying-0.205
 exc-0.203
 breathing-0.197
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 ghosts+0.104
 staying+0.094
 cob+0.093
 snowy+0.091
 breathing+0.091
Neg logit contributions
-0.148
oured-0.131
ete-0.127
iane-0.121
or-0.121
Token He
Token ID679
Feature activation-2.50e-02

Pos logit contributions
 ghosts+0.023
 cob+0.021
 staying+0.021
 snowy+0.021
 breathing+0.020
Neg logit contributions
-0.029
oured-0.026
ete-0.024
or-0.024
 course-0.024
Token displays
Token ID11298
Feature activation-3.26e-02

Pos logit contributions
+0.062
iane+0.051
oured+0.051
or+0.050
ete+0.049
Neg logit contributions
 exc-0.031
 ghosts-0.031
 exciting-0.029
 snowy-0.028
 cob-0.028
Token his
Token ID465
Feature activation+2.15e-02

Pos logit contributions
+0.086
oured+0.071
ete+0.067
ar+0.066
lor+0.066
Neg logit contributions
 ghosts-0.043
 staying-0.037
 exciting-0.036
 breathing-0.036
 cob-0.036
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
+0.095
oured+0.085
ete+0.083
ar+0.079
or+0.079
Neg logit contributions
 ghosts-0.072
 staying-0.068
 cob-0.067
 breathing-0.065
 snowy-0.065
Token piece
Token ID3704
Feature activation-5.45e-03

Pos logit contributions
+0.031
oured+0.028
ete+0.027
iane+0.027
oni+0.026
Neg logit contributions
 ghosts-0.024
 cob-0.023
 snowy-0.023
 staying-0.022
 exciting-0.022
Token of
Token ID286
Feature activation+3.50e-02

Pos logit contributions
+0.012
oured+0.011
ete+0.011
ian+0.010
cient+0.010
Neg logit contributions
 ghosts-0.007
 cob-0.007
 staying-0.007
 than-0.007
 breathing-0.006
Token juicy
Token ID35075
Feature activation-3.01e-02

Pos logit contributions
 ghosts+0.057
 cob+0.053
 staying+0.053
 snowy+0.051
 exciting+0.051
Neg logit contributions
-0.069
oured-0.061
ete-0.059
or-0.057
ar-0.056
Token water
Token ID1660
Feature activation-4.52e-02

Pos logit contributions
+0.061
ete+0.055
oured+0.055
iane+0.052
cient+0.051
Neg logit contributions
 ghosts-0.043
 than-0.042
 cob-0.042
 staying-0.041
 textures-0.040
Tokenmelon
Token ID45690
Feature activation-1.74e-01

Pos logit contributions
+0.103
oured+0.095
ete+0.094
iane+0.089
ian+0.087
Neg logit contributions
 ghosts-0.053
 staying-0.052
 than-0.051
 cob-0.048
 breathing-0.047
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
+0.401
oured+0.370
ete+0.357
ian+0.336
iane+0.336
Neg logit contributions
 ghosts-0.210
 staying-0.200
 than-0.198
 cob-0.190
 exciting-0.186
Token When
Token ID1649
Feature activation-3.70e-02

Pos logit contributions
 ghosts+0.017
 cob+0.015
 staying+0.015
 snowy+0.015
 breathing+0.015
Neg logit contributions
-0.020
oured-0.017
ete-0.017
oni-0.016
or-0.016
Token she
Token ID673
Feature activation+7.09e-04

Pos logit contributions
+0.083
oured+0.077
ete+0.074
oni+0.072
or+0.072
Neg logit contributions
 ghosts-0.049
 cob-0.045
 staying-0.044
 snowy-0.043
 breathing-0.041
Token took
Token ID1718
Feature activation-4.82e-02

Pos logit contributions
 cob+0.001
 staying+0.001
 exc+0.001
 snowy+0.001
 ghosts+0.001
Neg logit contributions
-0.001
 trigger-0.001
cient-0.001
iane-0.001
lor-0.001
Token a
Token ID257
Feature activation-1.62e-02

Pos logit contributions
+0.116
ete+0.107
oured+0.104
iane+0.103
or+0.103
Neg logit contributions
 ghosts-0.055
 cob-0.053
 exciting-0.050
 interesting-0.048
 staying-0.045
Token"
Token ID1
Feature activation-2.70e-02

Pos logit contributions
+0.013
oured+0.012
ete+0.010
oni+0.010
ian+0.010
Neg logit contributions
 ghosts-0.011
 staying-0.010
 cob-0.010
 snowy-0.010
 exc-0.010
TokenHere
Token ID4342
Feature activation-8.10e-02

Pos logit contributions
+0.049
oured+0.045
ete+0.043
or+0.042
oni+0.040
Neg logit contributions
 ghosts-0.047
 cob-0.043
 staying-0.042
 snowy-0.042
 exc-0.041
Token you
Token ID345
Feature activation-2.67e-02

Pos logit contributions
+0.145
oured+0.127
ete+0.121
or+0.117
oni+0.116
Neg logit contributions
 ghosts-0.147
 cob-0.136
 staying-0.135
 snowy-0.131
 breathing-0.130
Token go
Token ID467
Feature activation-7.97e-03

Pos logit contributions
+0.072
oured+0.063
lor+0.062
side+0.058
iane+0.057
Neg logit contributions
 exc-0.029
 ghosts-0.028
 interesting-0.026
 cob-0.026
 exciting-0.025
Token.
Token ID13
Feature activation+1.77e-03

Pos logit contributions
+0.019
oured+0.017
ete+0.016
oni+0.016
ingu+0.015
Neg logit contributions
 ghosts-0.010
 staying-0.009
 than-0.009
 cob-0.009
 exciting-0.009
Token Enjoy
Token ID18179
Feature activation-1.74e-01

Pos logit contributions
 ghosts+0.003
 staying+0.002
 exc+0.002
 cob+0.002
 breathing+0.002
Neg logit contributions
-0.004
oured-0.003
oni-0.003
les-0.003
ete-0.003
Token your
Token ID534
Feature activation-1.25e-02

Pos logit contributions
+0.449
oured+0.396
ete+0.370
iane+0.357
 trigger+0.355
Neg logit contributions
 ghosts-0.200
 staying-0.191
 cob-0.187
 than-0.180
 snowy-0.173
Token grapes
Token ID36656
Feature activation-9.52e-02

Pos logit contributions
+0.026
oured+0.024
ete+0.023
iane+0.022
or+0.022
Neg logit contributions
 ghosts-0.018
 cob-0.017
 staying-0.017
 snowy-0.017
 breathing-0.016
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
+0.200
oured+0.177
ete+0.172
iane+0.164
or+0.164
Neg logit contributions
 ghosts-0.141
 staying-0.129
 cob-0.128
 snowy-0.124
 breathing-0.122
Token They
Token ID1119
Feature activation-3.45e-02

Pos logit contributions
 ghosts+0.004
 exc+0.004
 staying+0.004
 than+0.004
 cob+0.004
Neg logit contributions
-0.006
oured-0.005
les-0.005
ete-0.005
or-0.005
Token are
Token ID389
Feature activation-1.69e-02

Pos logit contributions
+0.073
ete+0.070
oured+0.069
iane+0.065
or+0.065
Neg logit contributions
 exc-0.047
 ghosts-0.045
 staying-0.045
 cob-0.045
 exciting-0.044
Token raven
Token ID37735
Feature activation+7.55e-03

Pos logit contributions
+0.036
oured+0.032
ete+0.031
oni+0.030
or+0.030
Neg logit contributions
 ghosts-0.025
 cob-0.022
 staying-0.022
 snowy-0.021
 breathing-0.021
Token,
Token ID11
Feature activation-5.76e-03

Pos logit contributions
 ghosts+0.012
 cob+0.011
 staying+0.011
 breathing+0.011
 snowy+0.010
Neg logit contributions
-0.016
oured-0.014
ete-0.014
iane-0.013
oni-0.013
Token and
Token ID290
Feature activation+7.33e-03

Pos logit contributions
+0.013
oured+0.011
ete+0.011
oni+0.011
 course+0.011
Neg logit contributions
 ghosts-0.008
 cob-0.007
 staying-0.007
 snowy-0.007
 exciting-0.006
Token it
Token ID340
Feature activation-3.20e-04

Pos logit contributions
 ghosts+0.011
 cob+0.010
 staying+0.009
 snowy+0.009
 breathing+0.009
Neg logit contributions
-0.016
oured-0.014
ete-0.014
or-0.013
oni-0.013
Token was
Token ID373
Feature activation-6.84e-03

Pos logit contributions
+0.001
oured+0.001
ete+0.001
or+0.001
iane+0.001
Neg logit contributions
 ghosts-0.001
 cob-0.000
 staying-0.000
 snowy-0.000
 breathing-0.000
Token sitting
Token ID5586
Feature activation+1.44e-01

Pos logit contributions
+0.013
oni+0.012
oured+0.012
ted+0.011
ete+0.011
Neg logit contributions
 ghosts-0.010
 textures-0.009
 outage-0.009
 cob-0.009
 staying-0.008
Token on
Token ID319
Feature activation+2.43e-02

Pos logit contributions
 ghosts+0.188
 staying+0.166
 snowy+0.166
 cob+0.163
 breathing+0.161
Neg logit contributions
-0.336
oured-0.303
ete-0.299
ian-0.283
oni-0.282
Token the
Token ID262
Feature activation-1.41e-02

Pos logit contributions
 ghosts+0.029
 cob+0.026
 snowy+0.024
 staying+0.023
 breathing+0.023
Neg logit contributions
-0.058
oured-0.057
ete-0.055
or-0.055
iane-0.052
Token fence
Token ID13990
Feature activation+2.64e-02

Pos logit contributions
+0.033
oured+0.029
ete+0.028
or+0.027
ar+0.027
Neg logit contributions
 ghosts-0.019
 cob-0.018
 snowy-0.017
 staying-0.017
 breathing-0.017
Token.
Token ID13
Feature activation+8.29e-03

Pos logit contributions
 ghosts+0.034
 staying+0.032
 cob+0.031
 snowy+0.031
 breathing+0.031
Neg logit contributions
-0.060
oured-0.054
ete-0.054
iane-0.052
ian-0.051
Token It
Token ID632
Feature activation+1.41e-02

Pos logit contributions
 ghosts+0.010
 cob+0.009
 snowy+0.009
 staying+0.009
 raining+0.009
Neg logit contributions
-0.018
oured-0.016
or-0.016
igh-0.016
 course-0.016
Token
Token ID198
Feature activation-5.59e-03

Pos logit contributions
+0.002
oured+0.002
ete+0.002
oni+0.002
iane+0.002
Neg logit contributions
 ghosts-0.003
 staying-0.002
 cob-0.002
 snowy-0.002
 breathing-0.002
Token
Token ID198
Feature activation-5.69e-03

Pos logit contributions
+0.010
oured+0.009
ete+0.008
oni+0.008
or+0.008
Neg logit contributions
 ghosts-0.010
 cob-0.009
 staying-0.009
 snowy-0.009
 breathing-0.009
Token"
Token ID1
Feature activation-2.96e-02

Pos logit contributions
+0.010
oured+0.009
ete+0.008
oni+0.008
als+0.008
Neg logit contributions
 ghosts-0.011
 exc-0.010
 cob-0.010
 snowy-0.010
 staying-0.010
TokenI
Token ID40
Feature activation-5.27e-02

Pos logit contributions
+0.049
 course+0.039
ete+0.037
oni+0.037
oured+0.037
Neg logit contributions
 ghosts-0.060
 cob-0.057
 staying-0.056
 snowy-0.054
 breathing-0.054
Token'm
Token ID1101
Feature activation-2.93e-02

Pos logit contributions
+0.093
oured+0.081
ete+0.078
oni+0.075
or+0.074
Neg logit contributions
 ghosts-0.097
 cob-0.090
 staying-0.089
 snowy-0.087
 breathing-0.086
Token OK
Token ID7477
Feature activation+1.11e-01

Pos logit contributions
+0.051
ete+0.041
 course+0.041
oured+0.041
on+0.040
Neg logit contributions
 ghosts-0.052
 textures-0.052
 cob-0.049
 snowy-0.049
 than-0.047
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
 cob+0.239
 ghosts+0.239
 snowy+0.237
 textures+0.230
 staying+0.227
Neg logit contributions
-0.133
um-0.104
ar-0.101
 course-0.099
ete-0.097
Token Lily
Token ID20037
Feature activation+9.12e-03

Pos logit contributions
+0.029
oured+0.025
ete+0.025
 course+0.024
oni+0.024
Neg logit contributions
 ghosts-0.017
 staying-0.015
 cob-0.015
 snowy-0.015
 breathing-0.015
Token.
Token ID13
Feature activation-1.95e-03

Pos logit contributions
 ghosts+0.017
 cob+0.015
 staying+0.015
 snowy+0.015
 breathing+0.015
Neg logit contributions
-0.015
oured-0.013
oni-0.013
ete-0.013
 course-0.013
Token It
Token ID632
Feature activation+1.14e-02

Pos logit contributions
+0.004
als+0.003
oured+0.003
ar+0.003
ete+0.003
Neg logit contributions
 ghosts-0.003
 staying-0.003
 cob-0.003
 snowy-0.003
 exciting-0.003
Token's
Token ID338
Feature activation+4.17e-02

Pos logit contributions
 ghosts+0.020
 cob+0.019
 staying+0.019
 snowy+0.018
 breathing+0.018
Neg logit contributions
-0.021
oured-0.018
ete-0.018
or-0.017
oni-0.017
Token land
Token ID1956
Feature activation+1.95e-02

Pos logit contributions
+0.031
oured+0.027
ete+0.026
oni+0.025
or+0.025
Neg logit contributions
 ghosts-0.028
 cob-0.026
 staying-0.026
 snowy-0.025
 exciting-0.025
Token of
Token ID286
Feature activation+3.10e-02

Pos logit contributions
 ghosts+0.026
 staying+0.024
 snowy+0.023
 than+0.023
 cob+0.023
Neg logit contributions
-0.044
oured-0.040
ete-0.038
ian-0.037
oni-0.037
Token muff
Token ID27563
Feature activation+2.26e-02

Pos logit contributions
 ghosts+0.051
 breathing+0.046
 cob+0.046
 staying+0.046
 snowy+0.046
Neg logit contributions
-0.056
 course-0.048
oured-0.047
oni-0.046
ete-0.046
Tokenins
Token ID1040
Feature activation-3.60e-02

Pos logit contributions
 ghosts+0.040
 staying+0.037
 cob+0.036
 snowy+0.036
 breathing+0.035
Neg logit contributions
-0.041
oured-0.036
ete-0.035
 trigger-0.033
iane-0.033
Token.
Token ID13
Feature activation+3.97e-05

Pos logit contributions
+0.082
oured+0.072
ete+0.070
 course+0.067
iane+0.067
Neg logit contributions
 ghosts-0.049
 staying-0.046
 than-0.043
 breathing-0.043
 cob-0.042
Token There
Token ID1318
Feature activation+1.03e-01

Pos logit contributions
 ghosts+0.000
 cob+0.000
 staying+0.000
 snowy+0.000
 breathing+0.000
Neg logit contributions
-0.000
oured-0.000
ete-0.000
als-0.000
oni-0.000
Token are
Token ID389
Feature activation-1.75e-02

Pos logit contributions
 ghosts+0.183
 cob+0.171
 staying+0.170
 snowy+0.163
 exciting+0.161
Neg logit contributions
-0.190
oured-0.168
ete-0.157
oni-0.152
iane-0.150
Token many
Token ID867
Feature activation+2.90e-02

Pos logit contributions
+0.032
oured+0.028
ete+0.028
oni+0.028
 course+0.027
Neg logit contributions
 ghosts-0.029
 cob-0.026
 staying-0.026
 breathing-0.026
 snowy-0.025
Token muff
Token ID27563
Feature activation+2.28e-02

Pos logit contributions
 ghosts+0.042
 staying+0.038
 breathing+0.038
 outage+0.037
 cob+0.037
Neg logit contributions
-0.056
 course-0.051
oured-0.050
les-0.050
als-0.049
Tokenins
Token ID1040
Feature activation-3.60e-02

Pos logit contributions
 ghosts+0.035
 staying+0.032
 cob+0.031
 snowy+0.031
 breathing+0.030
Neg logit contributions
-0.047
oured-0.042
ete-0.040
iane-0.038
 trigger-0.038
Token like
Token ID588
Feature activation-1.71e-02

Pos logit contributions
+0.083
oured+0.073
ete+0.071
 course+0.067
ian+0.067
Neg logit contributions
 ghosts-0.050
 staying-0.046
 than-0.044
 breathing-0.043
 cob-0.043
Token being
Token ID852
Feature activation+1.08e-01

Pos logit contributions
+0.039
 course+0.036
oni+0.035
on+0.033
 top+0.032
Neg logit contributions
 ghosts-0.033
 cob-0.032
 snowy-0.031
 cigarettes-0.031
 Esk-0.031
Token adventurous
Token ID39770
Feature activation+1.25e-02

Pos logit contributions
 ghosts+0.139
 cob+0.131
 snowy+0.127
 staying+0.124
 breathing+0.119
Neg logit contributions
-0.232
oured-0.212
oni-0.208
ete-0.207
 course-0.203
Token is
Token ID318
Feature activation-1.60e-02

Pos logit contributions
 ghosts+0.023
 cob+0.021
 staying+0.021
 snowy+0.021
 breathing+0.021
Neg logit contributions
-0.021
oured-0.018
ete-0.017
 course-0.017
oni-0.017
Token not
Token ID407
Feature activation+3.26e-02

Pos logit contributions
+0.034
oured+0.030
oni+0.030
ete+0.030
 course+0.029
Neg logit contributions
 ghosts-0.022
 cob-0.020
 snowy-0.019
 staying-0.019
 breathing-0.019
Token about
Token ID546
Feature activation+9.91e-02

Pos logit contributions
 ghosts+0.045
 snowy+0.042
 cob+0.042
 staying+0.040
 cigarettes+0.040
Neg logit contributions
-0.062
oni-0.057
 course-0.056
oured-0.056
ete-0.054
Token being
Token ID852
Feature activation+1.09e-01

Pos logit contributions
 ghosts+0.114
 cob+0.101
 snowy+0.098
 staying+0.098
 breathing+0.093
Neg logit contributions
-0.240
oured-0.213
ete-0.209
oni-0.206
 course-0.203
Token dangerous
Token ID4923
Feature activation+9.75e-03

Pos logit contributions
 ghosts+0.132
 staying+0.115
 cob+0.114
 breathing+0.110
 snowy+0.108
Neg logit contributions
-0.265
oured-0.235
ete-0.230
or-0.223
iane-0.222
Token.
Token ID13
Feature activation-2.66e-04

Pos logit contributions
 ghosts+0.017
 cob+0.016
 snowy+0.016
 staying+0.016
 exc+0.015
Neg logit contributions
-0.016
oured-0.014
les-0.014
 course-0.014
oni-0.014
Token It
Token ID632
Feature activation+1.51e-02

Pos logit contributions
+0.001
oured+0.000
ete+0.000
oni+0.000
or+0.000
Neg logit contributions
 ghosts-0.000
 cob-0.000
 staying-0.000
 snowy-0.000
 breathing-0.000
Token is
Token ID318
Feature activation-1.61e-02

Pos logit contributions
 ghosts+0.024
 cob+0.022
 staying+0.022
 exc+0.022
 snowy+0.021
Neg logit contributions
-0.031
oured-0.028
ete-0.027
or-0.026
iane-0.026
Token about
Token ID546
Feature activation+9.86e-02

Pos logit contributions
+0.031
oni+0.028
 course+0.027
oured+0.027
ete+0.027
Neg logit contributions
 ghosts-0.024
 cob-0.022
 snowy-0.021
 staying-0.021
 breathing-0.021
Token obedient
Token ID41652
Feature activation-2.25e-02

Pos logit contributions
 ghosts+0.088
 cob+0.081
 snowy+0.079
 outage+0.077
 staying+0.075
Neg logit contributions
-0.140
oured-0.126
 course-0.121
ar-0.119
iane-0.119
Token.
Token ID13
Feature activation+5.91e-03

Pos logit contributions
+0.041
oured+0.036
ete+0.034
oni+0.034
or+0.034
Neg logit contributions
 ghosts-0.038
 cob-0.036
 snowy-0.035
 staying-0.035
 exc-0.034
Token She
Token ID1375
Feature activation-1.99e-02

Pos logit contributions
 ghosts+0.009
 cob+0.009
 staying+0.009
 snowy+0.008
 breathing+0.008
Neg logit contributions
-0.012
oured-0.010
ete-0.010
oni-0.010
ar-0.010
Token knew
Token ID2993
Feature activation-3.41e-02

Pos logit contributions
+0.040
oured+0.034
ete+0.034
iane+0.033
als+0.032
Neg logit contributions
 ghosts-0.032
 cob-0.029
 staying-0.029
 exc-0.029
 snowy-0.029
Token that
Token ID326
Feature activation-1.91e-02

Pos logit contributions
+0.055
 course+0.048
oured+0.047
ete+0.046
or+0.045
Neg logit contributions
 ghosts-0.061
 snowy-0.060
 cob-0.058
 breathing-0.058
 exc-0.057
Token being
Token ID852
Feature activation+1.08e-01

Pos logit contributions
+0.029
 course+0.029
on+0.027
or+0.025
 top+0.024
Neg logit contributions
 Esk-0.030
 snowy-0.030
 ghosts-0.029
 outage-0.029
 cigarettes-0.029
Token obedient
Token ID41652
Feature activation-2.24e-02

Pos logit contributions
 ghosts+0.136
 cob+0.126
 snowy+0.123
 staying+0.120
 breathing+0.117
Neg logit contributions
-0.233
oured-0.213
oni-0.204
ete-0.201
 course-0.200
Token would
Token ID561
Feature activation-7.14e-02

Pos logit contributions
+0.044
oured+0.039
ete+0.038
oni+0.036
iane+0.036
Neg logit contributions
 ghosts-0.037
 cob-0.033
 staying-0.033
 breathing-0.032
 snowy-0.032
Token make
Token ID787
Feature activation+9.44e-04

Pos logit contributions
+0.156
oured+0.145
ete+0.139
als+0.134
or+0.133
Neg logit contributions
 ghosts-0.101
 staying-0.094
 cob-0.090
 exc-0.090
 exciting-0.088
Token her
Token ID607
Feature activation+4.45e-02

Pos logit contributions
 ghosts+0.001
 cob+0.001
 staying+0.001
 snowy+0.001
 breathing+0.001
Neg logit contributions
-0.002
oured-0.002
ete-0.002
or-0.002
oni-0.002
Token mom
Token ID1995
Feature activation+3.61e-02

Pos logit contributions
 ghosts+0.069
 staying+0.064
 cob+0.063
 snowy+0.062
 breathing+0.061
Neg logit contributions
-0.086
oured-0.073
ete-0.071
oni-0.070
or-0.070
Token<|endoftext|>
Token ID50256
Feature activation-1.23e-02

Pos logit contributions
+0.000
ete+0.000
oni+0.000
iane+0.000
oured+0.000
Neg logit contributions
 ghosts-0.000
 cob-0.000
 staying-0.000
 cigarettes-0.000
 oh-0.000
TokenBen
Token ID11696
Feature activation+5.71e-03

Pos logit contributions
+0.018
oured+0.017
ete+0.015
oni+0.014
or+0.014
Neg logit contributions
 ghosts-0.025
 staying-0.023
 cob-0.023
 snowy-0.023
 breathing-0.023
Token liked
Token ID8288
Feature activation-1.58e-02

Pos logit contributions
 ghosts+0.009
 cob+0.009
 staying+0.008
 snowy+0.008
 breathing+0.008
Neg logit contributions
-0.012
oured-0.010
ete-0.010
or-0.009
oni-0.009
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
+0.024
oni+0.020
oured+0.020
ete+0.020
ted+0.019
Neg logit contributions
 ghosts-0.031
 cob-0.029
 staying-0.028
 snowy-0.027
 breathing-0.027
Token balance
Token ID5236
Feature activation-3.26e-02

Pos logit contributions
+0.025
oured+0.024
ete+0.022
ar+0.022
lor+0.022
Neg logit contributions
 exc-0.016
 ghosts-0.016
 cob-0.015
 exciting-0.015
 staying-0.015
Token on
Token ID319
Feature activation+2.53e-02

Pos logit contributions
+0.066
oured+0.058
ete+0.056
oni+0.055
or+0.054
Neg logit contributions
 ghosts-0.051
 cob-0.047
 staying-0.046
 snowy-0.045
 breathing-0.045
Token the
Token ID262
Feature activation-9.16e-03

Pos logit contributions
 ghosts+0.030
 cob+0.029
 snowy+0.028
 exciting+0.027
 staying+0.027
Neg logit contributions
-0.060
oured-0.056
iane-0.053
ete-0.053
cient-0.052
Token wall
Token ID3355
Feature activation+5.67e-02

Pos logit contributions
+0.021
oured+0.019
ete+0.018
ar+0.018
or+0.018
Neg logit contributions
 ghosts-0.012
 cob-0.012
 snowy-0.011
 staying-0.011
 exciting-0.011
Token.
Token ID13
Feature activation+7.53e-03

Pos logit contributions
 ghosts+0.084
 cob+0.077
 staying+0.076
 snowy+0.074
 breathing+0.074
Neg logit contributions
-0.120
oured-0.106
ete-0.104
iane-0.099
oni-0.099
Token It
Token ID632
Feature activation+1.54e-02

Pos logit contributions
 ghosts+0.014
 cob+0.013
 staying+0.013
 snowy+0.012
 breathing+0.012
Neg logit contributions
-0.013
oured-0.011
ete-0.011
oni-0.011
or-0.011
Token was
Token ID373
Feature activation-3.79e-03

Pos logit contributions
 ghosts+0.033
 staying+0.030
 cigarettes+0.029
 breathing+0.029
 emergencies+0.029
Neg logit contributions
-0.021
ar-0.017
 course-0.017
les-0.017
on-0.017
Token will
Token ID481
Feature activation-6.17e-02

Pos logit contributions
+0.071
oured+0.065
ete+0.061
oni+0.060
um+0.060
Neg logit contributions
 ghosts-0.053
 exc-0.049
 cob-0.048
 staying-0.048
 exciting-0.047
Token keep
Token ID1394
Feature activation+1.75e-02

Pos logit contributions
+0.137
oured+0.126
ete+0.120
or+0.118
oni+0.117
Neg logit contributions
 ghosts-0.085
 staying-0.078
 cob-0.076
 exc-0.076
 snowy-0.074
Token looking
Token ID2045
Feature activation+2.88e-02

Pos logit contributions
 ghosts+0.033
 cob+0.031
 snowy+0.030
 exc+0.030
 staying+0.030
Neg logit contributions
-0.029
oured-0.024
ete-0.024
oni-0.023
 course-0.023
Token until
Token ID1566
Feature activation-2.07e-02

Pos logit contributions
 ghosts+0.044
 cob+0.041
 staying+0.040
 breathing+0.039
 snowy+0.038
Neg logit contributions
-0.055
oured-0.050
ete-0.049
or-0.048
ar-0.047
Token they
Token ID484
Feature activation-3.43e-02

Pos logit contributions
+0.040
oured+0.033
ete+0.033
oni+0.032
 course+0.032
Neg logit contributions
 ghosts-0.034
 cob-0.032
 staying-0.031
 breathing-0.030
 snowy-0.030
Token do
Token ID466
Feature activation+4.04e-02

Pos logit contributions
+0.075
oured+0.070
um+0.070
oni+0.068
ete+0.067
Neg logit contributions
 exc-0.044
 ghosts-0.043
 staying-0.041
 cob-0.039
 exciting-0.039
Token.
Token ID13
Feature activation+6.66e-04

Pos logit contributions
 ghosts+0.055
 staying+0.050
 cob+0.049
 exciting+0.048
 breathing+0.048
Neg logit contributions
-0.094
oured-0.083
ete-0.080
iane-0.079
als-0.078
Token She
Token ID1375
Feature activation-2.72e-02

Pos logit contributions
 ghosts+0.001
 cob+0.001
 staying+0.001
 snowy+0.001
 breathing+0.001
Neg logit contributions
-0.001
ete-0.001
oured-0.001
iane-0.001
oni-0.001
Token says
Token ID1139
Feature activation-6.96e-03

Pos logit contributions
+0.057
oured+0.049
ete+0.048
or+0.047
iane+0.046
Neg logit contributions
 ghosts-0.042
 cob-0.039
 staying-0.038
 snowy-0.038
 exc-0.037
Token they
Token ID484
Feature activation-3.46e-02

Pos logit contributions
+0.013
oured+0.011
ete+0.011
 course+0.011
oni+0.011
Neg logit contributions
 ghosts-0.011
 cob-0.011
 staying-0.011
 snowy-0.010
 breathing-0.010
Token will
Token ID481
Feature activation-6.15e-02

Pos logit contributions
+0.063
oured+0.055
ete+0.054
oni+0.052
iane+0.052
Neg logit contributions
 ghosts-0.062
 staying-0.056
 cob-0.056
 exc-0.056
 exciting-0.055
Token Lily
Token ID20037
Feature activation+9.23e-03

Pos logit contributions
+0.000
oured+0.000
ete+0.000
or+0.000
oni+0.000
Neg logit contributions
 ghosts-0.000
 cob-0.000
 staying-0.000
 snowy-0.000
 breathing-0.000
Token.
Token ID13
Feature activation+8.83e-03

Pos logit contributions
 ghosts+0.014
 cob+0.013
 staying+0.013
 snowy+0.012
 breathing+0.012
Neg logit contributions
-0.020
oured-0.018
ete-0.017
or-0.016
iane-0.016
Token Thank
Token ID6952
Feature activation-1.15e-02

Pos logit contributions
 ghosts+0.013
 cob+0.012
 staying+0.012
 exc+0.011
 snowy+0.011
Neg logit contributions
-0.018
oured-0.016
ete-0.016
oni-0.015
or-0.015
Token you
Token ID345
Feature activation-2.79e-02

Pos logit contributions
+0.025
oured+0.024
oni+0.022
ian+0.022
ete+0.022
Neg logit contributions
 ghosts-0.016
 staying-0.015
 oh-0.014
 exc-0.014
 than-0.014
Token!"
Token ID2474
Feature activation-5.06e-02

Pos logit contributions
+0.075
oured+0.066
hed+0.063
ian+0.063
ted+0.063
Neg logit contributions
 ghosts-0.030
 staying-0.025
 cob-0.025
 exciting-0.024
 snowy-0.023
Token Tom
Token ID4186
Feature activation+3.24e-03

Pos logit contributions
+0.108
oured+0.097
ete+0.091
or+0.090
ar+0.089
Neg logit contributions
 ghosts-0.074
 staying-0.069
 breathing-0.068
 exc-0.067
 cob-0.067
Token said
Token ID531
Feature activation-6.72e-03

Pos logit contributions
 exc+0.003
 cob+0.003
 about+0.002
 ghosts+0.002
 staying+0.002
Neg logit contributions
-0.010
ifying-0.008
lor-0.008
or-0.008
 trigger-0.008
Token,
Token ID11
Feature activation-5.41e-03

Pos logit contributions
+0.016
oured+0.014
 course+0.014
iane+0.014
ete+0.014
Neg logit contributions
 breathing-0.008
 ghosts-0.008
 staying-0.008
 exc-0.007
 than-0.007
Token putting
Token ID5137
Feature activation+4.76e-02

Pos logit contributions
+0.013
or+0.012
ete+0.012
oured+0.012
ar+0.012
Neg logit contributions
 staying-0.006
 breathing-0.006
 ghosts-0.006
 exc-0.006
 cob-0.005
Token it
Token ID340
Feature activation+1.25e-03

Pos logit contributions
 ghosts+0.053
 cob+0.050
 staying+0.047
 cigarettes+0.047
 oh+0.046
Neg logit contributions
-0.116
ete-0.102
oured-0.101
lor-0.100
cient-0.098
Token on
Token ID319
Feature activation+2.47e-02

Pos logit contributions
 ghosts+0.002
 cob+0.002
 staying+0.002
 snowy+0.002
 breathing+0.002
Neg logit contributions
-0.003
oured-0.002
ete-0.002
oni-0.002
or-0.002
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 ghosts+0.039
 cob+0.035
 staying+0.035
 snowy+0.034
 breathing+0.034
Neg logit contributions
-0.053
oured-0.045
ete-0.045
 course-0.044
oni-0.044
Token slide
Token ID10649
Feature activation+2.90e-03

Pos logit contributions
+0.024
oured+0.021
ete+0.021
or+0.020
oni+0.020
Neg logit contributions
 ghosts-0.016
 cob-0.015
 staying-0.015
 snowy-0.014
 breathing-0.014
Token too
Token ID1165
Feature activation+3.58e-02

Pos logit contributions
 ghosts+0.004
 staying+0.004
 cob+0.004
 snowy+0.004
 exciting+0.004
Neg logit contributions
-0.006
oured-0.005
ete-0.005
iane-0.005
oni-0.005
Token.
Token ID13
Feature activation+6.16e-03

Pos logit contributions
 ghosts+0.069
 cob+0.065
 staying+0.064
 snowy+0.062
 breathing+0.062
Neg logit contributions
-0.060
ete-0.048
ar-0.047
 course-0.047
oni-0.046
Token They
Token ID1119
Feature activation-3.74e-02

Pos logit contributions
 ghosts+0.012
 cob+0.011
 staying+0.011
 breathing+0.011
 snowy+0.011
Neg logit contributions
-0.011
ete-0.009
ar-0.008
oni-0.008
um-0.008
Token waited
Token ID13488
Feature activation+3.66e-02

Pos logit contributions
+0.085
oured+0.074
ete+0.073
or+0.070
oni+0.070
Neg logit contributions
 ghosts-0.051
 cob-0.046
 exc-0.045
 snowy-0.044
 exciting-0.044
Token and
Token ID290
Feature activation+7.73e-03

Pos logit contributions
 ghosts+0.059
 cob+0.053
 snowy+0.051
 staying+0.051
 exc+0.050
Neg logit contributions
-0.071
oured-0.063
ete-0.061
oni-0.060
or-0.060
Token waited
Token ID13488
Feature activation+3.71e-02

Pos logit contributions
 ghosts+0.014
 cob+0.013
 staying+0.013
 snowy+0.012
 breathing+0.012
Neg logit contributions
-0.014
oured-0.012
ete-0.012
oni-0.011
or-0.011
Token,
Token ID11
Feature activation-2.52e-03

Pos logit contributions
 ghosts+0.061
 cob+0.056
 staying+0.054
 snowy+0.054
 exc+0.052
Neg logit contributions
-0.072
oured-0.063
ete-0.061
oni-0.060
or-0.059
Token but
Token ID475
Feature activation+2.55e-02

Pos logit contributions
+0.007
oured+0.006
ete+0.006
oni+0.006
 course+0.006
Neg logit contributions
 ghosts-0.003
 cob-0.002
 snowy-0.002
 staying-0.002
 exciting-0.002
Token the
Token ID262
Feature activation-9.46e-03

Pos logit contributions
 ghosts+0.044
 cob+0.041
 breathing+0.040
 exc+0.040
 snowy+0.040
Neg logit contributions
-0.045
ete-0.039
oured-0.038
oni-0.037
ar-0.037
Token wanted
Token ID2227
Feature activation+4.10e-02

Pos logit contributions
+0.043
ete+0.038
oured+0.038
iane+0.037
um+0.036
Neg logit contributions
 ghosts-0.028
 staying-0.028
 exc-0.027
 exciting-0.027
 than-0.026
Token one
Token ID530
Feature activation-1.18e-02

Pos logit contributions
 ghosts+0.056
 staying+0.053
 than+0.050
 cob+0.049
 breathing+0.049
Neg logit contributions
-0.092
oured-0.082
ete-0.079
iane-0.077
ian-0.075
Token day
Token ID1110
Feature activation-4.99e-02

Pos logit contributions
+0.024
oured+0.021
ete+0.021
oni+0.020
or+0.020
Neg logit contributions
 ghosts-0.018
 cob-0.017
 staying-0.016
 snowy-0.016
 breathing-0.016
Token."
Token ID526
Feature activation-1.47e-02

Pos logit contributions
+0.104
oured+0.091
ete+0.089
iane+0.085
oni+0.085
Neg logit contributions
 ghosts-0.075
 staying-0.069
 cob-0.067
 snowy-0.066
 breathing-0.066
Token
Token ID220
Feature activation-7.33e-02

Pos logit contributions
+0.031
ete+0.028
oured+0.028
oni+0.027
or+0.027
Neg logit contributions
 ghosts-0.020
 cob-0.019
 staying-0.018
 breathing-0.018
 snowy-0.017
Token
Token ID198
Feature activation+9.22e-03

Pos logit contributions
+0.140
oured+0.127
ete+0.121
 course+0.119
ar+0.119
Neg logit contributions
 ghosts-0.113
 staying-0.107
 cob-0.105
 exc-0.103
 breathing-0.103
Token
Token ID198
Feature activation+9.36e-03

Pos logit contributions
 ghosts+0.015
 cob+0.014
 staying+0.014
 snowy+0.014
 breathing+0.014
Neg logit contributions
-0.018
oured-0.016
ete-0.015
oni-0.014
als-0.014
TokenL
Token ID43
Feature activation+2.78e-03

Pos logit contributions
 ghosts+0.013
 cob+0.012
 staying+0.012
 snowy+0.011
 exc+0.011
Neg logit contributions
-0.021
oured-0.018
ete-0.018
oni-0.017
als-0.017
Tokenily
Token ID813
Feature activation+6.91e-02

Pos logit contributions
 ghosts+0.004
 cob+0.003
 staying+0.003
 snowy+0.003
 breathing+0.003
Neg logit contributions
-0.006
oured-0.006
ete-0.006
oni-0.005
or-0.005
Token smiled
Token ID13541
Feature activation-2.61e-02

Pos logit contributions
 cob+0.076
 exc+0.076
 staying+0.073
 exciting+0.072
 sob+0.072
Neg logit contributions
-0.155
 trigger-0.141
ifying-0.138
or-0.135
igh-0.135
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
+0.054
oured+0.047
ete+0.046
or+0.044
oni+0.044
Neg logit contributions
 ghosts-0.040
 cob-0.037
 staying-0.036
 snowy-0.036
 breathing-0.035
Token up
Token ID510
Feature activation-5.80e-02

Pos logit contributions
+0.115
oured+0.096
ete+0.094
ian+0.093
iane+0.093
Neg logit contributions
 ghosts-0.054
 cob-0.052
 interesting-0.051
 exciting-0.050
 than-0.050
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
+0.136
oured+0.121
ete+0.117
iane+0.114
or+0.114
Neg logit contributions
 ghosts-0.078
 cob-0.070
 staying-0.068
 snowy-0.068
 exciting-0.067
Token nuts
Token ID14380
Feature activation+2.53e-02

Pos logit contributions
+0.034
oured+0.030
ete+0.029
or+0.028
iane+0.028
Neg logit contributions
 ghosts-0.026
 cob-0.024
 staying-0.024
 snowy-0.024
 breathing-0.023
Token.
Token ID13
Feature activation-2.69e-03

Pos logit contributions
 ghosts+0.036
 staying+0.033
 cob+0.033
 snowy+0.032
 breathing+0.032
Neg logit contributions
-0.054
oured-0.048
ete-0.047
iane-0.045
ar-0.045
Token They
Token ID1119
Feature activation-3.98e-02

Pos logit contributions
+0.005
ete+0.004
oni+0.004
oured+0.003
als+0.003
Neg logit contributions
 ghosts-0.005
 staying-0.005
 cob-0.005
 breathing-0.005
 snowy-0.005
Token saw
Token ID2497
Feature activation-5.96e-03

Pos logit contributions
+0.083
ete+0.072
oured+0.072
or+0.071
um+0.069
Neg logit contributions
 ghosts-0.061
 cob-0.057
 exc-0.056
 staying-0.055
 snowy-0.054
Token them
Token ID606
Feature activation-2.21e-02

Pos logit contributions
+0.013
oured+0.012
ete+0.011
or+0.011
iane+0.011
Neg logit contributions
 ghosts-0.009
 cob-0.008
 staying-0.008
 snowy-0.008
 breathing-0.007
Token hide
Token ID7808
Feature activation+4.05e-02

Pos logit contributions
+0.048
oured+0.043
ete+0.042
or+0.040
oni+0.040
Neg logit contributions
 ghosts-0.032
 staying-0.029
 cob-0.029
 snowy-0.028
 breathing-0.027
Token them
Token ID606
Feature activation-2.21e-02

Pos logit contributions
 ghosts+0.058
 cob+0.053
 staying+0.051
 snowy+0.050
 breathing+0.050
Neg logit contributions
-0.091
oured-0.079
ete-0.077
iane-0.074
oni-0.074
Token in
Token ID287
Feature activation+1.98e-02

Pos logit contributions
+0.043
oured+0.038
ete+0.036
oni+0.036
or+0.035
Neg logit contributions
 ghosts-0.036
 cob-0.033
 staying-0.032
 snowy-0.031
 breathing-0.031
Token their
Token ID511
Feature activation+2.35e-02

Pos logit contributions
 ghosts+0.035
 staying+0.033
 cob+0.032
 snowy+0.031
 exc+0.031
Neg logit contributions
-0.035
 course-0.029
ete-0.028
oni-0.028
oured-0.028
Token was
Token ID373
Feature activation-1.22e-03

Pos logit contributions
 ghosts+0.038
 staying+0.036
 breathing+0.035
 cob+0.034
 snowy+0.034
Neg logit contributions
-0.026
ted-0.021
als-0.020
ar-0.020
oni-0.020
Token a
Token ID257
Feature activation-1.07e-02

Pos logit contributions
+0.002
oni+0.002
 top+0.001
oured+0.001
iane+0.001
Neg logit contributions
 ghosts-0.002
 staying-0.002
 exc-0.002
 breathing-0.002
 outage-0.002
Token big
Token ID1263
Feature activation-2.02e-02

Pos logit contributions
+0.022
oured+0.018
oni+0.018
ete+0.018
 course+0.018
Neg logit contributions
 ghosts-0.016
 staying-0.015
 breathing-0.014
 cob-0.014
 exc-0.014
Token,
Token ID11
Feature activation-3.11e-03

Pos logit contributions
+0.036
 course+0.030
oured+0.029
ar+0.028
oni+0.028
Neg logit contributions
 ghosts-0.036
 staying-0.033
 breathing-0.033
 exc-0.032
 cob-0.032
Token red
Token ID2266
Feature activation+1.00e-02

Pos logit contributions
+0.005
on+0.005
 top+0.005
 course+0.005
oured+0.005
Neg logit contributions
 ghosts-0.004
 staying-0.004
 outage-0.004
 breathing-0.004
asured-0.004
Token balloon
Token ID21190
Feature activation-2.62e-02

Pos logit contributions
 ghosts+0.014
 cob+0.014
 snowy+0.013
 staying+0.013
 rainy+0.012
Neg logit contributions
-0.021
ete-0.020
oured-0.020
iane-0.018
or-0.018
Token floating
Token ID12462
Feature activation+2.32e-02

Pos logit contributions
+0.047
ete+0.042
oured+0.042
or+0.039
iane+0.039
Neg logit contributions
 ghosts-0.046
 cob-0.044
 staying-0.043
 snowy-0.042
 breathing-0.042
Token in
Token ID287
Feature activation+2.29e-02

Pos logit contributions
 ghosts+0.026
 cob+0.023
 staying+0.023
 snowy+0.023
 than+0.022
Neg logit contributions
-0.058
oured-0.052
ete-0.052
oni-0.049
ian-0.049
Token the
Token ID262
Feature activation-9.08e-03

Pos logit contributions
 ghosts+0.040
 staying+0.037
 cob+0.036
 breathing+0.036
 snowy+0.035
Neg logit contributions
-0.041
oured-0.035
 course-0.035
ete-0.034
oni-0.034
Token sky
Token ID6766
Feature activation-9.87e-03

Pos logit contributions
+0.015
oured+0.013
ete+0.013
oni+0.012
or+0.012
Neg logit contributions
 ghosts-0.018
 cob-0.017
 snowy-0.016
 staying-0.016
 breathing-0.016
Token!
Token ID0
Feature activation-5.49e-02

Pos logit contributions
+0.020
oured+0.018
ete+0.017
or+0.016
iane+0.016
Neg logit contributions
 ghosts-0.015
 staying-0.014
 cob-0.014
 snowy-0.014
 breathing-0.014
Token was
Token ID373
Feature activation-8.99e-04

Pos logit contributions
+0.013
iane+0.011
ete+0.011
oured+0.011
oni+0.011
Neg logit contributions
 ghosts-0.008
 staying-0.008
 cob-0.007
 snowy-0.007
 breathing-0.007
Token confused
Token ID10416
Feature activation+9.55e-03

Pos logit contributions
+0.002
oured+0.002
ete+0.002
oni+0.002
ar+0.002
Neg logit contributions
 ghosts-0.001
 cob-0.001
 snowy-0.001
 staying-0.001
 textures-0.001
Token,
Token ID11
Feature activation-3.71e-03

Pos logit contributions
 ghosts+0.017
 cob+0.016
 snowy+0.016
 staying+0.016
 exc+0.016
Neg logit contributions
-0.016
oured-0.014
ete-0.014
or-0.013
oni-0.013
Token but
Token ID475
Feature activation+3.39e-02

Pos logit contributions
+0.010
oured+0.009
ete+0.009
 course+0.008
oni+0.008
Neg logit contributions
 ghosts-0.003
 cob-0.003
 staying-0.003
 snowy-0.003
 exc-0.003
Token the
Token ID262
Feature activation-8.24e-03

Pos logit contributions
 ghosts+0.063
 cob+0.058
 staying+0.057
 snowy+0.056
 breathing+0.056
Neg logit contributions
-0.059
oured-0.051
ete-0.049
oni-0.047
or-0.047
Token park
Token ID3952
Feature activation-5.17e-02

Pos logit contributions
+0.018
oured+0.016
ete+0.016
or+0.015
oni+0.015
Neg logit contributions
 ghosts-0.012
 cob-0.011
 staying-0.011
 exciting-0.011
 snowy-0.011
Token keeper
Token ID28356
Feature activation-3.93e-02

Pos logit contributions
+0.103
oured+0.091
ete+0.088
or+0.084
oni+0.084
Neg logit contributions
 ghosts-0.083
 cob-0.076
 staying-0.075
 snowy-0.073
 breathing-0.072
Token was
Token ID373
Feature activation-1.65e-03

Pos logit contributions
+0.081
oured+0.068
ete+0.067
iane+0.066
oni+0.065
Neg logit contributions
 ghosts-0.062
 staying-0.057
 cob-0.056
 exciting-0.054
 snowy-0.053
Token shout
Token ID9191
Feature activation-3.67e-03

Pos logit contributions
+0.004
oured+0.003
ete+0.003
oni+0.003
 course+0.003
Neg logit contributions
 ghosts-0.002
 cob-0.002
 snowy-0.002
 textures-0.002
 staying-0.002
Tokeni
Token ID72
Feature activation-5.31e-02

Pos logit contributions
+0.008
oured+0.007
ete+0.006
oni+0.006
 course+0.006
Neg logit contributions
 ghosts-0.006
 staying-0.005
 cob-0.005
 breathing-0.005
 snowy-0.005
Tokening
Token ID278
Feature activation+2.90e-03

Pos logit contributions
+0.128
 course+0.107
ete+0.106
oured+0.105
iane+0.104
Neg logit contributions
 ghosts-0.067
 breathing-0.064
 staying-0.064
 cob-0.058
 snowy-0.057
Token his
Token ID465
Feature activation+2.28e-02

Pos logit contributions
+0.176
oured+0.155
ete+0.151
oni+0.145
or+0.144
Neg logit contributions
 ghosts-0.128
 cob-0.116
 staying-0.116
 snowy-0.112
 breathing-0.112
Token toy
Token ID13373
Feature activation-2.14e-02

Pos logit contributions
 ghosts+0.040
 cob+0.037
 staying+0.036
 snowy+0.035
 breathing+0.035
Neg logit contributions
-0.042
oured-0.037
ete-0.035
oni-0.034
or-0.034
Token gun
Token ID2485
Feature activation-7.25e-02

Pos logit contributions
+0.041
ete+0.039
oured+0.039
iane+0.036
oni+0.036
Neg logit contributions
 ghosts-0.035
 cob-0.032
 staying-0.031
 breathing-0.030
 than-0.030
Token and
Token ID290
Feature activation+8.55e-03

Pos logit contributions
+0.171
ete+0.154
oured+0.151
iane+0.149
oni+0.142
Neg logit contributions
 ghosts-0.089
 than-0.082
 staying-0.080
 breathing-0.077
 cob-0.077
Token decided
Token ID3066
Feature activation-1.31e-02

Pos logit contributions
 ghosts+0.015
 cob+0.014
 staying+0.014
 snowy+0.013
 breathing+0.013
Neg logit contributions
-0.017
oured-0.014
ete-0.014
or-0.013
oni-0.013
Token to
Token ID284
Feature activation-8.73e-03

Pos logit contributions
+0.026
oured+0.023
ete+0.022
oni+0.022
or+0.021
Neg logit contributions
 ghosts-0.021
 staying-0.019
 cob-0.019
 snowy-0.018
 breathing-0.018
Token play
Token ID711
Feature activation-1.47e-02

Pos logit contributions
+0.021
oured+0.020
ted+0.018
sel+0.018
als+0.018
Neg logit contributions
 exc-0.011
 ghosts-0.010
 staying-0.009
 exciting-0.009
 cob-0.009
Token with
Token ID351
Feature activation-5.43e-03

Pos logit contributions
+0.029
oured+0.025
 course+0.025
ar+0.024
ete+0.024
Neg logit contributions
 ghosts-0.023
 cob-0.022
 snowy-0.021
 staying-0.021
 breathing-0.020
Token his
Token ID465
Feature activation+2.24e-02

Pos logit contributions
+0.012
oured+0.011
ete+0.010
iane+0.010
oni+0.010
Neg logit contributions
 ghosts-0.008
 cob-0.007
 staying-0.007
 snowy-0.007
 exciting-0.007
Token friends
Token ID2460
Feature activation-3.70e-02

Pos logit contributions
 ghosts+0.031
 staying+0.028
 cob+0.028
 snowy+0.027
 breathing+0.027
Neg logit contributions
-0.049
oured-0.043
ete-0.042
oni-0.041
 course-0.041
Token instead
Token ID2427
Feature activation-2.95e-02

Pos logit contributions
+0.080
oured+0.071
ete+0.070
iane+0.068
oni+0.067
Neg logit contributions
 ghosts-0.053
 staying-0.049
 cob-0.047
 breathing-0.045
 snowy-0.045
Token tired
Token ID10032
Feature activation-2.08e-02

Pos logit contributions
 ghosts+0.000
 cob+0.000
 staying+0.000
 cigarettes+0.000
 snowy+0.000
Neg logit contributions
-0.000
oured-0.000
ete-0.000
ar-0.000
oni-0.000
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
+0.040
oured+0.034
ete+0.034
oni+0.033
or+0.033
Neg logit contributions
 ghosts-0.034
 cob-0.031
 staying-0.031
 exc-0.031
 snowy-0.030
Token wanted
Token ID2227
Feature activation+4.13e-02

Pos logit contributions
 ghosts+0.023
 cob+0.020
 staying+0.020
 breathing+0.019
 outage+0.019
Neg logit contributions
-0.024
oured-0.021
ete-0.021
ted-0.020
ar-0.019
Token to
Token ID284
Feature activation-5.30e-03

Pos logit contributions
 ghosts+0.078
 cob+0.073
 exc+0.073
 snowy+0.071
 staying+0.071
Neg logit contributions
-0.066
ete-0.057
oured-0.056
oni-0.053
or-0.052
Token go
Token ID467
Feature activation-4.46e-03

Pos logit contributions
+0.011
oured+0.011
oni+0.010
ted+0.010
iane+0.010
Neg logit contributions
 ghosts-0.008
 exc-0.007
 staying-0.007
 cob-0.007
 snowy-0.007
Token back
Token ID736
Feature activation-7.85e-03

Pos logit contributions
+0.008
oured+0.007
ete+0.007
oni+0.007
or+0.007
Neg logit contributions
 ghosts-0.008
 staying-0.007
 cob-0.007
 snowy-0.007
 breathing-0.007
Token to
Token ID284
Feature activation-5.48e-03

Pos logit contributions
+0.014
oured+0.012
ete+0.012
or+0.011
ar+0.011
Neg logit contributions
 ghosts-0.014
 cob-0.013
 exc-0.013
 staying-0.013
 cigarettes-0.012
Token his
Token ID465
Feature activation+2.49e-02

Pos logit contributions
+0.010
oured+0.009
ete+0.009
 course+0.008
or+0.008
Neg logit contributions
 ghosts-0.009
 cob-0.008
 staying-0.008
 snowy-0.008
 exc-0.008
Token cozy
Token ID37438
Feature activation+9.80e-02

Pos logit contributions
 ghosts+0.038
 staying+0.034
 cob+0.033
 breathing+0.033
 snowy+0.032
Neg logit contributions
-0.052
oured-0.045
ete-0.044
oni-0.042
 course-0.042
Token home
Token ID1363
Feature activation-6.77e-02

Pos logit contributions
 ghosts+0.161
 staying+0.149
 exc+0.149
 oh+0.143
 cigarettes+0.142
Neg logit contributions
-0.182
oured-0.156
ete-0.153
als-0.151
um-0.150
Token in
Token ID287
Feature activation+2.53e-02

Pos logit contributions
+0.126
oured+0.109
ete+0.106
or+0.102
oni+0.101
Neg logit contributions
 ghosts-0.116
 cob-0.106
 staying-0.104
 snowy-0.103
 breathing-0.101
TokenYes
Token ID5297
Feature activation-6.56e-02

Pos logit contributions
 ghosts+0.084
 cob+0.076
 staying+0.076
 snowy+0.074
 breathing+0.073
Neg logit contributions
-0.100
oured-0.087
ete-0.085
oni-0.082
or-0.081
Token,
Token ID11
Feature activation-6.43e-03

Pos logit contributions
+0.127
oured+0.110
oni+0.105
iane+0.104
ete+0.103
Neg logit contributions
 ghosts-0.112
 cob-0.102
 staying-0.101
 oh-0.096
 snowy-0.096
Token of
Token ID286
Feature activation+3.11e-02

Pos logit contributions
+0.011
oured+0.010
ete+0.010
 course+0.010
als+0.010
Neg logit contributions
 ghosts-0.011
 cob-0.010
 breathing-0.010
 snowy-0.010
 staying-0.010
Token course
Token ID1781
Feature activation-1.43e-01

Pos logit contributions
 ghosts+0.094
 staying+0.090
 cob+0.090
 exciting+0.088
 snowy+0.088
Neg logit contributions
-0.019
oured-0.012
ete-0.010
oni-0.009
iane-0.008
Token.
Token ID13
Feature activation+4.24e-03

Pos logit contributions
+0.258
oured+0.223
ete+0.215
oni+0.208
or+0.206
Neg logit contributions
 ghosts-0.257
 cob-0.237
 staying-0.235
 snowy-0.230
 breathing-0.228
Token Enjoy
Token ID18179
Feature activation-1.69e-01

Pos logit contributions
 ghosts+0.006
 staying+0.006
 cob+0.005
 exc+0.005
 snowy+0.005
Neg logit contributions
-0.009
oured-0.008
ete-0.008
oni-0.008
or-0.008
Token it
Token ID340
Feature activation+2.48e-03

Pos logit contributions
+0.388
oured+0.342
ete+0.332
oni+0.321
iane+0.319
Neg logit contributions
 ghosts-0.227
 staying-0.210
 cob-0.206
 snowy-0.197
 breathing-0.195
Token!"
Token ID2474
Feature activation-4.45e-02

Pos logit contributions
 ghosts+0.003
 staying+0.003
 than+0.003
 cob+0.003
 exciting+0.003
Neg logit contributions
-0.006
oured-0.005
ete-0.005
iane-0.005
 trigger-0.005
Token
Token ID198
Feature activation+4.34e-03

Pos logit contributions
+0.087
ete+0.077
oured+0.075
oni+0.073
or+0.073
Neg logit contributions
 ghosts-0.066
 cob-0.063
 breathing-0.062
 staying-0.062
 snowy-0.061
Token
Token ID198
Feature activation+4.42e-03

Pos logit contributions
 ghosts+0.007
 cob+0.007
 staying+0.007
 snowy+0.007
 breathing+0.007
Neg logit contributions
-0.008
oured-0.007
ete-0.007
oni-0.007
or-0.007
TokenThe
Token ID464
Feature activation-1.98e-02

Pos logit contributions
 ghosts+0.007
 cob+0.007
 snowy+0.007
 staying+0.007
 exciting+0.007
Neg logit contributions
-0.009
oured-0.008
ete-0.007
oni-0.007
iane-0.007
Token.
Token ID13
Feature activation-2.97e-03

Pos logit contributions
 ghosts+0.013
 staying+0.012
 cob+0.012
 breathing+0.012
 snowy+0.012
Neg logit contributions
-0.019
oured-0.017
ete-0.016
iane-0.016
or-0.016
Token She
Token ID1375
Feature activation-2.76e-02

Pos logit contributions
+0.005
oured+0.004
ete+0.004
ted+0.004
iane+0.003
Neg logit contributions
 ghosts-0.006
 staying-0.006
 cob-0.006
 snowy-0.005
 cigarettes-0.005
Token hugs
Token ID40657
Feature activation-8.66e-02

Pos logit contributions
+0.057
ete+0.048
or+0.047
oured+0.047
iane+0.046
Neg logit contributions
 ghosts-0.046
 cob-0.043
 staying-0.043
 exc-0.041
 snowy-0.041
Token him
Token ID683
Feature activation+2.71e-03

Pos logit contributions
+0.206
oured+0.190
ete+0.176
iane+0.170
ar+0.169
Neg logit contributions
 ghosts-0.130
 staying-0.098
 breathing-0.097
 Kim-0.096
 cob-0.095
Token and
Token ID290
Feature activation-1.24e-03

Pos logit contributions
 ghosts+0.003
 breathing+0.003
 cob+0.003
 snowy+0.003
 staying+0.003
Neg logit contributions
-0.007
ete-0.006
oured-0.006
iane-0.006
or-0.006
Token thanks
Token ID5176
Feature activation-1.02e-01

Pos logit contributions
+0.002
oured+0.002
ete+0.002
or+0.002
ar+0.002
Neg logit contributions
 ghosts-0.002
 cob-0.002
 staying-0.002
 snowy-0.002
 breathing-0.002
Token Sara
Token ID24799
Feature activation+1.83e-02

Pos logit contributions
+0.206
oured+0.178
iane+0.161
ete+0.160
ar+0.159
Neg logit contributions
 ghosts-0.185
 staying-0.158
 cob-0.157
 breathing-0.154
 exciting-0.150
Token and
Token ID290
Feature activation-1.54e-03

Pos logit contributions
 ghosts+0.022
 staying+0.020
 breathing+0.020
 than+0.019
 cob+0.018
Neg logit contributions
-0.049
oured-0.040
ete-0.040
iane-0.038
ar-0.038
Token Ben
Token ID3932
Feature activation-2.37e-02

Pos logit contributions
+0.004
oured+0.003
or+0.003
ar+0.003
ete+0.003
Neg logit contributions
 ghosts-0.002
 staying-0.002
 cob-0.002
 snowy-0.002
 breathing-0.002
Token.
Token ID13
Feature activation-4.22e-03

Pos logit contributions
+0.052
oured+0.045
ete+0.044
iane+0.042
oni+0.042
Neg logit contributions
 ghosts-0.035
 staying-0.032
 cob-0.031
 breathing-0.030
 snowy-0.030
Token She
Token ID1375
Feature activation-2.82e-02

Pos logit contributions
+0.007
oured+0.006
ete+0.005
ted+0.005
iane+0.005
Neg logit contributions
 ghosts-0.008
 cob-0.008
 staying-0.008
 snowy-0.007
 cigarettes-0.007
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
+0.235
oured+0.218
ete+0.213
iane+0.201
or+0.201
Neg logit contributions
 ghosts-0.154
 cob-0.145
 staying-0.140
 snowy-0.136
 breathing-0.135
Token It
Token ID632
Feature activation+1.99e-02

Pos logit contributions
 ghosts+0.026
 cob+0.024
 staying+0.024
 snowy+0.023
 breathing+0.023
Neg logit contributions
-0.023
oured-0.020
ete-0.019
or-0.019
oni-0.019
Token was
Token ID373
Feature activation+1.93e-03

Pos logit contributions
 ghosts+0.031
 cob+0.029
 staying+0.028
 snowy+0.028
 breathing+0.027
Neg logit contributions
-0.040
oured-0.036
ete-0.035
or-0.034
oni-0.033
Token the
Token ID262
Feature activation-5.78e-03

Pos logit contributions
 ghosts+0.003
 staying+0.003
 outage+0.003
 cob+0.003
 snowy+0.003
Neg logit contributions
-0.004
oni-0.003
oured-0.003
ete-0.003
um-0.003
Token biggest
Token ID4094
Feature activation+1.29e-02

Pos logit contributions
+0.013
oured+0.012
ete+0.011
or+0.011
ar+0.011
Neg logit contributions
 ghosts-0.008
 cob-0.007
 staying-0.007
 snowy-0.007
 exc-0.007
Token squash
Token ID34613
Feature activation-1.09e-01

Pos logit contributions
 ghosts+0.025
 cob+0.023
 staying+0.023
 snowy+0.022
 breathing+0.022
Neg logit contributions
-0.021
oured-0.019
ete-0.018
or-0.017
oni-0.017
Token that
Token ID326
Feature activation-1.30e-02

Pos logit contributions
+0.227
oured+0.200
ete+0.195
oni+0.189
or+0.188
Neg logit contributions
 ghosts-0.163
 cob-0.147
 staying-0.146
 snowy-0.143
 breathing-0.141
Token anyone
Token ID2687
Feature activation+6.44e-02

Pos logit contributions
+0.022
on+0.019
 course+0.019
ete+0.018
als+0.018
Neg logit contributions
 ghosts-0.023
 breathing-0.021
 staying-0.021
 cob-0.021
 snowy-0.020
Token had
Token ID550
Feature activation+1.65e-02

Pos logit contributions
 ghosts+0.098
 snowy+0.086
 staying+0.086
 cob+0.085
 breathing+0.084
Neg logit contributions
-0.119
or-0.110
oured-0.109
als-0.109
les-0.108
Token ever
Token ID1683
Feature activation-7.50e-03

Pos logit contributions
 ghosts+0.022
 staying+0.021
 than+0.021
 breathing+0.020
 cob+0.020
Neg logit contributions
-0.038
oured-0.033
ete-0.033
ian-0.032
iane-0.031
Token seen
Token ID1775
Feature activation+4.38e-02

Pos logit contributions
+0.017
oured+0.015
ete+0.015
oni+0.014
or+0.014
Neg logit contributions
 ghosts-0.010
 cob-0.009
 staying-0.009
 snowy-0.009
 breathing-0.008
Token was
Token ID373
Feature activation+2.36e-03

Pos logit contributions
+0.017
iane+0.013
oured+0.013
ete+0.013
or+0.013
Neg logit contributions
 cob-0.010
 ghosts-0.010
 staying-0.010
 exciting-0.010
 exc-0.010
Token very
Token ID845
Feature activation-4.31e-03

Pos logit contributions
 ghosts+0.003
 cob+0.003
 staying+0.003
 snowy+0.003
 breathing+0.003
Neg logit contributions
-0.005
oured-0.004
ete-0.004
oni-0.004
or-0.004
Token excited
Token ID6568
Feature activation+7.24e-03

Pos logit contributions
+0.011
oured+0.010
ete+0.010
oni+0.009
or+0.009
Neg logit contributions
 ghosts-0.005
 cob-0.004
 staying-0.004
 snowy-0.004
 breathing-0.004
Token and
Token ID290
Feature activation+1.75e-02

Pos logit contributions
 ghosts+0.011
 staying+0.010
 cob+0.010
 breathing+0.010
 snowy+0.010
Neg logit contributions
-0.015
oured-0.013
ete-0.013
or-0.012
oni-0.012
Token qu
Token ID627
Feature activation-7.69e-03

Pos logit contributions
 ghosts+0.026
 staying+0.026
 exciting+0.025
 exc+0.025
 snowy+0.024
Neg logit contributions
-0.037
oured-0.031
ete-0.030
oni-0.030
iane-0.030
Tokenacked
Token ID6021
Feature activation-1.12e-01

Pos logit contributions
+0.011
 course+0.009
ian+0.008
or+0.008
 trigger+0.008
Neg logit contributions
 raining-0.015
 cob-0.014
 textures-0.014
 snowy-0.014
 breathing-0.014
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
+0.296
 course+0.250
iane+0.250
igh+0.248
oured+0.247
Neg logit contributions
 ghosts-0.112
 exciting-0.110
 than-0.105
 interesting-0.104
 breathing-0.101
Token qu
Token ID627
Feature activation-7.59e-03

Pos logit contributions
 ghosts+0.028
 staying+0.027
 cob+0.026
 snowy+0.026
 breathing+0.026
Neg logit contributions
-0.033
oured-0.028
ete-0.028
or-0.027
oni-0.027
Tokenacked
Token ID6021
Feature activation-1.12e-01

Pos logit contributions
+0.010
 course+0.008
ian+0.007
or+0.007
iane+0.007
Neg logit contributions
 raining-0.015
 cob-0.015
 textures-0.015
 breathing-0.015
 snowy-0.015
Token with
Token ID351
Feature activation-2.10e-03

Pos logit contributions
+0.285
iane+0.243
 course+0.241
oured+0.240
igh+0.238
Neg logit contributions
 ghosts-0.122
 exciting-0.117
 than-0.113
 interesting-0.111
 cob-0.109
Token g
Token ID308
Feature activation+6.43e-03

Pos logit contributions
+0.005
or+0.004
oured+0.004
oni+0.004
ar+0.004
Neg logit contributions
 exciting-0.003
 ghosts-0.003
 interesting-0.003
 cob-0.003
 staying-0.002
Token asked
Token ID1965
Feature activation+5.38e-02

Pos logit contributions
+0.005
ete+0.004
oured+0.004
oni+0.004
ian+0.004
Neg logit contributions
 ghosts-0.006
 staying-0.005
 cob-0.005
 snowy-0.005
 breathing-0.005
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
 ghosts+0.097
 cob+0.090
 snowy+0.089
 staying+0.089
 breathing+0.085
Neg logit contributions
-0.094
ete-0.081
oured-0.081
oni-0.077
or-0.077
Token
Token ID198
Feature activation+3.09e-03

Pos logit contributions
 ghosts+0.007
 cob+0.007
 staying+0.007
 breathing+0.006
 snowy+0.006
Neg logit contributions
-0.008
oured-0.007
ete-0.006
or-0.006
oni-0.006
Token
Token ID198
Feature activation+3.00e-03

Pos logit contributions
 ghosts+0.005
 staying+0.004
 cob+0.004
 exc+0.004
 breathing+0.004
Neg logit contributions
-0.006
oured-0.006
als-0.005
ete-0.005
oni-0.005
TokenM
Token ID44
Feature activation-1.30e-02

Pos logit contributions
 ghosts+0.005
 cob+0.005
 exc+0.005
 staying+0.005
 snowy+0.005
Neg logit contributions
-0.005
oured-0.005
ian-0.004
oni-0.004
les-0.004
Tokenum
Token ID388
Feature activation-1.17e-01

Pos logit contributions
+0.013
oured+0.011
hed+0.009
als+0.009
ted+0.009
Neg logit contributions
 ghosts-0.032
 cob-0.031
 exc-0.031
 exciting-0.031
 snowy-0.031
Token said
Token ID531
Feature activation-5.07e-04

Pos logit contributions
+0.238
oured+0.197
or+0.194
ete+0.192
lor+0.189
Neg logit contributions
 ghosts-0.175
 cob-0.171
 staying-0.164
 snowy-0.159
 breathing-0.158
Token,
Token ID11
Feature activation-7.61e-03

Pos logit contributions
+0.001
oured+0.001
ete+0.001
oni+0.001
iane+0.001
Neg logit contributions
 ghosts-0.001
 cob-0.001
 staying-0.001
 breathing-0.001
 snowy-0.001
Token "
Token ID366
Feature activation+5.11e-02

Pos logit contributions
+0.017
oured+0.015
ete+0.015
or+0.015
 course+0.014
Neg logit contributions
 ghosts-0.010
 staying-0.009
 cob-0.009
 breathing-0.009
 snowy-0.009
TokenIt
Token ID1026
Feature activation+1.40e-02

Pos logit contributions
 ghosts+0.081
 cob+0.074
 staying+0.073
 snowy+0.072
 breathing+0.070
Neg logit contributions
-0.103
oured-0.093
ete-0.088
or-0.085
oni-0.085
Token's
Token ID338
Feature activation+4.54e-02

Pos logit contributions
 ghosts+0.025
 cob+0.023
 staying+0.022
 snowy+0.022
 breathing+0.022
Neg logit contributions
-0.027
oured-0.023
ete-0.022
or-0.021
oni-0.021