TOP ACTIVATIONS
MAX = 0.440

's build a fort with blankets and pill ows ."
building and get hugs and blankets . They felt sad for
They put pill ows and blankets on it . They had
he brought lots of soft blankets and pill ows . He
br ushes and their cozy blankets . When
dolls and make tents and blankets . Ben had a small
make a tent out of blankets and pill ows . They
She folded all of her blankets , her pill ows ,
. Tape is not for blankets . It is for paper
Lily wanted to add more blankets to her bed because she
exit . He moved some blankets and chairs to create a
had to hide under the blankets . There was a lot
the shelter - sticks , blankets , leaves and more !
the dog . He brought blankets and food so the dog
something as fun as pulling blankets on her modern bed .
, making a fort with blankets and pill ows .
ugg led deep in her blankets . But
more . He sold old blankets and a few tools he
started to lay out their blankets . Daisy was very persistent
ows as rockets and our blankets as space suits ." Anna

MIN ACTIVATIONS
MIN = -0.607

it ?" T oni looked down and saw a
everyone was happy . T oni was happy to have his
ground ! T oni was so scared . She
in the park , T oni saw something shiny near a
Finally , T oni was ready . She lifted
happy , she gave T oni a big hug .
higher . He showed T oni how to lift up her
lots of cheese and pepper oni . While they ate ,
Just then , T oni 's mom came outside and
on the outside and T oni thought it was so pretty
, and even though T oni was sad , she knew
a cheese pizza with pepper oni on top ." Mom my
One day , T oni was playing with his watch
doll ". T oni said , " I don
One day , when T oni and Jack were playing in
was a boy called T oni . He was very happy
ones with cheese and pepper oni . One day , she
One day T oni found a big , round
, " Hello , Pepper oni . You are my new
- Mon a and T oni . Mon a was three

INTERVAL 0.178 - 0.440
CONTAINS 1.644%

The two friends ran and ran , until at
noise ?" Ben runs to help her . He
and gave the boy her belt . She felt sad and
! After cl apping her hands , the little girl felt
thirds , like this . Now you try it ." Kathy

INTERVAL -0.084 - 0.178
CONTAINS 88.350%

there was a little girl named Lily who loved to dream
butter was soft and smooth , just like Lily 's favorite
and always wanted all the food for itself .
too ." The end . M oral
a click sound when he turned it . He

INTERVAL -0.345 - -0.084
CONTAINS 9.944%

and went home with the pole . When
. " That 's an enormous cup board ,"
happy and they had the best time ever . <|endoftext|> M
ila said . Ben nodded . He was hungry
Foo medicine to make him feel better . She showed lots

INTERVAL -0.607 - -0.345
CONTAINS 0.063%

crack . She saw the pumpkin split in half . She
back home . The rad ish answered : " Look there
she found the little dirty pumpkin . <|endoftext|> Once upon a
and proud of her new plant . <|endoftext|> Once upon a
to make the biggest sand castle ever . After
Token's
Token ID338
Feature activation-2.13e-01

Pos logit contributions
 blankets+0.060
 doors+0.058
 ghosts+0.055
 Chloe+0.055
 coats+0.054
Neg logit contributions
oni-0.084
melon-0.082
mod-0.070
abi-0.068
od-0.068
Token build
Token ID1382
Feature activation-1.56e-01

Pos logit contributions
oni+0.459
melon+0.451
od+0.371
mod+0.363
 pumpkin+0.359
Neg logit contributions
 blankets-0.319
 ghosts-0.306
 doors-0.295
 belts-0.284
 hinges-0.281
Token a
Token ID257
Feature activation+5.07e-02

Pos logit contributions
oni+0.324
melon+0.309
od+0.262
 pumpkin+0.250
 squash+0.246
Neg logit contributions
 blankets-0.239
 ghosts-0.233
 hinges-0.222
 belts-0.217
 doors-0.217
Token fort
Token ID6285
Feature activation+1.95e-01

Pos logit contributions
 blankets+0.084
 ghosts+0.080
 doors+0.078
 hinges+0.075
 belts+0.075
Neg logit contributions
oni-0.102
melon-0.100
od-0.081
mod-0.079
abi-0.077
Token with
Token ID351
Feature activation+1.13e-02

Pos logit contributions
 blankets+0.268
 ghosts+0.250
 doors+0.247
 belts+0.231
 hinges+0.230
Neg logit contributions
oni-0.445
melon-0.434
mod-0.369
od-0.367
abi-0.358
Token blankets
Token ID34794
Feature activation+4.40e-01

Pos logit contributions
 ghosts+0.017
 blankets+0.017
 hinges+0.017
 belts+0.017
 footsteps+0.017
Neg logit contributions
oni-0.022
melon-0.022
 pumpkin-0.018
 squash-0.017
od-0.017
Token and
Token ID290
Feature activation+2.42e-02

Pos logit contributions
 blankets+0.551
 doors+0.493
 ghosts+0.476
 belts+0.457
 hinges+0.443
Neg logit contributions
oni-1.051
melon-1.027
mod-0.946
od-0.908
abi-0.894
Token pill
Token ID9582
Feature activation+2.56e-01

Pos logit contributions
 Kenny+0.034
 years+0.032
Michael+0.032
 security+0.031
 Hur+0.031
Neg logit contributions
oni-0.046
melon-0.045
 pumpkin-0.041
 tomato-0.041
 squash-0.041
Tokenows
Token ID1666
Feature activation+3.17e-02

Pos logit contributions
 ghosts+0.409
 blankets+0.408
 doors+0.386
 footsteps+0.378
 terror+0.377
Neg logit contributions
oni-0.535
melon-0.502
od-0.412
abi-0.377
 pumpkin-0.372
Token."
Token ID526
Feature activation+1.43e-02

Pos logit contributions
 blankets+0.045
 ghosts+0.042
 doors+0.042
 belts+0.039
 hinges+0.039
Neg logit contributions
oni-0.071
melon-0.069
od-0.058
mod-0.058
abi-0.056
Token
Token ID220
Feature activation+2.58e-02

Pos logit contributions
 blankets+0.022
 ghosts+0.021
 doors+0.020
 hinges+0.019
 belts+0.019
Neg logit contributions
oni-0.030
melon-0.030
od-0.025
mod-0.024
abi-0.023
Token building
Token ID2615
Feature activation-1.66e-02

Pos logit contributions
melon+0.013
oni+0.013
 pumpkin+0.011
 squash+0.010
 tomato+0.010
Neg logit contributions
 ghosts-0.008
 blankets-0.008
 Kenny-0.008
 belts-0.007
 hinges-0.007
Token and
Token ID290
Feature activation+2.53e-02

Pos logit contributions
oni+0.036
melon+0.035
od+0.029
mod+0.029
abi+0.028
Neg logit contributions
 blankets-0.025
 ghosts-0.023
 doors-0.023
 hinges-0.022
 belts-0.021
Token get
Token ID651
Feature activation+4.51e-02

Pos logit contributions
 blankets+0.040
 ghosts+0.038
 hinges+0.037
 belts+0.036
 doors+0.036
Neg logit contributions
melon-0.051
oni-0.050
od-0.041
 pumpkin-0.040
 squash-0.038
Token hugs
Token ID40657
Feature activation+3.00e-01

Pos logit contributions
 blankets+0.072
 ghosts+0.070
 hinges+0.067
 doors+0.066
 belts+0.065
Neg logit contributions
oni-0.091
melon-0.090
od-0.071
 pumpkin-0.070
mod-0.070
Token and
Token ID290
Feature activation+2.42e-02

Pos logit contributions
 blankets+0.465
 ghosts+0.442
 doors+0.432
 hinges+0.421
 belts+0.408
Neg logit contributions
oni-0.637
melon-0.620
od-0.513
mod-0.491
 pumpkin-0.483
Token blankets
Token ID34794
Feature activation+4.38e-01

Pos logit contributions
 ghosts+0.033
 doors+0.032
 blankets+0.032
jar+0.032
 belts+0.032
Neg logit contributions
oni-0.050
melon-0.049
 squash-0.041
 pumpkin-0.041
 tomato-0.039
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 blankets+0.614
 ghosts+0.556
 doors+0.550
 belts+0.525
 hinges+0.512
Neg logit contributions
oni-1.003
melon-0.973
mod-0.841
od-0.835
abi-0.811
Token They
Token ID1119
Feature activation+1.53e-02

Pos logit contributions
 blankets+0.029
 ghosts+0.027
 doors+0.026
 hinges+0.026
 belts+0.025
Neg logit contributions
oni-0.038
melon-0.037
od-0.030
 pumpkin-0.029
abi-0.029
Token felt
Token ID2936
Feature activation-3.96e-02

Pos logit contributions
 blankets+0.025
 ghosts+0.023
 doors+0.023
 hinges+0.022
 belts+0.022
Neg logit contributions
oni-0.032
melon-0.031
od-0.025
mod-0.024
 pumpkin-0.024
Token sad
Token ID6507
Feature activation-1.01e-01

Pos logit contributions
oni+0.083
melon+0.081
od+0.066
mod+0.065
abi+0.064
Neg logit contributions
 blankets-0.063
 ghosts-0.060
 doors-0.059
 hinges-0.056
 belts-0.056
Token for
Token ID329
Feature activation+8.76e-02

Pos logit contributions
oni+0.175
melon+0.169
od+0.132
 pumpkin+0.124
 squash+0.123
Neg logit contributions
 blankets-0.180
 doors-0.170
 ages-0.170
 hinges-0.169
 belts-0.169
Token They
Token ID1119
Feature activation+2.43e-02

Pos logit contributions
 blankets+0.036
 ghosts+0.035
 doors+0.034
 hinges+0.033
 err+0.033
Neg logit contributions
oni-0.045
melon-0.044
od-0.036
mod-0.034
abi-0.034
Token put
Token ID1234
Feature activation+3.32e-03

Pos logit contributions
 blankets+0.037
 ghosts+0.036
 doors+0.034
 hinges+0.034
 footsteps+0.033
Neg logit contributions
oni-0.052
melon-0.051
od-0.042
mod-0.040
 pumpkin-0.040
Token pill
Token ID9582
Feature activation+2.53e-01

Pos logit contributions
 blankets+0.005
 ghosts+0.005
 hinges+0.005
 doors+0.005
 ages+0.004
Neg logit contributions
oni-0.007
melon-0.007
od-0.006
 pumpkin-0.006
mod-0.005
Tokenows
Token ID1666
Feature activation+3.35e-02

Pos logit contributions
 blankets+0.390
 ghosts+0.389
 doors+0.369
 hinges+0.357
 ages+0.356
Neg logit contributions
oni-0.536
melon-0.511
od-0.422
mod-0.395
abi-0.392
Token and
Token ID290
Feature activation+2.55e-02

Pos logit contributions
 blankets+0.044
 doors+0.040
 ghosts+0.039
 belts+0.038
 hinges+0.036
Neg logit contributions
oni-0.080
melon-0.078
od-0.067
mod-0.067
abi-0.065
Token blankets
Token ID34794
Feature activation+4.36e-01

Pos logit contributions
 Kenny+0.031
 Taxi+0.028
 passage+0.028
 terror+0.028
 Hur+0.028
Neg logit contributions
oni-0.058
melon-0.056
 pumpkin-0.048
 tomato-0.048
 plant-0.046
Token on
Token ID319
Feature activation+5.83e-02

Pos logit contributions
 blankets+0.622
 ghosts+0.586
 doors+0.571
 hinges+0.541
 belts+0.540
Neg logit contributions
oni-0.981
melon-0.959
od-0.803
mod-0.789
abi-0.771
Token it
Token ID340
Feature activation+1.18e-02

Pos logit contributions
 blankets+0.093
 ghosts+0.090
 doors+0.087
 hinges+0.084
 belts+0.083
Neg logit contributions
oni-0.120
melon-0.117
od-0.096
mod-0.094
abi-0.092
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 blankets+0.018
 ghosts+0.018
 doors+0.017
 hinges+0.016
 ages+0.016
Neg logit contributions
oni-0.025
melon-0.024
od-0.020
mod-0.019
 pumpkin-0.019
Token They
Token ID1119
Feature activation+1.85e-02

Pos logit contributions
 blankets+0.028
 ghosts+0.027
 doors+0.027
 hinges+0.026
 err+0.026
Neg logit contributions
oni-0.032
melon-0.031
od-0.025
abi-0.024
mod-0.024
Token had
Token ID550
Feature activation+5.57e-02

Pos logit contributions
 blankets+0.029
 ghosts+0.028
 doors+0.027
 hinges+0.027
 err+0.026
Neg logit contributions
oni-0.038
melon-0.037
od-0.030
mod-0.029
 pumpkin-0.029
Token he
Token ID339
Feature activation+1.18e-02

Pos logit contributions
 blankets+0.028
 ghosts+0.028
 hinges+0.027
 footsteps+0.026
 years+0.026
Neg logit contributions
oni-0.034
melon-0.034
 pumpkin-0.027
od-0.026
 squash-0.026
Token brought
Token ID3181
Feature activation-4.42e-02

Pos logit contributions
 blankets+0.019
 ghosts+0.019
 hinges+0.018
 footsteps+0.018
 doors+0.017
Neg logit contributions
oni-0.024
melon-0.023
od-0.019
 pumpkin-0.018
abi-0.018
Token lots
Token ID6041
Feature activation+7.09e-02

Pos logit contributions
oni+0.082
melon+0.077
od+0.062
 pumpkin+0.062
 squash+0.059
Neg logit contributions
 ghosts-0.079
 blankets-0.076
 hinges-0.076
 slam-0.072
jar-0.072
Token of
Token ID286
Feature activation+1.09e-01

Pos logit contributions
 ghosts+0.103
 blankets+0.100
 hinges+0.099
 doors+0.098
 belts+0.094
Neg logit contributions
oni-0.153
melon-0.150
od-0.123
 pumpkin-0.121
 squash-0.118
Token soft
Token ID2705
Feature activation+4.17e-02

Pos logit contributions
 ghosts+0.158
 hinges+0.158
 blankets+0.152
jar+0.149
 doors+0.148
Neg logit contributions
oni-0.229
melon-0.217
 pumpkin-0.182
 squash-0.180
od-0.179
Token blankets
Token ID34794
Feature activation+4.35e-01

Pos logit contributions
jar+0.057
 ghosts+0.056
 hinges+0.055
 slam+0.053
 passage+0.053
Neg logit contributions
oni-0.088
melon-0.086
 pumpkin-0.074
 squash-0.074
 tomato-0.070
Token and
Token ID290
Feature activation+1.89e-02

Pos logit contributions
 blankets+0.577
 doors+0.506
 ghosts+0.497
 belts+0.484
 hinges+0.467
Neg logit contributions
oni-1.022
melon-0.986
mod-0.892
od-0.865
abi-0.855
Token pill
Token ID9582
Feature activation+2.49e-01

Pos logit contributions
 terror+0.026
 ghosts+0.026
 passage+0.025
 Hur+0.025
 footsteps+0.025
Neg logit contributions
oni-0.038
melon-0.037
 tomato-0.032
 pumpkin-0.032
 squash-0.031
Tokenows
Token ID1666
Feature activation+2.79e-02

Pos logit contributions
 blankets+0.377
 ghosts+0.366
 doors+0.353
 hinges+0.335
 ages+0.332
Neg logit contributions
oni-0.536
melon-0.520
od-0.429
mod-0.409
abi-0.405
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 blankets+0.037
 doors+0.033
 ghosts+0.032
 belts+0.031
 hinges+0.030
Neg logit contributions
oni-0.066
melon-0.063
mod-0.057
od-0.055
abi-0.054
Token He
Token ID679
Feature activation+4.85e-02

Pos logit contributions
 blankets+0.022
 ghosts+0.021
 doors+0.020
 hinges+0.020
 belts+0.020
Neg logit contributions
oni-0.027
melon-0.027
od-0.022
abi-0.021
 pumpkin-0.021
Tokenbr
Token ID1671
Feature activation+9.46e-03

Pos logit contributions
oni+0.045
melon+0.043
od+0.035
 pumpkin+0.034
abi+0.033
Neg logit contributions
 blankets-0.032
 ghosts-0.031
 doors-0.030
 ages-0.029
 hinges-0.029
Tokenushes
Token ID17237
Feature activation+5.42e-02

Pos logit contributions
 blankets+0.015
 ghosts+0.015
 doors+0.014
 hinges+0.013
 belts+0.013
Neg logit contributions
oni-0.020
melon-0.019
od-0.016
mod-0.015
abi-0.015
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
 blankets+0.078
 ghosts+0.074
 doors+0.072
 belts+0.068
 hinges+0.068
Neg logit contributions
oni-0.121
melon-0.118
od-0.099
mod-0.098
abi-0.095
Token their
Token ID511
Feature activation-4.21e-02

Pos logit contributions
 ghosts+0.022
 blankets+0.022
 Kenny+0.021
 err+0.021
 hinges+0.021
Neg logit contributions
oni-0.032
melon-0.031
 pumpkin-0.025
od-0.024
 tomato-0.024
Token cozy
Token ID37438
Feature activation+1.62e-01

Pos logit contributions
oni+0.094
melon+0.091
 pumpkin+0.077
od+0.074
 tomato+0.073
Neg logit contributions
 ghosts-0.056
 blankets-0.055
 Kenny-0.052
 ages-0.052
 doors-0.051
Token blankets
Token ID34794
Feature activation+4.33e-01

Pos logit contributions
 ghosts+0.224
 Kenny+0.220
 ages+0.211
 terror+0.207
 brakes+0.206
Neg logit contributions
oni-0.342
melon-0.342
 pumpkin-0.296
 tomato-0.280
 squash-0.272
Token.
Token ID13
Feature activation+5.37e-05

Pos logit contributions
 blankets+0.636
 ghosts+0.601
 doors+0.586
 hinges+0.557
 belts+0.555
Neg logit contributions
oni-0.951
melon-0.929
od-0.774
mod-0.761
abi-0.743
Token
Token ID220
Feature activation+1.83e-02

Pos logit contributions
 blankets+0.000
 ghosts+0.000
 doors+0.000
 err+0.000
 hinges+0.000
Neg logit contributions
melon-0.000
oni-0.000
od-0.000
 Crab-0.000
abi-0.000
Token
Token ID198
Feature activation-9.31e-03

Pos logit contributions
 blankets+0.024
 ghosts+0.021
 doors+0.021
 belts+0.020
 ages+0.020
Neg logit contributions
melon-0.043
oni-0.042
mod-0.036
od-0.036
 pumpkin-0.035
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
oni+0.019
melon+0.018
od+0.015
mod+0.015
abi+0.014
Neg logit contributions
 blankets-0.015
 ghosts-0.015
 doors-0.014
 hinges-0.014
 belts-0.014
TokenWhen
Token ID2215
Feature activation+1.87e-01

Pos logit contributions
oni+0.024
melon+0.023
od+0.019
mod+0.019
abi+0.018
Neg logit contributions
 blankets-0.019
 ghosts-0.019
 doors-0.018
 hinges-0.018
 eyes-0.017
Token dolls
Token ID36062
Feature activation+1.06e-01

Pos logit contributions
 ghosts+0.033
 blankets+0.033
 ages+0.032
 terror+0.032
 hinges+0.031
Neg logit contributions
oni-0.049
melon-0.047
od-0.039
 pumpkin-0.038
 squash-0.037
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 blankets+0.142
 doors+0.130
 ghosts+0.127
 belts+0.122
 hinges+0.118
Neg logit contributions
oni-0.252
melon-0.247
mod-0.213
od-0.207
abi-0.205
Token make
Token ID787
Feature activation-9.02e-02

Pos logit contributions
 ghosts+0.025
 ages+0.024
 Kenny+0.024
 years+0.023
jar+0.023
Neg logit contributions
oni-0.033
melon-0.033
 pumpkin-0.028
 squash-0.027
 tomato-0.027
Token tents
Token ID29804
Feature activation+3.21e-01

Pos logit contributions
oni+0.193
melon+0.187
od+0.157
mod+0.152
abi+0.149
Neg logit contributions
 blankets-0.137
 ghosts-0.131
 doors-0.127
 hinges-0.122
 belts-0.121
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 blankets+0.484
 ghosts+0.468
 doors+0.449
 hinges+0.433
 ages+0.428
Neg logit contributions
oni-0.686
melon-0.670
od-0.553
mod-0.532
 pumpkin-0.525
Token blankets
Token ID34794
Feature activation+4.31e-01

Pos logit contributions
 Kenny+0.022
 Hur+0.021
 ages+0.021
 Taxi+0.021
 years+0.021
Neg logit contributions
melon-0.029
oni-0.029
 pumpkin-0.025
 squash-0.025
 tomato-0.024
Token.
Token ID13
Feature activation+7.35e-03

Pos logit contributions
 blankets+0.600
 ghosts+0.545
 doors+0.543
 belts+0.512
 hinges+0.504
Neg logit contributions
oni-0.993
melon-0.968
mod-0.834
od-0.823
abi-0.807
Token Ben
Token ID3932
Feature activation-1.15e-01

Pos logit contributions
 blankets+0.011
 ghosts+0.010
 doors+0.010
 hinges+0.009
 belts+0.009
Neg logit contributions
oni-0.017
melon-0.016
od-0.013
mod-0.013
abi-0.013
Token had
Token ID550
Feature activation+4.79e-02

Pos logit contributions
oni+0.247
melon+0.242
od+0.200
mod+0.197
abi+0.192
Neg logit contributions
 blankets-0.176
 ghosts-0.166
 doors-0.163
 belts-0.155
 hinges-0.154
Token a
Token ID257
Feature activation+4.33e-02

Pos logit contributions
 blankets+0.073
 ghosts+0.071
 doors+0.068
 hinges+0.067
 footsteps+0.065
Neg logit contributions
oni-0.100
melon-0.094
od-0.080
mod-0.078
abi-0.078
Token small
Token ID1402
Feature activation-2.59e-02

Pos logit contributions
 blankets+0.063
 ghosts+0.062
 doors+0.060
 hinges+0.058
 belts+0.057
Neg logit contributions
oni-0.092
melon-0.091
 pumpkin-0.074
od-0.073
mod-0.072
Token make
Token ID787
Feature activation-8.92e-02

Pos logit contributions
 blankets+0.001
 ghosts+0.001
 doors+0.001
 hinges+0.001
 ages+0.001
Neg logit contributions
oni-0.001
melon-0.001
od-0.001
mod-0.001
abi-0.001
Token a
Token ID257
Feature activation+4.80e-02

Pos logit contributions
melon+0.237
oni+0.227
mod+0.196
abi+0.187
iche+0.187
Neg logit contributions
 blankets-0.104
 doors-0.095
 ghosts-0.091
 footsteps-0.086
 eyes-0.084
Token tent
Token ID11105
Feature activation+3.17e-01

Pos logit contributions
 blankets+0.081
 ghosts+0.076
 doors+0.075
 hinges+0.072
 belts+0.071
Neg logit contributions
oni-0.096
melon-0.093
od-0.076
mod-0.075
abi-0.073
Token out
Token ID503
Feature activation-3.13e-03

Pos logit contributions
 blankets+0.430
 ghosts+0.399
 doors+0.392
 belts+0.369
 hinges+0.369
Neg logit contributions
oni-0.735
melon-0.715
od-0.609
mod-0.605
abi-0.590
Token of
Token ID286
Feature activation+1.05e-01

Pos logit contributions
melon+0.006
oni+0.006
od+0.005
 pumpkin+0.005
 squash+0.005
Neg logit contributions
 blankets-0.005
 hinges-0.005
 ghosts-0.005
 brakes-0.004
 belts-0.004
Token blankets
Token ID34794
Feature activation+4.31e-01

Pos logit contributions
 hinges+0.164
 slam+0.151
 belts+0.151
 Kenny+0.150
 Deal+0.150
Neg logit contributions
oni-0.187
melon-0.184
 pumpkin-0.169
 squash-0.166
 tomato-0.166
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
 blankets+0.556
 doors+0.486
 ghosts+0.480
 belts+0.465
 hinges+0.441
Neg logit contributions
oni-1.025
melon-0.989
mod-0.901
od-0.875
abi-0.860
Token pill
Token ID9582
Feature activation+2.47e-01

Pos logit contributions
 Kenny+0.021
 years+0.021
 terror+0.020
Michael+0.020
 Taxi+0.020
Neg logit contributions
oni-0.028
melon-0.028
 pumpkin-0.025
 squash-0.025
 tomato-0.025
Tokenows
Token ID1666
Feature activation+2.48e-02

Pos logit contributions
 blankets+0.376
 ghosts+0.363
 doors+0.350
 hinges+0.334
 ages+0.331
Neg logit contributions
oni-0.534
melon-0.517
od-0.427
mod-0.409
abi-0.404
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
 blankets+0.035
 ghosts+0.032
 doors+0.032
 belts+0.031
 hinges+0.030
Neg logit contributions
oni-0.056
melon-0.055
od-0.046
mod-0.046
abi-0.045
Token They
Token ID1119
Feature activation+9.90e-03

Pos logit contributions
 blankets+0.017
 ghosts+0.017
 doors+0.016
 hinges+0.016
 belts+0.015
Neg logit contributions
oni-0.024
melon-0.023
od-0.019
mod-0.018
abi-0.018
Token She
Token ID1375
Feature activation+1.62e-02

Pos logit contributions
 blankets+0.010
 ghosts+0.009
 doors+0.009
 belts+0.009
 hinges+0.009
Neg logit contributions
oni-0.012
melon-0.012
od-0.010
mod-0.010
 pumpkin-0.009
Token folded
Token ID24650
Feature activation-6.51e-02

Pos logit contributions
 blankets+0.025
 ghosts+0.023
 doors+0.023
 hinges+0.022
 belts+0.022
Neg logit contributions
oni-0.035
melon-0.034
od-0.028
mod-0.028
abi-0.027
Token all
Token ID477
Feature activation+1.09e-02

Pos logit contributions
oni+0.145
melon+0.144
mod+0.119
od+0.118
iche+0.115
Neg logit contributions
 blankets-0.096
 doors-0.090
 ghosts-0.087
 belts-0.082
 ages-0.081
Token of
Token ID286
Feature activation+1.01e-01

Pos logit contributions
 blankets+0.015
 doors+0.013
 ghosts+0.012
 ages+0.012
 beds+0.011
Neg logit contributions
oni-0.027
melon-0.026
od-0.022
mod-0.021
abi-0.021
Token her
Token ID607
Feature activation+2.76e-02

Pos logit contributions
 blankets+0.150
 ghosts+0.138
 doors+0.136
 ages+0.127
 hinges+0.126
Neg logit contributions
oni-0.226
melon-0.222
od-0.184
mod-0.183
abi-0.177
Token blankets
Token ID34794
Feature activation+4.31e-01

Pos logit contributions
 ghosts+0.039
 blankets+0.037
 hinges+0.036
 doors+0.035
 Kenny+0.035
Neg logit contributions
oni-0.059
melon-0.058
 pumpkin-0.049
 squash-0.048
od-0.048
Token,
Token ID11
Feature activation+2.70e-03

Pos logit contributions
 blankets+0.563
 doors+0.496
 ghosts+0.461
 belts+0.456
 hinges+0.450
Neg logit contributions
oni-1.046
melon-1.010
mod-0.901
od-0.886
abi-0.875
Token her
Token ID607
Feature activation+2.40e-02

Pos logit contributions
 ghosts+0.003
 blankets+0.003
 doors+0.003
 hinges+0.003
 ages+0.003
Neg logit contributions
oni-0.006
melon-0.006
 pumpkin-0.005
od-0.005
mod-0.005
Token pill
Token ID9582
Feature activation+2.46e-01

Pos logit contributions
 ghosts+0.031
 hinges+0.029
 ages+0.029
 Kenny+0.029
 doors+0.029
Neg logit contributions
oni-0.051
melon-0.050
 pumpkin-0.043
 squash-0.042
 tomato-0.041
Tokenows
Token ID1666
Feature activation+2.13e-02

Pos logit contributions
 blankets+0.365
 ghosts+0.352
 doors+0.340
 hinges+0.323
 belts+0.320
Neg logit contributions
oni-0.534
melon-0.521
od-0.431
mod-0.418
abi-0.411
Token,
Token ID11
Feature activation+1.03e-03

Pos logit contributions
 blankets+0.029
 doors+0.026
 ghosts+0.025
 belts+0.024
 hinges+0.024
Neg logit contributions
oni-0.051
melon-0.048
mod-0.043
od-0.042
abi-0.041
Token.
Token ID13
Feature activation+4.53e-03

Pos logit contributions
 blankets+0.014
 ghosts+0.014
 doors+0.013
 belts+0.012
 eyes+0.012
Neg logit contributions
oni-0.024
melon-0.023
mod-0.020
od-0.019
 pumpkin-0.019
Token Tape
Token ID40849
Feature activation-7.06e-03

Pos logit contributions
 blankets+0.006
 ghosts+0.006
 doors+0.006
 hinges+0.005
 belts+0.005
Neg logit contributions
oni-0.010
melon-0.010
od-0.009
mod-0.008
abi-0.008
Token is
Token ID318
Feature activation+4.67e-02

Pos logit contributions
oni+0.018
melon+0.017
mod+0.015
od+0.015
abi+0.015
Neg logit contributions
 blankets-0.008
 doors-0.008
 ghosts-0.008
 belts-0.007
 eyes-0.007
Token not
Token ID407
Feature activation-8.98e-02

Pos logit contributions
 blankets+0.073
 ghosts+0.068
 doors+0.067
 hinges+0.063
 belts+0.063
Neg logit contributions
oni-0.099
melon-0.098
od-0.083
mod-0.081
iche-0.078
Token for
Token ID329
Feature activation+9.23e-02

Pos logit contributions
melon+0.204
oni+0.203
od+0.169
mod+0.168
iche+0.162
Neg logit contributions
 blankets-0.127
 ghosts-0.121
 doors-0.114
 hinges-0.110
 belts-0.108
Token blankets
Token ID34794
Feature activation+4.30e-01

Pos logit contributions
 blankets+0.142
 ghosts+0.131
 doors+0.130
 belts+0.129
 hinges+0.129
Neg logit contributions
oni-0.195
melon-0.186
od-0.155
abi-0.149
 squash-0.149
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
 blankets+0.536
 ghosts+0.491
 doors+0.481
 belts+0.458
 hinges+0.447
Neg logit contributions
oni-1.015
melon-0.988
mod-0.907
od-0.876
abi-0.856
Token It
Token ID632
Feature activation-3.29e-02

Pos logit contributions
 blankets+0.012
 ghosts+0.011
 doors+0.011
 hinges+0.011
 belts+0.011
Neg logit contributions
oni-0.019
melon-0.018
od-0.016
mod-0.015
abi-0.015
Token is
Token ID318
Feature activation+4.93e-02

Pos logit contributions
oni+0.082
melon+0.081
mod+0.070
od+0.070
abi+0.068
Neg logit contributions
 blankets-0.039
 doors-0.035
 ghosts-0.035
 belts-0.032
 beds-0.030
Token for
Token ID329
Feature activation+8.97e-02

Pos logit contributions
 blankets+0.077
 ghosts+0.074
 doors+0.072
 hinges+0.069
 belts+0.069
Neg logit contributions
oni-0.104
melon-0.101
od-0.080
abi-0.080
mod-0.079
Token paper
Token ID3348
Feature activation-6.75e-02

Pos logit contributions
 belts+0.133
 blankets+0.133
 passage+0.131
 hinges+0.130
 doors+0.129
Neg logit contributions
oni-0.181
melon-0.170
 squash-0.137
od-0.136
abi-0.136
Token Lily
Token ID20037
Feature activation+3.82e-02

Pos logit contributions
 blankets+0.008
 ghosts+0.008
 doors+0.008
 hinges+0.007
 belts+0.007
Neg logit contributions
oni-0.010
melon-0.010
od-0.008
mod-0.008
 pumpkin-0.008
Token wanted
Token ID2227
Feature activation+1.22e-02

Pos logit contributions
 blankets+0.066
 ghosts+0.063
 doors+0.062
 hinges+0.060
 belts+0.060
Neg logit contributions
oni-0.074
melon-0.071
od-0.058
mod-0.056
 pumpkin-0.055
Token to
Token ID284
Feature activation+2.57e-03

Pos logit contributions
 blankets+0.018
 ghosts+0.017
 doors+0.017
 hinges+0.016
 belts+0.016
Neg logit contributions
oni-0.027
melon-0.026
od-0.022
mod-0.021
abi-0.021
Token add
Token ID751
Feature activation-1.96e-01

Pos logit contributions
 blankets+0.004
 ghosts+0.004
 doors+0.004
 belts+0.004
 hinges+0.004
Neg logit contributions
oni-0.005
melon-0.005
od-0.004
mod-0.004
 pumpkin-0.004
Token more
Token ID517
Feature activation+3.50e-02

Pos logit contributions
oni+0.423
melon+0.413
od+0.342
mod+0.336
abi+0.328
Neg logit contributions
 blankets-0.296
 ghosts-0.281
 doors-0.274
 hinges-0.260
 belts-0.259
Token blankets
Token ID34794
Feature activation+4.30e-01

Pos logit contributions
 ghosts+0.052
 blankets+0.052
 doors+0.049
 hinges+0.049
 footsteps+0.048
Neg logit contributions
oni-0.072
melon-0.071
od-0.059
 pumpkin-0.058
 squash-0.056
Token to
Token ID284
Feature activation+2.64e-03

Pos logit contributions
 blankets+0.517
 doors+0.451
 ghosts+0.437
 belts+0.431
 hinges+0.409
Neg logit contributions
oni-1.039
melon-1.001
mod-0.934
iche-0.900
od-0.895
Token her
Token ID607
Feature activation+2.23e-02

Pos logit contributions
 blankets+0.004
 ghosts+0.004
 doors+0.004
 belts+0.004
 hinges+0.004
Neg logit contributions
oni-0.005
melon-0.005
od-0.004
mod-0.004
 pumpkin-0.004
Token bed
Token ID3996
Feature activation+3.27e-01

Pos logit contributions
 ghosts+0.030
 ages+0.029
 hinges+0.029
 blankets+0.029
 footsteps+0.028
Neg logit contributions
oni-0.047
melon-0.047
 pumpkin-0.040
 squash-0.038
 tomato-0.037
Token because
Token ID780
Feature activation+4.49e-02

Pos logit contributions
 blankets+0.474
 ghosts+0.443
 doors+0.436
 belts+0.415
 hinges+0.412
Neg logit contributions
oni-0.727
melon-0.714
od-0.597
mod-0.595
abi-0.575
Token she
Token ID673
Feature activation+1.37e-02

Pos logit contributions
 hinges+0.081
 terror+0.080
 brakes+0.080
 footsteps+0.080
 blankets+0.080
Neg logit contributions
melon-0.073
oni-0.073
 pumpkin-0.059
 tomato-0.056
 squash-0.056
Token exit
Token ID8420
Feature activation+2.31e-01

Pos logit contributions
oni+0.052
melon+0.051
od+0.041
mod+0.041
abi+0.040
Neg logit contributions
 blankets-0.039
 ghosts-0.037
 doors-0.037
 hinges-0.035
 belts-0.034
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 blankets+0.374
 ghosts+0.363
 ages+0.343
 hinges+0.339
 terror+0.337
Neg logit contributions
melon-0.440
oni-0.430
od-0.351
 pumpkin-0.351
 squash-0.332
Token He
Token ID679
Feature activation+4.96e-02

Pos logit contributions
 blankets+0.020
 ghosts+0.019
 doors+0.018
 belts+0.018
 hinges+0.017
Neg logit contributions
oni-0.027
melon-0.026
od-0.021
abi-0.020
mod-0.020
Token moved
Token ID3888
Feature activation+7.63e-03

Pos logit contributions
 ghosts+0.081
 blankets+0.081
 footsteps+0.075
 hinges+0.074
 doors+0.073
Neg logit contributions
oni-0.097
melon-0.095
od-0.079
 pumpkin-0.076
mod-0.074
Token some
Token ID617
Feature activation-1.01e-02

Pos logit contributions
 ghosts+0.014
 blankets+0.013
 hinges+0.013
 beds+0.013
jar+0.013
Neg logit contributions
oni-0.013
melon-0.013
od-0.010
 squash-0.010
mod-0.010
Token blankets
Token ID34794
Feature activation+4.29e-01

Pos logit contributions
melon+0.018
oni+0.018
 squash+0.016
 pumpkin+0.016
 tomato+0.015
Neg logit contributions
 Amen-0.016
 ghosts-0.015
jar-0.014
 Chloe-0.014
 Kenny-0.014
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
 blankets+0.521
 doors+0.451
 ghosts+0.417
 belts+0.409
 hinges+0.399
Neg logit contributions
oni-1.063
melon-1.015
mod-0.941
iche-0.922
abi-0.918
Token chairs
Token ID18791
Feature activation+2.95e-01

Pos logit contributions
 ghosts+0.022
 years+0.021
 Deal+0.021
jar+0.021
 Amen+0.020
Neg logit contributions
oni-0.026
melon-0.025
 pumpkin-0.021
 tomato-0.021
 squash-0.021
Token to
Token ID284
Feature activation+4.03e-03

Pos logit contributions
 blankets+0.404
 doors+0.367
 ghosts+0.356
 belts+0.342
 hinges+0.341
Neg logit contributions
oni-0.687
melon-0.663
od-0.573
mod-0.571
abi-0.563
Token create
Token ID2251
Feature activation-2.13e-01

Pos logit contributions
 blankets+0.006
 ghosts+0.006
 doors+0.006
 belts+0.006
 years+0.006
Neg logit contributions
oni-0.008
melon-0.008
 pumpkin-0.006
od-0.006
 squash-0.006
Token a
Token ID257
Feature activation+4.72e-02

Pos logit contributions
oni+0.416
melon+0.383
od+0.344
 pumpkin+0.309
 squash+0.308
Neg logit contributions
 hinges-0.310
 ghosts-0.308
 blankets-0.306
jar-0.302
 belts-0.298
Token had
Token ID550
Feature activation+4.88e-02

Pos logit contributions
 blankets+0.000
 ghosts+0.000
 doors+0.000
 hinges+0.000
 belts+0.000
Neg logit contributions
oni-0.001
melon-0.001
od-0.000
mod-0.000
 pumpkin-0.000
Token to
Token ID284
Feature activation+8.04e-03

Pos logit contributions
 blankets+0.079
 ghosts+0.076
 doors+0.074
 hinges+0.072
 belts+0.071
Neg logit contributions
oni-0.098
melon-0.096
od-0.078
mod-0.076
abi-0.075
Token hide
Token ID7808
Feature activation+1.38e-01

Pos logit contributions
 blankets+0.011
 ghosts+0.011
 doors+0.010
 belts+0.010
 hinges+0.010
Neg logit contributions
oni-0.018
melon-0.018
od-0.015
mod-0.015
abi-0.014
Token under
Token ID739
Feature activation+1.75e-01

Pos logit contributions
 blankets+0.218
 ghosts+0.206
 hinges+0.205
 doors+0.203
 footsteps+0.198
Neg logit contributions
oni-0.265
melon-0.260
od-0.215
 squash-0.210
 pumpkin-0.206
Token the
Token ID262
Feature activation-9.44e-03

Pos logit contributions
 hinges+0.304
 blankets+0.301
 belts+0.300
 doors+0.294
 ghosts+0.291
Neg logit contributions
melon-0.302
oni-0.299
 squash-0.237
od-0.237
 pumpkin-0.236
Token blankets
Token ID34794
Feature activation+4.29e-01

Pos logit contributions
melon+0.019
oni+0.019
 pumpkin+0.016
 squash+0.016
 tomato+0.015
Neg logit contributions
 ghosts-0.014
 belts-0.014
 blankets-0.014
 ages-0.014
 hinges-0.013
Token.
Token ID13
Feature activation+8.06e-03

Pos logit contributions
 blankets+0.571
 ghosts+0.516
 doors+0.515
 belts+0.477
 ages+0.475
Neg logit contributions
oni-1.008
melon-0.966
mod-0.867
od-0.839
abi-0.829
Token There
Token ID1318
Feature activation+1.39e-01

Pos logit contributions
 blankets+0.014
 ghosts+0.013
 doors+0.013
 belts+0.012
 hinges+0.012
Neg logit contributions
oni-0.016
melon-0.016
od-0.013
abi-0.012
mod-0.012
Token was
Token ID373
Feature activation+4.10e-02

Pos logit contributions
 blankets+0.179
 ghosts+0.168
 doors+0.165
 Chloe+0.148
 ages+0.148
Neg logit contributions
oni-0.343
melon-0.328
abi-0.285
mod-0.282
iche-0.278
Token a
Token ID257
Feature activation+4.75e-02

Pos logit contributions
 blankets+0.065
 ghosts+0.061
 doors+0.061
 hinges+0.059
 belts+0.059
Neg logit contributions
oni-0.083
melon-0.083
od-0.066
abi-0.064
mod-0.063
Token lot
Token ID1256
Feature activation+2.05e-01

Pos logit contributions
 blankets+0.076
 ghosts+0.074
 belts+0.072
 doors+0.071
 hinges+0.070
Neg logit contributions
melon-0.093
oni-0.092
 pumpkin-0.076
 squash-0.074
 tomato-0.071
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
oni+0.353
melon+0.344
od+0.287
mod+0.280
abi+0.274
Neg logit contributions
 blankets-0.241
 ghosts-0.230
 doors-0.223
 hinges-0.213
 belts-0.212
Token shelter
Token ID11772
Feature activation+2.95e-01

Pos logit contributions
melon+0.025
oni+0.025
 pumpkin+0.022
 squash+0.021
 redd+0.020
Neg logit contributions
 blankets-0.016
 Kenny-0.015
 belts-0.015
 hinges-0.015
 ages-0.015
Token -
Token ID532
Feature activation+4.35e-02

Pos logit contributions
 blankets+0.412
 doors+0.378
 ghosts+0.371
 belts+0.347
 hinges+0.346
Neg logit contributions
oni-0.687
melon-0.658
mod-0.576
od-0.564
abi-0.557
Token sticks
Token ID16461
Feature activation-3.69e-02

Pos logit contributions
 blankets+0.068
 ghosts+0.066
 hinges+0.065
 doors+0.064
 belts+0.062
Neg logit contributions
melon-0.083
oni-0.083
 pumpkin-0.068
 squash-0.067
od-0.066
Token,
Token ID11
Feature activation+2.32e-03

Pos logit contributions
oni+0.087
melon+0.083
mod+0.074
od+0.073
abi+0.072
Neg logit contributions
 blankets-0.050
 doors-0.045
 ghosts-0.044
 ages-0.041
 belts-0.041
Token blankets
Token ID34794
Feature activation+4.28e-01

Pos logit contributions
 blankets+0.003
 ghosts+0.003
 doors+0.003
 hinges+0.003
 footsteps+0.003
Neg logit contributions
oni-0.005
melon-0.005
od-0.004
 pumpkin-0.004
 squash-0.004
Token,
Token ID11
Feature activation+5.81e-04

Pos logit contributions
 blankets+0.576
 doors+0.511
 ghosts+0.508
 belts+0.479
 ages+0.475
Neg logit contributions
oni-1.003
melon-0.965
mod-0.867
od-0.841
abi-0.828
Token leaves
Token ID5667
Feature activation+6.45e-02

Pos logit contributions
 ghosts+0.001
 blankets+0.001
 doors+0.001
 footsteps+0.001
 Kenny+0.001
Neg logit contributions
oni-0.001
melon-0.001
od-0.001
 pumpkin-0.001
mod-0.001
Token and
Token ID290
Feature activation+8.87e-03

Pos logit contributions
 blankets+0.086
 doors+0.075
 ghosts+0.073
 belts+0.070
 hinges+0.070
Neg logit contributions
oni-0.154
melon-0.147
mod-0.133
od-0.130
abi-0.128
Token more
Token ID517
Feature activation+3.06e-02

Pos logit contributions
 Kenny+0.013
jar+0.013
 ghosts+0.013
 Chloe+0.013
 Hur+0.012
Neg logit contributions
melon-0.016
oni-0.016
 pumpkin-0.014
 squash-0.013
 tomato-0.013
Token!
Token ID0
Feature activation+7.05e-02

Pos logit contributions
 blankets+0.047
 ghosts+0.047
 doors+0.046
 footsteps+0.045
 belts+0.045
Neg logit contributions
oni-0.059
melon-0.057
od-0.048
 squash-0.046
 pumpkin-0.045
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 blankets+0.075
 ghosts+0.070
 doors+0.069
 ages+0.064
 footsteps+0.063
Neg logit contributions
oni-0.112
melon-0.111
mod-0.095
od-0.092
iche-0.090
Token dog
Token ID3290
Feature activation+6.06e-02

Pos logit contributions
oni+0.039
melon+0.038
 squash+0.030
 pumpkin+0.030
od+0.030
Neg logit contributions
 blankets-0.030
 ghosts-0.028
 doors-0.027
 hinges-0.027
 belts-0.027
Token.
Token ID13
Feature activation+9.62e-04

Pos logit contributions
 blankets+0.087
 ghosts+0.080
 doors+0.078
 hinges+0.073
 ages+0.072
Neg logit contributions
oni-0.137
melon-0.132
mod-0.115
od-0.112
abi-0.110
Token He
Token ID679
Feature activation+4.30e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.001
 hinges+0.001
 belts+0.001
Neg logit contributions
oni-0.002
melon-0.002
od-0.002
abi-0.002
mod-0.001
Token brought
Token ID3181
Feature activation-5.05e-02

Pos logit contributions
 blankets+0.066
 ghosts+0.063
 doors+0.061
 hinges+0.059
 belts+0.058
Neg logit contributions
oni-0.093
melon-0.090
od-0.074
mod-0.072
abi-0.071
Token blankets
Token ID34794
Feature activation+4.28e-01

Pos logit contributions
oni+0.108
melon+0.106
od+0.087
mod+0.085
abi+0.084
Neg logit contributions
 blankets-0.077
 ghosts-0.074
 doors-0.072
 hinges-0.069
 belts-0.068
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 blankets+0.556
 doors+0.476
 ghosts+0.462
 belts+0.442
 hinges+0.438
Neg logit contributions
oni-1.026
melon-0.978
mod-0.903
od-0.883
abi-0.863
Token food
Token ID2057
Feature activation+2.33e-02

Pos logit contributions
 ghosts+0.017
jar+0.017
 Kenny+0.017
 Taxi+0.017
 brakes+0.017
Neg logit contributions
oni-0.020
melon-0.020
 tomato-0.017
 pumpkin-0.017
 squash-0.017
Token so
Token ID523
Feature activation+4.41e-02

Pos logit contributions
 blankets+0.034
 ghosts+0.033
 doors+0.031
 hinges+0.030
 belts+0.030
Neg logit contributions
oni-0.052
melon-0.051
od-0.041
mod-0.040
 pumpkin-0.040
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
 blankets+0.070
 ghosts+0.067
 doors+0.065
 hinges+0.063
 belts+0.062
Neg logit contributions
oni-0.092
melon-0.090
od-0.073
mod-0.071
abi-0.070
Token dog
Token ID3290
Feature activation+5.95e-02

Pos logit contributions
oni+0.045
melon+0.044
 pumpkin+0.036
 squash+0.035
abi+0.034
Neg logit contributions
 blankets-0.035
 hinges-0.034
 ghosts-0.033
 doors-0.033
 belts-0.032
Token something
Token ID1223
Feature activation+2.32e-02

Pos logit contributions
oni+0.170
melon+0.165
od+0.139
mod+0.133
abi+0.133
Neg logit contributions
 blankets-0.117
 ghosts-0.110
 doors-0.107
 hinges-0.105
 belts-0.103
Token as
Token ID355
Feature activation+1.32e-01

Pos logit contributions
 blankets+0.035
 ghosts+0.034
 doors+0.033
 ages+0.030
 belts+0.030
Neg logit contributions
oni-0.051
melon-0.049
mod-0.041
od-0.041
abi-0.039
Token fun
Token ID1257
Feature activation+8.99e-02

Pos logit contributions
 blankets+0.214
 ghosts+0.201
 doors+0.201
 hinges+0.196
 belts+0.192
Neg logit contributions
oni-0.268
melon-0.258
od-0.215
mod-0.208
abi-0.206
Token as
Token ID355
Feature activation+1.32e-01

Pos logit contributions
 blankets+0.142
 doors+0.131
 ghosts+0.131
 hinges+0.129
 belts+0.128
Neg logit contributions
oni-0.181
melon-0.181
od-0.153
 squash-0.141
 pumpkin-0.140
Token pulling
Token ID10427
Feature activation+5.04e-02

Pos logit contributions
 blankets+0.204
 doors+0.191
 hinges+0.188
 ghosts+0.188
 belts+0.186
Neg logit contributions
oni-0.273
melon-0.261
od-0.225
 pumpkin-0.213
abi-0.213
Token blankets
Token ID34794
Feature activation+4.25e-01

Pos logit contributions
 blankets+0.074
 ghosts+0.070
 doors+0.068
 hinges+0.066
 belts+0.065
Neg logit contributions
oni-0.110
melon-0.107
od-0.090
mod-0.087
abi-0.086
Token on
Token ID319
Feature activation+4.86e-02

Pos logit contributions
 blankets+0.554
 doors+0.504
 ghosts+0.504
 belts+0.463
 hinges+0.459
Neg logit contributions
oni-0.999
melon-0.968
mod-0.861
od-0.836
abi-0.829
Token her
Token ID607
Feature activation+2.45e-02

Pos logit contributions
 blankets+0.075
 ghosts+0.072
 doors+0.070
 hinges+0.067
 belts+0.067
Neg logit contributions
oni-0.102
melon-0.100
od-0.082
mod-0.080
abi-0.079
Token modern
Token ID3660
Feature activation-4.36e-02

Pos logit contributions
 ghosts+0.029
 ages+0.028
 Kenny+0.028
 blankets+0.027
 security+0.026
Neg logit contributions
oni-0.054
melon-0.052
 pumpkin-0.046
 squash-0.045
 tomato-0.045
Token bed
Token ID3996
Feature activation+3.22e-01

Pos logit contributions
melon+0.087
oni+0.086
 pumpkin+0.079
 tomato+0.079
 squash+0.078
Neg logit contributions
 Kenny-0.055
 ghosts-0.052
 security-0.050
Michael-0.049
 ages-0.048
Token.
Token ID13
Feature activation+4.15e-03

Pos logit contributions
 blankets+0.497
 ghosts+0.477
 doors+0.458
 hinges+0.441
 belts+0.438
Neg logit contributions
oni-0.681
melon-0.663
od-0.548
mod-0.524
abi-0.517
Token,
Token ID11
Feature activation+1.36e-03

Pos logit contributions
 blankets+0.483
 ghosts+0.458
 doors+0.446
 hinges+0.425
 belts+0.422
Neg logit contributions
oni-0.706
melon-0.689
od-0.572
mod-0.560
abi-0.548
Token making
Token ID1642
Feature activation-6.18e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 hinges+0.002
 belts+0.002
Neg logit contributions
oni-0.003
melon-0.003
od-0.002
mod-0.002
 pumpkin-0.002
Token a
Token ID257
Feature activation+4.42e-02

Pos logit contributions
oni+0.144
melon+0.141
od+0.119
mod+0.116
abi+0.114
Neg logit contributions
 blankets-0.083
 ghosts-0.078
 doors-0.076
 hinges-0.072
 belts-0.072
Token fort
Token ID6285
Feature activation+1.77e-01

Pos logit contributions
 blankets+0.071
 ghosts+0.070
 ages+0.067
 hinges+0.066
 belts+0.066
Neg logit contributions
oni-0.088
melon-0.086
 pumpkin-0.069
od-0.068
 squash-0.067
Token with
Token ID351
Feature activation-3.48e-03

Pos logit contributions
 blankets+0.263
 ghosts+0.249
 doors+0.243
 hinges+0.231
 belts+0.230
Neg logit contributions
oni-0.387
melon-0.379
od-0.314
mod-0.308
abi-0.301
Token blankets
Token ID34794
Feature activation+4.24e-01

Pos logit contributions
oni+0.007
melon+0.007
od+0.005
 pumpkin+0.005
 squash+0.005
Neg logit contributions
 ghosts-0.005
 blankets-0.005
 hinges-0.005
 doors-0.005
 belts-0.005
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
 blankets+0.522
 doors+0.455
 ghosts+0.443
 belts+0.430
 beds+0.403
Neg logit contributions
oni-1.040
melon-1.009
mod-0.943
od-0.904
abi-0.894
Token pill
Token ID9582
Feature activation+2.42e-01

Pos logit contributions
 Kenny+0.019
Michael+0.017
 Hur+0.017
jar+0.016
 Taxi+0.016
Neg logit contributions
oni-0.029
melon-0.027
 tomato-0.026
 pumpkin-0.025
 squash-0.024
Tokenows
Token ID1666
Feature activation+2.00e-02

Pos logit contributions
 blankets+0.377
 ghosts+0.373
 doors+0.356
 ages+0.344
 hinges+0.341
Neg logit contributions
oni-0.510
melon-0.485
od-0.398
mod-0.369
abi-0.368
Token.
Token ID13
Feature activation+6.98e-03

Pos logit contributions
 blankets+0.027
 doors+0.024
 ghosts+0.024
 belts+0.023
 hinges+0.021
Neg logit contributions
oni-0.047
melon-0.046
mod-0.040
od-0.040
abi-0.039
Token
Token ID198
Feature activation+1.41e-03

Pos logit contributions
 blankets+0.010
 ghosts+0.010
 doors+0.009
 hinges+0.009
 belts+0.009
Neg logit contributions
oni-0.016
melon-0.015
od-0.013
mod-0.012
abi-0.012
Tokenugg
Token ID6837
Feature activation+9.38e-03

Pos logit contributions
 blankets+0.340
 ghosts+0.335
 doors+0.319
 hinges+0.309
 footsteps+0.303
Neg logit contributions
oni-0.317
melon-0.314
abi-0.240
od-0.236
mod-0.232
Tokenled
Token ID992
Feature activation-4.40e-02

Pos logit contributions
 blankets+0.018
 ghosts+0.018
 hinges+0.018
 doors+0.017
 ages+0.017
Neg logit contributions
oni-0.016
melon-0.015
od-0.012
abi-0.011
mod-0.011
Token deep
Token ID2769
Feature activation+1.22e-01

Pos logit contributions
melon+0.077
oni+0.076
od+0.059
abi+0.057
 pumpkin+0.057
Neg logit contributions
 ghosts-0.076
 blankets-0.075
 hinges-0.073
 doors-0.072
 belts-0.071
Token in
Token ID287
Feature activation+2.65e-02

Pos logit contributions
 Kenny+0.199
 ghosts+0.198
 footsteps+0.197
 blankets+0.192
 ages+0.190
Neg logit contributions
melon-0.206
oni-0.203
 pumpkin-0.157
abi-0.155
od-0.154
Token her
Token ID607
Feature activation+2.07e-02

Pos logit contributions
 belts+0.049
 brakes+0.046
Quick+0.046
 hinges+0.045
 buses+0.044
Neg logit contributions
melon-0.036
oni-0.034
 tomato-0.030
 grape-0.030
 pumpkin-0.029
Token blankets
Token ID34794
Feature activation+4.24e-01

Pos logit contributions
 Kenny+0.025
Run+0.025
Quick+0.024
 ages+0.024
 ghosts+0.024
Neg logit contributions
melon-0.040
oni-0.038
 pumpkin-0.037
 tomato-0.035
 bean-0.034
Token.
Token ID13
Feature activation+5.21e-03

Pos logit contributions
 blankets+0.617
 ghosts+0.584
 doors+0.568
 hinges+0.540
 belts+0.537
Neg logit contributions
oni-0.939
melon-0.917
od-0.765
mod-0.751
abi-0.734
Token
Token ID220
Feature activation+1.50e-02

Pos logit contributions
 blankets+0.009
 ghosts+0.009
 doors+0.009
 belts+0.009
 hinges+0.008
Neg logit contributions
oni-0.010
melon-0.009
od-0.008
abi-0.007
mod-0.007
Token
Token ID198
Feature activation-5.63e-03

Pos logit contributions
 blankets+0.022
 ghosts+0.021
 doors+0.020
 hinges+0.019
 belts+0.019
Neg logit contributions
oni-0.033
melon-0.032
od-0.027
mod-0.026
abi-0.026
Token
Token ID198
Feature activation-8.03e-03

Pos logit contributions
oni+0.013
melon+0.013
od+0.010
mod+0.010
 pumpkin+0.010
Neg logit contributions
 blankets-0.008
 ghosts-0.007
 doors-0.007
 ages-0.007
 hinges-0.007
TokenBut
Token ID1537
Feature activation+4.17e-02

Pos logit contributions
oni+0.017
melon+0.016
od+0.014
mod+0.013
abi+0.013
Neg logit contributions
 blankets-0.013
 ghosts-0.012
 doors-0.012
 hinges-0.011
 belts-0.011
Token more
Token ID517
Feature activation+3.10e-02

Pos logit contributions
melon+0.220
oni+0.220
mod+0.184
od+0.181
abi+0.177
Neg logit contributions
 blankets-0.129
 doors-0.120
 ghosts-0.120
 ages-0.109
 belts-0.109
Token.
Token ID13
Feature activation+9.90e-03

Pos logit contributions
 blankets+0.043
 ghosts+0.041
 doors+0.040
 hinges+0.037
 belts+0.037
Neg logit contributions
oni-0.071
melon-0.069
od-0.058
mod-0.057
abi-0.056
Token He
Token ID679
Feature activation+4.33e-02

Pos logit contributions
 blankets+0.015
 ghosts+0.014
 doors+0.014
 belts+0.013
 hinges+0.013
Neg logit contributions
oni-0.022
melon-0.021
od-0.018
mod-0.017
abi-0.017
Token sold
Token ID2702
Feature activation-1.65e-01

Pos logit contributions
 blankets+0.068
 doors+0.064
 ghosts+0.063
 belts+0.061
 ages+0.060
Neg logit contributions
oni-0.090
melon-0.088
od-0.074
mod-0.072
iche-0.070
Token old
Token ID1468
Feature activation-2.38e-02

Pos logit contributions
oni+0.365
melon+0.357
od+0.296
mod+0.290
abi+0.284
Neg logit contributions
 blankets-0.242
 ghosts-0.230
 doors-0.222
 hinges-0.213
 belts-0.211
Token blankets
Token ID34794
Feature activation+4.24e-01

Pos logit contributions
oni+0.057
melon+0.056
od+0.047
 pumpkin+0.047
mod+0.046
Neg logit contributions
 blankets-0.029
 ghosts-0.028
 doors-0.026
 hinges-0.025
 ages-0.024
Token and
Token ID290
Feature activation+1.07e-02

Pos logit contributions
 blankets+0.523
 doors+0.473
 belts+0.439
 ghosts+0.437
 hinges+0.418
Neg logit contributions
oni-1.016
melon-0.974
mod-0.911
od-0.886
iche-0.884
Token a
Token ID257
Feature activation+4.13e-02

Pos logit contributions
 ghosts+0.016
 terror+0.015
 blankets+0.015
 passage+0.015
 footsteps+0.015
Neg logit contributions
oni-0.022
melon-0.021
 pumpkin-0.018
 squash-0.017
 tomato-0.017
Token few
Token ID1178
Feature activation+1.30e-01

Pos logit contributions
 ghosts+0.066
 blankets+0.065
 doors+0.061
 ages+0.060
 belts+0.060
Neg logit contributions
oni-0.082
melon-0.082
 pumpkin-0.066
od-0.064
 squash-0.064
Token tools
Token ID4899
Feature activation+1.27e-01

Pos logit contributions
 ghosts+0.173
 Kenny+0.167
jar+0.163
 blankets+0.162
 hinges+0.160
Neg logit contributions
oni-0.272
melon-0.271
 pumpkin-0.227
 squash-0.225
od-0.217
Token he
Token ID339
Feature activation+5.14e-03

Pos logit contributions
 blankets+0.180
 doors+0.167
 ghosts+0.167
 belts+0.158
 hinges+0.155
Neg logit contributions
oni-0.284
melon-0.277
od-0.235
mod-0.233
abi-0.228
Token started
Token ID2067
Feature activation+8.22e-02

Pos logit contributions
oni+0.096
melon+0.093
od+0.077
mod+0.077
abi+0.074
Neg logit contributions
 blankets-0.070
 ghosts-0.066
 doors-0.065
 hinges-0.061
 belts-0.061
Token to
Token ID284
Feature activation-6.46e-03

Pos logit contributions
 blankets+0.119
 ghosts+0.109
 doors+0.108
 belts+0.101
 hinges+0.100
Neg logit contributions
oni-0.186
melon-0.180
od-0.153
mod-0.152
abi-0.147
Token lay
Token ID3830
Feature activation+6.64e-02

Pos logit contributions
oni+0.014
melon+0.013
od+0.011
mod+0.011
abi+0.011
Neg logit contributions
 blankets-0.010
 ghosts-0.010
 doors-0.009
 hinges-0.009
 ages-0.009
Token out
Token ID503
Feature activation-7.93e-03

Pos logit contributions
 blankets+0.094
 ghosts+0.090
 doors+0.086
 hinges+0.084
 ages+0.082
Neg logit contributions
oni-0.149
melon-0.147
od-0.120
mod-0.118
abi-0.117
Token their
Token ID511
Feature activation-5.15e-02

Pos logit contributions
melon+0.017
oni+0.017
od+0.013
 pumpkin+0.013
 squash+0.013
Neg logit contributions
 ghosts-0.012
 blankets-0.011
 hinges-0.011
 belts-0.011
 doors-0.011
Token blankets
Token ID34794
Feature activation+4.24e-01

Pos logit contributions
melon+0.118
oni+0.110
 pumpkin+0.097
 squash+0.094
 tomato+0.090
Neg logit contributions
 ghosts-0.072
 ages-0.065
 Kenny-0.064
 footsteps-0.061
 hinges-0.061
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
 blankets+0.570
 doors+0.501
 ghosts+0.479
 belts+0.467
 hinges+0.461
Neg logit contributions
oni-1.007
melon-0.970
mod-0.872
od-0.857
abi-0.834
Token Daisy
Token ID40355
Feature activation-7.38e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 belts+0.002
 hinges+0.002
Neg logit contributions
oni-0.003
melon-0.003
od-0.003
abi-0.003
iche-0.003
Token was
Token ID373
Feature activation+3.08e-02

Pos logit contributions
oni+0.157
melon+0.153
od+0.127
mod+0.122
abi+0.121
Neg logit contributions
 blankets-0.112
 ghosts-0.108
 doors-0.104
 hinges-0.101
 belts-0.099
Token very
Token ID845
Feature activation+5.73e-02

Pos logit contributions
 blankets+0.043
 ghosts+0.041
 doors+0.040
 hinges+0.037
 ages+0.037
Neg logit contributions
oni-0.071
melon-0.069
od-0.058
mod-0.057
 pumpkin-0.055
Token persistent
Token ID16218
Feature activation-3.60e-02

Pos logit contributions
 blankets+0.093
 ghosts+0.087
 doors+0.086
 belts+0.081
 hinges+0.081
Neg logit contributions
oni-0.119
melon-0.115
od-0.096
mod-0.095
abi-0.090
Tokenows
Token ID1666
Feature activation+1.13e-02

Pos logit contributions
 blankets+0.382
 ghosts+0.379
 doors+0.354
 ages+0.349
 footsteps+0.348
Neg logit contributions
oni-0.512
melon-0.491
od-0.403
 pumpkin-0.376
mod-0.373
Token as
Token ID355
Feature activation+1.27e-01

Pos logit contributions
 blankets+0.016
 ghosts+0.015
 doors+0.015
 belts+0.014
 hinges+0.014
Neg logit contributions
oni-0.025
melon-0.025
od-0.021
mod-0.021
abi-0.020
Token rockets
Token ID20293
Feature activation+1.60e-01

Pos logit contributions
 Kenny+0.211
 terror+0.203
 hinges+0.202
 blankets+0.200
 corridor+0.199
Neg logit contributions
oni-0.237
melon-0.226
 pumpkin-0.185
 squash-0.180
od-0.177
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
 blankets+0.234
 ghosts+0.222
 doors+0.216
 belts+0.205
 hinges+0.204
Neg logit contributions
oni-0.354
melon-0.346
od-0.288
mod-0.285
abi-0.278
Token our
Token ID674
Feature activation+6.06e-02

Pos logit contributions
 Kenny+0.016
Michael+0.014
jar+0.014
 ages+0.014
 Chloe+0.014
Neg logit contributions
oni-0.020
melon-0.020
 pumpkin-0.018
 tomato-0.017
 squash-0.017
Token blankets
Token ID34794
Feature activation+4.23e-01

Pos logit contributions
 Kenny+0.079
jar+0.074
 ages+0.073
 ghosts+0.072
 Chloe+0.071
Neg logit contributions
oni-0.129
melon-0.129
 pumpkin-0.112
 tomato-0.104
 squash-0.104
Token as
Token ID355
Feature activation+1.26e-01

Pos logit contributions
 blankets+0.637
 ghosts+0.614
 doors+0.592
 hinges+0.567
 ages+0.560
Neg logit contributions
oni-0.909
melon-0.889
od-0.728
mod-0.707
 pumpkin-0.696
Token space
Token ID2272
Feature activation+2.08e-01

Pos logit contributions
 Kenny+0.206
 hinges+0.200
 blankets+0.199
 terror+0.199
 footsteps+0.196
Neg logit contributions
oni-0.238
melon-0.228
 pumpkin-0.183
 squash-0.177
od-0.176
Token suits
Token ID14803
Feature activation+1.99e-01

Pos logit contributions
 blankets+0.333
 ghosts+0.326
 Kenny+0.324
 hinges+0.316
 footsteps+0.316
Neg logit contributions
oni-0.408
melon-0.391
od-0.315
 pumpkin-0.298
abi-0.285
Token."
Token ID526
Feature activation-3.87e-04

Pos logit contributions
 blankets+0.295
 ghosts+0.280
 doors+0.273
 hinges+0.260
 belts+0.258
Neg logit contributions
oni-0.435
melon-0.425
od-0.353
mod-0.346
abi-0.339
Token Anna
Token ID11735
Feature activation+7.04e-02

Pos logit contributions
oni+0.001
melon+0.001
od+0.001
mod+0.001
 pumpkin+0.001
Neg logit contributions
 blankets-0.001
 ghosts-0.001
 doors-0.001
 hinges-0.001
 belts-0.000
Token it
Token ID340
Feature activation-1.12e-02

Pos logit contributions
 blankets+0.043
 ghosts+0.041
 doors+0.040
 hinges+0.038
 belts+0.038
Neg logit contributions
oni-0.057
melon-0.056
od-0.046
mod-0.045
abi-0.044
Token?"
Token ID1701
Feature activation-7.02e-02

Pos logit contributions
melon+0.025
oni+0.025
mod+0.023
od+0.022
abi+0.022
Neg logit contributions
 blankets-0.013
 ghosts-0.013
 doors-0.012
 bed-0.011
 eyes-0.011
Token
Token ID220
Feature activation+1.25e-03

Pos logit contributions
oni+0.147
melon+0.143
od+0.118
mod+0.116
abi+0.113
Neg logit contributions
 blankets-0.112
 ghosts-0.107
 doors-0.103
 hinges-0.099
 belts-0.099
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
 blankets+0.001
 Chloe+0.001
 Kenny+0.001
 doors+0.001
 ghosts+0.001
Neg logit contributions
melon-0.003
oni-0.003
 pumpkin-0.003
mod-0.003
 squash-0.003
TokenT
Token ID51
Feature activation-9.06e-03

Pos logit contributions
melon+0.043
oni+0.042
mod+0.035
od+0.034
 pumpkin+0.034
Neg logit contributions
 blankets-0.023
 doors-0.021
 ghosts-0.021
 Chloe-0.020
 ages-0.020
Tokenoni
Token ID14651
Feature activation-6.04e-01

Pos logit contributions
melon+0.007
 redd+0.005
 pumpkin+0.004
mod+0.004
 poppy+0.004
Neg logit contributions
 eyes-0.022
 years-0.022
 blankets-0.021
 ages-0.021
 Now-0.021
Token looked
Token ID3114
Feature activation+2.05e-02

Pos logit contributions
oni+1.394
melon+1.386
mod+1.188
abi+1.161
iche+1.159
Neg logit contributions
 doors-0.761
 blankets-0.757
 ghosts-0.701
 eyes-0.699
 years-0.678
Token down
Token ID866
Feature activation+4.04e-02

Pos logit contributions
 blankets+0.035
 ghosts+0.033
 doors+0.032
 hinges+0.031
 belts+0.031
Neg logit contributions
oni-0.041
melon-0.040
od-0.032
mod-0.031
abi-0.031
Token and
Token ID290
Feature activation-1.57e-03

Pos logit contributions
 blankets+0.056
 ghosts+0.053
 doors+0.053
 belts+0.049
 hinges+0.049
Neg logit contributions
oni-0.092
melon-0.090
od-0.076
mod-0.075
abi-0.073
Token saw
Token ID2497
Feature activation+2.85e-03

Pos logit contributions
oni+0.004
melon+0.003
od+0.003
mod+0.003
abi+0.003
Neg logit contributions
 blankets-0.002
 doors-0.002
 ghosts-0.002
 belts-0.002
 ages-0.002
Token a
Token ID257
Feature activation+3.16e-02

Pos logit contributions
 blankets+0.004
 ghosts+0.003
 doors+0.003
 eyes+0.003
 footsteps+0.003
Neg logit contributions
oni-0.007
melon-0.007
iche-0.006
mod-0.006
abi-0.006
Token everyone
Token ID2506
Feature activation-2.15e-02

Pos logit contributions
oni+0.003
melon+0.002
od+0.002
mod+0.002
abi+0.002
Neg logit contributions
 blankets-0.002
 ghosts-0.002
 doors-0.002
 hinges-0.002
 footsteps-0.002
Token was
Token ID373
Feature activation+2.58e-02

Pos logit contributions
oni+0.051
melon+0.049
mod+0.042
od+0.042
abi+0.041
Neg logit contributions
 blankets-0.027
 ghosts-0.026
 doors-0.025
 ages-0.024
 years-0.023
Token happy
Token ID3772
Feature activation-6.98e-02

Pos logit contributions
 blankets+0.031
 ghosts+0.030
 years+0.028
 ages+0.028
 doors+0.028
Neg logit contributions
oni-0.063
melon-0.060
mod-0.052
od-0.052
 squash-0.050
Token.
Token ID13
Feature activation-6.09e-03

Pos logit contributions
oni+0.136
melon+0.134
od+0.110
 pumpkin+0.104
abi+0.101
Neg logit contributions
 blankets-0.120
 doors-0.113
 hinges-0.112
 ghosts-0.112
 belts-0.109
Token T
Token ID309
Feature activation-4.05e-02

Pos logit contributions
oni+0.012
melon+0.012
od+0.010
mod+0.010
abi+0.009
Neg logit contributions
 blankets-0.010
 ghosts-0.010
 doors-0.009
 hinges-0.009
 belts-0.009
Tokenoni
Token ID14651
Feature activation-6.03e-01

Pos logit contributions
melon+0.032
 redd+0.023
mod+0.018
 pumpkin+0.016
 squash+0.015
Neg logit contributions
 years-0.100
 eyes-0.098
 belts-0.092
 books-0.092
 doors-0.091
Token was
Token ID373
Feature activation+2.59e-02

Pos logit contributions
oni+1.368
melon+1.340
od+1.127
mod+1.122
abi+1.082
Neg logit contributions
 blankets-0.837
 ghosts-0.781
 doors-0.768
 belts-0.724
 ages-0.719
Token happy
Token ID3772
Feature activation-6.94e-02

Pos logit contributions
 blankets+0.030
 ghosts+0.029
 ages+0.028
 years+0.028
 doors+0.027
Neg logit contributions
oni-0.065
melon-0.061
od-0.053
mod-0.052
 squash-0.051
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
oni+0.140
melon+0.138
od+0.113
 pumpkin+0.108
abi+0.107
Neg logit contributions
 blankets-0.114
 ghosts-0.108
 doors-0.108
 hinges-0.105
 belts-0.103
Token have
Token ID423
Feature activation-2.17e-02

Pos logit contributions
oni+0.031
melon+0.030
od+0.025
mod+0.025
abi+0.024
Neg logit contributions
 blankets-0.019
 ghosts-0.018
 doors-0.017
 hinges-0.017
 ages-0.016
Token his
Token ID465
Feature activation+1.46e-02

Pos logit contributions
melon+0.049
oni+0.048
mod+0.039
od+0.039
 pumpkin+0.039
Neg logit contributions
 blankets-0.031
 ghosts-0.030
 ages-0.029
 doors-0.028
 years-0.027
Token ground
Token ID2323
Feature activation-3.06e-02

Pos logit contributions
oni+0.080
melon+0.078
od+0.066
mod+0.065
abi+0.062
Neg logit contributions
 blankets-0.055
 ghosts-0.052
 doors-0.051
 hinges-0.048
 belts-0.047
Token!
Token ID0
Feature activation+6.32e-02

Pos logit contributions
oni+0.067
melon+0.065
od+0.054
mod+0.053
abi+0.052
Neg logit contributions
 blankets-0.045
 ghosts-0.043
 doors-0.042
 hinges-0.040
 belts-0.040
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
 blankets+0.103
 ghosts+0.098
 doors+0.094
 hinges+0.091
 belts+0.091
Neg logit contributions
oni-0.131
melon-0.126
od-0.106
mod-0.103
abi-0.100
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
oni+0.034
melon+0.031
od+0.027
mod+0.026
abi+0.025
Neg logit contributions
 blankets-0.030
 ghosts-0.029
 doors-0.028
 hinges-0.027
 belts-0.027
TokenT
Token ID51
Feature activation-7.92e-03

Pos logit contributions
oni+0.032
melon+0.029
od+0.027
mod+0.026
abi+0.024
Neg logit contributions
 blankets-0.033
 ghosts-0.031
 doors-0.029
 hinges-0.029
 belts-0.029
Tokenoni
Token ID14651
Feature activation-6.03e-01

Pos logit contributions
melon+0.008
 pumpkin+0.005
 squash+0.005
mod+0.005
 redd+0.005
Neg logit contributions
 years-0.018
 eyes-0.017
 blankets-0.017
 hinges-0.017
 doors-0.017
Token was
Token ID373
Feature activation+2.44e-02

Pos logit contributions
oni+1.349
melon+1.308
od+1.115
 pumpkin+1.067
mod+1.060
Neg logit contributions
 blankets-0.858
 ghosts-0.814
 doors-0.783
 belts-0.763
 hinges-0.762
Token so
Token ID523
Feature activation+3.55e-02

Pos logit contributions
 blankets+0.040
 ghosts+0.038
 doors+0.037
 belts+0.037
 hinges+0.036
Neg logit contributions
oni-0.049
melon-0.048
od-0.039
mod-0.038
abi-0.037
Token scared
Token ID12008
Feature activation+1.28e-02

Pos logit contributions
 blankets+0.063
 ghosts+0.060
 doors+0.059
 hinges+0.056
 belts+0.056
Neg logit contributions
oni-0.069
melon-0.066
od-0.054
mod-0.052
abi-0.051
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
 blankets+0.025
 hinges+0.024
 belts+0.024
 doors+0.023
 ghosts+0.023
Neg logit contributions
oni-0.021
melon-0.020
od-0.017
 pumpkin-0.016
 squash-0.015
Token She
Token ID1375
Feature activation-7.86e-04

Pos logit contributions
oni+0.005
melon+0.005
od+0.004
mod+0.004
abi+0.004
Neg logit contributions
 blankets-0.004
 ghosts-0.004
 belts-0.004
 ages-0.004
 doors-0.004
Token in
Token ID287
Feature activation+1.41e-02

Pos logit contributions
 blankets+0.039
 ghosts+0.037
 doors+0.035
 ages+0.034
 belts+0.033
Neg logit contributions
oni-0.069
melon-0.067
od-0.057
mod-0.056
abi-0.056
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
 blankets+0.023
 ghosts+0.022
 doors+0.022
 hinges+0.021
 belts+0.021
Neg logit contributions
oni-0.028
melon-0.028
od-0.023
mod-0.022
 pumpkin-0.022
Token park
Token ID3952
Feature activation-4.59e-03

Pos logit contributions
oni+0.077
melon+0.073
od+0.070
mod+0.069
abi+0.067
Neg logit contributions
 blankets-0.031
 doors-0.027
 beds-0.025
 bed-0.025
 tent-0.024
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
oni+0.011
melon+0.011
mod+0.009
abi+0.009
od+0.009
Neg logit contributions
 blankets-0.006
 ghosts-0.006
 doors-0.006
 ages-0.005
 hinges-0.005
Token T
Token ID309
Feature activation-4.17e-02

Pos logit contributions
oni+0.029
melon+0.028
od+0.023
mod+0.023
abi+0.022
Neg logit contributions
 blankets-0.024
 ghosts-0.023
 doors-0.023
 ages-0.022
 belts-0.022
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.033
 redd+0.027
mod+0.026
chard+0.017
 squash+0.016
Neg logit contributions
 eyes-0.104
 years-0.097
 books-0.096
 blankets-0.096
 coats-0.095
Token saw
Token ID2497
Feature activation+2.38e-03

Pos logit contributions
oni+1.472
melon+1.472
mod+1.296
abi+1.267
iche+1.230
Neg logit contributions
 blankets-0.701
 ghosts-0.668
 doors-0.645
 ages-0.597
 years-0.581
Token something
Token ID1223
Feature activation+8.72e-03

Pos logit contributions
 blankets+0.003
 ghosts+0.002
 doors+0.002
 footsteps+0.002
 eyes+0.002
Neg logit contributions
melon-0.006
oni-0.006
od-0.005
mod-0.005
iche-0.005
Token shiny
Token ID22441
Feature activation-5.33e-02

Pos logit contributions
 blankets+0.012
 ghosts+0.012
 doors+0.011
 footsteps+0.010
 belts+0.010
Neg logit contributions
oni-0.019
melon-0.019
od-0.016
mod-0.016
abi-0.016
Token near
Token ID1474
Feature activation-5.08e-02

Pos logit contributions
oni+0.122
melon+0.120
od+0.101
mod+0.099
abi+0.097
Neg logit contributions
 blankets-0.073
 ghosts-0.069
 doors-0.068
 belts-0.064
 hinges-0.064
Token a
Token ID257
Feature activation+3.20e-02

Pos logit contributions
oni+0.112
melon+0.110
od+0.093
mod+0.092
abi+0.088
Neg logit contributions
 blankets-0.075
 ghosts-0.070
 doors-0.069
 ages-0.064
 footsteps-0.064
Token
Token ID198
Feature activation-1.80e-02

Pos logit contributions
melon+0.016
oni+0.016
mod+0.013
od+0.013
 pumpkin+0.013
Neg logit contributions
 blankets-0.012
 ghosts-0.011
 doors-0.011
 ages-0.010
 eyes-0.010
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
oni+0.040
melon+0.040
od+0.033
mod+0.032
 pumpkin+0.032
Neg logit contributions
 blankets-0.026
 ghosts-0.024
 doors-0.024
 ages-0.023
 belts-0.023
TokenFinally
Token ID11158
Feature activation+9.43e-02

Pos logit contributions
oni+0.043
melon+0.042
od+0.036
mod+0.035
abi+0.034
Neg logit contributions
 blankets-0.025
 ghosts-0.023
 doors-0.022
 hinges-0.021
 belts-0.021
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 blankets+0.183
 hinges+0.178
 ghosts+0.177
 belts+0.172
 doors+0.171
Neg logit contributions
oni-0.162
melon-0.154
od-0.125
 Crab-0.117
 pumpkin-0.113
Token T
Token ID309
Feature activation-4.39e-02

Pos logit contributions
oni+0.035
melon+0.035
od+0.029
mod+0.029
abi+0.028
Neg logit contributions
 blankets-0.024
 ghosts-0.022
 doors-0.022
 ages-0.021
 belts-0.021
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.032
 redd+0.020
mod+0.019
 squash+0.015
iche+0.013
Neg logit contributions
 eyes-0.110
 years-0.104
 belts-0.104
 books-0.102
 hugs-0.100
Token was
Token ID373
Feature activation+2.45e-02

Pos logit contributions
oni+1.398
melon+1.393
mod+1.198
od+1.159
abi+1.155
Neg logit contributions
 blankets-0.784
 ghosts-0.723
 doors-0.723
 ages-0.684
 belts-0.667
Token ready
Token ID3492
Feature activation-3.66e-03

Pos logit contributions
 blankets+0.030
 minutes+0.029
 doors+0.028
 ghosts+0.028
 ages+0.028
Neg logit contributions
oni-0.059
melon-0.056
od-0.049
mod-0.047
 pumpkin-0.047
Token.
Token ID13
Feature activation-7.26e-03

Pos logit contributions
oni+0.008
melon+0.007
od+0.006
mod+0.006
abi+0.006
Neg logit contributions
 blankets-0.006
 ghosts-0.006
 doors-0.005
 hinges-0.005
 belts-0.005
Token She
Token ID1375
Feature activation-1.51e-03

Pos logit contributions
oni+0.017
melon+0.016
od+0.014
mod+0.013
abi+0.013
Neg logit contributions
 blankets-0.010
 ghosts-0.010
 doors-0.009
 hinges-0.009
 belts-0.009
Token lifted
Token ID13663
Feature activation-8.21e-03

Pos logit contributions
oni+0.003
melon+0.003
od+0.003
mod+0.003
abi+0.003
Neg logit contributions
 blankets-0.002
 ghosts-0.002
 doors-0.002
 hinges-0.002
 belts-0.002
Token happy
Token ID3772
Feature activation-6.66e-02

Pos logit contributions
 blankets+0.060
 ghosts+0.058
 doors+0.057
 belts+0.054
 ages+0.054
Neg logit contributions
oni-0.079
melon-0.077
od-0.061
abi-0.060
mod-0.059
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
oni+0.137
melon+0.133
od+0.112
 pumpkin+0.105
mod+0.104
Neg logit contributions
 blankets-0.108
 ghosts-0.102
 doors-0.100
 hinges-0.098
 belts-0.096
Token she
Token ID673
Feature activation-1.34e-04

Pos logit contributions
oni+0.023
melon+0.023
od+0.019
mod+0.018
abi+0.018
Neg logit contributions
 blankets-0.016
 ghosts-0.016
 doors-0.015
 hinges-0.015
 belts-0.014
Token gave
Token ID2921
Feature activation-5.86e-02

Pos logit contributions
oni+0.000
melon+0.000
od+0.000
mod+0.000
 pumpkin+0.000
Neg logit contributions
 blankets-0.000
 ghosts-0.000
 doors-0.000
 hinges-0.000
 footsteps-0.000
Token T
Token ID309
Feature activation-3.81e-02

Pos logit contributions
melon+0.142
oni+0.133
iche+0.119
mod+0.116
 squash+0.114
Neg logit contributions
 blankets-0.070
 doors-0.066
 eyes-0.064
 ghosts-0.060
 ages-0.060
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.042
mod+0.029
 redd+0.028
 squash+0.028
 pumpkin+0.023
Neg logit contributions
 years-0.087
 belts-0.085
 eyes-0.085
 hugs-0.082
 doors-0.082
Token a
Token ID257
Feature activation+3.30e-02

Pos logit contributions
oni+1.327
melon+1.290
od+1.080
mod+1.043
 pumpkin+1.035
Neg logit contributions
 blankets-0.871
 ghosts-0.840
 doors-0.814
 hinges-0.781
 belts-0.768
Token big
Token ID1263
Feature activation-2.78e-02

Pos logit contributions
 blankets+0.055
 ghosts+0.055
 doors+0.053
 hinges+0.051
 belts+0.050
Neg logit contributions
oni-0.064
melon-0.062
mod-0.050
od-0.050
 pumpkin-0.049
Token hug
Token ID16225
Feature activation+1.74e-01

Pos logit contributions
melon+0.065
oni+0.064
 pumpkin+0.054
 tomato+0.052
 squash+0.052
Neg logit contributions
 ghosts-0.036
 blankets-0.034
 doors-0.033
 hinges-0.032
 belts-0.032
Token.
Token ID13
Feature activation-6.10e-04

Pos logit contributions
 blankets+0.263
 ghosts+0.250
 doors+0.244
 hinges+0.233
 belts+0.229
Neg logit contributions
oni-0.379
melon-0.369
od-0.306
mod-0.298
abi-0.292
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
melon+0.001
oni+0.001
mod+0.001
od+0.001
 squash+0.001
Neg logit contributions
 blankets-0.001
 doors-0.001
 ghosts-0.001
 eyes-0.001
 ages-0.001
Token higher
Token ID2440
Feature activation-1.53e-01

Pos logit contributions
oni+0.253
melon+0.245
od+0.209
mod+0.204
abi+0.202
Neg logit contributions
 blankets-0.153
 ghosts-0.144
 doors-0.143
 err-0.131
 minutes-0.131
Token.
Token ID13
Feature activation-7.87e-03

Pos logit contributions
oni+0.341
melon+0.334
od+0.278
mod+0.275
abi+0.267
Neg logit contributions
 blankets-0.222
 ghosts-0.210
 doors-0.205
 hinges-0.194
 belts-0.194
Token He
Token ID679
Feature activation+2.67e-02

Pos logit contributions
oni+0.017
melon+0.017
od+0.014
mod+0.014
 pumpkin+0.014
Neg logit contributions
 blankets-0.011
 ghosts-0.011
 doors-0.010
 hinges-0.010
 belts-0.010
Token showed
Token ID3751
Feature activation-8.96e-02

Pos logit contributions
 blankets+0.034
 ghosts+0.032
 doors+0.032
 ages+0.030
 belts+0.030
Neg logit contributions
oni-0.063
melon-0.062
mod-0.052
od-0.052
abi-0.052
Token T
Token ID309
Feature activation-4.34e-02

Pos logit contributions
melon+0.212
oni+0.210
mod+0.175
iche+0.171
od+0.170
Neg logit contributions
 blankets-0.117
 ghosts-0.111
 doors-0.109
 eyes-0.100
 belts-0.099
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.044
mod+0.031
 redd+0.030
 squash+0.024
iche+0.022
Neg logit contributions
 eyes-0.096
 years-0.096
 belts-0.092
 books-0.091
 hugs-0.089
Token how
Token ID703
Feature activation+6.94e-02

Pos logit contributions
oni+1.491
melon+1.476
mod+1.270
abi+1.232
iche+1.231
Neg logit contributions
 blankets-0.712
 ghosts-0.669
 doors-0.647
 ages-0.599
 belts-0.599
Token to
Token ID284
Feature activation-1.47e-02

Pos logit contributions
 blankets+0.086
 ghosts+0.084
 doors+0.080
 ages+0.076
 err+0.075
Neg logit contributions
oni-0.168
melon-0.165
od-0.136
abi-0.135
mod-0.134
Token lift
Token ID10303
Feature activation-4.21e-02

Pos logit contributions
oni+0.034
melon+0.032
od+0.029
mod+0.028
iche+0.027
Neg logit contributions
 blankets-0.019
 fear-0.018
 brakes-0.018
 breath-0.017
 laps-0.017
Token up
Token ID510
Feature activation-1.62e-01

Pos logit contributions
melon+0.096
oni+0.095
mod+0.081
iche+0.080
od+0.079
Neg logit contributions
 blankets-0.058
 doors-0.055
 ghosts-0.053
 belts-0.049
 eyes-0.049
Token her
Token ID607
Feature activation+6.52e-03

Pos logit contributions
oni+0.328
melon+0.320
od+0.264
 pumpkin+0.250
mod+0.249
Neg logit contributions
 blankets-0.260
 ghosts-0.251
 hinges-0.240
 doors-0.239
 ages-0.231
Token lots
Token ID6041
Feature activation+4.74e-02

Pos logit contributions
oni+0.035
melon+0.034
mod+0.029
od+0.029
abi+0.028
Neg logit contributions
 blankets-0.026
 doors-0.023
 ghosts-0.023
 eyes-0.022
 ages-0.021
Token of
Token ID286
Feature activation+8.41e-02

Pos logit contributions
 blankets+0.058
 ghosts+0.055
 doors+0.052
 hinges+0.050
 belts+0.049
Neg logit contributions
oni-0.117
melon-0.114
od-0.097
mod-0.095
abi-0.093
Token cheese
Token ID9891
Feature activation-1.80e-01

Pos logit contributions
 blankets+0.144
 ghosts+0.136
 doors+0.134
 belts+0.127
 hinges+0.127
Neg logit contributions
oni-0.166
melon-0.162
od-0.132
mod-0.130
abi-0.126
Token and
Token ID290
Feature activation-3.15e-03

Pos logit contributions
oni+0.402
melon+0.397
mod+0.367
od+0.345
abi+0.341
Neg logit contributions
 blankets-0.252
 doors-0.222
 ghosts-0.220
 belts-0.207
 beds-0.203
Token pepper
Token ID13385
Feature activation-3.62e-01

Pos logit contributions
oni+0.006
melon+0.005
od+0.004
 pumpkin+0.004
mod+0.004
Neg logit contributions
 blankets-0.006
 ghosts-0.006
 doors-0.006
 hinges-0.005
 belts-0.005
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
mod+0.572
melon+0.531
oni+0.490
 redd+0.485
iche+0.475
Neg logit contributions
 blankets-0.718
 doors-0.648
 eyes-0.637
 belts-0.621
 books-0.610
Token.
Token ID13
Feature activation-8.10e-03

Pos logit contributions
melon+1.254
oni+1.224
mod+1.193
abi+1.060
 redd+1.056
Neg logit contributions
 blankets-0.924
 doors-0.809
 ghosts-0.795
 belts-0.769
 books-0.765
Token While
Token ID2893
Feature activation+1.74e-01

Pos logit contributions
oni+0.017
melon+0.017
od+0.014
mod+0.013
abi+0.013
Neg logit contributions
 blankets-0.012
 ghosts-0.012
 doors-0.012
 belts-0.011
 hinges-0.011
Token they
Token ID484
Feature activation-5.38e-02

Pos logit contributions
 blankets+0.321
 hinges+0.315
 ghosts+0.312
 doors+0.311
 belts+0.308
Neg logit contributions
oni-0.307
melon-0.307
od-0.239
 pumpkin-0.237
 squash-0.225
Token ate
Token ID15063
Feature activation-1.12e-01

Pos logit contributions
oni+0.106
melon+0.104
od+0.083
mod+0.079
 pumpkin+0.079
Neg logit contributions
 blankets-0.091
 ghosts-0.091
 doors-0.087
 hinges-0.086
 belts-0.084
Token,
Token ID11
Feature activation-1.45e-02

Pos logit contributions
oni+0.276
melon+0.258
od+0.253
mod+0.252
iche+0.235
Neg logit contributions
 blankets-0.135
 doors-0.116
 books-0.107
 eyes-0.104
 beds-0.103
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
oni+0.037
melon+0.036
od+0.030
mod+0.029
abi+0.028
Neg logit contributions
 blankets-0.027
 ghosts-0.025
 doors-0.025
 hinges-0.023
 belts-0.023
TokenJust
Token ID5703
Feature activation+1.32e-01

Pos logit contributions
oni+0.032
melon+0.030
od+0.026
mod+0.025
abi+0.024
Neg logit contributions
 blankets-0.031
 ghosts-0.030
 doors-0.029
 hinges-0.028
 belts-0.027
Token then
Token ID788
Feature activation+1.29e-01

Pos logit contributions
 blankets+0.193
 ghosts+0.180
 doors+0.175
 bedrooms+0.173
 belts+0.173
Neg logit contributions
oni-0.280
melon-0.279
abi-0.226
 pumpkin-0.223
 squash-0.222
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
 blankets+0.227
 doors+0.210
 ghosts+0.207
 hinges+0.206
 belts+0.206
Neg logit contributions
oni-0.247
melon-0.238
od-0.203
mod-0.191
 pumpkin-0.188
Token T
Token ID309
Feature activation-4.15e-02

Pos logit contributions
oni+0.023
melon+0.021
od+0.019
mod+0.017
 pumpkin+0.017
Neg logit contributions
 blankets-0.027
 hinges-0.026
 years-0.025
 brakes-0.025
 laps-0.025
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.030
mod+0.017
 redd+0.015
 squash+0.012
 pumpkin+0.012
Neg logit contributions
 eyes-0.105
 years-0.102
 books-0.102
 doors-0.098
 belts-0.097
Token's
Token ID338
Feature activation-2.21e-01

Pos logit contributions
oni+1.362
melon+1.336
od+1.112
mod+1.109
abi+1.080
Neg logit contributions
 blankets-0.849
 ghosts-0.808
 doors-0.786
 hinges-0.736
 belts-0.730
Token mom
Token ID1995
Feature activation+6.93e-02

Pos logit contributions
oni+0.486
melon+0.476
od+0.396
mod+0.385
 pumpkin+0.382
Neg logit contributions
 blankets-0.325
 ghosts-0.306
 doors-0.298
 hinges-0.285
 belts-0.283
Token came
Token ID1625
Feature activation+2.46e-02

Pos logit contributions
 blankets+0.112
 ghosts+0.107
 doors+0.103
 hinges+0.101
 belts+0.100
Neg logit contributions
oni-0.144
melon-0.140
od-0.115
mod-0.110
 pumpkin-0.109
Token outside
Token ID2354
Feature activation+6.46e-02

Pos logit contributions
 belts+0.045
 blankets+0.044
 ghosts+0.043
 hinges+0.042
 brakes+0.041
Neg logit contributions
oni-0.040
melon-0.040
od-0.033
 pumpkin-0.030
abi-0.030
Token and
Token ID290
Feature activation+1.24e-03

Pos logit contributions
 blankets+0.111
 ghosts+0.107
 belts+0.104
 hinges+0.104
 doors+0.104
Neg logit contributions
melon-0.122
oni-0.121
od-0.097
 pumpkin-0.092
abi-0.090
Token on
Token ID319
Feature activation+3.98e-02

Pos logit contributions
oni+0.388
melon+0.384
mod+0.332
abi+0.326
iche+0.319
Neg logit contributions
 blankets-0.225
 ghosts-0.203
 doors-0.202
 ages-0.185
 belts-0.183
Token the
Token ID262
Feature activation-3.30e-02

Pos logit contributions
 blankets+0.053
 doors+0.050
 ghosts+0.046
 eyes+0.044
 ages+0.043
Neg logit contributions
melon-0.094
oni-0.092
mod-0.082
iche-0.079
od-0.077
Token outside
Token ID2354
Feature activation+6.66e-02

Pos logit contributions
oni+0.076
melon+0.074
mod+0.063
od+0.063
abi+0.060
Neg logit contributions
 blankets-0.048
 doors-0.045
 ghosts-0.043
 belts-0.040
 hinges-0.040
Token and
Token ID290
Feature activation+3.14e-03

Pos logit contributions
 blankets+0.099
 ghosts+0.094
 doors+0.090
 hinges+0.088
 belts+0.087
Neg logit contributions
oni-0.145
melon-0.143
od-0.119
 pumpkin-0.114
mod-0.112
Token T
Token ID309
Feature activation-4.15e-02

Pos logit contributions
 blankets+0.005
 ghosts+0.005
 doors+0.005
 hinges+0.005
 belts+0.004
Neg logit contributions
oni-0.007
melon-0.006
od-0.005
mod-0.005
 pumpkin-0.005
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.031
mod+0.021
 redd+0.020
 pumpkin+0.016
 squash+0.015
Neg logit contributions
 eyes-0.108
 years-0.099
 books-0.098
 doors-0.094
 hugs-0.093
Token thought
Token ID1807
Feature activation-2.76e-02

Pos logit contributions
oni+1.456
melon+1.456
mod+1.241
abi+1.197
iche+1.196
Neg logit contributions
 blankets-0.764
 ghosts-0.708
 doors-0.705
 belts-0.646
 ages-0.642
Token it
Token ID340
Feature activation-8.88e-03

Pos logit contributions
oni+0.059
melon+0.058
od+0.048
mod+0.047
abi+0.046
Neg logit contributions
 blankets-0.042
 ghosts-0.040
 doors-0.039
 hinges-0.037
 belts-0.037
Token was
Token ID373
Feature activation+2.59e-02

Pos logit contributions
oni+0.023
melon+0.022
abi+0.019
iche+0.019
 pumpkin+0.018
Neg logit contributions
 blankets-0.010
 ghosts-0.010
 doors-0.010
 Chloe-0.009
 belts-0.008
Token so
Token ID523
Feature activation+3.75e-02

Pos logit contributions
 blankets+0.036
 ghosts+0.035
 doors+0.033
 ages+0.031
 hinges+0.030
Neg logit contributions
oni-0.060
melon-0.059
mod-0.049
od-0.049
iche-0.048
Token pretty
Token ID2495
Feature activation-6.23e-02

Pos logit contributions
 blankets+0.056
 ghosts+0.052
 doors+0.052
 ages+0.049
 hinges+0.048
Neg logit contributions
oni-0.082
melon-0.080
mod-0.066
od-0.066
abi-0.065
Token,
Token ID11
Feature activation-6.00e-03

Pos logit contributions
oni+0.085
melon+0.081
od+0.069
mod+0.065
abi+0.064
Neg logit contributions
 blankets-0.068
 ghosts-0.064
 doors-0.063
 belts-0.061
 ages-0.060
Token and
Token ID290
Feature activation+5.20e-03

Pos logit contributions
oni+0.012
melon+0.012
od+0.010
 pumpkin+0.009
mod+0.009
Neg logit contributions
 blankets-0.010
 ghosts-0.009
 hinges-0.009
 doors-0.009
 belts-0.008
Token even
Token ID772
Feature activation-9.99e-02

Pos logit contributions
 blankets+0.008
 ghosts+0.008
 doors+0.008
 hinges+0.008
 belts+0.007
Neg logit contributions
oni-0.011
melon-0.011
od-0.008
 pumpkin-0.008
mod-0.008
Token though
Token ID996
Feature activation+3.57e-02

Pos logit contributions
oni+0.208
melon+0.203
od+0.167
mod+0.164
abi+0.160
Neg logit contributions
 blankets-0.159
 ghosts-0.152
 doors-0.148
 hinges-0.141
 belts-0.141
Token T
Token ID309
Feature activation-3.19e-02

Pos logit contributions
 blankets+0.057
 ghosts+0.055
 doors+0.053
 hinges+0.053
 belts+0.052
Neg logit contributions
oni-0.073
melon-0.071
od-0.058
 pumpkin-0.056
mod-0.056
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.023
 redd+0.019
mod+0.018
 ju+0.012
 pumpkin+0.012
Neg logit contributions
 eyes-0.082
 books-0.074
 years-0.074
 doors-0.073
 belts-0.073
Token was
Token ID373
Feature activation+3.41e-02

Pos logit contributions
oni+1.448
melon+1.431
mod+1.232
od+1.227
iche+1.194
Neg logit contributions
 blankets-0.723
 doors-0.680
 ghosts-0.676
 belts-0.638
 ages-0.620
Token sad
Token ID6507
Feature activation-1.11e-01

Pos logit contributions
 blankets+0.049
 ghosts+0.047
 doors+0.045
 eyes+0.042
 minutes+0.042
Neg logit contributions
oni-0.076
melon-0.074
od-0.065
mod-0.064
iche-0.060
Token,
Token ID11
Feature activation-6.96e-03

Pos logit contributions
oni+0.230
melon+0.225
od+0.185
mod+0.175
abi+0.174
Neg logit contributions
 blankets-0.177
 ghosts-0.166
 doors-0.164
 hinges-0.158
 belts-0.158
Token she
Token ID673
Feature activation+2.77e-03

Pos logit contributions
oni+0.014
melon+0.014
od+0.011
mod+0.011
 pumpkin+0.011
Neg logit contributions
 blankets-0.011
 ghosts-0.011
 doors-0.011
 hinges-0.011
 belts-0.010
Token knew
Token ID2993
Feature activation-3.10e-04

Pos logit contributions
 blankets+0.004
 ghosts+0.004
 doors+0.004
 hinges+0.004
 belts+0.004
Neg logit contributions
oni-0.006
melon-0.006
od-0.005
mod-0.005
abi-0.005
Token a
Token ID257
Feature activation+2.87e-02

Pos logit contributions
melon+0.058
oni+0.057
od+0.051
mod+0.051
iche+0.047
Neg logit contributions
 blankets-0.044
 doors-0.041
 eyes-0.038
 ghosts-0.038
 books-0.037
Token cheese
Token ID9891
Feature activation-1.79e-01

Pos logit contributions
 blankets+0.048
 doors+0.045
 books+0.044
 eyes+0.044
 minutes+0.043
Neg logit contributions
oni-0.052
melon-0.051
od-0.048
mod-0.045
abi-0.041
Token pizza
Token ID14256
Feature activation-2.57e-01

Pos logit contributions
oni+0.383
melon+0.377
mod+0.360
od+0.344
abi+0.333
Neg logit contributions
 blankets-0.247
 ghosts-0.222
 doors-0.214
 books-0.211
 belts-0.203
Token with
Token ID351
Feature activation-1.59e-02

Pos logit contributions
oni+0.580
melon+0.571
mod+0.498
od+0.493
abi+0.476
Neg logit contributions
 blankets-0.350
 ghosts-0.324
 doors-0.317
 belts-0.303
 ages-0.297
Token pepper
Token ID13385
Feature activation-3.61e-01

Pos logit contributions
melon+0.029
oni+0.029
mod+0.026
od+0.025
abi+0.023
Neg logit contributions
 blankets-0.028
 doors-0.026
 eyes-0.024
 ghosts-0.024
 ages-0.024
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
mod+0.389
melon+0.325
 redd+0.296
str+0.292
od+0.280
Neg logit contributions
 blankets-0.852
 doors-0.816
 books-0.800
 eyes-0.790
 belts-0.766
Token on
Token ID319
Feature activation+3.78e-02

Pos logit contributions
mod+1.106
melon+1.082
oni+0.953
str+0.929
 redd+0.928
Neg logit contributions
 blankets-0.953
 books-0.917
 doors-0.899
 minutes-0.864
 years-0.864
Token top
Token ID1353
Feature activation-6.86e-02

Pos logit contributions
 blankets+0.048
 doors+0.047
 eyes+0.042
 books+0.041
 bed+0.039
Neg logit contributions
melon-0.083
mod-0.082
oni-0.081
od-0.076
str-0.072
Token."
Token ID526
Feature activation-1.41e-02

Pos logit contributions
melon+0.161
oni+0.161
mod+0.152
od+0.143
abi+0.139
Neg logit contributions
 blankets-0.078
 doors-0.076
 ghosts-0.067
 minutes-0.066
 belts-0.064
Token Mom
Token ID11254
Feature activation+8.06e-02

Pos logit contributions
oni+0.027
melon+0.026
od+0.021
mod+0.020
 pumpkin+0.020
Neg logit contributions
 ghosts-0.025
 blankets-0.024
 doors-0.023
 belts-0.023
 ages-0.022
Tokenmy
Token ID1820
Feature activation-1.09e-01

Pos logit contributions
 blankets+0.116
 doors+0.102
 ghosts+0.099
 Chloe+0.099
 eyes+0.096
Neg logit contributions
oni-0.173
melon-0.172
od-0.147
mod-0.146
abi-0.142
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
oni+0.046
melon+0.046
od+0.037
mod+0.037
abi+0.036
Neg logit contributions
 blankets-0.029
 ghosts-0.027
 doors-0.026
 ages-0.025
 belts-0.025
TokenOne
Token ID3198
Feature activation+1.22e-02

Pos logit contributions
oni+0.045
melon+0.043
od+0.036
mod+0.036
abi+0.035
Neg logit contributions
 blankets-0.032
 ghosts-0.031
 doors-0.030
 hinges-0.029
 belts-0.028
Token day
Token ID1110
Feature activation-3.58e-02

Pos logit contributions
 blankets+0.018
 ghosts+0.017
 doors+0.016
 ages+0.016
 belts+0.016
Neg logit contributions
oni-0.027
melon-0.026
od-0.022
mod-0.021
abi-0.021
Token,
Token ID11
Feature activation-1.64e-02

Pos logit contributions
oni+0.067
melon+0.064
od+0.053
mod+0.052
 pumpkin+0.049
Neg logit contributions
 blankets-0.064
 ghosts-0.061
 doors-0.059
 hinges-0.058
 terror-0.057
Token T
Token ID309
Feature activation-4.34e-02

Pos logit contributions
oni+0.031
melon+0.029
od+0.024
mod+0.024
abi+0.023
Neg logit contributions
 blankets-0.029
 ghosts-0.028
 doors-0.027
 hinges-0.027
 terror-0.026
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.031
 redd+0.017
mod+0.017
 squash+0.012
 pumpkin+0.011
Neg logit contributions
 eyes-0.117
 books-0.109
 years-0.109
 doors-0.105
 blankets-0.105
Token was
Token ID373
Feature activation+2.24e-02

Pos logit contributions
oni+1.409
melon+1.386
mod+1.163
od+1.162
abi+1.136
Neg logit contributions
 blankets-0.791
 ghosts-0.754
 doors-0.735
 belts-0.681
 ages-0.680
Token playing
Token ID2712
Feature activation+2.55e-02

Pos logit contributions
 blankets+0.028
 ghosts+0.026
 doors+0.025
 hinges+0.023
 ages+0.023
Neg logit contributions
oni-0.055
melon-0.054
od-0.046
mod-0.045
abi-0.044
Token with
Token ID351
Feature activation-1.71e-02

Pos logit contributions
 blankets+0.035
 ghosts+0.033
 doors+0.031
 ages+0.030
 belts+0.029
Neg logit contributions
oni-0.060
melon-0.058
od-0.049
mod-0.048
abi-0.047
Token his
Token ID465
Feature activation+1.09e-02

Pos logit contributions
oni+0.041
melon+0.041
od+0.035
mod+0.034
abi+0.033
Neg logit contributions
 blankets-0.023
 ghosts-0.021
 doors-0.020
 belts-0.019
 ages-0.018
Token watch
Token ID2342
Feature activation+4.49e-02

Pos logit contributions
 blankets+0.016
 ghosts+0.015
 doors+0.014
 hinges+0.014
 belts+0.013
Neg logit contributions
oni-0.024
melon-0.024
od-0.020
mod-0.020
 pumpkin-0.019
Token doll
Token ID3654
Feature activation-3.10e-02

Pos logit contributions
 blankets+0.220
 ghosts+0.212
 doors+0.204
 ages+0.199
 hinges+0.197
Neg logit contributions
oni-0.263
melon-0.259
od-0.208
mod-0.202
 pumpkin-0.201
Token".
Token ID1911
Feature activation-2.68e-02

Pos logit contributions
oni+0.067
melon+0.066
od+0.055
mod+0.054
abi+0.052
Neg logit contributions
 blankets-0.046
 ghosts-0.044
 doors-0.043
 hinges-0.041
 belts-0.041
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
melon+0.065
oni+0.063
mod+0.054
iche+0.053
 squash+0.052
Neg logit contributions
 blankets-0.030
 eyes-0.029
 doors-0.029
 footsteps-0.028
 ghosts-0.027
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
melon+0.027
oni+0.026
 redd+0.023
mod+0.022
 pumpkin+0.022
Neg logit contributions
 blankets-0.010
 doors-0.009
 ghosts-0.009
 Chloe-0.009
 ages-0.009
TokenT
Token ID51
Feature activation-6.80e-03

Pos logit contributions
melon+0.025
oni+0.024
mod+0.020
iche+0.020
 redd+0.019
Neg logit contributions
 blankets-0.014
 ghosts-0.014
 doors-0.013
 years-0.013
 belts-0.013
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.012
oni+0.010
 squash+0.010
mod+0.009
 pumpkin+0.009
Neg logit contributions
 blankets-0.012
 ghosts-0.011
 doors-0.011
 belts-0.011
 ages-0.011
Token said
Token ID531
Feature activation-1.51e-02

Pos logit contributions
oni+1.368
melon+1.353
mod+1.124
od+1.123
abi+1.105
Neg logit contributions
 blankets-0.827
 doors-0.784
 ghosts-0.772
 belts-0.729
 hinges-0.719
Token,
Token ID11
Feature activation-9.05e-03

Pos logit contributions
oni+0.036
melon+0.035
mod+0.030
od+0.029
abi+0.029
Neg logit contributions
 blankets-0.019
 ghosts-0.019
 doors-0.018
 footsteps-0.017
 hinges-0.017
Token "
Token ID366
Feature activation-3.26e-02

Pos logit contributions
oni+0.021
melon+0.021
mod+0.018
od+0.018
iche+0.017
Neg logit contributions
 blankets-0.012
 doors-0.011
 ghosts-0.011
 eyes-0.010
 footsteps-0.010
TokenI
Token ID40
Feature activation-3.80e-02

Pos logit contributions
oni+0.083
melon+0.081
od+0.067
 pumpkin+0.065
iche+0.064
Neg logit contributions
 blankets-0.037
 ghosts-0.036
 doors-0.035
 belts-0.034
 ages-0.033
Token don
Token ID836
Feature activation+5.87e-02

Pos logit contributions
oni+0.092
melon+0.091
od+0.075
mod+0.074
 pumpkin+0.073
Neg logit contributions
 blankets-0.047
 ghosts-0.045
 doors-0.043
 ages-0.041
 hinges-0.041
TokenOne
Token ID3198
Feature activation+1.69e-02

Pos logit contributions
melon+0.041
oni+0.040
 pumpkin+0.034
mod+0.034
abi+0.033
Neg logit contributions
 blankets-0.018
 doors-0.016
 ghosts-0.016
 Chloe-0.016
 ages-0.015
Token day
Token ID1110
Feature activation-3.26e-02

Pos logit contributions
 blankets+0.021
 ghosts+0.020
 ages+0.019
 years+0.018
 minutes+0.017
Neg logit contributions
oni-0.041
melon-0.040
od-0.035
mod-0.034
abi-0.033
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
oni+0.070
melon+0.068
od+0.057
mod+0.055
abi+0.054
Neg logit contributions
 blankets-0.050
 ghosts-0.048
 doors-0.046
 hinges-0.044
 belts-0.044
Token when
Token ID618
Feature activation+6.73e-03

Pos logit contributions
oni+0.027
melon+0.026
od+0.021
mod+0.021
abi+0.020
Neg logit contributions
 blankets-0.024
 ghosts-0.023
 doors-0.022
 belts-0.021
 ages-0.021
Token T
Token ID309
Feature activation-4.12e-02

Pos logit contributions
 blankets+0.011
 ghosts+0.010
 doors+0.010
 belts+0.010
 hinges+0.010
Neg logit contributions
oni-0.014
melon-0.014
od-0.011
mod-0.011
abi-0.011
Tokenoni
Token ID14651
Feature activation-6.02e-01

Pos logit contributions
melon+0.054
mod+0.048
 redd+0.048
 pumpkin+0.039
 ju+0.038
Neg logit contributions
 eyes-0.082
 blankets-0.075
 years-0.075
 ages-0.072
 doors-0.072
Token and
Token ID290
Feature activation-6.98e-05

Pos logit contributions
oni+1.511
melon+1.490
mod+1.338
abi+1.318
od+1.312
Neg logit contributions
 blankets-0.619
 doors-0.571
 ghosts-0.564
 ages-0.531
 years-0.521
Token Jack
Token ID3619
Feature activation-2.11e-01

Pos logit contributions
melon+0.000
oni+0.000
mod+0.000
iche+0.000
umber+0.000
Neg logit contributions
 blankets-0.000
 eyes-0.000
 ghosts-0.000
 coats-0.000
 books-0.000
Token were
Token ID547
Feature activation-1.44e-02

Pos logit contributions
oni+0.524
melon+0.514
mod+0.439
abi+0.438
od+0.435
Neg logit contributions
 blankets-0.242
 ghosts-0.220
 doors-0.217
 years-0.209
 ages-0.208
Token playing
Token ID2712
Feature activation+2.92e-02

Pos logit contributions
oni+0.039
melon+0.037
mod+0.032
od+0.032
abi+0.032
Neg logit contributions
 blankets-0.015
 ghosts-0.015
 doors-0.014
 err-0.012
 ages-0.012
Token in
Token ID287
Feature activation+1.41e-02

Pos logit contributions
 blankets+0.039
 ghosts+0.037
 doors+0.035
 ages+0.034
 belts+0.033
Neg logit contributions
oni-0.069
melon-0.067
od-0.057
mod-0.056
abi-0.056
Token was
Token ID373
Feature activation+2.01e-02

Pos logit contributions
 blankets+0.046
 ghosts+0.043
 doors+0.040
 Chloe+0.039
 ages+0.039
Neg logit contributions
oni-0.130
melon-0.126
mod-0.108
abi-0.108
od-0.107
Token a
Token ID257
Feature activation+2.59e-02

Pos logit contributions
 blankets+0.024
 ghosts+0.022
 ages+0.020
 doors+0.020
 footsteps+0.019
Neg logit contributions
oni-0.051
melon-0.051
od-0.043
abi-0.042
mod-0.042
Token boy
Token ID2933
Feature activation-2.23e-02

Pos logit contributions
 blankets+0.033
 ghosts+0.031
 doors+0.029
 belts+0.027
 hinges+0.027
Neg logit contributions
oni-0.063
melon-0.061
od-0.052
mod-0.052
abi-0.049
Token called
Token ID1444
Feature activation+1.90e-01

Pos logit contributions
oni+0.058
melon+0.056
od+0.048
mod+0.048
abi+0.047
Neg logit contributions
 blankets-0.025
 ghosts-0.023
 doors-0.022
 hinges-0.020
 belts-0.020
Token T
Token ID309
Feature activation-4.38e-02

Pos logit contributions
 blankets+0.310
 ghosts+0.295
 doors+0.290
 belts+0.277
 hinges+0.276
Neg logit contributions
oni-0.390
melon-0.381
od-0.311
mod-0.302
abi-0.295
Tokenoni
Token ID14651
Feature activation-6.01e-01

Pos logit contributions
melon+0.084
oni+0.075
mod+0.067
 redd+0.064
 squash+0.063
Neg logit contributions
 blankets-0.075
 ghosts-0.072
 doors-0.070
 belts-0.069
 eyes-0.068
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
oni+1.415
melon+1.409
mod+1.182
od+1.163
iche+1.147
Neg logit contributions
 blankets-0.772
 ghosts-0.737
 doors-0.688
 years-0.672
 ages-0.656
Token He
Token ID679
Feature activation+2.38e-02

Pos logit contributions
oni+0.017
melon+0.017
od+0.014
mod+0.014
abi+0.013
Neg logit contributions
 blankets-0.013
 ghosts-0.012
 doors-0.012
 hinges-0.011
 belts-0.011
Token was
Token ID373
Feature activation+2.12e-02

Pos logit contributions
 blankets+0.031
 ghosts+0.030
 doors+0.028
 ages+0.027
 belts+0.027
Neg logit contributions
oni-0.055
melon-0.055
od-0.046
mod-0.045
abi-0.044
Token very
Token ID845
Feature activation+4.59e-02

Pos logit contributions
 blankets+0.027
 ghosts+0.026
 doors+0.025
 ages+0.023
 hinges+0.023
Neg logit contributions
oni-0.052
melon-0.050
od-0.042
mod-0.041
abi-0.041
Token happy
Token ID3772
Feature activation-7.45e-02

Pos logit contributions
 blankets+0.064
 ghosts+0.062
 doors+0.059
 belts+0.056
 err+0.056
Neg logit contributions
oni-0.102
melon-0.098
od-0.084
mod-0.081
abi-0.078
Token ones
Token ID3392
Feature activation-1.36e-01

Pos logit contributions
melon+0.362
od+0.342
oni+0.333
 fault+0.325
iche+0.314
Neg logit contributions
 blankets-0.187
 books-0.182
 ghosts-0.150
 packages-0.144
 days-0.142
Token with
Token ID351
Feature activation-9.00e-03

Pos logit contributions
melon+0.375
oni+0.360
iche+0.333
od+0.318
ackers+0.317
Neg logit contributions
 blankets-0.121
 years-0.112
 books-0.107
 ghosts-0.100
 doors-0.096
Token cheese
Token ID9891
Feature activation-1.76e-01

Pos logit contributions
melon+0.018
oni+0.017
mod+0.016
od+0.015
str+0.015
Neg logit contributions
 blankets-0.015
 doors-0.014
 eyes-0.013
 books-0.013
 lights-0.013
Token and
Token ID290
Feature activation+2.68e-03

Pos logit contributions
mod+0.419
melon+0.411
oni+0.398
 redd+0.378
od+0.375
Neg logit contributions
 blankets-0.206
 doors-0.175
 books-0.164
 eyes-0.163
 ghosts-0.160
Token pepper
Token ID13385
Feature activation-3.60e-01

Pos logit contributions
 blankets+0.006
 doors+0.005
 ghosts+0.005
 eyes+0.005
 belts+0.005
Neg logit contributions
melon-0.004
oni-0.004
od-0.004
mod-0.003
iche-0.003
Tokenoni
Token ID14651
Feature activation-6.01e-01

Pos logit contributions
mod+0.471
melon+0.416
 redd+0.401
rust+0.370
str+0.369
Neg logit contributions
 blankets-0.779
 eyes-0.741
 doors-0.727
 books-0.707
 belts-0.688
Token.
Token ID13
Feature activation-3.39e-03

Pos logit contributions
mod+1.353
melon+1.316
oni+1.203
 redd+1.200
str+1.146
Neg logit contributions
 blankets-0.742
 books-0.680
 doors-0.653
 eyes-0.642
 years-0.637
Token One
Token ID1881
Feature activation-1.06e-02

Pos logit contributions
oni+0.009
melon+0.008
od+0.007
mod+0.007
abi+0.007
Neg logit contributions
 blankets-0.004
 doors-0.004
 eyes-0.003
 beds-0.003
 Chloe-0.003
Token day
Token ID1110
Feature activation-2.96e-02

Pos logit contributions
oni+0.028
melon+0.027
od+0.027
mod+0.024
 fault+0.024
Neg logit contributions
 blankets-0.009
 books-0.008
 years-0.008
 days-0.007
 minutes-0.007
Token,
Token ID11
Feature activation-8.39e-03

Pos logit contributions
oni+0.056
melon+0.055
od+0.045
mod+0.043
 pumpkin+0.043
Neg logit contributions
 blankets-0.052
 ghosts-0.049
 doors-0.049
 belts-0.048
 hinges-0.047
Token she
Token ID673
Feature activation-8.98e-04

Pos logit contributions
oni+0.017
melon+0.017
mod+0.014
od+0.014
 pumpkin+0.013
Neg logit contributions
 blankets-0.013
 ghosts-0.013
 doors-0.013
 hinges-0.012
 belts-0.012
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
oni+0.003
melon+0.003
od+0.002
mod+0.002
abi+0.002
Neg logit contributions
 blankets-0.002
 doors-0.002
 ghosts-0.002
 hinges-0.002
 belts-0.002
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
oni+0.036
melon+0.036
od+0.029
mod+0.029
abi+0.028
Neg logit contributions
 blankets-0.024
 ghosts-0.023
 doors-0.022
 belts-0.021
 hinges-0.021
TokenOne
Token ID3198
Feature activation+1.49e-02

Pos logit contributions
oni+0.036
melon+0.034
od+0.029
mod+0.029
abi+0.027
Neg logit contributions
 blankets-0.028
 ghosts-0.026
 doors-0.026
 hinges-0.025
 belts-0.024
Token day
Token ID1110
Feature activation-3.17e-02

Pos logit contributions
 blankets+0.024
 ghosts+0.023
 doors+0.022
 ages+0.021
 hinges+0.021
Neg logit contributions
oni-0.031
melon-0.031
od-0.025
mod-0.025
abi-0.024
Token T
Token ID309
Feature activation-4.11e-02

Pos logit contributions
oni+0.059
melon+0.056
od+0.047
mod+0.045
 pumpkin+0.043
Neg logit contributions
 blankets-0.057
 ghosts-0.054
 doors-0.053
 hinges-0.052
 belts-0.051
Tokenoni
Token ID14651
Feature activation-6.01e-01

Pos logit contributions
melon+0.029
mod+0.023
 redd+0.019
str+0.014
 squash+0.014
Neg logit contributions
 eyes-0.104
 years-0.099
 books-0.097
 doors-0.095
 blankets-0.093
Token found
Token ID1043
Feature activation-9.58e-02

Pos logit contributions
oni+1.363
melon+1.333
od+1.116
mod+1.095
abi+1.071
Neg logit contributions
 blankets-0.848
 ghosts-0.802
 doors-0.780
 hinges-0.739
 belts-0.736
Token a
Token ID257
Feature activation+3.41e-02

Pos logit contributions
oni+0.252
melon+0.251
od+0.219
iche+0.217
mod+0.212
Neg logit contributions
 blankets-0.104
 ghosts-0.096
 doors-0.088
 footsteps-0.082
 ages-0.082
Token big
Token ID1263
Feature activation-2.71e-02

Pos logit contributions
 blankets+0.054
 ghosts+0.051
 doors+0.049
 hinges+0.047
 belts+0.046
Neg logit contributions
oni-0.072
melon-0.070
od-0.059
mod-0.058
abi-0.056
Token,
Token ID11
Feature activation-1.15e-02

Pos logit contributions
oni+0.055
melon+0.053
od+0.045
mod+0.044
abi+0.042
Neg logit contributions
 blankets-0.045
 ghosts-0.042
 doors-0.042
 hinges-0.039
 belts-0.039
Token round
Token ID2835
Feature activation-1.29e-01

Pos logit contributions
oni+0.023
melon+0.022
od+0.018
mod+0.018
abi+0.017
Neg logit contributions
 blankets-0.019
 ghosts-0.019
 doors-0.018
 hinges-0.017
 belts-0.017
Token,
Token ID11
Feature activation-9.99e-03

Pos logit contributions
oni+0.035
melon+0.034
mod+0.031
iche+0.030
abi+0.030
Neg logit contributions
 blankets-0.015
 ghosts-0.014
 eyes-0.014
 doors-0.014
 belts-0.013
Token "
Token ID366
Feature activation-3.30e-02

Pos logit contributions
oni+0.024
melon+0.023
mod+0.020
od+0.020
iche+0.020
Neg logit contributions
 blankets-0.012
 doors-0.011
 ghosts-0.011
 belts-0.011
 eyes-0.010
TokenHello
Token ID15496
Feature activation+6.70e-02

Pos logit contributions
melon+0.083
oni+0.082
od+0.071
iche+0.065
 pumpkin+0.065
Neg logit contributions
 blankets-0.036
 belts-0.035
 doors-0.033
 moments-0.032
 ghosts-0.032
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
 blankets+0.095
 ghosts+0.090
 doors+0.087
 belts+0.082
 hinges+0.081
Neg logit contributions
oni-0.150
melon-0.149
od-0.124
mod-0.124
abi-0.118
Token Pepper
Token ID24346
Feature activation-7.21e-02

Pos logit contributions
melon+0.023
oni+0.023
mod+0.021
od+0.020
iche+0.019
Neg logit contributions
 blankets-0.015
 ghosts-0.014
 eyes-0.014
 doors-0.014
 scar-0.013
Tokenoni
Token ID14651
Feature activation-6.01e-01

Pos logit contributions
melon+0.192
mod+0.184
oni+0.182
iche+0.172
od+0.168
Neg logit contributions
 blankets-0.064
 ghosts-0.057
 years-0.050
 Chloe-0.047
 books-0.045
Token.
Token ID13
Feature activation-2.51e-03

Pos logit contributions
melon+1.498
oni+1.470
mod+1.424
iche+1.320
 redd+1.289
Neg logit contributions
 blankets-0.619
 ghosts-0.583
 years-0.530
 books-0.518
 ages-0.511
Token You
Token ID921
Feature activation+5.89e-02

Pos logit contributions
oni+0.006
melon+0.006
od+0.005
mod+0.005
abi+0.005
Neg logit contributions
 blankets-0.003
 ghosts-0.003
 doors-0.003
 hinges-0.003
 belts-0.003
Token are
Token ID389
Feature activation-1.73e-02

Pos logit contributions
 blankets+0.070
 ghosts+0.066
 doors+0.062
 ages+0.059
 books+0.057
Neg logit contributions
oni-0.144
melon-0.143
mod-0.122
abi-0.122
od-0.120
Token my
Token ID616
Feature activation+1.33e-01

Pos logit contributions
oni+0.043
melon+0.043
mod+0.037
od+0.036
iche+0.036
Neg logit contributions
 blankets-0.020
 ghosts-0.020
 doors-0.018
 years-0.017
 ages-0.017
Token new
Token ID649
Feature activation-1.93e-02

Pos logit contributions
 blankets+0.222
 ghosts+0.215
 eyes+0.206
 doors+0.203
 belts+0.196
Neg logit contributions
melon-0.260
oni-0.260
mod-0.211
od-0.207
iche-0.206
Token -
Token ID532
Feature activation+2.84e-02

Pos logit contributions
 blankets+0.013
 ghosts+0.012
 doors+0.011
 hinges+0.011
 belts+0.011
Neg logit contributions
oni-0.026
melon-0.025
od-0.022
mod-0.021
abi-0.021
Token Mon
Token ID2892
Feature activation-2.05e-01

Pos logit contributions
 blankets+0.043
 ghosts+0.041
 doors+0.041
 hinges+0.039
 belts+0.039
Neg logit contributions
oni-0.060
melon-0.059
od-0.049
mod-0.047
 pumpkin-0.047
Tokena
Token ID64
Feature activation-1.82e-01

Pos logit contributions
melon+0.473
mod+0.395
 redd+0.377
oni+0.371
 squash+0.342
Neg logit contributions
 years-0.291
 blankets-0.263
 minutes-0.260
 ghosts-0.259
 belts-0.239
Token and
Token ID290
Feature activation-4.72e-04

Pos logit contributions
melon+0.451
oni+0.446
mod+0.388
od+0.370
 redd+0.369
Neg logit contributions
 blankets-0.216
 ghosts-0.199
 years-0.194
 doors-0.190
 ages-0.182
Token T
Token ID309
Feature activation-4.43e-02

Pos logit contributions
oni+0.001
melon+0.001
mod+0.001
iche+0.001
abi+0.001
Neg logit contributions
 ghosts-0.001
 blankets-0.001
 Chloe-0.001
 Kenny-0.001
 coats-0.001
Tokenoni
Token ID14651
Feature activation-6.01e-01

Pos logit contributions
melon+0.084
oni+0.072
mod+0.070
 redd+0.068
 pumpkin+0.063
Neg logit contributions
 blankets-0.076
 eyes-0.073
 ghosts-0.073
 footsteps-0.069
 doors-0.068
Token.
Token ID13
Feature activation-5.04e-03

Pos logit contributions
melon+1.489
oni+1.470
mod+1.315
 redd+1.264
abi+1.248
Neg logit contributions
 blankets-0.682
 ghosts-0.639
 years-0.600
 ages-0.576
 doors-0.576
Token Mon
Token ID2892
Feature activation-2.05e-01

Pos logit contributions
melon+0.012
oni+0.011
mod+0.011
 squash+0.011
str+0.010
Neg logit contributions
 blankets-0.006
 eyes-0.005
 beds-0.005
 doors-0.005
 Chloe-0.005
Tokena
Token ID64
Feature activation-1.82e-01

Pos logit contributions
melon+0.469
mod+0.381
 redd+0.380
 fault+0.354
oni+0.331
Neg logit contributions
 years-0.290
 minutes-0.230
 blankets-0.225
 ghosts-0.222
 belts-0.219
Token was
Token ID373
Feature activation+2.38e-02

Pos logit contributions
melon+0.458
oni+0.452
mod+0.393
abi+0.380
 redd+0.377
Neg logit contributions
 blankets-0.192
 years-0.192
 doors-0.183
 ghosts-0.177
 ages-0.173
Token three
Token ID1115
Feature activation-1.37e-02

Pos logit contributions
 ghosts+0.025
 blankets+0.022
 Chloe+0.021
 doors+0.019
 ages+0.019
Neg logit contributions
oni-0.066
melon-0.061
od-0.053
ade-0.052
aul-0.052
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 Chloe+0.002
 Kenny+0.002
Neg logit contributions
melon-0.004
oni-0.004
 pumpkin-0.003
mod-0.003
od-0.003
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
oni+0.041
melon+0.041
od+0.033
mod+0.033
abi+0.032
Neg logit contributions
 blankets-0.027
 ghosts-0.026
 doors-0.025
 belts-0.024
 hinges-0.024
TokenThe
Token ID464
Feature activation-1.02e-01

Pos logit contributions
oni+0.040
melon+0.039
od+0.032
mod+0.032
abi+0.031
Neg logit contributions
 blankets-0.029
 ghosts-0.027
 doors-0.027
 belts-0.026
 hinges-0.025
Token two
Token ID734
Feature activation-3.30e-02

Pos logit contributions
melon+0.187
oni+0.179
 pumpkin+0.150
 squash+0.140
mod+0.131
Neg logit contributions
 blankets-0.177
 ghosts-0.174
 ages-0.171
 years-0.165
 hinges-0.162
Token friends
Token ID2460
Feature activation+5.51e-02

Pos logit contributions
oni+0.074
melon+0.071
od+0.060
 pumpkin+0.058
mod+0.057
Neg logit contributions
 blankets-0.046
 ghosts-0.044
 hinges-0.043
 ages-0.042
 doors-0.041
Token ran
Token ID4966
Feature activation+2.12e-01

Pos logit contributions
 blankets+0.070
 ghosts+0.067
 doors+0.063
 hinges+0.062
 belts+0.059
Neg logit contributions
oni-0.130
melon-0.129
od-0.110
mod-0.106
 pumpkin-0.105
Token and
Token ID290
Feature activation-1.44e-03

Pos logit contributions
 blankets+0.398
 ghosts+0.393
jar+0.371
 hinges+0.370
 doors+0.365
Neg logit contributions
melon-0.352
oni-0.350
od-0.264
 squash-0.261
mod-0.251
Token ran
Token ID4966
Feature activation+2.12e-01

Pos logit contributions
melon+0.003
oni+0.003
od+0.003
mod+0.003
 squash+0.003
Neg logit contributions
 ghosts-0.002
 blankets-0.002
 hinges-0.002
 doors-0.002
 footsteps-0.002
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
 ghosts+0.379
 blankets+0.379
jar+0.357
 hinges+0.350
 doors+0.349
Neg logit contributions
melon-0.361
oni-0.358
od-0.278
 squash-0.268
 pumpkin-0.251
Token until
Token ID1566
Feature activation+4.79e-02

Pos logit contributions
oni+0.037
melon+0.037
od+0.031
mod+0.030
abi+0.030
Neg logit contributions
 blankets-0.020
 ghosts-0.019
 doors-0.018
 hinges-0.017
 belts-0.017
Token at
Token ID379
Feature activation-4.64e-02

Pos logit contributions
 blankets+0.084
 ghosts+0.081
 doors+0.078
 hinges+0.077
 belts+0.075
Neg logit contributions
oni-0.092
melon-0.089
od-0.071
mod-0.069
abi-0.068
Token noise
Token ID7838
Feature activation+1.43e-01

Pos logit contributions
melon+0.076
oni+0.075
mod+0.068
od+0.064
abi+0.064
Neg logit contributions
 ghosts-0.041
 blankets-0.039
 doors-0.038
 door-0.036
 bed-0.036
Token?"
Token ID1701
Feature activation-6.14e-02

Pos logit contributions
 blankets+0.212
 ghosts+0.209
 doors+0.200
 ages+0.182
 eyes+0.181
Neg logit contributions
oni-0.303
melon-0.294
mod-0.257
od-0.252
abi-0.245
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
oni+0.131
melon+0.128
od+0.106
mod+0.103
abi+0.101
Neg logit contributions
 blankets-0.096
 ghosts-0.091
 doors-0.088
 hinges-0.084
 belts-0.084
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
oni+0.025
melon+0.024
od+0.020
mod+0.020
abi+0.019
Neg logit contributions
 blankets-0.021
 ghosts-0.020
 doors-0.019
 hinges-0.019
 belts-0.019
TokenBen
Token ID11696
Feature activation-1.60e-01

Pos logit contributions
oni+0.029
melon+0.028
od+0.023
mod+0.023
abi+0.022
Neg logit contributions
 blankets-0.023
 ghosts-0.022
 doors-0.021
 hinges-0.020
 belts-0.020
Token runs
Token ID4539
Feature activation+1.95e-01

Pos logit contributions
oni+0.386
melon+0.384
mod+0.328
od+0.326
abi+0.315
Neg logit contributions
 blankets-0.198
 doors-0.185
 ghosts-0.183
 eyes-0.168
 belts-0.163
Token to
Token ID284
Feature activation-1.22e-02

Pos logit contributions
 blankets+0.384
 ghosts+0.370
 doors+0.349
 belts+0.347
 eyes+0.345
Neg logit contributions
oni-0.325
melon-0.322
od-0.225
 squash-0.225
mod-0.220
Token help
Token ID1037
Feature activation+1.11e-01

Pos logit contributions
oni+0.025
melon+0.025
od+0.020
mod+0.020
abi+0.019
Neg logit contributions
 blankets-0.019
 ghosts-0.019
 doors-0.018
 belts-0.018
 hinges-0.018
Token her
Token ID607
Feature activation+1.07e-02

Pos logit contributions
 blankets+0.156
 ghosts+0.147
 doors+0.143
 hinges+0.135
 belts+0.134
Neg logit contributions
oni-0.255
melon-0.250
od-0.210
mod-0.206
abi-0.201
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
 blankets+0.013
 doors+0.012
 ghosts+0.012
 footsteps+0.012
 hinges+0.011
Neg logit contributions
oni-0.027
melon-0.027
mod-0.023
abi-0.022
od-0.022
Token He
Token ID679
Feature activation+3.32e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 belts+0.002
 ages+0.002
Neg logit contributions
oni-0.004
melon-0.004
od-0.003
mod-0.003
abi-0.003
Token and
Token ID290
Feature activation+3.21e-03

Pos logit contributions
oni+0.639
melon+0.633
mod+0.538
od+0.535
iche+0.530
Neg logit contributions
 blankets-0.370
 doors-0.343
 ghosts-0.333
 belts-0.312
 eyes-0.311
Token gave
Token ID2921
Feature activation-5.98e-02

Pos logit contributions
 blankets+0.005
 ghosts+0.005
 doors+0.005
 hinges+0.005
 belts+0.005
Neg logit contributions
oni-0.007
melon-0.007
od-0.005
mod-0.005
 pumpkin-0.005
Token the
Token ID262
Feature activation-2.41e-02

Pos logit contributions
oni+0.133
melon+0.130
od+0.109
mod+0.106
abi+0.104
Neg logit contributions
 blankets-0.089
 ghosts-0.083
 doors-0.081
 belts-0.076
 hinges-0.076
Token boy
Token ID2933
Feature activation-1.41e-02

Pos logit contributions
oni+0.054
melon+0.050
mod+0.046
 redd+0.044
 pumpkin+0.043
Neg logit contributions
 blankets-0.030
 ghosts-0.029
 hinges-0.028
 terror-0.027
 ages-0.027
Token her
Token ID607
Feature activation+1.33e-02

Pos logit contributions
oni+0.034
melon+0.033
od+0.028
mod+0.028
iche+0.028
Neg logit contributions
 blankets-0.018
 ghosts-0.017
 doors-0.016
 belts-0.016
 books-0.015
Token belt
Token ID10999
Feature activation+2.07e-01

Pos logit contributions
 ghosts+0.016
 blankets+0.016
 hinges+0.015
 terror+0.015
 ages+0.015
Neg logit contributions
oni-0.029
melon-0.027
mod-0.025
od-0.024
abi-0.023
Token.
Token ID13
Feature activation-2.75e-03

Pos logit contributions
 blankets+0.298
 ghosts+0.278
 doors+0.275
 belts+0.260
 hinges+0.260
Neg logit contributions
oni-0.463
melon-0.452
od-0.380
mod-0.374
abi-0.367
Token She
Token ID1375
Feature activation+7.86e-03

Pos logit contributions
oni+0.006
melon+0.006
od+0.005
mod+0.005
 pumpkin+0.005
Neg logit contributions
 blankets-0.004
 ghosts-0.003
 doors-0.003
 hinges-0.003
 ages-0.003
Token felt
Token ID2936
Feature activation-5.68e-02

Pos logit contributions
 blankets+0.011
 doors+0.010
 ghosts+0.010
 belts+0.010
 ages+0.009
Neg logit contributions
oni-0.018
melon-0.018
od-0.015
mod-0.015
iche-0.015
Token sad
Token ID6507
Feature activation-1.16e-01

Pos logit contributions
oni+0.126
melon+0.121
od+0.103
mod+0.098
abi+0.096
Neg logit contributions
 blankets-0.085
 ghosts-0.080
 doors-0.078
 ages-0.074
 years-0.074
Token and
Token ID290
Feature activation+1.93e-03

Pos logit contributions
oni+0.236
melon+0.230
od+0.189
abi+0.178
 pumpkin+0.177
Neg logit contributions
 blankets-0.187
 ghosts-0.176
 doors-0.172
 belts-0.168
 hinges-0.167
Token!
Token ID0
Feature activation+6.01e-02

Pos logit contributions
oni+0.111
melon+0.102
od+0.094
 squash+0.078
 pumpkin+0.078
Neg logit contributions
 hinges-0.108
 blankets-0.107
jar-0.106
 doors-0.105
 brakes-0.104
Token After
Token ID2293
Feature activation+1.02e-01

Pos logit contributions
 blankets+0.099
 ghosts+0.095
 doors+0.091
 hinges+0.088
 belts+0.087
Neg logit contributions
oni-0.124
melon-0.120
od-0.099
mod-0.095
abi-0.094
Token cl
Token ID537
Feature activation+9.32e-03

Pos logit contributions
 hinges+0.186
 blankets+0.179
 passage+0.174
 doors+0.173
 torch+0.172
Neg logit contributions
oni-0.186
melon-0.174
od-0.150
 pumpkin-0.132
abi-0.131
Tokenapping
Token ID5912
Feature activation-1.76e-01

Pos logit contributions
 blankets+0.016
 doors+0.015
 ghosts+0.015
 eyes+0.015
 footsteps+0.015
Neg logit contributions
oni-0.018
melon-0.017
mod-0.014
od-0.014
abi-0.013
Token her
Token ID607
Feature activation+6.52e-03

Pos logit contributions
oni+0.382
melon+0.373
od+0.310
mod+0.304
abi+0.297
Neg logit contributions
 blankets-0.262
 ghosts-0.249
 doors-0.242
 hinges-0.231
 belts-0.229
Token hands
Token ID2832
Feature activation+1.99e-01

Pos logit contributions
 blankets+0.007
 ghosts+0.007
 hinges+0.007
 doors+0.007
 ages+0.007
Neg logit contributions
oni-0.015
melon-0.015
od-0.013
 pumpkin-0.013
 squash-0.013
Token,
Token ID11
Feature activation-1.78e-02

Pos logit contributions
 blankets+0.261
 doors+0.235
 ghosts+0.234
 belts+0.219
 ages+0.212
Neg logit contributions
oni-0.476
melon-0.452
mod-0.397
abi-0.394
od-0.388
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
oni+0.036
melon+0.035
od+0.029
mod+0.028
 pumpkin+0.028
Neg logit contributions
 blankets-0.029
 ghosts-0.028
 doors-0.027
 hinges-0.027
 belts-0.026
Token little
Token ID1310
Feature activation+5.26e-02

Pos logit contributions
oni+0.053
melon+0.051
 pumpkin+0.042
abi+0.042
 grape+0.041
Neg logit contributions
 blankets-0.053
 Kenny-0.051
 hinges-0.050
 ages-0.050
 ghosts-0.050
Token girl
Token ID2576
Feature activation+4.66e-02

Pos logit contributions
 blankets+0.077
 ghosts+0.073
 doors+0.071
 hinges+0.070
 ages+0.070
Neg logit contributions
oni-0.112
melon-0.111
mod-0.091
od-0.090
 pumpkin-0.089
Token felt
Token ID2936
Feature activation-5.97e-02

Pos logit contributions
 blankets+0.062
 doors+0.058
 ghosts+0.057
 belts+0.054
 ages+0.053
Neg logit contributions
oni-0.108
melon-0.107
mod-0.090
abi-0.087
od-0.087
Token thirds
Token ID41188
Feature activation+2.52e-02

Pos logit contributions
 blankets+0.016
 ghosts+0.016
 doors+0.015
 belts+0.015
 hinges+0.015
Neg logit contributions
oni-0.020
melon-0.019
od-0.016
mod-0.016
 pumpkin-0.016
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
 blankets+0.040
 ghosts+0.039
 doors+0.037
 hinges+0.036
 footsteps+0.036
Neg logit contributions
oni-0.052
melon-0.050
od-0.041
 pumpkin-0.040
abi-0.040
Token like
Token ID588
Feature activation-1.28e-01

Pos logit contributions
oni+0.032
melon+0.031
od+0.028
 pumpkin+0.026
 squash+0.026
Neg logit contributions
 blankets-0.029
 hinges-0.028
 footsteps-0.027
 passage-0.027
 ghosts-0.027
Token this
Token ID428
Feature activation-2.87e-02

Pos logit contributions
oni+0.241
melon+0.238
 squash+0.197
 pumpkin+0.196
od+0.189
Neg logit contributions
 blankets-0.198
 hinges-0.196
 ghosts-0.194
 belts-0.190
 doors-0.190
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
oni+0.063
melon+0.061
od+0.052
mod+0.051
abi+0.048
Neg logit contributions
 blankets-0.044
 ghosts-0.040
 doors-0.040
 belts-0.038
 ages-0.037
Token Now
Token ID2735
Feature activation+2.06e-01

Pos logit contributions
oni+0.015
melon+0.015
od+0.013
abi+0.012
mod+0.012
Neg logit contributions
 blankets-0.012
 ghosts-0.011
 doors-0.011
 err-0.011
 beds-0.011
Token you
Token ID345
Feature activation+1.20e-01

Pos logit contributions
 blankets+0.333
 ghosts+0.317
 doors+0.305
 hinges+0.303
 ages+0.301
Neg logit contributions
oni-0.397
melon-0.390
od-0.326
 pumpkin-0.318
 squash-0.310
Token try
Token ID1949
Feature activation-2.62e-02

Pos logit contributions
 blankets+0.154
 ghosts+0.144
 doors+0.140
 ages+0.132
 hinges+0.131
Neg logit contributions
oni-0.288
melon-0.283
od-0.237
mod-0.236
abi-0.230
Token it
Token ID340
Feature activation-1.49e-02

Pos logit contributions
melon+0.062
oni+0.061
mod+0.051
od+0.049
iche+0.047
Neg logit contributions
 blankets-0.036
 doors-0.033
 ghosts-0.033
 ages-0.030
 belts-0.030
Token."
Token ID526
Feature activation-1.60e-02

Pos logit contributions
melon+0.034
oni+0.034
mod+0.029
od+0.028
 pumpkin+0.028
Neg logit contributions
 blankets-0.020
 doors-0.019
 ghosts-0.018
 belts-0.017
 hinges-0.017
Token Kathy
Token ID33726
Feature activation+2.03e-01

Pos logit contributions
oni+0.023
mod+0.021
 squash+0.020
od+0.020
melon+0.020
Neg logit contributions
 passage-0.025
 blankets-0.023
 chests-0.023
 years-0.023
 ghosts-0.023
Token there
Token ID612
Feature activation+5.06e-02

Pos logit contributions
oni+0.041
melon+0.039
od+0.034
mod+0.034
abi+0.033
Neg logit contributions
 blankets-0.017
 ghosts-0.016
 doors-0.015
 ages-0.014
 hinges-0.014
Token was
Token ID373
Feature activation+2.21e-02

Pos logit contributions
 blankets+0.041
 ghosts+0.037
 Chloe+0.037
 Kenny+0.036
 ages+0.035
Neg logit contributions
oni-0.142
melon-0.139
mod-0.120
abi-0.120
iche-0.120
Token a
Token ID257
Feature activation+2.80e-02

Pos logit contributions
 blankets+0.026
 ghosts+0.023
 ages+0.022
 doors+0.021
 footsteps+0.020
Neg logit contributions
oni-0.057
melon-0.057
od-0.047
mod-0.046
abi-0.046
Token little
Token ID1310
Feature activation+5.14e-02

Pos logit contributions
 blankets+0.032
 ghosts+0.031
 doors+0.029
 belts+0.027
 years+0.026
Neg logit contributions
oni-0.070
melon-0.069
od-0.059
mod-0.059
abi-0.055
Token girl
Token ID2576
Feature activation+4.60e-02

Pos logit contributions
 blankets+0.072
 ghosts+0.069
 doors+0.065
 belts+0.062
 hinges+0.061
Neg logit contributions
oni-0.118
melon-0.114
od-0.097
mod-0.096
abi-0.093
Token named
Token ID3706
Feature activation+1.06e-01

Pos logit contributions
 blankets+0.059
 ghosts+0.055
 doors+0.055
 belts+0.052
 hinges+0.052
Neg logit contributions
oni-0.108
melon-0.107
od-0.091
mod-0.089
 pumpkin-0.088
Token Lily
Token ID20037
Feature activation+1.35e-02

Pos logit contributions
 blankets+0.176
 ghosts+0.170
 belts+0.166
 doors+0.166
 brakes+0.162
Neg logit contributions
oni-0.205
melon-0.204
od-0.169
mod-0.159
 squash-0.158
Token who
Token ID508
Feature activation+4.59e-02

Pos logit contributions
 blankets+0.017
 ghosts+0.016
 doors+0.015
 years+0.015
 hinges+0.015
Neg logit contributions
oni-0.033
melon-0.032
od-0.026
mod-0.026
abi-0.026
Token loved
Token ID6151
Feature activation-1.55e-02

Pos logit contributions
 blankets+0.073
 ghosts+0.070
 doors+0.069
 hinges+0.065
 belts+0.065
Neg logit contributions
oni-0.096
melon-0.094
od-0.077
mod-0.076
abi-0.074
Token to
Token ID284
Feature activation-1.54e-02

Pos logit contributions
oni+0.031
melon+0.030
 pumpkin+0.024
od+0.024
mod+0.024
Neg logit contributions
 blankets-0.025
 ghosts-0.024
 doors-0.024
 hinges-0.024
 ages-0.023
Token dream
Token ID4320
Feature activation+1.11e-01

Pos logit contributions
oni+0.037
melon+0.036
od+0.030
mod+0.030
abi+0.029
Neg logit contributions
 blankets-0.020
 ghosts-0.019
 doors-0.018
 hinges-0.017
 belts-0.017
Token butter
Token ID9215
Feature activation-2.67e-01

Pos logit contributions
oni+0.115
melon+0.112
od+0.091
mod+0.088
abi+0.084
Neg logit contributions
 blankets-0.130
 doors-0.122
 ghosts-0.121
 hinges-0.115
 belts-0.115
Token was
Token ID373
Feature activation+4.48e-02

Pos logit contributions
oni+0.682
melon+0.655
mod+0.629
od+0.612
iche+0.602
Neg logit contributions
 blankets-0.292
 doors-0.236
 eyes-0.201
 beds-0.196
 belts-0.194
Token soft
Token ID2705
Feature activation+3.62e-02

Pos logit contributions
 blankets+0.058
 doors+0.049
 eyes+0.047
 minutes+0.046
 years+0.046
Neg logit contributions
oni-0.107
melon-0.104
od-0.096
mod-0.091
iche-0.088
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
 blankets+0.041
 doors+0.032
 ghosts+0.031
 eyes+0.031
 years+0.030
Neg logit contributions
oni-0.096
melon-0.092
mod-0.086
od-0.085
iche-0.082
Token smooth
Token ID7209
Feature activation-1.23e-01

Pos logit contributions
 blankets+0.027
 doors+0.024
 ghosts+0.024
 belts+0.022
 hinges+0.022
Neg logit contributions
oni-0.033
melon-0.032
od-0.027
mod-0.026
iche-0.025
Token,
Token ID11
Feature activation+3.10e-03

Pos logit contributions
oni+0.312
melon+0.304
mod+0.291
od+0.277
iche+0.276
Neg logit contributions
 blankets-0.139
 doors-0.114
 eyes-0.109
 ghosts-0.102
 years-0.102
Token just
Token ID655
Feature activation-7.58e-02

Pos logit contributions
 blankets+0.003
 doors+0.003
 ghosts+0.003
 minutes+0.003
 eyes+0.003
Neg logit contributions
oni-0.008
melon-0.008
od-0.007
mod-0.007
iche-0.007
Token like
Token ID588
Feature activation-1.15e-01

Pos logit contributions
oni+0.166
melon+0.158
od+0.135
mod+0.133
iche+0.127
Neg logit contributions
 blankets-0.116
 ghosts-0.107
 minutes-0.103
 doors-0.103
 ages-0.102
Token Lily
Token ID20037
Feature activation+3.19e-02

Pos logit contributions
oni+0.243
melon+0.237
od+0.195
mod+0.190
abi+0.187
Neg logit contributions
 blankets-0.178
 ghosts-0.170
 doors-0.166
 hinges-0.160
 belts-0.159
Token's
Token ID338
Feature activation-2.07e-01

Pos logit contributions
 blankets+0.044
 years+0.042
 ghosts+0.040
 Chloe+0.038
 doors+0.037
Neg logit contributions
melon-0.073
oni-0.072
mod-0.061
emo-0.058
iche-0.057
Token favorite
Token ID4004
Feature activation-1.84e-01

Pos logit contributions
oni+0.453
melon+0.446
od+0.382
mod+0.371
abi+0.353
Neg logit contributions
 blankets-0.324
 eyes-0.294
 doors-0.286
 ghosts-0.276
 hinges-0.269
Token and
Token ID290
Feature activation-2.91e-03

Pos logit contributions
oni+0.362
melon+0.341
od+0.282
 squash+0.257
 pumpkin+0.256
Neg logit contributions
 blankets-0.345
 doors-0.325
 hinges-0.320
 ghosts-0.318
 belts-0.313
Token always
Token ID1464
Feature activation-1.54e-01

Pos logit contributions
oni+0.006
melon+0.006
od+0.005
mod+0.005
abi+0.005
Neg logit contributions
 blankets-0.004
 ghosts-0.004
 doors-0.004
 belts-0.004
 ages-0.004
Token wanted
Token ID2227
Feature activation-8.00e-03

Pos logit contributions
oni+0.321
melon+0.316
od+0.259
mod+0.253
abi+0.246
Neg logit contributions
 blankets-0.246
 ghosts-0.231
 doors-0.228
 hinges-0.219
 belts-0.217
Token all
Token ID477
Feature activation-5.84e-03

Pos logit contributions
oni+0.017
melon+0.017
od+0.014
mod+0.014
abi+0.014
Neg logit contributions
 blankets-0.012
 ghosts-0.011
 doors-0.011
 hinges-0.011
 belts-0.010
Token the
Token ID262
Feature activation-2.77e-02

Pos logit contributions
oni+0.014
melon+0.013
abi+0.012
mod+0.012
od+0.012
Neg logit contributions
 blankets-0.008
 ghosts-0.007
 doors-0.007
 ages-0.007
 books-0.006
Token food
Token ID2057
Feature activation+1.17e-02

Pos logit contributions
oni+0.061
melon+0.059
od+0.049
mod+0.048
abi+0.047
Neg logit contributions
 blankets-0.042
 ghosts-0.040
 doors-0.039
 hinges-0.037
 ages-0.036
Token for
Token ID329
Feature activation+6.83e-02

Pos logit contributions
 blankets+0.017
 ghosts+0.016
 doors+0.015
 belts+0.014
 hinges+0.014
Neg logit contributions
oni-0.026
melon-0.026
od-0.022
mod-0.022
abi-0.021
Token itself
Token ID2346
Feature activation-6.81e-02

Pos logit contributions
 blankets+0.106
 ghosts+0.100
 doors+0.099
 hinges+0.095
 belts+0.095
Neg logit contributions
oni-0.145
melon-0.139
od-0.117
mod-0.113
abi-0.112
Token.
Token ID13
Feature activation-6.48e-03

Pos logit contributions
oni+0.161
melon+0.157
mod+0.136
od+0.131
abi+0.131
Neg logit contributions
 blankets-0.091
 ghosts-0.086
 doors-0.080
 ages-0.076
 belts-0.076
Token
Token ID198
Feature activation-2.38e-02

Pos logit contributions
oni+0.014
melon+0.014
od+0.012
mod+0.011
abi+0.011
Neg logit contributions
 blankets-0.010
 ghosts-0.009
 doors-0.009
 hinges-0.008
 ages-0.008
Token
Token ID198
Feature activation-2.41e-02

Pos logit contributions
oni+0.053
melon+0.052
od+0.043
mod+0.042
abi+0.041
Neg logit contributions
 blankets-0.035
 ghosts-0.033
 doors-0.032
 hinges-0.030
 ages-0.030
Token too
Token ID1165
Feature activation+1.18e-01

Pos logit contributions
 blankets+0.164
 ghosts+0.156
 doors+0.151
 hinges+0.144
 belts+0.142
Neg logit contributions
oni-0.294
melon-0.287
od-0.241
mod-0.236
abi-0.233
Token."
Token ID526
Feature activation-1.01e-02

Pos logit contributions
 blankets+0.203
 ghosts+0.195
 doors+0.190
 belts+0.185
 hinges+0.182
Neg logit contributions
oni-0.228
melon-0.217
od-0.179
abi-0.168
 pumpkin-0.166
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
oni+0.019
melon+0.018
od+0.014
abi+0.014
 pumpkin+0.014
Neg logit contributions
 ghosts-0.018
 blankets-0.018
 doors-0.017
 hinges-0.017
 belts-0.017
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
oni+0.027
melon+0.026
od+0.022
mod+0.021
abi+0.021
Neg logit contributions
 blankets-0.019
 ghosts-0.019
 doors-0.018
 hinges-0.018
 belts-0.017
TokenThe
Token ID464
Feature activation-1.00e-01

Pos logit contributions
oni+0.027
melon+0.026
od+0.021
abi+0.021
mod+0.020
Neg logit contributions
 ghosts-0.024
 blankets-0.023
 doors-0.023
 hinges-0.022
 footsteps-0.021
Token end
Token ID886
Feature activation-2.87e-03

Pos logit contributions
oni+0.206
melon+0.201
 pumpkin+0.159
 squash+0.154
od+0.154
Neg logit contributions
 blankets-0.162
 ghosts-0.158
 doors-0.153
 hinges-0.150
 belts-0.148
Token.
Token ID13
Feature activation-3.72e-04

Pos logit contributions
oni+0.005
melon+0.004
od+0.003
 squash+0.003
 pumpkin+0.003
Neg logit contributions
 blankets-0.006
 doors-0.005
 brakes-0.005
 ghosts-0.005
 luggage-0.005
Token
Token ID198
Feature activation-1.58e-02

Pos logit contributions
oni+0.001
melon+0.001
od+0.001
abi+0.001
mod+0.001
Neg logit contributions
 blankets-0.001
 doors-0.001
 ghosts-0.001
 belts-0.001
 hinges-0.001
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
oni+0.040
melon+0.039
od+0.033
mod+0.033
abi+0.032
Neg logit contributions
 blankets-0.019
 ghosts-0.017
 doors-0.017
 hinges-0.016
 belts-0.016
TokenM
Token ID44
Feature activation-6.13e-02

Pos logit contributions
oni+0.034
melon+0.033
od+0.027
mod+0.026
abi+0.026
Neg logit contributions
 blankets-0.029
 ghosts-0.028
 doors-0.027
 hinges-0.026
 belts-0.026
Tokenoral
Token ID6864
Feature activation-3.27e-02

Pos logit contributions
oni+0.106
melon+0.098
od+0.076
abi+0.074
 pumpkin+0.072
Neg logit contributions
 blankets-0.125
 ghosts-0.122
 doors-0.117
 hinges-0.113
 ages-0.113
Token a
Token ID257
Feature activation+3.21e-02

Pos logit contributions
oni+0.213
melon+0.201
od+0.172
 pumpkin+0.157
mod+0.156
Neg logit contributions
 blankets-0.142
 ghosts-0.138
 hinges-0.134
 years-0.131
 belts-0.130
Token click
Token ID3904
Feature activation+1.99e-01

Pos logit contributions
 blankets+0.048
 ghosts+0.048
 ages+0.045
 belts+0.044
 err+0.043
Neg logit contributions
oni-0.068
melon-0.064
od-0.055
 pumpkin-0.054
 squash-0.052
Token sound
Token ID2128
Feature activation+4.79e-02

Pos logit contributions
 blankets+0.289
 ghosts+0.273
 doors+0.268
 hinges+0.253
 belts+0.252
Neg logit contributions
oni-0.441
melon-0.432
od-0.358
mod-0.354
abi-0.345
Token when
Token ID618
Feature activation+6.31e-03

Pos logit contributions
 blankets+0.069
 ghosts+0.066
 doors+0.064
 hinges+0.060
 belts+0.060
Neg logit contributions
oni-0.107
melon-0.105
od-0.086
mod-0.086
abi-0.084
Token he
Token ID339
Feature activation-3.58e-03

Pos logit contributions
 blankets+0.011
 ghosts+0.010
 hinges+0.010
 err+0.010
 belts+0.010
Neg logit contributions
oni-0.012
melon-0.012
od-0.009
 pumpkin-0.009
 squash-0.009
Token turned
Token ID2900
Feature activation+6.63e-02

Pos logit contributions
oni+0.007
melon+0.007
od+0.006
mod+0.006
abi+0.006
Neg logit contributions
 blankets-0.006
 ghosts-0.005
 doors-0.005
 hinges-0.005
 err-0.005
Token it
Token ID340
Feature activation-1.19e-02

Pos logit contributions
 blankets+0.101
 ghosts+0.097
 doors+0.092
 hinges+0.091
 belts+0.089
Neg logit contributions
oni-0.141
melon-0.138
od-0.115
mod-0.111
 pumpkin-0.109
Token.
Token ID13
Feature activation+2.05e-04

Pos logit contributions
oni+0.025
melon+0.024
od+0.020
mod+0.019
 pumpkin+0.018
Neg logit contributions
 blankets-0.019
 ghosts-0.018
 ages-0.017
 doors-0.017
 hinges-0.017
Token
Token ID198
Feature activation-6.24e-03

Pos logit contributions
 blankets+0.000
 ghosts+0.000
 doors+0.000
 hinges+0.000
 belts+0.000
Neg logit contributions
oni-0.000
melon-0.000
od-0.000
mod-0.000
abi-0.000
Token
Token ID198
Feature activation-7.36e-03

Pos logit contributions
oni+0.014
melon+0.014
od+0.011
mod+0.011
 pumpkin+0.011
Neg logit contributions
 blankets-0.009
 ghosts-0.008
 doors-0.008
 ages-0.008
 belts-0.008
TokenHe
Token ID1544
Feature activation+6.22e-02

Pos logit contributions
oni+0.015
melon+0.015
od+0.012
mod+0.012
abi+0.012
Neg logit contributions
 blankets-0.012
 ghosts-0.012
 doors-0.011
 hinges-0.011
 belts-0.011
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 blankets+0.023
 ghosts+0.021
 doors+0.021
 hinges+0.020
 belts+0.020
Neg logit contributions
oni-0.038
melon-0.037
od-0.031
mod-0.030
abi-0.030
Token went
Token ID1816
Feature activation+9.65e-02

Pos logit contributions
 ghosts+0.022
 blankets+0.022
 doors+0.020
 hinges+0.020
 bedrooms+0.020
Neg logit contributions
melon-0.026
oni-0.025
 pumpkin-0.020
 squash-0.019
 Crab-0.019
Token home
Token ID1363
Feature activation+1.78e-02

Pos logit contributions
 ghosts+0.171
 blankets+0.163
 terror+0.153
 hinges+0.153
 ages+0.150
Neg logit contributions
melon-0.182
oni-0.180
 pumpkin-0.135
od-0.133
abi-0.132
Token with
Token ID351
Feature activation-5.10e-03

Pos logit contributions
 blankets+0.028
 ghosts+0.028
 doors+0.027
 hinges+0.025
 ages+0.025
Neg logit contributions
oni-0.036
melon-0.035
od-0.029
mod-0.027
abi-0.027
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
oni+0.010
melon+0.010
od+0.008
abi+0.008
 pumpkin+0.008
Neg logit contributions
 blankets-0.008
 ghosts-0.008
 hinges-0.008
 doors-0.008
 belts-0.008
Token pole
Token ID16825
Feature activation-1.48e-01

Pos logit contributions
oni+0.022
melon+0.022
 pumpkin+0.018
mod+0.017
 squash+0.017
Neg logit contributions
 blankets-0.017
 ghosts-0.016
 doors-0.016
 hinges-0.015
 belts-0.015
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
oni+0.326
melon+0.318
od+0.266
mod+0.263
abi+0.256
Neg logit contributions
 blankets-0.216
 ghosts-0.204
 doors-0.199
 belts-0.189
 hinges-0.189
Token
Token ID220
Feature activation+1.59e-02

Pos logit contributions
 blankets+0.004
 ghosts+0.004
 doors+0.004
 belts+0.003
 hinges+0.003
Neg logit contributions
oni-0.004
melon-0.004
od-0.003
abi-0.003
mod-0.003
Token
Token ID198
Feature activation-9.75e-03

Pos logit contributions
 blankets+0.021
 ghosts+0.019
 doors+0.019
 belts+0.018
 hinges+0.018
Neg logit contributions
melon-0.037
oni-0.037
mod-0.031
od-0.030
 pumpkin-0.030
Token
Token ID198
Feature activation-1.23e-02

Pos logit contributions
oni+0.023
melon+0.023
od+0.019
mod+0.019
 pumpkin+0.018
Neg logit contributions
 blankets-0.013
 ghosts-0.012
 doors-0.011
 ages-0.011
 Kenny-0.011
TokenWhen
Token ID2215
Feature activation+1.84e-01

Pos logit contributions
oni+0.026
melon+0.025
od+0.021
mod+0.021
abi+0.020
Neg logit contributions
 blankets-0.019
 ghosts-0.018
 doors-0.018
 hinges-0.017
 belts-0.017
Token.
Token ID13
Feature activation-4.94e-03

Pos logit contributions
 blankets+0.037
 ghosts+0.035
 doors+0.034
 hinges+0.032
 belts+0.032
Neg logit contributions
oni-0.062
melon-0.060
mod-0.050
od-0.049
abi-0.048
Token
Token ID198
Feature activation-1.52e-02

Pos logit contributions
oni+0.010
melon+0.010
od+0.008
mod+0.008
abi+0.008
Neg logit contributions
 blankets-0.008
 ghosts-0.008
 doors-0.007
 belts-0.007
 err-0.007
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
oni+0.029
melon+0.028
od+0.025
mod+0.023
abi+0.022
Neg logit contributions
 blankets-0.026
 ghosts-0.025
 doors-0.024
 belts-0.023
 hinges-0.023
Token"
Token ID1
Feature activation+3.32e-04

Pos logit contributions
oni+0.031
melon+0.030
od+0.026
mod+0.024
 pumpkin+0.023
Neg logit contributions
 blankets-0.028
 ghosts-0.027
 doors-0.026
 hinges-0.025
 belts-0.025
TokenThat
Token ID2504
Feature activation-9.47e-02

Pos logit contributions
 blankets+0.000
 ghosts+0.000
 doors+0.000
 ages+0.000
 belts+0.000
Neg logit contributions
oni-0.001
melon-0.001
od-0.001
abi-0.001
 pumpkin-0.001
Token's
Token ID338
Feature activation-2.21e-01

Pos logit contributions
oni+0.187
melon+0.181
mod+0.155
od+0.153
abi+0.149
Neg logit contributions
 blankets-0.163
 doors-0.152
 ghosts-0.152
 eyes-0.142
 luggage-0.140
Token an
Token ID281
Feature activation-4.32e-02

Pos logit contributions
oni+0.485
melon+0.475
od+0.391
mod+0.384
abi+0.379
Neg logit contributions
 blankets-0.329
 ghosts-0.312
 doors-0.306
 belts-0.292
 hinges-0.291
Token enormous
Token ID9812
Feature activation-1.05e-01

Pos logit contributions
oni+0.088
melon+0.085
od+0.070
mod+0.070
abi+0.067
Neg logit contributions
 blankets-0.069
 ghosts-0.068
 doors-0.067
 hinges-0.063
 years-0.062
Token cup
Token ID6508
Feature activation-2.82e-02

Pos logit contributions
oni+0.206
melon+0.206
 pumpkin+0.164
od+0.162
 squash+0.158
Neg logit contributions
 blankets-0.177
 ghosts-0.170
 doors-0.164
 hinges-0.161
 belts-0.160
Tokenboard
Token ID3526
Feature activation-3.18e-02

Pos logit contributions
oni+0.062
melon+0.060
od+0.050
mod+0.049
abi+0.048
Neg logit contributions
 blankets-0.042
 ghosts-0.040
 doors-0.039
 hinges-0.037
 belts-0.037
Token,"
Token ID553
Feature activation+2.02e-02

Pos logit contributions
oni+0.066
melon+0.065
od+0.053
 pumpkin+0.051
mod+0.051
Neg logit contributions
 blankets-0.049
 ghosts-0.048
 doors-0.045
 hinges-0.044
 footsteps-0.044
Token happy
Token ID3772
Feature activation-5.35e-02

Pos logit contributions
 blankets+0.099
 ghosts+0.093
 doors+0.092
 years+0.087
 ages+0.087
Neg logit contributions
oni-0.140
melon-0.133
mod-0.114
od-0.112
abi-0.108
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
oni+0.100
melon+0.099
od+0.082
 pumpkin+0.078
 squash+0.075
Neg logit contributions
 blankets-0.093
 hinges-0.089
 doors-0.088
 ghosts-0.087
 belts-0.086
Token they
Token ID484
Feature activation-4.01e-02

Pos logit contributions
 blankets+0.019
 ghosts+0.018
 doors+0.018
 hinges+0.017
 belts+0.017
Neg logit contributions
oni-0.024
melon-0.023
od-0.019
mod-0.019
 pumpkin-0.018
Token had
Token ID550
Feature activation+4.51e-02

Pos logit contributions
oni+0.083
melon+0.082
od+0.067
mod+0.063
 pumpkin+0.063
Neg logit contributions
 blankets-0.064
 ghosts-0.063
 doors-0.061
 hinges-0.060
 footsteps-0.058
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 blankets+0.060
 ghosts+0.052
 doors+0.052
 ages+0.051
 belts+0.048
Neg logit contributions
oni-0.109
melon-0.108
mod-0.091
od-0.091
iche-0.088
Token best
Token ID1266
Feature activation-2.06e-01

Pos logit contributions
oni+0.027
melon+0.025
od+0.021
mod+0.020
abi+0.019
Neg logit contributions
 blankets-0.033
 ghosts-0.031
 doors-0.031
 belts-0.029
 ages-0.029
Token time
Token ID640
Feature activation+4.87e-03

Pos logit contributions
oni+0.380
melon+0.378
 pumpkin+0.310
 squash+0.308
od+0.297
Neg logit contributions
 doors-0.344
 ghosts-0.341
 blankets-0.340
 hinges-0.337
 belts-0.322
Token ever
Token ID1683
Feature activation+3.87e-02

Pos logit contributions
 blankets+0.008
 hinges+0.008
 ghosts+0.008
 doors+0.008
 footsteps+0.007
Neg logit contributions
oni-0.009
melon-0.009
od-0.008
 pumpkin-0.007
 squash-0.007
Token.
Token ID13
Feature activation+7.09e-03

Pos logit contributions
 blankets+0.060
 ghosts+0.057
 doors+0.055
 hinges+0.054
 belts+0.053
Neg logit contributions
oni-0.083
melon-0.081
od-0.068
mod-0.064
 pumpkin-0.063
Token<|endoftext|>
Token ID50256
Feature activation+1.13e-01

Pos logit contributions
 blankets+0.010
 ghosts+0.009
 doors+0.009
 ages+0.009
 hinges+0.009
Neg logit contributions
oni-0.016
melon-0.016
od-0.013
mod-0.013
abi-0.013
TokenM
Token ID44
Feature activation-5.41e-02

Pos logit contributions
 blankets+0.156
 ghosts+0.147
 doors+0.143
 ages+0.136
 belts+0.132
Neg logit contributions
oni-0.257
melon-0.253
od-0.210
mod-0.208
abi-0.204
Tokenila
Token ID10102
Feature activation-9.44e-02

Pos logit contributions
 ghosts+0.144
 doors+0.143
 Chloe+0.136
 years+0.134
 ages+0.131
Neg logit contributions
melon-0.271
oni-0.259
 redd-0.226
 pumpkin-0.218
mod-0.218
Token said
Token ID531
Feature activation-1.63e-02

Pos logit contributions
melon+0.235
oni+0.228
mod+0.190
abi+0.188
iche+0.187
Neg logit contributions
 doors-0.114
 blankets-0.112
 ghosts-0.107
 eyes-0.102
 years-0.099
Token.
Token ID13
Feature activation+1.68e-03

Pos logit contributions
melon+0.044
oni+0.044
iche+0.038
 redd+0.038
mod+0.038
Neg logit contributions
 blankets-0.015
 eyes-0.014
 ghosts-0.013
 doors-0.013
 footsteps-0.013
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 Chloe+0.002
 footsteps+0.002
Neg logit contributions
melon-0.004
oni-0.004
od-0.004
iche-0.004
abi-0.004
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
melon+0.031
oni+0.031
od+0.025
mod+0.025
 pumpkin+0.024
Neg logit contributions
 blankets-0.018
 ghosts-0.017
 doors-0.016
 hinges-0.015
 ages-0.015
TokenBen
Token ID11696
Feature activation-1.68e-01

Pos logit contributions
melon+0.032
oni+0.032
od+0.025
mod+0.025
abi+0.025
Neg logit contributions
 blankets-0.022
 ghosts-0.021
 doors-0.021
 belts-0.020
 years-0.019
Token nodded
Token ID14464
Feature activation-3.24e-02

Pos logit contributions
melon+0.410
oni+0.407
mod+0.364
od+0.356
iche+0.356
Neg logit contributions
 blankets-0.184
 doors-0.178
 eyes-0.169
 belts-0.159
 terror-0.156
Token.
Token ID13
Feature activation+9.29e-04

Pos logit contributions
oni+0.082
melon+0.081
mod+0.068
od+0.068
abi+0.066
Neg logit contributions
 blankets-0.037
 doors-0.034
 ghosts-0.034
 eyes-0.032
 footsteps-0.032
Token He
Token ID679
Feature activation+3.30e-02

Pos logit contributions
 blankets+0.001
 ghosts+0.001
 doors+0.001
 hinges+0.001
 belts+0.001
Neg logit contributions
oni-0.002
melon-0.002
od-0.002
mod-0.002
abi-0.002
Token was
Token ID373
Feature activation+2.92e-02

Pos logit contributions
 blankets+0.044
 doors+0.041
 ghosts+0.039
 belts+0.039
 years+0.037
Neg logit contributions
melon-0.075
oni-0.074
od-0.063
mod-0.063
iche-0.062
Token hungry
Token ID14720
Feature activation-1.19e-01

Pos logit contributions
 blankets+0.035
 ghosts+0.035
 doors+0.034
 years+0.033
 minutes+0.032
Neg logit contributions
oni-0.070
melon-0.069
od-0.060
mod-0.057
 pumpkin-0.057
Token Foo
Token ID36080
Feature activation-4.07e-03

Pos logit contributions
oni+0.152
melon+0.151
od+0.126
mod+0.124
iche+0.122
Neg logit contributions
 blankets-0.082
 ghosts-0.078
 doors-0.075
 belts-0.071
 ages-0.070
Token medicine
Token ID9007
Feature activation+4.42e-02

Pos logit contributions
oni+0.009
melon+0.009
od+0.007
mod+0.007
abi+0.007
Neg logit contributions
 blankets-0.006
 ghosts-0.006
 doors-0.005
 belts-0.005
 hinges-0.005
Token to
Token ID284
Feature activation-1.12e-02

Pos logit contributions
 blankets+0.062
 ghosts+0.057
 doors+0.056
 belts+0.053
 hinges+0.052
Neg logit contributions
oni-0.100
melon-0.098
od-0.084
mod-0.084
abi-0.081
Token make
Token ID787
Feature activation-1.03e-01

Pos logit contributions
oni+0.025
melon+0.024
od+0.020
mod+0.020
abi+0.020
Neg logit contributions
 blankets-0.016
 ghosts-0.015
 doors-0.015
 belts-0.014
 hinges-0.014
Token him
Token ID683
Feature activation+9.60e-03

Pos logit contributions
oni+0.226
melon+0.221
od+0.184
mod+0.180
abi+0.176
Neg logit contributions
 blankets-0.153
 ghosts-0.145
 doors-0.141
 hinges-0.134
 belts-0.133
Token feel
Token ID1254
Feature activation-1.27e-01

Pos logit contributions
 blankets+0.017
 ghosts+0.016
 doors+0.016
 hinges+0.015
 footsteps+0.015
Neg logit contributions
oni-0.019
melon-0.018
od-0.015
mod-0.014
abi-0.014
Token better
Token ID1365
Feature activation-1.26e-01

Pos logit contributions
oni+0.233
melon+0.225
od+0.184
mod+0.176
abi+0.169
Neg logit contributions
 blankets-0.239
 ghosts-0.227
 doors-0.221
 hinges-0.212
 ages-0.211
Token.
Token ID13
Feature activation-3.29e-03

Pos logit contributions
oni+0.274
melon+0.268
od+0.222
mod+0.218
abi+0.213
Neg logit contributions
 blankets-0.188
 ghosts-0.179
 doors-0.174
 hinges-0.166
 belts-0.165
Token She
Token ID1375
Feature activation+1.13e-03

Pos logit contributions
oni+0.007
melon+0.006
od+0.005
mod+0.005
abi+0.005
Neg logit contributions
 blankets-0.005
 ghosts-0.005
 doors-0.005
 hinges-0.005
 ages-0.005
Token showed
Token ID3751
Feature activation-8.96e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.001
 belts+0.001
 hinges+0.001
Neg logit contributions
melon-0.002
oni-0.002
od-0.002
mod-0.002
iche-0.002
Token lots
Token ID6041
Feature activation+5.26e-02

Pos logit contributions
oni+0.207
melon+0.197
od+0.172
mod+0.166
abi+0.163
Neg logit contributions
 blankets-0.122
 ghosts-0.113
 doors-0.112
 hinges-0.110
 belts-0.103
Token crack
Token ID8469
Feature activation-7.73e-03

Pos logit contributions
 blankets+0.183
 ghosts+0.170
 doors+0.165
 hinges+0.156
 belts+0.156
Neg logit contributions
oni-0.398
melon-0.390
od-0.334
mod-0.325
 pumpkin-0.320
Token.
Token ID13
Feature activation-4.51e-03

Pos logit contributions
oni+0.018
melon+0.018
mod+0.015
od+0.015
abi+0.015
Neg logit contributions
 blankets-0.010
 ghosts-0.010
 doors-0.009
 belts-0.009
 years-0.008
Token She
Token ID1375
Feature activation+4.27e-03

Pos logit contributions
oni+0.010
melon+0.010
od+0.008
mod+0.008
abi+0.008
Neg logit contributions
 blankets-0.007
 ghosts-0.006
 doors-0.006
 hinges-0.006
 belts-0.006
Token saw
Token ID2497
Feature activation+3.09e-03

Pos logit contributions
 err+0.008
 occasions+0.008
 blankets+0.008
 ages+0.008
 belts+0.008
Neg logit contributions
oni-0.007
melon-0.006
 tomato-0.005
 pumpkin-0.005
od-0.005
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
 blankets+0.004
 ghosts+0.004
 doors+0.004
 footsteps+0.004
 eyes+0.004
Neg logit contributions
oni-0.007
melon-0.007
od-0.006
mod-0.006
abi-0.006
Token pumpkin
Token ID30089
Feature activation-4.54e-01

Pos logit contributions
oni+0.049
melon+0.047
od+0.047
ackers+0.044
ipp+0.043
Neg logit contributions
 doors-0.060
 blankets-0.057
 door-0.056
 eyes-0.055
 fear-0.054
Token split
Token ID6626
Feature activation-2.13e-01

Pos logit contributions
oni+1.074
melon+1.038
mod+0.906
od+0.891
abi+0.873
Neg logit contributions
 blankets-0.613
 doors-0.577
 ghosts-0.559
 hinges-0.508
 eyes-0.507
Token in
Token ID287
Feature activation+1.01e-02

Pos logit contributions
oni+0.521
melon+0.519
mod+0.452
iche+0.441
od+0.440
Neg logit contributions
 blankets-0.259
 doors-0.247
 ghosts-0.232
 eyes-0.220
 hinges-0.210
Token half
Token ID2063
Feature activation-1.17e-01

Pos logit contributions
 blankets+0.017
 ghosts+0.016
 doors+0.015
 hinges+0.015
 belts+0.015
Neg logit contributions
oni-0.021
melon-0.020
od-0.017
mod-0.016
abi-0.016
Token.
Token ID13
Feature activation-5.34e-03

Pos logit contributions
melon+0.279
oni+0.278
mod+0.239
od+0.233
abi+0.226
Neg logit contributions
 blankets-0.153
 doors-0.141
 ghosts-0.137
 eyes-0.129
 years-0.126
Token She
Token ID1375
Feature activation+3.81e-03

Pos logit contributions
oni+0.013
melon+0.012
od+0.010
abi+0.010
mod+0.010
Neg logit contributions
 blankets-0.007
 doors-0.007
 ghosts-0.007
 footsteps-0.006
 eyes-0.006
Token back
Token ID736
Feature activation-4.82e-03

Pos logit contributions
oni+0.020
melon+0.019
od+0.017
mod+0.017
abi+0.016
Neg logit contributions
 blankets-0.011
 ghosts-0.010
 doors-0.010
 hinges-0.010
 ages-0.009
Token home
Token ID1363
Feature activation+1.08e-02

Pos logit contributions
oni+0.010
melon+0.010
od+0.008
 pumpkin+0.008
mod+0.008
Neg logit contributions
 blankets-0.008
 ghosts-0.007
 doors-0.007
 belts-0.007
 hinges-0.007
Token.
Token ID13
Feature activation-1.68e-05

Pos logit contributions
 blankets+0.016
 ghosts+0.015
 doors+0.015
 hinges+0.014
 belts+0.014
Neg logit contributions
oni-0.024
melon-0.023
od-0.020
mod-0.019
abi-0.019
Token The
Token ID383
Feature activation-8.49e-02

Pos logit contributions
oni+0.000
melon+0.000
od+0.000
mod+0.000
abi+0.000
Neg logit contributions
 ghosts-0.000
 blankets-0.000
 doors-0.000
 hinges-0.000
 belts-0.000
Token rad
Token ID2511
Feature activation-3.09e-01

Pos logit contributions
oni+0.138
od+0.122
melon+0.116
mod+0.105
iche+0.104
Neg logit contributions
 blankets-0.172
 doors-0.165
 fear-0.161
 eyes-0.158
 years-0.153
Tokenish
Token ID680
Feature activation-3.73e-01

Pos logit contributions
oni+0.394
melon+0.346
od+0.282
mod+0.266
iche+0.257
Neg logit contributions
 blankets-0.715
 doors-0.690
 eyes-0.671
 ghosts-0.663
 torch-0.648
Token answered
Token ID9373
Feature activation-3.00e-02

Pos logit contributions
oni+0.834
melon+0.808
od+0.682
mod+0.677
abi+0.654
Neg logit contributions
 blankets-0.542
 ghosts-0.503
 doors-0.499
 belts-0.469
 hinges-0.464
Token:
Token ID25
Feature activation-4.86e-02

Pos logit contributions
oni+0.070
melon+0.069
mod+0.061
iche+0.059
od+0.059
Neg logit contributions
 blankets-0.038
 doors-0.034
 ghosts-0.034
 eyes-0.034
 footsteps-0.032
Token "
Token ID366
Feature activation-3.26e-02

Pos logit contributions
oni+0.116
melon+0.112
mod+0.097
od+0.096
 pumpkin+0.092
Neg logit contributions
 blankets-0.058
 ghosts-0.053
 doors-0.051
 eyes-0.050
 ages-0.050
TokenLook
Token ID8567
Feature activation-4.38e-02

Pos logit contributions
oni+0.071
melon+0.069
od+0.057
mod+0.055
 pumpkin+0.054
Neg logit contributions
 blankets-0.049
 ghosts-0.047
 doors-0.045
 belts-0.044
 ages-0.043
Token there
Token ID612
Feature activation+5.45e-02

Pos logit contributions
oni+0.100
melon+0.098
mod+0.081
od+0.081
abi+0.078
Neg logit contributions
 blankets-0.061
 ghosts-0.058
 doors-0.056
 belts-0.053
 hinges-0.053
Token she
Token ID673
Feature activation+1.43e-03

Pos logit contributions
oni+0.078
melon+0.076
od+0.063
mod+0.062
abi+0.061
Neg logit contributions
 blankets-0.047
 ghosts-0.044
 doors-0.043
 ages-0.040
 hinges-0.039
Token found
Token ID1043
Feature activation-9.17e-02

Pos logit contributions
 blankets+0.002
 ghosts+0.002
 doors+0.002
 hinges+0.002
 footsteps+0.002
Neg logit contributions
oni-0.003
melon-0.003
od-0.002
mod-0.002
abi-0.002
Token the
Token ID262
Feature activation-2.55e-02

Pos logit contributions
melon+0.237
oni+0.234
mod+0.204
iche+0.199
od+0.198
Neg logit contributions
 blankets-0.107
 doors-0.090
 eyes-0.085
 ghosts-0.085
 ages-0.084
Token little
Token ID1310
Feature activation+5.71e-02

Pos logit contributions
oni+0.040
melon+0.035
od+0.034
mod+0.032
umble+0.030
Neg logit contributions
 doors-0.053
 blankets-0.051
 door-0.049
 eyes-0.049
 fear-0.047
Token dirty
Token ID11841
Feature activation-6.97e-02

Pos logit contributions
 doors+0.119
 blankets+0.118
 eyes+0.112
 door+0.107
 lights+0.107
Neg logit contributions
oni-0.098
melon-0.078
od-0.077
mod-0.069
iche-0.068
Token pumpkin
Token ID30089
Feature activation-4.44e-01

Pos logit contributions
oni+0.079
melon+0.062
mod+0.062
od+0.062
ackers+0.049
Neg logit contributions
 doors-0.173
 blankets-0.170
 eyes-0.168
 door-0.162
 lights-0.160
Token.
Token ID13
Feature activation-2.29e-03

Pos logit contributions
oni+1.092
melon+1.029
mod+0.976
od+0.910
iche+0.899
Neg logit contributions
 blankets-0.575
 doors-0.481
 ghosts-0.465
 years-0.459
 beds-0.436
Token<|endoftext|>
Token ID50256
Feature activation+1.05e-01

Pos logit contributions
oni+0.005
melon+0.005
od+0.004
mod+0.004
abi+0.004
Neg logit contributions
 blankets-0.003
 ghosts-0.003
 doors-0.003
 belts-0.003
 hinges-0.003
TokenOnce
Token ID7454
Feature activation+9.18e-03

Pos logit contributions
 blankets+0.154
 ghosts+0.146
 doors+0.142
 hinges+0.135
 belts+0.134
Neg logit contributions
oni-0.231
melon-0.226
od-0.188
mod-0.185
abi-0.180
Token upon
Token ID2402
Feature activation+1.14e-01

Pos logit contributions
 blankets+0.008
 ghosts+0.007
 doors+0.007
 Chloe+0.006
 ages+0.006
Neg logit contributions
oni-0.026
melon-0.025
mod-0.022
abi-0.022
od-0.022
Token a
Token ID257
Feature activation+3.08e-02

Pos logit contributions
 blankets+0.187
 doors+0.182
 ghosts+0.177
 hinges+0.176
 belts+0.174
Neg logit contributions
oni-0.222
melon-0.206
od-0.181
mod-0.177
abi-0.169
Token and
Token ID290
Feature activation-2.28e-03

Pos logit contributions
oni+0.152
melon+0.148
od+0.123
mod+0.119
abi+0.117
Neg logit contributions
 blankets-0.111
 ghosts-0.105
 doors-0.103
 hinges-0.099
 belts-0.098
Token proud
Token ID6613
Feature activation-1.12e-01

Pos logit contributions
oni+0.005
melon+0.005
od+0.004
mod+0.004
abi+0.004
Neg logit contributions
 blankets-0.004
 ghosts-0.003
 doors-0.003
 hinges-0.003
 belts-0.003
Token of
Token ID286
Feature activation+8.36e-02

Pos logit contributions
oni+0.222
melon+0.216
od+0.182
 pumpkin+0.169
 squash+0.166
Neg logit contributions
 blankets-0.185
 hinges-0.178
 doors-0.177
 ghosts-0.174
 belts-0.174
Token her
Token ID607
Feature activation+7.10e-03

Pos logit contributions
 blankets+0.127
 ghosts+0.120
 doors+0.117
 hinges+0.111
 belts+0.110
Neg logit contributions
oni-0.183
melon-0.179
od-0.148
mod-0.146
abi-0.142
Token new
Token ID649
Feature activation-1.90e-02

Pos logit contributions
 blankets+0.010
 ghosts+0.009
 doors+0.009
 hinges+0.008
 belts+0.008
Neg logit contributions
oni-0.017
melon-0.016
od-0.014
mod-0.013
abi-0.013
Token plant
Token ID4618
Feature activation-3.80e-01

Pos logit contributions
oni+0.033
melon+0.030
mod+0.024
od+0.024
ackers+0.023
Neg logit contributions
 blankets-0.039
 eyes-0.036
 doors-0.036
 hinges-0.035
 ghosts-0.035
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
oni+0.945
melon+0.922
mod+0.846
abi+0.792
iche+0.781
Neg logit contributions
 blankets-0.467
 ghosts-0.408
 doors-0.377
 years-0.374
 eyes-0.367
Token<|endoftext|>
Token ID50256
Feature activation+1.06e-01

Pos logit contributions
oni+0.016
melon+0.016
od+0.013
mod+0.013
abi+0.013
Neg logit contributions
 blankets-0.011
 ghosts-0.010
 doors-0.010
 hinges-0.009
 belts-0.009
TokenOnce
Token ID7454
Feature activation+1.11e-02

Pos logit contributions
 blankets+0.148
 ghosts+0.137
 doors+0.134
 ages+0.127
 hinges+0.124
Neg logit contributions
oni-0.241
melon-0.238
od-0.198
mod-0.196
 pumpkin-0.192
Token upon
Token ID2402
Feature activation+1.15e-01

Pos logit contributions
 blankets+0.010
 ghosts+0.009
 doors+0.009
 beds+0.008
 ages+0.008
Neg logit contributions
oni-0.030
melon-0.030
mod-0.026
od-0.026
abi-0.025
Token a
Token ID257
Feature activation+2.73e-02

Pos logit contributions
 blankets+0.167
 ghosts+0.159
 doors+0.157
 hinges+0.150
 belts+0.149
Neg logit contributions
oni-0.252
melon-0.244
od-0.206
mod-0.203
abi-0.197
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
 blankets+0.031
 doors+0.029
 ghosts+0.029
 belts+0.027
 hinges+0.027
Neg logit contributions
oni-0.055
melon-0.055
od-0.046
mod-0.045
abi-0.044
Token make
Token ID787
Feature activation-1.11e-01

Pos logit contributions
oni+0.034
melon+0.033
mod+0.027
od+0.027
abi+0.026
Neg logit contributions
 blankets-0.021
 ghosts-0.019
 doors-0.019
 ages-0.018
 hinges-0.018
Token the
Token ID262
Feature activation-3.77e-02

Pos logit contributions
melon+0.303
oni+0.289
mod+0.245
abi+0.238
iche+0.237
Neg logit contributions
 blankets-0.127
 doors-0.118
 ghosts-0.117
 footsteps-0.110
 eyes-0.107
Token biggest
Token ID4094
Feature activation-2.30e-01

Pos logit contributions
oni+0.087
melon+0.081
od+0.073
mod+0.073
abi+0.070
Neg logit contributions
 blankets-0.054
 doors-0.053
 eyes-0.047
 lights-0.046
 ghosts-0.046
Token sand
Token ID6450
Feature activation-7.00e-02

Pos logit contributions
oni+0.466
melon+0.453
od+0.370
mod+0.366
abi+0.356
Neg logit contributions
 blankets-0.383
 ghosts-0.364
 doors-0.358
 hinges-0.340
 belts-0.338
Tokencastle
Token ID18676
Feature activation-3.77e-01

Pos logit contributions
oni+0.127
melon+0.121
mod+0.103
abi+0.099
od+0.095
Neg logit contributions
 blankets-0.130
 doors-0.125
 ghosts-0.123
 belts-0.117
 hinges-0.115
Token ever
Token ID1683
Feature activation+2.79e-02

Pos logit contributions
oni+0.910
melon+0.887
abi+0.748
mod+0.748
od+0.743
Neg logit contributions
 blankets-0.470
 ghosts-0.450
 doors-0.432
 belts-0.408
 ages-0.407
Token.
Token ID13
Feature activation+3.81e-04

Pos logit contributions
 blankets+0.030
 ghosts+0.029
 doors+0.028
 years+0.027
 ages+0.027
Neg logit contributions
oni-0.072
melon-0.072
mod-0.061
abi-0.060
 pumpkin-0.059
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
 doors+0.001
 ghosts+0.001
 blankets+0.001
 belts+0.001
 brakes+0.001
Neg logit contributions
oni-0.001
melon-0.001
od-0.000
mod-0.000
 pumpkin-0.000
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
oni+0.026
melon+0.025
od+0.021
mod+0.021
abi+0.020
Neg logit contributions
 blankets-0.014
 ghosts-0.014
 doors-0.014
 hinges-0.013
 belts-0.013
TokenAfter
Token ID3260
Feature activation+9.97e-02

Pos logit contributions
oni+0.024
melon+0.023
od+0.020
mod+0.019
abi+0.019
Neg logit contributions
 blankets-0.016
 ghosts-0.016
 doors-0.016
 eyes-0.015
 hinges-0.015