TOP ACTIVATIONS
MAX = 3.656

would sit by the books he lf and read together every
should not climb the books he lf . It is dangerous
was a beautiful pink c asser ole . The
We can climb the books he lf and see what is
it back on the books he lf so we can find
can make v room - v room sounds and go fast
be careful . The books he lf is tall and we
They go to the books he lf and start to climb
Suddenly , P uss noticed a bigger bug .
accidentally knocked over the books he lf .
" Time to bat he , L ila and Tom
his hand to pull the bud . " No
it go v room - v room . The truck loved
see what was inside the bud . " Let
can 't reach the books he lf . Can you reach
Now we have a books he lf !" <|endoftext|> M ummy
They ran to the books he lf and put the box
puppy jumped on the books he lf and knocked all the
He goes to the books he lf and looks for it
, I finished the books he lf !" Andy said with

MIN ACTIVATIONS
MIN = -3.776

When Daisy put the magn ifying glass down , the cater
flashlight , and a magn ifying glass . As
a family who really enjoyed pic n ics . They loved
3 year old boy as Super Robbie . <|endoftext|> Once upon
girl named Lily who loved pic n ics . She would
so excited ! She loved pic n ics and spending time
're a great pilot and Super Fly looks so happy with
their salad and have happy pic n ics together . <|endoftext|>
the warm sunshine and having pic n ics with her family
, next to her magn ifying glass . She looked at
in the garden and have pic n ics together .
near the cliff and have pic n ics . One day
and nodded . He loved pic n ics and couldn 't
. <|endoftext|> M ama and pap a were busy in the
the little frog and the Professor became good friends , and
see the bird and have pic n ics . <|endoftext|> Once
the watch with the magn ifying glass . He saw the
throughout the forest . <|endoftext|> Old Sam lived all alone in
She takes out a magn ifying glass . She looks at
The sun was setting and Farmer Joe was getting ready for

INTERVAL 1.798 - 3.656
CONTAINS 0.167%

Sue . Sue loved to help her mom in the kitchen
mom quickly cleaned up the mess . Soon , Lisa forgot
. It was a chip m unk ! The chip m
Hi Jay den ! Ready for school today ?"
saw a gray smoke coming from the oven . She was

INTERVAL -0.060 - 1.798
CONTAINS 55.466%

down and look out the window . <|endoftext|> Once upon a
day , they saw something new in the park . It
told them what Tom did . He said Tom was a
Lil ah could run very fast and race against Daddy .
She looked in the kitchen , the living room and even

INTERVAL -1.918 - -0.060
CONTAINS 44.132%

, " Let me show you how fun this game can
to be patient ." So Jane waited patiently while
. Here , have a carrot . It 's delicious and
had a very long feather on his head . All the
red cover and a picture of a fox and a crow

INTERVAL -3.776 - -1.918
CONTAINS 0.235%

I 'm so glad you like the candle , Tim my
hello . â € � � Hi ,
time there was a girl named Kay la . She was
there was a little girl named Lily . She loved to
was the wind , and Grand ma smiled again .
Token would
Token ID561
Feature activation-9.37e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 maze+0.014
 monsters+0.013
 robbers+0.013
Neg logit contributions
oured-0.009
cream-0.009
 sprink-0.009
 placed-0.008
 dip-0.008
Token sit
Token ID1650
Feature activation+6.45e-01

Pos logit contributions
oured+0.034
cream+0.034
 sprink+0.031
 dip+0.031
 bait+0.030
Neg logit contributions
 villains-0.043
 hall-0.038
 robbers-0.035
 monsters-0.035
py-0.035
Token by
Token ID416
Feature activation+7.22e-01

Pos logit contributions
 villains+0.032
 hall+0.027
py+0.026
 robbers+0.026
 villain+0.025
Neg logit contributions
oured-0.023
cream-0.023
 sprink-0.020
 dip-0.019
 bait-0.019
Token the
Token ID262
Feature activation+4.83e-01

Pos logit contributions
 villains+0.031
 hall+0.029
 dragons+0.027
 monsters+0.026
py+0.026
Neg logit contributions
cream-0.029
oured-0.029
 dip-0.026
 bait-0.025
int-0.025
Token books
Token ID3835
Feature activation+3.50e-01

Pos logit contributions
 villains+0.018
py+0.015
oon+0.014
 villain+0.013
dates+0.013
Neg logit contributions
 sprink-0.021
oured-0.021
cream-0.020
 dip-0.020
 vanilla-0.019
Tokenhe
Token ID258
Feature activation+3.66e+00

Pos logit contributions
 villains+0.015
 robbers+0.012
py+0.011
 monsters+0.011
 skeletons+0.011
Neg logit contributions
oured-0.015
cream-0.015
 dip-0.014
 sprink-0.014
 placed-0.013
Tokenlf
Token ID1652
Feature activation+1.20e+00

Pos logit contributions
 villains+0.119
umpy+0.101
 robbers+0.100
Raw+0.097
py+0.092
Neg logit contributions
 as-0.159
cream-0.149
 sprink-0.148
asted-0.145
 then-0.144
Token and
Token ID290
Feature activation-2.27e-01

Pos logit contributions
 villains+0.061
 robbers+0.052
 hall+0.051
 villain+0.050
py+0.050
Neg logit contributions
oured-0.039
cream-0.038
 sprink-0.033
 dip-0.033
 placed-0.033
Token read
Token ID1100
Feature activation-5.82e-02

Pos logit contributions
oured+0.009
cream+0.009
 sprink+0.008
 dip+0.008
 bait+0.008
Neg logit contributions
 villains-0.010
 hall-0.009
 dragons-0.008
 robbers-0.008
 monsters-0.008
Token together
Token ID1978
Feature activation-1.06e-01

Pos logit contributions
oured+0.002
cream+0.002
 dip+0.002
 bait+0.002
int+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 dragons-0.002
 monsters-0.002
 lawyers-0.002
Token every
Token ID790
Feature activation-5.78e-01

Pos logit contributions
cream+0.004
oured+0.004
 sprink+0.004
 dip+0.003
 bait+0.003
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
py-0.004
 monsters-0.004
Token should
Token ID815
Feature activation-3.10e-02

Pos logit contributions
 villains+0.015
 hall+0.013
 monsters+0.012
 dragons+0.012
 robbers+0.012
Neg logit contributions
oured-0.017
cream-0.016
 sprink-0.015
 dip-0.015
 bait-0.015
Token not
Token ID407
Feature activation-6.97e-01

Pos logit contributions
cream+0.001
 dip+0.001
oured+0.001
 sprink+0.001
 placed+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 skeletons-0.001
 dragons-0.001
 robbers-0.001
Token climb
Token ID12080
Feature activation-2.78e-01

Pos logit contributions
oured+0.030
cream+0.029
 sprink+0.026
 bait+0.026
 dip+0.025
Neg logit contributions
 villains-0.029
 hall-0.025
 robbers-0.023
 dragons-0.023
 monsters-0.023
Token the
Token ID262
Feature activation+2.58e-01

Pos logit contributions
cream+0.010
oured+0.009
 sprink+0.009
 dip+0.008
 bait+0.008
Neg logit contributions
 villains-0.014
 hall-0.012
py-0.011
 robbers-0.011
 villain-0.011
Token books
Token ID3835
Feature activation+3.87e-01

Pos logit contributions
 villains+0.010
py+0.009
oon+0.008
dates+0.008
 robbers+0.008
Neg logit contributions
 sprink-0.011
cream-0.010
 dip-0.010
 vanilla-0.010
oured-0.009
Tokenhe
Token ID258
Feature activation+3.62e+00

Pos logit contributions
 villains+0.019
py+0.016
 robbers+0.016
 skeletons+0.015
 trolls+0.015
Neg logit contributions
cream-0.013
oured-0.013
 as-0.012
 dip-0.012
 sprink-0.012
Tokenlf
Token ID1652
Feature activation+1.23e+00

Pos logit contributions
 villains+0.135
 robbers+0.108
Raw+0.105
py+0.105
 villain+0.103
Neg logit contributions
cream-0.144
 as-0.142
 sprink-0.141
 placed-0.127
 that-0.123
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
 villains+0.061
 robbers+0.052
py+0.051
Raw+0.051
 hall+0.049
Neg logit contributions
cream-0.040
oured-0.038
 as-0.036
 sprink-0.034
 placed-0.033
Token It
Token ID632
Feature activation+4.19e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.012
 maze+0.012
 robbers+0.012
Neg logit contributions
cream-0.014
oured-0.013
 sprink-0.013
 vanilla-0.012
 dip-0.012
Token is
Token ID318
Feature activation+3.82e-01

Pos logit contributions
 villains+0.019
 hall+0.016
 robbers+0.016
 dragons+0.015
 maze+0.015
Neg logit contributions
cream-0.016
oured-0.015
 sprink-0.014
 dip-0.014
 gul-0.013
Token dangerous
Token ID4923
Feature activation+3.35e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 dragons+0.013
 robbers+0.013
 monsters+0.013
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.014
 dip-0.014
 bait-0.013
Token was
Token ID373
Feature activation+9.19e-02

Pos logit contributions
oured+0.020
cream+0.019
 bait+0.018
inated+0.017
 dip+0.017
Neg logit contributions
 hall-0.015
 villains-0.015
 dragons-0.014
 monsters-0.013
 monster-0.013
Token a
Token ID257
Feature activation-2.58e-01

Pos logit contributions
 dragons+0.003
 monsters+0.003
 hall+0.003
 villains+0.003
 chamber+0.003
Neg logit contributions
oured-0.004
cream-0.004
int-0.004
inated-0.004
acket-0.004
Token beautiful
Token ID4950
Feature activation+3.01e-01

Pos logit contributions
cream+0.012
oured+0.011
ck+0.010
shaped+0.010
inated+0.010
Neg logit contributions
 villains-0.010
 hall-0.010
 monster-0.009
 chamber-0.009
 maze-0.009
Token pink
Token ID11398
Feature activation+2.92e-01

Pos logit contributions
 hall+0.010
 monster+0.009
 chamber+0.009
 villains+0.009
 cabinet+0.009
Neg logit contributions
cream-0.014
oured-0.014
ck-0.013
shaped-0.013
 bait-0.012
Token c
Token ID269
Feature activation+1.59e+00

Pos logit contributions
 hall+0.010
 villains+0.009
 monster+0.009
 cabinet+0.009
 chamber+0.009
Neg logit contributions
oured-0.013
cream-0.013
ck-0.012
shaped-0.012
 bait-0.011
Tokenasser
Token ID24929
Feature activation+3.60e+00

Pos logit contributions
 villains+0.067
 hall+0.059
 dragons+0.054
 robbers+0.054
py+0.054
Neg logit contributions
oured-0.064
cream-0.064
 sprink-0.058
 dip-0.057
 bait-0.056
Tokenole
Token ID2305
Feature activation+8.80e-01

Pos logit contributions
 villains+0.154
 hall+0.137
 dragons+0.127
py+0.125
 monsters+0.125
Neg logit contributions
oured-0.142
cream-0.141
 sprink-0.127
 dip-0.127
 bait-0.123
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
 villains+0.035
 hall+0.032
 dragons+0.029
 robbers+0.029
 monsters+0.029
Neg logit contributions
oured-0.037
cream-0.036
 dip-0.033
 sprink-0.032
 bait-0.032
Token
Token ID198
Feature activation-7.63e-01

Pos logit contributions
oured+0.024
cream+0.023
 bait+0.022
ck+0.022
 dip+0.022
Neg logit contributions
 villains-0.018
 hall-0.017
 lawyers-0.015
 robbers-0.015
 cabinet-0.015
Token
Token ID198
Feature activation-7.37e-01

Pos logit contributions
cream+0.031
oured+0.031
 sprink+0.029
 dip+0.028
 bait+0.028
Neg logit contributions
 villains-0.032
 hall-0.027
 monsters-0.025
 dragons-0.025
 robbers-0.025
TokenThe
Token ID464
Feature activation+5.64e-02

Pos logit contributions
oured+0.029
cream+0.029
 sprink+0.026
 dip+0.026
 bait+0.025
Neg logit contributions
 villains-0.032
 hall-0.028
 dragons-0.026
 robbers-0.026
py-0.026
Token We
Token ID775
Feature activation+5.54e-02

Pos logit contributions
 villains+0.028
 hall+0.024
 robbers+0.022
 dragons+0.022
 monsters+0.022
Neg logit contributions
oured-0.031
cream-0.030
 sprink-0.028
 dip-0.027
 bait-0.027
Token can
Token ID460
Feature activation-4.07e-01

Pos logit contributions
 villains+0.002
 hall+0.002
py+0.002
 dragons+0.002
 robbers+0.002
Neg logit contributions
oured-0.002
cream-0.002
 dip-0.002
 sprink-0.002
 bait-0.002
Token climb
Token ID12080
Feature activation-2.19e-01

Pos logit contributions
oured+0.014
cream+0.014
 dip+0.013
 sprink+0.013
 bait+0.013
Neg logit contributions
 villains-0.019
 hall-0.017
 robbers-0.016
py-0.015
 dragons-0.015
Token the
Token ID262
Feature activation+3.46e-01

Pos logit contributions
oured+0.008
cream+0.008
 sprink+0.007
 dip+0.007
 bait+0.007
Neg logit contributions
 villains-0.010
 hall-0.009
 dragons-0.008
 robbers-0.008
 monsters-0.008
Token books
Token ID3835
Feature activation+1.82e-01

Pos logit contributions
 villains+0.014
py+0.012
oon+0.012
 sisters+0.011
dates+0.011
Neg logit contributions
 sprink-0.014
cream-0.013
 dip-0.013
oured-0.012
 vanilla-0.012
Tokenhe
Token ID258
Feature activation+3.59e+00

Pos logit contributions
 villains+0.009
py+0.007
 robbers+0.007
 skeletons+0.007
 monsters+0.007
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.006
 dip-0.006
 as-0.006
Tokenlf
Token ID1652
Feature activation+1.12e+00

Pos logit contributions
 villains+0.114
 robbers+0.091
 villain+0.083
umpy+0.081
Raw+0.079
Neg logit contributions
cream-0.162
 sprink-0.158
 as-0.156
 dip-0.146
 placed-0.146
Token and
Token ID290
Feature activation-4.84e-01

Pos logit contributions
 villains+0.052
 robbers+0.044
py+0.043
 hall+0.043
 villain+0.042
Neg logit contributions
cream-0.039
oured-0.039
 sprink-0.036
 dip-0.035
 as-0.034
Token see
Token ID766
Feature activation+4.04e-02

Pos logit contributions
oured+0.019
cream+0.018
 sprink+0.016
 dip+0.016
 bait+0.016
Neg logit contributions
 villains-0.022
 hall-0.020
 dragons-0.018
 robbers-0.018
 monsters-0.018
Token what
Token ID644
Feature activation-8.42e-01

Pos logit contributions
 villains+0.002
 hall+0.002
 dragons+0.001
 monsters+0.001
 robbers+0.001
Neg logit contributions
oured-0.002
cream-0.002
 dip-0.001
 sprink-0.001
 bait-0.001
Token is
Token ID318
Feature activation-7.02e-01

Pos logit contributions
cream+0.028
 sprink+0.026
oured+0.026
 dip+0.026
 bait+0.025
Neg logit contributions
 villains-0.040
 hall-0.035
py-0.035
 robbers-0.034
 villain-0.033
Token it
Token ID340
Feature activation-2.91e-01

Pos logit contributions
 villains+0.013
 hall+0.012
 dragons+0.012
 monsters+0.011
 lawyers+0.011
Neg logit contributions
oured-0.013
cream-0.013
 dip-0.012
 mun-0.012
inated-0.011
Token back
Token ID736
Feature activation+7.33e-01

Pos logit contributions
oured+0.012
cream+0.012
 sprink+0.011
 dip+0.011
 bait+0.010
Neg logit contributions
 villains-0.012
 hall-0.010
 robbers-0.010
 dragons-0.010
py-0.010
Token on
Token ID319
Feature activation-4.00e-01

Pos logit contributions
 villains+0.028
 hall+0.026
 dragons+0.024
 monsters+0.023
 robbers+0.023
Neg logit contributions
oured-0.031
cream-0.030
 dip-0.028
 bait-0.028
 sprink-0.028
Token the
Token ID262
Feature activation+2.63e-01

Pos logit contributions
oured+0.016
cream+0.016
 dip+0.014
 sprink+0.014
 bait+0.014
Neg logit contributions
 villains-0.017
 hall-0.016
 dragons-0.015
 monsters-0.014
 robbers-0.014
Token books
Token ID3835
Feature activation+3.12e-01

Pos logit contributions
 villains+0.010
py+0.008
oon+0.008
 robbers+0.008
 villain+0.008
Neg logit contributions
 sprink-0.011
cream-0.011
oured-0.011
 dip-0.010
 vanilla-0.010
Tokenhe
Token ID258
Feature activation+3.54e+00

Pos logit contributions
 villains+0.013
 robbers+0.010
py+0.010
 hall+0.010
 monsters+0.010
Neg logit contributions
oured-0.013
cream-0.013
 sprink-0.012
 dip-0.012
 placed-0.012
Tokenlf
Token ID1652
Feature activation+1.29e+00

Pos logit contributions
 villains+0.143
umpy+0.121
 robbers+0.120
Raw+0.118
 villain+0.113
Neg logit contributions
cream-0.131
 sprink-0.127
 as-0.125
 placed-0.116
iously-0.109
Token so
Token ID523
Feature activation-1.88e-01

Pos logit contributions
 villains+0.059
 hall+0.050
 robbers+0.048
py+0.048
 villain+0.047
Neg logit contributions
oured-0.048
cream-0.048
 sprink-0.043
 dip-0.042
 placed-0.041
Token we
Token ID356
Feature activation-3.23e-01

Pos logit contributions
oured+0.008
cream+0.008
int+0.007
 gul+0.007
 sprink+0.007
Neg logit contributions
 villains-0.008
 dragons-0.007
 hall-0.007
 monsters-0.007
 villain-0.006
Token can
Token ID460
Feature activation-6.98e-01

Pos logit contributions
oured+0.016
cream+0.015
 sprink+0.014
 dip+0.014
 bait+0.014
Neg logit contributions
 villains-0.011
 hall-0.009
 dragons-0.009
 robbers-0.008
 monsters-0.008
Token find
Token ID1064
Feature activation+5.91e-01

Pos logit contributions
oured+0.024
cream+0.024
 dip+0.023
 sprink+0.022
 placed+0.021
Neg logit contributions
 villains-0.034
 hall-0.029
 robbers-0.028
py-0.028
 monsters-0.027
Token can
Token ID460
Feature activation-2.59e-01

Pos logit contributions
 villains+0.010
 hall+0.009
 dragons+0.008
 monsters+0.008
py+0.008
Neg logit contributions
oured-0.015
cream-0.015
 bait-0.014
 sprink-0.014
 dip-0.013
Token make
Token ID787
Feature activation+1.07e+00

Pos logit contributions
oured+0.012
cream+0.011
 bait+0.010
int+0.009
 sprink+0.009
Neg logit contributions
 villains-0.011
 hall-0.009
 dragons-0.009
 monsters-0.009
 villain-0.009
Token v
Token ID410
Feature activation+1.02e+00

Pos logit contributions
 villains+0.039
 hall+0.033
 dragons+0.033
 monsters+0.032
py+0.030
Neg logit contributions
oured-0.052
cream-0.050
 bait-0.045
 dip-0.045
 sprink-0.044
Tokenroom
Token ID3823
Feature activation-8.60e-02

Pos logit contributions
 villains+0.034
 hall+0.028
 robbers+0.027
 monsters+0.026
 dragons+0.025
Neg logit contributions
cream-0.054
oured-0.052
 sprink-0.048
shaped-0.046
int-0.045
Token-
Token ID12
Feature activation+5.03e-01

Pos logit contributions
oured+0.004
cream+0.004
 sprink+0.003
 dip+0.003
 bait+0.003
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.003
 robbers-0.003
 monsters-0.003
Tokenv
Token ID85
Feature activation+3.47e+00

Pos logit contributions
 villains+0.020
 hall+0.017
 dragons+0.015
py+0.015
 lawyers+0.015
Neg logit contributions
oured-0.022
cream-0.020
 bait-0.020
 dip-0.020
 placed-0.019
Tokenroom
Token ID3823
Feature activation+4.73e-02

Pos logit contributions
 villains+0.132
 robbers+0.112
 hall+0.107
 scientist+0.100
 villain+0.099
Neg logit contributions
cream-0.159
oured-0.148
 sprink-0.142
asted-0.133
ck-0.133
Token sounds
Token ID5238
Feature activation+1.07e+00

Pos logit contributions
 villains+0.002
 hall+0.002
 robbers+0.001
 dragons+0.001
py+0.001
Neg logit contributions
oured-0.002
cream-0.002
 sprink-0.002
 dip-0.002
 bait-0.002
Token and
Token ID290
Feature activation-8.13e-02

Pos logit contributions
 villains+0.041
 hall+0.035
 dragons+0.034
 monsters+0.033
 robbers+0.032
Neg logit contributions
oured-0.049
cream-0.048
 dip-0.042
 sprink-0.042
 bait-0.042
Token go
Token ID467
Feature activation+1.43e-01

Pos logit contributions
cream+0.003
oured+0.003
 sprink+0.003
 dip+0.003
 bait+0.003
Neg logit contributions
 villains-0.003
 hall-0.003
py-0.003
 robbers-0.003
 dragons-0.003
Token fast
Token ID3049
Feature activation+4.24e-01

Pos logit contributions
 villains+0.005
 hall+0.005
 dragons+0.004
 monsters+0.004
 robbers+0.004
Neg logit contributions
oured-0.007
cream-0.007
 bait-0.006
 sprink-0.006
int-0.006
Token be
Token ID307
Feature activation+3.42e-01

Pos logit contributions
 villains+0.007
 hall+0.006
 dragons+0.006
 robbers+0.006
 monsters+0.006
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.005
 dip-0.005
 bait-0.005
Token careful
Token ID8161
Feature activation+1.29e+00

Pos logit contributions
 villains+0.015
 hall+0.013
py+0.012
 villain+0.012
robat+0.012
Neg logit contributions
oured-0.013
cream-0.012
 bait-0.012
 sprink-0.012
 placed-0.012
Token.
Token ID13
Feature activation+7.82e-01

Pos logit contributions
 villains+0.052
 hall+0.045
 dragons+0.041
 robbers+0.041
py+0.041
Neg logit contributions
oured-0.055
cream-0.054
 sprink-0.050
 dip-0.049
 bait-0.048
Token The
Token ID383
Feature activation+4.42e-01

Pos logit contributions
 villains+0.032
 hall+0.028
 robbers+0.026
 dragons+0.025
 monsters+0.025
Neg logit contributions
oured-0.033
cream-0.032
 sprink-0.030
 dip-0.030
 bait-0.029
Token books
Token ID3835
Feature activation+4.39e-01

Pos logit contributions
 villains+0.018
py+0.017
oon+0.016
dates+0.015
umpy+0.015
Neg logit contributions
 sprink-0.017
 vanilla-0.014
cream-0.014
 dip-0.013
 frost-0.012
Tokenhe
Token ID258
Feature activation+3.46e+00

Pos logit contributions
 villains+0.017
 hall+0.014
 robbers+0.013
 dragons+0.013
py+0.013
Neg logit contributions
oured-0.019
cream-0.019
 sprink-0.018
 dip-0.018
 bait-0.017
Tokenlf
Token ID1652
Feature activation+1.64e+00

Pos logit contributions
 villains+0.098
 robbers+0.075
py+0.069
 villain+0.069
umpy+0.069
Neg logit contributions
cream-0.173
 sprink-0.164
oured-0.155
 dip-0.154
 bait-0.154
Token is
Token ID318
Feature activation+3.34e-01

Pos logit contributions
 villains+0.054
 hall+0.046
 dragons+0.043
 monsters+0.042
 maze+0.040
Neg logit contributions
oured-0.084
cream-0.081
 sprink-0.074
 dip-0.074
 bait-0.074
Token tall
Token ID7331
Feature activation+6.99e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.011
 robbers+0.011
 monsters+0.011
Neg logit contributions
oured-0.014
cream-0.014
 sprink-0.013
 dip-0.013
 bait-0.012
Token and
Token ID290
Feature activation+3.47e-01

Pos logit contributions
 villains+0.034
 hall+0.029
py+0.029
 robbers+0.028
 villain+0.028
Neg logit contributions
cream-0.023
oured-0.022
 sprink-0.022
 bait-0.020
 dip-0.020
Token we
Token ID356
Feature activation+7.08e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.011
 robbers+0.011
 monsters+0.011
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.013
 dip-0.013
 bait-0.013
TokenThey
Token ID2990
Feature activation-3.65e-01

Pos logit contributions
 villains+0.008
 hall+0.007
py+0.006
 dragons+0.006
 robbers+0.006
Neg logit contributions
cream-0.015
oured-0.015
 sprink-0.014
 dip-0.014
 bait-0.014
Token go
Token ID467
Feature activation-3.97e-01

Pos logit contributions
cream+0.014
oured+0.013
 sprink+0.012
 bait+0.012
 vanilla+0.011
Neg logit contributions
 villains-0.017
 hall-0.015
 dragons-0.014
 robbers-0.014
 maze-0.014
Token to
Token ID284
Feature activation+2.83e-01

Pos logit contributions
oured+0.012
cream+0.011
 dip+0.010
 as+0.009
 placed+0.009
Neg logit contributions
 villains-0.023
py-0.020
 villain-0.019
 dragons-0.019
 narrative-0.019
Token the
Token ID262
Feature activation+5.10e-01

Pos logit contributions
 villains+0.012
 hall+0.011
 robbers+0.010
 dragons+0.010
 monsters+0.010
Neg logit contributions
cream-0.011
oured-0.011
 sprink-0.010
 bait-0.009
 mun-0.009
Token books
Token ID3835
Feature activation+1.12e-01

Pos logit contributions
 villains+0.020
py+0.020
oon+0.019
umpy+0.017
dates+0.016
Neg logit contributions
 sprink-0.020
cream-0.019
 vanilla-0.018
oured-0.018
 dip-0.018
Tokenhe
Token ID258
Feature activation+3.44e+00

Pos logit contributions
 villains+0.005
py+0.004
 robbers+0.004
 monsters+0.004
 skeletons+0.004
Neg logit contributions
oured-0.004
cream-0.004
 dip-0.004
 sprink-0.004
 placed-0.004
Tokenlf
Token ID1652
Feature activation+1.07e+00

Pos logit contributions
 villains+0.111
 robbers+0.094
Raw+0.092
py+0.092
umpy+0.089
Neg logit contributions
 as-0.151
 sprink-0.144
cream-0.141
 placed-0.138
 dip-0.135
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
 villains+0.050
 hall+0.042
py+0.042
 robbers+0.042
 villain+0.041
Neg logit contributions
oured-0.039
cream-0.038
 dip-0.035
 sprink-0.034
 placed-0.034
Token start
Token ID923
Feature activation-4.31e-01

Pos logit contributions
 villains+0.007
 hall+0.007
 robbers+0.006
 dragons+0.006
 maze+0.006
Neg logit contributions
oured-0.007
cream-0.007
 sprink-0.006
shaped-0.006
 bait-0.006
Token to
Token ID284
Feature activation-3.23e-01

Pos logit contributions
oured+0.018
cream+0.017
 dip+0.015
 sprink+0.015
 bait+0.015
Neg logit contributions
 villains-0.018
 hall-0.016
 dragons-0.015
 robbers-0.015
 monsters-0.015
Token climb
Token ID12080
Feature activation-6.84e-01

Pos logit contributions
oured+0.011
cream+0.011
 dip+0.010
 sprink+0.010
 placed+0.009
Neg logit contributions
 villains-0.016
 hall-0.014
py-0.013
 robbers-0.013
 dragons-0.013
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 villains+0.045
 hall+0.039
 robbers+0.037
 monsters+0.036
 villain+0.036
Neg logit contributions
cream-0.038
oured-0.037
 sprink-0.034
 dip-0.033
 bait-0.033
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
oured+0.005
cream+0.005
 dip+0.005
 sprink+0.005
 bait+0.005
Neg logit contributions
 villains-0.006
 hall-0.005
 monsters-0.005
 dragons-0.005
 robbers-0.005
TokenSuddenly
Token ID38582
Feature activation-7.64e-01

Pos logit contributions
oured+0.006
cream+0.006
 dip+0.005
 sprink+0.005
 bait+0.005
Neg logit contributions
 villains-0.005
 hall-0.004
 dragons-0.004
 monsters-0.004
py-0.004
Token,
Token ID11
Feature activation+9.94e-01

Pos logit contributions
cream+0.040
oured+0.039
 sprink+0.036
asted+0.035
inated+0.035
Neg logit contributions
 villains-0.023
 hall-0.020
 monsters-0.019
 robbers-0.018
 maze-0.018
Token P
Token ID350
Feature activation-2.64e-01

Pos logit contributions
 villains+0.030
 hall+0.027
 robbers+0.026
 maze+0.025
 rac+0.025
Neg logit contributions
cream-0.050
oured-0.049
inated-0.044
 sprink-0.044
ck-0.043
Tokenuss
Token ID1046
Feature activation+3.44e+00

Pos logit contributions
oured+0.010
cream+0.010
 sprink+0.009
 dip+0.009
 bait+0.008
Neg logit contributions
 villains-0.012
 hall-0.011
 robbers-0.010
 dragons-0.010
py-0.010
Token noticed
Token ID6810
Feature activation-5.02e-01

Pos logit contributions
 villains+0.153
 hall+0.139
 dragons+0.131
py+0.129
 monsters+0.128
Neg logit contributions
oured-0.130
cream-0.129
 sprink-0.115
 bait-0.115
 dip-0.114
Token a
Token ID257
Feature activation+3.63e-02

Pos logit contributions
oured+0.020
cream+0.018
 dip+0.017
inated+0.017
int+0.017
Neg logit contributions
 villains-0.021
 monsters-0.020
 monster-0.020
 hall-0.020
 skeletons-0.020
Token bigger
Token ID5749
Feature activation+3.21e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 robbers+0.001
py+0.001
Neg logit contributions
oured-0.002
cream-0.002
 sprink-0.001
 dip-0.001
 bait-0.001
Token bug
Token ID5434
Feature activation+2.28e-02

Pos logit contributions
 villains+0.011
 hall+0.008
 robbers+0.008
py+0.008
 dragons+0.008
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.015
 dip-0.015
 bait-0.014
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 robbers+0.001
 dragons+0.001
py+0.001
Neg logit contributions
oured-0.001
cream-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
Token accidentally
Token ID14716
Feature activation-1.11e+00

Pos logit contributions
oured+0.005
cream+0.005
 bait+0.004
 sprink+0.004
 dip+0.004
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
 dragons-0.004
 monsters-0.004
Token knocked
Token ID13642
Feature activation-2.30e-01

Pos logit contributions
oured+0.043
cream+0.040
 dip+0.038
 bait+0.038
inated+0.038
Neg logit contributions
 villains-0.047
 hall-0.044
 dragons-0.042
 robbers-0.042
 monsters-0.041
Token over
Token ID625
Feature activation-8.38e-01

Pos logit contributions
oured+0.010
cream+0.010
 sprink+0.009
 dip+0.009
 bait+0.009
Neg logit contributions
 villains-0.009
 hall-0.008
 dragons-0.007
 robbers-0.007
py-0.007
Token the
Token ID262
Feature activation+5.06e-01

Pos logit contributions
oured+0.032
cream+0.030
 dip+0.028
 bait+0.027
 sprink+0.027
Neg logit contributions
 villains-0.037
 hall-0.034
 dragons-0.032
 robbers-0.032
 monsters-0.032
Token books
Token ID3835
Feature activation+4.55e-01

Pos logit contributions
 villains+0.020
py+0.017
oon+0.015
dates+0.014
 robbers+0.014
Neg logit contributions
 sprink-0.022
cream-0.021
oured-0.020
 vanilla-0.019
 dip-0.019
Tokenhe
Token ID258
Feature activation+3.43e+00

Pos logit contributions
 villains+0.020
py+0.016
 robbers+0.016
 hall+0.015
 monsters+0.015
Neg logit contributions
cream-0.019
oured-0.019
 sprink-0.017
 dip-0.017
 vanilla-0.016
Tokenlf
Token ID1652
Feature activation+1.32e+00

Pos logit contributions
 villains+0.134
py+0.113
 robbers+0.108
Raw+0.107
umpy+0.106
Neg logit contributions
cream-0.139
 sprink-0.128
 as-0.127
 placed-0.122
oured-0.112
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
 villains+0.060
 hall+0.049
 robbers+0.048
 villain+0.048
py+0.048
Neg logit contributions
cream-0.051
oured-0.050
 sprink-0.045
 dip-0.044
 placed-0.043
Token
Token ID220
Feature activation+8.23e-01

Pos logit contributions
cream+0.009
oured+0.009
 sprink+0.008
 dip+0.008
 bait+0.008
Neg logit contributions
 villains-0.011
 hall-0.009
 dragons-0.009
py-0.009
 monsters-0.009
Token
Token ID198
Feature activation-4.05e-01

Pos logit contributions
 villains+0.036
 hall+0.031
 robbers+0.029
 monsters+0.029
 dragons+0.029
Neg logit contributions
cream-0.033
oured-0.032
 sprink-0.029
 dip-0.028
 bait-0.028
Token
Token ID198
Feature activation-4.08e-01

Pos logit contributions
cream+0.014
oured+0.014
 sprink+0.013
 dip+0.013
 placed+0.012
Neg logit contributions
 villains-0.020
 hall-0.016
 monsters-0.016
 villain-0.016
 dragons-0.016
Token
Token ID198
Feature activation+6.71e-01

Pos logit contributions
 villains+0.027
 hall+0.023
 dragons+0.021
 robbers+0.021
py+0.021
Neg logit contributions
oured-0.029
cream-0.029
 sprink-0.027
 dip-0.027
 bait-0.026
Token"
Token ID1
Feature activation-1.11e-01

Pos logit contributions
 villains+0.021
 hall+0.018
 robbers+0.015
 maze+0.015
 dragons+0.015
Neg logit contributions
oured-0.036
cream-0.035
 sprink-0.032
 dip-0.032
 bait-0.031
TokenTime
Token ID7575
Feature activation-6.31e-01

Pos logit contributions
oured+0.005
cream+0.005
 sprink+0.004
 dip+0.004
 bait+0.004
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
 dragons-0.004
 monsters-0.004
Token to
Token ID284
Feature activation+1.44e-01

Pos logit contributions
oured+0.025
cream+0.025
 sprink+0.022
 dip+0.022
 bait+0.022
Neg logit contributions
 villains-0.027
 hall-0.024
 dragons-0.022
 robbers-0.022
 monsters-0.022
Token bat
Token ID7365
Feature activation+5.55e-01

Pos logit contributions
 villains+0.006
 hall+0.006
 dragons+0.005
 robbers+0.005
 monsters+0.005
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.005
 bait-0.005
 dip-0.005
Tokenhe
Token ID258
Feature activation+3.43e+00

Pos logit contributions
 villains+0.023
 robbers+0.018
 hall+0.018
 dragons+0.018
 monsters+0.017
Neg logit contributions
cream-0.024
oured-0.024
 sprink-0.022
 dip-0.022
 bait-0.021
Token,
Token ID11
Feature activation-3.56e-01

Pos logit contributions
 villains+0.147
 hall+0.129
 dragons+0.122
 monsters+0.119
py+0.119
Neg logit contributions
oured-0.137
cream-0.136
 dip-0.122
 sprink-0.121
 bait-0.118
Token L
Token ID406
Feature activation+2.95e-01

Pos logit contributions
oured+0.018
cream+0.018
shaped+0.017
inated+0.016
 placed+0.016
Neg logit contributions
 villains-0.011
 hall-0.010
 monsters-0.009
 robbers-0.009
 villain-0.009
Tokenila
Token ID10102
Feature activation+1.89e+00

Pos logit contributions
 villains+0.010
 hall+0.008
 robbers+0.008
 dragons+0.008
 monsters+0.007
Neg logit contributions
oured-0.015
cream-0.015
 dip-0.014
 sprink-0.014
 bait-0.013
Token and
Token ID290
Feature activation+2.33e-01

Pos logit contributions
 villains+0.080
 hall+0.071
 dragons+0.065
 robbers+0.065
py+0.064
Neg logit contributions
oured-0.076
cream-0.076
 sprink-0.068
 dip-0.068
 bait-0.067
Token Tom
Token ID4186
Feature activation+4.56e-01

Pos logit contributions
 villains+0.008
 hall+0.007
 robbers+0.007
 dragons+0.006
 lawyers+0.006
Neg logit contributions
oured-0.012
cream-0.011
shaped-0.010
 dip-0.010
 placed-0.010
Token his
Token ID465
Feature activation+6.13e-01

Pos logit contributions
oured+0.023
cream+0.022
 sprink+0.020
 dip+0.020
 bait+0.020
Neg logit contributions
 villains-0.022
 hall-0.019
 dragons-0.018
 robbers-0.017
 monsters-0.017
Token hand
Token ID1021
Feature activation-4.12e-01

Pos logit contributions
 villains+0.022
 hall+0.021
 maze+0.019
 robbers+0.018
 chamber+0.018
Neg logit contributions
cream-0.029
oured-0.027
 sprink-0.025
 bait-0.024
 mun-0.024
Token to
Token ID284
Feature activation+3.12e-01

Pos logit contributions
cream+0.021
oured+0.020
 sprink+0.019
asted+0.019
 bait+0.019
Neg logit contributions
 villains-0.011
 hall-0.011
 dragons-0.009
 alone-0.009
 monsters-0.009
Token pull
Token ID2834
Feature activation+7.82e-01

Pos logit contributions
 hall+0.012
 seek+0.012
 villains+0.011
 robbers+0.011
 monster+0.011
Neg logit contributions
cream-0.013
oured-0.012
acket-0.011
endant-0.010
inated-0.010
Token the
Token ID262
Feature activation+3.26e-01

Pos logit contributions
 villains+0.033
 hall+0.029
 dragons+0.028
 monsters+0.028
 robbers+0.028
Neg logit contributions
oured-0.031
cream-0.031
 sprink-0.027
 dip-0.027
 mun-0.026
Token bud
Token ID3600
Feature activation+3.37e+00

Pos logit contributions
 villains+0.010
 hall+0.010
 cabinet+0.009
 maze+0.009
 stair+0.009
Neg logit contributions
cream-0.016
oured-0.015
 mun-0.013
asted-0.013
 bait-0.013
Token.
Token ID13
Feature activation+4.72e-01

Pos logit contributions
 villains+0.113
 hall+0.111
 dragons+0.097
 maze+0.096
py+0.095
Neg logit contributions
cream-0.154
oured-0.153
 sprink-0.137
 bait-0.135
 mun-0.135
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
 villains+0.015
 hall+0.013
 robbers+0.011
 dragons+0.011
 villain+0.011
Neg logit contributions
cream-0.024
oured-0.024
 sprink-0.022
 dip-0.021
shaped-0.021
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
 villains+0.010
 hall+0.009
 dragons+0.008
 robbers+0.008
 monsters+0.008
Neg logit contributions
oured-0.011
cream-0.011
 sprink-0.010
 dip-0.010
 bait-0.010
Token"
Token ID1
Feature activation-1.98e-01

Pos logit contributions
 villains+0.007
 hall+0.006
py+0.005
 monsters+0.005
 dragons+0.005
Neg logit contributions
cream-0.011
oured-0.011
 dip-0.010
 bait-0.010
 sprink-0.010
TokenNo
Token ID2949
Feature activation-3.74e-01

Pos logit contributions
oured+0.010
cream+0.010
 placed+0.009
 mun+0.009
 sprink+0.008
Neg logit contributions
 villains-0.007
 hall-0.005
 seek-0.005
Raw-0.005
 robbers-0.005
Token it
Token ID340
Feature activation+2.00e-01

Pos logit contributions
 villains+0.015
 dragons+0.014
 monsters+0.014
 hall+0.013
 lawyers+0.012
Neg logit contributions
oured-0.020
cream-0.018
 mun-0.017
 dip-0.017
int-0.017
Token go
Token ID467
Feature activation+5.90e-01

Pos logit contributions
 villains+0.006
 monsters+0.006
 alone+0.005
 hall+0.005
 dragons+0.005
Neg logit contributions
oured-0.011
cream-0.010
aman-0.009
int-0.009
onia-0.009
Token v
Token ID410
Feature activation+9.55e-01

Pos logit contributions
 villains+0.019
 hall+0.017
 monsters+0.015
 robbers+0.015
 dragons+0.015
Neg logit contributions
oured-0.030
cream-0.029
 sprink-0.028
int-0.027
 mun-0.027
Tokenroom
Token ID3823
Feature activation+6.74e-01

Pos logit contributions
 villains+0.031
 hall+0.025
 robbers+0.025
 dragons+0.024
 monsters+0.023
Neg logit contributions
cream-0.050
oured-0.048
 sprink-0.043
 bait-0.042
shaped-0.042
Token-
Token ID12
Feature activation+2.51e-01

Pos logit contributions
 villains+0.026
 hall+0.023
 dragons+0.021
 monsters+0.021
 robbers+0.020
Neg logit contributions
oured-0.030
cream-0.029
 sprink-0.027
 dip-0.026
 bait-0.026
Tokenv
Token ID85
Feature activation+3.36e+00

Pos logit contributions
 villains+0.010
 hall+0.008
umpy+0.008
baby+0.008
 lawyers+0.008
Neg logit contributions
oured-0.011
 sprink-0.010
 dip-0.010
 mun-0.009
 placed-0.009
Tokenroom
Token ID3823
Feature activation+4.59e-01

Pos logit contributions
 villains+0.116
 robbers+0.100
 hall+0.096
 maze+0.088
 witches+0.088
Neg logit contributions
cream-0.161
oured-0.153
 bait-0.140
 placed-0.139
 as-0.137
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
 villains+0.018
 hall+0.015
 dragons+0.014
 monsters+0.014
 robbers+0.014
Neg logit contributions
oured-0.020
cream-0.020
 sprink-0.018
 dip-0.018
 bait-0.018
Token The
Token ID383
Feature activation+1.76e-01

Pos logit contributions
cream+0.013
oured+0.013
 sprink+0.012
 dip+0.012
 bait+0.011
Neg logit contributions
 villains-0.020
 hall-0.017
py-0.017
 robbers-0.016
 monsters-0.016
Token truck
Token ID7779
Feature activation-2.76e-01

Pos logit contributions
 villains+0.006
 hall+0.005
 robbers+0.004
 dragons+0.004
 monsters+0.004
Neg logit contributions
oured-0.009
cream-0.009
 sprink-0.008
 dip-0.008
 bait-0.008
Token loved
Token ID6151
Feature activation-3.19e-02

Pos logit contributions
oured+0.014
 dip+0.012
cream+0.012
 bait+0.012
 sprink+0.012
Neg logit contributions
 villains-0.010
 hall-0.009
 dragons-0.008
 monster-0.008
 monsters-0.008
Token see
Token ID766
Feature activation-2.79e-01

Pos logit contributions
cream+0.004
oured+0.004
acket+0.004
 bait+0.003
 sprink+0.003
Neg logit contributions
 villains-0.004
 hall-0.004
 seek-0.004
 maze-0.004
 dragons-0.004
Token what
Token ID644
Feature activation-1.14e+00

Pos logit contributions
oured+0.014
cream+0.013
acket+0.012
 mun+0.012
int+0.012
Neg logit contributions
 dragons-0.009
 witches-0.008
 monsters-0.008
 skeletons-0.008
 villains-0.008
Token was
Token ID373
Feature activation-7.55e-01

Pos logit contributions
oured+0.049
cream+0.044
 dip+0.039
 bait+0.039
int+0.039
Neg logit contributions
 villains-0.049
 hall-0.044
 dragons-0.042
 monsters-0.041
 monster-0.040
Token inside
Token ID2641
Feature activation-4.14e-02

Pos logit contributions
oured+0.035
cream+0.035
 mun+0.031
 dip+0.031
 bait+0.031
Neg logit contributions
 villains-0.026
 hall-0.025
 monsters-0.022
 dragons-0.022
py-0.022
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
oured+0.002
cream+0.002
shaped+0.002
 mun+0.002
int+0.002
Neg logit contributions
 villains-0.001
 hall-0.001
 dragons-0.001
 monsters-0.001
 skeletons-0.001
Token bud
Token ID3600
Feature activation+3.35e+00

Pos logit contributions
cream+0.001
oured+0.001
 bait+0.001
 placed+0.001
 dip+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 dragons-0.001
 maze-0.001
 monsters-0.001
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
 hall+0.098
 villains+0.096
 dragons+0.082
py+0.082
 maze+0.081
Neg logit contributions
oured-0.168
cream-0.168
 bait-0.149
 sprink-0.148
 mun-0.147
Token
Token ID198
Feature activation+6.55e-02

Pos logit contributions
 villains+0.005
 hall+0.004
 robbers+0.004
 dragons+0.004
 monsters+0.004
Neg logit contributions
oured-0.007
cream-0.007
 sprink-0.006
 dip-0.006
 bait-0.006
Token
Token ID198
Feature activation+8.44e-02

Pos logit contributions
 villains+0.003
 hall+0.002
 robbers+0.002
 dragons+0.002
 monsters+0.002
Neg logit contributions
oured-0.003
cream-0.003
 dip-0.003
 sprink-0.003
 bait-0.002
Token"
Token ID1
Feature activation-3.30e-01

Pos logit contributions
 villains+0.003
 hall+0.002
 dragons+0.002
py+0.002
 monsters+0.002
Neg logit contributions
oured-0.004
cream-0.004
 dip-0.004
 sprink-0.004
 bait-0.004
TokenLet
Token ID5756
Feature activation+5.31e-02

Pos logit contributions
oured+0.015
cream+0.015
 sprink+0.013
 dip+0.013
 bait+0.013
Neg logit contributions
 villains-0.012
 hall-0.011
 robbers-0.010
 dragons-0.009
 lawyers-0.009
Token can
Token ID460
Feature activation-8.29e-01

Pos logit contributions
cream+0.027
oured+0.027
 sprink+0.025
 dip+0.024
 bait+0.024
Neg logit contributions
 villains-0.024
 hall-0.021
 robbers-0.019
 monsters-0.019
 maze-0.019
Token't
Token ID470
Feature activation+3.26e-01

Pos logit contributions
oured+0.038
cream+0.037
 sprink+0.033
 bait+0.033
shaped+0.033
Neg logit contributions
 villains-0.031
 hall-0.027
 dragons-0.025
 robbers-0.024
 monsters-0.024
Token reach
Token ID3151
Feature activation-5.94e-02

Pos logit contributions
 villains+0.015
 hall+0.013
 robbers+0.012
 dragons+0.012
 monsters+0.012
Neg logit contributions
oured-0.012
cream-0.012
 sprink-0.011
 dip-0.010
 bait-0.010
Token the
Token ID262
Feature activation+5.07e-01

Pos logit contributions
oured+0.003
cream+0.002
 mun+0.002
int+0.002
 dip+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 dragons-0.002
 monsters-0.002
 skeletons-0.002
Token books
Token ID3835
Feature activation+2.73e-01

Pos logit contributions
 villains+0.020
py+0.017
oon+0.015
 robbers+0.015
 hall+0.014
Neg logit contributions
 sprink-0.022
oured-0.021
cream-0.021
 dip-0.020
 vanilla-0.020
Tokenhe
Token ID258
Feature activation+3.35e+00

Pos logit contributions
 villains+0.010
 hall+0.008
 robbers+0.007
py+0.007
 dragons+0.007
Neg logit contributions
oured-0.013
cream-0.013
 sprink-0.012
 dip-0.012
 bait-0.012
Tokenlf
Token ID1652
Feature activation+1.26e+00

Pos logit contributions
 villains+0.120
py+0.088
 robbers+0.086
 villain+0.086
umpy+0.084
Neg logit contributions
cream-0.153
 sprink-0.146
 as-0.138
oured-0.137
 placed-0.136
Token.
Token ID13
Feature activation+5.03e-01

Pos logit contributions
 villains+0.052
 hall+0.046
 dragons+0.042
 robbers+0.042
 monsters+0.041
Neg logit contributions
oured-0.052
cream-0.051
 sprink-0.047
 dip-0.047
 bait-0.046
Token Can
Token ID1680
Feature activation+4.30e-02

Pos logit contributions
 villains+0.018
 hall+0.015
 robbers+0.014
 lawyers+0.014
 dragons+0.013
Neg logit contributions
oured-0.024
cream-0.023
 dip-0.022
 sprink-0.022
ck-0.022
Token you
Token ID345
Feature activation-4.97e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 robbers+0.001
 dragons+0.001
 monsters+0.001
Neg logit contributions
oured-0.002
cream-0.002
 bait-0.002
 sprink-0.002
 placed-0.002
Token reach
Token ID3151
Feature activation-2.53e-01

Pos logit contributions
cream+0.022
oured+0.022
 sprink+0.020
shaped+0.019
 vanilla+0.019
Neg logit contributions
 villains-0.019
 hall-0.016
 robbers-0.015
 maze-0.015
 dragons-0.015
Token Now
Token ID2735
Feature activation+3.46e-01

Pos logit contributions
oured+0.018
cream+0.018
 dip+0.017
 sprink+0.017
 bait+0.017
Neg logit contributions
 villains-0.012
 hall-0.010
 robbers-0.009
py-0.009
 villain-0.009
Token we
Token ID356
Feature activation+1.65e-01

Pos logit contributions
 villains+0.013
 dragons+0.011
 hall+0.011
 monsters+0.011
 robbers+0.011
Neg logit contributions
oured-0.016
cream-0.016
shaped-0.015
 sprink-0.014
int-0.013
Token have
Token ID423
Feature activation+1.87e-01

Pos logit contributions
 villains+0.006
 hall+0.005
 dragons+0.005
 robbers+0.005
py+0.005
Neg logit contributions
oured-0.008
cream-0.008
 sprink-0.007
 dip-0.007
 bait-0.007
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
 villains+0.007
 dragons+0.007
 monsters+0.006
 robbers+0.006
 knights+0.006
Neg logit contributions
oured-0.008
cream-0.008
immer-0.007
inated-0.007
int-0.007
Token books
Token ID3835
Feature activation-3.00e-01

Pos logit contributions
cream+0.009
oured+0.009
shaped+0.008
ck+0.008
 sprink+0.008
Neg logit contributions
 villains-0.007
 hall-0.007
 maze-0.006
 robbers-0.006
 monster-0.006
Tokenhe
Token ID258
Feature activation+3.34e+00

Pos logit contributions
oured+0.012
cream+0.012
 sprink+0.011
 dip+0.011
 bait+0.011
Neg logit contributions
 villains-0.013
 hall-0.011
 dragons-0.010
 robbers-0.010
py-0.010
Tokenlf
Token ID1652
Feature activation+9.28e-01

Pos logit contributions
 villains+0.142
 hall+0.117
 villain+0.114
 robbers+0.110
py+0.109
Neg logit contributions
cream-0.129
 bait-0.119
oured-0.119
 sprink-0.117
 placed-0.115
Token!"
Token ID2474
Feature activation+9.92e-02

Pos logit contributions
 villains+0.039
 hall+0.034
 dragons+0.031
 robbers+0.031
py+0.031
Neg logit contributions
oured-0.038
cream-0.037
 sprink-0.034
 dip-0.034
 bait-0.033
Token<|endoftext|>
Token ID50256
Feature activation+2.55e-01

Pos logit contributions
 villains+0.003
 hall+0.003
 dragons+0.002
 robbers+0.002
 maze+0.002
Neg logit contributions
oured-0.005
cream-0.005
 dip-0.005
 sprink-0.005
ck-0.004
TokenM
Token ID44
Feature activation-1.77e+00

Pos logit contributions
 villains+0.011
 hall+0.009
 robbers+0.009
 villain+0.009
 monsters+0.009
Neg logit contributions
cream-0.010
oured-0.010
 sprink-0.009
 dip-0.009
 bait-0.009
Tokenummy
Token ID13513
Feature activation-1.66e+00

Pos logit contributions
oured+0.076
cream+0.076
 dip+0.071
 bait+0.069
inated+0.068
Neg logit contributions
 villains-0.064
 hall-0.059
py-0.054
 maze-0.054
 dragons-0.053
TokenThey
Token ID2990
Feature activation-5.17e-01

Pos logit contributions
 villains+0.002
 hall+0.002
py+0.002
 dragons+0.002
 monsters+0.002
Neg logit contributions
cream-0.003
oured-0.003
 bait-0.003
 sprink-0.003
 dip-0.003
Token ran
Token ID4966
Feature activation+3.44e-03

Pos logit contributions
cream+0.019
oured+0.019
 sprink+0.017
 dip+0.016
 bait+0.016
Neg logit contributions
 villains-0.024
 hall-0.021
 dragons-0.020
 robbers-0.020
 monsters-0.020
Token to
Token ID284
Feature activation+2.73e-01

Pos logit contributions
 villains+0.000
 hall+0.000
py+0.000
oon+0.000
 dragons+0.000
Neg logit contributions
oured-0.000
cream-0.000
 dip-0.000
 bait-0.000
 as-0.000
Token the
Token ID262
Feature activation+3.46e-01

Pos logit contributions
 villains+0.011
 hall+0.010
 robbers+0.009
 dragons+0.009
 monsters+0.009
Neg logit contributions
cream-0.012
oured-0.012
 sprink-0.010
int-0.010
shaped-0.010
Token books
Token ID3835
Feature activation+2.16e-01

Pos logit contributions
 villains+0.012
py+0.010
oon+0.009
 villain+0.009
robat+0.009
Neg logit contributions
cream-0.016
 sprink-0.016
oured-0.015
 dip-0.015
 vanilla-0.015
Tokenhe
Token ID258
Feature activation+3.32e+00

Pos logit contributions
 villains+0.011
py+0.009
 robbers+0.009
 skeletons+0.009
 witches+0.009
Neg logit contributions
oured-0.008
 dip-0.007
cream-0.007
 sprink-0.007
 placed-0.007
Tokenlf
Token ID1652
Feature activation+1.10e+00

Pos logit contributions
 villains+0.094
 robbers+0.073
 villain+0.069
 lawyers+0.067
umpy+0.065
Neg logit contributions
cream-0.163
 sprink-0.163
 dip-0.161
 placed-0.158
 as-0.157
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
 villains+0.054
 villain+0.045
py+0.045
 robbers+0.045
 hall+0.045
Neg logit contributions
oured-0.037
cream-0.036
 dip-0.033
 sprink-0.033
 placed-0.033
Token put
Token ID1234
Feature activation-2.98e-03

Pos logit contributions
 villains+0.001
 hall+0.001
 robbers+0.001
 dragons+0.001
 monsters+0.001
Neg logit contributions
oured-0.001
cream-0.001
shaped-0.000
int-0.000
inated-0.000
Token the
Token ID262
Feature activation-3.04e-01

Pos logit contributions
oured+0.000
inated+0.000
rig+0.000
 mun+0.000
cream+0.000
Neg logit contributions
 dragons-0.000
 hall-0.000
 monsters-0.000
 robbers-0.000
 villains-0.000
Token box
Token ID3091
Feature activation+2.31e-02

Pos logit contributions
oured+0.014
cream+0.014
 mun+0.013
 dip+0.013
 sprink+0.012
Neg logit contributions
 villains-0.011
 hall-0.010
 dragons-0.009
 robbers-0.009
 monsters-0.009
Token puppy
Token ID26188
Feature activation+6.97e-01

Pos logit contributions
 villains+0.016
 hall+0.014
py+0.013
 dragons+0.013
 monsters+0.013
Neg logit contributions
oured-0.014
cream-0.013
 dip-0.013
 sprink-0.013
 bait-0.012
Token jumped
Token ID11687
Feature activation-6.76e-01

Pos logit contributions
 villains+0.031
 hall+0.027
 dragons+0.025
 robbers+0.025
py+0.025
Neg logit contributions
oured-0.027
cream-0.026
 sprink-0.024
 dip-0.024
 bait-0.023
Token on
Token ID319
Feature activation-4.36e-01

Pos logit contributions
oured+0.023
cream+0.022
 sprink+0.020
 dip+0.020
 bait+0.020
Neg logit contributions
 villains-0.033
 hall-0.029
py-0.027
 robbers-0.027
 lawyers-0.027
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
oured+0.013
cream+0.013
 sprink+0.012
 dip+0.012
 placed+0.011
Neg logit contributions
 villains-0.023
 hall-0.020
py-0.019
 robbers-0.019
 villain-0.019
Token books
Token ID3835
Feature activation+2.00e-01

Pos logit contributions
 villains+0.009
py+0.008
 lawyers+0.007
dates+0.007
oon+0.007
Neg logit contributions
oured-0.010
 sprink-0.009
cream-0.009
 dip-0.009
 vanilla-0.008
Tokenhe
Token ID258
Feature activation+3.31e+00

Pos logit contributions
 villains+0.008
 robbers+0.007
py+0.007
 hall+0.007
 skeletons+0.006
Neg logit contributions
oured-0.009
cream-0.008
 dip-0.008
 sprink-0.008
 placed-0.007
Tokenlf
Token ID1652
Feature activation+1.07e+00

Pos logit contributions
 villains+0.100
Raw+0.078
 robbers+0.078
py+0.078
umpy+0.075
Neg logit contributions
 as-0.153
cream-0.151
 sprink-0.149
oured-0.145
 dip-0.144
Token and
Token ID290
Feature activation-8.06e-02

Pos logit contributions
 villains+0.048
 hall+0.041
 robbers+0.039
py+0.039
 villain+0.038
Neg logit contributions
oured-0.041
cream-0.040
 dip-0.036
 sprink-0.036
 placed-0.035
Token knocked
Token ID13642
Feature activation-3.55e-01

Pos logit contributions
oured+0.003
cream+0.003
 sprink+0.003
 dip+0.003
 bait+0.003
Neg logit contributions
 villains-0.004
 hall-0.003
 dragons-0.003
 robbers-0.003
 monsters-0.003
Token all
Token ID477
Feature activation+3.79e-01

Pos logit contributions
cream+0.011
oured+0.011
 sprink+0.010
 vanilla+0.010
 placed+0.009
Neg logit contributions
 villains-0.020
 villain-0.016
 hall-0.016
 monsters-0.015
 robbers-0.015
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
 villains+0.018
 hall+0.015
 robbers+0.015
 villain+0.015
 monsters+0.014
Neg logit contributions
cream-0.014
oured-0.013
 dip-0.012
 sprink-0.012
 placed-0.011
Token He
Token ID679
Feature activation-8.59e-02

Pos logit contributions
 villains+0.013
py+0.011
 hall+0.010
 dragons+0.010
 monsters+0.010
Neg logit contributions
cream-0.018
oured-0.018
 vanilla-0.016
 sprink-0.016
 dip-0.016
Token goes
Token ID2925
Feature activation+1.01e-01

Pos logit contributions
oured+0.004
cream+0.004
 bait+0.003
 dip+0.003
 sprink+0.003
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.003
 robbers-0.003
 monsters-0.003
Token to
Token ID284
Feature activation+4.07e-01

Pos logit contributions
 villains+0.005
 hall+0.004
py+0.004
 dragons+0.004
 monsters+0.004
Neg logit contributions
oured-0.004
cream-0.004
 sprink-0.003
 dip-0.003
 bait-0.003
Token the
Token ID262
Feature activation+5.28e-01

Pos logit contributions
 villains+0.019
 hall+0.017
 dragons+0.016
py+0.016
 robbers+0.016
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.013
 dip-0.013
 bait-0.013
Token books
Token ID3835
Feature activation+1.11e-01

Pos logit contributions
 villains+0.018
py+0.016
oon+0.013
 villain+0.013
umpy+0.012
Neg logit contributions
 sprink-0.025
cream-0.024
oured-0.024
 vanilla-0.023
 dip-0.023
Tokenhe
Token ID258
Feature activation+3.30e+00

Pos logit contributions
 villains+0.005
py+0.004
 robbers+0.004
 monsters+0.004
 skeletons+0.004
Neg logit contributions
oured-0.004
cream-0.004
 dip-0.004
 sprink-0.004
 placed-0.004
Tokenlf
Token ID1652
Feature activation+9.26e-01

Pos logit contributions
 villains+0.100
 robbers+0.081
py+0.080
Raw+0.077
 villain+0.071
Neg logit contributions
 sprink-0.147
cream-0.147
 as-0.147
 placed-0.140
 dip-0.140
Token and
Token ID290
Feature activation+2.30e-02

Pos logit contributions
 villains+0.042
 hall+0.035
 robbers+0.034
py+0.034
 villain+0.033
Neg logit contributions
oured-0.035
cream-0.035
 sprink-0.031
 dip-0.031
 bait-0.030
Token looks
Token ID3073
Feature activation-1.39e+00

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 robbers+0.001
 maze+0.001
Neg logit contributions
oured-0.001
cream-0.001
immer-0.001
 bait-0.001
shaped-0.001
Token for
Token ID329
Feature activation+7.36e-01

Pos logit contributions
cream+0.069
oured+0.066
shaped+0.062
 bait+0.061
 sprink+0.061
Neg logit contributions
 villains-0.041
 hall-0.040
 alone-0.038
 dragons-0.037
 robbers-0.036
Token it
Token ID340
Feature activation-3.82e-01

Pos logit contributions
 villains+0.033
 hall+0.028
 monsters+0.028
 dragons+0.028
 robbers+0.027
Neg logit contributions
oured-0.030
cream-0.029
inated-0.026
 dip-0.025
int-0.025
Token,
Token ID11
Feature activation+2.90e-01

Pos logit contributions
 hall+0.012
 villains+0.011
 monster+0.011
 cabinet+0.011
baby+0.010
Neg logit contributions
cream-0.031
shaped-0.030
oured-0.030
asted-0.029
 bait-0.029
Token I
Token ID314
Feature activation-1.91e-01

Pos logit contributions
 villains+0.011
 hall+0.010
 dragons+0.009
 monster+0.009
 maze+0.009
Neg logit contributions
oured-0.014
cream-0.013
shaped-0.012
inated-0.012
 bait-0.012
Token finished
Token ID5201
Feature activation+2.26e-02

Pos logit contributions
cream+0.008
oured+0.008
 sprink+0.007
 bait+0.007
 dip+0.007
Neg logit contributions
 villains-0.007
 hall-0.007
 robbers-0.006
 villain-0.006
 monsters-0.006
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 lawyers+0.001
 monsters+0.001
Neg logit contributions
oured-0.001
cream-0.001
int-0.001
 dip-0.001
 bait-0.001
Token books
Token ID3835
Feature activation-1.29e-01

Pos logit contributions
oured+0.010
 sprink+0.010
cream+0.010
 dip+0.009
 vanilla+0.009
Neg logit contributions
 villains-0.007
py-0.006
 hall-0.006
 robbers-0.005
 villain-0.005
Tokenhe
Token ID258
Feature activation+3.29e+00

Pos logit contributions
oured+0.006
cream+0.006
 dip+0.006
 bait+0.006
 sprink+0.006
Neg logit contributions
 villains-0.004
 hall-0.004
 dragons-0.003
 maze-0.003
 monsters-0.003
Tokenlf
Token ID1652
Feature activation+9.68e-01

Pos logit contributions
 villains+0.107
py+0.078
 villain+0.077
 robbers+0.077
Raw+0.075
Neg logit contributions
cream-0.158
 sprink-0.154
oured-0.144
 placed-0.143
 as-0.141
Token!"
Token ID2474
Feature activation-1.62e-01

Pos logit contributions
 villains+0.040
 hall+0.037
 dragons+0.034
 monsters+0.033
 robbers+0.033
Neg logit contributions
oured-0.038
cream-0.038
 dip-0.035
 bait-0.034
 sprink-0.034
Token Andy
Token ID12382
Feature activation-6.79e-02

Pos logit contributions
oured+0.007
cream+0.007
 dip+0.006
 sprink+0.006
 bait+0.006
Neg logit contributions
 villains-0.006
 hall-0.006
 dragons-0.005
py-0.005
 robbers-0.005
Token said
Token ID531
Feature activation-3.35e-01

Pos logit contributions
cream+0.003
oured+0.003
 bait+0.003
 dip+0.002
 sprink+0.002
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.002
 sisters-0.002
py-0.002
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
cream+0.016
oured+0.015
 sprink+0.014
 bait+0.014
 dip+0.014
Neg logit contributions
 villains-0.012
 hall-0.010
 dragons-0.009
 robbers-0.009
 lawyers-0.009
Token When
Token ID1649
Feature activation+2.05e-01

Pos logit contributions
 villains+0.003
 hall+0.002
 robbers+0.002
 dragons+0.002
 maze+0.002
Neg logit contributions
oured-0.004
cream-0.004
 dip-0.004
 sprink-0.004
 bait-0.004
Token Daisy
Token ID40355
Feature activation-2.38e-01

Pos logit contributions
 villains+0.008
 hall+0.007
 dragons+0.007
 robbers+0.007
 monsters+0.007
Neg logit contributions
oured-0.009
cream-0.009
 sprink-0.008
 dip-0.008
 bait-0.007
Token put
Token ID1234
Feature activation+2.14e-01

Pos logit contributions
cream+0.010
oured+0.010
 bait+0.009
 sprink+0.009
shaped+0.009
Neg logit contributions
 villains-0.009
 hall-0.008
 dragons-0.007
 robbers-0.007
 monster-0.007
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 villains+0.008
 hall+0.007
 dragons+0.006
 monsters+0.006
 maze+0.006
Neg logit contributions
cream-0.010
oured-0.010
 dip-0.009
 sprink-0.009
 bait-0.009
Token magn
Token ID7842
Feature activation+1.80e-01

Pos logit contributions
oured+0.012
cream+0.012
inated+0.012
ck+0.011
asted+0.011
Neg logit contributions
 hall-0.008
 villains-0.008
 monster-0.007
 maze-0.007
 cabinet-0.007
Tokenifying
Token ID4035
Feature activation-3.78e+00

Pos logit contributions
 villains+0.010
 hall+0.009
 dragons+0.009
oon+0.009
py+0.009
Neg logit contributions
cream-0.004
asted-0.004
 bait-0.004
oured-0.003
 mun-0.003
Token glass
Token ID5405
Feature activation-4.76e-01

Pos logit contributions
oured+0.156
cream+0.153
ck+0.148
int+0.134
 dip+0.132
Neg logit contributions
 hall-0.143
 chamber-0.131
 villains-0.129
 maze-0.126
 cabinet-0.123
Token down
Token ID866
Feature activation-1.21e+00

Pos logit contributions
oured+0.019
cream+0.019
 sprink+0.017
 dip+0.017
 bait+0.017
Neg logit contributions
 villains-0.021
 hall-0.017
 robbers-0.017
 dragons-0.016
py-0.016
Token,
Token ID11
Feature activation-5.16e-02

Pos logit contributions
cream+0.053
oured+0.052
 dip+0.047
 sprink+0.047
 bait+0.046
Neg logit contributions
 villains-0.047
 hall-0.042
 dragons-0.038
 monsters-0.038
 maze-0.038
Token the
Token ID262
Feature activation-3.75e-01

Pos logit contributions
oured+0.002
cream+0.002
 sprink+0.002
 dip+0.002
 bait+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 dragons-0.001
 robbers-0.001
py-0.001
Token cater
Token ID20825
Feature activation-2.80e-01

Pos logit contributions
oured+0.020
cream+0.020
 dip+0.017
shaped+0.017
inated+0.017
Neg logit contributions
 villains-0.012
 hall-0.011
 monsters-0.009
 monster-0.009
 maze-0.009
Token flashlight
Token ID38371
Feature activation-1.45e-02

Pos logit contributions
cream+0.005
oured+0.005
 sprink+0.005
 dip+0.005
 bait+0.005
Neg logit contributions
 villains-0.004
 hall-0.003
 robbers-0.003
 maze-0.003
 dragons-0.003
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
oured+0.001
cream+0.001
 bait+0.001
 dip+0.001
 sprink+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 dragons-0.000
 monsters-0.000
 maze-0.000
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
oured+0.013
cream+0.012
inated+0.011
int+0.011
shaped+0.011
Neg logit contributions
 villains-0.009
 hall-0.008
 monsters-0.008
 dragons-0.008
 lawyers-0.007
Token a
Token ID257
Feature activation-9.02e-01

Pos logit contributions
oured+0.014
cream+0.013
inated+0.012
int+0.012
 dip+0.011
Neg logit contributions
 villains-0.017
 hall-0.015
 dragons-0.015
 monsters-0.015
 robbers-0.014
Token magn
Token ID7842
Feature activation-3.09e-01

Pos logit contributions
cream+0.043
oured+0.043
 sprink+0.038
 dip+0.038
 bait+0.038
Neg logit contributions
 villains-0.032
 hall-0.028
 robbers-0.025
 dragons-0.025
 maze-0.024
Tokenifying
Token ID4035
Feature activation-3.56e+00

Pos logit contributions
cream+0.007
oured+0.006
 bait+0.006
asted+0.006
 sprink+0.006
Neg logit contributions
 villains-0.017
 hall-0.016
 dragons-0.016
py-0.016
 monsters-0.015
Token glass
Token ID5405
Feature activation-1.20e+00

Pos logit contributions
oured+0.145
cream+0.143
ck+0.134
int+0.123
 dip+0.122
Neg logit contributions
 hall-0.142
 villains-0.136
 chamber-0.128
 maze-0.127
 monster-0.124
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
oured+0.052
cream+0.052
 sprink+0.048
 dip+0.047
 bait+0.046
Neg logit contributions
 villains-0.047
 hall-0.040
 dragons-0.037
 robbers-0.037
py-0.036
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
oured+0.006
cream+0.006
 dip+0.005
 sprink+0.005
 bait+0.005
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.003
 robbers-0.003
py-0.002
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
 villains+0.001
 hall+0.001
 monsters+0.001
 robbers+0.001
 dragons+0.001
Neg logit contributions
oured-0.001
cream-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
TokenAs
Token ID1722
Feature activation+5.18e-01

Pos logit contributions
 villains+0.002
 hall+0.002
 monsters+0.002
 dragons+0.002
py+0.001
Neg logit contributions
oured-0.003
cream-0.003
 sprink-0.002
 dip-0.002
 bait-0.002
Token a
Token ID257
Feature activation+1.79e-01

Pos logit contributions
oured+0.022
cream+0.021
 dip+0.018
 placed+0.017
 sprink+0.017
Neg logit contributions
 villains-0.029
 hall-0.026
 dragons-0.025
 monsters-0.025
py-0.024
Token family
Token ID1641
Feature activation+2.78e-01

Pos logit contributions
 villains+0.008
 hall+0.007
py+0.007
 dragons+0.007
 monsters+0.007
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.006
 dip-0.006
 bait-0.005
Token who
Token ID508
Feature activation-1.06e-01

Pos logit contributions
 hall+0.010
 villains+0.009
py+0.008
 dragons+0.008
 lawyers+0.008
Neg logit contributions
oured-0.014
cream-0.012
 bait-0.012
 dip-0.011
 sprink-0.011
Token really
Token ID1107
Feature activation-1.13e+00

Pos logit contributions
oured+0.005
cream+0.005
 bait+0.004
 dip+0.004
 sprink+0.004
Neg logit contributions
 villains-0.004
 hall-0.004
 monsters-0.003
 dragons-0.003
 chamber-0.003
Token enjoyed
Token ID8359
Feature activation-5.46e-01

Pos logit contributions
oured+0.047
cream+0.045
 sprink+0.041
 dip+0.041
 bait+0.040
Neg logit contributions
 villains-0.049
 hall-0.043
 robbers-0.038
 villain-0.038
 monsters-0.037
Token pic
Token ID8301
Feature activation-3.52e+00

Pos logit contributions
oured+0.023
cream+0.023
 sprink+0.021
 dip+0.021
 bait+0.021
Neg logit contributions
 villains-0.022
 hall-0.019
 dragons-0.018
 robbers-0.017
 monsters-0.017
Tokenn
Token ID77
Feature activation+8.68e-01

Pos logit contributions
oured+0.127
cream+0.126
 sprink+0.110
 bait+0.108
 dip+0.107
Neg logit contributions
 villains-0.165
 hall-0.147
py-0.139
 dragons-0.134
 robbers-0.132
Tokenics
Token ID873
Feature activation-7.52e-01

Pos logit contributions
 villains+0.035
 hall+0.031
 dragons+0.029
py+0.029
 monsters+0.028
Neg logit contributions
oured-0.036
cream-0.036
 sprink-0.033
 dip-0.032
 bait-0.032
Token.
Token ID13
Feature activation-3.28e-01

Pos logit contributions
oured+0.036
cream+0.035
 dip+0.032
 sprink+0.032
shaped+0.031
Neg logit contributions
 villains-0.027
 hall-0.023
 dragons-0.022
 monsters-0.021
 robbers-0.021
Token They
Token ID1119
Feature activation-4.49e-03

Pos logit contributions
oured+0.018
cream+0.017
ck+0.017
shaped+0.016
 bait+0.016
Neg logit contributions
 villains-0.010
 hall-0.009
 robbers-0.008
 chamber-0.007
 basement-0.007
Token loved
Token ID6151
Feature activation+1.22e-01

Pos logit contributions
oured+0.000
cream+0.000
 sprink+0.000
 dip+0.000
 bait+0.000
Neg logit contributions
 villains-0.000
 hall-0.000
 dragons-0.000
 robbers-0.000
py-0.000
Token 3
Token ID513
Feature activation+9.78e-01

Pos logit contributions
cream+0.000
oured+0.000
shaped+0.000
 sprink+0.000
ck+0.000
Neg logit contributions
 villains-0.000
 hall-0.000
 robbers-0.000
 maze-0.000
 monster-0.000
Token year
Token ID614
Feature activation-3.31e-01

Pos logit contributions
 villains+0.033
 hall+0.028
 robbers+0.025
 monsters+0.025
 dragons+0.024
Neg logit contributions
oured-0.049
cream-0.048
 dip-0.044
 sprink-0.044
 bait-0.043
Token old
Token ID1468
Feature activation-4.02e-01

Pos logit contributions
oured+0.016
cream+0.015
 dip+0.015
asted+0.015
 sprink+0.014
Neg logit contributions
 villains-0.011
 hall-0.009
 monsters-0.009
 dragons-0.009
 maze-0.009
Token boy
Token ID2933
Feature activation-5.39e-03

Pos logit contributions
oured+0.018
cream+0.018
shaped+0.017
inated+0.017
 dip+0.016
Neg logit contributions
 villains-0.015
 hall-0.013
 robbers-0.012
 monsters-0.012
 monster-0.012
Token as
Token ID355
Feature activation-1.53e+00

Pos logit contributions
oured+0.000
cream+0.000
 dip+0.000
 sprink+0.000
inated+0.000
Neg logit contributions
 villains-0.000
 hall-0.000
 dragons-0.000
 robbers-0.000
 monsters-0.000
Token Super
Token ID3115
Feature activation-3.52e+00

Pos logit contributions
oured+0.073
cream+0.066
 sprink+0.062
int+0.062
 dip+0.061
Neg logit contributions
 villains-0.061
 dragons-0.049
 monsters-0.048
 robbers-0.046
 hall-0.046
Token Robbie
Token ID34855
Feature activation-2.71e-01

Pos logit contributions
cream+0.122
oured+0.122
 sprink+0.114
 dip+0.109
shaped+0.107
Neg logit contributions
 villains-0.168
 robbers-0.144
 villain-0.141
 hall-0.138
 monsters-0.135
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
cream+0.012
oured+0.012
 sprink+0.011
 dip+0.011
 vanilla+0.011
Neg logit contributions
 villains-0.010
 hall-0.009
 robbers-0.008
 dragons-0.008
 lawyers-0.008
Token<|endoftext|>
Token ID50256
Feature activation+3.87e-01

Pos logit contributions
cream+0.014
oured+0.014
 sprink+0.013
 dip+0.013
 bait+0.013
Neg logit contributions
 villains-0.014
 hall-0.012
 dragons-0.011
 monsters-0.011
 robbers-0.011
TokenOnce
Token ID7454
Feature activation-3.66e-02

Pos logit contributions
 villains+0.019
 robbers+0.016
 monsters+0.015
py+0.015
 villain+0.015
Neg logit contributions
oured-0.013
cream-0.013
 sprink-0.012
 dip-0.011
 bait-0.011
Token upon
Token ID2402
Feature activation-6.56e-01

Pos logit contributions
oured+0.001
cream+0.001
 dip+0.001
 sprink+0.001
 bait+0.001
Neg logit contributions
 villains-0.002
 hall-0.001
 robbers-0.001
py-0.001
 dragons-0.001
Token girl
Token ID2576
Feature activation+3.09e-01

Pos logit contributions
cream+0.032
oured+0.032
 sprink+0.031
 dip+0.031
 bait+0.030
Neg logit contributions
 villains-0.020
 hall-0.015
py-0.014
 dragons-0.014
 robbers-0.014
Token named
Token ID3706
Feature activation-2.40e+00

Pos logit contributions
 villains+0.015
 hall+0.014
 dragons+0.013
 robbers+0.013
py+0.013
Neg logit contributions
oured-0.010
cream-0.010
 sprink-0.009
 dip-0.009
 bait-0.009
Token Lily
Token ID20037
Feature activation+4.71e-01

Pos logit contributions
oured+0.095
 dip+0.090
 sprink+0.088
cream+0.086
 vanilla+0.084
Neg logit contributions
 villains-0.109
 villain-0.087
robat-0.087
py-0.083
 robbers-0.082
Token who
Token ID508
Feature activation-3.56e-01

Pos logit contributions
 villains+0.020
 hall+0.017
 robbers+0.016
 dragons+0.016
 villain+0.016
Neg logit contributions
cream-0.019
oured-0.019
 sprink-0.017
 dip-0.017
 placed-0.017
Token loved
Token ID6151
Feature activation-4.79e-01

Pos logit contributions
cream+0.009
 sprink+0.007
oured+0.007
 dip+0.007
 mun+0.007
Neg logit contributions
 villains-0.021
 robbers-0.018
 hall-0.018
py-0.018
 dragons-0.018
Token pic
Token ID8301
Feature activation-3.48e+00

Pos logit contributions
cream+0.017
oured+0.016
 sprink+0.015
 dip+0.014
 vanilla+0.014
Neg logit contributions
 villains-0.023
 hall-0.020
py-0.020
 villain-0.019
 maze-0.019
Tokenn
Token ID77
Feature activation+8.90e-01

Pos logit contributions
cream+0.127
oured+0.126
 bait+0.112
 sprink+0.110
 dip+0.108
Neg logit contributions
 villains-0.156
 hall-0.141
py-0.132
 dragons-0.126
 seek-0.125
Tokenics
Token ID873
Feature activation-5.36e-01

Pos logit contributions
 villains+0.039
 hall+0.034
 robbers+0.032
 dragons+0.032
 monsters+0.031
Neg logit contributions
oured-0.034
cream-0.034
 sprink-0.031
 dip-0.031
 bait-0.030
Token.
Token ID13
Feature activation-3.27e-01

Pos logit contributions
oured+0.024
cream+0.023
 sprink+0.021
 dip+0.021
 bait+0.021
Neg logit contributions
 villains-0.021
 hall-0.018
 dragons-0.017
 monsters-0.017
 robbers-0.016
Token She
Token ID1375
Feature activation+4.23e-01

Pos logit contributions
oured+0.014
 dip+0.013
 sprink+0.013
cream+0.013
 vanilla+0.013
Neg logit contributions
 villains-0.014
 robbers-0.011
 monsters-0.011
 villain-0.011
py-0.011
Token would
Token ID561
Feature activation-7.58e-01

Pos logit contributions
 villains+0.015
 hall+0.014
 dragons+0.012
 robbers+0.012
 monsters+0.012
Neg logit contributions
oured-0.020
cream-0.019
 sprink-0.017
 bait-0.017
shaped-0.017
Token so
Token ID523
Feature activation+3.27e-01

Pos logit contributions
 villains+0.008
 hall+0.007
 monsters+0.006
 dragons+0.006
 lawyers+0.006
Neg logit contributions
cream-0.008
oured-0.008
 dip-0.007
 sprink-0.007
 bait-0.007
Token excited
Token ID6568
Feature activation-4.15e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.011
 robbers+0.011
 monsters+0.011
Neg logit contributions
oured-0.013
cream-0.013
 sprink-0.012
 dip-0.012
 bait-0.012
Token!
Token ID0
Feature activation-2.34e-01

Pos logit contributions
oured+0.020
cream+0.019
 bait+0.018
 sprink+0.017
shaped+0.017
Neg logit contributions
 villains-0.014
 hall-0.013
 monsters-0.012
 dragons-0.012
 maze-0.011
Token She
Token ID1375
Feature activation+3.05e-01

Pos logit contributions
oured+0.010
cream+0.010
 dip+0.009
 sprink+0.009
inated+0.009
Neg logit contributions
 villains-0.010
 monsters-0.008
py-0.008
 dragons-0.007
 villain-0.007
Token loved
Token ID6151
Feature activation+3.38e-02

Pos logit contributions
 villains+0.012
 hall+0.011
 robbers+0.010
 dragons+0.010
 monsters+0.010
Neg logit contributions
cream-0.013
oured-0.013
shaped-0.011
 sprink-0.011
 bait-0.011
Token pic
Token ID8301
Feature activation-3.42e+00

Pos logit contributions
 villains+0.002
 hall+0.001
py+0.001
 villain+0.001
 maze+0.001
Neg logit contributions
cream-0.001
oured-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
Tokenn
Token ID77
Feature activation+1.00e+00

Pos logit contributions
cream+0.127
oured+0.127
 bait+0.113
 sprink+0.111
 dip+0.109
Neg logit contributions
 villains-0.148
 hall-0.134
py-0.125
 seek-0.120
oon-0.120
Tokenics
Token ID873
Feature activation-5.50e-01

Pos logit contributions
 villains+0.045
 hall+0.039
 robbers+0.037
 dragons+0.037
 monsters+0.036
Neg logit contributions
oured-0.038
cream-0.037
 sprink-0.034
 dip-0.034
 bait-0.033
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
oured+0.022
cream+0.022
 sprink+0.020
 dip+0.020
 bait+0.019
Neg logit contributions
 villains-0.023
 hall-0.021
 dragons-0.019
 robbers-0.019
 monsters-0.019
Token spending
Token ID4581
Feature activation-5.11e-01

Pos logit contributions
oured+0.004
cream+0.004
shaped+0.004
 sprink+0.004
 bait+0.004
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
 dragons-0.004
 monsters-0.004
Token time
Token ID640
Feature activation-5.58e-01

Pos logit contributions
oured+0.021
cream+0.021
 sprink+0.019
 dip+0.019
 bait+0.018
Neg logit contributions
 villains-0.021
 hall-0.019
 dragons-0.017
 robbers-0.017
 monsters-0.017
Token're
Token ID821
Feature activation-5.43e-01

Pos logit contributions
cream+0.012
oured+0.012
 vanilla+0.011
shaped+0.011
 sprink+0.011
Neg logit contributions
 villains-0.010
 hall-0.008
 villain-0.008
 robbers-0.008
 monsters-0.008
Token a
Token ID257
Feature activation-3.79e-01

Pos logit contributions
cream+0.022
oured+0.022
 dip+0.020
 sprink+0.019
 mun+0.019
Neg logit contributions
 villains-0.023
 hall-0.020
 robbers-0.019
 dragons-0.019
 monsters-0.019
Token great
Token ID1049
Feature activation-5.92e-01

Pos logit contributions
oured+0.016
cream+0.016
 sprink+0.015
 dip+0.015
 bait+0.014
Neg logit contributions
 villains-0.015
 hall-0.013
 dragons-0.012
 robbers-0.012
py-0.012
Token pilot
Token ID8022
Feature activation+1.63e-02

Pos logit contributions
cream+0.032
oured+0.032
 sprink+0.029
 dip+0.029
 bait+0.028
Neg logit contributions
 villains-0.018
 hall-0.015
 dragons-0.013
 villain-0.013
 monsters-0.013
Token and
Token ID290
Feature activation-2.65e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 robbers+0.001
 dragons+0.001
 maze+0.001
Neg logit contributions
cream-0.001
oured-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
Token Super
Token ID3115
Feature activation-3.42e+00

Pos logit contributions
oured+0.012
cream+0.012
 sprink+0.011
 dip+0.011
shaped+0.011
Neg logit contributions
 villains-0.010
 hall-0.008
 dragons-0.008
 monsters-0.008
 robbers-0.008
Token Fly
Token ID13575
Feature activation-6.05e-01

Pos logit contributions
cream+0.121
oured+0.121
 sprink+0.109
 dip+0.106
 bait+0.103
Neg logit contributions
 villains-0.162
 hall-0.142
 robbers-0.136
py-0.134
 monsters-0.134
Token looks
Token ID3073
Feature activation-6.09e-01

Pos logit contributions
cream+0.027
oured+0.027
 sprink+0.026
 dip+0.025
 vanilla+0.024
Neg logit contributions
 villains-0.023
 hall-0.020
py-0.018
 dragons-0.018
 monsters-0.018
Token so
Token ID523
Feature activation-2.27e-01

Pos logit contributions
cream+0.028
oured+0.027
 sprink+0.025
 dip+0.024
asted+0.024
Neg logit contributions
 villains-0.022
 hall-0.019
 dragons-0.019
 monsters-0.019
 alone-0.018
Token happy
Token ID3772
Feature activation-4.91e-01

Pos logit contributions
oured+0.011
cream+0.011
 sprink+0.010
 dip+0.010
asted+0.010
Neg logit contributions
 villains-0.008
 hall-0.007
 dragons-0.006
 monsters-0.006
 maze-0.006
Token with
Token ID351
Feature activation-5.59e-01

Pos logit contributions
oured+0.021
cream+0.020
 sprink+0.019
 dip+0.019
 bait+0.018
Neg logit contributions
 villains-0.020
 hall-0.017
 dragons-0.016
 robbers-0.016
py-0.016
Token their
Token ID511
Feature activation-1.03e+00

Pos logit contributions
oured+0.001
cream+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.002
 hall-0.002
 dragons-0.002
 robbers-0.002
 monsters-0.001
Token salad
Token ID20698
Feature activation-9.87e-01

Pos logit contributions
cream+0.036
oured+0.036
 sprink+0.034
 dip+0.033
 bait+0.033
Neg logit contributions
 villains-0.048
 hall-0.042
 villain-0.039
py-0.039
 robbers-0.039
Token and
Token ID290
Feature activation-7.23e-01

Pos logit contributions
cream+0.030
oured+0.028
 sprink+0.027
 bait+0.026
 dip+0.026
Neg logit contributions
 villains-0.052
 hall-0.045
 villain-0.045
 robbers-0.043
py-0.043
Token have
Token ID423
Feature activation-1.27e-01

Pos logit contributions
cream+0.025
oured+0.024
 dip+0.022
 sprink+0.022
 bait+0.022
Neg logit contributions
 villains-0.034
 hall-0.031
py-0.029
 villain-0.029
 robbers-0.028
Token happy
Token ID3772
Feature activation-8.34e-01

Pos logit contributions
oured+0.005
cream+0.005
 sprink+0.004
 dip+0.004
 bait+0.004
Neg logit contributions
 villains-0.006
 hall-0.005
 dragons-0.005
 robbers-0.005
 monsters-0.005
Token pic
Token ID8301
Feature activation-3.41e+00

Pos logit contributions
cream+0.039
oured+0.038
 sprink+0.038
 dip+0.036
 bait+0.036
Neg logit contributions
 villains-0.028
 hall-0.025
py-0.022
 robbers-0.022
 villain-0.022
Tokenn
Token ID77
Feature activation+5.44e-01

Pos logit contributions
oured+0.128
cream+0.128
 sprink+0.116
 vanilla+0.114
 dip+0.114
Neg logit contributions
 villains-0.149
py-0.125
 hall-0.124
 seek-0.118
robat-0.117
Tokenics
Token ID873
Feature activation-6.52e-01

Pos logit contributions
 villains+0.025
 hall+0.022
 robbers+0.021
 maze+0.021
 villain+0.021
Neg logit contributions
cream-0.020
oured-0.020
 sprink-0.018
 dip-0.017
 bait-0.017
Token together
Token ID1978
Feature activation+3.51e-01

Pos logit contributions
cream+0.023
oured+0.023
 sprink+0.021
 dip+0.020
 bait+0.020
Neg logit contributions
 villains-0.030
 hall-0.027
 robbers-0.025
py-0.025
 dragons-0.025
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.011
 robbers+0.011
 monsters+0.011
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.014
 dip-0.014
 bait-0.013
Token<|endoftext|>
Token ID50256
Feature activation+5.32e-01

Pos logit contributions
cream+0.004
oured+0.004
 sprink+0.004
 mun+0.004
 dip+0.004
Neg logit contributions
 villains-0.006
 hall-0.005
 monsters-0.005
py-0.005
 robbers-0.005
Token the
Token ID262
Feature activation+2.65e-01

Pos logit contributions
 sprink+0.015
 dip+0.014
oured+0.014
cream+0.013
 vanilla+0.013
Neg logit contributions
 villains-0.032
py-0.027
 villain-0.027
 robbers-0.026
 hall-0.026
Token warm
Token ID5814
Feature activation+3.54e-01

Pos logit contributions
 villains+0.011
py+0.009
 robbers+0.009
 villain+0.009
 skeletons+0.008
Neg logit contributions
 sprink-0.011
 dip-0.011
cream-0.010
oured-0.010
 vanilla-0.010
Token sunshine
Token ID34488
Feature activation+2.64e-01

Pos logit contributions
 villains+0.016
 villain+0.013
 robbers+0.013
 lawyers+0.013
py+0.013
Neg logit contributions
cream-0.013
oured-0.013
 sprink-0.013
 dip-0.013
 vanilla-0.011
Token and
Token ID290
Feature activation-3.74e-01

Pos logit contributions
 villains+0.012
 hall+0.010
 robbers+0.010
py+0.010
 villain+0.010
Neg logit contributions
oured-0.009
cream-0.009
 sprink-0.009
 dip-0.009
 bait-0.008
Token having
Token ID1719
Feature activation+4.87e-01

Pos logit contributions
oured+0.016
cream+0.016
 sprink+0.014
 dip+0.014
 bait+0.014
Neg logit contributions
 villains-0.015
 hall-0.013
 dragons-0.012
 robbers-0.012
py-0.012
Token pic
Token ID8301
Feature activation-3.26e+00

Pos logit contributions
 villains+0.020
 hall+0.018
 dragons+0.016
 robbers+0.016
 monsters+0.016
Neg logit contributions
oured-0.020
cream-0.020
 sprink-0.018
 dip-0.018
 bait-0.017
Tokenn
Token ID77
Feature activation+1.25e+00

Pos logit contributions
cream+0.118
oured+0.115
 bait+0.101
 sprink+0.101
 dip+0.098
Neg logit contributions
 villains-0.150
 hall-0.133
py-0.126
 dragons-0.121
 robbers-0.121
Tokenics
Token ID873
Feature activation-5.45e-01

Pos logit contributions
 villains+0.058
 hall+0.050
 robbers+0.049
 maze+0.047
 skeletons+0.047
Neg logit contributions
oured-0.046
cream-0.045
 sprink-0.041
 dip-0.040
 placed-0.039
Token with
Token ID351
Feature activation+8.88e-01

Pos logit contributions
oured+0.021
cream+0.021
 sprink+0.019
 dip+0.019
 bait+0.018
Neg logit contributions
 villains-0.024
 hall-0.021
 robbers-0.020
py-0.020
 dragons-0.019
Token her
Token ID607
Feature activation-8.35e-02

Pos logit contributions
 villains+0.026
 hall+0.022
 dragons+0.022
 monsters+0.021
py+0.020
Neg logit contributions
oured-0.047
cream-0.047
 bait-0.042
 dip-0.042
int-0.042
Token family
Token ID1641
Feature activation-1.18e-01

Pos logit contributions
oured+0.005
cream+0.005
 sprink+0.004
 dip+0.004
 bait+0.004
Neg logit contributions
 villains-0.002
 hall-0.002
 robbers-0.002
py-0.002
 dragons-0.002
Token,
Token ID11
Feature activation+5.91e-02

Pos logit contributions
oured+0.005
cream+0.005
 sprink+0.005
 dip+0.005
 bait+0.005
Neg logit contributions
 villains-0.006
 hall-0.005
 robbers-0.005
 dragons-0.005
py-0.005
Token next
Token ID1306
Feature activation-5.65e-02

Pos logit contributions
 hall+0.002
 villains+0.002
 alone+0.002
 robbers+0.002
 dragons+0.002
Neg logit contributions
cream-0.003
oured-0.003
 bait-0.003
shaped-0.003
engu-0.002
Token to
Token ID284
Feature activation+4.11e-01

Pos logit contributions
oured+0.002
cream+0.002
 sprink+0.002
 dip+0.002
 bait+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 robbers-0.002
 dragons-0.002
py-0.002
Token her
Token ID607
Feature activation+1.13e-01

Pos logit contributions
 villains+0.015
 hall+0.013
 dragons+0.012
 robbers+0.012
 monsters+0.012
Neg logit contributions
oured-0.019
cream-0.019
 sprink-0.017
 dip-0.017
 bait-0.017
Token magn
Token ID7842
Feature activation-2.14e-01

Pos logit contributions
 villains+0.004
 hall+0.003
 dragons+0.003
 monsters+0.003
 robbers+0.003
Neg logit contributions
cream-0.005
oured-0.005
 sprink-0.005
 dip-0.005
 bait-0.005
Tokenifying
Token ID4035
Feature activation-3.25e+00

Pos logit contributions
cream+0.005
 bait+0.003
 dip+0.003
oured+0.003
 sprink+0.003
Neg logit contributions
 villains-0.013
 hall-0.012
 dragons-0.012
py-0.012
oon-0.012
Token glass
Token ID5405
Feature activation-9.09e-01

Pos logit contributions
oured+0.111
cream+0.109
 dip+0.096
 bait+0.094
ck+0.093
Neg logit contributions
 villains-0.155
 hall-0.145
 dragons-0.134
 maze-0.132
 monsters-0.131
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
oured+0.041
cream+0.040
 sprink+0.038
 dip+0.037
 bait+0.036
Neg logit contributions
 villains-0.035
 hall-0.028
 robbers-0.027
py-0.026
 dragons-0.026
Token She
Token ID1375
Feature activation+8.17e-02

Pos logit contributions
 villains+0.009
 hall+0.007
py+0.006
 robbers+0.006
 monsters+0.006
Neg logit contributions
oured-0.015
cream-0.015
 sprink-0.014
 dip-0.014
 bait-0.014
Token looked
Token ID3114
Feature activation-9.47e-01

Pos logit contributions
 villains+0.003
 hall+0.003
 dragons+0.003
 monsters+0.003
 maze+0.003
Neg logit contributions
cream-0.003
oured-0.003
 bait-0.003
shaped-0.003
inated-0.003
Token at
Token ID379
Feature activation+9.23e-02

Pos logit contributions
cream+0.049
oured+0.045
 bait+0.043
asted+0.042
 sprink+0.042
Neg logit contributions
 hall-0.027
 villains-0.026
 dragons-0.026
 alone-0.025
 monsters-0.024
Token in
Token ID287
Feature activation-6.82e-01

Pos logit contributions
 villains+0.014
 principal+0.012
py+0.012
 villain+0.012
umpy+0.012
Neg logit contributions
 as-0.005
 dip-0.005
cream-0.005
 placed-0.004
 sprink-0.004
Token the
Token ID262
Feature activation-1.25e-02

Pos logit contributions
cream+0.021
oured+0.021
 sprink+0.019
 dip+0.019
 bait+0.018
Neg logit contributions
 villains-0.035
 hall-0.031
py-0.030
 robbers-0.029
 villain-0.029
Token garden
Token ID11376
Feature activation-6.45e-01

Pos logit contributions
 dip+0.000
 sprink+0.000
oured+0.000
cream+0.000
inated+0.000
Neg logit contributions
 villains-0.001
 villain-0.000
py-0.000
 robbers-0.000
robat-0.000
Token and
Token ID290
Feature activation-8.34e-01

Pos logit contributions
oured+0.027
cream+0.027
 sprink+0.025
 dip+0.025
 bait+0.024
Neg logit contributions
 villains-0.026
 hall-0.023
 dragons-0.021
 robbers-0.021
py-0.021
Token have
Token ID423
Feature activation+1.59e-01

Pos logit contributions
oured+0.033
cream+0.032
 sprink+0.028
 bait+0.027
 dip+0.027
Neg logit contributions
 villains-0.037
 hall-0.034
 dragons-0.031
 robbers-0.031
 monsters-0.030
Token pic
Token ID8301
Feature activation-3.24e+00

Pos logit contributions
 villains+0.007
 hall+0.006
 dragons+0.006
 robbers+0.006
 monsters+0.006
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.006
 dip-0.006
 bait-0.005
Tokenn
Token ID77
Feature activation+1.10e+00

Pos logit contributions
cream+0.118
oured+0.116
 sprink+0.104
 bait+0.101
 vanilla+0.099
Neg logit contributions
 villains-0.148
 hall-0.131
py-0.124
 robbers-0.120
 dragons-0.120
Tokenics
Token ID873
Feature activation-7.71e-01

Pos logit contributions
 villains+0.050
 hall+0.044
 robbers+0.042
 dragons+0.041
 monsters+0.041
Neg logit contributions
oured-0.041
cream-0.041
 sprink-0.036
 dip-0.036
 bait-0.035
Token together
Token ID1978
Feature activation+7.15e-01

Pos logit contributions
oured+0.032
cream+0.032
 sprink+0.029
 dip+0.029
 bait+0.028
Neg logit contributions
 villains-0.031
 hall-0.027
 dragons-0.025
 robbers-0.025
 monsters-0.025
Token.
Token ID13
Feature activation-5.06e-01

Pos logit contributions
 villains+0.026
 hall+0.023
 dragons+0.021
 robbers+0.021
 monsters+0.020
Neg logit contributions
oured-0.033
cream-0.032
 sprink-0.030
 dip-0.029
 bait-0.029
Token
Token ID220
Feature activation+7.98e-01

Pos logit contributions
oured+0.027
cream+0.026
ck+0.024
 sprink+0.024
 bait+0.024
Neg logit contributions
 villains-0.014
 hall-0.014
 dragons-0.011
 basement-0.011
 robbers-0.011
Token near
Token ID1474
Feature activation-4.93e-03

Pos logit contributions
oured+0.003
cream+0.003
 sprink+0.003
 dip+0.003
 vanilla+0.003
Neg logit contributions
 villains-0.003
 hall-0.002
 dragons-0.002
 maze-0.002
 robbers-0.002
Token the
Token ID262
Feature activation+2.74e-02

Pos logit contributions
oured+0.000
cream+0.000
int+0.000
 mun+0.000
shaped+0.000
Neg logit contributions
 villains-0.000
 hall-0.000
 dragons-0.000
 monsters-0.000
 lawyers-0.000
Token cliff
Token ID19516
Feature activation+5.85e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 monsters+0.001
 robbers+0.001
Neg logit contributions
oured-0.001
cream-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
Token and
Token ID290
Feature activation-5.04e-01

Pos logit contributions
 villains+0.022
 hall+0.020
 dragons+0.018
 monsters+0.018
 maze+0.018
Neg logit contributions
oured-0.026
cream-0.026
 sprink-0.024
 dip-0.023
 bait-0.023
Token have
Token ID423
Feature activation+2.63e-01

Pos logit contributions
oured+0.021
cream+0.021
 sprink+0.019
int+0.018
 bait+0.018
Neg logit contributions
 villains-0.021
 hall-0.019
 dragons-0.017
 robbers-0.017
 maze-0.017
Token pic
Token ID8301
Feature activation-3.24e+00

Pos logit contributions
 villains+0.011
 hall+0.010
 dragons+0.009
 robbers+0.009
 monsters+0.009
Neg logit contributions
oured-0.011
cream-0.010
 sprink-0.009
 dip-0.009
 bait-0.009
Tokenn
Token ID77
Feature activation+1.07e+00

Pos logit contributions
cream+0.125
oured+0.119
 sprink+0.108
 bait+0.106
 vanilla+0.104
Neg logit contributions
 villains-0.142
 hall-0.125
py-0.120
 robbers-0.114
 dragons-0.114
Tokenics
Token ID873
Feature activation-5.63e-01

Pos logit contributions
 villains+0.047
 hall+0.042
 robbers+0.039
 dragons+0.038
 monsters+0.038
Neg logit contributions
oured-0.041
cream-0.041
 sprink-0.037
 dip-0.036
 bait-0.035
Token.
Token ID13
Feature activation-3.08e-01

Pos logit contributions
oured+0.024
cream+0.023
 sprink+0.021
 dip+0.021
 bait+0.020
Neg logit contributions
 villains-0.023
 hall-0.020
 dragons-0.019
 robbers-0.019
 monsters-0.018
Token One
Token ID1881
Feature activation+1.79e-01

Pos logit contributions
oured+0.017
cream+0.016
ck+0.016
 bait+0.015
 sprink+0.015
Neg logit contributions
 hall-0.009
 villains-0.009
 chamber-0.007
 dragons-0.007
 basement-0.006
Token day
Token ID1110
Feature activation-9.99e-02

Pos logit contributions
 villains+0.003
 hall+0.003
 maze+0.002
 dragons+0.002
 chamber+0.002
Neg logit contributions
oured-0.012
cream-0.011
 sprink-0.011
 placed-0.010
shaped-0.010
Token and
Token ID290
Feature activation+6.79e-01

Pos logit contributions
 villains+0.005
 dragons+0.004
 hall+0.004
 monsters+0.004
 monster+0.004
Neg logit contributions
cream-0.009
oured-0.009
 sprink-0.009
 bait-0.008
 dip-0.008
Token nodded
Token ID14464
Feature activation+3.07e-02

Pos logit contributions
 villains+0.024
 robbers+0.021
 hall+0.021
 seek+0.021
 dragons+0.019
Neg logit contributions
oured-0.029
shaped-0.028
cream-0.028
ifying-0.027
immer-0.027
Token.
Token ID13
Feature activation-5.40e-02

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 monsters+0.001
 monster+0.001
Neg logit contributions
oured-0.002
cream-0.002
 sprink-0.002
shaped-0.002
 bait-0.002
Token He
Token ID679
Feature activation-3.74e-01

Pos logit contributions
oured+0.003
cream+0.003
ck+0.003
 sprink+0.003
 dip+0.003
Neg logit contributions
 villains-0.001
 hall-0.001
 robbers-0.001
 maze-0.001
 dragons-0.001
Token loved
Token ID6151
Feature activation-2.01e-02

Pos logit contributions
oured+0.016
cream+0.016
shaped+0.015
ifying+0.015
 sprink+0.014
Neg logit contributions
 villains-0.013
 hall-0.012
 dragons-0.012
 seek-0.012
 robbers-0.011
Token pic
Token ID8301
Feature activation-3.22e+00

Pos logit contributions
cream+0.001
oured+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 robbers-0.001
py-0.001
 dragons-0.001
Tokenn
Token ID77
Feature activation+9.57e-01

Pos logit contributions
cream+0.129
oured+0.127
 sprink+0.115
 bait+0.114
 vanilla+0.112
Neg logit contributions
 villains-0.129
 hall-0.113
py-0.112
oon-0.106
 seek-0.106
Tokenics
Token ID873
Feature activation-5.60e-01

Pos logit contributions
 villains+0.040
 hall+0.036
 dragons+0.033
 robbers+0.033
py+0.033
Neg logit contributions
oured-0.039
cream-0.038
 sprink-0.035
 dip-0.035
 bait-0.034
Token and
Token ID290
Feature activation-1.00e-01

Pos logit contributions
oured+0.025
cream+0.024
 sprink+0.022
 dip+0.022
shaped+0.021
Neg logit contributions
 villains-0.022
 hall-0.019
 dragons-0.017
 monsters-0.017
 robbers-0.017
Token couldn
Token ID3521
Feature activation-2.79e-02

Pos logit contributions
oured+0.004
cream+0.004
shaped+0.004
int+0.004
 sprink+0.004
Neg logit contributions
 villains-0.004
 hall-0.004
 robbers-0.004
 dragons-0.003
 seek-0.003
Token't
Token ID470
Feature activation-2.25e-01

Pos logit contributions
oured+0.001
cream+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 dragons-0.001
 robbers-0.001
py-0.001
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
 villains+0.002
 hall+0.002
 dragons+0.001
 monsters+0.001
 robbers+0.001
Neg logit contributions
cream-0.002
oured-0.002
 sprink-0.002
 dip-0.002
 bait-0.002
Token<|endoftext|>
Token ID50256
Feature activation+7.17e-01

Pos logit contributions
 villains+0.007
 hall+0.006
 robbers+0.005
 dragons+0.005
py+0.005
Neg logit contributions
oured-0.008
cream-0.008
 sprink-0.007
 dip-0.007
 bait-0.007
TokenM
Token ID44
Feature activation-1.56e+00

Pos logit contributions
 villains+0.035
 robbers+0.029
 monsters+0.028
 villain+0.028
 hall+0.028
Neg logit contributions
cream-0.025
oured-0.025
 sprink-0.023
 dip-0.021
 bait-0.021
Tokenama
Token ID1689
Feature activation-1.21e+00

Pos logit contributions
cream+0.072
oured+0.072
 bait+0.068
 dip+0.067
inated+0.065
Neg logit contributions
 villains-0.048
 hall-0.047
py-0.043
 story-0.042
baby-0.042
Token and
Token ID290
Feature activation+5.00e-02

Pos logit contributions
cream+0.055
oured+0.055
 bait+0.053
 dip+0.052
shaped+0.051
Neg logit contributions
 hall-0.039
 villains-0.038
 dragons-0.034
py-0.034
 robbers-0.033
Token pap
Token ID20461
Feature activation-3.18e+00

Pos logit contributions
 villains+0.002
 robbers+0.002
 hall+0.002
 lawyers+0.001
 dragons+0.001
Neg logit contributions
oured-0.003
cream-0.002
 sprink-0.002
immer-0.002
endant-0.002
Tokena
Token ID64
Feature activation-1.03e+00

Pos logit contributions
oured+0.129
cream+0.124
 dip+0.123
 sprink+0.122
 bait+0.120
Neg logit contributions
 villains-0.126
py-0.112
 hall-0.108
baby-0.105
 lawyers-0.102
Token were
Token ID547
Feature activation+2.11e-01

Pos logit contributions
cream+0.050
oured+0.047
 bait+0.045
inated+0.045
 dip+0.044
Neg logit contributions
 hall-0.032
 villains-0.032
 dragons-0.028
 monsters-0.026
 robbers-0.026
Token busy
Token ID8179
Feature activation+3.38e-01

Pos logit contributions
 hall+0.006
 villains+0.006
 lawyers+0.005
 alone+0.005
 sisters+0.005
Neg logit contributions
cream-0.011
oured-0.010
 dip-0.010
 bait-0.010
shaped-0.009
Token in
Token ID287
Feature activation-6.75e-01

Pos logit contributions
 villains+0.010
 hall+0.010
 dragons+0.009
 lawyers+0.009
 monsters+0.009
Neg logit contributions
cream-0.017
oured-0.017
shaped-0.015
 bait-0.015
 dip-0.015
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
cream+0.029
oured+0.029
int+0.025
 bait+0.024
 mun+0.024
Neg logit contributions
 villains-0.027
 hall-0.026
 robbers-0.023
 monsters-0.023
 dragons-0.023
Token the
Token ID262
Feature activation+5.42e-01

Pos logit contributions
 villains+0.018
 hall+0.016
 dragons+0.015
 robbers+0.015
 monsters+0.015
Neg logit contributions
oured-0.016
cream-0.016
 sprink-0.014
 dip-0.014
 bait-0.014
Token little
Token ID1310
Feature activation-4.20e-01

Pos logit contributions
 villains+0.020
 hall+0.018
 monster+0.017
 robbers+0.016
 dragons+0.016
Neg logit contributions
cream-0.027
oured-0.026
shaped-0.023
inated-0.023
 dip-0.022
Token frog
Token ID21264
Feature activation+2.64e-01

Pos logit contributions
cream+0.022
oured+0.022
 sprink+0.020
 dip+0.020
shaped+0.020
Neg logit contributions
 villains-0.013
 hall-0.011
 robbers-0.010
 dragons-0.010
 monsters-0.010
Token and
Token ID290
Feature activation+5.94e-01

Pos logit contributions
 villains+0.011
 hall+0.009
 dragons+0.009
py+0.009
 robbers+0.008
Neg logit contributions
oured-0.011
cream-0.011
 sprink-0.010
 dip-0.010
 bait-0.010
Token the
Token ID262
Feature activation+9.19e-02

Pos logit contributions
 villains+0.028
 hall+0.025
py+0.023
 robbers+0.023
 maze+0.023
Neg logit contributions
cream-0.020
oured-0.020
 dip-0.018
 sprink-0.018
 bait-0.018
Token Professor
Token ID8129
Feature activation-3.17e+00

Pos logit contributions
 villains+0.003
 hall+0.003
 monster+0.003
 robbers+0.003
 dragons+0.003
Neg logit contributions
cream-0.004
oured-0.004
 dip-0.004
shaped-0.004
 sprink-0.004
Token became
Token ID2627
Feature activation+1.91e-03

Pos logit contributions
oured+0.140
cream+0.139
 dip+0.129
 sprink+0.127
 bait+0.123
Neg logit contributions
 villains-0.120
 hall-0.103
 robbers-0.098
 dragons-0.097
py-0.096
Token good
Token ID922
Feature activation-9.86e-01

Pos logit contributions
 villains+0.000
 hall+0.000
 dragons+0.000
 robbers+0.000
 monsters+0.000
Neg logit contributions
oured-0.000
cream-0.000
 sprink-0.000
 dip-0.000
 bait-0.000
Token friends
Token ID2460
Feature activation+1.94e-01

Pos logit contributions
oured+0.058
cream+0.057
 sprink+0.054
 dip+0.054
 bait+0.053
Neg logit contributions
 villains-0.024
 hall-0.019
 dragons-0.016
 robbers-0.016
py-0.016
Token,
Token ID11
Feature activation+3.36e-01

Pos logit contributions
 villains+0.008
 hall+0.008
 robbers+0.007
py+0.007
 maze+0.007
Neg logit contributions
oured-0.007
cream-0.007
 sprink-0.007
 dip-0.007
 bait-0.006
Token and
Token ID290
Feature activation+4.69e-01

Pos logit contributions
 villains+0.013
 hall+0.011
 dragons+0.011
 robbers+0.010
 monsters+0.010
Neg logit contributions
oured-0.015
cream-0.015
 dip-0.013
 sprink-0.013
 bait-0.013
Token see
Token ID766
Feature activation-6.50e-02

Pos logit contributions
 villains+0.000
 hall+0.000
 dragons+0.000
 robbers+0.000
py+0.000
Neg logit contributions
oured-0.000
cream-0.000
 sprink-0.000
 dip-0.000
 bait-0.000
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
oured+0.003
cream+0.003
int+0.002
 bait+0.002
 dip+0.002
Neg logit contributions
 villains-0.002
 dragons-0.002
 hall-0.002
 monsters-0.002
 lawyers-0.002
Token bird
Token ID6512
Feature activation+2.65e-01

Pos logit contributions
oured+0.013
cream+0.013
 bait+0.011
 placed+0.011
 dip+0.011
Neg logit contributions
 villains-0.007
 hall-0.007
 dragons-0.006
 monsters-0.006
 monster-0.006
Token and
Token ID290
Feature activation-1.83e-01

Pos logit contributions
 villains+0.009
 hall+0.008
 dragons+0.008
py+0.008
 monsters+0.008
Neg logit contributions
oured-0.013
cream-0.012
 bait-0.011
 sprink-0.011
 vanilla-0.011
Token have
Token ID423
Feature activation+1.25e-01

Pos logit contributions
oured+0.007
cream+0.007
 sprink+0.006
 dip+0.006
 bait+0.006
Neg logit contributions
 villains-0.008
 hall-0.008
 dragons-0.007
 robbers-0.007
 monsters-0.007
Token pic
Token ID8301
Feature activation-3.12e+00

Pos logit contributions
 villains+0.006
 hall+0.005
 dragons+0.005
 monsters+0.005
 robbers+0.005
Neg logit contributions
oured-0.005
cream-0.005
 dip-0.004
inated-0.004
 sprink-0.004
Tokenn
Token ID77
Feature activation+8.55e-01

Pos logit contributions
cream+0.134
oured+0.122
 bait+0.115
 vanilla+0.112
 sprink+0.112
Neg logit contributions
 villains-0.118
py-0.102
 hall-0.099
 seek-0.095
 dragons-0.092
Tokenics
Token ID873
Feature activation-7.20e-01

Pos logit contributions
 villains+0.037
 hall+0.032
 robbers+0.030
 dragons+0.030
 monsters+0.030
Neg logit contributions
oured-0.034
cream-0.033
 sprink-0.030
 dip-0.030
 bait-0.029
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
oured+0.030
cream+0.030
 sprink+0.027
 dip+0.027
 bait+0.026
Neg logit contributions
 villains-0.029
 hall-0.026
 dragons-0.024
 robbers-0.023
 monsters-0.023
Token<|endoftext|>
Token ID50256
Feature activation+5.09e-01

Pos logit contributions
cream+0.018
oured+0.017
 sprink+0.016
 dip+0.016
 bait+0.015
Neg logit contributions
 villains-0.019
 hall-0.016
 dragons-0.015
 robbers-0.015
 monsters-0.015
TokenOnce
Token ID7454
Feature activation-6.91e-02

Pos logit contributions
 villains+0.025
 robbers+0.021
 hall+0.021
py+0.020
 monsters+0.020
Neg logit contributions
cream-0.017
oured-0.017
 sprink-0.016
 dip-0.015
 bait-0.015
Token the
Token ID262
Feature activation-2.09e-01

Pos logit contributions
oured+0.009
cream+0.009
 sprink+0.008
 dip+0.008
 bait+0.008
Neg logit contributions
 villains-0.009
 hall-0.008
 dragons-0.008
 robbers-0.008
py-0.008
Token watch
Token ID2342
Feature activation+9.79e-01

Pos logit contributions
cream+0.010
oured+0.010
shaped+0.009
 bait+0.009
 mun+0.009
Neg logit contributions
 villains-0.007
 hall-0.007
 maze-0.006
 dragons-0.006
 monsters-0.006
Token with
Token ID351
Feature activation+7.22e-01

Pos logit contributions
 villains+0.038
 hall+0.034
 dragons+0.031
 monsters+0.030
py+0.030
Neg logit contributions
cream-0.043
oured-0.043
 sprink-0.038
 dip-0.038
 bait-0.038
Token the
Token ID262
Feature activation-3.79e-01

Pos logit contributions
 villains+0.030
 hall+0.027
 monsters+0.027
 dragons+0.026
 monster+0.026
Neg logit contributions
cream-0.031
oured-0.030
inated-0.026
int-0.025
 sprink-0.025
Token magn
Token ID7842
Feature activation+4.10e-01

Pos logit contributions
cream+0.018
oured+0.017
shaped+0.015
inated+0.015
 mun+0.015
Neg logit contributions
 villains-0.015
 hall-0.013
 maze-0.012
 monsters-0.012
 dragons-0.012
Tokenifying
Token ID4035
Feature activation-3.10e+00

Pos logit contributions
 villains+0.026
 hall+0.024
 dragons+0.023
py+0.023
 monsters+0.023
Neg logit contributions
cream-0.008
oured-0.006
 bait-0.006
 sprink-0.006
 dip-0.006
Token glass
Token ID5405
Feature activation-1.79e-01

Pos logit contributions
oured+0.110
cream+0.108
 dip+0.094
ck+0.094
 bait+0.092
Neg logit contributions
 villains-0.142
 hall-0.136
 dragons-0.124
 chamber-0.123
 maze-0.123
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
oured+0.008
cream+0.007
 sprink+0.007
 dip+0.007
 bait+0.007
Neg logit contributions
 villains-0.007
 hall-0.006
 robbers-0.006
 dragons-0.006
py-0.006
Token He
Token ID679
Feature activation-2.80e-01

Pos logit contributions
 villains+0.004
 hall+0.003
py+0.003
 robbers+0.003
 dragons+0.003
Neg logit contributions
cream-0.006
oured-0.006
 sprink-0.005
 dip-0.005
 bait-0.005
Token saw
Token ID2497
Feature activation-1.89e-01

Pos logit contributions
cream+0.013
oured+0.013
 sprink+0.011
 dip+0.011
shaped+0.011
Neg logit contributions
 villains-0.010
 hall-0.009
 dragons-0.009
 monsters-0.008
 robbers-0.008
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
oured+0.007
cream+0.006
int+0.006
 dip+0.006
 mun+0.006
Neg logit contributions
 villains-0.009
 monsters-0.008
 dragons-0.008
 hall-0.008
 skeletons-0.008
Token throughout
Token ID3690
Feature activation-2.50e-01

Pos logit contributions
 villains+0.015
 hall+0.013
 dragons+0.012
 monsters+0.012
py+0.012
Neg logit contributions
cream-0.017
oured-0.017
 sprink-0.015
 dip-0.015
 bait-0.015
Token the
Token ID262
Feature activation-4.95e-01

Pos logit contributions
 placed+0.005
 sprink+0.005
 dip+0.005
oured+0.005
 bait+0.005
Neg logit contributions
 villains-0.014
 hall-0.013
 robbers-0.012
py-0.012
 villain-0.012
Token forest
Token ID8222
Feature activation+5.03e-01

Pos logit contributions
 dip+0.024
oured+0.024
cream+0.024
 sprink+0.023
 bait+0.022
Neg logit contributions
 villains-0.015
 robbers-0.011
 villain-0.011
py-0.011
 lawyers-0.011
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
 villains+0.021
 hall+0.018
 robbers+0.017
 dragons+0.016
py+0.016
Neg logit contributions
oured-0.021
cream-0.021
 sprink-0.019
 dip-0.019
 bait-0.018
Token<|endoftext|>
Token ID50256
Feature activation+3.34e-01

Pos logit contributions
oured+0.013
cream+0.013
 sprink+0.012
 dip+0.012
 bait+0.012
Neg logit contributions
 villains-0.012
 hall-0.010
 dragons-0.010
 robbers-0.009
py-0.009
TokenOld
Token ID19620
Feature activation-3.10e+00

Pos logit contributions
 villains+0.016
 hall+0.013
 robbers+0.013
 monsters+0.013
 villain+0.013
Neg logit contributions
oured-0.012
cream-0.012
 sprink-0.011
 bait-0.010
 dip-0.010
Token Sam
Token ID3409
Feature activation+1.25e-01

Pos logit contributions
oured+0.140
cream+0.136
 dip+0.128
 sprink+0.126
 bait+0.125
Neg logit contributions
 villains-0.112
 hall-0.102
 robbers-0.095
 dragons-0.091
 maze-0.090
Token lived
Token ID5615
Feature activation-9.76e-02

Pos logit contributions
 villains+0.005
 hall+0.004
 dragons+0.004
 robbers+0.004
 maze+0.004
Neg logit contributions
oured-0.005
cream-0.005
 bait-0.005
 dip-0.005
 sprink-0.005
Token all
Token ID477
Feature activation+5.02e-01

Pos logit contributions
cream+0.004
oured+0.004
 dip+0.003
 sprink+0.003
 bait+0.003
Neg logit contributions
 villains-0.004
 hall-0.004
 robbers-0.004
 dragons-0.004
 lawyers-0.004
Token alone
Token ID3436
Feature activation+4.24e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 dragons+0.012
py+0.012
 robbers+0.012
Neg logit contributions
oured-0.026
cream-0.025
 dip-0.024
 sprink-0.023
 bait-0.023
Token in
Token ID287
Feature activation-8.97e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 dragons+0.013
 monsters+0.013
 alone+0.013
Neg logit contributions
oured-0.019
cream-0.019
 dip-0.018
 sprink-0.017
shaped-0.017
Token She
Token ID1375
Feature activation+1.30e-01

Pos logit contributions
 villains+0.022
 hall+0.018
 monsters+0.017
py+0.017
 robbers+0.016
Neg logit contributions
oured-0.026
cream-0.025
 sprink-0.024
 dip-0.023
 vanilla-0.023
Token takes
Token ID2753
Feature activation+3.18e-01

Pos logit contributions
 villains+0.005
 hall+0.005
 dragons+0.004
 robbers+0.004
 monsters+0.004
Neg logit contributions
cream-0.005
oured-0.005
 dip-0.005
 bait-0.005
 sprink-0.005
Token out
Token ID503
Feature activation-9.29e-01

Pos logit contributions
 villains+0.013
 hall+0.012
 dragons+0.012
 monsters+0.012
 lawyers+0.011
Neg logit contributions
cream-0.012
oured-0.012
inated-0.011
 mun-0.011
 dip-0.011
Token a
Token ID257
Feature activation+1.53e-01

Pos logit contributions
cream+0.032
oured+0.032
 dip+0.027
 bait+0.027
 sprink+0.027
Neg logit contributions
 villains-0.045
 hall-0.042
 dragons-0.039
 monsters-0.039
 robbers-0.038
Token magn
Token ID7842
Feature activation+2.13e-01

Pos logit contributions
 villains+0.006
 hall+0.005
 dragons+0.005
 robbers+0.005
 maze+0.005
Neg logit contributions
cream-0.007
oured-0.007
 sprink-0.006
 dip-0.006
 bait-0.006
Tokenifying
Token ID4035
Feature activation-3.09e+00

Pos logit contributions
 villains+0.012
 dragons+0.011
 hall+0.011
py+0.011
oon+0.011
Neg logit contributions
cream-0.005
oured-0.004
 bait-0.004
 mun-0.004
asted-0.004
Token glass
Token ID5405
Feature activation-6.99e-01

Pos logit contributions
oured+0.120
cream+0.118
ck+0.108
 dip+0.103
int+0.100
Neg logit contributions
 hall-0.126
 villains-0.126
 chamber-0.116
 maze-0.115
 dragons-0.114
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
oured+0.028
cream+0.028
 sprink+0.027
 dip+0.026
 bait+0.025
Neg logit contributions
 villains-0.030
 hall-0.025
 robbers-0.024
py-0.023
 villain-0.023
Token She
Token ID1375
Feature activation+5.67e-02

Pos logit contributions
 villains+0.014
 hall+0.012
 dragons+0.011
 robbers+0.011
 lawyers+0.011
Neg logit contributions
oured-0.022
cream-0.022
 sprink-0.020
 dip-0.020
 bait-0.020
Token looks
Token ID3073
Feature activation-1.02e+00

Pos logit contributions
 villains+0.002
 hall+0.002
 dragons+0.002
 monsters+0.002
 maze+0.002
Neg logit contributions
cream-0.002
oured-0.002
 bait-0.002
 dip-0.002
 sprink-0.002
Token at
Token ID379
Feature activation+2.72e-01

Pos logit contributions
cream+0.052
oured+0.047
 bait+0.043
acket+0.043
 mun+0.043
Neg logit contributions
 villains-0.030
 hall-0.030
 dragons-0.029
 alone-0.029
 monsters-0.028
TokenThe
Token ID464
Feature activation-7.73e-01

Pos logit contributions
 villains+0.021
 hall+0.017
 robbers+0.017
 monsters+0.017
py+0.017
Neg logit contributions
oured-0.018
cream-0.018
 sprink-0.016
 dip-0.015
 bait-0.015
Token sun
Token ID4252
Feature activation-1.40e-02

Pos logit contributions
oured+0.039
cream+0.037
ck+0.034
 placed+0.033
 dip+0.033
Neg logit contributions
 villains-0.026
 hall-0.024
 monster-0.023
 robbers-0.023
 dragons-0.022
Token was
Token ID373
Feature activation-2.70e-01

Pos logit contributions
cream+0.001
oured+0.001
 bait+0.001
shaped+0.001
 dip+0.001
Neg logit contributions
 hall-0.000
 villains-0.000
py-0.000
 monster-0.000
oon-0.000
Token setting
Token ID4634
Feature activation-3.42e-01

Pos logit contributions
oured+0.012
cream+0.012
engu+0.011
ck+0.011
 dip+0.011
Neg logit contributions
 hall-0.009
 alone-0.008
 dragons-0.008
 world-0.008
py-0.008
Token and
Token ID290
Feature activation-2.30e-01

Pos logit contributions
oured+0.019
ck+0.018
cream+0.018
endant+0.018
 mun+0.017
Neg logit contributions
 hall-0.008
 dragons-0.007
 alone-0.007
 chamber-0.007
 world-0.007
Token Farmer
Token ID37755
Feature activation-3.04e+00

Pos logit contributions
oured+0.011
cream+0.010
int+0.009
ck+0.009
shaped+0.009
Neg logit contributions
 hall-0.008
 villains-0.008
 dragons-0.007
 monsters-0.007
 maze-0.007
Token Joe
Token ID5689
Feature activation-3.84e-01

Pos logit contributions
oured+0.111
 dip+0.107
cream+0.103
 sprink+0.102
shaped+0.099
Neg logit contributions
 villains-0.127
 hall-0.117
 robbers-0.117
 monsters-0.113
py-0.112
Token was
Token ID373
Feature activation-2.32e-01

Pos logit contributions
oured+0.018
cream+0.017
 bait+0.017
 dip+0.017
 sprink+0.016
Neg logit contributions
 villains-0.012
 hall-0.012
 dragons-0.010
py-0.010
 alone-0.010
Token getting
Token ID1972
Feature activation-1.93e-01

Pos logit contributions
cream+0.011
oured+0.011
 dip+0.010
ck+0.010
shaped+0.010
Neg logit contributions
 hall-0.007
 villains-0.007
 alone-0.007
 dragons-0.006
 lawyers-0.006
Token ready
Token ID3492
Feature activation-6.22e-01

Pos logit contributions
oured+0.010
inated+0.010
cream+0.009
engu+0.009
 mun+0.009
Neg logit contributions
 alone-0.005
 lawyers-0.004
 hall-0.004
 monsters-0.004
 villains-0.004
Token for
Token ID329
Feature activation+2.36e-01

Pos logit contributions
cream+0.034
oured+0.033
ck+0.031
shaped+0.031
inated+0.030
Neg logit contributions
 hall-0.014
 villains-0.014
 dragons-0.013
 monsters-0.012
 maze-0.012
Token Sue
Token ID26089
Feature activation-3.30e-01

Pos logit contributions
oured+0.097
cream+0.095
 sprink+0.090
 dip+0.090
 vanilla+0.086
Neg logit contributions
 villains-0.073
 hall-0.057
py-0.056
 villain-0.055
 robbers-0.055
Token.
Token ID13
Feature activation+4.90e-03

Pos logit contributions
cream+0.013
oured+0.013
 sprink+0.012
 dip+0.012
 placed+0.011
Neg logit contributions
 villains-0.014
 hall-0.012
 robbers-0.012
 villain-0.011
 monsters-0.011
Token Sue
Token ID26089
Feature activation-2.16e-01

Pos logit contributions
 villains+0.000
 villain+0.000
 scientist+0.000
 monsters+0.000
 robbers+0.000
Neg logit contributions
cream-0.000
 sprink-0.000
 vanilla-0.000
oured-0.000
 dip-0.000
Token loved
Token ID6151
Feature activation+1.38e-01

Pos logit contributions
cream+0.007
oured+0.007
 sprink+0.007
 dip+0.006
 placed+0.006
Neg logit contributions
 villains-0.011
 hall-0.009
 robbers-0.009
 villain-0.009
py-0.009
Token to
Token ID284
Feature activation+6.36e-02

Pos logit contributions
 villains+0.007
 hall+0.006
py+0.006
 villain+0.006
oon+0.006
Neg logit contributions
cream-0.004
 sprink-0.004
oured-0.004
 dip-0.004
 vanilla-0.004
Token help
Token ID1037
Feature activation+1.94e+00

Pos logit contributions
 villains+0.003
 hall+0.002
 dragons+0.002
 robbers+0.002
 maze+0.002
Neg logit contributions
oured-0.003
cream-0.003
 sprink-0.002
 bait-0.002
shaped-0.002
Token her
Token ID607
Feature activation+5.26e-01

Pos logit contributions
 villains+0.064
 hall+0.054
 dragons+0.048
 robbers+0.048
 monsters+0.047
Neg logit contributions
oured-0.097
cream-0.097
 sprink-0.089
 dip-0.089
 bait-0.087
Token mom
Token ID1995
Feature activation-1.02e-01

Pos logit contributions
 villains+0.023
 hall+0.019
py+0.018
 robbers+0.018
 villain+0.018
Neg logit contributions
cream-0.021
oured-0.020
 dip-0.020
 sprink-0.019
 bait-0.018
Token in
Token ID287
Feature activation+1.64e-01

Pos logit contributions
oured+0.004
cream+0.004
 sprink+0.004
 bait+0.004
 dip+0.004
Neg logit contributions
 villains-0.004
 hall-0.004
 dragons-0.003
py-0.003
 robbers-0.003
Token the
Token ID262
Feature activation-2.31e-02

Pos logit contributions
 villains+0.008
 hall+0.007
 dragons+0.007
 robbers+0.007
 monsters+0.007
Neg logit contributions
oured-0.006
cream-0.006
 sprink-0.005
 dip-0.005
 bait-0.005
Token kitchen
Token ID9592
Feature activation-9.86e-01

Pos logit contributions
cream+0.001
 sprink+0.001
 dip+0.001
 vanilla+0.001
oured+0.001
Neg logit contributions
 villains-0.001
py-0.001
oon-0.001
 villain-0.001
robat-0.001
Token mom
Token ID1995
Feature activation-8.35e-01

Pos logit contributions
 villains+0.002
 hall+0.001
 dragons+0.001
 monsters+0.001
 monster+0.001
Neg logit contributions
cream-0.003
oured-0.003
 dip-0.003
 bait-0.003
inated-0.003
Token quickly
Token ID2952
Feature activation+4.86e-01

Pos logit contributions
cream+0.040
 bait+0.039
inated+0.038
oured+0.037
shaped+0.036
Neg logit contributions
 hall-0.026
 sisters-0.023
 dragons-0.021
 villains-0.021
 robbers-0.021
Token cleaned
Token ID20750
Feature activation+3.76e-01

Pos logit contributions
 villains+0.018
 hall+0.017
 robbers+0.016
 dragons+0.015
 monsters+0.015
Neg logit contributions
cream-0.020
oured-0.018
shaped-0.017
inated-0.017
 vanilla-0.017
Token up
Token ID510
Feature activation-1.35e+00

Pos logit contributions
 villains+0.016
 hall+0.016
 dragons+0.015
 monsters+0.014
 lawyers+0.014
Neg logit contributions
oured-0.014
cream-0.014
 bait-0.012
 dip-0.012
 mun-0.012
Token the
Token ID262
Feature activation-2.41e-01

Pos logit contributions
cream+0.057
oured+0.056
 dip+0.049
 bait+0.048
int+0.048
Neg logit contributions
 villains-0.054
 hall-0.051
 dragons-0.047
 monsters-0.047
 lawyers-0.045
Token mess
Token ID2085
Feature activation+2.15e+00

Pos logit contributions
oured+0.013
cream+0.013
 dip+0.012
 bait+0.012
 mun+0.011
Neg logit contributions
 villains-0.007
 hall-0.007
 maze-0.005
 dragons-0.005
 monsters-0.005
Token.
Token ID13
Feature activation+5.39e-01

Pos logit contributions
 villains+0.095
 hall+0.081
 robbers+0.078
py+0.076
 dragons+0.076
Neg logit contributions
oured-0.083
cream-0.082
 sprink-0.076
 dip-0.074
 placed-0.072
Token Soon
Token ID15894
Feature activation-1.64e-01

Pos logit contributions
 villains+0.020
 hall+0.018
 dragons+0.016
 robbers+0.016
 lawyers+0.016
Neg logit contributions
cream-0.024
oured-0.024
 bait-0.022
 sprink-0.022
 dip-0.022
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
oured+0.006
cream+0.005
 sprink+0.005
 dip+0.005
 bait+0.005
Neg logit contributions
 villains-0.008
 hall-0.007
py-0.007
 robbers-0.007
 dragons-0.006
Token Lisa
Token ID15378
Feature activation+8.57e-01

Pos logit contributions
 villains+0.044
 hall+0.040
 dragons+0.037
 monsters+0.036
 robbers+0.036
Neg logit contributions
cream-0.045
oured-0.045
 sprink-0.040
 dip-0.039
 bait-0.039
Token forgot
Token ID16453
Feature activation-2.55e-01

Pos logit contributions
 villains+0.032
 hall+0.030
 dragons+0.027
 monsters+0.026
py+0.026
Neg logit contributions
cream-0.038
oured-0.036
 bait-0.035
 dip-0.033
 sprink-0.032
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
cream+0.026
oured+0.026
 sprink+0.022
shaped+0.022
 dip+0.022
Neg logit contributions
 villains-0.020
 hall-0.018
 monster-0.017
 maze-0.017
 monsters-0.017
Token It
Token ID632
Feature activation-5.56e-01

Pos logit contributions
 villains+0.005
 hall+0.005
 cabinet+0.004
 robbers+0.004
 chamber+0.004
Neg logit contributions
cream-0.007
oured-0.007
ck-0.007
 dip-0.006
 sprink-0.006
Token was
Token ID373
Feature activation-1.93e-02

Pos logit contributions
oured+0.028
cream+0.027
 bait+0.025
 sprink+0.024
 dip+0.024
Neg logit contributions
 villains-0.018
 hall-0.016
 dragons-0.015
py-0.015
 monster-0.015
Token a
Token ID257
Feature activation-1.92e-01

Pos logit contributions
cream+0.001
oured+0.001
int+0.001
 dip+0.001
acket+0.001
Neg logit contributions
 villains-0.001
 dragons-0.001
 monsters-0.001
 hall-0.001
py-0.001
Token chip
Token ID11594
Feature activation-5.43e-01

Pos logit contributions
cream+0.009
oured+0.009
shaped+0.008
 dip+0.008
 sprink+0.008
Neg logit contributions
 villains-0.007
 hall-0.006
 monster-0.006
 robbers-0.006
 maze-0.006
Tokenm
Token ID76
Feature activation+2.31e+00

Pos logit contributions
oured+0.023
cream+0.023
 sprink+0.023
 dip+0.022
 bait+0.021
Neg logit contributions
 villains-0.021
 hall-0.017
 dragons-0.016
 robbers-0.016
py-0.016
Tokenunk
Token ID2954
Feature activation-8.42e-01

Pos logit contributions
 villains+0.062
py+0.052
 hall+0.052
 monsters+0.047
 monster+0.047
Neg logit contributions
cream-0.126
oured-0.125
 dip-0.115
 sprink-0.114
 placed-0.110
Token!
Token ID0
Feature activation-2.00e-01

Pos logit contributions
cream+0.038
oured+0.038
 dip+0.034
 sprink+0.034
asted+0.034
Neg logit contributions
 villains-0.030
 hall-0.027
 monster-0.025
 monsters-0.025
py-0.025
Token The
Token ID383
Feature activation-8.59e-02

Pos logit contributions
cream+0.010
oured+0.009
ck+0.009
 dip+0.009
 sprink+0.009
Neg logit contributions
 villains-0.006
 hall-0.006
 robbers-0.005
 lawyers-0.005
 maze-0.005
Token chip
Token ID11594
Feature activation-2.82e-01

Pos logit contributions
oured+0.004
cream+0.004
 dip+0.003
 sprink+0.003
 placed+0.003
Neg logit contributions
 villains-0.004
 hall-0.003
 robbers-0.003
 monsters-0.003
 monster-0.003
Tokenm
Token ID76
Feature activation+2.09e+00

Pos logit contributions
cream+0.013
oured+0.013
 sprink+0.012
 dip+0.012
 bait+0.011
Neg logit contributions
 villains-0.011
 hall-0.009
 dragons-0.008
 robbers-0.008
py-0.008
TokenHi
Token ID17250
Feature activation-9.07e-01

Pos logit contributions
oured+0.008
cream+0.008
 bait+0.007
 dip+0.007
 sprink+0.007
Neg logit contributions
 villains-0.006
 hall-0.005
 robbers-0.004
py-0.004
 dragons-0.004
Token Jay
Token ID9180
Feature activation+1.99e-01

Pos logit contributions
cream+0.041
oured+0.040
 bait+0.037
 dip+0.036
asted+0.036
Neg logit contributions
 villains-0.034
 hall-0.031
 dragons-0.029
 monsters-0.028
 monster-0.028
Tokenden
Token ID6559
Feature activation+2.97e-01

Pos logit contributions
 villains+0.008
 hall+0.007
py+0.007
 dragons+0.006
 monster+0.006
Neg logit contributions
cream-0.008
oured-0.008
 dip-0.008
 bait-0.008
 sprink-0.008
Token!
Token ID0
Feature activation-1.43e-01

Pos logit contributions
 villains+0.010
 hall+0.009
py+0.008
 dragons+0.008
 monster+0.008
Neg logit contributions
cream-0.014
oured-0.014
shaped-0.013
 bait-0.012
 dip-0.012
Token Ready
Token ID23432
Feature activation+2.80e-02

Pos logit contributions
cream+0.007
oured+0.007
 sprink+0.006
 dip+0.006
 bait+0.006
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
 dragons-0.004
 villain-0.004
Token for
Token ID329
Feature activation+1.85e+00

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
py+0.001
 monsters+0.001
Neg logit contributions
cream-0.001
oured-0.001
 sprink-0.001
 bait-0.001
shaped-0.001
Token school
Token ID1524
Feature activation+7.40e-01

Pos logit contributions
 villains+0.089
 hall+0.080
 dragons+0.075
 robbers+0.074
 monsters+0.074
Neg logit contributions
oured-0.064
cream-0.063
 sprink-0.056
 dip-0.056
 bait-0.054
Token today
Token ID1909
Feature activation+4.00e-01

Pos logit contributions
 villains+0.030
 hall+0.027
 dragons+0.024
 robbers+0.024
 monsters+0.024
Neg logit contributions
oured-0.031
cream-0.031
 sprink-0.028
 dip-0.028
 bait-0.027
Token?"
Token ID1701
Feature activation-9.03e-02

Pos logit contributions
 villains+0.014
 hall+0.014
 maze+0.012
 dragons+0.012
 monsters+0.012
Neg logit contributions
cream-0.019
oured-0.018
shaped-0.016
 sprink-0.016
 bait-0.016
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
oured+0.004
cream+0.004
 sprink+0.004
 bait+0.004
 dip+0.004
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.003
 lawyers-0.003
 robbers-0.003
Token
Token ID198
Feature activation+2.94e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 robbers+0.013
 dragons+0.013
 monsters+0.013
Neg logit contributions
oured-0.017
cream-0.017
 sprink-0.016
 dip-0.016
 bait-0.015
Token saw
Token ID2497
Feature activation-7.62e-02

Pos logit contributions
 villains+0.009
 hall+0.009
 dragons+0.008
 monsters+0.007
 lawyers+0.007
Neg logit contributions
cream-0.012
oured-0.012
 bait-0.011
 dip-0.011
inated-0.011
Token a
Token ID257
Feature activation+7.22e-02

Pos logit contributions
oured+0.003
cream+0.003
 mun+0.003
int+0.003
 dip+0.003
Neg logit contributions
 hall-0.003
 monsters-0.003
 lawyers-0.003
 dragons-0.003
 villains-0.003
Token gray
Token ID12768
Feature activation-9.29e-02

Pos logit contributions
 villains+0.003
 hall+0.003
 dragons+0.002
 robbers+0.002
 maze+0.002
Neg logit contributions
oured-0.003
cream-0.003
 dip-0.003
 bait-0.003
 sprink-0.003
Token smoke
Token ID7523
Feature activation-4.28e-01

Pos logit contributions
oured+0.004
cream+0.004
 bait+0.004
 dip+0.004
ck+0.004
Neg logit contributions
 villains-0.003
 hall-0.003
 monster-0.003
 dragons-0.003
 maze-0.003
Token coming
Token ID2406
Feature activation+3.07e-02

Pos logit contributions
oured+0.022
cream+0.021
 bait+0.019
 dip+0.018
 mun+0.018
Neg logit contributions
 villains-0.014
 hall-0.013
 monsters-0.011
 dragons-0.011
 monster-0.011
Token from
Token ID422
Feature activation+1.85e+00

Pos logit contributions
 villains+0.001
 hall+0.001
 robbers+0.001
 dragons+0.001
 villain+0.001
Neg logit contributions
cream-0.001
oured-0.001
 sprink-0.001
 dip-0.001
 bait-0.001
Token the
Token ID262
Feature activation+5.07e-01

Pos logit contributions
 villains+0.075
 hall+0.071
 dragons+0.064
 monsters+0.063
 maze+0.063
Neg logit contributions
oured-0.078
cream-0.076
 sprink-0.067
 dip-0.067
 bait-0.066
Token oven
Token ID14361
Feature activation-1.05e+00

Pos logit contributions
 villains+0.018
 hall+0.015
 dragons+0.014
 robbers+0.014
py+0.014
Neg logit contributions
oured-0.024
cream-0.024
 sprink-0.022
 dip-0.022
 bait-0.022
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
oured+0.048
cream+0.047
 dip+0.043
 sprink+0.043
 bait+0.042
Neg logit contributions
 villains-0.039
 hall-0.035
 dragons-0.031
 robbers-0.031
 monsters-0.031
Token She
Token ID1375
Feature activation+9.20e-02

Pos logit contributions
 villains+0.008
 hall+0.007
 dragons+0.006
 robbers+0.006
py+0.006
Neg logit contributions
oured-0.014
cream-0.014
 sprink-0.013
 dip-0.013
 bait-0.013
Token was
Token ID373
Feature activation+3.44e-01

Pos logit contributions
 villains+0.004
 hall+0.003
 dragons+0.003
 monsters+0.003
 robbers+0.003
Neg logit contributions
oured-0.004
cream-0.004
 dip-0.003
 bait-0.003
 sprink-0.003
Token down
Token ID866
Feature activation-9.39e-01

Pos logit contributions
 villains+0.006
 hall+0.005
 dragons+0.005
py+0.005
 monsters+0.005
Neg logit contributions
oured-0.007
cream-0.007
 sprink-0.007
 dip-0.007
 bait-0.007
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
oured+0.040
cream+0.040
 dip+0.037
 sprink+0.037
 bait+0.036
Neg logit contributions
 villains-0.037
 hall-0.033
 dragons-0.030
 monsters-0.029
 robbers-0.029
Token look
Token ID804
Feature activation-9.93e-01

Pos logit contributions
 villains+0.004
 hall+0.004
 robbers+0.003
 seek+0.003
 dragons+0.003
Neg logit contributions
oured-0.004
cream-0.004
shaped-0.004
 bait-0.004
 sprink-0.004
Token out
Token ID503
Feature activation-1.65e+00

Pos logit contributions
cream+0.046
oured+0.044
 sprink+0.042
 bait+0.042
 dip+0.041
Neg logit contributions
 villains-0.035
 hall-0.031
 dragons-0.029
 monsters-0.028
 robbers-0.028
Token the
Token ID262
Feature activation-3.64e-01

Pos logit contributions
cream+0.068
oured+0.067
 sprink+0.060
 bait+0.059
 dip+0.059
Neg logit contributions
 villains-0.069
 hall-0.061
 dragons-0.058
 monsters-0.057
 robbers-0.056
Token window
Token ID4324
Feature activation+2.10e-01

Pos logit contributions
oured+0.018
cream+0.018
 sprink+0.016
 dip+0.016
 bait+0.016
Neg logit contributions
 villains-0.012
 hall-0.011
 dragons-0.010
 robbers-0.009
 monsters-0.009
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
 villains+0.006
 hall+0.006
 dragons+0.005
 monsters+0.005
 maze+0.005
Neg logit contributions
cream-0.011
oured-0.010
 bait-0.010
 sprink-0.009
 dip-0.009
Token<|endoftext|>
Token ID50256
Feature activation+4.27e-01

Pos logit contributions
oured+0.006
cream+0.006
 dip+0.006
 sprink+0.006
 bait+0.006
Neg logit contributions
 villains-0.006
 hall-0.005
 dragons-0.005
 robbers-0.004
py-0.004
TokenOnce
Token ID7454
Feature activation-1.00e-01

Pos logit contributions
 villains+0.022
 robbers+0.019
 villain+0.019
py+0.019
 hall+0.018
Neg logit contributions
cream-0.013
oured-0.012
 sprink-0.012
 dip-0.011
 bait-0.011
Token upon
Token ID2402
Feature activation-5.09e-01

Pos logit contributions
oured+0.003
cream+0.003
 dip+0.003
 sprink+0.003
 bait+0.003
Neg logit contributions
 villains-0.005
 robbers-0.004
 hall-0.004
 villain-0.004
 lawyers-0.004
Token a
Token ID257
Feature activation+2.69e-01

Pos logit contributions
oured+0.014
cream+0.014
 sprink+0.012
 dip+0.012
 bait+0.011
Neg logit contributions
 villains-0.028
 hall-0.026
 dragons-0.025
 monsters-0.024
 robbers-0.024
Token day
Token ID1110
Feature activation-1.45e-01

Pos logit contributions
 villains+0.018
 hall+0.016
 monster+0.014
 monsters+0.014
 maze+0.014
Neg logit contributions
oured-0.041
cream-0.039
inated-0.036
shaped-0.035
 sprink-0.035
Token,
Token ID11
Feature activation+8.06e-01

Pos logit contributions
oured+0.007
cream+0.007
 dip+0.006
 sprink+0.006
shaped+0.006
Neg logit contributions
 villains-0.005
 hall-0.005
 dragons-0.004
 monsters-0.004
 robbers-0.004
Token they
Token ID484
Feature activation+6.99e-01

Pos logit contributions
 villains+0.031
 hall+0.027
 robbers+0.026
 monsters+0.025
 dragons+0.025
Neg logit contributions
oured-0.039
cream-0.037
int-0.033
shaped-0.033
inated-0.032
Token saw
Token ID2497
Feature activation+2.80e-01

Pos logit contributions
 villains+0.027
 hall+0.025
 dragons+0.023
 robbers+0.022
 monsters+0.022
Neg logit contributions
cream-0.032
oured-0.031
 sprink-0.028
shaped-0.027
 vanilla-0.027
Token something
Token ID1223
Feature activation-8.64e-01

Pos logit contributions
 monsters+0.010
 villains+0.010
 monster+0.010
 hall+0.009
 dragons+0.009
Neg logit contributions
oured-0.014
int-0.013
immer-0.012
 mun-0.012
inated-0.011
Token new
Token ID649
Feature activation+6.44e-01

Pos logit contributions
oured+0.044
cream+0.041
int+0.037
 mun+0.037
acket+0.036
Neg logit contributions
 villains-0.030
 hall-0.027
 monster-0.027
 monsters-0.027
 maze-0.026
Token in
Token ID287
Feature activation-6.07e-01

Pos logit contributions
 villains+0.023
 hall+0.021
 monsters+0.019
 maze+0.019
 dragons+0.018
Neg logit contributions
oured-0.031
cream-0.030
 dip-0.028
 sprink-0.027
 bait-0.026
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
oured+0.029
cream+0.027
int+0.026
 mun+0.024
 dip+0.023
Neg logit contributions
 villains-0.022
 hall-0.022
 dragons-0.020
 robbers-0.020
 monsters-0.019
Token park
Token ID3952
Feature activation+3.31e-01

Pos logit contributions
 hall+0.005
 villains+0.004
 maze+0.004
 monster+0.003
 world+0.003
Neg logit contributions
oured-0.012
cream-0.012
inated-0.011
 mun-0.011
ck-0.011
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
 villains+0.011
 hall+0.010
 dragons+0.008
 maze+0.008
 monsters+0.008
Neg logit contributions
oured-0.017
cream-0.016
 dip-0.015
 sprink-0.015
 bait-0.015
Token It
Token ID632
Feature activation-1.67e-01

Pos logit contributions
 villains+0.011
 hall+0.010
 robbers+0.009
 lawyers+0.008
 skeletons+0.008
Neg logit contributions
oured-0.011
cream-0.011
 dip-0.010
ck-0.010
 sprink-0.010
Token told
Token ID1297
Feature activation+6.89e-01

Pos logit contributions
 villains+0.014
 hall+0.012
 robbers+0.012
 monsters+0.012
 dragons+0.012
Neg logit contributions
oured-0.011
cream-0.011
inated-0.010
 dip-0.010
 sprink-0.010
Token them
Token ID606
Feature activation-4.50e-01

Pos logit contributions
 villains+0.029
 hall+0.025
 dragons+0.024
 monsters+0.024
 robbers+0.023
Neg logit contributions
oured-0.029
cream-0.028
 sprink-0.026
 dip-0.025
 bait-0.025
Token what
Token ID644
Feature activation-6.81e-01

Pos logit contributions
oured+0.019
cream+0.017
 sprink+0.016
 dip+0.016
 bait+0.016
Neg logit contributions
 villains-0.020
 hall-0.017
 dragons-0.016
 monsters-0.016
 robbers-0.016
Token Tom
Token ID4186
Feature activation-6.72e-01

Pos logit contributions
cream+0.025
oured+0.024
 sprink+0.022
 dip+0.022
 bait+0.022
Neg logit contributions
 villains-0.031
 hall-0.028
py-0.026
 dragons-0.026
 robbers-0.026
Token did
Token ID750
Feature activation-3.08e-01

Pos logit contributions
oured+0.032
cream+0.031
 dip+0.029
 sprink+0.029
shaped+0.028
Neg logit contributions
 villains-0.023
 hall-0.020
 monsters-0.019
 robbers-0.019
 dragons-0.018
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
oured+0.013
cream+0.013
 sprink+0.012
 dip+0.012
 bait+0.011
Neg logit contributions
 villains-0.012
 hall-0.011
 dragons-0.010
 robbers-0.010
py-0.010
Token He
Token ID679
Feature activation-1.63e-01

Pos logit contributions
 villains+0.014
 hall+0.012
py+0.011
 dragons+0.011
 robbers+0.011
Neg logit contributions
cream-0.013
oured-0.013
 sprink-0.012
 dip-0.012
 bait-0.012
Token said
Token ID531
Feature activation-9.37e-02

Pos logit contributions
oured+0.007
cream+0.006
 dip+0.006
inated+0.006
shaped+0.006
Neg logit contributions
 villains-0.007
 hall-0.006
 dragons-0.006
 monsters-0.006
 robbers-0.006
Token Tom
Token ID4186
Feature activation-1.14e-01

Pos logit contributions
oured+0.004
cream+0.003
 dip+0.003
 sprink+0.003
 bait+0.003
Neg logit contributions
 villains-0.004
 hall-0.004
 dragons-0.004
 robbers-0.004
 monsters-0.004
Token was
Token ID373
Feature activation+2.66e-01

Pos logit contributions
oured+0.006
 dip+0.005
cream+0.005
 sprink+0.005
 bait+0.005
Neg logit contributions
 villains-0.004
 hall-0.003
 monsters-0.003
 dragons-0.003
 robbers-0.003
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
 villains+0.011
 hall+0.009
 robbers+0.009
 dragons+0.009
 monsters+0.009
Neg logit contributions
oured-0.012
cream-0.011
 dip-0.010
 sprink-0.010
 bait-0.010
Token Lil
Token ID16342
Feature activation-6.42e-01

Pos logit contributions
oured+0.023
cream+0.022
ck+0.022
shaped+0.022
 sprink+0.021
Neg logit contributions
 hall-0.014
 villains-0.013
 robbers-0.011
 rac-0.011
 maze-0.011
Tokenah
Token ID993
Feature activation+1.32e+00

Pos logit contributions
oured+0.023
cream+0.023
 dip+0.023
 bait+0.022
 sprink+0.021
Neg logit contributions
 villains-0.026
 hall-0.024
py-0.024
 dragons-0.023
 robbers-0.022
Token could
Token ID714
Feature activation-1.17e+00

Pos logit contributions
 villains+0.047
 hall+0.043
 dragons+0.038
 monsters+0.037
 alone+0.036
Neg logit contributions
cream-0.061
oured-0.059
 bait-0.055
 sprink-0.054
 vanilla-0.054
Token run
Token ID1057
Feature activation+2.35e-01

Pos logit contributions
cream+0.062
oured+0.057
 sprink+0.051
 bait+0.051
shaped+0.050
Neg logit contributions
 villains-0.038
 hall-0.037
 seek-0.033
 dragons-0.031
 alone-0.031
Token very
Token ID845
Feature activation+1.71e-01

Pos logit contributions
 villains+0.010
 hall+0.009
 dragons+0.008
 robbers+0.008
 monsters+0.008
Neg logit contributions
oured-0.010
cream-0.010
 sprink-0.009
 dip-0.009
 bait-0.009
Token fast
Token ID3049
Feature activation+2.05e-01

Pos logit contributions
 villains+0.006
 hall+0.005
 dragons+0.005
 monsters+0.004
 robbers+0.004
Neg logit contributions
oured-0.008
cream-0.008
 sprink-0.008
 dip-0.007
 bait-0.007
Token and
Token ID290
Feature activation+1.72e-01

Pos logit contributions
 villains+0.009
 hall+0.008
 dragons+0.007
 robbers+0.007
py+0.007
Neg logit contributions
oured-0.008
cream-0.008
 sprink-0.007
 dip-0.007
 bait-0.007
Token race
Token ID3234
Feature activation-4.24e-01

Pos logit contributions
 villains+0.007
 hall+0.006
 robbers+0.006
 dragons+0.005
 maze+0.005
Neg logit contributions
cream-0.008
oured-0.008
 sprink-0.007
shaped-0.007
 bait-0.007
Token against
Token ID1028
Feature activation-7.81e-01

Pos logit contributions
oured+0.017
cream+0.017
 sprink+0.015
 dip+0.015
 bait+0.015
Neg logit contributions
 villains-0.018
 hall-0.016
 robbers-0.015
py-0.015
 dragons-0.015
Token Daddy
Token ID26926
Feature activation-3.45e-01

Pos logit contributions
oured+0.034
cream+0.033
int+0.030
inated+0.029
 dip+0.029
Neg logit contributions
 villains-0.029
 hall-0.028
 monsters-0.026
 dragons-0.025
 skeletons-0.025
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
oured+0.015
cream+0.015
 sprink+0.014
 dip+0.014
 bait+0.014
Neg logit contributions
 villains-0.013
 hall-0.012
 robbers-0.010
 dragons-0.010
py-0.010
Token She
Token ID1375
Feature activation+8.11e-02

Pos logit contributions
 villains+0.001
py+0.001
 monsters+0.001
 hall+0.001
 villain+0.001
Neg logit contributions
oured-0.001
cream-0.001
 dip-0.001
 sprink-0.001
 placed-0.001
Token looked
Token ID3114
Feature activation-8.01e-01

Pos logit contributions
 villains+0.003
 hall+0.003
 dragons+0.003
 robbers+0.003
 monsters+0.003
Neg logit contributions
cream-0.003
oured-0.003
 sprink-0.003
 dip-0.003
 bait-0.003
Token in
Token ID287
Feature activation-1.23e+00

Pos logit contributions
cream+0.036
oured+0.035
 sprink+0.032
 dip+0.032
 bait+0.032
Neg logit contributions
 villains-0.030
 hall-0.027
 dragons-0.025
 robbers-0.025
 monsters-0.024
Token the
Token ID262
Feature activation+1.25e-01

Pos logit contributions
oured+0.040
cream+0.038
 sprink+0.037
 dip+0.036
 bait+0.035
Neg logit contributions
 villains-0.062
 hall-0.054
py-0.052
 villain-0.050
 dragons-0.050
Token kitchen
Token ID9592
Feature activation-1.20e+00

Pos logit contributions
 villains+0.005
py+0.004
 villain+0.003
 dragons+0.003
 robbers+0.003
Neg logit contributions
oured-0.006
cream-0.006
 dip-0.006
 sprink-0.006
 vanilla-0.005
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
oured+0.048
cream+0.048
 sprink+0.043
 dip+0.043
 bait+0.042
Neg logit contributions
 villains-0.051
 hall-0.044
 dragons-0.041
 robbers-0.041
py-0.041
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
 villains+0.003
 hall+0.003
 robbers+0.003
 dragons+0.003
 monsters+0.003
Neg logit contributions
cream-0.005
oured-0.005
 bait-0.004
 sprink-0.004
shaped-0.004
Token living
Token ID2877
Feature activation+9.52e-01

Pos logit contributions
 villains+0.016
py+0.013
 hall+0.011
 robbers+0.011
 villain+0.011
Neg logit contributions
 sprink-0.023
oured-0.022
 dip-0.022
cream-0.022
 vanilla-0.021
Token room
Token ID2119
Feature activation+1.88e-01

Pos logit contributions
 villains+0.034
py+0.026
 robbers+0.026
 villain+0.024
 veterin+0.022
Neg logit contributions
oured-0.045
 sprink-0.045
cream-0.044
 dip-0.044
 placed-0.043
Token and
Token ID290
Feature activation-5.17e-02

Pos logit contributions
 villains+0.007
 hall+0.007
 dragons+0.006
 monsters+0.006
 robbers+0.006
Neg logit contributions
cream-0.008
oured-0.008
 sprink-0.007
 dip-0.007
 bait-0.007
Token even
Token ID772
Feature activation-1.51e+00

Pos logit contributions
oured+0.002
cream+0.002
 sprink+0.002
 dip+0.002
 bait+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 dragons-0.002
 robbers-0.002
 monsters-0.002
Token,
Token ID11
Feature activation+7.77e-01

Pos logit contributions
oured+0.012
cream+0.012
 dip+0.011
 sprink+0.011
shaped+0.010
Neg logit contributions
 villains-0.015
 hall-0.012
 dragons-0.012
 monsters-0.012
 villain-0.012
Token "
Token ID366
Feature activation-1.25e-01

Pos logit contributions
 villains+0.021
 hall+0.017
 robbers+0.015
 dragons+0.015
py+0.015
Neg logit contributions
cream-0.043
oured-0.043
 dip-0.039
 sprink-0.039
 bait-0.039
TokenLet
Token ID5756
Feature activation-9.31e-02

Pos logit contributions
oured+0.006
cream+0.005
 sprink+0.005
 dip+0.005
 bait+0.005
Neg logit contributions
 villains-0.005
 hall-0.004
 robbers-0.004
 dragons-0.004
py-0.004
Token me
Token ID502
Feature activation-5.61e-01

Pos logit contributions
oured+0.004
cream+0.004
 sprink+0.004
 dip+0.004
 bait+0.004
Neg logit contributions
 villains-0.003
 hall-0.003
 dragons-0.003
 monsters-0.003
 robbers-0.003
Token show
Token ID905
Feature activation+1.26e+00

Pos logit contributions
cream+0.026
oured+0.024
 sprink+0.023
shaped+0.023
inated+0.022
Neg logit contributions
 villains-0.020
 hall-0.018
 seek-0.017
 robbers-0.017
 maze-0.017
Token you
Token ID345
Feature activation-7.86e-01

Pos logit contributions
 villains+0.042
 dragons+0.035
 monsters+0.034
 hall+0.033
 monster+0.032
Neg logit contributions
oured-0.064
cream-0.061
 sprink-0.057
 mun-0.056
 dip-0.056
Token how
Token ID703
Feature activation-1.74e-01

Pos logit contributions
oured+0.028
cream+0.027
 sprink+0.024
 mun+0.023
 dip+0.023
Neg logit contributions
 villains-0.037
 hall-0.033
 dragons-0.033
 monsters-0.032
 maze-0.031
Token fun
Token ID1257
Feature activation+1.16e-01

Pos logit contributions
oured+0.008
cream+0.007
 sprink+0.007
 dip+0.006
 bait+0.006
Neg logit contributions
 villains-0.007
 hall-0.006
 dragons-0.006
 monsters-0.006
 maze-0.006
Token this
Token ID428
Feature activation+4.72e-02

Pos logit contributions
 villains+0.005
 hall+0.004
 dragons+0.004
 monsters+0.004
 robbers+0.004
Neg logit contributions
oured-0.005
cream-0.005
 sprink-0.004
 dip-0.004
 bait-0.004
Token game
Token ID983
Feature activation-1.84e-02

Pos logit contributions
 villains+0.002
 hall+0.002
 robbers+0.001
py+0.001
 dragons+0.001
Neg logit contributions
cream-0.002
oured-0.002
 sprink-0.002
 dip-0.002
 bait-0.002
Token can
Token ID460
Feature activation-7.16e-01

Pos logit contributions
oured+0.001
cream+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
 dragons-0.001
 monsters-0.001
 robbers-0.001
Token to
Token ID284
Feature activation-7.96e-02

Pos logit contributions
 villains+0.009
 dragons+0.009
 hall+0.008
 lawyers+0.008
 monsters+0.008
Neg logit contributions
oured-0.010
cream-0.010
inated-0.009
int-0.009
immer-0.009
Token be
Token ID307
Feature activation+7.26e-02

Pos logit contributions
cream+0.003
oured+0.003
shaped+0.003
inated+0.003
 sprink+0.003
Neg logit contributions
 villains-0.003
 hall-0.003
 robbers-0.003
 seek-0.003
 dragons-0.003
Token patient
Token ID5827
Feature activation+3.93e-01

Pos logit contributions
 villains+0.003
 hall+0.003
py+0.002
 villain+0.002
 dragons+0.002
Neg logit contributions
oured-0.003
cream-0.003
 sprink-0.003
 dip-0.003
 bait-0.003
Token."
Token ID526
Feature activation-3.99e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 dragons+0.013
 robbers+0.013
 monsters+0.012
Neg logit contributions
oured-0.017
cream-0.017
 sprink-0.015
 dip-0.015
 bait-0.015
Token
Token ID198
Feature activation-7.06e-01

Pos logit contributions
oured+0.018
cream+0.018
 sprink+0.016
 dip+0.016
 bait+0.016
Neg logit contributions
 villains-0.015
 hall-0.013
 dragons-0.012
 robbers-0.012
py-0.012
Token
Token ID198
Feature activation-7.07e-01

Pos logit contributions
oured+0.031
cream+0.031
 sprink+0.028
 dip+0.028
 bait+0.027
Neg logit contributions
 villains-0.028
 hall-0.024
 robbers-0.022
 dragons-0.022
 monsters-0.022
TokenSo
Token ID2396
Feature activation-1.08e+00

Pos logit contributions
cream+0.030
oured+0.030
 sprink+0.028
 dip+0.027
 bait+0.027
Neg logit contributions
 villains-0.029
 hall-0.024
 robbers-0.023
py-0.023
 dragons-0.023
Token Jane
Token ID12091
Feature activation+3.79e-01

Pos logit contributions
cream+0.056
oured+0.055
 bait+0.049
 sprink+0.048
shaped+0.048
Neg logit contributions
 villains-0.030
 hall-0.030
 dragons-0.027
 maze-0.027
 chamber-0.025
Token waited
Token ID13488
Feature activation+5.59e-01

Pos logit contributions
 villains+0.014
 hall+0.014
 dragons+0.013
py+0.012
 robbers+0.012
Neg logit contributions
oured-0.015
cream-0.014
 sprink-0.013
 bait-0.013
 dip-0.013
Token patiently
Token ID37220
Feature activation+1.14e-01

Pos logit contributions
 villains+0.020
 hall+0.019
 dragons+0.017
 monsters+0.017
 robbers+0.017
Neg logit contributions
cream-0.025
oured-0.024
 sprink-0.022
 bait-0.022
 dip-0.022
Token while
Token ID981
Feature activation+7.91e-01

Pos logit contributions
 villains+0.004
 hall+0.004
 dragons+0.004
 robbers+0.003
 monsters+0.003
Neg logit contributions
cream-0.005
oured-0.005
 sprink-0.005
 dip-0.004
 bait-0.004
Token.
Token ID13
Feature activation+2.84e-01

Pos logit contributions
cream+0.006
oured+0.006
 dip+0.006
 sprink+0.006
 bait+0.006
Neg logit contributions
 villains-0.007
 hall-0.007
 maze-0.006
 robbers-0.006
py-0.006
Token Here
Token ID3423
Feature activation-3.77e-02

Pos logit contributions
 villains+0.010
 hall+0.009
 dragons+0.008
 robbers+0.008
 monsters+0.008
Neg logit contributions
oured-0.013
cream-0.013
 sprink-0.012
 dip-0.012
 bait-0.012
Token,
Token ID11
Feature activation+3.81e-01

Pos logit contributions
oured+0.002
cream+0.002
shaped+0.002
 sprink+0.002
 dip+0.002
Neg logit contributions
 villains-0.001
 hall-0.001
 monsters-0.001
 dragons-0.001
 robbers-0.001
Token have
Token ID423
Feature activation+6.98e-01

Pos logit contributions
 villains+0.016
 hall+0.014
 robbers+0.014
 dragons+0.013
 monsters+0.013
Neg logit contributions
oured-0.016
cream-0.016
 sprink-0.014
shaped-0.014
 bait-0.013
Token a
Token ID257
Feature activation+1.87e-01

Pos logit contributions
 villains+0.030
 dragons+0.026
 hall+0.025
 monsters+0.025
 robbers+0.025
Neg logit contributions
oured-0.029
cream-0.027
inated-0.024
 dip-0.024
int-0.024
Token carrot
Token ID40562
Feature activation-1.33e-01

Pos logit contributions
 villains+0.007
 hall+0.006
 maze+0.006
 robbers+0.006
 villain+0.006
Neg logit contributions
cream-0.009
oured-0.008
shaped-0.007
 sprink-0.007
 bait-0.007
Token.
Token ID13
Feature activation+5.40e-01

Pos logit contributions
oured+0.006
cream+0.006
 sprink+0.005
 dip+0.005
 bait+0.005
Neg logit contributions
 villains-0.005
 hall-0.005
 robbers-0.004
 dragons-0.004
py-0.004
Token It
Token ID632
Feature activation+3.68e-01

Pos logit contributions
 villains+0.019
 hall+0.015
 robbers+0.015
 villain+0.014
py+0.014
Neg logit contributions
oured-0.026
cream-0.025
 dip-0.023
 bait-0.023
 sprink-0.023
Token's
Token ID338
Feature activation+4.45e-01

Pos logit contributions
 villains+0.011
 hall+0.009
 dragons+0.009
py+0.009
 monsters+0.009
Neg logit contributions
oured-0.019
cream-0.019
 bait-0.017
 sprink-0.017
 dip-0.017
Token delicious
Token ID12625
Feature activation-2.86e-01

Pos logit contributions
 villains+0.013
py+0.013
 monsters+0.013
 dragons+0.013
 alone+0.012
Neg logit contributions
oured-0.023
cream-0.021
shaped-0.019
inated-0.019
 mun-0.019
Token and
Token ID290
Feature activation-3.46e-02

Pos logit contributions
oured+0.012
cream+0.012
 sprink+0.011
 dip+0.011
 bait+0.011
Neg logit contributions
 villains-0.011
 hall-0.010
 robbers-0.009
 dragons-0.009
py-0.009
Token had
Token ID550
Feature activation-3.13e-02

Pos logit contributions
 villains+0.000
 hall+0.000
 robbers+0.000
 maze+0.000
 dragons+0.000
Neg logit contributions
cream-0.000
oured-0.000
 sprink-0.000
 dip-0.000
 placed-0.000
Token a
Token ID257
Feature activation+3.64e-01

Pos logit contributions
oured+0.001
cream+0.001
int+0.001
 dip+0.001
immer+0.001
Neg logit contributions
 villains-0.002
 dragons-0.001
 monsters-0.001
 hall-0.001
py-0.001
Token very
Token ID845
Feature activation+7.91e-01

Pos logit contributions
 villains+0.015
 hall+0.013
 dragons+0.012
 robbers+0.012
 monsters+0.012
Neg logit contributions
oured-0.016
cream-0.015
 sprink-0.014
 dip-0.014
 bait-0.014
Token long
Token ID890
Feature activation+8.89e-01

Pos logit contributions
 villains+0.033
 hall+0.029
 maze+0.026
 dragons+0.026
 monsters+0.026
Neg logit contributions
oured-0.034
cream-0.033
 sprink-0.030
 dip-0.029
 placed-0.029
Token feather
Token ID17972
Feature activation-8.27e-01

Pos logit contributions
 villains+0.034
 hall+0.030
 dragons+0.028
py+0.027
 monsters+0.027
Neg logit contributions
oured-0.040
cream-0.040
 sprink-0.036
 dip-0.035
 bait-0.035
Token on
Token ID319
Feature activation-3.19e-01

Pos logit contributions
oured+0.037
cream+0.036
 sprink+0.034
 dip+0.033
 vanilla+0.033
Neg logit contributions
 villains-0.031
 hall-0.027
 dragons-0.026
py-0.025
 monsters-0.025
Token his
Token ID465
Feature activation+8.63e-02

Pos logit contributions
oured+0.014
cream+0.014
 sprink+0.012
 dip+0.012
 bait+0.012
Neg logit contributions
 villains-0.013
 hall-0.011
 dragons-0.011
py-0.010
 monsters-0.010
Token head
Token ID1182
Feature activation-1.44e-02

Pos logit contributions
 villains+0.004
 hall+0.003
 robbers+0.003
 villain+0.003
 lawyers+0.003
Neg logit contributions
cream-0.003
oured-0.003
 dip-0.003
 sprink-0.003
 bait-0.003
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
oured+0.001
cream+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.000
 hall-0.000
 dragons-0.000
py-0.000
 monsters-0.000
Token All
Token ID1439
Feature activation+7.06e-01

Pos logit contributions
oured+0.005
cream+0.004
 dip+0.004
 bait+0.004
 sprink+0.004
Neg logit contributions
 villains-0.007
 hall-0.006
 lawyers-0.006
 monsters-0.006
 witches-0.006
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
 villains+0.030
 hall+0.026
 robbers+0.024
 dragons+0.024
 villain+0.024
Neg logit contributions
cream-0.028
oured-0.028
 dip-0.026
 sprink-0.025
 bait-0.025
Token red
Token ID2266
Feature activation-2.83e-01

Pos logit contributions
cream+0.001
oured+0.001
 sprink+0.001
 dip+0.001
 bait+0.001
Neg logit contributions
 villains-0.001
 hall-0.001
py-0.001
 robbers-0.001
 monsters-0.001
Token cover
Token ID3002
Feature activation+8.31e-01

Pos logit contributions
oured+0.014
cream+0.013
 dip+0.012
 bait+0.012
 sprink+0.012
Neg logit contributions
 villains-0.010
 hall-0.009
 dragons-0.008
 maze-0.008
 robbers-0.008
Token and
Token ID290
Feature activation-3.02e-01

Pos logit contributions
 villains+0.026
 hall+0.025
 dragons+0.022
py+0.022
 monsters+0.021
Neg logit contributions
oured-0.041
cream-0.040
 dip-0.038
 sprink-0.037
 bait-0.037
Token a
Token ID257
Feature activation-8.04e-01

Pos logit contributions
oured+0.012
cream+0.010
int+0.009
 dip+0.009
acket+0.009
Neg logit contributions
 villains-0.014
 hall-0.013
 dragons-0.012
 monsters-0.012
 robbers-0.012
Token picture
Token ID4286
Feature activation-1.36e+00

Pos logit contributions
oured+0.032
cream+0.032
 sprink+0.030
 dip+0.029
 bait+0.029
Neg logit contributions
 villains-0.033
 hall-0.029
py-0.027
 robbers-0.027
 dragons-0.027
Token of
Token ID286
Feature activation-6.87e-01

Pos logit contributions
oured+0.058
cream+0.058
 sprink+0.053
 dip+0.052
 bait+0.051
Neg logit contributions
 villains-0.054
 hall-0.047
 robbers-0.043
 dragons-0.043
py-0.043
Token a
Token ID257
Feature activation-8.40e-01

Pos logit contributions
oured+0.024
cream+0.023
int+0.020
 dip+0.019
inated+0.019
Neg logit contributions
 villains-0.035
 hall-0.032
 dragons-0.031
 monsters-0.031
 skeletons-0.030
Token fox
Token ID21831
Feature activation-5.69e-01

Pos logit contributions
 sprink+0.036
cream+0.036
oured+0.035
 dip+0.035
 vanilla+0.034
Neg logit contributions
 villains-0.030
py-0.025
 hall-0.025
 robbers-0.023
 monsters-0.022
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
oured+0.027
cream+0.026
 dip+0.024
 sprink+0.024
 bait+0.023
Neg logit contributions
 villains-0.021
 hall-0.018
 dragons-0.016
 robbers-0.016
 monsters-0.016
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
oured+0.040
cream+0.036
immer+0.034
int+0.034
acket+0.033
Neg logit contributions
 villains-0.049
 dragons-0.043
 hall-0.042
 monsters-0.042
 robbers-0.041
Token crow
Token ID37593
Feature activation-1.70e+00

Pos logit contributions
oured+0.056
cream+0.056
 sprink+0.052
 dip+0.052
 bait+0.051
Neg logit contributions
 villains-0.045
 hall-0.038
py-0.035
 dragons-0.035
 robbers-0.035
TokenI
Token ID40
Feature activation-8.35e-01

Pos logit contributions
oured+0.041
cream+0.039
 dip+0.037
 bait+0.036
 sprink+0.036
Neg logit contributions
 villains-0.028
 hall-0.023
 robbers-0.022
py-0.021
 lawyers-0.021
Token'm
Token ID1101
Feature activation-3.22e-01

Pos logit contributions
cream+0.044
oured+0.042
 bait+0.039
shaped+0.038
inated+0.038
Neg logit contributions
 villains-0.024
 hall-0.022
 monster-0.019
umpy-0.018
 villain-0.018
Token so
Token ID523
Feature activation-2.48e-01

Pos logit contributions
cream+0.017
oured+0.017
 dip+0.015
 mun+0.015
 bait+0.015
Neg logit contributions
 villains-0.010
 hall-0.009
 monsters-0.008
 lawyers-0.008
py-0.008
Token glad
Token ID9675
Feature activation-3.72e-01

Pos logit contributions
oured+0.012
cream+0.012
 dip+0.011
 bait+0.011
 sprink+0.010
Neg logit contributions
 villains-0.008
 hall-0.007
 lawyers-0.007
 alone-0.007
 monsters-0.006
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
oured+0.020
cream+0.018
int+0.018
shaped+0.018
 dip+0.018
Neg logit contributions
 villains-0.011
 dragons-0.010
 monsters-0.009
 skeletons-0.009
 robbers-0.009
Token like
Token ID588
Feature activation-2.41e+00

Pos logit contributions
cream+0.054
oured+0.052
 bait+0.050
 vanilla+0.050
 sprink+0.049
Neg logit contributions
 villains-0.046
 hall-0.042
 monsters-0.039
 dragons-0.038
 monster-0.037
Token the
Token ID262
Feature activation-5.34e-01

Pos logit contributions
oured+0.103
cream+0.096
int+0.087
 dip+0.084
 mun+0.083
Neg logit contributions
 villains-0.102
 dragons-0.092
 hall-0.091
 monsters-0.090
 monster-0.087
Token candle
Token ID26839
Feature activation+1.20e-01

Pos logit contributions
oured+0.025
cream+0.024
 mun+0.021
 bait+0.021
shaped+0.020
Neg logit contributions
 villains-0.020
 hall-0.019
 monsters-0.018
 dragons-0.018
 monster-0.017
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 villains+0.004
 hall+0.004
 dragons+0.004
 monsters+0.004
 monster+0.004
Neg logit contributions
oured-0.005
cream-0.005
 bait-0.005
 dip-0.005
shaped-0.005
Token Tim
Token ID5045
Feature activation+2.83e-01

Pos logit contributions
oured+0.050
cream+0.046
shaped+0.044
 dip+0.043
int+0.042
Neg logit contributions
 villains-0.036
 hall-0.032
 monsters-0.030
 dragons-0.030
 robbers-0.030
Tokenmy
Token ID1820
Feature activation+9.73e-01

Pos logit contributions
 villains+0.013
 hall+0.010
 basement+0.010
 robbers+0.009
 villain+0.009
Neg logit contributions
cream-0.012
oured-0.011
 sprink-0.010
 placed-0.010
 vanilla-0.010
Token hello
Token ID23748
Feature activation-5.93e-01

Pos logit contributions
oured+0.007
cream+0.006
 dip+0.006
 mun+0.006
ck+0.006
Neg logit contributions
 villains-0.007
 hall-0.006
 dragons-0.006
 monster-0.006
 lawyers-0.006
Token.
Token ID13
Feature activation-4.70e-02

Pos logit contributions
oured+0.028
cream+0.028
 sprink+0.026
 dip+0.026
 bait+0.026
Neg logit contributions
 villains-0.020
 hall-0.017
 dragons-0.016
 monsters-0.016
 robbers-0.016
Token
Token ID220
Feature activation+1.01e+00

Pos logit contributions
cream+0.002
oured+0.002
 sprink+0.002
 dip+0.002
ck+0.002
Neg logit contributions
 villains-0.002
 hall-0.002
 robbers-0.001
 maze-0.001
 skeletons-0.001
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
 villains+0.035
 hall+0.031
 dragons+0.027
 robbers+0.027
py+0.027
Neg logit contributions
oured-0.048
cream-0.047
 dip-0.044
 sprink-0.044
 bait-0.043
Token
Token ID198
Feature activation-3.53e-01

Pos logit contributions
oured+0.017
cream+0.017
 sprink+0.015
 dip+0.015
 bait+0.015
Neg logit contributions
 villains-0.016
 hall-0.014
 dragons-0.013
 monsters-0.013
 robbers-0.012
Tokenâ
Token ID22940
Feature activation-1.96e+00

Pos logit contributions
cream+0.015
oured+0.015
 sprink+0.013
 dip+0.013
 bait+0.013
Neg logit contributions
 villains-0.014
 hall-0.013
 dragons-0.012
 monsters-0.012
py-0.012
Token€
Token ID26391
Feature activation+9.44e-01

Pos logit contributions
oured+0.087
cream+0.086
 sprink+0.084
 bait+0.082
 dip+0.082
Neg logit contributions
 villains-0.070
 hall-0.062
py-0.056
 maze-0.054
 robbers-0.054
Token�
Token ID129
Feature activation-1.79e-01

Pos logit contributions
 villains+0.040
 hall+0.035
 dragons+0.032
 robbers+0.032
py+0.032
Neg logit contributions
oured-0.038
cream-0.038
 sprink-0.035
 dip-0.034
 bait-0.033
Token�
Token ID241
Feature activation-5.98e-01

Pos logit contributions
oured+0.007
cream+0.007
 sprink+0.007
 dip+0.006
 bait+0.006
Neg logit contributions
 villains-0.008
 hall-0.007
py-0.006
 robbers-0.006
 dragons-0.006
TokenHi
Token ID17250
Feature activation-5.53e-01

Pos logit contributions
oured+0.031
cream+0.028
 dip+0.027
 sprink+0.026
 placed+0.026
Neg logit contributions
 villains-0.020
 hall-0.016
 seek-0.014
umpy-0.014
 robbers-0.014
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
cream+0.026
oured+0.025
shaped+0.023
asted+0.023
 dip+0.023
Neg logit contributions
 villains-0.019
 hall-0.017
 maze-0.016
 robbers-0.016
 monster-0.016
Token time
Token ID640
Feature activation-1.11e-01

Pos logit contributions
 villains+0.004
 hall+0.003
 dragons+0.003
 robbers+0.003
 monsters+0.003
Neg logit contributions
cream-0.004
oured-0.004
 sprink-0.004
 dip-0.004
 bait-0.004
Token there
Token ID612
Feature activation+2.81e-01

Pos logit contributions
oured+0.004
cream+0.004
 sprink+0.003
 dip+0.003
 bait+0.003
Neg logit contributions
 villains-0.005
 hall-0.005
 robbers-0.004
 dragons-0.004
 villain-0.004
Token was
Token ID373
Feature activation-6.04e-01

Pos logit contributions
 villains+0.011
 hall+0.010
 dragons+0.009
 monsters+0.009
py+0.009
Neg logit contributions
oured-0.013
cream-0.012
 sprink-0.011
 dip-0.011
 bait-0.011
Token a
Token ID257
Feature activation+2.11e-01

Pos logit contributions
oured+0.021
cream+0.021
 dip+0.017
 placed+0.016
 sprink+0.016
Neg logit contributions
 villains-0.029
 hall-0.027
 dragons-0.026
py-0.025
 monsters-0.025
Token girl
Token ID2576
Feature activation+2.09e-01

Pos logit contributions
 villains+0.010
 hall+0.009
py+0.008
 dragons+0.008
 robbers+0.008
Neg logit contributions
oured-0.007
cream-0.007
 sprink-0.007
 dip-0.007
 bait-0.006
Token named
Token ID3706
Feature activation-2.12e+00

Pos logit contributions
 villains+0.009
 hall+0.008
 dragons+0.007
py+0.007
 monsters+0.007
Neg logit contributions
oured-0.009
cream-0.009
 bait-0.008
 sprink-0.008
 dip-0.007
Token Kay
Token ID17356
Feature activation-4.00e-01

Pos logit contributions
oured+0.101
cream+0.096
 dip+0.094
 sprink+0.094
 vanilla+0.089
Neg logit contributions
 villains-0.078
 villain-0.059
 hall-0.059
 robbers-0.058
 monsters-0.058
Tokenla
Token ID5031
Feature activation+5.37e-01

Pos logit contributions
cream+0.015
oured+0.014
 bait+0.013
 dip+0.013
 sprink+0.013
Neg logit contributions
 villains-0.018
 hall-0.017
py-0.015
 dragons-0.015
 monsters-0.015
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
 villains+0.014
 hall+0.013
 dragons+0.011
py+0.011
 alone+0.010
Neg logit contributions
oured-0.030
cream-0.029
 bait-0.028
 dip-0.027
 vanilla-0.027
Token She
Token ID1375
Feature activation+2.87e-01

Pos logit contributions
 villains+0.006
 robbers+0.004
py+0.004
 monsters+0.004
 villain+0.004
Neg logit contributions
oured-0.008
cream-0.007
 sprink-0.007
 dip-0.007
inated-0.007
Token was
Token ID373
Feature activation+2.48e-01

Pos logit contributions
 villains+0.012
 hall+0.011
 dragons+0.010
 monsters+0.010
py+0.010
Neg logit contributions
oured-0.012
cream-0.012
 bait-0.011
 dip-0.011
 sprink-0.010
Token there
Token ID612
Feature activation+5.12e-01

Pos logit contributions
oured+0.009
cream+0.009
 sprink+0.008
 dip+0.008
 bait+0.008
Neg logit contributions
 villains-0.006
 hall-0.006
 dragons-0.005
 monsters-0.005
 robbers-0.005
Token was
Token ID373
Feature activation-5.33e-01

Pos logit contributions
 villains+0.023
 hall+0.020
 robbers+0.018
 dragons+0.018
py+0.018
Neg logit contributions
oured-0.020
cream-0.020
 sprink-0.018
 dip-0.018
 bait-0.017
Token a
Token ID257
Feature activation+1.71e-01

Pos logit contributions
oured+0.019
cream+0.018
 dip+0.016
 placed+0.015
 bait+0.015
Neg logit contributions
 villains-0.025
 hall-0.023
 dragons-0.022
 monsters-0.022
 lawyers-0.021
Token little
Token ID1310
Feature activation-6.51e-01

Pos logit contributions
 villains+0.008
 hall+0.007
py+0.007
 dragons+0.007
 monsters+0.007
Neg logit contributions
oured-0.006
cream-0.005
 sprink-0.005
 dip-0.005
 bait-0.005
Token girl
Token ID2576
Feature activation+3.08e-01

Pos logit contributions
cream+0.033
oured+0.033
 sprink+0.032
 dip+0.031
 bait+0.030
Neg logit contributions
 villains-0.020
 hall-0.016
py-0.015
 dragons-0.015
 robbers-0.014
Token named
Token ID3706
Feature activation-2.45e+00

Pos logit contributions
 villains+0.015
 hall+0.013
 dragons+0.012
 robbers+0.012
py+0.012
Neg logit contributions
oured-0.011
cream-0.011
 sprink-0.009
 dip-0.009
 bait-0.009
Token Lily
Token ID20037
Feature activation+4.27e-01

Pos logit contributions
oured+0.097
 dip+0.092
 sprink+0.090
cream+0.088
 vanilla+0.086
Neg logit contributions
 villains-0.112
robat-0.090
 villain-0.089
py-0.085
 monsters-0.084
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 villains+0.018
 hall+0.015
 robbers+0.014
 dragons+0.014
 villain+0.014
Neg logit contributions
cream-0.017
oured-0.017
 sprink-0.016
 dip-0.016
 placed-0.015
Token She
Token ID1375
Feature activation+2.87e-01

Pos logit contributions
 dip+0.010
 sprink+0.010
oured+0.010
 vanilla+0.010
 as+0.009
Neg logit contributions
 villains-0.015
robat-0.013
 villain-0.012
 trolls-0.012
 monsters-0.012
Token loved
Token ID6151
Feature activation-1.00e-01

Pos logit contributions
 villains+0.014
 hall+0.013
 robbers+0.012
 dragons+0.012
py+0.012
Neg logit contributions
cream-0.009
oured-0.009
 sprink-0.008
 dip-0.008
 bait-0.008
Token to
Token ID284
Feature activation+3.28e-02

Pos logit contributions
cream+0.003
oured+0.003
 sprink+0.003
 dip+0.003
 vanilla+0.003
Neg logit contributions
 villains-0.005
 hall-0.004
py-0.004
 villain-0.004
oon-0.004
Token was
Token ID373
Feature activation-1.91e-01

Pos logit contributions
 villains+0.001
 hall+0.001
 dragons+0.001
 monsters+0.001
 monster+0.001
Neg logit contributions
oured-0.001
cream-0.001
 bait-0.001
 dip-0.001
 sprink-0.001
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
oured+0.008
cream+0.008
shaped+0.007
 mun+0.007
int+0.007
Neg logit contributions
 villains-0.007
 monsters-0.007
 dragons-0.007
 hall-0.007
py-0.007
Token wind
Token ID2344
Feature activation+1.81e+00

Pos logit contributions
 villains+0.001
 hall+0.001
 monsters+0.001
 maze+0.001
 dragons+0.001
Neg logit contributions
oured-0.002
cream-0.002
ck-0.001
shaped-0.001
 mun-0.001
Token,
Token ID11
Feature activation+6.36e-01

Pos logit contributions
 villains+0.055
 hall+0.055
py+0.050
 monsters+0.049
 monster+0.049
Neg logit contributions
oured-0.087
cream-0.084
 bait-0.077
 mun-0.076
asted-0.076
Token and
Token ID290
Feature activation+4.06e-01

Pos logit contributions
 villains+0.022
 hall+0.021
 monsters+0.020
 dragons+0.019
py+0.019
Neg logit contributions
oured-0.030
cream-0.028
shaped-0.026
int-0.025
 bait-0.024
Token Grand
Token ID5675
Feature activation-2.06e+00

Pos logit contributions
 villains+0.016
 hall+0.015
 monsters+0.014
 dragons+0.014
 robbers+0.013
Neg logit contributions
oured-0.018
cream-0.016
 dip-0.015
 bait-0.015
int-0.015
Tokenma
Token ID2611
Feature activation-1.04e-01

Pos logit contributions
oured+0.110
 bait+0.107
cream+0.107
 dip+0.106
inated+0.101
Neg logit contributions
 villains-0.054
 hall-0.048
baby-0.042
py-0.042
 lawyers-0.042
Token smiled
Token ID13541
Feature activation-2.33e-01

Pos logit contributions
oured+0.004
 bait+0.004
cream+0.004
shaped+0.004
 dip+0.004
Neg logit contributions
 villains-0.004
 hall-0.003
py-0.003
 dragons-0.003
 sisters-0.003
Token again
Token ID757
Feature activation+3.99e-01

Pos logit contributions
cream+0.011
oured+0.011
 sprink+0.010
 bait+0.010
 dip+0.010
Neg logit contributions
 villains-0.008
 hall-0.007
 dragons-0.006
 monsters-0.006
py-0.006
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
 villains+0.013
 hall+0.011
 dragons+0.010
 monsters+0.010
 robbers+0.010
Neg logit contributions
oured-0.020
cream-0.020
 sprink-0.018
 dip-0.018
 bait-0.018
Token
Token ID198
Feature activation-8.07e-01

Pos logit contributions
oured+0.014
cream+0.013
 dip+0.013
 bait+0.013
 sprink+0.012
Neg logit contributions
 villains-0.011
 hall-0.010
 robbers-0.009
 dragons-0.009
 lawyers-0.009