TOP ACTIVATIONS
MAX = 4.071

When Daisy put the magn ifying glass down , the cater
their salad and have happy pic n ics together . <|endoftext|>
flashlight , and a magn ifying glass . As
He pointed his magn ifying glass at different classmates and
He brought his magn ifying glass and looked closely .
She takes out a magn ifying glass . She looks at
borrowed her mom 's magn ifying glass to take a closer
train . They put a kettle on the stove . They
, next to her magn ifying glass . She looked at
so excited ! She loved pic n ics and spending time
her fairy doll . She named her fairy doll Rose and
her wand and say " ab rac ad ab ra "
, handing him the magn ifying glass . She looked around
her toy dragon . She named him Max and took him
soft and brown . She named it Brown ie and played
the watch with the magn ifying glass . He saw the
a family who really enjoyed pic n ics . They loved
the warm sunshine and having pic n ics with her family
her cute doll . She named her doll Lily and gave
liked to play and have pic n ics . One day

MIN ACTIVATIONS
MIN = -3.445

special powers that can heal injuries . With my powers ,
came to see what the comm otion was about . She
Look , Ben , the bud bloom ed . It is
Mom heard the comm otion and came to see
Wow , Sara , the bud bloom ed . It is
see what was inside the bud . " Let
and make some new or nam ents together ."
, " What does ' destroy ' mean ?"
mummy , a big l oll ipop is up in the
his hand to pull the bud . " No
my 's mom noticed the comm otion and came outside .
happened : a modern l oll ipop machine came to their
Mom heard the comm otion and came to the
found that it wouldn 't bud ge . He pushed and
not bigger than a fing ern ail . They also found
Be careful , these or nam ents are very special .
but the screw wouldn 't bud ge . Suddenly
but the lid wouldn 't bud ge ! The mouse began
but it still wouldn 't bud ge .
Suddenly , P uss noticed a bigger bug .

INTERVAL 2.192 - 4.071
CONTAINS 0.216%

reminded her that she should always prevent others from feeling sad
there was a little girl named Lily . She loved to
day , a little boy named Tim and his dog ,
there was a little girl named Sally . She was only
, there lived a boy named Tim . He had a

INTERVAL 0.313 - 2.192
CONTAINS 25.664%

hand . It went up in the air and joined other
Can we help you with the bags ?" The
the jungle for the wonderful night he had spent there .
very gentle as she strapped them on .
was so glad and started to play once more .

INTERVAL -1.566 - 0.313
CONTAINS 73.624%

Sally had gone to the lake and she was worried that
And each morning the sun kept pouring down its bright
Sam and walked with him . She said , " Don
" Mom my , mom my , look at the
, only 3 years old ! One day ,

INTERVAL -3.445 - -1.566
CONTAINS 0.497%

clean everything every day ." S ally 's
." M ia and Jack were sad , but
She said , " Please don 't take it away .
that was blowing in the wind . He was so excited
and asked if she could help it find its way home
Token When
Token ID1649
Feature activation-6.47e-01

Pos logit contributions
baby+0.006
 ambulance+0.006
 robots+0.006
 lawyers+0.006
 hall+0.005
Neg logit contributions
 mun-0.012
oured-0.012
 gul-0.012
int-0.012
 cu-0.011
Token Daisy
Token ID40355
Feature activation+2.12e-01

Pos logit contributions
baby+0.016
 ambulance+0.014
 robots+0.014
 lawyers+0.014
 dragons+0.014
Neg logit contributions
 mun-0.029
oured-0.028
 gul-0.027
 cu-0.027
int-0.027
Token put
Token ID1234
Feature activation-5.89e-01

Pos logit contributions
 pleasure+0.008
leaf+0.008
 bait+0.008
oured+0.008
 souven+0.008
Neg logit contributions
baby-0.006
 monster-0.006
 hall-0.006
 ambulance-0.005
 alone-0.005
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
baby+0.014
 ambulance+0.013
 robots+0.013
 dragons+0.012
 lawyers+0.012
Neg logit contributions
 mun-0.026
oured-0.026
 gul-0.025
 cu-0.025
int-0.024
Token magn
Token ID7842
Feature activation+2.59e-01

Pos logit contributions
 mun+0.001
oured+0.001
int+0.001
 cu+0.001
 gul+0.001
Neg logit contributions
 ambulance-0.000
 monster-0.000
 hall-0.000
baby-0.000
 kids-0.000
Tokenifying
Token ID4035
Feature activation+4.07e+00

Pos logit contributions
 mun+0.008
 gul+0.007
 cu+0.007
 pleasure+0.007
 bait+0.007
Neg logit contributions
baby-0.010
 dragons-0.010
 robots-0.009
 alone-0.009
 anymore-0.009
Token glass
Token ID5405
Feature activation-2.77e-01

Pos logit contributions
 mun+0.159
oured+0.157
int+0.155
leaf+0.152
 cu+0.151
Neg logit contributions
 hall-0.109
 monster-0.107
baby-0.107
 alone-0.104
 basement-0.102
Token down
Token ID866
Feature activation+8.37e-01

Pos logit contributions
baby+0.007
 ambulance+0.007
 robots+0.007
 lawyers+0.007
 villains+0.007
Neg logit contributions
 mun-0.012
oured-0.011
 gul-0.011
 cu-0.011
 bait-0.011
Token,
Token ID11
Feature activation+8.55e-02

Pos logit contributions
 mun+0.039
oured+0.039
 gul+0.037
int+0.037
 cu+0.037
Neg logit contributions
baby-0.018
 ambulance-0.016
 robots-0.016
 dragons-0.016
 monster-0.016
Token the
Token ID262
Feature activation+2.37e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
 bait+0.004
Neg logit contributions
baby-0.002
 ambulance-0.002
 robots-0.002
 lawyers-0.002
 dragons-0.002
Token cater
Token ID20825
Feature activation+2.79e-02

Pos logit contributions
oured+0.011
 mun+0.011
int+0.011
 gul+0.010
 cu+0.010
Neg logit contributions
baby-0.005
 monster-0.005
 ambulance-0.005
 hall-0.005
 robots-0.005
Token their
Token ID511
Feature activation+7.35e-01

Pos logit contributions
 mun+0.001
oured+0.001
 gul+0.001
 cu+0.001
 bait+0.001
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 hall-0.001
 villains-0.001
Token salad
Token ID20698
Feature activation+6.61e-01

Pos logit contributions
 mun+0.029
oured+0.029
 gul+0.027
 cu+0.027
int+0.027
Neg logit contributions
baby-0.021
 robots-0.020
 dragons-0.020
 ambulance-0.020
 lawyers-0.019
Token and
Token ID290
Feature activation+7.34e-01

Pos logit contributions
 mun+0.023
oured+0.021
 cu+0.021
 bait+0.020
 gul+0.020
Neg logit contributions
baby-0.023
 ambulance-0.022
 robots-0.021
 lawyers-0.021
 monster-0.020
Token have
Token ID423
Feature activation-2.13e-01

Pos logit contributions
 mun+0.025
oured+0.024
 gul+0.023
 cu+0.023
 bait+0.022
Neg logit contributions
baby-0.026
 ambulance-0.024
 robots-0.024
 lawyers-0.023
 monster-0.023
Token happy
Token ID3772
Feature activation+8.64e-01

Pos logit contributions
baby+0.007
 robots+0.007
 dragons+0.007
 ambulance+0.007
 lawyers+0.007
Neg logit contributions
 mun-0.007
oured-0.007
 gul-0.007
 cu-0.007
int-0.007
Token pic
Token ID8301
Feature activation+3.94e+00

Pos logit contributions
 mun+0.040
 gul+0.037
oured+0.037
 cu+0.036
 bait+0.036
Neg logit contributions
baby-0.021
 ambulance-0.019
 robots-0.017
 hall-0.017
 lawyers-0.017
Tokenn
Token ID77
Feature activation+6.59e-01

Pos logit contributions
 cu+0.103
oured+0.101
 gul+0.100
int+0.098
 mun+0.097
Neg logit contributions
baby-0.157
lf-0.145
 villains-0.144
 anymore-0.142
 dragons-0.138
Tokenics
Token ID873
Feature activation+4.94e-01

Pos logit contributions
 mun+0.022
oured+0.021
int+0.020
 planted+0.019
 gul+0.019
Neg logit contributions
baby-0.023
 monster-0.022
 ambulance-0.022
 hall-0.021
 robots-0.021
Token together
Token ID1978
Feature activation-1.51e-01

Pos logit contributions
 mun+0.019
oured+0.018
 cu+0.017
 gul+0.017
 bait+0.017
Neg logit contributions
baby-0.016
 ambulance-0.015
 robots-0.014
 lawyers-0.014
 hall-0.014
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.007
oured-0.007
 gul-0.007
 cu-0.007
int-0.007
Token<|endoftext|>
Token ID50256
Feature activation-5.08e-01

Pos logit contributions
 mun+0.008
oured+0.007
 gul+0.007
 pleasure+0.007
 cu+0.007
Neg logit contributions
 ambulance-0.008
baby-0.008
 lawyers-0.008
 monster-0.007
 alone-0.007
Token flashlight
Token ID38371
Feature activation-5.57e-02

Pos logit contributions
 mun+0.005
oured+0.005
int+0.005
 gul+0.005
 cu+0.005
Neg logit contributions
baby-0.003
 ambulance-0.003
 monster-0.003
 robots-0.003
 hall-0.003
Token,
Token ID11
Feature activation+3.80e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 hall+0.001
 monster+0.001
Neg logit contributions
 mun-0.002
oured-0.002
 gul-0.002
int-0.002
 cu-0.002
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
oured+0.019
int+0.018
 mun+0.018
mit+0.018
OW+0.017
Neg logit contributions
 robots-0.008
 dragons-0.008
 cats-0.008
 lawyers-0.008
 ambulance-0.008
Token a
Token ID257
Feature activation+3.87e-01

Pos logit contributions
 robots+0.000
 dragons+0.000
 lawyers+0.000
baby+0.000
 ambulance+0.000
Neg logit contributions
oured-0.001
int-0.000
 mun-0.000
 gul-0.000
 cu-0.000
Token magn
Token ID7842
Feature activation+1.77e-01

Pos logit contributions
 mun+0.017
oured+0.016
int+0.016
 gul+0.016
 cu+0.015
Neg logit contributions
baby-0.010
 ambulance-0.009
 robots-0.009
 monster-0.009
 hall-0.009
Tokenifying
Token ID4035
Feature activation+3.65e+00

Pos logit contributions
 mun+0.005
 gul+0.004
 cu+0.004
 bait+0.004
 pleasure+0.004
Neg logit contributions
baby-0.007
 dragons-0.007
 anymore-0.007
 robots-0.007
 ambulance-0.007
Token glass
Token ID5405
Feature activation-8.16e-02

Pos logit contributions
 mun+0.138
oured+0.136
int+0.133
 cu+0.131
leaf+0.131
Neg logit contributions
baby-0.107
 monster-0.106
 hall-0.105
 ambulance-0.103
 alone-0.101
Token.
Token ID13
Feature activation-6.93e-02

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
 bait-0.003
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
baby+0.001
 robots+0.001
 lawyers+0.001
 ambulance+0.001
 hall+0.001
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.003
int-0.003
 bait-0.003
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
baby+0.007
 ambulance+0.006
 robots+0.006
 lawyers+0.006
 dragons+0.006
Neg logit contributions
 mun-0.008
oured-0.008
 gul-0.008
 cu-0.008
int-0.008
TokenAs
Token ID1722
Feature activation-7.90e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.012
oured-0.012
 gul-0.011
 cu-0.011
int-0.011
Token
Token ID198
Feature activation-5.49e-01

Pos logit contributions
baby+0.019
 ambulance+0.017
 robots+0.017
 lawyers+0.016
 dragons+0.016
Neg logit contributions
 mun-0.023
oured-0.022
 gul-0.022
 cu-0.021
int-0.021
TokenHe
Token ID1544
Feature activation-5.40e-01

Pos logit contributions
baby+0.015
 ambulance+0.013
 robots+0.013
 dragons+0.012
 anymore+0.012
Neg logit contributions
 mun-0.023
oured-0.022
 gul-0.022
 cu-0.022
leaf-0.021
Token pointed
Token ID6235
Feature activation+8.66e-01

Pos logit contributions
lf+0.016
 dragons+0.016
 monster+0.015
 teacher+0.015
 kids+0.014
Neg logit contributions
 pleasure-0.023
oured-0.020
 bait-0.019
leaf-0.019
arest-0.019
Token his
Token ID465
Feature activation-3.65e-01

Pos logit contributions
 mun+0.042
oured+0.039
leaf+0.038
int+0.038
 gul+0.037
Neg logit contributions
baby-0.018
 alone-0.018
 dragons-0.017
 ambulance-0.017
 anymore-0.017
Token magn
Token ID7842
Feature activation+9.25e-01

Pos logit contributions
 monster+0.009
baby+0.009
 robots+0.009
 sister+0.009
 dragons+0.009
Neg logit contributions
 mun-0.016
oured-0.015
 gul-0.015
leaf-0.015
 cu-0.015
Tokenifying
Token ID4035
Feature activation+3.63e+00

Pos logit contributions
 mun+0.029
 pleasure+0.025
 bait+0.024
leaf+0.024
 planted+0.023
Neg logit contributions
baby-0.036
 robots-0.033
 dragons-0.033
 monster-0.033
es-0.032
Token glass
Token ID5405
Feature activation+2.35e-01

Pos logit contributions
 mun+0.132
oured+0.128
 cu+0.121
int+0.121
 gul+0.120
Neg logit contributions
baby-0.118
 ambulance-0.111
 monster-0.109
 robots-0.108
 hall-0.108
Token at
Token ID379
Feature activation-1.60e-01

Pos logit contributions
 mun+0.010
oured+0.009
 gul+0.009
 cu+0.009
int+0.009
Neg logit contributions
baby-0.007
 ambulance-0.006
 robots-0.006
 lawyers-0.006
 dragons-0.006
Token different
Token ID1180
Feature activation+6.21e-01

Pos logit contributions
baby+0.004
 ambulance+0.004
 robots+0.004
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.007
oured-0.007
 gul-0.007
 cu-0.007
int-0.006
Token classmates
Token ID28999
Feature activation-1.08e-01

Pos logit contributions
 mun+0.027
oured+0.027
int+0.025
 gul+0.025
 cu+0.025
Neg logit contributions
baby-0.016
 robots-0.015
 dragons-0.015
 ambulance-0.014
 lawyers-0.014
Token and
Token ID290
Feature activation+3.25e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.004
 cu-0.004
int-0.004
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
 mun+0.013
oured+0.012
 gul+0.012
 cu+0.012
int+0.011
Neg logit contributions
baby-0.009
 ambulance-0.009
 robots-0.009
 lawyers-0.008
 dragons-0.008
TokenHe
Token ID1544
Feature activation+2.30e-02

Pos logit contributions
 mun+0.009
oured+0.008
 gul+0.008
 cu+0.008
int+0.008
Neg logit contributions
baby-0.007
 ambulance-0.006
 robots-0.006
 lawyers-0.006
 dragons-0.006
Token brought
Token ID3181
Feature activation+4.36e-01

Pos logit contributions
 pleasure+0.001
leaf+0.001
oured+0.001
 sprink+0.001
 mun+0.001
Neg logit contributions
 dragons-0.001
 monsters-0.001
 alone-0.001
 monster-0.001
 hall-0.001
Token his
Token ID465
Feature activation-6.22e-01

Pos logit contributions
 mun+0.020
oured+0.019
 cu+0.018
int+0.018
mit+0.018
Neg logit contributions
 dragons-0.011
 monster-0.010
baby-0.010
 alone-0.010
 robots-0.010
Token magn
Token ID7842
Feature activation+6.59e-01

Pos logit contributions
baby+0.015
 ambulance+0.014
 monster+0.014
 robots+0.014
 lawyers+0.014
Neg logit contributions
 mun-0.028
oured-0.028
int-0.026
 gul-0.026
 cu-0.026
Tokenifying
Token ID4035
Feature activation+3.61e+00

Pos logit contributions
 mun+0.020
 pleasure+0.018
leaf+0.018
 Pet+0.017
 bait+0.017
Neg logit contributions
baby-0.023
 alone-0.022
 dragons-0.022
 robots-0.022
 anymore-0.021
Token glass
Token ID5405
Feature activation+2.17e-01

Pos logit contributions
 mun+0.135
oured+0.131
int+0.127
 cu+0.124
leaf+0.123
Neg logit contributions
baby-0.109
 monster-0.106
 ambulance-0.105
 dragons-0.104
 hall-0.104
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
 mun+0.008
oured+0.008
 gul+0.008
 cu+0.008
int+0.008
Neg logit contributions
baby-0.006
 ambulance-0.006
 robots-0.006
 dragons-0.006
 monster-0.006
Token looked
Token ID3114
Feature activation+7.04e-01

Pos logit contributions
 dragons+0.003
 monsters+0.003
 robots+0.003
 monster+0.003
 mice+0.003
Neg logit contributions
oured-0.004
int-0.004
 pleasure-0.004
leaf-0.004
rig-0.004
Token closely
Token ID7173
Feature activation+1.30e+00

Pos logit contributions
 mun+0.033
oured+0.031
leaf+0.031
OW+0.031
int+0.031
Neg logit contributions
 alone-0.016
 dragons-0.014
 anymore-0.013
 robots-0.013
 monster-0.013
Token.
Token ID13
Feature activation-9.59e-02

Pos logit contributions
 mun+0.074
oured+0.072
leaf+0.072
 gul+0.069
 pleasure+0.068
Neg logit contributions
baby-0.013
 anymore-0.013
 alone-0.013
 dragons-0.013
 robots-0.011
Token She
Token ID1375
Feature activation-7.00e-01

Pos logit contributions
baby+0.012
 ambulance+0.011
 robots+0.011
 dragons+0.011
 monster+0.011
Neg logit contributions
 mun-0.021
oured-0.021
 cu-0.020
 gul-0.020
int-0.019
Token takes
Token ID2753
Feature activation-3.45e-01

Pos logit contributions
 dragons+0.019
 robots+0.019
 ambulance+0.019
 lawyers+0.018
baby+0.018
Neg logit contributions
 mun-0.028
oured-0.027
 bait-0.026
 pleasure-0.025
leaf-0.025
Token out
Token ID503
Feature activation+1.32e+00

Pos logit contributions
baby+0.010
 robots+0.009
 ambulance+0.009
 lawyers+0.009
 dragons+0.009
Neg logit contributions
 mun-0.015
oured-0.013
 cu-0.013
int-0.013
 gul-0.013
Token a
Token ID257
Feature activation+2.22e-01

Pos logit contributions
 mun+0.047
oured+0.045
 gul+0.042
 cu+0.042
int+0.042
Neg logit contributions
baby-0.045
 ambulance-0.043
 robots-0.043
 lawyers-0.042
 dragons-0.042
Token magn
Token ID7842
Feature activation+4.43e-01

Pos logit contributions
 mun+0.009
oured+0.009
 gul+0.009
int+0.009
 cu+0.008
Neg logit contributions
baby-0.006
 monster-0.006
 hall-0.006
 ambulance-0.006
 robots-0.006
Tokenifying
Token ID4035
Feature activation+3.60e+00

Pos logit contributions
 mun+0.013
 pleasure+0.011
 bait+0.010
 gul+0.010
 planted+0.010
Neg logit contributions
baby-0.018
 dragons-0.017
 robots-0.016
 anymore-0.016
 monster-0.016
Token glass
Token ID5405
Feature activation+1.31e-01

Pos logit contributions
 mun+0.138
oured+0.137
int+0.134
leaf+0.129
 cu+0.129
Neg logit contributions
 monster-0.104
baby-0.102
 hall-0.101
 dragons-0.099
 ambulance-0.098
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.005
 cu+0.005
 bait+0.005
Neg logit contributions
baby-0.004
 robots-0.003
 lawyers-0.003
 ambulance-0.003
 villains-0.003
Token She
Token ID1375
Feature activation-5.05e-01

Pos logit contributions
baby+0.000
 robots+0.000
 lawyers+0.000
 ambulance+0.000
 hall+0.000
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
int-0.001
 bait-0.001
Token looks
Token ID3073
Feature activation+7.98e-01

Pos logit contributions
 dragons+0.013
 robots+0.013
 hall+0.012
 lawyers+0.012
 monster+0.012
Neg logit contributions
 mun-0.021
oured-0.020
 bait-0.020
 pleasure-0.019
int-0.019
Token at
Token ID379
Feature activation+7.96e-02

Pos logit contributions
 mun+0.040
oured+0.037
leaf+0.036
OW+0.036
 bait+0.036
Neg logit contributions
 alone-0.017
 dragons-0.015
 anymore-0.015
 mess-0.014
 monster-0.014
Token borrowed
Token ID22546
Feature activation+3.37e-02

Pos logit contributions
 dragons+0.006
baby+0.006
 monster+0.006
lf+0.006
 robots+0.006
Neg logit contributions
oured-0.010
 pleasure-0.010
 bait-0.009
leaf-0.009
 mun-0.009
Token her
Token ID607
Feature activation-4.14e-01

Pos logit contributions
int+0.002
 mun+0.002
oured+0.002
rig+0.002
astically+0.002
Neg logit contributions
 mice-0.001
 cats-0.001
 dragons-0.001
 monster-0.001
 robots-0.001
Token mom
Token ID1995
Feature activation+3.55e-01

Pos logit contributions
 monster+0.011
 house+0.010
 alone+0.010
 sister+0.010
 dragons+0.010
Neg logit contributions
 mun-0.018
int-0.018
oured-0.017
leaf-0.017
 gul-0.016
Token's
Token ID338
Feature activation+8.29e-01

Pos logit contributions
 mun+0.020
oured+0.019
int+0.019
leaf+0.019
 pleasure+0.019
Neg logit contributions
baby-0.005
 robots-0.004
 cats-0.004
 alone-0.004
 dragons-0.004
Token magn
Token ID7842
Feature activation+7.56e-01

Pos logit contributions
 mun+0.035
oured+0.035
int+0.035
leaf+0.033
icol+0.031
Neg logit contributions
 house-0.022
 monster-0.021
 alone-0.021
 ambulance-0.020
 dragons-0.020
Tokenifying
Token ID4035
Feature activation+3.58e+00

Pos logit contributions
 mun+0.022
 bait+0.020
 pleasure+0.020
leaf+0.019
 planted+0.019
Neg logit contributions
baby-0.027
 anymore-0.026
 dragons-0.026
 alone-0.026
 robots-0.025
Token glass
Token ID5405
Feature activation+1.06e-01

Pos logit contributions
 mun+0.129
oured+0.127
int+0.121
 cu+0.119
leaf+0.119
Neg logit contributions
baby-0.114
 monster-0.109
 ambulance-0.108
 dragons-0.106
 hall-0.106
Token to
Token ID284
Feature activation-3.74e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 dragons-0.003
 monster-0.003
Token take
Token ID1011
Feature activation-9.40e-01

Pos logit contributions
 dragons+0.010
 alone+0.010
 monster+0.010
 monsters+0.009
 cats+0.009
Neg logit contributions
oured-0.015
int-0.014
 mun-0.014
icol-0.014
 gul-0.013
Token a
Token ID257
Feature activation-1.74e-01

Pos logit contributions
 mice+0.013
baby+0.013
 alone+0.013
 dragons+0.013
 cats+0.012
Neg logit contributions
oured-0.050
 mun-0.050
int-0.049
mit-0.048
 cu-0.047
Token closer
Token ID5699
Feature activation+7.14e-01

Pos logit contributions
 monster+0.005
baby+0.005
 hall+0.005
 fort+0.005
 alone+0.005
Neg logit contributions
oured-0.006
 mun-0.006
int-0.006
 gul-0.006
leaf-0.006
Token train
Token ID4512
Feature activation+1.48e+00

Pos logit contributions
baby+0.002
 lawyers+0.002
 robots+0.002
 villains+0.002
 ambulance+0.002
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.003
 cu-0.003
 dip-0.003
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 mun+0.063
oured+0.062
 gul+0.060
 cu+0.059
int+0.058
Neg logit contributions
baby-0.039
 ambulance-0.037
 robots-0.036
 dragons-0.035
 lawyers-0.035
Token They
Token ID1119
Feature activation-2.29e-01

Pos logit contributions
 mun+0.007
oured+0.007
int+0.007
 gul+0.007
 cu+0.007
Neg logit contributions
baby-0.003
 lawyers-0.003
 robots-0.003
 ambulance-0.003
 dragons-0.003
Token put
Token ID1234
Feature activation-2.04e-01

Pos logit contributions
 robots+0.007
 ambulance+0.007
 hall+0.007
 dragons+0.007
baby+0.007
Neg logit contributions
oured-0.009
 mun-0.009
int-0.008
 gul-0.008
OW-0.008
Token a
Token ID257
Feature activation+4.48e-01

Pos logit contributions
baby+0.005
 ambulance+0.004
 robots+0.004
 dragons+0.004
 lawyers+0.004
Neg logit contributions
 mun-0.010
oured-0.009
 gul-0.009
int-0.009
 cu-0.009
Token kettle
Token ID40231
Feature activation+3.55e+00

Pos logit contributions
 mun+0.020
oured+0.020
 gul+0.019
int+0.019
 cu+0.018
Neg logit contributions
baby-0.011
 ambulance-0.011
 robots-0.010
 hall-0.010
 monster-0.010
Token on
Token ID319
Feature activation-4.83e-02

Pos logit contributions
 mun+0.186
oured+0.181
 pleasure+0.179
 gul+0.177
OW+0.173
Neg logit contributions
 ambulance-0.062
baby-0.058
 hall-0.053
 twins-0.052
 fort-0.052
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 dragons+0.001
 hall+0.001
Neg logit contributions
 mun-0.002
oured-0.002
int-0.002
leaf-0.002
mit-0.002
Token stove
Token ID27635
Feature activation+1.82e+00

Pos logit contributions
 mun+0.018
oured+0.018
int+0.017
 gul+0.017
 cu+0.017
Neg logit contributions
baby-0.009
 ambulance-0.009
 robots-0.008
 hall-0.008
 monster-0.008
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
 mun+0.078
oured+0.076
 gul+0.073
 cu+0.073
 bait+0.071
Neg logit contributions
baby-0.047
 ambulance-0.043
 robots-0.043
 lawyers-0.042
 dragons-0.041
Token They
Token ID1119
Feature activation-1.53e-01

Pos logit contributions
 mun+0.006
oured+0.006
int+0.005
mit+0.005
 gul+0.005
Neg logit contributions
baby-0.002
 lawyers-0.002
 robots-0.002
 ambulance-0.002
 twins-0.002
Token,
Token ID11
Feature activation+2.71e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
Token next
Token ID1306
Feature activation+4.03e-01

Pos logit contributions
int+0.012
 mun+0.012
OW+0.012
oured+0.012
 bait+0.012
Neg logit contributions
 alone-0.006
 hall-0.006
 robots-0.005
 lawyers-0.005
 ambulance-0.005
Token to
Token ID284
Feature activation-2.73e-01

Pos logit contributions
 mun+0.014
oured+0.014
 gul+0.013
 cu+0.013
int+0.013
Neg logit contributions
baby-0.014
 ambulance-0.013
 robots-0.013
 lawyers-0.012
 dragons-0.012
Token her
Token ID607
Feature activation-4.92e-01

Pos logit contributions
baby+0.006
 ambulance+0.006
 robots+0.005
 lawyers+0.005
 villains+0.005
Neg logit contributions
 mun-0.013
oured-0.012
 gul-0.012
 cu-0.012
 placed-0.012
Token magn
Token ID7842
Feature activation+2.74e-01

Pos logit contributions
baby+0.010
 ambulance+0.009
 robots+0.009
 lawyers+0.009
 dragons+0.009
Neg logit contributions
 mun-0.024
oured-0.023
 gul-0.022
 cu-0.022
int-0.022
Tokenifying
Token ID4035
Feature activation+3.53e+00

Pos logit contributions
 mun+0.007
 gul+0.006
 bait+0.006
 cu+0.006
leaf+0.005
Neg logit contributions
baby-0.012
 dragons-0.012
 robots-0.011
 ambulance-0.011
 monster-0.011
Token glass
Token ID5405
Feature activation-7.98e-02

Pos logit contributions
 mun+0.122
oured+0.118
int+0.113
 cu+0.112
 gul+0.111
Neg logit contributions
baby-0.121
 ambulance-0.114
 monster-0.112
 robots-0.112
 hall-0.111
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
baby+0.002
 lawyers+0.002
 robots+0.002
 ambulance+0.002
 villains+0.002
Neg logit contributions
 mun-0.003
oured-0.003
 cu-0.003
 gul-0.003
 placed-0.003
Token She
Token ID1375
Feature activation-5.06e-01

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.010
oured-0.010
 gul-0.009
 cu-0.009
int-0.009
Token looked
Token ID3114
Feature activation+6.86e-01

Pos logit contributions
 dragons+0.015
 hall+0.015
 sister+0.014
 teacher+0.014
 lawyers+0.014
Neg logit contributions
 pleasure-0.018
oured-0.018
 bait-0.018
leaf-0.017
OW-0.017
Token at
Token ID379
Feature activation+2.02e-01

Pos logit contributions
 mun+0.035
 bait+0.033
OW+0.032
oured+0.032
leaf+0.032
Neg logit contributions
 alone-0.014
 dragons-0.012
baby-0.011
 robots-0.011
 monster-0.011
Token so
Token ID523
Feature activation-9.04e-02

Pos logit contributions
baby+0.010
 alone+0.010
 ambulance+0.010
 twins+0.010
 lawyers+0.010
Neg logit contributions
 mun-0.015
oured-0.015
int-0.014
 cu-0.014
leaf-0.014
Token excited
Token ID6568
Feature activation+1.08e+00

Pos logit contributions
baby+0.003
 alone+0.003
 lawyers+0.003
 ambulance+0.003
 twins+0.003
Neg logit contributions
oured-0.003
 mun-0.003
 cu-0.003
 gul-0.003
 bait-0.003
Token!
Token ID0
Feature activation+1.26e-01

Pos logit contributions
oured+0.046
 mun+0.046
 gul+0.043
 cu+0.043
 bait+0.043
Neg logit contributions
baby-0.029
 ambulance-0.026
 robots-0.025
 monster-0.025
 dragons-0.025
Token She
Token ID1375
Feature activation-1.50e-02

Pos logit contributions
 mun+0.005
oured+0.005
 cu+0.005
 gul+0.005
 dip+0.004
Neg logit contributions
baby-0.004
 ambulance-0.003
 dragons-0.003
 villains-0.003
 monster-0.003
Token loved
Token ID6151
Feature activation+1.06e-01

Pos logit contributions
lf+0.000
baby+0.000
 dragons+0.000
 hall+0.000
 kids+0.000
Neg logit contributions
oured-0.001
 pleasure-0.001
int-0.001
leaf-0.001
 bait-0.001
Token pic
Token ID8301
Feature activation+3.47e+00

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
 bait+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 villains-0.003
Tokenn
Token ID77
Feature activation+2.66e-01

Pos logit contributions
oured+0.082
 mun+0.082
int+0.079
 cu+0.078
 bait+0.076
Neg logit contributions
baby-0.148
lf-0.144
 hall-0.136
 ambulance-0.135
 dragons-0.131
Tokenics
Token ID873
Feature activation+4.15e-01

Pos logit contributions
 mun+0.010
oured+0.009
 gul+0.009
 cu+0.009
int+0.009
Neg logit contributions
baby-0.008
 robots-0.008
 villains-0.008
 monster-0.008
 ambulance-0.008
Token and
Token ID290
Feature activation+3.57e-01

Pos logit contributions
 mun+0.016
oured+0.016
 gul+0.015
 cu+0.015
 bait+0.014
Neg logit contributions
baby-0.013
 ambulance-0.012
 robots-0.011
 lawyers-0.011
 dragons-0.011
Token spending
Token ID4581
Feature activation+7.69e-01

Pos logit contributions
int+0.013
oured+0.012
 mun+0.011
leaf+0.011
 cu+0.011
Neg logit contributions
baby-0.011
 robots-0.011
 hall-0.011
 dragons-0.011
 kids-0.011
Token time
Token ID640
Feature activation+2.07e-01

Pos logit contributions
 mun+0.025
oured+0.024
 gul+0.023
 cu+0.023
 bait+0.022
Neg logit contributions
baby-0.029
 ambulance-0.027
 robots-0.026
 lawyers-0.026
 dragons-0.026
Token her
Token ID607
Feature activation-3.18e-01

Pos logit contributions
baby+0.014
 ambulance+0.012
 villains+0.012
 robots+0.011
 lawyers+0.011
Neg logit contributions
 mun-0.032
oured-0.031
 gul-0.031
 cu-0.031
 dip-0.030
Token fairy
Token ID25607
Feature activation+6.67e-01

Pos logit contributions
baby+0.007
 ambulance+0.006
 lawyers+0.005
 robots+0.005
 dragons+0.005
Neg logit contributions
 mun-0.015
oured-0.015
 cu-0.015
 gul-0.015
 dip-0.014
Token doll
Token ID3654
Feature activation+2.96e-01

Pos logit contributions
 mun+0.028
oured+0.027
 cu+0.026
 dip+0.026
 gul+0.026
Neg logit contributions
baby-0.017
ulk-0.016
 ambulance-0.015
 villains-0.014
 lawyers-0.014
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
 mun+0.011
oured+0.011
 favourite+0.010
 placed+0.010
 cu+0.010
Neg logit contributions
baby-0.009
ulk-0.008
 villains-0.008
 ambulance-0.007
loo-0.007
Token She
Token ID1375
Feature activation-6.27e-01

Pos logit contributions
baby+0.006
 ambulance+0.006
 monster+0.006
 villains+0.006
 monsters+0.006
Neg logit contributions
oured-0.006
 mun-0.006
 dip-0.005
 pleasure-0.005
 gul-0.005
Token named
Token ID3706
Feature activation+3.46e+00

Pos logit contributions
baby+0.015
 dragons+0.014
 hall+0.014
 robots+0.014
 villains+0.014
Neg logit contributions
oured-0.028
int-0.027
 pleasure-0.026
leaf-0.026
 bait-0.026
Token her
Token ID607
Feature activation-1.16e+00

Pos logit contributions
 mun+0.149
oured+0.144
 gul+0.144
 cu+0.140
 dip+0.139
Neg logit contributions
baby-0.084
 ambulance-0.077
 lawyers-0.073
 villains-0.071
 hall-0.070
Token fairy
Token ID25607
Feature activation+9.49e-01

Pos logit contributions
baby+0.021
 ambulance+0.019
 robots+0.019
 monster+0.018
 lawyers+0.018
Neg logit contributions
 mun-0.060
oured-0.058
 gul-0.057
int-0.056
 cu-0.056
Token doll
Token ID3654
Feature activation+4.85e-01

Pos logit contributions
 mun+0.043
oured+0.042
 gul+0.041
 cu+0.041
int+0.040
Neg logit contributions
baby-0.022
 ambulance-0.020
 robots-0.020
 lawyers-0.019
 dragons-0.019
Token Rose
Token ID8049
Feature activation+5.05e-01

Pos logit contributions
 mun+0.020
oured+0.020
 cu+0.019
 gul+0.019
 placed+0.018
Neg logit contributions
baby-0.013
 ambulance-0.012
 lawyers-0.011
 robots-0.011
 villains-0.011
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
 mun+0.021
oured+0.021
 gul+0.019
 cu+0.019
 placed+0.019
Neg logit contributions
baby-0.014
 ambulance-0.013
 robots-0.012
 lawyers-0.012
 monster-0.012
Token her
Token ID607
Feature activation-4.79e-01

Pos logit contributions
 mun+0.009
oured+0.009
 gul+0.009
int+0.009
 cu+0.009
Neg logit contributions
baby-0.005
 dragons-0.005
 ambulance-0.005
 robots-0.005
 monster-0.005
Token wand
Token ID11569
Feature activation+1.01e+00

Pos logit contributions
 alone+0.013
 dragons+0.013
 robots+0.012
 monster+0.012
baby+0.012
Neg logit contributions
 mun-0.021
oured-0.020
leaf-0.019
 gul-0.019
 bait-0.019
Token and
Token ID290
Feature activation+6.72e-01

Pos logit contributions
 mun+0.047
oured+0.046
 gul+0.044
 cu+0.044
int+0.043
Neg logit contributions
baby-0.022
 dragons-0.021
 alone-0.021
 ambulance-0.021
 robots-0.021
Token say
Token ID910
Feature activation+8.04e-01

Pos logit contributions
oured+0.024
 mun+0.024
int+0.023
 gul+0.022
 cu+0.021
Neg logit contributions
 dragons-0.022
 robots-0.022
 monster-0.021
baby-0.021
 villains-0.021
Token "
Token ID366
Feature activation+8.07e-01

Pos logit contributions
 mun+0.030
oured+0.029
 gul+0.028
 cu+0.028
int+0.027
Neg logit contributions
baby-0.025
 ambulance-0.023
 robots-0.023
 lawyers-0.022
 hall-0.022
Tokenab
Token ID397
Feature activation+3.45e+00

Pos logit contributions
oured+0.031
 cu+0.031
 bait+0.030
 gul+0.029
 mun+0.029
Neg logit contributions
baby-0.026
 robots-0.022
emaker-0.021
 dragons-0.021
 lawyers-0.020
Tokenrac
Token ID11510
Feature activation+1.19e-01

Pos logit contributions
 mun+0.094
leaf+0.084
 gul+0.083
oured+0.082
 bait+0.079
Neg logit contributions
baby-0.147
 sister-0.138
 ambulance-0.137
 monster-0.135
 robots-0.135
Tokenad
Token ID324
Feature activation-1.68e+00

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
int+0.003
Neg logit contributions
baby-0.004
 ambulance-0.004
 robots-0.004
 lawyers-0.004
 dragons-0.004
Tokenab
Token ID397
Feature activation+2.50e+00

Pos logit contributions
baby+0.064
 ambulance+0.060
 dragons+0.058
 alone+0.058
 monster+0.057
Neg logit contributions
 mun-0.052
oured-0.051
 gul-0.050
 cu-0.050
 bait-0.047
Tokenra
Token ID430
Feature activation+1.07e+00

Pos logit contributions
 mun+0.059
leaf+0.056
 ju+0.054
 gul+0.053
oured+0.051
Neg logit contributions
baby-0.111
 sister-0.108
 ambulance-0.104
 monster-0.102
es-0.102
Token"
Token ID1
Feature activation+7.22e-01

Pos logit contributions
 mun+0.047
leaf+0.045
oured+0.045
 gul+0.045
 bait+0.044
Neg logit contributions
baby-0.027
 ambulance-0.024
 monster-0.022
 dragons-0.022
 anymore-0.022
Token,
Token ID11
Feature activation-7.34e-01

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.007
oured-0.007
 gul-0.006
 cu-0.006
int-0.006
Token handing
Token ID22786
Feature activation+1.50e-01

Pos logit contributions
baby+0.014
 ambulance+0.012
 robots+0.012
 lawyers+0.011
 dragons+0.011
Neg logit contributions
 mun-0.037
oured-0.036
int-0.035
 gul-0.035
 cu-0.035
Token him
Token ID683
Feature activation+3.04e-01

Pos logit contributions
 mun+0.006
oured+0.006
 gul+0.006
 cu+0.006
 bait+0.006
Neg logit contributions
baby-0.004
 ambulance-0.003
 villains-0.003
 robots-0.003
 lawyers-0.003
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.011
 cu+0.011
int+0.011
Neg logit contributions
baby-0.009
 ambulance-0.008
 robots-0.008
 dragons-0.008
 lawyers-0.008
Token magn
Token ID7842
Feature activation-3.11e-02

Pos logit contributions
 ambulance+0.000
 monster+0.000
 robots+0.000
 kids+0.000
 hall+0.000
Neg logit contributions
 mun-0.001
oured-0.001
int-0.001
 cu-0.001
 gul-0.001
Tokenifying
Token ID4035
Feature activation+3.44e+00

Pos logit contributions
baby+0.001
 dragons+0.001
 monster+0.001
 robots+0.001
 anymore+0.001
Neg logit contributions
 mun-0.001
 gul-0.001
 cu-0.001
 pleasure-0.001
oured-0.001
Token glass
Token ID5405
Feature activation-3.89e-01

Pos logit contributions
 mun+0.123
oured+0.120
int+0.114
 cu+0.114
 gul+0.112
Neg logit contributions
baby-0.112
 ambulance-0.106
 monster-0.106
 hall-0.104
 dragons-0.104
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
baby+0.010
 lawyers+0.009
 ambulance+0.009
 robots+0.009
 dragons+0.008
Neg logit contributions
 mun-0.017
oured-0.016
 gul-0.016
 cu-0.016
 bait-0.016
Token She
Token ID1375
Feature activation-4.88e-01

Pos logit contributions
baby+0.004
 ambulance+0.003
 monster+0.003
 dragons+0.003
 monsters+0.003
Neg logit contributions
 mun-0.004
oured-0.004
 cu-0.004
 gul-0.004
 pleasure-0.003
Token looked
Token ID3114
Feature activation+6.83e-01

Pos logit contributions
 dragons+0.013
lf+0.013
 kids+0.013
 monster+0.013
 twins+0.013
Neg logit contributions
 pleasure-0.019
oured-0.018
int-0.018
leaf-0.018
ical-0.017
Token around
Token ID1088
Feature activation+5.31e-01

Pos logit contributions
 mun+0.034
oured+0.032
 bait+0.031
leaf+0.031
int+0.031
Neg logit contributions
 alone-0.015
baby-0.013
 anymore-0.013
 dragons-0.012
 monster-0.012
Token her
Token ID607
Feature activation-3.06e-01

Pos logit contributions
baby+0.009
 ambulance+0.008
 villains+0.007
 robots+0.007
 lawyers+0.007
Neg logit contributions
 mun-0.020
 gul-0.019
oured-0.019
 cu-0.019
 bait-0.019
Token toy
Token ID13373
Feature activation+4.21e-01

Pos logit contributions
baby+0.007
 ambulance+0.006
 lawyers+0.006
 dragons+0.006
ulk+0.006
Neg logit contributions
 mun-0.014
oured-0.014
 cu-0.014
 gul-0.013
 bait-0.013
Token dragon
Token ID10441
Feature activation+6.94e-01

Pos logit contributions
 mun+0.018
 bait+0.017
 favourite+0.017
 gul+0.017
 dip+0.017
Neg logit contributions
baby-0.009
ulk-0.009
 villains-0.008
emaker-0.008
reating-0.008
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 mun+0.027
 planted+0.025
 cu+0.025
 placed+0.024
 bait+0.024
Neg logit contributions
baby-0.021
ulk-0.020
 ambulance-0.018
reating-0.018
 principal-0.018
Token She
Token ID1375
Feature activation-7.23e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 hall+0.002
Neg logit contributions
 mun-0.006
oured-0.006
 cu-0.005
 gul-0.005
int-0.005
Token named
Token ID3706
Feature activation+3.43e+00

Pos logit contributions
 dragons+0.019
baby+0.019
 villains+0.019
 hall+0.018
 robots+0.018
Neg logit contributions
oured-0.032
int-0.029
 pleasure-0.029
leaf-0.029
 gul-0.028
Token him
Token ID683
Feature activation-1.92e-02

Pos logit contributions
 mun+0.145
oured+0.139
 gul+0.137
 cu+0.134
 dip+0.131
Neg logit contributions
baby-0.091
 ambulance-0.082
 lawyers-0.079
 robots-0.078
 hall-0.077
Token Max
Token ID5436
Feature activation-1.52e-01

Pos logit contributions
baby+0.001
 ambulance+0.000
 robots+0.000
 lawyers+0.000
 dragons+0.000
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
 bait-0.001
Token and
Token ID290
Feature activation-9.04e-02

Pos logit contributions
baby+0.004
 ambulance+0.004
 villains+0.004
 robots+0.004
 lawyers+0.004
Neg logit contributions
 mun-0.006
oured-0.006
 gul-0.006
 placed-0.006
 cu-0.006
Token took
Token ID1718
Feature activation-2.09e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 hall+0.002
 monster+0.002
 lawyers+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 cu-0.004
 gul-0.004
 placed-0.004
Token him
Token ID683
Feature activation+4.00e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 hall+0.005
 lawyers+0.005
Neg logit contributions
 mun-0.009
oured-0.008
 gul-0.008
 cu-0.008
 bait-0.008
Token soft
Token ID2705
Feature activation+3.78e-01

Pos logit contributions
oured+0.002
 mun+0.002
int+0.002
 gul+0.002
 cu+0.002
Neg logit contributions
baby-0.002
 robots-0.001
 alone-0.001
 dragons-0.001
 monster-0.001
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
 mun+0.015
oured+0.014
 gul+0.014
 placed+0.014
 cu+0.014
Neg logit contributions
baby-0.010
 villains-0.010
 ambulance-0.010
 robots-0.010
 principal-0.010
Token brown
Token ID7586
Feature activation+8.38e-01

Pos logit contributions
baby+0.003
 robots+0.003
 ambulance+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
int-0.003
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
 mun+0.034
oured+0.033
 gul+0.032
 cu+0.031
 placed+0.031
Neg logit contributions
baby-0.021
 ambulance-0.020
 lawyers-0.019
 robots-0.019
 villains-0.019
Token She
Token ID1375
Feature activation-7.08e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 monster+0.003
 dragons+0.003
 villains+0.003
Neg logit contributions
 mun-0.004
oured-0.004
 pleasure-0.004
 gul-0.004
 dip-0.004
Token named
Token ID3706
Feature activation+3.43e+00

Pos logit contributions
baby+0.018
 dragons+0.017
 villains+0.017
 robots+0.017
 alone+0.017
Neg logit contributions
oured-0.031
 pleasure-0.028
int-0.028
leaf-0.028
 bait-0.028
Token it
Token ID340
Feature activation+3.33e-01

Pos logit contributions
 mun+0.152
oured+0.150
 cu+0.144
int+0.143
 gul+0.143
Neg logit contributions
baby-0.085
 robots-0.077
 ambulance-0.076
 dragons-0.075
 lawyers-0.073
Token Brown
Token ID4373
Feature activation-2.80e-01

Pos logit contributions
 mun+0.014
oured+0.013
 gul+0.013
 cu+0.013
int+0.013
Neg logit contributions
baby-0.009
 robots-0.009
 ambulance-0.009
 dragons-0.008
 lawyers-0.008
Tokenie
Token ID494
Feature activation+2.01e-01

Pos logit contributions
baby+0.006
 ambulance+0.006
 lawyers+0.006
 villains+0.006
 robots+0.006
Neg logit contributions
 mun-0.013
oured-0.012
 gul-0.012
 cu-0.012
 placed-0.012
Token and
Token ID290
Feature activation-1.23e-01

Pos logit contributions
 mun+0.009
oured+0.009
 gul+0.009
 cu+0.009
int+0.009
Neg logit contributions
baby-0.005
 robots-0.004
 ambulance-0.004
 dragons-0.004
 lawyers-0.004
Token played
Token ID2826
Feature activation-9.19e-02

Pos logit contributions
baby+0.004
 robots+0.004
 ambulance+0.004
 dragons+0.003
 lawyers+0.003
Neg logit contributions
oured-0.005
 mun-0.005
 gul-0.004
int-0.004
 cu-0.004
Token the
Token ID262
Feature activation+9.49e-02

Pos logit contributions
 mun+0.010
oured+0.010
 placed+0.010
 gul+0.010
 cu+0.010
Neg logit contributions
baby-0.010
 lawyers-0.009
 villains-0.009
 ambulance-0.009
 dragons-0.008
Token watch
Token ID2342
Feature activation-6.13e-01

Pos logit contributions
 mun+0.004
oured+0.004
int+0.004
 gul+0.004
 cu+0.004
Neg logit contributions
 monster-0.003
 hall-0.002
 ambulance-0.002
baby-0.002
 dragons-0.002
Token with
Token ID351
Feature activation-6.80e-01

Pos logit contributions
baby+0.015
 ambulance+0.014
 robots+0.014
 lawyers+0.013
 dragons+0.013
Neg logit contributions
 mun-0.027
oured-0.026
 gul-0.025
 cu-0.025
int-0.025
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
baby+0.020
 monster+0.019
 alone+0.019
 dragons+0.019
 robots+0.019
Neg logit contributions
 mun-0.027
oured-0.026
int-0.025
 cu-0.024
 gul-0.024
Token magn
Token ID7842
Feature activation-1.50e-02

Pos logit contributions
oured+0.012
 mun+0.012
int+0.011
 cu+0.010
 gul+0.010
Neg logit contributions
 monster-0.008
 ambulance-0.008
 hall-0.007
 kids-0.007
 twins-0.007
Tokenifying
Token ID4035
Feature activation+3.42e+00

Pos logit contributions
baby+0.001
 dragons+0.001
 robots+0.001
 monster+0.001
 ambulance+0.001
Neg logit contributions
 mun-0.000
 cu-0.000
 gul-0.000
 pleasure-0.000
 bait-0.000
Token glass
Token ID5405
Feature activation-3.96e-01

Pos logit contributions
 mun+0.120
oured+0.118
 cu+0.113
int+0.112
 gul+0.109
Neg logit contributions
baby-0.113
 ambulance-0.107
 monster-0.107
 hall-0.105
 dragons-0.104
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
baby+0.010
 lawyers+0.009
 ambulance+0.009
 robots+0.009
 villains+0.008
Neg logit contributions
 mun-0.017
oured-0.017
 gul-0.016
 cu-0.016
 bait-0.016
Token He
Token ID679
Feature activation-2.61e-01

Pos logit contributions
baby+0.008
 ambulance+0.007
 robots+0.007
 lawyers+0.007
 dragons+0.006
Neg logit contributions
 mun-0.016
oured-0.015
 gul-0.015
 cu-0.015
int-0.014
Token saw
Token ID2497
Feature activation-1.17e-04

Pos logit contributions
 dragons+0.006
 monster+0.006
 kids+0.006
baby+0.006
 alone+0.006
Neg logit contributions
 pleasure-0.011
oured-0.011
leaf-0.011
 mun-0.010
ical-0.010
Token the
Token ID262
Feature activation-5.12e-02

Pos logit contributions
baby+0.000
 ambulance+0.000
 robots+0.000
 dragons+0.000
 lawyers+0.000
Neg logit contributions
 mun-0.000
oured-0.000
 gul-0.000
 cu-0.000
int-0.000
Token a
Token ID257
Feature activation-4.93e-01

Pos logit contributions
oured+0.015
 mun+0.015
 cu+0.014
int+0.014
 gul+0.014
Neg logit contributions
baby-0.014
 twins-0.014
 dragons-0.014
 robots-0.013
 lawyers-0.013
Token family
Token ID1641
Feature activation-5.28e-02

Pos logit contributions
baby+0.018
 robots+0.017
 ambulance+0.017
 lawyers+0.017
 dragons+0.017
Neg logit contributions
 mun-0.016
oured-0.014
 gul-0.014
 cu-0.014
 dip-0.014
Token who
Token ID508
Feature activation+2.97e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
 hall+0.001
Neg logit contributions
 mun-0.002
oured-0.002
int-0.002
 gul-0.002
 cu-0.002
Token really
Token ID1107
Feature activation+1.19e+00

Pos logit contributions
oured+0.012
 mun+0.011
 cu+0.011
int+0.011
 gul+0.010
Neg logit contributions
baby-0.010
 robots-0.009
 alone-0.009
 hall-0.009
 lawyers-0.008
Token enjoyed
Token ID8359
Feature activation+8.40e-01

Pos logit contributions
oured+0.043
 cu+0.041
 mun+0.041
 gul+0.039
 bait+0.038
Neg logit contributions
baby-0.040
 alone-0.036
 villains-0.036
 hall-0.036
 twins-0.035
Token pic
Token ID8301
Feature activation+3.42e+00

Pos logit contributions
 mun+0.035
oured+0.035
 gul+0.033
 cu+0.033
int+0.033
Neg logit contributions
baby-0.022
 ambulance-0.021
 robots-0.020
 lawyers-0.020
 dragons-0.020
Tokenn
Token ID77
Feature activation+2.48e-01

Pos logit contributions
oured+0.082
 mun+0.080
int+0.077
 cu+0.077
 gul+0.076
Neg logit contributions
baby-0.145
lf-0.142
 hall-0.138
 dragons-0.136
 villains-0.136
Tokenics
Token ID873
Feature activation+5.29e-01

Pos logit contributions
 mun+0.010
oured+0.009
 gul+0.009
 cu+0.009
int+0.009
Neg logit contributions
baby-0.007
 ambulance-0.007
 robots-0.007
 monster-0.007
 lawyers-0.006
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
 mun+0.025
oured+0.024
 gul+0.023
 cu+0.023
int+0.023
Neg logit contributions
baby-0.012
 ambulance-0.011
 robots-0.010
 lawyers-0.010
 dragons-0.010
Token They
Token ID1119
Feature activation-4.79e-02

Pos logit contributions
oured+0.012
 mun+0.012
int+0.012
 gul+0.011
 cu+0.011
Neg logit contributions
baby-0.005
 hall-0.004
 robots-0.004
 lawyers-0.004
 ambulance-0.004
Token loved
Token ID6151
Feature activation+1.05e-01

Pos logit contributions
baby+0.002
 dragons+0.001
 robots+0.001
 ambulance+0.001
 lawyers+0.001
Neg logit contributions
oured-0.002
 mun-0.002
int-0.002
 gul-0.002
 cu-0.002
Token the
Token ID262
Feature activation-5.43e-02

Pos logit contributions
 dip+0.022
 sprink+0.022
 mun+0.022
 placed+0.022
 gul+0.022
Neg logit contributions
baby-0.027
 villains-0.025
 ambulance-0.025
iness-0.024
 villain-0.023
Token warm
Token ID5814
Feature activation-4.94e-01

Pos logit contributions
baby+0.002
 villains+0.002
iness+0.002
 robots+0.002
 lawyers+0.002
Neg logit contributions
 dip-0.002
 mun-0.002
 planted-0.002
 favourite-0.002
 gul-0.002
Token sunshine
Token ID34488
Feature activation-1.03e+00

Pos logit contributions
baby+0.018
 villains+0.018
 lawyers+0.018
 principal+0.017
 cheating+0.017
Neg logit contributions
 mun-0.015
oured-0.015
 dip-0.014
 gul-0.014
int-0.014
Token and
Token ID290
Feature activation+5.84e-01

Pos logit contributions
baby+0.031
 ambulance+0.028
 robots+0.027
 villains+0.027
 lawyers+0.027
Neg logit contributions
 mun-0.040
oured-0.039
 gul-0.037
 cu-0.037
 dip-0.036
Token having
Token ID1719
Feature activation-2.85e-01

Pos logit contributions
 mun+0.023
oured+0.023
 gul+0.022
 cu+0.022
int+0.021
Neg logit contributions
baby-0.017
 ambulance-0.016
 robots-0.016
 lawyers-0.015
 dragons-0.015
Token pic
Token ID8301
Feature activation+3.41e+00

Pos logit contributions
baby+0.009
 robots+0.009
 ambulance+0.009
 dragons+0.008
 lawyers+0.008
Neg logit contributions
 mun-0.010
oured-0.010
int-0.010
 gul-0.010
 cu-0.010
Tokenn
Token ID77
Feature activation-1.79e-01

Pos logit contributions
oured+0.082
 mun+0.082
 cu+0.082
 gul+0.081
int+0.080
Neg logit contributions
baby-0.139
lf-0.133
 hall-0.131
 anymore-0.129
 villains-0.129
Tokenics
Token ID873
Feature activation+2.78e-01

Pos logit contributions
baby+0.006
 monster+0.006
 robots+0.006
 ambulance+0.006
 villains+0.006
Neg logit contributions
 mun-0.006
oured-0.006
int-0.005
 placed-0.005
 gul-0.005
Token with
Token ID351
Feature activation-5.44e-01

Pos logit contributions
 mun+0.011
oured+0.011
 cu+0.011
 gul+0.011
 bait+0.010
Neg logit contributions
baby-0.008
 ambulance-0.007
 robots-0.007
 lawyers-0.007
 monster-0.007
Token her
Token ID607
Feature activation-1.29e-01

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 dragons+0.009
 lawyers+0.009
Neg logit contributions
 mun-0.026
oured-0.026
 gul-0.025
 cu-0.025
int-0.025
Token family
Token ID1641
Feature activation+3.32e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.007
oured-0.006
 gul-0.006
 cu-0.006
 bait-0.006
Token her
Token ID607
Feature activation-4.01e-01

Pos logit contributions
baby+0.011
 ambulance+0.009
 villains+0.009
 robots+0.009
 lawyers+0.009
Neg logit contributions
 mun-0.025
oured-0.024
 gul-0.024
 cu-0.024
 bait-0.024
Token cute
Token ID13779
Feature activation+1.13e+00

Pos logit contributions
baby+0.010
ulk+0.009
 ambulance+0.009
reating+0.009
 villains+0.008
Neg logit contributions
 favourite-0.017
 mun-0.017
 cu-0.016
 dip-0.016
 gul-0.016
Token doll
Token ID3654
Feature activation+3.20e-02

Pos logit contributions
 mun+0.046
 dip+0.044
 gul+0.043
oured+0.043
 favourite+0.043
Neg logit contributions
baby-0.028
ulk-0.028
 lawyers-0.025
 principal-0.025
 villains-0.025
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
 mun+0.001
 planted+0.001
oured+0.001
 favourite+0.001
 cu+0.001
Neg logit contributions
baby-0.001
ulk-0.001
 villains-0.001
 ambulance-0.001
reating-0.001
Token She
Token ID1375
Feature activation-8.01e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 monster+0.003
Neg logit contributions
 mun-0.006
oured-0.005
 gul-0.005
 cu-0.005
 dip-0.005
Token named
Token ID3706
Feature activation+3.41e+00

Pos logit contributions
baby+0.021
 dragons+0.020
 hall+0.020
 robots+0.019
 villains+0.019
Neg logit contributions
oured-0.035
int-0.033
 pleasure-0.033
leaf-0.032
 mun-0.032
Token her
Token ID607
Feature activation-1.04e+00

Pos logit contributions
 mun+0.156
oured+0.151
 gul+0.149
 cu+0.145
 dip+0.144
Neg logit contributions
baby-0.076
 ambulance-0.068
 lawyers-0.065
 robots-0.064
 villains-0.063
Token doll
Token ID3654
Feature activation+3.69e-01

Pos logit contributions
baby+0.022
 ambulance+0.019
 robots+0.019
 lawyers+0.019
 dragons+0.018
Neg logit contributions
 mun-0.050
oured-0.049
 gul-0.047
 cu-0.047
int-0.046
Token Lily
Token ID20037
Feature activation+1.66e-01

Pos logit contributions
 mun+0.013
oured+0.013
 placed+0.012
 planted+0.012
 bought+0.012
Neg logit contributions
baby-0.012
ulk-0.011
 villains-0.011
 virus-0.010
iness-0.010
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
 mun+0.006
oured+0.006
 placed+0.006
 planted+0.006
 cu+0.006
Neg logit contributions
baby-0.005
 ambulance-0.005
 villains-0.005
 monster-0.005
ulk-0.005
Token gave
Token ID2921
Feature activation+5.65e-01

Pos logit contributions
 mun+0.013
oured+0.012
 cu+0.012
 gul+0.012
 placed+0.012
Neg logit contributions
baby-0.010
 ambulance-0.009
 monster-0.008
 basement-0.008
 hall-0.008
Token liked
Token ID8288
Feature activation-8.04e-02

Pos logit contributions
baby+0.006
 robots+0.006
 dragons+0.006
 alone+0.006
 ambulance+0.006
Neg logit contributions
oured-0.007
 mun-0.007
 gul-0.007
int-0.007
 cu-0.007
Token to
Token ID284
Feature activation-7.67e-02

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.002
 cu-0.002
int-0.002
Token play
Token ID711
Feature activation-3.46e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 dragons+0.002
 lawyers+0.002
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.003
 cu-0.003
int-0.003
Token and
Token ID290
Feature activation+3.89e-01

Pos logit contributions
baby+0.013
 ambulance+0.012
reating+0.011
 robots+0.011
 lawyers+0.011
Neg logit contributions
 mun-0.011
 cu-0.010
oured-0.010
 bait-0.010
 gul-0.010
Token have
Token ID423
Feature activation-7.70e-01

Pos logit contributions
oured+0.015
 mun+0.015
int+0.014
 gul+0.014
 cu+0.014
Neg logit contributions
baby-0.012
 robots-0.011
 ambulance-0.011
 dragons-0.011
 hall-0.011
Token pic
Token ID8301
Feature activation+3.40e+00

Pos logit contributions
baby+0.026
 ambulance+0.025
 robots+0.024
 lawyers+0.024
 dragons+0.024
Neg logit contributions
 mun-0.027
oured-0.026
 gul-0.025
 cu-0.025
int-0.024
Tokenn
Token ID77
Feature activation-2.08e-02

Pos logit contributions
 mun+0.081
oured+0.079
int+0.077
 gul+0.077
 cu+0.076
Neg logit contributions
baby-0.143
lf-0.136
 villains-0.136
 ambulance-0.136
 dragons-0.136
Tokenics
Token ID873
Feature activation+5.17e-01

Pos logit contributions
baby+0.001
 monster+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
Neg logit contributions
 mun-0.001
oured-0.001
 placed-0.001
 gul-0.001
int-0.001
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 mun+0.019
oured+0.018
 cu+0.017
 gul+0.017
 bait+0.017
Neg logit contributions
baby-0.017
 ambulance-0.016
 robots-0.015
 lawyers-0.015
 hall-0.015
Token One
Token ID1881
Feature activation+3.97e-01

Pos logit contributions
 mun+0.010
oured+0.010
 cu+0.009
 gul+0.009
 dip+0.009
Neg logit contributions
baby-0.011
 ambulance-0.010
 lawyers-0.010
 robots-0.010
 monster-0.010
Token day
Token ID1110
Feature activation-6.31e-03

Pos logit contributions
oured+0.019
 mun+0.019
 cu+0.018
 gul+0.018
leaf+0.017
Neg logit contributions
baby-0.009
 ambulance-0.008
 monster-0.007
 dragons-0.007
 robots-0.007
Token special
Token ID2041
Feature activation+3.80e-01

Pos logit contributions
 dragons+0.020
 kids+0.020
 alone+0.019
 lawyers+0.019
 monsters+0.019
Neg logit contributions
int-0.026
oured-0.026
rig-0.025
 mun-0.025
icol-0.025
Token powers
Token ID5635
Feature activation+1.17e+00

Pos logit contributions
oured+0.014
 mun+0.014
int+0.013
 gul+0.013
 cu+0.012
Neg logit contributions
 ambulance-0.012
 monster-0.012
 robots-0.012
 monsters-0.011
 dragons-0.011
Token that
Token ID326
Feature activation+7.03e-02

Pos logit contributions
 mun+0.048
oured+0.046
 gul+0.045
 cu+0.044
int+0.044
Neg logit contributions
baby-0.034
 robots-0.031
 ambulance-0.031
 dragons-0.030
 lawyers-0.030
Token can
Token ID460
Feature activation+7.27e-01

Pos logit contributions
oured+0.003
 mun+0.003
 cu+0.003
int+0.003
 gul+0.003
Neg logit contributions
baby-0.002
 dragons-0.002
 monster-0.002
 alone-0.002
 ambulance-0.002
Token heal
Token ID12035
Feature activation-2.01e+00

Pos logit contributions
oured+0.028
 mun+0.028
 cu+0.026
 gul+0.026
int+0.026
Neg logit contributions
baby-0.022
 ambulance-0.022
 dragons-0.021
 monster-0.021
 alone-0.021
Token injuries
Token ID6821
Feature activation-3.44e+00

Pos logit contributions
baby+0.059
 ambulance+0.054
 robots+0.053
 lawyers+0.052
 hall+0.051
Neg logit contributions
 mun-0.079
oured-0.077
 gul-0.074
 cu-0.074
int-0.072
Token.
Token ID13
Feature activation+5.74e-02

Pos logit contributions
baby+0.098
 robots+0.093
 ambulance+0.093
 villains+0.090
 monster+0.089
Neg logit contributions
 mun-0.129
oured-0.128
 gul-0.120
 placed-0.119
int-0.119
Token With
Token ID2080
Feature activation-1.31e-01

Pos logit contributions
 mun+0.003
oured+0.003
 bait+0.002
 dip+0.002
 cu+0.002
Neg logit contributions
baby-0.001
 lawyers-0.001
 robots-0.001
 ambulance-0.001
 twins-0.001
Token my
Token ID616
Feature activation+5.44e-01

Pos logit contributions
 dragons+0.003
 robots+0.003
 alone+0.003
 kids+0.003
 monsters+0.003
Neg logit contributions
oured-0.006
int-0.006
 mun-0.006
 cu-0.005
icol-0.005
Token powers
Token ID5635
Feature activation+1.75e+00

Pos logit contributions
oured+0.022
 mun+0.022
leaf+0.020
astically+0.019
 gul+0.019
Neg logit contributions
 sister-0.015
 monster-0.015
 ambulance-0.015
 alone-0.015
baby-0.014
Token,
Token ID11
Feature activation-1.56e-01

Pos logit contributions
 mun+0.082
oured+0.078
 gul+0.078
 cu+0.077
int+0.075
Neg logit contributions
baby-0.039
 alone-0.036
 ambulance-0.035
 robots-0.035
 dragons-0.034
Token came
Token ID1625
Feature activation+7.31e-01

Pos logit contributions
 bait+0.025
 pleasure+0.024
 mun+0.024
leaf+0.023
icol+0.023
Neg logit contributions
baby-0.025
 robots-0.021
 dragons-0.021
 monster-0.021
 ambulance-0.021
Token to
Token ID284
Feature activation-3.82e-01

Pos logit contributions
 mun+0.030
oured+0.029
 gul+0.028
int+0.028
 cu+0.027
Neg logit contributions
baby-0.021
 ambulance-0.019
 robots-0.019
 dragons-0.018
 lawyers-0.018
Token see
Token ID766
Feature activation+1.22e-01

Pos logit contributions
 monster+0.009
 kids+0.009
 lawyers+0.009
 dragons+0.009
 alone+0.009
Neg logit contributions
int-0.017
oured-0.017
 mun-0.016
rig-0.015
ical-0.015
Token what
Token ID644
Feature activation+2.40e-01

Pos logit contributions
 mun+0.006
oured+0.005
int+0.005
 gul+0.005
 cu+0.005
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
Token the
Token ID262
Feature activation-6.93e-02

Pos logit contributions
 mun+0.009
oured+0.009
 gul+0.009
 cu+0.009
int+0.009
Neg logit contributions
baby-0.007
 ambulance-0.007
 robots-0.006
 lawyers-0.006
 dragons-0.006
Token comm
Token ID725
Feature activation-3.19e+00

Pos logit contributions
 hall+0.002
 twins+0.002
baby+0.002
 house+0.002
 kids+0.002
Neg logit contributions
oured-0.003
 mun-0.003
int-0.003
leaf-0.003
 gul-0.002
Tokenotion
Token ID9650
Feature activation-3.92e-01

Pos logit contributions
baby+0.073
 robots+0.066
 ambulance+0.066
 lawyers+0.061
 dragons+0.061
Neg logit contributions
oured-0.148
 mun-0.145
int-0.138
 gul-0.138
 placed-0.136
Token was
Token ID373
Feature activation+4.88e-01

Pos logit contributions
baby+0.011
 ambulance+0.011
 monster+0.011
 dragons+0.010
 robots+0.010
Neg logit contributions
 mun-0.015
oured-0.015
 cu-0.014
int-0.014
 gul-0.014
Token about
Token ID546
Feature activation+4.19e-01

Pos logit contributions
 mun+0.027
oured+0.026
 gul+0.025
int+0.025
leaf+0.025
Neg logit contributions
baby-0.007
 alone-0.007
 ambulance-0.007
 hall-0.007
 anymore-0.006
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
 mun+0.025
oured+0.025
int+0.025
 gul+0.023
leaf+0.023
Neg logit contributions
baby-0.004
 dragons-0.004
 robots-0.004
 anymore-0.003
 alone-0.003
Token She
Token ID1375
Feature activation-2.16e-01

Pos logit contributions
baby+0.003
 robots+0.003
 lawyers+0.003
 hall+0.002
 ambulance+0.002
Neg logit contributions
 mun-0.005
int-0.005
oured-0.005
 bait-0.005
 cu-0.005
TokenLook
Token ID8567
Feature activation+1.03e+00

Pos logit contributions
oured+0.020
 mun+0.019
 gul+0.018
int+0.018
 bait+0.018
Neg logit contributions
baby-0.014
 lawyers-0.012
 twins-0.012
 hall-0.012
 kids-0.012
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
 mun+0.050
oured+0.049
leaf+0.048
 gul+0.048
 cu+0.046
Neg logit contributions
baby-0.021
 ambulance-0.018
 dragons-0.017
 monster-0.017
 anymore-0.017
Token Ben
Token ID3932
Feature activation-1.49e-02

Pos logit contributions
oured+0.017
 mun+0.017
int+0.017
leaf+0.017
rig+0.017
Neg logit contributions
 kids-0.006
 monster-0.005
 twins-0.005
baby-0.005
 sister-0.005
Token,
Token ID11
Feature activation-4.93e-02

Pos logit contributions
baby+0.000
 ambulance+0.000
 robots+0.000
 lawyers+0.000
 dragons+0.000
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
leaf-0.001
Token the
Token ID262
Feature activation-6.29e-01

Pos logit contributions
 kids+0.001
 monster+0.001
 twins+0.001
 dragons+0.001
 sister+0.001
Neg logit contributions
int-0.002
oured-0.002
 mun-0.002
rig-0.002
leaf-0.002
Token bud
Token ID3600
Feature activation-3.12e+00

Pos logit contributions
baby+0.016
 ambulance+0.016
 monster+0.015
 dragons+0.015
 hall+0.015
Neg logit contributions
oured-0.028
 mun-0.027
int-0.025
 gul-0.025
leaf-0.025
Token bloom
Token ID29955
Feature activation+1.25e-02

Pos logit contributions
baby+0.079
 ambulance+0.071
 hall+0.068
 alone+0.068
 dragons+0.068
Neg logit contributions
 mun-0.135
oured-0.133
 gul-0.129
 cu-0.126
 bait-0.125
Tokened
Token ID276
Feature activation+1.82e-02

Pos logit contributions
 mun+0.001
oured+0.001
 gul+0.001
 cu+0.001
int+0.001
Neg logit contributions
baby-0.000
 ambulance-0.000
 robots-0.000
 lawyers-0.000
 dragons-0.000
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 mun+0.001
oured+0.001
 gul+0.001
int+0.001
 cu+0.001
Neg logit contributions
baby-0.000
 ambulance-0.000
 dragons-0.000
 alone-0.000
 robots-0.000
Token It
Token ID632
Feature activation+7.90e-02

Pos logit contributions
baby+0.003
 She+0.003
 Lee+0.003
 lawyers+0.003
 twins+0.003
Neg logit contributions
 mun-0.007
oured-0.006
 bait-0.006
 gul-0.006
 placed-0.006
Token is
Token ID318
Feature activation-8.31e-02

Pos logit contributions
oured+0.004
 bait+0.004
int+0.004
 mun+0.004
 pleasure+0.004
Neg logit contributions
 hall-0.001
 alone-0.001
 dragons-0.001
 monster-0.001
 mess-0.001
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
baby+0.003
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
int-0.004
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.004
Neg logit contributions
 mun-0.005
oured-0.005
 cu-0.005
 gul-0.005
 bait-0.005
TokenMom
Token ID29252
Feature activation+6.83e-01

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.008
 cu-0.008
 gul-0.007
 bait-0.007
oured-0.007
Token heard
Token ID2982
Feature activation+3.92e-01

Pos logit contributions
 mun+0.022
oured+0.021
 bait+0.020
 pleasure+0.020
 cu+0.020
Neg logit contributions
baby-0.026
 ambulance-0.024
 robots-0.023
 monster-0.023
 dragons-0.023
Token the
Token ID262
Feature activation+4.81e-03

Pos logit contributions
 mun+0.015
 gul+0.015
oured+0.015
 cu+0.014
 dip+0.014
Neg logit contributions
baby-0.011
 villains-0.010
 lawyers-0.010
 ambulance-0.010
 hall-0.010
Token comm
Token ID725
Feature activation-3.04e+00

Pos logit contributions
 gul+0.000
 mun+0.000
 cu+0.000
oured+0.000
leaf+0.000
Neg logit contributions
baby-0.000
 dragons-0.000
 lawyers-0.000
reating-0.000
 anymore-0.000
Tokenotion
Token ID9650
Feature activation-2.59e-01

Pos logit contributions
baby+0.074
 robots+0.071
 lawyers+0.067
 dragons+0.066
 fort+0.066
Neg logit contributions
oured-0.129
 mun-0.126
 gul-0.123
 cu-0.122
int-0.120
Token and
Token ID290
Feature activation-1.91e-01

Pos logit contributions
baby+0.010
 villains+0.010
 lawyers+0.009
 robots+0.009
 principal+0.009
Neg logit contributions
 gul-0.007
 mun-0.007
oured-0.007
 cu-0.007
 placed-0.007
Token came
Token ID1625
Feature activation+6.59e-01

Pos logit contributions
baby+0.007
 villains+0.006
 robots+0.006
 ambulance+0.006
 lawyers+0.006
Neg logit contributions
 mun-0.006
 gul-0.006
oured-0.006
 cu-0.006
 placed-0.006
Token to
Token ID284
Feature activation-3.86e-01

Pos logit contributions
 cu+0.016
 gul+0.016
oured+0.015
 sto+0.015
 placed+0.015
Neg logit contributions
baby-0.027
 villains-0.026
lf-0.026
 ambulance-0.026
iness-0.026
Token see
Token ID766
Feature activation+1.38e-01

Pos logit contributions
baby+0.012
 ambulance+0.011
 robots+0.011
 lawyers+0.011
 dragons+0.011
Neg logit contributions
 mun-0.014
oured-0.014
 gul-0.013
 cu-0.013
int-0.013
TokenWow
Token ID22017
Feature activation-1.27e+00

Pos logit contributions
oured+0.019
 mun+0.018
int+0.017
 gul+0.016
 cu+0.016
Neg logit contributions
baby-0.013
 lawyers-0.012
 twins-0.012
 villains-0.012
 kids-0.012
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
baby+0.036
 robots+0.035
 ambulance+0.034
 lawyers+0.034
 villains+0.034
Neg logit contributions
 mun-0.051
oured-0.049
 cu-0.048
 gul-0.047
 dip-0.046
Token Sara
Token ID24799
Feature activation-5.18e-02

Pos logit contributions
oured+0.061
int+0.059
 mun+0.058
rig+0.056
leaf+0.056
Neg logit contributions
 kids-0.025
 monster-0.024
baby-0.022
 twins-0.022
 ambulance-0.022
Token,
Token ID11
Feature activation+6.07e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
 dragons+0.001
Neg logit contributions
 mun-0.002
oured-0.002
 gul-0.002
leaf-0.002
 cu-0.002
Token the
Token ID262
Feature activation-6.22e-01

Pos logit contributions
int+0.031
oured+0.030
 mun+0.028
rig+0.028
ical+0.027
Neg logit contributions
 kids-0.014
 monster-0.013
 twins-0.012
baby-0.012
 monsters-0.011
Token bud
Token ID3600
Feature activation-2.98e+00

Pos logit contributions
baby+0.017
 ambulance+0.016
 monster+0.016
 dragons+0.015
 lawyers+0.015
Neg logit contributions
oured-0.026
 mun-0.026
 gul-0.024
int-0.024
 cu-0.024
Token bloom
Token ID29955
Feature activation+9.50e-02

Pos logit contributions
baby+0.060
 alone+0.052
 hall+0.051
 ambulance+0.051
 twins+0.051
Neg logit contributions
 mun-0.142
oured-0.140
 gul-0.136
 bait-0.136
leaf-0.135
Tokened
Token ID276
Feature activation+6.88e-02

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
 bait+0.004
Neg logit contributions
baby-0.002
 ambulance-0.002
 robots-0.002
 dragons-0.002
 lawyers-0.002
Token.
Token ID13
Feature activation-7.78e-02

Pos logit contributions
 mun+0.003
oured+0.003
 gul+0.003
 cu+0.003
int+0.003
Neg logit contributions
baby-0.001
 ambulance-0.001
 dragons-0.001
 alone-0.001
 robots-0.001
Token It
Token ID632
Feature activation+9.78e-02

Pos logit contributions
baby+0.002
 twins+0.002
 She+0.002
 lawyers+0.001
 Lee+0.001
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.003
 bait-0.003
int-0.003
Token is
Token ID318
Feature activation-1.60e-02

Pos logit contributions
oured+0.005
 bait+0.005
int+0.005
 mun+0.005
icol+0.005
Neg logit contributions
 alone-0.001
 hall-0.001
 monster-0.001
 dragons-0.001
 twins-0.001
Token see
Token ID766
Feature activation+3.94e-01

Pos logit contributions
 house+0.008
 kids+0.007
 hall+0.007
 teacher+0.007
 ambulance+0.007
Neg logit contributions
oured-0.009
 mun-0.009
icol-0.009
int-0.008
ical-0.008
Token what
Token ID644
Feature activation+3.70e-01

Pos logit contributions
 mun+0.019
oured+0.019
int+0.018
 gul+0.017
icol+0.017
Neg logit contributions
baby-0.008
 dragons-0.008
 ambulance-0.008
 lawyers-0.008
 robots-0.008
Token was
Token ID373
Feature activation+6.11e-01

Pos logit contributions
 mun+0.014
oured+0.014
 gul+0.013
 cu+0.013
int+0.013
Neg logit contributions
baby-0.011
 ambulance-0.010
 robots-0.010
 lawyers-0.010
 dragons-0.010
Token inside
Token ID2641
Feature activation+8.71e-01

Pos logit contributions
 mun+0.030
oured+0.028
int+0.028
 gul+0.027
 pleasure+0.027
Neg logit contributions
 alone-0.014
 hall-0.013
 lawyers-0.012
 dragons-0.012
 ambulance-0.012
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
 mun+0.036
 gul+0.035
oured+0.035
 cu+0.034
 dip+0.034
Neg logit contributions
baby-0.023
 villains-0.022
 robots-0.022
 ambulance-0.021
 lawyers-0.021
Token bud
Token ID3600
Feature activation-2.94e+00

Pos logit contributions
 mun+0.011
oured+0.011
int+0.010
 bait+0.009
 gul+0.009
Neg logit contributions
baby-0.006
 hall-0.006
 monster-0.006
 house-0.006
 ambulance-0.006
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
baby+0.055
 ambulance+0.048
 robots+0.046
 dragons+0.045
 hall+0.045
Neg logit contributions
 mun-0.148
oured-0.144
 gul-0.140
 cu-0.138
int-0.137
Token
Token ID198
Feature activation+3.93e-02

Pos logit contributions
baby+0.006
 ambulance+0.006
 robots+0.005
 lawyers+0.005
 hall+0.005
Neg logit contributions
 mun-0.012
oured-0.012
int-0.011
 gul-0.011
 cu-0.011
Token
Token ID198
Feature activation-6.07e-02

Pos logit contributions
 mun+0.002
oured+0.001
 gul+0.001
 cu+0.001
int+0.001
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token"
Token ID1
Feature activation+7.51e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
 dragons+0.001
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.003
 cu-0.003
int-0.003
TokenLet
Token ID5756
Feature activation-8.06e-02

Pos logit contributions
oured+0.033
 mun+0.033
int+0.030
 gul+0.029
 pleasure+0.029
Neg logit contributions
baby-0.020
 lawyers-0.017
 hall-0.017
 ambulance-0.017
 basement-0.016
Token and
Token ID290
Feature activation+8.35e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 monster+0.005
 hall+0.005
Neg logit contributions
 mun-0.007
oured-0.007
 cu-0.007
 gul-0.007
leaf-0.007
Token make
Token ID787
Feature activation-1.21e+00

Pos logit contributions
 mun+0.031
oured+0.030
 gul+0.028
int+0.028
 cu+0.028
Neg logit contributions
baby-0.027
 robots-0.025
 ambulance-0.025
 lawyers-0.025
 dragons-0.025
Token some
Token ID617
Feature activation-6.11e-01

Pos logit contributions
baby+0.031
 ambulance+0.029
 lawyers+0.029
 robots+0.029
 dragons+0.029
Neg logit contributions
 mun-0.053
oured-0.053
int-0.049
 cu-0.049
 gul-0.049
Token new
Token ID649
Feature activation-4.85e-01

Pos logit contributions
baby+0.014
 ambulance+0.013
 robots+0.013
 lawyers+0.013
 dragons+0.012
Neg logit contributions
 mun-0.029
oured-0.028
 cu-0.027
 gul-0.027
int-0.026
Token or
Token ID393
Feature activation+1.19e+00

Pos logit contributions
baby+0.008
 ambulance+0.007
 robots+0.007
 lawyers+0.007
 dragons+0.007
Neg logit contributions
 mun-0.025
oured-0.024
 gul-0.024
 cu-0.024
int-0.023
Tokennam
Token ID7402
Feature activation-2.93e+00

Pos logit contributions
 mun+0.049
oured+0.048
 gul+0.045
int+0.045
 cu+0.045
Neg logit contributions
baby-0.033
 robots-0.032
 ambulance-0.031
 lawyers-0.031
 dragons-0.031
Tokenents
Token ID658
Feature activation-2.27e+00

Pos logit contributions
baby+0.115
 ambulance+0.115
 monster+0.111
 hall+0.108
 basement+0.107
Neg logit contributions
 mun-0.082
oured-0.081
 gul-0.074
int-0.073
 cu-0.073
Token together
Token ID1978
Feature activation-1.00e+00

Pos logit contributions
baby+0.062
 robots+0.056
 ambulance+0.055
 villains+0.054
 lawyers+0.054
Neg logit contributions
 mun-0.091
oured-0.088
 placed-0.084
 gul-0.084
 dip-0.084
Token."
Token ID526
Feature activation+1.07e-01

Pos logit contributions
baby+0.022
 ambulance+0.019
 robots+0.019
 lawyers+0.018
 dragons+0.018
Neg logit contributions
 mun-0.047
oured-0.046
 gul-0.045
 cu-0.044
int-0.043
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
 bait+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 dragons-0.003
 lawyers-0.003
Token
Token ID198
Feature activation-2.99e-01

Pos logit contributions
baby+0.009
 robots+0.009
 ambulance+0.009
 lawyers+0.009
 hall+0.009
Neg logit contributions
 mun-0.008
 cu-0.008
 gul-0.008
oured-0.008
leaf-0.008
Token,
Token ID11
Feature activation-7.72e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.003
oured-0.003
 gul-0.003
 cu-0.003
int-0.003
Token "
Token ID366
Feature activation+2.31e-01

Pos logit contributions
baby+0.015
 ambulance+0.013
 lawyers+0.012
 robots+0.012
 dragons+0.011
Neg logit contributions
 mun-0.040
oured-0.039
 gul-0.037
int-0.037
 bait-0.037
TokenWhat
Token ID2061
Feature activation+1.90e-01

Pos logit contributions
 mun+0.010
oured+0.010
 gul+0.009
 cu+0.009
 bait+0.009
Neg logit contributions
baby-0.006
 ambulance-0.006
 lawyers-0.006
 robots-0.005
 dragons-0.005
Token does
Token ID857
Feature activation+7.01e-01

Pos logit contributions
 mun+0.007
oured+0.007
 gul+0.006
 cu+0.006
 bait+0.006
Neg logit contributions
baby-0.006
 ambulance-0.006
 robots-0.005
 dragons-0.005
 hall-0.005
Token '
Token ID705
Feature activation+6.74e-01

Pos logit contributions
 mun+0.026
oured+0.026
 gul+0.024
 cu+0.024
int+0.024
Neg logit contributions
baby-0.023
 ambulance-0.022
 lawyers-0.021
 robots-0.021
 monster-0.021
Tokendestroy
Token ID41659
Feature activation-2.93e+00

Pos logit contributions
 mun+0.023
oured+0.022
 cu+0.021
icol+0.021
 gul+0.021
Neg logit contributions
baby-0.024
 lawyers-0.023
 ambulance-0.022
 robots-0.022
 alone-0.021
Token'
Token ID6
Feature activation+3.58e-01

Pos logit contributions
baby+0.080
 ambulance+0.072
 lawyers+0.072
 robots+0.070
 dragons+0.067
Neg logit contributions
 mun-0.126
oured-0.120
 gul-0.116
 cu-0.116
 dip-0.111
Token mean
Token ID1612
Feature activation-3.84e-01

Pos logit contributions
 mun+0.015
oured+0.014
 gul+0.014
 cu+0.014
int+0.013
Neg logit contributions
baby-0.010
 ambulance-0.009
 robots-0.009
 lawyers-0.009
 dragons-0.009
Token?"
Token ID1701
Feature activation-5.44e-01

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.015
oured-0.015
 gul-0.014
 cu-0.014
int-0.014
Token
Token ID220
Feature activation-2.17e+00

Pos logit contributions
baby+0.013
 ambulance+0.012
 lawyers+0.011
 hall+0.011
 alone+0.011
Neg logit contributions
 mun-0.026
oured-0.025
 bait-0.025
leaf-0.024
 cu-0.024
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
baby+0.063
 ambulance+0.058
 robots+0.057
 lawyers+0.056
 dragons+0.055
Neg logit contributions
 mun-0.087
oured-0.084
 gul-0.081
 cu-0.081
int-0.079
Token mummy
Token ID38567
Feature activation+4.41e-01

Pos logit contributions
 mun+0.007
oured+0.007
int+0.007
 gul+0.007
leaf+0.007
Neg logit contributions
baby-0.005
 monster-0.005
 ambulance-0.005
 hall-0.005
 dragons-0.005
Token,
Token ID11
Feature activation-3.21e-01

Pos logit contributions
 mun+0.018
oured+0.018
 gul+0.017
 cu+0.017
int+0.017
Neg logit contributions
baby-0.012
 ambulance-0.011
 robots-0.011
 dragons-0.011
 lawyers-0.011
Token a
Token ID257
Feature activation-7.13e-02

Pos logit contributions
 monster+0.009
baby+0.009
 kids+0.009
 ambulance+0.009
 dragons+0.009
Neg logit contributions
 mun-0.013
oured-0.013
int-0.013
rig-0.012
ical-0.012
Token big
Token ID1263
Feature activation-4.88e-01

Pos logit contributions
baby+0.002
 monster+0.002
 ambulance+0.002
 hall+0.002
 robots+0.002
Neg logit contributions
oured-0.003
 mun-0.003
int-0.003
 gul-0.003
 cu-0.003
Token l
Token ID300
Feature activation-1.43e+00

Pos logit contributions
baby+0.013
 ambulance+0.012
 robots+0.012
 lawyers+0.011
 dragons+0.011
Neg logit contributions
 mun-0.021
oured-0.020
 gul-0.019
 cu-0.019
int-0.019
Tokenoll
Token ID692
Feature activation-2.93e+00

Pos logit contributions
baby+0.048
 ambulance+0.045
 robots+0.045
 lawyers+0.044
 dragons+0.043
Neg logit contributions
 mun-0.051
oured-0.049
 gul-0.047
 cu-0.047
int-0.046
Tokenipop
Token ID42800
Feature activation-2.84e-01

Pos logit contributions
baby+0.118
 ambulance+0.112
 robots+0.111
 lawyers+0.109
 dragons+0.108
Neg logit contributions
 mun-0.084
oured-0.081
 gul-0.076
 cu-0.075
int-0.074
Token is
Token ID318
Feature activation-2.62e-02

Pos logit contributions
baby+0.008
 ambulance+0.007
 monster+0.007
 dragons+0.007
 robots+0.007
Neg logit contributions
 mun-0.011
oured-0.011
 gul-0.011
 cu-0.011
int-0.011
Token up
Token ID510
Feature activation+1.12e+00

Pos logit contributions
 alone+0.001
 dragons+0.001
 monster+0.001
baby+0.001
 mess+0.001
Neg logit contributions
int-0.001
oured-0.001
 mun-0.001
leaf-0.001
 cu-0.001
Token in
Token ID287
Feature activation+5.48e-01

Pos logit contributions
 mun+0.052
oured+0.051
int+0.049
 gul+0.047
leaf+0.047
Neg logit contributions
 alone-0.027
baby-0.024
 lawyers-0.023
 dragons-0.023
 anymore-0.022
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
 mun+0.025
oured+0.025
int+0.024
 cu+0.023
 gul+0.023
Neg logit contributions
baby-0.013
 ambulance-0.012
 dragons-0.012
 robots-0.012
 lawyers-0.012
Token his
Token ID465
Feature activation-7.29e-01

Pos logit contributions
 mun+0.044
oured+0.043
 gul+0.042
 cu+0.042
int+0.041
Neg logit contributions
baby-0.026
 ambulance-0.023
 robots-0.023
 lawyers-0.022
 dragons-0.022
Token hand
Token ID1021
Feature activation+1.66e-01

Pos logit contributions
baby+0.018
 ambulance+0.017
 hall+0.017
 house+0.017
 sister+0.017
Neg logit contributions
 mun-0.032
oured-0.030
leaf-0.030
 gul-0.029
int-0.029
Token to
Token ID284
Feature activation-4.73e-01

Pos logit contributions
 mun+0.008
 pleasure+0.008
oured+0.007
leaf+0.007
 bait+0.007
Neg logit contributions
baby-0.004
 ambulance-0.003
 alone-0.003
 robots-0.003
 monster-0.003
Token pull
Token ID2834
Feature activation-1.32e+00

Pos logit contributions
 alone+0.014
 monster+0.014
 ambulance+0.014
 911+0.014
 kids+0.014
Neg logit contributions
oured-0.017
 mun-0.017
ical-0.016
 favourite-0.016
astically-0.016
Token the
Token ID262
Feature activation-9.79e-02

Pos logit contributions
baby+0.036
 ambulance+0.033
 robots+0.033
 lawyers+0.032
 dragons+0.032
Neg logit contributions
 mun-0.056
oured-0.054
 gul-0.052
 cu-0.051
int-0.051
Token bud
Token ID3600
Feature activation-2.92e+00

Pos logit contributions
 ambulance+0.003
 monster+0.003
baby+0.003
 alone+0.003
 hall+0.002
Neg logit contributions
 mun-0.004
oured-0.004
int-0.004
icol-0.004
 gul-0.004
Token.
Token ID13
Feature activation-5.44e-01

Pos logit contributions
baby+0.073
 ambulance+0.067
 robots+0.065
 dragons+0.064
 lawyers+0.064
Neg logit contributions
 mun-0.128
oured-0.124
 gul-0.120
 cu-0.117
int-0.116
Token
Token ID198
Feature activation-1.94e-01

Pos logit contributions
baby+0.010
 ambulance+0.009
 monster+0.009
 lawyers+0.009
 alone+0.009
Neg logit contributions
 mun-0.026
oured-0.026
leaf-0.024
int-0.024
 cu-0.024
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.004
 lawyers+0.004
 dragons+0.004
Neg logit contributions
 mun-0.008
oured-0.008
 gul-0.008
 cu-0.008
int-0.008
Token"
Token ID1
Feature activation+7.04e-01

Pos logit contributions
baby+0.005
 ambulance+0.004
 robots+0.004
 dragons+0.004
 lawyers+0.004
Neg logit contributions
 mun-0.013
oured-0.013
 gul-0.012
 cu-0.012
int-0.012
TokenNo
Token ID2949
Feature activation-1.68e-01

Pos logit contributions
oured+0.034
 mun+0.033
 pleasure+0.031
icol+0.030
int+0.030
Neg logit contributions
baby-0.015
 lawyers-0.014
 ambulance-0.013
 villains-0.013
 monster-0.013
Tokenmy
Token ID1820
Feature activation-5.44e-01

Pos logit contributions
 mun+0.031
oured+0.030
 gul+0.030
 cu+0.029
int+0.029
Neg logit contributions
baby-0.011
 ambulance-0.009
 robots-0.009
 lawyers-0.009
 dragons-0.009
Token's
Token ID338
Feature activation+9.75e-03

Pos logit contributions
baby+0.015
 ambulance+0.014
 monster+0.013
 alone+0.013
 dragons+0.013
Neg logit contributions
 pleasure-0.022
leaf-0.022
 bait-0.022
oured-0.021
 souven-0.021
Token mom
Token ID1995
Feature activation+3.84e-01

Pos logit contributions
 favourite+0.000
oured+0.000
 mun+0.000
 gul+0.000
 pleasure+0.000
Neg logit contributions
baby-0.000
ulk-0.000
reating-0.000
opl-0.000
 ambulance-0.000
Token noticed
Token ID6810
Feature activation+6.26e-01

Pos logit contributions
 mun+0.014
 bait+0.013
oured+0.013
 pleasure+0.013
leaf+0.013
Neg logit contributions
baby-0.013
 ambulance-0.012
 robots-0.012
 monster-0.011
 dragons-0.011
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 mun+0.022
oured+0.022
 gul+0.021
 cu+0.021
 placed+0.020
Neg logit contributions
baby-0.021
 ambulance-0.019
 robots-0.019
 lawyers-0.019
 dragons-0.019
Token comm
Token ID725
Feature activation-2.92e+00

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
 bait-0.005
Tokenotion
Token ID9650
Feature activation+2.35e-02

Pos logit contributions
baby+0.063
 robots+0.060
 dragons+0.058
 ambulance+0.057
 lawyers+0.057
Neg logit contributions
oured-0.134
 mun-0.131
 gul-0.126
int-0.124
 cu-0.123
Token and
Token ID290
Feature activation-4.89e-01

Pos logit contributions
 mun+0.001
oured+0.001
 gul+0.001
 cu+0.001
int+0.001
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token came
Token ID1625
Feature activation+6.59e-01

Pos logit contributions
baby+0.019
 ambulance+0.018
 robots+0.018
 lawyers+0.018
 dragons+0.018
Neg logit contributions
 mun-0.014
oured-0.014
 gul-0.013
 cu-0.013
int-0.013
Token outside
Token ID2354
Feature activation+9.98e-01

Pos logit contributions
oured+0.022
 mun+0.022
 gul+0.021
 cu+0.021
 placed+0.021
Neg logit contributions
baby-0.022
 villains-0.021
 ambulance-0.021
 robots-0.020
 lawyers-0.020
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
 gul+0.035
 mun+0.034
 placed+0.033
oured+0.032
 cu+0.032
Neg logit contributions
 villains-0.031
baby-0.030
 ambulance-0.029
 robots-0.029
 lawyers-0.029
Token happened
Token ID3022
Feature activation+4.28e-01

Pos logit contributions
 mun+0.003
oured+0.003
 pleasure+0.003
 gul+0.003
 cu+0.003
Neg logit contributions
baby-0.003
 monster-0.003
 ambulance-0.003
 dragons-0.003
 robots-0.003
Token:
Token ID25
Feature activation+6.17e-01

Pos logit contributions
 mun+0.022
oured+0.022
 gul+0.021
int+0.021
 cu+0.021
Neg logit contributions
baby-0.008
 ambulance-0.006
 lawyers-0.006
 robots-0.006
 alone-0.006
Token a
Token ID257
Feature activation-1.56e-01

Pos logit contributions
 mun+0.023
oured+0.022
int+0.022
 gul+0.021
 cu+0.021
Neg logit contributions
baby-0.020
 robots-0.019
 ambulance-0.019
 lawyers-0.019
 dragons-0.018
Token modern
Token ID3660
Feature activation-1.62e-01

Pos logit contributions
baby+0.004
 ambulance+0.004
 monster+0.004
 robots+0.004
 lawyers+0.004
Neg logit contributions
 mun-0.007
oured-0.006
 gul-0.006
 cu-0.006
int-0.006
Token l
Token ID300
Feature activation-1.23e+00

Pos logit contributions
baby+0.004
 ambulance+0.004
 monster+0.004
 hall+0.004
 robots+0.004
Neg logit contributions
 mun-0.007
oured-0.007
int-0.007
 gul-0.007
 cu-0.007
Tokenoll
Token ID692
Feature activation-2.91e+00

Pos logit contributions
baby+0.040
 ambulance+0.037
 robots+0.037
 lawyers+0.036
 dragons+0.036
Neg logit contributions
 mun-0.045
oured-0.044
 gul-0.042
 cu-0.041
int-0.041
Tokenipop
Token ID42800
Feature activation-3.79e-02

Pos logit contributions
baby+0.114
 ambulance+0.107
 robots+0.105
 lawyers+0.104
 dragons+0.104
Neg logit contributions
 mun-0.086
oured-0.083
 gul-0.079
 cu-0.078
int-0.075
Token machine
Token ID4572
Feature activation+1.43e+00

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 monster+0.001
 dragons+0.001
Neg logit contributions
 mun-0.002
oured-0.002
 gul-0.001
 cu-0.001
int-0.001
Token came
Token ID1625
Feature activation+1.55e-01

Pos logit contributions
 mun+0.062
oured+0.060
int+0.060
 pleasure+0.060
 gul+0.058
Neg logit contributions
baby-0.037
 ambulance-0.035
 monster-0.034
 robots-0.034
 hall-0.034
Token to
Token ID284
Feature activation-5.25e-01

Pos logit contributions
 mun+0.006
oured+0.006
int+0.006
 cu+0.006
 gul+0.006
Neg logit contributions
baby-0.005
 ambulance-0.004
 dragons-0.004
 alone-0.004
 robots-0.004
Token their
Token ID511
Feature activation-4.54e-01

Pos logit contributions
baby+0.013
 ambulance+0.012
 lawyers+0.012
 hall+0.012
 dragons+0.012
Neg logit contributions
oured-0.024
 mun-0.024
int-0.023
 cu-0.022
 gul-0.022
Token
Token ID198
Feature activation+4.30e-01

Pos logit contributions
 mun+0.005
 cu+0.005
oured+0.005
 gul+0.005
 pleasure+0.004
Neg logit contributions
baby-0.006
 ambulance-0.006
 dragons-0.006
 monster-0.005
 robots-0.005
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
 mun+0.013
 cu+0.013
oured+0.013
 gul+0.013
 bait+0.012
Neg logit contributions
baby-0.015
 ambulance-0.015
 robots-0.015
 dragons-0.014
 monster-0.014
TokenMom
Token ID29252
Feature activation+8.15e-01

Pos logit contributions
 cu+0.008
 mun+0.008
 bait+0.008
 placed+0.008
 gul+0.008
Neg logit contributions
 monster-0.008
 ambulance-0.008
 dragons-0.008
baby-0.008
 monsters-0.007
Token heard
Token ID2982
Feature activation+3.63e-01

Pos logit contributions
 mun+0.030
oured+0.030
 gul+0.029
 cu+0.029
int+0.028
Neg logit contributions
baby-0.025
 robots-0.023
 lawyers-0.023
 villains-0.023
 ambulance-0.023
Token the
Token ID262
Feature activation+4.48e-02

Pos logit contributions
 gul+0.013
oured+0.013
 mun+0.013
 dip+0.013
 cu+0.012
Neg logit contributions
baby-0.010
 villains-0.010
 hall-0.009
 maze-0.009
 monster-0.009
Token comm
Token ID725
Feature activation-2.89e+00

Pos logit contributions
 gul+0.002
leaf+0.002
 mun+0.002
oured+0.002
 cu+0.002
Neg logit contributions
baby-0.001
 dragons-0.001
 villains-0.001
yards-0.001
 anymore-0.001
Tokenotion
Token ID9650
Feature activation-2.13e-01

Pos logit contributions
baby+0.070
 robots+0.066
 fort+0.064
 dragons+0.064
 ambulance+0.062
Neg logit contributions
oured-0.124
 mun-0.120
 gul-0.117
 placed-0.115
 cu-0.115
Token and
Token ID290
Feature activation+8.02e-02

Pos logit contributions
baby+0.008
 villains+0.008
 robots+0.008
 lawyers+0.007
 principal+0.007
Neg logit contributions
 gul-0.006
 mun-0.006
 placed-0.006
 dip-0.006
oured-0.006
Token came
Token ID1625
Feature activation+7.16e-01

Pos logit contributions
 gul+0.002
 mun+0.002
 placed+0.002
 cu+0.002
oured+0.002
Neg logit contributions
opl-0.003
 villains-0.003
baby-0.003
 anymore-0.003
 basement-0.003
Token to
Token ID284
Feature activation-3.26e-01

Pos logit contributions
 cu+0.014
oured+0.013
 placed+0.013
 gul+0.013
 sto+0.012
Neg logit contributions
emaker-0.032
iness-0.032
 villains-0.032
baby-0.032
opl-0.032
Token the
Token ID262
Feature activation-3.79e-01

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 dragons+0.010
 lawyers+0.010
Neg logit contributions
 mun-0.012
oured-0.011
 gul-0.011
 cu-0.011
 placed-0.010
Token found
Token ID1043
Feature activation+1.93e-01

Pos logit contributions
 mun+0.001
oured+0.001
 bait+0.000
 gul+0.000
 cu+0.000
Neg logit contributions
baby-0.001
 ambulance-0.000
 robots-0.000
 lawyers-0.000
 hall-0.000
Token that
Token ID326
Feature activation+4.69e-01

Pos logit contributions
 mun+0.008
oured+0.008
int+0.007
 cu+0.007
leaf+0.007
Neg logit contributions
baby-0.006
 dragons-0.005
 monster-0.005
 robots-0.005
 ambulance-0.005
Token it
Token ID340
Feature activation+6.43e-01

Pos logit contributions
 mun+0.017
 gul+0.016
 bait+0.016
oured+0.016
 cu+0.016
Neg logit contributions
baby-0.015
 ambulance-0.014
 robots-0.013
 lawyers-0.013
 hall-0.013
Token wouldn
Token ID3636
Feature activation+4.01e-01

Pos logit contributions
 mun+0.026
oured+0.026
leaf+0.025
 pleasure+0.025
 cu+0.024
Neg logit contributions
baby-0.018
 monster-0.017
 ambulance-0.017
 alone-0.017
 dragons-0.017
Token't
Token ID470
Feature activation-7.11e-01

Pos logit contributions
 mun+0.014
oured+0.014
 gul+0.013
 cu+0.013
int+0.013
Neg logit contributions
baby-0.013
 ambulance-0.012
 robots-0.012
 lawyers-0.012
 dragons-0.012
Token bud
Token ID3600
Feature activation-2.89e+00

Pos logit contributions
baby+0.025
 ambulance+0.023
 robots+0.023
 lawyers+0.022
 dragons+0.022
Neg logit contributions
 mun-0.024
oured-0.024
 gul-0.023
 cu-0.022
int-0.022
Tokenge
Token ID469
Feature activation-1.36e+00

Pos logit contributions
baby+0.090
 ambulance+0.083
 robots+0.082
 lawyers+0.080
 dragons+0.080
Neg logit contributions
 mun-0.110
oured-0.106
 gul-0.103
 cu-0.101
int-0.100
Token.
Token ID13
Feature activation-5.61e-02

Pos logit contributions
baby+0.033
 ambulance+0.032
 lawyers+0.030
 hall+0.030
 principal+0.030
Neg logit contributions
 mun-0.056
oured-0.055
 placed-0.053
 bait-0.053
 gul-0.053
Token He
Token ID679
Feature activation-1.24e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 lawyers+0.002
reating+0.002
 dragons+0.002
Neg logit contributions
 mun-0.002
oured-0.002
 gul-0.002
 cu-0.002
 bait-0.002
Token pushed
Token ID7121
Feature activation-5.13e-01

Pos logit contributions
baby+0.004
 monster+0.004
 dragons+0.003
 ambulance+0.003
 robots+0.003
Neg logit contributions
 pleasure-0.005
oured-0.005
leaf-0.005
 mun-0.005
 dip-0.004
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
baby+0.015
 ambulance+0.014
 robots+0.013
 lawyers+0.013
 dragons+0.013
Neg logit contributions
 mun-0.021
oured-0.020
 gul-0.019
 cu-0.019
int-0.019
Token not
Token ID407
Feature activation-4.82e-01

Pos logit contributions
 mun+0.018
oured+0.018
int+0.018
leaf+0.017
 gul+0.016
Neg logit contributions
 alone-0.012
 dragons-0.011
 monsters-0.010
 robots-0.010
 monster-0.010
Token bigger
Token ID5749
Feature activation+4.84e-02

Pos logit contributions
baby+0.014
 dragons+0.014
 alone+0.014
 robots+0.013
 lawyers+0.013
Neg logit contributions
oured-0.019
 mun-0.019
int-0.018
 gul-0.018
 cu-0.018
Token than
Token ID621
Feature activation+6.80e-01

Pos logit contributions
 mun+0.002
oured+0.002
 cu+0.002
 gul+0.002
 bait+0.002
Neg logit contributions
baby-0.002
 ambulance-0.002
 hall-0.001
 robots-0.001
 lawyers-0.001
Token a
Token ID257
Feature activation-9.84e-02

Pos logit contributions
 mun+0.030
int+0.030
oured+0.030
 cu+0.029
 gul+0.029
Neg logit contributions
 dragons-0.016
 robots-0.016
baby-0.016
 monsters-0.015
 monster-0.015
Token fing
Token ID8038
Feature activation-4.86e-02

Pos logit contributions
baby+0.003
 robots+0.003
 ambulance+0.003
 lawyers+0.002
ulk+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 cu-0.004
 gul-0.004
 dip-0.004
Tokenern
Token ID1142
Feature activation-2.87e+00

Pos logit contributions
baby+0.002
 ambulance+0.002
 lawyers+0.002
 principal+0.002
 dragons+0.002
Neg logit contributions
 mun-0.002
oured-0.001
 gul-0.001
 cu-0.001
 bait-0.001
Tokenail
Token ID603
Feature activation-1.71e+00

Pos logit contributions
baby+0.086
 ambulance+0.080
 robots+0.079
 lawyers+0.077
 dragons+0.077
Neg logit contributions
 mun-0.111
oured-0.108
 gul-0.104
 cu-0.103
int-0.101
Token.
Token ID13
Feature activation-2.58e-01

Pos logit contributions
baby+0.041
 ambulance+0.037
 robots+0.036
 monster+0.035
 lawyers+0.035
Neg logit contributions
 mun-0.077
oured-0.076
 gul-0.073
 cu-0.073
int-0.072
Token They
Token ID1119
Feature activation+1.38e-01

Pos logit contributions
 robots+0.006
 lawyers+0.006
baby+0.006
 dragons+0.006
 hall+0.006
Neg logit contributions
 mun-0.011
oured-0.011
 cu-0.011
int-0.010
 dip-0.010
Token also
Token ID635
Feature activation+1.06e+00

Pos logit contributions
oured+0.005
 mun+0.005
int+0.005
icol+0.005
 pleasure+0.005
Neg logit contributions
 dragons-0.004
baby-0.004
 robots-0.004
 alone-0.004
 monster-0.004
Token found
Token ID1043
Feature activation+2.80e-01

Pos logit contributions
int+0.040
oured+0.040
 mun+0.037
 pleasure+0.037
leaf+0.037
Neg logit contributions
 hall-0.031
 alone-0.031
 dragons-0.031
 robots-0.029
 lawyers-0.029
TokenBe
Token ID3856
Feature activation+1.25e+00

Pos logit contributions
 mun+0.009
oured+0.009
 cu+0.008
 gul+0.008
 bait+0.008
Neg logit contributions
baby-0.007
 lawyers-0.006
 ambulance-0.006
 robots-0.006
 villains-0.006
Token careful
Token ID8161
Feature activation-3.81e-01

Pos logit contributions
oured+0.054
 mun+0.053
 gul+0.048
 dip+0.047
 cu+0.047
Neg logit contributions
baby-0.035
 ambulance-0.034
 lawyers-0.032
 alone-0.032
 twins-0.031
Token,
Token ID11
Feature activation+5.18e-01

Pos logit contributions
baby+0.010
 ambulance+0.009
 robots+0.009
 lawyers+0.009
 dragons+0.008
Neg logit contributions
 mun-0.016
oured-0.016
 gul-0.015
 cu-0.015
int-0.015
Token these
Token ID777
Feature activation-4.85e-01

Pos logit contributions
 mun+0.023
oured+0.023
int+0.022
 gul+0.022
 cu+0.022
Neg logit contributions
baby-0.013
 ambulance-0.012
 robots-0.011
 lawyers-0.011
 dragons-0.011
Token or
Token ID393
Feature activation+1.18e+00

Pos logit contributions
baby+0.012
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.022
oured-0.021
 gul-0.021
 cu-0.020
int-0.020
Tokennam
Token ID7402
Feature activation-2.87e+00

Pos logit contributions
 mun+0.048
oured+0.047
int+0.046
 gul+0.044
 cu+0.044
Neg logit contributions
baby-0.033
 robots-0.032
 lawyers-0.032
 ambulance-0.031
 dragons-0.031
Tokenents
Token ID658
Feature activation-1.93e+00

Pos logit contributions
baby+0.108
 ambulance+0.104
 monster+0.100
 robots+0.100
 lawyers+0.098
Neg logit contributions
 mun-0.088
oured-0.086
 gul-0.081
 cu-0.080
int-0.079
Token are
Token ID389
Feature activation-1.79e-01

Pos logit contributions
baby+0.053
 ambulance+0.048
 robots+0.047
 lawyers+0.046
 dragons+0.046
Neg logit contributions
 mun-0.080
oured-0.078
 gul-0.075
 cu-0.074
 bait-0.073
Token very
Token ID845
Feature activation-1.58e-02

Pos logit contributions
 dragons+0.005
baby+0.005
 lawyers+0.005
 robots+0.005
 ambulance+0.005
Neg logit contributions
 mun-0.008
oured-0.008
int-0.007
 gul-0.007
 cu-0.007
Token special
Token ID2041
Feature activation-1.57e-01

Pos logit contributions
baby+0.000
 alone+0.000
 ambulance+0.000
 lawyers+0.000
 robots+0.000
Neg logit contributions
oured-0.001
 mun-0.001
 gul-0.001
 cu-0.001
int-0.001
Token.
Token ID13
Feature activation-3.23e-01

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.007
oured-0.007
 gul-0.007
 cu-0.007
 bait-0.007
Token but
Token ID475
Feature activation+9.36e-04

Pos logit contributions
 robots+0.007
baby+0.006
 alone+0.006
 ambulance+0.006
 monster+0.006
Neg logit contributions
oured-0.010
int-0.010
 mun-0.009
leaf-0.009
mit-0.009
Token the
Token ID262
Feature activation-7.25e-01

Pos logit contributions
 mun+0.000
oured+0.000
 gul+0.000
 cu+0.000
 bait+0.000
Neg logit contributions
baby-0.000
 ambulance-0.000
 robots-0.000
 lawyers-0.000
 dragons-0.000
Token screw
Token ID9580
Feature activation-1.04e-01

Pos logit contributions
baby+0.024
 ambulance+0.022
 lawyers+0.021
 robots+0.021
 dragons+0.021
Neg logit contributions
 mun-0.026
oured-0.025
 gul-0.024
 cu-0.024
 bait-0.024
Token wouldn
Token ID3636
Feature activation+3.51e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 monster+0.003
 dragons+0.003
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
leaf-0.004
Token't
Token ID470
Feature activation-5.07e-01

Pos logit contributions
 mun+0.013
oured+0.012
 gul+0.012
 cu+0.012
int+0.011
Neg logit contributions
baby-0.011
 ambulance-0.011
 robots-0.011
 lawyers-0.010
 dragons-0.010
Token bud
Token ID3600
Feature activation-2.86e+00

Pos logit contributions
baby+0.018
 ambulance+0.017
 robots+0.016
 lawyers+0.016
 dragons+0.016
Neg logit contributions
 mun-0.017
oured-0.017
 gul-0.016
 cu-0.016
int-0.015
Tokenge
Token ID469
Feature activation-1.27e+00

Pos logit contributions
baby+0.093
 ambulance+0.086
 robots+0.085
 lawyers+0.084
 dragons+0.083
Neg logit contributions
 mun-0.104
oured-0.101
 gul-0.097
 cu-0.096
int-0.094
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
baby+0.031
 ambulance+0.031
 hall+0.029
 lawyers+0.029
 principal+0.028
Neg logit contributions
 mun-0.051
oured-0.051
 placed-0.049
 bait-0.049
 gul-0.048
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
 dragons+0.001
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
int-0.001
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
 mun+0.015
oured+0.014
 gul+0.014
 cu+0.014
int+0.013
Neg logit contributions
baby-0.014
 ambulance-0.013
 robots-0.013
 lawyers-0.013
 dragons-0.012
TokenSuddenly
Token ID38582
Feature activation+6.88e-01

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.011
 cu+0.011
int+0.011
Neg logit contributions
baby-0.010
 ambulance-0.009
 robots-0.009
 lawyers-0.009
 dragons-0.009
Token but
Token ID475
Feature activation+5.24e-02

Pos logit contributions
baby+0.010
 ambulance+0.010
 robots+0.009
 lawyers+0.009
 alone+0.009
Neg logit contributions
 mun-0.012
oured-0.012
int-0.011
leaf-0.011
 gul-0.011
Token the
Token ID262
Feature activation-6.69e-01

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
 cu+0.002
int+0.002
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token lid
Token ID19789
Feature activation+3.93e-01

Pos logit contributions
baby+0.021
 ambulance+0.019
 robots+0.019
 lawyers+0.019
 dragons+0.019
Neg logit contributions
 mun-0.025
oured-0.025
 gul-0.024
 cu-0.024
 bait-0.023
Token wouldn
Token ID3636
Feature activation+3.56e-01

Pos logit contributions
 mun+0.015
oured+0.015
leaf+0.014
 cu+0.014
 pleasure+0.014
Neg logit contributions
baby-0.012
 ambulance-0.011
 alone-0.011
 monster-0.011
 dragons-0.010
Token't
Token ID470
Feature activation-4.86e-01

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.012
 cu+0.011
 bait+0.011
Neg logit contributions
baby-0.012
 robots-0.011
 ambulance-0.011
 lawyers-0.011
 dragons-0.011
Token bud
Token ID3600
Feature activation-2.86e+00

Pos logit contributions
baby+0.017
 ambulance+0.016
 robots+0.016
 lawyers+0.016
 dragons+0.016
Neg logit contributions
 mun-0.016
oured-0.016
 gul-0.015
 cu-0.015
int-0.014
Tokenge
Token ID469
Feature activation-1.26e+00

Pos logit contributions
baby+0.081
 ambulance+0.073
 robots+0.072
 lawyers+0.070
 dragons+0.070
Neg logit contributions
 mun-0.117
oured-0.113
 gul-0.110
 cu-0.109
int-0.107
Token!
Token ID0
Feature activation-1.81e-01

Pos logit contributions
 ambulance+0.031
baby+0.031
 lawyers+0.030
 hall+0.030
 principal+0.029
Neg logit contributions
 mun-0.051
oured-0.050
 placed-0.048
 bait-0.048
 gul-0.048
Token The
Token ID383
Feature activation-1.15e-01

Pos logit contributions
 robots+0.004
baby+0.004
 lawyers+0.004
 ambulance+0.004
 dragons+0.004
Neg logit contributions
 mun-0.008
oured-0.008
leaf-0.008
mit-0.008
 cu-0.007
Token mouse
Token ID10211
Feature activation+3.11e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
 bait-0.005
Token began
Token ID2540
Feature activation+3.88e-01

Pos logit contributions
oured+0.012
 mun+0.011
leaf+0.011
 pleasure+0.011
 bait+0.011
Neg logit contributions
baby-0.010
 monster-0.009
 ambulance-0.009
 dragons-0.009
 alone-0.009
Token but
Token ID475
Feature activation-2.18e-01

Pos logit contributions
 alone+0.005
 ambulance+0.005
 lawyers+0.004
 cats+0.004
 crying+0.004
Neg logit contributions
engu-0.012
int-0.012
mit-0.012
oured-0.012
 mun-0.012
Token it
Token ID340
Feature activation+2.16e-01

Pos logit contributions
baby+0.007
 ambulance+0.006
 robots+0.006
 villains+0.006
 dragons+0.006
Neg logit contributions
 mun-0.008
oured-0.008
 gul-0.008
 cu-0.008
 placed-0.008
Token still
Token ID991
Feature activation+1.37e+00

Pos logit contributions
oured+0.008
leaf+0.008
 pleasure+0.008
 mun+0.008
 bait+0.008
Neg logit contributions
baby-0.006
 monster-0.006
 ambulance-0.006
 dragons-0.006
 monsters-0.006
Token wouldn
Token ID3636
Feature activation+3.12e-01

Pos logit contributions
oured+0.043
 mun+0.042
leaf+0.040
 cu+0.039
 bait+0.039
Neg logit contributions
baby-0.051
 monster-0.048
 ambulance-0.047
 dragons-0.047
 robots-0.047
Token't
Token ID470
Feature activation-4.66e-01

Pos logit contributions
 mun+0.009
oured+0.009
 gul+0.009
 dip+0.008
 cu+0.008
Neg logit contributions
baby-0.011
 robots-0.011
 lawyers-0.011
 villains-0.011
 ambulance-0.011
Token bud
Token ID3600
Feature activation-2.86e+00

Pos logit contributions
baby+0.018
 robots+0.017
 lawyers+0.017
 hall+0.017
 ambulance+0.017
Neg logit contributions
 mun-0.014
 dip-0.013
 gul-0.013
oured-0.013
 placed-0.012
Tokenge
Token ID469
Feature activation-1.27e+00

Pos logit contributions
baby+0.101
 ambulance+0.096
 robots+0.095
 lawyers+0.094
 dragons+0.093
Neg logit contributions
 mun-0.094
oured-0.093
 gul-0.086
 cu-0.086
 placed-0.084
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 ambulance+0.035
baby+0.034
 principal+0.034
 hall+0.034
ulk+0.033
Neg logit contributions
oured-0.046
 mun-0.046
 placed-0.045
 gul-0.043
 cu-0.043
Token
Token ID220
Feature activation-1.25e+00

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.005
 dragons+0.005
 lawyers+0.005
Neg logit contributions
 mun-0.008
oured-0.007
 gul-0.007
 cu-0.007
int-0.007
Token
Token ID198
Feature activation+5.17e-01

Pos logit contributions
baby+0.042
 ambulance+0.041
 monster+0.040
 robots+0.040
 dragons+0.040
Neg logit contributions
oured-0.041
 mun-0.041
 cu-0.038
int-0.038
 gul-0.038
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
 mun+0.018
oured+0.017
 cu+0.017
 gul+0.017
int+0.016
Neg logit contributions
baby-0.017
 ambulance-0.016
 robots-0.016
 monster-0.016
 dragons-0.016
Token
Token ID198
Feature activation+1.13e-01

Pos logit contributions
baby+0.042
 ambulance+0.040
 robots+0.039
 monster+0.039
 lawyers+0.039
Neg logit contributions
 mun-0.038
oured-0.037
 cu-0.035
int-0.035
 gul-0.035
Token
Token ID198
Feature activation+7.62e-02

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
TokenSuddenly
Token ID38582
Feature activation+7.99e-01

Pos logit contributions
 mun+0.003
oured+0.003
 gul+0.003
 cu+0.003
int+0.003
Neg logit contributions
baby-0.002
 ambulance-0.002
 robots-0.002
 lawyers-0.002
 dragons-0.002
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
leaf+0.042
 mun+0.042
oured+0.042
 gul+0.040
rig+0.040
Neg logit contributions
baby-0.012
 monster-0.011
 robots-0.011
 dragons-0.010
 ambulance-0.010
Token P
Token ID350
Feature activation+2.96e-01

Pos logit contributions
baby+0.024
 monster+0.024
 robots+0.023
 alone+0.023
 kids+0.023
Neg logit contributions
oured-0.047
int-0.046
leaf-0.046
 mun-0.046
rig-0.044
Tokenuss
Token ID1046
Feature activation-2.85e+00

Pos logit contributions
 mun+0.011
oured+0.011
 gul+0.010
 cu+0.010
int+0.010
Neg logit contributions
baby-0.009
 ambulance-0.009
 robots-0.008
 lawyers-0.008
 dragons-0.008
Token noticed
Token ID6810
Feature activation+7.92e-01

Pos logit contributions
baby+0.104
 ambulance+0.098
 dragons+0.096
 robots+0.096
 monster+0.095
Neg logit contributions
 mun-0.090
oured-0.089
 gul-0.084
 cu-0.083
 pleasure-0.082
Token a
Token ID257
Feature activation+1.34e-01

Pos logit contributions
 mun+0.027
oured+0.027
int+0.026
 pleasure+0.025
leaf+0.025
Neg logit contributions
baby-0.027
 monster-0.026
 ambulance-0.026
 robots-0.026
 dragons-0.025
Token bigger
Token ID5749
Feature activation-9.30e-02

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.005
 cu+0.005
 bait+0.005
Neg logit contributions
baby-0.004
 robots-0.004
 ambulance-0.004
 lawyers-0.004
 dragons-0.004
Token bug
Token ID5434
Feature activation+3.14e-01

Pos logit contributions
baby+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
 ambulance+0.002
Neg logit contributions
 mun-0.004
 cu-0.004
 gul-0.004
oured-0.004
 bait-0.004
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
 mun+0.012
 gul+0.012
oured+0.011
 cu+0.011
 placed+0.011
Neg logit contributions
baby-0.010
 lawyers-0.009
 robots-0.009
 ambulance-0.009
 villains-0.008
Token reminded
Token ID14516
Feature activation+2.90e-01

Pos logit contributions
 bait+0.010
 mun+0.010
 pleasure+0.010
leaf+0.010
oured+0.010
Neg logit contributions
baby-0.010
 ambulance-0.009
 robots-0.009
 monster-0.009
 dragons-0.009
Token her
Token ID607
Feature activation-8.40e-01

Pos logit contributions
 mun+0.013
oured+0.012
 gul+0.012
 cu+0.012
 placed+0.012
Neg logit contributions
baby-0.006
 villains-0.006
 ambulance-0.006
 lawyers-0.006
 hall-0.006
Token that
Token ID326
Feature activation+3.49e-01

Pos logit contributions
baby+0.028
 ambulance+0.026
 robots+0.026
 lawyers+0.025
 dragons+0.025
Neg logit contributions
 mun-0.030
oured-0.030
 gul-0.028
 cu-0.028
int-0.027
Token she
Token ID673
Feature activation+3.48e-01

Pos logit contributions
 mun+0.014
oured+0.014
 gul+0.014
 cu+0.013
 bait+0.013
Neg logit contributions
baby-0.010
 ambulance-0.009
 robots-0.008
 lawyers-0.008
 hall-0.008
Token should
Token ID815
Feature activation+7.21e-01

Pos logit contributions
 mun+0.013
oured+0.013
 cu+0.013
 gul+0.013
 bait+0.013
Neg logit contributions
baby-0.010
 robots-0.009
 monster-0.009
 lawyers-0.009
 dragons-0.009
Token always
Token ID1464
Feature activation+2.59e+00

Pos logit contributions
oured+0.025
 mun+0.025
 gul+0.023
int+0.023
leaf+0.022
Neg logit contributions
baby-0.025
 ambulance-0.024
 lawyers-0.023
 monster-0.023
 robots-0.023
Token prevent
Token ID2948
Feature activation-1.32e+00

Pos logit contributions
oured+0.108
 mun+0.106
 gul+0.096
leaf+0.095
int+0.094
Neg logit contributions
 ambulance-0.069
baby-0.069
 alone-0.067
 lawyers-0.066
 sister-0.065
Token others
Token ID1854
Feature activation-9.06e-01

Pos logit contributions
baby+0.033
 ambulance+0.030
 robots+0.029
 lawyers+0.029
 dragons+0.029
Neg logit contributions
 mun-0.058
oured-0.056
 gul-0.055
 cu-0.054
 bait-0.053
Token from
Token ID422
Feature activation-1.47e+00

Pos logit contributions
baby+0.030
 ambulance+0.028
 robots+0.027
 dragons+0.027
 lawyers+0.027
Neg logit contributions
 mun-0.032
oured-0.031
 gul-0.030
 cu-0.029
 bait-0.029
Token feeling
Token ID4203
Feature activation-4.14e-01

Pos logit contributions
baby+0.044
 ambulance+0.039
 hall+0.038
 robots+0.038
 villains+0.038
Neg logit contributions
 mun-0.055
 gul-0.054
oured-0.052
 cu-0.052
 bait-0.051
Token sad
Token ID6507
Feature activation+6.25e-02

Pos logit contributions
baby+0.009
 ambulance+0.008
 robots+0.008
 lawyers+0.008
 dragons+0.007
Neg logit contributions
 mun-0.020
oured-0.019
 gul-0.019
 cu-0.019
int-0.018
Token there
Token ID612
Feature activation-2.87e-01

Pos logit contributions
 mun+0.000
oured+0.000
 gul+0.000
 cu+0.000
 bait+0.000
Neg logit contributions
baby-0.000
 ambulance-0.000
 robots-0.000
 lawyers-0.000
 dragons-0.000
Token was
Token ID373
Feature activation+2.24e-01

Pos logit contributions
baby+0.010
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 villains+0.010
Neg logit contributions
 mun-0.009
oured-0.009
 gul-0.009
 cu-0.008
 placed-0.008
Token a
Token ID257
Feature activation-5.12e-01

Pos logit contributions
oured+0.008
 mun+0.008
 cu+0.008
int+0.008
mit+0.008
Neg logit contributions
baby-0.007
 twins-0.007
 dragons-0.007
 lawyers-0.007
 kids-0.007
Token little
Token ID1310
Feature activation-1.58e-03

Pos logit contributions
baby+0.019
 robots+0.018
ulk+0.018
 ambulance+0.018
 lawyers+0.018
Neg logit contributions
 mun-0.016
 gul-0.014
 cu-0.014
oured-0.014
 dip-0.014
Token girl
Token ID2576
Feature activation-5.95e-01

Pos logit contributions
ulk+0.000
baby+0.000
 villains+0.000
 dragons+0.000
 lawyers+0.000
Neg logit contributions
 mun-0.000
 gul-0.000
oured-0.000
 cu-0.000
 dip-0.000
Token named
Token ID3706
Feature activation+2.53e+00

Pos logit contributions
baby+0.023
 ambulance+0.022
 robots+0.022
 lawyers+0.022
 dragons+0.022
Neg logit contributions
 mun-0.017
oured-0.016
 gul-0.016
 cu-0.016
int-0.015
Token Lily
Token ID20037
Feature activation-3.83e-01

Pos logit contributions
 mun+0.121
oured+0.118
 gul+0.113
 cu+0.113
 dip+0.112
Neg logit contributions
baby-0.050
 ambulance-0.049
 robots-0.048
 villains-0.046
 monster-0.045
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 ambulance+0.009
baby+0.009
 lawyers+0.009
 villains+0.009
 robots+0.009
Neg logit contributions
 mun-0.016
 cu-0.015
 gul-0.015
 planted-0.015
 placed-0.015
Token She
Token ID1375
Feature activation-5.75e-01

Pos logit contributions
 ambulance+0.004
eds+0.003
 villains+0.003
 lawyers+0.003
 monster+0.003
Neg logit contributions
 mun-0.003
 dip-0.003
 favourite-0.003
oured-0.003
 Gret-0.003
Token loved
Token ID6151
Feature activation-1.75e-02

Pos logit contributions
baby+0.021
 robots+0.019
 ambulance+0.019
 monster+0.019
 dragons+0.019
Neg logit contributions
oured-0.019
 mun-0.019
 gul-0.018
 cu-0.018
 bait-0.017
Token to
Token ID284
Feature activation-3.68e-02

Pos logit contributions
baby+0.001
 ambulance+0.001
 villains+0.001
 robots+0.001
 lawyers+0.001
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.000
leaf-0.000
Token day
Token ID1110
Feature activation+1.44e-01

Pos logit contributions
 mun+0.001
oured+0.001
 gul+0.001
 cu+0.001
int+0.001
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token,
Token ID11
Feature activation+8.42e-02

Pos logit contributions
 mun+0.008
oured+0.008
leaf+0.008
 gul+0.008
 cu+0.007
Neg logit contributions
baby-0.002
 alone-0.002
 kids-0.002
 monster-0.002
 dragons-0.002
Token a
Token ID257
Feature activation-2.86e-01

Pos logit contributions
oured+0.004
 mun+0.004
int+0.003
leaf+0.003
 cu+0.003
Neg logit contributions
 alone-0.002
baby-0.002
 monster-0.002
 anymore-0.002
 kids-0.002
Token little
Token ID1310
Feature activation-8.88e-02

Pos logit contributions
baby+0.009
 ambulance+0.008
 robots+0.008
 lawyers+0.008
 monster+0.008
Neg logit contributions
 mun-0.011
oured-0.011
 gul-0.010
 cu-0.010
int-0.010
Token boy
Token ID2933
Feature activation-6.90e-01

Pos logit contributions
baby+0.002
 dragons+0.002
 robots+0.002
 lawyers+0.002
 villains+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
 bait-0.004
Token named
Token ID3706
Feature activation+2.46e+00

Pos logit contributions
baby+0.025
 ambulance+0.023
 robots+0.023
 lawyers+0.022
 monster+0.022
Neg logit contributions
 mun-0.023
oured-0.023
 gul-0.021
 cu-0.021
int-0.021
Token Tim
Token ID5045
Feature activation+1.01e-01

Pos logit contributions
 mun+0.097
oured+0.094
 gul+0.091
 cu+0.089
 bait+0.089
Neg logit contributions
baby-0.072
 ambulance-0.068
 lawyers-0.066
 robots-0.065
 dragons-0.064
Token and
Token ID290
Feature activation-6.66e-01

Pos logit contributions
 mun+0.003
oured+0.003
 gul+0.003
 cu+0.003
 placed+0.003
Neg logit contributions
baby-0.004
 ambulance-0.004
 robots-0.004
 lawyers-0.004
 villains-0.003
Token his
Token ID465
Feature activation-6.19e-01

Pos logit contributions
baby+0.016
 ambulance+0.014
 anymore+0.014
 dragons+0.013
 robots+0.013
Neg logit contributions
 mun-0.030
 bait-0.028
 gul-0.028
 cu-0.027
 planted-0.027
Token dog
Token ID3290
Feature activation+5.63e-02

Pos logit contributions
baby+0.012
 ambulance+0.011
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.030
oured-0.030
 gul-0.029
 cu-0.028
 bait-0.028
Token,
Token ID11
Feature activation+9.07e-01

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
 planted+0.002
 cu+0.002
Neg logit contributions
baby-0.002
 ambulance-0.002
 robots-0.002
 lawyers-0.002
reating-0.002
Token there
Token ID612
Feature activation-1.69e-01

Pos logit contributions
baby+0.001
 alone+0.001
 monster+0.001
 ambulance+0.001
 robots+0.001
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
leaf-0.001
Token was
Token ID373
Feature activation+3.31e-01

Pos logit contributions
baby+0.006
 ambulance+0.006
 robots+0.006
 lawyers+0.006
 villains+0.006
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
 placed-0.005
Token a
Token ID257
Feature activation-5.00e-01

Pos logit contributions
oured+0.013
 mun+0.013
int+0.012
 cu+0.012
mit+0.012
Neg logit contributions
baby-0.010
 twins-0.010
 dragons-0.010
 robots-0.010
 cats-0.010
Token little
Token ID1310
Feature activation+5.09e-02

Pos logit contributions
baby+0.018
 robots+0.017
 ambulance+0.017
 lawyers+0.017
 dragons+0.017
Neg logit contributions
 mun-0.016
oured-0.015
 gul-0.015
 cu-0.015
 dip-0.014
Token girl
Token ID2576
Feature activation-5.03e-01

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
 cu+0.002
 dip+0.002
Neg logit contributions
baby-0.001
ulk-0.001
 dragons-0.001
 lawyers-0.001
 robots-0.001
Token named
Token ID3706
Feature activation+2.48e+00

Pos logit contributions
baby+0.019
 ambulance+0.018
 robots+0.018
 lawyers+0.018
 dragons+0.018
Neg logit contributions
 mun-0.015
oured-0.014
 gul-0.014
 cu-0.014
int-0.013
Token Sally
Token ID25737
Feature activation+1.46e-01

Pos logit contributions
 mun+0.121
oured+0.119
 cu+0.115
 gul+0.114
 dip+0.112
Neg logit contributions
baby-0.048
 ambulance-0.046
 robots-0.045
 lawyers-0.043
 villains-0.042
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
 mun+0.006
 cu+0.006
 planted+0.006
 gul+0.006
 placed+0.006
Neg logit contributions
 ambulance-0.004
baby-0.004
 villains-0.003
 lawyers-0.003
 robots-0.003
Token She
Token ID1375
Feature activation-5.18e-01

Pos logit contributions
baby+0.009
 villains+0.009
 robots+0.009
 ambulance+0.008
 lawyers+0.008
Neg logit contributions
 mun-0.010
oured-0.009
 cu-0.009
 gul-0.009
 planted-0.009
Token was
Token ID373
Feature activation-4.53e-01

Pos logit contributions
baby+0.019
 ambulance+0.017
 robots+0.017
 monster+0.017
 hall+0.017
Neg logit contributions
oured-0.018
 mun-0.017
 gul-0.016
 cu-0.016
 bait-0.016
Token only
Token ID691
Feature activation+5.43e-01

Pos logit contributions
 twins+0.012
baby+0.011
 alone+0.011
 lawyers+0.011
 ambulance+0.011
Neg logit contributions
oured-0.021
 mun-0.021
int-0.020
 cu-0.020
OW-0.019
Token,
Token ID11
Feature activation+3.74e-01

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.007
oured-0.006
 gul-0.006
 cu-0.006
int-0.006
Token there
Token ID612
Feature activation-2.87e-01

Pos logit contributions
oured+0.018
 mun+0.017
 gul+0.016
 cu+0.016
int+0.016
Neg logit contributions
baby-0.009
 robots-0.008
 alone-0.008
 hall-0.008
 monster-0.008
Token lived
Token ID5615
Feature activation-6.19e-01

Pos logit contributions
baby+0.010
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 villains+0.009
Neg logit contributions
 mun-0.009
oured-0.009
 gul-0.009
 cu-0.008
 placed-0.008
Token a
Token ID257
Feature activation-4.43e-01

Pos logit contributions
baby+0.026
 ambulance+0.025
 lawyers+0.024
 robots+0.024
 dragons+0.024
Neg logit contributions
 mun-0.017
oured-0.016
 gul-0.015
 cu-0.015
int-0.015
Token boy
Token ID2933
Feature activation-5.53e-01

Pos logit contributions
baby+0.015
 ambulance+0.014
 robots+0.014
ulk+0.014
 dragons+0.014
Neg logit contributions
 mun-0.015
oured-0.014
 gul-0.014
 cu-0.014
 bait-0.013
Token named
Token ID3706
Feature activation+2.42e+00

Pos logit contributions
baby+0.021
 ambulance+0.019
 robots+0.019
 lawyers+0.019
 dragons+0.019
Neg logit contributions
 mun-0.018
oured-0.017
 gul-0.016
 cu-0.016
int-0.016
Token Tim
Token ID5045
Feature activation-2.97e-02

Pos logit contributions
 mun+0.106
oured+0.104
 gul+0.100
 cu+0.100
int+0.098
Neg logit contributions
baby-0.063
 robots-0.056
 ambulance-0.056
 dragons-0.055
 lawyers-0.054
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
 villains+0.001
 lawyers+0.001
 ambulance+0.001
baby+0.001
ulk+0.001
Neg logit contributions
 mun-0.001
 planted-0.001
 placed-0.001
 cu-0.001
 gul-0.001
Token He
Token ID679
Feature activation-3.22e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 lawyers+0.005
 dragons+0.005
 monster+0.005
Neg logit contributions
 mun-0.006
oured-0.006
 gul-0.006
 bait-0.006
 cu-0.005
Token had
Token ID550
Feature activation-1.85e-01

Pos logit contributions
baby+0.011
 robots+0.010
 ambulance+0.010
 dragons+0.010
 monster+0.010
Neg logit contributions
oured-0.011
 mun-0.011
 gul-0.010
 cu-0.010
int-0.010
Token a
Token ID257
Feature activation-2.69e-01

Pos logit contributions
 robots+0.006
 twins+0.006
 lawyers+0.006
 dragons+0.006
baby+0.006
Neg logit contributions
oured-0.007
int-0.007
 mun-0.007
 cu-0.006
rig-0.006
Token hand
Token ID1021
Feature activation-2.18e-01

Pos logit contributions
baby+0.012
 ambulance+0.011
 robots+0.011
 lawyers+0.011
 dragons+0.011
Neg logit contributions
 mun-0.013
oured-0.013
 gul-0.012
 cu-0.012
int-0.012
Token.
Token ID13
Feature activation-9.02e-02

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.011
oured-0.011
 gul-0.010
 cu-0.010
int-0.010
Token It
Token ID632
Feature activation+6.77e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.004
oured-0.004
 gul-0.004
 cu-0.004
int-0.004
Token went
Token ID1816
Feature activation-3.26e-01

Pos logit contributions
oured+0.024
 pleasure+0.023
leaf+0.023
int+0.022
 mun+0.022
Neg logit contributions
 dragons-0.022
 monster-0.022
 alone-0.022
 monsters-0.021
 hall-0.021
Token up
Token ID510
Feature activation+1.09e+00

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.012
oured-0.011
 gul-0.011
 cu-0.011
 bait-0.010
Token in
Token ID287
Feature activation+8.21e-01

Pos logit contributions
 mun+0.045
oured+0.044
int+0.043
 gul+0.042
 cu+0.041
Neg logit contributions
baby-0.030
 dragons-0.028
 robots-0.028
 alone-0.028
 lawyers-0.028
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
 mun+0.030
 gul+0.029
oured+0.029
 cu+0.029
 bait+0.028
Neg logit contributions
baby-0.026
 ambulance-0.025
 robots-0.023
 lawyers-0.023
 villains-0.023
Token air
Token ID1633
Feature activation-7.08e-01

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.011
 cu+0.011
int+0.011
Neg logit contributions
baby-0.008
 ambulance-0.007
 robots-0.007
 hall-0.007
 dragons-0.007
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
baby+0.015
 ambulance+0.014
 dragons+0.014
 robots+0.013
 alone+0.013
Neg logit contributions
 mun-0.033
oured-0.032
 gul-0.031
int-0.031
 cu-0.031
Token joined
Token ID5399
Feature activation+6.56e-01

Pos logit contributions
 alone+0.003
 dragons+0.003
 hall+0.003
 monster+0.003
baby+0.003
Neg logit contributions
int-0.003
oured-0.003
 pleasure-0.003
 mun-0.003
leaf-0.003
Token other
Token ID584
Feature activation-5.53e-01

Pos logit contributions
 mun+0.030
oured+0.030
 gul+0.028
 cu+0.028
int+0.028
Neg logit contributions
baby-0.015
 robots-0.014
 ambulance-0.014
 dragons-0.013
 lawyers-0.013
Token Can
Token ID1680
Feature activation-3.54e-01

Pos logit contributions
 mun+0.000
oured+0.000
int+0.000
 cu+0.000
 gul+0.000
Neg logit contributions
baby-0.000
 robots-0.000
 lawyers-0.000
 ambulance-0.000
 villains-0.000
Token we
Token ID356
Feature activation-9.17e-01

Pos logit contributions
baby+0.010
 ambulance+0.010
 robots+0.009
 lawyers+0.009
 dragons+0.009
Neg logit contributions
 mun-0.014
oured-0.014
 cu-0.013
 gul-0.013
int-0.013
Token help
Token ID1037
Feature activation-1.97e+00

Pos logit contributions
 ambulance+0.025
baby+0.025
 house+0.023
 hall+0.023
 monster+0.022
Neg logit contributions
oured-0.038
 mun-0.038
 gul-0.036
int-0.035
leaf-0.035
Token you
Token ID345
Feature activation+3.38e-01

Pos logit contributions
baby+0.057
 ambulance+0.053
 robots+0.052
 lawyers+0.051
 dragons+0.051
Neg logit contributions
 mun-0.079
oured-0.077
 gul-0.074
 cu-0.073
int-0.072
Token with
Token ID351
Feature activation-7.60e-01

Pos logit contributions
 mun+0.015
oured+0.015
 gul+0.014
leaf+0.014
 cu+0.014
Neg logit contributions
baby-0.008
 ambulance-0.008
 hall-0.008
 lawyers-0.008
 alone-0.008
Token the
Token ID262
Feature activation+3.20e-01

Pos logit contributions
baby+0.020
 ambulance+0.019
 robots+0.018
 lawyers+0.018
 dragons+0.018
Neg logit contributions
 mun-0.033
oured-0.032
 gul-0.030
 cu-0.030
int-0.030
Token bags
Token ID11668
Feature activation-4.12e-01

Pos logit contributions
 mun+0.015
oured+0.015
int+0.014
 gul+0.014
 cu+0.014
Neg logit contributions
baby-0.007
 ambulance-0.007
 monster-0.007
 robots-0.007
 hall-0.007
Token?"
Token ID1701
Feature activation-2.12e-01

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.017
oured-0.017
 gul-0.016
 cu-0.016
int-0.016
Token
Token ID198
Feature activation-7.64e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.009
oured-0.009
 gul-0.008
int-0.008
 cu-0.008
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
baby+0.024
 ambulance+0.023
 robots+0.022
 lawyers+0.022
 dragons+0.022
Neg logit contributions
 mun-0.028
oured-0.027
 gul-0.027
 cu-0.027
 bait-0.026
TokenThe
Token ID464
Feature activation-4.24e-01

Pos logit contributions
baby+0.021
 ambulance+0.019
 robots+0.019
 lawyers+0.019
 dragons+0.019
Neg logit contributions
 mun-0.027
oured-0.026
 gul-0.025
 cu-0.025
int-0.024
Token the
Token ID262
Feature activation-2.45e-03

Pos logit contributions
 mun+0.008
oured+0.008
int+0.007
 gul+0.007
 cu+0.007
Neg logit contributions
baby-0.004
 robots-0.003
 ambulance-0.003
 dragons-0.003
 lawyers-0.003
Token jungle
Token ID20712
Feature activation+4.50e-01

Pos logit contributions
baby+0.000
 ambulance+0.000
 robots+0.000
 monster+0.000
 dragons+0.000
Neg logit contributions
 mun-0.000
oured-0.000
 gul-0.000
 cu-0.000
int-0.000
Token for
Token ID329
Feature activation-5.32e-01

Pos logit contributions
 mun+0.019
oured+0.018
 gul+0.017
 cu+0.017
int+0.017
Neg logit contributions
baby-0.012
 ambulance-0.011
 robots-0.011
 lawyers-0.011
 dragons-0.011
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
baby+0.014
 ambulance+0.013
 robots+0.012
 lawyers+0.012
 hall+0.012
Neg logit contributions
 mun-0.022
 gul-0.021
 cu-0.021
oured-0.021
 placed-0.021
Token wonderful
Token ID7932
Feature activation+7.80e-01

Pos logit contributions
oured+0.011
int+0.010
 mun+0.010
 cu+0.009
 gul+0.009
Neg logit contributions
 monster-0.006
 ambulance-0.006
 dragons-0.005
 kids-0.005
 fort-0.005
Token night
Token ID1755
Feature activation+3.15e-01

Pos logit contributions
oured+0.039
 mun+0.035
leaf+0.034
int+0.034
 cu+0.034
Neg logit contributions
 monster-0.018
 ambulance-0.017
 dragons-0.017
 fort-0.017
 hall-0.017
Token he
Token ID339
Feature activation+1.80e-01

Pos logit contributions
 mun+0.014
oured+0.014
 gul+0.013
 cu+0.013
int+0.013
Neg logit contributions
baby-0.008
 ambulance-0.007
 robots-0.007
 dragons-0.007
 monster-0.007
Token had
Token ID550
Feature activation+6.40e-01

Pos logit contributions
oured+0.007
 mun+0.006
 cu+0.006
 gul+0.006
 bait+0.006
Neg logit contributions
baby-0.006
 dragons-0.005
 alone-0.005
 monster-0.005
 ambulance-0.005
Token spent
Token ID3377
Feature activation-9.29e-01

Pos logit contributions
oured+0.025
 mun+0.024
 gul+0.023
leaf+0.023
 cu+0.023
Neg logit contributions
baby-0.018
 alone-0.018
 dragons-0.018
 lawyers-0.017
 monster-0.017
Token there
Token ID612
Feature activation+2.60e-01

Pos logit contributions
baby+0.029
 ambulance+0.028
 robots+0.027
 lawyers+0.026
 dragons+0.025
Neg logit contributions
 mun-0.035
oured-0.033
 gul-0.032
 cu-0.032
 bait-0.032
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.011
 cu+0.011
int+0.011
Neg logit contributions
baby-0.006
 ambulance-0.005
 robots-0.005
 dragons-0.005
 lawyers-0.005
Token very
Token ID845
Feature activation+9.68e-02

Pos logit contributions
baby+0.015
 ambulance+0.014
 robots+0.013
 twins+0.013
 alone+0.013
Neg logit contributions
 mun-0.020
oured-0.020
int-0.019
 cu-0.019
leaf-0.019
Token gentle
Token ID10296
Feature activation+1.54e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.003
 ambulance-0.003
 lawyers-0.002
 robots-0.002
 alone-0.002
Token as
Token ID355
Feature activation+1.23e+00

Pos logit contributions
 mun+0.007
oured+0.007
 gul+0.007
 cu+0.007
int+0.006
Neg logit contributions
baby-0.004
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
Token she
Token ID673
Feature activation-3.22e-01

Pos logit contributions
 mun+0.058
oured+0.057
int+0.056
 gul+0.054
 cu+0.054
Neg logit contributions
baby-0.027
 lawyers-0.025
 robots-0.024
 dragons-0.024
 ambulance-0.024
Token strapped
Token ID37625
Feature activation+4.90e-02

Pos logit contributions
baby+0.009
 monster+0.009
 robots+0.009
 kids+0.009
 brothers+0.009
Neg logit contributions
 pleasure-0.011
 Gret-0.011
 Gab-0.011
inated-0.011
leaf-0.010
Token them
Token ID606
Feature activation+1.04e+00

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
 cu+0.002
int+0.002
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token on
Token ID319
Feature activation-2.82e-01

Pos logit contributions
 mun+0.046
oured+0.044
 gul+0.043
 cu+0.042
leaf+0.042
Neg logit contributions
baby-0.026
 robots-0.024
 alone-0.024
 ambulance-0.023
 dragons-0.023
Token.
Token ID13
Feature activation+9.05e-02

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.014
oured-0.014
int-0.013
 gul-0.013
 cu-0.013
Token
Token ID220
Feature activation-1.20e+00

Pos logit contributions
 mun+0.004
 bait+0.004
leaf+0.004
mit+0.004
int+0.004
Neg logit contributions
baby-0.002
 ambulance-0.002
 hall-0.002
 robots-0.002
 kids-0.002
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
baby+0.036
 ambulance+0.034
 robots+0.033
 dragons+0.032
 lawyers+0.032
Neg logit contributions
 mun-0.046
oured-0.045
 gul-0.043
 cu-0.043
int-0.042
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.005
 cu+0.005
int+0.005
Neg logit contributions
baby-0.003
 ambulance-0.003
 lawyers-0.003
 robots-0.003
 hall-0.003
Token was
Token ID373
Feature activation+1.29e-01

Pos logit contributions
 mun+0.007
oured+0.007
int+0.007
 bait+0.007
 cu+0.007
Neg logit contributions
baby-0.005
 monster-0.005
 robots-0.005
 ambulance-0.005
 hall-0.005
Token so
Token ID523
Feature activation+7.48e-01

Pos logit contributions
int+0.006
 mun+0.006
oured+0.005
 cu+0.005
leaf+0.005
Neg logit contributions
baby-0.003
 alone-0.003
 ambulance-0.003
 twins-0.003
 anymore-0.003
Token glad
Token ID9675
Feature activation+5.45e-02

Pos logit contributions
oured+0.034
 mun+0.034
int+0.034
 cu+0.034
 bait+0.032
Neg logit contributions
baby-0.017
 alone-0.016
 twins-0.014
 kids-0.014
 robots-0.014
Token and
Token ID290
Feature activation+6.15e-01

Pos logit contributions
oured+0.002
 mun+0.002
int+0.002
leaf+0.002
 gul+0.002
Neg logit contributions
baby-0.001
 robots-0.001
 monster-0.001
 dragons-0.001
 ambulance-0.001
Token started
Token ID2067
Feature activation+3.88e-01

Pos logit contributions
 mun+0.024
oured+0.024
int+0.023
 cu+0.022
 gul+0.022
Neg logit contributions
baby-0.018
 robots-0.017
 alone-0.016
 ambulance-0.016
 monster-0.016
Token to
Token ID284
Feature activation+4.84e-01

Pos logit contributions
 mun+0.016
oured+0.016
int+0.016
 bait+0.015
 cu+0.015
Neg logit contributions
baby-0.011
 robots-0.010
 ambulance-0.010
 dragons-0.010
 monster-0.010
Token play
Token ID711
Feature activation-3.34e-01

Pos logit contributions
oured+0.019
 mun+0.019
int+0.018
 bait+0.018
 gul+0.018
Neg logit contributions
baby-0.013
 ambulance-0.013
 alone-0.013
 monster-0.012
 robots-0.012
Token once
Token ID1752
Feature activation-1.62e-01

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 hall+0.010
Neg logit contributions
 mun-0.011
oured-0.011
 cu-0.011
 placed-0.011
 gul-0.011
Token more
Token ID517
Feature activation+1.32e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.008
oured-0.008
 gul-0.007
 cu-0.007
 placed-0.007
Token.
Token ID13
Feature activation+4.56e-01

Pos logit contributions
 mun+0.005
 gul+0.005
oured+0.005
 cu+0.005
 placed+0.005
Neg logit contributions
baby-0.004
 ambulance-0.003
 robots-0.003
 hall-0.003
 lawyers-0.003
Token
Token ID198
Feature activation+6.67e-01

Pos logit contributions
 mun+0.019
oured+0.018
int+0.018
 gul+0.018
 cu+0.017
Neg logit contributions
baby-0.013
 robots-0.011
 ambulance-0.011
 lawyers-0.011
 hall-0.011
Token Sally
Token ID25737
Feature activation+8.92e-01

Pos logit contributions
baby+0.022
 ambulance+0.020
 robots+0.019
 lawyers+0.019
 dragons+0.019
Neg logit contributions
 mun-0.043
oured-0.042
 gul-0.041
int-0.040
 cu-0.040
Token had
Token ID550
Feature activation+5.35e-01

Pos logit contributions
 mun+0.031
oured+0.030
 gul+0.029
 cu+0.028
int+0.027
Neg logit contributions
baby-0.030
 ambulance-0.028
 robots-0.028
 lawyers-0.028
 dragons-0.027
Token gone
Token ID3750
Feature activation+6.02e-01

Pos logit contributions
oured+0.020
 mun+0.020
int+0.020
mit+0.018
 gul+0.018
Neg logit contributions
baby-0.016
 dragons-0.016
 lawyers-0.016
 ambulance-0.016
 monsters-0.015
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
 mun+0.025
oured+0.024
 gul+0.023
 cu+0.023
int+0.022
Neg logit contributions
baby-0.017
 ambulance-0.016
 robots-0.016
 lawyers-0.015
 dragons-0.015
Token the
Token ID262
Feature activation-8.31e-02

Pos logit contributions
oured+0.012
 mun+0.012
int+0.012
 gul+0.011
 cu+0.011
Neg logit contributions
baby-0.007
 ambulance-0.007
 monster-0.007
 lawyers-0.007
 dragons-0.007
Token lake
Token ID13546
Feature activation+2.49e-01

Pos logit contributions
 monster+0.002
 ambulance+0.002
baby+0.002
 hall+0.002
 basement+0.002
Neg logit contributions
 mun-0.004
oured-0.004
int-0.003
 cu-0.003
 placed-0.003
Token and
Token ID290
Feature activation+1.18e-01

Pos logit contributions
 mun+0.011
oured+0.011
 gul+0.010
 cu+0.010
int+0.010
Neg logit contributions
baby-0.006
 ambulance-0.006
 robots-0.006
 lawyers-0.005
 dragons-0.005
Token she
Token ID673
Feature activation-4.08e-02

Pos logit contributions
 mun+0.004
oured+0.004
int+0.004
 gul+0.004
 cu+0.004
Neg logit contributions
baby-0.004
 ambulance-0.004
 alone-0.003
 lawyers-0.003
 robots-0.003
Token was
Token ID373
Feature activation-8.73e-02

Pos logit contributions
baby+0.001
 ambulance+0.001
 monster+0.001
 alone+0.001
 dragons+0.001
Neg logit contributions
oured-0.002
 mun-0.001
 pleasure-0.001
int-0.001
 bait-0.001
Token worried
Token ID7960
Feature activation+9.46e-02

Pos logit contributions
baby+0.003
 ambulance+0.002
 alone+0.002
 lawyers+0.002
 robots+0.002
Neg logit contributions
 mun-0.004
oured-0.003
 gul-0.003
int-0.003
 cu-0.003
Token that
Token ID326
Feature activation+1.65e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
 placed+0.003
Neg logit contributions
baby-0.003
 ambulance-0.002
 robots-0.002
 lawyers-0.002
 villains-0.002
Token
Token ID198
Feature activation+4.81e-01

Pos logit contributions
 mun+0.026
oured+0.026
 gul+0.025
 cu+0.024
int+0.024
Neg logit contributions
baby-0.017
 ambulance-0.016
 robots-0.015
 lawyers-0.015
 dragons-0.015
TokenAnd
Token ID1870
Feature activation+1.01e+00

Pos logit contributions
 mun+0.017
oured+0.017
 gul+0.016
 cu+0.016
int+0.016
Neg logit contributions
baby-0.016
 ambulance-0.015
 lawyers-0.014
 robots-0.014
 dragons-0.014
Token each
Token ID1123
Feature activation-1.33e+00

Pos logit contributions
int+0.047
oured+0.046
 mun+0.044
 gul+0.042
 sto+0.041
Neg logit contributions
baby-0.024
 kids-0.021
 alone-0.021
 anymore-0.020
 twins-0.020
Token morning
Token ID3329
Feature activation+9.90e-01

Pos logit contributions
baby+0.035
 monster+0.033
 ambulance+0.032
 dragons+0.032
 hall+0.032
Neg logit contributions
oured-0.057
 mun-0.056
 cu-0.056
icol-0.055
 bait-0.053
Token the
Token ID262
Feature activation-3.04e-01

Pos logit contributions
leaf+0.051
oured+0.051
 mun+0.051
int+0.050
rig+0.049
Neg logit contributions
baby-0.017
 anymore-0.014
 kids-0.014
 twins-0.013
 ambulance-0.012
Token sun
Token ID4252
Feature activation-1.07e+00

Pos logit contributions
 twins+0.006
 hall+0.006
 kids+0.006
 monster+0.006
 dragons+0.006
Neg logit contributions
oured-0.015
 mun-0.015
int-0.014
 cu-0.014
leaf-0.014
Token kept
Token ID4030
Feature activation-5.69e-01

Pos logit contributions
baby+0.035
 ambulance+0.034
 hall+0.033
 monster+0.032
 dragons+0.032
Neg logit contributions
 mun-0.038
oured-0.038
 bait-0.035
leaf-0.035
 cu-0.035
Token pouring
Token ID23147
Feature activation-9.19e-02

Pos logit contributions
baby+0.015
 alone+0.015
 ambulance+0.014
 dragons+0.014
 robots+0.013
Neg logit contributions
oured-0.024
 mun-0.024
leaf-0.023
int-0.023
mit-0.023
Token down
Token ID866
Feature activation+4.04e-02

Pos logit contributions
baby+0.002
 alone+0.002
 ambulance+0.002
 dragons+0.002
 robots+0.002
Neg logit contributions
 mun-0.004
oured-0.004
int-0.004
 cu-0.004
leaf-0.004
Token its
Token ID663
Feature activation-1.45e-01

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
int+0.002
 cu+0.002
Neg logit contributions
baby-0.001
 ambulance-0.001
 dragons-0.001
 alone-0.001
 robots-0.001
Token bright
Token ID6016
Feature activation-2.93e-01

Pos logit contributions
 house+0.004
 alone+0.004
 hall+0.004
 fort+0.004
 dragons+0.004
Neg logit contributions
leaf-0.006
int-0.006
oured-0.006
 mun-0.006
 gul-0.006
Token Sam
Token ID3409
Feature activation+6.05e-02

Pos logit contributions
baby+0.001
 ambulance+0.000
 robots+0.000
 lawyers+0.000
 dragons+0.000
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
 bait-0.001
Token and
Token ID290
Feature activation+9.58e-02

Pos logit contributions
 mun+0.003
oured+0.003
 cu+0.002
 gul+0.002
 bait+0.002
Neg logit contributions
baby-0.002
 ambulance-0.001
 robots-0.001
 alone-0.001
 dragons-0.001
Token walked
Token ID6807
Feature activation-3.87e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
int+0.004
 cu+0.004
Neg logit contributions
baby-0.003
 ambulance-0.002
 robots-0.002
 dragons-0.002
 lawyers-0.002
Token with
Token ID351
Feature activation-1.38e+00

Pos logit contributions
baby+0.011
 ambulance+0.010
 robots+0.010
 lawyers+0.010
 dragons+0.010
Neg logit contributions
 mun-0.016
oured-0.015
 gul-0.015
 cu-0.015
int-0.014
Token him
Token ID683
Feature activation+5.70e-01

Pos logit contributions
baby+0.041
 ambulance+0.038
 lawyers+0.036
 hall+0.036
 robots+0.035
Neg logit contributions
 mun-0.054
 gul-0.051
 bait-0.050
oured-0.050
 cu-0.050
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 mun+0.027
oured+0.026
 gul+0.025
 cu+0.025
 bait+0.025
Neg logit contributions
baby-0.012
 ambulance-0.011
 robots-0.011
 lawyers-0.010
 dragons-0.010
Token She
Token ID1375
Feature activation-5.35e-01

Pos logit contributions
baby+0.009
 ambulance+0.008
 robots+0.008
 lawyers+0.007
 dragons+0.007
Neg logit contributions
 mun-0.016
oured-0.016
 gul-0.015
 cu-0.015
 bait-0.015
Token said
Token ID531
Feature activation+6.81e-01

Pos logit contributions
baby+0.016
 dragons+0.016
 monster+0.016
 hall+0.016
 ambulance+0.016
Neg logit contributions
oured-0.019
 pleasure-0.019
int-0.018
 bait-0.018
leaf-0.018
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
 mun+0.027
oured+0.026
 gul+0.026
 cu+0.025
 bait+0.025
Neg logit contributions
baby-0.019
 ambulance-0.017
 robots-0.017
 hall-0.017
 lawyers-0.017
Token "
Token ID366
Feature activation+7.45e-02

Pos logit contributions
baby+0.005
 ambulance+0.004
 robots+0.004
 lawyers+0.004
 dragons+0.004
Neg logit contributions
 mun-0.011
oured-0.011
 gul-0.011
 cu-0.011
int-0.011
TokenDon
Token ID3987
Feature activation-1.81e-01

Pos logit contributions
 mun+0.003
oured+0.003
 gul+0.003
 cu+0.003
int+0.003
Neg logit contributions
baby-0.002
 ambulance-0.002
 robots-0.002
 lawyers-0.002
 dragons-0.002
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.005
 monsters+0.005
 monster+0.005
Neg logit contributions
 mun-0.005
 cu-0.004
oured-0.004
 gul-0.004
 sto-0.004
Token"
Token ID1
Feature activation+2.75e-01

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.004
 dragons+0.004
 monster+0.004
Neg logit contributions
 mun-0.006
oured-0.006
 gul-0.006
 cu-0.006
int-0.005
TokenMom
Token ID29252
Feature activation+3.80e-01

Pos logit contributions
 mun+0.013
oured+0.013
int+0.012
 bait+0.012
 gul+0.012
Neg logit contributions
baby-0.006
 ambulance-0.005
 lawyers-0.005
 hall-0.005
 monster-0.005
Tokenmy
Token ID1820
Feature activation-6.71e-01

Pos logit contributions
 mun+0.021
leaf+0.020
oured+0.020
 bait+0.020
 gul+0.020
Neg logit contributions
baby-0.006
 ambulance-0.004
 robots-0.004
 lawyers-0.004
 monster-0.004
Token,
Token ID11
Feature activation-4.40e-01

Pos logit contributions
baby+0.010
 ambulance+0.008
 sister+0.008
 monster+0.007
 twins+0.007
Neg logit contributions
leaf-0.037
 mun-0.037
 bait-0.036
oured-0.035
 cu-0.035
Token mom
Token ID1995
Feature activation-3.21e-01

Pos logit contributions
baby+0.013
 ambulance+0.012
 monster+0.012
 robots+0.011
 lawyers+0.011
Neg logit contributions
 mun-0.018
oured-0.018
int-0.017
 gul-0.017
leaf-0.016
Tokenmy
Token ID1820
Feature activation-8.76e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.016
oured-0.016
 gul-0.016
 cu-0.015
int-0.015
Token,
Token ID11
Feature activation-4.40e-01

Pos logit contributions
baby+0.017
 ambulance+0.015
 monster+0.014
 twins+0.013
 alone+0.013
Neg logit contributions
 mun-0.043
oured-0.043
leaf-0.042
 bait-0.041
int-0.041
Token look
Token ID804
Feature activation+1.85e-01

Pos logit contributions
baby+0.012
 ambulance+0.012
 monster+0.011
 kids+0.011
 robots+0.011
Neg logit contributions
 mun-0.019
oured-0.019
int-0.018
 gul-0.017
 cu-0.017
Token at
Token ID379
Feature activation+1.28e-01

Pos logit contributions
 mun+0.008
oured+0.007
 gul+0.007
 cu+0.007
int+0.007
Neg logit contributions
baby-0.005
 ambulance-0.005
 robots-0.005
 lawyers-0.005
 dragons-0.005
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.005
 cu+0.005
int+0.005
Neg logit contributions
baby-0.004
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
 mun+0.004
oured+0.004
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.003
 ambulance-0.002
 robots-0.002
 lawyers-0.002
 dragons-0.002
Token only
Token ID691
Feature activation+1.06e+00

Pos logit contributions
oured+0.005
 mun+0.005
int+0.005
 cu+0.005
 gul+0.005
Neg logit contributions
baby-0.003
 monster-0.002
 robots-0.002
 ambulance-0.002
 kids-0.002
Token 3
Token ID513
Feature activation+1.13e-01

Pos logit contributions
oured+0.044
 mun+0.044
int+0.043
 cu+0.041
 gul+0.040
Neg logit contributions
baby-0.032
 robots-0.030
 ambulance-0.029
 monster-0.029
 lawyers-0.029
Token years
Token ID812
Feature activation-3.74e-01

Pos logit contributions
 mun+0.004
 gul+0.004
 cu+0.004
oured+0.004
 bait+0.004
Neg logit contributions
baby-0.003
 basement-0.003
 ambulance-0.003
 robots-0.003
 dragons-0.003
Token old
Token ID1468
Feature activation+3.14e-01

Pos logit contributions
baby+0.010
 ambulance+0.009
 robots+0.009
 lawyers+0.009
 dragons+0.009
Neg logit contributions
 mun-0.016
oured-0.015
 gul-0.015
 cu-0.015
int-0.014
Token!
Token ID0
Feature activation-1.38e-01

Pos logit contributions
 mun+0.015
oured+0.014
 gul+0.014
 cu+0.014
 bait+0.014
Neg logit contributions
baby-0.007
 ambulance-0.006
 robots-0.006
 dragons-0.006
 lawyers-0.006
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
baby+0.004
 ambulance+0.004
 robots+0.004
 lawyers+0.004
 dragons+0.004
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
int-0.005
Token
Token ID198
Feature activation+4.14e-02

Pos logit contributions
 mun+0.004
oured+0.004
 cu+0.004
 gul+0.004
int+0.004
Neg logit contributions
baby-0.004
 ambulance-0.004
 robots-0.004
 lawyers-0.003
 dragons-0.003
TokenOne
Token ID3198
Feature activation+1.56e-01

Pos logit contributions
 mun+0.002
oured+0.002
 gul+0.002
 cu+0.001
int+0.001
Neg logit contributions
baby-0.001
 ambulance-0.001
 robots-0.001
 lawyers-0.001
 dragons-0.001
Token day
Token ID1110
Feature activation-9.49e-02

Pos logit contributions
 mun+0.007
oured+0.007
 gul+0.007
 cu+0.007
int+0.006
Neg logit contributions
baby-0.004
 ambulance-0.003
 robots-0.003
 lawyers-0.003
 dragons-0.003
Token,
Token ID11
Feature activation-8.93e-01

Pos logit contributions
baby+0.002
 ambulance+0.001
 monster+0.001
 lawyers+0.001
 robots+0.001
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
leaf-0.005
Token clean
Token ID3424
Feature activation-5.63e-01

Pos logit contributions
baby+0.003
 hall+0.003
 alone+0.003
 robots+0.003
 ambulance+0.003
Neg logit contributions
 mun-0.004
oured-0.004
int-0.004
 bait-0.003
 cu-0.003
Token everything
Token ID2279
Feature activation-5.07e-01

Pos logit contributions
 villains+0.020
baby+0.020
 ambulance+0.019
emaker+0.018
 lawyers+0.018
Neg logit contributions
oured-0.016
 gul-0.016
 cu-0.016
 placed-0.016
 sprink-0.016
Token every
Token ID790
Feature activation+2.88e-01

Pos logit contributions
baby+0.016
 ambulance+0.015
 robots+0.014
 villains+0.014
 lawyers+0.014
Neg logit contributions
 mun-0.019
oured-0.019
 gul-0.018
 cu-0.017
 dip-0.017
Token day
Token ID1110
Feature activation-2.85e-01

Pos logit contributions
 mun+0.011
 gul+0.010
oured+0.010
 dip+0.010
 planted+0.010
Neg logit contributions
baby-0.009
 robots-0.008
 lawyers-0.008
 villains-0.008
 ambulance-0.008
Token."
Token ID526
Feature activation-3.39e-02

Pos logit contributions
baby+0.007
 ambulance+0.006
 robots+0.006
 lawyers+0.006
 dragons+0.006
Neg logit contributions
 mun-0.013
oured-0.012
 gul-0.012
 cu-0.012
int-0.012
Token
Token ID220
Feature activation-1.60e+00

Pos logit contributions
baby+0.001
 ambulance+0.001
 robots+0.001
 lawyers+0.001
 dragons+0.001
Neg logit contributions
 mun-0.001
oured-0.001
 gul-0.001
 cu-0.001
int-0.001
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
baby+0.060
 ambulance+0.059
 monster+0.059
 villains+0.059
 robots+0.059
Neg logit contributions
 mun-0.043
oured-0.043
 cu-0.041
int-0.041
 pleasure-0.041
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.006
oured-0.006
 cu-0.006
 gul-0.006
int-0.006
TokenS
Token ID50
Feature activation+5.78e-01

Pos logit contributions
baby+0.004
 ambulance+0.004
 lawyers+0.004
 robots+0.004
 dragons+0.004
Neg logit contributions
oured-0.008
 mun-0.008
 gul-0.007
int-0.007
 cu-0.007
Tokenally
Token ID453
Feature activation+1.54e-01

Pos logit contributions
 mun+0.019
oured+0.019
 gul+0.018
 cu+0.018
 bait+0.018
Neg logit contributions
baby-0.020
 ambulance-0.019
 lawyers-0.018
 robots-0.018
 hall-0.018
Token's
Token ID338
Feature activation+1.94e-01

Pos logit contributions
 mun+0.005
oured+0.005
 bait+0.005
 pleasure+0.005
leaf+0.005
Neg logit contributions
baby-0.005
 ambulance-0.005
 robots-0.005
 hall-0.005
 dragons-0.005
Token."
Token ID526
Feature activation-6.62e-02

Pos logit contributions
baby+0.017
 ambulance+0.016
 robots+0.015
 hall+0.015
 lawyers+0.015
Neg logit contributions
 mun-0.020
oured-0.019
 gul-0.019
 cu-0.019
 bait-0.018
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
baby+0.002
 hall+0.002
 robots+0.002
 lawyers+0.001
 ambulance+0.001
Neg logit contributions
 mun-0.003
oured-0.003
 bait-0.003
int-0.003
leaf-0.003
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
 mun+0.006
oured+0.006
 gul+0.005
 cu+0.005
int+0.005
Neg logit contributions
baby-0.004
 ambulance-0.004
 robots-0.004
 lawyers-0.004
 dragons-0.004
TokenM
Token ID44
Feature activation+8.41e-01

Pos logit contributions
 mun+0.005
oured+0.005
 gul+0.004
 cu+0.004
int+0.004
Neg logit contributions
baby-0.004
 ambulance-0.004
 robots-0.004
 lawyers-0.004
 dragons-0.004
Tokenia
Token ID544
Feature activation-3.77e-01

Pos logit contributions
 mun+0.034
oured+0.033
int+0.033
 gul+0.032
 cu+0.032
Neg logit contributions
baby-0.023
 robots-0.022
 lawyers-0.021
 ambulance-0.021
 alone-0.021
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 monster+0.010
baby+0.009
 alone+0.009
 ambulance+0.009
 kids+0.009
Neg logit contributions
 pleasure-0.016
leaf-0.015
 bait-0.015
oured-0.015
 souven-0.014
Token Jack
Token ID3619
Feature activation+3.98e-01

Pos logit contributions
baby+0.057
 ambulance+0.052
 robots+0.051
 lawyers+0.051
 dragons+0.050
Neg logit contributions
 mun-0.068
oured-0.065
 gul-0.064
 cu-0.063
 bait-0.061
Token were
Token ID547
Feature activation-4.06e-01

Pos logit contributions
 pleasure+0.015
leaf+0.015
icol+0.015
 bait+0.015
 mun+0.015
Neg logit contributions
baby-0.011
 ambulance-0.010
 alone-0.010
 monster-0.009
 kids-0.009
Token sad
Token ID6507
Feature activation+3.99e-01

Pos logit contributions
baby+0.012
 ambulance+0.011
 robots+0.011
 lawyers+0.011
 dragons+0.010
Neg logit contributions
 mun-0.016
oured-0.016
 gul-0.015
 cu-0.015
int-0.015
Token,
Token ID11
Feature activation-2.58e-01

Pos logit contributions
 mun+0.016
oured+0.015
 gul+0.015
 cu+0.015
 dip+0.014
Neg logit contributions
baby-0.012
 ambulance-0.011
 robots-0.011
 lawyers-0.011
 dragons-0.010
Token but
Token ID475
Feature activation+3.59e-01

Pos logit contributions
baby+0.008
 ambulance+0.008
 robots+0.008
 lawyers+0.008
 dragons+0.008
Neg logit contributions
oured-0.010
 mun-0.009
int-0.009
 gul-0.009
leaf-0.009
Token She
Token ID1375
Feature activation-6.15e-01

Pos logit contributions
baby+0.003
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.005
oured-0.005
 gul-0.005
 cu-0.005
int-0.005
Token said
Token ID531
Feature activation+5.98e-01

Pos logit contributions
 dragons+0.016
baby+0.016
 hall+0.016
 alone+0.015
 monster+0.015
Neg logit contributions
oured-0.025
 pleasure-0.024
int-0.024
leaf-0.023
 bait-0.023
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
 mun+0.026
oured+0.025
 gul+0.024
 cu+0.024
int+0.024
Neg logit contributions
baby-0.015
 ambulance-0.014
 robots-0.014
 lawyers-0.013
 dragons-0.013
Token "
Token ID366
Feature activation+2.08e-03

Pos logit contributions
baby+0.004
 ambulance+0.003
 robots+0.003
 lawyers+0.003
 dragons+0.003
Neg logit contributions
 mun-0.009
oured-0.009
 gul-0.009
 cu-0.009
int-0.009
TokenPlease
Token ID5492
Feature activation-4.44e-01

Pos logit contributions
oured+0.000
 mun+0.000
 gul+0.000
 cu+0.000
 bait+0.000
Neg logit contributions
baby-0.000
 lawyers-0.000
 ambulance-0.000
 robots-0.000
 hall-0.000
Token don
Token ID836
Feature activation-1.69e+00

Pos logit contributions
baby+0.011
 ambulance+0.010
 monster+0.010
 robots+0.010
 lawyers+0.010
Neg logit contributions
 mun-0.020
oured-0.019
leaf-0.019
 gul-0.019
int-0.018
Token't
Token ID470
Feature activation-1.11e+00

Pos logit contributions
 fort+0.082
 villains+0.081
 yards+0.080
yards+0.080
 dragons+0.080
Neg logit contributions
 grant-0.025
 gul-0.023
 sprink-0.022
 mun-0.022
 dip-0.021
Token take
Token ID1011
Feature activation-1.07e+00

Pos logit contributions
baby+0.040
 robots+0.038
 ambulance+0.037
 lawyers+0.037
 dragons+0.037
Neg logit contributions
 mun-0.036
 gul-0.033
 cu-0.033
oured-0.033
 dip-0.033
Token it
Token ID340
Feature activation-1.27e-01

Pos logit contributions
baby+0.031
 ambulance+0.029
 robots+0.029
 dragons+0.028
 lawyers+0.028
Neg logit contributions
 mun-0.044
oured-0.043
 gul-0.040
 cu-0.040
int-0.040
Token away
Token ID1497
Feature activation-4.77e-01

Pos logit contributions
baby+0.004
 robots+0.004
 lawyers+0.004
 villains+0.004
 ambulance+0.003
Neg logit contributions
 mun-0.005
oured-0.005
 bait-0.004
 dip-0.004
 cu-0.004
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
baby+0.015
 ambulance+0.014
 robots+0.014
 lawyers+0.013
 villains+0.013
Neg logit contributions
 mun-0.017
oured-0.016
 cu-0.016
 bait-0.016
 gul-0.016
Token that
Token ID326
Feature activation+1.40e-01

Pos logit contributions
baby+0.000
 alone+0.000
 monster+0.000
 ambulance+0.000
 hall+0.000
Neg logit contributions
 pleasure-0.000
 mun-0.000
oured-0.000
int-0.000
leaf-0.000
Token was
Token ID373
Feature activation-1.43e-01

Pos logit contributions
oured+0.006
int+0.006
 mun+0.006
leaf+0.006
 cu+0.006
Neg logit contributions
baby-0.004
 monster-0.004
 alone-0.003
 hall-0.003
 dragons-0.003
Token blowing
Token ID19280
Feature activation-7.12e-01

Pos logit contributions
 alone+0.004
 dragons+0.003
 robots+0.003
 twins+0.003
 house+0.003
Neg logit contributions
int-0.007
 mun-0.007
oured-0.007
leaf-0.006
OW-0.006
Token in
Token ID287
Feature activation+5.03e-01

Pos logit contributions
baby+0.021
 robots+0.020
 dragons+0.019
 ambulance+0.019
 alone+0.019
Neg logit contributions
 mun-0.029
oured-0.029
int-0.027
 cu-0.027
 gul-0.026
Token the
Token ID262
Feature activation-2.03e-01

Pos logit contributions
oured+0.023
 mun+0.023
int+0.022
 gul+0.021
 cu+0.021
Neg logit contributions
baby-0.012
 dragons-0.011
 robots-0.011
 monster-0.011
 ambulance-0.011
Token wind
Token ID2344
Feature activation-2.05e+00

Pos logit contributions
 ambulance+0.005
 alone+0.004
 monster+0.004
 hall+0.004
baby+0.004
Neg logit contributions
 mun-0.010
oured-0.010
int-0.009
 gul-0.009
 cu-0.009
Token.
Token ID13
Feature activation-1.86e-01

Pos logit contributions
baby+0.035
 ambulance+0.030
 robots+0.029
 alone+0.029
 dragons+0.028
Neg logit contributions
 mun-0.106
oured-0.104
 gul-0.101
 cu-0.101
int-0.099
Token He
Token ID679
Feature activation-1.00e-01

Pos logit contributions
baby+0.004
 robots+0.004
 ambulance+0.003
 hall+0.003
 monster+0.003
Neg logit contributions
 mun-0.008
oured-0.008
int-0.008
leaf-0.008
 dip-0.008
Token was
Token ID373
Feature activation-4.89e-01

Pos logit contributions
 dragons+0.002
lf+0.002
 alone+0.002
 monster+0.002
 robots+0.002
Neg logit contributions
 pleasure-0.005
oured-0.004
icol-0.004
leaf-0.004
ical-0.004
Token so
Token ID523
Feature activation-2.77e-02

Pos logit contributions
 alone+0.012
 twins+0.011
 kids+0.009
 ambulance+0.009
 brothers+0.009
Neg logit contributions
 mun-0.023
int-0.023
icol-0.022
oured-0.022
OW-0.022
Token excited
Token ID6568
Feature activation+9.93e-01

Pos logit contributions
 alone+0.001
 twins+0.000
baby+0.000
 kids+0.000
 lawyers+0.000
Neg logit contributions
oured-0.001
 mun-0.001
icol-0.001
int-0.001
 cu-0.001
Token and
Token ID290
Feature activation-6.33e-02

Pos logit contributions
 mun+0.008
oured+0.008
 gul+0.007
 cu+0.007
int+0.007
Neg logit contributions
baby-0.003
 ambulance-0.003
 robots-0.003
 dragons-0.003
 lawyers-0.003
Token asked
Token ID1965
Feature activation+2.81e-01

Pos logit contributions
baby+0.002
 ambulance+0.002
 robots+0.002
 lawyers+0.002
 dragons+0.002
Neg logit contributions
 mun-0.002
oured-0.002
 gul-0.002
 cu-0.002
int-0.002
Token if
Token ID611
Feature activation-1.30e+00

Pos logit contributions
 mun+0.012
oured+0.012
 gul+0.011
 cu+0.011
int+0.011
Neg logit contributions
baby-0.007
 ambulance-0.007
 robots-0.007
 lawyers-0.006
 dragons-0.006
Token she
Token ID673
Feature activation-3.87e-01

Pos logit contributions
 lawyers+0.021
baby+0.021
 dragons+0.020
 monster+0.020
 ambulance+0.019
Neg logit contributions
oured-0.069
 mun-0.067
int-0.067
 gul-0.064
 cu-0.064
Token could
Token ID714
Feature activation+2.94e-01

Pos logit contributions
baby+0.009
 monster+0.009
 dragons+0.009
 alone+0.008
 villains+0.008
Neg logit contributions
oured-0.017
 mun-0.016
icol-0.016
leaf-0.016
 gul-0.016
Token help
Token ID1037
Feature activation-2.06e+00

Pos logit contributions
oured+0.012
 mun+0.012
 gul+0.011
leaf+0.011
int+0.011
Neg logit contributions
 ambulance-0.008
 alone-0.008
baby-0.008
 monster-0.007
 house-0.007
Token it
Token ID340
Feature activation-2.62e-01

Pos logit contributions
baby+0.047
 ambulance+0.043
 robots+0.042
 lawyers+0.041
 dragons+0.041
Neg logit contributions
 mun-0.096
oured-0.095
 gul-0.090
int-0.090
 cu-0.089
Token find
Token ID1064
Feature activation-8.04e-01

Pos logit contributions
baby+0.006
 ambulance+0.005
 robots+0.005
 lawyers+0.005
 dragons+0.005
Neg logit contributions
 mun-0.012
oured-0.012
 gul-0.011
 cu-0.011
int-0.011
Token its
Token ID663
Feature activation-1.94e-01

Pos logit contributions
 dragons+0.020
 ambulance+0.019
baby+0.019
 alone+0.019
 monster+0.019
Neg logit contributions
 mun-0.035
oured-0.035
mit-0.033
 gul-0.033
int-0.032
Token way
Token ID835
Feature activation-1.25e+00

Pos logit contributions
baby+0.005
 ambulance+0.005
 robots+0.005
 dragons+0.005
 monster+0.005
Neg logit contributions
 mun-0.008
oured-0.008
 gul-0.008
 cu-0.008
int-0.008
Token home
Token ID1363
Feature activation-1.18e+00

Pos logit contributions
baby+0.028
 ambulance+0.025
 robots+0.024
 lawyers+0.023
 dragons+0.023
Neg logit contributions
 mun-0.059
oured-0.057
 gul-0.055
 cu-0.055
int-0.054