TOP ACTIVATIONS
MAX = 8.907

began pushing themselves back and forth on their swings .
and forth , back and forth . She had so much
bouncing the ball back and forth . They laughed and played
kicking the ball back and forth . From that day on
moved his feet back and forth to practice standing on one
began to swing back and forth . The two friends had
eyes and swayed back and forth . After the
started to swing back and forth , higher and higher .
The ground rocked back and forth . It was exciting ,
and pushed Lucy back and forth . It felt like Lucy
swing would go back and forth in a fun motion .
He shook it back and forth and watched as the birds
passed the ball back and forth , laughing and having fun
vine , swinging back and forth . They were very happy
kicked the ball back and forth and had a lot of
shake his head back and forth . " No , Kitty
and toss it back and forth . L ily
for hours , back and forth , feeling the wind on
with leaves shaking back and forth . The kittens were mesmer
. She goes back and forth and makes many lines .

MIN ACTIVATIONS
MIN = -7.096

Sara . Good night . Sleep tight . Don 't let
" It 's easy . Sleep is when you close your
, Anna and Ben . Sleep well . We love you
You need to sleep . Sleep is good for you .
, my sweet ies . Sleep tight . And don 't
Good night , Lily . Sleep well ." Lily replied ,
Good night , Max . Sleep tight ." Max fell asleep
saw a big ball . Sleep y loved balls . He
Do you like my coffee shop ? I sell coffee and
to go to the doll shop . She does not want
One day , Sleep y went for a walk
day and all night . Sleep y had a big ,
. It belongs to the shop . You have to share
man comes out of a shop . He sees Tom with
them to the ice cream shop . They buy them ice
I can go to the shop and buy one ."
dog named Sleep y . Sleep y was a very sleepy
was a little dog named Sleep y . Sleep y was
you need to sleep . Sleep will help you feel better
They walk out of the shop and go back to the

INTERVAL 4.906 - 8.907
CONTAINS 0.046%

his mom argued back and forth until John 's mom finally
. He rocked back and forth , pushing and pulling as
and landed on Spot 's itch . Suddenly , the itch
on the wall and never forget it ." John
comes and sees the z ig zag . She gets a

INTERVAL 0.906 - 4.906
CONTAINS 15.766%

the lions , and the monkeys , and the birds .
my . She saw a toy horse and asked her mom
too damaged and had to be removed .
. They said , " No , Mom , we want
p outed and crossed her arms . She did not like

INTERVAL -3.095 - 0.906
CONTAINS 82.898%

They are waiting for the letters . They will be sad
. They want to see the show . They think it
her to protect herself . So , she grabbed the box
to do some work . Tom was very uncomfortable in the
closer . " Hi , Tom and Anna !"

INTERVAL -7.096 - -3.095
CONTAINS 1.290%

her favourite things around the camp to make it cozy ,
They make a big snow man with a hat and a
words . He told a story about a sleepy bear and
Amy . She was three years old and she loved cr
. When the waiter came , Lily 's mom
Token began
Token ID2540
Feature activation+1.84e+00

Pos logit contributions
 realize+0.175
 prod+0.172
pieces+0.169
aults+0.166
tip+0.162
Neg logit contributions
 grandma-0.169
 restaurant-0.164
 manager-0.164
 refrigerator-0.163
 old-0.163
Token pushing
Token ID7796
Feature activation+2.35e+00

Pos logit contributions
 realize+0.144
 prod+0.143
pieces+0.140
aults+0.138
tip+0.136
Neg logit contributions
 grandma-0.053
 refrigerator-0.050
 restaurant-0.050
 manager-0.050
 shop-0.049
Token themselves
Token ID2405
Feature activation+7.05e-01

Pos logit contributions
 realize+0.172
 prod+0.170
pieces+0.168
tip+0.164
vals+0.164
Neg logit contributions
 grandma-0.082
 old-0.077
 shop-0.075
 Anna-0.074
 Grand-0.072
Token back
Token ID736
Feature activation+1.57e+00

Pos logit contributions
 realize+0.048
 prod+0.047
pieces+0.047
tip+0.046
vals+0.046
Neg logit contributions
 old-0.027
 grandma-0.027
 manager-0.026
 shop-0.026
 restaurant-0.025
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
 realize+0.104
 prod+0.103
pieces+0.101
aults+0.099
tip+0.097
Neg logit contributions
 grandma-0.063
 restaurant-0.061
 manager-0.061
 refrigerator-0.061
 old-0.060
Token forth
Token ID6071
Feature activation+8.58e+00

Pos logit contributions
 realize+0.004
pieces+0.004
 unnecessary+0.004
tip+0.004
vals+0.004
Neg logit contributions
 grandma-0.020
 manager-0.020
 shop-0.020
 old-0.020
 Anna-0.020
Token on
Token ID319
Feature activation+1.30e+00

Pos logit contributions
 realize+0.574
 prod+0.565
pieces+0.554
aults+0.554
tip+0.534
Neg logit contributions
 grandma-0.341
 restaurant-0.336
 refrigerator-0.332
 manager-0.327
 old-0.321
Token their
Token ID511
Feature activation+8.76e-01

Pos logit contributions
 realize+0.084
 prod+0.083
pieces+0.081
aults+0.081
 ease+0.078
Neg logit contributions
 grandma-0.054
 manager-0.054
 restaurant-0.054
 refrigerator-0.053
 old-0.052
Token swings
Token ID26728
Feature activation+9.04e-01

Pos logit contributions
 realize+0.049
 prod+0.048
pieces+0.048
aults+0.046
tip+0.046
Neg logit contributions
 grandma-0.045
 old-0.044
 restaurant-0.044
 manager-0.043
 refrigerator-0.043
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 realize+0.061
aults+0.059
pieces+0.059
 ease+0.058
 prod+0.058
Neg logit contributions
 restaurant-0.034
omm-0.034
 refrigerator-0.033
 grandma-0.033
 manager-0.033
Token
Token ID198
Feature activation+4.46e-01

Pos logit contributions
 restaurant+0.007
 refrigerator+0.007
 grandma+0.007
 manager+0.007
 old+0.007
Neg logit contributions
 prod-0.010
 realize-0.010
pieces-0.010
 ease-0.010
aults-0.010
Token and
Token ID290
Feature activation-4.28e-01

Pos logit contributions
 realize+0.074
 forth+0.070
pieces+0.070
 prod+0.069
aults+0.068
Neg logit contributions
 manager-0.055
 refrigerator-0.052
 restaurant-0.052
omm-0.052
 Cream-0.051
Token forth
Token ID6071
Feature activation+8.67e+00

Pos logit contributions
 grandma+0.027
 restaurant+0.027
 old+0.026
 manager+0.026
 shop+0.026
Neg logit contributions
 realize-0.018
 prod-0.018
pieces-0.018
aults-0.018
tip-0.017
Token,
Token ID11
Feature activation+1.66e-01

Pos logit contributions
 realize+0.560
aults+0.550
 prod+0.545
pieces+0.541
 ease+0.534
Neg logit contributions
 restaurant-0.351
 refrigerator-0.351
 manager-0.338
 grandma-0.337
 chamber-0.334
Token back
Token ID736
Feature activation+1.27e+00

Pos logit contributions
 realize+0.010
 prod+0.010
pieces+0.010
aults+0.009
 realise+0.009
Neg logit contributions
 grandma-0.008
 refrigerator-0.008
 manager-0.008
 restaurant-0.008
 chamber-0.007
Token and
Token ID290
Feature activation-3.01e-02

Pos logit contributions
 realize+0.082
 prod+0.079
pieces+0.077
aults+0.076
 realise+0.076
Neg logit contributions
 manager-0.053
 grandma-0.053
 refrigerator-0.052
 restaurant-0.052
 shop-0.050
Token forth
Token ID6071
Feature activation+8.58e+00

Pos logit contributions
 grandma+0.002
 restaurant+0.002
 manager+0.002
 refrigerator+0.002
 old+0.002
Neg logit contributions
 realize-0.001
 prod-0.001
pieces-0.001
aults-0.001
tip-0.001
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 realize+0.553
aults+0.545
 prod+0.536
pieces+0.534
 ease+0.523
Neg logit contributions
 refrigerator-0.358
 restaurant-0.354
 grandma-0.346
 manager-0.344
 chamber-0.338
Token She
Token ID1375
Feature activation+4.39e-01

Pos logit contributions
 manager+0.009
 old+0.008
 restaurant+0.008
 chamber+0.008
man+0.008
Neg logit contributions
 realize-0.011
pieces-0.011
 prod-0.010
 ease-0.010
 realise-0.010
Token had
Token ID550
Feature activation+7.43e-01

Pos logit contributions
 realize+0.027
 prod+0.025
pieces+0.025
 realise+0.025
aults+0.024
Neg logit contributions
 refrigerator-0.022
 manager-0.021
 chamber-0.021
 restaurant-0.021
 grandma-0.020
Token so
Token ID523
Feature activation-7.03e-02

Pos logit contributions
 realize+0.048
 prod+0.047
pieces+0.046
aults+0.046
tip+0.045
Neg logit contributions
 grandma-0.032
 manager-0.031
 restaurant-0.031
 refrigerator-0.031
 old-0.030
Token much
Token ID881
Feature activation+2.17e+00

Pos logit contributions
 grandma+0.003
 old+0.003
 restaurant+0.003
 shop+0.003
 manager+0.003
Neg logit contributions
 prod-0.004
 realize-0.004
pieces-0.004
aults-0.004
tip-0.004
Token bouncing
Token ID30713
Feature activation+2.56e+00

Pos logit contributions
 manager+0.022
 restaurant+0.021
keeper+0.020
 refrigerator+0.020
 Cream+0.020
Neg logit contributions
 realize-0.039
pieces-0.037
 ease-0.037
aults-0.037
 realise-0.036
Token the
Token ID262
Feature activation+2.79e-02

Pos logit contributions
 realize+0.150
 prod+0.149
pieces+0.148
aults+0.145
 glide+0.143
Neg logit contributions
 grandma-0.123
 manager-0.121
 refrigerator-0.119
 restaurant-0.116
keeper-0.115
Token ball
Token ID2613
Feature activation+1.46e+00

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
aults+0.001
tip+0.001
Neg logit contributions
 grandma-0.001
 manager-0.001
 restaurant-0.001
 refrigerator-0.001
 shop-0.001
Token back
Token ID736
Feature activation+1.41e+00

Pos logit contributions
 realize+0.101
 prod+0.100
pieces+0.099
aults+0.098
tip+0.096
Neg logit contributions
 grandma-0.054
 manager-0.052
 restaurant-0.052
 refrigerator-0.051
 old-0.049
Token and
Token ID290
Feature activation-2.42e-01

Pos logit contributions
 realize+0.093
 prod+0.089
 forth+0.088
aults+0.087
pieces+0.087
Neg logit contributions
 manager-0.057
 grandma-0.055
 restaurant-0.053
omm-0.053
 refrigerator-0.052
Token forth
Token ID6071
Feature activation+8.53e+00

Pos logit contributions
 grandma+0.018
 restaurant+0.018
 old+0.018
 manager+0.018
 shop+0.018
Neg logit contributions
 realize-0.008
 prod-0.008
pieces-0.007
aults-0.007
tip-0.007
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
 realize+0.569
 prod+0.559
aults+0.552
pieces+0.551
 ease+0.533
Neg logit contributions
 grandma-0.337
 restaurant-0.336
 refrigerator-0.336
 manager-0.329
 chamber-0.322
Token They
Token ID1119
Feature activation+3.04e-01

Pos logit contributions
 manager+0.008
 restaurant+0.007
 old+0.007
 chamber+0.007
 refrigerator+0.007
Neg logit contributions
 realize-0.010
pieces-0.010
 prod-0.010
 ease-0.010
 realise-0.009
Token laughed
Token ID13818
Feature activation+1.41e+00

Pos logit contributions
 prod+0.016
pieces+0.016
 unnecessary+0.016
tip+0.016
 realize+0.015
Neg logit contributions
 grandma-0.016
 shop-0.015
 old-0.015
 restaurant-0.015
 is-0.015
Token and
Token ID290
Feature activation+3.58e-02

Pos logit contributions
 realize+0.103
aults+0.099
 prod+0.098
pieces+0.097
 realise+0.095
Neg logit contributions
 refrigerator-0.050
 restaurant-0.050
 manager-0.049
 chamber-0.048
 shop-0.047
Token played
Token ID2826
Feature activation+1.60e+00

Pos logit contributions
pieces+0.002
 realize+0.002
 prod+0.002
 unnecessary+0.002
tip+0.002
Neg logit contributions
 grandma-0.002
 Anna-0.002
 old-0.002
 Grand-0.002
 shop-0.002
Token kicking
Token ID17997
Feature activation+2.05e+00

Pos logit contributions
 realize+0.017
pieces+0.017
 prod+0.017
aults+0.016
 realise+0.016
Neg logit contributions
 refrigerator-0.013
 manager-0.013
 restaurant-0.013
 grandma-0.013
keeper-0.013
Token the
Token ID262
Feature activation+3.88e-01

Pos logit contributions
 realize+0.130
 prod+0.129
pieces+0.127
aults+0.125
tip+0.121
Neg logit contributions
 grandma-0.089
 manager-0.087
 refrigerator-0.086
 restaurant-0.085
keeper-0.083
Token ball
Token ID2613
Feature activation+1.60e+00

Pos logit contributions
 realize+0.021
 prod+0.021
pieces+0.021
aults+0.020
tip+0.020
Neg logit contributions
 grandma-0.020
 manager-0.019
 restaurant-0.019
 refrigerator-0.019
 shop-0.019
Token back
Token ID736
Feature activation+1.39e+00

Pos logit contributions
 realize+0.119
 prod+0.118
pieces+0.117
aults+0.114
tip+0.113
Neg logit contributions
 grandma-0.051
 old-0.050
 manager-0.049
 shop-0.049
 restaurant-0.049
Token and
Token ID290
Feature activation-7.43e-02

Pos logit contributions
 realize+0.097
 prod+0.095
pieces+0.094
aults+0.093
 ease+0.090
Neg logit contributions
 grandma-0.051
 manager-0.050
 restaurant-0.049
 refrigerator-0.048
keeper-0.046
Token forth
Token ID6071
Feature activation+8.45e+00

Pos logit contributions
 grandma+0.007
 is+0.007
 shop+0.006
 old+0.006
 Anna+0.006
Neg logit contributions
pieces-0.001
 realize-0.001
tip-0.001
 prod-0.001
aults-0.001
Token.
Token ID13
Feature activation-9.47e-02

Pos logit contributions
 realize+0.581
aults+0.568
 prod+0.568
pieces+0.562
 ease+0.548
Neg logit contributions
 restaurant-0.319
 refrigerator-0.315
 grandma-0.315
 manager-0.308
 chamber-0.300
Token From
Token ID3574
Feature activation-1.19e-01

Pos logit contributions
 manager+0.004
 old+0.004
 restaurant+0.004
 refrigerator+0.004
 chamber+0.004
Neg logit contributions
pieces-0.006
 realize-0.006
 prod-0.005
 ease-0.005
Kit-0.005
Token that
Token ID326
Feature activation-9.38e-01

Pos logit contributions
 refrigerator+0.006
 manager+0.006
 restaurant+0.006
 chamber+0.005
 grandma+0.005
Neg logit contributions
 realize-0.007
 forth-0.007
 prod-0.007
cking-0.007
aults-0.006
Token day
Token ID1110
Feature activation+8.04e-01

Pos logit contributions
 grandma+0.037
 manager+0.036
 restaurant+0.036
 refrigerator+0.036
 old+0.035
Neg logit contributions
 realize-0.064
 prod-0.063
pieces-0.062
aults-0.061
tip-0.060
Token on
Token ID319
Feature activation+1.14e+00

Pos logit contributions
 prod+0.062
pieces+0.060
 realize+0.059
cking+0.056
 realise+0.056
Neg logit contributions
 is-0.028
 grandma-0.027
 shop-0.027
 about-0.026
 Anna-0.025
Token moved
Token ID3888
Feature activation-6.45e-01

Pos logit contributions
eddy+0.032
 grandma+0.031
 refrigerator+0.031
omm+0.031
 manager+0.031
Neg logit contributions
 realize-0.036
 prod-0.034
 realise-0.033
pieces-0.031
 glide-0.031
Token his
Token ID465
Feature activation+9.83e-02

Pos logit contributions
 grandma+0.032
 refrigerator+0.031
 restaurant+0.031
 chamber+0.031
 manager+0.030
Neg logit contributions
 realize-0.037
 prod-0.037
aults-0.036
pieces-0.035
 ease-0.035
Token feet
Token ID3625
Feature activation+3.70e+00

Pos logit contributions
 prod+0.005
 realize+0.005
aults+0.005
pieces+0.005
 glide+0.005
Neg logit contributions
 manager-0.005
 restaurant-0.005
 grandma-0.005
 Cream-0.005
 refrigerator-0.005
Token back
Token ID736
Feature activation+1.24e+00

Pos logit contributions
 realize+0.255
 prod+0.252
pieces+0.248
aults+0.244
tip+0.240
Neg logit contributions
 grandma-0.140
 restaurant-0.134
 manager-0.134
 refrigerator-0.133
 old-0.133
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
 realize+0.080
 prod+0.079
aults+0.077
pieces+0.076
 realise+0.075
Neg logit contributions
 grandma-0.051
 manager-0.050
 restaurant-0.049
 refrigerator-0.048
 Cream-0.048
Token forth
Token ID6071
Feature activation+8.42e+00

Pos logit contributions
 grandma+0.009
 refrigerator+0.009
 restaurant+0.009
 manager+0.009
 old+0.009
Neg logit contributions
 realize-0.005
 prod-0.005
pieces-0.004
aults-0.004
tip-0.004
Token to
Token ID284
Feature activation+1.98e-01

Pos logit contributions
 realize+0.559
 prod+0.552
aults+0.542
pieces+0.533
 ease+0.524
Neg logit contributions
 grandma-0.336
 restaurant-0.334
 refrigerator-0.334
 manager-0.321
 chamber-0.320
Token practice
Token ID3357
Feature activation+1.86e+00

Pos logit contributions
 realize+0.012
 prod+0.012
pieces+0.012
aults+0.011
 realise+0.011
Neg logit contributions
 grandma-0.009
 manager-0.009
 refrigerator-0.009
 restaurant-0.009
 old-0.009
Token standing
Token ID5055
Feature activation-1.17e+00

Pos logit contributions
 realize+0.121
 prod+0.119
aults+0.116
pieces+0.116
 ease+0.115
Neg logit contributions
 refrigerator-0.078
 grandma-0.077
 manager-0.076
 restaurant-0.075
keeper-0.074
Token on
Token ID319
Feature activation+1.07e+00

Pos logit contributions
 grandma+0.055
 refrigerator+0.054
 restaurant+0.054
 manager+0.054
omm+0.052
Neg logit contributions
 realize-0.070
aults-0.069
 prod-0.068
pieces-0.067
 forth-0.067
Token one
Token ID530
Feature activation-7.12e-01

Pos logit contributions
 ease+0.063
 realize+0.061
 prod+0.059
 tumble+0.059
aults+0.058
Neg logit contributions
 manager-0.051
keeper-0.051
ber-0.049
 waitress-0.049
 waiter-0.049
Token began
Token ID2540
Feature activation+1.44e+00

Pos logit contributions
 refrigerator+0.005
 manager+0.005
 restaurant+0.005
 chamber+0.005
 grandma+0.005
Neg logit contributions
 realize-0.006
 prod-0.006
pieces-0.006
aults-0.006
 forth-0.006
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
 realize+0.106
 prod+0.105
pieces+0.104
aults+0.102
tip+0.100
Neg logit contributions
 grandma-0.047
 restaurant-0.045
 manager-0.045
 refrigerator-0.045
 old-0.045
Token swing
Token ID9628
Feature activation+3.36e+00

Pos logit contributions
pieces+0.010
 prod+0.010
 realize+0.010
aults+0.009
tip+0.009
Neg logit contributions
 grandma-0.014
 restaurant-0.014
 shop-0.014
 old-0.014
 refrigerator-0.014
Token back
Token ID736
Feature activation+1.19e+00

Pos logit contributions
pieces+0.184
 realize+0.183
 ease+0.183
aults+0.181
 forth+0.178
Neg logit contributions
 restaurant-0.165
 refrigerator-0.164
 manager-0.164
omm-0.163
 waiter-0.158
Token and
Token ID290
Feature activation+6.04e-02

Pos logit contributions
 realize+0.070
 prod+0.067
pieces+0.067
aults+0.066
 forth+0.066
Neg logit contributions
 manager-0.056
 restaurant-0.055
 grandma-0.054
 refrigerator-0.054
 Cream-0.053
Token forth
Token ID6071
Feature activation+8.42e+00

Pos logit contributions
pieces+0.001
 realize+0.001
 prod+0.001
tip+0.001
aults+0.001
Neg logit contributions
 grandma-0.005
 old-0.005
 shop-0.005
 Anna-0.005
 restaurant-0.005
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 realize+0.556
aults+0.544
 prod+0.539
pieces+0.534
 ease+0.527
Neg logit contributions
 restaurant-0.336
 refrigerator-0.332
 grandma-0.331
 manager-0.327
 chamber-0.317
Token The
Token ID383
Feature activation+5.93e-01

Pos logit contributions
 manager+0.005
 old+0.005
 restaurant+0.005
 chamber+0.005
man+0.005
Neg logit contributions
pieces-0.006
 realize-0.006
 prod-0.006
 ease-0.006
 realise-0.006
Token two
Token ID734
Feature activation+1.21e+00

Pos logit contributions
 realize+0.036
pieces+0.036
 prod+0.036
aults+0.035
tip+0.035
Neg logit contributions
 refrigerator-0.025
 grandma-0.025
 manager-0.024
 restaurant-0.024
 about-0.024
Token friends
Token ID2460
Feature activation+2.97e+00

Pos logit contributions
 prod+0.077
 realize+0.077
pieces+0.073
aults+0.073
tip+0.073
Neg logit contributions
 grandma-0.053
 old-0.051
 shop-0.050
 manager-0.050
 restaurant-0.050
Token had
Token ID550
Feature activation+6.77e-01

Pos logit contributions
 realize+0.164
 prod+0.164
pieces+0.160
aults+0.157
tip+0.154
Neg logit contributions
 grandma-0.151
 shop-0.146
 manager-0.146
 restaurant-0.145
 old-0.145
Token eyes
Token ID2951
Feature activation+1.27e-01

Pos logit contributions
 realize+0.019
 prod+0.019
pieces+0.019
aults+0.018
bs+0.018
Neg logit contributions
 manager-0.013
 Cream-0.012
 restaurant-0.012
man-0.012
 Hero-0.012
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 realize+0.009
aults+0.009
 prod+0.009
pieces+0.009
 ease+0.009
Neg logit contributions
 grandma-0.004
 restaurant-0.004
 manager-0.004
 refrigerator-0.004
 chamber-0.004
Token swayed
Token ID47632
Feature activation+4.88e-02

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
aults+0.001
tip+0.001
Neg logit contributions
 grandma-0.001
 restaurant-0.001
 manager-0.001
 refrigerator-0.001
 old-0.001
Token back
Token ID736
Feature activation+1.15e+00

Pos logit contributions
 realize+0.003
 prod+0.003
aults+0.003
pieces+0.003
tip+0.003
Neg logit contributions
 restaurant-0.002
 grandma-0.002
 manager-0.002
 refrigerator-0.002
 shop-0.002
Token and
Token ID290
Feature activation+1.77e-01

Pos logit contributions
 realize+0.078
 prod+0.077
pieces+0.076
aults+0.075
tip+0.073
Neg logit contributions
 grandma-0.043
 manager-0.043
 restaurant-0.043
 refrigerator-0.043
 shop-0.041
Token forth
Token ID6071
Feature activation+8.34e+00

Pos logit contributions
 realize+0.004
pieces+0.004
 prod+0.004
aults+0.004
tip+0.003
Neg logit contributions
 grandma-0.015
 old-0.015
 shop-0.014
 Grand-0.014
 manager-0.014
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
 realize+0.574
 prod+0.564
aults+0.560
pieces+0.555
tip+0.536
Neg logit contributions
 restaurant-0.313
 grandma-0.311
 refrigerator-0.309
 manager-0.304
 chamber-0.294
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
 manager+0.005
 restaurant+0.005
 old+0.005
 refrigerator+0.005
 chamber+0.005
Neg logit contributions
 realize-0.007
pieces-0.007
 prod-0.007
 ease-0.006
tip-0.006
Token
Token ID198
Feature activation+4.99e-01

Pos logit contributions
 realize+0.038
 prod+0.037
pieces+0.037
aults+0.036
tip+0.035
Neg logit contributions
 manager-0.021
 grandma-0.021
 refrigerator-0.020
 restaurant-0.020
 old-0.020
TokenAfter
Token ID3260
Feature activation+3.36e+00

Pos logit contributions
 prod+0.023
 realize+0.023
 unnecessary+0.021
pieces+0.021
vals+0.021
Neg logit contributions
 grandma-0.031
 shop-0.030
 restaurant-0.030
 Anna-0.029
 manager-0.029
Token the
Token ID262
Feature activation+2.31e-01

Pos logit contributions
 realize+0.233
 prod+0.231
pieces+0.227
aults+0.225
tip+0.220
Neg logit contributions
 grandma-0.130
 restaurant-0.124
 old-0.122
 manager-0.122
 shop-0.122
Token started
Token ID2067
Feature activation+2.54e+00

Pos logit contributions
 manager+0.020
 grandma+0.020
 refrigerator+0.020
eddy+0.020
 Cream+0.019
Neg logit contributions
 realize-0.020
 prod-0.019
 realise-0.018
pieces-0.017
 tumble-0.017
Token to
Token ID284
Feature activation+3.03e-01

Pos logit contributions
 realize+0.196
 prod+0.193
pieces+0.191
aults+0.189
tip+0.186
Neg logit contributions
 grandma-0.076
 restaurant-0.072
 manager-0.072
 refrigerator-0.071
 old-0.071
Token swing
Token ID9628
Feature activation+3.22e+00

Pos logit contributions
pieces+0.014
vals+0.013
tip+0.013
plets+0.013
aults+0.012
Neg logit contributions
 old-0.019
 restaurant-0.019
 chamber-0.019
 shop-0.019
 refrigerator-0.019
Token back
Token ID736
Feature activation+1.11e+00

Pos logit contributions
 realize+0.195
 prod+0.187
 ease+0.186
aults+0.185
hing+0.183
Neg logit contributions
 restaurant-0.145
 grandma-0.144
 refrigerator-0.143
 chamber-0.140
 manager-0.140
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
 realize+0.070
 prod+0.069
pieces+0.067
aults+0.066
 realise+0.065
Neg logit contributions
 grandma-0.048
 manager-0.047
 restaurant-0.047
 refrigerator-0.046
 shop-0.045
Token forth
Token ID6071
Feature activation+8.33e+00

Pos logit contributions
pieces+0.001
vals+0.001
aults+0.001
 realize+0.001
tip+0.001
Neg logit contributions
 grandma-0.002
 old-0.002
 shop-0.002
 Anna-0.002
 restaurant-0.002
Token,
Token ID11
Feature activation+3.63e-01

Pos logit contributions
 realize+0.575
 prod+0.563
aults+0.553
pieces+0.547
 ease+0.536
Neg logit contributions
 grandma-0.314
 restaurant-0.310
 refrigerator-0.307
 manager-0.301
 chamber-0.297
Token higher
Token ID2440
Feature activation+3.18e+00

Pos logit contributions
 realize+0.021
 prod+0.021
pieces+0.021
aults+0.020
tip+0.020
Neg logit contributions
 grandma-0.017
 restaurant-0.017
 manager-0.017
 refrigerator-0.017
 old-0.017
Token and
Token ID290
Feature activation+1.41e-01

Pos logit contributions
 realize+0.240
 prod+0.237
pieces+0.233
aults+0.231
tip+0.226
Neg logit contributions
 grandma-0.101
 manager-0.096
 restaurant-0.096
 refrigerator-0.095
 old-0.093
Token higher
Token ID2440
Feature activation+3.17e+00

Pos logit contributions
 realize+0.004
pieces+0.004
 prod+0.004
aults+0.004
tip+0.004
Neg logit contributions
 old-0.011
 grandma-0.011
 manager-0.010
 shop-0.010
keeper-0.010
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 realize+0.227
 prod+0.224
pieces+0.222
aults+0.218
tip+0.215
Neg logit contributions
 grandma-0.112
 restaurant-0.107
 old-0.107
 manager-0.107
 refrigerator-0.106
TokenThe
Token ID464
Feature activation+1.06e+00

Pos logit contributions
 realize+0.025
 prod+0.025
 unnecessary+0.024
pieces+0.023
 forth+0.023
Neg logit contributions
 grandma-0.033
 shop-0.032
 restaurant-0.031
 Anna-0.031
 manager-0.031
Token ground
Token ID2323
Feature activation+1.60e+00

Pos logit contributions
 realize+0.066
 prod+0.065
pieces+0.064
aults+0.062
tip+0.062
Neg logit contributions
 grandma-0.045
 restaurant-0.045
 manager-0.044
 refrigerator-0.044
 Cream-0.043
Token rocked
Token ID36872
Feature activation+8.38e-01

Pos logit contributions
 realize+0.098
 prod+0.098
 tumble+0.093
 forth+0.092
 realise+0.091
Neg logit contributions
 grandma-0.073
 restaurant-0.069
 refrigerator-0.069
 Cream-0.068
 novels-0.067
Token back
Token ID736
Feature activation+1.16e+00

Pos logit contributions
 realize+0.048
 prod+0.047
 forth+0.046
aults+0.046
hing+0.045
Neg logit contributions
 restaurant-0.042
 grandma-0.039
 refrigerator-0.039
 manager-0.039
 chamber-0.038
Token and
Token ID290
Feature activation+2.77e-01

Pos logit contributions
 realize+0.076
 prod+0.075
 forth+0.073
pieces+0.073
aults+0.072
Neg logit contributions
 manager-0.045
 restaurant-0.045
 grandma-0.044
 Cream-0.043
 refrigerator-0.043
Token forth
Token ID6071
Feature activation+8.31e+00

Pos logit contributions
 realize+0.010
 prod+0.010
pieces+0.009
aults+0.009
tip+0.009
Neg logit contributions
 grandma-0.019
 restaurant-0.019
 refrigerator-0.019
 manager-0.019
 old-0.019
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
 realize+0.572
 prod+0.558
aults+0.552
pieces+0.544
 ease+0.532
Neg logit contributions
 restaurant-0.314
 grandma-0.313
 refrigerator-0.310
 manager-0.303
 chamber-0.299
Token It
Token ID632
Feature activation+9.99e-01

Pos logit contributions
man+0.004
 restaurant+0.004
 manager+0.004
 novel+0.004
 chamber+0.004
Neg logit contributions
 realize-0.004
 prod-0.004
pieces-0.004
 ease-0.004
 realise-0.004
Token was
Token ID373
Feature activation-7.47e-01

Pos logit contributions
 realize+0.060
 prod+0.059
aults+0.056
pieces+0.056
 realise+0.055
Neg logit contributions
 grandma-0.048
 refrigerator-0.048
 restaurant-0.047
 manager-0.046
 chamber-0.045
Token exciting
Token ID7895
Feature activation-2.13e-01

Pos logit contributions
 manager+0.035
 restaurant+0.035
 grandma+0.035
 refrigerator+0.034
keeper+0.034
Neg logit contributions
 realize-0.045
 prod-0.043
 ease-0.042
aults-0.042
 realise-0.041
Token,
Token ID11
Feature activation+4.77e-01

Pos logit contributions
 manager+0.011
omm+0.011
 waitress+0.011
 Cream+0.011
 grandma+0.011
Neg logit contributions
 realize-0.011
 forth-0.011
 prod-0.011
pieces-0.011
aults-0.010
Token and
Token ID290
Feature activation+1.39e-01

Pos logit contributions
 realize+0.233
aults+0.230
pieces+0.228
 prod+0.228
 tumble+0.222
Neg logit contributions
 restaurant-0.129
 grandma-0.122
 chamber-0.120
 refrigerator-0.120
 manager-0.115
Token pushed
Token ID7121
Feature activation+1.72e+00

Pos logit contributions
 realize+0.008
pieces+0.008
 unnecessary+0.007
 prod+0.007
aults+0.007
Neg logit contributions
 grandma-0.008
 Anna-0.007
 shop-0.007
 old-0.007
 Grand-0.007
Token Lucy
Token ID22162
Feature activation-8.49e-01

Pos logit contributions
 realize+0.120
 prod+0.120
pieces+0.117
vals+0.115
aults+0.115
Neg logit contributions
 grandma-0.067
 old-0.063
 shop-0.062
 Anna-0.061
 Grand-0.059
Token back
Token ID736
Feature activation+1.38e+00

Pos logit contributions
 grandma+0.037
 shop+0.036
 old+0.036
keeper+0.035
 manager+0.035
Neg logit contributions
 realize-0.054
 prod-0.053
pieces-0.052
vals-0.051
tip-0.050
Token and
Token ID290
Feature activation+5.71e-01

Pos logit contributions
 realize+0.104
 prod+0.103
pieces+0.102
aults+0.100
tip+0.099
Neg logit contributions
 grandma-0.043
 restaurant-0.041
 manager-0.041
 refrigerator-0.041
 old-0.040
Token forth
Token ID6071
Feature activation+8.28e+00

Pos logit contributions
 realize+0.022
 prod+0.021
pieces+0.021
aults+0.020
tip+0.019
Neg logit contributions
 grandma-0.040
 manager-0.039
 restaurant-0.039
 refrigerator-0.038
 old-0.038
Token.
Token ID13
Feature activation+1.93e-01

Pos logit contributions
 realize+0.565
 prod+0.553
aults+0.550
pieces+0.545
 ease+0.528
Neg logit contributions
 restaurant-0.317
 refrigerator-0.310
 grandma-0.308
 chamber-0.301
 manager-0.299
Token It
Token ID632
Feature activation+1.16e+00

Pos logit contributions
 realize+0.011
 prod+0.011
pieces+0.011
 ease+0.011
apy+0.010
Neg logit contributions
 restaurant-0.009
 manager-0.009
 old-0.009
man-0.009
 novel-0.009
Token felt
Token ID2936
Feature activation+1.37e+00

Pos logit contributions
 realize+0.093
 prod+0.092
pieces+0.090
aults+0.089
tip+0.088
Neg logit contributions
 grandma-0.032
 restaurant-0.031
 refrigerator-0.031
 manager-0.030
 old-0.029
Token like
Token ID588
Feature activation+4.85e-01

Pos logit contributions
 prod+0.084
pieces+0.084
 realize+0.083
aults+0.081
tip+0.079
Neg logit contributions
 old-0.063
 grandma-0.062
 manager-0.060
 restaurant-0.059
 shop-0.059
Token Lucy
Token ID22162
Feature activation-9.31e-01

Pos logit contributions
 realize+0.028
 ease+0.027
 prod+0.027
pieces+0.027
 tumble+0.026
Neg logit contributions
 refrigerator-0.023
 restaurant-0.023
 manager-0.023
man-0.023
 chamber-0.023
Token swing
Token ID9628
Feature activation+3.35e+00

Pos logit contributions
 realize+0.022
pieces+0.022
 prod+0.022
 sudden+0.021
tip+0.021
Neg logit contributions
 about-0.015
 grandma-0.015
 manager-0.015
 Cream-0.015
 refrigerator-0.014
Token would
Token ID561
Feature activation-1.83e+00

Pos logit contributions
 realize+0.225
pieces+0.214
 prod+0.211
 tumble+0.210
hing+0.209
Neg logit contributions
 refrigerator-0.136
 grandma-0.135
 restaurant-0.135
 chamber-0.129
 manager-0.126
Token go
Token ID467
Feature activation+1.15e+00

Pos logit contributions
 grandma+0.122
 old+0.121
 shop+0.118
 manager+0.117
 restaurant+0.117
Neg logit contributions
pieces-0.072
 prod-0.072
 realize-0.071
aults-0.069
tip-0.068
Token back
Token ID736
Feature activation+1.06e+00

Pos logit contributions
 realize+0.064
 prod+0.062
pieces+0.062
aults+0.060
tip+0.059
Neg logit contributions
 grandma-0.059
 refrigerator-0.058
 restaurant-0.058
 manager-0.057
 chamber-0.056
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
 realize+0.067
aults+0.064
 forth+0.064
pieces+0.064
 prod+0.064
Neg logit contributions
 manager-0.043
omm-0.042
 refrigerator-0.041
 waiter-0.041
 restaurant-0.041
Token forth
Token ID6071
Feature activation+8.27e+00

Pos logit contributions
 grandma+0.001
 shop+0.001
 old+0.001
 restaurant+0.001
 manager+0.001
Neg logit contributions
 realize-0.000
 prod-0.000
pieces-0.000
aults-0.000
tip-0.000
Token in
Token ID287
Feature activation-6.36e-01

Pos logit contributions
 realize+0.529
aults+0.518
 prod+0.507
pieces+0.504
 ease+0.499
Neg logit contributions
 restaurant-0.352
 refrigerator-0.349
 manager-0.339
 chamber-0.332
 grandma-0.330
Token a
Token ID257
Feature activation+2.34e-01

Pos logit contributions
 manager+0.029
 restaurant+0.028
keeper+0.027
 grandma+0.027
 refrigerator+0.027
Neg logit contributions
 realize-0.040
 prod-0.039
pieces-0.038
 ease-0.038
aults-0.038
Token fun
Token ID1257
Feature activation+7.77e-01

Pos logit contributions
 prod+0.014
 realize+0.014
vals+0.014
tip+0.014
pieces+0.014
Neg logit contributions
 shop-0.011
 old-0.011
 restaurant-0.011
 grandma-0.011
 chamber-0.010
Token motion
Token ID6268
Feature activation+1.06e+00

Pos logit contributions
 realize+0.051
 prod+0.050
pieces+0.050
aults+0.049
tip+0.047
Neg logit contributions
 grandma-0.032
 manager-0.031
 refrigerator-0.031
 restaurant-0.031
 old-0.030
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
 realize+0.076
pieces+0.073
 prod+0.073
aults+0.072
 realise+0.071
Neg logit contributions
 grandma-0.038
 refrigerator-0.037
 restaurant-0.036
 Cream-0.035
 manager-0.035
Token He
Token ID679
Feature activation-3.93e-01

Pos logit contributions
pieces+0.103
 realize+0.100
 prod+0.098
 realise+0.094
 ease+0.094
Neg logit contributions
 restaurant-0.079
 manager-0.078
 old-0.077
 refrigerator-0.076
 grandma-0.074
Token shook
Token ID14682
Feature activation-3.63e-03

Pos logit contributions
 grandma+0.020
 refrigerator+0.020
 manager+0.020
eddy+0.020
 Cream+0.019
Neg logit contributions
 realize-0.022
 prod-0.022
 realise-0.021
pieces-0.020
 ease-0.020
Token it
Token ID340
Feature activation+1.31e-01

Pos logit contributions
 manager+0.000
 restaurant+0.000
 grandma+0.000
 refrigerator+0.000
 chamber+0.000
Neg logit contributions
 realize-0.000
 prod-0.000
 ease-0.000
 realise-0.000
pieces-0.000
Token back
Token ID736
Feature activation+1.09e+00

Pos logit contributions
 realize+0.009
 prod+0.009
aults+0.009
pieces+0.009
 realise+0.009
Neg logit contributions
 grandma-0.005
 restaurant-0.005
 manager-0.005
 refrigerator-0.004
 hotel-0.004
Token and
Token ID290
Feature activation-1.67e-03

Pos logit contributions
 realize+0.073
 prod+0.071
aults+0.069
pieces+0.068
 realise+0.068
Neg logit contributions
 restaurant-0.044
 manager-0.043
 grandma-0.041
 refrigerator-0.041
 Cream-0.040
Token forth
Token ID6071
Feature activation+8.25e+00

Pos logit contributions
 grandma+0.000
 refrigerator+0.000
 restaurant+0.000
 manager+0.000
 chamber+0.000
Neg logit contributions
 realize-0.000
 prod-0.000
pieces-0.000
 realise-0.000
aults-0.000
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
 realize+0.564
 prod+0.558
aults+0.545
pieces+0.542
 ease+0.527
Neg logit contributions
 restaurant-0.316
 grandma-0.315
 refrigerator-0.309
 manager-0.300
 chamber-0.298
Token watched
Token ID7342
Feature activation-2.36e-02

Pos logit contributions
 realize+0.006
 prod+0.006
pieces+0.006
aults+0.006
tip+0.006
Neg logit contributions
 grandma-0.006
 restaurant-0.006
 manager-0.006
 refrigerator-0.006
 old-0.006
Token as
Token ID355
Feature activation+9.58e-01

Pos logit contributions
 restaurant+0.001
 refrigerator+0.001
 manager+0.001
 grandma+0.001
 shop+0.001
Neg logit contributions
 realize-0.002
 prod-0.001
pieces-0.001
aults-0.001
 ease-0.001
Token the
Token ID262
Feature activation+2.54e-01

Pos logit contributions
 realize+0.057
 ease+0.054
 prod+0.053
 tumble+0.052
 realise+0.052
Neg logit contributions
 restaurant-0.047
 refrigerator-0.047
 Cream-0.046
 Shop-0.045
omm-0.044
Token birds
Token ID10087
Feature activation+8.89e-01

Pos logit contributions
 prod+0.015
 realize+0.015
pieces+0.015
tip+0.014
aults+0.014
Neg logit contributions
 grandma-0.012
 restaurant-0.011
 manager-0.011
 refrigerator-0.011
 about-0.011
Token passed
Token ID3804
Feature activation-1.87e+00

Pos logit contributions
 prod+0.029
pieces+0.029
 unnecessary+0.029
 realize+0.028
vals+0.027
Neg logit contributions
 grandma-0.027
 shop-0.026
 old-0.025
 restaurant-0.025
 manager-0.025
Token the
Token ID262
Feature activation+2.62e-01

Pos logit contributions
 refrigerator+0.089
 restaurant+0.087
 manager+0.087
 grandma+0.087
 chamber+0.085
Neg logit contributions
 realize-0.109
pieces-0.106
aults-0.105
 prod-0.105
 ease-0.102
Token ball
Token ID2613
Feature activation+1.52e+00

Pos logit contributions
 realize+0.019
 prod+0.019
aults+0.019
pieces+0.019
tip+0.019
Neg logit contributions
 about-0.008
 grandma-0.007
omm-0.007
 refrigerator-0.007
 manager-0.007
Token back
Token ID736
Feature activation+1.15e+00

Pos logit contributions
 realize+0.100
 prod+0.098
pieces+0.097
aults+0.096
 ease+0.094
Neg logit contributions
 grandma-0.060
 restaurant-0.058
 refrigerator-0.057
 manager-0.056
 Cream-0.056
Token and
Token ID290
Feature activation+2.01e-01

Pos logit contributions
 realize+0.071
 prod+0.068
 ease+0.068
 forth+0.067
pieces+0.067
Neg logit contributions
 manager-0.049
 restaurant-0.048
omm-0.048
 refrigerator-0.047
 dentist-0.047
Token forth
Token ID6071
Feature activation+8.21e+00

Pos logit contributions
 realize+0.010
 prod+0.010
pieces+0.010
aults+0.010
tip+0.010
Neg logit contributions
 grandma-0.011
 restaurant-0.011
 manager-0.011
 old-0.011
 shop-0.011
Token,
Token ID11
Feature activation+3.82e-01

Pos logit contributions
 realize+0.557
 prod+0.549
aults+0.545
pieces+0.543
tip+0.522
Neg logit contributions
 refrigerator-0.317
 restaurant-0.315
 grandma-0.309
 manager-0.305
 chamber-0.301
Token laughing
Token ID14376
Feature activation+2.54e+00

Pos logit contributions
 realize+0.024
 prod+0.023
pieces+0.023
aults+0.023
tip+0.022
Neg logit contributions
 grandma-0.017
 restaurant-0.017
 manager-0.017
 old-0.017
 refrigerator-0.017
Token and
Token ID290
Feature activation+1.88e-01

Pos logit contributions
 realize+0.168
aults+0.167
pieces+0.162
 prod+0.159
ault+0.155
Neg logit contributions
 refrigerator-0.104
 manager-0.101
 restaurant-0.100
omm-0.100
 dentist-0.099
Token having
Token ID1719
Feature activation+2.72e+00

Pos logit contributions
 realize+0.010
pieces+0.010
 prod+0.010
aults+0.009
tip+0.009
Neg logit contributions
 grandma-0.010
 old-0.010
 shop-0.010
 is-0.010
 Anna-0.010
Token fun
Token ID1257
Feature activation+8.70e-01

Pos logit contributions
 realize+0.146
pieces+0.142
 prod+0.141
aults+0.139
 ease+0.136
Neg logit contributions
 grandma-0.145
 manager-0.144
 refrigerator-0.143
 restaurant-0.142
keeper-0.140
Token vine
Token ID17793
Feature activation+4.16e+00

Pos logit contributions
 realize+0.000
 prod+0.000
 realise+0.000
aults+0.000
pieces+0.000
Neg logit contributions
 old-0.000
 grandma-0.000
 restaurant-0.000
 refrigerator-0.000
 shop-0.000
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
 realize+0.291
 prod+0.287
pieces+0.286
aults+0.281
 realise+0.271
Neg logit contributions
 grandma-0.152
 manager-0.146
 restaurant-0.143
 refrigerator-0.142
 old-0.142
Token swinging
Token ID25635
Feature activation+4.23e+00

Pos logit contributions
 realize+0.013
 prod+0.013
pieces+0.013
aults+0.012
tip+0.012
Neg logit contributions
 grandma-0.008
 restaurant-0.008
 manager-0.008
 refrigerator-0.008
 old-0.008
Token back
Token ID736
Feature activation+1.12e+00

Pos logit contributions
 realize+0.319
 prod+0.315
pieces+0.311
aults+0.307
tip+0.302
Neg logit contributions
 grandma-0.134
 restaurant-0.128
 manager-0.128
 refrigerator-0.126
 old-0.125
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
 realize+0.081
 prod+0.080
pieces+0.079
aults+0.078
tip+0.076
Neg logit contributions
 grandma-0.039
 manager-0.038
 restaurant-0.037
 refrigerator-0.036
 old-0.036
Token forth
Token ID6071
Feature activation+8.20e+00

Pos logit contributions
 grandma+0.011
 old+0.011
 shop+0.011
 restaurant+0.011
 Anna+0.011
Neg logit contributions
 realize-0.002
pieces-0.002
tip-0.002
aults-0.002
 prod-0.002
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 realize+0.553
aults+0.544
 prod+0.542
pieces+0.534
 ease+0.519
Neg logit contributions
 restaurant-0.312
 grandma-0.312
 manager-0.308
 refrigerator-0.307
 chamber-0.297
Token They
Token ID1119
Feature activation+2.91e-01

Pos logit contributions
 manager+0.007
 restaurant+0.007
 old+0.007
 refrigerator+0.007
 chamber+0.007
Neg logit contributions
 realize-0.011
 prod-0.011
pieces-0.010
 ease-0.010
 realise-0.010
Token were
Token ID547
Feature activation+3.64e-01

Pos logit contributions
 prod+0.017
pieces+0.017
 realize+0.017
 unnecessary+0.017
tip+0.017
Neg logit contributions
 grandma-0.014
 shop-0.013
 restaurant-0.013
 manager-0.013
 old-0.013
Token very
Token ID845
Feature activation+9.89e-01

Pos logit contributions
 prod+0.023
 realize+0.022
pieces+0.022
tip+0.021
aults+0.021
Neg logit contributions
 grandma-0.017
 old-0.017
 restaurant-0.016
 about-0.016
 manager-0.016
Token happy
Token ID3772
Feature activation+1.55e+00

Pos logit contributions
pieces+0.061
 prod+0.061
 realize+0.059
hing+0.058
vals+0.058
Neg logit contributions
 old-0.049
 grandma-0.046
 restaurant-0.043
 manager-0.043
 Anna-0.043
Token kicked
Token ID12165
Feature activation+2.12e+00

Pos logit contributions
 realize+0.011
 prod+0.011
pieces+0.010
aults+0.010
tip+0.010
Neg logit contributions
 grandma-0.010
 refrigerator-0.010
 manager-0.010
 restaurant-0.010
 old-0.010
Token the
Token ID262
Feature activation+1.92e-02

Pos logit contributions
 realize+0.152
 prod+0.150
pieces+0.148
aults+0.146
tip+0.143
Neg logit contributions
 grandma-0.075
 manager-0.073
 restaurant-0.072
 refrigerator-0.072
 old-0.071
Token ball
Token ID2613
Feature activation+1.49e+00

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
aults+0.001
tip+0.001
Neg logit contributions
 grandma-0.001
 restaurant-0.001
 old-0.001
 manager-0.001
 refrigerator-0.001
Token back
Token ID736
Feature activation+1.12e+00

Pos logit contributions
 realize+0.118
 prod+0.117
pieces+0.116
aults+0.114
tip+0.112
Neg logit contributions
 grandma-0.041
 old-0.040
 manager-0.039
 shop-0.039
 refrigerator-0.039
Token and
Token ID290
Feature activation-2.20e-01

Pos logit contributions
 realize+0.084
 prod+0.083
pieces+0.081
aults+0.081
 forth+0.080
Neg logit contributions
 grandma-0.034
 manager-0.034
 restaurant-0.034
 refrigerator-0.032
keeper-0.031
Token forth
Token ID6071
Feature activation+8.17e+00

Pos logit contributions
 grandma+0.020
 shop+0.019
 old+0.019
 Anna+0.019
 manager+0.019
Neg logit contributions
 realize-0.004
pieces-0.004
 prod-0.004
tip-0.003
vals-0.003
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
 realize+0.606
 prod+0.597
pieces+0.588
aults+0.585
tip+0.571
Neg logit contributions
 grandma-0.267
 restaurant-0.261
 refrigerator-0.259
 manager-0.256
 chamber-0.249
Token had
Token ID550
Feature activation+5.29e-01

Pos logit contributions
 grandma+0.007
 restaurant+0.007
 manager+0.007
 old+0.007
 shop+0.007
Neg logit contributions
 realize-0.005
pieces-0.005
 prod-0.005
tip-0.005
aults-0.005
Token a
Token ID257
Feature activation+3.05e-01

Pos logit contributions
 realize+0.030
 prod+0.029
pieces+0.029
aults+0.028
tip+0.027
Neg logit contributions
 grandma-0.027
 manager-0.026
 refrigerator-0.026
 restaurant-0.026
 shop-0.026
Token lot
Token ID1256
Feature activation+3.71e+00

Pos logit contributions
 prod+0.018
pieces+0.018
vals+0.018
cking+0.018
tip+0.018
Neg logit contributions
 grandma-0.015
 restaurant-0.015
 old-0.014
 shop-0.014
 chamber-0.014
Token of
Token ID286
Feature activation-4.30e-01

Pos logit contributions
 realize+0.235
 prod+0.231
pieces+0.228
aults+0.224
tip+0.220
Neg logit contributions
 grandma-0.164
 restaurant-0.158
 refrigerator-0.158
 manager-0.157
 old-0.155
Token shake
Token ID13279
Feature activation+4.85e-01

Pos logit contributions
pieces+0.011
 unnecessary+0.011
vals+0.011
plets+0.010
aults+0.010
Neg logit contributions
 shop-0.015
 old-0.015
 restaurant-0.015
 bar-0.015
 grandma-0.015
Token his
Token ID465
Feature activation+8.22e-02

Pos logit contributions
 realize+0.031
 prod+0.030
aults+0.030
pieces+0.029
 tumble+0.029
Neg logit contributions
 manager-0.021
 restaurant-0.020
 grandma-0.020
 refrigerator-0.020
 chamber-0.019
Token head
Token ID1182
Feature activation+2.97e-01

Pos logit contributions
 prod+0.004
 realize+0.004
aults+0.004
pieces+0.004
tip+0.004
Neg logit contributions
 restaurant-0.004
 manager-0.004
 grandma-0.004
 refrigerator-0.004
 shop-0.004
Token back
Token ID736
Feature activation+9.66e-01

Pos logit contributions
 realize+0.023
 prod+0.023
pieces+0.022
aults+0.022
tip+0.022
Neg logit contributions
 grandma-0.009
 restaurant-0.008
 manager-0.008
 refrigerator-0.008
 old-0.008
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
 realize+0.078
 prod+0.077
pieces+0.077
aults+0.075
tip+0.074
Neg logit contributions
 grandma-0.025
 restaurant-0.024
 old-0.024
 manager-0.024
 refrigerator-0.024
Token forth
Token ID6071
Feature activation+8.14e+00

Pos logit contributions
pieces+0.001
 realize+0.001
aults+0.001
tip+0.001
vals+0.001
Neg logit contributions
 grandma-0.001
 old-0.001
 shop-0.001
 manager-0.001
keeper-0.001
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
 realize+0.587
 prod+0.579
aults+0.570
pieces+0.559
 ease+0.551
Neg logit contributions
 grandma-0.280
 restaurant-0.276
 refrigerator-0.271
 manager-0.264
 chamber-0.260
Token "
Token ID366
Feature activation-6.92e-01

Pos logit contributions
 chamber+0.011
 restaurant+0.010
 refrigerator+0.010
 manager+0.010
 old+0.010
Neg logit contributions
 prod-0.015
 realize-0.015
pieces-0.015
Kit-0.015
aults-0.014
TokenNo
Token ID2949
Feature activation+1.67e+00

Pos logit contributions
 shop+0.032
 Anna+0.031
 Cream+0.030
 bar+0.030
 grandma+0.028
Neg logit contributions
 realize-0.042
pieces-0.041
 prod-0.039
oves-0.038
 unnecessary-0.038
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
 realize+0.104
 prod+0.101
pieces+0.098
 realise+0.096
 tumble+0.095
Neg logit contributions
 manager-0.075
keeper-0.074
 refrigerator-0.073
 grandma-0.072
 chamber-0.072
Token Kitty
Token ID21168
Feature activation+2.52e-01

Pos logit contributions
 realize+0.012
 prod+0.012
aults+0.011
pieces+0.011
ault+0.011
Neg logit contributions
 refrigerator-0.008
 manager-0.008
 restaurant-0.008
 chamber-0.008
 old-0.008
Token and
Token ID290
Feature activation-1.54e-01

Pos logit contributions
omm+0.004
 grandma+0.003
 manager+0.003
 refrigerator+0.003
 Cream+0.003
Neg logit contributions
 realize-0.005
pieces-0.005
aults-0.005
 forth-0.005
 prod-0.005
Token toss
Token ID12153
Feature activation+8.47e-01

Pos logit contributions
 grandma+0.010
 restaurant+0.009
 manager+0.009
 old+0.009
 refrigerator+0.009
Neg logit contributions
 realize-0.007
 prod-0.007
pieces-0.007
aults-0.006
tip-0.006
Token it
Token ID340
Feature activation-6.78e-02

Pos logit contributions
 realize+0.053
 prod+0.053
pieces+0.053
aults+0.052
tip+0.050
Neg logit contributions
 manager-0.037
 grandma-0.037
 refrigerator-0.037
 restaurant-0.036
 chamber-0.035
Token back
Token ID736
Feature activation+9.83e-01

Pos logit contributions
 grandma+0.003
 restaurant+0.003
 manager+0.003
 refrigerator+0.003
 old+0.003
Neg logit contributions
 realize-0.005
 prod-0.004
pieces-0.004
aults-0.004
tip-0.004
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
 prod+0.058
aults+0.057
pieces+0.057
 realize+0.057
 forth+0.056
Neg logit contributions
 manager-0.044
 grandma-0.044
 restaurant-0.043
 dentist-0.042
omm-0.041
Token forth
Token ID6071
Feature activation+8.14e+00

Pos logit contributions
 grandma+0.014
 shop+0.013
 old+0.013
 restaurant+0.013
 manager+0.013
Neg logit contributions
 realize-0.002
pieces-0.002
 prod-0.002
tip-0.002
aults-0.002
Token.
Token ID13
Feature activation-3.83e-01

Pos logit contributions
 realize+0.558
aults+0.554
 prod+0.552
pieces+0.547
ging+0.533
Neg logit contributions
 restaurant-0.299
 refrigerator-0.297
 grandma-0.295
 manager-0.289
 chamber-0.285
Token
Token ID198
Feature activation+3.49e-01

Pos logit contributions
 manager+0.017
 restaurant+0.017
 old+0.016
 novel+0.016
 chamber+0.016
Neg logit contributions
 realize-0.024
pieces-0.023
 prod-0.023
 realise-0.022
 ease-0.022
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
 realize+0.025
 prod+0.024
pieces+0.024
aults+0.024
 realise+0.023
Neg logit contributions
 grandma-0.012
 refrigerator-0.012
 manager-0.012
 restaurant-0.012
 old-0.012
TokenL
Token ID43
Feature activation-2.50e-01

Pos logit contributions
 realize+0.017
 prod+0.016
 unnecessary+0.016
pieces+0.016
tip+0.016
Neg logit contributions
 shop-0.018
 grandma-0.017
 restaurant-0.017
 Anna-0.017
 manager-0.016
Tokenily
Token ID813
Feature activation-1.19e+00

Pos logit contributions
 refrigerator+0.012
 grandma+0.012
keeper+0.012
 Cream+0.012
 shop+0.012
Neg logit contributions
 realize-0.015
pieces-0.014
 realise-0.013
tip-0.013
 prod-0.013
Token for
Token ID329
Feature activation-9.81e-01

Pos logit contributions
 realize+0.170
pieces+0.166
 prod+0.164
aults+0.163
 ease+0.159
Neg logit contributions
 restaurant-0.145
 refrigerator-0.144
 manager-0.143
 grandma-0.141
 chamber-0.139
Token hours
Token ID2250
Feature activation+6.70e-01

Pos logit contributions
 grandma+0.035
 old+0.032
 shop+0.032
 Anna+0.031
 restaurant+0.031
Neg logit contributions
 realize-0.072
 prod-0.072
pieces-0.070
aults-0.069
tip-0.068
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
 realize+0.047
pieces+0.046
aults+0.046
 prod+0.046
 ease+0.044
Neg logit contributions
 refrigerator-0.024
 restaurant-0.024
 grandma-0.023
 manager-0.023
 chamber-0.023
Token back
Token ID736
Feature activation+1.09e+00

Pos logit contributions
 prod+0.010
 realize+0.010
pieces+0.010
tip+0.010
aults+0.010
Neg logit contributions
 grandma-0.008
 old-0.007
 shop-0.007
 restaurant-0.007
 about-0.007
Token and
Token ID290
Feature activation-6.63e-02

Pos logit contributions
 realize+0.075
 prod+0.072
pieces+0.071
aults+0.070
 realise+0.070
Neg logit contributions
 manager-0.041
 grandma-0.041
 restaurant-0.040
 refrigerator-0.040
keeper-0.039
Token forth
Token ID6071
Feature activation+8.12e+00

Pos logit contributions
 grandma+0.006
 old+0.005
 restaurant+0.005
 shop+0.005
 manager+0.005
Neg logit contributions
 realize-0.002
 prod-0.001
pieces-0.001
aults-0.001
tip-0.001
Token,
Token ID11
Feature activation+1.77e-01

Pos logit contributions
 realize+0.538
aults+0.524
 prod+0.522
pieces+0.516
 ease+0.501
Neg logit contributions
 restaurant-0.323
 refrigerator-0.319
 manager-0.315
 grandma-0.315
 chamber-0.307
Token feeling
Token ID4203
Feature activation-5.27e-01

Pos logit contributions
 realize+0.012
 prod+0.011
pieces+0.011
aults+0.011
tip+0.011
Neg logit contributions
 grandma-0.007
 restaurant-0.007
 manager-0.007
 refrigerator-0.007
 old-0.007
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 grandma+0.024
 restaurant+0.023
 manager+0.023
 old+0.023
 refrigerator+0.023
Neg logit contributions
 realize-0.032
 prod-0.032
pieces-0.031
aults-0.031
tip-0.030
Token wind
Token ID2344
Feature activation+2.23e+00

Pos logit contributions
 prod+0.007
pieces+0.007
 realize+0.007
aults+0.007
 realise+0.006
Neg logit contributions
 old-0.006
 grandma-0.006
 shop-0.006
 refrigerator-0.006
 manager-0.006
Token on
Token ID319
Feature activation+9.45e-01

Pos logit contributions
 realize+0.145
 prod+0.143
pieces+0.141
aults+0.138
tip+0.136
Neg logit contributions
 grandma-0.094
 restaurant-0.091
 manager-0.091
 refrigerator-0.090
 old-0.090
Token with
Token ID351
Feature activation-1.22e+00

Pos logit contributions
 grandma+0.031
 manager+0.029
omm+0.028
 Cream+0.028
 refrigerator+0.028
Neg logit contributions
 realize-0.054
aults-0.053
 prod-0.052
cking-0.052
pieces-0.052
Token leaves
Token ID5667
Feature activation+2.70e+00

Pos logit contributions
 manager+0.055
 grandma+0.054
 restaurant+0.054
 refrigerator+0.052
 shop+0.052
Neg logit contributions
 realize-0.074
 prod-0.074
pieces-0.072
 ease-0.071
aults-0.071
Token shaking
Token ID17275
Feature activation+1.29e+00

Pos logit contributions
 realize+0.192
aults+0.188
 prod+0.184
cking+0.183
pieces+0.183
Neg logit contributions
 grandma-0.097
 manager-0.093
 restaurant-0.093
 chamber-0.088
 refrigerator-0.087
Token back
Token ID736
Feature activation+1.06e+00

Pos logit contributions
 prod+0.075
 realize+0.074
aults+0.073
pieces+0.073
 tumble+0.072
Neg logit contributions
 manager-0.061
 grandma-0.060
 restaurant-0.059
omm-0.058
keeper-0.058
Token and
Token ID290
Feature activation-1.92e-01

Pos logit contributions
 realize+0.069
 prod+0.067
pieces+0.066
aults+0.066
 realise+0.065
Neg logit contributions
 manager-0.043
 grandma-0.042
 restaurant-0.042
 refrigerator-0.040
 Cream-0.040
Token forth
Token ID6071
Feature activation+8.12e+00

Pos logit contributions
 grandma+0.011
 old+0.011
 restaurant+0.011
 refrigerator+0.011
 shop+0.011
Neg logit contributions
 realize-0.009
pieces-0.009
 prod-0.009
aults-0.009
tip-0.009
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 realize+0.582
 prod+0.575
aults+0.574
pieces+0.561
cking+0.553
Neg logit contributions
 grandma-0.272
 restaurant-0.271
 refrigerator-0.264
 manager-0.261
 chamber-0.255
Token The
Token ID383
Feature activation+4.23e-01

Pos logit contributions
 novel+0.013
 manager+0.013
man+0.012
 restaurant+0.012
 old+0.012
Neg logit contributions
pieces-0.015
 realize-0.014
 prod-0.014
 sudden-0.014
 ease-0.013
Token kittens
Token ID45584
Feature activation+2.05e+00

Pos logit contributions
 realize+0.023
 prod+0.023
pieces+0.022
 sudden+0.022
 tumble+0.022
Neg logit contributions
 grandma-0.021
 manager-0.020
 Cream-0.020
 about-0.020
 Grand-0.020
Token were
Token ID547
Feature activation+3.30e-01

Pos logit contributions
 realize+0.120
 prod+0.118
pieces+0.116
aults+0.114
tip+0.112
Neg logit contributions
 grandma-0.100
 restaurant-0.097
 manager-0.097
 refrigerator-0.096
 old-0.096
Token mesmer
Token ID46814
Feature activation+1.27e+00

Pos logit contributions
 realize+0.020
 prod+0.019
pieces+0.019
aults+0.019
 realise+0.018
Neg logit contributions
 grandma-0.015
 restaurant-0.015
 manager-0.015
 refrigerator-0.015
 shop-0.015
Token.
Token ID13
Feature activation-2.29e-01

Pos logit contributions
pieces+0.114
 realize+0.114
aults+0.111
 notice+0.110
 tumble+0.108
Neg logit contributions
 Cream-0.078
omm-0.077
 grandma-0.074
 waitress-0.072
 restaurant-0.071
Token She
Token ID1375
Feature activation+5.21e-02

Pos logit contributions
 restaurant+0.011
 manager+0.011
man+0.011
 refrigerator+0.010
 novel+0.010
Neg logit contributions
 realize-0.014
 prod-0.013
pieces-0.013
 realise-0.013
bs-0.012
Token goes
Token ID2925
Feature activation+8.59e-01

Pos logit contributions
 realize+0.003
 realise+0.003
 prod+0.003
 tumble+0.003
 notice+0.003
Neg logit contributions
eddy-0.003
man-0.003
 Cream-0.002
 refrigerator-0.002
 manager-0.002
Token back
Token ID736
Feature activation+9.39e-01

Pos logit contributions
 realize+0.052
 prod+0.050
pieces+0.049
aults+0.048
 realise+0.048
Neg logit contributions
 grandma-0.040
 refrigerator-0.039
 restaurant-0.038
 Shop-0.037
 chamber-0.037
Token and
Token ID290
Feature activation-2.51e-01

Pos logit contributions
 realize+0.053
 notice+0.051
 realise+0.051
 prod+0.050
 forth+0.049
Neg logit contributions
omm-0.048
 Cream-0.044
 waiter-0.044
 refrigerator-0.043
 truth-0.043
Token forth
Token ID6071
Feature activation+8.10e+00

Pos logit contributions
 grandma+0.012
 refrigerator+0.012
 manager+0.012
 restaurant+0.012
 old+0.012
Neg logit contributions
 realize-0.014
 prod-0.014
pieces-0.014
 forth-0.013
 realise-0.013
Token and
Token ID290
Feature activation-1.67e-01

Pos logit contributions
 realize+0.524
 prod+0.505
aults+0.503
pieces+0.493
ging+0.483
Neg logit contributions
 restaurant-0.338
 refrigerator-0.333
 grandma-0.324
 Cream-0.321
 manager-0.319
Token makes
Token ID1838
Feature activation-3.54e-01

Pos logit contributions
 grandma+0.011
 restaurant+0.010
 manager+0.010
 refrigerator+0.010
 old+0.010
Neg logit contributions
 realize-0.007
 prod-0.007
pieces-0.007
aults-0.007
tip-0.007
Token many
Token ID867
Feature activation-1.76e-01

Pos logit contributions
 refrigerator+0.016
 restaurant+0.016
 manager+0.016
 shop+0.016
 grandma+0.016
Neg logit contributions
 realize-0.022
 prod-0.021
pieces-0.021
aults-0.021
 realise-0.020
Token lines
Token ID3951
Feature activation+3.01e+00

Pos logit contributions
 grandma+0.008
 restaurant+0.007
 manager+0.007
 refrigerator+0.007
 old+0.007
Neg logit contributions
 realize-0.011
 prod-0.011
pieces-0.011
aults-0.011
tip-0.011
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
 realize+0.221
 prod+0.218
pieces+0.215
aults+0.212
tip+0.208
Neg logit contributions
 grandma-0.101
 restaurant-0.096
 manager-0.096
 refrigerator-0.095
 old-0.094
Token Sara
Token ID24799
Feature activation-1.40e+00

Pos logit contributions
man+0.006
 manager+0.005
 chamber+0.005
keeper+0.005
yer+0.005
Neg logit contributions
 realize-0.009
 forget-0.008
 swat-0.008
 realise-0.008
 tumble-0.008
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
 manager+0.075
 restaurant+0.073
 hotel+0.073
keeper+0.069
 refrigerator+0.069
Neg logit contributions
 realize-0.069
aults-0.065
 forth-0.064
bs-0.063
ching-0.061
Token Good
Token ID4599
Feature activation-2.63e-01

Pos logit contributions
 manager+0.015
 restaurant+0.014
keeper+0.014
man+0.014
 old+0.014
Neg logit contributions
 realize-0.015
bs-0.014
tip-0.014
 prod-0.014
 tumble-0.013
Token night
Token ID1755
Feature activation+3.09e-01

Pos logit contributions
keeper+0.015
 manager+0.014
 Hero+0.014
 shop+0.013
 refrigerator+0.013
Neg logit contributions
 realize-0.014
 forth-0.013
 notice-0.012
 unnecessary-0.012
 realise-0.012
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
 realize+0.017
 notice+0.016
aults+0.016
 forth+0.016
 tumble+0.016
Neg logit contributions
 novels-0.015
 refrigerator-0.015
 novel-0.014
 manager-0.014
keeper-0.014
Token Sleep
Token ID17376
Feature activation-7.10e+00

Pos logit contributions
 manager+0.014
 restaurant+0.014
keeper+0.013
 old+0.013
 refrigerator+0.012
Neg logit contributions
 realize-0.014
tip-0.013
 prod-0.013
vals-0.013
 tumble-0.013
Token tight
Token ID5381
Feature activation-9.36e-01

Pos logit contributions
 dentist+0.365
 novels+0.359
smith+0.337
 address+0.332
 refrigerator+0.328
Neg logit contributions
 realize-0.383
aults-0.380
oves-0.362
bs-0.359
 free-0.356
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
 restaurant+0.041
 refrigerator+0.040
keeper+0.040
 manager+0.040
omm+0.039
Neg logit contributions
 realize-0.050
hing-0.050
 tumble-0.049
 glide-0.048
 forth-0.048
Token Don
Token ID2094
Feature activation+4.99e-01

Pos logit contributions
 manager+0.012
 restaurant+0.011
keeper+0.011
 old+0.011
 refrigerator+0.011
Neg logit contributions
 realize-0.013
 prod-0.013
vals-0.012
tip-0.012
 tumble-0.012
Token't
Token ID470
Feature activation-3.49e-01

Pos logit contributions
 realize+0.019
hing+0.018
 tumble+0.018
fters+0.018
pieces+0.018
Neg logit contributions
 refrigerator-0.033
 restaurant-0.033
keeper-0.032
 manager-0.032
 chamber-0.032
Token let
Token ID1309
Feature activation+1.05e+00

Pos logit contributions
 grandma+0.022
 restaurant+0.021
 old+0.021
 shop+0.021
 refrigerator+0.020
Neg logit contributions
pieces-0.016
 prod-0.016
 realize-0.015
tip-0.015
aults-0.015
Token "
Token ID366
Feature activation-7.95e-01

Pos logit contributions
 prod+0.016
 realize+0.015
pieces+0.015
 glide+0.014
tip+0.014
Neg logit contributions
 restaurant-0.008
 chamber-0.008
 hotel-0.007
 refrigerator-0.007
 grandma-0.007
TokenIt
Token ID1026
Feature activation+1.56e+00

Pos logit contributions
 grandma+0.043
 shop+0.042
 restaurant+0.042
 manager+0.042
 old+0.041
Neg logit contributions
 realize-0.042
 prod-0.041
pieces-0.041
aults-0.040
 realise-0.038
Token's
Token ID338
Feature activation+7.16e-01

Pos logit contributions
 realize+0.089
 prod+0.088
pieces+0.083
 realise+0.082
tip+0.080
Neg logit contributions
 grandma-0.078
 restaurant-0.077
 manager-0.077
 refrigerator-0.074
 hotel-0.071
Token easy
Token ID2562
Feature activation+1.65e+00

Pos logit contributions
 realize+0.037
 unnecessary+0.036
 prod+0.036
 ease+0.035
 realise+0.035
Neg logit contributions
 manager-0.040
keeper-0.039
ber-0.037
 restaurant-0.036
 shop-0.036
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
 realize+0.099
 realise+0.093
pieces+0.092
 unnecessary+0.092
 prod+0.092
Neg logit contributions
 manager-0.072
 grandma-0.070
keeper-0.070
omm-0.070
 Cream-0.069
Token Sleep
Token ID17376
Feature activation-7.09e+00

Pos logit contributions
 prod+0.006
 realize+0.006
pieces+0.006
tip+0.006
 realise+0.006
Neg logit contributions
 restaurant-0.005
 manager-0.005
 refrigerator-0.005
 novel-0.005
 chamber-0.005
Token is
Token ID318
Feature activation-2.18e+00

Pos logit contributions
 dentist+0.391
 novels+0.381
 grandma+0.359
 Grand+0.343
 shop+0.343
Neg logit contributions
 realize-0.399
aults-0.359
oves-0.358
 realise-0.355
ching-0.351
Token when
Token ID618
Feature activation+7.98e-01

Pos logit contributions
 manager+0.112
 restaurant+0.107
keeper+0.107
 refrigerator+0.106
 shop+0.105
Neg logit contributions
 realize-0.121
 prod-0.116
pieces-0.114
aults-0.113
 ease-0.112
Token you
Token ID345
Feature activation+2.23e-01

Pos logit contributions
 realize+0.037
 unnecessary+0.036
pieces+0.034
aults+0.034
 prod+0.034
Neg logit contributions
 manager-0.049
man-0.047
keeper-0.047
 refrigerator-0.046
omm-0.045
Token close
Token ID1969
Feature activation-3.54e-01

Pos logit contributions
 realize+0.009
 prod+0.009
aults+0.008
pieces+0.008
 realise+0.008
Neg logit contributions
 grandma-0.015
 manager-0.015
 refrigerator-0.015
 old-0.014
 restaurant-0.014
Token your
Token ID534
Feature activation+1.12e+00

Pos logit contributions
 manager+0.019
 restaurant+0.017
 refrigerator+0.017
man+0.017
keeper+0.017
Neg logit contributions
 realize-0.020
 prod-0.019
 ease-0.019
 unnecessary-0.019
pieces-0.019
Token,
Token ID11
Feature activation-6.94e-02

Pos logit contributions
 realize+0.021
 notice+0.020
aults+0.019
 forth+0.019
 realise+0.019
Neg logit contributions
 novel-0.018
 refrigerator-0.018
 novels-0.018
 chamber-0.017
 manager-0.017
Token Anna
Token ID11735
Feature activation-2.41e+00

Pos logit contributions
keeper+0.002
 manager+0.002
man+0.002
 refrigerator+0.002
 chamber+0.002
Neg logit contributions
 realize-0.005
aults-0.005
 forth-0.005
pieces-0.005
 prod-0.005
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
 restaurant+0.127
 refrigerator+0.125
 manager+0.125
omm+0.124
 hotel+0.123
Neg logit contributions
 realize-0.125
 forth-0.115
bs-0.111
 realise-0.106
ching-0.104
Token Ben
Token ID3932
Feature activation-5.94e-01

Pos logit contributions
 manager+0.011
 refrigerator+0.011
 restaurant+0.010
 grandma+0.010
 chamber+0.010
Neg logit contributions
 realize-0.020
 prod-0.020
aults-0.020
pieces-0.020
 forth-0.019
Token.
Token ID13
Feature activation-4.24e-01

Pos logit contributions
omm+0.031
 restaurant+0.030
 manager+0.030
 novels+0.029
 refrigerator+0.029
Neg logit contributions
 realize-0.034
 forth-0.030
 notice-0.029
 realise-0.029
 forget-0.029
Token Sleep
Token ID17376
Feature activation-7.07e+00

Pos logit contributions
 manager+0.021
 restaurant+0.021
keeper+0.020
 old+0.020
 novel+0.020
Neg logit contributions
 realize-0.024
 prod-0.023
tip-0.023
vals-0.023
 ease-0.022
Token well
Token ID880
Feature activation-3.26e+00

Pos logit contributions
 dentist+0.386
 novels+0.365
 refrigerator+0.361
smith+0.351
 Gor+0.347
Neg logit contributions
 realize-0.384
aults-0.373
bs-0.360
oves-0.357
 unnecessary-0.350
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
omm+0.165
 refrigerator+0.159
 dentist+0.155
 freezer+0.151
 restaurant+0.150
Neg logit contributions
bs-0.195
lest-0.182
aults-0.176
 realize-0.171
 forth-0.165
Token We
Token ID775
Feature activation+1.00e+00

Pos logit contributions
 manager+0.020
 restaurant+0.020
keeper+0.019
 refrigerator+0.019
 old+0.019
Neg logit contributions
 realize-0.024
 prod-0.023
vals-0.023
tip-0.023
 glide-0.022
Token love
Token ID1842
Feature activation+1.18e-01

Pos logit contributions
 prod+0.053
pieces+0.052
 realize+0.051
aults+0.050
cking+0.049
Neg logit contributions
 grandma-0.055
 shop-0.054
 restaurant-0.053
 manager-0.053
 old-0.053
Token you
Token ID345
Feature activation+2.57e-01

Pos logit contributions
 forth+0.004
 unnecessary+0.004
aults+0.004
pieces+0.004
tip+0.003
Neg logit contributions
 manager-0.009
 refrigerator-0.009
man-0.009
 freezer-0.008
keeper-0.008
Token You
Token ID921
Feature activation+7.77e-01

Pos logit contributions
 realize+0.005
pieces+0.005
 prod+0.005
tip+0.004
 unnecessary+0.004
Neg logit contributions
 restaurant-0.003
 manager-0.003
 refrigerator-0.003
 novel-0.003
man-0.003
Token need
Token ID761
Feature activation+5.41e-01

Pos logit contributions
 realize+0.047
 prod+0.045
pieces+0.044
aults+0.044
 realise+0.043
Neg logit contributions
 grandma-0.036
 restaurant-0.036
 manager-0.036
 refrigerator-0.036
 old-0.035
Token to
Token ID284
Feature activation+1.14e-01

Pos logit contributions
 realize+0.038
 prod+0.038
 ease+0.036
 unnecessary+0.036
tip+0.036
Neg logit contributions
 manager-0.019
 restaurant-0.019
keeper-0.018
 refrigerator-0.018
man-0.018
Token sleep
Token ID3993
Feature activation-1.82e+00

Pos logit contributions
pieces+0.006
 prod+0.005
 realize+0.005
tip+0.005
aults+0.005
Neg logit contributions
 grandma-0.007
 old-0.007
 shop-0.006
 restaurant-0.006
 chamber-0.006
Token.
Token ID13
Feature activation+8.30e-02

Pos logit contributions
keeper+0.099
 gorilla+0.087
 manager+0.087
omm+0.086
 refrigerator+0.085
Neg logit contributions
 realize-0.096
 free-0.093
 notice-0.092
aults-0.091
ging-0.089
Token Sleep
Token ID17376
Feature activation-7.03e+00

Pos logit contributions
 realize+0.005
pieces+0.005
 prod+0.005
tip+0.005
 unnecessary+0.005
Neg logit contributions
 restaurant-0.004
 manager-0.004
 refrigerator-0.003
man-0.003
 novel-0.003
Token is
Token ID318
Feature activation-2.20e+00

Pos logit contributions
 dentist+0.362
 novels+0.350
omm+0.348
 refrigerator+0.335
 manager+0.334
Neg logit contributions
 realize-0.417
bs-0.384
oves-0.377
 unnecessary-0.375
 notice-0.370
Token good
Token ID922
Feature activation+7.22e-01

Pos logit contributions
 manager+0.121
keeper+0.120
 restaurant+0.118
 shop+0.117
 refrigerator+0.117
Neg logit contributions
 realize-0.115
 unnecessary-0.107
 prod-0.107
aults-0.106
 realise-0.104
Token for
Token ID329
Feature activation-7.25e-01

Pos logit contributions
 realize+0.045
 forth+0.042
 realise+0.041
 notice+0.041
bs+0.040
Neg logit contributions
keeper-0.033
 manager-0.031
 refrigerator-0.030
 restaurant-0.030
 Cream-0.029
Token you
Token ID345
Feature activation+1.55e-01

Pos logit contributions
keeper+0.040
 manager+0.040
man+0.039
 refrigerator+0.038
 restaurant+0.037
Neg logit contributions
 realize-0.038
 unnecessary-0.038
pieces-0.037
 ease-0.037
aults-0.037
Token.
Token ID13
Feature activation+8.49e-02

Pos logit contributions
 realize+0.010
aults+0.009
bs+0.009
oves+0.009
 realise+0.009
Neg logit contributions
 manager-0.007
omm-0.007
 Cream-0.007
 refrigerator-0.007
 restaurant-0.007
Token,
Token ID11
Feature activation-2.25e-02

Pos logit contributions
 realize+0.020
aults+0.019
 forth+0.019
 notice+0.019
 realise+0.018
Neg logit contributions
 novel-0.018
 novels-0.017
 refrigerator-0.017
 chamber-0.017
 manager-0.017
Token my
Token ID616
Feature activation+1.45e+00

Pos logit contributions
man+0.001
 manager+0.001
keeper+0.001
 restaurant+0.001
 chamber+0.001
Neg logit contributions
 realize-0.001
 prod-0.001
pieces-0.001
 realise-0.001
 forth-0.001
Token sweet
Token ID6029
Feature activation-1.13e+00

Pos logit contributions
 realize+0.095
bs+0.087
aults+0.084
apy+0.083
tip+0.083
Neg logit contributions
man-0.059
 Hero-0.057
 shop-0.057
 manager-0.056
keeper-0.056
Tokenies
Token ID444
Feature activation+3.38e+00

Pos logit contributions
 Hero+0.071
 manager+0.069
keeper+0.068
 chamber+0.067
 Cream+0.066
Neg logit contributions
 realize-0.053
 realise-0.043
 forth-0.042
aults-0.041
 notice-0.041
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 realize+0.225
hing+0.206
 realise+0.202
aults+0.201
 ease+0.200
Neg logit contributions
 manager-0.136
omm-0.135
 grandma-0.134
keeper-0.132
 refrigerator-0.131
Token Sleep
Token ID17376
Feature activation-7.02e+00

Pos logit contributions
 manager+0.016
 old+0.016
 restaurant+0.016
keeper+0.015
 novel+0.015
Neg logit contributions
 realize-0.017
 prod-0.016
 realise-0.015
tip-0.015
vals-0.015
Token tight
Token ID5381
Feature activation-9.14e-01

Pos logit contributions
 dentist+0.373
 novels+0.370
smith+0.344
 refrigerator+0.340
 address+0.333
Neg logit contributions
 realize-0.390
aults-0.372
oves-0.359
ching-0.356
bs-0.356
Token.
Token ID13
Feature activation-2.83e-01

Pos logit contributions
 restaurant+0.037
 refrigerator+0.037
 manager+0.037
omm+0.037
 grandma+0.036
Neg logit contributions
 realize-0.058
bs-0.056
aults-0.055
lest-0.055
 tumble-0.054
Token And
Token ID843
Feature activation+2.99e+00

Pos logit contributions
 manager+0.015
 restaurant+0.014
 old+0.014
keeper+0.014
 refrigerator+0.013
Neg logit contributions
 realize-0.016
 prod-0.015
vals-0.015
 realise-0.015
tip-0.015
Token don
Token ID836
Feature activation-1.86e+00

Pos logit contributions
 realize+0.152
 prod+0.144
 glide+0.137
aults+0.137
pieces+0.136
Neg logit contributions
 manager-0.175
 refrigerator-0.174
man-0.169
 restaurant-0.168
 chamber-0.165
Token't
Token ID470
Feature activation-2.61e-01

Pos logit contributions
 grandma+0.100
 restaurant+0.099
 manager+0.098
 refrigerator+0.098
 old+0.097
Neg logit contributions
 realize-0.098
 prod-0.094
pieces-0.094
aults-0.093
tip-0.089
TokenGood
Token ID10248
Feature activation+6.31e-01

Pos logit contributions
 grandma+0.044
 restaurant+0.042
 manager+0.042
 shop+0.042
 old+0.041
Neg logit contributions
 realize-0.043
 prod-0.043
pieces-0.042
aults-0.041
tip-0.039
Tokennight
Token ID3847
Feature activation-2.19e+00

Pos logit contributions
 realize+0.039
 forth+0.036
tip+0.036
 prod+0.035
 realise+0.035
Neg logit contributions
 manager-0.029
keeper-0.028
 shop-0.027
 refrigerator-0.027
 restaurant-0.027
Token,
Token ID11
Feature activation+9.92e-02

Pos logit contributions
 novels+0.110
 truth+0.109
 manager+0.107
keeper+0.107
 values+0.106
Neg logit contributions
 realize-0.114
 notice-0.105
 realise-0.100
aults-0.099
 prod-0.098
Token Lily
Token ID20037
Feature activation+4.70e-01

Pos logit contributions
 realize+0.007
pieces+0.006
 realise+0.006
 prod+0.006
 tumble+0.006
Neg logit contributions
man-0.004
 manager-0.004
keeper-0.004
 restaurant-0.004
ber-0.004
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 realize+0.027
 forth+0.025
aults+0.024
 realise+0.024
ault+0.023
Neg logit contributions
 manager-0.024
 restaurant-0.024
 refrigerator-0.023
keeper-0.023
 novels-0.022
Token Sleep
Token ID17376
Feature activation-7.02e+00

Pos logit contributions
 realize+0.007
 prod+0.007
tip+0.007
pieces+0.007
 tumble+0.007
Neg logit contributions
 restaurant-0.006
 manager-0.006
 novel-0.006
 old-0.006
 refrigerator-0.006
Token well
Token ID880
Feature activation-3.05e+00

Pos logit contributions
 novels+0.393
 dentist+0.377
 shop+0.365
 Hero+0.364
 address+0.364
Neg logit contributions
 realize-0.378
 notice-0.329
arse-0.327
oves-0.327
bs-0.326
Token."
Token ID526
Feature activation-8.56e-01

Pos logit contributions
 manager+0.131
keeper+0.129
 gorilla+0.129
omm+0.129
 refrigerator+0.129
Neg logit contributions
bs-0.184
 realize-0.179
lest-0.172
 forth-0.172
aults-0.169
Token Lily
Token ID20037
Feature activation+4.62e-01

Pos logit contributions
 manager+0.028
 restaurant+0.028
 grandma+0.027
man+0.027
 refrigerator+0.027
Neg logit contributions
 realize-0.063
 prod-0.061
pieces-0.061
 realise-0.059
aults-0.058
Token replied
Token ID8712
Feature activation-1.06e+00

Pos logit contributions
 realize+0.025
 realise+0.023
 prod+0.023
aults+0.022
 forth+0.022
Neg logit contributions
 refrigerator-0.025
 manager-0.025
 restaurant-0.024
 chamber-0.024
keeper-0.023
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
 restaurant+0.042
 manager+0.041
 refrigerator+0.040
 grandma+0.040
 chamber+0.039
Neg logit contributions
 realize-0.071
 prod-0.068
aults-0.068
tip-0.065
 realise-0.065
TokenGood
Token ID10248
Feature activation+7.04e-01

Pos logit contributions
 grandma+0.045
 shop+0.044
 restaurant+0.044
 manager+0.044
man+0.043
Neg logit contributions
 realize-0.038
 prod-0.037
pieces-0.037
aults-0.035
cking-0.034
Tokennight
Token ID3847
Feature activation-2.20e+00

Pos logit contributions
 realize+0.045
 forth+0.043
 prod+0.041
tip+0.041
 tumble+0.040
Neg logit contributions
 manager-0.031
keeper-0.030
 refrigerator-0.029
 shop-0.029
 chamber-0.028
Token,
Token ID11
Feature activation+9.94e-02

Pos logit contributions
 novels+0.110
 truth+0.109
 manager+0.107
 refrigerator+0.105
keeper+0.105
Neg logit contributions
 realize-0.113
 notice-0.102
aults-0.100
bs-0.099
 realise-0.099
Token Max
Token ID5436
Feature activation-5.11e-01

Pos logit contributions
 realize+0.006
 prod+0.005
 tumble+0.005
 forth+0.005
 realise+0.005
Neg logit contributions
 restaurant-0.005
 refrigerator-0.005
keeper-0.005
ber-0.005
 manager-0.005
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
 manager+0.028
 restaurant+0.026
 refrigerator+0.026
keeper+0.026
 novels+0.026
Neg logit contributions
 realize-0.027
aults-0.025
ching-0.025
 forth-0.024
 realise-0.024
Token Sleep
Token ID17376
Feature activation-7.01e+00

Pos logit contributions
 realize+0.009
 prod+0.009
 tumble+0.009
hing+0.008
 ease+0.008
Neg logit contributions
 old-0.007
 restaurant-0.007
 manager-0.007
 refrigerator-0.007
 chamber-0.007
Token tight
Token ID5381
Feature activation-9.05e-01

Pos logit contributions
 novels+0.417
 dentist+0.407
 address+0.389
 refrigerator+0.370
 shop+0.367
Neg logit contributions
 realize-0.364
aults-0.337
arse-0.326
oves-0.324
 free-0.322
Token."
Token ID526
Feature activation-8.81e-01

Pos logit contributions
 grandma+0.041
 manager+0.040
 dentist+0.039
keeper+0.038
 restaurant+0.037
Neg logit contributions
 realize-0.053
 tumble-0.052
 glide-0.052
aults-0.052
lest-0.051
Token Max
Token ID5436
Feature activation-5.15e-01

Pos logit contributions
 restaurant+0.030
 manager+0.030
 grandma+0.030
 refrigerator+0.030
 old+0.030
Neg logit contributions
 realize-0.063
 prod-0.062
pieces-0.061
aults-0.060
tip-0.059
Token fell
Token ID3214
Feature activation+1.11e+00

Pos logit contributions
 refrigerator+0.029
 manager+0.028
 chamber+0.027
 grandma+0.027
 novels+0.026
Neg logit contributions
 realize-0.028
 prod-0.025
aults-0.025
 realise-0.025
 glide-0.025
Token asleep
Token ID16039
Feature activation-2.58e+00

Pos logit contributions
 realize+0.079
 prod+0.076
 tumble+0.076
 forth+0.075
aults+0.074
Neg logit contributions
 restaurant-0.039
 grandma-0.038
 refrigerator-0.037
 manager-0.037
keeper-0.036
Token saw
Token ID2497
Feature activation+3.73e-01

Pos logit contributions
 refrigerator+0.036
 manager+0.035
 novels+0.035
omm+0.035
 grandma+0.034
Neg logit contributions
 realize-0.026
 sudden-0.024
 notice-0.023
 prod-0.022
aults-0.022
Token a
Token ID257
Feature activation+3.48e-01

Pos logit contributions
 realize+0.019
 sudden+0.018
 ease+0.018
 unnecessary+0.017
 prod+0.016
Neg logit contributions
 manager-0.021
keeper-0.021
man-0.020
omm-0.020
ber-0.020
Token big
Token ID1263
Feature activation-1.50e-01

Pos logit contributions
 realize+0.021
 prod+0.021
aults+0.020
 sudden+0.020
 tumble+0.019
Neg logit contributions
 manager-0.015
 grandma-0.015
 about-0.014
 refrigerator-0.014
keeper-0.014
Token ball
Token ID2613
Feature activation+1.15e+00

Pos logit contributions
 manager+0.008
keeper+0.007
ber+0.007
omm+0.007
 grandma+0.007
Neg logit contributions
 realize-0.008
 forth-0.008
 tumble-0.008
 prod-0.008
 swat-0.008
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
 realize+0.079
 prod+0.077
aults+0.076
pieces+0.076
 tumble+0.075
Neg logit contributions
 grandma-0.044
 manager-0.042
 restaurant-0.041
keeper-0.040
 refrigerator-0.040
Token Sleep
Token ID17376
Feature activation-7.01e+00

Pos logit contributions
 realize+0.008
 glide+0.008
 tumble+0.008
 prod+0.008
apy+0.007
Neg logit contributions
ber-0.009
 manager-0.009
man-0.009
 keeper-0.009
 grandma-0.009
Tokeny
Token ID88
Feature activation-1.46e+00

Pos logit contributions
 novels+0.406
 dentist+0.403
 shop+0.381
 grandma+0.376
 address+0.372
Neg logit contributions
 realize-0.339
aults-0.333
oves-0.314
bs-0.311
 notice-0.310
Token loved
Token ID6151
Feature activation-8.14e-01

Pos logit contributions
 manager+0.074
 refrigerator+0.074
 grandma+0.073
omm+0.070
 novels+0.068
Neg logit contributions
 sudden-0.081
 realize-0.080
 prod-0.079
aults-0.078
 glide-0.076
Token balls
Token ID11333
Feature activation+2.17e+00

Pos logit contributions
 manager+0.045
keeper+0.045
omm+0.042
 refrigerator+0.041
man+0.041
Neg logit contributions
 unnecessary-0.042
 realize-0.041
 sudden-0.041
 tumble-0.040
 forth-0.040
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
 realize+0.144
 ease+0.136
aults+0.135
 tumble+0.134
pieces+0.133
Neg logit contributions
 grandma-0.087
 manager-0.085
omm-0.083
 restaurant-0.081
keeper-0.081
Token He
Token ID679
Feature activation-3.10e-01

Pos logit contributions
 realize+0.009
 glide+0.008
 unnecessary+0.008
 ease+0.008
 tumble+0.008
Neg logit contributions
 manager-0.009
man-0.009
ber-0.009
 keeper-0.009
keeper-0.008
Token Do
Token ID2141
Feature activation-3.18e-01

Pos logit contributions
 manager+0.038
 old+0.037
 restaurant+0.037
 refrigerator+0.036
man+0.036
Neg logit contributions
 prod-0.033
 realize-0.032
pieces-0.032
tip-0.031
bs-0.031
Token you
Token ID345
Feature activation+1.46e-01

Pos logit contributions
 manager+0.018
 refrigerator+0.018
 restaurant+0.017
keeper+0.017
 grandma+0.017
Neg logit contributions
 realize-0.016
pieces-0.016
 prod-0.016
aults-0.015
tip-0.015
Token like
Token ID588
Feature activation+1.18e-01

Pos logit contributions
 prod+0.008
pieces+0.007
aults+0.007
cking+0.007
 realize+0.007
Neg logit contributions
 grandma-0.008
 restaurant-0.008
 old-0.008
 about-0.008
 is-0.008
Token my
Token ID616
Feature activation+1.50e+00

Pos logit contributions
pieces+0.006
 realize+0.006
 prod+0.006
bs+0.005
 swat+0.005
Neg logit contributions
 manager-0.007
keeper-0.007
man-0.006
 refrigerator-0.006
 chamber-0.006
Token coffee
Token ID6891
Feature activation-5.20e-01

Pos logit contributions
 realize+0.075
pieces+0.074
 blush+0.071
bs+0.070
 prod+0.069
Neg logit contributions
keeper-0.079
 Hero-0.078
man-0.077
 manager-0.076
omm-0.073
Token shop
Token ID6128
Feature activation-7.00e+00

Pos logit contributions
 Hero+0.021
 de+0.021
keeper+0.020
 manager+0.019
 chamber+0.019
Neg logit contributions
 forth-0.030
 blush-0.029
pieces-0.028
 realize-0.027
wheel-0.027
Token?
Token ID30
Feature activation-5.46e-01

Pos logit contributions
 Hero+0.309
 de+0.302
 chamber+0.298
 lair+0.292
 truth+0.292
Neg logit contributions
 forth-0.391
pieces-0.375
 realize-0.360
aults-0.352
bs-0.351
Token I
Token ID314
Feature activation-1.97e-02

Pos logit contributions
 manager+0.025
 restaurant+0.025
 chamber+0.024
 old+0.024
 refrigerator+0.024
Neg logit contributions
 prod-0.032
 realize-0.032
pieces-0.032
bs-0.031
tip-0.031
Token sell
Token ID3677
Feature activation-1.97e+00

Pos logit contributions
 refrigerator+0.001
 grandma+0.001
 manager+0.001
keeper+0.001
 old+0.001
Neg logit contributions
 realize-0.001
 prod-0.001
 realise-0.001
pieces-0.001
aults-0.001
Token coffee
Token ID6891
Feature activation-5.23e-01

Pos logit contributions
 chamber+0.106
 manager+0.105
 restaurant+0.100
 refrigerator+0.100
arium+0.097
Neg logit contributions
pieces-0.100
 prod-0.096
tip-0.094
 realize-0.094
bs-0.094
Token and
Token ID290
Feature activation-4.12e-01

Pos logit contributions
 chamber+0.022
 lair+0.021
 manager+0.021
 Hero+0.021
 grandma+0.021
Neg logit contributions
bs-0.029
pieces-0.029
aults-0.029
 blush-0.028
vals-0.028
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
 Amen+0.019
 Shop+0.019
 chamber+0.019
man+0.019
 de+0.019
Neg logit contributions
 realize-0.022
 prod-0.020
 ease-0.020
 notice-0.020
 realise-0.020
Token go
Token ID467
Feature activation+9.11e-01

Pos logit contributions
 realize+0.004
 prod+0.004
pieces+0.004
aults+0.004
tip+0.004
Neg logit contributions
 grandma-0.007
 restaurant-0.006
 manager-0.006
 old-0.006
 shop-0.006
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
 realize+0.050
 notice+0.047
 realise+0.045
 forth+0.045
tip+0.044
Neg logit contributions
 refrigerator-0.047
 de-0.046
omm-0.045
 novels-0.044
 chamber-0.044
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
 realize+0.006
 prod+0.006
 realise+0.006
 ease+0.006
 notice+0.006
Neg logit contributions
 refrigerator-0.005
man-0.005
keeper-0.005
 manager-0.005
 restaurant-0.004
Token doll
Token ID3654
Feature activation-6.73e-02

Pos logit contributions
 about+0.010
 de+0.010
abel+0.009
omm+0.009
 God+0.009
Neg logit contributions
 realize-0.018
aults-0.017
 prod-0.017
 sidelines-0.017
pieces-0.017
Token shop
Token ID6128
Feature activation-7.00e+00

Pos logit contributions
 de+0.003
 refrigerator+0.003
omm+0.002
 Cream+0.002
 old+0.002
Neg logit contributions
 realize-0.005
pieces-0.004
aults-0.004
 realise-0.004
 forth-0.004
Token.
Token ID13
Feature activation-3.93e-01

Pos logit contributions
 Sleep+0.393
omm+0.385
 Amen+0.383
 prayers+0.365
 refrigerator+0.360
Neg logit contributions
 realize-0.345
aults-0.340
 notice-0.335
 forth-0.328
ault-0.315
Token She
Token ID1375
Feature activation+1.02e-01

Pos logit contributions
 restaurant+0.017
 manager+0.017
 refrigerator+0.016
 grandma+0.016
 chamber+0.016
Neg logit contributions
 realize-0.025
 prod-0.025
 realise-0.023
pieces-0.023
tip-0.023
Token does
Token ID857
Feature activation-2.25e+00

Pos logit contributions
 realize+0.006
 prod+0.005
bs+0.005
 realise+0.005
 tumble+0.005
Neg logit contributions
man-0.006
 manager-0.006
eddy-0.006
 chamber-0.006
keeper-0.006
Token not
Token ID407
Feature activation-1.36e+00

Pos logit contributions
 grandma+0.069
 restaurant+0.065
 manager+0.065
 old+0.064
 refrigerator+0.064
Neg logit contributions
 realize-0.173
 prod-0.171
pieces-0.169
aults-0.166
tip-0.164
Token want
Token ID765
Feature activation+1.32e+00

Pos logit contributions
 grandma+0.079
 old+0.078
 restaurant+0.077
 about+0.077
 Anna+0.076
Neg logit contributions
pieces-0.069
 sudden-0.067
 prod-0.066
apy-0.063
vals-0.063
Token
Token ID198
Feature activation+5.01e-01

Pos logit contributions
 ease+0.001
 realize+0.001
 glide+0.001
 sudden+0.001
 tumble+0.001
Neg logit contributions
 manager-0.002
keeper-0.002
 keeper-0.002
ber-0.002
man-0.002
Token
Token ID198
Feature activation+5.10e-01

Pos logit contributions
 realize+0.026
 sudden+0.025
 realise+0.024
 ease+0.024
 notice+0.024
Neg logit contributions
 manager-0.027
 refrigerator-0.026
keeper-0.026
omm-0.025
 Shop-0.025
TokenOne
Token ID3198
Feature activation-3.06e+00

Pos logit contributions
aults+0.019
 sudden+0.018
 realize+0.018
Eventually+0.017
Finally+0.017
Neg logit contributions
 manager-0.036
 grandma-0.036
keeper-0.035
 refrigerator-0.035
 old-0.034
Token day
Token ID1110
Feature activation+3.51e-01

Pos logit contributions
 manager+0.171
keeper+0.171
 refrigerator+0.158
 shop+0.157
omm+0.156
Neg logit contributions
 realize-0.161
bs-0.144
tip-0.144
 tumble-0.142
 realise-0.142
Token,
Token ID11
Feature activation+8.10e-02

Pos logit contributions
 realize+0.023
 prod+0.023
aults+0.022
 sudden+0.022
bs+0.022
Neg logit contributions
 manager-0.014
keeper-0.013
 grandma-0.013
 refrigerator-0.013
 old-0.013
Token Sleep
Token ID17376
Feature activation-6.99e+00

Pos logit contributions
 sudden+0.005
 realize+0.005
aults+0.005
apy+0.005
 prod+0.005
Neg logit contributions
 manager-0.004
keeper-0.004
 keeper-0.003
 refrigerator-0.003
man-0.003
Tokeny
Token ID88
Feature activation-1.44e+00

Pos logit contributions
 novels+0.377
 dentist+0.364
 manager+0.360
 refrigerator+0.359
 shop+0.352
Neg logit contributions
 realize-0.366
aults-0.352
bs-0.344
 notice-0.338
 sudden-0.329
Token went
Token ID1816
Feature activation+8.86e-01

Pos logit contributions
 refrigerator+0.084
omm+0.083
 manager+0.082
 novels+0.081
keeper+0.078
Neg logit contributions
 realize-0.074
 sudden-0.066
 notice-0.065
aults-0.063
bs-0.063
Token for
Token ID329
Feature activation-7.29e-01

Pos logit contributions
 realize+0.059
 prod+0.057
aults+0.055
 glide+0.055
 unnecessary+0.055
Neg logit contributions
 grandma-0.035
 manager-0.034
keeper-0.033
 refrigerator-0.033
 restaurant-0.032
Token a
Token ID257
Feature activation+3.11e-01

Pos logit contributions
keeper+0.040
 manager+0.038
man+0.036
 refrigerator+0.036
ber+0.035
Neg logit contributions
 realize-0.042
 unnecessary-0.041
 ease-0.040
 sudden-0.039
 notice-0.038
Token walk
Token ID2513
Feature activation-1.11e+00

Pos logit contributions
 realize+0.016
pieces+0.016
 prod+0.016
aults+0.015
tip+0.015
Neg logit contributions
 grandma-0.018
 restaurant-0.017
 old-0.017
 shop-0.017
 refrigerator-0.017
Token day
Token ID1110
Feature activation+3.47e-01

Pos logit contributions
keeper+0.069
 manager+0.069
 grandma+0.067
 refrigerator+0.066
 restaurant+0.063
Neg logit contributions
 realize-0.078
hing-0.074
 prod-0.073
 forth-0.072
 tumble-0.072
Token and
Token ID290
Feature activation+3.34e-02

Pos logit contributions
 realize+0.019
aults+0.019
 sudden+0.017
 notice+0.017
bs+0.017
Neg logit contributions
 manager-0.018
keeper-0.018
 refrigerator-0.018
 grandma-0.017
man-0.017
Token all
Token ID477
Feature activation-1.37e+00

Pos logit contributions
 realize+0.002
 prod+0.002
 realise+0.002
 tumble+0.002
 glide+0.002
Neg logit contributions
 manager-0.002
keeper-0.002
 refrigerator-0.002
 grandma-0.002
omm-0.002
Token night
Token ID1755
Feature activation+4.38e-01

Pos logit contributions
keeper+0.063
 manager+0.063
 grandma+0.062
 refrigerator+0.062
 shop+0.058
Neg logit contributions
 realize-0.085
 prod-0.079
hing-0.079
pieces-0.078
 realise-0.078
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
 realize+0.032
aults+0.031
 prod+0.031
pieces+0.030
tip+0.030
Neg logit contributions
 manager-0.015
 grandma-0.014
 refrigerator-0.014
keeper-0.014
 shop-0.014
Token Sleep
Token ID17376
Feature activation-6.99e+00

Pos logit contributions
 ease+0.001
 realize+0.001
First+0.001
Br+0.001
 glide+0.001
Neg logit contributions
 manager-0.001
keeper-0.001
 keeper-0.001
ber-0.001
man-0.001
Tokeny
Token ID88
Feature activation-1.41e+00

Pos logit contributions
 dentist+0.360
 novels+0.349
 refrigerator+0.345
 manager+0.341
 address+0.338
Neg logit contributions
 realize-0.382
aults-0.360
bs-0.348
 notice-0.343
 sudden-0.340
Token had
Token ID550
Feature activation+4.13e-01

Pos logit contributions
 refrigerator+0.077
keeper+0.074
 manager+0.074
omm+0.072
 freezer+0.070
Neg logit contributions
 realize-0.081
bs-0.071
 realise-0.070
aults-0.070
ede-0.069
Token a
Token ID257
Feature activation+3.15e-01

Pos logit contributions
 realize+0.023
 ease+0.023
aults+0.022
 sudden+0.022
 unnecessary+0.021
Neg logit contributions
 manager-0.022
keeper-0.021
ber-0.021
 refrigerator-0.020
man-0.020
Token big
Token ID1263
Feature activation-2.09e-01

Pos logit contributions
 realize+0.017
 sudden+0.016
 prod+0.016
aults+0.015
 tumble+0.015
Neg logit contributions
 manager-0.017
keeper-0.016
 refrigerator-0.016
 restaurant-0.015
 about-0.015
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
keeper+0.012
 manager+0.011
ber+0.010
 Cream+0.010
 Hero+0.010
Neg logit contributions
 realize-0.012
 forth-0.011
 tumble-0.011
 prod-0.010
 sudden-0.010
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
 realize+0.079
aults+0.072
 unst+0.071
 tumble+0.071
tip+0.071
Neg logit contributions
 restaurant-0.060
 refrigerator-0.058
 manager-0.058
keeper-0.057
 God-0.057
Token It
Token ID632
Feature activation+7.79e-01

Pos logit contributions
 restaurant+0.026
 manager+0.025
 refrigerator+0.024
 chamber+0.023
 novel+0.023
Neg logit contributions
 realize-0.027
 prod-0.027
pieces-0.026
tip-0.026
bs-0.025
Token belongs
Token ID14448
Feature activation-4.03e-01

Pos logit contributions
 realize+0.056
 tumble+0.055
 bleed+0.054
 prod+0.054
 glide+0.053
Neg logit contributions
 restaurant-0.025
 refrigerator-0.025
 grandma-0.024
 manager-0.024
 chamber-0.023
Token to
Token ID284
Feature activation+5.21e-02

Pos logit contributions
omm+0.024
 Amen+0.023
 refrigerator+0.023
 restaurant+0.022
 novel+0.021
Neg logit contributions
bs-0.019
 unnecessary-0.019
 unst-0.018
 free-0.018
tip-0.017
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
 realize+0.003
 unnecessary+0.003
 ease+0.003
 prod+0.003
 notice+0.003
Neg logit contributions
keeper-0.003
 manager-0.003
man-0.003
 refrigerator-0.003
 novel-0.002
Token shop
Token ID6128
Feature activation-6.97e+00

Pos logit contributions
 about+0.009
 refrigerator+0.009
keeper+0.009
omm+0.009
arium+0.008
Neg logit contributions
 realize-0.011
aults-0.010
 ease-0.010
pieces-0.010
 notice-0.010
Token.
Token ID13
Feature activation-4.79e-01

Pos logit contributions
 Sleep+0.376
omm+0.363
 Amen+0.359
 truth+0.341
 novels+0.335
Neg logit contributions
 forget-0.320
 realize-0.320
aults-0.306
pieces-0.305
 notice-0.288
Token You
Token ID921
Feature activation+7.00e-01

Pos logit contributions
 restaurant+0.025
 manager+0.025
 refrigerator+0.024
 novel+0.024
man+0.024
Neg logit contributions
 realize-0.025
pieces-0.024
 prod-0.024
tip-0.024
bs-0.024
Token have
Token ID423
Feature activation+1.77e+00

Pos logit contributions
 realize+0.032
 prod+0.029
aults+0.028
pieces+0.028
 realise+0.028
Neg logit contributions
 refrigerator-0.044
 restaurant-0.044
 manager-0.044
 grandma-0.043
 old-0.042
Token to
Token ID284
Feature activation+7.74e-02

Pos logit contributions
 realize+0.107
 unnecessary+0.104
 notice+0.100
 ease+0.099
 prod+0.099
Neg logit contributions
man-0.084
 refrigerator-0.082
 manager-0.081
 chamber-0.080
 restaurant-0.079
Token share
Token ID2648
Feature activation-3.01e-01

Pos logit contributions
 realize+0.003
 prod+0.003
pieces+0.003
aults+0.003
tip+0.003
Neg logit contributions
 grandma-0.005
 restaurant-0.005
 manager-0.005
 refrigerator-0.005
 old-0.005
Token man
Token ID582
Feature activation-3.01e+00

Pos logit contributions
 realize+0.009
 prod+0.009
pieces+0.009
 tumble+0.008
aults+0.008
Neg logit contributions
 grandma-0.005
 restaurant-0.005
 about-0.005
 manager-0.005
 refrigerator-0.005
Token comes
Token ID2058
Feature activation+6.27e-01

Pos logit contributions
 grandma+0.149
 refrigerator+0.143
 old+0.143
 manager+0.143
 restaurant+0.142
Neg logit contributions
 realize-0.174
 prod-0.172
pieces-0.167
aults-0.165
tip-0.161
Token out
Token ID503
Feature activation+1.99e+00

Pos logit contributions
 realize+0.045
 prod+0.045
pieces+0.045
aults+0.043
tip+0.043
Neg logit contributions
 grandma-0.022
 old-0.021
 manager-0.021
 restaurant-0.021
 refrigerator-0.021
Token of
Token ID286
Feature activation-5.16e-01

Pos logit contributions
 realize+0.140
 prod+0.139
pieces+0.138
aults+0.135
tip+0.133
Neg logit contributions
 grandma-0.074
 old-0.072
 manager-0.071
 restaurant-0.071
 refrigerator-0.070
Token a
Token ID257
Feature activation+1.37e-01

Pos logit contributions
ber+0.030
keeper+0.028
 manager+0.028
omm+0.027
 Shop+0.027
Neg logit contributions
 ease-0.027
 prod-0.025
 notice-0.024
 realize-0.024
 forth-0.024
Token shop
Token ID6128
Feature activation-6.97e+00

Pos logit contributions
 realize+0.009
 prod+0.009
 sudden+0.008
pieces+0.008
 notice+0.008
Neg logit contributions
 about-0.005
 grandma-0.005
omm-0.005
 manager-0.005
man-0.005
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
omm+0.332
 Sleep+0.330
 novels+0.303
 truth+0.298
 Hero+0.293
Neg logit contributions
aults-0.375
 realize-0.371
ging-0.365
 forth-0.352
cking-0.351
Token He
Token ID679
Feature activation-6.31e-01

Pos logit contributions
 restaurant+0.013
 hotel+0.013
ber+0.013
 chamber+0.013
 year+0.013
Neg logit contributions
 realize-0.018
 prod-0.017
 realise-0.016
 notice-0.016
bs-0.016
Token sees
Token ID7224
Feature activation+8.49e-01

Pos logit contributions
eddy+0.029
 manager+0.028
 Sleep+0.028
robe+0.028
 grandma+0.027
Neg logit contributions
 prod-0.035
 realize-0.034
 swung-0.032
 tumble-0.031
 AW-0.030
Token Tom
Token ID4186
Feature activation-5.40e-01

Pos logit contributions
 realize+0.055
 prod+0.054
pieces+0.054
 ease+0.052
aults+0.050
Neg logit contributions
 manager-0.034
 chamber-0.034
 restaurant-0.034
 refrigerator-0.034
man-0.033
Token with
Token ID351
Feature activation-1.11e+00

Pos logit contributions
 grandma+0.019
 manager+0.019
 refrigerator+0.019
 restaurant+0.018
 chamber+0.018
Neg logit contributions
 realize-0.038
aults-0.037
 prod-0.037
cking-0.036
pieces-0.036
Token them
Token ID606
Feature activation-6.55e-01

Pos logit contributions
 realize+0.008
 prod+0.008
pieces+0.008
aults+0.008
tip+0.007
Neg logit contributions
 grandma-0.006
 old-0.006
 manager-0.006
 restaurant-0.006
 shop-0.006
Token to
Token ID284
Feature activation-3.03e-02

Pos logit contributions
 grandma+0.023
 manager+0.022
 old+0.022
 restaurant+0.022
 shop+0.022
Neg logit contributions
 realize-0.047
 prod-0.047
pieces-0.046
aults-0.045
tip-0.045
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
 refrigerator+0.002
keeper+0.002
man+0.001
 manager+0.001
 novel+0.001
Neg logit contributions
 realize-0.002
 prod-0.002
 realise-0.002
 notice-0.002
 ease-0.002
Token ice
Token ID4771
Feature activation-1.83e+00

Pos logit contributions
omm+0.010
 about+0.010
man+0.009
eddy+0.009
 year+0.009
Neg logit contributions
 realize-0.016
 prod-0.015
 notice-0.014
aults-0.014
pieces-0.014
Token cream
Token ID8566
Feature activation-4.01e-01

Pos logit contributions
 truth+0.095
omm+0.094
 Hero+0.094
 Sleep+0.094
 God+0.093
Neg logit contributions
 forth-0.091
cking-0.085
 realize-0.085
aults-0.083
bs-0.082
Token shop
Token ID6128
Feature activation-6.97e+00

Pos logit contributions
 de+0.011
 novels+0.011
 old+0.011
 chamber+0.011
 about+0.010
Neg logit contributions
 realize-0.029
 forth-0.028
pieces-0.027
aults-0.027
 prod-0.027
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
omm+0.381
 Sleep+0.373
 novels+0.368
 novel+0.359
 Amen+0.358
Neg logit contributions
aults-0.337
 realize-0.331
 forth-0.313
ging-0.311
oves-0.306
Token They
Token ID1119
Feature activation-7.98e-02

Pos logit contributions
 novel+0.026
 manager+0.026
 restaurant+0.025
man+0.025
 refrigerator+0.025
Neg logit contributions
 realize-0.034
 prod-0.032
pieces-0.031
 realise-0.031
bs-0.030
Token buy
Token ID2822
Feature activation-1.44e+00

Pos logit contributions
 chamber+0.005
 refrigerator+0.005
 grandma+0.005
 manager+0.005
man+0.005
Neg logit contributions
 realize-0.003
 prod-0.003
 realise-0.003
aults-0.003
pieces-0.003
Token them
Token ID606
Feature activation-6.54e-01

Pos logit contributions
 chamber+0.074
 manager+0.069
man+0.068
 restaurant+0.068
 refrigerator+0.068
Neg logit contributions
 realize-0.078
 ease-0.078
pieces-0.078
 prod-0.077
tip-0.075
Token ice
Token ID4771
Feature activation-1.82e+00

Pos logit contributions
 chamber+0.027
 refrigerator+0.025
 restaurant+0.025
 manager+0.024
 novel+0.024
Neg logit contributions
 realize-0.042
 prod-0.039
pieces-0.039
aults-0.039
tip-0.039
Token I
Token ID314
Feature activation+9.95e-03

Pos logit contributions
 manager+0.012
 refrigerator+0.012
 restaurant+0.012
man+0.011
omm+0.011
Neg logit contributions
 realize-0.018
 prod-0.018
pieces-0.017
bs-0.017
tip-0.017
Token can
Token ID460
Feature activation-5.78e-01

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
aults+0.001
 realise+0.000
Neg logit contributions
 grandma-0.001
 refrigerator-0.001
 manager-0.001
 old-0.001
 restaurant-0.001
Token go
Token ID467
Feature activation+8.75e-01

Pos logit contributions
 grandma+0.037
 restaurant+0.036
 manager+0.035
 shop+0.035
 old+0.035
Neg logit contributions
 realize-0.025
 prod-0.025
pieces-0.025
aults-0.024
tip-0.023
Token to
Token ID284
Feature activation+8.50e-02

Pos logit contributions
 realize+0.057
pieces+0.056
 prod+0.056
aults+0.054
tip+0.053
Neg logit contributions
 grandma-0.036
 refrigerator-0.036
 manager-0.036
 restaurant-0.035
 chamber-0.035
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
 realize+0.005
 prod+0.005
pieces+0.005
 ease+0.005
 realise+0.005
Neg logit contributions
 manager-0.004
keeper-0.004
 refrigerator-0.004
man-0.004
ber-0.004
Token shop
Token ID6128
Feature activation-6.96e+00

Pos logit contributions
 about+0.015
 manager+0.014
omm+0.014
 de+0.014
 Cream+0.013
Neg logit contributions
aults-0.018
 realize-0.018
pieces-0.017
ault-0.017
 prod-0.017
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
omm+0.359
 truth+0.349
 refrigerator+0.334
 novels+0.330
 de+0.322
Neg logit contributions
aults-0.368
 realize-0.349
 notice-0.334
ault-0.333
pieces-0.332
Token buy
Token ID2822
Feature activation-1.46e+00

Pos logit contributions
 grandma+0.020
 manager+0.020
 refrigerator+0.020
 restaurant+0.020
 old+0.019
Neg logit contributions
 realize-0.012
 prod-0.011
pieces-0.011
aults-0.010
 realise-0.010
Token one
Token ID530
Feature activation-8.34e-01

Pos logit contributions
 manager+0.084
 chamber+0.082
man+0.082
ber+0.079
 refrigerator+0.078
Neg logit contributions
pieces-0.073
 ease-0.072
tip-0.069
 realize-0.067
 unnecessary-0.067
Token."
Token ID526
Feature activation-1.13e+00

Pos logit contributions
 manager+0.035
 restaurant+0.032
 grandma+0.030
 Hero+0.030
 refrigerator+0.030
Neg logit contributions
pieces-0.051
tip-0.050
 realize-0.050
 prod-0.050
 forth-0.048
Token
Token ID198
Feature activation+2.77e-01

Pos logit contributions
 grandma+0.039
 refrigerator+0.039
 manager+0.039
 restaurant+0.039
 chamber+0.038
Neg logit contributions
 realize-0.082
 prod-0.080
pieces-0.078
 realise-0.076
aults-0.076
Token dog
Token ID3290
Feature activation-2.09e-01

Pos logit contributions
 manager+0.017
 about+0.017
keeper+0.016
 grandma+0.016
 truth+0.016
Neg logit contributions
 realize-0.034
 prod-0.033
tip-0.032
 forth-0.032
aults-0.032
Token named
Token ID3706
Feature activation-1.22e+00

Pos logit contributions
 manager+0.012
keeper+0.012
 refrigerator+0.012
 truth+0.012
omm+0.012
Neg logit contributions
 realize-0.009
 prod-0.009
bs-0.009
 swat-0.008
cking-0.008
Token Sleep
Token ID17376
Feature activation-6.96e+00

Pos logit contributions
 manager+0.048
keeper+0.046
 restaurant+0.044
 refrigerator+0.043
 shop+0.043
Neg logit contributions
 prod-0.082
 realize-0.081
aults-0.080
 tumble-0.079
 unnecessary-0.078
Tokeny
Token ID88
Feature activation-1.36e+00

Pos logit contributions
 novels+0.370
 dentist+0.368
 manager+0.357
 refrigerator+0.357
 address+0.349
Neg logit contributions
aults-0.352
 realize-0.336
bs-0.333
oves-0.318
 free-0.317
Token.
Token ID13
Feature activation+2.26e-03

Pos logit contributions
 manager+0.065
 refrigerator+0.062
 restaurant+0.060
 freezer+0.058
keeper+0.058
Neg logit contributions
 sudden-0.072
 realize-0.071
aults-0.069
 swat-0.067
 glide-0.066
Token Sleep
Token ID17376
Feature activation-6.96e+00

Pos logit contributions
 realize+0.000
 tumble+0.000
 prod+0.000
 glide+0.000
 ease+0.000
Neg logit contributions
 manager-0.000
keeper-0.000
ber-0.000
man-0.000
 keeper-0.000
Tokeny
Token ID88
Feature activation-1.37e+00

Pos logit contributions
 dentist+0.355
 novels+0.340
 manager+0.337
 refrigerator+0.332
 address+0.328
Neg logit contributions
 realize-0.372
bs-0.352
aults-0.352
 free-0.349
ging-0.332
Token was
Token ID373
Feature activation-9.25e-01

Pos logit contributions
 refrigerator+0.076
 manager+0.076
omm+0.072
keeper+0.071
 dentist+0.069
Neg logit contributions
 realize-0.067
 sudden-0.067
 swung-0.064
 soared-0.064
aults-0.062
Token a
Token ID257
Feature activation+3.33e-01

Pos logit contributions
keeper+0.054
 manager+0.054
ber+0.050
 refrigerator+0.049
 restaurant+0.049
Neg logit contributions
 realize-0.045
 ease-0.044
 sudden-0.043
aults-0.043
 free-0.041
Token very
Token ID845
Feature activation+8.60e-01

Pos logit contributions
 realize+0.018
 prod+0.017
aults+0.016
 sudden+0.016
tip+0.016
Neg logit contributions
 manager-0.017
keeper-0.017
 refrigerator-0.017
 restaurant-0.017
 grandma-0.017
Token sleepy
Token ID42023
Feature activation-7.05e-01

Pos logit contributions
 realize+0.040
bs+0.038
 prod+0.037
tip+0.036
 tumble+0.035
Neg logit contributions
 manager-0.049
keeper-0.049
 novels-0.048
 refrigerator-0.048
 shop-0.047
Token was
Token ID373
Feature activation-8.92e-01

Pos logit contributions
 manager+0.083
keeper+0.082
 waitress+0.078
 novels+0.077
 waiter+0.076
Neg logit contributions
 realize-0.106
 ease-0.101
 tumble-0.100
apy-0.097
 prod-0.097
Token a
Token ID257
Feature activation+3.40e-01

Pos logit contributions
 grandma+0.027
 restaurant+0.026
 manager+0.026
 refrigerator+0.026
 old+0.025
Neg logit contributions
 realize-0.068
 prod-0.068
pieces-0.067
aults-0.066
tip-0.065
Token little
Token ID1310
Feature activation-4.97e-01

Pos logit contributions
 realize+0.022
 prod+0.022
aults+0.021
pieces+0.021
 sudden+0.021
Neg logit contributions
 manager-0.013
keeper-0.013
 refrigerator-0.013
 about-0.013
 grandma-0.013
Token dog
Token ID3290
Feature activation-2.09e-01

Pos logit contributions
 manager+0.017
 about+0.017
keeper+0.016
 grandma+0.016
 truth+0.016
Neg logit contributions
 realize-0.034
 prod-0.033
tip-0.032
 forth-0.032
aults-0.032
Token named
Token ID3706
Feature activation-1.22e+00

Pos logit contributions
 manager+0.012
keeper+0.012
 refrigerator+0.012
 truth+0.012
omm+0.012
Neg logit contributions
 realize-0.009
 prod-0.009
bs-0.009
 swat-0.008
cking-0.008
Token Sleep
Token ID17376
Feature activation-6.96e+00

Pos logit contributions
 manager+0.048
keeper+0.046
 restaurant+0.044
 refrigerator+0.043
 shop+0.043
Neg logit contributions
 prod-0.082
 realize-0.081
aults-0.080
 tumble-0.079
 unnecessary-0.078
Tokeny
Token ID88
Feature activation-1.36e+00

Pos logit contributions
 novels+0.370
 dentist+0.368
 manager+0.357
 refrigerator+0.357
 address+0.349
Neg logit contributions
aults-0.352
 realize-0.336
bs-0.333
oves-0.318
 free-0.317
Token.
Token ID13
Feature activation+2.26e-03

Pos logit contributions
 manager+0.065
 refrigerator+0.062
 restaurant+0.060
 freezer+0.058
keeper+0.058
Neg logit contributions
 sudden-0.072
 realize-0.071
aults-0.069
 swat-0.067
 glide-0.066
Token Sleep
Token ID17376
Feature activation-6.96e+00

Pos logit contributions
 realize+0.000
 tumble+0.000
 prod+0.000
 glide+0.000
 ease+0.000
Neg logit contributions
 manager-0.000
keeper-0.000
ber-0.000
man-0.000
 keeper-0.000
Tokeny
Token ID88
Feature activation-1.37e+00

Pos logit contributions
 dentist+0.355
 novels+0.340
 manager+0.337
 refrigerator+0.332
 address+0.328
Neg logit contributions
 realize-0.372
bs-0.352
aults-0.352
 free-0.349
ging-0.332
Token was
Token ID373
Feature activation-9.25e-01

Pos logit contributions
 refrigerator+0.076
 manager+0.076
omm+0.072
keeper+0.071
 dentist+0.069
Neg logit contributions
 realize-0.067
 sudden-0.067
 swung-0.064
 soared-0.064
aults-0.062
Token you
Token ID345
Feature activation+2.27e-01

Pos logit contributions
 realize+0.015
 prod+0.015
pieces+0.014
aults+0.014
 tumble+0.014
Neg logit contributions
 manager-0.013
 restaurant-0.013
 grandma-0.013
 refrigerator-0.013
 chamber-0.013
Token need
Token ID761
Feature activation+4.78e-01

Pos logit contributions
pieces+0.013
 prod+0.013
 realize+0.012
aults+0.012
tip+0.012
Neg logit contributions
 grandma-0.012
 old-0.011
 shop-0.011
 refrigerator-0.011
 manager-0.011
Token to
Token ID284
Feature activation+1.22e-01

Pos logit contributions
 realize+0.035
 prod+0.034
pieces+0.033
tip+0.033
 ease+0.032
Neg logit contributions
 manager-0.016
 grandma-0.016
 restaurant-0.016
 refrigerator-0.016
 chamber-0.015
Token sleep
Token ID3993
Feature activation-1.80e+00

Pos logit contributions
pieces+0.006
 prod+0.006
 realize+0.005
tip+0.005
aults+0.005
Neg logit contributions
 grandma-0.007
 shop-0.007
 old-0.007
 restaurant-0.007
 refrigerator-0.007
Token.
Token ID13
Feature activation+9.13e-04

Pos logit contributions
keeper+0.093
omm+0.085
 manager+0.085
 gorilla+0.083
 restaurant+0.081
Neg logit contributions
 realize-0.096
 free-0.095
aults-0.095
 notice-0.092
 forth-0.092
Token Sleep
Token ID17376
Feature activation-6.95e+00

Pos logit contributions
 prod+0.000
 realize+0.000
pieces+0.000
tip+0.000
 realise+0.000
Neg logit contributions
 restaurant-0.000
 manager-0.000
 old-0.000
 refrigerator-0.000
 chamber-0.000
Token will
Token ID481
Feature activation-2.83e+00

Pos logit contributions
 novels+0.352
 dentist+0.348
 address+0.328
 grandma+0.327
 shop+0.326
Neg logit contributions
 realize-0.416
bs-0.382
ching-0.370
oves-0.370
 notice-0.365
Token help
Token ID1037
Feature activation-1.47e-02

Pos logit contributions
 grandma+0.170
 restaurant+0.166
 manager+0.166
 refrigerator+0.165
 old+0.164
Neg logit contributions
 realize-0.134
 prod-0.131
pieces-0.127
aults-0.125
tip-0.121
Token you
Token ID345
Feature activation+1.98e-01

Pos logit contributions
 refrigerator+0.001
 manager+0.001
 chamber+0.001
 restaurant+0.001
keeper+0.001
Neg logit contributions
 realize-0.001
 prod-0.001
aults-0.001
pieces-0.001
 realise-0.001
Token feel
Token ID1254
Feature activation+1.21e+00

Pos logit contributions
 realize+0.010
 prod+0.010
pieces+0.009
aults+0.009
tip+0.009
Neg logit contributions
 grandma-0.011
 manager-0.011
 restaurant-0.011
 refrigerator-0.011
 old-0.011
Token better
Token ID1365
Feature activation-6.03e-01

Pos logit contributions
 realize+0.054
 prod+0.048
 realise+0.047
 unnecessary+0.046
 ease+0.046
Neg logit contributions
 manager-0.078
 grandma-0.077
 refrigerator-0.077
 restaurant-0.077
man-0.076
Token They
Token ID1119
Feature activation+1.31e-01

Pos logit contributions
 refrigerator+0.051
 restaurant+0.049
 novel+0.049
 chamber+0.049
 manager+0.049
Neg logit contributions
 realize-0.082
 prod-0.077
pieces-0.077
 realise-0.075
 notice-0.074
Token walk
Token ID2513
Feature activation-1.18e+00

Pos logit contributions
 realize+0.005
aults+0.005
 prod+0.005
pieces+0.005
 ease+0.005
Neg logit contributions
 grandma-0.009
 manager-0.009
 refrigerator-0.009
 restaurant-0.009
 old-0.009
Token out
Token ID503
Feature activation+1.98e+00

Pos logit contributions
omm+0.071
 refrigerator+0.071
 dentist+0.068
 truth+0.067
 grandma+0.067
Neg logit contributions
 realize-0.055
pieces-0.051
aults-0.051
 realise-0.050
bs-0.050
Token of
Token ID286
Feature activation-4.95e-01

Pos logit contributions
 realize+0.116
 prod+0.113
pieces+0.112
 realise+0.108
aults+0.107
Neg logit contributions
 grandma-0.094
 refrigerator-0.094
 restaurant-0.093
 Cream-0.090
 manager-0.090
Token the
Token ID262
Feature activation-3.09e-01

Pos logit contributions
ber+0.028
keeper+0.028
omm+0.028
man+0.027
 manager+0.027
Neg logit contributions
 ease-0.024
 prod-0.023
 realize-0.022
 forth-0.022
 notice-0.022
Token shop
Token ID6128
Feature activation-6.95e+00

Pos logit contributions
 de+0.010
 about+0.010
man+0.009
 manager+0.009
 grandma+0.009
Neg logit contributions
 realize-0.021
pieces-0.021
 prod-0.020
 forth-0.020
 sudden-0.020
Token and
Token ID290
Feature activation-3.83e-01

Pos logit contributions
 Sleep+0.283
omm+0.281
 truth+0.268
 Amen+0.266
 novel+0.265
Neg logit contributions
 realize-0.434
aults-0.410
pieces-0.408
 forth-0.395
bs-0.395
Token go
Token ID467
Feature activation+8.83e-01

Pos logit contributions
 refrigerator+0.025
omm+0.024
man+0.024
 manager+0.024
 truth+0.024
Neg logit contributions
 realize-0.017
 prod-0.015
aults-0.014
pieces-0.014
 realise-0.014
Token back
Token ID736
Feature activation+7.66e-01

Pos logit contributions
 realize+0.055
pieces+0.052
 prod+0.051
aults+0.050
 realise+0.050
Neg logit contributions
 refrigerator-0.038
 grandma-0.037
omm-0.036
 truth-0.036
 novels-0.036
Token to
Token ID284
Feature activation+7.00e-02

Pos logit contributions
bs+0.036
 realize+0.035
 notice+0.035
pieces+0.033
 unst+0.033
Neg logit contributions
omm-0.043
 truth-0.039
 de-0.039
man-0.038
 refrigerator-0.037
Token the
Token ID262
Feature activation-3.13e-01

Pos logit contributions
 realize+0.004
 prod+0.003
 ease+0.003
 realise+0.003
 notice+0.003
Neg logit contributions
keeper-0.004
man-0.004
 manager-0.004
omm-0.004
ber-0.004
Token his
Token ID465
Feature activation+1.41e-01

Pos logit contributions
 realize+0.015
 prod+0.015
 realise+0.014
 forth+0.014
 ease+0.014
Neg logit contributions
 refrigerator-0.012
 restaurant-0.012
 Cream-0.012
 manager-0.011
 truth-0.011
Token mom
Token ID1995
Feature activation-1.03e+00

Pos logit contributions
 realize+0.010
 prod+0.010
pieces+0.010
 sudden+0.010
tip+0.009
Neg logit contributions
 restaurant-0.005
 refrigerator-0.005
 about-0.005
 manager-0.005
 Cream-0.005
Token argued
Token ID7189
Feature activation+2.23e+00

Pos logit contributions
 grandma+0.051
 restaurant+0.050
 manager+0.050
 refrigerator+0.049
 old+0.049
Neg logit contributions
 realize-0.060
 prod-0.059
pieces-0.058
aults-0.057
tip-0.055
Token back
Token ID736
Feature activation+9.08e-01

Pos logit contributions
 realize+0.147
 prod+0.138
 realise+0.137
ging+0.135
aults+0.135
Neg logit contributions
 restaurant-0.093
 refrigerator-0.091
 gorilla-0.087
 hotel-0.087
 grandma-0.086
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
 realize+0.060
 prod+0.057
 realise+0.056
pieces+0.056
aults+0.055
Neg logit contributions
 manager-0.036
 refrigerator-0.036
 waiter-0.036
omm-0.035
 restaurant-0.035
Token forth
Token ID6071
Feature activation+8.05e+00

Pos logit contributions
 realize+0.011
 prod+0.011
pieces+0.011
aults+0.010
tip+0.010
Neg logit contributions
 grandma-0.016
 restaurant-0.016
 refrigerator-0.016
 old-0.016
 manager-0.016
Token until
Token ID1566
Feature activation+2.61e+00

Pos logit contributions
 realize+0.543
 prod+0.533
aults+0.523
pieces+0.520
 ease+0.506
Neg logit contributions
 restaurant-0.313
 refrigerator-0.312
 grandma-0.309
 chamber-0.299
 manager-0.296
Token John
Token ID1757
Feature activation-6.80e-01

Pos logit contributions
 realize+0.149
 prod+0.145
pieces+0.143
 ease+0.142
aults+0.140
Neg logit contributions
 refrigerator-0.125
 old-0.125
 chamber-0.124
 manager-0.124
 restaurant-0.123
Token's
Token ID338
Feature activation+1.02e+00

Pos logit contributions
 refrigerator+0.036
 chamber+0.035
 grandma+0.034
 restaurant+0.034
 novels+0.034
Neg logit contributions
 realize-0.039
aults-0.037
 prod-0.036
 realise-0.035
pieces-0.035
Token mom
Token ID1995
Feature activation-1.09e+00

Pos logit contributions
 realize+0.071
 prod+0.069
pieces+0.069
aults+0.068
tip+0.067
Neg logit contributions
 manager-0.035
 restaurant-0.035
 Cream-0.035
 refrigerator-0.035
 chamber-0.034
Token finally
Token ID3443
Feature activation+1.50e-01

Pos logit contributions
 chamber+0.055
 refrigerator+0.054
 restaurant+0.054
 novels+0.053
 grandma+0.052
Neg logit contributions
 realize-0.062
 prod-0.060
pieces-0.059
aults-0.058
 realise-0.057
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
 restaurant+0.023
 waitress+0.022
 gorilla+0.022
 waiter+0.022
 Cream+0.021
Neg logit contributions
 realize-0.032
 prod-0.031
 forth-0.031
 realise-0.031
pieces-0.030
Token He
Token ID679
Feature activation-2.33e-01

Pos logit contributions
 prod+0.014
 realize+0.014
 realise+0.013
 ease+0.013
 glide+0.013
Neg logit contributions
 manager-0.009
 grandma-0.009
 restaurant-0.009
 chamber-0.009
 novel-0.009
Token rocked
Token ID36872
Feature activation+5.58e-01

Pos logit contributions
 refrigerator+0.013
 manager+0.013
 restaurant+0.013
 Cream+0.013
 grandma+0.013
Neg logit contributions
 prod-0.011
 realize-0.011
 realise-0.010
 forth-0.010
 sudden-0.010
Token back
Token ID736
Feature activation+9.82e-01

Pos logit contributions
 realize+0.036
 prod+0.035
aults+0.034
pieces+0.034
tip+0.033
Neg logit contributions
 restaurant-0.024
 grandma-0.024
 refrigerator-0.024
 manager-0.024
 shop-0.023
Token and
Token ID290
Feature activation+1.64e-01

Pos logit contributions
 realize+0.066
 prod+0.065
 forth+0.064
aults+0.063
 realise+0.062
Neg logit contributions
 manager-0.037
 restaurant-0.036
 truth-0.035
 Cream-0.035
 waiter-0.035
Token forth
Token ID6071
Feature activation+7.30e+00

Pos logit contributions
 realize+0.002
pieces+0.002
 prod+0.002
aults+0.002
tip+0.001
Neg logit contributions
 grandma-0.016
 old-0.015
 shop-0.015
 restaurant-0.015
 manager-0.015
Token,
Token ID11
Feature activation+2.32e-01

Pos logit contributions
 realize+0.517
 prod+0.510
aults+0.495
pieces+0.491
tip+0.482
Neg logit contributions
 grandma-0.263
 restaurant-0.262
 refrigerator-0.260
 manager-0.253
 chamber-0.250
Token pushing
Token ID7796
Feature activation+2.00e+00

Pos logit contributions
 realize+0.014
 prod+0.014
aults+0.013
pieces+0.013
 realise+0.013
Neg logit contributions
 grandma-0.011
 refrigerator-0.011
 manager-0.011
 restaurant-0.011
 chamber-0.010
Token and
Token ID290
Feature activation+1.75e-01

Pos logit contributions
 realize+0.145
pieces+0.144
 prod+0.144
aults+0.139
tip+0.138
Neg logit contributions
 grandma-0.070
 old-0.067
 shop-0.064
 restaurant-0.064
 refrigerator-0.063
Token pulling
Token ID10427
Feature activation+2.18e+00

Pos logit contributions
pieces+0.007
 realize+0.007
aults+0.007
 unnecessary+0.007
 prod+0.007
Neg logit contributions
 shop-0.012
 grandma-0.012
 old-0.012
 manager-0.011
 restaurant-0.011
Token as
Token ID355
Feature activation+9.57e-01

Pos logit contributions
 realize+0.153
pieces+0.152
 prod+0.151
tip+0.146
aults+0.146
Neg logit contributions
 grandma-0.080
 old-0.079
 shop-0.076
 refrigerator-0.075
 manager-0.074
Token and
Token ID290
Feature activation+5.00e-02

Pos logit contributions
 realize+0.160
 prod+0.151
aults+0.151
 realise+0.148
 tumble+0.146
Neg logit contributions
 grandma-0.086
 refrigerator-0.081
 manager-0.081
 Cream-0.081
 restaurant-0.080
Token landed
Token ID11406
Feature activation+1.35e+00

Pos logit contributions
 realize+0.002
 prod+0.002
pieces+0.002
aults+0.002
tip+0.002
Neg logit contributions
 grandma-0.003
 restaurant-0.003
 refrigerator-0.003
 manager-0.003
 old-0.003
Token on
Token ID319
Feature activation+9.07e-01

Pos logit contributions
 realize+0.071
 prod+0.070
pieces+0.067
tip+0.067
ging+0.067
Neg logit contributions
 grandma-0.074
 restaurant-0.072
 manager-0.070
 refrigerator-0.070
 chamber-0.070
Token Spot
Token ID15899
Feature activation-1.36e+00

Pos logit contributions
 ease+0.052
 prod+0.051
 realize+0.050
 tumble+0.049
Kit+0.048
Neg logit contributions
ber-0.045
 manager-0.044
 restaurant-0.042
 chamber-0.042
keeper-0.041
Token's
Token ID338
Feature activation+8.42e-01

Pos logit contributions
 grandma+0.050
 shop+0.050
 refrigerator+0.048
 old+0.048
 restaurant+0.048
Neg logit contributions
 realize-0.095
 prod-0.094
pieces-0.093
tip-0.090
vals-0.089
Token itch
Token ID49700
Feature activation+5.23e+00

Pos logit contributions
 realize+0.053
 prod+0.052
pieces+0.052
aults+0.051
tip+0.050
Neg logit contributions
 grandma-0.038
 old-0.036
 refrigerator-0.036
 restaurant-0.036
 manager-0.036
Token.
Token ID13
Feature activation-9.65e-02

Pos logit contributions
 realize+0.344
 prod+0.339
aults+0.339
 tumble+0.335
pieces+0.332
Neg logit contributions
 grandma-0.208
 restaurant-0.200
 manager-0.194
 refrigerator-0.185
keeper-0.183
Token Suddenly
Token ID24975
Feature activation+1.15e+00

Pos logit contributions
 manager+0.004
 restaurant+0.004
man+0.004
 chamber+0.004
 hotel+0.004
Neg logit contributions
 realize-0.006
 prod-0.006
 tumble-0.006
apy-0.006
 glide-0.005
Token,
Token ID11
Feature activation+3.17e-01

Pos logit contributions
 realize+0.074
 prod+0.072
 tumble+0.069
aults+0.069
bs+0.068
Neg logit contributions
 manager-0.048
 restaurant-0.047
 refrigerator-0.047
 grandma-0.046
 chamber-0.045
Token the
Token ID262
Feature activation+2.76e-01

Pos logit contributions
 realize+0.020
 prod+0.019
 tumble+0.019
 ease+0.019
aults+0.018
Neg logit contributions
 manager-0.014
 refrigerator-0.013
 restaurant-0.013
 grandma-0.013
man-0.013
Token itch
Token ID49700
Feature activation+5.23e+00

Pos logit contributions
pieces+0.017
aults+0.017
 realize+0.016
 prod+0.016
cking+0.016
Neg logit contributions
 old-0.014
 refrigerator-0.013
 shop-0.013
 grandma-0.013
 restaurant-0.013
Token on
Token ID319
Feature activation+8.56e-01

Pos logit contributions
 restaurant+0.016
 manager+0.016
 grandma+0.016
 refrigerator+0.016
keeper+0.016
Neg logit contributions
 realize-0.015
aults-0.014
 prod-0.014
pieces-0.014
tip-0.014
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
aults+0.046
 realize+0.045
 ease+0.045
 notice+0.043
pieces+0.043
Neg logit contributions
 manager-0.041
ber-0.040
keeper-0.039
 restaurant-0.038
 waiter-0.038
Token wall
Token ID3355
Feature activation+1.11e+00

Pos logit contributions
 grandma+0.001
 restaurant+0.001
 manager+0.001
 refrigerator+0.001
 about+0.001
Neg logit contributions
 realize-0.001
 prod-0.001
aults-0.001
pieces-0.001
tip-0.001
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
pieces+0.070
 realize+0.070
aults+0.068
bs+0.068
 prod+0.066
Neg logit contributions
 restaurant-0.043
 grandma-0.042
 manager-0.042
 truth-0.041
 refrigerator-0.040
Token never
Token ID1239
Feature activation-1.36e+00

Pos logit contributions
 realize+0.005
 prod+0.005
pieces+0.005
aults+0.005
tip+0.005
Neg logit contributions
 grandma-0.007
 restaurant-0.007
 manager-0.007
 refrigerator-0.007
 old-0.007
Token forget
Token ID6044
Feature activation+6.14e+00

Pos logit contributions
 grandma+0.086
 old+0.084
 restaurant+0.084
 manager+0.084
 shop+0.083
Neg logit contributions
 realize-0.058
 prod-0.058
pieces-0.058
aults-0.056
tip-0.054
Token it
Token ID340
Feature activation-3.02e-01

Pos logit contributions
 realize+0.344
pieces+0.335
 unnecessary+0.321
aults+0.319
 realise+0.314
Neg logit contributions
 manager-0.285
 chamber-0.274
 restaurant-0.271
man-0.267
keeper-0.261
Token."
Token ID526
Feature activation-7.90e-01

Pos logit contributions
 grandma+0.012
 restaurant+0.012
 manager+0.012
 refrigerator+0.012
 chamber+0.012
Neg logit contributions
 realize-0.019
aults-0.018
pieces-0.018
 realise-0.018
 prod-0.018
Token
Token ID198
Feature activation+7.04e-01

Pos logit contributions
 restaurant+0.034
 old+0.034
 manager+0.034
 chamber+0.033
 hotel+0.032
Neg logit contributions
 realize-0.049
 prod-0.046
 realise-0.046
pieces-0.045
 ease-0.045
Token
Token ID198
Feature activation+7.03e-01

Pos logit contributions
 realize+0.043
Eventually+0.041
 prod+0.040
 realise+0.040
 ease+0.040
Neg logit contributions
 refrigerator-0.030
 restaurant-0.030
 chamber-0.030
 manager-0.030
 old-0.029
TokenJohn
Token ID7554
Feature activation-7.19e-01

Pos logit contributions
 realize+0.041
 prod+0.041
pieces+0.040
aults+0.038
tip+0.038
Neg logit contributions
 grandma-0.035
 shop-0.034
 restaurant-0.034
 manager-0.034
 refrigerator-0.034
Token comes
Token ID2058
Feature activation+7.34e-01

Pos logit contributions
 grandma+0.023
 restaurant+0.022
 manager+0.022
 refrigerator+0.022
keeper+0.022
Neg logit contributions
 realize-0.030
 prod-0.029
pieces-0.028
aults-0.028
 realise-0.028
Token and
Token ID290
Feature activation-4.92e-01

Pos logit contributions
 realize+0.053
 prod+0.052
pieces+0.051
aults+0.051
tip+0.049
Neg logit contributions
 grandma-0.026
 restaurant-0.025
 refrigerator-0.025
 manager-0.025
 shop-0.024
Token sees
Token ID7224
Feature activation+9.90e-01

Pos logit contributions
 grandma+0.021
 restaurant+0.021
 manager+0.021
 refrigerator+0.020
 chamber+0.020
Neg logit contributions
 realize-0.031
 prod-0.031
pieces-0.030
aults-0.029
tip-0.029
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
 realize+0.056
pieces+0.054
 prod+0.052
bs+0.052
aults+0.052
Neg logit contributions
man-0.051
 manager-0.049
keeper-0.049
 chamber-0.048
 novel-0.048
Token z
Token ID1976
Feature activation+1.11e+00

Pos logit contributions
 restaurant+0.015
 grandma+0.015
 manager+0.015
 Cream+0.015
keeper+0.015
Neg logit contributions
 realize-0.019
 prod-0.018
aults-0.018
pieces-0.018
tip-0.017
Tokenig
Token ID328
Feature activation+5.06e+00

Pos logit contributions
 realize+0.048
 prod+0.047
pieces+0.046
aults+0.044
tip+0.043
Neg logit contributions
 grandma-0.071
 restaurant-0.070
 manager-0.070
 refrigerator-0.069
 old-0.069
Tokenzag
Token ID50183
Feature activation+3.02e+00

Pos logit contributions
 realize+0.369
 prod+0.365
pieces+0.359
aults+0.353
tip+0.349
Neg logit contributions
 grandma-0.173
 manager-0.166
 old-0.165
 restaurant-0.164
 refrigerator-0.164
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
 realize+0.221
 prod+0.219
pieces+0.214
tip+0.211
aults+0.210
Neg logit contributions
 grandma-0.100
 old-0.099
 chamber-0.097
 refrigerator-0.096
 shop-0.096
Token She
Token ID1375
Feature activation+1.79e-02

Pos logit contributions
man+0.031
 manager+0.031
 restaurant+0.031
 novel+0.030
keeper+0.029
Neg logit contributions
 realize-0.036
 prod-0.034
bs-0.033
 realise-0.032
 notice-0.032
Token gets
Token ID3011
Feature activation+7.25e-01

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
 swat+0.001
cking+0.001
Neg logit contributions
 manager-0.001
 restaurant-0.001
 grandma-0.001
 refrigerator-0.001
man-0.001
Token a
Token ID257
Feature activation+3.56e-02

Pos logit contributions
 realize+0.040
 prod+0.039
pieces+0.038
aults+0.038
tip+0.037
Neg logit contributions
 grandma-0.037
 restaurant-0.037
 refrigerator-0.036
 manager-0.036
 chamber-0.036
Token the
Token ID262
Feature activation-2.02e-01

Pos logit contributions
 realize+0.117
pieces+0.116
 unnecessary+0.114
 prod+0.114
cking+0.113
Neg logit contributions
 manager-0.112
 restaurant-0.102
omm-0.101
keeper-0.101
man-0.100
Token lions
Token ID31420
Feature activation-1.74e+00

Pos logit contributions
 manager+0.009
 grandma+0.008
 refrigerator+0.008
 restaurant+0.008
 about+0.008
Neg logit contributions
 realize-0.012
 prod-0.012
aults-0.012
pieces-0.012
tip-0.012
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
 grandma+0.065
 manager+0.064
 refrigerator+0.062
omm+0.061
keeper+0.061
Neg logit contributions
 realize-0.121
pieces-0.117
aults-0.117
 prod-0.114
hing-0.114
Token and
Token ID290
Feature activation-3.16e-01

Pos logit contributions
 manager+0.004
 refrigerator+0.004
 chamber+0.004
 grandma+0.004
 restaurant+0.004
Neg logit contributions
 realize-0.006
 prod-0.006
pieces-0.006
aults-0.006
 realise-0.006
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
 grandma+0.014
 manager+0.014
 refrigerator+0.014
 restaurant+0.014
 old+0.014
Neg logit contributions
 realize-0.019
 prod-0.019
pieces-0.019
aults-0.018
tip-0.018
Token monkeys
Token ID26697
Feature activation+1.52e+00

Pos logit contributions
 manager+0.008
 grandma+0.008
 about+0.008
omm+0.008
keeper+0.008
Neg logit contributions
 realize-0.012
 prod-0.012
pieces-0.011
aults-0.011
 glide-0.011
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
 realize+0.102
 prod+0.099
pieces+0.098
aults+0.097
 realise+0.094
Neg logit contributions
 grandma-0.062
 manager-0.060
 refrigerator-0.060
 restaurant-0.059
 old-0.058
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
 manager+0.004
 refrigerator+0.004
 chamber+0.004
 restaurant+0.004
 grandma+0.004
Neg logit contributions
 realize-0.007
 prod-0.007
pieces-0.007
aults-0.007
 realise-0.007
Token the
Token ID262
Feature activation-1.64e-01

Pos logit contributions
 manager+0.013
 grandma+0.013
 refrigerator+0.013
 restaurant+0.013
 chamber+0.013
Neg logit contributions
 realize-0.020
 prod-0.020
pieces-0.020
aults-0.019
tip-0.019
Token birds
Token ID10087
Feature activation+6.96e-01

Pos logit contributions
 grandma+0.007
 manager+0.007
 about+0.007
omm+0.007
 refrigerator+0.007
Neg logit contributions
 prod-0.010
 realize-0.010
pieces-0.010
aults-0.009
 glide-0.009
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
 realize+0.051
 prod+0.050
pieces+0.049
aults+0.049
tip+0.048
Neg logit contributions
 grandma-0.024
 manager-0.023
 refrigerator-0.023
 restaurant-0.023
 old-0.022
Tokenmy
Token ID1820
Feature activation+8.16e-01

Pos logit contributions
 manager+0.057
 chamber+0.055
 refrigerator+0.054
keeper+0.054
 restaurant+0.053
Neg logit contributions
 realize-0.071
pieces-0.071
aults-0.069
 forth-0.068
 prod-0.067
Token.
Token ID13
Feature activation+6.89e-02

Pos logit contributions
aults+0.049
pieces+0.048
 forth+0.047
 realize+0.047
ault+0.046
Neg logit contributions
 manager-0.033
 chamber-0.033
 gorilla-0.032
 truth-0.032
 hotel-0.032
Token She
Token ID1375
Feature activation+6.49e-01

Pos logit contributions
 realize+0.005
 prod+0.004
pieces+0.004
 realise+0.004
aults+0.004
Neg logit contributions
 manager-0.003
keeper-0.003
 restaurant-0.003
 novel-0.003
ber-0.002
Token saw
Token ID2497
Feature activation+4.25e-01

Pos logit contributions
 sudden+0.028
 swung+0.027
 realize+0.025
 prod+0.025
bs+0.024
Neg logit contributions
 truth-0.041
 novels-0.040
 chamber-0.039
 manager-0.039
 novel-0.038
Token a
Token ID257
Feature activation+3.91e-01

Pos logit contributions
 realize+0.023
 ease+0.023
pieces+0.022
 sudden+0.022
 prod+0.022
Neg logit contributions
 manager-0.020
 chamber-0.020
ber-0.020
keeper-0.020
man-0.019
Token toy
Token ID13373
Feature activation+2.32e+00

Pos logit contributions
 realize+0.023
 prod+0.023
aults+0.022
 sudden+0.022
pieces+0.022
Neg logit contributions
 manager-0.017
 grandma-0.017
 about-0.017
keeper-0.016
 restaurant-0.016
Token horse
Token ID8223
Feature activation-1.29e+00

Pos logit contributions
 realize+0.143
 prod+0.142
pieces+0.140
 forth+0.138
 tumble+0.136
Neg logit contributions
 restaurant-0.081
 manager-0.077
 Sleep-0.077
 grandma-0.076
 God-0.076
Token and
Token ID290
Feature activation+3.46e-02

Pos logit contributions
 manager+0.049
 refrigerator+0.048
 grandma+0.048
 restaurant+0.048
 old+0.046
Neg logit contributions
aults-0.084
 realize-0.084
pieces-0.083
 prod-0.082
 forth-0.080
Token asked
Token ID1965
Feature activation-5.65e-01

Pos logit contributions
 realize+0.002
 prod+0.002
pieces+0.002
aults+0.002
tip+0.002
Neg logit contributions
 manager-0.001
 restaurant-0.001
 refrigerator-0.001
 chamber-0.001
 grandma-0.001
Token her
Token ID607
Feature activation-6.22e-02

Pos logit contributions
 lair+0.038
 manager+0.037
keeper+0.037
 restaurant+0.037
 hotel+0.036
Neg logit contributions
 realize-0.021
bs-0.020
isted-0.018
 ease-0.018
pieces-0.018
Token mom
Token ID1995
Feature activation-1.24e+00

Pos logit contributions
 restaurant+0.002
 de+0.002
 Cream+0.002
 manager+0.002
man+0.002
Neg logit contributions
 realize-0.005
pieces-0.004
 prod-0.004
 realise-0.004
bs-0.004
Token too
Token ID1165
Feature activation-1.70e+00

Pos logit contributions
 manager+0.043
 restaurant+0.043
 Cream+0.042
 shop+0.042
keeper+0.042
Neg logit contributions
 realize-0.049
 prod-0.047
 ease-0.046
 realise-0.045
pieces-0.045
Token damaged
Token ID9694
Feature activation+8.67e-01

Pos logit contributions
 manager+0.095
 chamber+0.091
 grandma+0.091
 shop+0.090
 restaurant+0.090
Neg logit contributions
 realize-0.088
 prod-0.081
bs-0.081
aults-0.081
 realise-0.079
Token and
Token ID290
Feature activation-7.52e-02

Pos logit contributions
 realize+0.049
 forth+0.049
 ease+0.048
ging+0.047
 notice+0.046
Neg logit contributions
 restaurant-0.040
 manager-0.039
 waiter-0.039
 waitress-0.038
 gorilla-0.037
Token had
Token ID550
Feature activation+5.25e-01

Pos logit contributions
 restaurant+0.004
 refrigerator+0.004
 chamber+0.004
 manager+0.004
 Shop+0.004
Neg logit contributions
 realize-0.004
 prod-0.004
pieces-0.004
 forth-0.004
bs-0.004
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
 realize+0.034
 prod+0.034
 ease+0.033
tip+0.032
 realise+0.032
Neg logit contributions
 restaurant-0.022
 manager-0.021
 chamber-0.021
 refrigerator-0.021
man-0.020
Token be
Token ID307
Feature activation+2.94e+00

Pos logit contributions
 realize+0.006
 prod+0.005
pieces+0.005
aults+0.005
tip+0.005
Neg logit contributions
 grandma-0.008
 restaurant-0.008
 manager-0.008
 refrigerator-0.008
 old-0.008
Token removed
Token ID4615
Feature activation+4.99e-01

Pos logit contributions
 realize+0.158
 prod+0.154
pieces+0.151
aults+0.147
tip+0.146
Neg logit contributions
 grandma-0.157
 restaurant-0.155
 refrigerator-0.154
 manager-0.154
 shop-0.151
Token.
Token ID13
Feature activation-9.80e-02

Pos logit contributions
 realize+0.029
 ease+0.028
 unnecessary+0.028
pieces+0.028
aults+0.027
Neg logit contributions
 restaurant-0.023
 manager-0.023
man-0.023
 chamber-0.022
 refrigerator-0.022
Token
Token ID220
Feature activation+4.98e-01

Pos logit contributions
 restaurant+0.005
 chamber+0.005
 novel+0.005
 refrigerator+0.005
man+0.005
Neg logit contributions
 realize-0.006
pieces-0.005
 prod-0.005
 realise-0.005
bs-0.005
Token
Token ID198
Feature activation+6.95e-01

Pos logit contributions
 realize+0.032
pieces+0.031
 prod+0.031
 ease+0.030
bs+0.030
Neg logit contributions
 restaurant-0.021
 refrigerator-0.020
 manager-0.020
 chamber-0.020
 shop-0.019
Token
Token ID198
Feature activation+6.91e-01

Pos logit contributions
 realize+0.050
 prod+0.049
pieces+0.049
aults+0.048
tip+0.047
Neg logit contributions
 grandma-0.024
 refrigerator-0.023
 restaurant-0.023
 manager-0.023
 old-0.023
Token.
Token ID13
Feature activation-3.94e-01

Pos logit contributions
 realize+0.075
pieces+0.072
aults+0.071
 notice+0.070
 forth+0.070
Neg logit contributions
keeper-0.051
omm-0.050
 Cream-0.049
 Shop-0.048
ber-0.048
Token They
Token ID1119
Feature activation+1.26e-01

Pos logit contributions
man+0.020
 novel+0.020
 manager+0.019
 restaurant+0.019
 chamber+0.019
Neg logit contributions
 realize-0.023
 realise-0.021
 prod-0.020
 notice-0.020
pieces-0.020
Token said
Token ID531
Feature activation+2.09e-01

Pos logit contributions
 realize+0.008
 prod+0.007
aults+0.007
pieces+0.007
 realise+0.007
Neg logit contributions
 grandma-0.006
 restaurant-0.006
 refrigerator-0.006
 manager-0.006
 chamber-0.006
Token,
Token ID11
Feature activation+2.62e-02

Pos logit contributions
 realize+0.013
apy+0.013
bs+0.012
 prod+0.012
 realise+0.012
Neg logit contributions
 restaurant-0.009
 chamber-0.009
 refrigerator-0.009
 shop-0.009
man-0.009
Token "
Token ID366
Feature activation-8.06e-01

Pos logit contributions
 prod+0.002
 realize+0.002
pieces+0.002
tip+0.002
bs+0.002
Neg logit contributions
 restaurant-0.001
 chamber-0.001
 grandma-0.001
 manager-0.001
 refrigerator-0.001
TokenNo
Token ID2949
Feature activation+1.57e+00

Pos logit contributions
 grandma+0.039
 shop+0.039
 Cream+0.038
 restaurant+0.035
 Grand+0.035
Neg logit contributions
pieces-0.050
 prod-0.049
 realize-0.049
 ease-0.048
hing-0.045
Token,
Token ID11
Feature activation+4.73e-02

Pos logit contributions
 realize+0.116
 prod+0.113
pieces+0.112
aults+0.110
 realise+0.108
Neg logit contributions
 grandma-0.052
 manager-0.052
 restaurant-0.051
 refrigerator-0.051
keeper-0.050
Token Mom
Token ID11254
Feature activation-2.93e-01

Pos logit contributions
 realize+0.004
 prod+0.003
pieces+0.003
aults+0.003
 tumble+0.003
Neg logit contributions
 manager-0.002
 restaurant-0.002
 chamber-0.001
 refrigerator-0.001
keeper-0.001
Token,
Token ID11
Feature activation+2.06e-02

Pos logit contributions
 chamber+0.014
 restaurant+0.014
 manager+0.014
 novel+0.014
 novels+0.014
Neg logit contributions
 realize-0.016
 forth-0.016
bs-0.015
aults-0.015
 prod-0.015
Token we
Token ID356
Feature activation+1.25e+00

Pos logit contributions
 realize+0.001
 prod+0.001
pieces+0.001
aults+0.001
tip+0.001
Neg logit contributions
 grandma-0.001
 manager-0.001
 restaurant-0.001
 refrigerator-0.001
 old-0.001
Token want
Token ID765
Feature activation+1.35e+00

Pos logit contributions
 prod+0.070
pieces+0.069
tip+0.066
 realize+0.065
cking+0.064
Neg logit contributions
 grandma-0.064
 is-0.062
 restaurant-0.062
 shop-0.062
 refrigerator-0.061
Token p
Token ID279
Feature activation+2.12e+00

Pos logit contributions
 grandma+0.063
 refrigerator+0.062
 restaurant+0.061
 novels+0.060
 chamber+0.060
Neg logit contributions
 realize-0.058
 prod-0.057
 swung-0.053
aults-0.053
 realise-0.052
Tokenouted
Token ID18534
Feature activation-1.37e+00

Pos logit contributions
 prod+0.133
pieces+0.131
 realize+0.131
aults+0.127
cking+0.126
Neg logit contributions
 grandma-0.092
 restaurant-0.092
 refrigerator-0.091
 old-0.090
 chamber-0.090
Token and
Token ID290
Feature activation-2.44e-01

Pos logit contributions
 grandma+0.037
 manager+0.036
 restaurant+0.036
 refrigerator+0.035
 shop+0.034
Neg logit contributions
 realize-0.111
 prod-0.108
pieces-0.106
aults-0.105
 realise-0.104
Token crossed
Token ID12606
Feature activation+1.49e-02

Pos logit contributions
 refrigerator+0.014
 restaurant+0.014
 grandma+0.014
 manager+0.014
 chamber+0.014
Neg logit contributions
 realize-0.012
 prod-0.012
aults-0.011
 realise-0.011
pieces-0.011
Token her
Token ID607
Feature activation-1.11e-01

Pos logit contributions
 realize+0.001
 prod+0.001
 ease+0.001
aults+0.001
 realise+0.001
Neg logit contributions
 refrigerator-0.001
 Cream-0.001
 restaurant-0.001
 grandma-0.001
keeper-0.001
Token arms
Token ID5101
Feature activation+2.40e+00

Pos logit contributions
 Cream+0.006
 restaurant+0.006
 manager+0.006
 refrigerator+0.006
 grandma+0.006
Neg logit contributions
 realize-0.006
 prod-0.006
 tumble-0.005
aults-0.005
pieces-0.005
Token.
Token ID13
Feature activation-5.30e-01

Pos logit contributions
 realize+0.192
 prod+0.190
pieces+0.188
aults+0.186
tip+0.183
Neg logit contributions
 grandma-0.064
 restaurant-0.060
 manager-0.060
 refrigerator-0.060
 old-0.060
Token She
Token ID1375
Feature activation+1.87e-01

Pos logit contributions
 restaurant+0.024
 manager+0.024
man+0.024
 novel+0.023
keeper+0.023
Neg logit contributions
 realize-0.033
 prod-0.031
bs-0.030
 realise-0.030
pieces-0.030
Token did
Token ID750
Feature activation-2.00e+00

Pos logit contributions
 realize+0.010
 prod+0.009
 realise+0.009
 swung+0.009
 sudden+0.008
Neg logit contributions
 refrigerator-0.011
 restaurant-0.011
eddy-0.011
 manager-0.011
 novels-0.011
Token not
Token ID407
Feature activation-1.22e+00

Pos logit contributions
 refrigerator+0.069
 grandma+0.069
 restaurant+0.068
 manager+0.067
 shop+0.067
Neg logit contributions
 realize-0.147
 prod-0.142
pieces-0.139
aults-0.139
 realise-0.137
Token like
Token ID588
Feature activation+1.90e-01

Pos logit contributions
 old+0.074
 grandma+0.073
 Anna+0.072
 about+0.072
 restaurant+0.071
Neg logit contributions
pieces-0.058
 sudden-0.056
vals-0.056
 prod-0.055
Eventually-0.054
Token They
Token ID1119
Feature activation+1.14e-01

Pos logit contributions
 restaurant+0.014
 manager+0.013
 refrigerator+0.013
 novel+0.013
 old+0.013
Neg logit contributions
 realize-0.018
pieces-0.017
 prod-0.017
tip-0.017
vals-0.016
Token are
Token ID389
Feature activation-1.18e-01

Pos logit contributions
 realize+0.006
 prod+0.005
pieces+0.005
aults+0.005
tip+0.005
Neg logit contributions
 grandma-0.007
 restaurant-0.007
 manager-0.006
 refrigerator-0.006
 old-0.006
Token waiting
Token ID4953
Feature activation-2.22e+00

Pos logit contributions
 restaurant+0.005
 manager+0.005
 refrigerator+0.005
keeper+0.005
 shop+0.005
Neg logit contributions
 realize-0.007
 prod-0.007
pieces-0.007
cking-0.007
 realise-0.007
Token for
Token ID329
Feature activation-7.90e-01

Pos logit contributions
omm+0.091
 manager+0.088
 refrigerator+0.088
 restaurant+0.088
 chamber+0.087
Neg logit contributions
 realize-0.142
pieces-0.134
bs-0.133
aults-0.130
 unnecessary-0.130
Token the
Token ID262
Feature activation-9.24e-02

Pos logit contributions
 manager+0.041
keeper+0.041
man+0.040
 refrigerator+0.038
ber+0.038
Neg logit contributions
 realize-0.042
 ease-0.041
 unnecessary-0.041
pieces-0.040
 notice-0.040
Token letters
Token ID7475
Feature activation-2.88e-01

Pos logit contributions
 restaurant+0.004
 manager+0.004
ber+0.004
 Anna+0.004
keeper+0.004
Neg logit contributions
 realize-0.005
 prod-0.005
tip-0.005
pieces-0.005
 unnecessary-0.005
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
 restaurant+0.011
 refrigerator+0.011
omm+0.010
 Cream+0.010
 manager+0.010
Neg logit contributions
 realize-0.020
pieces-0.019
aults-0.019
bs-0.018
 realise-0.018
Token They
Token ID1119
Feature activation+8.33e-02

Pos logit contributions
 restaurant+0.015
 manager+0.015
 old+0.015
 refrigerator+0.015
 novel+0.014
Neg logit contributions
 realize-0.019
pieces-0.018
 prod-0.018
tip-0.017
 realise-0.017
Token will
Token ID481
Feature activation-2.80e+00

Pos logit contributions
 realize+0.004
 prod+0.004
pieces+0.004
aults+0.004
tip+0.004
Neg logit contributions
 grandma-0.005
 restaurant-0.005
 refrigerator-0.005
 manager-0.005
 old-0.005
Token be
Token ID307
Feature activation+2.93e+00

Pos logit contributions
 grandma+0.164
 restaurant+0.158
 manager+0.158
 old+0.158
 refrigerator+0.157
Neg logit contributions
 realize-0.136
 prod-0.134
pieces-0.132
aults-0.129
tip-0.126
Token sad
Token ID6507
Feature activation+1.29e-01

Pos logit contributions
 realize+0.166
 prod+0.162
pieces+0.161
aults+0.158
tip+0.154
Neg logit contributions
 grandma-0.146
 manager-0.144
 restaurant-0.144
 refrigerator-0.144
 shop-0.141
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
 realize+0.033
aults+0.032
bs+0.031
pieces+0.031
 realise+0.031
Neg logit contributions
 refrigerator-0.027
 dentist-0.026
 restaurant-0.026
 freezer-0.026
 shop-0.026
Token They
Token ID1119
Feature activation+3.90e-02

Pos logit contributions
 manager+0.026
 novel+0.026
 restaurant+0.025
 refrigerator+0.025
man+0.025
Neg logit contributions
 realize-0.035
 prod-0.034
pieces-0.033
bs-0.032
 realise-0.032
Token want
Token ID765
Feature activation+1.39e+00

Pos logit contributions
 prod+0.002
pieces+0.002
 realize+0.002
tip+0.002
 unnecessary+0.002
Neg logit contributions
 grandma-0.002
 shop-0.002
 old-0.002
 restaurant-0.002
 manager-0.002
Token to
Token ID284
Feature activation+8.53e-02

Pos logit contributions
 realize+0.089
 prod+0.084
 realise+0.083
pieces+0.082
 notice+0.081
Neg logit contributions
 refrigerator-0.061
 chamber-0.061
man-0.058
 Amen-0.056
omm-0.055
Token see
Token ID766
Feature activation+2.10e+00

Pos logit contributions
 unnecessary+0.004
ging+0.004
cking+0.004
pieces+0.004
 prod+0.004
Neg logit contributions
 grandma-0.005
 shop-0.005
 old-0.005
 Grand-0.005
 is-0.005
Token the
Token ID262
Feature activation-3.20e-01

Pos logit contributions
 realize+0.117
pieces+0.117
 prod+0.107
 ease+0.105
aults+0.104
Neg logit contributions
omm-0.109
 manager-0.108
 refrigerator-0.107
 chamber-0.107
man-0.104
Token show
Token ID905
Feature activation-4.48e-01

Pos logit contributions
 refrigerator+0.013
 about+0.012
 novels+0.012
 chamber+0.012
 grandma+0.012
Neg logit contributions
 realize-0.020
 prod-0.020
aults-0.019
 glide-0.019
 sudden-0.019
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
 refrigerator+0.018
 restaurant+0.017
 chamber+0.017
 Cream+0.017
 old+0.017
Neg logit contributions
pieces-0.030
 realize-0.029
aults-0.028
 prod-0.028
 realise-0.027
Token They
Token ID1119
Feature activation+3.37e-02

Pos logit contributions
 novel+0.026
 manager+0.026
 restaurant+0.026
 refrigerator+0.026
man+0.025
Neg logit contributions
 realize-0.033
 prod-0.032
bs-0.032
pieces-0.032
 notice-0.031
Token think
Token ID892
Feature activation+6.24e-01

Pos logit contributions
pieces+0.001
 prod+0.001
tip+0.001
 unnecessary+0.001
cking+0.001
Neg logit contributions
 grandma-0.002
 shop-0.002
 old-0.002
 restaurant-0.002
 is-0.002
Token it
Token ID340
Feature activation-3.58e-01

Pos logit contributions
 realize+0.035
pieces+0.035
bs+0.034
 prod+0.033
 realise+0.032
Neg logit contributions
 refrigerator-0.033
man-0.032
 chamber-0.032
 restaurant-0.031
keeper-0.031
Token her
Token ID607
Feature activation+5.87e-03

Pos logit contributions
keeper+0.045
 manager+0.045
ber+0.045
 Cream+0.044
man+0.044
Neg logit contributions
 realize-0.036
 ease-0.032
pieces-0.032
 realise-0.032
 prod-0.031
Token to
Token ID284
Feature activation+1.49e-01

Pos logit contributions
 realize+0.000
 realise+0.000
pieces+0.000
bs+0.000
aults+0.000
Neg logit contributions
 Cream-0.000
 restaurant-0.000
 shop-0.000
 refrigerator-0.000
 manager-0.000
Token protect
Token ID1805
Feature activation+5.56e-01

Pos logit contributions
 realize+0.007
 prod+0.007
pieces+0.007
aults+0.007
tip+0.007
Neg logit contributions
 grandma-0.009
 restaurant-0.009
 manager-0.009
 refrigerator-0.009
 old-0.009
Token herself
Token ID5223
Feature activation+2.85e-01

Pos logit contributions
 realize+0.029
pieces+0.028
 prod+0.028
 ease+0.027
aults+0.026
Neg logit contributions
 restaurant-0.030
 manager-0.030
 refrigerator-0.029
keeper-0.029
 chamber-0.029
Token.
Token ID13
Feature activation+7.02e-02

Pos logit contributions
 realize+0.019
aults+0.019
pieces+0.018
 realise+0.018
ault+0.018
Neg logit contributions
 Cream-0.011
 grandma-0.011
 manager-0.011
 waiter-0.010
 restaurant-0.010
Token So
Token ID1406
Feature activation-4.27e-01

Pos logit contributions
 realize+0.004
 prod+0.004
pieces+0.003
bs+0.003
 realise+0.003
Neg logit contributions
 restaurant-0.004
 novel-0.004
man-0.004
 manager-0.004
 hotel-0.003
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
 grandma+0.014
 restaurant+0.013
 manager+0.013
 refrigerator+0.013
 old+0.013
Neg logit contributions
 realize-0.032
 prod-0.032
pieces-0.031
aults-0.031
tip-0.030
Token she
Token ID673
Feature activation+5.37e-01

Pos logit contributions
 realize+0.008
 prod+0.007
pieces+0.007
aults+0.007
tip+0.007
Neg logit contributions
 grandma-0.006
 restaurant-0.006
 manager-0.006
 old-0.006
 refrigerator-0.006
Token grabbed
Token ID13176
Feature activation+1.78e+00

Pos logit contributions
 realize+0.026
 prod+0.026
aults+0.024
pieces+0.024
 realise+0.024
Neg logit contributions
 grandma-0.031
 refrigerator-0.031
 restaurant-0.031
 chamber-0.030
 manager-0.030
Token the
Token ID262
Feature activation+8.24e-02

Pos logit contributions
 realize+0.126
 prod+0.124
pieces+0.122
aults+0.121
tip+0.118
Neg logit contributions
 grandma-0.066
 old-0.063
 restaurant-0.063
 manager-0.063
 refrigerator-0.063
Token box
Token ID3091
Feature activation-3.08e+00

Pos logit contributions
 realize+0.006
 prod+0.006
pieces+0.006
tip+0.006
aults+0.006
Neg logit contributions
 grandma-0.002
 restaurant-0.002
keeper-0.002
 manager-0.002
 Cream-0.002
Token to
Token ID284
Feature activation+2.00e-03

Pos logit contributions
 Hero+0.050
 gorilla+0.049
omm+0.049
 restaurant+0.049
 Rudolph+0.049
Neg logit contributions
bs-0.072
pieces-0.072
wheel-0.072
 realize-0.071
vals-0.068
Token do
Token ID466
Feature activation-1.68e+00

Pos logit contributions
 prod+0.000
 realize+0.000
pieces+0.000
aults+0.000
tip+0.000
Neg logit contributions
 grandma-0.000
 old-0.000
 shop-0.000
 restaurant-0.000
 refrigerator-0.000
Token some
Token ID617
Feature activation-2.45e+00

Pos logit contributions
 refrigerator+0.063
keeper+0.063
 grandma+0.063
 manager+0.061
man+0.061
Neg logit contributions
 realize-0.113
pieces-0.113
 prod-0.110
aults-0.107
 realise-0.106
Token work
Token ID670
Feature activation-3.42e-01

Pos logit contributions
 grandma+0.110
 manager+0.107
 restaurant+0.106
 refrigerator+0.106
 old+0.104
Neg logit contributions
 realize-0.151
 prod-0.149
pieces-0.147
aults-0.144
tip-0.141
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
 gorilla+0.015
omm+0.015
 refrigerator+0.014
 restaurant+0.014
 chamber+0.014
Neg logit contributions
pieces-0.020
aults-0.020
 realize-0.020
 forth-0.020
bs-0.019
Token Tom
Token ID4186
Feature activation-3.25e-01

Pos logit contributions
 manager+0.008
 chamber+0.008
 restaurant+0.008
 grandma+0.008
 refrigerator+0.008
Neg logit contributions
 realize-0.015
 prod-0.015
pieces-0.015
 realise-0.014
aults-0.014
Token was
Token ID373
Feature activation-9.14e-01

Pos logit contributions
 refrigerator+0.013
 manager+0.013
 restaurant+0.013
 chamber+0.013
 grandma+0.013
Neg logit contributions
 realize-0.021
 prod-0.020
 realise-0.020
 sudden-0.019
tip-0.019
Token very
Token ID845
Feature activation+7.35e-01

Pos logit contributions
 grandma+0.040
 restaurant+0.039
 refrigerator+0.039
 manager+0.038
 chamber+0.038
Neg logit contributions
 realize-0.058
 prod-0.057
pieces-0.056
aults-0.055
tip-0.054
Token uncomfortable
Token ID12916
Feature activation+2.42e+00

Pos logit contributions
pieces+0.047
 prod+0.046
aults+0.046
 realize+0.045
vals+0.045
Neg logit contributions
 old-0.034
 grandma-0.033
 manager-0.032
 restaurant-0.031
 shop-0.031
Token in
Token ID287
Feature activation-8.05e-01

Pos logit contributions
 realize+0.129
ging+0.123
 forth+0.120
 slipping+0.119
 realise+0.118
Neg logit contributions
keeper-0.123
 waiter-0.121
 waitress-0.121
 Hero-0.118
 Amen-0.118
Token the
Token ID262
Feature activation-2.39e-02

Pos logit contributions
keeper+0.039
man+0.038
 manager+0.038
 refrigerator+0.036
 Cream+0.036
Neg logit contributions
 realize-0.048
 tumble-0.045
 realise-0.045
 prod-0.045
 ease-0.045
Token closer
Token ID5699
Feature activation+3.52e-01

Pos logit contributions
 realize+0.056
 prod+0.055
pieces+0.054
aults+0.054
tip+0.053
Neg logit contributions
 grandma-0.025
 restaurant-0.024
 manager-0.023
 refrigerator-0.023
 old-0.023
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
 realize+0.020
 realise+0.019
 notice+0.018
 prod+0.018
aults+0.018
Neg logit contributions
omm-0.017
 manager-0.017
 waitress-0.016
 Cream-0.016
 refrigerator-0.016
Token
Token ID198
Feature activation+2.93e-01

Pos logit contributions
man+0.019
 manager+0.019
 novel+0.019
 year+0.018
 refrigerator+0.018
Neg logit contributions
 realize-0.018
bs-0.017
 prod-0.016
 notice-0.016
apy-0.016
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
 realize+0.020
 prod+0.019
pieces+0.019
aults+0.019
 realise+0.019
Neg logit contributions
 refrigerator-0.011
 manager-0.011
 restaurant-0.011
 grandma-0.011
 chamber-0.011
Token"
Token ID1
Feature activation+1.07e+00

Pos logit contributions
pieces+0.020
 prod+0.020
 realize+0.019
 ease+0.019
 unnecessary+0.019
Neg logit contributions
 shop-0.011
 is-0.011
 old-0.011
 Anna-0.011
 grandma-0.011
TokenHi
Token ID17250
Feature activation-1.10e+00

Pos logit contributions
 realize+0.059
pieces+0.057
 prod+0.057
aults+0.055
 realise+0.054
Neg logit contributions
 grandma-0.057
 shop-0.057
 restaurant-0.057
 manager-0.057
 Anna-0.055
Token,
Token ID11
Feature activation+2.99e-02

Pos logit contributions
 manager+0.056
keeper+0.053
 refrigerator+0.053
man+0.052
 novels+0.052
Neg logit contributions
aults-0.060
 realize-0.058
pieces-0.055
 prod-0.055
ault-0.054
Token Tom
Token ID4186
Feature activation-5.36e-01

Pos logit contributions
 realize+0.002
 prod+0.002
pieces+0.002
 tumble+0.002
 forget+0.002
Neg logit contributions
 novels-0.001
man-0.001
keeper-0.001
 manager-0.001
 refrigerator-0.001
Token and
Token ID290
Feature activation-2.63e-01

Pos logit contributions
 manager+0.032
 novels+0.030
 restaurant+0.030
omm+0.030
 hotel+0.029
Neg logit contributions
 realize-0.024
 forth-0.024
cking-0.022
aults-0.022
 prod-0.021
Token Anna
Token ID11735
Feature activation-2.39e+00

Pos logit contributions
 manager+0.011
 refrigerator+0.010
man+0.010
omm+0.010
keeper+0.010
Neg logit contributions
 realize-0.017
 prod-0.017
pieces-0.016
bs-0.016
tip-0.016
Token!"
Token ID2474
Feature activation+2.57e-03

Pos logit contributions
 manager+0.118
omm+0.113
 refrigerator+0.111
 restaurant+0.111
 grandma+0.110
Neg logit contributions
 realize-0.141
aults-0.128
 prod-0.126
 realise-0.125
hing-0.124
Token her
Token ID607
Feature activation-3.62e-02

Pos logit contributions
 grandma+0.023
 old+0.022
 refrigerator+0.022
 restaurant+0.022
 manager+0.022
Neg logit contributions
 realize-0.038
 prod-0.038
pieces-0.037
aults-0.037
tip-0.036
Token favourite
Token ID12507
Feature activation-9.04e-01

Pos logit contributions
 manager+0.002
 restaurant+0.002
 grandma+0.002
keeper+0.002
 shop+0.002
Neg logit contributions
 realize-0.002
 prod-0.002
pieces-0.002
aults-0.002
tip-0.002
Token things
Token ID1243
Feature activation+3.51e-01

Pos logit contributions
 manager+0.043
 grandma+0.043
keeper+0.042
 restaurant+0.041
 refrigerator+0.040
Neg logit contributions
 prod-0.052
 realize-0.052
pieces-0.051
tip-0.049
aults-0.049
Token around
Token ID1088
Feature activation-2.67e-01

Pos logit contributions
 realize+0.025
 prod+0.024
pieces+0.024
aults+0.023
tip+0.023
Neg logit contributions
 grandma-0.013
 restaurant-0.012
 manager-0.012
 refrigerator-0.012
 chamber-0.012
Token the
Token ID262
Feature activation+4.79e-03

Pos logit contributions
 manager+0.014
 restaurant+0.014
keeper+0.014
 grandma+0.014
 Cream+0.013
Neg logit contributions
 realize-0.014
 prod-0.014
pieces-0.014
 ease-0.014
 realise-0.013
Token camp
Token ID1413
Feature activation-3.47e+00

Pos logit contributions
 realize+0.000
 prod+0.000
aults+0.000
pieces+0.000
 tumble+0.000
Neg logit contributions
 manager-0.000
 grandma-0.000
keeper-0.000
 restaurant-0.000
 refrigerator-0.000
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
 grandma+0.170
omm+0.164
 novels+0.158
 novel+0.157
 truth+0.156
Neg logit contributions
aults-0.192
pieces-0.190
 realize-0.189
 prod-0.187
vals-0.185
Token make
Token ID787
Feature activation+1.26e+00

Pos logit contributions
aults+0.006
 prod+0.006
pieces+0.006
 realize+0.006
tip+0.006
Neg logit contributions
 grandma-0.010
 old-0.010
 restaurant-0.009
 shop-0.009
 refrigerator-0.009
Token it
Token ID340
Feature activation-4.09e-01

Pos logit contributions
 realize+0.081
pieces+0.080
 prod+0.079
aults+0.078
 realise+0.075
Neg logit contributions
 grandma-0.053
 manager-0.052
 refrigerator-0.052
 restaurant-0.052
 shop-0.051
Token cozy
Token ID37438
Feature activation-1.49e+00

Pos logit contributions
 old+0.020
 grandma+0.020
 restaurant+0.020
 refrigerator+0.019
 chamber+0.019
Neg logit contributions
 prod-0.023
 realize-0.023
pieces-0.023
aults-0.022
tip-0.022
Token,
Token ID11
Feature activation+9.67e-02

Pos logit contributions
omm+0.062
 grandma+0.062
 manager+0.061
 Cream+0.060
ber+0.058
Neg logit contributions
 realize-0.094
aults-0.092
 forth-0.088
 tumble-0.088
 prod-0.087
Token They
Token ID1119
Feature activation-1.28e-01

Pos logit contributions
 manager+0.030
keeper+0.028
 restaurant+0.028
 old+0.028
 grandma+0.028
Neg logit contributions
 realize-0.049
 prod-0.048
pieces-0.047
aults-0.045
 realise-0.045
Token make
Token ID787
Feature activation+1.20e+00

Pos logit contributions
 grandma+0.007
 old+0.007
 shop+0.007
 is+0.007
 restaurant+0.007
Neg logit contributions
pieces-0.007
 prod-0.006
 unnecessary-0.006
tip-0.006
 sudden-0.006
Token a
Token ID257
Feature activation+2.30e-02

Pos logit contributions
 realize+0.069
 prod+0.067
pieces+0.066
aults+0.065
tip+0.063
Neg logit contributions
 grandma-0.060
 manager-0.059
 refrigerator-0.059
 restaurant-0.059
 shop-0.058
Token big
Token ID1263
Feature activation-1.72e-01

Pos logit contributions
pieces+0.002
tip+0.002
 prod+0.002
 realize+0.002
hing+0.002
Neg logit contributions
 grandma-0.001
 old-0.001
 restaurant-0.001
 refrigerator-0.001
 chamber-0.001
Token snow
Token ID6729
Feature activation+2.03e+00

Pos logit contributions
 grandma+0.008
 restaurant+0.008
 old+0.008
 refrigerator+0.008
 shop+0.008
Neg logit contributions
 realize-0.010
 prod-0.010
pieces-0.010
aults-0.010
tip-0.010
Tokenman
Token ID805
Feature activation-3.45e+00

Pos logit contributions
 prod+0.173
 realize+0.171
 forth+0.171
aults+0.171
 glide+0.171
Neg logit contributions
 old-0.035
 manager-0.035
 grandma-0.035
 restaurant-0.033
 truth-0.033
Token with
Token ID351
Feature activation-1.37e+00

Pos logit contributions
 refrigerator+0.104
 grandma+0.104
 manager+0.103
 restaurant+0.101
 old+0.099
Neg logit contributions
 realize-0.262
pieces-0.257
 prod-0.257
aults-0.256
tip-0.246
Token a
Token ID257
Feature activation-4.02e-02

Pos logit contributions
 grandma+0.049
 manager+0.048
 restaurant+0.048
 refrigerator+0.047
 shop+0.047
Neg logit contributions
 realize-0.097
 prod-0.096
pieces-0.094
aults-0.093
tip-0.091
Token hat
Token ID6877
Feature activation-1.39e-01

Pos logit contributions
 grandma+0.002
 old+0.002
 refrigerator+0.002
 shop+0.002
 restaurant+0.002
Neg logit contributions
pieces-0.003
 realize-0.003
 prod-0.003
tip-0.003
aults-0.003
Token and
Token ID290
Feature activation-6.73e-01

Pos logit contributions
 restaurant+0.005
 manager+0.005
 chamber+0.005
 refrigerator+0.005
 grandma+0.004
Neg logit contributions
 realize-0.010
aults-0.010
 prod-0.009
pieces-0.009
hing-0.009
Token a
Token ID257
Feature activation-8.13e-02

Pos logit contributions
 grandma+0.019
 old+0.017
 Grand+0.016
 Anna+0.016
 restaurant+0.015
Neg logit contributions
pieces-0.055
 realize-0.055
aults-0.054
 prod-0.054
cking-0.052
Token words
Token ID2456
Feature activation-2.09e+00

Pos logit contributions
 manager+0.025
 Cream+0.025
keeper+0.025
man+0.025
 grandma+0.024
Neg logit contributions
 realize-0.040
pieces-0.039
aults-0.038
 prod-0.038
tip-0.037
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 grandma+0.070
 refrigerator+0.069
 restaurant+0.069
 manager+0.067
 chamber+0.066
Neg logit contributions
 realize-0.152
pieces-0.149
aults-0.147
 prod-0.146
 realise-0.142
Token He
Token ID679
Feature activation-6.03e-01

Pos logit contributions
 restaurant+0.006
man+0.006
 novel+0.006
 hotel+0.006
 grandma+0.006
Neg logit contributions
 prod-0.007
 ease-0.007
 realize-0.007
pieces-0.007
 realise-0.007
Token told
Token ID1297
Feature activation-8.27e-01

Pos logit contributions
 refrigerator+0.038
eddy+0.037
omm+0.037
 manager+0.036
 Cream+0.036
Neg logit contributions
 realize-0.028
 realise-0.026
 prod-0.025
 sudden-0.025
 forth-0.023
Token a
Token ID257
Feature activation+2.09e-01

Pos logit contributions
 Cream+0.046
ber+0.046
 refrigerator+0.045
eddy+0.045
 de+0.044
Neg logit contributions
 realize-0.043
 prod-0.042
 ease-0.041
 realise-0.040
pieces-0.040
Token story
Token ID1621
Feature activation-4.94e+00

Pos logit contributions
 realize+0.017
 prod+0.016
 realise+0.016
 tumble+0.016
pieces+0.016
Neg logit contributions
 grandma-0.005
 manager-0.005
 refrigerator-0.005
 Cream-0.005
keeper-0.005
Token about
Token ID546
Feature activation-3.74e+00

Pos logit contributions
 Cream+0.161
omm+0.156
 refrigerator+0.154
 freezer+0.145
 restaurant+0.141
Neg logit contributions
 realize-0.336
wheel-0.325
 realise-0.320
hing-0.316
pieces-0.315
Token a
Token ID257
Feature activation+2.14e-01

Pos logit contributions
keeper+0.197
 manager+0.196
omm+0.193
eddy+0.189
man+0.188
Neg logit contributions
 realize-0.202
pieces-0.200
 ease-0.197
 prod-0.196
 realise-0.186
Token sleepy
Token ID42023
Feature activation-6.49e-01

Pos logit contributions
 realize+0.013
 prod+0.012
pieces+0.012
 sudden+0.012
 realise+0.012
Neg logit contributions
 manager-0.009
 Cream-0.009
keeper-0.009
omm-0.009
 refrigerator-0.009
Token bear
Token ID6842
Feature activation-9.07e-01

Pos logit contributions
 Cream+0.026
 manager+0.026
keeper+0.025
ber+0.025
uzz+0.025
Neg logit contributions
 realize-0.039
 prod-0.035
 notice-0.035
 forth-0.035
 sudden-0.035
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
omm+0.035
 refrigerator+0.035
 grandma+0.035
 old+0.034
 manager+0.034
Neg logit contributions
 realize-0.059
pieces-0.058
aults-0.058
ault-0.055
 realise-0.055
Token Amy
Token ID14235
Feature activation-1.13e-01

Pos logit contributions
 manager+0.047
 restaurant+0.047
man+0.047
 novel+0.045
keeper+0.045
Neg logit contributions
 realize-0.063
 prod-0.062
pieces-0.061
bs-0.059
tip-0.058
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
 manager+0.005
 restaurant+0.005
 refrigerator+0.004
keeper+0.004
omm+0.004
Neg logit contributions
 realize-0.007
bs-0.007
 prod-0.007
tip-0.007
aults-0.007
Token She
Token ID1375
Feature activation+6.94e-01

Pos logit contributions
 realize+0.014
 prod+0.013
pieces+0.013
 realise+0.012
 ease+0.012
Neg logit contributions
 manager-0.010
 grandma-0.010
 restaurant-0.010
keeper-0.010
man-0.009
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
 realize+0.037
 swung+0.035
tip+0.035
 sudden+0.034
bs+0.034
Neg logit contributions
keeper-0.037
 manager-0.036
 refrigerator-0.035
 restaurant-0.033
 Hero-0.033
Token three
Token ID1115
Feature activation-3.88e+00

Pos logit contributions
keeper+0.038
 manager+0.038
 restaurant+0.037
man+0.037
 refrigerator+0.037
Neg logit contributions
 realize-0.052
 prod-0.050
pieces-0.048
tip-0.048
 realise-0.047
Token years
Token ID812
Feature activation-4.71e+00

Pos logit contributions
man+0.158
 manager+0.157
keeper+0.153
 Hero+0.153
 refrigerator+0.147
Neg logit contributions
 realize-0.252
pieces-0.246
aults-0.241
 sudden-0.240
cking-0.238
Token old
Token ID1468
Feature activation-3.33e+00

Pos logit contributions
man+0.112
 Hero+0.105
 Shop+0.105
 Cream+0.101
 Man+0.097
Neg logit contributions
 realize-0.364
bs-0.363
tip-0.357
lest-0.347
 sudden-0.343
Token and
Token ID290
Feature activation+1.29e-01

Pos logit contributions
 manager+0.133
 restaurant+0.129
man+0.128
arium+0.126
 Hero+0.122
Neg logit contributions
 realize-0.213
 forth-0.196
 sudden-0.196
tip-0.189
 prod-0.187
Token she
Token ID673
Feature activation+5.66e-01

Pos logit contributions
 realize+0.008
 prod+0.008
tip+0.008
bs+0.008
apy+0.008
Neg logit contributions
 restaurant-0.006
 manager-0.005
keeper-0.005
 refrigerator-0.005
man-0.005
Token loved
Token ID6151
Feature activation-7.34e-01

Pos logit contributions
 realize+0.036
 prod+0.034
tip+0.034
 realise+0.033
bs+0.033
Neg logit contributions
 refrigerator-0.024
keeper-0.024
 restaurant-0.024
 manager-0.024
 grandma-0.023
Token cr
Token ID1067
Feature activation+6.42e-02

Pos logit contributions
 manager+0.027
keeper+0.027
man+0.026
 chamber+0.025
 refrigerator+0.025
Neg logit contributions
 realize-0.050
 prod-0.049
pieces-0.048
tip-0.048
 realise-0.047
Token.
Token ID13
Feature activation+7.98e-02

Pos logit contributions
 grandma+0.044
omm+0.042
 manager+0.042
 old+0.041
 chamber+0.041
Neg logit contributions
 realize-0.069
 prod-0.068
pieces-0.068
aults-0.067
 forth-0.066
Token
Token ID198
Feature activation+5.32e-01

Pos logit contributions
 realize+0.006
 prod+0.006
pieces+0.006
aults+0.005
tip+0.005
Neg logit contributions
 grandma-0.003
 manager-0.003
 restaurant-0.003
 refrigerator-0.003
 old-0.003
Token
Token ID198
Feature activation+5.45e-01

Pos logit contributions
 realize+0.036
 prod+0.034
pieces+0.034
aults+0.033
 realise+0.033
Neg logit contributions
 manager-0.021
 refrigerator-0.020
 restaurant-0.020
 grandma-0.020
 old-0.019
TokenWhen
Token ID2215
Feature activation+9.43e-01

Pos logit contributions
 prod+0.031
 realize+0.029
 ease+0.029
 unnecessary+0.029
pieces+0.028
Neg logit contributions
 grandma-0.031
 shop-0.030
 old-0.030
 Anna-0.029
 restaurant-0.029
Token the
Token ID262
Feature activation-2.43e-02

Pos logit contributions
 realize+0.059
 prod+0.059
pieces+0.058
aults+0.057
tip+0.056
Neg logit contributions
 grandma-0.044
 restaurant-0.041
 refrigerator-0.041
 old-0.041
 manager-0.041
Token waiter
Token ID46612
Feature activation-3.16e+00

Pos logit contributions
uzz+0.001
abel+0.001
keeper+0.001
 de+0.001
ber+0.001
Neg logit contributions
 realize-0.001
 sudden-0.001
 notice-0.001
tip-0.001
 tumble-0.001
Token came
Token ID1625
Feature activation-1.30e-01

Pos logit contributions
 old+0.192
 de+0.192
 novels+0.189
 truth+0.181
 chamber+0.179
Neg logit contributions
 realize-0.139
 prod-0.125
 realise-0.121
 notice-0.121
pieces-0.116
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
 de+0.006
 novels+0.006
 refrigerator+0.006
 truth+0.006
 manager+0.006
Neg logit contributions
 realize-0.007
aults-0.007
 prod-0.007
ging-0.007
tip-0.006
Token Lily
Token ID20037
Feature activation+4.90e-01

Pos logit contributions
 realize+0.007
pieces+0.007
 prod+0.007
apy+0.007
 tumble+0.007
Neg logit contributions
 manager-0.006
 Shop-0.006
keeper-0.006
man-0.006
 truth-0.006
Token's
Token ID338
Feature activation+7.31e-01

Pos logit contributions
 realize+0.023
 prod+0.020
 realise+0.019
 sudden+0.019
 notice+0.019
Neg logit contributions
 novels-0.030
 chamber-0.029
 truth-0.028
 manager-0.028
 refrigerator-0.028
Token mom
Token ID1995
Feature activation-1.22e+00

Pos logit contributions
 sudden+0.036
 realize+0.033
 realise+0.028
 forget+0.027
 prod+0.027
Neg logit contributions
arium-0.039
lor-0.037
 MacDonald-0.036
 ruled-0.036
 Sleep-0.035