TOP ACTIVATIONS
MAX = 0.068

smiled and said , " Att aching means to stick something
was ready for make - bel ieve fun . <|endoftext|> Once
her mom my watched in amaz ement as the machine flew
and they played make - bel ieve . Finally , they
and was amazed by the fant astical things she saw .
scattered and broken . The view is messy and ugly .
. She says , " Who did this ? Who drank
sad . " Who did this ?" she asked
, they got off the subway and completed their shopping .
They hear Spot 's sn ores from the living room .
a fire . The fire alarm rings . Sara and her
thinks they made up the story . " Tom
suddenly , a brightly - col oured belt caught her eye
night . They were so capt ivated by the fireplace รข
go ! Everyone watched in amaz ement as it took off
happy . " Who did this ?" Mom asks
. She said , " Who did this ?"
't go on any more tours and had to stay in
ittens . She was so capt ivated by the parade that
friends stopped and watched in amaz ement . The dog too

MIN ACTIVATIONS
MIN = -0.066

dolls around in a wheel bar row she had found in
it away in a wheel bar row . Once
provide you with some delicious seaw eed to eat ."
" Hi P ingu ! Do you want to
the different fish and colorful seaw eed below .
of soil into the wheel bar row . It was heavy
. He found a wheel bar row , a tr amp
back with a big wheel bar row . He asked the
the cat off the wheel bar row and put him back
It was hiding under a seaw eed . Lily said ,
mar bles , spark ly mar bles , and mar bles
found it stuck in some seaw eed . Tim was so
started to chew on some seaw eed . Mary was so
him a big red wheel bar row and filled it with
to keep playing with your mar bles all day ?" it
will make a salad with lettuce , tomato , and cheese
light en up the wheel bar row . <|endoftext|> Once there
play with you and your mar bles ?" The boy looked
She also picked some green seaw eed to show her friends
a big , spinning wheel bar row and put the pumpkin

INTERVAL 0.034 - 0.068
CONTAINS 0.138%

t eddy bear . The sun is mild . It is
's hair and she was speech less . She did not
Now , let รข โ‚ฌ โ„ข s start playing !"
play together . Today is their birthday . They are very
it was the most delicious thing she had ever tasted !

INTERVAL 0.001 - 0.034
CONTAINS 45.704%

wished he didn 't do that . He said sorry to
you , Mom !" Lily and Ben say .
doing . She told Lily that it wasn 't safe to
aunt ie and uncle 's house , Tommy saw the baby
very much . One day , Lily 's friend , Tim

INTERVAL -0.032 - 0.001
CONTAINS 53.956%

was being so stubborn . She wouldn 't listen . She
and the singing birds . He began to feel at peace
Max felt better and they sat on the bench ,
Polly said , " Let 's get rid of this !"
, Lily was always careful and made sure to ask for

INTERVAL -0.066 - -0.032
CONTAINS 0.202%

" Thank you , Ben ," Mia says . "
Once there was a pe b ble . The pe b
, " What are b erry - bus hes ?"
away as fast as he could . B
- a tree with Bright ly shining leaves . She quickly
Token smiled
Token ID13541
Feature activation-9.18e-03

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.011
chair+0.011
five+0.010
Neg logit contributions
P-0.017
Bo-0.016
orn-0.016
un-0.015
ble-0.014
Token and
Token ID290
Feature activation-4.39e-03

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
un+0.015
ble+0.015
Neg logit contributions
 knocks-0.015
cakes-0.014
cases-0.013
chair-0.013
five-0.012
Token said
Token ID531
Feature activation-1.37e-02

Pos logit contributions
P+0.008
Bo+0.008
orn+0.008
un+0.007
ble+0.007
Neg logit contributions
 knocks-0.007
cakes-0.007
cases-0.006
chair-0.006
five-0.006
Token,
Token ID11
Feature activation-4.63e-03

Pos logit contributions
P+0.024
Bo+0.023
orn+0.022
un+0.020
ble+0.020
Neg logit contributions
 knocks-0.025
cakes-0.023
cases-0.022
chair-0.022
five-0.021
Token "
Token ID366
Feature activation-4.05e-03

Pos logit contributions
P+0.011
orn+0.010
Bo+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.006
cakes-0.005
cases-0.005
chair-0.005
 knocking-0.005
TokenAtt
Token ID8086
Feature activation+6.39e-02

Pos logit contributions
P+0.007
orn+0.007
Bo+0.007
ble+0.006
un+0.006
Neg logit contributions
 knocks-0.007
cakes-0.007
chair-0.006
cases-0.006
five-0.006
Tokenaching
Token ID8103
Feature activation+1.11e-02

Pos logit contributions
 knocks+0.123
 sigh+0.101
 leaving+0.099
 knocking+0.099
chair+0.099
Neg logit contributions
P-0.112
orn-0.098
Bo-0.096
ble-0.095
un-0.093
Token means
Token ID1724
Feature activation-1.32e-02

Pos logit contributions
 knocks+0.016
cakes+0.014
cases+0.014
 James+0.013
chair+0.013
Neg logit contributions
P-0.024
orn-0.023
Bo-0.023
G-0.020
un-0.020
Token to
Token ID284
Feature activation+4.66e-03

Pos logit contributions
P+0.029
Bo+0.028
orn+0.028
un+0.026
ble+0.026
Neg logit contributions
 knocks-0.018
cakes-0.016
cases-0.014
chair-0.014
five-0.014
Token stick
Token ID4859
Feature activation-1.27e-02

Pos logit contributions
 knocks+0.006
 report+0.006
 knocking+0.005
 James+0.005
 knock+0.005
Neg logit contributions
P-0.010
orn-0.010
Bo-0.010
un-0.009
ble-0.009
Token something
Token ID1223
Feature activation+3.60e-03

Pos logit contributions
P+0.028
Bo+0.027
orn+0.026
G+0.025
un+0.024
Neg logit contributions
 knocks-0.018
cakes-0.016
chair-0.014
cases-0.014
 Amy-0.014
Token was
Token ID373
Feature activation-2.84e-03

Pos logit contributions
P+0.008
orn+0.007
Bo+0.007
ble+0.007
G+0.007
Neg logit contributions
 knocks-0.005
cases-0.004
 sigh-0.004
chair-0.004
cakes-0.004
Token ready
Token ID3492
Feature activation+5.40e-03

Pos logit contributions
P+0.006
orn+0.006
Bo+0.006
un+0.005
G+0.005
Neg logit contributions
 knocks-0.004
cakes-0.004
 leaving-0.004
 knocking-0.003
 James-0.003
Token for
Token ID329
Feature activation-4.96e-03

Pos logit contributions
 knocks+0.006
cakes+0.005
cases+0.005
chair+0.005
 leaving+0.005
Neg logit contributions
P-0.013
Bo-0.013
orn-0.013
un-0.012
G-0.012
Token make
Token ID787
Feature activation+1.43e-02

Pos logit contributions
P+0.010
Bo+0.009
orn+0.009
un+0.008
ble+0.008
Neg logit contributions
 knocks-0.008
cakes-0.007
cases-0.007
chair-0.007
five-0.007
Token-
Token ID12
Feature activation+3.14e-02

Pos logit contributions
 knocks+0.025
cakes+0.023
cases+0.021
chair+0.021
five+0.020
Neg logit contributions
P-0.026
Bo-0.025
orn-0.025
un-0.023
ble-0.022
Tokenbel
Token ID6667
Feature activation+6.19e-02

Pos logit contributions
 knocks+0.049
cakes+0.041
cases+0.039
chair+0.039
 leaving+0.038
Neg logit contributions
P-0.066
Bo-0.062
orn-0.061
un-0.058
ble-0.056
Tokenieve
Token ID12311
Feature activation+2.01e-02

Pos logit contributions
 knocks+0.098
cakes+0.090
cases+0.083
chair+0.083
five+0.079
Neg logit contributions
P-0.123
Bo-0.116
orn-0.115
un-0.106
ble-0.105
Token fun
Token ID1257
Feature activation+1.05e-02

Pos logit contributions
 knocks+0.032
cakes+0.029
cases+0.027
chair+0.027
five+0.026
Neg logit contributions
P-0.040
Bo-0.038
orn-0.037
un-0.034
ble-0.034
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
 knocks+0.018
cakes+0.016
cases+0.016
five+0.015
chair+0.015
Neg logit contributions
P-0.019
Bo-0.018
orn-0.017
un-0.016
ble-0.016
Token<|endoftext|>
Token ID50256
Feature activation+1.37e-02

Pos logit contributions
 knocks+0.007
cakes+0.007
cases+0.006
chair+0.006
five+0.006
Neg logit contributions
P-0.007
Bo-0.007
orn-0.007
un-0.006
ble-0.006
TokenOnce
Token ID7454
Feature activation-6.51e-03

Pos logit contributions
 knocks+0.020
cakes+0.019
chair+0.018
 James+0.018
 Amy+0.018
Neg logit contributions
P-0.026
Bo-0.025
orn-0.024
ble-0.024
un-0.023
Token her
Token ID607
Feature activation+2.35e-02

Pos logit contributions
P+0.007
Bo+0.007
orn+0.007
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.006
cakes-0.005
cases-0.005
chair-0.005
five-0.005
Token mom
Token ID1995
Feature activation+1.06e-02

Pos logit contributions
 knocks+0.037
cakes+0.033
chair+0.031
cases+0.031
 leaving+0.030
Neg logit contributions
P-0.048
Bo-0.046
orn-0.045
un-0.041
G-0.041
Tokenmy
Token ID1820
Feature activation+2.68e-03

Pos logit contributions
 knocks+0.017
cakes+0.015
cases+0.014
chair+0.014
five+0.014
Neg logit contributions
P-0.021
Bo-0.020
orn-0.020
un-0.018
ble-0.018
Token watched
Token ID7342
Feature activation+1.58e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
 knocking+0.003
Neg logit contributions
P-0.006
Bo-0.005
orn-0.005
un-0.005
G-0.005
Token in
Token ID287
Feature activation+2.23e-03

Pos logit contributions
 knocks+0.028
cakes+0.025
cases+0.024
chair+0.024
five+0.023
Neg logit contributions
P-0.028
Bo-0.027
orn-0.026
un-0.024
ble-0.024
Token amaz
Token ID40642
Feature activation+6.01e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
chair+0.004
five+0.003
Neg logit contributions
P-0.004
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Tokenement
Token ID972
Feature activation+2.50e-02

Pos logit contributions
 knocks+0.091
 knocking+0.073
cakes+0.073
five+0.069
cases+0.069
Neg logit contributions
P-0.127
orn-0.120
Bo-0.112
ble-0.109
un-0.105
Token as
Token ID355
Feature activation-1.19e-02

Pos logit contributions
 knocks+0.045
cakes+0.039
five+0.037
 divorce+0.037
chair+0.036
Neg logit contributions
P-0.042
Bo-0.040
orn-0.039
ble-0.038
un-0.038
Token the
Token ID262
Feature activation+2.51e-03

Pos logit contributions
P+0.026
Bo+0.024
orn+0.024
un+0.022
ble+0.022
Neg logit contributions
 knocks-0.017
cakes-0.016
chair-0.015
cases-0.014
 James-0.014
Token machine
Token ID4572
Feature activation+2.09e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
five+0.004
chair+0.004
Neg logit contributions
P-0.005
Bo-0.004
orn-0.004
un-0.004
ble-0.004
Token flew
Token ID13112
Feature activation-1.55e-02

Pos logit contributions
 knocks+0.035
cakes+0.032
cases+0.030
chair+0.029
five+0.028
Neg logit contributions
P-0.040
Bo-0.038
orn-0.038
un-0.034
ble-0.034
Token and
Token ID290
Feature activation-5.25e-04

Pos logit contributions
P+0.030
Bo+0.028
orn+0.028
un+0.025
G+0.025
Neg logit contributions
 knocks-0.020
cakes-0.019
cases-0.017
chair-0.017
 knocking-0.016
Token they
Token ID484
Feature activation+5.11e-04

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
un+0.001
ble+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
cases-0.001
chair-0.001
 James-0.001
Token played
Token ID2826
Feature activation-3.60e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
 James+0.001
Neg logit contributions
P-0.001
orn-0.001
Bo-0.001
ble-0.001
G-0.001
Token make
Token ID787
Feature activation+1.22e-02

Pos logit contributions
P+0.008
orn+0.008
Bo+0.008
un+0.008
G+0.008
Neg logit contributions
 knocks-0.004
cakes-0.004
 James-0.004
chair-0.004
cases-0.003
Token-
Token ID12
Feature activation+3.31e-02

Pos logit contributions
 knocks+0.021
cakes+0.020
cases+0.019
chair+0.019
five+0.018
Neg logit contributions
P-0.022
Bo-0.021
orn-0.021
un-0.019
ble-0.018
Tokenbel
Token ID6667
Feature activation+5.99e-02

Pos logit contributions
 knocks+0.056
cakes+0.047
 leaving+0.045
 sigh+0.045
chair+0.045
Neg logit contributions
P-0.063
Bo-0.060
orn-0.059
un-0.057
G-0.054
Tokenieve
Token ID12311
Feature activation+1.96e-02

Pos logit contributions
 knocks+0.098
cakes+0.089
cases+0.083
chair+0.083
five+0.079
Neg logit contributions
P-0.116
Bo-0.109
orn-0.108
un-0.100
ble-0.099
Token.
Token ID13
Feature activation+6.47e-03

Pos logit contributions
 knocks+0.028
cakes+0.025
chair+0.024
cases+0.024
 James+0.023
Neg logit contributions
P-0.042
Bo-0.040
orn-0.040
un-0.037
G-0.036
Token Finally
Token ID9461
Feature activation-5.15e-03

Pos logit contributions
 knocks+0.009
cakes+0.008
cases+0.008
chair+0.008
five+0.007
Neg logit contributions
P-0.014
Bo-0.013
orn-0.013
un-0.012
ble-0.012
Token,
Token ID11
Feature activation-2.91e-03

Pos logit contributions
P+0.009
Bo+0.008
orn+0.008
un+0.008
ble+0.007
Neg logit contributions
 knocks-0.009
cakes-0.009
cases-0.008
chair-0.008
five-0.008
Token they
Token ID484
Feature activation-1.35e-03

Pos logit contributions
P+0.007
Bo+0.006
orn+0.006
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.004
cakes-0.003
cases-0.003
chair-0.003
five-0.003
Token and
Token ID290
Feature activation+9.12e-03

Pos logit contributions
 knocks+0.014
cakes+0.013
chair+0.012
cases+0.012
 James+0.011
Neg logit contributions
P-0.021
Bo-0.020
orn-0.020
un-0.018
ble-0.018
Token was
Token ID373
Feature activation+5.17e-03

Pos logit contributions
 knocks+0.014
cakes+0.013
chair+0.012
cases+0.012
 James+0.012
Neg logit contributions
P-0.019
Bo-0.018
orn-0.018
un-0.016
G-0.016
Token amazed
Token ID24562
Feature activation+2.01e-02

Pos logit contributions
 knocks+0.009
cakes+0.008
chair+0.007
 James+0.007
 knocking+0.007
Neg logit contributions
P-0.010
Bo-0.010
orn-0.010
un-0.009
G-0.009
Token by
Token ID416
Feature activation+1.77e-02

Pos logit contributions
 knocks+0.034
cakes+0.030
cases+0.028
chair+0.028
five+0.027
Neg logit contributions
P-0.039
Bo-0.037
orn-0.036
un-0.033
ble-0.033
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
 knocks+0.025
cakes+0.023
chair+0.021
cases+0.021
 James+0.020
Neg logit contributions
P-0.039
Bo-0.036
orn-0.036
un-0.033
G-0.033
Token fant
Token ID5113
Feature activation+5.98e-02

Pos logit contributions
 knocks+0.030
cakes+0.028
cases+0.026
chair+0.026
five+0.025
Neg logit contributions
P-0.030
Bo-0.028
orn-0.028
un-0.025
ble-0.025
Tokenastical
Token ID32044
Feature activation+1.73e-02

Pos logit contributions
 knocks+0.085
cakes+0.074
 sigh+0.073
five+0.071
 Amy+0.070
Neg logit contributions
Bo-0.116
P-0.115
ble-0.109
ink-0.107
G-0.106
Token things
Token ID1243
Feature activation+8.63e-03

Pos logit contributions
 knocks+0.030
cakes+0.027
cases+0.026
chair+0.025
five+0.025
Neg logit contributions
P-0.031
Bo-0.030
orn-0.029
un-0.027
ble-0.026
Token she
Token ID673
Feature activation-1.68e-03

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.011
chair+0.011
five+0.011
Neg logit contributions
P-0.017
Bo-0.017
orn-0.016
un-0.015
ble-0.015
Token saw
Token ID2497
Feature activation-3.30e-03

Pos logit contributions
P+0.003
orn+0.003
Bo+0.003
un+0.003
ble+0.003
Neg logit contributions
 knocks-0.003
cakes-0.002
chair-0.002
cases-0.002
 James-0.002
Token.
Token ID13
Feature activation+2.97e-03

Pos logit contributions
P+0.007
Bo+0.007
orn+0.007
un+0.007
G+0.006
Neg logit contributions
 knocks-0.004
cakes-0.004
chair-0.004
cases-0.004
 James-0.004
Token scattered
Token ID16830
Feature activation+8.10e-03

Pos logit contributions
 knocks+0.006
cakes+0.005
cases+0.005
chair+0.005
 knocking+0.005
Neg logit contributions
P-0.008
Bo-0.008
orn-0.008
un-0.007
G-0.007
Token and
Token ID290
Feature activation+2.17e-02

Pos logit contributions
 knocks+0.012
cakes+0.011
chair+0.010
cases+0.010
 knocking+0.010
Neg logit contributions
P-0.017
Bo-0.016
orn-0.016
G-0.015
un-0.014
Token broken
Token ID5445
Feature activation+1.50e-02

Pos logit contributions
 knocks+0.034
cakes+0.032
cases+0.030
chair+0.030
five+0.029
Neg logit contributions
P-0.043
Bo-0.040
orn-0.040
un-0.037
ble-0.037
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 knocks+0.024
cakes+0.022
cases+0.021
chair+0.021
five+0.020
Neg logit contributions
P-0.029
Bo-0.028
orn-0.027
un-0.025
ble-0.025
Token The
Token ID383
Feature activation+2.80e-02

Pos logit contributions
cakes+0.021
 knocks+0.021
cases+0.019
lived+0.019
chair+0.019
Neg logit contributions
P-0.021
Bo-0.021
orn-0.019
G-0.019
un-0.019
Token view
Token ID1570
Feature activation+5.96e-02

Pos logit contributions
 knocks+0.046
cakes+0.044
cases+0.043
chair+0.040
five+0.039
Neg logit contributions
P-0.051
orn-0.048
Bo-0.048
ble-0.044
un-0.044
Token is
Token ID318
Feature activation+1.46e-03

Pos logit contributions
 knocks+0.096
ksh+0.096
cases+0.094
cakes+0.094
isions+0.092
Neg logit contributions
P-0.096
orn-0.096
Bo-0.090
ble-0.090
ery-0.081
Token messy
Token ID24097
Feature activation+1.33e-02

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.002
chair+0.002
five+0.002
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Token and
Token ID290
Feature activation+2.69e-02

Pos logit contributions
 knocks+0.026
cakes+0.026
five+0.024
cases+0.024
 divorce+0.024
Neg logit contributions
P-0.018
orn-0.017
ble-0.016
Bo-0.016
un-0.015
Token ugly
Token ID13400
Feature activation+1.84e-02

Pos logit contributions
 knocks+0.045
cakes+0.044
five+0.042
cases+0.041
chair+0.041
Neg logit contributions
P-0.046
Bo-0.045
orn-0.043
ble-0.041
un-0.040
Token.
Token ID13
Feature activation+9.58e-03

Pos logit contributions
 knocks+0.032
cakes+0.032
cases+0.030
five+0.029
lived+0.028
Neg logit contributions
P-0.030
orn-0.029
Bo-0.028
ble-0.027
un-0.026
Token.
Token ID13
Feature activation-2.77e-03

Pos logit contributions
 knocks+0.016
cakes+0.016
ksh+0.016
 divorce+0.016
cases+0.016
Neg logit contributions
ble-0.004
P-0.004
She-0.003
Bo-0.003
un-0.003
Token She
Token ID1375
Feature activation+1.49e-02

Pos logit contributions
P+0.005
Bo+0.004
orn+0.004
un+0.004
ble+0.004
Neg logit contributions
 knocks-0.005
cakes-0.005
cases-0.005
chair-0.005
five-0.004
Token says
Token ID1139
Feature activation+3.56e-03

Pos logit contributions
cakes+0.019
 knocks+0.018
chair+0.017
cases+0.017
ksh+0.017
Neg logit contributions
P-0.032
Bo-0.030
orn-0.029
un-0.028
ble-0.027
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
 knocks+0.007
cakes+0.007
cases+0.007
ksh+0.006
chair+0.006
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.004
ble-0.004
Token "
Token ID366
Feature activation+1.03e-02

Pos logit contributions
 knocks+0.020
cakes+0.018
cases+0.017
chair+0.016
five+0.016
Neg logit contributions
P-0.030
Bo-0.028
orn-0.028
un-0.026
ble-0.026
TokenWho
Token ID8241
Feature activation+5.91e-02

Pos logit contributions
 knocks+0.018
cakes+0.016
cases+0.015
chair+0.015
five+0.014
Neg logit contributions
P-0.020
Bo-0.019
orn-0.018
un-0.017
G-0.017
Token did
Token ID750
Feature activation-1.00e-02

Pos logit contributions
 knocks+0.109
cakes+0.106
ksh+0.105
idents+0.101
chair+0.100
Neg logit contributions
P-0.097
orn-0.084
Bo-0.079
un-0.078
ery-0.072
Token this
Token ID428
Feature activation+1.69e-02

Pos logit contributions
P+0.016
orn+0.012
ble+0.012
Bo+0.012
un+0.011
Neg logit contributions
ksh-0.021
cakes-0.020
 knocks-0.019
cases-0.019
ounded-0.018
Token?
Token ID30
Feature activation+1.35e-02

Pos logit contributions
 knocks+0.034
cases+0.032
chair+0.032
cakes+0.031
five+0.031
Neg logit contributions
P-0.025
orn-0.021
Bo-0.021
ble-0.021
un-0.019
Token Who
Token ID5338
Feature activation+4.09e-02

Pos logit contributions
 knocks+0.015
cakes+0.014
cases+0.012
chair+0.012
five+0.011
Neg logit contributions
P-0.033
Bo-0.032
orn-0.031
un-0.030
ble-0.029
Token drank
Token ID24070
Feature activation-8.92e-03

Pos logit contributions
 knocks+0.071
cakes+0.067
chair+0.063
cases+0.063
five+0.062
Neg logit contributions
P-0.073
Bo-0.067
orn-0.067
un-0.062
ble-0.059
Token sad
Token ID6507
Feature activation+1.22e-02

Pos logit contributions
 knocks+0.051
cakes+0.049
cases+0.047
chair+0.046
five+0.046
Neg logit contributions
P-0.061
Bo-0.057
orn-0.055
un-0.054
ble-0.053
Token.
Token ID13
Feature activation-8.29e-03

Pos logit contributions
 knocks+0.026
cases+0.026
lived+0.025
cakes+0.025
ounded+0.025
Neg logit contributions
P-0.013
Bo-0.013
ble-0.012
orn-0.011
un-0.011
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
P+0.014
Bo+0.013
orn+0.013
un+0.012
ble+0.012
Neg logit contributions
 knocks-0.015
cakes-0.014
chair-0.013
cases-0.013
five-0.013
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
P+0.031
orn+0.031
Bo+0.030
ery+0.028
 feather+0.027
Neg logit contributions
 knocks-0.021
cakes-0.017
 Amy-0.017
 sigh-0.017
chair-0.017
Token"
Token ID1
Feature activation+3.18e-03

Pos logit contributions
orn+0.017
Bo+0.016
P+0.015
ery+0.014
ble+0.014
Neg logit contributions
 knocks-0.022
cakes-0.020
chair-0.019
 Amy-0.019
 knocking-0.019
TokenWho
Token ID8241
Feature activation+5.90e-02

Pos logit contributions
 knocks+0.006
cakes+0.006
cases+0.005
chair+0.005
five+0.005
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.005
ble-0.004
Token did
Token ID750
Feature activation-1.06e-02

Pos logit contributions
 knocks+0.107
cakes+0.107
ksh+0.102
chair+0.099
idents+0.099
Neg logit contributions
P-0.099
orn-0.088
Bo-0.083
un-0.082
ery-0.075
Token this
Token ID428
Feature activation+1.76e-02

Pos logit contributions
P+0.015
Bo+0.012
orn+0.012
ble+0.012
un+0.011
Neg logit contributions
cakes-0.023
ksh-0.023
cases-0.021
cake-0.020
 knocks-0.020
Token?"
Token ID1701
Feature activation-1.41e-02

Pos logit contributions
 knocks+0.037
cases+0.034
cakes+0.034
chair+0.033
five+0.032
Neg logit contributions
P-0.025
orn-0.023
Bo-0.022
ble-0.022
un-0.020
Token she
Token ID673
Feature activation-4.69e-03

Pos logit contributions
P+0.032
orn+0.031
Bo+0.031
un+0.029
ble+0.028
Neg logit contributions
 knocks-0.018
cakes-0.016
chair-0.015
 Amy-0.014
 leaving-0.014
Token asked
Token ID1965
Feature activation-2.57e-03

Pos logit contributions
P+0.009
orn+0.009
Bo+0.009
un+0.008
ble+0.008
Neg logit contributions
 knocks-0.008
cakes-0.007
cases-0.007
 knocking-0.007
chair-0.007
Token,
Token ID11
Feature activation-7.85e-04

Pos logit contributions
P+0.006
Bo+0.005
orn+0.005
un+0.005
ble+0.005
Neg logit contributions
 knocks-0.008
cakes-0.008
ksh-0.007
cases-0.007
chair-0.007
Token they
Token ID484
Feature activation+5.22e-03

Pos logit contributions
P+0.002
Bo+0.002
orn+0.002
ble+0.001
un+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
chair-0.001
cases-0.001
five-0.001
Token got
Token ID1392
Feature activation+6.05e-03

Pos logit contributions
 knocks+0.007
cakes+0.007
cases+0.006
 sigh+0.006
 James+0.006
Neg logit contributions
P-0.011
orn-0.011
Bo-0.011
ble-0.010
agons-0.009
Token off
Token ID572
Feature activation+9.62e-03

Pos logit contributions
 knocks+0.011
cakes+0.011
chair+0.010
five+0.010
cases+0.010
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
un-0.009
ble-0.008
Token the
Token ID262
Feature activation+1.86e-02

Pos logit contributions
 knocks+0.017
cakes+0.016
cases+0.015
chair+0.015
five+0.014
Neg logit contributions
P-0.017
Bo-0.016
orn-0.016
un-0.015
ble-0.014
Token subway
Token ID19612
Feature activation+5.89e-02

Pos logit contributions
 knocks+0.030
cakes+0.028
cases+0.026
chair+0.025
five+0.025
Neg logit contributions
P-0.036
Bo-0.034
orn-0.034
un-0.031
ble-0.031
Token and
Token ID290
Feature activation+8.90e-03

Pos logit contributions
 knocks+0.112
cakes+0.105
ksh+0.103
vised+0.099
 begs+0.098
Neg logit contributions
un-0.073
P-0.064
orn-0.063
Bo-0.061
boat-0.060
Token completed
Token ID5668
Feature activation+5.04e-03

Pos logit contributions
 knocks+0.015
cakes+0.014
five+0.013
ksh+0.013
chair+0.013
Neg logit contributions
P-0.015
Bo-0.015
orn-0.014
un-0.014
ble-0.013
Token their
Token ID511
Feature activation+2.52e-02

Pos logit contributions
 knocks+0.009
cakes+0.009
chair+0.008
cases+0.008
five+0.008
Neg logit contributions
P-0.008
Bo-0.008
orn-0.008
ble-0.007
un-0.007
Token shopping
Token ID9735
Feature activation+2.66e-02

Pos logit contributions
cakes+0.047
 knocks+0.044
cake+0.043
chair+0.042
five+0.041
Neg logit contributions
P-0.040
Bo-0.036
orn-0.033
 feather-0.032
ble-0.031
Token.
Token ID13
Feature activation+7.80e-03

Pos logit contributions
 knocks+0.046
cakes+0.041
chair+0.039
ksh+0.038
five+0.038
Neg logit contributions
P-0.047
orn-0.043
Bo-0.043
ble-0.041
un-0.040
Token They
Token ID1119
Feature activation-4.78e-03

Pos logit contributions
P+0.005
orn+0.004
Bo+0.004
ble+0.004
un+0.004
Neg logit contributions
 knocks-0.003
cakes-0.003
chair-0.003
cases-0.003
 James-0.003
Token hear
Token ID3285
Feature activation+1.94e-02

Pos logit contributions
P+0.011
orn+0.011
Bo+0.010
ble+0.010
un+0.009
Neg logit contributions
 knocks-0.007
 sigh-0.006
 begs-0.005
five-0.005
cases-0.005
Token Spot
Token ID15899
Feature activation+8.02e-03

Pos logit contributions
 knocks+0.032
cakes+0.030
cases+0.028
chair+0.028
five+0.027
Neg logit contributions
P-0.036
Bo-0.035
orn-0.034
un-0.032
ble-0.031
Token's
Token ID338
Feature activation+5.68e-03

Pos logit contributions
 knocks+0.013
cakes+0.013
chair+0.012
cases+0.011
five+0.011
Neg logit contributions
P-0.015
Bo-0.015
orn-0.014
un-0.014
ble-0.013
Token sn
Token ID3013
Feature activation-1.09e-03

Pos logit contributions
cakes+0.011
 knocks+0.011
cases+0.010
chair+0.009
five+0.009
Neg logit contributions
Bo-0.008
P-0.008
ble-0.008
orn-0.008
un-0.008
Tokenores
Token ID2850
Feature activation+5.87e-02

Pos logit contributions
P+0.002
Bo+0.002
orn+0.002
un+0.002
ble+0.002
Neg logit contributions
 knocks-0.002
cakes-0.002
chair-0.001
cases-0.001
five-0.001
Token from
Token ID422
Feature activation+4.36e-04

Pos logit contributions
 knocks+0.096
cakes+0.088
cases+0.082
chair+0.082
five+0.079
Neg logit contributions
P-0.112
Bo-0.107
orn-0.105
un-0.097
ble-0.095
Token the
Token ID262
Feature activation+6.18e-03

Pos logit contributions
 knocks+0.001
 knocking+0.001
 leaving+0.001
 Amy+0.001
chair+0.001
Neg logit contributions
P-0.001
orn-0.001
Bo-0.001
un-0.001
G-0.001
Token living
Token ID2877
Feature activation+1.74e-02

Pos logit contributions
 knocks+0.009
cakes+0.008
chair+0.008
cases+0.008
five+0.007
Neg logit contributions
P-0.013
Bo-0.012
orn-0.012
un-0.011
ble-0.011
Token room
Token ID2119
Feature activation+4.31e-03

Pos logit contributions
 knocks+0.021
cakes+0.019
cases+0.018
chair+0.017
five+0.016
Neg logit contributions
P-0.041
Bo-0.039
orn-0.039
un-0.036
ble-0.036
Token.
Token ID13
Feature activation-5.59e-03

Pos logit contributions
 knocks+0.007
cakes+0.007
chair+0.006
cases+0.006
five+0.006
Neg logit contributions
P-0.008
Bo-0.007
orn-0.007
un-0.007
ble-0.006
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
 knocks+0.023
cakes+0.021
chair+0.019
cases+0.019
 knocking+0.018
Neg logit contributions
P-0.025
Bo-0.023
orn-0.023
un-0.021
G-0.020
Token fire
Token ID2046
Feature activation+2.51e-02

Pos logit contributions
 knocks+0.019
cakes+0.017
cases+0.016
chair+0.015
five+0.015
Neg logit contributions
P-0.024
Bo-0.023
orn-0.023
un-0.021
ble-0.021
Token.
Token ID13
Feature activation+2.66e-04

Pos logit contributions
 knocks+0.039
cakes+0.038
cases+0.037
chair+0.036
five+0.036
Neg logit contributions
Bo-0.043
P-0.042
ble-0.042
orn-0.041
un-0.041
Token The
Token ID383
Feature activation+1.48e-02

Pos logit contributions
 knocks+0.000
cakes+0.000
chair+0.000
five+0.000
cases+0.000
Neg logit contributions
P-0.001
orn-0.001
Bo-0.001
un-0.001
ble-0.001
Token fire
Token ID2046
Feature activation+3.70e-02

Pos logit contributions
 knocks+0.024
cakes+0.023
cases+0.022
chair+0.022
five+0.021
Neg logit contributions
P-0.028
Bo-0.026
orn-0.026
ble-0.024
un-0.024
Token alarm
Token ID10436
Feature activation+5.85e-02

Pos logit contributions
 knocks+0.050
cakes+0.049
chair+0.047
cases+0.047
five+0.046
Neg logit contributions
P-0.071
Bo-0.071
orn-0.069
ble-0.068
un-0.065
Token rings
Token ID13917
Feature activation-6.04e-03

Pos logit contributions
 knocks+0.076
vised+0.074
cakes+0.074
lived+0.074
ounded+0.073
Neg logit contributions
P-0.107
Bo-0.104
orn-0.103
ble-0.100
un-0.097
Token.
Token ID13
Feature activation+6.40e-04

Pos logit contributions
P+0.015
Bo+0.014
orn+0.013
G+0.012
un+0.012
Neg logit contributions
 knocks-0.008
cakes-0.007
cases-0.007
 sigh-0.006
chair-0.006
Token Sara
Token ID24799
Feature activation+1.37e-02

Pos logit contributions
 knocks+0.001
cakes+0.001
chair+0.001
five+0.001
cases+0.001
Neg logit contributions
P-0.002
orn-0.001
Bo-0.001
un-0.001
ble-0.001
Token and
Token ID290
Feature activation+3.66e-03

Pos logit contributions
 knocks+0.021
cakes+0.021
cases+0.019
chair+0.019
five+0.018
Neg logit contributions
P-0.026
Bo-0.025
orn-0.024
un-0.023
ble-0.023
Token her
Token ID607
Feature activation+4.42e-03

Pos logit contributions
 knocks+0.006
cakes+0.005
chair+0.005
cases+0.005
five+0.005
Neg logit contributions
P-0.008
Bo-0.007
orn-0.007
un-0.007
ble-0.007
Token thinks
Token ID6834
Feature activation+2.71e-02

Pos logit contributions
 knocks+0.002
cakes+0.002
chair+0.002
cases+0.002
five+0.002
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Token they
Token ID484
Feature activation-8.80e-03

Pos logit contributions
 knocks+0.038
cakes+0.034
ksh+0.034
idents+0.033
five+0.032
Neg logit contributions
P-0.052
Bo-0.051
orn-0.050
ble-0.048
un-0.048
Token made
Token ID925
Feature activation+7.29e-03

Pos logit contributions
P+0.015
Bo+0.014
orn+0.014
ble+0.012
un+0.012
Neg logit contributions
 knocks-0.017
cakes-0.015
cases-0.015
chair-0.014
five-0.014
Token up
Token ID510
Feature activation+1.03e-02

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.012
chair+0.012
five+0.011
Neg logit contributions
P-0.012
Bo-0.012
orn-0.011
un-0.011
ble-0.010
Token the
Token ID262
Feature activation+2.32e-02

Pos logit contributions
 knocks+0.015
cakes+0.014
cases+0.013
chair+0.013
 sigh+0.012
Neg logit contributions
P-0.022
orn-0.021
Bo-0.021
un-0.019
ble-0.019
Token story
Token ID1621
Feature activation+5.83e-02

Pos logit contributions
 knocks+0.039
cakes+0.037
cases+0.034
chair+0.034
five+0.033
Neg logit contributions
P-0.041
orn-0.039
Bo-0.039
un-0.035
 feather-0.035
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
 knocks+0.105
cakes+0.092
vised+0.091
ksh+0.090
idents+0.090
Neg logit contributions
P-0.085
Bo-0.081
un-0.080
orn-0.080
ble-0.078
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
P+0.010
orn+0.009
Bo+0.009
ble+0.008
un+0.008
Neg logit contributions
 knocks-0.009
cakes-0.008
chair-0.008
cases-0.008
five-0.008
Token
Token ID198
Feature activation-2.72e-03

Pos logit contributions
P+0.022
orn+0.021
Bo+0.021
ery+0.019
 feather+0.019
Neg logit contributions
 knocks-0.014
chair-0.012
cakes-0.012
 sigh-0.012
 Amy-0.012
Token"
Token ID1
Feature activation+1.72e-03

Pos logit contributions
orn+0.004
Bo+0.004
P+0.003
ble+0.003
un+0.003
Neg logit contributions
 knocks-0.006
cakes-0.005
chair-0.005
five-0.005
cases-0.005
TokenTom
Token ID13787
Feature activation-6.57e-03

Pos logit contributions
 knocks+0.003
cakes+0.003
cases+0.003
chair+0.003
 knocking+0.003
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
un-0.003
G-0.003
Token suddenly
Token ID6451
Feature activation-4.74e-03

Pos logit contributions
P+0.045
orn+0.045
Bo+0.042
un+0.042
ery+0.040
Neg logit contributions
 knocks-0.023
cakes-0.020
 leaving-0.018
 James-0.018
 Amy-0.018
Token,
Token ID11
Feature activation-9.35e-03

Pos logit contributions
P+0.010
orn+0.009
Bo+0.009
un+0.009
ble+0.008
Neg logit contributions
 knocks-0.007
cakes-0.006
cases-0.006
chair-0.006
 sigh-0.006
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
P+0.020
orn+0.020
Bo+0.019
un+0.019
ble+0.017
Neg logit contributions
 knocks-0.013
cakes-0.011
cases-0.011
chair-0.010
 sigh-0.010
Token brightly
Token ID35254
Feature activation+5.12e-03

Pos logit contributions
P+0.023
Bo+0.022
orn+0.021
ble+0.019
un+0.019
Neg logit contributions
 knocks-0.023
cakes-0.021
chair-0.020
 sigh-0.020
 manager-0.020
Token-
Token ID12
Feature activation+2.25e-02

Pos logit contributions
 knocks+0.010
cakes+0.009
cases+0.009
five+0.008
chair+0.008
Neg logit contributions
P-0.009
orn-0.008
Bo-0.008
ble-0.007
un-0.007
Tokencol
Token ID4033
Feature activation+5.77e-02

Pos logit contributions
 knocks+0.029
cakes+0.026
cases+0.024
chair+0.023
five+0.022
Neg logit contributions
P-0.052
Bo-0.049
orn-0.049
un-0.046
ble-0.045
Tokenoured
Token ID8167
Feature activation-2.12e-02

Pos logit contributions
 knocks+0.067
 Amy+0.058
cakes+0.056
 manager+0.052
 James+0.052
Neg logit contributions
P-0.132
orn-0.130
un-0.126
Bo-0.123
ir-0.117
Token belt
Token ID10999
Feature activation-7.00e-03

Pos logit contributions
P+0.040
Bo+0.039
orn+0.036
G+0.035
un+0.034
Neg logit contributions
 knocks-0.033
cakes-0.032
chair-0.031
 manager-0.029
 sigh-0.028
Token caught
Token ID4978
Feature activation-1.13e-02

Pos logit contributions
P+0.012
orn+0.012
Bo+0.011
un+0.011
ble+0.010
Neg logit contributions
 knocks-0.013
cakes-0.012
five-0.011
ksh-0.011
idents-0.011
Token her
Token ID607
Feature activation+2.98e-03

Pos logit contributions
Bo+0.023
P+0.022
orn+0.021
un+0.021
G+0.020
Neg logit contributions
 knocks-0.015
cakes-0.014
 Amy-0.013
 James-0.013
 knocking-0.013
Token eye
Token ID4151
Feature activation-1.96e-02

Pos logit contributions
 knocks+0.005
chair+0.004
cakes+0.004
cases+0.004
five+0.004
Neg logit contributions
P-0.006
Bo-0.006
orn-0.006
G-0.005
un-0.005
Token night
Token ID1755
Feature activation+3.15e-02

Pos logit contributions
 knocks+0.011
cakes+0.010
chair+0.010
cases+0.010
five+0.009
Neg logit contributions
P-0.014
Bo-0.013
orn-0.013
un-0.012
ble-0.012
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 knocks+0.054
cakes+0.048
cases+0.046
five+0.046
chair+0.045
Neg logit contributions
P-0.057
Bo-0.053
orn-0.051
un-0.048
ble-0.048
Token They
Token ID1119
Feature activation-1.54e-03

Pos logit contributions
 knocks+0.016
cakes+0.014
cases+0.013
chair+0.013
five+0.012
Neg logit contributions
P-0.022
Bo-0.021
orn-0.021
un-0.019
G-0.019
Token were
Token ID547
Feature activation+6.29e-03

Pos logit contributions
P+0.003
orn+0.003
Bo+0.003
ble+0.003
G+0.003
Neg logit contributions
 knocks-0.002
 sigh-0.002
 James-0.002
chair-0.002
cases-0.002
Token so
Token ID523
Feature activation+1.22e-02

Pos logit contributions
 knocks+0.011
cakes+0.010
cases+0.009
chair+0.009
five+0.009
Neg logit contributions
P-0.012
Bo-0.011
orn-0.011
un-0.010
ble-0.010
Token capt
Token ID3144
Feature activation+5.73e-02

Pos logit contributions
 knocks+0.018
cakes+0.017
cases+0.015
 sigh+0.015
chair+0.015
Neg logit contributions
P-0.025
orn-0.024
Bo-0.024
un-0.022
G-0.021
Tokenivated
Token ID30829
Feature activation+2.51e-02

Pos logit contributions
 knocks+0.090
cakes+0.080
cases+0.075
chair+0.072
five+0.071
Neg logit contributions
P-0.122
Bo-0.114
ble-0.109
orn-0.107
un-0.106
Token by
Token ID416
Feature activation+2.00e-02

Pos logit contributions
 knocks+0.046
cakes+0.041
cases+0.039
chair+0.038
five+0.038
Neg logit contributions
P-0.046
Bo-0.042
orn-0.041
G-0.038
un-0.037
Token the
Token ID262
Feature activation+2.59e-03

Pos logit contributions
 knocks+0.032
cakes+0.029
cases+0.027
chair+0.027
five+0.026
Neg logit contributions
P-0.039
Bo-0.037
orn-0.037
un-0.034
ble-0.033
Token fireplace
Token ID44250
Feature activation+1.61e-02

Pos logit contributions
 knocks+0.005
cakes+0.004
cases+0.004
chair+0.004
five+0.004
Neg logit contributions
P-0.005
Bo-0.004
orn-0.004
un-0.004
ble-0.004
Tokenรข
Token ID22940
Feature activation+2.48e-02

Pos logit contributions
 knocks+0.027
cakes+0.024
chair+0.023
five+0.023
cases+0.023
Neg logit contributions
P-0.029
Bo-0.028
orn-0.028
ble-0.026
un-0.026
Token go
Token ID467
Feature activation+1.49e-03

Pos logit contributions
 knocks+0.009
cakes+0.007
chair+0.007
 James+0.007
 Amy+0.007
Neg logit contributions
P-0.013
Bo-0.012
orn-0.012
un-0.011
G-0.011
Token!
Token ID0
Feature activation+1.43e-02

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.002
chair+0.002
five+0.002
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Token Everyone
Token ID11075
Feature activation+3.08e-02

Pos logit contributions
 knocks+0.020
cakes+0.018
chair+0.018
 James+0.017
cases+0.017
Neg logit contributions
P-0.031
orn-0.030
Bo-0.029
ble-0.028
un-0.026
Token watched
Token ID7342
Feature activation+1.88e-02

Pos logit contributions
 knocks+0.052
cakes+0.048
cases+0.045
chair+0.045
five+0.043
Neg logit contributions
P-0.058
Bo-0.055
orn-0.054
un-0.050
ble-0.049
Token in
Token ID287
Feature activation+4.54e-04

Pos logit contributions
 knocks+0.030
cakes+0.027
cases+0.025
chair+0.025
five+0.024
Neg logit contributions
P-0.037
Bo-0.036
orn-0.035
un-0.032
ble-0.032
Token amaz
Token ID40642
Feature activation+5.66e-02

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.001
Bo-0.001
orn-0.001
un-0.001
ble-0.001
Tokenement
Token ID972
Feature activation+3.19e-02

Pos logit contributions
 knocks+0.083
cakes+0.067
 knocking+0.066
chair+0.063
five+0.063
Neg logit contributions
P-0.121
orn-0.114
Bo-0.110
ble-0.107
un-0.102
Token as
Token ID355
Feature activation-6.61e-03

Pos logit contributions
 knocks+0.054
cakes+0.049
chair+0.045
 divorce+0.044
five+0.044
Neg logit contributions
P-0.055
Bo-0.055
ble-0.052
orn-0.052
un-0.051
Token it
Token ID340
Feature activation-3.07e-04

Pos logit contributions
P+0.015
orn+0.014
Bo+0.014
un+0.013
G+0.013
Neg logit contributions
 knocks-0.009
cakes-0.008
chair-0.008
cases-0.008
 James-0.007
Token took
Token ID1718
Feature activation-7.69e-03

Pos logit contributions
P+0.001
orn+0.001
Bo+0.001
un+0.001
G+0.001
Neg logit contributions
 knocks-0.001
cakes-0.000
cases-0.000
chair-0.000
 sigh-0.000
Token off
Token ID572
Feature activation-4.25e-03

Pos logit contributions
P+0.016
orn+0.015
Bo+0.015
ble+0.014
un+0.014
Neg logit contributions
 knocks-0.012
cakes-0.010
five-0.010
 James-0.010
cases-0.010
Token happy
Token ID3772
Feature activation+3.04e-02

Pos logit contributions
 knocks+0.022
cakes+0.020
chair+0.019
cases+0.019
five+0.018
Neg logit contributions
P-0.022
Bo-0.021
orn-0.020
un-0.019
ble-0.019
Token.
Token ID13
Feature activation-7.54e-03

Pos logit contributions
cases+0.067
ounded+0.066
cakes+0.065
ksh+0.065
 knocks+0.064
Neg logit contributions
ble-0.031
P-0.030
Bo-0.028
orn-0.028
un-0.025
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
P+0.013
Bo+0.012
orn+0.012
un+0.011
ble+0.011
Neg logit contributions
 knocks-0.014
cakes-0.013
cases-0.012
chair-0.012
five-0.012
Token
Token ID198
Feature activation-9.18e-03

Pos logit contributions
P+0.026
orn+0.024
Bo+0.024
un+0.022
 feather+0.021
Neg logit contributions
 knocks-0.019
cakes-0.017
chair-0.016
 Amy-0.016
 sigh-0.016
Token"
Token ID1
Feature activation+2.05e-04

Pos logit contributions
orn+0.012
P+0.012
Bo+0.012
un+0.011
ble+0.010
Neg logit contributions
 knocks-0.019
cakes-0.018
chair-0.017
 Amy-0.017
five-0.016
TokenWho
Token ID8241
Feature activation+5.62e-02

Pos logit contributions
 knocks+0.000
cakes+0.000
cases+0.000
chair+0.000
five+0.000
Neg logit contributions
P-0.000
Bo-0.000
orn-0.000
G-0.000
un-0.000
Token did
Token ID750
Feature activation-1.27e-02

Pos logit contributions
 knocks+0.099
cakes+0.093
cases+0.086
chair+0.086
 knocking+0.083
Neg logit contributions
P-0.101
orn-0.094
Bo-0.093
un-0.086
ble-0.082
Token this
Token ID428
Feature activation+1.37e-02

Pos logit contributions
P+0.021
orn+0.018
Bo+0.018
ble+0.017
un+0.016
Neg logit contributions
ksh-0.025
cakes-0.024
cases-0.023
 knocks-0.022
cake-0.021
Token?"
Token ID1701
Feature activation-1.54e-02

Pos logit contributions
 knocks+0.029
cakes+0.027
cases+0.026
chair+0.026
five+0.026
Neg logit contributions
P-0.019
orn-0.018
Bo-0.017
ble-0.016
un-0.016
Token Mom
Token ID11254
Feature activation+1.87e-02

Pos logit contributions
P+0.036
Bo+0.035
orn+0.034
un+0.032
ble+0.032
Neg logit contributions
 knocks-0.019
cakes-0.017
cases-0.015
chair-0.015
five-0.014
Token asks
Token ID7893
Feature activation+5.70e-03

Pos logit contributions
cakes+0.036
 divorce+0.034
idents+0.033
five+0.033
 inconvenience+0.033
Neg logit contributions
P-0.026
Bo-0.024
orn-0.023
ble-0.022
un-0.022
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
 knocks+0.010
cakes+0.010
 divorce+0.009
cases+0.009
ksh+0.009
Neg logit contributions
P-0.004
ble-0.004
Bo-0.004
orn-0.004
un-0.004
Token She
Token ID1375
Feature activation+8.81e-03

Pos logit contributions
P+0.004
orn+0.004
Bo+0.004
ble+0.004
un+0.004
Neg logit contributions
 knocks-0.004
cakes-0.003
chair-0.003
 Amy-0.003
cases-0.003
Token said
Token ID531
Feature activation-6.03e-03

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.011
 knocking+0.011
chair+0.011
Neg logit contributions
P-0.018
orn-0.018
Bo-0.018
ble-0.016
un-0.016
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
P+0.008
Bo+0.007
orn+0.007
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.013
cakes-0.012
ksh-0.012
cases-0.012
chair-0.012
Token "
Token ID366
Feature activation+9.59e-03

Pos logit contributions
 knocks+0.023
cakes+0.020
cases+0.018
chair+0.018
five+0.017
Neg logit contributions
P-0.036
Bo-0.035
orn-0.034
un-0.032
ble-0.031
TokenWho
Token ID8241
Feature activation+5.61e-02

Pos logit contributions
 knocks+0.016
cakes+0.015
chair+0.014
cases+0.014
five+0.013
Neg logit contributions
P-0.017
orn-0.017
Bo-0.017
un-0.015
ble-0.015
Token did
Token ID750
Feature activation-1.04e-02

Pos logit contributions
 knocks+0.097
cakes+0.091
chair+0.085
cases+0.085
ksh+0.082
Neg logit contributions
P-0.101
orn-0.093
Bo-0.092
un-0.086
ble-0.082
Token this
Token ID428
Feature activation+1.80e-02

Pos logit contributions
P+0.017
Bo+0.014
orn+0.014
ble+0.013
un+0.013
Neg logit contributions
ksh-0.021
cakes-0.020
 knocks-0.019
cases-0.019
five-0.018
Token?"
Token ID1701
Feature activation-4.51e-03

Pos logit contributions
 knocks+0.037
cakes+0.034
cases+0.033
chair+0.032
five+0.032
Neg logit contributions
P-0.027
Bo-0.025
orn-0.025
un-0.023
ble-0.022
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
P+0.008
orn+0.008
Bo+0.008
un+0.007
ble+0.007
Neg logit contributions
 knocks-0.008
cakes-0.007
chair-0.007
cases-0.007
 James-0.007
Token
Token ID198
Feature activation-7.27e-03

Pos logit contributions
orn+0.028
P+0.027
Bo+0.026
ery+0.025
 feather+0.024
Neg logit contributions
 knocks-0.016
 Amy-0.014
 sigh-0.013
cakes-0.013
chair-0.013
Token't
Token ID470
Feature activation+5.05e-03

Pos logit contributions
 knocks+0.031
cakes+0.028
cases+0.027
five+0.026
chair+0.026
Neg logit contributions
P-0.032
Bo-0.030
orn-0.030
un-0.028
ble-0.027
Token go
Token ID467
Feature activation+2.85e-02

Pos logit contributions
 knocks+0.007
cakes+0.006
chair+0.005
 sigh+0.005
 knocking+0.005
Neg logit contributions
P-0.012
orn-0.011
Bo-0.011
un-0.010
ble-0.010
Token on
Token ID319
Feature activation+3.90e-03

Pos logit contributions
 knocks+0.052
cakes+0.051
ksh+0.049
 James+0.049
five+0.048
Neg logit contributions
P-0.043
orn-0.042
un-0.041
Bo-0.040
ble-0.037
Token any
Token ID597
Feature activation+7.34e-03

Pos logit contributions
 knocks+0.007
cakes+0.007
cases+0.006
five+0.006
chair+0.006
Neg logit contributions
P-0.007
Bo-0.006
orn-0.006
un-0.006
ble-0.006
Token more
Token ID517
Feature activation+5.55e-03

Pos logit contributions
cases+0.015
cakes+0.014
ksh+0.014
 James+0.014
idents+0.013
Neg logit contributions
P-0.012
orn-0.010
Bo-0.009
 feather-0.009
ble-0.008
Token tours
Token ID21284
Feature activation+5.60e-02

Pos logit contributions
cakes+0.011
ksh+0.010
five+0.010
cake+0.010
faced+0.010
Neg logit contributions
P-0.008
Bo-0.007
 feather-0.007
orn-0.007
ble-0.006
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
 knocks+0.098
ksh+0.090
cakes+0.090
 begs+0.086
idents+0.084
Neg logit contributions
P-0.083
un-0.081
orn-0.077
Bo-0.072
G-0.071
Token had
Token ID550
Feature activation+1.18e-02

Pos logit contributions
 knocks+0.023
cakes+0.022
ksh+0.022
cases+0.022
five+0.021
Neg logit contributions
P-0.021
Bo-0.020
orn-0.020
ble-0.018
un-0.018
Token to
Token ID284
Feature activation+1.80e-02

Pos logit contributions
 knocks+0.016
cakes+0.014
cases+0.014
five+0.013
chair+0.013
Neg logit contributions
P-0.026
Bo-0.024
orn-0.024
un-0.023
G-0.022
Token stay
Token ID2652
Feature activation+5.71e-03

Pos logit contributions
 knocks+0.026
cakes+0.021
 sigh+0.020
 knocking+0.020
 leaving+0.020
Neg logit contributions
P-0.040
orn-0.038
Bo-0.038
un-0.035
ble-0.034
Token in
Token ID287
Feature activation+3.78e-03

Pos logit contributions
 knocks+0.010
cakes+0.009
cases+0.008
chair+0.008
five+0.008
Neg logit contributions
P-0.011
Bo-0.010
orn-0.010
un-0.009
ble-0.009
Tokenittens
Token ID34978
Feature activation-7.55e-03

Pos logit contributions
P+0.010
Bo+0.010
orn+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.013
cakes-0.012
 James-0.011
 knocking-0.011
chair-0.011
Token.
Token ID13
Feature activation+5.94e-04

Pos logit contributions
P+0.016
Bo+0.015
orn+0.015
un+0.014
G+0.013
Neg logit contributions
 knocks-0.011
cakes-0.010
cases-0.010
chair-0.010
five-0.009
Token She
Token ID1375
Feature activation-5.66e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.001
Bo-0.001
orn-0.001
un-0.001
ble-0.001
Token was
Token ID373
Feature activation+5.90e-03

Pos logit contributions
P+0.012
orn+0.012
Bo+0.011
ble+0.011
un+0.011
Neg logit contributions
 knocks-0.009
cakes-0.007
chair-0.007
cases-0.007
 sigh-0.007
Token so
Token ID523
Feature activation+2.85e-03

Pos logit contributions
 knocks+0.010
cakes+0.009
cases+0.009
chair+0.009
five+0.008
Neg logit contributions
P-0.011
Bo-0.010
orn-0.010
un-0.010
ble-0.009
Token capt
Token ID3144
Feature activation+5.59e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
chair+0.004
five+0.003
Neg logit contributions
P-0.006
orn-0.006
Bo-0.006
un-0.005
ble-0.005
Tokenivated
Token ID30829
Feature activation+2.10e-02

Pos logit contributions
 knocks+0.095
cakes+0.081
cases+0.076
chair+0.075
idents+0.074
Neg logit contributions
P-0.113
Bo-0.105
ble-0.102
un-0.098
orn-0.097
Token by
Token ID416
Feature activation+1.79e-02

Pos logit contributions
 knocks+0.039
cakes+0.034
cases+0.033
five+0.032
 knocking+0.031
Neg logit contributions
P-0.038
Bo-0.035
orn-0.035
un-0.032
G-0.032
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
 knocks+0.028
cakes+0.026
 knocking+0.024
 James+0.024
cases+0.024
Neg logit contributions
P-0.037
orn-0.034
Bo-0.033
un-0.030
G-0.030
Token parade
Token ID16134
Feature activation+3.01e-02

Pos logit contributions
 knocks+0.019
cakes+0.016
 begs+0.016
cases+0.016
chair+0.016
Neg logit contributions
P-0.018
Bo-0.018
orn-0.016
un-0.016
 feather-0.015
Token that
Token ID326
Feature activation+3.63e-03

Pos logit contributions
 knocks+0.052
cakes+0.046
five+0.042
cases+0.042
chair+0.042
Neg logit contributions
P-0.052
Bo-0.051
orn-0.050
un-0.047
ble-0.046
Token friends
Token ID2460
Feature activation-2.15e-03

Pos logit contributions
 knocks+0.017
cakes+0.015
chair+0.014
cases+0.014
 sigh+0.013
Neg logit contributions
P-0.019
Bo-0.018
orn-0.018
un-0.016
G-0.016
Token stopped
Token ID5025
Feature activation+9.68e-03

Pos logit contributions
P+0.004
orn+0.004
Bo+0.004
un+0.004
ble+0.004
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.003
 sigh-0.003
 James-0.003
Token and
Token ID290
Feature activation+4.89e-03

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.011
chair+0.011
five+0.010
Neg logit contributions
P-0.021
Bo-0.020
orn-0.020
un-0.019
ble-0.019
Token watched
Token ID7342
Feature activation+1.52e-02

Pos logit contributions
 knocks+0.008
cakes+0.008
chair+0.007
cases+0.007
five+0.007
Neg logit contributions
P-0.009
Bo-0.009
orn-0.008
un-0.008
G-0.008
Token in
Token ID287
Feature activation+1.11e-03

Pos logit contributions
 knocks+0.025
cakes+0.023
chair+0.022
cases+0.022
five+0.020
Neg logit contributions
P-0.027
Bo-0.027
ble-0.025
orn-0.025
un-0.025
Token amaz
Token ID40642
Feature activation+5.59e-02

Pos logit contributions
cakes+0.002
 knocks+0.002
cases+0.002
chair+0.002
lived+0.002
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
G-0.002
Tokenement
Token ID972
Feature activation+2.97e-02

Pos logit contributions
 knocks+0.085
cakes+0.069
 knocking+0.068
chair+0.065
cases+0.064
Neg logit contributions
P-0.118
orn-0.110
Bo-0.105
ble-0.103
un-0.097
Token.
Token ID13
Feature activation-2.05e-04

Pos logit contributions
 knocks+0.056
cakes+0.050
 divorce+0.047
chair+0.046
five+0.046
Neg logit contributions
Bo-0.045
ble-0.045
P-0.045
un-0.042
orn-0.041
Token The
Token ID383
Feature activation+6.72e-03

Pos logit contributions
P+0.000
Bo+0.000
orn+0.000
un+0.000
ble+0.000
Neg logit contributions
 knocks-0.000
cakes-0.000
cases-0.000
chair-0.000
five-0.000
Token dog
Token ID3290
Feature activation+5.46e-03

Pos logit contributions
 knocks+0.012
cakes+0.011
cases+0.010
chair+0.010
five+0.010
Neg logit contributions
P-0.012
Bo-0.011
orn-0.011
un-0.010
ble-0.010
Token too
Token ID1165
Feature activation-4.28e-03

Pos logit contributions
 knocks+0.009
cakes+0.008
cases+0.008
chair+0.007
five+0.007
Neg logit contributions
P-0.011
orn-0.010
Bo-0.010
un-0.009
ble-0.009
Token dolls
Token ID36062
Feature activation-1.40e-02

Pos logit contributions
 knocks+0.014
cakes+0.013
cases+0.012
chair+0.012
five+0.011
Neg logit contributions
P-0.017
Bo-0.016
orn-0.016
un-0.015
ble-0.015
Token around
Token ID1088
Feature activation+8.83e-03

Pos logit contributions
P+0.030
Bo+0.029
orn+0.028
G+0.026
un+0.025
Neg logit contributions
 knocks-0.020
cakes-0.018
chair-0.017
cases-0.017
 knocking-0.017
Token in
Token ID287
Feature activation-9.69e-03

Pos logit contributions
 knocks+0.013
 knocking+0.011
 James+0.011
 Amy+0.011
cakes+0.010
Neg logit contributions
P-0.019
orn-0.018
Bo-0.018
un-0.017
G-0.017
Token a
Token ID257
Feature activation-7.84e-03

Pos logit contributions
P+0.021
orn+0.020
Bo+0.020
ble+0.018
un+0.018
Neg logit contributions
 knocks-0.014
 James-0.012
 knocking-0.012
 Amy-0.012
 sigh-0.012
Token wheel
Token ID7825
Feature activation-7.44e-03

Pos logit contributions
P+0.013
Bo+0.012
orn+0.012
un+0.011
ble+0.010
Neg logit contributions
 knocks-0.015
cakes-0.014
cases-0.014
chair-0.014
five-0.013
Tokenbar
Token ID5657
Feature activation-6.59e-02

Pos logit contributions
P+0.016
Bo+0.015
orn+0.015
G+0.013
ble+0.013
Neg logit contributions
 knocks-0.011
cakes-0.010
chair-0.010
cases-0.009
five-0.009
Tokenrow
Token ID808
Feature activation-2.27e-02

Pos logit contributions
Bo+0.096
Mart+0.081
G+0.081
P+0.080
Wal+0.076
Neg logit contributions
 knocks-0.126
chair-0.123
cases-0.121
cakes-0.120
five-0.114
Token she
Token ID673
Feature activation+4.17e-03

Pos logit contributions
P+0.052
Bo+0.050
orn+0.048
Ant+0.044
G+0.044
Neg logit contributions
 knocks-0.029
cakes-0.026
chair-0.026
cases-0.025
 leaving-0.024
Token had
Token ID550
Feature activation-2.13e-02

Pos logit contributions
 knocks+0.007
chair+0.006
cakes+0.006
 knocking+0.006
cases+0.006
Neg logit contributions
P-0.008
Bo-0.008
orn-0.008
ble-0.007
G-0.007
Token found
Token ID1043
Feature activation-8.21e-03

Pos logit contributions
P+0.046
orn+0.045
Bo+0.045
ble+0.040
un+0.040
Neg logit contributions
 knocks-0.031
cakes-0.027
cases-0.026
 knocking-0.025
chair-0.025
Token in
Token ID287
Feature activation-8.69e-03

Pos logit contributions
Bo+0.018
P+0.017
orn+0.016
G+0.015
un+0.015
Neg logit contributions
 knocks-0.011
cakes-0.011
chair-0.010
cases-0.009
 leaving-0.009
Token it
Token ID340
Feature activation+3.07e-03

Pos logit contributions
P+0.016
Bo+0.015
orn+0.015
un+0.013
G+0.013
Neg logit contributions
 knocks-0.011
cakes-0.010
cases-0.009
five-0.009
chair-0.009
Token away
Token ID1497
Feature activation-1.52e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.007
Bo-0.007
orn-0.006
un-0.006
ble-0.006
Token in
Token ID287
Feature activation-1.19e-02

Pos logit contributions
P+0.033
Bo+0.031
orn+0.031
un+0.029
G+0.028
Neg logit contributions
 knocks-0.021
cakes-0.019
cases-0.018
chair-0.018
five-0.017
Token a
Token ID257
Feature activation-1.14e-02

Pos logit contributions
P+0.029
Bo+0.027
orn+0.027
ute+0.025
agons+0.025
Neg logit contributions
 knocks-0.015
 James-0.012
 knocking-0.012
 Amy-0.012
 leaving-0.012
Token wheel
Token ID7825
Feature activation-2.20e-03

Pos logit contributions
P+0.020
Bo+0.019
orn+0.019
un+0.018
ble+0.017
Neg logit contributions
 knocks-0.020
cakes-0.018
chair-0.017
 sigh-0.017
 knocking-0.017
Tokenbar
Token ID5657
Feature activation-6.51e-02

Pos logit contributions
P+0.005
Bo+0.004
orn+0.004
un+0.004
G+0.004
Neg logit contributions
 knocks-0.003
cakes-0.003
chair-0.003
cases-0.003
five-0.003
Tokenrow
Token ID808
Feature activation-1.46e-02

Pos logit contributions
Bo+0.091
P+0.083
Mart+0.078
G+0.077
Wal+0.069
Neg logit contributions
 knocks-0.127
cakes-0.125
cases-0.123
chair-0.120
five-0.117
Token.
Token ID13
Feature activation+8.57e-05

Pos logit contributions
P+0.031
Bo+0.030
orn+0.029
un+0.027
G+0.026
Neg logit contributions
 knocks-0.021
cakes-0.019
cases-0.018
chair-0.018
five-0.017
Token
Token ID198
Feature activation-3.88e-03

Pos logit contributions
 knocks+0.000
cakes+0.000
cases+0.000
chair+0.000
five+0.000
Neg logit contributions
P-0.000
Bo-0.000
orn-0.000
un-0.000
ble-0.000
Token
Token ID198
Feature activation+1.78e-03

Pos logit contributions
P+0.008
Bo+0.007
orn+0.007
ble+0.007
 feather+0.007
Neg logit contributions
 knocks-0.006
chair-0.005
cakes-0.005
 Amy-0.005
 James-0.005
TokenOnce
Token ID7454
Feature activation+4.13e-03

Pos logit contributions
 knocks+0.003
cakes+0.003
five+0.002
chair+0.002
 Whoever+0.002
Neg logit contributions
orn-0.003
P-0.003
ble-0.003
Bo-0.003
un-0.003
Token provide
Token ID2148
Feature activation-1.24e-02

Pos logit contributions
P+0.033
orn+0.031
Bo+0.031
un+0.029
G+0.028
Neg logit contributions
 knocks-0.019
cakes-0.016
 sigh-0.016
 knocking-0.015
 begs-0.015
Token you
Token ID345
Feature activation+2.54e-03

Pos logit contributions
Bo+0.029
P+0.029
orn+0.028
un+0.027
G+0.026
Neg logit contributions
 knocks-0.013
cakes-0.012
 pizza-0.012
five-0.012
 Amy-0.011
Token with
Token ID351
Feature activation-1.98e-02

Pos logit contributions
 knocks+0.004
cakes+0.003
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.005
ble-0.005
Token some
Token ID617
Feature activation-7.34e-03

Pos logit contributions
P+0.043
Bo+0.043
orn+0.041
un+0.039
ble+0.038
Neg logit contributions
 knocks-0.027
cakes-0.023
 leaving-0.022
 pizza-0.022
 sigh-0.022
Token delicious
Token ID12625
Feature activation-1.28e-02

Pos logit contributions
P+0.016
Bo+0.016
orn+0.015
un+0.015
ble+0.014
Neg logit contributions
 knocks-0.010
cakes-0.009
 pizza-0.008
five-0.008
chair-0.008
Token seaw
Token ID32085
Feature activation-6.46e-02

Pos logit contributions
P+0.023
Bo+0.022
orn+0.021
un+0.019
ble+0.019
Neg logit contributions
 knocks-0.023
cakes-0.021
cases-0.020
chair-0.020
five-0.019
Tokeneed
Token ID2308
Feature activation-3.59e-02

Pos logit contributions
Bo+0.093
P+0.092
orn+0.089
un+0.087
Mart+0.082
Neg logit contributions
 knocks-0.115
cakes-0.113
five-0.113
 pizza-0.104
cases-0.103
Token to
Token ID284
Feature activation-6.52e-03

Pos logit contributions
P+0.080
Bo+0.077
orn+0.075
un+0.071
G+0.070
Neg logit contributions
 knocks-0.047
cakes-0.045
cases-0.041
chair-0.040
five-0.039
Token eat
Token ID4483
Feature activation-1.58e-02

Pos logit contributions
P+0.017
Bo+0.016
un+0.016
orn+0.016
Mart+0.015
Neg logit contributions
 knocks-0.006
 report-0.005
 pizza-0.005
 leaving-0.005
 sigh-0.005
Token."
Token ID526
Feature activation-3.19e-03

Pos logit contributions
Bo+0.036
un+0.035
P+0.035
orn+0.034
Mart+0.033
Neg logit contributions
 knocks-0.018
cakes-0.018
 pizza-0.016
 anymore-0.015
 again-0.015
Token
Token ID220
Feature activation-1.84e-02

Pos logit contributions
P+0.006
orn+0.005
Bo+0.005
ble+0.005
un+0.005
Neg logit contributions
 knocks-0.005
cakes-0.005
chair-0.005
five-0.005
cases-0.005
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 knocks+0.001
cakes+0.001
 leaving+0.001
 begs+0.001
idents+0.001
Neg logit contributions
orn-0.002
P-0.002
Bo-0.002
ery-0.002
ble-0.001
Token
Token ID198
Feature activation-4.40e-03

Pos logit contributions
P+0.025
orn+0.025
Bo+0.024
ble+0.024
ery+0.023
Neg logit contributions
 knocks-0.015
cakes-0.013
 Whoever-0.012
 Amy-0.012
chair-0.012
Token"
Token ID1
Feature activation-1.26e-02

Pos logit contributions
orn+0.006
este+0.006
ery+0.005
ble+0.005
Bo+0.005
Neg logit contributions
 knocks-0.008
cakes-0.008
 Whoever-0.007
ulating-0.007
 knocking-0.007
TokenHi
Token ID17250
Feature activation-3.08e-03

Pos logit contributions
orn+0.022
ble+0.022
Bo+0.022
P+0.021
un+0.021
Neg logit contributions
 knocks-0.018
cakes-0.018
 Amy-0.016
five-0.016
 James-0.016
Token P
Token ID350
Feature activation-2.84e-02

Pos logit contributions
P+0.006
Bo+0.005
orn+0.005
un+0.005
G+0.005
Neg logit contributions
 knocks-0.005
cakes-0.005
cases-0.004
 James-0.004
chair-0.004
Tokeningu
Token ID6680
Feature activation-6.38e-02

Pos logit contributions
Bo+0.050
P+0.050
G+0.045
Ant+0.045
Mart+0.044
Neg logit contributions
 knocks-0.044
cakes-0.042
idents-0.040
cases-0.040
ksh-0.039
Token!
Token ID0
Feature activation+5.80e-03

Pos logit contributions
P+0.135
Bo+0.130
orn+0.123
un+0.116
G+0.116
Neg logit contributions
 knocks-0.091
cakes-0.080
chair-0.076
cases-0.074
 sigh-0.074
Token Do
Token ID2141
Feature activation-9.70e-03

Pos logit contributions
 knocks+0.008
cakes+0.008
cases+0.008
five+0.007
 report+0.007
Neg logit contributions
Bo-0.012
P-0.012
orn-0.012
un-0.011
Ant-0.010
Token you
Token ID345
Feature activation-3.80e-03

Pos logit contributions
P+0.019
Bo+0.018
orn+0.018
un+0.016
ble+0.016
Neg logit contributions
 knocks-0.016
cakes-0.015
cases-0.014
chair-0.014
five-0.013
Token want
Token ID765
Feature activation-4.38e-03

Pos logit contributions
P+0.008
Bo+0.008
orn+0.008
un+0.007
ble+0.007
Neg logit contributions
 knocks-0.005
cakes-0.005
chair-0.005
cases-0.005
five-0.004
Token to
Token ID284
Feature activation+8.41e-03

Pos logit contributions
P+0.011
Bo+0.011
orn+0.010
un+0.010
ble+0.010
Neg logit contributions
 knocks-0.005
cakes-0.004
cases-0.004
chair-0.004
five-0.003
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
P+0.009
Bo+0.008
orn+0.008
un+0.008
G+0.008
Neg logit contributions
 knocks-0.006
cakes-0.005
 Amy-0.005
 leaving-0.005
 sigh-0.005
Token different
Token ID1180
Feature activation-1.60e-02

Pos logit contributions
P+0.021
Bo+0.020
orn+0.019
un+0.017
G+0.016
Neg logit contributions
 knocks-0.040
cakes-0.038
cases-0.036
chair-0.036
five-0.035
Token fish
Token ID5916
Feature activation-2.38e-02

Pos logit contributions
P+0.027
Bo+0.025
orn+0.025
un+0.023
G+0.022
Neg logit contributions
 knocks-0.030
cakes-0.028
cases-0.026
chair-0.026
five-0.025
Token and
Token ID290
Feature activation-9.28e-03

Pos logit contributions
P+0.052
Bo+0.048
orn+0.048
un+0.045
G+0.044
Neg logit contributions
 knocks-0.033
cakes-0.030
cases-0.028
chair-0.028
 leaving-0.027
Token colorful
Token ID20239
Feature activation-2.22e-02

Pos logit contributions
P+0.018
orn+0.016
Bo+0.016
un+0.015
ble+0.014
Neg logit contributions
 knocks-0.016
cakes-0.014
cases-0.014
chair-0.013
 knocking-0.013
Token seaw
Token ID32085
Feature activation-6.37e-02

Pos logit contributions
P+0.038
orn+0.035
Bo+0.034
un+0.033
G+0.031
Neg logit contributions
 knocks-0.041
cakes-0.038
cases-0.035
five-0.035
chair-0.035
Tokeneed
Token ID2308
Feature activation-3.18e-02

Pos logit contributions
P+0.097
un+0.092
orn+0.092
Bo+0.090
Ant+0.080
Neg logit contributions
 knocks-0.109
cakes-0.107
five-0.104
cases-0.100
ulating-0.098
Token below
Token ID2174
Feature activation+4.61e-03

Pos logit contributions
P+0.067
Bo+0.064
orn+0.063
un+0.058
G+0.057
Neg logit contributions
 knocks-0.047
cakes-0.043
cases-0.040
chair-0.039
five-0.038
Token.
Token ID13
Feature activation-2.42e-03

Pos logit contributions
 knocks+0.005
cakes+0.005
 Amy+0.005
 anymore+0.005
 leaving+0.005
Neg logit contributions
P-0.011
Bo-0.010
orn-0.010
un-0.010
ble-0.009
Token
Token ID198
Feature activation-8.59e-03

Pos logit contributions
P+0.005
Bo+0.005
orn+0.005
ble+0.004
un+0.004
Neg logit contributions
 knocks-0.004
cakes-0.003
chair-0.003
 Amy-0.003
 James-0.003
Token
Token ID198
Feature activation-4.42e-03

Pos logit contributions
orn+0.020
P+0.020
 feather+0.019
agons+0.019
 compass+0.018
Neg logit contributions
 Amy-0.008
 knocks-0.008
cakes-0.008
chair-0.008
 James-0.007
Token of
Token ID286
Feature activation-5.05e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
un+0.001
ble+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
cases-0.001
chair-0.001
five-0.001
Token soil
Token ID9260
Feature activation+1.15e-03

Pos logit contributions
P+0.009
Bo+0.009
orn+0.008
un+0.008
G+0.008
Neg logit contributions
 knocks-0.009
cakes-0.008
 knocking-0.008
chair-0.007
 sigh-0.007
Token into
Token ID656
Feature activation+1.04e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token the
Token ID262
Feature activation-5.13e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token wheel
Token ID7825
Feature activation+5.19e-03

Pos logit contributions
P+0.008
Bo+0.008
orn+0.008
un+0.008
ble+0.007
Neg logit contributions
 knocks-0.010
cakes-0.009
chair-0.008
 knocking-0.008
cases-0.008
Tokenbar
Token ID5657
Feature activation-6.18e-02

Pos logit contributions
 knocks+0.008
cakes+0.008
cases+0.007
chair+0.007
five+0.007
Neg logit contributions
P-0.011
Bo-0.010
orn-0.010
un-0.009
G-0.009
Tokenrow
Token ID808
Feature activation-6.09e-03

Pos logit contributions
Bo+0.086
P+0.078
Mart+0.073
G+0.071
orn+0.065
Neg logit contributions
cakes-0.126
cases-0.124
 knocks-0.123
chair-0.114
 manager-0.111
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
P+0.013
Bo+0.012
orn+0.012
un+0.011
G+0.011
Neg logit contributions
 knocks-0.009
cakes-0.008
cases-0.008
chair-0.008
five-0.007
Token It
Token ID632
Feature activation+6.64e-03

Pos logit contributions
 knocks+0.019
cakes+0.017
lived+0.017
chair+0.017
 divorce+0.017
Neg logit contributions
Bo-0.017
P-0.016
G-0.015
un-0.015
ble-0.014
Token was
Token ID373
Feature activation-3.32e-04

Pos logit contributions
 knocks+0.011
cakes+0.010
cases+0.009
chair+0.009
 knocking+0.009
Neg logit contributions
P-0.013
orn-0.012
Bo-0.012
un-0.011
G-0.011
Token heavy
Token ID4334
Feature activation-8.42e-03

Pos logit contributions
P+0.001
orn+0.001
Bo+0.001
un+0.001
G+0.001
Neg logit contributions
 knocks-0.000
 knocking-0.000
 leaving-0.000
cakes-0.000
 James-0.000
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
P+0.002
Bo+0.002
orn+0.002
ble+0.002
un+0.002
Neg logit contributions
 knocks-0.002
cakes-0.002
chair-0.002
five-0.002
cases-0.002
Token He
Token ID679
Feature activation-2.04e-02

Pos logit contributions
 knocks+0.021
cakes+0.021
 divorce+0.019
cases+0.019
lived+0.019
Neg logit contributions
P-0.014
Bo-0.014
orn-0.013
un-0.012
G-0.012
Token found
Token ID1043
Feature activation-8.79e-03

Pos logit contributions
P+0.043
orn+0.043
Bo+0.039
un+0.038
ble+0.038
Neg logit contributions
 knocks-0.031
cakes-0.025
cases-0.025
 knocking-0.024
chair-0.024
Token a
Token ID257
Feature activation-1.14e-02

Pos logit contributions
P+0.017
Bo+0.016
orn+0.016
G+0.015
un+0.015
Neg logit contributions
 knocks-0.014
cakes-0.013
cases-0.012
chair-0.012
 James-0.012
Token wheel
Token ID7825
Feature activation-3.64e-03

Pos logit contributions
P+0.020
orn+0.019
Bo+0.019
un+0.017
G+0.016
Neg logit contributions
 knocks-0.021
cakes-0.019
chair-0.018
cases-0.018
 knocking-0.018
Tokenbar
Token ID5657
Feature activation-6.05e-02

Pos logit contributions
P+0.007
Bo+0.007
orn+0.007
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.006
cakes-0.005
cases-0.005
chair-0.005
five-0.005
Tokenrow
Token ID808
Feature activation-1.84e-02

Pos logit contributions
Bo+0.081
P+0.080
G+0.075
Mart+0.069
Lex+0.065
Neg logit contributions
 knocks-0.124
cakes-0.122
cases-0.118
five-0.113
chair-0.112
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
P+0.038
Bo+0.036
orn+0.035
un+0.032
G+0.032
Neg logit contributions
 knocks-0.028
cakes-0.026
cases-0.025
chair-0.024
 manager-0.023
Token a
Token ID257
Feature activation-1.23e-02

Pos logit contributions
P+0.027
Bo+0.026
orn+0.026
un+0.024
ble+0.023
Neg logit contributions
 knocks-0.023
cakes-0.021
cases-0.019
chair-0.019
five-0.019
Token tr
Token ID491
Feature activation-2.32e-03

Pos logit contributions
P+0.024
orn+0.022
Bo+0.022
G+0.020
umper+0.020
Neg logit contributions
 knocks-0.020
cakes-0.018
 knocking-0.017
 sigh-0.017
chair-0.017
Tokenamp
Token ID696
Feature activation+1.32e-02

Pos logit contributions
P+0.004
Bo+0.004
orn+0.004
un+0.004
ble+0.004
Neg logit contributions
 knocks-0.004
cakes-0.004
cases-0.004
chair-0.003
five-0.003
Token back
Token ID736
Feature activation-8.42e-03

Pos logit contributions
 knocks+0.005
cakes+0.004
cases+0.004
 knocking+0.004
chair+0.004
Neg logit contributions
P-0.010
orn-0.009
Bo-0.009
un-0.008
G-0.008
Token with
Token ID351
Feature activation-1.23e-02

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
un+0.016
ble+0.015
Neg logit contributions
 knocks-0.012
cakes-0.011
cases-0.010
chair-0.010
five-0.010
Token a
Token ID257
Feature activation-3.52e-03

Pos logit contributions
P+0.027
Bo+0.026
orn+0.025
ble+0.024
G+0.023
Neg logit contributions
 knocks-0.017
 sigh-0.016
 Amy-0.016
 James-0.015
 leaving-0.015
Token big
Token ID1263
Feature activation+5.22e-03

Pos logit contributions
P+0.007
Bo+0.006
orn+0.006
ble+0.006
G+0.005
Neg logit contributions
 knocks-0.006
 sigh-0.006
cakes-0.005
 knocking-0.005
 manager-0.005
Token wheel
Token ID7825
Feature activation-1.05e-02

Pos logit contributions
 knocks+0.009
cakes+0.008
cases+0.007
chair+0.007
five+0.007
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
un-0.009
ble-0.008
Tokenbar
Token ID5657
Feature activation-6.04e-02

Pos logit contributions
P+0.020
orn+0.019
Bo+0.019
un+0.017
ble+0.017
Neg logit contributions
 knocks-0.018
cakes-0.016
cases-0.015
chair-0.015
five-0.015
Tokenrow
Token ID808
Feature activation-2.00e-02

Pos logit contributions
Bo+0.082
P+0.079
G+0.072
Mart+0.067
Lex+0.061
Neg logit contributions
 knocks-0.122
cakes-0.121
cases-0.119
chair-0.115
olding-0.114
Token.
Token ID13
Feature activation+1.56e-03

Pos logit contributions
P+0.042
Bo+0.040
orn+0.039
un+0.035
G+0.035
Neg logit contributions
 knocks-0.030
cakes-0.028
cases-0.026
chair-0.026
 manager-0.024
Token He
Token ID679
Feature activation-1.13e-02

Pos logit contributions
 knocks+0.003
cakes+0.002
cases+0.002
chair+0.002
five+0.002
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Token asked
Token ID1965
Feature activation-1.27e-02

Pos logit contributions
P+0.025
orn+0.024
Bo+0.023
ble+0.022
un+0.022
Neg logit contributions
 knocks-0.016
cakes-0.013
cases-0.013
 sigh-0.013
chair-0.012
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
P+0.029
orn+0.027
Bo+0.027
ble+0.025
G+0.024
Neg logit contributions
 knocks-0.017
 Amy-0.016
 James-0.016
cakes-0.015
 manager-0.015
Token the
Token ID262
Feature activation-7.81e-03

Pos logit contributions
 knocks+0.005
cakes+0.004
chair+0.004
five+0.004
 James+0.004
Neg logit contributions
P-0.006
orn-0.006
Bo-0.006
ble-0.005
G-0.005
Token cat
Token ID3797
Feature activation+8.44e-03

Pos logit contributions
P+0.015
Bo+0.014
orn+0.013
un+0.013
G+0.012
Neg logit contributions
 knocks-0.014
cakes-0.012
 sigh-0.012
 knocking-0.011
chair-0.011
Token off
Token ID572
Feature activation-1.29e-03

Pos logit contributions
 knocks+0.013
cakes+0.012
cases+0.011
chair+0.011
five+0.011
Neg logit contributions
P-0.016
Bo-0.016
orn-0.015
un-0.014
ble-0.014
Token the
Token ID262
Feature activation-1.45e-02

Pos logit contributions
P+0.003
orn+0.003
Bo+0.003
G+0.002
ble+0.002
Neg logit contributions
 knocks-0.002
cakes-0.002
chair-0.002
 James-0.002
 Amy-0.002
Token wheel
Token ID7825
Feature activation+4.98e-03

Pos logit contributions
P+0.024
Bo+0.023
orn+0.023
un+0.022
G+0.020
Neg logit contributions
 knocks-0.028
cakes-0.025
chair-0.024
 knocking-0.023
cases-0.023
Tokenbar
Token ID5657
Feature activation-6.03e-02

Pos logit contributions
 knocks+0.008
cakes+0.007
chair+0.007
cases+0.007
five+0.006
Neg logit contributions
P-0.010
Bo-0.010
orn-0.009
G-0.009
un-0.008
Tokenrow
Token ID808
Feature activation-7.83e-03

Pos logit contributions
Bo+0.082
P+0.077
Mart+0.070
G+0.069
Wal+0.061
Neg logit contributions
cakes-0.121
 knocks-0.119
cases-0.117
five-0.116
chair-0.116
Token and
Token ID290
Feature activation+2.69e-03

Pos logit contributions
P+0.017
Bo+0.016
orn+0.016
G+0.014
un+0.014
Neg logit contributions
 knocks-0.011
cakes-0.011
cases-0.010
chair-0.010
 leaving-0.009
Token put
Token ID1234
Feature activation-1.05e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
chair+0.004
cases+0.004
five+0.004
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.004
ble-0.004
Token him
Token ID683
Feature activation-4.32e-03

Pos logit contributions
P+0.023
Bo+0.022
orn+0.022
G+0.021
un+0.021
Neg logit contributions
 knocks-0.014
cakes-0.012
 James-0.011
five-0.011
chair-0.011
Token back
Token ID736
Feature activation-2.23e-02

Pos logit contributions
P+0.008
Bo+0.008
orn+0.008
un+0.007
G+0.007
Neg logit contributions
 knocks-0.007
cakes-0.006
cases-0.006
chair-0.006
five-0.006
Token It
Token ID632
Feature activation+1.39e-03

Pos logit contributions
P+0.024
Bo+0.023
orn+0.023
un+0.021
ble+0.021
Neg logit contributions
 knocks-0.017
cakes-0.015
cases-0.014
chair-0.014
five-0.013
Token was
Token ID373
Feature activation-4.98e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
chair+0.002
 sigh+0.002
cases+0.002
Neg logit contributions
P-0.003
orn-0.003
Bo-0.003
un-0.003
ble-0.002
Token hiding
Token ID11816
Feature activation-9.93e-03

Pos logit contributions
P+0.011
orn+0.011
Bo+0.010
un+0.009
G+0.009
Neg logit contributions
 knocks-0.007
 leaving-0.007
 knocking-0.006
 Amy-0.006
 pizza-0.006
Token under
Token ID739
Feature activation-4.41e-03

Pos logit contributions
P+0.025
orn+0.024
Bo+0.023
un+0.023
G+0.022
Neg logit contributions
 knocks-0.011
cakes-0.009
cases-0.008
 Amy-0.008
chair-0.008
Token a
Token ID257
Feature activation-1.52e-02

Pos logit contributions
P+0.009
orn+0.008
Bo+0.008
un+0.008
ble+0.007
Neg logit contributions
 knocks-0.007
cakes-0.006
 Amy-0.006
 James-0.006
 sigh-0.006
Token seaw
Token ID32085
Feature activation-5.94e-02

Pos logit contributions
P+0.026
Bo+0.024
orn+0.023
un+0.023
G+0.022
Neg logit contributions
 knocks-0.028
 sigh-0.025
chair-0.025
cakes-0.025
 knocking-0.023
Tokeneed
Token ID2308
Feature activation-3.79e-02

Pos logit contributions
P+0.086
Bo+0.084
orn+0.081
un+0.076
Mart+0.074
Neg logit contributions
 knocks-0.111
five-0.103
cakes-0.100
 sigh-0.098
cases-0.098
Token.
Token ID13
Feature activation-5.49e-03

Pos logit contributions
P+0.072
Bo+0.068
orn+0.067
un+0.062
G+0.060
Neg logit contributions
 knocks-0.064
cakes-0.057
cases-0.055
chair-0.054
five-0.052
Token Lily
Token ID20037
Feature activation-6.32e-03

Pos logit contributions
P+0.012
Bo+0.012
orn+0.011
un+0.011
ble+0.011
Neg logit contributions
 knocks-0.008
cakes-0.007
chair-0.006
cases-0.006
five-0.006
Token said
Token ID531
Feature activation-1.49e-02

Pos logit contributions
P+0.013
orn+0.013
Bo+0.012
un+0.011
ble+0.011
Neg logit contributions
 knocks-0.010
cakes-0.008
cases-0.008
chair-0.008
 sigh-0.008
Token,
Token ID11
Feature activation+2.22e-03

Pos logit contributions
P+0.030
Bo+0.028
orn+0.028
un+0.026
G+0.025
Neg logit contributions
 knocks-0.024
cakes-0.022
five-0.020
cases-0.020
chair-0.020
Token mar
Token ID1667
Feature activation-5.42e-02

Pos logit contributions
P+0.012
Bo+0.011
orn+0.011
G+0.011
Bee+0.011
Neg logit contributions
 knocks-0.009
cakes-0.007
 turns-0.007
cases-0.007
 pizza-0.007
Tokenbles
Token ID7689
Feature activation-3.78e-02

Pos logit contributions
Bo+0.065
Lex+0.062
G+0.062
orn+0.061
Bee+0.060
Neg logit contributions
cakes-0.114
cases-0.109
 knocks-0.099
oke-0.095
isions-0.093
Token,
Token ID11
Feature activation-7.19e-04

Pos logit contributions
P+0.086
Bo+0.085
orn+0.080
G+0.076
Ant+0.076
Neg logit contributions
 knocks-0.046
cakes-0.043
cases-0.040
chair-0.038
five-0.037
Token spark
Token ID9009
Feature activation-1.18e-02

Pos logit contributions
P+0.002
orn+0.001
Bo+0.001
un+0.001
ble+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
cases-0.001
 leaving-0.001
 James-0.001
Tokenly
Token ID306
Feature activation-2.90e-02

Pos logit contributions
P+0.032
Bo+0.031
orn+0.030
un+0.029
G+0.028
Neg logit contributions
 knocks-0.010
cakes-0.009
cases-0.008
chair-0.008
five-0.007
Token mar
Token ID1667
Feature activation-5.94e-02

Pos logit contributions
P+0.056
Bo+0.053
orn+0.052
G+0.050
ble+0.047
Neg logit contributions
 knocks-0.046
cakes-0.040
cases-0.039
chair-0.037
five-0.037
Tokenbles
Token ID7689
Feature activation-4.63e-02

Pos logit contributions
Bo+0.070
G+0.068
Lex+0.067
orn+0.066
P+0.065
Neg logit contributions
cakes-0.123
cases-0.118
 knocks-0.109
oke-0.103
five-0.101
Token,
Token ID11
Feature activation-6.16e-03

Pos logit contributions
P+0.103
Bo+0.100
orn+0.096
G+0.091
un+0.089
Neg logit contributions
 knocks-0.059
cakes-0.055
cases-0.051
chair-0.049
five-0.048
Token and
Token ID290
Feature activation+4.76e-03

Pos logit contributions
P+0.014
orn+0.014
Bo+0.014
un+0.013
G+0.013
Neg logit contributions
 knocks-0.008
 leaving-0.006
cakes-0.006
cases-0.006
 knocking-0.006
Token mar
Token ID1667
Feature activation-4.82e-02

Pos logit contributions
 knocks+0.008
cakes+0.007
cases+0.006
chair+0.006
 knocking+0.006
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
un-0.008
ble-0.008
Tokenbles
Token ID7689
Feature activation-3.33e-02

Pos logit contributions
orn+0.058
Bo+0.057
Lex+0.056
P+0.056
G+0.056
Neg logit contributions
cakes-0.099
cases-0.096
 knocks-0.087
oke-0.084
five-0.081
Token found
Token ID1043
Feature activation+4.72e-04

Pos logit contributions
P+0.017
orn+0.016
Bo+0.016
un+0.015
ble+0.015
Neg logit contributions
 knocks-0.014
cakes-0.012
cases-0.012
 sigh-0.011
 leaving-0.011
Token it
Token ID340
Feature activation+6.30e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.001
Bo-0.001
orn-0.001
un-0.001
ble-0.001
Token stuck
Token ID7819
Feature activation-8.51e-03

Pos logit contributions
 knocks+0.009
cakes+0.008
cases+0.007
chair+0.007
 leaving+0.007
Neg logit contributions
P-0.014
Bo-0.013
orn-0.013
un-0.012
G-0.012
Token in
Token ID287
Feature activation-8.04e-03

Pos logit contributions
P+0.020
Bo+0.019
orn+0.019
G+0.017
un+0.017
Neg logit contributions
 knocks-0.011
 James-0.009
 leaving-0.009
 knocking-0.009
 Amy-0.009
Token some
Token ID617
Feature activation-1.05e-02

Pos logit contributions
P+0.017
orn+0.017
Bo+0.016
un+0.015
ble+0.015
Neg logit contributions
 knocks-0.012
 James-0.010
 Amy-0.010
 knocking-0.010
cakes-0.010
Token seaw
Token ID32085
Feature activation-5.93e-02

Pos logit contributions
P+0.020
Bo+0.020
un+0.018
orn+0.018
ble+0.017
Neg logit contributions
 knocks-0.018
cakes-0.015
chair-0.015
 knocking-0.015
 leaving-0.015
Tokeneed
Token ID2308
Feature activation-2.22e-02

Pos logit contributions
P+0.088
Bo+0.087
orn+0.087
un+0.080
ble+0.076
Neg logit contributions
 knocks-0.105
five-0.099
cases-0.094
cakes-0.093
ulating-0.092
Token.
Token ID13
Feature activation+2.84e-03

Pos logit contributions
P+0.042
Bo+0.040
orn+0.040
un+0.037
ble+0.036
Neg logit contributions
 knocks-0.037
cakes-0.034
chair-0.031
cases-0.031
five-0.030
Token Tim
Token ID5045
Feature activation-5.49e-04

Pos logit contributions
 knocks+0.005
cakes+0.005
cases+0.005
 divorce+0.005
lived+0.005
Neg logit contributions
P-0.005
Bo-0.005
orn-0.004
un-0.004
G-0.004
Token was
Token ID373
Feature activation+8.16e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
G+0.001
ble+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
 knocking-0.001
 sigh-0.001
chair-0.001
Token so
Token ID523
Feature activation+7.48e-03

Pos logit contributions
 knocks+0.014
cakes+0.013
chair+0.012
cases+0.012
 knocking+0.012
Neg logit contributions
P-0.015
Bo-0.014
orn-0.014
un-0.013
G-0.013
Token started
Token ID2067
Feature activation-1.69e-02

Pos logit contributions
 knocks+0.008
cakes+0.007
cases+0.007
chair+0.006
five+0.006
Neg logit contributions
P-0.008
Bo-0.008
orn-0.008
un-0.007
ble-0.007
Token to
Token ID284
Feature activation+1.40e-03

Pos logit contributions
P+0.037
Bo+0.037
orn+0.035
un+0.033
ble+0.033
Neg logit contributions
 knocks-0.022
cakes-0.021
chair-0.020
cases-0.020
five-0.018
Token chew
Token ID34722
Feature activation-8.31e-03

Pos logit contributions
 knocks+0.002
 sigh+0.002
 knocking+0.002
cakes+0.002
 leaving+0.002
Neg logit contributions
P-0.003
orn-0.003
Bo-0.003
un-0.003
G-0.003
Token on
Token ID319
Feature activation-1.35e-03

Pos logit contributions
un+0.020
P+0.019
Bo+0.019
orn+0.019
ully+0.018
Neg logit contributions
 knocks-0.008
cakes-0.008
 again-0.008
 Amy-0.008
 anymore-0.007
Token some
Token ID617
Feature activation-1.14e-02

Pos logit contributions
P+0.003
Bo+0.003
orn+0.003
un+0.002
ble+0.002
Neg logit contributions
 knocks-0.002
cakes-0.002
cases-0.002
chair-0.002
 James-0.002
Token seaw
Token ID32085
Feature activation-5.89e-02

Pos logit contributions
P+0.022
Bo+0.022
orn+0.021
un+0.020
G+0.019
Neg logit contributions
 knocks-0.018
cakes-0.017
 sigh-0.015
chair-0.015
five-0.015
Tokeneed
Token ID2308
Feature activation-2.42e-02

Pos logit contributions
P+0.086
orn+0.084
Bo+0.082
un+0.078
Mart+0.072
Neg logit contributions
 knocks-0.110
cakes-0.105
five-0.104
cases-0.100
ksh-0.096
Token.
Token ID13
Feature activation+3.11e-03

Pos logit contributions
P+0.050
Bo+0.048
orn+0.047
un+0.043
G+0.042
Neg logit contributions
 knocks-0.037
cakes-0.033
cases-0.031
chair-0.031
five-0.029
Token Mary
Token ID5335
Feature activation+1.46e-02

Pos logit contributions
 knocks+0.005
cakes+0.005
cases+0.004
ksh+0.004
 divorce+0.004
Neg logit contributions
P-0.006
Bo-0.006
orn-0.005
G-0.005
un-0.005
Token was
Token ID373
Feature activation+1.86e-03

Pos logit contributions
 knocks+0.023
cakes+0.019
chair+0.018
 sigh+0.018
 James+0.017
Neg logit contributions
P-0.031
orn-0.029
Bo-0.029
un-0.027
ble-0.026
Token so
Token ID523
Feature activation+2.96e-03

Pos logit contributions
 knocks+0.003
cakes+0.003
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.004
Bo-0.003
orn-0.003
un-0.003
ble-0.003
Token him
Token ID683
Feature activation+1.05e-02

Pos logit contributions
P+0.011
orn+0.011
Bo+0.011
ble+0.010
un+0.009
Neg logit contributions
 knocks-0.009
cakes-0.009
cases-0.008
 James-0.008
five-0.008
Token a
Token ID257
Feature activation+4.88e-03

Pos logit contributions
 knocks+0.016
cakes+0.015
cases+0.014
chair+0.013
five+0.013
Neg logit contributions
P-0.021
Bo-0.021
orn-0.020
G-0.019
un-0.019
Token big
Token ID1263
Feature activation+1.12e-02

Pos logit contributions
 knocks+0.008
cakes+0.008
chair+0.007
cases+0.007
five+0.007
Neg logit contributions
P-0.009
Bo-0.009
orn-0.008
ble-0.008
G-0.008
Token red
Token ID2266
Feature activation-9.30e-03

Pos logit contributions
 knocks+0.018
cakes+0.017
cases+0.016
chair+0.015
five+0.015
Neg logit contributions
P-0.021
Bo-0.020
orn-0.020
un-0.019
ble-0.018
Token wheel
Token ID7825
Feature activation-4.66e-03

Pos logit contributions
P+0.021
Bo+0.021
orn+0.019
G+0.019
ble+0.018
Neg logit contributions
 knocks-0.012
cakes-0.011
chair-0.011
five-0.010
 knocking-0.010
Tokenbar
Token ID5657
Feature activation-5.86e-02

Pos logit contributions
P+0.009
Bo+0.009
orn+0.009
un+0.008
ble+0.008
Neg logit contributions
 knocks-0.007
cakes-0.007
cases-0.006
chair-0.006
five-0.006
Tokenrow
Token ID808
Feature activation-1.45e-02

Pos logit contributions
Bo+0.097
P+0.092
G+0.086
Mart+0.081
Wal+0.079
Neg logit contributions
cakes-0.109
 knocks-0.107
cases-0.105
chair-0.102
five-0.099
Token and
Token ID290
Feature activation-1.74e-03

Pos logit contributions
P+0.031
Bo+0.030
orn+0.029
un+0.027
G+0.026
Neg logit contributions
 knocks-0.021
cakes-0.019
cases-0.018
chair-0.018
five-0.017
Token filled
Token ID5901
Feature activation+5.64e-03

Pos logit contributions
P+0.003
orn+0.003
Bo+0.003
ble+0.003
un+0.003
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.002
chair-0.002
 James-0.002
Token it
Token ID340
Feature activation+1.74e-03

Pos logit contributions
 knocks+0.009
cakes+0.008
cases+0.008
chair+0.007
five+0.007
Neg logit contributions
P-0.011
Bo-0.011
orn-0.011
un-0.010
ble-0.009
Token with
Token ID351
Feature activation+5.96e-04

Pos logit contributions
 knocks+0.003
cakes+0.002
chair+0.002
cases+0.002
five+0.002
Neg logit contributions
P-0.003
Bo-0.003
orn-0.003
ble-0.003
un-0.003
Token to
Token ID284
Feature activation-2.35e-03

Pos logit contributions
 knocks+0.014
cakes+0.012
cases+0.011
chair+0.011
five+0.010
Neg logit contributions
P-0.026
Bo-0.025
orn-0.025
un-0.023
ble-0.023
Token keep
Token ID1394
Feature activation-1.08e-02

Pos logit contributions
P+0.006
orn+0.005
Bo+0.005
un+0.005
ery+0.005
Neg logit contributions
 knocks-0.003
 sigh-0.002
 report-0.002
 knocking-0.002
 leaving-0.002
Token playing
Token ID2712
Feature activation-2.03e-02

Pos logit contributions
P+0.022
Bo+0.021
orn+0.021
un+0.019
G+0.019
Neg logit contributions
 knocks-0.016
cakes-0.014
 knocking-0.014
 leaving-0.013
 James-0.013
Token with
Token ID351
Feature activation-1.89e-02

Pos logit contributions
Bo+0.041
P+0.040
orn+0.039
un+0.037
G+0.037
Neg logit contributions
 knocks-0.030
cakes-0.027
chair-0.026
 James-0.025
 Amy-0.024
Token your
Token ID534
Feature activation-1.51e-02

Pos logit contributions
Bo+0.038
P+0.037
orn+0.036
G+0.034
un+0.034
Neg logit contributions
 knocks-0.027
 James-0.024
cakes-0.024
 Amy-0.023
 pizza-0.023
Token mar
Token ID1667
Feature activation-5.83e-02

Pos logit contributions
P+0.028
Bo+0.026
orn+0.026
G+0.025
un+0.024
Neg logit contributions
 knocks-0.026
cakes-0.022
 leaving-0.021
 sigh-0.021
 pizza-0.021
Tokenbles
Token ID7689
Feature activation-5.17e-02

Pos logit contributions
Bo+0.072
G+0.063
Lex+0.063
Mo+0.061
Ant+0.059
Neg logit contributions
cakes-0.132
cases-0.124
 knocks-0.114
 manager-0.112
cake-0.108
Token all
Token ID477
Feature activation-1.90e-02

Pos logit contributions
Bo+0.103
P+0.095
orn+0.089
G+0.086
Mart+0.086
Neg logit contributions
 knocks-0.080
cakes-0.080
chair-0.078
cases-0.075
 sigh-0.070
Token day
Token ID1110
Feature activation-1.30e-02

Pos logit contributions
P+0.038
orn+0.036
Bo+0.036
un+0.033
ble+0.032
Neg logit contributions
 knocks-0.030
cakes-0.028
cases-0.026
chair-0.025
five-0.024
Token?"
Token ID1701
Feature activation+6.79e-04

Pos logit contributions
P+0.023
Bo+0.022
orn+0.022
un+0.020
ble+0.020
Neg logit contributions
 knocks-0.023
cakes-0.021
cases-0.020
chair-0.020
five-0.019
Token it
Token ID340
Feature activation+1.69e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.001
Bo-0.001
orn-0.001
un-0.001
ble-0.001
Token will
Token ID481
Feature activation-1.28e-02

Pos logit contributions
 knocks+0.010
cakes+0.009
five+0.008
cases+0.008
chair+0.008
Neg logit contributions
P-0.013
Bo-0.012
orn-0.012
ble-0.011
un-0.011
Token make
Token ID787
Feature activation+7.47e-03

Pos logit contributions
P+0.032
orn+0.032
Bo+0.029
G+0.027
un+0.027
Neg logit contributions
 knocks-0.016
 knocking-0.012
 report-0.012
 knock-0.012
 sigh-0.012
Token a
Token ID257
Feature activation+4.22e-03

Pos logit contributions
 knocks+0.012
cakes+0.011
cases+0.010
chair+0.010
five+0.010
Neg logit contributions
P-0.014
Bo-0.014
orn-0.014
un-0.013
ble-0.012
Token salad
Token ID20698
Feature activation-2.10e-02

Pos logit contributions
 knocks+0.006
cakes+0.005
cases+0.005
chair+0.005
five+0.004
Neg logit contributions
P-0.009
Bo-0.009
orn-0.009
un-0.008
ble-0.008
Token with
Token ID351
Feature activation-6.94e-03

Pos logit contributions
P+0.044
Bo+0.042
orn+0.042
un+0.039
ble+0.038
Neg logit contributions
 knocks-0.031
cakes-0.028
cases-0.026
chair-0.026
five-0.025
Token lettuce
Token ID39406
Feature activation-5.82e-02

Pos logit contributions
Bo+0.015
P+0.015
orn+0.015
un+0.015
ble+0.014
Neg logit contributions
 knocks-0.009
cakes-0.008
 pizza-0.008
 James-0.007
 Amy-0.007
Token,
Token ID11
Feature activation-4.54e-03

Pos logit contributions
P+0.123
Bo+0.120
orn+0.117
un+0.110
G+0.107
Neg logit contributions
 knocks-0.084
cakes-0.077
cases-0.070
chair-0.066
 leaving-0.065
Token tomato
Token ID24240
Feature activation-3.70e-02

Pos logit contributions
P+0.009
Bo+0.009
orn+0.009
un+0.008
ble+0.008
Neg logit contributions
 knocks-0.007
cakes-0.006
cases-0.006
chair-0.006
five-0.005
Token,
Token ID11
Feature activation-9.03e-03

Pos logit contributions
P+0.076
Bo+0.075
orn+0.071
un+0.070
G+0.067
Neg logit contributions
 knocks-0.056
cakes-0.052
cases-0.045
chair-0.045
 pizza-0.044
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
un+0.016
ble+0.016
Neg logit contributions
 knocks-0.014
cakes-0.013
cases-0.012
chair-0.011
 sigh-0.011
Token cheese
Token ID9891
Feature activation-3.52e-02

Pos logit contributions
P+0.021
Bo+0.020
orn+0.020
un+0.018
ble+0.018
Neg logit contributions
 knocks-0.019
cakes-0.018
cases-0.017
chair-0.017
five-0.016
Token light
Token ID1657
Feature activation+1.30e-02

Pos logit contributions
 knocks+0.014
cakes+0.012
 sigh+0.012
 knocking+0.012
five+0.012
Neg logit contributions
P-0.023
orn-0.021
Bo-0.021
un-0.019
G-0.019
Tokenen
Token ID268
Feature activation-3.12e-04

Pos logit contributions
 knocks+0.021
cakes+0.018
cases+0.018
chair+0.017
 James+0.017
Neg logit contributions
P-0.027
Bo-0.025
orn-0.025
G-0.023
un-0.022
Token up
Token ID510
Feature activation-7.70e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
G+0.001
un+0.001
Neg logit contributions
 knocks-0.001
cakes-0.000
cases-0.000
chair-0.000
 knocking-0.000
Token the
Token ID262
Feature activation-6.12e-03

Pos logit contributions
P+0.018
Bo+0.018
orn+0.017
un+0.016
G+0.016
Neg logit contributions
 knocks-0.009
cakes-0.008
 leaving-0.008
cases-0.008
chair-0.008
Token wheel
Token ID7825
Feature activation+7.29e-04

Pos logit contributions
P+0.011
Bo+0.011
orn+0.010
un+0.010
G+0.009
Neg logit contributions
 knocks-0.011
 knocking-0.009
cakes-0.009
 sigh-0.009
chair-0.009
Tokenbar
Token ID5657
Feature activation-5.80e-02

Pos logit contributions
 knocks+0.001
cakes+0.001
chair+0.001
cases+0.001
 knocking+0.001
Neg logit contributions
P-0.002
Bo-0.001
orn-0.001
G-0.001
un-0.001
Tokenrow
Token ID808
Feature activation-9.64e-03

Pos logit contributions
Bo+0.078
P+0.074
G+0.066
Mart+0.066
orn+0.058
Neg logit contributions
cakes-0.116
 knocks-0.114
cases-0.113
olding-0.107
chair-0.107
Token.
Token ID13
Feature activation+2.12e-03

Pos logit contributions
P+0.022
Bo+0.021
orn+0.020
G+0.018
un+0.018
Neg logit contributions
 knocks-0.013
cakes-0.012
cases-0.011
chair-0.011
 manager-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.00e-02

Pos logit contributions
 knocks+0.004
cakes+0.003
chair+0.003
cases+0.003
 James+0.003
Neg logit contributions
P-0.004
orn-0.004
Bo-0.004
ble-0.004
G-0.003
TokenOnce
Token ID7454
Feature activation-8.49e-03

Pos logit contributions
 knocks+0.016
cakes+0.014
chair+0.014
 James+0.014
 Amy+0.014
Neg logit contributions
P-0.019
Bo-0.018
orn-0.017
ble-0.017
un-0.016
Token there
Token ID612
Feature activation+1.39e-02

Pos logit contributions
P+0.012
Bo+0.011
orn+0.011
un+0.010
ble+0.010
Neg logit contributions
 knocks-0.018
cakes-0.017
cases-0.016
chair-0.016
five-0.016
Token play
Token ID711
Feature activation-1.43e-02

Pos logit contributions
P+0.028
orn+0.028
Bo+0.028
ery+0.027
este+0.026
Neg logit contributions
 knocks-0.010
cakes-0.008
 buys-0.008
 knock-0.008
five-0.008
Token with
Token ID351
Feature activation-1.51e-02

Pos logit contributions
Bo+0.035
orn+0.034
P+0.034
G+0.032
este+0.032
Neg logit contributions
 knocks-0.015
cakes-0.013
five-0.011
 pizza-0.011
 manager-0.010
Token you
Token ID345
Feature activation-1.90e-02

Pos logit contributions
orn+0.035
Bo+0.035
P+0.034
ery+0.032
G+0.032
Neg logit contributions
 knocks-0.017
cakes-0.014
 pizza-0.014
 James-0.013
 we-0.013
Token and
Token ID290
Feature activation-7.56e-03

Pos logit contributions
Bo+0.037
P+0.037
orn+0.035
G+0.033
Mo+0.032
Neg logit contributions
 knocks-0.031
cakes-0.026
 James-0.026
 anymore-0.026
 again-0.026
Token your
Token ID534
Feature activation-1.31e-02

Pos logit contributions
P+0.017
orn+0.016
Bo+0.016
un+0.015
ble+0.015
Neg logit contributions
 knocks-0.011
cakes-0.009
 James-0.009
 Amy-0.009
chair-0.008
Token mar
Token ID1667
Feature activation-5.77e-02

Pos logit contributions
P+0.025
orn+0.024
G+0.024
Bo+0.023
Lex+0.022
Neg logit contributions
 knocks-0.022
cakes-0.018
 pizza-0.017
 buys-0.017
 manager-0.017
Tokenbles
Token ID7689
Feature activation-4.97e-02

Pos logit contributions
Bo+0.074
G+0.072
Lex+0.071
Mo+0.068
Bee+0.067
Neg logit contributions
cakes-0.123
cases-0.111
 knocks-0.107
 manager-0.103
oke-0.101
Token?"
Token ID1701
Feature activation-4.96e-03

Pos logit contributions
Bo+0.106
P+0.097
Mo+0.096
G+0.094
Mart+0.090
Neg logit contributions
 knocks-0.071
cakes-0.069
 anymore-0.067
chair-0.063
 again-0.061
Token The
Token ID383
Feature activation-2.05e-03

Pos logit contributions
P+0.010
Bo+0.009
orn+0.009
ble+0.008
un+0.008
Neg logit contributions
 knocks-0.008
cakes-0.007
chair-0.007
five-0.007
 James-0.007
Token boy
Token ID2933
Feature activation-4.72e-03

Pos logit contributions
P+0.004
Bo+0.004
orn+0.004
ble+0.003
un+0.003
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.003
chair-0.003
lived-0.003
Token looked
Token ID3114
Feature activation-9.16e-03

Pos logit contributions
P+0.008
Bo+0.008
orn+0.007
un+0.007
ble+0.007
Neg logit contributions
 knocks-0.008
cakes-0.008
cases-0.008
chair-0.007
 inconvenience-0.007
Token She
Token ID1375
Feature activation-6.54e-03

Pos logit contributions
P+0.012
Bo+0.012
orn+0.012
ble+0.011
un+0.011
Neg logit contributions
 knocks-0.010
cakes-0.009
chair-0.008
cases-0.008
five-0.008
Token also
Token ID635
Feature activation-7.78e-05

Pos logit contributions
P+0.014
orn+0.014
Bo+0.013
un+0.013
ble+0.012
Neg logit contributions
 knocks-0.010
cases-0.008
cakes-0.008
chair-0.008
 James-0.007
Token picked
Token ID6497
Feature activation+1.08e-04

Pos logit contributions
orn+0.000
P+0.000
Bo+0.000
ble+0.000
un+0.000
Neg logit contributions
 knocks-0.000
 knocking-0.000
 buys-0.000
cases-0.000
cakes-0.000
Token some
Token ID617
Feature activation-2.52e-03

Pos logit contributions
 knocks+0.000
cakes+0.000
cases+0.000
five+0.000
 James+0.000
Neg logit contributions
P-0.000
Bo-0.000
orn-0.000
un-0.000
G-0.000
Token green
Token ID4077
Feature activation-1.46e-02

Pos logit contributions
P+0.005
Bo+0.005
orn+0.005
G+0.004
un+0.004
Neg logit contributions
 knocks-0.004
cakes-0.004
cases-0.003
chair-0.003
 pizza-0.003
Token seaw
Token ID32085
Feature activation-5.77e-02

Pos logit contributions
P+0.028
Bo+0.025
orn+0.025
un+0.024
G+0.024
Neg logit contributions
 knocks-0.024
cakes-0.023
cases-0.021
chair-0.020
five-0.020
Tokeneed
Token ID2308
Feature activation-2.51e-02

Pos logit contributions
P+0.087
orn+0.084
Bo+0.081
un+0.077
Ant+0.073
Neg logit contributions
 knocks-0.102
cakes-0.102
cases-0.098
five-0.097
isions-0.091
Token to
Token ID284
Feature activation-4.73e-03

Pos logit contributions
P+0.054
Bo+0.051
orn+0.050
un+0.047
G+0.046
Neg logit contributions
 knocks-0.036
cakes-0.034
cases-0.032
chair-0.030
five-0.029
Token show
Token ID905
Feature activation+1.07e-02

Pos logit contributions
P+0.011
orn+0.011
Bo+0.011
un+0.010
Ant+0.009
Neg logit contributions
 knocks-0.005
 report-0.005
 knock-0.005
cakes-0.005
five-0.004
Token her
Token ID607
Feature activation-4.07e-03

Pos logit contributions
 knocks+0.018
cakes+0.017
cases+0.016
 James+0.015
 Amy+0.015
Neg logit contributions
P-0.021
orn-0.020
Bo-0.019
un-0.017
G-0.017
Token friends
Token ID2460
Feature activation+4.56e-03

Pos logit contributions
P+0.008
orn+0.007
Bo+0.007
G+0.006
un+0.006
Neg logit contributions
 knocks-0.007
cakes-0.007
cases-0.007
chair-0.006
 James-0.006
Token a
Token ID257
Feature activation-1.40e-03

Pos logit contributions
 knocks+0.027
cakes+0.025
cases+0.024
chair+0.023
five+0.023
Neg logit contributions
P-0.027
Bo-0.026
orn-0.025
un-0.023
ble-0.023
Token big
Token ID1263
Feature activation+4.14e-03

Pos logit contributions
Bo+0.003
P+0.003
orn+0.002
ble+0.002
un+0.002
Neg logit contributions
 knocks-0.002
cakes-0.002
 sigh-0.002
 manager-0.002
 pizza-0.002
Token,
Token ID11
Feature activation+5.13e-03

Pos logit contributions
 knocks+0.007
cakes+0.006
chair+0.006
 sigh+0.006
cases+0.006
Neg logit contributions
P-0.008
Bo-0.007
orn-0.007
un-0.007
ble-0.006
Token spinning
Token ID19493
Feature activation+1.78e-03

Pos logit contributions
 knocks+0.010
cakes+0.009
cases+0.008
chair+0.008
five+0.008
Neg logit contributions
P-0.009
Bo-0.008
orn-0.008
un-0.007
ble-0.007
Token wheel
Token ID7825
Feature activation+3.27e-03

Pos logit contributions
 knocks+0.003
cakes+0.003
chair+0.003
five+0.002
 sigh+0.002
Neg logit contributions
Bo-0.003
P-0.003
orn-0.003
G-0.003
un-0.003
Tokenbar
Token ID5657
Feature activation-5.73e-02

Pos logit contributions
 knocks+0.005
cakes+0.005
chair+0.004
cases+0.004
five+0.004
Neg logit contributions
P-0.007
Bo-0.006
orn-0.006
G-0.006
un-0.006
Tokenrow
Token ID808
Feature activation-1.37e-02

Pos logit contributions
Bo+0.086
P+0.082
G+0.077
Mart+0.073
Wal+0.068
Neg logit contributions
cakes-0.108
 knocks-0.107
cases-0.106
five-0.102
chair-0.101
Token and
Token ID290
Feature activation-1.76e-03

Pos logit contributions
P+0.029
Bo+0.028
orn+0.027
un+0.025
G+0.025
Neg logit contributions
 knocks-0.020
cakes-0.019
cases-0.018
chair-0.017
 manager-0.016
Token put
Token ID1234
Feature activation-9.79e-03

Pos logit contributions
P+0.004
orn+0.003
Bo+0.003
un+0.003
ble+0.003
Neg logit contributions
 knocks-0.003
cakes-0.002
 knocking-0.002
cases-0.002
chair-0.002
Token the
Token ID262
Feature activation-7.41e-03

Pos logit contributions
P+0.022
Bo+0.021
orn+0.020
G+0.019
un+0.019
Neg logit contributions
 knocks-0.013
cakes-0.012
five-0.011
 James-0.011
 knocking-0.011
Token pumpkin
Token ID30089
Feature activation-1.01e-02

Pos logit contributions
P+0.013
Bo+0.013
orn+0.012
un+0.012
G+0.011
Neg logit contributions
 knocks-0.013
cakes-0.012
five-0.011
 knocking-0.011
cases-0.011
Token t
Token ID256
Feature activation-6.52e-03

Pos logit contributions
 knocks+0.010
 sigh+0.008
chair+0.008
cakes+0.008
 leaving+0.008
Neg logit contributions
P-0.013
Bo-0.012
orn-0.012
G-0.011
un-0.011
Tokeneddy
Token ID21874
Feature activation-1.07e-02

Pos logit contributions
P+0.014
Bo+0.013
orn+0.012
G+0.012
Ant+0.011
Neg logit contributions
 knocks-0.009
cakes-0.009
chair-0.009
cases-0.008
five-0.008
Token bear
Token ID6842
Feature activation-8.34e-03

Pos logit contributions
P+0.020
Bo+0.019
G+0.018
orn+0.017
un+0.017
Neg logit contributions
 knocks-0.019
chair-0.016
cakes-0.016
 James-0.016
 sigh-0.015
Token.
Token ID13
Feature activation-2.52e-03

Pos logit contributions
P+0.017
Bo+0.017
orn+0.016
un+0.015
G+0.015
Neg logit contributions
 knocks-0.013
cakes-0.011
cases-0.010
chair-0.010
five-0.010
Token The
Token ID383
Feature activation+5.29e-03

Pos logit contributions
P+0.005
Bo+0.005
orn+0.004
un+0.004
G+0.004
Neg logit contributions
 knocks-0.004
cakes-0.004
cases-0.004
chair-0.004
 sigh-0.003
Token sun
Token ID4252
Feature activation+3.47e-02

Pos logit contributions
 knocks+0.008
chair+0.007
 sigh+0.006
 manager+0.006
 leaving+0.006
Neg logit contributions
P-0.011
Bo-0.011
orn-0.010
G-0.010
un-0.010
Token is
Token ID318
Feature activation-1.08e-02

Pos logit contributions
 knocks+0.057
cakes+0.052
cases+0.049
chair+0.048
five+0.046
Neg logit contributions
P-0.067
Bo-0.064
orn-0.063
un-0.058
ble-0.057
Token mild
Token ID11607
Feature activation+6.85e-03

Pos logit contributions
P+0.023
orn+0.023
Bo+0.022
un+0.020
Mart+0.020
Neg logit contributions
 knocks-0.015
 leaving-0.015
 knocking-0.013
cakes-0.013
 sigh-0.012
Token.
Token ID13
Feature activation+3.17e-03

Pos logit contributions
 knocks+0.016
 divorce+0.015
cakes+0.014
lived+0.014
cases+0.014
Neg logit contributions
P-0.007
orn-0.007
Bo-0.006
ble-0.006
ery-0.006
Token It
Token ID632
Feature activation+1.15e-02

Pos logit contributions
 knocks+0.005
cakes+0.004
cases+0.004
chair+0.004
five+0.004
Neg logit contributions
P-0.007
Bo-0.006
orn-0.006
un-0.006
ble-0.006
Token is
Token ID318
Feature activation-3.60e-03

Pos logit contributions
 knocks+0.017
cakes+0.016
cases+0.015
chair+0.015
five+0.014
Neg logit contributions
P-0.023
Bo-0.022
orn-0.022
un-0.020
ble-0.020
Token's
Token ID338
Feature activation+1.73e-02

Pos logit contributions
cakes+0.021
 knocks+0.021
ksh+0.019
five+0.019
cases+0.019
Neg logit contributions
P-0.023
orn-0.022
un-0.022
ble-0.021
Bo-0.021
Token hair
Token ID4190
Feature activation-4.98e-03

Pos logit contributions
cakes+0.029
 knocks+0.029
ksh+0.027
cases+0.027
chair+0.026
Neg logit contributions
P-0.028
orn-0.027
ble-0.027
un-0.026
Bo-0.026
Token and
Token ID290
Feature activation+9.62e-04

Pos logit contributions
P+0.009
Bo+0.009
orn+0.009
un+0.008
G+0.008
Neg logit contributions
 knocks-0.008
cakes-0.008
cases-0.007
chair-0.007
five-0.007
Token she
Token ID673
Feature activation+1.28e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.001
Token was
Token ID373
Feature activation+4.84e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.002
chair+0.002
five+0.002
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token speech
Token ID4046
Feature activation+3.66e-02

Pos logit contributions
 knocks+0.008
cakes+0.007
cases+0.007
chair+0.007
five+0.006
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
un-0.008
ble-0.008
Tokenless
Token ID1203
Feature activation+7.20e-04

Pos logit contributions
 knocks+0.054
cakes+0.043
five+0.040
 inconvenience+0.039
 Amy+0.039
Neg logit contributions
un-0.078
P-0.076
Bo-0.072
ble-0.070
orn-0.069
Token.
Token ID13
Feature activation-4.00e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.001
Bo-0.001
orn-0.001
un-0.001
ble-0.001
Token She
Token ID1375
Feature activation-5.02e-03

Pos logit contributions
P+0.007
Bo+0.007
orn+0.006
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.007
cakes-0.007
chair-0.006
cases-0.006
five-0.006
Token did
Token ID750
Feature activation-1.68e-03

Pos logit contributions
P+0.010
orn+0.010
Bo+0.010
ble+0.009
un+0.009
Neg logit contributions
 knocks-0.008
cakes-0.007
cases-0.006
chair-0.006
 sigh-0.006
Token not
Token ID407
Feature activation+6.78e-03

Pos logit contributions
P+0.002
ble+0.002
Bo+0.002
un+0.002
G+0.002
Neg logit contributions
ksh-0.003
 knocks-0.003
cakes-0.003
chair-0.003
cases-0.003
Token Now
Token ID2735
Feature activation-1.22e-04

Pos logit contributions
 knocks+0.001
cakes+0.001
chair+0.001
 James+0.001
 Amy+0.001
Neg logit contributions
P-0.001
orn-0.001
Bo-0.001
ble-0.001
un-0.001
Token,
Token ID11
Feature activation+5.15e-03

Pos logit contributions
P+0.000
Bo+0.000
orn+0.000
un+0.000
ble+0.000
Neg logit contributions
 knocks-0.000
cakes-0.000
cases-0.000
chair-0.000
five-0.000
Token let
Token ID1309
Feature activation+3.87e-03

Pos logit contributions
 knocks+0.007
cakes+0.006
cases+0.006
chair+0.006
five+0.005
Neg logit contributions
P-0.011
Bo-0.011
orn-0.011
un-0.010
ble-0.010
Tokenรข
Token ID22940
Feature activation+1.15e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
un-0.009
ble-0.008
Tokenโ‚ฌ
Token ID26391
Feature activation+2.62e-03

Pos logit contributions
 knocks+0.018
cakes+0.016
cases+0.016
chair+0.015
 leaving+0.014
Neg logit contributions
P-0.024
orn-0.022
Bo-0.022
un-0.020
ble-0.020
Tokenโ„ข
Token ID8151
Feature activation+3.95e-02

Pos logit contributions
 knocks+0.005
cakes+0.005
 knocking+0.004
 manager+0.004
lived+0.004
Neg logit contributions
P-0.004
Bo-0.004
orn-0.004
un-0.004
G-0.004
Tokens
Token ID82
Feature activation-4.74e-03

Pos logit contributions
 knocks+0.058
cakes+0.054
cases+0.050
chair+0.049
five+0.047
Neg logit contributions
P-0.083
Bo-0.080
orn-0.078
ble-0.071
un-0.071
Token start
Token ID923
Feature activation+6.30e-03

Pos logit contributions
P+0.011
Bo+0.010
orn+0.010
G+0.009
un+0.009
Neg logit contributions
 knocks-0.007
cakes-0.006
chair-0.005
 sigh-0.005
 Amy-0.005
Token playing
Token ID2712
Feature activation+3.03e-03

Pos logit contributions
 knocks+0.009
cakes+0.008
chair+0.008
cases+0.008
five+0.008
Neg logit contributions
P-0.013
Bo-0.013
orn-0.012
un-0.011
G-0.011
Token!"
Token ID2474
Feature activation-8.26e-03

Pos logit contributions
 knocks+0.004
cakes+0.004
chair+0.003
five+0.003
 Amy+0.003
Neg logit contributions
Bo-0.007
P-0.007
orn-0.007
un-0.006
G-0.006
Token
Token ID220
Feature activation-1.16e-02

Pos logit contributions
P+0.017
orn+0.017
Bo+0.017
ble+0.015
ery+0.015
Neg logit contributions
 knocks-0.012
cakes-0.011
chair-0.010
 James-0.010
cases-0.010
Token play
Token ID711
Feature activation-1.21e-02

Pos logit contributions
 knocks+0.018
cakes+0.016
 sigh+0.014
chair+0.014
five+0.014
Neg logit contributions
P-0.035
orn-0.034
Bo-0.034
un-0.031
ble-0.031
Token together
Token ID1978
Feature activation+8.87e-03

Pos logit contributions
Bo+0.026
orn+0.026
P+0.025
un+0.025
G+0.024
Neg logit contributions
 knocks-0.015
cakes-0.013
chair-0.012
 Amy-0.012
 James-0.012
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 knocks+0.011
cakes+0.009
chair+0.009
 anymore+0.009
 James+0.008
Neg logit contributions
Bo-0.020
P-0.020
orn-0.020
un-0.019
G-0.019
Token Today
Token ID6288
Feature activation+1.24e-02

Pos logit contributions
 knocks+0.022
cakes+0.021
cases+0.020
lived+0.020
chair+0.020
Neg logit contributions
P-0.018
Bo-0.017
orn-0.016
G-0.015
un-0.015
Token is
Token ID318
Feature activation+1.13e-02

Pos logit contributions
ksh+0.025
 knocks+0.025
lived+0.025
cakes+0.024
five+0.024
Neg logit contributions
P-0.016
orn-0.014
Bo-0.014
ble-0.013
un-0.013
Token their
Token ID511
Feature activation+3.83e-02

Pos logit contributions
 knocks+0.021
cakes+0.020
cases+0.019
five+0.019
chair+0.019
Neg logit contributions
P-0.019
Bo-0.017
orn-0.017
un-0.016
ble-0.016
Token birthday
Token ID10955
Feature activation+3.12e-02

Pos logit contributions
 knocks+0.057
cakes+0.053
cases+0.049
chair+0.048
five+0.047
Neg logit contributions
P-0.078
Bo-0.074
orn-0.073
un-0.068
ble-0.068
Token.
Token ID13
Feature activation+6.00e-03

Pos logit contributions
 knocks+0.056
cakes+0.049
cases+0.048
five+0.047
lived+0.047
Neg logit contributions
P-0.054
orn-0.048
Bo-0.048
un-0.045
ble-0.043
Token They
Token ID1119
Feature activation+2.55e-03

Pos logit contributions
 knocks+0.010
cakes+0.009
cases+0.009
chair+0.009
lived+0.009
Neg logit contributions
P-0.011
Bo-0.011
orn-0.010
G-0.010
un-0.009
Token are
Token ID389
Feature activation+1.03e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.005
ble-0.004
Token very
Token ID845
Feature activation+2.29e-02

Pos logit contributions
 knocks+0.020
cakes+0.020
chair+0.019
five+0.019
cases+0.018
Neg logit contributions
P-0.015
Bo-0.013
un-0.012
ble-0.012
orn-0.012
Token it
Token ID340
Feature activation+1.13e-02

Pos logit contributions
 knocks+0.021
cakes+0.020
cases+0.018
chair+0.018
five+0.018
Neg logit contributions
P-0.025
Bo-0.024
orn-0.024
un-0.022
ble-0.021
Token was
Token ID373
Feature activation-3.04e-04

Pos logit contributions
 knocks+0.018
cakes+0.017
cases+0.015
chair+0.015
 sigh+0.015
Neg logit contributions
P-0.023
Bo-0.022
orn-0.021
un-0.020
G-0.019
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
un+0.001
G+0.001
Neg logit contributions
 knocks-0.000
cakes-0.000
 leaving-0.000
 James-0.000
 knocking-0.000
Token most
Token ID749
Feature activation+1.87e-02

Pos logit contributions
 knocks+0.041
cakes+0.038
cases+0.036
chair+0.036
five+0.035
Neg logit contributions
P-0.040
Bo-0.038
orn-0.037
un-0.034
ble-0.033
Token delicious
Token ID12625
Feature activation+6.68e-03

Pos logit contributions
 knocks+0.039
cases+0.034
cakes+0.033
 begs+0.032
chair+0.032
Neg logit contributions
P-0.029
orn-0.026
Bo-0.026
ble-0.024
G-0.023
Token thing
Token ID1517
Feature activation+3.62e-02

Pos logit contributions
 knocks+0.012
cases+0.012
ounded+0.010
cakes+0.010
ksh+0.010
Neg logit contributions
P-0.011
orn-0.011
Bo-0.010
 feather-0.010
ble-0.010
Token she
Token ID673
Feature activation+1.77e-02

Pos logit contributions
 knocks+0.064
cakes+0.055
cases+0.054
five+0.053
ksh+0.053
Neg logit contributions
P-0.063
orn-0.060
Bo-0.059
un-0.055
ble-0.054
Token had
Token ID550
Feature activation-1.83e-02

Pos logit contributions
 knocks+0.032
cakes+0.030
cases+0.029
chair+0.028
five+0.027
Neg logit contributions
P-0.031
Bo-0.029
orn-0.029
un-0.026
G-0.025
Token ever
Token ID1683
Feature activation-3.39e-04

Pos logit contributions
P+0.036
Bo+0.034
orn+0.034
un+0.031
ble+0.030
Neg logit contributions
 knocks-0.030
cakes-0.027
cases-0.025
chair-0.025
five-0.025
Token tasted
Token ID29187
Feature activation+2.49e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
G+0.001
un+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
chair-0.001
cases-0.001
 James-0.000
Token!
Token ID0
Feature activation+6.01e-03

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
un-0.004
ble-0.004
Token wished
Token ID16555
Feature activation+8.18e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
ble+0.001
un+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
chair-0.001
 sigh-0.001
cases-0.001
Token he
Token ID339
Feature activation+3.77e-03

Pos logit contributions
 knocks+0.014
cakes+0.012
cases+0.012
chair+0.012
five+0.012
Neg logit contributions
P-0.015
Bo-0.014
orn-0.014
ble-0.013
un-0.013
Token didn
Token ID1422
Feature activation+9.74e-03

Pos logit contributions
 knocks+0.008
cakes+0.007
five+0.007
cases+0.007
 begs+0.007
Neg logit contributions
P-0.006
orn-0.005
Bo-0.005
un-0.004
ble-0.004
Token't
Token ID470
Feature activation-9.98e-04

Pos logit contributions
 knocks+0.017
cakes+0.016
cases+0.015
chair+0.015
five+0.014
Neg logit contributions
P-0.017
Bo-0.016
orn-0.016
un-0.015
ble-0.015
Token do
Token ID466
Feature activation+1.11e-02

Pos logit contributions
P+0.002
Bo+0.002
orn+0.002
un+0.002
G+0.001
Neg logit contributions
cakes-0.002
cases-0.002
 knocks-0.002
ksh-0.002
chair-0.002
Token that
Token ID326
Feature activation+2.23e-02

Pos logit contributions
cakes+0.022
 knocks+0.022
ksh+0.022
cases+0.022
chair+0.021
Neg logit contributions
P-0.016
ble-0.015
Bo-0.014
orn-0.014
un-0.012
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
cases+0.041
 knocks+0.040
cakes+0.039
five+0.037
ounded+0.037
Neg logit contributions
P-0.034
orn-0.033
Bo-0.032
ble-0.030
un-0.029
Token He
Token ID679
Feature activation-5.54e-03

Pos logit contributions
P+0.020
Bo+0.019
orn+0.019
un+0.017
ble+0.017
Neg logit contributions
 knocks-0.019
cakes-0.017
chair-0.017
cases-0.016
 James-0.016
Token said
Token ID531
Feature activation-8.86e-03

Pos logit contributions
P+0.010
orn+0.010
Bo+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.010
cakes-0.008
chair-0.008
 sigh-0.008
cases-0.008
Token sorry
Token ID7926
Feature activation+2.81e-03

Pos logit contributions
P+0.021
Bo+0.020
orn+0.020
un+0.018
G+0.018
Neg logit contributions
 knocks-0.011
cakes-0.010
cases-0.009
chair-0.009
five-0.008
Token to
Token ID284
Feature activation+5.03e-03

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.003
chair+0.003
lived+0.003
Neg logit contributions
P-0.006
Bo-0.006
orn-0.005
un-0.005
ble-0.005
Token you
Token ID345
Feature activation+4.37e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
 knocks+0.008
cakes+0.007
cases+0.007
chair+0.007
five+0.006
Neg logit contributions
P-0.008
Bo-0.007
orn-0.007
un-0.007
ble-0.007
Token Mom
Token ID11254
Feature activation-7.22e-03

Pos logit contributions
 knocks+0.024
cakes+0.022
cases+0.021
chair+0.021
five+0.020
Neg logit contributions
P-0.035
Bo-0.033
orn-0.032
un-0.030
ble-0.030
Token!"
Token ID2474
Feature activation-1.10e-02

Pos logit contributions
P+0.012
orn+0.012
Bo+0.011
un+0.011
ble+0.010
Neg logit contributions
 knocks-0.013
cakes-0.012
cases-0.012
chair-0.011
five-0.011
Token Lily
Token ID20037
Feature activation+3.11e-04

Pos logit contributions
P+0.026
Bo+0.024
orn+0.023
un+0.023
G+0.022
Neg logit contributions
 knocks-0.014
cakes-0.012
cases-0.011
five-0.011
chair-0.011
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
cakes+0.001
 knocks+0.001
lived+0.001
 divorce+0.001
cases+0.000
Neg logit contributions
P-0.000
Bo-0.000
orn-0.000
un-0.000
ble-0.000
Token Ben
Token ID3932
Feature activation-1.06e-02

Pos logit contributions
cakes+0.026
 knocks+0.025
cases+0.025
lived+0.024
chair+0.024
Neg logit contributions
P-0.020
Bo-0.019
orn-0.017
un-0.017
ble-0.016
Token say
Token ID910
Feature activation-5.55e-03

Pos logit contributions
P+0.021
Bo+0.020
orn+0.020
un+0.018
ble+0.018
Neg logit contributions
 knocks-0.016
cakes-0.015
chair-0.014
cases-0.014
 knocking-0.013
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
P+0.008
Bo+0.008
un+0.008
ble+0.007
orn+0.007
Neg logit contributions
 knocks-0.011
cakes-0.010
chair-0.010
cases-0.010
 delays-0.010
Token
Token ID198
Feature activation-2.11e-02

Pos logit contributions
P+0.021
Bo+0.020
orn+0.019
un+0.018
G+0.018
Neg logit contributions
 knocks-0.019
cakes-0.018
cases-0.017
chair-0.017
five-0.016
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
P+0.042
orn+0.042
Bo+0.040
ble+0.036
un+0.036
Neg logit contributions
 knocks-0.033
cakes-0.028
chair-0.027
 Amy-0.026
five-0.026
Token doing
Token ID1804
Feature activation-5.53e-03

Pos logit contributions
P+0.016
Bo+0.015
orn+0.015
ble+0.014
un+0.014
Neg logit contributions
 knocks-0.016
cakes-0.015
chair-0.014
cases-0.014
five-0.014
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
P+0.009
orn+0.008
Bo+0.008
ble+0.008
un+0.008
Neg logit contributions
 knocks-0.010
cakes-0.010
cases-0.009
five-0.009
chair-0.009
Token She
Token ID1375
Feature activation-5.09e-03

Pos logit contributions
 knocks+0.034
cases+0.033
five+0.033
lived+0.033
cakes+0.033
Neg logit contributions
P-0.019
Bo-0.016
un-0.015
orn-0.015
G-0.013
Token told
Token ID1297
Feature activation+9.06e-03

Pos logit contributions
P+0.010
Bo+0.010
orn+0.010
ble+0.009
un+0.009
Neg logit contributions
 knocks-0.008
cakes-0.007
chair-0.007
cases-0.007
 knocking-0.007
Token Lily
Token ID20037
Feature activation+1.49e-02

Pos logit contributions
 knocks+0.019
cases+0.018
cakes+0.018
lived+0.018
chair+0.017
Neg logit contributions
P-0.013
Bo-0.011
orn-0.011
un-0.010
G-0.009
Token that
Token ID326
Feature activation+2.95e-03

Pos logit contributions
 knocks+0.022
ksh+0.021
five+0.020
cakes+0.019
 timer+0.018
Neg logit contributions
P-0.029
orn-0.028
un-0.027
Bo-0.027
ble-0.026
Token it
Token ID340
Feature activation+1.72e-02

Pos logit contributions
 knocks+0.005
cakes+0.004
cases+0.004
five+0.004
chair+0.004
Neg logit contributions
P-0.006
Bo-0.005
orn-0.005
un-0.005
ble-0.005
Token wasn
Token ID2492
Feature activation+6.07e-03

Pos logit contributions
 knocks+0.031
cakes+0.029
five+0.028
cases+0.027
chair+0.027
Neg logit contributions
P-0.029
orn-0.027
Bo-0.027
un-0.026
ble-0.025
Token't
Token ID470
Feature activation-1.80e-02

Pos logit contributions
 knocks+0.009
cakes+0.009
chair+0.008
cases+0.008
 knocking+0.008
Neg logit contributions
P-0.012
Bo-0.012
orn-0.012
un-0.010
ble-0.010
Token safe
Token ID3338
Feature activation+5.49e-03

Pos logit contributions
P+0.037
Bo+0.036
orn+0.036
G+0.032
ble+0.031
Neg logit contributions
 knocks-0.027
cakes-0.023
 knocking-0.023
 leaving-0.022
chair-0.022
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
 knocks+0.008
cakes+0.008
five+0.007
chair+0.007
cases+0.007
Neg logit contributions
P-0.011
Bo-0.010
orn-0.010
un-0.010
ble-0.010
Token aunt
Token ID25949
Feature activation+3.57e-02

Pos logit contributions
 knocks+0.007
cakes+0.006
cases+0.006
chair+0.006
five+0.006
Neg logit contributions
P-0.007
Bo-0.007
orn-0.007
un-0.006
ble-0.006
Tokenie
Token ID494
Feature activation+1.09e-02

Pos logit contributions
 knocks+0.052
cakes+0.046
five+0.044
idents+0.043
chair+0.042
Neg logit contributions
P-0.070
orn-0.067
Bo-0.064
un-0.063
ble-0.058
Token and
Token ID290
Feature activation-4.02e-03

Pos logit contributions
 knocks+0.016
cakes+0.015
five+0.013
cases+0.013
chair+0.013
Neg logit contributions
P-0.022
orn-0.021
Bo-0.021
un-0.019
G-0.018
Token uncle
Token ID7711
Feature activation+2.10e-02

Pos logit contributions
P+0.009
orn+0.008
Bo+0.008
ble+0.008
un+0.008
Neg logit contributions
 knocks-0.006
cakes-0.005
cases-0.005
chair-0.005
 James-0.005
Token's
Token ID338
Feature activation+9.61e-05

Pos logit contributions
 knocks+0.034
cakes+0.030
chair+0.029
cases+0.028
five+0.027
Neg logit contributions
P-0.041
Bo-0.039
orn-0.038
un-0.036
ble-0.034
Token house
Token ID2156
Feature activation+1.97e-02

Pos logit contributions
 knocks+0.000
cakes+0.000
cases+0.000
chair+0.000
five+0.000
Neg logit contributions
P-0.000
Bo-0.000
orn-0.000
un-0.000
ble-0.000
Token,
Token ID11
Feature activation+1.71e-04

Pos logit contributions
 knocks+0.034
cakes+0.032
cases+0.030
chair+0.030
five+0.029
Neg logit contributions
P-0.036
Bo-0.034
orn-0.033
un-0.031
ble-0.030
Token Tommy
Token ID19919
Feature activation+9.91e-04

Pos logit contributions
 knocks+0.000
cakes+0.000
cases+0.000
chair+0.000
 James+0.000
Neg logit contributions
P-0.000
orn-0.000
Bo-0.000
un-0.000
ble-0.000
Token saw
Token ID2497
Feature activation+1.23e-02

Pos logit contributions
 knocks+0.002
cakes+0.001
cases+0.001
 knocking+0.001
 James+0.001
Neg logit contributions
P-0.002
orn-0.002
Bo-0.002
ble-0.002
un-0.002
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
 knocks+0.019
cakes+0.018
 James+0.017
cases+0.016
chair+0.016
Neg logit contributions
P-0.025
Bo-0.024
orn-0.023
ble-0.021
un-0.021
Token baby
Token ID5156
Feature activation+6.31e-03

Pos logit contributions
 knocks+0.021
cakes+0.019
cases+0.018
chair+0.018
five+0.017
Neg logit contributions
P-0.024
Bo-0.023
orn-0.023
un-0.021
ble-0.021
Token very
Token ID845
Feature activation+1.44e-03

Pos logit contributions
 knocks+0.015
cakes+0.014
cases+0.013
chair+0.013
five+0.013
Neg logit contributions
P-0.012
Bo-0.011
orn-0.011
un-0.010
ble-0.010
Token much
Token ID881
Feature activation+3.48e-03

Pos logit contributions
 knocks+0.003
cakes+0.003
cases+0.002
chair+0.002
five+0.002
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token.
Token ID13
Feature activation+5.19e-03

Pos logit contributions
 knocks+0.007
cakes+0.006
cases+0.006
 knocking+0.006
 divorce+0.006
Neg logit contributions
P-0.006
orn-0.005
Bo-0.005
un-0.005
G-0.005
Token One
Token ID1881
Feature activation-1.39e-02

Pos logit contributions
 knocks+0.009
cakes+0.008
chair+0.007
 James+0.007
cases+0.007
Neg logit contributions
P-0.010
Bo-0.010
orn-0.009
ble-0.009
un-0.009
Token day
Token ID1110
Feature activation+8.61e-03

Pos logit contributions
P+0.023
orn+0.022
Bo+0.022
un+0.019
ble+0.019
Neg logit contributions
 knocks-0.027
cakes-0.025
cases-0.024
chair-0.023
five-0.023
Token,
Token ID11
Feature activation+3.66e-03

Pos logit contributions
 knocks+0.017
cakes+0.017
cases+0.016
chair+0.015
ksh+0.015
Neg logit contributions
P-0.013
Bo-0.012
orn-0.012
un-0.011
ble-0.010
Token Lily
Token ID20037
Feature activation-5.13e-03

Pos logit contributions
 knocks+0.006
cakes+0.005
cases+0.005
five+0.005
chair+0.005
Neg logit contributions
P-0.007
Bo-0.007
orn-0.007
un-0.006
ble-0.006
Token's
Token ID338
Feature activation+7.57e-03

Pos logit contributions
P+0.011
Bo+0.011
orn+0.011
un+0.010
ble+0.010
Neg logit contributions
 knocks-0.007
cakes-0.007
cases-0.006
chair-0.006
five-0.006
Token friend
Token ID1545
Feature activation+2.84e-03

Pos logit contributions
 knocks+0.012
cakes+0.011
chair+0.010
cases+0.010
five+0.010
Neg logit contributions
P-0.015
Bo-0.014
orn-0.014
un-0.013
ble-0.013
Token,
Token ID11
Feature activation-2.28e-03

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
chair+0.004
five+0.004
Neg logit contributions
P-0.006
Bo-0.005
orn-0.005
un-0.005
ble-0.005
Token Tim
Token ID5045
Feature activation+4.01e-03

Pos logit contributions
P+0.005
Bo+0.005
orn+0.005
un+0.005
ble+0.005
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.002
chair-0.002
five-0.002
Token was
Token ID373
Feature activation+7.42e-03

Pos logit contributions
 knocks+0.044
cakes+0.038
chair+0.037
cases+0.036
 sigh+0.036
Neg logit contributions
P-0.058
Bo-0.056
orn-0.054
un-0.050
ble-0.049
Token being
Token ID852
Feature activation+2.29e-02

Pos logit contributions
 knocks+0.014
cakes+0.014
cases+0.013
idents+0.013
chair+0.012
Neg logit contributions
P-0.012
Bo-0.011
ble-0.011
orn-0.011
un-0.010
Token so
Token ID523
Feature activation+1.54e-03

Pos logit contributions
 knocks+0.041
cakes+0.037
cases+0.036
chair+0.035
five+0.034
Neg logit contributions
P-0.043
Bo-0.039
orn-0.038
un-0.036
ble-0.035
Token stubborn
Token ID24444
Feature activation+6.62e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
cases+0.002
chair+0.002
 sigh+0.002
Neg logit contributions
P-0.003
orn-0.003
Bo-0.003
un-0.003
ble-0.003
Token.
Token ID13
Feature activation+4.36e-03

Pos logit contributions
 knocks+0.012
cakes+0.011
cases+0.010
five+0.010
chair+0.010
Neg logit contributions
P-0.012
orn-0.011
Bo-0.011
un-0.010
ble-0.010
Token She
Token ID1375
Feature activation-1.94e-03

Pos logit contributions
 knocks+0.008
cakes+0.007
chair+0.007
cases+0.007
five+0.006
Neg logit contributions
P-0.008
Bo-0.007
orn-0.007
un-0.007
ble-0.007
Token wouldn
Token ID3636
Feature activation+6.28e-03

Pos logit contributions
P+0.004
orn+0.004
Bo+0.004
ble+0.003
un+0.003
Neg logit contributions
 knocks-0.003
cakes-0.003
chair-0.003
cases-0.003
 sigh-0.003
Token't
Token ID470
Feature activation-8.40e-03

Pos logit contributions
 knocks+0.008
cakes+0.007
chair+0.007
cases+0.007
 knocking+0.007
Neg logit contributions
P-0.015
orn-0.014
Bo-0.014
G-0.012
ery-0.012
Token listen
Token ID6004
Feature activation+1.34e-02

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
un+0.016
ble+0.015
Neg logit contributions
 knocks-0.012
cakes-0.011
cases-0.010
chair-0.010
five-0.010
Token.
Token ID13
Feature activation+4.30e-03

Pos logit contributions
 knocks+0.018
cakes+0.016
chair+0.014
cases+0.014
five+0.014
Neg logit contributions
P-0.029
Bo-0.028
orn-0.028
un-0.026
ble-0.026
Token She
Token ID1375
Feature activation+2.75e-03

Pos logit contributions
 knocks+0.007
cakes+0.007
cases+0.006
chair+0.006
five+0.006
Neg logit contributions
P-0.008
Bo-0.008
orn-0.008
un-0.007
ble-0.007
Token and
Token ID290
Feature activation-3.47e-03

Pos logit contributions
 knocks+0.026
cakes+0.024
cases+0.022
five+0.022
chair+0.022
Neg logit contributions
P-0.025
Bo-0.025
orn-0.025
ble-0.023
un-0.022
Token the
Token ID262
Feature activation+1.17e-03

Pos logit contributions
P+0.007
Bo+0.006
orn+0.006
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.006
cakes-0.005
chair-0.005
cases-0.005
five-0.005
Token singing
Token ID13777
Feature activation+2.43e-03

Pos logit contributions
 knocks+0.002
cakes+0.002
chair+0.002
 sigh+0.002
cases+0.002
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token birds
Token ID10087
Feature activation-1.63e-03

Pos logit contributions
 knocks+0.004
cakes+0.003
chair+0.003
cases+0.003
five+0.003
Neg logit contributions
P-0.005
Bo-0.005
orn-0.005
G-0.004
un-0.004
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
P+0.003
Bo+0.003
orn+0.003
un+0.003
ble+0.003
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.002
chair-0.002
five-0.002
Token He
Token ID679
Feature activation-2.08e-02

Pos logit contributions
P+0.010
Bo+0.009
orn+0.009
un+0.008
G+0.008
Neg logit contributions
 knocks-0.013
cakes-0.012
cases-0.011
chair-0.011
 divorce-0.011
Token began
Token ID2540
Feature activation-1.23e-02

Pos logit contributions
P+0.045
orn+0.044
un+0.041
Bo+0.040
ble+0.040
Neg logit contributions
 knocks-0.029
 sigh-0.024
chair-0.024
cases-0.023
cakes-0.022
Token to
Token ID284
Feature activation-1.52e-03

Pos logit contributions
P+0.030
Bo+0.029
orn+0.028
un+0.027
ble+0.026
Neg logit contributions
 knocks-0.014
cakes-0.013
cases-0.011
chair-0.011
five-0.010
Token feel
Token ID1254
Feature activation+1.08e-03

Pos logit contributions
P+0.003
orn+0.003
Bo+0.003
un+0.003
G+0.003
Neg logit contributions
 knocks-0.002
 sigh-0.002
 knocking-0.002
cakes-0.002
chair-0.002
Token at
Token ID379
Feature activation+1.91e-02

Pos logit contributions
 knocks+0.001
 sigh+0.001
 leaving+0.001
 knocking+0.001
chair+0.001
Neg logit contributions
P-0.003
orn-0.002
Bo-0.002
un-0.002
G-0.002
Token peace
Token ID4167
Feature activation+9.67e-03

Pos logit contributions
 knocks+0.036
cakes+0.034
cases+0.032
chair+0.032
five+0.031
Neg logit contributions
P-0.031
Bo-0.030
orn-0.030
un-0.027
ble-0.027
Token
Token ID198
Feature activation-5.27e-03

Pos logit contributions
orn+0.022
P+0.020
ble+0.020
Bo+0.020
ery+0.019
Neg logit contributions
 Amy-0.013
 James-0.012
 knocks-0.012
cakes-0.012
five-0.011
TokenMax
Token ID11518
Feature activation+1.02e-02

Pos logit contributions
orn+0.010
ble+0.009
Bo+0.009
P+0.009
ery+0.008
Neg logit contributions
 knocks-0.008
cakes-0.008
 James-0.008
 Amy-0.008
five-0.007
Token felt
Token ID2936
Feature activation-3.98e-03

Pos logit contributions
 knocks+0.016
cakes+0.014
cases+0.013
chair+0.013
five+0.013
Neg logit contributions
P-0.021
Bo-0.020
orn-0.019
un-0.018
ble-0.018
Token better
Token ID1365
Feature activation-9.48e-03

Pos logit contributions
P+0.009
orn+0.008
Bo+0.008
un+0.008
ble+0.007
Neg logit contributions
 knocks-0.006
cakes-0.005
 sigh-0.005
chair-0.005
five-0.005
Token and
Token ID290
Feature activation+6.15e-03

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
un+0.016
ble+0.016
Neg logit contributions
 knocks-0.016
cakes-0.015
cases-0.014
chair-0.014
five-0.013
Token they
Token ID484
Feature activation-5.60e-03

Pos logit contributions
 knocks+0.011
cakes+0.011
isions+0.010
cases+0.010
chair+0.010
Neg logit contributions
P-0.010
Bo-0.009
orn-0.009
ble-0.008
G-0.008
Token sat
Token ID3332
Feature activation+1.23e-02

Pos logit contributions
P+0.012
orn+0.012
Bo+0.012
ble+0.011
un+0.011
Neg logit contributions
 knocks-0.007
 sigh-0.007
cakes-0.006
cases-0.006
chair-0.006
Token on
Token ID319
Feature activation+2.50e-03

Pos logit contributions
 knocks+0.017
cakes+0.015
cases+0.014
chair+0.014
five+0.013
Neg logit contributions
P-0.027
Bo-0.026
orn-0.025
un-0.024
G-0.023
Token the
Token ID262
Feature activation+2.96e-03

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
chair+0.004
five+0.004
Neg logit contributions
P-0.004
Bo-0.004
orn-0.004
un-0.004
ble-0.004
Token bench
Token ID7624
Feature activation+1.52e-02

Pos logit contributions
 knocks+0.004
cakes+0.004
cases+0.004
 James+0.003
chair+0.003
Neg logit contributions
P-0.006
Bo-0.006
orn-0.006
ble-0.006
un-0.005
Token,
Token ID11
Feature activation+8.29e-03

Pos logit contributions
 knocks+0.026
cakes+0.023
cases+0.021
chair+0.021
five+0.021
Neg logit contributions
P-0.027
Bo-0.026
orn-0.026
un-0.024
ble-0.024
Token Polly
Token ID36898
Feature activation-1.25e-02

Pos logit contributions
 knocks+0.015
cakes+0.014
chair+0.013
cases+0.013
five+0.012
Neg logit contributions
P-0.020
Bo-0.020
orn-0.019
un-0.018
ble-0.017
Token said
Token ID531
Feature activation-1.88e-02

Pos logit contributions
P+0.028
orn+0.027
Bo+0.026
un+0.025
ble+0.024
Neg logit contributions
 knocks-0.017
cakes-0.015
 sigh-0.014
cases-0.014
 leaving-0.014
Token,
Token ID11
Feature activation-1.83e-03

Pos logit contributions
P+0.037
orn+0.035
Bo+0.035
un+0.032
G+0.031
Neg logit contributions
 knocks-0.030
cakes-0.028
cases-0.026
chair-0.025
five-0.025
Token "
Token ID366
Feature activation-3.06e-03

Pos logit contributions
P+0.004
orn+0.004
Bo+0.004
un+0.004
ble+0.004
Neg logit contributions
 knocks-0.002
cakes-0.002
cases-0.002
chair-0.002
five-0.002
TokenLet
Token ID5756
Feature activation+3.44e-03

Pos logit contributions
orn+0.006
ble+0.006
P+0.006
Bo+0.006
un+0.006
Neg logit contributions
cakes-0.004
 knocks-0.004
ksh-0.004
five-0.004
 Whoever-0.003
Token's
Token ID338
Feature activation-8.33e-03

Pos logit contributions
 knocks+0.004
cakes+0.003
cases+0.003
chair+0.003
five+0.003
Neg logit contributions
P-0.009
Bo-0.008
orn-0.008
un-0.008
ble-0.007
Token get
Token ID651
Feature activation+6.28e-03

Pos logit contributions
P+0.019
Bo+0.018
orn+0.018
un+0.016
G+0.016
Neg logit contributions
 knocks-0.011
cakes-0.009
 knocking-0.009
 sigh-0.009
 leaving-0.009
Token rid
Token ID5755
Feature activation-5.66e-03

Pos logit contributions
 knocks+0.011
cakes+0.010
cases+0.010
chair+0.010
five+0.009
Neg logit contributions
P-0.012
Bo-0.011
orn-0.011
ble-0.010
un-0.010
Token of
Token ID286
Feature activation-1.94e-03

Pos logit contributions
P+0.013
Bo+0.012
orn+0.012
G+0.011
ble+0.011
Neg logit contributions
 knocks-0.007
cakes-0.007
chair-0.006
five-0.006
 Amy-0.006
Token this
Token ID428
Feature activation-7.97e-03

Pos logit contributions
P+0.004
Bo+0.004
orn+0.004
un+0.003
ble+0.003
Neg logit contributions
 knocks-0.003
cakes-0.003
cases-0.003
chair-0.003
five-0.002
Token!"
Token ID2474
Feature activation-1.14e-02

Pos logit contributions
P+0.016
Bo+0.016
orn+0.015
un+0.014
ble+0.014
Neg logit contributions
 knocks-0.012
cakes-0.011
cases-0.010
chair-0.010
five-0.010
Token,
Token ID11
Feature activation+8.25e-04

Pos logit contributions
P+0.022
Bo+0.021
orn+0.020
G+0.019
ble+0.019
Neg logit contributions
 knocks-0.017
cakes-0.014
chair-0.014
 Amy-0.014
 James-0.014
Token Lily
Token ID20037
Feature activation-2.25e-02

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
five+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token was
Token ID373
Feature activation-4.26e-03

Pos logit contributions
P+0.045
Bo+0.043
ble+0.041
orn+0.041
G+0.039
Neg logit contributions
 knocks-0.034
chair-0.027
 knocking-0.027
 sigh-0.026
cakes-0.026
Token always
Token ID1464
Feature activation-5.67e-03

Pos logit contributions
P+0.007
Bo+0.007
orn+0.007
un+0.006
ble+0.006
Neg logit contributions
 knocks-0.008
cakes-0.007
chair-0.007
cases-0.007
 knocking-0.007
Token careful
Token ID8161
Feature activation+8.30e-03

Pos logit contributions
P+0.011
Bo+0.010
orn+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.009
cakes-0.008
cases-0.008
chair-0.008
five-0.008
Token and
Token ID290
Feature activation-3.66e-03

Pos logit contributions
 knocks+0.014
cakes+0.013
cases+0.011
five+0.011
ksh+0.011
Neg logit contributions
P-0.015
Bo-0.015
orn-0.015
un-0.015
ble-0.014
Token made
Token ID925
Feature activation-5.13e-03

Pos logit contributions
P+0.007
Bo+0.006
orn+0.006
un+0.005
ble+0.005
Neg logit contributions
 knocks-0.007
cakes-0.006
chair-0.006
cases-0.006
 knocking-0.006
Token sure
Token ID1654
Feature activation+1.31e-02

Pos logit contributions
P+0.011
Bo+0.010
orn+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.008
cakes-0.007
cases-0.007
chair-0.007
five-0.006
Token to
Token ID284
Feature activation+5.34e-03

Pos logit contributions
 knocks+0.016
cakes+0.014
cases+0.013
chair+0.013
five+0.013
Neg logit contributions
P-0.030
Bo-0.029
orn-0.029
un-0.027
ble-0.026
Token ask
Token ID1265
Feature activation-2.25e-04

Pos logit contributions
 knocks+0.007
 knocking+0.006
 knock+0.006
 report+0.006
 sigh+0.006
Neg logit contributions
P-0.012
orn-0.012
Bo-0.011
un-0.010
G-0.010
Token for
Token ID329
Feature activation-1.79e-02

Pos logit contributions
P+0.001
orn+0.001
Bo+0.001
un+0.000
G+0.000
Neg logit contributions
 knocks-0.000
cakes-0.000
 knocking-0.000
 leaving-0.000
 James-0.000
Token
Token ID198
Feature activation-1.98e-02

Pos logit contributions
P+0.051
orn+0.049
Bo+0.047
ery+0.045
 feather+0.045
Neg logit contributions
 knocks-0.026
 Amy-0.022
 sigh-0.020
chair-0.020
 James-0.020
Token"
Token ID1
Feature activation-1.02e-02

Pos logit contributions
orn+0.030
Bo+0.028
un+0.027
P+0.027
ery+0.026
Neg logit contributions
 knocks-0.036
cakes-0.032
 Amy-0.032
chair-0.032
five-0.031
TokenThank
Token ID10449
Feature activation-1.02e-02

Pos logit contributions
orn+0.019
P+0.018
Bo+0.018
un+0.018
ble+0.017
Neg logit contributions
cakes-0.015
 knocks-0.015
five-0.014
chair-0.013
cake-0.013
Token you
Token ID345
Feature activation-4.46e-03

Pos logit contributions
P+0.021
Bo+0.020
orn+0.020
un+0.018
ble+0.018
Neg logit contributions
 knocks-0.016
cakes-0.014
cases-0.013
chair-0.013
five-0.013
Token,
Token ID11
Feature activation-1.87e-03

Pos logit contributions
P+0.009
Bo+0.008
orn+0.008
un+0.008
ble+0.007
Neg logit contributions
 knocks-0.007
cakes-0.007
chair-0.006
cases-0.006
five-0.006
Token Ben
Token ID3932
Feature activation-4.34e-02

Pos logit contributions
P+0.005
Bo+0.005
orn+0.005
un+0.004
G+0.004
Neg logit contributions
 knocks-0.002
cakes-0.002
 sigh-0.002
 Amy-0.001
chair-0.001
Token,"
Token ID553
Feature activation-3.03e-02

Pos logit contributions
P+0.091
Bo+0.088
orn+0.085
un+0.080
G+0.078
Neg logit contributions
 knocks-0.064
cakes-0.056
chair-0.053
cases-0.052
 sigh-0.052
Token Mia
Token ID32189
Feature activation+8.32e-04

Pos logit contributions
P+0.079
orn+0.076
Bo+0.076
un+0.072
ble+0.071
Neg logit contributions
 knocks-0.030
cakes-0.026
chair-0.022
cases-0.021
 sigh-0.021
Token says
Token ID1139
Feature activation-2.67e-03

Pos logit contributions
 knocks+0.001
cakes+0.001
cases+0.001
chair+0.001
 knocking+0.001
Neg logit contributions
P-0.002
Bo-0.002
orn-0.002
un-0.002
ble-0.002
Token.
Token ID13
Feature activation-2.30e-02

Pos logit contributions
ble+0.003
P+0.003
un+0.003
Bo+0.003
orn+0.003
Neg logit contributions
 knocks-0.006
cakes-0.006
 inconvenience-0.005
isions-0.005
chair-0.005
Token "
Token ID366
Feature activation-8.45e-03

Pos logit contributions
P+0.053
orn+0.051
Bo+0.051
un+0.047
ble+0.046
Neg logit contributions
 knocks-0.030
cakes-0.026
chair-0.023
cases-0.023
 knocking-0.022
TokenOnce
Token ID7454
Feature activation-5.49e-03

Pos logit contributions
 knocks+0.031
cakes+0.028
chair+0.027
 James+0.026
 Amy+0.026
Neg logit contributions
P-0.035
Bo-0.034
orn-0.032
ble-0.031
un-0.030
Token there
Token ID612
Feature activation+1.22e-02

Pos logit contributions
P+0.008
Bo+0.008
orn+0.007
un+0.007
ble+0.006
Neg logit contributions
 knocks-0.012
cakes-0.011
cases-0.010
chair-0.010
five-0.010
Token was
Token ID373
Feature activation-5.70e-03

Pos logit contributions
 knocks+0.019
cakes+0.017
chair+0.016
 James+0.016
 Amy+0.016
Neg logit contributions
P-0.025
Bo-0.023
orn-0.023
un-0.021
G-0.021
Token a
Token ID257
Feature activation-8.50e-03

Pos logit contributions
P+0.011
Bo+0.010
orn+0.010
un+0.009
ble+0.009
Neg logit contributions
 knocks-0.009
cakes-0.009
chair-0.008
 James-0.008
five-0.008
Token pe
Token ID613
Feature activation-5.54e-03

Pos logit contributions
P+0.016
Bo+0.016
orn+0.015
un+0.014
ble+0.014
Neg logit contributions
 knocks-0.014
cakes-0.013
chair-0.012
cases-0.012
five-0.011
Tokenb
Token ID65
Feature activation-3.44e-02

Pos logit contributions
Bo+0.007
P+0.007
orn+0.006
un+0.006
G+0.006
Neg logit contributions
 knocks-0.012
cakes-0.011
cases-0.011
chair-0.011
lived-0.011
Tokenble
Token ID903
Feature activation-2.24e-02

Pos logit contributions
P+0.033
Bo+0.029
orn+0.024
G+0.023
Mart+0.023
Neg logit contributions
 knocks-0.073
cakes-0.071
cases-0.071
 manager-0.068
 where-0.068
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
P+0.049
Bo+0.048
orn+0.046
un+0.044
G+0.043
Neg logit contributions
 knocks-0.029
chair-0.025
cakes-0.024
 knocking-0.023
 leaving-0.022
Token The
Token ID383
Feature activation-4.18e-03

Pos logit contributions
 knocks+0.019
cakes+0.016
chair+0.015
cases+0.015
five+0.014
Neg logit contributions
P-0.034
Bo-0.033
orn-0.032
un-0.030
ble-0.030
Token pe
Token ID613
Feature activation-9.31e-03

Pos logit contributions
P+0.008
un+0.007
Bo+0.007
orn+0.007
Mart+0.006
Neg logit contributions
 knocks-0.007
 sigh-0.006
five-0.006
 knock-0.006
 knocking-0.006
Tokenb
Token ID65
Feature activation-3.42e-02

Pos logit contributions
P+0.012
Bo+0.012
un+0.011
G+0.010
orn+0.010
Neg logit contributions
 knocks-0.019
chair-0.018
cakes-0.017
cases-0.017
lived-0.017
Token,
Token ID11
Feature activation-8.77e-03

Pos logit contributions
P+0.017
Bo+0.016
orn+0.015
ble+0.014
un+0.014
Neg logit contributions
 knocks-0.015
cakes-0.014
 Amy-0.014
 James-0.014
five-0.013
Token "
Token ID366
Feature activation-9.85e-03

Pos logit contributions
P+0.022
orn+0.021
Bo+0.020
un+0.019
ble+0.018
Neg logit contributions
 knocks-0.010
cakes-0.008
 knocking-0.008
 leaving-0.008
 James-0.008
TokenWhat
Token ID2061
Feature activation+1.77e-02

Pos logit contributions
P+0.021
orn+0.020
Bo+0.020
un+0.019
ble+0.019
Neg logit contributions
 knocks-0.013
cakes-0.012
five-0.011
cases-0.011
chair-0.011
Token are
Token ID389
Feature activation-2.14e-02

Pos logit contributions
 knocks+0.029
cakes+0.026
chair+0.024
cases+0.024
 James+0.023
Neg logit contributions
P-0.035
Bo-0.034
orn-0.032
un-0.030
ble-0.030
Token b
Token ID275
Feature activation-2.57e-02

Pos logit contributions
P+0.046
Bo+0.046
orn+0.045
un+0.042
ble+0.041
Neg logit contributions
 knocks-0.029
cakes-0.027
cases-0.024
chair-0.023
 leaving-0.022
Tokenerry
Token ID6996
Feature activation-3.68e-02

Pos logit contributions
P+0.045
Bo+0.043
orn+0.040
Mart+0.038
ble+0.037
Neg logit contributions
cakes-0.045
 knocks-0.044
chair-0.040
cases-0.039
olding-0.038
Token-
Token ID12
Feature activation+1.00e-02

Pos logit contributions
P+0.067
Bo+0.066
orn+0.062
un+0.059
G+0.057
Neg logit contributions
 knocks-0.063
cakes-0.062
chair-0.056
cases-0.056
 leaving-0.054
Tokenbus
Token ID10885
Feature activation-1.31e-02

Pos logit contributions
 knocks+0.015
cakes+0.015
chair+0.013
cases+0.013
five+0.013
Neg logit contributions
P-0.020
Bo-0.019
orn-0.019
ble-0.018
un-0.017
Tokenhes
Token ID956
Feature activation-1.03e-02

Pos logit contributions
P+0.023
Bo+0.022
orn+0.021
Mart+0.019
G+0.019
Neg logit contributions
 knocks-0.024
cakes-0.022
cases-0.021
chair-0.020
 leaving-0.020
Token?"
Token ID1701
Feature activation-6.90e-03

Pos logit contributions
Bo+0.021
P+0.021
orn+0.018
Mart+0.018
G+0.018
Neg logit contributions
cakes-0.016
 knocks-0.016
chair-0.015
cases-0.014
 Amy-0.014
Token
Token ID198
Feature activation-8.29e-03

Pos logit contributions
P+0.014
orn+0.014
Bo+0.014
ble+0.012
ery+0.012
Neg logit contributions
 knocks-0.010
cakes-0.009
chair-0.009
 James-0.009
 Amy-0.009
Token away
Token ID1497
Feature activation-1.26e-02

Pos logit contributions
P+0.027
Bo+0.026
orn+0.025
ble+0.023
G+0.023
Neg logit contributions
 knocks-0.016
cakes-0.013
chair-0.013
 sigh-0.013
cases-0.013
Token as
Token ID355
Feature activation-1.11e-02

Pos logit contributions
P+0.028
orn+0.027
Bo+0.026
un+0.024
G+0.024
Neg logit contributions
 knocks-0.018
cakes-0.015
cases-0.014
 leaving-0.014
chair-0.014
Token fast
Token ID3049
Feature activation-2.42e-03

Pos logit contributions
P+0.025
orn+0.022
Bo+0.022
G+0.021
un+0.020
Neg logit contributions
 knocks-0.016
chair-0.014
 Amy-0.013
 James-0.013
 knocking-0.013
Token as
Token ID355
Feature activation-1.87e-02

Pos logit contributions
P+0.005
Bo+0.005
orn+0.005
un+0.005
ble+0.005
Neg logit contributions
cakes-0.003
 knocks-0.003
cases-0.003
 divorce-0.003
 manager-0.003
Token he
Token ID339
Feature activation-1.69e-02

Pos logit contributions
P+0.047
orn+0.042
Bo+0.042
G+0.040
un+0.039
Neg logit contributions
 knocks-0.022
chair-0.017
 Amy-0.017
 James-0.017
 leaving-0.017
Token could
Token ID714
Feature activation-3.67e-02

Pos logit contributions
P+0.035
orn+0.032
Bo+0.031
ble+0.031
agons+0.030
Neg logit contributions
 knocks-0.026
chair-0.022
cases-0.021
 sigh-0.020
 knocking-0.020
Token.
Token ID13
Feature activation+4.27e-03

Pos logit contributions
P+0.090
Bo+0.081
orn+0.080
un+0.074
G+0.073
Neg logit contributions
 knocks-0.047
 sigh-0.038
 leaving-0.038
chair-0.038
 knocking-0.037
Token
Token ID220
Feature activation-8.77e-03

Pos logit contributions
 knocks+0.008
cakes+0.008
cases+0.007
chair+0.007
five+0.007
Neg logit contributions
P-0.007
Bo-0.007
orn-0.007
un-0.006
G-0.006
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
P+0.018
Bo+0.017
orn+0.017
 feather+0.015
G+0.015
Neg logit contributions
 knocks-0.013
cakes-0.012
chair-0.011
cases-0.011
 Amy-0.011
Token
Token ID198
Feature activation+1.43e-03

Pos logit contributions
P+0.015
orn+0.014
Bo+0.014
ble+0.014
un+0.013
Neg logit contributions
 knocks-0.011
cakes-0.010
chair-0.010
 Amy-0.009
 James-0.009
TokenB
Token ID33
Feature activation-5.23e-06

Pos logit contributions
 knocks+0.002
cakes+0.002
ulating+0.002
chair+0.002
 knocking+0.002
Neg logit contributions
orn-0.002
Bo-0.002
ble-0.002
P-0.002
un-0.002
Token -
Token ID532
Feature activation-1.14e-02

Pos logit contributions
P+0.012
Bo+0.012
orn+0.012
G+0.011
ble+0.011
Neg logit contributions
 knocks-0.012
cakes-0.012
chair-0.011
cases-0.011
five-0.010
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
P+0.024
Bo+0.023
orn+0.022
un+0.021
G+0.020
Neg logit contributions
 knocks-0.017
cakes-0.016
chair-0.015
cases-0.014
 leaving-0.014
Token tree
Token ID5509
Feature activation-6.42e-04

Pos logit contributions
P+0.028
orn+0.026
Bo+0.026
un+0.024
ble+0.023
Neg logit contributions
 knocks-0.025
 knocking-0.022
 sigh-0.022
cakes-0.022
 manager-0.021
Token with
Token ID351
Feature activation-6.28e-03

Pos logit contributions
P+0.001
Bo+0.001
orn+0.001
un+0.001
ble+0.001
Neg logit contributions
 knocks-0.001
cakes-0.001
cases-0.001
chair-0.001
five-0.001
Token Bright
Token ID17558
Feature activation-3.63e-03

Pos logit contributions
P+0.013
Bo+0.012
orn+0.012
ble+0.012
un+0.011
Neg logit contributions
 knocks-0.009
cakes-0.008
 Amy-0.008
 knocking-0.008
 leaving-0.008
Tokenly
Token ID306
Feature activation-3.59e-02

Pos logit contributions
P+0.007
Bo+0.006
orn+0.006
un+0.006
G+0.006
Neg logit contributions
 knocks-0.006
cakes-0.006
chair-0.006
cases-0.005
 sigh-0.005
Token shining
Token ID22751
Feature activation-1.59e-02

Pos logit contributions
P+0.065
Bo+0.061
orn+0.055
G+0.054
ble+0.054
Neg logit contributions
 knocks-0.064
chair-0.059
cakes-0.055
 sigh-0.054
five-0.053
Token leaves
Token ID5667
Feature activation-2.90e-02

Pos logit contributions
P+0.031
Bo+0.029
orn+0.027
G+0.026
un+0.025
Neg logit contributions
 knocks-0.026
cakes-0.025
chair-0.024
 sigh-0.023
cases-0.022
Token.
Token ID13
Feature activation-2.95e-03

Pos logit contributions
P+0.062
Bo+0.059
orn+0.057
un+0.053
G+0.052
Neg logit contributions
 knocks-0.042
cakes-0.038
cases-0.036
chair-0.036
 leaving-0.034
Token She
Token ID1375
Feature activation-1.38e-02

Pos logit contributions
P+0.005
Bo+0.004
orn+0.004
un+0.004
G+0.004
Neg logit contributions
 knocks-0.006
cakes-0.005
cases-0.005
 knocking-0.005
 sigh-0.005
Token quickly
Token ID2952
Feature activation-1.45e-02

Pos logit contributions
P+0.030
orn+0.029
agons+0.027
un+0.027
Bo+0.027
Neg logit contributions
 knocks-0.019
 sigh-0.015
chair-0.015
cakes-0.015
cases-0.015