TOP ACTIVATIONS
MAX = 0.084

just like in a fair yt ale ! <|endoftext|> Once upon
them over for a fair yt ale tea party . They
was walking through a fair yt ale . She fell asleep
and saw a magical fair yt ale creature called a n
went on adventures , had pic n ics , and even
their adventure . After a year of traveling , they finally
liked to play and have pic n ics . One day
roof . They would have pic n ics and play games
, ready to have more pic n ics in the future
something . A little fair yt ale bird was singing a
. <|endoftext|> Ben loves his sc ooter . He rides it
see the bird and have pic n ics . <|endoftext|> Once
and you can make a bou quet for mom ," he
go outside for a whole year ! Eventually ,
the park , and have pic n ics outside .
near the cliff and have pic n ics . One day
to be careful when having pic n ics and not to
<|endoftext|> Tom my loved his sc ooter . He liked to
day , a little boy named Ben wanted to play with
<|endoftext|> Tom my loved his sc ooter . He rode it

MIN ACTIVATIONS
MIN = -0.078

his grandfather . He asked : " Well , what do
iron ing clothes and asked nicely , " Mom , can
daughter was curious and asked : " What does he record
up to him and asked : " Why are you cl
The mother asked : " Why are you sad
to the flower and asked : " Hello flower , what
the bear because he asked nicely . The bear learned that
M ummy smiled and said : " They are scissors ,
her mom my and asked : " It looks so tough
Mark y asked : " What should I bring
Her mum smiled and said : " It 's a net
his side . She asked : " What is wrong ,
his cap , and said : " I don 't have
The man asked : � � € � �
the park her " use ful - park ". <|endoftext|> Once
to the bear and ask nicely if Hop can borrow the
mum was surprised and asked : " What will you do
other animals came to see Beef dance in the empty box
kitchen and see their mom chopping carrots and potatoes . She
bit scared . She asked : " What if it hurts

INTERVAL 0.043 - 0.084
CONTAINS 0.127%

time , a little boy named Andy had a fragile bicycle
there was a little girl named Lily . She loved to
there was a little boy named Tim . Tim loved to
Once there was a boy named Bob . He was very
there was a lazy cat named Tom . Tom loved to

INTERVAL 0.003 - 0.043
CONTAINS 41.486%

her . She sighed and said , " I wish I
has a long handle on the side . "
off to play . Little girl reached out to hug Mama
future . You will be a great veterinarian someday . You
to the sky . Mom looked up . She

INTERVAL -0.038 - 0.003
CONTAINS 57.958%

people who got on and off . He saw trees and
up from her nap , she saw the dog . She
felt weak and she felt as though she would stop running
� � € � � It 's under the ground now
seed and he thought it was boring . But his mom

INTERVAL -0.078 - -0.038
CONTAINS 0.430%

spicy sauce . The caul iflower was so happy , it
. He saw that the salt sh aker was open and
bird , so he ran inside to find his mom .
funny hat . He said , " I just wanted to
day , Cic i heard about a big tornado coming nearby
Token just
Token ID655
Feature activation-9.91e-04

Pos logit contributions
 praying+0.000
 whist+0.000
 asking+0.000
 being+0.000
 telling+0.000
Neg logit contributions
tted-0.001
umbs-0.001
okingly-0.001
aney-0.001
hell-0.001
Token like
Token ID588
Feature activation+9.07e-03

Pos logit contributions
 shaving+0.002
 praying+0.002
 whist+0.002
 asking+0.002
 Potato+0.002
Neg logit contributions
tted-0.001
umbs-0.001
bridge-0.001
quet-0.001
atha-0.001
Token in
Token ID287
Feature activation+5.86e-03

Pos logit contributions
tted+0.022
umbs+0.019
fy+0.018
atters+0.018
atha+0.017
Neg logit contributions
 praying-0.011
 whist-0.010
 asking-0.010
 shaving-0.010
 being-0.009
Token a
Token ID257
Feature activation+9.86e-03

Pos logit contributions
tted+0.014
umbs+0.012
ives+0.012
atha+0.012
quet+0.012
Neg logit contributions
 praying-0.006
 shaving-0.006
 whist-0.006
 asking-0.006
 being-0.005
Token fair
Token ID3148
Feature activation+4.61e-03

Pos logit contributions
tted+0.018
umbs+0.015
adders+0.015
okingly+0.015
ives+0.015
Neg logit contributions
 whist-0.015
 praying-0.015
 shaving-0.014
 asking-0.013
 recording-0.013
Tokenyt
Token ID20760
Feature activation+8.35e-02

Pos logit contributions
tted+0.010
adders+0.008
umbs+0.008
aid+0.008
ives+0.008
Neg logit contributions
 whist-0.006
 praying-0.006
 shaving-0.005
 asking-0.005
 daddy-0.005
Tokenale
Token ID1000
Feature activation-9.97e-03

Pos logit contributions
tted+0.161
quet+0.129
 queens+0.128
drops+0.126
There+0.123
Neg logit contributions
 grandfather-0.099
 asking-0.098
 grandson-0.098
 temperature-0.097
 shaving-0.096
Token!
Token ID0
Feature activation-5.13e-03

Pos logit contributions
 praying+0.013
 whist+0.013
 shaving+0.012
 asking+0.012
 smoking+0.011
Neg logit contributions
tted-0.021
umbs-0.017
atha-0.017
okingly-0.017
atters-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.06e-02

Pos logit contributions
 praying+0.003
 shaving+0.003
 whist+0.003
 Potato+0.002
 Brother+0.002
Neg logit contributions
tted-0.014
bridge-0.013
quet-0.012
atha-0.012
adders-0.012
TokenOnce
Token ID7454
Feature activation+2.53e-02

Pos logit contributions
 praying+0.023
 shaving+0.022
 whist+0.022
 pap+0.021
 asking+0.020
Neg logit contributions
tted-0.046
umbs-0.040
bridge-0.038
ay-0.037
hell-0.037
Token upon
Token ID2402
Feature activation-2.21e-02

Pos logit contributions
tted+0.054
quet+0.046
bridge+0.046
umbs+0.046
atha+0.045
Neg logit contributions
 praying-0.029
 shaving-0.029
 whist-0.029
 asking-0.026
 Potato-0.024
Token them
Token ID606
Feature activation-9.99e-03

Pos logit contributions
tted+0.008
umbs+0.007
atters+0.007
fy+0.007
bridge+0.007
Neg logit contributions
 asking-0.003
 whist-0.003
 praying-0.003
 shaving-0.003
 being-0.002
Token over
Token ID625
Feature activation-2.82e-02

Pos logit contributions
 praying+0.009
 whist+0.009
 asking+0.009
 shaving+0.008
 being+0.007
Neg logit contributions
tted-0.025
bridge-0.022
umbs-0.022
hell-0.022
quet-0.021
Token for
Token ID329
Feature activation-2.47e-02

Pos logit contributions
 shaving+0.032
 praying+0.032
 whist+0.031
 asking+0.029
 Potato+0.026
Neg logit contributions
tted-0.062
umbs-0.053
bridge-0.053
quet-0.052
atha-0.051
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
 shaving+0.029
 praying+0.029
 whist+0.029
 asking+0.027
 being+0.024
Neg logit contributions
tted-0.055
umbs-0.048
bridge-0.046
atha-0.046
okingly-0.045
Token fair
Token ID3148
Feature activation+7.02e-03

Pos logit contributions
tted+0.025
umbs+0.022
adders+0.022
bridge+0.021
atters+0.021
Neg logit contributions
 whist-0.020
 praying-0.020
 shaving-0.019
 recording-0.017
 asking-0.017
Tokenyt
Token ID20760
Feature activation+8.30e-02

Pos logit contributions
tted+0.017
umbs+0.015
adders+0.015
aled+0.014
atters+0.014
Neg logit contributions
 whist-0.007
 praying-0.007
 shaving-0.006
 asking-0.006
 daddy-0.006
Tokenale
Token ID1000
Feature activation-7.74e-03

Pos logit contributions
tted+0.160
quet+0.135
There+0.135
 queens+0.134
bridge+0.129
Neg logit contributions
 grandson-0.096
 grandfather-0.094
 temperature-0.092
 asking-0.089
pa-0.087
Token tea
Token ID8887
Feature activation+5.36e-03

Pos logit contributions
 whist+0.010
 praying+0.010
 shaving+0.009
 asking+0.009
 grandson+0.008
Neg logit contributions
tted-0.016
umbs-0.014
atha-0.014
okingly-0.013
igsaw-0.013
Token party
Token ID2151
Feature activation+2.66e-02

Pos logit contributions
tted+0.010
atters+0.008
okingly+0.008
umbs+0.008
igsaw+0.008
Neg logit contributions
 whist-0.008
 praying-0.008
 shaving-0.008
 asking-0.007
 Potato-0.007
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
tted+0.064
atters+0.057
umbs+0.056
ollen+0.056
okingly+0.056
Neg logit contributions
 praying-0.023
 whist-0.022
 shaving-0.021
 asking-0.021
 saying-0.019
Token They
Token ID1119
Feature activation+7.15e-03

Pos logit contributions
 shaving+0.024
 praying+0.023
 whist+0.023
 asking+0.022
 Potato+0.022
Neg logit contributions
tted-0.031
bridge-0.027
quet-0.027
hell-0.026
atha-0.026
Token was
Token ID373
Feature activation-1.91e-03

Pos logit contributions
tted+0.001
atha+0.001
bridge+0.001
umbs+0.001
quet+0.001
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 Potato-0.001
 asking-0.001
Token walking
Token ID6155
Feature activation+3.84e-03

Pos logit contributions
 praying+0.002
 whist+0.002
 shaving+0.002
 asking+0.002
 smoking+0.002
Neg logit contributions
tted-0.004
umbs-0.003
bridge-0.003
quet-0.003
igsaw-0.003
Token through
Token ID832
Feature activation-6.53e-03

Pos logit contributions
tted+0.008
umbs+0.008
atters+0.008
ives+0.008
aney+0.008
Neg logit contributions
 whist-0.004
 asking-0.003
 praying-0.003
 being-0.003
 was-0.003
Token a
Token ID257
Feature activation+8.02e-03

Pos logit contributions
 whist+0.008
 praying+0.008
 shaving+0.007
 asking+0.007
 smoking+0.006
Neg logit contributions
tted-0.014
umbs-0.012
quet-0.012
bridge-0.012
atha-0.012
Token fair
Token ID3148
Feature activation-1.01e-03

Pos logit contributions
tted+0.014
umbs+0.012
bridge+0.012
quet+0.012
adders+0.011
Neg logit contributions
 whist-0.012
 shaving-0.012
 praying-0.012
 asking-0.011
 Potato-0.011
Tokenyt
Token ID20760
Feature activation+7.95e-02

Pos logit contributions
 whist+0.001
 praying+0.001
 shaving+0.001
 asking+0.001
 recording+0.001
Neg logit contributions
tted-0.002
umbs-0.002
adders-0.002
bridge-0.002
atters-0.002
Tokenale
Token ID1000
Feature activation-1.22e-02

Pos logit contributions
tted+0.147
 queens+0.123
There+0.123
quet+0.120
bridge+0.119
Neg logit contributions
 grandson-0.090
 grandfather-0.088
 asking-0.088
 temperature-0.088
ise-0.083
Token.
Token ID13
Feature activation-4.08e-03

Pos logit contributions
 whist+0.016
 praying+0.015
 shaving+0.015
 asking+0.014
 smoking+0.013
Neg logit contributions
tted-0.025
umbs-0.021
atha-0.021
bridge-0.021
okingly-0.021
Token She
Token ID1375
Feature activation+8.45e-03

Pos logit contributions
 shaving+0.005
 asking+0.005
 praying+0.005
 whist+0.005
 Brother+0.004
Neg logit contributions
tted-0.008
quet-0.007
bridge-0.007
aults-0.007
vell-0.007
Token fell
Token ID3214
Feature activation-1.38e-03

Pos logit contributions
tted+0.018
atha+0.015
bridge+0.015
umbs+0.015
quet+0.014
Neg logit contributions
 shaving-0.010
 praying-0.010
 whist-0.009
 Potato-0.009
 asking-0.009
Token asleep
Token ID16039
Feature activation-1.56e-02

Pos logit contributions
 whist+0.001
 praying+0.001
 shaving+0.001
 asking+0.001
 temperature+0.001
Neg logit contributions
tted-0.003
umbs-0.003
atha-0.003
bridge-0.003
quet-0.003
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 shaving+0.027
 praying+0.026
 whist+0.026
 Potato+0.026
 Brother+0.025
Neg logit contributions
tted-0.031
quet-0.024
bridge-0.024
atha-0.024
 chains-0.023
Token saw
Token ID2497
Feature activation-6.67e-03

Pos logit contributions
 shaving+0.014
 praying+0.013
 whist+0.012
 Potato+0.012
 temperature+0.011
Neg logit contributions
tted-0.020
atha-0.017
bridge-0.016
okingly-0.016
quet-0.016
Token a
Token ID257
Feature activation-1.62e-02

Pos logit contributions
 whist+0.008
 shaving+0.008
 praying+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.014
umbs-0.012
quet-0.012
bridge-0.012
atha-0.012
Token magical
Token ID10883
Feature activation-1.22e-02

Pos logit contributions
 shaving+0.029
 praying+0.028
 asking+0.026
ise+0.026
 whist+0.026
Neg logit contributions
tted-0.023
quet-0.017
 frame-0.017
hell-0.017
igsaw-0.017
Token fair
Token ID3148
Feature activation-1.41e-02

Pos logit contributions
 shaving+0.019
 praying+0.018
 whist+0.018
 Brother+0.017
 asking+0.017
Neg logit contributions
tted-0.020
quet-0.016
bridge-0.016
igsaw-0.016
 frame-0.016
Tokenyt
Token ID20760
Feature activation+7.27e-02

Pos logit contributions
 whist+0.019
 praying+0.019
 shaving+0.019
 asking+0.017
 Potato+0.016
Neg logit contributions
tted-0.028
umbs-0.024
adders-0.023
bridge-0.023
atha-0.023
Tokenale
Token ID1000
Feature activation-2.13e-02

Pos logit contributions
tted+0.132
 queens+0.114
quet+0.111
umbs+0.110
There+0.109
Neg logit contributions
 asking-0.088
 temperature-0.088
 grandfather-0.087
 whist-0.087
 grandson-0.086
Token creature
Token ID7185
Feature activation-1.59e-02

Pos logit contributions
 shaving+0.027
 whist+0.027
 praying+0.027
 asking+0.024
 Potato+0.023
Neg logit contributions
tted-0.043
umbs-0.037
atha-0.036
bridge-0.036
quet-0.036
Token called
Token ID1444
Feature activation-1.88e-02

Pos logit contributions
 shaving+0.019
 praying+0.019
 whist+0.019
 asking+0.017
 Potato+0.017
Neg logit contributions
tted-0.033
bridge-0.028
umbs-0.028
quet-0.028
atha-0.027
Token a
Token ID257
Feature activation-2.50e-02

Pos logit contributions
 shaving+0.025
 praying+0.025
 whist+0.025
 asking+0.022
 Potato+0.022
Neg logit contributions
tted-0.036
umbs-0.030
bridge-0.030
quet-0.030
atha-0.030
Token n
Token ID299
Feature activation-1.69e-02

Pos logit contributions
 shaving+0.041
 praying+0.040
 whist+0.038
 asking+0.038
ise+0.037
Neg logit contributions
tted-0.038
quet-0.031
 frame-0.029
igsaw-0.029
hell-0.029
Token went
Token ID1816
Feature activation+6.37e-03

Pos logit contributions
tted+0.029
umbs+0.025
quet+0.024
bridge+0.024
igsaw+0.024
Neg logit contributions
 whist-0.020
 praying-0.019
 shaving-0.019
 asking-0.017
 Potato-0.016
Token on
Token ID319
Feature activation-4.10e-03

Pos logit contributions
tted+0.015
umbs+0.014
bridge+0.014
aney+0.014
quet+0.013
Neg logit contributions
 whist-0.007
 praying-0.006
 asking-0.006
 shaving-0.006
 being-0.005
Token adventures
Token ID17545
Feature activation+1.25e-02

Pos logit contributions
 shaving+0.006
 praying+0.006
 whist+0.006
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.007
umbs-0.006
bridge-0.006
quet-0.006
atha-0.006
Token,
Token ID11
Feature activation-2.34e-03

Pos logit contributions
tted+0.024
umbs+0.020
bridge+0.020
quet+0.020
atha+0.020
Neg logit contributions
 shaving-0.017
 praying-0.017
 whist-0.017
 asking-0.015
 Potato-0.015
Token had
Token ID550
Feature activation+2.84e-02

Pos logit contributions
 whist+0.003
 praying+0.002
 shaving+0.002
 asking+0.002
 being+0.002
Neg logit contributions
tted-0.005
quet-0.005
igsaw-0.005
umbs-0.005
okingly-0.005
Token pic
Token ID8301
Feature activation+6.76e-02

Pos logit contributions
tted+0.050
quet+0.042
igsaw+0.042
bridge+0.041
okingly+0.041
Neg logit contributions
 whist-0.045
 shaving-0.042
 praying-0.042
 asking-0.040
 Potato-0.038
Tokenn
Token ID77
Feature activation+3.42e-02

Pos logit contributions
tted+0.117
bridge+0.104
umbs+0.102
adders+0.100
okingly+0.099
Neg logit contributions
 whist-0.098
 shaving-0.095
 praying-0.093
 asking-0.086
 being-0.083
Tokenics
Token ID873
Feature activation+1.48e-02

Pos logit contributions
tted+0.056
igsaw+0.049
fy+0.046
okingly+0.046
quet+0.046
Neg logit contributions
 whist-0.057
 praying-0.054
 shaving-0.051
 asking-0.049
 being-0.048
Token,
Token ID11
Feature activation-4.17e-03

Pos logit contributions
tted+0.033
umbs+0.029
igsaw+0.029
bridge+0.029
quet+0.029
Neg logit contributions
 whist-0.017
 praying-0.015
 shaving-0.015
 asking-0.013
 being-0.013
Token and
Token ID290
Feature activation+8.53e-03

Pos logit contributions
 whist+0.005
 praying+0.004
 shaving+0.004
 asking+0.004
 being+0.003
Neg logit contributions
tted-0.010
igsaw-0.009
quet-0.009
okingly-0.009
umbs-0.008
Token even
Token ID772
Feature activation-1.34e-02

Pos logit contributions
tted+0.019
igsaw+0.016
quet+0.016
umbs+0.016
okingly+0.016
Neg logit contributions
 whist-0.012
 praying-0.010
 asking-0.010
 shaving-0.010
 being-0.009
Token their
Token ID511
Feature activation+2.99e-02

Pos logit contributions
 shaving+0.015
 whist+0.015
 praying+0.015
 Potato+0.014
 pap+0.014
Neg logit contributions
tted-0.018
igsaw-0.016
atha-0.015
quet-0.014
okingly-0.014
Token adventure
Token ID8855
Feature activation+1.67e-02

Pos logit contributions
tted+0.055
umbs+0.045
bridge+0.045
atha+0.044
quet+0.044
Neg logit contributions
 shaving-0.043
 praying-0.043
 whist-0.043
 asking-0.039
 Potato-0.038
Token.
Token ID13
Feature activation-3.59e-03

Pos logit contributions
tted+0.035
umbs+0.030
bridge+0.029
quet+0.029
atha+0.029
Neg logit contributions
 shaving-0.020
 praying-0.020
 whist-0.020
 asking-0.018
 Potato-0.017
Token After
Token ID2293
Feature activation+1.36e-03

Pos logit contributions
 praying+0.005
 shaving+0.005
 whist+0.005
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.006
bridge-0.005
quet-0.005
umbs-0.005
atha-0.005
Token a
Token ID257
Feature activation+2.49e-02

Pos logit contributions
tted+0.003
umbs+0.003
atha+0.002
bridge+0.002
quet+0.002
Neg logit contributions
 praying-0.002
 whist-0.002
 shaving-0.002
 asking-0.001
 Potato-0.001
Token year
Token ID614
Feature activation+6.72e-02

Pos logit contributions
tted+0.039
umbs+0.031
bridge+0.031
quet+0.030
atha+0.030
Neg logit contributions
 shaving-0.043
 praying-0.043
 whist-0.043
 asking-0.040
 Potato-0.039
Token of
Token ID286
Feature activation+2.23e-02

Pos logit contributions
tted+0.150
bridge+0.132
umbs+0.131
quet+0.127
atha+0.125
Neg logit contributions
 shaving-0.072
 praying-0.072
 whist-0.071
 asking-0.065
 Potato-0.058
Token traveling
Token ID11300
Feature activation+1.81e-02

Pos logit contributions
tted+0.044
umbs+0.037
bridge+0.036
quet+0.036
atha+0.036
Neg logit contributions
 shaving-0.030
 praying-0.030
 whist-0.030
 asking-0.027
 Potato-0.026
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
tted+0.043
umbs+0.040
bridge+0.038
quet+0.038
hell+0.037
Neg logit contributions
 whist-0.016
 praying-0.016
 shaving-0.015
 asking-0.014
 being-0.013
Token they
Token ID484
Feature activation+9.42e-03

Pos logit contributions
tted+0.027
umbs+0.023
quet+0.022
bridge+0.022
hell+0.022
Neg logit contributions
 praying-0.018
 whist-0.017
 shaving-0.017
 asking-0.016
 Potato-0.015
Token finally
Token ID3443
Feature activation+5.23e-03

Pos logit contributions
tted+0.018
umbs+0.015
bridge+0.015
quet+0.015
atha+0.015
Neg logit contributions
 shaving-0.013
 praying-0.013
 whist-0.013
 asking-0.011
 Potato-0.011
Token liked
Token ID8288
Feature activation-1.12e-02

Pos logit contributions
tted+0.016
quet+0.015
umbs+0.014
igsaw+0.014
bridge+0.014
Neg logit contributions
 whist-0.008
 shaving-0.007
 praying-0.007
 asking-0.007
 Potato-0.006
Token to
Token ID284
Feature activation-8.93e-03

Pos logit contributions
 shaving+0.017
 praying+0.016
 whist+0.016
 Potato+0.015
 Brother+0.015
Neg logit contributions
tted-0.018
 queens-0.015
umbs-0.015
adders-0.015
bridge-0.015
Token play
Token ID711
Feature activation+1.22e-02

Pos logit contributions
 whist+0.012
 shaving+0.012
 praying+0.011
 asking+0.010
 Potato+0.010
Neg logit contributions
tted-0.017
umbs-0.015
quet-0.015
bridge-0.015
igsaw-0.014
Token and
Token ID290
Feature activation+7.47e-03

Pos logit contributions
tted+0.018
bridge+0.014
quet+0.014
adders+0.014
atha+0.013
Neg logit contributions
 praying-0.022
 shaving-0.022
 whist-0.021
 asking-0.020
 Brother-0.020
Token have
Token ID423
Feature activation+2.95e-04

Pos logit contributions
tted+0.012
umbs+0.010
quet+0.010
bridge+0.010
igsaw+0.009
Neg logit contributions
 whist-0.013
 shaving-0.012
 praying-0.012
 asking-0.012
 Potato-0.011
Token pic
Token ID8301
Feature activation+6.69e-02

Pos logit contributions
tted+0.001
quet+0.001
umbs+0.001
okingly+0.001
igsaw+0.001
Neg logit contributions
 whist-0.000
 shaving-0.000
 praying-0.000
 being-0.000
 asking-0.000
Tokenn
Token ID77
Feature activation+2.82e-02

Pos logit contributions
tted+0.111
bridge+0.094
umbs+0.094
quet+0.090
adders+0.089
Neg logit contributions
 shaving-0.105
 whist-0.104
 praying-0.103
 asking-0.095
 Potato-0.090
Tokenics
Token ID873
Feature activation+8.81e-03

Pos logit contributions
tted+0.042
quet+0.033
umbs+0.033
igsaw+0.032
bridge+0.032
Neg logit contributions
 whist-0.051
 shaving-0.050
 praying-0.050
 asking-0.046
 Potato-0.044
Token.
Token ID13
Feature activation-7.15e-03

Pos logit contributions
tted+0.017
umbs+0.015
bridge+0.015
quet+0.015
igsaw+0.015
Neg logit contributions
 whist-0.012
 praying-0.011
 shaving-0.011
 asking-0.010
 Potato-0.010
Token One
Token ID1881
Feature activation-4.57e-03

Pos logit contributions
 praying+0.012
 whist+0.012
 shaving+0.011
 pap+0.010
 temperature+0.010
Neg logit contributions
tted-0.012
atha-0.010
igsaw-0.010
umbs-0.010
okingly-0.010
Token day
Token ID1110
Feature activation-1.95e-02

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.007
 asking+0.007
 Potato+0.006
Neg logit contributions
tted-0.008
umbs-0.007
igator-0.006
hell-0.006
atha-0.006
Token roof
Token ID9753
Feature activation+4.42e-03

Pos logit contributions
tted+0.035
umbs+0.030
quet+0.029
bridge+0.028
okingly+0.028
Neg logit contributions
 praying-0.025
 shaving-0.025
 whist-0.024
 asking-0.022
 smoking-0.021
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
tted+0.010
atha+0.008
igsaw+0.008
bridge+0.008
quet+0.008
Neg logit contributions
 praying-0.005
 whist-0.005
 shaving-0.004
 asking-0.004
 working-0.004
Token They
Token ID1119
Feature activation+1.64e-02

Pos logit contributions
tted+0.027
quet+0.025
bridge+0.024
atha+0.023
fy+0.023
Neg logit contributions
 whist-0.016
 praying-0.016
 Brother-0.016
 shaving-0.016
 Potato-0.015
Token would
Token ID561
Feature activation-3.21e-03

Pos logit contributions
tted+0.035
umbs+0.030
igsaw+0.030
quet+0.030
bridge+0.029
Neg logit contributions
 whist-0.020
 praying-0.019
 shaving-0.019
 asking-0.017
 pap-0.016
Token have
Token ID423
Feature activation+9.33e-03

Pos logit contributions
 whist+0.004
 praying+0.004
 shaving+0.004
 asking+0.004
 pap+0.003
Neg logit contributions
tted-0.007
umbs-0.006
igsaw-0.006
bridge-0.006
adders-0.006
Token pic
Token ID8301
Feature activation+6.68e-02

Pos logit contributions
tted+0.018
quet+0.015
igsaw+0.014
bridge+0.014
atha+0.014
Neg logit contributions
 whist-0.014
 praying-0.013
 shaving-0.013
 asking-0.012
 pap-0.011
Tokenn
Token ID77
Feature activation+3.52e-02

Pos logit contributions
tted+0.114
bridge+0.099
umbs+0.096
adders+0.094
okingly+0.093
Neg logit contributions
 whist-0.101
 shaving-0.100
 praying-0.098
 asking-0.090
 being-0.084
Tokenics
Token ID873
Feature activation+1.13e-02

Pos logit contributions
tted+0.060
igsaw+0.051
fy+0.050
okingly+0.049
igator+0.049
Neg logit contributions
 whist-0.056
 praying-0.054
 shaving-0.051
 being-0.049
 asking-0.048
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
tted+0.025
umbs+0.022
igsaw+0.021
bridge+0.021
quet+0.021
Neg logit contributions
 whist-0.013
 praying-0.012
 shaving-0.011
 asking-0.010
 being-0.010
Token play
Token ID711
Feature activation+1.94e-02

Pos logit contributions
tted+0.022
umbs+0.018
bridge+0.018
quet+0.018
hell+0.018
Neg logit contributions
 praying-0.018
 whist-0.018
 shaving-0.018
 asking-0.017
 pap-0.016
Token games
Token ID1830
Feature activation+2.52e-02

Pos logit contributions
tted+0.033
umbs+0.028
quet+0.027
okingly+0.027
atha+0.026
Neg logit contributions
 whist-0.030
 praying-0.030
 shaving-0.030
 asking-0.027
 Potato-0.026
Token,
Token ID11
Feature activation-5.80e-03

Pos logit contributions
 shaving+0.020
 praying+0.019
 whist+0.019
 Potato+0.018
 Brother+0.017
Neg logit contributions
tted-0.027
quet-0.022
umbs-0.022
atha-0.021
hell-0.021
Token ready
Token ID3492
Feature activation-1.14e-02

Pos logit contributions
 whist+0.008
 shaving+0.008
 praying+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.012
umbs-0.010
bridge-0.010
quet-0.010
atha-0.009
Token to
Token ID284
Feature activation+1.66e-02

Pos logit contributions
 shaving+0.014
 praying+0.014
 whist+0.014
 asking+0.012
 Potato+0.012
Neg logit contributions
tted-0.023
umbs-0.020
bridge-0.019
quet-0.019
atha-0.019
Token have
Token ID423
Feature activation+1.75e-02

Pos logit contributions
tted+0.029
umbs+0.024
bridge+0.024
adders+0.024
okingly+0.024
Neg logit contributions
 whist-0.026
 praying-0.026
 shaving-0.026
 asking-0.023
 Potato-0.022
Token more
Token ID517
Feature activation+9.23e-04

Pos logit contributions
tted+0.034
umbs+0.028
bridge+0.028
quet+0.028
atha+0.027
Neg logit contributions
 shaving-0.024
 praying-0.024
 whist-0.024
 asking-0.021
 Potato-0.021
Token pic
Token ID8301
Feature activation+6.68e-02

Pos logit contributions
tted+0.002
bridge+0.001
umbs+0.001
quet+0.001
atha+0.001
Neg logit contributions
 shaving-0.001
 whist-0.001
 praying-0.001
 asking-0.001
 Potato-0.001
Tokenn
Token ID77
Feature activation+4.11e-02

Pos logit contributions
tted+0.112
bridge+0.100
umbs+0.096
adders+0.094
okingly+0.092
Neg logit contributions
 shaving-0.101
 whist-0.100
 praying-0.099
 asking-0.091
 being-0.084
Tokenics
Token ID873
Feature activation+1.29e-02

Pos logit contributions
tted+0.061
bridge+0.047
quet+0.047
igsaw+0.047
umbs+0.046
Neg logit contributions
 whist-0.073
 praying-0.073
 shaving-0.073
 asking-0.068
 smoking-0.064
Token in
Token ID287
Feature activation+1.79e-02

Pos logit contributions
tted+0.029
umbs+0.026
bridge+0.026
quet+0.025
atters+0.025
Neg logit contributions
 whist-0.013
 praying-0.013
 shaving-0.013
 asking-0.011
 being-0.010
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
tted+0.037
bridge+0.031
umbs+0.031
quet+0.030
atha+0.030
Neg logit contributions
 shaving-0.023
 praying-0.023
 whist-0.022
 asking-0.020
 Potato-0.019
Token future
Token ID2003
Feature activation+1.53e-03

Pos logit contributions
tted+0.027
umbs+0.022
bridge+0.022
quet+0.022
atha+0.022
Neg logit contributions
 shaving-0.018
 praying-0.018
 whist-0.017
 asking-0.016
 Potato-0.015
Token something
Token ID1223
Feature activation-1.34e-02

Pos logit contributions
tted+0.006
bridge+0.005
umbs+0.005
quet+0.005
atha+0.005
Neg logit contributions
 shaving-0.003
 praying-0.003
 whist-0.003
 asking-0.003
 Potato-0.003
Token.
Token ID13
Feature activation+7.85e-03

Pos logit contributions
 Brother+0.021
 shaving+0.021
 Potato+0.020
 grandson+0.020
 praying+0.020
Neg logit contributions
tted-0.021
hell-0.018
quet-0.017
bridge-0.017
 chains-0.017
Token A
Token ID317
Feature activation+2.18e-03

Pos logit contributions
tted+0.012
umbs+0.009
bridge+0.009
quet+0.009
atha+0.009
Neg logit contributions
 shaving-0.014
 whist-0.014
 praying-0.014
 asking-0.013
 Potato-0.012
Token little
Token ID1310
Feature activation+2.06e-02

Pos logit contributions
tted+0.003
quet+0.003
bridge+0.002
umbs+0.002
igsaw+0.002
Neg logit contributions
 shaving-0.004
 praying-0.004
 whist-0.004
 asking-0.004
 Potato-0.003
Token fair
Token ID3148
Feature activation-9.81e-03

Pos logit contributions
tted+0.037
umbs+0.031
bridge+0.031
quet+0.030
atha+0.030
Neg logit contributions
 whist-0.031
 shaving-0.031
 praying-0.031
 asking-0.028
 Potato-0.027
Tokenyt
Token ID20760
Feature activation+6.53e-02

Pos logit contributions
 whist+0.013
 praying+0.012
 shaving+0.012
 pap+0.011
 Potato+0.011
Neg logit contributions
tted-0.021
umbs-0.019
adders-0.017
bridge-0.017
ollen-0.017
Tokenale
Token ID1000
Feature activation-1.50e-02

Pos logit contributions
tted+0.117
umbs+0.096
drops+0.094
 queens+0.093
quet+0.091
Neg logit contributions
 whist-0.091
 asking-0.087
 praying-0.086
 shaving-0.084
 grandfather-0.084
Token bird
Token ID6512
Feature activation-1.03e-02

Pos logit contributions
 whist+0.022
 praying+0.021
 shaving+0.019
 asking+0.018
 smoking+0.018
Neg logit contributions
tted-0.030
umbs-0.026
atha-0.025
fy-0.025
adders-0.024
Token was
Token ID373
Feature activation-1.05e-02

Pos logit contributions
 shaving+0.017
 whist+0.017
 praying+0.017
 asking+0.015
 Potato+0.015
Neg logit contributions
tted-0.017
umbs-0.014
bridge-0.013
quet-0.013
atha-0.013
Token singing
Token ID13777
Feature activation+7.96e-03

Pos logit contributions
 Brother+0.014
 grandson+0.013
 Potato+0.013
 Man+0.013
 shaving+0.013
Neg logit contributions
tted-0.018
bridge-0.016
 free-0.016
quet-0.015
 Sue-0.015
Token a
Token ID257
Feature activation+7.90e-03

Pos logit contributions
tted+0.017
umbs+0.015
atha+0.015
okingly+0.015
igsaw+0.015
Neg logit contributions
 whist-0.009
 praying-0.009
 shaving-0.009
 asking-0.008
 smoking-0.007
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 shaving+0.011
 whist+0.010
 praying+0.010
 Potato+0.010
 Brother+0.010
Neg logit contributions
tted-0.012
 of-0.010
umbs-0.010
hell-0.010
 Sue-0.009
Token<|endoftext|>
Token ID50256
Feature activation-2.75e-02

Pos logit contributions
tted+0.022
umbs+0.020
bridge+0.018
atters+0.018
quet+0.018
Neg logit contributions
 praying-0.012
 shaving-0.012
 whist-0.012
 asking-0.011
 razor-0.010
TokenBen
Token ID11696
Feature activation+4.50e-03

Pos logit contributions
 praying+0.031
 pap+0.028
 whist+0.028
 asking+0.027
 shaving+0.027
Neg logit contributions
tted-0.062
umbs-0.055
There-0.051
ay-0.051
bridge-0.051
Token loves
Token ID10408
Feature activation-2.13e-02

Pos logit contributions
tted+0.009
atha+0.007
umbs+0.007
ives+0.007
okingly+0.007
Neg logit contributions
 praying-0.006
 shaving-0.006
 whist-0.006
 asking-0.005
 Potato-0.005
Token his
Token ID465
Feature activation-6.32e-04

Pos logit contributions
 praying+0.030
 shaving+0.029
 whist+0.028
ise+0.028
 Potato+0.028
Neg logit contributions
tted-0.031
 queens-0.029
 of-0.027
quet-0.027
igator-0.027
Token sc
Token ID629
Feature activation+6.50e-02

Pos logit contributions
 praying+0.001
 shaving+0.001
 whist+0.001
 asking+0.001
 smoking+0.001
Neg logit contributions
tted-0.001
umbs-0.001
quet-0.001
igsaw-0.001
bridge-0.001
Tokenooter
Token ID25141
Feature activation+3.51e-02

Pos logit contributions
tted+0.091
bridge+0.090
There+0.078
quet+0.074
atha+0.073
Neg logit contributions
 shaving-0.111
 whist-0.106
 praying-0.105
 grandson-0.104
 pap-0.097
Token.
Token ID13
Feature activation+3.28e-02

Pos logit contributions
tted+0.077
umbs+0.065
bridge+0.065
quet+0.065
atha+0.064
Neg logit contributions
 shaving-0.038
 praying-0.038
 whist-0.037
 asking-0.033
 Potato-0.032
Token He
Token ID679
Feature activation-1.42e-04

Pos logit contributions
tted+0.063
quet+0.053
umbs+0.053
bridge+0.052
atha+0.052
Neg logit contributions
 shaving-0.045
 whist-0.045
 praying-0.044
 asking-0.040
 Potato-0.040
Token rides
Token ID17445
Feature activation+2.40e-02

Pos logit contributions
 shaving+0.000
 whist+0.000
 praying+0.000
 asking+0.000
 Potato+0.000
Neg logit contributions
tted-0.000
umbs-0.000
bridge-0.000
quet-0.000
atha-0.000
Token it
Token ID340
Feature activation+5.79e-03

Pos logit contributions
tted+0.051
umbs+0.045
quet+0.044
bridge+0.043
atha+0.043
Neg logit contributions
 whist-0.028
 shaving-0.028
 praying-0.027
 asking-0.024
 Potato-0.023
Token see
Token ID766
Feature activation-1.13e-03

Pos logit contributions
tted+0.032
umbs+0.028
bridge+0.027
quet+0.026
okingly+0.026
Neg logit contributions
 whist-0.029
 praying-0.029
 shaving-0.029
 asking-0.026
 Potato-0.025
Token the
Token ID262
Feature activation-7.29e-03

Pos logit contributions
 shaving+0.001
 praying+0.001
 whist+0.001
 asking+0.001
 Potato+0.001
Neg logit contributions
tted-0.002
umbs-0.002
bridge-0.002
quet-0.002
atha-0.002
Token bird
Token ID6512
Feature activation-1.31e-02

Pos logit contributions
 whist+0.011
 praying+0.011
 shaving+0.011
 asking+0.010
 Potato+0.010
Neg logit contributions
tted-0.013
umbs-0.011
quet-0.011
bridge-0.011
atha-0.011
Token and
Token ID290
Feature activation+3.31e-03

Pos logit contributions
 whist+0.014
 praying+0.014
 shaving+0.013
 asking+0.012
 smoking+0.011
Neg logit contributions
tted-0.030
atha-0.025
igsaw-0.025
bridge-0.025
okingly-0.025
Token have
Token ID423
Feature activation+8.00e-03

Pos logit contributions
tted+0.007
umbs+0.006
quet+0.006
bridge+0.006
igsaw+0.006
Neg logit contributions
 whist-0.005
 praying-0.004
 shaving-0.004
 asking-0.004
 pap-0.004
Token pic
Token ID8301
Feature activation+6.46e-02

Pos logit contributions
tted+0.016
quet+0.013
igsaw+0.013
umbs+0.013
bridge+0.013
Neg logit contributions
 whist-0.011
 praying-0.011
 shaving-0.011
 asking-0.010
 pap-0.009
Tokenn
Token ID77
Feature activation+3.26e-02

Pos logit contributions
tted+0.111
bridge+0.096
umbs+0.094
adders+0.092
 queens+0.091
Neg logit contributions
 whist-0.096
 shaving-0.095
 praying-0.094
 asking-0.087
 being-0.080
Tokenics
Token ID873
Feature activation+6.03e-03

Pos logit contributions
tted+0.052
igsaw+0.045
fy+0.044
okingly+0.043
quet+0.042
Neg logit contributions
 whist-0.053
 praying-0.052
 shaving-0.050
 asking-0.047
 being-0.046
Token.
Token ID13
Feature activation-3.75e-03

Pos logit contributions
tted+0.013
umbs+0.011
igsaw+0.011
bridge+0.011
atters+0.011
Neg logit contributions
 whist-0.007
 praying-0.007
 shaving-0.007
 asking-0.006
 being-0.006
Token<|endoftext|>
Token ID50256
Feature activation-2.35e-02

Pos logit contributions
 whist+0.002
 praying+0.002
 shaving+0.002
 Potato+0.002
 Brother+0.002
Neg logit contributions
tted-0.010
bridge-0.009
quet-0.009
atha-0.009
adders-0.009
TokenOnce
Token ID7454
Feature activation+2.63e-02

Pos logit contributions
 praying+0.031
 shaving+0.030
 pap+0.029
 whist+0.029
 asking+0.028
Neg logit contributions
tted-0.049
umbs-0.041
hell-0.038
bridge-0.038
ay-0.038
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
tted+0.012
umbs+0.010
atha+0.010
quet+0.010
bridge+0.010
Neg logit contributions
 whist-0.005
 praying-0.005
 shaving-0.005
 asking-0.004
 Potato-0.004
Token you
Token ID345
Feature activation-2.85e-04

Pos logit contributions
tted+0.028
umbs+0.025
quet+0.024
bridge+0.023
okingly+0.023
Neg logit contributions
 whist-0.021
 praying-0.020
 shaving-0.019
 asking-0.018
 being-0.018
Token can
Token ID460
Feature activation+1.12e-03

Pos logit contributions
 whist+0.000
 praying+0.000
 shaving+0.000
 asking+0.000
 was+0.000
Neg logit contributions
tted-0.001
bridge-0.001
igsaw-0.001
quet-0.000
umbs-0.000
Token make
Token ID787
Feature activation-1.38e-03

Pos logit contributions
tted+0.002
bridge+0.002
umbs+0.002
igsaw+0.002
quet+0.002
Neg logit contributions
 whist-0.002
 shaving-0.002
 praying-0.002
 asking-0.001
 Potato-0.001
Token a
Token ID257
Feature activation+3.42e-03

Pos logit contributions
 shaving+0.002
 whist+0.002
 praying+0.002
 asking+0.002
 Potato+0.002
Neg logit contributions
tted-0.003
umbs-0.002
bridge-0.002
quet-0.002
atha-0.002
Token bou
Token ID35833
Feature activation+6.44e-02

Pos logit contributions
tted+0.007
umbs+0.006
bridge+0.006
adders+0.005
ives+0.005
Neg logit contributions
 whist-0.005
 praying-0.005
 shaving-0.005
 pap-0.004
 asking-0.004
Tokenquet
Token ID21108
Feature activation+3.29e-02

Pos logit contributions
tted+0.053
okingly+0.048
 Pam+0.048
aney+0.047
igator+0.047
Neg logit contributions
 whist-0.130
 praying-0.130
 shaving-0.124
ning-0.123
 recording-0.119
Token for
Token ID329
Feature activation-7.53e-03

Pos logit contributions
tted+0.082
Meg+0.072
okingly+0.072
atha+0.071
Tweet+0.070
Neg logit contributions
 whist-0.020
 praying-0.020
 being-0.017
 living-0.017
 working-0.017
Token mom
Token ID1995
Feature activation+1.82e-02

Pos logit contributions
 shaving+0.010
 praying+0.010
 whist+0.010
 asking+0.009
 Potato+0.009
Neg logit contributions
tted-0.014
umbs-0.012
bridge-0.012
quet-0.012
atha-0.012
Token,"
Token ID553
Feature activation+4.33e-03

Pos logit contributions
tted+0.042
bridge+0.035
quet+0.035
umbs+0.035
atha+0.035
Neg logit contributions
 shaving-0.019
 praying-0.019
 whist-0.019
 asking-0.017
 Potato-0.015
Token he
Token ID339
Feature activation+3.02e-03

Pos logit contributions
tted+0.007
 Pam+0.006
quet+0.006
okingly+0.006
atha+0.006
Neg logit contributions
 shaving-0.006
 praying-0.006
 whist-0.006
 temperature-0.006
 asking-0.005
Token go
Token ID467
Feature activation+9.94e-04

Pos logit contributions
tted+0.027
umbs+0.023
bridge+0.022
quet+0.022
igsaw+0.022
Neg logit contributions
 whist-0.024
 praying-0.024
 shaving-0.023
 asking-0.022
 Potato-0.021
Token outside
Token ID2354
Feature activation-1.26e-03

Pos logit contributions
tted+0.002
umbs+0.002
aney+0.002
igsaw+0.002
bridge+0.002
Neg logit contributions
 asking-0.001
 whist-0.001
 shaving-0.001
 praying-0.001
 being-0.001
Token for
Token ID329
Feature activation-9.92e-03

Pos logit contributions
 praying+0.001
 whist+0.001
 shaving+0.001
 asking+0.001
 Potato+0.001
Neg logit contributions
tted-0.003
umbs-0.002
bridge-0.002
quet-0.002
atha-0.002
Token a
Token ID257
Feature activation+2.41e-02

Pos logit contributions
 shaving+0.016
 praying+0.015
 whist+0.015
 asking+0.014
 Potato+0.014
Neg logit contributions
tted-0.017
umbs-0.014
bridge-0.014
quet-0.014
atha-0.014
Token whole
Token ID2187
Feature activation+3.04e-02

Pos logit contributions
tted+0.042
bridge+0.037
umbs+0.037
quet+0.035
adders+0.035
Neg logit contributions
 whist-0.038
 praying-0.037
 shaving-0.036
 asking-0.033
 pap-0.032
Token year
Token ID614
Feature activation+6.44e-02

Pos logit contributions
tted+0.057
umbs+0.048
bridge+0.047
quet+0.047
atha+0.047
Neg logit contributions
 shaving-0.043
 praying-0.042
 whist-0.042
 asking-0.038
 Potato-0.037
Token!
Token ID0
Feature activation-8.16e-03

Pos logit contributions
tted+0.155
bridge+0.137
umbs+0.136
quet+0.132
atha+0.129
Neg logit contributions
 praying-0.059
 shaving-0.058
 whist-0.056
 asking-0.053
 temperature-0.046
Token
Token ID198
Feature activation-5.62e-04

Pos logit contributions
 whist+0.010
 shaving+0.010
 praying+0.010
 asking+0.008
 Potato+0.008
Neg logit contributions
tted-0.018
umbs-0.015
atha-0.014
okingly-0.014
quet-0.014
Token
Token ID198
Feature activation+1.38e-02

Pos logit contributions
 shaving+0.001
 praying+0.001
 whist+0.001
 asking+0.000
 Potato+0.000
Neg logit contributions
tted-0.001
umbs-0.001
quet-0.001
atha-0.001
bridge-0.001
TokenEventually
Token ID28724
Feature activation+9.59e-03

Pos logit contributions
tted+0.027
umbs+0.025
igsaw+0.025
bridge+0.024
fy+0.024
Neg logit contributions
 shaving-0.016
 whist-0.016
 praying-0.016
 asking-0.015
Playing-0.013
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
tted+0.025
umbs+0.022
atha+0.021
ives+0.021
bridge+0.021
Neg logit contributions
 praying-0.009
 whist-0.008
 shaving-0.008
 asking-0.007
 being-0.006
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
tted+0.028
umbs+0.023
bridge+0.023
quet+0.023
atha+0.023
Neg logit contributions
 shaving-0.018
 praying-0.018
 whist-0.018
 asking-0.016
 Potato-0.015
Token park
Token ID3952
Feature activation+2.02e-02

Pos logit contributions
tted+0.018
bridge+0.014
fy+0.014
 chains+0.013
ay+0.013
Neg logit contributions
 praying-0.019
 shaving-0.019
 asking-0.018
 whist-0.018
ise-0.017
Token,
Token ID11
Feature activation-1.85e-03

Pos logit contributions
tted+0.045
umbs+0.039
quet+0.038
bridge+0.038
atha+0.038
Neg logit contributions
 whist-0.021
 shaving-0.021
 praying-0.021
 asking-0.019
 Potato-0.018
Token and
Token ID290
Feature activation-3.55e-04

Pos logit contributions
 whist+0.002
 praying+0.002
 shaving+0.002
 asking+0.002
 Potato+0.002
Neg logit contributions
tted-0.004
quet-0.004
umbs-0.004
igsaw-0.003
atha-0.003
Token have
Token ID423
Feature activation+1.22e-02

Pos logit contributions
 whist+0.000
 praying+0.000
 shaving+0.000
 asking+0.000
 smoking+0.000
Neg logit contributions
tted-0.001
umbs-0.001
quet-0.001
bridge-0.001
hell-0.001
Token pic
Token ID8301
Feature activation+6.44e-02

Pos logit contributions
tted+0.022
quet+0.019
umbs+0.018
igsaw+0.018
okingly+0.018
Neg logit contributions
 whist-0.018
 shaving-0.017
 praying-0.017
 asking-0.016
 pap-0.015
Tokenn
Token ID77
Feature activation+2.75e-02

Pos logit contributions
tted+0.112
bridge+0.099
umbs+0.098
 queens+0.096
okingly+0.095
Neg logit contributions
 whist-0.091
 shaving-0.090
 praying-0.087
 asking-0.081
 being-0.076
Tokenics
Token ID873
Feature activation+7.48e-03

Pos logit contributions
tted+0.042
igsaw+0.035
quet+0.035
fy+0.034
okingly+0.034
Neg logit contributions
 whist-0.047
 praying-0.045
 shaving-0.045
 asking-0.042
 being-0.040
Token outside
Token ID2354
Feature activation-1.62e-04

Pos logit contributions
tted+0.016
umbs+0.014
quet+0.014
bridge+0.014
atters+0.014
Neg logit contributions
 whist-0.009
 praying-0.008
 shaving-0.008
 asking-0.007
 Potato-0.007
Token.
Token ID13
Feature activation+4.47e-03

Pos logit contributions
 praying+0.000
 whist+0.000
 shaving+0.000
 asking+0.000
 Potato+0.000
Neg logit contributions
tted-0.000
umbs-0.000
quet-0.000
bridge-0.000
atha-0.000
Token
Token ID220
Feature activation-2.05e-02

Pos logit contributions
tted+0.009
quet+0.008
bridge+0.007
hell+0.007
atha+0.007
Neg logit contributions
 whist-0.006
 shaving-0.006
 praying-0.006
 Brother-0.006
 Potato-0.006
Token near
Token ID1474
Feature activation-1.59e-02

Pos logit contributions
tted+0.046
bridge+0.040
umbs+0.040
quet+0.040
igsaw+0.039
Neg logit contributions
 whist-0.026
 praying-0.025
 shaving-0.024
 asking-0.022
 Potato-0.020
Token the
Token ID262
Feature activation+4.97e-03

Pos logit contributions
 shaving+0.027
 praying+0.026
 whist+0.025
 Potato+0.025
 Brother+0.024
Neg logit contributions
tted-0.023
adders-0.019
 chains-0.019
 frame-0.019
 Sue-0.018
Token cliff
Token ID19516
Feature activation+3.28e-03

Pos logit contributions
tted+0.007
umbs+0.005
quet+0.005
bridge+0.005
atha+0.005
Neg logit contributions
 shaving-0.010
 praying-0.010
 whist-0.009
 asking-0.009
 Potato-0.009
Token and
Token ID290
Feature activation-2.24e-03

Pos logit contributions
tted+0.007
bridge+0.006
umbs+0.006
quet+0.006
atha+0.006
Neg logit contributions
 whist-0.003
 praying-0.003
 shaving-0.003
 asking-0.003
 Potato-0.003
Token have
Token ID423
Feature activation-5.23e-05

Pos logit contributions
 shaving+0.004
 praying+0.004
 whist+0.004
 Potato+0.004
 asking+0.004
Neg logit contributions
tted-0.003
atha-0.003
bridge-0.003
umbs-0.003
quet-0.002
Token pic
Token ID8301
Feature activation+6.42e-02

Pos logit contributions
 whist+0.000
 shaving+0.000
 praying+0.000
 asking+0.000
 being+0.000
Neg logit contributions
tted-0.000
quet-0.000
igsaw-0.000
bridge-0.000
okingly-0.000
Tokenn
Token ID77
Feature activation+2.85e-02

Pos logit contributions
tted+0.105
bridge+0.088
umbs+0.088
quet+0.084
okingly+0.084
Neg logit contributions
 shaving-0.102
 whist-0.102
 praying-0.101
 asking-0.094
 Potato-0.088
Tokenics
Token ID873
Feature activation+6.04e-03

Pos logit contributions
tted+0.044
igsaw+0.037
fy+0.036
quet+0.036
okingly+0.035
Neg logit contributions
 whist-0.048
 praying-0.047
 shaving-0.046
 asking-0.043
 being-0.041
Token.
Token ID13
Feature activation-1.64e-03

Pos logit contributions
tted+0.013
umbs+0.011
bridge+0.011
igsaw+0.011
quet+0.011
Neg logit contributions
 whist-0.007
 praying-0.007
 shaving-0.007
 asking-0.006
 Potato-0.006
Token One
Token ID1881
Feature activation+6.92e-04

Pos logit contributions
 shaving+0.003
 praying+0.003
 whist+0.003
 temperature+0.003
 razor+0.003
Neg logit contributions
tted-0.002
 Sue-0.002
umbs-0.002
aney-0.002
ay-0.002
Token day
Token ID1110
Feature activation-1.69e-03

Pos logit contributions
tted+0.001
umbs+0.001
atha+0.001
quet+0.001
bridge+0.001
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 asking-0.001
 Potato-0.001
Token to
Token ID284
Feature activation+1.39e-02

Pos logit contributions
tted+0.050
quet+0.039
atha+0.039
umbs+0.039
hell+0.038
Neg logit contributions
 shaving-0.055
 praying-0.053
 whist-0.052
 Potato-0.050
 asking-0.049
Token be
Token ID307
Feature activation+1.82e-02

Pos logit contributions
tted+0.026
umbs+0.022
bridge+0.022
quet+0.022
atha+0.021
Neg logit contributions
 shaving-0.019
 whist-0.019
 praying-0.019
 asking-0.018
 Potato-0.017
Token careful
Token ID8161
Feature activation-1.91e-02

Pos logit contributions
tted+0.038
igsaw+0.035
umbs+0.035
quet+0.034
bridge+0.033
Neg logit contributions
 whist-0.022
 praying-0.021
 shaving-0.021
 asking-0.020
 being-0.019
Token when
Token ID618
Feature activation+1.02e-02

Pos logit contributions
 shaving+0.024
 praying+0.024
 whist+0.024
 Potato+0.021
 asking+0.021
Neg logit contributions
tted-0.039
umbs-0.032
bridge-0.032
quet-0.032
atha-0.032
Token having
Token ID1719
Feature activation+2.09e-02

Pos logit contributions
tted+0.015
 queens+0.013
adders+0.013
 Pam+0.013
bridge+0.013
Neg logit contributions
 shaving-0.015
 whist-0.015
 Potato-0.015
 praying-0.014
 grandson-0.014
Token pic
Token ID8301
Feature activation+6.37e-02

Pos logit contributions
tted+0.037
umbs+0.031
bridge+0.030
quet+0.030
atha+0.029
Neg logit contributions
 shaving-0.031
 praying-0.031
 whist-0.031
 asking-0.028
 Potato-0.028
Tokenn
Token ID77
Feature activation+2.51e-02

Pos logit contributions
tted+0.098
umbs+0.077
quet+0.076
bridge+0.075
atha+0.075
Neg logit contributions
 shaving-0.113
 praying-0.112
 whist-0.111
 asking-0.103
 Potato-0.102
Tokenics
Token ID873
Feature activation+7.54e-03

Pos logit contributions
tted+0.038
bridge+0.029
umbs+0.029
quet+0.029
atha+0.029
Neg logit contributions
 shaving-0.045
 praying-0.044
 whist-0.044
 asking-0.041
 Potato-0.040
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
tted+0.016
umbs+0.014
bridge+0.013
quet+0.013
atha+0.013
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.009
 asking-0.008
 Potato-0.007
Token not
Token ID407
Feature activation-2.98e-03

Pos logit contributions
tted+0.022
quet+0.018
 queens+0.018
atha+0.018
umbs+0.018
Neg logit contributions
 shaving-0.014
 whist-0.013
 praying-0.013
 Potato-0.013
 Brother-0.012
Token to
Token ID284
Feature activation+1.43e-02

Pos logit contributions
 shaving+0.004
 praying+0.004
 whist+0.004
 asking+0.004
 Potato+0.004
Neg logit contributions
tted-0.005
umbs-0.004
quet-0.004
bridge-0.004
atha-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.38e-02

Pos logit contributions
tted+0.009
quet+0.007
bridge+0.007
umbs+0.007
atha+0.007
Neg logit contributions
 shaving-0.006
 whist-0.005
 praying-0.005
 Potato-0.005
 asking-0.005
TokenTom
Token ID13787
Feature activation+1.50e-03

Pos logit contributions
 praying+0.026
 shaving+0.023
 whist+0.023
 pap+0.023
 asking+0.022
Neg logit contributions
tted-0.055
umbs-0.048
bridge-0.045
ives-0.045
ay-0.045
Tokenmy
Token ID1820
Feature activation+4.01e-03

Pos logit contributions
tted+0.003
atha+0.002
umbs+0.002
okingly+0.002
atters+0.002
Neg logit contributions
 praying-0.002
 shaving-0.002
 whist-0.002
 asking-0.002
 Potato-0.002
Token loved
Token ID6151
Feature activation-1.04e-02

Pos logit contributions
tted+0.008
atha+0.007
bridge+0.007
atters+0.006
okingly+0.006
Neg logit contributions
 shaving-0.005
 praying-0.005
 whist-0.005
 asking-0.004
 Potato-0.004
Token his
Token ID465
Feature activation+4.66e-04

Pos logit contributions
 praying+0.015
 shaving+0.015
 whist+0.014
 Potato+0.014
ise+0.014
Neg logit contributions
tted-0.015
 queens-0.014
quet-0.014
 chains-0.014
bridge-0.013
Token sc
Token ID629
Feature activation+6.31e-02

Pos logit contributions
tted+0.001
igsaw+0.001
quet+0.001
umbs+0.001
bridge+0.001
Neg logit contributions
 praying-0.001
 shaving-0.001
 whist-0.001
 asking-0.001
 smoking-0.001
Tokenooter
Token ID25141
Feature activation+3.41e-02

Pos logit contributions
bridge+0.095
tted+0.090
There+0.079
 queens+0.078
drops+0.076
Neg logit contributions
 shaving-0.102
 grandson-0.096
 whist-0.094
 praying-0.093
 grandfather-0.089
Token.
Token ID13
Feature activation+3.57e-02

Pos logit contributions
tted+0.078
umbs+0.067
bridge+0.067
quet+0.066
atha+0.066
Neg logit contributions
 shaving-0.034
 whist-0.034
 praying-0.034
 asking-0.029
 Potato-0.028
Token He
Token ID679
Feature activation+2.07e-04

Pos logit contributions
tted+0.071
quet+0.062
umbs+0.060
atha+0.060
bridge+0.059
Neg logit contributions
 shaving-0.046
 whist-0.044
 praying-0.043
 Potato-0.040
 asking-0.040
Token liked
Token ID8288
Feature activation-4.45e-03

Pos logit contributions
tted+0.000
bridge+0.000
umbs+0.000
atha+0.000
quet+0.000
Neg logit contributions
 shaving-0.000
 praying-0.000
 whist-0.000
 asking-0.000
 Potato-0.000
Token to
Token ID284
Feature activation+9.47e-04

Pos logit contributions
 shaving+0.006
 praying+0.006
 whist+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.009
umbs-0.008
bridge-0.008
quet-0.008
atha-0.007
Token day
Token ID1110
Feature activation-5.67e-04

Pos logit contributions
tted+0.011
umbs+0.009
hell+0.009
igsaw+0.009
atha+0.009
Neg logit contributions
 shaving-0.010
 praying-0.009
 whist-0.009
 asking-0.009
 Potato-0.009
Token,
Token ID11
Feature activation+3.56e-02

Pos logit contributions
 whist+0.001
 praying+0.001
 shaving+0.001
 Potato+0.001
 asking+0.001
Neg logit contributions
tted-0.001
umbs-0.001
okingly-0.001
hell-0.001
adders-0.001
Token a
Token ID257
Feature activation-4.19e-03

Pos logit contributions
tted+0.077
umbs+0.066
quet+0.066
bridge+0.066
atha+0.064
Neg logit contributions
 shaving-0.041
 whist-0.041
 praying-0.040
 asking-0.035
 Potato-0.034
Token little
Token ID1310
Feature activation+1.81e-02

Pos logit contributions
 praying+0.007
 shaving+0.007
 asking+0.007
 whist+0.007
 temperature+0.006
Neg logit contributions
tted-0.006
hell-0.005
igsaw-0.005
atha-0.005
fy-0.005
Token boy
Token ID2933
Feature activation+1.03e-02

Pos logit contributions
tted+0.037
umbs+0.031
quet+0.031
bridge+0.031
atha+0.030
Neg logit contributions
 shaving-0.023
 praying-0.022
 whist-0.022
 asking-0.020
 Potato-0.019
Token named
Token ID3706
Feature activation+6.27e-02

Pos logit contributions
tted+0.019
umbs+0.016
quet+0.015
bridge+0.015
atha+0.015
Neg logit contributions
 shaving-0.016
 whist-0.016
 praying-0.015
 asking-0.014
 Potato-0.014
Token Ben
Token ID3932
Feature activation+2.51e-03

Pos logit contributions
tted+0.120
quet+0.109
bridge+0.107
umbs+0.102
atha+0.100
Neg logit contributions
 shaving-0.088
 whist-0.083
 praying-0.081
 asking-0.078
 Potato-0.077
Token wanted
Token ID2227
Feature activation+1.47e-03

Pos logit contributions
tted+0.005
umbs+0.004
bridge+0.004
atha+0.004
hell+0.004
Neg logit contributions
 shaving-0.004
 praying-0.003
 whist-0.003
 asking-0.003
 Potato-0.003
Token to
Token ID284
Feature activation+8.34e-03

Pos logit contributions
 of+0.001
tted+0.001
 Sue+0.001
 frame+0.001
 queens+0.001
Neg logit contributions
ise-0.003
ica-0.003
 praying-0.003
 Amen-0.003
 Man-0.003
Token play
Token ID711
Feature activation+1.68e-02

Pos logit contributions
tted+0.016
umbs+0.014
bridge+0.014
hell+0.013
fy+0.013
Neg logit contributions
 shaving-0.012
 whist-0.012
 praying-0.011
 razor-0.010
 Potato-0.010
Token with
Token ID351
Feature activation+1.48e-02

Pos logit contributions
tted+0.027
bridge+0.022
quet+0.022
 frame+0.020
hell+0.020
Neg logit contributions
 praying-0.028
 shaving-0.026
 whist-0.026
 pap-0.025
 Brother-0.025
Token<|endoftext|>
Token ID50256
Feature activation-1.83e-02

Pos logit contributions
tted+0.020
umbs+0.016
bridge+0.016
atha+0.016
quet+0.016
Neg logit contributions
 shaving-0.016
 praying-0.016
 whist-0.016
 asking-0.015
 Potato-0.014
TokenTom
Token ID13787
Feature activation+5.59e-03

Pos logit contributions
 praying+0.020
 pap+0.019
 whist+0.018
 asking+0.018
 shaving+0.018
Neg logit contributions
tted-0.042
umbs-0.036
ay-0.035
ives-0.034
bridge-0.034
Tokenmy
Token ID1820
Feature activation+1.01e-02

Pos logit contributions
tted+0.010
umbs+0.009
atha+0.008
atters+0.008
ives+0.008
Neg logit contributions
 praying-0.008
 shaving-0.008
 whist-0.007
 asking-0.007
 Potato-0.007
Token loved
Token ID6151
Feature activation-1.03e-02

Pos logit contributions
tted+0.020
bridge+0.017
atha+0.016
atters+0.016
okingly+0.016
Neg logit contributions
 shaving-0.012
 praying-0.012
 whist-0.011
 Potato-0.011
 asking-0.011
Token his
Token ID465
Feature activation+6.64e-03

Pos logit contributions
 praying+0.014
 shaving+0.014
 whist+0.014
 Potato+0.013
 asking+0.013
Neg logit contributions
tted-0.017
quet-0.015
umbs-0.015
 queens-0.015
okingly-0.015
Token sc
Token ID629
Feature activation+6.25e-02

Pos logit contributions
tted+0.016
umbs+0.014
bridge+0.014
quet+0.013
atha+0.013
Neg logit contributions
 shaving-0.006
 praying-0.006
 whist-0.006
 asking-0.005
 Potato-0.005
Tokenooter
Token ID25141
Feature activation+3.50e-02

Pos logit contributions
bridge+0.092
tted+0.086
There+0.077
umbs+0.075
 queens+0.074
Neg logit contributions
 shaving-0.101
 grandson-0.099
 whist-0.092
 praying-0.090
 grandfather-0.086
Token.
Token ID13
Feature activation+3.45e-02

Pos logit contributions
tted+0.081
umbs+0.070
bridge+0.069
atha+0.069
quet+0.069
Neg logit contributions
 whist-0.034
 shaving-0.034
 praying-0.034
 asking-0.029
 Potato-0.027
Token He
Token ID679
Feature activation+1.48e-03

Pos logit contributions
tted+0.074
quet+0.064
atha+0.063
adders+0.061
vell+0.061
Neg logit contributions
 shaving-0.039
 whist-0.038
 Potato-0.037
 praying-0.037
 Brother-0.037
Token rode
Token ID22075
Feature activation+1.96e-02

Pos logit contributions
tted+0.003
umbs+0.003
quet+0.003
bridge+0.003
atha+0.003
Neg logit contributions
 whist-0.001
 shaving-0.001
 praying-0.001
 asking-0.001
 Potato-0.001
Token it
Token ID340
Feature activation+7.35e-03

Pos logit contributions
tted+0.044
umbs+0.041
quet+0.040
igsaw+0.038
adders+0.038
Neg logit contributions
 whist-0.022
 shaving-0.019
 asking-0.018
 praying-0.018
 being-0.016
Token his
Token ID465
Feature activation-9.91e-03

Pos logit contributions
 shaving+0.030
 praying+0.030
 whist+0.029
 Brother+0.028
 Potato+0.028
Neg logit contributions
tted-0.029
bridge-0.024
igator-0.024
quet-0.022
 Pam-0.022
Token grandfather
Token ID17695
Feature activation-1.07e-02

Pos logit contributions
 shaving+0.013
 praying+0.013
 whist+0.013
 asking+0.012
 Potato+0.011
Neg logit contributions
tted-0.020
umbs-0.016
quet-0.016
bridge-0.016
atha-0.016
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 shaving+0.012
 whist+0.012
 praying+0.012
 Potato+0.011
 Brother+0.010
Neg logit contributions
tted-0.022
quet-0.019
umbs-0.019
atha-0.019
bridge-0.018
Token He
Token ID679
Feature activation-1.67e-02

Pos logit contributions
 praying+0.029
 shaving+0.028
 whist+0.028
 temperature+0.026
 razor+0.025
Neg logit contributions
tted-0.030
 Pam-0.025
 Sue-0.023
atha-0.022
ives-0.022
Token asked
Token ID1965
Feature activation-2.31e-02

Pos logit contributions
 shaving+0.022
 praying+0.020
 Potato+0.019
 temperature+0.018
 Brother+0.018
Neg logit contributions
tted-0.033
atha-0.028
 Pam-0.027
quet-0.026
Tweet-0.026
Token:
Token ID25
Feature activation-7.80e-02

Pos logit contributions
 shaving+0.031
 whist+0.031
 praying+0.030
 Potato+0.027
 Brother+0.027
Neg logit contributions
tted-0.043
adders-0.036
bridge-0.036
 Pam-0.036
hell-0.036
Token "
Token ID366
Feature activation-1.67e-02

Pos logit contributions
 temperature+0.113
 shaving+0.108
 whist+0.106
 grandson+0.101
 Brother+0.101
Neg logit contributions
tted-0.126
 Pam-0.107
 Sue-0.107
ill-0.098
 of-0.095
TokenWell
Token ID5779
Feature activation-6.74e-03

Pos logit contributions
 shaving+0.022
 praying+0.022
 whist+0.022
 Brother+0.019
 asking+0.018
Neg logit contributions
tted-0.032
bridge-0.027
quet-0.027
atha-0.027
umbs-0.026
Token,
Token ID11
Feature activation-1.58e-02

Pos logit contributions
 praying+0.007
 whist+0.007
 shaving+0.007
 asking+0.006
 being+0.005
Neg logit contributions
tted-0.016
umbs-0.014
quet-0.014
bridge-0.013
fy-0.013
Token what
Token ID644
Feature activation-4.47e-02

Pos logit contributions
 praying+0.019
 whist+0.019
 shaving+0.018
 asking+0.017
 Potato+0.016
Neg logit contributions
tted-0.034
umbs-0.030
quet-0.029
bridge-0.029
atha-0.028
Token do
Token ID466
Feature activation+3.54e-03

Pos logit contributions
 shaving+0.070
 praying+0.068
 whist+0.066
 asking+0.064
 Potato+0.062
Neg logit contributions
tted-0.074
bridge-0.061
adders-0.061
hell-0.061
umbs-0.060
Token iron
Token ID6953
Feature activation-2.00e-02

Pos logit contributions
 whist+0.001
 shaving+0.001
 praying+0.001
 Potato+0.001
 pap+0.001
Neg logit contributions
tted-0.003
quet-0.002
umbs-0.002
bridge-0.002
atha-0.002
Tokening
Token ID278
Feature activation-2.67e-02

Pos logit contributions
 praying+0.028
 shaving+0.028
 whist+0.027
 asking+0.025
 Potato+0.025
Neg logit contributions
tted-0.037
bridge-0.031
quet-0.030
umbs-0.030
atha-0.029
Token clothes
Token ID8242
Feature activation+6.57e-03

Pos logit contributions
 shaving+0.035
 praying+0.035
 whist+0.035
 asking+0.031
 Potato+0.031
Neg logit contributions
tted-0.052
umbs-0.043
bridge-0.043
quet-0.043
atha-0.043
Token and
Token ID290
Feature activation-1.97e-02

Pos logit contributions
tted+0.013
umbs+0.011
bridge+0.011
quet+0.011
atha+0.011
Neg logit contributions
 shaving-0.008
 praying-0.008
 whist-0.008
 asking-0.007
 Potato-0.007
Token asked
Token ID1965
Feature activation-2.72e-02

Pos logit contributions
 shaving+0.025
 praying+0.024
 whist+0.024
 Potato+0.022
 temperature+0.021
Neg logit contributions
tted-0.037
bridge-0.032
quet-0.031
atha-0.031
umbs-0.031
Token nicely
Token ID16576
Feature activation-7.68e-02

Pos logit contributions
 whist+0.033
 shaving+0.033
 praying+0.031
 Potato+0.028
 Brother+0.027
Neg logit contributions
tted-0.053
adders-0.047
bridge-0.047
 Pam-0.046
quet-0.045
Token,
Token ID11
Feature activation-4.40e-02

Pos logit contributions
 whist+0.118
 shaving+0.115
 praying+0.109
 Potato+0.101
 Brother+0.101
Neg logit contributions
tted-0.111
igsaw-0.101
 of-0.100
ill-0.099
hell-0.098
Token "
Token ID366
Feature activation+2.52e-03

Pos logit contributions
 shaving+0.048
 whist+0.047
 temperature+0.045
 praying+0.044
 Potato+0.042
Neg logit contributions
tted-0.080
 Pam-0.072
 Sue-0.070
 of-0.069
bridge-0.067
TokenMom
Token ID29252
Feature activation-8.73e-03

Pos logit contributions
tted+0.005
bridge+0.005
umbs+0.004
quet+0.004
atha+0.004
Neg logit contributions
 praying-0.003
 whist-0.003
 shaving-0.003
 asking-0.003
 Potato-0.002
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
 whist+0.012
 shaving+0.012
 praying+0.011
 Potato+0.010
 asking+0.010
Neg logit contributions
tted-0.016
umbs-0.013
bridge-0.013
atha-0.013
quet-0.013
Token can
Token ID460
Feature activation-5.77e-04

Pos logit contributions
 shaving+0.034
 praying+0.034
 whist+0.034
 asking+0.031
 Potato+0.030
Neg logit contributions
tted-0.047
umbs-0.039
bridge-0.039
quet-0.038
atha-0.038
Token daughter
Token ID4957
Feature activation-2.90e-03

Pos logit contributions
 shaving+0.023
 whist+0.022
 praying+0.020
 Potato+0.019
 smoking+0.019
Neg logit contributions
tted-0.034
atha-0.030
 frame-0.029
quet-0.029
fy-0.029
Token was
Token ID373
Feature activation-2.75e-02

Pos logit contributions
 shaving+0.004
 praying+0.004
 Potato+0.003
 smoking+0.003
 whist+0.003
Neg logit contributions
tted-0.005
atha-0.005
aney-0.005
okingly-0.004
bridge-0.004
Token curious
Token ID11040
Feature activation+1.39e-03

Pos logit contributions
 Potato+0.050
 shaving+0.048
 razor+0.048
 Brother+0.048
 pistol+0.047
Neg logit contributions
tted-0.038
atha-0.028
 free-0.028
adders-0.027
bridge-0.027
Token and
Token ID290
Feature activation-1.78e-03

Pos logit contributions
 of+0.002
tted+0.002
fy+0.002
hell+0.002
ets+0.002
Neg logit contributions
 shaving-0.002
 Potato-0.002
 whist-0.002
 praying-0.002
 pap-0.002
Token asked
Token ID1965
Feature activation-2.04e-02

Pos logit contributions
 shaving+0.002
 praying+0.002
 whist+0.002
 Potato+0.002
 smoking+0.002
Neg logit contributions
tted-0.004
atha-0.003
quet-0.003
bridge-0.003
okingly-0.003
Token:
Token ID25
Feature activation-7.53e-02

Pos logit contributions
 whist+0.036
 shaving+0.034
rob+0.033
 Potato+0.033
 razor+0.033
Neg logit contributions
 of-0.026
tted-0.023
 Sue-0.023
 Pam-0.022
adders-0.022
Token "
Token ID366
Feature activation-8.51e-03

Pos logit contributions
 temperature+0.127
ise+0.120
 shaving+0.119
pa+0.119
 lung+0.119
Neg logit contributions
tted-0.093
 Sue-0.084
 of-0.079
 Pam-0.079
There-0.075
TokenWhat
Token ID2061
Feature activation-3.00e-02

Pos logit contributions
 whist+0.012
 shaving+0.012
 praying+0.011
 asking+0.010
 temperature+0.010
Neg logit contributions
tted-0.015
bridge-0.014
There-0.013
atha-0.013
quet-0.013
Token does
Token ID857
Feature activation+1.74e-02

Pos logit contributions
 asking+0.047
 Brother+0.047
 shaving+0.046
 temperature+0.046
ica+0.045
Neg logit contributions
 of-0.040
tted-0.038
adders-0.038
 belong-0.037
hell-0.036
Token he
Token ID339
Feature activation-5.49e-03

Pos logit contributions
tted+0.028
bridge+0.024
igsaw+0.023
igator+0.023
quet+0.023
Neg logit contributions
 shaving-0.025
 whist-0.025
 praying-0.024
 asking-0.023
 Potato-0.023
Token record
Token ID1700
Feature activation-9.40e-03

Pos logit contributions
 Brother+0.009
 Potato+0.008
 shaving+0.008
 praying+0.008
 whist+0.008
Neg logit contributions
tted-0.008
 belong-0.006
atha-0.006
bridge-0.006
ives-0.006
Token up
Token ID510
Feature activation-2.94e-02

Pos logit contributions
tted+0.002
umbs+0.002
bridge+0.002
quet+0.002
atha+0.002
Neg logit contributions
 shaving-0.002
 praying-0.002
 whist-0.002
 asking-0.001
 Potato-0.001
Token to
Token ID284
Feature activation-7.55e-03

Pos logit contributions
 shaving+0.037
 praying+0.037
 whist+0.037
 asking+0.033
 Potato+0.032
Neg logit contributions
tted-0.060
umbs-0.050
bridge-0.050
quet-0.050
atha-0.049
Token him
Token ID683
Feature activation-3.53e-02

Pos logit contributions
 shaving+0.012
 praying+0.012
 whist+0.011
 Potato+0.011
 temperature+0.011
Neg logit contributions
tted-0.011
 Sue-0.010
 Pam-0.009
atha-0.009
ollen-0.009
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
 shaving+0.059
 praying+0.055
 Brother+0.055
 Potato+0.055
 whist+0.055
Neg logit contributions
tted-0.047
quet-0.039
 of-0.038
hell-0.037
 free-0.037
Token asked
Token ID1965
Feature activation-2.98e-02

Pos logit contributions
 shaving+0.023
 praying+0.021
 temperature+0.021
 Potato+0.019
 lung+0.019
Neg logit contributions
tted-0.030
atha-0.027
 Sue-0.027
 Pam-0.027
igator-0.026
Token:
Token ID25
Feature activation-7.46e-02

Pos logit contributions
 shaving+0.056
 whist+0.055
 praying+0.052
 recorder+0.051
 Potato+0.051
Neg logit contributions
 Sue-0.033
 of-0.031
tted-0.029
hell-0.029
 Pam-0.029
Token "
Token ID366
Feature activation-1.40e-02

Pos logit contributions
 temperature+0.112
ise+0.103
 shaving+0.102
pa+0.101
 whist+0.100
Neg logit contributions
tted-0.115
 Sue-0.103
 Pam-0.098
ill-0.095
 of-0.093
TokenWhy
Token ID5195
Feature activation-4.56e-02

Pos logit contributions
 praying+0.020
 shaving+0.019
 whist+0.019
 temperature+0.017
 asking+0.016
Neg logit contributions
tted-0.025
bridge-0.022
quet-0.022
 queens-0.021
There-0.021
Token are
Token ID389
Feature activation+1.39e-02

Pos logit contributions
 shaving+0.101
 praying+0.100
 asking+0.098
 whist+0.097
 temperature+0.094
Neg logit contributions
tted-0.043
hell-0.032
bridge-0.029
adders-0.029
atha-0.028
Token you
Token ID345
Feature activation-1.81e-02

Pos logit contributions
tted+0.025
umbs+0.020
bridge+0.020
quet+0.020
atha+0.020
Neg logit contributions
 shaving-0.021
 praying-0.021
 whist-0.021
 asking-0.019
 Potato-0.019
Token cl
Token ID537
Feature activation+6.62e-03

Pos logit contributions
 shaving+0.021
 praying+0.020
 whist+0.019
 temperature+0.019
 pap+0.018
Neg logit contributions
tted-0.032
atha-0.029
 Pam-0.028
umbs-0.028
aney-0.028
Token
Token ID198
Feature activation+6.65e-05

Pos logit contributions
 whist+0.026
 shaving+0.026
 praying+0.025
 razor+0.023
 Potato+0.022
Neg logit contributions
tted-0.029
umbs-0.026
 Sue-0.024
aney-0.024
 Pam-0.024
Token
Token ID198
Feature activation+1.10e-02

Pos logit contributions
tted+0.000
ollen+0.000
ay+0.000
umbs+0.000
Tweet+0.000
Neg logit contributions
 praying-0.000
 shaving-0.000
 whist-0.000
 pap-0.000
 Potato-0.000
TokenThe
Token ID464
Feature activation-1.22e-02

Pos logit contributions
tted+0.014
quet+0.011
atha+0.011
umbs+0.010
ollen+0.010
Neg logit contributions
 shaving-0.022
 praying-0.022
 whist-0.022
 Potato-0.020
 asking-0.020
Token mother
Token ID2802
Feature activation+3.88e-03

Pos logit contributions
 whist+0.015
 shaving+0.015
 praying+0.014
 Potato+0.013
 asking+0.012
Neg logit contributions
tted-0.024
atha-0.020
quet-0.020
fy-0.020
 queens-0.020
Token asked
Token ID1965
Feature activation-1.59e-02

Pos logit contributions
tted+0.008
quet+0.007
umbs+0.007
atha+0.007
bridge+0.007
Neg logit contributions
 shaving-0.004
 praying-0.004
 whist-0.004
 Potato-0.004
 asking-0.004
Token:
Token ID25
Feature activation-7.23e-02

Pos logit contributions
 whist+0.027
 shaving+0.027
 praying+0.026
 Potato+0.026
 razor+0.025
Neg logit contributions
tted-0.024
 Sue-0.019
adders-0.019
aney-0.018
 of-0.018
Token "
Token ID366
Feature activation-1.11e-02

Pos logit contributions
 temperature+0.112
 shaving+0.108
pa+0.108
ise+0.107
 whist+0.107
Neg logit contributions
tted-0.103
 Sue-0.093
 Pam-0.087
 of-0.082
There-0.081
TokenWhy
Token ID5195
Feature activation-4.09e-02

Pos logit contributions
 praying+0.015
 shaving+0.015
 whist+0.014
 Potato+0.013
 pap+0.013
Neg logit contributions
tted-0.020
There-0.018
atha-0.017
bridge-0.016
 queens-0.016
Token are
Token ID389
Feature activation+9.65e-03

Pos logit contributions
 shaving+0.077
 whist+0.073
 asking+0.071
 praying+0.071
 Potato+0.069
Neg logit contributions
tted-0.048
adders-0.041
hell-0.040
atha-0.038
ives-0.038
Token you
Token ID345
Feature activation-1.70e-02

Pos logit contributions
tted+0.017
umbs+0.014
bridge+0.014
quet+0.014
atha+0.013
Neg logit contributions
 shaving-0.015
 praying-0.015
 whist-0.015
 asking-0.014
 Potato-0.013
Token sad
Token ID6507
Feature activation+3.37e-03

Pos logit contributions
 shaving+0.023
 Man+0.021
 praying+0.021
 whist+0.021
 Potato+0.021
Neg logit contributions
tted-0.027
umbs-0.024
atha-0.024
aney-0.023
 frame-0.023
Token to
Token ID284
Feature activation+5.55e-03

Pos logit contributions
 shaving+0.038
 praying+0.036
 Potato+0.035
 whist+0.034
 Brother+0.034
Neg logit contributions
tted-0.035
quet-0.029
okingly-0.029
umbs-0.029
 chains-0.029
Token the
Token ID262
Feature activation-1.17e-02

Pos logit contributions
tted+0.010
umbs+0.008
bridge+0.008
quet+0.008
atha+0.008
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.009
 asking-0.008
 Potato-0.008
Token flower
Token ID15061
Feature activation-2.04e-02

Pos logit contributions
 shaving+0.021
 praying+0.021
 whist+0.021
 asking+0.019
 Potato+0.019
Neg logit contributions
tted-0.018
umbs-0.014
bridge-0.014
quet-0.014
atha-0.014
Token and
Token ID290
Feature activation-8.74e-03

Pos logit contributions
 shaving+0.025
 praying+0.024
 whist+0.024
 asking+0.022
 Potato+0.021
Neg logit contributions
tted-0.042
quet-0.035
umbs-0.035
bridge-0.035
atha-0.035
Token asked
Token ID1965
Feature activation-2.02e-02

Pos logit contributions
 shaving+0.011
 praying+0.011
 whist+0.011
 asking+0.009
 Potato+0.009
Neg logit contributions
tted-0.018
umbs-0.015
bridge-0.015
quet-0.015
atha-0.015
Token:
Token ID25
Feature activation-7.21e-02

Pos logit contributions
 shaving+0.031
 whist+0.030
 praying+0.030
 Potato+0.027
 cinema+0.026
Neg logit contributions
tted-0.034
 Pam-0.028
 Sue-0.028
adders-0.028
okingly-0.027
Token "
Token ID366
Feature activation-7.57e-03

Pos logit contributions
 temperature+0.106
 shaving+0.101
pa+0.099
ise+0.098
 whist+0.097
Neg logit contributions
tted-0.116
 Sue-0.100
 Pam-0.099
ill-0.093
aney-0.093
TokenHello
Token ID15496
Feature activation-1.43e-02

Pos logit contributions
 shaving+0.009
 praying+0.008
 whist+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.016
umbs-0.014
bridge-0.014
quet-0.014
atha-0.014
Token flower
Token ID15061
Feature activation-2.97e-02

Pos logit contributions
 praying+0.018
 shaving+0.018
 whist+0.018
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.030
umbs-0.025
bridge-0.025
atha-0.024
quet-0.024
Token,
Token ID11
Feature activation-2.45e-02

Pos logit contributions
 shaving+0.035
 praying+0.035
 whist+0.034
 asking+0.031
 Potato+0.030
Neg logit contributions
tted-0.062
umbs-0.053
bridge-0.053
quet-0.052
atha-0.052
Token what
Token ID644
Feature activation-4.83e-02

Pos logit contributions
 praying+0.034
 whist+0.033
 shaving+0.033
 asking+0.030
 Potato+0.030
Neg logit contributions
tted-0.049
umbs-0.041
bridge-0.041
atha-0.040
quet-0.039
Token the
Token ID262
Feature activation-4.16e-03

Pos logit contributions
tted+0.016
umbs+0.014
bridge+0.014
quet+0.014
atha+0.014
Neg logit contributions
 shaving-0.010
 praying-0.010
 whist-0.009
 asking-0.008
 Potato-0.008
Token bear
Token ID6842
Feature activation-4.39e-03

Pos logit contributions
 shaving+0.006
 praying+0.006
 whist+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.008
umbs-0.007
bridge-0.007
quet-0.006
atha-0.006
Token because
Token ID780
Feature activation+3.77e-03

Pos logit contributions
 whist+0.004
 praying+0.004
 shaving+0.004
 asking+0.003
 Potato+0.003
Neg logit contributions
tted-0.011
bridge-0.009
atha-0.009
umbs-0.009
quet-0.009
Token he
Token ID339
Feature activation-2.05e-02

Pos logit contributions
tted+0.008
umbs+0.007
atha+0.007
quet+0.007
bridge+0.006
Neg logit contributions
 praying-0.005
 whist-0.005
 shaving-0.005
 asking-0.004
 smoking-0.004
Token asked
Token ID1965
Feature activation-1.82e-02

Pos logit contributions
 shaving+0.025
 praying+0.024
 whist+0.023
 asking+0.021
 Potato+0.021
Neg logit contributions
tted-0.043
atha-0.036
umbs-0.036
bridge-0.036
quet-0.035
Token nicely
Token ID16576
Feature activation-7.20e-02

Pos logit contributions
 shaving+0.022
 whist+0.022
 praying+0.021
 Potato+0.019
 asking+0.018
Neg logit contributions
tted-0.037
bridge-0.031
umbs-0.031
adders-0.031
atha-0.031
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
 shaving+0.106
 whist+0.105
 praying+0.100
 Brother+0.091
 Potato+0.090
Neg logit contributions
tted-0.120
adders-0.101
hell-0.101
ives-0.101
aney-0.100
Token The
Token ID383
Feature activation-6.13e-03

Pos logit contributions
 shaving+0.028
 praying+0.028
 whist+0.027
 asking+0.024
 temperature+0.023
Neg logit contributions
tted-0.041
umbs-0.034
atha-0.033
quet-0.033
okingly-0.033
Token bear
Token ID6842
Feature activation-1.36e-02

Pos logit contributions
 shaving+0.009
 praying+0.008
 whist+0.008
 asking+0.008
 Potato+0.008
Neg logit contributions
tted-0.011
bridge-0.009
umbs-0.009
quet-0.009
hell-0.009
Token learned
Token ID4499
Feature activation+1.99e-02

Pos logit contributions
 shaving+0.013
 praying+0.012
 whist+0.012
 asking+0.010
 Potato+0.010
Neg logit contributions
tted-0.032
umbs-0.028
bridge-0.027
quet-0.027
atha-0.027
Token that
Token ID326
Feature activation-5.35e-03

Pos logit contributions
tted+0.034
quet+0.028
bridge+0.027
umbs+0.027
hell+0.027
Neg logit contributions
 shaving-0.031
 praying-0.030
 whist-0.030
 Potato-0.028
 asking-0.027
TokenM
Token ID44
Feature activation+1.67e-03

Pos logit contributions
tted+0.013
umbs+0.011
bridge+0.011
quet+0.011
atha+0.011
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.009
 asking-0.008
 Potato-0.008
Tokenummy
Token ID13513
Feature activation-1.92e-02

Pos logit contributions
tted+0.002
igsaw+0.002
okingly+0.002
ay+0.002
umbs+0.002
Neg logit contributions
 whist-0.003
 praying-0.003
 shaving-0.003
 asking-0.003
 Potato-0.003
Token smiled
Token ID13541
Feature activation+1.38e-03

Pos logit contributions
 shaving+0.024
 praying+0.023
 whist+0.021
 smoking+0.020
 Potato+0.020
Neg logit contributions
tted-0.035
aney-0.031
atha-0.031
ill-0.030
quet-0.030
Token and
Token ID290
Feature activation-4.76e-03

Pos logit contributions
tted+0.002
quet+0.002
bridge+0.002
umbs+0.002
okingly+0.002
Neg logit contributions
 shaving-0.002
 praying-0.002
 whist-0.002
 Potato-0.002
 asking-0.002
Token said
Token ID531
Feature activation+1.43e-02

Pos logit contributions
 shaving+0.005
 praying+0.005
 whist+0.005
 temperature+0.004
 Potato+0.004
Neg logit contributions
tted-0.010
quet-0.009
atha-0.008
 Pam-0.008
okingly-0.008
Token:
Token ID25
Feature activation-7.17e-02

Pos logit contributions
tted+0.028
quet+0.023
bridge+0.023
umbs+0.023
hell+0.023
Neg logit contributions
 shaving-0.019
 whist-0.019
 praying-0.018
 asking-0.017
 Potato-0.017
Token "
Token ID366
Feature activation+2.82e-03

Pos logit contributions
 temperature+0.098
 grandson+0.094
 shaving+0.092
 whist+0.091
pa+0.088
Neg logit contributions
tted-0.116
 Sue-0.098
 Pam-0.096
quet-0.096
aney-0.095
TokenThey
Token ID2990
Feature activation+4.50e-04

Pos logit contributions
tted+0.005
bridge+0.004
umbs+0.004
quet+0.004
atha+0.004
Neg logit contributions
 shaving-0.004
 whist-0.004
 praying-0.004
 Potato-0.004
 asking-0.004
Token are
Token ID389
Feature activation-6.96e-03

Pos logit contributions
tted+0.001
bridge+0.001
umbs+0.001
quet+0.001
igsaw+0.001
Neg logit contributions
 whist-0.001
 praying-0.001
 shaving-0.001
 asking-0.001
 being-0.001
Token scissors
Token ID41786
Feature activation-1.41e-02

Pos logit contributions
 shaving+0.009
 praying+0.009
 whist+0.009
 asking+0.009
 Potato+0.008
Neg logit contributions
tted-0.014
umbs-0.012
bridge-0.012
quet-0.012
atha-0.011
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
 praying+0.016
 whist+0.016
 shaving+0.016
 asking+0.014
 Potato+0.013
Neg logit contributions
tted-0.031
bridge-0.026
atha-0.026
umbs-0.026
quet-0.026
Token her
Token ID607
Feature activation-1.29e-02

Pos logit contributions
tted+0.012
umbs+0.010
bridge+0.010
quet+0.010
atha+0.009
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.009
 asking-0.008
 Potato-0.008
Token mom
Token ID1995
Feature activation-1.75e-03

Pos logit contributions
 whist+0.016
 praying+0.016
 shaving+0.016
 Potato+0.015
 asking+0.015
Neg logit contributions
tted-0.023
igsaw-0.021
 frame-0.020
umbs-0.020
quet-0.020
Tokenmy
Token ID1820
Feature activation-1.73e-02

Pos logit contributions
 shaving+0.002
 whist+0.002
 praying+0.002
 Potato+0.002
 asking+0.002
Neg logit contributions
tted-0.004
umbs-0.003
quet-0.003
bridge-0.003
atha-0.003
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 shaving+0.022
 whist+0.021
 praying+0.021
 Potato+0.019
 pap+0.018
Neg logit contributions
tted-0.034
umbs-0.029
quet-0.029
atha-0.028
igsaw-0.028
Token asked
Token ID1965
Feature activation-2.20e-02

Pos logit contributions
 shaving+0.017
 cinema+0.016
 Potato+0.016
 temperature+0.015
 Tech+0.015
Neg logit contributions
atha-0.019
 Pam-0.018
 Sue-0.018
bs-0.017
tted-0.017
Token:
Token ID25
Feature activation-7.10e-02

Pos logit contributions
 whist+0.041
 shaving+0.039
 Potato+0.038
 Brother+0.036
 lung+0.035
Neg logit contributions
tted-0.026
adders-0.023
 Sue-0.023
 of-0.023
aney-0.022
Token "
Token ID366
Feature activation-3.19e-03

Pos logit contributions
 temperature+0.095
 whist+0.094
 shaving+0.089
 lung+0.088
pa+0.087
Neg logit contributions
tted-0.119
 Pam-0.103
 Sue-0.102
ill-0.100
aney-0.098
TokenIt
Token ID1026
Feature activation-2.50e-02

Pos logit contributions
 whist+0.004
 praying+0.004
 shaving+0.004
 pap+0.004
 Potato+0.004
Neg logit contributions
tted-0.006
bridge-0.005
quet-0.005
atha-0.005
umbs-0.005
Token looks
Token ID3073
Feature activation-5.03e-03

Pos logit contributions
 shaving+0.022
 whist+0.020
 praying+0.019
 Potato+0.017
 asking+0.016
Neg logit contributions
tted-0.060
umbs-0.052
atha-0.052
bridge-0.052
quet-0.052
Token so
Token ID523
Feature activation-2.18e-02

Pos logit contributions
 shaving+0.009
 praying+0.008
 whist+0.008
 asking+0.008
 Potato+0.008
Neg logit contributions
tted-0.008
umbs-0.006
bridge-0.006
quet-0.006
atha-0.006
Token tough
Token ID5802
Feature activation-7.64e-03

Pos logit contributions
 shaving+0.037
 praying+0.037
 whist+0.036
 asking+0.034
 Potato+0.033
Neg logit contributions
tted-0.035
umbs-0.028
bridge-0.028
quet-0.028
atha-0.027
Token
Token ID198
Feature activation-7.76e-03

Pos logit contributions
 shaving+0.029
 whist+0.029
 praying+0.028
 Potato+0.026
 razor+0.025
Neg logit contributions
tted-0.035
umbs-0.030
aney-0.029
atha-0.029
bridge-0.029
Token
Token ID198
Feature activation+3.36e-03

Pos logit contributions
 shaving+0.010
 whist+0.009
 praying+0.009
 Potato+0.009
 pap+0.009
Neg logit contributions
tted-0.016
atha-0.013
umbs-0.013
quet-0.013
ay-0.013
TokenMark
Token ID9704
Feature activation-1.92e-02

Pos logit contributions
tted+0.005
bridge+0.004
umbs+0.004
quet+0.004
atha+0.004
Neg logit contributions
 shaving-0.006
 whist-0.006
 praying-0.005
 asking-0.005
 Potato-0.005
Tokeny
Token ID88
Feature activation-1.44e-02

Pos logit contributions
 shaving+0.024
 praying+0.022
 whist+0.021
 Potato+0.020
 smoking+0.020
Neg logit contributions
tted-0.039
atha-0.034
aney-0.034
fy-0.033
umbs-0.033
Token asked
Token ID1965
Feature activation-1.23e-02

Pos logit contributions
 shaving+0.019
 praying+0.017
 Potato+0.016
 smoking+0.016
 whist+0.015
Neg logit contributions
tted-0.027
atha-0.025
aney-0.024
case-0.023
bridge-0.022
Token:
Token ID25
Feature activation-7.08e-02

Pos logit contributions
 whist+0.019
 shaving+0.018
 Potato+0.017
 praying+0.017
 pap+0.016
Neg logit contributions
tted-0.020
 of-0.017
 Sue-0.017
aney-0.016
adders-0.016
Token "
Token ID366
Feature activation-7.74e-03

Pos logit contributions
 temperature+0.109
ise+0.108
 shaving+0.106
 whist+0.104
 lung+0.099
Neg logit contributions
tted-0.100
 Sue-0.093
 of-0.084
aney-0.083
 Pam-0.081
TokenWhat
Token ID2061
Feature activation-3.66e-02

Pos logit contributions
 whist+0.010
 shaving+0.010
 praying+0.010
 Potato+0.009
 temperature+0.008
Neg logit contributions
tted-0.014
bridge-0.013
atha-0.013
quet-0.013
 queens-0.012
Token should
Token ID815
Feature activation+6.92e-03

Pos logit contributions
 shaving+0.051
 asking+0.051
 whist+0.049
 Brother+0.049
 temperature+0.048
Neg logit contributions
 of-0.055
tted-0.055
ay-0.052
adders-0.052
hell-0.052
Token I
Token ID314
Feature activation+3.47e-03

Pos logit contributions
tted+0.010
quet+0.008
bridge+0.008
atha+0.008
umbs+0.008
Neg logit contributions
 whist-0.012
 shaving-0.012
 praying-0.012
 asking-0.011
 Potato-0.011
Token bring
Token ID2222
Feature activation+9.36e-03

Pos logit contributions
tted+0.006
atha+0.005
bridge+0.005
quet+0.004
adders+0.004
Neg logit contributions
 whist-0.005
 shaving-0.005
 praying-0.005
 Potato-0.005
 Brother-0.005
TokenHer
Token ID9360
Feature activation-1.32e-03

Pos logit contributions
tted+0.008
umbs+0.007
bridge+0.007
adders+0.007
hell+0.006
Neg logit contributions
 shaving-0.005
 praying-0.005
 whist-0.005
 asking-0.005
 Potato-0.004
Token mum
Token ID25682
Feature activation-9.00e-04

Pos logit contributions
 shaving+0.002
 praying+0.002
 whist+0.001
 asking+0.001
 smoking+0.001
Neg logit contributions
tted-0.003
igsaw-0.002
 frame-0.002
quet-0.002
atha-0.002
Token smiled
Token ID13541
Feature activation-6.78e-03

Pos logit contributions
 shaving+0.001
 praying+0.001
 Potato+0.001
 whist+0.001
 temperature+0.001
Neg logit contributions
tted-0.002
quet-0.001
atha-0.001
ill-0.001
aney-0.001
Token and
Token ID290
Feature activation-1.25e-02

Pos logit contributions
 shaving+0.011
 praying+0.010
 Potato+0.010
 Brother+0.010
 cinema+0.010
Neg logit contributions
tted-0.010
ill-0.008
quet-0.008
 of-0.008
bridge-0.008
Token said
Token ID531
Feature activation+7.91e-03

Pos logit contributions
 shaving+0.017
 temperature+0.016
 praying+0.015
 Potato+0.015
 cinema+0.015
Neg logit contributions
tted-0.021
 Pam-0.019
atha-0.019
 Sue-0.019
quet-0.019
Token:
Token ID25
Feature activation-7.05e-02

Pos logit contributions
tted+0.014
quet+0.012
hell+0.012
adders+0.011
bridge+0.011
Neg logit contributions
 shaving-0.012
 whist-0.011
 praying-0.011
 Potato-0.011
 Brother-0.010
Token "
Token ID366
Feature activation-7.24e-04

Pos logit contributions
 temperature+0.094
 shaving+0.093
 grandson+0.093
 Brother+0.089
 whist+0.086
Neg logit contributions
tted-0.114
 Sue-0.103
 Pam-0.100
quet-0.095
hell-0.094
TokenIt
Token ID1026
Feature activation-2.82e-02

Pos logit contributions
 whist+0.001
 shaving+0.001
 praying+0.001
 Potato+0.001
 asking+0.001
Neg logit contributions
tted-0.001
umbs-0.001
adders-0.001
okingly-0.001
bridge-0.001
Token's
Token ID338
Feature activation-1.68e-02

Pos logit contributions
 shaving+0.027
 praying+0.023
 whist+0.022
 Brother+0.019
 Potato+0.019
Neg logit contributions
tted-0.067
atha-0.058
atters-0.057
bridge-0.057
quet-0.057
Token a
Token ID257
Feature activation-6.61e-03

Pos logit contributions
 shaving+0.023
 whist+0.023
 praying+0.022
 Potato+0.020
 asking+0.020
Neg logit contributions
tted-0.031
bridge-0.026
atha-0.026
quet-0.026
umbs-0.026
Token net
Token ID2010
Feature activation-1.18e-02

Pos logit contributions
 shaving+0.012
 praying+0.012
 whist+0.011
 asking+0.011
 Brother+0.011
Neg logit contributions
tted-0.009
quet-0.007
atha-0.007
 frame-0.007
igsaw-0.007
Token his
Token ID465
Feature activation-2.64e-02

Pos logit contributions
 shaving+0.019
 praying+0.019
 whist+0.019
 Potato+0.017
 asking+0.017
Neg logit contributions
tted-0.025
atha-0.021
quet-0.020
umbs-0.020
bridge-0.020
Token side
Token ID1735
Feature activation-8.29e-03

Pos logit contributions
 whist+0.032
 shaving+0.031
 praying+0.031
 Potato+0.028
 Brother+0.026
Neg logit contributions
tted-0.054
igsaw-0.045
atha-0.045
quet-0.044
okingly-0.044
Token.
Token ID13
Feature activation-7.95e-03

Pos logit contributions
 shaving+0.011
 praying+0.011
 asking+0.011
 whist+0.010
 saying+0.009
Neg logit contributions
tted-0.017
bridge-0.014
atha-0.014
umbs-0.014
quet-0.014
Token She
Token ID1375
Feature activation+7.13e-03

Pos logit contributions
 whist+0.016
 shaving+0.016
 praying+0.016
 Potato+0.015
 temperature+0.014
Neg logit contributions
tted-0.010
 Pam-0.007
umbs-0.007
igsaw-0.007
okingly-0.007
Token asked
Token ID1965
Feature activation-1.43e-02

Pos logit contributions
tted+0.014
atha+0.012
quet+0.012
bridge+0.011
fy+0.011
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.009
 Potato-0.008
 asking-0.007
Token:
Token ID25
Feature activation-7.01e-02

Pos logit contributions
 whist+0.019
 shaving+0.019
 praying+0.018
 Potato+0.017
 recorder+0.016
Neg logit contributions
tted-0.025
adders-0.022
 Pam-0.022
 Sue-0.021
hell-0.021
Token "
Token ID366
Feature activation-1.44e-02

Pos logit contributions
 whist+0.095
 temperature+0.095
 shaving+0.090
pa+0.089
 lung+0.089
Neg logit contributions
tted-0.120
 Pam-0.100
 Sue-0.100
There-0.093
igsaw-0.090
TokenWhat
Token ID2061
Feature activation-2.45e-02

Pos logit contributions
 praying+0.018
 whist+0.018
 shaving+0.018
 Potato+0.015
 pap+0.015
Neg logit contributions
tted-0.028
quet-0.024
atha-0.024
bridge-0.024
 queens-0.023
Token is
Token ID318
Feature activation+2.30e-02

Pos logit contributions
 shaving+0.028
 praying+0.027
 whist+0.027
 asking+0.026
 Potato+0.025
Neg logit contributions
tted-0.052
umbs-0.043
hell-0.043
bridge-0.043
atha-0.043
Token wrong
Token ID2642
Feature activation-5.11e-03

Pos logit contributions
tted+0.041
bridge+0.036
igsaw+0.036
adders+0.036
 frame+0.035
Neg logit contributions
 whist-0.028
 shaving-0.028
 praying-0.028
 Potato-0.026
 grandson-0.026
Token,
Token ID11
Feature activation+8.30e-03

Pos logit contributions
 whist+0.007
 shaving+0.007
 praying+0.007
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.009
igsaw-0.008
okingly-0.007
quet-0.007
bridge-0.007
Token his
Token ID465
Feature activation+1.29e-02

Pos logit contributions
tted+0.019
bridge+0.016
atha+0.016
quet+0.016
umbs+0.016
Neg logit contributions
 praying-0.011
 shaving-0.011
 whist-0.011
 asking-0.010
 Potato-0.009
Token cap
Token ID1451
Feature activation+1.73e-02

Pos logit contributions
tted+0.023
umbs+0.019
bridge+0.019
quet+0.019
atha+0.018
Neg logit contributions
 shaving-0.020
 whist-0.019
 praying-0.019
 asking-0.018
 Potato-0.017
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
tted+0.039
umbs+0.033
bridge+0.033
quet+0.033
atha+0.033
Neg logit contributions
 whist-0.018
 shaving-0.018
 praying-0.017
 asking-0.016
 Potato-0.015
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
 shaving+0.052
 temperature+0.051
 praying+0.050
 Potato+0.050
isha+0.050
Neg logit contributions
tted-0.038
 Sue-0.037
 free-0.037
 chains-0.036
 Pam-0.036
Token said
Token ID531
Feature activation+1.32e-02

Pos logit contributions
 shaving+0.022
 praying+0.021
 whist+0.020
 Potato+0.019
 temperature+0.019
Neg logit contributions
tted-0.032
bridge-0.028
atha-0.027
 Pam-0.027
quet-0.027
Token:
Token ID25
Feature activation-7.01e-02

Pos logit contributions
tted+0.030
hell+0.025
bridge+0.025
quet+0.025
okingly+0.025
Neg logit contributions
 shaving-0.013
 praying-0.013
 whist-0.013
 Potato-0.012
 temperature-0.012
Token "
Token ID366
Feature activation+1.06e-02

Pos logit contributions
 temperature+0.106
 grandson+0.101
pa+0.100
 shaving+0.097
 praying+0.096
Neg logit contributions
tted-0.105
 Pam-0.088
 Sue-0.087
bridge-0.080
 free-0.080
TokenI
Token ID40
Feature activation-1.21e-02

Pos logit contributions
tted+0.017
umbs+0.014
bridge+0.014
quet+0.013
atha+0.013
Neg logit contributions
 shaving-0.018
 whist-0.018
 praying-0.017
 asking-0.016
 Potato-0.016
Token don
Token ID836
Feature activation-8.71e-03

Pos logit contributions
 shaving+0.018
 whist+0.018
 praying+0.018
 asking+0.016
 Potato+0.016
Neg logit contributions
tted-0.022
umbs-0.018
quet-0.018
bridge-0.018
atha-0.017
Token't
Token ID470
Feature activation-1.71e-02

Pos logit contributions
 shaving+0.015
 praying+0.015
 whist+0.015
 asking+0.013
 Potato+0.013
Neg logit contributions
tted-0.014
umbs-0.011
bridge-0.011
quet-0.011
atha-0.011
Token have
Token ID423
Feature activation-1.50e-03

Pos logit contributions
 whist+0.027
 shaving+0.026
 asking+0.025
 praying+0.024
 being+0.024
Neg logit contributions
tted-0.030
bridge-0.027
quet-0.027
umbs-0.026
adders-0.026
Token
Token ID198
Feature activation-2.33e-03

Pos logit contributions
 shaving+0.036
 praying+0.036
 whist+0.036
 asking+0.032
 Potato+0.032
Neg logit contributions
tted-0.049
umbs-0.040
bridge-0.040
quet-0.040
atha-0.039
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
 praying+0.003
 shaving+0.003
 whist+0.003
 pap+0.002
 asking+0.002
Neg logit contributions
tted-0.005
umbs-0.004
hell-0.004
atha-0.004
ollen-0.004
TokenThe
Token ID464
Feature activation-9.06e-03

Pos logit contributions
tted+0.024
bridge+0.020
umbs+0.020
adders+0.020
quet+0.019
Neg logit contributions
 shaving-0.022
 whist-0.022
 praying-0.021
 Potato-0.020
 asking-0.020
Token man
Token ID582
Feature activation-3.88e-03

Pos logit contributions
 shaving+0.015
 praying+0.015
 whist+0.014
 Potato+0.014
 Brother+0.014
Neg logit contributions
tted-0.014
bridge-0.012
quet-0.011
 chains-0.011
 frame-0.011
Token asked
Token ID1965
Feature activation-2.20e-02

Pos logit contributions
 shaving+0.005
 praying+0.004
 whist+0.004
 Potato+0.004
 asking+0.004
Neg logit contributions
tted-0.008
bridge-0.007
atha-0.007
umbs-0.007
quet-0.007
Token:
Token ID25
Feature activation-6.99e-02

Pos logit contributions
 praying+0.033
 shaving+0.033
 whist+0.033
 recorder+0.029
 recording+0.029
Neg logit contributions
tted-0.035
 of-0.030
 Sue-0.030
 Pam-0.029
adders-0.029
Token �
Token ID6184
Feature activation-5.07e-04

Pos logit contributions
 temperature+0.099
 shaving+0.094
 whist+0.092
ise+0.090
 praying+0.090
Neg logit contributions
tted-0.118
 Sue-0.100
 Pam-0.097
aney-0.092
ill-0.092
Token�
Token ID95
Feature activation+5.25e-03

Pos logit contributions
 praying+0.001
 shaving+0.001
 whist+0.001
 asking+0.001
 Brother+0.001
Neg logit contributions
tted-0.001
quet-0.001
bridge-0.001
atha-0.001
igsaw-0.001
Token€
Token ID26391
Feature activation-2.37e-02

Pos logit contributions
tted+0.009
bridge+0.007
atha+0.007
quet+0.007
atters+0.007
Neg logit contributions
 praying-0.009
 whist-0.008
 shaving-0.008
 asking-0.008
 Brother-0.008
Token�
Token ID129
Feature activation+1.05e-02

Pos logit contributions
 shaving+0.034
 praying+0.033
 whist+0.033
 asking+0.031
 Potato+0.030
Neg logit contributions
tted-0.044
umbs-0.036
bridge-0.036
quet-0.036
atha-0.036
Token�
Token ID241
Feature activation-5.42e-03

Pos logit contributions
tted+0.018
atters+0.015
atha+0.014
bridge+0.014
hell+0.014
Neg logit contributions
 praying-0.016
 shaving-0.016
 whist-0.016
 Brother-0.015
 Potato-0.015
Token the
Token ID262
Feature activation-2.35e-02

Pos logit contributions
 praying+0.014
 shaving+0.014
 whist+0.014
 asking+0.013
 Potato+0.012
Neg logit contributions
tted-0.019
umbs-0.016
bridge-0.015
quet-0.015
atha-0.015
Token park
Token ID3952
Feature activation+1.04e-02

Pos logit contributions
 shaving+0.033
 praying+0.033
 whist+0.032
 asking+0.029
 Potato+0.029
Neg logit contributions
tted-0.044
bridge-0.036
quet-0.036
umbs-0.036
atha-0.035
Token her
Token ID607
Feature activation-1.30e-02

Pos logit contributions
tted+0.022
umbs+0.020
atters+0.019
bridge+0.019
okingly+0.019
Neg logit contributions
 praying-0.012
 whist-0.012
 shaving-0.011
 asking-0.011
 pap-0.010
Token "
Token ID366
Feature activation+1.79e-02

Pos logit contributions
 praying+0.016
 whist+0.016
 shaving+0.016
 asking+0.014
 Potato+0.014
Neg logit contributions
tted-0.027
umbs-0.023
bridge-0.023
quet-0.023
atha-0.022
Tokenuse
Token ID1904
Feature activation-5.94e-04

Pos logit contributions
tted+0.029
adders+0.025
igsaw+0.025
okingly+0.024
bridge+0.024
Neg logit contributions
 whist-0.028
 shaving-0.027
 praying-0.026
 asking-0.025
 Potato-0.024
Tokenful
Token ID913
Feature activation-6.96e-02

Pos logit contributions
 shaving+0.001
 whist+0.001
 praying+0.001
 Potato+0.000
 smoking+0.000
Neg logit contributions
tted-0.001
quet-0.001
atha-0.001
umbs-0.001
aney-0.001
Token-
Token ID12
Feature activation-1.19e-02

Pos logit contributions
 shaving+0.102
 whist+0.097
 praying+0.096
 asking+0.086
 smoking+0.085
Neg logit contributions
tted-0.124
umbs-0.106
quet-0.105
aney-0.102
hell-0.102
Tokenpark
Token ID20928
Feature activation-2.95e-02

Pos logit contributions
 praying+0.018
 shaving+0.018
 whist+0.018
 asking+0.017
 Potato+0.016
Neg logit contributions
tted-0.021
umbs-0.017
bridge-0.017
quet-0.016
atha-0.016
Token".
Token ID1911
Feature activation-2.62e-02

Pos logit contributions
 shaving+0.046
 whist+0.045
 praying+0.045
 asking+0.040
 smoking+0.039
Neg logit contributions
tted-0.052
quet-0.042
umbs-0.042
atha-0.041
bridge-0.041
Token<|endoftext|>
Token ID50256
Feature activation-2.02e-02

Pos logit contributions
 shaving+0.026
 praying+0.026
 whist+0.026
 asking+0.023
 Potato+0.022
Neg logit contributions
tted-0.060
bridge-0.052
quet-0.051
umbs-0.051
atha-0.050
TokenOnce
Token ID7454
Feature activation+2.55e-02

Pos logit contributions
 praying+0.022
 shaving+0.021
 whist+0.021
 pap+0.020
 asking+0.019
Neg logit contributions
tted-0.046
umbs-0.040
bridge-0.038
hell-0.037
ay-0.037
Token to
Token ID284
Feature activation+6.03e-04

Pos logit contributions
 whist+0.018
 praying+0.018
 shaving+0.018
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.036
bridge-0.031
umbs-0.030
quet-0.030
atha-0.030
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
tted+0.001
 Pam+0.001
 free+0.001
 Sue+0.001
quet+0.001
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 temperature-0.001
 Brother-0.001
Token bear
Token ID6842
Feature activation-1.41e-02

Pos logit contributions
 praying+0.023
 shaving+0.023
 whist+0.022
 asking+0.021
 Brother+0.020
Neg logit contributions
tted-0.024
bridge-0.020
quet-0.020
umbs-0.019
hell-0.019
Token and
Token ID290
Feature activation-2.00e-03

Pos logit contributions
 shaving+0.017
 praying+0.017
 whist+0.017
 asking+0.015
 Potato+0.014
Neg logit contributions
tted-0.029
umbs-0.025
bridge-0.025
quet-0.025
atha-0.024
Token ask
Token ID1265
Feature activation-3.13e-02

Pos logit contributions
 whist+0.003
 praying+0.003
 shaving+0.003
 asking+0.003
 pap+0.003
Neg logit contributions
tted-0.004
umbs-0.004
bridge-0.003
aney-0.003
hell-0.003
Token nicely
Token ID16576
Feature activation-6.94e-02

Pos logit contributions
 shaving+0.049
 whist+0.047
 praying+0.046
 smoking+0.041
 razor+0.041
Neg logit contributions
tted-0.049
adders-0.041
 Pam-0.041
 free-0.041
atha-0.041
Token if
Token ID611
Feature activation-1.47e-02

Pos logit contributions
 shaving+0.116
 whist+0.112
 praying+0.110
 Potato+0.105
 razor+0.104
Neg logit contributions
tted-0.092
 queens-0.083
ives-0.081
adders-0.080
aney-0.080
Token Hop
Token ID9996
Feature activation-1.29e-02

Pos logit contributions
 shaving+0.028
 praying+0.026
 whist+0.026
 Potato+0.024
 asking+0.024
Neg logit contributions
tted-0.019
 queens-0.015
quet-0.014
adders-0.014
 Pam-0.014
Token can
Token ID460
Feature activation-1.21e-02

Pos logit contributions
 whist+0.016
 shaving+0.016
 praying+0.016
 asking+0.014
 Potato+0.014
Neg logit contributions
tted-0.026
bridge-0.022
umbs-0.022
quet-0.022
atha-0.021
Token borrow
Token ID8804
Feature activation+4.00e-03

Pos logit contributions
 shaving+0.021
 whist+0.021
 praying+0.021
 asking+0.020
 Potato+0.019
Neg logit contributions
tted-0.018
bridge-0.015
umbs-0.015
quet-0.014
atha-0.014
Token the
Token ID262
Feature activation-2.28e-03

Pos logit contributions
tted+0.007
umbs+0.006
quet+0.006
bridge+0.006
hell+0.006
Neg logit contributions
 shaving-0.006
 praying-0.006
 whist-0.006
 asking-0.005
 Potato-0.005
Token mum
Token ID25682
Feature activation+7.12e-03

Pos logit contributions
 praying+0.007
 whist+0.007
 shaving+0.007
 smoking+0.007
 asking+0.007
Neg logit contributions
tted-0.008
 Pam-0.007
okingly-0.007
aney-0.007
igsaw-0.007
Token was
Token ID373
Feature activation-1.87e-02

Pos logit contributions
tted+0.014
quet+0.012
atha+0.012
bridge+0.012
umbs+0.012
Neg logit contributions
 shaving-0.009
 praying-0.009
 whist-0.008
 Potato-0.008
 asking-0.008
Token surprised
Token ID6655
Feature activation+2.78e-02

Pos logit contributions
 praying+0.029
 whist+0.029
 shaving+0.029
 Potato+0.028
 Brother+0.027
Neg logit contributions
tted-0.031
bridge-0.025
atha-0.024
quet-0.024
umbs-0.024
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
tted+0.063
umbs+0.054
bridge+0.054
quet+0.052
adders+0.052
Neg logit contributions
 shaving-0.030
 praying-0.030
 whist-0.030
 asking-0.028
 Potato-0.025
Token asked
Token ID1965
Feature activation-2.13e-02

Pos logit contributions
 shaving+0.016
 praying+0.016
 whist+0.016
 Potato+0.014
 asking+0.014
Neg logit contributions
tted-0.031
quet-0.026
bridge-0.026
atha-0.026
umbs-0.026
Token:
Token ID25
Feature activation-6.93e-02

Pos logit contributions
 shaving+0.029
 whist+0.028
 praying+0.028
 Potato+0.025
 asking+0.025
Neg logit contributions
tted-0.042
bridge-0.034
quet-0.034
umbs-0.034
atha-0.034
Token "
Token ID366
Feature activation+2.40e-03

Pos logit contributions
 temperature+0.093
 shaving+0.090
 whist+0.088
 lung+0.086
 praying+0.085
Neg logit contributions
tted-0.120
 Sue-0.102
 Pam-0.098
ill-0.098
aney-0.096
TokenWhat
Token ID2061
Feature activation-2.04e-02

Pos logit contributions
tted+0.005
umbs+0.004
bridge+0.004
quet+0.004
atha+0.004
Neg logit contributions
 shaving-0.003
 whist-0.003
 praying-0.003
 asking-0.003
 Potato-0.002
Token will
Token ID481
Feature activation+9.14e-03

Pos logit contributions
 shaving+0.029
 whist+0.029
 praying+0.028
 asking+0.027
 Potato+0.027
Neg logit contributions
tted-0.034
adders-0.030
hell-0.030
atha-0.030
 frame-0.029
Token you
Token ID345
Feature activation+1.65e-02

Pos logit contributions
tted+0.017
umbs+0.014
bridge+0.014
quet+0.014
atha+0.014
Neg logit contributions
 shaving-0.013
 praying-0.013
 whist-0.013
 asking-0.012
 Potato-0.012
Token do
Token ID466
Feature activation+1.25e-02

Pos logit contributions
tted+0.030
umbs+0.025
atha+0.025
bridge+0.025
quet+0.024
Neg logit contributions
 shaving-0.024
 praying-0.024
 whist-0.023
 asking-0.021
 Potato-0.021
Token other
Token ID584
Feature activation-1.68e-02

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.007
 Potato+0.006
 asking+0.006
Neg logit contributions
tted-0.011
umbs-0.010
quet-0.010
bridge-0.010
atha-0.009
Token animals
Token ID4695
Feature activation+1.32e-02

Pos logit contributions
 shaving+0.022
 whist+0.021
 praying+0.021
 Potato+0.019
 asking+0.018
Neg logit contributions
tted-0.034
umbs-0.029
quet-0.028
bridge-0.028
atha-0.028
Token came
Token ID1625
Feature activation-1.41e-02

Pos logit contributions
tted+0.027
umbs+0.023
quet+0.023
bridge+0.023
igsaw+0.022
Neg logit contributions
 whist-0.017
 praying-0.017
 shaving-0.017
 asking-0.015
 Potato-0.014
Token to
Token ID284
Feature activation-7.13e-03

Pos logit contributions
 shaving+0.020
 praying+0.020
 whist+0.019
 Potato+0.018
 asking+0.017
Neg logit contributions
tted-0.026
umbs-0.022
atha-0.022
quet-0.021
bridge-0.021
Token see
Token ID766
Feature activation-1.54e-02

Pos logit contributions
 shaving+0.012
 praying+0.012
ise+0.011
ning+0.011
 Brother+0.011
Neg logit contributions
tted-0.009
 Sue-0.008
 free-0.008
 Pam-0.007
 queens-0.007
Token Beef
Token ID35309
Feature activation-6.85e-02

Pos logit contributions
 shaving+0.024
 praying+0.022
 whist+0.021
 temperature+0.021
 recorder+0.021
Neg logit contributions
tted-0.022
umbs-0.020
 Sue-0.020
 of-0.019
hell-0.019
Token dance
Token ID9280
Feature activation+3.70e-03

Pos logit contributions
 shaving+0.106
 praying+0.097
 Brother+0.096
 razor+0.095
 grandson+0.093
Neg logit contributions
tted-0.093
umbs-0.085
 of-0.082
ives-0.081
aults-0.080
Token in
Token ID287
Feature activation+2.95e-03

Pos logit contributions
tted+0.009
umbs+0.008
quet+0.008
okingly+0.008
bridge+0.008
Neg logit contributions
 whist-0.003
 praying-0.003
 shaving-0.003
 asking-0.002
 Potato-0.002
Token the
Token ID262
Feature activation+8.96e-03

Pos logit contributions
tted+0.005
umbs+0.004
bridge+0.004
quet+0.004
igsaw+0.004
Neg logit contributions
 shaving-0.004
 praying-0.004
 whist-0.004
 asking-0.004
 Potato-0.004
Token empty
Token ID6565
Feature activation-9.50e-03

Pos logit contributions
tted+0.019
umbs+0.018
okingly+0.017
adders+0.017
aney+0.017
Neg logit contributions
 whist-0.009
 shaving-0.009
 praying-0.009
 living-0.009
 radio-0.008
Token box
Token ID3091
Feature activation+1.15e-02

Pos logit contributions
 shaving+0.013
 praying+0.013
 whist+0.013
 asking+0.012
 Potato+0.011
Neg logit contributions
tted-0.018
umbs-0.015
bridge-0.015
quet-0.015
atha-0.015
Token kitchen
Token ID9592
Feature activation-2.21e-02

Pos logit contributions
 whist+0.017
 praying+0.016
 shaving+0.015
ise+0.015
 Potato+0.014
Neg logit contributions
tted-0.036
bridge-0.031
umbs-0.031
 frame-0.030
quet-0.030
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 whist+0.033
 shaving+0.032
 praying+0.031
 Potato+0.031
 Brother+0.029
Neg logit contributions
tted-0.036
umbs-0.030
quet-0.030
aney-0.029
bridge-0.029
Token see
Token ID766
Feature activation-1.91e-02

Pos logit contributions
 shaving+0.017
 praying+0.016
 whist+0.016
 Potato+0.015
 asking+0.015
Neg logit contributions
tted-0.016
atha-0.012
bridge-0.012
umbs-0.012
adders-0.012
Token their
Token ID511
Feature activation-1.34e-02

Pos logit contributions
 whist+0.028
 shaving+0.028
 praying+0.026
 Potato+0.026
 asking+0.025
Neg logit contributions
tted-0.030
adders-0.027
bridge-0.026
umbs-0.025
 of-0.025
Token mom
Token ID1995
Feature activation-1.12e-02

Pos logit contributions
 whist+0.016
 praying+0.016
 shaving+0.016
 Potato+0.015
 asking+0.014
Neg logit contributions
tted-0.026
umbs-0.022
igsaw-0.022
quet-0.021
aney-0.021
Token chopping
Token ID45274
Feature activation-6.85e-02

Pos logit contributions
 whist+0.013
 shaving+0.012
 praying+0.012
 Potato+0.012
 Brother+0.011
Neg logit contributions
tted-0.023
umbs-0.020
bridge-0.020
quet-0.020
aney-0.019
Token carrots
Token ID34397
Feature activation-2.68e-02

Pos logit contributions
 praying+0.113
 whist+0.112
 shaving+0.105
 grandfather+0.101
ica+0.101
Neg logit contributions
tted-0.100
umbs-0.091
bridge-0.084
igsaw-0.081
adders-0.081
Token and
Token ID290
Feature activation-2.58e-02

Pos logit contributions
 praying+0.032
 shaving+0.032
 whist+0.032
 asking+0.028
 Potato+0.027
Neg logit contributions
tted-0.056
umbs-0.048
quet-0.047
bridge-0.047
hell-0.046
Token potatoes
Token ID18821
Feature activation-2.99e-02

Pos logit contributions
 grandson+0.032
 whist+0.032
 Man+0.032
 grandfather+0.032
 praying+0.031
Neg logit contributions
 queens-0.037
tted-0.035
bridge-0.035
umbs-0.034
quet-0.033
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
 praying+0.039
 whist+0.038
 shaving+0.038
 Potato+0.035
 grandson+0.035
Neg logit contributions
tted-0.058
umbs-0.050
bridge-0.048
quet-0.048
hell-0.047
Token She
Token ID1375
Feature activation-7.93e-03

Pos logit contributions
 whist+0.025
 praying+0.024
 shaving+0.024
ise+0.023
ning+0.022
Neg logit contributions
tted-0.014
umbs-0.011
 Sue-0.010
 Pam-0.010
aney-0.010
Token bit
Token ID1643
Feature activation+1.50e-02

Pos logit contributions
tted+0.040
umbs+0.031
quet+0.031
fy+0.030
atha+0.030
Neg logit contributions
 shaving-0.045
 praying-0.044
 whist-0.043
 asking-0.041
 Potato-0.040
Token scared
Token ID12008
Feature activation+2.12e-02

Pos logit contributions
tted+0.030
bridge+0.026
quet+0.025
umbs+0.025
atha+0.025
Neg logit contributions
 whist-0.020
 shaving-0.020
 praying-0.020
 asking-0.018
 Potato-0.017
Token.
Token ID13
Feature activation-1.14e-03

Pos logit contributions
tted+0.047
umbs+0.040
bridge+0.040
quet+0.040
atha+0.040
Neg logit contributions
 praying-0.024
 shaving-0.024
 whist-0.024
 asking-0.022
 Potato-0.020
Token She
Token ID1375
Feature activation+8.17e-03

Pos logit contributions
 shaving+0.002
 whist+0.002
 praying+0.002
 asking+0.002
 Potato+0.002
Neg logit contributions
tted-0.002
umbs-0.001
atha-0.001
quet-0.001
bridge-0.001
Token asked
Token ID1965
Feature activation-7.23e-03

Pos logit contributions
tted+0.016
atha+0.013
fy+0.013
bridge+0.012
okingly+0.012
Neg logit contributions
 shaving-0.011
 praying-0.011
 Potato-0.010
 whist-0.009
 smoking-0.009
Token:
Token ID25
Feature activation-6.85e-02

Pos logit contributions
 whist+0.011
 shaving+0.011
 praying+0.011
 Potato+0.010
 Brother+0.010
Neg logit contributions
tted-0.012
adders-0.010
atha-0.010
okingly-0.010
igsaw-0.010
Token "
Token ID366
Feature activation-8.42e-03

Pos logit contributions
 temperature+0.105
 shaving+0.101
pa+0.100
 whist+0.099
ise+0.098
Neg logit contributions
tted-0.102
 Sue-0.086
 Pam-0.083
There-0.078
aney-0.078
TokenWhat
Token ID2061
Feature activation-2.53e-02

Pos logit contributions
 shaving+0.012
 praying+0.012
 whist+0.012
 asking+0.010
 Potato+0.010
Neg logit contributions
tted-0.015
bridge-0.013
There-0.013
atha-0.013
quet-0.013
Token if
Token ID611
Feature activation-5.37e-03

Pos logit contributions
 shaving+0.037
 asking+0.035
 whist+0.034
 Potato+0.033
 praying+0.033
Neg logit contributions
tted-0.041
adders-0.038
 of-0.036
hell-0.036
 frame-0.036
Token it
Token ID340
Feature activation-1.37e-02

Pos logit contributions
 shaving+0.010
 whist+0.010
 praying+0.009
 asking+0.009
rev+0.009
Neg logit contributions
tted-0.006
 of-0.005
adders-0.005
igator-0.005
 frame-0.005
Token hurts
Token ID20406
Feature activation+1.25e-03

Pos logit contributions
 shaving+0.018
 praying+0.017
 whist+0.016
 Potato+0.016
 asking+0.015
Neg logit contributions
tted-0.026
umbs-0.021
igator-0.021
atters-0.021
ives-0.021
Token time
Token ID640
Feature activation-8.77e-05

Pos logit contributions
 shaving+0.014
 praying+0.014
 whist+0.014
 asking+0.013
 Potato+0.013
Neg logit contributions
tted-0.019
umbs-0.016
bridge-0.015
quet-0.015
atha-0.015
Token,
Token ID11
Feature activation+1.85e-02

Pos logit contributions
 shaving+0.000
 praying+0.000
 whist+0.000
 asking+0.000
 Potato+0.000
Neg logit contributions
tted-0.000
umbs-0.000
bridge-0.000
quet-0.000
atha-0.000
Token a
Token ID257
Feature activation-1.52e-02

Pos logit contributions
tted+0.029
umbs+0.023
bridge+0.022
quet+0.022
atha+0.022
Neg logit contributions
 shaving-0.032
 praying-0.032
 whist-0.032
 asking-0.029
 Potato-0.029
Token little
Token ID1310
Feature activation+5.11e-03

Pos logit contributions
 praying+0.026
 shaving+0.026
 asking+0.024
 whist+0.024
 temperature+0.023
Neg logit contributions
tted-0.024
atha-0.018
bridge-0.018
hell-0.018
igsaw-0.018
Token boy
Token ID2933
Feature activation-7.75e-03

Pos logit contributions
tted+0.011
umbs+0.009
bridge+0.009
quet+0.009
atha+0.009
Neg logit contributions
 whist-0.006
 shaving-0.006
 praying-0.006
 asking-0.006
 Potato-0.006
Token named
Token ID3706
Feature activation+4.70e-02

Pos logit contributions
 shaving+0.015
 praying+0.015
 whist+0.014
 Potato+0.014
 asking+0.013
Neg logit contributions
tted-0.010
bridge-0.008
atha-0.007
okingly-0.007
umbs-0.007
Token Andy
Token ID12382
Feature activation-1.72e-02

Pos logit contributions
tted+0.094
quet+0.083
umbs+0.078
bridge+0.077
atha+0.075
Neg logit contributions
 shaving-0.065
 whist-0.064
 praying-0.063
 asking-0.060
 Brother-0.058
Token had
Token ID550
Feature activation-5.60e-03

Pos logit contributions
 shaving+0.022
 praying+0.022
 whist+0.020
 Potato+0.020
 asking+0.020
Neg logit contributions
tted-0.033
bridge-0.028
atha-0.028
hell-0.028
umbs-0.027
Token a
Token ID257
Feature activation+8.21e-03

Pos logit contributions
 praying+0.007
 shaving+0.007
 whist+0.007
 Potato+0.007
 asking+0.007
Neg logit contributions
tted-0.011
bridge-0.009
umbs-0.009
atha-0.009
quet-0.008
Token fragile
Token ID21049
Feature activation-1.03e-02

Pos logit contributions
tted+0.012
igsaw+0.010
 frame+0.009
fy+0.009
atha+0.009
Neg logit contributions
 shaving-0.014
 praying-0.014
 asking-0.013
 whist-0.013
 Potato-0.012
Token bicycle
Token ID17026
Feature activation+1.63e-02

Pos logit contributions
 praying+0.016
 shaving+0.016
 whist+0.015
 asking+0.015
 Brother+0.014
Neg logit contributions
tted-0.016
 frame-0.014
igsaw-0.013
hell-0.013
 chains-0.013
Token there
Token ID612
Feature activation-4.62e-03

Pos logit contributions
tted+0.039
quet+0.035
umbs+0.034
ives+0.033
bridge+0.033
Neg logit contributions
 whist-0.028
 praying-0.028
 shaving-0.028
 asking-0.025
 smoking-0.024
Token was
Token ID373
Feature activation-9.77e-03

Pos logit contributions
 praying+0.006
 shaving+0.005
 whist+0.005
 asking+0.005
 grandson+0.005
Neg logit contributions
tted-0.009
 Sue-0.008
 of-0.008
ay-0.008
hell-0.008
Token a
Token ID257
Feature activation-1.82e-02

Pos logit contributions
 shaving+0.011
 whist+0.011
 praying+0.011
 asking+0.009
 Potato+0.009
Neg logit contributions
tted-0.021
quet-0.018
umbs-0.018
bridge-0.018
atha-0.018
Token little
Token ID1310
Feature activation+2.26e-03

Pos logit contributions
 shaving+0.029
 praying+0.028
 whist+0.027
 asking+0.026
 temperature+0.025
Neg logit contributions
tted-0.030
quet-0.024
bridge-0.024
atha-0.024
umbs-0.023
Token girl
Token ID2576
Feature activation+6.23e-04

Pos logit contributions
tted+0.005
bridge+0.004
umbs+0.004
atha+0.004
quet+0.004
Neg logit contributions
 whist-0.003
 shaving-0.003
 praying-0.003
 Potato-0.002
 Brother-0.002
Token named
Token ID3706
Feature activation+4.35e-02

Pos logit contributions
tted+0.001
umbs+0.001
bridge+0.001
quet+0.001
atha+0.001
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 asking-0.001
 Potato-0.001
Token Lily
Token ID20037
Feature activation-5.76e-03

Pos logit contributions
tted+0.094
bridge+0.090
umbs+0.081
quet+0.081
hell+0.079
Neg logit contributions
 asking-0.052
 praying-0.049
 Gr-0.048
 shaving-0.047
 daddy-0.045
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
 shaving+0.009
 praying+0.008
 Potato+0.008
 whist+0.008
 Brother+0.007
Neg logit contributions
tted-0.010
hell-0.008
atha-0.008
atters-0.008
 frame-0.008
Token She
Token ID1375
Feature activation-2.96e-03

Pos logit contributions
tted+0.031
quet+0.027
bridge+0.027
vell+0.026
atha+0.025
Neg logit contributions
 shaving-0.023
 praying-0.022
 asking-0.022
 whist-0.021
 Potato-0.021
Token loved
Token ID6151
Feature activation-1.68e-02

Pos logit contributions
 shaving+0.004
 praying+0.004
 Potato+0.004
 asking+0.004
 Brother+0.003
Neg logit contributions
tted-0.006
atha-0.005
hell-0.005
bridge-0.005
Tweet-0.005
Token to
Token ID284
Feature activation-9.20e-03

Pos logit contributions
 shaving+0.024
 praying+0.023
 whist+0.023
 Potato+0.022
 Brother+0.022
Neg logit contributions
tted-0.026
 queens-0.023
quet-0.023
umbs-0.023
ollen-0.022
Token there
Token ID612
Feature activation-3.48e-03

Pos logit contributions
tted+0.036
umbs+0.031
quet+0.030
bridge+0.030
atha+0.029
Neg logit contributions
 whist-0.032
 shaving-0.032
 praying-0.031
 asking-0.028
 Potato-0.027
Token was
Token ID373
Feature activation-6.70e-03

Pos logit contributions
 praying+0.005
 shaving+0.004
 grandson+0.004
 asking+0.004
 whist+0.004
Neg logit contributions
tted-0.007
 Sue-0.006
 of-0.006
ay-0.006
hell-0.006
Token a
Token ID257
Feature activation-1.72e-02

Pos logit contributions
 shaving+0.007
 whist+0.007
 praying+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.016
quet-0.014
atha-0.013
umbs-0.013
bridge-0.013
Token little
Token ID1310
Feature activation+5.26e-03

Pos logit contributions
 shaving+0.028
 praying+0.027
 whist+0.026
 asking+0.026
 temperature+0.024
Neg logit contributions
tted-0.028
bridge-0.022
quet-0.022
atha-0.022
hell-0.021
Token boy
Token ID2933
Feature activation-9.88e-03

Pos logit contributions
tted+0.011
umbs+0.009
bridge+0.009
atha+0.009
quet+0.009
Neg logit contributions
 whist-0.006
 shaving-0.006
 praying-0.006
 Potato-0.006
 asking-0.005
Token named
Token ID3706
Feature activation+4.66e-02

Pos logit contributions
 shaving+0.018
 praying+0.017
 whist+0.017
 asking+0.016
 Potato+0.016
Neg logit contributions
tted-0.014
quet-0.011
umbs-0.011
bridge-0.011
atha-0.011
Token Tim
Token ID5045
Feature activation-7.47e-03

Pos logit contributions
tted+0.073
umbs+0.058
bridge+0.057
quet+0.056
atha+0.056
Neg logit contributions
 shaving-0.082
 praying-0.081
 whist-0.081
 asking-0.075
 Potato-0.073
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
 shaving+0.011
 praying+0.010
 whist+0.010
 Potato+0.009
 pap+0.009
Neg logit contributions
tted-0.013
hell-0.011
atha-0.011
umbs-0.011
atters-0.011
Token Tim
Token ID5045
Feature activation+1.74e-03

Pos logit contributions
tted+0.036
quet+0.030
atha+0.030
bridge+0.030
umbs+0.029
Neg logit contributions
 shaving-0.033
 whist-0.032
 praying-0.032
 asking-0.030
 Potato-0.029
Token loved
Token ID6151
Feature activation-1.61e-02

Pos logit contributions
tted+0.003
hell+0.003
atha+0.003
igator+0.003
bridge+0.003
Neg logit contributions
 praying-0.002
 shaving-0.002
 pap-0.002
 Potato-0.002
 asking-0.002
Token to
Token ID284
Feature activation-5.20e-03

Pos logit contributions
 praying+0.024
 shaving+0.024
 Potato+0.024
ise+0.023
 whist+0.023
Neg logit contributions
tted-0.022
 queens-0.020
 chains-0.019
quet-0.019
 frame-0.019
TokenOnce
Token ID7454
Feature activation+2.61e-02

Pos logit contributions
 praying+0.018
 shaving+0.018
 whist+0.017
 pap+0.016
 asking+0.016
Neg logit contributions
tted-0.038
umbs-0.032
bridge-0.031
hell-0.030
ay-0.030
Token there
Token ID612
Feature activation+1.68e-02

Pos logit contributions
tted+0.054
umbs+0.045
bridge+0.045
quet+0.045
atha+0.044
Neg logit contributions
 shaving-0.032
 praying-0.032
 whist-0.032
 asking-0.028
 Potato-0.027
Token was
Token ID373
Feature activation+5.34e-04

Pos logit contributions
tted+0.036
 Sue+0.031
aney+0.031
hell+0.031
ill+0.030
Neg logit contributions
 shaving-0.018
 praying-0.018
 whist-0.017
 asking-0.015
 Potato-0.015
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
tted+0.001
quet+0.001
atha+0.001
umbs+0.001
bridge+0.001
Neg logit contributions
 whist-0.001
 shaving-0.001
 praying-0.001
 asking-0.000
 Potato-0.000
Token boy
Token ID2933
Feature activation-5.63e-03

Pos logit contributions
 shaving+0.033
 praying+0.032
 asking+0.031
 whist+0.030
 temperature+0.029
Neg logit contributions
tted-0.030
atha-0.024
hell-0.024
fy-0.023
quet-0.023
Token named
Token ID3706
Feature activation+5.21e-02

Pos logit contributions
 shaving+0.011
 praying+0.011
 whist+0.011
 Potato+0.011
 temperature+0.010
Neg logit contributions
tted-0.006
okingly-0.005
atters-0.004
hell-0.004
fy-0.004
Token Bob
Token ID5811
Feature activation-1.89e-02

Pos logit contributions
tted+0.078
umbs+0.061
bridge+0.059
atha+0.059
quet+0.058
Neg logit contributions
 shaving-0.095
 praying-0.094
 whist-0.094
 asking-0.087
 Potato-0.085
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
 shaving+0.027
 praying+0.027
 Potato+0.024
 asking+0.024
 whist+0.024
Neg logit contributions
tted-0.033
hell-0.028
umbs-0.027
atha-0.027
igator-0.026
Token He
Token ID679
Feature activation-1.48e-02

Pos logit contributions
tted+0.031
umbs+0.026
bridge+0.025
hell+0.025
quet+0.024
Neg logit contributions
 shaving-0.028
 praying-0.028
 whist-0.028
 asking-0.025
 Potato-0.025
Token was
Token ID373
Feature activation-1.88e-02

Pos logit contributions
 praying+0.021
 asking+0.020
 shaving+0.020
 temperature+0.020
 grandson+0.019
Neg logit contributions
tted-0.027
bridge-0.022
hell-0.021
Tweet-0.021
atha-0.021
Token very
Token ID845
Feature activation-1.07e-02

Pos logit contributions
 shaving+0.024
 praying+0.024
 whist+0.023
 asking+0.021
 Potato+0.021
Neg logit contributions
tted-0.037
bridge-0.031
umbs-0.031
quet-0.030
atha-0.030
Token there
Token ID612
Feature activation-9.42e-04

Pos logit contributions
tted+0.036
umbs+0.029
bridge+0.029
quet+0.029
atha+0.029
Neg logit contributions
 shaving-0.037
 praying-0.037
 whist-0.036
 asking-0.033
 Potato-0.033
Token was
Token ID373
Feature activation-6.37e-03

Pos logit contributions
 praying+0.001
 shaving+0.001
 asking+0.001
 grandson+0.001
 whist+0.001
Neg logit contributions
tted-0.002
 of-0.002
 Sue-0.002
ay-0.002
hell-0.002
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
 shaving+0.007
 whist+0.007
 praying+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.015
quet-0.013
umbs-0.013
atha-0.012
bridge-0.012
Token lazy
Token ID16931
Feature activation-2.14e-02

Pos logit contributions
 shaving+0.027
 praying+0.026
 whist+0.025
 asking+0.025
 temperature+0.024
Neg logit contributions
tted-0.026
quet-0.021
atha-0.021
bridge-0.021
hell-0.020
Token cat
Token ID3797
Feature activation-3.83e-03

Pos logit contributions
 shaving+0.031
 praying+0.030
 asking+0.028
 whist+0.028
 temperature+0.026
Neg logit contributions
tted-0.036
hell-0.031
quet-0.031
igsaw-0.031
bridge-0.030
Token named
Token ID3706
Feature activation+4.53e-02

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.006
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.006
quet-0.004
bridge-0.004
umbs-0.004
atha-0.004
Token Tom
Token ID4186
Feature activation-1.01e-02

Pos logit contributions
tted+0.079
bridge+0.064
umbs+0.064
quet+0.064
atha+0.063
Neg logit contributions
 shaving-0.071
 praying-0.070
 whist-0.070
 asking-0.064
 Potato-0.063
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 shaving+0.015
 praying+0.014
 Potato+0.013
 Brother+0.013
 whist+0.013
Neg logit contributions
tted-0.017
hell-0.015
igator-0.014
atters-0.014
umbs-0.014
Token Tom
Token ID4186
Feature activation-1.93e-03

Pos logit contributions
tted+0.030
umbs+0.024
bridge+0.024
quet+0.024
atha+0.024
Neg logit contributions
 shaving-0.025
 praying-0.025
 whist-0.025
 asking-0.022
 Potato-0.022
Token loved
Token ID6151
Feature activation-1.46e-02

Pos logit contributions
 shaving+0.003
 praying+0.003
 asking+0.003
 Potato+0.003
 pap+0.003
Neg logit contributions
tted-0.003
igator-0.003
hell-0.003
bridge-0.003
 frame-0.003
Token to
Token ID284
Feature activation-2.81e-03

Pos logit contributions
 shaving+0.023
 praying+0.023
 Potato+0.022
 whist+0.022
ise+0.022
Neg logit contributions
tted-0.019
 queens-0.017
 chains-0.016
adders-0.016
umbs-0.016
Token her
Token ID607
Feature activation-5.45e-03

Pos logit contributions
tted+0.020
bridge+0.017
umbs+0.017
quet+0.017
atha+0.017
Neg logit contributions
 shaving-0.010
 praying-0.010
 whist-0.010
 asking-0.009
 Potato-0.009
Token.
Token ID13
Feature activation+9.70e-05

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.007
 Potato+0.006
 asking+0.006
Neg logit contributions
tted-0.010
hell-0.008
quet-0.008
bridge-0.008
atha-0.008
Token She
Token ID1375
Feature activation+5.64e-03

Pos logit contributions
tted+0.000
atha+0.000
bridge+0.000
umbs+0.000
igsaw+0.000
Neg logit contributions
 whist-0.000
 shaving-0.000
 praying-0.000
 Potato-0.000
 pap-0.000
Token sighed
Token ID21893
Feature activation-1.37e-02

Pos logit contributions
tted+0.011
atha+0.009
fy+0.009
bridge+0.008
 Pam+0.008
Neg logit contributions
 shaving-0.008
 praying-0.007
 Potato-0.007
 smoking-0.006
 asking-0.006
Token and
Token ID290
Feature activation-4.70e-03

Pos logit contributions
 shaving+0.024
 Potato+0.023
rev+0.022
 Brother+0.022
 pap+0.022
Neg logit contributions
tted-0.016
 free-0.014
 of-0.013
hell-0.012
fy-0.012
Token said
Token ID531
Feature activation+2.25e-02

Pos logit contributions
 shaving+0.007
 praying+0.006
 Potato+0.006
 whist+0.006
 temperature+0.005
Neg logit contributions
tted-0.009
atha-0.007
 Pam-0.007
okingly-0.007
 Sue-0.007
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
tted+0.041
bridge+0.034
adders+0.034
hell+0.034
quet+0.034
Neg logit contributions
 shaving-0.031
 whist-0.030
 praying-0.030
 Potato-0.028
 asking-0.028
Token "
Token ID366
Feature activation+1.02e-02

Pos logit contributions
 shaving+0.046
 temperature+0.043
 cinema+0.042
 whist+0.041
 praying+0.041
Neg logit contributions
tted-0.057
drops-0.050
 Sue-0.047
 Pam-0.046
 free-0.046
TokenI
Token ID40
Feature activation-1.10e-02

Pos logit contributions
tted+0.019
umbs+0.016
bridge+0.016
quet+0.016
atha+0.016
Neg logit contributions
 shaving-0.014
 praying-0.014
 whist-0.014
 asking-0.013
 Potato-0.013
Token wish
Token ID4601
Feature activation+9.44e-03

Pos logit contributions
 shaving+0.018
 praying+0.018
 whist+0.017
 asking+0.016
 Potato+0.016
Neg logit contributions
tted-0.019
atha-0.014
fy-0.014
umbs-0.014
okingly-0.014
Token I
Token ID314
Feature activation-8.33e-03

Pos logit contributions
tted+0.016
umbs+0.013
bridge+0.013
quet+0.013
atha+0.013
Neg logit contributions
 shaving-0.015
 praying-0.015
 whist-0.015
 asking-0.013
 Potato-0.013
Token has
Token ID468
Feature activation+1.02e-03

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.006
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.011
bridge-0.009
umbs-0.009
atha-0.009
adders-0.009
Token a
Token ID257
Feature activation+6.29e-03

Pos logit contributions
tted+0.002
quet+0.002
okingly+0.002
igsaw+0.002
atha+0.002
Neg logit contributions
 whist-0.001
 shaving-0.001
 praying-0.001
 asking-0.001
 Potato-0.001
Token long
Token ID890
Feature activation-2.74e-03

Pos logit contributions
tted+0.009
bridge+0.007
 frame+0.007
hell+0.006
ay+0.006
Neg logit contributions
 praying-0.012
 shaving-0.011
 whist-0.011
ise-0.011
 Brother-0.011
Token handle
Token ID5412
Feature activation-2.66e-03

Pos logit contributions
 praying+0.005
 shaving+0.004
 whist+0.004
 Potato+0.004
 Brother+0.004
Neg logit contributions
tted-0.004
bridge-0.004
hell-0.003
umbs-0.003
atters-0.003
Token on
Token ID319
Feature activation+1.28e-02

Pos logit contributions
 shaving+0.003
 praying+0.003
 whist+0.003
 asking+0.003
 Potato+0.003
Neg logit contributions
tted-0.006
umbs-0.005
bridge-0.005
quet-0.005
atha-0.005
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
tted+0.026
umbs+0.022
quet+0.022
atha+0.022
bridge+0.021
Neg logit contributions
 shaving-0.016
 whist-0.016
 praying-0.016
 asking-0.015
 Potato-0.014
Token side
Token ID1735
Feature activation+1.65e-02

Pos logit contributions
tted+0.034
umbs+0.026
bridge+0.025
adders+0.025
atha+0.025
Neg logit contributions
 praying-0.037
 whist-0.036
 shaving-0.036
 asking-0.032
 Potato-0.032
Token.
Token ID13
Feature activation+8.08e-03

Pos logit contributions
tted+0.033
igsaw+0.028
quet+0.028
atha+0.028
bridge+0.028
Neg logit contributions
 shaving-0.021
 praying-0.020
 whist-0.020
 asking-0.019
 smoking-0.018
Token
Token ID198
Feature activation-4.92e-03

Pos logit contributions
tted+0.019
atha+0.017
quet+0.017
fy+0.017
bridge+0.017
Neg logit contributions
 shaving-0.007
 whist-0.007
 praying-0.006
 asking-0.006
 Potato-0.006
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
 praying+0.005
 shaving+0.005
 whist+0.005
 asking+0.004
 Potato+0.004
Neg logit contributions
tted-0.011
umbs-0.010
bridge-0.010
quet-0.010
atha-0.009
Token"
Token ID1
Feature activation+2.91e-02

Pos logit contributions
tted+0.028
igsaw+0.026
umbs+0.026
bridge+0.026
okingly+0.025
Neg logit contributions
 shaving-0.018
 whist-0.017
 asking-0.017
 praying-0.016
 smoking-0.016
Token off
Token ID572
Feature activation-1.17e-02

Pos logit contributions
tted+0.012
bridge+0.011
umbs+0.010
quet+0.010
okingly+0.010
Neg logit contributions
 praying-0.008
 whist-0.008
 shaving-0.008
 asking-0.007
 temperature-0.007
Token to
Token ID284
Feature activation+6.34e-03

Pos logit contributions
 praying+0.013
 whist+0.013
 shaving+0.013
 asking+0.012
 saying+0.010
Neg logit contributions
tted-0.026
bridge-0.023
umbs-0.023
atha-0.022
adders-0.022
Token play
Token ID711
Feature activation+1.09e-02

Pos logit contributions
tted+0.010
quet+0.008
atha+0.008
igsaw+0.008
bridge+0.008
Neg logit contributions
 shaving-0.010
 whist-0.010
 praying-0.010
 Potato-0.009
 Brother-0.009
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
tted+0.023
umbs+0.020
bridge+0.020
quet+0.020
atha+0.020
Neg logit contributions
 shaving-0.012
 whist-0.012
 praying-0.012
 asking-0.011
 Potato-0.010
Token Little
Token ID7703
Feature activation-1.03e-02

Pos logit contributions
 shaving+0.023
 whist+0.023
 praying+0.022
 asking+0.020
 Potato+0.019
Neg logit contributions
tted-0.040
umbs-0.034
quet-0.033
atha-0.033
bridge-0.033
Token girl
Token ID2576
Feature activation+8.54e-03

Pos logit contributions
 praying+0.014
 shaving+0.014
 whist+0.014
 asking+0.013
 Brother+0.012
Neg logit contributions
tted-0.020
umbs-0.017
bridge-0.017
quet-0.017
adders-0.017
Token reached
Token ID4251
Feature activation-2.86e-02

Pos logit contributions
tted+0.020
umbs+0.018
bridge+0.018
quet+0.018
atha+0.017
Neg logit contributions
 shaving-0.008
 praying-0.007
 whist-0.007
 asking-0.006
 Potato-0.006
Token out
Token ID503
Feature activation-2.26e-02

Pos logit contributions
 shaving+0.042
 praying+0.042
 whist+0.042
 asking+0.038
 Potato+0.037
Neg logit contributions
tted-0.052
umbs-0.043
bridge-0.043
quet-0.043
atha-0.042
Token to
Token ID284
Feature activation-6.31e-03

Pos logit contributions
 praying+0.028
 whist+0.027
 shaving+0.026
 asking+0.026
 saying+0.022
Neg logit contributions
tted-0.050
bridge-0.044
umbs-0.043
atha-0.042
atters-0.042
Token hug
Token ID16225
Feature activation-7.68e-03

Pos logit contributions
 shaving+0.010
 whist+0.010
 praying+0.010
 asking+0.009
 Potato+0.009
Neg logit contributions
tted-0.011
umbs-0.009
bridge-0.009
quet-0.009
hell-0.009
Token Mama
Token ID40084
Feature activation-1.07e-02

Pos logit contributions
 shaving+0.013
 whist+0.013
 praying+0.012
 Potato+0.011
 asking+0.011
Neg logit contributions
tted-0.011
quet-0.009
bridge-0.009
umbs-0.009
atha-0.009
Token future
Token ID2003
Feature activation-1.97e-03

Pos logit contributions
tted+0.001
bridge+0.001
quet+0.001
umbs+0.001
igsaw+0.001
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 asking-0.001
 Potato-0.001
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
 shaving+0.003
 praying+0.003
 whist+0.003
 Potato+0.003
 Brother+0.002
Neg logit contributions
tted-0.003
quet-0.003
bridge-0.003
umbs-0.003
adders-0.003
Token You
Token ID921
Feature activation+2.69e-03

Pos logit contributions
tted+0.035
umbs+0.029
bridge+0.029
quet+0.029
atha+0.029
Neg logit contributions
 shaving-0.026
 praying-0.026
 whist-0.026
 asking-0.023
 Potato-0.023
Token will
Token ID481
Feature activation-5.47e-03

Pos logit contributions
tted+0.004
quet+0.004
umbs+0.004
bridge+0.004
igsaw+0.004
Neg logit contributions
 whist-0.005
 praying-0.004
 shaving-0.004
 asking-0.004
 Potato-0.004
Token be
Token ID307
Feature activation+3.94e-03

Pos logit contributions
 shaving+0.008
 praying+0.008
 whist+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.010
umbs-0.008
quet-0.008
atha-0.008
bridge-0.008
Token a
Token ID257
Feature activation+2.90e-03

Pos logit contributions
tted+0.008
umbs+0.007
quet+0.007
igsaw+0.007
ives+0.007
Neg logit contributions
 whist-0.005
 praying-0.005
 shaving-0.005
 asking-0.005
 pap-0.005
Token great
Token ID1049
Feature activation+3.75e-03

Pos logit contributions
tted+0.004
quet+0.003
bridge+0.003
atha+0.003
fy+0.003
Neg logit contributions
 shaving-0.006
 praying-0.005
 whist-0.005
 asking-0.005
 smoking-0.005
Token veterinarian
Token ID41404
Feature activation-3.10e-03

Pos logit contributions
tted+0.007
quet+0.006
bridge+0.006
umbs+0.006
atha+0.006
Neg logit contributions
 shaving-0.005
 praying-0.005
 whist-0.005
 asking-0.005
 Potato-0.005
Token someday
Token ID25580
Feature activation+1.09e-02

Pos logit contributions
 shaving+0.004
 praying+0.004
 whist+0.004
 Potato+0.004
 asking+0.004
Neg logit contributions
tted-0.006
atha-0.005
bridge-0.005
quet-0.005
umbs-0.005
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
tted+0.016
umbs+0.013
quet+0.013
atha+0.013
adders+0.013
Neg logit contributions
 shaving-0.018
 whist-0.017
 praying-0.017
 razor-0.017
 pap-0.016
Token You
Token ID921
Feature activation+4.19e-03

Pos logit contributions
tted+0.024
umbs+0.019
bridge+0.019
quet+0.019
atha+0.019
Neg logit contributions
 shaving-0.021
 whist-0.020
 praying-0.020
 asking-0.018
 smoking-0.018
Token to
Token ID284
Feature activation-9.32e-04

Pos logit contributions
 shaving+0.044
 praying+0.043
 whist+0.043
 asking+0.038
 Potato+0.038
Neg logit contributions
tted-0.058
bridge-0.047
umbs-0.047
atha-0.047
quet-0.047
Token the
Token ID262
Feature activation-5.05e-03

Pos logit contributions
 shaving+0.001
 praying+0.001
 whist+0.001
 Potato+0.001
 asking+0.001
Neg logit contributions
tted-0.002
bridge-0.001
atha-0.001
umbs-0.001
quet-0.001
Token sky
Token ID6766
Feature activation+2.23e-03

Pos logit contributions
 shaving+0.007
 praying+0.007
 whist+0.007
 Potato+0.006
 asking+0.006
Neg logit contributions
tted-0.010
bridge-0.008
umbs-0.008
atha-0.008
quet-0.008
Token.
Token ID13
Feature activation-7.29e-03

Pos logit contributions
tted+0.005
umbs+0.004
bridge+0.004
quet+0.004
atha+0.004
Neg logit contributions
 shaving-0.002
 praying-0.002
 whist-0.002
 asking-0.002
 Potato-0.002
Token
Token ID198
Feature activation-8.30e-03

Pos logit contributions
 whist+0.009
 shaving+0.009
 praying+0.009
ise+0.008
 temperature+0.008
Neg logit contributions
tted-0.015
 Sue-0.012
 Pam-0.012
There-0.012
adders-0.012
Token
Token ID198
Feature activation+5.22e-03

Pos logit contributions
 praying+0.012
 whist+0.012
 shaving+0.012
 Potato+0.012
 pap+0.012
Neg logit contributions
tted-0.015
atters-0.012
adders-0.012
ay-0.012
ollen-0.012
TokenMom
Token ID29252
Feature activation+7.01e-03

Pos logit contributions
tted+0.009
atha+0.007
quet+0.007
bridge+0.007
umbs+0.007
Neg logit contributions
 whist-0.009
 shaving-0.009
 praying-0.009
 Potato-0.008
 asking-0.008
Token looked
Token ID3114
Feature activation-5.83e-03

Pos logit contributions
tted+0.013
bridge+0.011
atha+0.011
umbs+0.010
aney+0.010
Neg logit contributions
 shaving-0.010
 praying-0.010
 Potato-0.009
 whist-0.009
 asking-0.008
Token up
Token ID510
Feature activation-1.48e-02

Pos logit contributions
 shaving+0.010
 praying+0.010
 Potato+0.010
 whist+0.010
 pap+0.009
Neg logit contributions
tted-0.008
fy-0.006
adders-0.006
okingly-0.006
igator-0.006
Token.
Token ID13
Feature activation+2.92e-03

Pos logit contributions
 shaving+0.018
 praying+0.018
 whist+0.018
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.031
umbs-0.026
quet-0.026
bridge-0.026
atha-0.025
Token She
Token ID1375
Feature activation-1.42e-03

Pos logit contributions
tted+0.005
adders+0.003
bridge+0.003
umbs+0.003
atha+0.003
Neg logit contributions
 whist-0.005
 shaving-0.005
 praying-0.005
 asking-0.005
 Potato-0.005
Token people
Token ID661
Feature activation+1.42e-02

Pos logit contributions
 whist+0.010
 shaving+0.010
 praying+0.010
 asking+0.009
 Brother+0.008
Neg logit contributions
tted-0.020
umbs-0.017
quet-0.017
bridge-0.017
atha-0.017
Token who
Token ID508
Feature activation-1.06e-02

Pos logit contributions
tted+0.027
umbs+0.023
bridge+0.022
quet+0.022
atha+0.022
Neg logit contributions
 shaving-0.020
 praying-0.020
 whist-0.019
 asking-0.018
 Potato-0.017
Token got
Token ID1392
Feature activation-2.45e-03

Pos logit contributions
 shaving+0.014
 praying+0.014
 whist+0.014
 asking+0.012
 Potato+0.012
Neg logit contributions
tted-0.021
umbs-0.018
bridge-0.017
quet-0.017
atha-0.017
Token on
Token ID319
Feature activation+4.49e-03

Pos logit contributions
 whist+0.003
 shaving+0.003
 praying+0.003
 asking+0.003
 smoking+0.003
Neg logit contributions
tted-0.005
umbs-0.004
quet-0.004
bridge-0.004
igsaw-0.004
Token and
Token ID290
Feature activation+1.15e-03

Pos logit contributions
tted+0.008
bridge+0.007
quet+0.007
umbs+0.007
atha+0.007
Neg logit contributions
 shaving-0.006
 praying-0.006
 whist-0.006
 Potato-0.006
 asking-0.005
Token off
Token ID572
Feature activation-7.28e-03

Pos logit contributions
tted+0.002
umbs+0.002
quet+0.002
bridge+0.002
atha+0.002
Neg logit contributions
 whist-0.001
 praying-0.001
 shaving-0.001
 asking-0.001
 Potato-0.001
Token.
Token ID13
Feature activation-2.88e-03

Pos logit contributions
 whist+0.009
 shaving+0.009
 praying+0.009
 asking+0.008
 Potato+0.007
Neg logit contributions
tted-0.015
umbs-0.013
quet-0.013
bridge-0.013
atha-0.013
Token He
Token ID679
Feature activation-8.49e-03

Pos logit contributions
 shaving+0.005
 praying+0.005
 whist+0.005
 asking+0.004
 Potato+0.004
Neg logit contributions
tted-0.005
umbs-0.004
bridge-0.004
quet-0.004
atha-0.004
Token saw
Token ID2497
Feature activation-5.03e-03

Pos logit contributions
 shaving+0.010
 praying+0.010
 Potato+0.009
 whist+0.009
 smoking+0.008
Neg logit contributions
tted-0.018
atha-0.015
bridge-0.015
fy-0.014
umbs-0.014
Token trees
Token ID7150
Feature activation+6.73e-03

Pos logit contributions
 shaving+0.007
 whist+0.007
 praying+0.007
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.010
umbs-0.008
quet-0.008
bridge-0.008
atha-0.008
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
tted+0.015
atha+0.013
umbs+0.013
bridge+0.013
okingly+0.013
Neg logit contributions
 whist-0.007
 praying-0.007
 shaving-0.007
 asking-0.006
 smoking-0.006
Token up
Token ID510
Feature activation-6.87e-03

Pos logit contributions
tted+0.028
okingly+0.025
quet+0.025
atha+0.024
atters+0.024
Neg logit contributions
 shaving-0.029
 whist-0.027
 praying-0.027
 Potato-0.026
 pap-0.025
Token from
Token ID422
Feature activation+4.67e-03

Pos logit contributions
 whist+0.007
 shaving+0.006
 praying+0.006
 asking+0.006
 Potato+0.005
Neg logit contributions
tted-0.017
bridge-0.015
umbs-0.015
adders-0.015
atha-0.015
Token her
Token ID607
Feature activation+1.59e-02

Pos logit contributions
tted+0.010
umbs+0.009
bridge+0.008
quet+0.008
atha+0.008
Neg logit contributions
 whist-0.006
 praying-0.006
 shaving-0.006
 asking-0.005
 smoking-0.005
Token nap
Token ID25422
Feature activation-4.29e-03

Pos logit contributions
tted+0.031
umbs+0.026
bridge+0.026
quet+0.026
atha+0.026
Neg logit contributions
 shaving-0.021
 praying-0.021
 whist-0.021
 asking-0.019
 Potato-0.018
Token,
Token ID11
Feature activation+5.37e-03

Pos logit contributions
 shaving+0.005
 praying+0.005
 whist+0.005
 asking+0.004
 Potato+0.004
Neg logit contributions
tted-0.009
umbs-0.008
bridge-0.008
atha-0.008
quet-0.008
Token she
Token ID673
Feature activation-3.56e-04

Pos logit contributions
tted+0.010
umbs+0.008
quet+0.008
atha+0.008
bridge+0.008
Neg logit contributions
 whist-0.008
 praying-0.008
 shaving-0.008
 asking-0.007
 Potato-0.007
Token saw
Token ID2497
Feature activation-2.96e-03

Pos logit contributions
 shaving+0.001
 praying+0.001
 Potato+0.000
 grandson+0.000
 Brother+0.000
Neg logit contributions
tted-0.001
atha-0.000
Tweet-0.000
bridge-0.000
hell-0.000
Token the
Token ID262
Feature activation-8.50e-04

Pos logit contributions
 shaving+0.004
 praying+0.004
 whist+0.004
 asking+0.004
 Potato+0.004
Neg logit contributions
tted-0.006
umbs-0.005
bridge-0.005
quet-0.005
atha-0.004
Token dog
Token ID3290
Feature activation-1.75e-02

Pos logit contributions
 praying+0.001
 shaving+0.001
 whist+0.001
 asking+0.001
 Potato+0.001
Neg logit contributions
tted-0.001
umbs-0.001
bridge-0.001
hell-0.001
quet-0.001
Token.
Token ID13
Feature activation+9.87e-05

Pos logit contributions
 shaving+0.021
 praying+0.020
 whist+0.020
 Potato+0.019
 Brother+0.018
Neg logit contributions
tted-0.036
umbs-0.030
hell-0.030
quet-0.030
bridge-0.030
Token She
Token ID1375
Feature activation+7.66e-03

Pos logit contributions
tted+0.000
umbs+0.000
quet+0.000
bridge+0.000
atha+0.000
Neg logit contributions
 shaving-0.000
 praying-0.000
 whist-0.000
 asking-0.000
 Potato-0.000
Token felt
Token ID2936
Feature activation+3.11e-02

Pos logit contributions
tted+0.002
quet+0.002
igsaw+0.002
bridge+0.002
umbs+0.002
Neg logit contributions
 whist-0.002
 shaving-0.002
 praying-0.002
 asking-0.001
 smoking-0.001
Token weak
Token ID4939
Feature activation-3.50e-04

Pos logit contributions
tted+0.055
umbs+0.045
bridge+0.045
quet+0.045
atha+0.044
Neg logit contributions
 shaving-0.047
 praying-0.047
 whist-0.047
 asking-0.043
 Potato-0.042
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
 shaving+0.000
 praying+0.000
 whist+0.000
 Potato+0.000
 pap+0.000
Neg logit contributions
tted-0.001
umbs-0.001
bridge-0.001
hell-0.001
quet-0.001
Token she
Token ID673
Feature activation+3.12e-03

Pos logit contributions
tted+0.017
umbs+0.015
quet+0.015
bridge+0.014
atha+0.014
Neg logit contributions
 whist-0.013
 shaving-0.013
 praying-0.013
 asking-0.012
 Brother-0.011
Token felt
Token ID2936
Feature activation+2.80e-02

Pos logit contributions
tted+0.006
umbs+0.005
bridge+0.005
atha+0.005
quet+0.005
Neg logit contributions
 praying-0.004
 shaving-0.004
 whist-0.004
 asking-0.003
 Potato-0.003
Token as
Token ID355
Feature activation+7.07e-04

Pos logit contributions
tted+0.053
umbs+0.044
bridge+0.044
quet+0.044
atha+0.043
Neg logit contributions
 shaving-0.039
 praying-0.039
 whist-0.039
 asking-0.035
 Potato-0.034
Token though
Token ID996
Feature activation-1.00e-02

Pos logit contributions
tted+0.001
umbs+0.001
quet+0.001
atters+0.001
igsaw+0.001
Neg logit contributions
 whist-0.001
 praying-0.001
 shaving-0.001
 asking-0.001
 pap-0.001
Token she
Token ID673
Feature activation+1.31e-02

Pos logit contributions
 shaving+0.016
 praying+0.016
 whist+0.016
 asking+0.015
 Potato+0.015
Neg logit contributions
tted-0.017
umbs-0.014
bridge-0.013
quet-0.013
atha-0.013
Token would
Token ID561
Feature activation+5.92e-03

Pos logit contributions
tted+0.026
atha+0.021
umbs+0.021
bridge+0.021
hell+0.020
Neg logit contributions
 praying-0.018
 shaving-0.018
 whist-0.016
 asking-0.016
 Potato-0.015
Token stop
Token ID2245
Feature activation-2.02e-02

Pos logit contributions
tted+0.010
umbs+0.008
bridge+0.008
quet+0.008
atha+0.008
Neg logit contributions
 whist-0.010
 shaving-0.009
 praying-0.009
 asking-0.009
 Potato-0.008
Token running
Token ID2491
Feature activation+1.53e-02

Pos logit contributions
 shaving+0.024
 praying+0.024
 whist+0.023
 grandson+0.022
 Potato+0.022
Neg logit contributions
tted-0.037
drops-0.031
 Pam-0.031
quet-0.031
 chains-0.031
Token �
Token ID6184
Feature activation+1.48e-02

Pos logit contributions
 shaving+0.034
 praying+0.030
 whist+0.029
 temperature+0.029
 razor+0.028
Neg logit contributions
tted-0.062
 Sue-0.056
 Pam-0.055
quet-0.054
bridge-0.053
Token�
Token ID95
Feature activation+1.60e-02

Pos logit contributions
tted+0.025
umbs+0.020
bridge+0.020
quet+0.020
atha+0.019
Neg logit contributions
 shaving-0.024
 praying-0.024
 whist-0.024
 asking-0.022
 Potato-0.022
Token€
Token ID26391
Feature activation-1.62e-02

Pos logit contributions
tted+0.030
atha+0.024
bridge+0.023
quet+0.023
igsaw+0.023
Neg logit contributions
 praying-0.025
 whist-0.024
 shaving-0.024
 asking-0.022
 pap-0.021
Token�
Token ID129
Feature activation+2.05e-02

Pos logit contributions
 shaving+0.014
 whist+0.014
 praying+0.014
 asking+0.012
 Potato+0.011
Neg logit contributions
tted-0.039
quet-0.034
umbs-0.034
bridge-0.034
atha-0.033
Token�
Token ID241
Feature activation-1.46e-02

Pos logit contributions
tted+0.036
atha+0.029
atters+0.028
igsaw+0.028
quet+0.027
Neg logit contributions
 praying-0.033
 whist-0.032
 shaving-0.031
 pap-0.029
 Potato-0.028
TokenIt
Token ID1026
Feature activation-2.69e-02

Pos logit contributions
 praying+0.018
 shaving+0.017
 whist+0.017
 asking+0.015
 pap+0.015
Neg logit contributions
tted-0.031
bridge-0.026
umbs-0.026
atha-0.025
quet-0.025
Token's
Token ID338
Feature activation-1.39e-02

Pos logit contributions
 shaving+0.021
 praying+0.020
 whist+0.019
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.068
bridge-0.059
atha-0.058
umbs-0.058
quet-0.058
Token under
Token ID739
Feature activation-1.47e-03

Pos logit contributions
 shaving+0.021
 whist+0.020
 praying+0.020
 Potato+0.019
 razor+0.018
Neg logit contributions
tted-0.023
bridge-0.019
atha-0.018
adders-0.018
fy-0.018
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
 shaving+0.002
 whist+0.002
 praying+0.002
 asking+0.002
 Potato+0.001
Neg logit contributions
tted-0.003
umbs-0.003
quet-0.003
atha-0.003
bridge-0.003
Token ground
Token ID2323
Feature activation+4.46e-03

Pos logit contributions
tted+0.032
umbs+0.027
atha+0.027
okingly+0.027
quet+0.027
Neg logit contributions
 shaving-0.020
 whist-0.019
 praying-0.019
 asking-0.017
 Potato-0.017
Token now
Token ID783
Feature activation+1.44e-02

Pos logit contributions
tted+0.009
umbs+0.008
bridge+0.008
quet+0.008
atha+0.008
Neg logit contributions
 shaving-0.005
 praying-0.005
 whist-0.005
 asking-0.004
 Potato-0.004
Token seed
Token ID9403
Feature activation-3.03e-02

Pos logit contributions
 praying+0.038
 shaving+0.037
 whist+0.037
 asking+0.035
 pap+0.033
Neg logit contributions
tted-0.034
quet-0.028
bridge-0.028
umbs-0.028
hell-0.027
Token and
Token ID290
Feature activation+3.50e-03

Pos logit contributions
 shaving+0.047
 praying+0.046
 whist+0.045
 grandson+0.044
 Brother+0.043
Neg logit contributions
tted-0.050
quet-0.042
bridge-0.041
umbs-0.041
hell-0.040
Token he
Token ID339
Feature activation-2.05e-02

Pos logit contributions
tted+0.007
bridge+0.006
quet+0.006
atha+0.006
umbs+0.006
Neg logit contributions
 shaving-0.004
 praying-0.004
 whist-0.004
 Potato-0.004
 Brother-0.004
Token thought
Token ID1807
Feature activation+3.09e-03

Pos logit contributions
 shaving+0.026
 praying+0.026
 whist+0.024
 Potato+0.023
 Brother+0.023
Neg logit contributions
tted-0.041
atha-0.034
bridge-0.034
quet-0.034
hell-0.034
Token it
Token ID340
Feature activation-3.08e-02

Pos logit contributions
tted+0.006
bridge+0.005
umbs+0.005
quet+0.005
atha+0.005
Neg logit contributions
 shaving-0.004
 praying-0.004
 whist-0.004
 asking-0.004
 Potato-0.003
Token was
Token ID373
Feature activation-2.37e-03

Pos logit contributions
 shaving+0.038
 praying+0.035
 Brother+0.034
 grandson+0.034
 whist+0.033
Neg logit contributions
tted-0.064
hell-0.053
bridge-0.053
atha-0.052
quet-0.051
Token boring
Token ID14262
Feature activation+1.07e-03

Pos logit contributions
 shaving+0.004
 whist+0.003
 praying+0.003
 asking+0.003
 Potato+0.003
Neg logit contributions
tted-0.004
bridge-0.003
umbs-0.003
quet-0.003
atha-0.003
Token.
Token ID13
Feature activation-5.91e-03

Pos logit contributions
tted+0.002
umbs+0.002
bridge+0.002
quet+0.002
atha+0.002
Neg logit contributions
 shaving-0.001
 praying-0.001
 whist-0.001
 asking-0.001
 Potato-0.001
Token But
Token ID887
Feature activation-5.10e-03

Pos logit contributions
 praying+0.010
 whist+0.010
 shaving+0.010
 pap+0.009
 temperature+0.009
Neg logit contributions
tted-0.009
okingly-0.008
igsaw-0.007
atha-0.007
 Pam-0.007
Token his
Token ID465
Feature activation-6.16e-03

Pos logit contributions
 shaving+0.008
 praying+0.007
 whist+0.007
 Potato+0.007
 temperature+0.006
Neg logit contributions
tted-0.008
 Pam-0.007
okingly-0.007
 Sue-0.007
igator-0.007
Token mom
Token ID1995
Feature activation+1.88e-02

Pos logit contributions
 shaving+0.007
 whist+0.007
 praying+0.007
 asking+0.006
 Potato+0.006
Neg logit contributions
tted-0.013
atha-0.011
quet-0.011
umbs-0.011
igsaw-0.011
Token spicy
Token ID26880
Feature activation-2.88e-02

Pos logit contributions
 praying+0.008
 shaving+0.008
 whist+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.008
umbs-0.007
quet-0.006
bridge-0.006
hell-0.006
Token sauce
Token ID10746
Feature activation-1.88e-02

Pos logit contributions
 praying+0.043
 shaving+0.041
 whist+0.041
 grandson+0.039
 cinema+0.039
Neg logit contributions
tted-0.044
umbs-0.042
quet-0.039
hell-0.039
ives-0.038
Token.
Token ID13
Feature activation-1.82e-04

Pos logit contributions
 shaving+0.018
 praying+0.018
 whist+0.018
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.044
atha-0.038
bridge-0.037
quet-0.037
igsaw-0.037
Token The
Token ID383
Feature activation-9.68e-03

Pos logit contributions
 shaving+0.000
 praying+0.000
 whist+0.000
 asking+0.000
 Brother+0.000
Neg logit contributions
tted-0.000
bridge-0.000
atha-0.000
hell-0.000
quet-0.000
Token caul
Token ID46087
Feature activation-5.28e-02

Pos logit contributions
 shaving+0.012
 yelling+0.012
 praying+0.012
 whist+0.011
ning+0.011
Neg logit contributions
quet-0.014
 queens-0.014
 frame-0.014
hire-0.014
tted-0.014
Tokeniflower
Token ID42642
Feature activation-3.80e-02

Pos logit contributions
 grandson+0.070
 shaving+0.069
 whist+0.067
 praying+0.067
 Brother+0.065
Neg logit contributions
tted-0.091
quet-0.089
umbs-0.083
hell-0.083
bridge-0.080
Token was
Token ID373
Feature activation-1.14e-02

Pos logit contributions
 shaving+0.039
 praying+0.038
 whist+0.036
 asking+0.033
 Potato+0.032
Neg logit contributions
tted-0.085
bridge-0.073
umbs-0.072
quet-0.072
hell-0.070
Token so
Token ID523
Feature activation-1.31e-02

Pos logit contributions
 shaving+0.018
 praying+0.018
 whist+0.018
 Potato+0.016
 asking+0.016
Neg logit contributions
tted-0.019
umbs-0.016
bridge-0.015
quet-0.015
hell-0.015
Token happy
Token ID3772
Feature activation+1.16e-02

Pos logit contributions
 shaving+0.024
 praying+0.024
 whist+0.024
 asking+0.022
 Potato+0.022
Neg logit contributions
tted-0.019
umbs-0.015
quet-0.015
bridge-0.015
hell-0.015
Token,
Token ID11
Feature activation-9.29e-03

Pos logit contributions
tted+0.025
umbs+0.021
bridge+0.021
atha+0.021
quet+0.021
Neg logit contributions
 whist-0.014
 praying-0.014
 shaving-0.014
 asking-0.012
 Potato-0.011
Token it
Token ID340
Feature activation-2.68e-02

Pos logit contributions
 whist+0.013
 asking+0.012
 praying+0.012
 saying+0.012
 being+0.011
Neg logit contributions
tted-0.021
adders-0.017
aved-0.017
umbs-0.017
atha-0.017
Token.
Token ID13
Feature activation-5.41e-03

Pos logit contributions
 shaving+0.011
 praying+0.010
 whist+0.010
 Potato+0.009
 asking+0.009
Neg logit contributions
tted-0.019
umbs-0.016
bridge-0.016
quet-0.016
atha-0.015
Token He
Token ID679
Feature activation-6.92e-03

Pos logit contributions
 praying+0.008
 shaving+0.008
 whist+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.009
umbs-0.008
bridge-0.008
quet-0.007
atha-0.007
Token saw
Token ID2497
Feature activation-4.48e-03

Pos logit contributions
 shaving+0.011
 praying+0.010
 Potato+0.009
 temperature+0.008
 Brother+0.008
Neg logit contributions
tted-0.013
atha-0.010
bridge-0.010
fy-0.010
Tweet-0.009
Token that
Token ID326
Feature activation-1.63e-02

Pos logit contributions
 shaving+0.006
 praying+0.006
 whist+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.009
bridge-0.007
umbs-0.007
quet-0.007
atha-0.007
Token the
Token ID262
Feature activation-8.37e-03

Pos logit contributions
 shaving+0.024
 praying+0.023
 whist+0.022
 asking+0.021
 Potato+0.021
Neg logit contributions
tted-0.028
bridge-0.024
hell-0.023
umbs-0.023
adders-0.023
Token salt
Token ID8268
Feature activation-5.11e-02

Pos logit contributions
 praying+0.014
 shaving+0.012
 whist+0.012
 asking+0.012
ica+0.012
Neg logit contributions
tted-0.013
bridge-0.011
 chains-0.011
drops-0.010
hell-0.010
Token sh
Token ID427
Feature activation-2.87e-03

Pos logit contributions
 shaving+0.050
 praying+0.050
 whist+0.049
 asking+0.043
 grandson+0.042
Neg logit contributions
tted-0.117
bridge-0.101
umbs-0.100
quet-0.098
hell-0.098
Tokenaker
Token ID3110
Feature activation-2.92e-02

Pos logit contributions
 praying+0.005
 shaving+0.004
 whist+0.004
 asking+0.004
 grandson+0.004
Neg logit contributions
tted-0.005
umbs-0.004
bridge-0.004
quet-0.004
hell-0.004
Token was
Token ID373
Feature activation-7.75e-03

Pos logit contributions
 shaving+0.026
 praying+0.026
 whist+0.025
 asking+0.022
 Potato+0.021
Neg logit contributions
tted-0.069
bridge-0.059
umbs-0.059
quet-0.058
hell-0.058
Token open
Token ID1280
Feature activation+9.61e-03

Pos logit contributions
 praying+0.011
 shaving+0.011
 whist+0.011
 Potato+0.010
 grandson+0.010
Neg logit contributions
tted-0.014
bridge-0.011
umbs-0.011
quet-0.011
hell-0.011
Token and
Token ID290
Feature activation-9.41e-03

Pos logit contributions
tted+0.020
umbs+0.017
bridge+0.017
quet+0.017
atha+0.017
Neg logit contributions
 shaving-0.011
 praying-0.011
 whist-0.011
 asking-0.010
 Potato-0.010
Token bird
Token ID6512
Feature activation+4.34e-03

Pos logit contributions
tted+0.000
umbs+0.000
bridge+0.000
quet+0.000
atha+0.000
Neg logit contributions
 shaving-0.000
 praying-0.000
 whist-0.000
 asking-0.000
 Potato-0.000
Token,
Token ID11
Feature activation-1.34e-03

Pos logit contributions
tted+0.011
bridge+0.009
umbs+0.009
igsaw+0.009
atha+0.009
Neg logit contributions
 whist-0.004
 praying-0.004
 shaving-0.004
 asking-0.004
 smoking-0.003
Token so
Token ID523
Feature activation+1.80e-03

Pos logit contributions
 shaving+0.002
 praying+0.002
 whist+0.002
 Potato+0.002
 asking+0.002
Neg logit contributions
tted-0.002
umbs-0.002
bridge-0.002
hell-0.002
quet-0.002
Token he
Token ID339
Feature activation-1.21e-02

Pos logit contributions
tted+0.003
bridge+0.003
quet+0.003
umbs+0.003
hell+0.003
Neg logit contributions
 shaving-0.002
 praying-0.002
 whist-0.002
 Potato-0.002
 asking-0.002
Token ran
Token ID4966
Feature activation+1.70e-02

Pos logit contributions
 shaving+0.018
 praying+0.017
 whist+0.017
 asking+0.016
 Potato+0.015
Neg logit contributions
tted-0.022
umbs-0.018
bridge-0.018
quet-0.018
atha-0.018
Token inside
Token ID2641
Feature activation-3.85e-02

Pos logit contributions
bridge+0.038
tted+0.038
umbs+0.037
quet+0.037
aney+0.035
Neg logit contributions
 whist-0.016
 asking-0.016
 shaving-0.015
 temperature-0.013
 praying-0.013
Token to
Token ID284
Feature activation-5.66e-03

Pos logit contributions
 shaving+0.066
 Potato+0.066
 praying+0.066
 pap+0.065
 whist+0.064
Neg logit contributions
 of-0.051
tted-0.047
 Sue-0.044
ill-0.041
 free-0.041
Token find
Token ID1064
Feature activation-1.39e-02

Pos logit contributions
 shaving+0.009
 praying+0.009
 whist+0.009
 asking+0.008
 Potato+0.008
Neg logit contributions
tted-0.010
umbs-0.008
bridge-0.008
quet-0.008
atha-0.008
Token his
Token ID465
Feature activation-8.27e-03

Pos logit contributions
 shaving+0.019
 praying+0.019
 whist+0.019
 Potato+0.017
 asking+0.017
Neg logit contributions
tted-0.026
bridge-0.021
quet-0.021
umbs-0.021
adders-0.021
Token mom
Token ID1995
Feature activation+1.58e-03

Pos logit contributions
 shaving+0.014
 praying+0.013
 whist+0.013
 Potato+0.013
 Brother+0.013
Neg logit contributions
tted-0.012
 frame-0.010
quet-0.010
 chains-0.010
umbs-0.010
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
tted+0.003
umbs+0.002
atha+0.002
hell+0.002
fy+0.002
Neg logit contributions
 shaving-0.002
 whist-0.002
 praying-0.002
 Potato-0.002
 pap-0.002
Token funny
Token ID8258
Feature activation-2.07e-02

Pos logit contributions
tted+0.014
bridge+0.011
quet+0.011
atha+0.011
igsaw+0.011
Neg logit contributions
 praying-0.013
 shaving-0.013
 whist-0.013
 asking-0.012
 Potato-0.012
Token hat
Token ID6877
Feature activation-3.56e-03

Pos logit contributions
 praying+0.029
 shaving+0.029
 whist+0.028
 Brother+0.027
 Potato+0.026
Neg logit contributions
tted-0.036
bridge-0.030
igsaw-0.030
hell-0.030
atha-0.029
Token.
Token ID13
Feature activation-1.32e-03

Pos logit contributions
 shaving+0.004
 praying+0.004
 whist+0.004
 Potato+0.004
 Brother+0.004
Neg logit contributions
tted-0.007
bridge-0.006
umbs-0.006
quet-0.006
atha-0.006
Token He
Token ID679
Feature activation-1.58e-02

Pos logit contributions
 praying+0.002
 whist+0.002
 shaving+0.002
ise+0.002
 temperature+0.002
Neg logit contributions
tted-0.002
 Sue-0.002
umbs-0.002
 Pam-0.002
igsaw-0.002
Token said
Token ID531
Feature activation+1.59e-02

Pos logit contributions
 shaving+0.023
 praying+0.022
 temperature+0.022
 cinema+0.021
 Potato+0.020
Neg logit contributions
tted-0.026
Tweet-0.023
atha-0.022
 Sue-0.022
 Pam-0.022
Token,
Token ID11
Feature activation-4.64e-02

Pos logit contributions
tted+0.031
bridge+0.026
quet+0.025
hell+0.025
umbs+0.025
Neg logit contributions
 shaving-0.021
 praying-0.021
 whist-0.020
 asking-0.019
 Potato-0.018
Token "
Token ID366
Feature activation+5.59e-03

Pos logit contributions
 temperature+0.066
 cinema+0.066
 shaving+0.064
pa+0.063
ise+0.061
Neg logit contributions
tted-0.068
 Sue-0.062
 Pam-0.057
 free-0.057
 of-0.055
TokenI
Token ID40
Feature activation-1.73e-02

Pos logit contributions
tted+0.011
adders+0.010
bridge+0.009
umbs+0.009
igsaw+0.009
Neg logit contributions
 whist-0.007
 shaving-0.007
 praying-0.007
 asking-0.006
 Potato-0.006
Token just
Token ID655
Feature activation-3.12e-02

Pos logit contributions
 whist+0.026
 shaving+0.024
 praying+0.024
 asking+0.022
 Potato+0.021
Neg logit contributions
tted-0.031
quet-0.028
umbs-0.028
bridge-0.027
igsaw-0.026
Token wanted
Token ID2227
Feature activation-6.40e-03

Pos logit contributions
 shaving+0.052
 praying+0.052
 whist+0.052
 asking+0.048
 Potato+0.046
Neg logit contributions
tted-0.051
umbs-0.041
bridge-0.040
quet-0.040
atha-0.040
Token to
Token ID284
Feature activation+2.23e-03

Pos logit contributions
 shaving+0.008
 praying+0.008
 whist+0.008
 asking+0.007
 Potato+0.007
Neg logit contributions
tted-0.013
umbs-0.011
bridge-0.011
quet-0.011
atha-0.010
Token day
Token ID1110
Feature activation-1.18e-02

Pos logit contributions
 shaving+0.003
 praying+0.003
 whist+0.003
 asking+0.003
 Potato+0.002
Neg logit contributions
tted-0.004
umbs-0.003
bridge-0.003
quet-0.003
atha-0.003
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
 shaving+0.015
 praying+0.015
 whist+0.015
 asking+0.013
 Potato+0.013
Neg logit contributions
tted-0.023
umbs-0.019
quet-0.019
bridge-0.019
atha-0.019
Token Cic
Token ID37212
Feature activation-3.26e-02

Pos logit contributions
tted+0.022
bridge+0.019
umbs+0.017
atters+0.017
atha+0.017
Neg logit contributions
 praying-0.016
 whist-0.016
 shaving-0.016
 pap-0.015
 Potato-0.015
Tokeni
Token ID72
Feature activation-1.34e-02

Pos logit contributions
 shaving+0.055
 praying+0.051
 whist+0.050
 smoking+0.047
 grandson+0.046
Neg logit contributions
tted-0.045
atha-0.041
okingly-0.041
als-0.041
aney-0.040
Token heard
Token ID2982
Feature activation-4.49e-03

Pos logit contributions
 shaving+0.014
 praying+0.014
 whist+0.014
 asking+0.012
 Potato+0.012
Neg logit contributions
tted-0.030
umbs-0.025
atha-0.025
bridge-0.025
quet-0.025
Token about
Token ID546
Feature activation-5.54e-02

Pos logit contributions
 shaving+0.006
 praying+0.006
 whist+0.006
 asking+0.005
 Potato+0.005
Neg logit contributions
tted-0.009
umbs-0.008
bridge-0.008
quet-0.008
atha-0.007
Token a
Token ID257
Feature activation-1.16e-03

Pos logit contributions
 shaving+0.090
 Brother+0.089
 Potato+0.087
 whist+0.083
 Man+0.083
Neg logit contributions
tted-0.070
 Sue-0.064
 frame-0.064
 chains-0.063
 queens-0.062
Token big
Token ID1263
Feature activation+2.48e-03

Pos logit contributions
 shaving+0.002
 praying+0.002
 whist+0.002
 asking+0.002
 Brother+0.002
Neg logit contributions
tted-0.002
quet-0.001
igsaw-0.001
 frame-0.001
atha-0.001
Token tornado
Token ID33718
Feature activation-1.33e-02

Pos logit contributions
tted+0.004
igsaw+0.003
quet+0.003
bridge+0.003
 frame+0.003
Neg logit contributions
 shaving-0.004
 whist-0.004
 praying-0.004
 asking-0.003
 Potato-0.003
Token coming
Token ID2406
Feature activation-1.04e-02

Pos logit contributions
 shaving+0.019
 praying+0.018
 whist+0.018
 Brother+0.017
 Potato+0.017
Neg logit contributions
tted-0.024
quet-0.021
bridge-0.020
umbs-0.020
atha-0.019
Token nearby
Token ID6716
Feature activation-5.20e-03

Pos logit contributions
 praying+0.014
 whist+0.014
 shaving+0.014
 asking+0.013
 Potato+0.012
Neg logit contributions
tted-0.021
umbs-0.018
bridge-0.018
quet-0.017
igsaw-0.017