TOP ACTIVATIONS
MAX = 0.073

She also picked some green seaw eed to show her friends
ugg led up in some seaw eed and felt very cozy
see shells , rocks , seaw eed , crabs and fish
necklace . They found some seaw eed to use as string
home . He grabbed some seaw eed and started to build
together . They played hop sc ot ch and picked more
He also saw some tasty seaw eed and decided to give
the different fish and colorful seaw eed below .
sea . She took some seaw eed , made a big
. It was a sun sh iny day and there were
. Barry enjoyed playing hop sc ot ch with them so
cane , the river became shall ower and shall ower until
of shells , rocks and seaw eed , but Jane was
beach was full of messy seaw eed . The girl tried
love . They played hop sc ot ch , jumped rope
He found it wrapped in seaw eed and brought it back
They started to play hop sc ot ch and had lots
shore , collecting shells and seaw eed . Jimmy took time
started to chew on some seaw eed . Mary was so
found it stuck in some seaw eed . Tim was so

MIN ACTIVATIONS
MIN = -0.079

noises like " who osh " and " be ep "
when they want a new hairst yle or when their hair
What are we going to do ? â
there anything else I can do for you ?" she asked
" and " be ep " and wear helmets made of
to go play at the gym ?" Jack said
to join us for a class ? â
what are we going to do ?" His friend replied ,
me some jelly from the store ?" Jane said
you help me lift the drawer ?" Her mom
little boy who wasn 't feeling very well . He was
with the word " news " on it . Tim was
ila and Ben 's Club " and hung it on the
day , Emily wasn 't feeling very well . So her
Why are you tearing the table ?" asked Tim .
words like " ele phant " and " rain bow ".
wanted to have a cool hairst yle , but he didn
choose a mineral from my bag ?" L ily
love you , Grand ma " with big letters . They
Can we go to the stable and show him to the

INTERVAL 0.035 - 0.073
CONTAINS 0.089%

making " p ew p ew " sounds as he played
of the yard . It might cause trouble ." Tom said
. He said it would be much warmer outside then .
," said Lily . " Do you want to jump in
filled the air . " Uh - oh ," said Jack

INTERVAL -0.003 - 0.035
CONTAINS 63.861%

he was safe . The bear never returned the open pool
, but Tim my kept asking and asking . Finally ,
the flashlight on them . " Yes , we
her happy for a long time . From that day on
, and it was a bright silver beacon in the countryside

INTERVAL -0.041 - -0.003
CONTAINS 36.040%

<|endoftext|> L ily and Tom were playing in their room .
But then he smiled at her and said he
a time there was an ugly start . It was so
started to get dark , Jack had to go home .
knew he shouldn 't be there , but he couldn 't

INTERVAL -0.079 - -0.041
CONTAINS 0.011%

the most adorable creature he had ever seen . Henry wanted
Which one do you want , Tom ?" "
something to put on the plate ?" He asked .
reminds me of when I was young . I used to
. Do you have any money ?" Dad says , "
Token She
Token ID1375
Feature activation+8.08e-03

Pos logit contributions
 may+0.006
gh+0.005
xy+0.005
 omega+0.005
ww+0.005
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
but-0.005
itating-0.005
Token also
Token ID635
Feature activation+4.18e-03

Pos logit contributions
 may+0.012
xy+0.011
 omega+0.011
ww+0.011
gh+0.010
Neg logit contributions
 older-0.012
 below-0.012
 hesitant-0.012
 youngest-0.011
ivated-0.011
Token picked
Token ID6497
Feature activation-9.76e-03

Pos logit contributions
 may+0.007
xy+0.006
 omega+0.006
gh+0.006
of+0.006
Neg logit contributions
 older-0.006
 below-0.006
 hesitant-0.005
ivated-0.005
 youngest-0.005
Token some
Token ID617
Feature activation+1.26e-02

Pos logit contributions
 older+0.010
 below+0.010
 into+0.008
 behind+0.008
 bored+0.008
Neg logit contributions
 may-0.021
 omega-0.020
ww-0.020
xy-0.019
gh-0.019
Token green
Token ID4077
Feature activation+1.75e-02

Pos logit contributions
xy+0.020
 may+0.020
gh+0.020
 omega+0.019
rick+0.019
Neg logit contributions
 older-0.018
 below-0.017
 into-0.015
 behind-0.015
 hesitant-0.015
Token seaw
Token ID32085
Feature activation+7.26e-02

Pos logit contributions
gh+0.029
ww+0.029
xy+0.028
 omega+0.027
rick+0.027
Neg logit contributions
 below-0.023
 older-0.023
 into-0.022
 nearest-0.020
 behind-0.020
Tokeneed
Token ID2308
Feature activation+2.25e-02

Pos logit contributions
 may+0.109
 might+0.102
xy+0.094
ww+0.094
gly+0.094
Neg logit contributions
ider-0.106
but-0.097
 older-0.094
rier-0.092
 below-0.092
Token to
Token ID284
Feature activation+7.32e-03

Pos logit contributions
 may+0.036
 omega+0.034
xy+0.033
gh+0.032
 might+0.032
Neg logit contributions
 below-0.031
 older-0.031
but-0.029
itating-0.029
ivated-0.028
Token show
Token ID905
Feature activation+1.23e-02

Pos logit contributions
 may+0.012
ww+0.011
xy+0.011
 omega+0.011
rick+0.011
Neg logit contributions
 below-0.010
 older-0.010
ivated-0.009
 youngest-0.009
 behind-0.009
Token her
Token ID607
Feature activation-1.07e-02

Pos logit contributions
 may+0.019
 omega+0.019
xy+0.018
ww+0.018
rick+0.018
Neg logit contributions
 older-0.018
 below-0.017
ivated-0.016
but-0.016
itating-0.016
Token friends
Token ID2460
Feature activation-2.75e-03

Pos logit contributions
 older+0.013
 below+0.012
ivated+0.012
itating+0.012
ider+0.012
Neg logit contributions
 may-0.019
xy-0.018
 might-0.018
 omega-0.018
of-0.017
Tokenugg
Token ID6837
Feature activation-1.61e-02

Pos logit contributions
 below+0.005
ider+0.005
ivated+0.005
but+0.005
 older+0.005
Neg logit contributions
 may-0.007
 might-0.006
 omega-0.006
xy-0.006
rick-0.006
Tokenled
Token ID992
Feature activation+8.14e-03

Pos logit contributions
 below+0.016
 older+0.015
ivated+0.014
ider+0.014
but+0.014
Neg logit contributions
 may-0.032
 omega-0.030
gh-0.030
orm-0.030
 might-0.030
Token up
Token ID510
Feature activation+1.21e-02

Pos logit contributions
 may+0.019
 omega+0.018
xy+0.018
rick+0.018
 might+0.018
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
 hesitant-0.004
itating-0.004
Token in
Token ID287
Feature activation+1.20e-02

Pos logit contributions
 may+0.023
 might+0.021
xy+0.021
 omega+0.021
of+0.021
Neg logit contributions
 older-0.013
ivated-0.012
 below-0.012
itating-0.012
ider-0.012
Token some
Token ID617
Feature activation+1.57e-02

Pos logit contributions
 may+0.021
xy+0.021
 omega+0.019
rick+0.019
ww+0.019
Neg logit contributions
 older-0.016
 below-0.015
 hesitant-0.014
ivated-0.013
 behind-0.013
Token seaw
Token ID32085
Feature activation+7.22e-02

Pos logit contributions
xy+0.026
 may+0.025
 omega+0.024
rick+0.024
gh+0.024
Neg logit contributions
 older-0.023
 below-0.021
 behind-0.019
 bored-0.018
 into-0.018
Tokeneed
Token ID2308
Feature activation+2.04e-02

Pos logit contributions
 may+0.113
 might+0.102
xy+0.097
gly+0.096
 meanings+0.093
Neg logit contributions
ider-0.107
but-0.095
 older-0.093
rier-0.093
 below-0.088
Token and
Token ID290
Feature activation+6.44e-03

Pos logit contributions
 may+0.032
 omega+0.029
xy+0.029
 might+0.028
gh+0.028
Neg logit contributions
 older-0.029
 below-0.029
ivated-0.027
ider-0.027
itating-0.027
Token felt
Token ID2936
Feature activation-1.69e-02

Pos logit contributions
 may+0.010
xy+0.010
 omega+0.010
gly+0.010
rick+0.010
Neg logit contributions
 older-0.008
 below-0.008
 bored-0.008
 hesitant-0.008
 startled-0.008
Token very
Token ID845
Feature activation-1.06e-02

Pos logit contributions
ider+0.025
 older+0.024
itating+0.024
ivated+0.024
 below+0.023
Neg logit contributions
 may-0.026
 might-0.023
 omega-0.023
of-0.022
gh-0.022
Token cozy
Token ID37438
Feature activation+8.19e-03

Pos logit contributions
 older+0.014
ivated+0.014
 below+0.013
itating+0.013
ider+0.013
Neg logit contributions
 may-0.018
 might-0.016
 omega-0.016
xy-0.016
of-0.015
Token see
Token ID766
Feature activation+5.68e-03

Pos logit contributions
 may+0.014
 might+0.013
 omega+0.012
of+0.012
xy+0.012
Neg logit contributions
 older-0.013
ider-0.012
ivated-0.012
 below-0.012
itating-0.012
Token shells
Token ID19679
Feature activation+1.31e-02

Pos logit contributions
 may+0.010
xy+0.009
gh+0.009
ww+0.009
rick+0.009
Neg logit contributions
 older-0.008
 below-0.008
 bored-0.007
 youngest-0.007
 startled-0.006
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
 omega+0.021
haw+0.021
gh+0.021
ww+0.021
 may+0.020
Neg logit contributions
 below-0.019
 older-0.018
 behind-0.016
 into-0.016
ivated-0.016
Token rocks
Token ID12586
Feature activation+2.91e-02

Pos logit contributions
xy+0.021
gh+0.020
 may+0.020
ww+0.019
gly+0.019
Neg logit contributions
 older-0.014
 below-0.013
 bored-0.012
ivated-0.011
 behind-0.011
Token,
Token ID11
Feature activation+2.89e-02

Pos logit contributions
haw+0.051
 omega+0.051
 meanings+0.050
OW+0.050
ww+0.050
Neg logit contributions
 below-0.038
 older-0.036
 into-0.033
 behind-0.033
but-0.029
Token seaw
Token ID32085
Feature activation+7.14e-02

Pos logit contributions
xy+0.056
gh+0.054
 may+0.052
gly+0.051
ww+0.051
Neg logit contributions
 older-0.037
 below-0.034
 behind-0.032
 into-0.032
 bored-0.030
Tokeneed
Token ID2308
Feature activation+4.53e-02

Pos logit contributions
 may+0.111
xy+0.101
 might+0.101
gly+0.095
 meanings+0.093
Neg logit contributions
ider-0.100
rier-0.093
 older-0.092
 below-0.089
but-0.088
Token,
Token ID11
Feature activation+3.61e-02

Pos logit contributions
haw+0.080
ww+0.078
 omega+0.078
 may+0.078
 meanings+0.077
Neg logit contributions
 below-0.055
 older-0.053
 into-0.050
 behind-0.048
ider-0.045
Token crabs
Token ID49172
Feature activation+3.47e-02

Pos logit contributions
xy+0.075
gly+0.071
ww+0.070
 may+0.070
gh+0.069
Neg logit contributions
 older-0.040
 into-0.037
 below-0.036
 behind-0.036
 bored-0.031
Token and
Token ID290
Feature activation+7.16e-03

Pos logit contributions
 may+0.057
 omega+0.056
haw+0.056
 meanings+0.056
xy+0.055
Neg logit contributions
 below-0.046
 older-0.044
 behind-0.040
 into-0.039
ider-0.039
Token fish
Token ID5916
Feature activation+2.44e-02

Pos logit contributions
xy+0.012
 may+0.011
gh+0.011
gly+0.010
rick+0.010
Neg logit contributions
 older-0.011
 below-0.010
 bored-0.009
 youngest-0.009
vious-0.009
Token necklace
Token ID38987
Feature activation-4.49e-03

Pos logit contributions
 older+0.010
 below+0.009
 youngest+0.009
ider+0.008
itating+0.008
Neg logit contributions
 may-0.010
of-0.009
xy-0.009
 omega-0.009
rick-0.009
Token.
Token ID13
Feature activation-2.84e-03

Pos logit contributions
 older+0.005
 below+0.005
ider+0.005
itating+0.005
 hesitant+0.005
Neg logit contributions
 may-0.008
 omega-0.008
xy-0.007
rick-0.007
of-0.007
Token They
Token ID1119
Feature activation+8.94e-03

Pos logit contributions
 older+0.005
ivated+0.005
ider+0.005
 below+0.005
itating+0.005
Neg logit contributions
 may-0.003
xy-0.003
 omega-0.003
 might-0.003
rick-0.002
Token found
Token ID1043
Feature activation-8.95e-03

Pos logit contributions
 may+0.012
 omega+0.011
gh+0.011
xy+0.011
ww+0.011
Neg logit contributions
 older-0.014
 below-0.014
ivated-0.014
 hesitant-0.013
 youngest-0.013
Token some
Token ID617
Feature activation+6.44e-03

Pos logit contributions
 older+0.013
 below+0.012
 able+0.011
 hesitant+0.011
 into+0.011
Neg logit contributions
 may-0.014
gh-0.014
xy-0.014
of-0.014
ww-0.014
Token seaw
Token ID32085
Feature activation+6.87e-02

Pos logit contributions
xy+0.009
 may+0.008
rick+0.008
haw+0.008
of+0.008
Neg logit contributions
 older-0.011
 below-0.010
 hesitant-0.009
 elderly-0.009
 behind-0.009
Tokeneed
Token ID2308
Feature activation+1.88e-02

Pos logit contributions
 may+0.107
 might+0.098
gly+0.096
xy+0.096
haw+0.092
Neg logit contributions
ider-0.099
but-0.089
rier-0.085
 below-0.081
 older-0.081
Token to
Token ID284
Feature activation+8.45e-03

Pos logit contributions
 may+0.030
 omega+0.028
xy+0.028
of+0.027
gh+0.027
Neg logit contributions
 below-0.026
 older-0.026
ider-0.024
but-0.024
itating-0.023
Token use
Token ID779
Feature activation+3.00e-03

Pos logit contributions
 may+0.013
ww+0.013
xy+0.012
rick+0.012
 omega+0.012
Neg logit contributions
 older-0.012
 below-0.012
ivated-0.011
 youngest-0.010
 hesitant-0.010
Token as
Token ID355
Feature activation+2.34e-03

Pos logit contributions
ww+0.005
haw+0.005
 omega+0.005
xy+0.005
rick+0.005
Neg logit contributions
 below-0.004
 older-0.004
 into-0.003
 behind-0.003
 able-0.003
Token string
Token ID4731
Feature activation+1.33e-02

Pos logit contributions
of+0.004
 may+0.004
 omega+0.003
rick+0.003
gly+0.003
Neg logit contributions
 older-0.003
 below-0.003
 able-0.003
 bored-0.003
 hesitant-0.003
Token home
Token ID1363
Feature activation-1.29e-02

Pos logit contributions
 older+0.005
 below+0.005
 hesitant+0.004
 glimpse+0.004
 startled+0.004
Neg logit contributions
xy-0.004
 may-0.004
 omega-0.004
rick-0.004
gly-0.004
Token.
Token ID13
Feature activation-8.92e-03

Pos logit contributions
 older+0.016
 below+0.016
ivated+0.015
ider+0.015
itating+0.015
Neg logit contributions
 may-0.023
 omega-0.021
xy-0.021
 might-0.021
rick-0.021
Token He
Token ID679
Feature activation+6.62e-03

Pos logit contributions
 older+0.015
ivated+0.015
 below+0.015
itating+0.015
ider+0.015
Neg logit contributions
 may-0.011
 might-0.010
of-0.010
 omega-0.010
xy-0.010
Token grabbed
Token ID13176
Feature activation-3.03e-03

Pos logit contributions
 may+0.010
xy+0.009
 omega+0.009
gh+0.009
of+0.009
Neg logit contributions
 older-0.010
 below-0.009
 hesitant-0.009
ivated-0.009
ider-0.009
Token some
Token ID617
Feature activation+4.87e-03

Pos logit contributions
 older+0.004
 below+0.004
 behind+0.003
 into+0.003
ivated+0.003
Neg logit contributions
 may-0.006
xy-0.005
gh-0.005
rick-0.005
ww-0.005
Token seaw
Token ID32085
Feature activation+6.50e-02

Pos logit contributions
xy+0.008
 may+0.007
gh+0.007
rick+0.007
 omega+0.007
Neg logit contributions
 older-0.007
 below-0.007
 hesitant-0.006
 startled-0.006
 into-0.006
Tokeneed
Token ID2308
Feature activation+1.56e-02

Pos logit contributions
 may+0.094
 might+0.085
xy+0.082
gly+0.078
 meanings+0.077
Neg logit contributions
ider-0.103
but-0.094
rier-0.093
 older-0.092
 below-0.090
Token and
Token ID290
Feature activation+4.95e-03

Pos logit contributions
 may+0.023
xy+0.021
 might+0.021
 omega+0.021
of+0.020
Neg logit contributions
 older-0.024
 below-0.023
ivated-0.023
itating-0.022
ider-0.022
Token started
Token ID2067
Feature activation+1.47e-02

Pos logit contributions
 may+0.008
xy+0.008
gh+0.007
of+0.007
gly+0.007
Neg logit contributions
 older-0.007
 below-0.006
 hesitant-0.006
 startled-0.006
 able-0.006
Token to
Token ID284
Feature activation+1.06e-02

Pos logit contributions
 may+0.027
xy+0.025
 omega+0.025
of+0.025
 might+0.025
Neg logit contributions
 older-0.018
 below-0.017
ivated-0.016
 hesitant-0.016
itating-0.016
Token build
Token ID1382
Feature activation-2.05e-03

Pos logit contributions
xy+0.017
ww+0.017
 may+0.016
 omega+0.016
rick+0.016
Neg logit contributions
 older-0.015
 below-0.014
ivated-0.013
 youngest-0.013
 hesitant-0.013
Token together
Token ID1978
Feature activation-5.88e-03

Pos logit contributions
 older+0.001
itating+0.000
ivated+0.000
rier+0.000
 below+0.000
Neg logit contributions
 may-0.001
 might-0.000
xy-0.000
gh-0.000
rick-0.000
Token.
Token ID13
Feature activation-7.06e-04

Pos logit contributions
 older+0.010
ivated+0.009
itating+0.009
rier+0.009
ider+0.009
Neg logit contributions
 may-0.008
 might-0.008
gh-0.007
xy-0.007
orm-0.007
Token They
Token ID1119
Feature activation-5.63e-03

Pos logit contributions
 below+0.001
 older+0.001
ivated+0.001
itating+0.001
ider+0.001
Neg logit contributions
 may-0.001
 might-0.001
rick-0.001
xy-0.001
 omega-0.001
Token played
Token ID2826
Feature activation+6.09e-04

Pos logit contributions
 older+0.008
 below+0.008
ivated+0.008
 bored+0.008
 youngest+0.007
Neg logit contributions
 may-0.008
xy-0.008
 omega-0.008
gh-0.008
ww-0.008
Token hop
Token ID1725
Feature activation+2.22e-02

Pos logit contributions
 may+0.001
 omega+0.001
xy+0.001
 might+0.001
rick+0.001
Neg logit contributions
 older-0.001
 below-0.001
ivated-0.001
 hesitant-0.001
ider-0.001
Tokensc
Token ID1416
Feature activation+6.48e-02

Pos logit contributions
 may+0.039
 omega+0.038
xy+0.036
 fiber+0.036
gh+0.036
Neg logit contributions
 below-0.027
 older-0.026
ivated-0.025
 hesitant-0.024
 behind-0.024
Tokenot
Token ID313
Feature activation+1.25e-02

Pos logit contributions
 may+0.088
 might+0.077
xy+0.077
 omega+0.072
 meanings+0.070
Neg logit contributions
ivated-0.100
 into-0.098
ider-0.096
apping-0.096
but-0.095
Tokench
Token ID354
Feature activation+2.75e-02

Pos logit contributions
 omega+0.020
 may+0.020
 fiber+0.019
xy+0.019
 meanings+0.019
Neg logit contributions
 into-0.016
 below-0.016
vious-0.016
 behind-0.015
ivated-0.015
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
 may+0.042
 omega+0.038
rick+0.036
 might+0.036
xy+0.036
Neg logit contributions
 below-0.040
 older-0.040
ivated-0.039
but-0.039
ider-0.038
Token picked
Token ID6497
Feature activation-1.52e-02

Pos logit contributions
 omega+0.014
 may+0.014
xy+0.014
gh+0.013
 fiber+0.013
Neg logit contributions
 below-0.015
 bored-0.015
 behind-0.015
 older-0.014
 into-0.014
Token more
Token ID517
Feature activation+1.34e-02

Pos logit contributions
 below+0.017
 behind+0.015
 older+0.015
 into+0.015
 up+0.015
Neg logit contributions
 may-0.029
 fiber-0.028
 omega-0.028
xy-0.027
gh-0.027
Token He
Token ID679
Feature activation+9.53e-03

Pos logit contributions
 older+0.013
ivated+0.013
 below+0.013
itating+0.012
ider+0.012
Neg logit contributions
 may-0.011
 might-0.010
xy-0.010
 omega-0.010
of-0.010
Token also
Token ID635
Feature activation-4.33e-04

Pos logit contributions
 may+0.012
 omega+0.012
xy+0.011
gh+0.011
ww+0.011
Neg logit contributions
 older-0.016
 below-0.016
 hesitant-0.015
ivated-0.015
itating-0.015
Token saw
Token ID2497
Feature activation-1.08e-02

Pos logit contributions
 older+0.001
 below+0.001
ivated+0.001
itating+0.001
 hesitant+0.001
Neg logit contributions
 may-0.001
xy-0.001
 omega-0.001
 might-0.001
of-0.001
Token some
Token ID617
Feature activation+3.78e-03

Pos logit contributions
 older+0.015
 below+0.015
 hesitant+0.013
 youngest+0.013
 startled+0.013
Neg logit contributions
 may-0.018
 omega-0.018
xy-0.018
gh-0.017
rick-0.017
Token tasty
Token ID25103
Feature activation+2.28e-02

Pos logit contributions
xy+0.006
 may+0.006
 omega+0.006
gh+0.006
rick+0.006
Neg logit contributions
 older-0.006
 below-0.005
 hesitant-0.005
 startled-0.005
ivated-0.005
Token seaw
Token ID32085
Feature activation+6.44e-02

Pos logit contributions
xy+0.039
rick+0.037
gh+0.036
 may+0.036
 omega+0.036
Neg logit contributions
 below-0.032
 older-0.030
 into-0.027
 behind-0.026
 baker-0.026
Tokeneed
Token ID2308
Feature activation+1.10e-02

Pos logit contributions
 may+0.088
 might+0.078
xy+0.075
 omega+0.073
gly+0.072
Neg logit contributions
ider-0.105
 older-0.099
 below-0.099
but-0.098
rier-0.094
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
 may+0.018
 omega+0.017
xy+0.017
rick+0.016
gh+0.016
Neg logit contributions
 below-0.015
 older-0.015
itating-0.013
ider-0.013
ivated-0.013
Token decided
Token ID3066
Feature activation-6.88e-03

Pos logit contributions
 may+0.002
xy+0.002
of+0.002
gh+0.002
rick+0.002
Neg logit contributions
 older-0.002
 below-0.002
 hesitant-0.002
 startled-0.002
 bored-0.002
Token to
Token ID284
Feature activation-2.56e-04

Pos logit contributions
 older+0.009
ivated+0.009
 below+0.009
ider+0.008
itating+0.008
Neg logit contributions
 may-0.012
 might-0.011
xy-0.010
 omega-0.010
of-0.010
Token give
Token ID1577
Feature activation+1.48e-02

Pos logit contributions
 older+0.000
 below+0.000
ivated+0.000
itating+0.000
 hesitant+0.000
Neg logit contributions
 may-0.000
xy-0.000
 omega-0.000
ww-0.000
rick-0.000
Token the
Token ID262
Feature activation-1.38e-03

Pos logit contributions
 omega+0.015
xy+0.015
ww+0.015
 may+0.014
rick+0.014
Neg logit contributions
 below-0.012
 older-0.010
 behind-0.010
 into-0.009
ivated-0.009
Token different
Token ID1180
Feature activation+1.92e-02

Pos logit contributions
 older+0.002
ivated+0.002
 below+0.002
itating+0.002
ider+0.002
Neg logit contributions
 may-0.002
 might-0.002
xy-0.002
 omega-0.002
of-0.002
Token fish
Token ID5916
Feature activation+1.47e-02

Pos logit contributions
 may+0.027
xy+0.024
 omega+0.024
gh+0.024
rick+0.024
Neg logit contributions
 older-0.031
 below-0.030
ivated-0.029
itating-0.029
 hesitant-0.028
Token and
Token ID290
Feature activation+6.24e-03

Pos logit contributions
 omega+0.028
 may+0.026
gh+0.026
ww+0.026
haw+0.025
Neg logit contributions
 below-0.018
 older-0.016
itating-0.015
but-0.015
ivated-0.014
Token colorful
Token ID20239
Feature activation+2.13e-02

Pos logit contributions
xy+0.010
 may+0.010
gh+0.010
 omega+0.010
ww+0.009
Neg logit contributions
 below-0.009
 older-0.008
 youngest-0.008
 startled-0.008
 into-0.008
Token seaw
Token ID32085
Feature activation+6.40e-02

Pos logit contributions
xy+0.033
 may+0.032
 omega+0.031
gh+0.031
rick+0.030
Neg logit contributions
 below-0.031
 older-0.031
 youngest-0.027
ivated-0.026
 into-0.026
Tokeneed
Token ID2308
Feature activation+2.07e-02

Pos logit contributions
 may+0.092
 might+0.084
xy+0.080
gly+0.079
haw+0.078
Neg logit contributions
ider-0.100
rier-0.092
 below-0.092
but-0.088
 older-0.088
Token below
Token ID2174
Feature activation+1.42e-02

Pos logit contributions
 may+0.037
 omega+0.035
xy+0.034
rick+0.034
 might+0.034
Neg logit contributions
 below-0.024
 older-0.024
ider-0.022
ivated-0.022
itating-0.022
Token.
Token ID13
Feature activation+7.10e-03

Pos logit contributions
 omega+0.028
ww+0.027
 may+0.027
xy+0.027
haw+0.026
Neg logit contributions
 below-0.017
 older-0.014
 behind-0.013
ivated-0.013
 startled-0.013
Token
Token ID198
Feature activation-1.66e-03

Pos logit contributions
 may+0.011
 omega+0.010
xy+0.010
 might+0.010
gh+0.010
Neg logit contributions
 older-0.010
 below-0.010
ivated-0.010
itating-0.010
 hesitant-0.010
Token
Token ID198
Feature activation+2.61e-03

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.002
itating+0.002
ider+0.002
Neg logit contributions
 may-0.002
xy-0.002
 might-0.002
 omega-0.002
of-0.002
Token sea
Token ID5417
Feature activation+2.04e-02

Pos logit contributions
 may+0.002
 might+0.001
of+0.001
rick+0.001
 looks+0.001
Neg logit contributions
ivated-0.002
ider-0.002
itating-0.002
 hesitant-0.002
 older-0.002
Token.
Token ID13
Feature activation+6.07e-03

Pos logit contributions
 may+0.035
 might+0.032
xy+0.032
 omega+0.032
of+0.031
Neg logit contributions
 older-0.026
 below-0.025
ivated-0.025
itating-0.024
ider-0.024
Token She
Token ID1375
Feature activation+1.40e-03

Pos logit contributions
 may+0.009
xy+0.008
 omega+0.008
rick+0.008
gh+0.008
Neg logit contributions
 older-0.009
 below-0.009
ivated-0.009
itating-0.008
 hesitant-0.008
Token took
Token ID1718
Feature activation-2.70e-03

Pos logit contributions
 may+0.002
xy+0.002
 omega+0.002
gh+0.002
of+0.002
Neg logit contributions
 older-0.002
 below-0.002
 hesitant-0.002
ivated-0.002
itating-0.002
Token some
Token ID617
Feature activation+2.43e-03

Pos logit contributions
 older+0.003
 below+0.003
 into+0.003
ivated+0.003
 able+0.003
Neg logit contributions
xy-0.005
 may-0.005
 omega-0.005
rick-0.005
 might-0.005
Token seaw
Token ID32085
Feature activation+6.39e-02

Pos logit contributions
 may+0.004
xy+0.004
 omega+0.004
rick+0.004
gh+0.004
Neg logit contributions
 older-0.003
 below-0.003
ivated-0.003
 hesitant-0.003
but-0.003
Tokeneed
Token ID2308
Feature activation+1.63e-02

Pos logit contributions
 may+0.090
 might+0.082
xy+0.079
gly+0.074
 meanings+0.073
Neg logit contributions
ider-0.100
but-0.096
 older-0.095
 below-0.093
rier-0.091
Token,
Token ID11
Feature activation+5.40e-03

Pos logit contributions
 may+0.025
xy+0.023
 might+0.023
 omega+0.023
of+0.022
Neg logit contributions
 older-0.023
 below-0.023
ivated-0.022
itating-0.022
ider-0.022
Token made
Token ID925
Feature activation+1.78e-02

Pos logit contributions
 may+0.009
xy+0.008
gh+0.008
gly+0.008
ww+0.008
Neg logit contributions
 older-0.008
 below-0.007
 hesitant-0.007
but-0.007
ivated-0.007
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
 may+0.028
xy+0.025
 omega+0.025
 might+0.024
rick+0.024
Neg logit contributions
 older-0.027
 below-0.026
ivated-0.024
 hesitant-0.024
but-0.024
Token big
Token ID1263
Feature activation+3.29e-05

Pos logit contributions
 may+0.017
xy+0.016
 omega+0.015
 might+0.015
of+0.015
Neg logit contributions
 older-0.019
 below-0.018
ivated-0.017
itating-0.017
 hesitant-0.017
Token.
Token ID13
Feature activation+7.86e-04

Pos logit contributions
ivated+0.017
 older+0.017
itating+0.017
 hesitant+0.016
 below+0.016
Neg logit contributions
 may-0.017
 might-0.015
rick-0.014
xy-0.014
of-0.014
Token It
Token ID632
Feature activation+1.85e-02

Pos logit contributions
 may+0.001
xy+0.001
 omega+0.001
gh+0.001
rick+0.001
Neg logit contributions
 older-0.001
 below-0.001
ivated-0.001
itating-0.001
 hesitant-0.001
Token was
Token ID373
Feature activation+2.53e-02

Pos logit contributions
haw+0.027
 omega+0.026
ww+0.026
 meanings+0.026
oof+0.025
Neg logit contributions
 below-0.028
 hesitant-0.027
 startled-0.027
 older-0.027
 behind-0.026
Token a
Token ID257
Feature activation+1.64e-02

Pos logit contributions
 may+0.046
of+0.046
xy+0.045
haw+0.044
rick+0.043
Neg logit contributions
 older-0.034
 hesitant-0.030
 below-0.029
 startled-0.028
 bored-0.028
Token sun
Token ID4252
Feature activation+2.02e-02

Pos logit contributions
xy+0.025
of+0.024
 may+0.023
haw+0.023
rick+0.023
Neg logit contributions
 older-0.024
 hesitant-0.023
 below-0.022
 startled-0.021
 bored-0.021
Tokensh
Token ID1477
Feature activation+6.34e-02

Pos logit contributions
 may+0.038
 omega+0.037
haw+0.036
xy+0.036
orm+0.035
Neg logit contributions
 below-0.022
 older-0.022
itating-0.020
but-0.020
ider-0.019
Tokeniny
Token ID3541
Feature activation+1.00e-02

Pos logit contributions
 may+0.119
 omega+0.107
haw+0.103
 might+0.102
gly+0.100
Neg logit contributions
 older-0.080
 below-0.080
but-0.075
itating-0.073
ider-0.068
Token day
Token ID1110
Feature activation+1.17e-02

Pos logit contributions
 omega+0.020
haw+0.019
xy+0.019
rick+0.019
 may+0.019
Neg logit contributions
 below-0.010
 behind-0.009
 older-0.008
ider-0.008
 into-0.008
Token and
Token ID290
Feature activation+2.43e-03

Pos logit contributions
 may+0.020
 omega+0.020
haw+0.020
gly+0.019
xy+0.019
Neg logit contributions
 below-0.015
 older-0.014
 behind-0.013
but-0.013
ivated-0.013
Token there
Token ID612
Feature activation-7.74e-03

Pos logit contributions
gh+0.004
xy+0.004
rick+0.004
orm+0.004
 omega+0.004
Neg logit contributions
 older-0.003
 below-0.003
 bored-0.003
 startled-0.003
 into-0.003
Token were
Token ID547
Feature activation-4.55e-03

Pos logit contributions
 below+0.012
 startled+0.011
 hesitant+0.010
 amazed+0.010
 surprised+0.010
Neg logit contributions
ww-0.013
 omega-0.013
haw-0.012
 fiber-0.012
osaurs-0.012
Token.
Token ID13
Feature activation+2.25e-03

Pos logit contributions
 may+0.012
 omega+0.011
xy+0.011
rick+0.011
 might+0.011
Neg logit contributions
 older-0.008
 below-0.008
ivated-0.008
itating-0.008
 hesitant-0.008
Token Barry
Token ID14488
Feature activation+7.06e-03

Pos logit contributions
 may+0.003
 omega+0.003
gh+0.003
of+0.003
xy+0.003
Neg logit contributions
 older-0.004
 below-0.004
ivated-0.003
itating-0.003
 hesitant-0.003
Token enjoyed
Token ID8359
Feature activation+6.38e-03

Pos logit contributions
 may+0.010
 omega+0.009
xy+0.009
gh+0.009
orm+0.008
Neg logit contributions
 older-0.011
 below-0.011
 hesitant-0.011
ivated-0.011
itating-0.011
Token playing
Token ID2712
Feature activation+5.66e-03

Pos logit contributions
 may+0.013
of+0.012
 omega+0.012
 might+0.012
rick+0.012
Neg logit contributions
 below-0.007
 older-0.007
ivated-0.006
but-0.006
 hesitant-0.006
Token hop
Token ID1725
Feature activation+2.47e-02

Pos logit contributions
 may+0.011
 omega+0.010
of+0.010
rick+0.010
xy+0.010
Neg logit contributions
 below-0.006
 older-0.006
ivated-0.006
 hesitant-0.006
but-0.006
Tokensc
Token ID1416
Feature activation+6.17e-02

Pos logit contributions
 omega+0.047
 may+0.046
 fiber+0.043
xy+0.043
rick+0.043
Neg logit contributions
 below-0.028
 older-0.026
ivated-0.025
but-0.024
 hesitant-0.024
Tokenot
Token ID313
Feature activation+1.29e-02

Pos logit contributions
 may+0.089
 might+0.082
 meanings+0.074
 would+0.073
xy+0.073
Neg logit contributions
apping-0.087
but-0.087
ider-0.085
 into-0.084
ivated-0.083
Tokench
Token ID354
Feature activation+2.58e-02

Pos logit contributions
 may+0.019
 omega+0.019
xy+0.018
 meanings+0.017
 might+0.017
Neg logit contributions
vious-0.018
 below-0.018
 older-0.018
 into-0.017
but-0.017
Token with
Token ID351
Feature activation+1.60e-02

Pos logit contributions
 may+0.043
 omega+0.040
 might+0.038
rick+0.038
xy+0.037
Neg logit contributions
 below-0.034
 older-0.034
ivated-0.033
but-0.033
ider-0.033
Token them
Token ID606
Feature activation+4.63e-03

Pos logit contributions
xy+0.027
 omega+0.027
 may+0.027
ww+0.026
gh+0.025
Neg logit contributions
 below-0.022
 older-0.022
 behind-0.019
 into-0.019
 youngest-0.019
Token so
Token ID523
Feature activation+5.66e-03

Pos logit contributions
 may+0.009
 omega+0.008
xy+0.008
rick+0.008
ww+0.008
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
itating-0.005
 hesitant-0.005
Token cane
Token ID33009
Feature activation+9.63e-04

Pos logit contributions
 may+0.024
 might+0.021
xy+0.021
 omega+0.021
rick+0.021
Neg logit contributions
 older-0.021
ivated-0.021
 below-0.021
itating-0.020
ider-0.020
Token,
Token ID11
Feature activation-1.64e-05

Pos logit contributions
 omega+0.001
 may+0.001
xy+0.001
ww+0.001
rick+0.001
Neg logit contributions
 below-0.001
 older-0.001
 hesitant-0.001
ivated-0.001
itating-0.001
Token the
Token ID262
Feature activation+1.68e-03

Pos logit contributions
 below+0.000
 older+0.000
ivated+0.000
 hesitant+0.000
 startled+0.000
Neg logit contributions
gh-0.000
 may-0.000
 omega-0.000
rick-0.000
xy-0.000
Token river
Token ID7850
Feature activation+1.06e-02

Pos logit contributions
 may+0.003
xy+0.002
of+0.002
 omega+0.002
rick+0.002
Neg logit contributions
 older-0.002
 below-0.002
ivated-0.002
 hesitant-0.002
itating-0.002
Token became
Token ID2627
Feature activation+2.67e-02

Pos logit contributions
 may+0.015
 omega+0.014
xy+0.014
of+0.014
rick+0.014
Neg logit contributions
 older-0.016
 below-0.016
 hesitant-0.015
ivated-0.015
itating-0.015
Token shall
Token ID2236
Feature activation+6.17e-02

Pos logit contributions
 may+0.044
 might+0.040
 omega+0.039
xy+0.039
rick+0.039
Neg logit contributions
 older-0.036
ivated-0.035
 below-0.034
ider-0.034
itating-0.034
Tokenower
Token ID789
Feature activation+1.46e-02

Pos logit contributions
 may+0.098
 omega+0.091
xy+0.090
rick+0.088
 might+0.087
Neg logit contributions
 older-0.089
 below-0.087
itating-0.083
ivated-0.083
 hesitant-0.082
Token and
Token ID290
Feature activation+2.76e-03

Pos logit contributions
 may+0.028
 omega+0.027
xy+0.026
rick+0.026
 might+0.026
Neg logit contributions
 older-0.016
 below-0.016
itating-0.014
ivated-0.014
 hesitant-0.014
Token shall
Token ID2236
Feature activation+4.26e-02

Pos logit contributions
 may+0.004
 omega+0.004
xy+0.004
gh+0.004
rick+0.004
Neg logit contributions
 older-0.004
 below-0.004
 hesitant-0.003
 startled-0.003
ivated-0.003
Tokenower
Token ID789
Feature activation+6.07e-03

Pos logit contributions
 may+0.058
 might+0.051
of+0.048
xy+0.047
 omega+0.045
Neg logit contributions
 older-0.072
ider-0.071
ivated-0.070
itating-0.068
 youngest-0.067
Token until
Token ID1566
Feature activation-1.18e-02

Pos logit contributions
 may+0.011
 omega+0.010
xy+0.010
 might+0.010
rick+0.010
Neg logit contributions
 older-0.007
 below-0.007
ivated-0.007
itating-0.006
ider-0.006
Token of
Token ID286
Feature activation+2.25e-02

Pos logit contributions
 may+0.024
xy+0.022
 might+0.022
 omega+0.022
of+0.022
Neg logit contributions
 older-0.009
 below-0.009
ivated-0.009
itating-0.008
ider-0.008
Token shells
Token ID19679
Feature activation+1.35e-02

Pos logit contributions
xy+0.035
 may+0.035
 omega+0.034
gh+0.033
rick+0.033
Neg logit contributions
 older-0.035
 below-0.034
ivated-0.029
 bored-0.029
saw-0.029
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
 omega+0.022
 may+0.022
xy+0.021
gh+0.021
haw+0.020
Neg logit contributions
 below-0.019
 older-0.018
ivated-0.017
itating-0.017
but-0.016
Token rocks
Token ID12586
Feature activation+2.91e-02

Pos logit contributions
xy+0.023
 may+0.022
gh+0.022
ww+0.022
 omega+0.021
Neg logit contributions
 older-0.020
 below-0.019
ivated-0.017
 hesitant-0.017
 behind-0.017
Token and
Token ID290
Feature activation+8.06e-03

Pos logit contributions
 omega+0.048
 may+0.046
haw+0.045
orm+0.045
ww+0.045
Neg logit contributions
 below-0.042
 older-0.039
itating-0.036
but-0.035
ivated-0.035
Token seaw
Token ID32085
Feature activation+6.16e-02

Pos logit contributions
xy+0.013
 may+0.012
gh+0.012
 omega+0.011
ww+0.011
Neg logit contributions
 older-0.012
 below-0.011
 youngest-0.010
ivated-0.010
 hesitant-0.010
Tokeneed
Token ID2308
Feature activation+2.56e-02

Pos logit contributions
 may+0.092
 might+0.082
xy+0.081
ww+0.076
gly+0.075
Neg logit contributions
ider-0.093
 older-0.085
rier-0.084
 below-0.084
but-0.084
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
 may+0.044
 omega+0.043
xy+0.041
rick+0.041
ww+0.041
Neg logit contributions
 below-0.032
 older-0.032
ider-0.029
itating-0.029
ivated-0.028
Token but
Token ID475
Feature activation-1.46e-02

Pos logit contributions
 may+0.024
xy+0.024
ww+0.023
 omega+0.023
gh+0.023
Neg logit contributions
 older-0.017
 below-0.017
 hesitant-0.015
ivated-0.015
but-0.015
Token Jane
Token ID12091
Feature activation+1.61e-03

Pos logit contributions
 older+0.020
 below+0.020
ivated+0.019
 hesitant+0.019
itating+0.019
Neg logit contributions
 may-0.023
 omega-0.022
xy-0.022
of-0.021
rick-0.021
Token was
Token ID373
Feature activation-7.65e-03

Pos logit contributions
 omega+0.002
 may+0.002
ww+0.002
xy+0.002
haw+0.002
Neg logit contributions
 older-0.002
 below-0.002
 hesitant-0.002
itating-0.002
ivated-0.002
Token beach
Token ID10481
Feature activation+3.26e-03

Pos logit contributions
ivated+0.002
itating+0.002
 older+0.002
but+0.002
ider+0.002
Neg logit contributions
 may-0.002
 might-0.002
 omega-0.002
orm-0.002
ww-0.002
Token was
Token ID373
Feature activation+1.04e-02

Pos logit contributions
 may+0.003
 omega+0.003
xy+0.003
of+0.003
rick+0.003
Neg logit contributions
 older-0.006
 below-0.006
 hesitant-0.006
 startled-0.006
itating-0.006
Token full
Token ID1336
Feature activation+8.38e-03

Pos logit contributions
 may+0.015
 might+0.014
gh+0.013
 hard+0.012
 omega+0.012
Neg logit contributions
ivated-0.017
ider-0.016
itating-0.016
but-0.016
 below-0.016
Token of
Token ID286
Feature activation+9.98e-03

Pos logit contributions
 may+0.016
 might+0.015
of+0.015
 omega+0.015
rick+0.015
Neg logit contributions
ivated-0.009
itating-0.008
 older-0.008
ider-0.008
 below-0.008
Token messy
Token ID24097
Feature activation+6.38e-03

Pos logit contributions
 may+0.016
xy+0.016
gh+0.015
of+0.015
rick+0.015
Neg logit contributions
 older-0.015
 below-0.014
 bored-0.013
ivated-0.013
 hesitant-0.013
Token seaw
Token ID32085
Feature activation+6.03e-02

Pos logit contributions
 may+0.009
xy+0.009
of+0.009
 omega+0.009
rick+0.008
Neg logit contributions
 older-0.010
 below-0.010
ivated-0.009
ider-0.009
 hesitant-0.009
Tokeneed
Token ID2308
Feature activation+9.10e-03

Pos logit contributions
 may+0.092
 might+0.083
gly+0.078
 meanings+0.077
xy+0.076
Neg logit contributions
ider-0.090
but-0.082
 below-0.081
 older-0.079
rier-0.079
Token.
Token ID13
Feature activation+2.51e-03

Pos logit contributions
 may+0.016
 omega+0.015
xy+0.015
rick+0.015
of+0.015
Neg logit contributions
 older-0.011
 below-0.011
ider-0.010
itating-0.010
ivated-0.010
Token The
Token ID383
Feature activation-4.54e-03

Pos logit contributions
 may+0.004
xy+0.003
 omega+0.003
 might+0.003
gh+0.003
Neg logit contributions
 older-0.004
 below-0.004
ivated-0.004
itating-0.004
ider-0.004
Token girl
Token ID2576
Feature activation-4.21e-03

Pos logit contributions
ivated+0.007
 older+0.007
itating+0.007
 below+0.007
ider+0.007
Neg logit contributions
 may-0.006
 might-0.006
 omega-0.006
of-0.005
orm-0.005
Token tried
Token ID3088
Feature activation+5.30e-03

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
itating+0.007
ider+0.007
Neg logit contributions
 may-0.005
xy-0.005
 omega-0.005
 might-0.004
of-0.004
Token love
Token ID1842
Feature activation+1.86e-02

Pos logit contributions
 may+0.021
xy+0.021
gh+0.020
 omega+0.019
rick+0.019
Neg logit contributions
 older-0.018
 below-0.017
 bored-0.016
 youngest-0.016
 hesitant-0.016
Token.
Token ID13
Feature activation-9.07e-03

Pos logit contributions
 may+0.033
 omega+0.033
xy+0.032
of+0.032
rick+0.031
Neg logit contributions
 below-0.023
 older-0.021
 behind-0.019
but-0.019
 hesitant-0.018
Token They
Token ID1119
Feature activation+3.99e-03

Pos logit contributions
 older+0.016
ivated+0.016
 below+0.015
ider+0.015
itating+0.015
Neg logit contributions
 may-0.011
 might-0.010
xy-0.010
rick-0.010
 omega-0.010
Token played
Token ID2826
Feature activation+4.00e-03

Pos logit contributions
 may+0.005
 omega+0.005
xy+0.005
gh+0.005
rick+0.005
Neg logit contributions
 older-0.006
 below-0.006
ivated-0.006
 hesitant-0.006
 youngest-0.006
Token hop
Token ID1725
Feature activation+2.33e-02

Pos logit contributions
 may+0.007
 omega+0.006
xy+0.006
rick+0.006
 might+0.006
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
 hesitant-0.005
ider-0.005
Tokensc
Token ID1416
Feature activation+6.02e-02

Pos logit contributions
 omega+0.041
 may+0.041
rick+0.038
 fiber+0.038
xy+0.038
Neg logit contributions
 below-0.029
 older-0.027
ivated-0.026
itating-0.025
 hesitant-0.025
Tokenot
Token ID313
Feature activation+1.44e-02

Pos logit contributions
 may+0.085
xy+0.077
 might+0.077
 meanings+0.074
 omega+0.072
Neg logit contributions
 into-0.083
apping-0.083
ider-0.079
ivated-0.078
ashes-0.078
Tokench
Token ID354
Feature activation+2.63e-02

Pos logit contributions
 may+0.021
 omega+0.021
xy+0.020
 meanings+0.020
 fiber+0.019
Neg logit contributions
 into-0.020
 below-0.020
vious-0.019
 older-0.019
but-0.018
Token,
Token ID11
Feature activation+6.24e-03

Pos logit contributions
 may+0.044
 omega+0.041
rick+0.039
 meanings+0.038
xy+0.038
Neg logit contributions
 below-0.034
 older-0.033
but-0.033
ivated-0.033
ider-0.032
Token jumped
Token ID11687
Feature activation+6.92e-03

Pos logit contributions
 may+0.010
xy+0.010
 omega+0.010
gh+0.009
rick+0.009
Neg logit contributions
 older-0.009
 below-0.009
ivated-0.008
 hesitant-0.008
but-0.008
Token rope
Token ID17182
Feature activation+1.73e-02

Pos logit contributions
 may+0.012
 might+0.011
xy+0.011
 omega+0.011
of+0.011
Neg logit contributions
 older-0.009
ivated-0.008
 below-0.008
itating-0.008
ider-0.008
Token He
Token ID679
Feature activation+1.11e-02

Pos logit contributions
 may+0.007
 might+0.006
 omega+0.006
xy+0.006
rick+0.006
Neg logit contributions
 older-0.009
ivated-0.009
 below-0.009
ider-0.009
itating-0.009
Token found
Token ID1043
Feature activation-8.54e-03

Pos logit contributions
 may+0.017
xy+0.016
 omega+0.015
gh+0.015
of+0.015
Neg logit contributions
 older-0.016
 below-0.015
 hesitant-0.015
ivated-0.015
itating-0.015
Token it
Token ID340
Feature activation+3.64e-03

Pos logit contributions
 below+0.012
 older+0.012
 hesitant+0.012
 startled+0.011
 able+0.011
Neg logit contributions
 may-0.013
xy-0.012
rick-0.012
gh-0.012
of-0.012
Token wrapped
Token ID12908
Feature activation-4.40e-03

Pos logit contributions
 may+0.007
 omega+0.007
xy+0.007
ww+0.007
rick+0.006
Neg logit contributions
 older-0.004
 below-0.004
 hesitant-0.004
ivated-0.003
itating-0.003
Token in
Token ID287
Feature activation+2.28e-02

Pos logit contributions
 below+0.003
 behind+0.003
 older+0.003
but+0.003
 into+0.003
Neg logit contributions
 may-0.010
 omega-0.010
ww-0.010
xy-0.010
 fiber-0.010
Token seaw
Token ID32085
Feature activation+5.92e-02

Pos logit contributions
 may+0.043
xy+0.043
ww+0.041
of+0.040
gh+0.040
Neg logit contributions
 older-0.027
 below-0.025
 elderly-0.025
 into-0.023
 behind-0.023
Tokeneed
Token ID2308
Feature activation+2.13e-02

Pos logit contributions
 may+0.088
 might+0.080
xy+0.077
gly+0.074
ww+0.073
Neg logit contributions
ider-0.086
but-0.080
 older-0.079
rier-0.078
 below-0.075
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
 may+0.031
 omega+0.028
xy+0.028
 might+0.028
of+0.027
Neg logit contributions
 older-0.033
 below-0.032
ivated-0.031
itating-0.031
ider-0.031
Token brought
Token ID3181
Feature activation+5.00e-03

Pos logit contributions
 may+0.031
xy+0.029
of+0.029
gh+0.029
rick+0.028
Neg logit contributions
 older-0.023
 below-0.022
 hesitant-0.021
 startled-0.021
 bored-0.020
Token it
Token ID340
Feature activation+8.64e-03

Pos logit contributions
xy+0.008
ww+0.008
rick+0.008
 omega+0.008
 may+0.008
Neg logit contributions
 below-0.007
 into-0.007
 older-0.006
 behind-0.006
 able-0.006
Token back
Token ID736
Feature activation+1.60e-02

Pos logit contributions
 may+0.017
 might+0.016
of+0.015
xy+0.015
gh+0.015
Neg logit contributions
 older-0.009
ivated-0.009
itating-0.009
 below-0.008
ider-0.008
Token They
Token ID1119
Feature activation+6.79e-04

Pos logit contributions
 may+0.003
xy+0.003
gh+0.003
 omega+0.003
ww+0.003
Neg logit contributions
 older-0.003
ivated-0.003
but-0.003
 below-0.003
 hesitant-0.003
Token started
Token ID2067
Feature activation+1.41e-02

Pos logit contributions
 may+0.001
xy+0.001
 omega+0.001
 might+0.001
of+0.001
Neg logit contributions
 older-0.001
 below-0.001
ivated-0.001
itating-0.001
 hesitant-0.001
Token to
Token ID284
Feature activation+1.18e-02

Pos logit contributions
 may+0.024
 might+0.022
xy+0.021
 omega+0.021
rick+0.021
Neg logit contributions
 older-0.018
 below-0.018
ider-0.018
ivated-0.018
itating-0.017
Token play
Token ID711
Feature activation+2.73e-03

Pos logit contributions
 may+0.018
xy+0.017
 omega+0.017
ww+0.017
gh+0.016
Neg logit contributions
 older-0.018
ivated-0.017
 below-0.017
 hesitant-0.016
itating-0.016
Token hop
Token ID1725
Feature activation+1.82e-02

Pos logit contributions
 may+0.004
 might+0.004
xy+0.004
gh+0.004
orm+0.004
Neg logit contributions
 older-0.004
ivated-0.004
itating-0.004
ider-0.004
 below-0.004
Tokensc
Token ID1416
Feature activation+5.90e-02

Pos logit contributions
 may+0.030
 omega+0.028
xy+0.027
rick+0.027
 might+0.026
Neg logit contributions
 older-0.025
 below-0.024
ivated-0.024
itating-0.023
 hesitant-0.023
Tokenot
Token ID313
Feature activation+6.71e-03

Pos logit contributions
 may+0.073
xy+0.065
 omega+0.063
 might+0.061
 meanings+0.060
Neg logit contributions
 older-0.093
ider-0.092
 hesitant-0.092
ivated-0.090
 into-0.090
Tokench
Token ID354
Feature activation+2.20e-02

Pos logit contributions
 omega+0.010
 may+0.010
xy+0.010
 fiber+0.010
 meanings+0.010
Neg logit contributions
vious-0.009
 below-0.009
 older-0.009
 into-0.009
 startled-0.009
Token and
Token ID290
Feature activation+7.11e-03

Pos logit contributions
 may+0.038
 omega+0.035
rick+0.033
 might+0.033
xy+0.033
Neg logit contributions
 older-0.028
 below-0.028
ivated-0.027
but-0.027
itating-0.026
Token had
Token ID550
Feature activation-6.36e-03

Pos logit contributions
 may+0.010
 omega+0.010
xy+0.009
rick+0.009
gh+0.009
Neg logit contributions
 below-0.011
 older-0.011
ivated-0.010
 hesitant-0.010
itating-0.010
Token lots
Token ID6041
Feature activation-1.23e-03

Pos logit contributions
 older+0.009
 hesitant+0.008
 below+0.008
 bored+0.008
ivated+0.008
Neg logit contributions
 may-0.010
xy-0.010
gly-0.010
of-0.009
rick-0.009
Token shore
Token ID15191
Feature activation+1.31e-02

Pos logit contributions
 may+0.010
 might+0.009
xy+0.009
of+0.009
rick+0.009
Neg logit contributions
ivated-0.005
ider-0.005
 older-0.005
 hesitant-0.005
itating-0.005
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
 may+0.020
 omega+0.019
xy+0.018
 might+0.018
rick+0.018
Neg logit contributions
 older-0.019
 below-0.018
ivated-0.018
itating-0.018
ider-0.018
Token collecting
Token ID13157
Feature activation+8.41e-03

Pos logit contributions
 may+0.022
 might+0.020
xy+0.020
 omega+0.020
of+0.019
Neg logit contributions
 older-0.015
ivated-0.014
 below-0.014
ider-0.014
itating-0.014
Token shells
Token ID19679
Feature activation+2.06e-02

Pos logit contributions
 may+0.014
xy+0.014
 omega+0.013
rick+0.013
gh+0.013
Neg logit contributions
 below-0.012
 older-0.011
 into-0.010
 behind-0.010
 startled-0.010
Token and
Token ID290
Feature activation+7.57e-03

Pos logit contributions
 may+0.034
 omega+0.034
xy+0.032
gh+0.032
rick+0.031
Neg logit contributions
 below-0.028
 older-0.027
but-0.025
itating-0.025
ivated-0.024
Token seaw
Token ID32085
Feature activation+5.87e-02

Pos logit contributions
xy+0.013
 may+0.012
ww+0.012
gh+0.012
 omega+0.012
Neg logit contributions
 below-0.010
 older-0.010
 into-0.009
 youngest-0.009
 startled-0.009
Tokeneed
Token ID2308
Feature activation+2.44e-02

Pos logit contributions
 may+0.087
 might+0.078
xy+0.075
gly+0.071
rick+0.070
Neg logit contributions
ider-0.090
but-0.083
 below-0.083
rier-0.082
 older-0.082
Token.
Token ID13
Feature activation+2.75e-03

Pos logit contributions
 may+0.044
 omega+0.041
xy+0.040
rick+0.040
 might+0.040
Neg logit contributions
 below-0.029
 older-0.029
ivated-0.027
ider-0.027
itating-0.026
Token Jimmy
Token ID12963
Feature activation+8.38e-04

Pos logit contributions
 may+0.004
xy+0.004
 might+0.004
 omega+0.004
rick+0.004
Neg logit contributions
 older-0.004
 below-0.004
ivated-0.004
 hesitant-0.004
itating-0.004
Token took
Token ID1718
Feature activation+1.34e-03

Pos logit contributions
 omega+0.001
xy+0.001
 may+0.001
gh+0.001
ww+0.001
Neg logit contributions
 below-0.001
 older-0.001
 hesitant-0.001
ivated-0.001
 startled-0.001
Token time
Token ID640
Feature activation+1.78e-02

Pos logit contributions
xy+0.003
 may+0.003
 omega+0.002
rick+0.002
 might+0.002
Neg logit contributions
 older-0.002
 below-0.002
 into-0.001
 able-0.001
 hesitant-0.001
Token started
Token ID2067
Feature activation+1.62e-02

Pos logit contributions
 may+0.008
xy+0.007
 omega+0.007
 might+0.007
of+0.007
Neg logit contributions
 older-0.009
 below-0.009
ivated-0.009
itating-0.008
ider-0.008
Token to
Token ID284
Feature activation+1.36e-02

Pos logit contributions
 may+0.026
 might+0.024
xy+0.023
 omega+0.023
gh+0.023
Neg logit contributions
 older-0.022
ivated-0.022
ider-0.021
itating-0.021
 below-0.021
Token chew
Token ID34722
Feature activation+2.88e-03

Pos logit contributions
 may+0.020
xy+0.020
 omega+0.019
ww+0.019
rick+0.019
Neg logit contributions
 older-0.021
 below-0.020
ivated-0.019
 startled-0.019
 hesitant-0.019
Token on
Token ID319
Feature activation+1.06e-02

Pos logit contributions
 may+0.006
 omega+0.006
haw+0.006
rick+0.006
iverse+0.006
Neg logit contributions
 below-0.003
 behind-0.002
 into-0.002
 older-0.002
 again-0.002
Token some
Token ID617
Feature activation+1.00e-02

Pos logit contributions
 omega+0.021
 may+0.020
ww+0.020
xy+0.020
iverse+0.019
Neg logit contributions
 below-0.012
 older-0.010
 behind-0.010
 into-0.009
rier-0.009
Token seaw
Token ID32085
Feature activation+5.87e-02

Pos logit contributions
 may+0.015
xy+0.015
 omega+0.014
rick+0.014
gh+0.013
Neg logit contributions
 older-0.015
 below-0.014
ivated-0.013
 hesitant-0.013
ider-0.013
Tokeneed
Token ID2308
Feature activation+1.29e-02

Pos logit contributions
 may+0.085
 might+0.075
xy+0.072
gly+0.071
 meanings+0.067
Neg logit contributions
ider-0.091
 older-0.086
rier-0.083
but-0.083
 below-0.081
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
 may+0.021
 might+0.020
xy+0.019
of+0.019
 omega+0.019
Neg logit contributions
 older-0.017
ivated-0.017
 below-0.016
itating-0.016
ider-0.016
Token Mary
Token ID5335
Feature activation-6.20e-04

Pos logit contributions
 may+0.019
xy+0.017
 omega+0.017
 might+0.017
rick+0.017
Neg logit contributions
 older-0.018
 below-0.018
ivated-0.017
itating-0.017
 hesitant-0.017
Token was
Token ID373
Feature activation-7.37e-03

Pos logit contributions
 older+0.001
 below+0.001
 hesitant+0.001
 startled+0.001
ivated+0.001
Neg logit contributions
 may-0.001
xy-0.001
 omega-0.001
ww-0.001
gh-0.001
Token so
Token ID523
Feature activation-6.07e-03

Pos logit contributions
ider+0.013
 register+0.012
saw+0.012
ays+0.011
itating+0.011
Neg logit contributions
 hard-0.009
 might-0.008
 may-0.008
gh-0.008
 long-0.007
Token found
Token ID1043
Feature activation-1.34e-02

Pos logit contributions
 older+0.009
 below+0.009
ivated+0.009
itating+0.009
ider+0.009
Neg logit contributions
 may-0.010
 might-0.009
xy-0.009
 omega-0.009
of-0.009
Token it
Token ID340
Feature activation+4.59e-03

Pos logit contributions
 older+0.020
 below+0.020
ivated+0.018
 hesitant+0.018
itating+0.018
Neg logit contributions
 may-0.020
 omega-0.018
xy-0.018
of-0.018
rick-0.018
Token stuck
Token ID7819
Feature activation-5.72e-03

Pos logit contributions
 may+0.009
 omega+0.008
xy+0.008
rick+0.008
of+0.008
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
itating-0.005
ider-0.005
Token in
Token ID287
Feature activation+1.06e-02

Pos logit contributions
 older+0.005
 below+0.005
ivated+0.005
 hesitant+0.005
itating+0.005
Neg logit contributions
 may-0.012
 omega-0.011
xy-0.011
of-0.011
rick-0.011
Token some
Token ID617
Feature activation+6.94e-03

Pos logit contributions
 may+0.018
xy+0.017
gh+0.016
of+0.016
 omega+0.016
Neg logit contributions
 older-0.015
 below-0.014
 youngest-0.013
 hesitant-0.013
ivated-0.013
Token seaw
Token ID32085
Feature activation+5.85e-02

Pos logit contributions
 may+0.010
xy+0.010
gh+0.009
 omega+0.009
of+0.009
Neg logit contributions
 older-0.011
 below-0.011
ivated-0.010
 hesitant-0.010
 bored-0.010
Tokeneed
Token ID2308
Feature activation+1.31e-02

Pos logit contributions
 may+0.079
 might+0.069
xy+0.068
gly+0.066
haw+0.063
Neg logit contributions
ider-0.094
 older-0.092
 below-0.088
but-0.086
rier-0.086
Token.
Token ID13
Feature activation+7.48e-03

Pos logit contributions
 may+0.020
 might+0.018
xy+0.018
of+0.017
rick+0.017
Neg logit contributions
 older-0.019
ivated-0.019
itating-0.018
ider-0.018
 hesitant-0.018
Token Tim
Token ID5045
Feature activation-2.78e-04

Pos logit contributions
 may+0.011
 omega+0.010
xy+0.010
gh+0.009
rick+0.009
Neg logit contributions
 older-0.012
 below-0.011
ivated-0.011
itating-0.011
 hesitant-0.011
Token was
Token ID373
Feature activation-7.91e-03

Pos logit contributions
 older+0.000
 below+0.000
 hesitant+0.000
 startled+0.000
ivated+0.000
Neg logit contributions
 may-0.000
xy-0.000
gh-0.000
 omega-0.000
ww-0.000
Token so
Token ID523
Feature activation-4.46e-03

Pos logit contributions
ider+0.015
but+0.014
saw+0.014
itating+0.014
 register+0.014
Neg logit contributions
 might-0.008
 may-0.008
 hard-0.008
gh-0.007
 omega-0.007
Token noises
Token ID26782
Feature activation+1.08e-02

Pos logit contributions
 may+0.009
 omega+0.008
xy+0.008
rick+0.008
of+0.008
Neg logit contributions
 older-0.010
 below-0.010
ivated-0.010
 hesitant-0.009
itating-0.009
Token like
Token ID588
Feature activation+1.66e-02

Pos logit contributions
 may+0.015
 might+0.014
of+0.013
xy+0.013
 omega+0.013
Neg logit contributions
 older-0.017
ivated-0.016
itating-0.016
ider-0.016
 below-0.016
Token "
Token ID366
Feature activation-3.08e-03

Pos logit contributions
 may+0.027
 omega+0.027
rick+0.027
ww+0.026
of+0.026
Neg logit contributions
 older-0.023
 below-0.023
saw-0.019
 behind-0.019
 bored-0.019
Tokenwho
Token ID8727
Feature activation+2.51e-03

Pos logit contributions
 below+0.003
saw+0.003
but+0.003
ider+0.003
 older+0.003
Neg logit contributions
 may-0.006
 omega-0.005
gly-0.005
rick-0.005
gh-0.005
Tokenosh
Token ID3768
Feature activation-1.29e-03

Pos logit contributions
 omega+0.003
xy+0.003
 may+0.003
rick+0.003
gh+0.003
Neg logit contributions
 older-0.004
 below-0.004
vious-0.004
 announced-0.004
 able-0.004
Token"
Token ID1
Feature activation-5.76e-02

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.001
itating+0.001
ider+0.001
Neg logit contributions
 may-0.002
 omega-0.002
rick-0.002
xy-0.002
 might-0.002
Token and
Token ID290
Feature activation-2.57e-03

Pos logit contributions
 older+0.069
ivated+0.066
 below+0.066
itating+0.064
ider+0.064
Neg logit contributions
 may-0.103
xy-0.094
 might-0.094
 omega-0.093
of-0.093
Token "
Token ID366
Feature activation-1.61e-02

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
ider+0.003
 hesitant+0.003
Neg logit contributions
 may-0.004
gh-0.004
rick-0.004
of-0.004
 omega-0.003
Tokenbe
Token ID1350
Feature activation+3.06e-02

Pos logit contributions
 below+0.023
 older+0.022
ider+0.022
ivated+0.022
itating+0.021
Neg logit contributions
 may-0.026
 omega-0.023
 might-0.023
xy-0.022
rick-0.022
Tokenep
Token ID538
Feature activation+6.50e-03

Pos logit contributions
 may+0.059
xy+0.053
 omega+0.051
 might+0.050
rick+0.049
Neg logit contributions
 older-0.036
 bored-0.031
 able-0.031
 youngest-0.031
vious-0.029
Token"
Token ID1
Feature activation-5.47e-02

Pos logit contributions
 may+0.010
 omega+0.009
xy+0.009
rick+0.009
 might+0.009
Neg logit contributions
 older-0.010
 below-0.009
ider-0.009
itating-0.008
ivated-0.008
Token when
Token ID618
Feature activation-4.26e-03

Pos logit contributions
 older+0.015
 below+0.014
ivated+0.014
ider+0.014
itating+0.014
Neg logit contributions
 may-0.017
 omega-0.016
xy-0.015
rick-0.015
gh-0.015
Token they
Token ID484
Feature activation-7.00e-04

Pos logit contributions
 older+0.006
 below+0.006
 bored+0.005
ivated+0.005
ider+0.005
Neg logit contributions
 omega-0.006
 may-0.006
rick-0.006
of-0.006
orm-0.006
Token want
Token ID765
Feature activation+3.34e-03

Pos logit contributions
 older+0.001
 below+0.001
ivated+0.001
 bored+0.001
itating+0.001
Neg logit contributions
 omega-0.001
xy-0.001
gh-0.001
 may-0.001
rick-0.001
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
 may+0.006
xy+0.006
 omega+0.006
rick+0.006
orm+0.006
Neg logit contributions
 older-0.004
 below-0.003
but-0.003
 bored-0.003
ivated-0.003
Token new
Token ID649
Feature activation-1.86e-02

Pos logit contributions
 older+0.022
ivated+0.022
 below+0.022
itating+0.021
ider+0.021
Neg logit contributions
 may-0.023
 might-0.020
of-0.020
 omega-0.020
xy-0.019
Token hairst
Token ID42658
Feature activation-5.69e-02

Pos logit contributions
ivated+0.030
itating+0.030
 below+0.029
 older+0.029
ider+0.028
Neg logit contributions
 may-0.026
 might-0.024
gh-0.023
 omega-0.023
of-0.022
Tokenyle
Token ID2349
Feature activation-2.15e-02

Pos logit contributions
 older+0.093
 entrance+0.088
 startled+0.088
 hesitant+0.088
 terrified+0.086
Neg logit contributions
w-0.068
xy-0.066
of-0.065
 may-0.064
gh-0.062
Token or
Token ID393
Feature activation-1.71e-02

Pos logit contributions
 older+0.027
ivated+0.026
itating+0.025
 below+0.025
 hesitant+0.024
Neg logit contributions
 may-0.039
 might-0.036
xy-0.035
of-0.035
gh-0.034
Token when
Token ID618
Feature activation-5.22e-03

Pos logit contributions
 older+0.027
 below+0.024
 bored+0.024
 hesitant+0.023
 youngest+0.023
Neg logit contributions
 may-0.025
xy-0.025
rick-0.024
of-0.024
gh-0.023
Token their
Token ID511
Feature activation-1.29e-02

Pos logit contributions
 older+0.008
 below+0.007
 bored+0.007
ider+0.007
 youngest+0.007
Neg logit contributions
 may-0.008
 omega-0.007
of-0.007
rick-0.007
xy-0.007
Token hair
Token ID4190
Feature activation-1.15e-02

Pos logit contributions
 older+0.019
ivated+0.019
 below+0.019
itating+0.018
ider+0.018
Neg logit contributions
 may-0.019
 might-0.017
 omega-0.017
xy-0.017
of-0.017
Token What
Token ID1867
Feature activation+1.97e-03

Pos logit contributions
 older+0.017
 below+0.016
ivated+0.016
itating+0.016
ider+0.016
Neg logit contributions
 may-0.015
xy-0.013
 might-0.013
 omega-0.013
of-0.013
Token are
Token ID389
Feature activation-1.13e-02

Pos logit contributions
xy+0.003
 omega+0.002
orm+0.002
rick+0.002
haw+0.002
Neg logit contributions
 older-0.003
 hesitant-0.003
 bored-0.003
 startled-0.003
 below-0.003
Token we
Token ID356
Feature activation-3.26e-02

Pos logit contributions
 older+0.020
 below+0.019
ivated+0.018
ider+0.018
itating+0.018
Neg logit contributions
 may-0.014
of-0.013
 omega-0.013
xy-0.012
rick-0.012
Token going
Token ID1016
Feature activation-3.53e-02

Pos logit contributions
ider+0.052
 below+0.052
itating+0.051
saw+0.050
but+0.050
Neg logit contributions
 may-0.045
 might-0.041
 omega-0.037
xy-0.037
gh-0.037
Token to
Token ID284
Feature activation-3.66e-02

Pos logit contributions
ivated+0.063
rier+0.062
ider+0.062
 youngest+0.060
itating+0.060
Neg logit contributions
 may-0.042
 might-0.038
 would-0.036
 looks-0.036
gh-0.035
Token do
Token ID466
Feature activation-5.48e-02

Pos logit contributions
 older+0.068
 below+0.066
ivated+0.066
itating+0.065
ider+0.064
Neg logit contributions
 may-0.042
xy-0.036
 might-0.036
 omega-0.036
of-0.035
Token?
Token ID30
Feature activation-1.74e-02

Pos logit contributions
 older+0.071
ivated+0.071
itating+0.070
 below+0.070
 youngest+0.068
Neg logit contributions
 may-0.094
 might-0.087
xy-0.084
gh-0.084
of-0.080
Tokenâ
Token ID22940
Feature activation-1.04e-02

Pos logit contributions
 older+0.029
 below+0.028
ivated+0.028
itating+0.027
ider+0.027
Neg logit contributions
 may-0.024
xy-0.021
 might-0.021
 omega-0.021
of-0.020
Token
Token ID26391
Feature activation+7.42e-03

Pos logit contributions
 below+0.014
ivated+0.014
 older+0.014
ider+0.014
itating+0.014
Neg logit contributions
 may-0.017
 might-0.015
 omega-0.015
of-0.014
xy-0.014
Token
Token ID220
Feature activation-6.65e-03

Pos logit contributions
 may+0.011
xy+0.010
 omega+0.010
of+0.010
 might+0.010
Neg logit contributions
 older-0.011
 below-0.011
ivated-0.011
itating-0.011
ider-0.010
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
 older+0.011
ivated+0.011
 below+0.010
ider+0.010
itating+0.010
Neg logit contributions
 may-0.009
xy-0.008
 might-0.008
of-0.008
 omega-0.008
Token there
Token ID612
Feature activation-2.93e-02

Pos logit contributions
 may+0.004
 omega+0.004
xy+0.004
rick+0.004
gh+0.004
Neg logit contributions
 older-0.004
 below-0.004
ivated-0.004
itating-0.003
 bored-0.003
Token anything
Token ID1997
Feature activation-1.96e-02

Pos logit contributions
 older+0.038
 below+0.036
ivated+0.033
itating+0.033
 bored+0.032
Neg logit contributions
 may-0.049
rick-0.049
 omega-0.048
xy-0.048
gh-0.046
Token else
Token ID2073
Feature activation-7.13e-03

Pos logit contributions
 older+0.030
ider+0.030
ivated+0.030
itating+0.029
 below+0.029
Neg logit contributions
 may-0.030
 might-0.027
xy-0.024
of-0.024
 omega-0.024
Token I
Token ID314
Feature activation-3.49e-02

Pos logit contributions
 older+0.008
 below+0.008
ivated+0.008
itating+0.007
ider+0.007
Neg logit contributions
 may-0.013
xy-0.012
 might-0.012
 omega-0.012
of-0.012
Token can
Token ID460
Feature activation-1.99e-02

Pos logit contributions
 older+0.073
ivated+0.072
 below+0.072
itating+0.071
ider+0.071
Neg logit contributions
 may-0.033
 might-0.027
 omega-0.025
xy-0.025
of-0.025
Token do
Token ID466
Feature activation-5.47e-02

Pos logit contributions
 older+0.034
 below+0.033
ivated+0.032
itating+0.032
ider+0.032
Neg logit contributions
 may-0.027
 might-0.023
xy-0.022
of-0.022
 omega-0.022
Token for
Token ID329
Feature activation-4.02e-02

Pos logit contributions
 older+0.070
itating+0.069
ivated+0.068
 below+0.067
 youngest+0.067
Neg logit contributions
 may-0.095
 might-0.087
gh-0.084
xy-0.084
of-0.082
Token you
Token ID345
Feature activation-2.74e-02

Pos logit contributions
 older+0.066
 below+0.061
ivated+0.060
ider+0.059
but+0.058
Neg logit contributions
 may-0.056
rick-0.052
xy-0.051
 omega-0.050
of-0.050
Token?"
Token ID1701
Feature activation-5.06e-03

Pos logit contributions
ider+0.033
ivated+0.033
itating+0.033
rier+0.032
 youngest+0.032
Neg logit contributions
 may-0.047
 might-0.043
 would-0.041
of-0.041
gh-0.040
Token she
Token ID673
Feature activation-1.91e-02

Pos logit contributions
 older+0.008
ider+0.007
ivated+0.007
 below+0.007
itating+0.007
Neg logit contributions
 may-0.007
 might-0.007
of-0.007
 omega-0.007
xy-0.007
Token asked
Token ID1965
Feature activation-2.79e-03

Pos logit contributions
itating+0.023
ider+0.023
 below+0.022
ivated+0.022
 older+0.021
Neg logit contributions
 may-0.034
 might-0.032
 omega-0.031
of-0.031
rick-0.030
Token"
Token ID1
Feature activation-5.76e-02

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.001
itating+0.001
ider+0.001
Neg logit contributions
 may-0.002
 omega-0.002
rick-0.002
xy-0.002
 might-0.002
Token and
Token ID290
Feature activation-2.57e-03

Pos logit contributions
 older+0.069
ivated+0.066
 below+0.066
itating+0.064
ider+0.064
Neg logit contributions
 may-0.103
xy-0.094
 might-0.094
 omega-0.093
of-0.093
Token "
Token ID366
Feature activation-1.61e-02

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
ider+0.003
 hesitant+0.003
Neg logit contributions
 may-0.004
gh-0.004
rick-0.004
of-0.004
 omega-0.003
Tokenbe
Token ID1350
Feature activation+3.06e-02

Pos logit contributions
 below+0.023
 older+0.022
ider+0.022
ivated+0.022
itating+0.021
Neg logit contributions
 may-0.026
 omega-0.023
 might-0.023
xy-0.022
rick-0.022
Tokenep
Token ID538
Feature activation+6.50e-03

Pos logit contributions
 may+0.059
xy+0.053
 omega+0.051
 might+0.050
rick+0.049
Neg logit contributions
 older-0.036
 bored-0.031
 able-0.031
 youngest-0.031
vious-0.029
Token"
Token ID1
Feature activation-5.47e-02

Pos logit contributions
 may+0.010
 omega+0.009
xy+0.009
rick+0.009
 might+0.009
Neg logit contributions
 older-0.010
 below-0.009
ider-0.009
itating-0.008
ivated-0.008
Token and
Token ID290
Feature activation+1.84e-03

Pos logit contributions
 older+0.072
 below+0.069
ivated+0.068
itating+0.067
ider+0.066
Neg logit contributions
 may-0.092
xy-0.083
 might-0.083
 omega-0.083
of-0.082
Token wear
Token ID5806
Feature activation+9.72e-03

Pos logit contributions
 may+0.003
gh+0.003
rick+0.003
of+0.003
xy+0.003
Neg logit contributions
 older-0.002
 below-0.002
ivated-0.002
ider-0.002
itating-0.002
Token helmets
Token ID28359
Feature activation+5.55e-03

Pos logit contributions
 may+0.014
rick+0.014
 omega+0.014
xy+0.014
of+0.013
Neg logit contributions
 below-0.013
 older-0.013
 into-0.012
 youngest-0.012
 behind-0.012
Token made
Token ID925
Feature activation+4.22e-03

Pos logit contributions
 may+0.009
 might+0.008
xy+0.008
of+0.008
 omega+0.008
Neg logit contributions
 older-0.008
ivated-0.008
 below-0.007
ider-0.007
 hesitant-0.007
Token of
Token ID286
Feature activation+4.20e-03

Pos logit contributions
 may+0.008
 omega+0.007
xy+0.007
rick+0.007
 might+0.007
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
 hesitant-0.004
ider-0.004
Token to
Token ID284
Feature activation-9.33e-03

Pos logit contributions
 may+0.014
 might+0.013
of+0.012
 omega+0.012
gh+0.012
Neg logit contributions
ivated-0.013
itating-0.013
ider-0.013
 below-0.013
 older-0.012
Token go
Token ID467
Feature activation-2.56e-03

Pos logit contributions
ivated+0.018
 below+0.017
ider+0.017
itating+0.017
 older+0.017
Neg logit contributions
 may-0.011
 might-0.009
of-0.009
gh-0.009
 omega-0.009
Token play
Token ID711
Feature activation-1.68e-02

Pos logit contributions
rier+0.004
ivated+0.004
itating+0.004
ider+0.004
 youngest+0.004
Neg logit contributions
 may-0.003
 might-0.003
of-0.003
 looks-0.003
gh-0.003
Token at
Token ID379
Feature activation-2.81e-02

Pos logit contributions
itating+0.027
 below+0.026
ivated+0.025
 youngest+0.025
 register+0.025
Neg logit contributions
 may-0.022
gh-0.022
 might-0.020
of-0.020
xy-0.019
Token the
Token ID262
Feature activation-1.08e-02

Pos logit contributions
ider+0.051
itating+0.049
ivated+0.049
 hesitant+0.048
but+0.047
Neg logit contributions
 may-0.034
 might-0.031
xy-0.030
 omega-0.029
of-0.029
Token gym
Token ID11550
Feature activation-5.35e-02

Pos logit contributions
ivated+0.018
ider+0.018
 hesitant+0.017
itating+0.017
rier+0.017
Neg logit contributions
 may-0.015
 might-0.014
of-0.012
 looks-0.012
 is-0.012
Token?"
Token ID1701
Feature activation-7.23e-03

Pos logit contributions
rier+0.090
arted+0.086
ivated+0.085
 youngest+0.085
 paramedics+0.084
Neg logit contributions
 may-0.061
 might-0.061
of-0.060
 looks-0.059
 would-0.058
Token
Token ID198
Feature activation-9.55e-03

Pos logit contributions
 older+0.012
 below+0.012
ivated+0.012
itating+0.012
ider+0.012
Neg logit contributions
 may-0.009
 might-0.008
xy-0.008
 omega-0.008
of-0.008
Token
Token ID198
Feature activation-5.74e-03

Pos logit contributions
 older+0.014
 below+0.014
ivated+0.014
itating+0.014
ider+0.014
Neg logit contributions
 may-0.014
xy-0.013
 might-0.013
 omega-0.013
of-0.013
TokenJack
Token ID14295
Feature activation+7.63e-04

Pos logit contributions
 older+0.010
 below+0.010
ivated+0.010
itating+0.010
ider+0.010
Neg logit contributions
 may-0.007
xy-0.006
 might-0.006
 omega-0.006
of-0.006
Token said
Token ID531
Feature activation+7.40e-03

Pos logit contributions
 may+0.001
 omega+0.001
xy+0.001
gh+0.001
of+0.001
Neg logit contributions
 older-0.001
 hesitant-0.001
 below-0.001
ivated-0.001
itating-0.001
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
 may+0.004
 might+0.004
xy+0.004
 omega+0.004
of+0.004
Neg logit contributions
 older-0.003
 below-0.003
ivated-0.003
itating-0.003
ider-0.003
Token join
Token ID4654
Feature activation+7.37e-03

Pos logit contributions
 older+0.021
 below+0.020
ivated+0.020
itating+0.020
ider+0.019
Neg logit contributions
 may-0.016
xy-0.015
 omega-0.014
 might-0.014
rick-0.014
Token us
Token ID514
Feature activation-2.13e-02

Pos logit contributions
 may+0.011
 omega+0.010
xy+0.010
 might+0.010
rick+0.010
Neg logit contributions
 older-0.011
 below-0.011
ivated-0.010
itating-0.010
 hesitant-0.010
Token for
Token ID329
Feature activation-2.42e-02

Pos logit contributions
ivated+0.029
ider+0.029
rier+0.028
itating+0.028
saw+0.028
Neg logit contributions
 may-0.033
of-0.031
 might-0.030
 thumbs-0.028
 would-0.028
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
 older+0.036
 below+0.034
ivated+0.033
ider+0.032
 hesitant+0.032
Neg logit contributions
 may-0.038
xy-0.034
 omega-0.034
 might-0.034
rick-0.034
Token class
Token ID1398
Feature activation-5.31e-02

Pos logit contributions
 older+0.020
 below+0.019
ivated+0.019
itating+0.018
ider+0.018
Neg logit contributions
 may-0.016
xy-0.014
 omega-0.014
 might-0.014
rick-0.014
Token?
Token ID30
Feature activation-9.02e-03

Pos logit contributions
ivated+0.083
rier+0.081
itating+0.081
 terrified+0.079
 youngest+0.077
Neg logit contributions
 may-0.070
of-0.069
 might-0.066
 would-0.065
gh-0.061
Tokenâ
Token ID22940
Feature activation-1.81e-03

Pos logit contributions
 older+0.015
 below+0.015
ivated+0.015
ider+0.014
itating+0.014
Neg logit contributions
 may-0.012
 might-0.011
of-0.011
 omega-0.010
xy-0.010
Token
Token ID26391
Feature activation+9.65e-03

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.002
itating+0.002
ider+0.002
Neg logit contributions
 may-0.003
 omega-0.003
 might-0.003
xy-0.003
rick-0.003
Token
Token ID198
Feature activation-2.41e-03

Pos logit contributions
 may+0.015
 omega+0.014
xy+0.014
 might+0.013
rick+0.013
Neg logit contributions
 older-0.014
 below-0.013
ivated-0.013
itating-0.013
 hesitant-0.013
Token
Token ID198
Feature activation+4.58e-03

Pos logit contributions
 older+0.004
ider+0.004
ivated+0.004
 below+0.004
itating+0.004
Neg logit contributions
 may-0.003
of-0.003
 might-0.003
xy-0.003
 omega-0.003
Token what
Token ID644
Feature activation+9.20e-03

Pos logit contributions
 older+0.020
 below+0.017
ivated+0.016
rier+0.016
 able+0.016
Neg logit contributions
xy-0.026
gh-0.024
 may-0.024
of-0.024
rick-0.023
Token are
Token ID389
Feature activation-1.13e-02

Pos logit contributions
xy+0.010
 may+0.010
 omega+0.009
of+0.009
rick+0.009
Neg logit contributions
 older-0.017
 bored-0.016
 below-0.016
vious-0.015
 startled-0.015
Token we
Token ID356
Feature activation-2.92e-02

Pos logit contributions
 older+0.021
 below+0.020
ivated+0.020
ider+0.020
itating+0.020
Neg logit contributions
 may-0.013
 omega-0.011
xy-0.011
of-0.011
 might-0.011
Token going
Token ID1016
Feature activation-2.77e-02

Pos logit contributions
ider+0.049
 below+0.049
saw+0.049
but+0.048
itating+0.047
Neg logit contributions
 may-0.038
 might-0.035
 are-0.032
gh-0.032
w-0.030
Token to
Token ID284
Feature activation-3.11e-02

Pos logit contributions
ivated+0.051
rier+0.049
itating+0.049
ider+0.048
 youngest+0.047
Neg logit contributions
 may-0.031
 might-0.028
 would-0.028
 can-0.027
 looks-0.026
Token do
Token ID466
Feature activation-5.30e-02

Pos logit contributions
ider+0.055
 below+0.055
itating+0.055
 older+0.055
ivated+0.055
Neg logit contributions
 may-0.038
 might-0.032
 omega-0.032
xy-0.031
gh-0.031
Token?"
Token ID1701
Feature activation+9.00e-04

Pos logit contributions
ivated+0.071
itating+0.070
 below+0.070
saw+0.069
 youngest+0.069
Neg logit contributions
 may-0.091
gh-0.083
 might-0.083
 is-0.079
xy-0.079
Token His
Token ID2399
Feature activation+1.09e-02

Pos logit contributions
 may+0.001
 might+0.001
xy+0.001
 omega+0.001
of+0.001
Neg logit contributions
 older-0.001
ivated-0.001
 below-0.001
itating-0.001
ider-0.001
Token friend
Token ID1545
Feature activation-5.43e-03

Pos logit contributions
 may+0.023
 might+0.022
xy+0.022
 omega+0.022
of+0.021
Neg logit contributions
 older-0.009
 below-0.009
ivated-0.009
itating-0.008
ider-0.008
Token replied
Token ID8712
Feature activation+7.45e-03

Pos logit contributions
 older+0.008
 below+0.007
 hesitant+0.007
ivated+0.007
itating+0.007
Neg logit contributions
 may-0.008
 omega-0.008
xy-0.008
rick-0.008
 might-0.007
Token,
Token ID11
Feature activation+2.10e-03

Pos logit contributions
 may+0.010
 omega+0.009
xy+0.009
rick+0.009
of+0.009
Neg logit contributions
 older-0.012
 below-0.012
ivated-0.012
 hesitant-0.012
ider-0.012
Token me
Token ID502
Feature activation-1.84e-02

Pos logit contributions
 may+0.003
 might+0.002
 omega+0.002
gh+0.002
xy+0.002
Neg logit contributions
itating-0.004
ivated-0.004
 below-0.004
ider-0.004
 hesitant-0.003
Token some
Token ID617
Feature activation-2.12e-02

Pos logit contributions
ivated+0.025
itating+0.024
 below+0.024
ider+0.023
 older+0.023
Neg logit contributions
 may-0.029
 might-0.027
of-0.027
xy-0.026
 omega-0.026
Token jelly
Token ID27644
Feature activation-3.51e-02

Pos logit contributions
 older+0.038
 below+0.036
ivated+0.036
itating+0.036
ider+0.036
Neg logit contributions
 may-0.026
xy-0.023
 might-0.023
 omega-0.022
of-0.022
Token from
Token ID422
Feature activation-3.00e-02

Pos logit contributions
ivated+0.050
itating+0.049
saw+0.047
 below+0.046
 youngest+0.045
Neg logit contributions
 may-0.057
ww-0.053
gh-0.052
 might-0.051
xy-0.050
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
itating+0.051
ider+0.051
ivated+0.050
 older+0.050
 below+0.050
Neg logit contributions
 may-0.037
 might-0.033
 omega-0.033
xy-0.032
of-0.032
Token store
Token ID3650
Feature activation-5.24e-02

Pos logit contributions
ivated+0.034
itating+0.033
 hesitant+0.033
 older+0.032
ider+0.032
Neg logit contributions
 may-0.034
 might-0.031
 omega-0.030
gh-0.029
xy-0.029
Token?"
Token ID1701
Feature activation-1.14e-02

Pos logit contributions
ivated+0.093
 terrified+0.090
arted+0.088
itating+0.087
rier+0.086
Neg logit contributions
 would-0.052
 is-0.050
 looks-0.049
of-0.048
 cr-0.048
Token
Token ID220
Feature activation-9.96e-03

Pos logit contributions
ider+0.023
rier+0.022
 older+0.021
itating+0.021
 register+0.021
Neg logit contributions
 may-0.013
 might-0.012
 looks-0.010
of-0.010
 lived-0.010
Token
Token ID198
Feature activation-3.04e-03

Pos logit contributions
ider+0.022
ivated+0.020
vious+0.020
 older+0.020
 arrested+0.019
Neg logit contributions
 may-0.008
 might-0.008
xy-0.008
of-0.007
rick-0.006
TokenJane
Token ID41083
Feature activation-1.17e-02

Pos logit contributions
ider+0.006
 older+0.006
rier+0.006
itating+0.006
ivated+0.006
Neg logit contributions
of-0.003
 may-0.003
 might-0.003
rick-0.002
gh-0.002
Token said
Token ID531
Feature activation-1.75e-03

Pos logit contributions
 older+0.017
 below+0.016
 hesitant+0.016
ivated+0.016
itating+0.016
Neg logit contributions
 may-0.018
 omega-0.017
xy-0.017
ww-0.016
gh-0.016
Token you
Token ID345
Feature activation-1.64e-03

Pos logit contributions
orm+0.012
 may+0.012
xy+0.012
rick+0.012
 omega+0.011
Neg logit contributions
 older-0.011
ivated-0.010
 below-0.010
ider-0.009
itating-0.009
Token help
Token ID1037
Feature activation-5.69e-03

Pos logit contributions
 older+0.003
 below+0.003
ivated+0.003
itating+0.003
ider+0.003
Neg logit contributions
 may-0.002
 might-0.002
 omega-0.002
of-0.002
xy-0.001
Token me
Token ID502
Feature activation-1.19e-02

Pos logit contributions
 older+0.010
 below+0.010
ivated+0.010
ider+0.010
itating+0.010
Neg logit contributions
 may-0.007
 might-0.006
 omega-0.006
of-0.006
xy-0.006
Token lift
Token ID10303
Feature activation-1.17e-02

Pos logit contributions
ider+0.021
ivated+0.020
overed+0.020
itating+0.020
arted+0.020
Neg logit contributions
 may-0.014
 might-0.013
 coal-0.012
 cloth-0.012
of-0.012
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 older+0.017
 below+0.016
ivated+0.015
itating+0.015
but+0.015
Neg logit contributions
 may-0.019
 omega-0.017
xy-0.017
rick-0.016
gh-0.016
Token drawer
Token ID33451
Feature activation-5.24e-02

Pos logit contributions
ivated+0.022
itating+0.021
 below+0.021
 hesitant+0.021
 older+0.021
Neg logit contributions
 may-0.015
 might-0.013
 omega-0.013
gh-0.012
rick-0.012
Token?"
Token ID1701
Feature activation-2.80e-03

Pos logit contributions
ivated+0.082
 paramedics+0.077
 youngest+0.076
arted+0.076
saw+0.074
Neg logit contributions
 may-0.070
 looks-0.066
 might-0.065
 is-0.064
 can-0.064
Token
Token ID198
Feature activation-6.43e-03

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
itating+0.004
ider+0.004
Neg logit contributions
 may-0.004
xy-0.004
 omega-0.004
 might-0.004
of-0.004
Token
Token ID198
Feature activation-4.50e-03

Pos logit contributions
 older+0.010
 below+0.010
ivated+0.010
ider+0.009
itating+0.009
Neg logit contributions
 may-0.009
 might-0.008
xy-0.008
 omega-0.008
of-0.008
TokenHer
Token ID9360
Feature activation-8.90e-03

Pos logit contributions
 older+0.006
ivated+0.006
 below+0.006
itating+0.006
ider+0.006
Neg logit contributions
 may-0.008
xy-0.007
 omega-0.007
 might-0.007
rick-0.007
Token mom
Token ID1995
Feature activation+9.31e-04

Pos logit contributions
ivated+0.017
 below+0.016
itating+0.016
but+0.015
vious+0.015
Neg logit contributions
 may-0.011
 might-0.010
 thumbs-0.010
 omega-0.010
 pencil-0.010
Token little
Token ID1310
Feature activation+1.02e-02

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.002
itating+0.002
ider+0.002
Neg logit contributions
 may-0.002
xy-0.002
 omega-0.002
 might-0.002
of-0.002
Token boy
Token ID2933
Feature activation+7.34e-03

Pos logit contributions
 may+0.021
gh+0.020
xy+0.019
 omega+0.019
 might+0.019
Neg logit contributions
 older-0.011
 below-0.010
ivated-0.009
 bored-0.009
 hesitant-0.009
Token who
Token ID508
Feature activation+1.51e-02

Pos logit contributions
 may+0.013
 might+0.012
xy+0.012
 omega+0.012
of+0.012
Neg logit contributions
 older-0.009
ivated-0.009
 below-0.009
itating-0.008
ider-0.008
Token wasn
Token ID2492
Feature activation-3.14e-02

Pos logit contributions
of+0.026
xy+0.026
 Entry+0.026
 silk+0.025
rick+0.025
Neg logit contributions
 bored-0.019
 alone-0.017
 disappointed-0.016
 thirsty-0.015
 jealous-0.015
Token't
Token ID470
Feature activation-1.43e-02

Pos logit contributions
pit+0.057
rier+0.057
 older+0.057
 youngest+0.057
 below+0.057
Neg logit contributions
 may-0.032
of-0.029
gh-0.029
 might-0.028
xy-0.027
Token feeling
Token ID4203
Feature activation-5.20e-02

Pos logit contributions
ider+0.021
itating+0.020
ivated+0.020
 older+0.019
 below+0.019
Neg logit contributions
 may-0.020
 omega-0.019
 might-0.019
orm-0.019
gh-0.018
Token very
Token ID845
Feature activation-3.12e-02

Pos logit contributions
ider+0.072
itating+0.071
ivated+0.069
but+0.068
 older+0.067
Neg logit contributions
 may-0.082
 might-0.075
 omega-0.069
gh-0.067
orm-0.066
Token well
Token ID880
Feature activation-2.18e-02

Pos logit contributions
 older+0.035
 below+0.034
ivated+0.034
itating+0.033
ider+0.033
Neg logit contributions
 may-0.058
 might-0.053
 omega-0.053
xy-0.053
of-0.052
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
 older+0.030
ivated+0.029
itating+0.029
 below+0.028
ider+0.028
Neg logit contributions
 may-0.035
 might-0.032
xy-0.031
gh-0.030
rick-0.030
Token He
Token ID679
Feature activation-1.11e-03

Pos logit contributions
 older+0.020
 below+0.019
 hesitant+0.018
 youngest+0.017
 bored+0.017
Neg logit contributions
 may-0.022
xy-0.021
ww-0.021
gh-0.021
of-0.020
Token was
Token ID373
Feature activation-2.73e-02

Pos logit contributions
 bored+0.002
 older+0.002
 disappointed+0.002
 below+0.001
 hesitant+0.001
Neg logit contributions
 omega-0.002
ww-0.002
of-0.002
xy-0.002
 may-0.001
Token with
Token ID351
Feature activation-2.32e-04

Pos logit contributions
 older+0.018
ivated+0.018
rier+0.017
itating+0.017
 youngest+0.017
Neg logit contributions
 may-0.017
 might-0.016
xy-0.015
gh-0.015
of-0.015
Token the
Token ID262
Feature activation-2.52e-02

Pos logit contributions
 older+0.000
 below+0.000
 youngest+0.000
 bored+0.000
saw+0.000
Neg logit contributions
 may-0.000
xy-0.000
of-0.000
gh-0.000
rick-0.000
Token word
Token ID1573
Feature activation-4.51e-03

Pos logit contributions
 older+0.042
 below+0.041
ider+0.039
 youngest+0.038
ivated+0.038
Neg logit contributions
 may-0.034
xy-0.032
of-0.031
 omega-0.031
rick-0.030
Token "
Token ID366
Feature activation-1.89e-02

Pos logit contributions
 older+0.006
 below+0.006
ivated+0.006
itating+0.006
ider+0.006
Neg logit contributions
 may-0.007
xy-0.007
 omega-0.007
 might-0.007
of-0.007
Tokennews
Token ID10827
Feature activation-4.77e-03

Pos logit contributions
 below+0.027
ivated+0.026
 older+0.026
but+0.025
itating+0.025
Neg logit contributions
 may-0.031
 omega-0.027
 might-0.027
xy-0.027
rick-0.027
Token"
Token ID1
Feature activation-5.17e-02

Pos logit contributions
 older+0.007
 below+0.006
ivated+0.006
ider+0.006
itating+0.006
Neg logit contributions
 may-0.007
 omega-0.007
rick-0.007
xy-0.007
 might-0.007
Token on
Token ID319
Feature activation-4.37e-03

Pos logit contributions
 older+0.062
 below+0.061
 hesitant+0.057
but+0.057
itating+0.056
Neg logit contributions
 may-0.097
 omega-0.088
xy-0.086
rick-0.086
 might-0.085
Token it
Token ID340
Feature activation+2.75e-03

Pos logit contributions
 below+0.005
 behind+0.004
 older+0.004
 elderly+0.004
 before+0.004
Neg logit contributions
 may-0.009
ww-0.008
of-0.008
xy-0.008
rick-0.008
Token.
Token ID13
Feature activation-3.18e-03

Pos logit contributions
 may+0.004
 might+0.004
xy+0.004
of+0.004
rick+0.004
Neg logit contributions
ivated-0.004
 older-0.004
itating-0.004
 below-0.004
ider-0.004
Token Tim
Token ID5045
Feature activation-2.66e-03

Pos logit contributions
 older+0.006
 below+0.005
ivated+0.005
itating+0.005
ider+0.005
Neg logit contributions
 may-0.004
xy-0.003
 omega-0.003
 might-0.003
of-0.003
Token was
Token ID373
Feature activation-7.93e-03

Pos logit contributions
 older+0.004
 below+0.004
 hesitant+0.004
 bored+0.004
 startled+0.004
Neg logit contributions
 may-0.004
xy-0.004
 omega-0.004
gh-0.004
ww-0.004
Tokenila
Token ID10102
Feature activation-1.66e-03

Pos logit contributions
 may+0.005
 might+0.005
 omega+0.004
 thumbs+0.004
xy+0.004
Neg logit contributions
ider-0.003
 older-0.003
 below-0.003
vious-0.003
but-0.003
Token and
Token ID290
Feature activation-8.91e-03

Pos logit contributions
 older+0.002
 below+0.002
ider+0.002
but+0.002
ivated+0.002
Neg logit contributions
 may-0.003
 omega-0.002
xy-0.002
rick-0.002
gh-0.002
Token Ben
Token ID3932
Feature activation+1.27e-02

Pos logit contributions
 older+0.013
 below+0.012
 youngest+0.011
ivated+0.011
 hesitant+0.011
Neg logit contributions
 may-0.014
xy-0.013
gh-0.013
 might-0.013
of-0.013
Token's
Token ID338
Feature activation-1.16e-02

Pos logit contributions
 may+0.020
 omega+0.020
xy+0.019
of+0.019
rick+0.019
Neg logit contributions
 older-0.017
 below-0.016
itating-0.015
ider-0.015
 youngest-0.015
Token Club
Token ID6289
Feature activation-1.12e-02

Pos logit contributions
 older+0.018
 below+0.017
ivated+0.016
ider+0.016
itating+0.016
Neg logit contributions
 may-0.017
 omega-0.016
xy-0.016
of-0.016
gh-0.016
Token"
Token ID1
Feature activation-5.11e-02

Pos logit contributions
 older+0.015
ivated+0.014
 below+0.014
itating+0.014
ider+0.014
Neg logit contributions
 may-0.019
 might-0.017
xy-0.017
of-0.017
 omega-0.017
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
 older+0.063
 below+0.061
but+0.057
 hesitant+0.057
itating+0.057
Neg logit contributions
 may-0.093
 omega-0.085
xy-0.083
rick-0.083
 might-0.082
Token hung
Token ID9174
Feature activation-2.31e-03

Pos logit contributions
 older+0.017
 below+0.016
ivated+0.016
 hesitant+0.016
ider+0.016
Neg logit contributions
 may-0.017
xy-0.015
 omega-0.015
of-0.015
 might-0.015
Token it
Token ID340
Feature activation-3.28e-03

Pos logit contributions
 older+0.003
 below+0.003
ivated+0.003
itating+0.003
ider+0.003
Neg logit contributions
 may-0.004
 omega-0.004
 might-0.004
xy-0.004
of-0.004
Token on
Token ID319
Feature activation-8.63e-03

Pos logit contributions
 older+0.004
ivated+0.004
itating+0.003
ider+0.003
 hesitant+0.003
Neg logit contributions
 may-0.006
 might-0.006
xy-0.006
gh-0.005
of-0.005
Token the
Token ID262
Feature activation-8.98e-03

Pos logit contributions
 older+0.010
 below+0.010
 youngest+0.009
 elderly+0.008
 behind+0.008
Neg logit contributions
 may-0.017
xy-0.016
 omega-0.016
ww-0.015
haw-0.015
Token day
Token ID1110
Feature activation-2.38e-03

Pos logit contributions
 may+0.016
xy+0.014
 omega+0.014
 might+0.014
gh+0.014
Neg logit contributions
 older-0.022
 below-0.022
ivated-0.021
itating-0.021
 hesitant-0.021
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 older+0.004
ivated+0.004
 below+0.004
ider+0.004
 hesitant+0.004
Neg logit contributions
 may-0.003
xy-0.003
 omega-0.003
 might-0.003
rick-0.003
Token Emily
Token ID17608
Feature activation-3.53e-03

Pos logit contributions
 older+0.032
 below+0.032
ivated+0.031
itating+0.031
ider+0.031
Neg logit contributions
 may-0.022
 might-0.019
 omega-0.019
of-0.018
xy-0.018
Token wasn
Token ID2492
Feature activation-3.64e-02

Pos logit contributions
 older+0.006
ivated+0.006
ider+0.006
 below+0.006
itating+0.006
Neg logit contributions
 may-0.005
 might-0.004
rick-0.004
of-0.004
xy-0.004
Token't
Token ID470
Feature activation-1.79e-02

Pos logit contributions
rier+0.073
pit+0.073
 below+0.072
 youngest+0.070
ider+0.068
Neg logit contributions
 may-0.037
 might-0.031
gh-0.030
of-0.029
 hard-0.025
Token feeling
Token ID4203
Feature activation-5.10e-02

Pos logit contributions
ider+0.026
itating+0.024
but+0.023
saw+0.023
 register+0.022
Neg logit contributions
 may-0.026
 might-0.025
 hard-0.024
 omega-0.023
ww-0.023
Token very
Token ID845
Feature activation-2.93e-02

Pos logit contributions
ider+0.072
itating+0.069
but+0.068
ivated+0.067
 below+0.066
Neg logit contributions
 may-0.079
 might-0.072
 omega-0.066
gh-0.065
 hard-0.065
Token well
Token ID880
Feature activation-1.69e-02

Pos logit contributions
 older+0.038
 below+0.038
ider+0.037
ivated+0.037
itating+0.037
Neg logit contributions
 may-0.048
 might-0.043
xy-0.042
 omega-0.041
rick-0.041
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 older+0.023
ivated+0.023
itating+0.023
saw+0.023
ider+0.022
Neg logit contributions
 may-0.026
 might-0.024
xy-0.023
gh-0.023
of-0.022
Token So
Token ID1406
Feature activation-2.99e-03

Pos logit contributions
 older+0.023
 below+0.023
ivated+0.022
itating+0.022
ider+0.022
Neg logit contributions
 may-0.020
xy-0.018
 omega-0.018
 might-0.018
of-0.018
Token her
Token ID607
Feature activation-1.03e-02

Pos logit contributions
 older+0.005
ivated+0.004
 below+0.004
 hesitant+0.004
ider+0.004
Neg logit contributions
 may-0.004
xy-0.004
 omega-0.004
of-0.004
rick-0.004
TokenWhy
Token ID5195
Feature activation+9.39e-03

Pos logit contributions
 below+0.002
ivated+0.002
 older+0.002
itating+0.002
but+0.002
Neg logit contributions
 may-0.003
xy-0.002
 might-0.002
 omega-0.002
gh-0.002
Token are
Token ID389
Feature activation-1.12e-02

Pos logit contributions
xy+0.011
rick+0.010
 omega+0.010
 silk+0.010
orm+0.009
Neg logit contributions
 older-0.017
 startled-0.016
 surprised-0.015
 bored-0.015
 ashamed-0.015
Token you
Token ID345
Feature activation-1.92e-03

Pos logit contributions
 older+0.021
ivated+0.019
 below+0.019
itating+0.019
ider+0.019
Neg logit contributions
 may-0.013
 omega-0.012
xy-0.012
of-0.011
rick-0.011
Token tearing
Token ID24447
Feature activation-5.62e-03

Pos logit contributions
 below+0.002
ider+0.002
but+0.002
itating+0.002
saw+0.002
Neg logit contributions
 may-0.003
 might-0.003
gh-0.003
 omega-0.003
rick-0.003
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
itating+0.007
 hesitant+0.006
Neg logit contributions
 may-0.010
xy-0.009
 omega-0.009
 might-0.009
of-0.009
Token table
Token ID3084
Feature activation-5.10e-02

Pos logit contributions
ivated+0.020
 hesitant+0.020
itating+0.020
 below+0.019
but+0.019
Neg logit contributions
 may-0.015
 might-0.014
 powder-0.013
 omega-0.013
 looks-0.013
Token?"
Token ID1701
Feature activation-3.92e-03

Pos logit contributions
ivated+0.064
ider+0.062
itating+0.061
 youngest+0.060
 below+0.059
Neg logit contributions
 may-0.087
 might-0.080
 looks-0.078
gh-0.076
of-0.075
Token asked
Token ID1965
Feature activation-2.59e-03

Pos logit contributions
 older+0.006
 below+0.006
ivated+0.006
itating+0.006
ider+0.006
Neg logit contributions
 may-0.006
 might-0.005
xy-0.005
 omega-0.005
of-0.005
Token Tim
Token ID5045
Feature activation-2.08e-02

Pos logit contributions
 older+0.005
 below+0.004
ivated+0.004
 hesitant+0.004
itating+0.004
Neg logit contributions
 may-0.003
 omega-0.003
xy-0.003
rick-0.003
gh-0.003
Token.
Token ID13
Feature activation-2.40e-04

Pos logit contributions
ider+0.036
ivated+0.034
 paramedics+0.033
 youngest+0.033
rier+0.033
Neg logit contributions
 may-0.026
 might-0.025
of-0.024
rick-0.021
gh-0.021
Token
Token ID198
Feature activation-6.88e-03

Pos logit contributions
 older+0.000
 below+0.000
ivated+0.000
itating+0.000
 hesitant+0.000
Neg logit contributions
 may-0.000
 omega-0.000
gh-0.000
of-0.000
xy-0.000
Token words
Token ID2456
Feature activation+6.80e-03

Pos logit contributions
 older+0.038
 below+0.033
vious+0.032
 bored+0.032
 into+0.031
Neg logit contributions
xy-0.037
 omega-0.035
gh-0.034
ww-0.034
rick-0.033
Token like
Token ID588
Feature activation+1.68e-02

Pos logit contributions
 may+0.012
 omega+0.011
xy+0.011
rick+0.011
of+0.011
Neg logit contributions
 older-0.009
 below-0.008
itating-0.008
ivated-0.008
 hesitant-0.008
Token "
Token ID366
Feature activation-1.09e-02

Pos logit contributions
 may+0.027
xy+0.027
 omega+0.027
rick+0.026
yn+0.026
Neg logit contributions
 older-0.025
 below-0.021
 bored-0.019
 youngest-0.019
 elderly-0.019
Tokenele
Token ID11129
Feature activation-4.45e-03

Pos logit contributions
 below+0.015
 older+0.015
but+0.015
itating+0.014
ivated+0.014
Neg logit contributions
 may-0.018
 omega-0.016
xy-0.015
gly-0.015
rick-0.015
Tokenphant
Token ID33959
Feature activation-4.67e-03

Pos logit contributions
 older+0.007
ider+0.006
vious+0.006
 able+0.006
 bored+0.006
Neg logit contributions
 omega-0.006
 may-0.006
rick-0.005
xy-0.005
ww-0.005
Token"
Token ID1
Feature activation-5.03e-02

Pos logit contributions
 older+0.006
 below+0.005
ider+0.005
ivated+0.005
itating+0.005
Neg logit contributions
 may-0.008
 omega-0.008
xy-0.008
 might-0.008
rick-0.008
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
 older+0.058
 below+0.056
but+0.053
 hesitant+0.052
ider+0.052
Neg logit contributions
 may-0.095
 omega-0.087
xy-0.086
rick-0.085
 might-0.084
Token "
Token ID366
Feature activation-1.40e-02

Pos logit contributions
 older+0.006
 below+0.006
ivated+0.005
 youngest+0.005
 bored+0.005
Neg logit contributions
 may-0.008
xy-0.007
of-0.007
rick-0.007
gh-0.007
Tokenrain
Token ID3201
Feature activation+1.56e-02

Pos logit contributions
 below+0.019
 older+0.019
but+0.018
ivated+0.018
itating+0.018
Neg logit contributions
 may-0.023
 omega-0.021
 might-0.020
xy-0.020
rick-0.020
Tokenbow
Token ID8176
Feature activation-1.21e-03

Pos logit contributions
 may+0.022
 omega+0.022
rick+0.021
orm+0.021
 might+0.020
Neg logit contributions
 older-0.022
but-0.022
 below-0.022
itating-0.021
ider-0.021
Token".
Token ID1911
Feature activation-1.05e-02

Pos logit contributions
 older+0.002
 below+0.002
itating+0.002
ider+0.002
 hesitant+0.001
Neg logit contributions
 may-0.002
 omega-0.002
xy-0.002
rick-0.002
orm-0.002
Token wanted
Token ID2227
Feature activation-1.26e-02

Pos logit contributions
 may+0.006
 omega+0.006
of+0.006
xy+0.005
ww+0.005
Neg logit contributions
 older-0.007
 hesitant-0.007
 bored-0.006
 youngest-0.006
 below-0.006
Token to
Token ID284
Feature activation-4.54e-03

Pos logit contributions
 older+0.014
 below+0.012
 bored+0.011
 hesitant+0.011
but+0.011
Neg logit contributions
 may-0.026
xy-0.024
 omega-0.024
of-0.024
ww-0.024
Token have
Token ID423
Feature activation-5.02e-03

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
itating+0.007
 hesitant+0.007
Neg logit contributions
 may-0.006
xy-0.006
 omega-0.006
of-0.006
rick-0.006
Token a
Token ID257
Feature activation-1.44e-02

Pos logit contributions
 older+0.007
 below+0.006
 bored+0.006
 hesitant+0.006
 able+0.006
Neg logit contributions
 may-0.008
xy-0.008
of-0.008
rick-0.008
 omega-0.007
Token cool
Token ID3608
Feature activation-5.56e-03

Pos logit contributions
 older+0.024
 below+0.022
ivated+0.022
 hesitant+0.022
itating+0.022
Neg logit contributions
 may-0.020
xy-0.018
 omega-0.018
of-0.017
rick-0.017
Token hairst
Token ID42658
Feature activation-5.02e-02

Pos logit contributions
 older+0.008
 below+0.008
 hesitant+0.007
 bored+0.007
ider+0.007
Neg logit contributions
xy-0.009
 may-0.008
 omega-0.008
rick-0.008
of-0.008
Tokenyle
Token ID2349
Feature activation-1.38e-02

Pos logit contributions
 older+0.083
 paramedics+0.077
 entrance+0.076
 sidelines+0.075
 below+0.075
Neg logit contributions
w-0.060
 pencil-0.060
gh-0.059
 may-0.058
of-0.058
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
 older+0.020
ivated+0.020
 below+0.019
itating+0.019
 youngest+0.018
Neg logit contributions
 may-0.022
 might-0.020
of-0.019
xy-0.019
gh-0.019
Token but
Token ID475
Feature activation-2.58e-02

Pos logit contributions
 older+0.009
 hesitant+0.008
 below+0.007
ivated+0.007
 bored+0.007
Neg logit contributions
xy-0.022
 may-0.022
of-0.021
gly-0.021
rick-0.021
Token he
Token ID339
Feature activation+5.71e-05

Pos logit contributions
 older+0.039
 below+0.038
ivated+0.037
 hesitant+0.036
itating+0.036
Neg logit contributions
 may-0.038
 omega-0.035
xy-0.035
of-0.035
rick-0.035
Token didn
Token ID1422
Feature activation-7.77e-03

Pos logit contributions
 may+0.000
xy+0.000
 omega+0.000
of+0.000
 might+0.000
Neg logit contributions
 older-0.000
 below-0.000
ivated-0.000
ider-0.000
 hesitant-0.000
Token choose
Token ID3853
Feature activation+3.05e-03

Pos logit contributions
 older+0.014
 below+0.013
ivated+0.013
itating+0.013
ider+0.013
Neg logit contributions
 may-0.010
xy-0.009
 omega-0.009
 might-0.009
rick-0.009
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
 may+0.005
 might+0.004
xy+0.004
 omega+0.004
of+0.004
Neg logit contributions
 older-0.004
ivated-0.004
 below-0.004
itating-0.004
ider-0.004
Token mineral
Token ID18352
Feature activation-3.72e-02

Pos logit contributions
 older+0.018
 below+0.016
 hesitant+0.016
itating+0.016
 youngest+0.016
Neg logit contributions
xy-0.012
 may-0.012
rick-0.011
 omega-0.011
of-0.011
Token from
Token ID422
Feature activation-2.44e-02

Pos logit contributions
ivated+0.055
itating+0.055
arted+0.051
saw+0.051
 youngest+0.050
Neg logit contributions
 may-0.059
 might-0.054
 pencil-0.052
 would-0.050
ww-0.049
Token my
Token ID616
Feature activation-1.64e-02

Pos logit contributions
 older+0.038
ivated+0.038
itating+0.038
ider+0.038
 below+0.037
Neg logit contributions
 may-0.034
 might-0.031
xy-0.030
 omega-0.030
of-0.029
Token bag
Token ID6131
Feature activation-5.02e-02

Pos logit contributions
 older+0.020
 below+0.018
ider+0.017
 youngest+0.017
ivated+0.017
Neg logit contributions
 may-0.030
xy-0.028
 omega-0.028
of-0.027
rick-0.027
Token?"
Token ID1701
Feature activation-2.57e-03

Pos logit contributions
ivated+0.074
itating+0.068
 terrified+0.068
 youngest+0.068
saw+0.067
Neg logit contributions
 may-0.076
 might-0.071
 would-0.070
of-0.067
 can-0.064
Token
Token ID198
Feature activation-1.14e-03

Pos logit contributions
 older+0.004
ivated+0.004
itating+0.004
ider+0.004
 below+0.004
Neg logit contributions
 may-0.003
 might-0.003
of-0.003
xy-0.003
 omega-0.003
Token
Token ID198
Feature activation+2.88e-03

Pos logit contributions
 older+0.002
ivated+0.002
ider+0.002
itating+0.002
 below+0.002
Neg logit contributions
 may-0.001
of-0.001
 might-0.001
xy-0.001
 omega-0.001
TokenL
Token ID43
Feature activation-9.82e-03

Pos logit contributions
 may+0.003
of+0.003
 might+0.003
xy+0.003
rick+0.003
Neg logit contributions
 older-0.005
ivated-0.005
 below-0.005
itating-0.005
 hesitant-0.005
Tokenily
Token ID813
Feature activation-9.68e-03

Pos logit contributions
 older+0.013
ivated+0.012
itating+0.012
 below+0.012
 hesitant+0.012
Neg logit contributions
 may-0.016
of-0.016
xy-0.015
 might-0.015
 omega-0.015
Token love
Token ID1842
Feature activation+6.87e-03

Pos logit contributions
 older+0.010
 below+0.010
 youngest+0.010
ivated+0.010
 announced+0.010
Neg logit contributions
gh-0.007
xy-0.007
rick-0.007
 omega-0.007
 may-0.007
Token you
Token ID345
Feature activation+1.19e-02

Pos logit contributions
 may+0.011
gh+0.010
 omega+0.010
rick+0.010
xy+0.010
Neg logit contributions
 older-0.010
 below-0.009
 elderly-0.008
 into-0.008
 youngest-0.008
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
 may+0.019
 omega+0.019
xy+0.018
rick+0.018
of+0.017
Neg logit contributions
 older-0.016
ivated-0.014
ider-0.014
itating-0.014
 below-0.014
Token Grand
Token ID5675
Feature activation+6.07e-03

Pos logit contributions
 older+0.051
 hesitant+0.044
 below+0.044
ivated+0.043
itating+0.042
Neg logit contributions
 may-0.057
xy-0.055
gh-0.053
rick-0.053
 omega-0.053
Tokenma
Token ID2611
Feature activation+1.12e-02

Pos logit contributions
 may+0.012
 silk+0.011
 omega+0.011
 might+0.011
haw+0.011
Neg logit contributions
 older-0.007
 register-0.006
 manager-0.005
ider-0.005
but-0.005
Token"
Token ID1
Feature activation-5.01e-02

Pos logit contributions
 may+0.022
 omega+0.020
xy+0.020
rick+0.020
gh+0.019
Neg logit contributions
 older-0.012
 below-0.011
ider-0.011
itating-0.011
but-0.011
Token with
Token ID351
Feature activation+1.23e-02

Pos logit contributions
 older+0.067
 below+0.064
ivated+0.063
itating+0.062
ider+0.062
Neg logit contributions
 may-0.084
xy-0.076
 might-0.076
 omega-0.076
of-0.074
Token big
Token ID1263
Feature activation-9.01e-03

Pos logit contributions
ww+0.023
 may+0.023
xy+0.023
gh+0.023
haw+0.022
Neg logit contributions
 older-0.016
 elderly-0.013
 youngest-0.013
 bored-0.012
 below-0.012
Token letters
Token ID7475
Feature activation+1.35e-02

Pos logit contributions
 older+0.015
 bored+0.013
 below+0.013
 youngest+0.012
 window+0.012
Neg logit contributions
xy-0.014
gh-0.012
 may-0.012
ww-0.012
of-0.012
Token.
Token ID13
Feature activation-6.26e-03

Pos logit contributions
 may+0.021
 might+0.019
xy+0.019
 omega+0.019
of+0.019
Neg logit contributions
 older-0.019
 below-0.018
ivated-0.018
itating-0.018
ider-0.018
Token They
Token ID1119
Feature activation+9.62e-03

Pos logit contributions
ivated+0.011
 below+0.011
 older+0.011
ider+0.010
itating+0.010
Neg logit contributions
 may-0.008
 might-0.007
 omega-0.007
xy-0.007
of-0.007
Token Can
Token ID1680
Feature activation+1.74e-02

Pos logit contributions
 may+0.014
gh+0.013
xy+0.013
orm+0.013
 might+0.013
Neg logit contributions
 older-0.013
 below-0.012
ider-0.012
 hesitant-0.011
but-0.011
Token we
Token ID356
Feature activation-4.45e-03

Pos logit contributions
 may+0.028
orm+0.028
 omega+0.028
rick+0.028
xy+0.027
Neg logit contributions
 older-0.021
ider-0.020
but-0.020
 below-0.020
ivated-0.020
Token go
Token ID467
Feature activation-1.07e-03

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
itating+0.007
ider+0.007
Neg logit contributions
 may-0.006
xy-0.005
 omega-0.005
rick-0.005
gh-0.005
Token to
Token ID284
Feature activation-1.98e-02

Pos logit contributions
rier+0.002
ivated+0.002
 youngest+0.002
itating+0.002
 older+0.002
Neg logit contributions
 may-0.002
of-0.001
 might-0.001
 would-0.001
 will-0.001
Token the
Token ID262
Feature activation-6.40e-03

Pos logit contributions
ider+0.039
itating+0.037
ivated+0.037
 hesitant+0.036
but+0.036
Neg logit contributions
 may-0.023
 might-0.020
of-0.020
 omega-0.020
xy-0.019
Token stable
Token ID8245
Feature activation-5.01e-02

Pos logit contributions
ivated+0.009
itating+0.008
 older+0.008
ider+0.008
 hesitant+0.008
Neg logit contributions
 may-0.011
 might-0.010
 omega-0.009
xy-0.009
of-0.009
Token and
Token ID290
Feature activation-3.34e-03

Pos logit contributions
ivated+0.092
arted+0.091
ider+0.083
itating+0.082
vious+0.082
Neg logit contributions
 would-0.058
 will-0.057
 is-0.055
 might-0.055
 may-0.053
Token show
Token ID905
Feature activation-7.55e-03

Pos logit contributions
ider+0.006
 older+0.005
ivated+0.005
itating+0.005
 below+0.005
Neg logit contributions
 may-0.005
 might-0.004
of-0.004
 omega-0.004
xy-0.004
Token him
Token ID683
Feature activation-1.88e-02

Pos logit contributions
 older+0.015
 below+0.014
ivated+0.014
itating+0.014
ider+0.014
Neg logit contributions
 may-0.008
xy-0.007
 might-0.007
 omega-0.007
rick-0.007
Token to
Token ID284
Feature activation-1.25e-02

Pos logit contributions
ivated+0.031
ider+0.031
itating+0.029
 hesitant+0.028
 startled+0.027
Neg logit contributions
 may-0.028
 might-0.026
of-0.025
 would-0.024
 looks-0.024
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
 older+0.019
 below+0.018
ivated+0.018
itating+0.017
 youngest+0.017
Neg logit contributions
 may-0.019
xy-0.017
 omega-0.017
rick-0.017
 might-0.017
Token making
Token ID1642
Feature activation+2.09e-02

Pos logit contributions
 may+0.021
 might+0.019
xy+0.018
 omega+0.018
rick+0.018
Neg logit contributions
 older-0.016
ider-0.015
ivated-0.015
itating-0.015
 below-0.015
Token "
Token ID366
Feature activation-9.10e-03

Pos logit contributions
 may+0.032
xy+0.029
 omega+0.028
rick+0.028
 might+0.028
Neg logit contributions
 older-0.033
 below-0.031
ivated-0.030
 hesitant-0.029
 startled-0.029
Tokenp
Token ID79
Feature activation+2.30e-02

Pos logit contributions
 older+0.013
 below+0.013
ivated+0.013
ider+0.013
itating+0.013
Neg logit contributions
 may-0.014
 omega-0.012
 might-0.012
xy-0.012
rick-0.012
Tokenew
Token ID413
Feature activation+3.78e-02

Pos logit contributions
 omega+0.034
xy+0.033
 may+0.032
rick+0.032
orm+0.030
Neg logit contributions
 older-0.034
 startled-0.032
 below-0.032
 hesitant-0.032
ivated-0.031
Token p
Token ID279
Feature activation+2.08e-02

Pos logit contributions
xy+0.062
 may+0.062
 omega+0.059
 meanings+0.058
 might+0.057
Neg logit contributions
 older-0.050
 startled-0.045
 into-0.045
 below-0.044
ivated-0.043
Tokenew
Token ID413
Feature activation+4.16e-02

Pos logit contributions
 may+0.034
 might+0.031
 omega+0.030
 would+0.029
gh+0.029
Neg logit contributions
 older-0.029
ider-0.028
itating-0.028
ivated-0.027
 below-0.026
Token"
Token ID1
Feature activation-3.90e-02

Pos logit contributions
 may+0.069
xy+0.068
 omega+0.065
 might+0.064
 meanings+0.062
Neg logit contributions
 older-0.056
 startled-0.050
 below-0.049
 into-0.049
ivated-0.048
Token sounds
Token ID5238
Feature activation+4.59e-02

Pos logit contributions
 older+0.047
 below+0.045
ivated+0.043
 hesitant+0.043
itating+0.043
Neg logit contributions
 may-0.071
 omega-0.065
xy-0.064
rick-0.064
 might-0.064
Token as
Token ID355
Feature activation+5.88e-03

Pos logit contributions
 may+0.068
 might+0.061
of+0.059
 omega+0.059
gh+0.058
Neg logit contributions
 older-0.068
ider-0.067
ivated-0.067
itating-0.066
 below-0.064
Token he
Token ID339
Feature activation-7.96e-04

Pos logit contributions
 may+0.009
of+0.008
rick+0.008
xy+0.008
 omega+0.008
Neg logit contributions
 older-0.009
 below-0.009
 hesitant-0.008
ivated-0.008
 startled-0.008
Token played
Token ID2826
Feature activation+8.90e-03

Pos logit contributions
 older+0.001
 below+0.001
 startled+0.001
ider+0.001
 announced+0.001
Neg logit contributions
 may-0.001
 omega-0.001
xy-0.001
ww-0.001
gh-0.001
Token of
Token ID286
Feature activation-2.93e-03

Pos logit contributions
 may+0.025
 might+0.024
xy+0.023
of+0.023
rick+0.023
Neg logit contributions
ivated-0.014
 older-0.014
 below-0.013
itating-0.013
 hesitant-0.013
Token the
Token ID262
Feature activation+2.33e-03

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
itating+0.003
 hesitant+0.003
Neg logit contributions
 may-0.005
 omega-0.004
xy-0.004
 might-0.004
of-0.004
Token yard
Token ID12699
Feature activation-2.40e-02

Pos logit contributions
 may+0.003
 omega+0.003
xy+0.003
of+0.003
gh+0.003
Neg logit contributions
 older-0.004
 below-0.003
itating-0.003
ider-0.003
ivated-0.003
Token.
Token ID13
Feature activation+7.46e-03

Pos logit contributions
ivated+0.034
 older+0.031
itating+0.031
 hesitant+0.029
ider+0.029
Neg logit contributions
 may-0.039
 might-0.037
xy-0.035
of-0.034
rick-0.033
Token It
Token ID632
Feature activation+1.79e-02

Pos logit contributions
 may+0.010
 might+0.009
 omega+0.009
xy+0.009
of+0.009
Neg logit contributions
 older-0.012
 below-0.012
ivated-0.012
itating-0.011
ider-0.011
Token might
Token ID1244
Feature activation+3.67e-02

Pos logit contributions
 may+0.015
 omega+0.014
xy+0.014
gh+0.014
rick+0.014
Neg logit contributions
 older-0.037
 below-0.036
itating-0.035
 hesitant-0.035
ivated-0.035
Token cause
Token ID2728
Feature activation+5.64e-03

Pos logit contributions
xy+0.047
 may+0.047
 omega+0.045
rick+0.044
gly+0.044
Neg logit contributions
 older-0.062
 below-0.059
ivated-0.057
 hesitant-0.056
 bored-0.055
Token trouble
Token ID5876
Feature activation+1.56e-02

Pos logit contributions
 may+0.010
xy+0.010
gly+0.010
 omega+0.009
rick+0.009
Neg logit contributions
 older-0.008
 below-0.006
 bored-0.006
 hesitant-0.006
vious-0.006
Token."
Token ID526
Feature activation-3.15e-03

Pos logit contributions
 may+0.026
 might+0.024
xy+0.023
 omega+0.023
of+0.023
Neg logit contributions
 older-0.020
ivated-0.020
 below-0.019
itating-0.019
ider-0.019
Token Tom
Token ID4186
Feature activation-6.58e-04

Pos logit contributions
 older+0.005
 below+0.004
ivated+0.004
itating+0.004
 hesitant+0.004
Neg logit contributions
 may-0.005
xy-0.004
 omega-0.004
 might-0.004
rick-0.004
Token said
Token ID531
Feature activation+6.76e-03

Pos logit contributions
 older+0.001
 hesitant+0.001
ivated+0.001
 below+0.001
but+0.001
Neg logit contributions
 may-0.001
xy-0.001
 omega-0.001
gh-0.001
ww-0.001
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
 may+0.030
 might+0.027
xy+0.027
of+0.027
 omega+0.026
Neg logit contributions
 older-0.022
ivated-0.021
 below-0.021
ider-0.021
itating-0.021
Token He
Token ID679
Feature activation+1.78e-02

Pos logit contributions
 may+0.006
 might+0.005
xy+0.005
 omega+0.005
of+0.005
Neg logit contributions
 older-0.008
ivated-0.008
 below-0.008
ider-0.008
itating-0.008
Token said
Token ID531
Feature activation+1.32e-02

Pos logit contributions
 may+0.027
xy+0.024
 might+0.024
 omega+0.024
of+0.024
Neg logit contributions
 older-0.026
 below-0.026
ivated-0.025
itating-0.025
ider-0.025
Token it
Token ID340
Feature activation+4.69e-03

Pos logit contributions
 may+0.023
rick+0.022
 omega+0.022
xy+0.022
of+0.022
Neg logit contributions
 older-0.017
 below-0.016
 hesitant-0.015
but-0.015
ivated-0.015
Token would
Token ID561
Feature activation+3.88e-02

Pos logit contributions
 omega+0.007
ww+0.007
oof+0.007
orm+0.007
 fiber+0.007
Neg logit contributions
 older-0.007
 hesitant-0.007
 startled-0.007
 below-0.007
 nearest-0.007
Token be
Token ID307
Feature activation+3.71e-02

Pos logit contributions
xy+0.061
 omega+0.059
gly+0.058
ww+0.058
rick+0.057
Neg logit contributions
 older-0.056
 below-0.052
 hesitant-0.049
 longer-0.048
ivated-0.047
Token much
Token ID881
Feature activation+2.57e-02

Pos logit contributions
 may+0.060
of+0.055
rick+0.055
xy+0.054
 omega+0.054
Neg logit contributions
 older-0.057
 hesitant-0.051
 below-0.050
 bored-0.049
ivated-0.048
Token warmer
Token ID23254
Feature activation+2.71e-02

Pos logit contributions
 may+0.040
 might+0.038
 hard+0.034
 thumbs+0.034
 omega+0.034
Neg logit contributions
ivated-0.036
ider-0.036
itating-0.035
 below-0.034
but-0.034
Token outside
Token ID2354
Feature activation-2.87e-03

Pos logit contributions
 may+0.047
 omega+0.043
of+0.043
xy+0.043
rick+0.042
Neg logit contributions
 older-0.035
 below-0.034
ider-0.032
 hesitant-0.031
ivated-0.031
Token then
Token ID788
Feature activation+1.16e-02

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
 hesitant+0.003
ider+0.003
Neg logit contributions
 may-0.005
 omega-0.005
xy-0.004
rick-0.004
of-0.004
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
 omega+0.020
 may+0.019
rick+0.019
xy+0.019
orm+0.019
Neg logit contributions
 below-0.015
 older-0.015
 behind-0.013
 hesitant-0.012
but-0.012
Token,"
Token ID553
Feature activation+8.92e-03

Pos logit contributions
 may+0.015
 might+0.014
of+0.013
xy+0.013
rick+0.013
Neg logit contributions
 older-0.016
ivated-0.016
itating-0.015
ider-0.015
 youngest-0.015
Token said
Token ID531
Feature activation-4.30e-03

Pos logit contributions
 may+0.014
xy+0.013
 omega+0.013
gh+0.012
rick+0.012
Neg logit contributions
 older-0.013
 below-0.013
ivated-0.012
 hesitant-0.012
itating-0.012
Token Lily
Token ID20037
Feature activation-8.65e-03

Pos logit contributions
 older+0.007
ider+0.007
ivated+0.007
itating+0.007
 below+0.007
Neg logit contributions
 may-0.006
 might-0.005
of-0.005
xy-0.005
 omega-0.005
Token.
Token ID13
Feature activation-5.85e-03

Pos logit contributions
ivated+0.014
ider+0.014
itating+0.014
 older+0.014
saw+0.013
Neg logit contributions
 may-0.011
 might-0.010
of-0.010
gh-0.010
yn-0.009
Token "
Token ID366
Feature activation+1.50e-02

Pos logit contributions
 older+0.010
itating+0.009
ivated+0.009
ider+0.009
 below+0.009
Neg logit contributions
 may-0.008
 might-0.007
 omega-0.007
of-0.007
xy-0.006
TokenDo
Token ID5211
Feature activation+4.51e-02

Pos logit contributions
 may+0.025
of+0.023
xy+0.023
 might+0.022
 omega+0.022
Neg logit contributions
 older-0.021
 below-0.019
ider-0.018
ivated-0.018
itating-0.018
Token you
Token ID345
Feature activation+9.12e-03

Pos logit contributions
xy+0.075
 omega+0.074
yn+0.072
ool+0.071
oc+0.070
Neg logit contributions
 older-0.053
ivated-0.047
 bored-0.045
 jealous-0.044
ider-0.043
Token want
Token ID765
Feature activation+1.34e-02

Pos logit contributions
 may+0.014
 might+0.013
 omega+0.012
xy+0.012
of+0.012
Neg logit contributions
 older-0.013
 below-0.013
ivated-0.013
itating-0.013
ider-0.013
Token to
Token ID284
Feature activation-4.94e-03

Pos logit contributions
 may+0.022
 might+0.020
of+0.019
 omega+0.019
gh+0.019
Neg logit contributions
ivated-0.018
itating-0.018
 older-0.018
ider-0.018
 below-0.018
Token jump
Token ID4391
Feature activation-1.76e-03

Pos logit contributions
ider+0.010
itating+0.010
ivated+0.010
 below+0.009
 older+0.009
Neg logit contributions
 may-0.005
of-0.005
 might-0.005
gh-0.004
 omega-0.004
Token in
Token ID287
Feature activation+1.10e-02

Pos logit contributions
 older+0.002
ider+0.002
ivated+0.002
 youngest+0.002
itating+0.002
Neg logit contributions
 may-0.003
of-0.003
 might-0.003
gh-0.003
xy-0.003
Token filled
Token ID5901
Feature activation+3.72e-03

Pos logit contributions
 may+0.028
 omega+0.026
xy+0.026
 might+0.025
rick+0.025
Neg logit contributions
 older-0.023
 below-0.023
ivated-0.022
itating-0.022
ider-0.022
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
 may+0.006
xy+0.005
 might+0.005
 omega+0.005
of+0.005
Neg logit contributions
 older-0.006
 below-0.006
ivated-0.005
itating-0.005
ider-0.005
Token air
Token ID1633
Feature activation+2.13e-02

Pos logit contributions
 may+0.021
 might+0.019
 omega+0.018
xy+0.018
rick+0.018
Neg logit contributions
 older-0.017
ivated-0.017
 below-0.016
itating-0.016
ider-0.016
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
 may+0.035
 might+0.032
xy+0.032
of+0.031
gh+0.031
Neg logit contributions
 older-0.028
ivated-0.027
ider-0.026
itating-0.026
 below-0.026
Token "
Token ID366
Feature activation+1.11e-02

Pos logit contributions
 may+0.002
 might+0.002
 omega+0.002
rick+0.002
xy+0.002
Neg logit contributions
 older-0.003
ider-0.003
ivated-0.003
 below-0.003
itating-0.003
TokenUh
Token ID34653
Feature activation+4.08e-02

Pos logit contributions
 may+0.016
xy+0.014
of+0.014
 might+0.014
 omega+0.014
Neg logit contributions
 older-0.017
 below-0.016
ivated-0.016
ider-0.016
itating-0.016
Token-
Token ID12
Feature activation-1.74e-03

Pos logit contributions
 may+0.061
 omega+0.056
rick+0.053
xy+0.053
 might+0.053
Neg logit contributions
 older-0.062
 below-0.060
ider-0.060
ivated-0.059
 hesitant-0.058
Tokenoh
Token ID1219
Feature activation-1.45e-03

Pos logit contributions
 older+0.003
 below+0.003
ivated+0.003
ider+0.003
itating+0.003
Neg logit contributions
 may-0.002
xy-0.002
 might-0.002
of-0.002
 omega-0.002
Token,"
Token ID553
Feature activation+2.60e-03

Pos logit contributions
 older+0.002
ivated+0.002
itating+0.002
 below+0.002
ider+0.002
Neg logit contributions
 may-0.002
gh-0.002
 might-0.002
ww-0.002
of-0.002
Token said
Token ID531
Feature activation-1.45e-02

Pos logit contributions
 may+0.004
 might+0.003
rick+0.003
 omega+0.003
xy+0.003
Neg logit contributions
ider-0.004
 older-0.004
itating-0.004
ivated-0.004
 below-0.004
Token Jack
Token ID3619
Feature activation+3.06e-04

Pos logit contributions
ider+0.026
itating+0.025
ivated+0.024
but+0.024
saw+0.023
Neg logit contributions
 may-0.017
gh-0.016
orm-0.016
gly-0.016
 might-0.015
Token he
Token ID339
Feature activation-1.28e-02

Pos logit contributions
 older+0.018
ivated+0.018
itating+0.017
ider+0.017
 below+0.016
Neg logit contributions
 may-0.023
 might-0.021
xy-0.020
gh-0.020
of-0.020
Token was
Token ID373
Feature activation-3.80e-03

Pos logit contributions
 older+0.024
 below+0.024
ivated+0.023
ider+0.023
itating+0.023
Neg logit contributions
 may-0.013
 omega-0.012
xy-0.012
rick-0.012
of-0.012
Token safe
Token ID3338
Feature activation-6.04e-03

Pos logit contributions
ider+0.007
saw+0.006
itating+0.006
ivated+0.006
 youngest+0.006
Neg logit contributions
gh-0.005
 may-0.004
 might-0.004
 hard-0.004
 would-0.004
Token.
Token ID13
Feature activation+3.23e-03

Pos logit contributions
ivated+0.009
 older+0.008
itating+0.008
saw+0.008
ider+0.008
Neg logit contributions
 may-0.009
 might-0.009
xy-0.008
gh-0.008
rick-0.008
Token The
Token ID383
Feature activation-3.65e-03

Pos logit contributions
 may+0.005
gh+0.005
 omega+0.005
xy+0.005
 might+0.005
Neg logit contributions
 below-0.004
 older-0.004
but-0.004
 hesitant-0.004
itating-0.004
Token bear
Token ID6842
Feature activation+1.24e-03

Pos logit contributions
 older+0.005
 below+0.005
ivated+0.005
ider+0.005
itating+0.005
Neg logit contributions
 may-0.006
xy-0.005
 omega-0.005
 might-0.005
of-0.005
Token never
Token ID1239
Feature activation-1.02e-02

Pos logit contributions
 may+0.002
 omega+0.001
xy+0.001
 might+0.001
rick+0.001
Neg logit contributions
 older-0.002
 below-0.002
ivated-0.002
 hesitant-0.002
itating-0.002
Token returned
Token ID4504
Feature activation+3.38e-03

Pos logit contributions
 older+0.015
 below+0.015
ivated+0.015
itating+0.014
ider+0.014
Neg logit contributions
 may-0.016
xy-0.014
 omega-0.014
 might-0.014
of-0.014
Token the
Token ID262
Feature activation-7.80e-03

Pos logit contributions
 may+0.006
 omega+0.006
xy+0.006
 might+0.006
of+0.006
Neg logit contributions
 older-0.004
 below-0.004
ivated-0.004
itating-0.003
ider-0.003
Token open
Token ID1280
Feature activation-1.29e-02

Pos logit contributions
ivated+0.011
 older+0.011
itating+0.011
 below+0.011
but+0.011
Neg logit contributions
 may-0.012
 might-0.011
 omega-0.010
rick-0.010
xy-0.010
Token pool
Token ID5933
Feature activation-8.83e-03

Pos logit contributions
ivated+0.020
itating+0.019
ider+0.019
but+0.018
 older+0.018
Neg logit contributions
 may-0.020
 might-0.018
of-0.017
rick-0.016
gh-0.016
Token,
Token ID11
Feature activation-6.92e-03

Pos logit contributions
 older+0.005
 below+0.005
ivated+0.005
itating+0.005
ider+0.005
Neg logit contributions
 may-0.005
xy-0.004
 might-0.004
 omega-0.004
of-0.004
Token but
Token ID475
Feature activation-7.93e-03

Pos logit contributions
 older+0.008
 below+0.008
ivated+0.008
 hesitant+0.008
itating+0.007
Neg logit contributions
 may-0.013
xy-0.012
 omega-0.012
of-0.012
rick-0.012
Token Tim
Token ID5045
Feature activation+1.40e-03

Pos logit contributions
 older+0.011
 below+0.011
ivated+0.011
 hesitant+0.011
itating+0.010
Neg logit contributions
 may-0.013
 omega-0.012
xy-0.012
of-0.011
rick-0.011
Tokenmy
Token ID1820
Feature activation+2.43e-02

Pos logit contributions
 may+0.002
 omega+0.002
xy+0.002
 silk+0.002
ww+0.002
Neg logit contributions
 older-0.002
 hesitant-0.002
 disappointed-0.002
 youngest-0.002
itating-0.002
Token kept
Token ID4030
Feature activation+1.34e-02

Pos logit contributions
 may+0.034
 omega+0.031
xy+0.031
of+0.030
rick+0.030
Neg logit contributions
 older-0.038
 hesitant-0.037
 below-0.036
ivated-0.036
itating-0.036
Token asking
Token ID4737
Feature activation+1.68e-02

Pos logit contributions
 may+0.023
 might+0.021
xy+0.021
 omega+0.020
gh+0.020
Neg logit contributions
 older-0.017
ider-0.017
 below-0.017
ivated-0.017
itating-0.016
Token and
Token ID290
Feature activation+5.74e-03

Pos logit contributions
ww+0.036
 may+0.033
 omega+0.033
xy+0.033
haw+0.032
Neg logit contributions
 older-0.019
but-0.017
 before-0.016
 manager-0.015
 below-0.015
Token asking
Token ID4737
Feature activation+1.88e-02

Pos logit contributions
 may+0.011
xy+0.010
 omega+0.010
of+0.010
rick+0.010
Neg logit contributions
 older-0.007
 below-0.006
ivated-0.006
 hesitant-0.006
ider-0.006
Token.
Token ID13
Feature activation+3.80e-03

Pos logit contributions
ww+0.040
 omega+0.037
 may+0.036
xy+0.035
rick+0.035
Neg logit contributions
 older-0.022
but-0.018
 below-0.018
 manager-0.017
 before-0.017
Token Finally
Token ID9461
Feature activation+4.83e-03

Pos logit contributions
 may+0.006
 omega+0.005
of+0.005
xy+0.005
rick+0.005
Neg logit contributions
 older-0.006
 below-0.006
 hesitant-0.005
but-0.005
ivated-0.005
Token,
Token ID11
Feature activation-6.00e-03

Pos logit contributions
 may+0.006
xy+0.006
 omega+0.006
rick+0.006
of+0.006
Neg logit contributions
 older-0.008
ivated-0.008
 below-0.008
 hesitant-0.008
ider-0.007
Token the
Token ID262
Feature activation-4.88e-03

Pos logit contributions
 may+0.012
xy+0.011
 omega+0.011
 might+0.011
of+0.011
Neg logit contributions
 older-0.010
 below-0.010
ivated-0.010
itating-0.009
ider-0.009
Token flashlight
Token ID38371
Feature activation-1.57e-03

Pos logit contributions
 older+0.008
 below+0.007
ider+0.007
 youngest+0.007
 bored+0.007
Neg logit contributions
 may-0.007
xy-0.007
of-0.006
 omega-0.006
rick-0.006
Token on
Token ID319
Feature activation-4.79e-03

Pos logit contributions
ivated+0.002
 older+0.002
itating+0.002
ider+0.002
 hesitant+0.002
Neg logit contributions
 may-0.002
 might-0.002
xy-0.002
of-0.002
gh-0.002
Token them
Token ID606
Feature activation+9.22e-03

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
itating+0.006
 hesitant+0.006
Neg logit contributions
 may-0.008
xy-0.007
 omega-0.007
 might-0.007
of-0.007
Token.
Token ID13
Feature activation+1.73e-03

Pos logit contributions
 may+0.016
 omega+0.015
xy+0.015
 might+0.015
rick+0.015
Neg logit contributions
 older-0.011
 below-0.011
ivated-0.011
itating-0.010
ider-0.010
Token
Token ID198
Feature activation-2.00e-04

Pos logit contributions
 may+0.002
 might+0.002
 omega+0.002
xy+0.002
gh+0.002
Neg logit contributions
ivated-0.003
 older-0.003
itating-0.003
 below-0.003
ider-0.003
Token
Token ID198
Feature activation+1.30e-03

Pos logit contributions
 older+0.000
ivated+0.000
 below+0.000
ider+0.000
itating+0.000
Neg logit contributions
 may-0.000
xy-0.000
 might-0.000
of-0.000
 omega-0.000
Token"
Token ID1
Feature activation+9.02e-03

Pos logit contributions
 may+0.001
of+0.001
rick+0.001
xy+0.001
 omega+0.001
Neg logit contributions
 older-0.003
ivated-0.003
 below-0.002
itating-0.002
ider-0.002
TokenYes
Token ID5297
Feature activation+5.21e-03

Pos logit contributions
 may+0.015
 might+0.013
xy+0.013
 omega+0.013
of+0.013
Neg logit contributions
 older-0.013
 below-0.012
ivated-0.012
itating-0.012
ider-0.012
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
 may+0.006
 might+0.005
xy+0.005
 omega+0.005
of+0.005
Neg logit contributions
 older-0.010
 below-0.010
ivated-0.010
itating-0.009
ider-0.009
Token we
Token ID356
Feature activation+7.66e-03

Pos logit contributions
 older+0.018
 below+0.016
ivated+0.016
ider+0.016
 hesitant+0.016
Neg logit contributions
 may-0.018
xy-0.017
 omega-0.016
rick-0.016
of-0.016
Token her
Token ID607
Feature activation-1.07e-02

Pos logit contributions
haw+0.021
 may+0.021
 omega+0.021
xy+0.020
osaurs+0.020
Neg logit contributions
 older-0.014
 again-0.012
 longer-0.011
 into-0.011
 bored-0.010
Token happy
Token ID3772
Feature activation-1.25e-02

Pos logit contributions
ider+0.017
ivated+0.017
itating+0.017
 below+0.016
 older+0.016
Neg logit contributions
 may-0.015
 might-0.014
 omega-0.013
of-0.012
 thumbs-0.012
Token for
Token ID329
Feature activation-2.81e-03

Pos logit contributions
ivated+0.013
 older+0.013
ider+0.013
itating+0.013
 below+0.012
Neg logit contributions
 may-0.024
 might-0.022
gh-0.021
 omega-0.021
xy-0.021
Token a
Token ID257
Feature activation-2.78e-03

Pos logit contributions
 older+0.003
 longer+0.003
 before+0.003
 again+0.003
 below+0.003
Neg logit contributions
haw-0.006
ww-0.005
rick-0.005
xy-0.005
osaurs-0.005
Token long
Token ID890
Feature activation-9.91e-03

Pos logit contributions
 older+0.005
 below+0.004
itating+0.004
ivated+0.004
 hesitant+0.004
Neg logit contributions
 may-0.004
xy-0.004
 omega-0.004
rick-0.003
 might-0.003
Token time
Token ID640
Feature activation+1.20e-02

Pos logit contributions
 older+0.016
 below+0.016
 hesitant+0.015
ivated+0.015
 youngest+0.015
Neg logit contributions
 may-0.013
xy-0.013
 omega-0.011
rick-0.011
haw-0.011
Token.
Token ID13
Feature activation+2.89e-04

Pos logit contributions
 may+0.020
 might+0.018
xy+0.018
 omega+0.018
of+0.017
Neg logit contributions
 older-0.016
 below-0.016
ivated-0.016
itating-0.015
ider-0.015
Token From
Token ID3574
Feature activation-5.40e-03

Pos logit contributions
 may+0.000
xy+0.000
 omega+0.000
 might+0.000
gh+0.000
Neg logit contributions
 older-0.000
 below-0.000
ivated-0.000
itating-0.000
 hesitant-0.000
Token that
Token ID326
Feature activation-1.73e-04

Pos logit contributions
ider+0.008
 older+0.008
ivated+0.008
itating+0.008
 hesitant+0.007
Neg logit contributions
 may-0.008
 might-0.007
xy-0.007
gly-0.007
of-0.007
Token day
Token ID1110
Feature activation-6.60e-03

Pos logit contributions
 older+0.000
 below+0.000
 bored+0.000
 hesitant+0.000
 disappointed+0.000
Neg logit contributions
 omega-0.000
xy-0.000
 may-0.000
 meanings-0.000
orm-0.000
Token on
Token ID319
Feature activation+6.39e-03

Pos logit contributions
 older+0.007
ivated+0.007
 below+0.007
ider+0.006
itating+0.006
Neg logit contributions
 may-0.013
 omega-0.012
xy-0.012
 might-0.012
rick-0.012
Token,
Token ID11
Feature activation+8.22e-03

Pos logit contributions
 older+0.013
 below+0.013
ivated+0.012
ider+0.012
itating+0.012
Neg logit contributions
 may-0.017
 omega-0.016
xy-0.016
rick-0.016
of-0.016
Token and
Token ID290
Feature activation+3.78e-03

Pos logit contributions
 may+0.015
 omega+0.015
xy+0.015
of+0.015
rick+0.015
Neg logit contributions
 below-0.010
 older-0.009
 hesitant-0.009
but-0.008
ivated-0.008
Token it
Token ID340
Feature activation+1.32e-02

Pos logit contributions
 omega+0.006
gh+0.006
 may+0.006
xy+0.006
rick+0.006
Neg logit contributions
 below-0.005
 older-0.005
 hesitant-0.005
 startled-0.005
ivated-0.005
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
 may+0.016
 omega+0.014
xy+0.014
 might+0.014
of+0.014
Neg logit contributions
 older-0.023
 below-0.023
ivated-0.022
itating-0.022
ider-0.022
Token a
Token ID257
Feature activation+1.46e-02

Pos logit contributions
 may+0.026
 might+0.024
xy+0.023
 omega+0.023
gh+0.023
Neg logit contributions
ivated-0.023
 older-0.023
ider-0.023
itating-0.022
 below-0.022
Token bright
Token ID6016
Feature activation+1.26e-02

Pos logit contributions
 may+0.020
xy+0.020
 omega+0.019
rick+0.018
of+0.018
Neg logit contributions
 older-0.023
 below-0.022
ivated-0.022
 hesitant-0.022
itating-0.022
Token silver
Token ID8465
Feature activation+1.15e-02

Pos logit contributions
 may+0.017
 might+0.015
 omega+0.015
of+0.015
xy+0.015
Neg logit contributions
 older-0.020
ivated-0.020
itating-0.019
ider-0.019
 below-0.019
Token beacon
Token ID34538
Feature activation+1.16e-02

Pos logit contributions
 may+0.017
xy+0.015
 omega+0.015
 might+0.015
of+0.015
Neg logit contributions
 older-0.018
 below-0.017
ivated-0.017
ider-0.017
itating-0.016
Token in
Token ID287
Feature activation+2.03e-02

Pos logit contributions
 may+0.022
 might+0.020
xy+0.020
 omega+0.020
of+0.020
Neg logit contributions
 older-0.013
 below-0.012
ivated-0.012
itating-0.012
ider-0.012
Token the
Token ID262
Feature activation+5.42e-03

Pos logit contributions
 may+0.034
 omega+0.032
xy+0.032
of+0.031
haw+0.031
Neg logit contributions
 below-0.028
 older-0.027
ivated-0.024
itating-0.024
 hesitant-0.024
Token countryside
Token ID25708
Feature activation+1.03e-02

Pos logit contributions
 may+0.011
xy+0.011
 omega+0.011
gh+0.010
rick+0.010
Neg logit contributions
 older-0.006
 below-0.005
 youngest-0.005
itating-0.004
ivated-0.004
Token<|endoftext|>
Token ID50256
Feature activation-1.71e-02

Pos logit contributions
 may+0.000
gh+0.000
xy+0.000
 omega+0.000
 might+0.000
Neg logit contributions
 older-0.000
 below-0.000
 hesitant-0.000
but-0.000
ivated-0.000
TokenL
Token ID43
Feature activation+9.61e-03

Pos logit contributions
 older+0.018
 below+0.018
ivated+0.017
itating+0.017
ider+0.016
Neg logit contributions
 may-0.033
 omega-0.031
xy-0.030
rick-0.030
 might-0.030
Tokenily
Token ID813
Feature activation+4.11e-03

Pos logit contributions
 may+0.016
xy+0.014
 omega+0.014
 might+0.014
rick+0.014
Neg logit contributions
 older-0.013
 below-0.012
ivated-0.012
itating-0.012
ider-0.012
Token and
Token ID290
Feature activation+9.12e-03

Pos logit contributions
 may+0.006
xy+0.006
gh+0.006
 omega+0.006
orm+0.006
Neg logit contributions
 below-0.005
 older-0.005
 bored-0.005
 hesitant-0.005
 startled-0.005
Token Tom
Token ID4186
Feature activation+1.31e-02

Pos logit contributions
 may+0.014
xy+0.013
gh+0.013
 omega+0.012
orm+0.012
Neg logit contributions
 older-0.014
 below-0.013
ivated-0.013
 hesitant-0.012
 bored-0.012
Token were
Token ID547
Feature activation-1.74e-02

Pos logit contributions
xy+0.019
gh+0.018
 silk+0.018
 omega+0.017
 may+0.017
Neg logit contributions
 below-0.020
 bored-0.019
 older-0.019
vious-0.019
 startled-0.018
Token playing
Token ID2712
Feature activation-1.33e-02

Pos logit contributions
 below+0.028
itating+0.028
but+0.028
ider+0.027
ivated+0.027
Neg logit contributions
 may-0.021
 might-0.021
 hard-0.020
gh-0.020
 imagination-0.019
Token in
Token ID287
Feature activation-7.29e-03

Pos logit contributions
 older+0.018
itating+0.018
ivated+0.018
ider+0.017
 below+0.017
Neg logit contributions
 may-0.021
 might-0.020
gh-0.019
xy-0.019
orm-0.019
Token their
Token ID511
Feature activation+1.47e-02

Pos logit contributions
 older+0.009
 below+0.009
ivated+0.009
 youngest+0.008
itating+0.008
Neg logit contributions
 may-0.013
 omega-0.012
xy-0.012
gh-0.012
 might-0.012
Token room
Token ID2119
Feature activation-8.65e-03

Pos logit contributions
 may+0.026
 might+0.024
xy+0.024
 omega+0.024
rick+0.023
Neg logit contributions
 older-0.018
 below-0.017
ivated-0.017
itating-0.017
ider-0.017
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
itating+0.015
ivated+0.015
vious+0.014
 startled+0.014
saw+0.014
Neg logit contributions
 might-0.009
 may-0.009
gh-0.009
orm-0.008
 cr-0.008
Token
Token ID198
Feature activation-3.07e-03

Pos logit contributions
 older+0.008
 below+0.008
ivated+0.008
itating+0.008
ider+0.008
Neg logit contributions
 may-0.008
 omega-0.007
xy-0.007
 might-0.007
of-0.007
Token
Token ID198
Feature activation+4.61e-04

Pos logit contributions
 older+0.004
 below+0.004
ivated+0.004
itating+0.004
 hesitant+0.004
Neg logit contributions
 may-0.005
 omega-0.004
 might-0.004
xy-0.004
rick-0.004
TokenBut
Token ID1537
Feature activation-2.17e-02

Pos logit contributions
 may+0.001
xy+0.001
 might+0.001
 omega+0.001
of+0.001
Neg logit contributions
 older-0.001
 below-0.001
ivated-0.001
itating-0.001
ider-0.001
Token then
Token ID788
Feature activation+8.75e-03

Pos logit contributions
 older+0.028
 below+0.026
ivated+0.026
itating+0.026
ider+0.025
Neg logit contributions
 may-0.038
 might-0.034
xy-0.034
 omega-0.034
of-0.034
Token he
Token ID339
Feature activation-4.50e-03

Pos logit contributions
xy+0.014
 omega+0.014
 may+0.014
rick+0.013
of+0.013
Neg logit contributions
 older-0.012
 below-0.012
 startled-0.011
 hesitant-0.011
 bored-0.011
Token smiled
Token ID13541
Feature activation-1.59e-02

Pos logit contributions
 older+0.006
ivated+0.006
 below+0.006
itating+0.006
ider+0.006
Neg logit contributions
 may-0.007
 might-0.006
xy-0.006
 omega-0.006
of-0.006
Token at
Token ID379
Feature activation+1.02e-02

Pos logit contributions
 older+0.024
ivated+0.024
 below+0.023
itating+0.023
ider+0.023
Neg logit contributions
 may-0.024
 might-0.021
xy-0.021
 omega-0.020
of-0.020
Token her
Token ID607
Feature activation+6.19e-04

Pos logit contributions
 may+0.018
 omega+0.017
ww+0.017
gh+0.016
xy+0.016
Neg logit contributions
 below-0.014
 older-0.014
 behind-0.012
 youngest-0.012
ivated-0.011
Token and
Token ID290
Feature activation+8.18e-03

Pos logit contributions
 may+0.001
 might+0.001
xy+0.001
of+0.001
 omega+0.001
Neg logit contributions
ivated-0.001
ider-0.001
 older-0.001
itating-0.001
 below-0.001
Token said
Token ID531
Feature activation+1.58e-02

Pos logit contributions
 may+0.013
 might+0.011
xy+0.011
 omega+0.011
of+0.011
Neg logit contributions
 older-0.012
ivated-0.012
ider-0.012
itating-0.012
 below-0.011
Token he
Token ID339
Feature activation+9.49e-03

Pos logit contributions
 may+0.022
 omega+0.021
rick+0.021
xy+0.021
of+0.021
Neg logit contributions
 older-0.025
 below-0.024
 hesitant-0.024
ivated-0.023
ider-0.023
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
 may+0.007
xy+0.007
rick+0.006
gly+0.006
 omega+0.006
Neg logit contributions
 older-0.005
 below-0.005
ivated-0.005
 hesitant-0.005
 startled-0.005
Token time
Token ID640
Feature activation+1.69e-02

Pos logit contributions
 may+0.016
xy+0.014
 omega+0.014
haw+0.013
rick+0.013
Neg logit contributions
 older-0.021
 hesitant-0.021
ider-0.020
 below-0.020
ivated-0.020
Token there
Token ID612
Feature activation+8.95e-04

Pos logit contributions
 omega+0.027
 may+0.025
xy+0.025
rick+0.025
 meanings+0.025
Neg logit contributions
 older-0.024
 below-0.022
 hesitant-0.022
ivated-0.022
ider-0.021
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
 fiber+0.002
 omega+0.002
ww+0.002
rick+0.001
xy+0.001
Neg logit contributions
 below-0.001
 startled-0.001
 hesitant-0.001
 surprised-0.001
 older-0.001
Token an
Token ID281
Feature activation-1.80e-02

Pos logit contributions
 may+0.020
of+0.018
xy+0.018
 might+0.018
rick+0.018
Neg logit contributions
 older-0.013
 below-0.012
 bored-0.012
ivated-0.012
 hesitant-0.012
Token ugly
Token ID13400
Feature activation-1.55e-02

Pos logit contributions
itating+0.022
ider+0.022
ivated+0.021
 below+0.021
 older+0.021
Neg logit contributions
 may-0.030
 omega-0.028
rick-0.028
orm-0.028
xy-0.028
Token start
Token ID923
Feature activation-8.76e-03

Pos logit contributions
 older+0.022
 below+0.021
ivated+0.020
ider+0.020
 hesitant+0.020
Neg logit contributions
 may-0.024
xy-0.022
 omega-0.022
of-0.022
 might-0.022
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
ivated+0.012
 older+0.011
itating+0.011
 below+0.011
 hesitant+0.010
Neg logit contributions
 may-0.015
 might-0.014
xy-0.013
rick-0.013
gh-0.013
Token It
Token ID632
Feature activation+5.09e-03

Pos logit contributions
 below+0.016
 older+0.015
 hesitant+0.015
 startled+0.015
ivated+0.014
Neg logit contributions
 may-0.020
by-0.019
xy-0.019
gh-0.019
of-0.019
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
 omega+0.006
 may+0.006
xy+0.005
of+0.005
gh+0.005
Neg logit contributions
 older-0.009
 below-0.009
 hesitant-0.009
 startled-0.009
 bored-0.009
Token so
Token ID523
Feature activation+9.62e-03

Pos logit contributions
 may+0.017
xy+0.016
of+0.016
 omega+0.015
rick+0.015
Neg logit contributions
 older-0.017
 below-0.015
 hesitant-0.015
ivated-0.015
 bored-0.015
Token started
Token ID2067
Feature activation+2.56e-02

Pos logit contributions
 may+0.012
xy+0.010
 omega+0.010
 might+0.010
of+0.009
Neg logit contributions
 older-0.026
 below-0.025
ivated-0.025
itating-0.025
ider-0.025
Token to
Token ID284
Feature activation+2.28e-02

Pos logit contributions
 may+0.049
 omega+0.047
xy+0.046
of+0.046
rick+0.045
Neg logit contributions
 older-0.030
 below-0.027
 hesitant-0.026
 bored-0.025
ivated-0.025
Token get
Token ID651
Feature activation+3.03e-02

Pos logit contributions
 may+0.038
xy+0.036
 omega+0.035
rick+0.035
 might+0.034
Neg logit contributions
 older-0.033
 below-0.031
ivated-0.030
itating-0.029
 hesitant-0.028
Token dark
Token ID3223
Feature activation+1.70e-02

Pos logit contributions
 may+0.048
xy+0.044
 omega+0.043
 might+0.043
of+0.043
Neg logit contributions
 older-0.044
 below-0.042
ivated-0.041
itating-0.040
 hesitant-0.040
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
 may+0.025
 omega+0.023
xy+0.023
rick+0.022
of+0.022
Neg logit contributions
 older-0.027
 below-0.026
ivated-0.025
ider-0.024
itating-0.024
Token Jack
Token ID3619
Feature activation-5.51e-03

Pos logit contributions
 older+0.018
 below+0.018
ivated+0.018
itating+0.017
ider+0.017
Neg logit contributions
 may-0.015
xy-0.014
 might-0.014
 omega-0.014
of-0.013
Token had
Token ID550
Feature activation-1.99e-02

Pos logit contributions
 older+0.009
 below+0.008
ivated+0.008
itating+0.008
 hesitant+0.008
Neg logit contributions
 may-0.008
xy-0.007
 omega-0.007
 might-0.007
of-0.007
Token to
Token ID284
Feature activation-1.50e-03

Pos logit contributions
ivated+0.030
 below+0.030
ider+0.030
 older+0.030
itating+0.030
Neg logit contributions
 may-0.029
 might-0.027
 omega-0.026
rick-0.025
gh-0.025
Token go
Token ID467
Feature activation-3.25e-03

Pos logit contributions
 older+0.002
 below+0.002
ivated+0.002
 youngest+0.002
 hesitant+0.002
Neg logit contributions
 may-0.002
xy-0.002
 omega-0.002
 might-0.002
ww-0.002
Token home
Token ID1363
Feature activation-1.80e-05

Pos logit contributions
ivated+0.004
ider+0.004
 youngest+0.004
itating+0.004
rier+0.004
Neg logit contributions
 may-0.005
 might-0.005
of-0.005
 looks-0.004
 would-0.004
Token.
Token ID13
Feature activation+8.72e-04

Pos logit contributions
ivated+0.000
saw+0.000
itating+0.000
vious+0.000
ider+0.000
Neg logit contributions
 may-0.000
 might-0.000
gh-0.000
of-0.000
 hard-0.000
Token knew
Token ID2993
Feature activation-2.05e-02

Pos logit contributions
 may+0.012
xy+0.012
 omega+0.012
ww+0.011
gh+0.011
Neg logit contributions
 older-0.011
 below-0.011
 hesitant-0.010
 youngest-0.010
 startled-0.010
Token he
Token ID339
Feature activation+2.10e-03

Pos logit contributions
 older+0.028
 below+0.027
 hesitant+0.025
ivated+0.025
itating+0.024
Neg logit contributions
 may-0.033
 omega-0.032
xy-0.031
of-0.031
rick-0.031
Token shouldn
Token ID6584
Feature activation-4.00e-03

Pos logit contributions
ww+0.003
 omega+0.003
xy+0.003
of+0.003
gh+0.003
Neg logit contributions
 older-0.003
 below-0.003
 hesitant-0.003
 startled-0.003
 bored-0.003
Token't
Token ID470
Feature activation-6.95e-03

Pos logit contributions
 older+0.005
 below+0.005
itating+0.004
 hesitant+0.004
ivated+0.004
Neg logit contributions
 may-0.007
 omega-0.007
orm-0.007
xy-0.006
of-0.006
Token be
Token ID307
Feature activation-1.69e-02

Pos logit contributions
 older+0.011
 below+0.011
ivated+0.011
itating+0.011
 hesitant+0.011
Neg logit contributions
 may-0.009
xy-0.009
 omega-0.009
ww-0.008
rick-0.008
Token there
Token ID612
Feature activation-1.50e-02

Pos logit contributions
ider+0.019
itating+0.018
saw+0.017
but+0.017
ivated+0.017
Neg logit contributions
 may-0.030
 might-0.030
gh-0.028
orm-0.027
 omega-0.027
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
 older+0.018
 below+0.017
ivated+0.017
itating+0.017
ider+0.017
Neg logit contributions
 may-0.027
 omega-0.025
xy-0.024
 might-0.024
rick-0.024
Token but
Token ID475
Feature activation-2.81e-02

Pos logit contributions
 older+0.029
 below+0.028
 hesitant+0.025
ivated+0.024
 bored+0.024
Neg logit contributions
xy-0.061
 may-0.058
 omega-0.058
rick-0.058
gh-0.057
Token he
Token ID339
Feature activation-1.67e-03

Pos logit contributions
 older+0.043
 below+0.042
ivated+0.041
itating+0.040
 hesitant+0.040
Neg logit contributions
 may-0.041
xy-0.038
 omega-0.038
of-0.037
rick-0.037
Token couldn
Token ID3521
Feature activation-5.10e-03

Pos logit contributions
 older+0.003
 below+0.003
 hesitant+0.003
ider+0.003
ivated+0.003
Neg logit contributions
 may-0.002
xy-0.002
 omega-0.002
of-0.002
rick-0.002
Token't
Token ID470
Feature activation-3.87e-03

Pos logit contributions
 older+0.006
 below+0.006
ivated+0.006
itating+0.006
ider+0.005
Neg logit contributions
 may-0.009
 omega-0.009
rick-0.008
orm-0.008
of-0.008
Token the
Token ID262
Feature activation-2.86e-02

Pos logit contributions
 may+0.004
xy+0.003
of+0.003
 omega+0.003
rick+0.003
Neg logit contributions
 older-0.002
 below-0.002
 hesitant-0.002
 startled-0.002
ivated-0.002
Token most
Token ID749
Feature activation-5.05e-03

Pos logit contributions
 older+0.043
 below+0.040
 youngest+0.039
 hesitant+0.038
itating+0.037
Neg logit contributions
 may-0.045
xy-0.042
of-0.041
 omega-0.041
gh-0.040
Token adorable
Token ID29012
Feature activation-2.47e-04

Pos logit contributions
ivated+0.009
 older+0.008
itating+0.008
ider+0.008
 below+0.008
Neg logit contributions
 may-0.006
 might-0.006
 omega-0.006
xy-0.005
rick-0.005
Token creature
Token ID7185
Feature activation-4.66e-03

Pos logit contributions
 below+0.000
 older+0.000
 startled+0.000
 youngest+0.000
 behind+0.000
Neg logit contributions
 may-0.000
xy-0.000
 omega-0.000
of-0.000
rick-0.000
Token he
Token ID339
Feature activation-2.34e-02

Pos logit contributions
 older+0.006
ivated+0.006
itating+0.006
 below+0.006
ider+0.006
Neg logit contributions
 may-0.008
 might-0.007
xy-0.007
of-0.007
 omega-0.007
Token had
Token ID550
Feature activation-4.20e-02

Pos logit contributions
 below+0.042
 older+0.040
 behind+0.039
 startled+0.039
ider+0.039
Neg logit contributions
 may-0.025
 omega-0.025
xy-0.024
rick-0.024
ww-0.024
Token ever
Token ID1683
Feature activation-1.62e-02

Pos logit contributions
 older+0.097
ivated+0.096
itating+0.093
 paramedics+0.092
rier+0.092
Neg logit contributions
gh-0.031
 may-0.031
 might-0.029
of-0.024
orm-0.024
Token seen
Token ID1775
Feature activation-9.12e-03

Pos logit contributions
 older+0.016
 below+0.015
ivated+0.015
itating+0.014
ider+0.014
Neg logit contributions
 may-0.033
 might-0.030
xy-0.030
 omega-0.030
of-0.030
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
ivated+0.013
 older+0.013
itating+0.013
 hesitant+0.012
ider+0.012
Neg logit contributions
 may-0.014
 might-0.013
gh-0.013
xy-0.013
of-0.012
Token Henry
Token ID8616
Feature activation-1.65e-03

Pos logit contributions
 older+0.022
ider+0.022
ivated+0.022
itating+0.021
 below+0.021
Neg logit contributions
 may-0.016
 might-0.014
 omega-0.013
xy-0.013
of-0.013
Token wanted
Token ID2227
Feature activation-5.34e-03

Pos logit contributions
 older+0.002
 hesitant+0.002
 below+0.002
 startled+0.002
ivated+0.002
Neg logit contributions
xy-0.002
 omega-0.002
 may-0.002
gh-0.002
ww-0.002
Token Which
Token ID9022
Feature activation+1.22e-02

Pos logit contributions
 older+0.001
ivated+0.001
 below+0.001
itating+0.001
ider+0.001
Neg logit contributions
 may-0.001
 might-0.001
 omega-0.001
xy-0.001
of-0.001
Token one
Token ID530
Feature activation-1.90e-02

Pos logit contributions
 may+0.022
xy+0.021
rick+0.021
 omega+0.020
gh+0.020
Neg logit contributions
 older-0.014
 below-0.013
 hesitant-0.012
 bored-0.012
ivated-0.012
Token do
Token ID466
Feature activation-8.19e-03

Pos logit contributions
 older+0.033
 below+0.032
ivated+0.032
itating+0.032
ider+0.031
Neg logit contributions
 may-0.024
xy-0.021
 omega-0.021
 might-0.021
of-0.020
Token you
Token ID345
Feature activation-5.88e-03

Pos logit contributions
 older+0.012
 below+0.011
ider+0.011
 hesitant+0.010
ivated+0.010
Neg logit contributions
 may-0.012
rick-0.011
xy-0.011
 omega-0.011
orm-0.011
Token want
Token ID765
Feature activation-1.66e-02

Pos logit contributions
 older+0.009
itating+0.009
ivated+0.009
ider+0.009
 below+0.009
Neg logit contributions
 may-0.009
 might-0.008
 omega-0.007
xy-0.007
of-0.007
Token,
Token ID11
Feature activation-4.68e-02

Pos logit contributions
ivated+0.019
itating+0.019
 older+0.018
 below+0.018
ider+0.018
Neg logit contributions
 may-0.031
 might-0.030
xy-0.028
gh-0.028
of-0.028
Token Tom
Token ID4186
Feature activation-2.62e-02

Pos logit contributions
ider+0.079
itating+0.079
ivated+0.079
 below+0.078
 older+0.075
Neg logit contributions
 may-0.061
 omega-0.054
 might-0.054
of-0.050
rick-0.049
Token?"
Token ID1701
Feature activation-8.53e-03

Pos logit contributions
ivated+0.035
ider+0.034
itating+0.032
 youngest+0.031
but+0.031
Neg logit contributions
 may-0.045
 might-0.042
xy-0.041
 would-0.040
 looks-0.040
Token
Token ID198
Feature activation-4.69e-03

Pos logit contributions
ider+0.017
itating+0.016
ivated+0.016
rier+0.016
 older+0.016
Neg logit contributions
 may-0.009
 might-0.008
of-0.007
 omega-0.007
rick-0.007
Token
Token ID198
Feature activation+3.71e-03

Pos logit contributions
 older+0.008
ivated+0.008
ider+0.007
 below+0.007
itating+0.007
Neg logit contributions
 may-0.006
of-0.006
 might-0.006
xy-0.006
 omega-0.006
Token"
Token ID1
Feature activation+2.03e-03

Pos logit contributions
 may+0.004
of+0.003
rick+0.003
 omega+0.003
 might+0.003
Neg logit contributions
 older-0.008
 below-0.007
ivated-0.007
itating-0.007
ider-0.007
Token something
Token ID1223
Feature activation-2.04e-03

Pos logit contributions
 may+0.004
 might+0.004
of+0.004
 omega+0.004
gh+0.004
Neg logit contributions
itating-0.005
ivated-0.005
ider-0.005
 below-0.005
 older-0.005
Token to
Token ID284
Feature activation-8.40e-03

Pos logit contributions
ivated+0.003
 older+0.003
 below+0.003
itating+0.003
ider+0.003
Neg logit contributions
 may-0.003
 might-0.003
xy-0.003
 omega-0.003
gh-0.003
Token put
Token ID1234
Feature activation-1.43e-02

Pos logit contributions
 older+0.013
 below+0.013
ivated+0.012
 hesitant+0.012
 youngest+0.011
Neg logit contributions
 may-0.012
xy-0.011
rick-0.011
 omega-0.011
of-0.011
Token on
Token ID319
Feature activation-1.50e-02

Pos logit contributions
 below+0.017
 older+0.017
 behind+0.016
 into+0.015
 bored+0.015
Neg logit contributions
 may-0.025
 omega-0.025
rick-0.024
xy-0.024
of-0.024
Token the
Token ID262
Feature activation-1.45e-02

Pos logit contributions
 older+0.022
 below+0.021
ivated+0.020
 hesitant+0.019
but+0.019
Neg logit contributions
 may-0.023
 omega-0.022
xy-0.021
of-0.021
rick-0.021
Token plate
Token ID7480
Feature activation-4.35e-02

Pos logit contributions
 older+0.023
 below+0.022
ider+0.021
ivated+0.021
itating+0.020
Neg logit contributions
 may-0.021
xy-0.019
 omega-0.019
of-0.019
rick-0.018
Token?"
Token ID1701
Feature activation-1.05e-02

Pos logit contributions
ivated+0.061
itating+0.057
 youngest+0.057
 older+0.056
ider+0.054
Neg logit contributions
 may-0.067
of-0.064
 might-0.063
gh-0.061
 looks-0.060
Token He
Token ID679
Feature activation-3.37e-03

Pos logit contributions
ider+0.018
itating+0.018
ivated+0.017
 older+0.017
 below+0.017
Neg logit contributions
 may-0.013
 might-0.012
of-0.012
rick-0.011
xy-0.011
Token asked
Token ID1965
Feature activation-3.37e-03

Pos logit contributions
itating+0.004
ivated+0.004
 below+0.004
ider+0.003
 older+0.003
Neg logit contributions
 may-0.007
 might-0.006
 omega-0.006
of-0.006
rick-0.006
Token.
Token ID13
Feature activation-1.37e-03

Pos logit contributions
 older+0.005
ivated+0.005
itating+0.005
 below+0.005
ider+0.005
Neg logit contributions
 may-0.005
 might-0.005
xy-0.004
of-0.004
 omega-0.004
Token
Token ID220
Feature activation-1.86e-04

Pos logit contributions
ivated+0.002
 older+0.002
itating+0.002
 below+0.002
ider+0.002
Neg logit contributions
 may-0.002
 might-0.002
of-0.002
xy-0.002
 omega-0.002
Token reminds
Token ID17603
Feature activation+7.25e-03

Pos logit contributions
 may+0.012
 omega+0.011
xy+0.011
rick+0.011
gh+0.011
Neg logit contributions
 older-0.036
 below-0.035
itating-0.034
ivated-0.034
 hesitant-0.033
Token me
Token ID502
Feature activation-9.91e-03

Pos logit contributions
 may+0.013
 omega+0.012
xy+0.012
rick+0.012
haw+0.011
Neg logit contributions
 older-0.010
 below-0.008
ivated-0.008
 youngest-0.008
 elderly-0.008
Token of
Token ID286
Feature activation-7.59e-03

Pos logit contributions
 older+0.012
ivated+0.012
 below+0.012
itating+0.012
ider+0.012
Neg logit contributions
 may-0.017
 might-0.016
of-0.015
xy-0.015
 omega-0.015
Token when
Token ID618
Feature activation-4.37e-03

Pos logit contributions
 older+0.011
 below+0.010
 bored+0.009
 youngest+0.009
ivated+0.009
Neg logit contributions
 may-0.012
 omega-0.011
xy-0.011
rick-0.011
of-0.011
Token I
Token ID314
Feature activation-1.35e-02

Pos logit contributions
 older+0.007
 below+0.007
ivated+0.007
ider+0.007
itating+0.007
Neg logit contributions
 may-0.006
 omega-0.005
xy-0.005
rick-0.005
of-0.005
Token was
Token ID373
Feature activation-4.20e-02

Pos logit contributions
 older+0.025
ivated+0.024
 below+0.024
itating+0.024
ider+0.023
Neg logit contributions
 may-0.016
 might-0.014
of-0.014
xy-0.014
 omega-0.014
Token young
Token ID1862
Feature activation-2.02e-02

Pos logit contributions
but+0.063
owner+0.060
ider+0.059
rolley+0.059
saw+0.055
Neg logit contributions
 hard-0.059
 long-0.054
 might-0.054
 may-0.053
gh-0.053
Token.
Token ID13
Feature activation-2.36e-03

Pos logit contributions
ivated+0.036
itating+0.036
saw+0.036
but+0.035
uct+0.034
Neg logit contributions
 might-0.021
 may-0.021
 is-0.018
 hard-0.018
 can-0.018
Token I
Token ID314
Feature activation-1.91e-02

Pos logit contributions
 older+0.003
 below+0.003
ivated+0.003
ider+0.003
itating+0.003
Neg logit contributions
 may-0.004
xy-0.003
 might-0.003
 omega-0.003
of-0.003
Token used
Token ID973
Feature activation-6.82e-03

Pos logit contributions
 older+0.037
 below+0.037
ivated+0.036
itating+0.036
ider+0.036
Neg logit contributions
 may-0.019
xy-0.017
 omega-0.017
 might-0.016
of-0.016
Token to
Token ID284
Feature activation-5.51e-03

Pos logit contributions
 older+0.010
 below+0.009
 bored+0.008
 hesitant+0.008
 into+0.008
Neg logit contributions
xy-0.011
 may-0.010
rick-0.010
 omega-0.010
gh-0.010
Token.
Token ID13
Feature activation+5.12e-03

Pos logit contributions
 older+0.011
 below+0.010
ider+0.010
ivated+0.010
 hesitant+0.009
Neg logit contributions
 may-0.018
xy-0.017
 omega-0.016
rick-0.016
gh-0.016
Token Do
Token ID2141
Feature activation+3.89e-02

Pos logit contributions
 may+0.007
xy+0.006
 might+0.006
 omega+0.006
of+0.006
Neg logit contributions
 older-0.009
 below-0.008
ivated-0.008
ider-0.008
itating-0.008
Token you
Token ID345
Feature activation-1.19e-02

Pos logit contributions
 omega+0.062
xy+0.060
rick+0.059
oc+0.059
yn+0.059
Neg logit contributions
 older-0.046
ider-0.039
saw-0.039
but-0.038
ivated-0.037
Token have
Token ID423
Feature activation-2.66e-02

Pos logit contributions
 older+0.017
 below+0.017
ivated+0.017
itating+0.016
ider+0.016
Neg logit contributions
 may-0.019
 might-0.017
 omega-0.017
xy-0.016
of-0.016
Token any
Token ID597
Feature activation-3.65e-02

Pos logit contributions
 older+0.039
ivated+0.039
 below+0.039
itating+0.038
ider+0.038
Neg logit contributions
 may-0.040
 might-0.036
 omega-0.036
of-0.035
xy-0.035
Token money
Token ID1637
Feature activation-4.15e-02

Pos logit contributions
ivated+0.059
itating+0.059
 below+0.056
but+0.055
ider+0.054
Neg logit contributions
 may-0.052
 might-0.048
 omega-0.045
gh-0.045
ww-0.044
Token?"
Token ID1701
Feature activation-5.70e-03

Pos logit contributions
ivated+0.051
itating+0.049
 below+0.047
 hesitant+0.046
 older+0.046
Neg logit contributions
 may-0.074
 might-0.068
xy-0.067
 omega-0.067
gh-0.066
Token Dad
Token ID17415
Feature activation+7.06e-03

Pos logit contributions
ider+0.010
ivated+0.010
itating+0.010
 below+0.010
 older+0.010
Neg logit contributions
 may-0.007
 might-0.006
 omega-0.006
of-0.006
rick-0.006
Token says
Token ID1139
Feature activation+1.21e-02

Pos logit contributions
 may+0.010
xy+0.009
 omega+0.009
 might+0.009
gh+0.009
Neg logit contributions
 older-0.011
 below-0.011
ivated-0.011
itating-0.010
ider-0.010
Token,
Token ID11
Feature activation+6.74e-03

Pos logit contributions
 may+0.015
xy+0.013
 omega+0.013
 might+0.013
rick+0.012
Neg logit contributions
 older-0.022
 below-0.021
ivated-0.021
ider-0.021
itating-0.021
Token "
Token ID366
Feature activation+7.53e-03

Pos logit contributions
 may+0.009
 omega+0.008
 might+0.008
xy+0.008
of+0.008
Neg logit contributions
 older-0.010
itating-0.010
ivated-0.010
 below-0.010
vious-0.010