TOP ACTIVATIONS
MAX = 5.296

so he asked the ch ame leon to play with him
One day the ch ame leon was looking around and
sign that said ' Law yer ', and he went inside
. The ch ame leon was so grateful and
was a young bear called Be art ie . Be art
could go see the fire work show . Sophie and her
and the lightning struck the hyd rant . Suddenly water was
vegetables laughed at the boast ful onion . One of them
is mine !" The ch ame leon could tell he was
the park her " use ful - park ". <|endoftext|> Once
there was a girl named Be cca . Be cca had
of the body , the lung ." All the students looked
help heal the boo - bo o .
town , with Mr sm ith blinking the car lights to
Ho ppy and one called Cotton . They lived in a
that it was nearly sund own when they emerged from the
. It was a ch ame leon . The crocod ile
was happy and asked the mechanic to explain more things to
special , it was the only kind of its kind .
and the doctor said the patient was very helpful . They

MIN ACTIVATIONS
MIN = -5.176

She had entered a sand pit and it was getting bigger
. She had a red tr icycle with a big wheel
he saw a big sand pit filled with toys . He
did not know what a tr uffle was . They did
was playing in the sand pit when she saw something crazy
. She had a big tr icycle that she would ride
. Tim had a big maze in his backyard . He
books . She had many books in her room . She
to playing with his sand pit , but he could not
outside . He saw a tr amp oline ! He was
. He had a colorful sandbox filled with shiny toys and
. They had a big sandbox and many toys . They
books . She had many books in her room . She
books . She had many books with pictures and words .
books . She had many books on her shelf in her
books . She had many books in her room . She
books . She had many books in her room . She
. They had a big sandbox where they made castles and
When they reached the sand pit , Jessie could see that
books . She had many books in her room . Some

INTERVAL 2.678 - 5.296
CONTAINS 0.094%

the beautiful , colorful fire work displays spark in the sky
took their three - year - old daughter to the circus
" Uh - oh , what is that ?"
his grandparents would practice the healing style every day . <|endoftext|>
many different kinds of sa us ages everywhere she looked .

INTERVAL 0.060 - 2.678
CONTAINS 45.934%

John realized that measuring things made his heart happy and cool
, close his eyes , and spread his arms wide .
she gave him a big hug and said , " Thank
Dave was a big , strong bear and Amy was a
waiting . They walked her to the kitchen and showed her

INTERVAL -2.558 - 0.060
CONTAINS 53.736%

he finished , he bent down and whispered something wise in
Ben and Lily screamed . They were hurt by the blocks
ship . " I 'm the wind !"
love you . He likes to see them happy and dry
make a square and a circle . One day

INTERVAL -5.176 - -2.558
CONTAINS 0.236%

She makes a sad face at Tom . "
books on the books he lf . He wanted to find
guilty for not doing what her mom my said . She
and didn 't know what to do .
the boat and was amazed at all the new sights .
Token so
Token ID523
Feature activation-2.15e-01

Pos logit contributions
 Will+0.002
 louder+0.001
 three+0.001
 Ch+0.001
 3+0.001
Neg logit contributions
he-0.002
aren-0.002
pit-0.002
case-0.002
icycle-0.002
Token he
Token ID339
Feature activation-2.79e-01

Pos logit contributions
pit+0.007
he+0.006
aren+0.006
case+0.006
 benches+0.006
Neg logit contributions
 Will-0.006
 crew-0.006
 Ch-0.006
 three-0.006
 louder-0.006
Token asked
Token ID1965
Feature activation+8.07e-01

Pos logit contributions
he+0.009
pit+0.009
 benches+0.009
aren+0.009
case+0.008
Neg logit contributions
 Will-0.007
 crew-0.007
 pap-0.006
 three-0.006
 louder-0.006
Token the
Token ID262
Feature activation+4.27e-01

Pos logit contributions
 Will+0.023
ch+0.023
 crew+0.022
ksh+0.022
 pap+0.022
Neg logit contributions
 benches-0.021
he-0.020
pit-0.020
aren-0.019
case-0.019
Token ch
Token ID442
Feature activation+1.00e+00

Pos logit contributions
 Will+0.012
ksh+0.012
 Chick+0.012
 Amen+0.011
free+0.011
Neg logit contributions
pit-0.012
 bench-0.011
 benches-0.011
 sandbox-0.011
 swing-0.010
Tokename
Token ID480
Feature activation+5.06e+00

Pos logit contributions
 Will+0.031
 crew+0.028
 pap+0.027
 Chick+0.027
free+0.026
Neg logit contributions
pit-0.027
he-0.027
 benches-0.026
aren-0.025
case-0.025
Tokenleon
Token ID38970
Feature activation+2.36e+00

Pos logit contributions
 Will+0.117
V+0.112
 crew+0.110
 Chick+0.110
 pap+0.110
Neg logit contributions
pit-0.157
he-0.156
case-0.146
 benches-0.144
ement-0.142
Token to
Token ID284
Feature activation+9.55e-01

Pos logit contributions
 Will+0.057
 crew+0.053
 pap+0.051
 three+0.050
 Ch+0.048
Neg logit contributions
he-0.082
 benches-0.079
aren-0.077
pit-0.077
case-0.075
Token play
Token ID711
Feature activation-5.14e-01

Pos logit contributions
 Will+0.016
 crew+0.014
 three+0.013
 pap+0.013
 louder+0.012
Neg logit contributions
pit-0.039
he-0.039
 benches-0.038
case-0.038
awed-0.038
Token with
Token ID351
Feature activation-1.75e-01

Pos logit contributions
aren+0.013
pit+0.013
 benches+0.012
he+0.012
case+0.012
Neg logit contributions
 Will-0.017
 three-0.015
 crew-0.015
 louder-0.015
 Sp-0.015
Token him
Token ID683
Feature activation-2.57e-01

Pos logit contributions
pit+0.006
he+0.006
case+0.006
aren+0.006
 benches+0.006
Neg logit contributions
 Will-0.004
 crew-0.004
 three-0.003
 Sp-0.003
 louder-0.003
Token
Token ID198
Feature activation-3.85e-01

Pos logit contributions
he+0.011
case+0.010
 benches+0.010
aren+0.010
pit+0.009
Neg logit contributions
 Will-0.011
 crew-0.011
 pap-0.011
 louder-0.010
 Chick-0.010
TokenOne
Token ID3198
Feature activation-4.37e-01

Pos logit contributions
he+0.010
pit+0.010
 benches+0.010
case+0.010
aren+0.010
Neg logit contributions
 Will-0.012
 crew-0.011
 pap-0.011
 Chick-0.010
ch-0.010
Token day
Token ID1110
Feature activation+1.20e+00

Pos logit contributions
he+0.015
case+0.013
 bench+0.013
pit+0.013
aren+0.013
Neg logit contributions
 Will-0.010
 crew-0.010
V-0.009
 pap-0.009
ksh-0.009
Token the
Token ID262
Feature activation-3.36e-01

Pos logit contributions
 Will+0.045
 crew+0.042
 pap+0.041
 three+0.040
 Chick+0.040
Neg logit contributions
he-0.024
pit-0.024
 benches-0.023
case-0.023
aren-0.022
Token ch
Token ID442
Feature activation+5.86e-01

Pos logit contributions
 bench+0.009
 benches+0.009
pit+0.009
he+0.008
 slide+0.008
Neg logit contributions
ksh-0.010
 Will-0.009
free-0.009
 Amen-0.009
 Chick-0.009
Tokename
Token ID480
Feature activation+4.71e+00

Pos logit contributions
 Will+0.016
 crew+0.015
 pap+0.014
 Chick+0.014
 three+0.014
Neg logit contributions
he-0.018
pit-0.017
 benches-0.017
case-0.017
aren-0.017
Tokenleon
Token ID38970
Feature activation+1.96e+00

Pos logit contributions
 Will+0.114
 pap+0.107
V+0.107
 Yoga+0.106
 crew+0.105
Neg logit contributions
he-0.143
pit-0.142
case-0.137
 benches-0.131
awed-0.130
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
 Will+0.049
 crew+0.045
 pap+0.042
 three+0.041
 Ch+0.040
Neg logit contributions
 benches-0.068
he-0.067
pit-0.064
aren-0.063
ement-0.061
Token looking
Token ID2045
Feature activation+1.77e+00

Pos logit contributions
 Will+0.032
 crew+0.030
 pap+0.028
 Chick+0.028
free+0.028
Neg logit contributions
pit-0.034
he-0.034
case-0.033
 benches-0.033
aren-0.032
Token around
Token ID1088
Feature activation+2.36e-01

Pos logit contributions
 Will+0.037
 crew+0.034
 pap+0.032
 Chick+0.031
V+0.030
Neg logit contributions
he-0.064
pit-0.063
 benches-0.062
case-0.061
aren-0.060
Token and
Token ID290
Feature activation+3.35e-01

Pos logit contributions
 Will+0.007
 crew+0.006
 three+0.006
 pap+0.006
 louder+0.006
Neg logit contributions
pit-0.007
case-0.007
he-0.007
aren-0.007
 benches-0.007
Token sign
Token ID1051
Feature activation-4.82e-01

Pos logit contributions
 Will+0.009
 crew+0.009
 pap+0.008
 three+0.008
ch+0.008
Neg logit contributions
he-0.014
pit-0.014
 benches-0.014
aren-0.014
ue-0.013
Token that
Token ID326
Feature activation+1.78e+00

Pos logit contributions
pit+0.016
ue+0.016
aren+0.016
awed+0.016
he+0.015
Neg logit contributions
 three-0.011
 Will-0.011
 louder-0.010
ch-0.010
 crew-0.009
Token said
Token ID531
Feature activation+5.65e-01

Pos logit contributions
 Will+0.046
 crew+0.042
 pap+0.041
 Ch+0.039
 Chick+0.039
Neg logit contributions
pit-0.058
 benches-0.056
he-0.055
aren-0.055
ue-0.053
Token '
Token ID705
Feature activation+1.06e+00

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.010
 Chick+0.009
ch+0.009
Neg logit contributions
he-0.021
pit-0.020
 benches-0.020
case-0.020
aren-0.020
TokenLaw
Token ID16966
Feature activation+1.21e+00

Pos logit contributions
 Will+0.027
 crew+0.026
 pap+0.025
 Chick+0.024
 louder+0.024
Neg logit contributions
he-0.033
pit-0.032
 benches-0.031
aren-0.031
case-0.031
Tokenyer
Token ID9860
Feature activation+4.61e+00

Pos logit contributions
 Will+0.034
 crew+0.031
 Chick+0.031
 pap+0.031
 Ch+0.030
Neg logit contributions
he-0.035
pit-0.034
case-0.034
aren-0.033
 benches-0.032
Token',
Token ID3256
Feature activation+1.19e+00

Pos logit contributions
 Will+0.120
 crew+0.118
 Chick+0.114
 pap+0.114
 louder+0.111
Neg logit contributions
he-0.130
pit-0.125
case-0.125
aren-0.124
 benches-0.121
Token and
Token ID290
Feature activation+6.18e-01

Pos logit contributions
 Will+0.037
 three+0.031
 crew+0.031
 Ch+0.030
 pap+0.030
Neg logit contributions
pit-0.037
aren-0.036
he-0.034
 benches-0.034
awed-0.034
Token he
Token ID339
Feature activation+8.25e-01

Pos logit contributions
 Will+0.017
 crew+0.015
 pap+0.015
 three+0.015
 Chick+0.014
Neg logit contributions
pit-0.019
he-0.019
 benches-0.018
aren-0.018
case-0.018
Token went
Token ID1816
Feature activation-2.16e-01

Pos logit contributions
 Will+0.023
 crew+0.021
 pap+0.020
 Chick+0.020
ch+0.020
Neg logit contributions
he-0.024
pit-0.024
 benches-0.023
case-0.023
aren-0.023
Token inside
Token ID2641
Feature activation+6.89e-01

Pos logit contributions
he+0.007
pit+0.007
 benches+0.007
case+0.007
aren+0.007
Neg logit contributions
 Will-0.005
 crew-0.005
 pap-0.004
 Chick-0.004
 three-0.004
Token.
Token ID13
Feature activation+7.39e-01

Pos logit contributions
pit+0.025
 benches+0.024
ue+0.024
icycle+0.024
aren+0.024
Neg logit contributions
 Will-0.025
 crew-0.023
 three-0.022
free-0.021
 Ch-0.021
Token
Token ID198
Feature activation+5.49e-01

Pos logit contributions
 Will+0.018
 Chick+0.015
 Ch+0.015
 Sp+0.015
 crew+0.015
Neg logit contributions
pit-0.026
he-0.025
 benches-0.025
case-0.024
aren-0.024
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
 Will+0.013
 crew+0.012
 pap+0.012
 louder+0.011
 younger+0.011
Neg logit contributions
he-0.020
case-0.018
 benches-0.018
aren-0.017
pit-0.017
TokenThe
Token ID464
Feature activation+3.12e-01

Pos logit contributions
 Will+0.018
 crew+0.017
 pap+0.016
 Chick+0.016
ch+0.016
Neg logit contributions
he-0.016
pit-0.016
 benches-0.016
case-0.016
aren-0.015
Token ch
Token ID442
Feature activation+7.49e-01

Pos logit contributions
ksh+0.009
 Will+0.009
 Chick+0.009
 Amen+0.008
 Him+0.008
Neg logit contributions
pit-0.008
 benches-0.008
 slide-0.008
 sandbox-0.008
 swing-0.008
Tokename
Token ID480
Feature activation+4.57e+00

Pos logit contributions
 Will+0.021
 crew+0.019
 pap+0.019
 Chick+0.018
 Amen+0.018
Neg logit contributions
he-0.022
pit-0.022
 benches-0.021
case-0.020
aren-0.020
Tokenleon
Token ID38970
Feature activation+1.67e+00

Pos logit contributions
 Will+0.106
 crew+0.097
 pap+0.095
 Yoga+0.093
 Chick+0.092
Neg logit contributions
he-0.150
pit-0.149
case-0.142
 benches-0.141
aren-0.139
Token was
Token ID373
Feature activation+9.86e-01

Pos logit contributions
 Will+0.041
 crew+0.037
 pap+0.036
 three+0.035
 Ch+0.034
Neg logit contributions
 benches-0.062
he-0.057
pit-0.056
 purse-0.054
aren-0.054
Token so
Token ID523
Feature activation-2.84e-01

Pos logit contributions
 Will+0.022
 crew+0.020
 pap+0.020
 Chick+0.019
 three+0.019
Neg logit contributions
he-0.034
pit-0.034
 benches-0.033
case-0.033
aren-0.032
Token grateful
Token ID14066
Feature activation-2.72e-01

Pos logit contributions
he+0.010
pit+0.010
 benches+0.010
case+0.010
aren+0.010
Neg logit contributions
 Will-0.006
 crew-0.006
 pap-0.005
 Chick-0.005
ch-0.005
Token and
Token ID290
Feature activation-9.07e-02

Pos logit contributions
he+0.011
 benches+0.011
pit+0.011
case+0.011
aren+0.011
Neg logit contributions
 Will-0.005
 crew-0.004
 pap-0.004
 three-0.004
 louder-0.004
Token was
Token ID373
Feature activation+9.86e-01

Pos logit contributions
 Will+0.007
 Sp+0.006
 Ch+0.006
 three+0.006
 Chick+0.006
Neg logit contributions
pit-0.010
 benches-0.010
aren-0.009
he-0.009
case-0.009
Token a
Token ID257
Feature activation+3.22e-01

Pos logit contributions
 crew+0.019
 Will+0.019
 pap+0.018
ye+0.017
 louder+0.016
Neg logit contributions
he-0.036
 benches-0.035
case-0.034
pit-0.034
aren-0.033
Token young
Token ID1862
Feature activation+2.07e+00

Pos logit contributions
 Will+0.007
 crew+0.006
ksh+0.006
free+0.006
 Him+0.006
Neg logit contributions
 bench-0.011
he-0.011
pit-0.010
 sandbox-0.010
 purse-0.010
Token bear
Token ID6842
Feature activation+1.44e-01

Pos logit contributions
 Will+0.046
 crew+0.043
 pap+0.041
 three+0.040
 Chick+0.039
Neg logit contributions
he-0.072
pit-0.072
 benches-0.071
case-0.070
aren-0.069
Token called
Token ID1444
Feature activation+2.20e+00

Pos logit contributions
 Will+0.004
 crew+0.004
 Chick+0.004
 pap+0.004
 Ch+0.004
Neg logit contributions
pit-0.004
 benches-0.004
he-0.004
aren-0.004
case-0.004
Token Be
Token ID1355
Feature activation+4.54e+00

Pos logit contributions
 louder+0.062
 crew+0.061
 Amen+0.060
free+0.059
ksh+0.058
Neg logit contributions
 benches-0.055
he-0.053
aren-0.052
ue-0.051
pit-0.051
Tokenart
Token ID433
Feature activation+1.92e+00

Pos logit contributions
 Will+0.130
 Yoga+0.123
 Would+0.120
 Amen+0.120
ksh+0.119
Neg logit contributions
 than-0.117
aw-0.113
he-0.109
pit-0.107
awed-0.104
Tokenie
Token ID494
Feature activation+2.26e+00

Pos logit contributions
 Will+0.074
 pap+0.071
 crew+0.071
 Yoga+0.070
 Chick+0.069
Neg logit contributions
he-0.034
pit-0.031
case-0.029
 benches-0.028
aren-0.028
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 Will+0.089
 crew+0.084
 pap+0.083
ch+0.082
V+0.082
Neg logit contributions
he-0.037
pit-0.036
 benches-0.034
case-0.034
aren-0.032
Token Be
Token ID1355
Feature activation+3.02e+00

Pos logit contributions
 crew+0.004
 Will+0.004
 pap+0.004
 louder+0.004
 Amen+0.004
Neg logit contributions
he-0.003
 benches-0.003
pit-0.003
case-0.003
aren-0.003
Tokenart
Token ID433
Feature activation+3.57e-01

Pos logit contributions
 Will+0.097
 Yoga+0.096
ksh+0.090
 July+0.089
 Would+0.088
Neg logit contributions
 than-0.075
he-0.071
aw-0.066
pit-0.064
awed-0.064
Token could
Token ID714
Feature activation+2.05e+00

Pos logit contributions
 pap+0.068
Ro+0.065
ch+0.064
V+0.063
 Will+0.063
Neg logit contributions
he-0.046
case-0.045
aren-0.039
 benches-0.037
hel-0.037
Token go
Token ID467
Feature activation-4.31e-02

Pos logit contributions
 Will+0.045
 crew+0.041
 pap+0.040
 Chick+0.039
ch+0.038
Neg logit contributions
he-0.072
pit-0.069
case-0.069
 benches-0.068
aren-0.067
Token see
Token ID766
Feature activation+1.34e+00

Pos logit contributions
he+0.001
 benches+0.001
pit+0.001
case+0.001
aren+0.001
Neg logit contributions
 Will-0.001
 pap-0.001
 crew-0.001
 Chick-0.001
V-0.001
Token the
Token ID262
Feature activation+7.41e-02

Pos logit contributions
 Will+0.033
 crew+0.031
 pap+0.031
ch+0.030
V+0.030
Neg logit contributions
he-0.042
 benches-0.041
pit-0.040
case-0.039
aren-0.039
Token fire
Token ID2046
Feature activation+2.63e+00

Pos logit contributions
 Will+0.002
 pap+0.002
 Chick+0.002
V+0.002
 Sp+0.002
Neg logit contributions
he-0.002
case-0.002
pit-0.002
 benches-0.002
ement-0.002
Tokenwork
Token ID1818
Feature activation+4.51e+00

Pos logit contributions
 Will+0.047
 pap+0.042
 crew+0.041
V+0.040
 Chick+0.039
Neg logit contributions
he-0.102
pit-0.102
case-0.098
 benches-0.097
aren-0.094
Token show
Token ID905
Feature activation+1.55e+00

Pos logit contributions
 Will+0.140
 crew+0.132
 pap+0.127
 three+0.124
ch+0.123
Neg logit contributions
he-0.116
pit-0.115
 benches-0.113
aren-0.110
case-0.109
Token.
Token ID13
Feature activation+6.58e-01

Pos logit contributions
 Will+0.050
 crew+0.047
 pap+0.045
 three+0.044
 Chick+0.044
Neg logit contributions
he-0.038
pit-0.038
 benches-0.037
case-0.036
aren-0.036
Token Sophie
Token ID35331
Feature activation-8.46e-01

Pos logit contributions
 Will+0.017
 three+0.016
 Yoga+0.015
 3+0.015
 younger+0.015
Neg logit contributions
he-0.020
pit-0.020
case-0.020
 benches-0.019
opl-0.019
Token and
Token ID290
Feature activation+2.71e-01

Pos logit contributions
he+0.025
pit+0.025
 benches+0.025
aren+0.024
case+0.023
Neg logit contributions
 Will-0.024
 crew-0.022
 pap-0.021
 three-0.021
 Chick-0.020
Token her
Token ID607
Feature activation-5.13e-01

Pos logit contributions
 crew+0.010
 Will+0.010
ch+0.010
ye+0.009
 louder+0.009
Neg logit contributions
he-0.005
pit-0.005
 benches-0.005
aren-0.005
 bench-0.005
Token and
Token ID290
Feature activation+3.01e-01

Pos logit contributions
 Will+0.014
 crew+0.012
 pap+0.012
 three+0.011
 Chick+0.011
Neg logit contributions
he-0.022
pit-0.022
 benches-0.022
case-0.022
aren-0.021
Token the
Token ID262
Feature activation-9.71e-01

Pos logit contributions
 Will+0.007
 crew+0.007
 pap+0.007
 Amen+0.007
free+0.006
Neg logit contributions
he-0.009
 benches-0.009
pit-0.009
case-0.009
aren-0.009
Token lightning
Token ID14357
Feature activation+1.63e+00

Pos logit contributions
 benches+0.032
he+0.031
pit+0.031
case+0.030
 bench+0.029
Neg logit contributions
 Will-0.024
 pap-0.021
V-0.020
 crew-0.020
 Chick-0.020
Token struck
Token ID7425
Feature activation+1.78e+00

Pos logit contributions
 Will+0.044
 crew+0.040
ch+0.039
 pap+0.039
 Chick+0.038
Neg logit contributions
pit-0.049
he-0.048
 benches-0.047
aren-0.047
case-0.046
Token the
Token ID262
Feature activation+7.11e-02

Pos logit contributions
 Will+0.043
 crew+0.042
 pap+0.040
 Amen+0.038
 Chick+0.038
Neg logit contributions
he-0.057
 benches-0.056
pit-0.056
case-0.053
aren-0.052
Token hyd
Token ID7409
Feature activation+4.43e+00

Pos logit contributions
 Will+0.002
 crew+0.002
 pap+0.002
 Chick+0.002
 Yoga+0.002
Neg logit contributions
he-0.002
 benches-0.002
pit-0.002
case-0.002
aren-0.002
Tokenrant
Token ID5250
Feature activation+2.65e+00

Pos logit contributions
 Will+0.128
 pap+0.123
 crew+0.115
 Mass+0.110
V+0.107
Neg logit contributions
he-0.132
pit-0.130
case-0.126
ement-0.122
 benches-0.121
Token.
Token ID13
Feature activation+3.55e-01

Pos logit contributions
 Will+0.079
 crew+0.074
 three+0.073
ch+0.073
free+0.071
Neg logit contributions
aren-0.071
he-0.070
pit-0.069
case-0.065
 benches-0.065
Token Suddenly
Token ID24975
Feature activation+2.35e-01

Pos logit contributions
 Will+0.009
 crew+0.009
 pap+0.009
 Chick+0.008
 louder+0.008
Neg logit contributions
he-0.011
 benches-0.010
pit-0.010
case-0.010
aren-0.010
Token water
Token ID1660
Feature activation+6.07e-01

Pos logit contributions
 Will+0.009
 crew+0.008
 pap+0.008
 Chick+0.008
 three+0.008
Neg logit contributions
he-0.005
pit-0.005
 benches-0.005
case-0.005
aren-0.005
Token was
Token ID373
Feature activation+3.46e-01

Pos logit contributions
 Will+0.016
 crew+0.015
 pap+0.014
ch+0.014
 Chick+0.014
Neg logit contributions
pit-0.018
he-0.018
aren-0.018
 benches-0.018
case-0.018
Token vegetables
Token ID13701
Feature activation-3.17e-01

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 Chick+0.001
ch+0.001
Neg logit contributions
he-0.001
pit-0.001
 benches-0.001
case-0.001
aren-0.001
Token laughed
Token ID13818
Feature activation+2.41e-01

Pos logit contributions
aren+0.010
 benches+0.010
pit+0.010
ue+0.010
icycle+0.009
Neg logit contributions
 Will-0.008
 three-0.007
 roasted-0.007
free-0.007
 pap-0.007
Token at
Token ID379
Feature activation-1.63e+00

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 louder+0.009
 three+0.009
Neg logit contributions
he-0.004
pit-0.004
aren-0.004
 benches-0.004
case-0.004
Token the
Token ID262
Feature activation+3.76e-01

Pos logit contributions
he+0.055
pit+0.055
case+0.054
 benches+0.052
aren+0.052
Neg logit contributions
 Will-0.040
 crew-0.035
 pap-0.034
 three-0.033
 Chick-0.033
Token boast
Token ID28020
Feature activation+3.57e+00

Pos logit contributions
 Will+0.009
 crew+0.008
 pap+0.008
 Chick+0.007
ch+0.007
Neg logit contributions
he-0.013
pit-0.012
 benches-0.012
case-0.012
aren-0.012
Tokenful
Token ID913
Feature activation+4.38e+00

Pos logit contributions
 Will+0.088
 crew+0.081
 pap+0.078
 three+0.075
 Chick+0.075
Neg logit contributions
he-0.115
pit-0.114
 benches-0.113
case-0.110
aren-0.110
Token onion
Token ID21670
Feature activation+1.77e+00

Pos logit contributions
 Will+0.136
 crew+0.128
 pap+0.125
 Chick+0.121
 three+0.120
Neg logit contributions
he-0.112
pit-0.110
 benches-0.109
case-0.106
aren-0.105
Token.
Token ID13
Feature activation-2.71e-01

Pos logit contributions
 Will+0.063
 pap+0.059
 crew+0.059
 three+0.058
 louder+0.058
Neg logit contributions
aren-0.040
he-0.039
 benches-0.038
pit-0.037
case-0.036
Token One
Token ID1881
Feature activation-3.54e-01

Pos logit contributions
he+0.009
pit+0.008
 benches+0.008
case+0.008
aren+0.008
Neg logit contributions
 Will-0.007
 pap-0.006
 crew-0.006
 Chick-0.006
 Yoga-0.006
Token of
Token ID286
Feature activation+4.48e-01

Pos logit contributions
he+0.013
pit+0.013
 benches+0.013
case+0.013
aren+0.013
Neg logit contributions
 Will-0.007
 crew-0.006
 pap-0.006
 three-0.006
ch-0.006
Token them
Token ID606
Feature activation-7.13e-01

Pos logit contributions
 Will+0.010
 crew+0.010
 pap+0.009
 Chick+0.009
ch+0.009
Neg logit contributions
he-0.015
pit-0.015
 benches-0.015
case-0.014
aren-0.014
Token is
Token ID318
Feature activation+8.92e-02

Pos logit contributions
 Will+0.009
 pap+0.008
 crew+0.008
 three+0.008
 Chick+0.008
Neg logit contributions
he-0.011
 benches-0.011
pit-0.011
aren-0.011
case-0.010
Token mine
Token ID6164
Feature activation-1.43e+00

Pos logit contributions
 Will+0.002
 crew+0.002
 pap+0.002
V+0.002
 Chick+0.002
Neg logit contributions
he-0.003
pit-0.003
 benches-0.003
case-0.003
aren-0.003
Token!"
Token ID2474
Feature activation+4.04e-01

Pos logit contributions
he+0.043
aren+0.042
 benches+0.041
awed+0.039
hel+0.039
Neg logit contributions
 Will-0.040
 three-0.039
 Sp-0.037
free-0.035
 Ch-0.035
Token The
Token ID383
Feature activation+3.09e-01

Pos logit contributions
 Will+0.009
 Sp+0.007
 three+0.007
 Ch+0.007
 Big+0.007
Neg logit contributions
pit-0.016
aren-0.015
he-0.015
case-0.015
 benches-0.015
Token ch
Token ID442
Feature activation+3.74e-01

Pos logit contributions
 Will+0.010
ksh+0.010
 Amen+0.010
 Chick+0.009
 Him+0.009
Neg logit contributions
 bench-0.007
pit-0.007
 benches-0.007
 sandbox-0.007
 statue-0.006
Tokename
Token ID480
Feature activation+4.36e+00

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.010
 Chick+0.010
ch+0.010
Neg logit contributions
he-0.010
pit-0.010
 benches-0.010
case-0.010
aren-0.010
Tokenleon
Token ID38970
Feature activation+1.71e+00

Pos logit contributions
 Will+0.102
 crew+0.096
 pap+0.094
 Chick+0.090
 Yoga+0.089
Neg logit contributions
he-0.142
pit-0.140
case-0.136
 benches-0.134
aren-0.132
Token could
Token ID714
Feature activation+3.90e-01

Pos logit contributions
 Will+0.044
 three+0.038
 crew+0.038
 pap+0.037
 Ch+0.037
Neg logit contributions
 benches-0.061
he-0.057
pit-0.055
aren-0.053
 bench-0.052
Token tell
Token ID1560
Feature activation-4.07e-02

Pos logit contributions
 Will+0.008
 three+0.007
 help+0.006
 Ch+0.006
 pap+0.006
Neg logit contributions
pit-0.017
 benches-0.015
ement-0.015
aren-0.015
awed-0.015
Token he
Token ID339
Feature activation+8.97e-01

Pos logit contributions
pit+0.001
he+0.001
case+0.001
aren+0.001
 benches+0.001
Neg logit contributions
 Will-0.001
 crew-0.001
 pap-0.001
 three-0.001
 Ch-0.001
Token was
Token ID373
Feature activation+1.59e-01

Pos logit contributions
 Will+0.021
 three+0.018
 crew+0.018
 louder+0.017
 pap+0.017
Neg logit contributions
he-0.032
pit-0.032
 benches-0.032
aren-0.030
case-0.029
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
 Will+0.024
 crew+0.022
 pap+0.022
 three+0.021
 Chick+0.021
Neg logit contributions
pit-0.025
he-0.025
 benches-0.024
case-0.024
aren-0.023
Token park
Token ID3952
Feature activation-7.23e-02

Pos logit contributions
 Will+0.032
 crew+0.030
 pap+0.029
 Chick+0.028
ch+0.028
Neg logit contributions
he-0.045
pit-0.044
 benches-0.044
case-0.043
aren-0.042
Token her
Token ID607
Feature activation+8.13e-01

Pos logit contributions
he+0.002
pit+0.002
awed+0.002
 benches+0.002
aren+0.002
Neg logit contributions
 Will-0.002
 three-0.002
 crew-0.002
 louder-0.002
 pap-0.002
Token "
Token ID366
Feature activation-1.67e-01

Pos logit contributions
 Will+0.018
 crew+0.018
 pap+0.017
 three+0.017
 Ch+0.016
Neg logit contributions
he-0.028
pit-0.027
 benches-0.027
awed-0.027
aren-0.027
Tokenuse
Token ID1904
Feature activation+1.87e+00

Pos logit contributions
pit+0.005
 benches+0.005
case+0.005
he+0.005
aren+0.005
Neg logit contributions
 Will-0.004
ch-0.004
free-0.004
 crew-0.004
 three-0.004
Tokenful
Token ID913
Feature activation+4.32e+00

Pos logit contributions
 Will+0.048
 crew+0.045
 pap+0.043
 Chick+0.042
ch+0.042
Neg logit contributions
he-0.057
pit-0.056
 benches-0.056
aren-0.054
case-0.054
Token-
Token ID12
Feature activation+2.70e+00

Pos logit contributions
 Will+0.121
 crew+0.113
V+0.110
 Chick+0.109
ch+0.108
Neg logit contributions
pit-0.119
he-0.119
case-0.113
aren-0.110
ue-0.110
Tokenpark
Token ID20928
Feature activation+3.71e+00

Pos logit contributions
 Will+0.072
 crew+0.071
 pap+0.068
 Chick+0.065
 younger+0.064
Neg logit contributions
he-0.080
aren-0.076
pit-0.076
case-0.074
 benches-0.074
Token".
Token ID1911
Feature activation+1.01e+00

Pos logit contributions
 Will+0.105
 crew+0.100
 pap+0.096
 Chick+0.094
V+0.091
Neg logit contributions
he-0.106
pit-0.100
case-0.099
aren-0.098
 benches-0.095
Token<|endoftext|>
Token ID50256
Feature activation-7.77e-02

Pos logit contributions
 Will+0.025
 Gr+0.020
 Chick+0.019
 Ch+0.019
 Sp+0.019
Neg logit contributions
pit-0.039
 benches-0.037
castle-0.036
awed-0.036
ement-0.035
TokenOnce
Token ID7454
Feature activation-4.04e-01

Pos logit contributions
pit+0.003
 benches+0.003
case+0.002
he+0.002
aren+0.002
Neg logit contributions
 Will-0.002
ch-0.002
 Ch-0.002
 Sp-0.002
V-0.002
Token there
Token ID612
Feature activation+2.02e-01

Pos logit contributions
he+0.009
 benches+0.009
pit+0.009
case+0.008
aren+0.008
Neg logit contributions
 Will-0.014
 three-0.012
 pap-0.012
 crew-0.012
 Ch-0.012
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
 Will+0.005
 Sp+0.004
 Ch+0.004
 three+0.004
 Chick+0.004
Neg logit contributions
 benches-0.007
pit-0.007
aren-0.006
he-0.006
case-0.006
Token a
Token ID257
Feature activation+5.92e-01

Pos logit contributions
 crew+0.026
 Will+0.025
 pap+0.024
ye+0.023
cient+0.022
Neg logit contributions
he-0.042
 benches-0.039
case-0.039
pit-0.038
aren-0.037
Token girl
Token ID2576
Feature activation+3.18e-01

Pos logit contributions
 Will+0.013
 crew+0.012
ksh+0.012
free+0.012
 Him+0.012
Neg logit contributions
 bench-0.019
he-0.019
 sandbox-0.018
pit-0.018
case-0.018
Token named
Token ID3706
Feature activation+1.49e+00

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 Chick+0.009
ch+0.009
Neg logit contributions
he-0.008
pit-0.008
 benches-0.008
case-0.008
aren-0.008
Token Be
Token ID1355
Feature activation+4.31e+00

Pos logit contributions
 crew+0.051
 pap+0.046
 Will+0.045
ye+0.045
 Amen+0.044
Neg logit contributions
he-0.037
ue-0.036
case-0.036
 benches-0.036
aren-0.036
Tokencca
Token ID13227
Feature activation+2.20e+00

Pos logit contributions
 Will+0.117
 pap+0.104
ye+0.104
 crew+0.104
 Chick+0.104
Neg logit contributions
he-0.129
 than-0.121
aw-0.121
pit-0.115
awed-0.115
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
 Will+0.085
 crew+0.080
ch+0.080
 louder+0.079
 Yoga+0.079
Neg logit contributions
he-0.038
pit-0.036
 benches-0.034
case-0.033
aren-0.033
Token Be
Token ID1355
Feature activation+3.27e+00

Pos logit contributions
 Will+0.007
 pap+0.006
 Ch+0.006
 Yoga+0.006
 three+0.006
Neg logit contributions
pit-0.007
he-0.007
case-0.006
 benches-0.006
aren-0.006
Tokencca
Token ID13227
Feature activation+1.06e+00

Pos logit contributions
 Will+0.093
ye+0.087
 Yoga+0.085
 pap+0.085
 Chick+0.084
Neg logit contributions
he-0.094
 than-0.091
pit-0.083
awed-0.083
aw-0.082
Token had
Token ID550
Feature activation+1.88e+00

Pos logit contributions
 Will+0.025
 crew+0.022
 three+0.022
 pap+0.021
 Chick+0.020
Neg logit contributions
pit-0.037
 benches-0.036
he-0.036
aren-0.035
case-0.035
Token of
Token ID286
Feature activation+1.33e-01

Pos logit contributions
 Will+0.026
 three+0.024
 crew+0.023
free+0.022
 louder+0.022
Neg logit contributions
awed-0.029
 benches-0.029
aren-0.029
pit-0.028
he-0.027
Token the
Token ID262
Feature activation+4.13e-01

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.003
 Chick+0.003
ch+0.003
Neg logit contributions
he-0.004
 benches-0.004
pit-0.004
case-0.004
aren-0.004
Token body
Token ID1767
Feature activation+2.41e+00

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.010
 three+0.009
ch+0.009
Neg logit contributions
he-0.013
pit-0.013
 benches-0.012
aren-0.012
case-0.012
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
 Will+0.071
 three+0.065
 crew+0.065
free+0.063
 louder+0.062
Neg logit contributions
aren-0.066
 benches-0.065
pit-0.063
awed-0.063
he-0.062
Token the
Token ID262
Feature activation+6.34e-01

Pos logit contributions
 Will+0.043
 crew+0.039
 three+0.038
 louder+0.038
 pap+0.038
Neg logit contributions
he-0.042
pit-0.042
aren-0.041
awed-0.041
 benches-0.040
Token lung
Token ID12317
Feature activation+4.28e+00

Pos logit contributions
 Will+0.015
 crew+0.014
 pap+0.014
 three+0.013
 Chick+0.013
Neg logit contributions
he-0.021
pit-0.020
 benches-0.020
case-0.020
aren-0.020
Token."
Token ID526
Feature activation+1.01e+00

Pos logit contributions
 Will+0.100
 crew+0.092
 pap+0.087
ch+0.085
 three+0.084
Neg logit contributions
 benches-0.146
aren-0.145
he-0.144
pit-0.142
awed-0.135
Token All
Token ID1439
Feature activation+1.05e-01

Pos logit contributions
 Will+0.022
 Sp+0.018
 three+0.017
 Yoga+0.016
 Ch+0.016
Neg logit contributions
pit-0.040
he-0.038
case-0.037
aren-0.037
 benches-0.037
Token the
Token ID262
Feature activation-6.80e-01

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.002
 Chick+0.002
ch+0.002
Neg logit contributions
he-0.003
pit-0.003
 benches-0.003
case-0.003
aren-0.003
Token students
Token ID2444
Feature activation-2.54e-01

Pos logit contributions
he+0.023
pit+0.022
 benches+0.022
case+0.022
aren+0.021
Neg logit contributions
 Will-0.016
 crew-0.015
 pap-0.015
 three-0.014
 Chick-0.014
Token looked
Token ID3114
Feature activation-1.97e-02

Pos logit contributions
pit+0.008
 benches+0.008
aren+0.007
ue+0.007
he+0.007
Neg logit contributions
 Will-0.007
 three-0.007
 crew-0.006
 louder-0.006
 3-0.006
Token help
Token ID1037
Feature activation+2.04e+00

Pos logit contributions
 Will+0.017
 crew+0.016
 pap+0.015
 Chick+0.014
 three+0.014
Neg logit contributions
he-0.028
 benches-0.027
pit-0.026
case-0.026
aren-0.026
Token heal
Token ID12035
Feature activation+2.78e+00

Pos logit contributions
 Will+0.050
 crew+0.044
 Chick+0.044
 pap+0.043
ch+0.043
Neg logit contributions
pit-0.069
he-0.066
 benches-0.065
case-0.065
aren-0.065
Token the
Token ID262
Feature activation+6.12e-01

Pos logit contributions
 Will+0.094
 crew+0.091
 pap+0.088
 three+0.085
ye+0.084
Neg logit contributions
he-0.062
 benches-0.059
pit-0.057
case-0.055
aren-0.054
Token boo
Token ID22513
Feature activation+2.57e+00

Pos logit contributions
 Will+0.016
 crew+0.015
 pap+0.015
 Chick+0.015
 Sp+0.014
Neg logit contributions
he-0.019
 benches-0.018
 bench-0.018
pit-0.018
case-0.017
Token-
Token ID12
Feature activation+1.60e+00

Pos logit contributions
 Will+0.086
 crew+0.081
 pap+0.080
 Yoga+0.076
 three+0.076
Neg logit contributions
he-0.065
pit-0.058
case-0.056
 benches-0.053
ue-0.051
Tokenbo
Token ID2127
Feature activation+4.25e+00

Pos logit contributions
 crew+0.046
 Will+0.043
 pap+0.043
 mate+0.042
ye+0.040
Neg logit contributions
he-0.052
case-0.044
awed-0.043
aren-0.042
handed-0.042
Tokeno
Token ID78
Feature activation+1.83e+00

Pos logit contributions
 Will+0.110
 crew+0.102
 pap+0.099
 Chick+0.095
 three+0.094
Neg logit contributions
he-0.132
pit-0.130
 benches-0.127
case-0.126
aren-0.124
Token.
Token ID13
Feature activation+7.93e-01

Pos logit contributions
 Will+0.058
 crew+0.055
 pap+0.054
 Chick+0.052
 three+0.051
Neg logit contributions
he-0.047
pit-0.044
case-0.043
 benches-0.043
aren-0.041
Token
Token ID220
Feature activation+9.67e-01

Pos logit contributions
 Will+0.017
 Yoga+0.014
 Sp+0.014
 Ch+0.013
 Chick+0.013
Neg logit contributions
pit-0.030
 benches-0.029
he-0.028
case-0.027
awed-0.027
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
 Will+0.026
 crew+0.024
 pap+0.023
 Chick+0.022
ch+0.022
Neg logit contributions
pit-0.029
he-0.029
 benches-0.028
case-0.028
aren-0.027
Token
Token ID198
Feature activation+4.45e-01

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 Chick+0.009
ch+0.009
Neg logit contributions
pit-0.013
 benches-0.012
he-0.012
case-0.012
aren-0.012
Token town
Token ID3240
Feature activation-8.73e-02

Pos logit contributions
pit+0.003
he+0.003
case+0.003
aren+0.003
 benches+0.003
Neg logit contributions
 Will-0.002
 crew-0.002
 three-0.002
 pap-0.002
 Ch-0.002
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
he+0.002
ue+0.002
aren+0.002
 benches+0.002
pit+0.002
Neg logit contributions
 Will-0.003
 crew-0.003
 three-0.003
 louder-0.003
 3-0.002
Token with
Token ID351
Feature activation-4.47e-03

Pos logit contributions
 Will+0.014
 three+0.012
 louder+0.012
 crew+0.011
 3+0.011
Neg logit contributions
he-0.018
case-0.018
ue-0.018
ement-0.018
pit-0.017
Token Mr
Token ID1770
Feature activation+1.92e+00

Pos logit contributions
he+0.000
pit+0.000
 benches+0.000
case+0.000
aren+0.000
Neg logit contributions
 Will-0.000
 crew-0.000
 pap-0.000
 Chick-0.000
ch-0.000
Token sm
Token ID895
Feature activation+9.56e-01

Pos logit contributions
 Will+0.066
 crew+0.062
 pap+0.061
 three+0.059
 Chick+0.059
Neg logit contributions
he-0.046
pit-0.044
 benches-0.043
case-0.043
aren-0.042
Tokenith
Token ID342
Feature activation+4.25e+00

Pos logit contributions
 Will+0.024
 three+0.024
 pap+0.022
 crew+0.022
 3+0.022
Neg logit contributions
 benches-0.030
case-0.029
he-0.028
pit-0.027
hel-0.027
Token blinking
Token ID43196
Feature activation+2.61e+00

Pos logit contributions
 Will+0.120
 crew+0.112
 pap+0.109
 Chick+0.105
 three+0.105
Neg logit contributions
he-0.122
pit-0.120
 benches-0.118
case-0.116
aren-0.114
Token the
Token ID262
Feature activation-5.57e-01

Pos logit contributions
 Will+0.065
 crew+0.065
 pap+0.063
 three+0.063
 3+0.059
Neg logit contributions
he-0.083
aren-0.079
 benches-0.078
pit-0.078
case-0.077
Token car
Token ID1097
Feature activation+1.68e-01

Pos logit contributions
he+0.019
pit+0.018
 benches+0.018
aren+0.018
case+0.018
Neg logit contributions
 Will-0.013
 crew-0.012
 pap-0.012
 three-0.012
ch-0.011
Token lights
Token ID7588
Feature activation+4.25e-01

Pos logit contributions
 Will+0.005
 three+0.005
 crew+0.005
 louder+0.005
free+0.004
Neg logit contributions
aren-0.005
pit-0.004
 benches-0.004
ue-0.004
he-0.004
Token to
Token ID284
Feature activation-2.33e-01

Pos logit contributions
 three+0.012
 Will+0.012
 louder+0.012
free+0.011
 crew+0.011
Neg logit contributions
aren-0.012
ue-0.012
pit-0.011
 benches-0.011
handed-0.011
Token Ho
Token ID9544
Feature activation+2.53e+00

Pos logit contributions
 Will+0.049
 crew+0.049
 Amen+0.047
 louder+0.046
 pap+0.044
Neg logit contributions
 benches-0.073
pit-0.072
aren-0.071
he-0.071
case-0.069
Tokenppy
Token ID14097
Feature activation+2.85e+00

Pos logit contributions
 Will+0.061
 crew+0.056
 pap+0.054
 louder+0.054
 Yoga+0.054
Neg logit contributions
pit-0.082
he-0.081
aren-0.078
case-0.077
 benches-0.077
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
 Will+0.111
 crew+0.107
 pap+0.105
 Chick+0.103
 three+0.101
Neg logit contributions
he-0.053
pit-0.051
 benches-0.051
case-0.048
aren-0.047
Token one
Token ID530
Feature activation+1.49e+00

Pos logit contributions
ksh+0.049
ch+0.047
V+0.047
 louder+0.047
 crew+0.046
Neg logit contributions
 benches-0.045
 Lib-0.044
aren-0.043
pit-0.043
 bench-0.042
Token called
Token ID1444
Feature activation+1.98e+00

Pos logit contributions
 Will+0.039
 crew+0.036
 pap+0.035
 Chick+0.034
 louder+0.034
Neg logit contributions
he-0.045
pit-0.045
 benches-0.044
case-0.043
aren-0.043
Token Cotton
Token ID23915
Feature activation+4.22e+00

Pos logit contributions
 crew+0.040
 Amen+0.038
 louder+0.038
 Will+0.038
ch+0.038
Neg logit contributions
 benches-0.064
pit-0.063
he-0.062
aren-0.061
 purse-0.061
Token.
Token ID13
Feature activation+7.02e-01

Pos logit contributions
V+0.175
 Will+0.172
ch+0.171
 Amen+0.169
 July+0.167
Neg logit contributions
pit-0.052
 benches-0.045
he-0.044
 than-0.042
 bench-0.041
Token They
Token ID1119
Feature activation+6.33e-02

Pos logit contributions
 crew+0.015
 July+0.014
 louder+0.014
free+0.014
 Amen+0.014
Neg logit contributions
 benches-0.021
he-0.021
pit-0.020
 bench-0.020
awed-0.020
Token lived
Token ID5615
Feature activation+2.58e-01

Pos logit contributions
 Will+0.002
 crew+0.002
 pap+0.002
ch+0.002
 louder+0.002
Neg logit contributions
he-0.002
pit-0.002
 benches-0.001
case-0.001
aren-0.001
Token in
Token ID287
Feature activation+1.37e-01

Pos logit contributions
 Will+0.008
 crew+0.007
 pap+0.007
 Chick+0.007
ch+0.007
Neg logit contributions
he-0.007
pit-0.007
 benches-0.007
case-0.006
aren-0.006
Token a
Token ID257
Feature activation+2.87e-01

Pos logit contributions
 crew+0.003
 Will+0.003
ye+0.003
 louder+0.003
ch+0.003
Neg logit contributions
he-0.005
 benches-0.004
case-0.004
 bench-0.004
aren-0.004
Token that
Token ID326
Feature activation+9.74e-01

Pos logit contributions
he+0.024
ue+0.024
pit+0.024
case+0.024
aren+0.023
Neg logit contributions
 Will-0.019
 crew-0.018
 three-0.017
 sailors-0.016
 louder-0.016
Token it
Token ID340
Feature activation+1.01e+00

Pos logit contributions
 Will+0.021
 crew+0.019
 pap+0.018
 Chick+0.017
 three+0.017
Neg logit contributions
pit-0.035
he-0.035
 benches-0.035
case-0.034
aren-0.034
Token was
Token ID373
Feature activation+1.45e+00

Pos logit contributions
 Will+0.025
 crew+0.022
 pap+0.021
 Ch+0.020
 Chick+0.020
Neg logit contributions
 benches-0.034
pit-0.034
he-0.034
aren-0.033
 purse-0.032
Token nearly
Token ID3016
Feature activation+1.37e+00

Pos logit contributions
 Will+0.034
 crew+0.031
 pap+0.030
 Chick+0.029
 three+0.028
Neg logit contributions
he-0.049
pit-0.049
 benches-0.048
case-0.047
aren-0.047
Token sund
Token ID37437
Feature activation-1.50e-01

Pos logit contributions
 Will+0.036
 crew+0.033
 pap+0.033
 Chick+0.031
 Amen+0.031
Neg logit contributions
he-0.042
pit-0.041
 benches-0.041
case-0.040
aren-0.039
Tokenown
Token ID593
Feature activation+4.16e+00

Pos logit contributions
he+0.004
pit+0.004
 benches+0.004
case+0.004
aren+0.004
Neg logit contributions
 Will-0.004
 crew-0.004
 pap-0.004
 Chick-0.004
 three-0.004
Token when
Token ID618
Feature activation+1.66e+00

Pos logit contributions
 Will+0.134
 crew+0.131
 pap+0.126
 three+0.124
free+0.121
Neg logit contributions
 benches-0.102
he-0.099
pit-0.097
aren-0.096
case-0.095
Token they
Token ID484
Feature activation+3.03e-01

Pos logit contributions
 Will+0.040
 crew+0.037
 pap+0.036
 Chick+0.034
 three+0.034
Neg logit contributions
he-0.055
pit-0.054
 benches-0.053
case-0.052
aren-0.052
Token emerged
Token ID9349
Feature activation+3.48e-01

Pos logit contributions
 Will+0.008
 crew+0.007
 three+0.007
 roasted+0.007
 Ch+0.006
Neg logit contributions
pit-0.010
ement-0.009
 benches-0.009
aren-0.009
case-0.009
Token from
Token ID422
Feature activation-8.82e-01

Pos logit contributions
 Will+0.010
 crew+0.009
 louder+0.009
 three+0.009
 pap+0.009
Neg logit contributions
pit-0.010
aren-0.010
he-0.010
 benches-0.010
case-0.010
Token the
Token ID262
Feature activation-1.88e-02

Pos logit contributions
pit+0.033
he+0.032
case+0.032
ue+0.032
aren+0.031
Neg logit contributions
 Will-0.019
 crew-0.017
 three-0.017
 pap-0.016
 Ch-0.015
Token.
Token ID13
Feature activation-9.69e-02

Pos logit contributions
 Will+0.079
 crew+0.073
 pap+0.071
 three+0.070
 louder+0.070
Neg logit contributions
he-0.069
pit-0.067
 benches-0.066
case-0.066
aren-0.064
Token It
Token ID632
Feature activation+6.46e-01

Pos logit contributions
pit+0.003
he+0.003
 benches+0.003
case+0.003
aren+0.003
Neg logit contributions
 Will-0.003
 crew-0.002
 three-0.002
 Ch-0.002
 Sp-0.002
Token was
Token ID373
Feature activation+3.60e-01

Pos logit contributions
 Will+0.018
 crew+0.016
 pap+0.016
 Ch+0.015
 Chick+0.015
Neg logit contributions
he-0.020
pit-0.020
 benches-0.020
aren-0.019
case-0.018
Token a
Token ID257
Feature activation-1.52e-01

Pos logit contributions
 Will+0.008
 crew+0.007
 pap+0.007
free+0.007
 Chick+0.007
Neg logit contributions
he-0.012
pit-0.012
 benches-0.012
case-0.012
aren-0.012
Token ch
Token ID442
Feature activation-3.82e-01

Pos logit contributions
he+0.005
pit+0.005
 benches+0.005
case+0.005
aren+0.005
Neg logit contributions
 Will-0.003
 crew-0.003
 pap-0.003
 Chick-0.003
free-0.003
Tokename
Token ID480
Feature activation+4.16e+00

Pos logit contributions
pit+0.016
 benches+0.016
he+0.016
case+0.016
aren+0.016
Neg logit contributions
 Will-0.006
 crew-0.005
ch-0.005
 pap-0.005
 three-0.004
Tokenleon
Token ID38970
Feature activation+1.49e+00

Pos logit contributions
 Will+0.106
 crew+0.099
 pap+0.098
 Chick+0.095
V+0.093
Neg logit contributions
he-0.128
pit-0.125
case-0.121
 benches-0.120
aren-0.117
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
 Will+0.046
 crew+0.043
 three+0.042
 Ch+0.041
ch+0.041
Neg logit contributions
 benches-0.042
he-0.042
aren-0.039
case-0.037
pit-0.037
Token The
Token ID383
Feature activation-6.17e-01

Pos logit contributions
pit+0.005
he+0.005
 benches+0.005
case+0.004
aren+0.004
Neg logit contributions
 Will-0.004
 crew-0.003
 Ch-0.003
 Sp-0.003
 three-0.003
Token crocod
Token ID37565
Feature activation+2.25e-01

Pos logit contributions
 bench+0.016
 benches+0.016
pit+0.016
he+0.016
 sandbox+0.015
Neg logit contributions
 Will-0.017
 Chick-0.017
ksh-0.016
free-0.016
 Amen-0.016
Tokenile
Token ID576
Feature activation+2.93e-01

Pos logit contributions
 Will+0.005
ch+0.005
 crew+0.005
free+0.004
 three+0.004
Neg logit contributions
 benches-0.008
aren-0.008
 purse-0.008
pit-0.007
case-0.007
Token was
Token ID373
Feature activation+3.78e-01

Pos logit contributions
 Will+0.027
 crew+0.023
 three+0.023
 pap+0.023
 Ch+0.021
Neg logit contributions
 benches-0.040
pit-0.037
he-0.037
aren-0.035
ue-0.034
Token happy
Token ID3772
Feature activation-3.76e-01

Pos logit contributions
 Will+0.009
 crew+0.009
 pap+0.008
 Chick+0.008
ch+0.008
Neg logit contributions
he-0.012
pit-0.012
 benches-0.012
case-0.012
aren-0.012
Token and
Token ID290
Feature activation-2.47e-01

Pos logit contributions
 benches+0.013
he+0.013
pit+0.012
aren+0.012
case+0.012
Neg logit contributions
 Will-0.010
 three-0.009
 crew-0.008
 louder-0.008
 pap-0.008
Token asked
Token ID1965
Feature activation+1.30e+00

Pos logit contributions
pit+0.007
he+0.007
 benches+0.007
aren+0.007
case+0.007
Neg logit contributions
 Will-0.007
 crew-0.006
 pap-0.006
 three-0.006
 Chick-0.006
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 crew+0.033
ksh+0.032
 Amen+0.032
 Will+0.031
ch+0.031
Neg logit contributions
he-0.038
 benches-0.036
pit-0.035
aren-0.035
 bench-0.034
Token mechanic
Token ID21600
Feature activation+4.14e+00

Pos logit contributions
 Will+0.032
 Chick+0.030
ksh+0.029
 roasted+0.029
ch+0.028
Neg logit contributions
he-0.037
case-0.035
pit-0.034
 purse-0.033
 benches-0.032
Token to
Token ID284
Feature activation+1.62e+00

Pos logit contributions
 Will+0.101
 crew+0.094
 pap+0.091
 Chick+0.089
ch+0.087
Neg logit contributions
pit-0.132
he-0.132
 benches-0.128
case-0.126
aren-0.125
Token explain
Token ID4727
Feature activation+5.76e-01

Pos logit contributions
 Will+0.028
 crew+0.025
 pap+0.024
 three+0.024
 Amen+0.021
Neg logit contributions
pit-0.067
 benches-0.065
awed-0.065
aren-0.064
he-0.063
Token more
Token ID517
Feature activation+1.78e+00

Pos logit contributions
 Will+0.016
 crew+0.015
 pap+0.015
 Chick+0.014
ch+0.014
Neg logit contributions
he-0.016
pit-0.016
 benches-0.016
case-0.015
aren-0.015
Token things
Token ID1243
Feature activation-1.40e+00

Pos logit contributions
 Will+0.045
 crew+0.042
 pap+0.041
 Chick+0.040
ch+0.039
Neg logit contributions
he-0.055
 benches-0.054
case-0.054
pit-0.054
aren-0.051
Token to
Token ID284
Feature activation+2.87e-01

Pos logit contributions
pit+0.047
 benches+0.046
awed+0.045
aren+0.045
he+0.044
Neg logit contributions
 Will-0.034
free-0.030
 louder-0.029
 crew-0.029
 three-0.029
Token special
Token ID2041
Feature activation+1.01e+00

Pos logit contributions
case+0.014
he+0.014
 bench+0.013
 benches+0.013
pit+0.013
Neg logit contributions
 Will-0.011
 crew-0.010
ksh-0.010
 Amen-0.010
ch-0.010
Token,
Token ID11
Feature activation+5.19e-01

Pos logit contributions
 Will+0.031
 three+0.029
 crew+0.028
 Sp+0.027
 pap+0.027
Neg logit contributions
he-0.030
aren-0.030
ue-0.029
awed-0.028
pit-0.028
Token it
Token ID340
Feature activation+1.33e+00

Pos logit contributions
 Will+0.012
 three+0.010
 Ch+0.010
 Sp+0.009
 Chick+0.008
Neg logit contributions
aren-0.021
he-0.020
pit-0.020
handed-0.020
ue-0.019
Token was
Token ID373
Feature activation+1.07e+00

Pos logit contributions
 Will+0.032
 pap+0.028
 crew+0.028
 Chick+0.027
 Sp+0.027
Neg logit contributions
he-0.046
pit-0.046
 benches-0.046
aren-0.045
ue-0.043
Token the
Token ID262
Feature activation-6.28e-01

Pos logit contributions
 Will+0.030
ksh+0.029
V+0.028
free+0.028
 Amen+0.028
Neg logit contributions
pit-0.027
case-0.027
 bench-0.026
he-0.026
 than-0.026
Token only
Token ID691
Feature activation+4.13e+00

Pos logit contributions
pit+0.020
he+0.020
 benches+0.020
case+0.020
 bench+0.019
Neg logit contributions
 Will-0.015
 crew-0.013
 Chick-0.013
ch-0.013
 pap-0.013
Token kind
Token ID1611
Feature activation+3.24e+00

Pos logit contributions
 Will+0.102
 crew+0.097
 pap+0.094
 Amen+0.093
 Chick+0.093
Neg logit contributions
pit-0.126
he-0.125
 benches-0.124
case-0.119
aren-0.118
Token of
Token ID286
Feature activation+4.26e-01

Pos logit contributions
 Will+0.079
 crew+0.072
 three+0.070
 pap+0.068
free+0.066
Neg logit contributions
pit-0.107
 benches-0.106
he-0.106
case-0.104
aren-0.104
Token its
Token ID663
Feature activation+1.30e+00

Pos logit contributions
 Will+0.010
 crew+0.010
 Chick+0.010
 pap+0.010
ch+0.010
Neg logit contributions
 benches-0.013
he-0.013
pit-0.013
aren-0.012
 bench-0.012
Token kind
Token ID1611
Feature activation+2.65e+00

Pos logit contributions
 Will+0.029
 crew+0.027
 pap+0.026
 Chick+0.025
ch+0.025
Neg logit contributions
he-0.044
pit-0.044
 benches-0.044
case-0.043
aren-0.042
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
 Will+0.065
 crew+0.060
 three+0.058
 pap+0.057
 3+0.054
Neg logit contributions
he-0.088
pit-0.086
 benches-0.086
aren-0.085
case-0.085
Token and
Token ID290
Feature activation+5.47e-01

Pos logit contributions
he+0.009
aren+0.009
pit+0.009
 benches+0.009
icycle+0.009
Neg logit contributions
 Will-0.006
 three-0.006
 crew-0.005
 louder-0.005
 sure-0.005
Token the
Token ID262
Feature activation+8.55e-02

Pos logit contributions
 Will+0.015
 crew+0.014
 pap+0.013
 Chick+0.013
ch+0.013
Neg logit contributions
he-0.016
pit-0.016
 benches-0.016
case-0.015
aren-0.015
Token doctor
Token ID6253
Feature activation+2.39e+00

Pos logit contributions
 Will+0.002
 crew+0.002
 Chick+0.002
 pap+0.002
ksh+0.002
Neg logit contributions
he-0.003
case-0.003
pit-0.003
 bench-0.003
 benches-0.003
Token said
Token ID531
Feature activation+6.93e-01

Pos logit contributions
 Will+0.060
 crew+0.054
 three+0.050
 pap+0.050
 Ch+0.049
Neg logit contributions
 benches-0.081
pit-0.081
he-0.076
aren-0.075
ue-0.074
Token the
Token ID262
Feature activation+9.26e-01

Pos logit contributions
 Will+0.020
 crew+0.019
 pap+0.019
ch+0.018
 Chick+0.018
Neg logit contributions
he-0.019
pit-0.018
 benches-0.018
case-0.018
aren-0.017
Token patient
Token ID5827
Feature activation+4.12e+00

Pos logit contributions
 Chick+0.021
 Will+0.021
 crew+0.021
 Ch+0.019
 pap+0.019
Neg logit contributions
case-0.031
he-0.030
pit-0.029
 bench-0.027
 benches-0.027
Token was
Token ID373
Feature activation+2.18e+00

Pos logit contributions
 Will+0.116
 crew+0.107
 pap+0.104
 three+0.102
free+0.100
Neg logit contributions
he-0.118
pit-0.118
 benches-0.118
aren-0.113
case-0.112
Token very
Token ID845
Feature activation+1.14e+00

Pos logit contributions
 Will+0.051
 crew+0.047
 pap+0.045
 Chick+0.044
ch+0.044
Neg logit contributions
he-0.072
pit-0.069
case-0.068
aren-0.068
 benches-0.067
Token helpful
Token ID7613
Feature activation+2.11e+00

Pos logit contributions
 Will+0.020
 crew+0.017
ch+0.017
 Chick+0.017
 pap+0.016
Neg logit contributions
he-0.044
case-0.043
pit-0.043
aren-0.042
 benches-0.041
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
 Will+0.067
 three+0.062
 crew+0.060
free+0.059
 louder+0.059
Neg logit contributions
 benches-0.057
pit-0.055
he-0.055
aren-0.054
awed-0.054
Token They
Token ID1119
Feature activation+1.34e-01

Pos logit contributions
 Will+0.029
 Sp+0.022
 Chick+0.021
 Gr+0.021
 Yoga+0.021
Neg logit contributions
pit-0.047
 benches-0.044
castle-0.043
awed-0.043
he-0.042
Token She
Token ID1375
Feature activation-6.15e-01

Pos logit contributions
pit+0.006
he+0.006
 benches+0.006
aren+0.006
icycle+0.006
Neg logit contributions
 Will-0.009
 three-0.009
 Sp-0.008
 Yoga-0.008
 3-0.008
Token had
Token ID550
Feature activation+4.25e-01

Pos logit contributions
he+0.020
pit+0.019
 benches+0.019
aren+0.019
case+0.018
Neg logit contributions
 Will-0.017
 crew-0.015
 pap-0.014
 three-0.014
 Chick-0.014
Token entered
Token ID5982
Feature activation-7.40e-02

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 three+0.009
 Chick+0.009
Neg logit contributions
he-0.014
pit-0.014
 benches-0.013
aren-0.013
case-0.013
Token a
Token ID257
Feature activation+5.70e-02

Pos logit contributions
case+0.003
pit+0.003
ue+0.003
awed+0.003
aren+0.003
Neg logit contributions
 three-0.001
 Will-0.001
 louder-0.001
 3-0.001
 Sp-0.001
Token sand
Token ID6450
Feature activation-2.51e+00

Pos logit contributions
 Will+0.002
 crew+0.001
 pap+0.001
 Chick+0.001
free+0.001
Neg logit contributions
he-0.002
pit-0.002
 benches-0.002
case-0.002
aren-0.002
Tokenpit
Token ID15544
Feature activation-5.18e+00

Pos logit contributions
aren+0.073
ue+0.062
Susan+0.059
Lib+0.059
chair+0.058
Neg logit contributions
 mer-0.079
 mate-0.077
 three-0.076
 crew-0.075
 sw-0.073
Token and
Token ID290
Feature activation+4.19e-01

Pos logit contributions
aren+0.181
ue+0.176
opl+0.171
mm+0.171
awed+0.167
Neg logit contributions
 three-0.127
 3-0.104
 for-0.099
 working-0.087
 to-0.087
Token it
Token ID340
Feature activation+8.14e-01

Pos logit contributions
 Will+0.011
 three+0.010
 pap+0.009
 roasted+0.009
 crew+0.009
Neg logit contributions
he-0.014
ement-0.014
aren-0.014
ue-0.013
pit-0.013
Token was
Token ID373
Feature activation+4.80e-01

Pos logit contributions
 Will+0.018
ch+0.016
 Sp+0.016
 three+0.016
 pap+0.016
Neg logit contributions
aren-0.029
he-0.028
 benches-0.027
pit-0.027
 purse-0.026
Token getting
Token ID1972
Feature activation+1.55e+00

Pos logit contributions
 Will+0.010
 pap+0.009
 crew+0.009
 three+0.009
 louder+0.009
Neg logit contributions
he-0.017
pit-0.017
aren-0.017
 benches-0.017
case-0.016
Token bigger
Token ID5749
Feature activation+4.81e-01

Pos logit contributions
 crew+0.040
 Will+0.040
 Amen+0.039
 pap+0.037
 Mass+0.037
Neg logit contributions
 benches-0.047
he-0.046
pit-0.043
case-0.041
ries-0.040
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
 Will+0.046
 crew+0.042
 pap+0.042
 Chick+0.042
 three+0.041
Neg logit contributions
pit-0.035
 benches-0.034
he-0.033
aren-0.033
case-0.032
Token She
Token ID1375
Feature activation+1.23e-01

Pos logit contributions
he+0.002
pit+0.002
 benches+0.002
case+0.002
aren+0.002
Neg logit contributions
 Will-0.006
 crew-0.006
 pap-0.006
ch-0.006
 Chick-0.006
Token had
Token ID550
Feature activation+5.24e-01

Pos logit contributions
 crew+0.004
 Will+0.004
 louder+0.003
 pap+0.003
 younger+0.003
Neg logit contributions
he-0.003
case-0.003
 benches-0.003
pit-0.003
 bench-0.003
Token a
Token ID257
Feature activation-6.68e-01

Pos logit contributions
 crew+0.011
 Will+0.011
 pap+0.010
ye+0.010
 Amen+0.010
Neg logit contributions
 benches-0.018
he-0.018
case-0.018
 purse-0.017
 bench-0.017
Token red
Token ID2266
Feature activation-1.95e+00

Pos logit contributions
pit+0.021
he+0.021
case+0.021
 purse+0.021
 benches+0.021
Neg logit contributions
 Will-0.016
 crew-0.015
free-0.014
 louder-0.014
ch-0.014
Token tr
Token ID491
Feature activation-4.86e+00

Pos logit contributions
he+0.056
aren+0.055
 benches+0.055
pit+0.052
case+0.052
Neg logit contributions
 Will-0.055
 three-0.055
 pap-0.052
 crew-0.051
ch-0.050
Tokenicycle
Token ID35298
Feature activation-2.78e+00

Pos logit contributions
 benches+0.106
 bench+0.096
aren+0.087
 purse+0.086
pit+0.082
Neg logit contributions
 Will-0.161
 Sp-0.152
 three-0.149
V-0.147
 Chick-0.145
Token with
Token ID351
Feature activation+9.50e-02

Pos logit contributions
 benches+0.100
pit+0.098
aren+0.097
ue+0.095
awed+0.089
Neg logit contributions
 three-0.067
 Will-0.058
 3-0.057
ch-0.054
free-0.053
Token a
Token ID257
Feature activation+5.64e-02

Pos logit contributions
 Will+0.002
 crew+0.002
 pap+0.002
 Amen+0.002
 Chick+0.002
Neg logit contributions
he-0.003
 benches-0.003
case-0.003
pit-0.003
aren-0.003
Token big
Token ID1263
Feature activation-1.34e+00

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 three+0.001
ch+0.001
Neg logit contributions
he-0.002
pit-0.002
 benches-0.002
aren-0.002
case-0.002
Token wheel
Token ID7825
Feature activation+1.35e-02

Pos logit contributions
he+0.051
handed+0.050
aren+0.048
awed+0.048
 benches+0.047
Neg logit contributions
 three-0.027
 Will-0.027
 3-0.024
 pap-0.024
ch-0.024
Token he
Token ID339
Feature activation+5.42e-01

Pos logit contributions
 Will+0.019
 crew+0.017
 three+0.017
 pap+0.017
 Ch+0.017
Neg logit contributions
he-0.015
pit-0.014
aren-0.013
case-0.013
 benches-0.013
Token saw
Token ID2497
Feature activation+1.68e-01

Pos logit contributions
 Will+0.014
 crew+0.013
 three+0.012
 pap+0.012
ch+0.012
Neg logit contributions
he-0.018
aren-0.017
pit-0.017
 benches-0.017
case-0.017
Token a
Token ID257
Feature activation+6.98e-02

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.003
 Chick+0.002
ch+0.002
Neg logit contributions
he-0.006
pit-0.006
 benches-0.006
case-0.006
aren-0.006
Token big
Token ID1263
Feature activation-1.98e+00

Pos logit contributions
 Will+0.002
 crew+0.002
 pap+0.001
 three+0.001
ch+0.001
Neg logit contributions
he-0.002
pit-0.002
 benches-0.002
aren-0.002
case-0.002
Token sand
Token ID6450
Feature activation-2.24e+00

Pos logit contributions
he+0.059
awed+0.058
case+0.056
pit+0.055
handed+0.055
Neg logit contributions
 three-0.054
 3-0.051
 Will-0.050
 pap-0.050
ch-0.048
Tokenpit
Token ID15544
Feature activation-4.75e+00

Pos logit contributions
aren+0.059
Lib+0.051
handed+0.051
he+0.049
Susan+0.047
Neg logit contributions
 three-0.068
 sw-0.068
 mer-0.067
 crew-0.066
 mate-0.065
Token filled
Token ID5901
Feature activation+1.07e+00

Pos logit contributions
aren+0.168
ue+0.156
orns+0.152
Lib+0.152
 benches+0.152
Neg logit contributions
 three-0.105
 3-0.087
 sw-0.079
 working-0.075
 Will-0.071
Token with
Token ID351
Feature activation+3.90e-01

Pos logit contributions
 Will+0.031
 three+0.030
 3+0.028
 louder+0.028
 Sp+0.028
Neg logit contributions
aren-0.028
pit-0.028
 benches-0.027
ement-0.027
case-0.026
Token toys
Token ID14958
Feature activation-2.24e+00

Pos logit contributions
 Will+0.010
 three+0.010
 crew+0.010
 pap+0.009
 louder+0.009
Neg logit contributions
he-0.012
case-0.012
aren-0.012
pit-0.011
ue-0.011
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ue+0.077
aren+0.077
awed+0.067
he+0.066
ement+0.065
Neg logit contributions
 three-0.053
 Will-0.045
 working-0.045
 sw-0.045
 louder-0.044
Token He
Token ID679
Feature activation-3.33e-01

Pos logit contributions
he+0.010
pit+0.010
 benches+0.010
aren+0.010
case+0.010
Neg logit contributions
 Will-0.010
 three-0.009
 Sp-0.009
 3-0.008
 Ch-0.008
Token did
Token ID750
Feature activation-3.54e-01

Pos logit contributions
opl+0.058
orns+0.057
aren+0.056
icycle+0.056
pit+0.056
Neg logit contributions
 three-0.039
 Will-0.039
 pap-0.032
 roasted-0.032
 blow-0.032
Token not
Token ID407
Feature activation-1.53e+00

Pos logit contributions
pit+0.012
aren+0.012
he+0.012
 benches+0.012
awed+0.011
Neg logit contributions
 Will-0.008
 louder-0.007
 crew-0.007
 three-0.007
 pap-0.007
Token know
Token ID760
Feature activation-1.30e+00

Pos logit contributions
pit+0.058
he+0.058
 benches+0.057
case+0.056
aren+0.056
Neg logit contributions
 Will-0.030
 crew-0.026
 pap-0.025
 three-0.024
 louder-0.024
Token what
Token ID644
Feature activation-1.25e+00

Pos logit contributions
he+0.047
pit+0.046
case+0.045
aren+0.045
icycle+0.042
Neg logit contributions
 Will-0.029
 pap-0.026
 crew-0.025
 louder-0.025
 Ch-0.025
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
 benches+0.037
case+0.037
he+0.036
 bench+0.036
pit+0.035
Neg logit contributions
 Will-0.031
 crew-0.030
ksh-0.029
free-0.028
V-0.028
Token tr
Token ID491
Feature activation-4.69e+00

Pos logit contributions
 bench+0.037
he+0.037
pit+0.037
 purse+0.035
 benches+0.035
Neg logit contributions
 crew-0.028
 Will-0.027
 Ch-0.027
ch-0.027
V-0.026
Tokenuffle
Token ID18137
Feature activation-6.77e-01

Pos logit contributions
 benches+0.130
pit+0.121
 bench+0.116
 sandbox+0.115
aren+0.114
Neg logit contributions
 Will-0.138
 Chick-0.127
 pap-0.124
 Sp-0.123
 crew-0.119
Token was
Token ID373
Feature activation-2.30e+00

Pos logit contributions
pit+0.022
 benches+0.021
he+0.021
case+0.021
aren+0.020
Neg logit contributions
 Will-0.017
 crew-0.015
 pap-0.015
ch-0.015
 three-0.015
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
he+0.054
pit+0.053
 benches+0.052
case+0.051
aren+0.050
Neg logit contributions
 Will-0.076
 crew-0.073
 pap-0.070
 three-0.070
 louder-0.069
Token They
Token ID1119
Feature activation-1.34e+00

Pos logit contributions
he+0.032
pit+0.031
 benches+0.030
aren+0.030
case+0.030
Neg logit contributions
 Will-0.018
 Yoga-0.016
 three-0.015
 Sp-0.015
 pap-0.015
Token did
Token ID750
Feature activation-3.90e-01

Pos logit contributions
opl+0.051
pit+0.051
 benches+0.049
aren+0.049
he+0.049
Neg logit contributions
 Will-0.029
 three-0.027
 pap-0.023
 crew-0.023
 roasted-0.023
Token was
Token ID373
Feature activation+9.57e-01

Pos logit contributions
ement+0.037
+0.034
pit+0.034
 benches+0.033
aren+0.032
Neg logit contributions
 Will-0.026
 sw-0.022
 three-0.022
 roasted-0.021
 Chick-0.020
Token playing
Token ID2712
Feature activation-2.27e-01

Pos logit contributions
 Will+0.027
 crew+0.026
 pap+0.025
ye+0.025
 Amen+0.024
Neg logit contributions
he-0.027
pit-0.026
case-0.025
 benches-0.025
aren-0.024
Token in
Token ID287
Feature activation+4.27e-01

Pos logit contributions
pit+0.005
he+0.005
 benches+0.005
aren+0.005
case+0.005
Neg logit contributions
 Will-0.008
 crew-0.007
 pap-0.007
 three-0.007
 Chick-0.007
Token the
Token ID262
Feature activation-5.19e-01

Pos logit contributions
 Will+0.009
 three+0.008
 Sp+0.007
V+0.007
 Ch+0.007
Neg logit contributions
ement-0.017
pit-0.017
case-0.016
aren-0.016
awed-0.016
Token sand
Token ID6450
Feature activation-2.09e+00

Pos logit contributions
he+0.010
pit+0.009
aren+0.009
 benches+0.009
case+0.009
Neg logit contributions
 Will-0.020
 crew-0.019
 pap-0.018
ch-0.018
 three-0.018
Tokenpit
Token ID15544
Feature activation-4.59e+00

Pos logit contributions
aren+0.084
ue+0.069
 Karen+0.068
Lib+0.067
 Lib+0.067
Neg logit contributions
 three-0.046
 Will-0.036
 mer-0.036
 mate-0.035
 3-0.035
Token when
Token ID618
Feature activation+1.49e+00

Pos logit contributions
aren+0.185
ue+0.169
 benches+0.165
icycle+0.156
Lib+0.156
Neg logit contributions
 three-0.095
 3-0.075
 Will-0.069
 folding-0.061
free-0.059
Token she
Token ID673
Feature activation+3.40e-01

Pos logit contributions
 Will+0.052
 three+0.048
 pap+0.045
 Sp+0.045
 Chick+0.045
Neg logit contributions
pit-0.038
he-0.037
case-0.036
ement-0.035
aren-0.035
Token saw
Token ID2497
Feature activation+6.19e-01

Pos logit contributions
 Will+0.009
 crew+0.008
 pap+0.008
 three+0.008
ch+0.008
Neg logit contributions
he-0.010
 benches-0.010
pit-0.010
aren-0.010
case-0.010
Token something
Token ID1223
Feature activation+5.47e-01

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.010
 Chick+0.009
ch+0.009
Neg logit contributions
he-0.024
pit-0.023
 benches-0.023
case-0.023
aren-0.023
Token crazy
Token ID7165
Feature activation+1.10e-01

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.009
 three+0.009
 louder+0.009
Neg logit contributions
he-0.020
pit-0.020
aren-0.020
 benches-0.019
case-0.019
Token.
Token ID13
Feature activation-7.04e-01

Pos logit contributions
pit+0.019
aren+0.018
ue+0.017
he+0.017
awed+0.017
Neg logit contributions
 Will-0.012
 three-0.011
 Yoga-0.011
 Ch-0.011
ch-0.010
Token She
Token ID1375
Feature activation+1.26e-02

Pos logit contributions
pit+0.017
awed+0.016
he+0.016
 benches+0.015
ement+0.015
Neg logit contributions
 Will-0.025
 Yoga-0.024
 three-0.023
 Sp-0.022
V-0.022
Token had
Token ID550
Feature activation+5.31e-01

Pos logit contributions
 Will+0.000
 crew+0.000
 pap+0.000
 Chick+0.000
ch+0.000
Neg logit contributions
he-0.000
pit-0.000
 benches-0.000
case-0.000
aren-0.000
Token a
Token ID257
Feature activation-4.17e-01

Pos logit contributions
 Will+0.008
 crew+0.007
 three+0.006
 pap+0.006
ch+0.006
Neg logit contributions
pit-0.023
he-0.022
 benches-0.022
aren-0.022
case-0.021
Token big
Token ID1263
Feature activation-1.82e+00

Pos logit contributions
he+0.015
pit+0.015
 benches+0.015
aren+0.014
case+0.014
Neg logit contributions
 Will-0.009
 crew-0.008
 pap-0.008
 three-0.008
 Chick-0.007
Token tr
Token ID491
Feature activation-4.55e+00

Pos logit contributions
he+0.060
awed+0.059
handed+0.059
 benches+0.058
aren+0.055
Neg logit contributions
 three-0.047
 pap-0.043
 3-0.043
 crew-0.043
 Will-0.041
Tokenicycle
Token ID35298
Feature activation-2.56e+00

Pos logit contributions
 benches+0.089
 bench+0.079
 purse+0.072
pit+0.065
aren+0.065
Neg logit contributions
 Will-0.144
 three-0.139
 Amen-0.139
 July-0.137
V-0.137
Token that
Token ID326
Feature activation+1.24e+00

Pos logit contributions
ue+0.093
 benches+0.093
aren+0.092
pit+0.092
awed+0.088
Neg logit contributions
 three-0.058
 3-0.048
 Will-0.047
ch-0.046
free-0.043
Token she
Token ID673
Feature activation+2.98e-01

Pos logit contributions
 Will+0.042
 pap+0.038
 three+0.038
 Ch+0.038
 Sp+0.038
Neg logit contributions
pit-0.030
 benches-0.029
he-0.029
ement-0.028
ue-0.027
Token would
Token ID561
Feature activation-1.16e-01

Pos logit contributions
 Will+0.008
 crew+0.007
 pap+0.007
 Chick+0.007
 louder+0.006
Neg logit contributions
he-0.009
pit-0.009
 benches-0.009
case-0.009
aren-0.009
Token ride
Token ID6594
Feature activation-1.50e-01

Pos logit contributions
awed+0.004
pit+0.004
aren+0.004
ement+0.004
ries+0.004
Neg logit contributions
 Will-0.003
 three-0.002
ch-0.002
 crew-0.002
 blow-0.002
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
 Will+0.016
 three+0.015
 crew+0.014
 pap+0.014
free+0.014
Neg logit contributions
 benches-0.018
pit-0.017
case-0.017
ement-0.017
aren-0.017
Token Tim
Token ID5045
Feature activation-6.85e-01

Pos logit contributions
pit+0.007
he+0.007
 benches+0.007
aren+0.007
case+0.007
Neg logit contributions
 Will-0.006
 Sp-0.005
 Yoga-0.005
 Ch-0.005
 Chick-0.005
Token had
Token ID550
Feature activation+7.23e-01

Pos logit contributions
 benches+0.028
pit+0.028
ement+0.028
aren+0.027
ue+0.026
Neg logit contributions
 Will-0.012
 three-0.010
 Chick-0.010
 Sp-0.009
 Ch-0.009
Token a
Token ID257
Feature activation-4.82e-01

Pos logit contributions
 crew+0.015
 Will+0.015
 pap+0.014
ye+0.014
 Amen+0.014
Neg logit contributions
he-0.025
 benches-0.024
case-0.024
pit-0.023
 bench-0.023
Token big
Token ID1263
Feature activation-1.81e+00

Pos logit contributions
 purse+0.014
pit+0.014
case+0.014
 bench+0.014
he+0.013
Neg logit contributions
 Will-0.012
free-0.012
 crew-0.011
ksh-0.011
 louder-0.011
Token maze
Token ID31237
Feature activation-4.54e+00

Pos logit contributions
he+0.059
 benches+0.057
aren+0.056
pit+0.055
awed+0.055
Neg logit contributions
 Will-0.045
 pap-0.044
 crew-0.044
 three-0.044
 3-0.040
Token in
Token ID287
Feature activation-6.36e-01

Pos logit contributions
aren+0.163
 benches+0.158
ue+0.154
ries+0.152
ement+0.149
Neg logit contributions
 three-0.099
ch-0.086
 important-0.082
 Will-0.082
 3-0.081
Token his
Token ID465
Feature activation-1.14e+00

Pos logit contributions
pit+0.023
aren+0.022
awed+0.022
he+0.022
ement+0.022
Neg logit contributions
 Will-0.014
 three-0.012
 Ch-0.012
 crew-0.012
 pap-0.012
Token backyard
Token ID24296
Feature activation-2.47e+00

Pos logit contributions
he+0.028
aren+0.027
ement+0.027
 benches+0.026
pit+0.026
Neg logit contributions
 crew-0.036
 Will-0.036
 pap-0.036
 three-0.035
 Yoga-0.034
Token.
Token ID13
Feature activation-6.43e-01

Pos logit contributions
aren+0.084
 benches+0.079
he+0.076
awed+0.075
hel+0.075
Neg logit contributions
 three-0.060
 Will-0.054
 crew-0.054
 pap-0.053
 Ch-0.052
Token He
Token ID679
Feature activation-3.25e-01

Pos logit contributions
he+0.017
pit+0.017
 benches+0.016
case+0.016
aren+0.016
Neg logit contributions
 Will-0.020
 crew-0.019
 pap-0.018
 Chick-0.018
 three-0.018
Token books
Token ID3835
Feature activation-3.66e+00

Pos logit contributions
pit+0.041
aren+0.039
he+0.039
case+0.038
 benches+0.037
Neg logit contributions
 Will-0.032
 three-0.028
 Sp-0.027
 Chick-0.027
 Ch-0.027
Token.
Token ID13
Feature activation-8.59e-01

Pos logit contributions
 benches+0.114
pit+0.113
aren+0.112
 sandbox+0.112
awed+0.110
Neg logit contributions
 Will-0.095
 three-0.089
 louder-0.082
 Ch-0.081
free-0.080
Token She
Token ID1375
Feature activation-7.56e-01

Pos logit contributions
he+0.019
pit+0.018
 benches+0.018
aren+0.018
case+0.017
Neg logit contributions
 Will-0.031
 three-0.028
 Yoga-0.028
 pap-0.028
 Chick-0.027
Token had
Token ID550
Feature activation+2.39e-01

Pos logit contributions
pit+0.026
ement+0.026
he+0.026
 benches+0.025
aren+0.025
Neg logit contributions
 Will-0.019
 three-0.016
 pap-0.015
 Ch-0.015
 Chick-0.015
Token many
Token ID867
Feature activation-1.02e+00

Pos logit contributions
 Will+0.003
 three+0.003
 crew+0.002
 pap+0.002
 Chick+0.002
Neg logit contributions
he-0.011
pit-0.011
ement-0.010
aren-0.010
ue-0.010
Token books
Token ID3835
Feature activation-4.52e+00

Pos logit contributions
pit+0.035
he+0.035
aren+0.035
ue+0.034
hel+0.033
Neg logit contributions
 Will-0.023
 three-0.022
 important-0.021
 younger-0.021
 pap-0.020
Token in
Token ID287
Feature activation-1.19e+00

Pos logit contributions
 benches+0.173
awed+0.170
 arrest+0.166
 sandbox+0.166
pit+0.162
Neg logit contributions
 three-0.084
 for-0.072
 Will-0.072
 important-0.068
free-0.067
Token her
Token ID607
Feature activation-1.93e+00

Pos logit contributions
pit+0.038
awed+0.038
case+0.036
he+0.036
ement+0.036
Neg logit contributions
 Will-0.030
 three-0.029
 Yoga-0.026
 pap-0.026
 Sp-0.025
Token room
Token ID2119
Feature activation-2.51e+00

Pos logit contributions
he+0.055
aren+0.053
pit+0.053
 benches+0.053
ue+0.051
Neg logit contributions
 Will-0.054
 pap-0.053
 crew-0.052
 three-0.051
 Yoga-0.049
Token.
Token ID13
Feature activation-6.18e-01

Pos logit contributions
aren+0.086
 benches+0.081
icycle+0.081
ite+0.079
awed+0.078
Neg logit contributions
 three-0.059
 Will-0.051
 Gr-0.048
 pap-0.048
 3-0.047
Token She
Token ID1375
Feature activation-1.03e+00

Pos logit contributions
he+0.017
pit+0.017
 benches+0.017
case+0.016
aren+0.016
Neg logit contributions
 Will-0.018
 crew-0.017
 pap-0.017
 Chick-0.016
 three-0.016
Token to
Token ID284
Feature activation-2.19e-01

Pos logit contributions
he+0.036
pit+0.035
case+0.035
 benches+0.035
aren+0.034
Neg logit contributions
 Will-0.015
 crew-0.013
 three-0.012
 pap-0.011
 louder-0.011
Token playing
Token ID2712
Feature activation+2.85e-02

Pos logit contributions
 benches+0.006
he+0.006
pit+0.006
case+0.006
 bench+0.006
Neg logit contributions
 Will-0.006
 crew-0.006
 pap-0.006
 Chick-0.005
 Amen-0.005
Token with
Token ID351
Feature activation-2.54e-01

Pos logit contributions
 Will+0.001
 three+0.001
ch+0.001
 louder+0.001
 crew+0.001
Neg logit contributions
aren-0.001
ue-0.001
pit-0.001
he-0.001
case-0.001
Token his
Token ID465
Feature activation-3.12e-01

Pos logit contributions
pit+0.009
he+0.009
case+0.008
aren+0.008
 benches+0.008
Neg logit contributions
 Will-0.006
 crew-0.005
 pap-0.005
 three-0.005
 Chick-0.005
Token sand
Token ID6450
Feature activation-1.88e+00

Pos logit contributions
he+0.011
case+0.011
pit+0.011
aren+0.011
 benches+0.011
Neg logit contributions
 Will-0.007
 crew-0.007
 pap-0.006
 three-0.006
 Ch-0.006
Tokenpit
Token ID15544
Feature activation-4.51e+00

Pos logit contributions
aren+0.048
he+0.041
Lib+0.038
 Lib+0.037
awed+0.036
Neg logit contributions
 crew-0.064
 three-0.061
 Will-0.061
 pap-0.060
 louder-0.058
Token,
Token ID11
Feature activation+5.99e-02

Pos logit contributions
aren+0.154
awed+0.143
ue+0.143
 benches+0.140
he+0.135
Neg logit contributions
 three-0.115
 crew-0.102
 Will-0.101
 3-0.097
 louder-0.091
Token but
Token ID475
Feature activation-1.51e+00

Pos logit contributions
 Will+0.002
 three+0.002
 louder+0.002
 pap+0.002
 crew+0.002
Neg logit contributions
he-0.002
case-0.002
aren-0.002
ue-0.002
awed-0.002
Token he
Token ID339
Feature activation-2.90e-01

Pos logit contributions
he+0.043
pit+0.043
 benches+0.042
aren+0.041
case+0.040
Neg logit contributions
 Will-0.044
 crew-0.040
 pap-0.039
 three-0.038
 louder-0.038
Token could
Token ID714
Feature activation+3.99e-01

Pos logit contributions
 benches+0.012
pit+0.011
he+0.011
aren+0.011
ue+0.010
Neg logit contributions
 Will-0.006
 crew-0.005
 three-0.005
 pap-0.005
 3-0.004
Token not
Token ID407
Feature activation-1.23e+00

Pos logit contributions
 Will+0.007
 three+0.006
 help+0.006
 sure+0.005
 Ch+0.005
Neg logit contributions
awed-0.018
pit-0.017
aren-0.017
 benches-0.017
ement-0.017
Token outside
Token ID2354
Feature activation+6.52e-01

Pos logit contributions
aren+0.026
ue+0.025
he+0.025
case+0.025
pit+0.024
Neg logit contributions
 Will-0.016
 three-0.015
 louder-0.014
 3-0.013
 crew-0.013
Token.
Token ID13
Feature activation-4.85e-01

Pos logit contributions
 Will+0.022
 crew+0.020
 three+0.020
 louder+0.020
 pap+0.020
Neg logit contributions
aren-0.015
pit-0.015
he-0.015
case-0.015
 benches-0.015
Token He
Token ID679
Feature activation-1.62e-01

Pos logit contributions
case+0.011
 benches+0.011
pit+0.010
he+0.010
aren+0.010
Neg logit contributions
 crew-0.016
 Will-0.016
 pap-0.016
 Amen-0.015
 Chick-0.015
Token saw
Token ID2497
Feature activation-1.86e-01

Pos logit contributions
he+0.005
pit+0.005
 benches+0.005
aren+0.005
case+0.005
Neg logit contributions
 Will-0.004
 crew-0.004
 pap-0.004
 three-0.004
 Chick-0.004
Token a
Token ID257
Feature activation-5.24e-01

Pos logit contributions
 benches+0.007
he+0.007
pit+0.007
case+0.006
aren+0.006
Neg logit contributions
 Will-0.004
 crew-0.003
 pap-0.003
 Amen-0.003
ch-0.003
Token tr
Token ID491
Feature activation-4.50e+00

Pos logit contributions
he+0.019
pit+0.019
 benches+0.018
aren+0.018
case+0.018
Neg logit contributions
 Will-0.011
 crew-0.010
 pap-0.010
 three-0.009
ch-0.009
Tokenamp
Token ID696
Feature activation-2.34e+00

Pos logit contributions
 benches+0.090
 bench+0.086
 purse+0.082
 than+0.079
 Lib+0.076
Neg logit contributions
 Will-0.133
V-0.128
ch-0.128
 Sp-0.128
 July-0.125
Tokenoline
Token ID14453
Feature activation-1.46e+00

Pos logit contributions
aren+0.075
pit+0.074
 sandbox+0.072
 purse+0.070
 wrist+0.067
Neg logit contributions
 Will-0.055
ch-0.050
 Sp-0.048
 three-0.044
 July-0.042
Token!
Token ID0
Feature activation-4.95e-01

Pos logit contributions
aren+0.047
ue+0.043
handed+0.041
 benches+0.040
Lib+0.039
Neg logit contributions
 Will-0.040
 three-0.039
ch-0.036
 louder-0.036
 3-0.033
Token He
Token ID679
Feature activation-3.88e-01

Pos logit contributions
he+0.013
pit+0.013
 benches+0.013
case+0.012
aren+0.012
Neg logit contributions
 Will-0.015
 crew-0.014
 pap-0.014
 Chick-0.014
ch-0.013
Token was
Token ID373
Feature activation-8.43e-01

Pos logit contributions
he+0.011
pit+0.011
 benches+0.011
aren+0.011
case+0.011
Neg logit contributions
 Will-0.011
 crew-0.010
 pap-0.010
 three-0.010
ch-0.010
Token.
Token ID13
Feature activation-3.20e-01

Pos logit contributions
aren+0.057
 benches+0.052
he+0.052
ue+0.052
awed+0.052
Neg logit contributions
 three-0.052
 Will-0.052
 crew-0.049
 pap-0.047
 Yoga-0.047
Token He
Token ID679
Feature activation+1.30e-01

Pos logit contributions
he+0.010
pit+0.010
 benches+0.010
case+0.009
aren+0.009
Neg logit contributions
 Will-0.008
 crew-0.008
 pap-0.007
 Chick-0.007
 three-0.007
Token had
Token ID550
Feature activation+2.22e-01

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.003
 Chick+0.003
 three+0.003
Neg logit contributions
he-0.004
pit-0.004
 benches-0.004
aren-0.004
case-0.004
Token a
Token ID257
Feature activation-6.92e-01

Pos logit contributions
 Will+0.004
 crew+0.004
 pap+0.004
 Amen+0.004
free+0.004
Neg logit contributions
 benches-0.008
he-0.008
pit-0.008
case-0.008
aren-0.008
Token colorful
Token ID20239
Feature activation-1.08e+00

Pos logit contributions
pit+0.023
he+0.023
case+0.022
 benches+0.022
aren+0.022
Neg logit contributions
 Will-0.016
 crew-0.015
 pap-0.014
free-0.014
 Chick-0.014
Token sandbox
Token ID35204
Feature activation-4.48e+00

Pos logit contributions
he+0.034
aren+0.033
 benches+0.032
hel+0.032
awed+0.032
Neg logit contributions
 crew-0.028
 three-0.028
 Will-0.027
 pap-0.027
ch-0.026
Token filled
Token ID5901
Feature activation+7.48e-01

Pos logit contributions
aren+0.183
ue+0.162
 benches+0.154
Susan+0.150
awed+0.149
Neg logit contributions
 three-0.098
 for-0.084
 a-0.077
 crew-0.074
 mate-0.073
Token with
Token ID351
Feature activation+4.08e-01

Pos logit contributions
 Will+0.024
 three+0.022
 Sp+0.022
free+0.021
 crew+0.021
Neg logit contributions
aren-0.018
pit-0.018
 benches-0.018
ement-0.017
case-0.017
Token shiny
Token ID22441
Feature activation-1.15e+00

Pos logit contributions
 Will+0.010
 three+0.010
 crew+0.010
 pap+0.009
 louder+0.009
Neg logit contributions
aren-0.013
he-0.013
case-0.013
ue-0.013
pit-0.013
Token toys
Token ID14958
Feature activation-2.20e+00

Pos logit contributions
aren+0.048
he+0.046
case+0.045
handed+0.044
ue+0.043
Neg logit contributions
 three-0.023
 potatoes-0.020
 Will-0.019
 crew-0.019
 sailors-0.019
Token and
Token ID290
Feature activation+4.85e-01

Pos logit contributions
aren+0.082
ue+0.078
awed+0.071
ement+0.068
he+0.067
Neg logit contributions
 three-0.055
 Will-0.049
 crew-0.045
 mate-0.044
 sailors-0.042
Token.
Token ID13
Feature activation-7.59e-01

Pos logit contributions
aren+0.055
 benches+0.052
he+0.052
hel+0.052
ement+0.051
Neg logit contributions
 three-0.046
 Will-0.043
 crew-0.041
 3-0.040
 pap-0.040
Token They
Token ID1119
Feature activation-1.07e+00

Pos logit contributions
he+0.023
pit+0.023
 benches+0.022
aren+0.021
ement+0.021
Neg logit contributions
 Will-0.022
 three-0.018
 Ch-0.018
 Yoga-0.018
 Sp-0.018
Token had
Token ID550
Feature activation+1.72e-01

Pos logit contributions
ement+0.038
pit+0.037
awed+0.037
aren+0.036
ue+0.035
Neg logit contributions
 Will-0.026
 three-0.023
 roasted-0.022
 crew-0.022
 pap-0.021
Token a
Token ID257
Feature activation-8.06e-01

Pos logit contributions
 Will+0.004
 crew+0.003
 pap+0.003
 Chick+0.003
 three+0.003
Neg logit contributions
he-0.006
pit-0.006
 benches-0.006
case-0.006
aren-0.006
Token big
Token ID1263
Feature activation-1.77e+00

Pos logit contributions
he+0.028
pit+0.028
 benches+0.027
case+0.027
aren+0.027
Neg logit contributions
 Will-0.018
 crew-0.016
 pap-0.015
 Chick-0.015
ch-0.015
Token sandbox
Token ID35204
Feature activation-4.48e+00

Pos logit contributions
he+0.057
awed+0.054
handed+0.052
hel+0.052
aren+0.051
Neg logit contributions
 three-0.047
 crew-0.046
 pap-0.045
 Will-0.044
 3-0.043
Token and
Token ID290
Feature activation-2.19e-01

Pos logit contributions
aren+0.153
icycle+0.140
he+0.136
chair+0.135
ue+0.133
Neg logit contributions
 three-0.108
 mate-0.101
 crew-0.095
 for-0.093
 louder-0.089
Token many
Token ID867
Feature activation-1.03e+00

Pos logit contributions
he+0.009
pit+0.009
aren+0.009
ement+0.009
case+0.009
Neg logit contributions
 Will-0.003
 crew-0.003
 three-0.003
 pap-0.003
 roasted-0.003
Token toys
Token ID14958
Feature activation-2.43e+00

Pos logit contributions
he+0.038
aren+0.036
case+0.035
hel+0.035
ue+0.034
Neg logit contributions
 three-0.023
 crew-0.022
 louder-0.021
 Will-0.021
 smaller-0.020
Token.
Token ID13
Feature activation-7.38e-01

Pos logit contributions
ue+0.089
aren+0.087
he+0.081
icycle+0.080
ement+0.080
Neg logit contributions
 three-0.048
 crew-0.045
 mate-0.044
 sw-0.042
 Will-0.040
Token They
Token ID1119
Feature activation-8.73e-01

Pos logit contributions
he+0.023
pit+0.022
 benches+0.022
case+0.022
aren+0.021
Neg logit contributions
 Will-0.019
 crew-0.018
 pap-0.017
 Chick-0.017
ch-0.017
Token books
Token ID3835
Feature activation-3.69e+00

Pos logit contributions
pit+0.041
aren+0.039
he+0.039
case+0.038
ue+0.036
Neg logit contributions
 Will-0.033
 three-0.029
 Sp-0.028
 Chick-0.027
 Ch-0.027
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
 benches+0.114
 sandbox+0.114
aren+0.113
pit+0.112
awed+0.111
Neg logit contributions
 Will-0.092
 three-0.089
 louder-0.081
 Ch-0.079
free-0.078
Token She
Token ID1375
Feature activation-6.96e-01

Pos logit contributions
he+0.018
pit+0.017
 benches+0.017
case+0.016
aren+0.016
Neg logit contributions
 Will-0.028
 crew-0.026
 pap-0.026
 Chick-0.025
ch-0.025
Token had
Token ID550
Feature activation+3.65e-01

Pos logit contributions
pit+0.025
ement+0.025
 benches+0.025
he+0.025
aren+0.024
Neg logit contributions
 Will-0.017
 three-0.014
 pap-0.013
 Sp-0.013
 Ch-0.013
Token many
Token ID867
Feature activation-9.77e-01

Pos logit contributions
 Will+0.005
 three+0.005
 crew+0.004
 pap+0.004
 Chick+0.004
Neg logit contributions
he-0.016
pit-0.016
ement-0.015
aren-0.015
ue-0.015
Token books
Token ID3835
Feature activation-4.47e+00

Pos logit contributions
he+0.034
aren+0.033
pit+0.033
ue+0.032
castle+0.032
Neg logit contributions
 Will-0.022
 three-0.021
 important-0.021
 younger-0.020
 3-0.019
Token in
Token ID287
Feature activation-1.37e+00

Pos logit contributions
 benches+0.162
awed+0.159
 sandbox+0.159
 arrest+0.156
pit+0.152
Neg logit contributions
 three-0.088
 for-0.078
 Will-0.075
free-0.072
 a-0.069
Token her
Token ID607
Feature activation-2.05e+00

Pos logit contributions
awed+0.044
pit+0.044
case+0.042
he+0.042
aren+0.041
Neg logit contributions
 Will-0.035
 three-0.034
 Yoga-0.030
 pap-0.030
 3-0.030
Token room
Token ID2119
Feature activation-2.46e+00

Pos logit contributions
he+0.063
aren+0.061
 benches+0.060
pit+0.060
ue+0.059
Neg logit contributions
 pap-0.053
 Will-0.053
 three-0.052
 crew-0.050
 3-0.049
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
aren+0.086
 benches+0.082
icycle+0.082
ite+0.080
orns+0.079
Neg logit contributions
 three-0.056
 Will-0.046
 3-0.044
 Gr-0.044
 pap-0.042
Token She
Token ID1375
Feature activation-1.02e+00

Pos logit contributions
he+0.014
pit+0.014
 benches+0.014
case+0.013
aren+0.013
Neg logit contributions
 Will-0.012
 crew-0.012
 pap-0.011
 Chick-0.011
ch-0.011
Token books
Token ID3835
Feature activation-3.39e+00

Pos logit contributions
pit+0.031
aren+0.029
he+0.029
case+0.029
 benches+0.027
Neg logit contributions
 Will-0.027
 three-0.024
 Sp-0.023
 Chick-0.023
 Ch-0.023
Token.
Token ID13
Feature activation-9.45e-01

Pos logit contributions
 benches+0.113
pit+0.112
 sandbox+0.112
aren+0.112
awed+0.110
Neg logit contributions
 Will-0.081
 three-0.076
 louder-0.071
 Ch-0.066
 Gr-0.066
Token She
Token ID1375
Feature activation-7.79e-01

Pos logit contributions
he+0.020
pit+0.019
 benches+0.019
aren+0.018
case+0.018
Neg logit contributions
 Will-0.035
 three-0.032
 Yoga-0.032
 pap-0.032
 crew-0.031
Token had
Token ID550
Feature activation+3.93e-01

Pos logit contributions
pit+0.030
ement+0.030
 benches+0.029
he+0.029
aren+0.028
Neg logit contributions
 Will-0.018
 three-0.015
 Ch-0.013
 Sp-0.013
 pap-0.013
Token many
Token ID867
Feature activation-8.59e-01

Pos logit contributions
 Will+0.005
 three+0.005
 pap+0.004
 crew+0.004
 3+0.004
Neg logit contributions
pit-0.018
he-0.018
ement-0.017
aren-0.017
ue-0.017
Token books
Token ID3835
Feature activation-4.43e+00

Pos logit contributions
he+0.031
pit+0.030
aren+0.029
handed+0.029
castle+0.029
Neg logit contributions
 three-0.019
 important-0.018
 Will-0.017
 younger-0.017
 questions-0.016
Token with
Token ID351
Feature activation-5.27e-01

Pos logit contributions
 benches+0.166
 arrest+0.166
awed+0.164
 sandbox+0.160
handed+0.156
Neg logit contributions
 three-0.088
 Will-0.074
 for-0.073
 a-0.065
 important-0.065
Token pictures
Token ID5986
Feature activation-1.33e+00

Pos logit contributions
pit+0.017
he+0.017
aren+0.016
case+0.016
ue+0.016
Neg logit contributions
 Will-0.014
 three-0.013
 pap-0.012
 Ch-0.012
 Chick-0.012
Token and
Token ID290
Feature activation-5.11e-01

Pos logit contributions
handed+0.051
aren+0.049
awed+0.048
icycle+0.046
pit+0.046
Neg logit contributions
 three-0.026
 Will-0.025
ch-0.022
 Ch-0.021
 Tw-0.020
Token words
Token ID2456
Feature activation-2.43e-02

Pos logit contributions
he+0.016
pit+0.015
case+0.015
aren+0.015
 benches+0.015
Neg logit contributions
 Will-0.014
 crew-0.013
 pap-0.012
 three-0.012
 Chick-0.012
Token.
Token ID13
Feature activation-6.17e-01

Pos logit contributions
pit+0.001
 benches+0.001
aren+0.001
he+0.001
awed+0.001
Neg logit contributions
 Will-0.001
 louder-0.001
 three-0.001
free-0.001
 crew-0.001
Token books
Token ID3835
Feature activation-3.52e+00

Pos logit contributions
pit+0.036
aren+0.034
he+0.034
case+0.033
 benches+0.032
Neg logit contributions
 Will-0.029
 three-0.025
 Sp-0.024
 Chick-0.024
 Ch-0.024
Token.
Token ID13
Feature activation-8.49e-01

Pos logit contributions
 benches+0.110
pit+0.109
aren+0.108
 sandbox+0.108
awed+0.107
Neg logit contributions
 Will-0.092
 three-0.086
 louder-0.078
 Ch-0.076
 Sp-0.076
Token She
Token ID1375
Feature activation-7.54e-01

Pos logit contributions
he+0.022
pit+0.022
 benches+0.021
case+0.021
aren+0.020
Neg logit contributions
 Will-0.027
 crew-0.025
 pap-0.024
 Chick-0.024
 three-0.024
Token had
Token ID550
Feature activation+3.64e-01

Pos logit contributions
pit+0.028
ement+0.028
he+0.027
 benches+0.027
aren+0.027
Neg logit contributions
 Will-0.018
 three-0.015
 pap-0.014
 Ch-0.014
 Sp-0.014
Token many
Token ID867
Feature activation-9.35e-01

Pos logit contributions
 Will+0.005
 three+0.004
 crew+0.004
 pap+0.004
 Chick+0.004
Neg logit contributions
he-0.016
pit-0.016
ement-0.015
aren-0.015
ue-0.015
Token books
Token ID3835
Feature activation-4.42e+00

Pos logit contributions
pit+0.032
he+0.032
aren+0.032
ue+0.031
castle+0.030
Neg logit contributions
 Will-0.021
 three-0.021
 important-0.020
 younger-0.019
 questions-0.018
Token on
Token ID319
Feature activation-2.55e+00

Pos logit contributions
 benches+0.164
awed+0.161
 sandbox+0.158
 arrest+0.155
pit+0.154
Neg logit contributions
 three-0.088
 Will-0.077
 for-0.077
 important-0.072
free-0.070
Token her
Token ID607
Feature activation-1.68e+00

Pos logit contributions
pit+0.097
awed+0.095
aren+0.094
ite+0.091
icycle+0.089
Neg logit contributions
 Will-0.056
 three-0.049
 Chick-0.046
 Yoga-0.046
 Sp-0.046
Token shelf
Token ID18316
Feature activation-2.84e+00

Pos logit contributions
aren+0.061
pit+0.058
he+0.057
awed+0.056
ue+0.055
Neg logit contributions
 pap-0.035
 Will-0.035
 three-0.035
 younger-0.033
 crew-0.032
Token in
Token ID287
Feature activation-4.89e-01

Pos logit contributions
aren+0.097
awed+0.097
 benches+0.096
icycle+0.089
ue+0.088
Neg logit contributions
 three-0.066
 Will-0.055
 Gr-0.053
 3-0.053
 Sp-0.050
Token her
Token ID607
Feature activation-1.58e+00

Pos logit contributions
pit+0.016
awed+0.016
aren+0.015
 benches+0.015
case+0.015
Neg logit contributions
 Will-0.012
 three-0.011
 pap-0.011
 Yoga-0.010
 3-0.010
Token books
Token ID3835
Feature activation-3.55e+00

Pos logit contributions
pit+0.036
aren+0.034
he+0.033
case+0.033
 benches+0.031
Neg logit contributions
 Will-0.030
 three-0.026
 Sp-0.025
 Chick-0.025
 Yoga-0.025
Token.
Token ID13
Feature activation-7.94e-01

Pos logit contributions
 benches+0.114
 sandbox+0.113
aren+0.113
pit+0.113
awed+0.110
Neg logit contributions
 Will-0.088
 three-0.083
 louder-0.075
 Ch-0.074
 Sp-0.074
Token She
Token ID1375
Feature activation-6.51e-01

Pos logit contributions
he+0.022
pit+0.022
 benches+0.021
case+0.021
aren+0.020
Neg logit contributions
 Will-0.023
 crew-0.022
 pap-0.021
 Chick-0.021
ch-0.021
Token had
Token ID550
Feature activation+4.77e-01

Pos logit contributions
pit+0.023
he+0.022
ement+0.022
 benches+0.022
aren+0.021
Neg logit contributions
 Will-0.016
 three-0.014
 pap-0.013
 Chick-0.013
 Sp-0.013
Token many
Token ID867
Feature activation-8.76e-01

Pos logit contributions
 Will+0.007
 three+0.006
 crew+0.006
 pap+0.006
 Chick+0.006
Neg logit contributions
he-0.020
pit-0.020
aren-0.019
 benches-0.019
ement-0.019
Token books
Token ID3835
Feature activation-4.41e+00

Pos logit contributions
pit+0.030
he+0.030
aren+0.030
ue+0.029
castle+0.029
Neg logit contributions
 Will-0.019
 three-0.019
 important-0.018
 younger-0.017
 questions-0.017
Token in
Token ID287
Feature activation-1.18e+00

Pos logit contributions
 benches+0.166
awed+0.161
 sandbox+0.160
 arrest+0.157
pit+0.156
Neg logit contributions
 three-0.088
 Will-0.078
 for-0.074
 important-0.070
free-0.070
Token her
Token ID607
Feature activation-1.92e+00

Pos logit contributions
pit+0.039
awed+0.039
case+0.037
aren+0.036
he+0.036
Neg logit contributions
 Will-0.029
 three-0.028
 Yoga-0.026
 Gr-0.025
 pap-0.025
Token room
Token ID2119
Feature activation-2.44e+00

Pos logit contributions
he+0.056
aren+0.055
pit+0.054
 benches+0.054
ue+0.053
Neg logit contributions
 Will-0.052
 pap-0.051
 three-0.050
 crew-0.049
 Yoga-0.047
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
aren+0.085
 benches+0.081
icycle+0.080
ite+0.078
orns+0.078
Neg logit contributions
 three-0.056
 Will-0.049
 Gr-0.046
 pap-0.045
 3-0.045
Token She
Token ID1375
Feature activation-9.70e-01

Pos logit contributions
he+0.017
pit+0.016
 benches+0.016
case+0.016
aren+0.016
Neg logit contributions
 Will-0.015
 crew-0.014
 pap-0.014
 Chick-0.013
ch-0.013
Token books
Token ID3835
Feature activation-3.50e+00

Pos logit contributions
pit+0.033
aren+0.031
he+0.031
case+0.030
 benches+0.030
Neg logit contributions
 Will-0.028
 three-0.025
 Sp-0.024
 Chick-0.024
 Ch-0.023
Token.
Token ID13
Feature activation-7.96e-01

Pos logit contributions
aren+0.111
 benches+0.110
pit+0.110
 sandbox+0.109
awed+0.107
Neg logit contributions
 Will-0.090
 three-0.083
 louder-0.078
 Ch-0.075
 Sp-0.073
Token She
Token ID1375
Feature activation-8.93e-01

Pos logit contributions
he+0.016
pit+0.016
 benches+0.016
aren+0.015
case+0.015
Neg logit contributions
 Will-0.029
 crew-0.027
 pap-0.027
 three-0.026
 Chick-0.026
Token had
Token ID550
Feature activation+3.48e-01

Pos logit contributions
pit+0.032
ement+0.032
he+0.031
 benches+0.031
aren+0.030
Neg logit contributions
 Will-0.022
 three-0.018
 pap-0.017
 Ch-0.017
V-0.017
Token many
Token ID867
Feature activation-8.78e-01

Pos logit contributions
 Will+0.005
 three+0.004
 crew+0.004
 pap+0.004
 Chick+0.004
Neg logit contributions
pit-0.015
he-0.015
aren-0.014
ement-0.014
 benches-0.014
Token books
Token ID3835
Feature activation-4.40e+00

Pos logit contributions
he+0.029
pit+0.029
aren+0.028
ue+0.027
case+0.027
Neg logit contributions
 Will-0.022
 three-0.021
 important-0.019
 younger-0.019
 crew-0.019
Token in
Token ID287
Feature activation-1.15e+00

Pos logit contributions
 benches+0.169
awed+0.165
 sandbox+0.161
 arrest+0.160
aren+0.158
Neg logit contributions
 three-0.085
 Will-0.074
 for-0.070
 important-0.066
free-0.064
Token her
Token ID607
Feature activation-1.92e+00

Pos logit contributions
pit+0.038
awed+0.037
he+0.035
case+0.035
aren+0.035
Neg logit contributions
 Will-0.030
 three-0.028
 Yoga-0.026
 pap-0.025
 Gr-0.025
Token room
Token ID2119
Feature activation-2.42e+00

Pos logit contributions
he+0.054
pit+0.053
aren+0.053
 benches+0.052
ue+0.050
Neg logit contributions
 Will-0.055
 pap-0.053
 crew-0.052
 three-0.052
 Yoga-0.049
Token.
Token ID13
Feature activation-6.36e-01

Pos logit contributions
aren+0.086
 benches+0.082
icycle+0.079
orns+0.079
ite+0.078
Neg logit contributions
 three-0.055
 Will-0.047
 Gr-0.044
 3-0.044
 pap-0.043
Token She
Token ID1375
Feature activation-1.15e+00

Pos logit contributions
he+0.017
pit+0.017
 benches+0.016
case+0.016
aren+0.016
Neg logit contributions
 Will-0.019
 crew-0.018
 pap-0.018
 Chick-0.017
 three-0.017
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
aren+0.030
awed+0.027
he+0.027
ue+0.027
icycle+0.026
Neg logit contributions
 three-0.026
 Will-0.025
 crew-0.025
 pap-0.024
 louder-0.023
Token They
Token ID1119
Feature activation-6.36e-01

Pos logit contributions
pit+0.017
he+0.017
 benches+0.017
case+0.016
ue+0.016
Neg logit contributions
 crew-0.014
 Will-0.014
free-0.013
 pap-0.013
ch-0.013
Token had
Token ID550
Feature activation+5.12e-01

Pos logit contributions
pit+0.020
he+0.020
 benches+0.020
ement+0.019
case+0.019
Neg logit contributions
 Will-0.017
 crew-0.015
 three-0.015
 pap-0.015
 Chick-0.014
Token a
Token ID257
Feature activation-5.85e-01

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 Amen+0.009
free+0.009
Neg logit contributions
 benches-0.019
pit-0.018
he-0.018
case-0.018
aren-0.018
Token big
Token ID1263
Feature activation-2.02e+00

Pos logit contributions
he+0.021
pit+0.021
 benches+0.021
case+0.020
aren+0.020
Neg logit contributions
 Will-0.012
 crew-0.011
 pap-0.011
 Chick-0.010
 three-0.010
Token sandbox
Token ID35204
Feature activation-4.40e+00

Pos logit contributions
he+0.060
awed+0.057
handed+0.055
hel+0.054
aren+0.053
Neg logit contributions
 three-0.057
 pap-0.056
 crew-0.054
 Will-0.053
 3-0.052
Token where
Token ID810
Feature activation+5.38e-01

Pos logit contributions
aren+0.163
handed+0.150
icycle+0.148
he+0.148
ement+0.145
Neg logit contributions
 three-0.091
 mate-0.088
 for-0.086
 crew-0.081
 pap-0.076
Token they
Token ID484
Feature activation-5.97e-01

Pos logit contributions
 Will+0.016
 crew+0.015
 pap+0.015
 Amen+0.014
ch+0.014
Neg logit contributions
 benches-0.015
he-0.014
pit-0.014
case-0.014
aren-0.013
Token made
Token ID925
Feature activation+1.36e+00

Pos logit contributions
he+0.020
pit+0.020
aren+0.019
 benches+0.019
ement+0.019
Neg logit contributions
 Will-0.015
 crew-0.013
 three-0.013
 pap-0.012
 roasted-0.012
Token castles
Token ID49203
Feature activation-7.97e-01

Pos logit contributions
 Will+0.029
 crew+0.026
 pap+0.025
 Chick+0.024
ch+0.024
Neg logit contributions
he-0.049
pit-0.048
 benches-0.047
case-0.047
aren-0.046
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
aren+0.028
ement+0.024
awed+0.024
handed+0.023
icycle+0.022
Neg logit contributions
 louder-0.021
 three-0.020
 crew-0.020
 mate-0.019
 smaller-0.019
Token When
Token ID1649
Feature activation-3.24e-01

Pos logit contributions
 Will+0.019
 pap+0.017
 crew+0.016
 Chick+0.016
 three+0.016
Neg logit contributions
he-0.025
pit-0.025
 benches-0.024
case-0.024
aren-0.023
Token they
Token ID484
Feature activation-1.30e+00

Pos logit contributions
pit+0.011
he+0.011
case+0.010
 benches+0.010
awed+0.010
Neg logit contributions
 Will-0.008
 pap-0.007
 crew-0.007
 Chick-0.007
 three-0.007
Token reached
Token ID4251
Feature activation+7.58e-01

Pos logit contributions
pit+0.047
ement+0.044
opl+0.044
 benches+0.044
 than+0.044
Neg logit contributions
 Will-0.033
 three-0.027
 roasted-0.025
 3-0.024
 Ch-0.024
Token the
Token ID262
Feature activation-3.30e-01

Pos logit contributions
 Will+0.013
 three+0.012
 pap+0.011
 Ch+0.011
 crew+0.010
Neg logit contributions
he-0.031
case-0.031
awed-0.031
pit-0.030
ement-0.030
Token sand
Token ID6450
Feature activation-1.75e+00

Pos logit contributions
he+0.009
aren+0.009
awed+0.009
hel+0.008
case+0.008
Neg logit contributions
 three-0.009
 Will-0.009
 crew-0.009
 3-0.009
 younger-0.008
Tokenpit
Token ID15544
Feature activation-4.39e+00

Pos logit contributions
aren+0.047
chair+0.043
he+0.043
case+0.042
opl+0.041
Neg logit contributions
 three-0.052
 Will-0.050
 mate-0.047
 sw-0.047
 mer-0.047
Token,
Token ID11
Feature activation-2.58e-02

Pos logit contributions
opl+0.166
orns+0.164
awed+0.154
aren+0.153
icycle+0.152
Neg logit contributions
 three-0.091
 Will-0.080
 3-0.072
 M-0.068
 pap-0.067
Token Jessie
Token ID43425
Feature activation+3.61e-01

Pos logit contributions
pit+0.001
he+0.001
case+0.001
 benches+0.001
awed+0.001
Neg logit contributions
 Will-0.001
 three-0.001
 pap-0.001
 crew-0.001
 Chick-0.001
Token could
Token ID714
Feature activation+4.05e-01

Pos logit contributions
 Will+0.010
 pap+0.009
 crew+0.009
 three+0.008
 Ch+0.008
Neg logit contributions
he-0.012
 benches-0.012
pit-0.011
case-0.011
aren-0.011
Token see
Token ID766
Feature activation-9.52e-01

Pos logit contributions
 Will+0.008
 three+0.007
 sure+0.006
 blow+0.006
 help+0.006
Neg logit contributions
orns-0.018
aren-0.017
awed-0.017
pit-0.017
ement-0.017
Token that
Token ID326
Feature activation+3.99e-01

Pos logit contributions
case+0.035
he+0.035
ue+0.035
pit+0.035
aren+0.034
Neg logit contributions
 Will-0.020
 three-0.018
 pap-0.017
 crew-0.016
 Ch-0.016
Token books
Token ID3835
Feature activation-3.35e+00

Pos logit contributions
pit+0.032
aren+0.030
he+0.030
case+0.029
ue+0.028
Neg logit contributions
 Will-0.022
 three-0.020
 Ch-0.019
 Yoga-0.018
 Sp-0.018
Token.
Token ID13
Feature activation-8.52e-01

Pos logit contributions
pit+0.106
 benches+0.106
aren+0.105
 sandbox+0.105
awed+0.104
Neg logit contributions
 Will-0.083
 three-0.079
 louder-0.074
 Yoga-0.070
 Ch-0.069
Token She
Token ID1375
Feature activation-7.34e-01

Pos logit contributions
he+0.020
pit+0.019
 benches+0.019
aren+0.018
case+0.018
Neg logit contributions
 Will-0.029
 crew-0.027
 pap-0.027
 three-0.026
 Chick-0.026
Token had
Token ID550
Feature activation+1.71e-01

Pos logit contributions
pit+0.027
ement+0.027
he+0.027
 benches+0.026
aren+0.025
Neg logit contributions
 Will-0.017
 three-0.015
 pap-0.014
 Ch-0.014
 Chick-0.013
Token many
Token ID867
Feature activation-1.05e+00

Pos logit contributions
 Will+0.002
 three+0.002
 crew+0.002
 pap+0.002
 Chick+0.002
Neg logit contributions
he-0.007
pit-0.007
ement-0.007
aren-0.007
ue-0.007
Token books
Token ID3835
Feature activation-4.39e+00

Pos logit contributions
he+0.036
pit+0.035
aren+0.035
castle+0.033
+0.033
Neg logit contributions
 three-0.024
 Will-0.024
 important-0.023
 younger-0.022
 pap-0.021
Token in
Token ID287
Feature activation-1.33e+00

Pos logit contributions
 benches+0.159
awed+0.158
 arrest+0.156
 sandbox+0.156
pit+0.151
Neg logit contributions
 three-0.088
 for-0.073
 Will-0.073
 important-0.070
free-0.067
Token her
Token ID607
Feature activation-1.97e+00

Pos logit contributions
awed+0.044
pit+0.044
case+0.042
ement+0.042
he+0.042
Neg logit contributions
 Will-0.033
 three-0.032
 pap-0.028
 Yoga-0.028
 3-0.028
Token room
Token ID2119
Feature activation-2.29e+00

Pos logit contributions
he+0.062
aren+0.061
 benches+0.060
pit+0.059
ement+0.058
Neg logit contributions
 pap-0.050
 three-0.048
 Will-0.048
 crew-0.047
 3-0.044
Token.
Token ID13
Feature activation-6.43e-01

Pos logit contributions
aren+0.080
 benches+0.076
icycle+0.075
awed+0.074
orns+0.074
Neg logit contributions
 three-0.053
 Will-0.043
 Gr-0.042
 3-0.042
 pap-0.041
Token Some
Token ID2773
Feature activation+8.22e-01

Pos logit contributions
he+0.021
pit+0.021
 benches+0.021
case+0.020
aren+0.020
Neg logit contributions
 Will-0.016
 crew-0.014
 pap-0.014
 Chick-0.013
 three-0.013
Token the
Token ID262
Feature activation-1.10e-01

Pos logit contributions
 Will+0.026
 crew+0.025
 pap+0.024
 Chick+0.023
ch+0.022
Neg logit contributions
he-0.037
 benches-0.036
pit-0.036
case-0.035
aren-0.034
Token beautiful
Token ID4950
Feature activation+1.05e+00

Pos logit contributions
he+0.004
 benches+0.003
case+0.003
pit+0.003
ement+0.003
Neg logit contributions
 Will-0.003
 pap-0.003
 Chick-0.002
 crew-0.002
 Sp-0.002
Token,
Token ID11
Feature activation+8.33e-01

Pos logit contributions
 Will+0.023
 crew+0.021
 pap+0.020
 three+0.020
ch+0.020
Neg logit contributions
pit-0.036
he-0.036
 benches-0.035
case-0.035
aren-0.035
Token colorful
Token ID20239
Feature activation+4.68e-01

Pos logit contributions
 Will+0.021
 crew+0.019
 three+0.019
 louder+0.018
 pap+0.018
Neg logit contributions
he-0.027
pit-0.027
case-0.026
aren-0.026
 benches-0.026
Token fire
Token ID2046
Feature activation+1.62e+00

Pos logit contributions
 Will+0.009
 crew+0.008
 three+0.008
 pap+0.007
ch+0.007
Neg logit contributions
pit-0.018
he-0.017
aren-0.017
case-0.017
 benches-0.017
Tokenwork
Token ID1818
Feature activation+3.43e+00

Pos logit contributions
 Will+0.027
 crew+0.024
 pap+0.022
 three+0.022
ch+0.021
Neg logit contributions
he-0.066
 benches-0.065
pit-0.064
aren-0.064
case-0.063
Token displays
Token ID11298
Feature activation+8.85e-01

Pos logit contributions
 Will+0.104
 crew+0.098
 three+0.093
ch+0.093
 pap+0.093
Neg logit contributions
aren-0.093
pit-0.092
he-0.091
 benches-0.090
case-0.087
Token spark
Token ID9009
Feature activation+8.68e-01

Pos logit contributions
 Will+0.023
 three+0.021
 crew+0.020
 louder+0.020
ch+0.020
Neg logit contributions
pit-0.028
aren-0.027
he-0.027
 benches-0.027
case-0.026
Token in
Token ID287
Feature activation-4.36e-01

Pos logit contributions
 Will+0.025
 crew+0.024
ch+0.023
 pap+0.023
 louder+0.023
Neg logit contributions
he-0.024
pit-0.024
aren-0.023
case-0.023
 benches-0.023
Token the
Token ID262
Feature activation-8.68e-01

Pos logit contributions
pit+0.016
he+0.016
case+0.015
aren+0.015
 benches+0.015
Neg logit contributions
 Will-0.009
 crew-0.008
 pap-0.008
 three-0.008
 Chick-0.008
Token sky
Token ID6766
Feature activation+1.34e+00

Pos logit contributions
pit+0.026
he+0.025
aren+0.025
 benches+0.024
case+0.024
Neg logit contributions
 Will-0.023
 crew-0.022
 three-0.022
ch-0.021
 pap-0.021
Token took
Token ID1718
Feature activation+2.14e+00

Pos logit contributions
 Will+0.031
 three+0.027
 crew+0.026
 pap+0.026
 Chick+0.025
Neg logit contributions
 benches-0.029
pit-0.029
case-0.029
he-0.028
ement-0.027
Token their
Token ID511
Feature activation+1.52e+00

Pos logit contributions
 Will+0.057
 crew+0.055
 pap+0.053
 Chick+0.050
free+0.050
Neg logit contributions
he-0.063
 benches-0.061
pit-0.060
aren-0.060
case-0.059
Token three
Token ID1115
Feature activation+1.43e+00

Pos logit contributions
 Will+0.032
 crew+0.030
 pap+0.028
 Chick+0.028
ch+0.028
Neg logit contributions
he-0.053
 benches-0.053
pit-0.052
aren-0.051
case-0.051
Token-
Token ID12
Feature activation+2.85e+00

Pos logit contributions
 Will+0.044
 crew+0.041
 three+0.040
 pap+0.040
ch+0.039
Neg logit contributions
he-0.037
pit-0.037
 benches-0.035
aren-0.035
case-0.035
Tokenyear
Token ID1941
Feature activation+2.72e+00

Pos logit contributions
 crew+0.080
 Will+0.078
 pap+0.078
 Chick+0.071
 louder+0.071
Neg logit contributions
he-0.084
aren-0.081
 benches-0.077
aw-0.073
pit-0.073
Token-
Token ID12
Feature activation+2.81e+00

Pos logit contributions
 Will+0.087
 crew+0.080
 pap+0.078
 Chick+0.077
ch+0.076
Neg logit contributions
he-0.068
pit-0.067
 benches-0.066
case-0.065
aren-0.064
Tokenold
Token ID727
Feature activation+2.14e+00

Pos logit contributions
 Will+0.099
 crew+0.094
 pap+0.092
 Chick+0.089
 louder+0.088
Neg logit contributions
he-0.062
aren-0.058
 benches-0.058
pit-0.058
case-0.056
Token daughter
Token ID4957
Feature activation+1.29e+00

Pos logit contributions
 Will+0.040
 crew+0.035
 pap+0.034
 three+0.032
 Chick+0.032
Neg logit contributions
he-0.083
pit-0.082
 benches-0.080
case-0.080
aren-0.079
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
 Will+0.032
 three+0.028
 crew+0.027
 pap+0.027
 3+0.026
Neg logit contributions
pit-0.044
he-0.043
 benches-0.042
case-0.042
aren-0.041
Token the
Token ID262
Feature activation+9.72e-02

Pos logit contributions
 Will+0.045
 crew+0.042
 pap+0.040
 Chick+0.039
ch+0.038
Neg logit contributions
he-0.065
pit-0.064
 benches-0.063
case-0.062
aren-0.061
Token circus
Token ID33125
Feature activation+6.00e-01

Pos logit contributions
 Will+0.004
 crew+0.004
 pap+0.004
 Chick+0.003
 three+0.003
Neg logit contributions
he-0.002
pit-0.002
 benches-0.002
case-0.002
aren-0.002
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
 benches+0.012
case+0.011
pit+0.011
he+0.011
aren+0.011
Neg logit contributions
 crew-0.014
 Will-0.013
 pap-0.013
free-0.013
ch-0.013
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
he+0.006
 benches+0.005
case+0.005
pit+0.005
aren+0.005
Neg logit contributions
 Will-0.006
 crew-0.006
 pap-0.006
 three-0.006
free-0.006
Token"
Token ID1
Feature activation-8.28e-01

Pos logit contributions
he+0.003
pit+0.003
 benches+0.003
case+0.003
aren+0.003
Neg logit contributions
 Will-0.003
 crew-0.003
 pap-0.003
 Chick-0.003
ch-0.003
TokenUh
Token ID34653
Feature activation-7.36e-02

Pos logit contributions
 benches+0.024
he+0.024
pit+0.023
aren+0.022
case+0.022
Neg logit contributions
 crew-0.023
 Will-0.022
 pap-0.022
 younger-0.020
ye-0.020
Token-
Token ID12
Feature activation+4.87e-01

Pos logit contributions
pit+0.001
he+0.001
aren+0.001
 benches+0.001
ue+0.001
Neg logit contributions
 Will-0.003
 younger-0.002
 crew-0.002
 pap-0.002
ksh-0.002
Tokenoh
Token ID1219
Feature activation+2.73e+00

Pos logit contributions
 crew+0.015
 Will+0.015
 pap+0.014
 younger+0.014
 Amen+0.013
Neg logit contributions
he-0.013
aren-0.012
pit-0.012
 benches-0.012
case-0.012
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 Will+0.100
 crew+0.095
 pap+0.092
 Chick+0.090
 Yoga+0.089
Neg logit contributions
he-0.054
pit-0.054
 benches-0.052
case-0.050
aren-0.050
Token what
Token ID644
Feature activation-6.89e-01

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.003
free+0.003
ch+0.003
Neg logit contributions
he-0.004
pit-0.003
 benches-0.003
case-0.003
aren-0.003
Token is
Token ID318
Feature activation-1.81e+00

Pos logit contributions
case+0.019
he+0.019
 benches+0.018
pit+0.018
aren+0.017
Neg logit contributions
 Will-0.019
 crew-0.019
 pap-0.019
free-0.018
 Amen-0.018
Token that
Token ID326
Feature activation+3.37e-01

Pos logit contributions
he+0.036
 benches+0.035
case+0.035
 bench+0.031
 purse+0.030
Neg logit contributions
 Will-0.066
 crew-0.063
ksh-0.062
Gr-0.062
 pap-0.062
Token?"
Token ID1701
Feature activation-1.82e-01

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
free+0.009
 Chick+0.009
Neg logit contributions
he-0.010
case-0.009
pit-0.009
 benches-0.009
ue-0.009
Token his
Token ID465
Feature activation+7.68e-01

Pos logit contributions
 Will+0.026
 crew+0.024
 pap+0.023
 Chick+0.022
 three+0.022
Neg logit contributions
he-0.032
pit-0.031
 benches-0.031
case-0.030
aren-0.030
Token grandparents
Token ID28571
Feature activation+2.87e-01

Pos logit contributions
 Will+0.017
 crew+0.015
 Chick+0.015
 pap+0.015
ch+0.015
Neg logit contributions
he-0.026
 benches-0.025
case-0.025
pit-0.025
aren-0.024
Token would
Token ID561
Feature activation+1.21e+00

Pos logit contributions
 Will+0.008
 crew+0.007
 three+0.007
free+0.006
 pap+0.006
Neg logit contributions
pit-0.009
 benches-0.009
ue-0.009
he-0.009
case-0.009
Token practice
Token ID3357
Feature activation+1.02e+00

Pos logit contributions
 Will+0.027
 crew+0.024
 pap+0.023
 three+0.022
 Chick+0.022
Neg logit contributions
pit-0.044
 benches-0.043
he-0.043
aren-0.043
awed-0.042
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
 Will+0.026
 crew+0.024
 pap+0.023
 Chick+0.022
ch+0.022
Neg logit contributions
he-0.032
pit-0.032
 benches-0.031
case-0.030
aren-0.030
Token healing
Token ID11516
Feature activation+2.82e+00

Pos logit contributions
 Will+0.012
 Chick+0.011
 crew+0.011
 Amen+0.010
 pap+0.010
Neg logit contributions
 benches-0.015
 bench-0.014
he-0.014
case-0.014
pit-0.014
Token style
Token ID3918
Feature activation-1.48e-01

Pos logit contributions
 Will+0.065
 crew+0.060
 pap+0.059
ch+0.057
 three+0.056
Neg logit contributions
he-0.095
pit-0.095
 benches-0.093
aren-0.092
case-0.091
Token every
Token ID790
Feature activation+4.87e-01

Pos logit contributions
pit+0.004
aren+0.004
 benches+0.004
awed+0.004
ue+0.004
Neg logit contributions
 Will-0.004
 crew-0.004
free-0.004
 three-0.004
ch-0.004
Token day
Token ID1110
Feature activation+1.83e+00

Pos logit contributions
 Will+0.011
 crew+0.010
 pap+0.010
 Chick+0.010
V+0.010
Neg logit contributions
he-0.016
pit-0.015
case-0.015
 benches-0.015
aren-0.015
Token.
Token ID13
Feature activation+6.99e-01

Pos logit contributions
 Will+0.058
 crew+0.055
 three+0.053
free+0.052
 louder+0.051
Neg logit contributions
pit-0.050
he-0.047
aren-0.047
case-0.046
ue-0.045
Token<|endoftext|>
Token ID50256
Feature activation+1.03e-01

Pos logit contributions
 Will+0.018
 crew+0.017
 pap+0.016
 Chick+0.016
ch+0.016
Neg logit contributions
pit-0.022
he-0.021
 benches-0.021
case-0.021
aren-0.020
Token many
Token ID867
Feature activation-1.57e+00

Pos logit contributions
pit+0.041
aren+0.039
 benches+0.038
he+0.038
ue+0.038
Neg logit contributions
 Will-0.015
 three-0.013
 louder-0.012
 pap-0.011
 sure-0.011
Token different
Token ID1180
Feature activation+1.21e-03

Pos logit contributions
pit+0.059
he+0.058
case+0.057
 benches+0.056
aren+0.056
Neg logit contributions
 Will-0.032
 crew-0.028
 pap-0.027
 three-0.026
 Chick-0.026
Token kinds
Token ID6982
Feature activation-1.65e+00

Pos logit contributions
 Will+0.000
 three+0.000
 crew+0.000
 pap+0.000
 3+0.000
Neg logit contributions
he-0.000
case-0.000
pit-0.000
aren-0.000
ue-0.000
Token of
Token ID286
Feature activation-5.77e-01

Pos logit contributions
icycle+0.075
pit+0.073
ue+0.071
aren+0.071
 slams+0.069
Neg logit contributions
 three-0.022
 crew-0.018
 Will-0.017
 important-0.017
free-0.016
Token sa
Token ID473
Feature activation+6.02e-02

Pos logit contributions
he+0.019
pit+0.018
 benches+0.018
case+0.018
aren+0.018
Neg logit contributions
 Will-0.014
 crew-0.013
 pap-0.013
 Chick-0.012
ch-0.012
Tokenus
Token ID385
Feature activation+2.79e+00

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 Chick+0.001
ch+0.001
Neg logit contributions
he-0.003
pit-0.003
 benches-0.003
case-0.003
aren-0.003
Tokenages
Token ID1095
Feature activation+6.10e-01

Pos logit contributions
 Will+0.069
 crew+0.065
 pap+0.062
 Chick+0.060
ch+0.059
Neg logit contributions
he-0.090
pit-0.089
 benches-0.087
case-0.086
aren-0.085
Token everywhere
Token ID8347
Feature activation-1.62e+00

Pos logit contributions
 three+0.016
 Will+0.016
free+0.015
 crew+0.015
 roasted+0.015
Neg logit contributions
ue-0.018
aren-0.018
pit-0.017
icycle-0.017
 benches-0.016
Token she
Token ID673
Feature activation-2.40e-01

Pos logit contributions
aren+0.047
pit+0.044
awed+0.044
icycle+0.043
 benches+0.042
Neg logit contributions
 Will-0.049
 three-0.045
 Chick-0.044
free-0.044
 Sp-0.043
Token looked
Token ID3114
Feature activation+7.20e-01

Pos logit contributions
pit+0.008
he+0.008
 benches+0.008
aren+0.008
ement+0.008
Neg logit contributions
 Will-0.006
 crew-0.005
 three-0.005
 pap-0.005
 roasted-0.004
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
 Will+0.017
 three+0.016
 louder+0.016
 crew+0.016
free+0.015
Neg logit contributions
aren-0.024
ue-0.024
pit-0.024
he-0.024
case-0.023
Token John
Token ID1757
Feature activation-6.51e-01

Pos logit contributions
he+0.011
pit+0.011
 benches+0.011
aren+0.010
case+0.010
Neg logit contributions
 Will-0.012
 crew-0.011
 pap-0.011
 three-0.010
 Chick-0.010
Token realized
Token ID6939
Feature activation-2.07e+00

Pos logit contributions
 benches+0.021
he+0.021
pit+0.021
aren+0.020
case+0.020
Neg logit contributions
 Will-0.017
 crew-0.015
 three-0.015
 pap-0.015
 Chick-0.014
Token that
Token ID326
Feature activation+1.21e-01

Pos logit contributions
pit+0.071
he+0.069
case+0.067
ue+0.067
awed+0.067
Neg logit contributions
 Will-0.052
 crew-0.048
 three-0.045
 louder-0.045
 pap-0.045
Token measuring
Token ID15964
Feature activation-2.54e-01

Pos logit contributions
 Will+0.003
 crew+0.003
 pap+0.003
 three+0.003
 Sp+0.003
Neg logit contributions
he-0.004
pit-0.004
aren-0.004
 benches-0.004
case-0.004
Token things
Token ID1243
Feature activation-2.43e+00

Pos logit contributions
he+0.008
pit+0.008
aren+0.008
case+0.008
 benches+0.008
Neg logit contributions
 Will-0.006
 crew-0.005
 three-0.005
 pap-0.005
 louder-0.005
Token made
Token ID925
Feature activation+1.39e+00

Pos logit contributions
pit+0.090
aren+0.088
ue+0.084
awed+0.083
 benches+0.083
Neg logit contributions
 Will-0.053
free-0.048
 louder-0.044
 three-0.044
 sure-0.044
Token his
Token ID465
Feature activation-2.67e-01

Pos logit contributions
 Will+0.032
 crew+0.028
 three+0.028
 louder+0.027
 pap+0.027
Neg logit contributions
pit-0.050
 benches-0.048
aren-0.047
awed-0.047
he-0.046
Token heart
Token ID2612
Feature activation+1.82e+00

Pos logit contributions
aren+0.009
pit+0.009
he+0.009
awed+0.009
ue+0.009
Neg logit contributions
 Will-0.006
 crew-0.006
 three-0.006
 pap-0.006
 younger-0.005
Token happy
Token ID3772
Feature activation+2.19e-01

Pos logit contributions
 Will+0.041
 crew+0.038
 louder+0.038
 pap+0.037
 three+0.037
Neg logit contributions
aren-0.062
pit-0.061
 benches-0.061
he-0.060
awed-0.058
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
 Will+0.007
 louder+0.006
 crew+0.006
 three+0.006
 pap+0.006
Neg logit contributions
aren-0.007
he-0.006
 benches-0.006
ue-0.006
awed-0.006
Token cool
Token ID3608
Feature activation+2.96e-02

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 Chick+0.001
ch+0.001
Neg logit contributions
he-0.001
pit-0.001
 benches-0.001
case-0.001
aren-0.001
Token,
Token ID11
Feature activation+4.30e-01

Pos logit contributions
 Will+0.043
 crew+0.041
 pap+0.039
 three+0.038
ch+0.038
Neg logit contributions
pit-0.040
he-0.040
aren-0.040
 benches-0.040
ue-0.038
Token close
Token ID1969
Feature activation+7.52e-01

Pos logit contributions
 Will+0.009
 crew+0.008
 pap+0.008
 Chick+0.008
 Amen+0.007
Neg logit contributions
he-0.015
pit-0.015
 benches-0.015
case-0.015
aren-0.014
Token his
Token ID465
Feature activation+8.33e-01

Pos logit contributions
 Will+0.017
 three+0.016
 pap+0.015
 louder+0.015
 crew+0.015
Neg logit contributions
he-0.027
case-0.026
aren-0.026
ue-0.026
pit-0.026
Token eyes
Token ID2951
Feature activation+1.03e+00

Pos logit contributions
 Will+0.026
 crew+0.024
 pap+0.023
 Chick+0.023
ch+0.023
Neg logit contributions
he-0.021
pit-0.021
 benches-0.021
case-0.020
aren-0.020
Token,
Token ID11
Feature activation+2.97e-01

Pos logit contributions
 Will+0.036
 three+0.035
 louder+0.034
free+0.033
ch+0.033
Neg logit contributions
aren-0.024
pit-0.022
ue-0.021
 benches-0.021
he-0.020
Token and
Token ID290
Feature activation+5.80e-01

Pos logit contributions
 Will+0.010
 three+0.009
 pap+0.009
 louder+0.009
 crew+0.009
Neg logit contributions
aren-0.008
he-0.008
ue-0.008
pit-0.008
case-0.008
Token spread
Token ID4104
Feature activation+1.51e+00

Pos logit contributions
 Will+0.015
 crew+0.014
 pap+0.013
 Chick+0.013
ch+0.013
Neg logit contributions
he-0.018
pit-0.018
 benches-0.018
case-0.017
aren-0.017
Token his
Token ID465
Feature activation-1.43e-01

Pos logit contributions
 Will+0.040
 crew+0.039
 pap+0.037
 Chick+0.036
 Amen+0.035
Neg logit contributions
he-0.044
 benches-0.044
pit-0.043
case-0.042
aren-0.040
Token arms
Token ID5101
Feature activation+9.18e-01

Pos logit contributions
he+0.004
 benches+0.004
case+0.004
pit+0.004
 purse+0.004
Neg logit contributions
 Will-0.004
 crew-0.003
 Chick-0.003
 pap-0.003
V-0.003
Token wide
Token ID3094
Feature activation-1.69e-01

Pos logit contributions
 Will+0.026
 crew+0.024
free+0.024
 three+0.024
 louder+0.024
Neg logit contributions
pit-0.026
 benches-0.025
aren-0.025
he-0.025
ue-0.024
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
aren+0.004
he+0.004
pit+0.004
ue+0.004
 benches+0.004
Neg logit contributions
 Will-0.005
 three-0.005
 louder-0.005
 crew-0.005
free-0.005
Token she
Token ID673
Feature activation+2.05e-01

Pos logit contributions
 Will+0.006
 three+0.005
 louder+0.005
 pap+0.005
 sure+0.005
Neg logit contributions
he-0.008
icycle-0.007
awed-0.007
case-0.007
orns-0.007
Token gave
Token ID2921
Feature activation+4.24e-01

Pos logit contributions
 Will+0.006
 crew+0.005
 pap+0.005
free+0.005
 Chick+0.005
Neg logit contributions
he-0.006
pit-0.006
 benches-0.006
case-0.006
aren-0.006
Token him
Token ID683
Feature activation+3.20e-03

Pos logit contributions
 Will+0.011
 three+0.010
 louder+0.010
 crew+0.010
 pap+0.009
Neg logit contributions
case-0.013
pit-0.013
he-0.013
 benches-0.013
aren-0.012
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
 Will+0.000
 crew+0.000
 pap+0.000
 Chick+0.000
ch+0.000
Neg logit contributions
he-0.000
pit-0.000
 benches-0.000
case-0.000
aren-0.000
Token big
Token ID1263
Feature activation-5.40e-01

Pos logit contributions
 Will+0.041
 crew+0.040
 Chick+0.037
 Amen+0.037
 Ch+0.036
Neg logit contributions
he-0.051
case-0.048
 bench-0.048
pit-0.047
 purse-0.046
Token hug
Token ID16225
Feature activation+3.76e-01

Pos logit contributions
he+0.015
pit+0.015
 benches+0.014
case+0.014
aren+0.014
Neg logit contributions
 Will-0.016
 crew-0.015
 pap-0.014
 Chick-0.014
free-0.014
Token and
Token ID290
Feature activation+9.52e-01

Pos logit contributions
 Will+0.013
 three+0.012
free+0.011
 louder+0.011
 crew+0.011
Neg logit contributions
 benches-0.009
aren-0.009
pit-0.009
ue-0.009
awed-0.009
Token said
Token ID531
Feature activation+5.29e-01

Pos logit contributions
 Will+0.024
 crew+0.022
 pap+0.022
 Chick+0.021
 three+0.021
Neg logit contributions
he-0.031
pit-0.030
 benches-0.030
case-0.029
aren-0.029
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
 Will+0.019
 crew+0.019
 pap+0.018
ch+0.018
V+0.018
Neg logit contributions
he-0.010
pit-0.010
 benches-0.010
case-0.010
aren-0.009
Token "
Token ID366
Feature activation+6.56e-01

Pos logit contributions
 Will+0.026
 crew+0.025
 pap+0.023
ch+0.022
 Chick+0.022
Neg logit contributions
he-0.049
pit-0.048
 benches-0.047
case-0.047
aren-0.046
TokenThank
Token ID10449
Feature activation-1.19e-01

Pos logit contributions
 Will+0.018
 crew+0.016
ch+0.016
 pap+0.016
 three+0.015
Neg logit contributions
pit-0.020
 benches-0.020
he-0.019
case-0.019
aren-0.018
Token Dave
Token ID9935
Feature activation+1.49e-01

Pos logit contributions
 Will+0.003
 Yoga+0.002
 three+0.002
 Fish+0.002
 names+0.002
Neg logit contributions
he-0.007
icycle-0.007
 purse-0.007
pit-0.007
ite-0.007
Token was
Token ID373
Feature activation+1.52e-02

Pos logit contributions
 Will+0.004
 crew+0.003
 three+0.003
 pap+0.003
 roasted+0.003
Neg logit contributions
 benches-0.005
pit-0.005
he-0.005
ement-0.005
aren-0.005
Token a
Token ID257
Feature activation-1.61e-01

Pos logit contributions
ksh+0.000
 Will+0.000
 crew+0.000
ye+0.000
ch+0.000
Neg logit contributions
he-0.000
case-0.000
pit-0.000
aren-0.000
 Lib-0.000
Token big
Token ID1263
Feature activation-1.68e+00

Pos logit contributions
pit+0.005
he+0.005
 bench+0.005
case+0.005
 sandbox+0.005
Neg logit contributions
 Will-0.004
free-0.003
ksh-0.003
 Him-0.003
ch-0.003
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
he+0.053
 benches+0.052
case+0.051
pit+0.051
awed+0.051
Neg logit contributions
 Will-0.044
 crew-0.041
 three-0.041
 pap-0.040
 3-0.038
Token strong
Token ID1913
Feature activation+5.50e-01

Pos logit contributions
 Will+0.007
 crew+0.007
 pap+0.006
 three+0.006
ch+0.006
Neg logit contributions
he-0.015
pit-0.015
 benches-0.015
aren-0.015
case-0.015
Token bear
Token ID6842
Feature activation+3.14e-01

Pos logit contributions
 Will+0.013
 three+0.013
 crew+0.013
 pap+0.012
 3+0.012
Neg logit contributions
he-0.018
 benches-0.018
awed-0.018
case-0.017
pit-0.017
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
 Will+0.009
 crew+0.009
 three+0.008
 Ch+0.008
 Sp+0.008
Neg logit contributions
 benches-0.009
he-0.009
pit-0.009
aren-0.009
ement-0.009
Token Amy
Token ID14235
Feature activation-5.47e-01

Pos logit contributions
 Will+0.030
 crew+0.027
 pap+0.027
 three+0.026
 Chick+0.026
Neg logit contributions
he-0.025
pit-0.024
 benches-0.024
aren-0.023
case-0.022
Token was
Token ID373
Feature activation+5.23e-02

Pos logit contributions
 benches+0.022
pit+0.021
he+0.021
aren+0.020
ement+0.020
Neg logit contributions
 Will-0.011
 three-0.009
 Ch-0.009
 pap-0.009
 crew-0.009
Token a
Token ID257
Feature activation+3.62e-02

Pos logit contributions
 Will+0.001
ksh+0.001
 crew+0.001
ye+0.001
ch+0.001
Neg logit contributions
he-0.001
case-0.001
pit-0.001
aren-0.001
 benches-0.001
Token waiting
Token ID4953
Feature activation-3.20e-01

Pos logit contributions
awed+0.046
 benches+0.046
opl+0.044
aren+0.044
icycle+0.044
Neg logit contributions
 Will-0.019
 three-0.017
 crew-0.016
 pap-0.015
 3-0.014
Token.
Token ID13
Feature activation+9.36e-01

Pos logit contributions
 benches+0.014
aren+0.014
he+0.014
awed+0.014
pit+0.014
Neg logit contributions
 Will-0.004
 three-0.004
 pap-0.003
 crew-0.003
 louder-0.003
Token They
Token ID1119
Feature activation+1.32e-01

Pos logit contributions
 Will+0.025
 crew+0.023
 pap+0.022
 Chick+0.021
ch+0.021
Neg logit contributions
he-0.029
pit-0.028
 benches-0.028
case-0.027
aren-0.027
Token walked
Token ID6807
Feature activation+3.69e-01

Pos logit contributions
 Will+0.004
 crew+0.003
 pap+0.003
 Chick+0.003
 three+0.003
Neg logit contributions
he-0.004
pit-0.004
 benches-0.004
case-0.004
aren-0.004
Token her
Token ID607
Feature activation+5.85e-01

Pos logit contributions
 Will+0.009
 three+0.008
 crew+0.008
 louder+0.008
ch+0.008
Neg logit contributions
pit-0.012
he-0.012
 benches-0.012
case-0.012
aren-0.012
Token to
Token ID284
Feature activation+1.79e+00

Pos logit contributions
 Will+0.015
 pap+0.014
 three+0.014
 3+0.014
 crew+0.014
Neg logit contributions
he-0.019
aren-0.018
ement-0.018
ue-0.018
opl-0.017
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
 Will+0.041
 crew+0.038
 pap+0.036
ch+0.036
free+0.035
Neg logit contributions
he-0.060
 benches-0.059
pit-0.058
aren-0.057
case-0.057
Token kitchen
Token ID9592
Feature activation+4.80e-01

Pos logit contributions
 Will+0.000
 crew+0.000
 pap+0.000
 three+0.000
 Chick+0.000
Neg logit contributions
he-0.000
pit-0.000
 benches-0.000
aren-0.000
case-0.000
Token and
Token ID290
Feature activation+5.98e-01

Pos logit contributions
 three+0.015
 Will+0.014
 pap+0.013
 3+0.013
 crew+0.013
Neg logit contributions
aren-0.013
he-0.013
ement-0.013
 benches-0.012
ue-0.012
Token showed
Token ID3751
Feature activation+1.31e+00

Pos logit contributions
 Will+0.017
 crew+0.015
 pap+0.015
 Chick+0.015
 three+0.014
Neg logit contributions
he-0.018
pit-0.018
 benches-0.017
case-0.017
aren-0.017
Token her
Token ID607
Feature activation+3.04e-01

Pos logit contributions
 Will+0.043
 crew+0.041
 pap+0.040
 Chick+0.038
ch+0.038
Neg logit contributions
he-0.032
pit-0.031
 benches-0.031
case-0.030
aren-0.030
Token he
Token ID339
Feature activation-1.29e+00

Pos logit contributions
he+0.007
pit+0.007
 benches+0.007
case+0.007
aren+0.007
Neg logit contributions
 Will-0.006
 crew-0.006
 pap-0.006
 Chick-0.005
 three-0.005
Token finished
Token ID5201
Feature activation-1.61e+00

Pos logit contributions
pit+0.048
 benches+0.048
he+0.046
case+0.045
ears+0.045
Neg logit contributions
 Will-0.031
 crew-0.027
 pap-0.025
 three-0.025
 Ch-0.023
Token,
Token ID11
Feature activation-9.12e-01

Pos logit contributions
he+0.052
case+0.052
pit+0.049
aren+0.049
 benches+0.049
Neg logit contributions
 Will-0.043
 pap-0.039
 crew-0.038
 three-0.038
 louder-0.037
Token he
Token ID339
Feature activation+1.99e-01

Pos logit contributions
pit+0.031
he+0.031
 benches+0.030
case+0.030
aren+0.029
Neg logit contributions
 Will-0.022
 crew-0.020
 pap-0.019
 three-0.019
 louder-0.018
Token bent
Token ID17157
Feature activation-4.97e-01

Pos logit contributions
 Will+0.005
 crew+0.005
 pap+0.004
 three+0.004
 louder+0.004
Neg logit contributions
he-0.007
 benches-0.007
pit-0.007
aren-0.006
case-0.006
Token down
Token ID866
Feature activation-1.36e+00

Pos logit contributions
case+0.017
he+0.016
ue+0.016
aren+0.015
 benches+0.015
Neg logit contributions
 Will-0.012
 three-0.011
 louder-0.011
 pap-0.011
 crew-0.011
Token and
Token ID290
Feature activation+3.28e-01

Pos logit contributions
pit+0.035
case+0.035
ue+0.034
aren+0.033
 benches+0.033
Neg logit contributions
 Will-0.045
 three-0.043
 louder-0.042
 crew-0.040
 pap-0.039
Token whispered
Token ID25029
Feature activation-2.37e-01

Pos logit contributions
 Will+0.009
 crew+0.009
 pap+0.008
 Chick+0.008
ch+0.008
Neg logit contributions
he-0.009
pit-0.009
 benches-0.009
case-0.009
aren-0.009
Token something
Token ID1223
Feature activation+9.73e-01

Pos logit contributions
pit+0.006
 benches+0.006
he+0.006
case+0.006
ement+0.006
Neg logit contributions
 Will-0.007
 crew-0.006
V-0.006
ch-0.006
 Chick-0.006
Token wise
Token ID10787
Feature activation+1.76e+00

Pos logit contributions
 Will+0.028
 crew+0.026
 pap+0.025
 Chick+0.024
ch+0.024
Neg logit contributions
he-0.028
pit-0.027
 benches-0.027
case-0.026
aren-0.026
Token in
Token ID287
Feature activation+1.73e-01

Pos logit contributions
 Will+0.062
 crew+0.058
 pap+0.057
 three+0.055
 Chick+0.055
Neg logit contributions
he-0.039
pit-0.038
 benches-0.037
case-0.036
aren-0.036
TokenBen
Token ID11696
Feature activation-1.46e-01

Pos logit contributions
 Will+0.012
 crew+0.012
 pap+0.011
 Amen+0.011
 Chick+0.011
Neg logit contributions
he-0.010
 benches-0.009
case-0.009
pit-0.009
aren-0.009
Token and
Token ID290
Feature activation+4.25e-01

Pos logit contributions
ement+0.004
aren+0.004
pit+0.004
he+0.004
 Lib+0.004
Neg logit contributions
 Will-0.005
 Ch-0.004
 Bear-0.004
 three-0.004
ch-0.004
Token Lily
Token ID20037
Feature activation-7.97e-01

Pos logit contributions
cient+0.016
 crew+0.016
 Amen+0.016
ksh+0.016
 pap+0.016
Neg logit contributions
 benches-0.007
 bench-0.007
 Sue-0.006
 purse-0.006
he-0.006
Token screamed
Token ID25421
Feature activation-1.57e+00

Pos logit contributions
pit+0.028
ement+0.027
aren+0.027
icycle+0.026
ue+0.026
Neg logit contributions
 Will-0.019
 three-0.018
 Ch-0.018
 sw-0.017
 qu-0.016
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
he+0.029
pit+0.028
 benches+0.027
case+0.026
aren+0.025
Neg logit contributions
 Will-0.060
 crew-0.058
 pap-0.057
 Chick-0.055
ch-0.055
Token They
Token ID1119
Feature activation-1.46e+00

Pos logit contributions
he+0.030
 benches+0.030
pit+0.030
case+0.029
aren+0.028
Neg logit contributions
 Will-0.031
 crew-0.030
 pap-0.029
ch-0.029
free-0.028
Token were
Token ID547
Feature activation-1.51e+00

Pos logit contributions
pit+0.053
ement+0.050
aren+0.048
castle+0.046
he+0.046
Neg logit contributions
 Will-0.037
 Ch-0.034
 three-0.032
 Chick-0.031
 crew-0.031
Token hurt
Token ID5938
Feature activation-5.07e-01

Pos logit contributions
pit+0.052
he+0.052
aren+0.050
 benches+0.050
ue+0.049
Neg logit contributions
 Will-0.035
 crew-0.032
 pap-0.031
 louder-0.030
 three-0.030
Token by
Token ID416
Feature activation-1.30e+00

Pos logit contributions
case+0.014
pit+0.014
aren+0.014
he+0.013
ue+0.013
Neg logit contributions
 louder-0.016
 Will-0.016
 three-0.015
free-0.015
 Ch-0.014
Token the
Token ID262
Feature activation-8.85e-01

Pos logit contributions
 benches+0.040
he+0.038
pit+0.037
 bench+0.036
aren+0.035
Neg logit contributions
 Will-0.034
 crew-0.032
ch-0.031
 Amen-0.031
ye-0.031
Token blocks
Token ID7021
Feature activation-2.30e+00

Pos logit contributions
pit+0.031
he+0.031
aren+0.030
 benches+0.030
case+0.030
Neg logit contributions
 Will-0.019
 crew-0.018
 pap-0.016
 louder-0.016
 three-0.016
Token ship
Token ID4074
Feature activation+1.46e+00

Pos logit contributions
 benches+0.004
he+0.004
pit+0.004
 bench+0.004
case+0.004
Neg logit contributions
 Chick-0.004
 Will-0.004
 Yoga-0.004
ksh-0.004
V-0.004
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
 Will+0.048
 crew+0.045
 pap+0.044
 Chick+0.044
ch+0.043
Neg logit contributions
he-0.035
pit-0.034
 benches-0.033
case-0.033
aren-0.031
Token
Token ID220
Feature activation+1.38e+00

Pos logit contributions
 crew+0.032
 Will+0.032
 pap+0.031
free+0.030
ch+0.030
Neg logit contributions
he-0.031
 benches-0.030
case-0.030
pit-0.030
aren-0.029
Token
Token ID198
Feature activation+6.87e-01

Pos logit contributions
 Will+0.041
 crew+0.039
 pap+0.038
 Chick+0.036
ch+0.036
Neg logit contributions
he-0.038
case-0.036
pit-0.036
 benches-0.036
aren-0.035
Token
Token ID198
Feature activation+7.71e-01

Pos logit contributions
 Will+0.022
 crew+0.022
 pap+0.022
ksh+0.020
 roasted+0.020
Neg logit contributions
he-0.017
case-0.016
aren-0.015
ue-0.015
 benches-0.015
Token"
Token ID1
Feature activation-9.97e-01

Pos logit contributions
 Will+0.025
 pap+0.024
 crew+0.023
 younger+0.023
 roasted+0.022
Neg logit contributions
he-0.018
case-0.017
ue-0.017
aren-0.017
pit-0.016
TokenI
Token ID40
Feature activation-3.40e-01

Pos logit contributions
pit+0.029
he+0.029
 benches+0.028
case+0.028
aren+0.027
Neg logit contributions
 Will-0.028
 crew-0.026
 pap-0.025
 three-0.024
 Chick-0.024
Token'm
Token ID1101
Feature activation+9.74e-01

Pos logit contributions
pit+0.013
 benches+0.013
he+0.013
aren+0.012
awed+0.012
Neg logit contributions
 Will-0.007
 crew-0.006
 pap-0.006
 Ch-0.005
 Chick-0.005
Token the
Token ID262
Feature activation-9.03e-01

Pos logit contributions
 Will+0.024
 crew+0.023
 pap+0.022
V+0.022
ch+0.021
Neg logit contributions
he-0.030
pit-0.029
 benches-0.029
case-0.029
aren-0.028
Token wind
Token ID2344
Feature activation+1.79e+00

Pos logit contributions
he+0.031
 benches+0.031
pit+0.031
case+0.030
ement+0.029
Neg logit contributions
 Will-0.019
 Chick-0.017
V-0.016
 Yoga-0.016
free-0.016
Token!"
Token ID2474
Feature activation+1.42e-01

Pos logit contributions
 Will+0.054
 crew+0.051
 pap+0.049
 three+0.049
free+0.048
Neg logit contributions
 benches-0.048
pit-0.046
he-0.046
aren-0.045
case-0.045
Token love
Token ID1842
Feature activation-7.05e-01

Pos logit contributions
 Will+0.027
 crew+0.027
 pap+0.025
 Chick+0.024
 three+0.023
Neg logit contributions
he-0.035
case-0.033
pit-0.033
 benches-0.032
aren-0.032
Token you
Token ID345
Feature activation+1.31e-01

Pos logit contributions
 benches+0.019
ries+0.019
 bench+0.017
he+0.017
aren+0.016
Neg logit contributions
 crew-0.020
V-0.019
free-0.018
fortable-0.018
ye-0.018
Token.
Token ID13
Feature activation-2.49e-01

Pos logit contributions
 Will+0.005
 crew+0.005
 pap+0.005
ch+0.005
V+0.005
Neg logit contributions
he-0.002
pit-0.002
 benches-0.002
case-0.002
aren-0.002
Token He
Token ID679
Feature activation+8.92e-02

Pos logit contributions
he+0.005
case+0.005
 benches+0.005
pit+0.005
aren+0.005
Neg logit contributions
 crew-0.008
 pap-0.008
 Will-0.008
 roasted-0.008
ch-0.008
Token likes
Token ID7832
Feature activation-1.59e-01

Pos logit contributions
 Will+0.002
 pap+0.002
 crew+0.002
 Chick+0.002
ch+0.002
Neg logit contributions
pit-0.003
he-0.003
 benches-0.003
aren-0.003
case-0.003
Token to
Token ID284
Feature activation-2.66e-01

Pos logit contributions
 benches+0.004
 bench+0.004
ries+0.004
he+0.004
 than+0.004
Neg logit contributions
 crew-0.004
V-0.004
 Amen-0.004
free-0.004
hetic-0.004
Token see
Token ID766
Feature activation+6.42e-01

Pos logit contributions
he+0.009
pit+0.009
 benches+0.009
case+0.009
aren+0.009
Neg logit contributions
 Will-0.006
 crew-0.005
 pap-0.005
 louder-0.005
 three-0.005
Token them
Token ID606
Feature activation-8.70e-01

Pos logit contributions
 Will+0.014
 crew+0.013
 pap+0.013
 Chick+0.012
ch+0.012
Neg logit contributions
he-0.022
pit-0.022
 benches-0.022
case-0.021
aren-0.021
Token happy
Token ID3772
Feature activation+2.42e-01

Pos logit contributions
aren+0.021
pit+0.020
he+0.020
 benches+0.020
case+0.020
Neg logit contributions
 Will-0.029
 crew-0.027
 louder-0.026
 pap-0.026
 three-0.026
Token and
Token ID290
Feature activation-5.71e-02

Pos logit contributions
 Will+0.010
 crew+0.009
 pap+0.009
 three+0.009
 louder+0.009
Neg logit contributions
aren-0.004
he-0.004
case-0.004
pit-0.004
 benches-0.004
Token dry
Token ID5894
Feature activation-3.63e-02

Pos logit contributions
 benches+0.002
he+0.002
pit+0.002
case+0.002
 bench+0.002
Neg logit contributions
 Will-0.001
 crew-0.001
ch-0.001
V-0.001
 Chick-0.001
Token make
Token ID787
Feature activation+7.73e-01

Pos logit contributions
pit+0.034
he+0.033
 benches+0.033
aren+0.033
case+0.032
Neg logit contributions
 Will-0.021
 three-0.019
 crew-0.019
 pap-0.018
ch-0.018
Token a
Token ID257
Feature activation+3.25e-01

Pos logit contributions
 Will+0.018
 crew+0.017
 Amen+0.017
free+0.016
 pap+0.016
Neg logit contributions
 benches-0.024
he-0.024
pit-0.024
case-0.023
ement-0.022
Token square
Token ID6616
Feature activation+2.50e-01

Pos logit contributions
 Will+0.009
 crew+0.008
 pap+0.008
 Chick+0.008
free+0.008
Neg logit contributions
he-0.009
pit-0.009
 benches-0.009
case-0.009
aren-0.009
Token and
Token ID290
Feature activation+5.36e-02

Pos logit contributions
 three+0.007
 Will+0.007
 crew+0.006
 louder+0.006
 pap+0.006
Neg logit contributions
aren-0.008
he-0.007
awed-0.007
 benches-0.007
ue-0.007
Token a
Token ID257
Feature activation+3.16e-02

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 Chick+0.001
 Amen+0.001
Neg logit contributions
pit-0.002
 benches-0.002
he-0.002
case-0.002
 bench-0.002
Token circle
Token ID9197
Feature activation-5.21e-01

Pos logit contributions
 Will+0.001
 crew+0.001
 pap+0.001
 three+0.001
 Chick+0.001
Neg logit contributions
he-0.001
pit-0.001
 benches-0.001
aren-0.001
case-0.001
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
aren+0.016
 benches+0.015
he+0.014
awed+0.014
ue+0.014
Neg logit contributions
 Will-0.014
 three-0.014
ch-0.014
 louder-0.014
 crew-0.013
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
he+0.006
pit+0.006
 benches+0.006
case+0.006
aren+0.006
Neg logit contributions
 Will-0.008
 crew-0.008
 pap-0.007
free-0.007
 Chick-0.007
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
he+0.004
case+0.004
aren+0.004
 benches+0.004
ue+0.004
Neg logit contributions
 crew-0.006
 pap-0.005
 Will-0.005
ksh-0.005
 louder-0.005
TokenOne
Token ID3198
Feature activation-3.75e-01

Pos logit contributions
he+0.011
pit+0.010
case+0.010
 benches+0.010
aren+0.010
Neg logit contributions
 Will-0.009
 crew-0.009
 pap-0.008
 Chick-0.008
 Ch-0.008
Token day
Token ID1110
Feature activation+1.12e+00

Pos logit contributions
he+0.012
case+0.011
pit+0.011
 bench+0.011
 benches+0.010
Neg logit contributions
 Will-0.010
 crew-0.009
ksh-0.009
 Chick-0.009
 Amen-0.009
Token She
Token ID1375
Feature activation-3.09e-01

Pos logit contributions
 Will+0.004
 Yoga+0.004
V+0.004
 three+0.004
 Ch+0.004
Neg logit contributions
pit-0.006
castle-0.005
he-0.005
 benches-0.005
ement-0.005
Token makes
Token ID1838
Feature activation+1.43e+00

Pos logit contributions
he+0.010
 benches+0.010
pit+0.010
aren+0.010
case+0.010
Neg logit contributions
 Will-0.008
 pap-0.007
 crew-0.007
 louder-0.006
 three-0.006
Token a
Token ID257
Feature activation+1.42e-01

Pos logit contributions
 crew+0.037
 Will+0.036
 pap+0.034
 Amen+0.034
ye+0.034
Neg logit contributions
he-0.044
 benches-0.042
pit-0.041
case-0.040
 bench-0.040
Token sad
Token ID6507
Feature activation+5.05e-01

Pos logit contributions
 Will+0.004
 crew+0.004
 pap+0.003
 Chick+0.003
free+0.003
Neg logit contributions
he-0.004
pit-0.004
 benches-0.004
case-0.004
 bench-0.004
Token face
Token ID1986
Feature activation+6.70e-01

Pos logit contributions
 Will+0.014
 crew+0.013
 pap+0.013
 louder+0.013
 three+0.012
Neg logit contributions
he-0.015
pit-0.015
 benches-0.015
case-0.014
aren-0.014
Token at
Token ID379
Feature activation-2.63e+00

Pos logit contributions
 Will+0.024
 crew+0.023
 pap+0.022
 Chick+0.022
 three+0.022
Neg logit contributions
he-0.014
pit-0.014
 benches-0.013
aren-0.013
case-0.013
Token Tom
Token ID4186
Feature activation-8.80e-01

Pos logit contributions
case+0.083
he+0.080
pit+0.080
aren+0.077
awed+0.077
Neg logit contributions
 Will-0.069
 three-0.062
 louder-0.060
 younger-0.060
 pap-0.058
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
aren+0.025
pit+0.024
 benches+0.024
he+0.022
case+0.022
Neg logit contributions
 Will-0.027
 louder-0.024
free-0.024
 three-0.024
 younger-0.023
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
pit+0.011
castle+0.010
 benches+0.010
he+0.010
case+0.009
Neg logit contributions
 Will-0.007
 Yoga-0.006
V-0.006
 Big-0.006
 Sp-0.006
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
he+0.004
pit+0.004
 benches+0.004
case+0.004
aren+0.004
Neg logit contributions
 Will-0.004
 crew-0.004
 pap-0.003
 Chick-0.003
 three-0.003
Token"
Token ID1
Feature activation-9.39e-01

Pos logit contributions
pit+0.006
ement+0.006
 benches+0.005
case+0.005
he+0.005
Neg logit contributions
V-0.004
 louder-0.004
ch-0.004
 Will-0.004
ye-0.004
Token books
Token ID3835
Feature activation-4.19e+00

Pos logit contributions
aren+0.037
pit+0.037
he+0.036
awed+0.036
+0.035
Neg logit contributions
 Will-0.032
 three-0.031
 3-0.028
 younger-0.028
 Sp-0.028
Token on
Token ID319
Feature activation-2.72e+00

Pos logit contributions
aren+0.132
 benches+0.127
ue+0.125
handed+0.125
awed+0.124
Neg logit contributions
 three-0.093
 working-0.083
 for-0.083
free-0.080
 Will-0.079
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
pit+0.113
awed+0.113
aren+0.112
he+0.109
ement+0.107
Neg logit contributions
 Will-0.048
 three-0.040
 Sp-0.038
 Ch-0.037
 Chick-0.037
Token books
Token ID3835
Feature activation-3.87e+00

Pos logit contributions
aren+0.034
awed+0.034
pit+0.033
ement+0.033
he+0.031
Neg logit contributions
 Will-0.036
 three-0.034
 younger-0.032
V-0.032
 crew-0.032
Tokenhe
Token ID258
Feature activation-3.24e+00

Pos logit contributions
aren+0.075
awed+0.075
 purse+0.071
 benches+0.071
ears+0.069
Neg logit contributions
 Will-0.127
 three-0.126
 weekend-0.116
 a-0.113
ch-0.112
Tokenlf
Token ID1652
Feature activation-2.87e+00

Pos logit contributions
aren+0.103
ite+0.092
aw+0.087
awed+0.087
ement+0.085
Neg logit contributions
 Will-0.090
 three-0.075
 Sp-0.074
 films-0.072
 crew-0.072
Token.
Token ID13
Feature activation-6.83e-01

Pos logit contributions
aren+0.092
 benches+0.085
ue+0.085
awed+0.079
 ey+0.078
Neg logit contributions
 three-0.068
 Will-0.064
 crew-0.063
 3-0.058
 Ch-0.057
Token He
Token ID679
Feature activation-9.35e-01

Pos logit contributions
pit+0.021
he+0.020
ement+0.020
aren+0.019
 benches+0.019
Neg logit contributions
 Will-0.019
 Yoga-0.017
V-0.017
 three-0.017
 Ch-0.016
Token wanted
Token ID2227
Feature activation-9.70e-01

Pos logit contributions
he+0.028
pit+0.028
 benches+0.027
case+0.026
aren+0.026
Neg logit contributions
 Will-0.026
 crew-0.024
 pap-0.023
 Chick-0.022
 three-0.022
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
he+0.039
pit+0.038
case+0.037
aren+0.037
 benches+0.036
Neg logit contributions
 Will-0.018
 three-0.015
 crew-0.015
 pap-0.014
 Ch-0.014
Token find
Token ID1064
Feature activation-5.68e-01

Pos logit contributions
awed+0.051
ement+0.050
pit+0.049
aren+0.047
aw+0.045
Neg logit contributions
 Will-0.027
 three-0.026
 help-0.023
 3-0.021
 younger-0.021
Token guilty
Token ID6717
Feature activation-5.59e-01

Pos logit contributions
he+0.020
case+0.019
 benches+0.018
pit+0.017
aren+0.017
Neg logit contributions
 Will-0.015
 crew-0.014
 pap-0.013
ye-0.013
 Chick-0.013
Token for
Token ID329
Feature activation-1.24e+00

Pos logit contributions
pit+0.016
aren+0.016
he+0.016
 benches+0.016
case+0.015
Neg logit contributions
 Will-0.017
 louder-0.015
 three-0.015
 crew-0.015
 pap-0.015
Token not
Token ID407
Feature activation-8.89e-03

Pos logit contributions
 benches+0.036
he+0.036
pit+0.035
aren+0.033
case+0.033
Neg logit contributions
 Will-0.032
 crew-0.031
ch-0.030
 pap-0.029
ye-0.029
Token doing
Token ID1804
Feature activation-5.37e-01

Pos logit contributions
he+0.000
pit+0.000
 benches+0.000
case+0.000
aren+0.000
Neg logit contributions
 Will-0.000
 crew-0.000
 pap-0.000
 Chick-0.000
ch-0.000
Token what
Token ID644
Feature activation-1.20e+00

Pos logit contributions
pit+0.020
he+0.019
aren+0.019
case+0.019
 benches+0.019
Neg logit contributions
 Will-0.012
 louder-0.010
 three-0.010
 pap-0.010
 Yoga-0.010
Token her
Token ID607
Feature activation-2.68e+00

Pos logit contributions
pit+0.033
he+0.033
 benches+0.032
aren+0.031
case+0.030
Neg logit contributions
 Will-0.037
 pap-0.033
 crew-0.033
 three-0.032
 Chick-0.032
Token mom
Token ID1995
Feature activation-1.37e+00

Pos logit contributions
he+0.105
 benches+0.103
aren+0.103
pit+0.102
awed+0.099
Neg logit contributions
 Will-0.053
 pap-0.045
 three-0.044
 crew-0.043
 younger-0.042
Tokenmy
Token ID1820
Feature activation-8.75e-01

Pos logit contributions
 benches+0.049
pit+0.046
case+0.044
he+0.044
castle+0.043
Neg logit contributions
 Will-0.032
 three-0.026
 younger-0.026
 pap-0.026
 Ch-0.026
Token said
Token ID531
Feature activation-1.23e+00

Pos logit contributions
 benches+0.034
pit+0.033
he+0.030
case+0.030
aren+0.030
Neg logit contributions
 Will-0.019
 pap-0.015
 three-0.014
 sure-0.014
free-0.014
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
pit+0.033
aren+0.033
 benches+0.032
ue+0.031
he+0.031
Neg logit contributions
 Will-0.037
free-0.035
 three-0.034
 louder-0.033
 crew-0.033
Token She
Token ID1375
Feature activation-6.48e-01

Pos logit contributions
he+0.006
pit+0.006
case+0.006
 benches+0.006
aren+0.006
Neg logit contributions
 Will-0.018
 crew-0.018
 pap-0.017
ch-0.017
ye-0.016
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
 Will+0.020
 three+0.019
 louder+0.019
 Chick+0.017
 Gr+0.017
Neg logit contributions
aren-0.025
he-0.023
ue-0.023
icycle-0.022
case-0.022
Token didn
Token ID1422
Feature activation-8.18e-01

Pos logit contributions
pit+0.041
he+0.041
 benches+0.040
aren+0.039
case+0.039
Neg logit contributions
 Will-0.032
 louder-0.028
 pap-0.028
 crew-0.027
 three-0.027
Token't
Token ID470
Feature activation-1.04e+00

Pos logit contributions
he+0.024
pit+0.024
 benches+0.023
case+0.023
aren+0.023
Neg logit contributions
 Will-0.022
 crew-0.021
 pap-0.020
 Chick-0.020
ch-0.019
Token know
Token ID760
Feature activation-7.21e-03

Pos logit contributions
pit+0.041
he+0.041
 benches+0.041
aren+0.040
case+0.040
Neg logit contributions
 Will-0.018
 crew-0.016
 pap-0.015
 three-0.015
free-0.015
Token what
Token ID644
Feature activation-9.21e-01

Pos logit contributions
he+0.000
pit+0.000
case+0.000
aren+0.000
 benches+0.000
Neg logit contributions
 Will-0.000
 crew-0.000
 pap-0.000
 three-0.000
 louder-0.000
Token to
Token ID284
Feature activation-3.46e+00

Pos logit contributions
case+0.027
 benches+0.027
he+0.027
aren+0.026
pit+0.026
Neg logit contributions
 crew-0.023
 Will-0.023
 Amen-0.022
 pap-0.022
ksh-0.022
Token do
Token ID466
Feature activation-1.81e+00

Pos logit contributions
pit+0.139
 benches+0.136
aren+0.135
awed+0.131
he+0.130
Neg logit contributions
 Will-0.059
 crew-0.048
 three-0.048
 pap-0.046
ch-0.046
Token.
Token ID13
Feature activation-9.02e-01

Pos logit contributions
aren+0.063
pit+0.061
 benches+0.061
 sandbox+0.060
awed+0.059
Neg logit contributions
 Will-0.043
free-0.040
 three-0.039
 important-0.037
 Sp-0.036
Token
Token ID220
Feature activation+6.04e-01

Pos logit contributions
pit+0.030
he+0.030
 benches+0.028
castle+0.028
awed+0.027
Neg logit contributions
 Will-0.023
 three-0.020
 Sp-0.020
 Yoga-0.020
 Ch-0.019
Token
Token ID198
Feature activation-5.98e-01

Pos logit contributions
 Will+0.015
 crew+0.013
 Chick+0.013
 pap+0.013
 Ch+0.013
Neg logit contributions
pit-0.019
 benches-0.019
he-0.019
aren-0.018
case-0.018
Token
Token ID198
Feature activation-4.82e-01

Pos logit contributions
pit+0.019
he+0.018
 benches+0.018
case+0.017
aren+0.017
Neg logit contributions
 Will-0.016
 crew-0.014
 pap-0.013
 Chick-0.013
 three-0.013
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
pit+0.055
awed+0.051
aren+0.051
ue+0.051
ears+0.050
Neg logit contributions
 Will-0.028
 three-0.026
 louder-0.024
 Ch-0.023
 3-0.022
Token boat
Token ID8848
Feature activation+9.48e-01

Pos logit contributions
 benches+0.009
he+0.009
pit+0.008
case+0.008
 bench+0.008
Neg logit contributions
 Will-0.010
 Chick-0.009
 crew-0.009
 pap-0.009
free-0.008
Token and
Token ID290
Feature activation+1.64e-01

Pos logit contributions
 crew+0.030
 Will+0.029
 three+0.029
 louder+0.029
ch+0.027
Neg logit contributions
aren-0.023
awed-0.023
ears-0.022
pit-0.022
ries-0.022
Token was
Token ID373
Feature activation-2.39e-01

Pos logit contributions
 Will+0.005
 crew+0.004
 pap+0.004
 three+0.004
 louder+0.004
Neg logit contributions
he-0.005
pit-0.005
case-0.005
aren-0.005
ement-0.005
Token amazed
Token ID24562
Feature activation-4.71e-01

Pos logit contributions
he+0.008
pit+0.008
 benches+0.008
aren+0.007
ue+0.007
Neg logit contributions
 Will-0.006
 crew-0.005
 pap-0.005
 three-0.005
 louder-0.005
Token at
Token ID379
Feature activation-3.02e+00

Pos logit contributions
case+0.015
pit+0.015
 benches+0.015
he+0.014
aren+0.014
Neg logit contributions
 Will-0.013
 louder-0.012
 three-0.012
 3-0.011
free-0.011
Token all
Token ID477
Feature activation-1.30e+00

Pos logit contributions
he+0.100
pit+0.099
case+0.096
 benches+0.096
aren+0.095
Neg logit contributions
 Will-0.073
 crew-0.066
 pap-0.065
 three-0.063
 Chick-0.062
Token the
Token ID262
Feature activation-6.06e-01

Pos logit contributions
pit+0.056
awed+0.055
+0.055
case+0.053
icycle+0.052
Neg logit contributions
 Will-0.018
 three-0.016
 Ch-0.015
 pap-0.015
 younger-0.015
Token new
Token ID649
Feature activation-2.52e-01

Pos logit contributions
he+0.020
 benches+0.020
pit+0.020
case+0.019
aren+0.019
Neg logit contributions
 Will-0.014
 crew-0.013
 pap-0.013
 Chick-0.013
ch-0.012
Token sights
Token ID21343
Feature activation-8.85e-01

Pos logit contributions
he+0.008
pit+0.008
case+0.008
aren+0.008
ue+0.008
Neg logit contributions
 crew-0.006
 Will-0.006
 pap-0.006
 three-0.005
 louder-0.005
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
pit+0.030
icycle+0.030
ue+0.028
aren+0.028
 sandbox+0.027
Neg logit contributions
 Will-0.021
 louder-0.020
free-0.020
ch-0.019
 three-0.019