TOP ACTIVATIONS
MAX = 0.105

. He kept throwing a tant rum . His mom was
surprise . He threw a tant rum . He kicked and
placed it onto the wheel bar row . John
uffy had made the wheel bar row much lighter . The
to eat something scr um pt ious .
crown ." The king we pt even more and said ,
and started to throw a tant rum . Suddenly , she
kept crying and throwing a tant rum . Finally
sad . She threw a tant rum and cried loudly .
stubborn . She throws a tant rum . She kicks and
his feet and threw a tant rum . "
her feet and threw a tant rum . She grabbed the
listen . She threw a tant rum in the store .
He unh ooks Anna from the zip line and carries
. They all covered each other in furry hugs , and
the cat off the wheel bar row and put him back
of soil into the wheel bar row . It was heavy
He jumped onto the wheel bar row and started rolling around
The two players covered each other in the game . One
much lighter . The wheel bar row was now much easier

MIN ACTIVATIONS
MIN = -0.122

They were not grateful for Farmer Joe . He was mean
And he was grateful for Farmer Joe 's help . <|endoftext|>
. They were grateful for Farmer Sue . She was nice
. S asha looked out of her window
The moral of Leah 's story is that it
big . S asha was about to touch the
was sad , because her pap a never had time to
. He didn 't want Farmer Dan to be mad .
bird began to tell Pen elope it 's story .
on , she still pret ends to be monsters but she
park to play . So Sasha asked her dad if she
my said goodbye and introduced Rachel to the nice lady at
there ." S asha smiled and rushed off to
very compassionate and kindly convinced Lex to keep going so they
felt so sorry and promised Ashley that she would get the
were the best little mag icians in the whole town .
cooking . S andra 's mom saw her and
care of it , Ab bi ?" Ab bi
. He was so eng ross ed that he didn 't
to your friends , remember Heather 's story . Listen to

INTERVAL 0.048 - 0.105
CONTAINS 0.885%

came out and sh ined brightly . Susan smiled and knew
learn your lesson ?" she asks . " Yes
shouts . " Wow , Sara . You are
" Yes , it was ," agreed Daisy . And they
bed . " Hello ," said Emily . "

INTERVAL -0.009 - 0.048
CONTAINS 68.951%

" It is moving !" " Wow , Tom
She used a small knife to cut it into pieces .
water or shampoo . She did not like to be disturbed
the tree , he found a small box . He opened
he lf . Mom my was so scared that the books

INTERVAL -0.066 - -0.009
CONTAINS 30.040%

! " Oh wow ," Sarah exclaimed . She wanted to
it more and more each time . Soon , Beat rice
its wing and couldn 't fly . Nut ty felt sad
wanted someone who was kind and helpful . When she found
't because the spider couldn 't talk . Then

INTERVAL -0.122 - -0.066
CONTAINS 0.123%

and they continued walking . Along the way they found a
up the weapon . But as soon as he touched it
's okay , as long as they can help me on
world . One day , as she was walking with her
led away until the next spring . They never forgot about
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
asured+0.047
king+0.046
 awaiting+0.044
 tradition+0.042
illa+0.041
Neg logit contributions
hell-0.046
 are-0.045
 conclusions-0.043
 vest-0.042
 cry-0.041
Token He
Token ID679
Feature activation+1.15e-02

Pos logit contributions
 knowing+0.032
 to+0.031
king+0.030
 than+0.030
 when+0.030
Neg logit contributions
Found-0.017
 ah-0.017
Raw-0.016
 spoiler-0.016
 conclusions-0.015
Token kept
Token ID4030
Feature activation+1.92e-02

Pos logit contributions
Found+0.018
Raw+0.017
 ah+0.017
Got+0.016
 spoiler+0.016
Neg logit contributions
 knowing-0.021
king-0.020
 to-0.020
 than-0.019
 for-0.019
Token throwing
Token ID9644
Feature activation+1.21e-02

Pos logit contributions
 ah+0.036
Found+0.034
 spoiler+0.034
Raw+0.033
 ey+0.032
Neg logit contributions
ess-0.026
king-0.026
 than-0.025
 since-0.025
 knowing-0.024
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
Raw+0.027
 ah+0.026
Found+0.026
ickets+0.025
Boo+0.025
Neg logit contributions
 to-0.017
 for-0.016
 when-0.016
 than-0.016
 as-0.016
Token tant
Token ID24246
Feature activation+1.05e-01

Pos logit contributions
 ah+0.019
bridge+0.018
Raw+0.018
Found+0.018
 spoiler+0.017
Neg logit contributions
 to-0.020
 for-0.019
 than-0.018
 when-0.018
 knowing-0.017
Tokenrum
Token ID6582
Feature activation+3.57e-02

Pos logit contributions
Found+0.144
Raw+0.137
 ah+0.137
 spoiler+0.133
Got+0.129
Neg logit contributions
 knowing-0.210
king-0.203
 to-0.200
 than-0.199
 for-0.196
Token.
Token ID13
Feature activation-3.88e-03

Pos logit contributions
 spoiler+0.073
Found+0.073
 ah+0.073
bridge+0.071
 are+0.070
Neg logit contributions
king-0.039
 awaiting-0.038
 knowing-0.037
asured-0.035
app-0.034
Token His
Token ID2399
Feature activation-4.82e-03

Pos logit contributions
 knowing+0.008
 to+0.008
 when+0.008
 than+0.008
king+0.008
Neg logit contributions
Found-0.005
 ah-0.005
Raw-0.005
 spoiler-0.005
 conclusions-0.005
Token mom
Token ID1995
Feature activation-1.58e-02

Pos logit contributions
 to+0.006
 knowing+0.005
 for+0.005
 when+0.005
 than+0.005
Neg logit contributions
Found-0.011
Raw-0.011
 ah-0.011
hing-0.010
 conclusions-0.010
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
 knowing+0.029
king+0.028
 than+0.028
 awaiting+0.027
asured+0.026
Neg logit contributions
Found-0.024
 ah-0.023
Got-0.022
Raw-0.022
 spoiler-0.021
Token surprise
Token ID5975
Feature activation-2.02e-02

Pos logit contributions
 knowing+0.028
 to+0.028
 for+0.028
 than+0.027
 when+0.026
Neg logit contributions
Found-0.025
 ah-0.024
Raw-0.024
 spoiler-0.023
hing-0.023
Token.
Token ID13
Feature activation-6.96e-03

Pos logit contributions
king+0.022
 knowing+0.021
asured+0.019
 awaiting+0.018
app+0.018
Neg logit contributions
 ah-0.044
Found-0.044
 spoiler-0.043
Raw-0.042
 tube-0.042
Token He
Token ID679
Feature activation+1.70e-02

Pos logit contributions
 to+0.013
 knowing+0.013
 when+0.013
 where+0.012
 than+0.012
Neg logit contributions
 ah-0.011
Found-0.011
hing-0.010
 conclusions-0.010
Raw-0.010
Token threw
Token ID9617
Feature activation+1.57e-02

Pos logit contributions
Found+0.028
Raw+0.027
 ah+0.027
 spoiler+0.026
Got+0.026
Neg logit contributions
 knowing-0.029
king-0.028
 to-0.027
 than-0.027
 for-0.027
Token a
Token ID257
Feature activation+4.71e-03

Pos logit contributions
Raw+0.037
 ah+0.037
Found+0.037
 ey+0.035
hing+0.035
Neg logit contributions
 to-0.019
 for-0.019
 when-0.017
 as-0.017
 where-0.016
Token tant
Token ID24246
Feature activation+1.03e-01

Pos logit contributions
Found+0.008
 ah+0.007
 conclusions+0.007
Raw+0.007
hing+0.007
Neg logit contributions
 to-0.009
 for-0.008
 than-0.007
 where-0.007
 when-0.007
Tokenrum
Token ID6582
Feature activation+3.71e-02

Pos logit contributions
Found+0.145
Raw+0.139
 ah+0.138
 spoiler+0.134
Got+0.131
Neg logit contributions
 knowing-0.204
king-0.196
 than-0.195
 to-0.194
 for-0.190
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
Found+0.081
 ah+0.080
 spoiler+0.080
Raw+0.078
bridge+0.077
Neg logit contributions
king-0.037
 knowing-0.037
 awaiting-0.035
asured-0.034
app-0.032
Token He
Token ID679
Feature activation+2.20e-02

Pos logit contributions
 to+0.003
 knowing+0.003
 when+0.003
 where+0.003
 before+0.003
Neg logit contributions
Found-0.003
 ah-0.003
Raw-0.003
 conclusions-0.003
hing-0.003
Token kicked
Token ID12165
Feature activation+1.58e-02

Pos logit contributions
Found+0.036
 ah+0.035
Raw+0.035
 spoiler+0.033
Got+0.033
Neg logit contributions
 knowing-0.038
king-0.037
 to-0.036
 than-0.036
 for-0.035
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
Found+0.040
 ah+0.040
Raw+0.039
 ey+0.038
bridge+0.038
Neg logit contributions
 to-0.016
 for-0.015
 when-0.015
 before-0.015
 than-0.014
Token placed
Token ID4624
Feature activation-3.80e-03

Pos logit contributions
Raw+0.037
Found+0.037
 spoiler+0.037
 conclusions+0.036
 ah+0.036
Neg logit contributions
 to-0.024
 knowing-0.024
 when-0.024
 for-0.023
 before-0.023
Token it
Token ID340
Feature activation+2.75e-02

Pos logit contributions
 to+0.005
 knowing+0.005
 for+0.005
 when+0.005
 before+0.005
Neg logit contributions
Found-0.008
Raw-0.008
 ah-0.007
 spoiler-0.007
 conclusions-0.007
Token onto
Token ID4291
Feature activation+1.69e-02

Pos logit contributions
Found+0.061
 ah+0.059
Raw+0.058
 spoiler+0.057
bridge+0.056
Neg logit contributions
 knowing-0.031
king-0.029
 than-0.027
 awaiting-0.025
asured-0.025
Token the
Token ID262
Feature activation+9.92e-03

Pos logit contributions
Found+0.032
 ah+0.031
Raw+0.031
 spoiler+0.031
 conclusions+0.030
Neg logit contributions
 knowing-0.024
king-0.024
 than-0.022
asured-0.022
 to-0.021
Token wheel
Token ID7825
Feature activation+1.04e-02

Pos logit contributions
Found+0.015
Raw+0.014
 ah+0.014
 spoiler+0.014
Got+0.013
Neg logit contributions
 knowing-0.019
 to-0.019
 for-0.018
 when-0.018
 before-0.018
Tokenbar
Token ID5657
Feature activation+9.97e-02

Pos logit contributions
 spoiler+0.020
bridge+0.020
 ah+0.020
 tube+0.020
hing+0.019
Neg logit contributions
 knowing-0.013
king-0.012
asured-0.012
 awaiting-0.012
 than-0.011
Tokenrow
Token ID808
Feature activation+3.19e-02

Pos logit contributions
Found+0.157
Raw+0.152
 ah+0.152
 spoiler+0.146
Got+0.143
Neg logit contributions
 knowing-0.179
king-0.172
 to-0.172
 for-0.169
 than-0.168
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
bridge+0.065
 ah+0.064
 tube+0.063
 spoiler+0.062
Found+0.062
Neg logit contributions
 knowing-0.035
king-0.034
asured-0.034
 awaiting-0.031
ind-0.031
Token
Token ID198
Feature activation+8.43e-03

Pos logit contributions
Found+0.017
 ah+0.017
Raw+0.017
 spoiler+0.016
 conclusions+0.016
Neg logit contributions
 knowing-0.021
king-0.020
 to-0.020
 than-0.019
 for-0.019
Token
Token ID198
Feature activation+2.67e-03

Pos logit contributions
 ah+0.017
Found+0.017
 spoiler+0.017
Raw+0.016
 ey+0.016
Neg logit contributions
 to-0.011
 knowing-0.011
 when-0.011
 before-0.010
 for-0.010
TokenJohn
Token ID7554
Feature activation-1.32e-02

Pos logit contributions
Found+0.005
 ah+0.005
Raw+0.005
 spoiler+0.005
 ey+0.004
Neg logit contributions
 knowing-0.004
king-0.004
 to-0.004
 when-0.004
 than-0.004
Tokenuffy
Token ID15352
Feature activation-4.94e-02

Pos logit contributions
 knowing+0.043
 to+0.039
king+0.038
 before+0.038
 when+0.038
Neg logit contributions
Found-0.043
Raw-0.043
 ah-0.043
 spoiler-0.042
 tube-0.041
Token had
Token ID550
Feature activation-9.71e-03

Pos logit contributions
 knowing+0.084
king+0.081
 to+0.078
 than+0.077
 for+0.077
Neg logit contributions
Found-0.082
 ah-0.079
Raw-0.079
 spoiler-0.076
Got-0.075
Token made
Token ID925
Feature activation-8.08e-03

Pos logit contributions
king+0.013
 knowing+0.013
 than+0.012
 awaiting+0.012
 for+0.012
Neg logit contributions
Found-0.019
Raw-0.018
 ah-0.018
Got-0.018
 spoiler-0.017
Token the
Token ID262
Feature activation+2.61e-02

Pos logit contributions
 knowing+0.011
king+0.011
 to+0.011
 for+0.010
 than+0.010
Neg logit contributions
Found-0.016
 ah-0.015
Raw-0.015
 spoiler-0.015
 conclusions-0.015
Token wheel
Token ID7825
Feature activation+1.80e-02

Pos logit contributions
Found+0.045
 ah+0.045
Raw+0.043
 spoiler+0.042
Got+0.040
Neg logit contributions
 knowing-0.045
 to-0.045
 for-0.043
 when-0.043
 than-0.042
Tokenbar
Token ID5657
Feature activation+9.93e-02

Pos logit contributions
Found+0.035
 ah+0.035
 spoiler+0.035
Raw+0.034
 tube+0.034
Neg logit contributions
 knowing-0.023
king-0.022
 than-0.021
 awaiting-0.021
asured-0.020
Tokenrow
Token ID808
Feature activation+3.20e-02

Pos logit contributions
Found+0.165
Raw+0.161
 ah+0.158
 spoiler+0.151
Got+0.150
Neg logit contributions
 knowing-0.174
 to-0.168
 for-0.168
king-0.166
 than-0.166
Token much
Token ID881
Feature activation+2.40e-02

Pos logit contributions
Found+0.069
 ah+0.067
Raw+0.066
 spoiler+0.066
 conclusions+0.064
Neg logit contributions
 knowing-0.037
king-0.036
 than-0.033
 to-0.033
 awaiting-0.032
Token lighter
Token ID14871
Feature activation+1.45e-04

Pos logit contributions
Found+0.039
 ah+0.038
Raw+0.038
 spoiler+0.037
 conclusions+0.036
Neg logit contributions
 knowing-0.041
king-0.040
 to-0.039
 than-0.039
 for-0.038
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
 ah+0.000
 are+0.000
 tube+0.000
 conclusions+0.000
Found+0.000
Neg logit contributions
king-0.000
 knowing-0.000
 awaiting-0.000
app-0.000
asured-0.000
Token The
Token ID383
Feature activation+6.14e-03

Pos logit contributions
 knowing+0.005
king+0.005
 to+0.005
 than+0.005
 for+0.005
Neg logit contributions
Found-0.004
 ah-0.004
Raw-0.004
 spoiler-0.004
 conclusions-0.004
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
Found+0.083
Raw+0.082
 ah+0.082
 conclusions+0.080
 spoiler+0.080
Neg logit contributions
king-0.008
 knowing-0.008
 than-0.005
 awaiting-0.004
asured-0.004
Token eat
Token ID4483
Feature activation+3.74e-03

Pos logit contributions
 to+0.026
 when+0.025
 knowing+0.024
 before+0.023
 for+0.023
Neg logit contributions
Found-0.031
 ah-0.029
 spoiler-0.029
Got-0.028
Raw-0.028
Token something
Token ID1223
Feature activation-3.60e-03

Pos logit contributions
Found+0.008
 spoiler+0.007
Raw+0.007
 ah+0.007
Got+0.007
Neg logit contributions
 for-0.006
 to-0.006
 when-0.006
 before-0.005
 knowing-0.005
Token scr
Token ID6040
Feature activation-7.07e-03

Pos logit contributions
king+0.006
 knowing+0.005
asured+0.005
 awaiting+0.005
app+0.005
Neg logit contributions
 ah-0.006
Found-0.006
Raw-0.006
 tube-0.006
 pear-0.006
Tokenum
Token ID388
Feature activation-1.38e-02

Pos logit contributions
 knowing+0.010
king+0.010
 to+0.010
 than+0.009
 for+0.009
Neg logit contributions
Found-0.013
 ah-0.013
Raw-0.013
 spoiler-0.013
 conclusions-0.012
Tokenpt
Token ID457
Feature activation+9.92e-02

Pos logit contributions
king+0.018
 knowing+0.017
 awaiting+0.016
asured+0.016
app+0.015
Neg logit contributions
Found-0.028
 ah-0.027
 are-0.027
Got-0.025
 tube-0.025
Tokenious
Token ID699
Feature activation+3.42e-02

Pos logit contributions
 ah+0.218
Found+0.214
Raw+0.213
bridge+0.211
 spoiler+0.211
Neg logit contributions
king-0.091
 knowing-0.085
asured-0.085
 awaiting-0.084
app-0.081
Token.
Token ID13
Feature activation+5.59e-03

Pos logit contributions
hell+0.067
Found+0.066
 ah+0.065
 seaw+0.064
 are+0.064
Neg logit contributions
king-0.047
asured-0.039
ind-0.038
 knowing-0.037
rou-0.035
Token
Token ID220
Feature activation+1.48e-02

Pos logit contributions
Found+0.009
 ah+0.009
Raw+0.008
 spoiler+0.008
 conclusions+0.008
Neg logit contributions
 knowing-0.010
 to-0.010
king-0.010
 than-0.009
 for-0.009
Token
Token ID198
Feature activation+6.65e-03

Pos logit contributions
Found+0.028
 ah+0.027
Raw+0.027
 spoiler+0.027
 conclusions+0.026
Neg logit contributions
 knowing-0.022
 to-0.021
king-0.021
 for-0.021
 when-0.021
Token
Token ID198
Feature activation+9.09e-05

Pos logit contributions
Found+0.011
Raw+0.011
 ah+0.011
 spoiler+0.011
Got+0.010
Neg logit contributions
 knowing-0.011
king-0.011
 to-0.010
 than-0.010
 for-0.010
Token crown
Token ID12389
Feature activation+1.74e-02

Pos logit contributions
Found+0.003
Raw+0.003
 ah+0.003
 spoiler+0.002
Got+0.002
Neg logit contributions
 knowing-0.006
 to-0.006
 for-0.006
 than-0.006
 when-0.006
Token."
Token ID526
Feature activation+1.46e-02

Pos logit contributions
 are+0.032
hell+0.031
bridge+0.030
 ah+0.029
 vest+0.029
Neg logit contributions
king-0.024
 knowing-0.023
asured-0.022
 awaiting-0.022
 introducing-0.021
Token The
Token ID383
Feature activation+1.88e-02

Pos logit contributions
Found+0.024
 ah+0.024
Raw+0.024
 spoiler+0.023
Got+0.022
Neg logit contributions
 knowing-0.025
king-0.024
 than-0.023
 for-0.022
asured-0.022
Token king
Token ID5822
Feature activation-1.14e-03

Pos logit contributions
Found+0.034
 ah+0.033
 spoiler+0.033
Raw+0.033
 conclusions+0.032
Neg logit contributions
 knowing-0.030
 to-0.028
 for-0.028
 when-0.028
 before-0.027
Token we
Token ID356
Feature activation+3.86e-02

Pos logit contributions
 knowing+0.002
king+0.002
 to+0.002
 than+0.002
 for+0.002
Neg logit contributions
Found-0.002
 ah-0.002
Raw-0.002
 spoiler-0.002
Got-0.002
Tokenpt
Token ID457
Feature activation+9.81e-02

Pos logit contributions
Found+0.051
Raw+0.050
 conclusions+0.049
 ey+0.048
Got+0.046
Neg logit contributions
 for-0.083
 to-0.082
 when-0.079
 as-0.078
 knowing-0.077
Token even
Token ID772
Feature activation-6.41e-03

Pos logit contributions
 ah+0.203
Found+0.194
 spoiler+0.189
Raw+0.189
bridge+0.185
Neg logit contributions
asured-0.105
king-0.105
 awaiting-0.100
 knowing-0.097
 than-0.096
Token more
Token ID517
Feature activation+2.27e-03

Pos logit contributions
king+0.010
 knowing+0.010
 than+0.009
 awaiting+0.009
 to+0.009
Neg logit contributions
Found-0.012
 ah-0.011
Raw-0.011
Got-0.011
 spoiler-0.011
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
 ah+0.005
Found+0.004
hell+0.004
bridge+0.004
 spoiler+0.004
Neg logit contributions
king-0.003
 awaiting-0.002
 knowing-0.002
app-0.002
asured-0.002
Token said
Token ID531
Feature activation-1.66e-02

Pos logit contributions
king+0.016
 knowing+0.015
 than+0.014
 to+0.014
 awaiting+0.014
Neg logit contributions
Found-0.020
 ah-0.020
Raw-0.019
 spoiler-0.018
Got-0.018
Token,
Token ID11
Feature activation+1.90e-03

Pos logit contributions
 knowing+0.023
 to+0.022
king+0.022
 for+0.021
 when+0.021
Neg logit contributions
Found-0.033
Raw-0.032
 ah-0.032
 spoiler-0.031
 conclusions-0.031
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
 are+0.073
 ah+0.071
Found+0.070
 vest+0.069
 tube+0.068
Neg logit contributions
 awaiting-0.065
king-0.063
asured-0.063
ind-0.062
app-0.062
Token started
Token ID2067
Feature activation-4.13e-02

Pos logit contributions
king+0.021
 than+0.021
 knowing+0.021
asured+0.020
 awaiting+0.019
Neg logit contributions
 ah-0.013
Raw-0.012
Found-0.012
Got-0.011
 are-0.011
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
king+0.038
 than+0.036
asured+0.033
ess+0.033
iring+0.031
Neg logit contributions
Raw-0.093
 cry-0.090
 ah-0.089
 mud-0.088
 scrap-0.087
Token throw
Token ID3714
Feature activation-4.03e-03

Pos logit contributions
 knowing+0.036
 to+0.036
king+0.035
 when+0.035
 for+0.035
Neg logit contributions
Found-0.013
 ah-0.012
Raw-0.012
 spoiler-0.011
Got-0.011
Token a
Token ID257
Feature activation+4.10e-03

Pos logit contributions
 to+0.005
 for+0.004
 when+0.004
 before+0.004
 than+0.004
Neg logit contributions
Found-0.009
 spoiler-0.009
 ah-0.009
Raw-0.009
 ey-0.009
Token tant
Token ID24246
Feature activation+9.74e-02

Pos logit contributions
Found+0.007
 spoiler+0.006
Raw+0.006
 ah+0.006
Got+0.006
Neg logit contributions
 to-0.008
 for-0.007
king-0.007
 knowing-0.007
 when-0.007
Tokenrum
Token ID6582
Feature activation+3.30e-02

Pos logit contributions
Found+0.131
 ah+0.126
Raw+0.125
 spoiler+0.121
Got+0.117
Neg logit contributions
 knowing-0.197
king-0.190
 to-0.188
 than-0.186
 for-0.184
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
 ah+0.058
Found+0.058
 spoiler+0.057
 are+0.057
bridge+0.056
Neg logit contributions
asured-0.042
king-0.042
 awaiting-0.041
app-0.039
 tradition-0.039
Token Suddenly
Token ID24975
Feature activation-1.92e-03

Pos logit contributions
Found+0.002
Raw+0.002
 ah+0.001
 spoiler+0.001
 ey+0.001
Neg logit contributions
king-0.002
 knowing-0.002
 than-0.002
asured-0.002
 awaiting-0.002
Token,
Token ID11
Feature activation+2.02e-02

Pos logit contributions
 to+0.002
 when+0.002
 for+0.002
 knowing+0.002
 before+0.002
Neg logit contributions
Found-0.004
Raw-0.004
 spoiler-0.004
 ah-0.004
 conclusions-0.004
Token she
Token ID673
Feature activation-4.42e-03

Pos logit contributions
Raw+0.045
Found+0.045
 spoiler+0.045
 ey+0.044
 ah+0.044
Neg logit contributions
 when-0.024
 to-0.023
 knowing-0.023
 for-0.022
 before-0.022
Token kept
Token ID4030
Feature activation+1.40e-02

Pos logit contributions
Found+0.010
Raw+0.010
 ah+0.010
 spoiler+0.010
Got+0.009
Neg logit contributions
 knowing-0.010
king-0.010
 than-0.009
 to-0.009
 for-0.009
Token crying
Token ID13774
Feature activation+7.20e-02

Pos logit contributions
 ah+0.028
Found+0.027
Raw+0.027
 spoiler+0.025
 ey+0.025
Neg logit contributions
king-0.018
 than-0.018
ess-0.017
 knowing-0.016
 awaiting-0.016
Token and
Token ID290
Feature activation+3.10e-03

Pos logit contributions
Found+0.158
 ah+0.154
 spoiler+0.153
 conclusions+0.152
Raw+0.151
Neg logit contributions
king-0.072
 knowing-0.067
 awaiting-0.067
asured-0.066
 to-0.064
Token throwing
Token ID9644
Feature activation+5.17e-03

Pos logit contributions
 ah+0.006
Found+0.006
Raw+0.006
Got+0.006
 spoiler+0.005
Neg logit contributions
king-0.004
 than-0.004
 knowing-0.004
 awaiting-0.004
app-0.004
Token a
Token ID257
Feature activation+3.26e-03

Pos logit contributions
Raw+0.012
Found+0.012
 ah+0.012
 spoiler+0.012
 ey+0.012
Neg logit contributions
 to-0.006
 when-0.006
 for-0.006
 as-0.006
 before-0.005
Token tant
Token ID24246
Feature activation+9.73e-02

Pos logit contributions
 ah+0.005
Found+0.005
 spoiler+0.005
Raw+0.005
bridge+0.004
Neg logit contributions
 to-0.007
 for-0.006
 when-0.006
 knowing-0.006
 where-0.006
Tokenrum
Token ID6582
Feature activation+3.01e-02

Pos logit contributions
Found+0.137
Raw+0.132
 ah+0.131
 spoiler+0.128
Got+0.124
Neg logit contributions
 knowing-0.192
king-0.184
 to-0.182
 than-0.182
 for-0.178
Token.
Token ID13
Feature activation+3.92e-03

Pos logit contributions
Found+0.060
 ah+0.058
 are+0.057
 spoiler+0.057
 tube+0.056
Neg logit contributions
king-0.035
asured-0.033
 awaiting-0.033
app-0.032
 knowing-0.031
Token
Token ID198
Feature activation+1.82e-03

Pos logit contributions
Found+0.007
 ah+0.007
Raw+0.007
 spoiler+0.007
 conclusions+0.006
Neg logit contributions
 knowing-0.006
 to-0.006
king-0.006
 when-0.006
 for-0.006
Token
Token ID198
Feature activation-2.74e-03

Pos logit contributions
Found+0.003
 ah+0.003
Raw+0.003
 spoiler+0.003
 conclusions+0.003
Neg logit contributions
 knowing-0.003
king-0.003
 to-0.003
 than-0.003
 for-0.003
TokenFinally
Token ID11158
Feature activation-1.07e-02

Pos logit contributions
 knowing+0.004
king+0.004
 to+0.004
 for+0.004
 than+0.004
Neg logit contributions
Found-0.005
Raw-0.005
 ah-0.005
Got-0.005
 spoiler-0.005
Token sad
Token ID6507
Feature activation+1.50e-02

Pos logit contributions
king+0.034
 knowing+0.033
 than+0.032
asured+0.031
 awaiting+0.030
Neg logit contributions
Found-0.026
 ah-0.025
Raw-0.025
Got-0.024
 spoiler-0.023
Token.
Token ID13
Feature activation-8.98e-03

Pos logit contributions
 are+0.028
 ah+0.027
Found+0.026
 tube+0.026
 conclusions+0.026
Neg logit contributions
king-0.020
asured-0.019
ind-0.018
app-0.018
 awaiting-0.017
Token She
Token ID1375
Feature activation-9.64e-04

Pos logit contributions
king+0.017
 knowing+0.017
 than+0.016
 for+0.016
 awaiting+0.016
Neg logit contributions
Found-0.013
Raw-0.013
 ah-0.012
 spoiler-0.012
Got-0.012
Token threw
Token ID9617
Feature activation+1.21e-02

Pos logit contributions
 knowing+0.002
king+0.002
 than+0.002
 to+0.001
 for+0.001
Neg logit contributions
Found-0.002
Raw-0.002
 ah-0.002
 spoiler-0.001
Got-0.001
Token a
Token ID257
Feature activation-3.09e-03

Pos logit contributions
Raw+0.031
 ah+0.030
Found+0.030
 spoiler+0.029
bridge+0.029
Neg logit contributions
 to-0.012
 for-0.012
 when-0.011
 as-0.010
 before-0.010
Token tant
Token ID24246
Feature activation+9.73e-02

Pos logit contributions
 to+0.006
 for+0.006
 when+0.005
 where+0.005
 knowing+0.005
Neg logit contributions
Raw-0.005
 ah-0.005
 conclusions-0.005
Found-0.005
bridge-0.004
Tokenrum
Token ID6582
Feature activation+3.72e-02

Pos logit contributions
Found+0.135
Raw+0.129
 ah+0.128
 spoiler+0.126
Got+0.121
Neg logit contributions
 knowing-0.194
king-0.186
 to-0.184
 than-0.184
 for-0.180
Token and
Token ID290
Feature activation+5.99e-04

Pos logit contributions
Found+0.077
 ah+0.075
 spoiler+0.074
Raw+0.072
 are+0.071
Neg logit contributions
king-0.040
asured-0.039
 awaiting-0.038
 knowing-0.037
app-0.037
Token cried
Token ID16896
Feature activation+3.97e-02

Pos logit contributions
Found+0.001
 ah+0.001
Raw+0.001
 spoiler+0.001
Got+0.001
Neg logit contributions
 knowing-0.001
king-0.001
 than-0.001
 to-0.001
 for-0.001
Token loudly
Token ID23112
Feature activation-5.08e-03

Pos logit contributions
Found+0.086
 ah+0.085
 spoiler+0.084
Raw+0.083
 conclusions+0.083
Neg logit contributions
asured-0.039
king-0.039
 knowing-0.038
 than-0.037
 awaiting-0.037
Token.
Token ID13
Feature activation-4.00e-03

Pos logit contributions
asured+0.006
king+0.005
ind+0.005
 tradition+0.005
 knowing+0.005
Neg logit contributions
 ah-0.010
Found-0.010
Raw-0.009
 spoiler-0.009
 are-0.009
Token stubborn
Token ID24444
Feature activation+1.65e-02

Pos logit contributions
Found+0.000
 ah+0.000
Raw+0.000
 pear+0.000
 are+0.000
Neg logit contributions
king-0.000
 since-0.000
ess-0.000
 tradition-0.000
 knowing-0.000
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
 are+0.031
Found+0.030
 ah+0.028
 spoiler+0.027
hell+0.027
Neg logit contributions
king-0.022
asured-0.022
 awaiting-0.021
app-0.021
ind-0.020
Token She
Token ID1375
Feature activation+1.59e-02

Pos logit contributions
 knowing+0.012
 to+0.012
 when+0.012
 than+0.012
 for+0.011
Neg logit contributions
Found-0.014
 ah-0.014
 spoiler-0.013
Raw-0.013
 conclusions-0.013
Token throws
Token ID12542
Feature activation+1.92e-02

Pos logit contributions
Found+0.026
 ah+0.026
 spoiler+0.025
Raw+0.025
 conclusions+0.024
Neg logit contributions
 knowing-0.026
 to-0.026
 for-0.026
 when-0.026
 than-0.025
Token a
Token ID257
Feature activation+3.60e-04

Pos logit contributions
Found+0.045
Raw+0.044
 ah+0.044
 spoiler+0.044
 ey+0.042
Neg logit contributions
 to-0.023
 for-0.021
 when-0.020
 before-0.019
 as-0.019
Token tant
Token ID24246
Feature activation+9.70e-02

Pos logit contributions
Found+0.000
 spoiler+0.000
 ah+0.000
Raw+0.000
bridge+0.000
Neg logit contributions
 to-0.001
 for-0.001
 than-0.001
 knowing-0.001
king-0.001
Tokenrum
Token ID6582
Feature activation+3.61e-02

Pos logit contributions
Found+0.134
Raw+0.128
 ah+0.127
 spoiler+0.125
Got+0.121
Neg logit contributions
 knowing-0.194
king-0.187
 than-0.185
 to-0.185
 for-0.181
Token.
Token ID13
Feature activation-5.47e-04

Pos logit contributions
Found+0.079
 ah+0.078
 spoiler+0.076
Raw+0.075
 are+0.075
Neg logit contributions
 knowing-0.037
king-0.037
 awaiting-0.035
asured-0.035
app-0.032
Token She
Token ID1375
Feature activation+1.98e-02

Pos logit contributions
 to+0.001
 knowing+0.001
 when+0.001
 than+0.001
 for+0.001
Neg logit contributions
Found-0.001
 ah-0.001
 spoiler-0.001
Raw-0.001
 conclusions-0.001
Token kicks
Token ID17364
Feature activation+2.53e-02

Pos logit contributions
Found+0.031
 ah+0.030
Raw+0.029
 spoiler+0.029
 conclusions+0.028
Neg logit contributions
 to-0.035
 for-0.034
 knowing-0.034
 when-0.034
 than-0.034
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
Found+0.063
Raw+0.062
 ah+0.062
 spoiler+0.061
bridge+0.060
Neg logit contributions
 to-0.026
 when-0.025
 for-0.025
 before-0.023
 than-0.023
Token his
Token ID465
Feature activation+2.15e-02

Pos logit contributions
Found+0.041
Raw+0.040
 ah+0.040
 spoiler+0.039
 ey+0.038
Neg logit contributions
 to-0.041
 knowing-0.041
 for-0.039
 when-0.039
king-0.039
Token feet
Token ID3625
Feature activation+3.29e-02

Pos logit contributions
Found+0.035
 ah+0.034
Raw+0.034
 spoiler+0.033
 conclusions+0.032
Neg logit contributions
 knowing-0.037
king-0.036
 to-0.035
 than-0.035
 for-0.034
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
 spoiler+0.058
 are+0.058
hell+0.057
hing+0.057
 tube+0.057
Neg logit contributions
asured-0.040
 knowing-0.040
 awaiting-0.039
king-0.038
cess-0.037
Token threw
Token ID9617
Feature activation+1.68e-02

Pos logit contributions
Found+0.031
 ah+0.031
Raw+0.030
 spoiler+0.029
Got+0.028
Neg logit contributions
 knowing-0.037
king-0.036
 than-0.035
 to-0.035
asured-0.034
Token a
Token ID257
Feature activation+6.88e-03

Pos logit contributions
Found+0.036
Raw+0.035
 ey+0.035
 ah+0.035
 conclusions+0.033
Neg logit contributions
 to-0.023
 for-0.022
 when-0.021
 as-0.021
 before-0.020
Token tant
Token ID24246
Feature activation+9.69e-02

Pos logit contributions
 ah+0.010
 conclusions+0.010
Found+0.010
Raw+0.010
 spoiler+0.010
Neg logit contributions
 to-0.013
 for-0.012
king-0.011
 than-0.011
 when-0.011
Tokenrum
Token ID6582
Feature activation+3.49e-02

Pos logit contributions
Found+0.127
 ah+0.122
Raw+0.121
 spoiler+0.117
Got+0.113
Neg logit contributions
 knowing-0.199
king-0.193
 to-0.190
 than-0.188
 for-0.186
Token.
Token ID13
Feature activation+6.03e-03

Pos logit contributions
 ah+0.077
Found+0.076
 spoiler+0.075
Raw+0.074
bridge+0.073
Neg logit contributions
 knowing-0.035
king-0.034
 awaiting-0.032
asured-0.032
app-0.030
Token
Token ID198
Feature activation+7.70e-03

Pos logit contributions
Found+0.011
 ah+0.011
 spoiler+0.011
 conclusions+0.011
Raw+0.011
Neg logit contributions
 to-0.009
 knowing-0.009
 when-0.009
 where-0.008
 for-0.008
Token
Token ID198
Feature activation+2.87e-03

Pos logit contributions
Found+0.016
 ah+0.016
 spoiler+0.015
Raw+0.015
 ey+0.015
Neg logit contributions
 to-0.010
 knowing-0.010
 when-0.010
 for-0.009
 where-0.009
Token"
Token ID1
Feature activation-1.69e-02

Pos logit contributions
 ah+0.005
Found+0.005
 spoiler+0.005
 ey+0.005
Raw+0.005
Neg logit contributions
 when-0.004
 to-0.004
 knowing-0.004
king-0.004
 than-0.004
Token her
Token ID607
Feature activation+1.24e-02

Pos logit contributions
Found+0.037
 ah+0.036
Raw+0.036
 spoiler+0.035
 conclusions+0.034
Neg logit contributions
 knowing-0.025
king-0.024
 than-0.022
 to-0.022
 for-0.022
Token feet
Token ID3625
Feature activation+2.94e-02

Pos logit contributions
 ah+0.015
 spoiler+0.015
Found+0.015
 tube+0.015
 ey+0.015
Neg logit contributions
 knowing-0.025
king-0.025
 awaiting-0.024
asured-0.023
app-0.023
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
 ah+0.055
Found+0.055
 are+0.055
 spoiler+0.054
 tube+0.054
Neg logit contributions
asured-0.034
 knowing-0.034
 awaiting-0.033
king-0.032
ind-0.030
Token threw
Token ID9617
Feature activation+1.25e-02

Pos logit contributions
Found+0.019
 ah+0.018
Raw+0.018
 spoiler+0.018
 conclusions+0.017
Neg logit contributions
 knowing-0.020
king-0.020
 to-0.019
 than-0.019
 for-0.019
Token a
Token ID257
Feature activation+1.85e-03

Pos logit contributions
Raw+0.029
Found+0.028
 ah+0.028
 ey+0.027
 spoiler+0.027
Neg logit contributions
 to-0.015
 for-0.014
 when-0.014
 as-0.014
 before-0.013
Token tant
Token ID24246
Feature activation+9.65e-02

Pos logit contributions
Raw+0.002
Found+0.002
 ah+0.002
 conclusions+0.002
bridge+0.002
Neg logit contributions
 to-0.004
 for-0.004
king-0.004
 than-0.004
 knowing-0.003
Tokenrum
Token ID6582
Feature activation+3.18e-02

Pos logit contributions
Found+0.131
Raw+0.125
 ah+0.125
 spoiler+0.121
Got+0.117
Neg logit contributions
 knowing-0.195
king-0.188
 to-0.186
 than-0.185
 for-0.182
Token.
Token ID13
Feature activation+5.40e-04

Pos logit contributions
 ah+0.067
Found+0.066
 spoiler+0.064
 are+0.063
bridge+0.062
Neg logit contributions
king-0.034
asured-0.033
 knowing-0.032
 awaiting-0.032
app-0.030
Token She
Token ID1375
Feature activation+1.23e-02

Pos logit contributions
Found+0.001
Raw+0.001
 ah+0.001
 spoiler+0.001
 ey+0.001
Neg logit contributions
 knowing-0.001
king-0.001
 than-0.001
 for-0.001
asured-0.001
Token grabbed
Token ID13176
Feature activation+1.20e-02

Pos logit contributions
Found+0.021
 ah+0.020
Raw+0.020
 spoiler+0.020
 conclusions+0.019
Neg logit contributions
 knowing-0.021
 to-0.020
king-0.020
 when-0.020
 than-0.019
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
Raw+0.028
Found+0.028
 spoiler+0.026
 ah+0.026
 ey+0.026
Neg logit contributions
 to-0.013
 for-0.013
 when-0.013
 knowing-0.012
 before-0.012
Token listen
Token ID6004
Feature activation-3.63e-02

Pos logit contributions
 knowing+0.060
 to+0.058
 for+0.057
 when+0.057
 than+0.057
Neg logit contributions
Found-0.066
 ah-0.065
Raw-0.064
 ey-0.063
 spoiler-0.062
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
asured+0.045
illa+0.042
king+0.041
 awaiting+0.041
ich+0.040
Neg logit contributions
hell-0.065
 are-0.065
l-0.058
Found-0.056
 cry-0.056
Token She
Token ID1375
Feature activation+5.43e-03

Pos logit contributions
 knowing+0.030
 to+0.029
king+0.029
 than+0.028
 when+0.028
Neg logit contributions
Found-0.028
 ah-0.028
Raw-0.027
 spoiler-0.027
 conclusions-0.026
Token threw
Token ID9617
Feature activation+1.39e-02

Pos logit contributions
Found+0.009
Raw+0.009
 ah+0.009
Got+0.008
 spoiler+0.008
Neg logit contributions
 knowing-0.009
king-0.009
 to-0.008
 than-0.008
 for-0.008
Token a
Token ID257
Feature activation-1.09e-03

Pos logit contributions
Raw+0.034
 ah+0.033
Found+0.033
 spoiler+0.032
 ey+0.032
Neg logit contributions
 to-0.016
 for-0.015
 when-0.014
 as-0.014
 before-0.014
Token tant
Token ID24246
Feature activation+9.63e-02

Pos logit contributions
 to+0.002
 for+0.002
 than+0.002
 when+0.002
king+0.002
Neg logit contributions
bridge-0.002
Found-0.002
 ah-0.002
Raw-0.001
 conclusions-0.001
Tokenrum
Token ID6582
Feature activation+3.57e-02

Pos logit contributions
Found+0.140
Raw+0.135
 ah+0.133
 spoiler+0.132
Got+0.128
Neg logit contributions
 knowing-0.185
king-0.178
 than-0.177
 to-0.176
 for-0.172
Token in
Token ID287
Feature activation+1.42e-02

Pos logit contributions
Found+0.073
 ah+0.071
 spoiler+0.070
 are+0.069
Raw+0.068
Neg logit contributions
king-0.040
 awaiting-0.039
asured-0.038
 knowing-0.037
app-0.036
Token the
Token ID262
Feature activation+2.45e-02

Pos logit contributions
Found+0.031
Raw+0.030
 ah+0.030
 spoiler+0.030
bridge+0.029
Neg logit contributions
 to-0.017
 knowing-0.017
 when-0.016
 for-0.016
 than-0.016
Token store
Token ID3650
Feature activation-7.92e-03

Pos logit contributions
 ah+0.036
Found+0.035
bridge+0.035
Raw+0.035
hing+0.032
Neg logit contributions
 for-0.046
 to-0.046
 when-0.044
 where-0.043
 before-0.041
Token.
Token ID13
Feature activation-4.71e-03

Pos logit contributions
asured+0.012
ind+0.011
king+0.011
 supposed+0.011
 awaiting+0.010
Neg logit contributions
 are-0.013
 bin-0.012
 tube-0.012
 ah-0.012
Found-0.012
Token
Token ID198
Feature activation+9.27e-03

Pos logit contributions
Found+0.034
 ah+0.034
 spoiler+0.033
Raw+0.033
 ey+0.032
Neg logit contributions
 knowing-0.024
 to-0.023
 when-0.022
king-0.022
 for-0.022
TokenHe
Token ID1544
Feature activation+1.29e-02

Pos logit contributions
Found+0.015
 ah+0.014
 spoiler+0.014
Raw+0.014
 ey+0.014
Neg logit contributions
 knowing-0.016
king-0.016
 to-0.016
 than-0.016
 when-0.016
Token unh
Token ID14274
Feature activation+4.03e-02

Pos logit contributions
Found+0.020
 ah+0.019
Raw+0.019
 spoiler+0.018
Got+0.018
Neg logit contributions
 knowing-0.024
king-0.023
 to-0.022
 than-0.022
 for-0.022
Tokenooks
Token ID31085
Feature activation-9.17e-03

Pos logit contributions
 spoiler+0.080
Raw+0.076
Found+0.072
 ah+0.070
 conclusions+0.070
Neg logit contributions
 knowing-0.056
 before-0.053
 to-0.050
king-0.049
 awaiting-0.048
Token Anna
Token ID11735
Feature activation+2.44e-02

Pos logit contributions
 knowing+0.012
king+0.012
 awaiting+0.012
asured+0.011
app+0.011
Neg logit contributions
 ah-0.018
Found-0.017
 are-0.016
 ey-0.016
 spoiler-0.016
Token from
Token ID422
Feature activation+9.61e-02

Pos logit contributions
Found+0.036
 are+0.034
 ah+0.034
 tube+0.032
Got+0.031
Neg logit contributions
 knowing-0.047
asured-0.044
 than-0.041
 awaiting-0.041
king-0.040
Token the
Token ID262
Feature activation+4.62e-02

Pos logit contributions
Found+0.176
Raw+0.173
 ah+0.172
 ey+0.170
 conclusions+0.168
Neg logit contributions
 knowing-0.145
king-0.141
 awaiting-0.125
 than-0.123
 to-0.122
Token zip
Token ID19974
Feature activation+2.26e-02

Pos logit contributions
Found+0.071
 ah+0.070
Raw+0.068
 spoiler+0.067
Got+0.064
Neg logit contributions
 to-0.084
 knowing-0.082
 for-0.081
 where-0.080
king-0.079
Token line
Token ID1627
Feature activation-1.14e-03

Pos logit contributions
 tube+0.045
 ah+0.045
 spoiler+0.044
Found+0.043
 are+0.043
Neg logit contributions
 knowing-0.029
asured-0.027
king-0.026
 supposed-0.026
 awaiting-0.025
Token and
Token ID290
Feature activation+3.96e-02

Pos logit contributions
 knowing+0.001
king+0.001
asured+0.001
ind+0.001
 supposed+0.001
Neg logit contributions
 are-0.002
 tube-0.002
 ah-0.002
 spoiler-0.002
Found-0.002
Token carries
Token ID10732
Feature activation+1.55e-02

Pos logit contributions
Found+0.073
Raw+0.071
 spoiler+0.070
 ah+0.068
 conclusions+0.067
Neg logit contributions
 to-0.061
 for-0.060
 when-0.059
 knowing-0.058
 before-0.057
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
asured+0.048
king+0.041
 awaiting+0.040
 knowing+0.040
rou+0.040
Neg logit contributions
 ah-0.063
 are-0.060
 tube-0.057
hing-0.057
Found-0.057
Token They
Token ID1119
Feature activation+8.24e-03

Pos logit contributions
 knowing+0.009
king+0.008
 to+0.008
 than+0.008
 for+0.008
Neg logit contributions
Found-0.009
 ah-0.008
Raw-0.008
 spoiler-0.008
 conclusions-0.008
Token all
Token ID477
Feature activation+2.94e-02

Pos logit contributions
Found+0.015
Raw+0.014
 spoiler+0.014
 ey+0.014
 conclusions+0.014
Neg logit contributions
 knowing-0.013
 to-0.013
 for-0.012
 when-0.012
king-0.012
Token covered
Token ID5017
Feature activation+9.36e-03

Pos logit contributions
Found+0.052
 ah+0.050
Raw+0.050
 spoiler+0.049
 conclusions+0.047
Neg logit contributions
 knowing-0.047
king-0.045
 to-0.045
 than-0.044
 for-0.044
Token each
Token ID1123
Feature activation+7.76e-02

Pos logit contributions
Found+0.021
Raw+0.021
 ah+0.021
 spoiler+0.020
 conclusions+0.020
Neg logit contributions
 knowing-0.010
king-0.010
 to-0.010
 than-0.009
 for-0.009
Token other
Token ID584
Feature activation+9.57e-02

Pos logit contributions
Found+0.086
Raw+0.082
 ah+0.082
 spoiler+0.079
Got+0.075
Neg logit contributions
 knowing-0.175
king-0.170
 to-0.169
 for-0.167
 than-0.166
Token in
Token ID287
Feature activation+2.85e-02

Pos logit contributions
 ah+0.198
Found+0.196
 spoiler+0.188
 conclusions+0.187
Raw+0.185
Neg logit contributions
 knowing-0.117
king-0.108
 to-0.105
 awaiting-0.104
 than-0.101
Token furry
Token ID40857
Feature activation+3.28e-02

Pos logit contributions
Found+0.052
Raw+0.051
 ah+0.049
 spoiler+0.049
Got+0.048
Neg logit contributions
 knowing-0.045
 to-0.043
 for-0.042
 when-0.042
 than-0.042
Token hugs
Token ID40657
Feature activation+2.39e-02

Pos logit contributions
 spoiler+0.056
Found+0.054
Raw+0.053
 ah+0.052
bridge+0.051
Neg logit contributions
 to-0.059
 for-0.057
 when-0.056
 than-0.056
 knowing-0.055
Token,
Token ID11
Feature activation+2.91e-02

Pos logit contributions
 ah+0.051
Found+0.051
 are+0.049
hing+0.048
Got+0.047
Neg logit contributions
asured-0.026
king-0.025
 knowing-0.025
 awaiting-0.023
app-0.022
Token and
Token ID290
Feature activation+2.60e-03

Pos logit contributions
 ah+0.063
Found+0.062
Raw+0.060
 spoiler+0.059
Got+0.058
Neg logit contributions
king-0.033
 knowing-0.031
 than-0.030
 awaiting-0.030
asured-0.029
Token the
Token ID262
Feature activation+2.38e-02

Pos logit contributions
 knowing+0.016
 to+0.016
king+0.015
 for+0.015
 when+0.015
Neg logit contributions
Found-0.026
 ah-0.025
Raw-0.025
 spoiler-0.025
 ey-0.024
Token cat
Token ID3797
Feature activation+1.17e-02

Pos logit contributions
Found+0.036
 ah+0.036
Raw+0.034
 spoiler+0.033
hing+0.032
Neg logit contributions
 to-0.051
 for-0.047
 knowing-0.046
 when-0.046
 before-0.044
Token off
Token ID572
Feature activation+3.94e-02

Pos logit contributions
Found+0.026
 ah+0.026
Raw+0.026
 spoiler+0.025
 conclusions+0.025
Neg logit contributions
 knowing-0.013
king-0.012
 than-0.011
 to-0.011
 for-0.011
Token the
Token ID262
Feature activation+5.29e-03

Pos logit contributions
 ah+0.076
Found+0.073
Raw+0.071
 tube+0.069
 spoiler+0.068
Neg logit contributions
king-0.056
 knowing-0.055
asured-0.054
 than-0.051
 awaiting-0.051
Token wheel
Token ID7825
Feature activation+6.97e-03

Pos logit contributions
Found+0.007
 ah+0.007
Raw+0.006
 spoiler+0.006
Got+0.006
Neg logit contributions
 to-0.011
 knowing-0.011
 for-0.011
 before-0.011
 when-0.011
Tokenbar
Token ID5657
Feature activation+9.49e-02

Pos logit contributions
 ah+0.014
 spoiler+0.014
Found+0.014
 tube+0.014
hing+0.014
Neg logit contributions
 knowing-0.008
king-0.008
asured-0.008
 awaiting-0.007
 than-0.007
Tokenrow
Token ID808
Feature activation+2.35e-02

Pos logit contributions
Found+0.154
Raw+0.149
 ah+0.148
 spoiler+0.142
Got+0.140
Neg logit contributions
 knowing-0.167
 to-0.161
king-0.161
 for-0.159
 than-0.158
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
 ah+0.046
Found+0.044
 tube+0.044
 spoiler+0.044
bridge+0.044
Neg logit contributions
asured-0.028
king-0.027
 knowing-0.027
ind-0.027
 awaiting-0.026
Token put
Token ID1234
Feature activation+1.81e-02

Pos logit contributions
Found+0.026
 ah+0.026
Raw+0.025
 spoiler+0.025
 conclusions+0.024
Neg logit contributions
 knowing-0.022
king-0.022
 to-0.021
 than-0.021
 for-0.021
Token him
Token ID683
Feature activation+5.50e-02

Pos logit contributions
Raw+0.038
Found+0.037
 ey+0.037
 conclusions+0.037
 spoiler+0.037
Neg logit contributions
 to-0.024
 for-0.024
 when-0.023
 knowing-0.022
 before-0.022
Token back
Token ID736
Feature activation+3.64e-02

Pos logit contributions
Raw+0.130
Found+0.129
 ah+0.128
 spoiler+0.125
 conclusions+0.123
Neg logit contributions
 knowing-0.055
 to-0.054
 for-0.052
 when-0.052
king-0.052
Token of
Token ID286
Feature activation+2.96e-02

Pos logit contributions
Found+0.068
 ah+0.067
 spoiler+0.066
 tube+0.066
bridge+0.065
Neg logit contributions
 knowing-0.038
king-0.037
 than-0.033
 awaiting-0.032
asured-0.032
Token soil
Token ID9260
Feature activation+3.82e-03

Pos logit contributions
Found+0.048
 spoiler+0.047
Raw+0.045
Boo+0.045
bridge+0.044
Neg logit contributions
 to-0.059
 for-0.058
 when-0.057
 where-0.055
 than-0.052
Token into
Token ID656
Feature activation+1.23e-02

Pos logit contributions
 ah+0.008
 tube+0.008
Found+0.008
 spoiler+0.008
 are+0.008
Neg logit contributions
 knowing-0.004
king-0.004
asured-0.004
 awaiting-0.004
 than-0.003
Token the
Token ID262
Feature activation+1.37e-02

Pos logit contributions
Found+0.026
Raw+0.025
 ah+0.025
 spoiler+0.025
Got+0.024
Neg logit contributions
 knowing-0.015
 to-0.014
king-0.014
 for-0.014
 when-0.014
Token wheel
Token ID7825
Feature activation+6.56e-03

Pos logit contributions
Found+0.020
 ah+0.020
 spoiler+0.019
Raw+0.019
hing+0.018
Neg logit contributions
 to-0.027
 when-0.026
 knowing-0.026
 for-0.026
 before-0.025
Tokenbar
Token ID5657
Feature activation+9.45e-02

Pos logit contributions
 ah+0.013
 spoiler+0.013
 tube+0.012
bridge+0.012
Found+0.012
Neg logit contributions
 knowing-0.009
king-0.008
asured-0.008
 awaiting-0.008
 than-0.008
Tokenrow
Token ID808
Feature activation+2.31e-02

Pos logit contributions
Found+0.152
Raw+0.147
 ah+0.146
 spoiler+0.141
Got+0.138
Neg logit contributions
 knowing-0.167
king-0.161
 to-0.161
 for-0.158
 than-0.158
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
 ah+0.046
 tube+0.046
 spoiler+0.045
bridge+0.045
 are+0.044
Neg logit contributions
 knowing-0.026
king-0.026
asured-0.025
 awaiting-0.024
ind-0.024
Token It
Token ID632
Feature activation+3.28e-02

Pos logit contributions
Found+0.002
 ah+0.002
Raw+0.002
 spoiler+0.002
Got+0.002
Neg logit contributions
king-0.002
 knowing-0.002
 than-0.002
asured-0.002
 for-0.002
Token was
Token ID373
Feature activation+8.96e-03

Pos logit contributions
Found+0.062
 conclusions+0.060
Raw+0.060
 ah+0.059
 spoiler+0.059
Neg logit contributions
 to-0.048
 knowing-0.047
 for-0.047
 when-0.046
king-0.045
Token heavy
Token ID4334
Feature activation+1.79e-02

Pos logit contributions
 ah+0.016
Found+0.016
 spoiler+0.016
Raw+0.016
 tube+0.015
Neg logit contributions
king-0.013
 knowing-0.013
 than-0.011
asured-0.011
 awaiting-0.011
Token He
Token ID679
Feature activation+1.75e-02

Pos logit contributions
 knowing+0.003
king+0.003
 to+0.003
 for+0.003
 than+0.003
Neg logit contributions
Found-0.003
 ah-0.003
Raw-0.003
 spoiler-0.003
 conclusions-0.003
Token jumped
Token ID11687
Feature activation-1.96e-02

Pos logit contributions
Found+0.030
 ah+0.029
Raw+0.029
 spoiler+0.028
Got+0.028
Neg logit contributions
 knowing-0.029
king-0.027
 to-0.027
 than-0.027
 for-0.026
Token onto
Token ID4291
Feature activation+1.62e-02

Pos logit contributions
 knowing+0.021
 to+0.020
king+0.020
 for+0.019
 when+0.019
Neg logit contributions
Found-0.045
 ah-0.044
Raw-0.044
 spoiler-0.043
 conclusions-0.043
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
Found+0.033
 ah+0.033
Raw+0.032
 spoiler+0.032
 conclusions+0.031
Neg logit contributions
 knowing-0.021
king-0.020
 to-0.020
 than-0.019
 for-0.019
Token wheel
Token ID7825
Feature activation+8.18e-03

Pos logit contributions
Found+0.019
Raw+0.018
 ah+0.018
 spoiler+0.017
Got+0.017
Neg logit contributions
 to-0.028
 knowing-0.028
 for-0.027
 before-0.027
 when-0.027
Tokenbar
Token ID5657
Feature activation+9.42e-02

Pos logit contributions
 ah+0.017
 spoiler+0.017
Found+0.016
 tube+0.016
hing+0.016
Neg logit contributions
 knowing-0.010
king-0.009
asured-0.009
 awaiting-0.009
 than-0.008
Tokenrow
Token ID808
Feature activation+2.64e-02

Pos logit contributions
Found+0.159
Raw+0.155
 ah+0.152
 spoiler+0.145
Got+0.145
Neg logit contributions
 knowing-0.162
 to-0.157
king-0.156
 for-0.155
 than-0.155
Token and
Token ID290
Feature activation+2.25e-02

Pos logit contributions
 ah+0.049
Found+0.048
 spoiler+0.047
 tube+0.047
bridge+0.046
Neg logit contributions
asured-0.033
king-0.033
 knowing-0.033
 awaiting-0.031
ind-0.030
Token started
Token ID2067
Feature activation-3.04e-02

Pos logit contributions
Found+0.039
Raw+0.038
 ah+0.037
 spoiler+0.037
 conclusions+0.036
Neg logit contributions
 knowing-0.037
 to-0.036
king-0.035
 when-0.035
 for-0.035
Token rolling
Token ID10708
Feature activation-1.05e-02

Pos logit contributions
king+0.022
 than+0.022
asured+0.022
 knowing+0.021
ind+0.019
Neg logit contributions
 ah-0.075
Raw-0.072
Found-0.071
 tube-0.070
 spoiler-0.068
Token around
Token ID1088
Feature activation-1.44e-02

Pos logit contributions
 to+0.013
 when+0.011
 for+0.011
 than+0.011
 knowing+0.011
Neg logit contributions
Found-0.024
 ah-0.024
Raw-0.024
 spoiler-0.023
 ey-0.023
TokenThe
Token ID464
Feature activation+8.59e-03

Pos logit contributions
 to+0.018
 when+0.017
 for+0.016
 before+0.016
 knowing+0.016
Neg logit contributions
 spoiler-0.024
 ah-0.023
 conclusions-0.023
 ey-0.022
Found-0.022
Token two
Token ID734
Feature activation+2.83e-02

Pos logit contributions
Found+0.017
Raw+0.017
 spoiler+0.017
 ah+0.017
 conclusions+0.016
Neg logit contributions
 knowing-0.012
 to-0.011
 when-0.011
 for-0.011
 before-0.011
Token players
Token ID1938
Feature activation+2.43e-02

Pos logit contributions
Found+0.049
Raw+0.048
 ah+0.047
 spoiler+0.046
 conclusions+0.045
Neg logit contributions
 knowing-0.046
 to-0.045
king-0.045
 for-0.044
 when-0.044
Token covered
Token ID5017
Feature activation+1.06e-02

Pos logit contributions
Found+0.040
Raw+0.039
 ah+0.038
 spoiler+0.038
 ey+0.037
Neg logit contributions
 knowing-0.041
 to-0.040
 when-0.040
king-0.039
 for-0.039
Token each
Token ID1123
Feature activation+7.91e-02

Pos logit contributions
Found+0.024
Raw+0.023
 ah+0.023
 spoiler+0.023
Got+0.022
Neg logit contributions
 knowing-0.012
 to-0.012
king-0.011
 when-0.011
 for-0.011
Token other
Token ID584
Feature activation+9.41e-02

Pos logit contributions
Found+0.080
 ah+0.076
Raw+0.073
 spoiler+0.069
 conclusions+0.069
Neg logit contributions
 knowing-0.188
king-0.183
 to-0.176
 than-0.175
 awaiting-0.174
Token in
Token ID287
Feature activation+2.73e-02

Pos logit contributions
 ah+0.200
Found+0.194
 spoiler+0.192
 are+0.191
 tube+0.188
Neg logit contributions
 knowing-0.106
king-0.099
asured-0.093
 awaiting-0.093
 to-0.089
Token the
Token ID262
Feature activation+2.52e-02

Pos logit contributions
Found+0.053
Raw+0.052
 ah+0.050
 spoiler+0.049
Got+0.049
Neg logit contributions
 knowing-0.039
 to-0.038
 for-0.038
 when-0.037
 than-0.036
Token game
Token ID983
Feature activation+3.17e-02

Pos logit contributions
Found+0.042
 ah+0.041
Raw+0.041
 spoiler+0.039
Got+0.038
Neg logit contributions
 to-0.043
 knowing-0.042
 for-0.042
 when-0.042
king-0.041
Token.
Token ID13
Feature activation+2.81e-03

Pos logit contributions
Found+0.070
 ah+0.069
Raw+0.067
 spoiler+0.067
 tube+0.066
Neg logit contributions
 knowing-0.035
king-0.034
asured-0.030
 than-0.030
 to-0.030
Token One
Token ID1881
Feature activation-1.18e-03

Pos logit contributions
Found+0.005
 ah+0.005
Raw+0.005
 ey+0.005
Got+0.005
Neg logit contributions
 to-0.004
 when-0.004
 knowing-0.004
 before-0.004
 for-0.004
Token much
Token ID881
Feature activation+2.40e-02

Pos logit contributions
Found+0.069
 ah+0.067
Raw+0.066
 spoiler+0.066
 conclusions+0.064
Neg logit contributions
 knowing-0.037
king-0.036
 than-0.033
 to-0.033
 awaiting-0.032
Token lighter
Token ID14871
Feature activation+1.45e-04

Pos logit contributions
Found+0.039
 ah+0.038
Raw+0.038
 spoiler+0.037
 conclusions+0.036
Neg logit contributions
 knowing-0.041
king-0.040
 to-0.039
 than-0.039
 for-0.038
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
 ah+0.000
 are+0.000
 tube+0.000
 conclusions+0.000
Found+0.000
Neg logit contributions
king-0.000
 knowing-0.000
 awaiting-0.000
app-0.000
asured-0.000
Token The
Token ID383
Feature activation+6.14e-03

Pos logit contributions
 knowing+0.005
king+0.005
 to+0.005
 than+0.005
 for+0.005
Neg logit contributions
Found-0.004
 ah-0.004
Raw-0.004
 spoiler-0.004
 conclusions-0.004
Token wheel
Token ID7825
Feature activation+6.47e-03

Pos logit contributions
 ah+0.010
Found+0.010
hing+0.009
Raw+0.009
 conclusions+0.009
Neg logit contributions
 for-0.011
 to-0.011
 knowing-0.011
 when-0.011
 next-0.010
Tokenbar
Token ID5657
Feature activation+9.36e-02

Pos logit contributions
Found+0.011
Raw+0.011
 ah+0.011
 spoiler+0.010
 conclusions+0.010
Neg logit contributions
 knowing-0.011
 to-0.010
king-0.010
 when-0.010
 for-0.010
Tokenrow
Token ID808
Feature activation+2.42e-02

Pos logit contributions
Raw+0.178
Found+0.177
 conclusions+0.169
 ah+0.162
Got+0.162
Neg logit contributions
 for-0.145
 knowing-0.143
 to-0.143
 when-0.139
 than-0.139
Token was
Token ID373
Feature activation+1.01e-02

Pos logit contributions
Found+0.042
Raw+0.041
 ah+0.041
 spoiler+0.040
 conclusions+0.040
Neg logit contributions
 knowing-0.039
 to-0.038
king-0.038
 for-0.037
 when-0.037
Token now
Token ID783
Feature activation+3.35e-02

Pos logit contributions
Found+0.018
 spoiler+0.018
 ah+0.018
Raw+0.017
 tube+0.017
Neg logit contributions
king-0.014
 knowing-0.014
 than-0.013
app-0.013
 awaiting-0.013
Token much
Token ID881
Feature activation-7.35e-03

Pos logit contributions
Found+0.057
 ah+0.056
 spoiler+0.056
Raw+0.055
 tube+0.054
Neg logit contributions
king-0.052
 knowing-0.049
 than-0.046
app-0.045
 awaiting-0.045
Token easier
Token ID4577
Feature activation-2.88e-02

Pos logit contributions
king+0.013
 knowing+0.013
 than+0.012
app+0.012
asured+0.012
Neg logit contributions
Found-0.011
Raw-0.011
 ah-0.011
 spoiler-0.011
Got-0.010
Token They
Token ID1119
Feature activation+1.17e-02

Pos logit contributions
 knowing+0.015
king+0.014
 to+0.014
 than+0.014
 for+0.014
Neg logit contributions
Found-0.015
 ah-0.015
Raw-0.015
 spoiler-0.014
 conclusions-0.014
Token were
Token ID547
Feature activation+3.13e-02

Pos logit contributions
Found+0.020
 ah+0.019
Raw+0.019
 spoiler+0.019
 conclusions+0.018
Neg logit contributions
 knowing-0.020
king-0.019
 to-0.019
 than-0.018
 for-0.018
Token not
Token ID407
Feature activation-2.94e-02

Pos logit contributions
 ah+0.049
Found+0.044
 are+0.043
bridge+0.042
 spoiler+0.040
Neg logit contributions
king-0.056
 than-0.050
 before-0.050
 awaiting-0.050
 knowing-0.049
Token grateful
Token ID14066
Feature activation-1.17e-02

Pos logit contributions
king+0.049
 knowing+0.049
 to+0.047
 than+0.046
 awaiting+0.046
Neg logit contributions
 ah-0.047
Found-0.046
Raw-0.044
 spoiler-0.044
Got-0.041
Token for
Token ID329
Feature activation-6.04e-03

Pos logit contributions
king+0.015
app+0.014
asured+0.014
icing+0.012
 awaiting+0.012
Neg logit contributions
 are-0.023
 ah-0.021
 conclusions-0.020
 stuff-0.020
 shovel-0.019
Token Farmer
Token ID37755
Feature activation-1.22e-01

Pos logit contributions
 knowing+0.009
king+0.009
asured+0.009
 than+0.008
 awaiting+0.008
Neg logit contributions
 ah-0.011
Found-0.010
 conclusions-0.010
Raw-0.010
 spoiler-0.010
Token Joe
Token ID5689
Feature activation-5.29e-02

Pos logit contributions
asured+0.181
 knowing+0.180
king+0.178
 awaiting+0.173
 patiently+0.172
Neg logit contributions
 ah-0.209
Found-0.200
Raw-0.184
 are-0.184
Got-0.179
Token.
Token ID13
Feature activation-9.50e-03

Pos logit contributions
king+0.071
 awaiting+0.067
asured+0.067
 knowing+0.065
 patiently+0.063
Neg logit contributions
 are-0.100
hell-0.095
 ah-0.091
Found-0.090
 conclusions-0.088
Token He
Token ID679
Feature activation+1.15e-02

Pos logit contributions
 knowing+0.016
king+0.015
 to+0.015
 than+0.015
 for+0.015
Neg logit contributions
Found-0.016
 ah-0.016
Raw-0.015
 spoiler-0.015
 conclusions-0.015
Token was
Token ID373
Feature activation+7.95e-03

Pos logit contributions
Found+0.018
Raw+0.017
 ah+0.017
Got+0.017
 spoiler+0.016
Neg logit contributions
king-0.021
 knowing-0.020
 than-0.019
 to-0.019
 awaiting-0.019
Token mean
Token ID1612
Feature activation+3.36e-02

Pos logit contributions
 ah+0.012
Found+0.011
bridge+0.011
 are+0.010
Raw+0.010
Neg logit contributions
king-0.014
 than-0.013
 since-0.012
 awaiting-0.012
iously-0.012
Token And
Token ID843
Feature activation-1.28e-02

Pos logit contributions
king+0.015
 knowing+0.015
 than+0.014
 awaiting+0.014
 to+0.014
Neg logit contributions
Found-0.010
Raw-0.010
 ah-0.010
 spoiler-0.010
 ey-0.009
Token he
Token ID339
Feature activation-2.03e-02

Pos logit contributions
king+0.020
 knowing+0.017
 awaiting+0.017
asured+0.017
app+0.016
Neg logit contributions
 ah-0.023
Found-0.022
 conclusions-0.022
 are-0.021
Raw-0.021
Token was
Token ID373
Feature activation-6.99e-03

Pos logit contributions
 knowing+0.029
king+0.029
 to+0.027
 than+0.027
 for+0.027
Neg logit contributions
Found-0.039
Raw-0.037
 ah-0.037
 spoiler-0.036
Got-0.036
Token grateful
Token ID14066
Feature activation-8.16e-03

Pos logit contributions
king+0.011
 than+0.009
 awaiting+0.009
app+0.009
ind+0.009
Neg logit contributions
Found-0.012
 ah-0.012
 are-0.011
bridge-0.011
 spoiler-0.011
Token for
Token ID329
Feature activation-8.18e-03

Pos logit contributions
king+0.009
ind+0.008
asured+0.008
app+0.007
astical+0.007
Neg logit contributions
 are-0.017
 tube-0.016
 rock-0.015
 vest-0.015
 weapon-0.015
Token Farmer
Token ID37755
Feature activation-1.16e-01

Pos logit contributions
king+0.012
asured+0.011
 than+0.011
 knowing+0.011
app+0.011
Neg logit contributions
 ah-0.015
Found-0.014
 conclusions-0.014
 spoiler-0.013
Raw-0.013
Token Joe
Token ID5689
Feature activation-4.58e-02

Pos logit contributions
king+0.213
 knowing+0.211
asured+0.207
 awaiting+0.205
 where+0.199
Neg logit contributions
 ah-0.157
Found-0.148
Raw-0.136
 spoiler-0.135
 tube-0.134
Token's
Token ID338
Feature activation-3.65e-02

Pos logit contributions
king+0.063
 knowing+0.058
asured+0.057
 awaiting+0.056
 than+0.055
Neg logit contributions
 are-0.083
 ah-0.083
Found-0.082
 tube-0.078
hell-0.078
Token help
Token ID1037
Feature activation+2.46e-02

Pos logit contributions
 knowing+0.063
 to+0.061
 for+0.059
king+0.058
 when+0.058
Neg logit contributions
Found-0.062
 ah-0.060
Raw-0.060
 spoiler-0.059
Got-0.057
Token.
Token ID13
Feature activation-5.49e-03

Pos logit contributions
 ah+0.049
 conclusions+0.049
 are+0.049
Found+0.048
hing+0.048
Neg logit contributions
king-0.028
asured-0.028
 knowing-0.026
 awaiting-0.026
app-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.40e-02

Pos logit contributions
king+0.010
 knowing+0.010
 than+0.010
asured+0.009
 awaiting+0.009
Neg logit contributions
Found-0.008
 ah-0.008
Raw-0.008
 spoiler-0.007
Got-0.007
Token.
Token ID13
Feature activation+7.18e-04

Pos logit contributions
 ah+0.035
 are+0.034
 tube+0.033
 shovel+0.032
 bin+0.032
Neg logit contributions
king-0.034
asured-0.033
 supposed-0.033
ind-0.033
elia-0.032
Token They
Token ID1119
Feature activation+1.94e-02

Pos logit contributions
Found+0.001
Raw+0.001
 ah+0.001
 spoiler+0.001
 conclusions+0.001
Neg logit contributions
king-0.001
 knowing-0.001
 than-0.001
asured-0.001
 for-0.001
Token were
Token ID547
Feature activation+3.75e-02

Pos logit contributions
Found+0.034
Raw+0.034
 ah+0.033
 spoiler+0.033
 conclusions+0.032
Neg logit contributions
 knowing-0.031
 to-0.030
king-0.029
 for-0.029
 when-0.029
Token grateful
Token ID14066
Feature activation+4.81e-03

Pos logit contributions
 ah+0.059
Found+0.057
bridge+0.054
 spoiler+0.054
Raw+0.052
Neg logit contributions
king-0.062
 knowing-0.059
 than-0.058
 awaiting-0.058
app-0.057
Token for
Token ID329
Feature activation+2.02e-03

Pos logit contributions
 are+0.010
 ah+0.010
 seaw+0.009
 conclusions+0.009
 shovel+0.009
Neg logit contributions
king-0.005
asured-0.005
app-0.005
icing-0.004
 supposed-0.004
Token Farmer
Token ID37755
Feature activation-1.15e-01

Pos logit contributions
 ah+0.004
Found+0.003
 conclusions+0.003
 are+0.003
Raw+0.003
Neg logit contributions
king-0.003
 knowing-0.003
 than-0.003
asured-0.003
 to-0.003
Token Sue
Token ID26089
Feature activation-6.87e-02

Pos logit contributions
king+0.192
 knowing+0.192
asured+0.192
 awaiting+0.187
 patiently+0.181
Neg logit contributions
 ah-0.174
Found-0.165
 are-0.148
Raw-0.147
 spoiler-0.147
Token.
Token ID13
Feature activation-3.99e-03

Pos logit contributions
king+0.094
 awaiting+0.089
asured+0.088
 knowing+0.085
 than+0.080
Neg logit contributions
 are-0.129
hell-0.119
 ah-0.117
Found-0.117
 vest-0.113
Token She
Token ID1375
Feature activation+1.39e-02

Pos logit contributions
king+0.007
 knowing+0.007
asured+0.007
 to+0.007
 than+0.007
Neg logit contributions
Found-0.006
Raw-0.006
 ah-0.006
 spoiler-0.006
Got-0.005
Token was
Token ID373
Feature activation+1.15e-02

Pos logit contributions
Found+0.024
 ah+0.023
Raw+0.023
Got+0.022
 spoiler+0.022
Neg logit contributions
king-0.023
 knowing-0.022
 than-0.021
 to-0.021
 for-0.021
Token nice
Token ID3621
Feature activation+1.44e-02

Pos logit contributions
 ah+0.018
Found+0.017
bridge+0.017
 are+0.016
Raw+0.015
Neg logit contributions
king-0.020
 than-0.019
 awaiting-0.017
 since-0.017
 knowing-0.017
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
 are+0.004
 conclusions+0.004
 tube+0.004
Found+0.004
hell+0.004
Neg logit contributions
king-0.003
 knowing-0.002
asured-0.002
 awaiting-0.002
app-0.002
Token
Token ID220
Feature activation+5.99e-03

Pos logit contributions
 knowing+0.003
king+0.003
asured+0.003
 than+0.003
 for+0.002
Neg logit contributions
Found-0.002
Raw-0.002
 ah-0.002
Got-0.002
 conclusions-0.002
Token
Token ID198
Feature activation-1.38e-03

Pos logit contributions
Found+0.012
 ah+0.011
Raw+0.011
 spoiler+0.011
 conclusions+0.011
Neg logit contributions
 knowing-0.008
 to-0.008
 when-0.008
 for-0.008
king-0.008
Token
Token ID198
Feature activation-6.21e-03

Pos logit contributions
 knowing+0.002
king+0.002
 than+0.002
 to+0.002
 for+0.002
Neg logit contributions
Found-0.002
Raw-0.002
 ah-0.002
 spoiler-0.002
Got-0.002
TokenS
Token ID50
Feature activation-3.62e-02

Pos logit contributions
 knowing+0.012
asured+0.012
king+0.012
 for+0.011
 before+0.011
Neg logit contributions
Found-0.009
Raw-0.009
Got-0.008
AAAA-0.008
Hey-0.008
Tokenasha
Token ID14715
Feature activation-1.10e-01

Pos logit contributions
 knowing+0.064
king+0.059
asured+0.059
 where+0.059
 before+0.057
Neg logit contributions
Found-0.059
Hey-0.054
Got-0.054
orn-0.053
 ey-0.052
Token looked
Token ID3114
Feature activation-2.88e-02

Pos logit contributions
 knowing+0.190
king+0.185
 than+0.178
 for+0.172
 awaiting+0.169
Neg logit contributions
Found-0.180
 ah-0.173
Raw-0.173
Got-0.163
 spoiler-0.159
Token out
Token ID503
Feature activation-2.40e-02

Pos logit contributions
king+0.032
 knowing+0.031
 than+0.029
 letting+0.029
asured+0.028
Neg logit contributions
Found-0.060
 ey-0.058
Raw-0.058
 ah-0.058
 tube-0.056
Token of
Token ID286
Feature activation-3.27e-03

Pos logit contributions
king+0.039
asured+0.037
illa+0.035
cess+0.035
 knowing+0.035
Neg logit contributions
Found-0.039
 ey-0.038
 ah-0.037
Raw-0.037
 are-0.037
Token her
Token ID607
Feature activation+4.26e-03

Pos logit contributions
 to+0.004
 knowing+0.003
 for+0.003
 where+0.003
 when+0.003
Neg logit contributions
 ah-0.008
Raw-0.008
Found-0.007
 spoiler-0.007
bridge-0.007
Token window
Token ID4324
Feature activation-2.06e-02

Pos logit contributions
 ey+0.006
Found+0.006
 tube+0.006
 ah+0.006
 spoiler+0.006
Neg logit contributions
 knowing-0.008
king-0.008
 than-0.007
 awaiting-0.007
asured-0.007
Token
Token ID198
Feature activation-1.54e-03

Pos logit contributions
Found+0.004
Raw+0.004
 ah+0.004
 spoiler+0.003
Got+0.003
Neg logit contributions
 knowing-0.004
king-0.004
 to-0.004
 than-0.004
 for-0.004
Token
Token ID198
Feature activation-6.93e-03

Pos logit contributions
 knowing+0.003
 to+0.003
 for+0.003
 when+0.003
 where+0.003
Neg logit contributions
 spoiler-0.002
 ah-0.002
Found-0.002
Raw-0.002
 ey-0.002
TokenThe
Token ID464
Feature activation-1.65e-02

Pos logit contributions
 knowing+0.011
king+0.010
 to+0.010
 than+0.010
 for+0.010
Neg logit contributions
Found-0.013
 ah-0.012
Raw-0.012
 spoiler-0.012
 conclusions-0.012
Token moral
Token ID6573
Feature activation-2.47e-02

Pos logit contributions
 knowing+0.024
 for+0.023
 to+0.022
 when+0.022
 before+0.022
Neg logit contributions
Found-0.032
Raw-0.031
 ah-0.031
 spoiler-0.031
 conclusions-0.030
Token of
Token ID286
Feature activation-5.55e-02

Pos logit contributions
king+0.038
 knowing+0.038
 than+0.034
 for+0.034
 awaiting+0.034
Neg logit contributions
Found-0.045
Raw-0.043
 ah-0.042
Got-0.041
 conclusions-0.041
Token Leah
Token ID31232
Feature activation-1.10e-01

Pos logit contributions
king+0.086
 knowing+0.079
 than+0.076
 awaiting+0.076
asured+0.074
Neg logit contributions
 ah-0.101
Found-0.100
 conclusions-0.097
Raw-0.094
 ey-0.093
Token's
Token ID338
Feature activation-6.68e-02

Pos logit contributions
king+0.155
 knowing+0.152
 than+0.139
 awaiting+0.136
 to+0.136
Neg logit contributions
Found-0.211
 ah-0.206
Raw-0.204
 spoiler-0.195
Got-0.193
Token story
Token ID1621
Feature activation-4.55e-02

Pos logit contributions
king+0.141
 knowing+0.137
 than+0.135
 awaiting+0.130
 to+0.129
Neg logit contributions
 ah-0.079
Found-0.079
Raw-0.078
 conclusions-0.073
 ey-0.073
Token is
Token ID318
Feature activation-4.09e-02

Pos logit contributions
king+0.085
 knowing+0.077
asured+0.074
 awaiting+0.072
rying+0.070
Neg logit contributions
 are-0.070
Found-0.065
 tube-0.063
 ey-0.061
Raw-0.060
Token that
Token ID326
Feature activation-1.50e-02

Pos logit contributions
king+0.056
asured+0.040
app+0.035
ind+0.035
 than+0.035
Neg logit contributions
 are-0.083
 tube-0.077
 am-0.075
 conclusions-0.074
Found-0.072
Token it
Token ID340
Feature activation-1.25e-02

Pos logit contributions
 knowing+0.022
king+0.022
 to+0.021
 than+0.020
 for+0.020
Neg logit contributions
Found-0.028
 ah-0.027
Raw-0.027
 spoiler-0.026
 conclusions-0.026
Token big
Token ID1263
Feature activation-6.14e-04

Pos logit contributions
 knowing+0.028
king+0.028
asured+0.026
 to+0.026
 awaiting+0.025
Neg logit contributions
Found-0.026
 ah-0.025
Raw-0.025
 spoiler-0.024
Got-0.024
Token.
Token ID13
Feature activation-6.52e-03

Pos logit contributions
king+0.001
asured+0.001
 knowing+0.001
ind+0.001
app+0.001
Neg logit contributions
Found-0.001
 ah-0.001
 are-0.001
 spoiler-0.001
Raw-0.001
Token
Token ID198
Feature activation-5.68e-03

Pos logit contributions
 knowing+0.011
king+0.011
 to+0.010
 than+0.010
 for+0.010
Neg logit contributions
Found-0.011
Raw-0.011
 ah-0.011
 spoiler-0.010
Got-0.010
Token
Token ID198
Feature activation-8.91e-03

Pos logit contributions
 knowing+0.008
 to+0.008
king+0.008
 for+0.008
 when+0.008
Neg logit contributions
Found-0.011
 ah-0.011
 spoiler-0.010
Raw-0.010
 conclusions-0.010
TokenS
Token ID50
Feature activation-3.80e-02

Pos logit contributions
 knowing+0.017
king+0.016
asured+0.016
 for+0.016
 where+0.015
Neg logit contributions
Found-0.013
Raw-0.013
Got-0.012
AAAA-0.012
 ah-0.011
Tokenasha
Token ID14715
Feature activation-1.09e-01

Pos logit contributions
 knowing+0.067
 where+0.062
asured+0.061
king+0.061
 before+0.059
Neg logit contributions
Found-0.062
orn-0.058
Got-0.058
 ey-0.058
Hey-0.057
Token was
Token ID373
Feature activation+1.53e-04

Pos logit contributions
 knowing+0.191
king+0.185
 than+0.176
 for+0.172
 awaiting+0.171
Neg logit contributions
Found-0.177
 ah-0.172
Raw-0.169
Got-0.160
 spoiler-0.157
Token about
Token ID546
Feature activation-4.41e-02

Pos logit contributions
Found+0.000
 ah+0.000
Raw+0.000
 are+0.000
 pear+0.000
Neg logit contributions
king-0.000
 knowing-0.000
ind-0.000
 than-0.000
 awaiting-0.000
Token to
Token ID284
Feature activation-1.89e-02

Pos logit contributions
king+0.028
asured+0.023
app+0.018
 knowing+0.018
 awaiting+0.017
Neg logit contributions
Raw-0.116
Found-0.115
 conclusions-0.113
 ah-0.112
 ey-0.112
Token touch
Token ID3638
Feature activation-6.60e-03

Pos logit contributions
 to+0.031
 knowing+0.030
 when+0.030
 before+0.029
 for+0.029
Neg logit contributions
Found-0.034
Raw-0.032
 spoiler-0.032
bridge-0.031
 ah-0.031
Token the
Token ID262
Feature activation+5.26e-03

Pos logit contributions
 to+0.008
 knowing+0.008
 when+0.008
 for+0.007
 before+0.007
Neg logit contributions
Found-0.015
Raw-0.015
 ah-0.014
 spoiler-0.014
Got-0.014
Token was
Token ID373
Feature activation+4.33e-03

Pos logit contributions
Found+0.004
Raw+0.004
 ah+0.004
 spoiler+0.004
Got+0.004
Neg logit contributions
 knowing-0.003
king-0.003
 to-0.003
 than-0.003
 for-0.003
Token sad
Token ID6507
Feature activation+1.22e-02

Pos logit contributions
 ah+0.007
Raw+0.006
Found+0.006
 are+0.006
 pear+0.006
Neg logit contributions
king-0.008
 than-0.007
 knowing-0.007
 awaiting-0.007
asured-0.007
Token,
Token ID11
Feature activation+5.64e-03

Pos logit contributions
 are+0.024
 tube+0.024
 vest+0.023
 flashlight+0.023
 cloth+0.023
Neg logit contributions
king-0.016
app-0.013
 awaiting-0.013
asured-0.013
illa-0.012
Token because
Token ID780
Feature activation-4.58e-02

Pos logit contributions
Found+0.009
Raw+0.009
 ah+0.009
 are+0.009
 spoiler+0.009
Neg logit contributions
asured-0.009
king-0.009
 awaiting-0.008
rou-0.007
eter-0.007
Token her
Token ID607
Feature activation-3.12e-02

Pos logit contributions
 knowing+0.060
 to+0.058
 when+0.057
 for+0.056
 than+0.055
Neg logit contributions
Found-0.095
Raw-0.094
 ah-0.092
 spoiler-0.092
Got-0.089
Token pap
Token ID20461
Feature activation-1.09e-01

Pos logit contributions
 knowing+0.046
 to+0.044
king+0.042
 for+0.042
 when+0.042
Neg logit contributions
Found-0.060
 ah-0.059
Raw-0.058
 spoiler-0.057
 conclusions-0.056
Tokena
Token ID64
Feature activation-5.47e-02

Pos logit contributions
 knowing+0.145
king+0.139
 to+0.134
 before+0.134
 when+0.132
Neg logit contributions
Found-0.214
 ah-0.210
Raw-0.204
 spoiler-0.201
 tube-0.199
Token never
Token ID1239
Feature activation-1.89e-02

Pos logit contributions
 knowing+0.094
king+0.092
 to+0.087
 than+0.086
 for+0.086
Neg logit contributions
Found-0.089
 ah-0.086
Raw-0.085
 spoiler-0.082
Got-0.080
Token had
Token ID550
Feature activation-1.99e-02

Pos logit contributions
king+0.033
 knowing+0.031
 than+0.031
 patiently+0.031
 awaiting+0.030
Neg logit contributions
Found-0.029
 ah-0.027
 are-0.027
Got-0.026
Raw-0.026
Token time
Token ID640
Feature activation-5.41e-02

Pos logit contributions
king+0.031
 knowing+0.029
 than+0.028
 awaiting+0.027
asured+0.027
Neg logit contributions
Found-0.035
 ah-0.033
Raw-0.032
Got-0.032
 conclusions-0.032
Token to
Token ID284
Feature activation-9.55e-03

Pos logit contributions
king+0.039
asured+0.030
ind+0.028
izing+0.028
app+0.027
Neg logit contributions
 are-0.133
Found-0.130
 ah-0.127
 tube-0.124
 conclusions-0.123
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
asured+0.013
king+0.012
app+0.012
ind+0.012
ich+0.011
Neg logit contributions
 are-0.013
 conclusions-0.012
 ah-0.011
 g-0.011
Found-0.011
Token He
Token ID679
Feature activation-4.88e-04

Pos logit contributions
 knowing+0.024
king+0.023
 than+0.022
 for+0.022
 to+0.022
Neg logit contributions
Found-0.023
Raw-0.023
 ah-0.022
 spoiler-0.022
Got-0.021
Token didn
Token ID1422
Feature activation+7.53e-03

Pos logit contributions
 knowing+0.001
king+0.001
 than+0.001
 for+0.001
 where+0.001
Neg logit contributions
Found-0.001
Raw-0.001
 ah-0.001
Got-0.001
 spoiler-0.001
Token't
Token ID470
Feature activation-1.25e-02

Pos logit contributions
Found+0.014
 ah+0.014
Raw+0.014
 spoiler+0.013
Got+0.013
Neg logit contributions
 knowing-0.011
 to-0.011
king-0.011
 for-0.010
 when-0.010
Token want
Token ID765
Feature activation+4.04e-03

Pos logit contributions
 knowing+0.021
 to+0.021
 for+0.020
 than+0.020
 when+0.020
Neg logit contributions
Found-0.022
 ah-0.022
 spoiler-0.021
 ey-0.021
Raw-0.021
Token Farmer
Token ID37755
Feature activation-1.08e-01

Pos logit contributions
Found+0.011
Raw+0.011
 ah+0.011
 conclusions+0.011
Got+0.011
Neg logit contributions
king-0.002
 knowing-0.002
asured-0.001
 awaiting-0.001
 than-0.001
Token Dan
Token ID6035
Feature activation-5.35e-02

Pos logit contributions
king+0.184
 knowing+0.183
asured+0.181
 patiently+0.177
ind+0.177
Neg logit contributions
 ah-0.163
Found-0.158
 are-0.146
Raw-0.144
 spoiler-0.142
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
king+0.035
 knowing+0.033
asured+0.031
 awaiting+0.029
 patiently+0.028
Neg logit contributions
 are-0.136
Found-0.132
 ah-0.131
hell-0.130
 conclusions-0.129
Token be
Token ID307
Feature activation-4.46e-04

Pos logit contributions
 knowing+0.023
king+0.022
 to+0.021
 than+0.021
 for+0.021
Neg logit contributions
Found-0.022
 ah-0.021
Raw-0.021
 spoiler-0.021
 conclusions-0.020
Token mad
Token ID8805
Feature activation+1.92e-02

Pos logit contributions
 knowing+0.001
king+0.001
 awaiting+0.001
 to+0.001
 than+0.001
Neg logit contributions
Found-0.001
 ah-0.001
Raw-0.001
 spoiler-0.001
Got-0.001
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 ah+0.037
Found+0.036
 are+0.036
 spoiler+0.035
Raw+0.034
Neg logit contributions
asured-0.022
app-0.021
king-0.021
ind-0.020
 awaiting-0.020
Token bird
Token ID6512
Feature activation+3.25e-03

Pos logit contributions
Found+0.002
Raw+0.002
 ah+0.002
 spoiler+0.002
 conclusions+0.002
Neg logit contributions
 knowing-0.003
 to-0.003
 when-0.002
 for-0.002
king-0.002
Token began
Token ID2540
Feature activation-4.86e-02

Pos logit contributions
Found+0.005
 ah+0.005
Raw+0.005
 spoiler+0.004
Got+0.004
Neg logit contributions
 knowing-0.006
king-0.006
 than-0.006
 to-0.006
 for-0.006
Token to
Token ID284
Feature activation-1.57e-02

Pos logit contributions
asured+0.031
king+0.029
 than+0.027
becca+0.026
ind+0.025
Neg logit contributions
 ah-0.120
 ey-0.117
Raw-0.116
Found-0.114
 tube-0.113
Token tell
Token ID1560
Feature activation-4.78e-03

Pos logit contributions
 knowing+0.029
 to+0.028
 when+0.027
 for+0.027
king+0.027
Neg logit contributions
Found-0.025
 spoiler-0.024
 ah-0.023
bridge-0.023
Raw-0.023
Token Pen
Token ID7507
Feature activation-7.01e-02

Pos logit contributions
king+0.008
 knowing+0.007
asured+0.007
 than+0.007
 awaiting+0.007
Neg logit contributions
 ah-0.008
Found-0.008
Raw-0.008
 conclusions-0.008
 are-0.008
Tokenelope
Token ID47329
Feature activation-1.06e-01

Pos logit contributions
king+0.119
 knowing+0.119
 awaiting+0.117
asured+0.117
 patiently+0.112
Neg logit contributions
Found-0.091
 ah-0.089
Raw-0.089
hell-0.086
orn-0.084
Token it
Token ID340
Feature activation-1.53e-02

Pos logit contributions
king+0.134
 than+0.130
 knowing+0.128
asured+0.126
 awaiting+0.121
Neg logit contributions
 ah-0.211
Found-0.205
 are-0.198
 pear-0.193
 conclusions-0.192
Token's
Token ID338
Feature activation-4.09e-02

Pos logit contributions
king+0.023
 knowing+0.022
asured+0.022
 than+0.021
 awaiting+0.020
Neg logit contributions
Found-0.026
 ah-0.026
 are-0.026
Raw-0.024
 tube-0.024
Token story
Token ID1621
Feature activation-2.89e-02

Pos logit contributions
king+0.078
 knowing+0.076
 than+0.073
 where+0.072
 when+0.071
Neg logit contributions
Found-0.055
Raw-0.055
 ah-0.054
 conclusions-0.051
 tube-0.051
Token.
Token ID13
Feature activation-4.73e-04

Pos logit contributions
asured+0.035
king+0.034
 knowing+0.032
ind+0.030
rying+0.030
Neg logit contributions
hell-0.057
bridge-0.057
Found-0.056
 are-0.056
 ah-0.055
Token
Token ID220
Feature activation+9.11e-03

Pos logit contributions
 knowing+0.001
 to+0.001
king+0.001
 when+0.001
 for+0.001
Neg logit contributions
Found-0.001
 ah-0.001
Raw-0.001
 spoiler-0.001
 conclusions-0.001
Token on
Token ID319
Feature activation-3.99e-03

Pos logit contributions
 knowing+0.048
 to+0.046
king+0.045
 for+0.045
 when+0.044
Neg logit contributions
Found-0.076
 ah-0.075
Raw-0.074
 spoiler-0.073
 conclusions-0.072
Token,
Token ID11
Feature activation-8.60e-03

Pos logit contributions
 knowing+0.006
king+0.005
 to+0.005
 for+0.005
 than+0.005
Neg logit contributions
Found-0.008
 ah-0.008
Raw-0.008
 spoiler-0.008
 conclusions-0.007
Token she
Token ID673
Feature activation-3.10e-02

Pos logit contributions
 knowing+0.013
king+0.012
 to+0.012
 than+0.012
 for+0.012
Neg logit contributions
Found-0.016
 ah-0.016
Raw-0.016
 spoiler-0.015
Got-0.015
Token still
Token ID991
Feature activation-2.69e-02

Pos logit contributions
king+0.043
 knowing+0.042
 than+0.040
 for+0.039
 to+0.039
Neg logit contributions
Found-0.060
Raw-0.058
 ah-0.058
Got-0.056
 spoiler-0.055
Token pret
Token ID2181
Feature activation-1.78e-02

Pos logit contributions
 knowing+0.041
 to+0.039
king+0.038
 than+0.038
 when+0.038
Neg logit contributions
Found-0.050
 ah-0.049
Raw-0.048
 spoiler-0.048
 conclusions-0.047
Tokenends
Token ID2412
Feature activation-1.06e-01

Pos logit contributions
 knowing+0.026
 before+0.024
 for+0.024
 than+0.024
 to+0.023
Neg logit contributions
 ah-0.034
Found-0.033
 spoiler-0.032
Got-0.031
Boo-0.031
Token to
Token ID284
Feature activation-4.55e-02

Pos logit contributions
king+0.105
asured+0.088
 knowing+0.087
 awaiting+0.079
app+0.078
Neg logit contributions
Found-0.253
 ah-0.248
Raw-0.242
Got-0.237
 tube-0.231
Token be
Token ID307
Feature activation-2.75e-02

Pos logit contributions
 to+0.068
 knowing+0.067
 when+0.066
 for+0.064
 than+0.064
Neg logit contributions
Found-0.089
 ey-0.084
 ah-0.084
 spoiler-0.082
bridge-0.082
Token monsters
Token ID10963
Feature activation+6.72e-03

Pos logit contributions
 knowing+0.040
king+0.039
 to+0.037
 than+0.037
 for+0.036
Neg logit contributions
Found-0.052
 ah-0.050
Raw-0.050
 spoiler-0.048
Got-0.047
Token but
Token ID475
Feature activation-9.91e-03

Pos logit contributions
Found+0.014
 ah+0.014
Raw+0.013
 are+0.013
 spoiler+0.013
Neg logit contributions
 knowing-0.008
king-0.008
asured-0.007
 awaiting-0.007
app-0.007
Token she
Token ID673
Feature activation-4.60e-03

Pos logit contributions
 knowing+0.013
 to+0.012
king+0.012
 for+0.012
 than+0.012
Neg logit contributions
Found-0.020
 ah-0.020
Raw-0.020
 spoiler-0.020
 conclusions-0.019
Token park
Token ID3952
Feature activation-1.06e-02

Pos logit contributions
Found+0.018
Raw+0.017
 ah+0.017
 spoiler+0.016
 conclusions+0.016
Neg logit contributions
 knowing-0.026
king-0.026
 than-0.025
 to-0.025
 for-0.025
Token to
Token ID284
Feature activation+2.23e-03

Pos logit contributions
king+0.017
ross+0.015
ind+0.015
asured+0.015
rou+0.015
Neg logit contributions
 slide-0.019
 are-0.018
 vest-0.017
 pear-0.017
 tube-0.016
Token play
Token ID711
Feature activation+2.33e-02

Pos logit contributions
Found+0.004
 ah+0.004
 spoiler+0.004
Raw+0.004
 ey+0.004
Neg logit contributions
 to-0.003
 where-0.003
 for-0.003
 when-0.003
 knowing-0.003
Token.
Token ID13
Feature activation+1.67e-04

Pos logit contributions
Found+0.061
 ah+0.060
Raw+0.060
 spoiler+0.059
 ey+0.058
Neg logit contributions
 knowing-0.018
 to-0.017
king-0.016
 for-0.016
 when-0.016
Token So
Token ID1406
Feature activation+1.83e-02

Pos logit contributions
Found+0.000
Raw+0.000
 ah+0.000
 ey+0.000
Got+0.000
Neg logit contributions
 knowing-0.000
king-0.000
asured-0.000
 than-0.000
 for-0.000
Token Sasha
Token ID32151
Feature activation-1.06e-01

Pos logit contributions
Found+0.030
Raw+0.028
 ah+0.028
 are+0.028
Got+0.027
Neg logit contributions
king-0.032
asured-0.030
 knowing-0.030
rou-0.027
ind-0.027
Token asked
Token ID1965
Feature activation+1.50e-02

Pos logit contributions
 knowing+0.182
king+0.178
 than+0.167
 awaiting+0.157
 for+0.156
Neg logit contributions
Found-0.175
Raw-0.170
 ah-0.167
Got-0.159
 tube-0.157
Token her
Token ID607
Feature activation-6.78e-03

Pos logit contributions
Found+0.030
 ah+0.029
Raw+0.029
 spoiler+0.028
Got+0.028
Neg logit contributions
 knowing-0.021
king-0.020
 to-0.019
 than-0.019
 for-0.019
Token dad
Token ID9955
Feature activation-4.34e-03

Pos logit contributions
king+0.012
 knowing+0.011
 than+0.011
asured+0.010
app+0.010
Neg logit contributions
Found-0.011
 ah-0.011
 tube-0.011
Raw-0.010
 are-0.010
Token if
Token ID611
Feature activation+1.79e-02

Pos logit contributions
king+0.007
asured+0.006
 knowing+0.006
 awaiting+0.006
 than+0.006
Neg logit contributions
 are-0.008
Found-0.007
 ah-0.007
 pear-0.007
 tube-0.007
Token she
Token ID673
Feature activation-2.56e-02

Pos logit contributions
 ah+0.030
Found+0.029
Raw+0.028
Got+0.027
 are+0.026
Neg logit contributions
king-0.031
 knowing-0.029
 than-0.028
asured-0.028
app-0.027
Tokenmy
Token ID1820
Feature activation+4.66e-05

Pos logit contributions
 knowing+0.000
 than+0.000
king+0.000
 awaiting+0.000
asured+0.000
Neg logit contributions
Found-0.000
 ah-0.000
Raw-0.000
 are-0.000
 tube-0.000
Token said
Token ID531
Feature activation-4.77e-03

Pos logit contributions
 ah+0.000
Found+0.000
Raw+0.000
 tube+0.000
Got+0.000
Neg logit contributions
 than-0.000
 knowing-0.000
king-0.000
asured-0.000
 awaiting-0.000
Token goodbye
Token ID24829
Feature activation-1.31e-03

Pos logit contributions
 knowing+0.006
king+0.006
 to+0.006
 for+0.006
 than+0.006
Neg logit contributions
Found-0.010
 ah-0.009
Raw-0.009
 spoiler-0.009
 conclusions-0.009
Token and
Token ID290
Feature activation-7.47e-03

Pos logit contributions
 supposed+0.002
asured+0.002
rou+0.002
ind+0.002
iring+0.002
Neg logit contributions
 shovel-0.001
 Boom-0.001
 vest-0.001
 rock-0.001
 -0.001
Token introduced
Token ID5495
Feature activation-4.85e-02

Pos logit contributions
king+0.013
 knowing+0.013
 than+0.013
asured+0.012
 awaiting+0.012
Neg logit contributions
Found-0.012
 ah-0.012
Raw-0.011
Got-0.011
 tube-0.011
Token Rachel
Token ID15984
Feature activation-1.05e-01

Pos logit contributions
king+0.087
 knowing+0.086
 awaiting+0.084
 than+0.082
asured+0.081
Neg logit contributions
Found-0.074
 ah-0.072
 conclusions-0.069
 ey-0.066
Raw-0.066
Token to
Token ID284
Feature activation-2.51e-02

Pos logit contributions
king+0.105
asured+0.084
izing+0.080
iring+0.078
 awaiting+0.073
Neg logit contributions
 are-0.231
Found-0.229
hell-0.228
 pear-0.225
 vest-0.222
Token the
Token ID262
Feature activation-6.20e-04

Pos logit contributions
king+0.041
 knowing+0.038
 than+0.038
asured+0.038
 awaiting+0.036
Neg logit contributions
Found-0.044
 ah-0.043
 conclusions-0.042
Raw-0.041
 are-0.040
Token nice
Token ID3621
Feature activation+9.61e-03

Pos logit contributions
king+0.001
asured+0.001
 knowing+0.001
app+0.001
 than+0.001
Neg logit contributions
Found-0.001
 tube-0.001
Raw-0.001
 vest-0.001
 fog-0.001
Token lady
Token ID10846
Feature activation-5.70e-02

Pos logit contributions
 tube+0.011
Found+0.011
 ey+0.011
 vest+0.011
 rock+0.011
Neg logit contributions
king-0.021
asured-0.019
 knowing-0.019
app-0.018
 letting-0.018
Token at
Token ID379
Feature activation-1.42e-02

Pos logit contributions
king+0.075
 knowing+0.069
asured+0.063
app+0.061
 awaiting+0.060
Neg logit contributions
Found-0.110
 are-0.107
 ah-0.105
 tube-0.102
 pear-0.102
Token there
Token ID612
Feature activation+1.27e-02

Pos logit contributions
 knowing+0.036
king+0.034
 to+0.034
 than+0.033
 for+0.033
Neg logit contributions
Found-0.055
 ah-0.054
Raw-0.054
 spoiler-0.052
 conclusions-0.051
Token."
Token ID526
Feature activation+1.18e-02

Pos logit contributions
 ah+0.024
Found+0.024
Raw+0.024
 are+0.024
 conclusions+0.023
Neg logit contributions
king-0.017
 knowing-0.017
asured-0.015
 awaiting-0.015
 than-0.014
Token
Token ID198
Feature activation+4.39e-03

Pos logit contributions
Found+0.021
 ah+0.020
Raw+0.020
 spoiler+0.020
 conclusions+0.019
Neg logit contributions
 knowing-0.019
king-0.018
 to-0.018
 than-0.017
 for-0.017
Token
Token ID198
Feature activation-2.38e-03

Pos logit contributions
Found+0.007
Raw+0.007
 ah+0.007
 conclusions+0.006
Got+0.006
Neg logit contributions
 knowing-0.008
king-0.007
 than-0.007
 awaiting-0.007
asured-0.007
TokenS
Token ID50
Feature activation-3.29e-02

Pos logit contributions
 knowing+0.005
king+0.005
asured+0.005
 before+0.005
 where+0.005
Neg logit contributions
Found-0.003
Raw-0.003
Got-0.003
 conclusions-0.003
AAAA-0.003
Tokenasha
Token ID14715
Feature activation-1.05e-01

Pos logit contributions
 knowing+0.057
king+0.053
 where+0.053
asured+0.051
 before+0.051
Neg logit contributions
Found-0.054
 ey-0.051
orn-0.050
Got-0.050
Hey-0.050
Token smiled
Token ID13541
Feature activation-2.35e-02

Pos logit contributions
 knowing+0.188
king+0.182
 than+0.176
 for+0.171
 to+0.170
Neg logit contributions
Found-0.164
 ah-0.160
Raw-0.157
 spoiler-0.149
Got-0.149
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
 supposed+0.035
 awaiting+0.034
king+0.033
app+0.033
illa+0.032
Neg logit contributions
 vest-0.037
 are-0.033
Found-0.033
 spoiler-0.033
 rock-0.032
Token rushed
Token ID13999
Feature activation-1.25e-02

Pos logit contributions
king+0.040
 than+0.038
 knowing+0.038
asured+0.037
ind+0.037
Neg logit contributions
Found-0.028
 ah-0.028
Got-0.026
Raw-0.026
AAAA-0.023
Token off
Token ID572
Feature activation+2.20e-02

Pos logit contributions
king+0.009
asured+0.009
 knowing+0.008
 than+0.008
rou+0.008
Neg logit contributions
Found-0.030
Raw-0.029
 ah-0.029
 tube-0.028
 conclusions-0.028
Token to
Token ID284
Feature activation-9.65e-03

Pos logit contributions
 tube+0.043
 ah+0.043
 rocks+0.042
 rock+0.042
 pear+0.042
Neg logit contributions
king-0.025
asured-0.023
ross-0.021
illa-0.021
rou-0.021
Token very
Token ID845
Feature activation-1.86e-03

Pos logit contributions
Found+0.038
 ah+0.036
Raw+0.036
Got+0.034
 spoiler+0.034
Neg logit contributions
 knowing-0.032
king-0.031
 than-0.031
 awaiting-0.029
 before-0.029
Token compassionate
Token ID32533
Feature activation-1.81e-02

Pos logit contributions
 knowing+0.003
king+0.003
 where+0.003
 than+0.003
 awaiting+0.003
Neg logit contributions
Raw-0.003
Found-0.003
 ah-0.003
 spoiler-0.003
 tube-0.003
Token and
Token ID290
Feature activation-8.42e-03

Pos logit contributions
king+0.025
ind+0.024
 awaiting+0.023
asured+0.023
app+0.023
Neg logit contributions
 are-0.032
Found-0.031
 tube-0.030
 vest-0.029
 rock-0.028
Token kindly
Token ID26820
Feature activation-2.58e-03

Pos logit contributions
king+0.014
 than+0.013
 knowing+0.013
app+0.012
asured+0.012
Neg logit contributions
 ah-0.014
Found-0.013
Raw-0.013
 are-0.012
Got-0.012
Token convinced
Token ID9431
Feature activation-2.49e-02

Pos logit contributions
asured+0.004
king+0.004
ind+0.004
 knowing+0.004
app+0.004
Neg logit contributions
 are-0.004
Found-0.004
 ah-0.004
Got-0.003
 tube-0.003
Token Lex
Token ID17210
Feature activation-1.04e-01

Pos logit contributions
asured+0.047
king+0.045
 than+0.044
 knowing+0.043
rou+0.041
Neg logit contributions
 ah-0.036
 are-0.034
 conclusions-0.033
Found-0.033
Raw-0.031
Token to
Token ID284
Feature activation-3.45e-02

Pos logit contributions
asured+0.146
 knowing+0.142
king+0.138
 introducing+0.134
app+0.134
Neg logit contributions
 ah-0.161
Found-0.157
 pear-0.156
orn-0.155
 are-0.154
Token keep
Token ID1394
Feature activation+2.62e-02

Pos logit contributions
 knowing+0.059
king+0.057
 to+0.056
 than+0.055
 for+0.054
Neg logit contributions
Found-0.057
 ah-0.056
Raw-0.055
 spoiler-0.054
 conclusions-0.052
Token going
Token ID1016
Feature activation-1.65e-02

Pos logit contributions
 ah+0.048
 tube+0.043
Found+0.043
Raw+0.042
 conclusions+0.042
Neg logit contributions
king-0.035
 than-0.034
 knowing-0.034
asured-0.034
ess-0.033
Token so
Token ID523
Feature activation+1.14e-02

Pos logit contributions
king+0.018
 knowing+0.018
asured+0.017
 awaiting+0.016
 than+0.016
Neg logit contributions
 spoiler-0.033
 tube-0.032
 conclusions-0.032
Raw-0.031
Found-0.031
Token they
Token ID484
Feature activation-1.43e-02

Pos logit contributions
Raw+0.021
Found+0.021
 ah+0.021
 conclusions+0.021
Got+0.020
Neg logit contributions
king-0.015
 knowing-0.014
asured-0.013
 than-0.013
 awaiting-0.013
Token felt
Token ID2936
Feature activation+6.39e-03

Pos logit contributions
 than+0.001
 knowing+0.001
king+0.001
asured+0.001
 awaiting+0.001
Neg logit contributions
Found-0.001
 ah-0.001
Raw-0.001
Got-0.001
 tube-0.001
Token so
Token ID523
Feature activation+2.28e-02

Pos logit contributions
Raw+0.012
 spoiler+0.012
 ah+0.011
Found+0.011
 ey+0.011
Neg logit contributions
 knowing-0.011
 to-0.010
 when-0.010
 for-0.010
 where-0.010
Token sorry
Token ID7926
Feature activation+4.38e-02

Pos logit contributions
Found+0.036
Raw+0.035
 ah+0.035
 spoiler+0.034
 conclusions+0.033
Neg logit contributions
 knowing-0.043
 to-0.041
 for-0.040
 when-0.040
king-0.039
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
 are+0.096
 ah+0.095
Found+0.094
 tube+0.090
Raw+0.090
Neg logit contributions
king-0.045
asured-0.045
app-0.038
ind-0.037
 knowing-0.036
Token promised
Token ID8072
Feature activation+9.74e-03

Pos logit contributions
king+0.038
 than+0.036
 knowing+0.035
asured+0.034
app+0.033
Neg logit contributions
 ah-0.036
Found-0.035
Raw-0.032
Got-0.032
 are-0.030
Token Ashley
Token ID20899
Feature activation-1.03e-01

Pos logit contributions
Found+0.015
 ah+0.015
 are+0.015
Raw+0.015
 conclusions+0.015
Neg logit contributions
king-0.016
asured-0.015
 than-0.014
 knowing-0.014
app-0.013
Token that
Token ID326
Feature activation-1.77e-02

Pos logit contributions
king+0.090
 than+0.072
asured+0.071
ind+0.068
 awaiting+0.059
Neg logit contributions
 are-0.256
Found-0.250
 ah-0.244
hell-0.233
 tube-0.232
Token she
Token ID673
Feature activation-3.11e-02

Pos logit contributions
king+0.029
 knowing+0.027
asured+0.026
 than+0.026
app+0.025
Neg logit contributions
Found-0.031
 ah-0.030
Raw-0.030
 conclusions-0.028
Got-0.028
Token would
Token ID561
Feature activation-2.47e-02

Pos logit contributions
king+0.051
 knowing+0.051
 than+0.048
 awaiting+0.046
 to+0.046
Neg logit contributions
Found-0.052
Raw-0.050
 ah-0.049
 spoiler-0.048
Got-0.047
Token get
Token ID651
Feature activation-9.21e-04

Pos logit contributions
 knowing+0.028
 to+0.026
king+0.025
 for+0.025
 when+0.024
Neg logit contributions
Found-0.056
 ah-0.055
Raw-0.054
 spoiler-0.053
Got-0.052
Token the
Token ID262
Feature activation-4.92e-03

Pos logit contributions
 knowing+0.001
king+0.001
 than+0.001
 to+0.001
 when+0.001
Neg logit contributions
Found-0.002
 ah-0.002
Raw-0.002
 spoiler-0.002
Got-0.002
Token were
Token ID547
Feature activation+2.59e-02

Pos logit contributions
Found+0.021
Raw+0.021
 spoiler+0.020
 ah+0.020
 conclusions+0.020
Neg logit contributions
 knowing-0.017
 to-0.016
 for-0.016
 when-0.016
 than-0.016
Token the
Token ID262
Feature activation+1.37e-02

Pos logit contributions
 ah+0.046
Found+0.041
 are+0.041
bridge+0.041
 spoiler+0.040
Neg logit contributions
king-0.042
 knowing-0.037
 awaiting-0.036
 than-0.036
rying-0.035
Token best
Token ID1266
Feature activation-2.16e-02

Pos logit contributions
Found+0.022
 ah+0.021
Raw+0.021
 spoiler+0.021
Got+0.020
Neg logit contributions
 knowing-0.024
king-0.023
 to-0.023
 than-0.022
asured-0.022
Token little
Token ID1310
Feature activation-5.56e-02

Pos logit contributions
king+0.039
asured+0.035
 knowing+0.034
app+0.033
 awaiting+0.033
Neg logit contributions
 ah-0.034
 tube-0.033
Found-0.031
 are-0.030
bridge-0.030
Token mag
Token ID2153
Feature activation-5.48e-02

Pos logit contributions
 knowing+0.095
 to+0.091
 than+0.089
 for+0.089
king+0.088
Neg logit contributions
Found-0.094
Raw-0.091
 ah-0.090
 spoiler-0.088
 conclusions-0.086
Tokenicians
Token ID5106
Feature activation-1.03e-01

Pos logit contributions
 knowing+0.110
 to+0.108
 for+0.106
 where+0.105
 when+0.104
Neg logit contributions
 ah-0.076
Found-0.073
bridge-0.072
hing-0.066
Raw-0.064
Token in
Token ID287
Feature activation-1.54e-02

Pos logit contributions
asured+0.135
king+0.135
ind+0.124
ixed+0.117
 elderly+0.112
Neg logit contributions
 are-0.198
 ah-0.196
Found-0.177
bridge-0.176
hing-0.175
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
king+0.022
 knowing+0.021
 awaiting+0.019
 to+0.019
asured+0.019
Neg logit contributions
 ah-0.030
Found-0.029
Raw-0.029
 spoiler-0.028
 conclusions-0.028
Token whole
Token ID2187
Feature activation-1.07e-03

Pos logit contributions
Found+0.022
 ah+0.022
Raw+0.022
 spoiler+0.021
 conclusions+0.021
Neg logit contributions
 knowing-0.023
 to-0.022
king-0.022
 for-0.021
 than-0.021
Token town
Token ID3240
Feature activation-1.18e-02

Pos logit contributions
king+0.002
 knowing+0.002
asured+0.002
 for+0.002
 awaiting+0.002
Neg logit contributions
Found-0.002
Raw-0.001
 ah-0.001
 tube-0.001
 spoiler-0.001
Token.
Token ID13
Feature activation-3.68e-03

Pos logit contributions
king+0.015
asured+0.014
 knowing+0.014
 awaiting+0.013
ind+0.012
Neg logit contributions
 ah-0.024
 are-0.023
Found-0.021
bridge-0.021
hing-0.021
Token cooking
Token ID10801
Feature activation-3.30e-02

Pos logit contributions
 than+0.040
king+0.039
asured+0.037
 knowing+0.036
ess+0.036
Neg logit contributions
 ah-0.104
 tube-0.098
 shovel-0.095
Raw-0.095
 cloth-0.095
Token.
Token ID13
Feature activation+3.66e-03

Pos logit contributions
 knowing+0.031
king+0.030
 to+0.027
 than+0.027
asured+0.026
Neg logit contributions
Found-0.078
 ah-0.077
Raw-0.076
 spoiler-0.075
 conclusions-0.075
Token
Token ID198
Feature activation+3.30e-04

Pos logit contributions
Found+0.005
Raw+0.005
Got+0.005
 ey+0.005
 spoiler+0.005
Neg logit contributions
king-0.007
asured-0.007
 knowing-0.007
ind-0.006
 than-0.006
Token
Token ID198
Feature activation-4.68e-03

Pos logit contributions
Found+0.001
Raw+0.001
 ah+0.001
Got+0.000
 spoiler+0.000
Neg logit contributions
 knowing-0.001
king-0.001
 than-0.001
 for-0.001
 to-0.001
TokenS
Token ID50
Feature activation-3.52e-02

Pos logit contributions
king+0.008
 knowing+0.008
asured+0.008
 for+0.008
 patiently+0.007
Neg logit contributions
Found-0.008
Got-0.007
Raw-0.007
Boo-0.007
 conclusions-0.007
Tokenandra
Token ID15918
Feature activation-1.03e-01

Pos logit contributions
 knowing+0.060
king+0.057
asured+0.057
 where+0.056
 before+0.055
Neg logit contributions
Found-0.055
 ey-0.054
Got-0.051
Ble-0.051
Hey-0.051
Token's
Token ID338
Feature activation-2.65e-02

Pos logit contributions
 knowing+0.190
king+0.189
 than+0.183
 awaiting+0.173
 for+0.168
Neg logit contributions
Found-0.153
 ah-0.148
Raw-0.142
Got-0.136
 tube-0.134
Token mom
Token ID1995
Feature activation-1.99e-02

Pos logit contributions
 knowing+0.041
king+0.040
 than+0.038
 to+0.038
 when+0.037
Neg logit contributions
Found-0.048
 ah-0.047
Raw-0.046
 spoiler-0.045
 ey-0.044
Token saw
Token ID2497
Feature activation+3.21e-03

Pos logit contributions
 than+0.041
 knowing+0.040
king+0.040
asured+0.038
 awaiting+0.037
Neg logit contributions
 ah-0.026
Found-0.026
 are-0.023
Got-0.022
Raw-0.022
Token her
Token ID607
Feature activation+1.26e-03

Pos logit contributions
Found+0.006
 ah+0.006
 ey+0.006
Got+0.006
Raw+0.006
Neg logit contributions
king-0.004
 knowing-0.004
asured-0.004
 than-0.004
 awaiting-0.004
Token and
Token ID290
Feature activation+1.47e-03

Pos logit contributions
 ah+0.002
 tube+0.002
Found+0.002
 are+0.002
 ey+0.002
Neg logit contributions
king-0.002
 knowing-0.002
asured-0.002
 than-0.002
app-0.002
Token care
Token ID1337
Feature activation-6.63e-03

Pos logit contributions
 knowing+0.007
king+0.007
 to+0.007
 than+0.007
 for+0.007
Neg logit contributions
Found-0.004
 ah-0.004
Raw-0.004
 spoiler-0.004
 conclusions-0.004
Token of
Token ID286
Feature activation-1.65e-02

Pos logit contributions
king+0.011
 knowing+0.010
 awaiting+0.010
asured+0.010
app+0.010
Neg logit contributions
 ah-0.010
Found-0.010
 are-0.010
 conclusions-0.010
 ey-0.009
Token it
Token ID340
Feature activation-2.98e-03

Pos logit contributions
 knowing+0.026
king+0.026
 to+0.025
 than+0.024
 for+0.024
Neg logit contributions
Found-0.028
 ah-0.028
Raw-0.028
 spoiler-0.027
 conclusions-0.027
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
king+0.004
 knowing+0.004
asured+0.004
 awaiting+0.003
 than+0.003
Neg logit contributions
 ah-0.006
Found-0.005
 are-0.005
Raw-0.005
 conclusions-0.005
Token Ab
Token ID2275
Feature activation-1.90e-02

Pos logit contributions
Found+0.035
 ah+0.035
Raw+0.034
 spoiler+0.034
 conclusions+0.033
Neg logit contributions
 knowing-0.022
king-0.022
 to-0.020
 than-0.020
 for-0.020
Tokenbi
Token ID8482
Feature activation-1.02e-01

Pos logit contributions
 knowing+0.032
king+0.030
 when+0.029
 to+0.029
 before+0.028
Neg logit contributions
Found-0.030
 ey-0.030
 ah-0.029
orn-0.028
 pear-0.027
Token?"
Token ID1701
Feature activation+1.68e-02

Pos logit contributions
king+0.161
 knowing+0.148
asured+0.138
 awaiting+0.138
 than+0.133
Neg logit contributions
 ah-0.176
 are-0.173
Found-0.172
Raw-0.163
orn-0.157
Token
Token ID198
Feature activation+8.23e-03

Pos logit contributions
Found+0.031
Raw+0.030
 ah+0.030
 spoiler+0.030
Got+0.029
Neg logit contributions
 knowing-0.026
 to-0.026
 when-0.025
 for-0.024
king-0.024
Token
Token ID198
Feature activation-6.00e-04

Pos logit contributions
 spoiler+0.019
Found+0.018
 ah+0.018
Raw+0.017
 ey+0.017
Neg logit contributions
 to-0.010
 knowing-0.010
 when-0.009
 for-0.009
 where-0.009
TokenAb
Token ID4826
Feature activation-1.19e-02

Pos logit contributions
 knowing+0.001
 to+0.001
 when+0.001
king+0.001
 than+0.001
Neg logit contributions
Found-0.001
 spoiler-0.001
 ah-0.001
Raw-0.001
 ey-0.001
Tokenbi
Token ID8482
Feature activation-8.75e-02

Pos logit contributions
 knowing+0.020
king+0.018
 before+0.018
 when+0.018
 to+0.017
Neg logit contributions
Found-0.020
 ey-0.020
 ah-0.019
Got-0.018
 are-0.018
Token.
Token ID13
Feature activation+3.03e-03

Pos logit contributions
 knowing+0.011
king+0.011
asured+0.010
 awaiting+0.009
 than+0.009
Neg logit contributions
 tube-0.019
 spoiler-0.018
 vest-0.018
 ah-0.018
Found-0.018
Token He
Token ID679
Feature activation+1.99e-02

Pos logit contributions
Found+0.005
 ah+0.005
Raw+0.005
 spoiler+0.004
 ey+0.004
Neg logit contributions
 knowing-0.005
 to-0.005
 when-0.005
king-0.005
 for-0.005
Token was
Token ID373
Feature activation+1.25e-02

Pos logit contributions
Found+0.036
 ah+0.035
Raw+0.035
 spoiler+0.034
 conclusions+0.033
Neg logit contributions
 knowing-0.031
king-0.029
 to-0.029
 than-0.028
 for-0.028
Token so
Token ID523
Feature activation+2.14e-02

Pos logit contributions
 ah+0.022
Found+0.021
Raw+0.021
bridge+0.021
 spoiler+0.020
Neg logit contributions
king-0.017
 than-0.016
app-0.016
 knowing-0.016
ess-0.016
Token eng
Token ID1786
Feature activation-4.12e-02

Pos logit contributions
 ah+0.038
Found+0.037
Raw+0.036
 spoiler+0.035
Got+0.035
Neg logit contributions
king-0.033
 knowing-0.033
 to-0.030
 for-0.030
 than-0.030
Tokenross
Token ID1214
Feature activation-1.02e-01

Pos logit contributions
king+0.048
 knowing+0.047
 to+0.045
 telling+0.043
 letting+0.043
Neg logit contributions
bridge-0.078
 ey-0.072
Found-0.071
orn-0.068
hing-0.068
Tokened
Token ID276
Feature activation-5.32e-02

Pos logit contributions
 knowing+0.203
king+0.200
asured+0.196
 where+0.189
 patiently+0.187
Neg logit contributions
Found-0.129
 ah-0.116
Raw-0.112
Got-0.112
bridge-0.111
Token that
Token ID326
Feature activation-5.02e-02

Pos logit contributions
king+0.056
asured+0.054
 knowing+0.050
app+0.049
 awaiting+0.046
Neg logit contributions
 ah-0.112
Found-0.107
 tube-0.105
 are-0.104
 spoiler-0.102
Token he
Token ID339
Feature activation-3.53e-02

Pos logit contributions
 knowing+0.085
 to+0.083
 when+0.081
 for+0.080
 before+0.078
Neg logit contributions
Found-0.088
Raw-0.085
 conclusions-0.083
 ah-0.083
 spoiler-0.083
Token didn
Token ID1422
Feature activation-5.24e-03

Pos logit contributions
 knowing+0.058
king+0.057
 than+0.054
 to+0.054
 for+0.054
Neg logit contributions
Found-0.059
 ah-0.058
Raw-0.058
 spoiler-0.054
Got-0.054
Token't
Token ID470
Feature activation-1.74e-02

Pos logit contributions
 knowing+0.008
king+0.008
 to+0.008
 than+0.008
 for+0.008
Neg logit contributions
Found-0.009
Raw-0.009
 ah-0.009
 spoiler-0.009
 conclusions-0.008
Token to
Token ID284
Feature activation-1.64e-02

Pos logit contributions
king+0.028
asured+0.026
app+0.023
 knowing+0.023
 awaiting+0.021
Neg logit contributions
 are-0.083
Found-0.082
 ah-0.081
 conclusions-0.081
Raw-0.081
Token your
Token ID534
Feature activation+2.05e-03

Pos logit contributions
king+0.024
 knowing+0.024
 to+0.022
 than+0.022
asured+0.022
Neg logit contributions
 ah-0.030
Found-0.029
 conclusions-0.029
Raw-0.029
 spoiler-0.028
Token friends
Token ID2460
Feature activation+2.89e-02

Pos logit contributions
Found+0.004
Raw+0.004
 ah+0.004
 spoiler+0.003
 conclusions+0.003
Neg logit contributions
 knowing-0.003
king-0.003
 to-0.003
 than-0.003
 when-0.003
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
Found+0.057
 ah+0.056
Raw+0.054
 spoiler+0.054
 conclusions+0.054
Neg logit contributions
 knowing-0.038
king-0.038
asured-0.034
 than-0.033
 to-0.033
Token remember
Token ID3505
Feature activation-3.48e-02

Pos logit contributions
Found+0.037
Raw+0.036
 ah+0.036
 spoiler+0.035
 conclusions+0.034
Neg logit contributions
 knowing-0.028
 to-0.026
king-0.026
 for-0.026
 when-0.026
Token Heather
Token ID24253
Feature activation-1.02e-01

Pos logit contributions
king+0.038
 knowing+0.034
asured+0.031
app+0.030
 than+0.029
Neg logit contributions
 ah-0.077
Found-0.076
 conclusions-0.074
 are-0.074
Raw-0.074
Token's
Token ID338
Feature activation-3.79e-02

Pos logit contributions
king+0.135
 knowing+0.124
asured+0.118
 than+0.116
app+0.116
Neg logit contributions
 ah-0.197
Found-0.196
 are-0.193
hell-0.183
 conclusions-0.183
Token story
Token ID1621
Feature activation-3.43e-02

Pos logit contributions
 knowing+0.064
king+0.062
 to+0.061
 than+0.060
 for+0.060
Neg logit contributions
Found-0.063
 ah-0.061
Raw-0.061
 spoiler-0.059
 conclusions-0.058
Token.
Token ID13
Feature activation-4.13e-03

Pos logit contributions
king+0.042
 knowing+0.038
asured+0.038
rying+0.034
 awaiting+0.034
Neg logit contributions
 are-0.066
Found-0.066
hell-0.065
bridge-0.065
 ah-0.064
Token Listen
Token ID20600
Feature activation-1.30e-02

Pos logit contributions
 knowing+0.007
king+0.007
 to+0.007
 than+0.006
 for+0.006
Neg logit contributions
Found-0.007
 ah-0.007
Raw-0.007
 spoiler-0.006
Got-0.006
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
asured+0.015
king+0.014
 letting+0.013
app+0.012
 knowing+0.012
Neg logit contributions
 are-0.024
Raw-0.024
 ah-0.023
Found-0.023
AAAA-0.023
Token came
Token ID1625
Feature activation+1.11e-03

Pos logit contributions
 knowing+0.023
king+0.022
 than+0.022
asured+0.021
 where+0.021
Neg logit contributions
Found-0.019
 ah-0.018
Raw-0.018
 spoiler-0.017
Got-0.017
Token out
Token ID503
Feature activation-9.21e-04

Pos logit contributions
Found+0.002
 ah+0.002
Raw+0.002
 spoiler+0.002
hing+0.002
Neg logit contributions
 knowing-0.002
asured-0.002
 than-0.002
king-0.002
 awaiting-0.002
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
asured+0.001
king+0.001
 knowing+0.001
app+0.001
 supposed+0.001
Neg logit contributions
 ah-0.002
Found-0.002
 ey-0.002
 spoiler-0.002
 tube-0.002
Token sh
Token ID427
Feature activation+3.10e-02

Pos logit contributions
Found+0.022
 ah+0.022
Raw+0.022
 spoiler+0.021
 conclusions+0.021
Neg logit contributions
 knowing-0.018
king-0.017
 to-0.016
 than-0.016
 for-0.016
Tokenined
Token ID1389
Feature activation-2.22e-02

Pos logit contributions
Found+0.043
 ah+0.042
Raw+0.042
 spoiler+0.041
hing+0.040
Neg logit contributions
 knowing-0.060
 to-0.059
 for-0.058
 than-0.057
 before-0.056
Token brightly
Token ID35254
Feature activation+4.96e-02

Pos logit contributions
 knowing+0.033
king+0.032
asured+0.031
 than+0.030
 awaiting+0.030
Neg logit contributions
Found-0.039
 ah-0.039
Raw-0.037
 spoiler-0.037
Got-0.036
Token.
Token ID13
Feature activation+1.59e-03

Pos logit contributions
 ah+0.103
Found+0.100
 spoiler+0.098
hing+0.097
Raw+0.096
Neg logit contributions
 knowing-0.054
king-0.054
asured-0.051
 letting-0.048
 supposed-0.047
Token Susan
Token ID14919
Feature activation-4.84e-02

Pos logit contributions
Found+0.003
 ah+0.003
Raw+0.003
 spoiler+0.003
 conclusions+0.003
Neg logit contributions
 knowing-0.003
 to-0.003
 when-0.002
king-0.002
 than-0.002
Token smiled
Token ID13541
Feature activation-1.71e-02

Pos logit contributions
 knowing+0.086
king+0.085
 than+0.082
 for+0.081
 to+0.080
Neg logit contributions
Found-0.075
 ah-0.073
Raw-0.070
Got-0.067
 spoiler-0.066
Token and
Token ID290
Feature activation-1.15e-02

Pos logit contributions
 supposed+0.022
asured+0.021
king+0.021
app+0.020
 letting+0.020
Neg logit contributions
Found-0.030
 ah-0.030
 vest-0.030
 spoiler-0.030
 ey-0.029
Token knew
Token ID2993
Feature activation+1.77e-02

Pos logit contributions
king+0.020
 knowing+0.019
 than+0.019
 to+0.018
asured+0.018
Neg logit contributions
 ah-0.018
Found-0.018
Raw-0.017
Got-0.017
 spoiler-0.016
Token learn
Token ID2193
Feature activation-2.37e-02

Pos logit contributions
Found+0.076
 ey+0.073
Got+0.071
Raw+0.070
 ah+0.068
Neg logit contributions
 when-0.065
 to-0.064
 for-0.063
 as-0.060
 knowing-0.059
Token your
Token ID534
Feature activation-2.24e-02

Pos logit contributions
 to+0.026
 for+0.024
 where+0.023
 when+0.023
 knowing+0.023
Neg logit contributions
Found-0.057
 ah-0.056
Raw-0.056
 spoiler-0.055
bridge-0.054
Token lesson
Token ID11483
Feature activation-7.95e-03

Pos logit contributions
 to+0.041
 for+0.039
 knowing+0.039
 when+0.038
 than+0.037
Neg logit contributions
Found-0.037
 ah-0.036
 spoiler-0.035
Got-0.034
Raw-0.034
Token?"
Token ID1701
Feature activation+3.41e-02

Pos logit contributions
king+0.015
 knowing+0.015
 awaiting+0.013
asured+0.013
 than+0.013
Neg logit contributions
Found-0.011
 ah-0.011
Raw-0.011
 spoiler-0.010
 are-0.010
Token she
Token ID673
Feature activation+3.16e-02

Pos logit contributions
Found+0.078
hing+0.076
 spoiler+0.076
Raw+0.074
bridge+0.074
Neg logit contributions
 to-0.041
 when-0.039
 knowing-0.038
 for-0.037
 where-0.036
Token asks
Token ID7893
Feature activation+5.70e-02

Pos logit contributions
Found+0.059
 ah+0.057
 spoiler+0.056
 conclusions+0.055
bridge+0.055
Neg logit contributions
 to-0.048
 when-0.047
 for-0.046
 knowing-0.046
king-0.045
Token.
Token ID13
Feature activation+5.25e-03

Pos logit contributions
Found+0.139
Raw+0.136
 ah+0.136
 spoiler+0.134
Got+0.131
Neg logit contributions
 knowing-0.053
king-0.049
 to-0.049
 for-0.047
 when-0.047
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
Found+0.011
 ah+0.010
 spoiler+0.010
Raw+0.010
 conclusions+0.010
Neg logit contributions
 knowing-0.007
 to-0.007
 when-0.007
 for-0.007
king-0.007
Token
Token ID198
Feature activation+8.96e-03

Pos logit contributions
 ah+0.037
 spoiler+0.037
Found+0.036
bridge+0.036
hing+0.035
Neg logit contributions
 to-0.018
 when-0.015
 knowing-0.015
 where-0.015
 for-0.014
Token"
Token ID1
Feature activation-1.43e-02

Pos logit contributions
 spoiler+0.019
 ah+0.019
Found+0.018
bridge+0.018
 ey+0.018
Neg logit contributions
 when-0.011
 to-0.011
 than-0.010
 knowing-0.010
 where-0.010
TokenYes
Token ID5297
Feature activation+2.55e-02

Pos logit contributions
 knowing+0.020
 to+0.020
 than+0.020
 when+0.019
king+0.019
Neg logit contributions
Found-0.027
 spoiler-0.027
 ah-0.027
 ey-0.026
Raw-0.026
Token shouts
Token ID34757
Feature activation+2.81e-02

Pos logit contributions
 knowing+0.028
king+0.028
 to+0.027
 than+0.026
 when+0.026
Neg logit contributions
Found-0.019
 ah-0.018
Raw-0.018
 spoiler-0.017
Got-0.017
Token.
Token ID13
Feature activation-5.93e-03

Pos logit contributions
Found+0.056
 ah+0.055
Raw+0.054
 spoiler+0.053
 conclusions+0.052
Neg logit contributions
 knowing-0.036
asured-0.035
king-0.035
 awaiting-0.032
 than-0.032
Token
Token ID198
Feature activation+4.35e-03

Pos logit contributions
 knowing+0.009
 to+0.009
 when+0.008
king+0.008
 for+0.008
Neg logit contributions
Found-0.011
 ah-0.011
 spoiler-0.011
Raw-0.011
 conclusions-0.011
Token
Token ID198
Feature activation+9.69e-04

Pos logit contributions
Found+0.008
 ah+0.008
 spoiler+0.008
Raw+0.008
 ey+0.008
Neg logit contributions
 knowing-0.006
 to-0.006
 when-0.006
 for-0.006
king-0.006
Token"
Token ID1
Feature activation-1.61e-02

Pos logit contributions
Found+0.002
 ah+0.002
 spoiler+0.002
Raw+0.002
 ey+0.002
Neg logit contributions
 knowing-0.001
 to-0.001
king-0.001
 when-0.001
 than-0.001
TokenWow
Token ID22017
Feature activation+6.52e-02

Pos logit contributions
king+0.029
 when+0.028
 than+0.027
 knowing+0.027
 letting+0.027
Neg logit contributions
 ey-0.025
 tube-0.023
 ah-0.023
 spoiler-0.022
 flashlight-0.022
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
Found+0.134
Raw+0.131
 ah+0.129
 spoiler+0.128
 conclusions+0.125
Neg logit contributions
 knowing-0.085
 to-0.082
 for-0.081
king-0.081
 when-0.079
Token Sara
Token ID24799
Feature activation-1.59e-02

Pos logit contributions
Found+0.044
Raw+0.043
 spoiler+0.042
 ey+0.041
 ah+0.041
Neg logit contributions
 to-0.030
 for-0.029
 knowing-0.029
 where-0.029
 when-0.028
Token.
Token ID13
Feature activation-8.16e-03

Pos logit contributions
king+0.028
 knowing+0.028
asured+0.027
 awaiting+0.026
 to+0.025
Neg logit contributions
 ah-0.025
Found-0.023
 are-0.023
Raw-0.022
 spoiler-0.021
Token You
Token ID921
Feature activation+3.70e-02

Pos logit contributions
 to+0.012
 knowing+0.011
 when+0.011
 where+0.011
 for+0.011
Neg logit contributions
Found-0.015
Raw-0.015
 ah-0.015
 conclusions-0.015
 spoiler-0.015
Token are
Token ID389
Feature activation+1.54e-02

Pos logit contributions
 conclusions+0.038
 ey+0.035
Raw+0.035
 ah+0.034
Found+0.034
Neg logit contributions
 when-0.086
 for-0.086
 to-0.085
 where-0.083
 than-0.082
Token "
Token ID366
Feature activation+1.75e-03

Pos logit contributions
 knowing+0.004
 to+0.004
 when+0.004
 for+0.004
king+0.004
Neg logit contributions
Found-0.004
 ah-0.004
 spoiler-0.004
 conclusions-0.004
Raw-0.004
TokenYes
Token ID5297
Feature activation+1.93e-02

Pos logit contributions
Found+0.003
Raw+0.003
 ah+0.003
Got+0.003
 spoiler+0.003
Neg logit contributions
 knowing-0.003
king-0.003
 to-0.003
 for-0.003
 where-0.003
Token,
Token ID11
Feature activation+1.68e-02

Pos logit contributions
 ah+0.036
Found+0.036
Raw+0.035
 spoiler+0.034
 conclusions+0.034
Neg logit contributions
 knowing-0.029
king-0.028
 than-0.026
 to-0.026
asured-0.026
Token it
Token ID340
Feature activation-1.46e-03

Pos logit contributions
Found+0.033
Raw+0.032
 ah+0.032
 spoiler+0.032
 conclusions+0.031
Neg logit contributions
 knowing-0.023
 to-0.022
king-0.022
 for-0.022
 when-0.021
Token was
Token ID373
Feature activation-3.85e-03

Pos logit contributions
 knowing+0.002
king+0.002
 to+0.002
 than+0.002
 for+0.002
Neg logit contributions
Found-0.003
 ah-0.002
Raw-0.002
 spoiler-0.002
 conclusions-0.002
Token,"
Token ID553
Feature activation+5.16e-02

Pos logit contributions
king+0.006
 knowing+0.006
asured+0.005
 awaiting+0.005
 than+0.005
Neg logit contributions
 ah-0.007
Found-0.006
Raw-0.006
 are-0.006
 spoiler-0.006
Token agreed
Token ID4987
Feature activation+1.72e-02

Pos logit contributions
 spoiler+0.112
Found+0.111
Raw+0.109
 conclusions+0.108
 ey+0.108
Neg logit contributions
 to-0.067
 knowing-0.065
 when-0.064
 for-0.062
 as-0.057
Token Daisy
Token ID40355
Feature activation-3.84e-02

Pos logit contributions
 ah+0.035
Found+0.032
 are+0.032
hell+0.032
 tube+0.032
Neg logit contributions
king-0.022
asured-0.021
ind-0.020
 knowing-0.019
 than-0.019
Token.
Token ID13
Feature activation-6.83e-03

Pos logit contributions
king+0.050
ind+0.045
 knowing+0.045
asured+0.044
 awaiting+0.043
Neg logit contributions
 ah-0.076
Found-0.074
 are-0.072
hell-0.069
 tube-0.068
Token And
Token ID843
Feature activation-1.42e-02

Pos logit contributions
 knowing+0.011
king+0.011
 to+0.010
 than+0.010
 for+0.010
Neg logit contributions
Found-0.012
 ah-0.012
Raw-0.012
 spoiler-0.011
 conclusions-0.011
Token they
Token ID484
Feature activation-2.40e-02

Pos logit contributions
king+0.018
asured+0.016
ind+0.016
 knowing+0.016
 than+0.016
Neg logit contributions
Found-0.029
 ah-0.029
Got-0.026
 are-0.026
Raw-0.026
Token bed
Token ID3996
Feature activation+1.87e-03

Pos logit contributions
 knowing+0.028
king+0.028
 than+0.026
 when+0.026
 to+0.026
Neg logit contributions
Found-0.022
 ah-0.021
Raw-0.021
 spoiler-0.021
 ey-0.021
Token.
Token ID13
Feature activation-7.40e-03

Pos logit contributions
 ah+0.004
bridge+0.004
 tube+0.004
 are+0.003
hell+0.003
Neg logit contributions
asured-0.002
king-0.002
 knowing-0.002
 awaiting-0.002
 than-0.002
Token
Token ID198
Feature activation-3.58e-04

Pos logit contributions
 knowing+0.012
 to+0.012
 when+0.012
king+0.012
 where+0.011
Neg logit contributions
Found-0.012
 ah-0.012
Raw-0.012
 spoiler-0.012
 conclusions-0.012
Token
Token ID198
Feature activation-5.07e-03

Pos logit contributions
 to+0.000
 knowing+0.000
 when+0.000
 where+0.000
 for+0.000
Neg logit contributions
 spoiler-0.001
 ah-0.001
Found-0.001
 ey-0.001
 conclusions-0.001
Token"
Token ID1
Feature activation-2.40e-02

Pos logit contributions
 knowing+0.007
 to+0.007
 when+0.007
king+0.007
 than+0.006
Neg logit contributions
 spoiler-0.010
 ah-0.010
Found-0.010
 ey-0.009
Raw-0.009
TokenHello
Token ID15496
Feature activation+5.79e-02

Pos logit contributions
king+0.041
 when+0.041
 knowing+0.040
 letting+0.038
 awaiting+0.038
Neg logit contributions
 spoiler-0.037
 ey-0.036
 conclusions-0.036
 tube-0.035
hell-0.034
Token,"
Token ID553
Feature activation+5.54e-02

Pos logit contributions
Found+0.109
 ah+0.107
Raw+0.106
 spoiler+0.103
Got+0.101
Neg logit contributions
 knowing-0.085
king-0.082
 to-0.079
 than-0.078
 for-0.077
Token said
Token ID531
Feature activation+2.32e-02

Pos logit contributions
Found+0.118
 spoiler+0.114
Raw+0.114
 conclusions+0.112
 ey+0.111
Neg logit contributions
 to-0.072
 knowing-0.069
 when-0.069
 for-0.068
 before-0.064
Token Emily
Token ID17608
Feature activation-3.06e-02

Pos logit contributions
Found+0.047
Raw+0.046
 ah+0.045
 spoiler+0.045
 conclusions+0.044
Neg logit contributions
 knowing-0.031
 to-0.029
king-0.029
 for-0.028
 when-0.028
Token.
Token ID13
Feature activation-3.99e-03

Pos logit contributions
king+0.038
 than+0.037
 knowing+0.037
ind+0.036
asured+0.036
Neg logit contributions
 ah-0.057
Found-0.053
 spoiler-0.052
 vest-0.051
 are-0.050
Token "
Token ID366
Feature activation+6.89e-04

Pos logit contributions
 knowing+0.006
 to+0.005
king+0.005
 when+0.005
 for+0.005
Neg logit contributions
Found-0.008
 ah-0.008
Raw-0.007
 spoiler-0.007
 conclusions-0.007
Token "
Token ID366
Feature activation+1.00e-02

Pos logit contributions
 to+0.003
 knowing+0.003
 when+0.003
 for+0.003
 where+0.003
Neg logit contributions
Found-0.004
 ah-0.003
 conclusions-0.003
 spoiler-0.003
Raw-0.003
TokenIt
Token ID1026
Feature activation+2.95e-02

Pos logit contributions
 ah+0.018
 ey+0.017
 tube+0.017
Found+0.017
 spoiler+0.017
Neg logit contributions
 than-0.016
 knowing-0.015
 when-0.015
 to-0.014
king-0.014
Token is
Token ID318
Feature activation+1.36e-03

Pos logit contributions
Found+0.046
 ah+0.044
Raw+0.044
 spoiler+0.043
 conclusions+0.042
Neg logit contributions
 knowing-0.053
king-0.052
 to-0.051
 than-0.050
 for-0.050
Token moving
Token ID3867
Feature activation+6.34e-03

Pos logit contributions
 ah+0.002
Found+0.002
Raw+0.002
 spoiler+0.002
 ey+0.002
Neg logit contributions
king-0.002
 knowing-0.002
 awaiting-0.002
asured-0.002
 than-0.002
Token!"
Token ID2474
Feature activation+4.17e-02

Pos logit contributions
Found+0.014
 ah+0.014
Raw+0.014
 spoiler+0.014
 conclusions+0.014
Neg logit contributions
 knowing-0.007
king-0.006
 awaiting-0.006
 to-0.006
asured-0.006
Token
Token ID198
Feature activation+1.84e-02

Pos logit contributions
Found+0.079
 ah+0.077
Raw+0.076
 spoiler+0.076
 conclusions+0.074
Neg logit contributions
 knowing-0.062
 to-0.061
 when-0.059
 for-0.058
 than-0.058
Token
Token ID198
Feature activation+9.99e-03

Pos logit contributions
Found+0.037
 ah+0.037
 spoiler+0.036
Raw+0.036
 ey+0.035
Neg logit contributions
 to-0.025
 knowing-0.024
 for-0.024
 when-0.024
 where-0.023
Token"
Token ID1
Feature activation-1.30e-02

Pos logit contributions
Found+0.018
 ah+0.018
Raw+0.018
 spoiler+0.017
Got+0.017
Neg logit contributions
 knowing-0.015
king-0.015
 to-0.015
 than-0.014
 for-0.014
TokenWow
Token ID22017
Feature activation+6.47e-02

Pos logit contributions
 than+0.021
 knowing+0.021
king+0.020
 when+0.020
 letting+0.019
Neg logit contributions
 ey-0.021
 conclusions-0.020
 spoiler-0.020
 tube-0.020
 ah-0.020
Token,
Token ID11
Feature activation+2.03e-02

Pos logit contributions
Found+0.140
Raw+0.138
 conclusions+0.133
 spoiler+0.133
 ah+0.132
Neg logit contributions
 for-0.078
 to-0.078
 knowing-0.077
 when-0.075
king-0.074
Token Tom
Token ID4186
Feature activation-2.01e-02

Pos logit contributions
Found+0.045
Raw+0.045
 spoiler+0.043
 conclusions+0.043
 ey+0.042
Neg logit contributions
 to-0.024
 where-0.024
 for-0.024
 when-0.023
 knowing-0.022
Token She
Token ID1375
Feature activation+9.70e-03

Pos logit contributions
 knowing+0.005
king+0.005
 than+0.004
asured+0.004
 to+0.004
Neg logit contributions
Found-0.004
Raw-0.003
 ah-0.003
 spoiler-0.003
Got-0.003
Token used
Token ID973
Feature activation-1.93e-02

Pos logit contributions
Found+0.017
 ah+0.017
Raw+0.017
 spoiler+0.016
Got+0.016
Neg logit contributions
 knowing-0.015
king-0.015
 than-0.014
 to-0.014
 awaiting-0.014
Token a
Token ID257
Feature activation-6.71e-03

Pos logit contributions
 to+0.027
 for+0.026
 knowing+0.026
 when+0.025
 where+0.024
Neg logit contributions
Found-0.040
Raw-0.039
 ah-0.038
 spoiler-0.038
 conclusions-0.037
Token small
Token ID1402
Feature activation+9.99e-03

Pos logit contributions
 to+0.011
 for+0.011
 knowing+0.010
 than+0.010
 when+0.010
Neg logit contributions
Found-0.012
 conclusions-0.012
Raw-0.012
 ah-0.012
 spoiler-0.012
Token knife
Token ID9845
Feature activation-2.29e-05

Pos logit contributions
Found+0.018
Raw+0.018
hing+0.018
 conclusions+0.018
Got+0.017
Neg logit contributions
 to-0.017
 for-0.017
 than-0.015
 knowing-0.015
 when-0.015
Token to
Token ID284
Feature activation+3.14e-03

Pos logit contributions
king+0.000
 knowing+0.000
asured+0.000
ind+0.000
 awaiting+0.000
Neg logit contributions
 are-0.000
 tube-0.000
Found-0.000
 ah-0.000
 shovel-0.000
Token cut
Token ID2005
Feature activation+1.58e-02

Pos logit contributions
Found+0.005
Raw+0.005
 ah+0.005
 spoiler+0.005
Got+0.005
Neg logit contributions
 knowing-0.006
 to-0.005
king-0.005
 for-0.005
 when-0.005
Token it
Token ID340
Feature activation+2.91e-02

Pos logit contributions
Found+0.035
Raw+0.034
 ah+0.033
 spoiler+0.032
 conclusions+0.032
Neg logit contributions
 to-0.020
 for-0.020
 when-0.019
 knowing-0.019
 before-0.018
Token into
Token ID656
Feature activation+1.93e-02

Pos logit contributions
 are+0.059
 tube+0.058
 pear+0.057
 ah+0.057
hell+0.057
Neg logit contributions
 knowing-0.033
king-0.032
 awaiting-0.029
asured-0.029
ind-0.027
Token pieces
Token ID5207
Feature activation+3.39e-02

Pos logit contributions
Found+0.036
Raw+0.035
 spoiler+0.033
 ah+0.033
Got+0.033
Neg logit contributions
 knowing-0.030
 to-0.030
 for-0.029
 when-0.029
 than-0.028
Token.
Token ID13
Feature activation+5.21e-03

Pos logit contributions
 ah+0.068
 are+0.068
hell+0.067
 tube+0.066
Found+0.066
Neg logit contributions
king-0.040
 knowing-0.037
 awaiting-0.036
asured-0.036
ind-0.033
Token water
Token ID1660
Feature activation-1.49e-02

Pos logit contributions
king+0.071
 knowing+0.069
 than+0.066
 awaiting+0.066
asured+0.063
Neg logit contributions
Found-0.092
 ah-0.091
Raw-0.090
 conclusions-0.088
 ey-0.088
Token or
Token ID393
Feature activation+1.15e-02

Pos logit contributions
 knowing+0.018
king+0.017
 to+0.016
 than+0.016
 for+0.016
Neg logit contributions
Found-0.032
 ah-0.031
Raw-0.031
 spoiler-0.030
 conclusions-0.030
Token shampoo
Token ID47259
Feature activation-2.19e-02

Pos logit contributions
Found+0.019
Raw+0.018
 spoiler+0.018
bridge+0.017
 ah+0.017
Neg logit contributions
 to-0.022
 for-0.021
 when-0.021
 knowing-0.021
 where-0.020
Token.
Token ID13
Feature activation-9.06e-03

Pos logit contributions
 knowing+0.025
king+0.023
asured+0.023
 awaiting+0.022
app+0.021
Neg logit contributions
 ah-0.046
 are-0.046
Found-0.046
 tube-0.044
 spoiler-0.044
Token She
Token ID1375
Feature activation+1.37e-02

Pos logit contributions
 to+0.012
 when+0.012
 knowing+0.011
 where+0.011
 because+0.011
Neg logit contributions
 ah-0.020
Found-0.019
 spoiler-0.019
bridge-0.018
hing-0.018
Token did
Token ID750
Feature activation+8.89e-03

Pos logit contributions
Found+0.024
 ah+0.023
Raw+0.023
 spoiler+0.023
 conclusions+0.022
Neg logit contributions
 knowing-0.022
 to-0.021
king-0.021
 when-0.021
 than-0.021
Token not
Token ID407
Feature activation-3.42e-02

Pos logit contributions
 conclusions+0.020
Raw+0.020
Found+0.020
 are+0.020
 ah+0.020
Neg logit contributions
king-0.009
 knowing-0.008
 awaiting-0.008
asured-0.007
 than-0.007
Token like
Token ID588
Feature activation-4.83e-02

Pos logit contributions
 knowing+0.036
 to+0.036
 for+0.035
 than+0.035
 when+0.034
Neg logit contributions
 ah-0.082
Found-0.081
 spoiler-0.080
 ey-0.080
Raw-0.078
Token to
Token ID284
Feature activation-1.93e-02

Pos logit contributions
king+0.072
 knowing+0.070
 awaiting+0.067
 than+0.066
asured+0.064
Neg logit contributions
 ah-0.087
Found-0.087
Raw-0.085
 conclusions-0.084
 ey-0.083
Token be
Token ID307
Feature activation-6.10e-03

Pos logit contributions
 knowing+0.031
 to+0.031
 when+0.031
 for+0.030
 than+0.030
Neg logit contributions
Found-0.034
Raw-0.032
 ah-0.032
 spoiler-0.031
Got-0.031
Token disturbed
Token ID24069
Feature activation-2.38e-02

Pos logit contributions
 knowing+0.010
king+0.010
 awaiting+0.010
 than+0.009
app+0.009
Neg logit contributions
 ah-0.010
Found-0.010
Raw-0.009
Got-0.009
 conclusions-0.009
Token the
Token ID262
Feature activation-9.47e-03

Pos logit contributions
Found+0.009
Raw+0.009
 ah+0.009
Got+0.009
 spoiler+0.009
Neg logit contributions
 to-0.005
 when-0.004
 where-0.004
 before-0.004
 knowing-0.004
Token tree
Token ID5509
Feature activation-1.41e-02

Pos logit contributions
 to+0.021
 for+0.020
 knowing+0.020
 where+0.019
 when+0.019
Neg logit contributions
 ah-0.013
Found-0.012
 spoiler-0.012
Raw-0.011
bridge-0.011
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
 knowing+0.024
king+0.023
asured+0.022
 than+0.022
 awaiting+0.021
Neg logit contributions
Found-0.023
 ah-0.023
Raw-0.022
 tube-0.022
 conclusions-0.021
Token he
Token ID339
Feature activation-5.80e-03

Pos logit contributions
 ey+0.040
Found+0.039
 spoiler+0.038
 conclusions+0.038
Raw+0.038
Neg logit contributions
 when-0.024
 to-0.023
 where-0.022
 before-0.021
 for-0.021
Token found
Token ID1043
Feature activation+4.04e-03

Pos logit contributions
 knowing+0.011
king+0.011
 than+0.011
 awaiting+0.010
 for+0.010
Neg logit contributions
Found-0.008
Raw-0.008
 ah-0.008
Got-0.008
 spoiler-0.007
Token a
Token ID257
Feature activation+2.62e-03

Pos logit contributions
Raw+0.009
Found+0.009
 ah+0.009
 spoiler+0.009
 ey+0.008
Neg logit contributions
 to-0.005
 knowing-0.005
 for-0.005
 when-0.005
 where-0.005
Token small
Token ID1402
Feature activation+1.31e-02

Pos logit contributions
Found+0.005
 ah+0.005
Raw+0.004
 spoiler+0.004
 conclusions+0.004
Neg logit contributions
 knowing-0.004
 to-0.004
 for-0.004
 when-0.004
king-0.004
Token box
Token ID3091
Feature activation+2.72e-02

Pos logit contributions
Found+0.020
 ah+0.019
Raw+0.019
hing+0.019
Got+0.018
Neg logit contributions
 to-0.026
 for-0.025
 knowing-0.024
 than-0.024
 when-0.024
Token.
Token ID13
Feature activation-1.97e-03

Pos logit contributions
bridge+0.046
 tube+0.046
 rock+0.045
 vest+0.044
Found+0.044
Neg logit contributions
cess-0.040
king-0.039
elia-0.039
ind-0.038
asured-0.038
Token He
Token ID679
Feature activation+1.50e-02

Pos logit contributions
 knowing+0.004
 to+0.004
 for+0.004
king+0.004
 when+0.004
Neg logit contributions
Found-0.003
 ah-0.003
Raw-0.003
 spoiler-0.002
 conclusions-0.002
Token opened
Token ID4721
Feature activation-3.13e-02

Pos logit contributions
Found+0.025
Raw+0.024
 ah+0.023
Got+0.022
 spoiler+0.022
Neg logit contributions
king-0.026
 knowing-0.026
 than-0.024
 for-0.024
 awaiting-0.023
Tokenhe
Token ID258
Feature activation+3.37e-02

Pos logit contributions
Found+0.014
 are+0.014
 spoiler+0.014
 ah+0.014
 tube+0.014
Neg logit contributions
 knowing-0.018
king-0.018
asured-0.018
 awaiting-0.017
 than-0.016
Tokenlf
Token ID1652
Feature activation+5.01e-03

Pos logit contributions
Found+0.041
 ah+0.040
Raw+0.040
bridge+0.038
 ey+0.037
Neg logit contributions
king-0.072
 knowing-0.071
 than-0.066
 to-0.065
asured-0.065
Token.
Token ID13
Feature activation+4.95e-03

Pos logit contributions
 tube+0.009
bridge+0.009
 are+0.009
 spoiler+0.008
 ah+0.008
Neg logit contributions
king-0.007
 knowing-0.007
asured-0.007
ind-0.007
 awaiting-0.007
Token Mom
Token ID11254
Feature activation-1.89e-02

Pos logit contributions
Found+0.009
 ah+0.009
Raw+0.009
 spoiler+0.008
 conclusions+0.008
Neg logit contributions
 knowing-0.008
king-0.007
 to-0.007
 than-0.007
 for-0.007
Tokenmy
Token ID1820
Feature activation+5.03e-03

Pos logit contributions
 knowing+0.035
king+0.034
 than+0.033
 awaiting+0.032
asured+0.031
Neg logit contributions
Found-0.029
 ah-0.028
Raw-0.027
Got-0.025
hing-0.025
Token was
Token ID373
Feature activation+1.67e-02

Pos logit contributions
Found+0.010
 ah+0.010
Raw+0.010
Got+0.009
 spoiler+0.009
Neg logit contributions
 knowing-0.007
king-0.007
 than-0.006
 to-0.006
 for-0.006
Token so
Token ID523
Feature activation+1.82e-02

Pos logit contributions
 ah+0.025
Found+0.024
Raw+0.023
 are+0.023
 fog+0.023
Neg logit contributions
 than-0.027
 knowing-0.027
 since-0.026
king-0.026
ess-0.025
Token scared
Token ID12008
Feature activation+7.36e-03

Pos logit contributions
 ah+0.028
Found+0.027
Raw+0.027
 are+0.026
Got+0.025
Neg logit contributions
 knowing-0.031
king-0.031
 than-0.030
 before-0.029
asured-0.029
Token that
Token ID326
Feature activation-1.30e-02

Pos logit contributions
 ah+0.014
Found+0.014
Hey+0.013
Raw+0.013
 are+0.013
Neg logit contributions
asured-0.010
king-0.010
 awaiting-0.009
 knowing-0.008
app-0.008
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
king+0.023
asured+0.021
 knowing+0.021
 awaiting+0.020
 than+0.020
Neg logit contributions
 ah-0.021
Found-0.021
Raw-0.020
 conclusions-0.019
 are-0.019
Token books
Token ID3835
Feature activation-1.58e-02

Pos logit contributions
 knowing+0.027
 to+0.026
 for+0.026
 when+0.026
king+0.026
Neg logit contributions
Found-0.022
 ah-0.022
Raw-0.021
 spoiler-0.021
 conclusions-0.020
Token!
Token ID0
Feature activation-1.55e-03

Pos logit contributions
Found+0.019
 ah+0.019
Raw+0.018
 spoiler+0.018
 conclusions+0.018
Neg logit contributions
 knowing-0.008
king-0.008
 than-0.008
asured-0.007
 to-0.007
Token "
Token ID366
Feature activation+3.54e-03

Pos logit contributions
 knowing+0.003
 to+0.002
king+0.002
 for+0.002
 when+0.002
Neg logit contributions
Found-0.003
 ah-0.003
Raw-0.003
 spoiler-0.003
 conclusions-0.002
TokenOh
Token ID5812
Feature activation+3.76e-02

Pos logit contributions
 spoiler+0.005
 ey+0.005
 ah+0.005
 tube+0.005
bridge+0.005
Neg logit contributions
king-0.006
 knowing-0.006
 when-0.006
 than-0.006
 to-0.006
Token wow
Token ID28796
Feature activation-1.39e-02

Pos logit contributions
Found+0.076
Raw+0.074
 spoiler+0.072
 ah+0.072
 conclusions+0.072
Neg logit contributions
 knowing-0.051
 to-0.049
 for-0.048
 when-0.047
king-0.047
Token,"
Token ID553
Feature activation+5.95e-02

Pos logit contributions
 knowing+0.020
king+0.019
asured+0.018
 than+0.017
 to+0.017
Neg logit contributions
 ah-0.027
Found-0.026
Raw-0.026
 spoiler-0.025
Got-0.024
Token Sarah
Token ID10490
Feature activation-1.37e-02

Pos logit contributions
Found+0.122
Raw+0.120
 conclusions+0.118
 spoiler+0.117
 ah+0.117
Neg logit contributions
 to-0.079
 knowing-0.078
 when-0.076
 for-0.075
 than-0.072
Token exclaimed
Token ID33552
Feature activation+3.35e-02

Pos logit contributions
 knowing+0.024
king+0.024
 than+0.023
 for+0.022
 awaiting+0.022
Neg logit contributions
 ah-0.021
Found-0.021
Raw-0.021
 spoiler-0.020
Got-0.019
Token.
Token ID13
Feature activation-1.03e-03

Pos logit contributions
 ah+0.076
Found+0.073
Raw+0.072
 spoiler+0.072
 conclusions+0.070
Neg logit contributions
king-0.035
 knowing-0.035
asured-0.033
 than-0.032
 awaiting-0.030
Token She
Token ID1375
Feature activation+9.20e-03

Pos logit contributions
 knowing+0.001
king+0.001
 than+0.001
asured+0.001
 for+0.001
Neg logit contributions
Found-0.002
 ah-0.002
Raw-0.002
 spoiler-0.002
Got-0.002
Token wanted
Token ID2227
Feature activation+3.99e-02

Pos logit contributions
Found+0.016
 ah+0.015
Raw+0.015
 spoiler+0.015
Got+0.014
Neg logit contributions
 knowing-0.015
king-0.015
 to-0.014
 than-0.014
 for-0.014
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
Found+0.120
Raw+0.118
 ah+0.118
 conclusions+0.115
Got+0.114
Neg logit contributions
king-0.014
 knowing-0.012
 than-0.009
asured-0.008
 awaiting-0.007
Token it
Token ID340
Feature activation+9.46e-03

Pos logit contributions
 knowing+0.021
 to+0.020
king+0.020
 when+0.020
 for+0.020
Neg logit contributions
Found-0.038
 ah-0.037
Raw-0.037
 spoiler-0.036
 ey-0.035
Token more
Token ID517
Feature activation+6.38e-03

Pos logit contributions
Found+0.020
Raw+0.020
 ah+0.019
 spoiler+0.019
 tube+0.019
Neg logit contributions
 knowing-0.011
king-0.011
asured-0.010
 than-0.009
 awaiting-0.009
Token and
Token ID290
Feature activation-4.49e-03

Pos logit contributions
hell+0.011
 conclusions+0.011
 tube+0.011
 rock+0.011
 ah+0.011
Neg logit contributions
king-0.009
 knowing-0.008
asured-0.008
 awaiting-0.008
illa-0.007
Token more
Token ID517
Feature activation-8.31e-04

Pos logit contributions
 knowing+0.007
 to+0.006
king+0.006
 when+0.006
 for+0.006
Neg logit contributions
Found-0.008
Raw-0.008
 ah-0.008
 spoiler-0.008
 ey-0.008
Token each
Token ID1123
Feature activation+2.93e-02

Pos logit contributions
king+0.001
 knowing+0.001
 where+0.001
 to+0.001
 for+0.001
Neg logit contributions
Found-0.002
 ah-0.002
Raw-0.001
 conclusions-0.001
 spoiler-0.001
Token time
Token ID640
Feature activation-4.88e-02

Pos logit contributions
Found+0.051
 ah+0.050
Raw+0.049
 spoiler+0.048
 conclusions+0.047
Neg logit contributions
 knowing-0.048
king-0.046
 to-0.045
 than-0.044
 for-0.044
Token.
Token ID13
Feature activation-4.05e-03

Pos logit contributions
asured+0.070
king+0.058
 knowing+0.057
ind+0.056
 tradition+0.055
Neg logit contributions
 ah-0.090
Found-0.089
 are-0.085
Got-0.082
 tube-0.082
Token Soon
Token ID15894
Feature activation-2.12e-02

Pos logit contributions
 knowing+0.007
king+0.006
 to+0.006
 than+0.006
 for+0.006
Neg logit contributions
Found-0.007
 ah-0.007
Raw-0.007
 spoiler-0.007
 conclusions-0.006
Token,
Token ID11
Feature activation-1.94e-03

Pos logit contributions
 knowing+0.032
 to+0.030
 when+0.030
 for+0.030
king+0.029
Neg logit contributions
Found-0.039
Raw-0.039
 ah-0.039
 spoiler-0.038
 conclusions-0.037
Token Beat
Token ID12568
Feature activation-4.25e-02

Pos logit contributions
 knowing+0.003
 to+0.003
 when+0.003
 for+0.003
 before+0.002
Neg logit contributions
Raw-0.004
Found-0.004
 ah-0.004
 spoiler-0.004
 ey-0.004
Tokenrice
Token ID20970
Feature activation-4.41e-02

Pos logit contributions
asured+0.047
 preparing+0.045
 knowing+0.043
 for+0.042
 patiently+0.042
Neg logit contributions
Found-0.086
Boo-0.085
Got-0.079
Raw-0.078
 ah-0.078
Token its
Token ID663
Feature activation+3.67e-02

Pos logit contributions
Found+0.092
 ah+0.090
Raw+0.088
 spoiler+0.085
bridge+0.084
Neg logit contributions
 knowing-0.071
 to-0.070
 when-0.067
 for-0.066
 than-0.065
Token wing
Token ID8539
Feature activation+3.52e-02

Pos logit contributions
Found+0.052
Raw+0.051
 spoiler+0.050
 ah+0.050
 ey+0.049
Neg logit contributions
king-0.068
 knowing-0.067
 awaiting-0.062
 than-0.062
 to-0.062
Token and
Token ID290
Feature activation+9.38e-03

Pos logit contributions
hell+0.069
 tube+0.069
Found+0.069
 are+0.068
 vest+0.067
Neg logit contributions
king-0.040
 awaiting-0.040
 knowing-0.039
asured-0.037
 patiently-0.036
Token couldn
Token ID3521
Feature activation+2.85e-02

Pos logit contributions
Found+0.018
 ah+0.017
Raw+0.017
 spoiler+0.017
Got+0.016
Neg logit contributions
 knowing-0.014
king-0.013
 to-0.013
 than-0.013
 for-0.013
Token't
Token ID470
Feature activation-1.01e-02

Pos logit contributions
Found+0.061
 ah+0.060
 spoiler+0.058
 ey+0.058
Raw+0.057
Neg logit contributions
 to-0.036
 when-0.033
 knowing-0.033
 for-0.032
 before-0.031
Token fly
Token ID6129
Feature activation-5.45e-02

Pos logit contributions
 to+0.016
 when+0.016
 knowing+0.015
 as+0.015
 before+0.015
Neg logit contributions
 ah-0.020
Found-0.020
 ey-0.019
 spoiler-0.019
Got-0.019
Token.
Token ID13
Feature activation-5.50e-03

Pos logit contributions
 knowing+0.073
king+0.071
 awaiting+0.068
asured+0.066
 patiently+0.063
Neg logit contributions
Found-0.105
Raw-0.103
 conclusions-0.101
 ah-0.100
 spoiler-0.100
Token Nut
Token ID11959
Feature activation-1.15e-02

Pos logit contributions
 knowing+0.012
king+0.011
 to+0.011
 for+0.011
 when+0.011
Neg logit contributions
Found-0.007
 ah-0.007
Raw-0.007
 spoiler-0.006
 conclusions-0.006
Tokenty
Token ID774
Feature activation-1.13e-02

Pos logit contributions
 knowing+0.016
king+0.015
 to+0.015
 than+0.015
 awaiting+0.015
Neg logit contributions
Found-0.022
Raw-0.021
 ah-0.020
Got-0.020
 spoiler-0.020
Token felt
Token ID2936
Feature activation+7.29e-03

Pos logit contributions
 knowing+0.020
king+0.019
 to+0.019
 than+0.018
 for+0.018
Neg logit contributions
Found-0.018
 ah-0.017
Raw-0.017
 spoiler-0.017
Got-0.016
Token sad
Token ID6507
Feature activation+1.87e-02

Pos logit contributions
 ah+0.015
Found+0.015
 spoiler+0.015
Raw+0.015
bridge+0.014
Neg logit contributions
 knowing-0.011
 when-0.011
 to-0.010
 for-0.010
 as-0.010
Token wanted
Token ID2227
Feature activation+3.15e-02

Pos logit contributions
Found+0.007
Raw+0.007
 ah+0.007
Got+0.007
 spoiler+0.006
Neg logit contributions
 knowing-0.008
king-0.008
 than-0.007
 for-0.007
 before-0.007
Token someone
Token ID2130
Feature activation-7.84e-03

Pos logit contributions
Found+0.087
 ah+0.086
Raw+0.085
 conclusions+0.083
Got+0.083
Neg logit contributions
king-0.017
 knowing-0.015
 than-0.013
asured-0.012
 awaiting-0.011
Token who
Token ID508
Feature activation+4.58e-03

Pos logit contributions
king+0.006
 knowing+0.004
asured+0.004
ind+0.004
rou+0.004
Neg logit contributions
 are-0.020
Found-0.020
 pear-0.019
 vest-0.019
hell-0.018
Token was
Token ID373
Feature activation+4.81e-03

Pos logit contributions
Found+0.007
 ah+0.006
Raw+0.006
 are+0.006
Got+0.006
Neg logit contributions
king-0.009
 knowing-0.009
asured-0.008
 than-0.008
 to-0.008
Token kind
Token ID1611
Feature activation+1.98e-03

Pos logit contributions
 ah+0.008
Found+0.008
 are+0.008
Raw+0.008
 pear+0.008
Neg logit contributions
king-0.007
app-0.006
 knowing-0.006
asured-0.006
 awaiting-0.006
Token and
Token ID290
Feature activation-9.09e-03

Pos logit contributions
 are+0.004
Found+0.003
 ah+0.003
hell+0.003
 vest+0.003
Neg logit contributions
king-0.003
asured-0.003
ind-0.002
app-0.002
 patiently-0.002
Token helpful
Token ID7613
Feature activation-5.87e-03

Pos logit contributions
king+0.014
 knowing+0.013
 than+0.012
asured+0.012
 to+0.012
Neg logit contributions
Found-0.016
 ah-0.016
Raw-0.016
Got-0.015
 spoiler-0.015
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
king+0.007
asured+0.006
 knowing+0.006
app+0.006
 patiently+0.005
Neg logit contributions
 are-0.012
Found-0.012
 ah-0.012
 conclusions-0.012
 spoiler-0.012
Token When
Token ID1649
Feature activation-3.40e-02

Pos logit contributions
 knowing+0.018
king+0.018
 to+0.017
 than+0.017
 for+0.017
Neg logit contributions
Found-0.017
 ah-0.017
Raw-0.016
 spoiler-0.016
 conclusions-0.016
Token she
Token ID673
Feature activation-1.74e-02

Pos logit contributions
king+0.053
 knowing+0.051
asured+0.050
 than+0.050
 awaiting+0.046
Neg logit contributions
 ah-0.061
Found-0.061
Raw-0.059
 spoiler-0.056
 conclusions-0.056
Token found
Token ID1043
Feature activation+9.90e-03

Pos logit contributions
 knowing+0.035
king+0.034
 than+0.033
 for+0.032
asured+0.031
Neg logit contributions
Found-0.025
 ah-0.024
Raw-0.023
Got-0.022
 ey-0.021
Token't
Token ID470
Feature activation-9.44e-03

Pos logit contributions
Found+0.061
 ah+0.061
 spoiler+0.059
Raw+0.059
 conclusions+0.058
Neg logit contributions
 to-0.040
 knowing-0.039
 for-0.038
 when-0.038
 before-0.037
Token because
Token ID780
Feature activation-3.76e-02

Pos logit contributions
 knowing+0.014
king+0.014
 than+0.013
 to+0.013
 for+0.013
Neg logit contributions
Found-0.017
Raw-0.016
 conclusions-0.016
 ah-0.016
 spoiler-0.016
Token the
Token ID262
Feature activation-9.40e-03

Pos logit contributions
 knowing+0.059
king+0.058
 to+0.055
 than+0.054
 for+0.054
Neg logit contributions
Found-0.067
 ah-0.065
Raw-0.065
 spoiler-0.063
 conclusions-0.062
Token spider
Token ID19230
Feature activation-2.92e-02

Pos logit contributions
 knowing+0.022
king+0.021
 to+0.021
 than+0.021
 for+0.021
Neg logit contributions
Found-0.010
 ah-0.009
Raw-0.009
 spoiler-0.009
 conclusions-0.008
Token couldn
Token ID3521
Feature activation+2.79e-02

Pos logit contributions
 knowing+0.052
king+0.051
 than+0.047
 to+0.047
 awaiting+0.047
Neg logit contributions
Found-0.046
 ah-0.044
Raw-0.043
 spoiler-0.041
Got-0.041
Token't
Token ID470
Feature activation-9.65e-03

Pos logit contributions
Found+0.055
 ah+0.054
Raw+0.052
 spoiler+0.052
 ey+0.051
Neg logit contributions
 to-0.038
 knowing-0.038
 for-0.036
 when-0.036
king-0.036
Token talk
Token ID1561
Feature activation-4.28e-02

Pos logit contributions
 knowing+0.014
king+0.014
 to+0.013
 than+0.013
 for+0.012
Neg logit contributions
Found-0.018
Raw-0.018
 ah-0.018
 spoiler-0.017
 conclusions-0.017
Token.
Token ID13
Feature activation-7.14e-03

Pos logit contributions
king+0.044
asured+0.042
 knowing+0.042
 awaiting+0.040
 patiently+0.039
Neg logit contributions
Found-0.093
 are-0.093
hell-0.089
 conclusions-0.089
Raw-0.089
Token
Token ID198
Feature activation-4.64e-03

Pos logit contributions
 knowing+0.013
king+0.012
 to+0.012
 than+0.012
 for+0.012
Neg logit contributions
Found-0.011
 ah-0.011
Raw-0.011
 spoiler-0.011
Got-0.010
Token
Token ID198
Feature activation-8.35e-03

Pos logit contributions
 knowing+0.007
 to+0.007
king+0.007
 when+0.006
 for+0.006
Neg logit contributions
Found-0.009
 ah-0.008
Raw-0.008
 spoiler-0.008
 conclusions-0.008
TokenThen
Token ID6423
Feature activation-1.85e-02

Pos logit contributions
 knowing+0.013
king+0.013
 to+0.013
 when+0.012
 than+0.012
Neg logit contributions
Found-0.015
 ah-0.015
Raw-0.014
 spoiler-0.014
 conclusions-0.014
Token and
Token ID290
Feature activation+1.23e-03

Pos logit contributions
 ah+0.053
Found+0.050
Got+0.046
 shovel+0.046
Raw+0.046
Neg logit contributions
asured-0.037
ind-0.036
 than-0.036
king-0.035
app-0.034
Token they
Token ID484
Feature activation-2.78e-03

Pos logit contributions
 ah+0.002
Found+0.002
Got+0.002
Raw+0.002
 are+0.002
Neg logit contributions
king-0.002
 than-0.002
 knowing-0.002
asured-0.002
app-0.002
Token continued
Token ID3767
Feature activation-3.22e-02

Pos logit contributions
 knowing+0.004
king+0.004
 to+0.004
 when+0.004
 for+0.004
Neg logit contributions
Found-0.005
Raw-0.005
 ah-0.005
 spoiler-0.005
 conclusions-0.005
Token walking
Token ID6155
Feature activation-2.37e-02

Pos logit contributions
asured+0.042
king+0.042
 than+0.040
iring+0.040
cess+0.039
Neg logit contributions
 ah-0.057
 tube-0.056
Raw-0.055
 are-0.054
Found-0.054
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
 knowing+0.026
king+0.026
asured+0.024
 than+0.023
 awaiting+0.023
Neg logit contributions
Found-0.051
 ah-0.050
Raw-0.049
 conclusions-0.049
 spoiler-0.048
Token Along
Token ID17159
Feature activation-7.77e-02

Pos logit contributions
 knowing+0.009
king+0.009
 than+0.008
asured+0.008
 awaiting+0.008
Neg logit contributions
Found-0.009
Raw-0.008
Got-0.008
 ah-0.008
 spoiler-0.008
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
king+0.129
 knowing+0.126
asured+0.117
 than+0.115
 awaiting+0.113
Neg logit contributions
Found-0.133
 ah-0.132
 are-0.129
Got-0.123
 tube-0.123
Token way
Token ID835
Feature activation-1.47e-02

Pos logit contributions
 knowing+0.035
king+0.034
 than+0.032
 to+0.032
asured+0.031
Neg logit contributions
Found-0.036
Raw-0.034
 ah-0.034
 spoiler-0.033
 conclusions-0.033
Token they
Token ID484
Feature activation-1.82e-02

Pos logit contributions
 knowing+0.021
king+0.021
 than+0.019
asured+0.019
 awaiting+0.019
Neg logit contributions
Found-0.028
 ah-0.027
Raw-0.027
 spoiler-0.026
 conclusions-0.026
Token found
Token ID1043
Feature activation+1.97e-03

Pos logit contributions
 knowing+0.030
 to+0.029
king+0.028
 when+0.028
 for+0.028
Neg logit contributions
Found-0.032
 ah-0.031
Raw-0.031
 spoiler-0.030
 conclusions-0.029
Token a
Token ID257
Feature activation-3.16e-03

Pos logit contributions
Raw+0.004
 spoiler+0.004
Found+0.004
 ah+0.004
 ey+0.004
Neg logit contributions
 knowing-0.003
 to-0.003
 for-0.003
 when-0.003
 where-0.002
Token up
Token ID510
Feature activation-2.20e-02

Pos logit contributions
 to+0.027
 for+0.027
 when+0.025
 knowing+0.024
 where+0.024
Neg logit contributions
Raw-0.049
 ah-0.049
Found-0.049
 spoiler-0.049
 ey-0.046
Token the
Token ID262
Feature activation-1.55e-03

Pos logit contributions
 to+0.025
 knowing+0.024
 for+0.024
 when+0.024
 where+0.023
Neg logit contributions
Found-0.051
Raw-0.050
 ah-0.049
 spoiler-0.049
Got-0.048
Token weapon
Token ID4282
Feature activation+8.53e-03

Pos logit contributions
 to+0.004
 for+0.003
 knowing+0.003
 when+0.003
 than+0.003
Neg logit contributions
 ah-0.002
Found-0.002
Raw-0.002
 conclusions-0.002
 spoiler-0.002
Token.
Token ID13
Feature activation-8.30e-03

Pos logit contributions
 tube+0.016
hell+0.016
 are+0.015
 vest+0.015
Found+0.015
Neg logit contributions
king-0.011
 knowing-0.010
asured-0.010
ind-0.010
 awaiting-0.010
Token But
Token ID887
Feature activation+6.06e-03

Pos logit contributions
 knowing+0.021
king+0.021
 than+0.020
asured+0.019
ind+0.019
Neg logit contributions
Raw-0.007
Found-0.007
 spoiler-0.006
 ah-0.006
Got-0.006
Token as
Token ID355
Feature activation-7.28e-02

Pos logit contributions
Found+0.013
 ah+0.013
Raw+0.012
 spoiler+0.012
 conclusions+0.012
Neg logit contributions
king-0.007
 knowing-0.007
 than-0.006
asured-0.006
 awaiting-0.006
Token soon
Token ID2582
Feature activation-1.46e-02

Pos logit contributions
 knowing+0.123
king+0.121
 to+0.115
 than+0.114
 for+0.113
Neg logit contributions
Found-0.121
 ah-0.119
Raw-0.118
 spoiler-0.113
 conclusions-0.110
Token as
Token ID355
Feature activation-7.22e-02

Pos logit contributions
king+0.009
 knowing+0.008
asured+0.007
 than+0.006
ind+0.006
Neg logit contributions
Found-0.040
 ah-0.039
Raw-0.039
Got-0.038
AAAA-0.038
Token he
Token ID339
Feature activation-9.35e-03

Pos logit contributions
 knowing+0.121
 to+0.116
 when+0.114
king+0.113
 for+0.113
Neg logit contributions
Found-0.124
 ah-0.119
Raw-0.119
 spoiler-0.117
 conclusions-0.114
Token touched
Token ID12615
Feature activation-1.54e-02

Pos logit contributions
 knowing+0.016
king+0.015
 to+0.015
 than+0.015
 for+0.015
Neg logit contributions
Found-0.016
 ah-0.015
Raw-0.015
 spoiler-0.015
 conclusions-0.014
Token it
Token ID340
Feature activation+1.77e-02

Pos logit contributions
 to+0.018
 when+0.017
 for+0.016
 where+0.016
 before+0.016
Neg logit contributions
Found-0.039
Raw-0.038
Got-0.037
 ah-0.036
bridge-0.036
Token's
Token ID338
Feature activation-4.10e-02

Pos logit contributions
Found+0.050
Raw+0.050
 ah+0.049
 spoiler+0.047
 tube+0.046
Neg logit contributions
 knowing-0.047
king-0.046
 than-0.043
asured-0.043
 to-0.042
Token okay
Token ID8788
Feature activation+6.61e-03

Pos logit contributions
king+0.071
 knowing+0.068
 to+0.063
 awaiting+0.062
asured+0.061
Neg logit contributions
Found-0.063
Raw-0.062
 are-0.061
 spoiler-0.061
 ah-0.059
Token,
Token ID11
Feature activation+9.08e-03

Pos logit contributions
 ah+0.011
Raw+0.011
Found+0.011
 are+0.010
 conclusions+0.010
Neg logit contributions
king-0.011
 knowing-0.010
asured-0.010
 awaiting-0.010
app-0.009
Token as
Token ID355
Feature activation-7.18e-02

Pos logit contributions
Found+0.016
 ah+0.016
Raw+0.015
 spoiler+0.015
 conclusions+0.014
Neg logit contributions
 knowing-0.015
king-0.014
 to-0.014
 than-0.013
asured-0.013
Token long
Token ID890
Feature activation-2.08e-03

Pos logit contributions
king+0.108
 knowing+0.107
 than+0.099
 to+0.099
asured+0.098
Neg logit contributions
 ah-0.129
Found-0.128
Raw-0.128
 spoiler-0.122
 conclusions-0.121
Token as
Token ID355
Feature activation-6.77e-02

Pos logit contributions
king+0.002
 knowing+0.001
asured+0.001
rou+0.001
ind+0.001
Neg logit contributions
 ah-0.005
Raw-0.005
Found-0.005
 spoiler-0.005
bridge-0.005
Token they
Token ID484
Feature activation-6.18e-03

Pos logit contributions
king+0.109
 knowing+0.105
asured+0.099
 awaiting+0.096
 than+0.095
Neg logit contributions
 ah-0.116
Raw-0.115
Found-0.113
 conclusions-0.108
 spoiler-0.106
Token can
Token ID460
Feature activation+2.62e-03

Pos logit contributions
 knowing+0.014
king+0.013
 to+0.013
 for+0.013
 than+0.013
Neg logit contributions
Found-0.007
 ah-0.007
Raw-0.007
 spoiler-0.006
 conclusions-0.006
Token help
Token ID1037
Feature activation+2.43e-02

Pos logit contributions
Found+0.005
 ah+0.004
Raw+0.004
 spoiler+0.004
 conclusions+0.004
Neg logit contributions
 knowing-0.004
king-0.004
 to-0.004
 than-0.004
 when-0.004
Token me
Token ID502
Feature activation+3.08e-02

Pos logit contributions
 conclusions+0.043
 ah+0.042
Raw+0.042
 are+0.041
Found+0.040
Neg logit contributions
 knowing-0.037
king-0.035
asured-0.034
 awaiting-0.034
 than-0.031
Token on
Token ID319
Feature activation+5.87e-03

Pos logit contributions
 are+0.051
 conclusions+0.051
 ah+0.050
Raw+0.049
hing+0.049
Neg logit contributions
 knowing-0.047
king-0.045
 awaiting-0.042
asured-0.041
 than-0.041
Token world
Token ID995
Feature activation-8.79e-03

Pos logit contributions
 knowing+0.001
 to+0.001
king+0.001
 than+0.001
 when+0.001
Neg logit contributions
Found-0.001
 ah-0.001
Raw-0.001
 spoiler-0.001
 conclusions-0.001
Token.
Token ID13
Feature activation-8.76e-04

Pos logit contributions
king+0.013
 knowing+0.012
asured+0.012
 awaiting+0.011
 patiently+0.010
Neg logit contributions
 are-0.016
 tube-0.016
Found-0.016
hell-0.016
hing-0.015
Token One
Token ID1881
Feature activation-3.69e-03

Pos logit contributions
king+0.002
 knowing+0.002
 than+0.002
asured+0.002
 awaiting+0.002
Neg logit contributions
Found-0.001
Raw-0.001
 spoiler-0.001
 ey-0.001
 tube-0.001
Token day
Token ID1110
Feature activation-2.68e-02

Pos logit contributions
king+0.007
 knowing+0.006
 awaiting+0.006
asured+0.006
 than+0.006
Neg logit contributions
Found-0.006
 tube-0.006
Raw-0.005
 vest-0.005
 are-0.005
Token,
Token ID11
Feature activation+9.73e-03

Pos logit contributions
 knowing+0.043
king+0.042
asured+0.040
 than+0.039
 awaiting+0.038
Neg logit contributions
Found-0.047
 ah-0.046
Raw-0.045
 spoiler-0.043
Got-0.043
Token as
Token ID355
Feature activation-6.83e-02

Pos logit contributions
 tube+0.012
 vest+0.012
 ah+0.012
 are+0.012
Found+0.012
Neg logit contributions
asured-0.020
king-0.019
 supposed-0.018
ind-0.017
rying-0.017
Token she
Token ID673
Feature activation-1.37e-02

Pos logit contributions
king+0.120
asured+0.111
 awaiting+0.107
 knowing+0.104
 than+0.101
Neg logit contributions
 ah-0.107
Found-0.104
Raw-0.104
 tube-0.100
Got-0.098
Token was
Token ID373
Feature activation+2.64e-03

Pos logit contributions
 to+0.020
 knowing+0.020
 when+0.020
 for+0.019
 than+0.019
Neg logit contributions
 spoiler-0.025
 ah-0.025
Found-0.025
Raw-0.025
 conclusions-0.024
Token walking
Token ID6155
Feature activation-1.83e-02

Pos logit contributions
Found+0.005
Raw+0.005
 ah+0.005
 ey+0.005
AAAA+0.005
Neg logit contributions
king-0.004
 knowing-0.003
 than-0.003
asured-0.003
app-0.003
Token with
Token ID351
Feature activation+6.44e-03

Pos logit contributions
 to+0.018
 for+0.017
 when+0.015
 knowing+0.014
 where+0.014
Neg logit contributions
 spoiler-0.049
 ah-0.049
Raw-0.048
Found-0.047
 conclusions-0.046
Token her
Token ID607
Feature activation-4.46e-03

Pos logit contributions
Raw+0.017
 spoiler+0.016
Found+0.016
 ah+0.016
 conclusions+0.016
Neg logit contributions
 to-0.006
 when-0.005
 knowing-0.005
 where-0.005
 for-0.005
Tokenled
Token ID992
Feature activation-2.57e-02

Pos logit contributions
Found+0.051
 ah+0.051
Raw+0.049
Got+0.047
 spoiler+0.047
Neg logit contributions
 knowing-0.054
king-0.051
 to-0.050
 than-0.050
 where-0.050
Token away
Token ID1497
Feature activation+3.58e-02

Pos logit contributions
 knowing+0.034
king+0.032
 awaiting+0.031
cess+0.031
 supposed+0.031
Neg logit contributions
Found-0.046
Raw-0.045
 ah-0.045
 seaw-0.044
 spoiler-0.043
Token until
Token ID1566
Feature activation-2.20e-02

Pos logit contributions
 are+0.053
 rock+0.051
 tube+0.050
 pear+0.050
 shovel+0.048
Neg logit contributions
becca-0.059
ind-0.057
athy-0.057
resa-0.057
elia-0.055
Token the
Token ID262
Feature activation-6.40e-03

Pos logit contributions
 knowing+0.030
king+0.029
 than+0.028
 to+0.027
 for+0.027
Neg logit contributions
Found-0.044
 ah-0.043
Raw-0.043
 spoiler-0.042
 conclusions-0.041
Token next
Token ID1306
Feature activation-5.81e-03

Pos logit contributions
 knowing+0.013
king+0.012
 to+0.012
 for+0.012
 when+0.012
Neg logit contributions
Found-0.009
 ah-0.009
Raw-0.008
 spoiler-0.008
 conclusions-0.008
Token spring
Token ID6076
Feature activation-6.64e-02

Pos logit contributions
 knowing+0.010
king+0.010
asured+0.009
 awaiting+0.008
 than+0.008
Neg logit contributions
Raw-0.010
 tube-0.010
Found-0.009
 ah-0.009
 pear-0.009
Token.
Token ID13
Feature activation-3.82e-03

Pos logit contributions
 knowing+0.123
king+0.117
asured+0.115
 than+0.113
 awaiting+0.109
Neg logit contributions
Found-0.100
 ah-0.097
Raw-0.097
 tube-0.094
bridge-0.091
Token They
Token ID1119
Feature activation+9.23e-03

Pos logit contributions
 knowing+0.007
king+0.007
 than+0.007
asured+0.007
 awaiting+0.007
Neg logit contributions
Raw-0.006
Found-0.005
 ah-0.005
 spoiler-0.005
Got-0.005
Token never
Token ID1239
Feature activation-8.22e-03

Pos logit contributions
Found+0.015
Raw+0.015
 spoiler+0.015
 ah+0.015
 ey+0.015
Neg logit contributions
 knowing-0.016
 to-0.015
 for-0.015
king-0.015
 when-0.015
Token forgot
Token ID16453
Feature activation-3.72e-02

Pos logit contributions
 knowing+0.013
king+0.013
 to+0.012
 than+0.012
 for+0.012
Neg logit contributions
Found-0.015
 ah-0.014
Raw-0.014
 spoiler-0.014
 conclusions-0.013
Token about
Token ID546
Feature activation-5.77e-02

Pos logit contributions
king+0.046
 knowing+0.043
 than+0.043
 awaiting+0.041
app+0.041
Neg logit contributions
 ah-0.074
Found-0.074
 conclusions-0.072
 are-0.071
Raw-0.071