TOP ACTIVATIONS
MAX = 6.823

Lily assured her that L izzy was very friendly and loved
, mum and dad told Maria that they had an exciting
there was a girl called Min nie . She was very
day Mia 's mom introduced Mia to a diamond . It
sign that said " Safety First ". Lily remembered what her
dad made sure to encourage Maria to try all of the
a baby squirrel called T izzy . She was very small
inside , â said Ali â s mom
somewhere . She told L izzy to stay put and not
of the items and told Min nie the cost . Min
was another bear called L izzy . L izzy
walked the more thinking L izzy did . Finally , when
a little girl called L izzy . She was three years
teacher and she could teach Bella how to swim . Bella
of something y ummy told Bella it was ready . She
was a girl called L izzy . She loved going to
a little girl called L izzy . One day , L
today , so she showed Emily her new purple bag .
friendly , and he taught Ted lots of new things .
Her mom showed Julie how to use the comb

MIN ACTIVATIONS
MIN = -7.297

kept crying and throwing a tant rum . Finally
to cry and threw a tant rum . She didn 't
listen . She threw a tant rum in the store .
sad . She threw a tant rum and cried loudly .
. He kept throwing a tant rum . His mom was
She cried and threw a tant rum . Her mom tried
and started to throw a tant rum . Suddenly , she
and so she threw a tant rum when her parents said
her feet and threw a tant rum . She grabbed the
Who goes there ?" a voice said from inside .
stubborn . She throws a tant rum . She kicks and
surprise . He threw a tant rum . He kicked and
are you playing ?" a voice says . They
his feet and threw a tant rum . "
Leave them alone !" a voice said . L
Who is it ?" a voice asked . "
we join you ?" a voice asked . It was Mia
Hello , Lily ," a voice said . Lily turned around
are you doing ?" a voice said . They
going on here ?" a voice said . They

INTERVAL 3.293 - 6.823
CONTAINS 0.577%

" OK , Mia , let 's go .
very friendly . Frank showed Gerry all the best places to
front of them and hugged Lily . The dog
two friends , Max and Julie . They liked to share
was trying to make it hard for him to play ,

INTERVAL -0.237 - 3.293
CONTAINS 59.621%

Yes !" replied Joyce . " There 's a big billboard
think I did ?" L ily said , "
the discovery he had made . They were all amazed at
cup board . But she could not find her pretty mug
. But soon the boy had to go back

INTERVAL -3.767 - -0.237
CONTAINS 39.601%

She wanted to have the prett iest bicycle in town .
garden . He found a bag of seeds and he went
hello to the night . They thank Mom for explaining the
We should go inside and see if Mom and Dad are
very nice . He lived in a big tree with his

INTERVAL -7.297 - -3.767
CONTAINS 0.200%

, he heard a little voice say , " Wait for
questioned what it was and why it was there .
the sky . Lily was sad and cried . Her mom
a lot of noise and whim pered . She was very
L ily sees Tom crying . She feels sorry for
Token Lily
Token ID20037
Feature activation+1.58e+00

Pos logit contributions
 and+0.026
+0.025
 so+0.025
 called+0.024
 convey+0.024
Neg logit contributions
 disl-0.046
 Majesty-0.043
 Sorry-0.043
 winners-0.041
 majesty-0.041
Token assured
Token ID13933
Feature activation+9.79e-01

Pos logit contributions
 and+0.033
 so+0.032
+0.031
 called+0.030
sc+0.030
Neg logit contributions
 disl-0.050
 Majesty-0.046
 Sorry-0.046
 winners-0.044
 majesty-0.044
Token her
Token ID607
Feature activation+3.03e+00

Pos logit contributions
sc+0.019
On+0.018
+0.018
 convey+0.018
 and+0.017
Neg logit contributions
 disl-0.031
 Sorry-0.028
 yeah-0.028
 Majesty-0.027
 sincere-0.027
Token that
Token ID326
Feature activation+1.31e+00

Pos logit contributions
 and+0.064
 so+0.060
+0.059
 called+0.057
 even+0.056
Neg logit contributions
 disl-0.101
 Sorry-0.096
 Majesty-0.094
 winners-0.091
 majesty-0.088
Token L
Token ID406
Feature activation+4.39e+00

Pos logit contributions
 and+0.024
 so+0.023
+0.023
 called+0.022
On+0.022
Neg logit contributions
 disl-0.045
 Majesty-0.041
 Sorry-0.041
 winners-0.040
 majesty-0.039
Tokenizzy
Token ID40593
Feature activation+6.82e+00

Pos logit contributions
 convey+0.121
 so+0.116
+0.114
 and+0.112
 shelf+0.111
Neg logit contributions
 disl-0.104
rav-0.092
 Majesty-0.088
 yeah-0.088
 Sorry-0.087
Token was
Token ID373
Feature activation+2.53e+00

Pos logit contributions
 and+0.095
 so+0.091
+0.089
 called+0.086
On+0.081
Neg logit contributions
 disl-0.264
 Majesty-0.247
 Sorry-0.244
 winners-0.239
 majesty-0.237
Token very
Token ID845
Feature activation+7.48e-01

Pos logit contributions
 so+0.048
 and+0.048
 called+0.044
+0.044
 even+0.044
Neg logit contributions
 disl-0.091
 Sorry-0.088
 Majesty-0.087
 majesty-0.083
rav-0.083
Token friendly
Token ID8030
Feature activation+1.38e-01

Pos logit contributions
 and+0.018
+0.017
 called+0.017
 so+0.017
 carefully+0.016
Neg logit contributions
 disl-0.022
 Majesty-0.022
 Sorry-0.022
 winners-0.021
 yeah-0.020
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
 convey+0.001
On+0.001
At+0.001
sc+0.001
+0.001
Neg logit contributions
 disl-0.006
 winners-0.005
 nods-0.005
 Majesty-0.005
joy-0.005
Token loved
Token ID6151
Feature activation+2.01e+00

Pos logit contributions
 so+0.030
 and+0.030
 called+0.030
+0.028
 even+0.028
Neg logit contributions
 Majesty-0.049
 disl-0.048
 winners-0.046
joy-0.046
 Sorry-0.046
Token,
Token ID11
Feature activation-2.87e-01

Pos logit contributions
 disl+0.019
 Majesty+0.018
 winners+0.018
 majesty+0.017
 sincere+0.017
Neg logit contributions
 and-0.004
 so-0.003
 called-0.003
-0.002
 even-0.002
Token mum
Token ID25682
Feature activation-1.26e+00

Pos logit contributions
 disl+0.010
 Majesty+0.010
 winners+0.009
 majesty+0.009
 Sorry+0.009
Neg logit contributions
 and-0.006
 so-0.005
 called-0.005
-0.005
 even-0.005
Token and
Token ID290
Feature activation+1.96e+00

Pos logit contributions
 disl+0.041
 Majesty+0.037
 Sorry+0.037
 majesty+0.036
 winners+0.036
Neg logit contributions
sc-0.024
 convey-0.023
 so-0.023
On-0.022
-0.022
Token dad
Token ID9955
Feature activation-2.50e+00

Pos logit contributions
 and+0.036
 so+0.035
 called+0.033
 even+0.031
+0.031
Neg logit contributions
 disl-0.073
 Majesty-0.067
 winners-0.065
 majesty-0.065
 Sorry-0.065
Token told
Token ID1297
Feature activation+7.78e-01

Pos logit contributions
 disl+0.078
 Sorry+0.075
 winners+0.073
 Majesty+0.071
 yeah+0.068
Neg logit contributions
sc-0.046
 convey-0.045
 so-0.044
 shelf-0.044
 and-0.044
Token Maria
Token ID14200
Feature activation+6.77e+00

Pos logit contributions
 and+0.012
+0.012
 so+0.012
 called+0.011
sc+0.011
Neg logit contributions
 disl-0.029
 Majesty-0.027
 Sorry-0.026
 winners-0.026
 majesty-0.025
Token that
Token ID326
Feature activation+7.15e-01

Pos logit contributions
 and+0.091
+0.088
 so+0.088
 called+0.085
sc+0.084
Neg logit contributions
 disl-0.261
 Majesty-0.243
 Sorry-0.239
 winners-0.235
 majesty-0.233
Token they
Token ID484
Feature activation+2.15e+00

Pos logit contributions
 and+0.013
 so+0.012
+0.012
 called+0.012
sc+0.011
Neg logit contributions
 disl-0.025
 Majesty-0.023
 Sorry-0.023
 winners-0.022
 majesty-0.022
Token had
Token ID550
Feature activation+1.48e+00

Pos logit contributions
 and+0.037
 so+0.035
+0.035
 even+0.033
 called+0.033
Neg logit contributions
 disl-0.083
 Majesty-0.080
 majesty-0.076
 Sorry-0.075
 winners-0.073
Token an
Token ID281
Feature activation+1.54e+00

Pos logit contributions
 and+0.019
sc+0.018
+0.018
 convey+0.017
On+0.017
Neg logit contributions
 disl-0.057
 Majesty-0.053
 winners-0.053
 Sorry-0.053
 majesty-0.051
Token exciting
Token ID7895
Feature activation-1.16e+00

Pos logit contributions
sc+0.050
On+0.043
At+0.042
and+0.041
+0.039
Neg logit contributions
 disl-0.035
 Majesty-0.033
 sorry-0.032
 winners-0.030
 sincere-0.028
Token there
Token ID612
Feature activation-1.30e+00

Pos logit contributions
 and+0.004
 so+0.004
 called+0.004
+0.004
 convey+0.004
Neg logit contributions
 disl-0.013
 Sorry-0.012
 Majesty-0.012
 winners-0.012
 majesty-0.012
Token was
Token ID373
Feature activation-3.51e-02

Pos logit contributions
 disl+0.048
 Sorry+0.045
 Majesty+0.043
 yeah+0.043
 winners+0.042
Neg logit contributions
 convey-0.018
sc-0.018
 so-0.017
 and-0.017
On-0.017
Token a
Token ID257
Feature activation+1.61e+00

Pos logit contributions
 disl+0.001
 winners+0.001
 wow+0.001
 Sorry+0.001
 Majesty+0.001
Neg logit contributions
 convey-0.001
sc-0.001
 shelf-0.001
 aisle-0.001
 floor-0.001
Token girl
Token ID2576
Feature activation-8.82e-01

Pos logit contributions
 and+0.026
 so+0.025
+0.024
 called+0.024
 even+0.022
Neg logit contributions
 disl-0.060
 Majesty-0.056
 Sorry-0.055
 winners-0.054
 majesty-0.053
Token called
Token ID1444
Feature activation+1.29e+00

Pos logit contributions
 disl+0.019
 writer+0.017
 Sorry+0.017
 sincere+0.016
 sorry+0.015
Neg logit contributions
sc-0.025
 convey-0.023
On-0.023
 floor-0.022
 off-0.022
Token Min
Token ID1855
Feature activation+6.70e+00

Pos logit contributions
 and+0.030
On+0.029
 convey+0.029
 so+0.029
sc+0.029
Neg logit contributions
 disl-0.036
 Sorry-0.033
 Majesty-0.033
 majesty-0.032
 winners-0.031
Tokennie
Token ID11952
Feature activation+3.60e+00

Pos logit contributions
 so+0.137
On+0.133
+0.132
 convey+0.130
sc+0.127
Neg logit contributions
 disl-0.197
 Majesty-0.185
 majesty-0.179
 writer-0.176
rav-0.173
Token.
Token ID13
Feature activation-7.99e-02

Pos logit contributions
sc+0.047
 convey+0.044
On+0.042
+0.041
 so+0.041
Neg logit contributions
 disl-0.136
 Majesty-0.126
 Sorry-0.125
 winners-0.123
 majesty-0.122
Token She
Token ID1375
Feature activation+7.54e-01

Pos logit contributions
 disl+0.002
 Sorry+0.002
 Majesty+0.002
 yeah+0.002
osity+0.002
Neg logit contributions
 and-0.002
 so-0.002
 called-0.002
-0.002
sc-0.002
Token was
Token ID373
Feature activation+1.77e-01

Pos logit contributions
sc+0.017
 convey+0.017
 so+0.017
 and+0.016
+0.016
Neg logit contributions
 disl-0.021
 Sorry-0.019
 Majesty-0.018
 sincere-0.018
 majesty-0.018
Token very
Token ID845
Feature activation-5.03e-01

Pos logit contributions
 and+0.002
sc+0.002
+0.002
On+0.002
 convey+0.002
Neg logit contributions
 disl-0.006
 Majesty-0.006
 Sorry-0.006
 winners-0.006
 majesty-0.006
Token day
Token ID1110
Feature activation-4.01e-01

Pos logit contributions
 and+0.022
 so+0.021
+0.020
 called+0.020
 even+0.019
Neg logit contributions
 disl-0.049
 Majesty-0.046
 Sorry-0.046
 winners-0.045
 majesty-0.044
Token Mia
Token ID32189
Feature activation+2.65e+00

Pos logit contributions
 disl+0.017
 Majesty+0.016
 Sorry+0.016
 winners+0.016
 majesty+0.015
Neg logit contributions
 and-0.005
 so-0.004
-0.004
 called-0.004
 even-0.004
Token's
Token ID338
Feature activation+1.38e-01

Pos logit contributions
 convey+0.057
sc+0.054
 shelf+0.052
 so+0.049
ula+0.048
Neg logit contributions
 disl-0.075
 Majesty-0.069
 Sorry-0.067
 nods-0.066
 majesty-0.064
Token mom
Token ID1995
Feature activation-1.44e+00

Pos logit contributions
sc+0.003
 and+0.003
+0.003
 so+0.003
On+0.003
Neg logit contributions
 disl-0.004
 Majesty-0.004
 Sorry-0.004
 winners-0.004
 majesty-0.004
Token introduced
Token ID5495
Feature activation+1.29e+00

Pos logit contributions
 disl+0.038
 Majesty+0.035
 Sorry+0.035
 majesty+0.034
 winners+0.033
Neg logit contributions
sc-0.033
 convey-0.032
 shelf-0.030
On-0.030
 so-0.029
Token Mia
Token ID32189
Feature activation+6.52e+00

Pos logit contributions
+0.021
 convey+0.021
sc+0.021
 and+0.021
On+0.020
Neg logit contributions
 disl-0.045
 Majesty-0.041
 Sorry-0.040
 winners-0.040
 majesty-0.039
Token to
Token ID284
Feature activation+8.14e-01

Pos logit contributions
 convey+0.092
On+0.075
 floor+0.073
sc+0.073
At+0.071
Neg logit contributions
 disl-0.229
 Majesty-0.214
 winners-0.211
 writer-0.206
osity-0.205
Token a
Token ID257
Feature activation+1.79e+00

Pos logit contributions
sc+0.014
+0.013
 and+0.013
 convey+0.013
 called+0.012
Neg logit contributions
 disl-0.028
 Majesty-0.026
 Sorry-0.026
 winners-0.025
 majesty-0.025
Token diamond
Token ID15291
Feature activation-1.58e+00

Pos logit contributions
 and+0.037
sc+0.034
+0.034
 so+0.034
 called+0.034
Neg logit contributions
 disl-0.053
 Majesty-0.050
 Sorry-0.050
 winners-0.049
 majesty-0.047
Token.
Token ID13
Feature activation+8.12e-01

Pos logit contributions
 disl+0.056
 Majesty+0.054
 winners+0.054
 Sorry+0.054
 writer+0.053
Neg logit contributions
-0.021
 convey-0.019
On-0.019
sc-0.018
At-0.018
Token It
Token ID632
Feature activation-1.94e+00

Pos logit contributions
 and+0.018
 convey+0.018
 so+0.017
+0.017
 even+0.017
Neg logit contributions
 disl-0.024
 Sorry-0.022
 Majesty-0.022
 yeah-0.021
 majesty-0.021
Token sign
Token ID1051
Feature activation-1.62e+00

Pos logit contributions
 and+0.002
 so+0.002
+0.002
 called+0.002
 even+0.002
Neg logit contributions
 disl-0.005
 Majesty-0.005
 Sorry-0.005
osity-0.005
 majesty-0.005
Token that
Token ID326
Feature activation-8.33e-01

Pos logit contributions
 disl+0.053
 winners+0.053
 Sorry+0.051
 sincere+0.049
 sorry+0.049
Neg logit contributions
 convey-0.023
sc-0.023
-0.023
On-0.022
ern-0.021
Token said
Token ID531
Feature activation+1.11e+00

Pos logit contributions
 disl+0.027
 Sorry+0.025
 Majesty+0.025
 winners+0.024
 majesty+0.024
Neg logit contributions
 and-0.016
-0.016
 so-0.015
On-0.014
 convey-0.014
Token "
Token ID366
Feature activation+3.20e+00

Pos logit contributions
 and+0.020
 so+0.019
+0.017
 called+0.017
 even+0.016
Neg logit contributions
 disl-0.041
 Majesty-0.038
 Sorry-0.038
 majesty-0.037
 winners-0.036
TokenSafety
Token ID45372
Feature activation+1.28e+00

Pos logit contributions
 and+0.072
 so+0.070
+0.069
 called+0.067
 even+0.065
Neg logit contributions
 disl-0.096
 Sorry-0.088
 Majesty-0.087
 winners-0.085
 majesty-0.082
Token First
Token ID3274
Feature activation+6.47e+00

Pos logit contributions
 and+0.024
 so+0.023
+0.023
 called+0.023
 convey+0.022
Neg logit contributions
 disl-0.042
 Majesty-0.039
 Sorry-0.039
 winners-0.038
 majesty-0.037
Token".
Token ID1911
Feature activation-8.07e-01

Pos logit contributions
On+0.111
 convey+0.110
sc+0.106
 unw+0.098
 called+0.097
Neg logit contributions
 disl-0.208
 Majesty-0.204
 Sorry-0.201
 winners-0.195
 majesty-0.194
Token Lily
Token ID20037
Feature activation+8.39e-01

Pos logit contributions
 disl+0.025
 Sorry+0.023
 Majesty+0.022
 winners+0.022
 yeah+0.022
Neg logit contributions
 and-0.016
-0.016
 called-0.016
 convey-0.016
 even-0.015
Token remembered
Token ID12086
Feature activation+1.47e+00

Pos logit contributions
sc+0.016
 convey+0.015
 so+0.015
 shelf+0.014
On+0.014
Neg logit contributions
 disl-0.026
 Majesty-0.026
 majesty-0.025
 nods-0.024
 winners-0.023
Token what
Token ID644
Feature activation-1.16e+00

Pos logit contributions
sc+0.033
+0.030
 convey+0.030
 shelf+0.028
On+0.027
Neg logit contributions
 disl-0.047
 Majesty-0.042
 majesty-0.040
 yeah-0.039
 sincere-0.039
Token her
Token ID607
Feature activation+3.61e-01

Pos logit contributions
 disl+0.039
 Sorry+0.035
 Majesty+0.035
 winners+0.035
 yeah+0.034
Neg logit contributions
sc-0.022
 and-0.021
 convey-0.020
-0.020
 so-0.020
Token dad
Token ID9955
Feature activation-2.27e+00

Pos logit contributions
 and+0.032
 so+0.031
+0.030
 called+0.029
 even+0.028
Neg logit contributions
 disl-0.052
 Majesty-0.047
 Sorry-0.047
 winners-0.046
 majesty-0.045
Token made
Token ID925
Feature activation+1.66e+00

Pos logit contributions
 disl+0.070
 Majesty+0.068
 winners+0.067
 Sorry+0.067
 majesty+0.065
Neg logit contributions
sc-0.041
 convey-0.041
 so-0.039
On-0.039
 shelf-0.039
Token sure
Token ID1654
Feature activation+1.82e+00

Pos logit contributions
On+0.028
 and+0.028
+0.028
 convey+0.028
 shelf+0.027
Neg logit contributions
 disl-0.056
 Majesty-0.052
 Sorry-0.051
 winners-0.051
 majesty-0.049
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
 and+0.025
 so+0.024
+0.023
 called+0.023
 even+0.021
Neg logit contributions
 disl-0.072
 Majesty-0.067
 Sorry-0.066
 winners-0.065
 majesty-0.064
Token encourage
Token ID7898
Feature activation-2.27e-01

Pos logit contributions
sc+0.001
 and+0.001
+0.001
 so+0.000
 convey+0.000
Neg logit contributions
 disl-0.001
 Sorry-0.001
 Majesty-0.001
 winners-0.001
 majesty-0.001
Token Maria
Token ID14200
Feature activation+6.44e+00

Pos logit contributions
 disl+0.007
 Sorry+0.007
 Majesty+0.007
 winners+0.007
 majesty+0.006
Neg logit contributions
On-0.004
sc-0.004
 convey-0.004
 aisle-0.004
-0.004
Token to
Token ID284
Feature activation+3.38e-01

Pos logit contributions
 convey+0.071
On+0.060
sc+0.058
 shelf+0.055
 floor+0.053
Neg logit contributions
 disl-0.255
 Majesty-0.242
 winners-0.238
 majesty-0.234
osity-0.230
Token try
Token ID1949
Feature activation+1.78e+00

Pos logit contributions
sc+0.007
 and+0.007
 so+0.007
+0.007
 convey+0.006
Neg logit contributions
 disl-0.011
 Majesty-0.010
 Sorry-0.010
 winners-0.009
 majesty-0.009
Token all
Token ID477
Feature activation+1.51e+00

Pos logit contributions
sc+0.030
 convey+0.029
+0.026
 aisle+0.026
 called+0.026
Neg logit contributions
 disl-0.060
joy-0.054
 Majesty-0.054
 winners-0.052
 wow-0.051
Token of
Token ID286
Feature activation+3.25e+00

Pos logit contributions
 convey+0.026
sc+0.024
On+0.024
 called+0.023
+0.022
Neg logit contributions
 disl-0.053
 winners-0.048
 Sorry-0.045
 yeah-0.045
 Majesty-0.045
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
sc+0.060
+0.059
On+0.057
 called+0.055
 and+0.054
Neg logit contributions
 disl-0.108
 Majesty-0.098
 winners-0.096
 yeah-0.094
 wow-0.093
Token a
Token ID257
Feature activation+1.55e+00

Pos logit contributions
 convey+0.001
sc+0.001
 shelf+0.001
 aisle+0.001
 floor+0.001
Neg logit contributions
 disl-0.001
 winners-0.001
 Sorry-0.001
 wow-0.001
 Majesty-0.001
Token baby
Token ID5156
Feature activation-7.13e-01

Pos logit contributions
 and+0.025
 so+0.023
+0.023
 called+0.022
 even+0.021
Neg logit contributions
 disl-0.059
 Majesty-0.055
 Sorry-0.054
 winners-0.053
 majesty-0.052
Token squirrel
Token ID33039
Feature activation-1.30e+00

Pos logit contributions
 disl+0.023
 Majesty+0.021
 Sorry+0.021
 winners+0.021
 majesty+0.020
Neg logit contributions
-0.014
 and-0.014
 so-0.014
sc-0.013
On-0.013
Token called
Token ID1444
Feature activation+1.36e+00

Pos logit contributions
 disl+0.042
 Sorry+0.038
 sincere+0.038
 Majesty+0.038
 winners+0.037
Neg logit contributions
 convey-0.023
-0.022
sc-0.022
 so-0.021
 shelf-0.020
Token T
Token ID309
Feature activation+3.26e+00

Pos logit contributions
 and+0.029
 so+0.028
+0.028
 called+0.027
 even+0.026
Neg logit contributions
 disl-0.043
 Majesty-0.040
 Sorry-0.040
 winners-0.039
 majesty-0.038
Tokenizzy
Token ID40593
Feature activation+6.41e+00

Pos logit contributions
sc+0.083
 so+0.082
+0.080
 convey+0.080
 called+0.080
Neg logit contributions
 disl-0.083
 Majesty-0.081
 majesty-0.078
rav-0.074
osity-0.073
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
 convey+0.088
sc+0.088
On+0.087
+0.086
At+0.082
Neg logit contributions
 disl-0.230
 Majesty-0.222
 majesty-0.213
osity-0.203
 winners-0.202
Token She
Token ID1375
Feature activation+8.25e-01

Pos logit contributions
+0.007
sc+0.006
 and+0.006
 carefully+0.006
 so+0.006
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.006
 yeah-0.006
joy-0.006
Token was
Token ID373
Feature activation+2.61e-01

Pos logit contributions
 convey+0.018
 so+0.018
+0.018
 and+0.018
sc+0.018
Neg logit contributions
 disl-0.024
 Sorry-0.021
 Majesty-0.021
 majesty-0.020
 sincere-0.020
Token very
Token ID845
Feature activation-3.76e-01

Pos logit contributions
 and+0.003
+0.003
sc+0.003
 so+0.003
On+0.003
Neg logit contributions
 disl-0.011
 Majesty-0.010
 Sorry-0.010
 winners-0.010
 majesty-0.010
Token small
Token ID1402
Feature activation+2.06e+00

Pos logit contributions
 disl+0.012
 Majesty+0.012
osity+0.011
 winners+0.011
 Sorry+0.011
Neg logit contributions
 so-0.008
 and-0.008
 called-0.008
-0.007
 even-0.007
Token inside
Token ID2641
Feature activation-3.75e-01

Pos logit contributions
 disl+0.007
 Majesty+0.006
 Sorry+0.006
 winners+0.006
 majesty+0.006
Neg logit contributions
 and-0.003
-0.003
 so-0.003
 called-0.002
sc-0.002
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
 disl+0.012
 Sorry+0.012
 winners+0.011
 Majesty+0.011
 yeah+0.011
Neg logit contributions
On-0.006
 convey-0.006
sc-0.006
At-0.006
Days-0.005
Tokenâ
Token ID22940
Feature activation+1.74e+00

Pos logit contributions
 disl+0.009
 Majesty+0.008
 Sorry+0.008
 winners+0.008
 majesty+0.008
Neg logit contributions
sc-0.002
 and-0.002
 called-0.002
On-0.002
 convey-0.002
Token
Token ID26391
Feature activation+3.88e-01

Pos logit contributions
 and+0.044
+0.044
 so+0.042
 called+0.042
sc+0.040
Neg logit contributions
 disl-0.048
 Majesty-0.046
 Sorry-0.044
 winners-0.043
 majesty-0.043
Token said
Token ID531
Feature activation+1.13e+00

Pos logit contributions
 and+0.009
+0.008
 so+0.008
 called+0.008
On+0.007
Neg logit contributions
 disl-0.013
 Majesty-0.012
 winners-0.011
 Sorry-0.011
osity-0.011
Token Ali
Token ID12104
Feature activation+6.40e+00

Pos logit contributions
 and+0.020
 so+0.019
+0.019
 called+0.018
 even+0.017
Neg logit contributions
 disl-0.041
 Majesty-0.038
 Sorry-0.037
 winners-0.037
 majesty-0.036
Tokenâ
Token ID22940
Feature activation+3.30e+00

Pos logit contributions
 and+0.067
 so+0.060
+0.059
 called+0.053
 even+0.047
Neg logit contributions
 disl-0.280
 Sorry-0.266
 Majesty-0.262
 winners-0.258
 majesty-0.253
Token
Token ID26391
Feature activation+2.87e+00

Pos logit contributions
 and+0.082
+0.081
 so+0.078
 called+0.077
sc+0.075
Neg logit contributions
 disl-0.093
 Majesty-0.089
 Sorry-0.085
 winners-0.084
 majesty-0.083
Token
Token ID8151
Feature activation+2.94e+00

Pos logit contributions
 and+0.062
+0.059
 so+0.058
 called+0.056
On+0.053
Neg logit contributions
 disl-0.094
 Majesty-0.085
 Sorry-0.085
 winners-0.083
osity-0.082
Tokens
Token ID82
Feature activation+2.01e+00

Pos logit contributions
 convey+0.075
 called+0.068
On+0.067
 shelf+0.067
 so+0.067
Neg logit contributions
 disl-0.074
rav-0.069
 Sorry-0.068
 winners-0.067
 yeah-0.064
Token mom
Token ID1995
Feature activation+1.91e+00

Pos logit contributions
 and+0.060
 so+0.056
+0.056
 called+0.053
 shelf+0.052
Neg logit contributions
 disl-0.051
 Sorry-0.044
 winners-0.043
 Majesty-0.043
 yeah-0.042
Token somewhere
Token ID7382
Feature activation-1.33e+00

Pos logit contributions
sc+0.010
+0.010
So+0.009
 floor+0.009
 shelf+0.009
Neg logit contributions
 disl-0.017
 winners-0.017
 Majesty-0.017
joy-0.016
 Sorry-0.016
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
 disl+0.038
 winners+0.035
 sorry+0.034
 Sorry+0.034
 Majesty+0.033
Neg logit contributions
sc-0.026
 convey-0.026
ern-0.025
 shelf-0.024
 strips-0.024
Token She
Token ID1375
Feature activation+3.59e-01

Pos logit contributions
+0.007
 even+0.007
 and+0.007
 convey+0.007
sc+0.007
Neg logit contributions
 disl-0.007
 Sorry-0.007
 yeah-0.007
 Majesty-0.007
osity-0.006
Token told
Token ID1297
Feature activation+1.88e-01

Pos logit contributions
sc+0.008
 convey+0.008
 shelf+0.008
 floor+0.008
 so+0.007
Neg logit contributions
 disl-0.009
 Sorry-0.009
 Majesty-0.008
 nods-0.008
 sincere-0.008
Token L
Token ID406
Feature activation+3.75e+00

Pos logit contributions
sc+0.004
+0.004
On+0.004
 convey+0.004
 aisle+0.004
Neg logit contributions
 disl-0.005
 Sorry-0.004
 hey-0.004
 Majesty-0.004
 yeah-0.004
Tokenizzy
Token ID40593
Feature activation+6.40e+00

Pos logit contributions
 convey+0.120
 so+0.113
 shelf+0.111
+0.111
 and+0.108
Neg logit contributions
 disl-0.070
rav-0.066
 yeah-0.060
osity-0.058
 Majesty-0.058
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
 and+0.088
 so+0.083
+0.082
 called+0.079
 even+0.074
Neg logit contributions
 disl-0.250
 Majesty-0.234
 Sorry-0.232
 winners-0.227
 majesty-0.225
Token stay
Token ID2652
Feature activation-1.13e+00

Pos logit contributions
 disl+0.003
 Sorry+0.003
 Majesty+0.003
 winners+0.003
 majesty+0.003
Neg logit contributions
sc-0.002
 and-0.002
 so-0.002
-0.002
 convey-0.002
Token put
Token ID1234
Feature activation+1.50e+00

Pos logit contributions
 disl+0.038
 Majesty+0.037
 sincere+0.036
 winners+0.036
 Sorry+0.036
Neg logit contributions
-0.018
sc-0.018
 convey-0.016
 shelf-0.015
 floor-0.015
Token and
Token ID290
Feature activation+7.36e-01

Pos logit contributions
+0.024
On+0.023
sc+0.021
 convey+0.021
At+0.020
Neg logit contributions
 disl-0.051
 Sorry-0.048
 winners-0.048
 Majesty-0.047
 sorry-0.046
Token not
Token ID407
Feature activation-1.61e+00

Pos logit contributions
 and+0.015
 so+0.014
 called+0.014
+0.014
 even+0.013
Neg logit contributions
 disl-0.025
 Majesty-0.024
 winners-0.023
 Sorry-0.023
osity-0.023
Token of
Token ID286
Feature activation+2.23e+00

Pos logit contributions
 convey+0.014
On+0.012
 shelf+0.012
sc+0.012
 called+0.012
Neg logit contributions
 disl-0.021
 winners-0.020
 nods-0.020
 yeah-0.019
osity-0.019
Token the
Token ID262
Feature activation+6.25e-01

Pos logit contributions
+0.042
 convey+0.041
sc+0.039
On+0.037
 and+0.037
Neg logit contributions
 disl-0.067
 winners-0.064
 yeah-0.062
 Majesty-0.061
 majesty-0.061
Token items
Token ID3709
Feature activation+1.24e+00

Pos logit contributions
+0.013
 and+0.013
On+0.012
 so+0.012
 called+0.012
Neg logit contributions
 disl-0.019
 Majesty-0.018
 winners-0.017
 majesty-0.017
 yeah-0.016
Token and
Token ID290
Feature activation+3.48e-01

Pos logit contributions
 convey+0.017
On+0.016
sc+0.016
 shelf+0.016
 floor+0.015
Neg logit contributions
 winners-0.041
 disl-0.041
 yeah-0.039
osity-0.039
 nods-0.038
Token told
Token ID1297
Feature activation+3.95e-01

Pos logit contributions
+0.005
 and+0.005
On+0.005
 convey+0.005
sc+0.005
Neg logit contributions
 disl-0.012
 Majesty-0.011
 Sorry-0.011
 winners-0.011
 majesty-0.011
Token Min
Token ID1855
Feature activation+6.38e+00

Pos logit contributions
+0.007
 and+0.006
On+0.006
 called+0.006
 so+0.006
Neg logit contributions
 disl-0.014
 Majesty-0.013
 Sorry-0.012
 winners-0.012
 majesty-0.012
Tokennie
Token ID11952
Feature activation+4.56e+00

Pos logit contributions
 so+0.123
 and+0.120
On+0.120
 convey+0.120
 called+0.119
Neg logit contributions
 disl-0.202
 Majesty-0.187
 majesty-0.179
 winners-0.177
rav-0.175
Token the
Token ID262
Feature activation+2.30e+00

Pos logit contributions
 and+0.069
 so+0.065
+0.063
 called+0.059
 even+0.058
Neg logit contributions
 disl-0.178
 Sorry-0.168
 Majesty-0.167
 winners-0.162
 majesty-0.160
Token cost
Token ID1575
Feature activation+1.54e+00

Pos logit contributions
 and+0.049
+0.047
 called+0.047
 so+0.046
On+0.045
Neg logit contributions
 disl-0.072
 Majesty-0.067
 winners-0.065
 Sorry-0.065
 majesty-0.063
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
 convey+0.028
On+0.026
 shelf+0.026
 aisle+0.024
 floor+0.024
Neg logit contributions
 winners-0.049
 disl-0.048
 yeah-0.043
joy-0.043
rav-0.043
Token Min
Token ID1855
Feature activation+4.12e+00

Pos logit contributions
 and+0.009
 so+0.009
 called+0.009
+0.008
 even+0.008
Neg logit contributions
 disl-0.012
 Majesty-0.011
 Sorry-0.011
 majesty-0.011
 winners-0.011
Token was
Token ID373
Feature activation+9.10e-01

Pos logit contributions
 disl+0.065
 Sorry+0.060
 Majesty+0.059
 winners+0.057
 majesty+0.057
Neg logit contributions
 and-0.030
 so-0.029
-0.028
 convey-0.028
On-0.027
Token another
Token ID1194
Feature activation+9.93e-01

Pos logit contributions
sc+0.017
 convey+0.016
+0.015
 aisle+0.015
 and+0.015
Neg logit contributions
 disl-0.030
 sincere-0.028
 sorry-0.027
 winners-0.027
 Majesty-0.026
Token bear
Token ID6842
Feature activation+5.13e-01

Pos logit contributions
+0.024
 so+0.024
 and+0.024
sc+0.023
On+0.023
Neg logit contributions
 disl-0.027
 Majesty-0.025
 Sorry-0.024
 winners-0.024
 majesty-0.023
Token called
Token ID1444
Feature activation+2.10e+00

Pos logit contributions
 and+0.006
 so+0.006
+0.006
 called+0.006
On+0.006
Neg logit contributions
 disl-0.020
 Majesty-0.019
 Sorry-0.019
 winners-0.018
 majesty-0.018
Token L
Token ID406
Feature activation+4.33e+00

Pos logit contributions
 and+0.043
 so+0.041
+0.041
 called+0.040
 even+0.038
Neg logit contributions
 disl-0.068
 Majesty-0.063
 Sorry-0.062
 winners-0.060
 majesty-0.060
Tokenizzy
Token ID40593
Feature activation+6.33e+00

Pos logit contributions
 convey+0.105
 so+0.103
 and+0.102
+0.101
 shelf+0.100
Neg logit contributions
 disl-0.120
 Majesty-0.108
 yeah-0.104
 winners-0.104
 Sorry-0.103
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 and+0.067
 so+0.064
+0.062
 called+0.060
On+0.056
Neg logit contributions
 disl-0.263
 Majesty-0.249
 Sorry-0.244
 winners-0.241
 majesty-0.239
Token
Token ID198
Feature activation+7.63e-01

Pos logit contributions
 and+0.010
 so+0.010
+0.010
 called+0.010
 convey+0.010
Neg logit contributions
 disl-0.015
 Majesty-0.014
 Sorry-0.014
 winners-0.013
 majesty-0.013
Token
Token ID198
Feature activation+2.42e-01

Pos logit contributions
 convey+0.020
 shelf+0.018
 floor+0.018
 strips+0.016
 so+0.016
Neg logit contributions
 disl-0.020
 Sorry-0.017
haw-0.016
rav-0.016
 yeah-0.016
TokenL
Token ID43
Feature activation+1.53e+00

Pos logit contributions
 convey+0.005
 so+0.004
 and+0.004
 shelf+0.004
 called+0.004
Neg logit contributions
 disl-0.008
 Sorry-0.007
 sincere-0.007
 yeah-0.007
 Majesty-0.007
Tokenizzy
Token ID40593
Feature activation+3.16e+00

Pos logit contributions
 convey+0.042
 shelf+0.039
+0.039
On+0.038
 so+0.038
Neg logit contributions
 disl-0.039
 Majesty-0.034
 majesty-0.033
 yeah-0.033
 winners-0.032
Token walked
Token ID6807
Feature activation+7.65e-01

Pos logit contributions
 and+0.011
+0.010
 so+0.010
 called+0.010
 convey+0.010
Neg logit contributions
 disl-0.015
 Majesty-0.014
 Sorry-0.014
 winners-0.014
 majesty-0.013
Token the
Token ID262
Feature activation+2.15e-01

Pos logit contributions
 convey+0.010
+0.010
 shelf+0.010
 floor+0.009
 called+0.009
Neg logit contributions
 disl-0.027
 winners-0.026
 Majesty-0.026
 Sorry-0.026
joy-0.025
Token more
Token ID517
Feature activation+9.00e-02

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
sc+0.003
Neg logit contributions
 disl-0.008
 Majesty-0.008
 Sorry-0.007
 winners-0.007
 majesty-0.007
Token thinking
Token ID3612
Feature activation+2.64e-02

Pos logit contributions
 convey+0.001
 unw+0.001
sc+0.001
On+0.001
ern+0.001
Neg logit contributions
 disl-0.003
 winners-0.003
 Majesty-0.003
 nods-0.002
joy-0.002
Token L
Token ID406
Feature activation+3.72e+00

Pos logit contributions
 convey+0.000
On+0.000
+0.000
 so+0.000
sc+0.000
Neg logit contributions
 disl-0.001
 winners-0.001
 Majesty-0.001
 Sorry-0.001
osity-0.001
Tokenizzy
Token ID40593
Feature activation+6.26e+00

Pos logit contributions
 so+0.092
 and+0.092
 convey+0.092
+0.090
 even+0.087
Neg logit contributions
 disl-0.099
 Majesty-0.090
 majesty-0.087
rav-0.087
 winners-0.086
Token did
Token ID750
Feature activation+1.80e+00

Pos logit contributions
 and+0.081
 so+0.081
 convey+0.080
+0.079
On+0.078
Neg logit contributions
 disl-0.237
 Majesty-0.221
 Sorry-0.214
 winners-0.213
 majesty-0.212
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
 and+0.018
 so+0.017
+0.017
 called+0.016
 even+0.014
Neg logit contributions
 disl-0.077
 Majesty-0.072
 Sorry-0.071
 winners-0.070
 majesty-0.069
Token Finally
Token ID9461
Feature activation+2.51e+00

Pos logit contributions
 and+0.006
 convey+0.005
 called+0.005
 so+0.005
 even+0.005
Neg logit contributions
 disl-0.007
 Sorry-0.006
 Majesty-0.006
osity-0.006
 yeah-0.006
Token,
Token ID11
Feature activation+6.42e-01

Pos logit contributions
sc+0.039
 convey+0.038
+0.038
 shelf+0.037
 and+0.037
Neg logit contributions
 disl-0.086
 Sorry-0.083
 Majesty-0.083
 winners-0.081
 yeah-0.080
Token when
Token ID618
Feature activation+7.56e-01

Pos logit contributions
+0.012
 convey+0.012
sc+0.012
 shelf+0.012
 and+0.012
Neg logit contributions
 disl-0.019
 Sorry-0.018
 Majesty-0.018
 yeah-0.017
 winners-0.017
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
 convey+0.010
sc+0.010
 shelf+0.009
 aisle+0.009
+0.009
Neg logit contributions
 disl-0.014
 winners-0.013
 Sorry-0.013
 wow-0.013
 Majesty-0.013
Token little
Token ID1310
Feature activation+9.13e-01

Pos logit contributions
 and+0.027
 so+0.025
 called+0.025
+0.024
 even+0.023
Neg logit contributions
 disl-0.064
 Majesty-0.059
 Sorry-0.058
 winners-0.057
osity-0.057
Token girl
Token ID2576
Feature activation-6.07e-01

Pos logit contributions
 and+0.027
 so+0.025
 called+0.024
+0.024
 even+0.023
Neg logit contributions
 disl-0.025
 Majesty-0.023
 Sorry-0.021
osity-0.021
 majesty-0.021
Token called
Token ID1444
Feature activation+1.66e+00

Pos logit contributions
 disl+0.014
 writer+0.012
 Sorry+0.012
 sincere+0.012
 sorry+0.011
Neg logit contributions
sc-0.017
 convey-0.015
On-0.015
 floor-0.015
-0.015
Token L
Token ID406
Feature activation+4.53e+00

Pos logit contributions
 and+0.038
 so+0.036
+0.036
On+0.036
 convey+0.036
Neg logit contributions
 disl-0.048
 Sorry-0.045
 Majesty-0.044
 majesty-0.043
 winners-0.042
Tokenizzy
Token ID40593
Feature activation+6.25e+00

Pos logit contributions
 so+0.139
 convey+0.138
 even+0.130
+0.129
 shelf+0.129
Neg logit contributions
rav-0.085
 disl-0.082
 yeah-0.074
 Majesty-0.073
 winners-0.072
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
sc+0.093
On+0.090
+0.089
 convey+0.087
At+0.085
Neg logit contributions
 disl-0.226
 Majesty-0.209
 majesty-0.202
 Sorry-0.194
 writer-0.193
Token She
Token ID1375
Feature activation+7.96e-01

Pos logit contributions
+0.004
 convey+0.004
sc+0.004
 and+0.004
 carefully+0.004
Neg logit contributions
 disl-0.003
 Sorry-0.003
 yeah-0.003
rav-0.003
 Majesty-0.003
Token was
Token ID373
Feature activation+1.74e-01

Pos logit contributions
sc+0.019
 convey+0.019
 so+0.019
+0.019
 and+0.018
Neg logit contributions
 disl-0.021
 Sorry-0.019
 Majesty-0.018
 majesty-0.018
 sincere-0.017
Token three
Token ID1115
Feature activation+1.53e+00

Pos logit contributions
 and+0.002
+0.002
 so+0.002
sc+0.002
 called+0.002
Neg logit contributions
 disl-0.007
 Majesty-0.007
 Sorry-0.007
 winners-0.006
 majesty-0.006
Token years
Token ID812
Feature activation+1.47e+00

Pos logit contributions
sc+0.035
On+0.032
+0.031
 convey+0.029
At+0.028
Neg logit contributions
 disl-0.044
 sincere-0.040
 Majesty-0.039
 nods-0.039
 majesty-0.039
Token teacher
Token ID4701
Feature activation-8.35e-01

Pos logit contributions
 disl+0.010
 Sorry+0.009
 Majesty+0.009
 winners+0.009
 majesty+0.008
Neg logit contributions
-0.008
 and-0.008
 so-0.008
 convey-0.008
On-0.008
Token and
Token ID290
Feature activation+5.55e-01

Pos logit contributions
 disl+0.035
 Majesty+0.033
 winners+0.033
 Sorry+0.032
 majesty+0.032
Neg logit contributions
On-0.007
 convey-0.007
 so-0.007
sc-0.006
 and-0.006
Token she
Token ID673
Feature activation+1.79e-01

Pos logit contributions
 and+0.010
 so+0.009
+0.009
 called+0.009
 even+0.009
Neg logit contributions
 disl-0.019
 Majesty-0.018
 Sorry-0.018
 winners-0.017
 majesty-0.017
Token could
Token ID714
Feature activation+3.19e-01

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
 even+0.003
Neg logit contributions
 disl-0.006
 Majesty-0.006
 Sorry-0.006
 winners-0.006
 majesty-0.006
Token teach
Token ID4545
Feature activation-6.85e-01

Pos logit contributions
 and+0.008
 so+0.007
+0.007
 called+0.007
sc+0.007
Neg logit contributions
 disl-0.009
 Majesty-0.008
 Sorry-0.008
 winners-0.008
 majesty-0.008
Token Bella
Token ID39304
Feature activation+6.23e+00

Pos logit contributions
 disl+0.023
 Sorry+0.021
 Majesty+0.021
 winners+0.021
 yeah+0.020
Neg logit contributions
On-0.012
 convey-0.012
 and-0.011
sc-0.011
-0.011
Token how
Token ID703
Feature activation+3.18e-01

Pos logit contributions
 and+0.090
 so+0.085
+0.085
 called+0.083
 convey+0.079
Neg logit contributions
 disl-0.238
 Majesty-0.221
 Sorry-0.218
 winners-0.215
 majesty-0.212
Token to
Token ID284
Feature activation-5.90e-01

Pos logit contributions
 and+0.002
 so+0.002
+0.002
 even+0.001
 called+0.001
Neg logit contributions
 disl-0.015
 Majesty-0.015
 Sorry-0.015
 majesty-0.014
 winners-0.014
Token swim
Token ID9422
Feature activation-3.83e-02

Pos logit contributions
 disl+0.018
 Majesty+0.017
 Sorry+0.017
 winners+0.016
 majesty+0.016
Neg logit contributions
 and-0.013
 so-0.012
-0.012
 called-0.012
 even-0.012
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
 disl+0.001
 winners+0.001
 Sorry+0.001
 Majesty+0.001
 nods+0.001
Neg logit contributions
 convey-0.001
 shelf-0.000
-0.000
 called-0.000
 floor-0.000
Token Bella
Token ID39304
Feature activation+2.94e+00

Pos logit contributions
 and+0.002
 convey+0.002
 so+0.002
+0.002
 called+0.002
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.002
 yeah-0.002
Token of
Token ID286
Feature activation+1.59e+00

Pos logit contributions
 disl+0.049
 Sorry+0.048
 winners+0.046
 yeah+0.045
 sorry+0.044
Neg logit contributions
 convey-0.029
On-0.026
sc-0.024
 shelf-0.023
 called-0.023
Token something
Token ID1223
Feature activation-1.35e+00

Pos logit contributions
 and+0.023
+0.022
 so+0.022
 called+0.021
sc+0.020
Neg logit contributions
 disl-0.061
 Majesty-0.057
 Sorry-0.057
 winners-0.056
 majesty-0.055
Token y
Token ID331
Feature activation+1.21e+00

Pos logit contributions
 disl+0.042
 Sorry+0.038
 winners+0.036
 yeah+0.036
 Majesty+0.035
Neg logit contributions
 convey-0.027
On-0.027
sc-0.026
 aisle-0.024
 and-0.024
Tokenummy
Token ID13513
Feature activation+1.89e+00

Pos logit contributions
 called+0.027
 so+0.027
 convey+0.027
+0.026
 and+0.026
Neg logit contributions
 disl-0.034
 Sorry-0.033
 Majesty-0.030
 winners-0.030
osity-0.030
Token told
Token ID1297
Feature activation+7.81e-01

Pos logit contributions
On+0.026
 convey+0.025
sc+0.024
+0.023
 called+0.023
Neg logit contributions
 disl-0.071
 Sorry-0.064
 winners-0.064
 Majesty-0.064
 majesty-0.062
Token Bella
Token ID39304
Feature activation+6.23e+00

Pos logit contributions
 convey+0.016
On+0.016
 aisle+0.016
sc+0.016
+0.016
Neg logit contributions
 disl-0.023
 Majesty-0.021
 Sorry-0.021
 winners-0.020
 yeah-0.020
Token it
Token ID340
Feature activation+1.24e+00

Pos logit contributions
 convey+0.077
sc+0.071
On+0.071
 called+0.069
 and+0.069
Neg logit contributions
 disl-0.247
 Majesty-0.226
 winners-0.220
 majesty-0.220
 Sorry-0.218
Token was
Token ID373
Feature activation+2.94e+00

Pos logit contributions
 convey+0.020
On+0.018
sc+0.017
 aisle+0.017
At+0.016
Neg logit contributions
 disl-0.043
 Sorry-0.039
 Majesty-0.039
 winners-0.038
 nods-0.038
Token ready
Token ID3492
Feature activation+3.80e+00

Pos logit contributions
 convey+0.056
sc+0.055
 aisle+0.050
and+0.047
So+0.047
Neg logit contributions
 disl-0.094
 sorry-0.089
 sincere-0.087
 Majesty-0.087
 majesty-0.084
Token.
Token ID13
Feature activation+4.27e-01

Pos logit contributions
 convey+0.072
sc+0.067
 aisle+0.064
On+0.062
At+0.060
Neg logit contributions
 disl-0.112
 Majesty-0.105
 winners-0.104
joy-0.104
rav-0.103
Token She
Token ID1375
Feature activation+2.76e-01

Pos logit contributions
 and+0.009
 so+0.009
 convey+0.009
 even+0.008
+0.008
Neg logit contributions
 disl-0.013
 Sorry-0.012
 Majesty-0.012
 majesty-0.011
osity-0.011
Token was
Token ID373
Feature activation+3.87e-01

Pos logit contributions
 disl+0.047
 Sorry+0.044
 yeah+0.042
 Majesty+0.041
 nods+0.041
Neg logit contributions
 convey-0.020
sc-0.019
-0.018
 so-0.018
 called-0.017
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
 convey+0.008
sc+0.008
 shelf+0.007
 aisle+0.007
+0.007
Neg logit contributions
 disl-0.011
 winners-0.011
 Sorry-0.010
 wow-0.010
 Majesty-0.010
Token girl
Token ID2576
Feature activation-8.04e-01

Pos logit contributions
 and+0.023
 so+0.020
 called+0.020
+0.019
 even+0.019
Neg logit contributions
 disl-0.057
 Majesty-0.052
 Sorry-0.051
 winners-0.051
osity-0.051
Token called
Token ID1444
Feature activation+1.48e+00

Pos logit contributions
 disl+0.017
 writer+0.016
 Sorry+0.015
 sincere+0.015
 sorry+0.014
Neg logit contributions
sc-0.023
 convey-0.021
On-0.021
-0.020
 off-0.020
Token L
Token ID406
Feature activation+4.38e+00

Pos logit contributions
 and+0.034
+0.034
 so+0.033
 even+0.032
 convey+0.032
Neg logit contributions
 disl-0.042
 Sorry-0.039
 Majesty-0.038
 majesty-0.037
 winners-0.037
Tokenizzy
Token ID40593
Feature activation+6.22e+00

Pos logit contributions
 so+0.145
 convey+0.145
+0.140
 shelf+0.137
 even+0.137
Neg logit contributions
rav-0.071
 disl-0.071
 Majesty-0.061
 majesty-0.061
 winners-0.061
Token.
Token ID13
Feature activation+7.85e-02

Pos logit contributions
sc+0.105
+0.102
 convey+0.100
On+0.100
At+0.095
Neg logit contributions
 disl-0.206
 Majesty-0.191
 majesty-0.187
 writer-0.180
 winners-0.177
Token She
Token ID1375
Feature activation+7.92e-01

Pos logit contributions
+0.002
 convey+0.002
 and+0.002
 so+0.002
 carefully+0.002
Neg logit contributions
 disl-0.002
 Sorry-0.002
rav-0.002
 yeah-0.002
 Majesty-0.002
Token loved
Token ID6151
Feature activation+1.33e+00

Pos logit contributions
+0.018
 so+0.017
 convey+0.017
sc+0.017
 and+0.017
Neg logit contributions
 disl-0.023
 Sorry-0.021
 Majesty-0.020
 majesty-0.020
 winners-0.019
Token going
Token ID1016
Feature activation-1.22e-01

Pos logit contributions
+0.020
sc+0.019
On+0.016
So+0.015
 and+0.015
Neg logit contributions
 disl-0.052
 Sorry-0.047
 Majesty-0.045
 winners-0.045
 majesty-0.045
Token to
Token ID284
Feature activation+9.28e-02

Pos logit contributions
 winners+0.004
rav+0.004
 Sorry+0.004
 disl+0.004
 Majesty+0.004
Neg logit contributions
-0.002
sc-0.002
 floor-0.002
So-0.002
 closely-0.002
Token a
Token ID257
Feature activation+1.53e+00

Pos logit contributions
 convey+0.007
sc+0.007
 shelf+0.007
 aisle+0.007
 floor+0.006
Neg logit contributions
 disl-0.011
 winners-0.010
 Sorry-0.009
 wow-0.009
 Majesty-0.009
Token little
Token ID1310
Feature activation+8.05e-01

Pos logit contributions
 and+0.024
 so+0.022
 called+0.022
+0.022
 even+0.021
Neg logit contributions
 disl-0.059
 Majesty-0.054
 Sorry-0.054
 winners-0.053
osity-0.052
Token girl
Token ID2576
Feature activation-7.61e-01

Pos logit contributions
 and+0.023
 so+0.022
 called+0.021
+0.021
 even+0.021
Neg logit contributions
 disl-0.022
 Majesty-0.020
 Sorry-0.018
 winners-0.018
osity-0.018
Token called
Token ID1444
Feature activation+1.48e+00

Pos logit contributions
 disl+0.018
 Sorry+0.016
 writer+0.016
 sincere+0.016
 tweet+0.015
Neg logit contributions
sc-0.020
 convey-0.019
On-0.019
 floor-0.018
 so-0.018
Token L
Token ID406
Feature activation+4.37e+00

Pos logit contributions
 and+0.034
 so+0.032
+0.032
On+0.032
 convey+0.032
Neg logit contributions
 disl-0.043
 Sorry-0.040
 Majesty-0.040
 majesty-0.038
 winners-0.038
Tokenizzy
Token ID40593
Feature activation+6.21e+00

Pos logit contributions
 so+0.126
 convey+0.126
 even+0.118
+0.118
 shelf+0.117
Neg logit contributions
 disl-0.096
rav-0.089
 Majesty-0.084
osity-0.082
 majesty-0.082
Token.
Token ID13
Feature activation+7.92e-02

Pos logit contributions
sc+0.101
On+0.098
 convey+0.094
+0.094
At+0.094
Neg logit contributions
 disl-0.215
 Majesty-0.198
 majesty-0.192
osity-0.184
 writer-0.184
Token One
Token ID1881
Feature activation+1.47e+00

Pos logit contributions
+0.002
 convey+0.002
 and+0.002
sc+0.002
 so+0.002
Neg logit contributions
 disl-0.002
 Sorry-0.002
 yeah-0.002
 Majesty-0.002
rav-0.002
Token day
Token ID1110
Feature activation-3.40e-01

Pos logit contributions
 and+0.025
 so+0.024
+0.024
 called+0.023
 convey+0.023
Neg logit contributions
 disl-0.052
 Sorry-0.048
 Majesty-0.047
 winners-0.046
 majesty-0.045
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
 disl+0.014
 Majesty+0.013
 Sorry+0.013
 winners+0.013
 majesty+0.013
Neg logit contributions
 and-0.004
 so-0.004
-0.004
 called-0.004
 even-0.003
Token L
Token ID406
Feature activation+2.56e+00

Pos logit contributions
 disl+0.003
 Sorry+0.003
 Majesty+0.003
 winners+0.003
 yeah+0.003
Neg logit contributions
-0.002
 and-0.002
 so-0.002
 convey-0.002
sc-0.002
Token today
Token ID1909
Feature activation+1.58e+00

Pos logit contributions
 disl+0.072
 Sorry+0.068
 Majesty+0.068
 winners+0.066
 majesty+0.065
Neg logit contributions
-0.018
 and-0.017
sc-0.017
 so-0.017
 convey-0.016
Token,
Token ID11
Feature activation+3.73e-01

Pos logit contributions
 convey+0.022
sc+0.021
On+0.019
+0.018
At+0.017
Neg logit contributions
 disl-0.055
 Sorry-0.054
 winners-0.052
 Majesty-0.052
 yeah-0.051
Token so
Token ID523
Feature activation-2.33e-01

Pos logit contributions
sc+0.006
 convey+0.006
+0.005
On+0.005
 shelf+0.005
Neg logit contributions
 disl-0.013
 Sorry-0.012
 Majesty-0.011
 winners-0.011
 wow-0.011
Token she
Token ID673
Feature activation+2.14e+00

Pos logit contributions
 disl+0.007
 Sorry+0.007
 Majesty+0.007
 winners+0.007
 majesty+0.006
Neg logit contributions
sc-0.005
 convey-0.005
 and-0.005
 shelf-0.004
-0.004
Token showed
Token ID3751
Feature activation+2.17e-01

Pos logit contributions
 convey+0.039
 shelf+0.038
sc+0.038
On+0.038
 so+0.037
Neg logit contributions
 disl-0.067
 Sorry-0.063
 Majesty-0.062
 majesty-0.060
 winners-0.059
Token Emily
Token ID17608
Feature activation+6.21e+00

Pos logit contributions
 convey+0.004
On+0.004
sc+0.004
 shelf+0.004
+0.004
Neg logit contributions
 disl-0.007
 Majesty-0.006
 Sorry-0.006
 winners-0.006
 yeah-0.006
Token her
Token ID607
Feature activation+2.16e+00

Pos logit contributions
sc+0.124
 convey+0.121
 floor+0.105
 shelf+0.105
At+0.104
Neg logit contributions
 disl-0.189
 wow-0.170
 Majesty-0.168
joy-0.162
 winners-0.161
Token new
Token ID649
Feature activation+8.43e-01

Pos logit contributions
sc+0.046
On+0.044
 and+0.044
 convey+0.044
 called+0.043
Neg logit contributions
 disl-0.065
 Majesty-0.060
 Sorry-0.059
 winners-0.058
 majesty-0.056
Token purple
Token ID14032
Feature activation+1.88e+00

Pos logit contributions
+0.018
 and+0.018
 so+0.018
 called+0.017
sc+0.017
Neg logit contributions
 disl-0.026
 Majesty-0.024
 Sorry-0.023
 winners-0.023
 majesty-0.023
Token bag
Token ID6131
Feature activation+1.32e+00

Pos logit contributions
 and+0.042
 so+0.041
+0.040
 called+0.039
On+0.038
Neg logit contributions
 disl-0.057
 Majesty-0.053
 Sorry-0.052
 winners-0.051
 majesty-0.050
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
 convey+0.020
+0.018
sc+0.018
On+0.018
At+0.017
Neg logit contributions
 winners-0.041
 disl-0.040
osity-0.040
 sincere-0.038
 Sorry-0.038
Token friendly
Token ID8030
Feature activation-1.13e+00

Pos logit contributions
 disl+0.024
 Majesty+0.023
osity+0.022
rav+0.022
 Sorry+0.022
Neg logit contributions
 so-0.020
 and-0.019
 called-0.018
-0.018
 even-0.018
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
 disl+0.048
joy+0.046
 winners+0.046
 Majesty+0.045
 yeah+0.044
Neg logit contributions
sc-0.009
-0.009
 convey-0.008
On-0.008
At-0.008
Token and
Token ID290
Feature activation+7.91e-01

Pos logit contributions
sc+0.000
 convey+0.000
 aisle+0.000
 shelf+0.000
ool+0.000
Neg logit contributions
 disl-0.001
 winners-0.001
joy-0.001
 Majesty-0.001
 yeah-0.001
Token he
Token ID339
Feature activation-4.56e-01

Pos logit contributions
 and+0.017
 so+0.016
 called+0.016
+0.015
 even+0.015
Neg logit contributions
 disl-0.026
 Majesty-0.024
 Sorry-0.024
 winners-0.023
 majesty-0.023
Token taught
Token ID7817
Feature activation+7.03e-01

Pos logit contributions
 disl+0.015
 Majesty+0.014
 Sorry+0.013
 majesty+0.013
 winners+0.013
Neg logit contributions
 and-0.009
 so-0.009
 convey-0.009
-0.009
sc-0.008
Token Ted
Token ID11396
Feature activation+6.13e+00

Pos logit contributions
 and+0.015
+0.014
 so+0.014
 called+0.014
sc+0.013
Neg logit contributions
 disl-0.022
 Majesty-0.020
 Sorry-0.020
 winners-0.020
 majesty-0.019
Token lots
Token ID6041
Feature activation+3.47e+00

Pos logit contributions
 and+0.091
 so+0.086
+0.083
 called+0.079
 even+0.076
Neg logit contributions
 disl-0.237
 Sorry-0.225
 Majesty-0.224
 winners-0.216
 majesty-0.215
Token of
Token ID286
Feature activation+1.96e+00

Pos logit contributions
 convey+0.058
On+0.050
+0.050
 shelf+0.048
 aisle+0.047
Neg logit contributions
 disl-0.123
 winners-0.113
 yeah-0.107
haw-0.106
 nods-0.105
Token new
Token ID649
Feature activation-5.06e-01

Pos logit contributions
 and+0.049
 so+0.047
 called+0.046
+0.044
 even+0.044
Neg logit contributions
 disl-0.056
 Sorry-0.053
 Majesty-0.053
osity-0.049
 winners-0.049
Token things
Token ID1243
Feature activation+5.54e-01

Pos logit contributions
 disl+0.013
 Sorry+0.012
 Majesty+0.012
 winners+0.011
 majesty+0.011
Neg logit contributions
 and-0.014
 so-0.013
-0.013
 called-0.013
 even-0.012
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
 convey+0.006
+0.006
 and+0.006
 called+0.005
On+0.005
Neg logit contributions
 disl-0.023
 winners-0.021
 Sorry-0.021
 Majesty-0.020
 yeah-0.020
Token
Token ID198
Feature activation+9.10e-01

Pos logit contributions
 disl+0.021
 Sorry+0.019
 Majesty+0.019
 winners+0.019
 yeah+0.018
Neg logit contributions
 convey-0.009
 so-0.008
 called-0.008
 and-0.008
-0.008
Token
Token ID198
Feature activation+4.63e-01

Pos logit contributions
 convey+0.015
 so+0.015
 shelf+0.015
sc+0.015
 and+0.014
Neg logit contributions
 disl-0.031
 Majesty-0.028
 Sorry-0.028
rav-0.027
 winners-0.027
TokenHer
Token ID9360
Feature activation-5.27e-01

Pos logit contributions
 convey+0.011
 and+0.010
 so+0.010
 shelf+0.010
 called+0.010
Neg logit contributions
 disl-0.013
 Sorry-0.012
 Majesty-0.012
 winners-0.011
rav-0.011
Token mom
Token ID1995
Feature activation-1.24e+00

Pos logit contributions
 disl+0.016
 Majesty+0.015
 majesty+0.015
 Sorry+0.015
 winners+0.015
Neg logit contributions
sc-0.011
 so-0.011
-0.011
 and-0.011
On-0.011
Token showed
Token ID3751
Feature activation+1.46e+00

Pos logit contributions
 disl+0.031
 nods+0.030
 winners+0.028
 Majesty+0.028
 yeah+0.028
Neg logit contributions
 convey-0.029
sc-0.029
 so-0.028
On-0.028
 shelf-0.028
Token Julie
Token ID21946
Feature activation+6.12e+00

Pos logit contributions
+0.024
 convey+0.024
On+0.024
 called+0.023
 and+0.023
Neg logit contributions
 disl-0.050
 Majesty-0.047
 winners-0.045
 Sorry-0.044
 majesty-0.044
Token how
Token ID703
Feature activation-4.70e-01

Pos logit contributions
 convey+0.118
sc+0.117
 called+0.112
On+0.107
 shelf+0.107
Neg logit contributions
 disl-0.196
 Majesty-0.181
 winners-0.172
 majesty-0.171
 nods-0.170
Token to
Token ID284
Feature activation-7.18e-01

Pos logit contributions
 disl+0.021
 Sorry+0.019
 Majesty+0.019
 winners+0.018
 majesty+0.018
Neg logit contributions
 and-0.004
 so-0.004
-0.004
sc-0.004
 called-0.004
Token use
Token ID779
Feature activation+2.85e+00

Pos logit contributions
 disl+0.022
 Majesty+0.019
 yeah+0.019
 Sorry+0.019
 winners+0.019
Neg logit contributions
sc-0.014
 so-0.014
-0.014
 convey-0.013
 and-0.013
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
sc+0.057
On+0.057
So+0.055
and+0.055
At+0.053
Neg logit contributions
 disl-0.082
 winners-0.073
 yeah-0.070
 wow-0.070
 Majesty-0.070
Token comb
Token ID1974
Feature activation-5.62e-01

Pos logit contributions
+0.029
 and+0.028
 called+0.028
On+0.028
 so+0.028
Neg logit contributions
 disl-0.043
 Majesty-0.041
 winners-0.040
 majesty-0.039
 yeah-0.038
Token kept
Token ID4030
Feature activation+1.01e+00

Pos logit contributions
sc+0.024
and+0.022
 convey+0.022
 aisle+0.022
On+0.021
Neg logit contributions
 disl-0.020
 nods-0.019
 sincere-0.019
 Sorry-0.019
 Majesty-0.019
Token crying
Token ID13774
Feature activation-3.62e+00

Pos logit contributions
sc+0.026
+0.025
On+0.024
 shelf+0.023
So+0.023
Neg logit contributions
 disl-0.026
 sincere-0.024
joy-0.023
 sorry-0.023
 Sorry-0.023
Token and
Token ID290
Feature activation+9.68e-01

Pos logit contributions
 disl+0.145
 sorry+0.145
 sincere+0.142
 winners+0.142
 Majesty+0.140
Neg logit contributions
sc-0.030
-0.029
 convey-0.026
 aisle-0.025
On-0.024
Token throwing
Token ID9644
Feature activation-4.03e-01

Pos logit contributions
 and+0.021
+0.020
 so+0.020
sc+0.020
 convey+0.019
Neg logit contributions
 disl-0.030
 Sorry-0.027
 Majesty-0.027
 winners-0.026
 majesty-0.026
Token a
Token ID257
Feature activation+9.09e-01

Pos logit contributions
 disl+0.013
 winners+0.011
 nods+0.011
 sorry+0.011
 Sorry+0.011
Neg logit contributions
-0.007
 convey-0.007
On-0.007
 called-0.007
sc-0.007
Token tant
Token ID24246
Feature activation-7.30e+00

Pos logit contributions
 called+0.023
 so+0.023
+0.022
 convey+0.022
 even+0.021
Neg logit contributions
 disl-0.022
 Sorry-0.020
 winners-0.019
 yeah-0.018
 majesty-0.018
Tokenrum
Token ID6582
Feature activation-2.31e-01

Pos logit contributions
 disl+0.157
 Majesty+0.135
 Sorry+0.133
 winners+0.128
 majesty+0.125
Neg logit contributions
 and-0.229
 so-0.225
-0.225
 called-0.219
On-0.215
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
 disl+0.009
 Majesty+0.009
 Sorry+0.009
 winners+0.008
osity+0.008
Neg logit contributions
sc-0.003
-0.003
 convey-0.002
On-0.002
 called-0.002
Token
Token ID198
Feature activation+7.69e-01

Pos logit contributions
 and+0.004
 convey+0.004
+0.004
sc+0.004
 so+0.004
Neg logit contributions
 disl-0.007
 Sorry-0.007
 Majesty-0.007
 yeah-0.006
 sincere-0.006
Token
Token ID198
Feature activation+2.93e-01

Pos logit contributions
 convey+0.020
 shelf+0.018
sc+0.018
 floor+0.017
 strips+0.017
Neg logit contributions
 disl-0.018
 Sorry-0.017
rav-0.016
 sincere-0.015
 understands-0.015
TokenFinally
Token ID11158
Feature activation+2.15e+00

Pos logit contributions
 convey+0.006
 so+0.006
 and+0.006
sc+0.005
 called+0.005
Neg logit contributions
 disl-0.009
 Sorry-0.008
 sincere-0.008
 Majesty-0.008
 hey-0.008
Token to
Token ID284
Feature activation-8.67e-01

Pos logit contributions
sc+0.013
+0.012
 convey+0.011
So+0.011
 called+0.011
Neg logit contributions
 disl-0.034
joy-0.030
 Majesty-0.030
 sincere-0.029
 winners-0.029
Token cry
Token ID3960
Feature activation-3.44e+00

Pos logit contributions
 disl+0.022
 Majesty+0.020
 Sorry+0.020
 winners+0.019
 majesty+0.019
Neg logit contributions
 and-0.024
 so-0.023
-0.023
 called-0.022
sc-0.022
Token and
Token ID290
Feature activation+4.96e-01

Pos logit contributions
 disl+0.144
 Sorry+0.138
 Majesty+0.138
 sorry+0.135
 sincere+0.134
Neg logit contributions
sc-0.027
-0.025
 aisle-0.023
 convey-0.023
On-0.021
Token threw
Token ID9617
Feature activation-2.75e-01

Pos logit contributions
 and+0.009
 so+0.008
+0.008
 called+0.008
 even+0.008
Neg logit contributions
 disl-0.018
 Majesty-0.017
 Sorry-0.016
 winners-0.016
 majesty-0.016
Token a
Token ID257
Feature activation+4.28e-01

Pos logit contributions
 disl+0.009
 Sorry+0.008
 winners+0.008
 Majesty+0.008
 yeah+0.008
Neg logit contributions
On-0.005
-0.005
 convey-0.005
 called-0.004
sc-0.004
Token tant
Token ID24246
Feature activation-7.17e+00

Pos logit contributions
 and+0.009
 so+0.009
+0.009
 called+0.009
 convey+0.009
Neg logit contributions
 disl-0.013
 Sorry-0.012
 winners-0.011
 Majesty-0.011
 yeah-0.011
Tokenrum
Token ID6582
Feature activation-5.79e-01

Pos logit contributions
 disl+0.161
 Majesty+0.142
 Sorry+0.140
 winners+0.134
 majesty+0.132
Neg logit contributions
 and-0.218
 so-0.213
-0.212
 called-0.208
On-0.202
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 disl+0.023
 Majesty+0.022
 Sorry+0.022
 winners+0.021
 sincere+0.021
Neg logit contributions
-0.006
sc-0.006
On-0.006
 convey-0.006
 so-0.006
Token She
Token ID1375
Feature activation+2.58e-01

Pos logit contributions
 and+0.005
+0.005
 so+0.005
 called+0.005
sc+0.005
Neg logit contributions
 disl-0.007
 Majesty-0.006
 Sorry-0.006
 majesty-0.006
 winners-0.006
Token didn
Token ID1422
Feature activation-4.78e-01

Pos logit contributions
sc+0.007
 aisle+0.006
 convey+0.006
On+0.006
 shelf+0.006
Neg logit contributions
 disl-0.006
 sincere-0.006
 Sorry-0.006
 Majesty-0.005
 nods-0.005
Token't
Token ID470
Feature activation-3.93e-01

Pos logit contributions
 disl+0.004
 Sorry+0.004
 sorry+0.004
rav+0.003
 winners+0.003
Neg logit contributions
 called-0.018
 convey-0.017
sc-0.017
 shelf-0.017
 floor-0.017
Token listen
Token ID6004
Feature activation+9.11e-01

Pos logit contributions
 Majesty+0.051
 majesty+0.048
 disl+0.048
 Sorry+0.047
joy+0.046
Neg logit contributions
 and-0.042
 even-0.039
-0.037
 so-0.037
 called-0.037
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
 convey+0.010
+0.009
sc+0.009
 called+0.009
 shelf+0.008
Neg logit contributions
 disl-0.037
 Majesty-0.033
 Sorry-0.033
 winners-0.033
 sincere-0.033
Token She
Token ID1375
Feature activation-2.46e-01

Pos logit contributions
 disl+0.009
 Majesty+0.009
 Sorry+0.009
 majesty+0.008
 winners+0.008
Neg logit contributions
 and-0.007
-0.007
 so-0.007
 called-0.007
sc-0.007
Token threw
Token ID9617
Feature activation+2.26e-01

Pos logit contributions
 disl+0.007
 Majesty+0.006
 Sorry+0.006
 nods+0.006
 majesty+0.006
Neg logit contributions
 convey-0.005
sc-0.005
 so-0.005
-0.005
 shelf-0.005
Token a
Token ID257
Feature activation+2.42e-01

Pos logit contributions
+0.004
On+0.004
 called+0.004
 convey+0.003
 and+0.003
Neg logit contributions
 disl-0.008
 winners-0.007
 Sorry-0.007
 Majesty-0.007
 yeah-0.007
Token tant
Token ID24246
Feature activation-7.02e+00

Pos logit contributions
 and+0.006
 so+0.006
+0.006
 called+0.006
 even+0.005
Neg logit contributions
 disl-0.007
 Sorry-0.006
 Majesty-0.006
 winners-0.006
 majesty-0.006
Tokenrum
Token ID6582
Feature activation-5.62e-01

Pos logit contributions
 disl+0.170
 Majesty+0.159
 Sorry+0.156
 winners+0.149
 majesty+0.147
Neg logit contributions
 and-0.201
 so-0.192
 called-0.189
-0.187
 even-0.183
Token in
Token ID287
Feature activation+2.82e-02

Pos logit contributions
 disl+0.025
 Majesty+0.024
 Sorry+0.024
 winners+0.024
 majesty+0.023
Neg logit contributions
-0.003
sc-0.003
 convey-0.003
 called-0.003
 so-0.003
Token the
Token ID262
Feature activation-2.79e+00

Pos logit contributions
sc+0.001
On+0.001
 convey+0.001
At+0.001
+0.001
Neg logit contributions
 disl-0.001
 winners-0.001
 sorry-0.001
joy-0.001
 Sorry-0.001
Token store
Token ID3650
Feature activation+2.20e-01

Pos logit contributions
 disl+0.101
 Majesty+0.095
 Sorry+0.093
 winners+0.091
 majesty+0.090
Neg logit contributions
 and-0.048
 so-0.046
-0.045
 called-0.044
 even-0.041
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
+0.002
On+0.002
sc+0.002
 convey+0.002
At+0.002
Neg logit contributions
 disl-0.009
 winners-0.009
 Majesty-0.008
 Sorry-0.008
 sincere-0.008
Token sad
Token ID6507
Feature activation-4.21e+00

Pos logit contributions
 and+0.015
 so+0.015
 called+0.015
+0.014
 even+0.014
Neg logit contributions
 disl-0.022
 Majesty-0.021
 winners-0.020
 majesty-0.020
 Sorry-0.020
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 disl+0.177
 Sorry+0.166
 Majesty+0.165
 winners+0.162
 majesty+0.159
Neg logit contributions
-0.042
 and-0.040
 so-0.039
 called-0.038
sc-0.037
Token She
Token ID1375
Feature activation+4.12e-01

Pos logit contributions
 and+0.002
+0.002
sc+0.002
 convey+0.002
 so+0.002
Neg logit contributions
 disl-0.004
 Sorry-0.004
 Majesty-0.004
 yeah-0.004
 majesty-0.004
Token threw
Token ID9617
Feature activation-1.74e-01

Pos logit contributions
sc+0.010
 aisle+0.010
 convey+0.010
+0.009
 shelf+0.009
Neg logit contributions
 disl-0.011
 Sorry-0.010
 sincere-0.010
 Majesty-0.010
 nods-0.009
Token a
Token ID257
Feature activation+4.74e-01

Pos logit contributions
 disl+0.006
 winners+0.006
 Sorry+0.005
 Majesty+0.005
 yeah+0.005
Neg logit contributions
-0.003
On-0.003
 called-0.003
 convey-0.003
sc-0.002
Token tant
Token ID24246
Feature activation-7.02e+00

Pos logit contributions
 so+0.011
 and+0.011
 called+0.011
+0.011
 even+0.011
Neg logit contributions
 disl-0.013
 Sorry-0.012
 winners-0.012
 Majesty-0.011
 yeah-0.011
Tokenrum
Token ID6582
Feature activation-6.22e-01

Pos logit contributions
 disl+0.157
 Majesty+0.140
 Sorry+0.138
 winners+0.132
 majesty+0.129
Neg logit contributions
 and-0.214
 so-0.209
-0.208
 called-0.204
 even-0.198
Token and
Token ID290
Feature activation-3.82e-01

Pos logit contributions
 disl+0.029
 Majesty+0.027
 Sorry+0.027
 winners+0.027
 majesty+0.026
Neg logit contributions
-0.003
sc-0.003
 so-0.003
 called-0.003
 and-0.003
Token cried
Token ID16896
Feature activation-2.16e+00

Pos logit contributions
 disl+0.012
 Sorry+0.011
 Majesty+0.011
 winners+0.010
 yeah+0.010
Neg logit contributions
-0.008
sc-0.008
 and-0.007
On-0.007
 so-0.007
Token loudly
Token ID23112
Feature activation-4.15e-01

Pos logit contributions
 disl+0.077
 sorry+0.074
 winners+0.073
 Majesty+0.073
 Sorry+0.073
Neg logit contributions
-0.027
sc-0.027
 aisle-0.027
 convey-0.026
On-0.025
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
 disl+0.018
 Majesty+0.017
 Sorry+0.017
 winners+0.017
 majesty+0.017
Neg logit contributions
 and-0.004
 so-0.004
-0.003
 called-0.003
 even-0.003
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
sc+0.016
 convey+0.016
+0.015
 called+0.015
On+0.013
Neg logit contributions
 disl-0.041
 Sorry-0.037
joy-0.036
 sincere-0.036
 winners-0.036
Token He
Token ID679
Feature activation-6.85e-01

Pos logit contributions
 disl+0.008
 Majesty+0.007
 Sorry+0.007
 winners+0.007
 majesty+0.007
Neg logit contributions
 and-0.007
 so-0.006
-0.006
 called-0.006
On-0.006
Token kept
Token ID4030
Feature activation+1.00e+00

Pos logit contributions
 disl+0.019
 Sorry+0.017
 Majesty+0.017
 nods+0.017
 sincere+0.016
Neg logit contributions
-0.015
 so-0.015
sc-0.015
 convey-0.015
 and-0.015
Token throwing
Token ID9644
Feature activation+4.33e-02

Pos logit contributions
+0.021
 and+0.021
 so+0.020
sc+0.020
 called+0.020
Neg logit contributions
 disl-0.032
 Majesty-0.029
 Sorry-0.029
 winners-0.028
 majesty-0.027
Token a
Token ID257
Feature activation+2.72e-01

Pos logit contributions
+0.001
 called+0.001
On+0.001
 so+0.001
 and+0.001
Neg logit contributions
 disl-0.001
 Sorry-0.001
 winners-0.001
 Majesty-0.001
 majesty-0.001
Token tant
Token ID24246
Feature activation-6.91e+00

Pos logit contributions
 and+0.005
 so+0.005
+0.005
 called+0.005
 even+0.005
Neg logit contributions
 disl-0.009
 Majesty-0.008
 Sorry-0.008
 winners-0.008
 majesty-0.008
Tokenrum
Token ID6582
Feature activation-1.80e-01

Pos logit contributions
 disl+0.169
 Majesty+0.160
 Sorry+0.157
 winners+0.150
 majesty+0.148
Neg logit contributions
 and-0.196
 so-0.186
 called-0.183
-0.182
 even-0.177
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 disl+0.007
 Majesty+0.007
 Sorry+0.007
 winners+0.007
 sincere+0.006
Neg logit contributions
sc-0.002
-0.002
 convey-0.002
On-0.002
 called-0.002
Token His
Token ID2399
Feature activation-1.59e+00

Pos logit contributions
 disl+0.018
 Majesty+0.017
 Sorry+0.016
 winners+0.016
 majesty+0.016
Neg logit contributions
 and-0.015
 so-0.014
-0.014
 called-0.013
On-0.013
Token mom
Token ID1995
Feature activation-9.90e-01

Pos logit contributions
 disl+0.061
 Majesty+0.056
 Sorry+0.055
osity+0.055
 winners+0.054
Neg logit contributions
 and-0.026
 so-0.023
 called-0.022
 shelf-0.022
-0.021
Token was
Token ID373
Feature activation+4.30e-01

Pos logit contributions
 disl+0.027
 nods+0.026
 sincere+0.026
 winners+0.025
 yeah+0.025
Neg logit contributions
sc-0.020
On-0.019
 shelf-0.019
 convey-0.019
-0.019
Token She
Token ID1375
Feature activation+5.17e-01

Pos logit contributions
 and+0.004
 so+0.004
+0.004
 called+0.004
sc+0.004
Neg logit contributions
 disl-0.006
 Majesty-0.005
 Sorry-0.005
 majesty-0.005
 winners-0.005
Token cried
Token ID16896
Feature activation-1.71e+00

Pos logit contributions
sc+0.013
 convey+0.012
and+0.012
 shelf+0.012
 aisle+0.012
Neg logit contributions
 disl-0.012
 nods-0.012
 Majesty-0.011
 sincere-0.011
 Sorry-0.011
Token and
Token ID290
Feature activation+1.95e-01

Pos logit contributions
 disl+0.073
 sorry+0.070
 Sorry+0.070
 Majesty+0.070
 winners+0.070
Neg logit contributions
-0.010
sc-0.009
 convey-0.008
On-0.008
 aisle-0.008
Token threw
Token ID9617
Feature activation+1.49e-01

Pos logit contributions
+0.004
 and+0.004
 so+0.004
sc+0.004
On+0.004
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.006
 winners-0.005
 majesty-0.005
Token a
Token ID257
Feature activation+5.18e-01

Pos logit contributions
+0.003
On+0.003
 convey+0.002
 called+0.002
sc+0.002
Neg logit contributions
 disl-0.005
 winners-0.004
 Sorry-0.004
 Majesty-0.004
 sorry-0.004
Token tant
Token ID24246
Feature activation-6.88e+00

Pos logit contributions
 and+0.012
 so+0.012
+0.012
 called+0.011
 even+0.011
Neg logit contributions
 disl-0.015
 Majesty-0.014
 Sorry-0.014
 winners-0.013
 majesty-0.013
Tokenrum
Token ID6582
Feature activation-5.25e-01

Pos logit contributions
 disl+0.162
 Majesty+0.148
 Sorry+0.146
 winners+0.139
 majesty+0.137
Neg logit contributions
 and-0.202
 so-0.195
-0.192
 called-0.192
 even-0.186
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 disl+0.022
 Majesty+0.021
 Sorry+0.021
 winners+0.021
 majesty+0.020
Neg logit contributions
-0.005
 and-0.005
 so-0.004
 called-0.004
sc-0.004
Token Her
Token ID2332
Feature activation-8.32e-01

Pos logit contributions
 and+0.001
 so+0.000
+0.000
 called+0.000
 even+0.000
Neg logit contributions
 disl-0.001
 Majesty-0.001
 Sorry-0.001
 winners-0.001
 majesty-0.001
Token mom
Token ID1995
Feature activation-6.85e-01

Pos logit contributions
 disl+0.034
osity+0.031
 Majesty+0.031
 Sorry+0.031
 yeah+0.031
Neg logit contributions
 and-0.012
 so-0.011
 called-0.010
 even-0.010
 shelf-0.010
Token tried
Token ID3088
Feature activation+1.62e+00

Pos logit contributions
 disl+0.018
 sincere+0.017
 winners+0.017
 Sorry+0.017
 nods+0.017
Neg logit contributions
 shelf-0.014
On-0.014
sc-0.014
 convey-0.014
 aisle-0.014
Token and
Token ID290
Feature activation+7.05e-01

Pos logit contributions
 disl+0.144
 Majesty+0.136
 winners+0.136
 sorry+0.134
 Sorry+0.133
Neg logit contributions
sc-0.023
On-0.022
 convey-0.020
-0.020
At-0.019
Token started
Token ID2067
Feature activation+1.31e+00

Pos logit contributions
 and+0.013
+0.013
 so+0.013
sc+0.012
On+0.012
Neg logit contributions
 disl-0.024
 Sorry-0.022
 Majesty-0.021
 winners-0.021
 majesty-0.020
Token to
Token ID284
Feature activation-7.29e-01

Pos logit contributions
sc+0.019
+0.018
 convey+0.017
 shelf+0.016
On+0.015
Neg logit contributions
 disl-0.050
joy-0.043
 Sorry-0.043
 Majesty-0.043
 sincere-0.043
Token throw
Token ID3714
Feature activation-1.32e-01

Pos logit contributions
 disl+0.017
 Majesty+0.015
 Sorry+0.015
 winners+0.014
 majesty+0.014
Neg logit contributions
 and-0.021
 so-0.021
-0.021
 called-0.020
sc-0.020
Token a
Token ID257
Feature activation+6.64e-01

Pos logit contributions
 disl+0.004
 Sorry+0.004
 Majesty+0.004
 winners+0.004
 nods+0.004
Neg logit contributions
On-0.002
 convey-0.002
-0.002
 called-0.002
sc-0.002
Token tant
Token ID24246
Feature activation-6.84e+00

Pos logit contributions
 and+0.013
+0.013
 so+0.013
 called+0.012
 convey+0.012
Neg logit contributions
 disl-0.021
 Majesty-0.020
 Sorry-0.020
 winners-0.019
 majesty-0.019
Tokenrum
Token ID6582
Feature activation-1.93e-01

Pos logit contributions
 disl+0.151
 Majesty+0.132
 Sorry+0.130
 winners+0.124
 majesty+0.122
Neg logit contributions
 and-0.211
 so-0.207
-0.207
 called-0.202
On-0.197
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
 disl+0.007
 Sorry+0.007
 Majesty+0.007
 winners+0.006
 sincere+0.006
Neg logit contributions
sc-0.003
 convey-0.002
On-0.002
At-0.002
-0.002
Token Suddenly
Token ID24975
Feature activation+4.64e-02

Pos logit contributions
 and+0.007
+0.007
 so+0.007
 called+0.006
sc+0.006
Neg logit contributions
 disl-0.008
 Sorry-0.008
 Majesty-0.008
 majesty-0.007
 yeah-0.007
Token,
Token ID11
Feature activation-2.77e-01

Pos logit contributions
 and+0.000
 so+0.000
+0.000
 called+0.000
 even+0.000
Neg logit contributions
 disl-0.002
 Majesty-0.002
 Sorry-0.002
 winners-0.002
 majesty-0.002
Token she
Token ID673
Feature activation+2.64e-01

Pos logit contributions
 disl+0.012
 Majesty+0.012
 majesty+0.011
osity+0.011
haw+0.011
Neg logit contributions
 and-0.004
 so-0.004
 even-0.003
 called-0.003
-0.003
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
 winners+0.010
joy+0.010
 disl+0.010
 nods+0.010
 yeah+0.009
Neg logit contributions
 convey-0.003
sc-0.003
On-0.003
-0.003
At-0.003
Token so
Token ID523
Feature activation-1.28e+00

Pos logit contributions
 and+0.013
 so+0.013
+0.012
 called+0.012
 even+0.012
Neg logit contributions
 disl-0.018
 Majesty-0.017
 Sorry-0.016
 winners-0.016
 majesty-0.016
Token she
Token ID673
Feature activation+1.46e+00

Pos logit contributions
 disl+0.049
 Majesty+0.046
 winners+0.045
 Sorry+0.044
 majesty+0.043
Neg logit contributions
 and-0.023
 so-0.021
-0.020
 called-0.020
 even-0.019
Token threw
Token ID9617
Feature activation-4.02e-01

Pos logit contributions
 and+0.030
sc+0.030
On+0.029
 convey+0.029
+0.029
Neg logit contributions
 disl-0.040
 Sorry-0.039
 Majesty-0.037
 sincere-0.036
haw-0.036
Token a
Token ID257
Feature activation+5.95e-01

Pos logit contributions
 disl+0.012
 winners+0.011
 nods+0.011
 Sorry+0.011
joy+0.010
Neg logit contributions
 convey-0.008
 shelf-0.007
On-0.007
sc-0.007
 aisle-0.007
Token tant
Token ID24246
Feature activation-6.84e+00

Pos logit contributions
+0.011
 and+0.011
 so+0.011
 called+0.011
On+0.010
Neg logit contributions
 disl-0.019
 Sorry-0.018
 winners-0.017
 Majesty-0.017
 majesty-0.017
Tokenrum
Token ID6582
Feature activation-6.56e-01

Pos logit contributions
 disl+0.153
 Majesty+0.136
 Sorry+0.133
 winners+0.128
 majesty+0.126
Neg logit contributions
 and-0.208
 so-0.204
-0.203
 called-0.199
On-0.193
Token when
Token ID618
Feature activation+1.06e+00

Pos logit contributions
 disl+0.025
 Majesty+0.025
 Sorry+0.024
 winners+0.024
osity+0.023
Neg logit contributions
-0.007
 convey-0.007
sc-0.007
On-0.007
 called-0.006
Token her
Token ID607
Feature activation-5.94e-01

Pos logit contributions
 and+0.018
 so+0.017
+0.017
 called+0.017
 even+0.016
Neg logit contributions
 disl-0.039
 Majesty-0.036
 Sorry-0.035
 majesty-0.035
 winners-0.035
Token parents
Token ID3397
Feature activation-2.67e+00

Pos logit contributions
 disl+0.019
 Majesty+0.018
 Sorry+0.018
 winners+0.017
 majesty+0.017
Neg logit contributions
 and-0.012
 so-0.012
-0.012
 called-0.011
 even-0.011
Token said
Token ID531
Feature activation+9.62e-01

Pos logit contributions
 disl+0.064
 Sorry+0.062
 winners+0.061
 sincere+0.058
 yeah+0.057
Neg logit contributions
On-0.059
 shelf-0.059
 convey-0.058
sc-0.057
 floor-0.056
Token her
Token ID607
Feature activation-1.05e+00

Pos logit contributions
 disl+0.001
 Majesty+0.001
 Sorry+0.001
 winners+0.001
joy+0.001
Neg logit contributions
-0.001
On-0.001
sc-0.001
 convey-0.001
 called-0.001
Token feet
Token ID3625
Feature activation+6.68e-01

Pos logit contributions
 disl+0.031
 Majesty+0.031
 winners+0.030
 majesty+0.030
 Sorry+0.029
Neg logit contributions
-0.022
sc-0.022
On-0.021
 called-0.021
 so-0.020
Token and
Token ID290
Feature activation-7.07e-01

Pos logit contributions
 convey+0.006
 pic+0.005
sc+0.005
On+0.005
 shelf+0.004
Neg logit contributions
 winners-0.025
 disl-0.024
 sorry-0.024
 sincere-0.023
osity-0.023
Token threw
Token ID9617
Feature activation+2.31e-01

Pos logit contributions
 disl+0.025
 Majesty+0.023
 Sorry+0.023
 winners+0.022
 majesty+0.022
Neg logit contributions
 and-0.013
 so-0.012
-0.012
 called-0.012
 even-0.011
Token a
Token ID257
Feature activation+4.87e-01

Pos logit contributions
+0.004
On+0.004
 called+0.004
 convey+0.004
 and+0.004
Neg logit contributions
 disl-0.008
 winners-0.007
 Sorry-0.007
 Majesty-0.007
 yeah-0.007
Token tant
Token ID24246
Feature activation-6.81e+00

Pos logit contributions
 and+0.012
 so+0.012
+0.012
 called+0.012
 even+0.011
Neg logit contributions
 disl-0.013
 Majesty-0.012
 Sorry-0.012
 winners-0.012
 majesty-0.011
Tokenrum
Token ID6582
Feature activation-4.92e-01

Pos logit contributions
 disl+0.148
 Majesty+0.129
 Sorry+0.126
 winners+0.121
 majesty+0.119
Neg logit contributions
 and-0.212
 so-0.209
-0.209
 called-0.203
On-0.200
Token.
Token ID13
Feature activation-5.00e-02

Pos logit contributions
 disl+0.020
 Majesty+0.019
 Sorry+0.019
 winners+0.019
 majesty+0.018
Neg logit contributions
-0.005
 and-0.005
 so-0.005
sc-0.005
 called-0.004
Token She
Token ID1375
Feature activation-1.81e-01

Pos logit contributions
 disl+0.001
 Majesty+0.001
 Sorry+0.001
 majesty+0.001
 winners+0.001
Neg logit contributions
 and-0.001
-0.001
 so-0.001
 called-0.001
sc-0.001
Token grabbed
Token ID13176
Feature activation-8.25e-01

Pos logit contributions
 disl+0.005
 Sorry+0.005
 sincere+0.004
 Majesty+0.004
 nods+0.004
Neg logit contributions
-0.004
sc-0.004
On-0.004
 aisle-0.004
 shelf-0.004
Token the
Token ID262
Feature activation-4.17e-02

Pos logit contributions
 disl+0.027
 winners+0.025
 Sorry+0.024
 Majesty+0.024
 yeah+0.024
Neg logit contributions
On-0.014
-0.014
sc-0.014
 and-0.013
 called-0.013
TokenWho
Token ID8241
Feature activation-3.01e+00

Pos logit contributions
 disl+0.010
 Majesty+0.010
 Sorry+0.009
 majesty+0.009
 winners+0.009
Neg logit contributions
sc-0.013
-0.013
 and-0.013
 so-0.013
 convey-0.012
Token goes
Token ID2925
Feature activation-1.38e-01

Pos logit contributions
 disl+0.105
 Majesty+0.098
 Sorry+0.095
joy+0.094
osity+0.093
Neg logit contributions
 and-0.059
-0.057
 so-0.057
 called-0.054
 even-0.053
Token there
Token ID612
Feature activation-1.45e+00

Pos logit contributions
 disl+0.005
 Majesty+0.005
 Sorry+0.005
 winners+0.004
 majesty+0.004
Neg logit contributions
 and-0.002
-0.002
 so-0.002
sc-0.002
 called-0.002
Token?"
Token ID1701
Feature activation-2.67e+00

Pos logit contributions
 disl+0.048
 Sorry+0.045
 Majesty+0.045
 winners+0.044
 majesty+0.043
Neg logit contributions
 convey-0.026
sc-0.024
-0.024
 called-0.023
On-0.023
Token a
Token ID257
Feature activation-1.47e+00

Pos logit contributions
 disl+0.096
 Sorry+0.090
 Majesty+0.089
 winners+0.086
 majesty+0.085
Neg logit contributions
 and-0.044
 so-0.042
-0.041
 called-0.040
sc-0.039
Token voice
Token ID3809
Feature activation-6.78e+00

Pos logit contributions
 disl+0.047
 Majesty+0.042
osity+0.041
 Sorry+0.040
 winners+0.040
Neg logit contributions
 and-0.035
 called-0.033
 so-0.033
-0.032
 quickly-0.031
Token said
Token ID531
Feature activation-8.76e-01

Pos logit contributions
 disl+0.238
 Majesty+0.224
 Sorry+0.217
 winners+0.215
 majesty+0.213
Neg logit contributions
 and-0.126
 so-0.120
-0.119
 called-0.118
 even-0.109
Token from
Token ID422
Feature activation-3.29e-01

Pos logit contributions
 disl+0.039
 Majesty+0.037
 Sorry+0.036
 winners+0.036
 majesty+0.035
Neg logit contributions
 and-0.009
 so-0.008
-0.008
 called-0.007
 even-0.006
Token inside
Token ID2641
Feature activation-6.38e-01

Pos logit contributions
 disl+0.014
 Majesty+0.013
 Sorry+0.013
 winners+0.013
 majesty+0.013
Neg logit contributions
 and-0.004
 so-0.004
 called-0.004
-0.003
 from-0.003
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 disl+0.025
 Sorry+0.023
 Majesty+0.023
 winners+0.022
 sincere+0.022
Neg logit contributions
-0.008
sc-0.007
 so-0.007
 convey-0.007
 and-0.007
Token
Token ID198
Feature activation+5.25e-01

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
 even+0.002
Neg logit contributions
 disl-0.008
 Majesty-0.007
 Sorry-0.007
 winners-0.007
 majesty-0.007
Token stubborn
Token ID24444
Feature activation+1.11e-01

Pos logit contributions
 disl+0.035
 Sorry+0.034
 Majesty+0.033
rav+0.032
osity+0.031
Neg logit contributions
 so-0.027
 and-0.027
 called-0.026
 even-0.024
-0.023
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 convey+0.001
On+0.001
sc+0.001
+0.001
At+0.001
Neg logit contributions
 disl-0.004
 winners-0.004
 Majesty-0.004
 Sorry-0.004
 nods-0.004
Token She
Token ID1375
Feature activation-6.83e-01

Pos logit contributions
 disl+0.029
 Sorry+0.027
 Majesty+0.026
 winners+0.025
 majesty+0.025
Neg logit contributions
 and-0.023
 so-0.023
 called-0.023
-0.023
sc-0.023
Token throws
Token ID12542
Feature activation-3.10e-01

Pos logit contributions
 disl+0.018
 Majesty+0.016
 Sorry+0.016
osity+0.016
 winners+0.015
Neg logit contributions
 and-0.020
-0.019
 so-0.018
 called-0.018
 carefully-0.018
Token a
Token ID257
Feature activation+7.54e-02

Pos logit contributions
 disl+0.011
 winners+0.010
 Sorry+0.010
 Majesty+0.010
 yeah+0.009
Neg logit contributions
-0.005
 called-0.005
 convey-0.005
On-0.005
sc-0.005
Token tant
Token ID24246
Feature activation-6.64e+00

Pos logit contributions
 so+0.002
+0.002
 called+0.002
 and+0.002
 convey+0.002
Neg logit contributions
 disl-0.002
 Sorry-0.002
 Majesty-0.002
 winners-0.002
 majesty-0.002
Tokenrum
Token ID6582
Feature activation-6.59e-01

Pos logit contributions
 disl+0.162
 Majesty+0.153
 Sorry+0.152
 winners+0.144
 majesty+0.141
Neg logit contributions
 and-0.187
 so-0.177
 called-0.175
-0.174
 even-0.169
Token.
Token ID13
Feature activation-8.16e-01

Pos logit contributions
 disl+0.028
 Majesty+0.027
 Sorry+0.027
 winners+0.026
 majesty+0.026
Neg logit contributions
 and-0.006
-0.006
 so-0.005
 called-0.005
 convey-0.005
Token She
Token ID1375
Feature activation-7.24e-01

Pos logit contributions
 disl+0.025
 Sorry+0.023
 Majesty+0.023
 winners+0.022
 majesty+0.022
Neg logit contributions
 and-0.018
 so-0.017
-0.017
 called-0.017
sc-0.017
Token kicks
Token ID17364
Feature activation-5.56e-02

Pos logit contributions
 disl+0.019
 Majesty+0.017
 Sorry+0.017
 winners+0.016
 majesty+0.016
Neg logit contributions
 and-0.020
 so-0.019
-0.019
 called-0.019
 even-0.018
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
 disl+0.002
 nods+0.002
 winners+0.002
 yeah+0.002
 Sorry+0.002
Neg logit contributions
 convey-0.001
 called-0.001
On-0.001
-0.001
sc-0.001
Token surprise
Token ID5975
Feature activation+2.98e-01

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
 shelf+0.003
Neg logit contributions
 disl-0.004
 Majesty-0.004
 Sorry-0.004
osity-0.004
 majesty-0.004
Token.
Token ID13
Feature activation-6.23e-01

Pos logit contributions
On+0.004
+0.004
sc+0.004
At+0.004
 convey+0.004
Neg logit contributions
 disl-0.010
 Sorry-0.010
 winners-0.010
 Majesty-0.009
 yeah-0.009
Token He
Token ID679
Feature activation-1.09e+00

Pos logit contributions
 disl+0.019
 Majesty+0.018
 Sorry+0.017
 winners+0.017
 majesty+0.016
Neg logit contributions
 and-0.015
 so-0.014
-0.014
 called-0.013
On-0.013
Token threw
Token ID9617
Feature activation+4.49e-01

Pos logit contributions
 disl+0.028
 Sorry+0.025
 sincere+0.025
 Majesty+0.025
 nods+0.025
Neg logit contributions
-0.027
 so-0.026
 convey-0.026
sc-0.026
 and-0.025
Token a
Token ID257
Feature activation+5.60e-01

Pos logit contributions
+0.007
 called+0.007
On+0.007
 and+0.007
 so+0.007
Neg logit contributions
 disl-0.016
 Sorry-0.014
 Majesty-0.014
 winners-0.014
 yeah-0.014
Token tant
Token ID24246
Feature activation-6.37e+00

Pos logit contributions
 and+0.011
 so+0.011
+0.010
 called+0.010
 even+0.010
Neg logit contributions
 disl-0.019
 Majesty-0.017
 Sorry-0.017
 winners-0.017
 majesty-0.016
Tokenrum
Token ID6582
Feature activation-3.50e-01

Pos logit contributions
 disl+0.153
 Majesty+0.143
 Sorry+0.141
 winners+0.134
 majesty+0.132
Neg logit contributions
 and-0.183
 so-0.175
 called-0.172
-0.171
 even-0.167
Token.
Token ID13
Feature activation-6.90e-01

Pos logit contributions
 disl+0.015
 Majesty+0.014
 Sorry+0.014
 winners+0.014
 majesty+0.013
Neg logit contributions
-0.003
 so-0.003
 called-0.003
 convey-0.003
 and-0.003
Token He
Token ID679
Feature activation-1.14e+00

Pos logit contributions
 disl+0.020
 Majesty+0.019
 Sorry+0.018
 winners+0.018
 majesty+0.018
Neg logit contributions
 and-0.016
 so-0.016
-0.016
 called-0.015
 even-0.015
Token kicked
Token ID12165
Feature activation+2.14e-01

Pos logit contributions
 disl+0.029
 Sorry+0.027
 sincere+0.026
haw+0.026
 winners+0.025
Neg logit contributions
-0.027
On-0.026
 convey-0.025
 so-0.025
 shelf-0.025
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
On+0.004
 convey+0.004
+0.004
 called+0.004
 aisle+0.004
Neg logit contributions
 disl-0.006
 winners-0.006
 sorry-0.006
 Sorry-0.006
joy-0.006
Token are
Token ID389
Feature activation-8.17e-01

Pos logit contributions
 disl+0.102
 Majesty+0.096
 Sorry+0.091
 majesty+0.090
osity+0.088
Neg logit contributions
 and-0.075
-0.073
 so-0.071
 called-0.067
 even-0.066
Token you
Token ID345
Feature activation-1.85e+00

Pos logit contributions
 disl+0.024
 Majesty+0.022
 Sorry+0.022
 winners+0.021
 majesty+0.021
Neg logit contributions
 and-0.020
 so-0.019
-0.019
 called-0.018
 even-0.018
Token playing
Token ID2712
Feature activation-1.02e+00

Pos logit contributions
 disl+0.056
 Sorry+0.051
 Majesty+0.051
 winners+0.049
 majesty+0.049
Neg logit contributions
 and-0.044
 so-0.043
 called-0.042
-0.041
 even-0.040
Token?"
Token ID1701
Feature activation-2.97e+00

Pos logit contributions
 disl+0.034
 Sorry+0.032
 winners+0.031
 Majesty+0.031
 yeah+0.029
Neg logit contributions
-0.018
 and-0.017
 so-0.017
 convey-0.017
 called-0.017
Token a
Token ID257
Feature activation-1.96e+00

Pos logit contributions
 disl+0.095
 Majesty+0.095
 majesty+0.090
 winners+0.088
osity+0.087
Neg logit contributions
 and-0.067
 called-0.065
-0.064
 so-0.063
 even-0.059
Token voice
Token ID3809
Feature activation-6.25e+00

Pos logit contributions
 disl+0.060
 Majesty+0.055
 Sorry+0.053
osity+0.053
 winners+0.052
Neg logit contributions
 and-0.050
 called-0.048
 so-0.046
-0.043
 quickly-0.043
Token says
Token ID1139
Feature activation-1.68e+00

Pos logit contributions
 disl+0.174
 Majesty+0.162
 Sorry+0.157
 winners+0.153
 majesty+0.152
Neg logit contributions
 and-0.161
 so-0.154
 called-0.153
-0.153
 even-0.144
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 disl+0.076
 Majesty+0.072
 Sorry+0.072
 winners+0.070
 majesty+0.070
Neg logit contributions
 and-0.014
 so-0.012
-0.012
 called-0.011
 even-0.009
Token
Token ID198
Feature activation+5.29e-01

Pos logit contributions
 and+0.004
 so+0.004
+0.004
 called+0.003
 even+0.003
Neg logit contributions
 disl-0.009
 Majesty-0.009
 Sorry-0.008
 winners-0.008
 majesty-0.008
Token
Token ID198
Feature activation+9.45e-02

Pos logit contributions
 so+0.008
 convey+0.008
sc+0.008
 shelf+0.008
 and+0.008
Neg logit contributions
 disl-0.019
 Sorry-0.017
 Majesty-0.017
 winners-0.017
 majesty-0.017
TokenThey
Token ID2990
Feature activation-4.48e-01

Pos logit contributions
+0.001
 and+0.001
 called+0.001
On+0.001
 so+0.001
Neg logit contributions
 disl-0.004
 Majesty-0.004
 Sorry-0.004
 majesty-0.004
 winners-0.004
Token his
Token ID465
Feature activation-1.69e+00

Pos logit contributions
 disl+0.066
 Sorry+0.061
 winners+0.059
 Majesty+0.058
 yeah+0.058
Neg logit contributions
-0.053
 convey-0.052
 shelf-0.050
On-0.049
 aisle-0.048
Token feet
Token ID3625
Feature activation+1.09e+00

Pos logit contributions
 Majesty+0.046
 disl+0.046
 majesty+0.045
 winners+0.044
 Sorry+0.042
Neg logit contributions
On-0.039
sc-0.039
-0.039
 called-0.037
At-0.037
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
 pic+0.014
 convey+0.013
ern+0.011
 shelf+0.010
At+0.010
Neg logit contributions
 winners-0.036
 sorry-0.035
 disl-0.033
osity-0.032
 sincere-0.032
Token threw
Token ID9617
Feature activation+4.50e-01

Pos logit contributions
 disl+0.052
 Majesty+0.048
 Sorry+0.048
 winners+0.047
 majesty+0.046
Neg logit contributions
 and-0.026
-0.025
 so-0.025
 called-0.024
 even-0.023
Token a
Token ID257
Feature activation+5.69e-01

Pos logit contributions
+0.008
On+0.008
 called+0.008
 convey+0.008
 and+0.008
Neg logit contributions
 disl-0.014
 Sorry-0.013
 winners-0.013
 Majesty-0.013
 nods-0.012
Token tant
Token ID24246
Feature activation-6.24e+00

Pos logit contributions
 and+0.011
 so+0.011
+0.010
 called+0.010
 even+0.010
Neg logit contributions
 disl-0.019
 Majesty-0.018
 Sorry-0.017
 winners-0.017
 majesty-0.017
Tokenrum
Token ID6582
Feature activation-2.65e-01

Pos logit contributions
 disl+0.147
 Majesty+0.135
 Sorry+0.135
 winners+0.127
 majesty+0.125
Neg logit contributions
 and-0.181
 so-0.174
-0.172
 called-0.171
 even-0.166
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
 disl+0.011
 Majesty+0.010
 Sorry+0.010
 winners+0.010
 majesty+0.010
Neg logit contributions
-0.003
 so-0.002
 and-0.002
 called-0.002
 convey-0.002
Token
Token ID198
Feature activation+4.23e-01

Pos logit contributions
 disl+0.014
 Majesty+0.013
 Sorry+0.013
 winners+0.013
 majesty+0.013
Neg logit contributions
 and-0.007
 so-0.007
-0.007
 called-0.007
 even-0.006
Token
Token ID198
Feature activation-5.48e-02

Pos logit contributions
 convey+0.009
 so+0.009
 shelf+0.008
 floor+0.008
 even+0.008
Neg logit contributions
 disl-0.013
 Sorry-0.011
 sincere-0.011
rav-0.010
 Majesty-0.010
Token"
Token ID1
Feature activation-1.15e+00

Pos logit contributions
 disl+0.002
 Sorry+0.002
 Majesty+0.002
 winners+0.002
 majesty+0.002
Neg logit contributions
 so-0.001
 and-0.001
 convey-0.001
 called-0.001
 even-0.001
Token Leave
Token ID17446
Feature activation-2.28e+00

Pos logit contributions
 disl+0.022
 Majesty+0.022
 winners+0.020
 majesty+0.020
osity+0.019
Neg logit contributions
-0.026
 and-0.025
 so-0.024
 called-0.024
 from-0.022
Token them
Token ID606
Feature activation-3.23e+00

Pos logit contributions
 disl+0.083
 Majesty+0.077
 Sorry+0.077
 winners+0.075
 majesty+0.074
Neg logit contributions
 and-0.038
 so-0.036
-0.036
 called-0.035
 even-0.033
Token alone
Token ID3436
Feature activation-2.98e+00

Pos logit contributions
 disl+0.094
 winners+0.087
 Sorry+0.086
 Majesty+0.083
 sincere+0.079
Neg logit contributions
On-0.068
-0.068
 convey-0.067
 called-0.066
 and-0.065
Token!"
Token ID2474
Feature activation-2.65e+00

Pos logit contributions
 disl+0.105
 winners+0.098
 Sorry+0.097
 Majesty+0.097
 majesty+0.092
Neg logit contributions
 convey-0.046
-0.045
On-0.045
 shelf-0.043
 called-0.042
Token a
Token ID257
Feature activation-1.79e+00

Pos logit contributions
 disl+0.099
 Sorry+0.092
 Majesty+0.091
 winners+0.088
 majesty+0.087
Neg logit contributions
 and-0.040
 so-0.038
-0.038
 called-0.036
 convey-0.036
Token voice
Token ID3809
Feature activation-6.23e+00

Pos logit contributions
 disl+0.056
 Majesty+0.051
osity+0.050
 winners+0.048
rav+0.047
Neg logit contributions
 and-0.043
 called-0.041
 so-0.040
 quickly-0.039
-0.039
Token said
Token ID531
Feature activation-7.89e-01

Pos logit contributions
 disl+0.206
 Sorry+0.189
 Majesty+0.188
 winners+0.182
 majesty+0.179
Neg logit contributions
 and-0.122
 so-0.118
-0.117
 called-0.112
On-0.110
Token.
Token ID13
Feature activation-9.88e-02

Pos logit contributions
 disl+0.036
 Majesty+0.034
 Sorry+0.034
 winners+0.033
 majesty+0.033
Neg logit contributions
 and-0.007
 so-0.006
-0.006
 called-0.006
 even-0.005
Token
Token ID198
Feature activation+3.27e-01

Pos logit contributions
 disl+0.004
 Majesty+0.004
 winners+0.004
 Sorry+0.004
 majesty+0.004
Neg logit contributions
 and-0.001
-0.001
 so-0.001
 called-0.001
On-0.001
Token
Token ID198
Feature activation-8.90e-02

Pos logit contributions
 so+0.005
 and+0.005
 even+0.005
sc+0.005
 convey+0.005
Neg logit contributions
 disl-0.012
 Sorry-0.011
 Majesty-0.011
 majesty-0.010
 winners-0.010
TokenL
Token ID43
Feature activation+6.26e-01

Pos logit contributions
 disl+0.003
 Majesty+0.003
 Sorry+0.003
 winners+0.003
 majesty+0.003
Neg logit contributions
 and-0.001
-0.001
 so-0.001
 called-0.001
On-0.001
TokenWho
Token ID8241
Feature activation-2.31e+00

Pos logit contributions
 disl+0.029
 Majesty+0.028
 majesty+0.026
 Sorry+0.026
 winners+0.025
Neg logit contributions
-0.022
 and-0.022
On-0.021
 so-0.021
sc-0.021
Token is
Token ID318
Feature activation-8.06e-01

Pos logit contributions
 disl+0.071
 Majesty+0.065
 Sorry+0.063
 winners+0.061
 majesty+0.060
Neg logit contributions
 and-0.054
-0.052
 so-0.051
 called-0.051
 even-0.049
Token it
Token ID340
Feature activation-1.85e+00

Pos logit contributions
 disl+0.030
 Majesty+0.028
 Sorry+0.028
 majesty+0.027
 winners+0.027
Neg logit contributions
 and-0.013
 so-0.013
-0.012
 called-0.012
 even-0.011
Token?"
Token ID1701
Feature activation-2.90e+00

Pos logit contributions
 disl+0.063
 Majesty+0.059
 Sorry+0.058
 winners+0.056
 majesty+0.056
Neg logit contributions
 and-0.036
 so-0.035
-0.034
 called-0.033
 even-0.032
Token a
Token ID257
Feature activation-1.27e+00

Pos logit contributions
 disl+0.097
 Majesty+0.090
 Sorry+0.088
 winners+0.087
 majesty+0.086
Neg logit contributions
 and-0.058
 so-0.056
-0.055
 called-0.054
 even-0.051
Token voice
Token ID3809
Feature activation-6.18e+00

Pos logit contributions
 disl+0.038
 Majesty+0.034
 Sorry+0.033
osity+0.032
 winners+0.032
Neg logit contributions
 and-0.033
 called-0.031
 so-0.031
-0.030
 quickly-0.029
Token asked
Token ID1965
Feature activation-1.10e+00

Pos logit contributions
 disl+0.211
 Sorry+0.195
 Majesty+0.189
 winners+0.186
 sincere+0.182
Neg logit contributions
 and-0.107
-0.106
 so-0.105
On-0.100
sc-0.099
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
 disl+0.043
 Sorry+0.041
 Majesty+0.041
 winners+0.040
 sincere+0.039
Neg logit contributions
On-0.011
-0.011
 so-0.011
 convey-0.011
 and-0.011
Token
Token ID198
Feature activation+6.92e-01

Pos logit contributions
 and+0.004
 so+0.004
+0.004
 called+0.004
 even+0.004
Neg logit contributions
 disl-0.010
 Majesty-0.010
 Sorry-0.010
 winners-0.009
 majesty-0.009
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
 so+0.012
sc+0.011
 and+0.011
 even+0.011
 convey+0.011
Neg logit contributions
 disl-0.024
 Sorry-0.022
 Majesty-0.021
 majesty-0.021
 sincere-0.021
Token"
Token ID1
Feature activation-4.71e-01

Pos logit contributions
 and+0.004
+0.004
 so+0.004
 called+0.004
On+0.004
Neg logit contributions
 disl-0.012
 Majesty-0.012
 Sorry-0.011
 winners-0.011
 majesty-0.011
Token we
Token ID356
Feature activation-1.71e+00

Pos logit contributions
 disl+0.001
 Majesty+0.001
 majesty+0.001
 winners+0.001
osity+0.001
Neg logit contributions
-0.000
 and-0.000
 so-0.000
 called-0.000
 even-0.000
Token join
Token ID4654
Feature activation+1.67e-01

Pos logit contributions
 disl+0.054
 Majesty+0.050
 Sorry+0.049
 winners+0.047
 majesty+0.047
Neg logit contributions
 and-0.038
 so-0.036
-0.036
 called-0.035
 even-0.034
Token you
Token ID345
Feature activation-1.43e+00

Pos logit contributions
+0.003
 convey+0.002
 and+0.002
 so+0.002
On+0.002
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.006
 winners-0.006
 majesty-0.005
Token?"
Token ID1701
Feature activation-3.02e+00

Pos logit contributions
 disl+0.055
 Majesty+0.050
 Sorry+0.050
 majesty+0.049
 winners+0.048
Neg logit contributions
 and-0.024
 so-0.023
-0.021
 called-0.021
 even-0.020
Token a
Token ID257
Feature activation-1.89e+00

Pos logit contributions
 disl+0.104
 Majesty+0.097
 Sorry+0.094
 winners+0.093
 majesty+0.092
Neg logit contributions
 and-0.059
 so-0.056
-0.056
 called-0.055
 even-0.051
Token voice
Token ID3809
Feature activation-6.13e+00

Pos logit contributions
 disl+0.057
 Majesty+0.052
osity+0.049
 Sorry+0.048
rav+0.047
Neg logit contributions
 and-0.049
 called-0.048
 so-0.046
-0.043
 quickly-0.043
Token asked
Token ID1965
Feature activation-1.38e+00

Pos logit contributions
 disl+0.211
 Sorry+0.195
 Majesty+0.191
 winners+0.188
 majesty+0.182
Neg logit contributions
 and-0.107
-0.105
 so-0.104
On-0.098
 convey-0.097
Token.
Token ID13
Feature activation+7.64e-02

Pos logit contributions
 disl+0.055
 Sorry+0.052
 Majesty+0.052
 winners+0.052
 sincere+0.050
Neg logit contributions
-0.014
 and-0.014
 so-0.014
 convey-0.013
On-0.013
Token It
Token ID632
Feature activation-2.10e+00

Pos logit contributions
 and+0.001
 so+0.001
+0.001
 called+0.001
On+0.001
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.003
 majesty-0.003
Token was
Token ID373
Feature activation+7.98e-01

Pos logit contributions
 disl+0.089
 Majesty+0.084
 Sorry+0.083
 winners+0.081
 majesty+0.081
Neg logit contributions
 and-0.025
 so-0.023
 called-0.022
-0.022
 even-0.020
Token Mia
Token ID32189
Feature activation+3.10e+00

Pos logit contributions
 and+0.014
+0.014
 so+0.014
sc+0.013
 called+0.013
Neg logit contributions
 disl-0.027
 Majesty-0.025
 Sorry-0.025
 winners-0.025
 majesty-0.024
TokenHello
Token ID15496
Feature activation-3.83e+00

Pos logit contributions
 disl+0.026
 Majesty+0.025
 Sorry+0.024
 winners+0.024
 majesty+0.024
Neg logit contributions
 and-0.008
 so-0.007
-0.007
 called-0.007
On-0.007
Token,
Token ID11
Feature activation-4.37e-01

Pos logit contributions
 disl+0.156
 winners+0.145
 Sorry+0.145
osity+0.143
 Majesty+0.143
Neg logit contributions
 and-0.060
 so-0.053
-0.047
 called-0.045
 even-0.044
Token Lily
Token ID20037
Feature activation+1.11e+00

Pos logit contributions
 disl+0.014
 Majesty+0.013
 Sorry+0.013
 winners+0.012
osity+0.012
Neg logit contributions
 and-0.010
 so-0.010
-0.010
 called-0.009
 even-0.009
Token,"
Token ID553
Feature activation-2.30e+00

Pos logit contributions
 and+0.011
 called+0.008
 so+0.008
+0.008
 even+0.006
Neg logit contributions
 disl-0.051
 Sorry-0.049
 winners-0.048
 Majesty-0.048
osity-0.047
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
 disl+0.083
 Majesty+0.080
 winners+0.078
osity+0.078
 majesty+0.076
Neg logit contributions
 and-0.042
 so-0.040
-0.039
 called-0.039
 even-0.035
Token voice
Token ID3809
Feature activation-6.12e+00

Pos logit contributions
 disl+0.032
 Majesty+0.029
osity+0.027
 Sorry+0.027
 winners+0.027
Neg logit contributions
 and-0.026
 called-0.026
 so-0.025
-0.025
 carefully-0.024
Token said
Token ID531
Feature activation-7.78e-01

Pos logit contributions
 disl+0.195
 Majesty+0.183
 Sorry+0.178
 winners+0.175
 majesty+0.173
Neg logit contributions
 and-0.132
 so-0.127
 called-0.126
-0.125
 even-0.117
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
 disl+0.037
 Majesty+0.036
 Sorry+0.035
 winners+0.035
 majesty+0.035
Neg logit contributions
 and-0.006
 so-0.005
 called-0.004
-0.004
 from-0.004
Token Lily
Token ID20037
Feature activation+4.26e-01

Pos logit contributions
 and+0.006
 so+0.006
+0.006
 called+0.005
On+0.005
Neg logit contributions
 disl-0.015
 Majesty-0.014
 Sorry-0.014
 winners-0.014
 majesty-0.014
Token turned
Token ID2900
Feature activation-2.92e-01

Pos logit contributions
sc+0.009
 convey+0.008
 so+0.008
 shelf+0.008
+0.008
Neg logit contributions
 disl-0.013
 Majesty-0.012
 Sorry-0.012
 nods-0.012
 majesty-0.012
Token around
Token ID1088
Feature activation-1.14e+00

Pos logit contributions
 disl+0.010
 Sorry+0.009
 nods+0.009
 sincere+0.009
 sorry+0.009
Neg logit contributions
 shelf-0.004
 convey-0.004
On-0.004
-0.004
 floor-0.004
Token are
Token ID389
Feature activation+9.96e-02

Pos logit contributions
 disl+0.053
 Majesty+0.049
 Sorry+0.047
 majesty+0.045
 winners+0.045
Neg logit contributions
 and-0.050
-0.049
 so-0.048
 called-0.046
 even-0.044
Token you
Token ID345
Feature activation-1.62e+00

Pos logit contributions
 and+0.002
 so+0.002
+0.002
 called+0.002
 even+0.002
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.003
 majesty-0.003
Token doing
Token ID1804
Feature activation+1.98e-01

Pos logit contributions
 disl+0.045
 Majesty+0.041
 Sorry+0.040
 winners+0.039
 majesty+0.038
Neg logit contributions
 and-0.041
 so-0.040
-0.040
 called-0.039
 even-0.037
Token?"
Token ID1701
Feature activation-3.04e+00

Pos logit contributions
 and+0.004
sc+0.004
+0.004
 so+0.004
On+0.004
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.006
 winners-0.005
 sincere-0.005
Token a
Token ID257
Feature activation-1.61e+00

Pos logit contributions
 disl+0.106
 Majesty+0.100
 winners+0.096
 majesty+0.095
osity+0.094
Neg logit contributions
 and-0.059
 so-0.057
 called-0.056
-0.055
 even-0.051
Token voice
Token ID3809
Feature activation-6.11e+00

Pos logit contributions
 disl+0.052
 Majesty+0.048
osity+0.047
rav+0.045
 winners+0.045
Neg logit contributions
 and-0.039
 called-0.038
 so-0.036
 quickly-0.035
-0.033
Token said
Token ID531
Feature activation-1.57e+00

Pos logit contributions
 disl+0.208
 Sorry+0.192
 Majesty+0.188
 winners+0.184
 majesty+0.180
Neg logit contributions
 and-0.109
-0.107
 so-0.106
On-0.100
sc-0.099
Token.
Token ID13
Feature activation+2.60e-02

Pos logit contributions
 disl+0.069
 Majesty+0.065
 Sorry+0.064
 winners+0.063
 majesty+0.063
Neg logit contributions
 and-0.015
 so-0.014
-0.013
 called-0.013
 even-0.011
Token
Token ID198
Feature activation+5.29e-01

Pos logit contributions
 and+0.001
 so+0.001
+0.001
 called+0.001
On+0.000
Neg logit contributions
 disl-0.001
 Majesty-0.001
 winners-0.001
 Sorry-0.001
 majesty-0.001
Token
Token ID198
Feature activation+9.46e-02

Pos logit contributions
 so+0.008
 and+0.008
sc+0.007
 even+0.007
 called+0.007
Neg logit contributions
 disl-0.019
 Sorry-0.018
 Majesty-0.017
 majesty-0.017
 winners-0.017
TokenThey
Token ID2990
Feature activation-5.16e-01

Pos logit contributions
+0.001
 and+0.001
 called+0.001
On+0.001
 so+0.001
Neg logit contributions
 disl-0.004
 Majesty-0.004
 Sorry-0.004
 winners-0.004
 majesty-0.004
Token going
Token ID1016
Feature activation+1.00e-01

Pos logit contributions
 disl+0.017
 Majesty+0.016
 Sorry+0.016
 winners+0.015
 majesty+0.015
Neg logit contributions
 and-0.009
 so-0.009
-0.008
 called-0.008
 even-0.008
Token on
Token ID319
Feature activation+1.42e+00

Pos logit contributions
 and+0.001
+0.001
 so+0.001
 called+0.001
 even+0.001
Neg logit contributions
 disl-0.004
 Sorry-0.004
 Majesty-0.004
 winners-0.004
 majesty-0.004
Token here
Token ID994
Feature activation+4.72e-01

Pos logit contributions
sc+0.036
+0.033
On+0.033
 convey+0.033
 even+0.033
Neg logit contributions
 disl-0.036
 Sorry-0.035
 winners-0.034
 Majesty-0.033
 sincere-0.031
Token?"
Token ID1701
Feature activation-3.12e+00

Pos logit contributions
 convey+0.012
sc+0.012
On+0.011
+0.011
 carefully+0.011
Neg logit contributions
 disl-0.011
 Sorry-0.011
 Majesty-0.010
 winners-0.010
 yeah-0.010
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
 disl+0.112
 Majesty+0.106
osity+0.101
 majesty+0.100
 winners+0.100
Neg logit contributions
 and-0.060
 so-0.057
 called-0.055
-0.055
 even-0.051
Token voice
Token ID3809
Feature activation-6.09e+00

Pos logit contributions
 disl+0.071
 Majesty+0.064
osity+0.064
rav+0.061
 Sorry+0.060
Neg logit contributions
 and-0.050
 called-0.046
 so-0.045
 quickly-0.043
 shelf-0.042
Token said
Token ID531
Feature activation-1.69e+00

Pos logit contributions
 disl+0.212
 Sorry+0.196
 Majesty+0.192
 winners+0.189
 majesty+0.183
Neg logit contributions
 and-0.103
-0.102
 so-0.100
On-0.094
sc-0.093
Token.
Token ID13
Feature activation+2.85e-02

Pos logit contributions
 disl+0.079
 Majesty+0.075
 Sorry+0.074
 majesty+0.072
 winners+0.072
Neg logit contributions
 and-0.014
 so-0.012
-0.011
 called-0.011
 even-0.008
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
 and+0.001
 so+0.000
+0.000
 called+0.000
On+0.000
Neg logit contributions
 disl-0.001
 Majesty-0.001
 winners-0.001
 Sorry-0.001
 majesty-0.001
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
 so+0.004
 and+0.004
+0.004
 called+0.004
sc+0.003
Neg logit contributions
 disl-0.011
 Sorry-0.010
 Majesty-0.010
 winners-0.010
 majesty-0.010
TokenThey
Token ID2990
Feature activation-6.74e-01

Pos logit contributions
 disl+0.007
 Majesty+0.006
osity+0.006
 Sorry+0.006
 majesty+0.006
Neg logit contributions
 and-0.002
-0.002
On-0.002
 so-0.002
 called-0.002
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
 and+0.001
 so+0.001
+0.001
 called+0.001
 even+0.001
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.003
 majesty-0.003
Token
Token ID198
Feature activation-4.92e-02

Pos logit contributions
sc+0.006
 so+0.006
 convey+0.006
+0.005
 even+0.005
Neg logit contributions
 disl-0.012
 Sorry-0.011
 Majesty-0.011
 majesty-0.010
 nods-0.010
Token"
Token ID1
Feature activation-9.94e-01

Pos logit contributions
 disl+0.002
 Sorry+0.001
 Majesty+0.001
 majesty+0.001
 winners+0.001
Neg logit contributions
 so-0.001
 and-0.001
 convey-0.001
 called-0.001
sc-0.001
TokenOK
Token ID11380
Feature activation+1.00e+00

Pos logit contributions
 disl+0.024
 Majesty+0.023
 Sorry+0.021
 majesty+0.021
 winners+0.020
Neg logit contributions
 and-0.029
 so-0.028
-0.028
On-0.027
 called-0.027
Token,
Token ID11
Feature activation+8.87e-01

Pos logit contributions
 convey+0.014
sc+0.014
On+0.014
+0.014
 called+0.013
Neg logit contributions
 disl-0.037
 Majesty-0.035
 Sorry-0.035
 winners-0.034
 majesty-0.034
Token Mia
Token ID32189
Feature activation+3.71e+00

Pos logit contributions
 and+0.018
 so+0.017
+0.016
 called+0.016
 even+0.016
Neg logit contributions
 disl-0.031
 Majesty-0.028
 Sorry-0.028
 winners-0.027
osity-0.027
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
 and+0.033
 so+0.025
+0.023
 called+0.023
 even+0.017
Neg logit contributions
 disl-0.174
 Sorry-0.165
 Majesty-0.164
 winners-0.160
osity-0.159
Token let
Token ID1309
Feature activation-3.39e-01

Pos logit contributions
 and+0.014
 so+0.013
+0.013
 called+0.013
sc+0.013
Neg logit contributions
 disl-0.018
 Majesty-0.016
 Sorry-0.016
 winners-0.016
 majesty-0.016
Token's
Token ID338
Feature activation-1.32e+00

Pos logit contributions
 disl+0.013
 Majesty+0.013
 Sorry+0.012
 majesty+0.012
 nods+0.012
Neg logit contributions
On-0.004
sc-0.004
 convey-0.004
 called-0.004
 so-0.004
Token go
Token ID467
Feature activation+1.60e-01

Pos logit contributions
 disl+0.046
 Majesty+0.042
 Sorry+0.041
 winners+0.041
osity+0.041
Neg logit contributions
 and-0.026
 so-0.025
-0.024
 called-0.024
 even-0.023
Token.
Token ID13
Feature activation+4.92e-01

Pos logit contributions
+0.002
 convey+0.002
On+0.002
 so+0.002
sc+0.002
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.005
 winners-0.005
 majesty-0.005
Token very
Token ID845
Feature activation-4.80e-01

Pos logit contributions
 and+0.002
 so+0.001
+0.001
 called+0.001
 even+0.001
Neg logit contributions
 disl-0.005
 Majesty-0.004
 Sorry-0.004
 winners-0.004
 majesty-0.004
Token friendly
Token ID8030
Feature activation-9.33e-01

Pos logit contributions
 disl+0.014
 Majesty+0.014
 Sorry+0.013
rav+0.013
osity+0.013
Neg logit contributions
 so-0.011
 and-0.011
 called-0.011
-0.011
 even-0.011
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
 disl+0.040
 Majesty+0.038
 winners+0.038
 Sorry+0.037
joy+0.037
Neg logit contributions
-0.007
 convey-0.007
On-0.007
sc-0.006
At-0.006
Token Frank
Token ID5278
Feature activation+7.82e-01

Pos logit contributions
 and+0.007
 so+0.006
+0.006
 called+0.006
 even+0.006
Neg logit contributions
 disl-0.013
 Majesty-0.012
 Sorry-0.012
 winners-0.011
 majesty-0.011
Token showed
Token ID3751
Feature activation+7.80e-01

Pos logit contributions
 convey+0.013
sc+0.013
 so+0.012
 shelf+0.012
On+0.012
Neg logit contributions
 disl-0.027
 Majesty-0.025
 majesty-0.024
 Sorry-0.024
 nods-0.024
Token Gerry
Token ID42196
Feature activation+3.71e+00

Pos logit contributions
 and+0.017
 so+0.017
+0.016
 called+0.015
 even+0.015
Neg logit contributions
 disl-0.025
 Majesty-0.023
 Sorry-0.023
 winners-0.023
 majesty-0.022
Token all
Token ID477
Feature activation+7.08e-01

Pos logit contributions
 and+0.058
 so+0.054
+0.053
 called+0.050
 even+0.049
Neg logit contributions
 disl-0.142
 Sorry-0.135
 Majesty-0.133
 winners-0.130
 majesty-0.127
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 and+0.012
+0.011
 so+0.011
 called+0.011
 convey+0.011
Neg logit contributions
 disl-0.025
 Majesty-0.023
 Sorry-0.023
 winners-0.023
 majesty-0.022
Token best
Token ID1266
Feature activation+3.10e-01

Pos logit contributions
 and+0.032
+0.031
 so+0.030
 called+0.030
On+0.029
Neg logit contributions
 disl-0.052
 Majesty-0.048
 Sorry-0.046
 winners-0.046
 majesty-0.046
Token places
Token ID4113
Feature activation+1.29e+00

Pos logit contributions
+0.007
On+0.007
 convey+0.007
 and+0.007
 called+0.007
Neg logit contributions
 disl-0.009
 Majesty-0.008
 winners-0.008
 majesty-0.008
 Sorry-0.008
Token to
Token ID284
Feature activation-3.09e-01

Pos logit contributions
 and+0.021
+0.020
 so+0.020
 called+0.019
 convey+0.019
Neg logit contributions
 disl-0.047
 Sorry-0.043
 Majesty-0.043
 winners-0.042
 majesty-0.041
Token front
Token ID2166
Feature activation+4.09e-01

Pos logit contributions
sc+0.017
+0.016
 convey+0.016
 especially+0.015
 closely+0.015
Neg logit contributions
 disl-0.017
joy-0.017
 sorry-0.016
 sincere-0.016
 winners-0.016
Token of
Token ID286
Feature activation+1.23e+00

Pos logit contributions
sc+0.007
+0.007
On+0.007
 especially+0.007
 even+0.006
Neg logit contributions
 nods-0.012
 disl-0.012
joy-0.012
 sorry-0.011
 winners-0.011
Token them
Token ID606
Feature activation-7.28e-01

Pos logit contributions
 and+0.020
+0.020
 so+0.019
 called+0.019
sc+0.018
Neg logit contributions
 disl-0.044
 Majesty-0.041
 Sorry-0.041
 winners-0.040
 majesty-0.039
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
 disl+0.026
 winners+0.025
 sorry+0.025
 nods+0.025
 sincere+0.024
Neg logit contributions
 convey-0.008
sc-0.007
ern-0.007
On-0.007
At-0.006
Token hugged
Token ID41130
Feature activation-8.50e-01

Pos logit contributions
 disl+0.046
 Majesty+0.043
 Sorry+0.043
 winners+0.041
 majesty+0.041
Neg logit contributions
 and-0.024
-0.024
 so-0.024
sc-0.022
 called-0.022
Token Lily
Token ID20037
Feature activation+3.60e+00

Pos logit contributions
 disl+0.030
 Sorry+0.028
 Majesty+0.027
 winners+0.026
 yeah+0.026
Neg logit contributions
-0.014
 so-0.013
 and-0.013
 called-0.013
On-0.013
Token.
Token ID13
Feature activation-6.92e-02

Pos logit contributions
 convey+0.048
On+0.045
At+0.045
sc+0.042
 called+0.040
Neg logit contributions
 winners-0.120
 disl-0.120
 nods-0.118
 Majesty-0.116
 sorry-0.116
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
 disl+0.003
 Sorry+0.002
 Majesty+0.002
 majesty+0.002
 sincere+0.002
Neg logit contributions
-0.001
sc-0.001
 so-0.001
 and-0.001
 convey-0.001
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 convey+0.009
sc+0.009
 shelf+0.009
 especially+0.009
 so+0.009
Neg logit contributions
 disl-0.011
 Sorry-0.009
 majesty-0.009
 sorry-0.009
rav-0.009
TokenThe
Token ID464
Feature activation-5.65e-01

Pos logit contributions
 disl+0.003
 Sorry+0.003
 Majesty+0.003
 winners+0.003
 majesty+0.003
Neg logit contributions
 so-0.002
 convey-0.002
 and-0.002
 even-0.002
 called-0.002
Token dog
Token ID3290
Feature activation-1.91e+00

Pos logit contributions
 disl+0.018
osity+0.017
 Majesty+0.016
 Sorry+0.016
 yeah+0.015
Neg logit contributions
 and-0.015
 so-0.013
-0.013
 shelf-0.013
 called-0.013
Token two
Token ID734
Feature activation+1.58e+00

Pos logit contributions
 and+0.005
 shelf+0.004
 convey+0.004
sc+0.004
 called+0.004
Neg logit contributions
 disl-0.006
 Sorry-0.006
 winners-0.006
 Majesty-0.006
 majesty-0.005
Token friends
Token ID2460
Feature activation-2.14e+00

Pos logit contributions
+0.046
sc+0.046
On+0.045
 and+0.045
 so+0.045
Neg logit contributions
 disl-0.034
 Majesty-0.032
 winners-0.030
 majesty-0.030
 Sorry-0.029
Token,
Token ID11
Feature activation-3.70e-01

Pos logit contributions
 disl+0.067
 sincere+0.063
 winners+0.061
 sorry+0.061
 Majesty+0.060
Neg logit contributions
sc-0.044
 convey-0.041
 shelf-0.040
-0.039
 aisle-0.038
Token Max
Token ID5436
Feature activation+3.17e+00

Pos logit contributions
 disl+0.011
 Sorry+0.010
 winners+0.010
 Majesty+0.010
 yeah+0.010
Neg logit contributions
-0.008
 and-0.008
 so-0.008
 convey-0.007
sc-0.007
Token and
Token ID290
Feature activation+5.89e-01

Pos logit contributions
sc+0.027
 convey+0.027
On+0.022
 shelf+0.020
 aisle+0.020
Neg logit contributions
 disl-0.135
 nods-0.123
 Majesty-0.121
 majesty-0.121
 winners-0.120
Token Julie
Token ID21946
Feature activation+3.48e+00

Pos logit contributions
 so+0.012
 and+0.012
+0.011
 called+0.011
 even+0.011
Neg logit contributions
 disl-0.020
 Majesty-0.019
 Sorry-0.018
 majesty-0.018
osity-0.018
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
 convey+0.057
On+0.054
sc+0.053
 shelf+0.052
 aisle+0.050
Neg logit contributions
 disl-0.119
 winners-0.111
 Majesty-0.108
rav-0.105
 tweet-0.105
Token They
Token ID1119
Feature activation+3.62e-01

Pos logit contributions
 disl+0.010
 Sorry+0.009
 Majesty+0.009
 winners+0.009
 nods+0.008
Neg logit contributions
 so-0.008
-0.008
 and-0.008
 called-0.008
 even-0.008
Token liked
Token ID8288
Feature activation+1.82e+00

Pos logit contributions
 and+0.007
 called+0.007
 so+0.007
 even+0.007
+0.006
Neg logit contributions
 disl-0.013
 Majesty-0.013
 majesty-0.012
 Sorry-0.012
 yeah-0.012
Token to
Token ID284
Feature activation-1.61e+00

Pos logit contributions
+0.017
 and+0.016
sc+0.015
 called+0.015
 so+0.015
Neg logit contributions
 disl-0.079
 Majesty-0.073
 Sorry-0.073
 winners-0.072
 majesty-0.070
Token share
Token ID2648
Feature activation-5.84e-01

Pos logit contributions
 disl+0.051
 Majesty+0.048
 Sorry+0.047
 winners+0.046
osity+0.045
Neg logit contributions
 and-0.035
-0.033
 so-0.033
 called-0.033
 even-0.032
Token was
Token ID373
Feature activation+2.57e+00

Pos logit contributions
 and+0.007
 so+0.007
+0.007
 called+0.007
 convey+0.007
Neg logit contributions
 disl-0.021
 Sorry-0.019
 Majesty-0.019
 winners-0.019
osity-0.018
Token trying
Token ID2111
Feature activation+1.91e+00

Pos logit contributions
sc+0.047
 and+0.047
 convey+0.046
+0.046
 aisle+0.044
Neg logit contributions
 disl-0.084
 winners-0.076
 Sorry-0.075
 Majesty-0.075
 sincere-0.073
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 convey+0.027
 aisle+0.026
sc+0.024
 shelf+0.023
 called+0.023
Neg logit contributions
 disl-0.073
 Majesty-0.064
 winners-0.064
 Sorry-0.063
 sincere-0.063
Token make
Token ID787
Feature activation+1.29e+00

Pos logit contributions
 and+0.023
sc+0.023
 so+0.022
+0.022
 convey+0.022
Neg logit contributions
 disl-0.033
 Sorry-0.030
 winners-0.030
 Majesty-0.029
 sincere-0.028
Token it
Token ID340
Feature activation+5.07e-01

Pos logit contributions
 and+0.024
 so+0.023
+0.022
 called+0.022
sc+0.021
Neg logit contributions
 disl-0.044
 Majesty-0.041
 Sorry-0.041
 winners-0.039
 majesty-0.039
Token hard
Token ID1327
Feature activation+3.40e+00

Pos logit contributions
 and+0.009
 so+0.008
+0.008
 even+0.008
 called+0.008
Neg logit contributions
 disl-0.019
 Majesty-0.017
 Sorry-0.017
 majesty-0.017
 winners-0.017
Token for
Token ID329
Feature activation+1.26e+00

Pos logit contributions
 convey+0.044
 called+0.044
On+0.044
 shelf+0.042
sc+0.042
Neg logit contributions
 disl-0.126
 winners-0.116
 Majesty-0.115
 Sorry-0.114
osity-0.109
Token him
Token ID683
Feature activation+1.07e+00

Pos logit contributions
sc+0.028
+0.027
 and+0.026
On+0.025
 so+0.025
Neg logit contributions
 disl-0.038
 Sorry-0.034
 Majesty-0.034
 winners-0.033
 sincere-0.032
Token to
Token ID284
Feature activation-5.62e-01

Pos logit contributions
 convey+0.012
sc+0.011
On+0.011
 aisle+0.011
 called+0.011
Neg logit contributions
 disl-0.043
 Majesty-0.039
 winners-0.039
 Sorry-0.039
 majesty-0.038
Token play
Token ID711
Feature activation+6.94e-01

Pos logit contributions
 disl+0.018
 Majesty+0.016
 Sorry+0.016
 winners+0.016
 majesty+0.016
Neg logit contributions
 and-0.012
 so-0.011
-0.011
 called-0.011
sc-0.011
Token,
Token ID11
Feature activation-5.14e-01

Pos logit contributions
+0.009
 convey+0.009
 called+0.009
On+0.008
 aisle+0.008
Neg logit contributions
 disl-0.026
 winners-0.024
 Sorry-0.024
 Majesty-0.024
 sincere-0.023
TokenYes
Token ID5297
Feature activation+5.88e-01

Pos logit contributions
 disl+0.018
 Sorry+0.016
 Majesty+0.016
 winners+0.016
 yeah+0.015
Neg logit contributions
 and-0.014
 so-0.014
 called-0.013
-0.013
 convey-0.013
Token!"
Token ID2474
Feature activation-1.73e+00

Pos logit contributions
sc+0.009
On+0.009
 convey+0.008
+0.008
 called+0.008
Neg logit contributions
 disl-0.021
 Majesty-0.021
 Sorry-0.020
 majesty-0.020
 yeah-0.020
Token replied
Token ID8712
Feature activation-2.03e+00

Pos logit contributions
 disl+0.057
 Majesty+0.053
 Sorry+0.051
 winners+0.051
 majesty+0.050
Neg logit contributions
 and-0.037
 so-0.035
-0.035
 called-0.034
 even-0.032
Token Joyce
Token ID25936
Feature activation+1.40e+00

Pos logit contributions
 disl+0.073
 Majesty+0.067
 Sorry+0.065
 winners+0.064
 majesty+0.063
Neg logit contributions
 and-0.039
 so-0.037
-0.035
 called-0.035
 even-0.033
Token.
Token ID13
Feature activation+8.97e-01

Pos logit contributions
 and+0.014
 so+0.013
+0.012
 called+0.011
 even+0.010
Neg logit contributions
 disl-0.063
 Sorry-0.059
 Majesty-0.059
 winners-0.057
 majesty-0.057
Token "
Token ID366
Feature activation+2.69e-01

Pos logit contributions
 convey+0.019
sc+0.019
+0.019
 so+0.018
 even+0.018
Neg logit contributions
 disl-0.026
 Sorry-0.026
 Majesty-0.024
 majesty-0.023
osity-0.023
TokenThere
Token ID1858
Feature activation+9.21e-01

Pos logit contributions
 and+0.006
 so+0.006
 called+0.006
+0.006
 convey+0.006
Neg logit contributions
 disl-0.008
 Sorry-0.007
 Majesty-0.007
 winners-0.007
 majesty-0.007
Token's
Token ID338
Feature activation+2.08e-01

Pos logit contributions
 and+0.014
+0.013
 so+0.013
 called+0.012
 even+0.011
Neg logit contributions
 disl-0.036
 Majesty-0.033
 Sorry-0.033
 winners-0.033
 majesty-0.032
Token a
Token ID257
Feature activation+7.96e-01

Pos logit contributions
 and+0.004
sc+0.003
 convey+0.003
+0.003
On+0.003
Neg logit contributions
 disl-0.007
 Majesty-0.007
 Sorry-0.007
 winners-0.006
 majesty-0.006
Token big
Token ID1263
Feature activation+6.44e-01

Pos logit contributions
 and+0.017
 so+0.017
+0.017
 called+0.016
 even+0.016
Neg logit contributions
 disl-0.025
 Majesty-0.023
 Sorry-0.023
 majesty-0.022
 winners-0.022
Token billboard
Token ID41448
Feature activation-1.60e+00

Pos logit contributions
 and+0.016
 so+0.015
+0.015
 called+0.015
 even+0.014
Neg logit contributions
 disl-0.019
 Majesty-0.017
 Sorry-0.017
 majesty-0.017
osity-0.016
Token think
Token ID892
Feature activation-2.97e-01

Pos logit contributions
 Majesty+0.092
 disl+0.091
 majesty+0.086
 Sorry+0.083
 winners+0.083
Neg logit contributions
 and-0.070
-0.067
 so-0.066
 called-0.065
 even-0.062
Token I
Token ID314
Feature activation-1.03e+00

Pos logit contributions
 disl+0.008
 Sorry+0.008
 winners+0.008
 Majesty+0.008
 majesty+0.007
Neg logit contributions
sc-0.007
On-0.006
 convey-0.006
-0.006
 and-0.006
Token did
Token ID750
Feature activation+7.67e-01

Pos logit contributions
 disl+0.035
 Majesty+0.034
 Sorry+0.032
 winners+0.032
 majesty+0.032
Neg logit contributions
 and-0.021
-0.020
 called-0.019
 so-0.019
 even-0.018
Token?"
Token ID1701
Feature activation-1.38e+00

Pos logit contributions
 and+0.011
 so+0.011
+0.010
 even+0.010
 called+0.010
Neg logit contributions
 disl-0.030
 Majesty-0.029
 Sorry-0.029
 majesty-0.028
osity-0.028
Token
Token ID198
Feature activation+3.25e-01

Pos logit contributions
 disl+0.052
 Sorry+0.048
 Majesty+0.047
 winners+0.046
 yeah+0.046
Neg logit contributions
-0.020
 and-0.019
 so-0.019
 called-0.018
 convey-0.018
Token
Token ID198
Feature activation+5.25e-03

Pos logit contributions
 so+0.005
 convey+0.005
+0.005
 and+0.005
sc+0.005
Neg logit contributions
 disl-0.012
 Sorry-0.011
 Majesty-0.010
 sincere-0.010
 majesty-0.010
TokenL
Token ID43
Feature activation+6.19e-01

Pos logit contributions
 and+0.000
 so+0.000
+0.000
 called+0.000
 convey+0.000
Neg logit contributions
 disl-0.000
 Majesty-0.000
 Sorry-0.000
 winners-0.000
 majesty-0.000
Tokenily
Token ID813
Feature activation+3.71e-01

Pos logit contributions
 and+0.013
 so+0.012
+0.012
 called+0.012
 even+0.011
Neg logit contributions
 disl-0.020
 Majesty-0.019
 Sorry-0.018
 winners-0.018
 majesty-0.018
Token said
Token ID531
Feature activation-6.94e-01

Pos logit contributions
 and+0.008
 so+0.008
+0.008
sc+0.007
 convey+0.007
Neg logit contributions
 disl-0.011
 Majesty-0.010
 Sorry-0.010
 winners-0.010
 sincere-0.010
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
 disl+0.034
 Majesty+0.032
 Sorry+0.031
 majesty+0.031
 winners+0.030
Neg logit contributions
 and-0.006
 so-0.005
-0.004
 even-0.003
 from-0.003
Token "
Token ID366
Feature activation+7.84e-01

Pos logit contributions
 and+0.021
 so+0.019
+0.018
 called+0.018
 even+0.017
Neg logit contributions
 disl-0.039
 Majesty-0.037
 Sorry-0.036
 majesty-0.036
 winners-0.036
Token the
Token ID262
Feature activation+2.66e+00

Pos logit contributions
 disl+0.001
 Majesty+0.001
 winners+0.001
osity+0.001
 yeah+0.001
Neg logit contributions
sc-0.000
 convey-0.000
On-0.000
 called-0.000
-0.000
Token discovery
Token ID9412
Feature activation+7.09e-01

Pos logit contributions
 and+0.049
 so+0.047
+0.046
 called+0.045
 even+0.042
Neg logit contributions
 disl-0.093
 Sorry-0.087
 Majesty-0.087
 winners-0.084
 majesty-0.083
Token he
Token ID339
Feature activation+1.36e+00

Pos logit contributions
 convey+0.010
On+0.010
sc+0.009
+0.009
At+0.009
Neg logit contributions
 disl-0.025
 winners-0.024
 Sorry-0.023
 Majesty-0.022
 yeah-0.022
Token had
Token ID550
Feature activation+1.05e+00

Pos logit contributions
 and+0.028
 so+0.027
+0.027
 called+0.026
 even+0.025
Neg logit contributions
 disl-0.044
 Majesty-0.041
 Sorry-0.040
 winners-0.039
 majesty-0.038
Token made
Token ID925
Feature activation+1.49e+00

Pos logit contributions
 convey+0.023
sc+0.023
 shelf+0.023
+0.023
 and+0.023
Neg logit contributions
 disl-0.031
 Majesty-0.028
 winners-0.027
 Sorry-0.027
 majesty-0.026
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
 and+0.018
+0.018
 so+0.017
 called+0.017
 convey+0.017
Neg logit contributions
 disl-0.058
 Majesty-0.054
 Sorry-0.053
 winners-0.053
 majesty-0.052
Token They
Token ID1119
Feature activation+2.50e-01

Pos logit contributions
 and+0.003
 so+0.002
+0.002
 called+0.002
 even+0.002
Neg logit contributions
 disl-0.005
 Majesty-0.005
 Sorry-0.005
 winners-0.005
 majesty-0.005
Token were
Token ID547
Feature activation-1.45e-01

Pos logit contributions
 and+0.004
 so+0.004
+0.004
 called+0.004
sc+0.004
Neg logit contributions
 disl-0.009
 Majesty-0.008
 Sorry-0.008
 winners-0.008
 majesty-0.008
Token all
Token ID477
Feature activation+3.23e-01

Pos logit contributions
 disl+0.006
 Majesty+0.006
 winners+0.006
 Sorry+0.006
 majesty+0.006
Neg logit contributions
 and-0.001
sc-0.001
-0.001
On-0.001
 convey-0.001
Token amazed
Token ID24562
Feature activation+2.56e-01

Pos logit contributions
 so+0.006
 and+0.006
+0.006
 called+0.006
 even+0.005
Neg logit contributions
 disl-0.012
 Majesty-0.011
 Sorry-0.011
 majesty-0.010
rav-0.010
Token at
Token ID379
Feature activation+1.90e-01

Pos logit contributions
 and+0.002
 so+0.002
+0.002
 called+0.002
 even+0.002
Neg logit contributions
 disl-0.011
 Majesty-0.011
 Sorry-0.011
 winners-0.011
 majesty-0.010
Token cup
Token ID6508
Feature activation+9.80e-01

Pos logit contributions
 disl+0.009
 Majesty+0.009
 Sorry+0.009
 winners+0.008
 majesty+0.008
Neg logit contributions
 and-0.006
-0.005
 so-0.005
 called-0.005
 even-0.005
Tokenboard
Token ID3526
Feature activation+1.50e+00

Pos logit contributions
+0.028
 especially+0.027
 convey+0.027
 even+0.027
On+0.026
Neg logit contributions
 disl-0.018
 Sorry-0.018
 nods-0.017
 yeah-0.017
 winners-0.016
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
 convey+0.026
 especially+0.024
 pic+0.024
Days+0.024
+0.024
Neg logit contributions
 nods-0.041
 disl-0.040
 winners-0.039
 wow-0.039
 Thank-0.038
Token But
Token ID887
Feature activation+3.44e-01

Pos logit contributions
 disl+0.008
 Sorry+0.008
 Majesty+0.007
 majesty+0.007
 yeah+0.007
Neg logit contributions
 and-0.007
-0.007
 even-0.007
 so-0.007
sc-0.007
Token she
Token ID673
Feature activation+4.34e-01

Pos logit contributions
 and+0.005
+0.005
sc+0.005
 so+0.005
 convey+0.005
Neg logit contributions
 disl-0.012
 Sorry-0.012
 Majesty-0.011
 winners-0.011
 yeah-0.011
Token could
Token ID714
Feature activation-8.96e-02

Pos logit contributions
sc+0.010
 convey+0.009
+0.009
 so+0.009
On+0.009
Neg logit contributions
 disl-0.012
 Majesty-0.011
 Sorry-0.011
 majesty-0.011
 sincere-0.011
Token not
Token ID407
Feature activation-1.69e+00

Pos logit contributions
 disl+0.003
 Sorry+0.003
 Majesty+0.003
 winners+0.003
 sincere+0.003
Neg logit contributions
sc-0.002
 and-0.002
-0.002
 convey-0.002
 so-0.002
Token find
Token ID1064
Feature activation-3.06e-01

Pos logit contributions
 disl+0.032
 hey+0.030
 Sorry+0.029
 sincere+0.029
 bully+0.028
Neg logit contributions
sc-0.051
 convey-0.044
ion-0.041
 strips-0.041
 aisle-0.039
Token her
Token ID607
Feature activation+8.70e-01

Pos logit contributions
 disl+0.010
 Majesty+0.009
 Sorry+0.009
 winners+0.009
 majesty+0.009
Neg logit contributions
-0.006
 and-0.006
On-0.005
sc-0.005
 so-0.005
Token pretty
Token ID2495
Feature activation-1.55e-01

Pos logit contributions
sc+0.020
On+0.019
+0.019
At+0.018
 convey+0.018
Neg logit contributions
 disl-0.026
 Majesty-0.024
 majesty-0.023
 sincere-0.022
 winners-0.022
Token mug
Token ID25152
Feature activation+2.94e-01

Pos logit contributions
 disl+0.005
 Majesty+0.005
 Sorry+0.005
 majesty+0.005
 winners+0.005
Neg logit contributions
-0.003
 and-0.003
 so-0.003
 called-0.003
On-0.003
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
 convey+0.028
On+0.025
 shelf+0.024
+0.024
At+0.024
Neg logit contributions
 disl-0.070
 winners-0.068
 Majesty-0.064
osity-0.064
 tweet-0.062
Token
Token ID198
Feature activation+6.81e-01

Pos logit contributions
 and+0.005
 so+0.005
 called+0.004
+0.004
 convey+0.004
Neg logit contributions
 disl-0.006
 Majesty-0.006
 Sorry-0.006
 winners-0.005
 majesty-0.005
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
 convey+0.014
 shelf+0.013
 floor+0.013
 so+0.013
 and+0.012
Neg logit contributions
 disl-0.021
 Majesty-0.019
 winners-0.019
 majesty-0.018
 Sorry-0.018
TokenBut
Token ID1537
Feature activation+5.21e-03

Pos logit contributions
 and+0.004
 so+0.004
 called+0.003
 convey+0.003
 shelf+0.003
Neg logit contributions
 disl-0.008
 Majesty-0.007
 Sorry-0.007
 winners-0.007
 majesty-0.007
Token soon
Token ID2582
Feature activation+2.07e+00

Pos logit contributions
 and+0.000
 so+0.000
+0.000
 called+0.000
 even+0.000
Neg logit contributions
 disl-0.000
 Majesty-0.000
 Sorry-0.000
 winners-0.000
 majesty-0.000
Token the
Token ID262
Feature activation+3.76e-01

Pos logit contributions
 and+0.034
 so+0.033
 even+0.030
+0.029
 called+0.029
Neg logit contributions
 disl-0.083
 Sorry-0.076
 Majesty-0.075
 majesty-0.073
 winners-0.073
Token boy
Token ID2933
Feature activation-1.61e+00

Pos logit contributions
 and+0.008
 so+0.008
 called+0.008
 shelf+0.007
 even+0.007
Neg logit contributions
 disl-0.013
 Sorry-0.012
osity-0.011
 Majesty-0.011
rav-0.011
Token had
Token ID550
Feature activation+4.28e-01

Pos logit contributions
 disl+0.053
 Majesty+0.050
 Sorry+0.049
 winners+0.048
 sincere+0.048
Neg logit contributions
 convey-0.027
sc-0.026
 so-0.026
On-0.026
 and-0.026
Token to
Token ID284
Feature activation-1.43e+00

Pos logit contributions
 and+0.006
 so+0.005
+0.005
 called+0.005
 even+0.005
Neg logit contributions
 disl-0.017
 Majesty-0.016
 Sorry-0.016
 winners-0.015
 majesty-0.015
Token go
Token ID467
Feature activation+2.56e-01

Pos logit contributions
 disl+0.047
 Majesty+0.044
 Sorry+0.043
 winners+0.042
 majesty+0.041
Neg logit contributions
 and-0.027
sc-0.027
 so-0.026
 called-0.026
-0.026
Token back
Token ID736
Feature activation-1.42e+00

Pos logit contributions
+0.005
sc+0.004
 convey+0.004
 shelf+0.004
 called+0.004
Neg logit contributions
 disl-0.008
 Majesty-0.008
 winners-0.008
 Sorry-0.007
joy-0.007
Token She
Token ID1375
Feature activation+4.02e-01

Pos logit contributions
 disl+0.001
 Majesty+0.001
 Sorry+0.001
 majesty+0.001
osity+0.001
Neg logit contributions
 and-0.001
-0.001
 so-0.001
 called-0.001
sc-0.001
Token wanted
Token ID2227
Feature activation+5.36e-01

Pos logit contributions
sc+0.010
 aisle+0.009
 convey+0.009
 shelf+0.009
On+0.009
Neg logit contributions
 disl-0.011
 Majesty-0.010
 sincere-0.010
 Sorry-0.010
 nods-0.009
Token to
Token ID284
Feature activation-5.24e-01

Pos logit contributions
sc+0.008
+0.008
 convey+0.007
On+0.007
So+0.007
Neg logit contributions
 disl-0.019
 winners-0.018
 Sorry-0.017
 Majesty-0.017
 sincere-0.017
Token have
Token ID423
Feature activation+8.58e-01

Pos logit contributions
 disl+0.016
 Sorry+0.015
 Majesty+0.015
 winners+0.014
 majesty+0.014
Neg logit contributions
 and-0.011
 so-0.011
-0.011
 called-0.011
sc-0.011
Token the
Token ID262
Feature activation+6.50e-01

Pos logit contributions
 and+0.013
 so+0.013
+0.012
 called+0.012
 even+0.011
Neg logit contributions
 disl-0.033
 Majesty-0.031
 Sorry-0.030
 majesty-0.029
 winners-0.029
Token prett
Token ID46442
Feature activation-2.27e+00

Pos logit contributions
 and+0.015
 so+0.015
+0.014
 even+0.014
 shelf+0.013
Neg logit contributions
 disl-0.021
 Majesty-0.019
 Sorry-0.019
 majesty-0.018
osity-0.018
Tokeniest
Token ID6386
Feature activation-6.75e-01

Pos logit contributions
 disl+0.057
 Majesty+0.052
 Sorry+0.049
 majesty+0.047
 sincere+0.045
Neg logit contributions
 and-0.068
 so-0.065
-0.062
 called-0.062
 even-0.061
Token bicycle
Token ID17026
Feature activation-2.51e-01

Pos logit contributions
 disl+0.020
 Majesty+0.018
 Sorry+0.018
 winners+0.017
 majesty+0.017
Neg logit contributions
 and-0.016
 so-0.015
-0.015
 called-0.015
 even-0.014
Token in
Token ID287
Feature activation+5.51e-01

Pos logit contributions
 disl+0.008
 winners+0.008
 Sorry+0.008
 Majesty+0.008
 yeah+0.008
Neg logit contributions
On-0.004
 convey-0.004
-0.004
sc-0.004
At-0.004
Token town
Token ID3240
Feature activation-1.48e+00

Pos logit contributions
sc+0.010
 convey+0.009
+0.009
 and+0.009
 called+0.009
Neg logit contributions
 disl-0.019
 Sorry-0.018
 winners-0.017
 Majesty-0.017
 sincere-0.016
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 disl+0.061
 Majesty+0.057
 winners+0.057
 Sorry+0.056
 majesty+0.054
Neg logit contributions
-0.015
 and-0.015
 convey-0.014
On-0.014
 called-0.014
Token garden
Token ID11376
Feature activation+2.17e+00

Pos logit contributions
On+0.011
 so+0.011
 called+0.011
sc+0.011
 and+0.011
Neg logit contributions
 disl-0.016
 Majesty-0.014
 winners-0.014
 Sorry-0.014
 majesty-0.013
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
sc+0.028
On+0.028
 convey+0.028
+0.027
At+0.027
Neg logit contributions
 disl-0.078
 Majesty-0.073
 winners-0.072
 majesty-0.069
joy-0.067
Token He
Token ID679
Feature activation-2.54e-01

Pos logit contributions
 called+0.008
sc+0.008
 convey+0.008
 and+0.008
 so+0.008
Neg logit contributions
 disl-0.008
 Sorry-0.007
 Majesty-0.007
osity-0.006
 yeah-0.006
Token found
Token ID1043
Feature activation-7.44e-02

Pos logit contributions
 disl+0.007
 Majesty+0.007
 nods+0.007
 majesty+0.007
 sincere+0.006
Neg logit contributions
sc-0.005
 convey-0.005
 shelf-0.005
 aisle-0.005
 so-0.004
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
 disl+0.003
 Majesty+0.003
 winners+0.003
 Sorry+0.003
 majesty+0.002
Neg logit contributions
 and-0.001
-0.001
 so-0.001
 called-0.001
On-0.001
Token bag
Token ID6131
Feature activation-3.88e-01

Pos logit contributions
 and+0.019
 so+0.018
+0.018
 called+0.018
sc+0.018
Neg logit contributions
 disl-0.036
 Majesty-0.034
 Sorry-0.034
 winners-0.033
 majesty-0.033
Token of
Token ID286
Feature activation+5.75e-01

Pos logit contributions
 winners+0.013
 disl+0.013
osity+0.013
 yeah+0.013
 Sorry+0.012
Neg logit contributions
 convey-0.005
sc-0.004
-0.004
On-0.004
ern-0.004
Token seeds
Token ID11904
Feature activation+5.93e-01

Pos logit contributions
 and+0.010
+0.010
 so+0.010
On+0.010
sc+0.010
Neg logit contributions
 disl-0.019
 Majesty-0.017
 winners-0.017
 Sorry-0.017
 majesty-0.017
Token and
Token ID290
Feature activation+3.73e-01

Pos logit contributions
 convey+0.006
On+0.005
sc+0.005
+0.005
So+0.004
Neg logit contributions
 disl-0.023
 winners-0.022
osity-0.021
 Majesty-0.021
 yeah-0.021
Token he
Token ID339
Feature activation-3.66e-01

Pos logit contributions
sc+0.006
+0.006
 convey+0.006
On+0.006
 and+0.006
Neg logit contributions
 disl-0.012
 Majesty-0.011
 majesty-0.011
 Sorry-0.011
 winners-0.011
Token went
Token ID1816
Feature activation+1.40e+00

Pos logit contributions
 disl+0.011
 Majesty+0.010
 majesty+0.010
 winners+0.009
 Sorry+0.009
Neg logit contributions
 convey-0.007
sc-0.007
 aisle-0.006
On-0.006
 shelf-0.006
Token hello
Token ID23748
Feature activation-3.13e+00

Pos logit contributions
 disl+0.056
 Majesty+0.052
 Sorry+0.050
 winners+0.050
 majesty+0.049
Neg logit contributions
 and-0.033
-0.030
 so-0.030
 called-0.029
 even-0.028
Token to
Token ID284
Feature activation-1.60e+00

Pos logit contributions
 disl+0.155
 Majesty+0.144
 winners+0.142
 Sorry+0.141
 majesty+0.140
Neg logit contributions
 and-0.026
 so-0.019
 from-0.014
 even-0.013
-0.013
Token the
Token ID262
Feature activation-4.66e-01

Pos logit contributions
 disl+0.073
 Majesty+0.070
 majesty+0.068
 winners+0.068
osity+0.067
Neg logit contributions
 and-0.017
 so-0.015
 even-0.013
-0.012
 called-0.012
Token night
Token ID1755
Feature activation+7.23e-02

Pos logit contributions
 disl+0.015
 Majesty+0.014
 Sorry+0.014
 majesty+0.014
 yeah+0.014
Neg logit contributions
 and-0.011
 so-0.010
-0.009
 even-0.009
 called-0.009
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
 and+0.001
 so+0.001
+0.001
 called+0.001
 even+0.001
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.003
 majesty-0.003
Token They
Token ID1119
Feature activation-5.33e-01

Pos logit contributions
 disl+0.008
 Sorry+0.008
 Majesty+0.007
 majesty+0.007
 winners+0.007
Neg logit contributions
 and-0.005
 so-0.005
-0.005
 called-0.005
 convey-0.005
Token thank
Token ID5875
Feature activation-1.30e+00

Pos logit contributions
 disl+0.016
 Majesty+0.014
 Sorry+0.014
 winners+0.014
osity+0.013
Neg logit contributions
 and-0.013
-0.013
 so-0.013
 called-0.012
 even-0.012
Token Mom
Token ID11254
Feature activation+1.95e+00

Pos logit contributions
 disl+0.061
 Majesty+0.058
 majesty+0.056
osity+0.055
 winners+0.054
Neg logit contributions
 and-0.015
 so-0.012
 called-0.011
 carefully-0.011
 from-0.011
Token for
Token ID329
Feature activation-5.22e-01

Pos logit contributions
 and+0.005
+0.005
 so+0.005
 called+0.004
sc+0.003
Neg logit contributions
 disl-0.096
 Majesty-0.091
 Sorry-0.090
 winners-0.089
 majesty-0.089
Token explaining
Token ID11170
Feature activation+1.08e+00

Pos logit contributions
 disl+0.019
 Majesty+0.018
 Sorry+0.018
 winners+0.018
 majesty+0.017
Neg logit contributions
 and-0.008
-0.008
 so-0.008
 called-0.007
sc-0.007
Token the
Token ID262
Feature activation+2.56e+00

Pos logit contributions
+0.017
sc+0.016
 and+0.016
 called+0.016
On+0.016
Neg logit contributions
 disl-0.040
 Majesty-0.036
 Sorry-0.036
 winners-0.035
 majesty-0.034
Token We
Token ID775
Feature activation-1.58e+00

Pos logit contributions
 disl+0.011
 Majesty+0.010
 winners+0.010
 majesty+0.009
 Sorry+0.009
Neg logit contributions
-0.008
 and-0.008
 so-0.008
 called-0.007
On-0.007
Token should
Token ID815
Feature activation+1.04e+00

Pos logit contributions
 disl+0.049
 Majesty+0.047
osity+0.044
 majesty+0.044
 Sorry+0.043
Neg logit contributions
 and-0.039
-0.038
 called-0.036
 so-0.036
 even-0.036
Token go
Token ID467
Feature activation-7.98e-01

Pos logit contributions
 and+0.017
+0.016
 so+0.015
 called+0.015
 even+0.014
Neg logit contributions
joy-0.038
 disl-0.038
rav-0.038
osity-0.037
 Majesty-0.037
Token inside
Token ID2641
Feature activation+1.29e+00

Pos logit contributions
 disl+0.027
 Sorry+0.026
 Majesty+0.026
 winners+0.026
 majesty+0.025
Neg logit contributions
-0.013
sc-0.012
 convey-0.012
On-0.012
 and-0.012
Token and
Token ID290
Feature activation-1.13e+00

Pos logit contributions
 convey+0.020
sc+0.020
ern+0.019
On+0.017
Days+0.017
Neg logit contributions
 disl-0.043
 winners-0.039
 Majesty-0.039
 sorry-0.039
joy-0.038
Token see
Token ID766
Feature activation-6.37e-01

Pos logit contributions
 disl+0.039
 Majesty+0.037
 winners+0.036
 Sorry+0.035
osity+0.035
Neg logit contributions
 and-0.022
 so-0.021
-0.020
 called-0.020
 even-0.019
Token if
Token ID611
Feature activation+1.69e+00

Pos logit contributions
 disl+0.019
 Majesty+0.017
 Sorry+0.017
 winners+0.016
 sorry+0.016
Neg logit contributions
sc-0.014
On-0.013
-0.013
 convey-0.013
 aisle-0.013
Token Mom
Token ID11254
Feature activation-2.71e-01

Pos logit contributions
 and+0.031
 so+0.031
+0.030
 called+0.029
 even+0.028
Neg logit contributions
 disl-0.061
 Majesty-0.055
 winners-0.054
osity-0.054
 majesty-0.053
Token and
Token ID290
Feature activation+1.93e-01

Pos logit contributions
 disl+0.009
 Majesty+0.009
 Sorry+0.009
 winners+0.008
 majesty+0.008
Neg logit contributions
 and-0.005
 so-0.005
-0.005
 called-0.005
 even-0.004
Token Dad
Token ID17415
Feature activation-9.03e-01

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
 even+0.003
Neg logit contributions
 disl-0.007
 Majesty-0.006
 Sorry-0.006
 winners-0.006
osity-0.006
Token are
Token ID389
Feature activation+2.64e-01

Pos logit contributions
 disl+0.028
 Majesty+0.026
 Sorry+0.026
 majesty+0.025
 winners+0.025
Neg logit contributions
 and-0.021
 so-0.020
-0.020
 called-0.019
 even-0.018
Token very
Token ID845
Feature activation-3.98e-01

Pos logit contributions
 so+0.008
 and+0.008
 called+0.007
 even+0.007
+0.007
Neg logit contributions
 disl-0.017
 Sorry-0.016
 Majesty-0.016
rav-0.015
osity-0.015
Token nice
Token ID3621
Feature activation-4.84e-01

Pos logit contributions
 disl+0.013
 Majesty+0.012
osity+0.012
 winners+0.012
 yeah+0.011
Neg logit contributions
 and-0.009
 so-0.009
 called-0.009
 carefully-0.008
 quickly-0.008
Token.
Token ID13
Feature activation+1.65e-01

Pos logit contributions
 disl+0.018
 winners+0.017
 Majesty+0.017
joy+0.017
 Sorry+0.016
Neg logit contributions
-0.007
sc-0.007
On-0.006
At-0.006
 convey-0.006
Token He
Token ID679
Feature activation+1.38e-01

Pos logit contributions
 and+0.003
 so+0.003
+0.003
 called+0.003
On+0.002
Neg logit contributions
 disl-0.006
 Majesty-0.006
 winners-0.006
 Sorry-0.006
 majesty-0.005
Token lived
Token ID5615
Feature activation+5.21e-01

Pos logit contributions
+0.003
sc+0.003
 convey+0.003
 so+0.003
 and+0.003
Neg logit contributions
 disl-0.004
 Majesty-0.003
 Sorry-0.003
 majesty-0.003
 sincere-0.003
Token in
Token ID287
Feature activation-1.35e+00

Pos logit contributions
 and+0.005
 so+0.004
 called+0.004
+0.004
 even+0.003
Neg logit contributions
 disl-0.024
 Majesty-0.022
 Sorry-0.022
 majesty-0.022
 winners-0.021
Token a
Token ID257
Feature activation-2.10e+00

Pos logit contributions
 disl+0.053
 Majesty+0.049
 Sorry+0.049
 winners+0.048
 majesty+0.047
Neg logit contributions
 and-0.019
 so-0.018
-0.018
 called-0.017
sc-0.016
Token big
Token ID1263
Feature activation-2.44e+00

Pos logit contributions
 disl+0.084
 Majesty+0.079
 Sorry+0.078
 winners+0.076
 majesty+0.076
Neg logit contributions
 and-0.028
 so-0.026
-0.026
 called-0.025
 even-0.023
Token tree
Token ID5509
Feature activation-9.61e-01

Pos logit contributions
 disl+0.088
 Majesty+0.082
 Sorry+0.081
 winners+0.079
 majesty+0.078
Neg logit contributions
 and-0.040
-0.039
 so-0.039
 called-0.037
sc-0.036
Token with
Token ID351
Feature activation+2.01e-01

Pos logit contributions
 disl+0.033
 winners+0.032
 Sorry+0.031
 sincere+0.031
 Majesty+0.031
Neg logit contributions
 convey-0.013
-0.013
sc-0.012
 so-0.011
 even-0.011
Token his
Token ID465
Feature activation-9.96e-01

Pos logit contributions
 and+0.003
 so+0.003
 called+0.003
+0.003
 even+0.002
Neg logit contributions
 disl-0.008
 Majesty-0.008
 Sorry-0.007
 winners-0.007
osity-0.007
Token,
Token ID11
Feature activation-4.72e-01

Pos logit contributions
 disl+0.044
 Majesty+0.042
 Sorry+0.042
 winners+0.041
 majesty+0.041
Neg logit contributions
-0.015
 and-0.015
 so-0.014
On-0.014
 called-0.013
Token he
Token ID339
Feature activation-4.22e-01

Pos logit contributions
 disl+0.017
 Majesty+0.016
 Sorry+0.015
 winners+0.015
 majesty+0.015
Neg logit contributions
 so-0.009
 and-0.009
 called-0.009
 even-0.008
-0.008
Token heard
Token ID2982
Feature activation-4.78e-01

Pos logit contributions
 disl+0.010
 Majesty+0.010
 Sorry+0.009
 majesty+0.009
 sincere+0.009
Neg logit contributions
-0.011
 and-0.011
 convey-0.010
 so-0.010
sc-0.010
Token a
Token ID257
Feature activation-6.43e-01

Pos logit contributions
 disl+0.020
 Sorry+0.019
 Majesty+0.018
 winners+0.018
 majesty+0.018
Neg logit contributions
 and-0.006
 so-0.006
 called-0.006
-0.006
 even-0.005
Token little
Token ID1310
Feature activation-1.11e+00

Pos logit contributions
 disl+0.020
 Majesty+0.019
 winners+0.018
 Sorry+0.018
 majesty+0.018
Neg logit contributions
 and-0.015
 so-0.014
-0.014
 called-0.014
 even-0.013
Token voice
Token ID3809
Feature activation-4.66e+00

Pos logit contributions
 disl+0.033
 Majesty+0.031
 majesty+0.030
 winners+0.030
 Sorry+0.029
Neg logit contributions
 and-0.030
 so-0.029
-0.028
 called-0.027
 even-0.026
Token say
Token ID910
Feature activation-1.44e-01

Pos logit contributions
 disl+0.175
 Sorry+0.159
 Majesty+0.153
 sincere+0.153
 tweet+0.152
Neg logit contributions
sc-0.059
-0.058
 so-0.058
On-0.058
 shelf-0.057
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 disl+0.006
 Majesty+0.006
 Sorry+0.006
 majesty+0.006
 winners+0.006
Neg logit contributions
 and-0.002
 so-0.002
-0.001
 called-0.001
 from-0.001
Token "
Token ID366
Feature activation+1.33e+00

Pos logit contributions
 and+0.031
 so+0.029
+0.028
 called+0.027
 even+0.025
Neg logit contributions
 disl-0.061
 Majesty-0.058
 winners-0.057
 Sorry-0.057
 majesty-0.056
TokenWait
Token ID21321
Feature activation-1.04e+00

Pos logit contributions
+0.031
 and+0.031
 so+0.029
 called+0.029
On+0.029
Neg logit contributions
 disl-0.041
 Majesty-0.039
 majesty-0.037
 winners-0.037
 Sorry-0.037
Token for
Token ID329
Feature activation+4.00e-01

Pos logit contributions
 disl+0.035
 Majesty+0.033
 Sorry+0.033
 tweet+0.033
 winners+0.032
Neg logit contributions
sc-0.016
On-0.015
 convey-0.015
At-0.015
 aisle-0.015
Token questioned
Token ID11434
Feature activation-1.30e+00

Pos logit contributions
sc+0.008
 convey+0.007
+0.007
 aisle+0.007
ion+0.007
Neg logit contributions
 disl-0.006
 nods-0.005
 sincere-0.005
 Majesty-0.005
 majesty-0.005
Token what
Token ID644
Feature activation-7.97e-01

Pos logit contributions
 disl+0.033
 sorry+0.031
 tweet+0.031
 yeah+0.030
 wow+0.030
Neg logit contributions
sc-0.035
 aisle-0.032
On-0.032
 convey-0.031
At-0.031
Token it
Token ID340
Feature activation-9.69e-01

Pos logit contributions
 disl+0.023
 Sorry+0.020
 writer+0.020
 nods+0.019
 sincere+0.019
Neg logit contributions
sc-0.017
 convey-0.016
 into-0.015
On-0.015
 hind-0.015
Token was
Token ID373
Feature activation+2.22e+00

Pos logit contributions
 disl+0.033
 nods+0.029
 Sorry+0.029
 yeah+0.029
 sincere+0.029
Neg logit contributions
 convey-0.015
sc-0.015
-0.014
 so-0.014
On-0.013
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
sc+0.038
 convey+0.038
+0.037
On+0.035
At+0.034
Neg logit contributions
 disl-0.073
 Majesty-0.064
 sincere-0.062
 winners-0.062
 sorry-0.061
Token why
Token ID1521
Feature activation-4.00e+00

Pos logit contributions
 and+0.019
 so+0.018
 called+0.018
+0.017
 even+0.017
Neg logit contributions
 disl-0.038
 Majesty-0.036
 Sorry-0.035
 winners-0.035
 majesty-0.034
Token it
Token ID340
Feature activation-2.34e+00

Pos logit contributions
 disl+0.154
 Majesty+0.148
 Sorry+0.143
 winners+0.142
 majesty+0.141
Neg logit contributions
 and-0.063
 so-0.057
-0.055
 called-0.055
 even-0.053
Token was
Token ID373
Feature activation+1.38e+00

Pos logit contributions
 disl+0.082
 Sorry+0.075
 Majesty+0.072
 yeah+0.071
 sincere+0.071
Neg logit contributions
-0.035
 convey-0.035
 and-0.035
sc-0.034
 so-0.033
Token there
Token ID612
Feature activation-5.25e-01

Pos logit contributions
sc+0.032
ches+0.029
text+0.027
So+0.027
ion+0.027
Neg logit contributions
 sorry-0.037
 sincere-0.037
 disl-0.037
 winners-0.034
 ashamed-0.032
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
 disl+0.014
 Sorry+0.013
 sorry+0.012
 sincere+0.012
 wow+0.012
Neg logit contributions
sc-0.012
 convey-0.011
Days-0.011
So-0.010
ches-0.010
Token
Token ID220
Feature activation-7.57e-01

Pos logit contributions
 and+0.007
 so+0.006
+0.006
 called+0.006
 convey+0.006
Neg logit contributions
 disl-0.014
 Sorry-0.013
 Majesty-0.013
 yeah-0.013
 majesty-0.013
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
sc+0.018
+0.016
ches+0.016
 convey+0.016
 especially+0.015
Neg logit contributions
 disl-0.019
joy-0.018
 Sorry-0.018
 winners-0.017
 sincere-0.016
Token sky
Token ID6766
Feature activation-2.03e+00

Pos logit contributions
 disl+0.059
 Majesty+0.054
 Sorry+0.053
 winners+0.052
 majesty+0.051
Neg logit contributions
 and-0.045
 so-0.043
-0.043
 called-0.041
 even-0.039
Token.
Token ID13
Feature activation+1.74e-01

Pos logit contributions
 disl+0.070
 winners+0.066
 Majesty+0.065
 Sorry+0.065
osity+0.063
Neg logit contributions
 convey-0.030
-0.028
sc-0.027
On-0.026
At-0.026
Token Lily
Token ID20037
Feature activation+1.23e+00

Pos logit contributions
 and+0.003
+0.003
 so+0.003
 convey+0.003
 called+0.003
Neg logit contributions
 disl-0.006
 Sorry-0.006
 Majesty-0.006
 yeah-0.005
 majesty-0.005
Token was
Token ID373
Feature activation-1.68e-01

Pos logit contributions
 convey+0.025
On+0.025
sc+0.025
 shelf+0.024
 aisle+0.024
Neg logit contributions
 disl-0.034
 Sorry-0.032
 nods-0.032
 Majesty-0.031
 sincere-0.031
Token sad
Token ID6507
Feature activation-3.77e+00

Pos logit contributions
 disl+0.005
 Majesty+0.004
 winners+0.004
 Sorry+0.004
 sincere+0.004
Neg logit contributions
 and-0.004
-0.004
sc-0.004
 convey-0.004
On-0.004
Token and
Token ID290
Feature activation+7.73e-01

Pos logit contributions
 disl+0.147
 yeah+0.138
 Sorry+0.137
 Majesty+0.137
 winners+0.135
Neg logit contributions
sc-0.046
-0.043
 convey-0.042
 aisle-0.036
 shelf-0.035
Token cried
Token ID16896
Feature activation-2.14e+00

Pos logit contributions
 so+0.016
 called+0.016
 and+0.016
+0.014
 even+0.014
Neg logit contributions
 disl-0.026
 Majesty-0.025
 winners-0.025
 majesty-0.024
rav-0.024
Token.
Token ID13
Feature activation+3.80e-01

Pos logit contributions
 disl+0.077
 Majesty+0.074
 Sorry+0.074
 sorry+0.073
 winners+0.073
Neg logit contributions
-0.028
sc-0.027
 aisle-0.026
 convey-0.026
On-0.024
Token Her
Token ID2332
Feature activation-5.79e-01

Pos logit contributions
 and+0.006
 so+0.005
+0.005
 called+0.005
 even+0.005
Neg logit contributions
 disl-0.015
 Majesty-0.014
 Sorry-0.014
 winners-0.013
 majesty-0.013
Token mom
Token ID1995
Feature activation-7.43e-01

Pos logit contributions
 disl+0.022
 Majesty+0.020
 Sorry+0.020
osity+0.020
 yeah+0.019
Neg logit contributions
 and-0.010
 so-0.009
 even-0.008
 shelf-0.008
 carefully-0.008
Token a
Token ID257
Feature activation-6.19e-01

Pos logit contributions
 and+0.009
 so+0.009
+0.008
 even+0.008
 called+0.008
Neg logit contributions
 disl-0.023
 Sorry-0.022
 Majesty-0.022
 winners-0.021
 majesty-0.021
Token lot
Token ID1256
Feature activation-4.83e-01

Pos logit contributions
 disl+0.021
 Majesty+0.019
 Sorry+0.019
 winners+0.019
 majesty+0.018
Neg logit contributions
 and-0.012
 so-0.011
-0.011
 called-0.011
 even-0.010
Token of
Token ID286
Feature activation+1.55e-01

Pos logit contributions
 disl+0.020
 winners+0.019
 Sorry+0.018
 Majesty+0.018
 majesty+0.018
Neg logit contributions
 and-0.005
-0.005
 convey-0.005
 so-0.005
 called-0.005
Token noise
Token ID7838
Feature activation-3.51e+00

Pos logit contributions
 and+0.004
 so+0.003
+0.003
 called+0.003
 even+0.003
Neg logit contributions
 disl-0.005
 Majesty-0.005
 Sorry-0.005
 majesty-0.004
osity-0.004
Token and
Token ID290
Feature activation+9.00e-02

Pos logit contributions
 disl+0.145
 winners+0.137
 Sorry+0.133
 Majesty+0.132
osity+0.131
Neg logit contributions
 convey-0.030
On-0.028
-0.028
sc-0.028
 shelf-0.027
Token whim
Token ID29923
Feature activation-3.83e+00

Pos logit contributions
 and+0.002
 so+0.002
 called+0.002
+0.002
 even+0.001
Neg logit contributions
 disl-0.003
 Majesty-0.003
 Sorry-0.003
 winners-0.003
 majesty-0.003
Tokenpered
Token ID13653
Feature activation+1.33e+00

Pos logit contributions
 disl+0.107
 Majesty+0.104
 sincere+0.097
 winners+0.096
 majesty+0.094
Neg logit contributions
-0.087
On-0.086
 and-0.085
sc-0.085
At-0.085
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
+0.018
 and+0.017
 called+0.017
On+0.017
 convey+0.017
Neg logit contributions
 disl-0.049
 Majesty-0.046
 Sorry-0.046
 winners-0.045
 majesty-0.044
Token She
Token ID1375
Feature activation+1.38e-01

Pos logit contributions
 disl+0.004
 Majesty+0.004
 Sorry+0.004
 winners+0.004
 majesty+0.004
Neg logit contributions
 and-0.002
 so-0.002
-0.002
 called-0.002
 even-0.002
Token was
Token ID373
Feature activation-5.47e-02

Pos logit contributions
sc+0.003
 convey+0.003
On+0.003
 shelf+0.003
 aisle+0.003
Neg logit contributions
 disl-0.004
 Majesty-0.004
 Sorry-0.004
 sincere-0.004
 nods-0.004
Token very
Token ID845
Feature activation-7.92e-01

Pos logit contributions
 disl+0.002
 Majesty+0.002
 Sorry+0.002
 winners+0.002
 majesty+0.002
Neg logit contributions
 and-0.001
 so-0.001
-0.001
 called-0.001
 even-0.001
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
 so+0.002
 convey+0.002
sc+0.002
+0.002
 and+0.002
Neg logit contributions
 disl-0.004
 Sorry-0.004
 Majesty-0.003
 sincere-0.003
 majesty-0.003
TokenL
Token ID43
Feature activation+6.75e-01

Pos logit contributions
 disl+0.010
 Sorry+0.009
 Majesty+0.009
 winners+0.009
 majesty+0.009
Neg logit contributions
 and-0.004
 so-0.004
-0.004
 called-0.004
 convey-0.004
Tokenily
Token ID813
Feature activation+8.04e-01

Pos logit contributions
 and+0.014
 so+0.013
+0.013
 called+0.013
 even+0.012
Neg logit contributions
 disl-0.022
 Majesty-0.021
 Sorry-0.020
 winners-0.020
 majesty-0.019
Token sees
Token ID7224
Feature activation-1.38e+00

Pos logit contributions
 so+0.014
On+0.014
 convey+0.014
sc+0.014
+0.014
Neg logit contributions
 disl-0.026
 Majesty-0.024
 Sorry-0.024
 nods-0.024
 winners-0.023
Token Tom
Token ID4186
Feature activation+1.15e+00

Pos logit contributions
 disl+0.049
 Majesty+0.049
 Sorry+0.047
 majesty+0.046
 winners+0.046
Neg logit contributions
 and-0.026
 so-0.024
 called-0.022
-0.022
 even-0.021
Token crying
Token ID13774
Feature activation-4.23e+00

Pos logit contributions
sc+0.016
On+0.016
 convey+0.016
 so+0.015
 called+0.015
Neg logit contributions
 disl-0.041
 Sorry-0.038
 sincere-0.038
 winners-0.037
 Majesty-0.037
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
 disl+0.196
 Sorry+0.188
 Majesty+0.183
 winners+0.183
 sincere+0.181
Neg logit contributions
-0.020
 convey-0.015
sc-0.014
On-0.012
 so-0.012
Token She
Token ID1375
Feature activation-5.27e-01

Pos logit contributions
 disl+0.014
 Majesty+0.013
 Sorry+0.012
 winners+0.012
 majesty+0.012
Neg logit contributions
 and-0.009
 so-0.008
-0.008
 called-0.008
 even-0.008
Token feels
Token ID5300
Feature activation-1.07e+00

Pos logit contributions
 disl+0.015
 Majesty+0.013
 Sorry+0.013
 winners+0.013
 majesty+0.013
Neg logit contributions
 and-0.014
 so-0.013
-0.013
 called-0.013
 even-0.012
Token sorry
Token ID7926
Feature activation-3.56e+00

Pos logit contributions
 Majesty+0.019
 disl+0.019
rav+0.017
 Sorry+0.017
 majesty+0.017
Neg logit contributions
 so-0.040
 and-0.040
 even-0.038
-0.038
 called-0.038
Token for
Token ID329
Feature activation+2.62e-02

Pos logit contributions
 disl+0.135
 Sorry+0.126
 Majesty+0.124
 winners+0.122
 yeah+0.120
Neg logit contributions
-0.046
 convey-0.045
 so-0.044
sc-0.044
 called-0.043