TOP ACTIVATIONS
MAX = 0.134

" Yes , it is ," said the other brave friend
walked away â Mary feeling happy and a
all smiles . John was cheered on , happy to have
" Yes it is ," her Mom agreed .
" Okay , Lily ," said Tim my . "
We will pray and hope ," Lily and Ben said .
" Yes , it did ," Mom my smiled , relieved
hands and paws , and cheered , " Y ay ,
friend a girl could have ," she said with a smile
'm glad I could help ," his mom replied . <|endoftext|>
" Yes ," Mom my said , smiling
. " Okay ," said Lily , and they
" Yes we did ," said Tom , smiling .
" Yes , it was ," his sister said . "
OK , we forgive him ," they said . " But
They ran and laughed and cheered . Lily made some new
will be , sweet heart ," his mom answered . "
, we can , Luna ," Mia says . " We
Yes , I 'm okay ," Lily said . " But
We are sorry , Mom ," they say . " We

MIN ACTIVATIONS
MIN = -0.296

as he opened the ch ocol ates and started eating them
box and share the ch ocol ates . They are happy
and started eating the ch ocol ates . The shop keeper
noticed something . The ch ocol ates were bitter . They
bled up all the ch ocol ates , and she was
park . When the ch ocol ates were all gone ,
a large selection of ch ocol ates . Ell a picked
started eating the delicious ch ocol ates . She was so
a big box of ch ocol ates . It was so
looking at all the ch ocol ates , Lex i chose
saw all the tasty ch ocol ates inside . She couldn
and they shared the ch ocol ates with everyone in the
a big box of ch ocol ates ! <|endoftext|> Once upon
a big box of ch ocol ates to share with the
her a box of ch ocol ates . Lily opened the
was a box of ch ocol ates . " For me
she paid for the ch ocol ates and the two of
son is eating our ch ocol ates !"
he ate all the ch ocol ates and sweets . Then
a big box of ch ocol ates !

INTERVAL 0.027 - 0.134
CONTAINS 21.836%

They were not depend able friends anymore . Then
turned it on , and it made a funny noise .
delicious ! Lily was happy that she found the lime and
ola was surprised to know that cob we bs could make
One day , a big wind comes and blows

INTERVAL -0.081 - 0.027
CONTAINS 75.513%

time , there was a little boy named Tim my .
We found a huge root . See ?" He showed her
, I want to keep them !" Tom says . "
few months of training , Pete 's brain felt more reliable
and he told her he was exploring . She welcomed him

INTERVAL -0.188 - -0.081
CONTAINS 2.647%

began flying around making aer opl ane noises . Jack soon
her friend when she stumbled upon a strange object . It
while picking fresh flowers in her garden , Pen elope heard
She tried to open the oy ster with her fingers ,
done ?" Mom shouted . " You ruined the chocolate and

INTERVAL -0.296 - -0.188
CONTAINS 0.004%

day , a group of mis chie vous mice came by
enjoyed her cool , ch ocol at y treat !
There 's a cool re ind eer in our yard !"
mom told him to stop mis beh aving and to be
a big box of ch ocol ates !
Token"
Token ID1
Feature activation-6.06e-02

Pos logit contributions
s+0.031
 they+0.031
 said+0.030
 +0.030
+0.028
Neg logit contributions
ocol-0.059
 property-0.058
beh-0.056
esty-0.056
asured-0.054
TokenYes
Token ID5297
Feature activation+2.66e-02

Pos logit contributions
ocol+0.153
 property+0.138
ities+0.130
asured+0.130
isher+0.129
Neg logit contributions
 they-0.084
-0.081
s-0.080
 said-0.079
 -0.079
Token,
Token ID11
Feature activation-7.41e-03

Pos logit contributions
s+0.027
 +0.024
+0.024
 they+0.024
 exclaimed+0.023
Neg logit contributions
ocol-0.071
 property-0.064
isher-0.062
ant-0.061
ities-0.060
Token it
Token ID340
Feature activation+5.50e-02

Pos logit contributions
ocol+0.021
 property+0.019
asured+0.019
beh+0.019
isher+0.018
Neg logit contributions
 they-0.008
s-0.007
 said-0.007
-0.007
 -0.007
Token is
Token ID318
Feature activation-2.27e-03

Pos logit contributions
 they+0.057
s+0.055
+0.051
 +0.050
cakes+0.047
Neg logit contributions
ocol-0.134
 property-0.133
beh-0.124
esty-0.122
asured-0.121
Token,"
Token ID553
Feature activation+1.23e-01

Pos logit contributions
ocol+0.006
 property+0.006
asured+0.006
beh+0.006
isher+0.006
Neg logit contributions
s-0.002
 they-0.002
 -0.002
 said-0.002
-0.002
Token said
Token ID531
Feature activation+7.90e-02

Pos logit contributions
s+0.041
 +0.026
cakes+0.025
+0.025
 they+0.022
Neg logit contributions
ocol-0.385
beh-0.359
 property-0.357
isher-0.357
asured-0.348
Token the
Token ID262
Feature activation+4.00e-02

Pos logit contributions
s+0.084
 they+0.073
 +0.072
+0.071
 exclaimed+0.070
Neg logit contributions
ocol-0.208
 property-0.186
beh-0.185
isher-0.185
asured-0.183
Token other
Token ID584
Feature activation-3.38e-02

Pos logit contributions
s+0.051
 they+0.049
 said+0.047
 +0.047
+0.046
Neg logit contributions
ocol-0.098
 property-0.092
beh-0.086
asured-0.086
ities-0.085
Token brave
Token ID14802
Feature activation-6.76e-03

Pos logit contributions
ocol+0.080
 property+0.073
asured+0.070
beh+0.069
isher+0.069
Neg logit contributions
s-0.047
 they-0.046
 said-0.044
 -0.044
-0.044
Token friend
Token ID1545
Feature activation+5.22e-02

Pos logit contributions
ocol+0.016
 property+0.015
asured+0.014
beh+0.014
ities+0.014
Neg logit contributions
s-0.009
 they-0.009
 said-0.009
-0.009
 -0.009
Token walked
Token ID6807
Feature activation-2.94e-02

Pos logit contributions
s+0.003
 they+0.003
 said+0.003
 +0.003
+0.003
Neg logit contributions
ocol-0.008
 property-0.008
asured-0.007
beh-0.007
isher-0.007
Token away
Token ID1497
Feature activation-6.46e-03

Pos logit contributions
ocol+0.057
 property+0.052
isher+0.050
asured+0.050
beh+0.049
Neg logit contributions
s-0.048
 they-0.046
 said-0.046
 exclaimed-0.046
 -0.046
Tokenâ
Token ID22940
Feature activation-8.70e-02

Pos logit contributions
ocol+0.018
 property+0.017
asured+0.016
beh+0.016
ities+0.015
Neg logit contributions
s-0.006
 they-0.006
 -0.006
 said-0.006
-0.005
Token
Token ID26391
Feature activation+1.32e-02

Pos logit contributions
ocol+0.182
 property+0.156
ities+0.151
asured+0.148
ubb+0.147
Neg logit contributions
 -0.147
s-0.147
-0.146
 said-0.146
 they-0.143
Token
Token ID447
Feature activation-3.99e-02

Pos logit contributions
cakes+0.015
 said+0.015
 exclaimed+0.014
 replied+0.014
 they+0.013
Neg logit contributions
 property-0.031
ocol-0.028
beh-0.028
 than-0.027
asured-0.027
Token
Token ID251
Feature activation+1.23e-01

Pos logit contributions
ocol+0.086
 property+0.080
esty+0.076
isher+0.076
beh+0.076
Neg logit contributions
 they-0.053
 said-0.052
 exclaimed-0.052
s-0.049
-0.047
TokenMary
Token ID24119
Feature activation+5.15e-02

Pos logit contributions
s+0.105
 they+0.092
 exclaimed+0.090
 said+0.088
cakes+0.088
Neg logit contributions
ocol-0.336
 property-0.308
beh-0.303
isher-0.295
asured-0.295
Token feeling
Token ID4203
Feature activation+6.94e-03

Pos logit contributions
 they+0.047
s+0.047
cakes+0.042
+0.041
 +0.040
Neg logit contributions
ocol-0.131
asured-0.124
 property-0.122
beh-0.121
isher-0.118
Token happy
Token ID3772
Feature activation+2.28e-02

Pos logit contributions
s+0.009
 they+0.009
 said+0.008
 +0.008
+0.008
Neg logit contributions
ocol-0.017
 property-0.015
asured-0.015
beh-0.015
isher-0.014
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
s+0.021
 they+0.019
 +0.019
 said+0.018
+0.018
Neg logit contributions
ocol-0.064
 property-0.060
asured-0.058
beh-0.057
ities-0.057
Token a
Token ID257
Feature activation+7.81e-02

Pos logit contributions
s+0.018
cakes+0.017
+0.016
 they+0.016
 exclaimed+0.016
Neg logit contributions
ocol-0.024
 property-0.023
beh-0.023
asured-0.022
esty-0.021
Token all
Token ID477
Feature activation+9.43e-03

Pos logit contributions
ocol+0.011
 property+0.010
asured+0.010
beh+0.010
isher+0.010
Neg logit contributions
s-0.006
 they-0.006
 said-0.005
 -0.005
-0.005
Token smiles
Token ID21845
Feature activation+1.01e-01

Pos logit contributions
s+0.012
 they+0.012
 +0.011
 said+0.011
+0.011
Neg logit contributions
ocol-0.023
 property-0.021
asured-0.020
beh-0.020
isher-0.020
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
s+0.114
 they+0.112
 said+0.106
+0.105
 +0.103
Neg logit contributions
ocol-0.268
 property-0.247
beh-0.236
isher-0.235
asured-0.235
Token John
Token ID1757
Feature activation+3.19e-02

Pos logit contributions
ocol+0.044
 property+0.041
asured+0.040
beh+0.040
isher+0.039
Neg logit contributions
s-0.009
 they-0.008
 said-0.008
 -0.007
 exclaimed-0.007
Token was
Token ID373
Feature activation-9.23e-03

Pos logit contributions
s+0.024
 they+0.023
 said+0.022
 +0.021
+0.021
Neg logit contributions
ocol-0.096
 property-0.090
beh-0.086
ities-0.086
isher-0.086
Token cheered
Token ID34767
Feature activation+1.22e-01

Pos logit contributions
ocol+0.024
 property+0.023
isher+0.022
ities+0.021
beh+0.021
Neg logit contributions
 they-0.010
s-0.010
 -0.010
 said-0.010
-0.010
Token on
Token ID319
Feature activation+2.21e-02

Pos logit contributions
s+0.147
 they+0.145
 said+0.136
+0.136
 +0.136
Neg logit contributions
ocol-0.312
 property-0.288
beh-0.274
asured-0.274
isher-0.273
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
 they+0.020
 +0.020
+0.019
s+0.019
 said+0.017
Neg logit contributions
ocol-0.065
 property-0.060
isher-0.060
beh-0.060
asured-0.059
Token happy
Token ID3772
Feature activation+2.62e-02

Pos logit contributions
s+0.018
 they+0.018
 +0.017
 said+0.017
+0.017
Neg logit contributions
ocol-0.044
 property-0.041
beh-0.039
isher-0.039
asured-0.039
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
s+0.023
 they+0.022
 said+0.021
 +0.021
+0.020
Neg logit contributions
ocol-0.076
 property-0.070
asured-0.068
beh-0.068
isher-0.067
Token have
Token ID423
Feature activation+6.47e-02

Pos logit contributions
ocol+0.028
 property+0.026
asured+0.025
beh+0.025
ities+0.025
Neg logit contributions
s-0.012
 they-0.011
 said-0.010
 -0.010
-0.010
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
 they+0.017
s+0.016
 said+0.015
cakes+0.014
+0.012
Neg logit contributions
uzz-0.079
ocol-0.077
uff-0.075
 property-0.074
asured-0.073
Token"
Token ID1
Feature activation-6.15e-02

Pos logit contributions
s+0.028
 they+0.027
 +0.026
 said+0.026
+0.026
Neg logit contributions
ocol-0.062
 property-0.057
asured-0.054
beh-0.054
isher-0.054
TokenYes
Token ID5297
Feature activation+2.49e-02

Pos logit contributions
ocol+0.152
 property+0.139
isher+0.128
asured+0.128
ities+0.127
Neg logit contributions
 they-0.087
-0.086
 -0.084
s-0.082
 said-0.082
Token it
Token ID340
Feature activation+5.50e-02

Pos logit contributions
s+0.026
 +0.023
 they+0.023
+0.023
 said+0.022
Neg logit contributions
ocol-0.066
 property-0.060
isher-0.058
asured-0.057
ant-0.057
Token is
Token ID318
Feature activation-4.91e-03

Pos logit contributions
 they+0.061
s+0.060
+0.055
 +0.055
 exclaimed+0.049
Neg logit contributions
 property-0.129
ocol-0.129
beh-0.120
asured-0.117
ities-0.117
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
ocol+0.014
 property+0.013
asured+0.012
beh+0.012
ities+0.012
Neg logit contributions
s-0.004
 they-0.004
 -0.004
-0.004
 said-0.004
Token her
Token ID607
Feature activation-6.02e-02

Pos logit contributions
s+0.181
 they+0.164
+0.160
 +0.160
cakes+0.156
Neg logit contributions
ocol-0.252
 property-0.226
asured-0.224
beh-0.224
isher-0.220
Token Mom
Token ID11254
Feature activation+3.32e-02

Pos logit contributions
ocol+0.136
 property+0.126
asured+0.125
isher+0.120
ities+0.118
Neg logit contributions
s-0.081
 they-0.077
 -0.073
 exclaimed-0.072
 said-0.072
Token agreed
Token ID4987
Feature activation+7.02e-02

Pos logit contributions
 they+0.033
s+0.031
+0.027
+0.027
 +0.026
Neg logit contributions
ocol-0.080
asured-0.077
beh-0.077
isher-0.075
ac-0.074
Token.
Token ID13
Feature activation-9.12e-03

Pos logit contributions
s+0.090
+0.086
cakes+0.084
lic+0.083
ves+0.082
Neg logit contributions
ocol-0.128
uzz-0.123
asured-0.122
beh-0.122
 than-0.121
Token
Token ID220
Feature activation+4.58e-02

Pos logit contributions
ocol+0.023
 property+0.021
asured+0.021
beh+0.020
uzz+0.020
Neg logit contributions
s-0.009
 said-0.009
 they-0.008
 exclaimed-0.008
cakes-0.007
Token
Token ID198
Feature activation+3.13e-02

Pos logit contributions
 they+0.025
cakes+0.024
 announced+0.024
s+0.022
 arrived+0.021
Neg logit contributions
uzz-0.090
ocol-0.089
uff-0.089
beh-0.084
asured-0.084
Token"
Token ID1
Feature activation-5.55e-02

Pos logit contributions
s+0.041
 they+0.040
 +0.038
 said+0.038
+0.036
Neg logit contributions
ocol-0.071
 property-0.068
beh-0.065
asured-0.064
isher-0.064
TokenOkay
Token ID16454
Feature activation-2.38e-02

Pos logit contributions
ocol+0.136
 property+0.122
ities+0.113
 appearance+0.113
asured+0.112
Neg logit contributions
 they-0.082
-0.077
s-0.077
 said-0.077
 -0.073
Token,
Token ID11
Feature activation-1.77e-03

Pos logit contributions
ocol+0.064
 property+0.061
isher+0.057
asured+0.057
ities+0.057
Neg logit contributions
s-0.022
 they-0.019
 -0.018
 exclaimed-0.018
-0.018
Token Lily
Token ID20037
Feature activation+4.48e-02

Pos logit contributions
ocol+0.005
 property+0.005
beh+0.005
ities+0.005
isher+0.005
Neg logit contributions
 they-0.002
 said-0.002
-0.002
 -0.002
s-0.001
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
 they+0.045
s+0.044
cakes+0.044
 exclaimed+0.040
 +0.040
Neg logit contributions
 property-0.108
ocol-0.103
asured-0.098
esty-0.098
 than-0.098
Token said
Token ID531
Feature activation+7.68e-02

Pos logit contributions
s+0.030
 they+0.019
 +0.017
+0.016
cakes+0.010
Neg logit contributions
ocol-0.404
 property-0.382
beh-0.376
isher-0.372
asured-0.372
Token Tim
Token ID5045
Feature activation-5.75e-03

Pos logit contributions
s+0.114
 they+0.104
 +0.102
+0.102
 said+0.100
Neg logit contributions
ocol-0.170
 property-0.150
beh-0.149
isher-0.148
asured-0.147
Tokenmy
Token ID1820
Feature activation+4.01e-02

Pos logit contributions
ocol+0.011
 property+0.011
asured+0.010
beh+0.010
isher+0.010
Neg logit contributions
 they-0.009
s-0.009
 -0.009
-0.008
 said-0.008
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 they+0.044
cakes+0.044
s+0.041
+0.039
+0.038
Neg logit contributions
beh-0.087
asured-0.087
ocol-0.084
 property-0.084
esty-0.083
Token "
Token ID366
Feature activation-8.18e-02

Pos logit contributions
ocol+0.040
 property+0.037
asured+0.036
beh+0.036
isher+0.035
Neg logit contributions
s-0.014
 they-0.013
 said-0.012
 exclaimed-0.012
 -0.011
Token We
Token ID775
Feature activation+2.54e-02

Pos logit contributions
ocol+0.054
asured+0.047
 property+0.047
beh+0.047
isher+0.046
Neg logit contributions
-0.031
 they-0.030
 -0.030
s-0.028
 said-0.027
Token will
Token ID481
Feature activation+4.88e-02

Pos logit contributions
s+0.039
 they+0.038
 +0.037
 said+0.037
+0.036
Neg logit contributions
ocol-0.056
 property-0.051
asured-0.049
beh-0.048
isher-0.048
Token pray
Token ID12472
Feature activation+5.56e-02

Pos logit contributions
s+0.071
 they+0.068
 +0.064
 said+0.064
 exclaimed+0.063
Neg logit contributions
ocol-0.110
 property-0.104
asured-0.097
ities-0.096
beh-0.095
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
 they+0.056
s+0.055
+0.052
 said+0.050
 +0.048
Neg logit contributions
ocol-0.160
 property-0.145
beh-0.143
isher-0.141
ities-0.139
Token hope
Token ID2911
Feature activation+4.52e-02

Pos logit contributions
 they+0.026
s+0.026
 said+0.025
+0.025
 +0.025
Neg logit contributions
ocol-0.046
 property-0.042
isher-0.040
beh-0.040
asured-0.040
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
s+0.050
 they+0.047
 +0.046
 said+0.046
+0.045
Neg logit contributions
ocol-0.118
 property-0.110
asured-0.104
ities-0.103
beh-0.103
Token Lily
Token ID20037
Feature activation+3.87e-02

Pos logit contributions
s+0.175
 exclaimed+0.162
cakes+0.161
 +0.153
lic+0.152
Neg logit contributions
ocol-0.261
 property-0.216
isher-0.214
ities-0.211
esty-0.211
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
cakes+0.031
't+0.030
 they+0.029
s+0.029
Everyone+0.027
Neg logit contributions
asured-0.092
esty-0.085
ities-0.084
 property-0.083
ac-0.080
Token Ben
Token ID3932
Feature activation+4.36e-02

Pos logit contributions
 they+0.013
s+0.013
 said+0.013
 +0.013
+0.013
Neg logit contributions
ocol-0.026
 property-0.024
beh-0.023
asured-0.023
isher-0.022
Token said
Token ID531
Feature activation+7.19e-02

Pos logit contributions
cakes+0.029
s+0.025
 they+0.023
't+0.020
+0.019
Neg logit contributions
ities-0.110
 property-0.109
asured-0.108
esty-0.107
beh-0.105
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
s+0.074
cakes+0.072
 said+0.071
 exclaimed+0.071
 +0.069
Neg logit contributions
ocol-0.170
 property-0.162
ities-0.160
beh-0.158
asured-0.158
Token"
Token ID1
Feature activation-6.04e-02

Pos logit contributions
s+0.030
 they+0.029
+0.028
 +0.028
 said+0.028
Neg logit contributions
ocol-0.058
 property-0.054
asured-0.051
beh-0.051
isher-0.051
TokenYes
Token ID5297
Feature activation+2.56e-02

Pos logit contributions
ocol+0.141
 property+0.129
asured+0.120
isher+0.120
ities+0.119
Neg logit contributions
 they-0.090
s-0.088
-0.086
 -0.085
 said-0.085
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
s+0.024
 they+0.021
 said+0.021
 +0.021
+0.021
Neg logit contributions
ocol-0.071
 property-0.066
isher-0.063
asured-0.062
ities-0.062
Token it
Token ID340
Feature activation+5.40e-02

Pos logit contributions
ocol+0.032
 property+0.029
beh+0.029
asured+0.028
ities+0.028
Neg logit contributions
 they-0.011
 said-0.009
s-0.009
 -0.009
-0.009
Token did
Token ID750
Feature activation+5.32e-02

Pos logit contributions
 they+0.050
s+0.049
+0.046
 +0.045
cakes+0.043
Neg logit contributions
 property-0.134
ocol-0.133
beh-0.127
asured-0.123
esty-0.123
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
s+0.051
 they+0.047
 +0.046
+0.045
 said+0.045
Neg logit contributions
ocol-0.145
 property-0.137
beh-0.129
asured-0.128
ities-0.127
Token Mom
Token ID11254
Feature activation+2.95e-02

Pos logit contributions
s+0.170
 they+0.156
 +0.151
+0.151
 exclaimed+0.147
Neg logit contributions
ocol-0.265
 property-0.239
beh-0.234
asured-0.232
isher-0.229
Tokenmy
Token ID1820
Feature activation+2.95e-02

Pos logit contributions
 they+0.053
s+0.050
+0.048
 +0.048
+0.047
Neg logit contributions
ocol-0.048
asured-0.046
beh-0.045
isher-0.044
uzz-0.043
Token smiled
Token ID13541
Feature activation+5.64e-02

Pos logit contributions
s+0.016
+0.016
 they+0.014
 +0.013
+0.013
Neg logit contributions
ocol-0.075
asured-0.074
beh-0.074
isher-0.073
ant-0.072
Token,
Token ID11
Feature activation+4.29e-03

Pos logit contributions
s+0.054
 they+0.052
 +0.049
 exclaimed+0.048
 said+0.048
Neg logit contributions
ocol-0.148
 property-0.138
asured-0.136
beh-0.133
ities-0.133
Token relieved
Token ID22598
Feature activation+3.20e-02

Pos logit contributions
s+0.005
 they+0.005
 said+0.005
+0.005
 +0.005
Neg logit contributions
ocol-0.010
 property-0.010
asured-0.009
ities-0.009
beh-0.009
Token hands
Token ID2832
Feature activation+3.06e-02

Pos logit contributions
ocol+0.066
 property+0.059
asured+0.056
beh+0.055
isher+0.055
Neg logit contributions
s-0.068
 they-0.066
 said-0.065
 -0.064
-0.063
Token and
Token ID290
Feature activation+5.53e-03

Pos logit contributions
s+0.029
 they+0.028
 said+0.027
 +0.026
+0.026
Neg logit contributions
ocol-0.084
 property-0.078
asured-0.075
beh-0.075
isher-0.074
Token paws
Token ID49193
Feature activation+1.40e-02

Pos logit contributions
s+0.005
 they+0.004
+0.004
 +0.004
 said+0.004
Neg logit contributions
ocol-0.015
 property-0.014
beh-0.014
asured-0.014
isher-0.014
Token,
Token ID11
Feature activation+2.06e-03

Pos logit contributions
s+0.015
 they+0.014
+0.014
 +0.014
 said+0.014
Neg logit contributions
ocol-0.035
 property-0.032
asured-0.031
beh-0.031
ities-0.031
Token and
Token ID290
Feature activation+1.31e-02

Pos logit contributions
 they+0.002
s+0.002
 said+0.002
 +0.002
+0.002
Neg logit contributions
ocol-0.006
 property-0.005
isher-0.005
beh-0.005
asured-0.005
Token cheered
Token ID34767
Feature activation+1.21e-01

Pos logit contributions
 they+0.013
 said+0.012
s+0.012
 +0.011
+0.011
Neg logit contributions
ocol-0.038
 property-0.036
asured-0.034
isher-0.034
beh-0.034
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 they+0.134
s+0.131
+0.121
 said+0.119
 +0.118
Neg logit contributions
ocol-0.333
 property-0.308
isher-0.295
asured-0.295
beh-0.294
Token "
Token ID366
Feature activation-7.60e-02

Pos logit contributions
s+0.013
 they+0.012
 said+0.012
 +0.012
+0.012
Neg logit contributions
ocol-0.030
 property-0.028
asured-0.026
beh-0.026
isher-0.026
TokenY
Token ID56
Feature activation-7.72e-02

Pos logit contributions
ocol+0.176
 property+0.165
ities+0.152
asured+0.151
isher+0.150
Neg logit contributions
s-0.114
 they-0.112
-0.112
 -0.109
 said-0.107
Tokenay
Token ID323
Feature activation+3.26e-03

Pos logit contributions
ocol+0.161
 property+0.145
beh+0.138
isher+0.137
asured+0.137
Neg logit contributions
 they-0.125
s-0.125
 -0.120
 said-0.120
-0.119
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
s+0.003
 they+0.003
 +0.003
+0.003
 said+0.003
Neg logit contributions
ocol-0.009
 property-0.008
beh-0.008
isher-0.008
ities-0.008
Token friend
Token ID1545
Feature activation+5.60e-02

Pos logit contributions
ocol+0.040
 property+0.037
asured+0.035
beh+0.035
isher+0.035
Neg logit contributions
s-0.022
 they-0.021
 -0.020
 said-0.020
-0.020
Token a
Token ID257
Feature activation+7.07e-02

Pos logit contributions
s+0.078
 they+0.075
 +0.073
 exclaimed+0.071
 said+0.071
Neg logit contributions
ocol-0.126
 property-0.116
beh-0.111
asured-0.110
isher-0.109
Token girl
Token ID2576
Feature activation+4.69e-02

Pos logit contributions
 they+0.103
s+0.102
+0.099
 +0.099
 said+0.098
Neg logit contributions
ocol-0.166
 property-0.149
ities-0.144
beh-0.142
asured-0.142
Token could
Token ID714
Feature activation+5.94e-03

Pos logit contributions
s+0.045
 they+0.043
 +0.041
+0.040
 said+0.040
Neg logit contributions
ocol-0.128
 property-0.119
asured-0.114
beh-0.114
isher-0.114
Token have
Token ID423
Feature activation+6.29e-02

Pos logit contributions
s+0.008
 they+0.007
 +0.007
 said+0.007
+0.007
Neg logit contributions
ocol-0.015
 property-0.014
asured-0.013
beh-0.013
ities-0.013
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
s+0.089
 they+0.085
 +0.084
+0.081
 said+0.077
Neg logit contributions
ocol-0.140
 property-0.130
beh-0.126
asured-0.125
ities-0.119
Token she
Token ID673
Feature activation+2.78e-02

Pos logit contributions
s+0.049
 they+0.038
 +0.031
cakes+0.031
+0.030
Neg logit contributions
ocol-0.378
 property-0.354
beh-0.352
asured-0.344
isher-0.343
Token said
Token ID531
Feature activation+7.18e-02

Pos logit contributions
s+0.016
cakes+0.013
 they+0.013
+0.011
 +0.011
Neg logit contributions
beh-0.075
asured-0.074
ocol-0.074
ac-0.071
uzz-0.071
Token with
Token ID351
Feature activation-1.91e-02

Pos logit contributions
s+0.079
cakes+0.073
 they+0.070
 said+0.069
 exclaimed+0.069
Neg logit contributions
ocol-0.168
beh-0.158
 property-0.156
asured-0.155
uzz-0.153
Token a
Token ID257
Feature activation+6.43e-02

Pos logit contributions
ocol+0.052
 property+0.048
asured+0.047
ities+0.046
isher+0.046
Neg logit contributions
s-0.019
 they-0.018
 said-0.016
 -0.016
 exclaimed-0.016
Token smile
Token ID8212
Feature activation+6.10e-02

Pos logit contributions
s+0.091
cakes+0.091
 they+0.090
 exclaimed+0.084
ves+0.078
Neg logit contributions
ocol-0.125
 property-0.115
ubb-0.109
 bruised-0.107
asured-0.106
Token'm
Token ID1101
Feature activation-2.32e-02

Pos logit contributions
ocol+0.019
 property+0.017
asured+0.017
beh+0.016
isher+0.016
Neg logit contributions
s-0.013
 they-0.013
 -0.013
 said-0.012
-0.012
Token glad
Token ID9675
Feature activation+2.77e-02

Pos logit contributions
ocol+0.046
asured+0.042
 property+0.041
beh+0.040
isher+0.038
Neg logit contributions
s-0.041
 they-0.036
cakes-0.035
 exclaimed-0.035
 said-0.035
Token I
Token ID314
Feature activation-1.79e-03

Pos logit contributions
s+0.037
 they+0.036
 +0.035
 said+0.035
+0.034
Neg logit contributions
ocol-0.066
 property-0.061
asured-0.058
beh-0.057
isher-0.057
Token could
Token ID714
Feature activation+1.57e-02

Pos logit contributions
ocol+0.004
 property+0.004
asured+0.004
beh+0.004
isher+0.004
Neg logit contributions
s-0.002
 they-0.002
 -0.002
-0.002
 exclaimed-0.002
Token help
Token ID1037
Feature activation+2.48e-02

Pos logit contributions
s+0.024
 they+0.023
 +0.022
 said+0.022
+0.022
Neg logit contributions
ocol-0.034
 property-0.032
asured-0.030
beh-0.030
isher-0.030
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
s+0.037
 they+0.035
 +0.035
 said+0.034
+0.033
Neg logit contributions
ocol-0.056
 property-0.052
asured-0.049
ities-0.048
beh-0.048
Token his
Token ID465
Feature activation-6.92e-02

Pos logit contributions
s+0.200
cakes+0.184
 +0.183
 exclaimed+0.181
+0.181
Neg logit contributions
ocol-0.228
asured-0.205
 property-0.200
beh-0.198
isher-0.198
Token mom
Token ID1995
Feature activation+4.43e-02

Pos logit contributions
ocol+0.179
 property+0.166
asured+0.160
isher+0.158
beh+0.156
Neg logit contributions
s-0.080
 they-0.077
 -0.072
 said-0.072
-0.071
Token replied
Token ID8712
Feature activation+6.84e-02

Pos logit contributions
 they+0.032
cakes+0.029
ves+0.029
s+0.028
+0.028
Neg logit contributions
asured-0.111
beh-0.106
ac-0.102
isher-0.102
uzz-0.100
Token.
Token ID13
Feature activation-6.52e-03

Pos logit contributions
s+0.074
 said+0.068
 they+0.068
 +0.066
 exclaimed+0.065
Neg logit contributions
ocol-0.171
asured-0.157
beh-0.156
 property-0.154
ities-0.152
Token<|endoftext|>
Token ID50256
Feature activation-1.48e-02

Pos logit contributions
ocol+0.017
uzz+0.016
 property+0.016
asured+0.016
uff+0.016
Neg logit contributions
s-0.004
 said-0.004
 exclaimed-0.004
 they-0.004
cakes-0.003
Token
Token ID220
Feature activation+4.53e-02

Pos logit contributions
ocol+0.010
 property+0.009
asured+0.008
beh+0.008
isher+0.008
Neg logit contributions
s-0.005
 they-0.005
 -0.004
 said-0.004
-0.004
Token
Token ID198
Feature activation+3.21e-02

Pos logit contributions
s+0.011
 they+0.011
 said+0.009
 exclaimed+0.006
 +0.004
Neg logit contributions
ocol-0.151
 property-0.142
asured-0.139
beh-0.138
isher-0.136
Token
Token ID198
Feature activation+2.57e-02

Pos logit contributions
 they+0.013
s+0.012
 said+0.011
 exclaimed+0.009
cakes+0.008
Neg logit contributions
ocol-0.096
asured-0.089
 property-0.088
uzz-0.087
beh-0.087
Token"
Token ID1
Feature activation-5.99e-02

Pos logit contributions
s+0.029
 they+0.029
+0.028
 +0.027
 said+0.027
Neg logit contributions
ocol-0.069
 property-0.064
ities-0.062
isher-0.061
asured-0.060
TokenYes
Token ID5297
Feature activation+2.68e-02

Pos logit contributions
ocol+0.138
 property+0.128
asured+0.119
beh+0.118
ities+0.118
Neg logit contributions
 they-0.090
s-0.089
 said-0.086
-0.086
 -0.085
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
s+0.026
 they+0.024
 +0.023
+0.023
 said+0.023
Neg logit contributions
ocol-0.073
 property-0.067
isher-0.065
asured-0.064
ities-0.063
Token Mom
Token ID11254
Feature activation+3.95e-02

Pos logit contributions
s+0.081
 they+0.068
+0.064
 +0.064
 exclaimed+0.056
Neg logit contributions
ocol-0.360
 property-0.333
beh-0.325
isher-0.324
asured-0.323
Tokenmy
Token ID1820
Feature activation+3.71e-02

Pos logit contributions
 they+0.067
s+0.065
+0.060
+0.060
 +0.060
Neg logit contributions
ocol-0.070
asured-0.066
beh-0.064
isher-0.064
 property-0.061
Token said
Token ID531
Feature activation+7.62e-02

Pos logit contributions
s+0.022
+0.021
't+0.020
 they+0.019
cakes+0.018
Neg logit contributions
ocol-0.092
asured-0.090
isher-0.090
beh-0.089
ac-0.087
Token,
Token ID11
Feature activation-1.37e-04

Pos logit contributions
s+0.077
 they+0.070
+0.068
 +0.068
 said+0.067
Neg logit contributions
ocol-0.198
 property-0.181
asured-0.179
beh-0.179
isher-0.178
Token smiling
Token ID16755
Feature activation+1.56e-02

Pos logit contributions
ocol+0.000
 property+0.000
asured+0.000
beh+0.000
isher+0.000
Neg logit contributions
s-0.000
 they-0.000
 said-0.000
 -0.000
-0.000
Token.
Token ID13
Feature activation-5.99e-03

Pos logit contributions
s+0.022
 they+0.022
 +0.020
+0.020
 said+0.020
Neg logit contributions
ocol-0.069
 property-0.064
beh-0.062
asured-0.062
isher-0.061
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
uzz+0.012
 Acc+0.011
esty+0.011
 property+0.011
asured+0.011
Neg logit contributions
ves-0.006
odic-0.006
vation-0.005
ons-0.005
 story-0.005
Token
Token ID198
Feature activation+2.47e-02

Pos logit contributions
cakes+0.035
 announced+0.032
 arrived+0.031
 couple+0.031
 timer+0.030
Neg logit contributions
uzz-0.060
uff-0.058
 property-0.057
 yourself-0.054
rum-0.054
Token"
Token ID1
Feature activation-6.19e-02

Pos logit contributions
s+0.032
 they+0.032
 +0.031
 said+0.030
+0.030
Neg logit contributions
ocol-0.056
 property-0.054
asured-0.051
beh-0.050
esty-0.050
TokenOkay
Token ID16454
Feature activation-2.63e-02

Pos logit contributions
ocol+0.142
 property+0.126
ities+0.119
asured+0.118
beh+0.118
Neg logit contributions
 they-0.099
s-0.098
 said-0.094
-0.093
 -0.090
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
ocol+0.072
 property+0.069
isher+0.065
asured+0.065
ities+0.065
Neg logit contributions
s-0.025
 they-0.022
 -0.021
-0.020
 exclaimed-0.020
Token said
Token ID531
Feature activation+7.60e-02

Pos logit contributions
s+0.054
 +0.046
't+0.045
cakes+0.044
+0.042
Neg logit contributions
ocol-0.330
isher-0.315
beh-0.312
esty-0.312
 property-0.311
Token Lily
Token ID20037
Feature activation+3.20e-02

Pos logit contributions
s+0.117
cakes+0.106
 exclaimed+0.102
 +0.100
+0.099
Neg logit contributions
ocol-0.156
isher-0.133
asured-0.133
beh-0.129
uzz-0.127
Token,
Token ID11
Feature activation-6.88e-03

Pos logit contributions
 they+0.033
s+0.031
cakes+0.030
+0.028
 +0.028
Neg logit contributions
ocol-0.075
asured-0.075
 property-0.074
beh-0.074
esty-0.071
Token and
Token ID290
Feature activation+7.45e-03

Pos logit contributions
ocol+0.016
 property+0.015
isher+0.014
beh+0.014
asured+0.014
Neg logit contributions
 they-0.010
s-0.010
 -0.009
 said-0.009
-0.009
Token they
Token ID484
Feature activation+7.47e-02

Pos logit contributions
s+0.006
 +0.005
+0.005
 they+0.005
 said+0.005
Neg logit contributions
ocol-0.021
 property-0.020
asured-0.019
beh-0.019
isher-0.019
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
 they+0.020
s+0.018
cakes+0.017
 said+0.016
 arrived+0.013
Neg logit contributions
ocol-0.075
uzz-0.070
uff-0.069
 property-0.069
ant-0.068
Token"
Token ID1
Feature activation-6.25e-02

Pos logit contributions
s+0.029
 they+0.028
 said+0.027
 +0.027
 exclaimed+0.025
Neg logit contributions
ocol-0.051
 property-0.049
beh-0.046
asured-0.046
ant-0.045
TokenYes
Token ID5297
Feature activation+2.69e-02

Pos logit contributions
ocol+0.155
 property+0.144
 appearance+0.133
asured+0.131
ities+0.131
Neg logit contributions
 they-0.087
-0.086
 -0.083
 said-0.083
s-0.081
Token we
Token ID356
Feature activation+3.80e-02

Pos logit contributions
s+0.025
 they+0.023
+0.022
 +0.022
 said+0.022
Neg logit contributions
ocol-0.074
 property-0.069
isher-0.066
asured-0.065
ities-0.065
Token did
Token ID750
Feature activation+5.27e-02

Pos logit contributions
s+0.054
 they+0.052
 +0.050
+0.050
 said+0.049
Neg logit contributions
ocol-0.088
 property-0.081
asured-0.077
ities-0.076
isher-0.076
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
s+0.050
 they+0.046
 +0.044
cakes+0.043
+0.043
Neg logit contributions
ocol-0.142
 property-0.133
asured-0.124
beh-0.124
ities-0.123
Token said
Token ID531
Feature activation+7.63e-02

Pos logit contributions
s+0.045
cakes+0.032
 they+0.029
+0.029
 +0.027
Neg logit contributions
ocol-0.355
 property-0.337
beh-0.335
ities-0.328
isher-0.328
Token Tom
Token ID4186
Feature activation+3.68e-02

Pos logit contributions
s+0.111
 +0.099
+0.098
 exclaimed+0.098
 they+0.097
Neg logit contributions
ocol-0.161
isher-0.142
beh-0.142
 property-0.141
ities-0.140
Token,
Token ID11
Feature activation-3.12e-03

Pos logit contributions
s+0.032
 they+0.032
 +0.029
+0.029
 said+0.028
Neg logit contributions
ocol-0.102
 property-0.095
asured-0.093
beh-0.092
ities-0.091
Token smiling
Token ID16755
Feature activation+1.28e-02

Pos logit contributions
ocol+0.008
 property+0.007
beh+0.007
isher+0.007
asured+0.007
Neg logit contributions
 they-0.004
s-0.004
 -0.004
 said-0.004
-0.004
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
s+0.011
 they+0.010
 said+0.010
+0.009
 +0.009
Neg logit contributions
ocol-0.038
 property-0.036
beh-0.034
isher-0.034
asured-0.034
Token"
Token ID1
Feature activation-6.25e-02

Pos logit contributions
s+0.031
 they+0.030
 said+0.029
 +0.029
+0.029
Neg logit contributions
ocol-0.063
 property-0.058
asured-0.055
beh-0.055
isher-0.055
TokenYes
Token ID5297
Feature activation+2.55e-02

Pos logit contributions
ocol+0.162
 property+0.149
 appearance+0.141
ities+0.138
asured+0.137
Neg logit contributions
 they-0.089
-0.083
 said-0.080
s-0.079
 -0.076
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
s+0.025
 +0.022
+0.021
 exclaimed+0.021
 they+0.021
Neg logit contributions
ocol-0.069
 property-0.063
isher-0.061
ant-0.059
ities-0.059
Token it
Token ID340
Feature activation+5.42e-02

Pos logit contributions
ocol+0.031
 property+0.029
beh+0.028
asured+0.028
isher+0.028
Neg logit contributions
 they-0.012
 said-0.010
s-0.010
-0.010
 -0.010
Token was
Token ID373
Feature activation-3.05e-03

Pos logit contributions
s+0.047
 they+0.047
+0.043
 +0.043
cakes+0.039
Neg logit contributions
 property-0.140
ocol-0.140
beh-0.134
asured-0.128
esty-0.126
Token,"
Token ID553
Feature activation+1.21e-01

Pos logit contributions
ocol+0.007
 property+0.006
asured+0.006
beh+0.006
isher+0.006
Neg logit contributions
s-0.005
 they-0.004
 -0.004
-0.004
 said-0.004
Token his
Token ID465
Feature activation-7.31e-02

Pos logit contributions
s+0.155
 they+0.138
 +0.138
+0.137
cakes+0.135
Neg logit contributions
ocol-0.280
 property-0.256
beh-0.250
asured-0.249
isher-0.245
Token sister
Token ID6621
Feature activation+6.26e-02

Pos logit contributions
ocol+0.112
asured+0.100
 property+0.099
isher+0.094
ities+0.093
Neg logit contributions
s-0.151
 they-0.144
 -0.144
 exclaimed-0.142
-0.140
Token said
Token ID531
Feature activation+7.28e-02

Pos logit contributions
s+0.033
 they+0.029
cakes+0.029
+0.026
+0.024
Neg logit contributions
beh-0.164
ocol-0.163
asured-0.163
 property-0.158
ant-0.156
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
s+0.075
 they+0.068
 said+0.067
+0.066
 +0.066
Neg logit contributions
ocol-0.181
 property-0.169
beh-0.167
asured-0.166
ities-0.164
Token "
Token ID366
Feature activation-8.27e-02

Pos logit contributions
ocol+0.031
 property+0.029
asured+0.028
uzz+0.028
beh+0.028
Neg logit contributions
s-0.008
 they-0.007
cakes-0.007
 said-0.007
 exclaimed-0.007
TokenOK
Token ID11380
Feature activation-3.52e-03

Pos logit contributions
ocol+0.119
 property+0.107
beh+0.102
isher+0.101
asured+0.101
Neg logit contributions
s-0.119
 they-0.115
 -0.113
 said-0.111
-0.111
Token,
Token ID11
Feature activation-1.18e-02

Pos logit contributions
ocol+0.008
 property+0.007
 appearance+0.007
tons+0.007
 than+0.007
Neg logit contributions
cakes-0.004
s-0.004
 exclaimed-0.004
lic-0.004
 they-0.004
Token we
Token ID356
Feature activation+4.56e-02

Pos logit contributions
ocol+0.029
 property+0.026
asured+0.026
beh+0.025
isher+0.025
Neg logit contributions
 they-0.016
s-0.016
 said-0.016
-0.015
 -0.015
Token forgive
Token ID20927
Feature activation+3.65e-02

Pos logit contributions
s+0.066
 they+0.065
 said+0.064
 +0.063
+0.063
Neg logit contributions
ocol-0.107
 property-0.096
asured-0.093
beh-0.092
isher-0.092
Token him
Token ID683
Feature activation-2.85e-02

Pos logit contributions
s+0.053
 they+0.052
 +0.050
 said+0.050
+0.049
Neg logit contributions
ocol-0.084
 property-0.077
asured-0.073
beh-0.073
isher-0.072
Token,"
Token ID553
Feature activation+1.20e-01

Pos logit contributions
ocol+0.070
 property+0.070
asured+0.066
beh+0.066
ities+0.064
Neg logit contributions
 they-0.028
s-0.027
 -0.026
cakes-0.025
-0.024
Token they
Token ID484
Feature activation+7.97e-02

Pos logit contributions
s+0.084
 exclaimed+0.073
cakes+0.071
 +0.071
+0.063
Neg logit contributions
ocol-0.333
 property-0.302
beh-0.293
isher-0.293
asured-0.289
Token said
Token ID531
Feature activation+7.54e-02

Pos logit contributions
cakes+0.055
s+0.048
 they+0.044
+0.042
't+0.041
Neg logit contributions
beh-0.200
ocol-0.197
ities-0.195
esty-0.194
asured-0.191
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
cakes+0.091
+0.088
 exclaimed+0.085
s+0.084
 said+0.081
Neg logit contributions
ocol-0.159
beh-0.150
uzz-0.150
ities-0.149
 property-0.147
Token "
Token ID366
Feature activation-8.17e-02

Pos logit contributions
ocol+0.032
 property+0.032
esty+0.032
uzz+0.031
 than+0.031
Neg logit contributions
 exclaimed-0.018
cakes-0.017
s-0.017
 said-0.016
 replied-0.015
TokenBut
Token ID1537
Feature activation-3.09e-02

Pos logit contributions
ocol+0.212
 property+0.189
asured+0.181
isher+0.181
beh+0.179
Neg logit contributions
 they-0.105
s-0.104
-0.103
 -0.097
 said-0.095
Token They
Token ID1119
Feature activation+2.50e-02

Pos logit contributions
ocol+0.057
beh+0.051
 property+0.051
ities+0.050
ubb+0.050
Neg logit contributions
 said-0.013
-0.013
s-0.013
 they-0.012
 -0.011
Token ran
Token ID4966
Feature activation+2.44e-04

Pos logit contributions
s+0.029
 they+0.028
 said+0.027
 +0.027
+0.026
Neg logit contributions
ocol-0.065
 property-0.060
asured-0.058
beh-0.057
isher-0.057
Token and
Token ID290
Feature activation+1.52e-02

Pos logit contributions
s+0.000
 they+0.000
 said+0.000
 +0.000
+0.000
Neg logit contributions
ocol-0.001
 property-0.001
asured-0.001
beh-0.001
isher-0.001
Token laughed
Token ID13818
Feature activation+8.32e-02

Pos logit contributions
s+0.017
+0.015
 they+0.015
 +0.015
 exclaimed+0.014
Neg logit contributions
ocol-0.038
 property-0.036
beh-0.035
asured-0.034
isher-0.034
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
s+0.089
 they+0.086
 +0.082
 said+0.082
+0.081
Neg logit contributions
ocol-0.220
 property-0.204
asured-0.195
beh-0.194
ities-0.194
Token cheered
Token ID34767
Feature activation+1.20e-01

Pos logit contributions
s+0.011
 they+0.010
 +0.009
 said+0.009
+0.009
Neg logit contributions
ocol-0.033
 property-0.030
asured-0.029
beh-0.029
isher-0.029
Token.
Token ID13
Feature activation-9.89e-03

Pos logit contributions
 they+0.114
s+0.114
+0.104
 said+0.103
 +0.102
Neg logit contributions
ocol-0.344
 property-0.318
asured-0.305
isher-0.305
beh-0.305
Token Lily
Token ID20037
Feature activation+3.30e-02

Pos logit contributions
ocol+0.030
 property+0.028
beh+0.027
isher+0.027
asured+0.027
Neg logit contributions
 they-0.008
-0.008
s-0.008
 -0.008
 said-0.007
Token made
Token ID925
Feature activation+4.90e-02

Pos logit contributions
s+0.031
 they+0.031
 +0.029
+0.028
cakes+0.028
Neg logit contributions
asured-0.077
 property-0.077
ocol-0.077
esty-0.074
ities-0.073
Token some
Token ID617
Feature activation+2.91e-02

Pos logit contributions
s+0.052
 they+0.047
 +0.045
 said+0.044
+0.044
Neg logit contributions
ocol-0.130
 property-0.121
asured-0.117
beh-0.117
isher-0.115
Token new
Token ID649
Feature activation-2.19e-02

Pos logit contributions
s+0.037
 they+0.036
 said+0.035
 +0.034
+0.034
Neg logit contributions
ocol-0.072
 property-0.067
asured-0.065
beh-0.064
isher-0.064
Token will
Token ID481
Feature activation+3.41e-02

Pos logit contributions
 they+0.053
s+0.052
 +0.050
+0.049
 exclaimed+0.046
Neg logit contributions
ocol-0.110
 property-0.107
beh-0.099
asured-0.099
ities-0.098
Token be
Token ID307
Feature activation-4.11e-02

Pos logit contributions
s+0.045
 they+0.044
 +0.041
 said+0.041
+0.041
Neg logit contributions
ocol-0.082
 property-0.077
asured-0.072
beh-0.072
ities-0.071
Token,
Token ID11
Feature activation-3.42e-03

Pos logit contributions
ocol+0.108
 property+0.099
isher+0.096
ities+0.096
beh+0.094
Neg logit contributions
 they-0.048
s-0.048
 said-0.047
 -0.046
-0.046
Token sweet
Token ID6029
Feature activation+2.75e-02

Pos logit contributions
ocol+0.009
beh+0.008
 property+0.008
asured+0.008
ities+0.008
Neg logit contributions
 they-0.005
 said-0.005
-0.004
 -0.004
s-0.004
Tokenheart
Token ID11499
Feature activation-6.38e-03

Pos logit contributions
s+0.050
 they+0.049
 +0.048
+0.047
 said+0.047
Neg logit contributions
ocol-0.053
 property-0.048
beh-0.044
isher-0.044
asured-0.044
Token,"
Token ID553
Feature activation+1.20e-01

Pos logit contributions
ocol+0.020
 property+0.019
asured+0.018
beh+0.018
ities+0.018
Neg logit contributions
s-0.004
 they-0.004
 said-0.003
-0.003
 -0.003
Token his
Token ID465
Feature activation-7.66e-02

Pos logit contributions
s+0.147
 they+0.130
 +0.126
+0.125
cakes+0.122
Neg logit contributions
ocol-0.287
 property-0.264
asured-0.259
isher-0.257
beh-0.257
Token mom
Token ID1995
Feature activation+4.20e-02

Pos logit contributions
ocol+0.215
 property+0.198
beh+0.192
isher+0.189
ities+0.188
Neg logit contributions
s-0.075
 they-0.073
 said-0.071
 -0.070
-0.070
Token answered
Token ID9373
Feature activation+3.43e-02

Pos logit contributions
 they+0.021
s+0.020
+0.016
 +0.016
+0.014
Neg logit contributions
ocol-0.122
asured-0.118
beh-0.117
isher-0.116
ac-0.113
Token.
Token ID13
Feature activation-5.86e-03

Pos logit contributions
s+0.027
 they+0.027
 +0.025
 said+0.025
+0.025
Neg logit contributions
ocol-0.102
 property-0.096
asured-0.092
beh-0.091
isher-0.091
Token "
Token ID366
Feature activation-8.31e-02

Pos logit contributions
ocol+0.016
 property+0.015
asured+0.014
uzz+0.014
beh+0.014
Neg logit contributions
s-0.005
 they-0.004
 said-0.004
 exclaimed-0.004
cakes-0.004
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
 exclaimed+0.032
s+0.032
 +0.030
cakes+0.030
+0.030
Neg logit contributions
ocol-0.070
 property-0.060
isher-0.060
ac-0.059
ant-0.059
Token we
Token ID356
Feature activation+4.32e-02

Pos logit contributions
ocol+0.025
 property+0.024
ac+0.022
beh+0.022
ities+0.022
Neg logit contributions
s-0.017
 -0.016
 exclaimed-0.016
-0.015
-0.014
Token can
Token ID460
Feature activation+1.44e-02

Pos logit contributions
s+0.080
 they+0.077
 exclaimed+0.076
cakes+0.075
 +0.072
Neg logit contributions
ocol-0.076
 property-0.073
ities-0.067
ac-0.065
asured-0.064
Token,
Token ID11
Feature activation-4.24e-03

Pos logit contributions
cakes+0.017
s+0.016
 they+0.015
 exclaimed+0.015
lic+0.014
Neg logit contributions
 property-0.033
ocol-0.032
asured-0.030
 appearance-0.030
esty-0.029
Token Luna
Token ID23694
Feature activation+5.45e-02

Pos logit contributions
ocol+0.009
 property+0.009
ities+0.008
asured+0.008
esty+0.008
Neg logit contributions
s-0.006
 -0.006
 exclaimed-0.006
 they-0.005
-0.005
Token,"
Token ID553
Feature activation+1.20e-01

Pos logit contributions
s+0.048
 they+0.045
 +0.045
 exclaimed+0.043
+0.042
Neg logit contributions
ocol-0.150
 property-0.143
asured-0.135
isher-0.134
beh-0.134
Token Mia
Token ID32189
Feature activation+4.77e-02

Pos logit contributions
s+0.167
 exclaimed+0.159
 +0.157
 they+0.156
cakes+0.155
Neg logit contributions
ocol-0.259
 property-0.231
ities-0.221
beh-0.221
esty-0.217
Token says
Token ID1139
Feature activation+7.61e-02

Pos logit contributions
 they+0.044
s+0.042
cakes+0.039
 +0.038
 exclaimed+0.037
Neg logit contributions
ocol-0.119
 property-0.111
asured-0.111
beh-0.107
ities-0.107
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
 exclaimed+0.083
s+0.080
cakes+0.080
 said+0.077
 +0.076
Neg logit contributions
ocol-0.174
ities-0.163
 property-0.160
beh-0.157
uzz-0.157
Token "
Token ID366
Feature activation-8.49e-02

Pos logit contributions
ocol+0.045
 property+0.044
ities+0.042
esty+0.042
asured+0.041
Neg logit contributions
 exclaimed-0.021
s-0.020
 said-0.020
 they-0.020
cakes-0.019
TokenWe
Token ID1135
Feature activation+3.04e-02

Pos logit contributions
ocol+0.221
 property+0.195
asured+0.189
isher+0.187
beh+0.187
Neg logit contributions
 they-0.110
s-0.109
-0.107
 -0.102
 said-0.100
TokenYes
Token ID5297
Feature activation+2.56e-02

Pos logit contributions
ocol+0.136
 property+0.124
asured+0.116
beh+0.116
isher+0.116
Neg logit contributions
s-0.103
 they-0.102
 said-0.098
 -0.098
-0.097
Token,
Token ID11
Feature activation-9.84e-03

Pos logit contributions
s+0.025
 they+0.023
+0.022
 exclaimed+0.022
 +0.022
Neg logit contributions
ocol-0.070
 property-0.064
isher-0.061
ities-0.060
asured-0.060
Token I
Token ID314
Feature activation-8.14e-03

Pos logit contributions
ocol+0.026
 property+0.024
beh+0.024
asured+0.023
isher+0.023
Neg logit contributions
 they-0.013
 said-0.012
-0.011
s-0.011
 -0.011
Token'm
Token ID1101
Feature activation-1.96e-02

Pos logit contributions
ocol+0.015
 property+0.015
asured+0.014
beh+0.014
ities+0.013
Neg logit contributions
 they-0.014
s-0.014
 -0.013
 exclaimed-0.013
cakes-0.013
Token okay
Token ID8788
Feature activation-2.78e-02

Pos logit contributions
ocol+0.026
 property+0.023
asured+0.021
beh+0.020
ities+0.020
Neg logit contributions
s-0.047
 they-0.046
 said-0.045
 -0.045
-0.045
Token,"
Token ID553
Feature activation+1.20e-01

Pos logit contributions
ocol+0.081
 property+0.076
asured+0.073
beh+0.073
ities+0.072
Neg logit contributions
s-0.022
 they-0.021
 -0.020
 said-0.019
 exclaimed-0.019
Token Lily
Token ID20037
Feature activation+3.59e-02

Pos logit contributions
s+0.174
cakes+0.160
 +0.156
't+0.153
ves+0.153
Neg logit contributions
ocol-0.232
beh-0.199
asured-0.196
isher-0.195
 property-0.193
Token said
Token ID531
Feature activation+7.12e-02

Pos logit contributions
 they+0.023
cakes+0.023
s+0.021
't+0.017
Happy+0.016
Neg logit contributions
asured-0.095
esty-0.088
beh-0.087
 property-0.086
ities-0.086
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
s+0.081
cakes+0.074
 exclaimed+0.073
 said+0.072
 they+0.071
Neg logit contributions
ocol-0.161
asured-0.155
beh-0.153
 property-0.150
ities-0.149
Token "
Token ID366
Feature activation-8.41e-02

Pos logit contributions
ocol+0.031
uzz+0.031
uff+0.030
asured+0.030
 property+0.030
Neg logit contributions
s-0.008
ves-0.008
cakes-0.008
 exclaimed-0.007
 said-0.007
TokenBut
Token ID1537
Feature activation-3.36e-02

Pos logit contributions
ocol+0.194
 property+0.176
isher+0.166
asured+0.165
beh+0.165
Neg logit contributions
s-0.124
 they-0.122
-0.120
 -0.117
 said-0.116
TokenWe
Token ID1135
Feature activation+3.16e-02

Pos logit contributions
ocol+0.136
 property+0.125
beh+0.118
ities+0.118
isher+0.118
Neg logit contributions
s-0.100
 -0.097
 they-0.096
 exclaimed-0.093
 said-0.093
Token are
Token ID389
Feature activation+5.35e-03

Pos logit contributions
s+0.042
 they+0.041
 said+0.041
 +0.040
+0.040
Neg logit contributions
ocol-0.078
 property-0.070
asured-0.068
isher-0.068
beh-0.067
Token sorry
Token ID7926
Feature activation+4.09e-02

Pos logit contributions
s+0.011
 they+0.010
 +0.010
 said+0.010
+0.010
Neg logit contributions
ocol-0.009
 property-0.008
beh-0.008
asured-0.008
ities-0.008
Token,
Token ID11
Feature activation-5.30e-03

Pos logit contributions
s+0.040
 +0.040
 exclaimed+0.038
 said+0.036
 they+0.036
Neg logit contributions
ocol-0.107
 property-0.101
ities-0.095
 appearance-0.093
asured-0.093
Token Mom
Token ID11254
Feature activation+5.28e-02

Pos logit contributions
ocol+0.013
 property+0.012
beh+0.011
asured+0.011
isher+0.011
Neg logit contributions
 they-0.007
s-0.007
 said-0.007
 -0.007
-0.007
Token,"
Token ID553
Feature activation+1.20e-01

Pos logit contributions
s+0.053
 +0.051
 they+0.051
cakes+0.051
 exclaimed+0.050
Neg logit contributions
 property-0.125
ocol-0.125
asured-0.116
 than-0.113
isher-0.113
Token they
Token ID484
Feature activation+7.74e-02

Pos logit contributions
s+0.170
 exclaimed+0.157
 +0.154
cakes+0.151
 said+0.148
Neg logit contributions
ocol-0.267
 property-0.229
ities-0.221
isher-0.221
asured-0.216
Token say
Token ID910
Feature activation+7.77e-02

Pos logit contributions
s+0.096
cakes+0.094
 they+0.090
ves+0.081
+0.076
Neg logit contributions
ocol-0.170
 property-0.168
ities-0.156
beh-0.153
 appearance-0.152
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 exclaimed+0.091
s+0.086
 said+0.084
cakes+0.082
 +0.081
Neg logit contributions
ocol-0.175
ities-0.157
 property-0.156
beh-0.155
uzz-0.153
Token "
Token ID366
Feature activation-8.40e-02

Pos logit contributions
uzz+0.043
 than+0.042
esty+0.042
ocol+0.041
ities+0.041
Neg logit contributions
 exclaimed-0.024
ons-0.024
cakes-0.024
odic-0.020
s-0.020
TokenWe
Token ID1135
Feature activation+3.26e-02

Pos logit contributions
ocol+0.181
 property+0.176
ities+0.166
beh+0.164
asured+0.163
Neg logit contributions
s-0.114
 -0.113
 said-0.112
 exclaimed-0.110
 they-0.108
Token as
Token ID355
Feature activation-2.09e-02

Pos logit contributions
ocol+0.010
 property+0.009
asured+0.009
beh+0.009
isher+0.009
Neg logit contributions
s-0.003
 they-0.003
 said-0.002
 -0.002
-0.002
Token he
Token ID339
Feature activation+5.17e-03

Pos logit contributions
ocol+0.071
 property+0.067
asured+0.065
isher+0.065
ities+0.065
Neg logit contributions
 they-0.012
s-0.008
 said-0.008
-0.007
 -0.007
Token opened
Token ID4721
Feature activation-1.03e-02

Pos logit contributions
 said+0.005
s+0.005
 they+0.005
+0.004
 +0.004
Neg logit contributions
ocol-0.015
 property-0.014
isher-0.013
ities-0.013
beh-0.013
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
ocol+0.030
 property+0.029
beh+0.029
ities+0.028
asured+0.027
Neg logit contributions
 they-0.009
s-0.008
 -0.008
-0.008
 said-0.008
Token ch
Token ID442
Feature activation-7.13e-02

Pos logit contributions
s+0.016
 +0.015
 they+0.015
 said+0.015
+0.015
Neg logit contributions
ocol-0.049
beh-0.046
asured-0.045
ities-0.045
 property-0.044
Tokenocol
Token ID4668
Feature activation-2.96e-01

Pos logit contributions
 property+0.067
esty+0.054
ities+0.054
beh+0.052
asured+0.050
Neg logit contributions
s-0.192
 they-0.181
 said-0.178
 exclaimed-0.175
 -0.172
Tokenates
Token ID689
Feature activation-2.24e-02

Pos logit contributions
 property+0.378
ocol+0.357
 appearance+0.339
esty+0.324
beh+0.318
Neg logit contributions
s-0.677
-0.637
 said-0.624
 they-0.622
 -0.618
Token and
Token ID290
Feature activation-5.30e-03

Pos logit contributions
ocol+0.063
 property+0.059
beh+0.056
asured+0.056
isher+0.056
Neg logit contributions
s-0.022
 they-0.021
 said-0.020
 -0.019
-0.019
Token started
Token ID2067
Feature activation-1.85e-02

Pos logit contributions
ocol+0.014
 property+0.013
ities+0.013
asured+0.013
isher+0.013
Neg logit contributions
 they-0.006
 said-0.006
s-0.006
 -0.005
 exclaimed-0.005
Token eating
Token ID6600
Feature activation+4.70e-03

Pos logit contributions
ocol+0.052
 property+0.049
isher+0.047
ities+0.047
esty+0.047
Neg logit contributions
 they-0.019
s-0.018
 -0.017
 said-0.017
-0.016
Token them
Token ID606
Feature activation+9.45e-03

Pos logit contributions
 they+0.005
 +0.004
+0.004
s+0.004
 said+0.004
Neg logit contributions
beh-0.014
ocol-0.013
 property-0.013
asured-0.013
esty-0.013
Token box
Token ID3091
Feature activation-1.71e-03

Pos logit contributions
s+0.024
 they+0.023
 +0.022
+0.022
 said+0.022
Neg logit contributions
ocol-0.059
beh-0.055
 property-0.055
asured-0.054
isher-0.053
Token and
Token ID290
Feature activation-4.92e-04

Pos logit contributions
ocol+0.004
 property+0.004
asured+0.004
isher+0.003
 than+0.003
Neg logit contributions
-0.002
 exclaimed-0.002
 -0.002
cakes-0.002
s-0.002
Token share
Token ID2648
Feature activation+2.25e-02

Pos logit contributions
ocol+0.001
 property+0.001
asured+0.001
beh+0.001
isher+0.001
Neg logit contributions
s-0.001
 they-0.001
 -0.001
 said-0.001
-0.001
Token the
Token ID262
Feature activation+2.02e-02

Pos logit contributions
s+0.022
 exclaimed+0.021
 said+0.020
 they+0.020
 +0.020
Neg logit contributions
ocol-0.059
 property-0.054
asured-0.052
ities-0.052
isher-0.051
Token ch
Token ID442
Feature activation-7.75e-02

Pos logit contributions
 +0.025
 said+0.025
+0.024
 they+0.024
s+0.024
Neg logit contributions
ocol-0.051
beh-0.049
asured-0.047
ant-0.046
ities-0.045
Tokenocol
Token ID4668
Feature activation-2.93e-01

Pos logit contributions
beh+0.051
 property+0.048
tons+0.039
asured+0.039
esty+0.038
Neg logit contributions
s-0.223
 they-0.210
-0.208
 said-0.201
 -0.199
Tokenates
Token ID689
Feature activation-1.22e-02

Pos logit contributions
ocol+0.365
 property+0.348
 appearance+0.310
beh+0.309
asured+0.306
Neg logit contributions
s-0.703
-0.663
 they-0.658
 said-0.643
 -0.641
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
ocol+0.030
 property+0.028
asured+0.028
isher+0.027
ities+0.027
Neg logit contributions
 -0.012
 said-0.012
 exclaimed-0.012
-0.011
 they-0.011
Token They
Token ID1119
Feature activation+2.34e-02

Pos logit contributions
ocol+0.050
 property+0.047
esty+0.045
ities+0.044
asured+0.044
Neg logit contributions
s-0.028
 exclaimed-0.027
 said-0.026
cakes-0.024
 they-0.024
Token are
Token ID389
Feature activation-5.64e-04

Pos logit contributions
 they+0.028
s+0.027
 said+0.027
 +0.026
+0.026
Neg logit contributions
ocol-0.061
 property-0.057
beh-0.055
asured-0.054
isher-0.054
Token happy
Token ID3772
Feature activation+3.80e-02

Pos logit contributions
ocol+0.002
 property+0.001
isher+0.001
beh+0.001
ities+0.001
Neg logit contributions
 they-0.001
s-0.001
 said-0.001
 -0.001
-0.001
Token and
Token ID290
Feature activation+3.46e-03

Pos logit contributions
 they+0.002
s+0.002
 +0.002
+0.002
+0.002
Neg logit contributions
ocol-0.005
 property-0.005
asured-0.005
isher-0.005
ities-0.005
Token started
Token ID2067
Feature activation-2.01e-02

Pos logit contributions
s+0.003
 they+0.003
 +0.003
+0.003
 said+0.003
Neg logit contributions
ocol-0.010
 property-0.009
asured-0.009
isher-0.009
beh-0.008
Token eating
Token ID6600
Feature activation+6.14e-03

Pos logit contributions
ocol+0.060
 property+0.056
asured+0.053
beh+0.053
isher+0.053
Neg logit contributions
s-0.016
 they-0.016
 -0.015
 said-0.015
-0.014
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
 they+0.005
s+0.004
 +0.004
+0.004
 said+0.004
Neg logit contributions
ocol-0.019
beh-0.018
 property-0.018
asured-0.018
esty-0.017
Token ch
Token ID442
Feature activation-6.89e-02

Pos logit contributions
s+0.031
 they+0.030
 said+0.030
 +0.029
+0.029
Neg logit contributions
ocol-0.054
 property-0.050
asured-0.049
beh-0.048
ities-0.047
Tokenocol
Token ID4668
Feature activation-2.91e-01

Pos logit contributions
 property+0.034
beh+0.028
asured+0.024
esty+0.021
 appearance+0.020
Neg logit contributions
s-0.215
 they-0.203
cakes-0.198
 said-0.195
-0.195
Tokenates
Token ID689
Feature activation-1.70e-02

Pos logit contributions
ocol+0.349
 property+0.343
 appearance+0.300
beh+0.293
asured+0.292
Neg logit contributions
s-0.707
-0.668
 they-0.663
 -0.655
 said-0.654
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
ocol+0.049
 property+0.046
beh+0.044
asured+0.044
isher+0.044
Neg logit contributions
s-0.015
 they-0.015
 said-0.014
 -0.013
-0.013
Token The
Token ID383
Feature activation+2.23e-02

Pos logit contributions
ocol+0.045
 property+0.043
beh+0.041
isher+0.040
asured+0.040
Neg logit contributions
 -0.021
-0.021
 they-0.021
s-0.021
 said-0.019
Token shop
Token ID6128
Feature activation-5.14e-02

Pos logit contributions
 said+0.028
s+0.027
 +0.027
 they+0.026
+0.026
Neg logit contributions
ocol-0.056
beh-0.053
 property-0.053
asured-0.052
ities-0.049
Tokenkeeper
Token ID13884
Feature activation+2.51e-02

Pos logit contributions
ocol+0.134
 property+0.125
asured+0.121
isher+0.121
beh+0.119
Neg logit contributions
s-0.051
 they-0.050
 -0.049
-0.049
 said-0.046
Token noticed
Token ID6810
Feature activation+1.03e-02

Pos logit contributions
 +0.028
cakes+0.027
 they+0.027
+0.025
s+0.025
Neg logit contributions
 property-0.079
ocol-0.078
ities-0.077
asured-0.075
 appearance-0.074
Token something
Token ID1223
Feature activation+1.39e-02

Pos logit contributions
s+0.010
 they+0.010
 said+0.009
+0.009
 +0.009
Neg logit contributions
ocol-0.029
 property-0.027
asured-0.026
beh-0.026
isher-0.026
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
 they+0.019
s+0.019
 said+0.019
+0.018
 +0.017
Neg logit contributions
ocol-0.034
 property-0.031
asured-0.030
beh-0.030
isher-0.030
Token The
Token ID383
Feature activation+2.10e-02

Pos logit contributions
ocol+0.053
 property+0.051
asured+0.050
isher+0.050
beh+0.049
Neg logit contributions
-0.022
 -0.021
 they-0.021
s-0.020
 said-0.019
Token ch
Token ID442
Feature activation-7.59e-02

Pos logit contributions
s+0.029
 they+0.029
 said+0.028
 +0.028
+0.027
Neg logit contributions
ocol-0.050
 property-0.046
asured-0.044
beh-0.044
isher-0.043
Tokenocol
Token ID4668
Feature activation-2.90e-01

Pos logit contributions
 property+0.082
beh+0.067
asured+0.066
esty+0.065
 appearance+0.063
Neg logit contributions
s-0.196
 they-0.184
-0.182
 said-0.180
 -0.175
Tokenates
Token ID689
Feature activation-1.81e-02

Pos logit contributions
 property+0.364
ocol+0.364
beh+0.316
 appearance+0.311
asured+0.300
Neg logit contributions
s-0.691
-0.656
 said-0.645
 they-0.642
 -0.636
Token were
Token ID547
Feature activation-1.23e-02

Pos logit contributions
ocol+0.056
 property+0.053
beh+0.052
asured+0.051
isher+0.051
Neg logit contributions
s-0.013
 they-0.013
 said-0.012
 -0.010
-0.010
Token bitter
Token ID12922
Feature activation-1.57e-02

Pos logit contributions
ocol+0.031
 property+0.031
beh+0.031
isher+0.030
esty+0.029
Neg logit contributions
 they-0.016
 said-0.015
s-0.013
-0.013
 -0.013
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
ocol+0.049
 property+0.046
beh+0.045
asured+0.045
isher+0.045
Neg logit contributions
s-0.011
 they-0.011
 said-0.010
-0.009
 -0.008
Token They
Token ID1119
Feature activation+2.30e-02

Pos logit contributions
ocol+0.054
 property+0.051
asured+0.049
beh+0.049
isher+0.049
Neg logit contributions
s-0.024
 they-0.024
-0.023
 -0.023
 said-0.022
Tokenbled
Token ID9342
Feature activation+3.16e-02

Pos logit contributions
ocol+0.055
 property+0.046
ities+0.044
asured+0.044
isher+0.044
Neg logit contributions
s-0.080
 they-0.079
 said-0.079
 -0.078
-0.077
Token up
Token ID510
Feature activation+1.30e-02

Pos logit contributions
s+0.034
 they+0.033
 +0.030
+0.030
 said+0.029
Neg logit contributions
ocol-0.088
 property-0.085
ities-0.080
beh-0.080
asured-0.080
Token all
Token ID477
Feature activation-1.20e-02

Pos logit contributions
 they+0.010
s+0.009
 +0.008
+0.008
 said+0.008
Neg logit contributions
ocol-0.040
 property-0.038
beh-0.037
isher-0.037
asured-0.036
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
ocol+0.032
 property+0.030
isher+0.029
asured+0.029
beh+0.028
Neg logit contributions
s-0.012
 they-0.011
 said-0.011
 -0.011
 exclaimed-0.011
Token ch
Token ID442
Feature activation-7.33e-02

Pos logit contributions
s+0.026
 said+0.025
 +0.025
 they+0.024
+0.024
Neg logit contributions
ocol-0.035
 property-0.032
beh-0.032
asured-0.031
ities-0.031
Tokenocol
Token ID4668
Feature activation-2.88e-01

Pos logit contributions
 property+0.046
esty+0.036
 appearance+0.034
beh+0.031
tons+0.030
Neg logit contributions
s-0.214
 they-0.205
ons-0.201
 said-0.194
-0.193
Tokenates
Token ID689
Feature activation-1.89e-02

Pos logit contributions
ocol+0.359
 property+0.350
 appearance+0.309
asured+0.299
beh+0.297
Neg logit contributions
s-0.688
 they-0.650
-0.648
 said-0.639
 -0.633
Token,
Token ID11
Feature activation+5.78e-04

Pos logit contributions
ocol+0.051
 property+0.047
asured+0.046
isher+0.045
beh+0.045
Neg logit contributions
s-0.018
 they-0.018
 -0.017
-0.017
 said-0.017
Token and
Token ID290
Feature activation+9.47e-03

Pos logit contributions
 they+0.001
 said+0.001
 +0.001
s+0.001
+0.001
Neg logit contributions
ocol-0.002
 property-0.001
beh-0.001
isher-0.001
ities-0.001
Token she
Token ID673
Feature activation+2.39e-02

Pos logit contributions
 they+0.009
s+0.009
 said+0.009
 +0.008
+0.008
Neg logit contributions
ocol-0.027
 property-0.025
isher-0.024
ities-0.024
beh-0.024
Token was
Token ID373
Feature activation-1.02e-02

Pos logit contributions
s+0.022
 they+0.022
 +0.020
+0.020
 said+0.019
Neg logit contributions
ocol-0.064
 property-0.060
asured-0.060
isher-0.058
beh-0.058
Token park
Token ID3952
Feature activation-4.16e-03

Pos logit contributions
 +0.022
+0.021
s+0.021
 said+0.020
 they+0.020
Neg logit contributions
ocol-0.062
asured-0.057
ities-0.056
beh-0.056
 property-0.055
Token.
Token ID13
Feature activation-7.65e-03

Pos logit contributions
ocol+0.013
 property+0.012
asured+0.011
isher+0.011
beh+0.011
Neg logit contributions
 they-0.003
s-0.003
 said-0.003
 -0.003
-0.003
Token When
Token ID1649
Feature activation-3.57e-02

Pos logit contributions
ocol+0.019
 property+0.017
asured+0.017
beh+0.017
isher+0.016
Neg logit contributions
s-0.010
 they-0.009
 said-0.009
 -0.009
-0.009
Token the
Token ID262
Feature activation+3.90e-02

Pos logit contributions
ocol+0.110
 property+0.104
beh+0.099
asured+0.099
ities+0.099
Neg logit contributions
s-0.019
 exclaimed-0.015
 -0.015
 said-0.015
-0.015
Token ch
Token ID442
Feature activation-7.66e-02

Pos logit contributions
s+0.061
 they+0.060
 said+0.058
 +0.058
+0.057
Neg logit contributions
ocol-0.086
 property-0.078
asured-0.074
beh-0.074
isher-0.073
Tokenocol
Token ID4668
Feature activation-2.88e-01

Pos logit contributions
 property+0.056
esty+0.044
ities+0.041
 appearance+0.038
beh+0.036
Neg logit contributions
s-0.226
-0.214
 said-0.214
 they-0.213
 -0.211
Tokenates
Token ID689
Feature activation-6.34e-03

Pos logit contributions
ocol+0.369
 property+0.368
 appearance+0.322
beh+0.313
asured+0.302
Neg logit contributions
s-0.679
-0.645
 said-0.640
 they-0.634
 -0.628
Token were
Token ID547
Feature activation+3.15e-03

Pos logit contributions
ocol+0.019
 property+0.018
asured+0.017
beh+0.017
isher+0.017
Neg logit contributions
s-0.005
 they-0.005
 said-0.004
 -0.004
-0.004
Token all
Token ID477
Feature activation-1.29e-03

Pos logit contributions
s+0.004
 they+0.004
 +0.003
 said+0.003
+0.003
Neg logit contributions
ocol-0.008
 property-0.008
asured-0.007
beh-0.007
isher-0.007
Token gone
Token ID3750
Feature activation+1.66e-03

Pos logit contributions
ocol+0.003
 property+0.003
asured+0.003
beh+0.003
isher+0.003
Neg logit contributions
s-0.002
 they-0.002
-0.001
 -0.001
 said-0.001
Token,
Token ID11
Feature activation+1.52e-02

Pos logit contributions
 they+0.001
s+0.001
 said+0.001
+0.001
 +0.001
Neg logit contributions
ocol-0.005
 property-0.005
beh-0.005
asured-0.005
isher-0.005
Token a
Token ID257
Feature activation+5.11e-02

Pos logit contributions
s+0.043
 they+0.041
 said+0.039
 +0.039
+0.039
Neg logit contributions
ocol-0.111
 property-0.104
asured-0.099
beh-0.099
isher-0.098
Token large
Token ID1588
Feature activation-1.36e-02

Pos logit contributions
 they+0.056
s+0.055
 +0.054
 said+0.052
+0.051
Neg logit contributions
ocol-0.139
beh-0.132
asured-0.128
ities-0.127
 property-0.127
Token selection
Token ID6356
Feature activation-5.19e-02

Pos logit contributions
beh+0.036
ocol+0.033
asured+0.033
esty+0.032
ities+0.032
Neg logit contributions
 they-0.016
 -0.015
 said-0.015
s-0.015
-0.014
Token of
Token ID286
Feature activation-4.19e-02

Pos logit contributions
ocol+0.168
 property+0.161
beh+0.155
asured+0.154
ities+0.153
Neg logit contributions
s-0.031
 they-0.030
 said-0.025
-0.023
 -0.022
Token ch
Token ID442
Feature activation-6.67e-02

Pos logit contributions
beh+0.097
ities+0.095
 property+0.095
esty+0.094
asured+0.094
Neg logit contributions
 they-0.059
 said-0.058
 -0.057
s-0.057
-0.054
Tokenocol
Token ID4668
Feature activation-2.88e-01

Pos logit contributions
 property+0.051
esty+0.037
asured+0.036
 appearance+0.034
tons+0.033
Neg logit contributions
s-0.199
 they-0.181
-0.174
ons-0.174
 of-0.173
Tokenates
Token ID689
Feature activation-1.95e-02

Pos logit contributions
 property+0.378
ocol+0.352
 appearance+0.346
esty+0.321
asured+0.320
Neg logit contributions
s-0.649
-0.611
 said-0.588
 -0.588
 they-0.587
Token.
Token ID13
Feature activation-2.39e-02

Pos logit contributions
ocol+0.061
 property+0.057
beh+0.056
asured+0.056
isher+0.055
Neg logit contributions
s-0.015
 they-0.015
 said-0.012
-0.011
 -0.010
Token Ell
Token ID7122
Feature activation-4.56e-02

Pos logit contributions
ocol+0.063
 property+0.059
beh+0.056
isher+0.056
asured+0.056
Neg logit contributions
 they-0.028
 -0.028
-0.027
s-0.027
 said-0.026
Tokena
Token ID64
Feature activation+1.25e-02

Pos logit contributions
ocol+0.076
 property+0.068
asured+0.065
beh+0.064
isher+0.064
Neg logit contributions
 they-0.092
s-0.090
 -0.089
-0.088
 said-0.087
Token picked
Token ID6497
Feature activation-1.82e-03

Pos logit contributions
s+0.010
 they+0.009
 said+0.009
 +0.009
+0.009
Neg logit contributions
ocol-0.037
 property-0.035
beh-0.033
asured-0.033
isher-0.033
Token started
Token ID2067
Feature activation-1.94e-02

Pos logit contributions
s+0.007
+0.006
 +0.006
 they+0.006
 exclaimed+0.006
Neg logit contributions
ocol-0.015
 property-0.014
isher-0.014
asured-0.013
beh-0.013
Token eating
Token ID6600
Feature activation+3.22e-03

Pos logit contributions
ocol+0.050
 property+0.046
asured+0.044
beh+0.044
isher+0.044
Neg logit contributions
s-0.021
 they-0.020
 said-0.020
 -0.020
-0.020
Token the
Token ID262
Feature activation+2.27e-02

Pos logit contributions
 they+0.003
s+0.003
 +0.003
+0.003
 said+0.003
Neg logit contributions
ocol-0.009
 property-0.009
beh-0.009
asured-0.009
isher-0.008
Token delicious
Token ID12625
Feature activation-3.59e-02

Pos logit contributions
s+0.034
 they+0.033
 said+0.032
 +0.032
+0.032
Neg logit contributions
ocol-0.052
 property-0.047
asured-0.045
beh-0.045
isher-0.045
Token ch
Token ID442
Feature activation-7.20e-02

Pos logit contributions
ocol+0.082
 property+0.079
beh+0.078
asured+0.077
esty+0.075
Neg logit contributions
s-0.051
 they-0.049
 said-0.048
 -0.048
-0.047
Tokenocol
Token ID4668
Feature activation-2.88e-01

Pos logit contributions
 property+0.031
asured+0.025
esty+0.024
beh+0.022
tons+0.021
Neg logit contributions
s-0.222
 they-0.209
ons-0.207
-0.206
cakes-0.206
Tokenates
Token ID689
Feature activation-1.63e-02

Pos logit contributions
ocol+0.352
 property+0.316
asured+0.280
beh+0.276
 appearance+0.271
Neg logit contributions
s-0.708
 they-0.678
-0.677
 said-0.667
 -0.667
Token.
Token ID13
Feature activation-7.97e-03

Pos logit contributions
ocol+0.046
 property+0.043
asured+0.041
beh+0.041
isher+0.041
Neg logit contributions
s-0.015
 they-0.014
 said-0.013
 -0.013
-0.013
Token She
Token ID1375
Feature activation-3.19e-02

Pos logit contributions
ocol+0.020
 property+0.019
uzz+0.019
esty+0.019
tons+0.019
Neg logit contributions
s-0.006
 said-0.005
 exclaimed-0.005
 they-0.005
cakes-0.004
Token was
Token ID373
Feature activation-1.38e-02

Pos logit contributions
ocol+0.089
 property+0.083
asured+0.082
isher+0.080
beh+0.079
Neg logit contributions
s-0.027
 they-0.026
 -0.024
-0.024
 said-0.022
Token so
Token ID523
Feature activation+1.87e-02

Pos logit contributions
ocol+0.039
 property+0.036
isher+0.035
ities+0.034
beh+0.034
Neg logit contributions
 they-0.013
 -0.013
 said-0.013
s-0.013
-0.013
Token a
Token ID257
Feature activation+6.96e-02

Pos logit contributions
 they+0.018
s+0.018
 said+0.017
 +0.017
+0.016
Neg logit contributions
ocol-0.058
 property-0.053
beh-0.052
asured-0.051
ities-0.051
Token big
Token ID1263
Feature activation+2.88e-02

Pos logit contributions
s+0.077
 they+0.075
 said+0.071
 +0.071
+0.070
Neg logit contributions
ocol-0.183
 property-0.170
asured-0.163
beh-0.162
isher-0.161
Token box
Token ID3091
Feature activation+1.38e-02

Pos logit contributions
 they+0.032
 said+0.031
s+0.030
 +0.029
+0.028
Neg logit contributions
beh-0.075
ocol-0.075
asured-0.071
 property-0.070
ities-0.070
Token of
Token ID286
Feature activation-3.50e-02

Pos logit contributions
s+0.016
 they+0.015
 +0.015
+0.015
 said+0.015
Neg logit contributions
ocol-0.035
 property-0.033
asured-0.031
isher-0.031
beh-0.031
Token ch
Token ID442
Feature activation-5.44e-02

Pos logit contributions
ocol+0.078
beh+0.077
asured+0.076
esty+0.076
 property+0.075
Neg logit contributions
 they-0.051
 said-0.050
s-0.047
-0.046
 -0.045
Tokenocol
Token ID4668
Feature activation-2.87e-01

Pos logit contributions
 property+0.053
tons+0.046
asured+0.046
esty+0.042
 appearance+0.042
Neg logit contributions
s-0.146
 they-0.131
ons-0.128
 said-0.124
-0.124
Tokenates
Token ID689
Feature activation-1.54e-02

Pos logit contributions
ocol+0.338
 property+0.316
 appearance+0.276
asured+0.274
beh+0.272
Neg logit contributions
s-0.698
-0.663
 they-0.660
 said-0.655
 -0.651
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
ocol+0.046
 property+0.043
beh+0.042
asured+0.042
isher+0.041
Neg logit contributions
s-0.013
 they-0.012
 said-0.011
-0.010
 -0.010
Token It
Token ID632
Feature activation+8.05e-03

Pos logit contributions
ocol+0.064
 property+0.061
beh+0.059
asured+0.058
isher+0.057
Neg logit contributions
 -0.028
-0.028
 they-0.027
s-0.026
 said-0.025
Token was
Token ID373
Feature activation-1.58e-02

Pos logit contributions
s+0.005
 they+0.005
 +0.004
+0.004
 said+0.004
Neg logit contributions
ocol-0.024
 property-0.023
asured-0.022
beh-0.022
isher-0.022
Token so
Token ID523
Feature activation+1.87e-02

Pos logit contributions
ocol+0.048
 property+0.048
beh+0.048
esty+0.047
isher+0.046
Neg logit contributions
 they-0.013
 said-0.012
-0.011
 -0.011
 exclaimed-0.008
Token looking
Token ID2045
Feature activation-2.02e-02

Pos logit contributions
ocol+0.114
 property+0.110
isher+0.108
esty+0.107
asured+0.106
Neg logit contributions
 they-0.041
-0.033
 -0.033
 said-0.032
s-0.030
Token at
Token ID379
Feature activation-4.49e-02

Pos logit contributions
ocol+0.047
 property+0.043
ities+0.042
beh+0.041
asured+0.041
Neg logit contributions
s-0.033
 they-0.032
 said-0.029
-0.029
 -0.028
Token all
Token ID477
Feature activation-1.26e-02

Pos logit contributions
ocol+0.137
beh+0.134
 property+0.131
ities+0.127
esty+0.125
Neg logit contributions
 they-0.035
 -0.035
-0.033
s-0.033
 said-0.031
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
ocol+0.040
 property+0.038
ities+0.038
beh+0.037
esty+0.037
Neg logit contributions
 they-0.009
s-0.008
 said-0.008
 -0.007
-0.007
Token ch
Token ID442
Feature activation-7.31e-02

Pos logit contributions
s+0.032
 said+0.031
 +0.031
+0.031
 they+0.029
Neg logit contributions
ocol-0.054
beh-0.051
 property-0.050
uzz-0.050
esty-0.049
Tokenocol
Token ID4668
Feature activation-2.86e-01

Pos logit contributions
 property+0.061
esty+0.053
 appearance+0.043
beh+0.037
asured+0.036
Neg logit contributions
s-0.210
ons-0.196
-0.192
cakes-0.191
 of-0.189
Tokenates
Token ID689
Feature activation-1.64e-02

Pos logit contributions
 property+0.394
ocol+0.382
 appearance+0.364
esty+0.333
beh+0.333
Neg logit contributions
s-0.629
-0.602
 -0.578
 said-0.574
 they-0.569
Token,
Token ID11
Feature activation-1.59e-03

Pos logit contributions
ocol+0.052
 property+0.049
beh+0.048
asured+0.048
ities+0.047
Neg logit contributions
s-0.012
 they-0.012
 said-0.010
 -0.009
-0.009
Token Lex
Token ID17210
Feature activation-5.82e-02

Pos logit contributions
ocol+0.004
 property+0.003
isher+0.003
asured+0.003
beh+0.003
Neg logit contributions
 they-0.003
s-0.002
-0.002
 -0.002
 said-0.002
Tokeni
Token ID72
Feature activation-3.14e-02

Pos logit contributions
ocol+0.081
 property+0.073
beh+0.068
ities+0.064
asured+0.063
Neg logit contributions
s-0.141
 said-0.136
 they-0.134
-0.133
 -0.133
Token chose
Token ID7690
Feature activation-2.05e-02

Pos logit contributions
ocol+0.094
 property+0.087
beh+0.084
isher+0.084
asured+0.084
Neg logit contributions
s-0.026
 they-0.025
 said-0.024
 -0.022
-0.022
Token saw
Token ID2497
Feature activation+1.37e-02

Pos logit contributions
 they+0.017
s+0.017
 +0.016
cakes+0.015
+0.015
Neg logit contributions
ocol-0.046
asured-0.043
 property-0.042
isher-0.041
ities-0.041
Token all
Token ID477
Feature activation-9.86e-03

Pos logit contributions
s+0.012
 they+0.011
 said+0.011
 +0.011
+0.011
Neg logit contributions
ocol-0.040
 property-0.037
asured-0.036
beh-0.035
isher-0.035
Token the
Token ID262
Feature activation+2.53e-02

Pos logit contributions
ocol+0.027
 property+0.025
asured+0.024
beh+0.024
isher+0.024
Neg logit contributions
s-0.010
 they-0.009
 -0.009
 said-0.009
-0.008
Token tasty
Token ID25103
Feature activation-4.82e-02

Pos logit contributions
 said+0.040
s+0.038
 they+0.038
 +0.038
+0.038
Neg logit contributions
ocol-0.055
beh-0.051
asured-0.049
 property-0.049
isher-0.049
Token ch
Token ID442
Feature activation-7.28e-02

Pos logit contributions
beh+0.103
ocol+0.103
asured+0.100
 property+0.098
esty+0.098
Neg logit contributions
s-0.073
 they-0.072
 said-0.072
-0.067
 -0.067
Tokenocol
Token ID4668
Feature activation-2.86e-01

Pos logit contributions
 property+0.036
esty+0.029
asured+0.028
beh+0.027
 appearance+0.027
Neg logit contributions
s-0.220
 they-0.216
ons-0.203
-0.202
 said-0.202
Tokenates
Token ID689
Feature activation-1.37e-02

Pos logit contributions
ocol+0.347
 property+0.334
 appearance+0.291
beh+0.287
asured+0.286
Neg logit contributions
s-0.688
 they-0.656
-0.654
 said-0.645
 -0.641
Token inside
Token ID2641
Feature activation+2.08e-02

Pos logit contributions
ocol+0.038
 property+0.035
asured+0.034
beh+0.033
isher+0.033
Neg logit contributions
s-0.013
 they-0.013
 said-0.012
 -0.012
-0.012
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
s+0.018
 they+0.017
 +0.016
 said+0.016
+0.016
Neg logit contributions
ocol-0.060
 property-0.056
asured-0.054
beh-0.053
isher-0.053
Token She
Token ID1375
Feature activation-2.90e-02

Pos logit contributions
ocol+0.029
 property+0.027
asured+0.025
beh+0.025
esty+0.025
Neg logit contributions
s-0.014
 they-0.014
 said-0.013
 exclaimed-0.013
cakes-0.012
Token couldn
Token ID3521
Feature activation-1.47e-02

Pos logit contributions
ocol+0.074
asured+0.070
 property+0.069
beh+0.066
isher+0.066
Neg logit contributions
s-0.030
 they-0.029
 -0.027
-0.026
cakes-0.025
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
s+0.060
 they+0.055
 +0.053
 said+0.053
+0.052
Neg logit contributions
ocol-0.171
 property-0.162
asured-0.155
ities-0.153
beh-0.153
Token they
Token ID484
Feature activation+7.65e-02

Pos logit contributions
s+0.007
 they+0.006
 +0.006
+0.006
 said+0.005
Neg logit contributions
ocol-0.042
 property-0.040
beh-0.039
asured-0.039
isher-0.039
Token shared
Token ID4888
Feature activation+3.46e-02

Pos logit contributions
s+0.087
 they+0.085
 said+0.081
 +0.081
+0.080
Neg logit contributions
ocol-0.199
 property-0.185
asured-0.176
beh-0.176
isher-0.175
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
s+0.029
 they+0.029
 said+0.027
 +0.026
+0.026
Neg logit contributions
ocol-0.100
 property-0.093
asured-0.090
beh-0.089
isher-0.089
Token ch
Token ID442
Feature activation-7.52e-02

Pos logit contributions
s+0.037
 +0.036
+0.036
 they+0.036
 said+0.036
Neg logit contributions
ocol-0.045
 property-0.041
beh-0.040
ities-0.040
asured-0.039
Tokenocol
Token ID4668
Feature activation-2.85e-01

Pos logit contributions
 property+0.056
esty+0.042
beh+0.042
 appearance+0.042
ities+0.036
Neg logit contributions
s-0.219
cakes-0.209
-0.208
 -0.205
 they-0.201
Tokenates
Token ID689
Feature activation-9.89e-03

Pos logit contributions
 property+0.384
ocol+0.371
 appearance+0.357
beh+0.317
asured+0.315
Neg logit contributions
s-0.659
-0.619
 they-0.604
 said-0.600
cakes-0.599
Token with
Token ID351
Feature activation-2.28e-02

Pos logit contributions
ocol+0.025
 property+0.024
asured+0.023
isher+0.023
beh+0.023
Neg logit contributions
s-0.010
 -0.010
 they-0.010
 said-0.010
-0.010
Token everyone
Token ID2506
Feature activation+1.61e-03

Pos logit contributions
ocol+0.055
 property+0.051
asured+0.048
beh+0.048
isher+0.048
Neg logit contributions
s-0.029
 they-0.027
 -0.027
 said-0.027
-0.026
Token in
Token ID287
Feature activation-3.47e-02

Pos logit contributions
s+0.001
 they+0.001
 said+0.001
 +0.001
+0.001
Neg logit contributions
ocol-0.005
 property-0.004
asured-0.004
beh-0.004
isher-0.004
Token the
Token ID262
Feature activation+2.14e-02

Pos logit contributions
ocol+0.105
 property+0.100
beh+0.097
asured+0.095
isher+0.095
Neg logit contributions
 they-0.027
s-0.025
-0.025
 -0.025
 said-0.024
Token a
Token ID257
Feature activation+6.36e-02

Pos logit contributions
ocol+0.049
 property+0.047
beh+0.045
asured+0.045
isher+0.045
Neg logit contributions
 they-0.014
 -0.014
 said-0.014
-0.013
s-0.013
Token big
Token ID1263
Feature activation+2.58e-02

Pos logit contributions
s+0.080
 they+0.078
 said+0.074
+0.073
 +0.073
Neg logit contributions
ocol-0.155
 property-0.144
asured-0.134
isher-0.134
beh-0.133
Token box
Token ID3091
Feature activation+1.27e-02

Pos logit contributions
 they+0.023
 said+0.023
 +0.023
s+0.022
+0.021
Neg logit contributions
ocol-0.072
beh-0.071
 property-0.069
asured-0.068
ities-0.067
Token of
Token ID286
Feature activation-4.06e-02

Pos logit contributions
s+0.011
 they+0.011
 +0.011
+0.010
 said+0.010
Neg logit contributions
ocol-0.035
 property-0.033
asured-0.032
isher-0.031
beh-0.031
Token ch
Token ID442
Feature activation-5.97e-02

Pos logit contributions
beh+0.093
ocol+0.092
esty+0.092
 property+0.091
asured+0.090
Neg logit contributions
 said-0.057
 they-0.055
 -0.052
-0.052
s-0.052
Tokenocol
Token ID4668
Feature activation-2.85e-01

Pos logit contributions
 property+0.054
esty+0.046
tons+0.042
asured+0.040
Deal+0.038
Neg logit contributions
s-0.158
 of-0.142
 they-0.142
 said-0.141
 exclaimed-0.138
Tokenates
Token ID689
Feature activation-1.35e-02

Pos logit contributions
ocol+0.347
 property+0.333
 appearance+0.294
asured+0.287
beh+0.284
Neg logit contributions
s-0.670
-0.636
 said-0.632
 they-0.624
 -0.622
Token!
Token ID0
Feature activation-4.08e-03

Pos logit contributions
ocol+0.040
 property+0.037
beh+0.036
asured+0.035
isher+0.035
Neg logit contributions
s-0.012
 they-0.012
 said-0.011
-0.010
 -0.010
Token<|endoftext|>
Token ID50256
Feature activation-2.21e-02

Pos logit contributions
ocol+0.013
 property+0.012
beh+0.011
asured+0.011
uzz+0.011
Neg logit contributions
s-0.002
 they-0.002
 said-0.002
 exclaimed-0.002
cakes-0.001
TokenOnce
Token ID7454
Feature activation-5.29e-02

Pos logit contributions
ocol+0.054
 property+0.048
ities+0.047
uzz+0.046
isher+0.046
Neg logit contributions
 they-0.024
 exclaimed-0.023
s-0.022
 said-0.022
cakes-0.021
Token upon
Token ID2402
Feature activation-9.16e-02

Pos logit contributions
ocol+0.081
uzz+0.070
uff+0.066
ac+0.066
 property+0.065
Neg logit contributions
cakes-0.103
 exclaimed-0.100
 said-0.092
-0.091
 timer-0.091
Token a
Token ID257
Feature activation+6.44e-02

Pos logit contributions
s+0.020
 they+0.019
 +0.018
 said+0.018
+0.018
Neg logit contributions
ocol-0.026
 property-0.023
asured-0.022
beh-0.022
isher-0.022
Token big
Token ID1263
Feature activation+2.34e-02

Pos logit contributions
s+0.075
 they+0.073
 +0.070
+0.069
 exclaimed+0.068
Neg logit contributions
ocol-0.157
 property-0.151
isher-0.140
asured-0.139
beh-0.136
Token box
Token ID3091
Feature activation+7.94e-03

Pos logit contributions
s+0.028
 they+0.028
 said+0.028
 +0.027
+0.026
Neg logit contributions
ocol-0.059
 property-0.054
beh-0.054
asured-0.053
ities-0.053
Token of
Token ID286
Feature activation-3.82e-02

Pos logit contributions
 +0.009
s+0.009
 exclaimed+0.008
 they+0.008
+0.008
Neg logit contributions
ocol-0.018
asured-0.018
 property-0.018
isher-0.017
beh-0.017
Token ch
Token ID442
Feature activation-6.22e-02

Pos logit contributions
ocol+0.076
beh+0.073
 property+0.073
ities+0.072
esty+0.072
Neg logit contributions
 said-0.065
 they-0.063
-0.061
s-0.061
 -0.061
Tokenocol
Token ID4668
Feature activation-2.84e-01

Pos logit contributions
 property+0.065
tons+0.060
esty+0.055
asured+0.052
 appearance+0.050
Neg logit contributions
s-0.155
ons-0.140
 they-0.139
-0.137
 said-0.135
Tokenates
Token ID689
Feature activation-1.65e-02

Pos logit contributions
ocol+0.350
 property+0.314
asured+0.271
beh+0.269
 appearance+0.268
Neg logit contributions
s-0.696
-0.663
 they-0.660
 said-0.657
 -0.653
Token to
Token ID284
Feature activation-2.73e-02

Pos logit contributions
ocol+0.045
 property+0.042
asured+0.040
isher+0.040
beh+0.040
Neg logit contributions
s-0.016
 they-0.016
 -0.015
 said-0.015
-0.015
Token share
Token ID2648
Feature activation+1.64e-02

Pos logit contributions
ocol+0.070
 property+0.065
beh+0.062
asured+0.062
isher+0.062
Neg logit contributions
 they-0.032
s-0.032
 said-0.031
 -0.031
-0.031
Token with
Token ID351
Feature activation-1.64e-02

Pos logit contributions
s+0.016
 they+0.016
+0.015
 +0.015
 said+0.014
Neg logit contributions
ocol-0.046
 property-0.043
beh-0.042
isher-0.041
asured-0.041
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
ocol+0.038
 property+0.035
asured+0.033
beh+0.033
isher+0.033
Neg logit contributions
s-0.024
 they-0.023
 said-0.022
 -0.022
-0.022
Token her
Token ID607
Feature activation-6.23e-02

Pos logit contributions
s+0.012
 they+0.012
cakes+0.011
 exclaimed+0.011
ves+0.011
Neg logit contributions
ocol-0.013
 property-0.012
beh-0.012
isher-0.012
asured-0.011
Token a
Token ID257
Feature activation+6.94e-02

Pos logit contributions
ocol+0.154
 property+0.148
asured+0.144
isher+0.141
beh+0.139
Neg logit contributions
s-0.069
 -0.063
 they-0.062
 exclaimed-0.061
 said-0.059
Token box
Token ID3091
Feature activation+8.04e-03

Pos logit contributions
s+0.081
 they+0.081
 +0.079
 exclaimed+0.078
+0.077
Neg logit contributions
ocol-0.159
 property-0.149
isher-0.144
asured-0.139
esty-0.134
Token of
Token ID286
Feature activation-3.74e-02

Pos logit contributions
 +0.008
s+0.008
 they+0.008
 exclaimed+0.008
+0.007
Neg logit contributions
ocol-0.020
 property-0.019
asured-0.019
isher-0.019
beh-0.018
Token ch
Token ID442
Feature activation-5.96e-02

Pos logit contributions
ocol+0.084
 property+0.082
beh+0.081
asured+0.079
esty+0.077
Neg logit contributions
 said-0.054
 they-0.054
s-0.052
 -0.049
-0.049
Tokenocol
Token ID4668
Feature activation-2.84e-01

Pos logit contributions
 property+0.079
esty+0.066
beh+0.066
olving+0.065
tons+0.064
Neg logit contributions
s-0.137
 they-0.126
 said-0.124
-0.120
 exclaimed-0.119
Tokenates
Token ID689
Feature activation-1.16e-02

Pos logit contributions
ocol+0.352
 property+0.316
beh+0.274
asured+0.272
 appearance+0.269
Neg logit contributions
s-0.696
 they-0.659
-0.658
 said-0.655
 -0.647
Token.
Token ID13
Feature activation-7.56e-03

Pos logit contributions
ocol+0.033
 property+0.031
asured+0.030
beh+0.030
isher+0.030
Neg logit contributions
s-0.010
 they-0.010
 -0.009
 said-0.009
-0.009
Token Lily
Token ID20037
Feature activation+3.05e-02

Pos logit contributions
ocol+0.019
 property+0.017
asured+0.017
isher+0.016
ities+0.016
Neg logit contributions
s-0.009
 they-0.009
 said-0.008
 -0.008
 exclaimed-0.008
Token opened
Token ID4721
Feature activation-1.26e-02

Pos logit contributions
 they+0.028
s+0.027
 +0.027
cakes+0.026
+0.026
Neg logit contributions
ocol-0.075
asured-0.075
 property-0.073
isher-0.070
esty-0.070
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
ocol+0.033
 property+0.030
isher+0.029
asured+0.029
ant+0.028
Neg logit contributions
s-0.013
 they-0.012
 exclaimed-0.012
 said-0.012
cakes-0.011
Token was
Token ID373
Feature activation-2.47e-02

Pos logit contributions
ocol+0.126
isher+0.112
 property+0.111
asured+0.108
ac+0.107
Neg logit contributions
 exclaimed-0.042
s-0.041
cakes-0.039
 they-0.037
 said-0.037
Token a
Token ID257
Feature activation+6.43e-02

Pos logit contributions
ocol+0.080
 property+0.074
beh+0.073
isher+0.071
esty+0.071
Neg logit contributions
 they-0.017
 said-0.015
-0.014
 -0.014
s-0.013
Token box
Token ID3091
Feature activation+1.01e-02

Pos logit contributions
 they+0.073
s+0.072
 said+0.070
 +0.070
+0.068
Neg logit contributions
ocol-0.170
beh-0.156
 property-0.155
asured-0.154
ities-0.152
Token of
Token ID286
Feature activation-3.30e-02

Pos logit contributions
s+0.010
 they+0.010
 +0.009
+0.009
 said+0.009
Neg logit contributions
ocol-0.027
 property-0.025
asured-0.024
isher-0.024
beh-0.024
Token ch
Token ID442
Feature activation-5.48e-02

Pos logit contributions
beh+0.078
esty+0.077
ocol+0.077
asured+0.076
 property+0.075
Neg logit contributions
 said-0.045
 they-0.044
 -0.039
s-0.039
-0.039
Tokenocol
Token ID4668
Feature activation-2.84e-01

Pos logit contributions
 property+0.055
asured+0.047
esty+0.046
tons+0.046
 appearance+0.043
Neg logit contributions
s-0.147
 they-0.131
 of-0.130
 said-0.130
ons-0.129
Tokenates
Token ID689
Feature activation-1.29e-02

Pos logit contributions
ocol+0.351
 property+0.346
 appearance+0.310
asured+0.300
esty+0.296
Neg logit contributions
s-0.661
-0.624
 said-0.618
 they-0.613
 -0.611
Token.
Token ID13
Feature activation-1.84e-02

Pos logit contributions
ocol+0.037
 property+0.035
beh+0.034
asured+0.034
isher+0.033
Neg logit contributions
s-0.012
 they-0.012
 said-0.011
-0.010
 -0.010
Token "
Token ID366
Feature activation-8.87e-02

Pos logit contributions
ocol+0.052
 property+0.048
asured+0.046
beh+0.046
isher+0.046
Neg logit contributions
s-0.017
 they-0.016
 said-0.015
 -0.015
-0.014
TokenFor
Token ID1890
Feature activation-6.76e-02

Pos logit contributions
ocol+0.222
 property+0.215
 appearance+0.201
asured+0.200
ities+0.194
Neg logit contributions
-0.119
 -0.116
s-0.107
 they-0.106
 said-0.105
Token me
Token ID502
Feature activation-1.08e-01

Pos logit contributions
ocol+0.183
 property+0.175
esty+0.174
isher+0.174
ities+0.171
Neg logit contributions
 they-0.070
-0.068
 said-0.068
 -0.067
 exclaimed-0.057
Token she
Token ID673
Feature activation+1.84e-02

Pos logit contributions
s+0.006
 they+0.005
 +0.005
 said+0.005
+0.005
Neg logit contributions
ocol-0.026
 property-0.024
asured-0.023
beh-0.023
isher-0.023
Token paid
Token ID3432
Feature activation-3.32e-02

Pos logit contributions
s+0.017
 they+0.016
 +0.016
+0.015
 exclaimed+0.014
Neg logit contributions
ocol-0.049
 property-0.046
asured-0.045
isher-0.044
beh-0.044
Token for
Token ID329
Feature activation+7.62e-03

Pos logit contributions
ocol+0.087
 property+0.082
asured+0.077
ities+0.077
beh+0.076
Neg logit contributions
 they-0.040
s-0.039
 said-0.038
-0.037
 -0.036
Token the
Token ID262
Feature activation+2.86e-02

Pos logit contributions
 they+0.006
 +0.005
 said+0.005
+0.005
s+0.005
Neg logit contributions
ocol-0.023
 property-0.022
beh-0.021
asured-0.021
isher-0.021
Token ch
Token ID442
Feature activation-7.41e-02

Pos logit contributions
 said+0.036
 +0.035
 they+0.034
+0.033
s+0.032
Neg logit contributions
ocol-0.072
beh-0.069
asured-0.068
 property-0.066
ities-0.064
Tokenocol
Token ID4668
Feature activation-2.83e-01

Pos logit contributions
 property+0.038
beh+0.029
esty+0.027
asured+0.023
 appearance+0.021
Neg logit contributions
s-0.224
-0.213
 -0.210
 they-0.210
 said-0.209
Tokenates
Token ID689
Feature activation-1.10e-02

Pos logit contributions
 property+0.385
ocol+0.368
 appearance+0.347
beh+0.327
asured+0.324
Neg logit contributions
s-0.642
-0.600
 said-0.579
 they-0.577
 -0.571
Token and
Token ID290
Feature activation-1.38e-03

Pos logit contributions
ocol+0.032
 property+0.029
beh+0.028
asured+0.028
ities+0.028
Neg logit contributions
s-0.011
 they-0.011
 said-0.010
-0.009
 -0.009
Token the
Token ID262
Feature activation+3.06e-02

Pos logit contributions
ocol+0.004
 property+0.004
ities+0.004
beh+0.004
asured+0.004
Neg logit contributions
 they-0.001
 said-0.001
s-0.001
 -0.001
 exclaimed-0.001
Token two
Token ID734
Feature activation+6.67e-02

Pos logit contributions
 said+0.038
 +0.037
 they+0.036
+0.035
s+0.034
Neg logit contributions
ocol-0.078
beh-0.074
asured-0.073
 property-0.072
ant-0.069
Token of
Token ID286
Feature activation-3.97e-02

Pos logit contributions
s+0.065
 they+0.063
 +0.060
+0.059
 said+0.059
Neg logit contributions
ocol-0.185
 property-0.171
asured-0.164
beh-0.163
isher-0.163
Token son
Token ID3367
Feature activation+3.56e-02

Pos logit contributions
beh+0.187
ocol+0.185
asured+0.172
 property+0.171
esty+0.165
Neg logit contributions
 said-0.110
s-0.107
 -0.104
 they-0.103
-0.102
Token is
Token ID318
Feature activation-8.28e-03

Pos logit contributions
s+0.034
 said+0.034
 they+0.034
+0.031
 +0.031
Neg logit contributions
ocol-0.105
 property-0.094
ities-0.092
asured-0.092
beh-0.092
Token eating
Token ID6600
Feature activation+7.12e-03

Pos logit contributions
ocol+0.021
isher+0.019
 property+0.019
ities+0.019
esty+0.019
Neg logit contributions
 they-0.011
 said-0.010
-0.010
 -0.010
s-0.009
Token our
Token ID674
Feature activation-5.14e-02

Pos logit contributions
 they+0.007
+0.006
s+0.006
 +0.006
 said+0.006
Neg logit contributions
beh-0.021
ocol-0.020
 property-0.020
asured-0.020
esty-0.019
Token ch
Token ID442
Feature activation-6.49e-02

Pos logit contributions
ocol+0.124
 property+0.118
beh+0.117
asured+0.114
ities+0.113
Neg logit contributions
s-0.068
 -0.065
 said-0.065
-0.062
 they-0.062
Tokenocol
Token ID4668
Feature activation-2.82e-01

Pos logit contributions
 property+0.039
esty+0.027
beh+0.026
tons+0.024
nam+0.020
Neg logit contributions
s-0.192
-0.177
ons-0.176
 said-0.173
 -0.173
Tokenates
Token ID689
Feature activation-1.31e-02

Pos logit contributions
 property+0.370
ocol+0.364
 appearance+0.340
esty+0.305
beh+0.304
Neg logit contributions
s-0.648
-0.602
 said-0.585
 -0.578
 exclaimed-0.578
Token!"
Token ID2474
Feature activation+9.22e-02

Pos logit contributions
ocol+0.038
 property+0.034
beh+0.034
asured+0.033
isher+0.033
Neg logit contributions
s-0.013
 they-0.013
 said-0.012
-0.011
 -0.011
Token
Token ID220
Feature activation+4.77e-02

Pos logit contributions
s+0.075
 they+0.069
 said+0.065
 exclaimed+0.059
cakes+0.058
Neg logit contributions
ocol-0.258
 property-0.236
asured-0.229
ities-0.229
beh-0.227
Token
Token ID198
Feature activation+2.88e-02

Pos logit contributions
 they+0.016
s+0.015
 said+0.014
cakes+0.012
 exclaimed+0.011
Neg logit contributions
ocol-0.149
 property-0.139
asured-0.137
uff-0.136
beh-0.136
Token
Token ID198
Feature activation+2.13e-02

Pos logit contributions
 they+0.027
s+0.026
 said+0.024
cakes+0.024
 exclaimed+0.021
Neg logit contributions
ocol-0.072
asured-0.065
 property-0.065
uff-0.064
uzz-0.064
Token he
Token ID339
Feature activation+8.66e-03

Pos logit contributions
 they+0.032
 +0.025
s+0.024
 said+0.024
+0.023
Neg logit contributions
ocol-0.214
 property-0.200
isher-0.196
ities-0.194
beh-0.194
Token ate
Token ID15063
Feature activation+2.54e-02

Pos logit contributions
s+0.009
 they+0.009
 said+0.009
 +0.009
+0.009
Neg logit contributions
ocol-0.023
 property-0.021
asured-0.020
beh-0.020
isher-0.020
Token all
Token ID477
Feature activation-8.44e-03

Pos logit contributions
 they+0.025
s+0.023
 +0.023
+0.023
 said+0.022
Neg logit contributions
ocol-0.072
 property-0.069
beh-0.067
asured-0.065
ities-0.065
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
ocol+0.023
 property+0.021
isher+0.020
asured+0.020
beh+0.020
Neg logit contributions
s-0.008
 they-0.008
 said-0.008
 -0.008
 exclaimed-0.008
Token ch
Token ID442
Feature activation-6.48e-02

Pos logit contributions
s+0.030
 said+0.028
 +0.028
+0.028
 they+0.028
Neg logit contributions
ocol-0.051
 property-0.048
asured-0.046
beh-0.046
ities-0.046
Tokenocol
Token ID4668
Feature activation-2.82e-01

Pos logit contributions
 property+0.050
esty+0.037
 appearance+0.031
tons+0.023
beh+0.023
Neg logit contributions
s-0.196
ons-0.179
 said-0.179
 they-0.179
-0.178
Tokenates
Token ID689
Feature activation-1.74e-02

Pos logit contributions
 property+0.366
ocol+0.346
 appearance+0.325
esty+0.306
beh+0.299
Neg logit contributions
s-0.656
-0.623
 said-0.609
 they-0.607
 -0.603
Token and
Token ID290
Feature activation-5.85e-03

Pos logit contributions
ocol+0.047
 property+0.044
asured+0.042
beh+0.042
isher+0.041
Neg logit contributions
s-0.018
 they-0.018
 said-0.017
 -0.017
-0.016
Token sweets
Token ID42402
Feature activation-3.88e-02

Pos logit contributions
ocol+0.017
 property+0.015
isher+0.015
beh+0.015
ities+0.015
Neg logit contributions
 they-0.006
 said-0.006
s-0.006
 -0.005
 exclaimed-0.005
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
ocol+0.101
 property+0.094
asured+0.090
ities+0.089
isher+0.089
Neg logit contributions
s-0.041
 they-0.040
 -0.039
-0.039
 said-0.038
Token Then
Token ID3244
Feature activation+1.03e-02

Pos logit contributions
ocol+0.053
 property+0.050
isher+0.047
beh+0.047
ities+0.047
Neg logit contributions
 -0.023
-0.023
 they-0.020
s-0.019
 said-0.018
Token a
Token ID257
Feature activation+7.40e-02

Pos logit contributions
ocol+0.091
beh+0.085
isher+0.083
asured+0.082
ities+0.081
Neg logit contributions
 they-0.024
-0.018
 -0.017
 said-0.016
 she-0.015
Token big
Token ID1263
Feature activation+2.82e-02

Pos logit contributions
 they+0.083
 said+0.082
 +0.081
s+0.080
+0.080
Neg logit contributions
ocol-0.199
beh-0.190
asured-0.182
 property-0.180
ities-0.178
Token box
Token ID3091
Feature activation+1.14e-02

Pos logit contributions
 said+0.029
 they+0.029
 +0.028
+0.026
s+0.025
Neg logit contributions
beh-0.080
ocol-0.077
asured-0.074
ant-0.072
ities-0.072
Token of
Token ID286
Feature activation-4.17e-02

Pos logit contributions
s+0.012
 they+0.011
 +0.011
 said+0.011
+0.011
Neg logit contributions
ocol-0.031
 property-0.029
asured-0.027
isher-0.027
beh-0.027
Token ch
Token ID442
Feature activation-5.90e-02

Pos logit contributions
beh+0.102
asured+0.099
esty+0.099
ant+0.096
ities+0.095
Neg logit contributions
 said-0.057
 they-0.056
 -0.052
-0.051
 big-0.048
Tokenocol
Token ID4668
Feature activation-2.82e-01

Pos logit contributions
 property+0.044
asured+0.038
tons+0.038
beh+0.037
esty+0.034
Neg logit contributions
s-0.161
ons-0.153
 they-0.148
 of-0.146
-0.146
Tokenates
Token ID689
Feature activation-1.02e-02

Pos logit contributions
ocol+0.345
 property+0.325
asured+0.285
 appearance+0.283
beh+0.282
Neg logit contributions
s-0.668
-0.639
 said-0.634
 they-0.631
 -0.629
Token!
Token ID0
Feature activation-8.76e-03

Pos logit contributions
ocol+0.030
 property+0.028
beh+0.028
asured+0.027
isher+0.027
Neg logit contributions
 they-0.009
s-0.009
 said-0.008
-0.007
 -0.007
Token
Token ID220
Feature activation+3.70e-02

Pos logit contributions
ocol+0.021
 property+0.019
beh+0.018
asured+0.018
isher+0.018
Neg logit contributions
s-0.011
 they-0.011
 said-0.011
 exclaimed-0.010
 -0.010
Token
Token ID198
Feature activation+2.29e-02

Pos logit contributions
s+0.016
 they+0.015
 said+0.014
 exclaimed+0.013
cakes+0.011
Neg logit contributions
ocol-0.110
 property-0.106
uzz-0.104
uff-0.103
asured-0.102
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
cakes+0.027
 timer+0.025
 they+0.023
 film+0.020
 replied+0.020
Neg logit contributions
uzz-0.046
 yourself-0.042
uff-0.040
imb-0.040
ocol-0.040
Token They
Token ID1119
Feature activation+2.88e-02

Pos logit contributions
ocol+0.028
 property+0.027
esty+0.027
ities+0.027
 appearance+0.026
Neg logit contributions
cakes-0.011
s-0.011
 exclaimed-0.011
 said-0.010
 they-0.010
Token were
Token ID547
Feature activation-1.47e-04

Pos logit contributions
s+0.033
+0.030
 they+0.030
 +0.030
cakes+0.028
Neg logit contributions
ocol-0.069
 property-0.067
asured-0.064
beh-0.064
ities-0.063
Token not
Token ID407
Feature activation+2.25e-02

Pos logit contributions
ocol+0.000
 property+0.000
isher+0.000
beh+0.000
ities+0.000
Neg logit contributions
 they-0.000
 said-0.000
s-0.000
-0.000
 -0.000
Token depend
Token ID4745
Feature activation-4.25e-02

Pos logit contributions
 they+0.030
 said+0.029
+0.028
s+0.028
 +0.027
Neg logit contributions
ocol-0.056
 property-0.051
isher-0.049
esty-0.047
beh-0.047
Tokenable
Token ID540
Feature activation-1.19e-02

Pos logit contributions
ocol+0.076
 property+0.066
ubb+0.062
asured+0.061
ities+0.061
Neg logit contributions
s-0.093
 they-0.087
 said-0.086
-0.083
 -0.080
Token friends
Token ID2460
Feature activation+6.16e-02

Pos logit contributions
ocol+0.037
beh+0.033
isher+0.033
 property+0.033
asured+0.032
Neg logit contributions
 they-0.010
s-0.010
 said-0.009
-0.008
 -0.007
Token anymore
Token ID7471
Feature activation+2.04e-02

Pos logit contributions
s+0.071
 they+0.070
+0.066
 said+0.064
 +0.061
Neg logit contributions
ocol-0.171
 property-0.151
isher-0.150
beh-0.149
asured-0.148
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
s+0.016
 they+0.016
+0.014
 said+0.014
 +0.014
Neg logit contributions
ocol-0.062
 property-0.057
isher-0.056
beh-0.056
asured-0.055
Token
Token ID198
Feature activation+2.56e-02

Pos logit contributions
ocol+0.028
 property+0.027
esty+0.027
ities+0.027
 appearance+0.026
Neg logit contributions
cakes-0.008
s-0.008
 exclaimed-0.008
 said-0.007
 they-0.007
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
cakes+0.032
 arrived+0.026
s+0.025
 timer+0.024
 they+0.024
Neg logit contributions
uzz-0.053
ocol-0.052
 property-0.051
asured-0.049
uff-0.048
TokenThen
Token ID6423
Feature activation-1.44e-02

Pos logit contributions
cakes+0.027
 +0.026
s+0.026
 said+0.026
 exclaimed+0.025
Neg logit contributions
ocol-0.038
 property-0.036
esty-0.034
ac-0.034
asured-0.033
Token turned
Token ID2900
Feature activation+3.98e-02

Pos logit contributions
ocol+0.075
 property+0.070
asured+0.069
isher+0.068
beh+0.067
Neg logit contributions
s-0.035
 they-0.034
 -0.033
-0.033
 said-0.030
Token it
Token ID340
Feature activation+4.76e-02

Pos logit contributions
s+0.038
 they+0.037
 said+0.035
 +0.035
+0.034
Neg logit contributions
ocol-0.110
 property-0.102
isher-0.098
asured-0.098
beh-0.097
Token on
Token ID319
Feature activation+1.64e-02

Pos logit contributions
s+0.059
 they+0.058
 said+0.055
 +0.054
+0.054
Neg logit contributions
ocol-0.124
 property-0.114
beh-0.108
asured-0.108
isher-0.107
Token,
Token ID11
Feature activation+1.12e-03

Pos logit contributions
+0.010
 they+0.010
 +0.009
s+0.008
 said+0.008
Neg logit contributions
ocol-0.055
beh-0.052
 property-0.050
asured-0.049
isher-0.049
Token and
Token ID290
Feature activation+8.33e-03

Pos logit contributions
 they+0.001
s+0.001
 said+0.001
 +0.001
+0.001
Neg logit contributions
ocol-0.003
 property-0.003
beh-0.003
isher-0.003
ities-0.003
Token it
Token ID340
Feature activation+4.64e-02

Pos logit contributions
 they+0.008
 said+0.007
s+0.007
 +0.007
+0.007
Neg logit contributions
ocol-0.025
 property-0.023
beh-0.022
ities-0.022
asured-0.022
Token made
Token ID925
Feature activation+4.66e-02

Pos logit contributions
 +0.042
s+0.042
+0.041
 they+0.040
 exclaimed+0.036
Neg logit contributions
ocol-0.115
asured-0.110
isher-0.109
 property-0.109
beh-0.106
Token a
Token ID257
Feature activation+7.20e-02

Pos logit contributions
s+0.049
 they+0.048
 said+0.045
 +0.045
+0.045
Neg logit contributions
ocol-0.125
 property-0.116
asured-0.111
beh-0.111
isher-0.110
Token funny
Token ID8258
Feature activation-1.12e-03

Pos logit contributions
s+0.102
 they+0.100
 said+0.095
+0.094
 +0.094
Neg logit contributions
ocol-0.157
 property-0.148
 appearance-0.134
ities-0.133
isher-0.132
Token noise
Token ID7838
Feature activation+2.52e-02

Pos logit contributions
ocol+0.002
 property+0.002
ities+0.002
 appearance+0.002
asured+0.002
Neg logit contributions
s-0.002
 they-0.002
 -0.002
 said-0.001
-0.001
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
 they+0.020
s+0.020
 said+0.018
+0.017
 +0.016
Neg logit contributions
ocol-0.076
 property-0.071
asured-0.069
beh-0.068
isher-0.068
Token delicious
Token ID12625
Feature activation+4.76e-03

Pos logit contributions
ocol+0.016
 property+0.015
beh+0.015
isher+0.015
esty+0.015
Neg logit contributions
 they-0.008
 said-0.008
 -0.007
-0.007
s-0.007
Token!
Token ID0
Feature activation+9.79e-04

Pos logit contributions
 they+0.004
s+0.003
 said+0.003
+0.003
 +0.003
Neg logit contributions
ocol-0.015
 property-0.014
beh-0.013
asured-0.013
isher-0.013
Token Lily
Token ID20037
Feature activation+3.75e-02

Pos logit contributions
s+0.001
 they+0.001
 said+0.001
 +0.001
 exclaimed+0.001
Neg logit contributions
ocol-0.003
 property-0.003
asured-0.002
beh-0.002
isher-0.002
Token was
Token ID373
Feature activation-1.28e-02

Pos logit contributions
 they+0.033
s+0.033
 +0.030
+0.029
cakes+0.027
Neg logit contributions
ocol-0.100
 property-0.095
asured-0.094
beh-0.091
ities-0.090
Token happy
Token ID3772
Feature activation+4.04e-02

Pos logit contributions
ocol+0.031
 property+0.029
asured+0.028
beh+0.028
ities+0.027
Neg logit contributions
s-0.017
 they-0.016
 said-0.015
 -0.015
-0.015
Token that
Token ID326
Feature activation+6.09e-02

Pos logit contributions
s+0.038
 +0.035
 they+0.034
 exclaimed+0.033
 said+0.033
Neg logit contributions
ocol-0.109
 property-0.102
asured-0.099
ities-0.097
beh-0.097
Token she
Token ID673
Feature activation+1.94e-02

Pos logit contributions
 they+0.029
 said+0.018
+0.017
 +0.016
s+0.015
Neg logit contributions
ocol-0.223
 property-0.210
isher-0.207
asured-0.203
beh-0.201
Token found
Token ID1043
Feature activation+2.04e-02

Pos logit contributions
s+0.021
 +0.020
 they+0.020
+0.019
 exclaimed+0.018
Neg logit contributions
ocol-0.048
 property-0.046
asured-0.045
beh-0.045
isher-0.044
Token the
Token ID262
Feature activation+2.70e-02

Pos logit contributions
s+0.021
 they+0.021
 said+0.020
 +0.019
+0.019
Neg logit contributions
ocol-0.055
 property-0.052
asured-0.049
beh-0.049
isher-0.049
Token lime
Token ID28738
Feature activation-3.40e-02

Pos logit contributions
 they+0.033
 said+0.033
+0.032
s+0.031
 +0.031
Neg logit contributions
ocol-0.069
beh-0.063
asured-0.061
ities-0.059
ant-0.059
Token and
Token ID290
Feature activation-1.45e-03

Pos logit contributions
ocol+0.099
 property+0.091
beh+0.088
asured+0.088
isher+0.087
Neg logit contributions
s-0.031
 they-0.031
 said-0.029
-0.028
 -0.027
Tokenola
Token ID5708
Feature activation+2.81e-02

Pos logit contributions
ocol+0.178
 property+0.165
beh+0.152
ities+0.151
asured+0.150
Neg logit contributions
s-0.144
 they-0.140
 said-0.138
-0.137
 -0.135
Token was
Token ID373
Feature activation-1.50e-02

Pos logit contributions
s+0.027
 they+0.027
 +0.025
+0.025
 said+0.023
Neg logit contributions
ocol-0.074
 property-0.070
asured-0.068
beh-0.067
isher-0.067
Token surprised
Token ID6655
Feature activation+5.35e-02

Pos logit contributions
ocol+0.034
 property+0.032
asured+0.031
beh+0.031
ities+0.030
Neg logit contributions
s-0.021
 they-0.021
 said-0.019
 -0.019
-0.019
Token to
Token ID284
Feature activation-1.27e-03

Pos logit contributions
s+0.055
 they+0.053
 +0.051
 said+0.050
+0.049
Neg logit contributions
ocol-0.144
 property-0.134
asured-0.129
beh-0.127
ities-0.127
Token know
Token ID760
Feature activation+5.36e-02

Pos logit contributions
 property+0.003
ocol+0.003
 appearance+0.003
asured+0.003
ities+0.002
Neg logit contributions
s-0.002
cakes-0.002
 they-0.002
 exclaimed-0.002
 said-0.002
Token that
Token ID326
Feature activation+5.96e-02

Pos logit contributions
 they+0.051
s+0.047
 said+0.045
+0.044
 +0.044
Neg logit contributions
ocol-0.154
 property-0.145
beh-0.140
asured-0.140
isher-0.139
Token cob
Token ID22843
Feature activation-8.83e-02

Pos logit contributions
 they+0.057
 said+0.047
+0.046
 +0.044
s+0.044
Neg logit contributions
ocol-0.185
isher-0.170
 property-0.167
beh-0.166
esty-0.165
Tokenwe
Token ID732
Feature activation-6.11e-03

Pos logit contributions
ocol+0.187
 property+0.164
beh+0.163
esty+0.161
asured+0.159
Neg logit contributions
 they-0.144
s-0.142
 said-0.136
-0.133
cakes-0.131
Tokenbs
Token ID1443
Feature activation-9.94e-03

Pos logit contributions
ocol+0.015
 property+0.014
asured+0.013
ities+0.013
isher+0.013
Neg logit contributions
s-0.009
 they-0.009
 said-0.008
 -0.008
-0.008
Token could
Token ID714
Feature activation+1.26e-02

Pos logit contributions
ocol+0.029
 property+0.026
asured+0.026
beh+0.026
isher+0.026
Neg logit contributions
 they-0.010
s-0.010
 said-0.009
-0.008
 -0.008
Token make
Token ID787
Feature activation+5.69e-02

Pos logit contributions
s+0.016
 they+0.016
 said+0.015
+0.015
 +0.015
Neg logit contributions
ocol-0.032
 property-0.029
isher-0.028
beh-0.028
asured-0.028
Token
Token ID198
Feature activation+1.74e-02

Pos logit contributions
ocol+0.038
 property+0.036
asured+0.034
beh+0.034
isher+0.034
Neg logit contributions
s-0.011
 they-0.010
 said-0.010
 exclaimed-0.009
 -0.009
Token
Token ID198
Feature activation+1.02e-02

Pos logit contributions
cakes+0.014
 exclaimed+0.013
 replied+0.013
 said+0.012
s+0.011
Neg logit contributions
uzz-0.043
ocol-0.039
ac-0.039
uff-0.039
 property-0.038
TokenOne
Token ID3198
Feature activation-8.22e-02

Pos logit contributions
s+0.014
 they+0.013
 said+0.013
 +0.013
+0.013
Neg logit contributions
ocol-0.024
 property-0.022
asured-0.021
beh-0.021
isher-0.021
Token day
Token ID1110
Feature activation+3.67e-02

Pos logit contributions
ocol+0.173
 property+0.159
asured+0.152
isher+0.148
beh+0.147
Neg logit contributions
s-0.128
 they-0.124
 -0.124
 exclaimed-0.121
 said-0.121
Token,
Token ID11
Feature activation-1.48e-02

Pos logit contributions
s+0.029
 they+0.028
 +0.026
 said+0.026
+0.026
Neg logit contributions
ocol-0.108
 property-0.101
asured-0.097
beh-0.096
isher-0.096
Token a
Token ID257
Feature activation+6.80e-02

Pos logit contributions
ocol+0.045
 property+0.042
asured+0.040
isher+0.040
ities+0.040
Neg logit contributions
s-0.010
 -0.009
 they-0.009
 said-0.009
-0.008
Token big
Token ID1263
Feature activation+1.77e-02

Pos logit contributions
s+0.087
 exclaimed+0.081
 +0.081
 they+0.077
+0.076
Neg logit contributions
ocol-0.159
 property-0.148
isher-0.139
esty-0.136
ubb-0.134
Token wind
Token ID2344
Feature activation+7.43e-03

Pos logit contributions
 they+0.023
s+0.023
 said+0.022
 +0.021
+0.021
Neg logit contributions
ocol-0.043
 property-0.040
beh-0.038
asured-0.038
ities-0.038
Token comes
Token ID2058
Feature activation-2.30e-02

Pos logit contributions
 +0.007
s+0.006
 they+0.006
 exclaimed+0.006
+0.006
Neg logit contributions
ocol-0.020
 property-0.019
isher-0.018
asured-0.018
beh-0.018
Token and
Token ID290
Feature activation-1.21e-03

Pos logit contributions
ocol+0.050
 property+0.049
asured+0.048
beh+0.048
ac+0.046
Neg logit contributions
cakes-0.027
 -0.026
 exclaimed-0.026
 they-0.025
 said-0.025
Token blows
Token ID20385
Feature activation+5.18e-02

Pos logit contributions
ocol+0.003
 property+0.003
beh+0.003
isher+0.003
asured+0.003
Neg logit contributions
s-0.001
 -0.001
 they-0.001
-0.001
 said-0.001
Token time
Token ID640
Feature activation-3.66e-02

Pos logit contributions
 they+0.125
s+0.123
cakes+0.123
 exclaimed+0.119
 said+0.117
Neg logit contributions
ocol-0.134
 property-0.126
asured-0.120
ities-0.116
ac-0.114
Token,
Token ID11
Feature activation-1.14e-02

Pos logit contributions
ocol+0.101
 property+0.093
asured+0.088
beh+0.088
ities+0.088
Neg logit contributions
s-0.032
 said-0.030
 they-0.030
 exclaimed-0.030
 -0.028
Token there
Token ID612
Feature activation+1.89e-02

Pos logit contributions
ocol+0.029
 property+0.027
beh+0.026
asured+0.026
isher+0.026
Neg logit contributions
 they-0.014
s-0.014
-0.014
 -0.013
 said-0.013
Token was
Token ID373
Feature activation-2.30e-02

Pos logit contributions
s+0.013
 they+0.013
 exclaimed+0.012
 said+0.011
cakes+0.011
Neg logit contributions
ocol-0.054
 property-0.051
asured-0.049
isher-0.048
ities-0.048
Token a
Token ID257
Feature activation+7.18e-02

Pos logit contributions
ocol+0.059
 property+0.054
asured+0.052
ities+0.052
beh+0.051
Neg logit contributions
s-0.027
 they-0.023
 said-0.023
 exclaimed-0.023
 -0.022
Token little
Token ID1310
Feature activation-6.77e-03

Pos logit contributions
s+0.082
 they+0.078
 +0.076
 said+0.075
 exclaimed+0.073
Neg logit contributions
ocol-0.184
 property-0.170
asured-0.162
isher-0.161
ities-0.160
Token boy
Token ID2933
Feature activation+2.99e-02

Pos logit contributions
ocol+0.018
 property+0.017
asured+0.017
beh+0.017
ities+0.016
Neg logit contributions
 they-0.008
s-0.008
 said-0.007
-0.007
 -0.007
Token named
Token ID3706
Feature activation-4.08e-03

Pos logit contributions
s+0.041
 they+0.040
 said+0.039
+0.039
 +0.038
Neg logit contributions
ocol-0.071
 property-0.066
asured-0.062
isher-0.062
beh-0.062
Token Tim
Token ID5045
Feature activation-1.85e-02

Pos logit contributions
ocol+0.008
 property+0.008
asured+0.007
beh+0.007
isher+0.007
Neg logit contributions
s-0.007
 they-0.006
 said-0.006
 -0.006
 exclaimed-0.006
Tokenmy
Token ID1820
Feature activation+2.12e-02

Pos logit contributions
ocol+0.038
 property+0.036
asured+0.035
isher+0.033
beh+0.033
Neg logit contributions
 they-0.029
s-0.029
cakes-0.027
 exclaimed-0.027
 -0.027
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
cakes+0.022
 they+0.022
s+0.022
 +0.021
 exclaimed+0.021
Neg logit contributions
ocol-0.053
 property-0.051
asured-0.049
beh-0.048
ities-0.048
TokenWe
Token ID1135
Feature activation+2.43e-02

Pos logit contributions
ocol+0.243
 property+0.214
isher+0.210
asured+0.209
beh+0.206
Neg logit contributions
-0.104
 they-0.104
s-0.101
 -0.092
 said-0.087
Token found
Token ID1043
Feature activation+2.37e-02

Pos logit contributions
 said+0.032
 they+0.031
s+0.029
 +0.029
+0.028
Neg logit contributions
ocol-0.063
 property-0.057
asured-0.056
beh-0.055
ities-0.055
Token a
Token ID257
Feature activation+6.01e-02

Pos logit contributions
 they+0.025
s+0.025
 said+0.023
+0.023
 +0.023
Neg logit contributions
ocol-0.064
 property-0.059
asured-0.058
beh-0.057
ities-0.057
Token huge
Token ID3236
Feature activation-1.11e-04

Pos logit contributions
s+0.071
 they+0.070
 said+0.067
 +0.067
+0.066
Neg logit contributions
ocol-0.154
 property-0.141
beh-0.137
asured-0.137
isher-0.136
Token root
Token ID6808
Feature activation-2.26e-02

Pos logit contributions
ocol+0.000
beh+0.000
asured+0.000
ities+0.000
 property+0.000
Neg logit contributions
 they-0.000
 said-0.000
s-0.000
 -0.000
-0.000
Token.
Token ID13
Feature activation-3.22e-02

Pos logit contributions
ocol+0.063
 property+0.057
beh+0.057
asured+0.055
ities+0.055
Neg logit contributions
 they-0.024
s-0.024
 said-0.022
-0.021
 -0.020
Token See
Token ID4091
Feature activation-3.81e-02

Pos logit contributions
ocol+0.082
 property+0.076
beh+0.075
asured+0.075
isher+0.074
Neg logit contributions
-0.045
 they-0.044
 -0.042
s-0.038
 said-0.036
Token?"
Token ID1701
Feature activation+8.52e-02

Pos logit contributions
ocol+0.099
 property+0.091
beh+0.088
asured+0.088
isher+0.087
Neg logit contributions
 they-0.045
s-0.045
 said-0.042
-0.042
 -0.042
Token He
Token ID679
Feature activation-2.17e-02

Pos logit contributions
cakes+0.072
s+0.065
odic+0.052
 exclaimed+0.051
lic+0.050
Neg logit contributions
ocol-0.203
uzz-0.202
asured-0.185
uff-0.182
 than-0.182
Token showed
Token ID3751
Feature activation+2.85e-03

Pos logit contributions
ocol+0.047
asured+0.045
isher+0.043
ities+0.042
ac+0.042
Neg logit contributions
s-0.027
 they-0.026
 -0.025
cakes-0.025
-0.024
Token her
Token ID607
Feature activation-6.48e-02

Pos logit contributions
s+0.004
 exclaimed+0.004
 +0.004
 they+0.004
+0.004
Neg logit contributions
ocol-0.006
isher-0.005
uzz-0.005
asured-0.005
ant-0.005
Token,
Token ID11
Feature activation-2.54e-02

Pos logit contributions
s+0.054
 they+0.053
 +0.051
cakes+0.049
 exclaimed+0.049
Neg logit contributions
ocol-0.150
 property-0.145
isher-0.134
beh-0.133
ities-0.133
Token I
Token ID314
Feature activation-1.10e-02

Pos logit contributions
ocol+0.076
asured+0.073
beh+0.072
isher+0.070
 property+0.070
Neg logit contributions
 they-0.028
 said-0.021
-0.018
 -0.018
s-0.017
Token want
Token ID765
Feature activation-3.46e-02

Pos logit contributions
ocol+0.020
 property+0.018
asured+0.017
isher+0.017
beh+0.017
Neg logit contributions
s-0.021
 they-0.021
 said-0.020
-0.020
 -0.020
Token to
Token ID284
Feature activation-3.85e-03

Pos logit contributions
ocol+0.097
 property+0.091
asured+0.086
isher+0.085
beh+0.085
Neg logit contributions
s-0.030
 -0.028
 they-0.027
 exclaimed-0.026
 said-0.026
Token keep
Token ID1394
Feature activation-1.86e-02

Pos logit contributions
ocol+0.009
 property+0.008
asured+0.008
beh+0.008
ities+0.008
Neg logit contributions
s-0.005
 they-0.005
 -0.005
 said-0.005
-0.005
Token them
Token ID606
Feature activation+2.47e-02

Pos logit contributions
ocol+0.051
 property+0.048
asured+0.046
beh+0.046
ities+0.046
Neg logit contributions
 they-0.019
s-0.018
 said-0.017
 -0.017
-0.017
Token!"
Token ID2474
Feature activation+9.73e-02

Pos logit contributions
s+0.022
 they+0.021
 said+0.020
 +0.019
+0.019
Neg logit contributions
ocol-0.072
 property-0.066
beh-0.064
asured-0.064
isher-0.064
Token Tom
Token ID4186
Feature activation+4.58e-02

Pos logit contributions
s+0.174
cakes+0.162
 exclaimed+0.145
lic+0.144
 +0.141
Neg logit contributions
ocol-0.177
 property-0.157
 yourself-0.139
asured-0.139
 appearance-0.139
Token says
Token ID1139
Feature activation+7.23e-02

Pos logit contributions
 they+0.044
s+0.042
cakes+0.041
 +0.038
+0.036
Neg logit contributions
ocol-0.113
 property-0.113
beh-0.107
asured-0.106
ities-0.106
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
s+0.080
 exclaimed+0.077
 said+0.076
cakes+0.075
 +0.074
Neg logit contributions
ocol-0.173
 property-0.158
beh-0.155
asured-0.155
ities-0.154
Token "
Token ID366
Feature activation-8.83e-02

Pos logit contributions
ocol+0.045
 than+0.043
 property+0.043
uzz+0.043
esty+0.042
Neg logit contributions
 exclaimed-0.039
s-0.037
cakes-0.037
 said-0.034
-0.033
Token few
Token ID1178
Feature activation-5.41e-02

Pos logit contributions
s+0.067
cakes+0.067
 exclaimed+0.065
 said+0.063
 +0.061
Neg logit contributions
ocol-0.115
 property-0.103
 appearance-0.100
esty-0.098
ubb-0.098
Token months
Token ID1933
Feature activation-3.51e-02

Pos logit contributions
ocol+0.110
 property+0.101
beh+0.095
asured+0.095
isher+0.094
Neg logit contributions
s-0.092
 they-0.092
 said-0.089
 -0.088
-0.087
Token of
Token ID286
Feature activation-3.89e-02

Pos logit contributions
ocol+0.104
 property+0.103
ities+0.096
asured+0.096
beh+0.096
Neg logit contributions
 they-0.030
s-0.028
 said-0.026
-0.025
 -0.025
Token training
Token ID3047
Feature activation-5.96e-02

Pos logit contributions
ocol+0.091
 property+0.090
esty+0.088
isher+0.086
beh+0.086
Neg logit contributions
 they-0.056
 -0.053
-0.053
 said-0.051
s-0.048
Token,
Token ID11
Feature activation+2.22e-03

Pos logit contributions
ocol+0.180
beh+0.179
 property+0.176
asured+0.172
isher+0.172
Neg logit contributions
 they-0.044
s-0.043
-0.039
 -0.037
 said-0.034
Token Pete
Token ID17542
Feature activation+3.16e-03

Pos logit contributions
 they+0.003
+0.003
s+0.003
 +0.003
 said+0.002
Neg logit contributions
ocol-0.006
 property-0.006
beh-0.005
asured-0.005
isher-0.005
Token's
Token ID338
Feature activation-4.64e-02

Pos logit contributions
s+0.003
 they+0.003
 +0.002
 said+0.002
+0.002
Neg logit contributions
ocol-0.009
 property-0.008
asured-0.008
beh-0.008
isher-0.008
Token brain
Token ID3632
Feature activation-3.77e-02

Pos logit contributions
ocol+0.118
beh+0.106
 property+0.102
isher+0.102
asured+0.100
Neg logit contributions
s-0.060
-0.060
 -0.059
 they-0.058
 said-0.058
Token felt
Token ID2936
Feature activation+4.07e-02

Pos logit contributions
ocol+0.118
 property+0.109
isher+0.105
beh+0.105
asured+0.104
Neg logit contributions
s-0.027
 they-0.025
 said-0.024
-0.022
 -0.022
Token more
Token ID517
Feature activation+1.17e-02

Pos logit contributions
 they+0.048
 +0.047
+0.047
s+0.046
 said+0.046
Neg logit contributions
ocol-0.108
 property-0.100
isher-0.097
ities-0.096
beh-0.095
Token reliable
Token ID9314
Feature activation-1.24e-02

Pos logit contributions
s+0.017
 said+0.017
+0.016
 they+0.016
 +0.016
Neg logit contributions
ocol-0.028
 property-0.025
isher-0.025
beh-0.024
esty-0.024
Token and
Token ID290
Feature activation+4.78e-03

Pos logit contributions
ocol+0.144
isher+0.132
 property+0.130
asured+0.129
beh+0.129
Neg logit contributions
 they-0.039
s-0.037
 said-0.035
-0.034
 -0.033
Token he
Token ID339
Feature activation+6.99e-03

Pos logit contributions
 they+0.004
s+0.003
 said+0.003
 +0.003
+0.003
Neg logit contributions
ocol-0.015
 property-0.014
isher-0.013
ities-0.013
asured-0.013
Token told
Token ID1297
Feature activation+4.91e-02

Pos logit contributions
s+0.004
 they+0.004
 +0.003
+0.003
cakes+0.003
Neg logit contributions
ocol-0.021
 property-0.020
asured-0.020
beh-0.019
isher-0.019
Token her
Token ID607
Feature activation-6.53e-02

Pos logit contributions
s+0.064
 they+0.062
 +0.059
 said+0.059
+0.059
Neg logit contributions
ocol-0.120
 property-0.110
asured-0.105
beh-0.105
isher-0.104
Token he
Token ID339
Feature activation+3.29e-03

Pos logit contributions
ocol+0.171
 property+0.161
asured+0.152
beh+0.152
ities+0.150
Neg logit contributions
s-0.068
 said-0.060
 -0.060
 they-0.060
-0.059
Token was
Token ID373
Feature activation-1.70e-02

Pos logit contributions
s+0.003
 they+0.003
 said+0.003
 +0.003
+0.003
Neg logit contributions
ocol-0.009
 property-0.009
asured-0.008
beh-0.008
isher-0.008
Token exploring
Token ID13504
Feature activation-3.08e-02

Pos logit contributions
ocol+0.040
 property+0.038
isher+0.036
ities+0.036
beh+0.036
Neg logit contributions
 they-0.023
 said-0.022
 -0.022
-0.022
s-0.022
Token.
Token ID13
Feature activation-6.18e-03

Pos logit contributions
ocol+0.094
beh+0.088
 property+0.087
isher+0.087
ities+0.086
Neg logit contributions
 they-0.024
-0.022
s-0.022
 -0.020
 said-0.019
Token She
Token ID1375
Feature activation-2.51e-02

Pos logit contributions
ocol+0.017
 property+0.015
esty+0.015
beh+0.015
uzz+0.015
Neg logit contributions
s-0.005
 they-0.005
 said-0.004
cakes-0.004
 exclaimed-0.004
Token welcomed
Token ID15846
Feature activation-3.68e-03

Pos logit contributions
ocol+0.067
asured+0.065
 property+0.064
beh+0.062
isher+0.061
Neg logit contributions
s-0.020
 -0.018
 they-0.018
-0.018
cakes-0.017
Token him
Token ID683
Feature activation-4.26e-02

Pos logit contributions
ocol+0.009
 property+0.008
asured+0.008
beh+0.008
isher+0.008
Neg logit contributions
s-0.005
 they-0.005
 said-0.005
 -0.005
-0.005
Token began
Token ID2540
Feature activation-2.18e-02

Pos logit contributions
 they+0.014
 said+0.014
s+0.013
 +0.013
+0.012
Neg logit contributions
ocol-0.030
 property-0.028
isher-0.027
ities-0.026
asured-0.026
Token flying
Token ID7348
Feature activation-1.72e-02

Pos logit contributions
ocol+0.068
 property+0.064
isher+0.062
asured+0.061
ities+0.061
Neg logit contributions
s-0.015
 they-0.014
 -0.012
 said-0.012
-0.012
Token around
Token ID1088
Feature activation+8.45e-03

Pos logit contributions
ocol+0.049
 property+0.045
ities+0.044
asured+0.044
isher+0.043
Neg logit contributions
s-0.020
 they-0.018
-0.017
 -0.016
 said-0.016
Token making
Token ID1642
Feature activation+1.50e-02

Pos logit contributions
 they+0.006
+0.005
s+0.005
 +0.005
 said+0.004
Neg logit contributions
ocol-0.027
 property-0.026
asured-0.026
beh-0.026
esty-0.026
Token aer
Token ID9551
Feature activation-6.87e-02

Pos logit contributions
s+0.018
 they+0.017
 said+0.016
 +0.016
+0.016
Neg logit contributions
ocol-0.038
 property-0.035
asured-0.034
isher-0.034
beh-0.034
Tokenopl
Token ID20106
Feature activation-9.31e-02

Pos logit contributions
ocol+0.061
 property+0.041
beh+0.039
asured+0.037
ities+0.032
Neg logit contributions
 they-0.207
 said-0.196
s-0.196
 -0.191
 exclaimed-0.189
Tokenane
Token ID1531
Feature activation+2.17e-02

Pos logit contributions
ocol+0.222
 property+0.198
beh+0.194
ities+0.191
asured+0.189
Neg logit contributions
 they-0.125
s-0.123
 said-0.122
 -0.120
 exclaimed-0.116
Token noises
Token ID26782
Feature activation+4.51e-02

Pos logit contributions
 they+0.028
 said+0.028
 +0.028
+0.027
 exclaimed+0.027
Neg logit contributions
ocol-0.047
 property-0.045
beh-0.044
asured-0.041
 appearance-0.040
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
s+0.036
 they+0.035
 said+0.031
+0.031
 +0.030
Neg logit contributions
ocol-0.135
 property-0.127
asured-0.122
isher-0.122
beh-0.121
Token Jack
Token ID3619
Feature activation+3.51e-02

Pos logit contributions
ocol+0.042
 property+0.040
asured+0.039
beh+0.039
isher+0.039
Neg logit contributions
s-0.004
 they-0.004
 -0.003
-0.003
 said-0.003
Token soon
Token ID2582
Feature activation+4.14e-02

Pos logit contributions
s+0.032
 they+0.030
 said+0.029
 +0.028
+0.028
Neg logit contributions
ocol-0.102
 property-0.094
beh-0.090
isher-0.090
ities-0.090
Token her
Token ID607
Feature activation-7.10e-02

Pos logit contributions
ocol+0.029
 property+0.027
asured+0.026
isher+0.026
beh+0.026
Neg logit contributions
 they-0.013
s-0.013
 -0.013
-0.013
 said-0.013
Token friend
Token ID1545
Feature activation+5.23e-02

Pos logit contributions
ocol+0.191
 property+0.170
beh+0.169
isher+0.164
asured+0.163
Neg logit contributions
s-0.091
 they-0.089
 said-0.086
-0.084
 -0.080
Token when
Token ID618
Feature activation-1.42e-02

Pos logit contributions
s+0.044
 they+0.043
 said+0.040
 +0.040
+0.040
Neg logit contributions
ocol-0.152
 property-0.142
asured-0.136
beh-0.136
isher-0.136
Token she
Token ID673
Feature activation+1.09e-02

Pos logit contributions
ocol+0.044
 property+0.041
beh+0.040
asured+0.039
isher+0.039
Neg logit contributions
s-0.008
 exclaimed-0.006
 said-0.006
 -0.006
-0.006
Token stumbled
Token ID24241
Feature activation-2.35e-02

Pos logit contributions
cakes+0.012
 they+0.011
 +0.010
s+0.010
 exclaimed+0.010
Neg logit contributions
ocol-0.023
ac-0.022
 property-0.021
asured-0.021
ra-0.021
Token upon
Token ID2402
Feature activation-8.29e-02

Pos logit contributions
ocol+0.045
 property+0.043
asured+0.041
ities+0.040
ac+0.040
Neg logit contributions
 they-0.037
s-0.035
 exclaimed-0.035
 said-0.035
 -0.034
Token a
Token ID257
Feature activation+6.69e-02

Pos logit contributions
ocol+0.203
 property+0.194
beh+0.182
ities+0.182
asured+0.178
Neg logit contributions
s-0.099
 exclaimed-0.091
 said-0.088
 they-0.086
 -0.083
Token strange
Token ID6283
Feature activation-3.86e-02

Pos logit contributions
s+0.066
 they+0.064
 said+0.061
 +0.061
+0.059
Neg logit contributions
ocol-0.182
 property-0.171
isher-0.160
asured-0.159
ities-0.159
Token object
Token ID2134
Feature activation-3.43e-02

Pos logit contributions
ocol+0.091
beh+0.084
asured+0.081
 property+0.080
ities+0.080
Neg logit contributions
s-0.053
 said-0.053
 they-0.052
-0.052
 -0.051
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
ocol+0.093
 property+0.086
asured+0.083
beh+0.082
isher+0.082
Neg logit contributions
s-0.036
 they-0.036
 said-0.033
-0.033
 -0.032
Token It
Token ID632
Feature activation+2.15e-03

Pos logit contributions
ocol+0.036
esty+0.033
imb+0.032
uzz+0.032
 property+0.032
Neg logit contributions
s-0.019
ves-0.019
cakes-0.018
 exclaimed-0.017
 they-0.017
Token while
Token ID981
Feature activation-5.59e-03

Pos logit contributions
ocol+0.025
tons+0.023
 property+0.023
ities+0.022
ant+0.022
Neg logit contributions
 exclaimed-0.010
s-0.010
cakes-0.009
 said-0.008
 -0.008
Token picking
Token ID10868
Feature activation-6.37e-02

Pos logit contributions
ocol+0.015
 property+0.013
ities+0.013
beh+0.013
asured+0.013
Neg logit contributions
s-0.006
 said-0.005
 they-0.005
 exclaimed-0.005
 -0.005
Token fresh
Token ID4713
Feature activation-5.60e-02

Pos logit contributions
ocol+0.163
 property+0.153
beh+0.149
asured+0.148
isher+0.146
Neg logit contributions
s-0.078
 they-0.076
-0.075
 -0.073
 said-0.073
Token flowers
Token ID12734
Feature activation-1.59e-02

Pos logit contributions
beh+0.150
 property+0.150
asured+0.149
ocol+0.145
ust+0.145
Neg logit contributions
s-0.063
 they-0.060
-0.057
 said-0.055
 -0.053
Token in
Token ID287
Feature activation-3.88e-02

Pos logit contributions
ocol+0.043
 property+0.040
asured+0.039
isher+0.039
beh+0.039
Neg logit contributions
s-0.017
 they-0.017
 said-0.015
-0.015
 -0.015
Token her
Token ID607
Feature activation-8.38e-02

Pos logit contributions
ocol+0.120
 property+0.114
beh+0.113
isher+0.112
ities+0.111
Neg logit contributions
-0.026
 they-0.026
 -0.025
s-0.023
 said-0.022
Token garden
Token ID11376
Feature activation-3.02e-02

Pos logit contributions
ocol+0.211
 property+0.190
beh+0.190
ities+0.187
asured+0.186
Neg logit contributions
s-0.109
 they-0.106
-0.105
 -0.103
 said-0.101
Token,
Token ID11
Feature activation-1.44e-03

Pos logit contributions
ocol+0.091
 property+0.085
beh+0.082
asured+0.082
isher+0.082
Neg logit contributions
s-0.023
 they-0.022
 said-0.020
 -0.020
-0.020
Token Pen
Token ID7507
Feature activation-5.64e-02

Pos logit contributions
ocol+0.004
 property+0.004
ities+0.004
asured+0.004
beh+0.004
Neg logit contributions
s-0.001
 said-0.001
 they-0.001
 exclaimed-0.001
 -0.001
Tokenelope
Token ID47329
Feature activation+1.95e-02

Pos logit contributions
ocol+0.105
 property+0.094
asured+0.087
beh+0.087
isher+0.087
Neg logit contributions
s-0.107
 they-0.105
 said-0.102
 -0.102
-0.101
Token heard
Token ID2982
Feature activation+7.99e-02

Pos logit contributions
s+0.015
 they+0.015
 +0.015
+0.014
cakes+0.014
Neg logit contributions
ocol-0.054
 property-0.052
asured-0.049
beh-0.048
ities-0.048
Token She
Token ID1375
Feature activation-2.98e-02

Pos logit contributions
ocol+0.046
 property+0.043
uff+0.042
esty+0.042
ities+0.042
Neg logit contributions
s-0.022
 exclaimed-0.022
 said-0.021
 they-0.021
cakes-0.021
Token tried
Token ID3088
Feature activation+3.76e-02

Pos logit contributions
ocol+0.071
 property+0.066
asured+0.065
isher+0.064
ities+0.063
Neg logit contributions
s-0.036
 they-0.035
 -0.034
-0.033
 exclaimed-0.031
Token to
Token ID284
Feature activation-2.27e-03

Pos logit contributions
s+0.030
 they+0.028
 said+0.028
 exclaimed+0.028
 +0.026
Neg logit contributions
ocol-0.104
 property-0.100
asured-0.097
ities-0.095
beh-0.093
Token open
Token ID1280
Feature activation-2.90e-02

Pos logit contributions
ocol+0.005
 property+0.005
beh+0.005
isher+0.005
ities+0.005
Neg logit contributions
 said-0.003
 they-0.003
s-0.003
-0.003
 -0.003
Token the
Token ID262
Feature activation+2.01e-02

Pos logit contributions
ocol+0.092
 property+0.089
beh+0.088
asured+0.085
ities+0.085
Neg logit contributions
 they-0.018
s-0.017
 -0.016
-0.016
 said-0.015
Token oy
Token ID35104
Feature activation-1.09e-01

Pos logit contributions
s+0.039
 they+0.038
 said+0.037
 +0.037
+0.037
Neg logit contributions
ocol-0.036
 property-0.032
beh-0.031
asured-0.031
isher-0.030
Tokenster
Token ID1706
Feature activation-2.50e-02

Pos logit contributions
ocol+0.214
 property+0.190
beh+0.189
asured+0.184
isher+0.175
Neg logit contributions
s-0.203
 they-0.193
 said-0.189
-0.188
 -0.186
Token with
Token ID351
Feature activation-1.48e-02

Pos logit contributions
ocol+0.079
 property+0.073
beh+0.073
asured+0.070
isher+0.070
Neg logit contributions
s-0.019
 they-0.017
 -0.017
-0.016
 said-0.016
Token her
Token ID607
Feature activation-7.70e-02

Pos logit contributions
ocol+0.041
 property+0.040
beh+0.039
asured+0.038
isher+0.038
Neg logit contributions
 they-0.015
 -0.014
s-0.014
-0.014
 said-0.014
Token fingers
Token ID9353
Feature activation-1.51e-02

Pos logit contributions
ocol+0.161
 property+0.146
beh+0.145
asured+0.138
isher+0.138
Neg logit contributions
s-0.136
 they-0.130
 -0.130
-0.129
 said-0.128
Token,
Token ID11
Feature activation-8.42e-03

Pos logit contributions
ocol+0.046
 property+0.043
beh+0.041
isher+0.041
asured+0.041
Neg logit contributions
s-0.012
 they-0.012
 said-0.011
 -0.011
-0.010
Token done
Token ID1760
Feature activation+9.58e-02

Pos logit contributions
ocol+0.100
 property+0.092
isher+0.091
beh+0.089
asured+0.088
Neg logit contributions
 said-0.042
 they-0.041
s-0.041
-0.039
 -0.039
Token?"
Token ID1701
Feature activation+9.25e-02

Pos logit contributions
s+0.098
 they+0.094
 +0.090
 said+0.089
+0.088
Neg logit contributions
ocol-0.261
 property-0.243
asured-0.231
beh-0.230
ities-0.230
Token Mom
Token ID11254
Feature activation+4.15e-02

Pos logit contributions
s+0.152
cakes+0.140
 +0.132
 exclaimed+0.131
 they+0.130
Neg logit contributions
ocol-0.187
 property-0.159
asured-0.148
ities-0.146
 appearance-0.144
Token shouted
Token ID17293
Feature activation+5.82e-02

Pos logit contributions
 they+0.038
s+0.037
cakes+0.034
+0.033
 +0.032
Neg logit contributions
ocol-0.105
asured-0.098
beh-0.095
 property-0.093
ities-0.093
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
s+0.057
 they+0.054
 +0.051
 said+0.051
 exclaimed+0.050
Neg logit contributions
ocol-0.157
 property-0.143
ities-0.139
asured-0.139
beh-0.138
Token "
Token ID366
Feature activation-8.92e-02

Pos logit contributions
ocol+0.055
 property+0.049
ities+0.048
uff+0.047
uzz+0.047
Neg logit contributions
s-0.030
 exclaimed-0.027
cakes-0.026
 they-0.025
 said-0.024
TokenYou
Token ID1639
Feature activation-5.63e-02

Pos logit contributions
ocol+0.199
 property+0.182
asured+0.173
isher+0.172
ities+0.169
Neg logit contributions
s-0.138
 they-0.136
-0.133
 said-0.130
 -0.129
Token ruined
Token ID20484
Feature activation+4.91e-03

Pos logit contributions
ocol+0.137
 property+0.127
beh+0.120
asured+0.120
isher+0.119
Neg logit contributions
s-0.073
 they-0.071
 -0.068
-0.068
 said-0.067
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
 they+0.006
+0.005
 +0.005
 said+0.005
s+0.005
Neg logit contributions
ocol-0.013
beh-0.012
asured-0.012
isher-0.012
 property-0.012
Token chocolate
Token ID11311
Feature activation-3.53e-02

Pos logit contributions
 they+0.022
 said+0.021
s+0.021
+0.021
 +0.020
Neg logit contributions
ocol-0.045
beh-0.042
asured-0.040
esty-0.040
 property-0.040
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
ocol+0.090
 property+0.085
isher+0.080
ities+0.080
asured+0.080
Neg logit contributions
s-0.038
 they-0.037
 -0.037
-0.036
 said-0.036
Token day
Token ID1110
Feature activation+4.52e-02

Pos logit contributions
 they+0.066
s+0.062
 said+0.062
+0.059
 +0.059
Neg logit contributions
ocol-0.202
 property-0.187
asured-0.182
beh-0.182
isher-0.182
Token,
Token ID11
Feature activation+4.21e-03

Pos logit contributions
s+0.043
 they+0.041
 +0.039
 said+0.039
+0.038
Neg logit contributions
ocol-0.125
 property-0.116
asured-0.111
beh-0.111
ities-0.111
Token a
Token ID257
Feature activation+6.84e-02

Pos logit contributions
 they+0.004
+0.004
 said+0.004
 +0.003
s+0.003
Neg logit contributions
ocol-0.013
 property-0.012
isher-0.011
beh-0.011
asured-0.011
Token group
Token ID1448
Feature activation+7.30e-03

Pos logit contributions
 they+0.074
s+0.074
 said+0.070
 +0.070
+0.070
Neg logit contributions
ocol-0.183
 property-0.168
beh-0.163
asured-0.163
ities-0.161
Token of
Token ID286
Feature activation-3.29e-02

Pos logit contributions
s+0.006
 +0.006
 they+0.006
 exclaimed+0.006
+0.006
Neg logit contributions
ocol-0.020
 property-0.019
isher-0.018
beh-0.018
asured-0.017
Token mis
Token ID2984
Feature activation-2.14e-01

Pos logit contributions
ocol+0.083
 property+0.081
asured+0.079
esty+0.077
ities+0.077
Neg logit contributions
 they-0.039
 said-0.038
-0.036
 -0.034
s-0.032
Tokenchie
Token ID3043
Feature activation-9.72e-02

Pos logit contributions
ocol+0.229
 property+0.191
ities+0.166
asured+0.163
 appearance+0.159
Neg logit contributions
s-0.562
 they-0.548
 said-0.540
 exclaimed-0.538
-0.538
Tokenvous
Token ID31222
Feature activation+1.38e-02

Pos logit contributions
ocol+0.161
 property+0.153
beh+0.151
asured+0.140
ac+0.139
Neg logit contributions
 they-0.186
s-0.183
 -0.180
-0.173
 said-0.171
Token mice
Token ID10693
Feature activation-3.39e-02

Pos logit contributions
 they+0.020
s+0.020
 said+0.019
+0.019
 +0.018
Neg logit contributions
ocol-0.032
 property-0.029
asured-0.029
isher-0.028
beh-0.028
Token came
Token ID1625
Feature activation-1.04e-02

Pos logit contributions
ocol+0.094
 property+0.086
isher+0.084
asured+0.084
beh+0.083
Neg logit contributions
 they-0.035
s-0.034
 said-0.034
-0.032
 -0.031
Token by
Token ID416
Feature activation-5.29e-02

Pos logit contributions
ocol+0.032
isher+0.029
ities+0.028
 property+0.028
asured+0.027
Neg logit contributions
s-0.010
 they-0.009
-0.009
 said-0.008
 -0.008
Token enjoyed
Token ID8359
Feature activation+1.68e-02

Pos logit contributions
 they+0.002
s+0.002
 said+0.002
 +0.002
+0.002
Neg logit contributions
ocol-0.007
 property-0.007
ities-0.006
asured-0.006
beh-0.006
Token her
Token ID607
Feature activation-6.39e-02

Pos logit contributions
 they+0.016
 +0.014
+0.014
s+0.014
 said+0.013
Neg logit contributions
ocol-0.049
 property-0.047
isher-0.045
beh-0.045
ities-0.044
Token cool
Token ID3608
Feature activation-8.57e-03

Pos logit contributions
ocol+0.158
 property+0.146
beh+0.139
asured+0.139
ities+0.138
Neg logit contributions
s-0.082
 they-0.081
 -0.077
 said-0.077
-0.076
Token,
Token ID11
Feature activation+3.76e-03

Pos logit contributions
ocol+0.022
 property+0.021
beh+0.021
ities+0.021
esty+0.020
Neg logit contributions
s-0.011
 they-0.010
 -0.009
-0.009
 said-0.009
Token ch
Token ID442
Feature activation-5.39e-02

Pos logit contributions
 they+0.005
 said+0.005
 +0.004
s+0.004
+0.004
Neg logit contributions
ocol-0.010
 property-0.009
isher-0.009
ities-0.009
beh-0.009
Tokenocol
Token ID4668
Feature activation-2.78e-01

Pos logit contributions
 property+0.081
esty+0.066
beh+0.064
 appearance+0.063
nam+0.061
Neg logit contributions
s-0.120
 they-0.110
 said-0.107
 exclaimed-0.104
-0.102
Tokenat
Token ID265
Feature activation+4.78e-03

Pos logit contributions
 property+0.380
ocol+0.365
 appearance+0.350
esty+0.314
beh+0.311
Neg logit contributions
s-0.621
-0.576
 said-0.571
 exclaimed-0.561
 they-0.560
Tokeny
Token ID88
Feature activation-1.19e-02

Pos logit contributions
s+0.006
 they+0.006
 said+0.006
 +0.006
+0.006
Neg logit contributions
ocol-0.012
 property-0.011
beh-0.010
asured-0.010
ities-0.010
Token treat
Token ID2190
Feature activation+1.48e-02

Pos logit contributions
ocol+0.032
 property+0.030
beh+0.029
asured+0.029
isher+0.029
Neg logit contributions
s-0.014
 they-0.013
 said-0.013
 -0.012
-0.012
Token!
Token ID0
Feature activation-4.77e-03

Pos logit contributions
 they+0.017
s+0.016
 said+0.015
+0.014
 +0.014
Neg logit contributions
ocol-0.041
 property-0.039
asured-0.037
beh-0.037
isher-0.037
Token
Token ID198
Feature activation+2.27e-02

Pos logit contributions
ocol+0.016
 property+0.015
asured+0.015
beh+0.014
isher+0.014
Neg logit contributions
s-0.002
 they-0.002
 -0.001
 said-0.001
-0.001
Token There
Token ID1318
Feature activation-2.75e-02

Pos logit contributions
ocol+0.031
 property+0.028
asured+0.027
isher+0.026
beh+0.026
Neg logit contributions
s-0.025
 they-0.025
 -0.025
-0.025
 said-0.024
Token's
Token ID338
Feature activation-5.75e-02

Pos logit contributions
ocol+0.074
 property+0.072
beh+0.066
isher+0.066
asured+0.066
Neg logit contributions
 they-0.022
s-0.022
 exclaimed-0.022
 -0.021
-0.021
Token a
Token ID257
Feature activation+5.44e-02

Pos logit contributions
ocol+0.155
 property+0.140
beh+0.139
isher+0.138
asured+0.136
Neg logit contributions
 they-0.063
s-0.061
 -0.061
-0.060
 said-0.060
Token cool
Token ID3608
Feature activation-1.85e-02

Pos logit contributions
s+0.059
 they+0.058
 +0.055
 said+0.054
+0.054
Neg logit contributions
ocol-0.142
 property-0.133
asured-0.124
isher-0.124
ities-0.123
Token re
Token ID302
Feature activation-1.44e-01

Pos logit contributions
ocol+0.044
beh+0.041
 property+0.040
ities+0.039
asured+0.039
Neg logit contributions
s-0.026
 they-0.025
-0.024
 said-0.024
 -0.024
Tokenind
Token ID521
Feature activation-1.95e-01

Pos logit contributions
ocol+0.083
 property+0.055
asured+0.039
beh+0.039
ities+0.038
Neg logit contributions
s-0.459
 they-0.451
 said-0.444
 -0.443
-0.443
Tokeneer
Token ID28153
Feature activation-1.68e-02

Pos logit contributions
ocol+0.271
 property+0.233
beh+0.220
asured+0.213
ities+0.210
Neg logit contributions
s-0.465
 they-0.450
-0.443
 said-0.442
 -0.431
Token in
Token ID287
Feature activation-4.24e-02

Pos logit contributions
ocol+0.039
 property+0.036
asured+0.034
beh+0.034
isher+0.034
Neg logit contributions
s-0.024
 they-0.023
 said-0.022
 -0.022
-0.022
Token our
Token ID674
Feature activation-7.42e-02

Pos logit contributions
ocol+0.105
 property+0.096
asured+0.092
beh+0.092
isher+0.091
Neg logit contributions
s-0.055
 they-0.054
 -0.051
 said-0.051
-0.051
Token yard
Token ID12699
Feature activation-2.48e-02

Pos logit contributions
ocol+0.191
beh+0.172
ities+0.169
ant+0.166
ubb+0.164
Neg logit contributions
s-0.099
 they-0.094
 -0.093
-0.093
 said-0.090
Token!"
Token ID2474
Feature activation+9.25e-02

Pos logit contributions
ocol+0.065
 property+0.061
asured+0.058
beh+0.058
isher+0.058
Neg logit contributions
s-0.027
 they-0.026
 -0.025
 said-0.025
-0.024
Token mom
Token ID1995
Feature activation+4.28e-02

Pos logit contributions
ocol+0.268
beh+0.238
 property+0.238
isher+0.230
ities+0.230
Neg logit contributions
 they-0.100
s-0.099
 said-0.097
 -0.094
-0.092
Token told
Token ID1297
Feature activation+5.53e-02

Pos logit contributions
 they+0.037
s+0.037
 +0.035
+0.035
cakes+0.033
Neg logit contributions
ocol-0.107
asured-0.104
 property-0.103
isher-0.102
beh-0.101
Token him
Token ID683
Feature activation-4.00e-02

Pos logit contributions
 they+0.080
s+0.078
 said+0.076
 +0.076
+0.076
Neg logit contributions
ocol-0.129
 property-0.119
asured-0.114
ities-0.113
isher-0.113
Token to
Token ID284
Feature activation-7.52e-03

Pos logit contributions
ocol+0.126
 property+0.115
isher+0.113
asured+0.112
beh+0.111
Neg logit contributions
 they-0.030
s-0.028
 -0.024
 said-0.024
-0.024
Token stop
Token ID2245
Feature activation+1.75e-02

Pos logit contributions
ocol+0.021
 property+0.018
beh+0.018
isher+0.018
ities+0.017
Neg logit contributions
 said-0.009
 they-0.009
 -0.009
-0.009
s-0.008
Token mis
Token ID2984
Feature activation-1.92e-01

Pos logit contributions
 they+0.019
+0.016
 said+0.016
s+0.016
 +0.015
Neg logit contributions
ocol-0.051
 property-0.048
beh-0.048
esty-0.048
isher-0.048
Tokenbeh
Token ID20709
Feature activation-1.39e-01

Pos logit contributions
ocol+0.274
 property+0.225
ities+0.209
asured+0.203
isher+0.202
Neg logit contributions
s-0.466
 said-0.436
 exclaimed-0.436
 they-0.434
-0.433
Tokenaving
Token ID2703
Feature activation-5.60e-03

Pos logit contributions
ocol+0.248
 property+0.221
ities+0.206
isher+0.195
asured+0.195
Neg logit contributions
s-0.284
 -0.269
-0.264
 said-0.264
 exclaimed-0.258
Token and
Token ID290
Feature activation+9.89e-03

Pos logit contributions
ocol+0.019
 property+0.017
asured+0.017
isher+0.017
ities+0.017
Neg logit contributions
 they-0.004
s-0.004
-0.003
 -0.003
 said-0.003
Token to
Token ID284
Feature activation-1.10e-02

Pos logit contributions
 they+0.009
 said+0.008
s+0.008
 +0.007
+0.007
Neg logit contributions
ocol-0.030
 property-0.028
ities-0.028
isher-0.027
asured-0.027
Token be
Token ID307
Feature activation-4.05e-02

Pos logit contributions
ocol+0.030
isher+0.027
ities+0.026
 property+0.026
beh+0.025
Neg logit contributions
 they-0.014
 said-0.013
 -0.013
-0.013
 finally-0.011
Token a
Token ID257
Feature activation+7.40e-02

Pos logit contributions
ocol+0.091
beh+0.085
isher+0.083
asured+0.082
ities+0.081
Neg logit contributions
 they-0.024
-0.018
 -0.017
 said-0.016
 she-0.015
Token big
Token ID1263
Feature activation+2.82e-02

Pos logit contributions
 they+0.083
 said+0.082
 +0.081
s+0.080
+0.080
Neg logit contributions
ocol-0.199
beh-0.190
asured-0.182
 property-0.180
ities-0.178
Token box
Token ID3091
Feature activation+1.14e-02

Pos logit contributions
 said+0.029
 they+0.029
 +0.028
+0.026
s+0.025
Neg logit contributions
beh-0.080
ocol-0.077
asured-0.074
ant-0.072
ities-0.072
Token of
Token ID286
Feature activation-4.17e-02

Pos logit contributions
s+0.012
 they+0.011
 +0.011
 said+0.011
+0.011
Neg logit contributions
ocol-0.031
 property-0.029
asured-0.027
isher-0.027
beh-0.027
Token ch
Token ID442
Feature activation-5.90e-02

Pos logit contributions
beh+0.102
asured+0.099
esty+0.099
ant+0.096
ities+0.095
Neg logit contributions
 said-0.057
 they-0.056
 -0.052
-0.051
 big-0.048
Tokenocol
Token ID4668
Feature activation-2.82e-01

Pos logit contributions
 property+0.044
asured+0.038
tons+0.038
beh+0.037
esty+0.034
Neg logit contributions
s-0.161
ons-0.153
 they-0.148
 of-0.146
-0.146
Tokenates
Token ID689
Feature activation-1.02e-02

Pos logit contributions
ocol+0.345
 property+0.325
asured+0.285
 appearance+0.283
beh+0.282
Neg logit contributions
s-0.668
-0.639
 said-0.634
 they-0.631
 -0.629
Token!
Token ID0
Feature activation-8.76e-03

Pos logit contributions
ocol+0.030
 property+0.028
beh+0.028
asured+0.027
isher+0.027
Neg logit contributions
 they-0.009
s-0.009
 said-0.008
-0.007
 -0.007
Token
Token ID220
Feature activation+3.70e-02

Pos logit contributions
ocol+0.021
 property+0.019
beh+0.018
asured+0.018
isher+0.018
Neg logit contributions
s-0.011
 they-0.011
 said-0.011
 exclaimed-0.010
 -0.010
Token
Token ID198
Feature activation+2.29e-02

Pos logit contributions
s+0.016
 they+0.015
 said+0.014
 exclaimed+0.013
cakes+0.011
Neg logit contributions
ocol-0.110
 property-0.106
uzz-0.104
uff-0.103
asured-0.102
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
cakes+0.027
 timer+0.025
 they+0.023
 film+0.020
 replied+0.020
Neg logit contributions
uzz-0.046
 yourself-0.042
uff-0.040
imb-0.040
ocol-0.040