TOP ACTIVATIONS
MAX = 0.282

She pushed and tug ged harder than ever , but still
so sad and tried even harder . But after
Daddy took turns rolling the dice and moving their pieces around
L ily and Ben squeezed harder and harder . They did
and Ben squeezed harder and harder . They did not notice
move . He tug ged harder and harder , until finally
He tug ged harder and harder , until finally , it
happened . She rubbed it harder . Still nothing . She
away . He rolled the dice and moved his pieces around
the wind started to blow harder and harder . Suddenly ,
Suddenly , the wind blew harder , and it got cold
started to blow harder and harder . Suddenly , Lily 's
clay . He shook it harder and harder and the sparks
realized that it was much harder than it looked . He
a new puzzle that was harder and more fun .
to soar . They soared higher in the sky than the
He shook it harder and harder and the sparks flew more
long jump . He jumped higher and higher and everyone was
came and he rolled the dice . He moved his piece
and courage , Dave worked harder . In the end ,

MIN ACTIVATIONS
MIN = -0.352

with nice dogs that have owners ," the man says .
dogs . They love their owners . They are happy .
and return them to their owners . <|endoftext|> Once upon a
to have you as their owners . You know what ?
or angry . And some owners are not careful . They
with the birds , not owners . They learn to strike
If she did , the owners would be very angry .
does not know why his owners are upset . He thinks
! The owners wondered what was wrong .
in the sun . The owners loved their car very much
said . The owners smiled and thanked Sally for
" Sure ," the owners said . So
how much he loved his owners , so he ob eyed
One day , his owners took him for a long
and Jill became the excited owners of the rabbit . They
walks and playing with her owners . She always dream t
beautiful car cover . The owners be amed with joy ,
and how to ask their owners if it was okay to
The pup even followed the owners into a small cave .
keep up . When the owners said " Stop ," the

INTERVAL 0.124 - 0.282
CONTAINS 5.323%

and found a l ily pad that was just the right
hard . He was very sad . One day
curious , so she went closer . When she got to
Molly enjoyed their dinner . As they ate , Molly thought
am exploring . Do you want to come with me ?"

INTERVAL -0.035 - 0.124
CONTAINS 61.572%

in the oven and waited . When the cake
Tim my sat in the chair and the bar ber started
and Spot were happy and relieved . They opened the door
fun with her friends . One day , Lily 's mom
notebook . But when she got to school , she did

INTERVAL -0.194 - -0.035
CONTAINS 32.145%

Later that day , Lily 's grandma came to visit .
Hello , kids . Are you OK ? Did you see
anymore . I want to go outside and see the pigeon
lov ingly . " That 's so adventurous of you !"
. Jenny smiled as she knew what she was supposed to

INTERVAL -0.352 - -0.194
CONTAINS 0.959%

One day , Tom 's mom said he had to go
's so pretty ," said mom my . L
it ?" Her mom replied , " No ,
. One day , his mom made an unusual sauce for
her . Her mom took out a big white
Token She
Token ID1375
Feature activation-2.06e-02

Pos logit contributions
ues+0.005
 handful+0.005
 Whoever+0.005
 closely+0.005
suits+0.005
Neg logit contributions
 parent-0.010
 owners-0.009
 parents-0.009
father-0.008
 guardian-0.008
Token pushed
Token ID7121
Feature activation+9.69e-02

Pos logit contributions
 parent+0.042
 owners+0.039
father+0.036
 guardian+0.036
 tou+0.036
Neg logit contributions
 closely-0.030
ues-0.030
 handful-0.029
 sip-0.028
suits-0.028
Token and
Token ID290
Feature activation+8.04e-02

Pos logit contributions
ues+0.187
suits+0.181
 handful+0.178
suit+0.175
ows+0.173
Neg logit contributions
 parent-0.161
 owners-0.143
 parents-0.137
 family-0.126
 aer-0.124
Token tug
Token ID27762
Feature activation+1.93e-01

Pos logit contributions
ues+0.116
 handful+0.108
suits+0.108
inn+0.104
 closely+0.103
Neg logit contributions
 parent-0.168
 owners-0.153
 parents-0.147
 je-0.140
 family-0.140
Tokenged
Token ID2004
Feature activation+1.71e-01

Pos logit contributions
ues+0.295
suit+0.280
suits+0.276
 sip+0.274
 handful+0.271
Neg logit contributions
 parent-0.351
 owners-0.326
 parents-0.320
 family-0.299
 aer-0.298
Token harder
Token ID7069
Feature activation+2.82e-01

Pos logit contributions
ues+0.238
suits+0.229
 handful+0.223
suit+0.221
 sip+0.211
Neg logit contributions
 parent-0.366
 owners-0.340
 parents-0.327
 family-0.307
father-0.305
Token than
Token ID621
Feature activation+4.50e-02

Pos logit contributions
ues+0.424
 handful+0.420
suits+0.409
 sip+0.394
 Whoever+0.394
Neg logit contributions
 parent-0.569
 owners-0.520
 parents-0.492
 family-0.471
 guardian-0.466
Token ever
Token ID1683
Feature activation-1.66e-01

Pos logit contributions
 handful+0.063
 closely+0.063
ues+0.062
 harder+0.060
 Whoever+0.060
Neg logit contributions
 parent-0.095
 owners-0.087
 parents-0.082
 guardian-0.080
 tou-0.080
Token,
Token ID11
Feature activation+2.99e-02

Pos logit contributions
 parent+0.304
 owners+0.282
 parents+0.266
 family+0.255
 guardian+0.254
Neg logit contributions
ues-0.277
 handful-0.270
suits-0.264
inn-0.253
 Whoever-0.253
Token but
Token ID475
Feature activation+1.93e-01

Pos logit contributions
ues+0.025
 handful+0.025
suits+0.023
 closely+0.022
inn+0.022
Neg logit contributions
 parent-0.080
 owners-0.075
 parents-0.072
 guardian-0.070
 family-0.069
Token still
Token ID991
Feature activation-2.18e-02

Pos logit contributions
 closely+0.252
 harder+0.245
 handful+0.240
 sip+0.232
 Whoever+0.232
Neg logit contributions
 parent-0.423
 owners-0.381
father-0.367
 tou-0.363
 guardian-0.360
Token so
Token ID523
Feature activation+5.67e-02

Pos logit contributions
 parent+0.026
 owners+0.024
 parents+0.023
father+0.022
 guardian+0.022
Neg logit contributions
 handful-0.017
ues-0.017
 closely-0.017
suits-0.016
 sip-0.016
Token sad
Token ID6507
Feature activation+1.42e-01

Pos logit contributions
ues+0.080
 handful+0.080
 closely+0.077
suits+0.076
 sip+0.074
Neg logit contributions
 parent-0.120
 owners-0.110
 parents-0.104
father-0.101
 guardian-0.100
Token and
Token ID290
Feature activation+7.53e-02

Pos logit contributions
ues+0.178
 handful+0.177
 closely+0.172
suits+0.165
 sip+0.164
Neg logit contributions
 parent-0.322
 owners-0.296
 parents-0.282
father-0.275
 guardian-0.274
Token tried
Token ID3088
Feature activation+1.30e-01

Pos logit contributions
 closely+0.103
 handful+0.100
 slim+0.098
 sip+0.098
 harder+0.096
Neg logit contributions
 parent-0.158
 owners-0.145
father-0.144
mother-0.138
 guardian-0.137
Token even
Token ID772
Feature activation-2.80e-02

Pos logit contributions
 handful+0.199
 closely+0.194
 harder+0.191
 sip+0.189
ues+0.185
Neg logit contributions
 parent-0.257
 owners-0.231
father-0.223
 pilot-0.222
 tou-0.220
Token harder
Token ID7069
Feature activation+2.78e-01

Pos logit contributions
 parent+0.058
 owners+0.051
 parents+0.050
 family+0.049
 pilot+0.049
Neg logit contributions
ues-0.042
suits-0.041
ows-0.039
suit-0.039
 Whoever-0.037
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
 handful+0.392
ues+0.390
 closely+0.382
suits+0.368
 sip+0.361
Neg logit contributions
 parent-0.588
 owners-0.533
 parents-0.505
 guardian-0.494
 tou-0.492
Token
Token ID198
Feature activation-3.72e-02

Pos logit contributions
ues+0.019
 handful+0.019
 closely+0.018
suits+0.018
 Whoever+0.018
Neg logit contributions
 parent-0.033
 owners-0.030
 parents-0.029
 guardian-0.027
father-0.027
Token
Token ID198
Feature activation-3.86e-02

Pos logit contributions
 parent+0.071
 owners+0.065
 parents+0.062
 family+0.059
father+0.058
Neg logit contributions
ues-0.060
 handful-0.060
suits-0.057
 closely-0.056
suit-0.056
TokenBut
Token ID1537
Feature activation+8.90e-02

Pos logit contributions
 parent+0.085
 owners+0.078
 parents+0.075
 guardian+0.072
father+0.072
Neg logit contributions
ues-0.051
 handful-0.051
 closely-0.048
suits-0.048
 Whoever-0.047
Token after
Token ID706
Feature activation+8.77e-02

Pos logit contributions
 handful+0.104
 closely+0.101
 harder+0.096
 sip+0.093
ues+0.093
Neg logit contributions
 parent-0.208
 owners-0.192
 tou-0.183
father-0.180
 parents-0.179
Token Daddy
Token ID26926
Feature activation-8.12e-02

Pos logit contributions
 handful+0.112
 sip+0.098
 spaghetti+0.097
 slim+0.096
 harder+0.096
Neg logit contributions
 owners-0.101
 parent-0.098
 guardian-0.097
mother-0.096
 tou-0.095
Token took
Token ID1718
Feature activation+1.25e-02

Pos logit contributions
 parent+0.157
 owners+0.143
 parents+0.135
 guardian+0.132
 tou+0.130
Neg logit contributions
 handful-0.129
ues-0.126
suits-0.125
 closely-0.123
 Whoever-0.123
Token turns
Token ID4962
Feature activation+1.60e-01

Pos logit contributions
ues+0.020
 handful+0.019
suits+0.019
 closely+0.019
suit+0.019
Neg logit contributions
 parent-0.024
 owners-0.022
 parents-0.021
 family-0.020
 je-0.019
Token rolling
Token ID10708
Feature activation+6.57e-02

Pos logit contributions
 handful+0.308
suits+0.306
ues+0.305
ishy+0.302
suit+0.299
Neg logit contributions
 parent-0.242
 owners-0.215
 parents-0.204
 family-0.196
 je-0.187
Token the
Token ID262
Feature activation-3.58e-02

Pos logit contributions
ues+0.143
 appet+0.138
ishy+0.138
ows+0.136
 sip+0.136
Neg logit contributions
 parent-0.083
 owners-0.073
 parents-0.072
 aer-0.068
 family-0.064
Token dice
Token ID17963
Feature activation+2.77e-01

Pos logit contributions
 parent+0.057
 aer+0.053
 parents+0.052
 family+0.051
 je+0.050
Neg logit contributions
ues-0.065
ows-0.062
rily-0.062
 sip-0.061
rant-0.061
Token and
Token ID290
Feature activation+6.46e-02

Pos logit contributions
 handful+0.403
ues+0.397
suits+0.380
 sip+0.379
inn+0.374
Neg logit contributions
 parent-0.555
 owners-0.509
 parents-0.489
 family-0.467
 guardian-0.461
Token moving
Token ID3867
Feature activation+4.13e-02

Pos logit contributions
ues+0.107
 handful+0.106
suits+0.102
 closely+0.101
 sip+0.100
Neg logit contributions
 parent-0.121
 owners-0.109
 parents-0.104
 family-0.099
 je-0.098
Token their
Token ID511
Feature activation-9.99e-02

Pos logit contributions
ishy+0.067
 sip+0.067
ues+0.067
 handful+0.067
suits+0.066
Neg logit contributions
 parent-0.072
 owners-0.067
 parents-0.065
 aer-0.061
 family-0.061
Token pieces
Token ID5207
Feature activation+4.90e-02

Pos logit contributions
 parent+0.207
 owners+0.190
 parents+0.186
 family+0.178
 je+0.174
Neg logit contributions
ues-0.146
 handful-0.143
inn-0.136
 sip-0.134
 Whoever-0.132
Token around
Token ID1088
Feature activation+9.52e-02

Pos logit contributions
ues+0.077
 handful+0.075
suits+0.074
 Whoever+0.073
ensive+0.071
Neg logit contributions
 parent-0.092
 owners-0.083
 parents-0.080
 family-0.077
 guardian-0.076
TokenL
Token ID43
Feature activation+4.49e-02

Pos logit contributions
 parent+0.024
 owners+0.021
 parents+0.020
 guardian+0.019
father+0.019
Neg logit contributions
ues-0.020
 handful-0.020
 closely-0.019
suits-0.019
 Whoever-0.018
Tokenily
Token ID813
Feature activation+1.21e-01

Pos logit contributions
ues+0.047
inn+0.045
 handful+0.045
sc+0.042
 sip+0.042
Neg logit contributions
 parent-0.111
 owners-0.103
 parents-0.098
 tou-0.097
 guardian-0.096
Token and
Token ID290
Feature activation+9.06e-02

Pos logit contributions
 handful+0.174
ues+0.172
suits+0.167
 closely+0.166
 sip+0.166
Neg logit contributions
 parent-0.249
 owners-0.228
 parents-0.213
 guardian-0.213
 tou-0.212
Token Ben
Token ID3932
Feature activation+1.45e-01

Pos logit contributions
 handful+0.065
 sip+0.061
 harder+0.056
 closely+0.056
suits+0.053
Neg logit contributions
 parent-0.232
 guardian-0.228
 owners-0.227
father-0.218
 tou-0.218
Token squeezed
Token ID29650
Feature activation+6.40e-02

Pos logit contributions
suits+0.204
 sip+0.203
ows+0.203
 handful+0.197
 closely+0.197
Neg logit contributions
 parent-0.294
father-0.271
 owners-0.268
 guardian-0.261
mother-0.259
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.113
 handful+0.108
suits+0.106
 Whoever+0.106
 closely+0.104
Neg logit contributions
 parent-0.115
 owners-0.104
 parents-0.100
 family-0.094
mother-0.091
Token and
Token ID290
Feature activation+8.63e-02

Pos logit contributions
ues+0.354
 handful+0.353
 closely+0.335
suits+0.335
 Whoever+0.327
Neg logit contributions
 parent-0.616
 owners-0.566
 parents-0.540
 guardian-0.519
father-0.517
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.129
 handful+0.128
suits+0.124
 closely+0.121
inn+0.121
Neg logit contributions
 parent-0.175
 owners-0.160
 parents-0.152
 family-0.144
 guardian-0.144
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
ues+0.383
 handful+0.379
 closely+0.368
suits+0.357
 sip+0.351
Neg logit contributions
 parent-0.590
 owners-0.539
 parents-0.511
 guardian-0.498
father-0.496
Token They
Token ID1119
Feature activation+2.05e-02

Pos logit contributions
 handful+0.031
ues+0.030
suits+0.030
ishy+0.029
sc+0.029
Neg logit contributions
 parent-0.042
 owners-0.038
 parents-0.037
 babys-0.035
 family-0.035
Token did
Token ID750
Feature activation+1.17e-01

Pos logit contributions
 sip+0.028
 handful+0.028
 closely+0.027
suits+0.027
ues+0.026
Neg logit contributions
 parent-0.044
 owners-0.040
father-0.038
 guardian-0.038
 parents-0.038
Token and
Token ID290
Feature activation+9.06e-02

Pos logit contributions
 handful+0.174
ues+0.172
suits+0.167
 closely+0.166
 sip+0.166
Neg logit contributions
 parent-0.249
 owners-0.228
 parents-0.213
 guardian-0.213
 tou-0.212
Token Ben
Token ID3932
Feature activation+1.45e-01

Pos logit contributions
 handful+0.065
 sip+0.061
 harder+0.056
 closely+0.056
suits+0.053
Neg logit contributions
 parent-0.232
 guardian-0.228
 owners-0.227
father-0.218
 tou-0.218
Token squeezed
Token ID29650
Feature activation+6.40e-02

Pos logit contributions
suits+0.204
 sip+0.203
ows+0.203
 handful+0.197
 closely+0.197
Neg logit contributions
 parent-0.294
father-0.271
 owners-0.268
 guardian-0.261
mother-0.259
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.113
 handful+0.108
suits+0.106
 Whoever+0.106
 closely+0.104
Neg logit contributions
 parent-0.115
 owners-0.104
 parents-0.100
 family-0.094
mother-0.091
Token and
Token ID290
Feature activation+8.63e-02

Pos logit contributions
ues+0.354
 handful+0.353
 closely+0.335
suits+0.335
 Whoever+0.327
Neg logit contributions
 parent-0.616
 owners-0.566
 parents-0.540
 guardian-0.519
father-0.517
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.129
 handful+0.128
suits+0.124
 closely+0.121
inn+0.121
Neg logit contributions
 parent-0.175
 owners-0.160
 parents-0.152
 family-0.144
 guardian-0.144
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
ues+0.383
 handful+0.379
 closely+0.368
suits+0.357
 sip+0.351
Neg logit contributions
 parent-0.590
 owners-0.539
 parents-0.511
 guardian-0.498
father-0.496
Token They
Token ID1119
Feature activation+2.05e-02

Pos logit contributions
 handful+0.031
ues+0.030
suits+0.030
ishy+0.029
sc+0.029
Neg logit contributions
 parent-0.042
 owners-0.038
 parents-0.037
 babys-0.035
 family-0.035
Token did
Token ID750
Feature activation+1.17e-01

Pos logit contributions
 sip+0.028
 handful+0.028
 closely+0.027
suits+0.027
ues+0.026
Neg logit contributions
 parent-0.044
 owners-0.040
father-0.038
 guardian-0.038
 parents-0.038
Token not
Token ID407
Feature activation+4.48e-02

Pos logit contributions
suits+0.266
suit+0.262
inn+0.257
sc+0.255
ishy+0.252
Neg logit contributions
 parent-0.150
 owners-0.133
 parents-0.130
 family-0.113
 aunt-0.110
Token notice
Token ID4003
Feature activation+1.53e-01

Pos logit contributions
ues+0.061
 handful+0.061
suits+0.058
 closely+0.058
 Whoever+0.056
Neg logit contributions
 parent-0.098
 owners-0.089
 parents-0.085
 family-0.082
 guardian-0.082
Token move
Token ID1445
Feature activation+6.09e-02

Pos logit contributions
 parent+0.026
 owners+0.024
 parents+0.022
father+0.021
mother+0.021
Neg logit contributions
 handful-0.027
 closely-0.027
ues-0.026
 sip-0.026
 harder-0.026
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
ues+0.093
 handful+0.092
suits+0.088
 closely+0.088
 Whoever+0.086
Neg logit contributions
 parent-0.121
 owners-0.111
 parents-0.105
 guardian-0.100
 family-0.100
Token He
Token ID679
Feature activation+7.04e-04

Pos logit contributions
ues+0.017
 handful+0.017
 closely+0.016
suits+0.016
 sip+0.016
Neg logit contributions
 parent-0.028
 owners-0.026
 parents-0.025
 guardian-0.024
 family-0.023
Token tug
Token ID27762
Feature activation+1.86e-01

Pos logit contributions
 closely+0.001
 handful+0.001
ues+0.001
 sip+0.001
 Whoever+0.001
Neg logit contributions
 parent-0.001
 owners-0.001
 parents-0.001
father-0.001
 guardian-0.001
Tokenged
Token ID2004
Feature activation+1.64e-01

Pos logit contributions
ues+0.247
ows+0.221
 sip+0.220
suits+0.217
suit+0.217
Neg logit contributions
 parent-0.380
 parents-0.353
 owners-0.351
pa-0.341
father-0.334
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.250
suits+0.236
suit+0.229
ows+0.222
 handful+0.219
Neg logit contributions
 parent-0.333
 owners-0.309
 parents-0.303
 family-0.283
father-0.281
Token and
Token ID290
Feature activation+7.19e-02

Pos logit contributions
ues+0.386
 handful+0.377
suits+0.373
 sip+0.362
suit+0.359
Neg logit contributions
 parent-0.579
 owners-0.534
 parents-0.511
 family-0.490
father-0.486
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.075
suits+0.074
 handful+0.068
ows+0.067
suit+0.065
Neg logit contributions
 parent-0.177
 owners-0.166
 parents-0.161
 guardian-0.156
father-0.155
Token,
Token ID11
Feature activation+2.17e-02

Pos logit contributions
ues+0.387
 handful+0.386
suits+0.369
 closely+0.369
 Whoever+0.359
Neg logit contributions
 parent-0.580
 owners-0.530
 parents-0.504
 guardian-0.484
father-0.482
Token until
Token ID1566
Feature activation+1.45e-01

Pos logit contributions
ues+0.019
 handful+0.018
suits+0.018
 sip+0.016
suit+0.016
Neg logit contributions
 parent-0.058
 owners-0.054
 parents-0.052
 guardian-0.050
 family-0.050
Token finally
Token ID3443
Feature activation+1.84e-02

Pos logit contributions
 handful+0.179
 harder+0.170
 closely+0.168
inn+0.163
 sip+0.163
Neg logit contributions
 parent-0.337
 owners-0.310
 guardian-0.291
father-0.290
 tou-0.288
Token He
Token ID679
Feature activation+7.04e-04

Pos logit contributions
ues+0.017
 handful+0.017
 closely+0.016
suits+0.016
 sip+0.016
Neg logit contributions
 parent-0.028
 owners-0.026
 parents-0.025
 guardian-0.024
 family-0.023
Token tug
Token ID27762
Feature activation+1.86e-01

Pos logit contributions
 closely+0.001
 handful+0.001
ues+0.001
 sip+0.001
 Whoever+0.001
Neg logit contributions
 parent-0.001
 owners-0.001
 parents-0.001
father-0.001
 guardian-0.001
Tokenged
Token ID2004
Feature activation+1.64e-01

Pos logit contributions
ues+0.247
ows+0.221
 sip+0.220
suits+0.217
suit+0.217
Neg logit contributions
 parent-0.380
 parents-0.353
 owners-0.351
pa-0.341
father-0.334
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.250
suits+0.236
suit+0.229
ows+0.222
 handful+0.219
Neg logit contributions
 parent-0.333
 owners-0.309
 parents-0.303
 family-0.283
father-0.281
Token and
Token ID290
Feature activation+7.19e-02

Pos logit contributions
ues+0.386
 handful+0.377
suits+0.373
 sip+0.362
suit+0.359
Neg logit contributions
 parent-0.579
 owners-0.534
 parents-0.511
 family-0.490
father-0.486
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.075
suits+0.074
 handful+0.068
ows+0.067
suit+0.065
Neg logit contributions
 parent-0.177
 owners-0.166
 parents-0.161
 guardian-0.156
father-0.155
Token,
Token ID11
Feature activation+2.17e-02

Pos logit contributions
ues+0.387
 handful+0.386
suits+0.369
 closely+0.369
 Whoever+0.359
Neg logit contributions
 parent-0.580
 owners-0.530
 parents-0.504
 guardian-0.484
father-0.482
Token until
Token ID1566
Feature activation+1.45e-01

Pos logit contributions
ues+0.019
 handful+0.018
suits+0.018
 sip+0.016
suit+0.016
Neg logit contributions
 parent-0.058
 owners-0.054
 parents-0.052
 guardian-0.050
 family-0.050
Token finally
Token ID3443
Feature activation+1.84e-02

Pos logit contributions
 handful+0.179
 harder+0.170
 closely+0.168
inn+0.163
 sip+0.163
Neg logit contributions
 parent-0.337
 owners-0.310
 guardian-0.291
father-0.290
 tou-0.288
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
 handful+0.026
ues+0.025
 closely+0.025
 Whoever+0.024
 harder+0.024
Neg logit contributions
 parent-0.040
 owners-0.036
 parents-0.034
father-0.033
 babys-0.033
Token it
Token ID340
Feature activation+2.08e-01

Pos logit contributions
 handful+0.028
 closely+0.027
ues+0.027
 Whoever+0.026
 harder+0.026
Neg logit contributions
 parent-0.044
 owners-0.040
 parents-0.038
father-0.037
mother-0.037
Token happened
Token ID3022
Feature activation-2.44e-02

Pos logit contributions
 handful+0.186
ues+0.186
 closely+0.180
suits+0.178
 sip+0.175
Neg logit contributions
 parent-0.217
 owners-0.196
 parents-0.185
 guardian-0.177
father-0.176
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 parent+0.048
 owners+0.043
 parents+0.041
 guardian+0.039
 family+0.039
Neg logit contributions
 handful-0.038
ues-0.038
suits-0.037
 sip-0.036
 Whoever-0.036
Token She
Token ID1375
Feature activation-2.14e-02

Pos logit contributions
 handful+0.022
ues+0.022
suits+0.021
 closely+0.021
 sip+0.020
Neg logit contributions
 parent-0.031
 owners-0.028
 parents-0.027
 family-0.026
 babys-0.025
Token rubbed
Token ID31862
Feature activation+8.31e-02

Pos logit contributions
 parent+0.044
 owners+0.040
 parents+0.038
father+0.037
 guardian+0.037
Neg logit contributions
 closely-0.031
 handful-0.031
ues-0.031
 sip-0.030
suits-0.029
Token it
Token ID340
Feature activation+2.10e-01

Pos logit contributions
ues+0.160
suits+0.157
 Whoever+0.153
 handful+0.149
suit+0.148
Neg logit contributions
 parent-0.137
 owners-0.121
 parents-0.116
 family-0.107
 aer-0.106
Token harder
Token ID7069
Feature activation+2.75e-01

Pos logit contributions
ues+0.328
suits+0.319
 Whoever+0.307
 handful+0.304
inn+0.299
Neg logit contributions
 parent-0.410
 owners-0.371
 parents-0.352
 aer-0.336
 family-0.336
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
ues+0.347
 handful+0.345
suits+0.336
 Whoever+0.325
suit+0.320
Neg logit contributions
 parent-0.623
 owners-0.572
 parents-0.549
 family-0.526
father-0.524
Token Still
Token ID7831
Feature activation+1.03e-01

Pos logit contributions
 handful+0.018
ues+0.017
suits+0.017
suit+0.017
 closely+0.016
Neg logit contributions
 parent-0.021
 owners-0.019
 parents-0.018
 family-0.017
 babys-0.017
Token nothing
Token ID2147
Feature activation+8.97e-02

Pos logit contributions
 handful+0.149
ues+0.147
 closely+0.144
 harder+0.140
 sip+0.140
Neg logit contributions
 parent-0.214
 owners-0.196
 parents-0.183
 tou-0.181
father-0.180
Token.
Token ID13
Feature activation+6.79e-03

Pos logit contributions
ues+0.133
 handful+0.133
 closely+0.128
suits+0.126
 sip+0.124
Neg logit contributions
 parent-0.183
 owners-0.167
 parents-0.158
 guardian-0.152
father-0.151
Token She
Token ID1375
Feature activation-2.78e-02

Pos logit contributions
 handful+0.011
ues+0.010
suits+0.010
 closely+0.010
suit+0.010
Neg logit contributions
 parent-0.013
 owners-0.012
 parents-0.012
 family-0.011
 babys-0.011
Token away
Token ID1497
Feature activation+3.41e-02

Pos logit contributions
ues+0.085
suits+0.084
 handful+0.083
 sip+0.082
sc+0.080
Neg logit contributions
 parent-0.083
 owners-0.078
 parents-0.076
 guardian-0.072
 family-0.071
Token.
Token ID13
Feature activation+9.65e-04

Pos logit contributions
 closely+0.044
 handful+0.044
ues+0.043
 harder+0.041
ows+0.040
Neg logit contributions
 parent-0.077
 owners-0.070
 parents-0.065
 babys-0.065
father-0.065
Token He
Token ID679
Feature activation-1.17e-02

Pos logit contributions
ues+0.001
 handful+0.001
 closely+0.001
suits+0.001
 sip+0.001
Neg logit contributions
 parent-0.002
 owners-0.002
 parents-0.002
 family-0.002
 guardian-0.002
Token rolled
Token ID11686
Feature activation+5.21e-02

Pos logit contributions
 parent+0.024
 owners+0.021
father+0.021
 parents+0.020
 guardian+0.020
Neg logit contributions
 closely-0.017
 handful-0.017
suits-0.017
 Whoever-0.017
ues-0.017
Token the
Token ID262
Feature activation-4.10e-02

Pos logit contributions
ues+0.103
 appet+0.098
ows+0.097
ishy+0.097
suits+0.096
Neg logit contributions
 parent-0.079
 parents-0.072
 owners-0.071
 family-0.067
 aer-0.065
Token dice
Token ID17963
Feature activation+2.74e-01

Pos logit contributions
 parent+0.071
 family+0.064
 owners+0.064
 je+0.062
 parents+0.061
Neg logit contributions
ues-0.077
ksh-0.070
inn-0.070
ows-0.069
suits-0.067
Token and
Token ID290
Feature activation+6.09e-02

Pos logit contributions
ues+0.376
 handful+0.367
suits+0.353
 sip+0.348
 Whoever+0.345
Neg logit contributions
 parent-0.576
 owners-0.531
 parents-0.510
 guardian-0.490
 family-0.489
Token moved
Token ID3888
Feature activation-1.14e-03

Pos logit contributions
 handful+0.095
ues+0.093
 closely+0.091
suits+0.090
 Whoever+0.089
Neg logit contributions
 parent-0.121
 owners-0.110
 parents-0.103
 guardian-0.100
father-0.100
Token his
Token ID465
Feature activation-1.03e-01

Pos logit contributions
 parent+0.002
 owners+0.002
 parents+0.002
 family+0.002
 aer+0.002
Neg logit contributions
ues-0.002
 sip-0.002
suits-0.002
ishy-0.002
ows-0.002
Token pieces
Token ID5207
Feature activation+4.73e-02

Pos logit contributions
 parent+0.212
 owners+0.193
 parents+0.184
 guardian+0.176
 family+0.175
Neg logit contributions
ues-0.152
 handful-0.151
 closely-0.144
suits-0.144
 Whoever-0.141
Token around
Token ID1088
Feature activation+9.26e-02

Pos logit contributions
ues+0.071
 handful+0.069
suits+0.068
 Whoever+0.067
 sip+0.066
Neg logit contributions
 parent-0.093
 owners-0.085
 parents-0.081
 family-0.078
 guardian-0.078
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
 handful+0.032
 closely+0.030
 sip+0.030
ues+0.030
 harder+0.029
Neg logit contributions
 parent-0.048
 owners-0.044
father-0.041
 guardian-0.041
 parents-0.041
Token wind
Token ID2344
Feature activation+6.59e-02

Pos logit contributions
 parent+0.041
father+0.041
mother+0.039
 owners+0.037
pa+0.037
Neg logit contributions
 handful-0.033
 harder-0.032
 slim-0.032
 sip-0.032
suits-0.031
Token started
Token ID2067
Feature activation-6.74e-02

Pos logit contributions
 handful+0.090
 closely+0.088
ues+0.088
suit+0.087
 sip+0.087
Neg logit contributions
 parent-0.141
 owners-0.129
father-0.124
 parents-0.124
 guardian-0.119
Token to
Token ID284
Feature activation-9.47e-02

Pos logit contributions
 parent+0.143
father+0.135
 owners+0.132
 parents+0.128
mother+0.127
Neg logit contributions
 harder-0.095
 handful-0.094
 closely-0.093
 sip-0.088
 slim-0.084
Token blow
Token ID6611
Feature activation+5.36e-02

Pos logit contributions
 parent+0.188
 owners+0.175
 parents+0.165
father+0.159
mother+0.157
Neg logit contributions
 handful-0.146
 closely-0.143
 sip-0.142
ues-0.140
 harder-0.138
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.082
 handful+0.077
suits+0.077
 Whoever+0.075
inn+0.073
Neg logit contributions
 parent-0.108
 owners-0.098
 parents-0.092
 aer-0.090
 family-0.089
Token and
Token ID290
Feature activation+7.22e-02

Pos logit contributions
ues+0.337
 handful+0.337
 closely+0.323
suits+0.318
 Whoever+0.309
Neg logit contributions
 parent-0.627
 owners-0.578
 parents-0.551
 guardian-0.533
father-0.531
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.075
 handful+0.074
suits+0.070
 closely+0.069
inn+0.066
Neg logit contributions
 parent-0.180
 owners-0.167
 parents-0.160
 guardian-0.155
 family-0.154
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
ues+0.356
 handful+0.354
 closely+0.352
 harder+0.332
ows+0.331
Neg logit contributions
 parent-0.609
 owners-0.559
 parents-0.529
father-0.524
 guardian-0.520
Token Suddenly
Token ID24975
Feature activation+1.73e-01

Pos logit contributions
ues+0.015
 handful+0.015
suits+0.014
 closely+0.014
suit+0.013
Neg logit contributions
 parent-0.023
 owners-0.021
 parents-0.020
 family-0.019
 babys-0.019
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 handful+0.241
ues+0.232
 closely+0.228
 sip+0.221
suits+0.219
Neg logit contributions
 parent-0.370
 owners-0.341
 parents-0.318
father-0.317
 guardian-0.313
Token Suddenly
Token ID24975
Feature activation+1.72e-01

Pos logit contributions
 handful+0.014
ues+0.014
 closely+0.013
suits+0.013
 sip+0.012
Neg logit contributions
 parent-0.023
 owners-0.021
 parents-0.020
 family-0.019
 babys-0.019
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 handful+0.218
ues+0.204
 closely+0.199
suits+0.195
 harder+0.193
Neg logit contributions
 parent-0.386
 owners-0.360
father-0.343
 guardian-0.339
 tou-0.337
Token the
Token ID262
Feature activation-3.55e-02

Pos logit contributions
 handful+0.018
ues+0.017
 closely+0.017
 sip+0.016
 Whoever+0.016
Neg logit contributions
 parent-0.033
 owners-0.030
father-0.029
 guardian-0.029
mother-0.028
Token wind
Token ID2344
Feature activation+6.12e-02

Pos logit contributions
 parent+0.064
 owners+0.058
father+0.057
mother+0.056
 guardian+0.053
Neg logit contributions
 handful-0.058
suits-0.055
suit-0.054
 sip-0.053
 dice-0.052
Token blew
Token ID17948
Feature activation+3.60e-02

Pos logit contributions
 handful+0.081
ues+0.079
 closely+0.078
 sip+0.077
suit+0.076
Neg logit contributions
 parent-0.135
 owners-0.123
father-0.119
 parents-0.118
 guardian-0.114
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.053
 handful+0.050
suits+0.049
 Whoever+0.048
ows+0.047
Neg logit contributions
 parent-0.075
 owners-0.069
 parents-0.065
 je-0.062
 family-0.062
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 handful+0.324
ues+0.323
 closely+0.310
suits+0.304
 sip+0.294
Neg logit contributions
 parent-0.640
 owners-0.590
 parents-0.564
 guardian-0.546
father-0.545
Token and
Token ID290
Feature activation+5.92e-02

Pos logit contributions
ues+0.016
suits+0.015
 handful+0.015
inn+0.015
suit+0.014
Neg logit contributions
 parent-0.030
 owners-0.028
 parents-0.027
 family-0.026
 je-0.026
Token it
Token ID340
Feature activation+2.06e-01

Pos logit contributions
 handful+0.089
ues+0.087
 closely+0.085
suits+0.083
 Whoever+0.083
Neg logit contributions
 parent-0.121
 owners-0.110
 parents-0.104
 guardian-0.101
father-0.100
Token got
Token ID1392
Feature activation+1.64e-02

Pos logit contributions
 handful+0.299
 closely+0.299
 harder+0.296
sc+0.290
ues+0.288
Neg logit contributions
 parent-0.419
 owners-0.373
father-0.372
 parents-0.364
mother-0.357
Token cold
Token ID4692
Feature activation+8.15e-02

Pos logit contributions
ues+0.021
 handful+0.020
suits+0.020
 closely+0.019
 Whoever+0.019
Neg logit contributions
 parent-0.037
 owners-0.034
 parents-0.033
 je-0.031
 family-0.031
Token started
Token ID2067
Feature activation-6.74e-02

Pos logit contributions
 handful+0.090
 closely+0.088
ues+0.088
suit+0.087
 sip+0.087
Neg logit contributions
 parent-0.141
 owners-0.129
father-0.124
 parents-0.124
 guardian-0.119
Token to
Token ID284
Feature activation-9.47e-02

Pos logit contributions
 parent+0.143
father+0.135
 owners+0.132
 parents+0.128
mother+0.127
Neg logit contributions
 harder-0.095
 handful-0.094
 closely-0.093
 sip-0.088
 slim-0.084
Token blow
Token ID6611
Feature activation+5.36e-02

Pos logit contributions
 parent+0.188
 owners+0.175
 parents+0.165
father+0.159
mother+0.157
Neg logit contributions
 handful-0.146
 closely-0.143
 sip-0.142
ues-0.140
 harder-0.138
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.082
 handful+0.077
suits+0.077
 Whoever+0.075
inn+0.073
Neg logit contributions
 parent-0.108
 owners-0.098
 parents-0.092
 aer-0.090
 family-0.089
Token and
Token ID290
Feature activation+7.22e-02

Pos logit contributions
ues+0.337
 handful+0.337
 closely+0.323
suits+0.318
 Whoever+0.309
Neg logit contributions
 parent-0.627
 owners-0.578
 parents-0.551
 guardian-0.533
father-0.531
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.075
 handful+0.074
suits+0.070
 closely+0.069
inn+0.066
Neg logit contributions
 parent-0.180
 owners-0.167
 parents-0.160
 guardian-0.155
 family-0.154
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
ues+0.356
 handful+0.354
 closely+0.352
 harder+0.332
ows+0.331
Neg logit contributions
 parent-0.609
 owners-0.559
 parents-0.529
father-0.524
 guardian-0.520
Token Suddenly
Token ID24975
Feature activation+1.73e-01

Pos logit contributions
ues+0.015
 handful+0.015
suits+0.014
 closely+0.014
suit+0.013
Neg logit contributions
 parent-0.023
 owners-0.021
 parents-0.020
 family-0.019
 babys-0.019
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 handful+0.241
ues+0.232
 closely+0.228
 sip+0.221
suits+0.219
Neg logit contributions
 parent-0.370
 owners-0.341
 parents-0.318
father-0.317
 guardian-0.313
Token Lily
Token ID20037
Feature activation+1.01e-01

Pos logit contributions
 handful+0.026
ues+0.025
 closely+0.025
 sip+0.024
 Whoever+0.024
Neg logit contributions
 parent-0.035
 owners-0.032
 parents-0.030
father-0.030
 guardian-0.030
Token's
Token ID338
Feature activation-1.12e-01

Pos logit contributions
 closely+0.143
 handful+0.143
ues+0.139
suits+0.139
 sip+0.134
Neg logit contributions
 parent-0.205
 owners-0.188
 parents-0.179
 guardian-0.176
 tou-0.176
Token clay
Token ID21558
Feature activation+3.79e-03

Pos logit contributions
 parent+0.062
 owners+0.057
 parents+0.055
father+0.054
mother+0.052
Neg logit contributions
 handful-0.044
 closely-0.044
suits-0.043
sc-0.042
 sip-0.042
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
 handful+0.005
 closely+0.005
ues+0.005
sc+0.005
 sip+0.005
Neg logit contributions
 parent-0.008
 owners-0.008
 babys-0.007
 parents-0.007
father-0.007
Token He
Token ID679
Feature activation+1.46e-03

Pos logit contributions
ues+0.016
 handful+0.016
suits+0.016
 closely+0.015
 sip+0.015
Neg logit contributions
 parent-0.025
 owners-0.023
 parents-0.022
 family-0.021
 guardian-0.021
Token shook
Token ID14682
Feature activation+1.05e-01

Pos logit contributions
 handful+0.002
 closely+0.002
ues+0.002
 Whoever+0.002
suits+0.002
Neg logit contributions
 parent-0.003
 owners-0.003
 parents-0.003
father-0.002
 babys-0.002
Token it
Token ID340
Feature activation+2.08e-01

Pos logit contributions
ues+0.201
suits+0.200
sc+0.193
suit+0.191
ows+0.189
Neg logit contributions
 parent-0.175
 owners-0.156
 parents-0.151
 family-0.141
 guardian-0.140
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.304
suits+0.293
 handful+0.286
inn+0.277
 dice+0.275
Neg logit contributions
 parent-0.432
 owners-0.399
 parents-0.374
 guardian-0.367
 family-0.359
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
ues+0.355
 handful+0.354
suits+0.341
 sip+0.332
 Whoever+0.329
Neg logit contributions
 parent-0.612
 owners-0.565
 parents-0.538
 guardian-0.517
 family-0.516
Token harder
Token ID7069
Feature activation+2.73e-01

Pos logit contributions
ues+0.101
suits+0.099
 handful+0.099
suit+0.094
inn+0.093
Neg logit contributions
 parent-0.147
 owners-0.136
 parents-0.130
 guardian-0.126
mother-0.124
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
ues+0.375
 handful+0.374
 closely+0.363
suits+0.351
 Whoever+0.346
Neg logit contributions
 parent-0.587
 owners-0.537
 parents-0.510
 guardian-0.492
father-0.490
Token the
Token ID262
Feature activation-3.16e-02

Pos logit contributions
ues+0.079
 handful+0.077
suits+0.076
suit+0.071
ows+0.071
Neg logit contributions
 parent-0.170
 owners-0.159
 parents-0.152
 guardian-0.148
 family-0.146
Token sparks
Token ID38306
Feature activation+3.57e-02

Pos logit contributions
 parent+0.061
 owners+0.056
 parents+0.053
father+0.053
mother+0.052
Neg logit contributions
 handful-0.048
 closely-0.046
ues-0.046
 sip-0.045
suits-0.045
Token realized
Token ID6939
Feature activation-9.17e-03

Pos logit contributions
 parent+0.029
 owners+0.026
 guardian+0.024
 parents+0.024
father+0.024
Neg logit contributions
ues-0.021
 closely-0.021
 handful-0.020
 sip-0.020
suits-0.020
Token that
Token ID326
Feature activation+1.90e-02

Pos logit contributions
 parent+0.020
 owners+0.019
 parents+0.018
 guardian+0.017
 tou+0.017
Neg logit contributions
 handful-0.012
 closely-0.012
ues-0.012
 Whoever-0.011
 harder-0.011
Token it
Token ID340
Feature activation+2.09e-01

Pos logit contributions
 handful+0.023
 closely+0.023
 harder+0.022
 Whoever+0.021
 sip+0.021
Neg logit contributions
 parent-0.043
 owners-0.039
 tou-0.037
 guardian-0.037
 babys-0.037
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
 closely+0.326
sc+0.308
 handful+0.308
 harder+0.307
ows+0.305
Neg logit contributions
 parent-0.407
 owners-0.366
 parents-0.347
 babys-0.346
 ost-0.344
Token much
Token ID881
Feature activation+9.04e-02

Pos logit contributions
 parent+0.254
 owners+0.232
 parents+0.220
 guardian+0.212
father+0.211
Neg logit contributions
 handful-0.172
ues-0.169
 closely-0.166
inn-0.160
 sip-0.159
Token harder
Token ID7069
Feature activation+2.73e-01

Pos logit contributions
ues+0.135
 handful+0.132
suits+0.131
suit+0.127
 Whoever+0.126
Neg logit contributions
 parent-0.185
 owners-0.168
 parents-0.161
 family-0.154
 guardian-0.154
Token than
Token ID621
Feature activation+4.04e-02

Pos logit contributions
ues+0.367
 handful+0.365
 closely+0.355
suits+0.347
 Whoever+0.336
Neg logit contributions
 parent-0.595
 owners-0.544
 parents-0.517
 guardian-0.501
 babys-0.500
Token it
Token ID340
Feature activation+2.08e-01

Pos logit contributions
 handful+0.053
 closely+0.052
 harder+0.052
ues+0.049
 Whoever+0.049
Neg logit contributions
 parent-0.089
 owners-0.081
 guardian-0.076
 tou-0.075
 pilot-0.075
Token looked
Token ID3114
Feature activation+1.15e-01

Pos logit contributions
 closely+0.314
 handful+0.303
ows+0.301
sc+0.299
 sip+0.296
Neg logit contributions
 parent-0.421
 owners-0.374
 parents-0.357
 pilot-0.353
 babys-0.352
Token.
Token ID13
Feature activation+9.21e-03

Pos logit contributions
ues+0.156
 Whoever+0.154
suits+0.150
inn+0.149
 handful+0.149
Neg logit contributions
 parent-0.240
 owners-0.221
 parents-0.214
 family-0.208
 guardian-0.207
Token He
Token ID679
Feature activation-2.14e-03

Pos logit contributions
ues+0.013
 handful+0.013
suits+0.012
 closely+0.012
 sip+0.012
Neg logit contributions
 parent-0.020
 owners-0.018
 parents-0.017
 family-0.017
 guardian-0.016
Token a
Token ID257
Feature activation+1.14e-01

Pos logit contributions
 handful+0.043
 harder+0.042
 closely+0.042
 sip+0.042
 spaghetti+0.040
Neg logit contributions
 parent-0.077
 guardian-0.069
 owners-0.069
 tou-0.068
 pilot-0.068
Token new
Token ID649
Feature activation-3.31e-02

Pos logit contributions
ues+0.189
 handful+0.185
 closely+0.180
suits+0.179
inn+0.176
Neg logit contributions
 parent-0.215
 owners-0.193
 parents-0.183
 guardian-0.175
 family-0.174
Token puzzle
Token ID15027
Feature activation+7.69e-02

Pos logit contributions
 parent+0.061
 owners+0.055
 parents+0.052
father+0.052
mother+0.050
Neg logit contributions
 handful-0.056
ues-0.053
 sip-0.053
 closely-0.053
suits-0.052
Token that
Token ID326
Feature activation+2.31e-02

Pos logit contributions
ues+0.102
 handful+0.102
 closely+0.097
suits+0.096
sc+0.095
Neg logit contributions
 parent-0.170
 owners-0.156
 parents-0.148
father-0.143
 babys-0.143
Token was
Token ID373
Feature activation-1.17e-01

Pos logit contributions
 closely+0.031
 handful+0.031
sc+0.031
 sip+0.031
suit+0.030
Neg logit contributions
 parent-0.048
 owners-0.043
father-0.042
 guardian-0.041
 pilot-0.041
Token harder
Token ID7069
Feature activation+2.73e-01

Pos logit contributions
 parent+0.261
 owners+0.238
 parents+0.228
father+0.222
 babys+0.220
Neg logit contributions
 handful-0.153
ues-0.150
 closely-0.148
 harder-0.139
 sip-0.138
Token and
Token ID290
Feature activation+7.96e-02

Pos logit contributions
ues+0.352
 handful+0.351
 closely+0.336
suits+0.332
 Whoever+0.323
Neg logit contributions
 parent-0.610
 owners-0.560
 parents-0.534
 guardian-0.516
father-0.514
Token more
Token ID517
Feature activation-5.07e-04

Pos logit contributions
ues+0.117
 handful+0.117
suits+0.112
 closely+0.111
inn+0.109
Neg logit contributions
 parent-0.163
 owners-0.149
 parents-0.142
 guardian-0.136
 family-0.135
Token fun
Token ID1257
Feature activation+1.03e-01

Pos logit contributions
 parent+0.001
 owners+0.001
 parents+0.001
 guardian+0.001
 tou+0.001
Neg logit contributions
ues-0.001
 handful-0.001
 closely-0.001
suits-0.001
 Whoever-0.001
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
ues+0.145
 closely+0.142
 handful+0.138
sc+0.130
suits+0.129
Neg logit contributions
 parent-0.222
 owners-0.200
 tou-0.191
 babys-0.189
 guardian-0.188
Token
Token ID198
Feature activation-3.65e-02

Pos logit contributions
 handful+0.025
ues+0.025
suits+0.024
 closely+0.024
 sip+0.024
Neg logit contributions
 parent-0.032
 owners-0.029
 parents-0.027
 family-0.026
 guardian-0.026
Token to
Token ID284
Feature activation-7.76e-02

Pos logit contributions
 parent+0.098
 owners+0.091
father+0.089
 tou+0.088
mother+0.086
Neg logit contributions
 closely-0.070
 handful-0.069
 harder-0.066
ues-0.065
 sip-0.065
Token soar
Token ID48701
Feature activation-1.04e-02

Pos logit contributions
 parent+0.148
 owners+0.139
father+0.130
 parents+0.129
mother+0.129
Neg logit contributions
 closely-0.122
 sip-0.122
 handful-0.121
ues-0.115
 harder-0.114
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
 parent+0.021
 owners+0.019
 parents+0.018
 guardian+0.018
 family+0.018
Neg logit contributions
ues-0.016
 handful-0.015
 Whoever-0.015
 sip-0.014
suits-0.014
Token They
Token ID1119
Feature activation+2.28e-02

Pos logit contributions
 handful+0.026
ues+0.026
 closely+0.024
 Whoever+0.024
suits+0.024
Neg logit contributions
 parent-0.047
 owners-0.043
 parents-0.041
 guardian-0.040
father-0.040
Token soared
Token ID35292
Feature activation-1.17e-02

Pos logit contributions
 handful+0.036
 closely+0.035
ues+0.034
suits+0.034
 sip+0.034
Neg logit contributions
 parent-0.045
 owners-0.041
 parents-0.039
 babys-0.038
father-0.038
Token higher
Token ID2440
Feature activation+2.73e-01

Pos logit contributions
 parent+0.023
 owners+0.022
 parents+0.021
 family+0.020
 guardian+0.020
Neg logit contributions
ues-0.017
 sip-0.016
 handful-0.016
 Whoever-0.016
 topp-0.015
Token in
Token ID287
Feature activation+2.52e-02

Pos logit contributions
ues+0.393
 handful+0.385
 sip+0.377
 Whoever+0.370
suits+0.365
Neg logit contributions
 parent-0.548
 owners-0.508
 parents-0.481
 family-0.475
 aer-0.475
Token the
Token ID262
Feature activation-2.07e-02

Pos logit contributions
ues+0.046
 handful+0.045
suits+0.044
 closely+0.043
ows+0.043
Neg logit contributions
 parent-0.044
 owners-0.039
 parents-0.037
 family-0.035
 guardian-0.035
Token sky
Token ID6766
Feature activation-1.13e-02

Pos logit contributions
 parent+0.039
 owners+0.036
 parents+0.034
father+0.032
 babys+0.032
Neg logit contributions
 handful-0.033
ues-0.033
 closely-0.032
suits-0.032
 sip-0.031
Token than
Token ID621
Feature activation+3.95e-02

Pos logit contributions
 parent+0.025
 owners+0.023
 parents+0.022
 guardian+0.021
 family+0.021
Neg logit contributions
ues-0.015
 handful-0.015
 closely-0.014
suits-0.014
 Whoever-0.014
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
 closely+0.067
 handful+0.066
 harder+0.066
ues+0.064
 Whoever+0.063
Neg logit contributions
 parent-0.071
 owners-0.064
father-0.058
 babys-0.058
 tou-0.057
Token He
Token ID679
Feature activation+1.46e-03

Pos logit contributions
ues+0.016
 handful+0.016
suits+0.016
 closely+0.015
 sip+0.015
Neg logit contributions
 parent-0.025
 owners-0.023
 parents-0.022
 family-0.021
 guardian-0.021
Token shook
Token ID14682
Feature activation+1.05e-01

Pos logit contributions
 handful+0.002
 closely+0.002
ues+0.002
 Whoever+0.002
suits+0.002
Neg logit contributions
 parent-0.003
 owners-0.003
 parents-0.003
father-0.002
 babys-0.002
Token it
Token ID340
Feature activation+2.08e-01

Pos logit contributions
ues+0.201
suits+0.200
sc+0.193
suit+0.191
ows+0.189
Neg logit contributions
 parent-0.175
 owners-0.156
 parents-0.151
 family-0.141
 guardian-0.140
Token harder
Token ID7069
Feature activation+2.74e-01

Pos logit contributions
ues+0.304
suits+0.293
 handful+0.286
inn+0.277
 dice+0.275
Neg logit contributions
 parent-0.432
 owners-0.399
 parents-0.374
 guardian-0.367
 family-0.359
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
ues+0.355
 handful+0.354
suits+0.341
 sip+0.332
 Whoever+0.329
Neg logit contributions
 parent-0.612
 owners-0.565
 parents-0.538
 guardian-0.517
 family-0.516
Token harder
Token ID7069
Feature activation+2.73e-01

Pos logit contributions
ues+0.101
suits+0.099
 handful+0.099
suit+0.094
inn+0.093
Neg logit contributions
 parent-0.147
 owners-0.136
 parents-0.130
 guardian-0.126
mother-0.124
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
ues+0.375
 handful+0.374
 closely+0.363
suits+0.351
 Whoever+0.346
Neg logit contributions
 parent-0.587
 owners-0.537
 parents-0.510
 guardian-0.492
father-0.490
Token the
Token ID262
Feature activation-3.16e-02

Pos logit contributions
ues+0.079
 handful+0.077
suits+0.076
suit+0.071
ows+0.071
Neg logit contributions
 parent-0.170
 owners-0.159
 parents-0.152
 guardian-0.148
 family-0.146
Token sparks
Token ID38306
Feature activation+3.57e-02

Pos logit contributions
 parent+0.061
 owners+0.056
 parents+0.053
father+0.053
mother+0.052
Neg logit contributions
 handful-0.048
 closely-0.046
ues-0.046
 sip-0.045
suits-0.045
Token flew
Token ID13112
Feature activation-3.28e-02

Pos logit contributions
 handful+0.058
 closely+0.058
ues+0.057
 sip+0.056
ows+0.054
Neg logit contributions
 parent-0.069
 owners-0.063
 parents-0.059
father-0.059
 babys-0.057
Token more
Token ID517
Feature activation-3.32e-03

Pos logit contributions
 parent+0.069
 owners+0.063
 parents+0.060
 guardian+0.058
 tou+0.058
Neg logit contributions
ues-0.046
 handful-0.045
suits-0.044
 closely-0.043
 Whoever-0.042
Token long
Token ID890
Feature activation+1.60e-01

Pos logit contributions
 handful+0.044
 closely+0.042
ues+0.042
 Whoever+0.040
suits+0.040
Neg logit contributions
 parent-0.080
 owners-0.074
father-0.070
 parents-0.070
mother-0.070
Token jump
Token ID4391
Feature activation+1.13e-01

Pos logit contributions
 handful+0.236
ues+0.228
 closely+0.223
suits+0.218
 sip+0.216
Neg logit contributions
 parent-0.328
 owners-0.300
 parents-0.284
father-0.276
 guardian-0.275
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
ues+0.172
 Whoever+0.160
 handful+0.159
inn+0.156
ensive+0.155
Neg logit contributions
 parent-0.213
 owners-0.196
 parents-0.184
 je-0.182
 aer-0.180
Token He
Token ID679
Feature activation+5.92e-03

Pos logit contributions
ues+0.025
 handful+0.025
 closely+0.023
suits+0.023
 Whoever+0.023
Neg logit contributions
 parent-0.045
 owners-0.041
 parents-0.039
 guardian-0.038
father-0.038
Token jumped
Token ID11687
Feature activation+1.10e-01

Pos logit contributions
 handful+0.009
 closely+0.009
ues+0.009
suits+0.009
 sip+0.008
Neg logit contributions
 parent-0.012
 owners-0.011
 parents-0.010
father-0.010
 guardian-0.010
Token higher
Token ID2440
Feature activation+2.73e-01

Pos logit contributions
ues+0.208
 topp+0.195
suits+0.188
 Whoever+0.184
ows+0.182
Neg logit contributions
 parent-0.157
 owners-0.145
 parents-0.144
 guardian-0.136
 family-0.135
Token and
Token ID290
Feature activation+7.77e-02

Pos logit contributions
ues+0.434
 handful+0.410
suits+0.409
 Whoever+0.407
 sip+0.406
Neg logit contributions
 parent-0.499
 owners-0.460
 parents-0.440
 aer-0.437
 family-0.433
Token higher
Token ID2440
Feature activation+2.70e-01

Pos logit contributions
ues+0.086
 handful+0.074
inn+0.072
suits+0.071
 topp+0.067
Neg logit contributions
 parent-0.178
 owners-0.167
 je-0.165
 guardian-0.164
 pilot-0.163
Token and
Token ID290
Feature activation+7.73e-02

Pos logit contributions
ues+0.405
 handful+0.392
suits+0.389
 sip+0.384
 Whoever+0.382
Neg logit contributions
 parent-0.522
 owners-0.484
 parents-0.465
 aer-0.450
 family-0.450
Token everyone
Token ID2506
Feature activation-9.09e-02

Pos logit contributions
ues+0.077
 handful+0.069
inn+0.066
suits+0.066
 sip+0.062
Neg logit contributions
 parent-0.189
 owners-0.178
 je-0.174
 parents-0.173
 guardian-0.173
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
 parent+0.195
 owners+0.176
 parents+0.167
father+0.164
 babys+0.162
Neg logit contributions
 handful-0.132
 closely-0.129
ues-0.128
suits-0.126
sc-0.123
Token came
Token ID1625
Feature activation-3.55e-02

Pos logit contributions
ues+0.102
 handful+0.100
suits+0.097
 sip+0.095
 closely+0.094
Neg logit contributions
 parent-0.139
 owners-0.127
 parents-0.121
 family-0.116
 je-0.115
Token and
Token ID290
Feature activation+6.45e-02

Pos logit contributions
 parent+0.082
 owners+0.076
 parents+0.073
father+0.072
 babys+0.072
Neg logit contributions
 closely-0.042
 handful-0.040
 harder-0.040
ues-0.040
 Whoever-0.039
Token he
Token ID339
Feature activation-1.70e-02

Pos logit contributions
ues+0.094
 handful+0.094
 closely+0.090
suits+0.090
 Whoever+0.088
Neg logit contributions
 parent-0.134
 owners-0.122
 parents-0.116
 guardian-0.111
father-0.111
Token rolled
Token ID11686
Feature activation+5.26e-02

Pos logit contributions
 parent+0.034
 owners+0.031
father+0.030
 parents+0.030
 guardian+0.029
Neg logit contributions
 handful-0.025
 closely-0.025
ues-0.025
suits-0.024
 Whoever-0.024
Token the
Token ID262
Feature activation-3.39e-02

Pos logit contributions
ues+0.104
ishy+0.094
ows+0.093
 sip+0.093
 appet+0.092
Neg logit contributions
 parent-0.080
 owners-0.074
 parents-0.072
 family-0.070
 aer-0.069
Token dice
Token ID17963
Feature activation+2.72e-01

Pos logit contributions
 parent+0.069
 owners+0.062
 pilot+0.059
 aer+0.059
 je+0.058
Neg logit contributions
ues-0.053
 handful-0.048
 sip-0.048
inn-0.047
 closely-0.047
Token.
Token ID13
Feature activation+5.93e-03

Pos logit contributions
ues+0.435
 handful+0.434
 sip+0.416
 appet+0.414
suits+0.413
Neg logit contributions
 parent-0.484
 owners-0.445
 parents-0.426
 family-0.405
 aer-0.402
Token He
Token ID679
Feature activation-9.01e-03

Pos logit contributions
ues+0.010
 handful+0.009
suits+0.009
 closely+0.009
 sip+0.009
Neg logit contributions
 parent-0.011
 owners-0.011
 je-0.010
 parents-0.010
 babys-0.010
Token moved
Token ID3888
Feature activation-7.89e-05

Pos logit contributions
 parent+0.018
 owners+0.016
 parents+0.015
 family+0.015
 je+0.014
Neg logit contributions
ues-0.014
 handful-0.014
 closely-0.013
suits-0.013
 sip-0.013
Token his
Token ID465
Feature activation-9.71e-02

Pos logit contributions
 parent+0.000
 owners+0.000
 aer+0.000
 parents+0.000
 family+0.000
Neg logit contributions
ues-0.000
ows-0.000
 sip-0.000
ishy-0.000
 appet-0.000
Token piece
Token ID3704
Feature activation+1.01e-01

Pos logit contributions
 parent+0.197
 owners+0.179
 parents+0.170
 guardian+0.163
father+0.163
Neg logit contributions
ues-0.146
 handful-0.146
 closely-0.140
suits-0.139
 Whoever-0.136
Token and
Token ID290
Feature activation+6.55e-02

Pos logit contributions
ues+0.006
 handful+0.006
 closely+0.006
ows+0.006
 harder+0.006
Neg logit contributions
 parent-0.012
 owners-0.011
 tou-0.010
 parents-0.010
 babys-0.010
Token courage
Token ID11917
Feature activation+2.27e-02

Pos logit contributions
 harder+0.095
 closely+0.094
 handful+0.092
ues+0.091
sc+0.090
Neg logit contributions
 parent-0.137
 babys-0.121
 owners-0.120
 parents-0.114
 tou-0.114
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
ues+0.029
 handful+0.028
 closely+0.028
ows+0.027
 harder+0.027
Neg logit contributions
 parent-0.052
 owners-0.047
 babys-0.045
 parents-0.045
father-0.044
Token Dave
Token ID9935
Feature activation+6.92e-02

Pos logit contributions
 handful+0.020
 closely+0.020
 Whoever+0.019
ues+0.019
 harder+0.019
Neg logit contributions
 parent-0.031
 owners-0.028
father-0.027
 babys-0.027
 guardian-0.026
Token worked
Token ID3111
Feature activation+2.00e-02

Pos logit contributions
 closely+0.104
ues+0.103
 handful+0.102
suits+0.099
ows+0.098
Neg logit contributions
 parent-0.136
 babys-0.124
 owners-0.123
 tou-0.120
 ost-0.118
Token harder
Token ID7069
Feature activation+2.72e-01

Pos logit contributions
 sip+0.025
suits+0.024
ues+0.024
 handful+0.024
ishy+0.023
Neg logit contributions
 parent-0.044
 owners-0.042
 parents-0.040
mother-0.039
father-0.038
Token.
Token ID13
Feature activation+8.57e-04

Pos logit contributions
 handful+0.352
ues+0.352
suits+0.339
 sip+0.330
 Whoever+0.327
Neg logit contributions
 parent-0.605
 owners-0.559
 parents-0.533
father-0.512
 family-0.510
Token In
Token ID554
Feature activation-2.12e-02

Pos logit contributions
ues+0.001
 handful+0.001
 closely+0.001
 Whoever+0.001
suits+0.001
Neg logit contributions
 parent-0.002
 owners-0.002
 parents-0.002
 guardian-0.002
 tou-0.002
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
 parent+0.043
 owners+0.039
 tou+0.037
 parents+0.037
 babys+0.037
Neg logit contributions
 handful-0.031
 closely-0.030
ues-0.030
inn-0.029
 harder-0.029
Token end
Token ID886
Feature activation-9.82e-02

Pos logit contributions
 parent+0.060
 owners+0.053
 parents+0.050
 tou+0.049
 babys+0.049
Neg logit contributions
 handful-0.072
 harder-0.071
 closely-0.070
 sip-0.069
 dice-0.067
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
 parent+0.213
 owners+0.193
 parents+0.181
 tou+0.179
father+0.179
Neg logit contributions
 handful-0.137
ues-0.136
 closely-0.132
 Whoever-0.125
suits-0.124
Token with
Token ID351
Feature activation+3.90e-02

Pos logit contributions
 parent+0.167
 owners+0.155
 parents+0.146
 family+0.143
 neighbors+0.137
Neg logit contributions
ues-0.136
 handful-0.132
inn-0.132
suits-0.130
 Whoever-0.129
Token nice
Token ID3621
Feature activation+8.57e-02

Pos logit contributions
 handful+0.066
ues+0.064
 closely+0.064
suits+0.062
 sip+0.061
Neg logit contributions
 parent-0.070
 owners-0.062
 guardian-0.059
 babys-0.058
 parents-0.058
Token dogs
Token ID6844
Feature activation-9.20e-02

Pos logit contributions
ues+0.145
 handful+0.142
 closely+0.140
inn+0.138
 Whoever+0.137
Neg logit contributions
 parent-0.158
 owners-0.143
 parents-0.136
 family-0.130
father-0.127
Token that
Token ID326
Feature activation+7.83e-03

Pos logit contributions
 parent+0.182
 owners+0.166
 parents+0.158
 family+0.151
mother+0.148
Neg logit contributions
ues-0.142
 handful-0.140
suits-0.133
inn-0.132
 Whoever-0.132
Token have
Token ID423
Feature activation+1.84e-02

Pos logit contributions
 sip+0.011
 handful+0.010
sc+0.010
 slim+0.010
suits+0.010
Neg logit contributions
 parent-0.017
 guardian-0.015
 owners-0.015
father-0.015
mother-0.015
Token owners
Token ID4393
Feature activation-3.52e-01

Pos logit contributions
 handful+0.025
ues+0.024
ows+0.024
suits+0.024
 topp+0.024
Neg logit contributions
 parent-0.039
father-0.034
 guardian-0.034
 babys-0.034
 owners-0.034
Token,"
Token ID553
Feature activation+9.11e-02

Pos logit contributions
 parent+0.736
 owners+0.669
 parents+0.635
 guardian+0.624
father+0.613
Neg logit contributions
 handful-0.501
ues-0.496
 closely-0.481
suits-0.481
 sip-0.471
Token the
Token ID262
Feature activation-3.32e-02

Pos logit contributions
 closely+0.105
 harder+0.103
 handful+0.102
 Whoever+0.096
 sip+0.096
Neg logit contributions
 parent-0.211
 owners-0.192
mother-0.187
 guardian-0.186
father-0.183
Token man
Token ID582
Feature activation-6.75e-02

Pos logit contributions
 parent+0.056
 owners+0.049
mother+0.048
father+0.047
 parents+0.047
Neg logit contributions
 handful-0.059
ues-0.057
suits-0.056
 closely-0.055
 harder-0.055
Token says
Token ID1139
Feature activation-1.38e-02

Pos logit contributions
 parent+0.149
 owners+0.137
 parents+0.131
 guardian+0.126
father+0.125
Neg logit contributions
ues-0.089
 handful-0.088
suits-0.084
 closely-0.084
 Whoever-0.081
Token.
Token ID13
Feature activation+2.39e-03

Pos logit contributions
 parent+0.025
 owners+0.022
 parents+0.022
 family+0.020
mother+0.020
Neg logit contributions
 handful-0.023
ues-0.023
suits-0.023
 Whoever-0.022
 sip-0.022
Token dogs
Token ID6844
Feature activation-9.01e-02

Pos logit contributions
ashes+0.066
ues+0.065
suits+0.065
ets+0.063
 minutes+0.063
Neg logit contributions
 parent-0.040
 guardian-0.038
father-0.037
 tou-0.035
mother-0.034
Token.
Token ID13
Feature activation+1.75e-03

Pos logit contributions
 parent+0.220
father+0.197
 owners+0.195
 babys+0.194
 guardian+0.191
Neg logit contributions
ows-0.098
 closely-0.096
suits-0.094
 handful-0.094
ues-0.091
Token They
Token ID1119
Feature activation+5.85e-03

Pos logit contributions
ues+0.002
 handful+0.002
 closely+0.002
suits+0.002
 sip+0.002
Neg logit contributions
 parent-0.004
 owners-0.004
 parents-0.004
 guardian-0.004
 family-0.004
Token love
Token ID1842
Feature activation-3.90e-02

Pos logit contributions
 sip+0.007
 closely+0.006
ows+0.006
 handful+0.006
 Whoever+0.006
Neg logit contributions
 parent-0.014
father-0.013
mother-0.013
 owners-0.013
 babys-0.012
Token their
Token ID511
Feature activation-1.01e-01

Pos logit contributions
 parent+0.085
 guardian+0.084
 pilot+0.080
mother+0.080
father+0.080
Neg logit contributions
 closely-0.043
 harder-0.043
 spaghetti-0.043
 handful-0.042
 Whoever-0.040
Token owners
Token ID4393
Feature activation-3.52e-01

Pos logit contributions
mother+0.174
father+0.174
 parent+0.171
 guardian+0.155
 babys+0.149
Neg logit contributions
suits-0.162
 closely-0.153
 handful-0.152
 harder-0.149
 sip-0.146
Token.
Token ID13
Feature activation+1.08e-03

Pos logit contributions
 parent+0.823
 owners+0.740
 guardian+0.709
father+0.708
 babys+0.703
Neg logit contributions
 handful-0.427
 closely-0.407
ows-0.404
suits-0.400
ues-0.396
Token They
Token ID1119
Feature activation+5.69e-03

Pos logit contributions
 handful+0.001
ues+0.001
 closely+0.001
suits+0.001
 sip+0.001
Neg logit contributions
 parent-0.003
 owners-0.002
 parents-0.002
 family-0.002
 guardian-0.002
Token are
Token ID389
Feature activation-5.83e-02

Pos logit contributions
 sip+0.007
 closely+0.006
ows+0.006
 handful+0.006
suits+0.006
Neg logit contributions
 parent-0.013
father-0.012
mother-0.012
 owners-0.012
 babys-0.012
Token happy
Token ID3772
Feature activation+7.06e-02

Pos logit contributions
 parent+0.103
father+0.099
mother+0.096
 babys+0.094
grand+0.092
Neg logit contributions
 handful-0.090
 closely-0.089
 harder-0.082
 slim-0.081
ends-0.080
Token.
Token ID13
Feature activation+9.66e-04

Pos logit contributions
ues+0.070
 closely+0.069
 handful+0.069
 harder+0.068
 sip+0.064
Neg logit contributions
 parent-0.172
 owners-0.157
father-0.156
 guardian-0.154
 tou-0.153
Token and
Token ID290
Feature activation+6.37e-02

Pos logit contributions
 closely+0.018
 handful+0.018
ues+0.018
ows+0.017
 harder+0.017
Neg logit contributions
 parent-0.037
 owners-0.034
 parents-0.032
 tou-0.032
 babys-0.032
Token return
Token ID1441
Feature activation-1.04e-01

Pos logit contributions
 closely+0.095
 sip+0.091
ues+0.088
 handful+0.087
 harder+0.087
Neg logit contributions
 parent-0.123
father-0.111
 owners-0.109
 babys-0.107
mother-0.105
Token them
Token ID606
Feature activation+1.28e-02

Pos logit contributions
 parent+0.216
 owners+0.189
 pilot+0.184
father+0.183
 je+0.182
Neg logit contributions
 handful-0.152
 closely-0.151
 harder-0.146
ues-0.141
suits-0.137
Token to
Token ID284
Feature activation-1.03e-01

Pos logit contributions
 closely+0.017
 handful+0.017
ows+0.016
suits+0.016
ues+0.015
Neg logit contributions
 parent-0.029
 owners-0.025
father-0.025
 pilot-0.025
 babys-0.025
Token their
Token ID511
Feature activation-1.03e-01

Pos logit contributions
 parent+0.190
 owners+0.174
 babys+0.167
 pilot+0.164
 je+0.164
Neg logit contributions
 closely-0.157
 sip-0.156
 handful-0.151
 harder-0.146
ues-0.145
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 parent+0.150
father+0.149
 known+0.143
mother+0.139
 tou+0.132
Neg logit contributions
 closely-0.199
suits-0.176
 handful-0.175
 sip-0.174
 harder-0.172
Token.
Token ID13
Feature activation+3.26e-03

Pos logit contributions
 parent+0.819
 owners+0.751
 tou+0.731
 guardian+0.713
 parents+0.705
Neg logit contributions
 closely-0.425
 handful-0.408
ues-0.388
 sip-0.378
sc-0.375
Token<|endoftext|>
Token ID50256
Feature activation-1.22e-02

Pos logit contributions
 handful+0.004
ues+0.004
 Whoever+0.004
 closely+0.004
inn+0.004
Neg logit contributions
 parent-0.007
 owners-0.007
 parents-0.006
 guardian-0.006
 pilot-0.006
TokenOnce
Token ID7454
Feature activation+1.21e-02

Pos logit contributions
 parent+0.021
 owners+0.019
 parents+0.018
 family+0.018
 je+0.017
Neg logit contributions
ues-0.022
 handful-0.021
 closely-0.021
ishy-0.021
suits-0.020
Token upon
Token ID2402
Feature activation+1.20e-01

Pos logit contributions
ues+0.014
 handful+0.014
suits+0.013
sc+0.013
 Whoever+0.013
Neg logit contributions
 parent-0.029
 owners-0.026
 parents-0.026
 family-0.025
 je-0.024
Token a
Token ID257
Feature activation+9.92e-02

Pos logit contributions
ues+0.242
suit+0.223
suits+0.221
inn+0.220
ishy+0.216
Neg logit contributions
 parent-0.198
 family-0.181
 owners-0.179
 parents-0.164
 neighbors-0.161
Token to
Token ID284
Feature activation-1.03e-01

Pos logit contributions
 parent+0.016
 owners+0.014
 guardian+0.014
 tou+0.014
father+0.014
Neg logit contributions
 handful-0.009
 closely-0.009
ues-0.009
suits-0.009
 Whoever-0.008
Token have
Token ID423
Feature activation+2.12e-02

Pos logit contributions
 parent+0.223
 owners+0.216
 babys+0.204
 parents+0.203
 guardian+0.200
Neg logit contributions
 Whoever-0.121
 sip-0.120
 handful-0.110
inn-0.106
 dice-0.102
Token you
Token ID345
Feature activation-6.32e-02

Pos logit contributions
 handful+0.029
ues+0.027
 sip+0.026
 closely+0.026
 Whoever+0.026
Neg logit contributions
 parent-0.047
 owners-0.041
 guardian-0.041
 babys-0.040
 pilot-0.040
Token as
Token ID355
Feature activation+1.16e-01

Pos logit contributions
 parent+0.139
 owners+0.127
 parents+0.119
father+0.118
 tou+0.118
Neg logit contributions
 handful-0.084
ues-0.083
suits-0.082
 closely-0.079
ows-0.078
Token their
Token ID511
Feature activation-9.66e-02

Pos logit contributions
ues+0.152
 handful+0.148
 closely+0.146
suits+0.143
inn+0.139
Neg logit contributions
 parent-0.256
 owners-0.235
father-0.221
 parents-0.220
 guardian-0.219
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 parent+0.174
mother+0.173
father+0.169
 owners+0.164
 guardian+0.158
Neg logit contributions
suits-0.148
 handful-0.147
 closely-0.143
ues-0.136
suit-0.133
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
 parent+0.787
 owners+0.722
 guardian+0.675
 parents+0.673
father+0.660
Neg logit contributions
 handful-0.446
suits-0.445
 sip-0.438
ues-0.433
ows-0.428
Token You
Token ID921
Feature activation+8.66e-02

Pos logit contributions
ues+0.008
 Whoever+0.008
 handful+0.007
 closely+0.007
suits+0.007
Neg logit contributions
 parent-0.017
 owners-0.016
 parents-0.015
mother-0.015
 babys-0.015
Token know
Token ID760
Feature activation-8.97e-04

Pos logit contributions
 handful+0.120
 Whoever+0.119
suits+0.117
 sip+0.116
ues+0.116
Neg logit contributions
 parent-0.179
 owners-0.165
 parents-0.158
 babys-0.154
father-0.152
Token what
Token ID644
Feature activation+1.35e-01

Pos logit contributions
 parent+0.002
 guardian+0.002
 owners+0.002
 babys+0.002
father+0.002
Neg logit contributions
 closely-0.001
 harder-0.001
 handful-0.001
ues-0.001
ows-0.001
Token?
Token ID30
Feature activation+1.64e-02

Pos logit contributions
 spaghetti+0.154
 dice+0.146
 topp+0.142
sc+0.135
 handful+0.134
Neg logit contributions
 parent-0.296
 guardian-0.278
 owners-0.273
father-0.269
mother-0.264
Token or
Token ID393
Feature activation+4.48e-02

Pos logit contributions
 handful+0.058
ues+0.058
 closely+0.057
suits+0.055
 harder+0.054
Neg logit contributions
 parent-0.094
 owners-0.086
 parents-0.082
 guardian-0.080
father-0.080
Token angry
Token ID7954
Feature activation+4.52e-02

Pos logit contributions
 handful+0.072
ues+0.070
 closely+0.070
 sip+0.070
suits+0.068
Neg logit contributions
 parent-0.085
 owners-0.077
father-0.073
 parents-0.073
mother-0.072
Token.
Token ID13
Feature activation+4.85e-03

Pos logit contributions
ues+0.064
 handful+0.064
 closely+0.062
suits+0.061
 Whoever+0.060
Neg logit contributions
 parent-0.095
 owners-0.087
 parents-0.082
 guardian-0.080
father-0.079
Token And
Token ID843
Feature activation-7.12e-02

Pos logit contributions
ues+0.005
 handful+0.005
 closely+0.005
suits+0.005
 Whoever+0.005
Neg logit contributions
 parent-0.012
 owners-0.011
 parents-0.010
 guardian-0.010
father-0.010
Token some
Token ID617
Feature activation+1.33e-01

Pos logit contributions
 parent+0.147
 owners+0.134
father+0.131
mother+0.128
 babys+0.128
Neg logit contributions
 handful-0.099
 closely-0.098
ues-0.097
 Whoever-0.095
 sip-0.093
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 handful+0.222
 sip+0.216
sc+0.209
ues+0.207
 harder+0.206
Neg logit contributions
 parent-0.237
father-0.210
 owners-0.207
 guardian-0.206
mother-0.205
Token are
Token ID389
Feature activation-5.94e-02

Pos logit contributions
 parent+0.738
 owners+0.671
 parents+0.630
father+0.622
 guardian+0.618
Neg logit contributions
 handful-0.501
ues-0.493
 sip-0.490
ows-0.487
 closely-0.483
Token not
Token ID407
Feature activation+3.38e-02

Pos logit contributions
 parent+0.124
 owners+0.111
father+0.109
 parents+0.107
mother+0.107
Neg logit contributions
 handful-0.082
 closely-0.082
ues-0.080
 harder-0.076
 sip-0.076
Token careful
Token ID8161
Feature activation+3.68e-02

Pos logit contributions
 handful+0.048
ues+0.046
 closely+0.045
suits+0.044
ows+0.043
Neg logit contributions
 parent-0.071
 owners-0.065
 parents-0.062
father-0.061
mother-0.060
Token.
Token ID13
Feature activation+4.67e-03

Pos logit contributions
 handful+0.055
ues+0.055
suits+0.053
 closely+0.053
 sip+0.052
Neg logit contributions
 parent-0.074
 owners-0.068
 parents-0.064
 guardian-0.062
 family-0.061
Token They
Token ID1119
Feature activation+4.92e-03

Pos logit contributions
ues+0.005
 Whoever+0.004
 handful+0.004
suits+0.004
 closely+0.004
Neg logit contributions
 parent-0.012
 owners-0.011
 parents-0.010
father-0.010
mother-0.010
Token with
Token ID351
Feature activation+4.10e-02

Pos logit contributions
 parent+0.543
 owners+0.493
 parents+0.472
 babys+0.463
 je+0.462
Neg logit contributions
ues-0.308
 handful-0.306
 closely-0.300
suits-0.289
 Whoever-0.280
Token the
Token ID262
Feature activation-3.32e-02

Pos logit contributions
 handful+0.059
 closely+0.059
ues+0.058
 sip+0.055
 harder+0.054
Neg logit contributions
 parent-0.084
 owners-0.076
 guardian-0.072
 babys-0.072
 parents-0.072
Token birds
Token ID10087
Feature activation-1.42e-01

Pos logit contributions
 parent+0.051
 owners+0.045
father+0.043
 babys+0.043
mother+0.042
Neg logit contributions
 handful-0.060
 dice-0.058
 closely-0.058
 harder-0.057
 sip-0.057
Token,
Token ID11
Feature activation+1.58e-02

Pos logit contributions
 parent+0.330
 owners+0.303
 babys+0.290
 parents+0.290
 guardian+0.288
Neg logit contributions
 closely-0.168
 handful-0.162
ues-0.162
ows-0.158
suits-0.153
Token not
Token ID407
Feature activation+3.41e-02

Pos logit contributions
 handful+0.021
 closely+0.021
 harder+0.021
 Whoever+0.021
ues+0.020
Neg logit contributions
 parent-0.032
 owners-0.029
father-0.029
mother-0.028
 parents-0.028
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 handful+0.049
 closely+0.049
ues+0.048
 sip+0.046
 harder+0.046
Neg logit contributions
 parent-0.070
 owners-0.064
 parents-0.060
father-0.060
 guardian-0.059
Token.
Token ID13
Feature activation+4.16e-03

Pos logit contributions
 parent+0.787
 owners+0.716
 parents+0.683
 guardian+0.668
father+0.660
Neg logit contributions
ues-0.448
 closely-0.443
 handful-0.441
ows-0.421
suits-0.421
Token They
Token ID1119
Feature activation+8.84e-03

Pos logit contributions
 handful+0.006
ues+0.006
 closely+0.005
inn+0.005
 sip+0.005
Neg logit contributions
 parent-0.009
 owners-0.008
 parents-0.008
 family-0.008
 guardian-0.008
Token learn
Token ID2193
Feature activation-3.27e-02

Pos logit contributions
 closely+0.012
 sip+0.012
 Whoever+0.011
ues+0.011
 handful+0.011
Neg logit contributions
 parent-0.019
 owners-0.017
 babys-0.016
father-0.016
 parents-0.016
Token to
Token ID284
Feature activation-1.04e-01

Pos logit contributions
 parent+0.072
father+0.065
 owners+0.064
 babys+0.063
mother+0.062
Neg logit contributions
 closely-0.042
ues-0.041
 handful-0.041
 harder-0.040
ows-0.038
Token strike
Token ID5587
Feature activation+5.47e-02

Pos logit contributions
 parent+0.174
 owners+0.154
 parents+0.151
 babys+0.150
father+0.145
Neg logit contributions
 sip-0.186
 closely-0.182
ues-0.181
 handful-0.179
 harder-0.175
Token If
Token ID1002
Feature activation+1.60e-01

Pos logit contributions
ues+0.006
 handful+0.006
 closely+0.006
 Whoever+0.006
suits+0.006
Neg logit contributions
 parent-0.010
 owners-0.009
 parents-0.009
 pilot-0.009
 tou-0.009
Token she
Token ID673
Feature activation-4.15e-02

Pos logit contributions
 closely+0.198
 harder+0.171
 handful+0.168
 Whoever+0.165
 sip+0.164
Neg logit contributions
 parent-0.346
 tou-0.326
 owners-0.324
 je-0.312
father-0.306
Token did
Token ID750
Feature activation+1.08e-01

Pos logit contributions
 parent+0.090
 owners+0.078
father+0.078
 parents+0.077
 ost+0.076
Neg logit contributions
 sip-0.054
 closely-0.054
 Whoever-0.054
suits-0.051
suit-0.050
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 closely+0.129
 handful+0.122
 harder+0.121
ues+0.116
 sip+0.114
Neg logit contributions
 parent-0.253
 owners-0.228
father-0.227
mother-0.219
 babys-0.216
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
 closely+0.022
 handful+0.021
 harder+0.020
 sip+0.020
 while+0.019
Neg logit contributions
 parent-0.032
 owners-0.029
 tou-0.029
 je-0.029
father-0.028
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 parent+0.051
mother+0.048
father+0.047
 tou+0.046
 bless+0.045
Neg logit contributions
 closely-0.065
 minutes-0.064
 harder-0.063
 handful-0.063
suits-0.060
Token would
Token ID561
Feature activation-1.01e-01

Pos logit contributions
 parent+0.718
 owners+0.656
 North+0.620
 tou+0.619
father+0.618
Neg logit contributions
 closely-0.501
ows-0.493
 sip-0.489
 Whoever-0.481
ues-0.477
Token be
Token ID307
Feature activation-1.52e-01

Pos logit contributions
 parent+0.182
father+0.171
 owners+0.165
mother+0.157
 je+0.155
Neg logit contributions
 sip-0.168
 closely-0.164
 Whoever-0.157
 handful-0.154
 harder-0.152
Token very
Token ID845
Feature activation-2.93e-02

Pos logit contributions
 parent+0.313
 owners+0.283
 parents+0.270
father+0.264
 je+0.261
Neg logit contributions
 handful-0.227
ues-0.221
 closely-0.217
suits-0.209
 harder-0.206
Token angry
Token ID7954
Feature activation+4.85e-02

Pos logit contributions
 parent+0.063
 owners+0.059
father+0.056
mother+0.055
 parents+0.055
Neg logit contributions
 closely-0.040
ues-0.039
 handful-0.039
suits-0.037
 sip-0.036
Token.
Token ID13
Feature activation+4.58e-03

Pos logit contributions
ues+0.064
 handful+0.063
 closely+0.062
suits+0.059
 Whoever+0.059
Neg logit contributions
 parent-0.107
 owners-0.098
 parents-0.093
 tou-0.090
 guardian-0.090
Token does
Token ID857
Feature activation+1.15e-01

Pos logit contributions
 parent+0.014
 owners+0.013
 parents+0.013
 guardian+0.012
father+0.012
Neg logit contributions
ues-0.008
 handful-0.007
 closely-0.007
suits-0.007
 Whoever-0.007
Token not
Token ID407
Feature activation+3.46e-02

Pos logit contributions
suits+0.292
suit+0.278
sc+0.272
ishy+0.264
 handful+0.264
Neg logit contributions
 parent-0.135
 parents-0.119
 owners-0.114
 family-0.102
 guardian-0.096
Token know
Token ID760
Feature activation+3.13e-04

Pos logit contributions
ues+0.050
 handful+0.050
suits+0.048
 closely+0.047
sc+0.046
Neg logit contributions
 parent-0.074
 owners-0.066
 parents-0.063
 family-0.062
 guardian-0.061
Token why
Token ID1521
Feature activation+2.95e-02

Pos logit contributions
 handful+0.000
 closely+0.000
ues+0.000
 harder+0.000
 sip+0.000
Neg logit contributions
 parent-0.001
 owners-0.001
 guardian-0.001
 parents-0.001
 tou-0.001
Token his
Token ID465
Feature activation-9.42e-02

Pos logit contributions
ues+0.038
 handful+0.036
 closely+0.035
 sip+0.035
 harder+0.034
Neg logit contributions
 parent-0.065
 owners-0.060
 guardian-0.058
 tou-0.057
father-0.057
Token owners
Token ID4393
Feature activation-3.50e-01

Pos logit contributions
 parent+0.129
father+0.118
mother+0.116
 owners+0.112
 tou+0.108
Neg logit contributions
ues-0.182
 closely-0.182
suits-0.182
 sip-0.181
 handful-0.181
Token are
Token ID389
Feature activation-5.57e-02

Pos logit contributions
 parent+0.717
 owners+0.654
 parents+0.611
 guardian+0.610
father+0.603
Neg logit contributions
 sip-0.532
 closely-0.511
ows-0.509
 handful-0.506
ues-0.505
Token upset
Token ID9247
Feature activation+4.49e-02

Pos logit contributions
 parent+0.109
 owners+0.099
 parents+0.094
 guardian+0.090
father+0.089
Neg logit contributions
 handful-0.087
ues-0.087
 closely-0.084
suits-0.083
 Whoever-0.082
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
ues+0.058
 handful+0.057
 closely+0.057
 harder+0.054
suits+0.054
Neg logit contributions
 parent-0.099
 owners-0.091
 guardian-0.086
 parents-0.086
 tou-0.085
Token He
Token ID679
Feature activation-6.39e-03

Pos logit contributions
ues+0.005
 handful+0.005
 closely+0.005
suits+0.005
 Whoever+0.005
Neg logit contributions
 parent-0.008
 owners-0.007
 parents-0.007
 guardian-0.006
father-0.006
Token thinks
Token ID6834
Feature activation+1.32e-01

Pos logit contributions
 parent+0.015
 owners+0.014
 parents+0.013
father+0.013
 guardian+0.013
Neg logit contributions
ues-0.007
 handful-0.007
 closely-0.007
 sip-0.007
suits-0.007
Token!
Token ID0
Feature activation+2.31e-02

Pos logit contributions
 parent+0.193
 owners+0.176
 parents+0.166
 family+0.160
mother+0.160
Neg logit contributions
ues-0.170
 handful-0.165
 Whoever-0.160
 sip-0.159
suits-0.158
Token
Token ID220
Feature activation+9.78e-02

Pos logit contributions
ues+0.028
 handful+0.028
 Whoever+0.028
 closely+0.027
inn+0.026
Neg logit contributions
 parent-0.053
 owners-0.048
 parents-0.046
father-0.045
 guardian-0.045
Token
Token ID198
Feature activation-3.88e-02

Pos logit contributions
 handful+0.153
ues+0.151
suits+0.145
 closely+0.144
 sip+0.143
Neg logit contributions
 parent-0.192
 owners-0.175
 parents-0.166
 family-0.157
father-0.156
Token
Token ID198
Feature activation-4.05e-02

Pos logit contributions
 parent+0.081
 owners+0.074
 parents+0.070
 guardian+0.067
father+0.067
Neg logit contributions
ues-0.056
 handful-0.056
 closely-0.054
suits-0.053
 Whoever-0.052
TokenThe
Token ID464
Feature activation-7.57e-02

Pos logit contributions
 parent+0.091
 owners+0.082
 parents+0.080
father+0.077
 je+0.077
Neg logit contributions
 handful-0.052
ues-0.051
 closely-0.050
 Whoever-0.049
suits-0.049
Token owners
Token ID4393
Feature activation-3.49e-01

Pos logit contributions
 parent+0.131
 owners+0.119
father+0.113
mother+0.110
 parents+0.109
Neg logit contributions
 handful-0.127
 closely-0.123
 sip-0.120
suits-0.118
ues-0.118
Token wondered
Token ID14028
Feature activation-5.46e-03

Pos logit contributions
 parent+0.690
 owners+0.628
 tou+0.591
 parents+0.587
father+0.574
Neg logit contributions
 handful-0.545
 closely-0.540
 Whoever-0.513
ues-0.509
 sip-0.508
Token what
Token ID644
Feature activation+1.40e-01

Pos logit contributions
 parent+0.013
 owners+0.012
 tou+0.012
 parents+0.012
 babys+0.012
Neg logit contributions
 closely-0.006
 handful-0.006
ues-0.006
ows-0.005
 harder-0.005
Token was
Token ID373
Feature activation-1.25e-01

Pos logit contributions
 closely+0.153
 handful+0.149
 Whoever+0.142
 harder+0.138
sc+0.135
Neg logit contributions
 parent-0.312
 babys-0.289
father-0.285
 North-0.285
 guardian-0.284
Token wrong
Token ID2642
Feature activation+1.14e-01

Pos logit contributions
 parent+0.253
 owners+0.229
 parents+0.218
father+0.208
 guardian+0.207
Neg logit contributions
 handful-0.187
ues-0.186
 closely-0.183
suits-0.177
 Whoever-0.175
Token.
Token ID13
Feature activation+6.03e-03

Pos logit contributions
ues+0.152
 handful+0.150
 closely+0.149
suits+0.144
ows+0.139
Neg logit contributions
 parent-0.245
 owners-0.223
 parents-0.211
 tou-0.210
father-0.209
Token in
Token ID287
Feature activation+2.05e-02

Pos logit contributions
ues+0.116
 Whoever+0.110
suits+0.109
 sip+0.108
 handful+0.107
Neg logit contributions
 parent-0.095
 owners-0.087
 parents-0.081
 family-0.080
 guardian-0.078
Token the
Token ID262
Feature activation-4.01e-02

Pos logit contributions
ues+0.037
 handful+0.036
suits+0.035
ows+0.035
 closely+0.034
Neg logit contributions
 parent-0.037
 owners-0.033
 parents-0.031
 family-0.030
 guardian-0.029
Token sun
Token ID4252
Feature activation-2.69e-02

Pos logit contributions
 parent+0.087
 owners+0.080
father+0.077
 parents+0.077
mother+0.077
Neg logit contributions
 handful-0.051
suit-0.050
suits-0.050
 harder-0.049
sc-0.049
Token.
Token ID13
Feature activation+5.04e-04

Pos logit contributions
 parent+0.059
 owners+0.055
 parents+0.052
 guardian+0.051
father+0.051
Neg logit contributions
 handful-0.035
ues-0.034
 closely-0.034
suits-0.033
suit-0.033
Token The
Token ID383
Feature activation-4.91e-02

Pos logit contributions
ues+0.001
 handful+0.001
suits+0.001
 sip+0.001
ishy+0.001
Neg logit contributions
 parent-0.001
 owners-0.001
 family-0.001
 parents-0.001
 je-0.001
Token owners
Token ID4393
Feature activation-3.49e-01

Pos logit contributions
 parent+0.092
 owners+0.083
father+0.082
mother+0.082
 parents+0.078
Neg logit contributions
 handful-0.077
suits-0.072
 closely-0.072
 dice-0.072
 Whoever-0.071
Token loved
Token ID6151
Feature activation-4.86e-02

Pos logit contributions
 parent+0.635
 owners+0.583
 parents+0.539
 tou+0.523
 guardian+0.522
Neg logit contributions
 handful-0.594
 closely-0.585
ues-0.584
suits-0.570
 sip-0.565
Token their
Token ID511
Feature activation-1.03e-01

Pos logit contributions
 parent+0.127
 owners+0.119
 guardian+0.117
father+0.114
mother+0.114
Neg logit contributions
 closely-0.042
 handful-0.041
ues-0.038
 Whoever-0.038
 harder-0.036
Token car
Token ID1097
Feature activation-5.89e-02

Pos logit contributions
 parent+0.203
 owners+0.184
father+0.182
mother+0.180
 tou+0.175
Neg logit contributions
 handful-0.149
 closely-0.147
suits-0.145
 Whoever-0.141
ues-0.141
Token very
Token ID845
Feature activation-3.12e-02

Pos logit contributions
 parent+0.138
 owners+0.125
father+0.122
 parents+0.121
 tou+0.121
Neg logit contributions
 handful-0.066
 closely-0.065
ows-0.065
 harder-0.064
ues-0.064
Token much
Token ID881
Feature activation+8.03e-02

Pos logit contributions
 parent+0.070
 owners+0.065
 parents+0.063
 family+0.061
 je+0.059
Neg logit contributions
 handful-0.039
ues-0.039
suits-0.037
sc-0.036
suit-0.036
Token said
Token ID531
Feature activation-4.91e-02

Pos logit contributions
 parent+0.096
 owners+0.090
father+0.085
 parents+0.084
 tou+0.084
Neg logit contributions
 handful-0.055
 closely-0.054
ues-0.053
 Whoever-0.052
suits-0.050
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
 parent+0.112
 owners+0.105
father+0.100
 tou+0.099
 ost+0.099
Neg logit contributions
 closely-0.062
 handful-0.060
ues-0.060
 harder-0.056
 sip-0.055
Token
Token ID198
Feature activation-4.85e-02

Pos logit contributions
 handful+0.004
ues+0.004
 Whoever+0.004
 closely+0.004
inn+0.004
Neg logit contributions
 parent-0.008
 owners-0.007
 parents-0.007
father-0.007
 pilot-0.007
Token
Token ID198
Feature activation-4.87e-02

Pos logit contributions
 parent+0.113
 owners+0.103
 parents+0.098
 guardian+0.097
 pilot+0.097
Neg logit contributions
ues-0.058
 handful-0.057
 closely-0.056
 Whoever-0.056
inn-0.055
TokenThe
Token ID464
Feature activation-8.11e-02

Pos logit contributions
 parent+0.106
 owners+0.095
 parents+0.092
father+0.089
mother+0.088
Neg logit contributions
ues-0.068
 handful-0.067
 Whoever-0.066
 closely-0.065
inn-0.063
Token owners
Token ID4393
Feature activation-3.49e-01

Pos logit contributions
father+0.085
pa+0.079
 known+0.079
 parent+0.078
mother+0.077
Neg logit contributions
 handful-0.138
 harder-0.137
 closely-0.135
 edges-0.132
 minutes-0.132
Token smiled
Token ID13541
Feature activation-3.38e-02

Pos logit contributions
 parent+0.707
 owners+0.659
 tou+0.602
 parents+0.601
 guardian+0.590
Neg logit contributions
 handful-0.543
ues-0.519
 closely-0.517
 sip-0.510
 Whoever-0.500
Token and
Token ID290
Feature activation+6.37e-02

Pos logit contributions
 parent+0.073
 owners+0.066
 parents+0.064
 family+0.061
 guardian+0.061
Neg logit contributions
ues-0.047
 handful-0.046
suits-0.045
 Whoever-0.044
sc-0.044
Token thanked
Token ID26280
Feature activation-2.09e-01

Pos logit contributions
 handful+0.087
 sip+0.080
 closely+0.079
 Whoever+0.078
ues+0.075
Neg logit contributions
 parent-0.145
 owners-0.131
father-0.130
 guardian-0.124
mother-0.122
Token Sally
Token ID25737
Feature activation+1.54e-02

Pos logit contributions
 parent+0.400
 owners+0.362
 parents+0.338
 guardian+0.332
father+0.328
Neg logit contributions
 handful-0.337
ues-0.333
 closely-0.327
 sip-0.321
 Whoever-0.317
Token for
Token ID329
Feature activation+6.77e-03

Pos logit contributions
 handful+0.019
 closely+0.019
ues+0.018
suits+0.017
ows+0.017
Neg logit contributions
 parent-0.036
 owners-0.033
 tou-0.031
 parents-0.031
 guardian-0.031
Token
Token ID198
Feature activation-4.65e-02

Pos logit contributions
 parent+0.106
 owners+0.097
 parents+0.093
 guardian+0.092
 pilot+0.092
Neg logit contributions
ues-0.056
inn-0.055
 Whoever-0.054
 handful-0.054
 closely-0.054
Token"
Token ID1
Feature activation+1.39e-02

Pos logit contributions
 parent+0.103
 owners+0.092
 parents+0.091
 je+0.087
father+0.087
Neg logit contributions
ues-0.062
 Whoever-0.062
 handful-0.061
 closely-0.060
inn-0.060
TokenSure
Token ID19457
Feature activation+3.49e-02

Pos logit contributions
ues+0.022
 handful+0.021
 closely+0.021
suits+0.021
 sip+0.020
Neg logit contributions
 parent-0.028
 owners-0.025
 parents-0.024
 guardian-0.023
father-0.023
Token,"
Token ID553
Feature activation+8.82e-02

Pos logit contributions
 handful+0.047
ues+0.046
 closely+0.044
suits+0.043
 Whoever+0.042
Neg logit contributions
 parent-0.077
 owners-0.071
 parents-0.067
father-0.065
 babys-0.064
Token the
Token ID262
Feature activation-3.16e-02

Pos logit contributions
 handful+0.096
 closely+0.087
 sip+0.086
 harder+0.081
 Whoever+0.081
Neg logit contributions
 parent-0.205
 North-0.192
 owners-0.192
father-0.187
 guardian-0.186
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
father+0.029
 known+0.029
pa+0.026
 insepar+0.026
 bless+0.026
Neg logit contributions
 lid-0.053
 handful-0.053
 closely-0.052
suits-0.052
gar-0.052
Token said
Token ID531
Feature activation-4.64e-02

Pos logit contributions
 parent+0.779
 owners+0.739
 tou+0.679
 ost+0.676
 parents+0.667
Neg logit contributions
 handful-0.459
 closely-0.449
ues-0.446
 sip-0.433
inn-0.426
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
 parent+0.094
 owners+0.085
 parents+0.081
 guardian+0.078
father+0.077
Neg logit contributions
ues-0.070
 handful-0.069
 closely-0.067
suits-0.066
 Whoever-0.065
Token
Token ID198
Feature activation-4.61e-02

Pos logit contributions
ues+0.011
 handful+0.011
 closely+0.010
suits+0.010
 sip+0.010
Neg logit contributions
 parent-0.016
 owners-0.015
 parents-0.014
 family-0.014
 babys-0.013
Token
Token ID198
Feature activation-4.62e-02

Pos logit contributions
 parent+0.087
 owners+0.080
 parents+0.075
 family+0.072
 babys+0.071
Neg logit contributions
ues-0.074
 handful-0.074
 closely-0.071
suits-0.071
suit-0.070
TokenSo
Token ID2396
Feature activation-9.60e-02

Pos logit contributions
 parent+0.086
 owners+0.080
 parents+0.073
 family+0.071
 tou+0.071
Neg logit contributions
ues-0.075
 handful-0.074
ows-0.072
 closely-0.072
suits-0.072
Token how
Token ID703
Feature activation+1.10e-02

Pos logit contributions
 parent+0.068
 owners+0.062
 tou+0.060
father+0.059
 pilot+0.059
Neg logit contributions
 closely-0.032
 handful-0.032
 harder-0.031
ues-0.029
 sip-0.028
Token much
Token ID881
Feature activation+8.12e-02

Pos logit contributions
 handful+0.015
 closely+0.015
ues+0.015
 sip+0.014
suits+0.014
Neg logit contributions
 parent-0.024
 owners-0.022
father-0.020
mother-0.020
 parents-0.020
Token he
Token ID339
Feature activation-1.80e-02

Pos logit contributions
 handful+0.094
 closely+0.091
 sip+0.090
 harder+0.089
 slim+0.086
Neg logit contributions
 parent-0.196
 owners-0.177
mother-0.171
father-0.170
 guardian-0.167
Token loved
Token ID6151
Feature activation-4.51e-02

Pos logit contributions
 parent+0.035
 owners+0.032
 parents+0.030
 guardian+0.029
father+0.029
Neg logit contributions
 handful-0.028
ues-0.028
 closely-0.028
suits-0.028
 Whoever-0.027
Token his
Token ID465
Feature activation-9.95e-02

Pos logit contributions
 parent+0.093
 owners+0.084
 parents+0.079
 guardian+0.079
mother+0.078
Neg logit contributions
 handful-0.066
 closely-0.065
ues-0.064
 harder-0.061
 sip-0.061
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
 parent+0.141
mother+0.136
 tou+0.128
father+0.128
 bless+0.127
Neg logit contributions
 closely-0.182
suits-0.174
 handful-0.174
 harder-0.172
ues-0.170
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 parent+0.776
 owners+0.703
 parents+0.663
 guardian+0.658
father+0.657
Neg logit contributions
 handful-0.463
 closely-0.455
 sip-0.434
ues-0.429
suits-0.426
Token so
Token ID523
Feature activation+5.11e-02

Pos logit contributions
 handful+0.018
 harder+0.017
 closely+0.017
 Whoever+0.016
ues+0.015
Neg logit contributions
 parent-0.046
father-0.042
mother-0.042
 owners-0.042
 guardian-0.041
Token he
Token ID339
Feature activation-1.82e-02

Pos logit contributions
 handful+0.066
 closely+0.065
ues+0.062
 harder+0.061
 Whoever+0.059
Neg logit contributions
 parent-0.113
 owners-0.101
father-0.099
mother-0.098
 guardian-0.098
Token ob
Token ID909
Feature activation-1.13e-01

Pos logit contributions
 parent+0.036
father+0.032
 owners+0.031
 parents+0.031
mother+0.031
Neg logit contributions
 closely-0.028
 handful-0.027
 harder-0.027
 Whoever-0.027
ues-0.026
Tokeneyed
Token ID18834
Feature activation-9.22e-02

Pos logit contributions
 parent+0.191
 owners+0.170
 parents+0.159
 guardian+0.151
father+0.150
Neg logit contributions
ues-0.209
 handful-0.208
 closely-0.202
suits-0.200
 Whoever-0.197
Token
Token ID198
Feature activation-4.55e-02

Pos logit contributions
 handful+0.153
ues+0.153
 sip+0.147
 closely+0.146
suits+0.146
Neg logit contributions
 parent-0.164
 owners-0.148
 parents-0.141
 family-0.135
father-0.130
TokenOne
Token ID3198
Feature activation+6.74e-02

Pos logit contributions
 parent+0.071
 owners+0.064
 family+0.060
 parents+0.059
 je+0.055
Neg logit contributions
ues-0.091
ishy-0.089
 handful-0.087
ows-0.085
suit-0.085
Token day
Token ID1110
Feature activation-5.54e-02

Pos logit contributions
 handful+0.132
 closely+0.129
ues+0.129
suits+0.128
 sip+0.128
Neg logit contributions
 parent-0.103
 owners-0.092
 parents-0.086
father-0.082
mother-0.082
Token,
Token ID11
Feature activation+2.36e-02

Pos logit contributions
 parent+0.112
 owners+0.103
 parents+0.097
 guardian+0.094
father+0.093
Neg logit contributions
 handful-0.084
ues-0.083
 closely-0.080
suits-0.079
 sip-0.078
Token his
Token ID465
Feature activation-9.73e-02

Pos logit contributions
ues+0.042
 handful+0.041
 closely+0.040
suits+0.040
 Whoever+0.039
Neg logit contributions
 parent-0.043
 owners-0.038
 parents-0.036
 family-0.034
 guardian-0.034
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
mother+0.148
father+0.147
 ost+0.132
 known+0.132
grand+0.130
Neg logit contributions
suits-0.172
suit-0.165
 handful-0.160
 closely-0.157
 spaghetti-0.157
Token took
Token ID1718
Feature activation+1.41e-02

Pos logit contributions
 parent+0.710
 owners+0.655
 guardian+0.618
 parents+0.605
father+0.605
Neg logit contributions
 handful-0.505
suits-0.483
 closely-0.472
 sip-0.465
 Whoever-0.462
Token him
Token ID683
Feature activation-9.72e-02

Pos logit contributions
 handful+0.026
ues+0.026
 closely+0.025
suits+0.025
 Whoever+0.025
Neg logit contributions
 parent-0.024
 owners-0.021
 parents-0.019
 guardian-0.019
father-0.019
Token for
Token ID329
Feature activation+9.13e-03

Pos logit contributions
 parent+0.191
 owners+0.172
 pilot+0.162
 guardian+0.161
 parents+0.161
Neg logit contributions
 handful-0.151
 closely-0.145
suits-0.145
ues-0.141
 sip-0.140
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
 handful+0.011
ues+0.011
 closely+0.010
suits+0.010
 Whoever+0.010
Neg logit contributions
 parent-0.022
 owners-0.020
 parents-0.019
 guardian-0.018
father-0.018
Token long
Token ID890
Feature activation+1.46e-01

Pos logit contributions
ues+0.214
 handful+0.208
 closely+0.204
suits+0.203
ows+0.202
Neg logit contributions
 parent-0.156
 owners-0.134
 parents-0.126
 family-0.120
 je-0.120
Token and
Token ID290
Feature activation+6.18e-02

Pos logit contributions
 handful+0.113
 closely+0.111
suits+0.111
ues+0.108
suit+0.106
Neg logit contributions
 parent-0.224
 owners-0.204
 parents-0.198
 babys-0.192
 ost-0.190
Token Jill
Token ID22772
Feature activation+1.47e-01

Pos logit contributions
 handful+0.059
ues+0.059
 closely+0.055
suits+0.054
 Whoever+0.053
Neg logit contributions
 parent-0.160
 owners-0.149
 parents-0.142
 guardian-0.138
father-0.137
Token became
Token ID2627
Feature activation-1.96e-01

Pos logit contributions
 closely+0.210
 handful+0.206
suits+0.203
 sip+0.201
ues+0.199
Neg logit contributions
 parent-0.309
 owners-0.280
 babys-0.269
 parents-0.267
 ost-0.259
Token the
Token ID262
Feature activation-3.88e-02

Pos logit contributions
 parent+0.336
father+0.290
 owners+0.288
mother+0.288
 babys+0.285
Neg logit contributions
 handful-0.359
 closely-0.332
 harder-0.330
suits-0.315
ues-0.313
Token excited
Token ID6568
Feature activation+9.29e-02

Pos logit contributions
 parent+0.057
father+0.053
mother+0.053
 babys+0.052
grand+0.052
Neg logit contributions
 handful-0.065
 harder-0.063
 closely-0.061
 dice-0.059
 slim-0.059
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
 closely+0.181
 handful+0.180
 harder+0.172
ues+0.170
 sip+0.170
Neg logit contributions
 parent-0.116
father-0.111
 ost-0.104
 babys-0.103
mother-0.102
Token of
Token ID286
Feature activation-1.51e-01

Pos logit contributions
 parent+0.811
 owners+0.744
 tou+0.706
 parents+0.701
father+0.690
Neg logit contributions
 handful-0.433
 closely-0.424
ues-0.407
 sip-0.399
ows-0.395
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
 parent+0.308
 owners+0.273
 guardian+0.262
 parents+0.261
 babys+0.260
Neg logit contributions
 handful-0.228
 closely-0.216
ues-0.211
 sip-0.210
 harder-0.208
Token rabbit
Token ID22746
Feature activation-9.18e-02

Pos logit contributions
 parent+0.059
father+0.050
mother+0.050
 owners+0.050
 babys+0.049
Neg logit contributions
 handful-0.072
 closely-0.069
suits-0.067
 sip-0.067
 slim-0.066
Token.
Token ID13
Feature activation+5.62e-03

Pos logit contributions
 parent+0.212
 owners+0.190
 babys+0.185
father+0.182
 parents+0.182
Neg logit contributions
 handful-0.113
 closely-0.111
suits-0.103
ows-0.103
 sip-0.103
Token They
Token ID1119
Feature activation+3.22e-03

Pos logit contributions
ues+0.008
 handful+0.008
suits+0.007
 closely+0.007
ows+0.007
Neg logit contributions
 parent-0.012
 owners-0.011
 parents-0.011
 family-0.010
father-0.010
Token walks
Token ID11114
Feature activation+1.28e-02

Pos logit contributions
ues+0.013
 handful+0.012
suits+0.012
suit+0.012
inn+0.012
Neg logit contributions
 parent-0.014
 owners-0.013
 parents-0.012
 family-0.012
 aer-0.011
Token and
Token ID290
Feature activation+6.50e-02

Pos logit contributions
ues+0.018
 handful+0.018
 closely+0.017
suits+0.017
 Whoever+0.017
Neg logit contributions
 parent-0.027
 owners-0.025
 parents-0.024
 guardian-0.023
father-0.023
Token playing
Token ID2712
Feature activation-4.19e-02

Pos logit contributions
 closely+0.108
 handful+0.105
 sip+0.102
 harder+0.102
sc+0.101
Neg logit contributions
 parent-0.120
 owners-0.102
father-0.100
mother-0.098
 guardian-0.097
Token with
Token ID351
Feature activation+4.50e-02

Pos logit contributions
 parent+0.077
 owners+0.071
 parents+0.066
 family+0.065
 pilot+0.062
Neg logit contributions
ues-0.070
 handful-0.067
suits-0.066
 Whoever-0.065
inn-0.064
Token her
Token ID607
Feature activation-1.21e-01

Pos logit contributions
 handful+0.087
 closely+0.085
 sip+0.082
ues+0.080
 harder+0.080
Neg logit contributions
 parent-0.072
 owners-0.062
 guardian-0.060
 tou-0.058
father-0.058
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
 parent+0.148
father+0.145
mother+0.142
 known+0.131
 bless+0.128
Neg logit contributions
suits-0.262
 handful-0.261
suit-0.251
 closely-0.241
 dice-0.232
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
 parent+0.770
 owners+0.702
 parents+0.666
 tou+0.662
 guardian+0.657
Neg logit contributions
 handful-0.463
 closely-0.456
ues-0.440
suits-0.423
 sip-0.420
Token She
Token ID1375
Feature activation-3.43e-02

Pos logit contributions
ues+0.008
 handful+0.008
suits+0.008
ows+0.008
suit+0.007
Neg logit contributions
 parent-0.013
 owners-0.012
 parents-0.011
 family-0.011
 je-0.011
Token always
Token ID1464
Feature activation-1.38e-01

Pos logit contributions
 parent+0.069
 tou+0.061
father+0.061
 owners+0.061
 babys+0.060
Neg logit contributions
 closely-0.048
suits-0.047
 handful-0.044
suit-0.044
 Whoever-0.044
Token dream
Token ID4320
Feature activation-6.88e-02

Pos logit contributions
 parent+0.284
 owners+0.260
 parents+0.246
 tou+0.245
father+0.244
Neg logit contributions
 handful-0.188
suits-0.187
suit-0.182
 closely-0.177
 harder-0.176
Tokent
Token ID83
Feature activation+9.42e-03

Pos logit contributions
 parent+0.163
 owners+0.148
 parents+0.142
 tou+0.140
 je+0.139
Neg logit contributions
ues-0.083
ows-0.080
 closely-0.078
 handful-0.075
sc-0.075
Token beautiful
Token ID4950
Feature activation+1.59e-02

Pos logit contributions
 handful+0.152
ues+0.150
 closely+0.146
suits+0.143
 sip+0.142
Neg logit contributions
 parent-0.196
 owners-0.180
 parents-0.169
father-0.163
mother-0.162
Token car
Token ID1097
Feature activation-5.93e-02

Pos logit contributions
 handful+0.027
ues+0.026
 closely+0.025
 sip+0.025
suits+0.024
Neg logit contributions
 parent-0.030
 owners-0.027
 parents-0.025
father-0.025
mother-0.024
Token cover
Token ID3002
Feature activation+5.69e-02

Pos logit contributions
 parent+0.145
 owners+0.133
 parents+0.129
father+0.128
 babys+0.126
Neg logit contributions
 closely-0.060
ows-0.059
ues-0.059
 harder-0.057
 Whoever-0.057
Token.
Token ID13
Feature activation+9.65e-04

Pos logit contributions
ues+0.073
 handful+0.073
 closely+0.072
ows+0.071
suits+0.069
Neg logit contributions
 parent-0.128
 owners-0.116
 parents-0.110
 tou-0.109
father-0.108
Token The
Token ID383
Feature activation-4.78e-02

Pos logit contributions
ues+0.001
 handful+0.001
 closely+0.001
 Whoever+0.001
suits+0.001
Neg logit contributions
 parent-0.002
 owners-0.002
 parents-0.002
 guardian-0.002
father-0.002
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
beh+0.052
grand+0.051
mother+0.049
 tou+0.045
father+0.045
Neg logit contributions
 harder-0.091
 closely-0.087
 minutes-0.087
 edges-0.086
 handful-0.085
Token be
Token ID307
Feature activation-1.57e-01

Pos logit contributions
 parent+0.700
 owners+0.661
 tou+0.626
 parents+0.601
 ost+0.599
Neg logit contributions
 handful-0.519
 closely-0.509
 Whoever-0.497
ues-0.494
 sip-0.479
Tokenamed
Token ID2434
Feature activation-2.99e-02

Pos logit contributions
 parent+0.316
 owners+0.285
 parents+0.271
father+0.262
 je+0.262
Neg logit contributions
 handful-0.240
ues-0.235
 closely-0.225
suits-0.220
 Whoever-0.219
Token with
Token ID351
Feature activation+4.20e-02

Pos logit contributions
 parent+0.069
 owners+0.063
 parents+0.060
father+0.059
 tou+0.058
Neg logit contributions
 handful-0.037
ues-0.037
 closely-0.037
suits-0.034
 Whoever-0.034
Token joy
Token ID8716
Feature activation-1.03e-02

Pos logit contributions
 handful+0.063
 sip+0.059
 closely+0.058
ues+0.057
 Whoever+0.056
Neg logit contributions
 parent-0.086
 owners-0.077
father-0.073
 babys-0.073
 parents-0.072
Token,
Token ID11
Feature activation+1.22e-02

Pos logit contributions
 parent+0.025
 owners+0.023
 parents+0.022
father+0.022
 tou+0.021
Neg logit contributions
 handful-0.012
ues-0.012
 closely-0.011
suits-0.011
ows-0.010
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
 parent+0.186
 owners+0.166
 parents+0.163
father+0.162
 babys+0.159
Neg logit contributions
 closely-0.100
 handful-0.098
ues-0.095
suits-0.095
sc-0.095
Token how
Token ID703
Feature activation+1.51e-02

Pos logit contributions
 closely+0.109
 harder+0.109
 sip+0.107
 handful+0.101
 Whoever+0.099
Neg logit contributions
 parent-0.132
father-0.123
 babys-0.121
mother-0.118
 guardian-0.115
Token to
Token ID284
Feature activation-9.78e-02

Pos logit contributions
 handful+0.022
 closely+0.022
ues+0.021
suits+0.020
ows+0.020
Neg logit contributions
 parent-0.031
 owners-0.028
father-0.027
 babys-0.027
mother-0.027
Token ask
Token ID1265
Feature activation-1.58e-02

Pos logit contributions
 parent+0.189
 owners+0.167
father+0.164
 parents+0.164
mother+0.164
Neg logit contributions
 sip-0.156
 closely-0.152
 handful-0.148
 Whoever-0.144
 harder-0.143
Token their
Token ID511
Feature activation-9.64e-02

Pos logit contributions
 parent+0.033
 owners+0.030
 tou+0.029
 guardian+0.028
 parents+0.028
Neg logit contributions
 closely-0.022
 handful-0.022
ues-0.022
 harder-0.021
ows-0.020
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
father+0.133
mother+0.131
 parent+0.131
 tou+0.123
 pilot+0.119
Neg logit contributions
 closely-0.177
 harder-0.174
suits-0.171
 handful-0.167
ues-0.164
Token if
Token ID611
Feature activation+1.32e-01

Pos logit contributions
 parent+0.782
 owners+0.717
 tou+0.677
 parents+0.677
 guardian+0.673
Neg logit contributions
 closely-0.436
 handful-0.436
ues-0.430
suits-0.407
ows-0.404
Token it
Token ID340
Feature activation+2.07e-01

Pos logit contributions
 harder+0.095
 closely+0.085
 while+0.081
 it+0.080
 handful+0.079
Neg logit contributions
 parent-0.332
father-0.321
 guardian-0.319
mother-0.309
 babys-0.300
Token was
Token ID373
Feature activation-1.25e-01

Pos logit contributions
 harder+0.274
sc+0.273
ows+0.265
 closely+0.265
 sip+0.262
Neg logit contributions
 parent-0.438
father-0.417
mother-0.388
 grandparents-0.382
 babys-0.380
Token okay
Token ID8788
Feature activation+5.28e-02

Pos logit contributions
 parent+0.275
 owners+0.251
 parents+0.242
father+0.237
mother+0.235
Neg logit contributions
 handful-0.165
 closely-0.158
ues-0.157
 harder-0.152
 sip-0.149
Token to
Token ID284
Feature activation-1.00e-01

Pos logit contributions
 handful+0.069
ues+0.068
 closely+0.067
ows+0.065
suits+0.064
Neg logit contributions
 parent-0.115
 owners-0.105
 tou-0.100
 parents-0.100
father-0.100
TokenThe
Token ID464
Feature activation-7.99e-02

Pos logit contributions
 parent+0.095
 owners+0.086
 parents+0.081
 guardian+0.078
 family+0.078
Neg logit contributions
ues-0.075
 handful-0.074
 closely-0.072
suits-0.071
 sip-0.070
Token pup
Token ID15552
Feature activation-9.85e-02

Pos logit contributions
father+0.115
mother+0.114
 ost+0.111
 parent+0.110
 known+0.105
Neg logit contributions
 harder-0.132
 handful-0.130
 closely-0.128
suit-0.128
suits-0.127
Token even
Token ID772
Feature activation-3.62e-02

Pos logit contributions
 parent+0.213
father+0.189
 owners+0.183
 babys+0.181
mother+0.180
Neg logit contributions
 closely-0.132
 handful-0.128
 Whoever-0.126
suits-0.125
suit-0.121
Token followed
Token ID3940
Feature activation-4.41e-02

Pos logit contributions
 parent+0.070
 owners+0.063
 parents+0.060
 guardian+0.058
father+0.057
Neg logit contributions
 handful-0.058
ues-0.057
 closely-0.056
suits-0.055
 sip-0.054
Token the
Token ID262
Feature activation-3.53e-02

Pos logit contributions
 parent+0.092
 owners+0.083
father+0.081
 guardian+0.080
 parents+0.079
Neg logit contributions
 closely-0.063
 handful-0.060
 harder-0.058
ues-0.058
 Whoever-0.057
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
father+0.055
 parent+0.054
mother+0.053
pa+0.049
 ost+0.048
Neg logit contributions
 handful-0.061
 closely-0.058
 harder-0.058
suits-0.058
suit-0.057
Token into
Token ID656
Feature activation-4.03e-02

Pos logit contributions
 parent+0.787
 owners+0.728
 tou+0.686
 parents+0.679
father+0.673
Neg logit contributions
 closely-0.466
 handful-0.451
ues-0.427
 Whoever-0.422
 sip-0.421
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 parent+0.079
 owners+0.072
 parents+0.068
father+0.067
 tou+0.066
Neg logit contributions
 handful-0.063
ues-0.060
 closely-0.060
 sip-0.058
suits-0.058
Token small
Token ID1402
Feature activation+1.22e-01

Pos logit contributions
 handful+0.155
 closely+0.146
ues+0.142
suits+0.142
 sip+0.141
Neg logit contributions
 parent-0.204
 owners-0.186
 parents-0.179
father-0.176
mother-0.172
Token cave
Token ID11527
Feature activation-7.36e-02

Pos logit contributions
 handful+0.261
 closely+0.249
suits+0.249
ues+0.249
 sip+0.243
Neg logit contributions
 parent-0.170
 owners-0.150
 parents-0.140
father-0.133
mother-0.129
Token.
Token ID13
Feature activation+2.70e-03

Pos logit contributions
 parent+0.159
 owners+0.145
 parents+0.138
father+0.134
 tou+0.133
Neg logit contributions
ues-0.101
 handful-0.101
 closely-0.098
suits-0.095
ows-0.093
Token keep
Token ID1394
Feature activation-1.07e-01

Pos logit contributions
 parent+0.198
 owners+0.177
father+0.169
 babys+0.168
 parents+0.167
Neg logit contributions
 sip-0.167
 handful-0.167
 closely-0.166
 Whoever-0.158
ues-0.155
Token up
Token ID510
Feature activation+9.81e-02

Pos logit contributions
 parent+0.254
 owners+0.231
 parents+0.220
 babys+0.219
father+0.216
Neg logit contributions
 handful-0.130
 closely-0.128
ues-0.119
 Whoever-0.116
 harder-0.115
Token.
Token ID13
Feature activation+3.08e-03

Pos logit contributions
 handful+0.129
 closely+0.126
 harder+0.114
ues+0.114
suits+0.109
Neg logit contributions
 parent-0.226
 owners-0.202
father-0.195
 babys-0.192
 parents-0.192
Token When
Token ID1649
Feature activation+7.98e-02

Pos logit contributions
ues+0.004
 handful+0.004
 closely+0.003
suits+0.003
 Whoever+0.003
Neg logit contributions
 parent-0.007
 owners-0.007
 parents-0.006
 guardian-0.006
father-0.006
Token the
Token ID262
Feature activation-3.45e-02

Pos logit contributions
 handful+0.086
 harder+0.086
 closely+0.082
 Whoever+0.078
 sip+0.073
Neg logit contributions
 parent-0.195
father-0.179
 owners-0.177
mother-0.175
 guardian-0.172
Token owners
Token ID4393
Feature activation-3.48e-01

Pos logit contributions
father+0.056
 parent+0.053
mother+0.053
 known+0.052
 ost+0.051
Neg logit contributions
suit-0.055
 handful-0.054
 sip-0.052
 closely-0.051
suits-0.051
Token said
Token ID531
Feature activation-5.03e-02

Pos logit contributions
 parent+0.736
 owners+0.670
 parents+0.617
father+0.607
 guardian+0.590
Neg logit contributions
 handful-0.516
 closely-0.514
 Whoever-0.502
suits-0.491
 sip-0.487
Token "
Token ID366
Feature activation+2.77e-02

Pos logit contributions
 parent+0.110
 owners+0.101
 parents+0.095
father+0.094
 guardian+0.093
Neg logit contributions
 handful-0.068
ues-0.067
 closely-0.065
suits-0.063
 Whoever-0.062
TokenStop
Token ID19485
Feature activation+6.02e-02

Pos logit contributions
 handful+0.042
ues+0.042
 closely+0.040
suits+0.040
 Whoever+0.039
Neg logit contributions
 parent-0.056
 owners-0.050
 parents-0.048
 guardian-0.046
father-0.046
Token,"
Token ID553
Feature activation+8.60e-02

Pos logit contributions
ues+0.083
 handful+0.083
 closely+0.079
suits+0.078
 Whoever+0.077
Neg logit contributions
 parent-0.129
 owners-0.118
 parents-0.112
 guardian-0.108
father-0.108
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
 handful+0.116
ues+0.110
 Whoever+0.110
 closely+0.108
 sip+0.107
Neg logit contributions
 parent-0.188
 owners-0.170
father-0.163
 parents-0.162
 guardian-0.160
Token and
Token ID290
Feature activation+6.55e-02

Pos logit contributions
ues+0.002
 handful+0.001
suits+0.001
 closely+0.001
 Whoever+0.001
Neg logit contributions
 parent-0.003
 owners-0.002
 parents-0.002
 guardian-0.002
 family-0.002
Token found
Token ID1043
Feature activation+1.02e-01

Pos logit contributions
ues+0.101
 handful+0.099
suits+0.096
 closely+0.095
 Whoever+0.093
Neg logit contributions
 parent-0.131
 owners-0.119
 parents-0.113
 family-0.108
 guardian-0.108
Token a
Token ID257
Feature activation+1.03e-01

Pos logit contributions
ues+0.153
suits+0.146
 handful+0.142
 Whoever+0.139
suit+0.139
Neg logit contributions
 parent-0.215
 owners-0.196
 parents-0.188
 family-0.180
 guardian-0.179
Token l
Token ID300
Feature activation-6.83e-02

Pos logit contributions
 handful+0.151
 sip+0.137
suits+0.133
 slim+0.130
 closely+0.129
Neg logit contributions
 parent-0.214
father-0.202
 owners-0.200
mother-0.195
pa-0.192
Tokenily
Token ID813
Feature activation+1.09e-01

Pos logit contributions
 parent+0.143
 owners+0.132
 parents+0.125
 guardian+0.123
mother+0.121
Neg logit contributions
 handful-0.095
ues-0.094
suits-0.092
 closely-0.092
 Whoever-0.091
Token pad
Token ID14841
Feature activation+1.79e-01

Pos logit contributions
 handful+0.164
ues+0.162
 closely+0.157
suits+0.156
 sip+0.153
Neg logit contributions
 parent-0.220
 owners-0.200
 parents-0.190
father-0.183
 guardian-0.182
Token that
Token ID326
Feature activation+1.38e-02

Pos logit contributions
 handful+0.259
ues+0.258
suits+0.245
 closely+0.245
ows+0.242
Neg logit contributions
 parent-0.375
 owners-0.340
 parents-0.323
father-0.314
 pilot-0.309
Token was
Token ID373
Feature activation-1.23e-01

Pos logit contributions
 slim+0.020
 handful+0.019
 sip+0.019
 closely+0.019
sc+0.018
Neg logit contributions
 parent-0.029
 owners-0.026
father-0.026
 pilot-0.025
mother-0.025
Token just
Token ID655
Feature activation+1.34e-02

Pos logit contributions
 parent+0.255
 owners+0.231
 parents+0.221
father+0.218
 pilot+0.213
Neg logit contributions
 handful-0.184
 closely-0.176
ues-0.171
 sip-0.169
inn-0.167
Token the
Token ID262
Feature activation-3.59e-02

Pos logit contributions
ues+0.020
 handful+0.019
suits+0.019
 Whoever+0.018
 closely+0.018
Neg logit contributions
 parent-0.028
 owners-0.025
 parents-0.024
 guardian-0.023
 family-0.023
Token right
Token ID826
Feature activation+4.94e-02

Pos logit contributions
 parent+0.063
father+0.059
pa+0.059
mother+0.057
 owners+0.054
Neg logit contributions
 handful-0.057
 slim-0.056
 harder-0.056
 sip-0.055
 closely-0.052
Token hard
Token ID1327
Feature activation+1.70e-01

Pos logit contributions
ues+0.128
abouts+0.115
 Whoever+0.115
sc+0.114
suit+0.111
Neg logit contributions
 parent-0.166
 owners-0.145
 family-0.145
 tou-0.140
 guardian-0.140
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 handful+0.302
ues+0.298
suits+0.292
 sip+0.288
suit+0.285
Neg logit contributions
 parent-0.298
 owners-0.271
 parents-0.254
 family-0.241
father-0.241
Token He
Token ID679
Feature activation+1.95e-03

Pos logit contributions
ues+0.020
 handful+0.020
suits+0.019
 closely+0.019
 sip+0.019
Neg logit contributions
 parent-0.030
 owners-0.027
 parents-0.026
 family-0.025
 je-0.025
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
 closely+0.003
 handful+0.003
ues+0.003
 Whoever+0.003
 sip+0.003
Neg logit contributions
 parent-0.004
 owners-0.004
 parents-0.004
 guardian-0.004
father-0.004
Token very
Token ID845
Feature activation-1.95e-02

Pos logit contributions
 parent+0.236
 owners+0.215
 parents+0.203
 guardian+0.195
 family+0.194
Neg logit contributions
ues-0.192
 handful-0.189
suits-0.185
 closely-0.179
sc-0.178
Token sad
Token ID6507
Feature activation+1.35e-01

Pos logit contributions
 parent+0.041
 owners+0.037
 parents+0.035
 guardian+0.034
 family+0.034
Neg logit contributions
ues-0.028
 handful-0.028
suits-0.027
 closely-0.027
 Whoever-0.027
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
ues+0.180
 handful+0.179
suits+0.171
 closely+0.170
 Whoever+0.166
Neg logit contributions
 parent-0.298
 owners-0.273
 parents-0.260
 guardian-0.250
 family-0.249
Token
Token ID198
Feature activation-3.78e-02

Pos logit contributions
ues+0.018
 handful+0.018
suits+0.017
 closely+0.017
 sip+0.017
Neg logit contributions
 parent-0.024
 owners-0.022
 parents-0.021
 family-0.020
 babys-0.020
Token
Token ID198
Feature activation-3.91e-02

Pos logit contributions
 parent+0.069
 owners+0.063
 parents+0.059
 family+0.057
father+0.056
Neg logit contributions
 handful-0.065
ues-0.065
suits-0.062
suit-0.062
ishy-0.061
TokenOne
Token ID3198
Feature activation+7.22e-02

Pos logit contributions
 parent+0.085
 owners+0.078
 parents+0.074
 guardian+0.072
father+0.071
Neg logit contributions
ues-0.053
 handful-0.053
 closely-0.050
suits-0.050
 Whoever-0.049
Token day
Token ID1110
Feature activation-5.21e-02

Pos logit contributions
 handful+0.139
 closely+0.138
 sip+0.136
inn+0.134
ues+0.133
Neg logit contributions
 parent-0.112
 owners-0.101
 parents-0.094
father-0.090
mother-0.089
Token curious
Token ID11040
Feature activation+1.10e-01

Pos logit contributions
 parent+0.263
 owners+0.238
 parents+0.227
father+0.224
mother+0.222
Neg logit contributions
 handful-0.176
 closely-0.173
ues-0.170
 harder-0.164
 Whoever-0.162
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
ues+0.185
 handful+0.182
suits+0.180
inn+0.176
sc+0.174
Neg logit contributions
 parent-0.207
 owners-0.188
 parents-0.180
 family-0.177
mother-0.167
Token so
Token ID523
Feature activation+5.11e-02

Pos logit contributions
ues+0.026
 handful+0.024
inn+0.024
suits+0.024
 sip+0.023
Neg logit contributions
 parent-0.033
 owners-0.031
 family-0.029
 parents-0.029
 je-0.028
Token she
Token ID673
Feature activation-3.88e-02

Pos logit contributions
 closely+0.073
 handful+0.070
 harder+0.067
 sip+0.065
ues+0.064
Neg logit contributions
 parent-0.108
 owners-0.098
father-0.095
mother-0.094
 babys-0.093
Token went
Token ID1816
Feature activation-1.42e-01

Pos logit contributions
 parent+0.073
 owners+0.066
 parents+0.063
father+0.061
mother+0.060
Neg logit contributions
 handful-0.063
ues-0.063
 closely-0.062
suits-0.061
 Whoever-0.060
Token closer
Token ID5699
Feature activation+2.04e-01

Pos logit contributions
 parent+0.317
 owners+0.293
 parents+0.281
 guardian+0.273
 babys+0.269
Neg logit contributions
ues-0.185
 handful-0.181
suits-0.170
 Whoever-0.165
 closely-0.165
Token.
Token ID13
Feature activation+3.81e-03

Pos logit contributions
ues+0.398
 handful+0.395
inn+0.381
suits+0.380
ishy+0.378
Neg logit contributions
 parent-0.315
 owners-0.283
 parents-0.269
 family-0.257
 guardian-0.251
Token When
Token ID1649
Feature activation+8.04e-02

Pos logit contributions
ues+0.006
 handful+0.006
suits+0.006
inn+0.006
sc+0.006
Neg logit contributions
 parent-0.007
 owners-0.007
 parents-0.006
 family-0.006
 babys-0.006
Token she
Token ID673
Feature activation-3.94e-02

Pos logit contributions
 harder+0.098
 closely+0.096
 handful+0.095
 sip+0.094
 while+0.089
Neg logit contributions
 parent-0.182
 owners-0.171
father-0.163
 guardian-0.159
mother-0.158
Token got
Token ID1392
Feature activation+1.61e-02

Pos logit contributions
 parent+0.088
 owners+0.081
 parents+0.077
father+0.074
 guardian+0.073
Neg logit contributions
 handful-0.053
 closely-0.051
ues-0.051
 sip-0.050
 Whoever-0.050
Token to
Token ID284
Feature activation-1.01e-01

Pos logit contributions
 handful+0.014
 closely+0.014
ues+0.013
suits+0.013
 sip+0.013
Neg logit contributions
 parent-0.043
 owners-0.040
 parents-0.038
father-0.037
 je-0.037
Token Molly
Token ID30236
Feature activation+9.05e-02

Pos logit contributions
ues+0.108
 handful+0.106
suits+0.103
 closely+0.103
 Whoever+0.101
Neg logit contributions
 parent-0.116
 owners-0.103
 parents-0.099
 family-0.093
 guardian-0.093
Token enjoyed
Token ID8359
Feature activation-4.91e-02

Pos logit contributions
 handful+0.132
 sip+0.130
 closely+0.125
ues+0.119
suits+0.118
Neg logit contributions
 parent-0.193
 owners-0.178
father-0.168
 guardian-0.165
 tou-0.165
Token their
Token ID511
Feature activation-1.03e-01

Pos logit contributions
 parent+0.086
 owners+0.077
 parents+0.073
 family+0.070
 guardian+0.068
Neg logit contributions
ues-0.088
 handful-0.087
suits-0.085
 closely-0.084
 Whoever-0.083
Token dinner
Token ID8073
Feature activation+5.31e-03

Pos logit contributions
 parent+0.217
 parents+0.199
 owners+0.196
 family+0.194
 je+0.185
Neg logit contributions
ues-0.149
 handful-0.138
 Whoever-0.137
ows-0.135
suits-0.134
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
 handful+0.006
ues+0.006
 closely+0.006
suits+0.006
ows+0.006
Neg logit contributions
 parent-0.013
 owners-0.011
 parents-0.011
 tou-0.011
 guardian-0.011
Token As
Token ID1081
Feature activation+1.26e-01

Pos logit contributions
 parent+0.002
 owners+0.002
 parents+0.002
 family+0.002
 babys+0.002
Neg logit contributions
ues-0.002
 handful-0.002
suits-0.002
 dice-0.002
hell-0.002
Token they
Token ID484
Feature activation-3.22e-02

Pos logit contributions
 handful+0.168
ues+0.165
 closely+0.162
 sip+0.158
suits+0.156
Neg logit contributions
 parent-0.276
 owners-0.254
 parents-0.238
 guardian-0.236
father-0.236
Token ate
Token ID15063
Feature activation+3.25e-02

Pos logit contributions
 parent+0.066
 owners+0.060
 parents+0.055
 guardian+0.055
 pilot+0.054
Neg logit contributions
 handful-0.050
 sip-0.049
ues-0.047
 closely-0.047
suits-0.047
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
ues+0.069
suits+0.067
suit+0.067
 Whoever+0.067
sc+0.065
Neg logit contributions
 parent-0.041
 parents-0.041
 family-0.039
 owners-0.035
 aer-0.033
Token Molly
Token ID30236
Feature activation+8.95e-02

Pos logit contributions
ues+0.014
 handful+0.014
suits+0.014
ows+0.013
sc+0.013
Neg logit contributions
 parent-0.026
 owners-0.024
 parents-0.023
 family-0.022
 babys-0.022
Token thought
Token ID1807
Feature activation+1.31e-01

Pos logit contributions
 closely+0.129
 handful+0.124
 sip+0.120
ues+0.119
suits+0.117
Neg logit contributions
 parent-0.184
 owners-0.166
father-0.161
 tou-0.159
 guardian-0.157
Token am
Token ID716
Feature activation-1.29e-01

Pos logit contributions
 parent+0.177
 owners+0.160
 parents+0.151
 guardian+0.144
father+0.143
Neg logit contributions
ues-0.158
 handful-0.158
 closely-0.152
suits-0.151
 Whoever-0.148
Token exploring
Token ID13504
Feature activation-5.36e-02

Pos logit contributions
 parent+0.253
 owners+0.235
father+0.216
 tou+0.214
 parents+0.213
Neg logit contributions
 handful-0.199
ues-0.195
 closely-0.194
 Whoever-0.187
 sip-0.183
Token.
Token ID13
Feature activation+8.68e-03

Pos logit contributions
 parent+0.103
 owners+0.094
 parents+0.090
 family+0.087
 guardian+0.086
Neg logit contributions
ues-0.086
 handful-0.083
suits-0.081
inn-0.081
 Whoever-0.080
Token Do
Token ID2141
Feature activation+1.71e-01

Pos logit contributions
ues+0.011
 handful+0.011
 Whoever+0.011
 closely+0.011
suits+0.010
Neg logit contributions
 parent-0.019
 owners-0.018
 parents-0.017
father-0.016
mother-0.016
Token you
Token ID345
Feature activation-6.30e-02

Pos logit contributions
 Whoever+0.239
ues+0.236
 handful+0.229
 closely+0.229
 dice+0.224
Neg logit contributions
 parent-0.332
 owners-0.304
 babys-0.301
 guardian-0.296
mother-0.286
Token want
Token ID765
Feature activation+1.44e-01

Pos logit contributions
 parent+0.146
 owners+0.137
 parents+0.128
 babys+0.126
father+0.126
Neg logit contributions
ues-0.074
 closely-0.073
 handful-0.073
 sip-0.071
 Whoever-0.070
Token to
Token ID284
Feature activation-9.74e-02

Pos logit contributions
 handful+0.208
 closely+0.198
 sip+0.197
ues+0.196
 harder+0.191
Neg logit contributions
 parent-0.300
 owners-0.272
 parents-0.254
father-0.254
 guardian-0.253
Token come
Token ID1282
Feature activation-1.05e-01

Pos logit contributions
 parent+0.192
 owners+0.185
father+0.183
mother+0.181
 babys+0.172
Neg logit contributions
 sip-0.137
 closely-0.136
 handful-0.130
 Whoever-0.125
 harder-0.121
Token with
Token ID351
Feature activation+4.43e-02

Pos logit contributions
 parent+0.226
 owners+0.209
 parents+0.198
father+0.192
 babys+0.192
Neg logit contributions
 handful-0.140
ues-0.139
 closely-0.137
suits-0.131
 Whoever-0.130
Token me
Token ID502
Feature activation-8.32e-02

Pos logit contributions
 handful+0.081
 closely+0.080
ues+0.078
 sip+0.076
suits+0.076
Neg logit contributions
 parent-0.075
 owners-0.067
 parents-0.062
 guardian-0.061
 tou-0.060
Token?"
Token ID1701
Feature activation+1.15e-01

Pos logit contributions
 parent+0.172
 owners+0.158
 parents+0.148
 tou+0.144
 je+0.143
Neg logit contributions
 handful-0.121
ues-0.120
 closely-0.117
suits-0.115
 sip-0.111
Token in
Token ID287
Feature activation+1.88e-02

Pos logit contributions
 handful+0.295
ues+0.278
 closely+0.274
suits+0.271
inn+0.264
Neg logit contributions
 parent-0.445
 owners-0.407
 parents-0.382
 guardian-0.375
 babys-0.374
Token the
Token ID262
Feature activation-3.02e-02

Pos logit contributions
ues+0.036
ows+0.035
suits+0.034
suit+0.034
sc+0.034
Neg logit contributions
 parent-0.032
 owners-0.028
 parents-0.027
 family-0.027
 babys-0.025
Token oven
Token ID14361
Feature activation+1.25e-02

Pos logit contributions
 parent+0.065
 owners+0.060
 parents+0.056
 guardian+0.055
father+0.055
Neg logit contributions
 handful-0.042
ues-0.041
suits-0.039
 closely-0.039
 sip-0.039
Token and
Token ID290
Feature activation+6.94e-02

Pos logit contributions
ues+0.015
 handful+0.015
 closely+0.014
suits+0.014
 Whoever+0.014
Neg logit contributions
 parent-0.029
 owners-0.027
 parents-0.026
 guardian-0.025
 family-0.025
Token waited
Token ID13488
Feature activation+3.23e-02

Pos logit contributions
 handful+0.100
ues+0.097
 sip+0.096
 closely+0.095
suits+0.093
Neg logit contributions
 parent-0.146
 owners-0.134
 guardian-0.126
 parents-0.124
father-0.123
Token.
Token ID13
Feature activation+9.29e-03

Pos logit contributions
suits+0.055
 Whoever+0.054
 handful+0.054
suit+0.054
war+0.054
Neg logit contributions
 parent-0.055
 parents-0.050
 family-0.047
 owners-0.046
 aer-0.043
Token
Token ID198
Feature activation-4.15e-02

Pos logit contributions
 handful+0.016
ues+0.015
hell+0.015
ishy+0.015
suits+0.015
Neg logit contributions
 parent-0.018
 owners-0.016
 parents-0.015
 family-0.014
 aunt-0.014
Token
Token ID198
Feature activation-4.16e-02

Pos logit contributions
 parent+0.076
 owners+0.068
 parents+0.065
 family+0.063
 babys+0.060
Neg logit contributions
 handful-0.072
ues-0.070
suit-0.069
suits-0.068
ishy-0.068
TokenWhen
Token ID2215
Feature activation+7.46e-02

Pos logit contributions
 parent+0.093
 owners+0.086
 parents+0.081
 family+0.078
 guardian+0.077
Neg logit contributions
 handful-0.055
ues-0.054
 closely-0.051
suits-0.051
ows-0.050
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
 handful+0.089
 sip+0.087
 harder+0.084
ues+0.083
 closely+0.082
Neg logit contributions
 parent-0.167
 owners-0.157
 guardian-0.156
father-0.156
mother-0.149
Token cake
Token ID12187
Feature activation+1.56e-02

Pos logit contributions
 parent+0.065
 owners+0.060
 parents+0.056
 guardian+0.056
father+0.056
Neg logit contributions
 handful-0.037
ues-0.035
 sip-0.035
 closely-0.034
suits-0.033
TokenTim
Token ID14967
Feature activation+1.06e-02

Pos logit contributions
 je+0.067
 owners+0.067
 parent+0.067
 family+0.060
 neighbors+0.058
Neg logit contributions
ues-0.094
ows-0.092
suit-0.089
 handful-0.087
ishy-0.086
Tokenmy
Token ID1820
Feature activation-2.18e-03

Pos logit contributions
 handful+0.018
 closely+0.018
ues+0.018
suits+0.018
inn+0.017
Neg logit contributions
 parent-0.019
 owners-0.017
 parents-0.016
 guardian-0.015
 tou-0.015
Token sat
Token ID3332
Feature activation-1.96e-02

Pos logit contributions
 parent+0.004
 owners+0.004
 guardian+0.004
father+0.004
 parents+0.004
Neg logit contributions
 closely-0.003
suits-0.003
 handful-0.003
 Whoever-0.003
 harder-0.003
Token in
Token ID287
Feature activation+2.21e-02

Pos logit contributions
 parent+0.040
 owners+0.037
 parents+0.035
 family+0.034
 guardian+0.033
Neg logit contributions
ues-0.030
 handful-0.029
 sip-0.029
 Whoever-0.028
sc-0.028
Token the
Token ID262
Feature activation-3.60e-02

Pos logit contributions
ues+0.041
 handful+0.039
suits+0.039
 closely+0.039
ows+0.038
Neg logit contributions
 parent-0.038
 owners-0.034
 parents-0.032
 family-0.031
 babys-0.030
Token chair
Token ID5118
Feature activation+5.35e-02

Pos logit contributions
 parent+0.083
 owners+0.075
father+0.074
mother+0.073
 guardian+0.070
Neg logit contributions
 handful-0.045
 sip-0.041
 closely-0.041
 harder-0.040
suit-0.040
Token and
Token ID290
Feature activation+6.36e-02

Pos logit contributions
ues+0.068
 handful+0.068
suits+0.064
 closely+0.063
 Whoever+0.063
Neg logit contributions
 parent-0.120
 owners-0.111
 parents-0.106
 family-0.102
mother-0.101
Token the
Token ID262
Feature activation-3.58e-02

Pos logit contributions
ues+0.103
 handful+0.103
 closely+0.099
suits+0.098
 Whoever+0.096
Neg logit contributions
 parent-0.121
 owners-0.110
 parents-0.104
 guardian-0.099
 family-0.098
Token bar
Token ID2318
Feature activation-1.20e-02

Pos logit contributions
 parent+0.078
 owners+0.070
mother+0.070
father+0.070
 guardian+0.066
Neg logit contributions
 handful-0.047
 harder-0.044
 closely-0.043
suits-0.043
 sip-0.042
Tokenber
Token ID527
Feature activation-1.43e-01

Pos logit contributions
 parent+0.019
 owners+0.016
 tou+0.015
 parents+0.015
 guardian+0.015
Neg logit contributions
ues-0.024
 handful-0.023
suits-0.023
 closely-0.023
ows-0.023
Token started
Token ID2067
Feature activation-6.79e-02

Pos logit contributions
 parent+0.286
 owners+0.254
 guardian+0.246
 tou+0.245
father+0.237
Neg logit contributions
 handful-0.218
suits-0.212
 closely-0.212
ues-0.210
ows-0.205
Token and
Token ID290
Feature activation+6.76e-02

Pos logit contributions
suits+0.045
 handful+0.044
 sip+0.043
ows+0.043
 closely+0.043
Neg logit contributions
 parent-0.062
father-0.056
 owners-0.055
mother-0.054
 parents-0.054
Token Spot
Token ID15899
Feature activation-5.64e-02

Pos logit contributions
 sip+0.055
 harder+0.054
 handful+0.052
 closely+0.045
 Whoever+0.045
Neg logit contributions
 guardian-0.163
ich-0.156
 North-0.154
 ost-0.154
father-0.153
Token were
Token ID547
Feature activation-1.29e-01

Pos logit contributions
 parent+0.110
father+0.100
 owners+0.096
 guardian+0.095
mother+0.095
Neg logit contributions
suits-0.089
 handful-0.088
 sip-0.086
ows-0.085
suit-0.082
Token happy
Token ID3772
Feature activation+7.17e-02

Pos logit contributions
 parent+0.289
father+0.261
 owners+0.257
mother+0.254
 parents+0.249
Neg logit contributions
 handful-0.161
 closely-0.160
ows-0.153
 harder-0.152
ues-0.147
Token and
Token ID290
Feature activation+6.69e-02

Pos logit contributions
 handful+0.089
ues+0.088
 closely+0.087
 sip+0.085
 harder+0.082
Neg logit contributions
 parent-0.162
 owners-0.148
father-0.141
 guardian-0.140
 tou-0.139
Token relieved
Token ID22598
Feature activation-8.83e-03

Pos logit contributions
ows+0.080
suits+0.080
suit+0.077
 handful+0.075
 slim+0.075
Neg logit contributions
father-0.151
 parent-0.148
mother-0.145
pa-0.134
 babys-0.133
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
 parent+0.021
 owners+0.019
father+0.018
 guardian+0.018
 pilot+0.018
Neg logit contributions
ues-0.010
 closely-0.010
 handful-0.010
 sip-0.010
 harder-0.010
Token They
Token ID1119
Feature activation+7.55e-03

Pos logit contributions
 handful+0.010
ues+0.010
 closely+0.009
inn+0.009
 sip+0.009
Neg logit contributions
 parent-0.016
 owners-0.015
 parents-0.014
 family-0.014
 tou-0.013
Token opened
Token ID4721
Feature activation+2.85e-02

Pos logit contributions
suits+0.010
 handful+0.010
 sip+0.010
ows+0.010
 Whoever+0.010
Neg logit contributions
 parent-0.016
father-0.015
 owners-0.014
mother-0.014
 parents-0.014
Token the
Token ID262
Feature activation-3.54e-02

Pos logit contributions
ues+0.058
sc+0.055
suits+0.055
suit+0.054
ows+0.054
Neg logit contributions
 parent-0.045
 owners-0.039
 parents-0.038
 family-0.035
 aer-0.035
Token door
Token ID3420
Feature activation-2.77e-03

Pos logit contributions
 parent+0.068
 owners+0.063
father+0.062
mother+0.060
 parents+0.059
Neg logit contributions
 handful-0.054
 closely-0.050
 harder-0.050
 sip-0.050
ues-0.049
Token fun
Token ID1257
Feature activation+9.35e-02

Pos logit contributions
ues+0.037
inn+0.034
suit+0.034
ows+0.033
ares+0.032
Neg logit contributions
 parent-0.041
 owners-0.038
 family-0.038
 aer-0.037
 parents-0.037
Token with
Token ID351
Feature activation+4.62e-02

Pos logit contributions
ues+0.156
suits+0.153
 handful+0.152
suit+0.149
 sip+0.145
Neg logit contributions
 parent-0.172
 owners-0.157
 parents-0.148
 family-0.146
 aer-0.143
Token her
Token ID607
Feature activation-1.23e-01

Pos logit contributions
 handful+0.096
 closely+0.094
ues+0.093
 sip+0.091
 Whoever+0.089
Neg logit contributions
 parent-0.066
 owners-0.059
 guardian-0.054
 parents-0.053
mother-0.052
Token friends
Token ID2460
Feature activation-2.37e-01

Pos logit contributions
father+0.194
mother+0.194
 known+0.194
 tou+0.184
 guardian+0.184
Neg logit contributions
 handful-0.209
suits-0.196
 spaghetti-0.196
 round-0.190
 sip-0.188
Token.
Token ID13
Feature activation+6.39e-03

Pos logit contributions
 parent+0.487
 owners+0.444
 parents+0.421
 guardian+0.407
 tou+0.404
Neg logit contributions
 handful-0.350
ues-0.348
 closely-0.338
suits-0.332
 Whoever-0.326
Token One
Token ID1881
Feature activation+8.27e-02

Pos logit contributions
ues+0.010
 handful+0.010
suits+0.009
suit+0.009
ishy+0.009
Neg logit contributions
 parent-0.013
 owners-0.011
 family-0.011
 parents-0.011
 aunt-0.011
Token day
Token ID1110
Feature activation-5.20e-02

Pos logit contributions
 handful+0.168
ues+0.165
 closely+0.164
 sip+0.162
suits+0.162
Neg logit contributions
 parent-0.122
 owners-0.108
 parents-0.100
 guardian-0.094
father-0.094
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
 parent+0.084
 parents+0.072
 owners+0.071
 family+0.070
 babys+0.069
Neg logit contributions
ues-0.101
ows-0.096
 closely-0.096
 Whoever-0.096
 handful-0.095
Token Lily
Token ID20037
Feature activation+9.51e-02

Pos logit contributions
ues+0.040
ows+0.040
suits+0.038
 kilograms+0.037
war+0.037
Neg logit contributions
 parent-0.032
 family-0.028
 babys-0.027
 parents-0.027
 owners-0.025
Token's
Token ID338
Feature activation-1.13e-01

Pos logit contributions
 closely+0.143
suits+0.143
 handful+0.140
ues+0.136
 sip+0.135
Neg logit contributions
 parent-0.183
 owners-0.167
 guardian-0.161
 parents-0.159
father-0.158
Token mom
Token ID1995
Feature activation-2.02e-01

Pos logit contributions
father+0.157
 ost+0.152
mother+0.151
 parent+0.145
 known+0.145
Neg logit contributions
 handful-0.209
suits-0.205
 sip-0.203
suit-0.200
 closely-0.196
Token notebook
Token ID20922
Feature activation+6.80e-02

Pos logit contributions
 parent+0.184
 owners+0.164
 parents+0.148
 tou+0.145
 je+0.145
Neg logit contributions
ues-0.237
 handful-0.233
 closely-0.232
suits-0.226
 sip-0.223
Token.
Token ID13
Feature activation+1.44e-03

Pos logit contributions
 closely+0.073
ues+0.071
ows+0.070
 handful+0.066
 harder+0.065
Neg logit contributions
 parent-0.162
 ost-0.151
 babys-0.146
 owners-0.146
 je-0.145
Token But
Token ID887
Feature activation+7.53e-02

Pos logit contributions
 handful+0.003
ues+0.003
suits+0.003
ishy+0.003
suit+0.003
Neg logit contributions
 parent-0.002
 owners-0.002
 parents-0.002
 family-0.002
 babys-0.002
Token when
Token ID618
Feature activation+5.82e-02

Pos logit contributions
ues+0.107
 handful+0.107
 closely+0.103
suits+0.101
 Whoever+0.100
Neg logit contributions
 parent-0.159
 owners-0.145
 parents-0.137
 guardian-0.132
father-0.132
Token she
Token ID673
Feature activation-3.86e-02

Pos logit contributions
 closely+0.082
 handful+0.082
 sip+0.081
 harder+0.080
ues+0.077
Neg logit contributions
 parent-0.120
 owners-0.110
father-0.106
 guardian-0.104
 je-0.104
Token got
Token ID1392
Feature activation+1.47e-02

Pos logit contributions
 parent+0.084
 owners+0.077
 parents+0.072
 ost+0.071
 guardian+0.071
Neg logit contributions
 sip-0.053
 closely-0.052
 handful-0.052
 Whoever-0.050
ues-0.050
Token to
Token ID284
Feature activation-1.02e-01

Pos logit contributions
 handful+0.026
 closely+0.026
 harder+0.025
ues+0.025
 sip+0.024
Neg logit contributions
 parent-0.025
 owners-0.023
 tou-0.021
 guardian-0.021
 ost-0.021
Token school
Token ID1524
Feature activation-5.91e-02

Pos logit contributions
 parent+0.152
 owners+0.144
 ost+0.133
 guardian+0.131
father+0.130
Neg logit contributions
 closely-0.194
 harder-0.188
 sip-0.186
 Whoever-0.185
ues-0.183
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
 parent+0.127
 owners+0.116
 parents+0.109
 tou+0.109
 guardian+0.107
Neg logit contributions
ues-0.082
 closely-0.079
 handful-0.078
 Whoever-0.076
suits-0.075
Token she
Token ID673
Feature activation-3.81e-02

Pos logit contributions
 handful+0.025
 closely+0.025
ues+0.024
 sip+0.024
 harder+0.024
Neg logit contributions
 parent-0.031
 owners-0.028
father-0.026
 guardian-0.026
 je-0.026
Token did
Token ID750
Feature activation+1.06e-01

Pos logit contributions
 parent+0.082
 owners+0.075
 ost+0.075
 guardian+0.074
 tou+0.074
Neg logit contributions
 closely-0.048
ues-0.045
suits-0.044
 harder-0.044
 Whoever-0.043
TokenLater
Token ID18602
Feature activation+5.63e-02

Pos logit contributions
 parent+0.056
 owners+0.051
 parents+0.049
 guardian+0.047
 babys+0.047
Neg logit contributions
ues-0.035
 handful-0.035
 closely-0.033
 Whoever-0.033
inn-0.033
Token that
Token ID326
Feature activation+2.09e-02

Pos logit contributions
 closely+0.058
 handful+0.058
 sip+0.056
ues+0.054
 harder+0.052
Neg logit contributions
 parent-0.133
 owners-0.126
 tou-0.117
father-0.117
 guardian-0.116
Token day
Token ID1110
Feature activation-4.54e-02

Pos logit contributions
 handful+0.038
 closely+0.037
 sip+0.037
inn+0.034
sc+0.034
Neg logit contributions
 parent-0.034
 owners-0.033
father-0.029
 parents-0.029
 tou-0.029
Token,
Token ID11
Feature activation+3.22e-02

Pos logit contributions
 parent+0.105
 owners+0.099
father+0.094
 guardian+0.093
 tou+0.091
Neg logit contributions
 handful-0.056
inn-0.051
ues-0.051
suits-0.050
 closely-0.049
Token Lily
Token ID20037
Feature activation+1.11e-01

Pos logit contributions
 handful+0.036
ues+0.034
 closely+0.033
 Whoever+0.033
suits+0.033
Neg logit contributions
 parent-0.076
 owners-0.071
father-0.067
 parents-0.067
 guardian-0.066
Token's
Token ID338
Feature activation-1.03e-01

Pos logit contributions
 closely+0.150
suits+0.143
 harder+0.140
 handful+0.133
ows+0.131
Neg logit contributions
 parent-0.223
 owners-0.213
 guardian-0.207
 ost-0.204
 tou-0.203
Token grandma
Token ID49890
Feature activation-1.55e-01

Pos logit contributions
 ost+0.149
 known+0.148
father+0.146
mother+0.143
 tou+0.142
Neg logit contributions
 handful-0.173
 closely-0.168
 spaghetti-0.162
suits-0.158
 harder-0.158
Token came
Token ID1625
Feature activation-2.64e-02

Pos logit contributions
 parent+0.326
 owners+0.319
 guardian+0.312
 tou+0.297
 ost+0.297
Neg logit contributions
 closely-0.182
 sip-0.172
 handful-0.171
 harder-0.169
ues-0.161
Token to
Token ID284
Feature activation-8.65e-02

Pos logit contributions
 parent+0.058
 tou+0.057
grand+0.056
 owners+0.056
father+0.055
Neg logit contributions
 closely-0.031
 harder-0.031
 handful-0.029
inn-0.027
suits-0.025
Token visit
Token ID3187
Feature activation-1.63e-01

Pos logit contributions
 owners+0.145
 parent+0.138
 guardian+0.127
father+0.125
 ost+0.123
Neg logit contributions
 sip-0.156
 handful-0.148
 closely-0.139
 harder-0.136
 spaghetti-0.135
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
 parent+0.297
 owners+0.269
 parents+0.250
 guardian+0.247
 tou+0.243
Neg logit contributions
 handful-0.276
ues-0.275
 closely-0.273
suits-0.265
 sip-0.263
TokenHello
Token ID15496
Feature activation-1.63e-02

Pos logit contributions
ues+0.046
 handful+0.045
 closely+0.044
suits+0.043
ows+0.042
Neg logit contributions
 parent-0.059
 owners-0.054
 parents-0.050
 guardian-0.049
father-0.049
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 parent+0.030
 owners+0.027
 parents+0.026
 family+0.025
 je+0.024
Neg logit contributions
ues-0.027
 handful-0.027
 closely-0.026
suits-0.026
inn-0.026
Token kids
Token ID3988
Feature activation-6.19e-02

Pos logit contributions
ues+0.021
suits+0.020
 handful+0.019
 closely+0.019
inn+0.019
Neg logit contributions
 parent-0.035
 owners-0.031
 parents-0.031
 family-0.030
 je-0.030
Token.
Token ID13
Feature activation+6.44e-03

Pos logit contributions
 parent+0.144
 owners+0.134
 parents+0.125
 tou+0.121
 guardian+0.120
Neg logit contributions
 handful-0.076
ues-0.075
suits-0.073
 closely-0.071
ows-0.071
Token Are
Token ID4231
Feature activation+8.04e-02

Pos logit contributions
 handful+0.008
ues+0.008
 Whoever+0.007
suits+0.007
 closely+0.007
Neg logit contributions
 parent-0.015
 owners-0.014
 parents-0.013
father-0.013
mother-0.012
Token you
Token ID345
Feature activation-6.39e-02

Pos logit contributions
 handful+0.141
 closely+0.137
ues+0.137
 Whoever+0.136
 sip+0.133
Neg logit contributions
 parent-0.141
 owners-0.129
 guardian-0.116
 parents-0.114
 babys-0.112
Token OK
Token ID7477
Feature activation+5.09e-02

Pos logit contributions
 parent+0.124
 owners+0.112
 parents+0.107
 family+0.102
 guardian+0.102
Neg logit contributions
ues-0.103
 handful-0.101
suits-0.098
 closely-0.098
 Whoever-0.096
Token?
Token ID30
Feature activation+1.77e-02

Pos logit contributions
ues+0.061
 handful+0.061
 closely+0.059
 sip+0.058
inn+0.057
Neg logit contributions
 parent-0.112
 owners-0.108
 parents-0.099
father-0.099
 tou-0.098
Token Did
Token ID7731
Feature activation+8.42e-02

Pos logit contributions
ues+0.022
 handful+0.021
 closely+0.020
suits+0.020
 Whoever+0.020
Neg logit contributions
 parent-0.041
 owners-0.037
 parents-0.036
father-0.034
 guardian-0.034
Token you
Token ID345
Feature activation-6.49e-02

Pos logit contributions
 closely+0.125
 Whoever+0.123
 handful+0.121
ues+0.120
 sip+0.118
Neg logit contributions
 parent-0.164
 owners-0.151
 guardian-0.146
 babys-0.141
father-0.140
Token see
Token ID766
Feature activation+6.09e-02

Pos logit contributions
 parent+0.129
 owners+0.127
 parents+0.117
father+0.115
mother+0.113
Neg logit contributions
 sip-0.102
 handful-0.093
 closely-0.092
ues-0.089
 Whoever-0.087
Token anymore
Token ID7471
Feature activation+9.65e-02

Pos logit contributions
ues+0.086
 handful+0.085
suits+0.081
 closely+0.080
 Whoever+0.080
Neg logit contributions
 parent-0.107
 owners-0.097
 parents-0.092
 family-0.089
 guardian-0.087
Token.
Token ID13
Feature activation-6.38e-04

Pos logit contributions
ues+0.151
 handful+0.150
suits+0.144
 closely+0.143
inn+0.141
Neg logit contributions
 parent-0.190
 owners-0.173
 parents-0.165
 family-0.160
 guardian-0.156
Token I
Token ID314
Feature activation+1.69e-02

Pos logit contributions
 parent+0.001
 owners+0.001
 parents+0.001
father+0.001
mother+0.001
Neg logit contributions
ues-0.001
 handful-0.001
 Whoever-0.001
 closely-0.001
suits-0.001
Token want
Token ID765
Feature activation+1.39e-01

Pos logit contributions
 sip+0.026
 handful+0.026
ues+0.026
suits+0.026
 closely+0.026
Neg logit contributions
 parent-0.033
 owners-0.030
 parents-0.028
father-0.028
mother-0.027
Token to
Token ID284
Feature activation-1.04e-01

Pos logit contributions
 spaghetti+0.167
 handful+0.164
 harder+0.163
 closely+0.162
 sip+0.160
Neg logit contributions
 parent-0.312
 owners-0.290
 guardian-0.289
father-0.280
 pilot-0.278
Token go
Token ID467
Feature activation-1.19e-01

Pos logit contributions
father+0.202
 owners+0.197
 parent+0.196
mother+0.195
 ost+0.185
Neg logit contributions
 sip-0.153
 closely-0.128
 handful-0.122
 harder-0.120
 while-0.119
Token outside
Token ID2354
Feature activation+3.74e-02

Pos logit contributions
 parent+0.229
 owners+0.209
 parents+0.194
father+0.192
 tou+0.188
Neg logit contributions
 handful-0.189
ues-0.185
 closely-0.184
suits-0.179
 sip-0.177
Token and
Token ID290
Feature activation+5.96e-02

Pos logit contributions
ues+0.047
 handful+0.047
 closely+0.045
suits+0.045
 Whoever+0.043
Neg logit contributions
 parent-0.084
 owners-0.077
 parents-0.074
 guardian-0.071
father-0.071
Token see
Token ID766
Feature activation+6.14e-02

Pos logit contributions
 sip+0.098
 handful+0.094
 closely+0.093
 harder+0.090
 Whoever+0.090
Neg logit contributions
 parent-0.109
 owners-0.102
father-0.100
mother-0.099
 parents-0.092
Token the
Token ID262
Feature activation-3.50e-02

Pos logit contributions
 handful+0.079
 closely+0.078
ues+0.075
 harder+0.073
 sip+0.070
Neg logit contributions
 parent-0.136
 owners-0.126
 babys-0.118
 parents-0.118
mother-0.117
Token pigeon
Token ID46387
Feature activation-3.04e-02

Pos logit contributions
 parent+0.061
father+0.054
mother+0.053
 owners+0.053
 babys+0.051
Neg logit contributions
 handful-0.059
 closely-0.057
 harder-0.055
 slim-0.054
 sip-0.054
Token lov
Token ID42900
Feature activation-3.23e-02

Pos logit contributions
 parent+0.078
 owners+0.072
 parents+0.068
 guardian+0.066
father+0.066
Neg logit contributions
ues-0.042
 handful-0.042
 closely-0.040
suits-0.040
 Whoever-0.039
Tokeningly
Token ID4420
Feature activation-4.43e-02

Pos logit contributions
 parent+0.071
 owners+0.065
 parents+0.062
 guardian+0.059
 tou+0.059
Neg logit contributions
ues-0.044
 handful-0.043
 closely-0.042
suits-0.041
inn-0.040
Token.
Token ID13
Feature activation+7.58e-03

Pos logit contributions
 parent+0.089
 owners+0.082
 parents+0.079
 family+0.075
 guardian+0.074
Neg logit contributions
ues-0.066
 handful-0.065
suits-0.065
 Whoever-0.064
sc-0.063
Token "
Token ID366
Feature activation+3.00e-02

Pos logit contributions
ues+0.010
 handful+0.010
suits+0.009
 closely+0.009
 sip+0.009
Neg logit contributions
 parent-0.017
 owners-0.016
 parents-0.015
 guardian-0.015
 family-0.015
TokenThat
Token ID2504
Feature activation+3.86e-02

Pos logit contributions
ues+0.048
 handful+0.047
suits+0.045
 closely+0.045
ows+0.044
Neg logit contributions
 parent-0.059
 owners-0.053
 parents-0.050
 guardian-0.048
father-0.048
Token's
Token ID338
Feature activation-1.17e-01

Pos logit contributions
 closely+0.053
 handful+0.051
 harder+0.050
 sip+0.048
sc+0.048
Neg logit contributions
 parent-0.078
 owners-0.072
father-0.070
 babys-0.069
 tou-0.069
Token so
Token ID523
Feature activation+4.93e-02

Pos logit contributions
 parent+0.251
 owners+0.229
 parents+0.219
 babys+0.211
father+0.209
Neg logit contributions
 handful-0.161
ues-0.160
 closely-0.157
 Whoever-0.150
suits-0.149
Token adventurous
Token ID39770
Feature activation-2.31e-02

Pos logit contributions
 handful+0.067
ues+0.066
 closely+0.066
suits+0.063
 sip+0.062
Neg logit contributions
 parent-0.107
 owners-0.098
 parents-0.092
father-0.090
 babys-0.089
Token of
Token ID286
Feature activation-1.50e-01

Pos logit contributions
 parent+0.048
 owners+0.044
 parents+0.042
 babys+0.040
father+0.040
Neg logit contributions
 handful-0.033
ues-0.032
 closely-0.032
suits-0.031
 sip-0.031
Token you
Token ID345
Feature activation-6.33e-02

Pos logit contributions
 parent+0.290
 owners+0.260
 babys+0.248
 tou+0.238
 parents+0.238
Neg logit contributions
 closely-0.236
 handful-0.233
ues-0.233
 harder-0.223
 Whoever-0.220
Token!"
Token ID2474
Feature activation+1.24e-01

Pos logit contributions
 parent+0.134
 owners+0.126
father+0.117
 tou+0.117
 babys+0.116
Neg logit contributions
 closely-0.084
ues-0.084
 handful-0.082
ows-0.080
suits-0.078
Token.
Token ID13
Feature activation+1.10e-03

Pos logit contributions
 parent+0.113
 owners+0.103
 parents+0.098
 guardian+0.094
father+0.094
Neg logit contributions
 handful-0.072
ues-0.072
 closely-0.069
suits-0.068
 Whoever-0.067
Token Jenny
Token ID28437
Feature activation+1.06e-01

Pos logit contributions
 handful+0.002
ows+0.002
hell+0.002
suits+0.002
ues+0.002
Neg logit contributions
 parent-0.002
 parents-0.002
 owners-0.002
 babys-0.002
 family-0.002
Token smiled
Token ID13541
Feature activation-3.56e-02

Pos logit contributions
ues+0.174
 handful+0.174
 closely+0.171
suits+0.167
 Whoever+0.165
Neg logit contributions
 parent-0.199
 owners-0.180
 parents-0.169
 guardian-0.164
father-0.163
Token as
Token ID355
Feature activation+1.19e-01

Pos logit contributions
 parent+0.082
 owners+0.076
 parents+0.073
 guardian+0.070
 family+0.070
Neg logit contributions
ues-0.044
 handful-0.043
suits-0.041
 closely-0.041
 Whoever-0.040
Token she
Token ID673
Feature activation-4.18e-02

Pos logit contributions
 closely+0.189
ues+0.187
 handful+0.183
 harder+0.175
 Whoever+0.173
Neg logit contributions
 parent-0.227
 owners-0.214
father-0.196
 je-0.194
 ost-0.194
Token knew
Token ID2993
Feature activation-1.11e-01

Pos logit contributions
 parent+0.085
 owners+0.077
 parents+0.073
 babys+0.071
 guardian+0.071
Neg logit contributions
ues-0.063
 handful-0.062
 closely-0.061
suits-0.060
 Whoever-0.059
Token what
Token ID644
Feature activation+1.35e-01

Pos logit contributions
 parent+0.265
 owners+0.244
 tou+0.231
 babys+0.230
 je+0.230
Neg logit contributions
ues-0.125
 closely-0.125
 handful-0.123
 harder-0.112
 Whoever-0.111
Token she
Token ID673
Feature activation-4.18e-02

Pos logit contributions
 handful+0.160
ues+0.156
 closely+0.155
 sip+0.142
 Whoever+0.137
Neg logit contributions
 parent-0.296
 owners-0.274
 babys-0.267
 guardian-0.266
 tou-0.262
Token was
Token ID373
Feature activation-1.28e-01

Pos logit contributions
 parent+0.080
 owners+0.072
 parents+0.069
 guardian+0.065
father+0.065
Neg logit contributions
ues-0.067
 handful-0.067
 closely-0.065
suits-0.065
 Whoever-0.063
Token supposed
Token ID4385
Feature activation-4.98e-02

Pos logit contributions
 parent+0.199
 owners+0.176
 parents+0.164
 guardian+0.157
 family+0.156
Neg logit contributions
 handful-0.252
ues-0.251
suits-0.245
 sip-0.240
inn-0.239
Token to
Token ID284
Feature activation-1.03e-01

Pos logit contributions
 parent+0.098
 owners+0.089
 parents+0.082
 guardian+0.081
 babys+0.081
Neg logit contributions
 handful-0.077
ues-0.076
 closely-0.075
 Whoever-0.073
suits-0.070
TokenOne
Token ID3198
Feature activation+6.96e-02

Pos logit contributions
 parent+0.074
 owners+0.069
 parents+0.063
 guardian+0.062
father+0.062
Neg logit contributions
 handful-0.053
ues-0.052
 closely-0.050
suits-0.049
 sip-0.049
Token day
Token ID1110
Feature activation-5.48e-02

Pos logit contributions
 handful+0.139
ues+0.137
 closely+0.135
suits+0.134
 sip+0.134
Neg logit contributions
 parent-0.106
 owners-0.094
 parents-0.086
mother-0.082
father-0.082
Token,
Token ID11
Feature activation+1.96e-02

Pos logit contributions
 parent+0.115
 owners+0.105
 parents+0.098
 guardian+0.096
father+0.096
Neg logit contributions
 handful-0.080
ues-0.078
suits-0.075
 closely-0.075
 Whoever-0.073
Token Tom
Token ID4186
Feature activation+1.14e-01

Pos logit contributions
ues+0.028
 handful+0.028
 closely+0.026
suits+0.026
 sip+0.026
Neg logit contributions
 parent-0.041
 owners-0.038
 parents-0.036
 family-0.035
 guardian-0.035
Token's
Token ID338
Feature activation-1.14e-01

Pos logit contributions
suits+0.154
 closely+0.149
 handful+0.145
suit+0.145
ues+0.141
Neg logit contributions
 parent-0.240
 owners-0.217
 guardian-0.209
 parents-0.207
 ost-0.206
Token mom
Token ID1995
Feature activation-1.96e-01

Pos logit contributions
 parent+0.161
mother+0.156
father+0.152
 tou+0.144
 ost+0.143
Neg logit contributions
 handful-0.213
 closely-0.208
 harder-0.204
suits-0.201
 dice-0.201
Token said
Token ID531
Feature activation-4.60e-02

Pos logit contributions
 parent+0.398
 owners+0.365
 guardian+0.353
 parents+0.346
 ost+0.344
Neg logit contributions
 closely-0.273
suits-0.269
 handful-0.267
ows-0.264
ues-0.264
Token he
Token ID339
Feature activation-1.54e-02

Pos logit contributions
 parent+0.097
 owners+0.089
father+0.082
 parents+0.082
 guardian+0.082
Neg logit contributions
 handful-0.065
ues-0.065
 closely-0.062
suits-0.060
 Whoever-0.060
Token had
Token ID550
Feature activation-1.05e-02

Pos logit contributions
 parent+0.032
 owners+0.029
 guardian+0.028
 parents+0.028
 ost+0.028
Neg logit contributions
suits-0.021
ows-0.020
ues-0.020
 handful-0.020
suit-0.020
Token to
Token ID284
Feature activation-1.01e-01

Pos logit contributions
 parent+0.023
 owners+0.020
 guardian+0.020
mother+0.019
father+0.019
Neg logit contributions
 handful-0.015
ues-0.014
 topp-0.014
 harder-0.013
suits-0.013
Token go
Token ID467
Feature activation-1.16e-01

Pos logit contributions
 parent+0.189
 owners+0.180
 ost+0.173
father+0.167
 guardian+0.166
Neg logit contributions
 sip-0.159
 handful-0.144
 harder-0.142
 closely-0.141
ues-0.138
Token's
Token ID338
Feature activation-1.13e-01

Pos logit contributions
ues+0.388
 handful+0.369
suits+0.366
inn+0.366
 Whoever+0.360
Neg logit contributions
 parent-0.345
 owners-0.305
 parents-0.291
 family-0.282
 guardian-0.280
Token so
Token ID523
Feature activation+5.03e-02

Pos logit contributions
 parent+0.203
 owners+0.185
 family+0.181
 parents+0.175
 je+0.173
Neg logit contributions
ues-0.205
suits-0.193
 Whoever-0.188
 handful-0.185
war-0.185
Token pretty
Token ID2495
Feature activation+9.76e-02

Pos logit contributions
ues+0.082
 Whoever+0.078
inn+0.075
 handful+0.075
 harder+0.074
Neg logit contributions
 parent-0.095
 family-0.087
 owners-0.085
 parents-0.084
 tou-0.083
Token,"
Token ID553
Feature activation+8.97e-02

Pos logit contributions
ues+0.132
 Whoever+0.128
inn+0.127
 handful+0.125
suits+0.125
Neg logit contributions
 parent-0.202
 owners-0.186
 family-0.182
 parents-0.182
 aer-0.178
Token said
Token ID531
Feature activation-4.78e-02

Pos logit contributions
 handful+0.132
 closely+0.130
ues+0.125
 sip+0.124
 Whoever+0.123
Neg logit contributions
 parent-0.160
 guardian-0.155
 owners-0.152
 ost-0.150
 tou-0.150
Token mom
Token ID1995
Feature activation-1.99e-01

Pos logit contributions
 owners+0.095
 parent+0.094
 guardian+0.088
father+0.088
 tou+0.088
Neg logit contributions
 handful-0.069
 sip-0.064
ues-0.061
 closely-0.061
 harder-0.060
Tokenmy
Token ID1820
Feature activation-8.94e-03

Pos logit contributions
 parent+0.309
 owners+0.268
 parents+0.252
father+0.240
mother+0.239
Neg logit contributions
ues-0.397
 handful-0.396
suits-0.384
 closely-0.380
 Whoever-0.377
Token.
Token ID13
Feature activation+4.29e-03

Pos logit contributions
 parent+0.018
 owners+0.017
 parents+0.016
 guardian+0.015
 tou+0.015
Neg logit contributions
 handful-0.013
ues-0.013
 closely-0.013
suits-0.012
 sip-0.012
Token
Token ID198
Feature activation-4.31e-02

Pos logit contributions
ues+0.007
 handful+0.006
ishy+0.006
 closely+0.006
suits+0.006
Neg logit contributions
 parent-0.009
 owners-0.008
 parents-0.008
 family-0.007
 babys-0.007
Token
Token ID198
Feature activation-4.41e-02

Pos logit contributions
 parent+0.075
 owners+0.067
 parents+0.063
 family+0.061
 babys+0.059
Neg logit contributions
 handful-0.079
ishy-0.078
ues-0.077
suit-0.076
suits-0.074
TokenL
Token ID43
Feature activation+2.66e-02

Pos logit contributions
 parent+0.088
 owners+0.080
 parents+0.072
 family+0.071
 neighbors+0.070
Neg logit contributions
 handful-0.072
ishy-0.070
ues-0.069
ows-0.069
 sip-0.068
Token it
Token ID340
Feature activation+2.06e-01

Pos logit contributions
ues+0.104
suit+0.099
suits+0.099
ows+0.098
inn+0.093
Neg logit contributions
 parent-0.107
 owners-0.099
 parents-0.099
 family-0.096
 aer-0.092
Token?"
Token ID1701
Feature activation+1.08e-01

Pos logit contributions
ues+0.308
 handful+0.307
 closely+0.295
suits+0.294
 Whoever+0.287
Neg logit contributions
 parent-0.416
 owners-0.379
 parents-0.360
 guardian-0.345
father-0.343
Token
Token ID198
Feature activation-4.83e-02

Pos logit contributions
 closely+0.197
 sip+0.197
ues+0.196
 Whoever+0.196
 handful+0.193
Neg logit contributions
 parent-0.165
 guardian-0.154
 owners-0.149
father-0.145
 tou-0.144
Token
Token ID198
Feature activation-4.83e-02

Pos logit contributions
 parent+0.112
 owners+0.104
 guardian+0.103
 ost+0.100
father+0.100
Neg logit contributions
ues-0.052
 closely-0.051
 handful-0.050
 Whoever-0.050
inn-0.049
TokenHer
Token ID9360
Feature activation-1.26e-01

Pos logit contributions
 parent+0.083
father+0.078
 ost+0.077
 owners+0.076
mother+0.075
Neg logit contributions
 closely-0.079
ues-0.079
 Whoever-0.077
 sip-0.076
suits-0.074
Token mom
Token ID1995
Feature activation-2.01e-01

Pos logit contributions
 ost+0.170
father+0.169
 parent+0.169
mother+0.168
 tou+0.165
Neg logit contributions
 handful-0.230
 closely-0.227
 sip-0.224
suits-0.219
 harder-0.216
Token replied
Token ID8712
Feature activation-1.62e-02

Pos logit contributions
 parent+0.390
 owners+0.353
 parents+0.334
mother+0.319
father+0.319
Neg logit contributions
ues-0.324
 handful-0.321
suits-0.309
 closely-0.307
 Whoever-0.299
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
 parent+0.023
 parents+0.020
 owners+0.019
 babys+0.018
 family+0.018
Neg logit contributions
 dice-0.034
suits-0.034
war-0.033
suit-0.033
ues-0.033
Token "
Token ID366
Feature activation+2.73e-02

Pos logit contributions
ues+0.025
suits+0.024
 handful+0.023
ishy+0.023
sc+0.023
Neg logit contributions
 parent-0.033
 owners-0.030
 parents-0.029
 family-0.028
 babys-0.028
TokenNo
Token ID2949
Feature activation+2.48e-02

Pos logit contributions
ues+0.044
 handful+0.043
 closely+0.041
suits+0.041
ows+0.041
Neg logit contributions
 parent-0.053
 owners-0.048
 parents-0.045
 family-0.043
 guardian-0.043
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
ues+0.043
 handful+0.041
suits+0.041
suit+0.041
 closely+0.040
Neg logit contributions
 parent-0.046
 owners-0.041
 parents-0.040
 family-0.038
 babys-0.037
Token.
Token ID13
Feature activation+2.45e-03

Pos logit contributions
ues+0.085
 handful+0.079
suits+0.079
 Whoever+0.079
suit+0.077
Neg logit contributions
 parent-0.129
 owners-0.118
 parents-0.115
 family-0.112
 je-0.110
Token One
Token ID1881
Feature activation+8.08e-02

Pos logit contributions
ues+0.004
 handful+0.004
 closely+0.003
suits+0.003
suit+0.003
Neg logit contributions
 parent-0.005
 owners-0.005
 je-0.004
 family-0.004
 parents-0.004
Token day
Token ID1110
Feature activation-5.22e-02

Pos logit contributions
 handful+0.165
ues+0.162
 closely+0.160
suits+0.159
 sip+0.159
Neg logit contributions
 parent-0.118
 owners-0.104
 parents-0.096
father-0.091
 guardian-0.091
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 parent+0.088
 owners+0.078
 parents+0.075
 je+0.074
 family+0.073
Neg logit contributions
ues-0.097
 handful-0.092
 closely-0.092
suits-0.091
 Whoever-0.091
Token his
Token ID465
Feature activation-9.85e-02

Pos logit contributions
ues+0.032
ows+0.030
suits+0.030
 handful+0.029
 closely+0.029
Neg logit contributions
 parent-0.025
 owners-0.022
 family-0.021
 je-0.021
 parents-0.021
Token mom
Token ID1995
Feature activation-2.02e-01

Pos logit contributions
mother+0.172
father+0.172
 known+0.162
 ost+0.159
 guardian+0.152
Neg logit contributions
 spaghetti-0.146
suits-0.139
 handful-0.137
 sip-0.135
suit-0.132
Token made
Token ID925
Feature activation+2.95e-02

Pos logit contributions
 parent+0.421
 owners+0.392
 guardian+0.379
 parents+0.370
 tou+0.366
Neg logit contributions
 handful-0.266
 closely-0.258
suits-0.255
 sip-0.254
ues-0.249
Token an
Token ID281
Feature activation+5.56e-02

Pos logit contributions
suit+0.054
suits+0.054
ues+0.054
 handful+0.051
sc+0.051
Neg logit contributions
 parent-0.051
 owners-0.045
 parents-0.045
 family-0.043
 je-0.041
Token unusual
Token ID8468
Feature activation+3.06e-02

Pos logit contributions
 closely+0.084
 handful+0.083
ues+0.082
 sip+0.077
inn+0.077
Neg logit contributions
 parent-0.110
 owners-0.101
 parents-0.094
 guardian-0.092
mother-0.091
Token sauce
Token ID10746
Feature activation+1.25e-01

Pos logit contributions
 handful+0.044
ues+0.042
 closely+0.042
 sip+0.042
suits+0.040
Neg logit contributions
 parent-0.064
 owners-0.059
 parents-0.055
father-0.054
mother-0.054
Token for
Token ID329
Feature activation+6.76e-03

Pos logit contributions
 handful+0.173
ues+0.173
 closely+0.165
 sip+0.161
suits+0.161
Neg logit contributions
 parent-0.269
 owners-0.247
 parents-0.233
 guardian-0.227
father-0.227
Token her
Token ID607
Feature activation-1.20e-01

Pos logit contributions
 parent+0.135
 owners+0.120
 guardian+0.109
 je+0.109
 parents+0.109
Neg logit contributions
ues-0.169
 handful-0.169
 closely-0.167
 sip-0.163
 Whoever-0.161
Token.
Token ID13
Feature activation-1.14e-03

Pos logit contributions
 parent+0.252
 owners+0.231
 parents+0.217
 guardian+0.211
father+0.210
Neg logit contributions
 handful-0.174
ues-0.173
 closely-0.166
suits-0.162
 sip-0.161
Token
Token ID198
Feature activation-5.14e-02

Pos logit contributions
 parent+0.002
 owners+0.002
 parents+0.001
 babys+0.001
 family+0.001
Neg logit contributions
 handful-0.002
ues-0.002
suits-0.002
 closely-0.002
sc-0.002
Token
Token ID198
Feature activation-5.17e-02

Pos logit contributions
 parent+0.105
 owners+0.096
 parents+0.091
 guardian+0.087
father+0.087
Neg logit contributions
ues-0.076
 handful-0.076
 closely-0.072
suits-0.072
 Whoever-0.070
TokenHer
Token ID9360
Feature activation-1.26e-01

Pos logit contributions
 parent+0.096
 owners+0.087
 parents+0.082
father+0.081
 guardian+0.080
Neg logit contributions
ues-0.085
 handful-0.084
 closely-0.084
 Whoever-0.082
 sip-0.082
Token mom
Token ID1995
Feature activation-2.05e-01

Pos logit contributions
 parent+0.159
 owners+0.143
 ost+0.140
 tou+0.140
father+0.137
Neg logit contributions
 handful-0.263
 closely-0.258
 sip-0.255
ues-0.255
suits-0.252
Token took
Token ID1718
Feature activation+1.17e-02

Pos logit contributions
 parent+0.409
 owners+0.371
 parents+0.352
 guardian+0.340
 tou+0.337
Neg logit contributions
ues-0.311
 handful-0.310
 closely-0.300
suits-0.296
 Whoever-0.293
Token out
Token ID503
Feature activation+9.42e-02

Pos logit contributions
ues+0.024
suits+0.023
suit+0.023
ishy+0.022
inn+0.022
Neg logit contributions
 parent-0.018
 parents-0.016
 owners-0.016
 family-0.015
mother-0.015
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
ues+0.151
suit+0.146
suits+0.145
 Whoever+0.141
sc+0.140
Neg logit contributions
 parent-0.190
 owners-0.169
 parents-0.167
 family-0.160
mother-0.156
Token big
Token ID1263
Feature activation+2.61e-02

Pos logit contributions
ues+0.161
ows+0.152
 Whoever+0.146
inn+0.143
ishy+0.141
Neg logit contributions
 parent-0.207
 family-0.179
 babys-0.177
 je-0.176
 parents-0.174
Token white
Token ID2330
Feature activation+1.04e-01

Pos logit contributions
ues+0.042
ows+0.040
 Whoever+0.039
inn+0.038
suits+0.037
Neg logit contributions
 parent-0.053
 parents-0.045
 family-0.045
 je-0.044
 babys-0.044