TOP ACTIVATIONS
MAX = 30.261

time . Fe eling sad , Tim my went
. One day , Grac ie was feeling a bit sad
around the park , Sus ie gasped with surprise . She
Bun B un and Tweet ie still weren 't talking .
" I 'm sorry sweet ie , but it 's getting
to the door and Sus ie followed it . The butterfly
a little girl named Sus ie . Sus ie loved to
a necklace for O ll ie with the stones . O
the chim ney ! Grac ie was so proud of the
called Mill ie . Mill ie wanted to get to the
One day Mill ie went out to play in
's a telephone , sweet ie . People use it to
You 're welcome , sweet ie . Do you want to
. One day , Ros ie 's mom my took her
her charming doll , Ros ie . One day , Lily
a little bird named Tweet ie . Tweet ie loved to
ily realized that the gen ie was right . She thanked
little girl called L izz ie . She liked to perform
. One day , Ros ie was playing in her room
dance lessons . H oot ie was happy that he didn

MIN ACTIVATIONS
MIN = -20.907

book . What is your name ?" " Tim
heard a voice calling her name . It was mom .
How do you know my name ?" " I
, " Hello , my name is Vin ny . I
. Go ahead . His name is Spot . He loves
the patient to say his name . The patient said his
sir . Why did you name him Wisdom ?" Ben asked
rabbit replied , " My name is Holly ."
She asked him what his name was and where he came
name ?" " My name is Pr inter ," the
a little boy . His name was Sam and he was
was happy and wrote his name next to Lily 's .
on . She wrote her name , and her mom 's
, how to write his name and how to draw a
a little girl . Her name was Claire and she was
a little boy . His name was Sam and he was
to the park . His name was Sam . He saw
when her mom called her name . She didn 't want
said " Hello ! My name is Fox and I love
could hear someone repeating her name over and over . "

INTERVAL 17.469 - 30.261
CONTAINS 0.125%

" My name is Tweet ie . I am looking for
" Hello , little bird ie ! Do you want to
get a new rubber duck ie for you to play with
girl . She was three years old , and she lived
a little girl named Mill ie . She was three years

INTERVAL 4.677 - 17.469
CONTAINS 23.212%

and said , " S ara , I give you the
special object . Every day , the girl would un pack
excited ! He laughed and pointed at the video
soon as he touched it , the weapon disappeared ! The
go . But don 't forget to rest , too .

INTERVAL -8.115 - 4.677
CONTAINS 64.965%

. She smiled and put it on her wrist .
wouldn 't get punished anymore . The little
. They played together on the lawn , running
. They made a big ig loo with blocks of snow
and ventured a few steps closer . Suddenly , from underneath

INTERVAL -20.907 - -8.115
CONTAINS 11.698%

twins . They like to play in the park with their
L ily did not listen to Max . She thought the
worked . The deer smiled to himself and knew that he
to do . From that day on ,
in a big , dark house . She was very lonely
Token time
Token ID640
Feature activation+5.51e-01

Pos logit contributions
eling+0.451
ited+0.401
i+0.395
ering+0.389
ie+0.383
Neg logit contributions
 name-0.228
 names-0.203
 fence-0.176
Circ-0.174
 to-0.173
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
eling+0.034
ited+0.029
i+0.029
ering+0.028
ie+0.027
Neg logit contributions
 name-0.028
 names-0.026
-0.025
 to-0.025
 They-0.025
Token
Token ID198
Feature activation-7.49e+00

Pos logit contributions
eling+0.025
i+0.024
heart+0.023
ie+0.023
ited+0.023
Neg logit contributions
 name-0.009
 names-0.009
Circ-0.008
 fence-0.008
 across-0.007
Token
Token ID198
Feature activation-7.82e+00

Pos logit contributions
 name+0.197
 names+0.174
 fence+0.144
 Bang+0.140
Circ+0.140
Neg logit contributions
eling-0.603
ited-0.545
ie-0.543
heart-0.539
i-0.538
TokenFe
Token ID14304
Feature activation+5.52e+00

Pos logit contributions
 name+0.379
 names+0.341
 to+0.315
 Bang+0.311
 fence+0.309
Neg logit contributions
eling-0.501
ited-0.440
i-0.428
ering-0.424
heart-0.414
Tokeneling
Token ID10809
Feature activation+2.96e+01

Pos logit contributions
itive+0.091
heart+0.087
ience+0.078
 reassured+0.071
ious+0.063
Neg logit contributions
-0.378
-0.366
 hills-0.365
 town-0.364
 to-0.360
Token sad
Token ID6507
Feature activation+8.04e+00

Pos logit contributions
eling+1.969
ited+1.706
i+1.686
ering+1.669
ie+1.611
Neg logit contributions
 name-1.396
-1.262
 to-1.231
 They-1.206
 across-1.201
Token,
Token ID11
Feature activation+1.31e+01

Pos logit contributions
eling+0.488
ited+0.425
ering+0.403
i+0.403
ie+0.382
Neg logit contributions
 name-0.429
 names-0.389
-0.381
 They-0.377
 to-0.376
Token Tim
Token ID5045
Feature activation+1.18e+00

Pos logit contributions
eling+0.885
ited+0.775
ering+0.760
i+0.758
ie+0.729
Neg logit contributions
 name-0.599
 names-0.531
-0.521
 They-0.505
 to-0.495
Tokenmy
Token ID1820
Feature activation+7.66e+00

Pos logit contributions
eling+0.075
ering+0.064
ited+0.063
izing+0.061
ience+0.060
Neg logit contributions
 name-0.057
-0.052
 to-0.052
 names-0.050
 They-0.048
Token went
Token ID1816
Feature activation-1.85e+00

Pos logit contributions
eling+0.491
ited+0.422
ering+0.417
heart+0.394
izing+0.388
Neg logit contributions
 name-0.390
-0.358
 to-0.353
 They-0.337
 names-0.336
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
 name+0.298
+0.277
 to+0.274
 names+0.274
 They+0.269
Neg logit contributions
eling-0.288
ited-0.247
i-0.238
ering-0.230
heart-0.223
Token One
Token ID1881
Feature activation+9.41e-01

Pos logit contributions
eling+0.022
i+0.021
heart+0.020
ited+0.020
ie+0.020
Neg logit contributions
 name-0.007
 names-0.006
Circ-0.006
 fence-0.005
 across-0.005
Token day
Token ID1110
Feature activation-4.01e-01

Pos logit contributions
eling+0.073
ited+0.065
i+0.064
ering+0.064
ie+0.062
Neg logit contributions
 name-0.033
 names-0.029
-0.027
 They-0.026
Circ-0.026
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.022
+0.021
 to+0.021
 They+0.020
 names+0.020
Neg logit contributions
eling-0.024
ited-0.021
ering-0.020
i-0.019
ience-0.019
Token Grac
Token ID38099
Feature activation+1.17e+01

Pos logit contributions
eling+0.900
ited+0.783
i+0.772
ering+0.768
ie+0.740
Neg logit contributions
 name-0.664
 names-0.594
-0.576
 They-0.560
 to-0.551
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.337
mmm+0.306
ited+0.293
izing+0.293
 chuckled+0.284
Neg logit contributions
 name-0.808
 names-0.799
 house-0.766
 to-0.756
-0.745
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
eling+1.994
ited+1.741
ering+1.735
i+1.648
heart+1.631
Neg logit contributions
 name-1.327
-1.205
 to-1.162
 names-1.151
 They-1.119
Token feeling
Token ID4203
Feature activation+1.62e+00

Pos logit contributions
eling+0.230
ited+0.203
i+0.199
ering+0.197
ie+0.191
Neg logit contributions
 name-0.155
 names-0.136
-0.135
 to-0.131
 They-0.127
Token a
Token ID257
Feature activation-2.71e+00

Pos logit contributions
eling+0.116
ited+0.106
i+0.105
ering+0.104
heart+0.101
Neg logit contributions
 name-0.064
 to-0.057
-0.056
 across-0.054
 names-0.053
Token bit
Token ID1643
Feature activation+9.14e+00

Pos logit contributions
 name+0.145
 names+0.122
+0.122
 to+0.122
 corner+0.119
Neg logit contributions
eling-0.167
ering-0.143
i-0.142
ited-0.136
ie-0.135
Token sad
Token ID6507
Feature activation+7.58e+00

Pos logit contributions
eling+0.608
i+0.522
ering+0.520
ited+0.513
ie+0.499
Neg logit contributions
 name-0.425
 to-0.381
-0.380
 names-0.359
 They-0.355
Token around
Token ID1088
Feature activation-6.33e+00

Pos logit contributions
eling+0.271
ited+0.252
ie+0.249
i+0.248
ering+0.244
Neg logit contributions
 name-0.118
 names-0.101
 Bang-0.090
 Some-0.087
Circ-0.083
Token the
Token ID262
Feature activation+5.79e+00

Pos logit contributions
 name+0.278
 names+0.246
+0.236
 They+0.229
Circ+0.226
Neg logit contributions
eling-0.434
ited-0.380
i-0.377
ering-0.374
ie-0.362
Token park
Token ID3952
Feature activation-9.72e+00

Pos logit contributions
eling+0.446
i+0.401
ited+0.394
ie+0.392
ering+0.392
Neg logit contributions
 name-0.179
 names-0.156
Circ-0.151
 They-0.149
-0.147
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.381
 names+0.332
+0.322
 They+0.310
 to+0.306
Neg logit contributions
eling-0.715
ited-0.631
i-0.624
ering-0.619
ie-0.600
Token Sus
Token ID8932
Feature activation+1.18e+01

Pos logit contributions
eling+0.915
ited+0.799
i+0.782
ering+0.777
ie+0.751
Neg logit contributions
 name-0.655
 names-0.582
-0.573
 They-0.557
 to-0.554
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.274
ience+0.208
mmm+0.196
ited+0.190
leness+0.184
Neg logit contributions
 to-0.913
-0.903
 name-0.899
 corner-0.872
 yard-0.863
Token gasped
Token ID45236
Feature activation+8.87e+00

Pos logit contributions
eling+1.874
ited+1.649
ering+1.572
i+1.540
heart+1.524
Neg logit contributions
 name-1.425
-1.327
 to-1.283
 names-1.243
 They-1.222
Token with
Token ID351
Feature activation-2.12e+00

Pos logit contributions
eling+0.681
ited+0.610
ience+0.598
ering+0.559
leness+0.553
Neg logit contributions
 name-0.342
 to-0.316
-0.298
 across-0.281
 They-0.268
Token surprise
Token ID5975
Feature activation-5.94e+00

Pos logit contributions
 name+0.100
 names+0.089
+0.087
 They+0.084
 to+0.082
Neg logit contributions
eling-0.141
ited-0.123
ering-0.122
i-0.122
ie-0.117
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 name+0.248
 to+0.226
 names+0.216
+0.216
 They+0.199
Neg logit contributions
eling-0.435
ited-0.392
ience-0.369
ering-0.360
i-0.360
Token She
Token ID1375
Feature activation-7.95e-01

Pos logit contributions
 names+0.042
 name+0.040
Circ+0.038
 fence+0.037
 road+0.036
Neg logit contributions
eling-0.087
heart-0.086
i-0.084
mmm-0.080
ie-0.080
Token Bun
Token ID28515
Feature activation+7.86e+00

Pos logit contributions
eling+0.632
ie+0.606
i+0.605
heart+0.590
ited+0.590
Neg logit contributions
 name-0.208
 names-0.202
Circ-0.183
 fence-0.183
 hills-0.154
TokenB
Token ID33
Feature activation+6.16e+00

Pos logit contributions
eling+0.506
ited+0.436
ering+0.419
i+0.398
heart+0.394
Neg logit contributions
-0.372
 name-0.368
 to-0.347
 They-0.331
 across-0.323
Tokenun
Token ID403
Feature activation-1.22e+00

Pos logit contributions
eling+0.412
ited+0.360
i+0.357
ering+0.354
ie+0.343
Neg logit contributions
 name-0.282
 names-0.252
-0.242
 They-0.235
Circ-0.232
Token and
Token ID290
Feature activation+5.13e+00

Pos logit contributions
+0.057
 name+0.055
 to+0.053
 They+0.052
 The+0.049
Neg logit contributions
eling-0.078
ited-0.068
ering-0.065
i-0.065
heart-0.062
Token Tweet
Token ID18752
Feature activation-1.11e-03

Pos logit contributions
eling+0.353
ited+0.312
ering+0.299
i+0.291
heart+0.288
Neg logit contributions
 name-0.234
-0.212
 to-0.210
 names-0.208
 They-0.206
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
 name+0.000
+0.000
 to+0.000
 names+0.000
 They+0.000
Neg logit contributions
eling-0.000
ited-0.000
ering-0.000
ience-0.000
i-0.000
Token still
Token ID991
Feature activation+4.49e+00

Pos logit contributions
eling+1.895
ited+1.696
ering+1.603
heart+1.568
i+1.530
Neg logit contributions
 name-1.418
-1.337
 to-1.305
 They-1.294
 names-1.278
Token weren
Token ID6304
Feature activation+1.27e+01

Pos logit contributions
eling+0.281
ited+0.245
ering+0.238
i+0.234
heart+0.229
Neg logit contributions
 name-0.230
-0.206
 names-0.202
 to-0.201
 They-0.198
Token't
Token ID470
Feature activation+7.65e-01

Pos logit contributions
eling+0.933
ited+0.817
i+0.813
ering+0.807
ie+0.793
Neg logit contributions
 name-0.507
 names-0.452
Circ-0.403
 fence-0.401
-0.400
Token talking
Token ID3375
Feature activation+5.36e+00

Pos logit contributions
eling+0.049
ited+0.044
ering+0.042
i+0.041
ience+0.041
Neg logit contributions
 name-0.036
-0.034
 to-0.034
 They-0.032
 names-0.032
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
eling+0.463
ering+0.420
i+0.420
ited+0.415
ie+0.411
Neg logit contributions
 name-0.131
 names-0.103
-0.095
Circ-0.092
 They-0.089
Token "
Token ID366
Feature activation-4.81e+00

Pos logit contributions
eling+0.145
ering+0.127
i+0.126
ited+0.126
ie+0.123
Neg logit contributions
 name-0.087
 names-0.077
-0.073
Circ-0.072
 They-0.072
TokenI
Token ID40
Feature activation-2.21e+00

Pos logit contributions
 name+0.202
 names+0.181
 fence+0.166
 to+0.162
 They+0.161
Neg logit contributions
eling-0.338
ited-0.307
i-0.291
ering-0.288
heart-0.286
Token'm
Token ID1101
Feature activation-6.60e-01

Pos logit contributions
 name+0.090
 names+0.076
+0.076
 to+0.072
 They+0.071
Neg logit contributions
eling-0.162
ering-0.143
i-0.143
ited-0.141
ie-0.136
Token sorry
Token ID7926
Feature activation+7.65e+00

Pos logit contributions
 name+0.036
+0.033
 to+0.032
 names+0.031
 They+0.031
Neg logit contributions
eling-0.038
ited-0.035
i-0.033
ering-0.033
ie-0.032
Token sweet
Token ID6029
Feature activation+9.33e+00

Pos logit contributions
eling+0.441
ited+0.382
ering+0.356
i+0.353
ience+0.334
Neg logit contributions
 name-0.451
 names-0.406
-0.403
 to-0.397
 They-0.390
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.605
ited+0.507
ering+0.500
ience+0.476
i+0.476
Neg logit contributions
 name-0.488
-0.435
 names-0.429
 to-0.426
 They-0.415
Token,
Token ID11
Feature activation+1.32e+01

Pos logit contributions
eling+2.170
ited+1.908
ering+1.852
i+1.803
ience+1.712
Neg logit contributions
 name-1.204
-1.030
 to-1.021
 names-1.004
 They-0.978
Token but
Token ID475
Feature activation+8.55e+00

Pos logit contributions
eling+0.945
ited+0.834
ering+0.816
i+0.806
ie+0.770
Neg logit contributions
 name-0.589
 names-0.506
-0.502
 They-0.481
 to-0.477
Token it
Token ID340
Feature activation+2.44e+00

Pos logit contributions
eling+0.613
ited+0.587
ering+0.532
ience+0.522
itive+0.520
Neg logit contributions
 name-0.372
 to-0.332
-0.330
 names-0.308
 They-0.293
Token's
Token ID338
Feature activation+7.13e+00

Pos logit contributions
eling+0.179
ited+0.162
ering+0.160
i+0.157
itive+0.154
Neg logit contributions
 name-0.095
-0.082
 names-0.080
 to-0.080
 They-0.077
Token getting
Token ID1972
Feature activation+8.07e+00

Pos logit contributions
eling+0.441
ited+0.390
ering+0.377
i+0.373
ience+0.359
Neg logit contributions
 name-0.373
 names-0.331
-0.331
 to-0.323
 They-0.316
Token to
Token ID284
Feature activation-1.12e+01

Pos logit contributions
eling+0.190
ited+0.175
i+0.168
ering+0.168
ience+0.162
Neg logit contributions
 name-0.100
 names-0.086
-0.084
 to-0.082
 They-0.079
Token the
Token ID262
Feature activation+5.79e+00

Pos logit contributions
 name+0.435
 names+0.386
+0.365
Circ+0.363
 They+0.356
Neg logit contributions
eling-0.812
i-0.707
ited-0.703
ering-0.695
ie-0.687
Token door
Token ID3420
Feature activation-1.23e+01

Pos logit contributions
eling+0.431
i+0.381
ited+0.375
ering+0.372
ie+0.372
Neg logit contributions
 name-0.193
 They-0.169
 names-0.169
-0.167
Circ-0.167
Token and
Token ID290
Feature activation+5.13e+00

Pos logit contributions
 name+0.483
+0.416
 names+0.416
 to+0.407
 They+0.397
Neg logit contributions
eling-0.903
ited-0.797
i-0.787
ering-0.778
ience-0.754
Token Sus
Token ID8932
Feature activation+1.18e+01

Pos logit contributions
eling+0.333
ited+0.291
i+0.286
ering+0.286
ie+0.274
Neg logit contributions
 name-0.247
 names-0.220
-0.214
 They-0.209
 to-0.207
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.265
ience+0.201
ited+0.200
mmm+0.174
leness+0.173
Neg logit contributions
 name-0.936
-0.934
 to-0.926
 corner-0.882
 names-0.877
Token followed
Token ID3940
Feature activation+2.20e+00

Pos logit contributions
eling+1.759
ited+1.551
ering+1.485
i+1.446
heart+1.430
Neg logit contributions
 name-1.551
-1.416
 to-1.371
 names-1.369
 They-1.331
Token it
Token ID340
Feature activation+2.57e+00

Pos logit contributions
eling+0.131
i+0.113
ering+0.111
ited+0.111
ie+0.108
Neg logit contributions
 name-0.115
 names-0.105
-0.099
Circ-0.098
 They-0.098
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
eling+0.169
ering+0.157
i+0.153
ie+0.152
ited+0.147
Neg logit contributions
 name-0.105
Circ-0.095
 names-0.093
 Bang-0.092
 Some-0.089
Token The
Token ID383
Feature activation-1.61e+00

Pos logit contributions
 name+0.038
 names+0.037
Circ+0.034
 fence+0.033
 road+0.030
Neg logit contributions
eling-0.087
heart-0.085
i-0.083
ie-0.080
ering-0.078
Token butterfly
Token ID35113
Feature activation-4.91e-01

Pos logit contributions
 name+0.053
+0.046
 They+0.046
 names+0.046
Circ+0.045
Neg logit contributions
eling-0.123
ited-0.109
i-0.109
ering-0.108
ie-0.107
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
eling+0.268
leness+0.189
izing+0.188
ited+0.176
ience+0.171
Neg logit contributions
 name-0.818
-0.806
 to-0.789
 names-0.752
 corner-0.742
Token little
Token ID1310
Feature activation+2.05e+00

Pos logit contributions
 name+0.043
 names+0.040
 They+0.038
+0.037
Circ+0.037
Neg logit contributions
eling-0.083
ited-0.075
i-0.073
ering-0.073
ie-0.071
Token girl
Token ID2576
Feature activation+7.80e+00

Pos logit contributions
eling+0.148
ited+0.130
i+0.129
ering+0.128
ie+0.125
Neg logit contributions
 name-0.082
 names-0.072
-0.069
 They-0.067
Circ-0.066
Token named
Token ID3706
Feature activation+9.57e+00

Pos logit contributions
eling+0.470
ited+0.403
ering+0.397
i+0.382
heart+0.369
Neg logit contributions
 name-0.419
-0.381
 to-0.373
 names-0.370
 They-0.362
Token Sus
Token ID8932
Feature activation+1.18e+01

Pos logit contributions
eling+0.623
i+0.543
ited+0.540
ering+0.532
ie+0.525
Neg logit contributions
 name-0.451
 names-0.403
-0.382
Circ-0.376
 They-0.372
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.317
ience+0.225
ited+0.222
ering+0.194
ises+0.190
Neg logit contributions
 name-0.954
-0.945
 to-0.926
 names-0.891
 corner-0.889
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+2.332
ited+2.104
ering+2.043
i+2.026
heart+1.973
Neg logit contributions
 name-0.988
-0.835
 names-0.807
 to-0.782
 They-0.750
Token Sus
Token ID8932
Feature activation+1.13e+01

Pos logit contributions
 name+0.036
 names+0.034
Circ+0.033
 fence+0.031
 road+0.029
Neg logit contributions
eling-0.066
heart-0.062
ie-0.061
i-0.061
ering-0.058
Tokenie
Token ID494
Feature activation+2.87e+01

Pos logit contributions
eling+0.325
ience+0.270
leness+0.232
ited+0.229
ises+0.229
Neg logit contributions
-0.846
 name-0.832
 to-0.813
 corner-0.787
 names-0.781
Token loved
Token ID6151
Feature activation+7.61e+00

Pos logit contributions
eling+2.123
ited+1.907
ering+1.815
itive+1.759
heart+1.746
Neg logit contributions
 name-1.195
-1.142
 to-1.073
 They-1.010
 names-0.979
Token to
Token ID284
Feature activation-1.29e+01

Pos logit contributions
eling+0.623
i+0.564
ie+0.558
ering+0.542
heart+0.539
Neg logit contributions
 name-0.212
 names-0.168
 Bang-0.163
 They-0.159
 fence-0.155
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
eling+0.747
i+0.677
ie+0.661
ering+0.651
ited+0.630
Neg logit contributions
 name-0.408
Circ-0.403
 Bang-0.384
 names-0.383
 They-0.380
Token necklace
Token ID38987
Feature activation+4.52e+00

Pos logit contributions
 name+0.043
 names+0.040
 They+0.038
+0.037
Circ+0.037
Neg logit contributions
eling-0.083
ited-0.075
i-0.073
ering-0.073
ie-0.071
Token for
Token ID329
Feature activation+5.29e+00

Pos logit contributions
eling+0.332
ited+0.292
i+0.286
ering+0.286
heart+0.275
Neg logit contributions
 name-0.181
 names-0.157
-0.153
 to-0.148
 They-0.146
Token O
Token ID440
Feature activation+4.83e+00

Pos logit contributions
eling+0.358
ited+0.317
i+0.310
ering+0.309
ie+0.297
Neg logit contributions
 name-0.239
 names-0.212
-0.207
 They-0.199
 to-0.198
Tokenll
Token ID297
Feature activation+1.20e+01

Pos logit contributions
eling+0.231
ited+0.186
ering+0.180
i+0.177
heart+0.170
Neg logit contributions
 name-0.304
-0.281
 names-0.273
 to-0.270
 They-0.260
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.464
ience+0.388
ited+0.385
ering+0.360
i+0.346
Neg logit contributions
 name-0.884
-0.809
 names-0.799
 to-0.791
 They-0.776
Token with
Token ID351
Feature activation-1.97e+00

Pos logit contributions
eling+2.151
i+1.913
ited+1.906
ering+1.882
ie+1.849
Neg logit contributions
 name-1.089
 names-0.955
-0.895
Circ-0.887
 Bang-0.873
Token the
Token ID262
Feature activation+4.31e+00

Pos logit contributions
 name+0.076
 names+0.066
+0.063
 They+0.061
Circ+0.060
Neg logit contributions
eling-0.147
ited-0.130
i-0.129
ering-0.128
ie-0.124
Token stones
Token ID14966
Feature activation-5.08e+00

Pos logit contributions
eling+0.298
i+0.263
ited+0.260
ering+0.257
ie+0.254
Neg logit contributions
 name-0.176
 names-0.156
Circ-0.153
-0.152
 They-0.152
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 name+0.183
 names+0.158
+0.154
 to+0.152
 They+0.147
Neg logit contributions
eling-0.393
ited-0.349
ering-0.338
i-0.338
heart-0.329
Token O
Token ID440
Feature activation+3.75e+00

Pos logit contributions
 name+0.042
 names+0.040
Circ+0.037
 across+0.034
 fence+0.033
Neg logit contributions
eling-0.097
i-0.093
heart-0.092
ie-0.089
ited-0.086
Token the
Token ID262
Feature activation+5.73e+00

Pos logit contributions
eling+0.861
i+0.789
ering+0.786
ited+0.784
ie+0.770
Neg logit contributions
 name-0.100
 names-0.057
-0.037
Circ-0.033
 They-0.032
Token chim
Token ID18205
Feature activation-2.46e+00

Pos logit contributions
eling+0.428
i+0.379
ited+0.377
ering+0.373
ie+0.369
Neg logit contributions
 name-0.197
 names-0.174
Circ-0.170
-0.169
 They-0.167
Tokenney
Token ID1681
Feature activation-8.45e+00

Pos logit contributions
 name+0.075
 names+0.065
Circ+0.058
 Bang+0.055
+0.053
Neg logit contributions
eling-0.201
i-0.180
ering-0.180
ited-0.180
ie-0.179
Token!
Token ID0
Feature activation+1.15e+01

Pos logit contributions
 name+0.363
+0.323
 names+0.318
 to+0.315
 They+0.308
Neg logit contributions
eling-0.578
ited-0.510
i-0.504
ering-0.497
ience-0.477
Token Grac
Token ID38099
Feature activation+1.17e+01

Pos logit contributions
eling+0.884
i+0.814
heart+0.813
ited+0.796
ie+0.779
Neg logit contributions
 name-0.313
 names-0.289
Circ-0.255
 fence-0.242
 road-0.228
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.310
mmm+0.286
iable+0.283
ited+0.273
pper+0.249
Neg logit contributions
 name-0.762
-0.738
 names-0.737
 to-0.715
 corner-0.711
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
eling+1.954
ited+1.758
ering+1.703
i+1.647
itive+1.631
Neg logit contributions
 name-1.328
-1.227
 to-1.162
 names-1.148
 They-1.144
Token so
Token ID523
Feature activation+9.79e+00

Pos logit contributions
eling+0.225
ited+0.200
i+0.196
ering+0.194
ience+0.190
Neg logit contributions
 name-0.162
-0.142
 to-0.140
 names-0.139
 across-0.135
Token proud
Token ID6613
Feature activation+1.04e+01

Pos logit contributions
eling+0.656
i+0.581
ited+0.578
ering+0.571
ie+0.568
Neg logit contributions
 name-0.470
-0.414
 names-0.395
 to-0.392
 They-0.377
Token of
Token ID286
Feature activation-3.75e+00

Pos logit contributions
eling+0.785
ited+0.692
ering+0.679
i+0.678
ience+0.651
Neg logit contributions
 name-0.405
 names-0.351
-0.350
 to-0.341
 They-0.330
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 name+0.183
+0.172
 to+0.162
 names+0.161
 They+0.158
Neg logit contributions
eling-0.241
ited-0.227
ering-0.210
ience-0.206
i-0.206
Token called
Token ID1444
Feature activation+7.85e+00

Pos logit contributions
+0.081
 to+0.075
 name+0.075
 They+0.071
 This+0.067
Neg logit contributions
eling-0.110
ited-0.098
ience-0.086
ering-0.086
i-0.086
Token Mill
Token ID9212
Feature activation+1.02e+01

Pos logit contributions
eling+0.531
i+0.468
ering+0.462
ited+0.459
ie+0.449
Neg logit contributions
 name-0.337
 names-0.298
-0.286
Circ-0.281
 fence-0.278
Tokenie
Token ID494
Feature activation+2.99e+01

Pos logit contributions
eling+0.196
ited+0.113
ering+0.093
i+0.077
ience+0.076
Neg logit contributions
 name-0.951
 names-0.905
-0.892
 to-0.882
 They-0.872
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
eling+2.103
i+1.849
ited+1.845
ering+1.824
ie+1.779
Neg logit contributions
 name-1.238
 names-1.090
-1.043
 They-1.016
Circ-1.012
Token Mill
Token ID9212
Feature activation+9.23e+00

Pos logit contributions
 name+0.011
 names+0.011
Circ+0.010
 fence+0.010
 road+0.009
Neg logit contributions
eling-0.026
heart-0.025
i-0.025
ie-0.025
ering-0.024
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.289
mmm+0.216
ited+0.214
ience+0.214
iable+0.185
Neg logit contributions
 name-0.644
-0.644
 to-0.635
 corner-0.614
 names-0.608
Token wanted
Token ID2227
Feature activation+3.06e+00

Pos logit contributions
eling+2.064
ited+1.849
ering+1.747
i+1.692
heart+1.653
Neg logit contributions
 name-1.266
-1.171
 to-1.123
 names-1.080
 They-1.080
Token to
Token ID284
Feature activation-1.26e+01

Pos logit contributions
eling+0.286
i+0.263
ering+0.259
ie+0.257
ited+0.257
Neg logit contributions
 name-0.053
 names-0.035
 They-0.030
-0.030
 fence-0.029
Token get
Token ID651
Feature activation-1.59e+00

Pos logit contributions
 name+0.462
 names+0.417
Circ+0.405
+0.395
 They+0.391
Neg logit contributions
eling-0.917
i-0.805
ited-0.793
ie-0.783
ering-0.776
Token to
Token ID284
Feature activation-1.28e+01

Pos logit contributions
 name+0.060
 names+0.053
+0.050
 They+0.050
Circ+0.050
Neg logit contributions
eling-0.117
i-0.104
ited-0.102
ering-0.102
ie-0.099
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 name+0.502
 names+0.449
+0.423
Circ+0.423
 They+0.419
Neg logit contributions
eling-0.928
i-0.806
ited-0.800
ering-0.785
ie-0.780
Token
Token ID220
Feature activation-9.46e-01

Pos logit contributions
 name+0.251
 names+0.220
+0.212
 They+0.205
 to+0.201
Neg logit contributions
eling-0.439
ited-0.387
i-0.383
ering-0.381
ie-0.369
Token
Token ID198
Feature activation-7.10e+00

Pos logit contributions
 name+0.014
 fence+0.012
Circ+0.012
 road+0.012
 names+0.011
Neg logit contributions
eling-0.083
i-0.079
ering-0.079
ie-0.079
heart-0.077
TokenOne
Token ID3198
Feature activation+5.29e+00

Pos logit contributions
 name+0.161
 names+0.138
 fence+0.108
Circ+0.105
 Bang+0.101
Neg logit contributions
eling-0.610
ited-0.558
i-0.556
ie-0.554
heart-0.551
Token day
Token ID1110
Feature activation-7.31e-01

Pos logit contributions
eling+0.418
ited+0.372
i+0.368
ering+0.367
ie+0.355
Neg logit contributions
 name-0.181
 names-0.154
-0.148
 They-0.141
 to-0.138
Token Mill
Token ID9212
Feature activation+9.23e+00

Pos logit contributions
 name+0.040
+0.037
 to+0.036
 names+0.035
 They+0.035
Neg logit contributions
eling-0.046
ited-0.039
ering-0.038
i-0.036
ience-0.035
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.237
ering+0.151
ited+0.148
leness+0.135
izing+0.132
Neg logit contributions
 name-0.766
-0.747
 to-0.739
 names-0.724
 yard-0.702
Token went
Token ID1816
Feature activation-1.39e+00

Pos logit contributions
eling+1.950
ering+1.681
ited+1.680
i+1.599
heart+1.558
Neg logit contributions
 name-1.374
-1.234
 names-1.216
 to-1.200
 They-1.175
Token out
Token ID503
Feature activation-6.95e+00

Pos logit contributions
 name+0.039
 names+0.039
Circ+0.038
 Bang+0.036
 Some+0.035
Neg logit contributions
eling-0.098
ie-0.096
i-0.096
ering-0.092
heart-0.086
Token to
Token ID284
Feature activation-1.27e+01

Pos logit contributions
 name+0.146
 names+0.116
 Bang+0.099
 They+0.093
Circ+0.092
Neg logit contributions
eling-0.627
i-0.576
ering-0.568
ie-0.567
ited-0.567
Token play
Token ID711
Feature activation-1.11e+01

Pos logit contributions
 name+0.581
 names+0.556
 They+0.530
 Some+0.527
 Bang+0.527
Neg logit contributions
eling-0.788
i-0.675
ie-0.669
ited-0.626
heart-0.614
Token in
Token ID287
Feature activation-2.21e+00

Pos logit contributions
 name+0.442
 names+0.391
+0.365
 They+0.358
Circ+0.356
Neg logit contributions
eling-0.801
ited-0.703
i-0.703
ering-0.700
ie-0.684
Token's
Token ID338
Feature activation+8.79e+00

Pos logit contributions
eling+0.237
ited+0.202
ering+0.199
heart+0.193
ie+0.193
Neg logit contributions
 name-0.143
 They-0.130
Circ-0.129
-0.126
 names-0.124
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
eling+0.601
i+0.530
ering+0.518
ited+0.515
ie+0.514
Neg logit contributions
 name-0.342
Circ-0.329
 names-0.310
 Bang-0.308
-0.300
Token telephone
Token ID11426
Feature activation-2.34e+00

Pos logit contributions
 name+0.042
 names+0.038
 They+0.037
+0.037
Circ+0.035
Neg logit contributions
eling-0.077
ited-0.069
i-0.068
ering-0.067
ience-0.066
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.106
 names+0.092
+0.092
 to+0.091
 They+0.088
Neg logit contributions
eling-0.160
ited-0.142
i-0.137
ering-0.137
ie-0.130
Token sweet
Token ID6029
Feature activation+9.33e+00

Pos logit contributions
eling+0.947
ited+0.832
i+0.819
ering+0.814
ie+0.786
Neg logit contributions
 name-0.620
 names-0.549
-0.532
 They-0.517
 to-0.509
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.355
ited+0.272
ering+0.261
i+0.242
ience+0.237
Neg logit contributions
 name-0.733
 names-0.671
-0.653
 to-0.652
 They-0.644
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+2.125
ited+1.908
ering+1.842
i+1.807
heart+1.750
Neg logit contributions
 name-1.198
-1.020
 names-1.014
 to-0.991
 They-0.975
Token People
Token ID4380
Feature activation-4.87e+00

Pos logit contributions
 name+0.029
 names+0.028
Circ+0.024
 across+0.023
 fence+0.023
Neg logit contributions
eling-0.074
heart-0.069
i-0.069
ie-0.066
ited-0.066
Token use
Token ID779
Feature activation+2.07e+00

Pos logit contributions
 name+0.227
 names+0.203
 to+0.201
+0.195
 They+0.190
Neg logit contributions
eling-0.316
ited-0.291
ering-0.277
i-0.275
heart-0.271
Token it
Token ID340
Feature activation+2.34e+00

Pos logit contributions
eling+0.147
i+0.129
ering+0.128
ited+0.128
ie+0.124
Neg logit contributions
 name-0.083
 names-0.072
-0.071
Circ-0.069
 They-0.068
Token to
Token ID284
Feature activation-1.29e+01

Pos logit contributions
eling+0.214
i+0.194
ited+0.194
ering+0.193
ie+0.189
Neg logit contributions
 name-0.047
 names-0.035
-0.032
Circ-0.029
 They-0.029
TokenYou
Token ID1639
Feature activation+6.80e+00

Pos logit contributions
eling+0.977
ited+0.880
ering+0.861
i+0.838
ience+0.809
Neg logit contributions
 name-0.472
 names-0.399
-0.385
 They-0.370
 to-0.361
Token're
Token ID821
Feature activation+7.08e+00

Pos logit contributions
eling+0.474
ited+0.419
ering+0.418
i+0.404
ie+0.390
Neg logit contributions
 name-0.315
 names-0.277
-0.272
 They-0.258
 to-0.254
Token welcome
Token ID7062
Feature activation-8.77e-01

Pos logit contributions
eling+0.509
ited+0.471
ering+0.452
i+0.449
ience+0.444
Neg logit contributions
 name-0.301
-0.256
 names-0.255
 They-0.240
 to-0.239
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.045
 names+0.040
+0.039
 They+0.038
Circ+0.038
Neg logit contributions
eling-0.054
ited-0.046
i-0.046
ering-0.046
ie-0.044
Token sweet
Token ID6029
Feature activation+9.33e+00

Pos logit contributions
eling+0.948
ited+0.831
i+0.826
ering+0.816
ie+0.793
Neg logit contributions
 name-0.604
 names-0.537
-0.515
 They-0.500
Circ-0.495
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.274
ited+0.181
ering+0.179
ience+0.146
i+0.143
Neg logit contributions
 name-0.807
 names-0.752
-0.745
 to-0.737
 They-0.734
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+2.050
ited+1.798
ering+1.763
i+1.701
ience+1.624
Neg logit contributions
 name-1.301
-1.140
 to-1.116
 names-1.103
 They-1.077
Token Do
Token ID2141
Feature activation+6.75e-01

Pos logit contributions
 name+0.037
 names+0.034
Circ+0.032
 fence+0.031
 road+0.031
Neg logit contributions
eling-0.069
i-0.063
heart-0.061
ited-0.061
ience-0.060
Token you
Token ID345
Feature activation+8.00e+00

Pos logit contributions
eling+0.044
ited+0.042
ering+0.038
heart+0.036
i+0.036
Neg logit contributions
 name-0.034
-0.032
 names-0.031
 to-0.030
 They-0.029
Token want
Token ID765
Feature activation+2.90e-01

Pos logit contributions
eling+0.559
ited+0.487
ering+0.487
i+0.479
ience+0.463
Neg logit contributions
 name-0.370
 names-0.320
-0.311
 to-0.299
 They-0.293
Token to
Token ID284
Feature activation-1.29e+01

Pos logit contributions
eling+0.025
i+0.023
ie+0.023
ering+0.023
heart+0.022
Neg logit contributions
 name-0.006
 They-0.004
 fence-0.004
 names-0.004
Circ-0.004
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
 name+0.128
+0.115
 to+0.113
 names+0.113
 They+0.111
Neg logit contributions
eling-0.185
ited-0.164
i-0.156
ering-0.154
ie-0.148
Token One
Token ID1881
Feature activation+9.41e-01

Pos logit contributions
eling+0.022
i+0.021
heart+0.020
ited+0.020
ie+0.020
Neg logit contributions
 name-0.007
 names-0.006
Circ-0.006
 fence-0.005
 across-0.005
Token day
Token ID1110
Feature activation-4.01e-01

Pos logit contributions
eling+0.073
ited+0.065
i+0.064
ering+0.064
ie+0.062
Neg logit contributions
 name-0.033
 names-0.029
-0.027
 They-0.026
Circ-0.026
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.022
+0.021
 to+0.021
 They+0.020
 names+0.020
Neg logit contributions
eling-0.024
ited-0.021
ering-0.020
i-0.019
ience-0.019
Token Ros
Token ID10018
Feature activation+3.75e+00

Pos logit contributions
eling+0.900
ited+0.783
i+0.772
ering+0.768
ie+0.740
Neg logit contributions
 name-0.664
 names-0.594
-0.576
 They-0.560
 to-0.551
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.093
ited+0.065
ering+0.063
ience+0.050
izing+0.049
Neg logit contributions
 name-0.315
 names-0.306
-0.301
 yard-0.294
 to-0.294
Token's
Token ID338
Feature activation+7.51e+00

Pos logit contributions
eling+2.001
ited+1.743
ering+1.736
i+1.652
heart+1.636
Neg logit contributions
 name-1.326
-1.193
 to-1.160
 names-1.157
 They-1.119
Token mom
Token ID1995
Feature activation-4.07e+00

Pos logit contributions
i+0.426
eling+0.414
ie+0.402
ering+0.377
itive+0.374
Neg logit contributions
 They-0.308
Circ-0.307
 Along-0.281
-0.280
 This-0.279
Tokenmy
Token ID1820
Feature activation+7.77e+00

Pos logit contributions
 name+0.201
+0.187
 to+0.186
 They+0.173
 names+0.173
Neg logit contributions
eling-0.264
ering-0.227
ited-0.221
ience-0.213
izing-0.211
Token took
Token ID1718
Feature activation+1.71e+00

Pos logit contributions
eling+0.486
ering+0.418
ited+0.416
heart+0.388
i+0.384
Neg logit contributions
 name-0.405
-0.369
 to-0.365
 They-0.351
 names-0.349
Token her
Token ID607
Feature activation+1.14e+01

Pos logit contributions
eling+0.091
i+0.081
ie+0.078
ited+0.077
ering+0.076
Neg logit contributions
 name-0.095
 names-0.086
 fence-0.086
 road-0.083
-0.083
Token her
Token ID607
Feature activation+1.31e+01

Pos logit contributions
 name+0.063
 names+0.062
Circ+0.061
 road+0.054
 Bang+0.052
Neg logit contributions
eling-0.124
i-0.121
ie-0.116
heart-0.114
ering-0.112
Token charming
Token ID23332
Feature activation+8.36e+00

Pos logit contributions
eling+0.950
i+0.848
ited+0.835
ering+0.831
ie+0.820
Neg logit contributions
 name-0.481
Circ-0.427
 names-0.425
-0.410
 Bang-0.406
Token doll
Token ID3654
Feature activation+7.51e-01

Pos logit contributions
eling+0.595
i+0.529
ited+0.525
ering+0.519
ie+0.515
Neg logit contributions
 name-0.326
 names-0.288
-0.280
Circ-0.274
 They-0.273
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
eling+0.056
ited+0.050
ering+0.049
i+0.048
heart+0.047
Neg logit contributions
 name-0.030
 names-0.025
-0.025
 to-0.024
 They-0.024
Token Ros
Token ID10018
Feature activation+3.75e+00

Pos logit contributions
eling+0.912
ited+0.795
i+0.782
ering+0.779
ie+0.749
Neg logit contributions
 name-0.655
 names-0.583
-0.568
 They-0.551
 to-0.544
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.098
ited+0.076
ering+0.064
mmm+0.052
ths+0.052
Neg logit contributions
 name-0.300
-0.290
 to-0.285
 names-0.285
 They-0.271
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+1.972
ited+1.762
ering+1.692
i+1.613
heart+1.594
Neg logit contributions
 name-1.394
-1.246
 names-1.197
 to-1.194
 They-1.176
Token One
Token ID1881
Feature activation-5.39e-01

Pos logit contributions
 name+0.029
 names+0.027
Circ+0.025
 fence+0.024
 road+0.023
Neg logit contributions
eling-0.072
i-0.069
heart-0.068
ie-0.067
ience-0.065
Token day
Token ID1110
Feature activation-1.62e+00

Pos logit contributions
 name+0.014
+0.011
 to+0.011
 names+0.010
 corner+0.009
Neg logit contributions
eling-0.049
ering-0.043
ited-0.043
i-0.042
eking-0.041
Token,
Token ID11
Feature activation+1.29e+01

Pos logit contributions
 name+0.091
+0.087
 to+0.085
 They+0.082
 names+0.080
Neg logit contributions
eling-0.102
ited-0.087
ering-0.083
i-0.080
heart-0.077
Token Lily
Token ID20037
Feature activation+1.99e+00

Pos logit contributions
eling+0.829
ited+0.720
i+0.705
ering+0.704
ie+0.672
Neg logit contributions
 name-0.634
 names-0.568
-0.556
 They-0.540
 to-0.533
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
eling+0.386
ited+0.364
ering+0.351
ience+0.343
i+0.341
Neg logit contributions
 name-0.179
-0.156
 names-0.141
 to-0.138
 They-0.136
Token little
Token ID1310
Feature activation+2.05e+00

Pos logit contributions
 name+0.043
 names+0.040
 They+0.038
+0.037
Circ+0.037
Neg logit contributions
eling-0.083
ited-0.075
i-0.073
ering-0.073
ie-0.071
Token bird
Token ID6512
Feature activation-4.48e+00

Pos logit contributions
eling+0.148
ited+0.130
i+0.129
ering+0.128
ie+0.125
Neg logit contributions
 name-0.082
 names-0.072
-0.069
 They-0.067
Circ-0.066
Token named
Token ID3706
Feature activation+9.57e+00

Pos logit contributions
 name+0.235
+0.221
 to+0.219
 They+0.203
 names+0.200
Neg logit contributions
eling-0.264
ited-0.227
ering-0.221
i-0.213
ience-0.210
Token Tweet
Token ID18752
Feature activation-1.11e-03

Pos logit contributions
eling+0.588
i+0.520
ie+0.499
ited+0.498
ering+0.495
Neg logit contributions
 name-0.471
 names-0.431
Circ-0.403
-0.395
 fence-0.393
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
 name+0.000
+0.000
 to+0.000
 names+0.000
 They+0.000
Neg logit contributions
eling-0.000
ited-0.000
ering-0.000
ience-0.000
heart-0.000
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+2.293
ited+2.063
ering+1.994
i+1.975
heart+1.940
Neg logit contributions
 name-1.024
-0.880
 names-0.860
 to-0.827
 They-0.797
Token Tweet
Token ID18752
Feature activation-5.24e-01

Pos logit contributions
 name+0.043
 names+0.042
Circ+0.042
 corner+0.039
 across+0.037
Neg logit contributions
eling-0.055
i-0.054
ie-0.053
heart-0.051
ering-0.048
Tokenie
Token ID494
Feature activation+2.87e+01

Pos logit contributions
+0.042
 name+0.039
 to+0.038
 They+0.037
 big+0.036
Neg logit contributions
eling-0.015
ited-0.011
mmm-0.011
ering-0.010
ience-0.010
Token loved
Token ID6151
Feature activation+7.61e+00

Pos logit contributions
eling+2.212
ited+2.040
ering+1.896
heart+1.877
ieved+1.819
Neg logit contributions
-1.111
 name-1.068
 to-0.990
 They-0.940
 The-0.851
Token to
Token ID284
Feature activation-1.29e+01

Pos logit contributions
eling+0.656
ie+0.619
i+0.610
ering+0.581
heart+0.579
Neg logit contributions
 name-0.173
 names-0.127
tted-0.115
 They-0.114
 Bang-0.114
Tokenily
Token ID813
Feature activation+6.08e+00

Pos logit contributions
eling+0.542
i+0.472
ering+0.462
ited+0.459
ience+0.445
Neg logit contributions
 name-0.271
-0.253
 to-0.236
 names-0.232
 They-0.231
Token realized
Token ID6939
Feature activation+1.80e+01

Pos logit contributions
eling+0.388
ering+0.335
ited+0.334
i+0.315
izing+0.309
Neg logit contributions
 name-0.311
-0.279
 names-0.278
 to-0.270
 They-0.266
Token that
Token ID326
Feature activation+6.24e+00

Pos logit contributions
eling+1.182
ited+1.101
ering+1.012
i+1.003
ience+0.969
Neg logit contributions
 name-0.836
-0.808
 to-0.804
 They-0.744
 names-0.727
Token the
Token ID262
Feature activation+5.07e+00

Pos logit contributions
eling+0.365
ited+0.317
i+0.308
ering+0.305
ie+0.290
Neg logit contributions
 name-0.343
 names-0.311
-0.309
 to-0.300
 They-0.299
Token gen
Token ID2429
Feature activation+8.68e+00

Pos logit contributions
eling+0.340
i+0.296
ited+0.292
ie+0.290
ering+0.289
Neg logit contributions
 name-0.204
 They-0.191
-0.188
Circ-0.187
 names-0.185
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.212
ited+0.157
heit+0.151
ience+0.139
i+0.138
Neg logit contributions
 name-0.674
 to-0.659
-0.641
 names-0.613
 They-0.593
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
eling+2.007
ited+1.799
ering+1.744
i+1.682
heart+1.666
Neg logit contributions
 name-1.274
 to-1.146
-1.113
 names-1.110
 They-1.075
Token right
Token ID826
Feature activation-1.61e+00

Pos logit contributions
eling+0.220
ited+0.203
i+0.194
ering+0.191
heart+0.189
Neg logit contributions
 name-0.164
 to-0.144
-0.143
 names-0.142
 across-0.135
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 name+0.060
 to+0.053
+0.052
 names+0.051
 They+0.047
Neg logit contributions
eling-0.125
ited-0.111
ering-0.106
i-0.106
ience-0.103
Token She
Token ID1375
Feature activation-7.15e-01

Pos logit contributions
 name+0.043
 names+0.043
Circ+0.039
 road+0.036
 across+0.036
Neg logit contributions
eling-0.086
i-0.080
heart-0.079
ie-0.077
ience-0.077
Token thanked
Token ID26280
Feature activation+6.68e+00

Pos logit contributions
 name+0.037
+0.035
 to+0.034
 They+0.032
 names+0.031
Neg logit contributions
eling-0.044
ering-0.040
ited-0.039
heart-0.038
itive-0.037
Token little
Token ID1310
Feature activation+1.99e+00

Pos logit contributions
 names+0.255
 name+0.255
 fence+0.203
tr+0.188
 Bang+0.186
Neg logit contributions
eling-0.681
ering-0.642
ie-0.631
ited-0.616
i-0.605
Token girl
Token ID2576
Feature activation+8.19e+00

Pos logit contributions
eling+0.144
ited+0.127
i+0.125
ering+0.125
ie+0.120
Neg logit contributions
 name-0.080
 names-0.070
-0.068
 They-0.065
 to-0.065
Token called
Token ID1444
Feature activation+7.52e+00

Pos logit contributions
eling+0.499
ited+0.434
ering+0.419
i+0.405
heart+0.394
Neg logit contributions
 name-0.432
-0.398
 to-0.386
 names-0.379
 They-0.372
Token L
Token ID406
Feature activation+5.22e+00

Pos logit contributions
eling+0.436
i+0.380
ering+0.370
ited+0.366
ie+0.363
Neg logit contributions
 name-0.392
 names-0.358
Circ-0.341
-0.341
 fence-0.336
Tokenizz
Token ID6457
Feature activation+1.19e+01

Pos logit contributions
eling+0.246
ited+0.202
i+0.197
ering+0.196
ie+0.184
Neg logit contributions
 name-0.342
 names-0.316
-0.311
 They-0.303
 to-0.300
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.277
ited+0.188
ering+0.159
i+0.142
ience+0.140
Neg logit contributions
 name-1.045
-0.990
 names-0.987
 to-0.976
 They-0.963
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
eling+2.135
i+1.902
ited+1.882
ering+1.868
ie+1.838
Neg logit contributions
 name-1.100
 names-0.969
Circ-0.905
-0.901
 They-0.885
Token She
Token ID1375
Feature activation-4.81e-01

Pos logit contributions
 name+0.031
 names+0.030
Circ+0.029
 fence+0.028
 road+0.027
Neg logit contributions
eling-0.069
i-0.066
heart-0.064
ie-0.062
ering-0.060
Token liked
Token ID8288
Feature activation+6.29e+00

Pos logit contributions
 name+0.021
+0.020
 to+0.019
 names+0.018
 They+0.018
Neg logit contributions
eling-0.034
ited-0.030
ering-0.029
heart-0.028
i-0.028
Token to
Token ID284
Feature activation-1.28e+01

Pos logit contributions
eling+0.557
i+0.512
ie+0.508
ering+0.497
heart+0.495
Neg logit contributions
 name-0.127
 names-0.086
 Bang-0.079
-0.078
 They-0.076
Token perform
Token ID1620
Feature activation-1.31e+00

Pos logit contributions
 name+0.437
 names+0.377
+0.362
Circ+0.345
 They+0.345
Neg logit contributions
eling-0.998
ited-0.884
i-0.882
ering-0.870
ie-0.855
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
eling+0.819
i+0.748
ering+0.747
ited+0.742
ie+0.726
Neg logit contributions
 name-0.140
 names-0.100
-0.078
Circ-0.077
 They-0.077
Token One
Token ID1881
Feature activation+9.41e-01

Pos logit contributions
eling+0.022
i+0.021
heart+0.020
ited+0.020
ie+0.020
Neg logit contributions
 name-0.007
 names-0.006
Circ-0.006
 fence-0.005
 across-0.005
Token day
Token ID1110
Feature activation-4.01e-01

Pos logit contributions
eling+0.073
ited+0.065
i+0.064
ering+0.064
ie+0.062
Neg logit contributions
 name-0.033
 names-0.029
-0.027
 They-0.026
Circ-0.026
Token,
Token ID11
Feature activation+1.39e+01

Pos logit contributions
 name+0.022
+0.021
 to+0.021
 They+0.020
 names+0.020
Neg logit contributions
eling-0.024
ited-0.021
ering-0.020
i-0.019
ience-0.019
Token Ros
Token ID10018
Feature activation+3.75e+00

Pos logit contributions
eling+0.900
ited+0.783
i+0.772
ering+0.768
ie+0.740
Neg logit contributions
 name-0.664
 names-0.594
-0.576
 They-0.560
 to-0.551
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.093
ited+0.065
ering+0.063
ience+0.050
izing+0.049
Neg logit contributions
 name-0.315
 names-0.306
-0.301
 yard-0.294
 to-0.294
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
eling+2.001
ited+1.743
ering+1.736
i+1.652
heart+1.636
Neg logit contributions
 name-1.326
-1.193
 to-1.160
 names-1.157
 They-1.119
Token playing
Token ID2712
Feature activation+2.82e-01

Pos logit contributions
eling+0.228
ited+0.200
i+0.197
ering+0.196
ie+0.189
Neg logit contributions
 name-0.157
 names-0.139
-0.136
 They-0.130
 to-0.130
Token in
Token ID287
Feature activation-2.02e+00

Pos logit contributions
eling+0.021
ited+0.018
i+0.018
ering+0.018
ie+0.018
Neg logit contributions
 name-0.011
 names-0.009
-0.009
 They-0.009
Circ-0.009
Token her
Token ID607
Feature activation+1.14e+01

Pos logit contributions
 name+0.103
 names+0.093
+0.086
 They+0.086
 Bang+0.086
Neg logit contributions
eling-0.126
ited-0.107
i-0.106
ering-0.104
ie-0.103
Token room
Token ID2119
Feature activation-7.43e+00

Pos logit contributions
eling+0.918
i+0.837
ie+0.828
ering+0.818
ited+0.816
Neg logit contributions
 name-0.341
 names-0.297
 Bang-0.274
Circ-0.274
 They-0.265
Token dance
Token ID9280
Feature activation-5.91e-01

Pos logit contributions
eling+0.012
ited+0.011
ering+0.011
i+0.010
ience+0.010
Neg logit contributions
 name-0.007
 to-0.006
-0.006
 names-0.006
 They-0.006
Token lessons
Token ID11658
Feature activation-4.39e+00

Pos logit contributions
 name+0.023
 names+0.020
+0.020
 to+0.019
 They+0.019
Neg logit contributions
eling-0.044
ited-0.039
i-0.038
ering-0.038
heart-0.036
Token.
Token ID13
Feature activation-4.50e-02

Pos logit contributions
 name+0.166
+0.152
 to+0.152
 names+0.146
 They+0.142
Neg logit contributions
eling-0.327
ited-0.297
i-0.284
ering-0.280
heart-0.267
Token H
Token ID367
Feature activation+6.10e+00

Pos logit contributions
 name+0.001
 names+0.001
Circ+0.001
 fence+0.001
 to+0.001
Neg logit contributions
eling-0.004
i-0.003
heart-0.003
ie-0.003
ited-0.003
Tokenoot
Token ID1025
Feature activation+9.72e+00

Pos logit contributions
eling+0.309
ited+0.261
ering+0.246
i+0.239
heart+0.230
Neg logit contributions
 name-0.366
-0.339
 names-0.329
 They-0.317
 Bang-0.315
Tokenie
Token ID494
Feature activation+2.91e+01

Pos logit contributions
eling+0.541
ited+0.471
ering+0.440
i+0.412
heart+0.405
Neg logit contributions
 name-0.540
 to-0.510
-0.503
 names-0.497
 They-0.488
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
eling+1.951
ited+1.746
ering+1.663
i+1.605
itive+1.585
Neg logit contributions
 name-1.345
-1.243
 to-1.227
 names-1.158
 They-1.147
Token happy
Token ID3772
Feature activation+8.32e+00

Pos logit contributions
eling+0.237
ited+0.213
i+0.207
ering+0.204
ience+0.202
Neg logit contributions
 name-0.153
-0.137
 to-0.133
 across-0.128
 names-0.126
Token that
Token ID326
Feature activation+5.02e+00

Pos logit contributions
eling+0.723
ited+0.651
i+0.639
ering+0.638
ie+0.617
Neg logit contributions
 name-0.226
 names-0.182
-0.177
 to-0.168
 They-0.164
Token he
Token ID339
Feature activation+5.39e+00

Pos logit contributions
eling+0.316
ited+0.278
ering+0.263
i+0.261
ience+0.250
Neg logit contributions
 name-0.262
-0.239
 names-0.234
 to-0.231
 They-0.226
Token didn
Token ID1422
Feature activation+1.32e+01

Pos logit contributions
eling+0.387
ering+0.340
ited+0.332
i+0.324
heart+0.316
Neg logit contributions
 name-0.217
 to-0.210
-0.209
 across-0.192
 They-0.178
Token book
Token ID1492
Feature activation-5.43e+00

Pos logit contributions
eling+0.929
ering+0.823
i+0.822
ience+0.807
ited+0.803
Neg logit contributions
 They-0.431
Circ-0.423
-0.390
 Bang-0.385
Let-0.378
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
 name+0.233
 names+0.206
Circ+0.196
+0.194
 They+0.191
Neg logit contributions
eling-0.363
i-0.321
ering-0.319
ited-0.318
ie-0.312
Token What
Token ID1867
Feature activation-8.50e+00

Pos logit contributions
 fence+0.100
 name+0.098
 corner+0.096
 yard+0.096
 across+0.095
Neg logit contributions
eling-0.174
i-0.164
heart-0.159
ience-0.156
mmm-0.156
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.371
 names+0.331
+0.316
 to+0.306
 They+0.300
Neg logit contributions
eling-0.606
ited-0.539
i-0.523
ering-0.522
heart-0.499
Token your
Token ID534
Feature activation+1.34e+01

Pos logit contributions
eling+0.147
i+0.125
ering+0.123
ited+0.123
ie+0.118
Neg logit contributions
 name-0.136
 names-0.124
-0.119
 They-0.118
Circ-0.116
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+1.161
i+1.095
ering+1.055
ie+1.041
ience+1.030
Neg logit contributions
Circ-0.172
 They-0.153
 Bang-0.133
 Some-0.125
Let-0.123
Token?"
Token ID1701
Feature activation+8.41e+00

Pos logit contributions
 name+1.115
+0.999
 names+0.986
 to+0.966
 They+0.937
Neg logit contributions
eling-1.284
ited-1.124
ering-1.078
i-1.069
heart-1.022
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
eling+0.623
i+0.581
 chuckled+0.562
ie+0.557
heart+0.553
Neg logit contributions
 names-0.205
 name-0.203
 fence-0.163
tr-0.151
Circ-0.147
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
 name+0.265
 names+0.255
 fence+0.209
 road+0.198
tr+0.194
Neg logit contributions
eling-0.780
ie-0.736
i-0.713
ise-0.711
ering-0.695
Token"
Token ID1
Feature activation+5.09e+00

Pos logit contributions
 name+0.429
 names+0.377
+0.347
 They+0.343
 to+0.339
Neg logit contributions
eling-0.739
ited-0.653
i-0.642
ering-0.636
ie-0.618
TokenTim
Token ID14967
Feature activation+1.68e+00

Pos logit contributions
eling+0.349
ited+0.313
i+0.309
ering+0.295
ience+0.291
Neg logit contributions
 name-0.205
 names-0.179
 fence-0.173
 to-0.171
 yard-0.167
Token heard
Token ID2982
Feature activation-1.04e+00

Pos logit contributions
eling+0.244
heart+0.213
ited+0.213
ering+0.212
itive+0.206
Neg logit contributions
 name-0.177
-0.165
 to-0.163
 names-0.158
 They-0.155
Token a
Token ID257
Feature activation-4.31e+00

Pos logit contributions
 name+0.041
 names+0.037
 fence+0.036
Circ+0.035
 road+0.035
Neg logit contributions
eling-0.072
i-0.064
ie-0.062
ering-0.061
ience-0.059
Token voice
Token ID3809
Feature activation-5.96e+00

Pos logit contributions
 name+0.179
 names+0.164
Circ+0.149
+0.148
 They+0.146
Neg logit contributions
eling-0.300
ering-0.259
i-0.255
ited-0.254
ie-0.253
Token calling
Token ID4585
Feature activation+8.89e-01

Pos logit contributions
 name+0.248
 names+0.217
+0.213
 to+0.204
 They+0.203
Neg logit contributions
eling-0.424
ited-0.376
i-0.370
ering-0.366
heart-0.354
Token her
Token ID607
Feature activation+9.97e+00

Pos logit contributions
eling+0.042
ie+0.041
i+0.040
ering+0.037
heart+0.037
Neg logit contributions
 name-0.045
 yard-0.044
 road-0.044
 fence-0.043
Circ-0.043
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
i+0.839
eling+0.806
ie+0.783
ering+0.760
itive+0.756
Neg logit contributions
Circ-0.148
 road-0.119
Let-0.113
 hills-0.111
 Some-0.107
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.757
 names+0.652
+0.625
 They+0.599
 to+0.588
Neg logit contributions
eling-1.602
ited-1.424
i-1.408
ering-1.401
ie-1.357
Token It
Token ID632
Feature activation-9.60e+00

Pos logit contributions
Circ+0.074
 road+0.073
 names+0.073
 name+0.070
 yard+0.068
Neg logit contributions
eling-0.202
i-0.195
ience-0.178
ering-0.178
ise-0.175
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 name+0.406
+0.369
 names+0.337
 They+0.325
 to+0.320
Neg logit contributions
eling-0.705
ited-0.639
ering-0.618
i-0.610
heart-0.597
Token mom
Token ID1995
Feature activation-5.77e+00

Pos logit contributions
eling+0.100
ited+0.086
i+0.085
ering+0.085
ie+0.082
Neg logit contributions
 name-0.077
 names-0.069
-0.067
 They-0.065
 to-0.064
Token.
Token ID13
Feature activation-2.86e+00

Pos logit contributions
 name+0.165
+0.154
 to+0.151
 They+0.134
 names+0.130
Neg logit contributions
eling-0.471
ited-0.433
ience-0.415
heart-0.406
ering-0.405
Token How
Token ID1374
Feature activation-7.03e+00

Pos logit contributions
 name+0.119
 fence+0.112
 road+0.112
 names+0.112
Circ+0.109
Neg logit contributions
eling-0.161
i-0.151
heart-0.147
ience-0.144
ie-0.140
Token do
Token ID466
Feature activation-5.45e+00

Pos logit contributions
 name+0.338
 names+0.302
+0.293
Circ+0.288
 They+0.287
Neg logit contributions
eling-0.446
ited-0.385
i-0.383
ering-0.381
ie-0.371
Token you
Token ID345
Feature activation+6.42e+00

Pos logit contributions
 name+0.294
+0.268
 names+0.265
 to+0.262
 They+0.256
Neg logit contributions
eling-0.327
ited-0.300
i-0.279
ering-0.278
heart-0.259
Token know
Token ID760
Feature activation-3.21e+00

Pos logit contributions
eling+0.489
ering+0.439
ited+0.432
i+0.429
ience+0.424
Neg logit contributions
 name-0.253
 names-0.209
-0.201
 to-0.193
 They-0.182
Token my
Token ID616
Feature activation+1.44e+01

Pos logit contributions
 name+0.165
+0.152
 to+0.149
 names+0.149
 They+0.139
Neg logit contributions
eling-0.199
ited-0.184
i-0.167
ering-0.167
heart-0.161
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.951
i+0.915
ie+0.898
ience+0.885
heart+0.873
Neg logit contributions
Circ-0.435
 They-0.425
 Some-0.409
 Bang-0.381
 Along-0.373
Token?"
Token ID1701
Feature activation+8.41e+00

Pos logit contributions
 name+1.078
+0.963
 names+0.942
 to+0.941
 They+0.892
Neg logit contributions
eling-1.324
ited-1.168
ering-1.124
i-1.108
ience-1.055
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
eling+0.549
i+0.530
ise+0.515
ie+0.514
 reassured+0.495
Neg logit contributions
 fence-0.221
 names-0.218
 road-0.210
 name-0.210
Circ-0.196
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
 name+0.254
 names+0.249
 fence+0.215
 road+0.203
Circ+0.203
Neg logit contributions
eling-0.776
ie-0.732
i-0.720
ise-0.719
ering-0.700
Token"
Token ID1
Feature activation+5.09e+00

Pos logit contributions
 name+0.474
 names+0.426
 They+0.386
Circ+0.382
 fence+0.382
Neg logit contributions
eling-0.680
ited-0.595
i-0.590
ering-0.581
ie-0.566
TokenI
Token ID40
Feature activation-5.40e+00

Pos logit contributions
eling+0.350
ited+0.314
i+0.311
ering+0.300
ience+0.298
Neg logit contributions
 name-0.197
 names-0.171
 fence-0.163
 to-0.159
 yard-0.159
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
eling+0.456
ited+0.410
i+0.388
ering+0.383
ie+0.368
Neg logit contributions
 name-0.248
 to-0.220
-0.218
 names-0.212
 They-0.203
Token "
Token ID366
Feature activation-7.95e+00

Pos logit contributions
eling+0.930
ering+0.839
i+0.835
ited+0.821
ience+0.812
Neg logit contributions
 name-0.311
 names-0.269
Circ-0.235
 fence-0.217
 yard-0.216
TokenHello
Token ID15496
Feature activation-1.60e+00

Pos logit contributions
 name+0.334
 names+0.291
 to+0.273
 fence+0.272
 They+0.269
Neg logit contributions
eling-0.552
ited-0.510
i-0.491
heart-0.480
ie-0.474
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
 name+0.095
 names+0.086
+0.084
 to+0.082
 They+0.081
Neg logit contributions
eling-0.089
ited-0.075
ering-0.073
i-0.072
ie-0.068
Token my
Token ID616
Feature activation+1.44e+01

Pos logit contributions
eling+0.832
ited+0.736
i+0.727
ering+0.723
ie+0.699
Neg logit contributions
 name-0.458
 names-0.400
-0.384
 They-0.370
 to-0.363
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.917
i+0.863
heart+0.830
ering+0.826
ie+0.823
Neg logit contributions
Circ-0.529
 name-0.518
 They-0.489
 Bang-0.489
-0.488
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.811
+0.709
 names+0.693
 to+0.669
 They+0.647
Neg logit contributions
eling-1.585
ited-1.407
ering-1.391
i-1.362
ience-1.305
Token Vin
Token ID11820
Feature activation+2.73e+00

Pos logit contributions
eling+0.166
ited+0.145
i+0.143
ering+0.142
ie+0.137
Neg logit contributions
 name-0.119
 names-0.106
-0.103
 They-0.100
 to-0.098
Tokenny
Token ID3281
Feature activation+9.56e-01

Pos logit contributions
eling+0.186
ited+0.164
i+0.163
ering+0.161
ie+0.157
Neg logit contributions
 name-0.120
 names-0.106
-0.103
 They-0.099
Circ-0.098
Token.
Token ID13
Feature activation-2.84e+00

Pos logit contributions
eling+0.061
i+0.053
ited+0.052
ering+0.052
ie+0.051
Neg logit contributions
 name-0.046
 names-0.041
-0.040
Circ-0.039
 They-0.039
Token I
Token ID314
Feature activation-1.07e+00

Pos logit contributions
 name+0.116
 names+0.114
 fence+0.110
 road+0.108
 yard+0.108
Neg logit contributions
eling-0.156
heart-0.149
i-0.148
ience-0.143
ie-0.139
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.115
 names+0.097
+0.097
 to+0.095
 They+0.089
Neg logit contributions
eling-0.306
ited-0.275
ering-0.269
i-0.264
heart-0.261
Token Go
Token ID1514
Feature activation-3.92e+00

Pos logit contributions
 name+0.109
Circ+0.100
 names+0.100
 fence+0.100
 road+0.097
Neg logit contributions
eling-0.180
i-0.166
ience-0.158
heart-0.158
ited-0.155
Token ahead
Token ID4058
Feature activation-3.82e+00

Pos logit contributions
 name+0.164
 names+0.151
 Bang+0.148
 fence+0.143
Circ+0.142
Neg logit contributions
eling-0.250
i-0.233
ited-0.233
ie-0.227
ering-0.218
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.167
 names+0.147
Circ+0.143
 Bang+0.137
 fence+0.133
Neg logit contributions
eling-0.253
i-0.227
ited-0.222
ering-0.222
ie-0.220
Token His
Token ID2399
Feature activation+1.35e+00

Pos logit contributions
 name+0.100
 fence+0.096
Circ+0.094
 to+0.092
 names+0.092
Neg logit contributions
eling-0.183
i-0.169
ience-0.161
heart-0.160
ited-0.157
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.112
i+0.109
ering+0.103
ience+0.099
ises+0.099
Neg logit contributions
Circ-0.025
 Some-0.020
 Bang-0.020
 Along-0.017
 They-0.017
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.769
 names+0.670
+0.663
 to+0.627
 They+0.624
Neg logit contributions
eling-1.616
ited-1.462
ering-1.421
i-1.397
heart-1.370
Token Spot
Token ID15899
Feature activation-6.67e+00

Pos logit contributions
eling+0.179
i+0.158
ited+0.156
ering+0.155
ie+0.151
Neg logit contributions
 name-0.101
 names-0.089
-0.085
Circ-0.083
 They-0.082
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.226
 names+0.193
+0.189
 They+0.180
 to+0.178
Neg logit contributions
eling-0.532
ited-0.479
ering-0.466
i-0.461
heart-0.451
Token He
Token ID679
Feature activation-6.86e+00

Pos logit contributions
 name+0.080
 fence+0.077
Circ+0.077
 names+0.077
 across+0.074
Neg logit contributions
eling-0.192
i-0.183
heart-0.178
ience-0.175
ited-0.168
Token loves
Token ID10408
Feature activation+6.83e+00

Pos logit contributions
 name+0.292
 names+0.255
+0.245
 They+0.236
 to+0.233
Neg logit contributions
eling-0.493
ited-0.441
ering-0.433
i-0.424
heart-0.420
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
eling+0.019
ited+0.017
ering+0.016
i+0.016
ience+0.016
Neg logit contributions
 name-0.011
 names-0.010
-0.010
 to-0.009
 They-0.009
Token patient
Token ID5827
Feature activation+1.75e+00

Pos logit contributions
eling+0.164
ited+0.132
ering+0.131
ience+0.130
i+0.128
Neg logit contributions
Circ-0.100
 They-0.098
 name-0.098
 names-0.095
 This-0.092
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
eling+0.123
ited+0.107
i+0.107
ering+0.106
ie+0.104
Neg logit contributions
 name-0.073
 names-0.065
-0.062
 They-0.060
Circ-0.060
Token say
Token ID910
Feature activation+2.60e+00

Pos logit contributions
 name+0.593
 names+0.525
+0.499
 They+0.489
Circ+0.484
Neg logit contributions
eling-1.020
i-0.886
ited-0.880
ering-0.875
ie-0.851
Token his
Token ID465
Feature activation+5.09e+00

Pos logit contributions
eling+0.162
ited+0.141
i+0.139
ering+0.137
ie+0.133
Neg logit contributions
 name-0.132
 names-0.119
-0.116
 They-0.112
 to-0.112
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.366
ering+0.330
i+0.328
ience+0.326
ise+0.321
Neg logit contributions
Circ-0.149
 They-0.136
Let-0.126
 Along-0.124
 corner-0.123
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.856
 names+0.751
+0.751
 to+0.744
 They+0.700
Neg logit contributions
eling-1.537
ited-1.353
ering-1.316
i-1.310
heart-1.267
Token The
Token ID383
Feature activation-3.10e+00

Pos logit contributions
 names+0.118
 name+0.117
 fence+0.113
Circ+0.110
 across+0.106
Neg logit contributions
eling-0.160
i-0.142
ering-0.133
ited-0.132
mmm-0.131
Token patient
Token ID5827
Feature activation+1.77e+00

Pos logit contributions
 name+0.122
 names+0.120
Circ+0.118
 They+0.113
+0.108
Neg logit contributions
eling-0.203
ited-0.169
ering-0.164
ience-0.164
i-0.161
Token said
Token ID531
Feature activation+5.97e+00

Pos logit contributions
eling+0.128
ited+0.113
i+0.110
ering+0.110
ie+0.105
Neg logit contributions
 name-0.074
 names-0.064
-0.063
 to-0.061
 They-0.061
Token his
Token ID465
Feature activation+5.08e+00

Pos logit contributions
eling+0.422
ited+0.400
i+0.360
heart+0.358
ie+0.353
Neg logit contributions
 name-0.266
 to-0.258
-0.255
 names-0.228
 They-0.226
Token sir
Token ID15967
Feature activation+1.36e+00

Pos logit contributions
eling+0.784
ited+0.694
ering+0.679
i+0.660
ie+0.645
Neg logit contributions
 name-0.550
-0.473
 names-0.466
 They-0.438
 to-0.438
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
eling+0.104
ited+0.092
ering+0.089
i+0.087
ience+0.085
Neg logit contributions
 name-0.053
-0.046
 names-0.045
 to-0.044
 They-0.044
Token Why
Token ID4162
Feature activation-3.66e+00

Pos logit contributions
 name+0.118
 names+0.113
Circ+0.112
 fence+0.112
 across+0.108
Neg logit contributions
eling-0.168
i-0.158
heart-0.152
ience-0.149
ering-0.145
Token did
Token ID750
Feature activation+4.50e+00

Pos logit contributions
 name+0.179
 names+0.157
+0.145
 They+0.129
 to+0.127
Neg logit contributions
eling-0.257
ited-0.246
ering-0.226
i-0.222
heart-0.220
Token you
Token ID345
Feature activation+6.43e+00

Pos logit contributions
eling+0.268
ited+0.241
ering+0.228
i+0.225
heart+0.215
Neg logit contributions
 name-0.244
 names-0.222
-0.220
 to-0.215
 They-0.213
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.472
ited+0.417
i+0.413
ering+0.412
ie+0.397
Neg logit contributions
 name-0.255
 names-0.222
-0.213
 They-0.205
 to-0.202
Token him
Token ID683
Feature activation+4.36e+00

Pos logit contributions
 name+0.999
+0.902
 to+0.868
 names+0.867
 They+0.810
Neg logit contributions
eling-1.401
ited-1.272
ering-1.217
i-1.181
itive-1.140
Token Wisdom
Token ID24075
Feature activation-1.13e+00

Pos logit contributions
eling+0.256
ited+0.221
ering+0.215
i+0.213
heart+0.203
Neg logit contributions
 name-0.240
 names-0.216
-0.213
 to-0.206
 They-0.205
Token?"
Token ID1701
Feature activation+8.41e+00

Pos logit contributions
 name+0.062
 names+0.056
+0.055
 to+0.054
 They+0.052
Neg logit contributions
eling-0.066
ited-0.058
ering-0.055
i-0.055
heart-0.052
Token Ben
Token ID3932
Feature activation-2.60e+00

Pos logit contributions
eling+0.553
i+0.477
ited+0.470
ie+0.457
ering+0.457
Neg logit contributions
 name-0.374
 names-0.350
 fence-0.315
Circ-0.314
 across-0.303
Token asked
Token ID1965
Feature activation+2.54e-01

Pos logit contributions
 name+0.124
+0.117
 to+0.108
 names+0.107
 They+0.107
Neg logit contributions
eling-0.173
ited-0.150
ering-0.146
i-0.140
ience-0.139
Token rabbit
Token ID22746
Feature activation-6.23e+00

Pos logit contributions
eling+0.281
i+0.260
ie+0.253
ited+0.251
heart+0.248
Neg logit contributions
 name-0.170
 names-0.154
Circ-0.151
 They-0.145
 Bang-0.145
Token replied
Token ID8712
Feature activation+1.36e+01

Pos logit contributions
 name+0.404
+0.382
 to+0.360
 names+0.350
 They+0.347
Neg logit contributions
eling-0.327
ited-0.264
ering-0.250
itive-0.239
heart-0.232
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
eling+0.818
ited+0.712
i+0.647
ering+0.631
heart+0.620
Neg logit contributions
 name-0.777
 to-0.724
-0.719
 names-0.685
 They-0.678
Token "
Token ID366
Feature activation-7.94e+00

Pos logit contributions
eling+1.004
i+0.908
ering+0.903
ited+0.902
ie+0.877
Neg logit contributions
 name-0.262
 names-0.213
Circ-0.180
-0.178
 They-0.173
TokenMy
Token ID3666
Feature activation+8.20e+00

Pos logit contributions
 name+0.345
 names+0.312
 fence+0.289
 yard+0.283
 road+0.281
Neg logit contributions
eling-0.536
ited-0.480
i-0.478
ie-0.455
ering-0.454
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
i+0.747
eling+0.739
heart+0.713
ie+0.700
ering+0.691
Neg logit contributions
Circ-0.081
 Some-0.044
 hills-0.043
 Bang-0.039
 Along-0.035
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.732
+0.621
 names+0.596
 to+0.578
 They+0.563
Neg logit contributions
eling-1.678
ited-1.533
ering-1.491
i-1.450
heart-1.400
Token Holly
Token ID7764
Feature activation+7.28e+00

Pos logit contributions
eling+0.159
i+0.141
ering+0.135
ited+0.135
ie+0.134
Neg logit contributions
 name-0.119
 names-0.108
Circ-0.101
-0.101
 fence-0.100
Token."
Token ID526
Feature activation-5.56e-01

Pos logit contributions
eling+0.580
ited+0.518
ering+0.504
i+0.497
heart+0.482
Neg logit contributions
 name-0.253
 names-0.214
-0.210
 They-0.201
 to-0.201
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
 name+0.012
 road+0.011
 fence+0.011
 names+0.011
Circ+0.009
Neg logit contributions
eling-0.043
i-0.041
ering-0.040
ie-0.040
heart-0.040
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
 name+0.198
 names+0.167
 fence+0.153
 road+0.145
 yard+0.130
Neg logit contributions
eling-0.885
ering-0.816
ie-0.810
i-0.804
ited-0.797
Token She
Token ID1375
Feature activation-2.19e+00

Pos logit contributions
Circ+0.106
 whole+0.104
 name+0.102
 corner+0.101
 road+0.100
Neg logit contributions
eling-0.145
i-0.144
heart-0.139
ie-0.127
ience-0.127
Token asked
Token ID1965
Feature activation+2.46e-01

Pos logit contributions
 name+0.117
+0.105
 names+0.104
 to+0.104
 They+0.099
Neg logit contributions
eling-0.143
ering-0.123
heart-0.119
ited-0.119
itive-0.113
Token him
Token ID683
Feature activation+4.36e+00

Pos logit contributions
eling+0.016
ited+0.015
ering+0.014
i+0.014
ience+0.013
Neg logit contributions
 name-0.012
 names-0.011
-0.011
 to-0.010
 They-0.010
Token what
Token ID644
Feature activation+5.23e+00

Pos logit contributions
eling+0.261
i+0.231
ering+0.225
ited+0.221
ie+0.221
Neg logit contributions
 name-0.217
 names-0.195
Circ-0.189
 They-0.188
-0.188
Token his
Token ID465
Feature activation+5.09e+00

Pos logit contributions
eling+0.315
ited+0.271
i+0.266
ering+0.265
ie+0.253
Neg logit contributions
 name-0.278
 names-0.252
-0.246
 They-0.239
 to-0.237
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
i+0.377
eling+0.371
ie+0.355
ering+0.343
itive+0.338
Neg logit contributions
Circ-0.120
 Some-0.117
 They-0.108
 Bang-0.105
 Along-0.099
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
 name+0.827
+0.748
 to+0.689
 names+0.680
 They+0.660
Neg logit contributions
eling-1.599
ited-1.442
ering-1.391
heart-1.345
i-1.339
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
eling+0.104
ited+0.091
ering+0.088
i+0.088
ie+0.084
Neg logit contributions
 name-0.074
 names-0.066
-0.064
 to-0.062
 They-0.062
Token where
Token ID810
Feature activation-1.55e+00

Pos logit contributions
eling+0.162
ited+0.141
ering+0.138
i+0.137
ie+0.131
Neg logit contributions
 name-0.138
 names-0.124
-0.121
 They-0.118
 to-0.117
Token he
Token ID339
Feature activation+3.93e+00

Pos logit contributions
 name+0.092
 names+0.084
+0.081
Circ+0.080
 They+0.079
Neg logit contributions
eling-0.082
i-0.068
ited-0.068
ering-0.067
ie-0.065
Token came
Token ID1625
Feature activation-3.95e-01

Pos logit contributions
eling+0.266
ering+0.234
ited+0.230
heart+0.224
i+0.219
Neg logit contributions
 name-0.188
-0.167
 names-0.166
 to-0.158
 They-0.155
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.868
i+0.818
ering+0.787
heart+0.780
ie+0.762
Neg logit contributions
Circ-0.449
 Bang-0.392
 Some-0.389
 They-0.385
tted-0.375
Token?"
Token ID1701
Feature activation+8.39e+00

Pos logit contributions
 name+1.125
+0.999
 names+0.980
 to+0.973
 They+0.923
Neg logit contributions
eling-1.274
ited-1.097
i-1.066
ering-1.064
ience-1.007
Token
Token ID198
Feature activation-1.03e+01

Pos logit contributions
eling+0.627
heart+0.595
i+0.576
 chuckled+0.573
ise+0.566
Neg logit contributions
 names-0.211
 name-0.196
Circ-0.174
 fence-0.161
ashes-0.161
Token"
Token ID1
Feature activation+5.08e+00

Pos logit contributions
 names+0.295
 name+0.257
 fence+0.226
ashes+0.225
 road+0.209
Neg logit contributions
eling-0.729
ise-0.703
heart-0.694
ering-0.689
ieved-0.687
TokenMy
Token ID3666
Feature activation+8.20e+00

Pos logit contributions
eling+0.335
ited+0.304
i+0.300
heart+0.285
ering+0.284
Neg logit contributions
 name-0.206
 names-0.190
 fence-0.173
 to-0.169
 road-0.167
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.726
i+0.719
heart+0.691
ering+0.676
ie+0.673
Neg logit contributions
Circ-0.097
 hills-0.059
 Bang-0.056
 Along-0.054
 across-0.054
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.756
+0.652
 names+0.605
 to+0.597
 They+0.572
Neg logit contributions
eling-1.655
ited-1.485
ering-1.458
i-1.443
itive-1.377
Token Pr
Token ID1736
Feature activation+2.50e+00

Pos logit contributions
eling+0.153
i+0.135
ering+0.129
ie+0.129
ited+0.129
Neg logit contributions
 name-0.124
 names-0.112
-0.105
Circ-0.104
 fence-0.103
Tokeninter
Token ID3849
Feature activation-1.02e+01

Pos logit contributions
eling+0.141
i+0.135
heart+0.128
ie+0.126
ited+0.122
Neg logit contributions
 name-0.119
Circ-0.114
 names-0.111
-0.110
 road-0.109
Token,"
Token ID553
Feature activation+7.86e+00

Pos logit contributions
 name+0.490
 to+0.433
 names+0.428
+0.425
 They+0.408
Neg logit contributions
eling-0.667
ited-0.596
ering-0.548
i-0.539
ience-0.526
Token the
Token ID262
Feature activation+2.59e+00

Pos logit contributions
eling+0.455
ited+0.398
ering+0.383
i+0.360
ience+0.360
Neg logit contributions
 name-0.447
-0.418
 names-0.394
 They-0.392
 to-0.387
Token a
Token ID257
Feature activation-4.29e+00

Pos logit contributions
eling+0.126
ited+0.113
i+0.113
ering+0.111
ie+0.110
Neg logit contributions
 name-0.047
 names-0.039
 fence-0.035
-0.035
Circ-0.035
Token little
Token ID1310
Feature activation-1.16e+00

Pos logit contributions
 name+0.163
 names+0.150
+0.135
 They+0.134
 Bang+0.133
Neg logit contributions
eling-0.307
ited-0.275
i-0.274
ering-0.267
ie-0.266
Token boy
Token ID2933
Feature activation-2.18e+00

Pos logit contributions
 name+0.040
+0.033
 names+0.032
 They+0.032
 fence+0.031
Neg logit contributions
eling-0.094
ited-0.082
ering-0.082
i-0.080
ience-0.077
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.119
+0.111
 to+0.108
 names+0.106
 They+0.106
Neg logit contributions
eling-0.129
ited-0.111
ering-0.107
ience-0.103
i-0.103
Token His
Token ID2399
Feature activation+1.35e+00

Pos logit contributions
 name+0.078
 names+0.074
 across+0.069
 fence+0.068
Circ+0.066
Neg logit contributions
eling-0.185
i-0.175
ie-0.173
heart-0.170
ited-0.170
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
i+0.104
eling+0.103
ie+0.096
itive+0.095
ering+0.093
Neg logit contributions
Circ-0.024
 hills-0.020
 corner-0.019
 across-0.018
 Some-0.018
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
 name+0.713
+0.696
 They+0.583
 to+0.576
 names+0.558
Neg logit contributions
eling-1.693
ited-1.558
ering-1.522
heart-1.465
i-1.459
Token Sam
Token ID3409
Feature activation-6.91e+00

Pos logit contributions
eling+0.106
ited+0.093
i+0.093
ering+0.092
ie+0.089
Neg logit contributions
 name-0.068
 names-0.061
-0.058
 They-0.056
Circ-0.056
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
 name+0.191
+0.177
 to+0.158
 names+0.150
 They+0.149
Neg logit contributions
eling-0.600
ited-0.546
ering-0.526
ience-0.513
i-0.512
Token he
Token ID339
Feature activation+3.93e+00

Pos logit contributions
eling+0.193
ited+0.181
ering+0.172
i+0.161
heart+0.161
Neg logit contributions
 name-0.116
-0.109
 They-0.106
 to-0.100
 names-0.097
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
eling+0.270
ited+0.241
ering+0.239
heart+0.232
i+0.228
Neg logit contributions
 name-0.184
-0.168
 names-0.157
 They-0.156
 to-0.153
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 name+0.024
+0.022
 names+0.021
 to+0.021
 They+0.020
Neg logit contributions
eling-0.024
ering-0.020
ited-0.019
itive-0.019
eking-0.019
Token happy
Token ID3772
Feature activation+6.59e+00

Pos logit contributions
eling+0.097
ited+0.083
i+0.082
ering+0.082
ie+0.079
Neg logit contributions
 name-0.080
 names-0.072
-0.070
 They-0.068
 to-0.067
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
eling+0.515
ited+0.460
i+0.449
ering+0.448
ience+0.434
Neg logit contributions
 name-0.238
 names-0.203
-0.203
 to-0.196
 They-0.190
Token wrote
Token ID2630
Feature activation+3.69e+00

Pos logit contributions
eling+0.151
ited+0.128
ering+0.123
i+0.120
heart+0.118
Neg logit contributions
 name-0.165
-0.148
 to-0.147
 They-0.145
 names-0.144
Token his
Token ID465
Feature activation+5.09e+00

Pos logit contributions
eling+0.265
ited+0.233
ering+0.233
i+0.233
ie+0.225
Neg logit contributions
 name-0.146
 names-0.129
-0.123
 They-0.121
Circ-0.120
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.405
ering+0.367
i+0.365
ie+0.360
ited+0.359
Neg logit contributions
 They-0.116
Circ-0.114
 hills-0.098
 Some-0.095
 Bang-0.095
Token next
Token ID1306
Feature activation-3.75e+00

Pos logit contributions
 name+0.798
 names+0.689
+0.667
 to+0.638
 They+0.635
Neg logit contributions
eling-1.584
ited-1.401
i-1.378
ering-1.373
ie-1.325
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
 name+0.107
 They+0.103
 Bang+0.097
Circ+0.097
 Some+0.096
Neg logit contributions
eling-0.279
ie-0.267
ited-0.265
i-0.260
heart-0.256
Token Lily
Token ID20037
Feature activation+5.97e-01

Pos logit contributions
 name+0.585
+0.513
 names+0.506
 They+0.489
 to+0.477
Neg logit contributions
eling-1.042
ited-0.936
ering-0.914
i-0.906
ience-0.884
Token's
Token ID338
Feature activation+5.67e+00

Pos logit contributions
eling+0.046
i+0.040
ited+0.040
ering+0.040
ie+0.039
Neg logit contributions
 name-0.021
 names-0.019
-0.017
Circ-0.017
 They-0.017
Token.
Token ID13
Feature activation-2.86e+00

Pos logit contributions
eling+0.357
ie+0.337
i+0.331
heart+0.319
ering+0.318
Neg logit contributions
Circ-0.210
 name-0.196
 They-0.195
 Along-0.194
 across-0.192
Tokenon
Token ID261
Feature activation+3.11e+00

Pos logit contributions
 name+0.527
 names+0.491
+0.483
 to+0.470
 They+0.469
Neg logit contributions
eling-0.302
ited-0.242
i-0.230
ering-0.229
heart-0.211
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
eling+0.215
i+0.200
ie+0.198
ering+0.193
ited+0.192
Neg logit contributions
 name-0.120
Circ-0.104
 names-0.104
-0.100
 Bang-0.099
Token She
Token ID1375
Feature activation-2.19e+00

Pos logit contributions
 name+0.084
Circ+0.083
 fence+0.081
 names+0.076
 across+0.075
Neg logit contributions
eling-0.194
i-0.178
heart-0.178
ited-0.170
ie-0.170
Token wrote
Token ID2630
Feature activation+3.69e+00

Pos logit contributions
 name+0.103
+0.093
 names+0.090
 to+0.090
 They+0.087
Neg logit contributions
eling-0.149
ering-0.131
ited-0.131
heart-0.131
i-0.125
Token her
Token ID607
Feature activation+9.97e+00

Pos logit contributions
eling+0.278
ited+0.248
i+0.241
ering+0.237
heart+0.231
Neg logit contributions
 name-0.148
 names-0.129
-0.124
 to-0.120
 They-0.117
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.831
i+0.773
ie+0.756
ering+0.753
ited+0.736
Neg logit contributions
 They-0.184
Circ-0.181
 fence-0.164
 name-0.163
 Bang-0.162
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
 name+1.011
 names+0.899
+0.894
 to+0.872
 They+0.844
Neg logit contributions
eling-1.427
ited-1.251
ering-1.192
i-1.180
heart-1.139
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
eling+0.730
ited+0.637
i+0.624
ering+0.624
ie+0.596
Neg logit contributions
 name-0.564
 names-0.504
-0.496
 They-0.480
 to-0.475
Token her
Token ID607
Feature activation+9.96e+00

Pos logit contributions
eling+0.170
ited+0.151
ering+0.146
i+0.141
heart+0.138
Neg logit contributions
 name-0.142
 names-0.127
 to-0.126
-0.125
 They-0.121
Token mom
Token ID1995
Feature activation-5.77e+00

Pos logit contributions
eling+0.693
i+0.627
ie+0.607
ited+0.603
ering+0.602
Neg logit contributions
 name-0.368
 They-0.334
Circ-0.331
-0.329
 names-0.325
Token's
Token ID338
Feature activation+5.65e+00

Pos logit contributions
 name+0.294
+0.271
 to+0.265
 names+0.263
 They+0.251
Neg logit contributions
eling-0.394
ited-0.338
ering-0.326
ience-0.313
heart-0.311
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
 name+0.031
 names+0.027
Circ+0.026
+0.026
 Bang+0.026
Neg logit contributions
eling-0.051
i-0.045
ering-0.044
ited-0.043
ie-0.043
Token how
Token ID703
Feature activation+6.26e+00

Pos logit contributions
eling+0.729
ited+0.628
i+0.623
ering+0.622
ie+0.595
Neg logit contributions
 name-0.551
 names-0.492
-0.473
Circ-0.461
 They-0.460
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
eling+0.471
ering+0.415
i+0.408
ie+0.403
ience+0.398
Neg logit contributions
 name-0.222
 names-0.186
 fence-0.180
 Bang-0.176
Circ-0.174
Token write
Token ID3551
Feature activation-3.67e+00

Pos logit contributions
 name+0.585
 names+0.514
+0.493
 They+0.475
 to+0.468
Neg logit contributions
eling-1.024
ited-0.907
i-0.895
ering-0.890
ie-0.860
Token his
Token ID465
Feature activation+5.09e+00

Pos logit contributions
 name+0.160
 names+0.142
+0.137
 to+0.132
 They+0.131
Neg logit contributions
eling-0.259
ited-0.232
i-0.225
ering-0.221
ie-0.214
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.404
i+0.368
ering+0.365
ited+0.356
ience+0.355
Neg logit contributions
Circ-0.113
 They-0.107
 Bang-0.105
-0.100
 name-0.098
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
 name+0.884
 names+0.779
+0.756
 to+0.728
 They+0.726
Neg logit contributions
eling-1.492
ited-1.323
i-1.287
ering-1.281
ie-1.230
Token how
Token ID703
Feature activation+6.26e+00

Pos logit contributions
eling+0.179
i+0.155
ering+0.153
ited+0.152
ie+0.148
Neg logit contributions
 name-0.112
 names-0.099
-0.096
Circ-0.096
 fence-0.094
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
eling+0.455
ering+0.400
i+0.398
ie+0.390
ience+0.383
Neg logit contributions
 name-0.239
 names-0.204
 fence-0.201
Circ-0.194
 Bang-0.193
Token draw
Token ID3197
Feature activation-7.23e+00

Pos logit contributions
 name+0.578
 names+0.496
+0.478
 They+0.448
 to+0.448
Neg logit contributions
eling-1.047
ited-0.952
ering-0.923
i-0.917
ience-0.895
Token a
Token ID257
Feature activation-4.29e+00

Pos logit contributions
 name+0.288
 names+0.252
+0.244
 They+0.234
 to+0.231
Neg logit contributions
eling-0.531
ited-0.473
i-0.463
ering-0.459
ie-0.444
Token a
Token ID257
Feature activation-4.27e+00

Pos logit contributions
eling+0.126
ited+0.114
i+0.113
ie+0.111
ering+0.110
Neg logit contributions
 name-0.049
 names-0.041
 fence-0.037
Circ-0.037
 Bang-0.036
Token little
Token ID1310
Feature activation-1.14e+00

Pos logit contributions
 name+0.166
 names+0.155
 Bang+0.138
 They+0.138
Circ+0.137
Neg logit contributions
eling-0.298
i-0.268
ited-0.267
ie-0.260
ering-0.258
Token girl
Token ID2576
Feature activation+5.01e+00

Pos logit contributions
 name+0.040
+0.033
 names+0.033
 They+0.032
 fence+0.031
Neg logit contributions
eling-0.091
ited-0.080
ering-0.079
i-0.078
ience-0.075
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
eling+0.281
ited+0.237
ering+0.231
i+0.226
heart+0.217
Neg logit contributions
 name-0.290
-0.265
 names-0.263
 to-0.257
 They-0.255
Token Her
Token ID2332
Feature activation+5.03e+00

Pos logit contributions
 name+0.093
Circ+0.091
 names+0.088
 fence+0.085
 road+0.085
Neg logit contributions
eling-0.172
i-0.168
ie-0.165
heart-0.163
ited-0.161
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.419
i+0.416
ie+0.389
ited+0.384
itive+0.384
Neg logit contributions
Circ-0.072
 hills-0.060
 Some-0.058
 Bang-0.055
 They-0.053
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 name+0.686
+0.672
 to+0.551
 They+0.550
 names+0.526
Neg logit contributions
eling-1.720
ering-1.572
ited-1.565
heart-1.490
i-1.485
Token Claire
Token ID20758
Feature activation+7.39e+00

Pos logit contributions
eling+0.116
ited+0.103
ering+0.101
i+0.100
heart+0.096
Neg logit contributions
 name-0.063
 names-0.055
-0.055
 They-0.052
 to-0.050
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
eling+0.574
ited+0.511
i+0.506
ering+0.503
ie+0.488
Neg logit contributions
 name-0.261
 names-0.223
-0.214
 They-0.205
 to-0.200
Token she
Token ID673
Feature activation+7.74e+00

Pos logit contributions
eling+0.178
ited+0.169
ering+0.161
ience+0.149
heart+0.147
Neg logit contributions
 name-0.136
-0.125
 They-0.119
 to-0.116
 names-0.113
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
eling+0.539
ited+0.481
ering+0.480
heart+0.458
i+0.457
Neg logit contributions
 name-0.361
-0.328
 names-0.306
 They-0.303
 to-0.298
Token a
Token ID257
Feature activation-4.27e+00

Pos logit contributions
eling+0.125
i+0.112
ited+0.112
ie+0.109
ering+0.109
Neg logit contributions
 name-0.050
 names-0.043
 fence-0.038
Circ-0.038
 Bang-0.037
Token little
Token ID1310
Feature activation-1.14e+00

Pos logit contributions
 name+0.158
 names+0.140
+0.130
 They+0.127
Circ+0.125
Neg logit contributions
eling-0.319
ited-0.283
i-0.282
ering-0.278
ie-0.272
Token boy
Token ID2933
Feature activation-2.16e+00

Pos logit contributions
 name+0.039
 names+0.032
+0.032
 fence+0.031
 They+0.031
Neg logit contributions
eling-0.092
ited-0.081
ering-0.081
i-0.079
heart-0.076
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.124
+0.113
 names+0.112
 to+0.110
 They+0.109
Neg logit contributions
eling-0.122
ited-0.104
ering-0.100
i-0.098
ience-0.096
Token His
Token ID2399
Feature activation+1.35e+00

Pos logit contributions
 name+0.080
 names+0.076
 fence+0.072
 across+0.070
Circ+0.066
Neg logit contributions
eling-0.180
ited-0.168
i-0.168
heart-0.166
ie-0.163
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.102
i+0.102
ie+0.095
itive+0.093
ering+0.093
Neg logit contributions
Circ-0.024
 hills-0.020
 corner-0.019
 Some-0.019
 across-0.018
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 name+0.766
+0.718
 They+0.619
 to+0.614
 names+0.611
Neg logit contributions
eling-1.656
ited-1.498
ering-1.482
i-1.428
heart-1.422
Token Sam
Token ID3409
Feature activation-6.90e+00

Pos logit contributions
eling+0.106
i+0.092
ited+0.092
ering+0.091
ie+0.089
Neg logit contributions
 name-0.069
 names-0.062
-0.058
 They-0.057
Circ-0.057
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
 name+0.192
+0.177
 to+0.158
 names+0.150
 They+0.148
Neg logit contributions
eling-0.597
ited-0.543
ering-0.525
ience-0.512
i-0.511
Token he
Token ID339
Feature activation+3.90e+00

Pos logit contributions
eling+0.191
ited+0.178
ering+0.170
i+0.161
heart+0.160
Neg logit contributions
 name-0.118
-0.110
 They-0.106
 to-0.101
 names-0.099
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
eling+0.258
ited+0.228
ering+0.225
i+0.219
heart+0.217
Neg logit contributions
 name-0.190
-0.171
 names-0.166
 They-0.162
 to-0.160
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
 name+0.009
 names+0.007
Circ+0.006
 Some+0.006
 Bang+0.006
Neg logit contributions
eling-0.030
i-0.029
ering-0.028
ie-0.028
ited-0.026
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
 name+0.689
 names+0.629
 fence+0.604
 across+0.590
 Bang+0.585
Neg logit contributions
eling-0.866
i-0.754
ie-0.738
ited-0.710
ering-0.679
Token park
Token ID3952
Feature activation-1.25e+01

Pos logit contributions
eling+0.185
ited+0.165
i+0.157
ie+0.157
ering+0.155
Neg logit contributions
 name-0.089
 They-0.085
Circ-0.081
 names-0.080
 This-0.076
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.558
 names+0.506
Circ+0.469
 Bang+0.463
 Some+0.429
Neg logit contributions
eling-0.825
ited-0.740
i-0.734
ering-0.733
ie-0.726
Token His
Token ID2399
Feature activation+1.35e+00

Pos logit contributions
 name+0.079
 names+0.078
Circ+0.070
 fence+0.065
 across+0.064
Neg logit contributions
eling-0.193
heart-0.179
ited-0.179
 chuckled-0.179
i-0.174
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.098
i+0.095
ering+0.091
ie+0.088
ited+0.086
Neg logit contributions
Circ-0.037
 They-0.030
 Some-0.029
 Bang-0.028
 Where-0.028
Token was
Token ID373
Feature activation+1.56e+00

Pos logit contributions
 name+0.764
+0.690
 to+0.638
 names+0.622
 They+0.622
Neg logit contributions
eling-1.634
ited-1.450
ering-1.434
i-1.411
heart-1.395
Token Sam
Token ID3409
Feature activation-6.90e+00

Pos logit contributions
eling+0.109
ited+0.096
i+0.095
ering+0.095
ie+0.091
Neg logit contributions
 name-0.066
 names-0.059
-0.057
 They-0.055
 to-0.054
Token.
Token ID13
Feature activation-2.84e+00

Pos logit contributions
 name+0.226
+0.200
 to+0.188
 names+0.185
 They+0.180
Neg logit contributions
eling-0.562
ited-0.505
ering-0.489
i-0.480
heart-0.477
Token He
Token ID679
Feature activation-6.89e+00

Pos logit contributions
 name+0.101
Circ+0.100
 names+0.099
 fence+0.091
 west+0.089
Neg logit contributions
eling-0.166
heart-0.151
 chuckled-0.151
ited-0.151
i-0.149
Token saw
Token ID2497
Feature activation-3.22e-01

Pos logit contributions
 name+0.331
+0.298
 names+0.286
 to+0.284
 They+0.280
Neg logit contributions
eling-0.459
ited-0.401
ering-0.400
heart-0.389
i-0.384
Token when
Token ID618
Feature activation+4.20e+00

Pos logit contributions
 name+0.816
 names+0.729
+0.725
 to+0.711
 They+0.698
Neg logit contributions
eling-1.175
ited-1.021
i-0.995
ering-0.986
ience-0.964
Token her
Token ID607
Feature activation+9.96e+00

Pos logit contributions
eling+0.215
ited+0.179
i+0.177
ering+0.175
ie+0.167
Neg logit contributions
 name-0.258
 names-0.237
-0.230
 They-0.225
Circ-0.224
Token mom
Token ID1995
Feature activation-5.76e+00

Pos logit contributions
eling+0.618
i+0.581
ie+0.556
ering+0.545
ited+0.540
Neg logit contributions
Circ-0.400
 name-0.400
 They-0.390
 Bang-0.379
-0.371
Token called
Token ID1444
Feature activation+4.72e+00

Pos logit contributions
 name+0.298
+0.270
 names+0.268
 to+0.263
 They+0.256
Neg logit contributions
eling-0.352
ited-0.302
ering-0.295
i-0.291
ience-0.284
Token her
Token ID607
Feature activation+9.96e+00

Pos logit contributions
eling+0.238
i+0.202
ie+0.200
ering+0.199
heart+0.192
Neg logit contributions
 name-0.270
Circ-0.250
 names-0.250
 fence-0.241
 Bang-0.241
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.597
i+0.588
ie+0.553
ering+0.549
itive+0.534
Neg logit contributions
Circ-0.361
 Bang-0.348
 name-0.343
 road-0.343
 Some-0.334
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 name+0.802
 names+0.698
+0.666
 They+0.644
Circ+0.632
Neg logit contributions
eling-1.546
ited-1.370
i-1.358
ering-1.349
ie-1.310
Token She
Token ID1375
Feature activation-2.19e+00

Pos logit contributions
 name+0.094
Circ+0.090
 names+0.089
 fence+0.083
 road+0.081
Neg logit contributions
eling-0.174
heart-0.166
i-0.159
ited-0.156
ie-0.151
Token didn
Token ID1422
Feature activation+1.19e+01

Pos logit contributions
 name+0.102
+0.093
 to+0.089
 They+0.087
 names+0.086
Neg logit contributions
eling-0.149
ering-0.131
ited-0.130
itive-0.124
i-0.123
Token't
Token ID470
Feature activation-8.35e-01

Pos logit contributions
eling+0.656
i+0.535
ie+0.530
ering+0.524
ise+0.516
Neg logit contributions
 name-0.585
 names-0.578
 fence-0.534
 yard-0.520
Circ-0.503
Token want
Token ID765
Feature activation-1.19e+00

Pos logit contributions
 name+0.036
 names+0.032
+0.031
 They+0.030
 to+0.030
Neg logit contributions
eling-0.059
ited-0.052
ering-0.051
i-0.051
ience-0.049
Token said
Token ID531
Feature activation+5.97e+00

Pos logit contributions
+0.049
 name+0.049
 to+0.047
 They+0.042
 names+0.040
Neg logit contributions
eling-0.074
ited-0.063
ering-0.060
itive-0.059
heart-0.058
Token "
Token ID366
Feature activation-7.94e+00

Pos logit contributions
eling+0.383
ited+0.335
i+0.313
ering+0.310
heart+0.297
Neg logit contributions
 name-0.314
 to-0.289
-0.284
 names-0.279
 They-0.270
TokenHello
Token ID15496
Feature activation-1.58e+00

Pos logit contributions
 name+0.300
 names+0.261
+0.248
 They+0.240
 to+0.235
Neg logit contributions
eling-0.598
ited-0.532
i-0.526
ering-0.522
ie-0.507
Token!
Token ID0
Feature activation+9.05e+00

Pos logit contributions
 name+0.095
 names+0.086
+0.085
 to+0.082
 They+0.082
Neg logit contributions
eling-0.087
ited-0.073
ering-0.071
i-0.069
ience-0.066
Token My
Token ID2011
Feature activation+9.47e+00

Pos logit contributions
eling+0.574
i+0.524
heart+0.524
ited+0.513
ience+0.509
Neg logit contributions
 name-0.353
 names-0.341
 fence-0.322
 road-0.318
 yard-0.312
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.842
i+0.812
heart+0.795
ie+0.784
ering+0.775
Neg logit contributions
Circ-0.097
 They-0.084
 Some-0.071
 Along-0.067
 hills-0.057
Token is
Token ID318
Feature activation+2.53e+00

Pos logit contributions
 name+0.678
+0.564
 names+0.544
 to+0.526
 They+0.488
Neg logit contributions
eling-1.716
ited-1.561
ering-1.543
i-1.510
ience-1.454
Token Fox
Token ID5426
Feature activation-3.91e+00

Pos logit contributions
eling+0.171
ited+0.150
i+0.147
ering+0.147
ie+0.141
Neg logit contributions
 name-0.115
 names-0.103
-0.100
 They-0.096
 to-0.095
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
 name+0.209
 names+0.188
+0.185
 They+0.179
 to+0.179
Neg logit contributions
eling-0.236
ited-0.203
ering-0.198
i-0.194
heart-0.185
Token I
Token ID314
Feature activation-1.08e+00

Pos logit contributions
eling+0.188
ited+0.168
ering+0.164
i+0.162
ie+0.155
Neg logit contributions
 name-0.116
 names-0.100
-0.099
 to-0.095
 They-0.095
Token love
Token ID1842
Feature activation-1.40e+00

Pos logit contributions
 name+0.037
 names+0.032
+0.030
 They+0.029
 to+0.028
Neg logit contributions
eling-0.086
ited-0.076
ering-0.076
i-0.076
ie-0.073
Token could
Token ID714
Feature activation-3.74e+00

Pos logit contributions
eling+0.520
ited+0.445
ering+0.439
i+0.432
heart+0.430
Neg logit contributions
 name-0.366
-0.331
 names-0.325
 to-0.322
 across-0.312
Token hear
Token ID3285
Feature activation-4.15e+00

Pos logit contributions
 name+0.170
 names+0.155
+0.146
Circ+0.145
 They+0.145
Neg logit contributions
eling-0.247
ited-0.214
i-0.212
ering-0.211
ie-0.202
Token someone
Token ID2130
Feature activation-7.34e+00

Pos logit contributions
 name+0.145
 names+0.123
+0.117
 They+0.113
Circ+0.112
Neg logit contributions
eling-0.321
ited-0.284
i-0.283
ering-0.281
ie-0.274
Token repeating
Token ID20394
Feature activation+7.86e+00

Pos logit contributions
 name+0.380
 names+0.339
+0.338
 to+0.331
 They+0.322
Neg logit contributions
eling-0.445
ited-0.388
i-0.378
ering-0.373
ience-0.366
Token her
Token ID607
Feature activation+9.96e+00

Pos logit contributions
eling+0.561
ited+0.500
i+0.488
ering+0.484
ie+0.471
Neg logit contributions
 name-0.329
 names-0.291
-0.279
 to-0.269
 They-0.269
Token name
Token ID1438
Feature activation-2.09e+01

Pos logit contributions
eling+0.858
i+0.804
ering+0.803
ience+0.762
itive+0.761
Neg logit contributions
Circ-0.165
 Some-0.134
 They-0.128
-0.126
 Bang-0.124
Token over
Token ID625
Feature activation-7.09e+00

Pos logit contributions
 name+0.767
 names+0.659
+0.639
 to+0.615
 They+0.601
Neg logit contributions
eling-1.611
ited-1.429
i-1.400
ering-1.394
ie-1.344
Token and
Token ID290
Feature activation+2.65e+00

Pos logit contributions
 name+0.328
 names+0.292
+0.278
Circ+0.277
 They+0.275
Neg logit contributions
eling-0.465
i-0.404
ited-0.403
ering-0.397
ie-0.387
Token over
Token ID625
Feature activation-7.08e+00

Pos logit contributions
eling+0.167
ited+0.146
i+0.142
ering+0.141
ie+0.136
Neg logit contributions
 name-0.134
 names-0.120
-0.118
 to-0.115
 They-0.114
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
 name+0.305
 names+0.268
Circ+0.252
+0.250
 They+0.249
Neg logit contributions
eling-0.490
ited-0.430
i-0.430
ering-0.425
ie-0.413
Token "
Token ID366
Feature activation-7.94e+00

Pos logit contributions
 road+0.092
 name+0.092
 names+0.083
Circ+0.082
 fence+0.081
Neg logit contributions
eling-0.172
heart-0.162
i-0.160
 chuckled-0.154
ience-0.150
Token "
Token ID366
Feature activation-7.21e+00

Pos logit contributions
eling+1.009
i+0.918
ering+0.913
ited+0.895
ie+0.886
Neg logit contributions
 name-0.304
 names-0.266
Circ-0.237
 fence-0.219
 road-0.213
TokenMy
Token ID3666
Feature activation+8.93e+00

Pos logit contributions
 name+0.302
 names+0.272
 fence+0.243
 to+0.240
 yard+0.238
Neg logit contributions
eling-0.501
i-0.447
ited-0.447
ie-0.433
ering-0.427
Token name
Token ID1438
Feature activation-2.01e+01

Pos logit contributions
i+0.809
eling+0.806
ie+0.764
heart+0.753
ering+0.751
Neg logit contributions
Circ-0.098
 Along-0.066
 They-0.059
 Some-0.053
 Bang-0.046
Token is
Token ID318
Feature activation+3.30e+00

Pos logit contributions
 name+0.666
+0.566
 names+0.522
 to+0.513
 They+0.489
Neg logit contributions
eling-1.670
ited-1.543
ering-1.503
i-1.451
heart-1.422
Token Tweet
Token ID18752
Feature activation-1.47e+00

Pos logit contributions
eling+0.215
i+0.190
ited+0.185
ering+0.184
ie+0.182
Neg logit contributions
 name-0.150
 names-0.135
-0.126
 They-0.125
Circ-0.125
Tokenie
Token ID494
Feature activation+2.79e+01

Pos logit contributions
 name+0.085
 to+0.077
+0.077
 names+0.076
 They+0.073
Neg logit contributions
eling-0.085
ited-0.072
ering-0.068
ieved-0.062
ience-0.062
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
eling+2.240
ited+2.007
ering+1.955
i+1.921
heart+1.868
Neg logit contributions
 name-0.961
 names-0.807
-0.788
 to-0.757
 They-0.741
Token I
Token ID314
Feature activation-2.41e-01

Pos logit contributions
 name+0.082
 names+0.078
Circ+0.075
 yard+0.072
 road+0.072
Neg logit contributions
eling-0.124
i-0.118
heart-0.116
ie-0.115
ience-0.113
Token am
Token ID716
Feature activation-2.32e+00

Pos logit contributions
 name+0.009
 names+0.007
+0.007
 They+0.007
 to+0.007
Neg logit contributions
eling-0.019
ited-0.017
ering-0.017
i-0.016
heart-0.016
Token looking
Token ID2045
Feature activation+2.38e+00

Pos logit contributions
 name+0.101
 names+0.090
+0.086
 They+0.084
Circ+0.082
Neg logit contributions
eling-0.160
ited-0.140
i-0.139
ering-0.138
ie-0.134
Token for
Token ID329
Feature activation+3.33e+00

Pos logit contributions
eling+0.149
i+0.131
ited+0.129
ering+0.129
ie+0.127
Neg logit contributions
 name-0.113
 names-0.103
Circ-0.098
-0.098
 They-0.097
Token "
Token ID366
Feature activation-7.87e+00

Pos logit contributions
eling+0.824
ering+0.765
i+0.765
ie+0.720
ience+0.716
Neg logit contributions
 name-0.354
 names-0.336
Circ-0.313
 yard-0.290
 road-0.286
TokenHello
Token ID15496
Feature activation-1.53e+00

Pos logit contributions
 name+0.302
 names+0.263
 to+0.239
 They+0.237
+0.232
Neg logit contributions
eling-0.582
ited-0.536
i-0.522
ie-0.508
heart-0.504
Token,
Token ID11
Feature activation+1.14e+01

Pos logit contributions
 name+0.088
 names+0.081
+0.078
 They+0.077
 to+0.076
Neg logit contributions
eling-0.084
ited-0.071
i-0.070
ering-0.070
ie-0.067
Token little
Token ID1310
Feature activation-1.10e+00

Pos logit contributions
eling+0.819
i+0.729
ie+0.718
ited+0.713
ering+0.705
Neg logit contributions
 name-0.406
 names-0.379
Circ-0.357
 They-0.350
-0.336
Token bird
Token ID6512
Feature activation-7.25e+00

Pos logit contributions
 name+0.036
 names+0.031
+0.029
 They+0.028
Circ+0.027
Neg logit contributions
eling-0.087
ited-0.078
i-0.078
ering-0.077
ie-0.076
Tokenie
Token ID494
Feature activation+2.71e+01

Pos logit contributions
 name+0.399
+0.389
 to+0.382
 fence+0.347
 They+0.346
Neg logit contributions
eling-0.416
ited-0.356
ering-0.338
ience-0.334
ieved-0.314
Token!
Token ID0
Feature activation+9.08e+00

Pos logit contributions
eling+1.827
ited+1.592
ering+1.533
i+1.460
ience+1.439
Neg logit contributions
 name-1.306
-1.175
 to-1.157
 names-1.107
 fence-1.100
Token Do
Token ID2141
Feature activation-9.97e-01

Pos logit contributions
eling+0.493
ience+0.460
i+0.456
heart+0.452
ited+0.434
Neg logit contributions
 names-0.363
 name-0.350
 yard-0.348
 road-0.347
 stairs-0.335
Token you
Token ID345
Feature activation+6.45e+00

Pos logit contributions
 name+0.055
+0.051
 names+0.051
 to+0.048
 They+0.048
Neg logit contributions
eling-0.059
ited-0.053
ering-0.050
i-0.049
heart-0.048
Token want
Token ID765
Feature activation-1.17e+00

Pos logit contributions
eling+0.461
ering+0.409
i+0.398
ited+0.394
ience+0.392
Neg logit contributions
 name-0.289
 names-0.247
-0.234
 to-0.229
 They-0.217
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
 name+0.028
 They+0.023
 names+0.021
 fence+0.020
 Along+0.019
Neg logit contributions
eling-0.089
ie-0.087
i-0.084
heart-0.080
ering-0.080
Token get
Token ID651
Feature activation-3.17e+00

Pos logit contributions
eling+0.188
i+0.164
ited+0.161
ering+0.160
ie+0.159
Neg logit contributions
 name-0.100
 names-0.095
Circ-0.087
 Bang-0.086
 Some-0.082
Token a
Token ID257
Feature activation-4.29e+00

Pos logit contributions
 name+0.138
 names+0.122
 fence+0.116
Circ+0.114
+0.113
Neg logit contributions
eling-0.219
i-0.190
ering-0.187
ited-0.185
ie-0.183
Token new
Token ID649
Feature activation-7.11e-01

Pos logit contributions
 names+0.161
 name+0.161
 They+0.157
+0.145
 Some+0.145
Neg logit contributions
eling-0.275
ited-0.253
ering-0.245
ie-0.243
i-0.242
Token rubber
Token ID14239
Feature activation-1.25e+00

Pos logit contributions
 name+0.027
 names+0.026
Circ+0.025
 They+0.024
+0.024
Neg logit contributions
eling-0.047
i-0.043
ie-0.042
ering-0.042
ited-0.042
Token duck
Token ID22045
Feature activation-2.29e+00

Pos logit contributions
 name+0.045
 names+0.039
+0.037
 They+0.036
Circ+0.035
Neg logit contributions
eling-0.095
i-0.084
ited-0.084
ering-0.083
ie-0.082
Tokenie
Token ID494
Feature activation+2.71e+01

Pos logit contributions
 name+0.093
 names+0.081
+0.078
 to+0.076
 They+0.075
Neg logit contributions
eling-0.167
ited-0.147
ering-0.144
i-0.144
heart-0.138
Token for
Token ID329
Feature activation+2.49e+00

Pos logit contributions
eling+1.974
ited+1.750
i+1.749
ering+1.726
ie+1.685
Neg logit contributions
 name-1.052
 names-0.918
-0.874
 They-0.849
Circ-0.847
Token you
Token ID345
Feature activation+6.43e+00

Pos logit contributions
eling+0.163
ited+0.146
i+0.141
ering+0.141
ie+0.136
Neg logit contributions
 name-0.117
 names-0.105
-0.103
 They-0.099
 to-0.099
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
eling+0.477
i+0.430
ited+0.427
ering+0.422
ie+0.413
Neg logit contributions
 name-0.233
 names-0.201
Circ-0.196
 Bang-0.189
-0.188
Token play
Token ID711
Feature activation-1.25e+01

Pos logit contributions
 name+0.474
 names+0.397
+0.381
 They+0.363
 to+0.354
Neg logit contributions
eling-1.141
ited-1.023
ering-1.013
i-1.010
ie-0.976
Token with
Token ID351
Feature activation-3.83e+00

Pos logit contributions
 name+0.530
 names+0.475
Circ+0.436
+0.432
 They+0.427
Neg logit contributions
eling-0.868
ering-0.759
i-0.752
ited-0.752
ie-0.740
Token girl
Token ID2576
Feature activation+5.11e+00

Pos logit contributions
 name+0.037
 names+0.032
+0.031
 Bang+0.029
 They+0.029
Neg logit contributions
eling-0.078
i-0.071
ited-0.070
ie-0.069
ering-0.069
Token.
Token ID13
Feature activation-2.74e+00

Pos logit contributions
eling+0.296
ited+0.249
ering+0.244
i+0.238
ience+0.228
Neg logit contributions
 name-0.290
-0.264
 names-0.262
 to-0.258
 They-0.256
Token She
Token ID1375
Feature activation-2.07e+00

Pos logit contributions
 name+0.106
Circ+0.102
Ar+0.100
 names+0.099
 fence+0.098
Neg logit contributions
heart-0.140
ie-0.138
eling-0.138
i-0.134
ited-0.134
Token was
Token ID373
Feature activation+1.68e+00

Pos logit contributions
 name+0.103
+0.093
 names+0.090
 They+0.088
 to+0.087
Neg logit contributions
eling-0.136
ited-0.119
ering-0.118
i-0.116
heart-0.114
Token three
Token ID1115
Feature activation-3.36e+00

Pos logit contributions
eling+0.119
ited+0.105
i+0.104
ering+0.103
ie+0.100
Neg logit contributions
 name-0.070
 names-0.062
-0.060
 They-0.058
 to-0.057
Token years
Token ID812
Feature activation+1.89e+01

Pos logit contributions
 name+0.212
+0.208
 to+0.205
 They+0.200
 big+0.200
Neg logit contributions
eling-0.153
ited-0.128
ering-0.118
ieved-0.116
iable-0.110
Token old
Token ID1468
Feature activation-3.04e+00

Pos logit contributions
eling+1.287
i+1.177
ie+1.172
ering+1.148
ited+1.136
Neg logit contributions
 name-0.751
 names-0.692
 Bang-0.638
Circ-0.627
-0.609
Token,
Token ID11
Feature activation+1.15e+01

Pos logit contributions
 name+0.124
 names+0.109
+0.107
 They+0.103
 to+0.102
Neg logit contributions
eling-0.225
ited-0.198
ering-0.195
i-0.192
ie-0.185
Token and
Token ID290
Feature activation+2.82e+00

Pos logit contributions
eling+0.808
ited+0.708
i+0.695
ering+0.695
ie+0.669
Neg logit contributions
 name-0.509
 names-0.447
-0.442
 to-0.425
 They-0.424
Token she
Token ID673
Feature activation+7.94e+00

Pos logit contributions
eling+0.201
ited+0.185
ering+0.177
i+0.171
ience+0.169
Neg logit contributions
 name-0.132
-0.121
 They-0.113
 names-0.111
 to-0.110
Token lived
Token ID5615
Feature activation+4.76e+00

Pos logit contributions
eling+0.536
ited+0.472
ering+0.467
i+0.455
heart+0.451
Neg logit contributions
 name-0.384
-0.347
 names-0.334
 to-0.329
 They-0.326
Token a
Token ID257
Feature activation-4.20e+00

Pos logit contributions
eling+0.127
i+0.113
ited+0.113
ie+0.111
ering+0.110
Neg logit contributions
 name-0.054
 names-0.047
 fence-0.042
Circ-0.042
 Bang-0.041
Token little
Token ID1310
Feature activation-1.05e+00

Pos logit contributions
 name+0.153
 names+0.133
+0.126
 They+0.122
 to+0.119
Neg logit contributions
eling-0.318
ited-0.283
i-0.280
ering-0.278
ie-0.271
Token girl
Token ID2576
Feature activation+5.08e+00

Pos logit contributions
 name+0.038
 names+0.031
+0.031
 They+0.030
 fence+0.030
Neg logit contributions
eling-0.083
ited-0.073
ering-0.073
i-0.071
ience-0.068
Token named
Token ID3706
Feature activation+7.16e+00

Pos logit contributions
eling+0.308
ited+0.263
ering+0.256
i+0.244
ience+0.243
Neg logit contributions
 name-0.276
-0.258
 to-0.249
 They-0.244
 names-0.243
Token Mill
Token ID9212
Feature activation+7.07e+00

Pos logit contributions
eling+0.426
ie+0.388
i+0.384
ited+0.375
ering+0.366
Neg logit contributions
 name-0.345
 names-0.308
Circ-0.304
 road-0.288
 Bang-0.287
Tokenie
Token ID494
Feature activation+2.71e+01

Pos logit contributions
eling+0.186
ited+0.130
ience+0.120
ering+0.116
ises+0.103
Neg logit contributions
 name-0.580
-0.569
 to-0.566
 names-0.554
 corner-0.543
Token.
Token ID13
Feature activation-2.80e+00

Pos logit contributions
eling+2.038
i+1.819
ited+1.803
ering+1.781
ie+1.768
Neg logit contributions
 name-1.007
 names-0.882
-0.809
Circ-0.806
 Bang-0.795
Token She
Token ID1375
Feature activation-2.16e+00

Pos logit contributions
 name+0.100
 names+0.088
Circ+0.084
 Bang+0.083
 fence+0.079
Neg logit contributions
eling-0.199
i-0.184
ie-0.183
heart-0.180
ited-0.178
Token was
Token ID373
Feature activation+1.60e+00

Pos logit contributions
 name+0.108
+0.098
 names+0.094
 They+0.092
 to+0.091
Neg logit contributions
eling-0.141
ited-0.124
ering-0.122
i-0.119
heart-0.118
Token three
Token ID1115
Feature activation-3.46e+00

Pos logit contributions
eling+0.110
i+0.096
ited+0.094
ering+0.093
ie+0.092
Neg logit contributions
 name-0.067
 names-0.061
-0.057
Circ-0.057
 Bang-0.057
Token years
Token ID812
Feature activation+1.88e+01

Pos logit contributions
 name+0.224
+0.219
 to+0.214
 They+0.212
 fence+0.212
Neg logit contributions
eling-0.155
ited-0.130
ering-0.118
ieved-0.117
ience-0.112
Token and
Token ID290
Feature activation+2.88e+00

Pos logit contributions
eling+0.011
ited+0.010
i+0.009
ering+0.009
heart+0.009
Neg logit contributions
 name-0.007
 to-0.006
-0.006
 names-0.006
 fence-0.006
Token said
Token ID531
Feature activation+6.19e+00

Pos logit contributions
eling+0.167
ited+0.147
ering+0.139
itive+0.133
ience+0.132
Neg logit contributions
 name-0.165
-0.151
 to-0.149
 names-0.145
 They-0.142
Token,
Token ID11
Feature activation+1.16e+01

Pos logit contributions
eling+0.358
ited+0.312
i+0.287
ering+0.285
ie+0.272
Neg logit contributions
 name-0.364
-0.330
 names-0.329
 to-0.327
 They-0.315
Token "
Token ID366
Feature activation-7.68e+00

Pos logit contributions
eling+0.891
ering+0.816
i+0.815
ience+0.781
ie+0.766
Neg logit contributions
 name-0.337
 names-0.309
Circ-0.287
 yard-0.264
 road-0.264
TokenS
Token ID50
Feature activation+4.25e+00

Pos logit contributions
 name+0.305
 names+0.264
 They+0.246
+0.244
 fence+0.239
Neg logit contributions
eling-0.567
ited-0.510
i-0.500
ering-0.488
heart-0.481
Tokenara
Token ID3301
Feature activation+1.37e+01

Pos logit contributions
eling+0.256
ering+0.215
itive+0.209
ited+0.209
ience+0.206
Neg logit contributions
 name-0.221
 names-0.204
-0.193
 to-0.189
 They-0.178
Token,
Token ID11
Feature activation+1.17e+01

Pos logit contributions
eling+0.890
ited+0.770
ering+0.715
ieved+0.683
itive+0.643
Neg logit contributions
 name-0.769
 to-0.685
-0.674
 names-0.663
 They-0.645
Token I
Token ID314
Feature activation-7.87e-01

Pos logit contributions
eling+0.727
ited+0.632
i+0.619
ering+0.615
ie+0.590
Neg logit contributions
 name-0.595
 names-0.534
-0.518
 They-0.501
 to-0.500
Token give
Token ID1577
Feature activation+5.31e-01

Pos logit contributions
 name+0.029
+0.023
 names+0.023
 to+0.021
 They+0.020
Neg logit contributions
eling-0.064
ering-0.057
ited-0.056
i-0.055
heart-0.054
Token you
Token ID345
Feature activation+6.71e+00

Pos logit contributions
eling+0.031
ited+0.026
i+0.026
ering+0.026
ie+0.025
Neg logit contributions
 name-0.029
 names-0.026
-0.026
 They-0.025
Circ-0.025
Token the
Token ID262
Feature activation+2.91e+00

Pos logit contributions
eling+0.510
i+0.455
ited+0.453
ering+0.443
ie+0.440
Neg logit contributions
 name-0.229
 names-0.194
-0.192
 They-0.192
 Bang-0.186
Token special
Token ID2041
Feature activation-1.37e+00

Pos logit contributions
eling+0.460
ited+0.409
i+0.404
ering+0.402
ie+0.393
Neg logit contributions
 name-0.242
 names-0.220
-0.212
Circ-0.211
 Bang-0.209
Token object
Token ID2134
Feature activation-9.08e-01

Pos logit contributions
 name+0.048
 names+0.045
 They+0.044
Circ+0.044
+0.044
Neg logit contributions
eling-0.094
i-0.086
ie-0.085
ited-0.084
ering-0.084
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
 name+0.042
 names+0.038
+0.036
Circ+0.036
 They+0.035
Neg logit contributions
eling-0.059
i-0.052
ering-0.051
ited-0.051
ie-0.050
Token Every
Token ID3887
Feature activation-2.57e+00

Pos logit contributions
 across+0.072
 names+0.071
 fence+0.070
Circ+0.069
 name+0.068
Neg logit contributions
heart-0.129
eling-0.129
mmm-0.127
i-0.125
ie-0.124
Token day
Token ID1110
Feature activation-2.71e+00

Pos logit contributions
 name+0.107
 names+0.100
 Bang+0.092
 They+0.089
Circ+0.088
Neg logit contributions
eling-0.172
ited-0.156
ie-0.155
i-0.154
heart-0.149
Token,
Token ID11
Feature activation+1.19e+01

Pos logit contributions
 name+0.140
 to+0.136
+0.133
 names+0.125
 They+0.123
Neg logit contributions
eling-0.185
ited-0.162
ering-0.154
eking-0.146
ieved-0.145
Token the
Token ID262
Feature activation+3.06e+00

Pos logit contributions
eling+0.674
ited+0.579
ering+0.564
i+0.560
ie+0.529
Neg logit contributions
 name-0.680
 names-0.618
-0.607
 They-0.589
 to-0.588
Token girl
Token ID2576
Feature activation+5.43e+00

Pos logit contributions
eling+0.205
i+0.181
ited+0.179
ie+0.178
ering+0.177
Neg logit contributions
 name-0.122
 names-0.113
 They-0.110
-0.110
Circ-0.109
Token would
Token ID561
Feature activation+1.11e+00

Pos logit contributions
eling+0.368
ited+0.317
ering+0.316
heart+0.303
i+0.303
Neg logit contributions
 name-0.254
 names-0.227
 to-0.223
-0.221
 They-0.215
Token un
Token ID555
Feature activation-6.22e+00

Pos logit contributions
eling+0.084
ited+0.075
i+0.074
ering+0.074
ie+0.071
Neg logit contributions
 name-0.041
 names-0.036
-0.034
 They-0.033
 to-0.032
Tokenpack
Token ID8002
Feature activation-2.11e+00

Pos logit contributions
 name+0.256
 names+0.228
+0.220
 They+0.214
 Bang+0.213
Neg logit contributions
eling-0.442
ited-0.390
ering-0.389
i-0.387
ie-0.376
Token excited
Token ID6568
Feature activation+1.19e+00

Pos logit contributions
eling+0.568
ited+0.500
i+0.497
ering+0.488
ie+0.486
Neg logit contributions
 name-0.374
-0.323
 names-0.323
 They-0.307
 to-0.303
Token!
Token ID0
Feature activation+9.14e+00

Pos logit contributions
eling+0.087
ited+0.078
i+0.074
ering+0.074
ience+0.073
Neg logit contributions
 name-0.047
 to-0.042
-0.042
 names-0.042
 They-0.040
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
eling+0.514
heart+0.509
i+0.487
 chuckled+0.481
mmm+0.475
Neg logit contributions
 names-0.309
 name-0.280
 fence-0.263
tr-0.257
Circ-0.244
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
 names+0.265
 name+0.262
 fence+0.220
tr+0.200
 road+0.189
Neg logit contributions
eling-0.792
ering-0.727
ieved-0.696
i-0.693
ie-0.690
TokenHe
Token ID1544
Feature activation-1.83e-01

Pos logit contributions
 name+0.494
 names+0.444
+0.422
 They+0.414
 to+0.407
Neg logit contributions
eling-0.661
ited-0.574
i-0.564
ering-0.563
ie-0.540
Token laughed
Token ID13818
Feature activation+1.40e+01

Pos logit contributions
 name+0.010
 to+0.009
+0.009
 names+0.009
 They+0.009
Neg logit contributions
eling-0.011
ited-0.009
ering-0.009
heart-0.009
i-0.009
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
eling+0.957
ited+0.837
i+0.820
ering+0.814
ie+0.787
Neg logit contributions
 name-0.641
 names-0.567
-0.555
 They-0.537
 to-0.534
Token pointed
Token ID6235
Feature activation+6.54e+00

Pos logit contributions
eling+0.164
ited+0.141
ering+0.135
ie+0.132
heart+0.131
Neg logit contributions
 name-0.155
-0.147
 to-0.145
 They-0.143
 names-0.139
Token at
Token ID379
Feature activation+1.13e+00

Pos logit contributions
eling+0.467
i+0.422
ering+0.421
ited+0.414
ie+0.404
Neg logit contributions
 name-0.244
 names-0.214
Circ-0.210
 fence-0.206
 They-0.203
Token the
Token ID262
Feature activation+2.75e+00

Pos logit contributions
eling+0.071
i+0.060
ering+0.060
ited+0.059
ie+0.059
Neg logit contributions
 name-0.055
 names-0.049
Circ-0.047
 They-0.046
-0.046
Token video
Token ID2008
Feature activation-6.39e+00

Pos logit contributions
eling+0.187
ited+0.168
ering+0.168
ie+0.160
heart+0.159
Neg logit contributions
 name-0.099
Circ-0.090
-0.086
 They-0.086
 names-0.085
Token soon
Token ID2582
Feature activation+9.41e+00

Pos logit contributions
eling+0.571
ited+0.521
ering+0.497
i+0.485
heart+0.480
Neg logit contributions
-0.313
 name-0.310
 to-0.292
 They-0.266
 big-0.261
Token as
Token ID355
Feature activation+7.74e+00

Pos logit contributions
eling+0.610
ering+0.540
ited+0.521
i+0.521
heart+0.518
Neg logit contributions
 name-0.454
 to-0.452
-0.437
 They-0.419
 across-0.405
Token he
Token ID339
Feature activation+4.76e+00

Pos logit contributions
eling+0.550
ited+0.496
ering+0.476
i+0.471
heart+0.458
Neg logit contributions
 name-0.332
-0.314
 to-0.302
 names-0.287
 They-0.283
Token touched
Token ID12615
Feature activation-2.06e+00

Pos logit contributions
eling+0.304
ering+0.263
heart+0.258
i+0.255
ited+0.252
Neg logit contributions
 name-0.232
-0.219
 to-0.210
 names-0.202
 across-0.194
Token it
Token ID340
Feature activation+1.74e+00

Pos logit contributions
 name+0.093
 names+0.081
+0.081
 to+0.078
 They+0.077
Neg logit contributions
eling-0.142
ited-0.127
i-0.122
ering-0.121
heart-0.117
Token,
Token ID11
Feature activation+1.23e+01

Pos logit contributions
eling+0.100
ited+0.087
i+0.082
ering+0.079
heart+0.078
Neg logit contributions
 name-0.100
-0.093
 to-0.092
 names-0.088
 They-0.087
Token the
Token ID262
Feature activation+3.50e+00

Pos logit contributions
eling+0.834
ited+0.731
i+0.722
ering+0.717
ie+0.693
Neg logit contributions
 name-0.550
 names-0.489
-0.473
 They-0.458
 to-0.452
Token weapon
Token ID4282
Feature activation-8.83e+00

Pos logit contributions
eling+0.206
ering+0.180
ie+0.179
ited+0.175
i+0.172
Neg logit contributions
Circ-0.148
 They-0.144
 name-0.139
 names-0.137
iffs-0.132
Token disappeared
Token ID12120
Feature activation-2.82e+00

Pos logit contributions
 name+0.433
 to+0.392
+0.385
 names+0.379
 across+0.370
Neg logit contributions
eling-0.561
ited-0.488
heart-0.480
i-0.478
ering-0.463
Token!
Token ID0
Feature activation+9.99e+00

Pos logit contributions
 name+0.124
 names+0.109
+0.106
 to+0.106
 They+0.102
Neg logit contributions
eling-0.197
ited-0.173
i-0.167
ering-0.167
ience-0.162
Token The
Token ID383
Feature activation-2.14e+00

Pos logit contributions
eling+0.617
ience+0.556
heart+0.553
mmm+0.552
 chuckled+0.536
Neg logit contributions
 name-0.319
 names-0.317
 fence-0.255
 corner-0.249
Circ-0.242
Token go
Token ID467
Feature activation-4.45e+00

Pos logit contributions
eling+0.694
ited+0.613
ering+0.605
i+0.604
heart+0.575
Neg logit contributions
 name-0.407
 names-0.353
-0.338
 to-0.325
 They-0.322
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
 name+0.181
 names+0.163
Circ+0.153
 Bang+0.151
 They+0.148
Neg logit contributions
eling-0.314
i-0.278
ited-0.275
ering-0.275
ie-0.271
Token But
Token ID887
Feature activation+8.95e-01

Pos logit contributions
 name+0.013
 names+0.012
Circ+0.012
 fence+0.012
 to+0.012
Neg logit contributions
eling-0.026
i-0.023
ited-0.023
ience-0.023
heart-0.022
Token don
Token ID836
Feature activation+4.50e+00

Pos logit contributions
eling+0.058
ited+0.054
ering+0.049
itive+0.049
ience+0.049
Neg logit contributions
 name-0.044
-0.039
 to-0.038
 names-0.037
 They-0.035
Token't
Token ID470
Feature activation+1.03e+00

Pos logit contributions
eling+0.340
ited+0.301
i+0.299
ering+0.296
ie+0.288
Neg logit contributions
 name-0.167
 names-0.145
-0.138
 They-0.133
Circ-0.131
Token forget
Token ID6044
Feature activation+1.00e+01

Pos logit contributions
eling+0.069
ited+0.060
ering+0.059
i+0.058
ience+0.056
Neg logit contributions
 name-0.049
 names-0.043
-0.043
 to-0.042
 They-0.041
Token to
Token ID284
Feature activation-1.27e+01

Pos logit contributions
eling+0.812
ited+0.725
i+0.721
ering+0.717
ie+0.698
Neg logit contributions
 name-0.322
 names-0.270
-0.256
 They-0.246
Circ-0.240
Token rest
Token ID1334
Feature activation-4.60e+00

Pos logit contributions
 name+0.495
 names+0.413
+0.402
 to+0.386
 They+0.376
Neg logit contributions
eling-0.950
ited-0.864
ering-0.845
i-0.829
ience-0.827
Token,
Token ID11
Feature activation+1.28e+01

Pos logit contributions
 name+0.184
+0.163
 names+0.160
 to+0.160
 They+0.152
Neg logit contributions
eling-0.335
ited-0.306
i-0.290
ering-0.288
ience-0.281
Token too
Token ID1165
Feature activation+1.18e+00

Pos logit contributions
eling+0.789
i+0.684
ited+0.673
ering+0.670
ie+0.652
Neg logit contributions
 name-0.623
 names-0.567
Circ-0.539
-0.539
 They-0.526
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
eling+0.088
ited+0.078
i+0.077
ering+0.077
ie+0.074
Neg logit contributions
 name-0.046
 names-0.040
-0.039
 They-0.037
 to-0.037
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
 name+0.132
 to+0.116
+0.112
 across+0.110
 names+0.109
Neg logit contributions
eling-0.303
ited-0.271
ering-0.259
heart-0.257
i-0.256
Token She
Token ID1375
Feature activation-1.45e+00

Pos logit contributions
 name+0.075
 names+0.071
 road+0.063
 fence+0.062
Circ+0.060
Neg logit contributions
heart-0.127
eling-0.124
i-0.123
ie-0.120
ience-0.119
Token smiled
Token ID13541
Feature activation+1.54e+01

Pos logit contributions
 name+0.073
+0.067
 to+0.065
 names+0.063
 They+0.059
Neg logit contributions
eling-0.098
ited-0.084
ering-0.083
heart-0.081
itive-0.079
Token and
Token ID290
Feature activation+3.42e+00

Pos logit contributions
eling+1.036
ited+0.907
i+0.903
ering+0.894
ie+0.866
Neg logit contributions
 name-0.689
 names-0.615
-0.590
 They-0.573
Circ-0.567
Token put
Token ID1234
Feature activation+2.67e-01

Pos logit contributions
eling+0.216
ited+0.185
ering+0.177
heart+0.173
itive+0.171
Neg logit contributions
 name-0.192
-0.179
 to-0.176
 They-0.170
 names-0.167
Token it
Token ID340
Feature activation+1.65e+00

Pos logit contributions
eling+0.018
ited+0.016
i+0.015
ering+0.015
ie+0.015
Neg logit contributions
 name-0.012
 names-0.011
-0.011
 They-0.010
 to-0.010
Token on
Token ID319
Feature activation-6.44e+00

Pos logit contributions
eling+0.121
ited+0.107
i+0.106
ering+0.106
ie+0.103
Neg logit contributions
 name-0.065
 names-0.057
-0.054
 They-0.053
Circ-0.052
Token her
Token ID607
Feature activation+1.08e+01

Pos logit contributions
 name+0.284
 names+0.254
+0.239
Circ+0.237
 They+0.235
Neg logit contributions
eling-0.435
i-0.382
ited-0.381
ering-0.377
ie-0.365
Token wrist
Token ID15980
Feature activation-2.91e+00

Pos logit contributions
eling+0.732
i+0.650
ited+0.641
ering+0.641
ie+0.621
Neg logit contributions
 name-0.450
 names-0.406
Circ-0.397
 They-0.397
 Bang-0.385
Token.
Token ID13
Feature activation-2.02e+00

Pos logit contributions
 name+0.114
 names+0.102
Circ+0.098
 They+0.095
+0.094
Neg logit contributions
eling-0.204
i-0.186
ering-0.183
ited-0.183
ie-0.180
Token
Token ID198
Feature activation-9.44e+00

Pos logit contributions
 road+0.066
 fence+0.063
 names+0.061
 name+0.060
Circ+0.055
Neg logit contributions
heart-0.111
i-0.109
mmm-0.106
ie-0.105
eling-0.103
Token wouldn
Token ID3636
Feature activation+1.14e+01

Pos logit contributions
eling+0.406
ited+0.353
ering+0.348
i+0.341
heart+0.333
Neg logit contributions
 name-0.358
 names-0.321
-0.317
 to-0.308
 They-0.304
Token't
Token ID470
Feature activation+1.42e+00

Pos logit contributions
eling+0.821
i+0.718
ited+0.711
ering+0.705
ie+0.692
Neg logit contributions
 name-0.455
 names-0.414
Circ-0.373
 They-0.371
 fence-0.365
Token get
Token ID651
Feature activation-1.15e+00

Pos logit contributions
eling+0.106
ited+0.095
ering+0.093
i+0.092
ience+0.090
Neg logit contributions
 name-0.056
-0.048
 to-0.046
 names-0.046
 They-0.044
Token punished
Token ID16851
Feature activation+1.17e+01

Pos logit contributions
 name+0.046
 names+0.040
+0.039
 to+0.039
 They+0.037
Neg logit contributions
eling-0.083
ited-0.074
i-0.072
ering-0.072
heart-0.070
Token anymore
Token ID7471
Feature activation+6.34e+00

Pos logit contributions
eling+0.848
ited+0.753
i+0.737
ering+0.732
ie+0.708
Neg logit contributions
 name-0.476
 names-0.417
-0.408
 to-0.392
 They-0.391
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
eling+0.488
ited+0.435
i+0.429
ering+0.427
ie+0.414
Neg logit contributions
 name-0.227
 names-0.196
-0.189
 They-0.181
 to-0.178
Token
Token ID220
Feature activation-2.60e+00

Pos logit contributions
 name+0.032
 names+0.030
Circ+0.025
 fence+0.024
 across+0.023
Neg logit contributions
eling-0.099
i-0.092
heart-0.091
ited-0.090
ience-0.089
Token
Token ID198
Feature activation-8.90e+00

Pos logit contributions
 fence+0.055
 names+0.055
 name+0.055
Circ+0.054
 road+0.052
Neg logit contributions
eling-0.196
ering-0.185
ieved-0.180
i-0.178
ie-0.177
Token
Token ID198
Feature activation-9.02e+00

Pos logit contributions
 name+0.222
 names+0.198
 fence+0.156
Circ+0.143
 to+0.142
Neg logit contributions
eling-0.751
ited-0.681
ering-0.673
i-0.668
ie-0.659
TokenThe
Token ID464
Feature activation+5.42e+00

Pos logit contributions
 name+0.449
 names+0.408
 They+0.372
 to+0.371
+0.371
Neg logit contributions
eling-0.568
ited-0.495
i-0.483
ering-0.482
ie-0.464
Token little
Token ID1310
Feature activation+5.07e-03

Pos logit contributions
eling+0.362
ited+0.316
i+0.313
ering+0.311
ie+0.300
Neg logit contributions
 name-0.249
 names-0.222
-0.214
 They-0.207
 to-0.204
Token.
Token ID13
Feature activation-2.77e+00

Pos logit contributions
 name+0.264
 names+0.232
Circ+0.212
 Bang+0.210
 They+0.208
Neg logit contributions
eling-0.527
i-0.472
ited-0.468
ering-0.459
ie-0.454
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
 name+0.066
Circ+0.065
 names+0.065
 across+0.056
 corner+0.054
Neg logit contributions
eling-0.192
ie-0.190
i-0.186
heart-0.182
ering-0.180
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
 name+0.243
 names+0.228
 fence+0.189
Circ+0.175
 road+0.170
Neg logit contributions
ie-0.817
eling-0.797
ering-0.751
i-0.736
heart-0.720
TokenThey
Token ID2990
Feature activation+1.84e+00

Pos logit contributions
 name+0.527
 names+0.476
+0.448
Circ+0.440
 They+0.439
Neg logit contributions
eling-0.623
ited-0.541
ering-0.532
i-0.531
ie-0.517
Token played
Token ID2826
Feature activation-1.66e+00

Pos logit contributions
eling+0.135
ering+0.120
heart+0.118
ited+0.114
itive+0.112
Neg logit contributions
 name-0.074
 names-0.068
 to-0.064
-0.064
 They-0.064
Token together
Token ID1978
Feature activation+2.54e+00

Pos logit contributions
 name+0.080
 names+0.072
Circ+0.068
 Bang+0.068
+0.068
Neg logit contributions
eling-0.102
ering-0.089
ited-0.086
i-0.086
ie-0.086
Token on
Token ID319
Feature activation-7.12e+00

Pos logit contributions
eling+0.167
i+0.145
ited+0.145
ering+0.144
ie+0.140
Neg logit contributions
 name-0.118
 names-0.105
-0.100
 They-0.098
Circ-0.098
Token the
Token ID262
Feature activation+2.73e+00

Pos logit contributions
 name+0.317
 names+0.280
+0.266
 They+0.258
Circ+0.256
Neg logit contributions
eling-0.490
ited-0.426
i-0.424
ering-0.420
ie-0.409
Token lawn
Token ID21133
Feature activation-9.24e+00

Pos logit contributions
eling+0.199
ering+0.177
ited+0.176
i+0.175
ie+0.175
Neg logit contributions
 name-0.100
 names-0.084
Circ-0.082
 Bang-0.080
 Where-0.080
Token,
Token ID11
Feature activation+1.15e+01

Pos logit contributions
 name+0.466
 names+0.415
Circ+0.396
+0.392
 Bang+0.388
Neg logit contributions
eling-0.560
i-0.493
ering-0.487
ited-0.487
ie-0.480
Token running
Token ID2491
Feature activation-3.18e+00

Pos logit contributions
eling+0.826
ited+0.726
i+0.715
heart+0.703
ering+0.702
Neg logit contributions
 name-0.489
 to-0.444
-0.438
 names-0.425
 They-0.424
Token.
Token ID13
Feature activation-2.35e+00

Pos logit contributions
 name+0.189
 names+0.163
Circ+0.152
+0.152
 Bang+0.149
Neg logit contributions
eling-0.358
i-0.321
ering-0.318
ie-0.316
ited-0.314
Token They
Token ID1119
Feature activation-7.75e+00

Pos logit contributions
Circ+0.072
 name+0.071
 across+0.068
 Bang+0.063
powers+0.063
Neg logit contributions
eling-0.147
i-0.142
ited-0.136
ie-0.134
heart-0.131
Token made
Token ID925
Feature activation+1.55e+00

Pos logit contributions
 name+0.357
 names+0.327
+0.324
 They+0.321
 to+0.310
Neg logit contributions
eling-0.527
ering-0.466
heart-0.454
ited-0.451
ience-0.433
Token a
Token ID257
Feature activation-3.86e+00

Pos logit contributions
eling+0.110
ering+0.098
ie+0.095
i+0.094
heart+0.090
Neg logit contributions
 name-0.058
Circ-0.050
 Along-0.050
 Where-0.049
 names-0.049
Token big
Token ID1263
Feature activation-1.01e+01

Pos logit contributions
 name+0.111
 names+0.105
 Where+0.104
Circ+0.102
 Some+0.100
Neg logit contributions
eling-0.285
ie-0.257
ering-0.257
i-0.253
ited-0.249
Token ig
Token ID45329
Feature activation+2.56e+00

Pos logit contributions
 name+0.336
 names+0.281
+0.269
 fence+0.261
 They+0.258
Neg logit contributions
eling-0.809
ited-0.727
i-0.714
ering-0.708
heart-0.684
Tokenloo
Token ID29680
Feature activation-2.63e+00

Pos logit contributions
eling+0.180
ited+0.158
ering+0.157
i+0.156
ie+0.153
Neg logit contributions
 name-0.110
 names-0.098
-0.092
 They-0.090
Circ-0.089
Token with
Token ID351
Feature activation-3.48e+00

Pos logit contributions
 name+0.113
 names+0.100
+0.096
 They+0.093
Circ+0.092
Neg logit contributions
eling-0.183
ited-0.161
i-0.160
ering-0.159
ie-0.154
Token blocks
Token ID7021
Feature activation-9.52e+00

Pos logit contributions
 name+0.123
+0.097
 names+0.097
Circ+0.095
 Along+0.090
Neg logit contributions
eling-0.258
i-0.231
ie-0.224
ering-0.223
ited-0.223
Token of
Token ID286
Feature activation-4.90e+00

Pos logit contributions
 name+0.451
 names+0.403
+0.386
Circ+0.377
 They+0.375
Neg logit contributions
eling-0.619
ited-0.538
i-0.538
ering-0.534
ie-0.516
Token snow
Token ID6729
Feature activation-5.13e+00

Pos logit contributions
 name+0.203
Circ+0.182
 Along+0.179
 names+0.174
 across+0.170
Neg logit contributions
eling-0.322
i-0.281
ie-0.279
heart-0.265
ering-0.262
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
eling+0.845
ited+0.743
i+0.728
ering+0.725
ie+0.709
Neg logit contributions
 name-0.501
-0.442
 names-0.437
 to-0.435
 They-0.422
Token ventured
Token ID44716
Feature activation-1.70e+00

Pos logit contributions
eling+0.206
ited+0.182
ering+0.174
i+0.173
ie+0.169
Neg logit contributions
 name-0.153
 to-0.147
-0.143
 They-0.136
 across-0.135
Token a
Token ID257
Feature activation-3.75e+00

Pos logit contributions
 name+0.083
 names+0.076
Circ+0.071
+0.069
 Bang+0.069
Neg logit contributions
eling-0.108
ering-0.094
i-0.094
ited-0.093
ie-0.090
Token few
Token ID1178
Feature activation+2.36e+00

Pos logit contributions
 name+0.157
 names+0.141
+0.131
 They+0.131
Circ+0.127
Neg logit contributions
eling-0.263
ited-0.233
ering-0.229
i-0.229
ie-0.221
Token steps
Token ID4831
Feature activation-1.08e+01

Pos logit contributions
eling+0.144
i+0.129
ited+0.126
ering+0.122
heart+0.120
Neg logit contributions
 name-0.109
 to-0.104
 names-0.101
 across-0.101
 corner-0.100
Token closer
Token ID5699
Feature activation-2.38e+00

Pos logit contributions
 name+0.402
 to+0.356
+0.351
 names+0.344
 across+0.335
Neg logit contributions
eling-0.814
ited-0.716
i-0.694
ering-0.688
heart-0.678
Token.
Token ID13
Feature activation-2.28e+00

Pos logit contributions
 name+0.075
 names+0.063
Circ+0.062
 Bang+0.058
+0.055
Neg logit contributions
eling-0.187
i-0.172
ering-0.172
ie-0.167
ited-0.166
Token Suddenly
Token ID24975
Feature activation-2.11e+00

Pos logit contributions
 names+0.073
 name+0.066
 road+0.059
 fence+0.056
Circ+0.055
Neg logit contributions
eling-0.148
mmm-0.138
i-0.135
ience-0.135
 reassured-0.133
Token,
Token ID11
Feature activation+1.20e+01

Pos logit contributions
 name+0.105
 to+0.103
+0.102
 They+0.095
 across+0.094
Neg logit contributions
eling-0.137
ited-0.124
ering-0.111
i-0.110
ieved-0.109
Token from
Token ID422
Feature activation+7.92e-01

Pos logit contributions
eling+0.856
ited+0.757
i+0.746
ering+0.741
ie+0.719
Neg logit contributions
 name-0.502
 names-0.439
-0.429
 They-0.413
 to-0.408
Token underneath
Token ID14638
Feature activation+3.07e-02

Pos logit contributions
eling+0.052
ited+0.045
i+0.045
ering+0.045
ie+0.043
Neg logit contributions
 name-0.037
 names-0.033
-0.032
 They-0.031
Circ-0.031
Token twins
Token ID20345
Feature activation+3.91e-01

Pos logit contributions
eling+0.137
ited+0.119
i+0.119
ering+0.117
ie+0.114
Neg logit contributions
 name-0.100
 names-0.090
-0.086
Circ-0.085
 They-0.084
Token.
Token ID13
Feature activation-2.70e+00

Pos logit contributions
eling+0.031
ited+0.028
i+0.027
ering+0.027
ie+0.026
Neg logit contributions
 name-0.013
 names-0.011
-0.011
 They-0.010
 to-0.010
Token They
Token ID1119
Feature activation-8.09e+00

Pos logit contributions
 name+0.048
 fence+0.037
 names+0.037
Circ+0.036
 Bang+0.035
Neg logit contributions
eling-0.226
i-0.215
ited-0.213
ie-0.207
heart-0.205
Token like
Token ID588
Feature activation+2.77e+00

Pos logit contributions
 name+0.361
 names+0.314
+0.302
 They+0.299
 to+0.283
Neg logit contributions
eling-0.580
ering-0.520
ited-0.512
i-0.503
heart-0.493
Token to
Token ID284
Feature activation-1.41e+01

Pos logit contributions
eling+0.152
ie+0.148
i+0.146
 laughed+0.135
 though+0.128
Neg logit contributions
tted-0.097
 name-0.088
 edge-0.087
 corner-0.077
 Along-0.072
Token play
Token ID711
Feature activation-1.24e+01

Pos logit contributions
 name+0.381
 names+0.324
Circ+0.320
+0.317
 Bang+0.306
Neg logit contributions
eling-1.141
i-1.029
ited-1.024
ie-1.015
ering-0.978
Token in
Token ID287
Feature activation-3.42e+00

Pos logit contributions
 name+0.552
 names+0.492
Circ+0.466
 Bang+0.456
+0.450
Neg logit contributions
eling-0.810
ering-0.713
ie-0.699
i-0.691
ited-0.671
Token the
Token ID262
Feature activation+2.79e+00

Pos logit contributions
 name+0.191
 names+0.169
 Bang+0.168
Circ+0.158
tr+0.157
Neg logit contributions
eling-0.191
ie-0.158
i-0.154
ering-0.148
ited-0.146
Token park
Token ID3952
Feature activation-1.23e+01

Pos logit contributions
eling+0.181
ie+0.166
i+0.157
ited+0.152
ering+0.149
Neg logit contributions
 name-0.104
 Bang-0.095
 names-0.090
Circ-0.089
 Some-0.087
Token with
Token ID351
Feature activation-3.62e+00

Pos logit contributions
 name+0.486
 names+0.416
Circ+0.403
 Bang+0.400
+0.384
Neg logit contributions
eling-0.877
i-0.788
ering-0.779
ited-0.775
ie-0.768
Token their
Token ID511
Feature activation+6.31e+00

Pos logit contributions
 name+0.168
 names+0.145
 Bang+0.142
+0.142
Circ+0.141
Neg logit contributions
eling-0.230
i-0.205
ie-0.197
ited-0.194
ering-0.192
TokenL
Token ID43
Feature activation+6.70e+00

Pos logit contributions
 name+0.382
+0.376
 They+0.352
 to+0.332
 names+0.328
Neg logit contributions
eling-0.736
i-0.654
ited-0.646
ering-0.631
heart-0.607
Tokenily
Token ID813
Feature activation+3.31e+00

Pos logit contributions
eling+0.460
ited+0.409
i+0.404
ering+0.392
ience+0.381
Neg logit contributions
 name-0.296
-0.269
 names-0.260
 They-0.258
 to-0.247
Token did
Token ID750
Feature activation+4.87e+00

Pos logit contributions
eling+0.246
ited+0.208
heart+0.206
ering+0.204
leness+0.201
Neg logit contributions
 name-0.135
 to-0.129
-0.128
 They-0.115
 names-0.113
Token not
Token ID407
Feature activation+1.91e+00

Pos logit contributions
eling+0.350
ited+0.337
heart+0.327
i+0.326
ience+0.316
Neg logit contributions
 name-0.197
-0.186
 to-0.181
 names-0.162
 They-0.161
Token listen
Token ID6004
Feature activation+3.20e+00

Pos logit contributions
eling+0.130
ited+0.113
ering+0.112
i+0.111
ience+0.109
Neg logit contributions
 name-0.091
 names-0.080
-0.076
 to-0.075
 They-0.073
Token to
Token ID284
Feature activation-1.39e+01

Pos logit contributions
eling+0.315
ited+0.288
i+0.284
ering+0.282
ie+0.276
Neg logit contributions
 name-0.048
 names-0.033
-0.031
 to-0.028
 They-0.026
Token Max
Token ID5436
Feature activation-3.15e+00

Pos logit contributions
 name+0.572
 names+0.502
+0.485
 They+0.469
 to+0.458
Neg logit contributions
eling-1.004
ited-0.891
i-0.876
ering-0.873
ie-0.843
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
 name+0.115
 names+0.099
+0.098
 to+0.095
 They+0.093
Neg logit contributions
eling-0.241
ited-0.215
i-0.210
ering-0.209
heart-0.204
Token She
Token ID1375
Feature activation-1.78e+00

Pos logit contributions
 name+0.094
Circ+0.089
 names+0.087
 fence+0.076
 corner+0.075
Neg logit contributions
eling-0.145
ering-0.130
i-0.130
heart-0.129
izing-0.123
Token thought
Token ID1807
Feature activation+6.33e+00

Pos logit contributions
 name+0.086
+0.075
 names+0.074
 to+0.073
 They+0.072
Neg logit contributions
eling-0.116
ering-0.103
heart-0.101
ited-0.098
leness-0.095
Token the
Token ID262
Feature activation+3.05e+00

Pos logit contributions
eling+0.444
ited+0.404
i+0.387
ering+0.380
heart+0.373
Neg logit contributions
 name-0.272
-0.244
 to-0.243
 names-0.239
 They-0.233
Token worked
Token ID3111
Feature activation+6.05e-01

Pos logit contributions
eling+0.128
ited+0.111
i+0.109
ering+0.109
heart+0.105
Neg logit contributions
 name-0.096
 names-0.087
-0.085
 to-0.084
 They-0.081
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
eling+0.047
ited+0.042
i+0.040
ering+0.038
heart+0.038
Neg logit contributions
 name-0.021
 to-0.021
-0.020
 across-0.019
 names-0.019
Token The
Token ID383
Feature activation-3.07e+00

Pos logit contributions
 name+0.092
 names+0.092
 fence+0.076
Circ+0.074
 road+0.070
Neg logit contributions
eling-0.198
heart-0.183
i-0.183
ience-0.178
ie-0.174
Token deer
Token ID20096
Feature activation-1.70e+00

Pos logit contributions
 name+0.121
 names+0.105
+0.101
 They+0.097
 to+0.095
Neg logit contributions
eling-0.228
ited-0.201
i-0.199
ering-0.198
ie-0.192
Token smiled
Token ID13541
Feature activation+1.46e+01

Pos logit contributions
 name+0.085
+0.079
 to+0.078
 names+0.073
 fence+0.072
Neg logit contributions
eling-0.106
ited-0.090
ering-0.089
heart-0.087
i-0.087
Token to
Token ID284
Feature activation-1.43e+01

Pos logit contributions
eling+1.045
ited+0.910
ering+0.883
i+0.882
ie+0.850
Neg logit contributions
 name-0.647
 names-0.560
-0.558
 to-0.543
 They-0.535
Token himself
Token ID2241
Feature activation+2.01e+00

Pos logit contributions
 name+0.620
+0.540
 names+0.536
 They+0.509
 to+0.496
Neg logit contributions
eling-1.006
ited-0.907
ering-0.886
i-0.876
ience-0.854
Token and
Token ID290
Feature activation+2.69e+00

Pos logit contributions
eling+0.138
ited+0.121
i+0.114
ering+0.114
heart+0.112
Neg logit contributions
 name-0.094
-0.083
 to-0.083
 names-0.080
 across-0.077
Token knew
Token ID2993
Feature activation+1.13e+01

Pos logit contributions
eling+0.165
ited+0.145
ering+0.134
itive+0.131
heart+0.129
Neg logit contributions
 name-0.143
 to-0.142
-0.141
 They-0.132
 across-0.126
Token that
Token ID326
Feature activation+3.47e+00

Pos logit contributions
eling+0.713
ited+0.639
i+0.598
ering+0.593
ie+0.574
Neg logit contributions
 name-0.573
-0.518
 names-0.511
 to-0.510
 They-0.487
Token he
Token ID339
Feature activation+3.95e+00

Pos logit contributions
eling+0.228
ited+0.204
i+0.194
ering+0.192
ie+0.185
Neg logit contributions
 name-0.165
-0.148
 names-0.148
 to-0.145
 They-0.140
Token to
Token ID284
Feature activation-1.42e+01

Pos logit contributions
eling+0.326
ited+0.282
ering+0.276
i+0.274
heart+0.261
Neg logit contributions
 name-0.277
-0.252
 names-0.251
 to-0.246
 They-0.240
Token do
Token ID466
Feature activation-5.37e+00

Pos logit contributions
 name+0.562
 names+0.464
+0.462
 to+0.425
 They+0.422
Neg logit contributions
eling-1.098
ited-0.978
ering-0.972
ience-0.962
i-0.949
Token.
Token ID13
Feature activation-2.77e+00

Pos logit contributions
 name+0.141
 to+0.134
+0.133
 names+0.114
 They+0.113
Neg logit contributions
eling-0.473
ited-0.434
i-0.415
ering-0.406
heart-0.404
Token
Token ID220
Feature activation-3.99e+00

Pos logit contributions
 name+0.108
Circ+0.097
 names+0.096
 road+0.089
 Bang+0.089
Neg logit contributions
eling-0.182
i-0.169
ie-0.167
ited-0.162
ering-0.161
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
 road+0.091
 name+0.090
 fence+0.090
Circ+0.089
 names+0.085
Neg logit contributions
eling-0.297
ering-0.282
ie-0.278
i-0.276
ieved-0.275
Token
Token ID198
Feature activation-1.02e+01

Pos logit contributions
 name+0.215
 names+0.184
 fence+0.153
 road+0.133
Circ+0.130
Neg logit contributions
eling-0.865
ie-0.815
ering-0.800
i-0.789
ited-0.774
TokenFrom
Token ID4863
Feature activation+1.73e+01

Pos logit contributions
 name+0.508
+0.471
 names+0.458
 They+0.447
Circ+0.435
Neg logit contributions
eling-0.639
i-0.560
ering-0.544
ited-0.543
heart-0.520
Token that
Token ID326
Feature activation+3.55e+00

Pos logit contributions
eling+1.235
ited+1.114
heart+1.054
ering+1.021
itive+1.018
Neg logit contributions
 name-0.766
-0.725
 to-0.713
 names-0.624
 across-0.624
Token day
Token ID1110
Feature activation-3.09e+00

Pos logit contributions
eling+0.272
ited+0.243
i+0.241
ering+0.239
ie+0.234
Neg logit contributions
 name-0.127
 names-0.110
-0.102
 They-0.100
 Bang-0.099
Token on
Token ID319
Feature activation-7.11e+00

Pos logit contributions
 name+0.091
+0.080
 to+0.074
 They+0.073
 names+0.072
Neg logit contributions
eling-0.262
ited-0.236
i-0.230
ering-0.229
heart-0.226
Token,
Token ID11
Feature activation+1.15e+01

Pos logit contributions
 to+0.324
+0.320
 name+0.314
 They+0.303
 The+0.281
Neg logit contributions
eling-0.522
ited-0.481
ering-0.428
heart-0.413
ieved-0.407
Token in
Token ID287
Feature activation-3.21e+00

Pos logit contributions
eling+0.291
i+0.254
ering+0.252
ie+0.250
ited+0.242
Neg logit contributions
 name-0.255
 names-0.244
Circ-0.229
 They-0.228
 Bang-0.228
Token a
Token ID257
Feature activation-3.93e+00

Pos logit contributions
 name+0.125
 names+0.112
 Bang+0.104
Circ+0.100
 fence+0.100
Neg logit contributions
eling-0.231
i-0.200
ited-0.198
ie-0.193
heart-0.188
Token big
Token ID1263
Feature activation-1.01e+01

Pos logit contributions
 name+0.115
 names+0.103
 Bang+0.089
 They+0.084
Circ+0.080
Neg logit contributions
eling-0.318
ited-0.288
ie-0.287
i-0.285
ience-0.279
Token,
Token ID11
Feature activation+1.17e+01

Pos logit contributions
 name+0.289
+0.230
 names+0.230
 fence+0.221
 to+0.218
Neg logit contributions
eling-0.864
ited-0.770
ering-0.767
i-0.758
heart-0.732
Token dark
Token ID3223
Feature activation-3.97e+00

Pos logit contributions
eling+0.884
ited+0.782
i+0.775
ering+0.771
ie+0.747
Neg logit contributions
 name-0.442
 names-0.380
-0.367
 They-0.352
 to-0.346
Token house
Token ID2156
Feature activation-1.55e+01

Pos logit contributions
 name+0.132
 names+0.117
 Bang+0.103
Circ+0.101
+0.099
Neg logit contributions
eling-0.309
i-0.280
ited-0.278
ie-0.277
ering-0.271
Token.
Token ID13
Feature activation-2.58e+00

Pos logit contributions
 name+0.631
 names+0.559
 Bang+0.519
Circ+0.519
+0.498
Neg logit contributions
eling-1.098
i-0.976
ering-0.965
ited-0.965
ie-0.960
Token She
Token ID1375
Feature activation-1.94e+00

Pos logit contributions
 name+0.081
 names+0.075
Circ+0.074
 across+0.070
 Bang+0.067
Neg logit contributions
eling-0.173
heart-0.160
ie-0.158
i-0.158
ited-0.156
Token was
Token ID373
Feature activation+1.81e+00

Pos logit contributions
 name+0.089
+0.083
 to+0.078
 names+0.078
 They+0.077
Neg logit contributions
eling-0.133
ited-0.120
ering-0.119
heart-0.114
i-0.111
Token very
Token ID845
Feature activation+5.98e+00

Pos logit contributions
eling+0.120
i+0.103
ited+0.102
ering+0.101
ie+0.099
Neg logit contributions
 name-0.082
 names-0.075
-0.071
 Bang-0.070
Circ-0.070
Token lonely
Token ID21757
Feature activation+5.68e+00

Pos logit contributions
eling+0.415
ited+0.365
i+0.359
ering+0.358
ie+0.345
Neg logit contributions
 name-0.265
 names-0.231
-0.226
 They-0.218
 to-0.217