TOP ACTIVATIONS
MAX = 21.202

write . He wrote his name , he wrote shapes ,
how he could sign his name . Everybody was so happy
a little boy . His name was Joe and he wanted
She even gave it a name , " Bl ooming Y
cake with candles and his name on it . They had
someone . He heard his name . He crawled out from
his mom my calling his name . So he decided to
. She carefully wrote her name on the front of the
and continued to write her name . L
ball . She called his name and waved .
boxes . It had their name and address on it .
and continued to write her name . L
, how to write his name and how to draw a
a new name . His name was Dash . Dash said
said he had a new name . His name was Dash
inside and give it a name . One day it was
was so proud of its name that it did not want
proud . He practiced his name everywhere , even when the
She tried to write her name . L - I -
book . What is your name ?" " Tim

MIN ACTIVATIONS
MIN = -32.259

a little bird named Tweet ie . Tweet ie loved to
" I 'm sorry sweet ie , but it 's getting
little bird y named Tweet ie . Tweet ie was very
H oot ie led Tweet ie to a big tree with
a necklace for O ll ie with the stones . O
a little girl named Sus ie . Sus ie loved to
ily realized that the gen ie was right . She thanked
little girl called L izz ie . She liked to perform
One day Mill ie went out to play in
Bun B un and Tweet ie still weren 't talking .
ot ter named O ll ie . O ll ie lived
he lost his friend Brown ie . <|endoftext|> Once upon a
tried to scare the bird ie away , but the bird
by and scared the bird ie . The cat wanted to
together . <|endoftext|> T rix ie was very proud of her
there was a little bird ie named Blue . Blue loved
a band age on Tweet ie 's wing and said ,
You 're welcome , sweet ie . Do you want to
good friend ," said Tweet ie . " And one day
her charming doll , Ros ie . One day , Lily

INTERVAL 7.837 - 21.202
CONTAINS 12.240%

in two ! The family was very surprised
tied around it . J ill looked up at
gives Lily the prize . It is a long , soft
slide and climbed up the steps . At the top ,
. She had always wanted to go on an adventure .

INTERVAL -5.528 - 7.837
CONTAINS 67.871%

. It 's fun . Come on , I promise ."
uffy and his friends were afraid of the big dog .
learn . Tim my felt happy to know
inside . As he was flying back to the
" Can we eat it ?" Anna asked .

INTERVAL -18.893 - -5.528
CONTAINS 19.803%

S ara and Tom are happy . They say , "
cake . It was delicious ! Blue smiled and
things back . But he never did . The end .
" Can we pick one ?" " Sure , Lily ,"
" The cave is the best shelter ." So , it

INTERVAL -32.259 - -18.893
CONTAINS 0.085%

. Lily watched in amaz ement as the cob web grew
, Molly said to M illy , � � € �
scattered around it . Sad ie was scared and started crying
him some popcorn . Mat i was very thankful and took
a young girl named Sus ie . She was only three
Token write
Token ID3551
Feature activation+3.88e+00

Pos logit contributions
 name+0.589
 names+0.503
+0.480
 to+0.475
 They+0.466
Neg logit contributions
eling-1.031
ited-0.918
i-0.913
ering-0.907
ience-0.879
Token.
Token ID13
Feature activation+3.52e+00

Pos logit contributions
 name+0.150
 names+0.129
+0.122
 They+0.121
 to+0.120
Neg logit contributions
eling-0.286
ited-0.254
i-0.251
ering-0.247
ie-0.243
Token He
Token ID679
Feature activation+7.74e+00

Pos logit contributions
 fence+0.127
 names+0.117
Circ+0.115
 whole+0.112
 name+0.111
Neg logit contributions
mmm-0.151
eling-0.148
 then-0.144
 chuckled-0.141
 luckily-0.140
Token wrote
Token ID2630
Feature activation-3.12e+00

Pos logit contributions
 name+0.407
 names+0.363
+0.360
 to+0.359
 They+0.339
Neg logit contributions
eling-0.466
ited-0.404
heart-0.403
ering-0.401
i-0.394
Token his
Token ID465
Feature activation-4.55e+00

Pos logit contributions
eling+0.225
ited+0.199
i+0.196
ering+0.193
ie+0.190
Neg logit contributions
 name-0.126
 names-0.109
-0.104
 to-0.103
 They-0.102
Token name
Token ID1438
Feature activation+2.12e+01

Pos logit contributions
eling+0.394
i+0.360
ering+0.360
ie+0.351
ience+0.350
Neg logit contributions
 They-0.072
Circ-0.064
 fence-0.059
 Some-0.057
 road-0.052
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
 name+0.843
 names+0.724
 to+0.705
+0.704
 They+0.680
Neg logit contributions
eling-1.566
ited-1.394
i-1.351
ering-1.335
ie-1.299
Token he
Token ID339
Feature activation-3.20e+00

Pos logit contributions
eling+0.729
ited+0.621
i+0.620
ering+0.612
ie+0.594
Neg logit contributions
 name-0.535
 names-0.479
Circ-0.453
 fence-0.450
 They-0.448
Token wrote
Token ID2630
Feature activation-3.17e+00

Pos logit contributions
eling+0.196
ited+0.170
ering+0.166
i+0.163
heart+0.163
Neg logit contributions
 name-0.171
 names-0.152
-0.148
 to-0.146
 They-0.141
Token shapes
Token ID15268
Feature activation+6.57e+00

Pos logit contributions
eling+0.223
ited+0.197
i+0.193
ering+0.189
ie+0.186
Neg logit contributions
 name-0.135
 names-0.118
-0.113
 to-0.112
 They-0.110
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
 name+0.273
 names+0.237
+0.226
 They+0.224
 fence+0.221
Neg logit contributions
eling-0.458
ited-0.402
i-0.400
ering-0.395
ie-0.388
Token how
Token ID703
Feature activation-6.57e+00

Pos logit contributions
 name+0.072
 names+0.064
+0.061
 They+0.061
 fence+0.060
Neg logit contributions
eling-0.103
i-0.089
ited-0.088
ering-0.087
ie-0.086
Token he
Token ID339
Feature activation-3.42e+00

Pos logit contributions
eling+0.445
i+0.386
ited+0.384
ering+0.381
ie+0.374
Neg logit contributions
 name-0.290
 names-0.251
 fence-0.243
-0.236
 Bang-0.236
Token could
Token ID714
Feature activation+3.92e+00

Pos logit contributions
eling+0.227
ering+0.203
heart+0.201
ited+0.198
i+0.189
Neg logit contributions
 name-0.150
 to-0.132
-0.132
 names-0.130
 They-0.121
Token sign
Token ID1051
Feature activation+1.08e+01

Pos logit contributions
 name+0.144
 names+0.126
Circ+0.121
+0.118
 They+0.118
Neg logit contributions
eling-0.287
i-0.253
ited-0.252
ering-0.246
ie-0.246
Token his
Token ID465
Feature activation-4.62e+00

Pos logit contributions
 name+0.463
 names+0.405
+0.387
 They+0.384
 to+0.380
Neg logit contributions
eling-0.739
ited-0.648
i-0.641
ering-0.633
ie-0.620
Token name
Token ID1438
Feature activation+2.12e+01

Pos logit contributions
eling+0.390
i+0.366
ering+0.359
ience+0.351
ited+0.349
Neg logit contributions
Circ-0.076
 They-0.071
 Bang-0.069
 Some-0.067
 name-0.067
Token.
Token ID13
Feature activation+3.31e+00

Pos logit contributions
 name+0.816
 names+0.698
 to+0.685
+0.682
 They+0.658
Neg logit contributions
eling-1.589
ited-1.409
i-1.359
ering-1.354
ie-1.311
Token Everybody
Token ID23080
Feature activation+3.06e+00

Pos logit contributions
 fence+0.109
 name+0.106
 road+0.098
 names+0.097
Circ+0.096
Neg logit contributions
eling-0.206
i-0.192
heart-0.191
 chuckled-0.182
ieved-0.181
Token was
Token ID373
Feature activation-1.55e+00

Pos logit contributions
 name+0.137
 to+0.136
+0.131
 across+0.122
 They+0.120
Neg logit contributions
eling-0.205
ited-0.180
i-0.171
ering-0.169
ience-0.168
Token so
Token ID523
Feature activation-8.12e+00

Pos logit contributions
eling+0.096
ited+0.084
i+0.081
ering+0.080
ie+0.079
Neg logit contributions
 name-0.080
-0.070
 names-0.070
 to-0.069
 They-0.068
Token happy
Token ID3772
Feature activation-6.37e+00

Pos logit contributions
eling+0.516
i+0.445
ited+0.444
ie+0.433
ering+0.433
Neg logit contributions
 name-0.417
-0.370
 names-0.361
 They-0.356
 to-0.351
Token a
Token ID257
Feature activation+4.80e+00

Pos logit contributions
 name+0.272
 names+0.236
+0.224
 They+0.223
 to+0.220
Neg logit contributions
eling-0.460
ited-0.403
i-0.401
ering-0.396
ie-0.388
Token little
Token ID1310
Feature activation+1.60e+00

Pos logit contributions
 name+0.168
 names+0.161
 They+0.153
Circ+0.144
 Along+0.144
Neg logit contributions
eling-0.327
ited-0.296
i-0.291
ering-0.286
ie-0.286
Token boy
Token ID2933
Feature activation+1.94e+00

Pos logit contributions
 name+0.061
 names+0.055
 They+0.050
+0.050
Circ+0.049
Neg logit contributions
eling-0.113
i-0.102
ited-0.102
ie-0.099
ering-0.099
Token.
Token ID13
Feature activation+3.53e+00

Pos logit contributions
 name+0.072
 names+0.061
+0.058
 They+0.057
 to+0.057
Neg logit contributions
eling-0.145
ited-0.129
i-0.127
ering-0.126
ie-0.124
Token His
Token ID2399
Feature activation-4.22e-01

Pos logit contributions
 name+0.089
 names+0.087
 road+0.079
Circ+0.079
 fence+0.078
Neg logit contributions
eling-0.238
 chuckled-0.236
i-0.228
ience-0.224
heart-0.220
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.032
i+0.031
ering+0.030
ited+0.029
ie+0.029
Neg logit contributions
Circ-0.009
 Some-0.008
 They-0.008
 stairs-0.007
 fence-0.007
Token was
Token ID373
Feature activation-1.49e+00

Pos logit contributions
 name+0.714
+0.647
 to+0.596
 across+0.546
 They+0.536
Neg logit contributions
eling-1.691
ited-1.479
ering-1.472
i-1.458
heart-1.440
Token Joe
Token ID5689
Feature activation-1.40e+00

Pos logit contributions
eling+0.096
ited+0.083
i+0.082
ering+0.081
ie+0.079
Neg logit contributions
 name-0.070
 names-0.062
-0.060
 They-0.059
 to-0.059
Token and
Token ID290
Feature activation-2.49e+00

Pos logit contributions
eling+0.114
ited+0.103
ering+0.099
i+0.098
heart+0.096
Neg logit contributions
 name-0.044
-0.036
 to-0.036
 names-0.035
 They-0.034
Token he
Token ID339
Feature activation-3.38e+00

Pos logit contributions
eling+0.180
ited+0.167
ering+0.156
i+0.152
ience+0.150
Neg logit contributions
 name-0.105
-0.094
 to-0.092
 They-0.091
 names-0.086
Token wanted
Token ID2227
Feature activation-7.02e-01

Pos logit contributions
eling+0.234
ited+0.203
ering+0.200
i+0.193
heart+0.192
Neg logit contributions
 name-0.159
-0.140
 to-0.135
 names-0.133
 They-0.131
Token She
Token ID1375
Feature activation+2.78e+00

Pos logit contributions
 road+0.111
 names+0.106
 fence+0.105
 name+0.104
 yard+0.092
Neg logit contributions
eling-0.214
i-0.211
heart-0.206
ie-0.205
ience-0.205
Token even
Token ID772
Feature activation-8.86e+00

Pos logit contributions
 name+0.129
 to+0.123
+0.119
 names+0.115
 across+0.109
Neg logit contributions
eling-0.184
ited-0.157
ering-0.156
heart-0.156
itive-0.145
Token gave
Token ID2921
Feature activation-5.21e+00

Pos logit contributions
eling+0.573
mmm+0.494
ering+0.488
heart+0.473
itive+0.472
Neg logit contributions
 to-0.383
-0.356
 name-0.356
 across-0.344
 corner-0.338
Token it
Token ID340
Feature activation-1.62e+00

Pos logit contributions
eling+0.333
ited+0.288
i+0.286
ering+0.281
ie+0.276
Neg logit contributions
 name-0.251
 names-0.223
-0.214
 They-0.213
 to-0.210
Token a
Token ID257
Feature activation+4.71e+00

Pos logit contributions
eling+0.130
i+0.124
ie+0.123
ited+0.120
heart+0.118
Neg logit contributions
 name-0.036
 road-0.029
 They-0.028
 fence-0.028
 Bang-0.028
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
 They+0.161
 names+0.155
 name+0.154
Circ+0.153
 Some+0.147
Neg logit contributions
eling-0.297
ited-0.284
ience-0.270
i-0.266
ie-0.265
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
 name+0.874
 names+0.751
 to+0.742
+0.734
 They+0.708
Neg logit contributions
eling-1.528
ited-1.331
ering-1.302
i-1.298
ie-1.257
Token "
Token ID366
Feature activation+8.03e+00

Pos logit contributions
eling+0.768
ited+0.671
i+0.662
ering+0.656
ie+0.642
Neg logit contributions
 name-0.521
 names-0.456
-0.445
 to-0.439
 They-0.439
TokenBl
Token ID3629
Feature activation-3.49e-01

Pos logit contributions
 name+0.330
 names+0.286
 fence+0.284
 road+0.276
 They+0.275
Neg logit contributions
eling-0.538
ited-0.507
i-0.490
ie-0.473
heart-0.467
Tokenooming
Token ID30602
Feature activation-1.13e+01

Pos logit contributions
eling+0.019
ited+0.016
ering+0.015
heart+0.015
ie+0.014
Neg logit contributions
 name-0.021
 names-0.019
-0.018
 to-0.018
 They-0.017
Token Y
Token ID575
Feature activation+2.88e+00

Pos logit contributions
eling+0.743
ited+0.646
i+0.639
ering+0.630
ie+0.616
Neg logit contributions
 name-0.527
 names-0.466
-0.447
 They-0.443
 to-0.439
Token cake
Token ID12187
Feature activation-2.02e+00

Pos logit contributions
 name+0.448
 names+0.371
 fence+0.356
+0.352
 to+0.351
Neg logit contributions
eling-0.840
i-0.732
ited-0.728
ering-0.727
ie-0.702
Token with
Token ID351
Feature activation+4.25e+00

Pos logit contributions
eling+0.138
ited+0.120
i+0.117
ering+0.116
ience+0.113
Neg logit contributions
 name-0.091
 names-0.079
 to-0.078
-0.077
 They-0.076
Token candles
Token ID32268
Feature activation-2.58e+00

Pos logit contributions
 name+0.165
 names+0.140
+0.135
 They+0.133
 fence+0.133
Neg logit contributions
eling-0.306
i-0.270
ited-0.270
ering-0.264
ie-0.261
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
eling+0.183
ited+0.159
i+0.154
ering+0.154
ience+0.150
Neg logit contributions
 name-0.110
 names-0.095
 to-0.094
-0.092
 They-0.091
Token his
Token ID465
Feature activation-4.59e+00

Pos logit contributions
eling+0.194
ited+0.173
i+0.171
ering+0.170
ie+0.166
Neg logit contributions
 name-0.086
 names-0.072
-0.068
 They-0.067
 to-0.066
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.320
i+0.282
ited+0.276
ering+0.273
ie+0.269
Neg logit contributions
 name-0.172
Circ-0.158
 They-0.155
 names-0.152
-0.152
Token on
Token ID319
Feature activation+7.53e+00

Pos logit contributions
 name+0.773
 names+0.661
+0.630
 They+0.621
 to+0.619
Neg logit contributions
eling-1.598
ited-1.419
i-1.396
ering-1.384
ie-1.352
Token it
Token ID340
Feature activation-1.40e+00

Pos logit contributions
 name+0.394
 names+0.357
Circ+0.350
 fence+0.344
 across+0.337
Neg logit contributions
eling-0.427
i-0.366
ited-0.351
ering-0.351
ie-0.343
Token.
Token ID13
Feature activation+3.28e+00

Pos logit contributions
eling+0.099
i+0.090
ie+0.088
ering+0.087
ited+0.087
Neg logit contributions
 name-0.052
 names-0.045
Circ-0.044
 fence-0.043
 Bang-0.042
Token They
Token ID1119
Feature activation+8.65e+00

Pos logit contributions
 name+0.120
 fence+0.117
 across+0.114
Circ+0.113
 names+0.112
Neg logit contributions
eling-0.184
mmm-0.171
i-0.166
ited-0.160
mm-0.158
Token had
Token ID550
Feature activation-1.63e+00

Pos logit contributions
 name+0.424
 names+0.385
+0.371
 They+0.367
 to+0.361
Neg logit contributions
eling-0.551
ering-0.515
heart-0.494
ited-0.474
izing-0.472
Token someone
Token ID2130
Feature activation+7.89e+00

Pos logit contributions
 name+0.752
 names+0.673
Circ+0.647
 fence+0.646
 Bang+0.632
Neg logit contributions
eling-0.832
i-0.700
ie-0.686
ering-0.678
ited-0.678
Token.
Token ID13
Feature activation+3.37e+00

Pos logit contributions
 name+0.234
 to+0.205
+0.195
 names+0.187
 They+0.183
Neg logit contributions
eling-0.639
ited-0.589
i-0.564
ience-0.556
ering-0.553
Token He
Token ID679
Feature activation+7.58e+00

Pos logit contributions
 name+0.088
Circ+0.083
 names+0.078
 whole+0.078
 corner+0.074
Neg logit contributions
eling-0.216
i-0.200
heart-0.198
 chuckled-0.194
ering-0.193
Token heard
Token ID2982
Feature activation+1.22e+00

Pos logit contributions
 name+0.349
+0.318
 to+0.305
 names+0.300
 They+0.296
Neg logit contributions
eling-0.523
ited-0.463
ering-0.448
heart-0.440
i-0.437
Token his
Token ID465
Feature activation-4.73e+00

Pos logit contributions
 name+0.057
 names+0.049
 fence+0.048
Circ+0.046
 across+0.046
Neg logit contributions
eling-0.077
ie-0.067
i-0.066
ering-0.065
heart-0.064
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.279
i+0.254
ering+0.250
ie+0.243
ise+0.232
Neg logit contributions
Circ-0.194
 Along-0.179
 name-0.179
 Some-0.179
Let-0.176
Token.
Token ID13
Feature activation+3.41e+00

Pos logit contributions
 name+0.944
 names+0.820
+0.812
 to+0.809
 They+0.792
Neg logit contributions
eling-1.441
ited-1.263
i-1.241
ering-1.214
ience-1.188
Token He
Token ID679
Feature activation+7.61e+00

Pos logit contributions
 names+0.105
 name+0.100
Circ+0.099
ashes+0.099
 whole+0.099
Neg logit contributions
eling-0.175
heart-0.172
i-0.166
 chuckled-0.164
ie-0.163
Token crawled
Token ID45668
Feature activation+2.33e+00

Pos logit contributions
 name+0.361
+0.323
 names+0.310
 to+0.308
 They+0.304
Neg logit contributions
eling-0.520
ited-0.459
ering-0.444
i-0.432
heart-0.430
Token out
Token ID503
Feature activation+9.03e+00

Pos logit contributions
 name+0.098
 names+0.087
 road+0.081
 fence+0.081
 Bang+0.079
Neg logit contributions
eling-0.141
ie-0.133
i-0.132
ering-0.130
ited-0.127
Token from
Token ID422
Feature activation+1.27e-01

Pos logit contributions
 name+0.438
 names+0.388
 fence+0.361
Circ+0.354
 Bang+0.354
Neg logit contributions
eling-0.556
ie-0.491
i-0.484
ited-0.480
ering-0.474
Token his
Token ID465
Feature activation-4.81e+00

Pos logit contributions
 name+0.214
 names+0.185
 They+0.179
+0.178
Circ+0.177
Neg logit contributions
eling-0.392
i-0.339
ited-0.336
ering-0.330
ie-0.325
Token mom
Token ID1995
Feature activation+5.39e+00

Pos logit contributions
eling+0.346
i+0.315
ited+0.305
ering+0.304
ie+0.303
Neg logit contributions
 name-0.166
Circ-0.157
 They-0.156
 Bang-0.149
-0.148
Tokenmy
Token ID1820
Feature activation-6.45e+00

Pos logit contributions
 name+0.254
 to+0.234
+0.229
 names+0.220
 across+0.216
Neg logit contributions
eling-0.344
ited-0.295
ering-0.284
ience-0.283
i-0.280
Token calling
Token ID4585
Feature activation-8.05e-01

Pos logit contributions
eling+0.427
ited+0.369
ering+0.356
i+0.353
ience+0.343
Neg logit contributions
 name-0.296
 to-0.263
 names-0.257
-0.257
 They-0.248
Token his
Token ID465
Feature activation-4.80e+00

Pos logit contributions
eling+0.049
i+0.043
ering+0.043
ie+0.042
ited+0.041
Neg logit contributions
 name-0.038
 names-0.034
Circ-0.033
 fence-0.033
 road-0.032
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.439
i+0.433
ering+0.411
ience+0.407
ie+0.405
Neg logit contributions
Circ-0.034
 Some-0.025
 They-0.023
 road-0.020
 Bang-0.020
Token.
Token ID13
Feature activation+3.27e+00

Pos logit contributions
 name+0.852
 names+0.736
+0.708
 to+0.703
 They+0.696
Neg logit contributions
eling-1.516
ited-1.332
i-1.311
ering-1.298
ie-1.266
Token So
Token ID1406
Feature activation-3.70e+00

Pos logit contributions
 name+0.090
 names+0.085
 fence+0.073
 road+0.071
Circ+0.069
Neg logit contributions
eling-0.238
i-0.224
heart-0.221
ience-0.211
 chuckled-0.211
Token he
Token ID339
Feature activation-3.52e+00

Pos logit contributions
eling+0.279
ited+0.256
ering+0.234
i+0.229
ieved+0.224
Neg logit contributions
 name-0.148
 to-0.143
-0.137
 They-0.135
 He-0.125
Token decided
Token ID3066
Feature activation-7.07e+00

Pos logit contributions
eling+0.227
ited+0.193
ering+0.192
heart+0.181
i+0.179
Neg logit contributions
 name-0.176
 to-0.161
-0.152
 names-0.150
 They-0.145
Token to
Token ID284
Feature activation+1.41e+01

Pos logit contributions
eling+0.698
i+0.636
ited+0.635
ering+0.631
ie+0.620
Neg logit contributions
 name-0.092
 names-0.054
 They-0.039
-0.039
 fence-0.036
Token.
Token ID13
Feature activation+3.52e+00

Pos logit contributions
 name+0.084
 to+0.073
 names+0.070
+0.069
 They+0.067
Neg logit contributions
eling-0.201
ited-0.177
i-0.175
ering-0.172
ie-0.167
Token She
Token ID1375
Feature activation+2.81e+00

Pos logit contributions
 name+0.117
 road+0.112
 fence+0.111
Circ+0.110
 names+0.104
Neg logit contributions
eling-0.228
heart-0.214
i-0.211
ited-0.204
ie-0.202
Token carefully
Token ID7773
Feature activation-2.78e+00

Pos logit contributions
 name+0.141
 to+0.132
+0.130
 They+0.121
 names+0.120
Neg logit contributions
eling-0.180
ering-0.158
ited-0.153
heart-0.148
i-0.145
Token wrote
Token ID2630
Feature activation-3.31e+00

Pos logit contributions
eling+0.179
ering+0.154
heart+0.154
ited+0.152
i+0.152
Neg logit contributions
 name-0.130
 to-0.123
 names-0.117
-0.115
 They-0.110
Token her
Token ID607
Feature activation-9.78e+00

Pos logit contributions
eling+0.238
ited+0.212
i+0.206
ering+0.200
ie+0.198
Neg logit contributions
 name-0.140
 names-0.123
 to-0.120
-0.117
 across-0.112
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.862
i+0.781
ering+0.779
ited+0.762
ience+0.761
Neg logit contributions
 They-0.144
Circ-0.140
 name-0.138
 fence-0.126
 road-0.126
Token on
Token ID319
Feature activation+7.49e+00

Pos logit contributions
 name+0.941
 to+0.828
 names+0.815
+0.805
 across+0.773
Neg logit contributions
eling-1.484
ited-1.296
i-1.240
ering-1.237
ie-1.186
Token the
Token ID262
Feature activation-2.79e+00

Pos logit contributions
 name+0.321
 names+0.281
+0.275
 to+0.273
 They+0.267
Neg logit contributions
eling-0.523
ited-0.466
i-0.451
ering-0.450
ie-0.436
Token front
Token ID2166
Feature activation+4.71e+00

Pos logit contributions
eling+0.194
ering+0.174
i+0.172
ie+0.167
ited+0.164
Neg logit contributions
 name-0.095
Circ-0.093
 They-0.090
-0.089
 Some-0.086
Token of
Token ID286
Feature activation+5.62e+00

Pos logit contributions
 name+0.190
 names+0.167
Circ+0.162
 They+0.154
 Bang+0.153
Neg logit contributions
eling-0.313
i-0.286
ie-0.284
ited-0.281
ering-0.279
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 name+0.256
 names+0.225
+0.216
 They+0.215
 to+0.211
Neg logit contributions
eling-0.374
ited-0.324
i-0.322
ering-0.317
ie-0.311
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
eling+0.975
ited+0.851
i+0.838
ering+0.830
ie+0.810
Neg logit contributions
 name-0.632
 names-0.553
-0.533
 They-0.526
 to-0.526
Token continued
Token ID3767
Feature activation-7.93e+00

Pos logit contributions
eling+0.158
ited+0.136
ering+0.128
heart+0.125
ie+0.125
Neg logit contributions
 name-0.132
 to-0.130
-0.128
 They-0.123
 across-0.112
Token to
Token ID284
Feature activation+1.43e+01

Pos logit contributions
eling+0.669
i+0.623
ering+0.615
ie+0.613
ited+0.595
Neg logit contributions
 name-0.183
 names-0.148
 fence-0.134
Circ-0.126
 Bang-0.124
Token write
Token ID3551
Feature activation+3.73e+00

Pos logit contributions
 name+0.523
 names+0.437
+0.413
 to+0.411
 They+0.404
Neg logit contributions
eling-1.090
ited-0.972
i-0.961
ering-0.960
ie-0.928
Token her
Token ID607
Feature activation-9.63e+00

Pos logit contributions
 name+0.158
 names+0.137
 to+0.132
+0.131
 They+0.127
Neg logit contributions
eling-0.267
ited-0.237
i-0.230
ering-0.224
ie-0.221
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.794
i+0.729
ering+0.719
ited+0.719
ie+0.710
Neg logit contributions
 They-0.192
Circ-0.184
 fence-0.178
 name-0.175
 road-0.174
Token.
Token ID13
Feature activation+3.40e+00

Pos logit contributions
 name+0.777
 names+0.660
+0.631
 to+0.629
 They+0.616
Neg logit contributions
eling-1.609
ited-1.425
i-1.395
ering-1.384
ie-1.348
Token
Token ID220
Feature activation+3.20e+00

Pos logit contributions
 name+0.120
 road+0.117
 fence+0.115
 names+0.111
Circ+0.107
Neg logit contributions
eling-0.206
i-0.198
heart-0.190
ie-0.189
ited-0.183
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 road+0.078
 fence+0.077
 name+0.069
Circ+0.068
 names+0.067
Neg logit contributions
eling-0.238
i-0.221
ering-0.220
ie-0.219
ieved-0.212
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 name+0.200
 names+0.169
 fence+0.148
 road+0.138
Circ+0.113
Neg logit contributions
eling-0.884
ie-0.796
ering-0.792
i-0.790
ited-0.786
TokenL
Token ID43
Feature activation-6.14e+00

Pos logit contributions
 name+0.437
 names+0.382
+0.369
 They+0.364
 to+0.359
Neg logit contributions
eling-0.697
ited-0.610
i-0.606
ering-0.597
ie-0.585
Token ball
Token ID2613
Feature activation+1.44e+01

Pos logit contributions
eling+0.297
i+0.259
ited+0.257
ering+0.253
ie+0.251
Neg logit contributions
 name-0.207
 names-0.183
 They-0.179
-0.177
Circ-0.175
Token.
Token ID13
Feature activation+3.21e+00

Pos logit contributions
 name+0.542
 names+0.464
+0.442
 They+0.437
 to+0.435
Neg logit contributions
eling-1.073
ited-0.949
i-0.939
ering-0.928
ie-0.908
Token She
Token ID1375
Feature activation+2.70e+00

Pos logit contributions
 name+0.110
Circ+0.106
 fence+0.105
 names+0.102
 whole+0.099
Neg logit contributions
eling-0.168
ited-0.156
mmm-0.153
ieved-0.152
 chuckled-0.151
Token called
Token ID1444
Feature activation-4.11e+00

Pos logit contributions
 name+0.129
 to+0.120
+0.117
 names+0.114
 They+0.110
Neg logit contributions
eling-0.176
ited-0.154
heart-0.151
ering-0.149
i-0.145
Token his
Token ID465
Feature activation-4.55e+00

Pos logit contributions
eling+0.230
ie+0.208
i+0.204
ering+0.203
heart+0.195
Neg logit contributions
 name-0.194
 names-0.175
Circ-0.175
 fence-0.170
 road-0.168
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.269
i+0.250
ie+0.239
ience+0.235
ering+0.233
Neg logit contributions
Circ-0.190
 name-0.186
 They-0.185
-0.179
 Bang-0.178
Token and
Token ID290
Feature activation-2.51e+00

Pos logit contributions
 name+0.969
 to+0.856
 names+0.852
+0.850
 They+0.824
Neg logit contributions
eling-1.437
ited-1.237
i-1.195
ering-1.195
ie-1.149
Token waved
Token ID26834
Feature activation-4.73e+00

Pos logit contributions
eling+0.146
ited+0.127
ering+0.121
i+0.120
ie+0.117
Neg logit contributions
 name-0.142
 to-0.130
 names-0.127
-0.127
 They-0.126
Token.
Token ID13
Feature activation+3.20e+00

Pos logit contributions
eling+0.332
i+0.295
ited+0.293
ering+0.288
ie+0.284
Neg logit contributions
 name-0.192
 names-0.166
 fence-0.158
-0.158
 They-0.157
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 name+0.153
 fence+0.150
 corner+0.148
Circ+0.147
 names+0.143
Neg logit contributions
eling-0.135
mmm-0.124
i-0.122
ering-0.121
 chuckled-0.118
Token
Token ID198
Feature activation+1.00e+01

Pos logit contributions
 name+0.202
 names+0.173
 fence+0.134
Circ+0.124
 road+0.118
Neg logit contributions
eling-0.872
ie-0.799
ering-0.794
i-0.790
ited-0.769
Token boxes
Token ID10559
Feature activation+9.17e+00

Pos logit contributions
eling+0.202
i+0.179
ering+0.178
ited+0.176
ie+0.173
Neg logit contributions
 name-0.099
Circ-0.089
 They-0.085
-0.085
 names-0.083
Token.
Token ID13
Feature activation+3.36e+00

Pos logit contributions
 name+0.312
 names+0.261
+0.252
 to+0.250
 They+0.248
Neg logit contributions
eling-0.717
ited-0.639
i-0.628
ering-0.620
ie-0.610
Token It
Token ID632
Feature activation+9.58e+00

Pos logit contributions
 road+0.091
 fence+0.087
 whole+0.087
 yard+0.085
 names+0.085
Neg logit contributions
 chuckled-0.185
mmm-0.183
eling-0.175
 reassured-0.172
i-0.170
Token had
Token ID550
Feature activation-1.72e+00

Pos logit contributions
 name+0.475
 names+0.419
+0.417
 They+0.408
 to+0.401
Neg logit contributions
eling-0.634
ited-0.556
ering-0.540
i-0.524
heart-0.524
Token their
Token ID511
Feature activation-5.93e+00

Pos logit contributions
eling+0.125
i+0.112
ited+0.107
ering+0.106
ie+0.105
Neg logit contributions
 name-0.059
Circ-0.050
-0.048
 fence-0.047
 Along-0.046
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.451
i+0.419
ering+0.399
ited+0.395
ie+0.394
Neg logit contributions
Circ-0.151
 name-0.138
-0.136
 Bang-0.135
 They-0.127
Token and
Token ID290
Feature activation-2.52e+00

Pos logit contributions
 name+0.996
 names+0.876
+0.857
 to+0.850
 They+0.842
Neg logit contributions
eling-1.398
ited-1.225
i-1.179
ering-1.174
ie-1.142
Token address
Token ID2209
Feature activation+9.38e+00

Pos logit contributions
eling+0.184
ited+0.165
ering+0.160
i+0.160
ie+0.156
Neg logit contributions
 name-0.102
 names-0.088
 They-0.084
 to-0.083
-0.083
Token on
Token ID319
Feature activation+7.53e+00

Pos logit contributions
 name+0.358
 names+0.308
+0.307
 to+0.306
 They+0.294
Neg logit contributions
eling-0.709
ited-0.632
i-0.607
ering-0.599
heart-0.588
Token it
Token ID340
Feature activation-1.50e+00

Pos logit contributions
 name+0.382
 names+0.340
Circ+0.332
+0.322
 Bang+0.320
Neg logit contributions
eling-0.443
i-0.384
ering-0.372
ited-0.366
ie-0.366
Token.
Token ID13
Feature activation+3.42e+00

Pos logit contributions
eling+0.107
i+0.096
ering+0.094
ited+0.094
ie+0.093
Neg logit contributions
 name-0.057
 names-0.049
Circ-0.048
 Bang-0.046
 fence-0.045
Token and
Token ID290
Feature activation-2.41e+00

Pos logit contributions
eling+0.975
ited+0.851
i+0.839
ering+0.830
ie+0.810
Neg logit contributions
 name-0.629
 names-0.550
-0.530
 They-0.524
 to-0.523
Token continued
Token ID3767
Feature activation-7.90e+00

Pos logit contributions
eling+0.159
ited+0.138
ering+0.130
heart+0.128
ience+0.125
Neg logit contributions
 name-0.122
 to-0.122
-0.119
 They-0.115
 names-0.104
Token to
Token ID284
Feature activation+1.43e+01

Pos logit contributions
eling+0.666
i+0.622
ie+0.614
ering+0.613
ited+0.592
Neg logit contributions
 name-0.181
 names-0.147
 fence-0.134
Circ-0.121
 Bang-0.119
Token write
Token ID3551
Feature activation+3.74e+00

Pos logit contributions
 name+0.530
 names+0.447
+0.423
 to+0.419
 They+0.415
Neg logit contributions
eling-1.082
ited-0.963
i-0.952
ering-0.948
ie-0.919
Token her
Token ID607
Feature activation-9.64e+00

Pos logit contributions
 name+0.155
 names+0.135
 to+0.129
+0.129
 They+0.125
Neg logit contributions
eling-0.271
ited-0.240
i-0.234
ering-0.228
ie-0.225
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.779
i+0.717
ering+0.706
ited+0.703
ie+0.699
Neg logit contributions
 They-0.201
Circ-0.193
 fence-0.188
 road-0.186
 name-0.181
Token.
Token ID13
Feature activation+3.44e+00

Pos logit contributions
 name+0.772
 names+0.656
+0.626
 to+0.622
 They+0.612
Neg logit contributions
eling-1.608
ited-1.425
i-1.399
ering-1.387
ie-1.353
Token
Token ID220
Feature activation+3.22e+00

Pos logit contributions
 name+0.134
 road+0.127
 fence+0.126
Circ+0.125
 names+0.124
Neg logit contributions
eling-0.195
i-0.186
ie-0.180
ieved-0.174
heart-0.173
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 fence+0.081
 road+0.080
 name+0.072
Circ+0.071
 names+0.071
Neg logit contributions
eling-0.234
ie-0.219
ering-0.218
i-0.217
ieved-0.212
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 name+0.225
 names+0.201
 fence+0.178
 road+0.162
 yard+0.140
Neg logit contributions
eling-0.828
ie-0.765
ering-0.748
i-0.742
ited-0.737
TokenL
Token ID43
Feature activation-6.17e+00

Pos logit contributions
 name+0.453
+0.400
 names+0.396
 They+0.388
 to+0.379
Neg logit contributions
eling-0.677
i-0.593
ited-0.585
ering-0.579
ie-0.564
Token,
Token ID11
Feature activation-1.15e+01

Pos logit contributions
 name+0.028
 names+0.024
Circ+0.023
 fence+0.023
 Bang+0.023
Neg logit contributions
eling-0.045
i-0.040
ering-0.039
ie-0.039
ited-0.039
Token how
Token ID703
Feature activation-6.51e+00

Pos logit contributions
eling+0.735
ited+0.634
i+0.629
ering+0.623
ie+0.605
Neg logit contributions
 name-0.548
 names-0.484
-0.463
 They-0.459
 fence-0.456
Token to
Token ID284
Feature activation+1.42e+01

Pos logit contributions
eling+0.486
ering+0.426
i+0.421
ie+0.418
ience+0.412
Neg logit contributions
 name-0.228
 fence-0.189
 names-0.188
 Bang-0.181
Circ-0.178
Token write
Token ID3551
Feature activation+3.71e+00

Pos logit contributions
 name+0.585
 names+0.508
+0.484
 They+0.477
 to+0.476
Neg logit contributions
eling-1.004
ited-0.891
i-0.877
ering-0.867
ie-0.847
Token his
Token ID465
Feature activation-4.56e+00

Pos logit contributions
 name+0.159
 names+0.140
+0.133
 to+0.133
 They+0.131
Neg logit contributions
eling-0.261
ited-0.233
i-0.226
ering-0.220
ie-0.217
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.360
i+0.328
ering+0.323
ience+0.317
ited+0.317
Neg logit contributions
Circ-0.099
 They-0.095
 Bang-0.093
 Some-0.088
 name-0.088
Token and
Token ID290
Feature activation-2.55e+00

Pos logit contributions
 name+0.880
 names+0.766
+0.736
 to+0.733
 They+0.724
Neg logit contributions
eling-1.497
ited-1.327
i-1.292
ering-1.278
ie-1.242
Token how
Token ID703
Feature activation-6.50e+00

Pos logit contributions
eling+0.169
i+0.147
ited+0.145
ering+0.144
ie+0.141
Neg logit contributions
 name-0.108
 names-0.094
Circ-0.092
 fence-0.092
-0.091
Token to
Token ID284
Feature activation+1.42e+01

Pos logit contributions
eling+0.468
i+0.410
ering+0.409
ie+0.404
ience+0.396
Neg logit contributions
 name-0.245
 fence-0.211
 names-0.206
Circ-0.199
 Bang-0.199
Token draw
Token ID3197
Feature activation+7.28e+00

Pos logit contributions
 name+0.579
 names+0.491
+0.469
 to+0.455
 They+0.449
Neg logit contributions
eling-1.026
ited-0.935
ering-0.900
i-0.898
ience-0.881
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 name+0.285
 names+0.246
+0.236
 They+0.232
 to+0.231
Neg logit contributions
eling-0.533
ited-0.475
i-0.466
ering-0.458
ie-0.448
Token a
Token ID257
Feature activation+4.72e+00

Pos logit contributions
eling+0.161
i+0.143
ited+0.143
ering+0.141
ience+0.137
Neg logit contributions
 name-0.060
 fence-0.047
Circ-0.047
 names-0.046
 They-0.046
Token new
Token ID649
Feature activation+1.06e+00

Pos logit contributions
 name+0.164
 They+0.162
 names+0.155
Circ+0.155
+0.154
Neg logit contributions
eling-0.316
ited-0.287
ience-0.275
i-0.275
ering-0.274
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
 name+0.036
Circ+0.035
 They+0.034
+0.034
 names+0.033
Neg logit contributions
eling-0.069
i-0.063
ie-0.062
ited-0.062
ering-0.062
Token.
Token ID13
Feature activation+3.38e+00

Pos logit contributions
 name+0.809
 names+0.696
+0.661
 They+0.654
 to+0.646
Neg logit contributions
eling-1.542
ited-1.364
i-1.352
ering-1.335
ie-1.309
Token His
Token ID2399
Feature activation-5.64e-01

Pos logit contributions
 name+0.162
 names+0.158
 corner+0.154
 fence+0.152
Circ+0.150
Neg logit contributions
eling-0.150
heart-0.138
mmm-0.137
 chuckled-0.135
i-0.134
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.040
i+0.040
ering+0.038
ie+0.038
ise+0.037
Neg logit contributions
Circ-0.015
 Some-0.012
 They-0.012
 hills-0.012
 Bang-0.012
Token was
Token ID373
Feature activation-1.64e+00

Pos logit contributions
 name+0.760
 to+0.665
+0.662
 names+0.599
 They+0.597
Neg logit contributions
eling-1.643
ited-1.436
ering-1.423
i-1.422
heart-1.379
Token Dash
Token ID16189
Feature activation+6.51e-01

Pos logit contributions
eling+0.105
ited+0.091
i+0.091
ering+0.089
ie+0.087
Neg logit contributions
 name-0.078
 names-0.069
-0.066
 They-0.066
 to-0.065
Token.
Token ID13
Feature activation+3.35e+00

Pos logit contributions
 name+0.023
+0.019
 to+0.019
 names+0.019
 They+0.018
Neg logit contributions
eling-0.052
ited-0.047
ering-0.045
i-0.044
ie-0.043
Token Dash
Token ID16189
Feature activation+6.71e-01

Pos logit contributions
 name+0.111
 corner+0.107
 names+0.102
 fence+0.101
Circ+0.099
Neg logit contributions
eling-0.199
i-0.185
ises-0.185
ience-0.183
 chuckled-0.181
Token said
Token ID531
Feature activation-5.63e+00

Pos logit contributions
 name+0.031
 to+0.029
+0.028
 They+0.028
 names+0.027
Neg logit contributions
eling-0.047
ited-0.040
ering-0.040
heart-0.038
eking-0.038
Token said
Token ID531
Feature activation-5.63e+00

Pos logit contributions
 name+0.355
 to+0.333
+0.330
 They+0.300
 names+0.297
Neg logit contributions
eling-0.498
ited-0.436
ering-0.424
heart-0.420
iness-0.407
Token he
Token ID339
Feature activation-3.41e+00

Pos logit contributions
eling+0.363
ited+0.326
i+0.294
ering+0.285
ie+0.282
Neg logit contributions
 name-0.296
 to-0.285
-0.264
 names-0.260
 They-0.256
Token had
Token ID550
Feature activation-2.02e+00

Pos logit contributions
eling+0.210
ited+0.181
ering+0.176
i+0.175
ie+0.169
Neg logit contributions
 name-0.178
 names-0.158
-0.153
 to-0.152
 They-0.151
Token a
Token ID257
Feature activation+4.72e+00

Pos logit contributions
eling+0.161
i+0.143
ited+0.143
ering+0.141
ience+0.137
Neg logit contributions
 name-0.060
 fence-0.047
Circ-0.047
 names-0.046
 They-0.046
Token new
Token ID649
Feature activation+1.06e+00

Pos logit contributions
 name+0.164
 They+0.162
 names+0.155
Circ+0.155
+0.154
Neg logit contributions
eling-0.316
ited-0.287
ience-0.275
i-0.275
ering-0.274
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
 name+0.036
Circ+0.035
 They+0.034
+0.034
 names+0.033
Neg logit contributions
eling-0.069
i-0.063
ie-0.062
ited-0.062
ering-0.062
Token.
Token ID13
Feature activation+3.38e+00

Pos logit contributions
 name+0.809
 names+0.696
+0.661
 They+0.654
 to+0.646
Neg logit contributions
eling-1.542
ited-1.364
i-1.352
ering-1.335
ie-1.309
Token His
Token ID2399
Feature activation-5.64e-01

Pos logit contributions
 name+0.162
 names+0.158
 corner+0.154
 fence+0.152
Circ+0.150
Neg logit contributions
eling-0.150
heart-0.138
mmm-0.137
 chuckled-0.135
i-0.134
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.040
i+0.040
ering+0.038
ie+0.038
ise+0.037
Neg logit contributions
Circ-0.015
 Some-0.012
 They-0.012
 hills-0.012
 Bang-0.012
Token was
Token ID373
Feature activation-1.64e+00

Pos logit contributions
 name+0.760
 to+0.665
+0.662
 names+0.599
 They+0.597
Neg logit contributions
eling-1.643
ited-1.436
ering-1.423
i-1.422
heart-1.379
Token Dash
Token ID16189
Feature activation+6.51e-01

Pos logit contributions
eling+0.105
ited+0.091
i+0.091
ering+0.089
ie+0.087
Neg logit contributions
 name-0.078
 names-0.069
-0.066
 They-0.066
 to-0.065
Token inside
Token ID2641
Feature activation+1.02e+01

Pos logit contributions
eling+0.118
ie+0.110
i+0.109
ering+0.109
ited+0.108
Neg logit contributions
 name-0.053
 names-0.045
 They-0.043
 Some-0.043
Circ-0.042
Token and
Token ID290
Feature activation-2.25e+00

Pos logit contributions
 name+0.390
 names+0.332
+0.325
 to+0.324
 They+0.316
Neg logit contributions
eling-0.756
ited-0.669
i-0.662
ering-0.654
ie-0.637
Token give
Token ID1577
Feature activation-3.42e-01

Pos logit contributions
eling+0.158
ited+0.140
i+0.137
ering+0.137
ie+0.132
Neg logit contributions
 name-0.096
 names-0.083
 to-0.081
-0.080
 They-0.079
Token it
Token ID340
Feature activation-1.61e+00

Pos logit contributions
eling+0.020
i+0.017
ited+0.017
ering+0.017
ie+0.016
Neg logit contributions
 name-0.018
 names-0.016
 fence-0.016
 road-0.016
 They-0.016
Token a
Token ID257
Feature activation+4.72e+00

Pos logit contributions
eling+0.132
i+0.123
ie+0.122
ited+0.120
ering+0.119
Neg logit contributions
 name-0.037
 road-0.032
 fence-0.032
 They-0.031
 Some-0.029
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
 They+0.171
 road+0.160
 name+0.158
 names+0.158
 hills+0.156
Neg logit contributions
eling-0.290
ited-0.278
ience-0.258
ie-0.255
 chuckled-0.254
Token.
Token ID13
Feature activation+3.63e+00

Pos logit contributions
 name+0.792
 names+0.682
 They+0.645
+0.644
 fence+0.636
Neg logit contributions
eling-1.538
ited-1.367
i-1.364
ering-1.337
ie-1.321
Token One
Token ID1881
Feature activation+2.45e+00

Pos logit contributions
 name+0.115
 road+0.113
 fence+0.110
 names+0.109
Circ+0.101
Neg logit contributions
eling-0.229
heart-0.222
i-0.218
mmm-0.213
ience-0.209
Token day
Token ID1110
Feature activation+3.00e+00

Pos logit contributions
 name+0.083
 to+0.073
+0.067
 names+0.064
 fence+0.062
Neg logit contributions
eling-0.198
ited-0.174
ering-0.172
i-0.169
ience-0.166
Token it
Token ID340
Feature activation-1.61e+00

Pos logit contributions
 to+0.149
 name+0.147
+0.138
 across+0.129
 They+0.127
Neg logit contributions
eling-0.207
ited-0.180
ering-0.168
ieved-0.163
ience-0.162
Token was
Token ID373
Feature activation-1.51e+00

Pos logit contributions
eling+0.113
ering+0.098
ited+0.098
i+0.097
heart+0.093
Neg logit contributions
 name-0.067
 to-0.061
-0.058
 names-0.057
 across-0.056
Token was
Token ID373
Feature activation-1.60e+00

Pos logit contributions
 name+0.540
 to+0.523
+0.496
 fence+0.465
 names+0.456
Neg logit contributions
eling-0.845
ited-0.701
ering-0.698
heart-0.681
leness-0.668
Token so
Token ID523
Feature activation-8.15e+00

Pos logit contributions
eling+0.100
ited+0.086
i+0.085
ering+0.084
ie+0.082
Neg logit contributions
 name-0.080
 names-0.071
-0.069
 They-0.068
 to-0.068
Token proud
Token ID6613
Feature activation-8.25e+00

Pos logit contributions
eling+0.498
ited+0.429
i+0.424
ering+0.418
ie+0.407
Neg logit contributions
 name-0.418
 names-0.375
-0.359
 They-0.357
 to-0.354
Token of
Token ID286
Feature activation+5.60e+00

Pos logit contributions
eling+0.561
i+0.503
ie+0.493
ited+0.493
ering+0.490
Neg logit contributions
 name-0.349
 names-0.307
Circ-0.287
 Bang-0.282
 fence-0.281
Token its
Token ID663
Feature activation-5.01e+00

Pos logit contributions
 name+0.205
+0.194
 They+0.189
 to+0.182
 names+0.170
Neg logit contributions
ited-0.415
eling-0.408
i-0.368
ience-0.367
ering-0.360
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.327
i+0.293
ited+0.291
ering+0.290
ie+0.288
Neg logit contributions
Circ-0.186
 They-0.183
 Bang-0.175
 Some-0.174
 name-0.172
Token that
Token ID326
Feature activation-3.07e+00

Pos logit contributions
 name+0.785
 names+0.665
 to+0.652
+0.649
 They+0.634
Neg logit contributions
eling-1.598
ited-1.406
i-1.380
ering-1.371
ie-1.335
Token it
Token ID340
Feature activation-1.61e+00

Pos logit contributions
eling+0.203
ited+0.178
i+0.173
ering+0.172
ie+0.166
Neg logit contributions
 name-0.146
 names-0.128
-0.127
 to-0.125
 They-0.125
Token did
Token ID750
Feature activation-4.56e+00

Pos logit contributions
eling+0.118
ited+0.106
ering+0.103
i+0.101
heart+0.099
Neg logit contributions
 name-0.066
-0.061
 to-0.059
 They-0.056
 names-0.055
Token not
Token ID407
Feature activation-1.23e+00

Pos logit contributions
eling+0.307
ited+0.273
i+0.269
ering+0.263
ie+0.258
Neg logit contributions
 name-0.208
 names-0.182
 to-0.176
-0.176
 They-0.173
Token want
Token ID765
Feature activation+1.54e+00

Pos logit contributions
eling+0.084
ited+0.074
i+0.073
ering+0.073
ie+0.070
Neg logit contributions
 name-0.055
 names-0.048
-0.046
 to-0.045
 They-0.045
Token proud
Token ID6613
Feature activation-8.36e+00

Pos logit contributions
eling+0.534
i+0.462
ited+0.457
ie+0.457
ering+0.456
Neg logit contributions
 name-0.409
-0.364
 to-0.345
 They-0.338
 names-0.337
Token.
Token ID13
Feature activation+3.30e+00

Pos logit contributions
eling+0.588
i+0.521
ited+0.518
ering+0.513
ie+0.508
Neg logit contributions
 name-0.339
 names-0.295
Circ-0.275
 fence-0.275
 They-0.273
Token He
Token ID679
Feature activation+7.65e+00

Pos logit contributions
 name+0.083
 fence+0.082
 names+0.077
Circ+0.076
 road+0.070
Neg logit contributions
eling-0.237
heart-0.222
i-0.218
ience-0.215
ited-0.209
Token practiced
Token ID19893
Feature activation+3.17e+00

Pos logit contributions
 name+0.406
+0.369
 to+0.366
 They+0.348
 names+0.346
Neg logit contributions
eling-0.489
ering-0.421
ited-0.411
heart-0.393
ie-0.384
Token his
Token ID465
Feature activation-4.66e+00

Pos logit contributions
 name+0.151
 to+0.137
 names+0.134
+0.133
 They+0.128
Neg logit contributions
eling-0.205
ited-0.185
i-0.178
ie-0.172
heart-0.171
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.315
ering+0.277
i+0.276
ited+0.274
ience+0.272
Neg logit contributions
 name-0.174
 They-0.164
Circ-0.159
 Some-0.156
 Bang-0.156
Token everywhere
Token ID8347
Feature activation+4.61e+00

Pos logit contributions
 name+0.870
 names+0.756
 to+0.754
+0.736
 across+0.711
Neg logit contributions
eling-1.515
ited-1.330
i-1.293
ering-1.275
ie-1.246
Token,
Token ID11
Feature activation-1.17e+01

Pos logit contributions
 name+0.183
 to+0.181
+0.175
 They+0.164
 He+0.158
Neg logit contributions
eling-0.326
ited-0.310
i-0.280
ience-0.269
ering-0.267
Token even
Token ID772
Feature activation-8.97e+00

Pos logit contributions
eling+0.772
ited+0.673
i+0.663
ering+0.654
ie+0.641
Neg logit contributions
 name-0.540
 names-0.474
 to-0.465
-0.463
 They-0.458
Token when
Token ID618
Feature activation-3.92e+00

Pos logit contributions
eling+0.545
ited+0.469
i+0.464
ering+0.456
ie+0.444
Neg logit contributions
 name-0.460
 names-0.411
-0.398
 to-0.396
 They-0.393
Token the
Token ID262
Feature activation-2.71e+00

Pos logit contributions
eling+0.245
ited+0.214
i+0.210
ering+0.207
ie+0.201
Neg logit contributions
 name-0.193
 names-0.172
-0.168
 to-0.167
 They-0.166
Token She
Token ID1375
Feature activation+2.79e+00

Pos logit contributions
Circ+0.104
 road+0.103
 whole+0.100
 fence+0.098
 name+0.097
Neg logit contributions
i-0.185
mmm-0.182
eling-0.179
 reassured-0.167
heart-0.166
Token tried
Token ID3088
Feature activation-3.02e+00

Pos logit contributions
 name+0.133
+0.122
 to+0.115
 names+0.114
 They+0.110
Neg logit contributions
eling-0.194
ering-0.171
heart-0.169
ited-0.164
i-0.157
Token to
Token ID284
Feature activation+1.45e+01

Pos logit contributions
eling+0.260
i+0.240
ering+0.239
ie+0.234
ited+0.227
Neg logit contributions
 name-0.062
 fence-0.046
 road-0.044
Circ-0.044
 names-0.043
Token write
Token ID3551
Feature activation+3.90e+00

Pos logit contributions
 name+0.610
 names+0.514
+0.495
 to+0.485
 Bang+0.466
Neg logit contributions
eling-1.035
ited-0.921
ience-0.912
i-0.912
ering-0.909
Token her
Token ID607
Feature activation-9.46e+00

Pos logit contributions
 name+0.167
 names+0.147
 to+0.141
+0.140
 They+0.135
Neg logit contributions
eling-0.281
ited-0.251
i-0.241
ering-0.235
ie-0.232
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+0.846
i+0.774
ering+0.766
ited+0.748
ie+0.747
Neg logit contributions
 name-0.125
 They-0.124
Circ-0.123
 fence-0.116
 road-0.113
Token.
Token ID13
Feature activation+3.42e+00

Pos logit contributions
 name+0.935
 names+0.818
 to+0.817
+0.803
 They+0.770
Neg logit contributions
eling-1.498
ited-1.322
ering-1.249
i-1.243
ience-1.209
Token L
Token ID406
Feature activation-2.22e+00

Pos logit contributions
 fence+0.095
 road+0.093
 corner+0.088
Circ+0.086
 name+0.085
Neg logit contributions
eling-0.207
i-0.206
mmm-0.204
ering-0.194
heart-0.190
Token-
Token ID12
Feature activation-6.63e+00

Pos logit contributions
eling+0.113
ited+0.097
i+0.096
ering+0.092
ience+0.091
Neg logit contributions
 name-0.134
-0.124
 names-0.118
 They-0.118
 to-0.118
TokenI
Token ID40
Feature activation+5.93e+00

Pos logit contributions
eling+0.385
ited+0.329
i+0.328
ering+0.320
ie+0.315
Neg logit contributions
 name-0.355
 names-0.318
-0.305
 They-0.305
 fence-0.304
Token-
Token ID12
Feature activation-6.66e+00

Pos logit contributions
 name+0.277
+0.237
 names+0.237
 to+0.225
 They+0.222
Neg logit contributions
eling-0.405
ering-0.354
i-0.349
ie-0.338
ited-0.338
Token book
Token ID1492
Feature activation+5.27e+00

Pos logit contributions
eling+0.903
i+0.799
ering+0.794
ience+0.786
ited+0.780
Neg logit contributions
 They-0.420
Circ-0.409
 Bang-0.373
 Some-0.371
-0.370
Token.
Token ID13
Feature activation+3.39e+00

Pos logit contributions
 name+0.222
 names+0.194
Circ+0.186
 They+0.183
 fence+0.181
Neg logit contributions
eling-0.350
i-0.310
ited-0.307
ering-0.306
ie-0.303
Token What
Token ID1867
Feature activation+8.53e+00

Pos logit contributions
 fence+0.119
 name+0.115
 corner+0.115
 across+0.113
 yard+0.113
Neg logit contributions
eling-0.205
i-0.194
heart-0.186
ience-0.185
mmm-0.185
Token is
Token ID318
Feature activation-2.64e+00

Pos logit contributions
 name+0.374
 names+0.330
+0.313
 to+0.313
 They+0.302
Neg logit contributions
eling-0.602
ited-0.537
i-0.518
ering-0.515
ie-0.496
Token your
Token ID534
Feature activation-1.31e+01

Pos logit contributions
eling+0.149
i+0.128
ering+0.124
ited+0.124
ie+0.121
Neg logit contributions
 name-0.142
 names-0.129
 They-0.124
-0.124
 fence-0.123
Token name
Token ID1438
Feature activation+2.11e+01

Pos logit contributions
eling+1.128
i+1.063
ering+1.020
ie+1.016
ises+1.005
Neg logit contributions
Circ-0.163
 They-0.148
 Bang-0.126
 Some-0.125
Let-0.115
Token?"
Token ID1701
Feature activation-8.27e+00

Pos logit contributions
 name+1.108
+0.977
 names+0.970
 to+0.968
 They+0.932
Neg logit contributions
eling-1.289
ited-1.128
ering-1.075
i-1.074
ience-1.024
Token
Token ID198
Feature activation+1.02e+01

Pos logit contributions
eling+0.606
i+0.564
 chuckled+0.548
ie+0.543
ise+0.535
Neg logit contributions
 name-0.202
 names-0.200
 fence-0.167
tr-0.155
 road-0.154
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
 name+0.260
 names+0.246
 fence+0.210
 road+0.203
tr+0.195
Neg logit contributions
eling-0.765
ie-0.723
ise-0.699
i-0.699
ering-0.677
Token"
Token ID1
Feature activation-5.17e+00

Pos logit contributions
 name+0.421
 names+0.365
 They+0.337
 to+0.337
 fence+0.333
Neg logit contributions
eling-0.722
ited-0.638
i-0.627
ering-0.617
ie-0.607
TokenTim
Token ID14967
Feature activation-1.61e+00

Pos logit contributions
eling+0.352
ited+0.315
i+0.312
ering+0.295
ience+0.294
Neg logit contributions
 name-0.206
 names-0.178
 fence-0.177
 to-0.174
 yard-0.170
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
eling+0.433
ited+0.408
ering+0.391
ience+0.386
i+0.382
Neg logit contributions
 name-0.202
-0.171
 to-0.158
 names-0.157
 They-0.154
Token little
Token ID1310
Feature activation-2.38e+00

Pos logit contributions
 name+0.038
 names+0.034
 They+0.033
Circ+0.032
+0.032
Neg logit contributions
eling-0.073
ited-0.066
i-0.064
ering-0.064
ie-0.063
Token bird
Token ID6512
Feature activation+3.18e+00

Pos logit contributions
eling+0.170
ited+0.150
i+0.149
ering+0.146
ie+0.145
Neg logit contributions
 name-0.095
 names-0.083
-0.078
 They-0.077
Circ-0.076
Token named
Token ID3706
Feature activation-1.04e+01

Pos logit contributions
 name+0.165
 to+0.155
+0.153
 They+0.143
 names+0.140
Neg logit contributions
eling-0.186
ited-0.160
ering-0.155
i-0.150
ience-0.148
Token Tweet
Token ID18752
Feature activation-8.29e-01

Pos logit contributions
eling+0.625
i+0.552
ie+0.534
ited+0.527
ering+0.522
Neg logit contributions
 name-0.511
 names-0.463
Circ-0.437
 fence-0.432
 road-0.425
Tokenie
Token ID494
Feature activation-3.06e+01

Pos logit contributions
eling+0.038
ited+0.031
ering+0.030
ience+0.029
i+0.028
Neg logit contributions
 name-0.054
-0.051
 to-0.050
 names-0.049
 They-0.048
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
eling+2.409
ited+2.164
ering+2.086
i+2.082
heart+2.026
Neg logit contributions
 name-1.047
-0.871
 names-0.865
 to-0.850
 They-0.815
Token Tweet
Token ID18752
Feature activation-2.08e-01

Pos logit contributions
 name+0.054
 names+0.052
Circ+0.051
 corner+0.048
 across+0.047
Neg logit contributions
eling-0.069
i-0.067
ie-0.066
heart-0.062
ering-0.059
Tokenie
Token ID494
Feature activation-2.96e+01

Pos logit contributions
eling+0.006
mmm+0.004
ited+0.004
ience+0.004
ering+0.004
Neg logit contributions
-0.016
 name-0.015
 to-0.015
 They-0.015
 big-0.014
Token loved
Token ID6151
Feature activation-7.98e+00

Pos logit contributions
eling+2.264
ited+2.087
ering+1.927
heart+1.907
ieved+1.876
Neg logit contributions
-1.121
 name-1.097
 to-1.031
 They-0.970
 The-0.863
Token to
Token ID284
Feature activation+1.22e+01

Pos logit contributions
eling+0.685
ie+0.650
i+0.637
ering+0.604
heart+0.601
Neg logit contributions
 name-0.178
 names-0.127
tted-0.122
 corner-0.121
 They-0.117
Token "
Token ID366
Feature activation+3.71e+00

Pos logit contributions
eling+0.164
i+0.143
ering+0.143
ited+0.142
ie+0.141
Neg logit contributions
 name-0.098
 names-0.086
 They-0.081
Circ-0.080
-0.080
TokenI
Token ID40
Feature activation+1.16e+00

Pos logit contributions
 name+0.154
 names+0.136
 fence+0.128
 to+0.125
 They+0.123
Neg logit contributions
eling-0.260
ited-0.235
i-0.224
ering-0.220
heart-0.218
Token'm
Token ID1101
Feature activation-1.17e+00

Pos logit contributions
 name+0.048
 names+0.040
+0.040
 to+0.039
 They+0.038
Neg logit contributions
eling-0.084
i-0.074
ering-0.073
ited-0.073
ie-0.071
Token sorry
Token ID7926
Feature activation-8.41e+00

Pos logit contributions
eling+0.067
ited+0.062
i+0.058
ering+0.057
ie+0.057
Neg logit contributions
 name-0.063
-0.057
 to-0.055
 They-0.054
 names-0.054
Token sweet
Token ID6029
Feature activation-1.02e+01

Pos logit contributions
eling+0.474
ited+0.408
i+0.379
ering+0.378
ience+0.359
Neg logit contributions
 name-0.496
 names-0.445
 to-0.441
-0.439
 They-0.432
Tokenie
Token ID494
Feature activation-3.05e+01

Pos logit contributions
eling+0.656
ited+0.550
ering+0.539
i+0.519
ience+0.518
Neg logit contributions
 name-0.527
 to-0.464
-0.462
 names-0.460
 They-0.449
Token,
Token ID11
Feature activation-1.43e+01

Pos logit contributions
eling+2.253
ited+1.980
ering+1.915
i+1.883
ience+1.792
Neg logit contributions
 name-1.245
 to-1.061
-1.041
 names-1.030
 They-1.014
Token but
Token ID475
Feature activation-9.06e+00

Pos logit contributions
eling+1.011
ited+0.892
ering+0.867
i+0.862
ie+0.827
Neg logit contributions
 name-0.632
 names-0.536
-0.529
 to-0.518
 They-0.518
Token it
Token ID340
Feature activation-3.56e+00

Pos logit contributions
eling+0.646
ited+0.621
ering+0.557
ience+0.553
i+0.547
Neg logit contributions
 name-0.391
 to-0.355
-0.341
 names-0.318
 They-0.308
Token's
Token ID338
Feature activation-7.93e+00

Pos logit contributions
eling+0.255
ited+0.231
ering+0.227
i+0.224
itive+0.213
Neg logit contributions
 name-0.141
 to-0.121
-0.119
 names-0.118
 They-0.114
Token getting
Token ID1972
Feature activation-8.70e+00

Pos logit contributions
eling+0.488
ited+0.429
ering+0.413
i+0.412
ience+0.397
Neg logit contributions
 name-0.411
 names-0.362
 to-0.358
-0.358
 They-0.348
Token little
Token ID1310
Feature activation-2.38e+00

Pos logit contributions
 name+0.038
 names+0.034
 They+0.033
Circ+0.032
+0.032
Neg logit contributions
eling-0.073
ited-0.066
i-0.064
ering-0.064
ie-0.063
Token bird
Token ID6512
Feature activation+3.18e+00

Pos logit contributions
eling+0.170
ited+0.150
i+0.149
ering+0.146
ie+0.145
Neg logit contributions
 name-0.095
 names-0.083
-0.078
 They-0.077
Circ-0.076
Tokeny
Token ID88
Feature activation-1.21e+01

Pos logit contributions
 name+0.165
 to+0.155
+0.153
 They+0.143
 names+0.140
Neg logit contributions
eling-0.186
ited-0.160
ering-0.155
i-0.150
ience-0.148
Token named
Token ID3706
Feature activation-1.02e+01

Pos logit contributions
eling+0.813
ited+0.711
i+0.700
ering+0.693
ie+0.674
Neg logit contributions
 name-0.537
 names-0.470
-0.452
 They-0.447
 to-0.445
Token Tweet
Token ID18752
Feature activation-6.07e-01

Pos logit contributions
eling+0.599
i+0.532
ie+0.514
ited+0.502
ering+0.496
Neg logit contributions
 name-0.513
 names-0.467
Circ-0.443
 fence-0.439
 road-0.434
Tokenie
Token ID494
Feature activation-3.04e+01

Pos logit contributions
eling+0.015
ited+0.010
ering+0.009
ience+0.008
i+0.008
Neg logit contributions
 name-0.053
-0.050
 to-0.049
 names-0.049
 They-0.048
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
eling+2.352
ited+2.099
i+2.053
ering+2.042
ie+1.985
Neg logit contributions
 name-1.066
 names-0.894
-0.867
 to-0.848
 They-0.838
Token Tweet
Token ID18752
Feature activation-7.81e-02

Pos logit contributions
 name+0.055
 names+0.053
Circ+0.053
 corner+0.050
 across+0.048
Neg logit contributions
eling-0.067
i-0.065
ie-0.064
heart-0.061
ering-0.057
Tokenie
Token ID494
Feature activation-2.95e+01

Pos logit contributions
eling+0.002
ited+0.002
mmm+0.002
ience+0.002
ering+0.001
Neg logit contributions
-0.006
 name-0.006
 to-0.006
 They-0.006
 big-0.005
Token was
Token ID373
Feature activation-3.73e+00

Pos logit contributions
eling+2.242
ited+2.061
ering+1.910
heart+1.875
ieved+1.856
Neg logit contributions
-1.122
 name-1.113
 to-1.039
 They-0.983
 fence-0.876
Token very
Token ID845
Feature activation-7.63e+00

Pos logit contributions
eling+0.265
ited+0.237
i+0.226
ering+0.226
ience+0.223
Neg logit contributions
 name-0.161
-0.138
 to-0.135
 names-0.134
 They-0.130
TokenH
Token ID39
Feature activation-1.27e+01

Pos logit contributions
 name+0.217
 names+0.194
 fence+0.170
 to+0.168
Circ+0.166
Neg logit contributions
eling-0.473
ited-0.423
i-0.416
ie-0.410
ering-0.409
Tokenoot
Token ID1025
Feature activation-1.13e+01

Pos logit contributions
eling+0.641
ited+0.517
ering+0.500
i+0.485
ience+0.474
Neg logit contributions
 name-0.733
-0.705
 They-0.672
 Along-0.667
 road-0.634
Tokenie
Token ID494
Feature activation-3.08e+01

Pos logit contributions
eling+0.545
ited+0.467
ering+0.417
mmm+0.409
ience+0.381
Neg logit contributions
 name-0.696
 to-0.671
-0.661
 names-0.631
 They-0.630
Token led
Token ID2957
Feature activation+1.74e+00

Pos logit contributions
eling+2.022
ited+1.809
ering+1.714
izing+1.618
itive+1.617
Neg logit contributions
 name-1.441
-1.340
 to-1.322
 They-1.199
 names-1.175
Token Tweet
Token ID18752
Feature activation-4.47e-01

Pos logit contributions
 name+0.092
 names+0.083
 fence+0.083
 They+0.081
Circ+0.080
Neg logit contributions
eling-0.095
i-0.087
ie-0.085
ering-0.081
heart-0.079
Tokenie
Token ID494
Feature activation-3.03e+01

Pos logit contributions
eling+0.009
ited+0.005
i+0.005
ering+0.005
ience+0.004
Neg logit contributions
 name-0.041
 names-0.038
-0.038
 to-0.038
 They-0.037
Token to
Token ID284
Feature activation+1.17e+01

Pos logit contributions
eling+2.436
i+2.257
ering+2.224
ie+2.211
ited+2.172
Neg logit contributions
 name-0.821
 Bang-0.687
Circ-0.683
 names-0.674
 They-0.654
Token a
Token ID257
Feature activation+2.17e+00

Pos logit contributions
 name+0.521
 names+0.478
 They+0.452
 fence+0.451
 Some+0.446
Neg logit contributions
eling-0.746
i-0.630
ie-0.613
ited-0.598
ering-0.596
Token big
Token ID1263
Feature activation+8.86e+00

Pos logit contributions
 name+0.072
 names+0.067
 They+0.063
 Bang+0.061
 Some+0.060
Neg logit contributions
eling-0.158
ited-0.143
ie-0.141
ering-0.140
i-0.140
Token tree
Token ID5509
Feature activation+1.08e+01

Pos logit contributions
 name+0.270
 names+0.226
+0.207
 They+0.206
 Bang+0.198
Neg logit contributions
eling-0.717
ited-0.644
i-0.639
ering-0.630
ie-0.625
Token with
Token ID351
Feature activation+2.20e+00

Pos logit contributions
 name+0.448
 names+0.398
Circ+0.373
 Bang+0.368
 They+0.350
Neg logit contributions
eling-0.756
ited-0.678
i-0.674
ering-0.668
ie-0.664
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
eling+0.755
i+0.683
ie+0.670
ering+0.653
ited+0.637
Neg logit contributions
 name-0.414
Circ-0.403
 Bang-0.386
 They-0.384
 fence-0.384
Token necklace
Token ID38987
Feature activation-4.89e+00

Pos logit contributions
 name+0.038
 names+0.034
 They+0.033
Circ+0.032
+0.032
Neg logit contributions
eling-0.073
ited-0.066
i-0.064
ering-0.064
ie-0.063
Token for
Token ID329
Feature activation-5.61e+00

Pos logit contributions
eling+0.356
ited+0.313
i+0.308
ering+0.305
ie+0.297
Neg logit contributions
 name-0.194
 names-0.167
-0.160
 to-0.159
 They-0.157
Token O
Token ID440
Feature activation-5.36e+00

Pos logit contributions
eling+0.375
ited+0.331
i+0.324
ering+0.321
ie+0.312
Neg logit contributions
 name-0.252
 names-0.222
-0.215
 to-0.212
 They-0.211
Tokenll
Token ID297
Feature activation-1.30e+01

Pos logit contributions
eling+0.253
ited+0.203
ering+0.195
i+0.194
ience+0.186
Neg logit contributions
 name-0.337
-0.307
 to-0.301
 names-0.300
 They-0.289
Tokenie
Token ID494
Feature activation-3.02e+01

Pos logit contributions
eling+0.499
ience+0.420
ited+0.413
ering+0.381
i+0.372
Neg logit contributions
 name-0.951
-0.862
 to-0.858
 names-0.855
 They-0.837
Token with
Token ID351
Feature activation+1.56e+00

Pos logit contributions
eling+2.229
i+1.989
ited+1.974
ering+1.940
ie+1.934
Neg logit contributions
 name-1.106
 names-0.957
Circ-0.903
 Bang-0.888
 They-0.883
Token the
Token ID262
Feature activation-4.98e+00

Pos logit contributions
 name+0.060
 names+0.051
+0.049
 They+0.048
 to+0.047
Neg logit contributions
eling-0.115
ited-0.102
i-0.101
ering-0.100
ie-0.098
Token stones
Token ID14966
Feature activation+4.53e+00

Pos logit contributions
eling+0.343
i+0.302
ited+0.299
ie+0.294
ering+0.294
Neg logit contributions
 name-0.200
 names-0.175
Circ-0.173
 They-0.173
-0.170
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
 name+0.161
 names+0.137
 to+0.134
+0.131
 They+0.129
Neg logit contributions
eling-0.347
ited-0.308
i-0.300
ering-0.298
ie-0.291
Token O
Token ID440
Feature activation-3.93e+00

Pos logit contributions
 name+0.050
 names+0.046
Circ+0.043
 across+0.041
 fence+0.040
Neg logit contributions
eling-0.114
i-0.109
heart-0.107
ie-0.104
ited-0.100
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
eling+0.265
leness+0.190
mmm+0.173
ited+0.173
izing+0.172
Neg logit contributions
 name-0.817
-0.798
 to-0.793
 corner-0.749
 names-0.747
Token little
Token ID1310
Feature activation-2.38e+00

Pos logit contributions
 name+0.038
 names+0.034
 They+0.033
Circ+0.032
+0.032
Neg logit contributions
eling-0.073
ited-0.066
i-0.064
ering-0.064
ie-0.063
Token girl
Token ID2576
Feature activation-8.90e+00

Pos logit contributions
eling+0.170
ited+0.150
i+0.149
ering+0.146
ie+0.145
Neg logit contributions
 name-0.095
 names-0.083
-0.078
 They-0.077
Circ-0.076
Token named
Token ID3706
Feature activation-1.02e+01

Pos logit contributions
eling+0.533
ited+0.457
ering+0.446
i+0.434
ience+0.418
Neg logit contributions
 name-0.473
 to-0.424
-0.424
 names-0.414
 They-0.409
Token Sus
Token ID8932
Feature activation-1.24e+01

Pos logit contributions
eling+0.657
i+0.571
ited+0.568
ering+0.557
ie+0.555
Neg logit contributions
 name-0.481
 names-0.427
-0.401
Circ-0.401
 fence-0.400
Tokenie
Token ID494
Feature activation-3.02e+01

Pos logit contributions
eling+0.325
ience+0.231
ited+0.226
ises+0.198
ering+0.193
Neg logit contributions
 name-0.998
-0.979
 to-0.974
 corner-0.938
 names-0.927
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
eling+2.412
ited+2.172
ering+2.104
i+2.102
ience+2.043
Neg logit contributions
 name-0.998
-0.814
 names-0.804
 to-0.794
 They-0.759
Token Sus
Token ID8932
Feature activation-1.16e+01

Pos logit contributions
 name+0.061
 names+0.056
Circ+0.055
 fence+0.054
 road+0.051
Neg logit contributions
eling-0.112
heart-0.105
ie-0.105
i-0.105
ering-0.098
Tokenie
Token ID494
Feature activation-2.91e+01

Pos logit contributions
eling+0.328
ience+0.273
leness+0.236
ises+0.235
ited+0.229
Neg logit contributions
-0.855
 name-0.849
 to-0.835
 corner-0.811
 names-0.792
Token loved
Token ID6151
Feature activation-7.60e+00

Pos logit contributions
eling+2.122
ited+1.903
ering+1.799
i+1.733
heart+1.729
Neg logit contributions
 name-1.219
-1.143
 to-1.108
 They-1.035
 names-0.989
Token to
Token ID284
Feature activation+1.25e+01

Pos logit contributions
eling+0.620
i+0.562
ie+0.558
ering+0.537
heart+0.533
Neg logit contributions
 name-0.208
 names-0.161
 Bang-0.159
 They-0.156
 fence-0.155
Tokenily
Token ID813
Feature activation-7.31e+00

Pos logit contributions
eling+0.557
i+0.486
ering+0.472
ited+0.472
ience+0.460
Neg logit contributions
 name-0.270
-0.247
 to-0.238
 They-0.230
 names-0.226
Token realized
Token ID6939
Feature activation-1.91e+01

Pos logit contributions
eling+0.461
ited+0.396
ering+0.395
i+0.374
ience+0.364
Neg logit contributions
 name-0.372
 names-0.330
-0.329
 to-0.327
 They-0.320
Token that
Token ID326
Feature activation-7.28e+00

Pos logit contributions
eling+1.246
ited+1.154
ering+1.058
i+1.058
ience+1.023
Neg logit contributions
 name-0.888
 to-0.855
-0.839
 They-0.790
 names-0.766
Token the
Token ID262
Feature activation-5.95e+00

Pos logit contributions
eling+0.423
ited+0.368
i+0.356
ering+0.351
ie+0.337
Neg logit contributions
 name-0.396
 names-0.356
-0.353
 to-0.350
 They-0.347
Token gen
Token ID2429
Feature activation-9.56e+00

Pos logit contributions
eling+0.396
i+0.345
ie+0.340
ited+0.340
ering+0.334
Neg logit contributions
 name-0.235
 They-0.222
Circ-0.215
-0.212
 names-0.210
Tokenie
Token ID494
Feature activation-3.01e+01

Pos logit contributions
eling+0.231
heit+0.174
ited+0.170
ience+0.153
i+0.150
Neg logit contributions
 name-0.737
 to-0.725
-0.694
 names-0.666
 big-0.653
Token was
Token ID373
Feature activation-4.47e+00

Pos logit contributions
eling+2.039
ited+1.822
ering+1.756
i+1.706
heart+1.675
Neg logit contributions
 name-1.335
 to-1.212
 names-1.154
-1.147
 They-1.133
Token right
Token ID826
Feature activation+1.32e+00

Pos logit contributions
eling+0.288
ited+0.263
i+0.253
ering+0.247
ie+0.245
Neg logit contributions
 name-0.212
 to-0.187
 names-0.183
-0.182
 across-0.176
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
 name+0.049
 to+0.043
+0.041
 names+0.041
 across+0.039
Neg logit contributions
eling-0.102
ited-0.090
ering-0.086
i-0.086
ience-0.084
Token She
Token ID1375
Feature activation+8.75e-01

Pos logit contributions
 name+0.042
 names+0.041
Circ+0.038
 road+0.036
 across+0.035
Neg logit contributions
eling-0.084
i-0.078
heart-0.077
ie-0.076
ience-0.076
Token thanked
Token ID26280
Feature activation-6.35e+00

Pos logit contributions
 name+0.045
+0.042
 to+0.042
 They+0.040
 names+0.038
Neg logit contributions
eling-0.053
ering-0.047
ited-0.047
heart-0.046
izing-0.043
Token little
Token ID1310
Feature activation-2.37e+00

Pos logit contributions
 name+0.237
 names+0.233
 fence+0.193
tr+0.178
 Bang+0.172
Neg logit contributions
eling-0.631
ering-0.590
ie-0.586
ited-0.570
ieved-0.560
Token girl
Token ID2576
Feature activation-9.29e+00

Pos logit contributions
eling+0.170
ited+0.150
i+0.148
ering+0.147
ie+0.143
Neg logit contributions
 name-0.094
 names-0.081
-0.078
 They-0.077
 to-0.077
Token called
Token ID1444
Feature activation-7.63e+00

Pos logit contributions
eling+0.563
ited+0.489
ering+0.469
i+0.457
heart+0.440
Neg logit contributions
 name-0.486
-0.441
 to-0.438
 names-0.423
 They-0.420
Token L
Token ID406
Feature activation-5.58e+00

Pos logit contributions
eling+0.437
i+0.380
ering+0.367
ited+0.366
ie+0.365
Neg logit contributions
 name-0.397
 names-0.359
 fence-0.345
Circ-0.345
 They-0.342
Tokenizz
Token ID6457
Feature activation-1.21e+01

Pos logit contributions
eling+0.261
ited+0.213
i+0.209
ering+0.206
ie+0.197
Neg logit contributions
 name-0.363
 names-0.332
-0.326
 They-0.322
 to-0.321
Tokenie
Token ID494
Feature activation-3.01e+01

Pos logit contributions
eling+0.279
ited+0.186
ering+0.154
i+0.142
ience+0.141
Neg logit contributions
 name-1.064
 names-1.000
-0.998
 to-0.998
 They-0.981
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
eling+2.200
i+1.967
ited+1.937
ering+1.916
ie+1.912
Neg logit contributions
 name-1.108
 names-0.964
Circ-0.914
 They-0.892
 Bang-0.888
Token She
Token ID1375
Feature activation+9.12e-01

Pos logit contributions
 name+0.049
Circ+0.047
 names+0.047
 fence+0.045
 road+0.044
Neg logit contributions
eling-0.111
i-0.105
heart-0.102
ie-0.100
ering-0.096
Token liked
Token ID8288
Feature activation-5.89e+00

Pos logit contributions
 name+0.041
+0.037
 to+0.036
 They+0.034
 names+0.034
Neg logit contributions
eling-0.064
ited-0.056
ering-0.055
heart-0.053
i-0.052
Token to
Token ID284
Feature activation+1.25e+01

Pos logit contributions
eling+0.518
i+0.477
ie+0.475
ering+0.460
heart+0.459
Neg logit contributions
 name-0.117
 names-0.076
 Bang-0.072
 They-0.070
 fence-0.070
Token perform
Token ID1620
Feature activation+1.11e+00

Pos logit contributions
 name+0.432
 names+0.368
+0.350
 They+0.344
Circ+0.341
Neg logit contributions
eling-0.952
ited-0.842
i-0.840
ering-0.824
ie-0.817
Token
Token ID220
Feature activation+2.59e-01

Pos logit contributions
 name+0.230
 names+0.199
+0.189
 They+0.187
 fence+0.184
Neg logit contributions
eling-0.401
ited-0.353
i-0.351
ering-0.346
ie-0.340
Token
Token ID198
Feature activation+6.48e+00

Pos logit contributions
 name+0.004
 fence+0.004
 road+0.004
Circ+0.003
 names+0.003
Neg logit contributions
eling-0.023
i-0.021
ie-0.021
ering-0.021
heart-0.021
TokenOne
Token ID3198
Feature activation-6.03e+00

Pos logit contributions
 name+0.149
 names+0.125
 fence+0.104
 road+0.099
Circ+0.097
Neg logit contributions
eling-0.550
ited-0.502
ie-0.501
i-0.501
heart-0.493
Token day
Token ID1110
Feature activation-2.71e-01

Pos logit contributions
eling+0.465
ited+0.414
i+0.411
ering+0.406
ie+0.399
Neg logit contributions
 name-0.211
 names-0.178
-0.168
 They-0.166
 to-0.164
Token Mill
Token ID9212
Feature activation-9.52e+00

Pos logit contributions
eling+0.017
ited+0.014
ering+0.014
i+0.013
ience+0.013
Neg logit contributions
 name-0.015
-0.013
 to-0.013
 They-0.013
 names-0.013
Tokenie
Token ID494
Feature activation-3.00e+01

Pos logit contributions
eling+0.242
ering+0.150
ited+0.150
leness+0.142
ience+0.135
Neg logit contributions
 name-0.784
 to-0.761
-0.758
 names-0.736
 yard-0.721
Token went
Token ID1816
Feature activation+5.39e-01

Pos logit contributions
eling+1.981
ited+1.703
ering+1.693
i+1.622
ience+1.571
Neg logit contributions
 name-1.424
-1.259
 to-1.258
 names-1.249
 They-1.223
Token out
Token ID503
Feature activation+6.89e+00

Pos logit contributions
 name+0.015
 names+0.015
Circ+0.014
 Bang+0.014
 Some+0.014
Neg logit contributions
eling-0.038
ie-0.037
i-0.037
ering-0.035
heart-0.033
Token to
Token ID284
Feature activation+1.21e+01

Pos logit contributions
 name+0.143
 names+0.111
 Bang+0.097
 fence+0.092
 They+0.092
Neg logit contributions
eling-0.617
i-0.567
ie-0.561
ited-0.558
ering-0.557
Token play
Token ID711
Feature activation+1.06e+01

Pos logit contributions
 name+0.552
 names+0.521
 They+0.503
 Some+0.502
 Bang+0.497
Neg logit contributions
eling-0.743
i-0.636
ie-0.633
ited-0.591
heart-0.574
Token in
Token ID287
Feature activation+2.16e+00

Pos logit contributions
 name+0.415
 names+0.363
 They+0.338
Circ+0.334
+0.333
Neg logit contributions
eling-0.761
i-0.668
ited-0.667
ering-0.662
ie-0.655
Token Bun
Token ID28515
Feature activation-9.00e+00

Pos logit contributions
eling+0.664
ie+0.637
i+0.634
ited+0.618
heart+0.614
Neg logit contributions
 name-0.225
 names-0.214
 fence-0.202
Circ-0.197
 hills-0.170
TokenB
Token ID33
Feature activation-7.66e+00

Pos logit contributions
eling+0.575
ited+0.495
ering+0.472
i+0.452
ience+0.449
Neg logit contributions
 name-0.419
-0.417
 to-0.399
 They-0.378
 across-0.371
Tokenun
Token ID403
Feature activation-1.89e-01

Pos logit contributions
eling+0.509
ited+0.445
i+0.442
ering+0.435
ie+0.427
Neg logit contributions
 name-0.346
 names-0.306
-0.291
 They-0.289
 to-0.286
Token and
Token ID290
Feature activation-5.76e+00

Pos logit contributions
eling+0.012
ited+0.010
i+0.010
ering+0.010
heart+0.009
Neg logit contributions
-0.009
 name-0.009
 to-0.008
 They-0.008
 across-0.008
Token Tweet
Token ID18752
Feature activation-8.28e-01

Pos logit contributions
eling+0.395
ited+0.349
ering+0.332
i+0.324
heart+0.319
Neg logit contributions
 name-0.260
 to-0.237
-0.233
 They-0.231
 names-0.228
Tokenie
Token ID494
Feature activation-3.00e+01

Pos logit contributions
eling+0.029
ited+0.022
ering+0.021
ience+0.020
i+0.019
Neg logit contributions
 name-0.061
-0.060
 to-0.059
 They-0.058
 names-0.058
Token still
Token ID991
Feature activation-4.79e+00

Pos logit contributions
eling+1.933
ited+1.729
ering+1.621
heart+1.585
i+1.557
Neg logit contributions
 name-1.462
 to-1.361
-1.359
 They-1.340
 names-1.305
Token weren
Token ID6304
Feature activation-1.39e+01

Pos logit contributions
eling+0.298
ited+0.259
ering+0.250
i+0.248
ience+0.242
Neg logit contributions
 name-0.243
 to-0.215
-0.215
 names-0.212
 They-0.211
Token't
Token ID470
Feature activation-1.88e+00

Pos logit contributions
eling+1.000
ited+0.872
i+0.869
ering+0.856
ie+0.854
Neg logit contributions
 name-0.562
 names-0.498
 fence-0.454
Circ-0.448
 They-0.444
Token talking
Token ID3375
Feature activation-6.09e+00

Pos logit contributions
eling+0.121
ited+0.107
ering+0.102
i+0.102
ience+0.101
Neg logit contributions
 name-0.088
 to-0.082
-0.080
 They-0.077
 names-0.077
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
eling+0.522
i+0.474
ering+0.472
ited+0.467
ie+0.467
Neg logit contributions
 name-0.146
 names-0.112
Circ-0.102
 fence-0.102
-0.101
Token ot
Token ID30972
Feature activation-8.31e+00

Pos logit contributions
eling+0.135
ited+0.120
i+0.118
ering+0.117
ie+0.114
Neg logit contributions
 name-0.071
 names-0.060
-0.058
 They-0.057
 to-0.057
Tokenter
Token ID353
Feature activation-1.25e+01

Pos logit contributions
eling+0.338
ited+0.267
i+0.263
ering+0.256
ie+0.247
Neg logit contributions
 name-0.591
 names-0.547
-0.532
 They-0.530
 to-0.527
Token named
Token ID3706
Feature activation-9.92e+00

Pos logit contributions
eling+0.794
ited+0.691
i+0.669
ering+0.666
ie+0.642
Neg logit contributions
 name-0.604
 names-0.531
-0.524
 to-0.517
 They-0.515
Token O
Token ID440
Feature activation-5.11e+00

Pos logit contributions
eling+0.572
i+0.492
ited+0.475
ering+0.473
ie+0.471
Neg logit contributions
 name-0.532
 names-0.481
Circ-0.460
 fence-0.455
 to-0.451
Tokenll
Token ID297
Feature activation-1.25e+01

Pos logit contributions
eling+0.236
ited+0.193
i+0.186
ering+0.185
ie+0.175
Neg logit contributions
 name-0.335
 names-0.305
-0.304
 They-0.298
 to-0.296
Tokenie
Token ID494
Feature activation-3.00e+01

Pos logit contributions
eling+0.193
ited+0.091
ering+0.072
i+0.071
ience+0.055
Neg logit contributions
 name-1.207
 names-1.135
-1.122
 They-1.118
 to-1.110
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
eling+2.209
ited+1.954
i+1.937
ering+1.913
ie+1.877
Neg logit contributions
 name-1.145
 names-0.984
-0.933
 They-0.923
 to-0.913
Token O
Token ID440
Feature activation-4.50e+00

Pos logit contributions
 name+0.053
 names+0.050
Circ+0.047
 fence+0.046
 across+0.045
Neg logit contributions
eling-0.088
i-0.084
heart-0.082
ie-0.081
ited-0.075
Tokenll
Token ID297
Feature activation-1.18e+01

Pos logit contributions
eling+0.219
ited+0.184
ience+0.178
ering+0.175
heart+0.173
Neg logit contributions
 name-0.255
-0.250
 They-0.224
 names-0.221
 to-0.219
Tokenie
Token ID494
Feature activation-2.92e+01

Pos logit contributions
eling+0.192
ited+0.102
ering+0.080
ience+0.079
i+0.061
Neg logit contributions
 name-1.128
 names-1.055
-1.051
 They-1.047
 to-1.033
Token lived
Token ID5615
Feature activation-6.51e+00

Pos logit contributions
eling+2.057
ited+1.839
ering+1.720
heart+1.646
i+1.635
Neg logit contributions
 name-1.304
-1.222
 to-1.210
 They-1.161
 names-1.067
Token he
Token ID339
Feature activation-7.41e+00

Pos logit contributions
eling+0.804
ited+0.702
i+0.690
ering+0.683
ie+0.663
Neg logit contributions
 name-0.591
 names-0.524
-0.504
 to-0.500
 They-0.500
Token lost
Token ID2626
Feature activation-4.42e-01

Pos logit contributions
eling+0.468
ited+0.418
ering+0.411
i+0.400
heart+0.390
Neg logit contributions
 name-0.366
-0.320
 names-0.319
 to-0.315
 They-0.307
Token his
Token ID465
Feature activation-7.53e+00

Pos logit contributions
eling+0.030
i+0.026
ie+0.026
ering+0.026
ited+0.026
Neg logit contributions
 name-0.019
 names-0.017
Circ-0.016
 Bang-0.016
 They-0.016
Token friend
Token ID1545
Feature activation+2.99e+00

Pos logit contributions
eling+0.545
i+0.491
ie+0.482
ering+0.479
ited+0.479
Neg logit contributions
 name-0.264
Circ-0.241
 They-0.233
 names-0.233
 Bang-0.226
Token Brown
Token ID4373
Feature activation+5.20e-01

Pos logit contributions
 name+0.109
 names+0.093
+0.089
 They+0.087
 to+0.087
Neg logit contributions
eling-0.225
ited-0.200
i-0.197
ering-0.195
ie-0.191
Tokenie
Token ID494
Feature activation-2.99e+01

Pos logit contributions
 name+0.042
 to+0.041
+0.040
 names+0.040
 fence+0.040
Neg logit contributions
eling-0.013
ited-0.010
ering-0.008
i-0.007
ience-0.007
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
eling+2.216
i+1.967
ited+1.965
ering+1.933
ie+1.917
Neg logit contributions
 name-1.103
 names-0.949
 They-0.885
Circ-0.885
-0.881
Token<|endoftext|>
Token ID50256
Feature activation+6.59e+00

Pos logit contributions
 name+0.038
 names+0.034
Circ+0.031
 across+0.030
 fence+0.030
Neg logit contributions
eling-0.088
i-0.082
ie-0.080
heart-0.079
ited-0.078
TokenOnce
Token ID7454
Feature activation-6.31e+00

Pos logit contributions
 names+0.190
 name+0.189
 fence+0.178
 road+0.164
Circ+0.158
Neg logit contributions
eling-0.432
ie-0.425
i-0.406
ise-0.397
ited-0.380
Token upon
Token ID2402
Feature activation-3.27e-01

Pos logit contributions
eling+0.458
ited+0.405
i+0.401
ering+0.396
ie+0.390
Neg logit contributions
 name-0.246
 names-0.213
-0.200
 They-0.200
 to-0.197
Token a
Token ID257
Feature activation+3.04e+00

Pos logit contributions
eling+0.022
ited+0.019
ie+0.019
i+0.018
ise+0.018
Neg logit contributions
 name-0.013
 fence-0.012
 names-0.012
Circ-0.011
 road-0.011
Token tried
Token ID3088
Feature activation-6.74e+00

Pos logit contributions
eling+0.143
ited+0.124
ering+0.120
i+0.117
heart+0.114
Neg logit contributions
 name-0.130
-0.117
 to-0.116
 They-0.113
 names-0.113
Token to
Token ID284
Feature activation+1.15e+01

Pos logit contributions
eling+0.637
i+0.583
ering+0.580
ited+0.572
ie+0.567
Neg logit contributions
 name-0.105
 names-0.070
Circ-0.056
 fence-0.054
 They-0.054
Token scare
Token ID19437
Feature activation+4.42e+00

Pos logit contributions
 name+0.441
 names+0.369
+0.353
 to+0.349
 They+0.342
Neg logit contributions
eling-0.851
ited-0.767
i-0.752
ering-0.749
ience-0.734
Token the
Token ID262
Feature activation-5.09e+00

Pos logit contributions
 name+0.200
 names+0.176
+0.170
 They+0.169
 to+0.168
Neg logit contributions
eling-0.297
ited-0.261
i-0.254
ering-0.253
ie-0.245
Token bird
Token ID6512
Feature activation+4.75e+00

Pos logit contributions
eling+0.317
i+0.278
ie+0.269
ering+0.266
ited+0.264
Neg logit contributions
 name-0.236
 They-0.224
 names-0.218
-0.217
Circ-0.214
Tokenie
Token ID494
Feature activation-2.99e+01

Pos logit contributions
 name+0.204
 to+0.181
 names+0.176
+0.175
 across+0.171
Neg logit contributions
eling-0.329
ited-0.289
i-0.281
ering-0.276
ience-0.272
Token away
Token ID1497
Feature activation-2.39e+00

Pos logit contributions
eling+2.140
ited+1.887
i+1.865
ering+1.841
ie+1.801
Neg logit contributions
 name-1.206
 names-1.043
-0.997
 They-0.984
 to-0.984
Token,
Token ID11
Feature activation-1.33e+01

Pos logit contributions
eling+0.168
ited+0.148
i+0.141
ering+0.139
ience+0.138
Neg logit contributions
 name-0.103
 to-0.094
-0.090
 They-0.088
 names-0.088
Token but
Token ID475
Feature activation-8.05e+00

Pos logit contributions
eling+0.863
ited+0.760
i+0.742
ering+0.727
ie+0.721
Neg logit contributions
 name-0.624
-0.551
 to-0.550
 names-0.544
 They-0.541
Token the
Token ID262
Feature activation-4.13e+00

Pos logit contributions
eling+0.592
ited+0.538
ering+0.501
i+0.495
heart+0.482
Neg logit contributions
 name-0.310
 to-0.309
-0.290
 across-0.275
 They-0.265
Token bird
Token ID6512
Feature activation+5.50e+00

Pos logit contributions
eling+0.273
i+0.243
ie+0.237
ited+0.235
ering+0.231
Neg logit contributions
 name-0.171
 They-0.160
 names-0.159
Circ-0.157
-0.151
Token by
Token ID416
Feature activation-1.94e+00

Pos logit contributions
eling+0.283
i+0.261
ering+0.258
ie+0.255
ited+0.248
Neg logit contributions
 name-0.119
Circ-0.112
 names-0.108
 road-0.100
 fence-0.099
Token and
Token ID290
Feature activation-5.37e+00

Pos logit contributions
eling+0.137
i+0.119
ited+0.119
ering+0.118
ie+0.115
Neg logit contributions
 name-0.079
 names-0.069
Circ-0.064
 They-0.064
 fence-0.063
Token scared
Token ID12008
Feature activation-1.20e+00

Pos logit contributions
eling+0.322
ited+0.283
i+0.267
ering+0.266
heart+0.257
Neg logit contributions
 name-0.283
 to-0.260
-0.257
 They-0.250
 names-0.248
Token the
Token ID262
Feature activation-5.25e+00

Pos logit contributions
eling+0.084
ited+0.075
i+0.072
ering+0.072
ience+0.070
Neg logit contributions
 name-0.050
 names-0.044
 to-0.044
-0.043
 They-0.043
Token bird
Token ID6512
Feature activation+4.57e+00

Pos logit contributions
eling+0.361
i+0.314
ited+0.311
ering+0.309
ie+0.309
Neg logit contributions
 name-0.200
 They-0.182
 names-0.180
Circ-0.179
-0.178
Tokenie
Token ID494
Feature activation-2.98e+01

Pos logit contributions
 name+0.152
 to+0.144
+0.136
 across+0.127
 They+0.127
Neg logit contributions
eling-0.348
ited-0.315
i-0.301
ering-0.298
ience-0.294
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
eling+2.182
ited+1.950
i+1.891
ering+1.870
heart+1.821
Neg logit contributions
 name-1.136
 to-0.990
-0.967
 names-0.959
 They-0.930
Token The
Token ID383
Feature activation+1.97e+00

Pos logit contributions
 name+0.043
 names+0.039
Circ+0.035
 fence+0.032
 across+0.032
Neg logit contributions
eling-0.119
i-0.110
heart-0.106
ie-0.105
ience-0.105
Token cat
Token ID3797
Feature activation+1.06e+01

Pos logit contributions
 name+0.070
 names+0.061
 They+0.059
Circ+0.057
+0.057
Neg logit contributions
eling-0.147
i-0.130
ited-0.130
ie-0.128
ering-0.127
Token wanted
Token ID2227
Feature activation-2.19e+00

Pos logit contributions
 name+0.473
 to+0.472
+0.463
 fence+0.437
 They+0.436
Neg logit contributions
eling-0.654
ited-0.583
mmm-0.560
ering-0.560
heart-0.552
Token to
Token ID284
Feature activation+1.28e+01

Pos logit contributions
eling+0.195
i+0.181
ie+0.178
ering+0.175
ited+0.173
Neg logit contributions
 name-0.044
 names-0.029
 fence-0.026
 They-0.025
Circ-0.024
Token together
Token ID1978
Feature activation-5.30e+00

Pos logit contributions
eling+0.949
ited+0.844
i+0.812
ering+0.802
ie+0.778
Neg logit contributions
 name-0.609
 to-0.537
 names-0.533
-0.531
 They-0.522
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
eling+0.356
ited+0.314
i+0.305
ering+0.303
ie+0.292
Neg logit contributions
 name-0.236
 names-0.208
 to-0.207
-0.205
 They-0.201
Token<|endoftext|>
Token ID50256
Feature activation+6.11e+00

Pos logit contributions
 name+0.031
 names+0.027
Circ+0.025
 fence+0.024
 across+0.023
Neg logit contributions
eling-0.101
i-0.094
heart-0.092
ie-0.091
ited-0.091
TokenT
Token ID51
Feature activation-7.78e+00

Pos logit contributions
 names+0.175
 name+0.173
 fence+0.159
 road+0.147
Circ+0.147
Neg logit contributions
eling-0.417
ie-0.399
i-0.387
ise-0.373
ited-0.368
Tokenrix
Token ID8609
Feature activation-8.24e+00

Pos logit contributions
eling+0.328
ited+0.261
i+0.259
ering+0.251
ie+0.244
Neg logit contributions
 name-0.542
 names-0.501
-0.486
 They-0.484
 to-0.482
Tokenie
Token ID494
Feature activation-2.97e+01

Pos logit contributions
eling+0.176
ited+0.128
ience+0.101
ering+0.100
mmm+0.089
Neg logit contributions
 name-0.704
 to-0.691
-0.684
 names-0.662
 They-0.659
Token was
Token ID373
Feature activation-3.74e+00

Pos logit contributions
eling+2.046
ited+1.847
ering+1.799
heart+1.738
i+1.715
Neg logit contributions
 name-1.251
-1.222
 to-1.163
 They-1.109
 fence-1.068
Token very
Token ID845
Feature activation-7.69e+00

Pos logit contributions
eling+0.263
ited+0.231
i+0.230
ering+0.227
ie+0.222
Neg logit contributions
 name-0.154
 names-0.137
 They-0.126
Circ-0.126
 fence-0.125
Token proud
Token ID6613
Feature activation-1.02e+01

Pos logit contributions
eling+0.534
ited+0.473
i+0.466
ering+0.462
ience+0.458
Neg logit contributions
 name-0.336
-0.282
 to-0.274
 names-0.272
 They-0.272
Token of
Token ID286
Feature activation+3.73e+00

Pos logit contributions
eling+0.707
i+0.630
ited+0.626
ie+0.625
ering+0.615
Neg logit contributions
 name-0.417
 names-0.364
Circ-0.346
 Bang-0.338
 They-0.338
Token her
Token ID607
Feature activation-1.15e+01

Pos logit contributions
 name+0.213
 names+0.193
 fence+0.185
Circ+0.184
 They+0.183
Neg logit contributions
eling-0.208
i-0.174
ited-0.172
ering-0.170
ie-0.167
Token there
Token ID612
Feature activation+3.37e+00

Pos logit contributions
eling+0.945
i+0.823
ited+0.818
ering+0.808
ie+0.797
Neg logit contributions
 name-0.617
 names-0.549
Circ-0.518
-0.511
 They-0.510
Token was
Token ID373
Feature activation-4.51e+00

Pos logit contributions
 name+0.141
+0.130
 to+0.123
 They+0.122
 names+0.121
Neg logit contributions
eling-0.244
ited-0.216
ering-0.208
i-0.208
ience-0.204
Token a
Token ID257
Feature activation+1.98e+00

Pos logit contributions
eling+0.377
ited+0.339
i+0.336
ering+0.332
ie+0.328
Neg logit contributions
 name-0.126
 names-0.102
-0.093
 They-0.092
 fence-0.090
Token little
Token ID1310
Feature activation-8.40e-01

Pos logit contributions
 name+0.069
 names+0.060
 They+0.055
+0.055
 to+0.054
Neg logit contributions
eling-0.150
ited-0.133
i-0.132
ering-0.130
ie-0.129
Token bird
Token ID6512
Feature activation+4.42e+00

Pos logit contributions
eling+0.065
ited+0.057
ering+0.057
i+0.057
ie+0.054
Neg logit contributions
 name-0.030
 names-0.025
 fence-0.024
-0.024
 to-0.024
Tokenie
Token ID494
Feature activation-2.97e+01

Pos logit contributions
 name+0.218
+0.202
 to+0.201
 They+0.190
 fence+0.188
Neg logit contributions
eling-0.274
ited-0.236
ering-0.225
ience-0.223
i-0.222
Token named
Token ID3706
Feature activation-9.32e+00

Pos logit contributions
eling+2.145
ited+1.899
ering+1.838
i+1.834
ience+1.765
Neg logit contributions
 name-1.190
 names-1.006
-1.005
 to-0.993
 They-0.969
Token Blue
Token ID4518
Feature activation+6.57e-01

Pos logit contributions
eling+0.507
i+0.459
ie+0.449
heart+0.417
ering+0.413
Neg logit contributions
 name-0.498
 names-0.469
Circ-0.452
 fence-0.433
 road-0.430
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 name+0.027
+0.024
 names+0.023
 to+0.023
 They+0.023
Neg logit contributions
eling-0.047
ited-0.041
ering-0.040
i-0.039
ience-0.038
Token Blue
Token ID4518
Feature activation+1.03e+00

Pos logit contributions
 name+0.078
 names+0.074
Circ+0.070
 corner+0.067
 fence+0.067
Neg logit contributions
eling-0.093
i-0.082
ie-0.082
ience-0.081
ited-0.079
Token loved
Token ID6151
Feature activation-7.45e+00

Pos logit contributions
+0.040
 name+0.039
 They+0.035
 to+0.035
 names+0.031
Neg logit contributions
eling-0.078
ited-0.068
ering-0.067
heart-0.064
ieved-0.063
Token a
Token ID257
Feature activation+1.87e+00

Pos logit contributions
eling+0.206
ited+0.180
i+0.180
ering+0.177
ie+0.174
Neg logit contributions
 name-0.117
 names-0.102
-0.096
 They-0.095
 fence-0.095
Token band
Token ID4097
Feature activation-6.82e-01

Pos logit contributions
 name+0.068
 They+0.064
 names+0.064
+0.060
Circ+0.059
Neg logit contributions
eling-0.129
ited-0.116
ering-0.112
i-0.112
ie-0.110
Tokenage
Token ID496
Feature activation+1.10e+01

Pos logit contributions
eling+0.043
ited+0.037
i+0.037
ering+0.036
ie+0.036
Neg logit contributions
 name-0.033
 names-0.029
-0.028
 They-0.028
 to-0.028
Token on
Token ID319
Feature activation+5.08e+00

Pos logit contributions
 name+0.372
 names+0.310
 to+0.310
+0.303
 They+0.294
Neg logit contributions
eling-0.863
ited-0.767
i-0.755
ering-0.743
ie-0.727
Token Tweet
Token ID18752
Feature activation+2.39e-02

Pos logit contributions
 name+0.229
 names+0.205
Circ+0.192
 fence+0.189
 They+0.185
Neg logit contributions
eling-0.329
i-0.286
ited-0.279
ering-0.277
ie-0.275
Tokenie
Token ID494
Feature activation-2.96e+01

Pos logit contributions
 name+0.001
 names+0.001
+0.001
 to+0.001
 fence+0.001
Neg logit contributions
eling-0.002
ited-0.001
i-0.001
ering-0.001
ience-0.001
Token's
Token ID338
Feature activation-8.13e+00

Pos logit contributions
eling+2.028
i+1.785
ited+1.783
ering+1.746
ie+1.726
Neg logit contributions
 name-1.246
 names-1.105
Circ-1.044
 They-1.041
 Bang-1.036
Token wing
Token ID8539
Feature activation+4.91e+00

Pos logit contributions
eling+0.496
i+0.454
ering+0.433
ie+0.430
ise+0.422
Neg logit contributions
Circ-0.340
 name-0.325
 They-0.314
 names-0.305
 Bang-0.303
Token and
Token ID290
Feature activation-4.07e+00

Pos logit contributions
 name+0.196
 names+0.170
+0.160
 They+0.159
 to+0.156
Neg logit contributions
eling-0.352
ited-0.311
i-0.309
ering-0.305
ie-0.299
Token said
Token ID531
Feature activation-7.41e+00

Pos logit contributions
eling+0.258
ited+0.231
ering+0.220
i+0.213
heart+0.209
Neg logit contributions
 name-0.201
 to-0.185
-0.177
 They-0.176
 names-0.175
Token,
Token ID11
Feature activation-1.30e+01

Pos logit contributions
eling+0.468
ited+0.419
i+0.375
ering+0.369
ie+0.361
Neg logit contributions
 name-0.393
 to-0.375
-0.360
 names-0.345
 They-0.345
TokenYou
Token ID1639
Feature activation-8.44e+00

Pos logit contributions
eling+0.997
ited+0.898
ering+0.873
i+0.854
ience+0.827
Neg logit contributions
 name-0.484
 names-0.403
-0.386
 They-0.381
 to-0.375
Token're
Token ID821
Feature activation-9.04e+00

Pos logit contributions
eling+0.575
ited+0.507
ering+0.503
i+0.488
ie+0.473
Neg logit contributions
 name-0.399
 names-0.349
-0.340
 They-0.330
 to-0.327
Token welcome
Token ID7062
Feature activation-4.07e-01

Pos logit contributions
eling+0.646
ited+0.595
i+0.569
ering+0.568
ience+0.562
Neg logit contributions
 name-0.382
 names-0.321
-0.319
 to-0.308
 They-0.306
Token,
Token ID11
Feature activation-1.45e+01

Pos logit contributions
eling+0.025
i+0.021
ited+0.021
ering+0.021
ie+0.020
Neg logit contributions
 name-0.020
 names-0.018
-0.018
 They-0.018
 fence-0.017
Token sweet
Token ID6029
Feature activation-9.78e+00

Pos logit contributions
eling+0.980
ited+0.857
i+0.852
ering+0.837
ie+0.822
Neg logit contributions
 name-0.628
 names-0.552
-0.526
 They-0.522
 to-0.517
Tokenie
Token ID494
Feature activation-2.96e+01

Pos logit contributions
eling+0.283
ited+0.187
ering+0.180
ience+0.152
i+0.149
Neg logit contributions
 name-0.840
 names-0.779
 to-0.771
-0.768
 They-0.765
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
eling+2.068
ited+1.812
ering+1.766
i+1.723
ience+1.649
Neg logit contributions
 name-1.308
 to-1.128
-1.122
 names-1.101
 They-1.086
Token Do
Token ID2141
Feature activation-1.51e+00

Pos logit contributions
 name+0.053
 names+0.048
Circ+0.046
 fence+0.045
 road+0.045
Neg logit contributions
eling-0.100
i-0.091
ited-0.088
heart-0.088
ience-0.087
Token you
Token ID345
Feature activation-8.39e+00

Pos logit contributions
eling+0.098
ited+0.095
ering+0.085
i+0.080
heart+0.079
Neg logit contributions
 name-0.075
-0.071
 names-0.068
 to-0.066
 They-0.065
Token want
Token ID765
Feature activation-4.98e-01

Pos logit contributions
eling+0.584
ited+0.508
ering+0.505
i+0.500
ience+0.485
Neg logit contributions
 name-0.385
 names-0.329
-0.317
 to-0.314
 They-0.305
Token to
Token ID284
Feature activation+1.28e+01

Pos logit contributions
eling+0.043
ie+0.040
i+0.040
ering+0.039
ited+0.038
Neg logit contributions
 name-0.009
 fence-0.007
 They-0.006
 names-0.006
 road-0.006
Token good
Token ID922
Feature activation-8.21e+00

Pos logit contributions
 name+0.038
 names+0.034
 They+0.033
Circ+0.032
+0.032
Neg logit contributions
eling-0.073
ited-0.066
i-0.064
ering-0.064
ie-0.063
Token friend
Token ID1545
Feature activation+1.65e+00

Pos logit contributions
eling+0.569
i+0.516
ited+0.514
ie+0.512
ering+0.499
Neg logit contributions
 name-0.297
 names-0.268
Circ-0.264
-0.261
 They-0.260
Token,"
Token ID553
Feature activation-1.04e+01

Pos logit contributions
 name+0.063
 names+0.053
 to+0.053
+0.053
 They+0.051
Neg logit contributions
eling-0.123
ited-0.109
ering-0.107
i-0.106
ience-0.102
Token said
Token ID531
Feature activation-7.96e+00

Pos logit contributions
eling+0.680
ited+0.590
ering+0.574
i+0.565
ience+0.546
Neg logit contributions
 name-0.504
-0.461
 They-0.449
 to-0.448
 names-0.440
Token Tweet
Token ID18752
Feature activation-2.47e-01

Pos logit contributions
eling+0.531
ited+0.464
i+0.458
ering+0.452
ie+0.442
Neg logit contributions
 name-0.359
 names-0.315
-0.303
 They-0.300
 to-0.298
Tokenie
Token ID494
Feature activation-2.96e+01

Pos logit contributions
eling+0.016
ited+0.014
ering+0.013
ience+0.013
i+0.013
Neg logit contributions
 name-0.012
-0.011
 to-0.011
 names-0.010
 They-0.010
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
eling+2.438
ited+2.183
ering+2.114
i+2.087
ience+2.040
Neg logit contributions
 name-0.955
-0.786
 to-0.772
 names-0.760
 They-0.721
Token "
Token ID366
Feature activation+6.17e+00

Pos logit contributions
 name+0.034
 names+0.032
Circ+0.032
 road+0.032
 corner+0.030
Neg logit contributions
eling-0.104
i-0.104
ie-0.098
ience-0.097
ise-0.097
TokenAnd
Token ID1870
Feature activation-1.06e+01

Pos logit contributions
 name+0.261
 names+0.235
 fence+0.224
 yard+0.221
 road+0.216
Neg logit contributions
eling-0.386
ited-0.364
i-0.338
heart-0.330
ieved-0.327
Token one
Token ID530
Feature activation-3.31e+00

Pos logit contributions
eling+0.663
ited+0.585
i+0.561
ering+0.555
ie+0.542
Neg logit contributions
 name-0.544
 names-0.479
-0.468
 to-0.466
 They-0.460
Token day
Token ID1110
Feature activation+1.31e+00

Pos logit contributions
eling+0.249
ited+0.218
i+0.217
ering+0.215
ie+0.206
Neg logit contributions
 name-0.127
 names-0.108
-0.104
 to-0.103
 They-0.101
Token her
Token ID607
Feature activation-1.29e+01

Pos logit contributions
 name+0.058
 names+0.056
Circ+0.056
 road+0.051
 Bang+0.048
Neg logit contributions
eling-0.116
i-0.112
ie-0.108
heart-0.105
ering-0.103
Token charming
Token ID23332
Feature activation-8.43e+00

Pos logit contributions
eling+0.931
i+0.832
ited+0.817
ering+0.809
ie+0.808
Neg logit contributions
 name-0.470
Circ-0.418
 names-0.410
 They-0.398
 Bang-0.398
Token doll
Token ID3654
Feature activation-1.34e+00

Pos logit contributions
eling+0.596
i+0.530
ited+0.525
ie+0.519
ering+0.516
Neg logit contributions
 name-0.325
 names-0.283
 They-0.273
-0.273
Circ-0.273
Token,
Token ID11
Feature activation-1.42e+01

Pos logit contributions
eling+0.100
ited+0.088
ering+0.086
i+0.085
ie+0.082
Neg logit contributions
 name-0.052
 names-0.044
 to-0.043
-0.042
 They-0.042
Token Ros
Token ID10018
Feature activation-3.96e+00

Pos logit contributions
eling+0.926
ited+0.807
i+0.795
ering+0.786
ie+0.766
Neg logit contributions
 name-0.664
 names-0.585
-0.565
 They-0.560
 to-0.557
Tokenie
Token ID494
Feature activation-2.96e+01

Pos logit contributions
eling+0.102
ited+0.078
ering+0.064
mmm+0.055
ieved+0.055
Neg logit contributions
 name-0.316
-0.303
 to-0.302
 names-0.299
 They-0.286
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
eling+1.988
ited+1.774
ering+1.693
i+1.627
heart+1.593
Neg logit contributions
 name-1.406
-1.235
 to-1.216
 names-1.195
 They-1.190
Token One
Token ID1881
Feature activation+3.59e-01

Pos logit contributions
 name+0.045
 names+0.041
Circ+0.039
 fence+0.039
 road+0.037
Neg logit contributions
eling-0.113
i-0.108
ie-0.106
heart-0.105
ience-0.102
Token day
Token ID1110
Feature activation+1.33e+00

Pos logit contributions
 name+0.009
 to+0.007
+0.007
 corner+0.007
 names+0.007
Neg logit contributions
eling-0.032
ering-0.028
ited-0.028
i-0.028
ience-0.027
Token,
Token ID11
Feature activation-1.30e+01

Pos logit contributions
 name+0.075
+0.070
 to+0.070
 They+0.067
 names+0.065
Neg logit contributions
eling-0.083
ited-0.070
ering-0.066
i-0.065
ience-0.061
Token Lily
Token ID20037
Feature activation-2.00e+00

Pos logit contributions
eling+0.828
ited+0.718
i+0.702
ering+0.696
ie+0.673
Neg logit contributions
 name-0.640
 names-0.568
-0.553
 They-0.547
 to-0.545
Token in
Token ID287
Feature activation+3.23e+00

Pos logit contributions
 name+0.091
 names+0.078
+0.074
 They+0.073
 to+0.073
Neg logit contributions
eling-0.178
ited-0.158
i-0.156
ering-0.154
ie-0.151
Token two
Token ID734
Feature activation+5.83e+00

Pos logit contributions
 name+0.117
 names+0.099
Circ+0.092
 They+0.091
+0.089
Neg logit contributions
eling-0.241
ited-0.212
i-0.211
ering-0.207
ie-0.204
Token!
Token ID0
Feature activation-9.17e+00

Pos logit contributions
 name+0.198
 names+0.170
 to+0.164
+0.163
 fence+0.159
Neg logit contributions
eling-0.448
ited-0.399
i-0.396
ering-0.390
ie-0.382
Token
Token ID220
Feature activation+2.30e+00

Pos logit contributions
eling+0.621
heart+0.579
i+0.568
 chuckled+0.562
mmm+0.559
Neg logit contributions
 name-0.267
 names-0.255
 fence-0.209
 road-0.206
Circ-0.204
Token
Token ID198
Feature activation+9.30e+00

Pos logit contributions
Circ+0.053
 road+0.053
 name+0.053
 names+0.051
 fence+0.049
Neg logit contributions
eling-0.162
ering-0.155
ie-0.150
i-0.149
ise-0.145
Token
Token ID198
Feature activation+9.37e+00

Pos logit contributions
 name+0.326
 names+0.308
 fence+0.265
 road+0.264
 yard+0.240
Neg logit contributions
eling-0.623
heart-0.569
ited-0.566
i-0.559
ise-0.556
TokenThe
Token ID464
Feature activation-4.55e+00

Pos logit contributions
 name+0.457
 names+0.417
 to+0.377
 fence+0.375
 They+0.368
Neg logit contributions
eling-0.577
ited-0.511
i-0.492
ering-0.483
heart-0.482
Token family
Token ID1641
Feature activation+3.52e+00

Pos logit contributions
eling+0.316
ited+0.277
i+0.276
ie+0.271
ering+0.270
Neg logit contributions
 name-0.183
 names-0.162
 They-0.160
-0.158
Circ-0.156
Token was
Token ID373
Feature activation-2.06e+00

Pos logit contributions
 name+0.148
 to+0.142
+0.139
 They+0.136
 across+0.133
Neg logit contributions
eling-0.236
ited-0.208
i-0.200
ering-0.197
ience-0.191
Token very
Token ID845
Feature activation-6.15e+00

Pos logit contributions
eling+0.130
ited+0.112
i+0.111
ering+0.109
ie+0.107
Neg logit contributions
 name-0.101
 names-0.090
-0.087
 They-0.086
 to-0.086
Token surprised
Token ID6655
Feature activation-4.45e+00

Pos logit contributions
eling+0.386
ited+0.341
i+0.335
ie+0.325
ience+0.320
Neg logit contributions
 name-0.311
 to-0.273
-0.269
 names-0.264
 They-0.263
Token tied
Token ID8165
Feature activation-2.36e+00

Pos logit contributions
 name+0.092
 names+0.077
+0.074
 to+0.074
 They+0.073
Neg logit contributions
eling-0.191
ited-0.170
i-0.167
ering-0.165
ie-0.161
Token around
Token ID1088
Feature activation+8.35e+00

Pos logit contributions
eling+0.176
i+0.160
ering+0.157
ited+0.156
ie+0.156
Neg logit contributions
 name-0.081
 names-0.069
Circ-0.065
 They-0.064
-0.064
Token it
Token ID340
Feature activation-3.17e+00

Pos logit contributions
 name+0.393
 names+0.348
+0.335
 They+0.332
 to+0.330
Neg logit contributions
eling-0.542
ited-0.472
i-0.465
ering-0.459
ie-0.448
Token.
Token ID13
Feature activation+2.14e+00

Pos logit contributions
eling+0.224
i+0.198
ited+0.198
ering+0.195
ie+0.192
Neg logit contributions
 name-0.127
 names-0.110
-0.104
 They-0.103
Circ-0.103
Token
Token ID198
Feature activation+8.86e+00

Pos logit contributions
 name+0.066
 names+0.063
Circ+0.054
 fence+0.054
 road+0.052
Neg logit contributions
eling-0.147
heart-0.140
i-0.140
ie-0.135
ited-0.132
Token
Token ID198
Feature activation+8.92e+00

Pos logit contributions
 name+0.195
 names+0.177
 fence+0.137
 road+0.132
 Bang+0.126
Neg logit contributions
eling-0.736
ie-0.686
i-0.670
heart-0.670
ited-0.665
TokenJ
Token ID41
Feature activation-9.24e+00

Pos logit contributions
 name+0.445
 names+0.401
 to+0.371
 fence+0.369
 Bang+0.366
Neg logit contributions
eling-0.551
ited-0.483
i-0.468
ering-0.460
ie-0.454
Tokenill
Token ID359
Feature activation-5.28e+00

Pos logit contributions
eling+0.474
ited+0.395
i+0.389
ering+0.382
ie+0.370
Neg logit contributions
 name-0.560
 names-0.510
-0.495
 They-0.492
 to-0.489
Token looked
Token ID3114
Feature activation-6.61e+00

Pos logit contributions
eling+0.348
ering+0.299
ited+0.293
heart+0.285
i+0.277
Neg logit contributions
 name-0.241
-0.216
 to-0.213
 names-0.210
 They-0.198
Token up
Token ID510
Feature activation+2.23e-01

Pos logit contributions
eling+0.512
ited+0.449
i+0.443
ering+0.433
ise+0.429
Neg logit contributions
 name-0.249
 to-0.213
 names-0.204
 across-0.202
-0.200
Token at
Token ID379
Feature activation-1.20e+00

Pos logit contributions
 name+0.010
 names+0.009
 They+0.008
+0.008
 fence+0.008
Neg logit contributions
eling-0.015
ited-0.013
i-0.013
ering-0.013
ie-0.013
Token gives
Token ID3607
Feature activation-3.51e+00

Pos logit contributions
 name+0.211
 names+0.189
+0.183
 to+0.181
 They+0.181
Neg logit contributions
eling-0.239
ited-0.207
i-0.204
ering-0.202
heart-0.196
Token Lily
Token ID20037
Feature activation-7.45e-01

Pos logit contributions
eling+0.229
ited+0.199
i+0.198
ering+0.194
ie+0.193
Neg logit contributions
 name-0.165
 names-0.145
-0.138
 They-0.138
 fence-0.137
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
eling+0.054
i+0.049
ited+0.048
ie+0.047
ering+0.047
Neg logit contributions
 name-0.027
 names-0.023
 Bang-0.023
 road-0.022
 fence-0.022
Token prize
Token ID11596
Feature activation+3.48e+00

Pos logit contributions
eling+0.218
ited+0.190
i+0.187
ering+0.185
ience+0.183
Neg logit contributions
 They-0.116
 name-0.113
-0.111
Circ-0.110
 Along-0.105
Token.
Token ID13
Feature activation+2.55e+00

Pos logit contributions
 name+0.126
 names+0.107
Circ+0.102
 Bang+0.100
 They+0.099
Neg logit contributions
eling-0.250
i-0.230
ering-0.224
ie-0.223
ited-0.223
Token It
Token ID632
Feature activation+9.22e+00

Pos logit contributions
Circ+0.091
 name+0.089
 names+0.089
 road+0.088
 fence+0.086
Neg logit contributions
eling-0.146
heart-0.133
 reassured-0.132
ience-0.132
i-0.132
Token is
Token ID318
Feature activation-3.29e+00

Pos logit contributions
 name+0.483
 names+0.433
+0.416
Circ+0.415
 fence+0.414
Neg logit contributions
eling-0.535
i-0.456
ited-0.453
ering-0.440
ie-0.439
Token a
Token ID257
Feature activation+4.52e+00

Pos logit contributions
eling+0.254
ited+0.227
i+0.223
ering+0.221
ie+0.217
Neg logit contributions
 name-0.119
 names-0.100
-0.095
 to-0.094
 They-0.093
Token long
Token ID890
Feature activation+7.14e+00

Pos logit contributions
 name+0.163
 names+0.150
 They+0.148
 Some+0.141
Circ+0.140
Neg logit contributions
eling-0.310
ited-0.278
i-0.271
ering-0.268
ie-0.266
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
 name+0.261
 names+0.224
+0.212
 They+0.210
 to+0.206
Neg logit contributions
eling-0.534
ited-0.474
i-0.470
ering-0.464
ie-0.456
Token soft
Token ID2705
Feature activation+4.49e+00

Pos logit contributions
eling+0.888
ited+0.794
i+0.784
ering+0.776
ie+0.759
Neg logit contributions
 name-0.422
 names-0.364
-0.343
 They-0.340
Circ-0.337
Token slide
Token ID10649
Feature activation+9.63e+00

Pos logit contributions
eling+0.166
i+0.144
ie+0.141
ited+0.138
ering+0.135
Neg logit contributions
 name-0.102
 names-0.092
 They-0.091
Circ-0.091
 Along-0.090
Token and
Token ID290
Feature activation-2.63e+00

Pos logit contributions
 name+0.405
 names+0.353
+0.345
 to+0.341
 They+0.339
Neg logit contributions
eling-0.676
ited-0.592
i-0.584
ering-0.577
ie-0.562
Token climbed
Token ID19952
Feature activation+2.37e+00

Pos logit contributions
eling+0.158
ited+0.135
ering+0.132
i+0.132
ie+0.127
Neg logit contributions
 name-0.139
 names-0.124
-0.123
 to-0.122
 They-0.121
Token up
Token ID510
Feature activation+1.09e+00

Pos logit contributions
 name+0.109
 names+0.099
Circ+0.097
 Bang+0.090
 Some+0.090
Neg logit contributions
eling-0.147
i-0.131
ie-0.125
ering-0.125
ited-0.122
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 name+0.053
 names+0.047
Circ+0.044
 Bang+0.043
 fence+0.042
Neg logit contributions
eling-0.068
i-0.059
ited-0.058
ie-0.057
ering-0.057
Token steps
Token ID4831
Feature activation+1.10e+01

Pos logit contributions
eling+0.185
ie+0.160
ering+0.158
ited+0.158
i+0.156
Neg logit contributions
 name-0.107
 names-0.099
Circ-0.094
 Some-0.084
 They-0.084
Token.
Token ID13
Feature activation+3.13e+00

Pos logit contributions
 name+0.400
 names+0.342
Circ+0.312
 They+0.306
+0.305
Neg logit contributions
eling-0.824
i-0.741
ited-0.737
ering-0.731
ie-0.719
Token At
Token ID1629
Feature activation+5.23e+00

Pos logit contributions
Circ+0.108
 road+0.106
 fuel+0.101
tr+0.097
Ar+0.097
Neg logit contributions
eling-0.179
i-0.174
heart-0.163
ie-0.158
ited-0.154
Token the
Token ID262
Feature activation-2.79e+00

Pos logit contributions
 name+0.223
 names+0.194
+0.190
 to+0.186
 They+0.185
Neg logit contributions
eling-0.363
ited-0.319
i-0.315
ering-0.312
ie-0.304
Token top
Token ID1353
Feature activation+6.06e+00

Pos logit contributions
eling+0.204
ited+0.177
i+0.172
ie+0.171
ering+0.165
Neg logit contributions
 name-0.095
Circ-0.086
 names-0.086
 They-0.079
 Bang-0.076
Token,
Token ID11
Feature activation-1.16e+01

Pos logit contributions
 name+0.253
+0.223
 to+0.220
 names+0.218
 They+0.216
Neg logit contributions
eling-0.426
ited-0.373
i-0.368
ering-0.364
ience-0.355
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
eling+0.315
ited+0.267
ering+0.257
i+0.252
ience+0.248
Neg logit contributions
 name-0.297
-0.270
 to-0.269
 names-0.262
 They-0.261
Token She
Token ID1375
Feature activation+2.51e+00

Pos logit contributions
 name+0.100
Circ+0.095
 names+0.094
 fence+0.090
 Bang+0.089
Neg logit contributions
i-0.149
ie-0.148
eling-0.147
heart-0.146
ited-0.141
Token had
Token ID550
Feature activation-2.43e+00

Pos logit contributions
 name+0.121
+0.108
 names+0.104
 to+0.104
 They+0.103
Neg logit contributions
eling-0.167
ited-0.148
ering-0.144
i-0.142
heart-0.140
Token always
Token ID1464
Feature activation-7.38e+00

Pos logit contributions
eling+0.142
i+0.132
ie+0.122
ise+0.118
ited+0.118
Neg logit contributions
 name-0.097
Circ-0.090
 Bang-0.086
 fence-0.085
iffs-0.083
Token wanted
Token ID2227
Feature activation-1.22e+00

Pos logit contributions
eling+0.473
ience+0.410
ited+0.405
ering+0.402
i+0.394
Neg logit contributions
 name-0.370
 to-0.331
 names-0.325
-0.320
 They-0.315
Token to
Token ID284
Feature activation+1.38e+01

Pos logit contributions
eling+0.104
i+0.097
ie+0.096
ering+0.095
heart+0.091
Neg logit contributions
 name-0.026
 They-0.017
 fence-0.016
 Bang-0.016
 names-0.016
Token go
Token ID467
Feature activation+6.54e+00

Pos logit contributions
 name+0.514
Circ+0.506
 Some+0.496
 across+0.490
 fence+0.489
Neg logit contributions
eling-0.908
i-0.791
ie-0.780
ited-0.753
ise-0.734
Token on
Token ID319
Feature activation+7.19e+00

Pos logit contributions
 name+0.240
 Bang+0.226
 names+0.222
Circ+0.218
 fence+0.209
Neg logit contributions
eling-0.438
ie-0.418
i-0.411
ering-0.397
ited-0.368
Token an
Token ID281
Feature activation-1.23e-01

Pos logit contributions
 name+0.279
 names+0.233
 They+0.223
 Bang+0.220
 fence+0.219
Neg logit contributions
eling-0.528
i-0.462
ited-0.458
ering-0.450
ie-0.447
Token adventure
Token ID8855
Feature activation+2.46e+00

Pos logit contributions
eling+0.008
ited+0.007
i+0.007
ering+0.007
ience+0.007
Neg logit contributions
 name-0.005
 names-0.005
 to-0.005
 fence-0.005
 road-0.005
Token.
Token ID13
Feature activation+2.90e+00

Pos logit contributions
 name+0.092
 names+0.079
 Bang+0.073
Circ+0.072
 They+0.071
Neg logit contributions
eling-0.182
i-0.164
ering-0.161
ited-0.161
ie-0.161
Token.
Token ID13
Feature activation+3.29e+00

Pos logit contributions
eling+0.108
i+0.096
ited+0.095
ering+0.095
ie+0.094
Neg logit contributions
 name-0.062
 names-0.053
Circ-0.051
 They-0.050
-0.050
Token It
Token ID632
Feature activation+9.57e+00

Pos logit contributions
 name+0.110
 fence+0.110
 road+0.109
 across+0.107
 corner+0.107
Neg logit contributions
eling-0.204
i-0.191
ience-0.184
heart-0.184
ise-0.183
Token's
Token ID338
Feature activation-5.73e+00

Pos logit contributions
 name+0.392
 names+0.339
+0.326
 They+0.323
 to+0.323
Neg logit contributions
eling-0.687
ited-0.619
ering-0.604
i-0.598
heart-0.585
Token fun
Token ID1257
Feature activation+2.78e+00

Pos logit contributions
eling+0.400
ited+0.353
i+0.347
ering+0.344
ie+0.336
Neg logit contributions
 name-0.245
 names-0.213
-0.203
 They-0.201
 to-0.200
Token.
Token ID13
Feature activation+3.38e+00

Pos logit contributions
 name+0.108
 names+0.093
+0.088
 They+0.087
 to+0.087
Neg logit contributions
eling-0.204
ited-0.180
i-0.178
ering-0.176
ie-0.173
Token Come
Token ID7911
Feature activation+5.26e+00

Pos logit contributions
 name+0.118
 fence+0.116
 road+0.115
Circ+0.111
 across+0.111
Neg logit contributions
eling-0.210
i-0.193
ise-0.188
heart-0.187
ience-0.187
Token on
Token ID319
Feature activation+7.58e+00

Pos logit contributions
 name+0.216
 names+0.211
Circ+0.201
 fence+0.197
 Some+0.197
Neg logit contributions
eling-0.324
i-0.299
ie-0.284
ering-0.282
ited-0.282
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
 name+0.437
 to+0.404
+0.399
 They+0.386
 names+0.382
Neg logit contributions
eling-0.440
ited-0.379
ering-0.352
i-0.344
ience-0.326
Token I
Token ID314
Feature activation+1.49e+00

Pos logit contributions
eling+0.719
ited+0.622
i+0.612
ering+0.607
ie+0.588
Neg logit contributions
 name-0.571
 names-0.504
-0.488
 They-0.483
 to-0.482
Token promise
Token ID6991
Feature activation-8.81e+00

Pos logit contributions
 name+0.063
 names+0.056
 They+0.054
Circ+0.054
 fence+0.054
Neg logit contributions
eling-0.101
ited-0.089
i-0.088
ering-0.086
ie-0.085
Token."
Token ID526
Feature activation+8.26e-01

Pos logit contributions
eling+0.585
i+0.520
ering+0.504
ited+0.503
ie+0.502
Neg logit contributions
 name-0.388
 names-0.344
Circ-0.328
 fence-0.322
 They-0.321
Tokenuffy
Token ID15352
Feature activation-4.82e+00

Pos logit contributions
eling+0.077
i+0.066
ering+0.064
ited+0.063
ie+0.063
Neg logit contributions
 name-0.089
 names-0.082
Circ-0.079
 to-0.079
-0.078
Token and
Token ID290
Feature activation-3.40e+00

Pos logit contributions
eling+0.330
ited+0.291
ering+0.281
heart+0.281
i+0.272
Neg logit contributions
 name-0.213
 to-0.197
-0.197
 They-0.187
 names-0.184
Token his
Token ID465
Feature activation-5.55e+00

Pos logit contributions
eling+0.219
i+0.189
ie+0.186
ited+0.184
ering+0.181
Neg logit contributions
 name-0.155
 names-0.140
Circ-0.133
 Bang-0.132
 fence-0.129
Token friends
Token ID2460
Feature activation+4.20e+00

Pos logit contributions
eling+0.236
i+0.230
ie+0.220
ering+0.206
itive+0.196
Neg logit contributions
Circ-0.301
 Where-0.285
 Bang-0.282
 This-0.280
 Along-0.278
Token were
Token ID547
Feature activation-3.89e+00

Pos logit contributions
 name+0.187
 to+0.177
+0.176
 They+0.171
 names+0.171
Neg logit contributions
eling-0.286
heart-0.245
ited-0.244
ering-0.240
mmm-0.238
Token afraid
Token ID7787
Feature activation-1.70e+00

Pos logit contributions
eling+0.234
i+0.199
ited+0.198
ering+0.197
ie+0.190
Neg logit contributions
 name-0.199
 names-0.180
Circ-0.172
-0.170
 Bang-0.170
Token of
Token ID286
Feature activation+4.77e+00

Pos logit contributions
eling+0.119
i+0.107
ie+0.106
ering+0.104
ited+0.103
Neg logit contributions
 name-0.070
 names-0.059
Circ-0.056
 fence-0.054
 Bang-0.054
Token the
Token ID262
Feature activation-3.71e+00

Pos logit contributions
 name+0.218
 names+0.189
 Bang+0.183
Circ+0.182
 They+0.177
Neg logit contributions
eling-0.323
i-0.272
ering-0.269
ited-0.262
ie-0.261
Token big
Token ID1263
Feature activation+1.01e+01

Pos logit contributions
eling+0.223
ited+0.193
ering+0.183
ie+0.178
ising+0.177
Neg logit contributions
 name-0.140
Circ-0.139
 Bang-0.135
tr-0.130
 To-0.130
Token dog
Token ID3290
Feature activation+1.06e+01

Pos logit contributions
 name+0.282
 names+0.235
Circ+0.224
 Bang+0.221
+0.205
Neg logit contributions
eling-0.807
ited-0.725
i-0.725
ie-0.718
ering-0.713
Token.
Token ID13
Feature activation+2.22e+00

Pos logit contributions
 name+0.436
 names+0.379
Circ+0.356
+0.350
 Bang+0.350
Neg logit contributions
eling-0.740
i-0.654
ited-0.650
ering-0.643
ie-0.638
Token learn
Token ID2193
Feature activation+1.97e+00

Pos logit contributions
 name+0.558
 names+0.481
+0.458
 They+0.451
 to+0.450
Neg logit contributions
eling-1.012
ited-0.896
i-0.886
ering-0.876
ie-0.856
Token.
Token ID13
Feature activation+3.20e+00

Pos logit contributions
 name+0.081
 names+0.071
+0.067
 They+0.067
 to+0.066
Neg logit contributions
eling-0.139
ited-0.122
i-0.121
ering-0.119
ie-0.117
Token
Token ID220
Feature activation+3.01e+00

Pos logit contributions
 name+0.090
 names+0.083
 fence+0.077
 road+0.076
Circ+0.075
Neg logit contributions
eling-0.235
heart-0.221
i-0.215
ieved-0.211
ie-0.211
Token
Token ID198
Feature activation+9.88e+00

Pos logit contributions
 road+0.063
 fence+0.063
 name+0.060
Circ+0.058
 corner+0.057
Neg logit contributions
eling-0.230
i-0.212
ering-0.212
mmm-0.210
ieved-0.208
Token
Token ID198
Feature activation+9.91e+00

Pos logit contributions
 name+0.215
 names+0.184
 fence+0.154
 road+0.133
 across+0.126
Neg logit contributions
eling-0.842
ie-0.768
ited-0.766
ering-0.759
i-0.757
TokenTim
Token ID14967
Feature activation-1.78e+00

Pos logit contributions
 name+0.443
 names+0.389
+0.381
 They+0.375
 to+0.370
Neg logit contributions
eling-0.668
ited-0.581
i-0.580
ering-0.570
ie-0.557
Tokenmy
Token ID1820
Feature activation-6.64e+00

Pos logit contributions
eling+0.118
ering+0.100
ited+0.096
ience+0.095
ises+0.093
Neg logit contributions
 name-0.084
 names-0.074
-0.074
 to-0.074
 They-0.069
Token felt
Token ID2936
Feature activation-8.32e+00

Pos logit contributions
eling+0.445
ering+0.377
ited+0.372
heart+0.359
i+0.356
Neg logit contributions
 name-0.328
-0.296
 to-0.294
 names-0.283
 They-0.278
Token happy
Token ID3772
Feature activation-6.62e+00

Pos logit contributions
eling+0.575
i+0.527
ie+0.518
ited+0.508
ering+0.507
Neg logit contributions
 name-0.379
-0.340
 to-0.327
 across-0.315
 They-0.308
Token to
Token ID284
Feature activation+1.40e+01

Pos logit contributions
eling+0.470
ited+0.415
i+0.402
ering+0.395
ience+0.388
Neg logit contributions
 name-0.286
 to-0.256
-0.251
 names-0.247
 They-0.239
Token know
Token ID760
Feature activation+2.82e+00

Pos logit contributions
 name+0.542
 names+0.461
+0.443
 They+0.435
 to+0.434
Neg logit contributions
eling-1.037
ited-0.920
i-0.910
ering-0.902
ie-0.879
Token inside
Token ID2641
Feature activation+8.76e+00

Pos logit contributions
 name+0.093
 to+0.085
+0.081
 names+0.079
 across+0.078
Neg logit contributions
eling-0.150
ited-0.130
ience-0.125
i-0.124
ering-0.123
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
 name+0.332
 to+0.298
+0.292
 names+0.281
 They+0.279
Neg logit contributions
eling-0.646
ited-0.581
i-0.566
ering-0.554
ience-0.551
Token
Token ID198
Feature activation+9.21e+00

Pos logit contributions
 names+0.072
 name+0.072
Circ+0.068
 fence+0.064
 road+0.062
Neg logit contributions
eling-0.161
i-0.153
ie-0.150
heart-0.149
ience-0.145
Token
Token ID198
Feature activation+9.28e+00

Pos logit contributions
 name+0.252
 names+0.240
 fence+0.205
 road+0.192
Circ+0.188
Neg logit contributions
eling-0.733
ie-0.681
ering-0.657
i-0.648
ise-0.640
TokenAs
Token ID1722
Feature activation-7.08e+00

Pos logit contributions
 name+0.474
 names+0.431
 fence+0.398
 to+0.395
 They+0.394
Neg logit contributions
eling-0.567
ited-0.492
i-0.477
ering-0.476
ie-0.470
Token he
Token ID339
Feature activation-4.06e+00

Pos logit contributions
eling+0.462
ited+0.401
ering+0.391
i+0.388
heart+0.369
Neg logit contributions
 name-0.337
-0.316
 to-0.309
 They-0.307
 names-0.292
Token was
Token ID373
Feature activation-2.41e+00

Pos logit contributions
eling+0.263
ering+0.234
heart+0.228
ited+0.225
i+0.223
Neg logit contributions
 name-0.176
-0.168
 to-0.160
 names-0.150
Circ-0.149
Token flying
Token ID7348
Feature activation+1.86e+00

Pos logit contributions
eling+0.158
ited+0.140
i+0.136
ering+0.133
heart+0.132
Neg logit contributions
 name-0.112
 to-0.099
-0.097
 names-0.096
 across-0.093
Token back
Token ID736
Feature activation+5.72e+00

Pos logit contributions
 name+0.091
 to+0.086
 across+0.083
+0.081
 names+0.078
Neg logit contributions
eling-0.122
ited-0.104
i-0.100
ience-0.099
ering-0.097
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 name+0.251
 names+0.227
Circ+0.213
 Bang+0.211
 They+0.209
Neg logit contributions
eling-0.375
i-0.337
ering-0.330
ie-0.330
ited-0.324
Token the
Token ID262
Feature activation-3.39e+00

Pos logit contributions
 name+0.598
 names+0.534
 fence+0.500
 They+0.499
Circ+0.497
Neg logit contributions
eling-0.894
i-0.775
ited-0.764
ie-0.756
ering-0.755
Token
Token ID198
Feature activation+9.45e+00

Pos logit contributions
 name+0.238
 names+0.223
Circ+0.194
 road+0.192
 fence+0.190
Neg logit contributions
eling-0.719
ie-0.671
i-0.659
ering-0.656
ise-0.641
Token"
Token ID1
Feature activation-5.81e+00

Pos logit contributions
 name+0.468
 names+0.416
+0.407
 They+0.402
 to+0.396
Neg logit contributions
eling-0.598
ited-0.516
i-0.511
ering-0.503
ie-0.493
TokenCan
Token ID6090
Feature activation+3.46e+00

Pos logit contributions
eling+0.409
ited+0.376
i+0.358
ering+0.341
heart+0.340
Neg logit contributions
 name-0.223
 to-0.191
 names-0.188
 fence-0.183
 road-0.181
Token we
Token ID356
Feature activation-6.65e+00

Pos logit contributions
 name+0.164
 names+0.145
+0.140
 They+0.138
 to+0.138
Neg logit contributions
eling-0.226
ited-0.196
i-0.194
ering-0.191
ie-0.186
Token eat
Token ID4483
Feature activation+2.46e+00

Pos logit contributions
eling+0.487
i+0.440
ited+0.434
ie+0.423
ering+0.411
Neg logit contributions
 name-0.230
 names-0.206
Circ-0.200
 fence-0.193
 Bang-0.191
Token it
Token ID340
Feature activation-2.05e+00

Pos logit contributions
 name+0.102
+0.089
 to+0.089
 names+0.089
 They+0.084
Neg logit contributions
eling-0.175
ited-0.159
ering-0.150
i-0.149
heart-0.146
Token?"
Token ID1701
Feature activation-8.93e+00

Pos logit contributions
eling+0.128
ited+0.110
i+0.105
ering+0.105
ience+0.103
Neg logit contributions
 name-0.106
 to-0.097
-0.096
 names-0.093
 They-0.091
Token Anna
Token ID11735
Feature activation-1.16e+00

Pos logit contributions
eling+0.586
i+0.511
ited+0.510
ie+0.493
ering+0.489
Neg logit contributions
 name-0.397
 names-0.355
 fence-0.340
Circ-0.333
 road-0.329
Token asked
Token ID1965
Feature activation-5.53e-01

Pos logit contributions
eling+0.079
ited+0.068
ering+0.066
ience+0.065
ie+0.064
Neg logit contributions
 name-0.053
-0.050
 to-0.047
 They-0.046
 names-0.046
Token.
Token ID13
Feature activation+2.68e+00

Pos logit contributions
eling+0.042
i+0.037
ited+0.037
ering+0.037
ie+0.036
Neg logit contributions
 name-0.020
 names-0.017
 fence-0.015
 They-0.015
-0.015
Token
Token ID198
Feature activation+9.36e+00

Pos logit contributions
 road+0.088
 names+0.076
 fence+0.076
 name+0.072
tr+0.072
Neg logit contributions
i-0.146
eling-0.146
 reassured-0.142
 chuckled-0.139
mmm-0.137
TokenS
Token ID50
Feature activation-4.57e+00

Pos logit contributions
 name+0.447
 names+0.405
 to+0.375
 fence+0.373
 Bang+0.369
Neg logit contributions
eling-0.584
i-0.502
ited-0.499
ering-0.486
ie-0.482
Tokenara
Token ID3301
Feature activation-1.45e+01

Pos logit contributions
eling+0.250
ited+0.209
i+0.206
ering+0.204
ie+0.197
Neg logit contributions
 name-0.266
 names-0.242
-0.234
 They-0.232
 to-0.231
Token and
Token ID290
Feature activation-3.61e+00

Pos logit contributions
eling+0.905
i+0.796
ited+0.784
ering+0.769
ie+0.758
Neg logit contributions
 name-0.665
 names-0.597
Circ-0.579
 Bang-0.567
 fence-0.565
Token Tom
Token ID4186
Feature activation+2.54e+00

Pos logit contributions
eling+0.223
i+0.196
ited+0.188
ering+0.182
ie+0.181
Neg logit contributions
 name-0.171
 names-0.153
Circ-0.151
 corner-0.148
 across-0.147
Token are
Token ID389
Feature activation-3.47e+00

Pos logit contributions
 name+0.118
 names+0.104
+0.098
 They+0.097
 to+0.095
Neg logit contributions
eling-0.170
ited-0.149
ering-0.148
i-0.147
ie-0.143
Token happy
Token ID3772
Feature activation-7.58e+00

Pos logit contributions
eling+0.223
ited+0.197
i+0.192
ering+0.189
ie+0.186
Neg logit contributions
 name-0.173
-0.151
 names-0.150
 They-0.147
 to-0.146
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
eling+0.508
i+0.456
ie+0.448
ering+0.442
ited+0.432
Neg logit contributions
 name-0.315
 names-0.279
Circ-0.273
 fence-0.266
 Bang-0.257
Token They
Token ID1119
Feature activation+7.52e+00

Pos logit contributions
 corner+0.049
tr+0.048
Circ+0.046
 across+0.046
 name+0.045
Neg logit contributions
i-0.130
eling-0.124
heart-0.122
mmm-0.121
ise-0.114
Token say
Token ID910
Feature activation-4.03e+00

Pos logit contributions
 name+0.331
 names+0.282
+0.269
 They+0.267
 to+0.258
Neg logit contributions
eling-0.532
ering-0.471
ited-0.464
i-0.461
ie-0.452
Token,
Token ID11
Feature activation-1.26e+01

Pos logit contributions
eling+0.234
ited+0.203
i+0.187
ering+0.184
ie+0.177
Neg logit contributions
 name-0.238
 to-0.220
-0.215
 names-0.215
 They-0.211
Token "
Token ID366
Feature activation+6.91e+00

Pos logit contributions
eling+0.928
i+0.834
ering+0.824
ience+0.809
ited+0.805
Neg logit contributions
 name-0.367
 names-0.342
Circ-0.315
 fence-0.297
 road-0.294
Tokencake
Token ID30560
Feature activation+2.00e+00

Pos logit contributions
 name+0.218
 names+0.192
Circ+0.175
 They+0.175
+0.174
Neg logit contributions
eling-0.361
i-0.325
ited-0.324
ie-0.319
ering-0.319
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 name+0.074
 to+0.063
 names+0.062
+0.062
 They+0.060
Neg logit contributions
eling-0.152
ited-0.133
ering-0.130
i-0.130
ience-0.126
Token It
Token ID632
Feature activation+8.60e+00

Pos logit contributions
 name+0.104
 names+0.104
 fence+0.096
 road+0.095
Circ+0.094
Neg logit contributions
eling-0.128
ience-0.124
i-0.116
 reassured-0.113
ering-0.113
Token was
Token ID373
Feature activation-2.49e+00

Pos logit contributions
 name+0.278
+0.276
 to+0.255
 big+0.236
 They+0.235
Neg logit contributions
eling-0.678
ering-0.598
ited-0.596
leness-0.574
eking-0.564
Token delicious
Token ID12625
Feature activation-6.71e+00

Pos logit contributions
eling+0.189
ited+0.168
ering+0.165
ience+0.162
i+0.161
Neg logit contributions
 name-0.100
-0.088
 to-0.085
 names-0.080
 They-0.079
Token!
Token ID0
Feature activation-9.71e+00

Pos logit contributions
eling+0.435
ited+0.370
ering+0.355
i+0.352
ience+0.343
Neg logit contributions
 name-0.334
 to-0.304
 names-0.295
-0.294
 They-0.288
Token
Token ID198
Feature activation+9.00e+00

Pos logit contributions
eling+0.611
ience+0.601
i+0.563
 reassured+0.560
ie+0.559
Neg logit contributions
 name-0.330
 names-0.326
 fence-0.280
 road-0.277
Circ-0.273
Token
Token ID198
Feature activation+8.98e+00

Pos logit contributions
 names+0.292
 name+0.287
 fence+0.230
 road+0.229
Circ+0.225
Neg logit contributions
eling-0.622
ie-0.581
ience-0.555
ering-0.553
ieved-0.547
TokenBlue
Token ID14573
Feature activation-2.23e-01

Pos logit contributions
 name+0.507
 names+0.458
+0.446
 They+0.442
 to+0.439
Neg logit contributions
eling-0.500
ited-0.423
i-0.419
ering-0.411
ie-0.399
Token smiled
Token ID13541
Feature activation-1.55e+01

Pos logit contributions
eling+0.015
ited+0.013
ering+0.013
heart+0.012
iable+0.012
Neg logit contributions
-0.009
 name-0.009
 to-0.009
 They-0.008
 fence-0.008
Token and
Token ID290
Feature activation-3.59e+00

Pos logit contributions
eling+1.044
ited+0.916
i+0.913
ering+0.897
ie+0.881
Neg logit contributions
 name-0.685
 names-0.606
-0.573
 They-0.570
 fence-0.566
Token things
Token ID1243
Feature activation+1.81e+00

Pos logit contributions
eling+0.434
i+0.384
ited+0.382
ie+0.379
ering+0.372
Neg logit contributions
 name-0.241
Circ-0.227
 Bang-0.222
 fence-0.213
 They-0.213
Token back
Token ID736
Feature activation+6.10e+00

Pos logit contributions
 name+0.059
 names+0.049
 to+0.049
+0.048
 They+0.047
Neg logit contributions
eling-0.144
ited-0.129
i-0.127
ering-0.126
ie-0.123
Token.
Token ID13
Feature activation+2.63e+00

Pos logit contributions
 name+0.197
 names+0.164
+0.154
 They+0.152
 to+0.150
Neg logit contributions
eling-0.485
ited-0.433
i-0.429
ering-0.425
ie-0.417
Token But
Token ID887
Feature activation+1.06e+00

Pos logit contributions
 name+0.068
 names+0.062
 fence+0.060
 corner+0.056
 across+0.054
Neg logit contributions
eling-0.174
i-0.162
mmm-0.162
heart-0.155
ise-0.154
Token he
Token ID339
Feature activation-3.82e+00

Pos logit contributions
 name+0.043
 to+0.042
+0.041
 They+0.038
 across+0.037
Neg logit contributions
eling-0.076
ited-0.071
heart-0.065
i-0.064
ience-0.064
Token never
Token ID1239
Feature activation-5.69e+00

Pos logit contributions
eling+0.247
heart+0.221
ering+0.218
ited+0.218
itive+0.204
Neg logit contributions
 name-0.178
-0.163
 to-0.160
 names-0.147
 They-0.146
Token did
Token ID750
Feature activation-5.08e+00

Pos logit contributions
eling+0.330
ering+0.277
ited+0.272
i+0.268
heart+0.264
Neg logit contributions
 name-0.314
 to-0.290
 names-0.282
-0.280
 They-0.269
Token.
Token ID13
Feature activation+2.58e+00

Pos logit contributions
eling+0.331
ited+0.293
i+0.286
ering+0.281
ie+0.277
Neg logit contributions
 name-0.242
 names-0.212
 to-0.208
-0.207
 They-0.202
Token The
Token ID383
Feature activation+3.07e+00

Pos logit contributions
 fence+0.068
 name+0.068
 names+0.065
 across+0.059
 road+0.056
Neg logit contributions
eling-0.170
mmm-0.159
ise-0.151
heart-0.151
ieved-0.150
Token end
Token ID886
Feature activation-1.02e+01

Pos logit contributions
 name+0.152
 names+0.136
Circ+0.133
 They+0.132
 Bang+0.132
Neg logit contributions
eling-0.190
ited-0.161
ering-0.156
i-0.153
ience-0.151
Token.
Token ID13
Feature activation+2.62e+00

Pos logit contributions
eling+0.559
ie+0.539
ering+0.509
i+0.507
ise+0.496
Neg logit contributions
 name-0.480
 names-0.436
Circ-0.407
 Bang-0.398
 fence-0.393
Token "
Token ID366
Feature activation+7.22e+00

Pos logit contributions
 name+0.087
Circ+0.087
 names+0.082
 road+0.075
 across+0.075
Neg logit contributions
eling-0.145
i-0.140
ie-0.136
ering-0.132
heart-0.130
TokenCan
Token ID6090
Feature activation+3.36e+00

Pos logit contributions
 name+0.307
 to+0.267
 names+0.266
 yard+0.252
 fence+0.249
Neg logit contributions
eling-0.443
ited-0.415
i-0.382
heart-0.378
ie-0.369
Token we
Token ID356
Feature activation-6.81e+00

Pos logit contributions
 name+0.179
 names+0.161
+0.155
 They+0.154
 to+0.152
Neg logit contributions
eling-0.197
ited-0.168
i-0.167
ering-0.164
ie-0.160
Token pick
Token ID2298
Feature activation+8.12e+00

Pos logit contributions
eling+0.489
i+0.443
ited+0.427
ie+0.426
ering+0.408
Neg logit contributions
 name-0.242
 names-0.218
Circ-0.215
 Bang-0.207
 They-0.206
Token one
Token ID530
Feature activation-2.37e+00

Pos logit contributions
 name+0.309
 names+0.263
+0.249
 They+0.249
 fence+0.246
Neg logit contributions
eling-0.596
i-0.527
ited-0.523
ering-0.515
ie-0.509
Token?"
Token ID1701
Feature activation-9.25e+00

Pos logit contributions
eling+0.133
i+0.114
ited+0.113
ering+0.111
ie+0.111
Neg logit contributions
 name-0.129
 names-0.117
 They-0.113
-0.111
Circ-0.111
Token "
Token ID366
Feature activation+7.17e+00

Pos logit contributions
eling+0.564
i+0.512
ie+0.507
ering+0.485
ited+0.485
Neg logit contributions
 name-0.418
 names-0.386
Circ-0.379
 fence-0.356
 Bang-0.356
TokenSure
Token ID19457
Feature activation-5.06e+00

Pos logit contributions
 name+0.331
 yard+0.291
 names+0.291
 fence+0.289
 road+0.278
Neg logit contributions
eling-0.436
ited-0.413
i-0.372
ering-0.366
ieved-0.354
Token,
Token ID11
Feature activation-1.25e+01

Pos logit contributions
eling+0.344
ited+0.296
ering+0.277
i+0.266
lling+0.265
Neg logit contributions
 to-0.254
 name-0.252
-0.223
 names-0.214
 big-0.204
Token Lily
Token ID20037
Feature activation-1.28e+00

Pos logit contributions
eling+0.895
ited+0.804
i+0.749
ering+0.744
ience+0.720
Neg logit contributions
 name-0.569
 to-0.520
 names-0.501
-0.493
 They-0.481
Token,"
Token ID553
Feature activation-9.06e+00

Pos logit contributions
eling+0.096
ited+0.083
ering+0.079
i+0.077
ience+0.076
Neg logit contributions
 name-0.054
-0.047
 to-0.047
 They-0.044
 names-0.044
Token "
Token ID366
Feature activation+7.67e+00

Pos logit contributions
eling+1.012
ited+0.908
i+0.905
ering+0.896
ie+0.881
Neg logit contributions
 name-0.314
 names-0.251
-0.226
 They-0.223
 to-0.219
TokenThe
Token ID464
Feature activation-4.03e+00

Pos logit contributions
 name+0.284
 names+0.240
 to+0.224
 They+0.217
 fence+0.215
Neg logit contributions
eling-0.569
ited-0.534
i-0.504
ie-0.496
heart-0.490
Token cave
Token ID11527
Feature activation+8.13e+00

Pos logit contributions
eling+0.234
ie+0.206
heart+0.190
ering+0.189
ience+0.188
Neg logit contributions
Circ-0.167
 name-0.151
 names-0.145
 This-0.138
iffs-0.137
Token is
Token ID318
Feature activation-3.08e+00

Pos logit contributions
 name+0.297
 to+0.251
 names+0.249
 fence+0.240
 They+0.239
Neg logit contributions
eling-0.612
ited-0.561
i-0.540
ering-0.529
heart-0.517
Token the
Token ID262
Feature activation-3.07e+00

Pos logit contributions
eling+0.200
ited+0.171
i+0.171
ering+0.169
ie+0.166
Neg logit contributions
 name-0.142
 names-0.127
Circ-0.121
 They-0.120
-0.119
Token best
Token ID1266
Feature activation-6.25e+00

Pos logit contributions
eling+0.172
ie+0.161
heart+0.155
i+0.150
ering+0.147
Neg logit contributions
Circ-0.120
 name-0.112
iffs-0.112
 This-0.110
 They-0.110
Token shelter
Token ID11772
Feature activation+1.15e+01

Pos logit contributions
eling+0.421
ie+0.416
ering+0.391
heart+0.385
i+0.378
Neg logit contributions
 name-0.207
 names-0.195
Circ-0.183
 Some-0.174
 Bang-0.169
Token."
Token ID526
Feature activation+5.68e-01

Pos logit contributions
 name+0.544
 names+0.481
 to+0.466
+0.466
 They+0.463
Neg logit contributions
eling-0.746
ited-0.649
i-0.636
ering-0.627
ie-0.610
Token So
Token ID1406
Feature activation-4.08e+00

Pos logit contributions
 name+0.022
 names+0.020
Circ+0.020
 corner+0.019
 across+0.019
Neg logit contributions
eling-0.034
i-0.031
heart-0.031
ie-0.030
ise-0.029
Token,
Token ID11
Feature activation-1.19e+01

Pos logit contributions
eling+0.289
ited+0.262
ering+0.234
i+0.225
ieved+0.220
Neg logit contributions
 name-0.203
 to-0.199
-0.197
 They-0.191
 The-0.179
Token it
Token ID340
Feature activation-1.88e+00

Pos logit contributions
eling+0.763
ited+0.657
i+0.656
ering+0.644
ie+0.634
Neg logit contributions
 name-0.566
 names-0.506
-0.474
 They-0.472
 fence-0.470
Token.
Token ID13
Feature activation+2.21e+00

Pos logit contributions
eling+0.434
ited+0.385
i+0.379
ering+0.374
ie+0.367
Neg logit contributions
 name-0.211
 names-0.181
 to-0.174
-0.172
 They-0.170
Token Lily
Token ID20037
Feature activation-1.46e+00

Pos logit contributions
 name+0.093
 names+0.085
Circ+0.083
 fence+0.080
 road+0.077
Neg logit contributions
eling-0.134
ience-0.118
ited-0.118
ie-0.116
heart-0.116
Token watched
Token ID7342
Feature activation-4.99e+00

Pos logit contributions
eling+0.100
ited+0.088
heart+0.087
ering+0.086
leness+0.084
Neg logit contributions
 to-0.061
-0.061
 name-0.060
 They-0.052
 fence-0.052
Token in
Token ID287
Feature activation+2.84e+00

Pos logit contributions
eling+0.354
ited+0.323
i+0.312
ience+0.301
ering+0.301
Neg logit contributions
 name-0.206
 to-0.185
-0.183
 names-0.174
 They-0.173
Token amaz
Token ID40642
Feature activation+1.72e+00

Pos logit contributions
 name+0.149
 names+0.134
 They+0.128
+0.128
Circ+0.127
Neg logit contributions
eling-0.169
ited-0.145
i-0.142
ering-0.140
ie-0.137
Tokenement
Token ID972
Feature activation-2.09e+01

Pos logit contributions
 to+0.109
 name+0.105
+0.103
 It+0.100
 names+0.098
Neg logit contributions
eling-0.068
ited-0.062
ience-0.059
pper-0.057
i-0.055
Token as
Token ID355
Feature activation-7.93e+00

Pos logit contributions
eling+1.478
ited+1.318
ience+1.224
i+1.209
ering+1.204
Neg logit contributions
 name-0.897
 to-0.859
-0.808
 They-0.790
 names-0.775
Token the
Token ID262
Feature activation-4.07e+00

Pos logit contributions
eling+0.536
ited+0.472
i+0.464
ering+0.458
ie+0.445
Neg logit contributions
 name-0.355
 names-0.311
-0.301
 They-0.297
 to-0.296
Token cob
Token ID22843
Feature activation-1.67e+00

Pos logit contributions
eling+0.258
ited+0.222
i+0.219
ie+0.218
ering+0.214
Neg logit contributions
 name-0.193
 names-0.173
 They-0.172
Circ-0.168
-0.167
Tokenweb
Token ID12384
Feature activation-3.18e+00

Pos logit contributions
eling+0.087
ited+0.079
i+0.071
ience+0.070
ering+0.070
Neg logit contributions
 name-0.094
 to-0.088
 names-0.085
-0.084
 fence-0.084
Token grew
Token ID6348
Feature activation-2.74e+00

Pos logit contributions
eling+0.198
ited+0.169
i+0.168
heart+0.164
ering+0.163
Neg logit contributions
 name-0.151
 to-0.149
-0.138
 names-0.135
 across-0.133
Token,
Token ID11
Feature activation-1.23e+01

Pos logit contributions
 name+0.109
 to+0.105
+0.103
 They+0.098
 names+0.096
Neg logit contributions
eling-0.130
ited-0.115
ering-0.108
i-0.105
ience-0.103
Token Molly
Token ID30236
Feature activation-3.17e+00

Pos logit contributions
eling+0.676
ited+0.571
i+0.565
ering+0.554
ie+0.540
Neg logit contributions
 name-0.701
 names-0.635
-0.613
 They-0.609
 to-0.604
Token said
Token ID531
Feature activation-6.16e+00

Pos logit contributions
eling+0.214
ering+0.183
ited+0.183
i+0.176
heart+0.173
Neg logit contributions
 name-0.145
 to-0.130
-0.128
 names-0.126
 They-0.126
Token to
Token ID284
Feature activation+1.36e+01

Pos logit contributions
eling+0.496
ited+0.444
i+0.438
ering+0.432
ie+0.425
Neg logit contributions
 name-0.201
 names-0.167
-0.157
 to-0.157
 They-0.155
Token M
Token ID337
Feature activation-2.62e+00

Pos logit contributions
 name+0.714
 names+0.638
+0.618
 They+0.611
 to+0.608
Neg logit contributions
eling-0.815
ited-0.704
i-0.690
ering-0.684
ie-0.662
Tokenilly
Token ID6548
Feature activation-2.12e+01

Pos logit contributions
eling+0.106
ited+0.084
i+0.080
ering+0.080
ie+0.074
Neg logit contributions
 name-0.188
 names-0.174
-0.170
 They-0.168
 to-0.168
Token,
Token ID11
Feature activation-1.20e+01

Pos logit contributions
eling+1.191
ited+1.013
i+0.970
ering+0.966
ie+0.923
Neg logit contributions
 name-1.218
 names-1.100
-1.069
 to-1.068
 They-1.062
Token �
Token ID6184
Feature activation-8.66e+00

Pos logit contributions
eling+0.986
i+0.899
ering+0.882
ited+0.873
ie+0.873
Neg logit contributions
 name-0.308
 names-0.259
Circ-0.229
 fence-0.228
 road-0.227
Token�
Token ID95
Feature activation-1.01e+01

Pos logit contributions
eling+0.585
ited+0.512
i+0.505
ering+0.503
ie+0.493
Neg logit contributions
 name-0.388
 names-0.336
-0.325
 They-0.324
 to-0.317
Token€
Token ID26391
Feature activation-1.32e+01

Pos logit contributions
eling+0.662
ited+0.578
i+0.567
ering+0.551
heart+0.539
Neg logit contributions
 name-0.468
 names-0.422
 to-0.399
 fence-0.393
-0.392
Token�
Token ID129
Feature activation-4.86e+00

Pos logit contributions
eling+0.793
i+0.700
ited+0.689
ering+0.672
ie+0.655
Neg logit contributions
 name-0.647
 names-0.589
Circ-0.555
 fence-0.553
 Bang-0.551
Token scattered
Token ID16830
Feature activation-5.96e-02

Pos logit contributions
 name+0.387
 names+0.338
+0.323
 They+0.320
 to+0.320
Neg logit contributions
eling-0.637
ited-0.560
i-0.551
ering-0.544
ie-0.534
Token around
Token ID1088
Feature activation+9.80e+00

Pos logit contributions
eling+0.004
ited+0.004
i+0.004
ering+0.004
ie+0.004
Neg logit contributions
 name-0.002
 names-0.002
-0.002
 They-0.002
Circ-0.002
Token it
Token ID340
Feature activation-1.74e+00

Pos logit contributions
 name+0.459
 names+0.406
+0.388
 They+0.385
 to+0.381
Neg logit contributions
eling-0.636
ited-0.552
i-0.548
ering-0.540
ie-0.529
Token.
Token ID13
Feature activation+3.34e+00

Pos logit contributions
eling+0.133
ited+0.118
i+0.117
ering+0.115
ie+0.113
Neg logit contributions
 name-0.062
 names-0.053
-0.050
 to-0.049
 They-0.049
Token Sad
Token ID14668
Feature activation-3.58e+00

Pos logit contributions
 name+0.166
 names+0.161
 road+0.153
Circ+0.151
 fence+0.150
Neg logit contributions
eling-0.166
heart-0.152
ie-0.146
i-0.144
ience-0.142
Tokenie
Token ID494
Feature activation-2.72e+01

Pos logit contributions
eling+0.117
iable+0.087
ited+0.082
mmm+0.081
ieved+0.077
Neg logit contributions
 to-0.250
 name-0.241
-0.239
 corner-0.234
 names-0.232
Token was
Token ID373
Feature activation-1.75e+00

Pos logit contributions
eling+1.940
ited+1.707
ering+1.619
i+1.600
heart+1.597
Neg logit contributions
 name-1.162
 to-1.073
-1.068
 They-0.988
 names-0.974
Token scared
Token ID12008
Feature activation+1.28e+00

Pos logit contributions
eling+0.109
ited+0.095
i+0.093
ering+0.091
ie+0.091
Neg logit contributions
 name-0.089
 names-0.078
-0.077
 to-0.076
 They-0.075
Token and
Token ID290
Feature activation-2.66e+00

Pos logit contributions
 name+0.055
 to+0.054
+0.051
 They+0.049
 across+0.048
Neg logit contributions
eling-0.090
ited-0.079
i-0.074
ience-0.074
ering-0.073
Token started
Token ID2067
Feature activation-1.32e+00

Pos logit contributions
eling+0.148
ited+0.136
i+0.127
ering+0.125
ience+0.125
Neg logit contributions
 to-0.138
 name-0.137
-0.131
 across-0.127
 They-0.127
Token crying
Token ID13774
Feature activation-3.24e-02

Pos logit contributions
eling+0.118
i+0.109
ering+0.108
ie+0.104
ited+0.103
Neg logit contributions
 name-0.026
 names-0.020
Circ-0.019
 Bang-0.017
 road-0.016
Token him
Token ID683
Feature activation-5.09e+00

Pos logit contributions
eling+0.254
i+0.235
ering+0.231
ie+0.227
ited+0.222
Neg logit contributions
 name-0.099
 names-0.080
 fence-0.079
Circ-0.075
 road-0.074
Token some
Token ID617
Feature activation+5.93e+00

Pos logit contributions
eling+0.360
i+0.331
ie+0.317
ering+0.313
ited+0.308
Neg logit contributions
 name-0.169
 road-0.155
 Bang-0.153
 fence-0.151
Circ-0.147
Token popcorn
Token ID38914
Feature activation-2.41e+00

Pos logit contributions
 name+0.215
Circ+0.192
 Bang+0.189
 They+0.184
+0.184
Neg logit contributions
eling-0.423
i-0.371
ited-0.370
ie-0.368
ering-0.365
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
eling+0.176
i+0.163
ering+0.158
ie+0.156
ited+0.156
Neg logit contributions
 name-0.084
 names-0.071
Circ-0.068
-0.067
 Bang-0.066
Token Mat
Token ID6550
Feature activation-4.42e+00

Pos logit contributions
 name+0.080
 names+0.077
Circ+0.071
 corner+0.069
 fence+0.067
Neg logit contributions
eling-0.160
i-0.147
ering-0.142
heart-0.140
ience-0.139
Tokeni
Token ID72
Feature activation-2.59e+01

Pos logit contributions
eling+0.173
ited+0.149
heart+0.142
itive+0.134
ering+0.132
Neg logit contributions
 name-0.269
-0.265
 to-0.255
 across-0.237
 names-0.233
Token was
Token ID373
Feature activation-2.43e+00

Pos logit contributions
eling+1.733
ited+1.497
ering+1.465
i+1.425
heart+1.421
Neg logit contributions
 name-1.213
-1.100
 to-1.092
 They-1.068
 names-1.061
Token very
Token ID845
Feature activation-6.67e+00

Pos logit contributions
eling+0.159
ited+0.139
i+0.137
ering+0.135
ie+0.133
Neg logit contributions
 name-0.116
 names-0.100
-0.099
 to-0.099
 They-0.096
Token thankful
Token ID25535
Feature activation-1.21e+01

Pos logit contributions
eling+0.451
i+0.393
ited+0.391
ie+0.382
ering+0.380
Neg logit contributions
 name-0.318
-0.271
 to-0.267
 names-0.262
 They-0.260
Token and
Token ID290
Feature activation-3.42e+00

Pos logit contributions
eling+0.930
ited+0.826
i+0.802
ering+0.795
ie+0.780
Neg logit contributions
 name-0.438
 to-0.373
 names-0.370
-0.367
 They-0.352
Token took
Token ID1718
Feature activation-9.79e-01

Pos logit contributions
eling+0.211
ited+0.187
ering+0.181
i+0.178
ie+0.172
Neg logit contributions
 name-0.173
 to-0.170
 They-0.164
-0.163
 He-0.153
Token a
Token ID257
Feature activation+4.57e+00

Pos logit contributions
eling+0.132
ited+0.119
i+0.119
ie+0.117
ering+0.114
Neg logit contributions
 name-0.056
 names-0.048
 fence-0.046
Circ-0.044
 road-0.043
Token young
Token ID1862
Feature activation+1.25e+00

Pos logit contributions
 name+0.164
 names+0.143
 They+0.132
+0.132
 to+0.131
Neg logit contributions
eling-0.341
ited-0.303
i-0.302
ering-0.296
ie-0.294
Token girl
Token ID2576
Feature activation-5.19e+00

Pos logit contributions
 name+0.044
 names+0.036
+0.035
 They+0.034
 fence+0.034
Neg logit contributions
eling-0.098
ited-0.087
ering-0.085
i-0.085
ie-0.082
Token named
Token ID3706
Feature activation-7.06e+00

Pos logit contributions
eling+0.310
ited+0.263
ering+0.255
i+0.247
ience+0.245
Neg logit contributions
 name-0.281
-0.258
 to-0.256
 They-0.250
 names-0.248
Token Sus
Token ID8932
Feature activation-9.55e+00

Pos logit contributions
eling+0.411
ie+0.375
i+0.370
ited+0.363
ering+0.351
Neg logit contributions
 name-0.343
 names-0.306
Circ-0.298
 road-0.289
 fence-0.289
Tokenie
Token ID494
Feature activation-2.76e+01

Pos logit contributions
eling+0.238
ience+0.179
ited+0.167
ises+0.147
mmm+0.145
Neg logit contributions
 to-0.758
 name-0.757
-0.744
 corner-0.733
 yard-0.721
Token.
Token ID13
Feature activation+3.13e+00

Pos logit contributions
eling+2.068
i+1.851
ited+1.825
ie+1.812
ering+1.796
Neg logit contributions
 name-1.000
 names-0.865
Circ-0.801
 Bang-0.788
 They-0.785
Token She
Token ID1375
Feature activation+2.58e+00

Pos logit contributions
 name+0.107
 names+0.089
 fence+0.088
Circ+0.088
 Bang+0.084
Neg logit contributions
eling-0.226
ie-0.206
i-0.205
ited-0.203
heart-0.201
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
 name+0.126
+0.112
 names+0.108
 They+0.108
 to+0.108
Neg logit contributions
eling-0.170
ited-0.149
ering-0.146
i-0.144
heart-0.141
Token only
Token ID691
Feature activation+1.14e+00

Pos logit contributions
eling+0.124
i+0.108
ited+0.107
ering+0.105
ie+0.104
Neg logit contributions
 name-0.078
 names-0.070
 fence-0.066
Circ-0.065
-0.065
Token three
Token ID1115
Feature activation+3.41e+00

Pos logit contributions
 name+0.036
+0.031
 names+0.031
 fence+0.031
 They+0.031
Neg logit contributions
eling-0.084
i-0.078
ited-0.075
ering-0.074
ie-0.073