TOP ACTIVATIONS
MAX = 3.048

candy and ride in his toy je ep . One day
ally , stay away from that saw , it 's very
can we go in the toy store ?" Her mom my
I will never go to that place again !" the boy
But Tim my was having so much fun playing that he
kind to C aryn from that day on .
grandma . She lives in that house ," Anna says and
Jones . He lives in that building over there . He
!" To her su prise , the thunder went
them . They cried and bl ed and hurt .
: But I was having so much fun !
, scary cat and got very scared . " Oh no
got off the couch and tr ud ged out of the
Her mom sighed and kn elt down next to her
little 3 year old boy cl apping his hands .
head . Ben cried and bl ed . Sam screamed and
She said , " Hi p anda ! What smells so
said , " You had so much fun outside . Let
She loved going to the toy store with her mom .
bought two hot dogs with k etchup . Max and Sam

MIN ACTIVATIONS
MIN = -2.949

Okay , Anna . That sounds like fun . But remember
him . Tom falls on the ground . He gets dirty
He threw the pieces on the ground and started to cry
, it hopped up on the stand . The mouse thought
noise as well . It sounds like a sn ore .
ily said , " That sounds great , grandma . Thank
, " OK , that sounds fun . What things do
She said , " That sounds fun . I will draw
the leaves flying all over the place . She decided to
- tap - tap on the leaf . It was a
friends found some sticks on the ground .
her hand and fell on the floor . It broke into
the rope and fell on the stick . He y el
Okay , Tom . That sounds fun . Let 's go
mom said , " That sounds very exciting . But why
the face . Little boy smiled shy ly and
. It was rolling down the hill . It was going
Sam says . " That sounds fun ." They
, " OK . That sounds good . That sounds comfortable
saw a tie lying on the ground . She picked it

INTERVAL 1.549 - 3.048
CONTAINS 0.317%

friend ?" The fake pup smiled and said "
puppy back to its family who were over joy ed to
the stick and chased the squirrel . The squirrel ran up
when they united to fly k ites in the bright blue
told the rabbit , " I am so hungry , I

INTERVAL 0.050 - 1.549
CONTAINS 45.300%

The monster stormed off and the cookies were safely back
at the girl and said , " Thank you so much
â„¢ s always worth asking for help when you need it
wild . <|endoftext|> Once upon a time , there was a
! That 's a really big squash . Maybe we can

INTERVAL -1.449 - 0.050
CONTAINS 53.866%

said , " Time to go home , Jimmy ." It
secret , but she didn 't tell anyone . She liked
The big goose flew until it reached its home .
- she had managed the waste all by herself . Plus
They said , " Thank you , mom . We love

INTERVAL -2.949 - -1.449
CONTAINS 0.516%

and his boat arrived on the shore . The boat and
had rescued the bird . From then on , he always
little girl named Grace . She was three years old and
!" Ben says . They are not happy .
sweet , satisfied smile on his face as he played in
Token candy
Token ID18550
Feature activation+8.97e-01

Pos logit contributions
idd+1.210
rol+1.202
und+1.081
iw+1.072
arre+1.059
Neg logit contributions
The-0.767
 finish-0.746
 ends-0.735
 before-0.731
In-0.729
Token and
Token ID290
Feature activation-8.76e-02

Pos logit contributions
idd+1.893
rol+1.875
und+1.717
iw+1.706
arre+1.693
Neg logit contributions
The-0.639
 finish-0.617
 before-0.611
 ends-0.594
In-0.589
Token ride
Token ID6594
Feature activation-2.05e-01

Pos logit contributions
The+0.092
In+0.087
 grant+0.087
 ends+0.087
 finish+0.087
Neg logit contributions
idd-0.149
rol-0.148
iw-0.137
und-0.135
arre-0.131
Token in
Token ID287
Feature activation+4.63e-01

Pos logit contributions
 grant+0.203
The+0.201
 finish+0.199
In+0.196
 ends+0.193
Neg logit contributions
idd-0.365
rol-0.362
und-0.333
iw-0.330
arre-0.319
Token his
Token ID465
Feature activation+9.35e-01

Pos logit contributions
idd+1.152
rol+1.144
und+1.069
iw+1.066
arre+1.051
Neg logit contributions
The-0.144
 finish-0.130
 grant-0.124
In-0.124
idi-0.122
Token toy
Token ID13373
Feature activation+3.05e+00

Pos logit contributions
rol+1.740
idd+1.716
und+1.552
iw+1.539
arre+1.521
Neg logit contributions
The-0.915
 finish-0.892
In-0.866
 ends-0.864
idi-0.864
Token je
Token ID11223
Feature activation+1.45e+00

Pos logit contributions
rol+5.749
idd+5.382
rou+5.217
arre+5.074
und+4.850
Neg logit contributions
 finish-3.395
The-3.305
 ends-3.260
 the-3.221
ers-3.212
Tokenep
Token ID538
Feature activation-3.05e-02

Pos logit contributions
rol+2.194
idd+2.160
iw+1.911
und+1.910
arre+1.885
Neg logit contributions
The-1.936
In-1.852
ers-1.835
 finish-1.810
 ends-1.779
Token.
Token ID13
Feature activation+6.58e-01

Pos logit contributions
The+0.025
 grant+0.025
idi+0.025
 finish+0.024
In+0.024
Neg logit contributions
rol-0.059
idd-0.059
und-0.054
iw-0.053
arre-0.053
Token One
Token ID1881
Feature activation-3.63e-01

Pos logit contributions
idd+1.143
rol+1.132
und+1.022
iw+1.017
arre+1.000
Neg logit contributions
The-0.709
 finish-0.686
In-0.676
idi-0.671
 ends-0.670
Token day
Token ID1110
Feature activation+5.28e-01

Pos logit contributions
 grant+0.344
ers+0.326
idi+0.324
The+0.321
 finish+0.320
Neg logit contributions
idd-0.661
rol-0.641
iw-0.601
und-0.589
arre-0.579
Tokenally
Token ID453
Feature activation-5.67e-01

Pos logit contributions
idd+1.956
rol+1.950
iw+1.741
und+1.740
arre+1.694
Neg logit contributions
The-1.204
 finish-1.163
In-1.162
 ends-1.137
 before-1.127
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
The+0.534
In+0.516
idi+0.514
 finish+0.510
 grant+0.509
Neg logit contributions
idd-1.067
rol-1.014
und-0.975
iw-0.945
arre-0.928
Token stay
Token ID2652
Feature activation+3.79e-01

Pos logit contributions
The+0.232
idi+0.231
ers+0.225
In+0.225
 finish+0.224
Neg logit contributions
idd-0.405
rol-0.401
und-0.367
iw-0.364
arre-0.360
Token away
Token ID1497
Feature activation-1.15e-01

Pos logit contributions
idd+0.658
rol+0.650
und+0.590
iw+0.586
arre+0.576
Neg logit contributions
The-0.404
 finish-0.389
 ends-0.387
 grant-0.387
In-0.386
Token from
Token ID422
Feature activation+1.06e+00

Pos logit contributions
The+0.093
idi+0.091
In+0.090
 grant+0.089
 ends+0.089
Neg logit contributions
idd-0.224
rol-0.223
und-0.207
iw-0.203
arre-0.201
Token that
Token ID326
Feature activation+3.03e+00

Pos logit contributions
rol+2.449
idd+2.441
und+2.252
iw+2.241
arre+2.240
Neg logit contributions
The-0.534
 finish-0.521
 before-0.514
 ends-0.481
 grant-0.471
Token saw
Token ID2497
Feature activation+1.52e+00

Pos logit contributions
rol+7.007
idd+6.745
rou+6.232
iw+5.959
und+5.906
Neg logit contributions
 finish-2.530
The-2.490
 before-2.489
 the-2.457
 entire-2.381
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
idd+3.050
rol+3.026
iw+2.791
arre+2.743
und+2.728
Neg logit contributions
The-1.290
 before-1.264
 finish-1.214
 ends-1.204
ers-1.170
Token it
Token ID340
Feature activation+4.85e-01

Pos logit contributions
idd+0.351
rol+0.348
und+0.318
iw+0.316
arre+0.312
Neg logit contributions
The-0.176
idi-0.170
 finish-0.170
In-0.169
 ends-0.168
Token's
Token ID338
Feature activation+1.88e-01

Pos logit contributions
idd+0.856
rol+0.849
und+0.770
iw+0.765
arre+0.754
Neg logit contributions
The-0.498
 finish-0.481
In-0.474
idi-0.472
 grant-0.471
Token very
Token ID845
Feature activation+1.46e+00

Pos logit contributions
idd+0.338
rol+0.337
und+0.306
iw+0.303
arre+0.301
Neg logit contributions
The-0.187
 grant-0.177
 finish-0.177
In-0.176
idi-0.175
Token can
Token ID460
Feature activation-4.12e-01

Pos logit contributions
idi+0.468
The+0.456
ers+0.455
In+0.438
 finish+0.430
Neg logit contributions
idd-0.753
rol-0.742
iw-0.680
und-0.670
arre-0.653
Token we
Token ID356
Feature activation+5.87e-02

Pos logit contributions
idi+0.494
ers+0.484
The+0.477
In+0.475
 finish+0.472
Neg logit contributions
idd-0.637
rol-0.618
iw-0.563
und-0.556
 Mechan-0.532
Token go
Token ID467
Feature activation+4.54e-02

Pos logit contributions
idd+0.115
rol+0.115
und+0.105
iw+0.105
arre+0.103
Neg logit contributions
The-0.047
idi-0.046
 ends-0.045
ers-0.045
 grant-0.044
Token in
Token ID287
Feature activation+5.04e-01

Pos logit contributions
idd+0.085
rol+0.084
iw+0.076
und+0.076
arre+0.074
Neg logit contributions
The-0.042
 grant-0.040
 ends-0.040
idi-0.040
In-0.040
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
idd+1.153
rol+1.145
und+1.063
iw+1.059
arre+1.045
Neg logit contributions
The-0.256
 finish-0.239
In-0.231
 grant-0.230
 ends-0.230
Token toy
Token ID13373
Feature activation+2.96e+00

Pos logit contributions
rol+3.566
idd+3.532
iw+3.153
und+3.151
arre+3.085
Neg logit contributions
 finish-2.491
The-2.471
In-2.330
 ends-2.325
 sun-2.246
Token store
Token ID3650
Feature activation+4.27e-01

Pos logit contributions
rol+5.797
idd+5.614
iw+5.181
arre+5.117
rou+5.079
Neg logit contributions
 finish-3.309
The-3.080
 before-3.054
 the-2.935
 ends-2.915
Token?"
Token ID1701
Feature activation-8.99e-03

Pos logit contributions
idd+0.739
rol+0.731
und+0.662
iw+0.659
arre+0.647
Neg logit contributions
The-0.452
 finish-0.437
In-0.433
 grant-0.433
idi-0.433
Token Her
Token ID2332
Feature activation+3.45e-01

Pos logit contributions
The+0.009
 ends+0.009
 grant+0.009
 finish+0.009
ers+0.009
Neg logit contributions
idd-0.016
rol-0.015
und-0.014
iw-0.014
arre-0.014
Token mom
Token ID1995
Feature activation+2.92e-01

Pos logit contributions
idd+0.764
rol+0.752
iw+0.697
und+0.697
arre+0.688
Neg logit contributions
The-0.205
 finish-0.192
In-0.188
 grant-0.188
idi-0.187
Tokenmy
Token ID1820
Feature activation+2.89e-01

Pos logit contributions
idd+0.367
rol+0.360
und+0.312
iw+0.310
arre+0.304
Neg logit contributions
The-0.451
 finish-0.442
 grant-0.440
In-0.437
 ends-0.436
TokenI
Token ID40
Feature activation+9.16e-01

Pos logit contributions
idd+0.712
rol+0.706
und+0.632
iw+0.631
arre+0.619
Neg logit contributions
The-0.502
 finish-0.493
 ends-0.484
 grant-0.484
idi-0.481
Token will
Token ID481
Feature activation+6.28e-02

Pos logit contributions
idd+1.552
rol+1.548
und+1.392
iw+1.382
arre+1.346
Neg logit contributions
The-1.020
 finish-1.017
 grant-0.984
In-0.983
 before-0.980
Token never
Token ID1239
Feature activation-7.31e-01

Pos logit contributions
idd+0.128
rol+0.127
und+0.117
iw+0.116
arre+0.114
Neg logit contributions
The-0.046
idi-0.044
 ends-0.044
In-0.043
ers-0.042
Token go
Token ID467
Feature activation-2.13e-01

Pos logit contributions
The+0.590
ers+0.571
idi+0.567
In+0.559
 ends+0.536
Neg logit contributions
idd-1.407
rol-1.392
arre-1.321
iw-1.290
und-1.283
Token to
Token ID284
Feature activation+2.68e-01

Pos logit contributions
The+0.221
 ends+0.216
In+0.215
idi+0.213
 grant+0.212
Neg logit contributions
idd-0.372
rol-0.367
iw-0.330
und-0.330
arre-0.323
Token that
Token ID326
Feature activation+2.92e+00

Pos logit contributions
idd+0.587
rol+0.582
und+0.539
iw+0.539
arre+0.528
Neg logit contributions
The-0.156
In-0.148
idi-0.146
 ends-0.145
 finish-0.144
Token place
Token ID1295
Feature activation-1.44e-01

Pos logit contributions
rol+6.134
idd+5.887
und+5.383
iw+5.354
arre+5.218
Neg logit contributions
 finish-2.928
 before-2.849
The-2.840
 the-2.703
 entire-2.640
Token again
Token ID757
Feature activation-2.47e-02

Pos logit contributions
The+0.150
In+0.146
idi+0.144
 finish+0.142
 grant+0.142
Neg logit contributions
rol-0.249
idd-0.248
und-0.224
iw-0.223
arre-0.215
Token!"
Token ID2474
Feature activation-3.73e-01

Pos logit contributions
The+0.027
In+0.026
 grant+0.026
 ends+0.026
 finish+0.026
Neg logit contributions
idd-0.041
rol-0.040
und-0.037
iw-0.037
arre-0.036
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
In+0.256
 ends+0.255
ers+0.255
idi+0.254
The+0.252
Neg logit contributions
idd-0.754
rol-0.747
und-0.701
iw-0.696
arre-0.672
Token boy
Token ID2933
Feature activation-1.14e-01

Pos logit contributions
idd+0.373
rol+0.363
iw+0.337
und+0.333
arre+0.330
Neg logit contributions
The-0.167
In-0.161
 grant-0.160
 finish-0.158
idi-0.157
Token But
Token ID887
Feature activation+1.75e-01

Pos logit contributions
idd+1.009
rol+1.000
und+0.902
iw+0.899
arre+0.883
Neg logit contributions
The-0.630
 finish-0.609
In-0.601
 ends-0.597
idi-0.596
Token Tim
Token ID5045
Feature activation-8.03e-02

Pos logit contributions
idd+0.281
rol+0.278
und+0.252
iw+0.250
arre+0.242
Neg logit contributions
The-0.204
 grant-0.199
 finish-0.199
 ends-0.197
idi-0.197
Tokenmy
Token ID1820
Feature activation-1.76e-01

Pos logit contributions
The+0.129
 grant+0.128
 finish+0.127
In+0.125
idi+0.125
Neg logit contributions
idd-0.095
rol-0.092
und-0.082
arre-0.080
iw-0.080
Token was
Token ID373
Feature activation+1.39e-01

Pos logit contributions
The+0.180
 grant+0.178
 finish+0.174
In+0.172
 ends+0.172
Neg logit contributions
idd-0.308
rol-0.295
und-0.277
arre-0.275
iw-0.271
Token having
Token ID1719
Feature activation+8.09e-01

Pos logit contributions
idd+0.222
rol+0.218
iw+0.195
arre+0.194
und+0.194
Neg logit contributions
The-0.166
 grant-0.162
 ends-0.162
 finish-0.161
In-0.160
Token so
Token ID523
Feature activation+2.92e+00

Pos logit contributions
idd+1.520
rol+1.504
und+1.369
iw+1.365
arre+1.333
Neg logit contributions
The-0.765
 finish-0.739
In-0.720
 before-0.719
 ends-0.717
Token much
Token ID881
Feature activation+4.76e-01

Pos logit contributions
rol+5.706
idd+5.490
und+4.758
iw+4.716
 Mechan+4.656
Neg logit contributions
The-3.460
In-3.375
 time-3.364
 had-3.022
 the-2.985
Token fun
Token ID1257
Feature activation+8.38e-01

Pos logit contributions
idd+0.788
rol+0.779
und+0.702
iw+0.700
arre+0.687
Neg logit contributions
The-0.545
 finish-0.528
 grant-0.521
In-0.521
 ends-0.520
Token playing
Token ID2712
Feature activation+7.25e-01

Pos logit contributions
idd+1.504
rol+1.502
iw+1.350
und+1.348
arre+1.329
Neg logit contributions
The-0.837
 before-0.815
 finish-0.815
In-0.795
idi-0.791
Token that
Token ID326
Feature activation+2.71e-01

Pos logit contributions
idd+1.517
rol+1.510
iw+1.382
und+1.380
arre+1.357
Neg logit contributions
The-0.524
 finish-0.501
 before-0.487
In-0.483
 grant-0.480
Token he
Token ID339
Feature activation-5.47e-03

Pos logit contributions
idd+0.511
rol+0.508
und+0.463
iw+0.463
arre+0.452
Neg logit contributions
The-0.243
In-0.231
 finish-0.231
idi-0.230
 grant-0.230
Token kind
Token ID1611
Feature activation-9.21e-02

Pos logit contributions
 grant+0.317
The+0.313
 ends+0.311
In+0.309
 before+0.301
Neg logit contributions
idd-0.547
rol-0.531
und-0.481
arre-0.478
iw-0.478
Token to
Token ID284
Feature activation-6.37e-02

Pos logit contributions
The+0.069
 grant+0.067
In+0.066
idi+0.065
 finish+0.065
Neg logit contributions
idd-0.189
rol-0.187
und-0.173
iw-0.171
arre-0.169
Token C
Token ID327
Feature activation+9.69e-01

Pos logit contributions
The+0.058
In+0.056
 finish+0.056
 ends+0.056
 grant+0.055
Neg logit contributions
idd-0.119
rol-0.117
und-0.109
iw-0.106
arre-0.104
Tokenaryn
Token ID38621
Feature activation+1.25e-01

Pos logit contributions
idd+1.414
rol+1.410
iw+1.264
und+1.243
arre+1.194
Neg logit contributions
The-1.289
 finish-1.268
In-1.242
idi-1.238
 ends-1.237
Token from
Token ID422
Feature activation+2.41e-01

Pos logit contributions
idd+0.232
rol+0.231
und+0.211
iw+0.210
arre+0.208
Neg logit contributions
The-0.114
In-0.109
 finish-0.108
idi-0.108
 ends-0.107
Token that
Token ID326
Feature activation+2.89e+00

Pos logit contributions
idd+0.495
rol+0.489
und+0.452
iw+0.450
arre+0.439
Neg logit contributions
The-0.178
In-0.168
 finish-0.168
 ends-0.166
 grant-0.165
Token day
Token ID1110
Feature activation-2.61e-01

Pos logit contributions
rol+7.272
idd+7.206
und+6.530
arre+6.484
ays+6.448
Neg logit contributions
 before-1.589
The-1.543
 finish-1.516
 the-1.494
 had-1.445
Token on
Token ID319
Feature activation+1.48e-01

Pos logit contributions
The+0.241
 grant+0.237
 ends+0.237
idi+0.235
In+0.231
Neg logit contributions
idd-0.482
rol-0.476
und-0.441
arre-0.430
iw-0.425
Token.
Token ID13
Feature activation+7.00e-01

Pos logit contributions
idd+0.276
rol+0.273
und+0.253
iw+0.247
arre+0.246
Neg logit contributions
The-0.134
 finish-0.130
 grant-0.128
In-0.128
 ends-0.128
Token
Token ID198
Feature activation-9.32e-02

Pos logit contributions
idd+1.368
rol+1.356
und+1.239
iw+1.238
arre+1.218
Neg logit contributions
The-0.591
 finish-0.562
In-0.555
idi-0.550
 grant-0.546
Token
Token ID198
Feature activation+4.48e-01

Pos logit contributions
 finish+0.076
 ends+0.075
 before+0.074
 grant+0.073
The+0.071
Neg logit contributions
idd-0.183
rol-0.182
und-0.167
iw-0.163
arre-0.163
Token grandma
Token ID49890
Feature activation-3.82e-01

Pos logit contributions
rol+1.838
idd+1.828
und+1.661
iw+1.638
arre+1.633
Neg logit contributions
The-0.912
 finish-0.893
 ends-0.877
 before-0.867
 grant-0.856
Token.
Token ID13
Feature activation+5.00e-01

Pos logit contributions
The+0.358
In+0.355
idi+0.348
 grant+0.344
 finish+0.340
Neg logit contributions
idd-0.699
rol-0.692
iw-0.635
und-0.631
arre-0.621
Token She
Token ID1375
Feature activation+1.63e-01

Pos logit contributions
idd+0.877
rol+0.870
und+0.787
iw+0.784
arre+0.770
Neg logit contributions
The-0.516
 finish-0.501
In-0.492
 ends-0.491
idi-0.490
Token lives
Token ID3160
Feature activation+2.97e-02

Pos logit contributions
idd+0.266
rol+0.262
und+0.236
iw+0.236
arre+0.230
Neg logit contributions
The-0.188
 finish-0.181
In-0.180
idi-0.179
 before-0.177
Token in
Token ID287
Feature activation+1.07e+00

Pos logit contributions
idd+0.057
rol+0.056
iw+0.052
und+0.052
arre+0.050
Neg logit contributions
The-0.025
 finish-0.025
 grant-0.024
idi-0.024
 ends-0.024
Token that
Token ID326
Feature activation+2.87e+00

Pos logit contributions
idd+2.611
rol+2.595
iw+2.398
und+2.388
arre+2.384
Neg logit contributions
The-0.448
 finish-0.391
 ends-0.368
 before-0.367
In-0.357
Token house
Token ID2156
Feature activation-1.57e-01

Pos logit contributions
idd+6.480
rol+6.380
iw+5.712
und+5.678
rou+5.596
Neg logit contributions
 finish-2.384
The-2.320
 ends-2.302
 the-2.284
 before-2.282
Token,"
Token ID553
Feature activation-1.01e+00

Pos logit contributions
The+0.138
In+0.133
idi+0.133
 grant+0.132
 finish+0.132
Neg logit contributions
idd-0.294
rol-0.293
und-0.269
iw-0.267
arre-0.261
Token Anna
Token ID11735
Feature activation+3.87e-01

Pos logit contributions
In+0.706
ers+0.695
ace+0.687
 finish+0.669
 ends+0.663
Neg logit contributions
idd-2.053
iw-1.932
rol-1.931
und-1.862
 batch-1.853
Token says
Token ID1139
Feature activation+2.05e-01

Pos logit contributions
idd+0.594
rol+0.587
und+0.524
iw+0.522
arre+0.514
Neg logit contributions
The-0.485
 finish-0.472
In-0.468
 grant-0.464
 ends-0.463
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
idd+0.403
rol+0.399
iw+0.364
und+0.364
arre+0.358
Neg logit contributions
The-0.169
In-0.162
 finish-0.161
idi-0.160
 ends-0.158
Token Jones
Token ID5437
Feature activation-4.66e-01

Pos logit contributions
rol+1.523
idd+1.512
und+1.326
arre+1.310
iw+1.307
Neg logit contributions
The-1.553
 grant-1.534
 finish-1.532
 before-1.526
 ends-1.499
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
The+0.463
In+0.449
idi+0.446
 finish+0.436
 grant+0.431
Neg logit contributions
idd-0.834
rol-0.816
iw-0.756
und-0.755
arre-0.728
Token He
Token ID679
Feature activation+2.01e-01

Pos logit contributions
idd+1.025
rol+1.017
und+0.926
iw+0.923
arre+0.908
Neg logit contributions
The-0.514
 finish-0.496
In-0.487
 ends-0.485
 grant-0.485
Token lives
Token ID3160
Feature activation+8.60e-02

Pos logit contributions
idd+0.339
rol+0.333
und+0.302
iw+0.302
arre+0.295
Neg logit contributions
The-0.223
 finish-0.213
idi-0.212
In-0.212
 before-0.210
Token in
Token ID287
Feature activation+1.05e+00

Pos logit contributions
idd+0.174
rol+0.173
iw+0.159
und+0.159
arre+0.154
Neg logit contributions
The-0.064
 finish-0.062
 grant-0.061
idi-0.061
In-0.060
Token that
Token ID326
Feature activation+2.86e+00

Pos logit contributions
idd+2.583
rol+2.572
iw+2.374
und+2.367
arre+2.367
Neg logit contributions
The-0.419
 finish-0.366
 before-0.349
 ends-0.344
In-0.331
Token building
Token ID2615
Feature activation+4.89e-01

Pos logit contributions
rol+6.329
idd+6.183
und+5.639
iw+5.608
arre+5.507
Neg logit contributions
 finish-2.658
 ends-2.505
 before-2.470
The-2.411
 the-2.370
Token over
Token ID625
Feature activation+9.78e-01

Pos logit contributions
idd+0.965
rol+0.958
und+0.878
iw+0.874
arre+0.860
Neg logit contributions
The-0.398
 finish-0.382
In-0.375
idi-0.372
 grant-0.372
Token there
Token ID612
Feature activation+6.97e-01

Pos logit contributions
idd+2.094
rol+2.086
und+1.910
iw+1.908
arre+1.898
Neg logit contributions
The-0.668
 finish-0.621
 before-0.620
 ends-0.607
In-0.597
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
idd+1.392
rol+1.377
und+1.262
iw+1.259
arre+1.241
Neg logit contributions
The-0.558
 finish-0.537
 ends-0.523
 before-0.522
In-0.521
Token He
Token ID679
Feature activation+4.34e-01

Pos logit contributions
idd+0.681
rol+0.675
und+0.609
iw+0.608
arre+0.594
Neg logit contributions
The-0.416
 finish-0.405
 ends-0.398
In-0.398
idi-0.396
Token!"
Token ID2474
Feature activation-3.37e-01

Pos logit contributions
idd+0.767
rol+0.759
und+0.673
iw+0.669
arre+0.655
Neg logit contributions
The-0.697
 finish-0.680
In-0.671
 ends-0.669
 grant-0.669
Token
Token ID198
Feature activation-5.48e-01

Pos logit contributions
idi+0.254
 ends+0.248
The+0.247
 finish+0.245
 grant+0.244
Neg logit contributions
idd-0.664
rol-0.662
und-0.624
iw-0.611
arre-0.603
Token
Token ID198
Feature activation-5.45e-01

Pos logit contributions
 ends+0.490
 finish+0.487
 before+0.480
 grant+0.474
idi+0.469
Neg logit contributions
idd-1.005
rol-0.999
und-0.937
iw-0.897
arre-0.888
TokenTo
Token ID2514
Feature activation+2.21e-02

Pos logit contributions
 before+0.301
 ends+0.298
ers+0.285
 finish+0.282
idi+0.282
Neg logit contributions
idd-1.182
rol-1.171
und-1.111
arre-1.096
iw-1.076
Token her
Token ID607
Feature activation-2.04e-01

Pos logit contributions
rol+0.042
idd+0.042
und+0.039
iw+0.038
arre+0.038
Neg logit contributions
The-0.018
In-0.018
idi-0.018
 ends-0.018
 finish-0.017
Token su
Token ID424
Feature activation+2.86e+00

Pos logit contributions
The+0.236
ers+0.233
In+0.230
 grant+0.227
 before+0.224
Neg logit contributions
idd-0.328
rol-0.315
und-0.293
iw-0.289
arre-0.289
Tokenprise
Token ID7919
Feature activation+2.88e-01

Pos logit contributions
idd+3.760
rol+3.737
 Mechan+3.330
iw+3.311
ays+3.252
Neg logit contributions
The-4.585
In-4.487
 finish-4.181
ers-4.079
idi-3.980
Token,
Token ID11
Feature activation-7.48e-02

Pos logit contributions
idd+0.500
rol+0.496
und+0.452
iw+0.444
arre+0.440
Neg logit contributions
The-0.304
In-0.291
 finish-0.290
 grant-0.287
idi-0.285
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
The+0.027
 finish+0.027
idi+0.026
In+0.026
 ends+0.026
Neg logit contributions
idd-0.177
rol-0.177
und-0.165
iw-0.164
arre-0.161
Token thunder
Token ID18355
Feature activation-6.10e-01

Pos logit contributions
The+0.121
 grant+0.118
idi+0.117
In+0.117
 finish+0.115
Neg logit contributions
idd-0.189
rol-0.181
arre-0.168
und-0.165
iw-0.164
Token went
Token ID1816
Feature activation+5.55e-01

Pos logit contributions
The+0.682
 grant+0.671
In+0.663
 finish+0.659
ace+0.657
Neg logit contributions
idd-1.004
rol-0.992
und-0.918
arre-0.881
iw-0.843
Token them
Token ID606
Feature activation+1.10e+00

Pos logit contributions
The+0.440
 grant+0.439
In+0.438
idi+0.429
 finish+0.421
Neg logit contributions
idd-0.689
rol-0.689
und-0.621
iw-0.614
ays-0.600
Token.
Token ID13
Feature activation-4.35e-01

Pos logit contributions
idd+2.351
rol+2.315
iw+2.158
und+2.114
arre+2.106
Neg logit contributions
The-0.781
 finish-0.762
 before-0.744
 ends-0.738
In-0.709
Token They
Token ID1119
Feature activation-2.14e-01

Pos logit contributions
idi+0.351
 finish+0.349
 ends+0.349
 grant+0.341
ers+0.338
Neg logit contributions
idd-0.846
rol-0.831
und-0.782
iw-0.766
arre-0.757
Token cried
Token ID16896
Feature activation+8.40e-01

Pos logit contributions
The+0.252
idi+0.250
 finish+0.246
 grant+0.245
In+0.244
Neg logit contributions
idd-0.338
rol-0.333
und-0.303
iw-0.298
arre-0.287
Token and
Token ID290
Feature activation+7.33e-01

Pos logit contributions
idd+1.917
rol+1.902
iw+1.767
und+1.757
arre+1.723
Neg logit contributions
The-0.463
 finish-0.431
 before-0.421
 ends-0.415
In-0.413
Token bl
Token ID698
Feature activation+2.80e+00

Pos logit contributions
idd+1.187
rol+1.177
iw+1.052
und+1.052
arre+1.036
Neg logit contributions
The-0.874
 finish-0.854
 ends-0.836
In-0.836
 grant-0.836
Tokened
Token ID276
Feature activation+9.22e-01

Pos logit contributions
rol+4.403
idd+4.364
 batch+4.093
iw+4.049
 Mechan+4.008
Neg logit contributions
The-3.342
In-3.294
ers-3.238
ace-3.226
 ends-3.210
Token and
Token ID290
Feature activation+9.26e-01

Pos logit contributions
idd+1.989
rol+1.965
iw+1.820
und+1.800
arre+1.780
Neg logit contributions
The-0.618
 finish-0.588
 before-0.582
 ends-0.578
In-0.560
Token hurt
Token ID5938
Feature activation+2.37e-01

Pos logit contributions
idd+1.608
rol+1.597
und+1.432
iw+1.430
arre+1.422
Neg logit contributions
The-1.006
 finish-0.984
 before-0.962
 ends-0.960
 grant-0.955
Token.
Token ID13
Feature activation-8.38e-01

Pos logit contributions
idd+0.467
rol+0.462
und+0.425
iw+0.419
arre+0.414
Neg logit contributions
The-0.193
 finish-0.190
 grant-0.188
idi-0.188
In-0.186
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
 finish+0.815
isy+0.808
 ends+0.801
ers+0.794
idi+0.791
Neg logit contributions
idd-1.518
rol-1.452
und-1.401
iw-1.348
arre-1.340
Token:
Token ID25
Feature activation+3.66e-01

Pos logit contributions
The+0.826
In+0.784
 grant+0.752
 ends+0.745
 finish+0.722
Neg logit contributions
idd-2.748
rol-2.568
arre-2.487
und-2.484
rou-2.409
Token But
Token ID887
Feature activation+2.17e-01

Pos logit contributions
idd+0.702
rol+0.692
und+0.636
iw+0.632
arre+0.624
Neg logit contributions
The-0.317
 finish-0.307
 ends-0.302
In-0.301
idi-0.300
Token I
Token ID314
Feature activation+6.45e-01

Pos logit contributions
idd+0.458
rol+0.453
und+0.420
iw+0.418
arre+0.410
Neg logit contributions
The-0.145
 finish-0.138
In-0.137
 ends-0.136
 grant-0.136
Token was
Token ID373
Feature activation-7.22e-02

Pos logit contributions
idd+1.171
rol+1.165
und+1.055
iw+1.054
arre+1.030
Neg logit contributions
The-0.632
 finish-0.614
In-0.601
 grant-0.597
 ends-0.597
Token having
Token ID1719
Feature activation+9.67e-01

Pos logit contributions
The+0.086
 grant+0.083
In+0.083
 ends+0.083
idi+0.081
Neg logit contributions
idd-0.118
rol-0.114
iw-0.102
arre-0.101
und-0.101
Token so
Token ID523
Feature activation+2.79e+00

Pos logit contributions
idd+1.809
rol+1.797
iw+1.638
und+1.628
arre+1.578
Neg logit contributions
The-0.924
 finish-0.897
 ends-0.872
 before-0.868
In-0.861
Token much
Token ID881
Feature activation+4.70e-01

Pos logit contributions
rol+5.231
idd+4.878
iw+4.533
bat+4.419
 Mechan+4.326
Neg logit contributions
The-3.409
In-3.230
 time-3.178
 finish-3.135
 the-2.982
Token fun
Token ID1257
Feature activation+7.25e-01

Pos logit contributions
idd+0.770
rol+0.760
und+0.685
iw+0.681
arre+0.670
Neg logit contributions
The-0.547
 finish-0.529
In-0.523
 grant-0.523
 ends-0.522
Token!
Token ID0
Feature activation+3.72e-01

Pos logit contributions
idd+1.270
rol+1.267
iw+1.142
und+1.140
arre+1.118
Neg logit contributions
The-0.751
 finish-0.729
 before-0.717
In-0.712
 ends-0.712
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
idd+0.707
rol+0.700
und+0.640
iw+0.637
arre+0.627
Neg logit contributions
The-0.329
 finish-0.319
 ends-0.314
 grant-0.313
idi-0.312
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
idd+0.448
rol+0.442
und+0.408
iw+0.405
arre+0.401
Neg logit contributions
 finish-0.171
The-0.170
 ends-0.169
 grant-0.168
 before-0.166
Token,
Token ID11
Feature activation-2.84e-01

Pos logit contributions
idd+1.029
rol+1.021
und+0.931
iw+0.927
arre+0.912
Neg logit contributions
The-0.501
 finish-0.484
In-0.474
 ends-0.473
 grant-0.472
Token scary
Token ID14343
Feature activation+9.07e-01

Pos logit contributions
The+0.293
idi+0.291
 finish+0.288
In+0.287
ers+0.285
Neg logit contributions
idd-0.476
rol-0.469
iw-0.435
und-0.426
arre-0.421
Token cat
Token ID3797
Feature activation+1.01e+00

Pos logit contributions
rol+1.721
idd+1.705
und+1.541
iw+1.538
arre+1.521
Neg logit contributions
The-0.826
 finish-0.808
 ends-0.791
 grant-0.790
 before-0.782
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
rol+2.030
idd+2.023
iw+1.842
arre+1.827
und+1.826
Neg logit contributions
The-0.821
 finish-0.777
 before-0.765
In-0.762
 grant-0.758
Token got
Token ID1392
Feature activation+1.63e+00

Pos logit contributions
idd+0.299
rol+0.295
und+0.266
iw+0.265
arre+0.255
Neg logit contributions
The-0.229
 finish-0.227
 ends-0.223
In-0.222
idi-0.222
Token very
Token ID845
Feature activation+2.79e+00

Pos logit contributions
idd+2.832
rol+2.768
und+2.593
arre+2.551
iw+2.515
Neg logit contributions
The-1.783
 finish-1.754
 ends-1.715
 grant-1.706
 before-1.695
Token scared
Token ID12008
Feature activation+8.97e-01

Pos logit contributions
rol+5.523
idd+5.431
rob+5.197
arre+5.026
und+5.016
Neg logit contributions
The-2.745
 old-2.742
 finish-2.691
 the-2.678
 in-2.631
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
idd+1.905
rol+1.886
iw+1.741
und+1.733
arre+1.715
Neg logit contributions
The-0.642
 before-0.602
 finish-0.599
 ends-0.582
In-0.581
Token "
Token ID366
Feature activation+9.99e-01

Pos logit contributions
idd+0.599
rol+0.594
und+0.536
iw+0.533
arre+0.520
Neg logit contributions
The-0.426
 finish-0.418
 ends-0.412
idi-0.411
 grant-0.410
TokenOh
Token ID5812
Feature activation+2.16e+00

Pos logit contributions
idd+1.736
rol+1.722
iw+1.548
und+1.544
arre+1.516
Neg logit contributions
The-1.129
In-1.056
 finish-1.038
 grant-1.027
idi-1.023
Token no
Token ID645
Feature activation+1.65e+00

Pos logit contributions
rol+4.237
idd+4.223
arre+3.921
und+3.807
ays+3.753
Neg logit contributions
The-2.033
 grant-2.000
 before-1.955
 finish-1.941
 be-1.940
Token got
Token ID1392
Feature activation+6.55e-01

Pos logit contributions
idd+0.804
rol+0.798
und+0.724
iw+0.719
arre+0.709
Neg logit contributions
The-0.450
 finish-0.434
In-0.428
 grant-0.428
 ends-0.427
Token off
Token ID572
Feature activation+1.87e-01

Pos logit contributions
idd+1.293
rol+1.283
und+1.178
iw+1.173
arre+1.155
Neg logit contributions
The-0.547
 finish-0.528
In-0.513
 ends-0.512
idi-0.511
Token the
Token ID262
Feature activation+4.90e-01

Pos logit contributions
idd+0.401
rol+0.399
und+0.369
iw+0.367
arre+0.360
Neg logit contributions
The-0.116
In-0.110
 ends-0.110
 finish-0.110
 grant-0.109
Token couch
Token ID18507
Feature activation+1.02e+00

Pos logit contributions
idd+0.851
rol+0.844
und+0.763
iw+0.760
arre+0.747
Neg logit contributions
The-0.522
 finish-0.505
In-0.498
 grant-0.497
idi-0.496
Token and
Token ID290
Feature activation+6.49e-01

Pos logit contributions
idd+2.377
rol+2.344
iw+2.189
und+2.165
arre+2.144
Neg logit contributions
The-0.495
 finish-0.479
 before-0.468
 ends-0.438
In-0.431
Token tr
Token ID491
Feature activation+2.78e+00

Pos logit contributions
idd+1.242
rol+1.231
und+1.123
iw+1.122
arre+1.104
Neg logit contributions
The-0.574
 finish-0.555
In-0.541
 ends-0.541
idi-0.540
Tokenud
Token ID463
Feature activation+4.86e-01

Pos logit contributions
rol+2.665
idd+2.513
iw+2.356
 poss+2.205
 Good+2.182
Neg logit contributions
In-4.969
The-4.947
idi-4.861
 finish-4.736
ace-4.733
Tokenged
Token ID2004
Feature activation+9.35e-01

Pos logit contributions
idd+0.643
rol+0.635
und+0.555
iw+0.551
arre+0.537
Neg logit contributions
The-0.717
 finish-0.702
In-0.694
 ends-0.692
 grant-0.692
Token out
Token ID503
Feature activation-6.06e-02

Pos logit contributions
idd+1.784
rol+1.767
iw+1.618
und+1.601
arre+1.581
Neg logit contributions
The-0.860
 finish-0.827
 before-0.811
 ends-0.810
In-0.804
Token of
Token ID286
Feature activation+8.76e-01

Pos logit contributions
The+0.041
In+0.039
 finish+0.039
 grant+0.039
idi+0.039
Neg logit contributions
rol-0.127
idd-0.127
und-0.117
iw-0.114
arre-0.112
Token the
Token ID262
Feature activation+6.30e-01

Pos logit contributions
idd+2.141
rol+2.128
iw+1.978
und+1.974
arre+1.968
Neg logit contributions
The-0.326
 finish-0.304
 before-0.287
idi-0.283
 ends-0.283
Token
Token ID198
Feature activation-6.57e-01

Pos logit contributions
 finish+0.485
 ends+0.478
 before+0.477
ers+0.474
idi+0.469
Neg logit contributions
idd-0.988
rol-0.946
und-0.907
iw-0.887
arre-0.866
TokenHer
Token ID9360
Feature activation-6.00e-01

Pos logit contributions
 before+0.748
 ends+0.746
ers+0.741
 finish+0.729
 grant+0.721
Neg logit contributions
idd-1.074
rol-0.981
und-0.957
iw-0.945
arre-0.918
Token mom
Token ID1995
Feature activation+1.38e-01

Pos logit contributions
The+0.480
In+0.459
 grant+0.456
ers+0.454
idi+0.447
Neg logit contributions
idd-1.206
rol-1.119
iw-1.078
arre-1.063
rou-1.062
Token sighed
Token ID21893
Feature activation+4.88e-01

Pos logit contributions
idd+0.241
rol+0.236
und+0.215
iw+0.214
arre+0.211
Neg logit contributions
The-0.143
 finish-0.140
 grant-0.140
In-0.138
 ends-0.137
Token and
Token ID290
Feature activation+3.03e-01

Pos logit contributions
idd+1.074
rol+1.067
und+0.987
iw+0.984
arre+0.972
Neg logit contributions
The-0.287
 finish-0.271
In-0.264
 grant-0.262
 ends-0.262
Token kn
Token ID638
Feature activation+2.77e+00

Pos logit contributions
idd+0.515
rol+0.507
und+0.460
iw+0.460
arre+0.450
Neg logit contributions
The-0.327
 finish-0.319
 grant-0.316
In-0.315
 ends-0.315
Tokenelt
Token ID2120
Feature activation+1.27e+00

Pos logit contributions
rol+3.306
idd+3.099
und+2.884
rob+2.824
 batch+2.783
Neg logit contributions
The-4.514
In-4.339
 finish-4.207
ers-4.177
ace-4.121
Token down
Token ID866
Feature activation+4.43e-01

Pos logit contributions
idd+2.153
rol+2.129
und+1.886
iw+1.861
arre+1.855
Neg logit contributions
The-1.506
 ends-1.427
 before-1.421
 finish-1.415
In-1.397
Token next
Token ID1306
Feature activation-6.65e-01

Pos logit contributions
idd+0.897
rol+0.890
und+0.818
iw+0.815
arre+0.803
Neg logit contributions
The-0.341
 finish-0.327
In-0.321
idi-0.320
 grant-0.320
Token to
Token ID284
Feature activation+8.82e-02

Pos logit contributions
 grant+0.570
The+0.568
idi+0.523
In+0.523
 finish+0.504
Neg logit contributions
idd-1.275
rol-1.235
iw-1.164
und-1.156
arre-1.134
Token her
Token ID607
Feature activation-7.58e-01

Pos logit contributions
idd+0.154
rol+0.151
und+0.139
iw+0.138
arre+0.135
Neg logit contributions
The-0.090
 finish-0.087
In-0.087
 ends-0.087
idi-0.087
Token little
Token ID1310
Feature activation+4.44e-01

Pos logit contributions
idd+0.110
rol+0.107
iw+0.100
und+0.100
arre+0.098
Neg logit contributions
The-0.040
In-0.038
 finish-0.038
idi-0.038
 grant-0.037
Token 3
Token ID513
Feature activation+1.03e+00

Pos logit contributions
idd+0.838
rol+0.827
iw+0.756
und+0.755
arre+0.740
Neg logit contributions
The-0.402
 finish-0.387
In-0.383
 grant-0.380
idi-0.379
Token year
Token ID614
Feature activation-4.53e-01

Pos logit contributions
rol+1.459
idd+1.442
und+1.270
iw+1.264
arre+1.253
Neg logit contributions
The-1.441
idi-1.387
 finish-1.377
 ends-1.372
 before-1.370
Token old
Token ID1468
Feature activation-6.02e-01

Pos logit contributions
 grant+0.184
The+0.183
In+0.182
 finish+0.165
idi+0.164
Neg logit contributions
idd-1.089
rol-1.053
iw-1.003
und-0.988
rou-0.977
Token boy
Token ID2933
Feature activation+6.79e-01

Pos logit contributions
In+0.633
 finish+0.632
The+0.631
 grant+0.620
 ends+0.606
Neg logit contributions
idd-1.091
rol-0.994
iw-0.954
und-0.945
 Mechan-0.891
Token cl
Token ID537
Feature activation+2.76e+00

Pos logit contributions
idd+1.135
rol+1.133
und+1.017
iw+1.014
arre+1.001
Neg logit contributions
The-0.758
 finish-0.734
In-0.721
idi-0.720
 grant-0.720
Tokenapping
Token ID5912
Feature activation+3.94e-01

Pos logit contributions
rol+3.817
 batch+3.533
idd+3.496
rob+3.491
iw+3.477
Neg logit contributions
The-3.870
 finish-3.735
ace-3.656
idi-3.592
 before-3.585
Token his
Token ID465
Feature activation-6.03e-01

Pos logit contributions
idd+0.764
rol+0.758
und+0.694
iw+0.687
arre+0.678
Neg logit contributions
The-0.336
 finish-0.325
In-0.319
 grant-0.318
 ends-0.316
Token hands
Token ID2832
Feature activation+9.29e-01

Pos logit contributions
The+0.763
idi+0.751
 grant+0.740
In+0.739
 finish+0.733
Neg logit contributions
idd-0.921
rol-0.888
und-0.826
arre-0.794
iw-0.789
Token.
Token ID13
Feature activation-6.56e-01

Pos logit contributions
idd+2.058
rol+2.044
iw+1.904
und+1.884
arre+1.860
Neg logit contributions
The-0.541
 finish-0.519
 before-0.513
In-0.488
 ends-0.488
Token
Token ID220
Feature activation+4.63e-01

Pos logit contributions
 grant+0.644
ace+0.634
 ends+0.627
ers+0.619
 finish+0.614
Neg logit contributions
idd-1.180
rol-1.137
und-1.073
iw-1.062
arre-1.035
Token head
Token ID1182
Feature activation+5.59e-01

Pos logit contributions
In+1.559
idi+1.530
The+1.413
 behold+1.350
 grant+1.330
Neg logit contributions
idd-2.773
rol-2.734
und-2.458
 Mechan-2.446
iw-2.441
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
idd+1.185
rol+1.176
und+1.084
iw+1.081
arre+1.066
Neg logit contributions
The-0.377
 finish-0.358
In-0.348
 ends-0.347
 grant-0.346
Token Ben
Token ID3932
Feature activation-3.99e-01

Pos logit contributions
 ends+0.417
 grant+0.412
idi+0.411
 finish+0.410
ers+0.405
Neg logit contributions
idd-0.938
rol-0.909
und-0.864
iw-0.851
arre-0.835
Token cried
Token ID16896
Feature activation+9.58e-01

Pos logit contributions
idi+0.500
The+0.489
 finish+0.486
In+0.485
 grant+0.483
Neg logit contributions
idd-0.611
rol-0.586
und-0.543
iw-0.523
arre-0.521
Token and
Token ID290
Feature activation+7.58e-01

Pos logit contributions
idd+2.138
rol+2.117
iw+1.962
und+1.949
arre+1.905
Neg logit contributions
The-0.596
 finish-0.568
 before-0.555
 ends-0.542
In-0.531
Token bl
Token ID698
Feature activation+2.75e+00

Pos logit contributions
idd+1.180
rol+1.172
und+1.040
iw+1.039
arre+1.026
Neg logit contributions
The-0.951
 finish-0.927
 ends-0.913
 grant-0.912
In-0.909
Tokened
Token ID276
Feature activation+1.33e+00

Pos logit contributions
rol+4.302
idd+4.213
 batch+3.929
iw+3.863
rob+3.853
Neg logit contributions
The-3.464
In-3.320
 ends-3.283
ace-3.225
ers-3.165
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
idd+2.971
rol+2.925
iw+2.718
arre+2.661
und+2.659
Neg logit contributions
The-0.841
 before-0.834
 finish-0.816
 ends-0.792
ers-0.727
Token Sam
Token ID3409
Feature activation-4.20e-01

Pos logit contributions
 ends+0.734
 grant+0.733
isy+0.721
 finish+0.718
ers+0.717
Neg logit contributions
idd-1.483
rol-1.409
und-1.359
iw-1.329
arre-1.294
Token screamed
Token ID25421
Feature activation+1.60e+00

Pos logit contributions
The+0.435
 finish+0.433
idi+0.433
 grant+0.428
In+0.428
Neg logit contributions
idd-0.715
rol-0.693
und-0.645
iw-0.627
arre-0.621
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
idd+4.034
rol+4.001
iw+3.791
und+3.689
rob+3.612
Neg logit contributions
 before-0.630
The-0.606
 finish-0.595
 ends-0.505
In-0.462
Token She
Token ID1375
Feature activation+1.61e-01

Pos logit contributions
The+0.198
 ends+0.197
 finish+0.197
 grant+0.193
ers+0.193
Neg logit contributions
idd-0.339
rol-0.335
und-0.306
iw-0.304
arre-0.297
Token said
Token ID531
Feature activation+3.17e-01

Pos logit contributions
idd+0.274
rol+0.268
und+0.244
iw+0.242
arre+0.239
Neg logit contributions
The-0.174
 finish-0.171
 ends-0.168
 grant-0.167
In-0.167
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
idd+0.651
rol+0.645
und+0.595
iw+0.593
arre+0.583
Neg logit contributions
The-0.233
 finish-0.224
In-0.221
 ends-0.218
idi-0.218
Token "
Token ID366
Feature activation+1.57e-03

Pos logit contributions
idd+0.498
rol+0.491
und+0.448
iw+0.446
arre+0.440
Neg logit contributions
The-0.239
 finish-0.234
 ends-0.232
In-0.228
idi-0.228
TokenHi
Token ID17250
Feature activation+7.34e-01

Pos logit contributions
idd+0.003
rol+0.003
und+0.002
iw+0.002
arre+0.002
Neg logit contributions
 finish-0.002
 ends-0.002
 before-0.002
idi-0.002
The-0.002
Token p
Token ID279
Feature activation+2.74e+00

Pos logit contributions
idd+1.475
rol+1.467
und+1.342
iw+1.340
arre+1.328
Neg logit contributions
The-0.580
 finish-0.554
 ends-0.539
 before-0.539
In-0.539
Tokenanda
Token ID5282
Feature activation+3.21e-01

Pos logit contributions
rol+4.912
arre+4.765
idd+4.683
und+4.616
 Mechan+4.587
Neg logit contributions
ers-2.378
The-2.296
 before-2.210
 finish-2.170
isy-2.153
Token!
Token ID0
Feature activation+1.03e-01

Pos logit contributions
idd+0.533
rol+0.525
iw+0.474
und+0.473
arre+0.459
Neg logit contributions
The-0.366
 finish-0.356
In-0.354
 ends-0.351
idi-0.351
Token What
Token ID1867
Feature activation+4.85e-01

Pos logit contributions
idd+0.157
rol+0.155
und+0.137
iw+0.137
arre+0.132
Neg logit contributions
The-0.131
 finish-0.130
 ends-0.128
idi-0.127
ers-0.126
Token smells
Token ID25760
Feature activation-9.28e-01

Pos logit contributions
idd+0.839
rol+0.830
und+0.751
iw+0.748
arre+0.735
Neg logit contributions
The-0.516
 finish-0.500
In-0.492
idi-0.491
 grant-0.489
Token so
Token ID523
Feature activation+6.89e-01

Pos logit contributions
idi+0.955
The+0.950
ace+0.935
 grant+0.918
In+0.913
Neg logit contributions
rol-1.586
idd-1.511
und-1.441
iw-1.438
 batch-1.388
Token said
Token ID531
Feature activation+4.01e-01

Pos logit contributions
idd+0.251
rol+0.246
und+0.223
iw+0.219
arre+0.218
Neg logit contributions
The-0.171
 grant-0.166
 finish-0.165
 ends-0.164
In-0.164
Token,
Token ID11
Feature activation+3.20e-01

Pos logit contributions
idd+0.830
rol+0.823
und+0.757
iw+0.753
arre+0.744
Neg logit contributions
The-0.292
 finish-0.280
In-0.275
 ends-0.272
idi-0.272
Token "
Token ID366
Feature activation+4.82e-01

Pos logit contributions
idd+0.604
rol+0.597
und+0.545
iw+0.542
arre+0.534
Neg logit contributions
The-0.287
 finish-0.281
 ends-0.276
In-0.273
idi-0.272
TokenYou
Token ID1639
Feature activation+1.74e-02

Pos logit contributions
idd+0.756
rol+0.749
und+0.669
iw+0.666
arre+0.653
Neg logit contributions
The-0.593
 finish-0.581
 grant-0.571
 ends-0.571
In-0.570
Token had
Token ID550
Feature activation-9.16e-02

Pos logit contributions
idd+0.029
rol+0.028
und+0.025
arre+0.025
iw+0.025
Neg logit contributions
The-0.020
idi-0.019
 before-0.019
 finish-0.019
In-0.019
Token so
Token ID523
Feature activation+2.73e+00

Pos logit contributions
The+0.080
 grant+0.079
idi+0.077
In+0.076
 ends+0.075
Neg logit contributions
idd-0.173
rol-0.171
arre-0.157
und-0.157
iw-0.155
Token much
Token ID881
Feature activation+6.78e-01

Pos logit contributions
rol+4.919
idd+4.692
bat+4.357
und+4.315
arre+4.250
Neg logit contributions
The-3.205
In-3.159
 finish-3.048
 time-3.046
 ends-2.942
Token fun
Token ID1257
Feature activation+1.01e+00

Pos logit contributions
idd+1.144
rol+1.137
und+1.021
iw+1.018
arre+0.996
Neg logit contributions
The-0.757
 finish-0.741
In-0.726
 ends-0.724
 before-0.723
Token outside
Token ID2354
Feature activation+1.48e-01

Pos logit contributions
rol+1.616
idd+1.604
iw+1.440
und+1.422
arre+1.399
Neg logit contributions
The-1.210
 finish-1.198
 before-1.196
 ends-1.157
 grant-1.156
Token.
Token ID13
Feature activation-8.56e-03

Pos logit contributions
idd+0.282
rol+0.278
und+0.256
iw+0.253
arre+0.249
Neg logit contributions
The-0.132
In-0.126
idi-0.126
 finish-0.125
 grant-0.124
Token Let
Token ID3914
Feature activation+7.22e-01

Pos logit contributions
The+0.011
 finish+0.010
 ends+0.010
idi+0.010
ers+0.010
Neg logit contributions
idd-0.013
rol-0.013
und-0.012
iw-0.012
arre-0.011
Token She
Token ID1375
Feature activation-1.41e+00

Pos logit contributions
The+0.426
 grant+0.424
idi+0.423
 ends+0.416
ers+0.415
Neg logit contributions
rol-0.518
idd-0.513
iw-0.467
und-0.463
arre-0.455
Token loved
Token ID6151
Feature activation+4.79e-01

Pos logit contributions
 before+1.588
 finish+1.570
 ends+1.561
 grant+1.551
ers+1.550
Neg logit contributions
idd-2.271
rol-2.030
und-1.968
iw-1.923
 Mechan-1.916
Token going
Token ID1016
Feature activation+2.10e-02

Pos logit contributions
idd+0.999
rol+0.990
und+0.912
iw+0.911
arre+0.896
Neg logit contributions
The-0.339
 finish-0.323
In-0.315
 grant-0.314
 ends-0.314
Token to
Token ID284
Feature activation+2.34e-01

Pos logit contributions
idd+0.042
rol+0.042
iw+0.038
und+0.038
arre+0.037
Neg logit contributions
The-0.016
 grant-0.016
idi-0.016
 ends-0.016
 finish-0.015
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
idd+0.562
rol+0.557
und+0.522
iw+0.521
arre+0.512
Neg logit contributions
The-0.082
In-0.075
idi-0.074
 ends-0.074
 finish-0.074
Token toy
Token ID13373
Feature activation+2.73e+00

Pos logit contributions
idd+1.986
rol+1.979
und+1.765
iw+1.751
arre+1.727
Neg logit contributions
The-1.434
 finish-1.388
In-1.344
idi-1.320
 before-1.310
Token store
Token ID3650
Feature activation+1.05e+00

Pos logit contributions
rol+5.215
idd+5.061
rou+4.654
arre+4.621
und+4.488
Neg logit contributions
 finish-3.094
The-3.033
 before-2.957
 the-2.898
 ends-2.803
Token with
Token ID351
Feature activation+1.15e+00

Pos logit contributions
idd+2.207
rol+2.183
und+2.018
arre+1.995
iw+1.993
Neg logit contributions
The-0.769
 before-0.728
 finish-0.722
In-0.703
 ends-0.677
Token her
Token ID607
Feature activation+4.85e-01

Pos logit contributions
idd+2.082
rol+2.064
und+1.866
arre+1.849
iw+1.831
Neg logit contributions
The-1.264
 finish-1.204
 ends-1.178
 before-1.176
In-1.176
Token mom
Token ID1995
Feature activation-1.95e-02

Pos logit contributions
idd+1.058
rol+1.048
und+0.969
iw+0.968
arre+0.952
Neg logit contributions
The-0.303
 finish-0.288
In-0.279
 grant-0.279
idi-0.278
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
 grant+0.027
The+0.026
idi+0.026
 finish+0.026
 ends+0.026
Neg logit contributions
idd-0.027
rol-0.027
iw-0.024
und-0.024
arre-0.023
Token bought
Token ID5839
Feature activation+1.10e+00

Pos logit contributions
idd+0.587
rol+0.581
und+0.528
iw+0.526
arre+0.515
Neg logit contributions
The-0.331
 finish-0.319
In-0.316
 ends-0.315
 grant-0.313
Token two
Token ID734
Feature activation+2.73e-01

Pos logit contributions
idd+2.296
rol+2.270
iw+2.090
und+2.089
arre+2.061
Neg logit contributions
The-0.832
 finish-0.786
 ends-0.777
 before-0.769
idi-0.761
Token hot
Token ID3024
Feature activation+6.87e-02

Pos logit contributions
idd+0.450
rol+0.447
und+0.401
iw+0.400
arre+0.395
Neg logit contributions
The-0.309
 grant-0.299
 finish-0.299
In-0.298
idi-0.297
Token dogs
Token ID6844
Feature activation+5.12e-01

Pos logit contributions
idd+0.109
rol+0.107
iw+0.097
und+0.097
arre+0.095
Neg logit contributions
The-0.083
 grant-0.081
In-0.080
 finish-0.080
idi-0.079
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
idd+1.027
rol+1.020
und+0.935
iw+0.932
arre+0.918
Neg logit contributions
The-0.401
 finish-0.384
In-0.376
 grant-0.374
 ends-0.374
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
idd+2.834
rol+2.792
iw+2.578
arre+2.568
und+2.567
Neg logit contributions
The-0.974
 finish-0.905
 ends-0.884
 before-0.876
In-0.855
Tokenetchup
Token ID47132
Feature activation+1.16e-01

Pos logit contributions
rol+3.961
 Mechan+3.702
idd+3.690
arre+3.621
und+3.480
Neg logit contributions
The-3.644
ers-3.498
In-3.418
 ends-3.361
 before-3.293
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
idd+0.236
rol+0.234
und+0.216
iw+0.214
arre+0.211
Neg logit contributions
The-0.090
In-0.085
 finish-0.085
idi-0.085
 grant-0.084
Token Max
Token ID5436
Feature activation-4.71e-01

Pos logit contributions
The+0.230
 grant+0.228
 ends+0.223
 finish+0.222
In+0.221
Neg logit contributions
idd-0.395
rol-0.388
und-0.359
arre-0.355
iw-0.350
Token and
Token ID290
Feature activation+8.95e-02

Pos logit contributions
The+0.468
 grant+0.467
 finish+0.463
 ends+0.462
In+0.456
Neg logit contributions
idd-0.820
rol-0.792
und-0.741
arre-0.734
iw-0.728
Token Sam
Token ID3409
Feature activation+1.39e-01

Pos logit contributions
idd+0.159
rol+0.156
und+0.144
iw+0.142
arre+0.141
Neg logit contributions
The-0.089
 finish-0.087
 grant-0.087
In-0.087
idi-0.085
TokenOkay
Token ID16454
Feature activation-7.59e-01

Pos logit contributions
idd+0.433
rol+0.429
iw+0.384
und+0.381
arre+0.371
Neg logit contributions
The-0.358
 finish-0.357
 grant-0.352
 ends-0.351
idi-0.350
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
In+0.721
The+0.720
idi+0.698
 grant+0.690
 ends+0.660
Neg logit contributions
idd-1.348
rol-1.318
iw-1.217
und-1.212
arre-1.200
Token Anna
Token ID11735
Feature activation+6.79e-02

Pos logit contributions
The+0.968
In+0.960
idi+0.930
 grant+0.924
ers+0.913
Neg logit contributions
rol-1.903
idd-1.869
iw-1.757
 batch-1.719
und-1.701
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
idd+0.132
rol+0.129
und+0.120
iw+0.119
arre+0.117
Neg logit contributions
The-0.058
idi-0.055
 grant-0.055
In-0.055
 finish-0.054
Token That
Token ID1320
Feature activation+9.96e-01

Pos logit contributions
The+0.132
idi+0.129
 finish+0.128
 ends+0.126
In+0.126
Neg logit contributions
idd-0.190
rol-0.188
iw-0.171
und-0.168
arre-0.161
Token sounds
Token ID5238
Feature activation-2.95e+00

Pos logit contributions
idd+1.863
rol+1.851
iw+1.686
und+1.664
arre+1.608
Neg logit contributions
The-0.976
 finish-0.961
 ends-0.953
In-0.923
 before-0.917
Token like
Token ID588
Feature activation+4.64e-02

Pos logit contributions
idi+5.283
 grant+4.831
oster+4.823
The+4.739
ers+4.735
Neg logit contributions
 sorts-2.817
 Good-2.764
rol-2.721
arre-2.358
 Enjoy-2.311
Token fun
Token ID1257
Feature activation-1.58e-01

Pos logit contributions
idd+0.081
rol+0.080
und+0.073
iw+0.073
arre+0.071
Neg logit contributions
The-0.047
 finish-0.045
idi-0.045
In-0.045
 grant-0.045
Token.
Token ID13
Feature activation-3.78e-01

Pos logit contributions
The+0.142
idi+0.137
In+0.136
 grant+0.135
 finish+0.133
Neg logit contributions
idd-0.300
rol-0.293
und-0.271
arre-0.266
iw-0.265
Token But
Token ID887
Feature activation+4.66e-01

Pos logit contributions
The+0.368
idi+0.366
ers+0.363
 ends+0.362
 finish+0.360
Neg logit contributions
idd-0.668
rol-0.663
iw-0.604
und-0.598
 Good-0.579
Token remember
Token ID3505
Feature activation-4.09e-01

Pos logit contributions
idd+0.885
rol+0.877
und+0.802
iw+0.800
arre+0.785
Neg logit contributions
The-0.414
 finish-0.397
In-0.390
idi-0.389
 ends-0.389
Token him
Token ID683
Feature activation+3.83e-01

Pos logit contributions
 grant+0.179
In+0.179
idi+0.177
The+0.176
 finish+0.174
Neg logit contributions
idd-0.289
rol-0.285
und-0.258
iw-0.258
arre-0.248
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
idd+0.729
rol+0.727
und+0.663
iw+0.660
arre+0.648
Neg logit contributions
The-0.336
 finish-0.321
In-0.319
idi-0.319
 grant-0.317
Token Tom
Token ID4186
Feature activation-2.21e-01

Pos logit contributions
 grant+0.191
 ends+0.190
The+0.189
 finish+0.188
idi+0.185
Neg logit contributions
idd-0.425
rol-0.420
und-0.392
iw-0.387
arre-0.378
Token falls
Token ID8953
Feature activation+6.84e-01

Pos logit contributions
The+0.281
In+0.273
idi+0.271
 finish+0.268
 grant+0.267
Neg logit contributions
idd-0.333
rol-0.329
und-0.296
iw-0.294
arre-0.290
Token on
Token ID319
Feature activation-6.71e-01

Pos logit contributions
idd+1.356
rol+1.340
iw+1.227
und+1.227
arre+1.206
Neg logit contributions
The-0.567
 finish-0.546
 ends-0.534
 before-0.531
In-0.531
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
 grant+0.300
In+0.295
 ends+0.278
 finish+0.268
ace+0.266
Neg logit contributions
idd-1.522
rol-1.492
und-1.434
iw-1.423
 batch-1.383
Token ground
Token ID2323
Feature activation+7.10e-01

Pos logit contributions
idi+3.579
In+3.558
 behold+3.213
The+3.152
 involve+3.123
Neg logit contributions
iw-4.373
idd-4.341
rol-4.258
 batch-4.210
 bur-4.023
Token.
Token ID13
Feature activation-9.88e-01

Pos logit contributions
idd+1.525
rol+1.511
und+1.392
iw+1.389
arre+1.373
Neg logit contributions
The-0.468
 finish-0.447
 before-0.434
 ends-0.430
In-0.428
Token He
Token ID679
Feature activation+6.39e-01

Pos logit contributions
 grant+0.897
isy+0.892
ace+0.881
 ends+0.857
 finish+0.849
Neg logit contributions
rol-1.782
idd-1.775
und-1.663
iw-1.631
arre-1.580
Token gets
Token ID3011
Feature activation+9.10e-01

Pos logit contributions
idd+1.017
rol+1.007
und+0.900
iw+0.896
arre+0.879
Neg logit contributions
The-0.773
 finish-0.753
 ends-0.743
 grant-0.740
In-0.739
Token dirty
Token ID11841
Feature activation+5.93e-01

Pos logit contributions
idd+1.511
rol+1.493
und+1.362
iw+1.343
arre+1.321
Neg logit contributions
The-1.054
 finish-1.028
 ends-1.009
 before-0.998
In-0.995
Token He
Token ID679
Feature activation+2.21e-02

Pos logit contributions
 grant+0.280
 finish+0.277
 ends+0.276
The+0.273
idi+0.267
Neg logit contributions
idd-0.631
rol-0.619
und-0.574
iw-0.570
arre-0.563
Token threw
Token ID9617
Feature activation+6.18e-01

Pos logit contributions
idd+0.037
rol+0.036
und+0.033
iw+0.033
arre+0.032
Neg logit contributions
The-0.024
 finish-0.023
In-0.023
 grant-0.023
 before-0.023
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
idd+1.311
rol+1.300
und+1.197
iw+1.193
arre+1.178
Neg logit contributions
The-0.426
 finish-0.405
 ends-0.394
In-0.392
idi-0.390
Token pieces
Token ID5207
Feature activation+6.36e-01

Pos logit contributions
idi+1.411
The+1.361
In+1.327
 before+1.311
 finish+1.307
Neg logit contributions
idd-1.878
rol-1.740
arre-1.708
iw-1.669
und-1.660
Token on
Token ID319
Feature activation-1.12e+00

Pos logit contributions
idd+1.275
rol+1.264
und+1.159
iw+1.156
arre+1.138
Neg logit contributions
The-0.505
 finish-0.484
In-0.471
 ends-0.471
 before-0.468
Token the
Token ID262
Feature activation-2.89e+00

Pos logit contributions
 grant+0.742
In+0.671
 finish+0.645
 ends+0.627
ace+0.622
Neg logit contributions
idd-2.337
rol-2.298
und-2.250
iw-2.190
 batch-2.120
Token ground
Token ID2323
Feature activation+7.31e-01

Pos logit contributions
idi+3.550
In+3.520
 behold+3.303
The+3.216
ace+3.142
Neg logit contributions
idd-4.331
 bur-4.120
iw-4.032
arre-4.016
und-3.997
Token and
Token ID290
Feature activation+9.64e-02

Pos logit contributions
idd+1.667
rol+1.654
iw+1.528
und+1.527
arre+1.507
Neg logit contributions
The-0.380
 finish-0.360
 before-0.348
 ends-0.343
In-0.341
Token started
Token ID2067
Feature activation+8.99e-01

Pos logit contributions
idd+0.174
rol+0.171
und+0.157
iw+0.156
arre+0.151
Neg logit contributions
The-0.094
In-0.091
 finish-0.090
 grant-0.090
idi-0.089
Token to
Token ID284
Feature activation+6.29e-01

Pos logit contributions
idd+1.967
rol+1.965
und+1.802
iw+1.802
arre+1.771
Neg logit contributions
The-0.554
 finish-0.534
 before-0.519
 grant-0.503
 ends-0.502
Token cry
Token ID3960
Feature activation+1.20e+00

Pos logit contributions
idd+0.975
rol+0.966
und+0.858
iw+0.856
arre+0.838
Neg logit contributions
The-0.794
 finish-0.777
 grant-0.762
 ends-0.760
In-0.760
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
The+0.510
idi+0.501
In+0.497
 grant+0.475
ers+0.467
Neg logit contributions
rol-1.050
idd-1.040
und-0.978
iw-0.949
arre-0.942
Token it
Token ID340
Feature activation+8.61e-02

Pos logit contributions
idd+0.321
rol+0.318
und+0.295
iw+0.294
arre+0.287
Neg logit contributions
The-0.092
 finish-0.090
In-0.088
idi-0.088
 ends-0.087
Token hopped
Token ID45230
Feature activation+6.27e-01

Pos logit contributions
idd+0.158
rol+0.157
und+0.144
iw+0.141
arre+0.141
Neg logit contributions
The-0.080
 finish-0.077
 grant-0.077
 ends-0.076
In-0.076
Token up
Token ID510
Feature activation+5.46e-01

Pos logit contributions
idd+1.192
rol+1.179
und+1.077
iw+1.075
arre+1.058
Neg logit contributions
The-0.569
 finish-0.549
In-0.537
 ends-0.535
 before-0.535
Token on
Token ID319
Feature activation+5.33e-03

Pos logit contributions
idd+1.201
rol+1.193
und+1.103
iw+1.100
arre+1.084
Neg logit contributions
The-0.329
 finish-0.311
In-0.301
 ends-0.300
 grant-0.299
Token the
Token ID262
Feature activation-2.86e+00

Pos logit contributions
idd+0.012
rol+0.012
und+0.012
iw+0.011
arre+0.011
Neg logit contributions
The-0.002
In-0.002
 grant-0.002
 finish-0.002
 ends-0.002
Token stand
Token ID1302
Feature activation+1.95e-01

Pos logit contributions
In+3.667
idi+3.560
ace+3.501
The+3.346
 behold+3.277
Neg logit contributions
iw-4.358
idd-4.218
oc-4.031
rol-3.898
 batch-3.866
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
idd+0.392
rol+0.389
und+0.359
iw+0.356
arre+0.349
Neg logit contributions
The-0.150
 finish-0.144
In-0.143
idi-0.143
 grant-0.143
Token The
Token ID383
Feature activation-6.02e-01

Pos logit contributions
The+0.091
 ends+0.090
 finish+0.089
 grant+0.088
ers+0.087
Neg logit contributions
idd-0.335
rol-0.332
und-0.311
iw-0.307
arre-0.301
Token mouse
Token ID10211
Feature activation-2.48e-01

Pos logit contributions
idi+0.713
The+0.709
In+0.700
ace+0.686
 grant+0.682
Neg logit contributions
idd-0.957
rol-0.878
iw-0.854
arre-0.822
 batch-0.812
Token thought
Token ID1807
Feature activation-1.10e+00

Pos logit contributions
The+0.224
 grant+0.220
 finish+0.219
In+0.217
 ends+0.213
Neg logit contributions
idd-0.453
rol-0.448
und-0.416
arre-0.404
iw-0.403
Token noise
Token ID7838
Feature activation-1.54e-01

Pos logit contributions
The+0.179
In+0.174
idi+0.172
 finish+0.169
 before+0.168
Neg logit contributions
idd-0.302
rol-0.296
und-0.271
iw-0.270
arre-0.263
Token as
Token ID355
Feature activation-1.20e+00

Pos logit contributions
The+0.111
idi+0.110
In+0.110
 grant+0.108
 finish+0.105
Neg logit contributions
idd-0.310
rol-0.309
und-0.284
iw-0.279
arre-0.279
Token well
Token ID880
Feature activation-4.33e-01

Pos logit contributions
In+1.209
idi+1.180
The+1.169
 finish+1.163
 grant+1.135
Neg logit contributions
idd-2.068
rol-2.039
und-1.850
rou-1.850
iw-1.835
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
The+0.374
In+0.362
idi+0.343
 grant+0.340
 finish+0.338
Neg logit contributions
idd-0.838
rol-0.831
und-0.772
iw-0.745
arre-0.736
Token It
Token ID632
Feature activation+5.06e-01

Pos logit contributions
The+0.168
 ends+0.167
 finish+0.166
 grant+0.165
In+0.162
Neg logit contributions
idd-0.418
rol-0.411
und-0.385
iw-0.380
arre-0.372
Token sounds
Token ID5238
Feature activation-2.86e+00

Pos logit contributions
idd+0.855
rol+0.848
und+0.764
iw+0.761
arre+0.747
Neg logit contributions
The-0.560
 finish-0.543
In-0.535
 grant-0.533
idi-0.532
Token like
Token ID588
Feature activation-1.94e-01

Pos logit contributions
oster+3.801
 grant+3.776
ace+3.758
idi+3.638
isy+3.540
Neg logit contributions
rol-3.918
idd-3.614
 Good-3.585
 sorts-3.475
 anyways-3.448
Token a
Token ID257
Feature activation+5.70e-01

Pos logit contributions
The+0.155
In+0.149
 finish+0.147
idi+0.147
 grant+0.146
Neg logit contributions
idd-0.381
rol-0.376
und-0.345
iw-0.344
arre-0.334
Token sn
Token ID3013
Feature activation+5.53e-01

Pos logit contributions
idd+1.057
rol+1.049
und+0.954
iw+0.950
arre+0.935
Neg logit contributions
The-0.536
 finish-0.518
In-0.508
 ends-0.507
 grant-0.506
Tokenore
Token ID382
Feature activation-7.42e-03

Pos logit contributions
idd+0.727
rol+0.718
und+0.626
iw+0.623
arre+0.608
Neg logit contributions
The-0.828
 finish-0.810
In-0.801
 ends-0.799
 grant-0.798
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
The+0.006
In+0.006
idi+0.006
 grant+0.006
 finish+0.006
Neg logit contributions
idd-0.014
rol-0.014
und-0.013
iw-0.013
arre-0.013
Tokenily
Token ID813
Feature activation+1.71e-01

Pos logit contributions
The+0.005
 finish+0.005
In+0.005
idi+0.005
 grant+0.005
Neg logit contributions
idd-0.023
rol-0.023
und-0.021
iw-0.021
arre-0.021
Token said
Token ID531
Feature activation-1.41e-02

Pos logit contributions
idd+0.340
rol+0.334
und+0.310
iw+0.308
arre+0.307
Neg logit contributions
The-0.132
 finish-0.128
 grant-0.127
In-0.125
 ends-0.125
Token,
Token ID11
Feature activation-4.83e-01

Pos logit contributions
The+0.011
In+0.011
idi+0.011
 finish+0.011
 grant+0.011
Neg logit contributions
idd-0.028
rol-0.027
iw-0.025
und-0.025
arre-0.025
Token "
Token ID366
Feature activation-5.27e-01

Pos logit contributions
 ends+0.499
 finish+0.486
idi+0.482
The+0.482
ers+0.476
Neg logit contributions
idd-0.828
rol-0.787
iw-0.739
und-0.725
arre-0.719
TokenThat
Token ID2504
Feature activation+4.74e-01

Pos logit contributions
 finish+0.690
idi+0.688
 before+0.684
 ends+0.680
 grant+0.672
Neg logit contributions
idd-0.759
rol-0.737
iw-0.691
und-0.661
arre-0.631
Token sounds
Token ID5238
Feature activation-2.85e+00

Pos logit contributions
idd+0.826
rol+0.818
und+0.741
iw+0.738
arre+0.727
Neg logit contributions
The-0.503
 finish-0.486
In-0.480
idi-0.478
 grant-0.477
Token great
Token ID1049
Feature activation+5.29e-01

Pos logit contributions
idi+4.435
oster+4.139
The+4.026
ace+3.966
ryn+3.893
Neg logit contributions
rol-3.323
 Good-3.198
 sorts-3.168
arre-3.093
idd-3.007
Token,
Token ID11
Feature activation+5.95e-01

Pos logit contributions
idd+1.101
rol+1.093
und+1.006
iw+1.003
arre+0.988
Neg logit contributions
The-0.384
 finish-0.366
In-0.358
 ends-0.356
 grant-0.356
Token grandma
Token ID49890
Feature activation-3.64e-01

Pos logit contributions
idd+1.048
rol+1.039
und+0.940
iw+0.936
arre+0.919
Neg logit contributions
The-0.620
 finish-0.601
In-0.589
 ends-0.589
 grant-0.588
Token.
Token ID13
Feature activation-7.56e-02

Pos logit contributions
The+0.346
In+0.339
idi+0.334
 finish+0.331
 grant+0.330
Neg logit contributions
idd-0.673
rol-0.641
iw-0.607
und-0.594
rou-0.580
Token Thank
Token ID6952
Feature activation-1.88e-01

Pos logit contributions
The+0.098
idi+0.097
 finish+0.096
 ends+0.095
In+0.094
Neg logit contributions
idd-0.111
rol-0.110
iw-0.098
und-0.098
arre-0.093
Token,
Token ID11
Feature activation+2.87e-03

Pos logit contributions
idd+0.114
rol+0.112
und+0.102
iw+0.102
arre+0.101
Neg logit contributions
The-0.049
In-0.047
idi-0.047
 finish-0.046
 ends-0.046
Token "
Token ID366
Feature activation+1.93e-01

Pos logit contributions
idd+0.005
rol+0.005
iw+0.005
und+0.005
arre+0.005
Neg logit contributions
The-0.003
 ends-0.003
 finish-0.003
In-0.003
idi-0.003
TokenOK
Token ID11380
Feature activation-8.06e-01

Pos logit contributions
idd+0.354
rol+0.348
iw+0.318
und+0.317
arre+0.312
Neg logit contributions
The-0.180
 finish-0.180
 grant-0.178
 ends-0.177
idi-0.177
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
In+0.790
The+0.782
 grant+0.772
idi+0.771
 ends+0.753
Neg logit contributions
idd-1.412
rol-1.358
und-1.256
arre-1.236
iw-1.236
Token that
Token ID326
Feature activation+2.08e+00

Pos logit contributions
In+0.869
The+0.859
idi+0.844
 ends+0.842
ers+0.833
Neg logit contributions
idd-1.504
rol-1.471
iw-1.365
und-1.307
arre-1.296
Token sounds
Token ID5238
Feature activation-2.82e+00

Pos logit contributions
rol+4.390
idd+4.287
iw+4.109
und+3.922
ays+3.800
Neg logit contributions
The-1.822
 ends-1.775
 finish-1.739
In-1.662
 before-1.624
Token fun
Token ID1257
Feature activation+1.14e-01

Pos logit contributions
idi+4.597
 grant+4.531
ace+4.312
The+4.267
isy+4.260
Neg logit contributions
 Good-3.003
rol-2.889
 sorts-2.749
idd-2.732
arre-2.639
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
idd+0.203
rol+0.199
und+0.181
iw+0.179
arre+0.177
Neg logit contributions
The-0.117
 grant-0.112
In-0.112
idi-0.112
 finish-0.111
Token What
Token ID1867
Feature activation+1.37e-01

Pos logit contributions
The+0.314
idi+0.313
ers+0.309
 ends+0.308
 finish+0.307
Neg logit contributions
idd-0.480
rol-0.475
iw-0.430
und-0.427
arre-0.410
Token things
Token ID1243
Feature activation-1.04e+00

Pos logit contributions
idd+0.286
rol+0.281
und+0.262
iw+0.261
arre+0.259
Neg logit contributions
The-0.093
idi-0.088
 finish-0.088
 grant-0.085
In-0.085
Token do
Token ID466
Feature activation-1.12e+00

Pos logit contributions
idi+0.803
The+0.764
ace+0.717
ers+0.695
In+0.681
Neg logit contributions
idd-2.047
rol-2.019
und-1.943
arre-1.942
iw-1.888
Token She
Token ID1375
Feature activation+2.81e-01

Pos logit contributions
idd+0.157
rol+0.155
iw+0.141
und+0.140
arre+0.136
Neg logit contributions
The-0.120
idi-0.117
 finish-0.117
 grant-0.117
In-0.117
Token said
Token ID531
Feature activation-6.69e-03

Pos logit contributions
idd+0.500
rol+0.491
und+0.449
iw+0.446
arre+0.442
Neg logit contributions
The-0.283
 finish-0.272
 grant-0.269
 before-0.268
In-0.268
Token,
Token ID11
Feature activation-3.46e-01

Pos logit contributions
The+0.005
In+0.005
 finish+0.005
idi+0.005
 ends+0.005
Neg logit contributions
idd-0.013
rol-0.013
und-0.012
iw-0.012
arre-0.012
Token "
Token ID366
Feature activation-5.17e-02

Pos logit contributions
 ends+0.346
 finish+0.344
The+0.343
idi+0.339
In+0.333
Neg logit contributions
idd-0.601
rol-0.579
iw-0.536
und-0.530
arre-0.526
TokenThat
Token ID2504
Feature activation+3.17e-01

Pos logit contributions
 finish+0.063
 grant+0.062
 ends+0.062
idi+0.062
 before+0.062
Neg logit contributions
idd-0.081
rol-0.078
iw-0.072
und-0.071
arre-0.069
Token sounds
Token ID5238
Feature activation-2.82e+00

Pos logit contributions
idd+0.545
rol+0.537
und+0.489
iw+0.484
arre+0.479
Neg logit contributions
The-0.339
 finish-0.327
In-0.325
idi-0.324
 grant-0.321
Token fun
Token ID1257
Feature activation-1.09e-01

Pos logit contributions
idi+4.038
oster+3.860
The+3.815
ers+3.558
In+3.511
Neg logit contributions
rol-3.662
 sorts-3.561
 Good-3.444
arre-3.318
idd-3.290
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
The+0.105
In+0.100
idi+0.100
 grant+0.099
 finish+0.098
Neg logit contributions
idd-0.201
rol-0.197
und-0.182
iw-0.178
arre-0.175
Token I
Token ID314
Feature activation+7.64e-01

Pos logit contributions
The+0.292
 finish+0.285
idi+0.284
In+0.282
 ends+0.282
Neg logit contributions
idd-0.372
rol-0.362
iw-0.328
und-0.325
arre-0.308
Token will
Token ID481
Feature activation-4.43e-01

Pos logit contributions
idd+1.351
rol+1.345
und+1.212
iw+1.208
arre+1.179
Neg logit contributions
The-0.794
 finish-0.782
 grant-0.761
In-0.761
 ends-0.758
Token draw
Token ID3197
Feature activation-1.10e-01

Pos logit contributions
idi+0.330
The+0.328
ers+0.326
 ends+0.318
In+0.312
Neg logit contributions
idd-0.874
rol-0.851
und-0.807
iw-0.791
arre-0.788
Token the
Token ID262
Feature activation-9.44e-01

Pos logit contributions
idd+3.156
rol+3.144
und+2.924
arre+2.908
iw+2.900
Neg logit contributions
The-0.491
 before-0.428
 finish-0.407
 ends-0.387
 grant-0.383
Token leaves
Token ID5667
Feature activation+1.13e-01

Pos logit contributions
idi+1.018
ers+0.984
The+0.982
In+0.963
 grant+0.956
Neg logit contributions
idd-1.516
rol-1.481
iw-1.446
und-1.393
arre-1.381
Token flying
Token ID7348
Feature activation+2.04e-01

Pos logit contributions
rol+0.215
idd+0.215
und+0.196
iw+0.194
arre+0.191
Neg logit contributions
The-0.097
idi-0.095
In-0.093
 finish-0.092
 grant-0.091
Token all
Token ID477
Feature activation-1.02e+00

Pos logit contributions
rol+0.412
idd+0.411
und+0.378
iw+0.373
arre+0.366
Neg logit contributions
The-0.156
 finish-0.150
 grant-0.149
idi-0.149
In-0.148
Token over
Token ID625
Feature activation-2.20e-01

Pos logit contributions
 grant+1.016
idi+0.912
The+0.890
In+0.879
 finish+0.864
Neg logit contributions
rol-1.945
idd-1.908
und-1.767
iw-1.729
 sorts-1.696
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 grant+0.106
 finish+0.102
idi+0.098
The+0.098
In+0.096
Neg logit contributions
idd-0.506
rol-0.496
iw-0.471
und-0.468
arre-0.445
Token place
Token ID1295
Feature activation+4.25e-01

Pos logit contributions
idi+3.543
 grant+3.481
kin+3.423
oster+3.384
ace+3.308
Neg logit contributions
idd-4.303
 bur-4.215
iw-4.195
und-4.190
rol-3.901
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
idd+0.853
rol+0.847
und+0.776
iw+0.774
arre+0.760
Neg logit contributions
The-0.333
 finish-0.318
In-0.314
 grant-0.313
idi-0.313
Token She
Token ID1375
Feature activation+3.76e-01

Pos logit contributions
idd+0.211
rol+0.209
und+0.189
iw+0.188
arre+0.186
Neg logit contributions
The-0.155
 grant-0.151
 finish-0.151
idi-0.150
 ends-0.150
Token decided
Token ID3066
Feature activation-3.64e-01

Pos logit contributions
idd+0.677
rol+0.667
und+0.608
iw+0.605
arre+0.596
Neg logit contributions
The-0.375
 finish-0.362
 grant-0.356
In-0.356
 ends-0.356
Token to
Token ID284
Feature activation+3.69e-02

Pos logit contributions
 ends+0.292
ers+0.285
The+0.284
ace+0.282
 grant+0.281
Neg logit contributions
idd-0.696
rol-0.685
und-0.640
iw-0.639
arre-0.628
Token-
Token ID12
Feature activation+1.02e+00

Pos logit contributions
idd+1.921
rol+1.891
iw+1.743
und+1.710
arre+1.691
Neg logit contributions
The-1.089
 finish-1.066
 before-1.044
 grant-1.034
 ends-1.027
Tokentap
Token ID44335
Feature activation+9.55e-01

Pos logit contributions
idd+1.642
rol+1.618
iw+1.454
und+1.451
arre+1.435
Neg logit contributions
The-1.268
In-1.197
 finish-1.184
idi-1.176
 ends-1.164
Token-
Token ID12
Feature activation+8.23e-01

Pos logit contributions
idd+1.697
rol+1.667
iw+1.536
und+1.509
arre+1.508
Neg logit contributions
The-1.002
 finish-0.954
 before-0.945
 ends-0.941
 grant-0.933
Tokentap
Token ID44335
Feature activation+6.59e-01

Pos logit contributions
idd+1.405
rol+1.391
und+1.254
iw+1.252
arre+1.233
Neg logit contributions
The-0.918
 finish-0.873
In-0.866
idi-0.855
 ends-0.855
Token on
Token ID319
Feature activation-8.24e-01

Pos logit contributions
idd+1.207
rol+1.193
iw+1.087
und+1.085
arre+1.068
Neg logit contributions
The-0.644
 finish-0.619
 ends-0.607
In-0.605
 before-0.605
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
In+0.419
ers+0.415
idi+0.408
 ends+0.395
 finish+0.385
Neg logit contributions
idd-1.784
rol-1.768
und-1.706
iw-1.672
 batch-1.622
Token leaf
Token ID12835
Feature activation+4.47e-01

Pos logit contributions
In+3.565
idi+3.305
The+3.202
 involve+3.132
 behold+3.120
Neg logit contributions
idd-4.502
 bur-4.167
rol-4.105
iw-4.046
 batch-3.880
Token.
Token ID13
Feature activation-2.34e-01

Pos logit contributions
idd+0.909
rol+0.904
und+0.831
iw+0.826
arre+0.814
Neg logit contributions
The-0.335
 finish-0.319
In-0.314
idi-0.314
 grant-0.312
Token It
Token ID632
Feature activation+2.03e-01

Pos logit contributions
The+0.242
 finish+0.238
 ends+0.235
idi+0.233
In+0.233
Neg logit contributions
rol-0.403
idd-0.402
und-0.368
iw-0.358
arre-0.357
Token was
Token ID373
Feature activation-4.05e-01

Pos logit contributions
idd+0.392
rol+0.389
und+0.357
iw+0.351
arre+0.349
Neg logit contributions
The-0.172
 finish-0.167
 grant-0.164
In-0.163
 before-0.162
Token a
Token ID257
Feature activation+2.09e-01

Pos logit contributions
The+0.371
 grant+0.364
In+0.360
 ends+0.359
 finish+0.357
Neg logit contributions
rol-0.751
idd-0.750
und-0.678
iw-0.669
arre-0.653
Token friends
Token ID2460
Feature activation+1.48e+00

Pos logit contributions
The+0.385
 grant+0.378
In+0.370
idi+0.369
 before+0.369
Neg logit contributions
idd-0.502
rol-0.473
iw-0.427
und-0.424
arre-0.420
Token found
Token ID1043
Feature activation+6.73e-01

Pos logit contributions
rol+2.958
idd+2.953
iw+2.776
und+2.673
rou+2.636
Neg logit contributions
 finish-1.250
The-1.236
In-1.207
 before-1.152
 ends-1.150
Token some
Token ID617
Feature activation-1.58e-01

Pos logit contributions
idd+1.431
rol+1.424
und+1.310
iw+1.309
arre+1.292
Neg logit contributions
The-0.461
 finish-0.435
In-0.424
 ends-0.424
idi-0.418
Token sticks
Token ID16461
Feature activation+2.42e-02

Pos logit contributions
The+0.174
 grant+0.169
 finish+0.168
idi+0.167
In+0.166
Neg logit contributions
idd-0.269
rol-0.263
iw-0.239
und-0.238
arre-0.231
Token on
Token ID319
Feature activation-9.16e-01

Pos logit contributions
idd+0.051
rol+0.051
und+0.047
iw+0.046
arre+0.045
Neg logit contributions
The-0.016
idi-0.016
 finish-0.016
In-0.015
 grant-0.015
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
 grant+0.439
ace+0.424
idi+0.421
isy+0.407
In+0.404
Neg logit contributions
idd-2.045
rol-1.994
und-1.964
iw-1.892
 batch-1.863
Token ground
Token ID2323
Feature activation+4.78e-01

Pos logit contributions
idi+2.974
 behold+2.829
In+2.725
 grant+2.577
The+2.494
Neg logit contributions
idd-4.699
rol-4.695
 bur-4.459
und-4.318
iw-4.224
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
idd+0.968
rol+0.961
und+0.883
iw+0.879
arre+0.865
Neg logit contributions
The-0.366
 finish-0.350
In-0.343
 grant-0.342
idi-0.342
Token
Token ID220
Feature activation+1.83e-01

Pos logit contributions
 grant+0.459
idi+0.444
The+0.442
ace+0.439
 ends+0.438
Neg logit contributions
idd-0.724
rol-0.706
und-0.651
arre-0.634
iw-0.634
Token
Token ID198
Feature activation-3.43e-02

Pos logit contributions
idd+0.364
rol+0.362
und+0.332
iw+0.328
arre+0.326
Neg logit contributions
The-0.140
 finish-0.140
 grant-0.137
idi-0.136
 ends-0.135
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
 finish+0.028
 grant+0.028
 ends+0.027
The+0.027
idi+0.027
Neg logit contributions
idd-0.067
rol-0.066
und-0.061
iw-0.060
arre-0.059
Token her
Token ID607
Feature activation-1.25e+00

Pos logit contributions
The+0.210
In+0.205
 grant+0.199
 finish+0.199
 ends+0.196
Neg logit contributions
idd-0.467
rol-0.461
iw-0.428
und-0.425
 batch-0.407
Token hand
Token ID1021
Feature activation+1.61e-01

Pos logit contributions
idi+1.503
The+1.492
In+1.490
ers+1.386
 grant+1.316
Neg logit contributions
idd-2.021
rol-1.944
iw-1.840
und-1.783
 batch-1.764
Token and
Token ID290
Feature activation+4.91e-01

Pos logit contributions
idd+0.353
rol+0.351
und+0.327
iw+0.323
arre+0.319
Neg logit contributions
The-0.094
idi-0.089
 grant-0.089
 finish-0.087
In-0.087
Token fell
Token ID3214
Feature activation+5.19e-01

Pos logit contributions
idd+0.853
rol+0.845
und+0.765
iw+0.762
arre+0.747
Neg logit contributions
The-0.519
 finish-0.503
In-0.496
idi-0.494
 grant-0.494
Token on
Token ID319
Feature activation-7.64e-01

Pos logit contributions
idd+1.002
rol+0.994
und+0.908
iw+0.905
arre+0.890
Neg logit contributions
The-0.452
 finish-0.435
In-0.427
 grant-0.425
 ends-0.425
Token the
Token ID262
Feature activation-2.76e+00

Pos logit contributions
In+0.384
 ends+0.375
 grant+0.375
idi+0.374
 finish+0.363
Neg logit contributions
idd-1.675
rol-1.637
iw-1.580
und-1.575
 batch-1.531
Token floor
Token ID4314
Feature activation+1.11e+00

Pos logit contributions
In+3.372
idi+3.352
 behold+3.226
The+3.049
ace+2.860
Neg logit contributions
idd-4.231
rol-4.146
 batch-4.015
iw-3.968
asser-3.759
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
idd+2.364
rol+2.336
iw+2.158
und+2.146
arre+2.129
Neg logit contributions
The-0.774
 before-0.760
 finish-0.754
 ends-0.709
In-0.704
Token It
Token ID632
Feature activation+1.68e-01

Pos logit contributions
 finish+0.099
The+0.098
 grant+0.098
 ends+0.097
idi+0.097
Neg logit contributions
idd-0.220
rol-0.217
und-0.200
iw-0.200
arre-0.197
Token broke
Token ID6265
Feature activation-2.35e-01

Pos logit contributions
idd+0.305
rol+0.305
und+0.278
iw+0.274
arre+0.270
Neg logit contributions
The-0.158
 finish-0.152
idi-0.152
 grant-0.151
In-0.150
Token into
Token ID656
Feature activation+1.02e+00

Pos logit contributions
 grant+0.149
The+0.148
idi+0.148
 finish+0.146
In+0.145
Neg logit contributions
idd-0.505
rol-0.501
und-0.466
iw-0.459
arre-0.449
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
idd+0.704
rol+0.698
und+0.652
iw+0.649
arre+0.638
Neg logit contributions
The-0.129
 finish-0.122
In-0.119
 grant-0.119
idi-0.116
Token rope
Token ID17182
Feature activation+8.74e-01

Pos logit contributions
idi+2.857
In+2.809
 grant+2.698
ace+2.690
 behold+2.634
Neg logit contributions
idd-3.373
iw-3.053
rol-3.036
 batch-2.972
 poss-2.956
Token and
Token ID290
Feature activation+8.19e-02

Pos logit contributions
idd+1.970
rol+1.943
iw+1.804
und+1.795
arre+1.779
Neg logit contributions
The-0.488
 finish-0.470
 before-0.461
 ends-0.443
In-0.434
Token fell
Token ID3214
Feature activation+8.19e-01

Pos logit contributions
idd+0.148
rol+0.147
und+0.134
iw+0.133
arre+0.129
Neg logit contributions
The-0.079
In-0.076
 grant-0.076
 finish-0.076
idi-0.075
Token on
Token ID319
Feature activation-1.68e-01

Pos logit contributions
idd+1.626
rol+1.607
iw+1.477
und+1.473
arre+1.460
Neg logit contributions
The-0.674
 finish-0.649
 before-0.641
 ends-0.633
In-0.628
Token the
Token ID262
Feature activation-2.75e+00

Pos logit contributions
In+0.061
 grant+0.061
 finish+0.059
The+0.059
 ends+0.058
Neg logit contributions
idd-0.398
rol-0.391
und-0.371
iw-0.368
arre-0.357
Token stick
Token ID4859
Feature activation+1.43e+00

Pos logit contributions
In+2.921
idi+2.835
 behold+2.614
 grant+2.568
ace+2.522
Neg logit contributions
idd-4.573
rol-4.294
 bur-4.166
iw-4.161
 poss-4.156
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
idd+3.204
rol+3.126
iw+2.933
und+2.894
arre+2.886
Neg logit contributions
 before-0.924
 finish-0.903
The-0.898
 ends-0.828
 grant-0.776
Token He
Token ID679
Feature activation-7.20e-02

Pos logit contributions
 grant+0.255
 ends+0.249
 finish+0.247
idi+0.243
The+0.242
Neg logit contributions
idd-0.675
rol-0.661
und-0.626
iw-0.616
arre-0.601
Token y
Token ID331
Feature activation+1.16e+00

Pos logit contributions
The+0.079
 finish+0.078
In+0.077
 grant+0.077
idi+0.077
Neg logit contributions
idd-0.120
rol-0.117
und-0.107
iw-0.106
arre-0.103
Tokenel
Token ID417
Feature activation-4.75e-01

Pos logit contributions
idd+1.450
rol+1.430
iw+1.247
und+1.207
arre+1.190
Neg logit contributions
The-1.792
 finish-1.774
 ends-1.753
In-1.733
 before-1.708
TokenOkay
Token ID16454
Feature activation-7.59e-01

Pos logit contributions
idd+0.161
rol+0.159
iw+0.144
und+0.142
arre+0.137
Neg logit contributions
 finish-0.126
idi-0.125
The-0.124
 ends-0.124
 before-0.123
Token,
Token ID11
Feature activation-4.25e-01

Pos logit contributions
In+0.752
The+0.738
idi+0.731
 grant+0.719
 ends+0.696
Neg logit contributions
idd-1.340
rol-1.293
iw-1.196
und-1.193
arre-1.160
Token Tom
Token ID4186
Feature activation-5.36e-01

Pos logit contributions
The+0.381
In+0.374
idi+0.370
 grant+0.359
 ends+0.357
Neg logit contributions
idd-0.786
rol-0.772
iw-0.715
und-0.696
arre-0.687
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
The+0.527
 grant+0.518
In+0.512
idi+0.511
 finish+0.501
Neg logit contributions
idd-0.969
rol-0.930
iw-0.881
und-0.875
arre-0.835
Token That
Token ID1320
Feature activation+1.29e+00

Pos logit contributions
The+0.027
idi+0.027
 finish+0.026
In+0.026
 ends+0.026
Neg logit contributions
idd-0.034
rol-0.034
iw-0.030
und-0.030
arre-0.029
Token sounds
Token ID5238
Feature activation-2.75e+00

Pos logit contributions
idd+2.595
rol+2.588
iw+2.368
und+2.332
arre+2.278
Neg logit contributions
The-1.119
 finish-1.114
 ends-1.109
 before-1.047
In-1.044
Token fun
Token ID1257
Feature activation-1.25e-01

Pos logit contributions
idi+4.881
The+4.562
 grant+4.487
ace+4.431
ers+4.385
Neg logit contributions
 sorts-2.498
rol-2.489
 Good-2.365
idd-2.266
asser-2.125
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
The+0.118
idi+0.114
In+0.113
 grant+0.112
 finish+0.111
Neg logit contributions
idd-0.232
rol-0.227
und-0.208
iw-0.207
arre-0.202
Token Let
Token ID3914
Feature activation+7.91e-01

Pos logit contributions
The+0.277
idi+0.275
 finish+0.272
 ends+0.272
ers+0.271
Neg logit contributions
idd-0.359
rol-0.354
iw-0.318
und-0.315
arre-0.301
Token's
Token ID338
Feature activation-8.88e-01

Pos logit contributions
idd+1.476
rol+1.465
iw+1.333
und+1.323
arre+1.289
Neg logit contributions
The-0.773
 finish-0.746
 ends-0.732
In-0.721
idi-0.719
Token go
Token ID467
Feature activation-2.67e-01

Pos logit contributions
idi+0.801
The+0.786
ers+0.768
In+0.768
isy+0.743
Neg logit contributions
idd-1.636
rol-1.582
und-1.509
arre-1.452
iw-1.426
Token mom
Token ID1995
Feature activation+2.35e-01

Pos logit contributions
The+0.904
In+0.881
 grant+0.827
ers+0.818
 finish+0.814
Neg logit contributions
idd-1.842
rol-1.658
iw-1.623
rou-1.605
arre-1.594
Token said
Token ID531
Feature activation+8.57e-02

Pos logit contributions
idd+0.400
rol+0.394
und+0.358
iw+0.354
arre+0.351
Neg logit contributions
The-0.258
 finish-0.251
 grant-0.249
In-0.247
 ends-0.246
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
idd+0.172
rol+0.169
und+0.156
iw+0.156
arre+0.153
Neg logit contributions
The-0.067
 finish-0.066
In-0.065
 ends-0.064
idi-0.064
Token "
Token ID366
Feature activation+6.10e-01

Pos logit contributions
The+0.017
 finish+0.017
 ends+0.017
idi+0.016
In+0.016
Neg logit contributions
idd-0.032
rol-0.031
und-0.028
iw-0.028
arre-0.028
TokenThat
Token ID2504
Feature activation+1.87e-01

Pos logit contributions
idd+0.973
rol+0.964
und+0.862
iw+0.857
arre+0.842
Neg logit contributions
The-0.750
 finish-0.723
In-0.718
 ends-0.712
idi-0.711
Token sounds
Token ID5238
Feature activation-2.74e+00

Pos logit contributions
idd+0.317
rol+0.312
und+0.285
iw+0.279
arre+0.279
Neg logit contributions
The-0.205
 finish-0.198
In-0.197
 grant-0.196
idi-0.195
Token very
Token ID845
Feature activation+2.11e-01

Pos logit contributions
idi+3.854
The+3.657
 grant+3.431
ace+3.429
In+3.412
Neg logit contributions
rol-3.653
 Good-3.439
 sorts-3.393
idd-3.276
und-3.246
Token exciting
Token ID7895
Feature activation-1.25e-01

Pos logit contributions
idd+0.391
rol+0.385
und+0.353
iw+0.351
arre+0.346
Neg logit contributions
The-0.196
 finish-0.188
 grant-0.187
idi-0.186
 ends-0.186
Token.
Token ID13
Feature activation-9.80e-02

Pos logit contributions
The+0.112
In+0.107
 finish+0.106
idi+0.106
 grant+0.106
Neg logit contributions
idd-0.235
rol-0.232
und-0.215
iw-0.212
arre-0.207
Token But
Token ID887
Feature activation+4.47e-01

Pos logit contributions
The+0.114
 finish+0.112
 ends+0.111
idi+0.111
In+0.111
Neg logit contributions
idd-0.159
rol-0.156
iw-0.140
und-0.140
arre-0.134
Token why
Token ID1521
Feature activation-3.32e-02

Pos logit contributions
idd+0.796
rol+0.788
und+0.716
iw+0.715
arre+0.700
Neg logit contributions
The-0.450
 finish-0.434
In-0.428
idi-0.426
 ends-0.426
Token the
Token ID262
Feature activation-9.27e-01

Pos logit contributions
In+0.683
ers+0.615
The+0.613
isy+0.598
 ends+0.593
Neg logit contributions
idd-2.329
rol-2.268
iw-2.155
und-2.109
 batch-2.055
Token face
Token ID1986
Feature activation-2.02e-01

Pos logit contributions
In+0.623
ers+0.618
The+0.603
idi+0.582
 grant+0.570
Neg logit contributions
idd-1.963
rol-1.786
iw-1.765
und-1.739
arre-1.729
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
The+0.174
In+0.168
 finish+0.168
 ends+0.166
idi+0.166
Neg logit contributions
idd-0.390
rol-0.382
iw-0.347
und-0.347
arre-0.334
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 ends+0.209
 finish+0.209
The+0.207
ers+0.200
In+0.198
Neg logit contributions
idd-0.404
rol-0.392
iw-0.362
und-0.362
arre-0.346
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
 ends+1.115
 finish+1.089
 before+1.086
 grant+1.053
ace+1.048
Neg logit contributions
idd-2.002
rol-1.856
und-1.812
iw-1.727
arre-1.669
TokenLittle
Token ID22253
Feature activation-2.74e+00

Pos logit contributions
 ends+1.453
 before+1.408
ace+1.382
 grant+1.371
 finish+1.350
Neg logit contributions
idd-2.534
rol-2.307
und-2.273
iw-2.205
oc-2.195
Token boy
Token ID2933
Feature activation-7.65e-01

Pos logit contributions
ace+3.991
 grant+3.883
isy+3.640
In+3.604
oster+3.552
Neg logit contributions
idd-4.046
und-3.608
iw-3.597
joy-3.394
 Mechan-3.315
Token smiled
Token ID13541
Feature activation+3.67e-01

Pos logit contributions
The+0.806
 ends+0.802
 grant+0.800
In+0.795
 finish+0.788
Neg logit contributions
idd-1.341
rol-1.244
und-1.156
iw-1.138
arre-1.112
Token shy
Token ID15800
Feature activation-3.29e-01

Pos logit contributions
idd+0.793
rol+0.787
und+0.727
iw+0.724
arre+0.716
Neg logit contributions
The-0.230
 finish-0.219
 grant-0.215
In-0.215
 ends-0.214
Tokenly
Token ID306
Feature activation-5.40e-01

Pos logit contributions
The+0.263
In+0.254
 grant+0.246
idi+0.240
 finish+0.240
Neg logit contributions
idd-0.664
rol-0.656
und-0.608
arre-0.600
iw-0.598
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
The+0.474
In+0.463
 grant+0.459
idi+0.452
 finish+0.444
Neg logit contributions
idd-1.031
rol-0.997
und-0.935
arre-0.919
iw-0.913
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
idd+0.899
rol+0.893
und+0.821
iw+0.816
arre+0.804
Neg logit contributions
The-0.324
 finish-0.311
 grant-0.305
In-0.304
idi-0.302
Token It
Token ID632
Feature activation+3.50e-01

Pos logit contributions
 finish+0.244
The+0.240
 ends+0.239
idi+0.238
In+0.233
Neg logit contributions
idd-0.501
rol-0.493
und-0.457
iw-0.452
arre-0.439
Token was
Token ID373
Feature activation-3.51e-01

Pos logit contributions
idd+0.686
rol+0.681
und+0.623
iw+0.620
arre+0.608
Neg logit contributions
The-0.292
 finish-0.281
 grant-0.275
In-0.275
 ends-0.272
Token rolling
Token ID10708
Feature activation+1.46e-01

Pos logit contributions
The+0.378
In+0.362
 grant+0.361
 finish+0.354
idi+0.353
Neg logit contributions
idd-0.598
rol-0.598
und-0.530
iw-0.528
rou-0.516
Token down
Token ID866
Feature activation-1.13e-02

Pos logit contributions
idd+0.291
rol+0.290
und+0.267
iw+0.262
arre+0.258
Neg logit contributions
The-0.114
 grant-0.112
 finish-0.112
idi-0.111
In-0.110
Token the
Token ID262
Feature activation-2.73e+00

Pos logit contributions
 grant+0.004
The+0.004
idi+0.004
 finish+0.004
In+0.004
Neg logit contributions
idd-0.027
rol-0.027
und-0.025
iw-0.025
arre-0.024
Token hill
Token ID12788
Feature activation+5.41e-01

Pos logit contributions
idi+4.447
 grant+4.327
ace+4.292
In+4.211
 behold+4.130
Neg logit contributions
idd-3.385
rol-3.047
rob-2.917
iw-2.913
und-2.780
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
idd+1.106
rol+1.098
und+1.009
iw+1.006
arre+0.991
Neg logit contributions
The-0.405
 finish-0.387
In-0.378
 ends-0.376
 grant-0.376
Token It
Token ID632
Feature activation+3.79e-01

Pos logit contributions
 finish+0.383
 ends+0.377
idi+0.373
The+0.372
 grant+0.363
Neg logit contributions
idd-0.753
rol-0.742
und-0.688
iw-0.680
arre-0.657
Token was
Token ID373
Feature activation-5.94e-01

Pos logit contributions
idd+0.720
rol+0.714
und+0.651
iw+0.647
arre+0.636
Neg logit contributions
The-0.340
 finish-0.327
 grant-0.322
In-0.321
idi-0.318
Token going
Token ID1016
Feature activation-3.88e-01

Pos logit contributions
The+0.675
 grant+0.665
In+0.651
idi+0.639
 ends+0.637
Neg logit contributions
rol-0.974
idd-0.969
und-0.853
iw-0.845
rou-0.839
Token Sam
Token ID3409
Feature activation+4.21e-02

Pos logit contributions
The+0.107
idi+0.106
In+0.106
 finish+0.104
 ends+0.104
Neg logit contributions
idd-0.192
rol-0.185
iw-0.174
und-0.172
arre-0.165
Token says
Token ID1139
Feature activation-6.25e-01

Pos logit contributions
idd+0.053
rol+0.053
und+0.046
iw+0.046
arre+0.045
Neg logit contributions
The-0.063
In-0.062
 grant-0.062
idi-0.061
 finish-0.061
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
In+0.623
The+0.622
idi+0.610
 grant+0.582
 ends+0.570
Neg logit contributions
idd-1.121
rol-1.087
iw-0.995
und-0.969
arre-0.964
Token "
Token ID366
Feature activation-3.66e-01

Pos logit contributions
 grant+0.247
 finish+0.247
 ends+0.244
The+0.243
idi+0.242
Neg logit contributions
idd-0.448
rol-0.433
und-0.399
iw-0.397
arre-0.387
TokenThat
Token ID2504
Feature activation+8.01e-01

Pos logit contributions
idi+0.296
 grant+0.292
 finish+0.290
 ends+0.288
 before+0.278
Neg logit contributions
idd-0.710
rol-0.688
iw-0.643
und-0.638
arre-0.631
Token sounds
Token ID5238
Feature activation-2.73e+00

Pos logit contributions
idd+1.490
rol+1.480
iw+1.344
und+1.342
arre+1.311
Neg logit contributions
The-0.777
 finish-0.750
 ends-0.744
In-0.730
 grant-0.727
Token fun
Token ID1257
Feature activation-3.91e-02

Pos logit contributions
idi+4.929
oster+4.472
 grant+4.432
The+4.405
ers+4.405
Neg logit contributions
rol-2.485
 Good-2.377
 sorts-2.374
idd-2.219
und-2.164
Token."
Token ID526
Feature activation-7.67e-01

Pos logit contributions
The+0.038
idi+0.037
In+0.037
 grant+0.037
 finish+0.036
Neg logit contributions
idd-0.071
rol-0.070
und-0.064
iw-0.063
arre-0.062
Token
Token ID198
Feature activation-2.94e-01

Pos logit contributions
 grant+0.763
 ends+0.755
idi+0.754
 finish+0.743
ers+0.718
Neg logit contributions
idd-1.366
rol-1.318
iw-1.240
und-1.237
arre-1.190
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
 finish+0.254
 grant+0.252
idi+0.250
 before+0.246
 ends+0.244
Neg logit contributions
idd-0.556
rol-0.543
und-0.507
iw-0.497
arre-0.487
TokenThey
Token ID2990
Feature activation-3.62e-01

Pos logit contributions
 grant+0.400
idi+0.394
 finish+0.388
 before+0.388
 ends+0.384
Neg logit contributions
idd-0.683
rol-0.652
und-0.612
iw-0.603
arre-0.589
Token,
Token ID11
Feature activation-7.48e-02

Pos logit contributions
The+0.008
In+0.008
 finish+0.008
idi+0.008
 ends+0.008
Neg logit contributions
idd-0.021
rol-0.021
iw-0.019
und-0.019
arre-0.019
Token "
Token ID366
Feature activation+1.98e-01

Pos logit contributions
The+0.071
 ends+0.070
 finish+0.070
In+0.069
idi+0.068
Neg logit contributions
idd-0.136
rol-0.132
iw-0.123
und-0.121
arre-0.119
TokenOK
Token ID11380
Feature activation-7.30e-01

Pos logit contributions
idd+0.312
rol+0.306
iw+0.278
und+0.275
arre+0.267
Neg logit contributions
 finish-0.238
The-0.236
 grant-0.235
 ends-0.234
idi-0.233
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
In+0.698
idi+0.691
The+0.691
 grant+0.677
 ends+0.657
Neg logit contributions
idd-1.296
rol-1.257
iw-1.165
und-1.157
arre-1.149
Token That
Token ID1320
Feature activation+1.37e+00

Pos logit contributions
The+0.195
idi+0.190
 finish+0.189
In+0.187
 grant+0.187
Neg logit contributions
idd-0.271
rol-0.266
iw-0.243
und-0.238
arre-0.228
Token sounds
Token ID5238
Feature activation-2.73e+00

Pos logit contributions
idd+2.690
rol+2.681
iw+2.436
und+2.404
arre+2.309
Neg logit contributions
The-1.300
 ends-1.258
 finish-1.257
In-1.205
 before-1.187
Token good
Token ID922
Feature activation+6.05e-01

Pos logit contributions
idi+4.532
 grant+4.240
ers+4.084
ace+4.067
oster+4.053
Neg logit contributions
 Good-2.847
rol-2.781
 sorts-2.766
arre-2.560
 Enjoy-2.516
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
idd+1.233
rol+1.225
und+1.124
iw+1.119
arre+1.104
Neg logit contributions
The-0.470
 finish-0.452
 ends-0.442
In-0.440
 grant-0.439
Token That
Token ID1320
Feature activation+1.40e+00

Pos logit contributions
The+0.255
idi+0.251
 finish+0.249
 ends+0.248
ers+0.247
Neg logit contributions
idd-0.333
rol-0.327
iw-0.296
und-0.292
arre-0.280
Token sounds
Token ID5238
Feature activation-2.61e+00

Pos logit contributions
idd+2.808
rol+2.794
iw+2.532
und+2.516
arre+2.420
Neg logit contributions
The-1.272
 ends-1.247
 finish-1.244
 before-1.186
In-1.172
Token comfortable
Token ID6792
Feature activation-1.59e-01

Pos logit contributions
idi+4.308
ace+3.938
 grant+3.896
oster+3.879
ers+3.860
Neg logit contributions
 Good-2.790
 sorts-2.778
rol-2.747
arre-2.547
idd-2.527
Token saw
Token ID2497
Feature activation+6.48e-01

Pos logit contributions
idd+0.717
rol+0.708
und+0.643
iw+0.640
arre+0.631
Neg logit contributions
The-0.416
 finish-0.401
In-0.396
 grant-0.393
 ends-0.393
Token a
Token ID257
Feature activation-2.60e-02

Pos logit contributions
idd+1.348
rol+1.340
und+1.230
iw+1.227
arre+1.212
Neg logit contributions
The-0.479
 finish-0.451
In-0.441
 ends-0.440
idi-0.439
Token tie
Token ID9839
Feature activation-3.43e-01

Pos logit contributions
The+0.020
In+0.020
 finish+0.019
ers+0.019
idi+0.019
Neg logit contributions
idd-0.052
rol-0.050
iw-0.047
und-0.046
arre-0.046
Token lying
Token ID9105
Feature activation+1.71e-01

Pos logit contributions
The+0.261
idi+0.259
In+0.249
 finish+0.244
 ends+0.242
Neg logit contributions
rol-0.681
idd-0.680
und-0.625
iw-0.614
arre-0.605
Token on
Token ID319
Feature activation-7.03e-01

Pos logit contributions
idd+0.358
rol+0.356
und+0.328
iw+0.327
arre+0.318
Neg logit contributions
The-0.120
 finish-0.114
 grant-0.112
In-0.111
idi-0.111
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
In+0.347
 grant+0.338
idi+0.329
 finish+0.327
The+0.317
Neg logit contributions
idd-1.522
rol-1.519
und-1.466
iw-1.441
 batch-1.422
Token ground
Token ID2323
Feature activation+4.31e-01

Pos logit contributions
idi+2.411
In+2.284
 behold+2.268
The+2.025
ace+1.994
Neg logit contributions
idd-5.038
rol-4.750
iw-4.584
 poss-4.493
 batch-4.458
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
idd+0.873
rol+0.867
und+0.798
iw+0.794
arre+0.781
Neg logit contributions
The-0.327
 finish-0.313
 grant-0.307
idi-0.307
In-0.307
Token She
Token ID1375
Feature activation+1.30e-02

Pos logit contributions
ers+0.706
 finish+0.704
 grant+0.701
 ends+0.690
idi+0.687
Neg logit contributions
idd-0.882
rol-0.871
und-0.804
iw-0.798
arre-0.754
Token picked
Token ID6497
Feature activation+3.08e-01

Pos logit contributions
idd+0.022
rol+0.022
und+0.020
iw+0.020
arre+0.019
Neg logit contributions
The-0.014
 finish-0.013
 before-0.013
 grant-0.013
 ends-0.013
Token it
Token ID340
Feature activation-7.33e-02

Pos logit contributions
idd+0.564
rol+0.560
und+0.508
iw+0.508
arre+0.497
Neg logit contributions
The-0.292
 finish-0.284
In-0.281
 grant-0.279
idi-0.279
Token friend
Token ID1545
Feature activation-3.82e-01

Pos logit contributions
idd+1.306
rol+1.300
und+1.184
iw+1.176
arre+1.156
Neg logit contributions
The-0.685
 finish-0.663
 before-0.647
 ends-0.647
In-0.645
Token?"
Token ID1701
Feature activation-1.11e+00

Pos logit contributions
The+0.447
In+0.444
idi+0.443
 finish+0.441
 ends+0.440
Neg logit contributions
idd-0.597
rol-0.592
und-0.532
iw-0.529
arre-0.520
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
 ends+1.041
ace+1.004
 grant+1.000
ers+0.987
idi+0.962
Neg logit contributions
idd-2.059
rol-1.957
iw-1.919
und-1.896
arre-1.824
Token
Token ID198
Feature activation-7.98e-01

Pos logit contributions
 grant+0.636
 ends+0.633
 finish+0.630
 before+0.629
ace+0.617
Neg logit contributions
idd-1.292
rol-1.243
und-1.176
iw-1.152
rou-1.123
TokenThe
Token ID464
Feature activation+9.27e-01

Pos logit contributions
 ends+0.367
 before+0.366
 grant+0.356
ace+0.347
 finish+0.334
Neg logit contributions
idd-1.846
rol-1.787
und-1.716
iw-1.685
arre-1.675
Token fake
Token ID8390
Feature activation+1.59e+00

Pos logit contributions
rol+1.867
idd+1.867
iw+1.704
und+1.702
arre+1.652
Neg logit contributions
The-0.777
 finish-0.732
In-0.714
 ends-0.712
idi-0.712
Token pup
Token ID15552
Feature activation-3.01e-01

Pos logit contributions
rol+2.896
idd+2.822
iw+2.578
und+2.552
arre+2.436
Neg logit contributions
The-1.721
 ends-1.660
 before-1.657
idi-1.649
 finish-1.635
Token smiled
Token ID13541
Feature activation+3.80e-01

Pos logit contributions
The+0.318
 finish+0.307
In+0.306
 grant+0.304
 ends+0.304
Neg logit contributions
idd-0.510
rol-0.500
und-0.455
iw-0.454
arre-0.448
Token and
Token ID290
Feature activation+6.04e-01

Pos logit contributions
idd+0.841
rol+0.835
und+0.774
iw+0.770
arre+0.764
Neg logit contributions
The-0.217
 finish-0.206
 grant-0.201
In-0.201
 ends-0.199
Token said
Token ID531
Feature activation+7.15e-01

Pos logit contributions
idd+1.010
rol+1.002
und+0.901
iw+0.898
arre+0.882
Neg logit contributions
The-0.681
 finish-0.661
In-0.650
idi-0.649
 ends-0.648
Token "
Token ID366
Feature activation+6.09e-01

Pos logit contributions
idd+1.415
rol+1.406
und+1.286
iw+1.284
arre+1.266
Neg logit contributions
The-0.592
 finish-0.564
 grant-0.557
In-0.553
 before-0.552
Token puppy
Token ID26188
Feature activation+4.95e-01

Pos logit contributions
idd+0.101
rol+0.099
iw+0.087
und+0.087
arre+0.086
Neg logit contributions
The-0.093
In-0.091
idi-0.091
 finish-0.091
 grant-0.090
Token back
Token ID736
Feature activation+4.51e-01

Pos logit contributions
idd+0.944
rol+0.937
und+0.855
iw+0.852
arre+0.839
Neg logit contributions
The-0.436
 finish-0.419
In-0.412
 grant-0.411
idi-0.410
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
idd+0.817
rol+0.810
und+0.736
iw+0.733
arre+0.718
Neg logit contributions
The-0.447
 finish-0.431
 grant-0.425
In-0.425
 ends-0.423
Token its
Token ID663
Feature activation-4.59e-03

Pos logit contributions
idd+2.737
rol+2.735
iw+2.501
arre+2.477
und+2.473
Neg logit contributions
 finish-0.840
The-0.839
 before-0.797
 grant-0.786
idi-0.765
Token family
Token ID1641
Feature activation-3.75e-01

Pos logit contributions
The+0.006
 grant+0.006
In+0.006
 ends+0.006
idi+0.006
Neg logit contributions
idd-0.007
rol-0.007
iw-0.006
und-0.006
arre-0.006
Token who
Token ID508
Feature activation+1.74e+00

Pos logit contributions
The+0.332
idi+0.328
In+0.327
 grant+0.323
 ends+0.319
Neg logit contributions
idd-0.697
rol-0.695
und-0.636
iw-0.632
arre-0.616
Token were
Token ID547
Feature activation-5.21e-02

Pos logit contributions
rol+3.466
idd+3.425
arre+3.137
iw+3.113
und+3.082
Neg logit contributions
The-1.537
 finish-1.444
In-1.429
 ends-1.410
idi-1.403
Token over
Token ID625
Feature activation+3.58e-01

Pos logit contributions
The+0.053
 ends+0.052
 grant+0.052
In+0.052
 finish+0.052
Neg logit contributions
idd-0.093
rol-0.092
und-0.082
iw-0.081
arre-0.080
Tokenjoy
Token ID2633
Feature activation+2.37e-01

Pos logit contributions
idd+0.688
rol+0.679
und+0.623
iw+0.622
arre+0.607
Neg logit contributions
The-0.314
 finish-0.304
 ends-0.299
 grant-0.299
In-0.298
Tokened
Token ID276
Feature activation-3.61e-01

Pos logit contributions
idd+0.304
rol+0.299
und+0.263
iw+0.257
arre+0.251
Neg logit contributions
The-0.359
 finish-0.354
 grant-0.350
idi-0.350
In-0.348
Token to
Token ID284
Feature activation+2.30e-02

Pos logit contributions
The+0.316
 grant+0.314
In+0.313
idi+0.312
 finish+0.311
Neg logit contributions
idd-0.677
rol-0.672
und-0.625
iw-0.613
arre-0.597
Token the
Token ID262
Feature activation+3.24e-01

Pos logit contributions
idd+1.477
rol+1.461
und+1.342
iw+1.339
arre+1.319
Neg logit contributions
The-0.525
 finish-0.504
 before-0.494
 ends-0.489
In-0.487
Token stick
Token ID4859
Feature activation+9.26e-02

Pos logit contributions
idd+0.614
rol+0.606
iw+0.557
und+0.555
arre+0.546
Neg logit contributions
The-0.293
 finish-0.278
idi-0.277
In-0.277
 grant-0.275
Token and
Token ID290
Feature activation+3.31e-02

Pos logit contributions
idd+0.180
rol+0.179
und+0.164
iw+0.164
arre+0.160
Neg logit contributions
The-0.077
idi-0.074
In-0.073
 finish-0.072
 grant-0.072
Token chased
Token ID26172
Feature activation-4.24e-01

Pos logit contributions
idd+0.062
rol+0.061
iw+0.057
und+0.056
arre+0.055
Neg logit contributions
The-0.030
In-0.029
 finish-0.028
 grant-0.028
idi-0.028
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
 grant+0.348
In+0.348
idi+0.344
The+0.340
 finish+0.336
Neg logit contributions
idd-0.808
rol-0.796
iw-0.752
und-0.741
arre-0.711
Token squirrel
Token ID33039
Feature activation+1.60e+00

Pos logit contributions
idi+0.381
The+0.379
In+0.372
 before+0.361
 grant+0.361
Neg logit contributions
idd-0.588
rol-0.568
iw-0.534
und-0.519
arre-0.503
Token.
Token ID13
Feature activation-2.71e-01

Pos logit contributions
idd+3.549
rol+3.475
arre+3.222
und+3.220
 Mechan+3.183
Neg logit contributions
The-1.084
 before-1.074
 finish-1.057
 ends-1.005
In-0.943
Token The
Token ID383
Feature activation+3.46e-01

Pos logit contributions
idi+0.255
The+0.251
ers+0.249
 ends+0.248
 grant+0.247
Neg logit contributions
idd-0.486
rol-0.478
und-0.448
iw-0.448
arre-0.438
Token squirrel
Token ID33039
Feature activation+1.15e+00

Pos logit contributions
idd+0.621
rol+0.612
iw+0.559
und+0.558
arre+0.545
Neg logit contributions
The-0.347
 finish-0.336
idi-0.333
In-0.332
 grant-0.331
Token ran
Token ID4966
Feature activation+3.47e-01

Pos logit contributions
idd+2.262
rol+2.252
und+2.044
arre+2.034
iw+2.033
Neg logit contributions
The-0.995
 finish-0.954
 ends-0.948
 before-0.937
In-0.932
Token up
Token ID510
Feature activation-1.71e-03

Pos logit contributions
idd+0.642
rol+0.638
und+0.581
iw+0.579
arre+0.567
Neg logit contributions
The-0.324
 finish-0.310
 grant-0.310
In-0.308
idi-0.307
Token when
Token ID618
Feature activation-2.72e-01

Pos logit contributions
The+0.178
 grant+0.175
 ends+0.172
 finish+0.171
In+0.171
Neg logit contributions
rol-0.323
idd-0.322
und-0.290
arre-0.287
iw-0.286
Token they
Token ID484
Feature activation+6.89e-01

Pos logit contributions
The+0.221
 grant+0.219
 finish+0.215
In+0.215
 ends+0.212
Neg logit contributions
idd-0.523
rol-0.515
und-0.479
iw-0.473
arre-0.466
Token united
Token ID16503
Feature activation-2.84e-01

Pos logit contributions
idd+1.289
rol+1.276
und+1.160
iw+1.159
arre+1.139
Neg logit contributions
The-0.644
 finish-0.625
In-0.611
 ends-0.608
 grant-0.608
Token to
Token ID284
Feature activation+1.28e-01

Pos logit contributions
The+0.243
In+0.232
 grant+0.232
idi+0.228
 finish+0.225
Neg logit contributions
rol-0.542
idd-0.537
und-0.502
arre-0.490
iw-0.482
Token fly
Token ID6129
Feature activation+5.01e-02

Pos logit contributions
idd+0.274
rol+0.273
und+0.254
iw+0.251
arre+0.248
Neg logit contributions
The-0.078
In-0.074
idi-0.072
 finish-0.072
 ends-0.072
Token k
Token ID479
Feature activation+1.93e+00

Pos logit contributions
rol+0.091
idd+0.091
und+0.083
iw+0.081
arre+0.081
Neg logit contributions
The-0.048
 finish-0.046
In-0.046
 grant-0.046
idi-0.046
Tokenites
Token ID2737
Feature activation+9.34e-02

Pos logit contributions
rol+2.783
idd+2.731
arre+2.529
 Mechan+2.472
 batch+2.449
Neg logit contributions
The-2.576
In-2.434
ers-2.429
 ends-2.404
 finish-2.369
Token in
Token ID287
Feature activation+7.76e-01

Pos logit contributions
rol+0.168
idd+0.167
und+0.153
iw+0.150
arre+0.148
Neg logit contributions
The-0.091
idi-0.089
In-0.088
 finish-0.087
 grant-0.087
Token the
Token ID262
Feature activation+2.52e-01

Pos logit contributions
idd+1.887
rol+1.875
iw+1.739
und+1.737
arre+1.724
Neg logit contributions
The-0.299
 finish-0.271
 ends-0.256
 before-0.254
In-0.251
Token bright
Token ID6016
Feature activation-7.21e-01

Pos logit contributions
idd+0.490
rol+0.488
und+0.448
iw+0.445
arre+0.437
Neg logit contributions
The-0.207
 grant-0.199
 finish-0.198
idi-0.197
In-0.197
Token blue
Token ID4171
Feature activation+6.02e-01

Pos logit contributions
 grant+0.648
The+0.630
In+0.624
idi+0.616
 finish+0.597
Neg logit contributions
rol-1.351
idd-1.346
und-1.275
iw-1.211
arre-1.198
Token told
Token ID1297
Feature activation-6.18e-02

Pos logit contributions
idd+0.139
rol+0.135
und+0.124
iw+0.122
arre+0.121
Neg logit contributions
The-0.094
 finish-0.093
 ends-0.091
 grant-0.091
In-0.091
Token the
Token ID262
Feature activation-4.13e-01

Pos logit contributions
In+0.038
The+0.038
 finish+0.036
 ends+0.035
 grant+0.035
Neg logit contributions
idd-0.132
rol-0.131
und-0.123
iw-0.122
arre-0.119
Token rabbit
Token ID22746
Feature activation-1.71e-01

Pos logit contributions
The+0.428
In+0.416
 finish+0.407
idi+0.406
 before+0.399
Neg logit contributions
idd-0.717
rol-0.695
arre-0.643
iw-0.637
und-0.636
Token,
Token ID11
Feature activation+6.13e-01

Pos logit contributions
The+0.138
 finish+0.136
In+0.133
 grant+0.130
 ends+0.130
Neg logit contributions
rol-0.333
idd-0.332
und-0.305
iw-0.303
arre-0.299
Token "
Token ID366
Feature activation+6.56e-01

Pos logit contributions
idd+1.196
rol+1.188
und+1.086
iw+1.083
arre+1.066
Neg logit contributions
The-0.521
 finish-0.498
In-0.489
 grant-0.488
idi-0.486
TokenI
Token ID40
Feature activation+1.82e+00

Pos logit contributions
idd+1.128
rol+1.119
und+1.008
iw+1.005
arre+0.988
Neg logit contributions
The-0.722
 finish-0.689
In-0.686
 grant-0.680
idi-0.678
Token am
Token ID716
Feature activation+7.56e-01

Pos logit contributions
rol+3.651
idd+3.579
und+3.237
iw+3.207
arre+3.200
Neg logit contributions
 grant-1.595
 finish-1.583
The-1.542
In-1.508
 ends-1.494
Token so
Token ID523
Feature activation+9.01e-01

Pos logit contributions
idd+1.192
rol+1.186
und+1.062
iw+1.054
arre+1.036
Neg logit contributions
The-0.924
 finish-0.905
 grant-0.890
 before-0.890
idi-0.884
Token hungry
Token ID14720
Feature activation-7.95e-02

Pos logit contributions
rol+1.396
idd+1.387
und+1.240
iw+1.223
arre+1.210
Neg logit contributions
The-1.134
 finish-1.104
 grant-1.085
In-1.083
idi-1.078
Token,
Token ID11
Feature activation-5.40e-01

Pos logit contributions
The+0.084
idi+0.082
In+0.082
 grant+0.081
 finish+0.080
Neg logit contributions
idd-0.135
rol-0.134
und-0.122
iw-0.122
arre-0.118
Token I
Token ID314
Feature activation+1.32e+00

Pos logit contributions
In+0.487
The+0.486
idi+0.482
 finish+0.468
 ends+0.466
Neg logit contributions
idd-0.986
rol-0.983
iw-0.895
und-0.892
arre-0.872
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
 finish+0.332
 ends+0.329
 before+0.327
 grant+0.323
idi+0.313
Neg logit contributions
idd-0.730
rol-0.721
und-0.669
iw-0.648
arre-0.645
TokenThe
Token ID464
Feature activation-5.95e-01

Pos logit contributions
 ends+0.093
 before+0.092
 finish+0.092
 grant+0.088
idi+0.084
Neg logit contributions
idd-0.495
rol-0.491
und-0.461
arre-0.455
iw-0.451
Token monster
Token ID9234
Feature activation-4.93e-01

Pos logit contributions
The+0.397
In+0.378
 grant+0.373
ace+0.359
 finish+0.358
Neg logit contributions
idd-1.260
rol-1.191
iw-1.131
arre-1.130
 batch-1.113
Token stormed
Token ID33997
Feature activation-1.31e-01

Pos logit contributions
The+0.495
In+0.482
 finish+0.477
 grant+0.476
 ends+0.470
Neg logit contributions
idd-0.864
rol-0.850
und-0.781
iw-0.762
arre-0.751
Token off
Token ID572
Feature activation+3.08e-01

Pos logit contributions
The+0.138
In+0.133
 grant+0.131
 finish+0.130
idi+0.128
Neg logit contributions
rol-0.230
idd-0.228
und-0.209
iw-0.200
arre-0.199
Token and
Token ID290
Feature activation+7.13e-01

Pos logit contributions
idd+0.643
rol+0.638
und+0.589
iw+0.585
arre+0.575
Neg logit contributions
The-0.216
 finish-0.204
In-0.203
 ends-0.201
idi-0.201
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
idd+1.491
rol+1.477
und+1.359
iw+1.357
arre+1.342
Neg logit contributions
The-0.508
 finish-0.489
 ends-0.472
 before-0.471
idi-0.470
Token cookies
Token ID14746
Feature activation+3.68e-01

Pos logit contributions
The+0.808
 grant+0.770
In+0.769
idi+0.733
ace+0.705
Neg logit contributions
idd-2.075
rol-1.982
iw-1.869
arre-1.846
 batch-1.844
Token were
Token ID547
Feature activation-7.57e-02

Pos logit contributions
idd+0.652
rol+0.649
und+0.588
iw+0.582
arre+0.573
Neg logit contributions
The-0.371
 finish-0.356
In-0.353
idi-0.353
 grant-0.351
Token safely
Token ID11512
Feature activation+2.65e-01

Pos logit contributions
The+0.078
 grant+0.077
In+0.076
 finish+0.075
idi+0.075
Neg logit contributions
rol-0.132
idd-0.131
und-0.118
iw-0.116
arre-0.115
Token back
Token ID736
Feature activation+4.86e-01

Pos logit contributions
idd+0.491
rol+0.489
und+0.444
iw+0.439
arre+0.434
Neg logit contributions
The-0.248
 grant-0.236
idi-0.236
 finish-0.236
In-0.235
Token at
Token ID379
Feature activation-1.09e-01

Pos logit contributions
The+0.120
 grant+0.118
In+0.116
 finish+0.116
idi+0.115
Neg logit contributions
idd-0.288
rol-0.282
und-0.264
iw-0.261
arre-0.256
Token the
Token ID262
Feature activation-3.98e-01

Pos logit contributions
The+0.079
 finish+0.078
In+0.078
 grant+0.075
 ends+0.075
Neg logit contributions
idd-0.224
rol-0.221
und-0.207
iw-0.205
arre-0.196
Token girl
Token ID2576
Feature activation-7.03e-01

Pos logit contributions
The+0.287
In+0.283
 grant+0.273
 finish+0.267
idi+0.265
Neg logit contributions
idd-0.808
rol-0.792
iw-0.736
und-0.728
arre-0.726
Token and
Token ID290
Feature activation+3.87e-01

Pos logit contributions
The+0.553
In+0.545
idi+0.527
 grant+0.519
 ends+0.516
Neg logit contributions
idd-1.380
rol-1.376
und-1.274
iw-1.269
arre-1.245
Token said
Token ID531
Feature activation+4.70e-01

Pos logit contributions
idd+0.709
rol+0.703
und+0.640
iw+0.639
arre+0.625
Neg logit contributions
The-0.369
 finish-0.357
In-0.353
 ends-0.350
 grant-0.350
Token,
Token ID11
Feature activation+4.24e-01

Pos logit contributions
idd+0.976
rol+0.969
und+0.892
iw+0.888
arre+0.875
Neg logit contributions
The-0.338
 finish-0.323
In-0.316
 ends-0.314
idi-0.313
Token "
Token ID366
Feature activation+3.34e-01

Pos logit contributions
idd+0.808
rol+0.801
und+0.732
iw+0.729
arre+0.718
Neg logit contributions
The-0.370
 finish-0.358
 ends-0.353
In-0.351
idi-0.350
TokenThank
Token ID10449
Feature activation+3.03e-01

Pos logit contributions
idd+0.529
rol+0.525
iw+0.469
und+0.469
arre+0.456
Neg logit contributions
The-0.398
 finish-0.395
 ends-0.389
 grant-0.386
idi-0.385
Token you
Token ID345
Feature activation+1.80e-01

Pos logit contributions
idd+0.448
rol+0.442
und+0.394
iw+0.393
arre+0.378
Neg logit contributions
The-0.395
 finish-0.391
idi-0.385
 ends-0.385
In-0.384
Token so
Token ID523
Feature activation+1.17e+00

Pos logit contributions
idd+0.317
rol+0.316
und+0.285
iw+0.284
arre+0.279
Neg logit contributions
The-0.182
In-0.177
 finish-0.176
 ends-0.176
idi-0.175
Token much
Token ID881
Feature activation+7.50e-01

Pos logit contributions
rol+1.765
idd+1.756
und+1.556
arre+1.545
iw+1.542
Neg logit contributions
The-1.571
 finish-1.500
 before-1.486
 grant-1.483
In-1.483
Tokenâ„¢
Token ID8151
Feature activation+7.77e-01

Pos logit contributions
idd+0.517
rol+0.507
und+0.454
iw+0.450
arre+0.443
Neg logit contributions
The-0.458
 finish-0.449
 grant-0.444
 ends-0.443
idi-0.443
Tokens
Token ID82
Feature activation-1.37e+00

Pos logit contributions
idd+0.918
rol+0.905
iw+0.779
und+0.776
arre+0.754
Neg logit contributions
The-1.249
 finish-1.217
In-1.209
 ends-1.208
 before-1.201
Token always
Token ID1464
Feature activation+1.64e-01

Pos logit contributions
idi+1.592
The+1.497
 finish+1.415
ers+1.413
 grant+1.403
Neg logit contributions
idd-2.141
rol-2.138
arre-2.108
und-1.969
rou-1.950
Token worth
Token ID2861
Feature activation+5.59e-01

Pos logit contributions
idd+0.285
rol+0.281
und+0.256
iw+0.255
arre+0.254
Neg logit contributions
The-0.169
 finish-0.162
idi-0.161
In-0.161
 grant-0.159
Token asking
Token ID4737
Feature activation-6.97e-01

Pos logit contributions
idd+1.036
rol+1.028
und+0.935
iw+0.932
arre+0.916
Neg logit contributions
The-0.527
 finish-0.508
In-0.499
 ends-0.497
 grant-0.497
Token for
Token ID329
Feature activation+9.48e-02

Pos logit contributions
The+0.669
idi+0.666
In+0.656
 finish+0.646
 grant+0.632
Neg logit contributions
idd-1.211
rol-1.190
und-1.111
arre-1.094
iw-1.076
Token help
Token ID1037
Feature activation-8.12e-01

Pos logit contributions
idd+0.175
rol+0.171
und+0.158
iw+0.157
arre+0.155
Neg logit contributions
The-0.086
In-0.083
 finish-0.083
idi-0.082
 ends-0.082
Token when
Token ID618
Feature activation+1.18e-01

Pos logit contributions
idi+0.699
The+0.681
In+0.675
 grant+0.662
 finish+0.640
Neg logit contributions
idd-1.516
rol-1.501
und-1.414
arre-1.394
rou-1.335
Token you
Token ID345
Feature activation+3.12e-01

Pos logit contributions
idd+0.188
rol+0.182
und+0.166
iw+0.164
arre+0.162
Neg logit contributions
The-0.141
 finish-0.136
In-0.135
idi-0.135
 ends-0.134
Token need
Token ID761
Feature activation+8.67e-03

Pos logit contributions
idd+0.547
rol+0.542
und+0.494
iw+0.489
arre+0.486
Neg logit contributions
The-0.320
idi-0.305
In-0.304
 finish-0.302
 ends-0.300
Token it
Token ID340
Feature activation-5.64e-02

Pos logit contributions
idd+0.016
rol+0.016
und+0.015
iw+0.015
arre+0.014
Neg logit contributions
The-0.008
In-0.007
idi-0.007
 finish-0.007
 ends-0.007
Token wild
Token ID4295
Feature activation+2.22e-01

Pos logit contributions
idd+0.159
rol+0.156
iw+0.142
und+0.141
arre+0.137
Neg logit contributions
The-0.118
 grant-0.116
In-0.115
idi-0.115
 finish-0.115
Token.
Token ID13
Feature activation+6.98e-02

Pos logit contributions
idd+0.408
rol+0.402
und+0.370
iw+0.368
arre+0.357
Neg logit contributions
The-0.214
 finish-0.206
In-0.205
 grant-0.205
idi-0.203
Token<|endoftext|>
Token ID50256
Feature activation-5.94e-01

Pos logit contributions
idd+0.146
rol+0.145
und+0.134
iw+0.133
arre+0.130
Neg logit contributions
The-0.048
 finish-0.047
idi-0.046
 ends-0.046
 grant-0.046
TokenOnce
Token ID7454
Feature activation-3.45e-02

Pos logit contributions
 finish+0.566
 grant+0.565
idi+0.545
The+0.541
 ends+0.535
Neg logit contributions
idd-1.081
rol-1.060
und-0.997
iw-0.955
arre-0.929
Token upon
Token ID2402
Feature activation-2.31e-01

Pos logit contributions
The+0.046
 finish+0.046
 grant+0.046
idi+0.045
In+0.045
Neg logit contributions
idd-0.049
rol-0.048
und-0.043
iw-0.043
arre-0.041
Token a
Token ID257
Feature activation+6.04e-01

Pos logit contributions
 finish+0.179
idi+0.178
 ends+0.176
 grant+0.175
The+0.173
Neg logit contributions
idd-0.441
rol-0.438
und-0.405
iw-0.399
arre-0.393
Token time
Token ID640
Feature activation+1.06e+00

Pos logit contributions
idd+1.565
rol+1.556
und+1.454
iw+1.452
arre+1.433
Neg logit contributions
The-0.125
 finish-0.104
In-0.097
idi-0.092
 ends-0.091
Token,
Token ID11
Feature activation-9.08e-01

Pos logit contributions
idd+2.348
rol+2.346
iw+2.154
und+2.141
arre+2.135
Neg logit contributions
The-0.639
 before-0.592
In-0.588
 finish-0.574
 ends-0.566
Token there
Token ID612
Feature activation+1.07e-01

Pos logit contributions
 finish+1.142
 grant+1.120
idi+1.111
ers+1.107
 ends+1.082
Neg logit contributions
idd-1.384
rol-1.343
und-1.269
iw-1.197
arre-1.153
Token was
Token ID373
Feature activation-5.72e-02

Pos logit contributions
idd+0.208
rol+0.206
und+0.191
iw+0.187
arre+0.184
Neg logit contributions
The-0.087
 finish-0.086
 grant-0.084
idi-0.083
In-0.082
Token a
Token ID257
Feature activation-2.63e-01

Pos logit contributions
 grant+0.044
 finish+0.043
The+0.043
 ends+0.043
idi+0.042
Neg logit contributions
idd-0.114
rol-0.112
und-0.104
iw-0.103
arre-0.101
Token!
Token ID0
Feature activation+1.68e-01

Pos logit contributions
idd+0.561
rol+0.550
und+0.501
iw+0.498
arre+0.487
Neg logit contributions
The-0.385
 finish-0.373
In-0.371
idi-0.370
 grant-0.368
Token That
Token ID1320
Feature activation+3.81e-01

Pos logit contributions
idd+0.242
rol+0.237
und+0.210
iw+0.210
arre+0.203
Neg logit contributions
The-0.228
 finish-0.224
idi-0.222
 ends-0.222
In-0.220
Token's
Token ID338
Feature activation-1.21e+00

Pos logit contributions
idd+0.660
rol+0.651
und+0.593
iw+0.589
arre+0.583
Neg logit contributions
The-0.403
 finish-0.390
In-0.385
idi-0.384
 grant-0.384
Token a
Token ID257
Feature activation+2.24e-01

Pos logit contributions
ers+1.216
idi+1.216
 ends+1.200
 grant+1.179
oster+1.154
Neg logit contributions
rol-2.008
idd-1.978
und-1.853
iw-1.836
 Good-1.822
Token really
Token ID1107
Feature activation+4.42e-01

Pos logit contributions
idd+0.434
rol+0.425
iw+0.395
und+0.392
arre+0.388
Neg logit contributions
The-0.188
 finish-0.179
 ends-0.179
idi-0.178
 before-0.178
Token big
Token ID1263
Feature activation+8.75e-01

Pos logit contributions
idd+0.836
rol+0.825
iw+0.754
und+0.753
arre+0.739
Neg logit contributions
The-0.401
 finish-0.384
In-0.379
 ends-0.378
 grant-0.376
Token squash
Token ID34613
Feature activation+3.11e-01

Pos logit contributions
rol+1.557
idd+1.550
und+1.389
iw+1.377
arre+1.344
Neg logit contributions
The-0.899
 finish-0.893
 ends-0.872
 grant-0.864
 before-0.864
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
idd+0.563
rol+0.555
iw+0.507
und+0.505
arre+0.493
Neg logit contributions
The-0.308
In-0.294
 finish-0.293
idi-0.293
 ends-0.289
Token Maybe
Token ID6674
Feature activation-2.44e-01

Pos logit contributions
The+0.191
 finish+0.189
idi+0.189
 ends+0.189
ers+0.187
Neg logit contributions
idd-0.246
rol-0.241
iw-0.216
und-0.214
arre-0.207
Token we
Token ID356
Feature activation+1.77e-01

Pos logit contributions
The+0.252
 finish+0.251
In+0.250
idi+0.250
ers+0.246
Neg logit contributions
idd-0.415
rol-0.395
iw-0.380
und-0.368
arre-0.352
Token can
Token ID460
Feature activation+1.23e-01

Pos logit contributions
idd+0.332
rol+0.328
und+0.300
iw+0.299
arre+0.295
Neg logit contributions
The-0.164
idi-0.158
 finish-0.154
In-0.153
 before-0.152
Token said
Token ID531
Feature activation-1.45e-01

Pos logit contributions
The+0.794
In+0.774
idi+0.750
 finish+0.748
 ends+0.740
Neg logit contributions
idd-1.482
rol-1.467
und-1.390
iw-1.306
rou-1.285
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
The+0.114
 finish+0.112
In+0.110
 ends+0.108
idi+0.107
Neg logit contributions
idd-0.287
rol-0.286
und-0.264
iw-0.257
arre-0.255
Token "
Token ID366
Feature activation+7.07e-01

Pos logit contributions
idd+0.522
rol+0.517
und+0.473
iw+0.468
arre+0.462
Neg logit contributions
The-0.255
 finish-0.248
 ends-0.245
In-0.243
idi-0.241
TokenTime
Token ID7575
Feature activation+2.73e-01

Pos logit contributions
idd+1.166
rol+1.154
und+1.036
iw+1.036
arre+1.016
Neg logit contributions
The-0.828
 finish-0.792
In-0.790
idi-0.782
 grant-0.781
Token to
Token ID284
Feature activation-2.00e-01

Pos logit contributions
idd+0.536
rol+0.534
und+0.490
iw+0.484
arre+0.479
Neg logit contributions
The-0.222
 finish-0.208
In-0.207
idi-0.206
 grant-0.205
Token go
Token ID467
Feature activation-3.61e-01

Pos logit contributions
The+0.164
In+0.159
idi+0.157
 ends+0.155
ers+0.153
Neg logit contributions
rol-0.382
idd-0.381
und-0.355
arre-0.345
iw-0.341
Token home
Token ID1363
Feature activation+2.37e-01

Pos logit contributions
The+0.411
In+0.400
 ends+0.396
idi+0.392
 grant+0.392
Neg logit contributions
idd-0.597
rol-0.593
und-0.541
iw-0.523
arre-0.509
Token,
Token ID11
Feature activation-3.58e-01

Pos logit contributions
idd+0.453
rol+0.449
und+0.413
iw+0.408
arre+0.403
Neg logit contributions
The-0.206
In-0.197
idi-0.195
 finish-0.195
 ends-0.195
Token Jimmy
Token ID12963
Feature activation-5.43e-01

Pos logit contributions
The+0.358
In+0.350
 grant+0.342
idi+0.341
 finish+0.340
Neg logit contributions
rol-0.624
idd-0.622
und-0.564
arre-0.551
iw-0.548
Token."
Token ID526
Feature activation-1.73e-01

Pos logit contributions
The+0.660
In+0.647
idi+0.634
 ends+0.629
 grant+0.628
Neg logit contributions
idd-0.860
rol-0.833
und-0.772
iw-0.738
arre-0.731
Token It
Token ID632
Feature activation+7.24e-01

Pos logit contributions
The+0.184
 ends+0.182
 finish+0.180
 grant+0.180
In+0.179
Neg logit contributions
idd-0.291
rol-0.289
und-0.265
arre-0.258
iw-0.257
Token secret
Token ID3200
Feature activation-4.00e-01

Pos logit contributions
rol+1.773
idd+1.756
und+1.614
iw+1.586
arre+1.576
Neg logit contributions
The-0.660
 finish-0.640
In-0.620
idi-0.619
 grant-0.618
Token,
Token ID11
Feature activation-4.59e-01

Pos logit contributions
The+0.489
In+0.460
 finish+0.457
idi+0.457
 ends+0.455
Neg logit contributions
idd-0.631
rol-0.613
iw-0.566
und-0.560
 batch-0.540
Token but
Token ID475
Feature activation+1.74e-01

Pos logit contributions
 finish+0.432
 ends+0.430
The+0.420
ers+0.413
 grant+0.407
Neg logit contributions
idd-0.827
rol-0.805
iw-0.761
und-0.742
 batch-0.706
Token she
Token ID673
Feature activation-1.38e+00

Pos logit contributions
idd+0.296
rol+0.293
iw+0.266
und+0.264
arre+0.254
Neg logit contributions
The-0.186
 finish-0.184
In-0.180
 grant-0.179
idi-0.179
Token didn
Token ID1422
Feature activation-1.31e-01

Pos logit contributions
The+1.409
 finish+1.402
 grant+1.386
 before+1.368
ers+1.342
Neg logit contributions
idd-2.396
rol-2.170
iw-2.111
rou-2.073
und-2.048
Token't
Token ID470
Feature activation-8.74e-01

Pos logit contributions
The+0.137
idi+0.137
 ends+0.135
 finish+0.135
ers+0.133
Neg logit contributions
idd-0.220
rol-0.219
und-0.197
arre-0.193
iw-0.193
Token tell
Token ID1560
Feature activation+1.37e-01

Pos logit contributions
idi+0.869
The+0.865
ers+0.845
 ends+0.823
In+0.816
Neg logit contributions
idd-1.486
rol-1.478
iw-1.378
arre-1.340
 sorts-1.335
Token anyone
Token ID2687
Feature activation-3.25e-01

Pos logit contributions
rol+0.241
idd+0.241
und+0.218
iw+0.217
arre+0.212
Neg logit contributions
The-0.136
In-0.132
 finish-0.130
idi-0.130
 grant-0.128
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
 finish+0.312
The+0.312
idi+0.305
In+0.304
 grant+0.301
Neg logit contributions
rol-0.578
idd-0.575
und-0.528
iw-0.526
rou-0.508
Token She
Token ID1375
Feature activation-1.45e-01

Pos logit contributions
idd+0.522
rol+0.518
und+0.468
iw+0.467
arre+0.457
Neg logit contributions
The-0.337
 finish-0.331
 grant-0.326
 ends-0.325
idi-0.324
Token liked
Token ID8288
Feature activation-7.66e-02

Pos logit contributions
The+0.144
 finish+0.139
 grant+0.138
 before+0.136
 ends+0.136
Neg logit contributions
idd-0.259
rol-0.251
und-0.232
iw-0.228
arre-0.227
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 finish+0.450
 before+0.440
 ends+0.439
 grant+0.439
idi+0.421
Neg logit contributions
idd-0.936
rol-0.927
und-0.853
iw-0.834
arre-0.817
TokenThe
Token ID464
Feature activation-1.21e-01

Pos logit contributions
 finish+0.051
 ends+0.049
 before+0.049
 grant+0.049
idi+0.045
Neg logit contributions
idd-0.364
rol-0.363
und-0.338
iw-0.335
arre-0.334
Token big
Token ID1263
Feature activation+1.09e+00

Pos logit contributions
 grant+0.145
The+0.145
 finish+0.144
In+0.143
idi+0.141
Neg logit contributions
idd-0.189
rol-0.183
iw-0.168
und-0.163
arre-0.161
Token goose
Token ID37246
Feature activation-3.86e-01

Pos logit contributions
rol+1.957
idd+1.945
und+1.750
arre+1.726
iw+1.715
Neg logit contributions
The-1.130
 finish-1.092
 ends-1.070
 before-1.063
idi-1.063
Token flew
Token ID13112
Feature activation-5.11e-01

Pos logit contributions
The+0.322
 grant+0.315
In+0.312
 finish+0.312
 ends+0.311
Neg logit contributions
idd-0.735
rol-0.728
und-0.676
iw-0.664
arre-0.653
Token until
Token ID1566
Feature activation-1.06e-01

Pos logit contributions
 grant+0.515
The+0.501
In+0.494
idi+0.491
 finish+0.490
Neg logit contributions
rol-0.926
idd-0.886
und-0.833
iw-0.793
arre-0.788
Token it
Token ID340
Feature activation+4.83e-04

Pos logit contributions
The+0.094
 grant+0.092
 ends+0.091
 finish+0.091
In+0.091
Neg logit contributions
idd-0.200
rol-0.198
und-0.182
iw-0.180
arre-0.173
Token reached
Token ID4251
Feature activation-4.28e-01

Pos logit contributions
rol+0.001
idd+0.001
und+0.001
iw+0.001
arre+0.001
Neg logit contributions
The-0.000
 grant-0.000
 finish-0.000
In-0.000
 ends-0.000
Token its
Token ID663
Feature activation-2.12e-01

Pos logit contributions
The+0.256
 grant+0.250
In+0.247
 finish+0.238
idi+0.237
Neg logit contributions
rol-0.918
idd-0.911
und-0.858
iw-0.845
arre-0.810
Token home
Token ID1363
Feature activation+3.85e-02

Pos logit contributions
The+0.229
In+0.223
idi+0.220
 grant+0.219
ers+0.217
Neg logit contributions
rol-0.353
idd-0.352
iw-0.323
und-0.319
arre-0.308
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
rol+0.074
idd+0.074
und+0.067
iw+0.067
arre+0.065
Neg logit contributions
The-0.033
idi-0.032
In-0.031
 grant-0.031
 ends-0.031
Token -
Token ID532
Feature activation+9.33e-01

Pos logit contributions
The+0.437
idi+0.431
In+0.424
 grant+0.421
 ends+0.419
Neg logit contributions
idd-0.657
rol-0.643
und-0.590
arre-0.587
iw-0.581
Token she
Token ID673
Feature activation-5.26e-01

Pos logit contributions
idd+1.732
rol+1.718
und+1.565
iw+1.545
arre+1.538
Neg logit contributions
The-0.910
 finish-0.875
 before-0.866
In-0.851
 ends-0.849
Token had
Token ID550
Feature activation-4.11e-01

Pos logit contributions
The+0.414
In+0.387
 grant+0.384
 before+0.382
 ends+0.381
Neg logit contributions
idd-1.060
rol-1.019
und-0.951
arre-0.943
iw-0.932
Token managed
Token ID5257
Feature activation-8.43e-01

Pos logit contributions
The+0.410
 grant+0.406
idi+0.398
 ends+0.396
In+0.386
Neg logit contributions
idd-0.719
rol-0.704
und-0.645
arre-0.645
iw-0.634
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
The+0.708
 grant+0.707
idi+0.684
oster+0.676
 ends+0.670
Neg logit contributions
rol-1.558
idd-1.552
arre-1.472
und-1.461
iw-1.422
Token waste
Token ID7030
Feature activation+3.37e-02

Pos logit contributions
The+0.374
idi+0.365
 grant+0.357
In+0.353
ers+0.342
Neg logit contributions
idd-0.555
rol-0.535
iw-0.494
arre-0.488
und-0.487
Token all
Token ID477
Feature activation-3.95e-01

Pos logit contributions
idd+0.064
rol+0.063
und+0.058
iw+0.058
arre+0.056
Neg logit contributions
The-0.030
idi-0.029
In-0.029
 grant-0.029
 finish-0.028
Token by
Token ID416
Feature activation+5.86e-02

Pos logit contributions
The+0.372
 grant+0.371
idi+0.358
In+0.352
 finish+0.349
Neg logit contributions
idd-0.724
rol-0.721
und-0.670
iw-0.660
arre-0.654
Token herself
Token ID5223
Feature activation-2.22e-01

Pos logit contributions
idd+0.118
rol+0.116
und+0.109
iw+0.108
arre+0.106
Neg logit contributions
The-0.044
 finish-0.043
idi-0.042
In-0.042
 ends-0.042
Token.
Token ID13
Feature activation+5.89e-02

Pos logit contributions
The+0.253
idi+0.250
In+0.244
 grant+0.243
 ends+0.239
Neg logit contributions
idd-0.357
rol-0.356
und-0.327
arre-0.319
iw-0.317
Token Plus
Token ID8227
Feature activation+6.81e-01

Pos logit contributions
idd+0.096
rol+0.096
und+0.086
iw+0.086
arre+0.085
Neg logit contributions
The-0.067
 finish-0.065
 grant-0.064
 ends-0.064
In-0.064
Token They
Token ID1119
Feature activation+5.36e-01

Pos logit contributions
ers+1.264
isy+1.261
 grant+1.254
 finish+1.218
 ends+1.197
Neg logit contributions
idd-1.798
rol-1.722
und-1.644
 Good-1.620
iw-1.614
Token said
Token ID531
Feature activation-4.83e-01

Pos logit contributions
idd+1.010
rol+1.002
und+0.913
iw+0.910
arre+0.896
Neg logit contributions
The-0.488
 finish-0.470
In-0.461
 ends-0.460
 grant-0.459
Token,
Token ID11
Feature activation-3.38e-01

Pos logit contributions
The+0.498
In+0.494
idi+0.492
 finish+0.481
 grant+0.473
Neg logit contributions
idd-0.831
rol-0.802
iw-0.738
arre-0.737
und-0.735
Token "
Token ID366
Feature activation-1.03e-01

Pos logit contributions
 ends+0.332
idi+0.330
 finish+0.330
The+0.327
ers+0.325
Neg logit contributions
idd-0.594
rol-0.571
iw-0.530
arre-0.523
und-0.523
TokenThank
Token ID10449
Feature activation-1.46e-01

Pos logit contributions
 finish+0.135
idi+0.134
 before+0.133
 ends+0.133
 grant+0.133
Neg logit contributions
idd-0.150
rol-0.144
iw-0.132
und-0.130
arre-0.127
Token you
Token ID345
Feature activation-1.48e-01

Pos logit contributions
idi+0.201
 finish+0.198
The+0.197
In+0.196
 before+0.194
Neg logit contributions
idd-0.204
rol-0.195
iw-0.176
und-0.175
arre-0.167
Token,
Token ID11
Feature activation+7.74e-01

Pos logit contributions
The+0.122
idi+0.121
In+0.120
 grant+0.118
 ends+0.116
Neg logit contributions
idd-0.287
rol-0.282
iw-0.259
und-0.258
arre-0.257
Token mom
Token ID1995
Feature activation-2.00e-01

Pos logit contributions
idd+1.695
rol+1.692
und+1.564
iw+1.545
arre+1.530
Neg logit contributions
The-0.489
 finish-0.473
 ends-0.452
 before-0.450
 grant-0.448
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
idi+0.172
The+0.172
 grant+0.171
In+0.169
 finish+0.168
Neg logit contributions
idd-0.384
rol-0.371
iw-0.346
und-0.343
arre-0.338
Token We
Token ID775
Feature activation+6.95e-01

Pos logit contributions
idd+0.182
rol+0.178
iw+0.157
und+0.157
arre+0.152
Neg logit contributions
The-0.174
idi-0.170
 finish-0.170
In-0.168
 ends-0.168
Token love
Token ID1842
Feature activation+1.07e-01

Pos logit contributions
idd+1.078
rol+1.071
und+0.952
iw+0.950
arre+0.923
Neg logit contributions
The-0.872
 finish-0.854
In-0.840
 ends-0.837
 grant-0.835
Token and
Token ID290
Feature activation+2.28e-01

Pos logit contributions
idd+0.354
rol+0.351
und+0.320
iw+0.319
arre+0.312
Neg logit contributions
The-0.172
 finish-0.167
 ends-0.166
In-0.166
 grant-0.165
Token his
Token ID465
Feature activation-6.81e-01

Pos logit contributions
idd+0.455
rol+0.452
und+0.415
iw+0.412
arre+0.404
Neg logit contributions
The-0.178
 finish-0.171
In-0.170
 grant-0.168
 ends-0.167
Token boat
Token ID8848
Feature activation-3.72e-01

Pos logit contributions
The+0.754
In+0.737
 grant+0.713
idi+0.702
 before+0.689
Neg logit contributions
idd-1.174
rol-1.095
und-1.005
iw-1.005
rou-0.987
Token arrived
Token ID5284
Feature activation+3.48e-01

Pos logit contributions
The+0.350
idi+0.340
In+0.339
 grant+0.335
 finish+0.332
Neg logit contributions
rol-0.680
idd-0.679
und-0.617
arre-0.590
iw-0.588
Token on
Token ID319
Feature activation-4.02e-01

Pos logit contributions
idd+0.622
rol+0.617
und+0.560
iw+0.555
arre+0.546
Neg logit contributions
The-0.350
 finish-0.338
 grant-0.336
In-0.335
idi-0.334
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
The+0.274
In+0.272
 grant+0.270
 ends+0.266
 finish+0.261
Neg logit contributions
idd-0.821
rol-0.814
und-0.769
iw-0.751
arre-0.724
Token shore
Token ID15191
Feature activation+1.93e-02

Pos logit contributions
In+1.700
ace+1.666
idi+1.653
 grant+1.624
ers+1.612
Neg logit contributions
idd-2.380
rol-2.332
und-2.069
iw-2.064
 Mechan-2.051
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
idd+0.038
rol+0.037
und+0.034
iw+0.034
arre+0.033
Neg logit contributions
The-0.016
idi-0.015
In-0.015
 grant-0.015
 ends-0.015
Token The
Token ID383
Feature activation-1.10e+00

Pos logit contributions
The+0.232
 ends+0.228
 finish+0.227
 grant+0.227
ers+0.220
Neg logit contributions
idd-0.547
rol-0.544
und-0.500
iw-0.489
arre-0.486
Token boat
Token ID8848
Feature activation-4.85e-01

Pos logit contributions
In+1.017
The+1.014
 grant+0.982
idi+0.969
ers+0.947
Neg logit contributions
idd-2.057
rol-1.919
iw-1.759
arre-1.748
rou-1.729
Token and
Token ID290
Feature activation-2.61e-01

Pos logit contributions
The+0.429
 grant+0.419
In+0.417
idi+0.412
 finish+0.411
Neg logit contributions
idd-0.907
rol-0.900
und-0.820
iw-0.792
arre-0.791
Token had
Token ID550
Feature activation-5.14e-01

Pos logit contributions
The+0.298
 finish+0.292
idi+0.282
 ends+0.281
 before+0.276
Neg logit contributions
idd-1.070
rol-1.034
und-0.974
iw-0.967
arre-0.946
Token rescued
Token ID19868
Feature activation+2.36e-01

Pos logit contributions
 grant+0.508
The+0.502
 ends+0.494
idi+0.493
 finish+0.485
Neg logit contributions
idd-0.897
rol-0.875
und-0.812
arre-0.796
iw-0.786
Token the
Token ID262
Feature activation-4.60e-01

Pos logit contributions
idd+0.550
rol+0.549
und+0.511
iw+0.510
arre+0.498
Neg logit contributions
The-0.104
In-0.097
 finish-0.096
 grant-0.095
idi-0.094
Token bird
Token ID6512
Feature activation+3.09e-01

Pos logit contributions
The+0.435
idi+0.435
In+0.426
 finish+0.423
 grant+0.419
Neg logit contributions
idd-0.842
rol-0.803
iw-0.762
und-0.737
rou-0.726
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
idd+0.600
rol+0.600
und+0.548
iw+0.546
arre+0.537
Neg logit contributions
The-0.256
 finish-0.246
In-0.245
idi-0.244
 grant-0.242
Token From
Token ID3574
Feature activation-1.72e+00

Pos logit contributions
idd+0.370
rol+0.367
und+0.337
iw+0.336
arre+0.328
Neg logit contributions
The-0.154
 finish-0.152
 ends-0.149
 grant-0.149
idi-0.148
Token then
Token ID788
Feature activation+6.38e-01

Pos logit contributions
 grant+1.680
 ends+1.646
ace+1.597
 finish+1.565
 improve+1.523
Neg logit contributions
idd-3.169
rol-3.002
und-2.972
iw-2.921
rou-2.855
Token on
Token ID319
Feature activation-9.56e-01

Pos logit contributions
idd+1.325
rol+1.316
und+1.206
iw+1.206
arre+1.188
Neg logit contributions
The-0.462
 finish-0.439
In-0.432
 ends-0.426
 grant-0.425
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
 grant+0.918
 finish+0.909
The+0.891
 ends+0.886
In+0.873
Neg logit contributions
idd-1.699
rol-1.662
und-1.630
iw-1.551
arre-1.500
Token he
Token ID339
Feature activation-1.02e-01

Pos logit contributions
The+0.087
 grant+0.086
idi+0.086
 ends+0.085
 finish+0.085
Neg logit contributions
idd-0.217
rol-0.216
iw-0.201
und-0.199
arre-0.194
Token always
Token ID1464
Feature activation+1.18e+00

Pos logit contributions
The+0.082
 finish+0.079
 grant+0.078
 ends+0.078
idi+0.077
Neg logit contributions
idd-0.200
rol-0.196
und-0.183
iw-0.181
arre-0.179
Token little
Token ID1310
Feature activation+3.56e-01

Pos logit contributions
ers+0.155
 grant+0.152
 ends+0.151
The+0.150
 before+0.146
Neg logit contributions
idd-0.676
rol-0.637
iw-0.621
und-0.601
arre-0.599
Token girl
Token ID2576
Feature activation+4.24e-02

Pos logit contributions
idd+0.678
rol+0.665
iw+0.613
und+0.609
arre+0.597
Neg logit contributions
The-0.319
 finish-0.308
In-0.305
 grant-0.304
idi-0.302
Token named
Token ID3706
Feature activation+2.62e-01

Pos logit contributions
idd+0.062
rol+0.059
iw+0.054
und+0.054
arre+0.051
Neg logit contributions
The-0.056
 finish-0.056
 ends-0.056
 grant-0.055
idi-0.055
Token Grace
Token ID16156
Feature activation-4.04e-01

Pos logit contributions
idd+0.493
rol+0.489
iw+0.445
und+0.445
arre+0.434
Neg logit contributions
The-0.240
 finish-0.232
 ends-0.229
idi-0.229
 grant-0.228
Token.
Token ID13
Feature activation+3.28e-01

Pos logit contributions
 finish+0.374
idi+0.370
The+0.370
In+0.363
 ends+0.361
Neg logit contributions
idd-0.746
rol-0.726
und-0.671
iw-0.670
rou-0.650
Token She
Token ID1375
Feature activation-1.68e+00

Pos logit contributions
idd+0.530
rol+0.530
und+0.476
iw+0.470
arre+0.462
Neg logit contributions
The-0.380
 finish-0.373
idi-0.366
 ends-0.365
 grant-0.364
Token was
Token ID373
Feature activation-2.41e-01

Pos logit contributions
 before+2.120
 finish+2.116
 ends+2.058
ers+2.030
 grant+2.029
Neg logit contributions
idd-2.529
rol-2.251
und-2.229
iw-2.097
rou-2.053
Token three
Token ID1115
Feature activation+5.51e-01

Pos logit contributions
The+0.226
 ends+0.222
idi+0.219
 grant+0.218
 finish+0.218
Neg logit contributions
idd-0.444
rol-0.438
iw-0.396
und-0.389
arre-0.383
Token years
Token ID812
Feature activation-7.62e-01

Pos logit contributions
idd+1.048
rol+1.040
und+0.949
iw+0.946
arre+0.930
Neg logit contributions
The-0.493
 finish-0.475
In-0.466
 ends-0.464
 grant-0.463
Token old
Token ID1468
Feature activation-1.08e+00

Pos logit contributions
The+0.373
 grant+0.370
In+0.360
idi+0.331
 finish+0.317
Neg logit contributions
idd-1.777
rol-1.702
iw-1.637
 batch-1.581
rou-1.577
Token and
Token ID290
Feature activation+2.14e-01

Pos logit contributions
 grant+0.975
idi+0.962
In+0.956
 finish+0.942
The+0.918
Neg logit contributions
idd-2.022
rol-1.899
iw-1.887
und-1.824
 batch-1.772
Token!"
Token ID2474
Feature activation-7.76e-01

Pos logit contributions
idd+1.505
rol+1.497
iw+1.323
und+1.319
arre+1.306
Neg logit contributions
The-1.208
 finish-1.192
 ends-1.179
 before-1.168
 grant-1.147
Token Ben
Token ID3932
Feature activation+2.12e-01

Pos logit contributions
ers+0.913
ace+0.891
idi+0.878
In+0.871
 ends+0.869
Neg logit contributions
idd-1.231
rol-1.144
iw-1.117
und-1.099
arre-1.094
Token says
Token ID1139
Feature activation-4.25e-01

Pos logit contributions
idd+0.275
rol+0.270
und+0.237
iw+0.235
arre+0.234
Neg logit contributions
The-0.316
In-0.309
idi-0.307
 grant-0.306
 finish-0.306
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
The+0.406
In+0.404
idi+0.398
 grant+0.380
 ends+0.374
Neg logit contributions
idd-0.774
rol-0.754
iw-0.686
und-0.682
arre-0.680
Token
Token ID198
Feature activation-8.26e-01

Pos logit contributions
 grant+0.357
 finish+0.357
 ends+0.355
idi+0.345
 before+0.340
Neg logit contributions
idd-0.761
rol-0.736
und-0.691
iw-0.690
arre-0.673
Token
Token ID198
Feature activation-1.47e+00

Pos logit contributions
 grant+0.817
idi+0.816
 before+0.791
 finish+0.790
ace+0.773
Neg logit contributions
idd-1.457
rol-1.397
und-1.341
iw-1.300
arre-1.288
TokenThey
Token ID2990
Feature activation-9.83e-01

Pos logit contributions
 grant+1.941
idi+1.832
 before+1.823
ace+1.768
 contain+1.757
Neg logit contributions
idd-2.291
arre-2.048
rol-2.029
und-2.007
iw-1.976
Token are
Token ID389
Feature activation-9.83e-01

Pos logit contributions
idi+1.020
ers+0.992
The+0.973
 before+0.947
In+0.944
Neg logit contributions
idd-1.655
rol-1.650
und-1.534
arre-1.517
iw-1.500
Token not
Token ID407
Feature activation+1.55e-01

Pos logit contributions
 grant+1.336
idi+1.304
The+1.295
In+1.263
isy+1.243
Neg logit contributions
idd-1.427
rol-1.371
arre-1.277
rou-1.207
iw-1.197
Token happy
Token ID3772
Feature activation-9.95e-02

Pos logit contributions
idd+0.278
rol+0.271
iw+0.248
und+0.246
arre+0.245
Neg logit contributions
The-0.156
 grant-0.152
idi-0.151
In-0.150
 finish-0.149
Token.
Token ID13
Feature activation-9.35e-01

Pos logit contributions
The+0.090
In+0.086
idi+0.086
 grant+0.086
 finish+0.084
Neg logit contributions
idd-0.188
rol-0.184
und-0.171
iw-0.170
arre-0.169
Token sweet
Token ID6029
Feature activation-2.39e-01

Pos logit contributions
idd+0.395
rol+0.388
iw+0.358
und+0.355
arre+0.351
Neg logit contributions
The-0.193
 grant-0.185
 finish-0.185
 ends-0.184
 before-0.184
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
The+0.214
In+0.202
 grant+0.198
 finish+0.195
idi+0.194
Neg logit contributions
idd-0.455
rol-0.443
und-0.410
iw-0.409
arre-0.408
Token satisfied
Token ID11378
Feature activation-3.01e-01

Pos logit contributions
The+0.180
 grant+0.177
 finish+0.174
 ends+0.173
In+0.173
Neg logit contributions
idd-0.365
rol-0.354
iw-0.331
und-0.324
arre-0.320
Token smile
Token ID8212
Feature activation-3.24e-01

Pos logit contributions
The+0.290
 ends+0.281
 grant+0.280
In+0.279
 finish+0.279
Neg logit contributions
idd-0.536
rol-0.528
und-0.491
iw-0.485
arre-0.473
Token on
Token ID319
Feature activation-2.88e-01

Pos logit contributions
The+0.261
In+0.255
idi+0.254
 finish+0.251
 grant+0.248
Neg logit contributions
idd-0.639
rol-0.630
und-0.586
arre-0.570
iw-0.566
Token his
Token ID465
Feature activation-1.62e+00

Pos logit contributions
 ends+0.300
 grant+0.298
In+0.296
The+0.296
 finish+0.293
Neg logit contributions
idd-0.480
rol-0.473
und-0.446
arre-0.428
iw-0.426
Token face
Token ID1986
Feature activation-6.39e-01

Pos logit contributions
In+1.303
The+1.276
ers+1.225
idi+1.215
 grant+1.173
Neg logit contributions
idd-3.040
arre-2.873
OO-2.827
rol-2.817
rou-2.810
Token as
Token ID355
Feature activation-7.74e-01

Pos logit contributions
The+0.517
idi+0.516
In+0.513
 ends+0.501
 grant+0.499
Neg logit contributions
idd-1.224
rol-1.220
und-1.131
arre-1.102
iw-1.101
Token he
Token ID339
Feature activation+1.55e-01

Pos logit contributions
The+0.750
In+0.742
 finish+0.739
 ends+0.731
idi+0.729
Neg logit contributions
idd-1.369
rol-1.336
iw-1.226
rou-1.220
und-1.211
Token played
Token ID2826
Feature activation-3.36e-01

Pos logit contributions
idd+0.295
rol+0.289
und+0.267
iw+0.264
arre+0.258
Neg logit contributions
The-0.138
 finish-0.132
 grant-0.130
idi-0.129
In-0.129
Token in
Token ID287
Feature activation+5.32e-01

Pos logit contributions
The+0.280
In+0.274
idi+0.273
 grant+0.272
 ends+0.270
Neg logit contributions
idd-0.644
rol-0.638
und-0.595
iw-0.573
arre-0.573