TOP ACTIVATIONS
MAX = 3.048

candy and ride in his toy je ep . One day
ally , stay away from that saw , it 's very
can we go in the toy store ?" Her mom my
I will never go to that place again !" the boy
But Tim my was having so much fun playing that he
kind to C aryn from that day on .
grandma . She lives in that house ," Anna says and
Jones . He lives in that building over there . He
!" To her su prise , the thunder went
them . They cried and bl ed and hurt .
: But I was having so much fun !
, scary cat and got very scared . " Oh no
got off the couch and tr ud ged out of the
Her mom sighed and kn elt down next to her
little 3 year old boy cl apping his hands .
head . Ben cried and bl ed . Sam screamed and
She said , " Hi p anda ! What smells so
said , " You had so much fun outside . Let
She loved going to the toy store with her mom .
bought two hot dogs with k etchup . Max and Sam

MIN ACTIVATIONS
MIN = -2.949

Okay , Anna . That sounds like fun . But remember
him . Tom falls on the ground . He gets dirty
He threw the pieces on the ground and started to cry
, it hopped up on the stand . The mouse thought
noise as well . It sounds like a sn ore .
ily said , " That sounds great , grandma . Thank
, " OK , that sounds fun . What things do
She said , " That sounds fun . I will draw
the leaves flying all over the place . She decided to
- tap - tap on the leaf . It was a
friends found some sticks on the ground .
her hand and fell on the floor . It broke into
the rope and fell on the stick . He y el
Okay , Tom . That sounds fun . Let 's go
mom said , " That sounds very exciting . But why
the face . Little boy smiled shy ly and
. It was rolling down the hill . It was going
Sam says . " That sounds fun ." They
, " OK . That sounds good . That sounds comfortable
saw a tie lying on the ground . She picked it

INTERVAL 1.549 - 3.048
CONTAINS 0.317%

sad and screamed , " I want my apple !"
day . At the mall , Tom saw a
, she went to the park to play with Spark les
� � I am Ty the Water Cat . I live
was happy to have a new home . It loved the

INTERVAL 0.050 - 1.549
CONTAINS 45.300%

three year old girl who loved to swim . Her parents
chair , and they read the book together . They laughed
wings . Then it looked down at them and said ,
long as he stayed in the kitchen . Tom was so
snow and decided to jump on it . When he landed

INTERVAL -1.449 - 0.050
CONTAINS 53.866%

She said , " Tom , you are thirsty . Here
money , but from being grateful for what you have .
." They hang up the phone and smile .
see him again ! It said it was sorry for not
from ear to ear . <|endoftext|> Once upon a time ,

INTERVAL -2.949 - -1.449
CONTAINS 0.516%

She was called Daisy . Daisy was very brave .
brightly that it lit up the sky . When the light
that her mom had encouraged her . She knew that she
saw a yellow cob on the ground . He picked it
could fly up high in the sky . Everyone was amazed
Token candy
Token ID18550
Feature activation+8.97e-01

Pos logit contributions
idd+1.210
rol+1.202
und+1.081
iw+1.072
arre+1.059
Neg logit contributions
The-0.767
 finish-0.746
 ends-0.735
 before-0.731
In-0.729
Token and
Token ID290
Feature activation-8.76e-02

Pos logit contributions
idd+1.893
rol+1.875
und+1.717
iw+1.706
arre+1.693
Neg logit contributions
The-0.639
 finish-0.617
 before-0.611
 ends-0.594
In-0.589
Token ride
Token ID6594
Feature activation-2.05e-01

Pos logit contributions
The+0.092
In+0.087
 grant+0.087
 ends+0.087
 finish+0.087
Neg logit contributions
idd-0.149
rol-0.148
iw-0.137
und-0.135
arre-0.131
Token in
Token ID287
Feature activation+4.63e-01

Pos logit contributions
 grant+0.203
The+0.201
 finish+0.199
In+0.196
 ends+0.193
Neg logit contributions
idd-0.365
rol-0.362
und-0.333
iw-0.330
arre-0.319
Token his
Token ID465
Feature activation+9.35e-01

Pos logit contributions
idd+1.152
rol+1.144
und+1.069
iw+1.066
arre+1.051
Neg logit contributions
The-0.144
 finish-0.130
 grant-0.124
In-0.124
idi-0.122
Token toy
Token ID13373
Feature activation+3.05e+00

Pos logit contributions
rol+1.740
idd+1.716
und+1.552
iw+1.539
arre+1.521
Neg logit contributions
The-0.915
 finish-0.892
In-0.866
 ends-0.864
idi-0.864
Token je
Token ID11223
Feature activation+1.45e+00

Pos logit contributions
rol+5.749
idd+5.382
rou+5.217
arre+5.074
und+4.850
Neg logit contributions
 finish-3.395
The-3.305
 ends-3.260
 the-3.221
ers-3.212
Tokenep
Token ID538
Feature activation-3.05e-02

Pos logit contributions
rol+2.194
idd+2.160
iw+1.911
und+1.910
arre+1.885
Neg logit contributions
The-1.936
In-1.852
ers-1.835
 finish-1.810
 ends-1.779
Token.
Token ID13
Feature activation+6.58e-01

Pos logit contributions
The+0.025
 grant+0.025
idi+0.025
 finish+0.024
In+0.024
Neg logit contributions
rol-0.059
idd-0.059
und-0.054
iw-0.053
arre-0.053
Token One
Token ID1881
Feature activation-3.63e-01

Pos logit contributions
idd+1.143
rol+1.132
und+1.022
iw+1.017
arre+1.000
Neg logit contributions
The-0.709
 finish-0.686
In-0.676
idi-0.671
 ends-0.670
Token day
Token ID1110
Feature activation+5.28e-01

Pos logit contributions
 grant+0.344
ers+0.326
idi+0.324
The+0.321
 finish+0.320
Neg logit contributions
idd-0.661
rol-0.641
iw-0.601
und-0.589
arre-0.579
Tokenally
Token ID453
Feature activation-5.67e-01

Pos logit contributions
idd+1.956
rol+1.950
iw+1.741
und+1.740
arre+1.694
Neg logit contributions
The-1.204
 finish-1.163
In-1.162
 ends-1.137
 before-1.127
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
The+0.534
In+0.516
idi+0.514
 finish+0.510
 grant+0.509
Neg logit contributions
idd-1.067
rol-1.014
und-0.975
iw-0.945
arre-0.928
Token stay
Token ID2652
Feature activation+3.79e-01

Pos logit contributions
The+0.232
idi+0.231
ers+0.225
In+0.225
 finish+0.224
Neg logit contributions
idd-0.405
rol-0.401
und-0.367
iw-0.364
arre-0.360
Token away
Token ID1497
Feature activation-1.15e-01

Pos logit contributions
idd+0.658
rol+0.650
und+0.590
iw+0.586
arre+0.576
Neg logit contributions
The-0.404
 finish-0.389
 ends-0.387
 grant-0.387
In-0.386
Token from
Token ID422
Feature activation+1.06e+00

Pos logit contributions
The+0.093
idi+0.091
In+0.090
 grant+0.089
 ends+0.089
Neg logit contributions
idd-0.224
rol-0.223
und-0.207
iw-0.203
arre-0.201
Token that
Token ID326
Feature activation+3.03e+00

Pos logit contributions
rol+2.449
idd+2.441
und+2.252
iw+2.241
arre+2.240
Neg logit contributions
The-0.534
 finish-0.521
 before-0.514
 ends-0.481
 grant-0.471
Token saw
Token ID2497
Feature activation+1.52e+00

Pos logit contributions
rol+7.007
idd+6.745
rou+6.232
iw+5.959
und+5.906
Neg logit contributions
 finish-2.530
The-2.490
 before-2.489
 the-2.457
 entire-2.381
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
idd+3.050
rol+3.026
iw+2.791
arre+2.743
und+2.728
Neg logit contributions
The-1.290
 before-1.264
 finish-1.214
 ends-1.204
ers-1.170
Token it
Token ID340
Feature activation+4.85e-01

Pos logit contributions
idd+0.351
rol+0.348
und+0.318
iw+0.316
arre+0.312
Neg logit contributions
The-0.176
idi-0.170
 finish-0.170
In-0.169
 ends-0.168
Token's
Token ID338
Feature activation+1.88e-01

Pos logit contributions
idd+0.856
rol+0.849
und+0.770
iw+0.765
arre+0.754
Neg logit contributions
The-0.498
 finish-0.481
In-0.474
idi-0.472
 grant-0.471
Token very
Token ID845
Feature activation+1.46e+00

Pos logit contributions
idd+0.338
rol+0.337
und+0.306
iw+0.303
arre+0.301
Neg logit contributions
The-0.187
 grant-0.177
 finish-0.177
In-0.176
idi-0.175
Token can
Token ID460
Feature activation-4.12e-01

Pos logit contributions
idi+0.468
The+0.456
ers+0.455
In+0.438
 finish+0.430
Neg logit contributions
idd-0.753
rol-0.742
iw-0.680
und-0.670
arre-0.653
Token we
Token ID356
Feature activation+5.87e-02

Pos logit contributions
idi+0.494
ers+0.484
The+0.477
In+0.475
 finish+0.472
Neg logit contributions
idd-0.637
rol-0.618
iw-0.563
und-0.556
 Mechan-0.532
Token go
Token ID467
Feature activation+4.54e-02

Pos logit contributions
idd+0.115
rol+0.115
und+0.105
iw+0.105
arre+0.103
Neg logit contributions
The-0.047
idi-0.046
 ends-0.045
ers-0.045
 grant-0.044
Token in
Token ID287
Feature activation+5.04e-01

Pos logit contributions
idd+0.085
rol+0.084
iw+0.076
und+0.076
arre+0.074
Neg logit contributions
The-0.042
 grant-0.040
 ends-0.040
idi-0.040
In-0.040
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
idd+1.153
rol+1.145
und+1.063
iw+1.059
arre+1.045
Neg logit contributions
The-0.256
 finish-0.239
In-0.231
 grant-0.230
 ends-0.230
Token toy
Token ID13373
Feature activation+2.96e+00

Pos logit contributions
rol+3.566
idd+3.532
iw+3.153
und+3.151
arre+3.085
Neg logit contributions
 finish-2.491
The-2.471
In-2.330
 ends-2.325
 sun-2.246
Token store
Token ID3650
Feature activation+4.27e-01

Pos logit contributions
rol+5.797
idd+5.614
iw+5.181
arre+5.117
rou+5.079
Neg logit contributions
 finish-3.309
The-3.080
 before-3.054
 the-2.935
 ends-2.915
Token?"
Token ID1701
Feature activation-8.99e-03

Pos logit contributions
idd+0.739
rol+0.731
und+0.662
iw+0.659
arre+0.647
Neg logit contributions
The-0.452
 finish-0.437
In-0.433
 grant-0.433
idi-0.433
Token Her
Token ID2332
Feature activation+3.45e-01

Pos logit contributions
The+0.009
 ends+0.009
 grant+0.009
 finish+0.009
ers+0.009
Neg logit contributions
idd-0.016
rol-0.015
und-0.014
iw-0.014
arre-0.014
Token mom
Token ID1995
Feature activation+2.92e-01

Pos logit contributions
idd+0.764
rol+0.752
iw+0.697
und+0.697
arre+0.688
Neg logit contributions
The-0.205
 finish-0.192
In-0.188
 grant-0.188
idi-0.187
Tokenmy
Token ID1820
Feature activation+2.89e-01

Pos logit contributions
idd+0.367
rol+0.360
und+0.312
iw+0.310
arre+0.304
Neg logit contributions
The-0.451
 finish-0.442
 grant-0.440
In-0.437
 ends-0.436
TokenI
Token ID40
Feature activation+9.16e-01

Pos logit contributions
idd+0.712
rol+0.706
und+0.632
iw+0.631
arre+0.619
Neg logit contributions
The-0.502
 finish-0.493
 ends-0.484
 grant-0.484
idi-0.481
Token will
Token ID481
Feature activation+6.28e-02

Pos logit contributions
idd+1.552
rol+1.548
und+1.392
iw+1.382
arre+1.346
Neg logit contributions
The-1.020
 finish-1.017
 grant-0.984
In-0.983
 before-0.980
Token never
Token ID1239
Feature activation-7.31e-01

Pos logit contributions
idd+0.128
rol+0.127
und+0.117
iw+0.116
arre+0.114
Neg logit contributions
The-0.046
idi-0.044
 ends-0.044
In-0.043
ers-0.042
Token go
Token ID467
Feature activation-2.13e-01

Pos logit contributions
The+0.590
ers+0.571
idi+0.567
In+0.559
 ends+0.536
Neg logit contributions
idd-1.407
rol-1.392
arre-1.321
iw-1.290
und-1.283
Token to
Token ID284
Feature activation+2.68e-01

Pos logit contributions
The+0.221
 ends+0.216
In+0.215
idi+0.213
 grant+0.212
Neg logit contributions
idd-0.372
rol-0.367
iw-0.330
und-0.330
arre-0.323
Token that
Token ID326
Feature activation+2.92e+00

Pos logit contributions
idd+0.587
rol+0.582
und+0.539
iw+0.539
arre+0.528
Neg logit contributions
The-0.156
In-0.148
idi-0.146
 ends-0.145
 finish-0.144
Token place
Token ID1295
Feature activation-1.44e-01

Pos logit contributions
rol+6.134
idd+5.887
und+5.383
iw+5.354
arre+5.218
Neg logit contributions
 finish-2.928
 before-2.849
The-2.840
 the-2.703
 entire-2.640
Token again
Token ID757
Feature activation-2.47e-02

Pos logit contributions
The+0.150
In+0.146
idi+0.144
 finish+0.142
 grant+0.142
Neg logit contributions
rol-0.249
idd-0.248
und-0.224
iw-0.223
arre-0.215
Token!"
Token ID2474
Feature activation-3.73e-01

Pos logit contributions
The+0.027
In+0.026
 grant+0.026
 ends+0.026
 finish+0.026
Neg logit contributions
idd-0.041
rol-0.040
und-0.037
iw-0.037
arre-0.036
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
In+0.256
 ends+0.255
ers+0.255
idi+0.254
The+0.252
Neg logit contributions
idd-0.754
rol-0.747
und-0.701
iw-0.696
arre-0.672
Token boy
Token ID2933
Feature activation-1.14e-01

Pos logit contributions
idd+0.373
rol+0.363
iw+0.337
und+0.333
arre+0.330
Neg logit contributions
The-0.167
In-0.161
 grant-0.160
 finish-0.158
idi-0.157
Token But
Token ID887
Feature activation+1.75e-01

Pos logit contributions
idd+1.009
rol+1.000
und+0.902
iw+0.899
arre+0.883
Neg logit contributions
The-0.630
 finish-0.609
In-0.601
 ends-0.597
idi-0.596
Token Tim
Token ID5045
Feature activation-8.03e-02

Pos logit contributions
idd+0.281
rol+0.278
und+0.252
iw+0.250
arre+0.242
Neg logit contributions
The-0.204
 grant-0.199
 finish-0.199
 ends-0.197
idi-0.197
Tokenmy
Token ID1820
Feature activation-1.76e-01

Pos logit contributions
The+0.129
 grant+0.128
 finish+0.127
In+0.125
idi+0.125
Neg logit contributions
idd-0.095
rol-0.092
und-0.082
arre-0.080
iw-0.080
Token was
Token ID373
Feature activation+1.39e-01

Pos logit contributions
The+0.180
 grant+0.178
 finish+0.174
In+0.172
 ends+0.172
Neg logit contributions
idd-0.308
rol-0.295
und-0.277
arre-0.275
iw-0.271
Token having
Token ID1719
Feature activation+8.09e-01

Pos logit contributions
idd+0.222
rol+0.218
iw+0.195
arre+0.194
und+0.194
Neg logit contributions
The-0.166
 grant-0.162
 ends-0.162
 finish-0.161
In-0.160
Token so
Token ID523
Feature activation+2.92e+00

Pos logit contributions
idd+1.520
rol+1.504
und+1.369
iw+1.365
arre+1.333
Neg logit contributions
The-0.765
 finish-0.739
In-0.720
 before-0.719
 ends-0.717
Token much
Token ID881
Feature activation+4.76e-01

Pos logit contributions
rol+5.706
idd+5.490
und+4.758
iw+4.716
 Mechan+4.656
Neg logit contributions
The-3.460
In-3.375
 time-3.364
 had-3.022
 the-2.985
Token fun
Token ID1257
Feature activation+8.38e-01

Pos logit contributions
idd+0.788
rol+0.779
und+0.702
iw+0.700
arre+0.687
Neg logit contributions
The-0.545
 finish-0.528
 grant-0.521
In-0.521
 ends-0.520
Token playing
Token ID2712
Feature activation+7.25e-01

Pos logit contributions
idd+1.504
rol+1.502
iw+1.350
und+1.348
arre+1.329
Neg logit contributions
The-0.837
 before-0.815
 finish-0.815
In-0.795
idi-0.791
Token that
Token ID326
Feature activation+2.71e-01

Pos logit contributions
idd+1.517
rol+1.510
iw+1.382
und+1.380
arre+1.357
Neg logit contributions
The-0.524
 finish-0.501
 before-0.487
In-0.483
 grant-0.480
Token he
Token ID339
Feature activation-5.47e-03

Pos logit contributions
idd+0.511
rol+0.508
und+0.463
iw+0.463
arre+0.452
Neg logit contributions
The-0.243
In-0.231
 finish-0.231
idi-0.230
 grant-0.230
Token kind
Token ID1611
Feature activation-9.21e-02

Pos logit contributions
 grant+0.317
The+0.313
 ends+0.311
In+0.309
 before+0.301
Neg logit contributions
idd-0.547
rol-0.531
und-0.481
arre-0.478
iw-0.478
Token to
Token ID284
Feature activation-6.37e-02

Pos logit contributions
The+0.069
 grant+0.067
In+0.066
idi+0.065
 finish+0.065
Neg logit contributions
idd-0.189
rol-0.187
und-0.173
iw-0.171
arre-0.169
Token C
Token ID327
Feature activation+9.69e-01

Pos logit contributions
The+0.058
In+0.056
 finish+0.056
 ends+0.056
 grant+0.055
Neg logit contributions
idd-0.119
rol-0.117
und-0.109
iw-0.106
arre-0.104
Tokenaryn
Token ID38621
Feature activation+1.25e-01

Pos logit contributions
idd+1.414
rol+1.410
iw+1.264
und+1.243
arre+1.194
Neg logit contributions
The-1.289
 finish-1.268
In-1.242
idi-1.238
 ends-1.237
Token from
Token ID422
Feature activation+2.41e-01

Pos logit contributions
idd+0.232
rol+0.231
und+0.211
iw+0.210
arre+0.208
Neg logit contributions
The-0.114
In-0.109
 finish-0.108
idi-0.108
 ends-0.107
Token that
Token ID326
Feature activation+2.89e+00

Pos logit contributions
idd+0.495
rol+0.489
und+0.452
iw+0.450
arre+0.439
Neg logit contributions
The-0.178
In-0.168
 finish-0.168
 ends-0.166
 grant-0.165
Token day
Token ID1110
Feature activation-2.61e-01

Pos logit contributions
rol+7.272
idd+7.206
und+6.530
arre+6.484
ays+6.448
Neg logit contributions
 before-1.589
The-1.543
 finish-1.516
 the-1.494
 had-1.445
Token on
Token ID319
Feature activation+1.48e-01

Pos logit contributions
The+0.241
 grant+0.237
 ends+0.237
idi+0.235
In+0.231
Neg logit contributions
idd-0.482
rol-0.476
und-0.441
arre-0.430
iw-0.425
Token.
Token ID13
Feature activation+7.00e-01

Pos logit contributions
idd+0.276
rol+0.273
und+0.253
iw+0.247
arre+0.246
Neg logit contributions
The-0.134
 finish-0.130
 grant-0.128
In-0.128
 ends-0.128
Token
Token ID198
Feature activation-9.32e-02

Pos logit contributions
idd+1.368
rol+1.356
und+1.239
iw+1.238
arre+1.218
Neg logit contributions
The-0.591
 finish-0.562
In-0.555
idi-0.550
 grant-0.546
Token
Token ID198
Feature activation+4.48e-01

Pos logit contributions
 finish+0.076
 ends+0.075
 before+0.074
 grant+0.073
The+0.071
Neg logit contributions
idd-0.183
rol-0.182
und-0.167
iw-0.163
arre-0.163
Token grandma
Token ID49890
Feature activation-3.82e-01

Pos logit contributions
rol+1.838
idd+1.828
und+1.661
iw+1.638
arre+1.633
Neg logit contributions
The-0.912
 finish-0.893
 ends-0.877
 before-0.867
 grant-0.856
Token.
Token ID13
Feature activation+5.00e-01

Pos logit contributions
The+0.358
In+0.355
idi+0.348
 grant+0.344
 finish+0.340
Neg logit contributions
idd-0.699
rol-0.692
iw-0.635
und-0.631
arre-0.621
Token She
Token ID1375
Feature activation+1.63e-01

Pos logit contributions
idd+0.877
rol+0.870
und+0.787
iw+0.784
arre+0.770
Neg logit contributions
The-0.516
 finish-0.501
In-0.492
 ends-0.491
idi-0.490
Token lives
Token ID3160
Feature activation+2.97e-02

Pos logit contributions
idd+0.266
rol+0.262
und+0.236
iw+0.236
arre+0.230
Neg logit contributions
The-0.188
 finish-0.181
In-0.180
idi-0.179
 before-0.177
Token in
Token ID287
Feature activation+1.07e+00

Pos logit contributions
idd+0.057
rol+0.056
iw+0.052
und+0.052
arre+0.050
Neg logit contributions
The-0.025
 finish-0.025
 grant-0.024
idi-0.024
 ends-0.024
Token that
Token ID326
Feature activation+2.87e+00

Pos logit contributions
idd+2.611
rol+2.595
iw+2.398
und+2.388
arre+2.384
Neg logit contributions
The-0.448
 finish-0.391
 ends-0.368
 before-0.367
In-0.357
Token house
Token ID2156
Feature activation-1.57e-01

Pos logit contributions
idd+6.480
rol+6.380
iw+5.712
und+5.678
rou+5.596
Neg logit contributions
 finish-2.384
The-2.320
 ends-2.302
 the-2.284
 before-2.282
Token,"
Token ID553
Feature activation-1.01e+00

Pos logit contributions
The+0.138
In+0.133
idi+0.133
 grant+0.132
 finish+0.132
Neg logit contributions
idd-0.294
rol-0.293
und-0.269
iw-0.267
arre-0.261
Token Anna
Token ID11735
Feature activation+3.87e-01

Pos logit contributions
In+0.706
ers+0.695
ace+0.687
 finish+0.669
 ends+0.663
Neg logit contributions
idd-2.053
iw-1.932
rol-1.931
und-1.862
 batch-1.853
Token says
Token ID1139
Feature activation+2.05e-01

Pos logit contributions
idd+0.594
rol+0.587
und+0.524
iw+0.522
arre+0.514
Neg logit contributions
The-0.485
 finish-0.472
In-0.468
 grant-0.464
 ends-0.463
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
idd+0.403
rol+0.399
iw+0.364
und+0.364
arre+0.358
Neg logit contributions
The-0.169
In-0.162
 finish-0.161
idi-0.160
 ends-0.158
Token Jones
Token ID5437
Feature activation-4.66e-01

Pos logit contributions
rol+1.523
idd+1.512
und+1.326
arre+1.310
iw+1.307
Neg logit contributions
The-1.553
 grant-1.534
 finish-1.532
 before-1.526
 ends-1.499
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
The+0.463
In+0.449
idi+0.446
 finish+0.436
 grant+0.431
Neg logit contributions
idd-0.834
rol-0.816
iw-0.756
und-0.755
arre-0.728
Token He
Token ID679
Feature activation+2.01e-01

Pos logit contributions
idd+1.025
rol+1.017
und+0.926
iw+0.923
arre+0.908
Neg logit contributions
The-0.514
 finish-0.496
In-0.487
 ends-0.485
 grant-0.485
Token lives
Token ID3160
Feature activation+8.60e-02

Pos logit contributions
idd+0.339
rol+0.333
und+0.302
iw+0.302
arre+0.295
Neg logit contributions
The-0.223
 finish-0.213
idi-0.212
In-0.212
 before-0.210
Token in
Token ID287
Feature activation+1.05e+00

Pos logit contributions
idd+0.174
rol+0.173
iw+0.159
und+0.159
arre+0.154
Neg logit contributions
The-0.064
 finish-0.062
 grant-0.061
idi-0.061
In-0.060
Token that
Token ID326
Feature activation+2.86e+00

Pos logit contributions
idd+2.583
rol+2.572
iw+2.374
und+2.367
arre+2.367
Neg logit contributions
The-0.419
 finish-0.366
 before-0.349
 ends-0.344
In-0.331
Token building
Token ID2615
Feature activation+4.89e-01

Pos logit contributions
rol+6.329
idd+6.183
und+5.639
iw+5.608
arre+5.507
Neg logit contributions
 finish-2.658
 ends-2.505
 before-2.470
The-2.411
 the-2.370
Token over
Token ID625
Feature activation+9.78e-01

Pos logit contributions
idd+0.965
rol+0.958
und+0.878
iw+0.874
arre+0.860
Neg logit contributions
The-0.398
 finish-0.382
In-0.375
idi-0.372
 grant-0.372
Token there
Token ID612
Feature activation+6.97e-01

Pos logit contributions
idd+2.094
rol+2.086
und+1.910
iw+1.908
arre+1.898
Neg logit contributions
The-0.668
 finish-0.621
 before-0.620
 ends-0.607
In-0.597
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
idd+1.392
rol+1.377
und+1.262
iw+1.259
arre+1.241
Neg logit contributions
The-0.558
 finish-0.537
 ends-0.523
 before-0.522
In-0.521
Token He
Token ID679
Feature activation+4.34e-01

Pos logit contributions
idd+0.681
rol+0.675
und+0.609
iw+0.608
arre+0.594
Neg logit contributions
The-0.416
 finish-0.405
 ends-0.398
In-0.398
idi-0.396
Token!"
Token ID2474
Feature activation-3.37e-01

Pos logit contributions
idd+0.767
rol+0.759
und+0.673
iw+0.669
arre+0.655
Neg logit contributions
The-0.697
 finish-0.680
In-0.671
 ends-0.669
 grant-0.669
Token
Token ID198
Feature activation-5.48e-01

Pos logit contributions
idi+0.254
 ends+0.248
The+0.247
 finish+0.245
 grant+0.244
Neg logit contributions
idd-0.664
rol-0.662
und-0.624
iw-0.611
arre-0.603
Token
Token ID198
Feature activation-5.45e-01

Pos logit contributions
 ends+0.490
 finish+0.487
 before+0.480
 grant+0.474
idi+0.469
Neg logit contributions
idd-1.005
rol-0.999
und-0.937
iw-0.897
arre-0.888
TokenTo
Token ID2514
Feature activation+2.21e-02

Pos logit contributions
 before+0.301
 ends+0.298
ers+0.285
 finish+0.282
idi+0.282
Neg logit contributions
idd-1.182
rol-1.171
und-1.111
arre-1.096
iw-1.076
Token her
Token ID607
Feature activation-2.04e-01

Pos logit contributions
rol+0.042
idd+0.042
und+0.039
iw+0.038
arre+0.038
Neg logit contributions
The-0.018
In-0.018
idi-0.018
 ends-0.018
 finish-0.017
Token su
Token ID424
Feature activation+2.86e+00

Pos logit contributions
The+0.236
ers+0.233
In+0.230
 grant+0.227
 before+0.224
Neg logit contributions
idd-0.328
rol-0.315
und-0.293
iw-0.289
arre-0.289
Tokenprise
Token ID7919
Feature activation+2.88e-01

Pos logit contributions
idd+3.760
rol+3.737
 Mechan+3.330
iw+3.311
ays+3.252
Neg logit contributions
The-4.585
In-4.487
 finish-4.181
ers-4.079
idi-3.980
Token,
Token ID11
Feature activation-7.48e-02

Pos logit contributions
idd+0.500
rol+0.496
und+0.452
iw+0.444
arre+0.440
Neg logit contributions
The-0.304
In-0.291
 finish-0.290
 grant-0.287
idi-0.285
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
The+0.027
 finish+0.027
idi+0.026
In+0.026
 ends+0.026
Neg logit contributions
idd-0.177
rol-0.177
und-0.165
iw-0.164
arre-0.161
Token thunder
Token ID18355
Feature activation-6.10e-01

Pos logit contributions
The+0.121
 grant+0.118
idi+0.117
In+0.117
 finish+0.115
Neg logit contributions
idd-0.189
rol-0.181
arre-0.168
und-0.165
iw-0.164
Token went
Token ID1816
Feature activation+5.55e-01

Pos logit contributions
The+0.682
 grant+0.671
In+0.663
 finish+0.659
ace+0.657
Neg logit contributions
idd-1.004
rol-0.992
und-0.918
arre-0.881
iw-0.843
Token them
Token ID606
Feature activation+1.10e+00

Pos logit contributions
The+0.440
 grant+0.439
In+0.438
idi+0.429
 finish+0.421
Neg logit contributions
idd-0.689
rol-0.689
und-0.621
iw-0.614
ays-0.600
Token.
Token ID13
Feature activation-4.35e-01

Pos logit contributions
idd+2.351
rol+2.315
iw+2.158
und+2.114
arre+2.106
Neg logit contributions
The-0.781
 finish-0.762
 before-0.744
 ends-0.738
In-0.709
Token They
Token ID1119
Feature activation-2.14e-01

Pos logit contributions
idi+0.351
 finish+0.349
 ends+0.349
 grant+0.341
ers+0.338
Neg logit contributions
idd-0.846
rol-0.831
und-0.782
iw-0.766
arre-0.757
Token cried
Token ID16896
Feature activation+8.40e-01

Pos logit contributions
The+0.252
idi+0.250
 finish+0.246
 grant+0.245
In+0.244
Neg logit contributions
idd-0.338
rol-0.333
und-0.303
iw-0.298
arre-0.287
Token and
Token ID290
Feature activation+7.33e-01

Pos logit contributions
idd+1.917
rol+1.902
iw+1.767
und+1.757
arre+1.723
Neg logit contributions
The-0.463
 finish-0.431
 before-0.421
 ends-0.415
In-0.413
Token bl
Token ID698
Feature activation+2.80e+00

Pos logit contributions
idd+1.187
rol+1.177
iw+1.052
und+1.052
arre+1.036
Neg logit contributions
The-0.874
 finish-0.854
 ends-0.836
In-0.836
 grant-0.836
Tokened
Token ID276
Feature activation+9.22e-01

Pos logit contributions
rol+4.403
idd+4.364
 batch+4.093
iw+4.049
 Mechan+4.008
Neg logit contributions
The-3.342
In-3.294
ers-3.238
ace-3.226
 ends-3.210
Token and
Token ID290
Feature activation+9.26e-01

Pos logit contributions
idd+1.989
rol+1.965
iw+1.820
und+1.800
arre+1.780
Neg logit contributions
The-0.618
 finish-0.588
 before-0.582
 ends-0.578
In-0.560
Token hurt
Token ID5938
Feature activation+2.37e-01

Pos logit contributions
idd+1.608
rol+1.597
und+1.432
iw+1.430
arre+1.422
Neg logit contributions
The-1.006
 finish-0.984
 before-0.962
 ends-0.960
 grant-0.955
Token.
Token ID13
Feature activation-8.38e-01

Pos logit contributions
idd+0.467
rol+0.462
und+0.425
iw+0.419
arre+0.414
Neg logit contributions
The-0.193
 finish-0.190
 grant-0.188
idi-0.188
In-0.186
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
 finish+0.815
isy+0.808
 ends+0.801
ers+0.794
idi+0.791
Neg logit contributions
idd-1.518
rol-1.452
und-1.401
iw-1.348
arre-1.340
Token:
Token ID25
Feature activation+3.66e-01

Pos logit contributions
The+0.826
In+0.784
 grant+0.752
 ends+0.745
 finish+0.722
Neg logit contributions
idd-2.748
rol-2.568
arre-2.487
und-2.484
rou-2.409
Token But
Token ID887
Feature activation+2.17e-01

Pos logit contributions
idd+0.702
rol+0.692
und+0.636
iw+0.632
arre+0.624
Neg logit contributions
The-0.317
 finish-0.307
 ends-0.302
In-0.301
idi-0.300
Token I
Token ID314
Feature activation+6.45e-01

Pos logit contributions
idd+0.458
rol+0.453
und+0.420
iw+0.418
arre+0.410
Neg logit contributions
The-0.145
 finish-0.138
In-0.137
 ends-0.136
 grant-0.136
Token was
Token ID373
Feature activation-7.22e-02

Pos logit contributions
idd+1.171
rol+1.165
und+1.055
iw+1.054
arre+1.030
Neg logit contributions
The-0.632
 finish-0.614
In-0.601
 grant-0.597
 ends-0.597
Token having
Token ID1719
Feature activation+9.67e-01

Pos logit contributions
The+0.086
 grant+0.083
In+0.083
 ends+0.083
idi+0.081
Neg logit contributions
idd-0.118
rol-0.114
iw-0.102
arre-0.101
und-0.101
Token so
Token ID523
Feature activation+2.79e+00

Pos logit contributions
idd+1.809
rol+1.797
iw+1.638
und+1.628
arre+1.578
Neg logit contributions
The-0.924
 finish-0.897
 ends-0.872
 before-0.868
In-0.861
Token much
Token ID881
Feature activation+4.70e-01

Pos logit contributions
rol+5.231
idd+4.878
iw+4.533
bat+4.419
 Mechan+4.326
Neg logit contributions
The-3.409
In-3.230
 time-3.178
 finish-3.135
 the-2.982
Token fun
Token ID1257
Feature activation+7.25e-01

Pos logit contributions
idd+0.770
rol+0.760
und+0.685
iw+0.681
arre+0.670
Neg logit contributions
The-0.547
 finish-0.529
In-0.523
 grant-0.523
 ends-0.522
Token!
Token ID0
Feature activation+3.72e-01

Pos logit contributions
idd+1.270
rol+1.267
iw+1.142
und+1.140
arre+1.118
Neg logit contributions
The-0.751
 finish-0.729
 before-0.717
In-0.712
 ends-0.712
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
idd+0.707
rol+0.700
und+0.640
iw+0.637
arre+0.627
Neg logit contributions
The-0.329
 finish-0.319
 ends-0.314
 grant-0.313
idi-0.312
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
idd+0.448
rol+0.442
und+0.408
iw+0.405
arre+0.401
Neg logit contributions
 finish-0.171
The-0.170
 ends-0.169
 grant-0.168
 before-0.166
Token,
Token ID11
Feature activation-2.84e-01

Pos logit contributions
idd+1.029
rol+1.021
und+0.931
iw+0.927
arre+0.912
Neg logit contributions
The-0.501
 finish-0.484
In-0.474
 ends-0.473
 grant-0.472
Token scary
Token ID14343
Feature activation+9.07e-01

Pos logit contributions
The+0.293
idi+0.291
 finish+0.288
In+0.287
ers+0.285
Neg logit contributions
idd-0.476
rol-0.469
iw-0.435
und-0.426
arre-0.421
Token cat
Token ID3797
Feature activation+1.01e+00

Pos logit contributions
rol+1.721
idd+1.705
und+1.541
iw+1.538
arre+1.521
Neg logit contributions
The-0.826
 finish-0.808
 ends-0.791
 grant-0.790
 before-0.782
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
rol+2.030
idd+2.023
iw+1.842
arre+1.827
und+1.826
Neg logit contributions
The-0.821
 finish-0.777
 before-0.765
In-0.762
 grant-0.758
Token got
Token ID1392
Feature activation+1.63e+00

Pos logit contributions
idd+0.299
rol+0.295
und+0.266
iw+0.265
arre+0.255
Neg logit contributions
The-0.229
 finish-0.227
 ends-0.223
In-0.222
idi-0.222
Token very
Token ID845
Feature activation+2.79e+00

Pos logit contributions
idd+2.832
rol+2.768
und+2.593
arre+2.551
iw+2.515
Neg logit contributions
The-1.783
 finish-1.754
 ends-1.715
 grant-1.706
 before-1.695
Token scared
Token ID12008
Feature activation+8.97e-01

Pos logit contributions
rol+5.523
idd+5.431
rob+5.197
arre+5.026
und+5.016
Neg logit contributions
The-2.745
 old-2.742
 finish-2.691
 the-2.678
 in-2.631
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
idd+1.905
rol+1.886
iw+1.741
und+1.733
arre+1.715
Neg logit contributions
The-0.642
 before-0.602
 finish-0.599
 ends-0.582
In-0.581
Token "
Token ID366
Feature activation+9.99e-01

Pos logit contributions
idd+0.599
rol+0.594
und+0.536
iw+0.533
arre+0.520
Neg logit contributions
The-0.426
 finish-0.418
 ends-0.412
idi-0.411
 grant-0.410
TokenOh
Token ID5812
Feature activation+2.16e+00

Pos logit contributions
idd+1.736
rol+1.722
iw+1.548
und+1.544
arre+1.516
Neg logit contributions
The-1.129
In-1.056
 finish-1.038
 grant-1.027
idi-1.023
Token no
Token ID645
Feature activation+1.65e+00

Pos logit contributions
rol+4.237
idd+4.223
arre+3.921
und+3.807
ays+3.753
Neg logit contributions
The-2.033
 grant-2.000
 before-1.955
 finish-1.941
 be-1.940
Token got
Token ID1392
Feature activation+6.55e-01

Pos logit contributions
idd+0.804
rol+0.798
und+0.724
iw+0.719
arre+0.709
Neg logit contributions
The-0.450
 finish-0.434
In-0.428
 grant-0.428
 ends-0.427
Token off
Token ID572
Feature activation+1.87e-01

Pos logit contributions
idd+1.293
rol+1.283
und+1.178
iw+1.173
arre+1.155
Neg logit contributions
The-0.547
 finish-0.528
In-0.513
 ends-0.512
idi-0.511
Token the
Token ID262
Feature activation+4.90e-01

Pos logit contributions
idd+0.401
rol+0.399
und+0.369
iw+0.367
arre+0.360
Neg logit contributions
The-0.116
In-0.110
 ends-0.110
 finish-0.110
 grant-0.109
Token couch
Token ID18507
Feature activation+1.02e+00

Pos logit contributions
idd+0.851
rol+0.844
und+0.763
iw+0.760
arre+0.747
Neg logit contributions
The-0.522
 finish-0.505
In-0.498
 grant-0.497
idi-0.496
Token and
Token ID290
Feature activation+6.49e-01

Pos logit contributions
idd+2.377
rol+2.344
iw+2.189
und+2.165
arre+2.144
Neg logit contributions
The-0.495
 finish-0.479
 before-0.468
 ends-0.438
In-0.431
Token tr
Token ID491
Feature activation+2.78e+00

Pos logit contributions
idd+1.242
rol+1.231
und+1.123
iw+1.122
arre+1.104
Neg logit contributions
The-0.574
 finish-0.555
In-0.541
 ends-0.541
idi-0.540
Tokenud
Token ID463
Feature activation+4.86e-01

Pos logit contributions
rol+2.665
idd+2.513
iw+2.356
 poss+2.205
 Good+2.182
Neg logit contributions
In-4.969
The-4.947
idi-4.861
 finish-4.736
ace-4.733
Tokenged
Token ID2004
Feature activation+9.35e-01

Pos logit contributions
idd+0.643
rol+0.635
und+0.555
iw+0.551
arre+0.537
Neg logit contributions
The-0.717
 finish-0.702
In-0.694
 ends-0.692
 grant-0.692
Token out
Token ID503
Feature activation-6.06e-02

Pos logit contributions
idd+1.784
rol+1.767
iw+1.618
und+1.601
arre+1.581
Neg logit contributions
The-0.860
 finish-0.827
 before-0.811
 ends-0.810
In-0.804
Token of
Token ID286
Feature activation+8.76e-01

Pos logit contributions
The+0.041
In+0.039
 finish+0.039
 grant+0.039
idi+0.039
Neg logit contributions
rol-0.127
idd-0.127
und-0.117
iw-0.114
arre-0.112
Token the
Token ID262
Feature activation+6.30e-01

Pos logit contributions
idd+2.141
rol+2.128
iw+1.978
und+1.974
arre+1.968
Neg logit contributions
The-0.326
 finish-0.304
 before-0.287
idi-0.283
 ends-0.283
Token
Token ID198
Feature activation-6.57e-01

Pos logit contributions
 finish+0.485
 ends+0.478
 before+0.477
ers+0.474
idi+0.469
Neg logit contributions
idd-0.988
rol-0.946
und-0.907
iw-0.887
arre-0.866
TokenHer
Token ID9360
Feature activation-6.00e-01

Pos logit contributions
 before+0.748
 ends+0.746
ers+0.741
 finish+0.729
 grant+0.721
Neg logit contributions
idd-1.074
rol-0.981
und-0.957
iw-0.945
arre-0.918
Token mom
Token ID1995
Feature activation+1.38e-01

Pos logit contributions
The+0.480
In+0.459
 grant+0.456
ers+0.454
idi+0.447
Neg logit contributions
idd-1.206
rol-1.119
iw-1.078
arre-1.063
rou-1.062
Token sighed
Token ID21893
Feature activation+4.88e-01

Pos logit contributions
idd+0.241
rol+0.236
und+0.215
iw+0.214
arre+0.211
Neg logit contributions
The-0.143
 finish-0.140
 grant-0.140
In-0.138
 ends-0.137
Token and
Token ID290
Feature activation+3.03e-01

Pos logit contributions
idd+1.074
rol+1.067
und+0.987
iw+0.984
arre+0.972
Neg logit contributions
The-0.287
 finish-0.271
In-0.264
 grant-0.262
 ends-0.262
Token kn
Token ID638
Feature activation+2.77e+00

Pos logit contributions
idd+0.515
rol+0.507
und+0.460
iw+0.460
arre+0.450
Neg logit contributions
The-0.327
 finish-0.319
 grant-0.316
In-0.315
 ends-0.315
Tokenelt
Token ID2120
Feature activation+1.27e+00

Pos logit contributions
rol+3.306
idd+3.099
und+2.884
rob+2.824
 batch+2.783
Neg logit contributions
The-4.514
In-4.339
 finish-4.207
ers-4.177
ace-4.121
Token down
Token ID866
Feature activation+4.43e-01

Pos logit contributions
idd+2.153
rol+2.129
und+1.886
iw+1.861
arre+1.855
Neg logit contributions
The-1.506
 ends-1.427
 before-1.421
 finish-1.415
In-1.397
Token next
Token ID1306
Feature activation-6.65e-01

Pos logit contributions
idd+0.897
rol+0.890
und+0.818
iw+0.815
arre+0.803
Neg logit contributions
The-0.341
 finish-0.327
In-0.321
idi-0.320
 grant-0.320
Token to
Token ID284
Feature activation+8.82e-02

Pos logit contributions
 grant+0.570
The+0.568
idi+0.523
In+0.523
 finish+0.504
Neg logit contributions
idd-1.275
rol-1.235
iw-1.164
und-1.156
arre-1.134
Token her
Token ID607
Feature activation-7.58e-01

Pos logit contributions
idd+0.154
rol+0.151
und+0.139
iw+0.138
arre+0.135
Neg logit contributions
The-0.090
 finish-0.087
In-0.087
 ends-0.087
idi-0.087
Token little
Token ID1310
Feature activation+4.44e-01

Pos logit contributions
idd+0.110
rol+0.107
iw+0.100
und+0.100
arre+0.098
Neg logit contributions
The-0.040
In-0.038
 finish-0.038
idi-0.038
 grant-0.037
Token 3
Token ID513
Feature activation+1.03e+00

Pos logit contributions
idd+0.838
rol+0.827
iw+0.756
und+0.755
arre+0.740
Neg logit contributions
The-0.402
 finish-0.387
In-0.383
 grant-0.380
idi-0.379
Token year
Token ID614
Feature activation-4.53e-01

Pos logit contributions
rol+1.459
idd+1.442
und+1.270
iw+1.264
arre+1.253
Neg logit contributions
The-1.441
idi-1.387
 finish-1.377
 ends-1.372
 before-1.370
Token old
Token ID1468
Feature activation-6.02e-01

Pos logit contributions
 grant+0.184
The+0.183
In+0.182
 finish+0.165
idi+0.164
Neg logit contributions
idd-1.089
rol-1.053
iw-1.003
und-0.988
rou-0.977
Token boy
Token ID2933
Feature activation+6.79e-01

Pos logit contributions
In+0.633
 finish+0.632
The+0.631
 grant+0.620
 ends+0.606
Neg logit contributions
idd-1.091
rol-0.994
iw-0.954
und-0.945
 Mechan-0.891
Token cl
Token ID537
Feature activation+2.76e+00

Pos logit contributions
idd+1.135
rol+1.133
und+1.017
iw+1.014
arre+1.001
Neg logit contributions
The-0.758
 finish-0.734
In-0.721
idi-0.720
 grant-0.720
Tokenapping
Token ID5912
Feature activation+3.94e-01

Pos logit contributions
rol+3.817
 batch+3.533
idd+3.496
rob+3.491
iw+3.477
Neg logit contributions
The-3.870
 finish-3.735
ace-3.656
idi-3.592
 before-3.585
Token his
Token ID465
Feature activation-6.03e-01

Pos logit contributions
idd+0.764
rol+0.758
und+0.694
iw+0.687
arre+0.678
Neg logit contributions
The-0.336
 finish-0.325
In-0.319
 grant-0.318
 ends-0.316
Token hands
Token ID2832
Feature activation+9.29e-01

Pos logit contributions
The+0.763
idi+0.751
 grant+0.740
In+0.739
 finish+0.733
Neg logit contributions
idd-0.921
rol-0.888
und-0.826
arre-0.794
iw-0.789
Token.
Token ID13
Feature activation-6.56e-01

Pos logit contributions
idd+2.058
rol+2.044
iw+1.904
und+1.884
arre+1.860
Neg logit contributions
The-0.541
 finish-0.519
 before-0.513
In-0.488
 ends-0.488
Token
Token ID220
Feature activation+4.63e-01

Pos logit contributions
 grant+0.644
ace+0.634
 ends+0.627
ers+0.619
 finish+0.614
Neg logit contributions
idd-1.180
rol-1.137
und-1.073
iw-1.062
arre-1.035
Token head
Token ID1182
Feature activation+5.59e-01

Pos logit contributions
In+1.559
idi+1.530
The+1.413
 behold+1.350
 grant+1.330
Neg logit contributions
idd-2.773
rol-2.734
und-2.458
 Mechan-2.446
iw-2.441
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
idd+1.185
rol+1.176
und+1.084
iw+1.081
arre+1.066
Neg logit contributions
The-0.377
 finish-0.358
In-0.348
 ends-0.347
 grant-0.346
Token Ben
Token ID3932
Feature activation-3.99e-01

Pos logit contributions
 ends+0.417
 grant+0.412
idi+0.411
 finish+0.410
ers+0.405
Neg logit contributions
idd-0.938
rol-0.909
und-0.864
iw-0.851
arre-0.835
Token cried
Token ID16896
Feature activation+9.58e-01

Pos logit contributions
idi+0.500
The+0.489
 finish+0.486
In+0.485
 grant+0.483
Neg logit contributions
idd-0.611
rol-0.586
und-0.543
iw-0.523
arre-0.521
Token and
Token ID290
Feature activation+7.58e-01

Pos logit contributions
idd+2.138
rol+2.117
iw+1.962
und+1.949
arre+1.905
Neg logit contributions
The-0.596
 finish-0.568
 before-0.555
 ends-0.542
In-0.531
Token bl
Token ID698
Feature activation+2.75e+00

Pos logit contributions
idd+1.180
rol+1.172
und+1.040
iw+1.039
arre+1.026
Neg logit contributions
The-0.951
 finish-0.927
 ends-0.913
 grant-0.912
In-0.909
Tokened
Token ID276
Feature activation+1.33e+00

Pos logit contributions
rol+4.302
idd+4.213
 batch+3.929
iw+3.863
rob+3.853
Neg logit contributions
The-3.464
In-3.320
 ends-3.283
ace-3.225
ers-3.165
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
idd+2.971
rol+2.925
iw+2.718
arre+2.661
und+2.659
Neg logit contributions
The-0.841
 before-0.834
 finish-0.816
 ends-0.792
ers-0.727
Token Sam
Token ID3409
Feature activation-4.20e-01

Pos logit contributions
 ends+0.734
 grant+0.733
isy+0.721
 finish+0.718
ers+0.717
Neg logit contributions
idd-1.483
rol-1.409
und-1.359
iw-1.329
arre-1.294
Token screamed
Token ID25421
Feature activation+1.60e+00

Pos logit contributions
The+0.435
 finish+0.433
idi+0.433
 grant+0.428
In+0.428
Neg logit contributions
idd-0.715
rol-0.693
und-0.645
iw-0.627
arre-0.621
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
idd+4.034
rol+4.001
iw+3.791
und+3.689
rob+3.612
Neg logit contributions
 before-0.630
The-0.606
 finish-0.595
 ends-0.505
In-0.462
Token She
Token ID1375
Feature activation+1.61e-01

Pos logit contributions
The+0.198
 ends+0.197
 finish+0.197
 grant+0.193
ers+0.193
Neg logit contributions
idd-0.339
rol-0.335
und-0.306
iw-0.304
arre-0.297
Token said
Token ID531
Feature activation+3.17e-01

Pos logit contributions
idd+0.274
rol+0.268
und+0.244
iw+0.242
arre+0.239
Neg logit contributions
The-0.174
 finish-0.171
 ends-0.168
 grant-0.167
In-0.167
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
idd+0.651
rol+0.645
und+0.595
iw+0.593
arre+0.583
Neg logit contributions
The-0.233
 finish-0.224
In-0.221
 ends-0.218
idi-0.218
Token "
Token ID366
Feature activation+1.57e-03

Pos logit contributions
idd+0.498
rol+0.491
und+0.448
iw+0.446
arre+0.440
Neg logit contributions
The-0.239
 finish-0.234
 ends-0.232
In-0.228
idi-0.228
TokenHi
Token ID17250
Feature activation+7.34e-01

Pos logit contributions
idd+0.003
rol+0.003
und+0.002
iw+0.002
arre+0.002
Neg logit contributions
 finish-0.002
 ends-0.002
 before-0.002
idi-0.002
The-0.002
Token p
Token ID279
Feature activation+2.74e+00

Pos logit contributions
idd+1.475
rol+1.467
und+1.342
iw+1.340
arre+1.328
Neg logit contributions
The-0.580
 finish-0.554
 ends-0.539
 before-0.539
In-0.539
Tokenanda
Token ID5282
Feature activation+3.21e-01

Pos logit contributions
rol+4.912
arre+4.765
idd+4.683
und+4.616
 Mechan+4.587
Neg logit contributions
ers-2.378
The-2.296
 before-2.210
 finish-2.170
isy-2.153
Token!
Token ID0
Feature activation+1.03e-01

Pos logit contributions
idd+0.533
rol+0.525
iw+0.474
und+0.473
arre+0.459
Neg logit contributions
The-0.366
 finish-0.356
In-0.354
 ends-0.351
idi-0.351
Token What
Token ID1867
Feature activation+4.85e-01

Pos logit contributions
idd+0.157
rol+0.155
und+0.137
iw+0.137
arre+0.132
Neg logit contributions
The-0.131
 finish-0.130
 ends-0.128
idi-0.127
ers-0.126
Token smells
Token ID25760
Feature activation-9.28e-01

Pos logit contributions
idd+0.839
rol+0.830
und+0.751
iw+0.748
arre+0.735
Neg logit contributions
The-0.516
 finish-0.500
In-0.492
idi-0.491
 grant-0.489
Token so
Token ID523
Feature activation+6.89e-01

Pos logit contributions
idi+0.955
The+0.950
ace+0.935
 grant+0.918
In+0.913
Neg logit contributions
rol-1.586
idd-1.511
und-1.441
iw-1.438
 batch-1.388
Token said
Token ID531
Feature activation+4.01e-01

Pos logit contributions
idd+0.251
rol+0.246
und+0.223
iw+0.219
arre+0.218
Neg logit contributions
The-0.171
 grant-0.166
 finish-0.165
 ends-0.164
In-0.164
Token,
Token ID11
Feature activation+3.20e-01

Pos logit contributions
idd+0.830
rol+0.823
und+0.757
iw+0.753
arre+0.744
Neg logit contributions
The-0.292
 finish-0.280
In-0.275
 ends-0.272
idi-0.272
Token "
Token ID366
Feature activation+4.82e-01

Pos logit contributions
idd+0.604
rol+0.597
und+0.545
iw+0.542
arre+0.534
Neg logit contributions
The-0.287
 finish-0.281
 ends-0.276
In-0.273
idi-0.272
TokenYou
Token ID1639
Feature activation+1.74e-02

Pos logit contributions
idd+0.756
rol+0.749
und+0.669
iw+0.666
arre+0.653
Neg logit contributions
The-0.593
 finish-0.581
 grant-0.571
 ends-0.571
In-0.570
Token had
Token ID550
Feature activation-9.16e-02

Pos logit contributions
idd+0.029
rol+0.028
und+0.025
arre+0.025
iw+0.025
Neg logit contributions
The-0.020
idi-0.019
 before-0.019
 finish-0.019
In-0.019
Token so
Token ID523
Feature activation+2.73e+00

Pos logit contributions
The+0.080
 grant+0.079
idi+0.077
In+0.076
 ends+0.075
Neg logit contributions
idd-0.173
rol-0.171
arre-0.157
und-0.157
iw-0.155
Token much
Token ID881
Feature activation+6.78e-01

Pos logit contributions
rol+4.919
idd+4.692
bat+4.357
und+4.315
arre+4.250
Neg logit contributions
The-3.205
In-3.159
 finish-3.048
 time-3.046
 ends-2.942
Token fun
Token ID1257
Feature activation+1.01e+00

Pos logit contributions
idd+1.144
rol+1.137
und+1.021
iw+1.018
arre+0.996
Neg logit contributions
The-0.757
 finish-0.741
In-0.726
 ends-0.724
 before-0.723
Token outside
Token ID2354
Feature activation+1.48e-01

Pos logit contributions
rol+1.616
idd+1.604
iw+1.440
und+1.422
arre+1.399
Neg logit contributions
The-1.210
 finish-1.198
 before-1.196
 ends-1.157
 grant-1.156
Token.
Token ID13
Feature activation-8.56e-03

Pos logit contributions
idd+0.282
rol+0.278
und+0.256
iw+0.253
arre+0.249
Neg logit contributions
The-0.132
In-0.126
idi-0.126
 finish-0.125
 grant-0.124
Token Let
Token ID3914
Feature activation+7.22e-01

Pos logit contributions
The+0.011
 finish+0.010
 ends+0.010
idi+0.010
ers+0.010
Neg logit contributions
idd-0.013
rol-0.013
und-0.012
iw-0.012
arre-0.011
Token She
Token ID1375
Feature activation-1.41e+00

Pos logit contributions
The+0.426
 grant+0.424
idi+0.423
 ends+0.416
ers+0.415
Neg logit contributions
rol-0.518
idd-0.513
iw-0.467
und-0.463
arre-0.455
Token loved
Token ID6151
Feature activation+4.79e-01

Pos logit contributions
 before+1.588
 finish+1.570
 ends+1.561
 grant+1.551
ers+1.550
Neg logit contributions
idd-2.271
rol-2.030
und-1.968
iw-1.923
 Mechan-1.916
Token going
Token ID1016
Feature activation+2.10e-02

Pos logit contributions
idd+0.999
rol+0.990
und+0.912
iw+0.911
arre+0.896
Neg logit contributions
The-0.339
 finish-0.323
In-0.315
 grant-0.314
 ends-0.314
Token to
Token ID284
Feature activation+2.34e-01

Pos logit contributions
idd+0.042
rol+0.042
iw+0.038
und+0.038
arre+0.037
Neg logit contributions
The-0.016
 grant-0.016
idi-0.016
 ends-0.016
 finish-0.015
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
idd+0.562
rol+0.557
und+0.522
iw+0.521
arre+0.512
Neg logit contributions
The-0.082
In-0.075
idi-0.074
 ends-0.074
 finish-0.074
Token toy
Token ID13373
Feature activation+2.73e+00

Pos logit contributions
idd+1.986
rol+1.979
und+1.765
iw+1.751
arre+1.727
Neg logit contributions
The-1.434
 finish-1.388
In-1.344
idi-1.320
 before-1.310
Token store
Token ID3650
Feature activation+1.05e+00

Pos logit contributions
rol+5.215
idd+5.061
rou+4.654
arre+4.621
und+4.488
Neg logit contributions
 finish-3.094
The-3.033
 before-2.957
 the-2.898
 ends-2.803
Token with
Token ID351
Feature activation+1.15e+00

Pos logit contributions
idd+2.207
rol+2.183
und+2.018
arre+1.995
iw+1.993
Neg logit contributions
The-0.769
 before-0.728
 finish-0.722
In-0.703
 ends-0.677
Token her
Token ID607
Feature activation+4.85e-01

Pos logit contributions
idd+2.082
rol+2.064
und+1.866
arre+1.849
iw+1.831
Neg logit contributions
The-1.264
 finish-1.204
 ends-1.178
 before-1.176
In-1.176
Token mom
Token ID1995
Feature activation-1.95e-02

Pos logit contributions
idd+1.058
rol+1.048
und+0.969
iw+0.968
arre+0.952
Neg logit contributions
The-0.303
 finish-0.288
In-0.279
 grant-0.279
idi-0.278
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
 grant+0.027
The+0.026
idi+0.026
 finish+0.026
 ends+0.026
Neg logit contributions
idd-0.027
rol-0.027
iw-0.024
und-0.024
arre-0.023
Token bought
Token ID5839
Feature activation+1.10e+00

Pos logit contributions
idd+0.587
rol+0.581
und+0.528
iw+0.526
arre+0.515
Neg logit contributions
The-0.331
 finish-0.319
In-0.316
 ends-0.315
 grant-0.313
Token two
Token ID734
Feature activation+2.73e-01

Pos logit contributions
idd+2.296
rol+2.270
iw+2.090
und+2.089
arre+2.061
Neg logit contributions
The-0.832
 finish-0.786
 ends-0.777
 before-0.769
idi-0.761
Token hot
Token ID3024
Feature activation+6.87e-02

Pos logit contributions
idd+0.450
rol+0.447
und+0.401
iw+0.400
arre+0.395
Neg logit contributions
The-0.309
 grant-0.299
 finish-0.299
In-0.298
idi-0.297
Token dogs
Token ID6844
Feature activation+5.12e-01

Pos logit contributions
idd+0.109
rol+0.107
iw+0.097
und+0.097
arre+0.095
Neg logit contributions
The-0.083
 grant-0.081
In-0.080
 finish-0.080
idi-0.079
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
idd+1.027
rol+1.020
und+0.935
iw+0.932
arre+0.918
Neg logit contributions
The-0.401
 finish-0.384
In-0.376
 grant-0.374
 ends-0.374
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
idd+2.834
rol+2.792
iw+2.578
arre+2.568
und+2.567
Neg logit contributions
The-0.974
 finish-0.905
 ends-0.884
 before-0.876
In-0.855
Tokenetchup
Token ID47132
Feature activation+1.16e-01

Pos logit contributions
rol+3.961
 Mechan+3.702
idd+3.690
arre+3.621
und+3.480
Neg logit contributions
The-3.644
ers-3.498
In-3.418
 ends-3.361
 before-3.293
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
idd+0.236
rol+0.234
und+0.216
iw+0.214
arre+0.211
Neg logit contributions
The-0.090
In-0.085
 finish-0.085
idi-0.085
 grant-0.084
Token Max
Token ID5436
Feature activation-4.71e-01

Pos logit contributions
The+0.230
 grant+0.228
 ends+0.223
 finish+0.222
In+0.221
Neg logit contributions
idd-0.395
rol-0.388
und-0.359
arre-0.355
iw-0.350
Token and
Token ID290
Feature activation+8.95e-02

Pos logit contributions
The+0.468
 grant+0.467
 finish+0.463
 ends+0.462
In+0.456
Neg logit contributions
idd-0.820
rol-0.792
und-0.741
arre-0.734
iw-0.728
Token Sam
Token ID3409
Feature activation+1.39e-01

Pos logit contributions
idd+0.159
rol+0.156
und+0.144
iw+0.142
arre+0.141
Neg logit contributions
The-0.089
 finish-0.087
 grant-0.087
In-0.087
idi-0.085
TokenOkay
Token ID16454
Feature activation-7.59e-01

Pos logit contributions
idd+0.433
rol+0.429
iw+0.384
und+0.381
arre+0.371
Neg logit contributions
The-0.358
 finish-0.357
 grant-0.352
 ends-0.351
idi-0.350
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
In+0.721
The+0.720
idi+0.698
 grant+0.690
 ends+0.660
Neg logit contributions
idd-1.348
rol-1.318
iw-1.217
und-1.212
arre-1.200
Token Anna
Token ID11735
Feature activation+6.79e-02

Pos logit contributions
The+0.968
In+0.960
idi+0.930
 grant+0.924
ers+0.913
Neg logit contributions
rol-1.903
idd-1.869
iw-1.757
 batch-1.719
und-1.701
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
idd+0.132
rol+0.129
und+0.120
iw+0.119
arre+0.117
Neg logit contributions
The-0.058
idi-0.055
 grant-0.055
In-0.055
 finish-0.054
Token That
Token ID1320
Feature activation+9.96e-01

Pos logit contributions
The+0.132
idi+0.129
 finish+0.128
 ends+0.126
In+0.126
Neg logit contributions
idd-0.190
rol-0.188
iw-0.171
und-0.168
arre-0.161
Token sounds
Token ID5238
Feature activation-2.95e+00

Pos logit contributions
idd+1.863
rol+1.851
iw+1.686
und+1.664
arre+1.608
Neg logit contributions
The-0.976
 finish-0.961
 ends-0.953
In-0.923
 before-0.917
Token like
Token ID588
Feature activation+4.64e-02

Pos logit contributions
idi+5.283
 grant+4.831
oster+4.823
The+4.739
ers+4.735
Neg logit contributions
 sorts-2.817
 Good-2.764
rol-2.721
arre-2.358
 Enjoy-2.311
Token fun
Token ID1257
Feature activation-1.58e-01

Pos logit contributions
idd+0.081
rol+0.080
und+0.073
iw+0.073
arre+0.071
Neg logit contributions
The-0.047
 finish-0.045
idi-0.045
In-0.045
 grant-0.045
Token.
Token ID13
Feature activation-3.78e-01

Pos logit contributions
The+0.142
idi+0.137
In+0.136
 grant+0.135
 finish+0.133
Neg logit contributions
idd-0.300
rol-0.293
und-0.271
arre-0.266
iw-0.265
Token But
Token ID887
Feature activation+4.66e-01

Pos logit contributions
The+0.368
idi+0.366
ers+0.363
 ends+0.362
 finish+0.360
Neg logit contributions
idd-0.668
rol-0.663
iw-0.604
und-0.598
 Good-0.579
Token remember
Token ID3505
Feature activation-4.09e-01

Pos logit contributions
idd+0.885
rol+0.877
und+0.802
iw+0.800
arre+0.785
Neg logit contributions
The-0.414
 finish-0.397
In-0.390
idi-0.389
 ends-0.389
Token him
Token ID683
Feature activation+3.83e-01

Pos logit contributions
 grant+0.179
In+0.179
idi+0.177
The+0.176
 finish+0.174
Neg logit contributions
idd-0.289
rol-0.285
und-0.258
iw-0.258
arre-0.248
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
idd+0.729
rol+0.727
und+0.663
iw+0.660
arre+0.648
Neg logit contributions
The-0.336
 finish-0.321
In-0.319
idi-0.319
 grant-0.317
Token Tom
Token ID4186
Feature activation-2.21e-01

Pos logit contributions
 grant+0.191
 ends+0.190
The+0.189
 finish+0.188
idi+0.185
Neg logit contributions
idd-0.425
rol-0.420
und-0.392
iw-0.387
arre-0.378
Token falls
Token ID8953
Feature activation+6.84e-01

Pos logit contributions
The+0.281
In+0.273
idi+0.271
 finish+0.268
 grant+0.267
Neg logit contributions
idd-0.333
rol-0.329
und-0.296
iw-0.294
arre-0.290
Token on
Token ID319
Feature activation-6.71e-01

Pos logit contributions
idd+1.356
rol+1.340
iw+1.227
und+1.227
arre+1.206
Neg logit contributions
The-0.567
 finish-0.546
 ends-0.534
 before-0.531
In-0.531
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
 grant+0.300
In+0.295
 ends+0.278
 finish+0.268
ace+0.266
Neg logit contributions
idd-1.522
rol-1.492
und-1.434
iw-1.423
 batch-1.383
Token ground
Token ID2323
Feature activation+7.10e-01

Pos logit contributions
idi+3.579
In+3.558
 behold+3.213
The+3.152
 involve+3.123
Neg logit contributions
iw-4.373
idd-4.341
rol-4.258
 batch-4.210
 bur-4.023
Token.
Token ID13
Feature activation-9.88e-01

Pos logit contributions
idd+1.525
rol+1.511
und+1.392
iw+1.389
arre+1.373
Neg logit contributions
The-0.468
 finish-0.447
 before-0.434
 ends-0.430
In-0.428
Token He
Token ID679
Feature activation+6.39e-01

Pos logit contributions
 grant+0.897
isy+0.892
ace+0.881
 ends+0.857
 finish+0.849
Neg logit contributions
rol-1.782
idd-1.775
und-1.663
iw-1.631
arre-1.580
Token gets
Token ID3011
Feature activation+9.10e-01

Pos logit contributions
idd+1.017
rol+1.007
und+0.900
iw+0.896
arre+0.879
Neg logit contributions
The-0.773
 finish-0.753
 ends-0.743
 grant-0.740
In-0.739
Token dirty
Token ID11841
Feature activation+5.93e-01

Pos logit contributions
idd+1.511
rol+1.493
und+1.362
iw+1.343
arre+1.321
Neg logit contributions
The-1.054
 finish-1.028
 ends-1.009
 before-0.998
In-0.995
Token He
Token ID679
Feature activation+2.21e-02

Pos logit contributions
 grant+0.280
 finish+0.277
 ends+0.276
The+0.273
idi+0.267
Neg logit contributions
idd-0.631
rol-0.619
und-0.574
iw-0.570
arre-0.563
Token threw
Token ID9617
Feature activation+6.18e-01

Pos logit contributions
idd+0.037
rol+0.036
und+0.033
iw+0.033
arre+0.032
Neg logit contributions
The-0.024
 finish-0.023
In-0.023
 grant-0.023
 before-0.023
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
idd+1.311
rol+1.300
und+1.197
iw+1.193
arre+1.178
Neg logit contributions
The-0.426
 finish-0.405
 ends-0.394
In-0.392
idi-0.390
Token pieces
Token ID5207
Feature activation+6.36e-01

Pos logit contributions
idi+1.411
The+1.361
In+1.327
 before+1.311
 finish+1.307
Neg logit contributions
idd-1.878
rol-1.740
arre-1.708
iw-1.669
und-1.660
Token on
Token ID319
Feature activation-1.12e+00

Pos logit contributions
idd+1.275
rol+1.264
und+1.159
iw+1.156
arre+1.138
Neg logit contributions
The-0.505
 finish-0.484
In-0.471
 ends-0.471
 before-0.468
Token the
Token ID262
Feature activation-2.89e+00

Pos logit contributions
 grant+0.742
In+0.671
 finish+0.645
 ends+0.627
ace+0.622
Neg logit contributions
idd-2.337
rol-2.298
und-2.250
iw-2.190
 batch-2.120
Token ground
Token ID2323
Feature activation+7.31e-01

Pos logit contributions
idi+3.550
In+3.520
 behold+3.303
The+3.216
ace+3.142
Neg logit contributions
idd-4.331
 bur-4.120
iw-4.032
arre-4.016
und-3.997
Token and
Token ID290
Feature activation+9.64e-02

Pos logit contributions
idd+1.667
rol+1.654
iw+1.528
und+1.527
arre+1.507
Neg logit contributions
The-0.380
 finish-0.360
 before-0.348
 ends-0.343
In-0.341
Token started
Token ID2067
Feature activation+8.99e-01

Pos logit contributions
idd+0.174
rol+0.171
und+0.157
iw+0.156
arre+0.151
Neg logit contributions
The-0.094
In-0.091
 finish-0.090
 grant-0.090
idi-0.089
Token to
Token ID284
Feature activation+6.29e-01

Pos logit contributions
idd+1.967
rol+1.965
und+1.802
iw+1.802
arre+1.771
Neg logit contributions
The-0.554
 finish-0.534
 before-0.519
 grant-0.503
 ends-0.502
Token cry
Token ID3960
Feature activation+1.20e+00

Pos logit contributions
idd+0.975
rol+0.966
und+0.858
iw+0.856
arre+0.838
Neg logit contributions
The-0.794
 finish-0.777
 grant-0.762
 ends-0.760
In-0.760
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
The+0.510
idi+0.501
In+0.497
 grant+0.475
ers+0.467
Neg logit contributions
rol-1.050
idd-1.040
und-0.978
iw-0.949
arre-0.942
Token it
Token ID340
Feature activation+8.61e-02

Pos logit contributions
idd+0.321
rol+0.318
und+0.295
iw+0.294
arre+0.287
Neg logit contributions
The-0.092
 finish-0.090
In-0.088
idi-0.088
 ends-0.087
Token hopped
Token ID45230
Feature activation+6.27e-01

Pos logit contributions
idd+0.158
rol+0.157
und+0.144
iw+0.141
arre+0.141
Neg logit contributions
The-0.080
 finish-0.077
 grant-0.077
 ends-0.076
In-0.076
Token up
Token ID510
Feature activation+5.46e-01

Pos logit contributions
idd+1.192
rol+1.179
und+1.077
iw+1.075
arre+1.058
Neg logit contributions
The-0.569
 finish-0.549
In-0.537
 ends-0.535
 before-0.535
Token on
Token ID319
Feature activation+5.33e-03

Pos logit contributions
idd+1.201
rol+1.193
und+1.103
iw+1.100
arre+1.084
Neg logit contributions
The-0.329
 finish-0.311
In-0.301
 ends-0.300
 grant-0.299
Token the
Token ID262
Feature activation-2.86e+00

Pos logit contributions
idd+0.012
rol+0.012
und+0.012
iw+0.011
arre+0.011
Neg logit contributions
The-0.002
In-0.002
 grant-0.002
 finish-0.002
 ends-0.002
Token stand
Token ID1302
Feature activation+1.95e-01

Pos logit contributions
In+3.667
idi+3.560
ace+3.501
The+3.346
 behold+3.277
Neg logit contributions
iw-4.358
idd-4.218
oc-4.031
rol-3.898
 batch-3.866
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
idd+0.392
rol+0.389
und+0.359
iw+0.356
arre+0.349
Neg logit contributions
The-0.150
 finish-0.144
In-0.143
idi-0.143
 grant-0.143
Token The
Token ID383
Feature activation-6.02e-01

Pos logit contributions
The+0.091
 ends+0.090
 finish+0.089
 grant+0.088
ers+0.087
Neg logit contributions
idd-0.335
rol-0.332
und-0.311
iw-0.307
arre-0.301
Token mouse
Token ID10211
Feature activation-2.48e-01

Pos logit contributions
idi+0.713
The+0.709
In+0.700
ace+0.686
 grant+0.682
Neg logit contributions
idd-0.957
rol-0.878
iw-0.854
arre-0.822
 batch-0.812
Token thought
Token ID1807
Feature activation-1.10e+00

Pos logit contributions
The+0.224
 grant+0.220
 finish+0.219
In+0.217
 ends+0.213
Neg logit contributions
idd-0.453
rol-0.448
und-0.416
arre-0.404
iw-0.403
Token noise
Token ID7838
Feature activation-1.54e-01

Pos logit contributions
The+0.179
In+0.174
idi+0.172
 finish+0.169
 before+0.168
Neg logit contributions
idd-0.302
rol-0.296
und-0.271
iw-0.270
arre-0.263
Token as
Token ID355
Feature activation-1.20e+00

Pos logit contributions
The+0.111
idi+0.110
In+0.110
 grant+0.108
 finish+0.105
Neg logit contributions
idd-0.310
rol-0.309
und-0.284
iw-0.279
arre-0.279
Token well
Token ID880
Feature activation-4.33e-01

Pos logit contributions
In+1.209
idi+1.180
The+1.169
 finish+1.163
 grant+1.135
Neg logit contributions
idd-2.068
rol-2.039
und-1.850
rou-1.850
iw-1.835
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
The+0.374
In+0.362
idi+0.343
 grant+0.340
 finish+0.338
Neg logit contributions
idd-0.838
rol-0.831
und-0.772
iw-0.745
arre-0.736
Token It
Token ID632
Feature activation+5.06e-01

Pos logit contributions
The+0.168
 ends+0.167
 finish+0.166
 grant+0.165
In+0.162
Neg logit contributions
idd-0.418
rol-0.411
und-0.385
iw-0.380
arre-0.372
Token sounds
Token ID5238
Feature activation-2.86e+00

Pos logit contributions
idd+0.855
rol+0.848
und+0.764
iw+0.761
arre+0.747
Neg logit contributions
The-0.560
 finish-0.543
In-0.535
 grant-0.533
idi-0.532
Token like
Token ID588
Feature activation-1.94e-01

Pos logit contributions
oster+3.801
 grant+3.776
ace+3.758
idi+3.638
isy+3.540
Neg logit contributions
rol-3.918
idd-3.614
 Good-3.585
 sorts-3.475
 anyways-3.448
Token a
Token ID257
Feature activation+5.70e-01

Pos logit contributions
The+0.155
In+0.149
 finish+0.147
idi+0.147
 grant+0.146
Neg logit contributions
idd-0.381
rol-0.376
und-0.345
iw-0.344
arre-0.334
Token sn
Token ID3013
Feature activation+5.53e-01

Pos logit contributions
idd+1.057
rol+1.049
und+0.954
iw+0.950
arre+0.935
Neg logit contributions
The-0.536
 finish-0.518
In-0.508
 ends-0.507
 grant-0.506
Tokenore
Token ID382
Feature activation-7.42e-03

Pos logit contributions
idd+0.727
rol+0.718
und+0.626
iw+0.623
arre+0.608
Neg logit contributions
The-0.828
 finish-0.810
In-0.801
 ends-0.799
 grant-0.798
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
The+0.006
In+0.006
idi+0.006
 grant+0.006
 finish+0.006
Neg logit contributions
idd-0.014
rol-0.014
und-0.013
iw-0.013
arre-0.013
Tokenily
Token ID813
Feature activation+1.71e-01

Pos logit contributions
The+0.005
 finish+0.005
In+0.005
idi+0.005
 grant+0.005
Neg logit contributions
idd-0.023
rol-0.023
und-0.021
iw-0.021
arre-0.021
Token said
Token ID531
Feature activation-1.41e-02

Pos logit contributions
idd+0.340
rol+0.334
und+0.310
iw+0.308
arre+0.307
Neg logit contributions
The-0.132
 finish-0.128
 grant-0.127
In-0.125
 ends-0.125
Token,
Token ID11
Feature activation-4.83e-01

Pos logit contributions
The+0.011
In+0.011
idi+0.011
 finish+0.011
 grant+0.011
Neg logit contributions
idd-0.028
rol-0.027
iw-0.025
und-0.025
arre-0.025
Token "
Token ID366
Feature activation-5.27e-01

Pos logit contributions
 ends+0.499
 finish+0.486
idi+0.482
The+0.482
ers+0.476
Neg logit contributions
idd-0.828
rol-0.787
iw-0.739
und-0.725
arre-0.719
TokenThat
Token ID2504
Feature activation+4.74e-01

Pos logit contributions
 finish+0.690
idi+0.688
 before+0.684
 ends+0.680
 grant+0.672
Neg logit contributions
idd-0.759
rol-0.737
iw-0.691
und-0.661
arre-0.631
Token sounds
Token ID5238
Feature activation-2.85e+00

Pos logit contributions
idd+0.826
rol+0.818
und+0.741
iw+0.738
arre+0.727
Neg logit contributions
The-0.503
 finish-0.486
In-0.480
idi-0.478
 grant-0.477
Token great
Token ID1049
Feature activation+5.29e-01

Pos logit contributions
idi+4.435
oster+4.139
The+4.026
ace+3.966
ryn+3.893
Neg logit contributions
rol-3.323
 Good-3.198
 sorts-3.168
arre-3.093
idd-3.007
Token,
Token ID11
Feature activation+5.95e-01

Pos logit contributions
idd+1.101
rol+1.093
und+1.006
iw+1.003
arre+0.988
Neg logit contributions
The-0.384
 finish-0.366
In-0.358
 ends-0.356
 grant-0.356
Token grandma
Token ID49890
Feature activation-3.64e-01

Pos logit contributions
idd+1.048
rol+1.039
und+0.940
iw+0.936
arre+0.919
Neg logit contributions
The-0.620
 finish-0.601
In-0.589
 ends-0.589
 grant-0.588
Token.
Token ID13
Feature activation-7.56e-02

Pos logit contributions
The+0.346
In+0.339
idi+0.334
 finish+0.331
 grant+0.330
Neg logit contributions
idd-0.673
rol-0.641
iw-0.607
und-0.594
rou-0.580
Token Thank
Token ID6952
Feature activation-1.88e-01

Pos logit contributions
The+0.098
idi+0.097
 finish+0.096
 ends+0.095
In+0.094
Neg logit contributions
idd-0.111
rol-0.110
iw-0.098
und-0.098
arre-0.093
Token,
Token ID11
Feature activation+2.87e-03

Pos logit contributions
idd+0.114
rol+0.112
und+0.102
iw+0.102
arre+0.101
Neg logit contributions
The-0.049
In-0.047
idi-0.047
 finish-0.046
 ends-0.046
Token "
Token ID366
Feature activation+1.93e-01

Pos logit contributions
idd+0.005
rol+0.005
iw+0.005
und+0.005
arre+0.005
Neg logit contributions
The-0.003
 ends-0.003
 finish-0.003
In-0.003
idi-0.003
TokenOK
Token ID11380
Feature activation-8.06e-01

Pos logit contributions
idd+0.354
rol+0.348
iw+0.318
und+0.317
arre+0.312
Neg logit contributions
The-0.180
 finish-0.180
 grant-0.178
 ends-0.177
idi-0.177
Token,
Token ID11
Feature activation-8.70e-01

Pos logit contributions
In+0.790
The+0.782
 grant+0.772
idi+0.771
 ends+0.753
Neg logit contributions
idd-1.412
rol-1.358
und-1.256
arre-1.236
iw-1.236
Token that
Token ID326
Feature activation+2.08e+00

Pos logit contributions
In+0.869
The+0.859
idi+0.844
 ends+0.842
ers+0.833
Neg logit contributions
idd-1.504
rol-1.471
iw-1.365
und-1.307
arre-1.296
Token sounds
Token ID5238
Feature activation-2.82e+00

Pos logit contributions
rol+4.390
idd+4.287
iw+4.109
und+3.922
ays+3.800
Neg logit contributions
The-1.822
 ends-1.775
 finish-1.739
In-1.662
 before-1.624
Token fun
Token ID1257
Feature activation+1.14e-01

Pos logit contributions
idi+4.597
 grant+4.531
ace+4.312
The+4.267
isy+4.260
Neg logit contributions
 Good-3.003
rol-2.889
 sorts-2.749
idd-2.732
arre-2.639
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
idd+0.203
rol+0.199
und+0.181
iw+0.179
arre+0.177
Neg logit contributions
The-0.117
 grant-0.112
In-0.112
idi-0.112
 finish-0.111
Token What
Token ID1867
Feature activation+1.37e-01

Pos logit contributions
The+0.314
idi+0.313
ers+0.309
 ends+0.308
 finish+0.307
Neg logit contributions
idd-0.480
rol-0.475
iw-0.430
und-0.427
arre-0.410
Token things
Token ID1243
Feature activation-1.04e+00

Pos logit contributions
idd+0.286
rol+0.281
und+0.262
iw+0.261
arre+0.259
Neg logit contributions
The-0.093
idi-0.088
 finish-0.088
 grant-0.085
In-0.085
Token do
Token ID466
Feature activation-1.12e+00

Pos logit contributions
idi+0.803
The+0.764
ace+0.717
ers+0.695
In+0.681
Neg logit contributions
idd-2.047
rol-2.019
und-1.943
arre-1.942
iw-1.888
Token She
Token ID1375
Feature activation+2.81e-01

Pos logit contributions
idd+0.157
rol+0.155
iw+0.141
und+0.140
arre+0.136
Neg logit contributions
The-0.120
idi-0.117
 finish-0.117
 grant-0.117
In-0.117
Token said
Token ID531
Feature activation-6.69e-03

Pos logit contributions
idd+0.500
rol+0.491
und+0.449
iw+0.446
arre+0.442
Neg logit contributions
The-0.283
 finish-0.272
 grant-0.269
 before-0.268
In-0.268
Token,
Token ID11
Feature activation-3.46e-01

Pos logit contributions
The+0.005
In+0.005
 finish+0.005
idi+0.005
 ends+0.005
Neg logit contributions
idd-0.013
rol-0.013
und-0.012
iw-0.012
arre-0.012
Token "
Token ID366
Feature activation-5.17e-02

Pos logit contributions
 ends+0.346
 finish+0.344
The+0.343
idi+0.339
In+0.333
Neg logit contributions
idd-0.601
rol-0.579
iw-0.536
und-0.530
arre-0.526
TokenThat
Token ID2504
Feature activation+3.17e-01

Pos logit contributions
 finish+0.063
 grant+0.062
 ends+0.062
idi+0.062
 before+0.062
Neg logit contributions
idd-0.081
rol-0.078
iw-0.072
und-0.071
arre-0.069
Token sounds
Token ID5238
Feature activation-2.82e+00

Pos logit contributions
idd+0.545
rol+0.537
und+0.489
iw+0.484
arre+0.479
Neg logit contributions
The-0.339
 finish-0.327
In-0.325
idi-0.324
 grant-0.321
Token fun
Token ID1257
Feature activation-1.09e-01

Pos logit contributions
idi+4.038
oster+3.860
The+3.815
ers+3.558
In+3.511
Neg logit contributions
rol-3.662
 sorts-3.561
 Good-3.444
arre-3.318
idd-3.290
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
The+0.105
In+0.100
idi+0.100
 grant+0.099
 finish+0.098
Neg logit contributions
idd-0.201
rol-0.197
und-0.182
iw-0.178
arre-0.175
Token I
Token ID314
Feature activation+7.64e-01

Pos logit contributions
The+0.292
 finish+0.285
idi+0.284
In+0.282
 ends+0.282
Neg logit contributions
idd-0.372
rol-0.362
iw-0.328
und-0.325
arre-0.308
Token will
Token ID481
Feature activation-4.43e-01

Pos logit contributions
idd+1.351
rol+1.345
und+1.212
iw+1.208
arre+1.179
Neg logit contributions
The-0.794
 finish-0.782
 grant-0.761
In-0.761
 ends-0.758
Token draw
Token ID3197
Feature activation-1.10e-01

Pos logit contributions
idi+0.330
The+0.328
ers+0.326
 ends+0.318
In+0.312
Neg logit contributions
idd-0.874
rol-0.851
und-0.807
iw-0.791
arre-0.788
Token the
Token ID262
Feature activation-9.44e-01

Pos logit contributions
idd+3.156
rol+3.144
und+2.924
arre+2.908
iw+2.900
Neg logit contributions
The-0.491
 before-0.428
 finish-0.407
 ends-0.387
 grant-0.383
Token leaves
Token ID5667
Feature activation+1.13e-01

Pos logit contributions
idi+1.018
ers+0.984
The+0.982
In+0.963
 grant+0.956
Neg logit contributions
idd-1.516
rol-1.481
iw-1.446
und-1.393
arre-1.381
Token flying
Token ID7348
Feature activation+2.04e-01

Pos logit contributions
rol+0.215
idd+0.215
und+0.196
iw+0.194
arre+0.191
Neg logit contributions
The-0.097
idi-0.095
In-0.093
 finish-0.092
 grant-0.091
Token all
Token ID477
Feature activation-1.02e+00

Pos logit contributions
rol+0.412
idd+0.411
und+0.378
iw+0.373
arre+0.366
Neg logit contributions
The-0.156
 finish-0.150
 grant-0.149
idi-0.149
In-0.148
Token over
Token ID625
Feature activation-2.20e-01

Pos logit contributions
 grant+1.016
idi+0.912
The+0.890
In+0.879
 finish+0.864
Neg logit contributions
rol-1.945
idd-1.908
und-1.767
iw-1.729
 sorts-1.696
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 grant+0.106
 finish+0.102
idi+0.098
The+0.098
In+0.096
Neg logit contributions
idd-0.506
rol-0.496
iw-0.471
und-0.468
arre-0.445
Token place
Token ID1295
Feature activation+4.25e-01

Pos logit contributions
idi+3.543
 grant+3.481
kin+3.423
oster+3.384
ace+3.308
Neg logit contributions
idd-4.303
 bur-4.215
iw-4.195
und-4.190
rol-3.901
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
idd+0.853
rol+0.847
und+0.776
iw+0.774
arre+0.760
Neg logit contributions
The-0.333
 finish-0.318
In-0.314
 grant-0.313
idi-0.313
Token She
Token ID1375
Feature activation+3.76e-01

Pos logit contributions
idd+0.211
rol+0.209
und+0.189
iw+0.188
arre+0.186
Neg logit contributions
The-0.155
 grant-0.151
 finish-0.151
idi-0.150
 ends-0.150
Token decided
Token ID3066
Feature activation-3.64e-01

Pos logit contributions
idd+0.677
rol+0.667
und+0.608
iw+0.605
arre+0.596
Neg logit contributions
The-0.375
 finish-0.362
 grant-0.356
In-0.356
 ends-0.356
Token to
Token ID284
Feature activation+3.69e-02

Pos logit contributions
 ends+0.292
ers+0.285
The+0.284
ace+0.282
 grant+0.281
Neg logit contributions
idd-0.696
rol-0.685
und-0.640
iw-0.639
arre-0.628
Token-
Token ID12
Feature activation+1.02e+00

Pos logit contributions
idd+1.921
rol+1.891
iw+1.743
und+1.710
arre+1.691
Neg logit contributions
The-1.089
 finish-1.066
 before-1.044
 grant-1.034
 ends-1.027
Tokentap
Token ID44335
Feature activation+9.55e-01

Pos logit contributions
idd+1.642
rol+1.618
iw+1.454
und+1.451
arre+1.435
Neg logit contributions
The-1.268
In-1.197
 finish-1.184
idi-1.176
 ends-1.164
Token-
Token ID12
Feature activation+8.23e-01

Pos logit contributions
idd+1.697
rol+1.667
iw+1.536
und+1.509
arre+1.508
Neg logit contributions
The-1.002
 finish-0.954
 before-0.945
 ends-0.941
 grant-0.933
Tokentap
Token ID44335
Feature activation+6.59e-01

Pos logit contributions
idd+1.405
rol+1.391
und+1.254
iw+1.252
arre+1.233
Neg logit contributions
The-0.918
 finish-0.873
In-0.866
idi-0.855
 ends-0.855
Token on
Token ID319
Feature activation-8.24e-01

Pos logit contributions
idd+1.207
rol+1.193
iw+1.087
und+1.085
arre+1.068
Neg logit contributions
The-0.644
 finish-0.619
 ends-0.607
In-0.605
 before-0.605
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
In+0.419
ers+0.415
idi+0.408
 ends+0.395
 finish+0.385
Neg logit contributions
idd-1.784
rol-1.768
und-1.706
iw-1.672
 batch-1.622
Token leaf
Token ID12835
Feature activation+4.47e-01

Pos logit contributions
In+3.565
idi+3.305
The+3.202
 involve+3.132
 behold+3.120
Neg logit contributions
idd-4.502
 bur-4.167
rol-4.105
iw-4.046
 batch-3.880
Token.
Token ID13
Feature activation-2.34e-01

Pos logit contributions
idd+0.909
rol+0.904
und+0.831
iw+0.826
arre+0.814
Neg logit contributions
The-0.335
 finish-0.319
In-0.314
idi-0.314
 grant-0.312
Token It
Token ID632
Feature activation+2.03e-01

Pos logit contributions
The+0.242
 finish+0.238
 ends+0.235
idi+0.233
In+0.233
Neg logit contributions
rol-0.403
idd-0.402
und-0.368
iw-0.358
arre-0.357
Token was
Token ID373
Feature activation-4.05e-01

Pos logit contributions
idd+0.392
rol+0.389
und+0.357
iw+0.351
arre+0.349
Neg logit contributions
The-0.172
 finish-0.167
 grant-0.164
In-0.163
 before-0.162
Token a
Token ID257
Feature activation+2.09e-01

Pos logit contributions
The+0.371
 grant+0.364
In+0.360
 ends+0.359
 finish+0.357
Neg logit contributions
rol-0.751
idd-0.750
und-0.678
iw-0.669
arre-0.653
Token friends
Token ID2460
Feature activation+1.48e+00

Pos logit contributions
The+0.385
 grant+0.378
In+0.370
idi+0.369
 before+0.369
Neg logit contributions
idd-0.502
rol-0.473
iw-0.427
und-0.424
arre-0.420
Token found
Token ID1043
Feature activation+6.73e-01

Pos logit contributions
rol+2.958
idd+2.953
iw+2.776
und+2.673
rou+2.636
Neg logit contributions
 finish-1.250
The-1.236
In-1.207
 before-1.152
 ends-1.150
Token some
Token ID617
Feature activation-1.58e-01

Pos logit contributions
idd+1.431
rol+1.424
und+1.310
iw+1.309
arre+1.292
Neg logit contributions
The-0.461
 finish-0.435
In-0.424
 ends-0.424
idi-0.418
Token sticks
Token ID16461
Feature activation+2.42e-02

Pos logit contributions
The+0.174
 grant+0.169
 finish+0.168
idi+0.167
In+0.166
Neg logit contributions
idd-0.269
rol-0.263
iw-0.239
und-0.238
arre-0.231
Token on
Token ID319
Feature activation-9.16e-01

Pos logit contributions
idd+0.051
rol+0.051
und+0.047
iw+0.046
arre+0.045
Neg logit contributions
The-0.016
idi-0.016
 finish-0.016
In-0.015
 grant-0.015
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
 grant+0.439
ace+0.424
idi+0.421
isy+0.407
In+0.404
Neg logit contributions
idd-2.045
rol-1.994
und-1.964
iw-1.892
 batch-1.863
Token ground
Token ID2323
Feature activation+4.78e-01

Pos logit contributions
idi+2.974
 behold+2.829
In+2.725
 grant+2.577
The+2.494
Neg logit contributions
idd-4.699
rol-4.695
 bur-4.459
und-4.318
iw-4.224
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
idd+0.968
rol+0.961
und+0.883
iw+0.879
arre+0.865
Neg logit contributions
The-0.366
 finish-0.350
In-0.343
 grant-0.342
idi-0.342
Token
Token ID220
Feature activation+1.83e-01

Pos logit contributions
 grant+0.459
idi+0.444
The+0.442
ace+0.439
 ends+0.438
Neg logit contributions
idd-0.724
rol-0.706
und-0.651
arre-0.634
iw-0.634
Token
Token ID198
Feature activation-3.43e-02

Pos logit contributions
idd+0.364
rol+0.362
und+0.332
iw+0.328
arre+0.326
Neg logit contributions
The-0.140
 finish-0.140
 grant-0.137
idi-0.136
 ends-0.135
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
 finish+0.028
 grant+0.028
 ends+0.027
The+0.027
idi+0.027
Neg logit contributions
idd-0.067
rol-0.066
und-0.061
iw-0.060
arre-0.059
Token her
Token ID607
Feature activation-1.25e+00

Pos logit contributions
The+0.210
In+0.205
 grant+0.199
 finish+0.199
 ends+0.196
Neg logit contributions
idd-0.467
rol-0.461
iw-0.428
und-0.425
 batch-0.407
Token hand
Token ID1021
Feature activation+1.61e-01

Pos logit contributions
idi+1.503
The+1.492
In+1.490
ers+1.386
 grant+1.316
Neg logit contributions
idd-2.021
rol-1.944
iw-1.840
und-1.783
 batch-1.764
Token and
Token ID290
Feature activation+4.91e-01

Pos logit contributions
idd+0.353
rol+0.351
und+0.327
iw+0.323
arre+0.319
Neg logit contributions
The-0.094
idi-0.089
 grant-0.089
 finish-0.087
In-0.087
Token fell
Token ID3214
Feature activation+5.19e-01

Pos logit contributions
idd+0.853
rol+0.845
und+0.765
iw+0.762
arre+0.747
Neg logit contributions
The-0.519
 finish-0.503
In-0.496
idi-0.494
 grant-0.494
Token on
Token ID319
Feature activation-7.64e-01

Pos logit contributions
idd+1.002
rol+0.994
und+0.908
iw+0.905
arre+0.890
Neg logit contributions
The-0.452
 finish-0.435
In-0.427
 grant-0.425
 ends-0.425
Token the
Token ID262
Feature activation-2.76e+00

Pos logit contributions
In+0.384
 ends+0.375
 grant+0.375
idi+0.374
 finish+0.363
Neg logit contributions
idd-1.675
rol-1.637
iw-1.580
und-1.575
 batch-1.531
Token floor
Token ID4314
Feature activation+1.11e+00

Pos logit contributions
In+3.372
idi+3.352
 behold+3.226
The+3.049
ace+2.860
Neg logit contributions
idd-4.231
rol-4.146
 batch-4.015
iw-3.968
asser-3.759
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
idd+2.364
rol+2.336
iw+2.158
und+2.146
arre+2.129
Neg logit contributions
The-0.774
 before-0.760
 finish-0.754
 ends-0.709
In-0.704
Token It
Token ID632
Feature activation+1.68e-01

Pos logit contributions
 finish+0.099
The+0.098
 grant+0.098
 ends+0.097
idi+0.097
Neg logit contributions
idd-0.220
rol-0.217
und-0.200
iw-0.200
arre-0.197
Token broke
Token ID6265
Feature activation-2.35e-01

Pos logit contributions
idd+0.305
rol+0.305
und+0.278
iw+0.274
arre+0.270
Neg logit contributions
The-0.158
 finish-0.152
idi-0.152
 grant-0.151
In-0.150
Token into
Token ID656
Feature activation+1.02e+00

Pos logit contributions
 grant+0.149
The+0.148
idi+0.148
 finish+0.146
In+0.145
Neg logit contributions
idd-0.505
rol-0.501
und-0.466
iw-0.459
arre-0.449
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
idd+0.704
rol+0.698
und+0.652
iw+0.649
arre+0.638
Neg logit contributions
The-0.129
 finish-0.122
In-0.119
 grant-0.119
idi-0.116
Token rope
Token ID17182
Feature activation+8.74e-01

Pos logit contributions
idi+2.857
In+2.809
 grant+2.698
ace+2.690
 behold+2.634
Neg logit contributions
idd-3.373
iw-3.053
rol-3.036
 batch-2.972
 poss-2.956
Token and
Token ID290
Feature activation+8.19e-02

Pos logit contributions
idd+1.970
rol+1.943
iw+1.804
und+1.795
arre+1.779
Neg logit contributions
The-0.488
 finish-0.470
 before-0.461
 ends-0.443
In-0.434
Token fell
Token ID3214
Feature activation+8.19e-01

Pos logit contributions
idd+0.148
rol+0.147
und+0.134
iw+0.133
arre+0.129
Neg logit contributions
The-0.079
In-0.076
 grant-0.076
 finish-0.076
idi-0.075
Token on
Token ID319
Feature activation-1.68e-01

Pos logit contributions
idd+1.626
rol+1.607
iw+1.477
und+1.473
arre+1.460
Neg logit contributions
The-0.674
 finish-0.649
 before-0.641
 ends-0.633
In-0.628
Token the
Token ID262
Feature activation-2.75e+00

Pos logit contributions
In+0.061
 grant+0.061
 finish+0.059
The+0.059
 ends+0.058
Neg logit contributions
idd-0.398
rol-0.391
und-0.371
iw-0.368
arre-0.357
Token stick
Token ID4859
Feature activation+1.43e+00

Pos logit contributions
In+2.921
idi+2.835
 behold+2.614
 grant+2.568
ace+2.522
Neg logit contributions
idd-4.573
rol-4.294
 bur-4.166
iw-4.161
 poss-4.156
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
idd+3.204
rol+3.126
iw+2.933
und+2.894
arre+2.886
Neg logit contributions
 before-0.924
 finish-0.903
The-0.898
 ends-0.828
 grant-0.776
Token He
Token ID679
Feature activation-7.20e-02

Pos logit contributions
 grant+0.255
 ends+0.249
 finish+0.247
idi+0.243
The+0.242
Neg logit contributions
idd-0.675
rol-0.661
und-0.626
iw-0.616
arre-0.601
Token y
Token ID331
Feature activation+1.16e+00

Pos logit contributions
The+0.079
 finish+0.078
In+0.077
 grant+0.077
idi+0.077
Neg logit contributions
idd-0.120
rol-0.117
und-0.107
iw-0.106
arre-0.103
Tokenel
Token ID417
Feature activation-4.75e-01

Pos logit contributions
idd+1.450
rol+1.430
iw+1.247
und+1.207
arre+1.190
Neg logit contributions
The-1.792
 finish-1.774
 ends-1.753
In-1.733
 before-1.708
TokenOkay
Token ID16454
Feature activation-7.59e-01

Pos logit contributions
idd+0.161
rol+0.159
iw+0.144
und+0.142
arre+0.137
Neg logit contributions
 finish-0.126
idi-0.125
The-0.124
 ends-0.124
 before-0.123
Token,
Token ID11
Feature activation-4.25e-01

Pos logit contributions
In+0.752
The+0.738
idi+0.731
 grant+0.719
 ends+0.696
Neg logit contributions
idd-1.340
rol-1.293
iw-1.196
und-1.193
arre-1.160
Token Tom
Token ID4186
Feature activation-5.36e-01

Pos logit contributions
The+0.381
In+0.374
idi+0.370
 grant+0.359
 ends+0.357
Neg logit contributions
idd-0.786
rol-0.772
iw-0.715
und-0.696
arre-0.687
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
The+0.527
 grant+0.518
In+0.512
idi+0.511
 finish+0.501
Neg logit contributions
idd-0.969
rol-0.930
iw-0.881
und-0.875
arre-0.835
Token That
Token ID1320
Feature activation+1.29e+00

Pos logit contributions
The+0.027
idi+0.027
 finish+0.026
In+0.026
 ends+0.026
Neg logit contributions
idd-0.034
rol-0.034
iw-0.030
und-0.030
arre-0.029
Token sounds
Token ID5238
Feature activation-2.75e+00

Pos logit contributions
idd+2.595
rol+2.588
iw+2.368
und+2.332
arre+2.278
Neg logit contributions
The-1.119
 finish-1.114
 ends-1.109
 before-1.047
In-1.044
Token fun
Token ID1257
Feature activation-1.25e-01

Pos logit contributions
idi+4.881
The+4.562
 grant+4.487
ace+4.431
ers+4.385
Neg logit contributions
 sorts-2.498
rol-2.489
 Good-2.365
idd-2.266
asser-2.125
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
The+0.118
idi+0.114
In+0.113
 grant+0.112
 finish+0.111
Neg logit contributions
idd-0.232
rol-0.227
und-0.208
iw-0.207
arre-0.202
Token Let
Token ID3914
Feature activation+7.91e-01

Pos logit contributions
The+0.277
idi+0.275
 finish+0.272
 ends+0.272
ers+0.271
Neg logit contributions
idd-0.359
rol-0.354
iw-0.318
und-0.315
arre-0.301
Token's
Token ID338
Feature activation-8.88e-01

Pos logit contributions
idd+1.476
rol+1.465
iw+1.333
und+1.323
arre+1.289
Neg logit contributions
The-0.773
 finish-0.746
 ends-0.732
In-0.721
idi-0.719
Token go
Token ID467
Feature activation-2.67e-01

Pos logit contributions
idi+0.801
The+0.786
ers+0.768
In+0.768
isy+0.743
Neg logit contributions
idd-1.636
rol-1.582
und-1.509
arre-1.452
iw-1.426
Token mom
Token ID1995
Feature activation+2.35e-01

Pos logit contributions
The+0.904
In+0.881
 grant+0.827
ers+0.818
 finish+0.814
Neg logit contributions
idd-1.842
rol-1.658
iw-1.623
rou-1.605
arre-1.594
Token said
Token ID531
Feature activation+8.57e-02

Pos logit contributions
idd+0.400
rol+0.394
und+0.358
iw+0.354
arre+0.351
Neg logit contributions
The-0.258
 finish-0.251
 grant-0.249
In-0.247
 ends-0.246
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
idd+0.172
rol+0.169
und+0.156
iw+0.156
arre+0.153
Neg logit contributions
The-0.067
 finish-0.066
In-0.065
 ends-0.064
idi-0.064
Token "
Token ID366
Feature activation+6.10e-01

Pos logit contributions
The+0.017
 finish+0.017
 ends+0.017
idi+0.016
In+0.016
Neg logit contributions
idd-0.032
rol-0.031
und-0.028
iw-0.028
arre-0.028
TokenThat
Token ID2504
Feature activation+1.87e-01

Pos logit contributions
idd+0.973
rol+0.964
und+0.862
iw+0.857
arre+0.842
Neg logit contributions
The-0.750
 finish-0.723
In-0.718
 ends-0.712
idi-0.711
Token sounds
Token ID5238
Feature activation-2.74e+00

Pos logit contributions
idd+0.317
rol+0.312
und+0.285
iw+0.279
arre+0.279
Neg logit contributions
The-0.205
 finish-0.198
In-0.197
 grant-0.196
idi-0.195
Token very
Token ID845
Feature activation+2.11e-01

Pos logit contributions
idi+3.854
The+3.657
 grant+3.431
ace+3.429
In+3.412
Neg logit contributions
rol-3.653
 Good-3.439
 sorts-3.393
idd-3.276
und-3.246
Token exciting
Token ID7895
Feature activation-1.25e-01

Pos logit contributions
idd+0.391
rol+0.385
und+0.353
iw+0.351
arre+0.346
Neg logit contributions
The-0.196
 finish-0.188
 grant-0.187
idi-0.186
 ends-0.186
Token.
Token ID13
Feature activation-9.80e-02

Pos logit contributions
The+0.112
In+0.107
 finish+0.106
idi+0.106
 grant+0.106
Neg logit contributions
idd-0.235
rol-0.232
und-0.215
iw-0.212
arre-0.207
Token But
Token ID887
Feature activation+4.47e-01

Pos logit contributions
The+0.114
 finish+0.112
 ends+0.111
idi+0.111
In+0.111
Neg logit contributions
idd-0.159
rol-0.156
iw-0.140
und-0.140
arre-0.134
Token why
Token ID1521
Feature activation-3.32e-02

Pos logit contributions
idd+0.796
rol+0.788
und+0.716
iw+0.715
arre+0.700
Neg logit contributions
The-0.450
 finish-0.434
In-0.428
idi-0.426
 ends-0.426
Token the
Token ID262
Feature activation-9.27e-01

Pos logit contributions
In+0.683
ers+0.615
The+0.613
isy+0.598
 ends+0.593
Neg logit contributions
idd-2.329
rol-2.268
iw-2.155
und-2.109
 batch-2.055
Token face
Token ID1986
Feature activation-2.02e-01

Pos logit contributions
In+0.623
ers+0.618
The+0.603
idi+0.582
 grant+0.570
Neg logit contributions
idd-1.963
rol-1.786
iw-1.765
und-1.739
arre-1.729
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
The+0.174
In+0.168
 finish+0.168
 ends+0.166
idi+0.166
Neg logit contributions
idd-0.390
rol-0.382
iw-0.347
und-0.347
arre-0.334
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
 ends+0.209
 finish+0.209
The+0.207
ers+0.200
In+0.198
Neg logit contributions
idd-0.404
rol-0.392
iw-0.362
und-0.362
arre-0.346
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
 ends+1.115
 finish+1.089
 before+1.086
 grant+1.053
ace+1.048
Neg logit contributions
idd-2.002
rol-1.856
und-1.812
iw-1.727
arre-1.669
TokenLittle
Token ID22253
Feature activation-2.74e+00

Pos logit contributions
 ends+1.453
 before+1.408
ace+1.382
 grant+1.371
 finish+1.350
Neg logit contributions
idd-2.534
rol-2.307
und-2.273
iw-2.205
oc-2.195
Token boy
Token ID2933
Feature activation-7.65e-01

Pos logit contributions
ace+3.991
 grant+3.883
isy+3.640
In+3.604
oster+3.552
Neg logit contributions
idd-4.046
und-3.608
iw-3.597
joy-3.394
 Mechan-3.315
Token smiled
Token ID13541
Feature activation+3.67e-01

Pos logit contributions
The+0.806
 ends+0.802
 grant+0.800
In+0.795
 finish+0.788
Neg logit contributions
idd-1.341
rol-1.244
und-1.156
iw-1.138
arre-1.112
Token shy
Token ID15800
Feature activation-3.29e-01

Pos logit contributions
idd+0.793
rol+0.787
und+0.727
iw+0.724
arre+0.716
Neg logit contributions
The-0.230
 finish-0.219
 grant-0.215
In-0.215
 ends-0.214
Tokenly
Token ID306
Feature activation-5.40e-01

Pos logit contributions
The+0.263
In+0.254
 grant+0.246
idi+0.240
 finish+0.240
Neg logit contributions
idd-0.664
rol-0.656
und-0.608
arre-0.600
iw-0.598
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
The+0.474
In+0.463
 grant+0.459
idi+0.452
 finish+0.444
Neg logit contributions
idd-1.031
rol-0.997
und-0.935
arre-0.919
iw-0.913
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
idd+0.899
rol+0.893
und+0.821
iw+0.816
arre+0.804
Neg logit contributions
The-0.324
 finish-0.311
 grant-0.305
In-0.304
idi-0.302
Token It
Token ID632
Feature activation+3.50e-01

Pos logit contributions
 finish+0.244
The+0.240
 ends+0.239
idi+0.238
In+0.233
Neg logit contributions
idd-0.501
rol-0.493
und-0.457
iw-0.452
arre-0.439
Token was
Token ID373
Feature activation-3.51e-01

Pos logit contributions
idd+0.686
rol+0.681
und+0.623
iw+0.620
arre+0.608
Neg logit contributions
The-0.292
 finish-0.281
 grant-0.275
In-0.275
 ends-0.272
Token rolling
Token ID10708
Feature activation+1.46e-01

Pos logit contributions
The+0.378
In+0.362
 grant+0.361
 finish+0.354
idi+0.353
Neg logit contributions
idd-0.598
rol-0.598
und-0.530
iw-0.528
rou-0.516
Token down
Token ID866
Feature activation-1.13e-02

Pos logit contributions
idd+0.291
rol+0.290
und+0.267
iw+0.262
arre+0.258
Neg logit contributions
The-0.114
 grant-0.112
 finish-0.112
idi-0.111
In-0.110
Token the
Token ID262
Feature activation-2.73e+00

Pos logit contributions
 grant+0.004
The+0.004
idi+0.004
 finish+0.004
In+0.004
Neg logit contributions
idd-0.027
rol-0.027
und-0.025
iw-0.025
arre-0.024
Token hill
Token ID12788
Feature activation+5.41e-01

Pos logit contributions
idi+4.447
 grant+4.327
ace+4.292
In+4.211
 behold+4.130
Neg logit contributions
idd-3.385
rol-3.047
rob-2.917
iw-2.913
und-2.780
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
idd+1.106
rol+1.098
und+1.009
iw+1.006
arre+0.991
Neg logit contributions
The-0.405
 finish-0.387
In-0.378
 ends-0.376
 grant-0.376
Token It
Token ID632
Feature activation+3.79e-01

Pos logit contributions
 finish+0.383
 ends+0.377
idi+0.373
The+0.372
 grant+0.363
Neg logit contributions
idd-0.753
rol-0.742
und-0.688
iw-0.680
arre-0.657
Token was
Token ID373
Feature activation-5.94e-01

Pos logit contributions
idd+0.720
rol+0.714
und+0.651
iw+0.647
arre+0.636
Neg logit contributions
The-0.340
 finish-0.327
 grant-0.322
In-0.321
idi-0.318
Token going
Token ID1016
Feature activation-3.88e-01

Pos logit contributions
The+0.675
 grant+0.665
In+0.651
idi+0.639
 ends+0.637
Neg logit contributions
rol-0.974
idd-0.969
und-0.853
iw-0.845
rou-0.839
Token Sam
Token ID3409
Feature activation+4.21e-02

Pos logit contributions
The+0.107
idi+0.106
In+0.106
 finish+0.104
 ends+0.104
Neg logit contributions
idd-0.192
rol-0.185
iw-0.174
und-0.172
arre-0.165
Token says
Token ID1139
Feature activation-6.25e-01

Pos logit contributions
idd+0.053
rol+0.053
und+0.046
iw+0.046
arre+0.045
Neg logit contributions
The-0.063
In-0.062
 grant-0.062
idi-0.061
 finish-0.061
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
In+0.623
The+0.622
idi+0.610
 grant+0.582
 ends+0.570
Neg logit contributions
idd-1.121
rol-1.087
iw-0.995
und-0.969
arre-0.964
Token "
Token ID366
Feature activation-3.66e-01

Pos logit contributions
 grant+0.247
 finish+0.247
 ends+0.244
The+0.243
idi+0.242
Neg logit contributions
idd-0.448
rol-0.433
und-0.399
iw-0.397
arre-0.387
TokenThat
Token ID2504
Feature activation+8.01e-01

Pos logit contributions
idi+0.296
 grant+0.292
 finish+0.290
 ends+0.288
 before+0.278
Neg logit contributions
idd-0.710
rol-0.688
iw-0.643
und-0.638
arre-0.631
Token sounds
Token ID5238
Feature activation-2.73e+00

Pos logit contributions
idd+1.490
rol+1.480
iw+1.344
und+1.342
arre+1.311
Neg logit contributions
The-0.777
 finish-0.750
 ends-0.744
In-0.730
 grant-0.727
Token fun
Token ID1257
Feature activation-3.91e-02

Pos logit contributions
idi+4.929
oster+4.472
 grant+4.432
The+4.405
ers+4.405
Neg logit contributions
rol-2.485
 Good-2.377
 sorts-2.374
idd-2.219
und-2.164
Token."
Token ID526
Feature activation-7.67e-01

Pos logit contributions
The+0.038
idi+0.037
In+0.037
 grant+0.037
 finish+0.036
Neg logit contributions
idd-0.071
rol-0.070
und-0.064
iw-0.063
arre-0.062
Token
Token ID198
Feature activation-2.94e-01

Pos logit contributions
 grant+0.763
 ends+0.755
idi+0.754
 finish+0.743
ers+0.718
Neg logit contributions
idd-1.366
rol-1.318
iw-1.240
und-1.237
arre-1.190
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
 finish+0.254
 grant+0.252
idi+0.250
 before+0.246
 ends+0.244
Neg logit contributions
idd-0.556
rol-0.543
und-0.507
iw-0.497
arre-0.487
TokenThey
Token ID2990
Feature activation-3.62e-01

Pos logit contributions
 grant+0.400
idi+0.394
 finish+0.388
 before+0.388
 ends+0.384
Neg logit contributions
idd-0.683
rol-0.652
und-0.612
iw-0.603
arre-0.589
Token,
Token ID11
Feature activation-7.48e-02

Pos logit contributions
The+0.008
In+0.008
 finish+0.008
idi+0.008
 ends+0.008
Neg logit contributions
idd-0.021
rol-0.021
iw-0.019
und-0.019
arre-0.019
Token "
Token ID366
Feature activation+1.98e-01

Pos logit contributions
The+0.071
 ends+0.070
 finish+0.070
In+0.069
idi+0.068
Neg logit contributions
idd-0.136
rol-0.132
iw-0.123
und-0.121
arre-0.119
TokenOK
Token ID11380
Feature activation-7.30e-01

Pos logit contributions
idd+0.312
rol+0.306
iw+0.278
und+0.275
arre+0.267
Neg logit contributions
 finish-0.238
The-0.236
 grant-0.235
 ends-0.234
idi-0.233
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
In+0.698
idi+0.691
The+0.691
 grant+0.677
 ends+0.657
Neg logit contributions
idd-1.296
rol-1.257
iw-1.165
und-1.157
arre-1.149
Token That
Token ID1320
Feature activation+1.37e+00

Pos logit contributions
The+0.195
idi+0.190
 finish+0.189
In+0.187
 grant+0.187
Neg logit contributions
idd-0.271
rol-0.266
iw-0.243
und-0.238
arre-0.228
Token sounds
Token ID5238
Feature activation-2.73e+00

Pos logit contributions
idd+2.690
rol+2.681
iw+2.436
und+2.404
arre+2.309
Neg logit contributions
The-1.300
 ends-1.258
 finish-1.257
In-1.205
 before-1.187
Token good
Token ID922
Feature activation+6.05e-01

Pos logit contributions
idi+4.532
 grant+4.240
ers+4.084
ace+4.067
oster+4.053
Neg logit contributions
 Good-2.847
rol-2.781
 sorts-2.766
arre-2.560
 Enjoy-2.516
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
idd+1.233
rol+1.225
und+1.124
iw+1.119
arre+1.104
Neg logit contributions
The-0.470
 finish-0.452
 ends-0.442
In-0.440
 grant-0.439
Token That
Token ID1320
Feature activation+1.40e+00

Pos logit contributions
The+0.255
idi+0.251
 finish+0.249
 ends+0.248
ers+0.247
Neg logit contributions
idd-0.333
rol-0.327
iw-0.296
und-0.292
arre-0.280
Token sounds
Token ID5238
Feature activation-2.61e+00

Pos logit contributions
idd+2.808
rol+2.794
iw+2.532
und+2.516
arre+2.420
Neg logit contributions
The-1.272
 ends-1.247
 finish-1.244
 before-1.186
In-1.172
Token comfortable
Token ID6792
Feature activation-1.59e-01

Pos logit contributions
idi+4.308
ace+3.938
 grant+3.896
oster+3.879
ers+3.860
Neg logit contributions
 Good-2.790
 sorts-2.778
rol-2.747
arre-2.547
idd-2.527
Token saw
Token ID2497
Feature activation+6.48e-01

Pos logit contributions
idd+0.717
rol+0.708
und+0.643
iw+0.640
arre+0.631
Neg logit contributions
The-0.416
 finish-0.401
In-0.396
 grant-0.393
 ends-0.393
Token a
Token ID257
Feature activation-2.60e-02

Pos logit contributions
idd+1.348
rol+1.340
und+1.230
iw+1.227
arre+1.212
Neg logit contributions
The-0.479
 finish-0.451
In-0.441
 ends-0.440
idi-0.439
Token tie
Token ID9839
Feature activation-3.43e-01

Pos logit contributions
The+0.020
In+0.020
 finish+0.019
ers+0.019
idi+0.019
Neg logit contributions
idd-0.052
rol-0.050
iw-0.047
und-0.046
arre-0.046
Token lying
Token ID9105
Feature activation+1.71e-01

Pos logit contributions
The+0.261
idi+0.259
In+0.249
 finish+0.244
 ends+0.242
Neg logit contributions
rol-0.681
idd-0.680
und-0.625
iw-0.614
arre-0.605
Token on
Token ID319
Feature activation-7.03e-01

Pos logit contributions
idd+0.358
rol+0.356
und+0.328
iw+0.327
arre+0.318
Neg logit contributions
The-0.120
 finish-0.114
 grant-0.112
In-0.111
idi-0.111
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
In+0.347
 grant+0.338
idi+0.329
 finish+0.327
The+0.317
Neg logit contributions
idd-1.522
rol-1.519
und-1.466
iw-1.441
 batch-1.422
Token ground
Token ID2323
Feature activation+4.31e-01

Pos logit contributions
idi+2.411
In+2.284
 behold+2.268
The+2.025
ace+1.994
Neg logit contributions
idd-5.038
rol-4.750
iw-4.584
 poss-4.493
 batch-4.458
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
idd+0.873
rol+0.867
und+0.798
iw+0.794
arre+0.781
Neg logit contributions
The-0.327
 finish-0.313
 grant-0.307
idi-0.307
In-0.307
Token She
Token ID1375
Feature activation+1.30e-02

Pos logit contributions
ers+0.706
 finish+0.704
 grant+0.701
 ends+0.690
idi+0.687
Neg logit contributions
idd-0.882
rol-0.871
und-0.804
iw-0.798
arre-0.754
Token picked
Token ID6497
Feature activation+3.08e-01

Pos logit contributions
idd+0.022
rol+0.022
und+0.020
iw+0.020
arre+0.019
Neg logit contributions
The-0.014
 finish-0.013
 before-0.013
 grant-0.013
 ends-0.013
Token it
Token ID340
Feature activation-7.33e-02

Pos logit contributions
idd+0.564
rol+0.560
und+0.508
iw+0.508
arre+0.497
Neg logit contributions
The-0.292
 finish-0.284
In-0.281
 grant-0.279
idi-0.279
Token sad
Token ID6507
Feature activation-1.54e-01

Pos logit contributions
The+0.502
In+0.498
 grant+0.497
 ends+0.492
idi+0.489
Neg logit contributions
idd-0.640
rol-0.634
iw-0.553
arre-0.551
und-0.548
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
The+0.128
idi+0.125
 grant+0.125
In+0.122
 ends+0.121
Neg logit contributions
idd-0.296
rol-0.293
und-0.275
iw-0.272
arre-0.270
Token screamed
Token ID25421
Feature activation+1.30e+00

Pos logit contributions
idd+0.369
rol+0.365
und+0.328
iw+0.327
arre+0.320
Neg logit contributions
The-0.271
In-0.261
 finish-0.260
 grant-0.259
 ends-0.257
Token,
Token ID11
Feature activation+1.84e-01

Pos logit contributions
idd+2.811
rol+2.772
iw+2.574
und+2.551
arre+2.484
Neg logit contributions
 before-0.944
The-0.922
 finish-0.910
 grant-0.845
 ends-0.831
Token "
Token ID366
Feature activation+8.15e-01

Pos logit contributions
idd+0.375
rol+0.374
und+0.345
iw+0.343
arre+0.339
Neg logit contributions
The-0.132
 finish-0.127
 ends-0.127
idi-0.125
In-0.125
TokenI
Token ID40
Feature activation+1.66e+00

Pos logit contributions
idd+1.534
rol+1.526
und+1.388
iw+1.380
arre+1.361
Neg logit contributions
The-0.784
 finish-0.734
In-0.732
 grant-0.719
 ends-0.718
Token want
Token ID765
Feature activation-2.01e-01

Pos logit contributions
rol+3.157
idd+3.076
und+2.768
iw+2.730
ays+2.700
Neg logit contributions
 finish-1.683
The-1.639
 grant-1.623
In-1.598
 ends-1.592
Token my
Token ID616
Feature activation+6.06e-03

Pos logit contributions
The+0.145
In+0.144
idi+0.142
 grant+0.139
 ends+0.138
Neg logit contributions
idd-0.405
rol-0.396
iw-0.376
und-0.368
 batch-0.360
Token apple
Token ID17180
Feature activation+8.62e-01

Pos logit contributions
idd+0.010
rol+0.010
iw+0.009
und+0.009
arre+0.008
Neg logit contributions
The-0.007
In-0.007
idi-0.007
 grant-0.007
 finish-0.007
Token!"
Token ID2474
Feature activation+5.18e-02

Pos logit contributions
idd+1.442
rol+1.425
und+1.282
iw+1.267
arre+1.256
Neg logit contributions
The-0.978
 finish-0.965
 before-0.946
 ends-0.941
 grant-0.927
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
idd+0.100
rol+0.099
und+0.092
iw+0.092
arre+0.090
Neg logit contributions
The-0.042
 ends-0.042
 finish-0.042
 grant-0.041
idi-0.041
Token day
Token ID1110
Feature activation+8.69e-01

Pos logit contributions
rol+2.593
idd+2.592
iw+2.357
und+2.328
arre+2.308
Neg logit contributions
 finish-1.300
The-1.290
 before-1.246
idi-1.234
 ends-1.219
Token.
Token ID13
Feature activation-6.64e-02

Pos logit contributions
idd+1.813
rol+1.810
iw+1.669
und+1.664
arre+1.643
Neg logit contributions
The-0.624
 finish-0.606
 before-0.600
In-0.581
 ends-0.575
Token
Token ID198
Feature activation+9.27e-02

Pos logit contributions
 grant+0.057
The+0.056
 finish+0.055
idi+0.055
 ends+0.055
Neg logit contributions
idd-0.126
rol-0.125
und-0.115
iw-0.114
arre-0.113
Token
Token ID198
Feature activation+2.99e-01

Pos logit contributions
idd+0.182
rol+0.180
und+0.167
iw+0.165
arre+0.162
Neg logit contributions
 finish-0.072
 grant-0.072
The-0.071
 ends-0.070
 before-0.069
TokenAt
Token ID2953
Feature activation+1.70e-01

Pos logit contributions
idd+0.504
rol+0.500
und+0.453
iw+0.449
arre+0.443
Neg logit contributions
The-0.321
 grant-0.318
 finish-0.317
 ends-0.313
 before-0.311
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
idd+0.419
rol+0.415
iw+0.390
und+0.390
arre+0.380
Neg logit contributions
The-0.052
 grant-0.047
idi-0.045
In-0.045
 ends-0.045
Token mall
Token ID17374
Feature activation+3.98e-01

Pos logit contributions
idd+2.038
rol+2.009
iw+1.772
und+1.767
arre+1.715
Neg logit contributions
The-2.527
 finish-2.516
In-2.441
 ends-2.401
 before-2.377
Token,
Token ID11
Feature activation-5.61e-01

Pos logit contributions
idd+0.824
rol+0.816
und+0.751
iw+0.749
arre+0.737
Neg logit contributions
The-0.288
 finish-0.274
 grant-0.272
idi-0.270
In-0.269
Token Tom
Token ID4186
Feature activation-2.08e-01

Pos logit contributions
 grant+0.621
ace+0.602
 finish+0.596
idi+0.593
The+0.589
Neg logit contributions
idd-0.935
rol-0.915
und-0.838
iw-0.832
arre-0.795
Token saw
Token ID2497
Feature activation+7.25e-01

Pos logit contributions
 grant+0.229
The+0.229
 ends+0.221
 finish+0.221
In+0.217
Neg logit contributions
idd-0.348
rol-0.337
und-0.310
iw-0.303
arre-0.301
Token a
Token ID257
Feature activation+5.07e-01

Pos logit contributions
idd+1.452
rol+1.443
und+1.324
iw+1.321
arre+1.302
Neg logit contributions
The-0.595
 finish-0.564
 ends-0.551
In-0.551
 before-0.548
Token,
Token ID11
Feature activation-5.83e-01

Pos logit contributions
idd+0.118
rol+0.116
und+0.107
iw+0.107
arre+0.105
Neg logit contributions
The-0.046
 finish-0.045
idi-0.044
 grant-0.043
 ends-0.043
Token she
Token ID673
Feature activation+3.90e-01

Pos logit contributions
 finish+0.622
idi+0.621
ers+0.618
ace+0.616
 ends+0.613
Neg logit contributions
idd-0.951
rol-0.939
iw-0.868
und-0.850
 batch-0.819
Token went
Token ID1816
Feature activation+7.79e-02

Pos logit contributions
idd+0.714
rol+0.704
und+0.643
iw+0.639
arre+0.632
Neg logit contributions
The-0.375
 finish-0.362
 grant-0.357
In-0.356
 ends-0.355
Token to
Token ID284
Feature activation-4.79e-03

Pos logit contributions
idd+0.155
rol+0.154
iw+0.141
und+0.141
arre+0.139
Neg logit contributions
The-0.062
 ends-0.059
 grant-0.058
In-0.057
 finish-0.057
Token the
Token ID262
Feature activation+8.25e-01

Pos logit contributions
The+0.002
ers+0.002
In+0.002
 ends+0.002
idi+0.002
Neg logit contributions
idd-0.011
rol-0.011
und-0.010
iw-0.010
arre-0.010
Token park
Token ID3952
Feature activation+1.64e+00

Pos logit contributions
idd+1.340
rol+1.325
und+1.182
iw+1.178
arre+1.156
Neg logit contributions
The-0.999
 finish-0.972
In-0.950
idi-0.945
 ends-0.939
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
rol+3.819
idd+3.817
und+3.527
arre+3.514
iw+3.449
Neg logit contributions
The-0.914
 before-0.903
 finish-0.850
In-0.790
idi-0.732
Token play
Token ID711
Feature activation+8.76e-01

Pos logit contributions
The+0.015
 ends+0.014
In+0.014
ers+0.014
idi+0.014
Neg logit contributions
idd-0.026
rol-0.026
iw-0.024
und-0.023
arre-0.023
Token with
Token ID351
Feature activation+8.55e-01

Pos logit contributions
idd+1.849
rol+1.838
iw+1.684
und+1.682
arre+1.666
Neg logit contributions
The-0.640
 finish-0.590
In-0.582
 before-0.579
 grant-0.567
Token Spark
Token ID17732
Feature activation+9.33e-01

Pos logit contributions
idd+1.450
rol+1.431
und+1.290
iw+1.284
arre+1.275
Neg logit contributions
The-0.976
 finish-0.932
 ends-0.921
In-0.919
 before-0.919
Tokenles
Token ID829
Feature activation+7.02e-01

Pos logit contributions
idd+1.581
rol+1.571
iw+1.413
und+1.412
arre+1.372
Neg logit contributions
The-1.047
 ends-1.019
 finish-1.008
 before-1.005
In-0.994
Token�
Token ID129
Feature activation+3.82e-01

Pos logit contributions
rol+1.360
idd+1.357
iw+1.201
und+1.195
arre+1.171
Neg logit contributions
The-1.069
In-1.026
 finish-1.024
 grant-1.004
 ends-1.004
Token�
Token ID241
Feature activation-3.57e-03

Pos logit contributions
idd+0.657
rol+0.650
und+0.590
iw+0.584
arre+0.576
Neg logit contributions
The-0.406
 finish-0.393
 ends-0.389
In-0.389
 grant-0.388
TokenI
Token ID40
Feature activation+1.28e+00

Pos logit contributions
 finish+0.004
 ends+0.004
The+0.004
 grant+0.004
idi+0.004
Neg logit contributions
idd-0.006
rol-0.006
und-0.005
iw-0.005
arre-0.005
Token am
Token ID716
Feature activation+3.87e-01

Pos logit contributions
idd+2.282
rol+2.270
iw+2.040
und+2.035
arre+2.012
Neg logit contributions
The-1.324
 finish-1.305
 grant-1.279
In-1.279
idi-1.249
Token Ty
Token ID7039
Feature activation-5.73e-01

Pos logit contributions
idd+0.695
rol+0.687
und+0.624
iw+0.621
arre+0.609
Neg logit contributions
The-0.388
 finish-0.374
In-0.370
 ends-0.369
idi-0.367
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
The+0.706
In+0.706
 finish+0.704
 ends+0.700
idi+0.688
Neg logit contributions
idd-0.889
rol-0.855
und-0.777
iw-0.760
rou-0.739
Token Water
Token ID5638
Feature activation+1.33e-01

Pos logit contributions
idd+3.278
rol+3.276
und+3.027
arre+2.932
iw+2.918
Neg logit contributions
The-1.332
 finish-1.298
 grant-1.221
 before-1.202
In-1.185
Token Cat
Token ID5181
Feature activation-1.70e-01

Pos logit contributions
idd+0.235
rol+0.234
und+0.212
iw+0.209
arre+0.205
Neg logit contributions
The-0.137
 finish-0.131
In-0.131
idi-0.131
 grant-0.129
Token.
Token ID13
Feature activation+2.73e-01

Pos logit contributions
The+0.175
 finish+0.169
In+0.168
idi+0.166
 grant+0.163
Neg logit contributions
idd-0.299
rol-0.296
und-0.270
iw-0.265
arre-0.258
Token I
Token ID314
Feature activation+1.02e+00

Pos logit contributions
idd+0.496
rol+0.492
und+0.447
iw+0.445
arre+0.436
Neg logit contributions
The-0.263
 finish-0.257
 ends-0.254
In-0.251
idi-0.251
Token live
Token ID2107
Feature activation+3.31e-01

Pos logit contributions
idd+1.832
rol+1.818
iw+1.646
und+1.634
arre+1.609
Neg logit contributions
The-1.030
 finish-1.021
 grant-0.998
In-0.992
 ends-0.985
Token was
Token ID373
Feature activation-3.60e-01

Pos logit contributions
The+0.412
 grant+0.409
 finish+0.401
In+0.398
idi+0.388
Neg logit contributions
rol-0.897
idd-0.893
und-0.825
arre-0.793
iw-0.790
Token happy
Token ID3772
Feature activation-1.19e-01

Pos logit contributions
The+0.345
 grant+0.344
 ends+0.338
idi+0.332
In+0.332
Neg logit contributions
rol-0.650
idd-0.636
und-0.578
arre-0.562
iw-0.558
Token to
Token ID284
Feature activation-4.13e-01

Pos logit contributions
The+0.101
 grant+0.097
idi+0.096
In+0.096
 finish+0.095
Neg logit contributions
idd-0.230
rol-0.228
und-0.210
iw-0.209
arre-0.204
Token have
Token ID423
Feature activation-3.45e-01

Pos logit contributions
The+0.211
ers+0.210
idi+0.205
 ends+0.204
In+0.202
Neg logit contributions
rol-0.899
idd-0.896
und-0.847
iw-0.818
arre-0.811
Token a
Token ID257
Feature activation+8.25e-01

Pos logit contributions
The+0.363
 grant+0.361
 finish+0.355
 ends+0.348
In+0.346
Neg logit contributions
idd-0.585
rol-0.577
und-0.533
arre-0.514
iw-0.506
Token new
Token ID649
Feature activation+1.59e+00

Pos logit contributions
idd+1.631
rol+1.626
und+1.488
iw+1.466
arre+1.453
Neg logit contributions
The-0.693
 finish-0.679
In-0.650
idi-0.650
 before-0.648
Token home
Token ID1363
Feature activation-5.55e-01

Pos logit contributions
rol+3.221
idd+3.205
und+2.899
arre+2.868
iw+2.839
Neg logit contributions
 finish-1.356
 before-1.331
The-1.323
idi-1.294
 ends-1.278
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
The+0.523
idi+0.521
In+0.510
 grant+0.503
 ends+0.497
Neg logit contributions
rol-1.011
idd-1.010
und-0.914
iw-0.908
rou-0.882
Token It
Token ID632
Feature activation+2.82e-01

Pos logit contributions
The+0.147
 finish+0.144
idi+0.144
 grant+0.142
 ends+0.142
Neg logit contributions
idd-0.331
rol-0.330
und-0.305
iw-0.298
arre-0.296
Token loved
Token ID6151
Feature activation+4.70e-01

Pos logit contributions
idd+0.584
rol+0.582
und+0.536
iw+0.531
arre+0.524
Neg logit contributions
The-0.196
 finish-0.187
 grant-0.183
In-0.182
 ends-0.182
Token the
Token ID262
Feature activation+3.09e-01

Pos logit contributions
idd+0.953
rol+0.946
und+0.869
iw+0.866
arre+0.851
Neg logit contributions
The-0.362
 finish-0.347
In-0.339
 ends-0.338
 grant-0.338
Token three
Token ID1115
Feature activation+2.85e-02

Pos logit contributions
ers+0.158
The+0.155
 ends+0.155
 grant+0.154
In+0.151
Neg logit contributions
idd-0.389
rol-0.370
iw-0.353
und-0.341
arre-0.339
Token year
Token ID614
Feature activation-7.15e-02

Pos logit contributions
idd+0.043
rol+0.041
iw+0.037
und+0.037
arre+0.036
Neg logit contributions
The-0.037
 grant-0.036
 finish-0.036
In-0.036
 ends-0.035
Token old
Token ID1468
Feature activation-6.07e-01

Pos logit contributions
The+0.024
 grant+0.023
In+0.022
 finish+0.021
idi+0.021
Neg logit contributions
idd-0.175
rol-0.172
iw-0.162
und-0.161
arre-0.159
Token girl
Token ID2576
Feature activation-6.39e-01

Pos logit contributions
In+0.628
The+0.623
 grant+0.608
 finish+0.607
 ends+0.596
Neg logit contributions
idd-1.099
rol-1.029
iw-0.980
und-0.954
rou-0.920
Token who
Token ID508
Feature activation+3.01e-01

Pos logit contributions
The+0.646
 ends+0.641
 finish+0.639
 grant+0.636
idi+0.633
Neg logit contributions
idd-1.141
rol-1.089
und-1.010
iw-1.008
rou-0.960
Token loved
Token ID6151
Feature activation+4.37e-01

Pos logit contributions
idd+0.575
rol+0.569
und+0.522
iw+0.518
arre+0.511
Neg logit contributions
The-0.263
 finish-0.252
 grant-0.248
 ends-0.248
 before-0.247
Token to
Token ID284
Feature activation-5.77e-01

Pos logit contributions
idd+0.886
rol+0.879
und+0.808
iw+0.807
arre+0.792
Neg logit contributions
The-0.334
 finish-0.319
In-0.313
 grant-0.311
 ends-0.311
Token swim
Token ID9422
Feature activation-6.25e-03

Pos logit contributions
The+0.555
ers+0.554
In+0.545
idi+0.544
 ends+0.538
Neg logit contributions
idd-0.999
rol-0.979
und-0.925
iw-0.901
arre-0.890
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
The+0.004
 grant+0.004
idi+0.004
 finish+0.004
 ends+0.004
Neg logit contributions
rol-0.013
idd-0.013
und-0.012
iw-0.012
arre-0.012
Token Her
Token ID2332
Feature activation-6.01e-01

Pos logit contributions
ers+0.794
The+0.777
 grant+0.774
idi+0.772
 ends+0.763
Neg logit contributions
rol-0.834
idd-0.828
arre-0.759
iw-0.749
und-0.747
Token parents
Token ID3397
Feature activation+2.46e-02

Pos logit contributions
The+0.641
In+0.617
ers+0.609
 grant+0.607
idi+0.598
Neg logit contributions
idd-1.035
rol-1.013
und-0.914
arre-0.894
iw-0.888
Tokenchair
Token ID16337
Feature activation+6.16e-01

Pos logit contributions
idd+0.038
rol+0.038
und+0.035
iw+0.034
arre+0.034
Neg logit contributions
The-0.022
 grant-0.022
idi-0.022
In-0.022
 finish-0.022
Token,
Token ID11
Feature activation-6.36e-02

Pos logit contributions
idd+1.276
rol+1.266
und+1.163
iw+1.161
arre+1.145
Neg logit contributions
The-0.444
 finish-0.425
In-0.413
 ends-0.411
 before-0.411
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
The+0.058
 ends+0.057
 finish+0.057
 grant+0.057
In+0.056
Neg logit contributions
idd-0.116
rol-0.116
iw-0.106
und-0.105
arre-0.103
Token they
Token ID484
Feature activation+5.15e-01

Pos logit contributions
The+0.288
 grant+0.280
In+0.280
 finish+0.278
 ends+0.275
Neg logit contributions
idd-0.577
rol-0.572
iw-0.525
und-0.524
arre-0.506
Token read
Token ID1100
Feature activation+3.43e-01

Pos logit contributions
idd+0.944
rol+0.936
und+0.851
iw+0.848
arre+0.834
Neg logit contributions
The-0.495
 finish-0.478
In-0.469
 ends-0.468
 grant-0.468
Token the
Token ID262
Feature activation+7.80e-01

Pos logit contributions
idd+0.648
rol+0.644
und+0.589
iw+0.585
arre+0.574
Neg logit contributions
The-0.306
 finish-0.297
 grant-0.292
In-0.292
idi-0.290
Token book
Token ID1492
Feature activation+5.73e-01

Pos logit contributions
idd+1.592
rol+1.575
und+1.449
iw+1.441
arre+1.418
Neg logit contributions
The-0.616
 finish-0.587
 ends-0.578
In-0.573
 grant-0.570
Token together
Token ID1978
Feature activation-2.57e-02

Pos logit contributions
idd+0.758
rol+0.748
und+0.653
iw+0.651
arre+0.635
Neg logit contributions
The-0.843
 finish-0.824
In-0.814
 ends-0.813
 grant-0.811
Token.
Token ID13
Feature activation-5.14e-01

Pos logit contributions
The+0.022
idi+0.021
In+0.021
 finish+0.021
 grant+0.021
Neg logit contributions
rol-0.049
idd-0.049
und-0.045
iw-0.044
arre-0.044
Token They
Token ID1119
Feature activation+1.72e-01

Pos logit contributions
The+0.491
 ends+0.487
 grant+0.486
ers+0.477
 finish+0.474
Neg logit contributions
rol-0.919
idd-0.915
und-0.834
arre-0.827
iw-0.820
Token laughed
Token ID13818
Feature activation+2.62e-02

Pos logit contributions
idd+0.310
rol+0.308
und+0.281
iw+0.280
arre+0.273
Neg logit contributions
The-0.165
 finish-0.159
 grant-0.157
 ends-0.157
idi-0.157
Token wings
Token ID12098
Feature activation+1.31e-02

Pos logit contributions
The+0.862
idi+0.854
 before+0.824
In+0.816
 grant+0.810
Neg logit contributions
rol-1.543
idd-1.522
und-1.414
iw-1.379
arre-1.377
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
rol+0.026
idd+0.025
und+0.023
iw+0.023
arre+0.023
Neg logit contributions
The-0.010
idi-0.010
In-0.010
 grant-0.010
 ends-0.010
Token Then
Token ID3244
Feature activation-2.14e-01

Pos logit contributions
The+0.132
 ends+0.127
 finish+0.126
 grant+0.125
In+0.124
Neg logit contributions
idd-0.308
rol-0.305
und-0.281
arre-0.276
iw-0.276
Token it
Token ID340
Feature activation-1.36e-01

Pos logit contributions
The+0.173
In+0.168
 grant+0.166
 ends+0.164
 finish+0.164
Neg logit contributions
idd-0.419
rol-0.409
und-0.384
iw-0.377
arre-0.368
Token looked
Token ID3114
Feature activation+9.53e-02

Pos logit contributions
The+0.138
 finish+0.134
In+0.133
 grant+0.133
idi+0.131
Neg logit contributions
idd-0.236
rol-0.234
und-0.212
arre-0.209
iw-0.208
Token down
Token ID866
Feature activation+3.21e-01

Pos logit contributions
rol+0.158
idd+0.156
und+0.141
iw+0.139
arre+0.138
Neg logit contributions
The-0.107
 grant-0.104
In-0.103
 finish-0.103
 ends-0.102
Token at
Token ID379
Feature activation+4.31e-01

Pos logit contributions
idd+0.567
rol+0.563
und+0.512
iw+0.509
arre+0.498
Neg logit contributions
The-0.325
 finish-0.316
 grant-0.313
idi-0.312
 ends-0.312
Token them
Token ID606
Feature activation+8.70e-02

Pos logit contributions
idd+1.047
rol+1.038
und+0.970
iw+0.967
arre+0.953
Neg logit contributions
The-0.159
 finish-0.147
In-0.141
 grant-0.140
idi-0.138
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
idd+0.175
rol+0.175
und+0.161
iw+0.159
arre+0.158
Neg logit contributions
The-0.065
idi-0.063
In-0.063
 finish-0.061
 grant-0.061
Token said
Token ID531
Feature activation+4.54e-01

Pos logit contributions
idd+0.987
rol+0.979
und+0.886
iw+0.883
arre+0.867
Neg logit contributions
The-0.586
 finish-0.567
In-0.557
 ends-0.556
 grant-0.555
Token,
Token ID11
Feature activation+3.64e-01

Pos logit contributions
idd+0.922
rol+0.915
und+0.841
iw+0.836
arre+0.825
Neg logit contributions
The-0.345
 finish-0.331
In-0.324
 ends-0.322
idi-0.321
Token long
Token ID890
Feature activation-1.01e+00

Pos logit contributions
idd+0.572
rol+0.565
und+0.518
iw+0.518
arre+0.505
Neg logit contributions
The-0.272
 finish-0.263
In-0.260
 grant-0.260
 ends-0.260
Token as
Token ID355
Feature activation+1.43e-01

Pos logit contributions
 grant+0.674
The+0.661
In+0.642
 ends+0.622
 finish+0.606
Neg logit contributions
idd-2.116
rol-2.076
und-1.974
iw-1.943
 batch-1.902
Token he
Token ID339
Feature activation+2.51e-01

Pos logit contributions
idd+0.265
rol+0.261
iw+0.240
und+0.238
arre+0.231
Neg logit contributions
The-0.131
 grant-0.126
 finish-0.126
In-0.126
 ends-0.124
Token stayed
Token ID9658
Feature activation-1.58e-01

Pos logit contributions
idd+0.472
rol+0.464
und+0.426
iw+0.425
arre+0.417
Neg logit contributions
The-0.230
 finish-0.220
 grant-0.218
In-0.217
 ends-0.216
Token in
Token ID287
Feature activation+5.42e-01

Pos logit contributions
The+0.168
 grant+0.164
In+0.163
 ends+0.162
 finish+0.159
Neg logit contributions
idd-0.270
rol-0.265
und-0.242
arre-0.242
iw-0.238
Token the
Token ID262
Feature activation+3.99e-01

Pos logit contributions
idd+1.293
rol+1.284
und+1.195
iw+1.192
arre+1.177
Neg logit contributions
The-0.227
 finish-0.209
In-0.200
 ends-0.198
 grant-0.198
Token kitchen
Token ID9592
Feature activation+8.55e-01

Pos logit contributions
idd+0.649
rol+0.642
und+0.576
iw+0.576
arre+0.567
Neg logit contributions
The-0.467
 finish-0.451
 grant-0.450
In-0.449
 ends-0.447
Token.
Token ID13
Feature activation+6.93e-01

Pos logit contributions
idd+1.807
rol+1.790
iw+1.643
und+1.641
arre+1.612
Neg logit contributions
The-0.589
 finish-0.576
 before-0.560
 ends-0.545
In-0.541
Token Tom
Token ID4186
Feature activation-2.30e-01

Pos logit contributions
idd+1.376
rol+1.367
und+1.247
iw+1.246
arre+1.224
Neg logit contributions
The-0.569
 finish-0.546
In-0.535
idi-0.530
 ends-0.528
Token was
Token ID373
Feature activation-5.47e-01

Pos logit contributions
The+0.208
 grant+0.207
In+0.201
 finish+0.200
 ends+0.197
Neg logit contributions
idd-0.425
rol-0.414
arre-0.384
und-0.384
iw-0.379
Token so
Token ID523
Feature activation+3.22e-01

Pos logit contributions
 grant+0.583
The+0.578
 ends+0.574
In+0.574
idi+0.553
Neg logit contributions
idd-0.931
rol-0.922
arre-0.820
und-0.806
rou-0.800
Token snow
Token ID6729
Feature activation-7.36e-02

Pos logit contributions
idd+0.291
rol+0.289
und+0.259
iw+0.259
arre+0.249
Neg logit contributions
The-0.211
 finish-0.206
 grant-0.206
idi-0.206
In-0.204
Token and
Token ID290
Feature activation-3.48e-01

Pos logit contributions
The+0.059
idi+0.056
 grant+0.056
 finish+0.056
In+0.056
Neg logit contributions
idd-0.148
rol-0.146
und-0.136
iw-0.133
arre-0.130
Token decided
Token ID3066
Feature activation-7.04e-01

Pos logit contributions
 finish+0.406
 grant+0.404
The+0.403
 ends+0.397
In+0.396
Neg logit contributions
idd-0.546
rol-0.536
iw-0.485
und-0.481
arre-0.460
Token to
Token ID284
Feature activation+7.62e-02

Pos logit contributions
 ends+0.623
ace+0.615
 finish+0.605
ers+0.603
 grant+0.589
Neg logit contributions
idd-1.296
rol-1.264
und-1.186
iw-1.182
rou-1.141
Token jump
Token ID4391
Feature activation-6.73e-02

Pos logit contributions
idd+0.149
rol+0.148
und+0.136
iw+0.134
arre+0.132
Neg logit contributions
The-0.062
idi-0.060
In-0.059
 ends-0.058
 before-0.058
Token on
Token ID319
Feature activation+7.86e-01

Pos logit contributions
The+0.047
idi+0.046
 grant+0.045
In+0.045
 finish+0.045
Neg logit contributions
rol-0.140
idd-0.139
und-0.128
iw-0.125
arre-0.124
Token it
Token ID340
Feature activation+1.02e+00

Pos logit contributions
idd+1.601
rol+1.590
iw+1.448
und+1.448
arre+1.444
Neg logit contributions
The-0.622
 finish-0.586
 before-0.574
 ends-0.569
In-0.567
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
idd+2.192
rol+2.145
iw+1.991
und+1.973
arre+1.962
Neg logit contributions
The-0.708
 finish-0.690
 before-0.684
 ends-0.654
In-0.635
Token When
Token ID1649
Feature activation+1.86e-01

Pos logit contributions
idd+0.327
rol+0.324
und+0.299
iw+0.297
arre+0.295
Neg logit contributions
The-0.145
 grant-0.142
 finish-0.141
 ends-0.141
idi-0.138
Token he
Token ID339
Feature activation-1.13e-01

Pos logit contributions
idd+0.346
rol+0.340
iw+0.314
und+0.314
arre+0.307
Neg logit contributions
The-0.167
 ends-0.163
 grant-0.162
 finish-0.162
idi-0.160
Token landed
Token ID11406
Feature activation-5.53e-01

Pos logit contributions
The+0.114
 grant+0.110
ers+0.109
 finish+0.109
In+0.108
Neg logit contributions
idd-0.201
rol-0.196
und-0.181
iw-0.180
arre-0.175
Token She
Token ID1375
Feature activation-1.46e-01

Pos logit contributions
 finish+0.328
 grant+0.321
The+0.320
idi+0.319
 before+0.316
Neg logit contributions
idd-0.487
rol-0.480
und-0.440
iw-0.435
arre-0.406
Token said
Token ID531
Feature activation+2.74e-01

Pos logit contributions
The+0.169
 finish+0.167
 grant+0.166
 ends+0.164
 before+0.164
Neg logit contributions
idd-0.236
rol-0.227
iw-0.207
und-0.204
arre-0.202
Token,
Token ID11
Feature activation-2.52e-01

Pos logit contributions
idd+0.559
rol+0.554
und+0.510
iw+0.510
arre+0.499
Neg logit contributions
The-0.205
 finish-0.197
In-0.194
idi-0.191
 ends-0.191
Token "
Token ID366
Feature activation+1.40e-01

Pos logit contributions
 finish+0.250
 ends+0.249
The+0.249
idi+0.244
In+0.240
Neg logit contributions
idd-0.443
rol-0.432
und-0.395
iw-0.393
arre-0.387
TokenTom
Token ID13787
Feature activation-4.81e-01

Pos logit contributions
idd+0.219
rol+0.217
iw+0.196
und+0.195
arre+0.186
Neg logit contributions
 finish-0.169
The-0.168
 before-0.166
 ends-0.166
 grant-0.166
Token,
Token ID11
Feature activation-9.35e-02

Pos logit contributions
The+0.435
 grant+0.419
In+0.416
 finish+0.410
idi+0.405
Neg logit contributions
idd-0.925
rol-0.882
und-0.840
iw-0.831
arre-0.813
Token you
Token ID345
Feature activation+6.73e-01

Pos logit contributions
The+0.100
idi+0.097
 finish+0.096
In+0.096
 ends+0.094
Neg logit contributions
idd-0.158
rol-0.158
iw-0.142
und-0.142
arre-0.136
Token are
Token ID389
Feature activation-7.75e-01

Pos logit contributions
idd+1.179
rol+1.170
und+1.056
iw+1.056
arre+1.029
Neg logit contributions
The-0.715
 finish-0.698
In-0.683
 grant-0.683
 ends-0.682
Token thirsty
Token ID47124
Feature activation+6.95e-01

Pos logit contributions
The+0.849
idi+0.832
In+0.822
 ends+0.803
 grant+0.801
Neg logit contributions
idd-1.330
rol-1.294
iw-1.182
und-1.133
arre-1.128
Token.
Token ID13
Feature activation+4.90e-01

Pos logit contributions
idd+1.419
rol+1.411
und+1.293
iw+1.288
arre+1.269
Neg logit contributions
The-0.526
 finish-0.509
 before-0.493
 ends-0.491
In-0.487
Token Here
Token ID3423
Feature activation+2.35e-01

Pos logit contributions
idd+0.702
rol+0.693
und+0.613
iw+0.611
arre+0.595
Neg logit contributions
The-0.669
 finish-0.653
In-0.645
 ends-0.644
idi-0.643
Token money
Token ID1637
Feature activation-1.41e-01

Pos logit contributions
The+0.068
idi+0.066
In+0.065
ers+0.064
 ends+0.063
Neg logit contributions
idd-0.152
rol-0.151
und-0.140
iw-0.139
arre-0.138
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
The+0.121
idi+0.118
In+0.115
 finish+0.113
 grant+0.113
Neg logit contributions
rol-0.267
idd-0.267
und-0.246
iw-0.242
arre-0.239
Token but
Token ID475
Feature activation+7.36e-01

Pos logit contributions
idd+0.366
rol+0.365
und+0.334
iw+0.333
arre+0.327
Neg logit contributions
The-0.169
 finish-0.166
idi-0.164
 ends-0.163
In-0.162
Token from
Token ID422
Feature activation-5.59e-01

Pos logit contributions
idd+1.508
rol+1.498
und+1.373
iw+1.364
arre+1.348
Neg logit contributions
The-0.561
 finish-0.538
 grant-0.524
 before-0.523
In-0.522
Token being
Token ID852
Feature activation+1.54e-02

Pos logit contributions
The+0.448
In+0.443
 ends+0.435
 finish+0.430
ers+0.422
Neg logit contributions
idd-1.089
rol-1.062
und-0.995
iw-0.994
rou-0.966
Token grateful
Token ID14066
Feature activation-9.02e-01

Pos logit contributions
idd+0.028
rol+0.027
und+0.025
iw+0.025
arre+0.024
Neg logit contributions
The-0.015
 finish-0.015
 grant-0.015
 ends-0.015
In-0.015
Token for
Token ID329
Feature activation+2.35e-01

Pos logit contributions
The+0.929
idi+0.908
In+0.873
 finish+0.868
 grant+0.862
Neg logit contributions
rol-1.547
idd-1.541
und-1.454
iw-1.403
rou-1.395
Token what
Token ID644
Feature activation-7.71e-01

Pos logit contributions
idd+0.481
rol+0.476
und+0.441
iw+0.438
arre+0.430
Neg logit contributions
The-0.169
 finish-0.164
In-0.161
 ends-0.160
idi-0.160
Token you
Token ID345
Feature activation+9.57e-04

Pos logit contributions
 finish+0.872
The+0.869
idi+0.853
In+0.837
 grant+0.825
Neg logit contributions
idd-1.242
rol-1.175
iw-1.102
und-1.096
arre-1.081
Token have
Token ID423
Feature activation-3.48e-03

Pos logit contributions
idd+0.002
rol+0.002
und+0.002
iw+0.002
arre+0.002
Neg logit contributions
The-0.001
idi-0.001
In-0.001
 finish-0.001
 ends-0.001
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
The+0.003
idi+0.003
 finish+0.003
 ends+0.003
In+0.003
Neg logit contributions
idd-0.007
rol-0.007
und-0.006
iw-0.006
arre-0.006
Token."
Token ID526
Feature activation+5.22e-01

Pos logit contributions
idd+0.436
rol+0.429
und+0.399
iw+0.392
arre+0.389
Neg logit contributions
The-0.173
In-0.163
idi-0.162
 grant-0.162
 finish-0.161
Token
Token ID198
Feature activation-9.94e-02

Pos logit contributions
idd+1.044
rol+1.035
und+0.950
iw+0.947
arre+0.932
Neg logit contributions
The-0.417
 finish-0.401
In-0.392
 ends-0.391
 grant-0.390
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
 finish+0.081
 grant+0.079
idi+0.079
 ends+0.079
 before+0.078
Neg logit contributions
idd-0.193
rol-0.189
und-0.176
iw-0.174
arre-0.170
TokenThey
Token ID2990
Feature activation-2.84e-01

Pos logit contributions
idd+0.294
rol+0.289
und+0.268
iw+0.266
arre+0.262
Neg logit contributions
 finish-0.133
 grant-0.131
 ends-0.131
 before-0.130
The-0.129
Token hang
Token ID8181
Feature activation-1.73e-01

Pos logit contributions
The+0.262
idi+0.250
 before+0.248
In+0.247
ers+0.244
Neg logit contributions
rol-0.516
idd-0.514
und-0.472
iw-0.465
arre-0.458
Token up
Token ID510
Feature activation-5.27e-01

Pos logit contributions
The+0.176
 grant+0.175
idi+0.174
In+0.174
 finish+0.173
Neg logit contributions
rol-0.306
idd-0.302
und-0.273
iw-0.271
arre-0.265
Token the
Token ID262
Feature activation-3.06e-01

Pos logit contributions
 grant+0.405
idi+0.400
The+0.395
In+0.390
 finish+0.380
Neg logit contributions
idd-1.050
rol-1.035
iw-0.974
und-0.968
arre-0.933
Token phone
Token ID3072
Feature activation+9.66e-02

Pos logit contributions
The+0.352
idi+0.344
In+0.342
 finish+0.329
 before+0.327
Neg logit contributions
idd-0.510
rol-0.490
iw-0.451
und-0.442
arre-0.436
Token and
Token ID290
Feature activation-3.69e-01

Pos logit contributions
idd+0.197
rol+0.197
und+0.182
iw+0.179
arre+0.178
Neg logit contributions
The-0.070
idi-0.066
 grant-0.066
In-0.066
 finish-0.064
Token smile
Token ID8212
Feature activation+6.66e-01

Pos logit contributions
The+0.338
In+0.327
ers+0.322
idi+0.322
 grant+0.309
Neg logit contributions
idd-0.667
rol-0.662
iw-0.614
und-0.610
arre-0.588
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
idd+1.343
rol+1.334
iw+1.220
und+1.220
arre+1.197
Neg logit contributions
The-0.527
 finish-0.507
 ends-0.494
 before-0.492
In-0.491
Token see
Token ID766
Feature activation+5.06e-01

Pos logit contributions
ers+0.544
idi+0.540
The+0.528
 ends+0.527
In+0.524
Neg logit contributions
idd-1.840
rol-1.802
und-1.747
arre-1.663
iw-1.654
Token him
Token ID683
Feature activation-3.12e-01

Pos logit contributions
idd+1.121
rol+1.113
und+1.030
iw+1.026
arre+1.012
Neg logit contributions
The-0.294
 finish-0.278
In-0.270
 grant-0.268
 ends-0.268
Token again
Token ID757
Feature activation+1.17e-01

Pos logit contributions
idi+0.293
The+0.293
In+0.289
 ends+0.279
ers+0.275
Neg logit contributions
rol-0.560
idd-0.555
und-0.515
arre-0.503
iw-0.501
Token!
Token ID0
Feature activation+3.20e-01

Pos logit contributions
idd+0.226
rol+0.222
und+0.208
iw+0.205
arre+0.202
Neg logit contributions
The-0.096
idi-0.093
In-0.093
 ends-0.093
 finish-0.092
Token It
Token ID632
Feature activation+2.35e-01

Pos logit contributions
idd+0.635
rol+0.630
und+0.579
iw+0.574
arre+0.568
Neg logit contributions
The-0.258
 finish-0.247
 grant-0.244
 ends-0.244
In-0.243
Token said
Token ID531
Feature activation-4.53e-01

Pos logit contributions
idd+0.458
rol+0.456
und+0.419
iw+0.415
arre+0.410
Neg logit contributions
The-0.190
 finish-0.184
 grant-0.180
 ends-0.180
In-0.180
Token it
Token ID340
Feature activation+1.06e-01

Pos logit contributions
In+0.450
 ends+0.449
The+0.448
 finish+0.445
idi+0.438
Neg logit contributions
idd-0.784
rol-0.782
und-0.718
arre-0.696
iw-0.696
Token was
Token ID373
Feature activation-3.95e-01

Pos logit contributions
idd+0.206
rol+0.205
und+0.189
iw+0.185
arre+0.185
Neg logit contributions
The-0.087
 finish-0.084
In-0.082
 grant-0.081
idi-0.081
Token sorry
Token ID7926
Feature activation-4.70e-01

Pos logit contributions
The+0.454
 ends+0.441
In+0.439
 finish+0.439
 grant+0.439
Neg logit contributions
idd-0.627
rol-0.627
und-0.562
iw-0.551
arre-0.548
Token for
Token ID329
Feature activation+3.04e-01

Pos logit contributions
The+0.508
 grant+0.493
idi+0.493
In+0.488
 finish+0.484
Neg logit contributions
rol-0.779
idd-0.766
und-0.713
arre-0.697
iw-0.683
Token not
Token ID407
Feature activation+3.14e-01

Pos logit contributions
idd+0.479
rol+0.474
und+0.427
iw+0.422
arre+0.414
Neg logit contributions
The-0.366
 finish-0.355
In-0.353
 ends-0.352
 grant-0.350
Token from
Token ID422
Feature activation-3.67e-01

Pos logit contributions
idd+0.225
rol+0.225
und+0.206
iw+0.202
arre+0.201
Neg logit contributions
The-0.103
In-0.100
 finish-0.099
idi-0.099
 grant-0.099
Token ear
Token ID1027
Feature activation+3.33e-01

Pos logit contributions
The+0.371
In+0.365
 grant+0.351
 ends+0.349
ers+0.346
Neg logit contributions
idd-0.652
rol-0.637
und-0.583
iw-0.581
rou-0.557
Token to
Token ID284
Feature activation+1.92e-01

Pos logit contributions
idd+0.677
rol+0.672
und+0.617
iw+0.614
arre+0.603
Neg logit contributions
The-0.252
 finish-0.241
In-0.237
idi-0.236
 grant-0.235
Token ear
Token ID1027
Feature activation+4.40e-01

Pos logit contributions
idd+0.364
rol+0.361
und+0.332
iw+0.328
arre+0.322
Neg logit contributions
The-0.170
In-0.163
 finish-0.161
 ends-0.160
idi-0.158
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
idd+0.892
rol+0.885
und+0.813
iw+0.809
arre+0.796
Neg logit contributions
The-0.334
 finish-0.320
In-0.314
idi-0.312
 grant-0.312
Token<|endoftext|>
Token ID50256
Feature activation-3.18e-01

Pos logit contributions
idd+0.390
rol+0.387
und+0.356
iw+0.353
arre+0.347
Neg logit contributions
The-0.150
 finish-0.146
 grant-0.143
 ends-0.143
idi-0.141
TokenOnce
Token ID7454
Feature activation+1.75e-01

Pos logit contributions
 finish+0.443
 grant+0.437
The+0.436
idi+0.432
 before+0.430
Neg logit contributions
idd-0.444
rol-0.433
und-0.397
iw-0.374
arre-0.363
Token upon
Token ID2402
Feature activation-2.89e-01

Pos logit contributions
idd+0.245
rol+0.242
und+0.215
iw+0.211
arre+0.207
Neg logit contributions
The-0.239
 finish-0.237
 grant-0.234
 ends-0.232
idi-0.232
Token a
Token ID257
Feature activation+3.58e-01

Pos logit contributions
 finish+0.230
idi+0.226
 ends+0.226
 grant+0.222
The+0.220
Neg logit contributions
idd-0.548
rol-0.544
und-0.502
iw-0.492
arre-0.486
Token time
Token ID640
Feature activation+9.75e-01

Pos logit contributions
idd+0.909
rol+0.901
und+0.847
iw+0.843
arre+0.837
Neg logit contributions
The-0.086
 finish-0.076
 grant-0.073
 ends-0.071
 before-0.068
Token,
Token ID11
Feature activation-5.95e-01

Pos logit contributions
idd+2.130
rol+2.124
iw+1.955
und+1.943
arre+1.932
Neg logit contributions
The-0.611
 before-0.562
In-0.560
 finish-0.553
idi-0.540
Token She
Token ID1375
Feature activation-1.64e+00

Pos logit contributions
ers+1.219
idi+1.218
 finish+1.185
 ends+1.184
The+1.183
Neg logit contributions
rol-1.321
idd-1.306
iw-1.220
und-1.203
 sorts-1.128
Token was
Token ID373
Feature activation-4.06e-01

Pos logit contributions
 before+1.966
 finish+1.965
 ends+1.886
idi+1.881
 grant+1.878
Neg logit contributions
idd-2.538
rol-2.264
und-2.220
iw-2.146
 Mechan-2.069
Token called
Token ID1444
Feature activation-1.58e-01

Pos logit contributions
The+0.412
 ends+0.410
idi+0.407
ers+0.404
 grant+0.397
Neg logit contributions
idd-0.715
rol-0.702
iw-0.638
und-0.616
rou-0.609
Token Daisy
Token ID40355
Feature activation-7.62e-01

Pos logit contributions
The+0.161
 ends+0.156
In+0.156
 finish+0.155
idi+0.155
Neg logit contributions
idd-0.274
rol-0.272
iw-0.247
und-0.245
arre-0.237
Token.
Token ID13
Feature activation-5.39e-01

Pos logit contributions
idi+0.723
 ends+0.703
 finish+0.703
 grant+0.703
The+0.688
Neg logit contributions
idd-1.356
rol-1.329
iw-1.238
und-1.219
rou-1.194
Token Daisy
Token ID40355
Feature activation-1.48e+00

Pos logit contributions
ers+0.663
 ends+0.656
 grant+0.656
 finish+0.654
The+0.653
Neg logit contributions
rol-0.797
idd-0.793
und-0.732
iw-0.721
arre-0.710
Token was
Token ID373
Feature activation-4.07e-01

Pos logit contributions
 grant+1.680
 finish+1.623
 before+1.597
 ends+1.596
The+1.574
Neg logit contributions
idd-2.434
rol-2.218
und-2.178
iw-2.140
arre-2.026
Token very
Token ID845
Feature activation+4.25e-01

Pos logit contributions
 ends+0.397
The+0.395
idi+0.386
 grant+0.385
ers+0.385
Neg logit contributions
idd-0.733
rol-0.717
iw-0.657
und-0.634
rou-0.625
Token brave
Token ID14802
Feature activation+2.15e-02

Pos logit contributions
idd+0.768
rol+0.758
iw+0.689
und+0.687
arre+0.674
Neg logit contributions
The-0.423
 finish-0.408
 ends-0.403
In-0.402
 grant-0.402
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
idd+0.045
rol+0.044
und+0.041
iw+0.041
arre+0.040
Neg logit contributions
The-0.015
 grant-0.015
In-0.014
 finish-0.014
idi-0.014
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
ers+0.421
 grant+0.421
The+0.420
 finish+0.420
 ends+0.416
Neg logit contributions
rol-0.553
idd-0.552
und-0.503
iw-0.500
arre-0.494
Token brightly
Token ID35254
Feature activation-8.21e-01

Pos logit contributions
 grant+0.196
The+0.196
 finish+0.194
idi+0.193
ers+0.193
Neg logit contributions
idd-0.339
rol-0.337
und-0.308
iw-0.304
rou-0.296
Token that
Token ID326
Feature activation+5.05e-01

Pos logit contributions
The+0.684
In+0.676
 ends+0.667
 finish+0.661
idi+0.657
Neg logit contributions
rol-1.575
idd-1.561
und-1.500
iw-1.422
rou-1.375
Token it
Token ID340
Feature activation+2.28e-02

Pos logit contributions
idd+1.023
rol+1.016
und+0.933
iw+0.929
arre+0.916
Neg logit contributions
The-0.386
 finish-0.370
In-0.362
 grant-0.360
 ends-0.360
Token lit
Token ID6578
Feature activation-2.48e-01

Pos logit contributions
idd+0.044
rol+0.044
und+0.041
iw+0.040
arre+0.040
Neg logit contributions
The-0.019
 finish-0.018
In-0.018
 grant-0.017
idi-0.017
Token up
Token ID510
Feature activation-7.76e-01

Pos logit contributions
The+0.279
In+0.279
 finish+0.276
 grant+0.274
idi+0.270
Neg logit contributions
rol-0.400
idd-0.394
und-0.365
iw-0.348
arre-0.346
Token the
Token ID262
Feature activation-1.83e+00

Pos logit contributions
 grant+0.732
idi+0.715
 ends+0.711
In+0.707
 finish+0.706
Neg logit contributions
rol-1.384
idd-1.370
und-1.287
iw-1.238
rou-1.222
Token sky
Token ID6766
Feature activation-6.72e-01

Pos logit contributions
idi+1.744
ers+1.635
In+1.625
ace+1.589
 grant+1.560
Neg logit contributions
idd-3.416
arre-3.110
rol-3.099
rou-3.062
iw-2.986
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
idi+0.623
The+0.621
In+0.617
 grant+0.602
 ends+0.581
Neg logit contributions
rol-1.222
idd-1.198
und-1.119
arre-1.076
iw-1.071
Token When
Token ID1649
Feature activation-9.00e-02

Pos logit contributions
The+0.178
 finish+0.176
 ends+0.176
 grant+0.174
idi+0.172
Neg logit contributions
idd-0.412
rol-0.407
und-0.376
iw-0.368
arre-0.367
Token the
Token ID262
Feature activation-6.43e-01

Pos logit contributions
The+0.067
 grant+0.064
In+0.064
idi+0.063
 ends+0.063
Neg logit contributions
idd-0.181
rol-0.178
und-0.166
iw-0.164
arre-0.161
Token light
Token ID1657
Feature activation-1.54e+00

Pos logit contributions
In+0.677
The+0.673
idi+0.672
 grant+0.652
ers+0.643
Neg logit contributions
idd-1.117
rol-1.056
arre-0.980
iw-0.961
 batch-0.957
Token that
Token ID326
Feature activation+8.07e-01

Pos logit contributions
idd+0.633
rol+0.628
und+0.576
iw+0.575
arre+0.565
Neg logit contributions
The-0.260
 finish-0.248
In-0.246
idi-0.244
 grant-0.244
Token her
Token ID607
Feature activation-9.19e-01

Pos logit contributions
idd+1.464
rol+1.446
und+1.315
iw+1.306
arre+1.286
Neg logit contributions
The-0.826
 finish-0.806
 before-0.794
 ends-0.786
In-0.784
Token mom
Token ID1995
Feature activation+1.82e-01

Pos logit contributions
The+1.017
In+0.977
idi+0.910
 grant+0.904
ers+0.885
Neg logit contributions
idd-1.618
rol-1.492
iw-1.428
rou-1.411
arre-1.372
Token had
Token ID550
Feature activation-1.43e-01

Pos logit contributions
idd+0.368
rol+0.363
und+0.334
iw+0.333
arre+0.328
Neg logit contributions
The-0.140
 finish-0.133
idi-0.132
In-0.132
 grant-0.132
Token encouraged
Token ID10085
Feature activation-6.61e-01

Pos logit contributions
The+0.142
 ends+0.137
idi+0.136
 grant+0.136
In+0.135
Neg logit contributions
idd-0.252
rol-0.247
und-0.226
arre-0.224
iw-0.223
Token her
Token ID607
Feature activation-1.60e+00

Pos logit contributions
The+0.900
 grant+0.894
In+0.887
idi+0.882
 finish+0.881
Neg logit contributions
rol-0.890
idd-0.869
und-0.801
arre-0.783
iw-0.778
Token.
Token ID13
Feature activation+7.59e-01

Pos logit contributions
idi+1.597
The+1.565
In+1.561
oster+1.501
ers+1.471
Neg logit contributions
idd-2.735
rol-2.733
rou-2.585
arre-2.558
und-2.519
Token She
Token ID1375
Feature activation+1.62e-01

Pos logit contributions
idd+1.393
rol+1.381
und+1.252
iw+1.250
arre+1.226
Neg logit contributions
The-0.743
 finish-0.711
In-0.704
idi-0.696
 before-0.695
Token knew
Token ID2993
Feature activation-1.28e-01

Pos logit contributions
idd+0.303
rol+0.298
und+0.273
iw+0.271
arre+0.268
Neg logit contributions
The-0.148
 finish-0.141
In-0.140
 grant-0.140
 ends-0.140
Token that
Token ID326
Feature activation+1.44e+00

Pos logit contributions
The+0.110
 finish+0.109
 grant+0.108
 ends+0.108
In+0.106
Neg logit contributions
idd-0.243
rol-0.240
und-0.221
iw-0.220
arre-0.214
Token she
Token ID673
Feature activation-5.73e-01

Pos logit contributions
idd+2.860
rol+2.842
und+2.581
iw+2.539
arre+2.505
Neg logit contributions
 before-1.326
The-1.306
 finish-1.248
 ends-1.216
In-1.202
Token saw
Token ID2497
Feature activation+5.26e-01

Pos logit contributions
idd+0.083
rol+0.081
und+0.074
iw+0.073
arre+0.072
Neg logit contributions
The-0.042
 finish-0.041
 grant-0.040
In-0.040
 before-0.040
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
idd+1.111
rol+1.103
und+1.016
iw+1.013
arre+0.998
Neg logit contributions
The-0.361
 finish-0.344
In-0.335
 ends-0.334
 grant-0.333
Token yellow
Token ID7872
Feature activation+2.43e-01

Pos logit contributions
The+0.124
In+0.120
 finish+0.119
 grant+0.118
 before+0.118
Neg logit contributions
idd-0.276
rol-0.268
iw-0.252
und-0.247
arre-0.245
Token cob
Token ID22843
Feature activation-1.15e-01

Pos logit contributions
idd+0.427
rol+0.422
iw+0.386
und+0.385
arre+0.374
Neg logit contributions
The-0.248
 finish-0.240
In-0.239
idi-0.236
 grant-0.234
Token on
Token ID319
Feature activation-4.41e-01

Pos logit contributions
The+0.129
 finish+0.128
In+0.125
 ends+0.124
 grant+0.123
Neg logit contributions
idd-0.197
rol-0.192
iw-0.176
und-0.175
arre-0.165
Token the
Token ID262
Feature activation-1.59e+00

Pos logit contributions
In+0.197
 grant+0.194
idi+0.192
 finish+0.191
The+0.187
Neg logit contributions
idd-0.981
rol-0.980
und-0.933
iw-0.916
 batch-0.891
Token ground
Token ID2323
Feature activation+1.03e-01

Pos logit contributions
In+1.438
idi+1.414
The+1.326
 grant+1.312
 behold+1.247
Neg logit contributions
rol-2.886
idd-2.838
iw-2.713
 batch-2.611
und-2.568
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
idd+0.203
rol+0.202
und+0.187
iw+0.185
arre+0.181
Neg logit contributions
The-0.081
idi-0.078
 grant-0.078
 finish-0.077
In-0.077
Token He
Token ID679
Feature activation-4.05e-01

Pos logit contributions
 finish+0.274
 grant+0.268
 ends+0.266
The+0.264
idi+0.263
Neg logit contributions
idd-0.643
rol-0.628
iw-0.589
und-0.587
arre-0.554
Token picked
Token ID6497
Feature activation+4.95e-01

Pos logit contributions
 finish+0.431
 grant+0.422
The+0.416
 before+0.414
 ends+0.412
Neg logit contributions
idd-0.695
rol-0.659
iw-0.615
und-0.607
arre-0.594
Token it
Token ID340
Feature activation+5.27e-01

Pos logit contributions
idd+0.922
rol+0.915
und+0.833
iw+0.830
arre+0.815
Neg logit contributions
The-0.461
 finish-0.445
In-0.438
 grant-0.436
 ends-0.435
Token could
Token ID714
Feature activation-8.29e-01

Pos logit contributions
idd+0.227
rol+0.225
iw+0.208
und+0.206
arre+0.201
Neg logit contributions
The-0.082
 finish-0.079
 grant-0.078
In-0.077
 ends-0.077
Token fly
Token ID6129
Feature activation-1.93e-01

Pos logit contributions
idi+0.560
ers+0.553
 ends+0.541
In+0.532
The+0.526
Neg logit contributions
rol-1.673
idd-1.658
und-1.549
iw-1.532
rou-1.494
Token up
Token ID510
Feature activation-2.93e-01

Pos logit contributions
The+0.175
idi+0.174
In+0.172
 grant+0.171
 finish+0.169
Neg logit contributions
rol-0.359
idd-0.353
und-0.329
iw-0.321
arre-0.312
Token high
Token ID1029
Feature activation-1.30e+00

Pos logit contributions
The+0.231
idi+0.227
 grant+0.226
In+0.224
 finish+0.220
Neg logit contributions
rol-0.574
idd-0.573
und-0.538
iw-0.525
arre-0.509
Token in
Token ID287
Feature activation-7.87e-02

Pos logit contributions
In+1.178
The+1.176
 grant+1.158
idi+1.118
 finish+1.051
Neg logit contributions
idd-2.456
rol-2.421
und-2.377
iw-2.279
ays-2.191
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
 grant+0.024
The+0.023
idi+0.023
In+0.023
 finish+0.023
Neg logit contributions
idd-0.192
rol-0.190
und-0.181
iw-0.178
arre-0.173
Token sky
Token ID6766
Feature activation-2.36e-01

Pos logit contributions
 grant+1.525
isy+1.495
idi+1.491
 before+1.425
ace+1.357
Neg logit contributions
idd-3.620
rol-3.379
und-3.249
arre-3.205
iw-3.178
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
The+0.208
idi+0.207
In+0.206
 grant+0.203
 ends+0.200
Neg logit contributions
rol-0.440
idd-0.438
und-0.404
iw-0.397
arre-0.387
Token Everyone
Token ID11075
Feature activation-7.63e-01

Pos logit contributions
 grant+0.113
The+0.111
 finish+0.111
 ends+0.111
idi+0.107
Neg logit contributions
idd-0.270
rol-0.268
und-0.249
iw-0.246
arre-0.240
Token was
Token ID373
Feature activation-4.46e-01

Pos logit contributions
 finish+0.707
 grant+0.704
The+0.684
idi+0.680
In+0.677
Neg logit contributions
idd-1.445
rol-1.377
und-1.294
iw-1.246
arre-1.236
Token amazed
Token ID24562
Feature activation-6.92e-01

Pos logit contributions
 ends+0.471
 grant+0.470
idi+0.466
In+0.464
The+0.463
Neg logit contributions
idd-0.785
rol-0.770
und-0.680
iw-0.677
arre-0.674