TOP ACTIVATIONS
MAX = 34.374

for treasure . They found some coins and jewels in a
bowls . They also had some flour and water and salt
and doors . They found some rocks and sticks and put
Mom gave them some money . They ran to
soup . Can we have some ?" Mom said , "
truck to Tom and dumps some sand next to him .
on , let 's pick some herbs and take them to
open the closet and put some things inside . We are
me . Let 's have some pie together ."
the bucket and p ours some water on the new plant
not careful . They spill some cereal on the table and
with Lily . He took some of her blocks and put
on , let 's have some water and cookies ." Tom
me . You can carry some of the small boxes and
my t eddy bear and some flour !" Lily said ,
sp oons . We need some flags . We can use
." They eat some apples . They are sweet
talk . He teaches Sara some signs . S
and hungry . They eat some snacks and drink some juice
the shovel to scoop up some sand and put it in

MIN ACTIVATIONS
MIN = -74.741

make it whole again . Amen ." Then ,
family and our friends . Amen ." Anna said
gentle . M oral : Being respectful and gentle
house and our family . Amen ." Mom smiled
H ola , me ll amo Emma ." L
end . M oral value : The story teaches
kind . Thank you . Amen ." Then she
us safe and happy . Amen ." " A
up ." M oral of the story : Be
Thank you , God . Amen ." He opened
them . So he hopped over to the
it ." So Benny went over to the
happen . So the turtles decided that whenever
playing . So , Tim my had a
ride . So she took off down the
safe to get too close .'' P oppy understood
yard ! So , she dragged her wheel
forever . So one day they decided to
away . M oral of the story : When
like me ." So , Billy started to practice

INTERVAL 7.095 - 34.374
CONTAINS 36.965%

pear . It was hard and not very tasty .
why her dad said to stay away from it ! From
Once there was a sad little boy . He was very
and cry . They drop the cookies and juice . They
diamond ? Give it back to me , now !"

INTERVAL -20.184 - 7.095
CONTAINS 51.137%

not want to make the lady mad . But they also
run to their mom and tell her what happened . Mom
slide and looks down . He feels nervous , but he
share it with Sam when he comes over ."
Inside the gate , Spot saw many other dogs

INTERVAL -47.462 - -20.184
CONTAINS 11.771%

fly and jump around . So the butterfly
can have a bite . But only a small one .
and go on adventures . One day , Tim my and
Tom , what is wrong ? Why are you not playing
, Can I play with the

INTERVAL -74.741 - -47.462
CONTAINS 0.127%

my back ." So , the little mouse climbed
fishing rod . Together , they spent the whole
' and ' thank you '. One day , Sam asked
Feed the Ele ph ants '. Jim looked and felt a
the woman said , Let me
Token for
Token ID329
Feature activation+1.59e+01

Pos logit contributions
 the+0.188
 a+0.179
 her+0.141
 my+0.131
 your+0.127
Neg logit contributions
oke-0.414
arry-0.405
jo-0.383
oc-0.378
uddy-0.377
Token treasure
Token ID14068
Feature activation+9.88e+00

Pos logit contributions
 up+0.293
 the+0.253
 on+0.244
 in+0.241
 out+0.241
Neg logit contributions
lor-0.506
arry-0.477
oc-0.473
 unison-0.469
obby-0.455
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
 the+0.239
 a+0.217
 all+0.184
 up+0.175
 out+0.174
Neg logit contributions
lor-0.392
arry-0.354
uddy-0.352
ides-0.339
jo-0.336
Token They
Token ID1119
Feature activation-1.30e+01

Pos logit contributions
lor+0.345
arry+0.286
och+0.278
ides+0.269
etsy+0.268
Neg logit contributions
 the-0.506
 a-0.483
 all-0.438
 in-0.421
 out-0.419
Token found
Token ID1043
Feature activation+1.51e+01

Pos logit contributions
lor+0.405
arry+0.378
uddy+0.377
jo+0.371
etsy+0.369
Neg logit contributions
 the-0.339
 a-0.314
 up-0.304
 out-0.284
 more-0.274
Token some
Token ID617
Feature activation+3.44e+01

Pos logit contributions
 the+0.124
 her+0.095
 up+0.088
 all+0.088
 in+0.083
Neg logit contributions
lor-0.625
uddy-0.578
apa-0.553
oke-0.552
oc-0.543
Token coins
Token ID10796
Feature activation+9.64e+00

Pos logit contributions
 the+0.741
 a+0.617
 up+0.616
 all+0.605
 her+0.598
Neg logit contributions
lor-1.120
oc-1.099
uddy-1.059
och-1.059
arry-1.057
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 the+0.185
 a+0.168
 all+0.139
 her+0.131
 up+0.130
Neg logit contributions
lor-0.384
och-0.356
arry-0.353
uddy-0.348
oke-0.342
Token jewels
Token ID42701
Feature activation-4.89e+00

Pos logit contributions
 the+0.331
 up+0.323
 all+0.290
 out+0.280
 her+0.257
Neg logit contributions
lor-0.525
oc-0.501
anas-0.479
uddy-0.478
ides-0.470
Token in
Token ID287
Feature activation+1.92e+01

Pos logit contributions
lor+0.191
och+0.172
arry+0.172
uddy+0.171
oke+0.169
Neg logit contributions
 the-0.100
 a-0.092
 all-0.075
 up-0.074
 her-0.071
Token a
Token ID257
Feature activation+2.92e+01

Pos logit contributions
 the+0.080
 up+0.073
 out+0.055
 a+0.044
 all+0.044
Neg logit contributions
lor-0.992
arry-0.919
-0.910
uddy-0.906
obby-0.901
Token bowls
Token ID34796
Feature activation-5.26e+00

Pos logit contributions
 up+0.463
 the+0.454
 out+0.422
 a+0.420
 all+0.405
Neg logit contributions
anas-0.389
oc-0.383
Really-0.358
obby-0.357
aron-0.354
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
lor+0.167
arry+0.158
ides+0.157
uddy+0.157
oke+0.155
Neg logit contributions
 the-0.131
 a-0.121
 up-0.102
 all-0.102
 more-0.099
Token They
Token ID1119
Feature activation-1.26e+01

Pos logit contributions
lor+0.518
ides+0.435
reetings+0.424
istent+0.423
jo+0.420
Neg logit contributions
 the-0.422
 a-0.383
 their-0.323
 all-0.319
 some-0.311
Token also
Token ID635
Feature activation+5.14e+00

Pos logit contributions
lor+0.395
uddy+0.380
arry+0.379
jo+0.379
etsy+0.375
Neg logit contributions
 the-0.327
 a-0.296
 up-0.283
 out-0.270
 all-0.254
Token had
Token ID550
Feature activation+2.21e+00

Pos logit contributions
 the+0.137
 up+0.122
 a+0.120
 out+0.117
 all+0.113
Neg logit contributions
lor-0.151
jo-0.148
oc-0.148
oke-0.147
oral-0.146
Token some
Token ID617
Feature activation+3.44e+01

Pos logit contributions
 up+0.025
 the+0.023
 out+0.020
 down+0.016
 in+0.016
Neg logit contributions
lor-0.092
anas-0.084
oral-0.083
Finally-0.082
obby-0.081
Token flour
Token ID10601
Feature activation-4.23e+00

Pos logit contributions
 the+0.827
 a+0.702
 all+0.649
 up+0.644
 out+0.610
Neg logit contributions
lor-1.121
arry-1.119
oral-1.119
-1.096
uddy-1.093
Token and
Token ID290
Feature activation+1.69e+01

Pos logit contributions
lor+0.157
+0.154
uddy+0.149
och+0.147
arry+0.146
Neg logit contributions
 the-0.082
 a-0.077
 all-0.063
 more-0.061
 his-0.059
Token water
Token ID1660
Feature activation+2.82e+01

Pos logit contributions
 the+0.424
 up+0.410
 out+0.396
 all+0.387
 a+0.374
Neg logit contributions
lor-0.468
mmmm-0.448
mmm-0.448
-0.448
etsy-0.435
Token and
Token ID290
Feature activation+1.70e+01

Pos logit contributions
 the+0.507
 a+0.480
 more+0.396
 all+0.389
 up+0.359
Neg logit contributions
lor-1.117
-1.067
uddy-1.054
oke-1.042
arry-1.038
Token salt
Token ID8268
Feature activation+5.14e+00

Pos logit contributions
 up+0.408
 the+0.402
 out+0.380
 all+0.362
 back+0.354
Neg logit contributions
lor-0.439
mmmm-0.434
oc-0.433
mmm-0.432
-0.431
Token and
Token ID290
Feature activation+1.68e+01

Pos logit contributions
 the+0.124
 a+0.112
 all+0.094
 her+0.084
 his+0.083
Neg logit contributions
lor-0.235
arry-0.227
ides-0.218
umbs-0.216
jo-0.216
Token doors
Token ID8215
Feature activation+6.11e+00

Pos logit contributions
 up+0.350
 the+0.333
 out+0.326
 all+0.320
 a+0.302
Neg logit contributions
Really-0.508
oral-0.465
oc-0.442
lor-0.437
ider-0.432
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.152
 a+0.140
 all+0.125
 up+0.119
 his+0.117
Neg logit contributions
lor-0.187
arry-0.176
ides-0.174
och-0.170
-0.167
Token They
Token ID1119
Feature activation-1.28e+01

Pos logit contributions
lor+0.447
istent+0.380
ides+0.377
reetings+0.376
och+0.367
Neg logit contributions
 the-0.452
 a-0.427
 their-0.355
 all-0.347
 some-0.342
Token found
Token ID1043
Feature activation+1.54e+01

Pos logit contributions
lor+0.379
arry+0.357
oke+0.354
jo+0.353
ides+0.349
Neg logit contributions
 the-0.323
 up-0.298
 a-0.293
 out-0.281
 back-0.249
Token some
Token ID617
Feature activation+3.43e+01

Pos logit contributions
 the+0.121
 my+0.102
 your+0.095
 away+0.095
 in+0.094
Neg logit contributions
lor-0.544
oral-0.521
apa-0.512
oke-0.510
umbs-0.509
Token rocks
Token ID12586
Feature activation+1.55e+01

Pos logit contributions
 the+0.737
 a+0.641
 up+0.638
 all+0.609
 out+0.585
Neg logit contributions
oral-1.069
umm-1.027
arry-1.019
oke-1.017
lor-1.013
Token and
Token ID290
Feature activation+1.68e+01

Pos logit contributions
 the+0.306
 a+0.294
 his+0.236
 an+0.234
 her+0.233
Neg logit contributions
lor-0.521
och-0.503
oke-0.502
arry-0.494
-0.486
Token sticks
Token ID16461
Feature activation+1.86e+01

Pos logit contributions
 up+0.360
 the+0.321
 out+0.317
 all+0.307
 back+0.287
Neg logit contributions
pill-0.466
Really-0.432
lor-0.424
oc-0.421
oral-0.420
Token and
Token ID290
Feature activation+1.69e+01

Pos logit contributions
 the+0.369
 a+0.351
 all+0.296
 up+0.276
 her+0.271
Neg logit contributions
lor-0.629
oke-0.572
uddy-0.572
arry-0.571
ides-0.570
Token put
Token ID1234
Feature activation+1.46e+01

Pos logit contributions
 up+0.348
 the+0.347
 all+0.318
 out+0.304
 a+0.280
Neg logit contributions
pill-0.480
lor-0.476
oc-0.472
Really-0.460
jo-0.456
Token
Token ID198
Feature activation-2.18e+01

Pos logit contributions
lor+0.546
ides+0.457
reetings+0.449
istent+0.436
ider+0.434
Neg logit contributions
 the-0.434
 a-0.373
 all-0.326
 their-0.323
 her-0.310
Token
Token ID198
Feature activation-2.18e+01

Pos logit contributions
umm+0.927
lor+0.915
ides+0.843
istent+0.837
ider+0.826
Neg logit contributions
 the-0.719
 a-0.643
 all-0.529
 big-0.514
 in-0.510
TokenMom
Token ID29252
Feature activation-3.00e+01

Pos logit contributions
lor+0.664
arry+0.540
ides+0.537
uddy+0.533
umm+0.530
Neg logit contributions
 the-0.859
 a-0.804
 all-0.729
 in-0.707
 more-0.691
Token gave
Token ID2921
Feature activation+9.68e-01

Pos logit contributions
lor+1.161
arry+1.062
uddy+1.035
etsy+1.016
+1.008
Neg logit contributions
 the-0.797
 a-0.729
 all-0.654
 out-0.606
 in-0.602
Token them
Token ID606
Feature activation+1.66e+01

Pos logit contributions
 the+0.016
 shiny+0.015
 a+0.015
 in+0.015
 strong+0.014
Neg logit contributions
etsy-0.029
obby-0.028
uddy-0.028
enny-0.027
lor-0.026
Token some
Token ID617
Feature activation+3.43e+01

Pos logit contributions
 the+0.189
 a+0.134
 out+0.127
 up+0.119
 in+0.112
Neg logit contributions
lor-0.717
arry-0.700
etsy-0.699
uddy-0.695
obby-0.690
Token money
Token ID1637
Feature activation+1.53e+01

Pos logit contributions
 the+0.661
 a+0.558
 all+0.498
 out+0.474
 up+0.459
Neg logit contributions
lor-1.377
arry-1.323
oral-1.309
-1.274
etsy-1.270
Token.
Token ID13
Feature activation-1.20e+01

Pos logit contributions
 the+0.311
 a+0.265
 all+0.236
 out+0.217
 up+0.212
Neg logit contributions
lor-0.627
arry-0.591
uddy-0.582
och-0.576
etsy-0.575
Token They
Token ID1119
Feature activation-1.25e+01

Pos logit contributions
lor+0.430
arry+0.380
uddy+0.369
etsy+0.362
jo+0.362
Neg logit contributions
 the-0.378
 a-0.348
 all-0.316
 out-0.294
 in-0.293
Token ran
Token ID4966
Feature activation-5.28e-01

Pos logit contributions
lor+0.391
+0.373
etsy+0.370
arry+0.367
ongo+0.365
Neg logit contributions
 the-0.330
 a-0.305
 up-0.285
 out-0.285
 more-0.264
Token to
Token ID284
Feature activation+1.44e+01

Pos logit contributions
etsy+0.020
uddy+0.020
lor+0.019
obby+0.019
arry+0.019
Neg logit contributions
 the-0.008
 a-0.007
 shiny-0.006
 all-0.006
 more-0.006
Token soup
Token ID17141
Feature activation+4.58e+00

Pos logit contributions
 the+0.104
 all+0.079
 her+0.070
 a+0.059
 in+0.056
Neg logit contributions
lor-0.691
mmmm-0.612
oc-0.607
obby-0.604
apa-0.604
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
 the+0.090
 a+0.080
 all+0.066
 out+0.061
 up+0.060
Neg logit contributions
lor-0.196
arry-0.179
uddy-0.177
-0.174
ides-0.172
Token Can
Token ID1680
Feature activation-2.84e+01

Pos logit contributions
lor+0.317
arry+0.282
jo+0.277
etsy+0.267
ongo+0.265
Neg logit contributions
 the-0.493
 a-0.484
 in-0.415
 out-0.415
 some-0.412
Token we
Token ID356
Feature activation-7.36e+00

Pos logit contributions
lor+0.973
arry+0.878
uddy+0.857
etsy+0.839
+0.838
Neg logit contributions
 the-0.863
 a-0.804
 all-0.735
 out-0.695
 in-0.685
Token have
Token ID423
Feature activation+1.24e+01

Pos logit contributions
lor+0.292
arry+0.272
+0.272
uddy+0.265
etsy+0.263
Neg logit contributions
 the-0.165
 a-0.148
 all-0.126
 up-0.125
 out-0.121
Token some
Token ID617
Feature activation+3.43e+01

Pos logit contributions
 the+0.139
 up+0.131
 out+0.111
 in+0.109
 all+0.102
Neg logit contributions
lor-0.526
mmmm-0.482
mmm-0.479
-0.474
oral-0.463
Token?"
Token ID1701
Feature activation-2.50e+01

Pos logit contributions
 the+0.906
 a+0.811
 all+0.735
 up+0.727
 out+0.713
Neg logit contributions
lor-1.165
oral-1.062
arry-1.050
ides-1.040
mmm-1.036
Token Mom
Token ID11254
Feature activation-4.68e+00

Pos logit contributions
lor+1.221
ides+1.082
reetings+1.063
istent+1.058
arry+1.049
Neg logit contributions
 the-0.642
 a-0.554
 her-0.442
 all-0.441
 their-0.429
Token said
Token ID531
Feature activation-3.49e+00

Pos logit contributions
lor+0.171
arry+0.156
uddy+0.151
etsy+0.148
+0.147
Neg logit contributions
 the-0.135
 a-0.124
 all-0.113
 out-0.105
 in-0.105
Token,
Token ID11
Feature activation+5.25e+00

Pos logit contributions
lor+0.144
+0.134
arry+0.134
etsy+0.134
uddy+0.133
Neg logit contributions
 the-0.067
 a-0.059
 all-0.055
 up-0.046
 out-0.045
Token "
Token ID366
Feature activation-1.21e+01

Pos logit contributions
 the+0.172
 a+0.160
 all+0.148
 out+0.140
 in+0.138
Neg logit contributions
lor-0.167
arry-0.150
uddy-0.145
-0.144
etsy-0.143
Token truck
Token ID7779
Feature activation+4.22e+00

Pos logit contributions
 the+0.611
 a+0.591
 all+0.525
 her+0.506
 out+0.497
Neg logit contributions
lor-0.708
uddy-0.679
-0.643
och-0.640
oral-0.635
Token to
Token ID284
Feature activation+1.45e+01

Pos logit contributions
 the+0.062
 a+0.052
 all+0.040
 her+0.034
 more+0.032
Neg logit contributions
lor-0.192
uddy-0.187
arry-0.181
jo-0.178
och-0.176
Token Tom
Token ID4186
Feature activation+7.57e+00

Pos logit contributions
 the+0.101
 all+0.096
 a+0.085
 in+0.084
 on+0.083
Neg logit contributions
-0.656
lor-0.653
arry-0.620
uddy-0.608
oral-0.607
Token and
Token ID290
Feature activation+1.62e+01

Pos logit contributions
 the+0.186
 a+0.171
 all+0.152
 out+0.137
 more+0.137
Neg logit contributions
lor-0.293
arry-0.265
uddy-0.262
jo-0.258
ides-0.254
Token dumps
Token ID45514
Feature activation-5.67e+00

Pos logit contributions
 the+0.437
 all+0.433
 a+0.427
 in+0.386
 shiny+0.378
Neg logit contributions
oke-0.361
lor-0.358
uddy-0.356
 Deal-0.354
 Maybe-0.353
Token some
Token ID617
Feature activation+3.43e+01

Pos logit contributions
och+0.232
lor+0.227
+0.227
umbs+0.217
umm+0.217
Neg logit contributions
 the-0.035
 a-0.031
 all-0.025
 more-0.018
 in-0.018
Token sand
Token ID6450
Feature activation+1.97e+01

Pos logit contributions
 the+0.707
 a+0.617
 all+0.590
 her+0.575
 up+0.518
Neg logit contributions
oc-1.090
umm-1.078
oral-1.077
och-1.075
oke-1.058
Token next
Token ID1306
Feature activation+1.41e+01

Pos logit contributions
 the+0.295
 a+0.262
 her+0.187
 more+0.179
 all+0.179
Neg logit contributions
lor-0.838
ider-0.782
ides-0.780
arry-0.770
oke-0.761
Token to
Token ID284
Feature activation+1.45e+01

Pos logit contributions
 the+0.234
 a+0.202
 all+0.174
 her+0.140
 out+0.131
Neg logit contributions
lor-0.617
arry-0.578
uddy-0.568
ides-0.564
obby-0.563
Token him
Token ID683
Feature activation+1.85e+01

Pos logit contributions
 the+0.283
 a+0.275
 all+0.261
 in+0.252
 shiny+0.244
Neg logit contributions
lor-0.488
-0.486
arry-0.448
uddy-0.447
och-0.445
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
 the+0.476
 a+0.433
 all+0.389
 out+0.357
 more+0.352
Neg logit contributions
lor-0.691
arry-0.633
uddy-0.619
-0.604
jo-0.604
Token on
Token ID319
Feature activation+2.01e+01

Pos logit contributions
lor+1.585
+1.515
arry+1.494
uddy+1.481
etsy+1.451
Neg logit contributions
 the-0.338
 a-0.268
 all-0.206
 more-0.138
 her-0.131
Token,
Token ID11
Feature activation+5.29e+00

Pos logit contributions
 the+0.360
 a+0.326
 all+0.279
 out+0.248
 in+0.243
Neg logit contributions
lor-0.912
uddy-0.845
arry-0.844
-0.820
etsy-0.812
Token let
Token ID1309
Feature activation-5.66e+00

Pos logit contributions
 the+0.144
 a+0.134
 all+0.125
 out+0.115
 up+0.114
Neg logit contributions
lor-0.185
uddy-0.168
etsy-0.167
arry-0.166
oke-0.159
Token's
Token ID338
Feature activation+1.01e+01

Pos logit contributions
lor+0.214
arry+0.189
uddy+0.188
etsy+0.184
jo+0.182
Neg logit contributions
 the-0.146
 a-0.135
 all-0.123
 in-0.116
 out-0.112
Token pick
Token ID2298
Feature activation+1.40e+01

Pos logit contributions
 the+0.201
 a+0.178
 all+0.148
 out+0.143
 more+0.139
Neg logit contributions
lor-0.410
-0.383
arry-0.372
uddy-0.368
ides-0.367
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.139
 all+0.106
 a+0.100
 shiny+0.092
 in+0.084
Neg logit contributions
lor-0.633
mmmm-0.569
'."-0.564
mmm-0.562
oc-0.562
Token herbs
Token ID25411
Feature activation-2.31e+00

Pos logit contributions
 the+0.815
 a+0.743
 all+0.682
 out+0.640
 her+0.637
Neg logit contributions
lor-1.149
oral-1.115
oc-1.107
arry-1.079
mmmm-1.076
Token and
Token ID290
Feature activation+1.66e+01

Pos logit contributions
lor+0.097
arry+0.088
och+0.086
+0.085
etsy+0.085
Neg logit contributions
 the-0.045
 a-0.042
 all-0.034
 his-0.032
 out-0.031
Token take
Token ID1011
Feature activation+1.41e+01

Pos logit contributions
 the+0.336
 up+0.310
 all+0.298
 out+0.296
 a+0.296
Neg logit contributions
lor-0.546
-0.519
Really-0.510
mmm-0.509
etsy-0.503
Token them
Token ID606
Feature activation+1.65e+01

Pos logit contributions
 the+0.206
 a+0.157
 all+0.155
 up+0.144
 shiny+0.144
Neg logit contributions
lor-0.608
etsy-0.545
ides-0.538
'."-0.535
-0.528
Token to
Token ID284
Feature activation+1.45e+01

Pos logit contributions
 the+0.276
 a+0.241
 all+0.185
 shiny+0.172
 his+0.169
Neg logit contributions
lor-0.713
uddy-0.646
etsy-0.645
arry-0.643
ides-0.636
Token open
Token ID1280
Feature activation+3.19e+01

Pos logit contributions
lor+0.247
arry+0.228
uddy+0.221
+0.220
etsy+0.220
Neg logit contributions
 the-0.122
 a-0.109
 all-0.094
 out-0.088
 in-0.086
Token the
Token ID262
Feature activation+2.63e+01

Pos logit contributions
 a+0.375
 the+0.335
 all+0.300
 shiny+0.292
 in+0.279
Neg logit contributions
lor-1.267
'."-1.162
obby-1.156
.'"-1.136
'.-1.128
Token closet
Token ID21615
Feature activation-9.90e+00

Pos logit contributions
 the+0.743
 a+0.687
 all+0.611
 his+0.568
 her+0.563
Neg logit contributions
lor-0.916
arry-0.811
och-0.796
oral-0.795
uddy-0.793
Token and
Token ID290
Feature activation+1.71e+01

Pos logit contributions
lor+0.428
arry+0.384
+0.374
uddy+0.371
och+0.370
Neg logit contributions
 the-0.202
 a-0.182
 all-0.148
 his-0.139
 out-0.137
Token put
Token ID1234
Feature activation+1.46e+01

Pos logit contributions
 the+0.333
 up+0.311
 a+0.305
 all+0.300
 out+0.283
Neg logit contributions
lor-0.520
grand-0.510
 Deal-0.504
Really-0.492
och-0.492
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.225
 a+0.203
 all+0.184
 her+0.180
 shiny+0.179
Neg logit contributions
lor-0.512
och-0.443
'."-0.435
-0.422
'.-0.421
Token things
Token ID1243
Feature activation+1.11e+01

Pos logit contributions
 the+0.710
 a+0.605
 all+0.538
 up+0.510
 out+0.486
Neg logit contributions
lor-1.261
och-1.210
oral-1.165
arry-1.144
ider-1.144
Token inside
Token ID2641
Feature activation+1.61e+01

Pos logit contributions
 the+0.159
 a+0.137
 all+0.106
 his+0.099
 her+0.096
Neg logit contributions
lor-0.487
och-0.464
ider-0.445
etsy-0.441
arry-0.439
Token.
Token ID13
Feature activation-1.17e+01

Pos logit contributions
 the+0.452
 a+0.434
 all+0.394
 up+0.375
 out+0.375
Neg logit contributions
lor-0.500
arry-0.423
och-0.422
obby-0.416
jo-0.414
Token We
Token ID775
Feature activation-2.65e+01

Pos logit contributions
lor+0.317
arry+0.281
jo+0.273
etsy+0.269
ongo+0.269
Neg logit contributions
 the-0.484
 a-0.476
 some-0.395
 out-0.393
 in-0.391
Token are
Token ID389
Feature activation+9.22e+00

Pos logit contributions
lor+0.838
arry+0.769
uddy+0.746
etsy+0.745
+0.741
Neg logit contributions
 the-0.827
 a-0.757
 all-0.678
 up-0.664
 out-0.661
Token me
Token ID502
Feature activation+1.54e+01

Pos logit contributions
 the+0.377
 a+0.350
 in+0.341
 up+0.333
 all+0.330
Neg logit contributions
lor-0.617
mmm-0.505
mmmm-0.504
'."-0.502
etsy-0.498
Token.
Token ID13
Feature activation-1.19e+01

Pos logit contributions
 the+0.430
 a+0.395
 all+0.354
 out+0.334
 up+0.326
Neg logit contributions
lor-0.551
arry-0.494
uddy-0.480
jo-0.471
etsy-0.470
Token Let
Token ID3914
Feature activation-2.21e+01

Pos logit contributions
lor+0.375
arry+0.343
jo+0.325
oral+0.324
ides+0.320
Neg logit contributions
 the-0.436
 a-0.429
 out-0.347
 in-0.345
 all-0.344
Token's
Token ID338
Feature activation+1.04e+01

Pos logit contributions
lor+0.813
arry+0.708
uddy+0.700
etsy+0.695
jo+0.689
Neg logit contributions
 the-0.572
 a-0.532
 all-0.481
 in-0.455
 out-0.442
Token have
Token ID423
Feature activation+1.23e+01

Pos logit contributions
 the+0.199
 a+0.173
 all+0.146
 up+0.142
 out+0.142
Neg logit contributions
lor-0.422
-0.385
arry-0.374
 Amen-0.370
ides-0.369
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.120
 up+0.096
 out+0.089
 all+0.087
 in+0.080
Neg logit contributions
lor-0.564
mmmm-0.505
mmm-0.493
-0.492
etsy-0.488
Token pie
Token ID2508
Feature activation-3.73e-01

Pos logit contributions
 the+0.759
 a+0.640
 all+0.607
 up+0.583
 out+0.570
Neg logit contributions
lor-1.283
oral-1.240
-1.206
unch-1.181
arry-1.178
Token together
Token ID1978
Feature activation+1.49e+01

Pos logit contributions
lor+0.015
arry+0.013
uddy+0.013
+0.013
etsy+0.013
Neg logit contributions
 the-0.009
 a-0.008
 all-0.007
 out-0.007
 up-0.007
Token."
Token ID526
Feature activation-2.54e+01

Pos logit contributions
 the+0.478
 a+0.444
 all+0.408
 out+0.386
 in+0.380
Neg logit contributions
lor-0.483
arry-0.431
uddy-0.418
etsy-0.410
-0.406
Token
Token ID198
Feature activation-2.17e+01

Pos logit contributions
lor+1.052
umm+0.963
istent+0.952
reetings+0.951
ides+0.947
Neg logit contributions
 the-0.889
 a-0.794
 her-0.683
 their-0.659
 his-0.645
Token
Token ID198
Feature activation-2.18e+01

Pos logit contributions
umm+0.814
lor+0.807
istent+0.764
ides+0.749
ider+0.736
Neg logit contributions
 the-0.782
 a-0.701
 her-0.586
 all-0.581
 his-0.575
Token the
Token ID262
Feature activation+2.60e+01

Pos logit contributions
 out+0.126
 in+0.065
 up+0.054
 the+0.052
 all+0.050
Neg logit contributions
lor-1.445
-1.411
omo-1.342
azz-1.337
oral-1.337
Token bucket
Token ID19236
Feature activation+6.86e+00

Pos logit contributions
 the+0.730
 a+0.680
 all+0.622
 her+0.606
 his+0.599
Neg logit contributions
lor-0.809
oral-0.785
-0.771
arry-0.766
oke-0.753
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
 the+0.133
 a+0.124
 all+0.102
 out+0.092
 an+0.091
Neg logit contributions
lor-0.253
arry-0.236
uddy-0.235
-0.233
oke-0.233
Token p
Token ID279
Feature activation+1.20e+01

Pos logit contributions
 a+0.428
 all+0.422
 the+0.409
 out+0.403
 in+0.402
Neg logit contributions
Really-0.349
lor-0.337
pill-0.334
 Maybe-0.334
-0.323
Tokenours
Token ID4662
Feature activation-1.56e+01

Pos logit contributions
 the+0.524
 a+0.501
 all+0.467
 out+0.447
 in+0.446
Neg logit contributions
lor-0.253
arry-0.219
uddy-0.206
-0.197
etsy-0.197
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
+0.637
umbs+0.617
mmmm+0.611
oral+0.608
lor+0.600
Neg logit contributions
 the-0.096
 a-0.083
 all-0.071
 shiny-0.061
 out-0.060
Token water
Token ID1660
Feature activation+2.79e+01

Pos logit contributions
 the+0.561
 a+0.481
 all+0.451
 his+0.445
 her+0.408
Neg logit contributions
-1.213
umbs-1.196
oral-1.172
lor-1.171
arry-1.151
Token on
Token ID319
Feature activation+2.02e+01

Pos logit contributions
 the+0.366
 a+0.354
 all+0.277
 more+0.249
 shiny+0.239
Neg logit contributions
lor-1.133
-1.096
oke-1.064
oling-1.062
umbs-1.051
Token the
Token ID262
Feature activation+2.59e+01

Pos logit contributions
 the+0.074
 a+0.057
 all+0.035
 out+0.022
 in+0.012
Neg logit contributions
lor-1.066
-1.041
uddy-0.983
etsy-0.982
arry-0.977
Token new
Token ID649
Feature activation+2.78e+01

Pos logit contributions
 the+0.641
 a+0.591
 all+0.541
 her+0.503
 his+0.495
Neg logit contributions
lor-0.930
-0.860
arry-0.835
etsy-0.833
uddy-0.832
Token plant
Token ID4618
Feature activation+7.20e+00

Pos logit contributions
 the+0.702
 a+0.642
 all+0.592
 her+0.545
 his+0.544
Neg logit contributions
lor-0.926
arry-0.872
-0.846
obby-0.833
oral-0.829
Token not
Token ID407
Feature activation+8.59e+00

Pos logit contributions
 the+0.197
 a+0.168
 his+0.164
 out+0.159
 up+0.156
Neg logit contributions
ider-0.291
oling-0.266
umpy-0.255
oke-0.253
uddy-0.249
Token careful
Token ID8161
Feature activation+3.97e+00

Pos logit contributions
 the+0.158
 a+0.137
 all+0.132
 up+0.120
 some+0.115
Neg logit contributions
aron-0.298
oke-0.295
unch-0.288
obby-0.284
oling-0.284
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
 the+0.101
 a+0.092
 all+0.082
 out+0.076
 more+0.074
Neg logit contributions
lor-0.148
arry-0.138
uddy-0.136
jo-0.133
obby-0.130
Token They
Token ID1119
Feature activation-1.25e+01

Pos logit contributions
lor+0.384
arry+0.320
och+0.317
jo+0.310
etsy+0.308
Neg logit contributions
 the-0.482
 a-0.455
 all-0.403
 out-0.388
 in-0.384
Token spill
Token ID19431
Feature activation+9.52e+00

Pos logit contributions
lor+0.414
+0.404
arry+0.402
uddy+0.397
ides+0.395
Neg logit contributions
 the-0.309
 a-0.285
 out-0.250
 up-0.242
 all-0.241
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.084
 a+0.065
 adventurous+0.064
 up+0.060
 all+0.059
Neg logit contributions
umbs-0.363
-0.343
oling-0.333
lor-0.330
oled-0.327
Token cereal
Token ID33158
Feature activation-1.49e-02

Pos logit contributions
 the+0.752
 a+0.690
 all+0.591
 up+0.571
 out+0.567
Neg logit contributions
lor-1.156
umm-1.136
-1.131
umbs-1.118
oral-1.106
Token on
Token ID319
Feature activation+2.01e+01

Pos logit contributions
lor+0.001
+0.001
ider+0.001
oling+0.001
uddy+0.001
Neg logit contributions
 the-0.000
 a-0.000
 an-0.000
 all-0.000
 her-0.000
Token the
Token ID262
Feature activation+2.62e+01

Pos logit contributions
 the+0.100
 a+0.081
 out+0.076
 up+0.076
 all+0.075
Neg logit contributions
-0.959
lor-0.950
mmm-0.872
mmmm-0.871
oral-0.870
Token table
Token ID3084
Feature activation+9.38e-01

Pos logit contributions
 the+0.854
 a+0.792
 all+0.735
 out+0.686
 more+0.670
Neg logit contributions
lor-0.782
uddy-0.680
-0.663
arry-0.658
etsy-0.655
Token and
Token ID290
Feature activation+1.66e+01

Pos logit contributions
 the+0.024
 a+0.022
 all+0.020
 up+0.018
 out+0.018
Neg logit contributions
lor-0.033
-0.029
arry-0.029
ides-0.029
uddy-0.029
Token with
Token ID351
Feature activation+1.69e+01

Pos logit contributions
 the+0.163
 a+0.156
 out+0.132
 up+0.127
 all+0.123
Neg logit contributions
lor-0.373
arry-0.352
oc-0.347
aron-0.343
oke-0.339
Token Lily
Token ID20037
Feature activation+7.72e+00

Pos logit contributions
 the+0.347
 a+0.327
 out+0.290
 up+0.284
 in+0.278
Neg logit contributions
lor-0.643
uddy-0.610
arry-0.601
jo-0.588
etsy-0.584
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
 the+0.183
 a+0.167
 all+0.145
 out+0.142
 up+0.134
Neg logit contributions
lor-0.281
uddy-0.265
jo-0.260
arry-0.259
oke-0.256
Token He
Token ID679
Feature activation-7.01e+00

Pos logit contributions
lor+0.381
ides+0.306
och+0.306
arry+0.305
reetings+0.304
Neg logit contributions
 the-0.482
 a-0.451
 all-0.415
 his-0.392
 more-0.385
Token took
Token ID1718
Feature activation+1.15e+01

Pos logit contributions
lor+0.235
arry+0.218
uddy+0.217
jo+0.213
etsy+0.211
Neg logit contributions
 the-0.201
 a-0.184
 out-0.164
 all-0.163
 up-0.156
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.077
 going+0.067
 in+0.060
 shiny+0.059
 a+0.054
Neg logit contributions
enny-0.425
lor-0.420
etsy-0.413
lying-0.413
 Eventually-0.408
Token of
Token ID286
Feature activation+1.44e+01

Pos logit contributions
 the+0.861
 a+0.785
 all+0.719
 up+0.699
 her+0.679
Neg logit contributions
oke-1.005
arry-0.974
oral-0.973
oc-0.968
uddy-0.967
Token her
Token ID607
Feature activation+2.22e+01

Pos logit contributions
 a+0.095
 the+0.084
 in+0.084
 out+0.083
 up+0.077
Neg logit contributions
lor-0.622
uddy-0.618
arry-0.618
-0.616
enny-0.616
Token blocks
Token ID7021
Feature activation+7.77e+00

Pos logit contributions
 the+0.498
 a+0.453
 all+0.403
 up+0.397
 out+0.386
Neg logit contributions
uddy-0.746
lor-0.745
-0.744
oke-0.737
arry-0.729
Token and
Token ID290
Feature activation+1.68e+01

Pos logit contributions
 the+0.142
 a+0.129
 all+0.100
 up+0.091
 more+0.091
Neg logit contributions
lor-0.304
arry-0.303
uddy-0.298
och-0.296
-0.289
Token put
Token ID1234
Feature activation+1.45e+01

Pos logit contributions
 the+0.379
 up+0.358
 a+0.356
 out+0.349
 all+0.331
Neg logit contributions
lor-0.425
jo-0.415
oc-0.410
anne-0.407
uddy-0.402
Token on
Token ID319
Feature activation+2.00e+01

Pos logit contributions
lor+1.591
arry+1.492
+1.474
uddy+1.473
etsy+1.448
Neg logit contributions
 the-0.363
 a-0.291
 all-0.225
 more-0.157
 up-0.149
Token,
Token ID11
Feature activation+5.35e+00

Pos logit contributions
 the+0.369
 a+0.336
 all+0.293
 out+0.261
 in+0.259
Neg logit contributions
lor-0.885
uddy-0.817
arry-0.811
-0.787
jo-0.786
Token let
Token ID1309
Feature activation-5.81e+00

Pos logit contributions
 the+0.142
 a+0.133
 all+0.123
 up+0.113
 out+0.112
Neg logit contributions
lor-0.189
uddy-0.173
arry-0.170
etsy-0.170
jo-0.163
Token's
Token ID338
Feature activation+1.03e+01

Pos logit contributions
lor+0.221
arry+0.195
uddy+0.194
etsy+0.190
jo+0.189
Neg logit contributions
 the-0.150
 a-0.139
 all-0.126
 in-0.118
 out-0.115
Token have
Token ID423
Feature activation+1.19e+01

Pos logit contributions
 the+0.194
 a+0.169
 all+0.141
 more+0.134
 in+0.132
Neg logit contributions
lor-0.434
-0.391
arry-0.391
uddy-0.378
ides-0.375
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.119
 up+0.091
 all+0.084
 in+0.084
 out+0.082
Neg logit contributions
lor-0.559
etsy-0.494
-0.485
mmmm-0.484
Really-0.483
Token water
Token ID1660
Feature activation+2.77e+01

Pos logit contributions
 the+0.734
 a+0.608
 all+0.571
 up+0.554
 out+0.527
Neg logit contributions
lor-1.336
oral-1.257
arry-1.232
-1.230
uddy-1.222
Token and
Token ID290
Feature activation+1.65e+01

Pos logit contributions
 the+0.564
 a+0.500
 all+0.430
 out+0.384
 up+0.380
Neg logit contributions
lor-1.188
arry-1.087
uddy-1.072
etsy-1.045
-1.041
Token cookies
Token ID14746
Feature activation+7.21e+00

Pos logit contributions
 the+0.270
 up+0.246
 all+0.235
 out+0.217
 a+0.200
Neg logit contributions
lor-0.613
uddy-0.567
grand-0.566
oral-0.554
-0.554
Token."
Token ID526
Feature activation-2.54e+01

Pos logit contributions
 the+0.183
 a+0.167
 all+0.148
 out+0.138
 up+0.135
Neg logit contributions
lor-0.280
arry-0.253
uddy-0.248
-0.244
etsy-0.243
Token Tom
Token ID4186
Feature activation+7.86e+00

Pos logit contributions
lor+1.098
ides+0.947
reetings+0.941
istent+0.913
wl+0.899
Neg logit contributions
 the-0.872
 a-0.775
 her-0.652
 all-0.641
 their-0.622
Token me
Token ID502
Feature activation+1.50e+01

Pos logit contributions
 the+0.149
 a+0.140
 all+0.128
 out+0.118
 up+0.116
Neg logit contributions
lor-0.218
arry-0.204
ides-0.193
obby-0.192
uddy-0.191
Token.
Token ID13
Feature activation-1.26e+01

Pos logit contributions
 the+0.384
 a+0.339
 all+0.319
 his+0.295
 her+0.292
Neg logit contributions
lor-0.511
arry-0.475
ides-0.459
-0.454
oc-0.437
Token You
Token ID921
Feature activation-2.64e+01

Pos logit contributions
lor+0.394
jo+0.340
anne+0.339
arry+0.334
abby+0.332
Neg logit contributions
 the-0.469
 a-0.447
 his-0.368
 in-0.360
 some-0.359
Token can
Token ID460
Feature activation-1.04e+00

Pos logit contributions
lor+0.823
arry+0.764
uddy+0.729
ides+0.727
+0.724
Neg logit contributions
 the-0.795
 a-0.715
 up-0.674
 his-0.651
 her-0.649
Token carry
Token ID3283
Feature activation+1.15e+01

Pos logit contributions
lor+0.046
arry+0.043
uddy+0.042
etsy+0.041
+0.041
Neg logit contributions
 the-0.021
 a-0.019
 all-0.016
 out-0.015
 in-0.015
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.102
 all+0.083
 a+0.082
 up+0.078
 out+0.073
Neg logit contributions
lor-0.482
oc-0.446
mmm-0.439
'."-0.438
och-0.437
Token of
Token ID286
Feature activation+1.44e+01

Pos logit contributions
 the+0.744
 a+0.656
 all+0.605
 up+0.581
 her+0.564
Neg logit contributions
lor-1.197
och-1.149
oc-1.138
arry-1.111
oral-1.109
Token the
Token ID262
Feature activation+2.59e+01

Pos logit contributions
 up+0.162
 in+0.137
 out+0.130
 a+0.129
 all+0.122
Neg logit contributions
lor-0.588
och-0.550
mmm-0.536
mmmm-0.534
oc-0.533
Token small
Token ID1402
Feature activation+2.90e+01

Pos logit contributions
 the+0.675
 a+0.613
 all+0.557
 her+0.540
 up+0.536
Neg logit contributions
lor-0.870
arry-0.821
oral-0.812
uddy-0.806
oke-0.804
Token boxes
Token ID10559
Feature activation+6.29e+00

Pos logit contributions
 the+0.752
 a+0.698
 all+0.651
 up+0.621
 her+0.609
Neg logit contributions
lor-0.927
arry-0.916
oc-0.910
oke-0.874
obby-0.862
Token and
Token ID290
Feature activation+1.61e+01

Pos logit contributions
 the+0.121
 a+0.109
 all+0.090
 her+0.084
 his+0.082
Neg logit contributions
lor-0.260
arry-0.241
och-0.237
uddy-0.230
etsy-0.230
Token my
Token ID616
Feature activation+1.86e+01

Pos logit contributions
 the+0.332
 up+0.326
 in+0.287
 out+0.285
 a+0.284
Neg logit contributions
lor-0.548
mmmm-0.534
mmm-0.525
umm-0.502
umbs-0.487
Token t
Token ID256
Feature activation+1.28e+01

Pos logit contributions
 the+0.470
 a+0.413
 all+0.383
 up+0.350
 out+0.347
Neg logit contributions
lor-0.662
-0.612
uddy-0.606
arry-0.604
ides-0.597
Tokeneddy
Token ID21874
Feature activation-2.36e+01

Pos logit contributions
 the+0.531
 a+0.514
 all+0.491
 in+0.474
 out+0.472
Neg logit contributions
lor-0.274
arry-0.232
reetings-0.229
uddy-0.226
istent-0.221
Token bear
Token ID6842
Feature activation+1.09e-02

Pos logit contributions
lor+0.807
oc+0.728
jo+0.705
uddy+0.704
+0.697
Neg logit contributions
 the-0.631
 a-0.584
 all-0.526
 out-0.495
 more-0.494
Token and
Token ID290
Feature activation+1.69e+01

Pos logit contributions
 the+0.000
 a+0.000
 all+0.000
 more+0.000
 out+0.000
Neg logit contributions
lor-0.000
arry-0.000
uddy-0.000
ides-0.000
jo-0.000
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.333
 up+0.300
 a+0.286
 all+0.274
 out+0.273
Neg logit contributions
lor-0.617
mmm-0.557
oc-0.555
mmmm-0.550
uddy-0.548
Token flour
Token ID10601
Feature activation-4.11e+00

Pos logit contributions
 the+0.812
 a+0.719
 all+0.649
 up+0.623
 out+0.600
Neg logit contributions
lor-1.250
arry-1.129
oral-1.118
och-1.117
ides-1.112
Token!"
Token ID2474
Feature activation-2.74e+01

Pos logit contributions
lor+0.124
arry+0.109
ides+0.108
oke+0.104
+0.104
Neg logit contributions
 the-0.125
 a-0.117
 all-0.103
 more-0.100
 his-0.099
Token Lily
Token ID20037
Feature activation+7.27e+00

Pos logit contributions
lor+1.201
arry+1.090
uddy+1.063
ides+1.047
etsy+1.045
Neg logit contributions
 the-0.647
 a-0.579
 all-0.499
 in-0.452
 out-0.449
Token said
Token ID531
Feature activation-3.28e+00

Pos logit contributions
 the+0.209
 a+0.193
 all+0.175
 in+0.162
 out+0.162
Neg logit contributions
lor-0.269
arry-0.243
uddy-0.237
etsy-0.232
-0.231
Token,
Token ID11
Feature activation+5.31e+00

Pos logit contributions
lor+0.130
arry+0.122
uddy+0.120
etsy+0.118
jo+0.117
Neg logit contributions
 the-0.076
 a-0.068
 all-0.063
 more-0.055
 out-0.054
Token sp
Token ID599
Feature activation+1.06e+01

Pos logit contributions
 up+0.321
 the+0.301
 all+0.295
 out+0.288
 a+0.284
Neg logit contributions
lor-0.544
 Deal-0.517
och-0.515
oke-0.505
uddy-0.504
Tokenoons
Token ID13022
Feature activation-1.66e+01

Pos logit contributions
 the+0.495
 a+0.471
 all+0.447
 out+0.429
 in+0.428
Neg logit contributions
lor-0.194
arry-0.159
uddy-0.151
etsy-0.145
-0.142
Token.
Token ID13
Feature activation-1.18e+01

Pos logit contributions
lor+0.613
arry+0.563
och+0.548
ides+0.547
oke+0.543
Neg logit contributions
 the-0.352
 a-0.319
 all-0.279
 up-0.268
 out-0.260
Token We
Token ID775
Feature activation-2.64e+01

Pos logit contributions
lor+0.334
arry+0.288
jo+0.281
ongo+0.281
uddy+0.276
Neg logit contributions
 the-0.472
 a-0.468
 some-0.395
 out-0.393
 in-0.387
Token need
Token ID761
Feature activation+3.37e+00

Pos logit contributions
lor+0.781
arry+0.738
ides+0.727
+0.722
uddy+0.715
Neg logit contributions
 the-0.807
 a-0.734
 up-0.668
 his-0.649
 all-0.645
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.039
 up+0.038
 all+0.032
 in+0.030
 out+0.028
Neg logit contributions
lor-0.144
obby-0.130
oral-0.130
-0.127
'."-0.126
Token flags
Token ID9701
Feature activation-9.85e-01

Pos logit contributions
 the+0.764
 a+0.657
 all+0.637
 up+0.634
 out+0.582
Neg logit contributions
lor-1.250
ides-1.164
oral-1.159
arry-1.154
och-1.145
Token.
Token ID13
Feature activation-1.18e+01

Pos logit contributions
lor+0.041
arry+0.038
och+0.037
ides+0.037
uddy+0.036
Neg logit contributions
 the-0.018
 a-0.015
 all-0.014
 up-0.013
 his-0.012
Token We
Token ID775
Feature activation-2.64e+01

Pos logit contributions
lor+0.319
jo+0.276
arry+0.276
ongo+0.272
uddy+0.265
Neg logit contributions
 the-0.482
 a-0.480
 some-0.410
 out-0.405
 in-0.402
Token can
Token ID460
Feature activation-7.33e-01

Pos logit contributions
lor+0.778
arry+0.736
ides+0.732
+0.721
etsy+0.716
Neg logit contributions
 the-0.803
 a-0.727
 up-0.669
 his-0.647
 all-0.639
Token use
Token ID779
Feature activation+1.53e+01

Pos logit contributions
lor+0.031
arry+0.029
+0.028
uddy+0.028
ides+0.028
Neg logit contributions
 the-0.015
 a-0.014
 all-0.011
 out-0.011
 in-0.011
Token."
Token ID526
Feature activation-2.54e+01

Pos logit contributions
 the+0.174
 a+0.161
 all+0.145
 out+0.139
 up+0.136
Neg logit contributions
lor-0.208
arry-0.188
uddy-0.183
-0.179
etsy-0.178
Token
Token ID198
Feature activation-2.16e+01

Pos logit contributions
lor+1.111
reetings+0.969
ides+0.969
istent+0.951
wl+0.936
Neg logit contributions
 the-0.864
 a-0.776
 their-0.674
 all-0.645
 her-0.636
Token
Token ID198
Feature activation-2.17e+01

Pos logit contributions
lor+0.878
umm+0.828
ides+0.793
istent+0.790
ider+0.762
Neg logit contributions
 the-0.747
 a-0.665
 all-0.556
 their-0.554
 his-0.537
TokenThey
Token ID2990
Feature activation-2.06e+01

Pos logit contributions
lor+0.726
umm+0.649
ides+0.640
istent+0.638
arry+0.614
Neg logit contributions
 the-0.833
 a-0.766
 their-0.677
 in-0.671
 more-0.669
Token eat
Token ID4483
Feature activation+1.76e+01

Pos logit contributions
lor+0.737
arry+0.709
+0.706
oral+0.679
uddy+0.679
Neg logit contributions
 the-0.455
 a-0.425
 up-0.361
 more-0.358
 out-0.357
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.095
 a+0.092
 out+0.081
 all+0.064
 in+0.050
Neg logit contributions
-0.727
oral-0.698
aron-0.685
Really-0.659
etsy-0.658
Token apples
Token ID22514
Feature activation+7.61e+00

Pos logit contributions
 the+0.793
 a+0.735
 out+0.653
 all+0.649
 up+0.636
Neg logit contributions
oral-1.142
-1.111
arry-1.096
lor-1.085
och-1.067
Token.
Token ID13
Feature activation-1.20e+01

Pos logit contributions
 the+0.146
 a+0.139
 all+0.112
 out+0.111
 up+0.104
Neg logit contributions
lor-0.276
-0.273
arry-0.264
uddy-0.263
och-0.260
Token They
Token ID1119
Feature activation-1.28e+01

Pos logit contributions
lor+0.417
arry+0.353
jo+0.348
wl+0.344
och+0.341
Neg logit contributions
 the-0.440
 a-0.424
 all-0.363
 their-0.343
 some-0.340
Token are
Token ID389
Feature activation+8.99e+00

Pos logit contributions
lor+0.455
arry+0.434
+0.434
uddy+0.421
oral+0.415
Neg logit contributions
 the-0.311
 a-0.287
 out-0.256
 up-0.250
 all-0.243
Token sweet
Token ID6029
Feature activation+9.60e+00

Pos logit contributions
 the+0.158
 out+0.140
 a+0.137
 any+0.123
 up+0.123
Neg logit contributions
omo-0.303
aron-0.292
ongo-0.287
uddy-0.285
 Amen-0.283
Token talk
Token ID1561
Feature activation+1.72e+01

Pos logit contributions
 the+0.310
 a+0.285
 up+0.268
 all+0.267
 out+0.255
Neg logit contributions
lor-0.529
ides-0.484
arry-0.479
jo-0.479
-0.475
Token.
Token ID13
Feature activation-1.20e+01

Pos logit contributions
 the+0.317
 a+0.272
 all+0.243
 up+0.210
 out+0.206
Neg logit contributions
lor-0.750
arry-0.690
jo-0.682
uddy-0.679
ides-0.657
Token He
Token ID679
Feature activation-6.98e+00

Pos logit contributions
lor+0.442
istent+0.407
wl+0.400
ides+0.399
jo+0.392
Neg logit contributions
 the-0.402
 a-0.392
 his-0.329
 her-0.306
 some-0.302
Token teaches
Token ID17324
Feature activation-1.46e+01

Pos logit contributions
lor+0.211
arry+0.187
uddy+0.182
jo+0.180
oke+0.179
Neg logit contributions
 the-0.225
 a-0.210
 all-0.192
 out-0.188
 in-0.183
Token Sara
Token ID24799
Feature activation-4.99e+00

Pos logit contributions
lor+0.636
arry+0.600
oral+0.585
jo+0.577
uddy+0.572
Neg logit contributions
 the-0.200
 a-0.173
 out-0.170
 up-0.164
 all-0.156
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
lor+0.220
jo+0.206
uddy+0.204
arry+0.196
etsy+0.192
Neg logit contributions
 the-0.075
 a-0.061
 out-0.057
 all-0.057
 up-0.051
Token signs
Token ID5895
Feature activation+6.66e+00

Pos logit contributions
 the+0.909
 a+0.817
 all+0.761
 up+0.735
 out+0.733
Neg logit contributions
lor-1.185
arry-1.112
oral-1.091
uddy-1.084
ides-1.076
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
 the+0.131
 a+0.114
 all+0.101
 out+0.089
 up+0.089
Neg logit contributions
lor-0.287
arry-0.266
uddy-0.262
jo-0.258
och-0.257
Token
Token ID198
Feature activation-2.17e+01

Pos logit contributions
lor+0.425
istent+0.397
jo+0.386
ides+0.385
wl+0.381
Neg logit contributions
 the-0.404
 a-0.390
 his-0.327
 her-0.309
 all-0.308
Token
Token ID198
Feature activation-2.18e+01

Pos logit contributions
umm+0.845
lor+0.834
istent+0.793
ider+0.784
ides+0.769
Neg logit contributions
 the-0.751
 a-0.678
 all-0.567
 his-0.555
 big-0.542
TokenS
Token ID50
Feature activation-2.30e+01

Pos logit contributions
lor+0.612
istent+0.570
jo+0.567
umm+0.556
+0.555
Neg logit contributions
 the-0.855
 a-0.804
 all-0.714
 more-0.713
 his-0.698
Token and
Token ID290
Feature activation+1.66e+01

Pos logit contributions
 the+0.176
 a+0.162
 all+0.139
 up+0.136
 her+0.124
Neg logit contributions
lor-0.309
arry-0.295
umm-0.285
uddy-0.282
oke-0.282
Token hungry
Token ID14720
Feature activation+1.52e+01

Pos logit contributions
 the+0.382
 a+0.336
 all+0.334
 out+0.320
 up+0.302
Neg logit contributions
ider-0.506
 Amen-0.487
lor-0.486
Really-0.481
etsy-0.472
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.408
 a+0.376
 all+0.343
 up+0.329
 out+0.323
Neg logit contributions
lor-0.502
arry-0.455
ides-0.441
uddy-0.441
-0.439
Token They
Token ID1119
Feature activation-1.26e+01

Pos logit contributions
lor+0.496
ides+0.445
istent+0.437
och+0.435
reetings+0.429
Neg logit contributions
 the-0.433
 a-0.411
 their-0.345
 some-0.311
 his-0.311
Token eat
Token ID4483
Feature activation+1.81e+01

Pos logit contributions
lor+0.429
arry+0.400
+0.393
ides+0.389
oral+0.387
Neg logit contributions
 the-0.319
 a-0.292
 out-0.260
 up-0.257
 all-0.253
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 a+0.077
 the+0.072
 out+0.068
 all+0.064
 hurt+0.045
Neg logit contributions
-0.687
oral-0.681
unch-0.659
lor-0.639
mmmm-0.627
Token snacks
Token ID28487
Feature activation+1.33e+01

Pos logit contributions
 the+0.697
 a+0.644
 all+0.603
 up+0.601
 out+0.598
Neg logit contributions
oral-1.155
lor-1.142
-1.080
unch-1.078
arry-1.074
Token and
Token ID290
Feature activation+1.67e+01

Pos logit contributions
 the+0.269
 a+0.248
 all+0.208
 up+0.202
 out+0.197
Neg logit contributions
lor-0.501
-0.467
och-0.462
uddy-0.461
arry-0.454
Token drink
Token ID4144
Feature activation+1.55e+01

Pos logit contributions
 the+0.350
 up+0.340
 out+0.332
 all+0.312
 a+0.306
Neg logit contributions
lor-0.531
-0.496
oral-0.493
anas-0.486
Really-0.484
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 the+0.221
 a+0.200
 all+0.178
 out+0.162
 up+0.157
Neg logit contributions
lor-0.582
-0.568
oral-0.554
umbs-0.551
etsy-0.536
Token juice
Token ID13135
Feature activation+1.14e+01

Pos logit contributions
 the+0.686
 a+0.607
 all+0.545
 out+0.517
 up+0.508
Neg logit contributions
lor-1.289
oral-1.224
-1.204
arry-1.199
umm-1.189
Token the
Token ID262
Feature activation+2.61e+01

Pos logit contributions
+0.249
lor+0.247
oral+0.242
unch+0.238
arry+0.237
Neg logit contributions
 up-0.035
 the-0.026
 out-0.025
 in-0.023
 a-0.017
Token shovel
Token ID33677
Feature activation-3.00e+00

Pos logit contributions
 the+0.755
 a+0.689
 all+0.650
 her+0.618
 out+0.618
Neg logit contributions
lor-0.748
arry-0.728
oral-0.700
-0.698
obby-0.685
Token to
Token ID284
Feature activation+1.43e+01

Pos logit contributions
lor+0.077
arry+0.077
obby+0.074
oke+0.073
och+0.072
Neg logit contributions
 the-0.089
 a-0.086
 all-0.075
 more-0.073
 her-0.072
Token scoop
Token ID23529
Feature activation+8.02e+00

Pos logit contributions
 the+0.287
 up+0.277
 a+0.272
 all+0.260
 out+0.259
Neg logit contributions
-0.409
ides-0.378
mmm-0.376
Finally-0.370
arry-0.368
Token up
Token ID510
Feature activation+3.16e+01

Pos logit contributions
 the+0.078
 a+0.070
 all+0.054
 shiny+0.044
 her+0.043
Neg logit contributions
lor-0.337
arry-0.320
oke-0.318
umbs-0.316
umm-0.306
Token some
Token ID617
Feature activation+3.42e+01

Pos logit contributions
 out+0.139
 the+0.122
 a+0.092
 up+0.075
 going+0.070
Neg logit contributions
lor-1.197
-1.175
oles-1.166
arry-1.165
umbs-1.164
Token sand
Token ID6450
Feature activation+2.00e+01

Pos logit contributions
 the+0.686
 a+0.631
 her+0.545
 all+0.544
 out+0.505
Neg logit contributions
umm-1.102
arry-1.090
oral-1.086
oke-1.062
lor-1.056
Token and
Token ID290
Feature activation+1.68e+01

Pos logit contributions
 the+0.425
 a+0.392
 all+0.315
 more+0.303
 her+0.301
Neg logit contributions
lor-0.779
arry-0.723
ides-0.703
uddy-0.695
-0.694
Token put
Token ID1234
Feature activation+1.46e+01

Pos logit contributions
 the+0.300
 a+0.278
 all+0.276
 out+0.262
 up+0.239
Neg logit contributions
Really-0.508
-0.497
mmm-0.490
lor-0.475
''-0.469
Token it
Token ID340
Feature activation+1.20e+01

Pos logit contributions
 the+0.138
 a+0.127
 more+0.101
 all+0.100
 scary+0.099
Neg logit contributions
lor-0.527
-0.512
och-0.482
Really-0.467
aron-0.466
Token in
Token ID287
Feature activation+1.92e+01

Pos logit contributions
 the+0.161
 a+0.118
 her+0.079
 more+0.078
 all+0.073
Neg logit contributions
ider-0.500
lor-0.467
-0.444
etsy-0.441
uddy-0.439
Token make
Token ID787
Feature activation+9.89e+00

Pos logit contributions
 the+0.139
 all+0.130
 up+0.124
 a+0.123
 out+0.117
Neg logit contributions
lor-0.368
och-0.353
grand-0.351
''-0.350
Really-0.349
Token it
Token ID340
Feature activation+5.99e+00

Pos logit contributions
 the+0.104
 in+0.079
 all+0.074
 a+0.073
 on+0.071
Neg logit contributions
lor-0.444
mmm-0.400
mmmm-0.395
arry-0.391
unch-0.390
Token whole
Token ID2187
Feature activation+6.72e+00

Pos logit contributions
 the+0.105
 a+0.076
 all+0.073
 her+0.066
 in+0.061
Neg logit contributions
lor-0.219
arry-0.205
ider-0.204
uer-0.203
ides-0.199
Token again
Token ID757
Feature activation+4.20e+00

Pos logit contributions
 the+0.204
 a+0.182
 her+0.165
 all+0.164
 out+0.162
Neg logit contributions
lor-0.199
-0.170
umm-0.169
arry-0.168
oral-0.164
Token.
Token ID13
Feature activation-1.74e+01

Pos logit contributions
 the+0.132
 a+0.123
 all+0.114
 out+0.109
 up+0.106
Neg logit contributions
lor-0.132
arry-0.119
uddy-0.113
jo-0.112
etsy-0.112
Token Amen
Token ID34717
Feature activation-6.63e+01

Pos logit contributions
lor+0.598
ides+0.535
abby+0.517
arry+0.507
jo+0.502
Neg logit contributions
 the-0.631
 a-0.615
 some-0.484
 an-0.475
 his-0.474
Token."
Token ID526
Feature activation-3.00e+01

Pos logit contributions
lor+2.594
umm+2.426
+2.421
etsy+2.374
arry+2.319
Neg logit contributions
 the-1.934
 a-1.793
 all-1.589
 in-1.508
 more-1.463
Token
Token ID198
Feature activation-2.62e+01

Pos logit contributions
lor+1.219
ides+1.103
umm+1.069
reetings+1.050
istent+1.025
Neg logit contributions
 the-1.054
 a-0.945
 her-0.808
 his-0.801
 their-0.788
Token
Token ID198
Feature activation-2.62e+01

Pos logit contributions
umm+1.205
lor+1.110
ides+1.089
istent+1.070
ider+1.067
Neg logit contributions
 the-0.850
 a-0.754
 his-0.629
 her-0.600
 their-0.596
TokenThen
Token ID6423
Feature activation-3.15e+01

Pos logit contributions
umm+0.918
lor+0.914
ides+0.860
istent+0.820
+0.788
Neg logit contributions
 the-0.920
 a-0.852
 her-0.737
 his-0.737
 their-0.726
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
lor+1.144
uddy+1.074
arry+1.051
+1.041
etsy+1.029
Neg logit contributions
 the-0.566
 a-0.525
 all-0.517
 up-0.508
 out-0.459
Token family
Token ID1641
Feature activation-9.00e+00

Pos logit contributions
 the+0.324
 a+0.298
 all+0.268
 out+0.259
 in+0.246
Neg logit contributions
lor-0.417
arry-0.383
uddy-0.381
oral-0.375
jo-0.375
Token and
Token ID290
Feature activation+1.28e+01

Pos logit contributions
lor+0.292
arry+0.251
jo+0.244
ides+0.243
etsy+0.239
Neg logit contributions
 the-0.259
 a-0.240
 all-0.220
 out-0.214
 up-0.210
Token our
Token ID674
Feature activation+1.17e+01

Pos logit contributions
 the+0.283
 a+0.251
 up+0.245
 all+0.243
 out+0.238
Neg logit contributions
lor-0.449
 Amen-0.414
jo-0.399
grand-0.396
oral-0.390
Token friends
Token ID2460
Feature activation+1.30e+00

Pos logit contributions
 the+0.295
 a+0.270
 out+0.239
 all+0.238
 up+0.220
Neg logit contributions
lor-0.418
oral-0.380
arry-0.380
uddy-0.379
jo-0.376
Token.
Token ID13
Feature activation-1.56e+01

Pos logit contributions
 the+0.042
 a+0.039
 all+0.035
 out+0.034
 up+0.034
Neg logit contributions
lor-0.041
arry-0.036
uddy-0.034
ides-0.034
etsy-0.034
Token Amen
Token ID34717
Feature activation-6.50e+01

Pos logit contributions
lor+0.431
arry+0.398
ides+0.382
ongo+0.378
abby+0.373
Neg logit contributions
 the-0.631
 a-0.629
 in-0.517
 out-0.513
 some-0.510
Token."
Token ID526
Feature activation-2.93e+01

Pos logit contributions
lor+2.550
etsy+2.308
+2.304
arry+2.293
och+2.278
Neg logit contributions
 the-1.905
 a-1.797
 all-1.567
 in-1.502
 more-1.465
Token
Token ID198
Feature activation-2.58e+01

Pos logit contributions
lor+1.236
umm+1.208
ides+1.181
istent+1.163
icious+1.134
Neg logit contributions
 the-0.972
 a-0.876
 her-0.771
 his-0.697
 their-0.694
Token
Token ID198
Feature activation-2.59e+01

Pos logit contributions
umm+1.055
lor+1.003
istent+0.979
ides+0.977
arry+0.940
Neg logit contributions
 the-0.901
 a-0.808
 her-0.683
 his-0.657
 their-0.642
TokenAnna
Token ID31160
Feature activation-2.41e+01

Pos logit contributions
umm+0.753
lor+0.747
arry+0.714
ides+0.711
istent+0.707
Neg logit contributions
 the-1.023
 a-0.952
 her-0.863
 more-0.827
 in-0.811
Token said
Token ID531
Feature activation-7.47e+00

Pos logit contributions
lor+0.823
arry+0.728
uddy+0.705
jo+0.700
etsy+0.697
Neg logit contributions
 the-0.738
 a-0.678
 all-0.624
 out-0.596
 in-0.579
Token gentle
Token ID10296
Feature activation+1.12e+00

Pos logit contributions
umm+1.111
lor+1.007
ides+0.982
istent+0.961
ider+0.955
Neg logit contributions
 the-0.892
 a-0.813
 all-0.671
 his-0.666
 big-0.642
Token.
Token ID13
Feature activation-2.36e+01

Pos logit contributions
 the+0.024
 a+0.022
 all+0.020
 out+0.018
 in+0.017
Neg logit contributions
lor-0.046
arry-0.042
jo-0.042
uddy-0.042
etsy-0.041
Token
Token ID198
Feature activation-3.43e+01

Pos logit contributions
lor+0.806
ides+0.730
umm+0.685
arry+0.678
oral+0.668
Neg logit contributions
 the-0.866
 a-0.809
 all-0.681
 his-0.668
 her-0.657
Token
Token ID198
Feature activation-3.40e+01

Pos logit contributions
umm+1.486
lor+1.379
ides+1.355
istent+1.336
ider+1.327
Neg logit contributions
 the-1.054
 a-0.980
 his-0.779
 her-0.748
 all-0.743
TokenM
Token ID44
Feature activation-4.08e+01

Pos logit contributions
umm+1.168
lor+1.101
ides+1.081
istent+1.030
oke+1.011
Neg logit contributions
 the-1.174
 a-1.121
 her-0.934
 his-0.927
 all-0.887
Tokenoral
Token ID6864
Feature activation-6.50e+01

Pos logit contributions
lor+0.858
oling+0.637
etsy+0.608
jo+0.606
och+0.602
Neg logit contributions
 the-1.800
 a-1.737
 all-1.619
 in-1.584
 more-1.575
Token:
Token ID25
Feature activation-2.95e+01

Pos logit contributions
lor+2.658
arry+2.442
uddy+2.381
etsy+2.337
jo+2.324
Neg logit contributions
 the-1.587
 a-1.439
 all-1.278
 out-1.177
 in-1.166
Token Being
Token ID11204
Feature activation-3.49e+01

Pos logit contributions
lor+0.784
arry+0.658
uddy+0.656
etsy+0.643
+0.640
Neg logit contributions
 the-1.206
 a-1.127
 all-1.048
 in-0.995
 out-0.983
Token respectful
Token ID25923
Feature activation-1.84e+01

Pos logit contributions
lor+1.100
reetings+1.092
arry+1.054
oral+0.998
aring+0.988
Neg logit contributions
 the-0.658
 all-0.571
 out-0.535
 a-0.521
 up-0.520
Token and
Token ID290
Feature activation+7.11e+00

Pos logit contributions
lor+0.763
arry+0.711
uddy+0.697
jo+0.687
oke+0.676
Neg logit contributions
 the-0.386
 a-0.349
 all-0.305
 out-0.281
 up-0.269
Token gentle
Token ID10296
Feature activation+9.43e+00

Pos logit contributions
 the+0.190
 a+0.181
 out+0.170
 all+0.166
 up+0.161
Neg logit contributions
lor-0.205
arry-0.204
reetings-0.200
oral-0.195
aring-0.191
Token house
Token ID2156
Feature activation-7.17e-01

Pos logit contributions
 the+0.333
 a+0.308
 out+0.286
 all+0.276
 up+0.267
Neg logit contributions
lor-0.398
oral-0.391
uddy-0.379
-0.373
arry-0.367
Token and
Token ID290
Feature activation+1.33e+01

Pos logit contributions
lor+0.026
arry+0.024
uddy+0.023
oral+0.023
etsy+0.023
Neg logit contributions
 the-0.018
 a-0.016
 all-0.015
 out-0.014
 up-0.014
Token our
Token ID674
Feature activation+1.20e+01

Pos logit contributions
 the+0.239
 a+0.223
 all+0.214
 up+0.213
 out+0.211
Neg logit contributions
lor-0.473
 Amen-0.446
grand-0.436
Really-0.436
oral-0.433
Token family
Token ID1641
Feature activation-8.67e+00

Pos logit contributions
 the+0.302
 a+0.278
 out+0.250
 all+0.241
 up+0.231
Neg logit contributions
lor-0.421
oral-0.401
uddy-0.397
arry-0.388
-0.388
Token.
Token ID13
Feature activation-1.57e+01

Pos logit contributions
lor+0.277
arry+0.244
uddy+0.235
etsy+0.233
ides+0.232
Neg logit contributions
 the-0.271
 a-0.253
 all-0.231
 out-0.223
 up-0.218
Token Amen
Token ID34717
Feature activation-6.47e+01

Pos logit contributions
lor+0.462
ides+0.407
arry+0.406
unch+0.389
anne+0.387
Neg logit contributions
 the-0.630
 a-0.620
 in-0.509
 their-0.502
 some-0.495
Token."
Token ID526
Feature activation-2.90e+01

Pos logit contributions
lor+2.618
etsy+2.365
+2.351
uddy+2.333
umm+2.332
Neg logit contributions
 the-1.853
 a-1.733
 all-1.520
 in-1.467
 more-1.429
Token
Token ID198
Feature activation-2.53e+01

Pos logit contributions
lor+1.121
ides+1.064
istent+1.058
reetings+1.045
umm+1.027
Neg logit contributions
 the-1.012
 a-0.915
 their-0.786
 his-0.759
 her-0.755
Token
Token ID198
Feature activation-2.54e+01

Pos logit contributions
umm+0.985
lor+0.965
istent+0.955
ides+0.935
arry+0.877
Neg logit contributions
 the-0.877
 a-0.798
 their-0.646
 his-0.635
 all-0.628
TokenMom
Token ID29252
Feature activation-3.34e+01

Pos logit contributions
istent+0.774
lor+0.766
umm+0.757
ides+0.746
jo+0.718
Neg logit contributions
 the-0.977
 a-0.917
 their-0.796
 more-0.786
 in-0.767
Token smiled
Token ID13541
Feature activation-5.11e+00

Pos logit contributions
lor+1.370
arry+1.259
uddy+1.231
etsy+1.207
jo+1.204
Neg logit contributions
 the-0.804
 a-0.728
 all-0.646
 out-0.594
 in-0.586
TokenH
Token ID39
Feature activation-4.01e+01

Pos logit contributions
lor+0.784
ides+0.718
arry+0.717
uddy+0.704
jo+0.694
Neg logit contributions
 the-0.943
 a-0.886
 in-0.750
 her-0.734
 more-0.730
Tokenola
Token ID5708
Feature activation-3.54e+01

Pos logit contributions
lor+0.909
Really+0.737
jo+0.727
umm+0.668
uddy+0.665
Neg logit contributions
 the-1.758
 a-1.631
 all-1.552
 in-1.466
 more-1.411
Token,
Token ID11
Feature activation-5.43e+00

Pos logit contributions
lor+1.588
arry+1.475
uddy+1.437
+1.423
etsy+1.421
Neg logit contributions
 the-0.658
 a-0.595
 all-0.512
 out-0.452
 more-0.450
Token me
Token ID502
Feature activation+5.02e+00

Pos logit contributions
lor+0.189
arry+0.170
etsy+0.168
uddy+0.167
oke+0.166
Neg logit contributions
 the-0.134
 a-0.125
 all-0.113
 up-0.110
 out-0.109
Token ll
Token ID32660
Feature activation-2.92e+01

Pos logit contributions
 the+0.112
 a+0.101
 all+0.089
 more+0.080
 out+0.080
Neg logit contributions
lor-0.205
arry-0.190
uddy-0.186
-0.185
reetings-0.182
Tokenamo
Token ID18811
Feature activation-6.46e+01

Pos logit contributions
lor+0.685
oral+0.642
jo+0.604
arry+0.599
istent+0.596
Neg logit contributions
 the-1.147
 a-1.102
 in-1.008
 out-0.985
 all-0.978
Token Emma
Token ID18966
Feature activation-1.43e+01

Pos logit contributions
lor+2.134
arry+1.921
uddy+1.857
etsy+1.823
jo+1.808
Neg logit contributions
 the-1.988
 a-1.863
 all-1.710
 more-1.593
 out-1.590
Token."
Token ID526
Feature activation-3.31e+01

Pos logit contributions
lor+0.545
arry+0.496
uddy+0.483
etsy+0.474
jo+0.471
Neg logit contributions
 the-0.385
 a-0.354
 all-0.318
 out-0.295
 in-0.293
Token
Token ID198
Feature activation-2.95e+01

Pos logit contributions
lor+1.150
ides+1.040
umm+1.003
oral+0.981
arry+0.977
Neg logit contributions
 the-1.227
 a-1.131
 her-0.982
 all-0.972
 his-0.943
Token
Token ID198
Feature activation-2.95e+01

Pos logit contributions
umm+1.175
ides+1.094
lor+1.090
istent+1.043
ider+1.007
Neg logit contributions
 the-0.986
 a-0.924
 her-0.768
 his-0.738
 all-0.701
TokenL
Token ID43
Feature activation-2.97e+01

Pos logit contributions
lor+0.881
umm+0.853
ides+0.827
uddy+0.791
+0.784
Neg logit contributions
 the-1.106
 a-1.068
 her-0.923
 all-0.888
 in-0.887
Token end
Token ID886
Feature activation-2.82e+01

Pos logit contributions
lor+0.760
arry+0.705
uddy+0.700
oke+0.691
jo+0.688
Neg logit contributions
 the-0.647
 a-0.603
 all-0.549
 out-0.536
 up-0.516
Token.
Token ID13
Feature activation-2.37e+01

Pos logit contributions
lor+1.176
arry+1.086
uddy+1.059
etsy+1.040
jo+1.035
Neg logit contributions
 the-0.654
 a-0.591
 all-0.523
 out-0.480
 in-0.471
Token
Token ID198
Feature activation-3.32e+01

Pos logit contributions
lor+0.848
ides+0.806
umm+0.761
arry+0.738
istent+0.736
Neg logit contributions
 the-0.825
 a-0.782
 all-0.625
 his-0.623
 her-0.612
Token
Token ID198
Feature activation-3.31e+01

Pos logit contributions
umm+1.585
lor+1.488
istent+1.485
ides+1.469
arians+1.417
Neg logit contributions
 the-0.849
 a-0.796
 his-0.595
 all-0.575
 her-0.556
TokenM
Token ID44
Feature activation-4.01e+01

Pos logit contributions
umm+1.180
ides+1.058
lor+1.056
istent+1.056
oke+1.002
Neg logit contributions
 the-1.132
 a-1.088
 his-0.895
 her-0.893
 in-0.859
Tokenoral
Token ID6864
Feature activation-6.41e+01

Pos logit contributions
lor+0.875
oling+0.700
etsy+0.669
istent+0.665
oled+0.662
Neg logit contributions
 the-1.704
 a-1.652
 all-1.536
 more-1.498
 in-1.498
Token value
Token ID1988
Feature activation-1.37e+01

Pos logit contributions
lor+2.630
arry+2.416
uddy+2.359
etsy+2.318
+2.303
Neg logit contributions
 the-1.565
 a-1.421
 all-1.259
 out-1.156
 in-1.149
Token:
Token ID25
Feature activation-2.86e+01

Pos logit contributions
lor+0.535
arry+0.501
uddy+0.478
oral+0.474
reetings+0.472
Neg logit contributions
 the-0.301
 a-0.290
 all-0.247
 out-0.241
 in-0.228
Token The
Token ID383
Feature activation-1.05e+01

Pos logit contributions
lor+0.878
arry+0.780
uddy+0.757
etsy+0.739
+0.733
Neg logit contributions
 the-0.997
 a-0.931
 all-0.859
 out-0.811
 in-0.811
Token story
Token ID1621
Feature activation-3.27e+01

Pos logit contributions
lor+0.364
oral+0.343
arry+0.334
uddy+0.325
+0.321
Neg logit contributions
 the-0.286
 a-0.253
 all-0.237
 out-0.235
 up-0.227
Token teaches
Token ID17324
Feature activation-2.36e+01

Pos logit contributions
lor+0.987
arry+0.872
oral+0.867
oke+0.854
uddy+0.851
Neg logit contributions
 the-1.045
 a-0.988
 all-0.897
 out-0.867
 his-0.862
Token kind
Token ID1611
Feature activation+1.09e+01

Pos logit contributions
 the+0.284
 all+0.240
 a+0.233
 up+0.227
 out+0.225
Neg logit contributions
lor-0.557
aron-0.510
reetings-0.504
 Amen-0.501
oc-0.498
Token.
Token ID13
Feature activation-1.45e+01

Pos logit contributions
 the+0.269
 a+0.244
 all+0.217
 out+0.200
 in+0.196
Neg logit contributions
lor-0.437
arry-0.402
uddy-0.392
etsy-0.384
jo-0.382
Token Thank
Token ID6952
Feature activation-3.41e+01

Pos logit contributions
lor+0.471
etsy+0.425
abby+0.421
ides+0.420
anne+0.416
Neg logit contributions
 the-0.504
 a-0.495
 his-0.422
 some-0.381
 her-0.380
Token you
Token ID345
Feature activation+3.54e+00

Pos logit contributions
lor+1.361
arry+1.247
uddy+1.211
etsy+1.193
+1.189
Neg logit contributions
 the-0.846
 a-0.772
 all-0.693
 out-0.643
 in-0.628
Token.
Token ID13
Feature activation-1.46e+01

Pos logit contributions
 the+0.042
 a+0.034
 all+0.026
 up+0.022
 out+0.021
Neg logit contributions
lor-0.182
arry-0.172
uddy-0.171
etsy-0.167
oral-0.165
Token Amen
Token ID34717
Feature activation-6.35e+01

Pos logit contributions
lor+0.480
ides+0.430
anne+0.430
arry+0.423
etsy+0.414
Neg logit contributions
 the-0.516
 a-0.495
 his-0.412
 in-0.403
 all-0.397
Token."
Token ID526
Feature activation-2.79e+01

Pos logit contributions
lor+2.570
etsy+2.327
+2.316
umm+2.268
arry+2.266
Neg logit contributions
 the-1.833
 a-1.708
 all-1.464
 in-1.410
 more-1.367
Token
Token ID198
Feature activation-2.43e+01

Pos logit contributions
lor+1.205
ides+1.108
umm+1.066
istent+1.065
jo+1.061
Neg logit contributions
 the-0.922
 a-0.858
 her-0.731
 all-0.664
 in-0.658
Token
Token ID198
Feature activation-2.43e+01

Pos logit contributions
umm+0.982
lor+0.970
ides+0.914
istent+0.914
ider+0.899
Neg logit contributions
 the-0.817
 a-0.744
 her-0.600
 all-0.592
 big-0.580
TokenThen
Token ID6423
Feature activation-2.98e+01

Pos logit contributions
lor+0.751
umm+0.729
ides+0.709
jo+0.708
istent+0.702
Neg logit contributions
 the-0.916
 a-0.857
 her-0.763
 more-0.727
 all-0.723
Token she
Token ID673
Feature activation-2.43e+00

Pos logit contributions
lor+0.972
+0.950
etsy+0.896
arry+0.894
uddy+0.885
Neg logit contributions
 the-0.556
 a-0.523
 up-0.522
 all-0.495
 shiny-0.474
Token us
Token ID514
Feature activation+9.35e+00

Pos logit contributions
 the+0.245
 a+0.233
 all+0.216
 shiny+0.212
 more+0.208
Neg logit contributions
lor-0.394
 Amen-0.346
jo-0.338
'."-0.321
anna-0.317
Token safe
Token ID3338
Feature activation+2.25e+01

Pos logit contributions
 the+0.201
 a+0.171
 all+0.151
 his+0.148
 out+0.146
Neg logit contributions
lor-0.347
arry-0.309
etsy-0.306
ides-0.299
'."-0.299
Token and
Token ID290
Feature activation+1.51e+01

Pos logit contributions
 the+0.736
 a+0.687
 all+0.630
 out+0.604
 his+0.587
Neg logit contributions
lor-0.661
arry-0.582
jo-0.547
obby-0.544
uddy-0.539
Token happy
Token ID3772
Feature activation+1.58e+01

Pos logit contributions
 the+0.334
 a+0.304
 all+0.290
 out+0.288
 up+0.265
Neg logit contributions
lor-0.509
 Amen-0.484
Really-0.482
arry-0.469
reetings-0.459
Token.
Token ID13
Feature activation-1.35e+01

Pos logit contributions
 the+0.494
 a+0.457
 all+0.417
 out+0.396
 up+0.388
Neg logit contributions
lor-0.514
arry-0.464
uddy-0.442
ides-0.434
jo-0.434
Token Amen
Token ID34717
Feature activation-6.25e+01

Pos logit contributions
lor+0.410
ides+0.371
arry+0.366
abby+0.356
unch+0.351
Neg logit contributions
 the-0.525
 a-0.522
 in-0.422
 some-0.417
 out-0.413
Token."
Token ID526
Feature activation-2.68e+01

Pos logit contributions
lor+2.526
+2.350
etsy+2.290
umm+2.287
och+2.247
Neg logit contributions
 the-1.799
 a-1.697
 all-1.438
 in-1.411
 more-1.362
Token
Token ID198
Feature activation-2.33e+01

Pos logit contributions
lor+1.004
ides+0.938
istent+0.919
umm+0.913
arry+0.880
Neg logit contributions
 the-0.988
 a-0.906
 his-0.763
 her-0.724
 their-0.719
Token
Token ID198
Feature activation-2.34e+01

Pos logit contributions
umm+0.931
istent+0.904
lor+0.894
ides+0.893
arry+0.835
Neg logit contributions
 the-0.814
 a-0.736
 big-0.585
 his-0.583
 their-0.564
Token"
Token ID1
Feature activation-2.72e+01

Pos logit contributions
umm+0.696
istent+0.692
ides+0.688
lor+0.682
+0.658
Neg logit contributions
 the-0.910
 a-0.845
 more-0.735
 in-0.712
 his-0.706
TokenA
Token ID32
Feature activation-1.07e+01

Pos logit contributions
lor+0.673
arry+0.597
uddy+0.585
jo+0.576
ides+0.568
Neg logit contributions
 the-1.142
 a-1.110
 more-0.982
 in-0.981
 all-0.974
Token up
Token ID510
Feature activation+2.10e+01

Pos logit contributions
lor+0.012
etsy+0.010
ongo+0.010
arry+0.010
enny+0.010
Neg logit contributions
 the-0.005
 a-0.004
 their-0.004
 all-0.003
 shiny-0.003
Token."
Token ID526
Feature activation-3.57e+01

Pos logit contributions
 the+0.466
 a+0.426
 all+0.384
 out+0.373
 up+0.341
Neg logit contributions
lor-0.779
arry-0.711
etsy-0.698
jo-0.682
reetings-0.677
Token
Token ID198
Feature activation-3.06e+01

Pos logit contributions
lor+1.304
ides+1.204
umm+1.174
reetings+1.122
arry+1.105
Neg logit contributions
 the-1.300
 a-1.164
 his-1.030
 her-1.026
 all-1.002
Token
Token ID198
Feature activation-3.02e+01

Pos logit contributions
umm+1.210
ides+1.139
lor+1.115
ider+1.099
istent+1.091
Neg logit contributions
 the-1.035
 a-0.933
 his-0.805
 her-0.787
 all-0.725
TokenM
Token ID44
Feature activation-3.77e+01

Pos logit contributions
umm+0.948
ides+0.896
lor+0.892
istent+0.843
+0.820
Neg logit contributions
 the-1.099
 a-1.051
 his-0.905
 her-0.905
 all-0.853
Tokenoral
Token ID6864
Feature activation-6.22e+01

Pos logit contributions
lor+0.850
oling+0.680
oled+0.641
plets+0.629
istent+0.628
Neg logit contributions
 the-1.574
 a-1.528
 all-1.432
 in-1.387
 more-1.373
Token of
Token ID286
Feature activation+6.67e+00

Pos logit contributions
lor+2.633
arry+2.404
uddy+2.381
etsy+2.373
+2.338
Neg logit contributions
 the-1.485
 a-1.358
 all-1.179
 in-1.086
 out-1.060
Token the
Token ID262
Feature activation+1.85e+01

Pos logit contributions
 the+0.008
 a+0.006
 out-0.004
 all-0.007
 up-0.010
Neg logit contributions
lor-0.384
jo-0.357
oral-0.351
uddy-0.351
arry-0.350
Token story
Token ID1621
Feature activation-3.11e+01

Pos logit contributions
 the+0.714
 a+0.672
 all+0.626
 out+0.595
 in+0.593
Neg logit contributions
lor-0.499
arry-0.437
uddy-0.420
etsy-0.409
-0.404
Token:
Token ID25
Feature activation-2.63e+01

Pos logit contributions
lor+1.119
arry+1.002
etsy+0.990
+0.989
uddy+0.978
Neg logit contributions
 the-0.934
 a-0.859
 all-0.784
 in-0.732
 out-0.721
Token Be
Token ID1355
Feature activation-1.08e+01

Pos logit contributions
lor+0.756
arry+0.653
uddy+0.648
etsy+0.634
ongo+0.624
Neg logit contributions
 the-1.021
 a-0.951
 all-0.875
 in-0.828
 out-0.814
Token Thank
Token ID6952
Feature activation-3.20e+01

Pos logit contributions
lor+0.365
arry+0.325
ongo+0.312
ides+0.310
jo+0.304
Neg logit contributions
 the-0.499
 a-0.496
 some-0.408
 in-0.406
 their-0.394
Token you
Token ID345
Feature activation+5.85e+00

Pos logit contributions
lor+1.298
arry+1.190
uddy+1.159
etsy+1.141
+1.135
Neg logit contributions
 the-0.783
 a-0.712
 all-0.634
 out-0.585
 in-0.577
Token,
Token ID11
Feature activation+5.13e+00

Pos logit contributions
 the+0.086
 a+0.073
 all+0.059
 out+0.050
 in+0.050
Neg logit contributions
lor-0.291
arry-0.272
uddy-0.268
etsy-0.263
jo-0.261
Token God
Token ID1793
Feature activation-1.08e+01

Pos logit contributions
 the+0.148
 a+0.138
 all+0.125
 up+0.124
 out+0.123
Neg logit contributions
lor-0.169
arry-0.154
oke-0.152
uddy-0.151
oral-0.150
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
lor+0.380
arry+0.344
uddy+0.336
etsy+0.327
jo+0.327
Neg logit contributions
 the-0.312
 a-0.287
 all-0.263
 out-0.246
 up-0.242
Token Amen
Token ID34717
Feature activation-6.13e+01

Pos logit contributions
lor+0.314
arry+0.284
ides+0.276
abs+0.272
jo+0.268
Neg logit contributions
 the-0.513
 a-0.503
 in-0.427
 some-0.411
 his-0.409
Token."
Token ID526
Feature activation-2.54e+01

Pos logit contributions
lor+2.492
umm+2.275
+2.274
etsy+2.243
arry+2.210
Neg logit contributions
 the-1.748
 a-1.628
 all-1.398
 in-1.373
 more-1.319
Token
Token ID198
Feature activation-2.19e+01

Pos logit contributions
lor+0.902
ides+0.796
umm+0.781
arry+0.777
istent+0.768
Neg logit contributions
 the-0.956
 a-0.878
 his-0.741
 all-0.738
 her-0.730
Token
Token ID198
Feature activation-2.19e+01

Pos logit contributions
umm+0.887
istent+0.828
lor+0.827
ides+0.810
ider+0.791
Neg logit contributions
 the-0.746
 a-0.687
 his-0.543
 big-0.518
 her-0.516
TokenHe
Token ID1544
Feature activation-1.91e+01

Pos logit contributions
umm+0.741
istent+0.698
ides+0.684
+0.662
lor+0.653
Neg logit contributions
 the-0.820
 a-0.765
 his-0.646
 her-0.628
 their-0.626
Token opened
Token ID4721
Feature activation+8.26e+00

Pos logit contributions
lor+0.658
oral+0.559
oke+0.552
arry+0.546
+0.531
Neg logit contributions
 the-0.518
 a-0.465
 all-0.441
 out-0.432
 up-0.408
Token them
Token ID606
Feature activation-1.32e+00

Pos logit contributions
 the+0.751
 a+0.681
 all+0.634
 out+0.593
 up+0.587
Neg logit contributions
lor-0.872
arry-0.810
uddy-0.795
jo-0.770
-0.768
Token.
Token ID13
Feature activation-2.55e+01

Pos logit contributions
lor+0.053
arry+0.048
jo+0.048
uddy+0.048
ides+0.048
Neg logit contributions
 the-0.025
 a-0.022
 all-0.018
 more-0.017
 her-0.017
Token
Token ID220
Feature activation-3.79e+01

Pos logit contributions
lor+0.889
ides+0.787
umm+0.761
arry+0.737
wl+0.732
Neg logit contributions
 the-0.936
 a-0.882
 all-0.737
 his-0.735
 her-0.707
Token
Token ID198
Feature activation-3.43e+01

Pos logit contributions
lor+1.372
umm+1.341
ider+1.249
istent+1.197
arry+1.189
Neg logit contributions
 the-1.291
 a-1.209
 all-1.032
 out-1.005
 up-0.987
Token
Token ID198
Feature activation-3.39e+01

Pos logit contributions
umm+1.405
lor+1.366
ides+1.305
istent+1.260
ider+1.230
Neg logit contributions
 the-1.097
 a-1.024
 his-0.825
 her-0.785
 all-0.778
TokenSo
Token ID2396
Feature activation-6.10e+01

Pos logit contributions
lor+1.085
umm+1.045
ides+1.007
istent+0.939
oke+0.924
Neg logit contributions
 the-1.216
 a-1.171
 his-0.976
 her-0.963
 all-0.962
Token he
Token ID339
Feature activation-3.50e+00

Pos logit contributions
lor+2.567
arry+2.442
uddy+2.410
etsy+2.387
obby+2.347
Neg logit contributions
 the-0.844
 a-0.784
 up-0.661
 out-0.651
 all-0.649
Token hopped
Token ID45230
Feature activation-1.12e+01

Pos logit contributions
lor+0.125
arry+0.114
uddy+0.112
etsy+0.111
jo+0.110
Neg logit contributions
 the-0.091
 a-0.083
 all-0.072
 out-0.072
 more-0.070
Token over
Token ID625
Feature activation+1.00e+01

Pos logit contributions
lor+0.479
arry+0.453
etsy+0.440
+0.439
uddy+0.437
Neg logit contributions
 the-0.153
 a-0.130
 all-0.103
 more-0.097
 shiny-0.094
Token to
Token ID284
Feature activation+6.09e+00

Pos logit contributions
 the+0.108
 a+0.095
 all+0.085
 out+0.079
 up+0.073
Neg logit contributions
lor-0.413
-0.403
arry-0.401
obby-0.400
jo-0.394
Token the
Token ID262
Feature activation+1.81e+01

Pos logit contributions
 the+0.033
 a+0.028
 in+0.018
 all+0.017
 more+0.016
Neg logit contributions
lor-0.296
arry-0.286
-0.282
jo-0.280
obby-0.279
Token it
Token ID340
Feature activation-3.59e+00

Pos logit contributions
lor+0.477
arry+0.438
uddy+0.423
etsy+0.413
jo+0.411
Neg logit contributions
 the-0.459
 a-0.425
 all-0.389
 out-0.364
 in-0.364
Token."
Token ID526
Feature activation-3.94e+01

Pos logit contributions
lor+0.137
arry+0.122
etsy+0.119
+0.119
ides+0.118
Neg logit contributions
 the-0.085
 a-0.076
 all-0.065
 out-0.061
 more-0.061
Token
Token ID220
Feature activation-3.89e+01

Pos logit contributions
lor+1.522
ides+1.343
umm+1.317
arry+1.280
reetings+1.260
Neg logit contributions
 the-1.391
 a-1.273
 her-1.084
 his-1.084
 all-1.061
Token
Token ID198
Feature activation-3.47e+01

Pos logit contributions
lor+1.573
umm+1.572
ider+1.499
ides+1.405
Really+1.400
Neg logit contributions
 the-1.223
 a-1.115
 all-0.929
 up-0.913
 out-0.898
Token
Token ID198
Feature activation-3.41e+01

Pos logit contributions
umm+1.483
lor+1.434
ides+1.391
istent+1.319
ider+1.303
Neg logit contributions
 the-1.109
 a-1.009
 his-0.841
 her-0.833
 all-0.725
TokenSo
Token ID2396
Feature activation-6.10e+01

Pos logit contributions
lor+1.133
umm+1.109
ides+1.065
istent+0.979
oke+0.966
Neg logit contributions
 the-1.219
 a-1.157
 her-1.000
 his-0.985
 in-0.917
Token Benny
Token ID44275
Feature activation-1.00e+01

Pos logit contributions
lor+2.514
arry+2.397
uddy+2.383
etsy+2.350
jo+2.302
Neg logit contributions
 the-0.990
 a-0.908
 all-0.783
 out-0.750
 up-0.746
Token went
Token ID1816
Feature activation-6.01e+00

Pos logit contributions
lor+0.393
arry+0.368
jo+0.364
uddy+0.361
etsy+0.358
Neg logit contributions
 the-0.225
 a-0.203
 all-0.182
 out-0.173
 more-0.165
Token over
Token ID625
Feature activation+9.77e+00

Pos logit contributions
lor+0.240
uddy+0.229
arry+0.229
och+0.222
jo+0.222
Neg logit contributions
 the-0.088
 a-0.079
 all-0.065
 shiny-0.058
 any-0.052
Token to
Token ID284
Feature activation+5.87e+00

Pos logit contributions
 the+0.123
 a+0.110
 all+0.109
 more+0.091
 out+0.090
Neg logit contributions
lor-0.394
obby-0.388
jo-0.387
arry-0.386
-0.386
Token the
Token ID262
Feature activation+1.81e+01

Pos logit contributions
 a+0.020
 all+0.019
 more+0.018
 the+0.017
 on+0.014
Neg logit contributions
obby-0.274
lor-0.274
jo-0.273
-0.271
arry-0.270
Token happen
Token ID1645
Feature activation-1.31e+01

Pos logit contributions
 the+0.127
 up+0.124
 out+0.118
 a+0.113
 back+0.100
Neg logit contributions
lor-0.438
-0.423
mmm-0.420
mmmm-0.408
oney-0.397
Token.
Token ID13
Feature activation-2.48e+01

Pos logit contributions
lor+0.508
arry+0.455
uddy+0.440
jo+0.440
oral+0.437
Neg logit contributions
 the-0.309
 a-0.283
 all-0.247
 out-0.234
 in-0.221
Token
Token ID220
Feature activation-3.79e+01

Pos logit contributions
lor+0.837
ides+0.759
umm+0.721
arry+0.712
oral+0.699
Neg logit contributions
 the-0.918
 a-0.859
 all-0.730
 his-0.718
 in-0.703
Token
Token ID198
Feature activation-3.43e+01

Pos logit contributions
lor+1.401
umm+1.379
ider+1.303
ides+1.249
istent+1.224
Neg logit contributions
 the-1.265
 a-1.169
 all-0.992
 up-0.950
 out-0.950
Token
Token ID198
Feature activation-3.40e+01

Pos logit contributions
umm+1.408
lor+1.334
ides+1.303
istent+1.248
ider+1.248
Neg logit contributions
 the-1.109
 a-1.029
 his-0.838
 her-0.816
 all-0.799
TokenSo
Token ID2396
Feature activation-6.08e+01

Pos logit contributions
umm+1.084
lor+1.066
ides+1.030
oke+0.963
istent+0.957
Neg logit contributions
 the-1.213
 a-1.168
 her-0.976
 his-0.968
 all-0.951
Token the
Token ID262
Feature activation+1.53e+01

Pos logit contributions
lor+2.562
arry+2.432
uddy+2.410
etsy+2.381
jo+2.329
Neg logit contributions
 the-0.916
 a-0.856
 all-0.713
 out-0.700
 up-0.698
Token turtles
Token ID36288
Feature activation-1.66e+01

Pos logit contributions
 the+0.369
 a+0.328
 all+0.299
 out+0.287
 up+0.273
Neg logit contributions
lor-0.599
arry-0.555
uddy-0.546
oral-0.535
etsy-0.534
Token decided
Token ID3066
Feature activation-1.65e+01

Pos logit contributions
lor+0.560
arry+0.527
uddy+0.510
etsy+0.503
jo+0.501
Neg logit contributions
 the-0.442
 a-0.416
 out-0.350
 up-0.346
 all-0.345
Token that
Token ID326
Feature activation-3.00e-01

Pos logit contributions
lor+0.681
arry+0.616
etsy+0.610
uddy+0.602
jo+0.587
Neg logit contributions
 the-0.236
 a-0.214
 all-0.186
 out-0.174
 more-0.165
Token whenever
Token ID8797
Feature activation-1.87e+01

Pos logit contributions
lor+0.012
arry+0.011
uddy+0.011
oral+0.011
obby+0.011
Neg logit contributions
 the-0.005
 a-0.004
 all-0.004
 out-0.004
 up-0.004
Token playing
Token ID2712
Feature activation+2.25e+00

Pos logit contributions
 the+0.161
 a+0.142
 all+0.110
 more+0.109
 her+0.108
Neg logit contributions
uddy-0.344
arry-0.338
-0.336
lor-0.328
etsy-0.323
Token.
Token ID13
Feature activation-2.31e+01

Pos logit contributions
 the+0.034
 a+0.028
 all+0.023
 out+0.021
 more+0.021
Neg logit contributions
lor-0.096
uddy-0.088
-0.088
jo-0.087
arry-0.087
Token
Token ID220
Feature activation-3.71e+01

Pos logit contributions
lor+0.827
ides+0.736
umm+0.694
arry+0.689
oral+0.683
Neg logit contributions
 the-0.833
 a-0.786
 all-0.672
 his-0.650
 her-0.638
Token
Token ID198
Feature activation-3.35e+01

Pos logit contributions
lor+1.404
umm+1.401
ider+1.319
ides+1.251
Really+1.243
Neg logit contributions
 the-1.230
 a-1.134
 all-0.980
 out-0.938
 up-0.922
Token
Token ID198
Feature activation-3.33e+01

Pos logit contributions
umm+1.364
lor+1.305
ides+1.275
ider+1.215
istent+1.209
Neg logit contributions
 the-1.081
 a-1.011
 his-0.811
 her-0.796
 all-0.790
TokenSo
Token ID2396
Feature activation-6.07e+01

Pos logit contributions
lor+1.078
umm+1.044
ides+1.016
oke+0.937
istent+0.929
Neg logit contributions
 the-1.191
 a-1.144
 her-0.951
 all-0.939
 his-0.937
Token,
Token ID11
Feature activation-4.83e+00

Pos logit contributions
lor+2.554
arry+2.443
uddy+2.416
etsy+2.390
obby+2.336
Neg logit contributions
 the-0.856
 a-0.786
 up-0.659
 out-0.650
 all-0.645
Token Tim
Token ID5045
Feature activation-1.16e+00

Pos logit contributions
lor+0.200
arry+0.190
uddy+0.188
etsy+0.186
jo+0.182
Neg logit contributions
 the-0.084
 a-0.076
 all-0.066
 up-0.063
 out-0.063
Tokenmy
Token ID1820
Feature activation-6.37e+00

Pos logit contributions
lor+0.032
arry+0.029
jo+0.029
uddy+0.028
ongo+0.027
Neg logit contributions
 the-0.040
 a-0.038
 all-0.035
 out-0.034
 more-0.033
Token had
Token ID550
Feature activation-5.95e+00

Pos logit contributions
lor+0.243
jo+0.230
arry+0.227
uddy+0.224
etsy+0.218
Neg logit contributions
 the-0.139
 a-0.127
 all-0.113
 out-0.107
 more-0.106
Token a
Token ID257
Feature activation+2.13e+01

Pos logit contributions
lor+0.275
jo+0.242
enny+0.237
anne+0.236
obby+0.236
Neg logit contributions
 the-0.056
 up-0.046
 out-0.039
 in-0.039
 all-0.034
Token ride
Token ID6594
Feature activation-1.02e+01

Pos logit contributions
lor+0.561
etsy+0.451
uddy+0.443
wl+0.426
och+0.424
Neg logit contributions
 the-0.684
 a-0.642
 all-0.601
 his-0.573
 out-0.537
Token.
Token ID13
Feature activation-2.36e+01

Pos logit contributions
lor+0.488
arry+0.452
jo+0.450
uddy+0.445
etsy+0.440
Neg logit contributions
 the-0.124
 a-0.105
 all-0.083
 more-0.073
 out-0.069
Token
Token ID220
Feature activation-3.72e+01

Pos logit contributions
lor+0.822
ides+0.742
umm+0.692
arry+0.688
oral+0.679
Neg logit contributions
 the-0.861
 a-0.811
 all-0.694
 his-0.672
 her-0.657
Token
Token ID198
Feature activation-3.40e+01

Pos logit contributions
umm+1.362
lor+1.357
ider+1.275
ides+1.211
Really+1.204
Neg logit contributions
 the-1.236
 a-1.154
 all-0.992
 out-0.953
 up-0.935
Token
Token ID198
Feature activation-3.37e+01

Pos logit contributions
umm+1.363
lor+1.304
ides+1.280
ider+1.248
istent+1.227
Neg logit contributions
 the-1.099
 a-1.024
 his-0.829
 her-0.809
 all-0.808
TokenSo
Token ID2396
Feature activation-6.04e+01

Pos logit contributions
lor+1.054
umm+1.030
ides+0.998
oke+0.937
istent+0.927
Neg logit contributions
 the-1.218
 a-1.171
 her-0.981
 his-0.972
 all-0.971
Token she
Token ID673
Feature activation-9.86e+00

Pos logit contributions
lor+2.570
arry+2.449
uddy+2.412
etsy+2.390
obby+2.332
Neg logit contributions
 the-0.882
 a-0.816
 all-0.669
 up-0.669
 out-0.668
Token took
Token ID1718
Feature activation+1.50e+00

Pos logit contributions
lor+0.360
arry+0.329
uddy+0.319
jo+0.319
etsy+0.315
Neg logit contributions
 the-0.256
 a-0.235
 all-0.205
 out-0.201
 more-0.197
Token off
Token ID572
Feature activation+7.25e+00

Pos logit contributions
 the+0.007
 dirty+0.006
 shiny+0.005
 all+0.005
 wet+0.004
Neg logit contributions
lor-0.065
etsy-0.061
-0.061
jo-0.060
uddy-0.060
Token down
Token ID866
Feature activation+1.45e+01

Pos logit contributions
 the+0.064
 more+0.062
 a+0.056
 shiny+0.050
 fair+0.047
Neg logit contributions
-0.293
lor-0.289
jo-0.280
arry-0.279
etsy-0.278
Token the
Token ID262
Feature activation+1.77e+01

Pos logit contributions
 up+0.040
 all+0.023
 more+0.023
 shiny+0.020
 in+0.018
Neg logit contributions
lor-0.627
-0.621
arry-0.620
och-0.606
jo-0.605
Token safe
Token ID3338
Feature activation+8.25e+00

Pos logit contributions
lor+0.309
reetings+0.303
arry+0.302
ongo+0.299
oke+0.298
Neg logit contributions
 the-0.193
 a-0.168
 all-0.160
 up-0.153
 out-0.146
Token to
Token ID284
Feature activation+1.58e-01

Pos logit contributions
 the+0.177
 a+0.167
 all+0.143
 more+0.133
 out+0.131
Neg logit contributions
lor-0.315
arry-0.279
uddy-0.276
etsy-0.274
jo-0.272
Token get
Token ID651
Feature activation+6.18e+00

Pos logit contributions
 the+0.003
 a+0.003
 all+0.003
 out+0.003
 up+0.003
Neg logit contributions
lor-0.006
-0.005
arry-0.005
ides-0.005
uddy-0.005
Token too
Token ID1165
Feature activation-1.51e-01

Pos logit contributions
 the+0.079
 a+0.068
 all+0.059
 shiny+0.046
 more+0.044
Neg logit contributions
lor-0.244
etsy-0.235
och-0.222
reetings-0.221
obby-0.219
Token close
Token ID1969
Feature activation+1.12e+01

Pos logit contributions
lor+0.006
arry+0.006
etsy+0.006
uddy+0.005
obby+0.005
Neg logit contributions
 the-0.003
 a-0.003
 all-0.002
 out-0.002
 up-0.002
Token.''
Token ID13531
Feature activation-6.03e+01

Pos logit contributions
 the+0.216
 a+0.200
 all+0.174
 out+0.154
 more+0.150
Neg logit contributions
lor-0.467
arry-0.429
jo-0.419
uddy-0.415
obby-0.413
Token
Token ID198
Feature activation-3.13e+01

Pos logit contributions
lor+1.969
ides+1.758
umm+1.745
arry+1.726
oral+1.680
Neg logit contributions
 the-2.281
 a-2.110
 all-1.825
 in-1.796
 his-1.781
Token
Token ID198
Feature activation-3.10e+01

Pos logit contributions
umm+1.183
lor+1.141
ides+1.112
istent+1.048
ider+1.028
Neg logit contributions
 the-1.054
 a-0.981
 his-0.804
 all-0.789
 her-0.754
TokenP
Token ID47
Feature activation-4.00e+01

Pos logit contributions
umm+1.005
lor+0.970
ides+0.945
istent+0.884
arry+0.873
Neg logit contributions
 the-1.092
 a-1.058
 his-0.864
 all-0.859
 her-0.835
Tokenoppy
Token ID26696
Feature activation-3.26e+01

Pos logit contributions
lor+0.705
oling+0.557
umm+0.552
arry+0.551
jo+0.542
Neg logit contributions
 the-1.864
 a-1.776
 all-1.707
 more-1.606
 in-1.590
Token understood
Token ID7247
Feature activation-1.29e+01

Pos logit contributions
lor+1.223
arry+1.111
uddy+1.076
jo+1.062
etsy+1.059
Neg logit contributions
 the-0.878
 a-0.808
 all-0.724
 out-0.693
 in-0.671
Token yard
Token ID12699
Feature activation-1.64e+01

Pos logit contributions
lor+0.526
ides+0.434
reetings+0.425
arry+0.420
oral+0.419
Neg logit contributions
 the-0.663
 a-0.619
 all-0.564
 out-0.539
 in-0.528
Token!
Token ID0
Feature activation-2.71e+01

Pos logit contributions
lor+0.685
arry+0.624
uddy+0.617
+0.613
jo+0.609
Neg logit contributions
 the-0.334
 a-0.297
 all-0.254
 out-0.236
 more-0.234
Token
Token ID220
Feature activation-3.64e+01

Pos logit contributions
lor+0.895
ides+0.784
arry+0.765
umm+0.747
oral+0.735
Neg logit contributions
 the-1.002
 a-0.954
 all-0.818
 in-0.793
 out-0.777
Token
Token ID198
Feature activation-3.36e+01

Pos logit contributions
lor+1.272
umm+1.245
ider+1.157
Really+1.122
ides+1.119
Neg logit contributions
 the-1.276
 a-1.186
 all-1.038
 out-0.999
 her-0.973
Token
Token ID198
Feature activation-3.35e+01

Pos logit contributions
umm+1.318
lor+1.260
ides+1.246
istent+1.198
ider+1.183
Neg logit contributions
 the-1.108
 a-1.045
 his-0.840
 all-0.832
 her-0.824
TokenSo
Token ID2396
Feature activation-6.03e+01

Pos logit contributions
lor+1.019
umm+0.999
ides+0.972
istent+0.910
oke+0.903
Neg logit contributions
 the-1.236
 a-1.197
 her-0.995
 all-0.993
 his-0.986
Token,
Token ID11
Feature activation-4.97e+00

Pos logit contributions
lor+2.516
arry+2.383
uddy+2.350
etsy+2.334
obby+2.282
Neg logit contributions
 the-0.893
 a-0.829
 up-0.695
 out-0.685
 all-0.682
Token she
Token ID673
Feature activation-8.39e+00

Pos logit contributions
lor+0.204
arry+0.191
uddy+0.189
etsy+0.188
jo+0.183
Neg logit contributions
 the-0.090
 a-0.082
 all-0.072
 more-0.069
 up-0.068
Token dragged
Token ID17901
Feature activation-1.31e+01

Pos logit contributions
lor+0.309
arry+0.283
uddy+0.278
etsy+0.276
jo+0.275
Neg logit contributions
 the-0.213
 a-0.195
 all-0.171
 out-0.164
 more-0.162
Token her
Token ID607
Feature activation+1.46e+01

Pos logit contributions
+0.556
etsy+0.539
uddy+0.531
arry+0.522
och+0.517
Neg logit contributions
 the-0.050
 shiny-0.048
 more-0.046
 a-0.040
 fair-0.033
Token wheel
Token ID7825
Feature activation-7.33e+00

Pos logit contributions
 the+0.361
 a+0.343
 all+0.311
 more+0.293
 his+0.280
Neg logit contributions
uddy-0.481
lor-0.465
etsy-0.461
-0.457
jo-0.441
Token forever
Token ID8097
Feature activation-3.03e+01

Pos logit contributions
lor+0.177
arry+0.162
jo+0.159
uddy+0.159
ides+0.156
Neg logit contributions
 the-0.092
 a-0.084
 all-0.071
 out-0.070
 up-0.069
Token.
Token ID13
Feature activation-2.45e+01

Pos logit contributions
lor+1.274
arry+1.166
uddy+1.130
etsy+1.121
jo+1.120
Neg logit contributions
 the-0.639
 a-0.580
 all-0.497
 out-0.476
 up-0.455
Token
Token ID220
Feature activation-3.72e+01

Pos logit contributions
lor+0.836
ides+0.809
umm+0.766
arry+0.737
abs+0.732
Neg logit contributions
 the-0.869
 a-0.813
 all-0.689
 his-0.663
 her-0.642
Token
Token ID198
Feature activation-3.36e+01

Pos logit contributions
lor+1.301
umm+1.269
ider+1.179
ides+1.155
arry+1.146
Neg logit contributions
 the-1.272
 a-1.197
 all-1.032
 out-0.986
 up-0.977
Token
Token ID198
Feature activation-3.32e+01

Pos logit contributions
umm+1.290
lor+1.261
ides+1.242
istent+1.201
ider+1.178
Neg logit contributions
 the-1.091
 a-1.007
 all-0.838
 his-0.818
 her-0.805
TokenSo
Token ID2396
Feature activation-6.03e+01

Pos logit contributions
umm+1.073
lor+1.040
ides+1.039
istent+0.995
arry+0.976
Neg logit contributions
 the-1.169
 a-1.109
 all-0.941
 her-0.919
 his-0.905
Token one
Token ID530
Feature activation+7.39e+00

Pos logit contributions
lor+2.653
arry+2.497
uddy+2.467
etsy+2.446
obby+2.391
Neg logit contributions
 the-0.874
 a-0.824
 all-0.661
 out-0.656
 up-0.635
Token day
Token ID1110
Feature activation-9.68e+00

Pos logit contributions
 the+0.193
 a+0.175
 all+0.168
 out+0.165
 up+0.156
Neg logit contributions
lor-0.234
jo-0.228
uddy-0.225
arry-0.224
oral-0.219
Token they
Token ID484
Feature activation-8.24e+00

Pos logit contributions
lor+0.398
arry+0.376
etsy+0.364
uddy+0.364
oke+0.360
Neg logit contributions
 the-0.144
 a-0.137
 all-0.132
 up-0.122
 out-0.115
Token decided
Token ID3066
Feature activation-1.57e+01

Pos logit contributions
lor+0.282
arry+0.264
etsy+0.251
jo+0.250
uddy+0.249
Neg logit contributions
 the-0.220
 a-0.206
 up-0.176
 out-0.174
 more-0.173
Token to
Token ID284
Feature activation+6.66e+00

Pos logit contributions
lor+0.648
arry+0.583
etsy+0.572
uddy+0.556
anne+0.556
Neg logit contributions
 the-0.219
 a-0.198
 all-0.167
 out-0.158
 more-0.155
Token away
Token ID1497
Feature activation+1.82e+01

Pos logit contributions
lor+0.514
arry+0.504
uddy+0.489
etsy+0.485
jo+0.474
Neg logit contributions
 the-0.196
 a-0.172
 all-0.128
 shiny-0.115
 big-0.107
Token.
Token ID13
Feature activation-1.93e+01

Pos logit contributions
 the+0.410
 a+0.382
 all+0.328
 more+0.308
 out+0.299
Neg logit contributions
lor-0.657
arry-0.648
uddy-0.610
etsy-0.602
jo-0.587
Token
Token ID198
Feature activation-2.91e+01

Pos logit contributions
lor+0.692
ides+0.668
umm+0.619
reetings+0.613
oral+0.613
Neg logit contributions
 the-0.674
 a-0.626
 all-0.530
 his-0.523
 her-0.494
Token
Token ID198
Feature activation-2.88e+01

Pos logit contributions
umm+1.298
ides+1.129
lor+1.125
istent+1.113
ider+1.075
Neg logit contributions
 the-0.916
 a-0.855
 his-0.679
 all-0.663
 her-0.641
TokenM
Token ID44
Feature activation-3.50e+01

Pos logit contributions
umm+0.989
lor+0.918
ides+0.898
istent+0.863
oke+0.847
Neg logit contributions
 the-0.992
 a-0.954
 all-0.790
 his-0.788
 her-0.775
Tokenoral
Token ID6864
Feature activation-6.02e+01

Pos logit contributions
lor+0.673
etsy+0.496
+0.485
oling+0.485
och+0.471
Neg logit contributions
 the-1.565
 a-1.509
 all-1.436
 in-1.397
 more-1.379
Token of
Token ID286
Feature activation+8.35e+00

Pos logit contributions
lor+2.492
arry+2.286
uddy+2.238
etsy+2.209
+2.192
Neg logit contributions
 the-1.461
 a-1.328
 all-1.171
 in-1.072
 out-1.072
Token the
Token ID262
Feature activation+1.98e+01

Pos logit contributions
 a+0.023
 the+0.022
 out+0.017
 all+0.011
 up+0.008
Neg logit contributions
lor-0.454
oral-0.417
jo-0.415
uddy-0.410
reetings-0.408
Token story
Token ID1621
Feature activation-2.99e+01

Pos logit contributions
 the+0.797
 a+0.753
 all+0.697
 out+0.685
 in+0.674
Neg logit contributions
lor-0.480
arry-0.416
uddy-0.400
oral-0.394
-0.377
Token:
Token ID25
Feature activation-2.50e+01

Pos logit contributions
lor+1.041
arry+0.924
+0.918
etsy+0.915
uddy+0.898
Neg logit contributions
 the-0.931
 a-0.858
 all-0.784
 in-0.738
 out-0.729
Token When
Token ID1649
Feature activation-2.95e+01

Pos logit contributions
lor+0.683
arry+0.588
uddy+0.573
etsy+0.566
+0.559
Neg logit contributions
 the-0.988
 a-0.923
 all-0.857
 in-0.816
 out-0.808
Token like
Token ID588
Feature activation-1.04e+01

Pos logit contributions
lor+0.544
ides+0.487
arry+0.475
oral+0.461
etsy+0.457
Neg logit contributions
 the-0.661
 a-0.635
 all-0.544
 in-0.522
 out-0.522
Token me
Token ID502
Feature activation+3.40e-01

Pos logit contributions
lor+0.439
jo+0.413
pill+0.402
uddy+0.401
arry+0.395
Neg logit contributions
 the-0.113
 a-0.096
 up-0.091
 out-0.086
 all-0.083
Token."
Token ID526
Feature activation-3.88e+01

Pos logit contributions
 the+0.009
 a+0.008
 all+0.007
 out+0.006
 more+0.006
Neg logit contributions
lor-0.013
arry-0.012
uddy-0.012
jo-0.012
etsy-0.012
Token
Token ID198
Feature activation-3.30e+01

Pos logit contributions
lor+1.497
ides+1.340
umm+1.309
arry+1.242
reetings+1.226
Neg logit contributions
 the-1.399
 a-1.270
 her-1.095
 his-1.093
 all-1.054
Token
Token ID198
Feature activation-3.27e+01

Pos logit contributions
umm+1.520
lor+1.376
ides+1.359
istent+1.340
ider+1.329
Neg logit contributions
 the-1.030
 a-0.932
 his-0.780
 her-0.758
 their-0.693
TokenSo
Token ID2396
Feature activation-6.02e+01

Pos logit contributions
umm+1.150
lor+1.089
ides+1.044
istent+1.002
oke+0.982
Neg logit contributions
 the-1.136
 a-1.073
 her-0.938
 his-0.927
 their-0.867
Token,
Token ID11
Feature activation-4.82e+00

Pos logit contributions
lor+2.604
arry+2.468
uddy+2.449
etsy+2.393
jo+2.371
Neg logit contributions
 the-0.974
 a-0.883
 all-0.760
 out-0.708
 up-0.698
Token Billy
Token ID15890
Feature activation-4.72e+00

Pos logit contributions
lor+0.178
arry+0.166
uddy+0.166
etsy+0.162
jo+0.159
Neg logit contributions
 the-0.113
 a-0.105
 all-0.096
 out-0.091
 up-0.090
Token started
Token ID2067
Feature activation+1.81e+00

Pos logit contributions
lor+0.190
arry+0.177
jo+0.174
uddy+0.172
etsy+0.169
Neg logit contributions
 the-0.101
 a-0.093
 all-0.083
 out-0.077
 more-0.075
Token to
Token ID284
Feature activation+7.04e+00

Pos logit contributions
 the+0.026
 a+0.023
 all+0.022
 more+0.020
 out+0.019
Neg logit contributions
lor-0.067
jo-0.066
uddy-0.064
obby-0.064
arry-0.063
Token practice
Token ID3357
Feature activation+5.18e+00

Pos logit contributions
 the+0.176
 a+0.162
 all+0.148
 out+0.140
 in+0.139
Neg logit contributions
lor-0.235
arry-0.227
jo-0.221
istent-0.219
uddy-0.218
Token pear
Token ID25286
Feature activation-7.58e+00

Pos logit contributions
 the+0.095
 a+0.086
 all+0.077
 out+0.074
 up+0.074
Neg logit contributions
lor-0.110
arry-0.105
etsy-0.103
uddy-0.102
-0.100
Token.
Token ID13
Feature activation-1.59e+01

Pos logit contributions
lor+0.286
arry+0.260
uddy+0.258
etsy+0.254
+0.254
Neg logit contributions
 the-0.187
 a-0.172
 all-0.143
 out-0.141
 up-0.136
Token It
Token ID632
Feature activation-7.57e+00

Pos logit contributions
lor+0.758
arry+0.675
reetings+0.675
ides+0.671
jo+0.671
Neg logit contributions
 the-0.419
 a-0.366
 all-0.302
 her-0.290
 out-0.276
Token was
Token ID373
Feature activation+8.87e+00

Pos logit contributions
lor+0.219
uddy+0.203
etsy+0.202
+0.202
jo+0.193
Neg logit contributions
 the-0.210
 a-0.195
 out-0.180
 all-0.177
 more-0.173
Token hard
Token ID1327
Feature activation+1.40e+01

Pos logit contributions
 the+0.116
 out+0.108
 up+0.099
 back+0.096
 away+0.086
Neg logit contributions
omo-0.286
etsy-0.282
omm-0.274
uddy-0.272
-0.270
Token and
Token ID290
Feature activation+1.27e+01

Pos logit contributions
 the+0.253
 a+0.201
 their+0.188
 more+0.156
 out+0.156
Neg logit contributions
arry-0.501
lor-0.497
-0.494
jo-0.489
ides-0.488
Token not
Token ID407
Feature activation+4.94e+00

Pos logit contributions
 out+0.242
 the+0.238
 a+0.237
 up+0.214
 in+0.204
Neg logit contributions
etsy-0.421
Really-0.418
uddy-0.406
-0.403
lor-0.403
Token very
Token ID845
Feature activation+4.94e+00

Pos logit contributions
 the+0.081
 up+0.070
 a+0.068
 out+0.061
 their+0.057
Neg logit contributions
etsy-0.173
istent-0.167
lor-0.162
ongo-0.162
arry-0.161
Token tasty
Token ID25103
Feature activation+1.65e+01

Pos logit contributions
 the+0.091
 a+0.083
 out+0.072
 all+0.066
 more+0.065
Neg logit contributions
arry-0.196
etsy-0.189
ongo-0.185
uddy-0.185
lor-0.184
Token.
Token ID13
Feature activation-1.64e+01

Pos logit contributions
 the+0.396
 a+0.354
 all+0.303
 out+0.283
 up+0.281
Neg logit contributions
lor-0.618
arry-0.574
uddy-0.571
-0.561
umm-0.561
Token
Token ID198
Feature activation-2.62e+01

Pos logit contributions
lor+0.813
reetings+0.743
ides+0.739
jo+0.731
arry+0.728
Neg logit contributions
 the-0.420
 a-0.360
 all-0.300
 her-0.291
 up-0.268
Token why
Token ID1521
Feature activation-2.73e+01

Pos logit contributions
lor+0.096
arry+0.090
Really+0.090
obby+0.089
uddy+0.088
Neg logit contributions
 the-0.039
 a-0.032
 all-0.030
 out-0.029
 up-0.026
Token her
Token ID607
Feature activation+1.16e+01

Pos logit contributions
lor+1.169
arry+1.109
uddy+1.099
oke+1.088
+1.080
Neg logit contributions
 the-0.418
 a-0.398
 out-0.346
 all-0.340
 up-0.323
Token dad
Token ID9955
Feature activation-1.93e+01

Pos logit contributions
 the+0.302
 a+0.275
 all+0.249
 out+0.235
 in+0.227
Neg logit contributions
lor-0.437
arry-0.399
uddy-0.392
etsy-0.383
jo-0.382
Token said
Token ID531
Feature activation-1.37e+01

Pos logit contributions
lor+0.694
arry+0.634
uddy+0.612
jo+0.606
+0.606
Neg logit contributions
 the-0.531
 a-0.493
 all-0.443
 out-0.421
 in-0.408
Token to
Token ID284
Feature activation+5.26e+00

Pos logit contributions
lor+0.576
arry+0.516
oke+0.515
etsy+0.514
uddy+0.513
Neg logit contributions
 the-0.242
 a-0.222
 all-0.197
 out-0.182
 up-0.177
Token stay
Token ID2652
Feature activation+9.68e+00

Pos logit contributions
 the+0.092
 a+0.084
 all+0.074
 out+0.071
 up+0.070
Neg logit contributions
lor-0.225
arry-0.210
-0.208
etsy-0.204
uddy-0.203
Token away
Token ID1497
Feature activation+1.81e+01

Pos logit contributions
 the+0.154
 a+0.133
 all+0.121
 more+0.096
 out+0.092
Neg logit contributions
lor-0.409
arry-0.370
 Amen-0.364
etsy-0.360
obby-0.359
Token from
Token ID422
Feature activation+1.96e-01

Pos logit contributions
 the+0.384
 a+0.343
 all+0.303
 out+0.276
 up+0.271
Neg logit contributions
lor-0.721
arry-0.684
uddy-0.650
obby-0.649
jo-0.643
Token it
Token ID340
Feature activation+4.31e+00

Pos logit contributions
 the+0.002
 a+0.002
 all+0.002
 out+0.001
 shiny+0.001
Neg logit contributions
lor-0.008
jo-0.008
-0.008
reetings-0.008
obby-0.008
Token!
Token ID0
Feature activation-2.24e+01

Pos logit contributions
 the+0.102
 a+0.092
 all+0.081
 out+0.075
 up+0.074
Neg logit contributions
lor-0.173
arry-0.158
uddy-0.155
jo-0.153
etsy-0.152
Token From
Token ID3574
Feature activation-3.13e+01

Pos logit contributions
lor+0.835
ides+0.830
umm+0.754
istent+0.739
ider+0.737
Neg logit contributions
 the-0.753
 a-0.675
 her-0.561
 his-0.549
 all-0.528
TokenOnce
Token ID7454
Feature activation-4.09e+01

Pos logit contributions
lor+1.021
istent+1.006
ides+0.986
uer+0.970
umm+0.968
Neg logit contributions
 the-1.243
 a-1.145
 on-1.004
 in-1.000
 all-0.992
Token there
Token ID612
Feature activation+7.02e+00

Pos logit contributions
lor+1.397
arry+1.216
ides+1.181
uddy+1.173
+1.171
Neg logit contributions
 the-1.373
 a-1.252
 all-1.165
 in-1.088
 out-1.083
Token was
Token ID373
Feature activation+9.76e+00

Pos logit contributions
 the+0.171
 a+0.154
 all+0.136
 in+0.124
 out+0.124
Neg logit contributions
lor-0.293
arry-0.269
uddy-0.262
etsy-0.257
ides-0.256
Token a
Token ID257
Feature activation+2.62e+01

Pos logit contributions
 the+0.051
 all+0.012
 a+0.007
 out-0.002
 up-0.008
Neg logit contributions
lor-0.529
arry-0.507
uddy-0.507
etsy-0.505
unch-0.505
Token sad
Token ID6507
Feature activation+1.22e+01

Pos logit contributions
 the+0.473
 a+0.417
 all+0.336
 in+0.304
 out+0.289
Neg logit contributions
lor-1.288
arry-1.168
ides-1.149
uddy-1.142
jo-1.131
Token little
Token ID1310
Feature activation+1.46e+01

Pos logit contributions
 the+0.304
 a+0.276
 all+0.245
 out+0.231
 up+0.226
Neg logit contributions
lor-0.471
arry-0.445
uddy-0.433
etsy-0.423
jo-0.422
Token boy
Token ID2933
Feature activation-1.36e+01

Pos logit contributions
 the+0.462
 a+0.429
 all+0.399
 out+0.382
 up+0.377
Neg logit contributions
arry-0.401
lor-0.399
uddy-0.392
etsy-0.378
och-0.378
Token.
Token ID13
Feature activation-1.48e+01

Pos logit contributions
lor+0.458
arry+0.411
uddy+0.398
etsy+0.390
+0.387
Neg logit contributions
 the-0.438
 a-0.407
 all-0.371
 in-0.351
 out-0.350
Token He
Token ID679
Feature activation-9.13e+00

Pos logit contributions
lor+0.504
ides+0.408
etsy+0.408
oral+0.402
arry+0.399
Neg logit contributions
 the-0.573
 a-0.534
 his-0.473
 all-0.469
 in-0.445
Token was
Token ID373
Feature activation+1.03e+01

Pos logit contributions
lor+0.285
jo+0.260
arry+0.256
uddy+0.253
etsy+0.246
Neg logit contributions
 the-0.276
 a-0.252
 out-0.236
 all-0.234
 up-0.223
Token very
Token ID845
Feature activation+6.97e+00

Pos logit contributions
 the+0.218
 off+0.176
 away+0.175
 more+0.174
 out+0.174
Neg logit contributions
 Amen-0.282
uddy-0.275
umpy-0.272
-0.271
ider-0.270
Token and
Token ID290
Feature activation+1.62e+01

Pos logit contributions
oling+0.142
uddy+0.141
oled+0.140
arry+0.139
oc+0.137
Neg logit contributions
 the-0.078
 a-0.067
 all-0.059
 shiny-0.053
 up-0.052
Token cry
Token ID3960
Feature activation-5.12e+00

Pos logit contributions
 the+0.479
 a+0.459
 all+0.439
 in+0.420
 more+0.416
Neg logit contributions
lor-0.414
 Deal-0.388
uddy-0.382
-0.374
arry-0.374
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
lor+0.190
arry+0.181
uddy+0.176
jo+0.170
oling+0.170
Neg logit contributions
 the-0.129
 a-0.113
 all-0.100
 up-0.092
 in-0.090
Token They
Token ID1119
Feature activation-1.28e+01

Pos logit contributions
lor+0.447
ides+0.397
istent+0.388
arry+0.382
och+0.382
Neg logit contributions
 the-0.459
 a-0.432
 his-0.368
 their-0.355
 all-0.353
Token drop
Token ID4268
Feature activation+1.90e+01

Pos logit contributions
lor+0.408
arry+0.385
uddy+0.384
+0.368
ides+0.367
Neg logit contributions
 the-0.327
 a-0.303
 out-0.265
 up-0.261
 more-0.256
Token the
Token ID262
Feature activation+2.57e+01

Pos logit contributions
 going+0.106
 all+0.102
 a+0.094
 shiny+0.091
 in+0.082
Neg logit contributions
umbs-0.741
uddy-0.734
lor-0.715
oc-0.712
azz-0.706
Token cookies
Token ID14746
Feature activation+7.22e+00

Pos logit contributions
 the+0.723
 a+0.652
 all+0.606
 his+0.569
 out+0.553
Neg logit contributions
lor-0.846
uddy-0.794
-0.788
oral-0.781
etsy-0.776
Token and
Token ID290
Feature activation+1.61e+01

Pos logit contributions
 the+0.131
 a+0.124
 shiny+0.093
 all+0.093
 an+0.090
Neg logit contributions
lor-0.278
-0.267
uddy-0.258
oling-0.256
och-0.256
Token juice
Token ID13135
Feature activation+1.09e+01

Pos logit contributions
 a+0.206
 up+0.204
 all+0.204
 the+0.196
 out+0.190
Neg logit contributions
oc-0.537
 Deal-0.524
lor-0.520
 Amen-0.516
-0.509
Token.
Token ID13
Feature activation-1.25e+01

Pos logit contributions
 the+0.223
 a+0.206
 all+0.162
 more+0.152
 up+0.149
Neg logit contributions
lor-0.433
uddy-0.396
-0.395
oling-0.387
och-0.384
Token They
Token ID1119
Feature activation-1.28e+01

Pos logit contributions
lor+0.437
ides+0.383
istent+0.379
arry+0.379
reetings+0.379
Neg logit contributions
 the-0.469
 a-0.441
 their-0.370
 his-0.367
 all-0.360
Token diamond
Token ID15291
Feature activation-1.05e+01

Pos logit contributions
 the+0.518
 a+0.477
 all+0.433
 out+0.420
 more+0.408
Neg logit contributions
lor-0.570
arry-0.505
-0.485
etsy-0.483
oral-0.477
Token?
Token ID30
Feature activation-1.69e+01

Pos logit contributions
lor+0.315
arry+0.272
ongo+0.259
uddy+0.253
jo+0.252
Neg logit contributions
 the-0.345
 a-0.319
 all-0.295
 more-0.279
 out-0.277
Token Give
Token ID13786
Feature activation-2.52e+01

Pos logit contributions
lor+0.464
oral+0.426
arry+0.413
umm+0.410
ides+0.408
Neg logit contributions
 the-0.657
 a-0.642
 in-0.559
 out-0.549
 on-0.539
Token it
Token ID340
Feature activation+1.04e+01

Pos logit contributions
lor+1.098
arry+0.954
ongo+0.948
etsy+0.934
uddy+0.925
Neg logit contributions
 the-0.395
 a-0.369
 all-0.307
 more-0.302
 in-0.297
Token back
Token ID736
Feature activation+2.55e+01

Pos logit contributions
 the+0.149
 a+0.116
 all+0.110
 in+0.098
 more+0.092
Neg logit contributions
lor-0.432
uddy-0.399
ongo-0.397
arry-0.388
etsy-0.386
Token to
Token ID284
Feature activation+1.37e+01

Pos logit contributions
 the+0.704
 a+0.658
 all+0.619
 more+0.574
 out+0.568
Neg logit contributions
lor-0.834
arry-0.744
jo-0.723
uddy-0.719
obby-0.708
Token me
Token ID502
Feature activation+1.36e+01

Pos logit contributions
 the+0.305
 a+0.292
 all+0.255
 in+0.250
 out+0.249
Neg logit contributions
lor-0.530
arry-0.468
-0.452
mmmm-0.452
uddy-0.452
Token,
Token ID11
Feature activation+4.27e+00

Pos logit contributions
 the+0.446
 a+0.413
 all+0.378
 out+0.359
 more+0.356
Neg logit contributions
lor-0.423
arry-0.377
uddy-0.357
jo-0.352
etsy-0.348
Token now
Token ID783
Feature activation+1.87e+01

Pos logit contributions
 the+0.099
 a+0.091
 all+0.081
 more+0.078
 up+0.076
Neg logit contributions
lor-0.158
uddy-0.143
arry-0.142
oke-0.141
och-0.139
Token!"
Token ID2474
Feature activation-2.89e+01

Pos logit contributions
 the+0.546
 a+0.517
 all+0.474
 more+0.456
 out+0.447
Neg logit contributions
lor-0.598
arry-0.518
uddy-0.492
-0.491
etsy-0.482
Token
Token ID198
Feature activation-2.27e+01

Pos logit contributions
lor+1.023
arry+0.887
ides+0.880
uddy+0.872
umm+0.867
Neg logit contributions
 the-0.994
 a-0.914
 all-0.801
 in-0.765
 out-0.759
Token not
Token ID407
Feature activation+7.83e+00

Pos logit contributions
 the+0.043
 a+0.038
 out+0.036
 all+0.035
 in+0.033
Neg logit contributions
lor-0.070
oral-0.066
-0.066
arry-0.065
uddy-0.065
Token want
Token ID765
Feature activation-4.28e+00

Pos logit contributions
 the+0.242
 a+0.221
 all+0.202
 up+0.189
 in+0.187
Neg logit contributions
lor-0.244
arry-0.237
uddy-0.228
jo-0.223
ides-0.222
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
uddy+0.153
etsy+0.137
lor+0.136
jo+0.135
anne+0.133
Neg logit contributions
 the-0.056
 up-0.055
 all-0.053
 on-0.053
 in-0.053
Token make
Token ID787
Feature activation+1.50e+01

Pos logit contributions
 the+0.268
 a+0.255
 all+0.214
 out+0.204
 up+0.193
Neg logit contributions
lor-0.497
arry-0.493
uddy-0.486
ongo-0.472
-0.472
Token the
Token ID262
Feature activation+2.52e+01

Pos logit contributions
 the+0.119
 in+0.104
 on+0.086
 a+0.086
 all+0.082
Neg logit contributions
lor-0.674
arry-0.631
uddy-0.619
jo-0.615
obby-0.611
Token lady
Token ID10846
Feature activation-9.19e+00

Pos logit contributions
 the+0.778
 a+0.724
 all+0.653
 in+0.625
 out+0.624
Neg logit contributions
lor-0.826
arry-0.760
uddy-0.726
oral-0.723
-0.711
Token mad
Token ID8805
Feature activation+5.45e+00

Pos logit contributions
lor+0.331
jo+0.325
oke+0.323
arry+0.318
uddy+0.316
Neg logit contributions
 the-0.191
 a-0.175
 all-0.162
 out-0.133
 in-0.131
Token.
Token ID13
Feature activation-1.33e+01

Pos logit contributions
 the+0.131
 a+0.119
 all+0.104
 out+0.094
 in+0.094
Neg logit contributions
lor-0.217
arry-0.203
uddy-0.198
jo-0.192
etsy-0.192
Token But
Token ID887
Feature activation-2.66e+01

Pos logit contributions
lor+0.417
reetings+0.351
och+0.349
ides+0.345
unch+0.338
Neg logit contributions
 the-0.520
 a-0.488
 all-0.440
 in-0.416
 out-0.414
Token they
Token ID484
Feature activation+6.97e-02

Pos logit contributions
Really+0.820
uddy+0.813
+0.801
ama+0.780
elope+0.778
Neg logit contributions
 shiny-0.368
 big-0.350
 up-0.335
 out-0.323
 all-0.320
Token also
Token ID635
Feature activation+3.93e+00

Pos logit contributions
 the+0.002
 a+0.002
 up+0.002
 out+0.002
 more+0.002
Neg logit contributions
lor-0.002
arry-0.002
uddy-0.002
etsy-0.002
-0.002
Token run
Token ID1057
Feature activation+1.22e+01

Pos logit contributions
lor+0.435
arry+0.410
uddy+0.396
+0.393
ongo+0.388
Neg logit contributions
 the-0.338
 a-0.314
 out-0.278
 up-0.273
 all-0.270
Token to
Token ID284
Feature activation+1.46e+01

Pos logit contributions
 the+0.193
 a+0.161
 all+0.138
 her+0.129
 shiny+0.127
Neg logit contributions
lor-0.455
uddy-0.453
arry-0.448
etsy-0.440
obby-0.428
Token their
Token ID511
Feature activation+2.43e+01

Pos logit contributions
 in+0.064
 on+0.061
 a+0.054
 shiny+0.046
 out+0.040
Neg logit contributions
 Today-0.600
obby-0.594
-0.591
 Deal-0.589
 Amen-0.587
Token mom
Token ID1995
Feature activation-6.40e+00

Pos logit contributions
 the+0.669
 a+0.646
 out+0.563
 all+0.553
 her+0.524
Neg logit contributions
lor-0.771
uddy-0.726
arry-0.702
obby-0.695
-0.690
Token and
Token ID290
Feature activation+1.67e+01

Pos logit contributions
lor+0.274
arry+0.252
jo+0.251
uddy+0.250
och+0.247
Neg logit contributions
 the-0.120
 a-0.109
 all-0.097
 out-0.081
 more-0.079
Token tell
Token ID1560
Feature activation-2.01e+00

Pos logit contributions
 the+0.409
 a+0.408
 all+0.381
 more+0.376
 in+0.348
Neg logit contributions
lor-0.484
uddy-0.463
arry-0.453
 Deal-0.447
och-0.440
Token her
Token ID607
Feature activation+2.23e+01

Pos logit contributions
lor+0.075
oral+0.072
jo+0.070
 Amen+0.069
arry+0.069
Neg logit contributions
 the-0.032
 a-0.030
 in-0.027
 all-0.026
 on-0.026
Token what
Token ID644
Feature activation+4.32e+00

Pos logit contributions
 the+0.434
 a+0.388
 all+0.337
 out+0.333
 in+0.326
Neg logit contributions
lor-0.922
arry-0.841
uddy-0.820
oral-0.810
jo-0.801
Token happened
Token ID3022
Feature activation-9.68e+00

Pos logit contributions
 the+0.117
 a+0.109
 all+0.105
 out+0.101
 up+0.099
Neg logit contributions
lor-0.147
uddy-0.133
arry-0.132
oral-0.131
jo-0.130
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
lor+0.398
arry+0.365
uddy+0.356
etsy+0.350
jo+0.348
Neg logit contributions
 the-0.233
 a-0.211
 all-0.187
 out-0.171
 in-0.170
Token Mom
Token ID11254
Feature activation-5.01e+00

Pos logit contributions
lor+0.485
ides+0.451
umm+0.440
istent+0.439
reetings+0.431
Neg logit contributions
 the-0.410
 a-0.378
 their-0.332
 her-0.324
 his-0.311
Token slide
Token ID10649
Feature activation+4.17e+00

Pos logit contributions
 the+0.762
 a+0.728
 out+0.646
 all+0.646
 in+0.621
Neg logit contributions
lor-0.774
uddy-0.702
-0.697
arry-0.690
ongo-0.669
Token and
Token ID290
Feature activation+1.58e+01

Pos logit contributions
 the+0.074
 a+0.067
 all+0.053
 out+0.048
 in+0.045
Neg logit contributions
lor-0.177
arry-0.169
uddy-0.165
-0.164
jo-0.162
Token looks
Token ID3073
Feature activation+9.96e+00

Pos logit contributions
 a+0.465
 the+0.465
 all+0.421
 out+0.402
 in+0.399
Neg logit contributions
lor-0.405
Really-0.367
oling-0.359
-0.357
arry-0.355
Token down
Token ID866
Feature activation+2.33e+01

Pos logit contributions
 the+0.169
 a+0.160
 all+0.119
 an+0.104
 some+0.103
Neg logit contributions
oke-0.355
arry-0.355
jo-0.344
lor-0.343
oling-0.342
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
 the+0.544
 a+0.503
 all+0.438
 out+0.411
 more+0.396
Neg logit contributions
lor-0.792
arry-0.769
ider-0.749
-0.728
oling-0.717
Token He
Token ID679
Feature activation-7.31e+00

Pos logit contributions
lor+0.399
ides+0.350
istent+0.342
arry+0.335
jo+0.330
Neg logit contributions
 the-0.467
 a-0.446
 his-0.385
 all-0.381
 out-0.367
Token feels
Token ID5300
Feature activation+9.35e-01

Pos logit contributions
lor+0.214
arry+0.195
oke+0.190
jo+0.188
+0.183
Neg logit contributions
 the-0.222
 a-0.211
 all-0.192
 out-0.191
 in-0.186
Token nervous
Token ID10927
Feature activation+5.38e-01

Pos logit contributions
 all+0.008
 the+0.007
 a+0.006
 out+0.005
 in+0.005
Neg logit contributions
lor-0.039
och-0.038
unch-0.037
oling-0.037
arry-0.037
Token,
Token ID11
Feature activation+4.85e+00

Pos logit contributions
 the+0.013
 a+0.012
 all+0.011
 out+0.009
 up+0.009
Neg logit contributions
lor-0.019
arry-0.018
jo-0.018
oke-0.018
uddy-0.017
Token but
Token ID475
Feature activation-5.72e+00

Pos logit contributions
 the+0.141
 a+0.134
 all+0.128
 up+0.127
 out+0.125
Neg logit contributions
lor-0.122
Really-0.116
oling-0.115
uddy-0.113
etsy-0.112
Token he
Token ID339
Feature activation+6.73e+00

Pos logit contributions
Really+0.147
oke+0.139
lor+0.138
Well+0.138
uddy+0.137
Neg logit contributions
 the-0.150
 a-0.143
 out-0.142
 all-0.137
 up-0.128
Token share
Token ID2648
Feature activation+8.74e+00

Pos logit contributions
lor+0.103
arry+0.095
uddy+0.093
etsy+0.091
jo+0.091
Neg logit contributions
 the-0.054
 a-0.049
 all-0.043
 out-0.039
 in-0.038
Token it
Token ID340
Feature activation+1.07e+01

Pos logit contributions
 the+0.111
 a+0.101
 all+0.089
 out+0.088
 up+0.078
Neg logit contributions
lor-0.385
oc-0.344
mmm-0.340
jo-0.337
arry-0.337
Token with
Token ID351
Feature activation+1.63e+01

Pos logit contributions
 the+0.218
 a+0.189
 out+0.160
 all+0.158
 up+0.142
Neg logit contributions
lor-0.435
arry-0.397
uddy-0.387
etsy-0.381
ider-0.381
Token Sam
Token ID3409
Feature activation+1.15e+01

Pos logit contributions
 the+0.336
 a+0.300
 all+0.267
 up+0.257
 out+0.255
Neg logit contributions
lor-0.675
arry-0.617
uddy-0.613
jo-0.608
etsy-0.595
Token when
Token ID618
Feature activation+5.08e+00

Pos logit contributions
 the+0.282
 a+0.257
 all+0.229
 out+0.217
 up+0.207
Neg logit contributions
lor-0.445
arry-0.403
uddy-0.397
jo-0.391
etsy-0.390
Token he
Token ID339
Feature activation+6.19e+00

Pos logit contributions
 the+0.134
 a+0.124
 all+0.116
 out+0.109
 up+0.108
Neg logit contributions
lor-0.171
arry-0.155
uddy-0.154
jo-0.149
obby-0.147
Token comes
Token ID2058
Feature activation+1.77e+00

Pos logit contributions
 the+0.190
 a+0.176
 all+0.161
 out+0.152
 in+0.149
Neg logit contributions
lor-0.205
arry-0.184
ides-0.176
uddy-0.175
oke-0.175
Token over
Token ID625
Feature activation+1.77e+01

Pos logit contributions
 the+0.029
 a+0.025
 all+0.023
 more+0.020
 out+0.017
Neg logit contributions
lor-0.076
arry-0.072
uddy-0.070
jo-0.069
obby-0.069
Token."
Token ID526
Feature activation-2.67e+01

Pos logit contributions
 the+0.459
 a+0.425
 all+0.394
 out+0.375
 up+0.363
Neg logit contributions
lor-0.629
arry-0.569
jo-0.545
obby-0.544
-0.543
Token
Token ID198
Feature activation-2.26e+01

Pos logit contributions
lor+1.109
umm+0.984
ides+0.980
reetings+0.950
istent+0.944
Neg logit contributions
 the-0.931
 a-0.850
 her-0.721
 all-0.671
 to-0.638
Token
Token ID198
Feature activation-2.26e+01

Pos logit contributions
umm+0.888
lor+0.877
ider+0.821
ides+0.819
istent+0.813
Neg logit contributions
 the-0.796
 a-0.708
 her-0.583
 all-0.578
 big-0.562
Token
Token ID198
Feature activation-2.68e+01

Pos logit contributions
lor+0.579
ides+0.521
reetings+0.519
och+0.509
oral+0.504
Neg logit contributions
 the-0.639
 a-0.603
 all-0.532
 his-0.500
 out-0.485
Token
Token ID198
Feature activation-2.67e+01

Pos logit contributions
umm+1.111
lor+1.002
ides+0.996
istent+0.977
ider+0.960
Neg logit contributions
 the-0.873
 a-0.810
 all-0.659
 in-0.640
 on-0.637
TokenInside
Token ID24441
Feature activation-2.00e+01

Pos logit contributions
lor+0.757
umm+0.705
ides+0.685
arry+0.676
oke+0.663
Neg logit contributions
 the-1.024
 a-0.980
 all-0.870
 in-0.847
 on-0.838
Token the
Token ID262
Feature activation+2.13e+01

Pos logit contributions
lor+1.058
uddy+1.006
+0.999
arry+0.996
obby+0.963
Neg logit contributions
 the-0.108
 a-0.101
 all-0.052
 more-0.049
 out-0.044
Token gate
Token ID8946
Feature activation-5.60e+00

Pos logit contributions
 the+0.684
 a+0.628
 all+0.577
 his+0.567
 in+0.548
Neg logit contributions
lor-0.670
arry-0.608
uddy-0.583
oral-0.570
etsy-0.564
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
lor+0.242
arry+0.228
uddy+0.221
+0.215
jo+0.215
Neg logit contributions
 the-0.105
 a-0.095
 all-0.081
 out-0.073
 up-0.071
Token Spot
Token ID15899
Feature activation+3.04e-01

Pos logit contributions
 the+0.025
 a+0.023
 more+0.021
 all+0.021
 out+0.020
Neg logit contributions
lor-0.034
uddy-0.033
arry-0.032
etsy-0.032
-0.031
Token saw
Token ID2497
Feature activation+7.41e+00

Pos logit contributions
 the+0.007
 a+0.006
 all+0.006
 out+0.005
 in+0.005
Neg logit contributions
lor-0.013
arry-0.012
uddy-0.012
etsy-0.011
jo-0.011
Token many
Token ID867
Feature activation+1.57e+01

Pos logit contributions
 the+0.045
 in+0.022
 out+0.021
 up+0.017
 all+0.013
Neg logit contributions
lor-0.378
uddy-0.348
arry-0.344
ongo-0.341
Really-0.339
Token other
Token ID584
Feature activation+2.17e+01

Pos logit contributions
 the+0.304
 a+0.292
 out+0.264
 up+0.243
 his+0.234
Neg logit contributions
arry-0.579
lor-0.559
oke-0.540
oc-0.528
oral-0.523
Token dogs
Token ID6844
Feature activation+2.67e+00

Pos logit contributions
 the+0.462
 a+0.420
 his+0.369
 out+0.356
 up+0.349
Neg logit contributions
lor-0.811
arry-0.805
uddy-0.767
ari-0.739
etsy-0.739
Token fly
Token ID6129
Feature activation+1.07e+01

Pos logit contributions
 the+0.145
 up+0.126
 all+0.112
 a+0.108
 in+0.103
Neg logit contributions
lor-0.446
etsy-0.432
uddy-0.431
arry-0.418
anne-0.417
Token and
Token ID290
Feature activation+1.55e+01

Pos logit contributions
 the+0.185
 a+0.150
 ice+0.134
 his+0.127
 all+0.121
Neg logit contributions
lor-0.397
arry-0.388
jo-0.364
ark-0.360
uddy-0.358
Token jump
Token ID4391
Feature activation+1.31e+01

Pos logit contributions
 the+0.358
 a+0.334
 all+0.302
 his+0.293
 in+0.290
Neg logit contributions
lor-0.479
-0.451
arry-0.439
uddy-0.431
Really-0.430
Token around
Token ID1088
Feature activation+1.42e+01

Pos logit contributions
 the+0.174
 a+0.129
 his+0.103
 more+0.102
 all+0.086
Neg logit contributions
lor-0.571
arry-0.547
uddy-0.532
etsy-0.525
-0.520
Token.
Token ID13
Feature activation-1.32e+01

Pos logit contributions
 the+0.278
 a+0.261
 more+0.228
 all+0.225
 out+0.223
Neg logit contributions
lor-0.485
-0.468
arry-0.450
jo-0.434
'."-0.423
Token
Token ID220
Feature activation-2.60e+01

Pos logit contributions
lor+0.481
oke+0.422
ides+0.420
reetings+0.413
oral+0.412
Neg logit contributions
 the-0.487
 a-0.459
 all-0.397
 her-0.391
 his-0.374
Token
Token ID198
Feature activation-2.33e+01

Pos logit contributions
umm+1.036
lor+1.022
ider+0.958
istent+0.942
ides+0.937
Neg logit contributions
 the-0.830
 a-0.771
 all-0.672
 out-0.641
 in-0.626
Token
Token ID198
Feature activation-2.34e+01

Pos logit contributions
umm+0.967
lor+0.935
ides+0.897
ider+0.878
istent+0.868
Neg logit contributions
 the-0.747
 a-0.695
 all-0.565
 her-0.545
 in-0.541
TokenSo
Token ID2396
Feature activation-4.99e+01

Pos logit contributions
lor+0.748
ides+0.664
oke+0.661
umm+0.657
+0.634
Neg logit contributions
 the-0.896
 a-0.867
 all-0.744
 her-0.731
 in-0.727
Token the
Token ID262
Feature activation+2.47e+01

Pos logit contributions
arry+1.690
lor+1.667
uddy+1.648
 Today+1.607
etsy+1.569
Neg logit contributions
 the-0.918
 a-0.851
 up-0.788
 out-0.779
 all-0.760
Token butterfly
Token ID35113
Feature activation-6.30e+00

Pos logit contributions
 the+0.720
 a+0.660
 his+0.616
 out+0.603
 all+0.595
Neg logit contributions
lor-0.793
arry-0.726
-0.711
etsy-0.704
pillar-0.694
Token can
Token ID460
Feature activation-6.61e+00

Pos logit contributions
 the+0.011
 a+0.010
 all+0.009
 out+0.008
 in+0.008
Neg logit contributions
lor-0.013
arry-0.012
uddy-0.012
etsy-0.011
-0.011
Token have
Token ID423
Feature activation+6.77e+00

Pos logit contributions
lor+0.288
arry+0.266
uddy+0.260
etsy+0.255
+0.255
Neg logit contributions
 the-0.144
 a-0.129
 all-0.112
 out-0.102
 in-0.101
Token a
Token ID257
Feature activation+2.38e+01

Pos logit contributions
 up+0.050
 the+0.049
 out+0.047
 in+0.041
 all+0.038
Neg logit contributions
lor-0.316
mmmm-0.281
mmm-0.278
-0.271
obby-0.271
Token bite
Token ID13197
Feature activation+8.34e+00

Pos logit contributions
 the+0.602
 a+0.511
 all+0.493
 her+0.449
 in+0.446
Neg logit contributions
lor-0.887
uddy-0.832
arry-0.814
-0.811
etsy-0.796
Token.
Token ID13
Feature activation-1.68e+01

Pos logit contributions
 the+0.215
 a+0.195
 all+0.177
 up+0.162
 out+0.161
Neg logit contributions
lor-0.309
arry-0.282
uddy-0.281
etsy-0.269
jo-0.268
Token But
Token ID887
Feature activation-3.00e+01

Pos logit contributions
lor+0.482
arry+0.418
jo+0.411
anne+0.395
etsy+0.393
Neg logit contributions
 the-0.648
 a-0.634
 some-0.524
 more-0.522
 all-0.521
Token only
Token ID691
Feature activation+1.03e+01

Pos logit contributions
lor+0.956
Really+0.910
mmm+0.902
arry+0.880
+0.878
Neg logit contributions
 the-0.689
 a-0.644
 all-0.594
 out-0.592
 up-0.586
Token a
Token ID257
Feature activation+2.46e+01

Pos logit contributions
 the+0.160
 all+0.143
 a+0.127
 up+0.121
 out+0.119
Neg logit contributions
lor-0.385
mmm-0.376
Really-0.371
reetings-0.369
mmmm-0.364
Token small
Token ID1402
Feature activation+2.47e+01

Pos logit contributions
 the+0.551
 a+0.477
 all+0.446
 up+0.397
 out+0.393
Neg logit contributions
lor-1.009
uddy-0.930
arry-0.926
etsy-0.906
-0.900
Token one
Token ID530
Feature activation+1.29e+01

Pos logit contributions
 the+0.599
 a+0.524
 all+0.499
 up+0.461
 her+0.450
Neg logit contributions
lor-0.902
arry-0.848
uddy-0.845
oc-0.838
jo-0.832
Token.
Token ID13
Feature activation-1.60e+01

Pos logit contributions
 the+0.336
 a+0.311
 all+0.289
 out+0.264
 up+0.261
Neg logit contributions
lor-0.462
arry-0.421
uddy-0.420
oke-0.413
jo-0.410
Token and
Token ID290
Feature activation+1.56e+01

Pos logit contributions
 a+0.060
 the+0.055
 out+0.039
 all+0.034
 away+0.031
Neg logit contributions
arry-0.459
lor-0.449
oc-0.446
jo-0.436
obby-0.425
Token go
Token ID467
Feature activation+1.44e+01

Pos logit contributions
 the+0.377
 a+0.354
 all+0.296
 out+0.295
 more+0.291
Neg logit contributions
lor-0.507
jo-0.485
arry-0.483
 Amen-0.470
oc-0.467
Token on
Token ID319
Feature activation+1.92e+01

Pos logit contributions
 the+0.185
 a+0.151
 all+0.097
 more+0.097
 her+0.091
Neg logit contributions
lor-0.628
arry-0.627
uddy-0.589
ongo-0.588
jo-0.578
Token adventures
Token ID17545
Feature activation-7.46e+00

Pos logit contributions
 the+0.452
 out+0.406
 a+0.401
 in+0.394
 all+0.371
Neg logit contributions
lor-0.637
arry-0.579
-0.566
jo-0.546
oral-0.540
Token.
Token ID13
Feature activation-1.34e+01

Pos logit contributions
lor+0.255
arry+0.244
uddy+0.237
jo+0.236
+0.233
Neg logit contributions
 the-0.188
 a-0.167
 all-0.143
 out-0.142
 more-0.138
Token One
Token ID1881
Feature activation-2.69e+01

Pos logit contributions
lor+0.485
ider+0.395
ides+0.382
och+0.380
etsy+0.378
Neg logit contributions
 the-0.516
 a-0.477
 all-0.440
 his-0.437
 out-0.402
Token day
Token ID1110
Feature activation-1.36e+00

Pos logit contributions
lor+0.854
uddy+0.836
arry+0.829
jo+0.823
ongo+0.786
Neg logit contributions
 the-0.737
 a-0.669
 all-0.643
 out-0.607
 up-0.592
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
lor+0.057
arry+0.056
uddy+0.056
+0.054
etsy+0.054
Neg logit contributions
 the-0.025
 a-0.022
 all-0.020
 more-0.018
 up-0.018
Token Tim
Token ID5045
Feature activation+7.16e+00

Pos logit contributions
 the+0.107
 a+0.099
 more+0.096
 all+0.096
 up+0.094
Neg logit contributions
-0.121
uddy-0.116
arry-0.111
ongo-0.109
etsy-0.107
Tokenmy
Token ID1820
Feature activation+3.64e-01

Pos logit contributions
 the+0.237
 a+0.221
 all+0.203
 out+0.191
 in+0.191
Neg logit contributions
lor-0.233
arry-0.209
uddy-0.202
etsy-0.198
-0.197
Token and
Token ID290
Feature activation+1.54e+01

Pos logit contributions
 the+0.009
 a+0.008
 all+0.007
 more+0.007
 out+0.007
Neg logit contributions
lor-0.014
arry-0.013
uddy-0.013
jo-0.013
etsy-0.013
Token Tom
Token ID4186
Feature activation-4.05e-01

Pos logit contributions
 the+0.170
 a+0.156
 all+0.149
 up+0.146
 out+0.145
Neg logit contributions
lor-0.226
arry-0.205
uddy-0.202
etsy-0.200
oke-0.197
Token,
Token ID11
Feature activation-3.58e+00

Pos logit contributions
lor+0.018
arry+0.016
uddy+0.016
etsy+0.016
+0.016
Neg logit contributions
 the-0.008
 a-0.007
 all-0.006
 out-0.006
 up-0.006
Token what
Token ID644
Feature activation-3.08e+00

Pos logit contributions
lor+0.124
uddy+0.114
oke+0.114
arry+0.112
etsy+0.111
Neg logit contributions
 the-0.090
 a-0.084
 all-0.077
 up-0.075
 out-0.072
Token is
Token ID318
Feature activation-1.22e+00

Pos logit contributions
lor+0.096
oke+0.090
uddy+0.085
arry+0.084
oral+0.084
Neg logit contributions
 the-0.085
 a-0.076
 all-0.075
 up-0.073
 out-0.072
Token wrong
Token ID2642
Feature activation-6.65e+00

Pos logit contributions
lor+0.037
oke+0.034
och+0.033
wl+0.032
mmm+0.032
Neg logit contributions
 the-0.026
 a-0.024
 more-0.023
 all-0.023
 back-0.023
Token?
Token ID30
Feature activation-2.27e+01

Pos logit contributions
lor+0.226
arry+0.206
uddy+0.199
jo+0.193
etsy+0.192
Neg logit contributions
 the-0.197
 a-0.181
 all-0.168
 out-0.157
 more-0.154
Token Why
Token ID4162
Feature activation-4.16e+01

Pos logit contributions
lor+0.627
arry+0.539
ides+0.532
umm+0.523
ongo+0.515
Neg logit contributions
 the-0.924
 a-0.874
 in-0.770
 all-0.756
 out-0.741
Token are
Token ID389
Feature activation+3.26e+00

Pos logit contributions
lor+1.191
oke+1.159
uddy+1.132
arry+1.093
ides+1.043
Neg logit contributions
 the-1.153
 a-1.114
 out-1.038
 more-1.019
 up-1.015
Token you
Token ID345
Feature activation+8.88e-01

Pos logit contributions
 the+0.088
 out+0.085
 up+0.083
 a+0.083
 all+0.077
Neg logit contributions
lor-0.089
uddy-0.084
arry-0.081
oke-0.081
mmm-0.080
Token not
Token ID407
Feature activation+3.74e+00

Pos logit contributions
 the+0.021
 a+0.018
 more+0.017
 his+0.016
 all+0.015
Neg logit contributions
lor-0.029
istent-0.028
oke-0.028
oling-0.028
obby-0.028
Token playing
Token ID2712
Feature activation+1.24e+01

Pos logit contributions
 the+0.088
 a+0.078
 all+0.070
 up+0.068
 more+0.066
Neg logit contributions
lor-0.125
oke-0.123
arry-0.120
reetings-0.120
aring-0.117
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
arry+0.475
lor+0.474
enny+0.469
obby+0.466
oral+0.461
Neg logit contributions
 the-0.197
 a-0.192
 more-0.159
 up-0.157
 shiny-0.153
Token
Token ID6184
Feature activation-3.73e+01

Pos logit contributions
 the+0.040
 a+0.038
 all+0.034
 in+0.032
 out+0.032
Neg logit contributions
lor-0.046
arry-0.041
uddy-0.040
jo-0.039
ides-0.039
Token
Token ID95
Feature activation-5.06e+01

Pos logit contributions
lor+0.934
uddy+0.784
arry+0.779
jo+0.771
etsy+0.762
Neg logit contributions
 the-1.522
 a-1.432
 all-1.330
 in-1.285
 out-1.268
Token
Token ID26391
Feature activation-3.80e+01

Pos logit contributions
lor+1.149
uddy+1.016
arry+1.015
etsy+1.001
jo+0.938
Neg logit contributions
 the-2.170
 a-2.058
 all-1.877
 in-1.820
 out-1.812
Token
Token ID129
Feature activation-3.62e+01

Pos logit contributions
lor+0.841
oral+0.738
arry+0.713
etsy+0.690
och+0.688
Neg logit contributions
 the-1.694
 a-1.598
 all-1.450
 in-1.398
 out-1.380
Token
Token ID241
Feature activation-3.90e+01

Pos logit contributions
lor+0.855
arry+0.733
uddy+0.702
etsy+0.679
+0.670
Neg logit contributions
 the-1.512
 a-1.431
 all-1.340
 out-1.282
 in-1.277
TokenCan
Token ID6090
Feature activation-2.94e+01

Pos logit contributions
lor+1.268
ides+1.106
uddy+1.096
arry+1.096
oke+1.049
Neg logit contributions
 the-1.435
 a-1.408
 all-1.186
 in-1.136
 her-1.124
Token I
Token ID314
Feature activation-1.27e+01

Pos logit contributions
lor+0.882
arry+0.799
uddy+0.770
+0.765
etsy+0.754
Neg logit contributions
 the-0.951
 a-0.885
 all-0.830
 out-0.805
 up-0.785
Token play
Token ID711
Feature activation+1.48e+01

Pos logit contributions
lor+0.478
arry+0.460
+0.452
uddy+0.436
ides+0.432
Neg logit contributions
 the-0.286
 a-0.252
 all-0.226
 up-0.223
 more-0.212
Token with
Token ID351
Feature activation+1.26e+01

Pos logit contributions
 the+0.263
 a+0.218
 all+0.204
 her+0.202
 up+0.186
Neg logit contributions
lor-0.592
-0.539
oc-0.531
arry-0.519
''-0.518
Token the
Token ID262
Feature activation+2.15e+01

Pos logit contributions
 the+0.241
 up+0.228
 a+0.225
 out+0.225
 in+0.216
Neg logit contributions
lor-0.444
mmm-0.404
mmmm-0.396
och-0.394
-0.394
Token my
Token ID616
Feature activation+1.67e+01

Pos logit contributions
 the+0.091
 a+0.073
 out+0.071
 in+0.058
 all+0.041
Neg logit contributions
lor-0.975
-0.865
arry-0.864
mmm-0.844
'."-0.844
Token back
Token ID736
Feature activation+2.39e+01

Pos logit contributions
 the+0.437
 a+0.408
 out+0.365
 all+0.348
 up+0.339
Neg logit contributions
lor-0.589
-0.516
uddy-0.514
jo-0.508
arry-0.508
Token."
Token ID526
Feature activation-2.77e+01

Pos logit contributions
 the+0.702
 a+0.652
 all+0.589
 out+0.552
 in+0.549
Neg logit contributions
lor-0.840
arry-0.759
uddy-0.732
jo-0.718
etsy-0.705
Token
Token ID198
Feature activation-2.40e+01

Pos logit contributions
lor+1.020
reetings+0.940
ider+0.903
umm+0.899
och+0.898
Neg logit contributions
 the-0.985
 a-0.902
 all-0.760
 in-0.722
 his-0.718
Token
Token ID198
Feature activation-2.41e+01

Pos logit contributions
umm+0.952
lor+0.910
ider+0.877
istent+0.874
och+0.855
Neg logit contributions
 the-0.796
 a-0.733
 all-0.611
 in-0.584
 big-0.576
TokenSo
Token ID2396
Feature activation-5.06e+01

Pos logit contributions
lor+0.738
+0.666
och+0.658
oke+0.657
arry+0.655
Neg logit contributions
 the-0.874
 a-0.825
 all-0.747
 more-0.707
 in-0.696
Token,
Token ID11
Feature activation+3.24e+00

Pos logit contributions
arry+1.810
lor+1.807
uddy+1.765
oc+1.706
+1.678
Neg logit contributions
 a-0.866
 the-0.831
 out-0.745
 up-0.736
 all-0.702
Token the
Token ID262
Feature activation+2.37e+01

Pos logit contributions
 up+0.008
 out+0.006
 a+0.005
 back+0.003
 the+0.003
Neg logit contributions
-0.155
Really-0.150
oc-0.149
lor-0.147
arry-0.145
Token little
Token ID1310
Feature activation+1.50e+01

Pos logit contributions
 the+0.518
 a+0.444
 all+0.408
 out+0.406
 up+0.389
Neg logit contributions
lor-0.941
uddy-0.883
arry-0.872
oral-0.872
-0.871
Token mouse
Token ID10211
Feature activation-5.65e+00

Pos logit contributions
 the+0.491
 a+0.446
 all+0.441
 out+0.425
 up+0.421
Neg logit contributions
lor-0.376
uddy-0.369
arry-0.362
-0.357
oc-0.355
Token climbed
Token ID19952
Feature activation-1.22e+00

Pos logit contributions
lor+0.234
arry+0.214
jo+0.210
uddy+0.209
oc+0.207
Neg logit contributions
 the-0.117
 a-0.106
 all-0.089
 out-0.088
 up-0.083
Token fishing
Token ID12478
Feature activation+7.24e+00

Pos logit contributions
 the+0.602
 a+0.546
 her+0.522
 all+0.520
 in+0.487
Neg logit contributions
lor-0.680
oral-0.634
arry-0.620
uddy-0.606
oke-0.597
Token rod
Token ID15299
Feature activation-1.44e+00

Pos logit contributions
 the+0.226
 a+0.210
 all+0.188
 her+0.185
 out+0.177
Neg logit contributions
lor-0.231
arry-0.210
uddy-0.203
-0.199
oral-0.194
Token.
Token ID13
Feature activation-1.66e+01

Pos logit contributions
lor+0.057
arry+0.056
uddy+0.054
och+0.053
enny+0.052
Neg logit contributions
 the-0.026
 a-0.022
 all-0.017
 shiny-0.015
 more-0.015
Token
Token ID198
Feature activation-2.62e+01

Pos logit contributions
lor+0.648
ides+0.598
oral+0.577
och+0.569
jo+0.559
Neg logit contributions
 the-0.556
 a-0.501
 all-0.453
 up-0.418
 out-0.413
Token
Token ID198
Feature activation-2.63e+01

Pos logit contributions
umm+1.062
lor+1.023
ider+0.993
istent+0.963
ides+0.954
Neg logit contributions
 the-0.859
 a-0.771
 all-0.675
 in-0.619
 big-0.609
TokenTogether
Token ID41631
Feature activation-4.99e+01

Pos logit contributions
lor+0.714
och+0.647
ides+0.625
jo+0.612
oke+0.608
Neg logit contributions
 the-1.067
 a-0.997
 all-0.936
 in-0.892
 big-0.843
Token,
Token ID11
Feature activation+4.65e-01

Pos logit contributions
uddy+1.877
arry+1.845
lor+1.806
etsy+1.755
obby+1.731
Neg logit contributions
 the-0.954
 a-0.916
 out-0.754
 all-0.741
 up-0.724
Token they
Token ID484
Feature activation-2.94e+00

Pos logit contributions
 the+0.011
 a+0.011
 out+0.010
 all+0.010
 more+0.010
Neg logit contributions
-0.015
uddy-0.014
arry-0.014
etsy-0.014
lor-0.014
Token spent
Token ID3377
Feature activation-5.96e+00

Pos logit contributions
uddy+0.083
oped+0.083
+0.083
arry+0.083
lor+0.082
Neg logit contributions
 the-0.077
 a-0.072
 up-0.066
 her-0.064
 out-0.063
Token the
Token ID262
Feature activation+2.12e+01

Pos logit contributions
lor+0.249
oc+0.247
+0.244
Finally+0.233
elope+0.233
Neg logit contributions
 out-0.025
 back-0.014
 up-0.013
 shiny-0.008
 for-0.006
Token whole
Token ID2187
Feature activation+7.81e+00

Pos logit contributions
 the+0.550
 a+0.501
 all+0.448
 out+0.419
 in+0.410
Neg logit contributions
lor-0.825
arry-0.757
uddy-0.744
-0.724
etsy-0.723
Token'
Token ID6
Feature activation-1.73e+01

Pos logit contributions
lor+0.796
arry+0.696
uddy+0.675
etsy+0.652
jo+0.647
Neg logit contributions
 the-1.162
 a-1.095
 all-1.016
 out-0.968
 in-0.967
Token and
Token ID290
Feature activation+1.11e+01

Pos logit contributions
lor+0.805
arry+0.749
uddy+0.730
etsy+0.721
+0.718
Neg logit contributions
 the-0.313
 a-0.274
 all-0.231
 out-0.205
 in-0.200
Token '
Token ID705
Feature activation-2.06e+01

Pos logit contributions
 the+0.387
 a+0.362
 all+0.336
 out+0.320
 in+0.318
Neg logit contributions
lor-0.328
arry-0.291
uddy-0.277
-0.276
etsy-0.275
Tokenthank
Token ID40716
Feature activation-2.33e+01

Pos logit contributions
lor+0.537
uddy+0.466
arry+0.448
jo+0.430
ides+0.415
Neg logit contributions
 the-0.841
 a-0.799
 all-0.723
 in-0.715
 out-0.693
Token you
Token ID345
Feature activation+2.14e+00

Pos logit contributions
lor+0.915
arry+0.841
uddy+0.808
etsy+0.799
jo+0.793
Neg logit contributions
 the-0.578
 a-0.528
 all-0.478
 out-0.442
 in-0.433
Token'.
Token ID4458
Feature activation-5.16e+01

Pos logit contributions
 the+0.104
 a+0.099
 all+0.094
 out+0.090
 in+0.090
Neg logit contributions
lor-0.036
arry-0.029
uddy-0.027
etsy-0.026
-0.025
Token One
Token ID1881
Feature activation-3.03e+01

Pos logit contributions
lor+1.689
ides+1.472
oling+1.435
etsy+1.407
oke+1.378
Neg logit contributions
 the-2.026
 a-1.979
 all-1.730
 his-1.677
 some-1.617
Token day
Token ID1110
Feature activation-4.21e+00

Pos logit contributions
lor+0.961
arry+0.944
jo+0.919
oral+0.903
uddy+0.901
Neg logit contributions
 the-0.820
 a-0.752
 all-0.699
 out-0.685
 up-0.645
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
arry+0.142
lor+0.140
uddy+0.136
+0.133
enny+0.132
Neg logit contributions
 the-0.091
 a-0.081
 all-0.081
 up-0.080
 more-0.076
Token Sam
Token ID3409
Feature activation+8.61e+00

Pos logit contributions
 the+0.024
 up+0.022
 more+0.022
 a+0.021
 all+0.021
Neg logit contributions
arry-0.032
-0.032
uddy-0.031
ongo-0.030
mmm-0.030
Token asked
Token ID1965
Feature activation-1.31e+01

Pos logit contributions
 the+0.220
 a+0.201
 all+0.182
 out+0.169
 more+0.165
Neg logit contributions
lor-0.331
arry-0.301
uddy-0.295
jo-0.293
etsy-0.292
TokenFeed
Token ID18332
Feature activation-2.28e+01

Pos logit contributions
lor+0.502
arry+0.428
uddy+0.421
oral+0.393
ides+0.392
Neg logit contributions
 the-0.803
 a-0.760
 all-0.677
 in-0.677
 on-0.662
Token the
Token ID262
Feature activation+2.22e+01

Pos logit contributions
lor+0.867
arry+0.786
uddy+0.747
etsy+0.743
obby+0.741
Neg logit contributions
 the-0.572
 a-0.533
 all-0.480
 out-0.447
 more-0.442
Token Ele
Token ID15987
Feature activation-2.71e+00

Pos logit contributions
 the+0.592
 a+0.539
 all+0.482
 out+0.452
 in+0.449
Neg logit contributions
lor-0.842
arry-0.767
uddy-0.749
-0.733
etsy-0.733
Tokenph
Token ID746
Feature activation-1.43e+01

Pos logit contributions
lor+0.080
arry+0.069
jo+0.064
uddy+0.064
etsy+0.064
Neg logit contributions
 the-0.095
 a-0.089
 all-0.084
 out-0.078
 his-0.078
Tokenants
Token ID1187
Feature activation-2.47e+01

Pos logit contributions
lor+0.288
arry+0.244
uddy+0.234
+0.226
etsy+0.225
Neg logit contributions
 the-0.641
 a-0.607
 all-0.572
 out-0.549
 in-0.549
Token'.
Token ID4458
Feature activation-5.15e+01

Pos logit contributions
lor+0.718
arry+0.640
+0.605
uddy+0.596
etsy+0.591
Neg logit contributions
 the-0.840
 a-0.793
 all-0.722
 out-0.692
 up-0.686
Token Jim
Token ID5395
Feature activation-2.48e+00

Pos logit contributions
lor+1.626
arry+1.450
uddy+1.405
etsy+1.375
+1.360
Neg logit contributions
 the-1.751
 a-1.636
 all-1.504
 out-1.420
 in-1.414
Token looked
Token ID3114
Feature activation+6.76e+00

Pos logit contributions
lor+0.091
arry+0.083
uddy+0.081
jo+0.081
etsy+0.079
Neg logit contributions
 the-0.067
 a-0.061
 all-0.055
 out-0.054
 up-0.051
Token and
Token ID290
Feature activation+1.22e+01

Pos logit contributions
 the+0.122
 a+0.117
 her+0.094
 more+0.087
 all+0.085
Neg logit contributions
oc-0.209
oke-0.208
arry-0.207
obby-0.197
etsy-0.197
Token felt
Token ID2936
Feature activation+2.14e+00

Pos logit contributions
 the+0.298
 a+0.288
 out+0.258
 all+0.255
 on+0.253
Neg logit contributions
lor-0.383
arry-0.372
jo-0.363
Really-0.361
obby-0.354
Token a
Token ID257
Feature activation+2.46e+01

Pos logit contributions
 the+0.031
 all+0.027
 a+0.025
 up+0.025
 in+0.025
Neg logit contributions
lor-0.073
''-0.071
oc-0.071
 Amen-0.071
Really-0.069
Token the
Token ID262
Feature activation+2.40e+01

Pos logit contributions
 the+0.394
 a+0.376
 out+0.344
 all+0.341
 up+0.332
Neg logit contributions
lor-0.459
arry-0.405
etsy-0.404
uddy-0.388
-0.383
Token woman
Token ID2415
Feature activation-2.13e+01

Pos logit contributions
 the+0.898
 a+0.832
 all+0.812
 in+0.786
 out+0.786
Neg logit contributions
lor-0.614
arry-0.510
oral-0.509
etsy-0.481
jo-0.480
Token said
Token ID531
Feature activation-5.55e+00

Pos logit contributions
lor+0.721
etsy+0.637
arry+0.637
uddy+0.630
+0.626
Neg logit contributions
 the-0.591
 a-0.558
 all-0.495
 out-0.486
 up-0.473
Token,
Token ID11
Feature activation+2.84e+00

Pos logit contributions
lor+0.244
arry+0.224
etsy+0.219
uddy+0.217
+0.214
Neg logit contributions
 the-0.103
 a-0.092
 all-0.082
 out-0.074
 up-0.073
Token
Token ID6184
Feature activation-3.59e+01

Pos logit contributions
 the+0.090
 a+0.083
 all+0.076
 in+0.071
 out+0.071
Neg logit contributions
lor-0.098
arry-0.089
uddy-0.087
jo-0.085
ides-0.084
Token
Token ID95
Feature activation-4.95e+01

Pos logit contributions
lor+0.761
jo+0.690
uddy+0.663
obby+0.624
umbs+0.624
Neg logit contributions
 the-1.557
 a-1.473
 all-1.347
 in-1.343
 up-1.297
Token
Token ID26391
Feature activation-3.64e+01

Pos logit contributions
uddy+1.016
lor+1.013
obby+0.977
arry+0.947
jo+0.942
Neg logit contributions
 the-2.082
 a-2.034
 in-1.765
 up-1.719
 all-1.713
Token
Token ID129
Feature activation-3.49e+01

Pos logit contributions
lor+0.800
umm+0.783
oral+0.757
arry+0.738
reetings+0.716
Neg logit contributions
 the-1.566
 a-1.479
 all-1.320
 in-1.273
 on-1.260
Token
Token ID241
Feature activation-3.76e+01

Pos logit contributions
lor+0.832
arry+0.714
uddy+0.699
etsy+0.679
och+0.674
Neg logit contributions
 the-1.479
 a-1.396
 all-1.300
 up-1.250
 in-1.230
TokenLet
Token ID5756
Feature activation-1.80e+01

Pos logit contributions
lor+1.163
ides+0.990
arry+0.966
uddy+0.955
och+0.932
Neg logit contributions
 the-1.459
 a-1.434
 all-1.193
 in-1.171
 his-1.159
Token me
Token ID502
Feature activation+1.26e+01

Pos logit contributions
lor+0.681
arry+0.617
uddy+0.597
etsy+0.591
jo+0.584
Neg logit contributions
 the-0.482
 a-0.444
 all-0.401
 in-0.370
 out-0.370