TOP ACTIVATIONS
MAX = 35.488

then covered the hole with some leaves and went back inside
blank piece of paper and some colourful paints .
to the store to buy some food . She wanted to
of the garden and plant some flowers . She watered them
cr ay on and added some squ igg les to the
they used it to buy some things they wanted .
the window to go do some other activities . Although he
's go inside and have some hot cocoa !"
upset that the earthquake broke some of our things . But
and dad . They had some stories to tell and showed
some butter and sprinkled on some pepper . Then with a
She grabbed some paper , markers and scissors
baking dish . Tom puts some flour and water in another
Next , she found some paints . She quickly unp
to the truck and saw some men wearing yellow hats and
, Spot decided to investigate some far away bushes . When
flour , some eggs , some milk and some sugar .
you please supply me with some past el colors ? I
a balloon . I have some money . You can choose
mom offered to buy him some pizza . Sammy 's eyes

MIN ACTIVATIONS
MIN = -74.491

H ola , me ll amo Emma ." L
end . M oral value : The story teaches
gentle . M oral : Being respectful and gentle
up ." M oral of the story : Be
away . M oral of the story : When
said to Dave , Hold on
and then said , No ,
with a smile . Do you
His mom smiled . That â
as a gift . Thank you
asked her mum , How did
said mom my , but don
smiled and said , Thank you
But Jim said : No ,
his mom said , Tim my
was sitting nearby . It was
smiled and said , I knew
can keep it forever things are
and tried to say I 'm
the boys said , We are

INTERVAL 7.993 - 35.488
CONTAINS 34.907%

ummy treats . Lily forgot all about her leg and was
garden . They dance to music , to songs , and
tries to make the bird look nice , but it looks
They got into the helicopter , and the boy
that the monster wasn 't real , and he enjoyed the

INTERVAL -19.502 - 7.993
CONTAINS 53.236%

L ily said , " Don 't worry , I
sun shone and birds ch ir ped . The
too much and they only gathered enough to make a small
." Ben is surprised . He thinks Lily is
the fence outside . He asked Tim my to stay inside

INTERVAL -46.997 - -19.502
CONTAINS 11.693%

There is a spider !" Tom dropped the skull
find another book ." But Sara does not want
swim away . Just then , D ucky 's
the cheap navy paint ! <|endoftext|> Once upon a time there
the apple for me ." Mom my smiled and

INTERVAL -74.491 - -46.997
CONTAINS 0.165%

stay too . So the cube opened the door
p anda and said Come with
of plans . Unfortunately , Winn ie got sick
brave and said , Hey ,
. â So , the mom my bird
Token then
Token ID788
Feature activation+4.88e+00

Pos logit contributions
lor+0.443
ushed+0.393
anas+0.383
uer+0.382
ides+0.381
Neg logit contributions
 the-0.365
 a-0.323
 her-0.287
 his-0.283
 all-0.279
Token covered
Token ID5017
Feature activation-2.48e+00

Pos logit contributions
 the+0.142
 a+0.131
 all+0.115
 his+0.113
 their+0.107
Neg logit contributions
lor-0.158
och-0.142
enny-0.140
oke-0.140
oped-0.139
Token the
Token ID262
Feature activation+2.67e+01

Pos logit contributions
lor+0.113
umm+0.101
ase+0.101
aron+0.101
+0.099
Neg logit contributions
 the-0.021
 a-0.020
 more-0.019
 big-0.017
 good-0.015
Token hole
Token ID7604
Feature activation-1.04e+01

Pos logit contributions
 the+0.928
 a+0.864
 her+0.780
 his+0.770
 all+0.739
Neg logit contributions
lor-0.862
umm-0.828
ides-0.770
arry-0.762
uer-0.755
Token with
Token ID351
Feature activation+1.63e+01

Pos logit contributions
ider+0.452
lor+0.433
ides+0.401
umm+0.399
oling+0.396
Neg logit contributions
 the-0.202
 a-0.170
 his-0.145
 her-0.130
 all-0.126
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.284
 up+0.239
 a+0.232
 in+0.231
 all+0.217
Neg logit contributions
lor-0.554
umm-0.545
-0.538
och-0.527
etsy-0.527
Token leaves
Token ID5667
Feature activation+1.66e+01

Pos logit contributions
 the+0.915
 a+0.808
 all+0.719
 her+0.680
 his+0.659
Neg logit contributions
umm-1.313
lor-1.311
ides-1.291
och-1.288
ider-1.266
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.372
 a+0.338
 his+0.286
 all+0.260
 her+0.256
Neg logit contributions
lor-0.696
-0.644
ides-0.640
och-0.634
istent-0.629
Token went
Token ID1816
Feature activation+1.64e+00

Pos logit contributions
 the+0.358
 a+0.304
 all+0.300
 up+0.285
 out+0.283
Neg logit contributions
lor-0.510
pill-0.494
plets-0.477
umm-0.473
anas-0.468
Token back
Token ID736
Feature activation+2.62e+01

Pos logit contributions
 the+0.031
 a+0.028
 her+0.020
 all+0.019
 his+0.019
Neg logit contributions
lor-0.068
ider-0.068
och-0.065
oling-0.065
-0.064
Token inside
Token ID2641
Feature activation+1.43e+01

Pos logit contributions
 the+0.474
 a+0.421
 more+0.336
 all+0.333
 his+0.322
Neg logit contributions
lor-1.099
obby-1.065
arry-1.045
-1.026
jo-1.020
Token blank
Token ID9178
Feature activation-7.27e-02

Pos logit contributions
 the+0.989
 a+0.896
 his+0.889
 all+0.840
 her+0.758
Neg logit contributions
-1.136
och-1.100
umm-1.099
oke-1.090
lor-1.077
Token piece
Token ID3704
Feature activation+8.19e+00

Pos logit contributions
lor+0.003
ides+0.003
och+0.003
umm+0.003
arry+0.002
Neg logit contributions
 the-0.002
 a-0.002
 his-0.002
 all-0.002
 her-0.002
Token of
Token ID286
Feature activation+1.65e+01

Pos logit contributions
 the+0.239
 a+0.211
 all+0.182
 his+0.181
 her+0.170
Neg logit contributions
lor-0.323
och-0.302
ider-0.293
ides-0.288
umm-0.287
Token paper
Token ID3348
Feature activation+1.37e+01

Pos logit contributions
 the+0.437
 a+0.410
 his+0.400
 all+0.384
 in+0.349
Neg logit contributions
lor-0.559
och-0.545
ides-0.542
mmmm-0.536
umm-0.532
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.304
 a+0.267
 his+0.215
 all+0.208
 her+0.191
Neg logit contributions
lor-0.675
ides-0.617
och-0.616
ider-0.605
umm-0.600
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.434
 a+0.383
 his+0.375
 all+0.362
 out+0.328
Neg logit contributions
lor-0.498
och-0.466
-0.462
ides-0.459
ider-0.458
Token colourful
Token ID44799
Feature activation+2.21e+01

Pos logit contributions
 the+0.989
 a+0.883
 all+0.806
 his+0.805
 her+0.710
Neg logit contributions
lor-1.369
och-1.333
ides-1.322
oke-1.290
umm-1.287
Token paints
Token ID32281
Feature activation-5.16e+00

Pos logit contributions
 the+0.652
 a+0.592
 his+0.543
 all+0.542
 up+0.479
Neg logit contributions
lor-0.846
ides-0.799
och-0.774
umm-0.766
-0.753
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
lor+0.224
och+0.206
ides+0.205
umm+0.200
ider+0.198
Neg logit contributions
 the-0.132
 a-0.119
 his-0.104
 all-0.100
 her-0.091
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+0.454
ides+0.403
och+0.390
ider+0.388
oke+0.385
Neg logit contributions
 the-0.501
 a-0.471
 all-0.420
 her-0.408
 his-0.389
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+0.860
umm+0.857
ider+0.844
ides+0.822
istent+0.802
Neg logit contributions
 the-0.643
 a-0.585
 all-0.488
 her-0.472
 his-0.457
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 the+0.178
 a+0.150
 all+0.126
 his+0.107
 more+0.099
Neg logit contributions
lor-0.500
jo-0.483
grand-0.467
och-0.463
ides-0.459
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 on+0.032
 in+0.027
 up+0.015
 all+0.015
 the+0.012
Neg logit contributions
lor-0.674
-0.655
obby-0.634
arry-0.625
istent-0.620
Token store
Token ID3650
Feature activation+7.22e+00

Pos logit contributions
 the+1.238
 a+1.165
 all+1.078
 his+1.052
 in+1.025
Neg logit contributions
lor-0.678
och-0.523
ider-0.516
umm-0.512
arry-0.505
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 the+0.151
 a+0.132
 all+0.105
 his+0.099
 her+0.096
Neg logit contributions
lor-0.351
arry-0.320
och-0.320
ider-0.319
ides-0.314
Token buy
Token ID2822
Feature activation+3.69e+00

Pos logit contributions
 the+0.446
 a+0.428
 on+0.421
 in+0.414
 all+0.412
Neg logit contributions
lor-0.352
-0.330
arry-0.296
och-0.295
grand-0.290
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.037
 a+0.025
 in+0.022
 all+0.019
 her+0.017
Neg logit contributions
lor-0.174
oke-0.163
och-0.161
arry-0.159
umm-0.156
Token food
Token ID2057
Feature activation+2.52e+01

Pos logit contributions
 the+0.915
 a+0.799
 all+0.702
 his+0.673
 her+0.630
Neg logit contributions
lor-1.463
och-1.408
ides-1.398
arry-1.373
oke-1.368
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.624
 a+0.557
 all+0.462
 his+0.424
 her+0.423
Neg logit contributions
lor-1.155
och-1.059
ides-1.044
arry-1.033
ider-1.023
Token She
Token ID1375
Feature activation-1.22e+01

Pos logit contributions
lor+0.406
ides+0.347
och+0.334
ider+0.327
umm+0.327
Neg logit contributions
 the-0.516
 a-0.484
 all-0.438
 his-0.419
 her-0.414
Token wanted
Token ID2227
Feature activation-6.77e+00

Pos logit contributions
lor+0.450
etsy+0.398
oke+0.396
och+0.396
jo+0.392
Neg logit contributions
 the-0.373
 a-0.341
 all-0.296
 his-0.286
 out-0.281
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
lor+0.303
obby+0.268
jo+0.267
och+0.266
grand+0.265
Neg logit contributions
 the-0.087
 a-0.072
 up-0.059
 all-0.057
 in-0.055
Token of
Token ID286
Feature activation+1.65e+01

Pos logit contributions
 the+0.114
 a+0.106
 all+0.093
 his+0.090
 her+0.087
Neg logit contributions
lor-0.149
oke-0.137
och-0.135
arry-0.131
aring-0.131
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 the+0.058
 a+0.054
 in+0.017
 out+0.012
 all+0.009
Neg logit contributions
lor-0.925
-0.850
och-0.841
arians-0.827
ides-0.825
Token garden
Token ID11376
Feature activation+1.37e+00

Pos logit contributions
 the+0.937
 a+0.856
 all+0.768
 his+0.743
 her+0.713
Neg logit contributions
lor-1.001
ides-0.878
umm-0.856
arry-0.842
och-0.834
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.033
 a+0.030
 his+0.025
 all+0.023
 her+0.022
Neg logit contributions
lor-0.061
umm-0.055
ides-0.055
och-0.055
arry-0.054
Token plant
Token ID4618
Feature activation+4.14e+00

Pos logit contributions
 the+0.372
 a+0.351
 all+0.302
 his+0.297
 big+0.271
Neg logit contributions
lor-0.547
etsy-0.494
och-0.494
grand-0.492
Really-0.490
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.073
 a+0.068
 his+0.061
 all+0.056
 big+0.046
Neg logit contributions
lor-0.189
ides-0.162
owers-0.162
ossom-0.159
-0.159
Token flowers
Token ID12734
Feature activation+1.16e+01

Pos logit contributions
 the+0.928
 a+0.831
 all+0.740
 his+0.728
 her+0.654
Neg logit contributions
lor-1.390
umm-1.348
ides-1.334
och-1.323
arry-1.298
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.303
 a+0.281
 his+0.243
 all+0.227
 her+0.213
Neg logit contributions
lor-0.498
och-0.449
ides-0.449
ider-0.437
-0.434
Token She
Token ID1375
Feature activation-1.22e+01

Pos logit contributions
lor+0.409
ides+0.349
och+0.339
ider+0.330
jo+0.329
Neg logit contributions
 the-0.520
 a-0.492
 all-0.444
 her-0.423
 his-0.409
Token watered
Token ID48024
Feature activation-4.75e+00

Pos logit contributions
lor+0.432
etsy+0.377
ides+0.376
ushed+0.369
uer+0.369
Neg logit contributions
 the-0.377
 a-0.346
 his-0.322
 all-0.296
 out-0.288
Token them
Token ID606
Feature activation+1.75e+01

Pos logit contributions
lor+0.212
oc+0.195
jo+0.192
ides+0.192
arry+0.189
Neg logit contributions
 the-0.042
 a-0.041
 his-0.032
 back-0.030
 angry-0.029
Token cr
Token ID1067
Feature activation+9.98e+00

Pos logit contributions
 the+0.814
 a+0.738
 all+0.685
 his+0.650
 her+0.645
Neg logit contributions
lor-1.036
umm-0.964
ides-0.949
ider-0.949
och-0.940
Tokenay
Token ID323
Feature activation-3.37e-01

Pos logit contributions
 the+0.462
 a+0.444
 all+0.410
 his+0.386
 her+0.385
Neg logit contributions
lor-0.257
ider-0.204
reetings-0.202
Really-0.201
etsy-0.201
Tokenon
Token ID261
Feature activation-4.53e+00

Pos logit contributions
lor+0.009
ides+0.008
och+0.008
oke+0.008
arry+0.008
Neg logit contributions
 the-0.015
 a-0.014
 all-0.013
 his-0.012
 her-0.012
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.199
ides+0.185
ider+0.183
uer+0.179
och+0.178
Neg logit contributions
 the-0.108
 a-0.095
 her-0.074
 all-0.074
 his-0.071
Token added
Token ID2087
Feature activation-3.59e+00

Pos logit contributions
 the+0.427
 a+0.418
 out+0.345
 all+0.339
 her+0.330
Neg logit contributions
lor-0.519
ushed-0.468
umm-0.466
oke-0.460
och-0.459
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
lor+0.182
umm+0.169
ides+0.166
oke+0.166
och+0.165
Neg logit contributions
 the-0.040
 a-0.030
 all-0.023
 her-0.021
 his-0.019
Token squ
Token ID2809
Feature activation+1.32e+01

Pos logit contributions
 the+0.908
 a+0.791
 all+0.735
 his+0.702
 her+0.690
Neg logit contributions
lor-1.384
ides-1.367
umm-1.357
oke-1.348
och-1.335
Tokenigg
Token ID6950
Feature activation-1.18e+01

Pos logit contributions
 the+0.657
 a+0.623
 all+0.574
 his+0.552
 her+0.548
Neg logit contributions
lor-0.306
ides-0.266
umm-0.253
arry-0.248
och-0.243
Tokenles
Token ID829
Feature activation+4.72e+00

Pos logit contributions
lor+0.303
ides+0.272
arry+0.258
oke+0.257
umm+0.255
Neg logit contributions
 the-0.531
 a-0.511
 all-0.462
 her-0.441
 his-0.437
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 the+0.103
 a+0.093
 all+0.074
 her+0.069
 his+0.068
Neg logit contributions
lor-0.208
ides-0.192
och-0.190
oke-0.186
umm-0.185
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 the+0.185
 a+0.174
 all+0.117
 in+0.117
 up+0.115
Neg logit contributions
lor-0.657
ides-0.620
och-0.615
arry-0.614
umm-0.608
Token they
Token ID484
Feature activation+2.69e+00

Pos logit contributions
 the+0.410
 a+0.392
 out+0.321
 all+0.321
 up+0.314
Neg logit contributions
lor-0.580
grand-0.492
person-0.491
jo-0.488
arry-0.482
Token used
Token ID973
Feature activation+2.50e+00

Pos logit contributions
 the+0.054
 a+0.049
 his+0.035
 her+0.033
 out+0.032
Neg logit contributions
lor-0.130
och-0.119
ides-0.118
arry-0.116
jo-0.115
Token it
Token ID340
Feature activation+1.07e+01

Pos logit contributions
 the+0.019
 a+0.014
 in+0.010
 all+0.009
 up+0.007
Neg logit contributions
lor-0.120
ides-0.111
jo-0.109
-0.109
obby-0.109
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 the+0.185
 a+0.144
 all+0.106
 her+0.102
 his+0.097
Neg logit contributions
lor-0.526
ides-0.492
uddy-0.492
arry-0.483
etsy-0.478
Token buy
Token ID2822
Feature activation+3.69e+00

Pos logit contributions
 the+0.404
 a+0.376
 all+0.334
 out+0.324
 up+0.314
Neg logit contributions
lor-0.495
ides-0.462
-0.454
arry-0.447
istent-0.436
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.043
 a+0.034
 all+0.026
 her+0.025
 in+0.023
Neg logit contributions
lor-0.172
umm-0.159
ides-0.158
och-0.156
arry-0.155
Token things
Token ID1243
Feature activation+8.14e+00

Pos logit contributions
 the+0.886
 a+0.783
 all+0.656
 her+0.640
 his+0.618
Neg logit contributions
lor-1.491
umm-1.422
ides-1.403
och-1.387
arry-1.378
Token they
Token ID484
Feature activation+2.69e+00

Pos logit contributions
 the+0.179
 a+0.160
 all+0.128
 her+0.120
 his+0.116
Neg logit contributions
lor-0.366
och-0.347
arry-0.335
ides-0.334
-0.327
Token wanted
Token ID2227
Feature activation-6.79e+00

Pos logit contributions
 the+0.093
 a+0.087
 all+0.075
 his+0.073
 her+0.072
Neg logit contributions
lor-0.097
ides-0.089
och-0.086
arry-0.085
istent-0.085
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.301
ides+0.283
och+0.270
arry+0.269
oke+0.269
Neg logit contributions
 the-0.154
 a-0.136
 all-0.115
 her-0.103
 his-0.103
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+0.477
ider+0.454
ides+0.449
uer+0.426
reetings+0.421
Neg logit contributions
 the-0.477
 a-0.436
 his-0.390
 all-0.387
 her-0.350
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 the+0.052
 a+0.047
 all+0.028
 more+0.019
 big+0.012
Neg logit contributions
lor-0.398
-0.397
jo-0.391
och-0.389
umm-0.382
Token window
Token ID4324
Feature activation+2.97e+00

Pos logit contributions
 the+0.906
 a+0.840
 all+0.754
 his+0.740
 her+0.722
Neg logit contributions
lor-0.964
ider-0.864
umm-0.862
ides-0.859
arry-0.842
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 the+0.065
 a+0.059
 all+0.048
 his+0.043
 her+0.043
Neg logit contributions
lor-0.148
ides-0.133
och-0.131
arry-0.131
ider-0.131
Token go
Token ID467
Feature activation+1.41e+01

Pos logit contributions
 the+0.347
 a+0.327
 all+0.280
 more+0.256
 in+0.252
Neg logit contributions
lor-0.542
ides-0.485
reetings-0.479
arry-0.469
-0.467
Token do
Token ID466
Feature activation+1.47e+01

Pos logit contributions
 the+0.310
 a+0.289
 all+0.234
 his+0.210
 her+0.201
Neg logit contributions
lor-0.653
arry-0.600
och-0.599
ider-0.586
ides-0.585
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.286
 a+0.241
 all+0.207
 her+0.186
 his+0.173
Neg logit contributions
lor-0.714
ides-0.632
jo-0.621
istent-0.611
oke-0.609
Token other
Token ID584
Feature activation+2.88e+01

Pos logit contributions
 the+0.966
 a+0.826
 all+0.763
 his+0.719
 her+0.718
Neg logit contributions
lor-1.396
ides-1.344
istent-1.301
arry-1.289
oke-1.276
Token activities
Token ID4568
Feature activation-5.39e+00

Pos logit contributions
 the+0.749
 a+0.679
 all+0.600
 his+0.576
 her+0.555
Neg logit contributions
lor-1.181
arry-1.101
ides-1.100
umm-1.097
oke-1.095
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
lor+0.234
och+0.218
ider+0.213
ides+0.212
+0.209
Neg logit contributions
 the-0.132
 a-0.119
 all-0.097
 his-0.092
 her-0.091
Token Although
Token ID4900
Feature activation-2.91e+01

Pos logit contributions
lor+0.490
ides+0.468
ider+0.455
uer+0.431
umm+0.423
Neg logit contributions
 the-0.466
 a-0.424
 his-0.367
 all-0.362
 her-0.347
Token he
Token ID339
Feature activation+8.46e+00

Pos logit contributions
lor+1.306
arry+1.183
och+1.148
etsy+1.141
uddy+1.139
Neg logit contributions
 the-0.487
 a-0.473
 out-0.391
 all-0.361
 in-0.335
Token's
Token ID338
Feature activation+1.06e+01

Pos logit contributions
lor+0.919
ides+0.765
jo+0.757
etsy+0.750
oke+0.740
Neg logit contributions
 the-0.700
 a-0.665
 all-0.589
 his-0.586
 in-0.540
Token go
Token ID467
Feature activation+1.41e+01

Pos logit contributions
 the+0.261
 a+0.231
 his+0.200
 all+0.187
 her+0.184
Neg logit contributions
lor-0.461
ides-0.422
-0.420
istent-0.416
arry-0.403
Token inside
Token ID2641
Feature activation+1.43e+01

Pos logit contributions
 the+0.292
 a+0.256
 all+0.203
 his+0.199
 her+0.192
Neg logit contributions
lor-0.698
arry-0.631
ides-0.627
och-0.627
uddy-0.624
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.320
 a+0.301
 his+0.255
 all+0.254
 her+0.247
Neg logit contributions
lor-0.615
istent-0.543
ides-0.543
arry-0.536
obby-0.536
Token have
Token ID423
Feature activation+1.23e+01

Pos logit contributions
 the+0.407
 a+0.373
 his+0.327
 all+0.326
 her+0.307
Neg logit contributions
lor-0.596
-0.538
ider-0.534
och-0.533
etsy-0.530
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.149
 all+0.093
 his+0.088
 a+0.086
 in+0.083
Neg logit contributions
lor-0.631
rd-0.550
-0.549
etsy-0.538
mmmm-0.535
Token hot
Token ID3024
Feature activation+2.99e+01

Pos logit contributions
 the+1.001
 a+0.850
 all+0.774
 his+0.755
 her+0.727
Neg logit contributions
lor-1.383
ider-1.317
-1.304
ides-1.295
umm-1.276
Token cocoa
Token ID35845
Feature activation+1.31e+00

Pos logit contributions
 the+0.990
 a+0.891
 all+0.812
 his+0.771
 her+0.749
Neg logit contributions
lor-1.120
umm-1.022
ides-1.015
-0.993
ider-0.977
Token!"
Token ID2474
Feature activation-3.00e+01

Pos logit contributions
 the+0.046
 a+0.043
 all+0.037
 his+0.037
 her+0.036
Neg logit contributions
lor-0.049
ides-0.043
umm-0.042
ider-0.042
och-0.042
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+1.086
umm+0.970
ides+0.967
ider+0.954
och+0.939
Neg logit contributions
 the-1.197
 a-1.116
 all-0.992
 her-0.951
 his-0.949
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+0.758
umm+0.756
ides+0.717
ider+0.710
istent+0.701
Neg logit contributions
 the-0.706
 a-0.656
 all-0.565
 her-0.536
 his-0.530
Token upset
Token ID9247
Feature activation+1.08e+01

Pos logit contributions
 the+0.281
 a+0.246
 all+0.231
 his+0.224
 her+0.215
Neg logit contributions
lor-0.431
oke-0.429
istent-0.422
ider-0.421
reetings-0.412
Token that
Token ID326
Feature activation+8.91e+00

Pos logit contributions
 the+0.277
 a+0.252
 all+0.212
 his+0.204
 her+0.197
Neg logit contributions
lor-0.454
arry-0.425
oke-0.424
ides-0.424
ider-0.422
Token the
Token ID262
Feature activation+2.67e+01

Pos logit contributions
 the+0.210
 a+0.197
 all+0.170
 his+0.158
 her+0.153
Neg logit contributions
lor-0.361
oke-0.332
umm-0.329
ider-0.327
jo-0.327
Token earthquake
Token ID16295
Feature activation-3.14e+00

Pos logit contributions
 the+1.036
 a+0.960
 his+0.844
 all+0.843
 her+0.829
Neg logit contributions
lor-0.852
umm-0.774
ider-0.746
ides-0.739
-0.733
Token broke
Token ID6265
Feature activation+8.93e+00

Pos logit contributions
lor+0.131
ides+0.119
oke+0.117
och+0.116
ider+0.115
Neg logit contributions
 the-0.094
 a-0.087
 all-0.076
 his-0.072
 her-0.070
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.228
 a+0.206
 all+0.182
 his+0.175
 her+0.169
Neg logit contributions
lor-0.363
ides-0.336
ider-0.336
arry-0.330
jo-0.329
Token of
Token ID286
Feature activation+1.65e+01

Pos logit contributions
 the+0.993
 a+0.932
 his+0.771
 all+0.770
 her+0.747
Neg logit contributions
lor-1.421
ides-1.359
umm-1.327
oke-1.313
och-1.301
Token our
Token ID674
Feature activation+1.60e+01

Pos logit contributions
 the+0.184
 a+0.181
 all+0.124
 up+0.122
 out+0.117
Neg logit contributions
lor-0.809
och-0.752
umm-0.746
ides-0.744
-0.735
Token things
Token ID1243
Feature activation+8.08e+00

Pos logit contributions
 the+0.486
 a+0.452
 all+0.373
 his+0.370
 her+0.356
Neg logit contributions
lor-0.617
ides-0.588
umm-0.586
arry-0.570
uddy-0.565
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.220
 a+0.202
 his+0.172
 all+0.169
 her+0.164
Neg logit contributions
lor-0.330
ides-0.307
och-0.302
ider-0.297
oke-0.296
Token But
Token ID887
Feature activation-2.62e+01

Pos logit contributions
lor+0.443
ides+0.399
uer+0.378
och+0.368
arry+0.361
Neg logit contributions
 the-0.510
 a-0.494
 his-0.403
 all-0.388
 some-0.384
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.698
oke+0.659
och+0.644
arry+0.635
ides+0.634
Neg logit contributions
 the-0.288
 a-0.261
 his-0.203
 all-0.198
 her-0.172
Token dad
Token ID9955
Feature activation-1.03e+01

Pos logit contributions
 the+0.456
 a+0.439
 all+0.376
 up+0.361
 his+0.359
Neg logit contributions
lor-0.542
och-0.480
arry-0.477
oke-0.473
etsy-0.465
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.507
oke+0.474
och+0.473
ides+0.471
arry+0.463
Neg logit contributions
 the-0.184
 a-0.166
 his-0.119
 all-0.113
 her-0.101
Token They
Token ID1119
Feature activation-1.26e+01

Pos logit contributions
lor+0.433
ides+0.376
reetings+0.360
jo+0.358
och+0.352
Neg logit contributions
 the-0.497
 a-0.468
 all-0.416
 her-0.401
 his-0.390
Token had
Token ID550
Feature activation+2.50e+00

Pos logit contributions
lor+0.452
arry+0.419
istent+0.416
och+0.415
ides+0.414
Neg logit contributions
 the-0.368
 a-0.343
 his-0.291
 up-0.277
 out-0.273
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.042
 up+0.034
 a+0.032
 in+0.031
 out+0.031
Neg logit contributions
lor-0.118
ides-0.103
etsy-0.100
jo-0.099
ari-0.099
Token stories
Token ID3923
Feature activation-2.61e+00

Pos logit contributions
 the+0.968
 a+0.827
 all+0.740
 his+0.713
 her+0.666
Neg logit contributions
lor-1.409
ides-1.364
och-1.340
arry-1.317
umm-1.314
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
lor+0.124
och+0.114
ides+0.112
ari+0.111
ider+0.111
Neg logit contributions
 the-0.055
 a-0.047
 his-0.039
 all-0.036
 her-0.034
Token tell
Token ID1560
Feature activation-3.94e+00

Pos logit contributions
 the+0.387
 a+0.358
 all+0.312
 out+0.303
 in+0.302
Neg logit contributions
lor-0.498
ides-0.483
arry-0.456
istent-0.447
jo-0.442
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.209
ides+0.196
och+0.191
arry+0.191
oke+0.189
Neg logit contributions
 the-0.056
 a-0.050
 his-0.033
 all-0.033
 out-0.027
Token showed
Token ID3751
Feature activation+1.74e+00

Pos logit contributions
 the+0.384
 a+0.356
 all+0.293
 his+0.286
 out+0.282
Neg logit contributions
lor-0.631
ides-0.569
och-0.565
oke-0.559
umm-0.559
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
lor+0.173
umm+0.160
umbs+0.155
owers+0.154
ides+0.154
Neg logit contributions
 the-0.038
 a-0.029
 all-0.021
 his-0.021
 her-0.017
Token butter
Token ID9215
Feature activation-4.08e+00

Pos logit contributions
 the+1.005
 a+0.908
 all+0.816
 her+0.801
 his+0.801
Neg logit contributions
lor-1.243
umm-1.230
ides-1.193
pill-1.138
oke-1.126
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.197
ides+0.175
och+0.174
uddy+0.172
etsy+0.172
Neg logit contributions
 the-0.079
 a-0.072
 all-0.057
 his-0.054
 more-0.050
Token sprinkled
Token ID49692
Feature activation-1.23e+01

Pos logit contributions
 the+0.453
 a+0.417
 all+0.405
 out+0.387
 up+0.387
Neg logit contributions
lor-0.471
umm-0.442
pill-0.438
plets-0.424
mmm-0.417
Token on
Token ID319
Feature activation+2.00e+01

Pos logit contributions
lor+0.610
+0.576
umm+0.574
plets+0.555
ides+0.549
Neg logit contributions
 the-0.134
 a-0.117
 all-0.084
 his-0.080
 up-0.064
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.114
 a+0.091
 all+0.052
 out+0.038
 in+0.034
Neg logit contributions
lor-1.104
-1.067
pill-1.024
umm-1.007
etsy-0.995
Token pepper
Token ID13385
Feature activation-3.41e-02

Pos logit contributions
 the+0.976
 a+0.904
 all+0.795
 his+0.753
 her+0.742
Neg logit contributions
lor-1.381
umm-1.332
ides-1.293
och-1.251
-1.220
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.002
ides+0.002
umm+0.001
och+0.001
arry+0.001
Neg logit contributions
 the-0.001
 a-0.001
 all-0.001
 his-0.001
 her-0.000
Token Then
Token ID3244
Feature activation-1.44e+01

Pos logit contributions
lor+0.450
ides+0.398
ider+0.383
uer+0.380
reetings+0.376
Neg logit contributions
 the-0.492
 a-0.466
 all-0.396
 her-0.393
 his-0.374
Token with
Token ID351
Feature activation+1.63e+01

Pos logit contributions
lor+0.576
+0.509
umm+0.502
arry+0.499
och+0.493
Neg logit contributions
 the-0.330
 a-0.312
 all-0.276
 his-0.262
 up-0.251
Token a
Token ID257
Feature activation+3.07e+01

Pos logit contributions
 the+0.096
 a+0.034
 up+0.032
 in+0.021
 out+0.017
Neg logit contributions
lor-0.850
-0.819
jo-0.809
och-0.804
anne-0.788
Token
Token ID220
Feature activation-2.23e+01

Pos logit contributions
lor+0.408
ides+0.352
umm+0.337
och+0.336
ider+0.330
Neg logit contributions
 the-0.514
 a-0.482
 all-0.435
 her-0.416
 his-0.410
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
lor+0.998
umm+0.967
ider+0.910
ides+0.908
uer+0.885
Neg logit contributions
 the-0.751
 a-0.705
 all-0.579
 her-0.575
 his-0.535
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
umm+0.820
lor+0.806
ides+0.765
ider+0.759
istent+0.747
Neg logit contributions
 the-0.671
 a-0.613
 all-0.513
 her-0.511
 his-0.490
TokenShe
Token ID3347
Feature activation-2.55e+01

Pos logit contributions
lor+0.626
umm+0.586
ides+0.565
istent+0.538
+0.537
Neg logit contributions
 the-0.829
 a-0.775
 all-0.693
 her-0.682
 his-0.658
Token grabbed
Token ID13176
Feature activation+1.03e+01

Pos logit contributions
lor+0.962
jo+0.879
uddy+0.871
ides+0.869
oke+0.859
Neg logit contributions
 the-0.794
 a-0.709
 all-0.622
 his-0.595
 her-0.587
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.077
 in+0.042
 a+0.039
 more+0.039
 all+0.035
Neg logit contributions
-0.458
och-0.440
pill-0.422
etsy-0.417
uddy-0.416
Token paper
Token ID3348
Feature activation+1.37e+01

Pos logit contributions
 the+1.042
 a+0.900
 all+0.822
 his+0.795
 her+0.731
Neg logit contributions
och-1.211
pill-1.185
pillar-1.177
umm-1.176
ider-1.168
Token,
Token ID11
Feature activation+4.40e+00

Pos logit contributions
 the+0.298
 a+0.257
 his+0.198
 all+0.191
 her+0.177
Neg logit contributions
lor-0.667
och-0.608
ides-0.605
uddy-0.594
arry-0.590
Token markers
Token ID19736
Feature activation-6.88e+00

Pos logit contributions
 the+0.094
 all+0.078
 a+0.072
 his+0.069
 up+0.068
Neg logit contributions
ider-0.153
och-0.151
lor-0.147
plets-0.145
pill-0.143
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.314
ides+0.294
och+0.292
uddy+0.282
arry+0.281
Neg logit contributions
 the-0.148
 a-0.130
 his-0.106
 all-0.099
 more-0.090
Token scissors
Token ID41786
Feature activation-8.43e-01

Pos logit contributions
 the+0.406
 all+0.339
 a+0.332
 up+0.317
 his+0.310
Neg logit contributions
lor-0.459
och-0.452
plets-0.449
ides-0.443
jo-0.435
Token baking
Token ID16871
Feature activation+6.86e+00

Pos logit contributions
 the+0.872
 a+0.800
 all+0.702
 his+0.629
 her+0.621
Neg logit contributions
lor-1.350
ides-1.203
och-1.197
arry-1.169
umm-1.169
Token dish
Token ID9433
Feature activation-5.52e+00

Pos logit contributions
 the+0.206
 a+0.194
 all+0.174
 his+0.160
 her+0.152
Neg logit contributions
lor-0.272
ides-0.237
umm-0.234
och-0.231
-0.226
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.248
ides+0.225
och+0.222
umm+0.221
ider+0.220
Neg logit contributions
 the-0.146
 a-0.132
 all-0.108
 his-0.103
 her-0.100
Token Tom
Token ID4186
Feature activation+8.89e+00

Pos logit contributions
lor+0.478
ides+0.413
och+0.393
jo+0.389
ider+0.388
Neg logit contributions
 the-0.481
 a-0.452
 all-0.388
 her-0.379
 his-0.367
Token puts
Token ID7584
Feature activation-3.91e+00

Pos logit contributions
 the+0.304
 a+0.287
 all+0.258
 his+0.247
 her+0.238
Neg logit contributions
lor-0.306
ides-0.272
oke-0.272
istent-0.266
oling-0.264
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
+0.193
lor+0.187
och+0.183
umbs+0.179
aron+0.178
Neg logit contributions
 the-0.011
 a-0.007
 all-0.003
 to+0.003
 in+0.005
Token flour
Token ID10601
Feature activation-6.82e+00

Pos logit contributions
 the+1.031
 a+0.926
 all+0.896
 his+0.810
 up+0.772
Neg logit contributions
lor-1.183
och-1.152
ider-1.125
umm-1.124
-1.115
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.298
och+0.293
+0.289
oling+0.287
ider+0.281
Neg logit contributions
 the-0.107
 a-0.102
 all-0.076
 an-0.069
 her-0.066
Token water
Token ID1660
Feature activation+2.58e+01

Pos logit contributions
 the+0.424
 a+0.399
 all+0.397
 out+0.359
 in+0.358
Neg logit contributions
lor-0.508
-0.483
pill-0.481
plets-0.471
umm-0.464
Token in
Token ID287
Feature activation+1.90e+01

Pos logit contributions
 a+0.327
 the+0.325
 all+0.229
 an+0.180
 more+0.169
Neg logit contributions
lor-1.276
-1.212
oling-1.200
oke-1.197
och-1.189
Token another
Token ID1194
Feature activation+3.01e+01

Pos logit contributions
 the+0.079
 a+0.056
 all+0.044
 in+0.006
 up+0.002
Neg logit contributions
lor-1.050
-1.047
umbs-0.982
och-0.979
umm-0.971
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
umm+0.874
lor+0.847
ider+0.822
ides+0.806
istent+0.788
Neg logit contributions
 the-0.655
 a-0.592
 all-0.500
 her-0.485
 his-0.471
TokenNext
Token ID10019
Feature activation-1.88e+01

Pos logit contributions
lor+0.688
umm+0.638
ides+0.636
oke+0.617
och+0.609
Neg logit contributions
 the-0.767
 a-0.715
 all-0.645
 her-0.621
 in-0.580
Token,
Token ID11
Feature activation+4.41e+00

Pos logit contributions
lor+0.859
+0.811
arry+0.794
oke+0.776
oral+0.770
Neg logit contributions
 the-0.323
 a-0.302
 all-0.257
 his-0.252
 up-0.227
Token she
Token ID673
Feature activation+1.72e+00

Pos logit contributions
 his+0.045
 a+0.045
 the+0.041
 all+0.033
 out+0.029
Neg logit contributions
-0.196
lor-0.193
umm-0.192
Really-0.188
owers-0.187
Token found
Token ID1043
Feature activation+1.35e+01

Pos logit contributions
 the+0.051
 a+0.046
 all+0.039
 his+0.038
 her+0.037
Neg logit contributions
lor-0.071
ides-0.064
arry-0.062
jo-0.062
och-0.062
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.071
 his+0.047
 all+0.033
 in+0.028
 their+0.025
Neg logit contributions
lor-0.703
och-0.604
pillar-0.601
pill-0.592
etsy-0.591
Token paints
Token ID32281
Feature activation-5.18e+00

Pos logit contributions
 the+0.876
 a+0.749
 all+0.670
 out+0.623
 his+0.613
Neg logit contributions
och-1.361
lor-1.341
umm-1.311
pill-1.304
pillar-1.302
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.244
och+0.233
umm+0.218
ider+0.218
+0.218
Neg logit contributions
 the-0.095
 a-0.083
 his-0.077
 all-0.063
 their-0.056
Token She
Token ID1375
Feature activation-1.22e+01

Pos logit contributions
lor+0.436
ides+0.381
ider+0.362
och+0.360
reetings+0.360
Neg logit contributions
 the-0.494
 a-0.466
 all-0.409
 her-0.396
 his-0.375
Token quickly
Token ID2952
Feature activation+1.44e+01

Pos logit contributions
lor+0.457
oke+0.415
ides+0.415
+0.410
etsy+0.407
Neg logit contributions
 the-0.363
 a-0.336
 his-0.307
 all-0.282
 out-0.274
Token unp
Token ID8593
Feature activation-1.80e+01

Pos logit contributions
 the+0.422
 a+0.393
 his+0.348
 all+0.334
 her+0.308
Neg logit contributions
lor-0.548
ides-0.522
istent-0.504
oke-0.501
och-0.497
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
uddy+0.130
arry+0.126
lor+0.122
ider+0.122
etsy+0.120
Neg logit contributions
 the-0.058
 a-0.051
 all-0.041
 her-0.040
 his-0.038
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 a+0.023
 the+0.019
 all+0.013
 more+0.010
 in+0.009
Neg logit contributions
-0.678
uddy-0.665
arry-0.663
obby-0.654
istent-0.652
Token truck
Token ID7779
Feature activation+2.78e+00

Pos logit contributions
 the+0.946
 a+0.883
 all+0.785
 her+0.757
 his+0.741
Neg logit contributions
lor-0.983
ides-0.850
och-0.846
umm-0.842
ider-0.841
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.050
 a+0.043
 all+0.033
 her+0.029
 his+0.028
Neg logit contributions
lor-0.152
ides-0.139
arry-0.138
och-0.138
uddy-0.136
Token saw
Token ID2497
Feature activation+1.09e+01

Pos logit contributions
 the+0.438
 a+0.435
 all+0.412
 her+0.393
 more+0.376
Neg logit contributions
oke-0.426
lor-0.424
och-0.417
uddy-0.413
etsy-0.402
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.046
 in+0.024
 all+0.023
 a+0.018
 her+0.017
Neg logit contributions
lor-0.571
oke-0.555
och-0.532
uddy-0.523
oc-0.519
Token men
Token ID1450
Feature activation-8.09e+00

Pos logit contributions
 the+1.005
 a+0.916
 her+0.815
 all+0.799
 his+0.746
Neg logit contributions
oke-1.331
lor-1.316
ides-1.260
och-1.257
arry-1.256
Token wearing
Token ID5762
Feature activation-2.07e+00

Pos logit contributions
lor+0.371
ides+0.347
oke+0.346
och+0.346
ider+0.345
Neg logit contributions
 the-0.183
 a-0.167
 all-0.133
 her-0.129
 his-0.124
Token yellow
Token ID7872
Feature activation+1.94e+01

Pos logit contributions
lor+0.101
+0.095
aron+0.094
och+0.094
anas+0.094
Neg logit contributions
 the-0.013
 a-0.008
 all-0.007
 her-0.006
 out-0.005
Token hats
Token ID23910
Feature activation-1.60e+00

Pos logit contributions
 the+0.604
 a+0.545
 all+0.493
 her+0.455
 his+0.433
Neg logit contributions
lor-0.763
ides-0.708
-0.671
arry-0.665
och-0.663
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
lor+0.075
ides+0.068
och+0.067
ider+0.066
arry+0.065
Neg logit contributions
 the-0.041
 a-0.036
 all-0.030
 his-0.029
 her-0.028
Token,
Token ID11
Feature activation+4.40e+00

Pos logit contributions
lor+0.062
uddy+0.061
+0.058
oling+0.057
etsy+0.057
Neg logit contributions
 the-0.029
 a-0.026
 all-0.024
 her-0.021
 up-0.020
Token Spot
Token ID15899
Feature activation+5.40e+00

Pos logit contributions
 the+0.142
 a+0.130
 all+0.126
 her+0.117
 out+0.116
Neg logit contributions
uddy-0.134
-0.131
lor-0.126
etsy-0.122
ongo-0.117
Token decided
Token ID3066
Feature activation-8.94e+00

Pos logit contributions
 the+0.158
 a+0.143
 all+0.127
 his+0.118
 her+0.117
Neg logit contributions
lor-0.229
uddy-0.200
arry-0.199
ides-0.199
jo-0.198
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
lor+0.402
ongo+0.360
uddy+0.356
aring+0.355
arry+0.355
Neg logit contributions
 the-0.154
 a-0.135
 all-0.125
 her-0.104
 out-0.103
Token investigate
Token ID9161
Feature activation+9.99e-01

Pos logit contributions
 the+0.338
 a+0.307
 all+0.255
 his+0.242
 her+0.237
Neg logit contributions
lor-0.611
ides-0.545
arry-0.532
-0.530
uddy-0.529
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.016
 a+0.015
 all+0.011
 her+0.011
 out+0.010
Neg logit contributions
lor-0.044
jo-0.042
arry-0.041
oke-0.040
oc-0.039
Token far
Token ID1290
Feature activation+2.33e+01

Pos logit contributions
 the+1.003
 a+0.940
 all+0.820
 his+0.748
 her+0.747
Neg logit contributions
lor-1.357
ides-1.254
arry-1.253
oke-1.247
oral-1.232
Token away
Token ID1497
Feature activation+2.51e+01

Pos logit contributions
 a+0.467
 the+0.451
 all+0.361
 her+0.306
 an+0.305
Neg logit contributions
lor-1.057
och-0.999
arry-0.994
oke-0.976
jo-0.971
Token bushes
Token ID37413
Feature activation+1.18e+01

Pos logit contributions
 the+0.535
 a+0.492
 all+0.418
 her+0.386
 his+0.347
Neg logit contributions
lor-1.115
arry-1.050
umm-0.994
ider-0.990
jo-0.989
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.274
 a+0.240
 all+0.201
 his+0.195
 her+0.194
Neg logit contributions
lor-0.528
ider-0.478
ides-0.472
och-0.472
arry-0.469
Token When
Token ID1649
Feature activation-2.49e+01

Pos logit contributions
lor+0.474
ides+0.443
och+0.426
umm+0.426
ider+0.419
Neg logit contributions
 the-0.471
 a-0.441
 his-0.379
 all-0.378
 her-0.348
Token flour
Token ID10601
Feature activation-6.80e+00

Pos logit contributions
 the+1.078
 a+0.951
 all+0.872
 his+0.818
 up+0.811
Neg logit contributions
lor-1.228
ider-1.193
umm-1.190
ides-1.171
och-1.166
Token,
Token ID11
Feature activation+4.40e+00

Pos logit contributions
lor+0.301
+0.286
ides+0.281
oke+0.281
och+0.279
Neg logit contributions
 the-0.146
 a-0.138
 his-0.110
 all-0.106
 her-0.103
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.085
 a+0.073
 all+0.071
 his+0.068
 out+0.065
Neg logit contributions
lor-0.168
plets-0.163
aron-0.162
-0.160
mmmm-0.160
Token eggs
Token ID9653
Feature activation-6.58e+00

Pos logit contributions
 the+1.114
 a+1.008
 all+0.928
 his+0.879
 up+0.857
Neg logit contributions
lor-1.206
ides-1.138
umm-1.134
ider-1.125
och-1.125
Token,
Token ID11
Feature activation+4.39e+00

Pos logit contributions
lor+0.311
ides+0.293
och+0.288
oke+0.287
+0.286
Neg logit contributions
 the-0.136
 a-0.126
 his-0.099
 all-0.097
 her-0.089
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.086
 a+0.069
 all+0.069
 his+0.068
 up+0.062
Neg logit contributions
lor-0.177
aron-0.168
umbs-0.165
-0.164
plets-0.163
Token milk
Token ID7545
Feature activation+4.83e+00

Pos logit contributions
 the+1.108
 a+1.009
 all+0.921
 his+0.876
 up+0.857
Neg logit contributions
lor-1.232
ides-1.162
umm-1.142
och-1.137
ider-1.129
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.100
 a+0.091
 his+0.074
 all+0.071
 her+0.066
Neg logit contributions
lor-0.233
ides-0.215
-0.213
och-0.212
umm-0.210
Token some
Token ID617
Feature activation+3.54e+01

Pos logit contributions
 the+0.241
 all+0.178
 up+0.178
 his+0.169
 out+0.160
Neg logit contributions
lor-0.663
-0.624
ider-0.623
mmmm-0.620
plets-0.620
Token sugar
Token ID7543
Feature activation-1.10e+00

Pos logit contributions
 the+1.079
 a+0.979
 all+0.882
 his+0.835
 her+0.801
Neg logit contributions
lor-1.322
ides-1.231
umm-1.209
och-1.200
ider-1.197
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
lor+0.048
ides+0.044
ider+0.043
och+0.043
umm+0.042
Neg logit contributions
 the-0.030
 a-0.027
 all-0.023
 his-0.022
 her-0.021
Token you
Token ID345
Feature activation+7.81e+00

Pos logit contributions
lor+0.004
ides+0.003
arry+0.003
och+0.003
umm+0.003
Neg logit contributions
 the-0.003
 a-0.003
 all-0.003
 his-0.003
 her-0.003
Token please
Token ID3387
Feature activation+1.28e+00

Pos logit contributions
 the+0.213
 a+0.187
 his+0.169
 all+0.161
 up+0.150
Neg logit contributions
lor-0.318
ides-0.296
-0.291
och-0.288
arry-0.287
Token supply
Token ID5127
Feature activation+9.89e+00

Pos logit contributions
 the+0.035
 a+0.031
 his+0.029
 all+0.027
 her+0.026
Neg logit contributions
lor-0.051
ides-0.048
-0.048
ider-0.046
och-0.046
Token me
Token ID502
Feature activation+1.53e+01

Pos logit contributions
 the+0.158
 a+0.137
 all+0.124
 in+0.111
 on+0.110
Neg logit contributions
lor-0.440
mmmm-0.389
och-0.379
mmm-0.378
grand-0.377
Token with
Token ID351
Feature activation+1.63e+01

Pos logit contributions
 the+0.290
 a+0.236
 his+0.219
 her+0.212
 all+0.209
Neg logit contributions
lor-0.671
ider-0.570
ase-0.569
och-0.568
aj-0.563
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.281
 a+0.242
 up+0.231
 all+0.212
 out+0.208
Neg logit contributions
lor-0.687
och-0.644
ides-0.615
mmmm-0.614
-0.613
Token past
Token ID1613
Feature activation+1.99e+01

Pos logit contributions
 the+0.974
 a+0.863
 all+0.774
 his+0.755
 her+0.704
Neg logit contributions
lor-1.382
och-1.326
umm-1.310
ides-1.300
oral-1.269
Tokenel
Token ID417
Feature activation+1.26e+01

Pos logit contributions
 the+0.804
 a+0.762
 all+0.711
 his+0.680
 her+0.666
Neg logit contributions
lor-0.562
ides-0.466
och-0.456
umm-0.447
arry-0.438
Token colors
Token ID7577
Feature activation+3.60e+00

Pos logit contributions
 the+0.438
 a+0.397
 all+0.357
 his+0.341
 her+0.337
Neg logit contributions
lor-0.465
ides-0.419
och-0.400
arry-0.395
ider-0.393
Token?
Token ID30
Feature activation-1.60e+01

Pos logit contributions
 the+0.132
 a+0.123
 all+0.108
 his+0.106
 her+0.105
Neg logit contributions
lor-0.122
ides-0.106
och-0.105
arry-0.103
ider-0.102
Token I
Token ID314
Feature activation-7.33e+00

Pos logit contributions
lor+0.589
ides+0.509
umm+0.488
uer+0.474
istent+0.462
Neg logit contributions
 the-0.648
 a-0.629
 all-0.525
 in-0.504
 her-0.499
Token a
Token ID257
Feature activation+3.07e+01

Pos logit contributions
 the+0.119
 a+0.089
 all+0.085
 out+0.077
 in+0.073
Neg logit contributions
lor-0.661
mmmm-0.586
'."-0.580
mmm-0.580
grand-0.574
Token balloon
Token ID21190
Feature activation+3.97e-01

Pos logit contributions
 the+1.005
 a+0.905
 all+0.829
 his+0.771
 her+0.769
Neg logit contributions
lor-1.227
ides-1.073
och-1.071
arry-1.055
-1.052
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.011
 a+0.010
 all+0.008
 his+0.008
 her+0.008
Neg logit contributions
lor-0.018
ides-0.016
och-0.016
ider-0.016
arry-0.016
Token I
Token ID314
Feature activation-7.31e+00

Pos logit contributions
lor+0.448
ides+0.392
jo+0.387
uer+0.379
anne+0.370
Neg logit contributions
 the-0.484
 a-0.469
 her-0.372
 all-0.370
 some-0.368
Token have
Token ID423
Feature activation+1.23e+01

Pos logit contributions
lor+0.297
ides+0.264
och+0.258
arry+0.258
umm+0.255
Neg logit contributions
 the-0.239
 a-0.220
 all-0.194
 his-0.182
 her-0.180
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.127
 a+0.079
 all+0.079
 up+0.071
 his+0.065
Neg logit contributions
lor-0.685
ides-0.598
och-0.595
mmmm-0.584
ider-0.584
Token money
Token ID1637
Feature activation+1.32e+01

Pos logit contributions
 the+0.978
 a+0.866
 all+0.774
 her+0.715
 his+0.713
Neg logit contributions
lor-1.497
ides-1.385
och-1.353
ider-1.346
umm-1.321
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.270
 a+0.229
 all+0.194
 his+0.182
 her+0.175
Neg logit contributions
lor-0.638
och-0.604
ider-0.602
ides-0.591
oke-0.576
Token You
Token ID921
Feature activation-2.63e+01

Pos logit contributions
lor+0.424
ides+0.368
jo+0.365
uer+0.358
anne+0.352
Neg logit contributions
 the-0.501
 a-0.491
 all-0.396
 some-0.392
 his-0.390
Token can
Token ID460
Feature activation-1.54e-01

Pos logit contributions
lor+0.929
ides+0.859
och+0.832
arry+0.813
oke+0.805
Neg logit contributions
 the-0.877
 a-0.796
 his-0.725
 her-0.716
 all-0.694
Token choose
Token ID3853
Feature activation+1.06e+01

Pos logit contributions
lor+0.007
ides+0.006
och+0.006
arry+0.006
ider+0.006
Neg logit contributions
 the-0.005
 a-0.004
 all-0.004
 his-0.003
 her-0.003
Token mom
Token ID1995
Feature activation-7.31e+00

Pos logit contributions
 the+0.890
 a+0.782
 all+0.677
 her+0.631
 his+0.612
Neg logit contributions
lor-1.463
ider-1.347
ides-1.333
uddy-1.324
umm-1.322
Token offered
Token ID4438
Feature activation-1.50e+01

Pos logit contributions
lor+0.235
och+0.206
oke+0.204
ides+0.203
jo+0.202
Neg logit contributions
 the-0.283
 a-0.267
 all-0.237
 his-0.227
 her-0.221
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
enny+0.484
lor+0.479
ongo+0.468
uddy+0.467
abs+0.461
Neg logit contributions
 the-0.326
 a-0.310
 all-0.284
 big-0.258
 in-0.252
Token buy
Token ID2822
Feature activation+3.70e+00

Pos logit contributions
 the+0.330
 a+0.304
 all+0.267
 on+0.246
 in+0.238
Neg logit contributions
lor-0.564
-0.503
ides-0.502
arry-0.495
etsy-0.492
Token him
Token ID683
Feature activation+1.99e+01

Pos logit contributions
 the+0.067
 in+0.060
 all+0.056
 a+0.056
 on+0.052
Neg logit contributions
lor-0.139
ongo-0.131
umm-0.130
och-0.127
-0.127
Token some
Token ID617
Feature activation+3.55e+01

Pos logit contributions
 the+0.148
 all+0.071
 a+0.050
 big+0.029
 out+0.023
Neg logit contributions
lor-1.003
etsy-0.919
ongo-0.917
grand-0.908
aj-0.907
Token pizza
Token ID14256
Feature activation+9.46e+00

Pos logit contributions
 the+0.983
 a+0.854
 all+0.777
 his+0.692
 her+0.684
Neg logit contributions
lor-1.397
ides-1.308
och-1.306
umm-1.302
ider-1.288
Token.
Token ID13
Feature activation-1.22e+01

Pos logit contributions
 the+0.235
 a+0.207
 all+0.172
 her+0.160
 his+0.157
Neg logit contributions
lor-0.441
ides-0.399
och-0.396
ider-0.393
umm-0.388
Token Sammy
Token ID44067
Feature activation+3.57e+00

Pos logit contributions
lor+0.505
ides+0.452
ider+0.436
och+0.431
umm+0.429
Neg logit contributions
 the-0.440
 a-0.405
 all-0.363
 her-0.338
 his-0.337
Token's
Token ID338
Feature activation+1.06e+01

Pos logit contributions
 the+0.107
 a+0.101
 out+0.089
 all+0.088
 her+0.085
Neg logit contributions
lor-0.136
-0.115
oke-0.114
etsy-0.114
istent-0.114
Token eyes
Token ID2951
Feature activation-3.37e+00

Pos logit contributions
 the+0.313
 a+0.272
 all+0.242
 her+0.242
 out+0.232
Neg logit contributions
lor-0.395
uddy-0.366
-0.363
ider-0.362
umm-0.359
TokenH
Token ID39
Feature activation-4.00e+01

Pos logit contributions
lor+0.982
ides+0.928
umm+0.861
istent+0.854
arry+0.844
Neg logit contributions
 the-1.190
 a-1.120
 her-0.946
 his-0.942
 in-0.903
Tokenola
Token ID5708
Feature activation-3.76e+01

Pos logit contributions
lor+0.996
umm+0.791
ides+0.755
jo+0.754
arians+0.742
Neg logit contributions
 the-2.010
 a-1.874
 all-1.747
 in-1.615
 his-1.601
Token,
Token ID11
Feature activation-8.47e+00

Pos logit contributions
lor+1.951
ides+1.772
arry+1.751
och+1.751
oke+1.751
Neg logit contributions
 the-0.751
 a-0.676
 all-0.544
 his-0.488
 her-0.474
Token me
Token ID502
Feature activation+3.39e+00

Pos logit contributions
lor+0.336
oke+0.298
och+0.293
ides+0.293
reetings+0.289
Neg logit contributions
 the-0.237
 a-0.221
 all-0.194
 his-0.185
 her-0.180
Token ll
Token ID32660
Feature activation-3.21e+01

Pos logit contributions
 the+0.086
 a+0.077
 all+0.066
 his+0.063
 her+0.062
Neg logit contributions
lor-0.157
ides-0.142
ider-0.140
oke-0.140
arry-0.140
Tokenamo
Token ID18811
Feature activation-7.18e+01

Pos logit contributions
lor+0.842
ides+0.753
istent+0.746
och+0.734
umm+0.728
Neg logit contributions
 the-1.451
 a-1.395
 all-1.220
 in-1.216
 out-1.172
Token Emma
Token ID18966
Feature activation-1.46e+01

Pos logit contributions
lor+2.741
ides+2.377
och+2.366
arry+2.354
oke+2.319
Neg logit contributions
 the-2.387
 a-2.243
 all-1.993
 his-1.866
 her-1.807
Token."
Token ID526
Feature activation-3.62e+01

Pos logit contributions
lor+0.632
ides+0.564
och+0.552
arry+0.552
umm+0.547
Neg logit contributions
 the-0.435
 a-0.400
 all-0.347
 his-0.322
 her-0.317
Token
Token ID198
Feature activation-2.80e+01

Pos logit contributions
lor+1.481
ides+1.371
umm+1.334
ider+1.283
oke+1.262
Neg logit contributions
 the-1.366
 a-1.259
 her-1.082
 all-1.062
 his-1.057
Token
Token ID198
Feature activation-2.75e+01

Pos logit contributions
umm+1.287
lor+1.228
ides+1.227
istent+1.164
ider+1.161
Neg logit contributions
 the-0.955
 a-0.889
 her-0.736
 his-0.719
 all-0.670
TokenL
Token ID43
Feature activation-2.62e+01

Pos logit contributions
lor+0.938
umm+0.920
ides+0.900
oke+0.831
istent+0.823
Neg logit contributions
 the-1.142
 a-1.098
 her-0.956
 his-0.930
 all-0.908
Token end
Token ID886
Feature activation-3.14e+01

Pos logit contributions
lor+0.960
ides+0.883
oke+0.877
och+0.870
istent+0.857
Neg logit contributions
 the-0.770
 a-0.717
 all-0.629
 his-0.599
 her-0.595
Token.
Token ID13
Feature activation-2.94e+01

Pos logit contributions
lor+1.479
ides+1.338
arry+1.321
och+1.314
ider+1.313
Neg logit contributions
 the-0.791
 a-0.714
 all-0.609
 his-0.551
 her-0.538
Token
Token ID198
Feature activation-3.46e+01

Pos logit contributions
lor+1.234
ides+1.187
umm+1.137
uer+1.111
istent+1.091
Neg logit contributions
 the-1.071
 a-1.008
 his-0.819
 her-0.801
 all-0.800
Token
Token ID198
Feature activation-3.31e+01

Pos logit contributions
umm+1.776
lor+1.712
ides+1.687
istent+1.669
arians+1.645
Neg logit contributions
 the-0.981
 a-0.913
 his-0.710
 all-0.672
 her-0.671
TokenM
Token ID44
Feature activation-3.89e+01

Pos logit contributions
umm+1.334
ides+1.226
lor+1.216
istent+1.198
plets+1.141
Neg logit contributions
 the-1.256
 a-1.202
 his-1.012
 her-1.003
 all-0.944
Tokenoral
Token ID6864
Feature activation-6.80e+01

Pos logit contributions
lor+0.963
oling+0.773
istent+0.763
och+0.748
ides+0.740
Neg logit contributions
 the-1.875
 a-1.815
 all-1.656
 her-1.576
 in-1.574
Token value
Token ID1988
Feature activation-1.78e+01

Pos logit contributions
lor+3.155
ides+2.837
arry+2.788
och+2.783
umm+2.765
Neg logit contributions
 the-1.843
 a-1.671
 all-1.430
 his-1.313
 her-1.300
Token:
Token ID25
Feature activation-3.07e+01

Pos logit contributions
lor+0.797
arry+0.716
oke+0.709
reetings+0.704
ides+0.699
Neg logit contributions
 the-0.426
 a-0.412
 all-0.334
 his-0.317
 her-0.312
Token The
Token ID383
Feature activation-9.88e+00

Pos logit contributions
lor+1.064
ides+0.924
arry+0.902
och+0.897
umm+0.891
Neg logit contributions
 the-1.179
 a-1.106
 all-0.993
 his-0.940
 her-0.935
Token story
Token ID1621
Feature activation-3.52e+01

Pos logit contributions
lor+0.383
oral+0.341
umm+0.339
och+0.335
arry+0.334
Neg logit contributions
 the-0.312
 a-0.279
 all-0.252
 his-0.247
 her-0.245
Token teaches
Token ID17324
Feature activation-2.49e+01

Pos logit contributions
lor+1.224
ides+1.087
oke+1.064
och+1.030
arry+1.027
Neg logit contributions
 the-1.268
 a-1.197
 all-1.059
 his-1.057
 her-1.034
Token gentle
Token ID10296
Feature activation+9.08e-01

Pos logit contributions
umm+1.147
lor+1.077
ides+1.055
ider+1.037
istent+1.019
Neg logit contributions
 the-0.810
 a-0.736
 his-0.606
 all-0.592
 big-0.540
Token.
Token ID13
Feature activation-2.94e+01

Pos logit contributions
 the+0.022
 a+0.020
 all+0.017
 his+0.015
 her+0.015
Neg logit contributions
lor-0.043
ides-0.039
umm-0.038
arry-0.038
jo-0.038
Token
Token ID198
Feature activation-3.46e+01

Pos logit contributions
lor+1.179
ides+1.098
umm+1.048
uer+1.014
ider+1.005
Neg logit contributions
 the-1.128
 a-1.051
 his-0.881
 all-0.871
 her-0.862
Token
Token ID198
Feature activation-3.31e+01

Pos logit contributions
umm+1.678
lor+1.603
ides+1.577
ider+1.555
istent+1.531
Neg logit contributions
 the-1.106
 a-1.023
 his-0.825
 her-0.792
 all-0.772
TokenM
Token ID44
Feature activation-3.89e+01

Pos logit contributions
umm+1.288
lor+1.221
ides+1.214
istent+1.142
oke+1.113
Neg logit contributions
 the-1.270
 a-1.209
 his-1.024
 her-1.022
 all-0.956
Tokenoral
Token ID6864
Feature activation-6.80e+01

Pos logit contributions
lor+0.925
oling+0.695
istent+0.686
och+0.684
ides+0.684
Neg logit contributions
 the-1.941
 a-1.873
 all-1.714
 her-1.648
 in-1.635
Token:
Token ID25
Feature activation-3.14e+01

Pos logit contributions
lor+3.143
ides+2.823
arry+2.781
och+2.772
umm+2.750
Neg logit contributions
 the-1.844
 a-1.671
 all-1.432
 his-1.318
 her-1.305
Token Being
Token ID11204
Feature activation-3.53e+01

Pos logit contributions
lor+0.974
ides+0.831
umm+0.807
och+0.800
ider+0.791
Neg logit contributions
 the-1.379
 a-1.294
 all-1.175
 his-1.118
 her-1.105
Token respectful
Token ID25923
Feature activation-2.06e+01

Pos logit contributions
lor+1.281
reetings+1.249
aring+1.168
arry+1.163
oke+1.143
Neg logit contributions
 the-0.776
 all-0.642
 a-0.630
 her-0.605
 his-0.566
Token and
Token ID290
Feature activation+5.89e+00

Pos logit contributions
lor+0.981
ides+0.891
arry+0.879
oke+0.878
och+0.873
Neg logit contributions
 the-0.482
 a-0.436
 all-0.363
 his-0.335
 her-0.330
Token gentle
Token ID10296
Feature activation+1.05e+01

Pos logit contributions
 the+0.187
 a+0.178
 all+0.159
 her+0.152
 out+0.152
Neg logit contributions
lor-0.190
reetings-0.182
istent-0.179
aring-0.178
arry-0.177
Token up
Token ID510
Feature activation+2.05e+01

Pos logit contributions
lor+0.095
uer+0.083
ides+0.082
ongo+0.082
etsy+0.081
Neg logit contributions
 the-0.045
 a-0.039
 their-0.030
 all-0.029
 his-0.028
Token."
Token ID526
Feature activation-3.80e+01

Pos logit contributions
 the+0.497
 a+0.454
 all+0.389
 out+0.345
 his+0.344
Neg logit contributions
lor-0.882
ider-0.791
ides-0.787
och-0.784
arry-0.772
Token
Token ID198
Feature activation-2.96e+01

Pos logit contributions
lor+1.633
ides+1.536
umm+1.505
ider+1.453
istent+1.406
Neg logit contributions
 the-1.396
 a-1.256
 his-1.102
 her-1.090
 all-1.056
Token
Token ID198
Feature activation-2.90e+01

Pos logit contributions
umm+1.358
ides+1.305
lor+1.287
ider+1.274
istent+1.241
Neg logit contributions
 the-1.018
 a-0.919
 his-0.791
 her-0.770
 all-0.704
TokenM
Token ID44
Feature activation-3.54e+01

Pos logit contributions
umm+1.035
ides+0.992
lor+0.979
istent+0.923
plets+0.904
Neg logit contributions
 the-1.175
 a-1.120
 his-0.981
 her-0.974
 all-0.907
Tokenoral
Token ID6864
Feature activation-6.49e+01

Pos logit contributions
lor+0.888
plets+0.722
oling+0.708
istent+0.683
ides+0.678
Neg logit contributions
 the-1.691
 a-1.639
 all-1.507
 her-1.462
 his-1.444
Token of
Token ID286
Feature activation+8.12e+00

Pos logit contributions
lor+3.094
ides+2.806
umm+2.746
och+2.735
arry+2.726
Neg logit contributions
 the-1.727
 a-1.575
 all-1.322
 his-1.202
 her-1.181
Token the
Token ID262
Feature activation+1.87e+01

Pos logit contributions
 a+0.012
 the+0.011
 all-0.011
 out-0.017
 in-0.023
Neg logit contributions
lor-0.518
jo-0.476
reetings-0.465
oral-0.464
oke-0.461
Token story
Token ID1621
Feature activation-3.35e+01

Pos logit contributions
 the+0.809
 a+0.763
 all+0.694
 his+0.663
 her+0.658
Neg logit contributions
lor-0.566
ides-0.479
arry-0.465
och-0.464
umm-0.462
Token:
Token ID25
Feature activation-2.74e+01

Pos logit contributions
lor+1.379
ides+1.235
umm+1.224
och+1.208
istent+1.198
Neg logit contributions
 the-1.103
 a-1.017
 all-0.896
 his-0.826
 her-0.821
Token Be
Token ID1355
Feature activation-1.08e+01

Pos logit contributions
lor+0.918
ides+0.797
umm+0.778
och+0.767
ider+0.766
Neg logit contributions
 the-1.144
 a-1.069
 all-0.962
 his-0.916
 her-0.906
Token away
Token ID1497
Feature activation+1.46e+01

Pos logit contributions
lor+0.760
ider+0.733
arry+0.715
uddy+0.696
etsy+0.689
Neg logit contributions
 the-0.365
 a-0.328
 all-0.243
 her-0.231
 his-0.214
Token.
Token ID13
Feature activation-2.20e+01

Pos logit contributions
 the+0.355
 a+0.331
 all+0.270
 her+0.250
 his+0.241
Neg logit contributions
lor-0.607
arry-0.575
ider-0.570
uer-0.555
uddy-0.545
Token
Token ID198
Feature activation-2.85e+01

Pos logit contributions
lor+0.912
ides+0.883
umm+0.835
och+0.815
uer+0.801
Neg logit contributions
 the-0.813
 a-0.755
 his-0.637
 all-0.630
 her-0.606
Token
Token ID198
Feature activation-2.80e+01

Pos logit contributions
umm+1.400
lor+1.283
ides+1.275
istent+1.238
ider+1.233
Neg logit contributions
 the-0.930
 a-0.863
 his-0.696
 all-0.666
 her-0.660
TokenM
Token ID44
Feature activation-3.44e+01

Pos logit contributions
umm+1.078
lor+1.014
ides+1.002
istent+0.950
oke+0.927
Neg logit contributions
 the-1.073
 a-1.028
 his-0.870
 her-0.852
 all-0.845
Tokenoral
Token ID6864
Feature activation-6.40e+01

Pos logit contributions
lor+0.749
och+0.551
oling+0.548
ides+0.548
istent+0.544
Neg logit contributions
 the-1.744
 a-1.682
 all-1.570
 his-1.501
 in-1.493
Token of
Token ID286
Feature activation+8.88e+00

Pos logit contributions
lor+2.993
ides+2.695
arry+2.646
och+2.643
umm+2.628
Neg logit contributions
 the-1.731
 a-1.570
 all-1.338
 his-1.225
 her-1.213
Token the
Token ID262
Feature activation+1.94e+01

Pos logit contributions
 a+0.033
 the+0.028
 all+0.011
 out+0.006
 in-0.004
Neg logit contributions
lor-0.536
jo-0.483
reetings-0.482
oral-0.481
grand-0.472
Token story
Token ID1621
Feature activation-3.28e+01

Pos logit contributions
 the+0.875
 a+0.828
 all+0.749
 his+0.731
 her+0.729
Neg logit contributions
lor-0.542
ides-0.447
arry-0.438
umm-0.436
oke-0.430
Token:
Token ID25
Feature activation-2.68e+01

Pos logit contributions
lor+1.317
ides+1.174
umm+1.159
och+1.148
istent+1.137
Neg logit contributions
 the-1.115
 a-1.030
 all-0.910
 his-0.845
 her-0.835
Token When
Token ID1649
Feature activation-3.12e+01

Pos logit contributions
lor+0.844
ides+0.721
umm+0.700
och+0.697
arry+0.692
Neg logit contributions
 the-1.153
 a-1.083
 all-0.982
 his-0.934
 her-0.926
Token said
Token ID531
Feature activation-2.23e+01

Pos logit contributions
 the+0.369
 a+0.341
 all+0.288
 his+0.275
 her+0.274
Neg logit contributions
lor-0.529
ides-0.472
umm-0.462
arry-0.459
ider-0.454
Token to
Token ID284
Feature activation-3.80e+00

Pos logit contributions
lor+1.110
ides+0.997
oke+0.983
och+0.980
ider+0.980
Neg logit contributions
 the-0.464
 a-0.403
 all-0.347
 his-0.292
 her-0.283
Token Dave
Token ID9935
Feature activation-1.94e+01

Pos logit contributions
lor+0.179
ides+0.162
uddy+0.160
arry+0.160
jo+0.159
Neg logit contributions
 the-0.081
 a-0.074
 all-0.063
 in-0.055
 his-0.053
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.971
ides+0.880
arry+0.873
oke+0.869
och+0.866
Neg logit contributions
 the-0.419
 a-0.378
 all-0.309
 his-0.268
 her-0.267
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.365
ides+0.320
och+0.314
arry+0.314
oke+0.310
Neg logit contributions
 the-0.332
 a-0.309
 all-0.275
 his-0.258
 her-0.256
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.408
jo+1.207
ides+1.185
uddy+1.151
umm+1.143
Neg logit contributions
 the-2.269
 a-2.108
 all-1.904
 his-1.830
 in-1.821
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.666
uddy+1.517
ides+1.496
umm+1.481
etsy+1.454
Neg logit contributions
 the-3.003
 a-2.810
 his-2.454
 all-2.440
 in-2.339
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.846
lor+1.639
oral+1.636
ides+1.627
ider+1.582
Neg logit contributions
 the-1.477
 a-1.371
 all-1.055
 in-1.019
 it-1.016
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.051
umm+0.912
ides+0.869
och+0.856
uddy+0.845
Neg logit contributions
 the-2.065
 a-1.918
 all-1.742
 his-1.724
 her-1.691
TokenHold
Token ID26807
Feature activation-3.13e+01

Pos logit contributions
lor+1.768
umm+1.621
ides+1.602
arians+1.495
och+1.486
Neg logit contributions
 the-1.825
 a-1.767
 his-1.452
 all-1.420
 her-1.406
Token on
Token ID319
Feature activation+1.12e+01

Pos logit contributions
lor+1.648
och+1.480
ides+1.473
oke+1.472
arry+1.463
Neg logit contributions
 the-0.517
 a-0.457
 all-0.370
 his-0.300
 her-0.299
Token and
Token ID290
Feature activation-3.23e+00

Pos logit contributions
lor+0.029
ides+0.025
och+0.025
arry+0.025
umm+0.025
Neg logit contributions
 the-0.028
 a-0.026
 all-0.023
 his-0.021
 her-0.021
Token then
Token ID788
Feature activation-1.23e+01

Pos logit contributions
lor+0.134
jo+0.124
och+0.122
ides+0.121
oke+0.120
Neg logit contributions
 the-0.072
 a-0.066
 all-0.057
 out-0.049
 his-0.049
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+0.530
och+0.482
ides+0.473
oke+0.470
jo+0.469
Neg logit contributions
 the-0.278
 a-0.258
 all-0.220
 his-0.188
 her-0.186
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.966
och+0.855
ides+0.852
arry+0.851
ider+0.851
Neg logit contributions
 the-0.353
 a-0.297
 all-0.267
 his-0.206
 her-0.202
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.331
ides+0.287
och+0.282
arry+0.282
istent+0.279
Neg logit contributions
 the-0.357
 a-0.334
 all-0.302
 his-0.283
 her-0.281
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.214
jo+1.005
ides+0.987
umm+0.978
uddy+0.975
Neg logit contributions
 the-2.409
 a-2.287
 all-2.041
 in-1.982
 her-1.940
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.468
uddy+1.328
umm+1.307
ides+1.266
etsy+1.234
Neg logit contributions
 the-3.169
 a-3.008
 all-2.620
 her-2.556
 his-2.547
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.655
lor+1.533
ides+1.467
oral+1.446
ider+1.432
Neg logit contributions
 the-1.637
 a-1.521
 all-1.247
 in-1.165
 it-1.151
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.054
umm+0.931
och+0.878
ides+0.857
uddy+0.853
Neg logit contributions
 the-2.050
 a-1.916
 all-1.732
 her-1.692
 his-1.691
TokenNo
Token ID2949
Feature activation-2.84e+01

Pos logit contributions
lor+1.697
umm+1.533
ides+1.525
uer+1.417
och+1.409
Neg logit contributions
 the-1.884
 a-1.821
 his-1.492
 all-1.487
 her-1.475
Token,
Token ID11
Feature activation-4.43e+00

Pos logit contributions
lor+1.463
ides+1.334
arry+1.317
och+1.304
istent+1.304
Neg logit contributions
 the-0.603
 a-0.537
 all-0.435
 his-0.387
 her-0.382
Token with
Token ID351
Feature activation-2.11e+00

Pos logit contributions
lor+0.022
uddy+0.022
umm+0.022
+0.021
arry+0.021
Neg logit contributions
 the-0.023
 a-0.022
 her-0.020
 out-0.019
 more-0.018
Token a
Token ID257
Feature activation+1.35e+01

Pos logit contributions
lor+0.107
jo+0.098
ides+0.098
och+0.098
umm+0.097
Neg logit contributions
 the-0.025
 a-0.020
 all-0.014
 out-0.013
 in-0.012
Token smile
Token ID8212
Feature activation-2.35e+01

Pos logit contributions
 the+0.367
 a+0.325
 all+0.285
 her+0.267
 his+0.264
Neg logit contributions
lor-0.605
och-0.538
ides-0.535
arry-0.531
ider-0.529
Token.
Token ID13
Feature activation-2.61e+01

Pos logit contributions
lor+1.207
ides+1.077
och+1.072
jo+1.061
ider+1.057
Neg logit contributions
 the-0.465
 a-0.408
 all-0.327
 his-0.279
 her-0.269
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.115
ides+1.054
umm+0.991
uer+0.973
istent+0.948
Neg logit contributions
 the-0.972
 a-0.907
 his-0.761
 her-0.745
 all-0.736
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.178
umm+1.023
jo+1.016
ides+0.985
uddy+0.973
Neg logit contributions
 the-2.460
 a-2.300
 all-2.045
 in-2.014
 her-1.967
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.501
uddy+1.428
umm+1.406
ides+1.356
unch+1.312
Neg logit contributions
 the-3.118
 a-2.935
 all-2.530
 her-2.508
 his-2.486
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.828
lor+1.682
ider+1.639
ides+1.615
oral+1.597
Neg logit contributions
 the-1.516
 a-1.391
 all-1.123
 it-1.039
 in-1.014
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.011
umm+0.878
ides+0.821
och+0.813
uddy+0.803
Neg logit contributions
 the-2.117
 a-1.968
 all-1.796
 her-1.749
 his-1.746
TokenDo
Token ID5211
Feature activation-2.94e+01

Pos logit contributions
lor+1.771
ides+1.585
umm+1.585
arians+1.490
och+1.478
Neg logit contributions
 the-1.831
 a-1.793
 his-1.452
 all-1.443
 her-1.427
Token you
Token ID345
Feature activation-1.02e+00

Pos logit contributions
lor+1.338
ides+1.200
arry+1.180
och+1.171
umm+1.160
Neg logit contributions
 the-0.815
 a-0.740
 all-0.638
 his-0.585
 her-0.577
TokenHis
Token ID6653
Feature activation-2.62e+01

Pos logit contributions
 the+0.128
 a+0.116
 his+0.099
 her+0.099
 all+0.099
Neg logit contributions
lor-0.163
ides-0.145
umm-0.145
jo-0.141
ider-0.139
Token mom
Token ID1995
Feature activation-2.45e+01

Pos logit contributions
lor+1.146
ides+1.027
arry+1.003
ider+1.001
uddy+1.000
Neg logit contributions
 the-0.733
 a-0.662
 all-0.572
 her-0.523
 his-0.512
Token smiled
Token ID13541
Feature activation-1.82e+01

Pos logit contributions
lor+1.034
ides+0.927
och+0.908
oke+0.902
arry+0.902
Neg logit contributions
 the-0.742
 a-0.685
 all-0.593
 his-0.556
 her-0.544
Token.
Token ID13
Feature activation-2.61e+01

Pos logit contributions
lor+0.854
istent+0.791
arry+0.786
ides+0.784
umm+0.781
Neg logit contributions
 the-0.364
 a-0.313
 all-0.256
 his-0.211
 her-0.207
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.179
ides+1.118
umm+1.045
uer+1.024
ider+1.012
Neg logit contributions
 the-0.919
 a-0.830
 his-0.712
 all-0.673
 her-0.673
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.236
ides+0.995
umm+0.988
jo+0.981
umbs+0.966
Neg logit contributions
 the-2.442
 a-2.282
 in-2.004
 his-1.995
 all-1.989
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.543
umm+1.413
uddy+1.381
ides+1.364
etsy+1.331
Neg logit contributions
 the-3.112
 a-2.918
 his-2.533
 all-2.466
 in-2.454
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.846
lor+1.707
ides+1.633
ider+1.624
oral+1.618
Neg logit contributions
 the-1.515
 a-1.378
 all-1.104
 it-1.041
 his-1.026
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.058
umm+0.915
och+0.863
ides+0.859
uddy+0.837
Neg logit contributions
 the-2.071
 a-1.911
 all-1.730
 his-1.723
 her-1.697
TokenThat
Token ID2504
Feature activation-2.98e+01

Pos logit contributions
lor+1.735
ides+1.558
umm+1.543
arians+1.453
uer+1.440
Neg logit contributions
 the-1.874
 a-1.812
 his-1.495
 all-1.474
 her-1.446
Tokenâ
Token ID22940
Feature activation-4.36e+01

Pos logit contributions
lor+1.164
jo+0.991
ides+0.987
istent+0.982
arry+0.979
Neg logit contributions
 the-0.940
 a-0.860
 all-0.769
 her-0.757
 his-0.739
Token as
Token ID355
Feature activation-1.28e+01

Pos logit contributions
 the+0.091
 a+0.089
 her+0.069
 all+0.060
 his+0.059
Neg logit contributions
oke-0.190
arry-0.189
aring-0.187
istent-0.186
oling-0.184
Token a
Token ID257
Feature activation+1.35e+01

Pos logit contributions
lor+0.537
+0.510
etsy+0.504
och+0.500
uddy+0.497
Neg logit contributions
 the-0.245
 a-0.221
 all-0.180
 her-0.162
 out-0.160
Token gift
Token ID6979
Feature activation-2.59e+01

Pos logit contributions
 the+0.429
 a+0.388
 all+0.355
 her+0.340
 his+0.333
Neg logit contributions
lor-0.523
umm-0.468
och-0.465
-0.464
uddy-0.460
Token.
Token ID13
Feature activation-2.61e+01

Pos logit contributions
lor+1.292
ides+1.160
och+1.147
arry+1.125
umm+1.124
Neg logit contributions
 the-0.543
 a-0.480
 all-0.389
 his-0.362
 her-0.332
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.063
ides+0.995
umm+0.945
uer+0.930
istent+0.914
Neg logit contributions
 the-0.994
 a-0.937
 his-0.774
 her-0.761
 all-0.761
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.102
jo+0.954
umm+0.937
uddy+0.928
ides+0.897
Neg logit contributions
 the-2.475
 a-2.362
 all-2.112
 in-2.058
 her-2.013
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.442
uddy+1.400
umm+1.359
ides+1.292
etsy+1.281
Neg logit contributions
 the-3.109
 a-2.988
 all-2.554
 her-2.514
 his-2.504
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.697
lor+1.555
ider+1.506
ides+1.500
oral+1.485
Neg logit contributions
 the-1.603
 a-1.492
 all-1.220
 it-1.132
 in-1.131
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.019
umm+0.890
och+0.845
ides+0.838
uddy+0.824
Neg logit contributions
 the-2.079
 a-1.947
 all-1.774
 her-1.718
 his-1.716
TokenThank
Token ID10449
Feature activation-3.36e+01

Pos logit contributions
lor+1.726
umm+1.563
ides+1.549
och+1.453
arians+1.438
Neg logit contributions
 the-1.863
 a-1.813
 his-1.480
 all-1.466
 her-1.452
Token you
Token ID345
Feature activation-1.02e+00

Pos logit contributions
lor+1.570
ides+1.433
umm+1.403
arry+1.389
och+1.389
Neg logit contributions
 the-0.948
 a-0.865
 all-0.725
 his-0.663
 her-0.661
Token asked
Token ID1965
Feature activation-2.86e+01

Pos logit contributions
 the+0.162
 a+0.144
 her+0.113
 all+0.107
 his+0.105
Neg logit contributions
lor-0.417
ider-0.380
ari-0.373
och-0.370
arry-0.367
Token her
Token ID607
Feature activation+6.93e+00

Pos logit contributions
lor+1.370
och+1.253
etsy+1.249
ides+1.245
uddy+1.244
Neg logit contributions
 the-0.485
 a-0.422
 all-0.358
 big-0.292
 more-0.279
Token mum
Token ID25682
Feature activation-3.05e+01

Pos logit contributions
 the+0.176
 a+0.156
 all+0.135
 his+0.125
 her+0.121
Neg logit contributions
lor-0.317
ides-0.280
och-0.279
arry-0.278
uddy-0.277
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+1.421
oke+1.291
och+1.289
ides+1.282
istent+1.266
Neg logit contributions
 the-0.651
 a-0.577
 all-0.483
 his-0.448
 her-0.412
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.342
och+0.305
oke+0.299
arry+0.297
etsy+0.297
Neg logit contributions
 the-0.321
 a-0.298
 all-0.272
 his-0.252
 her-0.246
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.169
ides+0.987
jo+0.983
uddy+0.975
umm+0.970
Neg logit contributions
 the-2.429
 a-2.271
 all-2.048
 in-2.021
 her-1.982
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.455
umm+1.362
uddy+1.349
ides+1.338
unch+1.274
Neg logit contributions
 the-3.134
 a-2.967
 all-2.563
 her-2.542
 in-2.500
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.870
lor+1.679
oral+1.669
ides+1.662
unn+1.612
Neg logit contributions
 the-1.454
 a-1.353
 all-1.018
 in-1.002
 it-0.969
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.070
umm+0.951
ides+0.907
uddy+0.877
och+0.868
Neg logit contributions
 the-2.033
 a-1.890
 all-1.698
 her-1.676
 his-1.651
TokenHow
Token ID2437
Feature activation-3.86e+01

Pos logit contributions
lor+1.788
umm+1.685
ides+1.650
uer+1.548
istent+1.524
Neg logit contributions
 the-1.787
 a-1.722
 her-1.389
 all-1.387
 his-1.364
Token did
Token ID750
Feature activation-5.72e+00

Pos logit contributions
lor+1.682
ides+1.520
arry+1.494
oke+1.480
och+1.473
Neg logit contributions
 the-1.085
 a-0.977
 all-0.850
 his-0.805
 her-0.790
Token said
Token ID531
Feature activation-2.23e+01

Pos logit contributions
 the+0.031
 her+0.022
 his+0.021
 all+0.020
 a+0.016
Neg logit contributions
lor-0.162
'."-0.140
mmmm-0.140
arry-0.139
''-0.138
Token mom
Token ID1995
Feature activation-2.45e+01

Pos logit contributions
lor+1.110
ides+0.997
oke+0.983
och+0.980
ider+0.980
Neg logit contributions
 the-0.464
 a-0.403
 all-0.347
 his-0.292
 her-0.283
Tokenmy
Token ID1820
Feature activation-1.01e+01

Pos logit contributions
lor+1.012
oke+0.919
ides+0.915
jo+0.909
och+0.901
Neg logit contributions
 the-0.672
 a-0.612
 all-0.517
 his-0.506
 her-0.471
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.372
ides+0.335
istent+0.333
oke+0.331
jo+0.326
Neg logit contributions
 the-0.317
 a-0.294
 his-0.249
 all-0.246
 her-0.237
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.339
och+0.301
oke+0.300
ides+0.296
uddy+0.296
Neg logit contributions
 the-0.325
 a-0.303
 all-0.274
 his-0.256
 her-0.251
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.231
ides+1.013
jo+1.010
umm+1.006
uddy+0.960
Neg logit contributions
 the-2.441
 a-2.259
 all-2.058
 in-2.009
 her-1.979
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.540
umm+1.464
uddy+1.407
ides+1.398
unch+1.343
Neg logit contributions
 the-3.094
 a-2.909
 all-2.514
 her-2.495
 in-2.447
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.873
lor+1.687
oral+1.666
ides+1.662
istent+1.609
Neg logit contributions
 the-1.477
 a-1.352
 all-1.048
 in-1.018
 it-1.003
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.092
umm+0.983
ides+0.908
och+0.883
uddy+0.876
Neg logit contributions
 the-2.041
 a-1.885
 all-1.705
 her-1.676
 his-1.655
Tokenbut
Token ID4360
Feature activation-3.77e+01

Pos logit contributions
lor+1.761
umm+1.618
ides+1.595
arians+1.483
uer+1.483
Neg logit contributions
 the-1.846
 a-1.808
 her-1.444
 his-1.440
 all-1.434
Token don
Token ID836
Feature activation-1.04e+01

Pos logit contributions
lor+1.426
och+1.271
oke+1.262
reetings+1.260
istent+1.259
Neg logit contributions
 the-1.030
 a-0.986
 all-0.853
 his-0.834
 her-0.812
Token smiled
Token ID13541
Feature activation-2.13e+01

Pos logit contributions
lor+0.778
ides+0.758
umm+0.728
ider+0.706
icious+0.699
Neg logit contributions
 the-0.542
 a-0.493
 all-0.404
 her-0.391
 his-0.391
Token and
Token ID290
Feature activation-1.98e+00

Pos logit contributions
lor+0.988
arry+0.901
umm+0.895
jo+0.890
ides+0.890
Neg logit contributions
 the-0.419
 a-0.370
 all-0.299
 his-0.249
 her-0.247
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+0.069
etsy+0.061
och+0.060
jo+0.060
arry+0.059
Neg logit contributions
 the-0.059
 a-0.056
 all-0.050
 more-0.046
 his-0.045
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.953
arry+0.838
oke+0.836
ides+0.835
och+0.835
Neg logit contributions
 the-0.392
 a-0.340
 all-0.301
 his-0.250
 her-0.239
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.361
ides+0.317
och+0.311
arry+0.310
umm+0.307
Neg logit contributions
 the-0.337
 a-0.313
 all-0.279
 his-0.262
 her-0.260
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.152
jo+0.987
umm+0.949
uddy+0.947
umbs+0.938
Neg logit contributions
 the-2.426
 a-2.307
 all-2.039
 in-2.025
 her-1.949
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.490
uddy+1.417
umm+1.382
ides+1.329
unch+1.285
Neg logit contributions
 the-3.120
 a-2.948
 all-2.544
 her-2.491
 in-2.487
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.762
lor+1.612
ides+1.559
oral+1.546
ider+1.529
Neg logit contributions
 the-1.565
 a-1.440
 all-1.158
 in-1.081
 it-1.077
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.058
umm+0.952
och+0.894
uddy+0.878
ides+0.870
Neg logit contributions
 the-2.039
 a-1.902
 all-1.710
 her-1.679
 his-1.672
TokenThank
Token ID10449
Feature activation-3.36e+01

Pos logit contributions
lor+1.757
umm+1.597
ides+1.596
arians+1.480
uer+1.479
Neg logit contributions
 the-1.829
 a-1.775
 his-1.434
 all-1.429
 her-1.423
Token you
Token ID345
Feature activation-1.02e+00

Pos logit contributions
lor+1.540
ides+1.402
umm+1.381
arry+1.362
istent+1.358
Neg logit contributions
 the-0.985
 a-0.901
 all-0.755
 her-0.696
 his-0.692
Token But
Token ID887
Feature activation-4.46e+01

Pos logit contributions
 the+0.760
 a+0.691
 all+0.608
 her+0.581
 his+0.563
Neg logit contributions
lor-1.095
ides-0.969
och-0.960
arry-0.957
oke-0.952
Token Jim
Token ID5395
Feature activation-1.49e+01

Pos logit contributions
lor+2.024
och+1.829
uddy+1.827
+1.812
jo+1.810
Neg logit contributions
 the-0.836
 a-0.778
 all-0.655
 his-0.575
 out-0.559
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+0.639
ides+0.570
och+0.558
arry+0.557
umm+0.556
Neg logit contributions
 the-0.460
 a-0.424
 all-0.369
 his-0.342
 her-0.339
Token:
Token ID25
Feature activation-3.42e+01

Pos logit contributions
lor+0.898
oke+0.805
ider+0.800
och+0.795
uddy+0.794
Neg logit contributions
 the-0.396
 a-0.347
 all-0.317
 her-0.262
 out-0.250
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.412
ides+1.261
umm+1.233
och+1.221
arry+1.216
Neg logit contributions
 the-1.148
 a-1.065
 all-0.916
 his-0.872
 her-0.850
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.527
ides+1.317
jo+1.251
umm+1.241
etsy+1.221
Neg logit contributions
 the-2.152
 a-2.020
 all-1.769
 his-1.722
 in-1.699
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.705
ides+1.485
etsy+1.471
umm+1.423
uddy+1.405
Neg logit contributions
 the-2.959
 a-2.796
 his-2.421
 all-2.361
 in-2.295
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.927
lor+1.756
ides+1.754
oral+1.720
ider+1.698
Neg logit contributions
 the-1.369
 a-1.267
 all-0.940
 his-0.914
 it-0.907
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.013
umm+0.838
och+0.818
ides+0.813
ider+0.773
Neg logit contributions
 the-2.106
 a-1.966
 all-1.786
 his-1.780
 her-1.748
TokenNo
Token ID2949
Feature activation-2.84e+01

Pos logit contributions
lor+1.882
umm+1.735
ides+1.721
och+1.614
arians+1.604
Neg logit contributions
 the-1.720
 a-1.665
 his-1.367
 all-1.301
 her-1.297
Token,
Token ID11
Feature activation-4.43e+00

Pos logit contributions
lor+1.430
ides+1.299
och+1.275
arry+1.274
umm+1.271
Neg logit contributions
 the-0.672
 a-0.602
 all-0.496
 his-0.446
 her-0.439
Token his
Token ID465
Feature activation+1.45e+01

Pos logit contributions
lor+0.110
ides+0.100
arry+0.100
ider+0.098
jo+0.098
Neg logit contributions
 the-0.067
 a-0.061
 all-0.049
 her-0.049
 his-0.048
Token mom
Token ID1995
Feature activation-2.45e+01

Pos logit contributions
 the+0.429
 a+0.385
 all+0.335
 her+0.311
 his+0.305
Neg logit contributions
lor-0.607
ides-0.542
ider-0.534
uddy-0.533
och-0.528
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+1.092
ides+0.985
och+0.967
oke+0.958
arry+0.956
Neg logit contributions
 the-0.678
 a-0.621
 all-0.530
 his-0.494
 her-0.483
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.907
ides+0.819
oke+0.818
och+0.813
istent+0.813
Neg logit contributions
 the-0.411
 a-0.363
 all-0.316
 his-0.271
 her-0.263
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.359
ides+0.316
och+0.316
oke+0.313
istent+0.313
Neg logit contributions
 the-0.310
 a-0.288
 all-0.261
 his-0.239
 her-0.238
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.175
jo+0.964
umm+0.947
ides+0.944
reetings+0.915
Neg logit contributions
 the-2.443
 a-2.298
 all-2.046
 in-2.013
 his-1.973
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.537
umm+1.418
uddy+1.402
ides+1.356
etsy+1.354
Neg logit contributions
 the-3.046
 a-2.893
 his-2.466
 all-2.429
 in-2.389
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.762
lor+1.591
oral+1.551
ides+1.550
och+1.502
Neg logit contributions
 the-1.561
 a-1.446
 all-1.147
 it-1.088
 in-1.086
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.069
umm+0.963
och+0.885
ides+0.873
uddy+0.862
Neg logit contributions
 the-2.037
 a-1.893
 all-1.701
 his-1.680
 her-1.664
TokenTim
Token ID14967
Feature activation-2.38e+01

Pos logit contributions
lor+1.705
umm+1.532
ides+1.525
arians+1.419
och+1.416
Neg logit contributions
 the-1.890
 a-1.852
 his-1.513
 all-1.485
 her-1.471
Tokenmy
Token ID1820
Feature activation-3.54e+00

Pos logit contributions
lor+1.068
ides+0.959
arry+0.942
och+0.939
umm+0.931
Neg logit contributions
 the-0.669
 a-0.613
 all-0.529
 his-0.484
 her-0.480
Token was
Token ID373
Feature activation-4.74e+00

Pos logit contributions
lor+0.168
och+0.149
pillar+0.147
arry+0.146
pill+0.145
Neg logit contributions
 the-0.030
 a-0.016
 all-0.015
 his-0.013
 out-0.013
Token sitting
Token ID5586
Feature activation-2.46e+01

Pos logit contributions
lor+0.220
ider+0.208
istent+0.202
jo+0.198
+0.197
Neg logit contributions
 the-0.076
 a-0.059
 all-0.049
 his-0.045
 her-0.045
Token nearby
Token ID6716
Feature activation+9.42e-01

Pos logit contributions
lor+1.291
ider+1.229
och+1.173
+1.160
oling+1.146
Neg logit contributions
 the-0.346
 a-0.259
 all-0.175
 her-0.164
 their-0.158
Token.
Token ID13
Feature activation-2.61e+01

Pos logit contributions
 the+0.019
 a+0.017
 all+0.014
 his+0.012
 her+0.012
Neg logit contributions
lor-0.046
ides-0.042
istent-0.041
ider-0.041
och-0.041
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.063
ides+0.969
umm+0.937
uer+0.892
ider+0.888
Neg logit contributions
 the-0.998
 a-0.941
 his-0.792
 her-0.778
 all-0.777
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.259
umm+1.037
ides+1.022
jo+0.982
uddy+0.977
Neg logit contributions
 the-2.434
 a-2.275
 all-2.032
 in-1.982
 her-1.962
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.528
umm+1.367
uddy+1.336
ides+1.325
unch+1.292
Neg logit contributions
 the-3.184
 a-2.982
 her-2.598
 all-2.591
 his-2.580
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.740
lor+1.649
oral+1.563
ides+1.544
ider+1.523
Neg logit contributions
 the-1.569
 a-1.454
 all-1.187
 in-1.092
 her-1.089
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.055
umm+0.908
och+0.873
ides+0.855
uddy+0.836
Neg logit contributions
 the-2.063
 a-1.921
 all-1.750
 her-1.721
 his-1.716
TokenIt
Token ID1026
Feature activation-2.10e+01

Pos logit contributions
lor+1.786
umm+1.606
ides+1.597
arians+1.492
och+1.490
Neg logit contributions
 the-1.834
 a-1.778
 his-1.452
 all-1.441
 her-1.433
Token was
Token ID373
Feature activation+4.89e+00

Pos logit contributions
lor+0.837
ides+0.730
arry+0.718
istent+0.713
oke+0.713
Neg logit contributions
 the-0.681
 a-0.627
 all-0.553
 his-0.521
 her-0.520
Token smiled
Token ID13541
Feature activation-2.13e+01

Pos logit contributions
lor+0.530
ides+0.487
istent+0.477
arry+0.476
uddy+0.462
Neg logit contributions
 the-0.314
 a-0.283
 all-0.237
 his-0.236
 her-0.231
Token and
Token ID290
Feature activation-1.98e+00

Pos logit contributions
lor+0.988
arry+0.901
umm+0.895
jo+0.890
ides+0.890
Neg logit contributions
 the-0.419
 a-0.370
 all-0.299
 his-0.249
 her-0.247
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+0.069
etsy+0.061
och+0.060
jo+0.060
arry+0.059
Neg logit contributions
 the-0.059
 a-0.056
 all-0.050
 more-0.046
 his-0.045
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.953
arry+0.838
oke+0.836
ides+0.835
och+0.835
Neg logit contributions
 the-0.392
 a-0.340
 all-0.301
 his-0.250
 her-0.239
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.361
ides+0.317
och+0.311
arry+0.310
umm+0.307
Neg logit contributions
 the-0.337
 a-0.313
 all-0.279
 his-0.262
 her-0.260
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.152
jo+0.987
umm+0.949
uddy+0.947
umbs+0.938
Neg logit contributions
 the-2.426
 a-2.307
 all-2.039
 in-2.025
 her-1.949
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.490
uddy+1.417
umm+1.382
ides+1.329
unch+1.285
Neg logit contributions
 the-3.120
 a-2.948
 all-2.544
 her-2.491
 in-2.487
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.762
lor+1.612
ides+1.559
oral+1.546
ider+1.529
Neg logit contributions
 the-1.565
 a-1.440
 all-1.158
 in-1.081
 it-1.077
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.058
umm+0.952
och+0.894
uddy+0.878
ides+0.870
Neg logit contributions
 the-2.039
 a-1.902
 all-1.710
 her-1.679
 his-1.672
TokenI
Token ID40
Feature activation-3.23e+01

Pos logit contributions
lor+1.757
umm+1.597
ides+1.596
arians+1.480
uer+1.479
Neg logit contributions
 the-1.829
 a-1.775
 his-1.434
 all-1.429
 her-1.423
Token knew
Token ID2993
Feature activation-1.84e+01

Pos logit contributions
lor+1.135
umm+1.000
ides+0.976
och+0.957
ider+0.938
Neg logit contributions
 the-1.283
 a-1.223
 all-1.078
 his-1.023
 her-1.013
Token can
Token ID460
Feature activation-1.86e+01

Pos logit contributions
lor+0.541
ides+0.485
+0.477
istent+0.473
arry+0.470
Neg logit contributions
 the-0.331
 a-0.302
 all-0.250
 his-0.239
 her-0.232
Token keep
Token ID1394
Feature activation-6.40e+00

Pos logit contributions
lor+0.771
ides+0.698
+0.683
ider+0.682
istent+0.680
Neg logit contributions
 the-0.515
 a-0.466
 all-0.403
 his-0.385
 her-0.368
Token it
Token ID340
Feature activation-4.71e+00

Pos logit contributions
lor+0.277
och+0.257
+0.249
ides+0.246
arians+0.245
Neg logit contributions
 the-0.112
 a-0.100
 all-0.083
 big-0.073
 in-0.068
Token forever
Token ID8097
Feature activation-2.95e+01

Pos logit contributions
lor+0.201
ider+0.184
ides+0.182
uer+0.179
och+0.175
Neg logit contributions
 the-0.113
 a-0.096
 his-0.080
 her-0.078
 all-0.076
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+1.087
ides+0.963
reetings+0.923
ider+0.922
istent+0.920
Neg logit contributions
 the-0.959
 a-0.889
 all-0.794
 his-0.767
 her-0.755
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.243
jo+1.105
uddy+1.080
oke+1.058
umm+1.049
Neg logit contributions
 the-2.339
 a-2.259
 all-2.032
 in-1.917
 her-1.888
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.412
uddy+1.405
umm+1.327
ides+1.292
oke+1.263
Neg logit contributions
 the-3.104
 a-3.018
 all-2.606
 her-2.522
 in-2.490
Token
Token ID447
Feature activation-2.06e+01

Pos logit contributions
umm+1.671
lor+1.504
ides+1.489
och+1.453
oral+1.435
Neg logit contributions
 the-1.619
 a-1.513
 all-1.240
 in-1.168
 it-1.163
Token
Token ID250
Feature activation-4.75e+01

Pos logit contributions
jo+0.516
ides+0.510
lor+0.508
anne+0.477
uddy+0.472
Neg logit contributions
 the-0.965
 a-0.930
 all-0.808
 very-0.763
 in-0.758
Token things
Token ID1243
Feature activation-1.18e+00

Pos logit contributions
lor+1.504
ides+1.317
uddy+1.273
umm+1.256
oke+1.253
Neg logit contributions
 the-2.064
 a-1.997
 all-1.700
 her-1.669
 in-1.635
Token are
Token ID389
Feature activation+1.44e+00

Pos logit contributions
lor+0.042
ides+0.039
och+0.038
istent+0.038
oke+0.037
Neg logit contributions
 the-0.037
 a-0.035
 his-0.031
 all-0.029
 her-0.029
Token and
Token ID290
Feature activation-3.23e+00

Pos logit contributions
lor+0.537
istent+0.525
reetings+0.509
oke+0.508
oling+0.500
Neg logit contributions
 the-0.243
 a-0.221
 all-0.201
 her-0.180
 some-0.180
Token tried
Token ID3088
Feature activation-1.37e+01

Pos logit contributions
lor+0.134
jo+0.124
och+0.122
ides+0.121
oke+0.120
Neg logit contributions
 the-0.072
 a-0.066
 all-0.057
 out-0.049
 his-0.049
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
lor+0.600
uddy+0.567
+0.566
etsy+0.560
och+0.552
Neg logit contributions
 the-0.203
 a-0.176
 all-0.140
 in-0.109
 big-0.104
Token say
Token ID910
Feature activation-1.94e+01

Pos logit contributions
lor+0.076
ides+0.072
+0.070
istent+0.069
uddy+0.068
Neg logit contributions
 the-0.054
 a-0.050
 all-0.043
 in-0.040
 on-0.039
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.865
oke+0.786
uddy+0.776
och+0.772
ides+0.766
Neg logit contributions
 the-0.341
 a-0.289
 all-0.253
 in-0.221
 up-0.215
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.370
umm+1.148
jo+1.132
ides+1.110
oke+1.092
Neg logit contributions
 the-2.241
 a-2.127
 all-1.929
 in-1.818
 his-1.806
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.546
umm+1.354
uddy+1.294
ides+1.285
etsy+1.229
Neg logit contributions
 the-3.107
 a-2.959
 all-2.637
 her-2.539
 his-2.538
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.794
lor+1.638
ides+1.572
oral+1.568
och+1.523
Neg logit contributions
 the-1.530
 a-1.431
 all-1.143
 in-1.081
 it-1.078
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.038
umm+0.906
och+0.851
ides+0.843
uddy+0.811
Neg logit contributions
 the-2.076
 a-1.940
 all-1.766
 his-1.734
 her-1.726
TokenI
Token ID40
Feature activation-3.23e+01

Pos logit contributions
lor+1.738
umm+1.582
ides+1.552
och+1.451
uer+1.446
Neg logit contributions
 the-1.851
 a-1.755
 all-1.463
 his-1.447
 her-1.430
Token'm
Token ID1101
Feature activation-2.05e+01

Pos logit contributions
lor+1.149
umm+1.002
ides+0.992
och+0.974
arry+0.958
Neg logit contributions
 the-1.262
 a-1.191
 all-1.055
 his-1.004
 her-0.992
Token the
Token ID262
Feature activation+8.18e+00

Pos logit contributions
 the+0.098
 a+0.086
 in+0.061
 up+0.060
 all+0.059
Neg logit contributions
lor-0.419
-0.398
och-0.392
arry-0.388
ides-0.388
Token boys
Token ID6510
Feature activation-2.40e+01

Pos logit contributions
 the+0.259
 a+0.238
 all+0.210
 his+0.199
 her+0.198
Neg logit contributions
lor-0.328
ides-0.288
umm-0.287
och-0.286
ider-0.283
Token said
Token ID531
Feature activation-1.93e+01

Pos logit contributions
lor+1.086
ides+0.985
och+0.966
oke+0.962
arry+0.960
Neg logit contributions
 the-0.636
 a-0.582
 all-0.486
 his-0.456
 her-0.453
Token,
Token ID11
Feature activation-9.57e+00

Pos logit contributions
lor+0.884
oke+0.801
ides+0.801
istent+0.794
ider+0.788
Neg logit contributions
 the-0.429
 a-0.382
 all-0.331
 her-0.290
 his-0.288
Token
Token ID6184
Feature activation-4.87e+01

Pos logit contributions
lor+0.349
ides+0.307
istent+0.306
oke+0.306
arry+0.304
Neg logit contributions
 the-0.318
 a-0.296
 all-0.267
 her-0.248
 his-0.247
Token
Token ID95
Feature activation-6.24e+01

Pos logit contributions
lor+1.363
jo+1.065
ides+1.050
umm+1.028
och+1.013
Neg logit contributions
 the-2.351
 a-2.228
 all-2.009
 in-1.918
 his-1.902
Token
Token ID26391
Feature activation-4.29e+01

Pos logit contributions
lor+1.689
etsy+1.370
uddy+1.350
ides+1.346
umm+1.338
Neg logit contributions
 the-3.062
 a-2.923
 all-2.548
 his-2.513
 in-2.430
Token
Token ID129
Feature activation-4.12e+01

Pos logit contributions
umm+1.651
lor+1.580
ides+1.472
och+1.456
oral+1.449
Neg logit contributions
 the-1.623
 a-1.519
 all-1.236
 in-1.153
 it-1.127
Token
Token ID241
Feature activation-4.61e+01

Pos logit contributions
lor+1.058
umm+0.904
och+0.870
ides+0.840
uddy+0.820
Neg logit contributions
 the-2.064
 a-1.930
 all-1.746
 his-1.722
 her-1.692
TokenWe
Token ID1135
Feature activation-3.84e+01

Pos logit contributions
lor+1.696
umm+1.515
ides+1.500
och+1.413
uer+1.390
Neg logit contributions
 the-1.908
 a-1.864
 his-1.530
 all-1.514
 her-1.484
Token are
Token ID389
Feature activation+1.44e+00

Pos logit contributions
lor+1.518
ides+1.357
arry+1.321
och+1.317
istent+1.307
Neg logit contributions
 the-1.274
 a-1.176
 all-1.030
 his-0.979
 her-0.970
Tokenummy
Token ID13513
Feature activation-1.45e+01

Pos logit contributions
 the+0.448
 a+0.426
 all+0.396
 his+0.380
 her+0.378
Neg logit contributions
lor-0.189
ides-0.146
och-0.141
arry-0.139
ider-0.136
Token treats
Token ID18432
Feature activation+4.70e+00

Pos logit contributions
lor+0.532
umm+0.512
ides+0.501
oke+0.494
umbs+0.494
Neg logit contributions
 the-0.372
 a-0.322
 all-0.298
 her-0.286
 his-0.264
Token.
Token ID13
Feature activation-1.49e+01

Pos logit contributions
 the+0.121
 a+0.110
 all+0.089
 his+0.088
 her+0.084
Neg logit contributions
lor-0.206
och-0.189
ides-0.186
ider-0.186
umm-0.183
Token Lily
Token ID20037
Feature activation+7.09e+00

Pos logit contributions
lor+0.730
ides+0.689
reetings+0.670
ider+0.659
uer+0.645
Neg logit contributions
 the-0.500
 a-0.452
 her-0.369
 all-0.361
 their-0.351
Token forgot
Token ID16453
Feature activation-1.71e+01

Pos logit contributions
 the+0.205
 a+0.192
 all+0.165
 her+0.155
 his+0.154
Neg logit contributions
lor-0.292
oke-0.258
ides-0.256
jo-0.254
istent-0.253
Token all
Token ID477
Feature activation+1.94e+01

Pos logit contributions
lor+0.490
oke+0.462
aring+0.433
uts+0.431
unch+0.428
Neg logit contributions
 a-0.420
 the-0.406
 big-0.337
 out-0.327
 in-0.320
Token about
Token ID546
Feature activation-2.01e+00

Pos logit contributions
 a+0.559
 the+0.556
 all+0.459
 out+0.457
 big+0.427
Neg logit contributions
oke-0.657
lor-0.640
arry-0.601
jo-0.600
oral-0.598
Token her
Token ID607
Feature activation+2.20e+01

Pos logit contributions
+0.100
lor+0.098
umm+0.093
Really+0.092
mmm+0.091
Neg logit contributions
 a-0.017
 out-0.016
 the-0.015
 in-0.012
 on-0.009
Token leg
Token ID1232
Feature activation+1.01e+01

Pos logit contributions
 the+0.571
 a+0.524
 all+0.460
 out+0.451
 her+0.438
Neg logit contributions
-0.879
lor-0.874
umm-0.836
ides-0.827
ider-0.825
Token and
Token ID290
Feature activation+1.33e+01

Pos logit contributions
 the+0.238
 a+0.217
 all+0.175
 his+0.163
 her+0.161
Neg logit contributions
lor-0.454
ides-0.413
-0.412
och-0.412
jo-0.404
Token was
Token ID373
Feature activation+1.19e+01

Pos logit contributions
 a+0.343
 the+0.333
 out+0.313
 big+0.290
 all+0.281
Neg logit contributions
lor-0.433
grand-0.393
anas-0.391
ushed-0.388
jo-0.386
Token garden
Token ID11376
Feature activation-2.53e-01

Pos logit contributions
 the+0.891
 a+0.812
 all+0.742
 his+0.711
 her+0.678
Neg logit contributions
lor-0.841
umm-0.758
ider-0.730
och-0.717
-0.712
Token.
Token ID13
Feature activation-1.37e+01

Pos logit contributions
lor+0.011
ider+0.010
ides+0.010
arry+0.010
oke+0.010
Neg logit contributions
 the-0.006
 a-0.005
 all-0.005
 his-0.004
 her-0.004
Token They
Token ID1119
Feature activation-1.40e+01

Pos logit contributions
lor+0.563
ides+0.488
uer+0.473
och+0.464
istent+0.459
Neg logit contributions
 the-0.535
 a-0.509
 all-0.434
 his-0.404
 some-0.399
Token dance
Token ID9280
Feature activation+5.81e+00

Pos logit contributions
lor+0.469
ides+0.465
jo+0.455
och+0.452
istent+0.451
Neg logit contributions
 the-0.392
 a-0.377
 his-0.341
 all-0.300
 more-0.299
Token to
Token ID284
Feature activation+1.22e+01

Pos logit contributions
 the+0.113
 a+0.091
 his+0.090
 her+0.073
 all+0.072
Neg logit contributions
lor-0.251
ider-0.235
jo-0.234
ides-0.231
uddy-0.229
Token music
Token ID2647
Feature activation+1.23e+01

Pos logit contributions
 the+0.087
 a+0.082
 on+0.081
 in+0.067
 out+0.065
Neg logit contributions
anas-0.530
jo-0.526
lor-0.523
ides-0.513
mmm-0.503
Token,
Token ID11
Feature activation+3.24e+00

Pos logit contributions
 the+0.255
 a+0.218
 his+0.184
 all+0.180
 her+0.167
Neg logit contributions
lor-0.566
ides-0.522
jo-0.516
ider-0.513
anas-0.500
Token to
Token ID284
Feature activation+1.24e+01

Pos logit contributions
 the+0.079
 a+0.073
 all+0.065
 his+0.064
 her+0.057
Neg logit contributions
lor-0.107
ider-0.107
ides-0.105
oke-0.104
jo-0.104
Token songs
Token ID7259
Feature activation-9.44e+00

Pos logit contributions
 the+0.238
 a+0.230
 on+0.201
 out+0.198
 up+0.193
Neg logit contributions
lor-0.419
jo-0.412
ides-0.406
arians-0.396
icious-0.394
Token,
Token ID11
Feature activation+3.42e+00

Pos logit contributions
lor+0.452
ides+0.418
uddy+0.405
oke+0.405
ider+0.403
Neg logit contributions
 the-0.192
 a-0.171
 his-0.133
 all-0.131
 her-0.121
Token and
Token ID290
Feature activation+1.43e+01

Pos logit contributions
 the+0.077
 a+0.068
 his+0.061
 all+0.059
 up+0.053
Neg logit contributions
lor-0.121
ides-0.120
ider-0.116
anas-0.115
ives-0.115
Token tries
Token ID8404
Feature activation-5.27e+00

Pos logit contributions
lor+0.255
ides+0.231
ider+0.229
oke+0.226
istent+0.222
Neg logit contributions
 the-0.284
 a-0.263
 all-0.234
 her-0.227
 his-0.224
Token to
Token ID284
Feature activation+1.05e+01

Pos logit contributions
lor+0.229
uddy+0.218
arry+0.210
oling+0.209
enny+0.208
Neg logit contributions
 the-0.085
 a-0.076
 all-0.068
 in-0.052
 her-0.052
Token make
Token ID787
Feature activation+1.15e+01

Pos logit contributions
 the+0.312
 a+0.288
 all+0.256
 his+0.239
 her+0.234
Neg logit contributions
lor-0.352
ides-0.347
istent-0.339
-0.336
arry-0.327
Token the
Token ID262
Feature activation+2.36e+01

Pos logit contributions
 the+0.133
 in+0.107
 a+0.103
 all+0.100
 on+0.086
Neg logit contributions
lor-0.536
umm-0.494
ides-0.483
unch-0.481
mmm-0.481
Token bird
Token ID6512
Feature activation-7.09e+00

Pos logit contributions
 the+0.752
 a+0.678
 all+0.621
 his+0.589
 her+0.587
Neg logit contributions
lor-0.908
ider-0.826
umm-0.822
-0.820
ides-0.810
Token look
Token ID804
Feature activation+1.64e+01

Pos logit contributions
lor+0.254
oing+0.251
aring+0.249
uer+0.247
oc+0.246
Neg logit contributions
 the-0.162
 a-0.145
 all-0.132
 his-0.115
 an-0.112
Token nice
Token ID3621
Feature activation+2.33e+01

Pos logit contributions
 the+0.303
 a+0.269
 all+0.241
 her+0.215
 his+0.182
Neg logit contributions
oke-0.581
oling-0.575
istent-0.569
aring-0.564
och-0.556
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 the+0.579
 a+0.517
 all+0.443
 his+0.417
 her+0.410
Neg logit contributions
lor-0.961
umm-0.919
oke-0.900
ides-0.900
ider-0.893
Token but
Token ID475
Feature activation-9.79e+00

Pos logit contributions
 the+0.034
 a+0.033
 all+0.031
 her+0.028
 out+0.028
Neg logit contributions
lor-0.030
istent-0.029
-0.029
ider-0.028
oling-0.028
Token it
Token ID340
Feature activation+7.27e+00

Pos logit contributions
lor+0.374
och+0.359
oke+0.357
ider+0.354
Really+0.351
Neg logit contributions
 a-0.176
 the-0.164
 all-0.148
 out-0.145
 big-0.128
Token looks
Token ID3073
Feature activation+5.59e+00

Pos logit contributions
 the+0.272
 a+0.254
 all+0.222
 her+0.221
 his+0.220
Neg logit contributions
lor-0.241
ides-0.212
ider-0.208
oke-0.207
uddy-0.201
Token
Token ID198
Feature activation-1.94e+01

Pos logit contributions
lor+1.123
ides+1.032
ider+1.007
reetings+0.995
umm+0.991
Neg logit contributions
 the-1.036
 a-0.940
 his-0.807
 all-0.805
 her-0.750
Token
Token ID198
Feature activation-1.94e+01

Pos logit contributions
umm+0.936
ider+0.891
lor+0.891
ides+0.880
istent+0.854
Neg logit contributions
 the-0.638
 a-0.570
 his-0.465
 all-0.454
 big-0.416
TokenThey
Token ID2990
Feature activation-2.02e+01

Pos logit contributions
lor+0.732
ides+0.687
umm+0.683
ider+0.656
istent+0.636
Neg logit contributions
 the-0.758
 a-0.698
 all-0.615
 his-0.607
 her-0.566
Token got
Token ID1392
Feature activation+1.05e+01

Pos logit contributions
lor+0.667
oped+0.617
ides+0.614
ari+0.606
arry+0.605
Neg logit contributions
 the-0.625
 a-0.574
 his-0.496
 her-0.485
 more-0.483
Token into
Token ID656
Feature activation+2.23e+01

Pos logit contributions
 the+0.154
 a+0.117
 her+0.096
 all+0.095
 his+0.084
Neg logit contributions
obby-0.410
 Amen-0.399
och-0.398
amb-0.397
rolley-0.397
Token the
Token ID262
Feature activation+2.65e+01

Pos logit contributions
 the+0.031
 a+0.017
 up+0.015
 in+0.013
 out+0.012
Neg logit contributions
lor-1.173
-1.148
obby-1.141
jo-1.133
oc-1.115
Token helicopter
Token ID15392
Feature activation-1.14e+01

Pos logit contributions
 the+0.883
 a+0.823
 all+0.743
 her+0.711
 his+0.699
Neg logit contributions
lor-0.984
och-0.826
umm-0.825
-0.800
ari-0.795
Token,
Token ID11
Feature activation+4.34e+00

Pos logit contributions
lor+0.556
ides+0.508
arry+0.503
jo+0.502
istent+0.499
Neg logit contributions
 the-0.241
 a-0.213
 all-0.166
 his-0.166
 her-0.164
Token and
Token ID290
Feature activation+1.51e+01

Pos logit contributions
 the+0.085
 a+0.079
 her+0.068
 all+0.067
 more+0.067
Neg logit contributions
lor-0.154
istent-0.152
och-0.150
anas-0.149
aron-0.149
Token the
Token ID262
Feature activation+2.65e+01

Pos logit contributions
 a+0.194
 the+0.185
 her+0.148
 big+0.137
 all+0.135
Neg logit contributions
lor-0.663
jo-0.637
uddy-0.631
oc-0.629
och-0.625
Token boy
Token ID2933
Feature activation-1.23e+01

Pos logit contributions
 the+0.716
 a+0.638
 her+0.557
 all+0.549
 his+0.537
Neg logit contributions
lor-1.148
ides-1.004
umm-1.003
ider-1.001
och-0.992
Token that
Token ID326
Feature activation+8.80e+00

Pos logit contributions
lor+0.714
och+0.633
mmm+0.612
oral+0.612
unch+0.608
Neg logit contributions
 the-0.307
 a-0.301
 up-0.290
 out-0.280
 big-0.236
Token the
Token ID262
Feature activation+2.66e+01

Pos logit contributions
 the+0.129
 a+0.127
 up+0.107
 out+0.104
 all+0.104
Neg logit contributions
lor-0.370
jo-0.348
grand-0.347
oc-0.340
och-0.340
Token monster
Token ID9234
Feature activation+2.17e+00

Pos logit contributions
 the+0.983
 a+0.892
 her+0.815
 all+0.800
 his+0.784
Neg logit contributions
lor-0.922
ider-0.792
umm-0.784
-0.753
ides-0.750
Token wasn
Token ID2492
Feature activation-1.08e+01

Pos logit contributions
 the+0.068
 a+0.063
 all+0.055
 her+0.052
 his+0.051
Neg logit contributions
lor-0.084
jo-0.075
uddy-0.074
ides-0.074
oke-0.074
Token't
Token ID470
Feature activation-7.93e+00

Pos logit contributions
lor+0.304
+0.249
ides+0.249
istent+0.248
och+0.246
Neg logit contributions
 the-0.463
 a-0.434
 all-0.397
 his-0.387
 her-0.384
Token real
Token ID1103
Feature activation+2.65e+01

Pos logit contributions
reetings+0.271
oke+0.271
umpy+0.271
istent+0.270
lor+0.265
Neg logit contributions
 the-0.161
 a-0.142
 some-0.130
 all-0.125
 her-0.114
Token,
Token ID11
Feature activation+4.37e+00

Pos logit contributions
 the+0.726
 a+0.664
 all+0.574
 her+0.540
 his+0.510
Neg logit contributions
lor-1.094
ides-1.007
oke-0.997
umm-0.979
jo-0.978
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
 the+0.106
 a+0.102
 all+0.086
 out+0.083
 up+0.082
Neg logit contributions
lor-0.168
istent-0.158
och-0.155
jo-0.152
oke-0.151
Token he
Token ID339
Feature activation+8.49e+00

Pos logit contributions
 the+0.365
 a+0.360
 all+0.305
 out+0.304
 big+0.282
Neg logit contributions
lor-0.579
grand-0.506
jo-0.502
oke-0.494
oc-0.489
Token enjoyed
Token ID8359
Feature activation+6.05e+00

Pos logit contributions
 the+0.287
 a+0.265
 all+0.232
 his+0.218
 her+0.218
Neg logit contributions
lor-0.333
ides-0.290
arry-0.285
och-0.282
istent-0.280
Token the
Token ID262
Feature activation+2.67e+01

Pos logit contributions
 the+0.076
 a+0.070
 out+0.060
 all+0.051
 up+0.049
Neg logit contributions
lor-0.301
-0.280
oling-0.262
ides-0.262
istent-0.261
Token
Token ID198
Feature activation-1.91e+01

Pos logit contributions
umm+0.879
lor+0.874
ides+0.860
istent+0.846
ider+0.814
Neg logit contributions
 the-0.622
 a-0.565
 all-0.458
 her-0.446
 his-0.430
TokenL
Token ID43
Feature activation-1.82e+01

Pos logit contributions
lor+0.623
ides+0.593
umm+0.586
istent+0.570
och+0.546
Neg logit contributions
 the-0.827
 a-0.786
 all-0.691
 her-0.681
 his-0.657
Tokenily
Token ID813
Feature activation-1.14e+01

Pos logit contributions
lor+0.675
ides+0.597
umm+0.593
istent+0.593
och+0.586
Neg logit contributions
 the-0.675
 a-0.639
 all-0.568
 her-0.547
 in-0.524
Token said
Token ID531
Feature activation-3.91e+00

Pos logit contributions
lor+0.479
ides+0.422
arry+0.420
uddy+0.415
istent+0.411
Neg logit contributions
 the-0.345
 a-0.315
 all-0.272
 his-0.262
 her-0.255
Token,
Token ID11
Feature activation+4.40e+00

Pos logit contributions
lor+0.166
etsy+0.155
uddy+0.155
ider+0.153
aring+0.149
Neg logit contributions
 the-0.089
 a-0.079
 all-0.069
 his-0.064
 big-0.056
Token "
Token ID366
Feature activation-1.15e+01

Pos logit contributions
 the+0.154
 a+0.143
 all+0.127
 his+0.121
 her+0.119
Neg logit contributions
lor-0.160
ides-0.139
och-0.138
ider-0.137
arry-0.137
TokenDon
Token ID3987
Feature activation-2.84e+01

Pos logit contributions
lor+0.393
ides+0.381
oke+0.351
istent+0.334
och+0.333
Neg logit contributions
 the-0.487
 a-0.453
 her-0.403
 all-0.391
 in-0.376
Token't
Token ID470
Feature activation-7.91e+00

Pos logit contributions
lor+1.034
ides+0.926
och+0.905
umm+0.896
arry+0.878
Neg logit contributions
 the-1.099
 a-1.023
 all-0.914
 her-0.851
 his-0.840
Token worry
Token ID5490
Feature activation-3.14e+00

Pos logit contributions
lor+0.298
ides+0.269
arry+0.267
istent+0.261
uddy+0.261
Neg logit contributions
 the-0.267
 a-0.250
 all-0.221
 his-0.208
 her-0.202
Token,
Token ID11
Feature activation+4.28e+00

Pos logit contributions
lor+0.169
ides+0.155
arry+0.154
uddy+0.153
umm+0.152
Neg logit contributions
 the-0.059
 a-0.052
 all-0.041
 his-0.036
 her-0.034
Token I
Token ID314
Feature activation-7.38e+00

Pos logit contributions
 the+0.150
 a+0.142
 all+0.128
 his+0.126
 her+0.121
Neg logit contributions
lor-0.146
uddy-0.129
arry-0.126
umm-0.124
oke-0.123
Token sun
Token ID4252
Feature activation-9.69e-01

Pos logit contributions
 the+0.703
 a+0.641
 all+0.537
 her+0.518
 his+0.515
Neg logit contributions
lor-1.176
ides-1.065
arry-1.063
umm-1.055
och-1.049
Token shone
Token ID44193
Feature activation-5.18e+00

Pos logit contributions
lor+0.038
ides+0.034
jo+0.033
och+0.033
+0.033
Neg logit contributions
 the-0.028
 a-0.027
 all-0.023
 his-0.023
 her-0.022
Token and
Token ID290
Feature activation+1.48e+01

Pos logit contributions
umm+0.206
lor+0.205
plets+0.197
arry+0.194
ides+0.193
Neg logit contributions
 the-0.088
 a-0.078
 his-0.065
 her-0.064
 all-0.057
Token birds
Token ID10087
Feature activation+1.87e+00

Pos logit contributions
 a+0.106
 the+0.106
 his+0.084
 her+0.064
 all+0.061
Neg logit contributions
lor-0.696
Really-0.675
umm-0.675
plets-0.662
 lifetime-0.658
Token ch
Token ID442
Feature activation+1.38e+01

Pos logit contributions
 the+0.045
 a+0.041
 his+0.034
 her+0.032
 all+0.031
Neg logit contributions
lor-0.080
ides-0.075
oling-0.074
istent-0.074
jo-0.073
Tokenir
Token ID343
Feature activation+2.87e+00

Pos logit contributions
 the+0.695
 a+0.664
 all+0.609
 his+0.590
 her+0.586
Neg logit contributions
lor-0.312
ides-0.266
arry-0.252
och-0.244
umm-0.242
Tokenped
Token ID9124
Feature activation+4.70e+00

Pos logit contributions
 the+0.122
 a+0.115
 all+0.104
 his+0.100
 her+0.100
Neg logit contributions
lor-0.085
ides-0.076
arry-0.071
istent-0.071
umm-0.070
Token.
Token ID13
Feature activation-1.28e+01

Pos logit contributions
 the+0.092
 a+0.074
 his+0.062
 her+0.061
 all+0.059
Neg logit contributions
lor-0.203
ides-0.192
uer-0.187
istent-0.187
ider-0.186
Token
Token ID198
Feature activation-1.98e+01

Pos logit contributions
lor+0.564
ides+0.505
ider+0.504
och+0.502
uer+0.494
Neg logit contributions
 the-0.468
 a-0.442
 all-0.370
 his-0.340
 her-0.329
Token
Token ID198
Feature activation-1.98e+01

Pos logit contributions
umm+0.907
lor+0.897
ider+0.855
ides+0.844
och+0.825
Neg logit contributions
 the-0.664
 a-0.614
 all-0.509
 his-0.482
 in-0.458
TokenThe
Token ID464
Feature activation-7.72e+00

Pos logit contributions
lor+0.644
umm+0.579
ides+0.563
och+0.561
oke+0.543
Neg logit contributions
 the-0.883
 a-0.851
 all-0.748
 his-0.710
 in-0.694
Token too
Token ID1165
Feature activation+9.96e+00

Pos logit contributions
 the+0.053
 a+0.042
 all+0.042
 his+0.034
 shiny+0.021
Neg logit contributions
lor-0.502
-0.476
ides-0.471
ables-0.457
umm-0.456
Token much
Token ID881
Feature activation+3.53e+00

Pos logit contributions
 the+0.227
 a+0.203
 his+0.167
 all+0.167
 her+0.161
Neg logit contributions
arry-0.371
lor-0.369
och-0.362
uddy-0.356
etsy-0.355
Token and
Token ID290
Feature activation+1.28e+01

Pos logit contributions
 the+0.087
 a+0.077
 all+0.074
 his+0.067
 her+0.061
Neg logit contributions
ides-0.137
lor-0.134
och-0.134
oke-0.131
arry-0.130
Token they
Token ID484
Feature activation+2.14e-01

Pos logit contributions
 the+0.341
 a+0.330
 all+0.273
 big+0.268
 his+0.266
Neg logit contributions
lor-0.466
etsy-0.430
-0.423
uddy-0.422
ides-0.411
Token only
Token ID691
Feature activation+1.33e+01

Pos logit contributions
 the+0.007
 a+0.006
 his+0.005
 all+0.005
 her+0.005
Neg logit contributions
lor-0.009
ides-0.008
etsy-0.008
och-0.008
arry-0.008
Token gathered
Token ID9272
Feature activation+2.08e+00

Pos logit contributions
 the+0.429
 a+0.400
 all+0.345
 his+0.341
 her+0.327
Neg logit contributions
lor-0.475
ides-0.452
istent-0.448
oke-0.445
och-0.444
Token enough
Token ID1576
Feature activation-4.84e+00

Pos logit contributions
 the+0.022
 a+0.020
 his+0.015
 my+0.014
 your+0.012
Neg logit contributions
umm-0.084
-0.082
ement-0.080
ushed-0.079
obby-0.079
Token to
Token ID284
Feature activation+1.07e+01

Pos logit contributions
lor+0.162
och+0.158
oke+0.157
umm+0.156
ides+0.156
Neg logit contributions
 the-0.117
 a-0.111
 his-0.096
 all-0.094
 her-0.091
Token make
Token ID787
Feature activation+1.16e+01

Pos logit contributions
 the+0.295
 a+0.275
 his+0.236
 all+0.232
 her+0.221
Neg logit contributions
lor-0.419
-0.405
ides-0.400
istent-0.389
arry-0.384
Token a
Token ID257
Feature activation+2.78e+01

Pos logit contributions
 the+0.108
 in+0.082
 a+0.080
 all+0.073
 his+0.071
Neg logit contributions
lor-0.588
umm-0.520
ides-0.508
apa-0.506
jo-0.504
Token small
Token ID1402
Feature activation+2.93e+01

Pos logit contributions
 the+0.817
 a+0.739
 all+0.645
 his+0.621
 her+0.605
Neg logit contributions
lor-1.172
umm-1.061
arry-1.043
ides-1.035
uer-1.031
Token."
Token ID526
Feature activation-3.10e+01

Pos logit contributions
lor+0.927
umm+0.825
ides+0.818
och+0.810
ider+0.799
Neg logit contributions
 the-0.633
 a-0.568
 all-0.499
 his-0.477
 her-0.459
Token
Token ID198
Feature activation-2.34e+01

Pos logit contributions
lor+1.450
ides+1.335
reetings+1.262
umm+1.260
uer+1.259
Neg logit contributions
 the-1.094
 a-0.993
 her-0.858
 his-0.820
 all-0.810
Token
Token ID198
Feature activation-2.35e+01

Pos logit contributions
lor+1.100
umm+1.060
ides+1.024
ider+1.013
istent+0.988
Neg logit contributions
 the-0.811
 a-0.727
 her-0.602
 his-0.590
 all-0.582
TokenBen
Token ID11696
Feature activation-2.53e+01

Pos logit contributions
lor+0.740
ides+0.655
umm+0.642
istent+0.617
oke+0.616
Neg logit contributions
 the-1.066
 a-1.005
 her-0.895
 all-0.885
 his-0.880
Token is
Token ID318
Feature activation+1.73e+00

Pos logit contributions
lor+0.836
ides+0.741
oke+0.738
arry+0.727
istent+0.714
Neg logit contributions
 the-0.931
 a-0.866
 all-0.772
 his-0.742
 her-0.740
Token surprised
Token ID6655
Feature activation-2.22e-01

Pos logit contributions
 the+0.045
 a+0.037
 all+0.036
 her+0.032
 his+0.032
Neg logit contributions
ider-0.060
oling-0.056
oke-0.055
lor-0.054
aron-0.054
Token.
Token ID13
Feature activation-1.69e+01

Pos logit contributions
lor+0.011
ides+0.010
arry+0.010
och+0.010
umm+0.010
Neg logit contributions
 the-0.005
 a-0.004
 all-0.003
 his-0.003
 her-0.003
Token He
Token ID679
Feature activation-9.88e+00

Pos logit contributions
lor+0.638
ides+0.570
och+0.529
istent+0.525
reetings+0.518
Neg logit contributions
 the-0.680
 a-0.637
 his-0.560
 all-0.556
 her-0.534
Token thinks
Token ID6834
Feature activation-2.04e+01

Pos logit contributions
lor+0.305
ides+0.271
oke+0.266
arry+0.264
istent+0.260
Neg logit contributions
 the-0.391
 a-0.366
 all-0.327
 his-0.315
 her-0.313
Token Lily
Token ID20037
Feature activation+4.71e+00

Pos logit contributions
oke+0.847
lor+0.793
och+0.773
ider+0.766
enny+0.765
Neg logit contributions
 the-0.381
 a-0.375
 all-0.299
 out-0.292
 up-0.281
Token is
Token ID318
Feature activation+1.21e+00

Pos logit contributions
 the+0.158
 a+0.148
 all+0.129
 his+0.122
 her+0.121
Neg logit contributions
lor-0.180
ides-0.161
arry-0.157
oke-0.157
och-0.156
Token the
Token ID262
Feature activation+1.55e+01

Pos logit contributions
lor+0.345
oc+0.333
umm+0.328
och+0.325
obby+0.322
Neg logit contributions
 the-0.043
 a-0.035
 all-0.020
 her-0.011
 out-0.010
Token fence
Token ID13990
Feature activation-1.69e+01

Pos logit contributions
 the+0.500
 a+0.452
 all+0.413
 her+0.400
 his+0.389
Neg logit contributions
lor-0.551
umm-0.516
och-0.506
ider-0.501
oke-0.487
Token outside
Token ID2354
Feature activation+9.79e+00

Pos logit contributions
lor+0.710
och+0.659
ider+0.649
ides+0.646
umm+0.645
Neg logit contributions
 the-0.413
 a-0.380
 her-0.307
 all-0.306
 his-0.302
Token.
Token ID13
Feature activation-2.15e+01

Pos logit contributions
 the+0.209
 a+0.199
 all+0.161
 her+0.148
 his+0.139
Neg logit contributions
lor-0.436
umm-0.396
ides-0.391
och-0.390
jo-0.390
Token He
Token ID679
Feature activation-1.39e+01

Pos logit contributions
lor+1.088
ides+1.041
umm+0.956
ider+0.951
uer+0.946
Neg logit contributions
 the-0.663
 a-0.591
 his-0.503
 all-0.479
 her-0.450
Token asked
Token ID1965
Feature activation-1.85e+01

Pos logit contributions
lor+0.510
oke+0.475
jo+0.469
ides+0.457
uddy+0.456
Neg logit contributions
 the-0.431
 a-0.398
 all-0.344
 her-0.332
 his-0.330
Token Tim
Token ID5045
Feature activation+1.88e+00

Pos logit contributions
lor+0.778
oke+0.764
arry+0.751
obby+0.746
etsy+0.744
Neg logit contributions
 the-0.304
 a-0.292
 all-0.228
 big-0.219
 up-0.209
Tokenmy
Token ID1820
Feature activation-2.33e+00

Pos logit contributions
 the+0.074
 a+0.070
 all+0.063
 his+0.059
 her+0.059
Neg logit contributions
lor-0.058
oke-0.051
och-0.051
arry-0.050
uddy-0.050
Token to
Token ID284
Feature activation+6.18e+00

Pos logit contributions
lor+0.099
oke+0.095
aring+0.092
och+0.092
arry+0.091
Neg logit contributions
 the-0.053
 a-0.049
 all-0.040
 her-0.036
 his-0.035
Token stay
Token ID2652
Feature activation+1.09e+01

Pos logit contributions
 the+0.164
 a+0.150
 all+0.132
 on+0.120
 his+0.119
Neg logit contributions
lor-0.241
ides-0.227
-0.226
och-0.220
arry-0.219
Token inside
Token ID2641
Feature activation+7.70e+00

Pos logit contributions
 the+0.199
 a+0.172
 all+0.152
 her+0.123
 more+0.117
Neg logit contributions
lor-0.439
aj-0.427
ider-0.426
ivering-0.418
 Amen-0.412
TokenThere
Token ID1858
Feature activation-1.93e+01

Pos logit contributions
lor+0.541
ides+0.478
umm+0.476
arry+0.458
istent+0.451
Neg logit contributions
 the-0.634
 a-0.604
 all-0.510
 his-0.508
 her-0.501
Token is
Token ID318
Feature activation+2.73e+00

Pos logit contributions
lor+0.755
ides+0.657
oke+0.632
umm+0.629
uddy+0.624
Neg logit contributions
 the-0.607
 a-0.575
 all-0.502
 his-0.480
 her-0.466
Token a
Token ID257
Feature activation+2.73e+01

Pos logit contributions
 the+0.024
 all+0.012
 her+0.010
 a+0.009
 his+0.008
Neg logit contributions
lor-0.141
ider-0.136
-0.130
uddy-0.129
pill-0.128
Token spider
Token ID19230
Feature activation-7.05e+00

Pos logit contributions
 the+0.973
 a+0.891
 all+0.806
 her+0.771
 his+0.770
Neg logit contributions
lor-0.956
ider-0.841
uddy-0.841
umm-0.837
och-0.824
Token!"
Token ID2474
Feature activation-3.31e+01

Pos logit contributions
lor+0.337
ider+0.303
ides+0.299
och+0.296
oke+0.296
Neg logit contributions
 the-0.162
 a-0.147
 all-0.122
 his-0.118
 her-0.112
Token
Token ID198
Feature activation-2.21e+01

Pos logit contributions
lor+1.307
umm+1.155
ides+1.149
och+1.099
arry+1.092
Neg logit contributions
 the-1.264
 a-1.193
 all-1.017
 her-0.981
 his-0.963
Token
Token ID198
Feature activation-2.19e+01

Pos logit contributions
umm+1.036
lor+1.022
ides+0.954
ider+0.948
istent+0.940
Neg logit contributions
 the-0.761
 a-0.697
 his-0.549
 her-0.547
 all-0.538
TokenTom
Token ID13787
Feature activation-2.23e+01

Pos logit contributions
lor+0.828
umm+0.827
ides+0.757
istent+0.732
oke+0.713
Neg logit contributions
 the-0.902
 a-0.846
 her-0.713
 his-0.688
 all-0.671
Token dropped
Token ID5710
Feature activation+6.91e+00

Pos logit contributions
lor+0.945
ides+0.856
oke+0.846
arry+0.840
uddy+0.837
Neg logit contributions
 the-0.636
 a-0.577
 all-0.498
 his-0.470
 her-0.463
Token the
Token ID262
Feature activation+2.42e+01

Pos logit contributions
 a+0.014
 the+0.011
 all+0.011
 in+0.006
 very-0.006
Neg logit contributions
och-0.318
oc-0.318
-0.314
lor-0.314
pillar-0.308
Token skull
Token ID14511
Feature activation-9.29e+00

Pos logit contributions
 the+0.942
 a+0.861
 all+0.806
 his+0.796
 their+0.788
Neg logit contributions
lor-0.706
pillar-0.655
ider-0.655
och-0.627
uddy-0.618
Token find
Token ID1064
Feature activation+5.04e+00

Pos logit contributions
 the+0.193
 a+0.172
 his+0.146
 all+0.142
 her+0.138
Neg logit contributions
lor-0.311
ides-0.291
-0.280
arry-0.277
istent-0.273
Token another
Token ID1194
Feature activation+2.66e+01

Pos logit contributions
 the+0.050
 his+0.039
 all+0.037
 a+0.035
 in+0.027
Neg logit contributions
lor-0.248
'."-0.222
ides-0.222
och-0.220
mmmm-0.219
Token book
Token ID1492
Feature activation-6.71e+00

Pos logit contributions
 the+0.890
 a+0.817
 all+0.727
 his+0.697
 her+0.679
Neg logit contributions
lor-1.045
ides-0.921
och-0.902
arry-0.891
umm-0.887
Token."
Token ID526
Feature activation-3.05e+01

Pos logit contributions
lor+0.338
ides+0.309
ider+0.306
och+0.303
oke+0.300
Neg logit contributions
 the-0.146
 a-0.126
 all-0.102
 his-0.101
 her-0.095
Token
Token ID198
Feature activation-2.29e+01

Pos logit contributions
lor+1.347
ides+1.235
istent+1.209
och+1.202
reetings+1.193
Neg logit contributions
 the-1.083
 a-0.982
 all-0.831
 her-0.817
 his-0.786
Token
Token ID198
Feature activation-2.29e+01

Pos logit contributions
lor+0.990
umm+0.930
ides+0.917
istent+0.908
och+0.893
Neg logit contributions
 the-0.827
 a-0.751
 all-0.633
 her-0.608
 his-0.606
TokenBut
Token ID1537
Feature activation-3.13e+01

Pos logit contributions
lor+0.660
ides+0.561
och+0.544
istent+0.538
jo+0.534
Neg logit contributions
 the-1.077
 a-1.015
 all-0.929
 her-0.890
 his-0.881
Token Sara
Token ID24799
Feature activation-8.78e+00

Pos logit contributions
lor+1.123
ider+1.060
+1.032
etsy+1.016
Really+1.002
Neg logit contributions
 the-0.816
 a-0.787
 all-0.687
 his-0.677
 big-0.649
Token does
Token ID857
Feature activation-5.08e+00

Pos logit contributions
lor+0.334
ides+0.300
arry+0.290
oke+0.288
ider+0.287
Neg logit contributions
 the-0.304
 a-0.282
 all-0.250
 his-0.236
 her-0.233
Token not
Token ID407
Feature activation+4.71e+00

Pos logit contributions
lor+0.249
ides+0.229
ider+0.226
arry+0.225
och+0.225
Neg logit contributions
 the-0.106
 a-0.095
 all-0.081
 his-0.069
 her-0.068
Token want
Token ID765
Feature activation-8.98e+00

Pos logit contributions
 the+0.156
 a+0.144
 all+0.128
 his+0.118
 her+0.117
Neg logit contributions
lor-0.172
ides-0.162
arry-0.157
och-0.156
oke-0.153
Token swim
Token ID9422
Feature activation+8.81e+00

Pos logit contributions
 the+0.358
 a+0.334
 all+0.278
 his+0.256
 her+0.253
Neg logit contributions
lor-0.570
ides-0.528
-0.509
istent-0.509
arry-0.508
Token away
Token ID1497
Feature activation+2.50e+01

Pos logit contributions
 the+0.195
 a+0.173
 her+0.134
 all+0.132
 his+0.122
Neg logit contributions
lor-0.412
arry-0.378
ider-0.377
ides-0.371
etsy-0.369
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
 the+0.687
 a+0.635
 all+0.526
 her+0.519
 his+0.491
Neg logit contributions
lor-0.984
ider-0.904
arry-0.890
ides-0.859
istent-0.853
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
lor+0.532
ides+0.532
ider+0.499
arians+0.495
och+0.491
Neg logit contributions
 the-0.424
 a-0.383
 all-0.334
 his-0.330
 her-0.295
Token
Token ID198
Feature activation-1.92e+01

Pos logit contributions
umm+0.911
lor+0.872
ides+0.863
ider+0.854
arians+0.842
Neg logit contributions
 the-0.621
 a-0.563
 all-0.466
 his-0.463
 her-0.415
TokenJust
Token ID5703
Feature activation-3.54e+01

Pos logit contributions
lor+0.576
ides+0.547
umm+0.530
jo+0.525
och+0.513
Neg logit contributions
 the-0.847
 a-0.808
 all-0.733
 his-0.712
 in-0.678
Token then
Token ID788
Feature activation+4.85e+00

Pos logit contributions
lor+1.219
ider+1.155
ides+1.109
mmm+1.108
reetings+1.088
Neg logit contributions
 the-0.976
 a-0.900
 all-0.847
 her-0.769
 big-0.760
Token,
Token ID11
Feature activation+4.38e+00

Pos logit contributions
 the+0.098
 a+0.084
 all+0.080
 more+0.062
 her+0.061
Neg logit contributions
lor-0.224
arry-0.209
uddy-0.200
etsy-0.199
reetings-0.199
Token D
Token ID360
Feature activation-5.65e+00

Pos logit contributions
 the+0.100
 a+0.086
 all+0.081
 more+0.074
 up+0.072
Neg logit contributions
lor-0.172
-0.160
etsy-0.158
arry-0.156
uddy-0.148
Tokenucky
Token ID5309
Feature activation-9.90e+00

Pos logit contributions
lor+0.180
arians+0.152
ides+0.148
jo+0.148
och+0.146
Neg logit contributions
 the-0.242
 a-0.234
 all-0.207
 in-0.201
 her-0.199
Token's
Token ID338
Feature activation+1.06e+01

Pos logit contributions
lor+0.411
ides+0.361
oke+0.357
arry+0.357
jo+0.355
Neg logit contributions
 the-0.295
 a-0.270
 all-0.238
 her-0.220
 his-0.218
Token the
Token ID262
Feature activation+2.50e+01

Pos logit contributions
 the+0.118
 a+0.094
 in+0.053
 all+0.052
 out+0.046
Neg logit contributions
lor-0.711
jo-0.687
ides-0.686
-0.684
istent-0.684
Token cheap
Token ID7026
Feature activation+2.08e+01

Pos logit contributions
 the+0.659
 a+0.592
 her+0.510
 all+0.506
 his+0.472
Neg logit contributions
lor-1.044
och-0.968
ides-0.960
umm-0.956
oke-0.943
Token navy
Token ID23956
Feature activation+1.93e+00

Pos logit contributions
 the+0.571
 a+0.517
 all+0.458
 her+0.456
 his+0.408
Neg logit contributions
lor-0.804
och-0.749
ides-0.734
arry-0.715
oke-0.710
Token paint
Token ID7521
Feature activation+1.29e+01

Pos logit contributions
 the+0.055
 a+0.051
 her+0.043
 all+0.043
 his+0.041
Neg logit contributions
lor-0.082
ides-0.073
och-0.073
oke-0.071
arry-0.071
Token!
Token ID0
Feature activation-1.87e+01

Pos logit contributions
 the+0.326
 a+0.296
 all+0.246
 her+0.234
 his+0.234
Neg logit contributions
lor-0.591
och-0.536
ides-0.535
umm-0.525
-0.513
Token<|endoftext|>
Token ID50256
Feature activation-3.32e+01

Pos logit contributions
lor+0.871
ider+0.852
ides+0.811
uer+0.800
arians+0.773
Neg logit contributions
 the-0.644
 a-0.571
 his-0.503
 all-0.494
 their-0.442
TokenOnce
Token ID7454
Feature activation-4.06e+01

Pos logit contributions
lor+1.233
ides+1.168
istent+1.151
umm+1.137
ider+1.136
Neg logit contributions
 the-1.348
 a-1.244
 all-1.077
 in-1.034
 his-1.022
Token upon
Token ID2402
Feature activation-7.08e+00

Pos logit contributions
lor+1.499
ides+1.312
umm+1.274
och+1.273
arry+1.269
Neg logit contributions
 the-1.502
 a-1.398
 all-1.251
 his-1.179
 her-1.169
Token a
Token ID257
Feature activation+2.97e+01

Pos logit contributions
lor+0.420
unch+0.378
jo+0.377
ides+0.376
arry+0.374
Neg logit contributions
 the-0.044
 all-0.012
 a-0.000
 out+0.006
 up+0.007
Token time
Token ID640
Feature activation-5.29e+00

Pos logit contributions
 the+0.648
 a+0.618
 all+0.464
 her+0.406
 in+0.406
Neg logit contributions
lor-1.594
och-1.456
umm-1.427
ides-1.427
arry-1.402
Token there
Token ID612
Feature activation+9.21e+00

Pos logit contributions
lor+0.295
ides+0.271
arry+0.266
och+0.266
umm+0.265
Neg logit contributions
 the-0.094
 a-0.081
 all-0.062
 his-0.052
 her-0.051
Token the
Token ID262
Feature activation+1.47e+01

Pos logit contributions
lor+0.111
aj+0.105
och+0.105
etsy+0.104
reetings+0.104
Neg logit contributions
 the-0.029
 a-0.022
 all-0.020
 his-0.018
 her-0.016
Token apple
Token ID17180
Feature activation-3.43e+00

Pos logit contributions
 the+0.460
 a+0.407
 her+0.369
 his+0.368
 all+0.365
Neg logit contributions
lor-0.578
umm-0.525
ides-0.507
-0.503
och-0.499
Token for
Token ID329
Feature activation+4.10e+00

Pos logit contributions
lor+0.146
ides+0.130
umm+0.130
och+0.128
arry+0.127
Neg logit contributions
 the-0.097
 a-0.089
 all-0.075
 his-0.074
 her-0.072
Token me
Token ID502
Feature activation+5.47e+00

Pos logit contributions
 the+0.086
 a+0.077
 out+0.067
 all+0.067
 up+0.067
Neg logit contributions
lor-0.185
mmm-0.163
ides-0.162
mmmm-0.162
umm-0.159
Token."
Token ID526
Feature activation-3.62e+01

Pos logit contributions
 the+0.172
 a+0.158
 all+0.138
 his+0.130
 her+0.128
Neg logit contributions
lor-0.229
ides-0.205
arry-0.199
och-0.199
ider-0.197
Token
Token ID198
Feature activation-2.80e+01

Pos logit contributions
lor+1.512
ides+1.427
umm+1.390
ider+1.349
reetings+1.324
Neg logit contributions
 the-1.345
 a-1.207
 his-1.042
 all-1.035
 her-1.031
Token
Token ID198
Feature activation-2.75e+01

Pos logit contributions
umm+1.337
lor+1.272
ides+1.266
ider+1.254
istent+1.227
Neg logit contributions
 the-0.929
 a-0.822
 his-0.694
 her-0.669
 all-0.650
TokenMom
Token ID29252
Feature activation-3.83e+01

Pos logit contributions
umm+1.046
ides+1.007
lor+0.999
istent+0.950
plets+0.916
Neg logit contributions
 the-1.065
 a-0.994
 his-0.857
 her-0.851
 all-0.815
Tokenmy
Token ID1820
Feature activation-2.33e+00

Pos logit contributions
lor+1.499
ides+1.318
oke+1.316
arry+1.303
och+1.298
Neg logit contributions
 the-1.253
 a-1.172
 all-1.025
 his-0.979
 her-0.957
Token smiled
Token ID13541
Feature activation-1.00e+01

Pos logit contributions
lor+0.098
oke+0.087
ides+0.086
arry+0.085
istent+0.085
Neg logit contributions
 the-0.069
 a-0.064
 all-0.056
 his-0.053
 her-0.051
Token and
Token ID290
Feature activation+8.31e+00

Pos logit contributions
lor+0.471
arry+0.435
umm+0.428
ider+0.424
istent+0.424
Neg logit contributions
 the-0.206
 a-0.175
 all-0.145
 his-0.122
 her-0.119
Token stay
Token ID2652
Feature activation+1.77e+01

Pos logit contributions
 the+0.349
 a+0.318
 all+0.270
 his+0.250
 her+0.245
Neg logit contributions
lor-0.579
ides-0.530
arry-0.511
-0.504
istent-0.502
Token too
Token ID1165
Feature activation+1.22e+01

Pos logit contributions
 the+0.335
 a+0.278
 all+0.241
 his+0.203
 her+0.197
Neg logit contributions
lor-0.767
ider-0.724
aj-0.705
ides-0.704
uer-0.703
Token.
Token ID13
Feature activation-1.23e+01

Pos logit contributions
 the+0.310
 a+0.278
 all+0.233
 his+0.218
 her+0.216
Neg logit contributions
lor-0.558
ides-0.511
arry-0.507
ider-0.504
och-0.497
Token
Token ID198
Feature activation-1.93e+01

Pos logit contributions
lor+0.576
ides+0.533
reetings+0.510
ider+0.507
umm+0.499
Neg logit contributions
 the-0.419
 a-0.372
 her-0.309
 all-0.305
 his-0.286
Token
Token ID198
Feature activation-1.93e+01

Pos logit contributions
umm+0.956
lor+0.919
ider+0.888
ides+0.886
istent+0.846
Neg logit contributions
 the-0.621
 a-0.550
 her-0.441
 all-0.427
 his-0.423
TokenSo
Token ID2396
Feature activation-5.02e+01

Pos logit contributions
lor+0.823
ides+0.768
umm+0.760
oke+0.727
istent+0.715
Neg logit contributions
 the-0.660
 a-0.602
 all-0.509
 her-0.504
 his-0.471
Token the
Token ID262
Feature activation+2.64e+01

Pos logit contributions
lor+2.627
umm+2.571
arry+2.551
+2.526
etsy+2.484
Neg logit contributions
 a-0.217
 the-0.133
 out-0.044
 his-0.031
 all-0.023
Token cube
Token ID23441
Feature activation-6.95e+00

Pos logit contributions
 the+0.841
 a+0.772
 all+0.675
 her+0.644
 his+0.639
Neg logit contributions
lor-1.028
umm-0.954
arry-0.930
oke-0.924
och-0.923
Token opened
Token ID4721
Feature activation+5.63e+00

Pos logit contributions
lor+0.283
ides+0.249
jo+0.246
arry+0.246
ider+0.245
Neg logit contributions
 the-0.218
 a-0.199
 all-0.172
 her-0.161
 his-0.160
Token the
Token ID262
Feature activation+2.63e+01

Pos logit contributions
 the+0.043
 a+0.043
 more+0.029
 all+0.025
 good+0.022
Neg logit contributions
lor-0.267
aman-0.245
aron-0.245
arry-0.244
jo-0.241
Token door
Token ID3420
Feature activation+2.92e+00

Pos logit contributions
 the+0.861
 a+0.790
 all+0.692
 her+0.661
 his+0.653
Neg logit contributions
lor-1.064
umm-0.947
ides-0.935
och-0.927
arry-0.924
Token p
Token ID279
Feature activation+9.97e+00

Pos logit contributions
 the+0.856
 a+0.774
 all+0.688
 his+0.686
 her+0.683
Neg logit contributions
lor-0.858
umm-0.787
ider-0.755
ides-0.753
och-0.747
Tokenanda
Token ID5282
Feature activation-3.18e+01

Pos logit contributions
 the+0.514
 a+0.490
 all+0.467
 in+0.438
 her+0.433
Neg logit contributions
lor-0.203
ides-0.162
istent-0.156
umm-0.148
reetings-0.141
Token and
Token ID290
Feature activation+1.31e+01

Pos logit contributions
lor+1.443
ides+1.283
och+1.278
oke+1.268
istent+1.261
Neg logit contributions
 the-0.719
 a-0.654
 all-0.528
 his-0.513
 her-0.510
Token said
Token ID531
Feature activation-5.96e+00

Pos logit contributions
 the+0.390
 a+0.368
 all+0.327
 out+0.303
 her+0.297
Neg logit contributions
lor-0.495
arry-0.435
oke-0.430
jo-0.428
och-0.426
Token
Token ID6184
Feature activation-3.78e+01

Pos logit contributions
lor+0.291
oke+0.260
arry+0.260
ider+0.255
och+0.255
Neg logit contributions
 the-0.126
 a-0.111
 all-0.097
 her-0.081
 his-0.080
Token
Token ID95
Feature activation-5.24e+01

Pos logit contributions
lor+0.958
ides+0.775
umm+0.740
jo+0.736
och+0.729
Neg logit contributions
 the-1.849
 a-1.749
 all-1.595
 in-1.526
 his-1.513
Token
Token ID26391
Feature activation-3.35e+01

Pos logit contributions
lor+1.303
ides+1.116
och+1.103
unch+1.098
oral+1.090
Neg logit contributions
 the-2.523
 a-2.422
 all-2.098
 his-2.064
 in-2.045
Token
Token ID129
Feature activation-3.24e+01

Pos logit contributions
umm+0.981
lor+0.973
oral+0.910
ides+0.900
och+0.872
Neg logit contributions
 the-1.485
 a-1.406
 all-1.227
 in-1.154
 his-1.138
Token
Token ID241
Feature activation-3.78e+01

Pos logit contributions
lor+0.801
ides+0.648
och+0.631
umm+0.625
arry+0.621
Neg logit contributions
 the-1.601
 a-1.518
 all-1.399
 his-1.344
 her-1.336
TokenCome
Token ID16773
Feature activation-2.88e+01

Pos logit contributions
lor+1.236
ides+1.103
umm+1.063
och+1.013
istent+1.000
Neg logit contributions
 the-1.653
 a-1.592
 his-1.357
 all-1.350
 her-1.329
Token with
Token ID351
Feature activation+1.49e+01

Pos logit contributions
lor+1.573
och+1.432
ides+1.430
arry+1.423
+1.412
Neg logit contributions
 the-0.506
 a-0.434
 all-0.338
 his-0.293
 her-0.284
Token of
Token ID286
Feature activation+7.67e+00

Pos logit contributions
lor+0.406
oke+0.372
ider+0.371
och+0.369
umm+0.368
Neg logit contributions
 the-0.350
 a-0.320
 her-0.276
 his-0.273
 all-0.265
Token plans
Token ID3352
Feature activation-1.17e+01

Pos logit contributions
 the+0.186
 a+0.174
 out+0.148
 all+0.143
 in+0.139
Neg logit contributions
lor-0.291
pill-0.291
jo-0.280
oke-0.279
umm-0.277
Token.
Token ID13
Feature activation-2.02e+01

Pos logit contributions
lor+0.507
ides+0.476
och+0.456
arry+0.454
istent+0.452
Neg logit contributions
 the-0.268
 a-0.240
 his-0.197
 her-0.194
 all-0.189
Token
Token ID198
Feature activation-2.68e+01

Pos logit contributions
lor+0.845
ides+0.798
ider+0.768
umm+0.758
uer+0.745
Neg logit contributions
 the-0.744
 a-0.694
 all-0.585
 his-0.559
 her-0.543
Token
Token ID198
Feature activation-2.64e+01

Pos logit contributions
umm+1.225
lor+1.157
ides+1.146
ider+1.124
istent+1.090
Neg logit contributions
 the-0.901
 a-0.839
 all-0.682
 his-0.678
 her-0.653
TokenUnfortunately
Token ID13898
Feature activation-4.77e+01

Pos logit contributions
lor+0.928
umm+0.917
ides+0.888
oke+0.826
istent+0.823
Neg logit contributions
 the-1.042
 a-0.992
 all-0.849
 his-0.832
 her-0.830
Token,
Token ID11
Feature activation-2.23e+00

Pos logit contributions
lor+1.839
arry+1.763
oke+1.737
ides+1.714
istent+1.704
Neg logit contributions
 a-1.012
 the-0.964
 big-0.840
 all-0.806
 his-0.732
Token Winn
Token ID19454
Feature activation-3.06e+01

Pos logit contributions
lor+0.103
+0.095
etsy+0.095
umm+0.094
uddy+0.093
Neg logit contributions
 the-0.032
 a-0.031
 her-0.024
 up-0.023
 all-0.023
Tokenie
Token ID494
Feature activation-1.34e+01

Pos logit contributions
lor+0.980
ides+0.861
jo+0.860
oke+0.844
och+0.819
Neg logit contributions
 the-1.198
 a-1.161
 all-1.001
 his-0.986
 out-0.956
Token got
Token ID1392
Feature activation+4.86e+00

Pos logit contributions
lor+0.543
ides+0.482
oke+0.473
istent+0.472
arry+0.467
Neg logit contributions
 the-0.418
 a-0.389
 all-0.336
 his-0.326
 her-0.321
Token sick
Token ID6639
Feature activation+5.86e+00

Pos logit contributions
 the+0.095
 a+0.077
 his+0.065
 all+0.064
 her+0.062
Neg logit contributions
lor-0.182
etsy-0.179
och-0.177
-0.177
oling-0.174
Token brave
Token ID14802
Feature activation+1.27e+01

Pos logit contributions
 the+0.208
 a+0.189
 all+0.170
 her+0.155
 out+0.146
Neg logit contributions
lor-0.295
oke-0.289
ider-0.286
arry-0.285
uer-0.283
Token and
Token ID290
Feature activation+1.36e+01

Pos logit contributions
 the+0.258
 a+0.233
 all+0.187
 her+0.173
 his+0.167
Neg logit contributions
lor-0.612
oke-0.566
umm-0.562
ides-0.560
jo-0.560
Token said
Token ID531
Feature activation-5.50e+00

Pos logit contributions
 the+0.409
 a+0.394
 all+0.358
 out+0.349
 her+0.338
Neg logit contributions
lor-0.442
uer-0.401
oke-0.399
ider-0.394
arry-0.391
Token,
Token ID11
Feature activation+2.91e+00

Pos logit contributions
lor+0.262
oke+0.236
arry+0.234
ider+0.231
reetings+0.231
Neg logit contributions
 the-0.116
 a-0.102
 all-0.090
 her-0.079
 his-0.074
Token
Token ID6184
Feature activation-3.72e+01

Pos logit contributions
 the+0.100
 a+0.093
 all+0.083
 his+0.078
 her+0.077
Neg logit contributions
lor-0.113
ides-0.099
arry-0.097
och-0.097
umm-0.096
Token
Token ID95
Feature activation-5.19e+01

Pos logit contributions
lor+0.927
ides+0.758
uddy+0.754
och+0.751
jo+0.744
Neg logit contributions
 the-1.811
 a-1.726
 all-1.554
 his-1.499
 in-1.488
Token
Token ID26391
Feature activation-3.30e+01

Pos logit contributions
lor+1.199
uddy+1.121
och+1.096
etsy+1.074
oke+1.058
Neg logit contributions
 the-2.550
 a-2.452
 his-2.140
 all-2.132
 in-2.063
Token
Token ID129
Feature activation-3.20e+01

Pos logit contributions
lor+0.865
umm+0.862
och+0.815
ides+0.790
ider+0.777
Neg logit contributions
 the-1.531
 a-1.461
 all-1.284
 his-1.219
 in-1.204
Token
Token ID241
Feature activation-3.74e+01

Pos logit contributions
lor+0.778
ides+0.626
och+0.618
umm+0.613
arry+0.603
Neg logit contributions
 the-1.594
 a-1.512
 all-1.393
 his-1.340
 her-1.329
TokenHey
Token ID10814
Feature activation-1.35e+01

Pos logit contributions
lor+1.221
ides+1.093
umm+1.044
och+1.011
oke+1.001
Neg logit contributions
 the-1.649
 a-1.616
 his-1.371
 all-1.344
 her-1.327
Token,
Token ID11
Feature activation+3.41e+00

Pos logit contributions
lor+0.619
oke+0.552
ides+0.547
arry+0.545
och+0.541
Neg logit contributions
 the-0.349
 a-0.317
 all-0.280
 her-0.263
 his-0.258
Token.
Token ID13
Feature activation-1.30e+01

Pos logit contributions
 the+0.170
 a+0.152
 all+0.126
 his+0.115
 her+0.113
Neg logit contributions
lor-0.399
arry-0.356
ides-0.350
reetings-0.349
jo-0.347
Tokenâ
Token ID22940
Feature activation-3.57e+01

Pos logit contributions
lor+0.476
ides+0.456
uer+0.436
ider+0.421
wl+0.420
Neg logit contributions
 the-0.525
 a-0.489
 all-0.402
 her-0.396
 his-0.391
Token
Token ID26391
Feature activation-3.26e+01

Pos logit contributions
lor+0.846
umm+0.666
uddy+0.653
ides+0.628
pill+0.610
Neg logit contributions
 the-1.842
 a-1.723
 all-1.558
 in-1.523
 his-1.503
Token
Token ID198
Feature activation-2.01e+01

Pos logit contributions
lor+0.944
umm+0.913
ides+0.856
ider+0.848
och+0.832
Neg logit contributions
 the-1.522
 a-1.420
 all-1.251
 in-1.160
 it-1.153
Token
Token ID198
Feature activation-2.02e+01

Pos logit contributions
umm+0.954
lor+0.903
ides+0.899
ider+0.887
uer+0.882
Neg logit contributions
 the-0.651
 a-0.580
 all-0.478
 his-0.459
 her-0.443
TokenSo
Token ID2396
Feature activation-5.11e+01

Pos logit contributions
lor+0.748
ides+0.733
umm+0.712
jo+0.688
uer+0.667
Neg logit contributions
 the-0.770
 a-0.702
 all-0.625
 his-0.588
 her-0.586
Token,
Token ID11
Feature activation+3.32e+00

Pos logit contributions
lor+2.556
arry+2.331
och+2.301
umm+2.293
ides+2.273
Neg logit contributions
 the-0.928
 a-0.881
 all-0.661
 his-0.587
 out-0.584
Token the
Token ID262
Feature activation+2.55e+01

Pos logit contributions
 the+0.063
 a+0.058
 all+0.046
 his+0.041
 her+0.040
Neg logit contributions
lor-0.165
och-0.148
arry-0.148
umm-0.146
-0.146
Token mom
Token ID1995
Feature activation-8.49e+00

Pos logit contributions
 the+0.726
 a+0.649
 all+0.570
 his+0.551
 her+0.533
Neg logit contributions
lor-1.116
umm-0.986
ider-0.976
ides-0.976
uddy-0.970
Tokenmy
Token ID1820
Feature activation+4.03e+00

Pos logit contributions
lor+0.290
ides+0.251
och+0.251
oke+0.250
arry+0.248
Neg logit contributions
 the-0.323
 a-0.304
 all-0.271
 his-0.259
 her-0.256
Token bird
Token ID6512
Feature activation-5.33e+00

Pos logit contributions
 the+0.101
 a+0.091
 all+0.076
 his+0.072
 her+0.072
Neg logit contributions
lor-0.189
ides-0.171
oke-0.169
arry-0.169
och-0.167