TOP ACTIVATIONS
MAX = 4.613

and happy . Mar ley and T illy cheered
the razor was the most tre asured thing in all of
and her puppy developed a tre asured friendship . Together ,
it soon became his most tre asured possession . He called
would help her find the tre as ue . She wore
one of Lucy 's most tre asured memories ! <|endoftext|> Once
out ." So Mar ley and T illy used
advice . The owl told B ongo to use a clean
. Suddenly , Jon ny remembered . He remembered
One day Jon ny and his parents went
From that day on , Tim my always made sure to
enjoyed the sunny day . Jon ny looked around and saw
stretch to reach it . Tig gy stretched and stretched ,
loved . Jon ny smiled at his parents
From that day on , Tim my always wore his yellow
stuck in a cage . Jon ny asked the deer ,
bedroom . Jon ny 's mom was really
. When he reached the tre et op , he asked
was . Snow y told Tig gy that it was up
blue z ig zag . Tig was so happy that he

MIN ACTIVATIONS
MIN = -3.715

both jumped in and were so silly ! They laughed and
pushed the buttons and the images changed back to the way
All the people around were so happy , they cheered and
!" Tom and Mia say together . They hold up the
on their papers . Their images start to fade . They
All the kids around were so jealous of the juice maker
melted and the flame f lic kered . He realized that
and went black . The pictures and sounds were gone .
go !" The birds and bugs cheered back , " You
the birds stopped singing and there was silence . The big
Ben and they watch the video on the screen . They
the bees buzzing in and out of the small holes .
!" Sam and Tom say together . " But
mom were talking . Sue said , " Mom , I
a funny face . The sandwiches are spicy ! They have
It starts to rain . The rain drops fall on their
. Everyone around her was so proud ! Jen
on the flowers and make them happy . She also liked
the couch and watch the film . They see a girl
Inside the box , there are two dolls . One

INTERVAL 2.531 - 4.613
CONTAINS 0.311%

brighter . Steve was amazed â € � � his wings
boy named Tim my . Tim my loved sa us ages
gave it to him . Tim my loved the soup and
shrugged and laughed . She gave Tommy some of the candy
j ogged back home . Tim my couldn 't wait to

INTERVAL 0.449 - 2.531
CONTAINS 30.268%

chain . She was happy with her chain .
� Oh , thank you ! I â € ™ ll
" I don 't understand ," she said .
, I loved it ! Can we come back and join
, Anna . Come on , let 's go and get

INTERVAL -1.633 - 0.449
CONTAINS 66.428%

to the little boy . The little boy was
! <|endoftext|> One bright and sunny day , Jack and his
my 's makeup and decided to put it on her face
L ily and Max are twins . They like to play
and he realized he could draw and make himself feel a

INTERVAL -3.715 - -1.633
CONTAINS 2.993%

out her flowers and displays them on a bench . She
a tree . They drew a picture of a rainbow and
toys , and sometimes they fight over them .
animals with the clay . They had red clay , blue
put the puzzle pieces together . Sue and her big hand
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
apy+1.282
 floats+1.119
 are+1.107
ades+1.080
 were+1.066
Neg logit contributions
umper-0.953
bur-0.939
othy-0.932
asured-0.913
ior-0.904
Token happy
Token ID3772
Feature activation-1.49e+00

Pos logit contributions
umper+0.237
bur+0.232
othy+0.232
ior+0.228
asured+0.227
Neg logit contributions
apy-0.214
 are-0.185
 floats-0.184
 were-0.178
ades-0.176
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
apy+3.168
 floats+2.758
 are+2.723
ades+2.689
 were+2.552
Neg logit contributions
asured-2.538
othy-2.502
bur-2.467
umper-2.454
 Tooth-2.446
Token
Token ID198
Feature activation+9.85e-02

Pos logit contributions
umper+3.229
othy+3.170
bur+3.157
asured+3.148
 Tooth+3.097
Neg logit contributions
apy-1.854
 are-1.516
 were-1.494
 floats-1.442
 they-1.439
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
umper+0.229
bur+0.225
othy+0.225
ior+0.221
asured+0.221
Neg logit contributions
apy-0.150
 floats-0.126
 are-0.126
 were-0.121
ades-0.117
TokenMar
Token ID7676
Feature activation+4.61e+00

Pos logit contributions
umper+0.659
bur+0.644
othy+0.644
ior+0.635
asured+0.632
Neg logit contributions
apy-0.591
 are-0.509
 floats-0.506
 were-0.494
ades-0.482
Tokenley
Token ID1636
Feature activation-1.11e-01

Pos logit contributions
othy+6.064
asured+5.976
 Tooth+5.934
umper+5.633
ior+5.298
Neg logit contributions
apy-11.581
 are-10.986
 were-10.846
 they-10.708
 vows-10.565
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
apy+0.220
 floats+0.190
 are+0.190
 were+0.183
ades+0.182
Neg logit contributions
umper-0.207
othy-0.204
bur-0.203
ior-0.199
asured-0.199
Token T
Token ID309
Feature activation+2.25e+00

Pos logit contributions
asured+4.196
umper+4.081
bur+4.070
ior+4.020
othy+3.990
Neg logit contributions
apy-5.404
 were-5.028
 are-4.999
 they-4.965
 floats-4.583
Tokenilly
Token ID6548
Feature activation-1.41e+00

Pos logit contributions
asured+2.379
othy+2.367
ior+2.350
bur+2.307
umper+2.224
Neg logit contributions
apy-6.080
 are-5.809
ades-5.711
 were-5.702
 vows-5.537
Token cheered
Token ID34767
Feature activation-6.09e-02

Pos logit contributions
apy+2.804
 floats+2.390
ades+2.360
 are+2.257
cakes+2.173
Neg logit contributions
othy-2.732
bur-2.698
umper-2.674
ior-2.596
asured-2.583
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
apy+0.956
 floats+0.845
 are+0.829
ades+0.828
 were+0.792
Neg logit contributions
bur-0.877
umper-0.871
othy-0.865
ior-0.847
asured-0.842
Token razor
Token ID26811
Feature activation+3.62e-01

Pos logit contributions
umper+3.714
asured+3.612
ior+3.606
bur+3.597
othy+3.584
Neg logit contributions
apy-3.413
 are-3.067
 were-3.044
 floats-2.968
 they-2.769
Token was
Token ID373
Feature activation+1.58e+00

Pos logit contributions
umper+0.742
bur+0.725
othy+0.723
ior+0.715
asured+0.710
Neg logit contributions
apy-0.653
 are-0.564
 floats-0.560
 were-0.549
ades-0.530
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
umper+3.182
ior+3.145
bur+3.015
othy+2.964
 Tooth+2.958
Neg logit contributions
apy-3.079
 were-2.618
 are-2.612
 they-2.534
 floats-2.509
Token most
Token ID749
Feature activation+1.35e+00

Pos logit contributions
umper+3.971
asured+3.860
ior+3.859
othy+3.855
bur+3.741
Neg logit contributions
apy-3.288
 are-3.184
 were-3.074
 floats-2.995
 they-2.853
Token tre
Token ID2054
Feature activation+4.57e+00

Pos logit contributions
umper+2.866
bur+2.838
othy+2.788
ior+2.764
 Tooth+2.639
Neg logit contributions
apy-2.488
 are-2.104
 floats-2.098
 were-2.076
 they-2.000
Tokenasured
Token ID34006
Feature activation+3.54e+00

Pos logit contributions
ior+7.116
umper+7.070
bur+6.720
othy+6.513
 Tooth+6.351
Neg logit contributions
apy-11.547
 floats-9.934
 are-9.875
ades-9.772
ves-9.569
Token thing
Token ID1517
Feature activation+9.41e-01

Pos logit contributions
bur+7.411
umper+7.406
ior+7.351
othy+6.946
ppa+6.816
Neg logit contributions
apy-7.529
 are-6.524
 were-6.515
 they-6.478
 floats-6.231
Token in
Token ID287
Feature activation+1.45e+00

Pos logit contributions
umper+1.878
othy+1.826
bur+1.817
ior+1.800
asured+1.785
Neg logit contributions
apy-1.762
 are-1.572
 were-1.543
 floats-1.529
 they-1.494
Token all
Token ID477
Feature activation+1.00e+00

Pos logit contributions
umper+2.854
ior+2.829
bur+2.770
othy+2.732
asured+2.722
Neg logit contributions
apy-2.814
 are-2.507
 were-2.461
 floats-2.372
 they-2.366
Token of
Token ID286
Feature activation+1.20e+00

Pos logit contributions
umper+2.181
bur+2.121
othy+2.116
ior+2.100
 Tooth+2.094
Neg logit contributions
apy-1.701
 are-1.520
 were-1.480
 floats-1.479
 they-1.411
Token and
Token ID290
Feature activation+5.63e-01

Pos logit contributions
umper+1.231
bur+1.199
othy+1.186
 Tooth+1.179
ior+1.177
Neg logit contributions
apy-1.209
 are-1.070
 floats-1.054
 were-1.051
ades-0.991
Token her
Token ID607
Feature activation+7.78e-03

Pos logit contributions
umper+1.108
bur+1.081
othy+1.071
asured+1.061
ior+1.060
Neg logit contributions
apy-1.077
 are-0.942
 floats-0.925
 were-0.922
 they-0.876
Token puppy
Token ID26188
Feature activation-1.78e+00

Pos logit contributions
umper+0.015
othy+0.015
bur+0.015
ior+0.014
 Tooth+0.014
Neg logit contributions
apy-0.015
 are-0.013
 floats-0.013
 were-0.013
ades-0.012
Token developed
Token ID4166
Feature activation-9.10e-02

Pos logit contributions
apy+3.385
ades+2.939
 floats+2.885
cakes+2.671
 are+2.656
Neg logit contributions
othy-3.548
bur-3.529
umper-3.515
ior-3.426
asured-3.323
Token a
Token ID257
Feature activation-1.14e-01

Pos logit contributions
apy+0.159
 floats+0.136
 are+0.136
 were+0.131
ades+0.129
Neg logit contributions
umper-0.190
othy-0.187
bur-0.187
asured-0.184
ior-0.183
Token tre
Token ID2054
Feature activation+4.57e+00

Pos logit contributions
apy+0.209
 floats+0.181
 are+0.181
 were+0.175
ades+0.172
Neg logit contributions
umper-0.228
othy-0.225
bur-0.225
ior-0.220
 Tooth-0.220
Tokenasured
Token ID34006
Feature activation+1.72e+00

Pos logit contributions
 Tooth+8.335
umper+8.326
ior+7.801
bur+7.496
gie+7.212
Neg logit contributions
apy-9.343
ves-8.925
ades-8.911
 flags-8.801
 are-8.702
Token friendship
Token ID14738
Feature activation-3.82e-01

Pos logit contributions
umper+3.412
 Tooth+3.234
bur+3.209
ior+3.175
othy+3.147
Neg logit contributions
apy-3.430
 are-3.053
 floats-3.008
 were-2.982
 they-2.885
Token.
Token ID13
Feature activation-6.22e-02

Pos logit contributions
apy+0.782
 floats+0.683
 are+0.672
ades+0.650
 were+0.649
Neg logit contributions
umper-0.684
bur-0.672
othy-0.669
asured-0.659
 Tooth-0.653
Token Together
Token ID17083
Feature activation-8.80e-01

Pos logit contributions
apy+0.100
 are+0.084
 floats+0.084
 were+0.081
ades+0.080
Neg logit contributions
umper-0.140
othy-0.137
bur-0.137
ior-0.136
 Tooth-0.134
Token,
Token ID11
Feature activation-4.42e-01

Pos logit contributions
apy+1.416
 floats+1.225
 are+1.197
ades+1.194
 were+1.107
Neg logit contributions
bur-1.893
othy-1.893
umper-1.880
ior-1.862
 Tooth-1.845
Token it
Token ID340
Feature activation+1.73e-01

Pos logit contributions
umper+2.001
bur+1.941
ior+1.917
othy+1.907
 Tooth+1.887
Neg logit contributions
apy-1.760
 are-1.534
 floats-1.523
 were-1.515
 they-1.454
Token soon
Token ID2582
Feature activation-8.20e-01

Pos logit contributions
umper+0.344
bur+0.336
othy+0.336
ior+0.330
asured+0.329
Neg logit contributions
apy-0.324
 are-0.280
 floats-0.279
 were-0.271
ades-0.265
Token became
Token ID2627
Feature activation+1.34e+00

Pos logit contributions
apy+1.577
 floats+1.336
 are+1.322
ades+1.293
 were+1.246
Neg logit contributions
umper-1.595
othy-1.579
bur-1.554
asured-1.541
 Tooth-1.538
Token his
Token ID465
Feature activation+6.85e-01

Pos logit contributions
umper+2.605
ior+2.537
othy+2.526
bur+2.495
 Tooth+2.473
Neg logit contributions
apy-2.588
 were-2.274
 are-2.262
 floats-2.224
 they-2.170
Token most
Token ID749
Feature activation+1.09e+00

Pos logit contributions
umper+1.331
bur+1.290
othy+1.279
ior+1.267
 Tooth+1.251
Neg logit contributions
apy-1.331
 are-1.173
 floats-1.155
 were-1.141
ades-1.097
Token tre
Token ID2054
Feature activation+4.48e+00

Pos logit contributions
umper+2.155
bur+2.143
othy+2.102
ior+2.084
 Tooth+2.017
Neg logit contributions
apy-2.109
 are-1.841
 floats-1.830
 were-1.812
ades-1.756
Tokenasured
Token ID34006
Feature activation+2.33e+00

Pos logit contributions
ior+5.990
umper+5.749
 Tooth+5.260
bur+5.229
my+5.189
Neg logit contributions
apy-11.836
 vows-10.648
ades-10.509
ves-10.469
 are-10.429
Token possession
Token ID7797
Feature activation+4.91e-01

Pos logit contributions
umper+4.439
bur+4.272
ior+4.175
othy+4.058
 Tooth+4.011
Neg logit contributions
apy-4.862
 are-4.352
 were-4.271
 floats-4.181
 they-4.167
Token.
Token ID13
Feature activation+4.16e-01

Pos logit contributions
umper+1.056
bur+1.033
othy+1.029
ior+1.017
 Tooth+1.008
Neg logit contributions
apy-0.838
 are-0.721
 floats-0.711
 were-0.701
ades-0.672
Token He
Token ID679
Feature activation+6.88e-01

Pos logit contributions
umper+0.930
bur+0.913
othy+0.910
asured+0.898
ior+0.895
Neg logit contributions
apy-0.676
 are-0.574
 floats-0.568
 were-0.554
ades-0.534
Token called
Token ID1444
Feature activation+5.85e-02

Pos logit contributions
umper+1.442
bur+1.407
othy+1.397
ior+1.394
 Tooth+1.379
Neg logit contributions
apy-1.212
 are-1.062
 floats-1.049
 were-1.044
ades-0.979
Token would
Token ID561
Feature activation-7.96e-01

Pos logit contributions
apy+0.631
 floats+0.536
 are+0.525
ades+0.511
 were+0.500
Neg logit contributions
umper-0.664
bur-0.662
othy-0.661
asured-0.648
 Tooth-0.644
Token help
Token ID1037
Feature activation-1.81e-01

Pos logit contributions
apy+1.498
 floats+1.292
ades+1.221
 are+1.188
 were+1.161
Neg logit contributions
umper-1.585
bur-1.584
othy-1.573
 Tooth-1.560
asured-1.547
Token her
Token ID607
Feature activation+3.66e-01

Pos logit contributions
apy+0.358
 floats+0.313
 are+0.306
ades+0.298
 were+0.297
Neg logit contributions
umper-0.341
othy-0.334
bur-0.333
 Tooth-0.330
ior-0.326
Token find
Token ID1064
Feature activation+2.46e-01

Pos logit contributions
umper+0.740
bur+0.723
othy+0.722
ior+0.713
asured+0.706
Neg logit contributions
apy-0.679
 are-0.591
 floats-0.584
 were-0.573
ades-0.556
Token the
Token ID262
Feature activation+9.66e-01

Pos logit contributions
umper+0.517
bur+0.506
othy+0.506
ior+0.498
asured+0.496
Neg logit contributions
apy-0.431
 are-0.370
 floats-0.368
 were-0.358
ades-0.348
Token tre
Token ID2054
Feature activation+4.45e+00

Pos logit contributions
umper+1.932
othy+1.879
ior+1.866
bur+1.860
asured+1.837
Neg logit contributions
apy-1.759
 are-1.638
 floats-1.590
 were-1.579
 they-1.480
Tokenas
Token ID292
Feature activation-8.60e-01

Pos logit contributions
umper+6.876
ior+6.653
 Tooth+6.407
bur+6.113
gie+6.017
Neg logit contributions
 flags-9.944
ades-9.872
 floats-9.839
apy-9.838
 are-9.797
Tokenue
Token ID518
Feature activation-7.11e-02

Pos logit contributions
apy+1.778
 floats+1.538
ades+1.463
 are+1.453
 were+1.413
Neg logit contributions
umper-1.566
bur-1.541
othy-1.505
asured-1.490
 Tooth-1.464
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
apy+0.144
 floats+0.124
 are+0.123
 were+0.119
ades+0.119
Neg logit contributions
umper-0.131
bur-0.129
othy-0.128
asured-0.126
ior-0.125
Token She
Token ID1375
Feature activation+5.38e-01

Pos logit contributions
apy+0.200
 floats+0.173
 are+0.172
ades+0.166
 were+0.166
Neg logit contributions
umper-0.205
othy-0.201
bur-0.200
ior-0.198
 Tooth-0.197
Token wore
Token ID12408
Feature activation-4.53e-01

Pos logit contributions
umper+1.035
bur+1.008
othy+1.001
ior+0.998
 Tooth+0.989
Neg logit contributions
apy-1.045
 are-0.916
 floats-0.909
 were-0.901
ades-0.863
Token one
Token ID530
Feature activation+8.17e-01

Pos logit contributions
umper+1.845
ior+1.771
bur+1.759
othy+1.759
 Tooth+1.743
Neg logit contributions
apy-1.612
 are-1.410
 were-1.386
 floats-1.376
 they-1.307
Token of
Token ID286
Feature activation+8.72e-02

Pos logit contributions
umper+1.980
bur+1.919
othy+1.907
ior+1.898
 Tooth+1.880
Neg logit contributions
apy-1.216
 are-1.019
 floats-1.006
 were-0.985
 they-0.935
Token Lucy
Token ID22162
Feature activation+4.50e-01

Pos logit contributions
umper+0.193
othy+0.189
bur+0.189
ior+0.186
asured+0.186
Neg logit contributions
apy-0.143
 floats-0.121
 are-0.121
 were-0.116
ades-0.114
Token's
Token ID338
Feature activation+7.65e-01

Pos logit contributions
umper+0.846
bur+0.823
othy+0.823
ior+0.809
asured+0.806
Neg logit contributions
apy-0.887
 are-0.780
 floats-0.773
 were-0.761
ades-0.734
Token most
Token ID749
Feature activation+9.51e-01

Pos logit contributions
umper+1.448
bur+1.401
othy+1.397
ior+1.373
asured+1.372
Neg logit contributions
apy-1.510
 are-1.344
 floats-1.317
 were-1.312
ades-1.250
Token tre
Token ID2054
Feature activation+4.43e+00

Pos logit contributions
umper+2.028
bur+1.987
othy+1.965
ior+1.950
 Tooth+1.908
Neg logit contributions
apy-1.694
 are-1.468
 floats-1.457
 were-1.443
ades-1.382
Tokenasured
Token ID34006
Feature activation+1.73e+00

Pos logit contributions
umper+5.699
 Tooth+5.620
ior+5.578
othy+4.840
my+4.827
Neg logit contributions
apy-11.582
 vows-10.879
 floats-10.761
 flags-10.649
ades-10.646
Token memories
Token ID9846
Feature activation-3.68e-01

Pos logit contributions
umper+3.070
bur+2.916
ior+2.911
 Tooth+2.895
othy+2.875
Neg logit contributions
apy-3.714
 are-3.367
 were-3.321
 floats-3.300
 they-3.230
Token!
Token ID0
Feature activation-8.35e-02

Pos logit contributions
apy+0.696
 floats+0.600
 are+0.591
ades+0.571
 were+0.569
Neg logit contributions
umper-0.714
bur-0.704
othy-0.699
 Tooth-0.688
asured-0.688
Token<|endoftext|>
Token ID50256
Feature activation-2.45e-01

Pos logit contributions
apy+0.131
 floats+0.111
 are+0.111
 were+0.106
ades+0.106
Neg logit contributions
umper-0.189
othy-0.186
bur-0.185
ior-0.183
 Tooth-0.182
TokenOnce
Token ID7454
Feature activation-7.11e-01

Pos logit contributions
apy+0.434
 floats+0.370
 are+0.363
ades+0.350
 were+0.347
Neg logit contributions
umper-0.515
bur-0.504
othy-0.502
ior-0.492
 Tooth-0.491
Token out
Token ID503
Feature activation-2.08e-01

Pos logit contributions
apy+0.571
 floats+0.498
 are+0.497
 were+0.483
ades+0.480
Neg logit contributions
umper-0.483
bur-0.476
othy-0.468
asured-0.463
ior-0.457
Token."
Token ID526
Feature activation+4.61e-01

Pos logit contributions
apy+0.381
 floats+0.327
 are+0.324
 were+0.314
ades+0.311
Neg logit contributions
umper-0.418
bur-0.410
othy-0.409
asured-0.404
ior-0.400
Token
Token ID198
Feature activation+6.71e-02

Pos logit contributions
umper+1.048
othy+1.025
bur+1.025
asured+1.012
ior+1.005
Neg logit contributions
apy-0.738
 are-0.620
 floats-0.615
 were-0.602
ades-0.575
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
umper+0.153
bur+0.150
othy+0.150
ior+0.148
 Tooth+0.147
Neg logit contributions
apy-0.106
 are-0.090
 floats-0.090
 were-0.086
ades-0.083
TokenSo
Token ID2396
Feature activation+4.29e-01

Pos logit contributions
apy+0.309
 are+0.274
 floats+0.273
 were+0.263
ades+0.257
Neg logit contributions
umper-0.295
bur-0.289
othy-0.289
ior-0.283
asured-0.281
Token Mar
Token ID1526
Feature activation+4.38e+00

Pos logit contributions
umper+0.971
othy+0.948
bur+0.946
asured+0.934
ior+0.931
Neg logit contributions
apy-0.691
 are-0.585
 floats-0.580
 were-0.567
ades-0.538
Tokenley
Token ID1636
Feature activation+1.26e-01

Pos logit contributions
othy+5.308
 Tooth+5.156
asured+5.081
umper+4.731
bur+4.631
Neg logit contributions
apy-11.505
 are-11.133
 were-11.101
 vows-10.723
 they-10.473
Token and
Token ID290
Feature activation+1.83e+00

Pos logit contributions
umper+0.216
othy+0.211
bur+0.211
ior+0.207
asured+0.206
Neg logit contributions
apy-0.269
 are-0.237
 floats-0.236
 were-0.230
ades-0.226
Token T
Token ID309
Feature activation+1.89e+00

Pos logit contributions
umper+3.628
asured+3.560
bur+3.557
ior+3.481
 Tooth+3.454
Neg logit contributions
apy-3.485
 are-3.137
 were-3.115
 they-2.990
 floats-2.888
Tokenilly
Token ID6548
Feature activation-1.14e+00

Pos logit contributions
asured+2.544
othy+2.532
bur+2.523
umper+2.480
ior+2.479
Neg logit contributions
apy-4.529
 are-4.292
 were-4.201
ades-4.146
 floats-4.006
Token used
Token ID973
Feature activation-5.96e-01

Pos logit contributions
apy+2.332
 floats+2.007
ades+1.932
 are+1.913
 were+1.805
Neg logit contributions
othy-2.128
umper-2.092
bur-2.082
ior-2.031
asured-2.009
Token advice
Token ID5608
Feature activation+1.20e+00

Pos logit contributions
umper+2.767
ior+2.708
bur+2.706
othy+2.685
asured+2.657
Neg logit contributions
apy-1.451
 are-1.184
 were-1.129
 floats-1.120
 they-1.096
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
umper+2.733
bur+2.678
othy+2.675
ior+2.635
 Tooth+2.611
Neg logit contributions
apy-1.919
 are-1.685
 were-1.634
 floats-1.580
 they-1.578
Token The
Token ID383
Feature activation+9.27e-01

Pos logit contributions
umper+0.971
othy+0.947
bur+0.947
asured+0.925
ior+0.922
Neg logit contributions
apy-1.108
 are-0.968
 floats-0.966
 were-0.951
ades-0.919
Token owl
Token ID39610
Feature activation+3.82e-01

Pos logit contributions
umper+1.298
ior+1.253
bur+1.252
asured+1.251
othy+1.239
Neg logit contributions
apy-2.258
 are-2.098
 were-2.053
 floats-2.047
ades-1.976
Token told
Token ID1297
Feature activation+1.72e+00

Pos logit contributions
umper+0.735
bur+0.716
othy+0.714
ior+0.704
 Tooth+0.700
Neg logit contributions
apy-0.739
 are-0.643
 floats-0.640
 were-0.629
ades-0.609
Token B
Token ID347
Feature activation+4.37e+00

Pos logit contributions
asured+3.217
othy+3.203
umper+3.200
ior+3.134
bur+3.133
Neg logit contributions
apy-3.354
 were-3.163
 are-3.135
 they-2.976
 floats-2.959
Tokenongo
Token ID25162
Feature activation+3.21e-01

Pos logit contributions
othy+5.819
asured+5.520
 Tooth+5.391
bur+5.261
nie+5.143
Neg logit contributions
apy-10.994
ades-10.658
 were-10.617
 are-10.599
cakes-10.520
Token to
Token ID284
Feature activation+9.94e-01

Pos logit contributions
umper+0.611
bur+0.598
othy+0.596
ior+0.587
asured+0.584
Neg logit contributions
apy-0.629
 are-0.551
 floats-0.545
 were-0.535
ades-0.520
Token use
Token ID779
Feature activation+1.40e+00

Pos logit contributions
umper+2.280
ior+2.220
bur+2.210
asured+2.204
othy+2.188
Neg logit contributions
apy-1.536
 are-1.389
 were-1.312
 floats-1.279
 they-1.214
Token a
Token ID257
Feature activation+6.36e-01

Pos logit contributions
umper+2.850
ior+2.776
othy+2.743
bur+2.739
 Tooth+2.707
Neg logit contributions
apy-2.695
 are-2.377
 were-2.310
 floats-2.221
 they-2.219
Token clean
Token ID3424
Feature activation+1.85e+00

Pos logit contributions
umper+1.383
bur+1.343
othy+1.340
ior+1.333
asured+1.326
Neg logit contributions
apy-1.097
 are-0.938
 floats-0.914
 were-0.901
ades-0.873
Token.
Token ID13
Feature activation+9.61e-01

Pos logit contributions
apy+1.368
 floats+1.210
 are+1.176
ades+1.125
 were+1.120
Neg logit contributions
umper-1.318
othy-1.308
bur-1.301
asured-1.297
 Tooth-1.266
Token
Token ID198
Feature activation+5.67e-01

Pos logit contributions
umper+2.364
othy+2.315
bur+2.308
 Tooth+2.297
asured+2.292
Neg logit contributions
apy-1.387
 are-1.158
 floats-1.143
 were-1.140
 they-1.068
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
umper+1.349
othy+1.322
bur+1.319
ior+1.307
 Tooth+1.305
Neg logit contributions
apy-0.846
 are-0.705
 floats-0.693
 were-0.684
ades-0.659
TokenSuddenly
Token ID38582
Feature activation+2.49e-01

Pos logit contributions
umper+2.161
 Tooth+2.111
othy+2.108
bur+2.088
ior+2.074
Neg logit contributions
apy-2.188
 floats-1.889
 are-1.886
 were-1.875
ades-1.826
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
umper+0.558
bur+0.546
othy+0.546
ior+0.538
asured+0.537
Neg logit contributions
apy-0.404
 are-0.341
 floats-0.339
 were-0.329
ades-0.319
Token Jon
Token ID5966
Feature activation+4.36e+00

Pos logit contributions
umper+1.716
othy+1.666
asured+1.652
bur+1.644
ior+1.643
Neg logit contributions
apy-1.739
 are-1.503
 were-1.495
 floats-1.488
 they-1.453
Tokenny
Token ID3281
Feature activation-2.44e-01

Pos logit contributions
 Tooth+8.430
asured+8.286
umper+7.596
othy+7.116
ppa+7.103
Neg logit contributions
 were-9.533
 are-9.173
 they-8.917
 floats-8.438
apy-8.351
Token remembered
Token ID12086
Feature activation+7.63e-01

Pos logit contributions
apy+0.476
 are+0.412
 floats+0.409
ades+0.393
 were+0.393
Neg logit contributions
umper-0.462
othy-0.456
bur-0.451
ior-0.443
 Tooth-0.441
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
umper+1.515
bur+1.481
othy+1.473
ior+1.473
asured+1.443
Neg logit contributions
apy-1.435
 are-1.261
 were-1.239
 floats-1.237
 they-1.191
Token He
Token ID679
Feature activation+8.87e-01

Pos logit contributions
bur+2.837
umper+2.822
othy+2.729
asured+2.707
 Tooth+2.700
Neg logit contributions
apy-2.963
 floats-2.544
 are-2.541
 were-2.532
 they-2.451
Token remembered
Token ID12086
Feature activation+1.14e+00

Pos logit contributions
umper+1.797
bur+1.761
ior+1.725
othy+1.723
 Tooth+1.719
Neg logit contributions
apy-1.625
 are-1.422
 were-1.421
 floats-1.420
 they-1.330
Token
Token ID220
Feature activation-1.77e-01

Pos logit contributions
umper+4.255
 Tooth+4.176
othy+4.169
asured+4.154
bur+4.096
Neg logit contributions
apy-2.340
 were-1.954
 are-1.943
 floats-1.890
 they-1.848
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
apy+0.286
 floats+0.244
 are+0.239
 were+0.229
ades+0.224
Neg logit contributions
umper-0.393
bur-0.386
othy-0.383
ior-0.378
asured-0.376
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
umper+2.693
othy+2.643
 Tooth+2.636
ior+2.615
asured+2.609
Neg logit contributions
apy-1.504
 are-1.257
 were-1.243
 floats-1.204
ades-1.177
TokenOne
Token ID3198
Feature activation+1.36e+00

Pos logit contributions
 Tooth+4.338
umper+4.325
othy+4.191
asured+4.104
ior+4.100
Neg logit contributions
apy-3.555
 were-3.121
 floats-3.069
 are-3.065
ades-3.001
Token day
Token ID1110
Feature activation+7.11e-01

Pos logit contributions
umper+2.673
bur+2.584
othy+2.511
ior+2.508
asured+2.505
Neg logit contributions
apy-2.731
 are-2.378
 floats-2.354
 were-2.353
 they-2.324
Token Jon
Token ID5966
Feature activation+4.34e+00

Pos logit contributions
umper+1.656
othy+1.619
bur+1.608
asured+1.603
 Tooth+1.600
Neg logit contributions
apy-1.126
 are-0.939
 floats-0.928
 were-0.922
 they-0.875
Tokenny
Token ID3281
Feature activation+1.10e-01

Pos logit contributions
 Tooth+8.060
asured+7.634
umper+7.238
ppa+6.934
othy+6.926
Neg logit contributions
 were-9.975
 are-9.535
 they-9.207
 floats-8.826
apy-8.380
Token and
Token ID290
Feature activation+2.39e+00

Pos logit contributions
umper+0.213
othy+0.209
bur+0.208
ior+0.205
asured+0.204
Neg logit contributions
apy-0.211
 are-0.182
 floats-0.182
 were-0.176
ades-0.173
Token his
Token ID465
Feature activation-1.31e-01

Pos logit contributions
bur+4.170
umper+4.169
ior+4.046
asured+4.034
ppa+3.986
Neg logit contributions
apy-5.172
 are-4.709
 they-4.700
 were-4.678
 floats-4.472
Token parents
Token ID3397
Feature activation-5.40e-01

Pos logit contributions
apy+0.248
 are+0.214
 floats+0.213
 were+0.206
ades+0.202
Neg logit contributions
umper-0.260
othy-0.253
bur-0.252
 Tooth-0.248
asured-0.247
Token went
Token ID1816
Feature activation+1.30e-01

Pos logit contributions
apy+1.039
 floats+0.888
ades+0.863
 are+0.862
 were+0.805
Neg logit contributions
umper-1.070
bur-1.045
othy-1.038
ior-1.005
asured-1.004
Token From
Token ID3574
Feature activation+2.84e+00

Pos logit contributions
umper+2.964
bur+2.912
othy+2.832
asured+2.808
 Tooth+2.786
Neg logit contributions
apy-2.809
 were-2.375
 are-2.365
 floats-2.332
 they-2.297
Token that
Token ID326
Feature activation+1.47e+00

Pos logit contributions
umper+6.663
ior+6.298
 Tooth+6.166
asured+6.151
othy+6.132
Neg logit contributions
apy-4.879
 they-4.557
 floats-4.278
 were-4.253
 are-4.098
Token day
Token ID1110
Feature activation+8.30e-01

Pos logit contributions
umper+2.893
asured+2.776
othy+2.766
bur+2.760
ior+2.751
Neg logit contributions
apy-2.868
 are-2.623
 were-2.571
 they-2.484
 floats-2.451
Token on
Token ID319
Feature activation+9.48e-01

Pos logit contributions
umper+1.699
bur+1.669
othy+1.668
asured+1.663
ior+1.659
Neg logit contributions
apy-1.534
 are-1.323
 were-1.283
 floats-1.267
 they-1.225
Token,
Token ID11
Feature activation-6.89e-02

Pos logit contributions
umper+2.301
bur+2.236
asured+2.232
othy+2.231
ior+2.211
Neg logit contributions
apy-1.446
 are-1.207
 were-1.175
 floats-1.164
 they-1.137
Token Tim
Token ID5045
Feature activation+4.33e+00

Pos logit contributions
apy+0.128
 floats+0.111
 are+0.110
ades+0.107
 were+0.106
Neg logit contributions
umper-0.137
bur-0.135
othy-0.134
ior-0.132
asured-0.132
Tokenmy
Token ID1820
Feature activation+2.24e+00

Pos logit contributions
 Tooth+4.843
asured+4.413
ior+4.252
ergus+4.172
umper+3.867
Neg logit contributions
 were-12.960
 are-12.769
 they-10.958
 floats-10.938
apy-10.720
Token always
Token ID1464
Feature activation+2.65e+00

Pos logit contributions
umper+4.371
 Tooth+4.271
ior+4.267
bur+4.244
othy+4.182
Neg logit contributions
apy-4.274
 were-4.101
 are-4.032
 floats-3.815
 they-3.745
Token made
Token ID925
Feature activation+1.41e+00

Pos logit contributions
umper+5.574
bur+5.369
 Tooth+5.343
ior+5.230
othy+5.074
Neg logit contributions
apy-5.008
 are-4.742
 were-4.700
 they-4.406
 floats-4.289
Token sure
Token ID1654
Feature activation+1.12e+00

Pos logit contributions
umper+2.376
bur+2.314
othy+2.275
ior+2.264
asured+2.234
Neg logit contributions
apy-3.147
 are-2.749
 floats-2.724
ades-2.686
 were-2.674
Token to
Token ID284
Feature activation+2.43e+00

Pos logit contributions
umper+2.164
othy+2.103
ior+2.098
bur+2.086
 Tooth+2.058
Neg logit contributions
apy-2.213
 are-1.950
 were-1.921
 floats-1.914
 they-1.892
Token enjoyed
Token ID8359
Feature activation-3.90e-01

Pos logit contributions
umper+1.925
bur+1.859
othy+1.858
 Tooth+1.838
ior+1.829
Neg logit contributions
apy-1.325
 are-1.115
 floats-1.115
 were-1.114
 they-1.056
Token the
Token ID262
Feature activation-8.52e-01

Pos logit contributions
apy+0.664
 are+0.566
 floats+0.565
 were+0.538
ades+0.536
Neg logit contributions
umper-0.827
bur-0.823
othy-0.821
asured-0.805
 Tooth-0.802
Token sunny
Token ID27737
Feature activation-1.12e-01

Pos logit contributions
apy+1.560
 floats+1.261
 are+1.243
 were+1.215
ades+1.213
Neg logit contributions
bur-1.751
umper-1.734
othy-1.731
 Tooth-1.719
ior-1.716
Token day
Token ID1110
Feature activation-5.49e-01

Pos logit contributions
apy+0.249
 are+0.219
 floats+0.218
 were+0.213
ades+0.211
Neg logit contributions
umper-0.180
bur-0.178
othy-0.177
ior-0.172
asured-0.172
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
apy+1.046
 floats+0.900
 are+0.899
ades+0.864
 were+0.858
Neg logit contributions
umper-1.062
bur-1.038
othy-1.034
asured-1.027
ior-1.016
Token Jon
Token ID5966
Feature activation+4.33e+00

Pos logit contributions
umper+3.002
othy+2.907
 Tooth+2.902
bur+2.897
asured+2.880
Neg logit contributions
apy-1.873
 are-1.565
 were-1.555
 floats-1.549
 they-1.463
Tokenny
Token ID3281
Feature activation-3.54e-01

Pos logit contributions
 Tooth+8.536
asured+7.648
umper+7.516
ppa+7.113
othy+7.058
Neg logit contributions
 were-9.799
 are-9.417
 floats-8.711
 they-8.700
apy-8.434
Token looked
Token ID3114
Feature activation+4.17e-01

Pos logit contributions
apy+0.692
 floats+0.593
 are+0.592
ades+0.574
 were+0.563
Neg logit contributions
othy-0.663
umper-0.662
bur-0.652
asured-0.645
ior-0.644
Token around
Token ID1088
Feature activation+3.60e-01

Pos logit contributions
umper+0.749
bur+0.732
othy+0.731
ior+0.714
 Tooth+0.714
Neg logit contributions
apy-0.861
 are-0.759
 floats-0.756
 were-0.740
ades-0.720
Token and
Token ID290
Feature activation+4.60e-01

Pos logit contributions
umper+0.642
bur+0.625
othy+0.623
ior+0.612
asured+0.609
Neg logit contributions
apy-0.750
 are-0.662
 floats-0.657
 were-0.646
ades-0.628
Token saw
Token ID2497
Feature activation-1.28e-01

Pos logit contributions
umper+0.918
bur+0.894
othy+0.889
ior+0.878
asured+0.874
Neg logit contributions
apy-0.866
 are-0.750
 floats-0.747
 were-0.736
ades-0.702
Token stretch
Token ID7539
Feature activation+1.76e-01

Pos logit contributions
umper+0.003
othy+0.003
bur+0.003
 Tooth+0.003
ior+0.003
Neg logit contributions
apy-0.003
 floats-0.002
 are-0.002
 were-0.002
ades-0.002
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
umper+0.379
bur+0.372
othy+0.371
ior+0.365
asured+0.364
Neg logit contributions
apy-0.300
 are-0.256
 floats-0.255
 were-0.247
ades-0.241
Token reach
Token ID3151
Feature activation+4.96e-01

Pos logit contributions
umper+0.254
othy+0.249
bur+0.249
ior+0.245
asured+0.244
Neg logit contributions
apy-0.209
 floats-0.178
 are-0.178
 were-0.172
ades-0.169
Token it
Token ID340
Feature activation-9.84e-01

Pos logit contributions
umper+1.084
othy+1.064
bur+1.058
ior+1.045
asured+1.041
Neg logit contributions
apy-0.834
 are-0.717
 floats-0.708
 were-0.690
ades-0.662
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
apy+2.014
 floats+1.760
 are+1.692
ades+1.673
 were+1.643
Neg logit contributions
othy-1.760
bur-1.756
umper-1.755
asured-1.737
 Tooth-1.710
Token Tig
Token ID14189
Feature activation+4.32e+00

Pos logit contributions
umper+3.160
asured+3.096
othy+3.059
bur+3.048
 Tooth+2.970
Neg logit contributions
apy-2.587
 are-2.170
 floats-2.159
 were-2.149
 they-2.131
Tokengy
Token ID1360
Feature activation-6.80e-02

Pos logit contributions
umper+6.844
bur+6.530
asured+6.225
ppa+6.128
 Tooth+6.099
Neg logit contributions
apy-10.023
 are-9.664
 were-9.563
ves-9.092
 floats-9.009
Token stretched
Token ID19110
Feature activation-1.19e-01

Pos logit contributions
apy+0.133
 floats+0.115
 are+0.115
 were+0.111
ades+0.111
Neg logit contributions
umper-0.128
othy-0.126
bur-0.125
ior-0.123
asured-0.123
Token and
Token ID290
Feature activation+1.77e-01

Pos logit contributions
apy+0.242
 floats+0.215
 are+0.212
 were+0.205
ades+0.205
Neg logit contributions
umper-0.213
othy-0.209
bur-0.208
asured-0.204
ior-0.203
Token stretched
Token ID19110
Feature activation-1.42e-01

Pos logit contributions
umper+0.340
othy+0.332
bur+0.332
ior+0.326
asured+0.325
Neg logit contributions
apy-0.346
 are-0.301
 floats-0.301
 were-0.292
ades-0.287
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
apy+0.276
 floats+0.243
 are+0.238
 were+0.231
ades+0.230
Neg logit contributions
umper-0.269
othy-0.264
bur-0.264
asured-0.258
ior-0.258
Token loved
Token ID6151
Feature activation+1.32e+00

Pos logit contributions
umper+2.588
bur+2.501
othy+2.459
ior+2.445
 Tooth+2.414
Neg logit contributions
apy-2.007
 are-1.734
 were-1.709
 floats-1.691
 they-1.649
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
umper+2.789
othy+2.694
bur+2.667
ior+2.656
 Tooth+2.627
Neg logit contributions
apy-2.361
 are-2.095
 were-2.089
 floats-2.048
 they-2.012
Token
Token ID220
Feature activation+1.77e-01

Pos logit contributions
umper+3.695
othy+3.621
 Tooth+3.593
asured+3.579
bur+3.570
Neg logit contributions
apy-1.947
 are-1.648
 were-1.637
 floats-1.582
 they-1.527
Token
Token ID198
Feature activation+7.11e-01

Pos logit contributions
umper+0.405
bur+0.397
othy+0.396
ior+0.391
asured+0.389
Neg logit contributions
apy-0.275
 are-0.231
 floats-0.230
 were-0.222
ades-0.215
Token
Token ID198
Feature activation+1.23e+00

Pos logit contributions
umper+1.705
othy+1.674
bur+1.661
asured+1.651
 Tooth+1.649
Neg logit contributions
apy-1.045
 are-0.873
 were-0.850
 floats-0.850
ades-0.816
TokenJon
Token ID18219
Feature activation+4.31e+00

Pos logit contributions
umper+2.485
othy+2.446
 Tooth+2.431
asured+2.392
bur+2.389
Neg logit contributions
apy-2.311
 are-2.012
 were-2.004
 floats-1.983
ades-1.939
Tokenny
Token ID3281
Feature activation+2.66e-01

Pos logit contributions
 Tooth+8.070
asured+7.611
umper+7.469
ppa+6.968
othy+6.811
Neg logit contributions
 were-9.526
 are-9.381
 they-8.753
 floats-8.713
apy-8.629
Token smiled
Token ID13541
Feature activation+8.81e-01

Pos logit contributions
umper+0.514
bur+0.502
othy+0.501
ior+0.493
asured+0.491
Neg logit contributions
apy-0.507
 are-0.441
 floats-0.439
 were-0.429
ades-0.417
Token at
Token ID379
Feature activation+2.90e+00

Pos logit contributions
umper+1.557
bur+1.532
othy+1.504
ior+1.485
 Tooth+1.467
Neg logit contributions
apy-1.868
 are-1.649
 were-1.630
 floats-1.619
 they-1.571
Token his
Token ID465
Feature activation-3.37e-01

Pos logit contributions
umper+6.748
 Tooth+6.652
ior+6.629
asured+6.464
bur+6.462
Neg logit contributions
apy-4.637
 were-4.289
 are-4.249
 they-4.154
 floats-3.917
Token parents
Token ID3397
Feature activation+1.09e+00

Pos logit contributions
apy+0.658
 are+0.569
 floats+0.567
 were+0.554
ades+0.541
Neg logit contributions
umper-0.647
othy-0.635
bur-0.628
 Tooth-0.628
asured-0.627
Token From
Token ID3574
Feature activation+3.00e+00

Pos logit contributions
umper+1.711
bur+1.673
othy+1.655
asured+1.645
 Tooth+1.628
Neg logit contributions
apy-1.366
 are-1.167
 were-1.150
 floats-1.141
 they-1.087
Token that
Token ID326
Feature activation+1.12e+00

Pos logit contributions
umper+7.208
asured+6.999
ior+6.979
othy+6.696
bur+6.635
Neg logit contributions
apy-4.893
 they-4.555
 floats-4.222
 were-4.134
 are-4.031
Token day
Token ID1110
Feature activation+3.52e-01

Pos logit contributions
umper+2.101
asured+2.040
othy+2.031
bur+2.025
ior+2.022
Neg logit contributions
apy-2.283
 are-2.075
 were-2.031
 floats-1.962
 they-1.954
Token on
Token ID319
Feature activation+8.08e-01

Pos logit contributions
umper+0.690
bur+0.675
othy+0.674
ior+0.666
asured+0.663
Neg logit contributions
apy-0.668
 are-0.580
 floats-0.573
 were-0.563
ades-0.545
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
umper+1.943
bur+1.887
asured+1.886
othy+1.881
ior+1.873
Neg logit contributions
apy-1.239
 are-1.038
 were-1.010
 floats-1.000
 they-0.965
Token Tim
Token ID5045
Feature activation+4.29e+00

Pos logit contributions
apy+0.283
 floats+0.248
 are+0.245
ades+0.239
 were+0.235
Neg logit contributions
umper-0.292
bur-0.287
othy-0.287
ior-0.280
asured-0.279
Tokenmy
Token ID1820
Feature activation+2.49e+00

Pos logit contributions
 Tooth+4.654
ior+4.490
asured+4.436
umper+4.033
ergus+3.960
Neg logit contributions
 were-12.945
 are-12.625
apy-10.910
 floats-10.859
 they-10.699
Token always
Token ID1464
Feature activation+2.96e+00

Pos logit contributions
umper+5.520
ior+5.442
bur+5.364
 Tooth+5.313
othy+5.256
Neg logit contributions
apy-4.197
 were-4.124
 are-3.952
 floats-3.735
 they-3.637
Token wore
Token ID12408
Feature activation+1.85e+00

Pos logit contributions
umper+6.534
 Tooth+6.402
ior+6.392
bur+6.217
asured+6.021
Neg logit contributions
apy-5.304
 were-5.123
 are-4.985
 they-4.643
 floats-4.566
Token his
Token ID465
Feature activation+1.72e+00

Pos logit contributions
umper+3.184
ior+3.031
bur+2.990
asured+2.990
 Tooth+2.939
Neg logit contributions
apy-4.215
 are-3.703
 floats-3.640
 were-3.611
 they-3.569
Token yellow
Token ID7872
Feature activation+1.52e+00

Pos logit contributions
umper+2.705
ior+2.482
asured+2.456
othy+2.442
bur+2.442
Neg logit contributions
apy-4.088
 are-3.793
 floats-3.698
 were-3.658
ades-3.547
Token stuck
Token ID7819
Feature activation+2.12e+00

Pos logit contributions
umper+0.704
bur+0.688
othy+0.687
ior+0.678
asured+0.675
Neg logit contributions
apy-0.676
 are-0.592
 floats-0.587
 were-0.575
ades-0.556
Token in
Token ID287
Feature activation+1.45e+00

Pos logit contributions
ior+4.846
umper+4.785
 Tooth+4.742
bur+4.564
othy+4.533
Neg logit contributions
apy-3.576
 were-3.176
 are-3.159
 floats-2.871
 they-2.845
Token a
Token ID257
Feature activation+7.31e-01

Pos logit contributions
umper+3.439
ior+3.336
bur+3.314
asured+3.277
othy+3.242
Neg logit contributions
apy-2.257
 are-2.001
 were-1.960
 floats-1.860
 they-1.818
Token cage
Token ID17186
Feature activation+7.19e-01

Pos logit contributions
umper+1.603
bur+1.567
asured+1.562
othy+1.556
ior+1.550
Neg logit contributions
apy-1.230
 are-1.062
 floats-1.043
 were-1.023
ades-0.980
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
umper+1.589
bur+1.547
ior+1.536
othy+1.529
asured+1.518
Neg logit contributions
apy-1.206
 are-1.047
 were-1.019
 floats-1.018
ades-0.971
Token Jon
Token ID5966
Feature activation+4.25e+00

Pos logit contributions
umper+4.258
bur+4.140
asured+4.059
othy+4.015
ppa+4.001
Neg logit contributions
apy-2.979
 were-2.475
 are-2.429
 floats-2.426
 they-2.360
Tokenny
Token ID3281
Feature activation-1.94e-01

Pos logit contributions
 Tooth+8.152
asured+7.830
umper+7.612
ppa+7.013
father+6.734
Neg logit contributions
 were-9.733
 are-9.325
 they-8.543
apy-8.260
 floats-8.232
Token asked
Token ID1965
Feature activation+1.50e+00

Pos logit contributions
apy+0.374
 floats+0.325
 are+0.324
ades+0.311
 were+0.308
Neg logit contributions
umper-0.370
othy-0.368
bur-0.362
asured-0.357
ior-0.357
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
umper+3.271
bur+3.148
 Tooth+3.143
asured+3.138
othy+3.132
Neg logit contributions
apy-2.606
 are-2.396
 were-2.347
 they-2.251
 floats-2.189
Token deer
Token ID20096
Feature activation+3.68e-01

Pos logit contributions
apy+0.576
 are+0.502
 floats+0.500
 were+0.487
ades+0.486
Neg logit contributions
umper-0.409
bur-0.403
othy-0.402
 Tooth-0.390
ior-0.388
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
umper+0.740
bur+0.724
othy+0.724
ior+0.713
 Tooth+0.710
Neg logit contributions
apy-0.677
 are-0.591
 floats-0.584
 were-0.573
ades-0.554
Token bedroom
Token ID14043
Feature activation+5.03e-01

Pos logit contributions
umper+1.163
bur+1.127
othy+1.124
ior+1.106
asured+1.104
Neg logit contributions
apy-1.114
 are-0.973
 floats-0.964
 were-0.945
ades-0.911
Token.
Token ID13
Feature activation+2.30e+00

Pos logit contributions
umper+0.999
bur+0.974
othy+0.971
ior+0.956
asured+0.950
Neg logit contributions
apy-0.937
 are-0.820
 floats-0.815
 were-0.797
ades-0.775
Token
Token ID220
Feature activation+1.76e-01

Pos logit contributions
umper+5.978
 Tooth+5.750
ppa+5.725
othy+5.676
asured+5.609
Neg logit contributions
apy-3.234
 were-2.826
 are-2.790
 floats-2.706
 they-2.650
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
umper+0.402
bur+0.394
othy+0.394
ior+0.388
asured+0.387
Neg logit contributions
apy-0.273
 are-0.229
 floats-0.229
 were-0.220
ades-0.214
Token
Token ID198
Feature activation+2.15e+00

Pos logit contributions
umper+3.403
 Tooth+3.334
othy+3.323
ior+3.295
asured+3.290
Neg logit contributions
apy-1.772
 are-1.506
 were-1.502
 floats-1.427
ades-1.406
TokenJon
Token ID18219
Feature activation+4.24e+00

Pos logit contributions
umper+4.750
 Tooth+4.698
othy+4.543
asured+4.475
ior+4.462
Neg logit contributions
apy-3.713
 were-3.338
 floats-3.281
 are-3.260
ades-3.216
Tokenny
Token ID3281
Feature activation+5.98e-01

Pos logit contributions
 Tooth+7.914
umper+7.423
asured+7.127
ppa+7.028
othy+6.452
Neg logit contributions
 were-9.667
 are-9.404
 floats-8.755
 they-8.608
apy-8.512
Token's
Token ID338
Feature activation+3.37e-01

Pos logit contributions
umper+1.193
bur+1.164
othy+1.147
asured+1.136
ior+1.133
Neg logit contributions
apy-1.108
 are-0.976
 floats-0.968
 were-0.959
ades-0.907
Token mom
Token ID1995
Feature activation+1.30e+00

Pos logit contributions
umper+0.669
bur+0.653
othy+0.651
ior+0.641
asured+0.639
Neg logit contributions
apy-0.633
 are-0.549
 floats-0.545
 were-0.533
ades-0.519
Token was
Token ID373
Feature activation+6.43e-01

Pos logit contributions
umper+2.248
 Tooth+2.174
asured+2.174
ior+2.158
bur+2.154
Neg logit contributions
apy-2.712
 were-2.562
 are-2.549
 floats-2.445
 they-2.410
Token really
Token ID1107
Feature activation+3.75e-01

Pos logit contributions
umper+1.189
bur+1.153
othy+1.147
ior+1.145
 Tooth+1.123
Neg logit contributions
apy-1.298
 are-1.141
 floats-1.125
 were-1.114
ades-1.080
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
apy+2.982
 are+2.582
 floats+2.582
ades+2.535
 vows+2.457
Neg logit contributions
othy-2.908
umper-2.899
 Tooth-2.831
asured-2.805
bur-2.776
Token When
Token ID1649
Feature activation-8.70e-01

Pos logit contributions
umper+0.532
bur+0.519
othy+0.516
ior+0.505
asured+0.505
Neg logit contributions
apy-0.652
 are-0.575
 floats-0.573
 were-0.561
ades-0.548
Token he
Token ID339
Feature activation-3.55e-01

Pos logit contributions
apy+1.824
 are+1.622
ades+1.621
 floats+1.615
 were+1.535
Neg logit contributions
othy-1.469
bur-1.462
umper-1.459
 Tooth-1.457
ior-1.424
Token reached
Token ID4251
Feature activation+6.11e-01

Pos logit contributions
apy+0.639
 floats+0.540
 are+0.532
ades+0.522
 were+0.506
Neg logit contributions
umper-0.743
othy-0.732
bur-0.731
 Tooth-0.710
asured-0.705
Token the
Token ID262
Feature activation+4.33e-01

Pos logit contributions
umper+1.342
bur+1.303
othy+1.300
ior+1.296
asured+1.288
Neg logit contributions
apy-1.037
 are-0.895
 floats-0.877
 were-0.860
 they-0.823
Token tre
Token ID2054
Feature activation+4.24e+00

Pos logit contributions
umper+0.958
bur+0.934
othy+0.933
asured+0.923
ior+0.923
Neg logit contributions
apy-0.722
 are-0.620
 floats-0.617
 were-0.598
ades-0.576
Tokenet
Token ID316
Feature activation-5.11e-01

Pos logit contributions
umper+7.734
 Tooth+7.292
ior+7.161
bur+6.846
ongo+6.395
Neg logit contributions
apy-8.727
 floats-8.650
ades-8.222
 flags-8.215
ves-8.122
Tokenop
Token ID404
Feature activation-5.19e-01

Pos logit contributions
apy+0.962
 floats+0.825
 are+0.811
ades+0.791
 were+0.782
Neg logit contributions
umper-1.016
othy-1.007
bur-1.006
 Tooth-0.968
ior-0.967
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
apy+0.973
 floats+0.834
 are+0.822
ades+0.800
 were+0.794
Neg logit contributions
umper-1.032
othy-1.019
bur-1.018
 Tooth-0.984
ior-0.983
Token he
Token ID339
Feature activation-6.87e-01

Pos logit contributions
apy+0.667
 floats+0.590
 are+0.585
ades+0.573
 were+0.561
Neg logit contributions
umper-0.591
bur-0.582
othy-0.580
 Tooth-0.574
ior-0.566
Token asked
Token ID1965
Feature activation+3.15e-01

Pos logit contributions
apy+1.388
 floats+1.192
 are+1.189
ades+1.150
 were+1.115
Neg logit contributions
umper-1.266
othy-1.262
bur-1.243
asured-1.219
 Tooth-1.211
Token was
Token ID373
Feature activation-1.67e-01

Pos logit contributions
apy+3.102
ades+2.695
 floats+2.606
cakes+2.453
 vows+2.435
Neg logit contributions
othy-2.734
umper-2.697
bur-2.646
asured-2.594
 Tooth-2.566
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
apy+0.326
 floats+0.279
 are+0.275
 were+0.267
ades+0.266
Neg logit contributions
umper-0.323
bur-0.316
othy-0.316
asured-0.310
 Tooth-0.310
Token Snow
Token ID7967
Feature activation+1.87e+00

Pos logit contributions
umper+2.270
asured+2.203
bur+2.200
othy+2.192
 Tooth+2.128
Neg logit contributions
apy-2.159
 floats-1.834
 are-1.830
 were-1.815
 they-1.786
Tokeny
Token ID88
Feature activation+1.29e-01

Pos logit contributions
 Tooth+3.027
ior+2.991
othy+2.989
asured+2.950
umper+2.940
Neg logit contributions
apy-4.061
 were-3.660
ades-3.603
 are-3.578
 floats-3.526
Token told
Token ID1297
Feature activation+1.86e+00

Pos logit contributions
umper+0.247
othy+0.242
bur+0.241
ior+0.237
asured+0.236
Neg logit contributions
apy-0.248
 are-0.216
 floats-0.215
 were-0.209
ades-0.205
Token Tig
Token ID14189
Feature activation+4.24e+00

Pos logit contributions
asured+3.287
umper+3.258
othy+3.144
bur+3.130
ior+3.101
Neg logit contributions
apy-3.985
 are-3.713
 were-3.637
 they-3.511
 floats-3.464
Tokengy
Token ID1360
Feature activation-1.09e+00

Pos logit contributions
umper+7.008
bur+6.765
asured+6.535
ior+6.251
ppa+6.133
Neg logit contributions
apy-9.160
 are-8.979
 were-8.919
ves-8.703
 vows-8.565
Token that
Token ID326
Feature activation+2.99e-01

Pos logit contributions
apy+2.190
 floats+1.908
ades+1.829
 are+1.805
 were+1.715
Neg logit contributions
othy-2.022
umper-1.980
asured-1.944
bur-1.936
ior-1.910
Token it
Token ID340
Feature activation-1.51e-02

Pos logit contributions
umper+0.642
bur+0.629
othy+0.629
ior+0.619
asured+0.617
Neg logit contributions
apy-0.512
 are-0.438
 floats-0.436
 were-0.424
ades-0.410
Token was
Token ID373
Feature activation+1.57e+00

Pos logit contributions
apy+0.027
 floats+0.023
 are+0.023
 were+0.022
ades+0.022
Neg logit contributions
umper-0.031
othy-0.030
bur-0.030
asured-0.030
 Tooth-0.030
Token up
Token ID510
Feature activation+2.13e+00

Pos logit contributions
umper+3.398
ior+3.324
bur+3.286
othy+3.282
asured+3.166
Neg logit contributions
apy-2.681
 are-2.465
 were-2.418
 they-2.349
 floats-2.332
Token blue
Token ID4171
Feature activation+3.35e-01

Pos logit contributions
umper+1.793
ior+1.724
bur+1.719
othy+1.716
asured+1.710
Neg logit contributions
apy-1.480
 are-1.324
 floats-1.307
 were-1.277
ades-1.211
Token z
Token ID1976
Feature activation+1.56e+00

Pos logit contributions
umper+0.660
bur+0.643
othy+0.643
ior+0.635
asured+0.632
Neg logit contributions
apy-0.639
 are-0.557
 floats-0.554
 were-0.540
ades-0.526
Tokenig
Token ID328
Feature activation+5.33e-02

Pos logit contributions
umper+2.105
ior+2.095
 Tooth+2.035
bur+2.017
asured+1.999
Neg logit contributions
apy-4.030
 are-3.608
 floats-3.607
ades-3.583
 were-3.424
Tokenzag
Token ID50183
Feature activation-1.26e+00

Pos logit contributions
umper+0.091
bur+0.089
othy+0.089
ior+0.086
asured+0.086
Neg logit contributions
apy-0.114
 are-0.101
 floats-0.101
 were-0.098
ades-0.096
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
apy+2.588
 floats+2.224
 are+2.168
ades+2.136
 were+2.095
Neg logit contributions
othy-2.240
bur-2.215
asured-2.166
ior-2.147
umper-2.133
Token Tig
Token ID14189
Feature activation+4.23e+00

Pos logit contributions
umper+0.956
bur+0.940
othy+0.940
asured+0.928
ior+0.926
Neg logit contributions
apy-0.435
 are-0.344
 floats-0.341
 were-0.327
ades-0.309
Token was
Token ID373
Feature activation-1.78e-01

Pos logit contributions
umper+8.708
asured+7.658
bur+7.489
ppa+7.189
father+6.896
Neg logit contributions
apy-9.018
 are-8.300
 were-8.079
 floats-8.062
 vows-7.978
Token so
Token ID523
Feature activation-2.11e+00

Pos logit contributions
apy+0.399
 floats+0.354
 are+0.354
ades+0.342
 were+0.342
Neg logit contributions
umper-0.285
othy-0.284
bur-0.282
asured-0.278
ior-0.273
Token happy
Token ID3772
Feature activation+1.68e-01

Pos logit contributions
apy+3.751
 floats+3.377
ades+3.325
 are+3.209
 widened+3.030
Neg logit contributions
asured-4.255
othy-4.136
bur-4.015
umper-3.914
ior-3.895
Token that
Token ID326
Feature activation+3.82e-01

Pos logit contributions
umper+0.297
bur+0.290
othy+0.289
ior+0.284
asured+0.283
Neg logit contributions
apy-0.352
 are-0.310
 floats-0.309
 were-0.301
ades-0.295
Token he
Token ID339
Feature activation+6.06e-01

Pos logit contributions
umper+1.178
othy+1.160
bur+1.159
ior+1.148
asured+1.146
Neg logit contributions
apy-0.306
 are-0.210
 floats-0.204
 were-0.193
ades-0.169
Token both
Token ID1111
Feature activation-1.73e+00

Pos logit contributions
apy+4.588
ades+4.132
 floats+4.077
 vows+3.927
cakes+3.870
Neg logit contributions
othy-3.540
bur-3.429
umper-3.418
asured-3.368
my-3.303
Token jumped
Token ID11687
Feature activation-1.61e+00

Pos logit contributions
apy+3.779
ades+3.440
 floats+3.314
 are+3.206
cakes+3.191
Neg logit contributions
othy-2.997
asured-2.850
umper-2.841
ior-2.772
bur-2.748
Token in
Token ID287
Feature activation-1.28e+00

Pos logit contributions
apy+4.030
 floats+3.592
 are+3.480
ades+3.471
 vows+3.423
Neg logit contributions
umper-2.095
bur-2.038
othy-2.034
asured-2.021
 Tooth-1.924
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
apy+3.215
 floats+2.854
ades+2.778
 are+2.737
cakes+2.727
Neg logit contributions
othy-1.701
bur-1.640
asured-1.629
umper-1.601
ior-1.571
Token were
Token ID547
Feature activation-1.50e+00

Pos logit contributions
apy+2.884
ades+2.678
 are+2.502
 floats+2.476
 vows+2.388
Neg logit contributions
othy-3.008
bur-2.955
umper-2.887
ior-2.857
asured-2.851
Token so
Token ID523
Feature activation-3.71e+00

Pos logit contributions
apy+3.455
ades+3.091
 floats+3.084
 are+3.058
 vows+2.923
Neg logit contributions
asured-2.333
othy-2.329
bur-2.241
umper-2.196
ior-2.107
Token silly
Token ID14397
Feature activation-4.82e-01

Pos logit contributions
 vows+7.651
apy+7.126
ades+7.016
 floats+6.860
ves+6.809
Neg logit contributions
asured-6.919
othy-6.221
my-6.114
umper-5.673
bur-5.658
Token!
Token ID0
Feature activation-6.33e-01

Pos logit contributions
apy+0.968
 are+0.838
 floats+0.827
 were+0.808
ades+0.803
Neg logit contributions
umper-0.873
othy-0.859
bur-0.853
asured-0.849
ior-0.842
Token They
Token ID1119
Feature activation-2.11e+00

Pos logit contributions
apy+0.988
 are+0.840
 floats+0.812
ades+0.792
 were+0.785
Neg logit contributions
umper-1.444
ior-1.428
othy-1.426
bur-1.409
 Tooth-1.392
Token laughed
Token ID13818
Feature activation-1.50e+00

Pos logit contributions
apy+4.319
ades+3.940
 floats+3.692
cakes+3.585
 vows+3.557
Neg logit contributions
othy-3.900
umper-3.891
bur-3.833
asured-3.776
my-3.633
Token and
Token ID290
Feature activation-7.01e-01

Pos logit contributions
apy+4.060
 floats+3.657
 are+3.631
ades+3.623
 vows+3.506
Neg logit contributions
othy-1.610
umper-1.609
asured-1.582
ior-1.577
 Tooth-1.538
Token pushed
Token ID7121
Feature activation+4.39e-01

Pos logit contributions
apy+2.674
 floats+2.315
 are+2.286
ades+2.251
 vows+2.123
Neg logit contributions
othy-2.299
umper-2.244
asured-2.203
bur-2.157
ior-2.144
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
umper+0.960
bur+0.943
othy+0.938
asured+0.927
ior+0.925
Neg logit contributions
apy-0.738
 are-0.632
 floats-0.623
 were-0.611
ades-0.584
Token buttons
Token ID12163
Feature activation-1.14e+00

Pos logit contributions
apy+4.146
ades+3.199
 are+3.179
 were+3.061
 floats+3.056
Neg logit contributions
nie-4.745
othy-4.732
 Tooth-4.623
umper-4.616
ior-4.603
Token and
Token ID290
Feature activation-9.87e-01

Pos logit contributions
apy+2.716
 floats+2.446
ades+2.369
 are+2.363
 vows+2.317
Neg logit contributions
umper-1.629
othy-1.614
bur-1.604
asured-1.556
ior-1.516
Token the
Token ID262
Feature activation-2.39e+00

Pos logit contributions
apy+1.926
ades+1.702
 are+1.695
 floats+1.686
 vows+1.605
Neg logit contributions
othy-1.819
umper-1.807
bur-1.782
 Tooth-1.764
asured-1.749
Token images
Token ID4263
Feature activation-3.65e+00

Pos logit contributions
apy+5.421
 are+4.579
ades+4.549
 floats+4.420
 were+4.340
Neg logit contributions
othy-4.132
 Tooth-4.052
nie-3.958
umper-3.900
my-3.778
Token changed
Token ID3421
Feature activation-1.35e+00

Pos logit contributions
apy+7.239
ades+6.740
 floats+6.704
 vows+6.458
 Sorry+6.024
Neg logit contributions
othy-6.457
asured-6.269
my-6.143
bur-6.056
umper-5.865
Token back
Token ID736
Feature activation-7.38e-01

Pos logit contributions
apy+2.789
 floats+2.415
 are+2.399
ades+2.346
 were+2.271
Neg logit contributions
asured-2.346
othy-2.346
bur-2.322
umper-2.313
ior-2.259
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
apy+1.466
 floats+1.272
 are+1.251
ades+1.218
 were+1.204
Neg logit contributions
othy-1.353
bur-1.349
umper-1.347
asured-1.344
ior-1.309
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
apy+3.148
ades+2.714
 floats+2.680
cakes+2.526
 are+2.510
Neg logit contributions
othy-3.410
 Tooth-3.276
asured-3.261
ior-3.219
bur-3.214
Token way
Token ID835
Feature activation-1.01e+00

Pos logit contributions
apy+2.927
 are+2.273
ades+2.258
 floats+2.200
 were+2.168
Neg logit contributions
othy-3.309
 Tooth-3.299
ior-3.281
umper-3.277
bur-3.205
Token All
Token ID1439
Feature activation-1.30e+00

Pos logit contributions
apy+2.038
 are+1.825
 floats+1.812
ades+1.757
 were+1.702
Neg logit contributions
umper-2.173
othy-2.145
 Tooth-2.141
ior-2.140
bur-2.083
Token the
Token ID262
Feature activation-4.92e-01

Pos logit contributions
apy+2.253
ades+2.006
 floats+1.902
 are+1.901
 vows+1.881
Neg logit contributions
othy-2.674
umper-2.673
asured-2.625
bur-2.593
ior-2.543
Token people
Token ID661
Feature activation-2.14e+00

Pos logit contributions
apy+0.928
 are+0.782
 floats+0.778
ades+0.760
 were+0.744
Neg logit contributions
umper-0.988
bur-0.964
othy-0.956
ior-0.948
 Tooth-0.948
Token around
Token ID1088
Feature activation-2.28e+00

Pos logit contributions
apy+4.390
ades+3.949
 floats+3.846
 vows+3.731
cakes+3.648
Neg logit contributions
umper-3.823
asured-3.720
bur-3.656
othy-3.646
my-3.545
Token were
Token ID547
Feature activation-1.72e+00

Pos logit contributions
apy+4.902
ades+4.316
 floats+4.097
cakes+4.096
 vows+3.912
Neg logit contributions
umper-3.994
othy-3.993
bur-3.896
asured-3.842
ior-3.754
Token so
Token ID523
Feature activation-3.62e+00

Pos logit contributions
apy+4.015
 floats+3.550
ades+3.477
 are+3.461
cakes+3.333
Neg logit contributions
asured-2.703
othy-2.628
bur-2.608
umper-2.570
ppa-2.415
Token happy
Token ID3772
Feature activation-7.05e-01

Pos logit contributions
apy+7.806
ades+7.505
 vows+7.300
cakes+7.296
 floats+7.230
Neg logit contributions
asured-6.304
othy-5.405
my-5.356
umper-5.075
ior-4.914
Token,
Token ID11
Feature activation-9.30e-01

Pos logit contributions
apy+1.589
 are+1.378
 floats+1.377
ades+1.348
 were+1.319
Neg logit contributions
umper-1.107
othy-1.098
asured-1.096
bur-1.088
 Tooth-1.073
Token they
Token ID484
Feature activation-1.98e+00

Pos logit contributions
apy+1.367
ades+1.115
 are+1.103
 floats+1.075
 were+1.023
Neg logit contributions
umper-2.148
bur-2.136
othy-2.127
 Tooth-2.125
asured-2.117
Token cheered
Token ID34767
Feature activation-4.78e-01

Pos logit contributions
apy+4.298
ades+3.702
 floats+3.558
cakes+3.403
 are+3.401
Neg logit contributions
umper-3.650
bur-3.501
othy-3.452
asured-3.391
ppa-3.352
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
apy+1.147
 floats+1.012
 are+1.005
ades+0.980
 were+0.976
Neg logit contributions
umper-0.702
othy-0.686
bur-0.681
asured-0.678
ior-0.674
Token!"
Token ID2474
Feature activation-2.68e+00

Pos logit contributions
apy+0.752
 floats+0.632
 are+0.622
 were+0.607
ades+0.607
Neg logit contributions
umper-0.835
bur-0.820
othy-0.819
 Tooth-0.806
ior-0.799
Token Tom
Token ID4186
Feature activation+6.13e-01

Pos logit contributions
apy+5.760
ades+5.463
 vows+5.268
 floats+5.181
cakes+4.964
Neg logit contributions
othy-4.570
 Tooth-4.327
ior-4.140
umper-4.060
bur-3.958
Token and
Token ID290
Feature activation-4.36e-02

Pos logit contributions
umper+1.418
bur+1.394
othy+1.381
asured+1.362
 Tooth+1.361
Neg logit contributions
apy-0.927
 are-0.813
 floats-0.801
 were-0.774
ades-0.737
Token Mia
Token ID32189
Feature activation-1.65e+00

Pos logit contributions
apy+0.064
 are+0.052
 floats+0.052
 were+0.050
ades+0.050
Neg logit contributions
umper-0.104
othy-0.104
bur-0.102
ior-0.101
 Tooth-0.101
Token say
Token ID910
Feature activation-1.97e+00

Pos logit contributions
apy+3.774
ades+3.124
 floats+3.068
cakes+2.948
ves+2.893
Neg logit contributions
othy-2.893
umper-2.875
ior-2.815
bur-2.778
 Tooth-2.714
Token together
Token ID1978
Feature activation-3.61e+00

Pos logit contributions
apy+4.174
 floats+3.880
ades+3.655
 are+3.581
 were+3.566
Neg logit contributions
othy-3.221
umper-3.193
asured-3.129
 Tooth-3.125
ior-3.026
Token.
Token ID13
Feature activation-2.66e+00

Pos logit contributions
apy+9.437
ades+8.429
 floats+8.346
cakes+7.858
ves+7.445
Neg logit contributions
asured-4.986
othy-4.972
my-4.955
 Tooth-4.638
bur-4.360
Token They
Token ID1119
Feature activation-1.65e+00

Pos logit contributions
apy+5.344
 are+4.692
ades+4.642
 floats+4.599
 vows+4.339
Neg logit contributions
othy-4.785
ior-4.776
 Tooth-4.688
bur-4.535
umper-4.485
Token hold
Token ID1745
Feature activation+7.15e-02

Pos logit contributions
apy+3.304
ades+2.733
 floats+2.686
cakes+2.597
ves+2.553
Neg logit contributions
othy-3.332
umper-3.294
bur-3.181
asured-3.150
ior-3.109
Token up
Token ID510
Feature activation-7.30e-01

Pos logit contributions
umper+0.157
bur+0.154
othy+0.154
ior+0.151
asured+0.151
Neg logit contributions
apy-0.119
 are-0.101
 floats-0.101
 were-0.097
ades-0.096
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
apy+1.439
 floats+1.252
 are+1.242
ades+1.189
 were+1.178
Neg logit contributions
umper-1.360
othy-1.357
bur-1.345
 Tooth-1.332
asured-1.331
Token on
Token ID319
Feature activation-5.80e-01

Pos logit contributions
apy+3.275
 floats+2.895
ades+2.873
 are+2.822
 vows+2.796
Neg logit contributions
bur-2.132
umper-2.111
othy-2.045
ior-1.992
asured-1.970
Token their
Token ID511
Feature activation-2.23e+00

Pos logit contributions
apy+0.985
 floats+0.839
 are+0.828
ades+0.803
 were+0.787
Neg logit contributions
umper-1.241
bur-1.233
othy-1.226
 Tooth-1.211
ior-1.194
Token papers
Token ID9473
Feature activation-1.83e+00

Pos logit contributions
apy+4.842
ades+4.085
 were+4.047
 are+4.007
cakes+3.797
Neg logit contributions
 Tooth-3.786
othy-3.754
ior-3.745
asured-3.732
bur-3.683
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
apy+4.146
 floats+3.609
ades+3.510
 vows+3.407
cakes+3.340
Neg logit contributions
bur-2.929
umper-2.881
othy-2.855
 Tooth-2.807
asured-2.795
Token Their
Token ID5334
Feature activation-2.45e+00

Pos logit contributions
apy+1.810
 floats+1.564
 are+1.562
ades+1.517
 were+1.501
Neg logit contributions
umper-2.188
 Tooth-2.151
othy-2.131
ior-2.104
bur-2.101
Token images
Token ID4263
Feature activation-3.59e+00

Pos logit contributions
apy+4.751
 are+3.933
 were+3.929
ades+3.874
 widened+3.752
Neg logit contributions
 Tooth-4.745
othy-4.699
asured-4.604
ior-4.552
bur-4.534
Token start
Token ID923
Feature activation-1.70e+00

Pos logit contributions
apy+6.570
ades+5.635
cakes+5.467
 vows+5.153
 floats+5.051
Neg logit contributions
bur-7.167
othy-7.166
asured-7.137
my-7.023
umper-7.003
Token to
Token ID284
Feature activation-2.90e+00

Pos logit contributions
apy+4.133
 floats+3.690
 are+3.659
ades+3.555
 were+3.490
Neg logit contributions
asured-2.449
othy-2.335
bur-2.303
umper-2.208
 Tooth-2.204
Token fade
Token ID22100
Feature activation+4.00e-01

Pos logit contributions
apy+5.662
ades+5.030
 vows+4.764
cakes+4.739
 floats+4.690
Neg logit contributions
bur-6.092
othy-5.979
 Tooth-5.920
asured-5.593
my-5.558
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
umper+0.760
othy+0.741
bur+0.740
ior+0.725
asured+0.724
Neg logit contributions
apy-0.785
 are-0.690
 floats-0.687
 were-0.671
ades-0.651
Token They
Token ID1119
Feature activation-2.39e+00

Pos logit contributions
apy+1.725
 are+1.481
 floats+1.470
ades+1.425
 were+1.417
Neg logit contributions
umper-2.207
 Tooth-2.169
othy-2.144
ior-2.128
bur-2.128
Token All
Token ID1439
Feature activation-8.87e-01

Pos logit contributions
apy+1.440
 are+1.259
 floats+1.253
ades+1.228
 were+1.194
Neg logit contributions
umper-1.566
othy-1.548
ior-1.545
 Tooth-1.544
bur-1.491
Token the
Token ID262
Feature activation-3.09e-01

Pos logit contributions
apy+1.558
ades+1.367
 floats+1.338
 are+1.323
 vows+1.280
Neg logit contributions
umper-1.823
othy-1.807
asured-1.773
bur-1.772
ior-1.748
Token kids
Token ID3988
Feature activation-1.69e+00

Pos logit contributions
apy+0.558
 floats+0.475
 are+0.472
ades+0.457
 were+0.456
Neg logit contributions
umper-0.641
bur-0.626
othy-0.625
 Tooth-0.619
ior-0.618
Token around
Token ID1088
Feature activation-2.19e+00

Pos logit contributions
apy+3.467
 floats+3.045
ades+3.032
 vows+2.861
 are+2.819
Neg logit contributions
umper-3.051
asured-2.979
othy-2.966
bur-2.931
my-2.893
Token were
Token ID547
Feature activation-1.52e+00

Pos logit contributions
apy+4.732
ades+4.151
 floats+3.973
cakes+3.834
 vows+3.770
Neg logit contributions
umper-3.901
othy-3.869
bur-3.783
ior-3.718
 Tooth-3.716
Token so
Token ID523
Feature activation-3.58e+00

Pos logit contributions
apy+3.425
 floats+3.111
 are+3.008
ades+3.007
 vows+2.842
Neg logit contributions
asured-2.452
othy-2.417
bur-2.376
umper-2.366
 Tooth-2.280
Token jealous
Token ID19354
Feature activation-5.62e-01

Pos logit contributions
apy+6.690
 floats+6.616
ades+6.566
 vows+6.418
 curtain+6.287
Neg logit contributions
asured-6.678
my-6.196
othy-6.023
umper-5.842
ior-5.837
Token of
Token ID286
Feature activation-7.84e-02

Pos logit contributions
apy+1.094
 floats+0.934
 are+0.928
ades+0.908
 were+0.887
Neg logit contributions
umper-1.068
othy-1.035
asured-1.033
bur-1.021
 Tooth-1.020
Token the
Token ID262
Feature activation-7.86e-01

Pos logit contributions
apy+0.130
 floats+0.110
 are+0.109
 were+0.105
ades+0.105
Neg logit contributions
umper-0.173
othy-0.170
bur-0.170
ior-0.167
asured-0.167
Token juice
Token ID13135
Feature activation-1.39e+00

Pos logit contributions
apy+1.557
 floats+1.336
 are+1.307
ades+1.293
 were+1.291
Neg logit contributions
umper-1.471
 Tooth-1.461
othy-1.453
bur-1.448
ior-1.421
Token maker
Token ID16009
Feature activation-5.33e-01

Pos logit contributions
apy+2.916
 floats+2.492
ades+2.433
 are+2.374
 vows+2.268
Neg logit contributions
umper-2.476
bur-2.453
 Tooth-2.434
othy-2.420
asured-2.351
Token melted
Token ID24178
Feature activation+4.56e-01

Pos logit contributions
apy+3.855
 floats+3.355
ades+3.346
 are+3.343
 were+3.062
Neg logit contributions
bur-3.921
othy-3.844
umper-3.794
asured-3.786
 Tooth-3.719
Token and
Token ID290
Feature activation+9.68e-02

Pos logit contributions
umper+0.917
othy+0.895
bur+0.891
ior+0.878
asured+0.876
Neg logit contributions
apy-0.851
 are-0.738
 floats-0.735
 were-0.718
ades-0.693
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
umper+0.203
bur+0.199
othy+0.199
asured+0.195
 Tooth+0.195
Neg logit contributions
apy-0.170
 are-0.146
 floats-0.145
 were-0.141
ades-0.138
Token flame
Token ID16506
Feature activation-2.10e+00

Pos logit contributions
apy+1.947
 are+1.584
 floats+1.568
ades+1.549
 were+1.526
Neg logit contributions
 Tooth-2.499
bur-2.441
othy-2.406
umper-2.400
asured-2.378
Token f
Token ID277
Feature activation-6.94e-01

Pos logit contributions
apy+4.068
ades+3.596
 floats+3.536
 are+3.465
cakes+3.319
Neg logit contributions
bur-3.817
othy-3.797
asured-3.750
umper-3.656
 Tooth-3.639
Tokenlic
Token ID677
Feature activation-3.53e+00

Pos logit contributions
apy+1.481
 are+1.348
 floats+1.332
 were+1.300
 they+1.276
Neg logit contributions
umper-1.212
bur-1.156
othy-1.152
asured-1.141
ior-1.119
Tokenkered
Token ID28970
Feature activation-1.20e+00

Pos logit contributions
apy+5.083
 are+4.951
 floats+4.811
 sip+4.336
 contest+4.265
Neg logit contributions
asured-7.919
othy-7.695
umper-7.462
nie-7.405
ppa-7.247
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
apy+2.497
 floats+2.235
 are+2.211
ades+2.130
 were+2.106
Neg logit contributions
othy-2.037
bur-2.011
umper-2.009
asured-1.981
 Tooth-1.947
Token He
Token ID679
Feature activation-8.39e-02

Pos logit contributions
umper+2.387
bur+2.325
othy+2.268
asured+2.265
ior+2.234
Neg logit contributions
apy-2.164
 were-1.807
 are-1.807
 floats-1.801
 they-1.727
Token realized
Token ID6939
Feature activation+1.68e+00

Pos logit contributions
apy+0.161
 are+0.140
 floats+0.140
ades+0.134
 were+0.134
Neg logit contributions
umper-0.161
othy-0.159
bur-0.158
asured-0.156
 Tooth-0.155
Token that
Token ID326
Feature activation+6.61e-01

Pos logit contributions
umper+3.805
ior+3.772
bur+3.709
othy+3.709
asured+3.637
Neg logit contributions
apy-2.953
 they-2.570
 were-2.472
 are-2.432
 floats-2.399
Token and
Token ID290
Feature activation-1.38e+00

Pos logit contributions
apy+0.506
 floats+0.451
 are+0.449
 were+0.436
ades+0.433
Neg logit contributions
umper-0.331
othy-0.324
bur-0.323
ior-0.315
asured-0.314
Token went
Token ID1816
Feature activation-8.69e-01

Pos logit contributions
apy+2.708
ades+2.433
 are+2.378
 floats+2.374
 vows+2.321
Neg logit contributions
bur-2.465
othy-2.449
 Tooth-2.430
umper-2.419
asured-2.354
Token black
Token ID2042
Feature activation-6.60e-01

Pos logit contributions
apy+1.898
 are+1.655
 floats+1.633
ades+1.596
 were+1.593
Neg logit contributions
umper-1.445
bur-1.440
othy-1.413
asured-1.387
ior-1.362
Token.
Token ID13
Feature activation-2.47e+00

Pos logit contributions
apy+1.380
 floats+1.184
 are+1.176
ades+1.143
 were+1.139
Neg logit contributions
umper-1.158
othy-1.148
bur-1.130
ior-1.122
asured-1.115
Token The
Token ID383
Feature activation-1.70e+00

Pos logit contributions
apy+4.768
 floats+4.555
ades+4.497
 are+4.377
 vows+4.214
Neg logit contributions
 Tooth-4.681
ior-4.442
umper-4.340
othy-4.233
asured-4.129
Token pictures
Token ID5986
Feature activation-3.52e+00

Pos logit contributions
apy+3.157
ades+2.543
 are+2.489
 floats+2.457
 were+2.451
Neg logit contributions
 Tooth-3.456
othy-3.424
umper-3.375
ior-3.363
bur-3.324
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
apy+7.621
ades+7.146
 vows+6.765
 floats+6.675
cakes+6.387
Neg logit contributions
bur-5.611
asured-5.425
umper-5.397
othy-5.350
my-5.320
Token sounds
Token ID5238
Feature activation-2.85e+00

Pos logit contributions
apy+3.009
ades+2.611
 are+2.451
 floats+2.344
 were+2.337
Neg logit contributions
othy-3.080
 Tooth-3.049
bur-3.040
ior-3.017
umper-3.012
Token were
Token ID547
Feature activation-1.21e+00

Pos logit contributions
apy+7.040
ades+6.498
 vows+6.258
 floats+6.117
cakes+6.102
Neg logit contributions
 Tooth-3.619
asured-3.558
umper-3.557
ior-3.437
othy-3.430
Token gone
Token ID3750
Feature activation-7.42e-01

Pos logit contributions
apy+2.642
 floats+2.339
 are+2.328
ades+2.293
 were+2.233
Neg logit contributions
asured-2.029
bur-1.974
othy-1.949
umper-1.939
 Tooth-1.888
Token.
Token ID13
Feature activation-2.52e+00

Pos logit contributions
apy+1.489
 floats+1.270
 are+1.235
ades+1.216
 were+1.193
Neg logit contributions
bur-1.371
umper-1.370
asured-1.350
othy-1.345
ior-1.312
Token go
Token ID467
Feature activation+7.58e-01

Pos logit contributions
apy+0.865
 floats+0.719
ades+0.710
 are+0.698
 were+0.685
Neg logit contributions
umper-1.016
bur-1.006
othy-1.000
 Tooth-0.977
ior-0.975
Token!"
Token ID2474
Feature activation-1.15e+00

Pos logit contributions
umper+1.483
bur+1.433
 Tooth+1.432
othy+1.429
ior+1.425
Neg logit contributions
apy-1.449
 are-1.301
 floats-1.285
 were-1.254
ades-1.205
Token The
Token ID383
Feature activation-2.25e+00

Pos logit contributions
apy+1.918
ades+1.718
 are+1.709
 floats+1.684
 were+1.581
Neg logit contributions
ior-2.368
othy-2.348
umper-2.343
bur-2.313
 Tooth-2.304
Token birds
Token ID10087
Feature activation-2.47e+00

Pos logit contributions
apy+4.619
ades+3.632
 floats+3.523
 vows+3.511
 are+3.408
Neg logit contributions
othy-4.791
bur-4.531
umper-4.400
 Tooth-4.355
my-4.348
Token and
Token ID290
Feature activation-9.33e-01

Pos logit contributions
apy+5.277
ades+4.584
cakes+4.424
 floats+4.412
 vows+4.304
Neg logit contributions
bur-4.569
umper-4.512
othy-4.466
my-4.309
i-4.263
Token bugs
Token ID11316
Feature activation-3.51e+00

Pos logit contributions
apy+1.755
ades+1.538
 are+1.509
 floats+1.465
 were+1.417
Neg logit contributions
othy-1.830
bur-1.829
umper-1.794
ior-1.784
 Tooth-1.730
Token cheered
Token ID34767
Feature activation-1.01e+00

Pos logit contributions
apy+7.735
ades+6.521
cakes+6.191
 Sorry+6.163
 vows+6.113
Neg logit contributions
bur-6.737
othy-6.418
umper-6.328
i-6.261
my-6.174
Token back
Token ID736
Feature activation-1.02e+00

Pos logit contributions
apy+2.471
 floats+2.190
 are+2.148
ades+2.130
 were+2.076
Neg logit contributions
umper-1.410
ior-1.409
othy-1.401
bur-1.393
asured-1.381
Token,
Token ID11
Feature activation-6.24e-01

Pos logit contributions
apy+2.019
 floats+1.764
 are+1.762
ades+1.712
 were+1.693
Neg logit contributions
bur-1.837
othy-1.834
asured-1.821
ior-1.812
umper-1.811
Token "
Token ID366
Feature activation-4.54e-01

Pos logit contributions
apy+1.492
 are+1.323
 floats+1.308
ades+1.304
 were+1.270
Neg logit contributions
bur-0.899
othy-0.889
umper-0.889
asured-0.876
ior-0.866
TokenYou
Token ID1639
Feature activation-1.84e+00

Pos logit contributions
apy+0.812
 are+0.696
 floats+0.684
ades+0.664
 were+0.663
Neg logit contributions
umper-0.943
othy-0.925
bur-0.925
ior-0.901
asured-0.897
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
apy+5.307
ades+5.268
 vows+4.852
cakes+4.726
 are+4.722
Neg logit contributions
 Tooth-4.475
othy-4.328
bur-4.327
umper-4.164
ior-4.077
Token birds
Token ID10087
Feature activation-2.35e+00

Pos logit contributions
apy+5.384
ades+4.750
 floats+4.515
 vows+4.503
cakes+4.381
Neg logit contributions
othy-4.203
 Tooth-4.178
asured-3.985
my-3.911
umper-3.866
Token stopped
Token ID5025
Feature activation+4.45e-01

Pos logit contributions
apy+5.075
ades+4.332
 vows+4.267
cakes+4.240
 floats+4.226
Neg logit contributions
umper-4.407
bur-4.322
othy-4.110
my-4.070
i-3.971
Token singing
Token ID13777
Feature activation-2.45e+00

Pos logit contributions
umper+0.842
othy+0.822
bur+0.821
ior+0.810
 Tooth+0.808
Neg logit contributions
apy-0.882
 are-0.772
 floats-0.766
 were-0.752
ades-0.728
Token and
Token ID290
Feature activation-2.23e+00

Pos logit contributions
apy+6.426
 floats+5.539
ades+5.452
 are+5.407
 vows+5.236
Neg logit contributions
othy-3.212
umper-3.144
 Tooth-3.138
asured-3.094
bur-3.077
Token there
Token ID612
Feature activation-3.51e+00

Pos logit contributions
apy+4.405
ades+4.088
 are+3.557
ves+3.551
cakes+3.546
Neg logit contributions
othy-4.335
bur-4.316
 Tooth-4.159
umper-4.139
ior-4.092
Token was
Token ID373
Feature activation-1.82e+00

Pos logit contributions
apy+8.273
ades+7.342
 floats+7.267
cakes+7.018
oms+6.666
Neg logit contributions
othy-5.450
asured-5.296
my-5.122
bur-5.099
i-5.085
Token silence
Token ID9550
Feature activation-2.28e+00

Pos logit contributions
apy+3.943
 floats+3.425
ades+3.405
 are+3.383
 were+3.172
Neg logit contributions
othy-3.170
bur-3.107
 Tooth-3.086
asured-3.074
umper-2.938
Token.
Token ID13
Feature activation-2.04e+00

Pos logit contributions
apy+5.242
 floats+4.480
 are+4.440
ades+4.383
 vows+4.200
Neg logit contributions
asured-3.614
othy-3.500
 Tooth-3.454
bur-3.436
umper-3.333
Token The
Token ID383
Feature activation-2.62e+00

Pos logit contributions
apy+4.168
 are+3.636
ades+3.616
 floats+3.467
 were+3.408
Neg logit contributions
ior-3.793
 Tooth-3.704
othy-3.632
umper-3.599
asured-3.485
Token big
Token ID1263
Feature activation-1.62e+00

Pos logit contributions
apy+6.121
ades+5.240
 vows+4.919
 are+4.894
cakes+4.877
Neg logit contributions
 Tooth-4.364
othy-4.354
asured-4.088
my-4.001
umper-3.982
Token Ben
Token ID3932
Feature activation+9.76e-01

Pos logit contributions
umper+0.110
bur+0.108
othy+0.108
ior+0.106
 Tooth+0.106
Neg logit contributions
apy-0.093
 floats-0.080
 are-0.079
 were-0.077
ades-0.076
Token and
Token ID290
Feature activation+2.34e-01

Pos logit contributions
umper+2.042
bur+2.003
othy+1.998
ior+1.957
asured+1.954
Neg logit contributions
apy-1.696
 are-1.537
 were-1.471
 floats-1.470
 they-1.421
Token they
Token ID484
Feature activation-2.19e+00

Pos logit contributions
umper+0.491
bur+0.481
othy+0.480
ior+0.473
asured+0.471
Neg logit contributions
apy-0.410
 are-0.351
 floats-0.350
 were-0.340
ades-0.331
Token watch
Token ID2342
Feature activation-1.76e+00

Pos logit contributions
apy+4.124
ades+3.522
 floats+3.215
cakes+3.199
ves+3.141
Neg logit contributions
bur-4.617
umper-4.613
othy-4.612
ior-4.388
my-4.314
Token the
Token ID262
Feature activation-2.20e+00

Pos logit contributions
apy+3.634
ades+3.171
 floats+3.107
 are+3.051
 were+2.995
Neg logit contributions
othy-3.146
 Tooth-3.145
umper-3.093
ior-3.071
bur-3.017
Token video
Token ID2008
Feature activation-3.49e+00

Pos logit contributions
apy+4.267
 were+3.400
 are+3.365
ades+3.302
 they+3.187
Neg logit contributions
 Tooth-4.507
othy-4.408
umper-4.355
bur-4.289
ior-4.142
Token on
Token ID319
Feature activation-5.17e-01

Pos logit contributions
apy+8.896
ades+7.282
 are+7.225
 floats+7.192
 vows+7.110
Neg logit contributions
othy-5.023
 Tooth-4.950
my-4.832
asured-4.773
umper-4.707
Token the
Token ID262
Feature activation-8.86e-01

Pos logit contributions
apy+0.922
 floats+0.785
 are+0.779
ades+0.752
 were+0.751
Neg logit contributions
umper-1.064
othy-1.059
bur-1.056
 Tooth-1.052
ior-1.034
Token screen
Token ID3159
Feature activation-2.59e+00

Pos logit contributions
apy+1.595
 are+1.270
 were+1.266
 floats+1.265
ades+1.260
Neg logit contributions
othy-1.853
 Tooth-1.849
umper-1.843
bur-1.830
ior-1.812
Token.
Token ID13
Feature activation-9.36e-01

Pos logit contributions
apy+6.090
 floats+5.061
ades+5.031
 are+4.928
 vows+4.832
Neg logit contributions
othy-4.173
asured-4.015
 Tooth-3.967
my-3.952
bur-3.881
Token They
Token ID1119
Feature activation-2.29e+00

Pos logit contributions
apy+1.368
 are+1.175
 floats+1.158
ades+1.151
 were+1.126
Neg logit contributions
umper-2.193
ior-2.155
 Tooth-2.145
othy-2.144
bur-2.107
Token the
Token ID262
Feature activation-5.61e-01

Pos logit contributions
apy+3.698
ades+3.294
 floats+3.171
 are+3.144
cakes+3.005
Neg logit contributions
 Tooth-3.345
othy-3.297
umper-3.252
bur-3.228
asured-3.108
Token bees
Token ID17002
Feature activation-1.75e+00

Pos logit contributions
apy+1.053
 floats+0.892
 are+0.882
 were+0.858
ades+0.858
Neg logit contributions
umper-1.109
bur-1.106
 Tooth-1.098
othy-1.095
ior-1.072
Token buzzing
Token ID41719
Feature activation-1.94e+00

Pos logit contributions
apy+4.178
 floats+3.665
ades+3.651
 vows+3.534
 are+3.451
Neg logit contributions
bur-2.554
umper-2.532
othy-2.481
 Tooth-2.400
asured-2.391
Token in
Token ID287
Feature activation-1.50e+00

Pos logit contributions
apy+4.648
 floats+4.058
ades+4.031
 are+3.981
 vows+3.884
Neg logit contributions
umper-2.774
othy-2.772
ior-2.678
bur-2.642
asured-2.610
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
apy+2.943
ades+2.546
 floats+2.522
cakes+2.488
 vows+2.378
Neg logit contributions
othy-2.890
bur-2.785
umper-2.774
 Tooth-2.752
ior-2.718
Token out
Token ID503
Feature activation-3.49e+00

Pos logit contributions
apy+2.713
ades+2.459
 floats+2.364
 are+2.334
cakes+2.214
Neg logit contributions
othy-2.613
bur-2.565
umper-2.534
 Tooth-2.504
asured-2.485
Token of
Token ID286
Feature activation-4.91e-01

Pos logit contributions
apy+7.072
ades+6.549
 are+6.534
 floats+6.530
cakes+6.427
Neg logit contributions
othy-6.119
asured-5.644
umper-5.467
 Tooth-5.353
i-5.157
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
apy+0.827
 floats+0.698
 are+0.689
ades+0.680
 were+0.660
Neg logit contributions
othy-1.059
umper-1.056
bur-1.054
 Tooth-1.032
asured-1.025
Token small
Token ID1402
Feature activation-3.18e-01

Pos logit contributions
apy+0.449
 are+0.383
 floats+0.383
 were+0.372
ades+0.368
Neg logit contributions
umper-0.476
bur-0.470
othy-0.469
 Tooth-0.462
ior-0.461
Token holes
Token ID10421
Feature activation-1.93e+00

Pos logit contributions
apy+0.598
 floats+0.509
 are+0.507
ades+0.495
 were+0.491
Neg logit contributions
umper-0.628
othy-0.617
bur-0.616
ior-0.609
 Tooth-0.604
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
apy+4.082
 floats+3.621
ades+3.399
 vows+3.396
 are+3.378
Neg logit contributions
othy-3.283
bur-3.245
asured-3.226
umper-3.206
 Tooth-2.997
Token!"
Token ID2474
Feature activation-2.54e+00

Pos logit contributions
apy+3.054
 floats+2.679
 were+2.637
 are+2.626
ades+2.575
Neg logit contributions
othy-2.477
 Tooth-2.451
bur-2.448
ior-2.424
asured-2.402
Token Sam
Token ID3409
Feature activation+2.82e-01

Pos logit contributions
apy+5.663
ades+5.202
 vows+5.029
 floats+4.832
ves+4.688
Neg logit contributions
othy-4.117
ior-4.050
 Tooth-4.035
my-3.898
umper-3.848
Token and
Token ID290
Feature activation-2.39e-01

Pos logit contributions
umper+0.467
bur+0.454
othy+0.453
ior+0.444
asured+0.443
Neg logit contributions
apy-0.617
 are-0.549
 floats-0.547
 were-0.535
ades-0.523
Token Tom
Token ID4186
Feature activation-1.28e+00

Pos logit contributions
apy+0.420
 are+0.357
 floats+0.357
ades+0.352
 were+0.347
Neg logit contributions
umper-0.504
othy-0.503
bur-0.493
ior-0.489
 Tooth-0.488
Token say
Token ID910
Feature activation-2.09e+00

Pos logit contributions
apy+2.808
 floats+2.297
ades+2.288
ves+2.125
 vows+2.112
Neg logit contributions
umper-2.334
ior-2.319
othy-2.318
bur-2.300
 Tooth-2.216
Token together
Token ID1978
Feature activation-3.48e+00

Pos logit contributions
apy+4.426
 floats+4.173
ades+3.877
 were+3.840
 are+3.828
Neg logit contributions
othy-3.390
umper-3.342
asured-3.284
 Tooth-3.269
ior-3.249
Token.
Token ID13
Feature activation-2.05e+00

Pos logit contributions
apy+9.230
 floats+8.516
ades+8.397
 vows+7.555
cakes+7.554
Neg logit contributions
asured-4.682
my-4.565
othy-4.443
 Tooth-4.342
ior-4.131
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
apy+3.911
 are+3.524
 floats+3.506
ades+3.376
 vows+3.336
Neg logit contributions
ior-3.823
othy-3.768
 Tooth-3.747
bur-3.677
umper-3.665
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
apy+2.216
 floats+1.978
 are+1.903
 vows+1.841
 were+1.824
Neg logit contributions
umper-2.389
bur-2.369
othy-2.357
ior-2.337
 Tooth-2.316
Token"
Token ID1
Feature activation-1.91e+00

Pos logit contributions
apy+2.494
 are+2.233
 floats+2.215
 were+2.135
 vows+2.116
Neg logit contributions
othy-2.435
bur-2.407
umper-2.406
ior-2.377
asured-2.320
TokenBut
Token ID1537
Feature activation+2.65e-01

Pos logit contributions
apy+4.110
 are+3.654
 floats+3.540
ades+3.524
 were+3.512
Neg logit contributions
othy-3.175
umper-3.170
 Tooth-3.139
bur-3.111
ior-3.013
Token mom
Token ID1995
Feature activation+8.54e-02

Pos logit contributions
apy+2.171
 floats+1.902
ades+1.850
 are+1.804
 were+1.755
Neg logit contributions
umper-1.977
othy-1.963
bur-1.956
 Tooth-1.946
ior-1.893
Token were
Token ID547
Feature activation-3.59e-01

Pos logit contributions
umper+0.171
bur+0.167
othy+0.167
ior+0.164
asured+0.163
Neg logit contributions
apy-0.160
 floats-0.137
 are-0.137
 were-0.132
ades-0.131
Token talking
Token ID3375
Feature activation-1.85e+00

Pos logit contributions
apy+0.776
 floats+0.683
 are+0.677
ades+0.662
 were+0.658
Neg logit contributions
umper-0.609
bur-0.594
othy-0.588
asured-0.578
 Tooth-0.573
Token.
Token ID13
Feature activation-2.49e+00

Pos logit contributions
apy+4.640
 floats+4.101
 are+4.069
 were+3.847
ades+3.834
Neg logit contributions
 Tooth-2.609
othy-2.588
umper-2.540
bur-2.502
asured-2.489
Token Sue
Token ID26089
Feature activation+1.38e-01

Pos logit contributions
apy+4.847
ades+4.443
 floats+4.106
 vows+3.900
 are+3.846
Neg logit contributions
othy-4.880
bur-4.607
ior-4.580
my-4.430
asured-4.423
Token said
Token ID531
Feature activation-3.47e+00

Pos logit contributions
umper+0.271
bur+0.265
othy+0.265
ior+0.260
asured+0.259
Neg logit contributions
apy-0.260
 are-0.224
 floats-0.223
 were-0.217
ades-0.213
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
apy+7.871
ades+7.282
 floats+7.175
 widened+7.088
 are+6.578
Neg logit contributions
othy-4.937
 Tooth-4.820
my-4.786
umper-4.605
i-4.491
Token "
Token ID366
Feature activation-3.44e-01

Pos logit contributions
apy+1.929
 floats+1.686
 are+1.673
ades+1.627
 were+1.551
Neg logit contributions
 Tooth-2.258
umper-2.249
bur-2.235
othy-2.232
ior-2.189
TokenMom
Token ID29252
Feature activation+3.79e-01

Pos logit contributions
apy+0.642
 are+0.545
 floats+0.542
ades+0.527
 were+0.518
Neg logit contributions
umper-0.690
bur-0.684
othy-0.682
ior-0.668
 Tooth-0.661
Token,
Token ID11
Feature activation-7.79e-01

Pos logit contributions
umper+0.806
bur+0.786
othy+0.785
ior+0.775
asured+0.775
Neg logit contributions
apy-0.651
 are-0.566
 floats-0.558
 were-0.546
ades-0.525
Token I
Token ID314
Feature activation-1.22e-02

Pos logit contributions
apy+1.520
ades+1.282
 floats+1.266
 are+1.216
 were+1.208
Neg logit contributions
umper-1.485
bur-1.461
othy-1.456
 Tooth-1.452
ior-1.416
Token a
Token ID257
Feature activation-2.14e+00

Pos logit contributions
apy+3.220
 floats+2.776
 are+2.744
ades+2.703
 were+2.679
Neg logit contributions
othy-3.046
 Tooth-3.030
bur-2.995
umper-2.992
ior-2.948
Token funny
Token ID8258
Feature activation-1.18e+00

Pos logit contributions
apy+3.820
 floats+3.283
 are+3.236
 were+3.224
ades+3.140
Neg logit contributions
 Tooth-4.360
ior-4.262
othy-4.173
bur-4.168
umper-4.072
Token face
Token ID1986
Feature activation-1.22e+00

Pos logit contributions
apy+2.032
 floats+1.718
 are+1.704
 were+1.665
ades+1.628
Neg logit contributions
othy-2.447
ior-2.405
 Tooth-2.399
bur-2.395
umper-2.392
Token.
Token ID13
Feature activation-2.47e+00

Pos logit contributions
apy+2.417
 floats+2.102
 are+2.061
ades+2.044
 were+1.990
Neg logit contributions
othy-2.240
umper-2.235
bur-2.210
 Tooth-2.200
ior-2.169
Token The
Token ID383
Feature activation-1.43e+00

Pos logit contributions
apy+4.799
ades+4.598
 are+4.456
 floats+4.349
 were+4.212
Neg logit contributions
ior-4.574
 Tooth-4.434
asured-4.223
othy-4.190
umper-3.969
Token sandwiches
Token ID29777
Feature activation-3.46e+00

Pos logit contributions
apy+2.526
 floats+1.993
ades+1.988
 are+1.944
 were+1.896
Neg logit contributions
 Tooth-3.047
othy-3.042
bur-3.029
umper-2.979
ior-2.935
Token are
Token ID389
Feature activation-9.85e-01

Pos logit contributions
apy+6.646
ades+6.017
 floats+5.510
cakes+5.306
 vows+5.292
Neg logit contributions
bur-6.519
othy-6.306
 Tooth-6.285
my-6.220
umper-6.195
Token spicy
Token ID26880
Feature activation-3.77e-01

Pos logit contributions
apy+1.885
 floats+1.661
 are+1.612
ades+1.599
 were+1.534
Neg logit contributions
bur-1.874
asured-1.849
othy-1.831
umper-1.830
 Tooth-1.811
Token!
Token ID0
Feature activation-2.37e+00

Pos logit contributions
apy+0.802
 floats+0.697
 are+0.690
ades+0.676
 were+0.672
Neg logit contributions
umper-0.640
bur-0.632
othy-0.631
ior-0.622
 Tooth-0.616
Token They
Token ID1119
Feature activation-2.24e+00

Pos logit contributions
apy+4.257
ades+3.881
 floats+3.873
 are+3.786
 were+3.632
Neg logit contributions
ior-4.781
 Tooth-4.707
umper-4.486
asured-4.471
othy-4.388
Token have
Token ID423
Feature activation-8.43e-01

Pos logit contributions
apy+4.000
ades+3.574
 floats+3.243
cakes+3.235
ves+3.061
Neg logit contributions
bur-4.686
othy-4.686
umper-4.624
ior-4.569
 Tooth-4.530
Token It
Token ID632
Feature activation-2.09e+00

Pos logit contributions
apy+2.164
 are+1.875
 floats+1.865
ades+1.835
 were+1.815
Neg logit contributions
umper-2.446
 Tooth-2.438
othy-2.387
ior-2.352
bur-2.321
Token starts
Token ID4940
Feature activation-9.94e-01

Pos logit contributions
apy+4.255
ades+3.641
 were+3.371
cakes+3.339
 are+3.323
Neg logit contributions
othy-3.869
asured-3.793
bur-3.714
 Tooth-3.683
umper-3.602
Token to
Token ID284
Feature activation-2.52e+00

Pos logit contributions
apy+2.235
 floats+2.027
 are+2.004
 were+1.924
ades+1.918
Neg logit contributions
asured-1.553
othy-1.541
umper-1.531
bur-1.520
 Tooth-1.498
Token rain
Token ID6290
Feature activation-1.27e+00

Pos logit contributions
apy+4.920
ades+4.273
 vows+4.068
cakes+3.953
 floats+3.909
Neg logit contributions
othy-5.267
bur-5.259
 Tooth-5.241
umper-5.043
asured-4.949
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
apy+2.659
 floats+2.190
 are+2.169
ades+2.162
 vows+2.118
Neg logit contributions
bur-2.282
othy-2.273
umper-2.201
asured-2.196
ior-2.170
Token The
Token ID383
Feature activation-3.45e+00

Pos logit contributions
apy+1.737
 floats+1.505
 are+1.504
ades+1.463
 were+1.450
Neg logit contributions
umper-2.044
 Tooth-2.019
othy-2.000
ior-1.961
bur-1.959
Token rain
Token ID6290
Feature activation-2.51e+00

Pos logit contributions
apy+7.724
ades+6.587
ves+6.077
 widened+5.930
 were+5.922
Neg logit contributions
 Tooth-6.560
othy-6.165
my-5.868
 owl-5.841
bur-5.703
Tokendrops
Token ID49253
Feature activation-2.79e+00

Pos logit contributions
apy+4.892
ades+4.163
 vows+3.933
 were+3.846
 are+3.803
Neg logit contributions
bur-4.619
othy-4.611
asured-4.459
my-4.324
ior-4.280
Token fall
Token ID2121
Feature activation-1.41e+00

Pos logit contributions
apy+5.042
ades+4.698
 vows+4.370
cakes+4.301
 floats+4.266
Neg logit contributions
bur-5.578
othy-5.311
umper-5.149
asured-5.104
my-5.027
Token on
Token ID319
Feature activation-5.80e-01

Pos logit contributions
apy+3.275
 floats+2.895
ades+2.873
 are+2.822
 vows+2.796
Neg logit contributions
bur-2.132
umper-2.111
othy-2.045
ior-1.992
asured-1.970
Token their
Token ID511
Feature activation-2.23e+00

Pos logit contributions
apy+0.985
 floats+0.839
 are+0.828
ades+0.803
 were+0.787
Neg logit contributions
umper-1.241
bur-1.233
othy-1.226
 Tooth-1.211
ior-1.194
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
apy+1.761
 floats+1.545
 are+1.476
ades+1.452
 were+1.423
Neg logit contributions
umper-1.685
othy-1.684
asured-1.664
bur-1.653
 Tooth-1.626
Token Everyone
Token ID11075
Feature activation-1.12e+00

Pos logit contributions
umper+0.032
othy+0.031
bur+0.031
ior+0.031
asured+0.031
Neg logit contributions
apy-0.028
 floats-0.024
 are-0.024
 were-0.023
ades-0.023
Token around
Token ID1088
Feature activation-1.48e+00

Pos logit contributions
apy+2.211
 floats+1.943
 are+1.928
ades+1.899
 vows+1.839
Neg logit contributions
othy-2.053
asured-2.041
umper-2.021
bur-1.965
ior-1.949
Token her
Token ID607
Feature activation-4.44e-01

Pos logit contributions
apy+2.825
 floats+2.391
ades+2.373
 vows+2.255
 are+2.202
Neg logit contributions
umper-2.943
othy-2.938
bur-2.892
 Tooth-2.782
asured-2.773
Token was
Token ID373
Feature activation-6.06e-01

Pos logit contributions
apy+0.901
 floats+0.783
 are+0.768
ades+0.750
 were+0.731
Neg logit contributions
umper-0.808
othy-0.806
bur-0.798
ior-0.792
asured-0.789
Token so
Token ID523
Feature activation-3.45e+00

Pos logit contributions
apy+1.217
 floats+1.080
 are+1.063
ades+1.020
 were+1.008
Neg logit contributions
othy-1.112
asured-1.099
umper-1.099
bur-1.094
 Tooth-1.065
Token proud
Token ID6613
Feature activation+1.19e-01

Pos logit contributions
 floats+7.207
apy+7.067
 vows+6.925
ades+6.806
ves+6.692
Neg logit contributions
asured-6.227
othy-5.678
my-5.470
umper-5.212
nie-5.043
Token!
Token ID0
Feature activation-1.11e+00

Pos logit contributions
umper+0.274
othy+0.269
bur+0.269
ior+0.265
asured+0.265
Neg logit contributions
apy-0.183
 are-0.153
 floats-0.152
 were-0.147
ades-0.143
Token
Token ID198
Feature activation-8.22e-01

Pos logit contributions
apy+1.988
 floats+1.713
ades+1.676
 are+1.676
 vows+1.592
Neg logit contributions
othy-2.263
umper-2.218
bur-2.190
ior-2.190
asured-2.138
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
apy+1.428
 floats+1.268
 are+1.201
ades+1.149
 vows+1.144
Neg logit contributions
othy-1.701
umper-1.701
bur-1.686
ior-1.623
asured-1.622
TokenJen
Token ID44875
Feature activation+4.54e-01

Pos logit contributions
apy+1.989
 floats+1.779
 are+1.714
ades+1.665
 vows+1.654
Neg logit contributions
othy-1.893
umper-1.844
bur-1.836
asured-1.780
ior-1.751
Token on
Token ID319
Feature activation-9.23e-01

Pos logit contributions
apy+3.536
 floats+3.088
ades+3.059
 are+2.988
 vows+2.955
Neg logit contributions
bur-2.085
umper-2.052
othy-1.972
 Tooth-1.915
asured-1.898
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
apy+1.784
 floats+1.524
 are+1.492
ades+1.479
 were+1.445
Neg logit contributions
umper-1.759
othy-1.744
bur-1.736
 Tooth-1.715
ior-1.664
Token flowers
Token ID12734
Feature activation-1.26e+00

Pos logit contributions
apy+3.485
 are+2.777
ades+2.777
 were+2.764
 floats+2.700
Neg logit contributions
bur-3.271
 Tooth-3.268
umper-3.216
othy-3.212
ior-3.124
Token and
Token ID290
Feature activation-8.52e-01

Pos logit contributions
apy+2.962
 floats+2.622
ades+2.563
 vows+2.508
 are+2.464
Neg logit contributions
umper-1.891
bur-1.877
othy-1.855
asured-1.791
 Tooth-1.770
Token make
Token ID787
Feature activation-2.00e+00

Pos logit contributions
apy+1.410
ades+1.197
 floats+1.146
 are+1.126
 were+1.091
Neg logit contributions
othy-1.842
umper-1.837
bur-1.824
 Tooth-1.801
ior-1.788
Token them
Token ID606
Feature activation-3.43e+00

Pos logit contributions
apy+3.608
 are+3.028
 were+2.965
 floats+2.965
ades+2.908
Neg logit contributions
othy-4.035
 Tooth-3.936
umper-3.933
bur-3.905
asured-3.881
Token happy
Token ID3772
Feature activation-2.07e+00

Pos logit contributions
apy+6.846
ades+6.090
 floats+5.778
 vows+5.736
cakes+5.698
Neg logit contributions
asured-6.574
othy-6.217
bur-6.154
my-6.147
umper-5.838
Token.
Token ID13
Feature activation+3.70e-01

Pos logit contributions
apy+4.756
ades+3.975
 floats+3.958
 vows+3.773
 are+3.761
Neg logit contributions
asured-3.383
 Tooth-3.287
othy-3.263
umper-3.220
bur-3.204
Token She
Token ID1375
Feature activation-6.57e-01

Pos logit contributions
umper+0.843
bur+0.827
othy+0.826
asured+0.815
ior+0.813
Neg logit contributions
apy-0.586
 are-0.497
 floats-0.491
 were-0.479
ades-0.460
Token also
Token ID635
Feature activation-7.69e-01

Pos logit contributions
apy+1.325
 are+1.120
 floats+1.116
ades+1.111
 were+1.061
Neg logit contributions
othy-1.198
umper-1.197
asured-1.184
bur-1.183
 Tooth-1.152
Token liked
Token ID8288
Feature activation-2.89e-01

Pos logit contributions
apy+1.579
ades+1.349
 floats+1.340
 are+1.311
 were+1.248
Neg logit contributions
othy-1.389
umper-1.383
asured-1.361
bur-1.351
 Tooth-1.332
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
apy+2.219
 floats+1.898
ades+1.878
 are+1.835
 were+1.777
Neg logit contributions
 Tooth-2.190
othy-2.145
bur-2.140
ior-2.101
umper-2.077
Token couch
Token ID18507
Feature activation-1.18e+00

Pos logit contributions
apy+2.548
 were+2.050
ades+2.033
 are+2.030
 floats+2.018
Neg logit contributions
 Tooth-2.790
ior-2.707
othy-2.695
bur-2.669
umper-2.624
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
apy+2.816
 floats+2.467
 are+2.411
ades+2.392
 were+2.322
Neg logit contributions
othy-1.742
asured-1.699
bur-1.697
 Tooth-1.693
ior-1.685
Token watch
Token ID2342
Feature activation-2.68e+00

Pos logit contributions
apy+2.148
ades+1.864
 floats+1.713
cakes+1.676
 were+1.671
Neg logit contributions
othy-2.682
bur-2.635
 Tooth-2.615
ior-2.606
umper-2.578
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
apy+5.919
ades+5.225
 floats+4.976
 are+4.850
 were+4.784
Neg logit contributions
 Tooth-4.692
othy-4.459
ior-4.405
asured-4.268
bur-4.176
Token film
Token ID2646
Feature activation-3.42e+00

Pos logit contributions
apy+4.179
 were+3.163
 are+3.098
ades+3.084
 floats+2.823
Neg logit contributions
 Tooth-5.794
ior-5.404
othy-5.267
bur-5.233
nie-5.181
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
apy+8.793
ades+7.355
 floats+7.281
 are+6.940
 were+6.844
Neg logit contributions
 Tooth-4.901
asured-4.866
othy-4.787
my-4.591
bur-4.542
Token They
Token ID1119
Feature activation-1.94e+00

Pos logit contributions
apy+3.990
ades+3.802
 are+3.444
 floats+3.402
 were+3.328
Neg logit contributions
 Tooth-4.807
ior-4.767
othy-4.582
asured-4.459
umper-4.370
Token see
Token ID766
Feature activation-2.02e+00

Pos logit contributions
apy+3.539
ades+2.892
cakes+2.725
 floats+2.682
ves+2.610
Neg logit contributions
othy-4.198
bur-4.172
umper-4.110
 Tooth-4.032
ior-3.978
Token a
Token ID257
Feature activation-1.90e+00

Pos logit contributions
apy+4.010
ades+3.474
cakes+3.257
 floats+3.211
 are+3.200
Neg logit contributions
 Tooth-3.805
othy-3.666
umper-3.629
bur-3.589
ior-3.562
Token girl
Token ID2576
Feature activation-2.07e-01

Pos logit contributions
apy+3.453
 are+2.914
 were+2.831
ades+2.757
 floats+2.753
Neg logit contributions
 Tooth-3.945
othy-3.677
ior-3.664
bur-3.663
umper-3.636
Token
Token ID198
Feature activation-2.33e+00

Pos logit contributions
apy+3.668
 floats+3.322
 are+3.139
 vows+3.074
 were+3.007
Neg logit contributions
umper-3.427
 Tooth-3.416
othy-3.397
bur-3.369
ior-3.347
TokenInside
Token ID24441
Feature activation-2.27e+00

Pos logit contributions
apy+5.161
 floats+4.734
 are+4.672
 were+4.507
 vows+4.436
Neg logit contributions
 Tooth-3.742
umper-3.740
bur-3.704
othy-3.655
ior-3.633
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
apy+4.963
 floats+4.206
ades+4.129
 vows+3.943
cakes+3.880
Neg logit contributions
bur-4.053
othy-3.857
umper-3.748
my-3.732
 Tooth-3.698
Token box
Token ID3091
Feature activation-2.48e+00

Pos logit contributions
apy+3.017
 floats+2.469
ades+2.406
 were+2.295
 are+2.286
Neg logit contributions
 Tooth-2.569
bur-2.514
othy-2.491
umper-2.477
ior-2.440
Token,
Token ID11
Feature activation-2.53e+00

Pos logit contributions
apy+5.380
 floats+4.593
ades+4.405
 vows+4.239
 widened+4.187
Neg logit contributions
othy-4.341
bur-4.151
my-4.130
asured-4.103
umper-3.960
Token there
Token ID612
Feature activation-3.41e+00

Pos logit contributions
apy+4.640
ades+4.162
 floats+3.958
 widened+3.755
 vows+3.685
Neg logit contributions
bur-4.897
asured-4.843
my-4.729
othy-4.722
 Tooth-4.706
Token are
Token ID389
Feature activation-2.10e+00

Pos logit contributions
apy+7.857
ades+6.653
 floats+6.521
cakes+6.277
 banners+6.140
Neg logit contributions
my-5.727
othy-5.726
bur-5.594
asured-5.376
i-5.294
Token two
Token ID734
Feature activation-2.52e+00

Pos logit contributions
apy+3.532
 floats+2.829
ades+2.775
 are+2.589
 widened+2.532
Neg logit contributions
othy-4.573
asured-4.541
bur-4.496
 Tooth-4.451
my-4.366
Token dolls
Token ID36062
Feature activation-1.76e+00

Pos logit contributions
apy+5.337
ades+4.203
 floats+4.150
cakes+4.041
 vows+3.997
Neg logit contributions
asured-4.405
othy-4.386
umper-4.335
 Tooth-4.328
ior-4.285
Token.
Token ID13
Feature activation-2.84e+00

Pos logit contributions
apy+4.059
 floats+3.405
ades+3.383
 are+3.220
 vows+3.199
Neg logit contributions
bur-2.826
umper-2.822
othy-2.781
asured-2.743
 Tooth-2.673
Token One
Token ID1881
Feature activation-2.39e+00

Pos logit contributions
apy+5.640
ades+5.062
 are+4.479
 floats+4.456
 were+4.447
Neg logit contributions
ior-5.792
 Tooth-5.770
othy-5.341
asured-5.329
umper-5.142
Token brighter
Token ID26083
Feature activation+9.01e-01

Pos logit contributions
umper+0.470
bur+0.460
othy+0.460
ior+0.452
asured+0.451
Neg logit contributions
apy-0.400
 are-0.344
 floats-0.343
 were-0.333
ades-0.324
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
umper+1.948
bur+1.876
othy+1.872
ior+1.850
 Tooth+1.833
Neg logit contributions
apy-1.582
 are-1.362
 floats-1.359
 were-1.339
 they-1.286
Token Steve
Token ID6542
Feature activation-3.55e-01

Pos logit contributions
apy+0.490
 are+0.427
 floats+0.426
 were+0.412
ades+0.408
Neg logit contributions
umper-0.456
othy-0.452
ior-0.443
bur-0.443
 Tooth-0.438
Token was
Token ID373
Feature activation+4.01e-03

Pos logit contributions
apy+0.688
 are+0.590
 floats+0.590
ades+0.571
 were+0.559
Neg logit contributions
umper-0.679
othy-0.671
bur-0.664
asured-0.654
 Tooth-0.648
Token amazed
Token ID24562
Feature activation+1.78e+00

Pos logit contributions
umper+0.008
othy+0.008
bur+0.008
asured+0.008
 Tooth+0.008
Neg logit contributions
apy-0.008
 floats-0.007
 are-0.007
 were-0.006
ades-0.006
Tokenâ
Token ID22940
Feature activation+2.66e+00

Pos logit contributions
umper+3.823
othy+3.735
bur+3.734
ior+3.669
 Tooth+3.667
Neg logit contributions
apy-3.043
 were-2.817
 they-2.748
 are-2.748
 floats-2.713
Token€
Token ID26391
Feature activation-7.30e-01

Pos logit contributions
umper+4.812
asured+4.597
bur+4.527
 Tooth+4.425
othy+4.366
Neg logit contributions
apy-5.353
 are-5.191
 floats-5.024
 were-4.990
 they-4.742
Token�
Token ID447
Feature activation-1.12e+00

Pos logit contributions
apy+1.456
 floats+1.276
 are+1.268
 were+1.207
ades+1.197
Neg logit contributions
umper-1.330
bur-1.317
othy-1.315
ior-1.279
 Tooth-1.272
Token�
Token ID251
Feature activation+1.26e+00

Pos logit contributions
apy+1.743
 floats+1.473
 are+1.454
ades+1.379
 were+1.359
Neg logit contributions
umper-2.510
bur-2.500
othy-2.479
ior-2.454
 Tooth-2.435
Tokenhis
Token ID14363
Feature activation-2.71e-01

Pos logit contributions
umper+2.929
bur+2.906
asured+2.864
othy+2.839
ior+2.812
Neg logit contributions
apy-2.037
 are-1.750
 were-1.706
 floats-1.694
 they-1.577
Token wings
Token ID12098
Feature activation-1.23e+00

Pos logit contributions
apy+0.519
 are+0.451
 floats+0.449
 were+0.437
ades+0.427
Neg logit contributions
umper-0.521
othy-0.517
bur-0.509
 Tooth-0.505
asured-0.505
Token boy
Token ID2933
Feature activation+1.73e+00

Pos logit contributions
umper+0.652
othy+0.637
bur+0.636
asured+0.624
ior+0.621
Neg logit contributions
apy-0.784
 are-0.691
 floats-0.691
 were-0.673
ades-0.656
Token named
Token ID3706
Feature activation-1.65e-01

Pos logit contributions
umper+2.962
bur+2.907
ior+2.840
asured+2.818
othy+2.817
Neg logit contributions
apy-3.818
 are-3.427
 were-3.405
 floats-3.357
 they-3.349
Token Tim
Token ID5045
Feature activation+2.60e+00

Pos logit contributions
apy+0.317
 floats+0.272
 are+0.271
 were+0.262
ades+0.261
Neg logit contributions
umper-0.325
bur-0.319
othy-0.318
ior-0.313
 Tooth-0.311
Tokenmy
Token ID1820
Feature activation+1.81e+00

Pos logit contributions
 Tooth+2.129
umper+1.974
ior+1.970
asured+1.937
othy+1.878
Neg logit contributions
 are-7.827
 were-7.711
apy-7.328
 they-7.081
 floats-7.021
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
umper+4.345
bur+4.250
othy+4.219
asured+4.203
ior+4.200
Neg logit contributions
apy-2.640
 are-2.346
 were-2.324
 they-2.186
 floats-2.172
Token Tim
Token ID5045
Feature activation+2.78e+00

Pos logit contributions
apy+0.309
 floats+0.268
 are+0.267
ades+0.257
 were+0.257
Neg logit contributions
umper-0.301
bur-0.297
othy-0.297
ior-0.292
asured-0.289
Tokenmy
Token ID1820
Feature activation+1.58e+00

Pos logit contributions
 Tooth+2.102
umper+2.042
asured+1.805
ior+1.790
ppa+1.750
Neg logit contributions
 were-8.679
 are-8.669
apy-8.256
 floats-7.954
 they-7.631
Token loved
Token ID6151
Feature activation+7.06e-01

Pos logit contributions
umper+3.529
bur+3.414
asured+3.336
 Tooth+3.331
othy+3.329
Neg logit contributions
apy-2.665
 were-2.419
 are-2.382
 floats-2.290
 they-2.217
Token sa
Token ID473
Feature activation+9.84e-01

Pos logit contributions
umper+1.289
othy+1.253
bur+1.244
ior+1.233
asured+1.225
Neg logit contributions
apy-1.438
 floats-1.276
 are-1.264
 were-1.238
ades-1.203
Tokenus
Token ID385
Feature activation+2.46e-01

Pos logit contributions
othy+1.106
umper+1.085
 Tooth+1.055
bur+1.043
asured+1.041
Neg logit contributions
apy-2.696
 are-2.454
 were-2.417
 floats-2.412
ades-2.380
Tokenages
Token ID1095
Feature activation+5.12e-02

Pos logit contributions
umper+0.392
bur+0.381
othy+0.380
ior+0.372
asured+0.371
Neg logit contributions
apy-0.558
 are-0.497
 floats-0.495
 were-0.485
ades-0.475
Token gave
Token ID2921
Feature activation+1.23e+00

Pos logit contributions
umper+0.068
othy+0.066
bur+0.066
ior+0.065
 Tooth+0.065
Neg logit contributions
apy-0.056
 are-0.048
 floats-0.048
 were-0.046
ades-0.046
Token it
Token ID340
Feature activation-9.46e-02

Pos logit contributions
umper+2.369
asured+2.316
othy+2.298
ior+2.253
bur+2.252
Neg logit contributions
apy-2.444
 are-2.184
 were-2.147
 floats-2.112
 they-2.039
Token to
Token ID284
Feature activation+4.56e-01

Pos logit contributions
apy+0.200
 floats+0.176
 are+0.175
 were+0.170
ades+0.168
Neg logit contributions
umper-0.165
bur-0.162
othy-0.161
 Tooth-0.158
asured-0.158
Token him
Token ID683
Feature activation+3.37e-01

Pos logit contributions
umper+0.901
bur+0.874
othy+0.873
asured+0.862
ior+0.861
Neg logit contributions
apy-0.871
 are-0.763
 floats-0.753
 were-0.740
ades-0.713
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
umper+0.700
bur+0.684
othy+0.682
ior+0.672
asured+0.669
Neg logit contributions
apy-0.603
 are-0.519
 floats-0.514
 were-0.503
ades-0.487
Token Tim
Token ID5045
Feature activation+3.03e+00

Pos logit contributions
apy+0.564
 floats+0.482
 are+0.481
ades+0.460
 were+0.455
Neg logit contributions
umper-0.714
bur-0.709
othy-0.706
ior-0.699
 Tooth-0.686
Tokenmy
Token ID1820
Feature activation+3.89e-01

Pos logit contributions
 Tooth+2.521
umper+2.415
asured+2.146
ior+2.002
bur+1.894
Neg logit contributions
 were-9.274
 are-9.217
apy-8.786
 floats-8.245
ades-7.994
Token loved
Token ID6151
Feature activation+7.55e-01

Pos logit contributions
umper+0.788
bur+0.769
othy+0.766
ior+0.754
 Tooth+0.751
Neg logit contributions
apy-0.714
 are-0.619
 floats-0.614
 were-0.604
ades-0.581
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
umper+1.553
othy+1.502
bur+1.496
ior+1.485
asured+1.471
Neg logit contributions
apy-1.394
 are-1.205
 floats-1.201
 were-1.173
 they-1.142
Token soup
Token ID17141
Feature activation+4.15e-01

Pos logit contributions
apy+0.390
 floats+0.330
 are+0.324
 were+0.314
ades+0.311
Neg logit contributions
umper-0.475
bur-0.468
othy-0.466
 Tooth-0.464
ior-0.458
Token and
Token ID290
Feature activation+7.95e-01

Pos logit contributions
umper+0.760
othy+0.738
bur+0.738
ior+0.726
asured+0.721
Neg logit contributions
apy-0.844
 are-0.746
 floats-0.737
 were-0.727
ades-0.703
Token shrugged
Token ID28271
Feature activation+6.88e-01

Pos logit contributions
umper+2.462
bur+2.411
ior+2.374
 Tooth+2.363
othy+2.357
Neg logit contributions
apy-1.753
 were-1.506
 are-1.494
 floats-1.452
 they-1.365
Token and
Token ID290
Feature activation+6.64e-01

Pos logit contributions
umper+1.296
bur+1.259
othy+1.246
ior+1.228
 Tooth+1.225
Neg logit contributions
apy-1.374
 are-1.218
 were-1.198
 floats-1.189
 they-1.140
Token laughed
Token ID13818
Feature activation-9.30e-01

Pos logit contributions
umper+1.368
bur+1.332
othy+1.319
ior+1.310
asured+1.306
Neg logit contributions
apy-1.218
 are-1.053
 floats-1.040
 were-1.037
 they-0.991
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
apy+1.864
 floats+1.684
 are+1.632
ades+1.595
 were+1.547
Neg logit contributions
umper-1.668
othy-1.643
asured-1.635
 Tooth-1.627
ior-1.614
Token She
Token ID1375
Feature activation+1.10e+00

Pos logit contributions
umper+2.498
bur+2.441
othy+2.438
asured+2.428
 Tooth+2.372
Neg logit contributions
apy-1.726
 are-1.446
 were-1.433
 floats-1.408
 they-1.365
Token gave
Token ID2921
Feature activation+2.86e+00

Pos logit contributions
umper+2.310
bur+2.241
ior+2.208
othy+2.189
 Tooth+2.172
Neg logit contributions
apy-2.000
 were-1.737
 are-1.720
 floats-1.710
 they-1.635
Token Tommy
Token ID19919
Feature activation+1.50e+00

Pos logit contributions
asured+5.690
umper+5.424
othy+5.270
ior+5.192
 Tooth+5.078
Neg logit contributions
apy-5.635
 are-5.502
 were-5.466
 they-5.201
 floats-4.962
Token some
Token ID617
Feature activation+5.36e-01

Pos logit contributions
umper+3.382
bur+3.292
ior+3.263
asured+3.252
othy+3.246
Neg logit contributions
apy-2.444
 are-2.152
 were-2.087
 floats-1.999
 they-1.998
Token of
Token ID286
Feature activation+1.57e+00

Pos logit contributions
umper+1.113
bur+1.093
othy+1.088
ior+1.071
asured+1.069
Neg logit contributions
apy-0.961
 are-0.834
 floats-0.820
 were-0.805
ades-0.779
Token the
Token ID262
Feature activation+7.64e-01

Pos logit contributions
umper+3.153
asured+3.102
othy+3.074
ior+3.064
bur+3.054
Neg logit contributions
apy-2.944
 are-2.619
 were-2.553
 floats-2.493
 they-2.452
Token candy
Token ID18550
Feature activation-8.78e-02

Pos logit contributions
umper+1.545
bur+1.507
othy+1.506
asured+1.493
ior+1.481
Neg logit contributions
apy-1.414
 are-1.268
 floats-1.224
 were-1.220
ades-1.158
Token j
Token ID474
Feature activation+3.60e-01

Pos logit contributions
apy+0.049
 are+0.042
 floats+0.042
ades+0.041
 were+0.041
Neg logit contributions
umper-0.054
othy-0.053
bur-0.053
asured-0.052
ior-0.052
Tokenogged
Token ID42545
Feature activation-3.14e-01

Pos logit contributions
umper+0.736
bur+0.722
othy+0.720
 Tooth+0.709
ior+0.709
Neg logit contributions
apy-0.661
 are-0.567
 floats-0.564
 were-0.550
ades-0.538
Token back
Token ID736
Feature activation+7.10e-01

Pos logit contributions
apy+0.660
 floats+0.576
 are+0.575
 were+0.554
ades+0.552
Neg logit contributions
umper-0.547
othy-0.544
bur-0.541
asured-0.530
ior-0.527
Token home
Token ID1363
Feature activation-8.82e-01

Pos logit contributions
umper+1.076
bur+1.025
othy+1.021
 Tooth+1.016
ior+1.014
Neg logit contributions
apy-1.682
 are-1.522
 floats-1.513
 were-1.491
 they-1.449
Token.
Token ID13
Feature activation+5.37e-02

Pos logit contributions
apy+1.738
 floats+1.565
 are+1.521
ades+1.466
 were+1.439
Neg logit contributions
asured-1.634
othy-1.611
bur-1.590
umper-1.590
 Tooth-1.574
Token Tim
Token ID5045
Feature activation+3.57e+00

Pos logit contributions
umper+0.113
othy+0.111
bur+0.111
ior+0.109
asured+0.109
Neg logit contributions
apy-0.094
 are-0.081
 floats-0.080
 were-0.078
ades-0.077
Tokenmy
Token ID1820
Feature activation+1.05e+00

Pos logit contributions
 Tooth+3.384
ior+2.881
asured+2.862
umper+2.788
othy+2.601
Neg logit contributions
 were-10.927
 are-10.845
 floats-9.793
apy-9.449
 they-9.281
Token couldn
Token ID3521
Feature activation-3.94e-01

Pos logit contributions
umper+2.268
bur+2.203
ior+2.188
othy+2.174
 Tooth+2.153
Neg logit contributions
apy-1.790
 were-1.593
 are-1.589
 floats-1.560
 they-1.466
Token't
Token ID470
Feature activation+3.85e-01

Pos logit contributions
apy+0.556
 floats+0.462
 are+0.446
 were+0.428
ades+0.428
Neg logit contributions
umper-0.957
othy-0.947
bur-0.942
 Tooth-0.920
ior-0.915
Token wait
Token ID4043
Feature activation+7.25e-01

Pos logit contributions
umper+0.885
bur+0.867
othy+0.865
ior+0.856
asured+0.852
Neg logit contributions
apy-0.598
 are-0.509
 floats-0.500
 were-0.490
ades-0.466
Token to
Token ID284
Feature activation+7.43e-01

Pos logit contributions
umper+1.341
bur+1.297
ior+1.287
othy+1.281
 Tooth+1.269
Neg logit contributions
apy-1.483
 are-1.315
 floats-1.289
 were-1.282
 they-1.250
Token chain
Token ID6333
Feature activation-2.45e-01

Pos logit contributions
umper+0.050
othy+0.049
bur+0.049
asured+0.048
 Tooth+0.047
Neg logit contributions
apy-0.050
 floats-0.043
 are-0.043
 were-0.042
ades-0.041
Token.
Token ID13
Feature activation-9.93e-01

Pos logit contributions
apy+0.459
 floats+0.396
 are+0.391
 were+0.378
ades+0.374
Neg logit contributions
umper-0.481
othy-0.479
bur-0.475
asured-0.468
 Tooth-0.464
Token She
Token ID1375
Feature activation-2.85e-01

Pos logit contributions
apy+1.765
 floats+1.574
 are+1.522
ades+1.481
 were+1.471
Neg logit contributions
umper-1.980
othy-1.975
bur-1.949
 Tooth-1.946
ior-1.929
Token was
Token ID373
Feature activation+2.66e-01

Pos logit contributions
apy+0.543
 are+0.468
 floats+0.467
ades+0.449
 were+0.446
Neg logit contributions
umper-0.548
othy-0.546
bur-0.537
asured-0.534
 Tooth-0.526
Token happy
Token ID3772
Feature activation-2.09e-01

Pos logit contributions
umper+0.561
bur+0.549
othy+0.548
ior+0.541
asured+0.538
Neg logit contributions
apy-0.470
 are-0.403
 floats-0.401
 were-0.390
ades-0.380
Token with
Token ID351
Feature activation+1.18e+00

Pos logit contributions
apy+0.404
 floats+0.349
 are+0.347
 were+0.335
ades+0.333
Neg logit contributions
umper-0.404
othy-0.398
bur-0.397
asured-0.392
 Tooth-0.391
Token her
Token ID607
Feature activation+1.10e+00

Pos logit contributions
umper+2.615
ior+2.558
bur+2.535
othy+2.510
asured+2.506
Neg logit contributions
apy-2.006
 are-1.758
 floats-1.689
 were-1.688
 they-1.627
Token chain
Token ID6333
Feature activation-3.73e-02

Pos logit contributions
umper+2.023
ior+1.947
bur+1.945
othy+1.935
asured+1.906
Neg logit contributions
apy-2.263
 are-2.076
 were-2.020
 floats-2.000
 they-1.917
Token.
Token ID13
Feature activation-6.36e-01

Pos logit contributions
apy+0.067
 floats+0.058
 are+0.057
 were+0.055
ades+0.055
Neg logit contributions
umper-0.076
othy-0.075
bur-0.075
asured-0.073
 Tooth-0.073
Token
Token ID198
Feature activation-4.22e-01

Pos logit contributions
apy+1.118
 floats+0.973
 are+0.945
ades+0.923
 were+0.917
Neg logit contributions
umper-1.311
othy-1.297
bur-1.285
 Tooth-1.277
ior-1.276
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
apy+0.706
 floats+0.610
 are+0.589
 were+0.568
ades+0.559
Neg logit contributions
umper-0.918
othy-0.901
bur-0.901
ior-0.882
 Tooth-0.881
Token�
Token ID241
Feature activation+1.09e+00

Pos logit contributions
umper+2.592
bur+2.486
asured+2.425
 Tooth+2.411
othy+2.405
Neg logit contributions
apy-3.200
 floats-2.868
 are-2.865
ades-2.756
 were-2.750
TokenOh
Token ID5812
Feature activation+2.58e-01

Pos logit contributions
umper+2.371
bur+2.306
othy+2.286
asured+2.282
ior+2.281
Neg logit contributions
apy-1.894
 are-1.632
 were-1.593
 floats-1.580
 they-1.501
Token,
Token ID11
Feature activation+6.30e-01

Pos logit contributions
umper+0.528
bur+0.516
othy+0.515
ior+0.507
asured+0.506
Neg logit contributions
apy-0.469
 are-0.405
 floats-0.403
 were-0.392
ades-0.381
Token thank
Token ID5875
Feature activation+1.42e+00

Pos logit contributions
umper+1.267
bur+1.236
othy+1.231
asured+1.219
ior+1.218
Neg logit contributions
apy-1.166
 are-1.026
 floats-1.010
 were-0.991
 they-0.949
Token you
Token ID345
Feature activation+2.11e-01

Pos logit contributions
umper+2.955
asured+2.927
bur+2.878
othy+2.873
ior+2.830
Neg logit contributions
apy-2.573
 are-2.330
 floats-2.221
 were-2.198
 they-2.174
Token!
Token ID0
Feature activation+1.28e+00

Pos logit contributions
umper+0.407
bur+0.397
othy+0.397
ior+0.390
asured+0.389
Neg logit contributions
apy-0.405
 are-0.352
 floats-0.351
 were-0.342
ades-0.334
Token I
Token ID314
Feature activation-1.36e-02

Pos logit contributions
umper+2.455
bur+2.453
asured+2.427
othy+2.414
ior+2.366
Neg logit contributions
apy-2.491
 are-2.190
 floats-2.134
 were-2.106
 they-2.037
Tokenâ
Token ID22940
Feature activation+1.61e+00

Pos logit contributions
apy+0.023
 floats+0.019
 are+0.019
 were+0.018
ades+0.018
Neg logit contributions
umper-0.030
othy-0.029
bur-0.029
asured-0.029
ior-0.029
Token€
Token ID26391
Feature activation-4.69e-01

Pos logit contributions
umper+2.786
asured+2.761
 Tooth+2.677
bur+2.676
othy+2.646
Neg logit contributions
apy-3.314
 are-3.178
 floats-3.030
 were-3.007
ades-2.804
Tokenâ„¢
Token ID8151
Feature activation-1.01e+00

Pos logit contributions
apy+0.929
 floats+0.805
 are+0.804
 were+0.781
ades+0.771
Neg logit contributions
umper-0.865
bur-0.856
othy-0.852
ior-0.833
 Tooth-0.823
Tokenll
Token ID297
Feature activation+2.30e-01

Pos logit contributions
apy+2.174
 floats+1.933
 are+1.825
ades+1.798
 vows+1.756
Neg logit contributions
umper-1.743
bur-1.732
othy-1.705
ior-1.643
asured-1.622
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
umper+1.362
bur+1.338
othy+1.331
asured+1.320
ior+1.307
Neg logit contributions
apy-0.945
 are-0.790
 floats-0.775
 were-0.770
ades-0.725
Token
Token ID198
Feature activation-4.26e-02

Pos logit contributions
umper+0.762
bur+0.747
othy+0.746
ior+0.737
asured+0.736
Neg logit contributions
apy-0.504
 are-0.421
 floats-0.416
 were-0.406
ades-0.393
Token"
Token ID1
Feature activation+2.84e-01

Pos logit contributions
apy+0.077
 floats+0.067
 are+0.067
 were+0.065
ades+0.063
Neg logit contributions
umper-0.087
othy-0.086
bur-0.085
ior-0.084
 Tooth-0.083
TokenI
Token ID40
Feature activation+1.13e+00

Pos logit contributions
umper+0.521
othy+0.507
bur+0.507
ior+0.498
asured+0.497
Neg logit contributions
apy-0.580
 are-0.508
 floats-0.506
 were-0.495
ades-0.483
Token don
Token ID836
Feature activation-8.59e-01

Pos logit contributions
umper+2.701
othy+2.617
bur+2.617
ior+2.591
 Tooth+2.577
Neg logit contributions
apy-1.768
 are-1.573
 were-1.511
 floats-1.416
 they-1.374
Token't
Token ID470
Feature activation+1.33e+00

Pos logit contributions
apy+1.257
 floats+1.099
 are+1.029
ades+1.004
 were+0.999
Neg logit contributions
umper-2.021
bur-1.977
othy-1.954
 Tooth-1.930
asured-1.890
Token understand
Token ID1833
Feature activation+1.07e+00

Pos logit contributions
umper+2.932
bur+2.874
othy+2.869
asured+2.860
ior+2.846
Neg logit contributions
apy-2.197
 are-2.079
 were-1.944
 floats-1.828
 they-1.770
Token,"
Token ID553
Feature activation+3.11e-01

Pos logit contributions
umper+2.277
othy+2.181
bur+2.169
asured+2.169
ior+2.139
Neg logit contributions
apy-1.876
 are-1.674
 floats-1.604
 were-1.601
 they-1.528
Token she
Token ID673
Feature activation+8.84e-01

Pos logit contributions
umper+0.684
bur+0.670
othy+0.668
asured+0.658
ior+0.658
Neg logit contributions
apy-0.518
 are-0.440
 floats-0.436
 were-0.425
ades-0.409
Token said
Token ID531
Feature activation+1.68e-01

Pos logit contributions
umper+1.926
bur+1.876
othy+1.840
asured+1.830
ior+1.825
Neg logit contributions
apy-1.530
 are-1.302
 were-1.291
 floats-1.279
 they-1.225
Token.
Token ID13
Feature activation+8.82e-02

Pos logit contributions
umper+0.348
othy+0.340
bur+0.340
ior+0.335
asured+0.334
Neg logit contributions
apy-0.298
 are-0.257
 floats-0.256
 were-0.248
ades-0.243
Token,
Token ID11
Feature activation+7.87e-01

Pos logit contributions
apy+1.629
 floats+1.405
 are+1.363
ades+1.340
 were+1.333
Neg logit contributions
umper-1.789
othy-1.784
bur-1.758
ior-1.730
 Tooth-1.720
Token I
Token ID314
Feature activation-5.31e-01

Pos logit contributions
umper+1.670
bur+1.644
othy+1.632
ior+1.620
asured+1.606
Neg logit contributions
apy-1.388
 are-1.215
 floats-1.195
 were-1.170
 they-1.139
Token loved
Token ID6151
Feature activation+5.84e-01

Pos logit contributions
apy+1.015
 floats+0.877
 are+0.842
ades+0.834
 were+0.804
Neg logit contributions
umper-1.029
othy-1.024
bur-1.013
asured-1.007
 Tooth-0.985
Token it
Token ID340
Feature activation+8.44e-02

Pos logit contributions
umper+1.127
bur+1.102
othy+1.093
ior+1.080
 Tooth+1.071
Neg logit contributions
apy-1.127
 floats-0.993
 are-0.990
 were-0.957
ades-0.938
Token!
Token ID0
Feature activation+8.79e-01

Pos logit contributions
umper+0.153
bur+0.149
othy+0.149
ior+0.146
asured+0.146
Neg logit contributions
apy-0.172
 floats-0.151
 are-0.150
 were-0.146
ades-0.144
Token Can
Token ID1680
Feature activation+6.00e-01

Pos logit contributions
umper+1.769
bur+1.749
othy+1.708
asured+1.697
ior+1.686
Neg logit contributions
apy-1.656
 are-1.436
 floats-1.426
 were-1.383
 they-1.334
Token we
Token ID356
Feature activation-8.38e-01

Pos logit contributions
umper+1.239
bur+1.208
othy+1.205
ior+1.195
asured+1.191
Neg logit contributions
apy-1.090
 are-0.948
 floats-0.938
 were-0.917
 they-0.886
Token come
Token ID1282
Feature activation+6.72e-01

Pos logit contributions
apy+1.365
 floats+1.107
ades+1.066
 are+1.034
 were+1.017
Neg logit contributions
umper-1.912
bur-1.897
othy-1.896
asured-1.825
 Tooth-1.822
Token back
Token ID736
Feature activation+3.88e-01

Pos logit contributions
umper+1.214
othy+1.170
bur+1.170
ior+1.164
 Tooth+1.161
Neg logit contributions
apy-1.371
 are-1.228
 floats-1.222
 were-1.189
ades-1.160
Token and
Token ID290
Feature activation-2.66e-01

Pos logit contributions
umper+0.665
bur+0.644
othy+0.644
ior+0.633
 Tooth+0.630
Neg logit contributions
apy-0.831
 are-0.739
 floats-0.735
 were-0.719
ades-0.703
Token join
Token ID4654
Feature activation-7.02e-01

Pos logit contributions
apy+0.401
 are+0.325
 floats+0.325
ades+0.314
 were+0.314
Neg logit contributions
umper-0.624
othy-0.620
bur-0.618
asured-0.607
 Tooth-0.605
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
apy+1.030
 floats+0.899
 are+0.877
ades+0.865
 were+0.857
Neg logit contributions
umper-1.055
othy-1.042
ior-1.036
 Tooth-1.034
bur-1.033
Token Anna
Token ID11735
Feature activation+7.44e-01

Pos logit contributions
umper+2.429
othy+2.369
bur+2.337
ior+2.326
asured+2.307
Neg logit contributions
apy-2.498
 are-2.360
 were-2.230
 floats-2.219
 they-2.187
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
umper+1.651
bur+1.611
othy+1.610
ior+1.577
asured+1.577
Neg logit contributions
apy-1.209
 are-1.071
 floats-1.031
 were-1.027
 they-0.975
Token Come
Token ID7911
Feature activation+1.42e-01

Pos logit contributions
umper+3.992
bur+3.981
asured+3.869
othy+3.810
ppa+3.777
Neg logit contributions
apy-3.566
 are-3.229
 floats-3.029
 were-2.957
 they-2.917
Token on
Token ID319
Feature activation-3.39e-02

Pos logit contributions
umper+0.271
bur+0.265
othy+0.264
ior+0.259
asured+0.258
Neg logit contributions
apy-0.277
 are-0.241
 floats-0.241
 were-0.234
ades-0.230
Token,
Token ID11
Feature activation+7.20e-01

Pos logit contributions
apy+0.057
 floats+0.048
 are+0.048
 were+0.046
ades+0.045
Neg logit contributions
umper-0.074
othy-0.073
bur-0.073
ior-0.072
 Tooth-0.072
Token let
Token ID1309
Feature activation+1.56e-01

Pos logit contributions
umper+1.499
bur+1.453
othy+1.449
asured+1.439
ior+1.430
Neg logit contributions
apy-1.286
 are-1.147
 floats-1.112
 were-1.093
 they-1.051
Token's
Token ID338
Feature activation+1.64e-01

Pos logit contributions
umper+0.272
bur+0.265
othy+0.265
ior+0.260
 Tooth+0.258
Neg logit contributions
apy-0.331
 are-0.291
 floats-0.291
 were-0.284
ades-0.279
Token go
Token ID467
Feature activation+9.79e-01

Pos logit contributions
umper+0.425
othy+0.418
bur+0.418
ior+0.412
asured+0.411
Neg logit contributions
apy-0.209
 are-0.167
 floats-0.167
 were-0.159
ades-0.154
Token and
Token ID290
Feature activation+5.81e-01

Pos logit contributions
umper+1.882
ior+1.820
 Tooth+1.820
othy+1.816
bur+1.806
Neg logit contributions
apy-1.901
 are-1.732
 floats-1.674
 were-1.654
 they-1.601
Token get
Token ID651
Feature activation+2.75e-01

Pos logit contributions
umper+1.453
bur+1.411
othy+1.406
ior+1.399
asured+1.397
Neg logit contributions
apy-0.805
 are-0.674
 floats-0.657
 were-0.637
 they-0.600
Token to
Token ID284
Feature activation-1.49e-01

Pos logit contributions
umper+1.724
bur+1.682
othy+1.674
ior+1.654
 Tooth+1.652
Neg logit contributions
apy-1.297
 are-1.116
 floats-1.092
 were-1.089
 they-1.037
Token the
Token ID262
Feature activation-6.75e-01

Pos logit contributions
apy+0.272
 floats+0.234
 are+0.232
 were+0.226
ades+0.225
Neg logit contributions
umper-0.303
othy-0.299
bur-0.297
 Tooth-0.291
ior-0.291
Token little
Token ID1310
Feature activation+5.26e-01

Pos logit contributions
apy+1.403
 are+1.207
 floats+1.197
ades+1.170
 were+1.160
Neg logit contributions
othy-1.205
umper-1.198
bur-1.186
 Tooth-1.179
ior-1.170
Token boy
Token ID2933
Feature activation+6.28e-01

Pos logit contributions
umper+0.951
bur+0.921
othy+0.919
asured+0.906
 Tooth+0.901
Neg logit contributions
apy-1.082
 are-0.961
 floats-0.960
 were-0.937
ades-0.900
Token.
Token ID13
Feature activation-6.49e-01

Pos logit contributions
umper+1.369
bur+1.341
othy+1.331
 Tooth+1.317
ior+1.311
Neg logit contributions
apy-1.055
 are-0.907
 floats-0.890
 were-0.877
ades-0.836
Token
Token ID198
Feature activation-6.20e-01

Pos logit contributions
apy+1.123
 floats+0.960
 are+0.955
ades+0.927
 were+0.902
Neg logit contributions
umper-1.360
ior-1.356
othy-1.353
bur-1.350
 Tooth-1.315
Token
Token ID198
Feature activation-7.65e-01

Pos logit contributions
apy+1.072
 floats+0.930
 are+0.904
 were+0.854
ades+0.845
Neg logit contributions
umper-1.314
bur-1.307
othy-1.293
ior-1.280
 Tooth-1.257
TokenThe
Token ID464
Feature activation-4.89e-01

Pos logit contributions
apy+1.390
 floats+1.190
 are+1.188
 were+1.129
ades+1.119
Neg logit contributions
umper-1.578
othy-1.564
ior-1.559
bur-1.557
 Tooth-1.508
Token little
Token ID1310
Feature activation+5.18e-01

Pos logit contributions
apy+0.963
 are+0.837
 floats+0.833
ades+0.804
 were+0.803
Neg logit contributions
othy-0.912
umper-0.902
bur-0.896
 Tooth-0.890
ior-0.881
Token boy
Token ID2933
Feature activation+2.35e-01

Pos logit contributions
umper+0.953
bur+0.921
othy+0.917
asured+0.904
 Tooth+0.904
Neg logit contributions
apy-1.066
 are-0.943
 floats-0.941
 were-0.921
ades-0.883
Token was
Token ID373
Feature activation+8.98e-01

Pos logit contributions
umper+0.470
bur+0.460
othy+0.459
ior+0.452
asured+0.450
Neg logit contributions
apy-0.437
 are-0.379
 floats-0.377
 were-0.367
ades-0.358
Token!
Token ID0
Feature activation-1.05e-01

Pos logit contributions
apy+0.897
 floats+0.777
 are+0.774
ades+0.745
 were+0.736
Neg logit contributions
umper-0.927
othy-0.911
bur-0.907
ior-0.894
 Tooth-0.891
Token<|endoftext|>
Token ID50256
Feature activation-6.01e-01

Pos logit contributions
apy+0.169
 floats+0.143
 are+0.141
 were+0.135
ades+0.135
Neg logit contributions
umper-0.238
bur-0.233
othy-0.232
ior-0.230
 Tooth-0.228
TokenOne
Token ID3198
Feature activation-9.88e-03

Pos logit contributions
apy+1.105
 floats+0.942
 are+0.911
ades+0.891
 vows+0.867
Neg logit contributions
umper-1.232
bur-1.204
othy-1.193
 Tooth-1.167
ior-1.164
Token bright
Token ID6016
Feature activation+4.45e-01

Pos logit contributions
apy+0.022
 floats+0.019
 are+0.019
 were+0.019
ades+0.019
Neg logit contributions
umper-0.016
othy-0.016
bur-0.016
asured-0.016
ior-0.016
Token and
Token ID290
Feature activation+6.99e-01

Pos logit contributions
umper+0.738
bur+0.719
othy+0.717
asured+0.704
ior+0.703
Neg logit contributions
apy-0.979
 floats-0.870
 are-0.868
 were-0.849
ades-0.829
Token sunny
Token ID27737
Feature activation+1.23e-01

Pos logit contributions
umper+1.275
bur+1.251
asured+1.241
othy+1.239
ior+1.228
Neg logit contributions
apy-1.423
 are-1.249
 floats-1.245
 were-1.227
 they-1.175
Token day
Token ID1110
Feature activation-1.62e+00

Pos logit contributions
umper+0.203
othy+0.199
bur+0.198
ior+0.194
asured+0.193
Neg logit contributions
apy-0.270
 are-0.239
 floats-0.238
 were-0.232
ades-0.229
Token,
Token ID11
Feature activation-8.98e-01

Pos logit contributions
apy+3.016
 floats+2.784
 are+2.688
ades+2.607
cakes+2.559
Neg logit contributions
umper-3.037
othy-3.036
bur-2.977
 Tooth-2.868
ior-2.855
Token Jack
Token ID3619
Feature activation+9.43e-01

Pos logit contributions
apy+1.757
 floats+1.570
ades+1.538
 are+1.527
 vows+1.469
Neg logit contributions
othy-1.641
umper-1.633
bur-1.610
ior-1.568
 Tooth-1.545
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
umper+1.991
bur+1.960
 Tooth+1.939
othy+1.935
asured+1.928
Neg logit contributions
apy-1.623
 are-1.461
 were-1.439
 floats-1.425
 they-1.331
Token his
Token ID465
Feature activation-1.14e+00

Pos logit contributions
bur+2.961
umper+2.948
asured+2.900
ior+2.832
 Tooth+2.799
Neg logit contributions
apy-3.201
 are-2.966
 were-2.917
 floats-2.842
 they-2.834
Tokenmy
Token ID1820
Feature activation+1.25e+00

Pos logit contributions
umper+2.733
othy+2.634
ior+2.618
asured+2.617
 Tooth+2.550
Neg logit contributions
apy-3.371
 are-3.222
 were-3.214
 they-3.050
 floats-3.010
Token's
Token ID338
Feature activation+1.05e-01

Pos logit contributions
umper+2.424
othy+2.365
ior+2.335
asured+2.324
bur+2.320
Neg logit contributions
apy-2.389
 are-2.175
 were-2.153
 floats-2.075
 they-2.056
Token makeup
Token ID16029
Feature activation+4.54e-01

Pos logit contributions
umper+0.210
othy+0.206
bur+0.206
ior+0.202
 Tooth+0.202
Neg logit contributions
apy-0.195
 are-0.168
 floats-0.168
 were-0.163
ades-0.160
Token and
Token ID290
Feature activation-2.26e-01

Pos logit contributions
umper+0.913
bur+0.889
othy+0.888
ior+0.879
asured+0.874
Neg logit contributions
apy-0.840
 are-0.732
 floats-0.725
 were-0.714
ades-0.688
Token decided
Token ID3066
Feature activation-1.51e-01

Pos logit contributions
apy+0.406
 are+0.349
 floats+0.349
ades+0.340
 were+0.333
Neg logit contributions
umper-0.455
othy-0.451
bur-0.447
 Tooth-0.442
ior-0.439
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
apy+0.332
 are+0.292
 floats+0.292
 were+0.283
ades+0.281
Neg logit contributions
umper-0.250
othy-0.246
bur-0.245
 Tooth-0.241
asured-0.239
Token put
Token ID1234
Feature activation+1.38e+00

Pos logit contributions
apy+2.033
 floats+1.735
ades+1.691
 are+1.614
 were+1.581
Neg logit contributions
othy-2.384
umper-2.349
 Tooth-2.297
bur-2.274
asured-2.269
Token it
Token ID340
Feature activation+4.85e-01

Pos logit contributions
umper+3.169
asured+3.101
ior+3.091
othy+3.091
bur+3.047
Neg logit contributions
apy-2.274
 are-1.968
 were-1.888
 floats-1.878
 they-1.824
Token on
Token ID319
Feature activation+8.19e-02

Pos logit contributions
umper+0.947
othy+0.924
bur+0.918
ior+0.914
asured+0.903
Neg logit contributions
apy-0.925
 are-0.806
 floats-0.797
 were-0.784
ades-0.763
Token her
Token ID607
Feature activation+3.36e-01

Pos logit contributions
umper+0.161
bur+0.158
othy+0.158
ior+0.155
 Tooth+0.155
Neg logit contributions
apy-0.155
 floats-0.134
 are-0.134
 were-0.130
ades-0.128
Token face
Token ID1986
Feature activation+4.34e-01

Pos logit contributions
umper+0.683
bur+0.668
othy+0.666
ior+0.655
asured+0.652
Neg logit contributions
apy-0.620
 are-0.537
 floats-0.533
 were-0.520
ades-0.506
TokenL
Token ID43
Feature activation+1.21e+00

Pos logit contributions
apy+1.539
 floats+1.304
 are+1.254
ades+1.235
 vows+1.231
Neg logit contributions
umper-1.571
bur-1.515
othy-1.498
 Tooth-1.478
ior-1.469
Tokenily
Token ID813
Feature activation+1.15e+00

Pos logit contributions
umper+2.671
bur+2.653
othy+2.649
asured+2.636
ior+2.630
Neg logit contributions
apy-1.908
 are-1.757
 were-1.683
 floats-1.602
ades-1.575
Token and
Token ID290
Feature activation-3.11e-01

Pos logit contributions
umper+1.979
bur+1.931
othy+1.877
asured+1.877
ior+1.847
Neg logit contributions
apy-2.434
 are-2.281
 were-2.241
 floats-2.217
 they-2.104
Token Max
Token ID5436
Feature activation-1.01e+00

Pos logit contributions
apy+0.582
 floats+0.502
ades+0.495
 are+0.486
 were+0.471
Neg logit contributions
umper-0.625
othy-0.613
bur-0.612
ior-0.601
 Tooth-0.597
Token are
Token ID389
Feature activation+2.09e-01

Pos logit contributions
apy+1.813
ades+1.474
 floats+1.462
cakes+1.350
 are+1.324
Neg logit contributions
umper-2.158
othy-2.141
bur-2.119
ior-2.115
 Tooth-2.059
Token twins
Token ID20345
Feature activation-4.90e-01

Pos logit contributions
umper+0.403
bur+0.394
othy+0.393
ior+0.386
asured+0.385
Neg logit contributions
apy-0.408
 are-0.355
 floats-0.354
 were-0.345
ades-0.337
Token.
Token ID13
Feature activation-8.73e-01

Pos logit contributions
apy+0.941
 floats+0.804
 are+0.775
ades+0.772
 were+0.747
Neg logit contributions
umper-0.957
bur-0.938
othy-0.932
ior-0.925
asured-0.919
Token They
Token ID1119
Feature activation-1.31e+00

Pos logit contributions
apy+0.957
 floats+0.760
ades+0.751
 are+0.718
 were+0.645
Neg logit contributions
ior-2.382
umper-2.374
othy-2.366
bur-2.347
 Tooth-2.328
Token like
Token ID588
Feature activation-8.44e-01

Pos logit contributions
apy+2.987
ades+2.570
 floats+2.565
cakes+2.424
ves+2.354
Neg logit contributions
umper-2.137
othy-2.122
bur-2.077
ior-2.030
asured-2.012
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
apy+2.004
 floats+1.759
 are+1.747
ades+1.718
 were+1.696
Neg logit contributions
umper-1.240
bur-1.227
othy-1.216
 Tooth-1.198
asured-1.193
Token play
Token ID711
Feature activation-1.55e+00

Pos logit contributions
apy+2.570
 floats+2.112
ades+2.062
 are+1.976
 were+1.956
Neg logit contributions
othy-3.457
umper-3.274
bur-3.257
asured-3.171
 Tooth-3.148
Token and
Token ID290
Feature activation+4.02e-01

Pos logit contributions
umper+0.367
bur+0.358
othy+0.358
ior+0.351
asured+0.350
Neg logit contributions
apy-0.397
 are-0.347
 floats-0.346
 were-0.337
ades-0.330
Token he
Token ID339
Feature activation+3.91e-01

Pos logit contributions
umper+0.777
bur+0.759
othy+0.753
ior+0.745
asured+0.740
Neg logit contributions
apy-0.783
 are-0.680
 floats-0.678
 were-0.664
ades-0.641
Token realized
Token ID6939
Feature activation+9.69e-01

Pos logit contributions
umper+0.736
bur+0.720
othy+0.713
ior+0.706
 Tooth+0.701
Neg logit contributions
apy-0.777
 are-0.678
 floats-0.675
 were-0.664
ades-0.644
Token he
Token ID339
Feature activation+1.33e+00

Pos logit contributions
umper+2.075
bur+2.034
ior+2.033
othy+2.024
asured+1.999
Neg logit contributions
apy-1.713
 are-1.460
 floats-1.440
 were-1.437
 they-1.409
Token could
Token ID714
Feature activation-9.21e-02

Pos logit contributions
umper+2.983
bur+2.957
ior+2.908
 Tooth+2.867
othy+2.841
Neg logit contributions
apy-2.164
 were-1.962
 are-1.950
 floats-1.844
 they-1.740
Token draw
Token ID3197
Feature activation+8.20e-02

Pos logit contributions
apy+0.156
 floats+0.134
 are+0.131
 were+0.127
ades+0.126
Neg logit contributions
umper-0.199
othy-0.197
bur-0.195
asured-0.192
 Tooth-0.192
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
umper+0.164
othy+0.160
bur+0.160
ior+0.157
 Tooth+0.157
Neg logit contributions
apy-0.152
 are-0.131
 floats-0.131
 were-0.127
ades-0.124
Token make
Token ID787
Feature activation-1.86e-01

Pos logit contributions
apy+0.029
 floats+0.025
 are+0.025
 were+0.024
ades+0.024
Neg logit contributions
umper-0.036
othy-0.035
bur-0.035
asured-0.035
 Tooth-0.035
Token himself
Token ID2241
Feature activation-1.55e+00

Pos logit contributions
apy+0.348
 floats+0.300
 are+0.300
 were+0.291
ades+0.284
Neg logit contributions
umper-0.368
othy-0.363
bur-0.359
 Tooth-0.355
asured-0.354
Token feel
Token ID1254
Feature activation-3.77e-01

Pos logit contributions
apy+2.801
 floats+2.470
ades+2.356
 are+2.349
 were+2.300
Neg logit contributions
othy-3.151
asured-3.108
umper-3.043
bur-2.985
 Tooth-2.983
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
apy+0.720
 floats+0.630
 are+0.617
ades+0.607
 were+0.598
Neg logit contributions
umper-0.727
othy-0.708
asured-0.703
 Tooth-0.703
bur-0.698
Token out
Token ID503
Feature activation-7.60e-01

Pos logit contributions
apy+0.778
 are+0.647
 floats+0.643
ades+0.627
 were+0.624
Neg logit contributions
umper-1.022
othy-1.006
bur-1.002
ior-0.986
 Tooth-0.978
Token her
Token ID607
Feature activation-6.22e-01

Pos logit contributions
apy+1.529
 floats+1.333
 are+1.314
ades+1.284
 were+1.273
Neg logit contributions
umper-1.391
othy-1.391
bur-1.360
 Tooth-1.334
asured-1.329
Token flowers
Token ID12734
Feature activation-8.64e-01

Pos logit contributions
apy+1.140
 floats+0.969
 are+0.966
 were+0.947
ades+0.932
Neg logit contributions
umper-1.242
bur-1.229
othy-1.227
 Tooth-1.206
asured-1.204
Token and
Token ID290
Feature activation-6.87e-01

Pos logit contributions
apy+2.180
 floats+1.938
 are+1.881
ades+1.859
 were+1.830
Neg logit contributions
bur-1.188
umper-1.178
othy-1.137
asured-1.105
 Tooth-1.103
Token displays
Token ID11298
Feature activation-1.47e+00

Pos logit contributions
apy+1.228
ades+1.041
 are+1.015
 floats+1.003
 were+0.994
Neg logit contributions
othy-1.405
umper-1.395
bur-1.390
 Tooth-1.364
ior-1.364
Token them
Token ID606
Feature activation-2.39e+00

Pos logit contributions
apy+2.678
 floats+2.266
 are+2.224
ades+2.171
 were+2.128
Neg logit contributions
othy-2.962
bur-2.958
umper-2.952
 Tooth-2.816
asured-2.811
Token on
Token ID319
Feature activation-9.57e-01

Pos logit contributions
apy+5.789
 floats+4.947
 are+4.830
ades+4.825
 were+4.714
Neg logit contributions
asured-3.650
bur-3.624
othy-3.601
umper-3.498
 Tooth-3.351
Token a
Token ID257
Feature activation-1.73e+00

Pos logit contributions
apy+1.794
 floats+1.500
 are+1.495
ades+1.449
 were+1.431
Neg logit contributions
othy-1.903
bur-1.891
umper-1.887
 Tooth-1.876
asured-1.831
Token bench
Token ID7624
Feature activation-1.11e+00

Pos logit contributions
apy+3.281
 are+2.647
 were+2.643
 floats+2.591
ades+2.464
Neg logit contributions
bur-3.474
othy-3.470
 Tooth-3.452
umper-3.427
ior-3.409
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
apy+2.275
 floats+1.912
 are+1.870
ades+1.842
 were+1.795
Neg logit contributions
othy-2.041
bur-2.030
umper-2.008
asured-1.982
 Tooth-1.971
Token She
Token ID1375
Feature activation-1.12e+00

Pos logit contributions
apy+1.544
 are+1.339
 floats+1.336
ades+1.312
 were+1.287
Neg logit contributions
umper-1.602
othy-1.576
 Tooth-1.564
ior-1.560
bur-1.532
Token a
Token ID257
Feature activation-2.15e+00

Pos logit contributions
apy+3.623
 floats+3.169
ades+3.165
 are+3.090
cakes+3.014
Neg logit contributions
othy-3.573
 Tooth-3.531
umper-3.514
bur-3.483
ior-3.285
Token tree
Token ID5509
Feature activation-1.10e+00

Pos logit contributions
apy+4.061
 are+3.393
 floats+3.373
 vows+3.296
 were+3.295
Neg logit contributions
 Tooth-4.307
bur-4.230
umper-4.204
othy-4.195
ior-3.996
Token.
Token ID13
Feature activation-5.13e-02

Pos logit contributions
apy+2.230
 floats+1.909
 are+1.850
ades+1.841
 vows+1.790
Neg logit contributions
bur-2.042
othy-2.027
umper-2.015
asured-1.976
 Tooth-1.948
Token They
Token ID1119
Feature activation-1.77e+00

Pos logit contributions
apy+0.084
 are+0.072
 floats+0.071
 were+0.068
ades+0.068
Neg logit contributions
umper-0.113
othy-0.111
bur-0.111
ior-0.110
asured-0.109
Token drew
Token ID9859
Feature activation-2.36e+00

Pos logit contributions
apy+3.714
ades+3.295
 floats+3.165
 are+2.978
cakes+2.975
Neg logit contributions
othy-3.220
bur-3.214
umper-3.203
asured-3.141
my-2.997
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
apy+4.677
 floats+3.837
ades+3.818
 are+3.789
 vows+3.776
Neg logit contributions
othy-4.358
bur-4.208
umper-4.201
my-4.132
 Tooth-4.115
Token picture
Token ID4286
Feature activation-1.60e+00

Pos logit contributions
apy+3.987
 are+3.453
 floats+3.301
 were+3.272
ades+3.267
Neg logit contributions
 Tooth-4.644
othy-4.595
bur-4.513
ior-4.390
umper-4.388
Token of
Token ID286
Feature activation-1.30e+00

Pos logit contributions
apy+2.840
 are+2.382
 floats+2.374
ades+2.298
 vows+2.263
Neg logit contributions
othy-3.229
bur-3.178
umper-3.172
 Tooth-3.086
asured-3.082
Token a
Token ID257
Feature activation-1.42e+00

Pos logit contributions
apy+2.546
ades+2.174
 floats+2.159
 are+2.109
cakes+2.044
Neg logit contributions
othy-2.438
bur-2.426
 Tooth-2.412
umper-2.387
asured-2.346
Token rainbow
Token ID27223
Feature activation-9.04e-01

Pos logit contributions
apy+2.607
 are+2.272
 floats+2.211
ades+2.202
 were+2.154
Neg logit contributions
 Tooth-2.822
bur-2.780
othy-2.765
umper-2.742
ior-2.672
Token and
Token ID290
Feature activation-8.16e-01

Pos logit contributions
apy+2.001
 are+1.746
 floats+1.717
ades+1.683
 were+1.677
Neg logit contributions
bur-1.460
othy-1.448
umper-1.436
 Tooth-1.387
asured-1.385
Token toys
Token ID14958
Feature activation-1.15e+00

Pos logit contributions
apy+2.793
ades+2.258
 floats+2.244
 are+2.213
 were+2.193
Neg logit contributions
othy-2.569
ior-2.568
asured-2.556
bur-2.537
umper-2.502
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
apy+2.697
 floats+2.329
ades+2.238
 are+2.210
 were+2.176
Neg logit contributions
asured-1.765
bur-1.761
umper-1.755
othy-1.728
ior-1.700
Token and
Token ID290
Feature activation-3.14e-01

Pos logit contributions
apy+0.760
 floats+0.670
 are+0.668
ades+0.658
 were+0.651
Neg logit contributions
umper-0.441
bur-0.434
othy-0.433
asured-0.427
ior-0.427
Token sometimes
Token ID3360
Feature activation-1.45e+00

Pos logit contributions
apy+0.459
 floats+0.368
 are+0.366
ades+0.364
 were+0.350
Neg logit contributions
umper-0.740
othy-0.736
bur-0.730
ior-0.724
asured-0.724
Token they
Token ID484
Feature activation-1.21e+00

Pos logit contributions
apy+1.628
ades+1.289
 floats+1.154
cakes+1.093
 are+1.088
Neg logit contributions
bur-3.850
othy-3.850
umper-3.839
asured-3.817
 Tooth-3.816
Token fight
Token ID1907
Feature activation-1.83e+00

Pos logit contributions
apy+2.268
ades+1.841
 floats+1.818
cakes+1.690
 were+1.671
Neg logit contributions
umper-2.528
bur-2.515
othy-2.480
ior-2.454
asured-2.453
Token over
Token ID625
Feature activation-9.86e-01

Pos logit contributions
apy+4.418
 floats+3.727
ades+3.640
 are+3.615
 vows+3.521
Neg logit contributions
asured-2.762
othy-2.719
umper-2.710
ior-2.657
bur-2.624
Token them
Token ID606
Feature activation-2.24e+00

Pos logit contributions
apy+2.006
 floats+1.687
 are+1.651
ades+1.632
 vows+1.590
Neg logit contributions
umper-1.789
bur-1.768
othy-1.751
asured-1.747
ior-1.722
Token.
Token ID13
Feature activation-8.10e-01

Pos logit contributions
apy+5.206
 floats+4.368
ades+4.202
 vows+4.127
 are+3.976
Neg logit contributions
asured-3.692
bur-3.656
umper-3.648
my-3.620
othy-3.614
Token
Token ID198
Feature activation-7.62e-01

Pos logit contributions
apy+1.378
 floats+1.151
 are+1.150
ades+1.114
 were+1.097
Neg logit contributions
umper-1.735
ior-1.721
bur-1.714
othy-1.701
 Tooth-1.669
Token
Token ID198
Feature activation-2.44e-01

Pos logit contributions
apy+1.338
 floats+1.165
 are+1.134
 were+1.078
 vows+1.059
Neg logit contributions
umper-1.576
bur-1.569
othy-1.556
ior-1.521
asured-1.515
Token animals
Token ID4695
Feature activation-2.12e+00

Pos logit contributions
apy+1.340
ades+1.097
 floats+1.083
 are+1.070
 were+1.013
Neg logit contributions
othy-1.874
umper-1.866
asured-1.852
bur-1.845
ior-1.830
Token with
Token ID351
Feature activation-2.08e+00

Pos logit contributions
apy+4.900
 floats+4.219
ades+4.113
 vows+3.980
cakes+3.979
Neg logit contributions
umper-3.407
bur-3.317
asured-3.265
 Tooth-3.183
othy-3.182
Token the
Token ID262
Feature activation-1.62e+00

Pos logit contributions
apy+3.341
cakes+2.825
 floats+2.805
ades+2.768
 vows+2.718
Neg logit contributions
umper-4.562
othy-4.527
bur-4.483
asured-4.399
 Tooth-4.393
Token clay
Token ID21558
Feature activation-2.23e+00

Pos logit contributions
apy+2.839
 floats+2.257
ades+2.252
 are+2.170
 were+2.128
Neg logit contributions
umper-3.435
bur-3.389
 Tooth-3.373
othy-3.364
ior-3.317
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
apy+4.972
ades+4.110
 floats+4.073
 are+3.919
 widened+3.881
Neg logit contributions
umper-3.720
bur-3.638
asured-3.570
 Tooth-3.569
othy-3.562
Token They
Token ID1119
Feature activation-1.67e+00

Pos logit contributions
apy+1.200
 floats+0.999
 are+0.991
ades+0.966
 were+0.937
Neg logit contributions
bur-1.574
umper-1.572
ior-1.568
othy-1.566
asured-1.521
Token had
Token ID550
Feature activation-1.06e+00

Pos logit contributions
apy+3.700
ades+3.251
 floats+3.151
cakes+3.061
 vows+3.034
Neg logit contributions
umper-2.896
othy-2.833
bur-2.829
asured-2.709
my-2.703
Token red
Token ID2266
Feature activation-1.47e+00

Pos logit contributions
apy+2.039
 floats+1.681
 are+1.632
ades+1.625
cakes+1.558
Neg logit contributions
othy-2.113
bur-2.080
umper-2.057
asured-2.046
 Tooth-2.019
Token clay
Token ID21558
Feature activation-1.85e+00

Pos logit contributions
apy+3.086
 floats+2.655
ades+2.609
 are+2.572
 vows+2.559
Neg logit contributions
othy-2.463
umper-2.440
bur-2.431
ior-2.391
asured-2.353
Token,
Token ID11
Feature activation-6.60e-01

Pos logit contributions
apy+4.463
 floats+3.744
ades+3.697
 are+3.667
 vows+3.600
Neg logit contributions
umper-2.693
bur-2.650
asured-2.592
othy-2.590
 Tooth-2.542
Token blue
Token ID4171
Feature activation-1.86e+00

Pos logit contributions
apy+1.236
 are+1.054
 floats+1.042
ades+1.027
 were+1.009
Neg logit contributions
umper-1.275
othy-1.269
bur-1.252
 Tooth-1.242
asured-1.228
Token put
Token ID1234
Feature activation-4.44e-01

Pos logit contributions
apy+0.427
 floats+0.367
 are+0.358
 were+0.349
ades+0.348
Neg logit contributions
umper-0.480
othy-0.477
bur-0.474
asured-0.465
 Tooth-0.462
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
apy+0.793
 floats+0.675
 are+0.662
ades+0.654
 were+0.649
Neg logit contributions
umper-0.903
bur-0.901
othy-0.887
ior-0.879
 Tooth-0.876
Token puzzle
Token ID15027
Feature activation-1.04e+00

Pos logit contributions
apy+2.896
 floats+2.429
ades+2.328
 were+2.281
 vows+2.223
Neg logit contributions
 Tooth-3.277
umper-3.232
bur-3.167
othy-3.166
asured-3.137
Token pieces
Token ID5207
Feature activation-1.30e+00

Pos logit contributions
apy+2.263
 floats+2.008
 are+1.905
ades+1.898
 were+1.860
Neg logit contributions
umper-1.734
bur-1.726
asured-1.717
othy-1.713
 Tooth-1.649
Token together
Token ID1978
Feature activation-6.63e-01

Pos logit contributions
apy+2.685
 floats+2.377
 vows+2.249
ades+2.249
 are+2.207
Neg logit contributions
umper-2.317
bur-2.308
asured-2.250
othy-2.212
ior-2.106
Token.
Token ID13
Feature activation-2.31e+00

Pos logit contributions
apy+1.282
 floats+1.120
 are+1.059
ades+1.054
 were+1.008
Neg logit contributions
umper-1.267
bur-1.263
othy-1.248
asured-1.236
ior-1.218
Token Sue
Token ID26089
Feature activation-2.50e-02

Pos logit contributions
apy+4.738
ades+4.373
 are+4.229
 floats+4.191
 vows+3.918
Neg logit contributions
othy-4.125
ior-3.979
asured-3.898
bur-3.703
 Tooth-3.506
Token and
Token ID290
Feature activation-4.33e-01

Pos logit contributions
apy+0.049
 floats+0.042
 are+0.042
 were+0.041
ades+0.040
Neg logit contributions
umper-0.048
othy-0.047
bur-0.047
asured-0.046
ior-0.046
Token her
Token ID607
Feature activation-1.15e+00

Pos logit contributions
apy+0.905
 floats+0.784
 are+0.768
ades+0.768
 were+0.748
Neg logit contributions
othy-0.787
umper-0.782
bur-0.782
ior-0.766
 Tooth-0.752
Token big
Token ID1263
Feature activation-6.95e-01

Pos logit contributions
apy+2.910
 floats+2.595
 are+2.552
ades+2.524
 were+2.481
Neg logit contributions
othy-1.584
umper-1.528
bur-1.519
asured-1.510
 Tooth-1.496
Token hand
Token ID1021
Feature activation-1.74e+00

Pos logit contributions
apy+1.747
 floats+1.536
 are+1.531
ades+1.526
 were+1.513
Neg logit contributions
umper-0.943
othy-0.941
ior-0.900
bur-0.899
 Tooth-0.895