TOP ACTIVATIONS
MAX = 3.693

little bird named Twe ety . Twe ety loved to fly
small bird named Twe ety . Twe ety loved to fly
bird ies to help . Together , they flew to the
little bird named Twe ety . Twe ety loved to fly
little bird named Twe ety . Twe ety loved to dance
bird ie named Twe ety . Twe ety lived in a
little bird named Twe ety . Twe ety was very flexible
they made . They had ten pairs of ac orns .
yellow bird named Twe ety . Twe ety loved to sing
little bird named Twe ety . Twe ety loved to whistle
little bird named Twe ety . Twe ety wanted to fly
little bird named Twe ety . Twe ety had an unusual
bird ie named Twe ety . Twe ety loved to fly
and were very happy . When they returned home , Twe
small bird named Twe ety . Twe ety lived in a
little bird named Twe ety . Twe ety lived in a
in the dirt . One day , Gold heard
little bird named Twe ety . Twe ety was very smart
. But when Twe ety and the owl
little bird named Twe ety . He had an orange be

MIN ACTIVATIONS
MIN = -3.698

so big ! She touched the screen and it lit up
over and accidentally stepped on the cricket . The cricket started
through the small hole in the screen . The butterfly was
she ran and showed it to her mom . Mom was
was surprised . He touched the screen and it made such
and pushed the button on the television . The screen lit
with a long cable . The cable was harmless , so
Tim stood up and pushed the button on the television .
closer look . Suddenly , the screen was making loud noises
Sara sees a light on the screen . The light is
Billy slowly approached the computer again , he slowly
. She peek ed around the corner and saw a pretty
M andy showed her hoop to her father .
step to get closer to the bug . He wanted to
to fit under the table the wires started to spark .
. Sam pushed the ball towards the shadow ,
frustrated . The buttons on the television were different and she
. He touched the keyboard , expecting the computer
trick ! She stood on the stage , holding a shiny
John nodded . He grabbed the cricket and put it carefully

INTERVAL 1.845 - 3.693
CONTAINS 0.532%

Thank you , Sara , this is a very nice pe
ily felt sorry for the little boy and said , "
to see the stars and couldn 't wait to tell her
back to living happily in the stable . <|endoftext|> Once there
wet . The girl was very excited .

INTERVAL -0.003 - 1.845
CONTAINS 49.463%

smiling . When they arrived at the museum
But you need to share and be nice to each other
voice was gentle , so he agreed . The
ily looked up and saw a friendly butterfly hovering above .
. " But I think it belongs to someone else .

INTERVAL -1.851 - -0.003
CONTAINS 49.594%

very happy together . They welcomed each other into their lives
picked it up with his mouth and passed it to the
his son climbed into one of the big trucks . But
me test for my bracelet ?" Mom my smiled
You spoiled my game !" Lily felt sad and didn 't

INTERVAL -3.698 - -1.851
CONTAINS 0.412%

and Tom cried and hugged each other . They felt lonely
on her head . She was shocked and angry . She
He says , " Y uck ! This apple is bad
. But you should also respect each other 's privacy .
off the bed , pushed the shiny knob again , and
Token little
Token ID1310
Feature activation+1.10e-01

Pos logit contributions
leaf+0.204
ting+0.202
orry+0.185
unk+0.183
 ear+0.179
Neg logit contributions
 ne-0.246
 One-0.238
 cheaper-0.234
 coll-0.233
 swe-0.233
Token bird
Token ID6512
Feature activation+4.02e-02

Pos logit contributions
 ne+0.153
 One+0.149
 cheaper+0.146
 swe+0.144
 coll+0.144
Neg logit contributions
leaf-0.173
ting-0.171
orry-0.160
unk-0.158
 ear-0.156
Token named
Token ID3706
Feature activation+2.91e-01

Pos logit contributions
 ne+0.063
 One+0.061
 cheaper+0.060
 coll+0.060
 swe+0.060
Neg logit contributions
leaf-0.057
ting-0.056
orry-0.052
unk-0.051
 ear-0.051
Token Twe
Token ID24205
Feature activation+1.07e+00

Pos logit contributions
 ne+0.294
 One+0.274
 cheaper+0.273
 coll+0.271
 swe+0.270
Neg logit contributions
leaf-0.578
ting-0.575
orry-0.544
unk-0.540
 ear-0.537
Tokenety
Token ID2963
Feature activation+1.57e-01

Pos logit contributions
 ne+1.033
 One+0.946
 swe+0.944
 cheaper+0.934
 coll+0.882
Neg logit contributions
ting-2.189
leaf-2.189
orry-2.105
unk-2.102
 upper-2.051
Token.
Token ID13
Feature activation+3.69e+00

Pos logit contributions
 ne+0.218
 One+0.210
 cheaper+0.207
 swe+0.205
 coll+0.205
Neg logit contributions
leaf-0.250
ting-0.248
orry-0.232
unk-0.229
 ear-0.227
Token Twe
Token ID24205
Feature activation+1.56e+00

Pos logit contributions
 coll+4.313
 cheaper+4.125
 swe+3.986
 scar+3.963
 problems+3.928
Neg logit contributions
ting-7.873
 ear-7.221
 Stop-6.483
leaf-6.424
rified-6.357
Tokenety
Token ID2963
Feature activation+1.11e+00

Pos logit contributions
 ne+1.657
 swe+1.529
 cheaper+1.522
 One+1.496
 saf+1.436
Neg logit contributions
ting-3.072
leaf-3.020
unk-2.966
orry-2.961
other-2.895
Token loved
Token ID6151
Feature activation+3.96e-01

Pos logit contributions
 cheaper+2.099
 ne+2.078
 coll+2.068
 swe+2.061
 One+2.019
Neg logit contributions
ting-1.145
leaf-1.136
orry-1.077
unk-1.049
 ear-1.043
Token to
Token ID284
Feature activation+9.86e-01

Pos logit contributions
 ne+0.535
 One+0.518
 cheaper+0.510
 swe+0.498
 coll+0.497
Neg logit contributions
leaf-0.650
ting-0.642
orry-0.591
 ear-0.589
unk-0.589
Token fly
Token ID6129
Feature activation+1.10e+00

Pos logit contributions
 cheaper+1.359
 ne+1.345
 swe+1.291
 One+1.284
 coll+1.269
Neg logit contributions
leaf-1.566
ting-1.517
 ear-1.479
 upper-1.430
orry-1.425
Token small
Token ID1402
Feature activation+3.56e-01

Pos logit contributions
leaf+0.307
ting+0.302
orry+0.277
unk+0.274
 ear+0.268
Neg logit contributions
 ne-0.362
 One-0.350
 cheaper-0.343
 coll-0.342
 swe-0.342
Token bird
Token ID6512
Feature activation-5.25e-02

Pos logit contributions
 ne+0.539
 One+0.524
 cheaper+0.516
 swe+0.511
One+0.509
Neg logit contributions
leaf-0.515
ting-0.512
orry-0.475
unk-0.472
 ear-0.471
Token named
Token ID3706
Feature activation+3.63e-01

Pos logit contributions
leaf+0.075
ting+0.074
orry+0.069
unk+0.068
 ear+0.067
Neg logit contributions
 ne-0.082
 One-0.079
 cheaper-0.077
 coll-0.077
 swe-0.077
Token Twe
Token ID24205
Feature activation+1.05e+00

Pos logit contributions
 ne+0.398
 cheaper+0.373
 One+0.372
 coll+0.369
 swe+0.368
Neg logit contributions
leaf-0.688
ting-0.684
orry-0.644
unk-0.640
 ear-0.635
Tokenety
Token ID2963
Feature activation+7.13e-02

Pos logit contributions
 ne+1.008
 One+0.928
 swe+0.904
 cheaper+0.903
 saf+0.855
Neg logit contributions
ting-2.154
leaf-2.153
orry-2.082
unk-2.068
 upper-2.019
Token.
Token ID13
Feature activation+3.69e+00

Pos logit contributions
 ne+0.097
 One+0.094
 cheaper+0.092
 swe+0.091
 coll+0.091
Neg logit contributions
leaf-0.115
ting-0.114
orry-0.107
unk-0.106
 ear-0.105
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+4.289
 cheaper+4.077
 former+4.038
 scar+3.969
 problems+3.961
Neg logit contributions
ting-7.705
 ear-7.151
leaf-6.476
 Stop-6.424
other-6.300
Tokenety
Token ID2963
Feature activation+1.02e+00

Pos logit contributions
 ne+1.521
 cheaper+1.386
 One+1.384
 swe+1.378
 saf+1.323
Neg logit contributions
ting-2.879
leaf-2.849
orry-2.807
unk-2.784
other-2.710
Token loved
Token ID6151
Feature activation+3.67e-01

Pos logit contributions
 cheaper+1.913
 ne+1.902
 coll+1.882
 swe+1.878
 One+1.851
Neg logit contributions
ting-1.051
leaf-1.050
orry-0.987
unk-0.963
 ear-0.952
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 ne+0.494
 One+0.479
 cheaper+0.471
 swe+0.461
 coll+0.459
Neg logit contributions
leaf-0.604
ting-0.597
orry-0.550
unk-0.547
 ear-0.547
Token fly
Token ID6129
Feature activation+1.13e+00

Pos logit contributions
 cheaper+1.542
 ne+1.509
 swe+1.457
 One+1.442
 coll+1.428
Neg logit contributions
leaf-1.754
ting-1.699
 ear-1.665
 upper-1.603
orry-1.590
Token bird
Token ID6512
Feature activation+1.11e+00

Pos logit contributions
 ne+1.777
 One+1.729
 coll+1.676
One+1.673
 cheaper+1.668
Neg logit contributions
ting-1.355
leaf-1.352
 ear-1.228
unk-1.178
orry-1.162
Tokenies
Token ID444
Feature activation-3.17e-01

Pos logit contributions
 ne+1.591
 cheaper+1.547
 One+1.542
 coll+1.516
One+1.507
Neg logit contributions
ting-1.716
leaf-1.645
unk-1.578
 ear-1.564
orry-1.561
Token to
Token ID284
Feature activation-2.38e-01

Pos logit contributions
leaf+0.535
ting+0.520
orry+0.498
unk+0.489
 upper+0.484
Neg logit contributions
 ne-0.416
 One-0.402
 swe-0.389
 cheaper-0.388
 coll-0.386
Token help
Token ID1037
Feature activation+4.88e-01

Pos logit contributions
leaf+0.352
ting+0.346
orry+0.324
unk+0.321
 upper+0.311
Neg logit contributions
 ne-0.364
 One-0.354
 cheaper-0.343
 swe-0.342
 coll-0.342
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 ne+0.694
 One+0.667
 cheaper+0.660
 coll+0.652
 swe+0.649
Neg logit contributions
ting-0.768
leaf-0.754
orry-0.692
 ear-0.691
unk-0.686
Token Together
Token ID17083
Feature activation+3.67e+00

Pos logit contributions
 ne+2.641
 coll+2.501
 swe+2.494
 cheaper+2.471
One+2.420
Neg logit contributions
ting-3.130
leaf-2.968
 ear-2.754
 Stop-2.671
unk-2.653
Token,
Token ID11
Feature activation+2.41e+00

Pos logit contributions
 former+6.318
 shin+6.038
One+5.947
 coll+5.924
 cheaper+5.870
Neg logit contributions
ting-5.788
 ear-4.728
 behind-4.518
 hand-4.467
 around-4.445
Token they
Token ID484
Feature activation+1.98e-01

Pos logit contributions
 ne+2.657
One+2.654
 coll+2.612
 One+2.592
 swe+2.546
Neg logit contributions
ting-4.997
leaf-4.495
 ear-4.007
 expecting-4.000
other-3.925
Token flew
Token ID13112
Feature activation+1.00e+00

Pos logit contributions
 ne+0.256
 One+0.250
 cheaper+0.244
 coll+0.242
 swe+0.241
Neg logit contributions
leaf-0.333
ting-0.331
orry-0.309
unk-0.305
 ear-0.302
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 ne+1.868
 One+1.843
 cheaper+1.838
 coll+1.821
One+1.815
Neg logit contributions
ting-1.128
leaf-1.084
 ear-0.961
unk-0.938
orry-0.932
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
 ne+2.008
 One+1.896
 swe+1.887
 cheaper+1.886
One+1.871
Neg logit contributions
leaf-1.854
ting-1.846
 ear-1.681
other-1.598
orry-1.597
Token little
Token ID1310
Feature activation+1.06e-02

Pos logit contributions
leaf+0.322
ting+0.318
orry+0.292
unk+0.288
 ear+0.282
Neg logit contributions
 ne-0.377
 One-0.364
 cheaper-0.357
 coll-0.356
 swe-0.356
Token bird
Token ID6512
Feature activation-5.01e-02

Pos logit contributions
 ne+0.015
 One+0.014
 cheaper+0.014
 coll+0.014
 swe+0.014
Neg logit contributions
leaf-0.017
ting-0.017
orry-0.016
unk-0.015
 ear-0.015
Token named
Token ID3706
Feature activation+3.70e-01

Pos logit contributions
leaf+0.073
ting+0.072
orry+0.067
unk+0.066
 ear+0.065
Neg logit contributions
 ne-0.077
 One-0.075
 cheaper-0.073
 coll-0.073
 swe-0.072
Token Twe
Token ID24205
Feature activation+9.49e-01

Pos logit contributions
 ne+0.367
 cheaper+0.342
 One+0.340
 coll+0.338
 swe+0.337
Neg logit contributions
leaf-0.741
ting-0.737
orry-0.698
unk-0.694
 ear-0.689
Tokenety
Token ID2963
Feature activation-2.55e-04

Pos logit contributions
 ne+0.890
 One+0.819
 swe+0.806
 cheaper+0.801
 coll+0.758
Neg logit contributions
ting-1.950
leaf-1.947
unk-1.877
orry-1.876
 upper-1.836
Token.
Token ID13
Feature activation+3.63e+00

Pos logit contributions
leaf+0.000
ting+0.000
orry+0.000
unk+0.000
 ear+0.000
Neg logit contributions
 ne-0.000
 One-0.000
 cheaper-0.000
 coll-0.000
 swe-0.000
Token Twe
Token ID24205
Feature activation+1.36e+00

Pos logit contributions
 coll+4.199
 cheaper+4.007
 problems+3.888
 scar+3.827
 swe+3.812
Neg logit contributions
ting-7.782
 ear-7.086
 Stop-6.405
leaf-6.336
ada-6.312
Tokenety
Token ID2963
Feature activation+9.81e-01

Pos logit contributions
 ne+1.395
 swe+1.277
 cheaper+1.276
 One+1.267
 coll+1.203
Neg logit contributions
ting-2.700
leaf-2.666
unk-2.617
orry-2.608
other-2.537
Token loved
Token ID6151
Feature activation+2.96e-01

Pos logit contributions
 cheaper+1.810
 ne+1.797
 coll+1.781
 swe+1.776
 One+1.747
Neg logit contributions
ting-1.054
leaf-1.050
orry-0.991
unk-0.968
 ear-0.960
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
 ne+0.395
 One+0.382
 cheaper+0.376
 swe+0.368
 coll+0.368
Neg logit contributions
leaf-0.490
ting-0.484
orry-0.448
unk-0.445
 ear-0.444
Token fly
Token ID6129
Feature activation+1.07e+00

Pos logit contributions
 cheaper+1.393
 ne+1.374
 swe+1.319
 One+1.308
 coll+1.292
Neg logit contributions
leaf-1.631
ting-1.583
 ear-1.542
 upper-1.489
orry-1.488
Token little
Token ID1310
Feature activation+9.64e-02

Pos logit contributions
leaf+0.207
ting+0.205
orry+0.188
unk+0.186
 ear+0.182
Neg logit contributions
 ne-0.242
 One-0.234
 cheaper-0.229
 coll-0.228
 swe-0.228
Token bird
Token ID6512
Feature activation-1.07e-01

Pos logit contributions
 ne+0.136
 One+0.132
 cheaper+0.129
 swe+0.128
 coll+0.128
Neg logit contributions
leaf-0.152
ting-0.150
orry-0.140
unk-0.138
 ear-0.137
Token named
Token ID3706
Feature activation+6.29e-01

Pos logit contributions
leaf+0.156
ting+0.153
orry+0.142
unk+0.140
 ear+0.137
Neg logit contributions
 ne-0.166
 One-0.161
 cheaper-0.157
 coll-0.156
 swe-0.156
Token Twe
Token ID24205
Feature activation+1.09e+00

Pos logit contributions
 ne+0.635
 cheaper+0.594
 coll+0.589
 swe+0.584
 One+0.576
Neg logit contributions
leaf-1.238
ting-1.237
orry-1.170
unk-1.163
 ear-1.162
Tokenety
Token ID2963
Feature activation-9.29e-02

Pos logit contributions
 ne+1.054
 One+0.964
 swe+0.948
 cheaper+0.944
 saf+0.889
Neg logit contributions
ting-2.217
leaf-2.213
orry-2.131
unk-2.129
 upper-2.077
Token.
Token ID13
Feature activation+3.62e+00

Pos logit contributions
leaf+0.153
ting+0.151
orry+0.141
unk+0.140
 ear+0.138
Neg logit contributions
 ne-0.125
 One-0.121
 cheaper-0.117
 coll-0.117
 swe-0.116
Token Twe
Token ID24205
Feature activation+1.55e+00

Pos logit contributions
 coll+4.189
 cheaper+4.121
 swe+3.915
 problems+3.855
 former+3.825
Neg logit contributions
ting-7.637
 ear-7.079
 Stop-6.329
leaf-6.266
ges-6.199
Tokenety
Token ID2963
Feature activation+9.58e-01

Pos logit contributions
 ne+1.641
 cheaper+1.502
 swe+1.487
 One+1.482
 saf+1.419
Neg logit contributions
ting-3.036
leaf-2.980
orry-2.929
unk-2.929
other-2.855
Token loved
Token ID6151
Feature activation+3.07e-01

Pos logit contributions
 cheaper+1.759
 ne+1.754
 coll+1.733
 swe+1.732
 One+1.701
Neg logit contributions
ting-1.034
leaf-1.030
orry-0.969
unk-0.949
 ear-0.943
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 ne+0.411
 One+0.398
 cheaper+0.391
 swe+0.383
 coll+0.382
Neg logit contributions
leaf-0.509
ting-0.502
orry-0.465
unk-0.463
 ear-0.461
Token dance
Token ID9280
Feature activation-2.02e-03

Pos logit contributions
 cheaper+1.372
 ne+1.353
 swe+1.300
 One+1.289
 coll+1.275
Neg logit contributions
leaf-1.616
ting-1.569
 ear-1.532
 upper-1.480
orry-1.474
Token bird
Token ID6512
Feature activation+1.13e-01

Pos logit contributions
 ne+0.080
 One+0.078
 cheaper+0.076
 coll+0.076
 swe+0.076
Neg logit contributions
leaf-0.090
ting-0.089
orry-0.083
unk-0.082
 ear-0.081
Tokenie
Token ID494
Feature activation-2.26e-01

Pos logit contributions
 ne+0.179
 One+0.174
 cheaper+0.172
 swe+0.170
 coll+0.170
Neg logit contributions
leaf-0.158
ting-0.156
orry-0.144
unk-0.143
 ear-0.141
Token named
Token ID3706
Feature activation+5.48e-01

Pos logit contributions
leaf+0.344
ting+0.336
orry+0.313
unk+0.308
 upper+0.303
Neg logit contributions
 ne-0.335
 One-0.326
 coll-0.315
 cheaper-0.314
One-0.313
Token Twe
Token ID24205
Feature activation+1.09e+00

Pos logit contributions
 ne+0.562
 cheaper+0.528
 coll+0.525
 One+0.517
 swe+0.514
Neg logit contributions
ting-1.070
leaf-1.066
orry-1.005
 ear-1.003
unk-1.001
Tokenety
Token ID2963
Feature activation+1.93e-01

Pos logit contributions
 ne+1.048
 One+0.967
 cheaper+0.954
 swe+0.928
 saf+0.904
Neg logit contributions
ting-2.224
leaf-2.191
unk-2.111
orry-2.110
 upper-2.062
Token.
Token ID13
Feature activation+3.62e+00

Pos logit contributions
 ne+0.268
 One+0.258
 cheaper+0.256
 swe+0.253
 coll+0.252
Neg logit contributions
leaf-0.306
ting-0.305
orry-0.284
unk-0.282
 ear-0.279
Token Twe
Token ID24205
Feature activation+1.41e+00

Pos logit contributions
 coll+4.278
 cheaper+3.996
 problems+3.836
 swe+3.817
 scar+3.813
Neg logit contributions
ting-7.592
 ear-7.068
leaf-6.312
 Stop-6.304
ges-6.175
Tokenety
Token ID2963
Feature activation+1.19e+00

Pos logit contributions
 ne+1.462
 cheaper+1.348
 One+1.333
 swe+1.308
 saf+1.282
Neg logit contributions
ting-2.809
leaf-2.742
orry-2.691
unk-2.688
other-2.623
Token lived
Token ID5615
Feature activation-2.52e-01

Pos logit contributions
 cheaper+2.249
 ne+2.218
 coll+2.212
 swe+2.204
 One+2.154
Neg logit contributions
ting-1.234
leaf-1.219
orry-1.158
unk-1.127
 ear-1.126
Token in
Token ID287
Feature activation+1.28e+00

Pos logit contributions
leaf+0.284
ting+0.276
orry+0.253
unk+0.244
 upper+0.240
Neg logit contributions
 ne-0.475
 One-0.460
 cheaper-0.450
One-0.449
 swe-0.449
Token a
Token ID257
Feature activation+9.84e-01

Pos logit contributions
 ne+1.860
 cheaper+1.836
 swe+1.816
 coll+1.793
 One+1.732
Neg logit contributions
leaf-1.923
ting-1.916
 ear-1.822
orry-1.734
unk-1.728
Token little
Token ID1310
Feature activation+1.25e-01

Pos logit contributions
leaf+0.124
ting+0.122
orry+0.112
unk+0.111
 ear+0.109
Neg logit contributions
 ne-0.146
 One-0.142
 cheaper-0.139
 coll-0.138
 swe-0.138
Token bird
Token ID6512
Feature activation+9.04e-02

Pos logit contributions
 ne+0.175
 One+0.170
 cheaper+0.166
 coll+0.165
 swe+0.165
Neg logit contributions
leaf-0.196
ting-0.194
orry-0.181
unk-0.179
 ear-0.178
Token named
Token ID3706
Feature activation+4.58e-01

Pos logit contributions
 ne+0.140
 One+0.136
 cheaper+0.133
 swe+0.132
 coll+0.132
Neg logit contributions
leaf-0.130
ting-0.128
orry-0.119
unk-0.118
 ear-0.116
Token Twe
Token ID24205
Feature activation+1.31e+00

Pos logit contributions
 ne+0.484
 cheaper+0.453
 coll+0.450
 swe+0.446
 One+0.446
Neg logit contributions
leaf-0.885
ting-0.882
orry-0.833
unk-0.828
 ear-0.825
Tokenety
Token ID2963
Feature activation+5.35e-02

Pos logit contributions
 ne+1.300
 swe+1.196
 One+1.196
 cheaper+1.180
 coll+1.112
Neg logit contributions
ting-2.625
leaf-2.601
orry-2.522
unk-2.521
 upper-2.463
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
 ne+0.073
 One+0.071
 cheaper+0.069
 swe+0.069
 coll+0.069
Neg logit contributions
leaf-0.086
ting-0.086
orry-0.080
unk-0.079
 ear-0.078
Token Twe
Token ID24205
Feature activation+1.67e+00

Pos logit contributions
 coll+4.086
 cheaper+3.912
 former+3.763
 swe+3.720
 problems+3.708
Neg logit contributions
ting-7.800
 ear-7.091
 Stop-6.433
leaf-6.389
rified-6.329
Tokenety
Token ID2963
Feature activation+1.08e+00

Pos logit contributions
 ne+1.784
 swe+1.662
 cheaper+1.651
 One+1.626
 coll+1.556
Neg logit contributions
ting-3.259
leaf-3.183
orry-3.145
unk-3.141
other-3.069
Token was
Token ID373
Feature activation+9.89e-01

Pos logit contributions
 cheaper+2.064
 ne+2.046
 coll+2.033
 swe+2.025
 One+1.988
Neg logit contributions
ting-1.099
leaf-1.094
orry-1.033
unk-1.004
 ear-0.999
Token very
Token ID845
Feature activation-3.52e-02

Pos logit contributions
 ne+1.507
 cheaper+1.476
 swe+1.465
 coll+1.457
 One+1.452
Neg logit contributions
ting-1.409
leaf-1.406
 ear-1.340
unk-1.335
orry-1.300
Token flexible
Token ID12846
Feature activation-5.28e-01

Pos logit contributions
leaf+0.056
ting+0.055
orry+0.052
unk+0.051
 ear+0.050
Neg logit contributions
 ne-0.049
 One-0.048
 cheaper-0.047
 swe-0.046
 coll-0.046
Token they
Token ID484
Feature activation+2.20e-01

Pos logit contributions
 ne+0.193
 One+0.187
 cheaper+0.181
 coll+0.181
 swe+0.181
Neg logit contributions
leaf-0.282
ting-0.280
orry-0.263
unk-0.260
 ear-0.257
Token made
Token ID925
Feature activation+1.04e+00

Pos logit contributions
 ne+0.309
 One+0.302
 cheaper+0.295
 coll+0.295
 swe+0.294
Neg logit contributions
leaf-0.344
ting-0.339
orry-0.317
unk-0.312
 ear-0.309
Token.
Token ID13
Feature activation+9.34e-01

Pos logit contributions
 ne+1.529
 One+1.510
One+1.476
 cheaper+1.446
 swe+1.431
Neg logit contributions
ting-1.542
leaf-1.541
unk-1.414
orry-1.404
 ear-1.385
Token They
Token ID1119
Feature activation+1.82e-01

Pos logit contributions
 ne+1.079
 cheaper+0.985
 swe+0.984
 coll+0.979
 saf+0.926
Neg logit contributions
leaf-1.712
ting-1.708
orry-1.637
unk-1.614
 ear-1.613
Token had
Token ID550
Feature activation+1.81e+00

Pos logit contributions
 ne+0.258
 One+0.251
 cheaper+0.246
 coll+0.245
 swe+0.245
Neg logit contributions
leaf-0.285
ting-0.280
orry-0.263
unk-0.258
 ear-0.257
Token ten
Token ID3478
Feature activation+3.61e+00

Pos logit contributions
 ne+2.569
 One+2.528
 coll+2.461
One+2.433
 swe+2.432
Neg logit contributions
leaf-2.726
ting-2.583
 ear-2.503
orry-2.485
 upper-2.445
Token pairs
Token ID14729
Feature activation-2.79e-02

Pos logit contributions
 former+4.965
 someday+4.923
oral+4.805
One+4.724
 One+4.721
Neg logit contributions
ting-6.238
 ear-5.615
leaf-5.509
ges-5.336
 hand-5.332
Token of
Token ID286
Feature activation+1.83e+00

Pos logit contributions
leaf+0.034
ting+0.034
orry+0.031
unk+0.031
 ear+0.030
Neg logit contributions
 ne-0.049
 One-0.047
 cheaper-0.047
 coll-0.046
 swe-0.046
Token ac
Token ID936
Feature activation+1.27e+00

Pos logit contributions
 ne+2.332
One+2.251
 One+2.250
 cheaper+2.217
 someday+2.210
Neg logit contributions
leaf-3.048
ting-2.968
 ear-2.827
other-2.630
orry-2.624
Tokenorns
Token ID19942
Feature activation+3.03e-01

Pos logit contributions
 One+1.595
 ne+1.586
 swe+1.539
 cheaper+1.528
 coll+1.477
Neg logit contributions
ting-2.257
leaf-2.185
unk-2.018
orry-2.008
other-1.959
Token.
Token ID13
Feature activation+7.46e-01

Pos logit contributions
 ne+0.421
 One+0.411
 cheaper+0.398
 swe+0.398
 coll+0.396
Neg logit contributions
leaf-0.480
ting-0.479
orry-0.442
unk-0.440
 ear-0.431
Token yellow
Token ID7872
Feature activation-1.88e-01

Pos logit contributions
leaf+0.029
ting+0.029
orry+0.027
unk+0.026
 ear+0.026
Neg logit contributions
 ne-0.036
 One-0.035
 cheaper-0.034
 coll-0.034
 swe-0.034
Token bird
Token ID6512
Feature activation+3.39e-02

Pos logit contributions
leaf+0.313
ting+0.308
orry+0.289
unk+0.285
 upper+0.280
Neg logit contributions
 ne-0.251
 One-0.243
 coll-0.236
 cheaper-0.235
 swe-0.235
Token named
Token ID3706
Feature activation+4.26e-01

Pos logit contributions
 ne+0.053
 One+0.051
 cheaper+0.050
 coll+0.050
 swe+0.050
Neg logit contributions
leaf-0.048
ting-0.048
orry-0.044
unk-0.044
 ear-0.043
Token Twe
Token ID24205
Feature activation+9.52e-01

Pos logit contributions
 ne+0.418
 cheaper+0.390
 One+0.384
 coll+0.383
 swe+0.383
Neg logit contributions
leaf-0.856
ting-0.854
orry-0.806
unk-0.800
 ear-0.799
Tokenety
Token ID2963
Feature activation-1.26e-01

Pos logit contributions
 ne+0.892
 One+0.823
 cheaper+0.807
 swe+0.806
 saf+0.758
Neg logit contributions
ting-1.967
leaf-1.956
orry-1.880
unk-1.879
 upper-1.836
Token.
Token ID13
Feature activation+3.60e+00

Pos logit contributions
leaf+0.209
ting+0.205
orry+0.192
unk+0.190
 ear+0.187
Neg logit contributions
 ne-0.168
 One-0.163
 cheaper-0.158
 coll-0.157
 swe-0.157
Token Twe
Token ID24205
Feature activation+1.33e+00

Pos logit contributions
 coll+4.076
 cheaper+3.995
 problems+3.863
 former+3.795
 swe+3.743
Neg logit contributions
ting-7.803
 ear-7.110
leaf-6.397
 Stop-6.364
ges-6.294
Tokenety
Token ID2963
Feature activation+8.89e-01

Pos logit contributions
 ne+1.354
 cheaper+1.243
 swe+1.241
 One+1.238
 saf+1.176
Neg logit contributions
ting-2.664
leaf-2.629
unk-2.568
orry-2.568
other-2.489
Token loved
Token ID6151
Feature activation+9.15e-02

Pos logit contributions
 ne+1.646
 cheaper+1.646
 swe+1.623
 coll+1.620
 One+1.602
Neg logit contributions
leaf-0.945
ting-0.943
orry-0.879
unk-0.859
 ear-0.852
Token to
Token ID284
Feature activation+7.99e-01

Pos logit contributions
 ne+0.119
 One+0.115
 cheaper+0.113
 swe+0.111
 coll+0.111
Neg logit contributions
leaf-0.154
ting-0.152
orry-0.142
unk-0.141
 ear-0.140
Token sing
Token ID1702
Feature activation+2.08e-02

Pos logit contributions
 cheaper+1.090
 ne+1.089
 One+1.042
 swe+1.042
 coll+1.026
Neg logit contributions
leaf-1.277
ting-1.241
 ear-1.191
orry-1.159
 upper-1.155
Token little
Token ID1310
Feature activation+8.00e-02

Pos logit contributions
leaf+0.236
ting+0.233
orry+0.214
unk+0.211
 ear+0.207
Neg logit contributions
 ne-0.275
 One-0.266
 cheaper-0.261
 coll-0.260
 swe-0.259
Token bird
Token ID6512
Feature activation-1.72e-02

Pos logit contributions
 ne+0.111
 One+0.108
 cheaper+0.105
 coll+0.105
 swe+0.105
Neg logit contributions
leaf-0.127
ting-0.126
orry-0.118
unk-0.116
 ear-0.115
Token named
Token ID3706
Feature activation+5.32e-01

Pos logit contributions
leaf+0.025
ting+0.024
orry+0.023
unk+0.022
 ear+0.022
Neg logit contributions
 ne-0.027
 One-0.026
 cheaper-0.025
 coll-0.025
 swe-0.025
Token Twe
Token ID24205
Feature activation+9.62e-01

Pos logit contributions
 ne+0.557
 cheaper+0.519
 coll+0.514
 swe+0.510
 One+0.508
Neg logit contributions
leaf-1.031
ting-1.031
orry-0.977
unk-0.969
 ear-0.967
Tokenety
Token ID2963
Feature activation-1.65e-01

Pos logit contributions
 ne+0.906
 One+0.837
 swe+0.821
 cheaper+0.819
 coll+0.768
Neg logit contributions
ting-1.977
leaf-1.976
orry-1.905
unk-1.895
 upper-1.855
Token.
Token ID13
Feature activation+3.58e+00

Pos logit contributions
leaf+0.277
ting+0.271
orry+0.254
unk+0.252
 ear+0.248
Neg logit contributions
 ne-0.219
 One-0.213
 cheaper-0.205
 coll-0.205
One-0.204
Token Twe
Token ID24205
Feature activation+1.36e+00

Pos logit contributions
 cheaper+4.167
 coll+4.114
 swe+3.896
 problems+3.829
 scar+3.676
Neg logit contributions
ting-7.757
 ear-7.040
ges-6.354
 Stop-6.328
rified-6.260
Tokenety
Token ID2963
Feature activation+7.24e-01

Pos logit contributions
 ne+1.393
 One+1.276
 cheaper+1.275
 swe+1.275
 saf+1.199
Neg logit contributions
ting-2.707
leaf-2.673
orry-2.613
unk-2.605
other-2.540
Token loved
Token ID6151
Feature activation+3.31e-01

Pos logit contributions
 ne+1.311
 cheaper+1.300
 coll+1.285
 swe+1.283
 One+1.278
Neg logit contributions
ting-0.807
leaf-0.806
orry-0.750
unk-0.735
 ear-0.723
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
 ne+0.443
 One+0.429
 cheaper+0.421
 swe+0.413
 coll+0.412
Neg logit contributions
leaf-0.547
ting-0.540
orry-0.498
unk-0.496
 ear-0.495
Token whistle
Token ID16121
Feature activation+6.61e-02

Pos logit contributions
 cheaper+1.848
 ne+1.794
 swe+1.735
 One+1.700
 coll+1.688
Neg logit contributions
leaf-1.990
ting-1.921
 ear-1.903
 upper-1.827
orry-1.797
Token little
Token ID1310
Feature activation-4.02e-03

Pos logit contributions
leaf+0.127
ting+0.126
orry+0.116
unk+0.114
 ear+0.112
Neg logit contributions
 ne-0.151
 One-0.146
 cheaper-0.143
 coll-0.143
 swe-0.143
Token bird
Token ID6512
Feature activation-1.27e-01

Pos logit contributions
leaf+0.006
ting+0.006
orry+0.006
unk+0.006
 ear+0.006
Neg logit contributions
 ne-0.006
 One-0.005
 cheaper-0.005
 coll-0.005
 swe-0.005
Token named
Token ID3706
Feature activation+3.27e-01

Pos logit contributions
leaf+0.185
ting+0.181
orry+0.168
unk+0.166
 ear+0.163
Neg logit contributions
 ne-0.196
 One-0.191
 cheaper-0.185
 coll-0.185
 swe-0.185
Token Twe
Token ID24205
Feature activation+8.83e-01

Pos logit contributions
 ne+0.395
 cheaper+0.371
 One+0.371
 coll+0.369
 swe+0.367
Neg logit contributions
leaf-0.585
ting-0.580
orry-0.547
unk-0.542
 ear-0.539
Tokenety
Token ID2963
Feature activation-6.00e-02

Pos logit contributions
 ne+0.817
 One+0.754
 swe+0.738
 cheaper+0.733
 coll+0.695
Neg logit contributions
leaf-1.836
ting-1.833
orry-1.758
unk-1.753
 upper-1.711
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
leaf+0.099
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.081
 One-0.078
 cheaper-0.076
 coll-0.076
 swe-0.075
Token Twe
Token ID24205
Feature activation+1.40e+00

Pos logit contributions
 coll+4.056
 cheaper+3.928
 swe+3.810
 problems+3.739
 former+3.704
Neg logit contributions
ting-7.512
 ear-6.999
leaf-6.393
 Stop-6.276
rified-6.202
Tokenety
Token ID2963
Feature activation+9.53e-01

Pos logit contributions
 ne+1.446
 swe+1.326
 cheaper+1.322
 One+1.316
 coll+1.237
Neg logit contributions
ting-2.793
leaf-2.759
orry-2.699
unk-2.696
other-2.623
Token wanted
Token ID2227
Feature activation+9.20e-01

Pos logit contributions
 cheaper+1.738
 ne+1.732
 coll+1.712
 swe+1.709
 One+1.684
Neg logit contributions
ting-1.038
leaf-1.036
orry-0.974
unk-0.951
 ear-0.945
Token to
Token ID284
Feature activation+3.85e-01

Pos logit contributions
 ne+1.352
 One+1.278
 coll+1.273
 cheaper+1.264
 swe+1.256
Neg logit contributions
leaf-1.429
ting-1.382
 ear-1.278
 upper-1.262
orry-1.255
Token fly
Token ID6129
Feature activation+1.14e+00

Pos logit contributions
 ne+0.492
 One+0.472
 cheaper+0.471
 swe+0.464
 coll+0.464
Neg logit contributions
leaf-0.652
ting-0.638
orry-0.602
 ear-0.601
unk-0.598
Token little
Token ID1310
Feature activation+1.52e-02

Pos logit contributions
leaf+0.251
ting+0.248
orry+0.227
unk+0.224
 ear+0.220
Neg logit contributions
 ne-0.292
 One-0.282
 cheaper-0.277
 coll-0.276
 swe-0.275
Token bird
Token ID6512
Feature activation+8.78e-02

Pos logit contributions
 ne+0.021
 One+0.021
 cheaper+0.020
 coll+0.020
 swe+0.020
Neg logit contributions
leaf-0.024
ting-0.024
orry-0.022
unk-0.022
 ear-0.022
Token named
Token ID3706
Feature activation+2.61e-01

Pos logit contributions
 ne+0.138
 One+0.134
 cheaper+0.132
 swe+0.131
 coll+0.131
Neg logit contributions
leaf-0.123
ting-0.122
orry-0.113
unk-0.111
 ear-0.110
Token Twe
Token ID24205
Feature activation+1.08e+00

Pos logit contributions
 ne+0.273
 One+0.256
 cheaper+0.254
 coll+0.252
 swe+0.251
Neg logit contributions
leaf-0.510
ting-0.507
orry-0.479
unk-0.476
 ear-0.473
Tokenety
Token ID2963
Feature activation+1.52e-01

Pos logit contributions
 ne+1.033
 One+0.944
 swe+0.930
 cheaper+0.927
 saf+0.872
Neg logit contributions
ting-2.204
leaf-2.203
orry-2.119
unk-2.117
 upper-2.066
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 ne+0.210
 One+0.203
 cheaper+0.200
 swe+0.198
 coll+0.198
Neg logit contributions
leaf-0.242
ting-0.241
orry-0.225
unk-0.223
 ear-0.220
Token Twe
Token ID24205
Feature activation+1.58e+00

Pos logit contributions
 coll+3.997
 cheaper+3.876
 swe+3.714
 former+3.657
 problems+3.597
Neg logit contributions
ting-7.686
 ear-7.046
leaf-6.319
 Stop-6.310
rified-6.254
Tokenety
Token ID2963
Feature activation+1.10e+00

Pos logit contributions
 ne+1.664
 cheaper+1.524
 swe+1.516
 One+1.504
 saf+1.448
Neg logit contributions
ting-3.100
leaf-3.044
orry-2.988
unk-2.988
other-2.911
Token had
Token ID550
Feature activation+5.68e-01

Pos logit contributions
 cheaper+2.063
 ne+2.041
 coll+2.031
 swe+2.026
 One+1.983
Neg logit contributions
ting-1.141
leaf-1.139
orry-1.075
unk-1.047
 ear-1.044
Token an
Token ID281
Feature activation+7.85e-01

Pos logit contributions
 ne+0.741
 cheaper+0.709
 One+0.705
 swe+0.698
 coll+0.694
Neg logit contributions
leaf-0.959
ting-0.937
 ear-0.890
orry-0.883
unk-0.883
Token unusual
Token ID8468
Feature activation+1.45e+00

Pos logit contributions
 ne+1.140
 cheaper+1.066
 One+1.064
 swe+1.063
 coll+1.052
Neg logit contributions
leaf-1.209
ting-1.167
 ear-1.146
orry-1.131
unk-1.120
Token bird
Token ID6512
Feature activation-4.91e-02

Pos logit contributions
leaf+0.127
ting+0.125
orry+0.117
unk+0.116
 ear+0.114
Neg logit contributions
 ne-0.109
 One-0.105
 cheaper-0.103
 coll-0.103
 swe-0.102
Tokenie
Token ID494
Feature activation-3.99e-01

Pos logit contributions
leaf+0.072
ting+0.070
orry+0.065
unk+0.064
 ear+0.063
Neg logit contributions
 ne-0.075
 One-0.073
 cheaper-0.071
 coll-0.071
 swe-0.071
Token named
Token ID3706
Feature activation+1.72e-01

Pos logit contributions
leaf+0.645
ting+0.626
orry+0.586
unk+0.577
 upper+0.571
Neg logit contributions
 ne-0.560
 One-0.545
 coll-0.525
One-0.522
 cheaper-0.519
Token Twe
Token ID24205
Feature activation+8.47e-01

Pos logit contributions
 ne+0.193
 One+0.183
 cheaper+0.181
 coll+0.179
 swe+0.178
Neg logit contributions
leaf-0.323
ting-0.321
orry-0.303
unk-0.300
 ear-0.298
Tokenety
Token ID2963
Feature activation+8.12e-02

Pos logit contributions
 ne+0.780
 One+0.721
 cheaper+0.702
 swe+0.689
 saf+0.667
Neg logit contributions
ting-1.758
leaf-1.750
orry-1.682
unk-1.677
 upper-1.638
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 ne+0.111
 One+0.107
 cheaper+0.105
 swe+0.104
 coll+0.104
Neg logit contributions
leaf-0.131
ting-0.130
orry-0.121
unk-0.120
 ear-0.119
Token Twe
Token ID24205
Feature activation+1.15e+00

Pos logit contributions
 coll+4.057
 cheaper+3.756
 scar+3.669
 problems+3.629
 swe+3.628
Neg logit contributions
ting-7.549
 ear-6.943
leaf-6.346
 Stop-6.317
ada-6.161
Tokenety
Token ID2963
Feature activation+1.16e+00

Pos logit contributions
 ne+1.132
 One+1.030
 cheaper+1.028
 swe+1.004
 saf+0.980
Neg logit contributions
ting-2.318
leaf-2.293
orry-2.241
unk-2.233
other-2.163
Token loved
Token ID6151
Feature activation+4.00e-01

Pos logit contributions
 cheaper+2.189
 ne+2.162
 coll+2.160
 swe+2.150
 One+2.102
Neg logit contributions
ting-1.195
leaf-1.185
orry-1.120
unk-1.089
 ear-1.086
Token to
Token ID284
Feature activation+9.78e-01

Pos logit contributions
 ne+0.541
 One+0.524
 cheaper+0.515
 swe+0.504
 coll+0.502
Neg logit contributions
leaf-0.657
ting-0.648
orry-0.596
 ear-0.593
unk-0.593
Token fly
Token ID6129
Feature activation+1.04e+00

Pos logit contributions
 cheaper+1.349
 ne+1.333
 swe+1.284
 One+1.274
 coll+1.261
Neg logit contributions
leaf-1.556
ting-1.507
 ear-1.463
 upper-1.414
orry-1.410
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
 ne+0.070
 One+0.068
 cheaper+0.066
 coll+0.066
 swe+0.066
Neg logit contributions
leaf-0.087
ting-0.086
orry-0.081
unk-0.080
 ear-0.079
Token were
Token ID547
Feature activation+6.53e-01

Pos logit contributions
 One+1.598
 ne+1.561
One+1.549
 cheaper+1.528
 coll+1.507
Neg logit contributions
leaf-1.955
ting-1.945
 ear-1.745
orry-1.741
 upper-1.685
Token very
Token ID845
Feature activation+3.03e-01

Pos logit contributions
 ne+1.040
 One+1.032
 cheaper+0.998
One+0.998
 coll+0.995
Neg logit contributions
leaf-0.890
ting-0.880
unk-0.815
orry-0.814
 ear-0.811
Token happy
Token ID3772
Feature activation+1.80e-02

Pos logit contributions
 ne+0.446
 One+0.439
 coll+0.426
 cheaper+0.424
 swe+0.424
Neg logit contributions
leaf-0.457
ting-0.444
orry-0.418
unk-0.413
 ear-0.410
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
 ne+0.025
 One+0.025
 cheaper+0.024
 coll+0.024
 swe+0.024
Neg logit contributions
leaf-0.028
ting-0.028
orry-0.026
unk-0.026
 ear-0.026
Token When
Token ID1649
Feature activation+3.53e+00

Pos logit contributions
 ne+2.420
 swe+2.303
 cheaper+2.264
 coll+2.243
One+2.163
Neg logit contributions
ting-2.903
leaf-2.728
 ear-2.567
orry-2.562
unk-2.486
Token they
Token ID484
Feature activation+1.32e-01

Pos logit contributions
 coll+7.232
 former+7.095
 swe+6.999
 ne+6.867
onna+6.852
Neg logit contributions
ting-4.193
 ear-3.348
leaf-2.997
 expecting-2.851
 adults-2.754
Token returned
Token ID4504
Feature activation+9.21e-01

Pos logit contributions
 ne+0.175
 One+0.170
 cheaper+0.167
 coll+0.166
 swe+0.166
Neg logit contributions
leaf-0.218
ting-0.216
orry-0.202
unk-0.199
 ear-0.197
Token home
Token ID1363
Feature activation+9.43e-01

Pos logit contributions
 ne+1.338
 One+1.312
One+1.298
 coll+1.290
 cheaper+1.287
Neg logit contributions
ting-1.426
leaf-1.373
 ear-1.298
unk-1.265
orry-1.256
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
 One+0.990
 ne+0.988
One+0.977
 coll+0.958
 swe+0.939
Neg logit contributions
ting-1.789
leaf-1.759
unk-1.666
orry-1.664
 ear-1.649
Token Twe
Token ID24205
Feature activation+1.20e+00

Pos logit contributions
 ne+1.839
 coll+1.834
One+1.787
 swe+1.764
 One+1.740
Neg logit contributions
ting-3.324
leaf-3.046
 ear-2.957
orry-2.843
 expecting-2.813
Token small
Token ID1402
Feature activation+3.98e-01

Pos logit contributions
leaf+0.269
ting+0.266
orry+0.244
unk+0.241
 ear+0.236
Neg logit contributions
 ne-0.318
 One-0.308
 cheaper-0.302
 coll-0.301
 swe-0.301
Token bird
Token ID6512
Feature activation+1.26e-02

Pos logit contributions
 ne+0.611
 One+0.596
 cheaper+0.586
 swe+0.580
One+0.579
Neg logit contributions
leaf-0.567
ting-0.563
orry-0.522
unk-0.520
 ear-0.520
Token named
Token ID3706
Feature activation+4.79e-01

Pos logit contributions
 ne+0.020
 One+0.019
 cheaper+0.019
 coll+0.019
 swe+0.019
Neg logit contributions
leaf-0.018
ting-0.017
orry-0.016
unk-0.016
 ear-0.016
Token Twe
Token ID24205
Feature activation+5.93e-01

Pos logit contributions
 ne+0.502
 cheaper+0.469
 coll+0.463
 One+0.462
 swe+0.462
Neg logit contributions
leaf-0.928
ting-0.924
orry-0.870
unk-0.865
 ear-0.863
Tokenety
Token ID2963
Feature activation-2.01e-02

Pos logit contributions
 ne+0.516
 One+0.476
 cheaper+0.461
 swe+0.455
 coll+0.437
Neg logit contributions
leaf-1.259
ting-1.256
orry-1.200
unk-1.195
 upper-1.172
Token.
Token ID13
Feature activation+3.51e+00

Pos logit contributions
leaf+0.033
ting+0.033
orry+0.030
unk+0.030
 ear+0.030
Neg logit contributions
 ne-0.027
 One-0.026
 cheaper-0.026
 coll-0.025
 swe-0.025
Token Twe
Token ID24205
Feature activation+8.79e-01

Pos logit contributions
 coll+3.919
 cheaper+3.692
 problems+3.609
 scar+3.578
 former+3.559
Neg logit contributions
ting-7.522
 ear-6.948
leaf-6.281
 Stop-6.270
ges-6.087
Tokenety
Token ID2963
Feature activation+8.99e-01

Pos logit contributions
 ne+0.823
 One+0.749
 cheaper+0.742
 swe+0.727
 saf+0.709
Neg logit contributions
ting-1.815
leaf-1.812
orry-1.747
unk-1.740
other-1.687
Token lived
Token ID5615
Feature activation-3.36e-01

Pos logit contributions
 cheaper+1.641
 ne+1.640
 coll+1.617
 swe+1.617
 One+1.595
Neg logit contributions
leaf-0.979
ting-0.977
orry-0.913
unk-0.894
 ear-0.887
Token in
Token ID287
Feature activation+1.36e+00

Pos logit contributions
leaf+0.391
ting+0.379
orry+0.350
unk+0.337
 upper+0.333
Neg logit contributions
 ne-0.622
 One-0.601
One-0.588
 cheaper-0.587
 swe-0.585
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 ne+1.992
 cheaper+1.975
 swe+1.959
 coll+1.929
 One+1.858
Neg logit contributions
leaf-2.024
ting-2.012
 ear-1.932
orry-1.812
unk-1.812
Token little
Token ID1310
Feature activation+1.68e-01

Pos logit contributions
leaf+0.137
ting+0.135
orry+0.124
unk+0.123
 ear+0.120
Neg logit contributions
 ne-0.163
 One-0.158
 cheaper-0.155
 coll-0.154
 swe-0.154
Token bird
Token ID6512
Feature activation-4.40e-02

Pos logit contributions
 ne+0.237
 One+0.230
 cheaper+0.225
 swe+0.223
 coll+0.223
Neg logit contributions
leaf-0.263
ting-0.260
orry-0.242
unk-0.240
 ear-0.238
Token named
Token ID3706
Feature activation+4.43e-01

Pos logit contributions
leaf+0.063
ting+0.062
orry+0.058
unk+0.057
 ear+0.056
Neg logit contributions
 ne-0.068
 One-0.066
 cheaper-0.065
 coll-0.064
 swe-0.064
Token Twe
Token ID24205
Feature activation+1.06e+00

Pos logit contributions
 ne+0.476
 cheaper+0.445
 One+0.442
 coll+0.442
 swe+0.439
Neg logit contributions
leaf-0.849
ting-0.845
orry-0.799
unk-0.794
 ear-0.788
Tokenety
Token ID2963
Feature activation-1.07e-02

Pos logit contributions
 ne+1.018
 One+0.937
 swe+0.925
 cheaper+0.915
 saf+0.860
Neg logit contributions
ting-2.168
leaf-2.168
orry-2.087
unk-2.085
 upper-2.029
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
leaf+0.017
ting+0.017
orry+0.016
unk+0.016
 ear+0.016
Neg logit contributions
 ne-0.015
 One-0.014
 cheaper-0.014
 coll-0.014
 swe-0.014
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+3.952
 cheaper+3.763
 swe+3.644
 scar+3.594
 problems+3.501
Neg logit contributions
ting-7.479
 ear-6.865
leaf-6.328
 Stop-6.278
rified-6.176
Tokenety
Token ID2963
Feature activation+9.29e-01

Pos logit contributions
 ne+1.523
 swe+1.407
 cheaper+1.396
 One+1.385
 saf+1.316
Neg logit contributions
ting-2.892
leaf-2.855
unk-2.796
orry-2.795
other-2.722
Token lived
Token ID5615
Feature activation-3.35e-01

Pos logit contributions
 cheaper+1.688
 ne+1.681
 coll+1.666
 swe+1.661
 One+1.639
Neg logit contributions
ting-1.022
leaf-1.015
orry-0.956
unk-0.933
 ear-0.926
Token in
Token ID287
Feature activation+1.35e+00

Pos logit contributions
leaf+0.385
ting+0.373
orry+0.343
unk+0.331
 upper+0.326
Neg logit contributions
 ne-0.626
 One-0.606
One-0.593
 cheaper-0.591
 swe-0.589
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
 ne+1.980
 cheaper+1.967
 swe+1.946
 coll+1.916
 One+1.849
Neg logit contributions
leaf-2.016
ting-2.006
 ear-1.921
orry-1.816
unk-1.810
Token in
Token ID287
Feature activation+7.03e-01

Pos logit contributions
 ne+1.189
 One+1.176
 cheaper+1.157
 coll+1.157
One+1.147
Neg logit contributions
ting-0.957
leaf-0.944
orry-0.875
unk-0.863
 ear-0.863
Token the
Token ID262
Feature activation-2.74e-01

Pos logit contributions
 ne+1.048
 One+1.011
 coll+1.001
 cheaper+0.997
 swe+0.986
Neg logit contributions
ting-1.029
leaf-1.014
 ear-0.959
orry-0.934
unk-0.923
Token dirt
Token ID13647
Feature activation-5.00e-02

Pos logit contributions
leaf+0.394
ting+0.385
orry+0.357
unk+0.354
 ear+0.343
Neg logit contributions
 ne-0.431
 One-0.422
 swe-0.412
 cheaper-0.409
 coll-0.407
Token.
Token ID13
Feature activation+2.43e+00

Pos logit contributions
leaf+0.082
ting+0.081
orry+0.076
unk+0.075
 ear+0.074
Neg logit contributions
 ne-0.067
 One-0.065
 cheaper-0.063
 swe-0.063
 coll-0.063
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 ne+3.631
 swe+3.490
 cheaper+3.408
 coll+3.244
 spe+3.109
Neg logit contributions
ting-3.941
 ear-3.701
leaf-3.626
other-3.606
orry-3.518
Token
Token ID198
Feature activation+3.50e+00

Pos logit contributions
 ne+1.319
 swe+1.278
 cheaper+1.244
 coll+1.237
 shin+1.146
Neg logit contributions
leaf-1.728
ting-1.681
orry-1.633
unk-1.618
other-1.595
TokenOne
Token ID3198
Feature activation-7.59e-01

Pos logit contributions
 swe+2.700
 ne+2.504
 cheaper+2.343
 spe+2.157
 coll+2.124
Neg logit contributions
ting-8.447
other-8.060
leaf-7.924
orry-7.830
 ear-7.562
Token day
Token ID1110
Feature activation+1.77e+00

Pos logit contributions
leaf+1.407
ting+1.404
unk+1.285
orry+1.278
 upper+1.263
Neg logit contributions
 One-0.908
 ne-0.902
One-0.862
 swe-0.831
 saf-0.830
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
 swe+2.088
 cheaper+2.075
 coll+2.073
 former+2.052
 ne+1.970
Neg logit contributions
ting-3.301
leaf-3.042
 ear-2.966
orry-2.890
unk-2.839
Token Gold
Token ID3561
Feature activation+1.41e+00

Pos logit contributions
 swe+2.324
 cheaper+2.195
 coll+2.174
 ne+2.158
 former+2.122
Neg logit contributions
ting-3.671
leaf-3.401
 ear-3.308
orry-3.136
rified-3.088
Token heard
Token ID2982
Feature activation+1.21e+00

Pos logit contributions
 cheaper+2.219
 ne+2.216
 One+2.173
 swe+2.143
 coll+2.120
Neg logit contributions
ting-1.999
unk-1.872
leaf-1.869
orry-1.865
other-1.748
Token little
Token ID1310
Feature activation-3.98e-02

Pos logit contributions
leaf+0.281
ting+0.277
orry+0.254
unk+0.251
 ear+0.246
Neg logit contributions
 ne-0.335
 One-0.324
 cheaper-0.318
 coll-0.317
 swe-0.317
Token bird
Token ID6512
Feature activation-7.44e-02

Pos logit contributions
leaf+0.064
ting+0.063
orry+0.059
unk+0.058
 ear+0.057
Neg logit contributions
 ne-0.055
 One-0.054
 cheaper-0.052
 coll-0.052
 swe-0.052
Token named
Token ID3706
Feature activation+3.51e-01

Pos logit contributions
leaf+0.107
ting+0.105
orry+0.098
unk+0.096
 ear+0.095
Neg logit contributions
 ne-0.116
 One-0.113
 cheaper-0.110
 coll-0.109
 swe-0.109
Token Twe
Token ID24205
Feature activation+1.10e+00

Pos logit contributions
 ne+0.384
 cheaper+0.359
 One+0.357
 coll+0.357
 swe+0.355
Neg logit contributions
leaf-0.667
ting-0.665
orry-0.627
unk-0.622
 ear-0.620
Tokenety
Token ID2963
Feature activation-1.70e-01

Pos logit contributions
 ne+1.062
 One+0.977
 swe+0.963
 cheaper+0.957
 saf+0.899
Neg logit contributions
leaf-2.233
ting-2.233
orry-2.150
unk-2.148
 upper-2.097
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
leaf+0.284
ting+0.278
orry+0.261
unk+0.258
 ear+0.254
Neg logit contributions
 ne-0.226
 One-0.220
 cheaper-0.212
 coll-0.211
One-0.211
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+3.920
 cheaper+3.764
 swe+3.570
 problems+3.503
 former+3.464
Neg logit contributions
ting-7.654
 ear-6.997
 Stop-6.318
leaf-6.230
rified-6.214
Tokenety
Token ID2963
Feature activation+8.57e-01

Pos logit contributions
 ne+1.522
 cheaper+1.396
 swe+1.394
 One+1.389
 saf+1.314
Neg logit contributions
ting-2.873
leaf-2.846
unk-2.786
orry-2.782
other-2.713
Token was
Token ID373
Feature activation+9.66e-01

Pos logit contributions
 ne+1.581
 cheaper+1.579
 coll+1.560
 swe+1.557
 One+1.538
Neg logit contributions
ting-0.918
leaf-0.917
orry-0.855
unk-0.836
 ear-0.826
Token very
Token ID845
Feature activation-2.48e-01

Pos logit contributions
 ne+1.461
 cheaper+1.429
 swe+1.421
 coll+1.409
 One+1.405
Neg logit contributions
ting-1.386
leaf-1.386
 ear-1.318
unk-1.314
orry-1.280
Token smart
Token ID4451
Feature activation-1.30e+00

Pos logit contributions
leaf+0.403
ting+0.402
orry+0.372
unk+0.366
 ear+0.361
Neg logit contributions
 ne-0.342
 One-0.331
 cheaper-0.321
 swe-0.318
 coll-0.317
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
 ne+0.418
 One+0.404
 swe+0.396
 coll+0.395
 cheaper+0.395
Neg logit contributions
leaf-0.396
ting-0.392
orry-0.360
unk-0.360
 ear-0.354
Token
Token ID220
Feature activation+2.78e-01

Pos logit contributions
 ne+3.100
 coll+2.938
 cheaper+2.934
 swe+2.888
 scar+2.756
Neg logit contributions
ting-3.709
leaf-3.475
 ear-3.256
 Stop-3.149
other-3.118
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
 ne+0.311
 One+0.291
 cheaper+0.290
 swe+0.288
 coll+0.287
Neg logit contributions
leaf-0.518
ting-0.513
orry-0.487
unk-0.484
 ear-0.474
Token
Token ID198
Feature activation+1.83e+00

Pos logit contributions
 ne+1.942
 swe+1.873
 cheaper+1.859
 coll+1.853
 enough+1.780
Neg logit contributions
leaf-2.430
ting-2.381
other-2.180
unk-2.140
orry-2.138
TokenBut
Token ID1537
Feature activation+2.28e+00

Pos logit contributions
 ne+2.741
 swe+2.651
 coll+2.650
 cheaper+2.618
 enough+2.586
Neg logit contributions
leaf-2.820
ting-2.807
other-2.461
orry-2.405
 Stop-2.380
Token when
Token ID618
Feature activation+3.46e+00

Pos logit contributions
 coll+3.468
 ne+3.318
 swe+3.274
 cheaper+3.236
One+3.213
Neg logit contributions
ting-3.779
 ear-3.338
leaf-3.107
 expecting-2.991
rified-2.925
Token Twe
Token ID24205
Feature activation+1.89e+00

Pos logit contributions
 coll+5.893
 cheaper+5.184
 swe+5.129
 former+5.060
onna+5.037
Neg logit contributions
ting-5.845
 ear-4.992
 oh-4.669
 Stop-4.644
rified-4.573
Tokenety
Token ID2963
Feature activation+1.62e+00

Pos logit contributions
 ne+2.175
 cheaper+1.991
 One+1.964
 saf+1.954
 swe+1.939
Neg logit contributions
ting-3.619
unk-3.544
orry-3.529
leaf-3.490
other-3.413
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
 cheaper+2.484
 coll+2.470
 swe+2.443
 ne+2.382
 One+2.360
Neg logit contributions
ting-2.450
leaf-2.245
orry-2.242
unk-2.198
 ear-2.117
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 cheaper+2.818
 ne+2.816
One+2.812
 coll+2.767
 One+2.760
Neg logit contributions
 ear-2.667
ting-2.666
leaf-2.481
 finger-2.313
 hand-2.305
Token owl
Token ID39610
Feature activation+1.93e+00

Pos logit contributions
 ne+1.747
 One+1.688
 cheaper+1.668
 coll+1.654
 swe+1.636
Neg logit contributions
leaf-1.537
ting-1.535
 ear-1.512
 upper-1.437
unk-1.426
Token little
Token ID1310
Feature activation+8.93e-03

Pos logit contributions
leaf+0.147
ting+0.145
orry+0.133
unk+0.132
 ear+0.129
Neg logit contributions
 ne-0.173
 One-0.168
 cheaper-0.165
 coll-0.164
 swe-0.164
Token bird
Token ID6512
Feature activation-1.89e-01

Pos logit contributions
 ne+0.012
 One+0.012
 cheaper+0.012
 coll+0.012
 swe+0.012
Neg logit contributions
leaf-0.014
ting-0.014
orry-0.013
unk-0.013
 ear-0.013
Token named
Token ID3706
Feature activation+4.24e-01

Pos logit contributions
leaf+0.279
ting+0.273
orry+0.254
unk+0.250
 upper+0.246
Neg logit contributions
 ne-0.288
 One-0.280
 coll-0.271
 cheaper-0.271
One-0.270
Token Twe
Token ID24205
Feature activation+7.84e-01

Pos logit contributions
 ne+0.486
 cheaper+0.456
 One+0.454
 coll+0.453
 swe+0.450
Neg logit contributions
leaf-0.780
ting-0.776
orry-0.731
unk-0.726
 ear-0.721
Tokenety
Token ID2963
Feature activation-1.64e-02

Pos logit contributions
 ne+0.722
 One+0.664
 swe+0.650
 cheaper+0.649
 coll+0.618
Neg logit contributions
leaf-1.632
ting-1.624
orry-1.565
unk-1.558
 upper-1.522
Token.
Token ID13
Feature activation+3.46e+00

Pos logit contributions
leaf+0.027
ting+0.026
orry+0.025
unk+0.024
 ear+0.024
Neg logit contributions
 ne-0.022
 One-0.022
 cheaper-0.021
 coll-0.021
 swe-0.021
Token He
Token ID679
Feature activation+9.26e-01

Pos logit contributions
 coll+3.886
 cheaper+3.884
 swe+3.630
 problems+3.572
 scar+3.550
Neg logit contributions
ting-7.256
 ear-6.840
leaf-6.058
 Stop-6.054
rified-5.995
Token had
Token ID550
Feature activation+7.57e-01

Pos logit contributions
 cheaper+1.656
 ne+1.627
 swe+1.621
 One+1.612
 coll+1.608
Neg logit contributions
leaf-1.075
ting-1.065
orry-0.989
 ear-0.987
unk-0.984
Token an
Token ID281
Feature activation+6.70e-01

Pos logit contributions
 ne+1.015
 cheaper+0.980
 swe+0.964
 One+0.963
 coll+0.954
Neg logit contributions
leaf-1.249
ting-1.218
 ear-1.169
unk-1.153
orry-1.148
Token orange
Token ID10912
Feature activation+6.64e-01

Pos logit contributions
 ne+0.975
 cheaper+0.914
 One+0.914
 swe+0.912
 coll+0.903
Neg logit contributions
leaf-1.029
ting-0.994
 ear-0.967
orry-0.960
unk-0.950
Token be
Token ID307
Feature activation+6.23e-02

Pos logit contributions
 ne+0.889
 cheaper+0.852
 One+0.842
 swe+0.842
 coll+0.819
Neg logit contributions
leaf-1.067
ting-1.064
 ear-1.024
unk-1.012
orry-1.000
Token so
Token ID523
Feature activation-6.87e-01

Pos logit contributions
leaf+2.151
ting+2.085
unk+2.041
orry+1.972
other+1.935
Neg logit contributions
 ne-2.118
 cheaper-2.077
 One-2.022
 swe-1.973
One-1.933
Token big
Token ID1263
Feature activation-7.17e-01

Pos logit contributions
leaf+1.238
ting+1.230
unk+1.157
orry+1.133
 upper+1.112
Neg logit contributions
 ne-0.841
 One-0.817
 cheaper-0.792
 swe-0.776
One-0.766
Token!
Token ID0
Feature activation-4.68e-01

Pos logit contributions
leaf+1.177
ting+1.107
orry+1.085
unk+1.085
 upper+1.053
Neg logit contributions
 ne-0.984
 One-0.950
 cheaper-0.932
 swe-0.922
 coll-0.902
Token She
Token ID1375
Feature activation-8.97e-02

Pos logit contributions
leaf+0.848
ting+0.843
unk+0.790
orry+0.787
 upper+0.772
Neg logit contributions
 One-0.561
 ne-0.548
One-0.513
 cheaper-0.511
 swe-0.507
Token touched
Token ID12615
Feature activation-1.33e+00

Pos logit contributions
leaf+0.142
ting+0.140
orry+0.131
unk+0.130
 ear+0.127
Neg logit contributions
 ne-0.127
 One-0.123
 cheaper-0.120
 swe-0.119
 coll-0.119
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
leaf+2.428
ting+2.250
unk+2.235
orry+2.220
 upper+2.178
Neg logit contributions
 One-1.636
 ne-1.600
 swe-1.512
One-1.501
 cheaper-1.494
Token screen
Token ID3159
Feature activation-1.92e-01

Pos logit contributions
idget+6.709
orry+6.563
udge+6.530
asing+6.469
ting+6.432
Neg logit contributions
 One-4.899
 saf-4.884
 ne-4.795
 coll-4.735
 cheaper-4.616
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
leaf+0.322
ting+0.316
orry+0.298
unk+0.295
 ear+0.289
Neg logit contributions
 ne-0.253
 One-0.244
 cheaper-0.238
 swe-0.236
 coll-0.234
Token it
Token ID340
Feature activation-2.59e-01

Pos logit contributions
leaf+2.025
ting+1.939
unk+1.910
orry+1.868
 upper+1.833
Neg logit contributions
 ne-1.508
 One-1.408
 cheaper-1.350
 swe-1.346
 coll-1.319
Token lit
Token ID6578
Feature activation+9.05e-01

Pos logit contributions
leaf+0.469
ting+0.459
orry+0.432
unk+0.430
 ear+0.424
Neg logit contributions
 ne-0.316
 One-0.300
 cheaper-0.292
 swe-0.291
 coll-0.287
Token up
Token ID510
Feature activation+2.63e-01

Pos logit contributions
 ne+1.674
 One+1.647
 coll+1.638
 cheaper+1.623
One+1.614
Neg logit contributions
ting-1.010
leaf-0.982
 ear-0.881
orry-0.870
unk-0.851
Token over
Token ID625
Feature activation-4.44e-01

Pos logit contributions
leaf+1.338
ting+1.248
orry+1.230
 upper+1.175
unk+1.132
Neg logit contributions
 One-1.966
 ne-1.961
One-1.924
 swe-1.890
 coll-1.859
Token and
Token ID290
Feature activation-4.15e-01

Pos logit contributions
leaf+0.789
ting+0.771
orry+0.743
unk+0.729
 upper+0.720
Neg logit contributions
 ne-0.544
 One-0.532
 coll-0.508
One-0.507
 swe-0.506
Token accidentally
Token ID14716
Feature activation-1.29e+00

Pos logit contributions
leaf+0.712
ting+0.698
orry+0.672
unk+0.666
 upper+0.652
Neg logit contributions
 ne-0.541
 One-0.516
 swe-0.504
 cheaper-0.499
 coll-0.491
Token stepped
Token ID10764
Feature activation-1.34e+00

Pos logit contributions
leaf+2.177
ting+2.098
orry+1.993
other+1.952
 upper+1.910
Neg logit contributions
 ne-1.820
 One-1.664
 swe-1.662
 cheaper-1.601
 coll-1.596
Token on
Token ID319
Feature activation+3.66e-01

Pos logit contributions
leaf+2.204
orry+2.026
ting+2.001
 upper+1.981
unk+1.911
Neg logit contributions
 ne-1.898
 One-1.868
 swe-1.807
One-1.779
 cheaper-1.765
Token the
Token ID262
Feature activation-3.63e+00

Pos logit contributions
 ne+0.526
 One+0.503
 cheaper+0.503
 coll+0.494
 swe+0.494
Neg logit contributions
leaf-0.566
ting-0.564
orry-0.519
 ear-0.518
unk-0.515
Token cricket
Token ID18836
Feature activation-1.16e+00

Pos logit contributions
leaf+5.840
orry+5.766
udge+5.344
idget+5.344
unk+5.337
Neg logit contributions
 swe-5.958
 coll-5.941
 ne-5.629
 One-5.547
 scar-5.396
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
leaf+2.178
ting+2.050
orry+2.005
unk+1.951
 upper+1.938
Neg logit contributions
 One-1.407
 ne-1.377
 coll-1.308
 swe-1.297
 saf-1.286
Token The
Token ID383
Feature activation-2.54e+00

Pos logit contributions
 ne+0.163
 One+0.156
 cheaper+0.153
 coll+0.152
 swe+0.152
Neg logit contributions
leaf-0.212
ting-0.210
orry-0.197
unk-0.195
 ear-0.193
Token cricket
Token ID18836
Feature activation-5.65e-01

Pos logit contributions
leaf+3.013
ting+2.795
orry+2.651
unk+2.412
asp+2.389
Neg logit contributions
 One-5.016
 coll-4.988
 swe-4.980
One-4.885
 ne-4.816
Token started
Token ID2067
Feature activation-2.29e-01

Pos logit contributions
leaf+0.998
ting+0.965
orry+0.909
unk+0.896
 upper+0.887
Neg logit contributions
 ne-0.736
 One-0.696
 swe-0.671
 cheaper-0.659
One-0.657
Token through
Token ID832
Feature activation-5.28e-01

Pos logit contributions
leaf+1.657
ting+1.503
orry+1.465
unk+1.419
 upper+1.373
Neg logit contributions
 ne-1.729
 One-1.667
 swe-1.645
 cheaper-1.606
 coll-1.586
Token the
Token ID262
Feature activation-3.29e+00

Pos logit contributions
leaf+0.896
ting+0.862
orry+0.828
unk+0.820
 upper+0.797
Neg logit contributions
 ne-0.709
 One-0.686
 cheaper-0.657
 swe-0.656
One-0.649
Token small
Token ID1402
Feature activation-1.27e+00

Pos logit contributions
unk+5.047
leaf+5.008
ting+4.982
orry+4.902
udge+4.722
Neg logit contributions
 coll-5.303
 One-5.165
 saf-4.979
 swe-4.955
One-4.906
Token hole
Token ID7604
Feature activation-9.57e-01

Pos logit contributions
leaf+2.067
ting+1.960
orry+1.894
unk+1.835
 upper+1.781
Neg logit contributions
 One-1.798
 coll-1.784
 ne-1.784
 swe-1.723
One-1.677
Token in
Token ID287
Feature activation-4.93e-01

Pos logit contributions
leaf+1.769
ting+1.669
orry+1.600
unk+1.554
 ear+1.545
Neg logit contributions
 ne-1.182
 One-1.127
 cheaper-1.094
 swe-1.077
One-1.051
Token the
Token ID262
Feature activation-3.57e+00

Pos logit contributions
leaf+0.846
ting+0.818
orry+0.787
unk+0.777
 upper+0.750
Neg logit contributions
 ne-0.653
 One-0.642
 cheaper-0.611
One-0.609
 swe-0.607
Token screen
Token ID3159
Feature activation-1.04e+00

Pos logit contributions
leaf+6.383
ying+6.367
unk+6.249
asing+6.205
orry+6.157
Neg logit contributions
 One-5.017
 coll-4.877
 swe-4.727
 saf-4.720
One-4.544
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
leaf+1.913
ting+1.812
orry+1.756
unk+1.731
 ear+1.679
Neg logit contributions
 ne-1.269
 One-1.244
 cheaper-1.174
 swe-1.159
 coll-1.156
Token The
Token ID383
Feature activation-3.15e+00

Pos logit contributions
leaf+0.749
ting+0.741
orry+0.703
unk+0.694
 ear+0.681
Neg logit contributions
 ne-0.425
 One-0.423
 cheaper-0.390
 swe-0.385
One-0.384
Token butterfly
Token ID35113
Feature activation-1.03e-01

Pos logit contributions
leaf+5.584
idget+5.578
ting+5.572
udge+5.530
ash+5.492
Neg logit contributions
 coll-4.231
 One-3.959
 ne-3.892
 swe-3.882
 saf-3.830
Token was
Token ID373
Feature activation-3.13e-01

Pos logit contributions
leaf+0.183
ting+0.180
orry+0.170
unk+0.169
 ear+0.166
Neg logit contributions
 ne-0.126
 One-0.120
 cheaper-0.117
 swe-0.116
 coll-0.116
Token she
Token ID673
Feature activation-2.00e+00

Pos logit contributions
leaf+1.426
ting+1.378
unk+1.325
orry+1.311
 upper+1.294
Neg logit contributions
 ne-0.636
 One-0.625
 swe-0.581
 cheaper-0.571
One-0.567
Token ran
Token ID4966
Feature activation-1.20e+00

Pos logit contributions
leaf+3.433
ting+3.397
other+3.342
unk+3.329
orry+3.301
Neg logit contributions
 ne-2.884
 swe-2.432
 cheaper-2.356
 One-2.307
 coll-2.244
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
leaf+1.893
ting+1.810
orry+1.781
 upper+1.778
unk+1.770
Neg logit contributions
 ne-1.719
 One-1.681
 cheaper-1.622
 swe-1.588
One-1.585
Token showed
Token ID3751
Feature activation-2.32e+00

Pos logit contributions
leaf+3.845
unk+3.708
orry+3.588
ting+3.572
 upper+3.531
Neg logit contributions
 ne-3.018
 swe-2.606
 One-2.551
 scar-2.430
 cheaper-2.430
Token it
Token ID340
Feature activation-7.98e-01

Pos logit contributions
leaf+4.601
ting+4.460
unk+4.390
 upper+4.288
iders+4.262
Neg logit contributions
 One-2.591
 swe-2.515
 enough-2.513
 ne-2.499
 coll-2.306
Token to
Token ID284
Feature activation-3.52e+00

Pos logit contributions
leaf+1.472
ting+1.454
unk+1.366
orry+1.362
 upper+1.345
Neg logit contributions
 One-0.927
 ne-0.919
 swe-0.888
 cheaper-0.877
 coll-0.863
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
leaf+8.826
ater+8.461
ting+8.363
ash+8.337
rified+8.324
Neg logit contributions
 One-3.336
 ne-2.467
 enough-2.354
 saf-2.336
 better-2.308
Token mom
Token ID1995
Feature activation+4.93e-01

Pos logit contributions
leaf+1.556
ting+1.534
orry+1.374
 upper+1.363
unk+1.349
Neg logit contributions
 ne-1.622
 One-1.562
 cheaper-1.538
 swe-1.530
 coll-1.528
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
 ne+0.708
 One+0.671
 coll+0.664
 cheaper+0.662
 swe+0.652
Neg logit contributions
ting-0.766
leaf-0.755
orry-0.697
 ear-0.695
unk-0.694
Token Mom
Token ID11254
Feature activation-5.25e-01

Pos logit contributions
leaf+3.797
ting+3.795
orry+3.466
 upper+3.404
vell+3.400
Neg logit contributions
 One-1.881
 ne-1.479
One-1.477
 swe-1.410
 coll-1.345
Token was
Token ID373
Feature activation-8.43e-01

Pos logit contributions
leaf+0.893
ting+0.879
orry+0.828
unk+0.820
 upper+0.817
Neg logit contributions
 ne-0.699
 One-0.653
 cheaper-0.641
 swe-0.640
 coll-0.629
Token was
Token ID373
Feature activation-1.33e+00

Pos logit contributions
leaf+1.006
ting+0.981
orry+0.922
unk+0.909
 ear+0.902
Neg logit contributions
 ne-0.714
 One-0.677
 cheaper-0.641
 swe-0.640
One-0.639
Token surprised
Token ID6655
Feature activation-6.13e-01

Pos logit contributions
leaf+2.105
ting+2.028
orry+1.910
 upper+1.847
other+1.828
Neg logit contributions
 ne-1.998
 One-1.881
 cheaper-1.872
 swe-1.802
 enough-1.791
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
leaf+1.074
ting+1.026
orry+0.993
unk+0.984
 upper+0.970
Neg logit contributions
 ne-0.792
 One-0.767
 cheaper-0.735
 swe-0.734
 coll-0.722
Token He
Token ID679
Feature activation-5.30e-01

Pos logit contributions
leaf+1.137
ting+1.130
orry+1.076
unk+1.064
 ear+1.044
Neg logit contributions
 One-0.480
 ne-0.466
One-0.426
 cheaper-0.424
 swe-0.407
Token touched
Token ID12615
Feature activation-1.54e+00

Pos logit contributions
leaf+0.839
ting+0.836
unk+0.772
orry+0.771
 upper+0.749
Neg logit contributions
 ne-0.767
 One-0.735
 cheaper-0.704
 swe-0.701
One-0.699
Token the
Token ID262
Feature activation-3.51e+00

Pos logit contributions
leaf+2.984
ting+2.744
unk+2.730
orry+2.725
 upper+2.626
Neg logit contributions
 One-1.805
 ne-1.777
 swe-1.667
 cheaper-1.660
 enough-1.645
Token screen
Token ID3159
Feature activation+2.09e-01

Pos logit contributions
orry+5.248
leaf+5.224
unk+5.219
ting+5.164
idget+5.027
Neg logit contributions
 One-5.684
 coll-5.563
 saf-5.497
 ne-5.426
One-5.340
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 ne+0.285
 One+0.276
 cheaper+0.270
 coll+0.269
 swe+0.268
Neg logit contributions
leaf-0.335
ting-0.333
orry-0.310
unk-0.307
 ear-0.305
Token it
Token ID340
Feature activation-1.64e-01

Pos logit contributions
leaf+2.364
ting+2.249
unk+2.222
orry+2.185
asp+2.124
Neg logit contributions
 ne-1.612
 One-1.466
 cheaper-1.431
 swe-1.426
 coll-1.368
Token made
Token ID925
Feature activation-3.83e-01

Pos logit contributions
leaf+0.281
ting+0.276
orry+0.260
unk+0.258
 ear+0.254
Neg logit contributions
 ne-0.212
 One-0.203
 cheaper-0.198
 swe-0.197
 coll-0.195
Token such
Token ID884
Feature activation-4.33e-01

Pos logit contributions
leaf+0.666
ting+0.657
orry+0.619
unk+0.616
 ear+0.605
Neg logit contributions
 ne-0.483
 One-0.468
 cheaper-0.451
 swe-0.449
 coll-0.448
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
 coll+2.303
 swe+2.240
 One+2.239
 cheaper+2.226
 ne+2.220
Neg logit contributions
ting-2.106
leaf-2.015
 ear-1.926
unk-1.851
 upper-1.837
Token pushed
Token ID7121
Feature activation-6.20e-01

Pos logit contributions
leaf+2.267
ting+2.161
unk+2.146
orry+2.103
other+2.076
Neg logit contributions
 ne-1.599
 One-1.455
 swe-1.411
 cheaper-1.390
One-1.333
Token the
Token ID262
Feature activation-3.45e+00

Pos logit contributions
leaf+1.111
ting+1.066
unk+1.011
orry+1.009
 upper+0.976
Neg logit contributions
 ne-0.789
 One-0.771
 swe-0.733
 cheaper-0.728
One-0.725
Token button
Token ID4936
Feature activation+2.42e-01

Pos logit contributions
orry+5.605
udge+5.305
asing+5.205
ting+5.103
leaf+5.071
Neg logit contributions
 One-5.511
 swe-5.374
 ne-5.124
 cheaper-5.072
 works-5.058
Token on
Token ID319
Feature activation-4.34e-01

Pos logit contributions
 ne+0.357
 One+0.347
 cheaper+0.339
 coll+0.339
 swe+0.336
Neg logit contributions
leaf-0.360
ting-0.360
orry-0.333
unk-0.329
 ear-0.327
Token the
Token ID262
Feature activation-3.46e+00

Pos logit contributions
leaf+0.755
ting+0.721
orry+0.687
unk+0.677
 upper+0.658
Neg logit contributions
 ne-0.570
 One-0.561
 cheaper-0.533
 swe-0.528
One-0.526
Token television
Token ID5581
Feature activation-1.18e+00

Pos logit contributions
orry+5.407
leaf+5.138
udge+5.135
asing+5.039
unk+5.032
Neg logit contributions
 One-5.803
 swe-5.523
 cheaper-5.386
 saf-5.384
 coll-5.376
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
leaf+2.222
ting+2.115
orry+2.045
unk+2.034
 upper+1.944
Neg logit contributions
 ne-1.413
 One-1.385
 cheaper-1.308
 coll-1.291
 swe-1.287
Token The
Token ID383
Feature activation-3.13e+00

Pos logit contributions
leaf+2.031
ting+2.001
orry+1.870
unk+1.829
other+1.790
Neg logit contributions
 One-1.020
 ne-0.952
One-0.861
 cheaper-0.853
 swe-0.851
Token screen
Token ID3159
Feature activation-5.05e-01

Pos logit contributions
orry+4.832
udge+4.707
ting+4.670
leaf+4.579
asing+4.510
Neg logit contributions
 One-5.088
 swe-4.923
 ne-4.847
 cheaper-4.753
 saf-4.748
Token lit
Token ID6578
Feature activation+5.93e-01

Pos logit contributions
leaf+0.900
ting+0.886
orry+0.826
unk+0.824
 ear+0.807
Neg logit contributions
 ne-0.635
 One-0.600
 cheaper-0.582
 swe-0.576
 coll-0.570
Token with
Token ID351
Feature activation-4.78e-02

Pos logit contributions
leaf+0.540
ting+0.531
orry+0.502
unk+0.496
 upper+0.487
Neg logit contributions
 ne-0.478
 One-0.464
 coll-0.448
 cheaper-0.446
 swe-0.445
Token a
Token ID257
Feature activation-5.89e-01

Pos logit contributions
leaf+0.081
ting+0.080
orry+0.075
unk+0.075
 ear+0.074
Neg logit contributions
 ne-0.062
 One-0.060
 cheaper-0.058
 coll-0.058
 swe-0.058
Token long
Token ID890
Feature activation+1.53e-01

Pos logit contributions
ting+0.908
leaf+0.896
orry+0.836
unk+0.820
 upper+0.805
Neg logit contributions
 ne-0.871
 One-0.836
 cheaper-0.830
 coll-0.827
 swe-0.827
Token cable
Token ID7862
Feature activation-3.45e-01

Pos logit contributions
 ne+0.220
 One+0.213
 cheaper+0.210
 swe+0.207
 coll+0.207
Neg logit contributions
leaf-0.235
ting-0.232
orry-0.217
unk-0.215
 ear-0.213
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
leaf+0.566
ting+0.554
orry+0.526
unk+0.516
 upper+0.510
Neg logit contributions
 ne-0.466
 One-0.453
 cheaper-0.440
 coll-0.437
 swe-0.434
Token The
Token ID383
Feature activation-3.45e+00

Pos logit contributions
leaf+0.449
ting+0.441
orry+0.420
unk+0.417
 ear+0.409
Neg logit contributions
 One-0.274
 ne-0.274
 cheaper-0.255
 swe-0.255
 coll-0.254
Token cable
Token ID7862
Feature activation+1.87e-01

Pos logit contributions
orry+6.499
leaf+6.319
unk+6.264
udge+6.182
ocket+6.150
Neg logit contributions
 One-4.363
 swe-4.252
 cheaper-4.165
 coll-4.159
 ne-4.103
Token was
Token ID373
Feature activation+6.06e-01

Pos logit contributions
 ne+0.225
 One+0.217
 cheaper+0.212
 coll+0.211
 swe+0.210
Neg logit contributions
leaf-0.331
ting-0.328
orry-0.308
unk-0.305
 ear-0.303
Token harmless
Token ID23585
Feature activation-2.94e-01

Pos logit contributions
 ne+0.805
 One+0.786
 coll+0.767
 cheaper+0.755
 swe+0.755
Neg logit contributions
leaf-0.982
ting-0.967
unk-0.909
 ear-0.905
orry-0.902
Token,
Token ID11
Feature activation+2.85e-01

Pos logit contributions
leaf+0.515
ting+0.502
orry+0.479
unk+0.473
 upper+0.467
Neg logit contributions
 ne-0.368
 One-0.355
 cheaper-0.345
 swe-0.344
 coll-0.342
Token so
Token ID523
Feature activation+3.57e-01

Pos logit contributions
 ne+0.438
 One+0.425
 cheaper+0.417
 coll+0.415
 swe+0.412
Neg logit contributions
leaf-0.408
ting-0.407
orry-0.372
unk-0.367
 ear-0.365
Token Tim
Token ID5045
Feature activation-6.65e-01

Pos logit contributions
leaf+1.238
ting+1.217
orry+1.146
unk+1.131
 ear+1.116
Neg logit contributions
 One-0.978
 ne-0.954
One-0.884
 swe-0.880
 coll-0.875
Token stood
Token ID6204
Feature activation-7.07e-01

Pos logit contributions
leaf+1.193
ting+1.181
orry+1.085
unk+1.078
 ear+1.065
Neg logit contributions
 ne-0.849
 One-0.799
 swe-0.765
One-0.754
 cheaper-0.749
Token up
Token ID510
Feature activation+1.48e+00

Pos logit contributions
leaf+1.095
ting+1.036
orry+0.988
unk+0.952
 upper+0.939
Neg logit contributions
 ne-1.089
 One-1.029
 cheaper-1.010
 swe-1.010
One-1.002
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
 coll+2.303
 swe+2.240
 One+2.239
 cheaper+2.226
 ne+2.220
Neg logit contributions
ting-2.106
leaf-2.015
 ear-1.926
unk-1.851
 upper-1.837
Token pushed
Token ID7121
Feature activation-6.20e-01

Pos logit contributions
leaf+2.267
ting+2.161
unk+2.146
orry+2.103
other+2.076
Neg logit contributions
 ne-1.599
 One-1.455
 swe-1.411
 cheaper-1.390
One-1.333
Token the
Token ID262
Feature activation-3.45e+00

Pos logit contributions
leaf+1.111
ting+1.066
unk+1.011
orry+1.009
 upper+0.976
Neg logit contributions
 ne-0.789
 One-0.771
 swe-0.733
 cheaper-0.728
One-0.725
Token button
Token ID4936
Feature activation+2.42e-01

Pos logit contributions
orry+5.605
udge+5.305
asing+5.205
ting+5.103
leaf+5.071
Neg logit contributions
 One-5.511
 swe-5.374
 ne-5.124
 cheaper-5.072
 works-5.058
Token on
Token ID319
Feature activation-4.34e-01

Pos logit contributions
 ne+0.357
 One+0.347
 cheaper+0.339
 coll+0.339
 swe+0.336
Neg logit contributions
leaf-0.360
ting-0.360
orry-0.333
unk-0.329
 ear-0.327
Token the
Token ID262
Feature activation-3.46e+00

Pos logit contributions
leaf+0.755
ting+0.721
orry+0.687
unk+0.677
 upper+0.658
Neg logit contributions
 ne-0.570
 One-0.561
 cheaper-0.533
 swe-0.528
One-0.526
Token television
Token ID5581
Feature activation-1.18e+00

Pos logit contributions
orry+5.407
leaf+5.138
udge+5.135
asing+5.039
unk+5.032
Neg logit contributions
 One-5.803
 swe-5.523
 cheaper-5.386
 saf-5.384
 coll-5.376
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
leaf+2.222
ting+2.115
orry+2.045
unk+2.034
 upper+1.944
Neg logit contributions
 ne-1.413
 One-1.385
 cheaper-1.308
 coll-1.291
 swe-1.287
Token closer
Token ID5699
Feature activation+8.04e-01

Pos logit contributions
ting+0.839
leaf+0.828
orry+0.777
unk+0.770
 ear+0.757
Neg logit contributions
 ne-0.678
 One-0.660
 cheaper-0.648
 coll-0.639
 swe-0.636
Token look
Token ID804
Feature activation-3.11e-01

Pos logit contributions
 ne+1.082
 cheaper+1.028
 swe+1.023
 One+1.006
 coll+1.001
Neg logit contributions
leaf-1.330
ting-1.291
 ear-1.222
orry-1.210
unk-1.209
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
leaf+0.520
ting+0.504
orry+0.480
unk+0.474
 upper+0.468
Neg logit contributions
 ne-0.415
 One-0.404
 cheaper-0.394
 swe-0.392
 coll-0.388
Token Suddenly
Token ID24975
Feature activation+7.63e-01

Pos logit contributions
leaf+0.706
ting+0.702
orry+0.664
unk+0.658
 ear+0.646
Neg logit contributions
 One-0.399
 ne-0.396
 cheaper-0.369
One-0.367
 swe-0.359
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
 ne+0.819
 coll+0.819
 cheaper+0.787
 swe+0.786
 One+0.784
Neg logit contributions
ting-1.420
leaf-1.409
orry-1.333
 ear-1.328
unk-1.317
Token the
Token ID262
Feature activation-3.43e+00

Pos logit contributions
leaf+0.710
ting+0.700
orry+0.669
unk+0.666
 ear+0.649
Neg logit contributions
 ne-0.405
 One-0.391
 cheaper-0.374
 swe-0.366
One-0.365
Token screen
Token ID3159
Feature activation-3.59e-01

Pos logit contributions
ting+6.011
leaf+5.781
unk+5.592
orry+5.579
udge+5.546
Neg logit contributions
 One-5.068
 saf-4.933
 ne-4.927
 cheaper-4.834
 coll-4.799
Token was
Token ID373
Feature activation+2.07e-01

Pos logit contributions
leaf+0.580
ting+0.567
orry+0.532
unk+0.526
 ear+0.517
Neg logit contributions
 ne-0.503
 One-0.482
 cheaper-0.468
 swe-0.463
 coll-0.461
Token making
Token ID1642
Feature activation+3.79e-01

Pos logit contributions
 ne+0.313
 One+0.306
 coll+0.298
 cheaper+0.297
 swe+0.296
Neg logit contributions
leaf-0.303
ting-0.299
orry-0.277
unk-0.275
 ear-0.272
Token loud
Token ID7812
Feature activation+1.31e+00

Pos logit contributions
 ne+0.525
 One+0.509
 cheaper+0.497
One+0.493
 swe+0.493
Neg logit contributions
leaf-0.603
ting-0.598
orry-0.557
unk-0.550
 ear-0.545
Token noises
Token ID26782
Feature activation-8.66e-01

Pos logit contributions
 ne+1.727
One+1.723
 One+1.716
 cheaper+1.689
 swe+1.618
Neg logit contributions
ting-2.220
leaf-2.117
 ear-1.974
orry-1.965
unk-1.944
Token Sara
Token ID24799
Feature activation+5.43e-01

Pos logit contributions
 ne+0.499
 One+0.481
 coll+0.478
 cheaper+0.473
 swe+0.469
Neg logit contributions
leaf-0.619
ting-0.618
orry-0.570
 ear-0.567
unk-0.566
Token sees
Token ID7224
Feature activation-7.31e-01

Pos logit contributions
 ne+0.746
 cheaper+0.722
 One+0.718
 swe+0.708
 coll+0.706
Neg logit contributions
leaf-0.866
ting-0.862
orry-0.804
unk-0.801
 ear-0.793
Token a
Token ID257
Feature activation-5.47e-01

Pos logit contributions
leaf+1.362
ting+1.334
orry+1.264
unk+1.250
 upper+1.240
Neg logit contributions
 ne-0.837
 One-0.820
 cheaper-0.788
 swe-0.780
 coll-0.780
Token light
Token ID1657
Feature activation-3.35e-01

Pos logit contributions
leaf+0.960
ting+0.959
orry+0.898
unk+0.881
 upper+0.864
Neg logit contributions
 ne-0.685
 cheaper-0.654
 One-0.650
 coll-0.647
 swe-0.638
Token on
Token ID319
Feature activation+1.63e-02

Pos logit contributions
leaf+0.467
ting+0.458
orry+0.429
unk+0.419
 upper+0.413
Neg logit contributions
 ne-0.537
 One-0.518
 cheaper-0.510
 coll-0.509
 swe-0.507
Token the
Token ID262
Feature activation-3.43e+00

Pos logit contributions
 ne+0.023
 One+0.022
 cheaper+0.022
 coll+0.022
 swe+0.022
Neg logit contributions
leaf-0.026
ting-0.025
orry-0.024
unk-0.024
 ear-0.023
Token screen
Token ID3159
Feature activation-9.21e-01

Pos logit contributions
unk+4.625
orry+4.538
asing+4.467
leaf+4.452
udge+4.409
Neg logit contributions
 saf-6.536
 coll-6.273
 One-6.193
 cheaper-5.937
 ne-5.915
Token.
Token ID13
Feature activation-5.53e-02

Pos logit contributions
leaf+1.677
ting+1.616
orry+1.546
unk+1.533
 upper+1.482
Neg logit contributions
 One-1.133
 ne-1.119
One-1.054
 cheaper-1.036
 coll-1.032
Token The
Token ID383
Feature activation-2.78e+00

Pos logit contributions
leaf+0.098
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.067
 One-0.065
 cheaper-0.063
 swe-0.062
 coll-0.062
Token light
Token ID1657
Feature activation-1.73e-01

Pos logit contributions
ting+4.734
orry+4.676
idget+4.650
leaf+4.644
unk+4.590
Neg logit contributions
 One-3.887
 saf-3.867
 ne-3.778
 coll-3.769
One-3.638
Token is
Token ID318
Feature activation-4.49e-01

Pos logit contributions
leaf+0.319
ting+0.316
orry+0.298
unk+0.295
 upper+0.291
Neg logit contributions
 ne-0.196
 One-0.188
 cheaper-0.183
 coll-0.182
 swe-0.181
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
leaf+1.292
ting+1.280
orry+1.212
unk+1.206
 upper+1.179
Neg logit contributions
 One-0.657
 ne-0.623
 cheaper-0.583
One-0.580
 swe-0.566
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
ting+1.182
leaf+1.180
orry+1.108
unk+1.098
 ear+1.096
Neg logit contributions
 One-0.564
 ne-0.554
One-0.536
 cheaper-0.514
 swe-0.496
TokenBilly
Token ID49640
Feature activation+2.16e-01

Pos logit contributions
leaf+0.317
ting+0.313
orry+0.293
unk+0.291
 ear+0.288
Neg logit contributions
 ne-0.257
 One-0.251
 cheaper-0.244
One-0.243
 swe-0.241
Token slowly
Token ID6364
Feature activation+4.57e-01

Pos logit contributions
 ne+0.300
 One+0.293
 cheaper+0.288
 coll+0.286
 swe+0.285
Neg logit contributions
leaf-0.342
ting-0.340
orry-0.317
unk-0.313
 ear-0.310
Token approached
Token ID10448
Feature activation-9.30e-01

Pos logit contributions
 ne+0.669
 One+0.664
 cheaper+0.650
 coll+0.648
 swe+0.643
Neg logit contributions
leaf-0.684
ting-0.679
orry-0.628
unk-0.610
 ear-0.609
Token the
Token ID262
Feature activation-3.36e+00

Pos logit contributions
leaf+1.668
ting+1.592
orry+1.571
unk+1.560
 upper+1.504
Neg logit contributions
 ne-1.158
 One-1.149
 cheaper-1.073
 swe-1.059
One-1.056
Token computer
Token ID3644
Feature activation+3.84e-02

Pos logit contributions
leaf+5.001
ting+4.841
unk+4.837
asc+4.782
orry+4.760
Neg logit contributions
 One-5.780
One-5.617
 coll-5.552
 cheaper-5.465
 saf-5.428
Token again
Token ID757
Feature activation+1.36e-01

Pos logit contributions
 ne+0.049
 One+0.047
 cheaper+0.046
 coll+0.046
 swe+0.046
Neg logit contributions
leaf-0.065
ting-0.064
orry-0.060
unk-0.060
 ear-0.059
Token,
Token ID11
Feature activation-8.95e-01

Pos logit contributions
 ne+0.175
 One+0.169
 cheaper+0.165
 swe+0.164
 coll+0.164
Neg logit contributions
leaf-0.231
ting-0.229
orry-0.215
unk-0.212
 ear-0.211
Token he
Token ID339
Feature activation-6.53e-01

Pos logit contributions
leaf+1.465
ting+1.400
unk+1.373
orry+1.372
 upper+1.319
Neg logit contributions
 ne-1.252
 One-1.189
 cheaper-1.169
 swe-1.153
 coll-1.136
Token slowly
Token ID6364
Feature activation+2.07e-01

Pos logit contributions
leaf+1.044
ting+1.023
orry+0.946
unk+0.945
 ear+0.922
Neg logit contributions
 ne-0.950
 One-0.893
 cheaper-0.872
 swe-0.865
 coll-0.857
Token.
Token ID13
Feature activation-9.72e-01

Pos logit contributions
leaf+1.397
ting+1.324
orry+1.281
unk+1.269
 upper+1.248
Neg logit contributions
 One-0.950
 ne-0.941
One-0.878
 coll-0.866
 cheaper-0.865
Token She
Token ID1375
Feature activation+2.73e-01

Pos logit contributions
leaf+1.948
ting+1.917
orry+1.844
unk+1.825
 upper+1.780
Neg logit contributions
 One-1.058
 ne-0.980
One-0.902
 swe-0.879
 coll-0.864
Token peek
Token ID27185
Feature activation-1.58e+00

Pos logit contributions
 ne+0.352
 One+0.343
 cheaper+0.341
 coll+0.338
 swe+0.335
Neg logit contributions
leaf-0.458
ting-0.453
orry-0.427
unk-0.421
 ear-0.418
Tokened
Token ID276
Feature activation-1.06e+00

Pos logit contributions
leaf+1.427
ting+1.306
unk+1.275
 upper+1.272
orry+1.249
Neg logit contributions
 ne-3.330
 One-3.192
 swe-3.138
One-3.087
 saf-3.077
Token around
Token ID1088
Feature activation-6.12e-01

Pos logit contributions
leaf+1.290
ting+1.186
orry+1.173
unk+1.162
 upper+1.141
Neg logit contributions
 ne-1.946
 One-1.899
 swe-1.840
 cheaper-1.814
 coll-1.793
Token the
Token ID262
Feature activation-3.36e+00

Pos logit contributions
leaf+1.104
ting+1.067
orry+1.030
unk+1.030
 upper+1.003
Neg logit contributions
 ne-0.747
 One-0.728
 swe-0.682
 coll-0.681
One-0.679
Token corner
Token ID5228
Feature activation-1.32e+00

Pos logit contributions
leaf+5.613
asing+5.407
ting+5.350
unk+5.252
orry+5.142
Neg logit contributions
One-4.936
 One-4.918
 ne-4.815
 saf-4.795
 swe-4.788
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
leaf+2.598
ting+2.482
unk+2.362
orry+2.325
 upper+2.319
Neg logit contributions
 One-1.473
 ne-1.462
One-1.378
 cheaper-1.346
 coll-1.318
Token saw
Token ID2497
Feature activation-1.08e+00

Pos logit contributions
leaf+3.114
unk+2.999
 upper+2.846
asp+2.845
ting+2.838
Neg logit contributions
 ne-2.227
 One-2.107
One-1.960
 swe-1.923
 saf-1.901
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
leaf+2.040
ting+1.992
orry+1.915
unk+1.908
 upper+1.893
Neg logit contributions
 ne-1.180
 One-1.162
 cheaper-1.095
 coll-1.093
 swe-1.088
Token pretty
Token ID2495
Feature activation-1.27e-01

Pos logit contributions
leaf+0.219
ting+0.216
orry+0.202
unk+0.199
 ear+0.196
Neg logit contributions
 ne-0.193
 One-0.186
 cheaper-0.183
 coll-0.181
 swe-0.181
Token M
Token ID337
Feature activation-9.40e-01

Pos logit contributions
leaf+1.690
ting+1.563
unk+1.458
orry+1.452
 upper+1.435
Neg logit contributions
 ne-2.124
 One-2.100
 swe-1.996
One-1.978
 coll-1.937
Tokenandy
Token ID10757
Feature activation-1.89e-01

Pos logit contributions
leaf+1.490
ting+1.472
 ear+1.324
 upper+1.297
unk+1.275
Neg logit contributions
 ne-1.352
 One-1.338
 swe-1.282
One-1.279
 coll-1.274
Token showed
Token ID3751
Feature activation-1.70e+00

Pos logit contributions
leaf+0.310
ting+0.305
orry+0.284
unk+0.281
 ear+0.279
Neg logit contributions
 ne-0.256
 One-0.248
 cheaper-0.239
 swe-0.239
 coll-0.239
Token her
Token ID607
Feature activation-1.68e+00

Pos logit contributions
leaf+3.194
ting+3.126
unk+2.935
 upper+2.915
orry+2.909
Neg logit contributions
 One-2.050
 swe-1.976
 ne-1.957
 enough-1.834
 coll-1.829
Token hoop
Token ID44792
Feature activation-7.06e-01

Pos logit contributions
leaf+2.926
ting+2.852
 upper+2.495
 Stop+2.464
orry+2.448
Neg logit contributions
 swe-2.380
 One-2.364
 ne-2.312
 coll-2.286
 cheaper-2.202
Token to
Token ID284
Feature activation-3.35e+00

Pos logit contributions
leaf+1.284
ting+1.252
orry+1.181
unk+1.170
 upper+1.152
Neg logit contributions
 ne-0.853
 One-0.837
 swe-0.806
 coll-0.800
 cheaper-0.797
Token her
Token ID607
Feature activation-9.65e-01

Pos logit contributions
leaf+8.518
ting+8.444
unk+7.979
umbling+7.919
orry+7.892
Neg logit contributions
 One-3.265
 ne-2.615
One-2.424
 saf-2.338
 Tweet-2.233
Token father
Token ID2988
Feature activation+6.20e-01

Pos logit contributions
leaf+1.637
ting+1.597
orry+1.449
 upper+1.431
unk+1.423
Neg logit contributions
 ne-1.341
 One-1.299
 swe-1.288
 coll-1.276
 cheaper-1.269
Token.
Token ID13
Feature activation-6.94e-01

Pos logit contributions
 ne+0.875
 cheaper+0.826
 One+0.822
 coll+0.821
 swe+0.812
Neg logit contributions
ting-0.979
leaf-0.967
 ear-0.898
orry-0.894
unk-0.892
Token
Token ID220
Feature activation-5.25e-01

Pos logit contributions
leaf+1.220
ting+1.195
orry+1.139
unk+1.108
 upper+1.084
Neg logit contributions
 One-0.894
 ne-0.866
 swe-0.809
 coll-0.803
One-0.799
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
leaf+1.065
ting+1.057
orry+0.983
 ear+0.977
 upper+0.973
Neg logit contributions
 One-0.517
 ne-0.501
One-0.475
 cheaper-0.465
 coll-0.459
Token step
Token ID2239
Feature activation-1.40e+00

Pos logit contributions
ting+1.339
leaf+1.337
orry+1.231
unk+1.210
 upper+1.195
Neg logit contributions
 ne-1.132
 One-1.088
 coll-1.049
 swe-1.039
 cheaper-1.025
Token to
Token ID284
Feature activation-1.95e+00

Pos logit contributions
leaf+2.373
ting+2.269
orry+2.211
unk+2.176
 upper+2.152
Neg logit contributions
 ne-1.839
 One-1.797
One-1.712
 cheaper-1.706
 enough-1.671
Token get
Token ID651
Feature activation-6.99e-01

Pos logit contributions
leaf+3.700
ting+3.665
unk+3.642
asp+3.495
orry+3.487
Neg logit contributions
 One-2.322
 ne-2.272
One-2.049
 saf-2.007
 cheaper-1.975
Token closer
Token ID5699
Feature activation+3.73e-01

Pos logit contributions
leaf+1.169
ting+1.153
orry+1.109
unk+1.098
 upper+1.077
Neg logit contributions
 ne-0.927
 One-0.892
 cheaper-0.867
 swe-0.864
 coll-0.839
Token to
Token ID284
Feature activation-1.97e+00

Pos logit contributions
 ne+0.508
 One+0.487
 cheaper+0.481
 coll+0.477
 swe+0.476
Neg logit contributions
leaf-0.603
ting-0.600
unk-0.554
orry-0.554
 ear-0.551
Token the
Token ID262
Feature activation-3.34e+00

Pos logit contributions
leaf+3.872
unk+3.747
orry+3.623
asp+3.579
ting+3.578
Neg logit contributions
 One-2.382
 ne-2.322
 saf-2.097
One-2.087
 Nib-2.021
Token bug
Token ID5434
Feature activation-3.38e-01

Pos logit contributions
leaf+5.834
ting+5.512
asing+5.293
ash+5.179
 upper+5.042
Neg logit contributions
 One-4.876
One-4.774
 ne-4.627
 problem-4.616
 swe-4.560
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
leaf+0.582
ting+0.565
orry+0.536
unk+0.527
 upper+0.521
Neg logit contributions
 ne-0.439
 One-0.426
 coll-0.412
 swe-0.412
 cheaper-0.408
Token He
Token ID679
Feature activation-1.86e-01

Pos logit contributions
leaf+0.098
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.073
 One-0.071
 cheaper-0.069
 coll-0.068
 swe-0.068
Token wanted
Token ID2227
Feature activation-1.18e+00

Pos logit contributions
leaf+0.292
ting+0.289
orry+0.268
unk+0.265
 ear+0.262
Neg logit contributions
 ne-0.270
 One-0.259
 cheaper-0.252
 swe-0.251
 coll-0.250
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
leaf+2.287
ting+2.258
unk+2.176
orry+2.170
 upper+2.094
Neg logit contributions
 ne-1.280
 One-1.268
 cheaper-1.213
 swe-1.172
One-1.167
Token to
Token ID284
Feature activation-1.71e+00

Pos logit contributions
 ne+0.605
 One+0.581
 coll+0.572
 cheaper+0.569
 swe+0.566
Neg logit contributions
ting-0.738
leaf-0.737
orry-0.684
unk-0.678
 ear-0.677
Token fit
Token ID4197
Feature activation-8.20e-01

Pos logit contributions
ting+2.802
leaf+2.801
unk+2.681
orry+2.646
asp+2.495
Neg logit contributions
 ne-2.505
 One-2.385
One-2.286
 swe-2.250
 cheaper-2.233
Token under
Token ID739
Feature activation-2.49e-01

Pos logit contributions
leaf+1.392
ting+1.301
orry+1.285
unk+1.268
 upper+1.247
Neg logit contributions
 ne-1.122
 One-1.072
 swe-1.041
 cheaper-1.036
 coll-0.995
Token the
Token ID262
Feature activation-3.24e+00

Pos logit contributions
leaf+0.418
ting+0.407
orry+0.386
unk+0.382
 upper+0.373
Neg logit contributions
 ne-0.332
 One-0.323
 cheaper-0.310
 swe-0.310
 coll-0.307
Token table
Token ID3084
Feature activation-6.56e-01

Pos logit contributions
orry+4.473
unk+4.455
leaf+4.364
ting+4.268
asing+4.265
Neg logit contributions
 saf-5.829
 coll-5.698
 One-5.629
 swe-5.613
 ne-5.529
Token the
Token ID262
Feature activation-3.34e+00

Pos logit contributions
leaf+1.423
ting+1.370
orry+1.316
unk+1.296
 ear+1.283
Neg logit contributions
 ne-0.582
 One-0.557
 cheaper-0.527
 swe-0.514
 coll-0.506
Token wires
Token ID19474
Feature activation-1.00e+00

Pos logit contributions
orry+5.023
leaf+4.775
asing+4.749
unk+4.723
udge+4.631
Neg logit contributions
 swe-5.591
 coll-5.568
 cheaper-5.548
 saf-5.508
 ne-5.401
Token started
Token ID2067
Feature activation+5.54e-01

Pos logit contributions
leaf+2.063
ting+1.943
orry+1.916
unk+1.890
 ear+1.855
Neg logit contributions
 ne-1.022
 One-0.975
 cheaper-0.922
 swe-0.892
One-0.875
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 ne+0.744
 One+0.728
 cheaper+0.712
 coll+0.708
One+0.707
Neg logit contributions
ting-0.896
leaf-0.882
unk-0.826
orry-0.823
 ear-0.814
Token spark
Token ID9009
Feature activation+1.41e-01

Pos logit contributions
 ne+0.583
 One+0.571
 coll+0.563
 cheaper+0.561
 swe+0.556
Neg logit contributions
leaf-0.580
ting-0.578
orry-0.530
unk-0.522
 ear-0.522
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
 ne+0.265
 One+0.259
 cheaper+0.255
 coll+0.253
 swe+0.252
Neg logit contributions
leaf-0.157
ting-0.156
orry-0.140
unk-0.138
 ear-0.135
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 ne+0.818
 coll+0.790
 One+0.785
 cheaper+0.784
 swe+0.773
Neg logit contributions
ting-0.910
leaf-0.886
orry-0.832
 ear-0.832
unk-0.827
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
leaf+2.401
ting+2.385
orry+2.225
unk+2.206
 upper+2.202
Neg logit contributions
 One-1.087
 ne-0.955
One-0.924
 saf-0.843
 coll-0.834
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
leaf+0.210
ting+0.209
orry+0.197
unk+0.195
 ear+0.193
Neg logit contributions
 ne-0.112
 One-0.109
 cheaper-0.104
One-0.103
 coll-0.103
TokenSam
Token ID16305
Feature activation-3.51e-01

Pos logit contributions
leaf+2.398
ting+2.383
unk+2.239
 ear+2.232
orry+2.191
Neg logit contributions
One-1.311
 One-1.272
 ne-1.236
 saf-1.106
 coll-1.064
Token pushed
Token ID7121
Feature activation-5.02e-01

Pos logit contributions
leaf+0.585
ting+0.572
orry+0.537
unk+0.534
 ear+0.527
Neg logit contributions
 ne-0.476
 One-0.456
 cheaper-0.438
 swe-0.436
 coll-0.434
Token the
Token ID262
Feature activation-3.33e+00

Pos logit contributions
leaf+0.833
ting+0.803
unk+0.755
orry+0.754
 upper+0.738
Neg logit contributions
 ne-0.693
 One-0.677
 swe-0.643
One-0.641
 cheaper-0.639
Token ball
Token ID2613
Feature activation-3.14e-01

Pos logit contributions
leaf+6.268
ting+6.074
idget+5.793
asing+5.790
orry+5.721
Neg logit contributions
 ne-4.530
 saf-4.451
 One-4.399
One-4.330
 problem-4.288
Token towards
Token ID3371
Feature activation-1.45e+00

Pos logit contributions
leaf+0.524
ting+0.508
orry+0.478
unk+0.475
 ear+0.463
Neg logit contributions
 ne-0.425
 One-0.416
 cheaper-0.398
 swe-0.395
One-0.394
Token the
Token ID262
Feature activation-3.15e+00

Pos logit contributions
leaf+2.608
ting+2.363
unk+2.256
orry+2.219
asp+2.208
Neg logit contributions
 One-1.960
 ne-1.888
 saf-1.818
 coll-1.795
One-1.790
Token shadow
Token ID9082
Feature activation-5.66e-01

Pos logit contributions
leaf+6.670
ting+6.333
asing+6.321
orry+6.188
idget+6.152
Neg logit contributions
 ne-3.564
 saf-3.504
 One-3.461
One-3.352
 problem-3.259
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
leaf+1.061
ting+1.021
orry+0.971
unk+0.969
 upper+0.944
Neg logit contributions
 ne-0.662
 One-0.641
 cheaper-0.606
 swe-0.600
 coll-0.597
Token frustrated
Token ID14718
Feature activation-4.63e-01

Pos logit contributions
leaf+0.172
ting+0.170
orry+0.158
unk+0.156
 ear+0.154
Neg logit contributions
 ne-0.160
 One-0.154
 cheaper-0.151
 swe-0.150
 coll-0.149
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
leaf+0.814
ting+0.785
orry+0.743
unk+0.737
 upper+0.727
Neg logit contributions
 ne-0.589
 One-0.576
 cheaper-0.557
 swe-0.552
 coll-0.545
Token The
Token ID383
Feature activation-2.78e+00

Pos logit contributions
leaf+0.311
ting+0.307
orry+0.290
unk+0.288
 ear+0.284
Neg logit contributions
 ne-0.173
 One-0.171
 cheaper-0.161
 swe-0.159
One-0.159
Token buttons
Token ID12163
Feature activation+3.80e-01

Pos logit contributions
leaf+4.298
udge+4.221
orry+4.218
ting+4.216
unk+4.122
Neg logit contributions
 One-4.395
 swe-4.294
 ne-4.189
 cheaper-4.143
One-4.135
Token on
Token ID319
Feature activation-1.46e-01

Pos logit contributions
 ne+0.528
 One+0.515
 cheaper+0.504
 coll+0.502
 swe+0.501
Neg logit contributions
leaf-0.594
ting-0.594
orry-0.551
unk-0.543
 ear-0.539
Token the
Token ID262
Feature activation-3.32e+00

Pos logit contributions
leaf+0.236
ting+0.232
orry+0.218
unk+0.215
 ear+0.211
Neg logit contributions
 ne-0.203
 One-0.198
 cheaper-0.192
 swe-0.191
 coll-0.191
Token television
Token ID5581
Feature activation-5.44e-01

Pos logit contributions
orry+4.932
leaf+4.818
udge+4.793
unk+4.748
asing+4.699
Neg logit contributions
 swe-5.521
 One-5.419
 cheaper-5.400
 coll-5.362
 saf-5.325
Token were
Token ID547
Feature activation-3.46e-01

Pos logit contributions
leaf+0.972
ting+0.940
orry+0.894
unk+0.889
 ear+0.869
Neg logit contributions
 ne-0.670
 One-0.646
 cheaper-0.620
 coll-0.618
 swe-0.613
Token different
Token ID1180
Feature activation-2.51e-01

Pos logit contributions
leaf+0.567
ting+0.558
orry+0.524
unk+0.516
 upper+0.507
Neg logit contributions
 ne-0.474
 One-0.453
 cheaper-0.450
 swe-0.446
 coll-0.440
Token and
Token ID290
Feature activation-3.08e-01

Pos logit contributions
leaf+0.413
ting+0.403
orry+0.381
unk+0.375
 ear+0.371
Neg logit contributions
 ne-0.341
 One-0.328
 cheaper-0.323
 swe-0.321
 coll-0.319
Token she
Token ID673
Feature activation-9.21e-01

Pos logit contributions
leaf+0.525
ting+0.515
orry+0.485
unk+0.483
 upper+0.471
Neg logit contributions
 ne-0.402
 One-0.386
 cheaper-0.380
 swe-0.378
 coll-0.374
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
leaf+0.018
ting+0.018
orry+0.017
unk+0.017
 ear+0.016
Neg logit contributions
 ne-0.015
 One-0.015
 cheaper-0.015
 coll-0.015
 swe-0.015
Token
Token ID198
Feature activation+4.54e-01

Pos logit contributions
 ne+1.241
 coll+1.204
 swe+1.187
 cheaper+1.158
 saf+1.089
Neg logit contributions
leaf-1.807
ting-1.793
orry-1.691
 ear-1.672
unk-1.659
Token
Token ID198
Feature activation+8.92e-01

Pos logit contributions
 ne+0.524
 coll+0.493
 swe+0.492
 cheaper+0.486
 One+0.482
Neg logit contributions
leaf-0.832
ting-0.814
orry-0.779
unk-0.770
 ear-0.757
TokenHe
Token ID1544
Feature activation+9.94e-01

Pos logit contributions
 ne+1.056
 coll+1.019
 swe+1.001
 cheaper+0.971
 One+0.961
Neg logit contributions
leaf-1.609
ting-1.564
orry-1.510
unk-1.465
 upper-1.459
Token touched
Token ID12615
Feature activation-1.05e+00

Pos logit contributions
 coll+1.404
 cheaper+1.391
 ne+1.390
 One+1.366
 swe+1.364
Neg logit contributions
leaf-1.506
ting-1.484
orry-1.417
 ear-1.393
unk-1.362
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
leaf+1.871
ting+1.764
orry+1.713
unk+1.702
 upper+1.667
Neg logit contributions
 ne-1.333
 One-1.324
 cheaper-1.247
 swe-1.245
One-1.214
Token keyboard
Token ID10586
Feature activation-8.53e-02

Pos logit contributions
leaf+5.222
ting+4.992
opher+4.942
unk+4.892
asc+4.833
Neg logit contributions
 coll-5.461
 One-5.333
 cheaper-5.157
 swe-5.090
One-5.056
Token,
Token ID11
Feature activation-5.73e-01

Pos logit contributions
leaf+0.145
ting+0.143
orry+0.135
unk+0.133
 ear+0.131
Neg logit contributions
 ne-0.109
 One-0.106
 cheaper-0.103
 swe-0.102
 coll-0.102
Token expecting
Token ID12451
Feature activation-1.07e+00

Pos logit contributions
leaf+0.944
ting+0.918
orry+0.881
unk+0.875
 upper+0.842
Neg logit contributions
 ne-0.784
 One-0.755
 swe-0.734
 cheaper-0.733
 coll-0.722
Token the
Token ID262
Feature activation-2.68e+00

Pos logit contributions
leaf+1.955
ting+1.941
unk+1.858
orry+1.848
asp+1.784
Neg logit contributions
 ne-1.243
 One-1.234
 cheaper-1.199
 swe-1.163
 coll-1.135
Token computer
Token ID3644
Feature activation+3.78e-01

Pos logit contributions
ting+4.102
leaf+4.065
orry+3.910
unk+3.891
asp+3.788
Neg logit contributions
 One-4.217
 coll-4.183
 cheaper-4.160
 swe-4.053
One-3.979
Token trick
Token ID6908
Feature activation-1.42e+00

Pos logit contributions
ting+0.965
leaf+0.958
orry+0.886
unk+0.867
 upper+0.860
Neg logit contributions
 ne-0.898
 One-0.874
 coll-0.863
 cheaper-0.859
 swe-0.852
Token!
Token ID0
Feature activation-5.29e-01

Pos logit contributions
leaf+2.637
ting+2.477
orry+2.421
 upper+2.391
unk+2.385
Neg logit contributions
 ne-1.639
 One-1.611
 coll-1.571
 cheaper-1.569
 swe-1.540
Token She
Token ID1375
Feature activation-1.26e-01

Pos logit contributions
leaf+1.012
ting+0.999
unk+0.947
orry+0.940
 upper+0.914
Neg logit contributions
 One-0.592
 ne-0.566
 coll-0.532
 cheaper-0.531
One-0.530
Token stood
Token ID6204
Feature activation+1.13e-01

Pos logit contributions
leaf+0.211
ting+0.210
orry+0.195
unk+0.194
 ear+0.191
Neg logit contributions
 ne-0.167
 One-0.161
 cheaper-0.157
 swe-0.156
 coll-0.156
Token on
Token ID319
Feature activation+2.83e-01

Pos logit contributions
 ne+0.183
 One+0.178
 cheaper+0.175
 coll+0.174
 swe+0.174
Neg logit contributions
leaf-0.155
ting-0.154
orry-0.141
unk-0.140
 ear-0.138
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
 ne+0.385
 One+0.371
 cheaper+0.363
 swe+0.360
 coll+0.358
Neg logit contributions
leaf-0.459
ting-0.455
orry-0.424
 ear-0.420
unk-0.418
Token stage
Token ID3800
Feature activation-9.14e-01

Pos logit contributions
udge+5.526
asing+5.456
ting+5.403
orry+5.389
unk+5.325
Neg logit contributions
 coll-4.856
 ne-4.855
 saf-4.812
 cheaper-4.728
 swe-4.703
Token,
Token ID11
Feature activation-1.16e+00

Pos logit contributions
leaf+1.692
ting+1.631
unk+1.547
orry+1.547
 upper+1.506
Neg logit contributions
 ne-1.093
 One-1.056
 coll-1.013
 cheaper-0.998
 swe-0.987
Token holding
Token ID4769
Feature activation-2.00e+00

Pos logit contributions
leaf+1.990
unk+1.875
ting+1.873
orry+1.870
other+1.782
Neg logit contributions
 ne-1.569
 One-1.460
 swe-1.439
 cheaper-1.417
 coll-1.414
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
leaf+3.879
ting+3.759
unk+3.478
orry+3.458
coon+3.408
Neg logit contributions
 ne-2.334
 One-2.189
 swe-2.172
 coll-2.147
 enough-2.140
Token shiny
Token ID22441
Feature activation-6.69e-01

Pos logit contributions
ting+2.305
leaf+2.216
orry+2.113
unk+2.040
other+2.028
Neg logit contributions
 ne-1.485
 cheaper-1.412
 coll-1.409
 swe-1.378
 One-1.333
TokenJohn
Token ID7554
Feature activation-2.11e-01

Pos logit contributions
leaf+0.405
ting+0.403
orry+0.372
unk+0.371
 ear+0.366
Neg logit contributions
 ne-0.477
 One-0.468
One-0.457
 cheaper-0.455
 swe-0.450
Token nodded
Token ID14464
Feature activation+7.22e-01

Pos logit contributions
leaf+0.338
ting+0.334
orry+0.310
unk+0.306
 ear+0.302
Neg logit contributions
 ne-0.296
 One-0.286
 cheaper-0.276
 swe-0.276
 coll-0.274
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
 ne+1.022
 cheaper+1.006
 One+1.004
 coll+1.004
One+0.977
Neg logit contributions
ting-1.095
leaf-1.087
 ear-1.019
unk-0.999
orry-0.993
Token He
Token ID679
Feature activation-6.74e-01

Pos logit contributions
leaf+1.781
ting+1.751
orry+1.683
unk+1.649
 upper+1.603
Neg logit contributions
 One-0.864
 ne-0.819
 cheaper-0.753
 swe-0.747
One-0.743
Token grabbed
Token ID13176
Feature activation-1.58e+00

Pos logit contributions
ting+1.069
leaf+1.051
orry+0.963
unk+0.957
 ear+0.945
Neg logit contributions
 ne-0.993
 One-0.945
 swe-0.916
 cheaper-0.910
One-0.899
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
leaf+3.037
ting+2.975
orry+2.893
unk+2.859
 upper+2.806
Neg logit contributions
 ne-1.813
 One-1.767
 swe-1.653
 cheaper-1.637
 enough-1.609
Token cricket
Token ID18836
Feature activation-1.20e-01

Pos logit contributions
leaf+4.496
ting+4.450
orry+4.184
idget+4.045
asc+4.005
Neg logit contributions
 One-6.076
 coll-5.944
 swe-5.832
One-5.782
 ne-5.748
Token and
Token ID290
Feature activation-9.38e-01

Pos logit contributions
leaf+0.202
ting+0.199
orry+0.187
unk+0.184
 ear+0.182
Neg logit contributions
 ne-0.156
 One-0.150
 cheaper-0.146
 swe-0.146
 coll-0.145
Token put
Token ID1234
Feature activation-1.62e+00

Pos logit contributions
leaf+1.510
ting+1.487
orry+1.425
unk+1.420
 upper+1.377
Neg logit contributions
 ne-1.369
 swe-1.263
 One-1.250
 cheaper-1.227
 coll-1.223
Token it
Token ID340
Feature activation-1.22e-01

Pos logit contributions
leaf+3.016
ting+2.837
orry+2.797
unk+2.769
 upper+2.737
Neg logit contributions
 ne-2.006
 One-1.902
 swe-1.811
 cheaper-1.802
 enough-1.785
Token carefully
Token ID7773
Feature activation+7.55e-01

Pos logit contributions
leaf+0.146
ting+0.142
orry+0.130
unk+0.128
 ear+0.125
Neg logit contributions
 ne-0.220
 One-0.213
 cheaper-0.210
 swe-0.209
 coll-0.208
TokenThank
Token ID10449
Feature activation-2.92e-01

Pos logit contributions
 ne+1.176
 One+1.144
 coll+1.137
 cheaper+1.132
 swe+1.120
Neg logit contributions
leaf-1.103
ting-1.084
orry-0.978
 upper-0.971
 ear-0.970
Token you
Token ID345
Feature activation+7.41e-01

Pos logit contributions
leaf+0.339
ting+0.333
orry+0.305
unk+0.300
 upper+0.293
Neg logit contributions
 ne-0.537
 One-0.522
 coll-0.509
One-0.509
 cheaper-0.508
Token,
Token ID11
Feature activation+1.88e-01

Pos logit contributions
 ne+0.821
 cheaper+0.772
 One+0.769
 swe+0.753
 coll+0.753
Neg logit contributions
ting-1.367
leaf-1.363
orry-1.302
unk-1.291
 ear-1.290
Token Sara
Token ID24799
Feature activation+1.13e+00

Pos logit contributions
 ne+0.248
 One+0.240
 coll+0.233
 cheaper+0.233
 swe+0.232
Neg logit contributions
leaf-0.314
ting-0.310
orry-0.291
unk-0.288
 ear-0.286
Token,
Token ID11
Feature activation-7.45e-01

Pos logit contributions
 ne+1.646
 One+1.606
 cheaper+1.602
One+1.576
 swe+1.566
Neg logit contributions
ting-1.673
leaf-1.649
unk-1.581
 ear-1.571
orry-1.552
Token this
Token ID428
Feature activation+1.93e+00

Pos logit contributions
leaf+1.133
ting+1.106
unk+1.051
orry+1.042
 upper+1.031
Neg logit contributions
 ne-1.104
 One-1.090
 coll-1.045
 cheaper-1.039
 swe-1.037
Token is
Token ID318
Feature activation-5.31e-03

Pos logit contributions
 ne+2.570
 coll+2.551
 swe+2.530
 cheaper+2.511
 One+2.506
Neg logit contributions
leaf-3.125
unk-3.027
orry-3.014
ting-2.966
 ear-2.918
Token a
Token ID257
Feature activation+6.55e-01

Pos logit contributions
leaf+0.009
ting+0.009
orry+0.008
unk+0.008
 ear+0.008
Neg logit contributions
 ne-0.007
 One-0.007
 cheaper-0.007
 swe-0.007
 coll-0.007
Token very
Token ID845
Feature activation+5.57e-01

Pos logit contributions
 ne+0.815
 One+0.805
 coll+0.766
 swe+0.764
 cheaper+0.762
Neg logit contributions
leaf-1.120
ting-1.082
 ear-1.046
orry-1.039
unk-1.025
Token nice
Token ID3621
Feature activation+1.02e+00

Pos logit contributions
 ne+0.727
 One+0.710
 coll+0.692
 swe+0.682
 cheaper+0.678
Neg logit contributions
leaf-0.929
ting-0.884
orry-0.866
 ear-0.852
unk-0.847
Token pe
Token ID613
Feature activation+1.02e+00

Pos logit contributions
 ne+1.535
 One+1.487
 swe+1.479
 cheaper+1.471
 coll+1.470
Neg logit contributions
leaf-1.498
ting-1.451
 ear-1.424
unk-1.388
orry-1.380
Tokenily
Token ID813
Feature activation+4.39e-01

Pos logit contributions
leaf+0.320
ting+0.316
orry+0.254
 ear+0.253
unk+0.246
Neg logit contributions
 ne-1.076
 One-1.057
 cheaper-1.044
 coll-1.035
 swe-1.034
Token felt
Token ID2936
Feature activation+8.12e-01

Pos logit contributions
 ne+0.612
 One+0.600
 cheaper+0.595
 coll+0.589
 swe+0.583
Neg logit contributions
leaf-0.681
ting-0.676
orry-0.637
unk-0.628
 ear-0.621
Token sorry
Token ID7926
Feature activation+7.12e-01

Pos logit contributions
 ne+1.143
 coll+1.115
 One+1.087
 cheaper+1.081
 saf+1.073
Neg logit contributions
leaf-1.256
ting-1.254
 ear-1.178
orry-1.163
unk-1.134
Token for
Token ID329
Feature activation+1.27e-01

Pos logit contributions
 ne+1.151
 cheaper+1.088
 One+1.080
 coll+1.078
One+1.067
Neg logit contributions
ting-0.990
leaf-0.953
orry-0.907
 ear-0.901
unk-0.878
Token the
Token ID262
Feature activation+1.71e-01

Pos logit contributions
 ne+0.186
 One+0.180
 cheaper+0.177
 coll+0.176
 swe+0.175
Neg logit contributions
leaf-0.192
ting-0.192
orry-0.177
unk-0.175
 ear-0.174
Token little
Token ID1310
Feature activation+1.91e+00

Pos logit contributions
 ne+0.267
 One+0.260
 cheaper+0.254
 coll+0.253
 swe+0.252
Neg logit contributions
leaf-0.245
ting-0.240
orry-0.224
unk-0.221
 ear-0.218
Token boy
Token ID2933
Feature activation+6.03e-01

Pos logit contributions
One+3.056
 One+3.036
 coll+2.989
 ne+2.987
 former+2.933
Neg logit contributions
ting-2.542
leaf-2.515
 ear-2.365
 finger-2.306
 hand-2.300
Token and
Token ID290
Feature activation+8.30e-01

Pos logit contributions
 ne+0.828
 One+0.786
 cheaper+0.777
 coll+0.776
One+0.775
Neg logit contributions
ting-0.977
leaf-0.959
orry-0.903
 ear-0.898
unk-0.890
Token said
Token ID531
Feature activation+1.01e+00

Pos logit contributions
 ne+1.164
 One+1.153
One+1.143
 coll+1.130
 cheaper+1.129
Neg logit contributions
ting-1.280
leaf-1.273
 ear-1.192
orry-1.174
unk-1.135
Token,
Token ID11
Feature activation-4.28e-01

Pos logit contributions
 ne+1.096
 coll+1.067
 cheaper+1.050
 One+1.041
One+1.036
Neg logit contributions
ting-1.882
leaf-1.847
 ear-1.786
orry-1.771
unk-1.756
Token "
Token ID366
Feature activation+4.04e-01

Pos logit contributions
leaf+0.602
ting+0.573
orry+0.549
unk+0.538
 upper+0.524
Neg logit contributions
 ne-0.686
 One-0.676
 cheaper-0.650
 swe-0.649
 coll-0.644
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
leaf+0.103
ting+0.102
orry+0.095
unk+0.094
 ear+0.093
Neg logit contributions
 ne-0.086
 One-0.083
 cheaper-0.081
 coll-0.081
 swe-0.081
Token see
Token ID766
Feature activation-6.74e-01

Pos logit contributions
ting+1.781
leaf+1.748
unk+1.653
orry+1.639
rified+1.609
Neg logit contributions
 One-1.398
 ne-1.350
One-1.285
 saf-1.251
 cheaper-1.234
Token the
Token ID262
Feature activation+9.63e-02

Pos logit contributions
leaf+1.112
ting+1.088
orry+1.048
unk+1.042
 upper+1.024
Neg logit contributions
 ne-0.901
 One-0.885
 cheaper-0.871
 swe-0.869
 coll-0.861
Token stars
Token ID5788
Feature activation+1.04e+00

Pos logit contributions
 ne+0.130
 One+0.125
 cheaper+0.122
 coll+0.122
 swe+0.121
Neg logit contributions
leaf-0.159
ting-0.157
orry-0.147
unk-0.145
 ear-0.144
Token and
Token ID290
Feature activation-3.71e-01

Pos logit contributions
 One+1.635
 ne+1.632
One+1.597
 coll+1.588
 cheaper+1.588
Neg logit contributions
ting-1.468
leaf-1.403
 ear-1.267
orry-1.261
unk-1.247
Token couldn
Token ID3521
Feature activation+1.85e+00

Pos logit contributions
leaf+0.592
ting+0.585
orry+0.553
unk+0.552
 upper+0.533
Neg logit contributions
 ne-0.521
 One-0.501
 swe-0.490
 cheaper-0.489
 coll-0.488
Token't
Token ID470
Feature activation-5.20e-01

Pos logit contributions
 ne+3.572
 coll+3.515
 swe+3.511
 cheaper+3.468
 scar+3.446
Neg logit contributions
ting-2.043
leaf-2.009
unk-1.828
orry-1.816
 Stop-1.750
Token wait
Token ID4043
Feature activation+4.67e-01

Pos logit contributions
leaf+0.653
ting+0.651
unk+0.600
orry+0.588
 upper+0.556
Neg logit contributions
 ne-0.902
 One-0.893
One-0.860
 cheaper-0.854
 coll-0.840
Token to
Token ID284
Feature activation-9.42e-01

Pos logit contributions
 ne+0.645
 One+0.618
 cheaper+0.608
 coll+0.605
 swe+0.601
Neg logit contributions
leaf-0.751
ting-0.740
orry-0.690
 ear-0.685
unk-0.681
Token tell
Token ID1560
Feature activation+1.07e-01

Pos logit contributions
ting+1.590
leaf+1.585
unk+1.510
orry+1.496
 upper+1.435
Neg logit contributions
 ne-1.258
 One-1.235
 saf-1.160
One-1.155
 swe-1.148
Token her
Token ID607
Feature activation+3.35e-01

Pos logit contributions
 ne+0.143
 One+0.138
 cheaper+0.135
 coll+0.134
 swe+0.133
Neg logit contributions
leaf-0.177
ting-0.175
orry-0.164
unk-0.162
 ear-0.161
Token back
Token ID736
Feature activation+5.61e-01

Pos logit contributions
 ne+2.237
 One+2.205
One+2.173
 cheaper+2.156
 coll+2.139
Neg logit contributions
ting-1.476
leaf-1.419
 ear-1.282
unk-1.253
other-1.211
Token to
Token ID284
Feature activation+1.09e-01

Pos logit contributions
 ne+0.879
 One+0.854
 cheaper+0.840
 coll+0.835
 swe+0.833
Neg logit contributions
leaf-0.803
ting-0.800
unk-0.724
orry-0.723
 ear-0.719
Token living
Token ID2877
Feature activation-1.59e+00

Pos logit contributions
 ne+0.155
 One+0.150
 cheaper+0.147
 swe+0.146
 coll+0.146
Neg logit contributions
leaf-0.170
ting-0.168
orry-0.156
unk-0.154
 ear-0.153
Token happily
Token ID18177
Feature activation+4.86e-01

Pos logit contributions
leaf+2.652
ting+2.518
orry+2.428
unk+2.311
 upper+2.306
Neg logit contributions
 ne-2.248
 One-2.104
 swe-2.087
 coll-2.082
 enough-2.053
Token in
Token ID287
Feature activation+3.99e-01

Pos logit contributions
 ne+0.724
 One+0.712
 cheaper+0.697
 swe+0.691
 coll+0.688
Neg logit contributions
leaf-0.722
ting-0.716
orry-0.659
unk-0.655
 ear-0.654
Token the
Token ID262
Feature activation+1.94e+00

Pos logit contributions
 ne+0.568
 One+0.547
 cheaper+0.540
 swe+0.537
 coll+0.535
Neg logit contributions
leaf-0.623
ting-0.616
orry-0.572
 ear-0.572
unk-0.570
Token stable
Token ID8245
Feature activation+2.24e-01

Pos logit contributions
 ne+2.405
 coll+2.310
 cheaper+2.274
 Nib+2.274
 One+2.188
Neg logit contributions
ting-3.320
 ear-3.230
leaf-3.131
 upper-3.033
other-3.015
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
 ne+0.297
 One+0.287
 cheaper+0.281
 swe+0.280
 coll+0.278
Neg logit contributions
leaf-0.368
ting-0.366
orry-0.342
unk-0.339
 ear-0.336
Token<|endoftext|>
Token ID50256
Feature activation+1.37e-01

Pos logit contributions
leaf+0.912
ting+0.904
orry+0.852
unk+0.849
 upper+0.845
Neg logit contributions
 One-0.479
 ne-0.469
One-0.434
 cheaper-0.432
 coll-0.431
TokenOnce
Token ID7454
Feature activation-1.78e-01

Pos logit contributions
 ne+0.144
 One+0.137
 cheaper+0.135
 coll+0.134
 swe+0.134
Neg logit contributions
leaf-0.263
ting-0.261
orry-0.246
unk-0.245
 ear-0.241
Token there
Token ID612
Feature activation+2.50e-02

Pos logit contributions
leaf+0.295
ting+0.291
orry+0.270
unk+0.267
 ear+0.265
Neg logit contributions
 ne-0.237
 One-0.232
 cheaper-0.224
 swe-0.222
 coll-0.221
Token wet
Token ID9583
Feature activation-1.63e-01

Pos logit contributions
 ne+1.096
 One+1.081
 coll+1.066
 swe+1.042
One+1.042
Neg logit contributions
leaf-1.219
ting-1.192
 ear-1.134
orry-1.123
unk-1.116
Token.
Token ID13
Feature activation-7.59e-01

Pos logit contributions
leaf+0.271
ting+0.263
orry+0.250
unk+0.247
 ear+0.242
Neg logit contributions
 ne-0.220
 One-0.213
 cheaper-0.207
 swe-0.206
 coll-0.205
Token
Token ID220
Feature activation-7.45e-01

Pos logit contributions
leaf+1.563
ting+1.530
orry+1.456
unk+1.455
 upper+1.428
Neg logit contributions
 One-0.788
 ne-0.734
One-0.697
 swe-0.669
 coll-0.653
Token
Token ID198
Feature activation-4.11e-02

Pos logit contributions
leaf+1.574
ting+1.556
orry+1.453
 ear+1.444
unk+1.438
Neg logit contributions
 One-0.698
 ne-0.654
One-0.644
 cheaper-0.609
 swe-0.595
Token
Token ID198
Feature activation-5.41e-01

Pos logit contributions
leaf+0.079
ting+0.078
orry+0.074
unk+0.073
 ear+0.073
Neg logit contributions
 ne-0.043
 One-0.042
 cheaper-0.040
 coll-0.040
 swe-0.040
TokenThe
Token ID464
Feature activation+1.91e+00

Pos logit contributions
leaf+1.012
ting+0.999
unk+0.949
orry+0.943
 ear+0.922
Neg logit contributions
 ne-0.631
 One-0.630
One-0.615
 swe-0.579
 cheaper-0.575
Token girl
Token ID2576
Feature activation+5.77e-02

Pos logit contributions
 One+3.385
 ne+3.370
 coll+3.299
 cheaper+3.286
 swe+3.255
Neg logit contributions
ting-2.228
leaf-2.124
 ear-2.118
orry-2.014
unk-1.950
Token was
Token ID373
Feature activation-2.08e-01

Pos logit contributions
 ne+0.090
 One+0.088
 cheaper+0.086
 coll+0.086
 swe+0.086
Neg logit contributions
leaf-0.082
ting-0.081
orry-0.075
unk-0.074
 ear-0.073
Token very
Token ID845
Feature activation-4.50e-01

Pos logit contributions
leaf+0.304
ting+0.299
orry+0.277
unk+0.271
 upper+0.267
Neg logit contributions
 ne-0.323
 One-0.313
 cheaper-0.307
 swe-0.304
 coll-0.302
Token excited
Token ID6568
Feature activation-7.62e-01

Pos logit contributions
ting+0.701
leaf+0.701
orry+0.643
unk+0.631
 upper+0.619
Neg logit contributions
 ne-0.665
 One-0.636
 cheaper-0.625
 swe-0.620
One-0.608
Token.
Token ID13
Feature activation-7.74e-01

Pos logit contributions
leaf+1.403
ting+1.362
orry+1.312
unk+1.288
 upper+1.276
Neg logit contributions
 One-0.913
 ne-0.908
One-0.847
 swe-0.843
 cheaper-0.841
Token smiling
Token ID16755
Feature activation-4.14e-01

Pos logit contributions
 ne+1.231
 One+1.198
 cheaper+1.190
 coll+1.180
 swe+1.173
Neg logit contributions
ting-0.900
leaf-0.851
 ear-0.796
orry-0.781
unk-0.780
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
leaf+0.702
ting+0.675
orry+0.646
unk+0.637
 upper+0.623
Neg logit contributions
 ne-0.549
 One-0.531
 swe-0.518
 cheaper-0.516
 coll-0.507
Token
Token ID220
Feature activation-8.28e-01

Pos logit contributions
leaf+0.324
ting+0.318
orry+0.299
unk+0.297
 upper+0.291
Neg logit contributions
 ne-0.222
 One-0.219
 cheaper-0.208
 swe-0.208
 coll-0.207
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
leaf+1.761
ting+1.738
orry+1.613
 ear+1.611
 upper+1.602
Neg logit contributions
 One-0.777
 ne-0.719
One-0.704
 cheaper-0.672
 swe-0.657
Token
Token ID198
Feature activation-2.26e-01

Pos logit contributions
leaf+0.563
ting+0.557
orry+0.526
unk+0.522
 ear+0.518
Neg logit contributions
 ne-0.284
 One-0.281
One-0.265
 cheaper-0.263
 swe-0.259
TokenWhen
Token ID2215
Feature activation+1.59e+00

Pos logit contributions
leaf+0.412
ting+0.406
orry+0.381
unk+0.379
 ear+0.374
Neg logit contributions
 ne-0.269
 One-0.262
One-0.252
 cheaper-0.251
 swe-0.249
Token they
Token ID484
Feature activation-7.36e-01

Pos logit contributions
 ne+1.975
 cheaper+1.958
 coll+1.912
 swe+1.904
One+1.893
Neg logit contributions
ting-2.776
leaf-2.578
 ear-2.574
orry-2.529
unk-2.442
Token arrived
Token ID5284
Feature activation-8.61e-02

Pos logit contributions
leaf+1.372
ting+1.359
unk+1.293
orry+1.270
 upper+1.253
Neg logit contributions
 ne-0.868
 One-0.843
One-0.783
 cheaper-0.774
 saf-0.759
Token at
Token ID379
Feature activation+6.50e-02

Pos logit contributions
leaf+0.112
ting+0.110
orry+0.101
unk+0.099
 ear+0.098
Neg logit contributions
 ne-0.146
 One-0.142
 cheaper-0.140
 swe-0.139
 coll-0.139
Token the
Token ID262
Feature activation-8.32e-01

Pos logit contributions
 ne+0.093
 One+0.090
 cheaper+0.088
 swe+0.087
 coll+0.087
Neg logit contributions
leaf-0.101
ting-0.100
orry-0.093
unk-0.092
 ear-0.091
Token museum
Token ID13257
Feature activation+3.08e-01

Pos logit contributions
leaf+1.554
ting+1.551
unk+1.469
orry+1.449
 ear+1.421
Neg logit contributions
 ne-0.946
 One-0.940
 cheaper-0.903
 saf-0.900
One-0.895
TokenBut
Token ID1537
Feature activation+5.30e-01

Pos logit contributions
leaf+0.426
ting+0.420
orry+0.396
unk+0.390
 ear+0.385
Neg logit contributions
 ne-0.454
 One-0.441
 cheaper-0.432
One-0.431
 coll-0.428
Token you
Token ID345
Feature activation-4.03e-02

Pos logit contributions
 ne+0.809
 One+0.789
 coll+0.775
 cheaper+0.767
 swe+0.761
Neg logit contributions
ting-0.767
leaf-0.765
orry-0.705
 ear-0.702
unk-0.690
Token need
Token ID761
Feature activation+1.39e-01

Pos logit contributions
leaf+0.066
ting+0.065
orry+0.061
unk+0.060
 ear+0.059
Neg logit contributions
 ne-0.055
 One-0.053
 cheaper-0.052
 swe-0.051
 coll-0.051
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
 ne+0.182
 One+0.175
 cheaper+0.171
 coll+0.170
 swe+0.169
Neg logit contributions
leaf-0.234
ting-0.230
orry-0.216
unk-0.213
 ear-0.212
Token share
Token ID2648
Feature activation-5.16e-01

Pos logit contributions
 ne+2.388
 coll+2.306
 cheaper+2.304
 One+2.286
 swe+2.270
Neg logit contributions
leaf-2.078
 ear-2.047
ting-2.036
orry-1.927
 upper-1.908
Token and
Token ID290
Feature activation+6.04e-01

Pos logit contributions
leaf+0.891
ting+0.867
orry+0.821
unk+0.811
 upper+0.790
Neg logit contributions
 ne-0.670
 One-0.651
 swe-0.642
 cheaper-0.640
 coll-0.633
Token be
Token ID307
Feature activation+8.86e-01

Pos logit contributions
 ne+0.784
 One+0.766
 cheaper+0.741
 coll+0.734
 swe+0.731
Neg logit contributions
leaf-1.012
ting-0.980
 ear-0.931
orry-0.919
 upper-0.904
Token nice
Token ID3621
Feature activation-4.24e-01

Pos logit contributions
 ne+1.391
 One+1.387
 coll+1.332
 cheaper+1.319
 saf+1.317
Neg logit contributions
leaf-1.232
ting-1.195
 ear-1.144
orry-1.117
unk-1.101
Token to
Token ID284
Feature activation-1.08e-01

Pos logit contributions
leaf+0.727
ting+0.713
orry+0.678
unk+0.663
 upper+0.651
Neg logit contributions
 ne-0.549
 One-0.532
 cheaper-0.521
 swe-0.519
 coll-0.513
Token each
Token ID1123
Feature activation-1.55e+00

Pos logit contributions
leaf+0.155
ting+0.152
orry+0.141
unk+0.139
 upper+0.136
Neg logit contributions
 ne-0.169
 One-0.166
 cheaper-0.161
 coll-0.161
 swe-0.160
Token other
Token ID584
Feature activation+1.90e-01

Pos logit contributions
leaf+2.333
ting+2.188
orry+2.175
unk+2.116
opher+2.046
Neg logit contributions
 coll-2.342
 One-2.329
One-2.303
 swe-2.253
 ne-2.233
Token voice
Token ID3809
Feature activation+5.10e-01

Pos logit contributions
 ne+0.116
 One+0.112
 cheaper+0.110
 coll+0.109
 swe+0.109
Neg logit contributions
leaf-0.109
ting-0.108
orry-0.100
unk-0.099
 ear-0.098
Token was
Token ID373
Feature activation-2.14e-01

Pos logit contributions
 ne+0.704
 One+0.698
 coll+0.683
 cheaper+0.679
One+0.673
Neg logit contributions
leaf-0.799
ting-0.797
orry-0.737
unk-0.724
 ear-0.721
Token gentle
Token ID10296
Feature activation-3.62e-01

Pos logit contributions
leaf+0.328
ting+0.324
orry+0.302
unk+0.297
 upper+0.291
Neg logit contributions
 ne-0.317
 One-0.306
 cheaper-0.301
 swe-0.298
 coll-0.295
Token,
Token ID11
Feature activation+3.61e-01

Pos logit contributions
leaf+0.633
ting+0.614
orry+0.588
unk+0.578
 upper+0.568
Neg logit contributions
 ne-0.458
 One-0.443
 cheaper-0.428
 swe-0.427
 coll-0.423
Token so
Token ID523
Feature activation+7.37e-01

Pos logit contributions
 ne+0.541
 One+0.526
 cheaper+0.515
 coll+0.514
One+0.510
Neg logit contributions
leaf-0.530
ting-0.530
orry-0.482
 ear-0.476
unk-0.476
Token he
Token ID339
Feature activation+1.92e-01

Pos logit contributions
 ne+0.901
 One+0.895
 coll+0.881
 cheaper+0.868
One+0.864
Neg logit contributions
ting-1.269
leaf-1.250
 ear-1.169
orry-1.169
unk-1.147
Token agreed
Token ID4987
Feature activation-1.89e-01

Pos logit contributions
 ne+0.266
 One+0.261
 cheaper+0.254
 coll+0.253
 swe+0.251
Neg logit contributions
leaf-0.305
ting-0.301
orry-0.281
unk-0.277
 ear-0.275
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
leaf+0.324
ting+0.319
orry+0.301
unk+0.298
 ear+0.293
Neg logit contributions
 ne-0.244
 One-0.236
 cheaper-0.229
 swe-0.228
 coll-0.226
Token
Token ID220
Feature activation-1.59e-01

Pos logit contributions
 ne+0.040
 One+0.038
 cheaper+0.037
 coll+0.037
 swe+0.037
Neg logit contributions
leaf-0.063
ting-0.062
orry-0.059
unk-0.058
 ear-0.057
Token
Token ID198
Feature activation-3.35e-02

Pos logit contributions
leaf+0.309
ting+0.306
orry+0.289
unk+0.285
 ear+0.285
Neg logit contributions
 ne-0.164
 One-0.160
 cheaper-0.153
 coll-0.151
 swe-0.151
TokenThe
Token ID464
Feature activation+1.23e+00

Pos logit contributions
leaf+0.059
ting+0.059
orry+0.055
unk+0.055
 ear+0.054
Neg logit contributions
 ne-0.041
 One-0.039
 cheaper-0.038
 coll-0.038
 swe-0.038
Tokenily
Token ID813
Feature activation+6.48e-01

Pos logit contributions
ting+1.940
leaf+1.750
 upper+1.559
 ear+1.455
 expecting+1.437
Neg logit contributions
 ne-3.442
 One-3.382
 cheaper-3.267
One-3.248
 coll-3.245
Token looked
Token ID3114
Feature activation-5.85e-01

Pos logit contributions
 ne+0.865
 One+0.841
 cheaper+0.832
 coll+0.823
One+0.820
Neg logit contributions
leaf-1.049
ting-1.041
orry-0.983
unk-0.974
 ear-0.965
Token up
Token ID510
Feature activation+2.60e-01

Pos logit contributions
leaf+0.852
ting+0.834
unk+0.780
orry+0.779
 upper+0.759
Neg logit contributions
 ne-0.910
 One-0.880
 swe-0.873
 cheaper-0.872
 coll-0.851
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
 ne+0.419
 One+0.407
 cheaper+0.401
 coll+0.400
 swe+0.398
Neg logit contributions
leaf-0.352
ting-0.350
orry-0.320
unk-0.317
 ear-0.315
Token saw
Token ID2497
Feature activation-7.25e-01

Pos logit contributions
leaf+0.510
ting+0.501
orry+0.472
unk+0.471
 upper+0.460
Neg logit contributions
 ne-0.437
 One-0.419
 swe-0.410
 cheaper-0.409
 coll-0.403
Token a
Token ID257
Feature activation+4.31e-01

Pos logit contributions
leaf+1.145
ting+1.115
orry+1.064
unk+1.056
 upper+1.048
Neg logit contributions
 ne-1.030
 One-0.998
 cheaper-0.986
 swe-0.982
 coll-0.969
Token friendly
Token ID8030
Feature activation-3.14e-01

Pos logit contributions
 ne+0.596
 One+0.587
 cheaper+0.561
One+0.559
 coll+0.559
Neg logit contributions
leaf-0.678
ting-0.666
orry-0.625
unk-0.623
 ear-0.618
Token butterfly
Token ID35113
Feature activation+1.47e-01

Pos logit contributions
leaf+0.468
ting+0.461
orry+0.430
unk+0.422
 upper+0.417
Neg logit contributions
 ne-0.473
 One-0.456
 coll-0.447
 cheaper-0.446
 swe-0.446
Token hovering
Token ID33627
Feature activation-8.92e-01

Pos logit contributions
 ne+0.213
 One+0.206
 cheaper+0.202
 swe+0.200
 coll+0.200
Neg logit contributions
leaf-0.226
ting-0.225
orry-0.208
unk-0.206
 ear-0.204
Token above
Token ID2029
Feature activation-3.35e-01

Pos logit contributions
leaf+1.056
ting+0.989
orry+0.974
unk+0.943
 upper+0.919
Neg logit contributions
 ne-1.658
 One-1.603
One-1.576
 swe-1.550
 cheaper-1.545
Token.
Token ID13
Feature activation-6.18e-02

Pos logit contributions
leaf+0.562
ting+0.546
orry+0.520
unk+0.514
 upper+0.505
Neg logit contributions
 ne-0.442
 One-0.433
 swe-0.416
 coll-0.415
 cheaper-0.413
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
leaf+1.138
ting+1.099
orry+1.044
unk+1.027
 upper+1.015
Neg logit contributions
 ne-0.733
 One-0.710
 swe-0.682
 cheaper-0.676
 coll-0.670
Token "
Token ID366
Feature activation-2.76e-01

Pos logit contributions
 ne+0.210
 One+0.202
 cheaper+0.199
 coll+0.198
 swe+0.198
Neg logit contributions
leaf-0.232
ting-0.230
orry-0.214
unk-0.212
 ear-0.210
TokenBut
Token ID1537
Feature activation-2.72e-01

Pos logit contributions
leaf+0.390
ting+0.387
orry+0.360
unk+0.358
 ear+0.349
Neg logit contributions
 ne-0.430
 One-0.423
One-0.412
 cheaper-0.410
 coll-0.407
Token I
Token ID314
Feature activation-2.13e-01

Pos logit contributions
leaf+0.386
ting+0.375
orry+0.348
unk+0.344
 upper+0.335
Neg logit contributions
 ne-0.433
 One-0.422
 cheaper-0.412
 swe-0.411
 coll-0.406
Token think
Token ID892
Feature activation-6.07e-01

Pos logit contributions
leaf+0.308
ting+0.307
orry+0.282
unk+0.278
 ear+0.275
Neg logit contributions
 ne-0.329
 One-0.321
 cheaper-0.312
 swe-0.309
 coll-0.308
Token it
Token ID340
Feature activation+7.54e-01

Pos logit contributions
leaf+0.873
ting+0.850
orry+0.787
unk+0.786
 upper+0.761
Neg logit contributions
 ne-0.959
 One-0.940
 cheaper-0.923
 swe-0.920
 coll-0.897
Token belongs
Token ID14448
Feature activation-4.27e-01

Pos logit contributions
 ne+0.908
 One+0.873
 coll+0.867
 cheaper+0.865
One+0.847
Neg logit contributions
ting-1.315
leaf-1.305
orry-1.236
unk-1.234
 ear-1.208
Token to
Token ID284
Feature activation-7.92e-01

Pos logit contributions
leaf+0.759
ting+0.751
orry+0.704
unk+0.692
 ear+0.683
Neg logit contributions
 ne-0.524
 One-0.509
 cheaper-0.497
 swe-0.491
One-0.487
Token someone
Token ID2130
Feature activation-8.92e-01

Pos logit contributions
leaf+1.253
ting+1.238
orry+1.130
unk+1.129
 upper+1.095
Neg logit contributions
 ne-1.150
 One-1.138
 cheaper-1.098
 swe-1.096
 coll-1.092
Token else
Token ID2073
Feature activation-8.87e-01

Pos logit contributions
leaf+1.338
ting+1.297
orry+1.230
unk+1.211
 upper+1.199
Neg logit contributions
 ne-1.353
 cheaper-1.311
 One-1.305
 swe-1.296
 coll-1.258
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
leaf+1.539
ting+1.498
orry+1.417
unk+1.385
 upper+1.375
Neg logit contributions
 ne-1.134
 One-1.126
 cheaper-1.100
 swe-1.072
 coll-1.058
Token very
Token ID845
Feature activation-4.06e-01

Pos logit contributions
leaf+0.411
ting+0.405
orry+0.378
unk+0.372
 upper+0.366
Neg logit contributions
 ne-0.394
 One-0.379
 cheaper-0.372
 swe-0.370
 coll-0.368
Token happy
Token ID3772
Feature activation-6.31e-02

Pos logit contributions
ting+0.616
leaf+0.613
orry+0.568
unk+0.561
 upper+0.551
Neg logit contributions
 ne-0.612
 One-0.583
 cheaper-0.581
 swe-0.575
 coll-0.565
Token together
Token ID1978
Feature activation-4.44e-01

Pos logit contributions
leaf+0.105
ting+0.103
orry+0.097
unk+0.096
 ear+0.094
Neg logit contributions
 ne-0.084
 One-0.082
 cheaper-0.080
 coll-0.079
 swe-0.079
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
leaf+0.759
ting+0.739
orry+0.705
unk+0.698
 upper+0.687
Neg logit contributions
 ne-0.579
 One-0.560
 cheaper-0.541
 swe-0.536
 coll-0.536
Token They
Token ID1119
Feature activation-5.74e-01

Pos logit contributions
leaf+0.218
ting+0.216
orry+0.206
unk+0.204
 ear+0.201
Neg logit contributions
 ne-0.088
 One-0.085
 cheaper-0.080
 swe-0.079
 coll-0.079
Token welcomed
Token ID15846
Feature activation-9.84e-01

Pos logit contributions
leaf+0.970
ting+0.958
unk+0.912
orry+0.902
 upper+0.889
Neg logit contributions
 ne-0.761
 One-0.725
 swe-0.694
 cheaper-0.693
 coll-0.690
Token each
Token ID1123
Feature activation-2.15e+00

Pos logit contributions
leaf+1.386
ting+1.336
orry+1.323
unk+1.290
 upper+1.260
Neg logit contributions
 ne-1.572
 One-1.536
 swe-1.486
 cheaper-1.478
 coll-1.464
Token other
Token ID584
Feature activation-3.22e-01

Pos logit contributions
leaf+3.349
ting+3.195
unk+3.190
orry+3.189
iders+3.098
Neg logit contributions
 One-3.184
 coll-3.155
 swe-3.085
One-3.066
 ne-3.063
Token into
Token ID656
Feature activation-3.66e-01

Pos logit contributions
leaf+0.511
ting+0.503
orry+0.474
unk+0.468
 upper+0.459
Neg logit contributions
 ne-0.453
 One-0.438
 cheaper-0.429
 swe-0.429
 coll-0.427
Token their
Token ID511
Feature activation+4.06e-01

Pos logit contributions
leaf+0.635
ting+0.630
orry+0.595
unk+0.589
 upper+0.575
Neg logit contributions
 ne-0.463
 One-0.445
 cheaper-0.436
 coll-0.435
 swe-0.433
Token lives
Token ID3160
Feature activation+2.84e-01

Pos logit contributions
 ne+0.601
 One+0.581
 cheaper+0.569
One+0.563
 coll+0.561
Neg logit contributions
leaf-0.604
ting-0.597
orry-0.562
 ear-0.558
unk-0.553
Token picked
Token ID6497
Feature activation-2.66e-01

Pos logit contributions
 ne+0.819
 One+0.813
 cheaper+0.806
 coll+0.801
 swe+0.793
Neg logit contributions
leaf-0.893
ting-0.881
orry-0.831
unk-0.816
 ear-0.814
Token it
Token ID340
Feature activation-2.03e-01

Pos logit contributions
leaf+0.406
ting+0.394
orry+0.374
unk+0.369
 upper+0.363
Neg logit contributions
 ne-0.391
 One-0.381
 cheaper-0.372
 swe-0.369
 coll-0.368
Token up
Token ID510
Feature activation+3.58e-01

Pos logit contributions
leaf+0.267
ting+0.258
orry+0.241
unk+0.236
 ear+0.232
Neg logit contributions
 ne-0.344
 One-0.336
 cheaper-0.328
 swe-0.327
 coll-0.324
Token with
Token ID351
Feature activation-2.02e-01

Pos logit contributions
 ne+0.486
 One+0.474
 coll+0.466
 cheaper+0.463
 swe+0.462
Neg logit contributions
leaf-0.576
ting-0.576
orry-0.534
unk-0.531
 ear-0.527
Token his
Token ID465
Feature activation-3.40e-01

Pos logit contributions
leaf+0.366
ting+0.361
orry+0.342
unk+0.339
 upper+0.331
Neg logit contributions
 ne-0.246
 One-0.237
 cheaper-0.229
 swe-0.228
 coll-0.227
Token mouth
Token ID5422
Feature activation-2.71e-01

Pos logit contributions
leaf+0.485
ting+0.476
orry+0.440
unk+0.428
 upper+0.417
Neg logit contributions
 ne-0.544
 One-0.532
 cheaper-0.520
 coll-0.520
 swe-0.515
Token and
Token ID290
Feature activation+9.54e-02

Pos logit contributions
leaf+0.474
ting+0.462
orry+0.439
unk+0.431
 ear+0.426
Neg logit contributions
 ne-0.342
 One-0.329
 cheaper-0.320
 swe-0.318
 coll-0.315
Token passed
Token ID3804
Feature activation-4.46e-01

Pos logit contributions
 ne+0.134
 One+0.131
 cheaper+0.127
 coll+0.127
 swe+0.126
Neg logit contributions
leaf-0.150
ting-0.149
orry-0.138
unk-0.137
 ear-0.136
Token it
Token ID340
Feature activation-2.74e-02

Pos logit contributions
leaf+0.708
ting+0.689
orry+0.658
unk+0.648
 upper+0.637
Neg logit contributions
 ne-0.642
 One-0.619
 cheaper-0.605
 swe-0.596
 coll-0.593
Token to
Token ID284
Feature activation-1.09e+00

Pos logit contributions
leaf+0.043
ting+0.042
orry+0.039
unk+0.039
 ear+0.038
Neg logit contributions
 ne-0.039
 One-0.038
 cheaper-0.037
 swe-0.037
 coll-0.037
Token the
Token ID262
Feature activation-6.52e-01

Pos logit contributions
leaf+1.888
ting+1.777
unk+1.757
orry+1.735
asp+1.701
Neg logit contributions
 One-1.469
 ne-1.468
 coll-1.380
 cheaper-1.374
 saf-1.362
Token his
Token ID465
Feature activation-1.07e+00

Pos logit contributions
 ne+0.765
 coll+0.750
 One+0.747
One+0.733
 cheaper+0.732
Neg logit contributions
ting-1.296
leaf-1.294
orry-1.208
 ear-1.201
unk-1.182
Token son
Token ID3367
Feature activation-5.33e-01

Pos logit contributions
ting+1.266
leaf+1.217
orry+1.097
unk+1.056
other+1.020
Neg logit contributions
 ne-2.022
 One-1.942
 coll-1.934
 cheaper-1.930
 swe-1.917
Token climbed
Token ID19952
Feature activation+1.40e+00

Pos logit contributions
leaf+0.860
ting+0.838
orry+0.789
unk+0.782
 upper+0.767
Neg logit contributions
 ne-0.751
 One-0.714
 cheaper-0.694
 swe-0.691
 coll-0.688
Token into
Token ID656
Feature activation+1.32e-01

Pos logit contributions
 ne+2.603
 One+2.598
 coll+2.583
 cheaper+2.578
One+2.558
Neg logit contributions
ting-1.679
leaf-1.501
 ear-1.361
unk-1.305
other-1.244
Token one
Token ID530
Feature activation-1.06e+00

Pos logit contributions
 ne+0.186
 One+0.180
 cheaper+0.176
 swe+0.175
 coll+0.175
Neg logit contributions
leaf-0.209
ting-0.206
orry-0.192
unk-0.190
 ear-0.188
Token of
Token ID286
Feature activation-9.10e-01

Pos logit contributions
leaf+1.412
ting+1.366
orry+1.227
unk+1.226
 Stop+1.158
Neg logit contributions
 ne-1.841
 One-1.830
 cheaper-1.786
 swe-1.767
 coll-1.764
Token the
Token ID262
Feature activation-5.72e-01

Pos logit contributions
leaf+1.593
ting+1.553
orry+1.519
unk+1.504
 upper+1.462
Neg logit contributions
 ne-1.167
 One-1.131
 cheaper-1.085
 swe-1.085
 coll-1.073
Token big
Token ID1263
Feature activation+1.88e-01

Pos logit contributions
leaf+0.978
ting+0.959
orry+0.902
unk+0.899
 ear+0.872
Neg logit contributions
 ne-0.753
 One-0.734
 coll-0.717
 cheaper-0.714
 swe-0.714
Token trucks
Token ID13960
Feature activation-7.77e-01

Pos logit contributions
 ne+0.239
 One+0.231
 cheaper+0.226
 coll+0.223
 swe+0.223
Neg logit contributions
leaf-0.319
ting-0.317
orry-0.297
unk-0.294
 ear-0.293
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
leaf+1.389
ting+1.336
orry+1.295
unk+1.281
 ear+1.254
Neg logit contributions
 ne-0.964
 One-0.927
 cheaper-0.889
 coll-0.873
One-0.860
Token But
Token ID887
Feature activation+7.39e-02

Pos logit contributions
leaf+0.827
ting+0.816
orry+0.771
unk+0.766
 upper+0.757
Neg logit contributions
 One-0.455
 ne-0.450
 cheaper-0.415
One-0.412
 swe-0.410
Token me
Token ID502
Feature activation+4.01e-01

Pos logit contributions
leaf+1.251
ting+1.192
orry+1.134
unk+1.099
 upper+1.063
Neg logit contributions
 ne-1.591
 One-1.576
One-1.524
 cheaper-1.523
 swe-1.518
Token test
Token ID1332
Feature activation-3.35e-01

Pos logit contributions
 ne+0.626
 One+0.606
 coll+0.596
 cheaper+0.593
 swe+0.592
Neg logit contributions
leaf-0.563
ting-0.557
orry-0.514
unk-0.512
 ear-0.509
Token for
Token ID329
Feature activation-2.55e-01

Pos logit contributions
leaf+0.532
ting+0.521
orry+0.491
unk+0.486
 upper+0.475
Neg logit contributions
 ne-0.472
 One-0.459
 cheaper-0.452
 swe-0.449
 coll-0.446
Token my
Token ID616
Feature activation+2.26e-01

Pos logit contributions
leaf+0.389
ting+0.383
orry+0.358
unk+0.354
 upper+0.344
Neg logit contributions
 ne-0.378
 One-0.368
 cheaper-0.360
 swe-0.356
 coll-0.355
Token bracelet
Token ID42893
Feature activation+1.08e+00

Pos logit contributions
 ne+0.284
 One+0.275
 cheaper+0.267
 coll+0.266
 swe+0.264
Neg logit contributions
leaf-0.385
ting-0.381
orry-0.358
 ear-0.355
unk-0.355
Token?"
Token ID1701
Feature activation-3.13e-01

Pos logit contributions
 ne+1.691
 coll+1.612
 One+1.607
 swe+1.582
 cheaper+1.577
Neg logit contributions
ting-1.482
leaf-1.455
unk-1.363
 ear-1.358
orry-1.343
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
leaf+0.623
ting+0.620
orry+0.584
unk+0.576
 upper+0.566
Neg logit contributions
 ne-0.318
 One-0.315
 cheaper-0.292
One-0.292
 swe-0.289
Token
Token ID198
Feature activation-6.51e-01

Pos logit contributions
leaf+0.675
ting+0.673
orry+0.631
unk+0.622
 ear+0.617
Neg logit contributions
 ne-0.343
 One-0.338
One-0.320
 cheaper-0.316
 swe-0.313
TokenMom
Token ID29252
Feature activation-2.61e-01

Pos logit contributions
ting+1.177
leaf+1.173
orry+1.088
unk+1.077
 ear+1.050
Neg logit contributions
 ne-0.797
 One-0.787
One-0.772
 cheaper-0.737
 swe-0.732
Tokenmy
Token ID1820
Feature activation-1.05e-01

Pos logit contributions
leaf+0.477
ting+0.471
orry+0.443
unk+0.437
 ear+0.434
Neg logit contributions
 ne-0.308
 One-0.297
 cheaper-0.284
 swe-0.284
 coll-0.281
Token smiled
Token ID13541
Feature activation+1.03e+00

Pos logit contributions
leaf+0.166
ting+0.164
orry+0.153
unk+0.151
 ear+0.150
Neg logit contributions
 ne-0.148
 One-0.143
 cheaper-0.139
 swe-0.139
 coll-0.138
TokenYou
Token ID1639
Feature activation-1.11e+00

Pos logit contributions
leaf+0.307
ting+0.297
orry+0.279
unk+0.275
 ear+0.270
Neg logit contributions
 ne-0.396
 One-0.384
 cheaper-0.375
One-0.374
 swe-0.374
Token spoiled
Token ID34781
Feature activation-1.84e+00

Pos logit contributions
leaf+1.947
ting+1.916
orry+1.764
unk+1.758
 upper+1.758
Neg logit contributions
 ne-1.430
 One-1.387
 cheaper-1.301
One-1.273
 swe-1.273
Token my
Token ID616
Feature activation-8.52e-02

Pos logit contributions
leaf+3.313
orry+3.061
unk+3.009
ting+3.000
 upper+2.946
Neg logit contributions
 swe-2.423
 ne-2.386
 One-2.327
 coll-2.310
 cheaper-2.264
Token game
Token ID983
Feature activation-1.63e+00

Pos logit contributions
leaf+0.118
ting+0.116
orry+0.108
unk+0.106
 ear+0.104
Neg logit contributions
 ne-0.138
 One-0.134
 cheaper-0.131
 swe-0.131
 coll-0.131
Token!"
Token ID2474
Feature activation+6.53e-01

Pos logit contributions
leaf+2.961
ting+2.823
orry+2.652
 upper+2.573
 ear+2.567
Neg logit contributions
 ne-2.103
 One-2.049
 cheaper-2.006
 swe-1.951
 enough-1.934
Token Lily
Token ID20037
Feature activation-5.73e-01

Pos logit contributions
 ne+0.871
 cheaper+0.835
One+0.826
 One+0.826
 coll+0.817
Neg logit contributions
ting-1.066
leaf-1.060
orry-0.989
 ear-0.984
unk-0.979
Token felt
Token ID2936
Feature activation-3.09e-01

Pos logit contributions
leaf+1.064
ting+1.034
orry+0.968
unk+0.959
 ear+0.949
Neg logit contributions
 ne-0.691
 One-0.654
 swe-0.625
 coll-0.617
 cheaper-0.616
Token sad
Token ID6507
Feature activation+1.70e-01

Pos logit contributions
leaf+0.506
ting+0.498
orry+0.468
unk+0.465
 upper+0.454
Neg logit contributions
 ne-0.420
 One-0.409
 cheaper-0.397
 swe-0.394
 coll-0.390
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
 ne+0.235
 One+0.226
 cheaper+0.221
 coll+0.221
 swe+0.220
Neg logit contributions
leaf-0.272
ting-0.272
orry-0.253
unk-0.250
 ear-0.248
Token didn
Token ID1422
Feature activation+4.23e-01

Pos logit contributions
leaf+0.193
ting+0.189
orry+0.177
unk+0.176
 ear+0.171
Neg logit contributions
 ne-0.183
 One-0.177
 cheaper-0.172
 swe-0.172
 coll-0.171
Token't
Token ID470
Feature activation+4.68e-02

Pos logit contributions
 ne+0.691
 coll+0.662
 swe+0.659
 One+0.658
 cheaper+0.658
Neg logit contributions
leaf-0.573
ting-0.565
orry-0.525
unk-0.517
 ear-0.506
Token and
Token ID290
Feature activation-6.15e-01

Pos logit contributions
leaf+3.221
ting+3.054
orry+2.908
unk+2.878
other+2.863
Neg logit contributions
 ne-1.854
 One-1.579
 swe-1.578
 cheaper-1.555
 coll-1.531
Token Tom
Token ID4186
Feature activation-1.05e+00

Pos logit contributions
leaf+0.857
ting+0.818
orry+0.758
unk+0.757
 upper+0.713
Neg logit contributions
 ne-1.034
 One-1.016
 swe-0.984
 coll-0.975
 cheaper-0.974
Token cried
Token ID16896
Feature activation+4.92e-01

Pos logit contributions
leaf+1.851
ting+1.783
unk+1.693
orry+1.669
 ear+1.621
Neg logit contributions
 ne-1.423
 One-1.313
 swe-1.254
 cheaper-1.253
One-1.238
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
 ne+0.703
 cheaper+0.670
 One+0.669
 coll+0.659
 swe+0.655
Neg logit contributions
ting-0.769
leaf-0.754
orry-0.704
 ear-0.704
unk-0.692
Token hugged
Token ID41130
Feature activation-6.74e-02

Pos logit contributions
leaf+2.047
ting+2.028
unk+1.928
orry+1.923
other+1.876
Neg logit contributions
 ne-2.064
 swe-1.887
 One-1.874
 cheaper-1.860
 coll-1.835
Token each
Token ID1123
Feature activation-1.96e+00

Pos logit contributions
leaf+0.098
ting+0.096
orry+0.090
unk+0.089
 ear+0.087
Neg logit contributions
 ne-0.105
 One-0.102
 cheaper-0.100
 swe-0.100
 coll-0.099
Token other
Token ID584
Feature activation+4.70e-01

Pos logit contributions
ting+3.030
leaf+3.016
orry+2.997
unk+2.979
iders+2.799
Neg logit contributions
 One-3.041
One-2.921
 swe-2.834
 coll-2.808
 cheaper-2.735
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
 ne+0.666
 One+0.639
 cheaper+0.628
 coll+0.627
One+0.618
Neg logit contributions
ting-0.729
leaf-0.724
 ear-0.674
orry-0.673
unk-0.664
Token They
Token ID1119
Feature activation-1.93e-01

Pos logit contributions
 ne+0.558
 cheaper+0.507
 coll+0.504
 swe+0.499
 One+0.486
Neg logit contributions
leaf-1.161
ting-1.157
 ear-1.106
orry-1.102
unk-1.098
Token felt
Token ID2936
Feature activation-6.29e-01

Pos logit contributions
leaf+0.311
ting+0.309
orry+0.288
unk+0.287
 ear+0.279
Neg logit contributions
 ne-0.268
 One-0.258
 cheaper-0.251
 swe-0.249
 coll-0.248
Token lonely
Token ID21757
Feature activation-4.18e-02

Pos logit contributions
leaf+0.989
ting+0.970
orry+0.918
unk+0.907
 upper+0.875
Neg logit contributions
 ne-0.915
 One-0.883
 cheaper-0.871
 swe-0.869
 coll-0.842
Token on
Token ID319
Feature activation-1.60e-01

Pos logit contributions
leaf+1.909
ting+1.847
orry+1.771
unk+1.760
 upper+1.706
Neg logit contributions
 ne-1.469
 swe-1.377
 cheaper-1.371
 One-1.355
 coll-1.335
Token her
Token ID607
Feature activation-7.87e-01

Pos logit contributions
leaf+0.258
ting+0.253
orry+0.237
unk+0.234
 ear+0.230
Neg logit contributions
 ne-0.224
 One-0.218
 cheaper-0.213
 swe-0.212
 coll-0.211
Token head
Token ID1182
Feature activation-4.42e-01

Pos logit contributions
leaf+1.134
ting+1.123
orry+1.014
unk+0.982
other+0.957
Neg logit contributions
 ne-1.274
 One-1.230
 swe-1.224
 cheaper-1.208
 coll-1.207
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
leaf+0.768
ting+0.743
orry+0.706
unk+0.692
 ear+0.681
Neg logit contributions
 ne-0.569
 One-0.548
 cheaper-0.537
 swe-0.535
 coll-0.528
Token She
Token ID1375
Feature activation-4.89e-01

Pos logit contributions
leaf+0.540
ting+0.531
orry+0.501
unk+0.497
 upper+0.489
Neg logit contributions
 One-0.306
 ne-0.306
 cheaper-0.287
 swe-0.287
 coll-0.283
Token was
Token ID373
Feature activation-1.98e+00

Pos logit contributions
leaf+0.840
ting+0.837
orry+0.770
unk+0.765
 ear+0.758
Neg logit contributions
 ne-0.646
 One-0.607
 cheaper-0.593
 swe-0.593
 coll-0.583
Token shocked
Token ID11472
Feature activation-6.74e-01

Pos logit contributions
leaf+3.787
ting+3.722
 zone+3.430
other+3.421
rified+3.414
Neg logit contributions
 ne-2.476
 cheaper-2.372
 swe-2.273
 One-2.234
 enough-2.187
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
leaf+1.209
ting+1.148
orry+1.118
unk+1.105
 upper+1.102
Neg logit contributions
 ne-0.844
 One-0.822
 swe-0.793
 cheaper-0.785
One-0.776
Token angry
Token ID7954
Feature activation-4.48e-01

Pos logit contributions
leaf+2.652
ting+2.521
orry+2.427
unk+2.425
other+2.400
Neg logit contributions
 ne-2.134
 swe-1.956
 cheaper-1.911
 One-1.891
 coll-1.819
Token.
Token ID13
Feature activation-8.91e-01

Pos logit contributions
leaf+0.788
ting+0.752
orry+0.720
unk+0.711
 upper+0.707
Neg logit contributions
 ne-0.569
 One-0.555
 swe-0.532
 coll-0.531
 cheaper-0.530
Token She
Token ID1375
Feature activation-7.63e-01

Pos logit contributions
leaf+1.852
ting+1.786
orry+1.687
unk+1.675
 upper+1.670
Neg logit contributions
 One-0.938
 ne-0.848
One-0.807
 swe-0.796
 cheaper-0.787
Token He
Token ID679
Feature activation+7.85e-01

Pos logit contributions
 ne+0.315
 One+0.294
 cheaper+0.294
 coll+0.293
 swe+0.291
Neg logit contributions
leaf-0.534
ting-0.529
orry-0.501
unk-0.498
 ear-0.494
Token says
Token ID1139
Feature activation+1.32e+00

Pos logit contributions
 ne+1.061
 One+1.040
 cheaper+1.034
 coll+1.026
One+1.021
Neg logit contributions
leaf-1.248
ting-1.210
orry-1.148
 ear-1.140
unk-1.126
Token,
Token ID11
Feature activation-3.48e-01

Pos logit contributions
 ne+1.478
 coll+1.472
 cheaper+1.447
One+1.445
 former+1.428
Neg logit contributions
ting-2.363
leaf-2.345
unk-2.292
 ear-2.291
orry-2.286
Token "
Token ID366
Feature activation+8.04e-01

Pos logit contributions
leaf+0.482
ting+0.466
orry+0.438
unk+0.428
 upper+0.419
Neg logit contributions
 ne-0.565
 One-0.555
 cheaper-0.534
 swe-0.533
 coll-0.530
TokenY
Token ID56
Feature activation+6.71e-01

Pos logit contributions
 ne+1.289
 cheaper+1.249
 One+1.240
 swe+1.237
 coll+1.235
Neg logit contributions
leaf-1.121
ting-1.107
 upper-1.012
orry-1.011
unk-0.996
Tokenuck
Token ID1347
Feature activation-1.89e+00

Pos logit contributions
 ne+1.378
 cheaper+1.338
 One+1.332
 swe+1.321
 coll+1.317
Neg logit contributions
leaf-0.620
ting-0.613
orry-0.561
unk-0.547
 upper-0.528
Token!
Token ID0
Feature activation+8.71e-02

Pos logit contributions
leaf+4.016
ting+3.618
 upper+3.610
orry+3.555
unk+3.470
Neg logit contributions
 ne-1.963
 enough-1.947
 One-1.794
 swe-1.788
 coll-1.749
Token This
Token ID770
Feature activation+1.24e+00

Pos logit contributions
 ne+0.146
 One+0.141
 cheaper+0.139
 coll+0.138
 swe+0.138
Neg logit contributions
leaf-0.115
ting-0.113
orry-0.104
unk-0.103
 ear-0.101
Token apple
Token ID17180
Feature activation-2.63e-01

Pos logit contributions
 ne+1.582
 swe+1.452
 coll+1.450
 cheaper+1.444
 One+1.438
Neg logit contributions
ting-2.107
leaf-2.074
unk-2.011
 upper-1.973
 ear-1.973
Token is
Token ID318
Feature activation+2.10e-01

Pos logit contributions
leaf+0.532
ting+0.522
orry+0.495
unk+0.489
 ear+0.485
Neg logit contributions
 ne-0.261
 One-0.247
 cheaper-0.239
 swe-0.237
 coll-0.236
Token bad
Token ID2089
Feature activation-1.77e+00

Pos logit contributions
 ne+0.278
 One+0.270
 coll+0.262
 cheaper+0.261
 swe+0.260
Neg logit contributions
leaf-0.347
ting-0.343
orry-0.322
unk-0.319
 ear-0.316
Token.
Token ID13
Feature activation-7.19e-01

Pos logit contributions
leaf+1.341
ting+1.328
orry+1.252
unk+1.229
 upper+1.209
Neg logit contributions
 ne-1.269
 cheaper-1.240
 swe-1.226
 One-1.223
 coll-1.190
Token But
Token ID887
Feature activation-3.86e-01

Pos logit contributions
leaf+1.065
ting+1.052
orry+0.971
unk+0.971
 upper+0.938
Neg logit contributions
 One-1.113
 ne-1.068
One-1.036
 cheaper-1.028
 swe-1.022
Token you
Token ID345
Feature activation-1.34e-01

Pos logit contributions
leaf+0.592
ting+0.575
unk+0.538
orry+0.537
 upper+0.523
Neg logit contributions
 ne-0.574
 One-0.556
 cheaper-0.546
 swe-0.543
 coll-0.535
Token should
Token ID815
Feature activation+3.85e-01

Pos logit contributions
leaf+0.234
ting+0.232
orry+0.217
unk+0.215
 ear+0.213
Neg logit contributions
 ne-0.168
 One-0.163
 cheaper-0.158
 swe-0.155
 coll-0.155
Token also
Token ID635
Feature activation+8.32e-02

Pos logit contributions
 ne+0.511
 One+0.485
 coll+0.482
 cheaper+0.482
 swe+0.478
Neg logit contributions
leaf-0.636
ting-0.624
orry-0.592
 ear-0.588
unk-0.582
Token respect
Token ID2461
Feature activation-1.89e+00

Pos logit contributions
 ne+0.111
 One+0.108
 cheaper+0.105
 coll+0.105
 swe+0.104
Neg logit contributions
leaf-0.136
ting-0.134
orry-0.126
unk-0.125
 ear-0.124
Token each
Token ID1123
Feature activation-2.54e+00

Pos logit contributions
leaf+3.521
ting+3.366
orry+3.254
unk+3.239
rified+3.129
Neg logit contributions
 One-2.293
 cheaper-2.242
 swe-2.228
 coll-2.224
One-2.188
Token other
Token ID584
Feature activation-1.05e+00

Pos logit contributions
leaf+4.281
idget+4.088
orry+4.083
ting+4.059
unk+4.000
Neg logit contributions
 One-3.614
One-3.585
 coll-3.426
 someday-3.302
 cheaper-3.294
Token's
Token ID338
Feature activation-1.12e+00

Pos logit contributions
leaf+1.915
ting+1.780
orry+1.741
unk+1.719
 ear+1.666
Neg logit contributions
 ne-1.300
 One-1.270
 swe-1.256
 cheaper-1.250
 coll-1.214
Token privacy
Token ID6782
Feature activation-8.89e-01

Pos logit contributions
leaf+1.939
ting+1.919
orry+1.776
unk+1.762
 upper+1.695
Neg logit contributions
 ne-1.445
 coll-1.424
 One-1.421
 swe-1.406
 cheaper-1.393
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
leaf+1.619
ting+1.545
orry+1.493
unk+1.474
 upper+1.437
Neg logit contributions
 ne-1.072
 One-1.064
 cheaper-1.035
 swe-1.013
One-1.008
Token off
Token ID572
Feature activation+2.98e-01

Pos logit contributions
leaf+0.168
ting+0.163
orry+0.151
unk+0.148
 upper+0.144
Neg logit contributions
 ne-0.250
 One-0.243
 cheaper-0.238
 swe-0.238
 coll-0.236
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
 ne+0.424
 One+0.413
 cheaper+0.407
 coll+0.405
 swe+0.403
Neg logit contributions
leaf-0.462
ting-0.461
orry-0.423
 ear-0.421
unk-0.419
Token bed
Token ID3996
Feature activation-1.15e-01

Pos logit contributions
leaf+2.900
ting+2.846
unk+2.737
orry+2.723
asing+2.666
Neg logit contributions
 ne-2.686
 swe-2.638
 saf-2.530
 One-2.522
 coll-2.459
Token,
Token ID11
Feature activation-8.40e-01

Pos logit contributions
leaf+0.206
ting+0.202
orry+0.191
unk+0.189
 ear+0.186
Neg logit contributions
 ne-0.140
 One-0.134
 cheaper-0.130
 coll-0.130
 swe-0.129
Token pushed
Token ID7121
Feature activation-5.76e-01

Pos logit contributions
leaf+1.440
ting+1.389
orry+1.360
unk+1.349
 upper+1.306
Neg logit contributions
 ne-1.133
 One-1.050
 swe-1.045
 cheaper-1.020
 coll-1.018
Token the
Token ID262
Feature activation-2.32e+00

Pos logit contributions
leaf+0.968
ting+0.929
orry+0.892
unk+0.890
 upper+0.866
Neg logit contributions
 ne-0.786
 One-0.756
 swe-0.732
 coll-0.720
 cheaper-0.720
Token shiny
Token ID22441
Feature activation-1.54e+00

Pos logit contributions
leaf+3.818
ting+3.709
orry+3.632
udge+3.552
unk+3.526
Neg logit contributions
 swe-3.423
 ne-3.405
 coll-3.241
 cheaper-3.184
 scar-3.125
Token knob
Token ID36842
Feature activation+3.34e-01

Pos logit contributions
leaf+2.298
ting+2.203
orry+2.124
unk+2.055
asing+1.970
Neg logit contributions
 ne-2.399
 swe-2.340
 coll-2.290
 cheaper-2.288
 One-2.276
Token again
Token ID757
Feature activation+1.43e-01

Pos logit contributions
 ne+0.541
 One+0.527
 cheaper+0.518
 coll+0.517
 swe+0.514
Neg logit contributions
ting-0.449
leaf-0.446
orry-0.407
 ear-0.404
unk-0.401
Token,
Token ID11
Feature activation-5.74e-01

Pos logit contributions
 ne+0.179
 One+0.172
 cheaper+0.168
 coll+0.167
 swe+0.167
Neg logit contributions
leaf-0.247
ting-0.246
orry-0.230
unk-0.228
 ear-0.227
Token and
Token ID290
Feature activation-4.05e-01

Pos logit contributions
leaf+0.996
ting+0.961
orry+0.934
unk+0.927
 upper+0.897
Neg logit contributions
 ne-0.744
 One-0.698
 swe-0.688
 cheaper-0.683
 coll-0.680