TOP ACTIVATIONS
MAX = 3.693

little bird named Twe ety . Twe ety loved to fly
small bird named Twe ety . Twe ety loved to fly
bird ies to help . Together , they flew to the
little bird named Twe ety . Twe ety loved to fly
little bird named Twe ety . Twe ety loved to dance
bird ie named Twe ety . Twe ety lived in a
little bird named Twe ety . Twe ety was very flexible
they made . They had ten pairs of ac orns .
yellow bird named Twe ety . Twe ety loved to sing
little bird named Twe ety . Twe ety loved to whistle
little bird named Twe ety . Twe ety wanted to fly
little bird named Twe ety . Twe ety had an unusual
bird ie named Twe ety . Twe ety loved to fly
and were very happy . When they returned home , Twe
small bird named Twe ety . Twe ety lived in a
little bird named Twe ety . Twe ety lived in a
in the dirt . One day , Gold heard
little bird named Twe ety . Twe ety was very smart
. But when Twe ety and the owl
little bird named Twe ety . He had an orange be

MIN ACTIVATIONS
MIN = -3.698

so big ! She touched the screen and it lit up
over and accidentally stepped on the cricket . The cricket started
through the small hole in the screen . The butterfly was
she ran and showed it to her mom . Mom was
was surprised . He touched the screen and it made such
and pushed the button on the television . The screen lit
with a long cable . The cable was harmless , so
Tim stood up and pushed the button on the television .
closer look . Suddenly , the screen was making loud noises
Sara sees a light on the screen . The light is
Billy slowly approached the computer again , he slowly
. She peek ed around the corner and saw a pretty
M andy showed her hoop to her father .
step to get closer to the bug . He wanted to
to fit under the table the wires started to spark .
. Sam pushed the ball towards the shadow ,
frustrated . The buttons on the television were different and she
. He touched the keyboard , expecting the computer
trick ! She stood on the stage , holding a shiny
John nodded . He grabbed the cricket and put it carefully

INTERVAL 1.845 - 3.693
CONTAINS 0.532%

a really high and long climb up the mountains ide .
bird recovered . Soon the time had come for
girl and she liked going on adventures . One day ,
about this ! So Bobby started packing his bag
with soap and water . It shines again .

INTERVAL -0.003 - 1.845
CONTAINS 49.463%

happy and healthy . The lab had so much
sad and wanted to find it . Max went on a
house . He had lots of fun and thoroughly enjoyed himself
all about their adventure . She smiled and hugged them .
. We will be good twins ," they say .

INTERVAL -1.851 - -0.003
CONTAINS 49.594%

the stretching was done , everyone shared lunch together . Max
to hop around too ! Together , Sammy and
aps her hands . They run after the man
She said , " No , it is mine . I
to Emma . With a smile on her

INTERVAL -3.698 - -1.851
CONTAINS 0.412%

It crawled onto Tim my 's hand and they became friends
had fun together . The enormous insect was never alone again
't wait to show it to her mum .
He held his toy with both hands and showed it to
So he ran to his house and grabbed a
Token little
Token ID1310
Feature activation+1.10e-01

Pos logit contributions
leaf+0.204
ting+0.202
orry+0.185
unk+0.183
 ear+0.179
Neg logit contributions
 ne-0.246
 One-0.238
 cheaper-0.234
 coll-0.233
 swe-0.233
Token bird
Token ID6512
Feature activation+4.02e-02

Pos logit contributions
 ne+0.153
 One+0.149
 cheaper+0.146
 swe+0.144
 coll+0.144
Neg logit contributions
leaf-0.173
ting-0.171
orry-0.160
unk-0.158
 ear-0.156
Token named
Token ID3706
Feature activation+2.91e-01

Pos logit contributions
 ne+0.063
 One+0.061
 cheaper+0.060
 coll+0.060
 swe+0.060
Neg logit contributions
leaf-0.057
ting-0.056
orry-0.052
unk-0.051
 ear-0.051
Token Twe
Token ID24205
Feature activation+1.07e+00

Pos logit contributions
 ne+0.294
 One+0.274
 cheaper+0.273
 coll+0.271
 swe+0.270
Neg logit contributions
leaf-0.578
ting-0.575
orry-0.544
unk-0.540
 ear-0.537
Tokenety
Token ID2963
Feature activation+1.57e-01

Pos logit contributions
 ne+1.033
 One+0.946
 swe+0.944
 cheaper+0.934
 coll+0.882
Neg logit contributions
ting-2.189
leaf-2.189
orry-2.105
unk-2.102
 upper-2.051
Token.
Token ID13
Feature activation+3.69e+00

Pos logit contributions
 ne+0.218
 One+0.210
 cheaper+0.207
 swe+0.205
 coll+0.205
Neg logit contributions
leaf-0.250
ting-0.248
orry-0.232
unk-0.229
 ear-0.227
Token Twe
Token ID24205
Feature activation+1.56e+00

Pos logit contributions
 coll+4.313
 cheaper+4.125
 swe+3.986
 scar+3.963
 problems+3.928
Neg logit contributions
ting-7.873
 ear-7.221
 Stop-6.483
leaf-6.424
rified-6.357
Tokenety
Token ID2963
Feature activation+1.11e+00

Pos logit contributions
 ne+1.657
 swe+1.529
 cheaper+1.522
 One+1.496
 saf+1.436
Neg logit contributions
ting-3.072
leaf-3.020
unk-2.966
orry-2.961
other-2.895
Token loved
Token ID6151
Feature activation+3.96e-01

Pos logit contributions
 cheaper+2.099
 ne+2.078
 coll+2.068
 swe+2.061
 One+2.019
Neg logit contributions
ting-1.145
leaf-1.136
orry-1.077
unk-1.049
 ear-1.043
Token to
Token ID284
Feature activation+9.86e-01

Pos logit contributions
 ne+0.535
 One+0.518
 cheaper+0.510
 swe+0.498
 coll+0.497
Neg logit contributions
leaf-0.650
ting-0.642
orry-0.591
 ear-0.589
unk-0.589
Token fly
Token ID6129
Feature activation+1.10e+00

Pos logit contributions
 cheaper+1.359
 ne+1.345
 swe+1.291
 One+1.284
 coll+1.269
Neg logit contributions
leaf-1.566
ting-1.517
 ear-1.479
 upper-1.430
orry-1.425
Token small
Token ID1402
Feature activation+3.56e-01

Pos logit contributions
leaf+0.307
ting+0.302
orry+0.277
unk+0.274
 ear+0.268
Neg logit contributions
 ne-0.362
 One-0.350
 cheaper-0.343
 coll-0.342
 swe-0.342
Token bird
Token ID6512
Feature activation-5.25e-02

Pos logit contributions
 ne+0.539
 One+0.524
 cheaper+0.516
 swe+0.511
One+0.509
Neg logit contributions
leaf-0.515
ting-0.512
orry-0.475
unk-0.472
 ear-0.471
Token named
Token ID3706
Feature activation+3.63e-01

Pos logit contributions
leaf+0.075
ting+0.074
orry+0.069
unk+0.068
 ear+0.067
Neg logit contributions
 ne-0.082
 One-0.079
 cheaper-0.077
 coll-0.077
 swe-0.077
Token Twe
Token ID24205
Feature activation+1.05e+00

Pos logit contributions
 ne+0.398
 cheaper+0.373
 One+0.372
 coll+0.369
 swe+0.368
Neg logit contributions
leaf-0.688
ting-0.684
orry-0.644
unk-0.640
 ear-0.635
Tokenety
Token ID2963
Feature activation+7.13e-02

Pos logit contributions
 ne+1.008
 One+0.928
 swe+0.904
 cheaper+0.903
 saf+0.855
Neg logit contributions
ting-2.154
leaf-2.153
orry-2.082
unk-2.068
 upper-2.019
Token.
Token ID13
Feature activation+3.69e+00

Pos logit contributions
 ne+0.097
 One+0.094
 cheaper+0.092
 swe+0.091
 coll+0.091
Neg logit contributions
leaf-0.115
ting-0.114
orry-0.107
unk-0.106
 ear-0.105
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+4.289
 cheaper+4.077
 former+4.038
 scar+3.969
 problems+3.961
Neg logit contributions
ting-7.705
 ear-7.151
leaf-6.476
 Stop-6.424
other-6.300
Tokenety
Token ID2963
Feature activation+1.02e+00

Pos logit contributions
 ne+1.521
 cheaper+1.386
 One+1.384
 swe+1.378
 saf+1.323
Neg logit contributions
ting-2.879
leaf-2.849
orry-2.807
unk-2.784
other-2.710
Token loved
Token ID6151
Feature activation+3.67e-01

Pos logit contributions
 cheaper+1.913
 ne+1.902
 coll+1.882
 swe+1.878
 One+1.851
Neg logit contributions
ting-1.051
leaf-1.050
orry-0.987
unk-0.963
 ear-0.952
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 ne+0.494
 One+0.479
 cheaper+0.471
 swe+0.461
 coll+0.459
Neg logit contributions
leaf-0.604
ting-0.597
orry-0.550
unk-0.547
 ear-0.547
Token fly
Token ID6129
Feature activation+1.13e+00

Pos logit contributions
 cheaper+1.542
 ne+1.509
 swe+1.457
 One+1.442
 coll+1.428
Neg logit contributions
leaf-1.754
ting-1.699
 ear-1.665
 upper-1.603
orry-1.590
Token bird
Token ID6512
Feature activation+1.11e+00

Pos logit contributions
 ne+1.777
 One+1.729
 coll+1.676
One+1.673
 cheaper+1.668
Neg logit contributions
ting-1.355
leaf-1.352
 ear-1.228
unk-1.178
orry-1.162
Tokenies
Token ID444
Feature activation-3.17e-01

Pos logit contributions
 ne+1.591
 cheaper+1.547
 One+1.542
 coll+1.516
One+1.507
Neg logit contributions
ting-1.716
leaf-1.645
unk-1.578
 ear-1.564
orry-1.561
Token to
Token ID284
Feature activation-2.38e-01

Pos logit contributions
leaf+0.535
ting+0.520
orry+0.498
unk+0.489
 upper+0.484
Neg logit contributions
 ne-0.416
 One-0.402
 swe-0.389
 cheaper-0.388
 coll-0.386
Token help
Token ID1037
Feature activation+4.88e-01

Pos logit contributions
leaf+0.352
ting+0.346
orry+0.324
unk+0.321
 upper+0.311
Neg logit contributions
 ne-0.364
 One-0.354
 cheaper-0.343
 swe-0.342
 coll-0.342
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 ne+0.694
 One+0.667
 cheaper+0.660
 coll+0.652
 swe+0.649
Neg logit contributions
ting-0.768
leaf-0.754
orry-0.692
 ear-0.691
unk-0.686
Token Together
Token ID17083
Feature activation+3.67e+00

Pos logit contributions
 ne+2.641
 coll+2.501
 swe+2.494
 cheaper+2.471
One+2.420
Neg logit contributions
ting-3.130
leaf-2.968
 ear-2.754
 Stop-2.671
unk-2.653
Token,
Token ID11
Feature activation+2.41e+00

Pos logit contributions
 former+6.318
 shin+6.038
One+5.947
 coll+5.924
 cheaper+5.870
Neg logit contributions
ting-5.788
 ear-4.728
 behind-4.518
 hand-4.467
 around-4.445
Token they
Token ID484
Feature activation+1.98e-01

Pos logit contributions
 ne+2.657
One+2.654
 coll+2.612
 One+2.592
 swe+2.546
Neg logit contributions
ting-4.997
leaf-4.495
 ear-4.007
 expecting-4.000
other-3.925
Token flew
Token ID13112
Feature activation+1.00e+00

Pos logit contributions
 ne+0.256
 One+0.250
 cheaper+0.244
 coll+0.242
 swe+0.241
Neg logit contributions
leaf-0.333
ting-0.331
orry-0.309
unk-0.305
 ear-0.302
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 ne+1.868
 One+1.843
 cheaper+1.838
 coll+1.821
One+1.815
Neg logit contributions
ting-1.128
leaf-1.084
 ear-0.961
unk-0.938
orry-0.932
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
 ne+2.008
 One+1.896
 swe+1.887
 cheaper+1.886
One+1.871
Neg logit contributions
leaf-1.854
ting-1.846
 ear-1.681
other-1.598
orry-1.597
Token little
Token ID1310
Feature activation+1.06e-02

Pos logit contributions
leaf+0.322
ting+0.318
orry+0.292
unk+0.288
 ear+0.282
Neg logit contributions
 ne-0.377
 One-0.364
 cheaper-0.357
 coll-0.356
 swe-0.356
Token bird
Token ID6512
Feature activation-5.01e-02

Pos logit contributions
 ne+0.015
 One+0.014
 cheaper+0.014
 coll+0.014
 swe+0.014
Neg logit contributions
leaf-0.017
ting-0.017
orry-0.016
unk-0.015
 ear-0.015
Token named
Token ID3706
Feature activation+3.70e-01

Pos logit contributions
leaf+0.073
ting+0.072
orry+0.067
unk+0.066
 ear+0.065
Neg logit contributions
 ne-0.077
 One-0.075
 cheaper-0.073
 coll-0.073
 swe-0.072
Token Twe
Token ID24205
Feature activation+9.49e-01

Pos logit contributions
 ne+0.367
 cheaper+0.342
 One+0.340
 coll+0.338
 swe+0.337
Neg logit contributions
leaf-0.741
ting-0.737
orry-0.698
unk-0.694
 ear-0.689
Tokenety
Token ID2963
Feature activation-2.55e-04

Pos logit contributions
 ne+0.890
 One+0.819
 swe+0.806
 cheaper+0.801
 coll+0.758
Neg logit contributions
ting-1.950
leaf-1.947
unk-1.877
orry-1.876
 upper-1.836
Token.
Token ID13
Feature activation+3.63e+00

Pos logit contributions
leaf+0.000
ting+0.000
orry+0.000
unk+0.000
 ear+0.000
Neg logit contributions
 ne-0.000
 One-0.000
 cheaper-0.000
 coll-0.000
 swe-0.000
Token Twe
Token ID24205
Feature activation+1.36e+00

Pos logit contributions
 coll+4.199
 cheaper+4.007
 problems+3.888
 scar+3.827
 swe+3.812
Neg logit contributions
ting-7.782
 ear-7.086
 Stop-6.405
leaf-6.336
ada-6.312
Tokenety
Token ID2963
Feature activation+9.81e-01

Pos logit contributions
 ne+1.395
 swe+1.277
 cheaper+1.276
 One+1.267
 coll+1.203
Neg logit contributions
ting-2.700
leaf-2.666
unk-2.617
orry-2.608
other-2.537
Token loved
Token ID6151
Feature activation+2.96e-01

Pos logit contributions
 cheaper+1.810
 ne+1.797
 coll+1.781
 swe+1.776
 One+1.747
Neg logit contributions
ting-1.054
leaf-1.050
orry-0.991
unk-0.968
 ear-0.960
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
 ne+0.395
 One+0.382
 cheaper+0.376
 swe+0.368
 coll+0.368
Neg logit contributions
leaf-0.490
ting-0.484
orry-0.448
unk-0.445
 ear-0.444
Token fly
Token ID6129
Feature activation+1.07e+00

Pos logit contributions
 cheaper+1.393
 ne+1.374
 swe+1.319
 One+1.308
 coll+1.292
Neg logit contributions
leaf-1.631
ting-1.583
 ear-1.542
 upper-1.489
orry-1.488
Token little
Token ID1310
Feature activation+9.64e-02

Pos logit contributions
leaf+0.207
ting+0.205
orry+0.188
unk+0.186
 ear+0.182
Neg logit contributions
 ne-0.242
 One-0.234
 cheaper-0.229
 coll-0.228
 swe-0.228
Token bird
Token ID6512
Feature activation-1.07e-01

Pos logit contributions
 ne+0.136
 One+0.132
 cheaper+0.129
 swe+0.128
 coll+0.128
Neg logit contributions
leaf-0.152
ting-0.150
orry-0.140
unk-0.138
 ear-0.137
Token named
Token ID3706
Feature activation+6.29e-01

Pos logit contributions
leaf+0.156
ting+0.153
orry+0.142
unk+0.140
 ear+0.137
Neg logit contributions
 ne-0.166
 One-0.161
 cheaper-0.157
 coll-0.156
 swe-0.156
Token Twe
Token ID24205
Feature activation+1.09e+00

Pos logit contributions
 ne+0.635
 cheaper+0.594
 coll+0.589
 swe+0.584
 One+0.576
Neg logit contributions
leaf-1.238
ting-1.237
orry-1.170
unk-1.163
 ear-1.162
Tokenety
Token ID2963
Feature activation-9.29e-02

Pos logit contributions
 ne+1.054
 One+0.964
 swe+0.948
 cheaper+0.944
 saf+0.889
Neg logit contributions
ting-2.217
leaf-2.213
orry-2.131
unk-2.129
 upper-2.077
Token.
Token ID13
Feature activation+3.62e+00

Pos logit contributions
leaf+0.153
ting+0.151
orry+0.141
unk+0.140
 ear+0.138
Neg logit contributions
 ne-0.125
 One-0.121
 cheaper-0.117
 coll-0.117
 swe-0.116
Token Twe
Token ID24205
Feature activation+1.55e+00

Pos logit contributions
 coll+4.189
 cheaper+4.121
 swe+3.915
 problems+3.855
 former+3.825
Neg logit contributions
ting-7.637
 ear-7.079
 Stop-6.329
leaf-6.266
ges-6.199
Tokenety
Token ID2963
Feature activation+9.58e-01

Pos logit contributions
 ne+1.641
 cheaper+1.502
 swe+1.487
 One+1.482
 saf+1.419
Neg logit contributions
ting-3.036
leaf-2.980
orry-2.929
unk-2.929
other-2.855
Token loved
Token ID6151
Feature activation+3.07e-01

Pos logit contributions
 cheaper+1.759
 ne+1.754
 coll+1.733
 swe+1.732
 One+1.701
Neg logit contributions
ting-1.034
leaf-1.030
orry-0.969
unk-0.949
 ear-0.943
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 ne+0.411
 One+0.398
 cheaper+0.391
 swe+0.383
 coll+0.382
Neg logit contributions
leaf-0.509
ting-0.502
orry-0.465
unk-0.463
 ear-0.461
Token dance
Token ID9280
Feature activation-2.02e-03

Pos logit contributions
 cheaper+1.372
 ne+1.353
 swe+1.300
 One+1.289
 coll+1.275
Neg logit contributions
leaf-1.616
ting-1.569
 ear-1.532
 upper-1.480
orry-1.474
Token bird
Token ID6512
Feature activation+1.13e-01

Pos logit contributions
 ne+0.080
 One+0.078
 cheaper+0.076
 coll+0.076
 swe+0.076
Neg logit contributions
leaf-0.090
ting-0.089
orry-0.083
unk-0.082
 ear-0.081
Tokenie
Token ID494
Feature activation-2.26e-01

Pos logit contributions
 ne+0.179
 One+0.174
 cheaper+0.172
 swe+0.170
 coll+0.170
Neg logit contributions
leaf-0.158
ting-0.156
orry-0.144
unk-0.143
 ear-0.141
Token named
Token ID3706
Feature activation+5.48e-01

Pos logit contributions
leaf+0.344
ting+0.336
orry+0.313
unk+0.308
 upper+0.303
Neg logit contributions
 ne-0.335
 One-0.326
 coll-0.315
 cheaper-0.314
One-0.313
Token Twe
Token ID24205
Feature activation+1.09e+00

Pos logit contributions
 ne+0.562
 cheaper+0.528
 coll+0.525
 One+0.517
 swe+0.514
Neg logit contributions
ting-1.070
leaf-1.066
orry-1.005
 ear-1.003
unk-1.001
Tokenety
Token ID2963
Feature activation+1.93e-01

Pos logit contributions
 ne+1.048
 One+0.967
 cheaper+0.954
 swe+0.928
 saf+0.904
Neg logit contributions
ting-2.224
leaf-2.191
unk-2.111
orry-2.110
 upper-2.062
Token.
Token ID13
Feature activation+3.62e+00

Pos logit contributions
 ne+0.268
 One+0.258
 cheaper+0.256
 swe+0.253
 coll+0.252
Neg logit contributions
leaf-0.306
ting-0.305
orry-0.284
unk-0.282
 ear-0.279
Token Twe
Token ID24205
Feature activation+1.41e+00

Pos logit contributions
 coll+4.278
 cheaper+3.996
 problems+3.836
 swe+3.817
 scar+3.813
Neg logit contributions
ting-7.592
 ear-7.068
leaf-6.312
 Stop-6.304
ges-6.175
Tokenety
Token ID2963
Feature activation+1.19e+00

Pos logit contributions
 ne+1.462
 cheaper+1.348
 One+1.333
 swe+1.308
 saf+1.282
Neg logit contributions
ting-2.809
leaf-2.742
orry-2.691
unk-2.688
other-2.623
Token lived
Token ID5615
Feature activation-2.52e-01

Pos logit contributions
 cheaper+2.249
 ne+2.218
 coll+2.212
 swe+2.204
 One+2.154
Neg logit contributions
ting-1.234
leaf-1.219
orry-1.158
unk-1.127
 ear-1.126
Token in
Token ID287
Feature activation+1.28e+00

Pos logit contributions
leaf+0.284
ting+0.276
orry+0.253
unk+0.244
 upper+0.240
Neg logit contributions
 ne-0.475
 One-0.460
 cheaper-0.450
One-0.449
 swe-0.449
Token a
Token ID257
Feature activation+9.84e-01

Pos logit contributions
 ne+1.860
 cheaper+1.836
 swe+1.816
 coll+1.793
 One+1.732
Neg logit contributions
leaf-1.923
ting-1.916
 ear-1.822
orry-1.734
unk-1.728
Token little
Token ID1310
Feature activation+1.25e-01

Pos logit contributions
leaf+0.124
ting+0.122
orry+0.112
unk+0.111
 ear+0.109
Neg logit contributions
 ne-0.146
 One-0.142
 cheaper-0.139
 coll-0.138
 swe-0.138
Token bird
Token ID6512
Feature activation+9.04e-02

Pos logit contributions
 ne+0.175
 One+0.170
 cheaper+0.166
 coll+0.165
 swe+0.165
Neg logit contributions
leaf-0.196
ting-0.194
orry-0.181
unk-0.179
 ear-0.178
Token named
Token ID3706
Feature activation+4.58e-01

Pos logit contributions
 ne+0.140
 One+0.136
 cheaper+0.133
 swe+0.132
 coll+0.132
Neg logit contributions
leaf-0.130
ting-0.128
orry-0.119
unk-0.118
 ear-0.116
Token Twe
Token ID24205
Feature activation+1.31e+00

Pos logit contributions
 ne+0.484
 cheaper+0.453
 coll+0.450
 swe+0.446
 One+0.446
Neg logit contributions
leaf-0.885
ting-0.882
orry-0.833
unk-0.828
 ear-0.825
Tokenety
Token ID2963
Feature activation+5.35e-02

Pos logit contributions
 ne+1.300
 swe+1.196
 One+1.196
 cheaper+1.180
 coll+1.112
Neg logit contributions
ting-2.625
leaf-2.601
orry-2.522
unk-2.521
 upper-2.463
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
 ne+0.073
 One+0.071
 cheaper+0.069
 swe+0.069
 coll+0.069
Neg logit contributions
leaf-0.086
ting-0.086
orry-0.080
unk-0.079
 ear-0.078
Token Twe
Token ID24205
Feature activation+1.67e+00

Pos logit contributions
 coll+4.086
 cheaper+3.912
 former+3.763
 swe+3.720
 problems+3.708
Neg logit contributions
ting-7.800
 ear-7.091
 Stop-6.433
leaf-6.389
rified-6.329
Tokenety
Token ID2963
Feature activation+1.08e+00

Pos logit contributions
 ne+1.784
 swe+1.662
 cheaper+1.651
 One+1.626
 coll+1.556
Neg logit contributions
ting-3.259
leaf-3.183
orry-3.145
unk-3.141
other-3.069
Token was
Token ID373
Feature activation+9.89e-01

Pos logit contributions
 cheaper+2.064
 ne+2.046
 coll+2.033
 swe+2.025
 One+1.988
Neg logit contributions
ting-1.099
leaf-1.094
orry-1.033
unk-1.004
 ear-0.999
Token very
Token ID845
Feature activation-3.52e-02

Pos logit contributions
 ne+1.507
 cheaper+1.476
 swe+1.465
 coll+1.457
 One+1.452
Neg logit contributions
ting-1.409
leaf-1.406
 ear-1.340
unk-1.335
orry-1.300
Token flexible
Token ID12846
Feature activation-5.28e-01

Pos logit contributions
leaf+0.056
ting+0.055
orry+0.052
unk+0.051
 ear+0.050
Neg logit contributions
 ne-0.049
 One-0.048
 cheaper-0.047
 swe-0.046
 coll-0.046
Token they
Token ID484
Feature activation+2.20e-01

Pos logit contributions
 ne+0.193
 One+0.187
 cheaper+0.181
 coll+0.181
 swe+0.181
Neg logit contributions
leaf-0.282
ting-0.280
orry-0.263
unk-0.260
 ear-0.257
Token made
Token ID925
Feature activation+1.04e+00

Pos logit contributions
 ne+0.309
 One+0.302
 cheaper+0.295
 coll+0.295
 swe+0.294
Neg logit contributions
leaf-0.344
ting-0.339
orry-0.317
unk-0.312
 ear-0.309
Token.
Token ID13
Feature activation+9.34e-01

Pos logit contributions
 ne+1.529
 One+1.510
One+1.476
 cheaper+1.446
 swe+1.431
Neg logit contributions
ting-1.542
leaf-1.541
unk-1.414
orry-1.404
 ear-1.385
Token They
Token ID1119
Feature activation+1.82e-01

Pos logit contributions
 ne+1.079
 cheaper+0.985
 swe+0.984
 coll+0.979
 saf+0.926
Neg logit contributions
leaf-1.712
ting-1.708
orry-1.637
unk-1.614
 ear-1.613
Token had
Token ID550
Feature activation+1.81e+00

Pos logit contributions
 ne+0.258
 One+0.251
 cheaper+0.246
 coll+0.245
 swe+0.245
Neg logit contributions
leaf-0.285
ting-0.280
orry-0.263
unk-0.258
 ear-0.257
Token ten
Token ID3478
Feature activation+3.61e+00

Pos logit contributions
 ne+2.569
 One+2.528
 coll+2.461
One+2.433
 swe+2.432
Neg logit contributions
leaf-2.726
ting-2.583
 ear-2.503
orry-2.485
 upper-2.445
Token pairs
Token ID14729
Feature activation-2.79e-02

Pos logit contributions
 former+4.965
 someday+4.923
oral+4.805
One+4.724
 One+4.721
Neg logit contributions
ting-6.238
 ear-5.615
leaf-5.509
ges-5.336
 hand-5.332
Token of
Token ID286
Feature activation+1.83e+00

Pos logit contributions
leaf+0.034
ting+0.034
orry+0.031
unk+0.031
 ear+0.030
Neg logit contributions
 ne-0.049
 One-0.047
 cheaper-0.047
 coll-0.046
 swe-0.046
Token ac
Token ID936
Feature activation+1.27e+00

Pos logit contributions
 ne+2.332
One+2.251
 One+2.250
 cheaper+2.217
 someday+2.210
Neg logit contributions
leaf-3.048
ting-2.968
 ear-2.827
other-2.630
orry-2.624
Tokenorns
Token ID19942
Feature activation+3.03e-01

Pos logit contributions
 One+1.595
 ne+1.586
 swe+1.539
 cheaper+1.528
 coll+1.477
Neg logit contributions
ting-2.257
leaf-2.185
unk-2.018
orry-2.008
other-1.959
Token.
Token ID13
Feature activation+7.46e-01

Pos logit contributions
 ne+0.421
 One+0.411
 cheaper+0.398
 swe+0.398
 coll+0.396
Neg logit contributions
leaf-0.480
ting-0.479
orry-0.442
unk-0.440
 ear-0.431
Token yellow
Token ID7872
Feature activation-1.88e-01

Pos logit contributions
leaf+0.029
ting+0.029
orry+0.027
unk+0.026
 ear+0.026
Neg logit contributions
 ne-0.036
 One-0.035
 cheaper-0.034
 coll-0.034
 swe-0.034
Token bird
Token ID6512
Feature activation+3.39e-02

Pos logit contributions
leaf+0.313
ting+0.308
orry+0.289
unk+0.285
 upper+0.280
Neg logit contributions
 ne-0.251
 One-0.243
 coll-0.236
 cheaper-0.235
 swe-0.235
Token named
Token ID3706
Feature activation+4.26e-01

Pos logit contributions
 ne+0.053
 One+0.051
 cheaper+0.050
 coll+0.050
 swe+0.050
Neg logit contributions
leaf-0.048
ting-0.048
orry-0.044
unk-0.044
 ear-0.043
Token Twe
Token ID24205
Feature activation+9.52e-01

Pos logit contributions
 ne+0.418
 cheaper+0.390
 One+0.384
 coll+0.383
 swe+0.383
Neg logit contributions
leaf-0.856
ting-0.854
orry-0.806
unk-0.800
 ear-0.799
Tokenety
Token ID2963
Feature activation-1.26e-01

Pos logit contributions
 ne+0.892
 One+0.823
 cheaper+0.807
 swe+0.806
 saf+0.758
Neg logit contributions
ting-1.967
leaf-1.956
orry-1.880
unk-1.879
 upper-1.836
Token.
Token ID13
Feature activation+3.60e+00

Pos logit contributions
leaf+0.209
ting+0.205
orry+0.192
unk+0.190
 ear+0.187
Neg logit contributions
 ne-0.168
 One-0.163
 cheaper-0.158
 coll-0.157
 swe-0.157
Token Twe
Token ID24205
Feature activation+1.33e+00

Pos logit contributions
 coll+4.076
 cheaper+3.995
 problems+3.863
 former+3.795
 swe+3.743
Neg logit contributions
ting-7.803
 ear-7.110
leaf-6.397
 Stop-6.364
ges-6.294
Tokenety
Token ID2963
Feature activation+8.89e-01

Pos logit contributions
 ne+1.354
 cheaper+1.243
 swe+1.241
 One+1.238
 saf+1.176
Neg logit contributions
ting-2.664
leaf-2.629
unk-2.568
orry-2.568
other-2.489
Token loved
Token ID6151
Feature activation+9.15e-02

Pos logit contributions
 ne+1.646
 cheaper+1.646
 swe+1.623
 coll+1.620
 One+1.602
Neg logit contributions
leaf-0.945
ting-0.943
orry-0.879
unk-0.859
 ear-0.852
Token to
Token ID284
Feature activation+7.99e-01

Pos logit contributions
 ne+0.119
 One+0.115
 cheaper+0.113
 swe+0.111
 coll+0.111
Neg logit contributions
leaf-0.154
ting-0.152
orry-0.142
unk-0.141
 ear-0.140
Token sing
Token ID1702
Feature activation+2.08e-02

Pos logit contributions
 cheaper+1.090
 ne+1.089
 One+1.042
 swe+1.042
 coll+1.026
Neg logit contributions
leaf-1.277
ting-1.241
 ear-1.191
orry-1.159
 upper-1.155
Token little
Token ID1310
Feature activation+8.00e-02

Pos logit contributions
leaf+0.236
ting+0.233
orry+0.214
unk+0.211
 ear+0.207
Neg logit contributions
 ne-0.275
 One-0.266
 cheaper-0.261
 coll-0.260
 swe-0.259
Token bird
Token ID6512
Feature activation-1.72e-02

Pos logit contributions
 ne+0.111
 One+0.108
 cheaper+0.105
 coll+0.105
 swe+0.105
Neg logit contributions
leaf-0.127
ting-0.126
orry-0.118
unk-0.116
 ear-0.115
Token named
Token ID3706
Feature activation+5.32e-01

Pos logit contributions
leaf+0.025
ting+0.024
orry+0.023
unk+0.022
 ear+0.022
Neg logit contributions
 ne-0.027
 One-0.026
 cheaper-0.025
 coll-0.025
 swe-0.025
Token Twe
Token ID24205
Feature activation+9.62e-01

Pos logit contributions
 ne+0.557
 cheaper+0.519
 coll+0.514
 swe+0.510
 One+0.508
Neg logit contributions
leaf-1.031
ting-1.031
orry-0.977
unk-0.969
 ear-0.967
Tokenety
Token ID2963
Feature activation-1.65e-01

Pos logit contributions
 ne+0.906
 One+0.837
 swe+0.821
 cheaper+0.819
 coll+0.768
Neg logit contributions
ting-1.977
leaf-1.976
orry-1.905
unk-1.895
 upper-1.855
Token.
Token ID13
Feature activation+3.58e+00

Pos logit contributions
leaf+0.277
ting+0.271
orry+0.254
unk+0.252
 ear+0.248
Neg logit contributions
 ne-0.219
 One-0.213
 cheaper-0.205
 coll-0.205
One-0.204
Token Twe
Token ID24205
Feature activation+1.36e+00

Pos logit contributions
 cheaper+4.167
 coll+4.114
 swe+3.896
 problems+3.829
 scar+3.676
Neg logit contributions
ting-7.757
 ear-7.040
ges-6.354
 Stop-6.328
rified-6.260
Tokenety
Token ID2963
Feature activation+7.24e-01

Pos logit contributions
 ne+1.393
 One+1.276
 cheaper+1.275
 swe+1.275
 saf+1.199
Neg logit contributions
ting-2.707
leaf-2.673
orry-2.613
unk-2.605
other-2.540
Token loved
Token ID6151
Feature activation+3.31e-01

Pos logit contributions
 ne+1.311
 cheaper+1.300
 coll+1.285
 swe+1.283
 One+1.278
Neg logit contributions
ting-0.807
leaf-0.806
orry-0.750
unk-0.735
 ear-0.723
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
 ne+0.443
 One+0.429
 cheaper+0.421
 swe+0.413
 coll+0.412
Neg logit contributions
leaf-0.547
ting-0.540
orry-0.498
unk-0.496
 ear-0.495
Token whistle
Token ID16121
Feature activation+6.61e-02

Pos logit contributions
 cheaper+1.848
 ne+1.794
 swe+1.735
 One+1.700
 coll+1.688
Neg logit contributions
leaf-1.990
ting-1.921
 ear-1.903
 upper-1.827
orry-1.797
Token little
Token ID1310
Feature activation-4.02e-03

Pos logit contributions
leaf+0.127
ting+0.126
orry+0.116
unk+0.114
 ear+0.112
Neg logit contributions
 ne-0.151
 One-0.146
 cheaper-0.143
 coll-0.143
 swe-0.143
Token bird
Token ID6512
Feature activation-1.27e-01

Pos logit contributions
leaf+0.006
ting+0.006
orry+0.006
unk+0.006
 ear+0.006
Neg logit contributions
 ne-0.006
 One-0.005
 cheaper-0.005
 coll-0.005
 swe-0.005
Token named
Token ID3706
Feature activation+3.27e-01

Pos logit contributions
leaf+0.185
ting+0.181
orry+0.168
unk+0.166
 ear+0.163
Neg logit contributions
 ne-0.196
 One-0.191
 cheaper-0.185
 coll-0.185
 swe-0.185
Token Twe
Token ID24205
Feature activation+8.83e-01

Pos logit contributions
 ne+0.395
 cheaper+0.371
 One+0.371
 coll+0.369
 swe+0.367
Neg logit contributions
leaf-0.585
ting-0.580
orry-0.547
unk-0.542
 ear-0.539
Tokenety
Token ID2963
Feature activation-6.00e-02

Pos logit contributions
 ne+0.817
 One+0.754
 swe+0.738
 cheaper+0.733
 coll+0.695
Neg logit contributions
leaf-1.836
ting-1.833
orry-1.758
unk-1.753
 upper-1.711
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
leaf+0.099
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.081
 One-0.078
 cheaper-0.076
 coll-0.076
 swe-0.075
Token Twe
Token ID24205
Feature activation+1.40e+00

Pos logit contributions
 coll+4.056
 cheaper+3.928
 swe+3.810
 problems+3.739
 former+3.704
Neg logit contributions
ting-7.512
 ear-6.999
leaf-6.393
 Stop-6.276
rified-6.202
Tokenety
Token ID2963
Feature activation+9.53e-01

Pos logit contributions
 ne+1.446
 swe+1.326
 cheaper+1.322
 One+1.316
 coll+1.237
Neg logit contributions
ting-2.793
leaf-2.759
orry-2.699
unk-2.696
other-2.623
Token wanted
Token ID2227
Feature activation+9.20e-01

Pos logit contributions
 cheaper+1.738
 ne+1.732
 coll+1.712
 swe+1.709
 One+1.684
Neg logit contributions
ting-1.038
leaf-1.036
orry-0.974
unk-0.951
 ear-0.945
Token to
Token ID284
Feature activation+3.85e-01

Pos logit contributions
 ne+1.352
 One+1.278
 coll+1.273
 cheaper+1.264
 swe+1.256
Neg logit contributions
leaf-1.429
ting-1.382
 ear-1.278
 upper-1.262
orry-1.255
Token fly
Token ID6129
Feature activation+1.14e+00

Pos logit contributions
 ne+0.492
 One+0.472
 cheaper+0.471
 swe+0.464
 coll+0.464
Neg logit contributions
leaf-0.652
ting-0.638
orry-0.602
 ear-0.601
unk-0.598
Token little
Token ID1310
Feature activation+1.52e-02

Pos logit contributions
leaf+0.251
ting+0.248
orry+0.227
unk+0.224
 ear+0.220
Neg logit contributions
 ne-0.292
 One-0.282
 cheaper-0.277
 coll-0.276
 swe-0.275
Token bird
Token ID6512
Feature activation+8.78e-02

Pos logit contributions
 ne+0.021
 One+0.021
 cheaper+0.020
 coll+0.020
 swe+0.020
Neg logit contributions
leaf-0.024
ting-0.024
orry-0.022
unk-0.022
 ear-0.022
Token named
Token ID3706
Feature activation+2.61e-01

Pos logit contributions
 ne+0.138
 One+0.134
 cheaper+0.132
 swe+0.131
 coll+0.131
Neg logit contributions
leaf-0.123
ting-0.122
orry-0.113
unk-0.111
 ear-0.110
Token Twe
Token ID24205
Feature activation+1.08e+00

Pos logit contributions
 ne+0.273
 One+0.256
 cheaper+0.254
 coll+0.252
 swe+0.251
Neg logit contributions
leaf-0.510
ting-0.507
orry-0.479
unk-0.476
 ear-0.473
Tokenety
Token ID2963
Feature activation+1.52e-01

Pos logit contributions
 ne+1.033
 One+0.944
 swe+0.930
 cheaper+0.927
 saf+0.872
Neg logit contributions
ting-2.204
leaf-2.203
orry-2.119
unk-2.117
 upper-2.066
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 ne+0.210
 One+0.203
 cheaper+0.200
 swe+0.198
 coll+0.198
Neg logit contributions
leaf-0.242
ting-0.241
orry-0.225
unk-0.223
 ear-0.220
Token Twe
Token ID24205
Feature activation+1.58e+00

Pos logit contributions
 coll+3.997
 cheaper+3.876
 swe+3.714
 former+3.657
 problems+3.597
Neg logit contributions
ting-7.686
 ear-7.046
leaf-6.319
 Stop-6.310
rified-6.254
Tokenety
Token ID2963
Feature activation+1.10e+00

Pos logit contributions
 ne+1.664
 cheaper+1.524
 swe+1.516
 One+1.504
 saf+1.448
Neg logit contributions
ting-3.100
leaf-3.044
orry-2.988
unk-2.988
other-2.911
Token had
Token ID550
Feature activation+5.68e-01

Pos logit contributions
 cheaper+2.063
 ne+2.041
 coll+2.031
 swe+2.026
 One+1.983
Neg logit contributions
ting-1.141
leaf-1.139
orry-1.075
unk-1.047
 ear-1.044
Token an
Token ID281
Feature activation+7.85e-01

Pos logit contributions
 ne+0.741
 cheaper+0.709
 One+0.705
 swe+0.698
 coll+0.694
Neg logit contributions
leaf-0.959
ting-0.937
 ear-0.890
orry-0.883
unk-0.883
Token unusual
Token ID8468
Feature activation+1.45e+00

Pos logit contributions
 ne+1.140
 cheaper+1.066
 One+1.064
 swe+1.063
 coll+1.052
Neg logit contributions
leaf-1.209
ting-1.167
 ear-1.146
orry-1.131
unk-1.120
Token bird
Token ID6512
Feature activation-4.91e-02

Pos logit contributions
leaf+0.127
ting+0.125
orry+0.117
unk+0.116
 ear+0.114
Neg logit contributions
 ne-0.109
 One-0.105
 cheaper-0.103
 coll-0.103
 swe-0.102
Tokenie
Token ID494
Feature activation-3.99e-01

Pos logit contributions
leaf+0.072
ting+0.070
orry+0.065
unk+0.064
 ear+0.063
Neg logit contributions
 ne-0.075
 One-0.073
 cheaper-0.071
 coll-0.071
 swe-0.071
Token named
Token ID3706
Feature activation+1.72e-01

Pos logit contributions
leaf+0.645
ting+0.626
orry+0.586
unk+0.577
 upper+0.571
Neg logit contributions
 ne-0.560
 One-0.545
 coll-0.525
One-0.522
 cheaper-0.519
Token Twe
Token ID24205
Feature activation+8.47e-01

Pos logit contributions
 ne+0.193
 One+0.183
 cheaper+0.181
 coll+0.179
 swe+0.178
Neg logit contributions
leaf-0.323
ting-0.321
orry-0.303
unk-0.300
 ear-0.298
Tokenety
Token ID2963
Feature activation+8.12e-02

Pos logit contributions
 ne+0.780
 One+0.721
 cheaper+0.702
 swe+0.689
 saf+0.667
Neg logit contributions
ting-1.758
leaf-1.750
orry-1.682
unk-1.677
 upper-1.638
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 ne+0.111
 One+0.107
 cheaper+0.105
 swe+0.104
 coll+0.104
Neg logit contributions
leaf-0.131
ting-0.130
orry-0.121
unk-0.120
 ear-0.119
Token Twe
Token ID24205
Feature activation+1.15e+00

Pos logit contributions
 coll+4.057
 cheaper+3.756
 scar+3.669
 problems+3.629
 swe+3.628
Neg logit contributions
ting-7.549
 ear-6.943
leaf-6.346
 Stop-6.317
ada-6.161
Tokenety
Token ID2963
Feature activation+1.16e+00

Pos logit contributions
 ne+1.132
 One+1.030
 cheaper+1.028
 swe+1.004
 saf+0.980
Neg logit contributions
ting-2.318
leaf-2.293
orry-2.241
unk-2.233
other-2.163
Token loved
Token ID6151
Feature activation+4.00e-01

Pos logit contributions
 cheaper+2.189
 ne+2.162
 coll+2.160
 swe+2.150
 One+2.102
Neg logit contributions
ting-1.195
leaf-1.185
orry-1.120
unk-1.089
 ear-1.086
Token to
Token ID284
Feature activation+9.78e-01

Pos logit contributions
 ne+0.541
 One+0.524
 cheaper+0.515
 swe+0.504
 coll+0.502
Neg logit contributions
leaf-0.657
ting-0.648
orry-0.596
 ear-0.593
unk-0.593
Token fly
Token ID6129
Feature activation+1.04e+00

Pos logit contributions
 cheaper+1.349
 ne+1.333
 swe+1.284
 One+1.274
 coll+1.261
Neg logit contributions
leaf-1.556
ting-1.507
 ear-1.463
 upper-1.414
orry-1.410
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
 ne+0.070
 One+0.068
 cheaper+0.066
 coll+0.066
 swe+0.066
Neg logit contributions
leaf-0.087
ting-0.086
orry-0.081
unk-0.080
 ear-0.079
Token were
Token ID547
Feature activation+6.53e-01

Pos logit contributions
 One+1.598
 ne+1.561
One+1.549
 cheaper+1.528
 coll+1.507
Neg logit contributions
leaf-1.955
ting-1.945
 ear-1.745
orry-1.741
 upper-1.685
Token very
Token ID845
Feature activation+3.03e-01

Pos logit contributions
 ne+1.040
 One+1.032
 cheaper+0.998
One+0.998
 coll+0.995
Neg logit contributions
leaf-0.890
ting-0.880
unk-0.815
orry-0.814
 ear-0.811
Token happy
Token ID3772
Feature activation+1.80e-02

Pos logit contributions
 ne+0.446
 One+0.439
 coll+0.426
 cheaper+0.424
 swe+0.424
Neg logit contributions
leaf-0.457
ting-0.444
orry-0.418
unk-0.413
 ear-0.410
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
 ne+0.025
 One+0.025
 cheaper+0.024
 coll+0.024
 swe+0.024
Neg logit contributions
leaf-0.028
ting-0.028
orry-0.026
unk-0.026
 ear-0.026
Token When
Token ID1649
Feature activation+3.53e+00

Pos logit contributions
 ne+2.420
 swe+2.303
 cheaper+2.264
 coll+2.243
One+2.163
Neg logit contributions
ting-2.903
leaf-2.728
 ear-2.567
orry-2.562
unk-2.486
Token they
Token ID484
Feature activation+1.32e-01

Pos logit contributions
 coll+7.232
 former+7.095
 swe+6.999
 ne+6.867
onna+6.852
Neg logit contributions
ting-4.193
 ear-3.348
leaf-2.997
 expecting-2.851
 adults-2.754
Token returned
Token ID4504
Feature activation+9.21e-01

Pos logit contributions
 ne+0.175
 One+0.170
 cheaper+0.167
 coll+0.166
 swe+0.166
Neg logit contributions
leaf-0.218
ting-0.216
orry-0.202
unk-0.199
 ear-0.197
Token home
Token ID1363
Feature activation+9.43e-01

Pos logit contributions
 ne+1.338
 One+1.312
One+1.298
 coll+1.290
 cheaper+1.287
Neg logit contributions
ting-1.426
leaf-1.373
 ear-1.298
unk-1.265
orry-1.256
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
 One+0.990
 ne+0.988
One+0.977
 coll+0.958
 swe+0.939
Neg logit contributions
ting-1.789
leaf-1.759
unk-1.666
orry-1.664
 ear-1.649
Token Twe
Token ID24205
Feature activation+1.20e+00

Pos logit contributions
 ne+1.839
 coll+1.834
One+1.787
 swe+1.764
 One+1.740
Neg logit contributions
ting-3.324
leaf-3.046
 ear-2.957
orry-2.843
 expecting-2.813
Token small
Token ID1402
Feature activation+3.98e-01

Pos logit contributions
leaf+0.269
ting+0.266
orry+0.244
unk+0.241
 ear+0.236
Neg logit contributions
 ne-0.318
 One-0.308
 cheaper-0.302
 coll-0.301
 swe-0.301
Token bird
Token ID6512
Feature activation+1.26e-02

Pos logit contributions
 ne+0.611
 One+0.596
 cheaper+0.586
 swe+0.580
One+0.579
Neg logit contributions
leaf-0.567
ting-0.563
orry-0.522
unk-0.520
 ear-0.520
Token named
Token ID3706
Feature activation+4.79e-01

Pos logit contributions
 ne+0.020
 One+0.019
 cheaper+0.019
 coll+0.019
 swe+0.019
Neg logit contributions
leaf-0.018
ting-0.017
orry-0.016
unk-0.016
 ear-0.016
Token Twe
Token ID24205
Feature activation+5.93e-01

Pos logit contributions
 ne+0.502
 cheaper+0.469
 coll+0.463
 One+0.462
 swe+0.462
Neg logit contributions
leaf-0.928
ting-0.924
orry-0.870
unk-0.865
 ear-0.863
Tokenety
Token ID2963
Feature activation-2.01e-02

Pos logit contributions
 ne+0.516
 One+0.476
 cheaper+0.461
 swe+0.455
 coll+0.437
Neg logit contributions
leaf-1.259
ting-1.256
orry-1.200
unk-1.195
 upper-1.172
Token.
Token ID13
Feature activation+3.51e+00

Pos logit contributions
leaf+0.033
ting+0.033
orry+0.030
unk+0.030
 ear+0.030
Neg logit contributions
 ne-0.027
 One-0.026
 cheaper-0.026
 coll-0.025
 swe-0.025
Token Twe
Token ID24205
Feature activation+8.79e-01

Pos logit contributions
 coll+3.919
 cheaper+3.692
 problems+3.609
 scar+3.578
 former+3.559
Neg logit contributions
ting-7.522
 ear-6.948
leaf-6.281
 Stop-6.270
ges-6.087
Tokenety
Token ID2963
Feature activation+8.99e-01

Pos logit contributions
 ne+0.823
 One+0.749
 cheaper+0.742
 swe+0.727
 saf+0.709
Neg logit contributions
ting-1.815
leaf-1.812
orry-1.747
unk-1.740
other-1.687
Token lived
Token ID5615
Feature activation-3.36e-01

Pos logit contributions
 cheaper+1.641
 ne+1.640
 coll+1.617
 swe+1.617
 One+1.595
Neg logit contributions
leaf-0.979
ting-0.977
orry-0.913
unk-0.894
 ear-0.887
Token in
Token ID287
Feature activation+1.36e+00

Pos logit contributions
leaf+0.391
ting+0.379
orry+0.350
unk+0.337
 upper+0.333
Neg logit contributions
 ne-0.622
 One-0.601
One-0.588
 cheaper-0.587
 swe-0.585
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 ne+1.992
 cheaper+1.975
 swe+1.959
 coll+1.929
 One+1.858
Neg logit contributions
leaf-2.024
ting-2.012
 ear-1.932
orry-1.812
unk-1.812
Token little
Token ID1310
Feature activation+1.68e-01

Pos logit contributions
leaf+0.137
ting+0.135
orry+0.124
unk+0.123
 ear+0.120
Neg logit contributions
 ne-0.163
 One-0.158
 cheaper-0.155
 coll-0.154
 swe-0.154
Token bird
Token ID6512
Feature activation-4.40e-02

Pos logit contributions
 ne+0.237
 One+0.230
 cheaper+0.225
 swe+0.223
 coll+0.223
Neg logit contributions
leaf-0.263
ting-0.260
orry-0.242
unk-0.240
 ear-0.238
Token named
Token ID3706
Feature activation+4.43e-01

Pos logit contributions
leaf+0.063
ting+0.062
orry+0.058
unk+0.057
 ear+0.056
Neg logit contributions
 ne-0.068
 One-0.066
 cheaper-0.065
 coll-0.064
 swe-0.064
Token Twe
Token ID24205
Feature activation+1.06e+00

Pos logit contributions
 ne+0.476
 cheaper+0.445
 One+0.442
 coll+0.442
 swe+0.439
Neg logit contributions
leaf-0.849
ting-0.845
orry-0.799
unk-0.794
 ear-0.788
Tokenety
Token ID2963
Feature activation-1.07e-02

Pos logit contributions
 ne+1.018
 One+0.937
 swe+0.925
 cheaper+0.915
 saf+0.860
Neg logit contributions
ting-2.168
leaf-2.168
orry-2.087
unk-2.085
 upper-2.029
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
leaf+0.017
ting+0.017
orry+0.016
unk+0.016
 ear+0.016
Neg logit contributions
 ne-0.015
 One-0.014
 cheaper-0.014
 coll-0.014
 swe-0.014
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+3.952
 cheaper+3.763
 swe+3.644
 scar+3.594
 problems+3.501
Neg logit contributions
ting-7.479
 ear-6.865
leaf-6.328
 Stop-6.278
rified-6.176
Tokenety
Token ID2963
Feature activation+9.29e-01

Pos logit contributions
 ne+1.523
 swe+1.407
 cheaper+1.396
 One+1.385
 saf+1.316
Neg logit contributions
ting-2.892
leaf-2.855
unk-2.796
orry-2.795
other-2.722
Token lived
Token ID5615
Feature activation-3.35e-01

Pos logit contributions
 cheaper+1.688
 ne+1.681
 coll+1.666
 swe+1.661
 One+1.639
Neg logit contributions
ting-1.022
leaf-1.015
orry-0.956
unk-0.933
 ear-0.926
Token in
Token ID287
Feature activation+1.35e+00

Pos logit contributions
leaf+0.385
ting+0.373
orry+0.343
unk+0.331
 upper+0.326
Neg logit contributions
 ne-0.626
 One-0.606
One-0.593
 cheaper-0.591
 swe-0.589
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
 ne+1.980
 cheaper+1.967
 swe+1.946
 coll+1.916
 One+1.849
Neg logit contributions
leaf-2.016
ting-2.006
 ear-1.921
orry-1.816
unk-1.810
Token in
Token ID287
Feature activation+7.03e-01

Pos logit contributions
 ne+1.189
 One+1.176
 cheaper+1.157
 coll+1.157
One+1.147
Neg logit contributions
ting-0.957
leaf-0.944
orry-0.875
unk-0.863
 ear-0.863
Token the
Token ID262
Feature activation-2.74e-01

Pos logit contributions
 ne+1.048
 One+1.011
 coll+1.001
 cheaper+0.997
 swe+0.986
Neg logit contributions
ting-1.029
leaf-1.014
 ear-0.959
orry-0.934
unk-0.923
Token dirt
Token ID13647
Feature activation-5.00e-02

Pos logit contributions
leaf+0.394
ting+0.385
orry+0.357
unk+0.354
 ear+0.343
Neg logit contributions
 ne-0.431
 One-0.422
 swe-0.412
 cheaper-0.409
 coll-0.407
Token.
Token ID13
Feature activation+2.43e+00

Pos logit contributions
leaf+0.082
ting+0.081
orry+0.076
unk+0.075
 ear+0.074
Neg logit contributions
 ne-0.067
 One-0.065
 cheaper-0.063
 swe-0.063
 coll-0.063
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 ne+3.631
 swe+3.490
 cheaper+3.408
 coll+3.244
 spe+3.109
Neg logit contributions
ting-3.941
 ear-3.701
leaf-3.626
other-3.606
orry-3.518
Token
Token ID198
Feature activation+3.50e+00

Pos logit contributions
 ne+1.319
 swe+1.278
 cheaper+1.244
 coll+1.237
 shin+1.146
Neg logit contributions
leaf-1.728
ting-1.681
orry-1.633
unk-1.618
other-1.595
TokenOne
Token ID3198
Feature activation-7.59e-01

Pos logit contributions
 swe+2.700
 ne+2.504
 cheaper+2.343
 spe+2.157
 coll+2.124
Neg logit contributions
ting-8.447
other-8.060
leaf-7.924
orry-7.830
 ear-7.562
Token day
Token ID1110
Feature activation+1.77e+00

Pos logit contributions
leaf+1.407
ting+1.404
unk+1.285
orry+1.278
 upper+1.263
Neg logit contributions
 One-0.908
 ne-0.902
One-0.862
 swe-0.831
 saf-0.830
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
 swe+2.088
 cheaper+2.075
 coll+2.073
 former+2.052
 ne+1.970
Neg logit contributions
ting-3.301
leaf-3.042
 ear-2.966
orry-2.890
unk-2.839
Token Gold
Token ID3561
Feature activation+1.41e+00

Pos logit contributions
 swe+2.324
 cheaper+2.195
 coll+2.174
 ne+2.158
 former+2.122
Neg logit contributions
ting-3.671
leaf-3.401
 ear-3.308
orry-3.136
rified-3.088
Token heard
Token ID2982
Feature activation+1.21e+00

Pos logit contributions
 cheaper+2.219
 ne+2.216
 One+2.173
 swe+2.143
 coll+2.120
Neg logit contributions
ting-1.999
unk-1.872
leaf-1.869
orry-1.865
other-1.748
Token little
Token ID1310
Feature activation-3.98e-02

Pos logit contributions
leaf+0.281
ting+0.277
orry+0.254
unk+0.251
 ear+0.246
Neg logit contributions
 ne-0.335
 One-0.324
 cheaper-0.318
 coll-0.317
 swe-0.317
Token bird
Token ID6512
Feature activation-7.44e-02

Pos logit contributions
leaf+0.064
ting+0.063
orry+0.059
unk+0.058
 ear+0.057
Neg logit contributions
 ne-0.055
 One-0.054
 cheaper-0.052
 coll-0.052
 swe-0.052
Token named
Token ID3706
Feature activation+3.51e-01

Pos logit contributions
leaf+0.107
ting+0.105
orry+0.098
unk+0.096
 ear+0.095
Neg logit contributions
 ne-0.116
 One-0.113
 cheaper-0.110
 coll-0.109
 swe-0.109
Token Twe
Token ID24205
Feature activation+1.10e+00

Pos logit contributions
 ne+0.384
 cheaper+0.359
 One+0.357
 coll+0.357
 swe+0.355
Neg logit contributions
leaf-0.667
ting-0.665
orry-0.627
unk-0.622
 ear-0.620
Tokenety
Token ID2963
Feature activation-1.70e-01

Pos logit contributions
 ne+1.062
 One+0.977
 swe+0.963
 cheaper+0.957
 saf+0.899
Neg logit contributions
leaf-2.233
ting-2.233
orry-2.150
unk-2.148
 upper-2.097
Token.
Token ID13
Feature activation+3.50e+00

Pos logit contributions
leaf+0.284
ting+0.278
orry+0.261
unk+0.258
 ear+0.254
Neg logit contributions
 ne-0.226
 One-0.220
 cheaper-0.212
 coll-0.211
One-0.211
Token Twe
Token ID24205
Feature activation+1.46e+00

Pos logit contributions
 coll+3.920
 cheaper+3.764
 swe+3.570
 problems+3.503
 former+3.464
Neg logit contributions
ting-7.654
 ear-6.997
 Stop-6.318
leaf-6.230
rified-6.214
Tokenety
Token ID2963
Feature activation+8.57e-01

Pos logit contributions
 ne+1.522
 cheaper+1.396
 swe+1.394
 One+1.389
 saf+1.314
Neg logit contributions
ting-2.873
leaf-2.846
unk-2.786
orry-2.782
other-2.713
Token was
Token ID373
Feature activation+9.66e-01

Pos logit contributions
 ne+1.581
 cheaper+1.579
 coll+1.560
 swe+1.557
 One+1.538
Neg logit contributions
ting-0.918
leaf-0.917
orry-0.855
unk-0.836
 ear-0.826
Token very
Token ID845
Feature activation-2.48e-01

Pos logit contributions
 ne+1.461
 cheaper+1.429
 swe+1.421
 coll+1.409
 One+1.405
Neg logit contributions
ting-1.386
leaf-1.386
 ear-1.318
unk-1.314
orry-1.280
Token smart
Token ID4451
Feature activation-1.30e+00

Pos logit contributions
leaf+0.403
ting+0.402
orry+0.372
unk+0.366
 ear+0.361
Neg logit contributions
 ne-0.342
 One-0.331
 cheaper-0.321
 swe-0.318
 coll-0.317
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
 ne+0.418
 One+0.404
 swe+0.396
 coll+0.395
 cheaper+0.395
Neg logit contributions
leaf-0.396
ting-0.392
orry-0.360
unk-0.360
 ear-0.354
Token
Token ID220
Feature activation+2.78e-01

Pos logit contributions
 ne+3.100
 coll+2.938
 cheaper+2.934
 swe+2.888
 scar+2.756
Neg logit contributions
ting-3.709
leaf-3.475
 ear-3.256
 Stop-3.149
other-3.118
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
 ne+0.311
 One+0.291
 cheaper+0.290
 swe+0.288
 coll+0.287
Neg logit contributions
leaf-0.518
ting-0.513
orry-0.487
unk-0.484
 ear-0.474
Token
Token ID198
Feature activation+1.83e+00

Pos logit contributions
 ne+1.942
 swe+1.873
 cheaper+1.859
 coll+1.853
 enough+1.780
Neg logit contributions
leaf-2.430
ting-2.381
other-2.180
unk-2.140
orry-2.138
TokenBut
Token ID1537
Feature activation+2.28e+00

Pos logit contributions
 ne+2.741
 swe+2.651
 coll+2.650
 cheaper+2.618
 enough+2.586
Neg logit contributions
leaf-2.820
ting-2.807
other-2.461
orry-2.405
 Stop-2.380
Token when
Token ID618
Feature activation+3.46e+00

Pos logit contributions
 coll+3.468
 ne+3.318
 swe+3.274
 cheaper+3.236
One+3.213
Neg logit contributions
ting-3.779
 ear-3.338
leaf-3.107
 expecting-2.991
rified-2.925
Token Twe
Token ID24205
Feature activation+1.89e+00

Pos logit contributions
 coll+5.893
 cheaper+5.184
 swe+5.129
 former+5.060
onna+5.037
Neg logit contributions
ting-5.845
 ear-4.992
 oh-4.669
 Stop-4.644
rified-4.573
Tokenety
Token ID2963
Feature activation+1.62e+00

Pos logit contributions
 ne+2.175
 cheaper+1.991
 One+1.964
 saf+1.954
 swe+1.939
Neg logit contributions
ting-3.619
unk-3.544
orry-3.529
leaf-3.490
other-3.413
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
 cheaper+2.484
 coll+2.470
 swe+2.443
 ne+2.382
 One+2.360
Neg logit contributions
ting-2.450
leaf-2.245
orry-2.242
unk-2.198
 ear-2.117
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 cheaper+2.818
 ne+2.816
One+2.812
 coll+2.767
 One+2.760
Neg logit contributions
 ear-2.667
ting-2.666
leaf-2.481
 finger-2.313
 hand-2.305
Token owl
Token ID39610
Feature activation+1.93e+00

Pos logit contributions
 ne+1.747
 One+1.688
 cheaper+1.668
 coll+1.654
 swe+1.636
Neg logit contributions
leaf-1.537
ting-1.535
 ear-1.512
 upper-1.437
unk-1.426
Token little
Token ID1310
Feature activation+8.93e-03

Pos logit contributions
leaf+0.147
ting+0.145
orry+0.133
unk+0.132
 ear+0.129
Neg logit contributions
 ne-0.173
 One-0.168
 cheaper-0.165
 coll-0.164
 swe-0.164
Token bird
Token ID6512
Feature activation-1.89e-01

Pos logit contributions
 ne+0.012
 One+0.012
 cheaper+0.012
 coll+0.012
 swe+0.012
Neg logit contributions
leaf-0.014
ting-0.014
orry-0.013
unk-0.013
 ear-0.013
Token named
Token ID3706
Feature activation+4.24e-01

Pos logit contributions
leaf+0.279
ting+0.273
orry+0.254
unk+0.250
 upper+0.246
Neg logit contributions
 ne-0.288
 One-0.280
 coll-0.271
 cheaper-0.271
One-0.270
Token Twe
Token ID24205
Feature activation+7.84e-01

Pos logit contributions
 ne+0.486
 cheaper+0.456
 One+0.454
 coll+0.453
 swe+0.450
Neg logit contributions
leaf-0.780
ting-0.776
orry-0.731
unk-0.726
 ear-0.721
Tokenety
Token ID2963
Feature activation-1.64e-02

Pos logit contributions
 ne+0.722
 One+0.664
 swe+0.650
 cheaper+0.649
 coll+0.618
Neg logit contributions
leaf-1.632
ting-1.624
orry-1.565
unk-1.558
 upper-1.522
Token.
Token ID13
Feature activation+3.46e+00

Pos logit contributions
leaf+0.027
ting+0.026
orry+0.025
unk+0.024
 ear+0.024
Neg logit contributions
 ne-0.022
 One-0.022
 cheaper-0.021
 coll-0.021
 swe-0.021
Token He
Token ID679
Feature activation+9.26e-01

Pos logit contributions
 coll+3.886
 cheaper+3.884
 swe+3.630
 problems+3.572
 scar+3.550
Neg logit contributions
ting-7.256
 ear-6.840
leaf-6.058
 Stop-6.054
rified-5.995
Token had
Token ID550
Feature activation+7.57e-01

Pos logit contributions
 cheaper+1.656
 ne+1.627
 swe+1.621
 One+1.612
 coll+1.608
Neg logit contributions
leaf-1.075
ting-1.065
orry-0.989
 ear-0.987
unk-0.984
Token an
Token ID281
Feature activation+6.70e-01

Pos logit contributions
 ne+1.015
 cheaper+0.980
 swe+0.964
 One+0.963
 coll+0.954
Neg logit contributions
leaf-1.249
ting-1.218
 ear-1.169
unk-1.153
orry-1.148
Token orange
Token ID10912
Feature activation+6.64e-01

Pos logit contributions
 ne+0.975
 cheaper+0.914
 One+0.914
 swe+0.912
 coll+0.903
Neg logit contributions
leaf-1.029
ting-0.994
 ear-0.967
orry-0.960
unk-0.950
Token be
Token ID307
Feature activation+6.23e-02

Pos logit contributions
 ne+0.889
 cheaper+0.852
 One+0.842
 swe+0.842
 coll+0.819
Neg logit contributions
leaf-1.067
ting-1.064
 ear-1.024
unk-1.012
orry-1.000
Token so
Token ID523
Feature activation-6.87e-01

Pos logit contributions
leaf+2.151
ting+2.085
unk+2.041
orry+1.972
other+1.935
Neg logit contributions
 ne-2.118
 cheaper-2.077
 One-2.022
 swe-1.973
One-1.933
Token big
Token ID1263
Feature activation-7.17e-01

Pos logit contributions
leaf+1.238
ting+1.230
unk+1.157
orry+1.133
 upper+1.112
Neg logit contributions
 ne-0.841
 One-0.817
 cheaper-0.792
 swe-0.776
One-0.766
Token!
Token ID0
Feature activation-4.68e-01

Pos logit contributions
leaf+1.177
ting+1.107
orry+1.085
unk+1.085
 upper+1.053
Neg logit contributions
 ne-0.984
 One-0.950
 cheaper-0.932
 swe-0.922
 coll-0.902
Token She
Token ID1375
Feature activation-8.97e-02

Pos logit contributions
leaf+0.848
ting+0.843
unk+0.790
orry+0.787
 upper+0.772
Neg logit contributions
 One-0.561
 ne-0.548
One-0.513
 cheaper-0.511
 swe-0.507
Token touched
Token ID12615
Feature activation-1.33e+00

Pos logit contributions
leaf+0.142
ting+0.140
orry+0.131
unk+0.130
 ear+0.127
Neg logit contributions
 ne-0.127
 One-0.123
 cheaper-0.120
 swe-0.119
 coll-0.119
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
leaf+2.428
ting+2.250
unk+2.235
orry+2.220
 upper+2.178
Neg logit contributions
 One-1.636
 ne-1.600
 swe-1.512
One-1.501
 cheaper-1.494
Token screen
Token ID3159
Feature activation-1.92e-01

Pos logit contributions
idget+6.709
orry+6.563
udge+6.530
asing+6.469
ting+6.432
Neg logit contributions
 One-4.899
 saf-4.884
 ne-4.795
 coll-4.735
 cheaper-4.616
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
leaf+0.322
ting+0.316
orry+0.298
unk+0.295
 ear+0.289
Neg logit contributions
 ne-0.253
 One-0.244
 cheaper-0.238
 swe-0.236
 coll-0.234
Token it
Token ID340
Feature activation-2.59e-01

Pos logit contributions
leaf+2.025
ting+1.939
unk+1.910
orry+1.868
 upper+1.833
Neg logit contributions
 ne-1.508
 One-1.408
 cheaper-1.350
 swe-1.346
 coll-1.319
Token lit
Token ID6578
Feature activation+9.05e-01

Pos logit contributions
leaf+0.469
ting+0.459
orry+0.432
unk+0.430
 ear+0.424
Neg logit contributions
 ne-0.316
 One-0.300
 cheaper-0.292
 swe-0.291
 coll-0.287
Token up
Token ID510
Feature activation+2.63e-01

Pos logit contributions
 ne+1.674
 One+1.647
 coll+1.638
 cheaper+1.623
One+1.614
Neg logit contributions
ting-1.010
leaf-0.982
 ear-0.881
orry-0.870
unk-0.851
Token over
Token ID625
Feature activation-4.44e-01

Pos logit contributions
leaf+1.338
ting+1.248
orry+1.230
 upper+1.175
unk+1.132
Neg logit contributions
 One-1.966
 ne-1.961
One-1.924
 swe-1.890
 coll-1.859
Token and
Token ID290
Feature activation-4.15e-01

Pos logit contributions
leaf+0.789
ting+0.771
orry+0.743
unk+0.729
 upper+0.720
Neg logit contributions
 ne-0.544
 One-0.532
 coll-0.508
One-0.507
 swe-0.506
Token accidentally
Token ID14716
Feature activation-1.29e+00

Pos logit contributions
leaf+0.712
ting+0.698
orry+0.672
unk+0.666
 upper+0.652
Neg logit contributions
 ne-0.541
 One-0.516
 swe-0.504
 cheaper-0.499
 coll-0.491
Token stepped
Token ID10764
Feature activation-1.34e+00

Pos logit contributions
leaf+2.177
ting+2.098
orry+1.993
other+1.952
 upper+1.910
Neg logit contributions
 ne-1.820
 One-1.664
 swe-1.662
 cheaper-1.601
 coll-1.596
Token on
Token ID319
Feature activation+3.66e-01

Pos logit contributions
leaf+2.204
orry+2.026
ting+2.001
 upper+1.981
unk+1.911
Neg logit contributions
 ne-1.898
 One-1.868
 swe-1.807
One-1.779
 cheaper-1.765
Token the
Token ID262
Feature activation-3.63e+00

Pos logit contributions
 ne+0.526
 One+0.503
 cheaper+0.503
 coll+0.494
 swe+0.494
Neg logit contributions
leaf-0.566
ting-0.564
orry-0.519
 ear-0.518
unk-0.515
Token cricket
Token ID18836
Feature activation-1.16e+00

Pos logit contributions
leaf+5.840
orry+5.766
udge+5.344
idget+5.344
unk+5.337
Neg logit contributions
 swe-5.958
 coll-5.941
 ne-5.629
 One-5.547
 scar-5.396
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
leaf+2.178
ting+2.050
orry+2.005
unk+1.951
 upper+1.938
Neg logit contributions
 One-1.407
 ne-1.377
 coll-1.308
 swe-1.297
 saf-1.286
Token The
Token ID383
Feature activation-2.54e+00

Pos logit contributions
 ne+0.163
 One+0.156
 cheaper+0.153
 coll+0.152
 swe+0.152
Neg logit contributions
leaf-0.212
ting-0.210
orry-0.197
unk-0.195
 ear-0.193
Token cricket
Token ID18836
Feature activation-5.65e-01

Pos logit contributions
leaf+3.013
ting+2.795
orry+2.651
unk+2.412
asp+2.389
Neg logit contributions
 One-5.016
 coll-4.988
 swe-4.980
One-4.885
 ne-4.816
Token started
Token ID2067
Feature activation-2.29e-01

Pos logit contributions
leaf+0.998
ting+0.965
orry+0.909
unk+0.896
 upper+0.887
Neg logit contributions
 ne-0.736
 One-0.696
 swe-0.671
 cheaper-0.659
One-0.657
Token through
Token ID832
Feature activation-5.28e-01

Pos logit contributions
leaf+1.657
ting+1.503
orry+1.465
unk+1.419
 upper+1.373
Neg logit contributions
 ne-1.729
 One-1.667
 swe-1.645
 cheaper-1.606
 coll-1.586
Token the
Token ID262
Feature activation-3.29e+00

Pos logit contributions
leaf+0.896
ting+0.862
orry+0.828
unk+0.820
 upper+0.797
Neg logit contributions
 ne-0.709
 One-0.686
 cheaper-0.657
 swe-0.656
One-0.649
Token small
Token ID1402
Feature activation-1.27e+00

Pos logit contributions
unk+5.047
leaf+5.008
ting+4.982
orry+4.902
udge+4.722
Neg logit contributions
 coll-5.303
 One-5.165
 saf-4.979
 swe-4.955
One-4.906
Token hole
Token ID7604
Feature activation-9.57e-01

Pos logit contributions
leaf+2.067
ting+1.960
orry+1.894
unk+1.835
 upper+1.781
Neg logit contributions
 One-1.798
 coll-1.784
 ne-1.784
 swe-1.723
One-1.677
Token in
Token ID287
Feature activation-4.93e-01

Pos logit contributions
leaf+1.769
ting+1.669
orry+1.600
unk+1.554
 ear+1.545
Neg logit contributions
 ne-1.182
 One-1.127
 cheaper-1.094
 swe-1.077
One-1.051
Token the
Token ID262
Feature activation-3.57e+00

Pos logit contributions
leaf+0.846
ting+0.818
orry+0.787
unk+0.777
 upper+0.750
Neg logit contributions
 ne-0.653
 One-0.642
 cheaper-0.611
One-0.609
 swe-0.607
Token screen
Token ID3159
Feature activation-1.04e+00

Pos logit contributions
leaf+6.383
ying+6.367
unk+6.249
asing+6.205
orry+6.157
Neg logit contributions
 One-5.017
 coll-4.877
 swe-4.727
 saf-4.720
One-4.544
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
leaf+1.913
ting+1.812
orry+1.756
unk+1.731
 ear+1.679
Neg logit contributions
 ne-1.269
 One-1.244
 cheaper-1.174
 swe-1.159
 coll-1.156
Token The
Token ID383
Feature activation-3.15e+00

Pos logit contributions
leaf+0.749
ting+0.741
orry+0.703
unk+0.694
 ear+0.681
Neg logit contributions
 ne-0.425
 One-0.423
 cheaper-0.390
 swe-0.385
One-0.384
Token butterfly
Token ID35113
Feature activation-1.03e-01

Pos logit contributions
leaf+5.584
idget+5.578
ting+5.572
udge+5.530
ash+5.492
Neg logit contributions
 coll-4.231
 One-3.959
 ne-3.892
 swe-3.882
 saf-3.830
Token was
Token ID373
Feature activation-3.13e-01

Pos logit contributions
leaf+0.183
ting+0.180
orry+0.170
unk+0.169
 ear+0.166
Neg logit contributions
 ne-0.126
 One-0.120
 cheaper-0.117
 swe-0.116
 coll-0.116
Token she
Token ID673
Feature activation-2.00e+00

Pos logit contributions
leaf+1.426
ting+1.378
unk+1.325
orry+1.311
 upper+1.294
Neg logit contributions
 ne-0.636
 One-0.625
 swe-0.581
 cheaper-0.571
One-0.567
Token ran
Token ID4966
Feature activation-1.20e+00

Pos logit contributions
leaf+3.433
ting+3.397
other+3.342
unk+3.329
orry+3.301
Neg logit contributions
 ne-2.884
 swe-2.432
 cheaper-2.356
 One-2.307
 coll-2.244
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
leaf+1.893
ting+1.810
orry+1.781
 upper+1.778
unk+1.770
Neg logit contributions
 ne-1.719
 One-1.681
 cheaper-1.622
 swe-1.588
One-1.585
Token showed
Token ID3751
Feature activation-2.32e+00

Pos logit contributions
leaf+3.845
unk+3.708
orry+3.588
ting+3.572
 upper+3.531
Neg logit contributions
 ne-3.018
 swe-2.606
 One-2.551
 scar-2.430
 cheaper-2.430
Token it
Token ID340
Feature activation-7.98e-01

Pos logit contributions
leaf+4.601
ting+4.460
unk+4.390
 upper+4.288
iders+4.262
Neg logit contributions
 One-2.591
 swe-2.515
 enough-2.513
 ne-2.499
 coll-2.306
Token to
Token ID284
Feature activation-3.52e+00

Pos logit contributions
leaf+1.472
ting+1.454
unk+1.366
orry+1.362
 upper+1.345
Neg logit contributions
 One-0.927
 ne-0.919
 swe-0.888
 cheaper-0.877
 coll-0.863
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
leaf+8.826
ater+8.461
ting+8.363
ash+8.337
rified+8.324
Neg logit contributions
 One-3.336
 ne-2.467
 enough-2.354
 saf-2.336
 better-2.308
Token mom
Token ID1995
Feature activation+4.93e-01

Pos logit contributions
leaf+1.556
ting+1.534
orry+1.374
 upper+1.363
unk+1.349
Neg logit contributions
 ne-1.622
 One-1.562
 cheaper-1.538
 swe-1.530
 coll-1.528
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
 ne+0.708
 One+0.671
 coll+0.664
 cheaper+0.662
 swe+0.652
Neg logit contributions
ting-0.766
leaf-0.755
orry-0.697
 ear-0.695
unk-0.694
Token Mom
Token ID11254
Feature activation-5.25e-01

Pos logit contributions
leaf+3.797
ting+3.795
orry+3.466
 upper+3.404
vell+3.400
Neg logit contributions
 One-1.881
 ne-1.479
One-1.477
 swe-1.410
 coll-1.345
Token was
Token ID373
Feature activation-8.43e-01

Pos logit contributions
leaf+0.893
ting+0.879
orry+0.828
unk+0.820
 upper+0.817
Neg logit contributions
 ne-0.699
 One-0.653
 cheaper-0.641
 swe-0.640
 coll-0.629
Token was
Token ID373
Feature activation-1.33e+00

Pos logit contributions
leaf+1.006
ting+0.981
orry+0.922
unk+0.909
 ear+0.902
Neg logit contributions
 ne-0.714
 One-0.677
 cheaper-0.641
 swe-0.640
One-0.639
Token surprised
Token ID6655
Feature activation-6.13e-01

Pos logit contributions
leaf+2.105
ting+2.028
orry+1.910
 upper+1.847
other+1.828
Neg logit contributions
 ne-1.998
 One-1.881
 cheaper-1.872
 swe-1.802
 enough-1.791
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
leaf+1.074
ting+1.026
orry+0.993
unk+0.984
 upper+0.970
Neg logit contributions
 ne-0.792
 One-0.767
 cheaper-0.735
 swe-0.734
 coll-0.722
Token He
Token ID679
Feature activation-5.30e-01

Pos logit contributions
leaf+1.137
ting+1.130
orry+1.076
unk+1.064
 ear+1.044
Neg logit contributions
 One-0.480
 ne-0.466
One-0.426
 cheaper-0.424
 swe-0.407
Token touched
Token ID12615
Feature activation-1.54e+00

Pos logit contributions
leaf+0.839
ting+0.836
unk+0.772
orry+0.771
 upper+0.749
Neg logit contributions
 ne-0.767
 One-0.735
 cheaper-0.704
 swe-0.701
One-0.699
Token the
Token ID262
Feature activation-3.51e+00

Pos logit contributions
leaf+2.984
ting+2.744
unk+2.730
orry+2.725
 upper+2.626
Neg logit contributions
 One-1.805
 ne-1.777
 swe-1.667
 cheaper-1.660
 enough-1.645
Token screen
Token ID3159
Feature activation+2.09e-01

Pos logit contributions
orry+5.248
leaf+5.224
unk+5.219
ting+5.164
idget+5.027
Neg logit contributions
 One-5.684
 coll-5.563
 saf-5.497
 ne-5.426
One-5.340
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 ne+0.285
 One+0.276
 cheaper+0.270
 coll+0.269
 swe+0.268
Neg logit contributions
leaf-0.335
ting-0.333
orry-0.310
unk-0.307
 ear-0.305
Token it
Token ID340
Feature activation-1.64e-01

Pos logit contributions
leaf+2.364
ting+2.249
unk+2.222
orry+2.185
asp+2.124
Neg logit contributions
 ne-1.612
 One-1.466
 cheaper-1.431
 swe-1.426
 coll-1.368
Token made
Token ID925
Feature activation-3.83e-01

Pos logit contributions
leaf+0.281
ting+0.276
orry+0.260
unk+0.258
 ear+0.254
Neg logit contributions
 ne-0.212
 One-0.203
 cheaper-0.198
 swe-0.197
 coll-0.195
Token such
Token ID884
Feature activation-4.33e-01

Pos logit contributions
leaf+0.666
ting+0.657
orry+0.619
unk+0.616
 ear+0.605
Neg logit contributions
 ne-0.483
 One-0.468
 cheaper-0.451
 swe-0.449
 coll-0.448
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
 coll+2.303
 swe+2.240
 One+2.239
 cheaper+2.226
 ne+2.220
Neg logit contributions
ting-2.106
leaf-2.015
 ear-1.926
unk-1.851
 upper-1.837
Token pushed
Token ID7121
Feature activation-6.20e-01

Pos logit contributions
leaf+2.267
ting+2.161
unk+2.146
orry+2.103
other+2.076
Neg logit contributions
 ne-1.599
 One-1.455
 swe-1.411
 cheaper-1.390
One-1.333
Token the
Token ID262
Feature activation-3.45e+00

Pos logit contributions
leaf+1.111
ting+1.066
unk+1.011
orry+1.009
 upper+0.976
Neg logit contributions
 ne-0.789
 One-0.771
 swe-0.733
 cheaper-0.728
One-0.725
Token button
Token ID4936
Feature activation+2.42e-01

Pos logit contributions
orry+5.605
udge+5.305
asing+5.205
ting+5.103
leaf+5.071
Neg logit contributions
 One-5.511
 swe-5.374
 ne-5.124
 cheaper-5.072
 works-5.058
Token on
Token ID319
Feature activation-4.34e-01

Pos logit contributions
 ne+0.357
 One+0.347
 cheaper+0.339
 coll+0.339
 swe+0.336
Neg logit contributions
leaf-0.360
ting-0.360
orry-0.333
unk-0.329
 ear-0.327
Token the
Token ID262
Feature activation-3.46e+00

Pos logit contributions
leaf+0.755
ting+0.721
orry+0.687
unk+0.677
 upper+0.658
Neg logit contributions
 ne-0.570
 One-0.561
 cheaper-0.533
 swe-0.528
One-0.526
Token television
Token ID5581
Feature activation-1.18e+00

Pos logit contributions
orry+5.407
leaf+5.138
udge+5.135
asing+5.039
unk+5.032
Neg logit contributions
 One-5.803
 swe-5.523
 cheaper-5.386
 saf-5.384
 coll-5.376
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
leaf+2.222
ting+2.115
orry+2.045
unk+2.034
 upper+1.944
Neg logit contributions
 ne-1.413
 One-1.385
 cheaper-1.308
 coll-1.291
 swe-1.287
Token The
Token ID383
Feature activation-3.13e+00

Pos logit contributions
leaf+2.031
ting+2.001
orry+1.870
unk+1.829
other+1.790
Neg logit contributions
 One-1.020
 ne-0.952
One-0.861
 cheaper-0.853
 swe-0.851
Token screen
Token ID3159
Feature activation-5.05e-01

Pos logit contributions
orry+4.832
udge+4.707
ting+4.670
leaf+4.579
asing+4.510
Neg logit contributions
 One-5.088
 swe-4.923
 ne-4.847
 cheaper-4.753
 saf-4.748
Token lit
Token ID6578
Feature activation+5.93e-01

Pos logit contributions
leaf+0.900
ting+0.886
orry+0.826
unk+0.824
 ear+0.807
Neg logit contributions
 ne-0.635
 One-0.600
 cheaper-0.582
 swe-0.576
 coll-0.570
Token with
Token ID351
Feature activation-4.78e-02

Pos logit contributions
leaf+0.540
ting+0.531
orry+0.502
unk+0.496
 upper+0.487
Neg logit contributions
 ne-0.478
 One-0.464
 coll-0.448
 cheaper-0.446
 swe-0.445
Token a
Token ID257
Feature activation-5.89e-01

Pos logit contributions
leaf+0.081
ting+0.080
orry+0.075
unk+0.075
 ear+0.074
Neg logit contributions
 ne-0.062
 One-0.060
 cheaper-0.058
 coll-0.058
 swe-0.058
Token long
Token ID890
Feature activation+1.53e-01

Pos logit contributions
ting+0.908
leaf+0.896
orry+0.836
unk+0.820
 upper+0.805
Neg logit contributions
 ne-0.871
 One-0.836
 cheaper-0.830
 coll-0.827
 swe-0.827
Token cable
Token ID7862
Feature activation-3.45e-01

Pos logit contributions
 ne+0.220
 One+0.213
 cheaper+0.210
 swe+0.207
 coll+0.207
Neg logit contributions
leaf-0.235
ting-0.232
orry-0.217
unk-0.215
 ear-0.213
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
leaf+0.566
ting+0.554
orry+0.526
unk+0.516
 upper+0.510
Neg logit contributions
 ne-0.466
 One-0.453
 cheaper-0.440
 coll-0.437
 swe-0.434
Token The
Token ID383
Feature activation-3.45e+00

Pos logit contributions
leaf+0.449
ting+0.441
orry+0.420
unk+0.417
 ear+0.409
Neg logit contributions
 One-0.274
 ne-0.274
 cheaper-0.255
 swe-0.255
 coll-0.254
Token cable
Token ID7862
Feature activation+1.87e-01

Pos logit contributions
orry+6.499
leaf+6.319
unk+6.264
udge+6.182
ocket+6.150
Neg logit contributions
 One-4.363
 swe-4.252
 cheaper-4.165
 coll-4.159
 ne-4.103
Token was
Token ID373
Feature activation+6.06e-01

Pos logit contributions
 ne+0.225
 One+0.217
 cheaper+0.212
 coll+0.211
 swe+0.210
Neg logit contributions
leaf-0.331
ting-0.328
orry-0.308
unk-0.305
 ear-0.303
Token harmless
Token ID23585
Feature activation-2.94e-01

Pos logit contributions
 ne+0.805
 One+0.786
 coll+0.767
 cheaper+0.755
 swe+0.755
Neg logit contributions
leaf-0.982
ting-0.967
unk-0.909
 ear-0.905
orry-0.902
Token,
Token ID11
Feature activation+2.85e-01

Pos logit contributions
leaf+0.515
ting+0.502
orry+0.479
unk+0.473
 upper+0.467
Neg logit contributions
 ne-0.368
 One-0.355
 cheaper-0.345
 swe-0.344
 coll-0.342
Token so
Token ID523
Feature activation+3.57e-01

Pos logit contributions
 ne+0.438
 One+0.425
 cheaper+0.417
 coll+0.415
 swe+0.412
Neg logit contributions
leaf-0.408
ting-0.407
orry-0.372
unk-0.367
 ear-0.365
Token Tim
Token ID5045
Feature activation-6.65e-01

Pos logit contributions
leaf+1.238
ting+1.217
orry+1.146
unk+1.131
 ear+1.116
Neg logit contributions
 One-0.978
 ne-0.954
One-0.884
 swe-0.880
 coll-0.875
Token stood
Token ID6204
Feature activation-7.07e-01

Pos logit contributions
leaf+1.193
ting+1.181
orry+1.085
unk+1.078
 ear+1.065
Neg logit contributions
 ne-0.849
 One-0.799
 swe-0.765
One-0.754
 cheaper-0.749
Token up
Token ID510
Feature activation+1.48e+00

Pos logit contributions
leaf+1.095
ting+1.036
orry+0.988
unk+0.952
 upper+0.939
Neg logit contributions
 ne-1.089
 One-1.029
 cheaper-1.010
 swe-1.010
One-1.002
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
 coll+2.303
 swe+2.240
 One+2.239
 cheaper+2.226
 ne+2.220
Neg logit contributions
ting-2.106
leaf-2.015
 ear-1.926
unk-1.851
 upper-1.837
Token pushed
Token ID7121
Feature activation-6.20e-01

Pos logit contributions
leaf+2.267
ting+2.161
unk+2.146
orry+2.103
other+2.076
Neg logit contributions
 ne-1.599
 One-1.455
 swe-1.411
 cheaper-1.390
One-1.333
Token the
Token ID262
Feature activation-3.45e+00

Pos logit contributions
leaf+1.111
ting+1.066
unk+1.011
orry+1.009
 upper+0.976
Neg logit contributions
 ne-0.789
 One-0.771
 swe-0.733
 cheaper-0.728
One-0.725
Token button
Token ID4936
Feature activation+2.42e-01

Pos logit contributions
orry+5.605
udge+5.305
asing+5.205
ting+5.103
leaf+5.071
Neg logit contributions
 One-5.511
 swe-5.374
 ne-5.124
 cheaper-5.072
 works-5.058
Token on
Token ID319
Feature activation-4.34e-01

Pos logit contributions
 ne+0.357
 One+0.347
 cheaper+0.339
 coll+0.339
 swe+0.336
Neg logit contributions
leaf-0.360
ting-0.360
orry-0.333
unk-0.329
 ear-0.327
Token the
Token ID262
Feature activation-3.46e+00

Pos logit contributions
leaf+0.755
ting+0.721
orry+0.687
unk+0.677
 upper+0.658
Neg logit contributions
 ne-0.570
 One-0.561
 cheaper-0.533
 swe-0.528
One-0.526
Token television
Token ID5581
Feature activation-1.18e+00

Pos logit contributions
orry+5.407
leaf+5.138
udge+5.135
asing+5.039
unk+5.032
Neg logit contributions
 One-5.803
 swe-5.523
 cheaper-5.386
 saf-5.384
 coll-5.376
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
leaf+2.222
ting+2.115
orry+2.045
unk+2.034
 upper+1.944
Neg logit contributions
 ne-1.413
 One-1.385
 cheaper-1.308
 coll-1.291
 swe-1.287
Token closer
Token ID5699
Feature activation+8.04e-01

Pos logit contributions
ting+0.839
leaf+0.828
orry+0.777
unk+0.770
 ear+0.757
Neg logit contributions
 ne-0.678
 One-0.660
 cheaper-0.648
 coll-0.639
 swe-0.636
Token look
Token ID804
Feature activation-3.11e-01

Pos logit contributions
 ne+1.082
 cheaper+1.028
 swe+1.023
 One+1.006
 coll+1.001
Neg logit contributions
leaf-1.330
ting-1.291
 ear-1.222
orry-1.210
unk-1.209
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
leaf+0.520
ting+0.504
orry+0.480
unk+0.474
 upper+0.468
Neg logit contributions
 ne-0.415
 One-0.404
 cheaper-0.394
 swe-0.392
 coll-0.388
Token Suddenly
Token ID24975
Feature activation+7.63e-01

Pos logit contributions
leaf+0.706
ting+0.702
orry+0.664
unk+0.658
 ear+0.646
Neg logit contributions
 One-0.399
 ne-0.396
 cheaper-0.369
One-0.367
 swe-0.359
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
 ne+0.819
 coll+0.819
 cheaper+0.787
 swe+0.786
 One+0.784
Neg logit contributions
ting-1.420
leaf-1.409
orry-1.333
 ear-1.328
unk-1.317
Token the
Token ID262
Feature activation-3.43e+00

Pos logit contributions
leaf+0.710
ting+0.700
orry+0.669
unk+0.666
 ear+0.649
Neg logit contributions
 ne-0.405
 One-0.391
 cheaper-0.374
 swe-0.366
One-0.365
Token screen
Token ID3159
Feature activation-3.59e-01

Pos logit contributions
ting+6.011
leaf+5.781
unk+5.592
orry+5.579
udge+5.546
Neg logit contributions
 One-5.068
 saf-4.933
 ne-4.927
 cheaper-4.834
 coll-4.799
Token was
Token ID373
Feature activation+2.07e-01

Pos logit contributions
leaf+0.580
ting+0.567
orry+0.532
unk+0.526
 ear+0.517
Neg logit contributions
 ne-0.503
 One-0.482
 cheaper-0.468
 swe-0.463
 coll-0.461
Token making
Token ID1642
Feature activation+3.79e-01

Pos logit contributions
 ne+0.313
 One+0.306
 coll+0.298
 cheaper+0.297
 swe+0.296
Neg logit contributions
leaf-0.303
ting-0.299
orry-0.277
unk-0.275
 ear-0.272
Token loud
Token ID7812
Feature activation+1.31e+00

Pos logit contributions
 ne+0.525
 One+0.509
 cheaper+0.497
One+0.493
 swe+0.493
Neg logit contributions
leaf-0.603
ting-0.598
orry-0.557
unk-0.550
 ear-0.545
Token noises
Token ID26782
Feature activation-8.66e-01

Pos logit contributions
 ne+1.727
One+1.723
 One+1.716
 cheaper+1.689
 swe+1.618
Neg logit contributions
ting-2.220
leaf-2.117
 ear-1.974
orry-1.965
unk-1.944
Token Sara
Token ID24799
Feature activation+5.43e-01

Pos logit contributions
 ne+0.499
 One+0.481
 coll+0.478
 cheaper+0.473
 swe+0.469
Neg logit contributions
leaf-0.619
ting-0.618
orry-0.570
 ear-0.567
unk-0.566
Token sees
Token ID7224
Feature activation-7.31e-01

Pos logit contributions
 ne+0.746
 cheaper+0.722
 One+0.718
 swe+0.708
 coll+0.706
Neg logit contributions
leaf-0.866
ting-0.862
orry-0.804
unk-0.801
 ear-0.793
Token a
Token ID257
Feature activation-5.47e-01

Pos logit contributions
leaf+1.362
ting+1.334
orry+1.264
unk+1.250
 upper+1.240
Neg logit contributions
 ne-0.837
 One-0.820
 cheaper-0.788
 swe-0.780
 coll-0.780
Token light
Token ID1657
Feature activation-3.35e-01

Pos logit contributions
leaf+0.960
ting+0.959
orry+0.898
unk+0.881
 upper+0.864
Neg logit contributions
 ne-0.685
 cheaper-0.654
 One-0.650
 coll-0.647
 swe-0.638
Token on
Token ID319
Feature activation+1.63e-02

Pos logit contributions
leaf+0.467
ting+0.458
orry+0.429
unk+0.419
 upper+0.413
Neg logit contributions
 ne-0.537
 One-0.518
 cheaper-0.510
 coll-0.509
 swe-0.507
Token the
Token ID262
Feature activation-3.43e+00

Pos logit contributions
 ne+0.023
 One+0.022
 cheaper+0.022
 coll+0.022
 swe+0.022
Neg logit contributions
leaf-0.026
ting-0.025
orry-0.024
unk-0.024
 ear-0.023
Token screen
Token ID3159
Feature activation-9.21e-01

Pos logit contributions
unk+4.625
orry+4.538
asing+4.467
leaf+4.452
udge+4.409
Neg logit contributions
 saf-6.536
 coll-6.273
 One-6.193
 cheaper-5.937
 ne-5.915
Token.
Token ID13
Feature activation-5.53e-02

Pos logit contributions
leaf+1.677
ting+1.616
orry+1.546
unk+1.533
 upper+1.482
Neg logit contributions
 One-1.133
 ne-1.119
One-1.054
 cheaper-1.036
 coll-1.032
Token The
Token ID383
Feature activation-2.78e+00

Pos logit contributions
leaf+0.098
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.067
 One-0.065
 cheaper-0.063
 swe-0.062
 coll-0.062
Token light
Token ID1657
Feature activation-1.73e-01

Pos logit contributions
ting+4.734
orry+4.676
idget+4.650
leaf+4.644
unk+4.590
Neg logit contributions
 One-3.887
 saf-3.867
 ne-3.778
 coll-3.769
One-3.638
Token is
Token ID318
Feature activation-4.49e-01

Pos logit contributions
leaf+0.319
ting+0.316
orry+0.298
unk+0.295
 upper+0.291
Neg logit contributions
 ne-0.196
 One-0.188
 cheaper-0.183
 coll-0.182
 swe-0.181
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
leaf+1.292
ting+1.280
orry+1.212
unk+1.206
 upper+1.179
Neg logit contributions
 One-0.657
 ne-0.623
 cheaper-0.583
One-0.580
 swe-0.566
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
ting+1.182
leaf+1.180
orry+1.108
unk+1.098
 ear+1.096
Neg logit contributions
 One-0.564
 ne-0.554
One-0.536
 cheaper-0.514
 swe-0.496
TokenBilly
Token ID49640
Feature activation+2.16e-01

Pos logit contributions
leaf+0.317
ting+0.313
orry+0.293
unk+0.291
 ear+0.288
Neg logit contributions
 ne-0.257
 One-0.251
 cheaper-0.244
One-0.243
 swe-0.241
Token slowly
Token ID6364
Feature activation+4.57e-01

Pos logit contributions
 ne+0.300
 One+0.293
 cheaper+0.288
 coll+0.286
 swe+0.285
Neg logit contributions
leaf-0.342
ting-0.340
orry-0.317
unk-0.313
 ear-0.310
Token approached
Token ID10448
Feature activation-9.30e-01

Pos logit contributions
 ne+0.669
 One+0.664
 cheaper+0.650
 coll+0.648
 swe+0.643
Neg logit contributions
leaf-0.684
ting-0.679
orry-0.628
unk-0.610
 ear-0.609
Token the
Token ID262
Feature activation-3.36e+00

Pos logit contributions
leaf+1.668
ting+1.592
orry+1.571
unk+1.560
 upper+1.504
Neg logit contributions
 ne-1.158
 One-1.149
 cheaper-1.073
 swe-1.059
One-1.056
Token computer
Token ID3644
Feature activation+3.84e-02

Pos logit contributions
leaf+5.001
ting+4.841
unk+4.837
asc+4.782
orry+4.760
Neg logit contributions
 One-5.780
One-5.617
 coll-5.552
 cheaper-5.465
 saf-5.428
Token again
Token ID757
Feature activation+1.36e-01

Pos logit contributions
 ne+0.049
 One+0.047
 cheaper+0.046
 coll+0.046
 swe+0.046
Neg logit contributions
leaf-0.065
ting-0.064
orry-0.060
unk-0.060
 ear-0.059
Token,
Token ID11
Feature activation-8.95e-01

Pos logit contributions
 ne+0.175
 One+0.169
 cheaper+0.165
 swe+0.164
 coll+0.164
Neg logit contributions
leaf-0.231
ting-0.229
orry-0.215
unk-0.212
 ear-0.211
Token he
Token ID339
Feature activation-6.53e-01

Pos logit contributions
leaf+1.465
ting+1.400
unk+1.373
orry+1.372
 upper+1.319
Neg logit contributions
 ne-1.252
 One-1.189
 cheaper-1.169
 swe-1.153
 coll-1.136
Token slowly
Token ID6364
Feature activation+2.07e-01

Pos logit contributions
leaf+1.044
ting+1.023
orry+0.946
unk+0.945
 ear+0.922
Neg logit contributions
 ne-0.950
 One-0.893
 cheaper-0.872
 swe-0.865
 coll-0.857
Token.
Token ID13
Feature activation-9.72e-01

Pos logit contributions
leaf+1.397
ting+1.324
orry+1.281
unk+1.269
 upper+1.248
Neg logit contributions
 One-0.950
 ne-0.941
One-0.878
 coll-0.866
 cheaper-0.865
Token She
Token ID1375
Feature activation+2.73e-01

Pos logit contributions
leaf+1.948
ting+1.917
orry+1.844
unk+1.825
 upper+1.780
Neg logit contributions
 One-1.058
 ne-0.980
One-0.902
 swe-0.879
 coll-0.864
Token peek
Token ID27185
Feature activation-1.58e+00

Pos logit contributions
 ne+0.352
 One+0.343
 cheaper+0.341
 coll+0.338
 swe+0.335
Neg logit contributions
leaf-0.458
ting-0.453
orry-0.427
unk-0.421
 ear-0.418
Tokened
Token ID276
Feature activation-1.06e+00

Pos logit contributions
leaf+1.427
ting+1.306
unk+1.275
 upper+1.272
orry+1.249
Neg logit contributions
 ne-3.330
 One-3.192
 swe-3.138
One-3.087
 saf-3.077
Token around
Token ID1088
Feature activation-6.12e-01

Pos logit contributions
leaf+1.290
ting+1.186
orry+1.173
unk+1.162
 upper+1.141
Neg logit contributions
 ne-1.946
 One-1.899
 swe-1.840
 cheaper-1.814
 coll-1.793
Token the
Token ID262
Feature activation-3.36e+00

Pos logit contributions
leaf+1.104
ting+1.067
orry+1.030
unk+1.030
 upper+1.003
Neg logit contributions
 ne-0.747
 One-0.728
 swe-0.682
 coll-0.681
One-0.679
Token corner
Token ID5228
Feature activation-1.32e+00

Pos logit contributions
leaf+5.613
asing+5.407
ting+5.350
unk+5.252
orry+5.142
Neg logit contributions
One-4.936
 One-4.918
 ne-4.815
 saf-4.795
 swe-4.788
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
leaf+2.598
ting+2.482
unk+2.362
orry+2.325
 upper+2.319
Neg logit contributions
 One-1.473
 ne-1.462
One-1.378
 cheaper-1.346
 coll-1.318
Token saw
Token ID2497
Feature activation-1.08e+00

Pos logit contributions
leaf+3.114
unk+2.999
 upper+2.846
asp+2.845
ting+2.838
Neg logit contributions
 ne-2.227
 One-2.107
One-1.960
 swe-1.923
 saf-1.901
Token a
Token ID257
Feature activation-1.38e-01

Pos logit contributions
leaf+2.040
ting+1.992
orry+1.915
unk+1.908
 upper+1.893
Neg logit contributions
 ne-1.180
 One-1.162
 cheaper-1.095
 coll-1.093
 swe-1.088
Token pretty
Token ID2495
Feature activation-1.27e-01

Pos logit contributions
leaf+0.219
ting+0.216
orry+0.202
unk+0.199
 ear+0.196
Neg logit contributions
 ne-0.193
 One-0.186
 cheaper-0.183
 coll-0.181
 swe-0.181
Token M
Token ID337
Feature activation-9.40e-01

Pos logit contributions
leaf+1.690
ting+1.563
unk+1.458
orry+1.452
 upper+1.435
Neg logit contributions
 ne-2.124
 One-2.100
 swe-1.996
One-1.978
 coll-1.937
Tokenandy
Token ID10757
Feature activation-1.89e-01

Pos logit contributions
leaf+1.490
ting+1.472
 ear+1.324
 upper+1.297
unk+1.275
Neg logit contributions
 ne-1.352
 One-1.338
 swe-1.282
One-1.279
 coll-1.274
Token showed
Token ID3751
Feature activation-1.70e+00

Pos logit contributions
leaf+0.310
ting+0.305
orry+0.284
unk+0.281
 ear+0.279
Neg logit contributions
 ne-0.256
 One-0.248
 cheaper-0.239
 swe-0.239
 coll-0.239
Token her
Token ID607
Feature activation-1.68e+00

Pos logit contributions
leaf+3.194
ting+3.126
unk+2.935
 upper+2.915
orry+2.909
Neg logit contributions
 One-2.050
 swe-1.976
 ne-1.957
 enough-1.834
 coll-1.829
Token hoop
Token ID44792
Feature activation-7.06e-01

Pos logit contributions
leaf+2.926
ting+2.852
 upper+2.495
 Stop+2.464
orry+2.448
Neg logit contributions
 swe-2.380
 One-2.364
 ne-2.312
 coll-2.286
 cheaper-2.202
Token to
Token ID284
Feature activation-3.35e+00

Pos logit contributions
leaf+1.284
ting+1.252
orry+1.181
unk+1.170
 upper+1.152
Neg logit contributions
 ne-0.853
 One-0.837
 swe-0.806
 coll-0.800
 cheaper-0.797
Token her
Token ID607
Feature activation-9.65e-01

Pos logit contributions
leaf+8.518
ting+8.444
unk+7.979
umbling+7.919
orry+7.892
Neg logit contributions
 One-3.265
 ne-2.615
One-2.424
 saf-2.338
 Tweet-2.233
Token father
Token ID2988
Feature activation+6.20e-01

Pos logit contributions
leaf+1.637
ting+1.597
orry+1.449
 upper+1.431
unk+1.423
Neg logit contributions
 ne-1.341
 One-1.299
 swe-1.288
 coll-1.276
 cheaper-1.269
Token.
Token ID13
Feature activation-6.94e-01

Pos logit contributions
 ne+0.875
 cheaper+0.826
 One+0.822
 coll+0.821
 swe+0.812
Neg logit contributions
ting-0.979
leaf-0.967
 ear-0.898
orry-0.894
unk-0.892
Token
Token ID220
Feature activation-5.25e-01

Pos logit contributions
leaf+1.220
ting+1.195
orry+1.139
unk+1.108
 upper+1.084
Neg logit contributions
 One-0.894
 ne-0.866
 swe-0.809
 coll-0.803
One-0.799
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
leaf+1.065
ting+1.057
orry+0.983
 ear+0.977
 upper+0.973
Neg logit contributions
 One-0.517
 ne-0.501
One-0.475
 cheaper-0.465
 coll-0.459
Token step
Token ID2239
Feature activation-1.40e+00

Pos logit contributions
ting+1.339
leaf+1.337
orry+1.231
unk+1.210
 upper+1.195
Neg logit contributions
 ne-1.132
 One-1.088
 coll-1.049
 swe-1.039
 cheaper-1.025
Token to
Token ID284
Feature activation-1.95e+00

Pos logit contributions
leaf+2.373
ting+2.269
orry+2.211
unk+2.176
 upper+2.152
Neg logit contributions
 ne-1.839
 One-1.797
One-1.712
 cheaper-1.706
 enough-1.671
Token get
Token ID651
Feature activation-6.99e-01

Pos logit contributions
leaf+3.700
ting+3.665
unk+3.642
asp+3.495
orry+3.487
Neg logit contributions
 One-2.322
 ne-2.272
One-2.049
 saf-2.007
 cheaper-1.975
Token closer
Token ID5699
Feature activation+3.73e-01

Pos logit contributions
leaf+1.169
ting+1.153
orry+1.109
unk+1.098
 upper+1.077
Neg logit contributions
 ne-0.927
 One-0.892
 cheaper-0.867
 swe-0.864
 coll-0.839
Token to
Token ID284
Feature activation-1.97e+00

Pos logit contributions
 ne+0.508
 One+0.487
 cheaper+0.481
 coll+0.477
 swe+0.476
Neg logit contributions
leaf-0.603
ting-0.600
unk-0.554
orry-0.554
 ear-0.551
Token the
Token ID262
Feature activation-3.34e+00

Pos logit contributions
leaf+3.872
unk+3.747
orry+3.623
asp+3.579
ting+3.578
Neg logit contributions
 One-2.382
 ne-2.322
 saf-2.097
One-2.087
 Nib-2.021
Token bug
Token ID5434
Feature activation-3.38e-01

Pos logit contributions
leaf+5.834
ting+5.512
asing+5.293
ash+5.179
 upper+5.042
Neg logit contributions
 One-4.876
One-4.774
 ne-4.627
 problem-4.616
 swe-4.560
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
leaf+0.582
ting+0.565
orry+0.536
unk+0.527
 upper+0.521
Neg logit contributions
 ne-0.439
 One-0.426
 coll-0.412
 swe-0.412
 cheaper-0.408
Token He
Token ID679
Feature activation-1.86e-01

Pos logit contributions
leaf+0.098
ting+0.097
orry+0.091
unk+0.090
 ear+0.089
Neg logit contributions
 ne-0.073
 One-0.071
 cheaper-0.069
 coll-0.068
 swe-0.068
Token wanted
Token ID2227
Feature activation-1.18e+00

Pos logit contributions
leaf+0.292
ting+0.289
orry+0.268
unk+0.265
 ear+0.262
Neg logit contributions
 ne-0.270
 One-0.259
 cheaper-0.252
 swe-0.251
 coll-0.250
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
leaf+2.287
ting+2.258
unk+2.176
orry+2.170
 upper+2.094
Neg logit contributions
 ne-1.280
 One-1.268
 cheaper-1.213
 swe-1.172
One-1.167
Token to
Token ID284
Feature activation-1.71e+00

Pos logit contributions
 ne+0.605
 One+0.581
 coll+0.572
 cheaper+0.569
 swe+0.566
Neg logit contributions
ting-0.738
leaf-0.737
orry-0.684
unk-0.678
 ear-0.677
Token fit
Token ID4197
Feature activation-8.20e-01

Pos logit contributions
ting+2.802
leaf+2.801
unk+2.681
orry+2.646
asp+2.495
Neg logit contributions
 ne-2.505
 One-2.385
One-2.286
 swe-2.250
 cheaper-2.233
Token under
Token ID739
Feature activation-2.49e-01

Pos logit contributions
leaf+1.392
ting+1.301
orry+1.285
unk+1.268
 upper+1.247
Neg logit contributions
 ne-1.122
 One-1.072
 swe-1.041
 cheaper-1.036
 coll-0.995
Token the
Token ID262
Feature activation-3.24e+00

Pos logit contributions
leaf+0.418
ting+0.407
orry+0.386
unk+0.382
 upper+0.373
Neg logit contributions
 ne-0.332
 One-0.323
 cheaper-0.310
 swe-0.310
 coll-0.307
Token table
Token ID3084
Feature activation-6.56e-01

Pos logit contributions
orry+4.473
unk+4.455
leaf+4.364
ting+4.268
asing+4.265
Neg logit contributions
 saf-5.829
 coll-5.698
 One-5.629
 swe-5.613
 ne-5.529
Token the
Token ID262
Feature activation-3.34e+00

Pos logit contributions
leaf+1.423
ting+1.370
orry+1.316
unk+1.296
 ear+1.283
Neg logit contributions
 ne-0.582
 One-0.557
 cheaper-0.527
 swe-0.514
 coll-0.506
Token wires
Token ID19474
Feature activation-1.00e+00

Pos logit contributions
orry+5.023
leaf+4.775
asing+4.749
unk+4.723
udge+4.631
Neg logit contributions
 swe-5.591
 coll-5.568
 cheaper-5.548
 saf-5.508
 ne-5.401
Token started
Token ID2067
Feature activation+5.54e-01

Pos logit contributions
leaf+2.063
ting+1.943
orry+1.916
unk+1.890
 ear+1.855
Neg logit contributions
 ne-1.022
 One-0.975
 cheaper-0.922
 swe-0.892
One-0.875
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 ne+0.744
 One+0.728
 cheaper+0.712
 coll+0.708
One+0.707
Neg logit contributions
ting-0.896
leaf-0.882
unk-0.826
orry-0.823
 ear-0.814
Token spark
Token ID9009
Feature activation+1.41e-01

Pos logit contributions
 ne+0.583
 One+0.571
 coll+0.563
 cheaper+0.561
 swe+0.556
Neg logit contributions
leaf-0.580
ting-0.578
orry-0.530
unk-0.522
 ear-0.522
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
 ne+0.265
 One+0.259
 cheaper+0.255
 coll+0.253
 swe+0.252
Neg logit contributions
leaf-0.157
ting-0.156
orry-0.140
unk-0.138
 ear-0.135
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 ne+0.818
 coll+0.790
 One+0.785
 cheaper+0.784
 swe+0.773
Neg logit contributions
ting-0.910
leaf-0.886
orry-0.832
 ear-0.832
unk-0.827
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
leaf+2.401
ting+2.385
orry+2.225
unk+2.206
 upper+2.202
Neg logit contributions
 One-1.087
 ne-0.955
One-0.924
 saf-0.843
 coll-0.834
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
leaf+0.210
ting+0.209
orry+0.197
unk+0.195
 ear+0.193
Neg logit contributions
 ne-0.112
 One-0.109
 cheaper-0.104
One-0.103
 coll-0.103
TokenSam
Token ID16305
Feature activation-3.51e-01

Pos logit contributions
leaf+2.398
ting+2.383
unk+2.239
 ear+2.232
orry+2.191
Neg logit contributions
One-1.311
 One-1.272
 ne-1.236
 saf-1.106
 coll-1.064
Token pushed
Token ID7121
Feature activation-5.02e-01

Pos logit contributions
leaf+0.585
ting+0.572
orry+0.537
unk+0.534
 ear+0.527
Neg logit contributions
 ne-0.476
 One-0.456
 cheaper-0.438
 swe-0.436
 coll-0.434
Token the
Token ID262
Feature activation-3.33e+00

Pos logit contributions
leaf+0.833
ting+0.803
unk+0.755
orry+0.754
 upper+0.738
Neg logit contributions
 ne-0.693
 One-0.677
 swe-0.643
One-0.641
 cheaper-0.639
Token ball
Token ID2613
Feature activation-3.14e-01

Pos logit contributions
leaf+6.268
ting+6.074
idget+5.793
asing+5.790
orry+5.721
Neg logit contributions
 ne-4.530
 saf-4.451
 One-4.399
One-4.330
 problem-4.288
Token towards
Token ID3371
Feature activation-1.45e+00

Pos logit contributions
leaf+0.524
ting+0.508
orry+0.478
unk+0.475
 ear+0.463
Neg logit contributions
 ne-0.425
 One-0.416
 cheaper-0.398
 swe-0.395
One-0.394
Token the
Token ID262
Feature activation-3.15e+00

Pos logit contributions
leaf+2.608
ting+2.363
unk+2.256
orry+2.219
asp+2.208
Neg logit contributions
 One-1.960
 ne-1.888
 saf-1.818
 coll-1.795
One-1.790
Token shadow
Token ID9082
Feature activation-5.66e-01

Pos logit contributions
leaf+6.670
ting+6.333
asing+6.321
orry+6.188
idget+6.152
Neg logit contributions
 ne-3.564
 saf-3.504
 One-3.461
One-3.352
 problem-3.259
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
leaf+1.061
ting+1.021
orry+0.971
unk+0.969
 upper+0.944
Neg logit contributions
 ne-0.662
 One-0.641
 cheaper-0.606
 swe-0.600
 coll-0.597
Token frustrated
Token ID14718
Feature activation-4.63e-01

Pos logit contributions
leaf+0.172
ting+0.170
orry+0.158
unk+0.156
 ear+0.154
Neg logit contributions
 ne-0.160
 One-0.154
 cheaper-0.151
 swe-0.150
 coll-0.149
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
leaf+0.814
ting+0.785
orry+0.743
unk+0.737
 upper+0.727
Neg logit contributions
 ne-0.589
 One-0.576
 cheaper-0.557
 swe-0.552
 coll-0.545
Token The
Token ID383
Feature activation-2.78e+00

Pos logit contributions
leaf+0.311
ting+0.307
orry+0.290
unk+0.288
 ear+0.284
Neg logit contributions
 ne-0.173
 One-0.171
 cheaper-0.161
 swe-0.159
One-0.159
Token buttons
Token ID12163
Feature activation+3.80e-01

Pos logit contributions
leaf+4.298
udge+4.221
orry+4.218
ting+4.216
unk+4.122
Neg logit contributions
 One-4.395
 swe-4.294
 ne-4.189
 cheaper-4.143
One-4.135
Token on
Token ID319
Feature activation-1.46e-01

Pos logit contributions
 ne+0.528
 One+0.515
 cheaper+0.504
 coll+0.502
 swe+0.501
Neg logit contributions
leaf-0.594
ting-0.594
orry-0.551
unk-0.543
 ear-0.539
Token the
Token ID262
Feature activation-3.32e+00

Pos logit contributions
leaf+0.236
ting+0.232
orry+0.218
unk+0.215
 ear+0.211
Neg logit contributions
 ne-0.203
 One-0.198
 cheaper-0.192
 swe-0.191
 coll-0.191
Token television
Token ID5581
Feature activation-5.44e-01

Pos logit contributions
orry+4.932
leaf+4.818
udge+4.793
unk+4.748
asing+4.699
Neg logit contributions
 swe-5.521
 One-5.419
 cheaper-5.400
 coll-5.362
 saf-5.325
Token were
Token ID547
Feature activation-3.46e-01

Pos logit contributions
leaf+0.972
ting+0.940
orry+0.894
unk+0.889
 ear+0.869
Neg logit contributions
 ne-0.670
 One-0.646
 cheaper-0.620
 coll-0.618
 swe-0.613
Token different
Token ID1180
Feature activation-2.51e-01

Pos logit contributions
leaf+0.567
ting+0.558
orry+0.524
unk+0.516
 upper+0.507
Neg logit contributions
 ne-0.474
 One-0.453
 cheaper-0.450
 swe-0.446
 coll-0.440
Token and
Token ID290
Feature activation-3.08e-01

Pos logit contributions
leaf+0.413
ting+0.403
orry+0.381
unk+0.375
 ear+0.371
Neg logit contributions
 ne-0.341
 One-0.328
 cheaper-0.323
 swe-0.321
 coll-0.319
Token she
Token ID673
Feature activation-9.21e-01

Pos logit contributions
leaf+0.525
ting+0.515
orry+0.485
unk+0.483
 upper+0.471
Neg logit contributions
 ne-0.402
 One-0.386
 cheaper-0.380
 swe-0.378
 coll-0.374
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
leaf+0.018
ting+0.018
orry+0.017
unk+0.017
 ear+0.016
Neg logit contributions
 ne-0.015
 One-0.015
 cheaper-0.015
 coll-0.015
 swe-0.015
Token
Token ID198
Feature activation+4.54e-01

Pos logit contributions
 ne+1.241
 coll+1.204
 swe+1.187
 cheaper+1.158
 saf+1.089
Neg logit contributions
leaf-1.807
ting-1.793
orry-1.691
 ear-1.672
unk-1.659
Token
Token ID198
Feature activation+8.92e-01

Pos logit contributions
 ne+0.524
 coll+0.493
 swe+0.492
 cheaper+0.486
 One+0.482
Neg logit contributions
leaf-0.832
ting-0.814
orry-0.779
unk-0.770
 ear-0.757
TokenHe
Token ID1544
Feature activation+9.94e-01

Pos logit contributions
 ne+1.056
 coll+1.019
 swe+1.001
 cheaper+0.971
 One+0.961
Neg logit contributions
leaf-1.609
ting-1.564
orry-1.510
unk-1.465
 upper-1.459
Token touched
Token ID12615
Feature activation-1.05e+00

Pos logit contributions
 coll+1.404
 cheaper+1.391
 ne+1.390
 One+1.366
 swe+1.364
Neg logit contributions
leaf-1.506
ting-1.484
orry-1.417
 ear-1.393
unk-1.362
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
leaf+1.871
ting+1.764
orry+1.713
unk+1.702
 upper+1.667
Neg logit contributions
 ne-1.333
 One-1.324
 cheaper-1.247
 swe-1.245
One-1.214
Token keyboard
Token ID10586
Feature activation-8.53e-02

Pos logit contributions
leaf+5.222
ting+4.992
opher+4.942
unk+4.892
asc+4.833
Neg logit contributions
 coll-5.461
 One-5.333
 cheaper-5.157
 swe-5.090
One-5.056
Token,
Token ID11
Feature activation-5.73e-01

Pos logit contributions
leaf+0.145
ting+0.143
orry+0.135
unk+0.133
 ear+0.131
Neg logit contributions
 ne-0.109
 One-0.106
 cheaper-0.103
 swe-0.102
 coll-0.102
Token expecting
Token ID12451
Feature activation-1.07e+00

Pos logit contributions
leaf+0.944
ting+0.918
orry+0.881
unk+0.875
 upper+0.842
Neg logit contributions
 ne-0.784
 One-0.755
 swe-0.734
 cheaper-0.733
 coll-0.722
Token the
Token ID262
Feature activation-2.68e+00

Pos logit contributions
leaf+1.955
ting+1.941
unk+1.858
orry+1.848
asp+1.784
Neg logit contributions
 ne-1.243
 One-1.234
 cheaper-1.199
 swe-1.163
 coll-1.135
Token computer
Token ID3644
Feature activation+3.78e-01

Pos logit contributions
ting+4.102
leaf+4.065
orry+3.910
unk+3.891
asp+3.788
Neg logit contributions
 One-4.217
 coll-4.183
 cheaper-4.160
 swe-4.053
One-3.979
Token trick
Token ID6908
Feature activation-1.42e+00

Pos logit contributions
ting+0.965
leaf+0.958
orry+0.886
unk+0.867
 upper+0.860
Neg logit contributions
 ne-0.898
 One-0.874
 coll-0.863
 cheaper-0.859
 swe-0.852
Token!
Token ID0
Feature activation-5.29e-01

Pos logit contributions
leaf+2.637
ting+2.477
orry+2.421
 upper+2.391
unk+2.385
Neg logit contributions
 ne-1.639
 One-1.611
 coll-1.571
 cheaper-1.569
 swe-1.540
Token She
Token ID1375
Feature activation-1.26e-01

Pos logit contributions
leaf+1.012
ting+0.999
unk+0.947
orry+0.940
 upper+0.914
Neg logit contributions
 One-0.592
 ne-0.566
 coll-0.532
 cheaper-0.531
One-0.530
Token stood
Token ID6204
Feature activation+1.13e-01

Pos logit contributions
leaf+0.211
ting+0.210
orry+0.195
unk+0.194
 ear+0.191
Neg logit contributions
 ne-0.167
 One-0.161
 cheaper-0.157
 swe-0.156
 coll-0.156
Token on
Token ID319
Feature activation+2.83e-01

Pos logit contributions
 ne+0.183
 One+0.178
 cheaper+0.175
 coll+0.174
 swe+0.174
Neg logit contributions
leaf-0.155
ting-0.154
orry-0.141
unk-0.140
 ear-0.138
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
 ne+0.385
 One+0.371
 cheaper+0.363
 swe+0.360
 coll+0.358
Neg logit contributions
leaf-0.459
ting-0.455
orry-0.424
 ear-0.420
unk-0.418
Token stage
Token ID3800
Feature activation-9.14e-01

Pos logit contributions
udge+5.526
asing+5.456
ting+5.403
orry+5.389
unk+5.325
Neg logit contributions
 coll-4.856
 ne-4.855
 saf-4.812
 cheaper-4.728
 swe-4.703
Token,
Token ID11
Feature activation-1.16e+00

Pos logit contributions
leaf+1.692
ting+1.631
unk+1.547
orry+1.547
 upper+1.506
Neg logit contributions
 ne-1.093
 One-1.056
 coll-1.013
 cheaper-0.998
 swe-0.987
Token holding
Token ID4769
Feature activation-2.00e+00

Pos logit contributions
leaf+1.990
unk+1.875
ting+1.873
orry+1.870
other+1.782
Neg logit contributions
 ne-1.569
 One-1.460
 swe-1.439
 cheaper-1.417
 coll-1.414
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
leaf+3.879
ting+3.759
unk+3.478
orry+3.458
coon+3.408
Neg logit contributions
 ne-2.334
 One-2.189
 swe-2.172
 coll-2.147
 enough-2.140
Token shiny
Token ID22441
Feature activation-6.69e-01

Pos logit contributions
ting+2.305
leaf+2.216
orry+2.113
unk+2.040
other+2.028
Neg logit contributions
 ne-1.485
 cheaper-1.412
 coll-1.409
 swe-1.378
 One-1.333
TokenJohn
Token ID7554
Feature activation-2.11e-01

Pos logit contributions
leaf+0.405
ting+0.403
orry+0.372
unk+0.371
 ear+0.366
Neg logit contributions
 ne-0.477
 One-0.468
One-0.457
 cheaper-0.455
 swe-0.450
Token nodded
Token ID14464
Feature activation+7.22e-01

Pos logit contributions
leaf+0.338
ting+0.334
orry+0.310
unk+0.306
 ear+0.302
Neg logit contributions
 ne-0.296
 One-0.286
 cheaper-0.276
 swe-0.276
 coll-0.274
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
 ne+1.022
 cheaper+1.006
 One+1.004
 coll+1.004
One+0.977
Neg logit contributions
ting-1.095
leaf-1.087
 ear-1.019
unk-0.999
orry-0.993
Token He
Token ID679
Feature activation-6.74e-01

Pos logit contributions
leaf+1.781
ting+1.751
orry+1.683
unk+1.649
 upper+1.603
Neg logit contributions
 One-0.864
 ne-0.819
 cheaper-0.753
 swe-0.747
One-0.743
Token grabbed
Token ID13176
Feature activation-1.58e+00

Pos logit contributions
ting+1.069
leaf+1.051
orry+0.963
unk+0.957
 ear+0.945
Neg logit contributions
 ne-0.993
 One-0.945
 swe-0.916
 cheaper-0.910
One-0.899
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
leaf+3.037
ting+2.975
orry+2.893
unk+2.859
 upper+2.806
Neg logit contributions
 ne-1.813
 One-1.767
 swe-1.653
 cheaper-1.637
 enough-1.609
Token cricket
Token ID18836
Feature activation-1.20e-01

Pos logit contributions
leaf+4.496
ting+4.450
orry+4.184
idget+4.045
asc+4.005
Neg logit contributions
 One-6.076
 coll-5.944
 swe-5.832
One-5.782
 ne-5.748
Token and
Token ID290
Feature activation-9.38e-01

Pos logit contributions
leaf+0.202
ting+0.199
orry+0.187
unk+0.184
 ear+0.182
Neg logit contributions
 ne-0.156
 One-0.150
 cheaper-0.146
 swe-0.146
 coll-0.145
Token put
Token ID1234
Feature activation-1.62e+00

Pos logit contributions
leaf+1.510
ting+1.487
orry+1.425
unk+1.420
 upper+1.377
Neg logit contributions
 ne-1.369
 swe-1.263
 One-1.250
 cheaper-1.227
 coll-1.223
Token it
Token ID340
Feature activation-1.22e-01

Pos logit contributions
leaf+3.016
ting+2.837
orry+2.797
unk+2.769
 upper+2.737
Neg logit contributions
 ne-2.006
 One-1.902
 swe-1.811
 cheaper-1.802
 enough-1.785
Token carefully
Token ID7773
Feature activation+7.55e-01

Pos logit contributions
leaf+0.146
ting+0.142
orry+0.130
unk+0.128
 ear+0.125
Neg logit contributions
 ne-0.220
 One-0.213
 cheaper-0.210
 swe-0.209
 coll-0.208
Token a
Token ID257
Feature activation+7.94e-01

Pos logit contributions
 ne+0.382
 One+0.368
 cheaper+0.363
 swe+0.359
 coll+0.358
Neg logit contributions
leaf-0.451
ting-0.448
orry-0.417
 ear-0.411
unk-0.410
Token really
Token ID1107
Feature activation+2.29e-01

Pos logit contributions
 ne+0.972
 One+0.928
 swe+0.918
 coll+0.916
 cheaper+0.914
Neg logit contributions
leaf-1.392
ting-1.333
orry-1.283
 ear-1.256
unk-1.250
Token high
Token ID1029
Feature activation+1.48e+00

Pos logit contributions
 ne+0.275
 One+0.265
 cheaper+0.259
 coll+0.258
 swe+0.256
Neg logit contributions
leaf-0.406
ting-0.398
orry-0.377
unk-0.373
 ear-0.372
Token and
Token ID290
Feature activation+5.69e-02

Pos logit contributions
 cheaper+2.218
 ne+2.209
 One+2.206
 coll+2.199
One+2.180
Neg logit contributions
ting-2.119
leaf-2.044
 ear-1.951
unk-1.876
orry-1.837
Token long
Token ID890
Feature activation+1.09e+00

Pos logit contributions
 ne+0.078
 One+0.076
 cheaper+0.074
 coll+0.073
 swe+0.073
Neg logit contributions
leaf-0.092
ting-0.091
orry-0.085
unk-0.084
 ear-0.083
Token climb
Token ID12080
Feature activation+2.26e+00

Pos logit contributions
 cheaper+1.503
 ne+1.498
 swe+1.463
 One+1.461
One+1.445
Neg logit contributions
ting-1.676
leaf-1.655
 ear-1.626
unk-1.563
orry-1.551
Token up
Token ID510
Feature activation+2.15e+00

Pos logit contributions
 cheaper+4.176
 coll+4.156
One+4.118
 swe+4.097
 ne+4.087
Neg logit contributions
ting-2.839
leaf-2.322
 behind-2.193
 ear-2.191
 around-2.122
Token the
Token ID262
Feature activation+9.21e-01

Pos logit contributions
 coll+3.579
 cheaper+3.558
 swe+3.553
 ne+3.466
One+3.422
Neg logit contributions
ting-2.878
leaf-2.727
 ear-2.598
 behind-2.542
orry-2.400
Token mountains
Token ID12269
Feature activation+4.32e-01

Pos logit contributions
 ne+1.366
 coll+1.326
 One+1.316
 cheaper+1.311
 swe+1.304
Neg logit contributions
leaf-1.351
ting-1.348
 ear-1.237
orry-1.231
 upper-1.218
Tokenide
Token ID485
Feature activation+4.91e-01

Pos logit contributions
 ne+0.597
 One+0.579
 cheaper+0.572
 swe+0.571
 coll+0.570
Neg logit contributions
ting-0.683
leaf-0.677
orry-0.625
unk-0.620
 ear-0.612
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 ne+0.836
 One+0.816
 cheaper+0.812
 coll+0.806
 swe+0.803
Neg logit contributions
ting-0.628
leaf-0.613
unk-0.559
orry-0.558
 ear-0.550
Token bird
Token ID6512
Feature activation+4.37e-01

Pos logit contributions
 ne+0.479
 One+0.465
 cheaper+0.452
 swe+0.447
 coll+0.447
Neg logit contributions
leaf-0.591
ting-0.588
orry-0.549
unk-0.545
 ear-0.545
Token recovered
Token ID11911
Feature activation+7.86e-01

Pos logit contributions
 ne+0.581
 One+0.568
 cheaper+0.557
 coll+0.550
 swe+0.549
Neg logit contributions
ting-0.711
leaf-0.709
orry-0.661
unk-0.655
 ear-0.650
Token.
Token ID13
Feature activation+2.20e-02

Pos logit contributions
 One+1.200
 ne+1.194
One+1.162
 cheaper+1.150
 coll+1.141
Neg logit contributions
ting-1.164
leaf-1.116
unk-1.021
 ear-1.016
orry-1.012
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
 ne+0.026
 One+0.025
 cheaper+0.024
 coll+0.024
 swe+0.024
Neg logit contributions
leaf-0.040
ting-0.039
orry-0.037
unk-0.037
 ear-0.036
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
 ne+0.120
 One+0.114
 cheaper+0.112
 swe+0.111
 coll+0.111
Neg logit contributions
leaf-0.200
ting-0.197
orry-0.187
unk-0.185
 ear-0.183
TokenSoon
Token ID28093
Feature activation+2.20e+00

Pos logit contributions
 ne+0.241
 One+0.231
 cheaper+0.228
 swe+0.226
 coll+0.226
Neg logit contributions
leaf-0.303
ting-0.298
orry-0.280
unk-0.276
 ear-0.272
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
 coll+2.976
One+2.974
 cheaper+2.898
 swe+2.866
 One+2.847
Neg logit contributions
ting-3.896
 ear-3.435
leaf-3.423
orry-3.386
 Tony-3.378
Token time
Token ID640
Feature activation+1.21e+00

Pos logit contributions
 ne+1.059
 One+1.033
 cheaper+1.007
 coll+1.003
 swe+0.999
Neg logit contributions
leaf-1.293
ting-1.282
 ear-1.220
orry-1.214
unk-1.206
Token had
Token ID550
Feature activation+1.62e+00

Pos logit contributions
One+1.751
 ne+1.739
 coll+1.737
 One+1.734
 swe+1.733
Neg logit contributions
ting-1.860
leaf-1.782
orry-1.677
unk-1.645
 ear-1.608
Token come
Token ID1282
Feature activation+1.68e+00

Pos logit contributions
 ne+2.171
 coll+2.149
One+2.136
 cheaper+2.134
 One+2.129
Neg logit contributions
leaf-2.576
ting-2.488
orry-2.367
unk-2.345
 ear-2.330
Token for
Token ID329
Feature activation+8.50e-01

Pos logit contributions
 ne+3.177
 cheaper+3.042
 One+3.005
One+3.000
 coll+2.995
Neg logit contributions
ting-1.989
leaf-1.763
 ear-1.746
unk-1.732
orry-1.612
Token girl
Token ID2576
Feature activation-2.98e-01

Pos logit contributions
leaf+0.153
ting+0.151
orry+0.140
unk+0.138
 ear+0.135
Neg logit contributions
 ne-0.162
 One-0.158
 cheaper-0.154
 coll-0.154
 swe-0.153
Token and
Token ID290
Feature activation-4.76e-01

Pos logit contributions
leaf+0.489
ting+0.479
orry+0.448
unk+0.443
 upper+0.437
Neg logit contributions
 ne-0.405
 One-0.398
 cheaper-0.383
 coll-0.380
 swe-0.379
Token she
Token ID673
Feature activation-6.67e-02

Pos logit contributions
leaf+0.704
ting+0.702
unk+0.649
orry+0.648
 upper+0.625
Neg logit contributions
 ne-0.734
 One-0.716
 swe-0.692
 cheaper-0.690
 coll-0.683
Token liked
Token ID8288
Feature activation+2.80e-01

Pos logit contributions
leaf+0.076
ting+0.074
orry+0.067
unk+0.066
 ear+0.065
Neg logit contributions
 ne-0.125
 One-0.122
 cheaper-0.119
 swe-0.119
 coll-0.118
Token going
Token ID1016
Feature activation+1.65e+00

Pos logit contributions
 ne+0.376
 One+0.362
 cheaper+0.356
 coll+0.352
 swe+0.351
Neg logit contributions
leaf-0.461
ting-0.457
orry-0.423
unk-0.419
 ear-0.418
Token on
Token ID319
Feature activation+2.46e+00

Pos logit contributions
 ne+2.881
 cheaper+2.760
 swe+2.757
 coll+2.754
 One+2.716
Neg logit contributions
ting-2.173
leaf-2.062
 ear-1.853
other-1.805
unk-1.791
Token adventures
Token ID17545
Feature activation+8.39e-01

Pos logit contributions
 ne+2.318
 cheaper+2.233
 shin+2.062
 Nib+2.033
 One+2.002
Neg logit contributions
ting-4.971
leaf-4.913
orry-4.605
 ear-4.564
 expecting-4.515
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 ne+1.376
 cheaper+1.349
 swe+1.337
 coll+1.332
 One+1.326
Neg logit contributions
leaf-1.097
ting-1.092
 ear-0.975
orry-0.972
unk-0.961
Token One
Token ID1881
Feature activation-7.64e-01

Pos logit contributions
 ne+0.078
 One+0.069
 cheaper+0.069
 coll+0.067
 swe+0.067
Neg logit contributions
leaf-0.315
ting-0.313
orry-0.300
unk-0.297
 ear-0.296
Token day
Token ID1110
Feature activation-6.98e-01

Pos logit contributions
leaf+1.424
ting+1.414
unk+1.310
orry+1.305
 upper+1.273
Neg logit contributions
 One-0.902
 ne-0.894
One-0.857
 saf-0.834
 swe-0.827
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
leaf+1.586
ting+1.533
orry+1.480
unk+1.463
 upper+1.460
Neg logit contributions
 One-0.543
 ne-0.539
One-0.480
 swe-0.476
 coll-0.452
Token about
Token ID546
Feature activation+1.55e-01

Pos logit contributions
leaf+0.855
ting+0.834
orry+0.796
unk+0.786
 upper+0.768
Neg logit contributions
 ne-0.617
 One-0.613
 cheaper-0.587
 swe-0.580
One-0.575
Token this
Token ID428
Feature activation+7.63e-01

Pos logit contributions
 ne+0.228
 One+0.221
 cheaper+0.216
 coll+0.215
 swe+0.214
Neg logit contributions
leaf-0.236
ting-0.233
orry-0.216
unk-0.213
 ear-0.212
Token!
Token ID0
Feature activation+8.92e-01

Pos logit contributions
 ne+1.031
 One+0.977
 coll+0.966
 swe+0.957
 cheaper+0.954
Neg logit contributions
leaf-1.233
ting-1.222
 ear-1.144
orry-1.138
unk-1.125
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
 ne+1.103
 coll+1.041
 cheaper+1.033
 swe+1.021
One+0.963
Neg logit contributions
leaf-1.550
ting-1.537
 ear-1.436
orry-1.433
unk-1.421
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
 ne+0.208
 One+0.196
 cheaper+0.194
 swe+0.193
 coll+0.192
Neg logit contributions
leaf-0.353
ting-0.348
orry-0.330
unk-0.327
 ear-0.323
TokenSo
Token ID2396
Feature activation+1.86e+00

Pos logit contributions
 ne+1.196
 coll+1.163
 cheaper+1.121
 swe+1.107
 One+1.080
Neg logit contributions
leaf-1.995
ting-1.990
orry-1.851
other-1.814
unk-1.804
Token Bobby
Token ID17155
Feature activation+1.25e+00

Pos logit contributions
 coll+2.407
 cheaper+2.316
 ne+2.270
 swe+2.202
One+2.145
Neg logit contributions
ting-3.207
 ear-3.085
leaf-2.976
orry-2.856
other-2.795
Token started
Token ID2067
Feature activation+1.20e+00

Pos logit contributions
 cheaper+1.701
 ne+1.694
 swe+1.678
 coll+1.661
 One+1.640
Neg logit contributions
ting-2.024
leaf-2.001
orry-1.834
 ear-1.824
unk-1.814
Token packing
Token ID24157
Feature activation+2.06e-01

Pos logit contributions
 ne+1.858
 cheaper+1.827
 One+1.777
 swe+1.770
 coll+1.757
Neg logit contributions
ting-1.729
leaf-1.711
 ear-1.587
unk-1.530
orry-1.502
Token his
Token ID465
Feature activation+1.70e-01

Pos logit contributions
 ne+0.278
 One+0.269
 cheaper+0.263
 coll+0.262
 swe+0.261
Neg logit contributions
leaf-0.335
ting-0.332
orry-0.309
unk-0.306
 ear-0.305
Token bag
Token ID6131
Feature activation-6.83e-01

Pos logit contributions
 ne+0.213
 One+0.205
 cheaper+0.200
 coll+0.199
 swe+0.198
Neg logit contributions
leaf-0.293
ting-0.289
orry-0.272
unk-0.271
 ear-0.269
Token with
Token ID351
Feature activation-5.57e-01

Pos logit contributions
 ne+1.102
 One+1.080
One+1.057
 cheaper+1.033
 coll+1.033
Neg logit contributions
ting-1.176
leaf-1.164
orry-1.085
 ear-1.080
unk-1.071
Token soap
Token ID19533
Feature activation+2.19e-01

Pos logit contributions
leaf+0.976
ting+0.964
orry+0.909
unk+0.903
 upper+0.870
Neg logit contributions
 ne-0.709
 One-0.687
 swe-0.670
 cheaper-0.669
 coll-0.669
Token and
Token ID290
Feature activation+3.06e-01

Pos logit contributions
 ne+0.296
 One+0.287
 cheaper+0.279
 coll+0.277
 swe+0.276
Neg logit contributions
leaf-0.354
ting-0.353
orry-0.327
unk-0.326
 ear-0.322
Token water
Token ID1660
Feature activation+3.45e-02

Pos logit contributions
 ne+0.455
 One+0.445
 cheaper+0.434
One+0.429
 coll+0.428
Neg logit contributions
leaf-0.455
ting-0.448
orry-0.415
 ear-0.411
unk-0.408
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
 ne+0.047
 One+0.045
 cheaper+0.044
 coll+0.044
 swe+0.044
Neg logit contributions
leaf-0.056
ting-0.055
orry-0.052
unk-0.051
 ear-0.051
Token It
Token ID632
Feature activation+2.03e+00

Pos logit contributions
 ne+0.400
 cheaper+0.366
 coll+0.363
 One+0.362
 swe+0.360
Neg logit contributions
leaf-0.844
ting-0.833
orry-0.789
 ear-0.785
unk-0.782
Token shines
Token ID32481
Feature activation+7.07e-01

Pos logit contributions
One+2.740
 ne+2.736
 One+2.734
 saf+2.704
 coll+2.692
Neg logit contributions
leaf-3.199
ting-3.051
orry-2.952
 ear-2.875
unk-2.844
Token again
Token ID757
Feature activation+1.64e-01

Pos logit contributions
 ne+1.166
 One+1.158
One+1.124
 coll+1.119
 cheaper+1.116
Neg logit contributions
ting-0.940
leaf-0.916
orry-0.829
 ear-0.827
unk-0.816
Token.
Token ID13
Feature activation+2.91e-01

Pos logit contributions
 ne+0.221
 One+0.214
 cheaper+0.209
 coll+0.207
 swe+0.207
Neg logit contributions
leaf-0.268
ting-0.267
orry-0.249
unk-0.247
 ear-0.245
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
 ne+0.325
 One+0.303
 cheaper+0.303
 coll+0.301
 swe+0.300
Neg logit contributions
leaf-0.546
ting-0.539
orry-0.509
unk-0.505
 ear-0.503
Token
Token ID198
Feature activation-1.92e-02

Pos logit contributions
 ne+0.242
 One+0.229
 cheaper+0.226
 swe+0.225
 coll+0.224
Neg logit contributions
leaf-0.380
ting-0.374
orry-0.355
unk-0.351
 ear-0.347
Token happy
Token ID3772
Feature activation+1.32e-01

Pos logit contributions
 ne+0.377
 One+0.368
 cheaper+0.360
 coll+0.359
 swe+0.357
Neg logit contributions
leaf-0.311
ting-0.309
orry-0.284
unk-0.282
 ear-0.279
Token and
Token ID290
Feature activation+4.92e-01

Pos logit contributions
 ne+0.178
 One+0.172
 cheaper+0.169
 coll+0.167
 swe+0.167
Neg logit contributions
leaf-0.215
ting-0.213
orry-0.199
unk-0.197
 ear-0.195
Token healthy
Token ID5448
Feature activation+2.42e-01

Pos logit contributions
 ne+0.698
 One+0.684
 cheaper+0.669
 coll+0.669
One+0.661
Neg logit contributions
leaf-0.755
ting-0.747
 ear-0.694
orry-0.689
unk-0.677
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 ne+0.332
 One+0.322
 cheaper+0.315
 coll+0.313
 swe+0.311
Neg logit contributions
ting-0.388
leaf-0.387
orry-0.357
unk-0.354
 ear-0.353
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
 ne+0.167
 One+0.157
 cheaper+0.156
 coll+0.155
 swe+0.154
Neg logit contributions
leaf-0.278
ting-0.276
orry-0.261
unk-0.258
 ear-0.256
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
 ne+0.139
 One+0.133
 cheaper+0.131
 swe+0.130
 coll+0.130
Neg logit contributions
leaf-0.176
ting-0.173
orry-0.163
unk-0.161
 ear-0.159
TokenThe
Token ID464
Feature activation-5.96e-01

Pos logit contributions
 ne+0.297
 One+0.281
 cheaper+0.277
 swe+0.276
 coll+0.276
Neg logit contributions
leaf-0.492
ting-0.483
orry-0.461
unk-0.454
 ear-0.447
Token lab
Token ID2248
Feature activation-2.12e-01

Pos logit contributions
leaf+0.984
ting+0.966
orry+0.907
unk+0.901
 upper+0.883
Neg logit contributions
 ne-0.809
 One-0.785
 swe-0.764
 coll-0.762
 cheaper-0.762
Token had
Token ID550
Feature activation+3.51e-01

Pos logit contributions
leaf+0.361
ting+0.357
orry+0.334
unk+0.331
 ear+0.327
Neg logit contributions
 ne-0.273
 One-0.263
 cheaper-0.255
 swe-0.253
 coll-0.252
Token so
Token ID523
Feature activation+9.03e-01

Pos logit contributions
 ne+0.473
 One+0.459
 coll+0.447
 cheaper+0.447
 swe+0.443
Neg logit contributions
leaf-0.571
ting-0.559
orry-0.524
unk-0.518
 ear-0.513
Token much
Token ID881
Feature activation+8.76e-01

Pos logit contributions
 ne+0.839
 coll+0.796
 One+0.792
 swe+0.788
 cheaper+0.780
Neg logit contributions
leaf-1.857
ting-1.796
orry-1.757
unk-1.730
 ear-1.710
Token sad
Token ID6507
Feature activation-4.01e-01

Pos logit contributions
leaf+0.563
ting+0.559
orry+0.514
unk+0.504
 upper+0.498
Neg logit contributions
 ne-0.482
 One-0.465
 cheaper-0.453
 swe-0.447
 coll-0.445
Token and
Token ID290
Feature activation-1.54e-01

Pos logit contributions
leaf+0.656
ting+0.632
orry+0.597
unk+0.589
 upper+0.578
Neg logit contributions
 ne-0.562
 One-0.554
 cheaper-0.531
One-0.525
 swe-0.525
Token wanted
Token ID2227
Feature activation+1.25e-01

Pos logit contributions
leaf+0.233
ting+0.228
orry+0.213
unk+0.211
 upper+0.206
Neg logit contributions
 ne-0.230
 One-0.223
 cheaper-0.216
 swe-0.215
 coll-0.214
Token to
Token ID284
Feature activation-3.78e-01

Pos logit contributions
 ne+0.164
 One+0.158
 cheaper+0.154
 coll+0.154
 swe+0.153
Neg logit contributions
leaf-0.211
ting-0.209
orry-0.196
unk-0.194
 ear-0.192
Token find
Token ID1064
Feature activation-1.42e+00

Pos logit contributions
ting+0.599
leaf+0.599
orry+0.552
unk+0.548
 upper+0.529
Neg logit contributions
 ne-0.543
 One-0.530
 swe-0.506
 cheaper-0.505
 coll-0.504
Token it
Token ID340
Feature activation+7.13e-01

Pos logit contributions
leaf+2.796
ting+2.761
unk+2.659
orry+2.652
asp+2.581
Neg logit contributions
 One-1.535
 ne-1.519
 cheaper-1.458
 swe-1.456
 coll-1.391
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
 ne+1.052
 coll+1.030
 One+1.003
 cheaper+1.000
 swe+0.990
Neg logit contributions
ting-1.044
leaf-1.013
orry-0.958
 ear-0.956
unk-0.943
Token Max
Token ID5436
Feature activation+6.86e-01

Pos logit contributions
leaf+0.833
ting+0.817
orry+0.777
unk+0.769
 ear+0.754
Neg logit contributions
 One-0.418
 ne-0.415
One-0.383
 cheaper-0.377
 swe-0.376
Token went
Token ID1816
Feature activation+8.71e-01

Pos logit contributions
 ne+1.036
 cheaper+1.022
 coll+1.017
 One+1.006
 swe+0.997
Neg logit contributions
ting-0.972
leaf-0.957
orry-0.900
 ear-0.895
unk-0.893
Token on
Token ID319
Feature activation+1.30e+00

Pos logit contributions
 ne+1.382
 coll+1.341
 One+1.339
 cheaper+1.325
 swe+1.324
Neg logit contributions
ting-1.206
leaf-1.196
 ear-1.080
unk-1.058
orry-1.057
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
 ne+1.890
 swe+1.866
 cheaper+1.832
 coll+1.832
 One+1.782
Neg logit contributions
leaf-1.901
ting-1.891
 ear-1.811
orry-1.749
unk-1.730
Token house
Token ID2156
Feature activation-1.26e-01

Pos logit contributions
leaf+0.278
ting+0.273
orry+0.253
unk+0.251
 ear+0.245
Neg logit contributions
 ne-0.294
 One-0.286
 cheaper-0.280
 coll-0.278
 swe-0.278
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
leaf+0.213
ting+0.209
orry+0.197
unk+0.195
 ear+0.192
Neg logit contributions
 ne-0.163
 One-0.157
 cheaper-0.153
 coll-0.152
 swe-0.151
Token He
Token ID679
Feature activation-8.23e-03

Pos logit contributions
 ne+0.439
 coll+0.410
 swe+0.409
 cheaper+0.409
 One+0.407
Neg logit contributions
leaf-0.691
ting-0.687
orry-0.646
 ear-0.639
unk-0.637
Token had
Token ID550
Feature activation+9.48e-01

Pos logit contributions
leaf+0.014
ting+0.014
orry+0.013
unk+0.013
 ear+0.013
Neg logit contributions
 ne-0.011
 One-0.010
 cheaper-0.010
 coll-0.010
 swe-0.010
Token lots
Token ID6041
Feature activation+1.93e-01

Pos logit contributions
 ne+1.422
 One+1.385
 coll+1.377
 swe+1.367
 cheaper+1.351
Neg logit contributions
leaf-1.399
ting-1.355
orry-1.242
 ear-1.242
 upper-1.221
Token of
Token ID286
Feature activation+1.13e+00

Pos logit contributions
 ne+0.393
 One+0.386
 cheaper+0.379
 coll+0.378
 swe+0.377
Neg logit contributions
leaf-0.185
ting-0.180
orry-0.160
unk-0.155
 ear-0.153
Token fun
Token ID1257
Feature activation+3.14e-01

Pos logit contributions
 ne+1.006
 One+0.978
 cheaper+0.934
 coll+0.929
 swe+0.924
Neg logit contributions
leaf-2.404
ting-2.267
 ear-2.203
orry-2.164
 upper-2.162
Token and
Token ID290
Feature activation-4.29e-02

Pos logit contributions
 ne+0.457
 One+0.446
 cheaper+0.438
 swe+0.436
 coll+0.433
Neg logit contributions
leaf-0.481
ting-0.477
orry-0.441
unk-0.436
 ear-0.433
Token thoroughly
Token ID13770
Feature activation+3.22e-02

Pos logit contributions
leaf+0.073
ting+0.072
orry+0.067
unk+0.067
 ear+0.066
Neg logit contributions
 ne-0.056
 One-0.054
 cheaper-0.052
 swe-0.052
 coll-0.052
Token enjoyed
Token ID8359
Feature activation+5.50e-01

Pos logit contributions
 ne+0.047
 One+0.046
 cheaper+0.045
 coll+0.045
 swe+0.045
Neg logit contributions
leaf-0.049
ting-0.048
orry-0.045
unk-0.044
 ear-0.044
Token himself
Token ID2241
Feature activation+6.29e-01

Pos logit contributions
 ne+0.830
 One+0.818
 swe+0.801
 cheaper+0.800
 coll+0.793
Neg logit contributions
ting-0.804
leaf-0.803
 ear-0.728
orry-0.725
unk-0.716
Token all
Token ID477
Feature activation-8.92e-01

Pos logit contributions
 ne+1.428
 One+1.364
 coll+1.361
 cheaper+1.354
One+1.344
Neg logit contributions
leaf-0.880
ting-0.848
orry-0.801
 ear-0.779
unk-0.776
Token about
Token ID546
Feature activation+1.80e-01

Pos logit contributions
ting+1.124
leaf+1.092
unk+1.029
 upper+1.026
orry+1.016
Neg logit contributions
 One-1.578
 ne-1.541
 swe-1.520
 coll-1.497
 cheaper-1.495
Token their
Token ID511
Feature activation+4.93e-01

Pos logit contributions
 ne+0.232
 One+0.225
 cheaper+0.218
 coll+0.217
 swe+0.216
Neg logit contributions
leaf-0.305
ting-0.302
orry-0.282
unk-0.279
 ear-0.278
Token adventure
Token ID8855
Feature activation+2.00e-01

Pos logit contributions
 ne+0.613
 One+0.597
 cheaper+0.575
One+0.570
 swe+0.569
Neg logit contributions
leaf-0.857
ting-0.834
orry-0.799
unk-0.790
 ear-0.787
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
 ne+0.274
 One+0.266
 cheaper+0.258
 swe+0.258
 coll+0.257
Neg logit contributions
leaf-0.323
ting-0.321
orry-0.298
unk-0.295
 ear-0.292
Token She
Token ID1375
Feature activation+2.18e-01

Pos logit contributions
 ne+0.165
 One+0.156
 cheaper+0.154
 coll+0.153
 swe+0.152
Neg logit contributions
leaf-0.248
ting-0.245
orry-0.231
unk-0.229
 ear-0.227
Token smiled
Token ID13541
Feature activation+7.10e-01

Pos logit contributions
 ne+0.301
 One+0.294
 cheaper+0.286
 coll+0.286
 swe+0.284
Neg logit contributions
leaf-0.349
ting-0.342
orry-0.322
unk-0.317
 ear-0.313
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
 ne+1.024
 One+0.990
 cheaper+0.982
 coll+0.976
One+0.958
Neg logit contributions
ting-1.086
leaf-1.055
 ear-1.000
orry-0.979
unk-0.976
Token hugged
Token ID41130
Feature activation-1.77e-01

Pos logit contributions
leaf+1.717
ting+1.715
orry+1.667
unk+1.660
other+1.601
Neg logit contributions
 ne-1.766
 swe-1.634
 cheaper-1.609
 One-1.602
 coll-1.597
Token them
Token ID606
Feature activation+1.09e-02

Pos logit contributions
leaf+0.331
ting+0.325
orry+0.310
unk+0.307
 upper+0.300
Neg logit contributions
 ne-0.203
 One-0.197
 swe-0.190
 cheaper-0.189
 coll-0.188
Token.
Token ID13
Feature activation-5.79e-02

Pos logit contributions
 ne+0.016
 One+0.015
 cheaper+0.015
 coll+0.015
 swe+0.015
Neg logit contributions
leaf-0.017
ting-0.017
orry-0.016
unk-0.016
 ear-0.015
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 ne+1.077
 One+1.030
 coll+1.021
 cheaper+1.011
One+1.004
Neg logit contributions
ting-1.298
leaf-1.276
 ear-1.192
orry-1.189
unk-1.182
Token We
Token ID775
Feature activation+4.02e-01

Pos logit contributions
leaf+0.323
ting+0.318
orry+0.298
unk+0.295
 upper+0.287
Neg logit contributions
 ne-0.297
 One-0.292
 cheaper-0.282
 swe-0.280
 coll-0.280
Token will
Token ID481
Feature activation+1.11e+00

Pos logit contributions
 ne+0.569
 One+0.548
 cheaper+0.542
 coll+0.541
 swe+0.538
Neg logit contributions
leaf-0.630
ting-0.614
orry-0.580
unk-0.570
 ear-0.567
Token be
Token ID307
Feature activation+4.27e-02

Pos logit contributions
 ne+1.674
 coll+1.603
 cheaper+1.592
 swe+1.579
 One+1.572
Neg logit contributions
leaf-1.584
ting-1.557
 ear-1.547
orry-1.473
 upper-1.457
Token good
Token ID922
Feature activation+9.40e-01

Pos logit contributions
 ne+0.062
 One+0.060
 cheaper+0.058
 coll+0.058
 swe+0.058
Neg logit contributions
leaf-0.066
ting-0.065
orry-0.061
unk-0.060
 ear-0.060
Token twins
Token ID20345
Feature activation+6.04e-01

Pos logit contributions
 ne+1.406
 One+1.361
 cheaper+1.335
One+1.326
 coll+1.319
Neg logit contributions
ting-1.416
leaf-1.386
 ear-1.302
unk-1.290
orry-1.252
Token,"
Token ID553
Feature activation+1.32e+00

Pos logit contributions
 ne+0.794
 One+0.770
 coll+0.747
 swe+0.738
One+0.735
Neg logit contributions
ting-0.992
leaf-0.990
orry-0.913
 ear-0.910
unk-0.906
Token they
Token ID484
Feature activation+3.47e-01

Pos logit contributions
 ne+2.732
 cheaper+2.669
 coll+2.663
 saf+2.644
One+2.638
Neg logit contributions
leaf-1.183
ting-1.105
 ear-1.048
orry-0.935
 hand-0.909
Token say
Token ID910
Feature activation+5.12e-01

Pos logit contributions
 ne+0.567
 One+0.546
 cheaper+0.545
 coll+0.538
 swe+0.534
Neg logit contributions
leaf-0.469
ting-0.458
orry-0.419
 ear-0.416
unk-0.411
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 ne+0.729
 One+0.705
 cheaper+0.693
One+0.687
 coll+0.687
Neg logit contributions
leaf-0.792
ting-0.790
 ear-0.726
orry-0.724
unk-0.720
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
 ne+0.188
 One+0.179
 cheaper+0.176
 coll+0.174
 swe+0.174
Neg logit contributions
leaf-0.318
ting-0.314
orry-0.296
unk-0.294
 ear-0.293
Token the
Token ID262
Feature activation-3.58e-01

Pos logit contributions
 ne+1.853
 coll+1.841
 swe+1.831
 One+1.825
 cheaper+1.824
Neg logit contributions
ting-1.757
leaf-1.644
 ear-1.589
orry-1.569
unk-1.474
Token stretching
Token ID20880
Feature activation+4.05e-02

Pos logit contributions
leaf+0.508
ting+0.499
orry+0.470
unk+0.463
 upper+0.448
Neg logit contributions
 ne-0.574
 One-0.554
 swe-0.543
 coll-0.542
 cheaper-0.541
Token was
Token ID373
Feature activation-5.79e-01

Pos logit contributions
 ne+0.051
 One+0.049
 cheaper+0.048
 coll+0.048
 swe+0.048
Neg logit contributions
leaf-0.070
ting-0.069
orry-0.065
unk-0.064
 ear-0.063
Token done
Token ID1760
Feature activation+1.03e+00

Pos logit contributions
leaf+0.783
ting+0.769
orry+0.719
unk+0.708
 ear+0.684
Neg logit contributions
 ne-0.973
 One-0.937
 cheaper-0.922
 swe-0.922
One-0.902
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
 ne+1.097
 cheaper+1.077
 coll+1.073
 One+1.070
One+1.064
Neg logit contributions
ting-1.968
leaf-1.920
 ear-1.798
unk-1.798
orry-1.794
Token everyone
Token ID2506
Feature activation-1.67e+00

Pos logit contributions
 ne+0.355
 One+0.346
 cheaper+0.340
 coll+0.339
 swe+0.338
Neg logit contributions
ting-0.296
leaf-0.296
orry-0.270
unk-0.265
 ear-0.264
Token shared
Token ID4888
Feature activation-4.56e-02

Pos logit contributions
leaf+2.972
ting+2.926
 upper+2.894
unk+2.799
orry+2.784
Neg logit contributions
 ne-2.193
 One-1.878
 swe-1.852
 cheaper-1.839
One-1.777
Token lunch
Token ID9965
Feature activation-4.68e-01

Pos logit contributions
leaf+0.080
ting+0.079
orry+0.075
unk+0.074
 ear+0.073
Neg logit contributions
 ne-0.056
 One-0.054
 cheaper-0.053
 swe-0.053
 coll-0.053
Token together
Token ID1978
Feature activation-6.95e-01

Pos logit contributions
leaf+0.804
ting+0.795
orry+0.751
unk+0.741
 ear+0.734
Neg logit contributions
 ne-0.604
 One-0.567
 cheaper-0.565
 swe-0.560
 coll-0.559
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
leaf+1.239
ting+1.206
orry+1.160
unk+1.141
 upper+1.124
Neg logit contributions
 ne-0.869
 One-0.827
 cheaper-0.801
 swe-0.797
 coll-0.795
Token Max
Token ID5436
Feature activation-1.95e-01

Pos logit contributions
leaf+0.416
ting+0.406
orry+0.378
unk+0.377
 upper+0.367
Neg logit contributions
 ne-0.449
 One-0.447
 cheaper-0.426
 coll-0.425
 swe-0.423
Token to
Token ID284
Feature activation+6.45e-02

Pos logit contributions
 ne+0.312
 One+0.302
 cheaper+0.298
 coll+0.294
 swe+0.293
Neg logit contributions
leaf-0.328
ting-0.326
orry-0.302
unk-0.300
 ear-0.297
Token hop
Token ID1725
Feature activation+4.41e-01

Pos logit contributions
 ne+0.096
 One+0.093
 cheaper+0.091
 coll+0.091
 swe+0.091
Neg logit contributions
leaf-0.097
ting-0.095
orry-0.089
unk-0.088
 ear-0.087
Token around
Token ID1088
Feature activation-5.89e-01

Pos logit contributions
 ne+0.914
 One+0.897
 cheaper+0.889
 coll+0.887
 swe+0.883
Neg logit contributions
leaf-0.395
ting-0.395
orry-0.334
 ear-0.327
unk-0.326
Token too
Token ID1165
Feature activation-9.85e-02

Pos logit contributions
leaf+1.009
ting+0.968
orry+0.938
unk+0.925
 upper+0.911
Neg logit contributions
 ne-0.776
 One-0.745
 coll-0.712
 cheaper-0.708
 swe-0.705
Token!
Token ID0
Feature activation-3.99e-01

Pos logit contributions
leaf+0.169
ting+0.167
orry+0.157
unk+0.155
 ear+0.153
Neg logit contributions
 ne-0.126
 One-0.122
 cheaper-0.118
 coll-0.117
 swe-0.117
Token
Token ID198
Feature activation-6.59e-01

Pos logit contributions
leaf+0.792
ting+0.780
orry+0.740
unk+0.735
 upper+0.728
Neg logit contributions
 One-0.412
 ne-0.406
One-0.373
 coll-0.372
 swe-0.370
Token
Token ID198
Feature activation-4.22e-01

Pos logit contributions
leaf+1.355
ting+1.352
 ear+1.265
orry+1.260
 upper+1.258
Neg logit contributions
 One-0.631
 ne-0.609
One-0.599
 cheaper-0.554
 coll-0.551
TokenTogether
Token ID41631
Feature activation+1.66e+00

Pos logit contributions
leaf+0.748
ting+0.737
unk+0.694
 ear+0.693
orry+0.693
Neg logit contributions
 ne-0.519
 One-0.510
One-0.500
 coll-0.483
 cheaper-0.482
Token,
Token ID11
Feature activation-9.54e-02

Pos logit contributions
 cheaper+1.949
 ne+1.947
 former+1.904
 swe+1.858
 coll+1.849
Neg logit contributions
ting-2.982
leaf-2.844
 ear-2.786
orry-2.780
unk-2.686
Token Sammy
Token ID44067
Feature activation+2.73e-01

Pos logit contributions
leaf+0.142
ting+0.139
orry+0.130
unk+0.129
 ear+0.127
Neg logit contributions
 ne-0.144
 One-0.140
 cheaper-0.136
 coll-0.136
 swe-0.136
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 ne+0.377
 One+0.367
 cheaper+0.362
 coll+0.358
 swe+0.357
Neg logit contributions
leaf-0.434
ting-0.434
orry-0.402
unk-0.397
 ear-0.390
Tokenaps
Token ID1686
Feature activation-3.15e-01

Pos logit contributions
leaf+1.222
ting+1.206
orry+1.071
 upper+1.058
unk+1.056
Neg logit contributions
 One-1.440
 ne-1.429
One-1.353
 coll-1.347
 cheaper-1.321
Token her
Token ID607
Feature activation-6.22e-01

Pos logit contributions
leaf+0.543
ting+0.535
orry+0.506
unk+0.500
 upper+0.488
Neg logit contributions
 ne-0.406
 One-0.395
 swe-0.380
 cheaper-0.377
 coll-0.377
Token hands
Token ID2832
Feature activation+6.97e-01

Pos logit contributions
leaf+0.495
ting+0.487
orry+0.420
unk+0.390
 upper+0.368
Neg logit contributions
 ne-1.410
 One-1.363
 swe-1.356
 coll-1.354
 cheaper-1.339
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 ne+1.007
 One+0.967
 coll+0.965
 cheaper+0.963
 swe+0.957
Neg logit contributions
ting-1.061
leaf-1.052
 ear-0.968
orry-0.951
unk-0.948
Token
Token ID198
Feature activation+6.70e-02

Pos logit contributions
leaf+0.301
ting+0.297
orry+0.283
unk+0.279
 ear+0.276
Neg logit contributions
 ne-0.166
 One-0.162
 cheaper-0.153
 swe-0.153
 coll-0.153
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
 ne+0.072
 One+0.069
 cheaper+0.067
 swe+0.067
 coll+0.067
Neg logit contributions
leaf-0.128
ting-0.126
orry-0.120
unk-0.118
 ear-0.117
TokenThey
Token ID2990
Feature activation-5.02e-01

Pos logit contributions
leaf+1.769
ting+1.708
orry+1.670
unk+1.663
 ear+1.647
Neg logit contributions
 ne-1.177
One-1.142
 One-1.115
 swe-1.062
 cheaper-1.058
Token run
Token ID1057
Feature activation+3.49e-01

Pos logit contributions
leaf+0.855
ting+0.847
unk+0.811
orry+0.799
 upper+0.774
Neg logit contributions
 ne-0.666
 One-0.642
One-0.605
 swe-0.604
 cheaper-0.600
Token after
Token ID706
Feature activation-3.50e-02

Pos logit contributions
 ne+0.553
 One+0.536
 cheaper+0.530
 coll+0.527
 swe+0.527
Neg logit contributions
leaf-0.487
ting-0.486
orry-0.439
 ear-0.435
unk-0.432
Token the
Token ID262
Feature activation-7.26e-01

Pos logit contributions
leaf+0.053
ting+0.052
orry+0.048
unk+0.048
 ear+0.047
Neg logit contributions
 ne-0.052
 One-0.051
 cheaper-0.049
 coll-0.049
 swe-0.049
Token man
Token ID582
Feature activation+6.08e-01

Pos logit contributions
leaf+1.268
ting+1.260
orry+1.240
unk+1.217
 upper+1.165
Neg logit contributions
 ne-0.931
 One-0.867
 coll-0.863
 cheaper-0.858
 swe-0.851
Token She
Token ID1375
Feature activation+2.80e-01

Pos logit contributions
 ne+0.124
 One+0.119
 cheaper+0.117
 coll+0.116
 swe+0.116
Neg logit contributions
leaf-0.165
ting-0.163
orry-0.153
unk-0.152
 ear-0.150
Token said
Token ID531
Feature activation+9.07e-01

Pos logit contributions
 ne+0.396
 One+0.387
 cheaper+0.380
 coll+0.380
 swe+0.377
Neg logit contributions
leaf-0.433
ting-0.426
orry-0.401
unk-0.394
 ear-0.392
Token,
Token ID11
Feature activation-7.82e-01

Pos logit contributions
 ne+0.951
 coll+0.916
 One+0.914
One+0.905
 cheaper+0.903
Neg logit contributions
ting-1.721
leaf-1.701
 ear-1.633
orry-1.628
unk-1.617
Token "
Token ID366
Feature activation+1.82e-02

Pos logit contributions
leaf+1.172
ting+1.099
orry+1.062
unk+1.034
other+1.016
Neg logit contributions
 One-1.209
 ne-1.205
 cheaper-1.134
 swe-1.133
One-1.130
TokenNo
Token ID2949
Feature activation+8.48e-01

Pos logit contributions
 ne+0.030
 One+0.029
 cheaper+0.029
 coll+0.029
 swe+0.029
Neg logit contributions
leaf-0.024
ting-0.024
orry-0.022
unk-0.022
 ear-0.021
Token,
Token ID11
Feature activation-6.44e-01

Pos logit contributions
 ne+0.910
 One+0.842
 cheaper+0.839
One+0.829
 coll+0.820
Neg logit contributions
ting-1.609
leaf-1.593
 ear-1.526
orry-1.522
unk-1.503
Token it
Token ID340
Feature activation+1.37e-01

Pos logit contributions
leaf+0.905
ting+0.866
unk+0.810
orry+0.800
 upper+0.779
Neg logit contributions
 ne-1.057
 One-1.033
 cheaper-1.006
 swe-1.002
 coll-0.999
Token is
Token ID318
Feature activation+8.57e-01

Pos logit contributions
 ne+0.171
 One+0.165
 cheaper+0.161
 coll+0.160
 swe+0.160
Neg logit contributions
leaf-0.237
ting-0.235
orry-0.221
unk-0.219
 ear-0.216
Token mine
Token ID6164
Feature activation-7.69e-01

Pos logit contributions
 ne+1.575
 One+1.531
 coll+1.516
One+1.495
 swe+1.478
Neg logit contributions
leaf-0.933
ting-0.914
 ear-0.871
orry-0.852
unk-0.844
Token.
Token ID13
Feature activation-4.33e-01

Pos logit contributions
leaf+1.345
ting+1.286
orry+1.223
unk+1.185
 upper+1.174
Neg logit contributions
 ne-0.995
 One-0.971
 cheaper-0.959
 swe-0.936
One-0.927
Token I
Token ID314
Feature activation+8.21e-01

Pos logit contributions
leaf+0.505
ting+0.498
orry+0.452
unk+0.447
 upper+0.431
Neg logit contributions
 One-0.789
 ne-0.784
 cheaper-0.755
One-0.749
 swe-0.747
Token to
Token ID284
Feature activation-1.84e+00

Pos logit contributions
leaf+0.743
ting+0.725
orry+0.685
unk+0.676
 upper+0.671
Neg logit contributions
 ne-0.596
 One-0.577
 swe-0.555
 cheaper-0.552
 coll-0.547
Token Emma
Token ID18966
Feature activation+5.40e-01

Pos logit contributions
leaf+3.303
unk+3.218
orry+3.181
asp+3.119
ting+3.117
Neg logit contributions
 One-2.405
 ne-2.370
 swe-2.201
One-2.099
 someday-2.090
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
 ne+0.741
 One+0.710
 cheaper+0.703
 coll+0.699
 swe+0.694
Neg logit contributions
ting-0.866
leaf-0.849
 ear-0.793
orry-0.793
unk-0.791
Token
Token ID220
Feature activation-1.18e-01

Pos logit contributions
leaf+0.249
ting+0.246
orry+0.234
unk+0.231
 ear+0.228
Neg logit contributions
 ne-0.142
 One-0.139
 cheaper-0.133
 swe-0.132
 coll-0.131
Token
Token ID198
Feature activation-5.15e-01

Pos logit contributions
leaf+0.229
ting+0.226
orry+0.214
unk+0.212
 ear+0.211
Neg logit contributions
 ne-0.123
 One-0.119
 cheaper-0.114
 swe-0.113
 coll-0.113
Token
Token ID198
Feature activation-5.47e-01

Pos logit contributions
leaf+1.044
ting+1.040
orry+0.983
unk+0.980
 ear+0.971
Neg logit contributions
 One-0.496
 ne-0.494
One-0.467
 swe-0.456
 coll-0.456
TokenWith
Token ID3152
Feature activation-3.43e-01

Pos logit contributions
leaf+0.957
ting+0.952
unk+0.897
orry+0.889
 ear+0.886
Neg logit contributions
 ne-0.681
 One-0.665
One-0.660
 swe-0.644
 cheaper-0.641
Token a
Token ID257
Feature activation-5.81e-01

Pos logit contributions
leaf+0.585
ting+0.581
orry+0.551
unk+0.548
 upper+0.534
Neg logit contributions
 ne-0.444
 One-0.431
 cheaper-0.418
 swe-0.416
 coll-0.413
Token smile
Token ID8212
Feature activation+4.09e-01

Pos logit contributions
ting+0.970
leaf+0.952
orry+0.897
unk+0.890
 ear+0.863
Neg logit contributions
 ne-0.799
 One-0.748
 cheaper-0.745
 swe-0.742
 coll-0.734
Token on
Token ID319
Feature activation-7.62e-01

Pos logit contributions
 ne+0.485
 One+0.473
 cheaper+0.462
 coll+0.455
 swe+0.454
Neg logit contributions
ting-0.735
leaf-0.726
orry-0.678
unk-0.677
 ear-0.677
Token her
Token ID607
Feature activation-5.55e-01

Pos logit contributions
leaf+1.342
ting+1.308
orry+1.234
unk+1.214
 upper+1.196
Neg logit contributions
 One-0.990
 ne-0.982
 swe-0.940
 cheaper-0.911
 coll-0.905
Token It
Token ID632
Feature activation+2.56e-01

Pos logit contributions
 ne+0.000
 One+0.000
 cheaper+0.000
 coll+0.000
 swe+0.000
Neg logit contributions
leaf-0.000
ting-0.000
orry-0.000
unk-0.000
 ear-0.000
Token crawled
Token ID45668
Feature activation-4.70e-01

Pos logit contributions
 ne+0.336
 One+0.329
 cheaper+0.323
 coll+0.322
 swe+0.318
Neg logit contributions
leaf-0.421
ting-0.420
orry-0.394
unk-0.388
 ear-0.384
Token onto
Token ID4291
Feature activation+7.47e-01

Pos logit contributions
leaf+0.583
ting+0.557
orry+0.527
unk+0.523
 upper+0.508
Neg logit contributions
 ne-0.837
 One-0.820
 cheaper-0.791
 swe-0.791
One-0.787
Token Tim
Token ID5045
Feature activation-9.85e-01

Pos logit contributions
 ne+1.144
 swe+1.099
 cheaper+1.095
 coll+1.089
 One+1.078
Neg logit contributions
ting-1.083
leaf-1.059
 ear-0.998
orry-0.988
unk-0.968
Tokenmy
Token ID1820
Feature activation-1.32e+00

Pos logit contributions
leaf+2.040
ting+1.929
orry+1.847
unk+1.821
 ear+1.793
Neg logit contributions
 One-0.984
 ne-0.982
One-0.936
 swe-0.925
 coll-0.918
Token's
Token ID338
Feature activation-2.37e+00

Pos logit contributions
leaf+2.554
ting+2.376
orry+2.315
unk+2.309
 upper+2.231
Neg logit contributions
 ne-1.552
 One-1.515
 swe-1.432
 coll-1.427
 cheaper-1.410
Token hand
Token ID1021
Feature activation-7.48e-01

Pos logit contributions
leaf+3.537
ting+3.159
orry+2.929
asc+2.785
asing+2.779
Neg logit contributions
 swe-4.230
 One-4.191
 coll-4.103
 ne-4.084
 cheaper-3.959
Token and
Token ID290
Feature activation-4.16e-01

Pos logit contributions
leaf+1.397
ting+1.319
orry+1.288
unk+1.270
 upper+1.231
Neg logit contributions
 ne-0.887
 One-0.861
 cheaper-0.825
 swe-0.818
 coll-0.810
Token they
Token ID484
Feature activation-9.02e-01

Pos logit contributions
leaf+0.696
ting+0.673
orry+0.642
unk+0.642
 upper+0.624
Neg logit contributions
 ne-0.573
 One-0.538
 swe-0.530
 cheaper-0.527
 coll-0.522
Token became
Token ID2627
Feature activation+5.22e-01

Pos logit contributions
leaf+1.531
ting+1.466
unk+1.401
orry+1.365
 upper+1.340
Neg logit contributions
 ne-1.275
 One-1.204
 swe-1.130
One-1.129
 saf-1.120
Token friends
Token ID2460
Feature activation-6.55e-01

Pos logit contributions
 ne+0.781
 One+0.771
 cheaper+0.749
One+0.747
 coll+0.741
Neg logit contributions
ting-0.767
leaf-0.763
orry-0.706
 ear-0.704
unk-0.703
Token had
Token ID550
Feature activation+8.28e-01

Pos logit contributions
 ne+0.077
 One+0.075
 cheaper+0.073
 swe+0.072
 coll+0.072
Neg logit contributions
leaf-0.098
ting-0.097
orry-0.091
unk-0.090
 ear-0.089
Token fun
Token ID1257
Feature activation-1.92e-01

Pos logit contributions
 ne+1.237
 cheaper+1.191
 One+1.187
 swe+1.168
 coll+1.161
Neg logit contributions
leaf-1.222
ting-1.188
orry-1.119
unk-1.085
 ear-1.085
Token together
Token ID1978
Feature activation-7.15e-01

Pos logit contributions
leaf+0.333
ting+0.328
orry+0.309
unk+0.306
 ear+0.302
Neg logit contributions
 ne-0.242
 One-0.234
 coll-0.226
 cheaper-0.225
 swe-0.224
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
leaf+1.259
ting+1.210
orry+1.153
unk+1.145
 upper+1.130
Neg logit contributions
 ne-0.921
 One-0.894
 cheaper-0.851
One-0.850
 coll-0.846
Token The
Token ID383
Feature activation-1.55e+00

Pos logit contributions
leaf+0.604
ting+0.594
orry+0.558
unk+0.557
 upper+0.550
Neg logit contributions
 One-0.368
 ne-0.364
One-0.340
 coll-0.340
 cheaper-0.340
Token enormous
Token ID9812
Feature activation-1.93e+00

Pos logit contributions
leaf+2.702
ting+2.655
unk+2.389
 Stop+2.383
idget+2.353
Neg logit contributions
 ne-2.093
 One-2.083
One-2.036
 swe-1.989
 coll-1.976
Token insect
Token ID12833
Feature activation-3.20e-01

Pos logit contributions
leaf+2.325
ting+2.221
idget+1.993
 upper+1.872
 Stop+1.813
Neg logit contributions
 ne-3.720
 coll-3.620
 saf-3.607
 One-3.605
 swe-3.492
Token was
Token ID373
Feature activation+2.00e-01

Pos logit contributions
leaf+0.571
ting+0.563
orry+0.527
unk+0.524
 upper+0.516
Neg logit contributions
 ne-0.390
 One-0.372
 swe-0.359
 cheaper-0.359
 coll-0.358
Token never
Token ID1239
Feature activation-3.64e-01

Pos logit contributions
 ne+0.301
 One+0.293
 cheaper+0.286
 coll+0.285
 swe+0.284
Neg logit contributions
leaf-0.294
ting-0.291
orry-0.271
unk-0.268
 ear-0.265
Token alone
Token ID3436
Feature activation-1.05e-01

Pos logit contributions
leaf+0.537
ting+0.527
orry+0.489
unk+0.485
 ear+0.475
Neg logit contributions
 ne-0.560
 One-0.538
 cheaper-0.533
 swe-0.528
 coll-0.520
Token again
Token ID757
Feature activation+4.90e-01

Pos logit contributions
leaf+0.214
ting+0.211
orry+0.201
unk+0.199
 ear+0.197
Neg logit contributions
 ne-0.099
 One-0.094
 cheaper-0.091
 swe-0.089
 coll-0.089
Token't
Token ID470
Feature activation-6.30e-01

Pos logit contributions
 ne+2.634
 swe+2.601
 coll+2.581
 cheaper+2.548
 saf+2.453
Neg logit contributions
leaf-1.689
ting-1.669
orry-1.515
unk-1.497
 Stop-1.434
Token wait
Token ID4043
Feature activation+1.60e-01

Pos logit contributions
ting+0.791
leaf+0.779
unk+0.729
orry+0.706
 upper+0.667
Neg logit contributions
 ne-1.103
 One-1.088
One-1.049
 cheaper-1.048
 coll-1.027
Token to
Token ID284
Feature activation-6.69e-01

Pos logit contributions
 ne+0.212
 One+0.205
 cheaper+0.200
 coll+0.199
 swe+0.198
Neg logit contributions
leaf-0.267
ting-0.264
orry-0.247
unk-0.244
 ear-0.242
Token show
Token ID905
Feature activation-1.70e+00

Pos logit contributions
ting+0.993
leaf+0.992
unk+0.930
orry+0.921
 upper+0.880
Neg logit contributions
 ne-1.023
 One-1.001
 coll-0.962
 swe-0.961
 cheaper-0.961
Token it
Token ID340
Feature activation-5.72e-01

Pos logit contributions
leaf+3.112
ting+2.975
unk+2.965
asp+2.843
 upper+2.817
Neg logit contributions
 One-2.134
 swe-2.072
 ne-2.071
 enough-2.069
 cheaper-2.003
Token to
Token ID284
Feature activation-2.49e+00

Pos logit contributions
leaf+0.942
ting+0.922
orry+0.869
unk+0.868
 upper+0.855
Neg logit contributions
 ne-0.772
 One-0.760
 cheaper-0.741
 swe-0.739
 coll-0.729
Token her
Token ID607
Feature activation-1.81e-01

Pos logit contributions
leaf+5.365
unk+5.238
ting+5.155
orry+4.970
asp+4.967
Neg logit contributions
 One-2.831
 ne-2.446
 swe-2.365
One-2.360
 saf-2.315
Token mum
Token ID25682
Feature activation+4.41e-01

Pos logit contributions
leaf+0.259
ting+0.255
orry+0.235
unk+0.231
 upper+0.227
Neg logit contributions
 ne-0.288
 One-0.280
 cheaper-0.274
 swe-0.273
 coll-0.273
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
 ne+0.622
 One+0.593
 cheaper+0.587
 coll+0.582
 swe+0.577
Neg logit contributions
ting-0.695
leaf-0.691
orry-0.633
unk-0.630
 ear-0.629
Token
Token ID220
Feature activation-6.98e-01

Pos logit contributions
leaf+0.616
ting+0.609
orry+0.575
unk+0.572
 upper+0.561
Neg logit contributions
 One-0.316
 ne-0.313
 coll-0.291
 swe-0.291
One-0.290
Token
Token ID198
Feature activation+2.60e-04

Pos logit contributions
leaf+1.453
ting+1.441
orry+1.345
 ear+1.338
 upper+1.332
Neg logit contributions
 One-0.662
 ne-0.628
One-0.610
 cheaper-0.587
 swe-0.576
Token He
Token ID679
Feature activation-1.77e-01

Pos logit contributions
leaf+1.055
ting+1.036
orry+0.988
unk+0.978
 ear+0.968
Neg logit contributions
 One-0.416
 ne-0.402
One-0.360
 cheaper-0.358
 swe-0.354
Token held
Token ID2714
Feature activation-1.32e+00

Pos logit contributions
leaf+0.270
ting+0.268
orry+0.249
unk+0.247
 ear+0.242
Neg logit contributions
 ne-0.262
 One-0.253
 cheaper-0.246
 swe-0.245
 coll-0.245
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
leaf+2.631
ting+2.547
orry+2.459
 upper+2.428
unk+2.407
Neg logit contributions
 ne-1.420
 One-1.396
 swe-1.303
 cheaper-1.255
 coll-1.245
Token toy
Token ID13373
Feature activation-1.20e+00

Pos logit contributions
leaf+1.949
ting+1.934
orry+1.827
unk+1.719
asp+1.711
Neg logit contributions
 ne-1.395
 swe-1.343
 cheaper-1.335
 One-1.329
 coll-1.310
Token with
Token ID351
Feature activation-8.91e-01

Pos logit contributions
leaf+2.168
ting+2.036
orry+2.018
unk+1.937
 upper+1.888
Neg logit contributions
 ne-1.543
 One-1.488
 swe-1.422
 cheaper-1.419
 coll-1.416
Token both
Token ID1111
Feature activation-1.87e+00

Pos logit contributions
leaf+1.687
ting+1.643
unk+1.589
orry+1.581
 upper+1.516
Neg logit contributions
 ne-1.068
 One-1.020
 swe-0.979
 coll-0.960
 cheaper-0.946
Token hands
Token ID2832
Feature activation-5.54e-01

Pos logit contributions
ting+2.750
leaf+2.740
orry+2.570
unk+2.479
 Stop+2.350
Neg logit contributions
 ne-3.165
 swe-3.079
 One-3.055
 coll-2.932
 cheaper-2.918
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
leaf+0.993
ting+0.959
orry+0.924
unk+0.906
 ear+0.892
Neg logit contributions
 ne-0.681
 One-0.661
 cheaper-0.640
 coll-0.627
 swe-0.627
Token showed
Token ID3751
Feature activation-1.24e+00

Pos logit contributions
leaf+0.179
ting+0.176
orry+0.165
unk+0.164
 ear+0.160
Neg logit contributions
 ne-0.149
 One-0.143
 cheaper-0.140
 coll-0.139
 swe-0.139
Token it
Token ID340
Feature activation-1.01e-01

Pos logit contributions
leaf+2.463
ting+2.386
unk+2.329
orry+2.306
 upper+2.304
Neg logit contributions
 ne-1.293
 One-1.293
 swe-1.245
 coll-1.198
 cheaper-1.196
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
leaf+0.173
ting+0.172
orry+0.161
unk+0.160
 ear+0.157
Neg logit contributions
 ne-0.128
 One-0.124
 cheaper-0.120
 swe-0.120
 coll-0.120
Token
Token ID198
Feature activation+5.14e-04

Pos logit contributions
 ne+0.007
 One+0.007
 cheaper+0.007
 coll+0.007
 swe+0.007
Neg logit contributions
leaf-0.013
ting-0.012
orry-0.012
unk-0.012
 ear-0.012
Token
Token ID198
Feature activation-3.42e-01

Pos logit contributions
 ne+0.001
 One+0.001
 cheaper+0.001
 coll+0.001
 swe+0.001
Neg logit contributions
leaf-0.001
ting-0.001
orry-0.001
unk-0.001
 ear-0.001
TokenSo
Token ID2396
Feature activation+9.58e-01

Pos logit contributions
leaf+0.622
ting+0.613
orry+0.572
unk+0.570
 ear+0.566
Neg logit contributions
 ne-0.418
 One-0.405
One-0.394
 cheaper-0.386
 swe-0.380
Token he
Token ID339
Feature activation+8.08e-01

Pos logit contributions
 coll+1.062
 cheaper+1.033
 swe+1.027
 ne+1.027
One+0.990
Neg logit contributions
ting-1.785
leaf-1.735
 ear-1.717
orry-1.663
unk-1.619
Token ran
Token ID4966
Feature activation-3.14e-01

Pos logit contributions
 One+1.167
 cheaper+1.156
 coll+1.152
 ne+1.152
 swe+1.125
Neg logit contributions
ting-1.217
leaf-1.206
orry-1.115
 ear-1.095
unk-1.095
Token to
Token ID284
Feature activation-2.26e+00

Pos logit contributions
leaf+0.454
ting+0.443
orry+0.418
unk+0.415
 upper+0.406
Neg logit contributions
 ne-0.489
 One-0.479
 cheaper-0.464
 swe-0.459
One-0.458
Token his
Token ID465
Feature activation-1.04e+00

Pos logit contributions
ting+4.595
leaf+4.533
unk+4.488
orry+4.448
asp+4.433
Neg logit contributions
 One-2.531
 ne-2.444
 saf-2.245
One-2.145
 Nib-2.082
Token house
Token ID2156
Feature activation-4.01e-01

Pos logit contributions
leaf+1.654
ting+1.639
orry+1.482
unk+1.427
 Stop+1.410
Neg logit contributions
 ne-1.558
 coll-1.501
 One-1.497
 swe-1.494
 cheaper-1.491
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
leaf+0.709
ting+0.693
orry+0.659
unk+0.648
 upper+0.636
Neg logit contributions
 ne-0.500
 One-0.488
 cheaper-0.469
 coll-0.465
 swe-0.462
Token grabbed
Token ID13176
Feature activation-1.37e+00

Pos logit contributions
leaf+2.921
ting+2.835
unk+2.763
orry+2.760
 upper+2.720
Neg logit contributions
 ne-1.976
 One-1.769
 swe-1.722
 cheaper-1.715
 saf-1.640
Token a
Token ID257
Feature activation-6.58e-01

Pos logit contributions
leaf+2.638
ting+2.571
unk+2.503
orry+2.490
 upper+2.440
Neg logit contributions
 ne-1.571
 One-1.507
 swe-1.418
 cheaper-1.392
 enough-1.363