TOP ACTIVATIONS
MAX = 4.137

he decided to go to the river . He looked in
He went to the market , found the perfect
day he would go to the field to do his work
" Let 's go to the west !" said Jill .
special . She went to the forest and she looked for
but when she arrived at the first place , they were
So they went to the store and asked the shop
and her daughter went to the field to test the corn
, Sam would go to the market with his family .
time to sleep !" Dave sighed . He wanted
carrot . He went to the garden to find one .
One day he went to the pond to drink some water
So , he went to the garden to pick some fresh
hungry . He headed to the west and began to twist
So , she went to the gate and opened it .
field . Johnny ran to the middle and noticed a big
day , he went to the forest and spotted a beautiful
You have to go to the west and twist off a
, so she went to the kitchen . She spotted a
and her parents went to the market . Jane tried to

MIN ACTIVATIONS
MIN = -4.008

computer . He would spend hours and hours every day playing
day Mill ie would spend hours making new routes for the
be ep . They have fun and laugh a lot .
arrow . He would spend hours shooting arrows at targets in
He would spend hours and hours every day playing games and
soft . They both had fun and were happy . The
help her . They all kept spinning , but she still
ep sounds . They have fun , but they also want
her . They both had fun and created many more things
They were having fun with their paper toys .
The cats had lots of fun playing , but after a
string . She would spend hours on end weaving patterns and
. He was very good at fixing them . One day
John and Jane spent the day practicing tennis . When they
trumpet and had lots of fun . After a
together . They both had fun and Lily realized that it
shiny pieces . She worked and worked , putting each piece
. They had lots of fun that day and Lily was
my . They were having fun catching butterflies when Lily noticed
They were having fun . But then they saw

INTERVAL 2.101 - 4.137
CONTAINS 0.487%

a blanket . They told them their plan . Mom and
re ind eer ! P ete was very curious
saw a big tree and decided to play hide and seek
Anna and Ben nod . They want to be good and
a long time . Lily learned that she should listen to

INTERVAL 0.065 - 2.101
CONTAINS 45.140%

their hands . They show them how to watch the waves
was sm elly . He lived with his farmer who looked
heard a loud noise . It was a plane flying over
dull . Now , she had all kinds of friends to
Suddenly , she heard a voice . It was coming from

INTERVAL -1.971 - 0.065
CONTAINS 53.605%

She should not get a cookie . She should get a
outside . They asked her to join them , but Lily
Hello , mom ! I 'm so glad you 're here
so busy !" Ben said . " They are
, taking in the bright colours and sunshine .

INTERVAL -4.008 - -1.971
CONTAINS 0.767%

drawings . A few days later , Lily and
so much joy . From then on , the candle was
both cry . They are not friends anymore .
with . The next day , Jane asked her
and they both had a great time .
Token he
Token ID339
Feature activation+1.26e+00

Pos logit contributions
 stranger+0.995
 keeper+0.944
 stall+0.937
Stan+0.891
 truth+0.884
Neg logit contributions
ug-1.110
idd-1.007
ooming-0.975
zz-0.957
irled-0.955
Token decided
Token ID3066
Feature activation+2.49e+00

Pos logit contributions
 stranger+2.241
 keeper+2.126
 stall+2.071
 Heidi+2.046
 truth+2.010
Neg logit contributions
ug-2.347
idd-2.173
zz-2.108
irled-2.084
ged-2.049
Token to
Token ID284
Feature activation+1.74e+00

Pos logit contributions
 stall+4.241
 stranger+4.156
 keeper+4.039
Stan+3.932
 butcher+3.905
Neg logit contributions
ug-4.572
ged-4.312
ooming-4.164
idd-4.138
zz-3.972
Token go
Token ID467
Feature activation+6.44e-01

Pos logit contributions
 stranger+2.711
 keeper+2.555
 stall+2.417
 ranger+2.369
 Heidi+2.338
Neg logit contributions
ug-3.589
idd-3.291
zz-3.239
irled-3.130
ooming-3.113
Token to
Token ID284
Feature activation+1.67e+00

Pos logit contributions
 stranger+1.006
 keeper+0.960
 stall+0.923
 Heidi+0.896
 truth+0.872
Neg logit contributions
ug-1.329
idd-1.257
ooming-1.234
zz-1.194
ips-1.187
Token the
Token ID262
Feature activation+4.14e+00

Pos logit contributions
 stranger+2.862
 keeper+2.784
 stall+2.580
 seller+2.551
 ranger+2.551
Neg logit contributions
ug-3.049
idd-2.842
ooming-2.797
zz-2.710
ips-2.686
Token river
Token ID7850
Feature activation+1.05e+00

Pos logit contributions
 Heidi+6.366
 keeper+5.903
 stranger+5.870
holder+5.749
 Sonia+5.662
Neg logit contributions
idd-8.072
 stacking-7.426
ooming-7.404
ug-7.198
bed-7.179
Token.
Token ID13
Feature activation+8.22e-01

Pos logit contributions
 stranger+1.565
 Heidi+1.414
 keeper+1.406
 stall+1.373
Stan+1.366
Neg logit contributions
ug-2.291
idd-2.129
ooming-2.070
ips-1.976
zz-1.972
Token He
Token ID679
Feature activation+4.91e-01

Pos logit contributions
 stranger+1.051
 keeper+0.961
 stall+0.921
Stan+0.886
 Heidi+0.870
Neg logit contributions
ug-1.959
idd-1.835
ooming-1.765
anged-1.738
zz-1.735
Token looked
Token ID3114
Feature activation+2.49e-01

Pos logit contributions
 stranger+0.791
 keeper+0.744
 stall+0.722
 Heidi+0.705
 truth+0.690
Neg logit contributions
ug-1.002
idd-0.945
ooming-0.902
zz-0.889
irled-0.888
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
 stranger+0.429
 keeper+0.415
 stall+0.404
 Heidi+0.384
 ranger+0.384
Neg logit contributions
ug-0.472
idd-0.440
ooming-0.430
irled-0.414
zz-0.412
Token
Token ID198
Feature activation-2.56e-01

Pos logit contributions
ug+0.068
idd+0.065
ooming+0.063
irled+0.061
med+0.061
Neg logit contributions
 stranger-0.040
 keeper-0.037
 stall-0.037
 Heidi-0.034
 seller-0.034
Token
Token ID198
Feature activation+9.14e-01

Pos logit contributions
ug+0.598
idd+0.568
ooming+0.554
irled+0.535
zz+0.535
Neg logit contributions
 stranger-0.333
 keeper-0.313
 stall-0.308
 Heidi-0.283
 ranger-0.282
TokenHe
Token ID1544
Feature activation+8.02e-01

Pos logit contributions
 stranger+1.296
 keeper+1.213
 stall+1.209
 Heidi+1.146
 camel+1.140
Neg logit contributions
ug-2.040
idd-1.909
ooming-1.882
med-1.847
irled-1.843
Token went
Token ID1816
Feature activation+1.07e+00

Pos logit contributions
 stranger+1.370
 keeper+1.273
 stall+1.258
 Heidi+1.233
 truth+1.194
Neg logit contributions
ug-1.564
idd-1.481
ooming-1.412
irled-1.400
zz-1.383
Token to
Token ID284
Feature activation+1.48e+00

Pos logit contributions
 stranger+1.584
 keeper+1.481
 truth+1.444
 stall+1.444
 Heidi+1.441
Neg logit contributions
ug-2.238
idd-2.163
ooming-2.108
ips-2.053
ged-2.021
Token the
Token ID262
Feature activation+4.13e+00

Pos logit contributions
 stranger+2.414
 keeper+2.310
 stall+2.194
 seller+2.150
Stan+2.149
Neg logit contributions
ug-2.795
idd-2.676
ooming-2.574
ips-2.483
zz-2.483
Token market
Token ID1910
Feature activation+2.01e+00

Pos logit contributions
asha+5.240
oto+5.153
isi+5.092
 Heidi+5.037
iri+4.917
Neg logit contributions
idd-9.580
 stacking-8.940
ooming-8.817
ug-8.225
ips-8.202
Token,
Token ID11
Feature activation+3.95e-01

Pos logit contributions
 stranger+3.840
Stan+3.548
 Heidi+3.539
 camel+3.512
 truth+3.484
Neg logit contributions
ug-3.407
idd-3.272
ooming-3.159
ged-3.009
ips-2.985
Token found
Token ID1043
Feature activation+2.13e+00

Pos logit contributions
 stranger+0.616
 keeper+0.581
 stall+0.568
 Heidi+0.539
 truth+0.537
Neg logit contributions
ug-0.808
idd-0.772
ooming-0.749
med-0.719
ged-0.717
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
 stranger+3.131
 keeper+2.961
 stall+2.861
Stan+2.733
 ranger+2.714
Neg logit contributions
ug-4.645
idd-4.536
med-4.343
ged-4.238
ips-4.102
Token perfect
Token ID2818
Feature activation+5.78e-01

Pos logit contributions
 stranger+2.366
 Heidi+2.286
 keeper+2.269
Stan+2.225
 Stanley+2.173
Neg logit contributions
idd-3.926
ug-3.883
ooming-3.719
ged-3.588
ips-3.551
Token day
Token ID1110
Feature activation-6.30e-01

Pos logit contributions
 stranger+1.960
 keeper+1.937
 stall+1.887
 Heidi+1.848
 seller+1.826
Neg logit contributions
ug-1.471
idd-1.304
zz-1.235
ooming-1.229
irled-1.192
Token he
Token ID339
Feature activation+2.37e-01

Pos logit contributions
ug+1.376
idd+1.316
ooming+1.281
irled+1.240
ips+1.239
Neg logit contributions
 stranger-0.930
 keeper-0.867
 stall-0.860
 Heidi-0.799
 ranger-0.793
Token would
Token ID561
Feature activation+1.59e+00

Pos logit contributions
 stranger+0.273
 keeper+0.256
 stall+0.250
 Heidi+0.233
Stan+0.225
Neg logit contributions
ug-0.588
idd-0.558
ooming-0.544
zz-0.533
irled-0.531
Token go
Token ID467
Feature activation+4.58e-01

Pos logit contributions
 stranger+2.284
 keeper+2.209
 stall+2.101
 Heidi+2.076
 ranger+2.060
Neg logit contributions
ug-3.441
idd-3.131
ooming-3.130
zz-3.097
irled-3.002
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
 stranger+0.709
 keeper+0.674
 stall+0.661
 Heidi+0.636
Stan+0.627
Neg logit contributions
ug-0.951
idd-0.895
ooming-0.881
zz-0.848
ips-0.839
Token the
Token ID262
Feature activation+4.08e+00

Pos logit contributions
 stranger+1.654
 keeper+1.616
 stall+1.526
 ranger+1.491
Stan+1.476
Neg logit contributions
ug-2.123
idd-1.971
ooming-1.955
zz-1.863
ips-1.837
Token field
Token ID2214
Feature activation+1.18e+00

Pos logit contributions
 Heidi+6.644
 Stanley+6.186
Stan+6.051
isi+6.022
 keeper+5.999
Neg logit contributions
ug-7.938
idd-7.923
ooming-7.616
 stacking-7.393
 spinning-7.248
Token to
Token ID284
Feature activation+1.67e+00

Pos logit contributions
 stranger+1.931
 keeper+1.778
 Heidi+1.737
 stall+1.718
Stan+1.714
Neg logit contributions
ug-2.406
ooming-2.199
idd-2.196
ips-2.103
zz-2.060
Token do
Token ID466
Feature activation+4.17e-02

Pos logit contributions
 stranger+2.752
 keeper+2.662
 stall+2.512
Stan+2.465
 ranger+2.454
Neg logit contributions
ug-3.225
idd-2.996
ooming-2.944
zz-2.880
irled-2.790
Token his
Token ID465
Feature activation+1.63e+00

Pos logit contributions
 stranger+0.050
 keeper+0.047
 stall+0.046
 Heidi+0.042
 seller+0.042
Neg logit contributions
ug-0.102
idd-0.096
ooming-0.095
irled-0.092
med-0.092
Token work
Token ID670
Feature activation-5.46e-01

Pos logit contributions
 stranger+2.842
 keeper+2.671
 Heidi+2.568
 stall+2.562
Stan+2.543
Neg logit contributions
ug-3.046
ooming-2.866
idd-2.802
irled-2.784
zz-2.733
Token"
Token ID1
Feature activation+8.90e-01

Pos logit contributions
 stranger+1.340
 stall+1.252
 keeper+1.212
 Heidi+1.177
 Stanley+1.166
Neg logit contributions
ug-2.045
idd-1.963
ooming-1.873
irled-1.851
med-1.847
TokenLet
Token ID5756
Feature activation+1.43e+00

Pos logit contributions
 stranger+1.412
 stall+1.361
 keeper+1.342
 Heidi+1.267
 camel+1.266
Neg logit contributions
ug-1.782
idd-1.731
ooming-1.620
irled-1.585
zz-1.585
Token's
Token ID338
Feature activation+9.28e-01

Pos logit contributions
 stranger+1.808
 stall+1.790
 keeper+1.767
 ranger+1.663
 seller+1.589
Neg logit contributions
ug-3.343
idd-3.123
ooming-3.086
zz-3.062
irled-3.018
Token go
Token ID467
Feature activation-3.24e-01

Pos logit contributions
 stranger+1.227
 keeper+1.157
 stall+1.114
 ranger+1.066
 Heidi+1.049
Neg logit contributions
ug-2.153
idd-2.006
ooming-1.967
zz-1.947
irled-1.900
Token to
Token ID284
Feature activation+7.39e-01

Pos logit contributions
ug+0.690
idd+0.651
ooming+0.633
irled+0.611
med+0.609
Neg logit contributions
 stranger-0.492
 keeper-0.467
 stall-0.461
 Heidi-0.428
 ranger-0.427
Token the
Token ID262
Feature activation+4.06e+00

Pos logit contributions
 stranger+1.133
 keeper+1.088
 stall+1.054
 seller+1.008
 ranger+0.992
Neg logit contributions
ug-1.532
idd-1.437
ooming-1.399
zz-1.358
med-1.340
Token west
Token ID7421
Feature activation+4.71e-01

Pos logit contributions
 Heidi+5.287
 keeper+5.068
Stan+4.966
holder+4.933
 ranger+4.908
Neg logit contributions
ug-8.511
idd-8.384
ooming-8.220
 spinning-8.135
 pract-8.094
Token!"
Token ID2474
Feature activation-3.71e-01

Pos logit contributions
 stranger+0.759
 keeper+0.703
 stall+0.700
 Heidi+0.679
Stan+0.671
Neg logit contributions
ug-0.952
idd-0.889
ooming-0.866
zz-0.830
ips-0.822
Token said
Token ID531
Feature activation-6.95e-01

Pos logit contributions
ug+0.838
idd+0.793
ooming+0.776
zz+0.744
ips+0.744
Neg logit contributions
 stranger-0.518
 keeper-0.491
 stall-0.479
 Heidi-0.446
 ranger-0.445
Token Jill
Token ID22772
Feature activation+4.70e-02

Pos logit contributions
ug+1.268
idd+1.186
ooming+1.161
ips+1.103
zz+1.099
Neg logit contributions
 stranger-1.271
 keeper-1.228
 stall-1.189
 Heidi-1.142
 ranger-1.141
Token.
Token ID13
Feature activation-3.61e-01

Pos logit contributions
 stranger+0.062
 keeper+0.058
 stall+0.057
 Heidi+0.053
Stan+0.053
Neg logit contributions
ug-0.109
idd-0.103
ooming-0.101
med-0.098
zz-0.097
Token special
Token ID2041
Feature activation+1.03e+00

Pos logit contributions
 stranger+2.470
 keeper+2.430
 stall+2.390
 Heidi+2.292
Stan+2.289
Neg logit contributions
ug-3.838
idd-3.581
ooming-3.477
irled-3.457
zz-3.378
Token.
Token ID13
Feature activation+3.92e-01

Pos logit contributions
 stranger+1.590
 keeper+1.511
 stall+1.469
Stan+1.442
 Heidi+1.430
Neg logit contributions
ug-2.134
idd-1.986
ooming-1.910
ged-1.883
ips-1.880
Token She
Token ID1375
Feature activation+3.82e-01

Pos logit contributions
 stranger+0.489
 keeper+0.458
 stall+0.442
 seller+0.411
 ranger+0.407
Neg logit contributions
ug-0.933
idd-0.885
ooming-0.867
zz-0.847
med-0.846
Token went
Token ID1816
Feature activation+6.33e-01

Pos logit contributions
 stranger+0.586
 keeper+0.551
 stall+0.540
 Heidi+0.518
Stan+0.506
Neg logit contributions
ug-0.807
idd-0.766
ooming-0.737
irled-0.724
zz-0.718
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
 stranger+0.996
 keeper+0.950
 stall+0.927
 Heidi+0.894
Stan+0.880
Neg logit contributions
ug-1.287
idd-1.212
ooming-1.199
zz-1.157
ips-1.149
Token the
Token ID262
Feature activation+4.02e+00

Pos logit contributions
 stranger+2.170
 keeper+2.118
 stall+2.009
 seller+1.968
 ranger+1.948
Neg logit contributions
ug-2.714
idd-2.524
ooming-2.507
zz-2.476
irled-2.405
Token forest
Token ID8222
Feature activation+1.21e+00

Pos logit contributions
 Heidi+6.509
 keeper+5.945
 Stanley+5.924
 Clover+5.869
Stan+5.802
Neg logit contributions
idd-8.273
 mar-7.470
 stacking-7.436
ooming-7.400
 pol-7.387
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 stranger+2.065
 keeper+1.884
 Heidi+1.872
Stan+1.859
 stall+1.848
Neg logit contributions
ug-2.336
idd-2.178
ooming-2.154
zz-2.033
ged-2.028
Token she
Token ID673
Feature activation+7.90e-01

Pos logit contributions
 stranger+2.129
 keeper+2.008
 stall+1.976
 truth+1.944
Stan+1.895
Neg logit contributions
ug-2.498
idd-2.437
ged-2.357
zz-2.324
ooming-2.315
Token looked
Token ID3114
Feature activation+8.91e-01

Pos logit contributions
 stranger+1.242
 keeper+1.147
 stall+1.127
 Heidi+1.099
 truth+1.071
Neg logit contributions
ug-1.647
idd-1.592
ooming-1.517
irled-1.499
zz-1.482
Token for
Token ID329
Feature activation+1.86e+00

Pos logit contributions
 stranger+1.633
 keeper+1.608
 stall+1.546
Stan+1.488
 Heidi+1.487
Neg logit contributions
ug-1.588
idd-1.475
ooming-1.448
ged-1.410
irled-1.402
Token but
Token ID475
Feature activation+1.42e+00

Pos logit contributions
 stranger+1.121
 keeper+1.051
 stall+1.035
 Heidi+0.996
 seller+0.992
Neg logit contributions
ug-1.375
idd-1.270
ooming-1.256
zz-1.210
med-1.200
Token when
Token ID618
Feature activation+4.36e-01

Pos logit contributions
 stranger+2.226
 stall+2.091
 keeper+2.074
 butcher+2.016
Stan+2.013
Neg logit contributions
ug-2.905
idd-2.745
ged-2.686
ooming-2.615
irled-2.613
Token she
Token ID673
Feature activation+1.37e-01

Pos logit contributions
 stranger+0.595
 keeper+0.567
 stall+0.563
 seller+0.521
Stan+0.517
Neg logit contributions
ug-0.972
idd-0.923
ooming-0.901
med-0.874
irled-0.872
Token arrived
Token ID5284
Feature activation+1.42e+00

Pos logit contributions
 stranger+0.215
 keeper+0.203
 stall+0.200
 Heidi+0.190
 ranger+0.187
Neg logit contributions
ug-0.285
idd-0.270
ooming-0.263
zz-0.253
irled-0.253
Token at
Token ID379
Feature activation+7.03e-01

Pos logit contributions
 stranger+2.639
 keeper+2.559
 stall+2.531
Stan+2.475
 truth+2.463
Neg logit contributions
ug-2.496
ooming-2.422
idd-2.322
ged-2.258
ips-2.208
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 stranger+0.991
 keeper+0.931
 stall+0.915
 seller+0.867
Stan+0.866
Neg logit contributions
ug-1.545
idd-1.462
ooming-1.414
ips-1.379
zz-1.376
Token first
Token ID717
Feature activation+1.52e+00

Pos logit contributions
 Heidi+7.183
 stranger+6.225
 Stanley+6.199
Stan+6.195
 Sonia+6.141
Neg logit contributions
ug-7.799
idd-7.271
 spinning-6.977
 pract-6.932
ooming-6.895
Token place
Token ID1295
Feature activation+9.56e-01

Pos logit contributions
 stranger+2.023
 keeper+1.920
 butcher+1.855
 Heidi+1.829
holder+1.821
Neg logit contributions
ug-3.520
idd-3.132
ooming-3.106
zz-3.068
ged-2.963
Token,
Token ID11
Feature activation+8.13e-01

Pos logit contributions
 stranger+1.617
 stall+1.513
Stan+1.497
 keeper+1.480
 butcher+1.455
Neg logit contributions
ug-1.864
idd-1.741
ooming-1.698
ips-1.629
irled-1.623
Token they
Token ID484
Feature activation+1.24e+00

Pos logit contributions
 stranger+1.091
 keeper+1.054
 stall+1.047
 seller+0.965
Stan+0.964
Neg logit contributions
ug-1.817
idd-1.715
ooming-1.663
med-1.635
ged-1.628
Token were
Token ID547
Feature activation+2.59e-01

Pos logit contributions
 stranger+2.052
 keeper+1.980
 stall+1.974
 Heidi+1.908
Stan+1.856
Neg logit contributions
ug-2.462
idd-2.284
ged-2.221
ooming-2.203
zz-2.146
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
ug+0.736
idd+0.700
ooming+0.683
irled+0.660
zz+0.659
Neg logit contributions
 stranger-0.404
 keeper-0.380
 stall-0.372
 Heidi-0.343
 ranger-0.342
TokenSo
Token ID2396
Feature activation-1.01e+00

Pos logit contributions
 stranger+1.199
 stall+1.158
 keeper+1.144
 Heidi+1.100
 seller+1.085
Neg logit contributions
ug-1.147
idd-1.070
ooming-1.031
med-1.007
irled-0.987
Token they
Token ID484
Feature activation+8.58e-01

Pos logit contributions
ug+2.069
idd+2.017
ooming+1.928
ips+1.874
zz+1.836
Neg logit contributions
 stranger-1.608
 keeper-1.535
 stall-1.481
 Stanley-1.429
 Heidi-1.419
Token went
Token ID1816
Feature activation+7.08e-01

Pos logit contributions
 stranger+1.500
 keeper+1.442
 stall+1.403
 Heidi+1.347
 truth+1.337
Neg logit contributions
ug-1.641
idd-1.525
zz-1.470
ooming-1.450
med-1.447
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
 stranger+0.984
 keeper+0.933
 stall+0.902
 truth+0.865
 Heidi+0.863
Neg logit contributions
ug-1.564
idd-1.500
ooming-1.467
ips-1.425
zz-1.414
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 stranger+1.824
 keeper+1.731
 stall+1.646
 seller+1.590
 ranger+1.579
Neg logit contributions
ug-2.240
idd-2.150
ooming-2.103
zz-2.020
ged-1.994
Token store
Token ID3650
Feature activation+1.35e+00

Pos logit contributions
 Heidi+6.826
 stranger+6.470
 Sonia+6.468
holder+6.394
Stan+6.394
Neg logit contributions
idd-6.961
 stacking-6.572
ooming-6.399
bed-6.194
 shaping-6.054
Token and
Token ID290
Feature activation+8.64e-01

Pos logit contributions
 stranger+2.372
Stan+2.130
 camel+2.129
 Heidi+2.127
 keeper+2.107
Neg logit contributions
ug-2.539
idd-2.464
ooming-2.372
ged-2.258
ips-2.229
Token asked
Token ID1965
Feature activation+6.83e-01

Pos logit contributions
 stranger+1.439
 keeper+1.351
 stall+1.321
 truth+1.295
Stan+1.259
Neg logit contributions
ug-1.661
idd-1.619
ooming-1.531
med-1.520
ged-1.516
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
 stranger+1.197
 keeper+1.151
 stall+1.138
 ranger+1.070
 bench+1.069
Neg logit contributions
ug-1.318
idd-1.186
ooming-1.163
med-1.134
ged-1.130
Token shop
Token ID6128
Feature activation+3.03e+00

Pos logit contributions
 Heidi+3.727
 Stanley+3.625
 stranger+3.475
Stan+3.462
 Sonia+3.381
Neg logit contributions
idd-6.650
ug-6.229
ooming-6.055
ips-6.050
 stacking-5.693
Token and
Token ID290
Feature activation+6.68e-01

Pos logit contributions
ug+1.408
idd+1.322
ooming+1.294
irled+1.245
ips+1.234
Neg logit contributions
 stranger-1.074
 keeper-1.016
 stall-1.004
 seller-0.948
 ranger-0.932
Token her
Token ID607
Feature activation+3.63e-01

Pos logit contributions
 stranger+0.888
 keeper+0.863
 stall+0.841
Stan+0.795
 ranger+0.780
Neg logit contributions
ug-1.516
idd-1.431
ooming-1.367
zz-1.352
ged-1.346
Token daughter
Token ID4957
Feature activation+3.07e-01

Pos logit contributions
 stranger+0.530
 keeper+0.505
 stall+0.493
 Heidi+0.468
Stan+0.467
Neg logit contributions
ug-0.775
idd-0.736
ooming-0.715
zz-0.692
irled-0.686
Token went
Token ID1816
Feature activation+9.34e-02

Pos logit contributions
 stranger+0.387
 keeper+0.366
 stall+0.356
 Heidi+0.330
 ranger+0.330
Neg logit contributions
ug-0.723
idd-0.686
ooming-0.669
zz-0.651
med-0.646
Token to
Token ID284
Feature activation+4.10e-01

Pos logit contributions
 stranger+0.122
 keeper+0.115
 stall+0.112
 Heidi+0.105
Stan+0.104
Neg logit contributions
ug-0.217
idd-0.206
ooming-0.202
ips-0.195
zz-0.194
Token the
Token ID262
Feature activation+3.99e+00

Pos logit contributions
 stranger+0.608
 keeper+0.583
 stall+0.563
 ranger+0.534
 seller+0.531
Neg logit contributions
ug-0.872
idd-0.826
ooming-0.806
zz-0.781
irled-0.775
Token field
Token ID2214
Feature activation+1.36e+00

Pos logit contributions
 keeper+5.740
 ranger+5.727
 stranger+5.178
 sir+5.065
 Heidi+5.035
Neg logit contributions
idd-7.875
ooming-7.741
 pract-7.732
 sudden-7.711
 stacking-7.699
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
 stranger+2.063
 keeper+1.859
Stan+1.839
 Heidi+1.837
 butcher+1.812
Neg logit contributions
ug-2.931
idd-2.735
ooming-2.701
ips-2.616
irled-2.561
Token test
Token ID1332
Feature activation-3.36e-01

Pos logit contributions
 stranger+2.454
 keeper+2.383
 stall+2.208
 ranger+2.189
Stan+2.133
Neg logit contributions
ug-2.982
idd-2.841
ooming-2.772
zz-2.757
irled-2.662
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
ug+0.769
idd+0.730
ooming+0.712
irled+0.686
zz+0.686
Neg logit contributions
 stranger-0.454
 keeper-0.427
 stall-0.420
 Heidi-0.389
 ranger-0.385
Token corn
Token ID11676
Feature activation+8.49e-01

Pos logit contributions
 ranger+2.230
 keeper+2.204
 stranger+2.146
 Heidi+1.829
Stan+1.824
Neg logit contributions
ug-5.359
zz-5.055
idd-5.011
ooming-4.986
irled-4.942
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
ug+0.822
idd+0.781
ooming+0.757
irled+0.727
med+0.725
Neg logit contributions
 stranger-0.636
 keeper-0.604
 stall-0.594
 ranger-0.557
 Heidi-0.557
Token Sam
Token ID3409
Feature activation+1.41e+00

Pos logit contributions
ug+0.862
idd+0.822
ooming+0.799
irled+0.772
zz+0.770
Neg logit contributions
 stranger-0.496
 keeper-0.466
 stall-0.457
 Heidi-0.423
 ranger-0.422
Token would
Token ID561
Feature activation+1.58e+00

Pos logit contributions
 stranger+2.007
 keeper+1.916
 stall+1.824
 Heidi+1.793
Stan+1.759
Neg logit contributions
ug-3.051
idd-2.890
ooming-2.809
zz-2.765
irled-2.741
Token go
Token ID467
Feature activation+6.27e-01

Pos logit contributions
 stranger+2.228
 keeper+2.154
Stan+2.038
 Heidi+2.038
 stall+2.014
Neg logit contributions
ug-3.362
idd-3.138
ooming-3.083
zz-2.998
irled-2.981
Token to
Token ID284
Feature activation+1.99e+00

Pos logit contributions
 stranger+0.939
 keeper+0.886
 stall+0.859
 Heidi+0.841
Stan+0.829
Neg logit contributions
ug-1.316
idd-1.252
ooming-1.228
ips-1.180
zz-1.176
Token the
Token ID262
Feature activation+3.99e+00

Pos logit contributions
 stranger+3.437
 keeper+3.377
Stan+3.179
 ranger+3.151
 camel+3.108
Neg logit contributions
ug-3.427
idd-3.288
ooming-3.206
ips-3.081
 pract-3.028
Token market
Token ID1910
Feature activation+2.06e+00

Pos logit contributions
 Heidi+4.380
 Stanley+4.014
 Clover+3.938
asha+3.909
oto+3.851
Neg logit contributions
idd-9.982
 stacking-9.180
ooming-9.144
ips-8.862
ug-8.786
Token with
Token ID351
Feature activation+1.37e+00

Pos logit contributions
 stranger+4.300
 Heidi+4.008
 camel+3.964
Stan+3.936
 Stanley+3.908
Neg logit contributions
ug-3.205
idd-2.941
ooming-2.897
ips-2.770
bed-2.694
Token his
Token ID465
Feature activation+1.33e+00

Pos logit contributions
 stranger+2.100
 keeper+2.009
 stall+1.920
Stan+1.865
 ranger+1.827
Neg logit contributions
ug-2.882
idd-2.721
med-2.610
ips-2.599
ooming-2.594
Token family
Token ID1641
Feature activation+1.17e+00

Pos logit contributions
 stranger+2.103
 keeper+1.970
 Heidi+1.947
Stan+1.946
 Clover+1.881
Neg logit contributions
ug-2.544
idd-2.497
ooming-2.362
ips-2.247
irled-2.239
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 stranger+1.776
 keeper+1.679
 Heidi+1.641
Stan+1.611
 stall+1.594
Neg logit contributions
ug-2.445
idd-2.283
ooming-2.232
med-2.138
ips-2.136
Token time
Token ID640
Feature activation-1.08e-01

Pos logit contributions
 stranger+2.266
 keeper+2.203
 stall+2.181
 seller+2.095
 ranger+2.069
Neg logit contributions
ug-1.720
idd-1.532
zz-1.457
ooming-1.448
irled-1.435
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
ug+0.247
idd+0.234
ooming+0.228
irled+0.220
zz+0.220
Neg logit contributions
 stranger-0.144
 keeper-0.136
 stall-0.134
 Heidi-0.124
 ranger-0.123
Token sleep
Token ID3993
Feature activation+2.96e-01

Pos logit contributions
 stranger+2.117
 keeper+2.008
 stall+2.000
 ranger+1.845
 Heidi+1.840
Neg logit contributions
ug-3.153
zz-2.878
ooming-2.871
idd-2.821
irled-2.729
Token!"
Token ID2474
Feature activation+2.36e+00

Pos logit contributions
 stranger+0.455
 keeper+0.434
 stall+0.426
 Heidi+0.400
 seller+0.399
Neg logit contributions
ug-0.621
idd-0.580
ooming-0.568
irled-0.546
zz-0.543
Token
Token ID198
Feature activation+2.76e+00

Pos logit contributions
 keeper+3.889
 stranger+3.874
 butcher+3.771
 stall+3.726
Stan+3.705
Neg logit contributions
ug-4.300
ged-3.983
irled-3.982
idd-3.956
bed-3.792
Token
Token ID198
Feature activation+3.99e+00

Pos logit contributions
 stranger+4.822
 keeper+4.762
 butcher+4.504
 ranc+4.503
 camel+4.486
Neg logit contributions
ug-5.007
idd-4.749
ooming-4.534
irled-4.458
ged-4.374
TokenDave
Token ID27984
Feature activation+4.02e-01

Pos logit contributions
 keeper+6.388
 truth+6.352
 address+6.254
 chest+6.217
 camel+6.165
Neg logit contributions
idd-7.679
ug-7.661
ooming-7.422
irled-7.302
ged-7.276
Token sighed
Token ID21893
Feature activation+7.41e-01

Pos logit contributions
 stranger+0.607
 stall+0.564
 keeper+0.562
 Heidi+0.535
Stan+0.521
Neg logit contributions
ug-0.867
idd-0.813
ooming-0.794
zz-0.777
irled-0.773
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 stranger+1.159
 keeper+1.097
 stall+1.091
Stan+1.029
 Heidi+1.029
Neg logit contributions
ug-1.524
ooming-1.406
idd-1.398
ged-1.354
irled-1.347
Token He
Token ID679
Feature activation+1.51e+00

Pos logit contributions
 stranger+1.770
 keeper+1.603
 stall+1.582
 camel+1.547
Stan+1.541
Neg logit contributions
ug-2.973
idd-2.865
ooming-2.714
ips-2.654
irled-2.648
Token wanted
Token ID2227
Feature activation+1.40e+00

Pos logit contributions
 stranger+2.598
 stall+2.509
 keeper+2.485
 Heidi+2.345
 seller+2.331
Neg logit contributions
ug-2.859
idd-2.674
irled-2.671
zz-2.646
ooming-2.581
Token carrot
Token ID40562
Feature activation+1.47e+00

Pos logit contributions
 keeper+2.050
 stranger+2.020
 Heidi+1.990
 stall+1.903
Stan+1.892
Neg logit contributions
ug-3.379
idd-3.309
ooming-3.235
zz-3.111
apy-3.051
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
 stranger+2.325
 keeper+2.153
 Heidi+2.106
 stall+2.089
Stan+2.074
Neg logit contributions
ug-3.042
idd-2.828
ooming-2.764
bed-2.683
zz-2.673
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
 stranger+0.921
 keeper+0.852
 stall+0.838
Stan+0.774
 seller+0.768
Neg logit contributions
ug-1.628
idd-1.512
ooming-1.501
zz-1.454
irled-1.451
Token went
Token ID1816
Feature activation+8.33e-01

Pos logit contributions
 stranger+3.223
 keeper+3.080
 stall+3.028
 truth+2.934
 Heidi+2.927
Neg logit contributions
ug-3.155
idd-2.950
irled-2.915
zz-2.875
ged-2.846
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 stranger+1.158
 keeper+1.103
 stall+1.073
 Heidi+1.030
 truth+1.014
Neg logit contributions
ug-1.814
idd-1.732
ooming-1.707
zz-1.666
ips-1.656
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 stranger+2.032
 keeper+1.961
 stall+1.872
 seller+1.789
 ranger+1.769
Neg logit contributions
ug-2.543
idd-2.390
ooming-2.336
zz-2.305
irled-2.247
Token garden
Token ID11376
Feature activation+1.02e+00

Pos logit contributions
 Heidi+6.668
 keeper+6.616
 stranger+6.486
 Sonia+6.308
 Clover+6.199
Neg logit contributions
idd-6.939
bed-6.575
 stacking-6.571
ooming-6.484
ug-6.345
Token to
Token ID284
Feature activation+2.18e+00

Pos logit contributions
 stranger+1.597
 keeper+1.467
 stall+1.403
 Heidi+1.401
 truth+1.381
Neg logit contributions
ug-2.110
idd-2.007
ooming-1.966
zz-1.872
ips-1.871
Token find
Token ID1064
Feature activation+2.27e+00

Pos logit contributions
 stranger+3.656
 keeper+3.521
 stall+3.334
Stan+3.254
holder+3.170
Neg logit contributions
ug-3.891
idd-3.839
zz-3.715
ooming-3.695
irled-3.625
Token one
Token ID530
Feature activation+1.94e+00

Pos logit contributions
 keeper+3.665
 stranger+3.590
 stall+3.409
 ranger+3.336
 bench+3.139
Neg logit contributions
ug-4.675
idd-4.346
med-4.227
ged-4.142
zz-4.104
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
 stranger+3.331
 keeper+3.081
Stan+3.028
 Heidi+2.994
 stall+2.962
Neg logit contributions
ug-3.621
idd-3.359
zz-3.228
ooming-3.174
ig-3.143
Token One
Token ID1881
Feature activation+1.59e+00

Pos logit contributions
 stranger+0.249
 keeper+0.232
 stall+0.228
 seller+0.211
 Heidi+0.210
Neg logit contributions
ug-0.415
idd-0.394
ooming-0.382
med-0.369
irled-0.369
Token day
Token ID1110
Feature activation-3.40e-01

Pos logit contributions
 stranger+3.527
 stall+3.502
 keeper+3.487
Stan+3.382
 seller+3.379
Neg logit contributions
ug-2.179
zz-1.848
idd-1.826
ooming-1.723
anged-1.655
Token he
Token ID339
Feature activation+8.61e-01

Pos logit contributions
ug+0.704
idd+0.665
ooming+0.645
irled+0.622
ips+0.620
Neg logit contributions
 stranger-0.539
 keeper-0.510
 stall-0.504
 Heidi-0.473
 ranger-0.469
Token went
Token ID1816
Feature activation+1.44e+00

Pos logit contributions
 stranger+1.301
 stall+1.251
 keeper+1.243
 Heidi+1.181
Stan+1.144
Neg logit contributions
ug-1.817
idd-1.727
ooming-1.666
irled-1.646
zz-1.642
Token to
Token ID284
Feature activation+8.97e-01

Pos logit contributions
 stranger+2.265
 keeper+2.220
 stall+2.187
Stan+2.092
 Heidi+2.085
Neg logit contributions
ug-2.842
ooming-2.693
idd-2.641
ips-2.572
med-2.556
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 stranger+1.368
 keeper+1.316
 stall+1.276
 seller+1.221
 ranger+1.202
Neg logit contributions
ug-1.838
idd-1.723
ooming-1.699
zz-1.641
med-1.614
Token pond
Token ID16723
Feature activation+1.52e+00

Pos logit contributions
 Heidi+5.723
 keeper+5.471
 Sonia+5.394
 Clover+5.393
 stranger+5.228
Neg logit contributions
idd-7.945
ug-7.673
ooming-7.651
 stacking-7.579
 drummer-7.486
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 stranger+2.664
 keeper+2.471
 Heidi+2.451
 stall+2.398
Stan+2.397
Neg logit contributions
ug-2.905
idd-2.681
ooming-2.675
ips-2.494
bed-2.491
Token drink
Token ID4144
Feature activation+1.42e+00

Pos logit contributions
 stranger+2.263
 keeper+2.160
 stall+2.137
Stan+2.032
 Heidi+2.016
Neg logit contributions
ug-2.371
idd-2.239
ooming-2.197
zz-2.124
irled-2.088
Token some
Token ID617
Feature activation+2.72e-01

Pos logit contributions
 stranger+2.176
 keeper+2.130
 stall+2.094
 Heidi+1.985
Stan+1.951
Neg logit contributions
ug-2.970
idd-2.736
med-2.679
ooming-2.659
ged-2.623
Token water
Token ID1660
Feature activation+8.79e-01

Pos logit contributions
 stranger+0.375
 keeper+0.361
 stall+0.352
 Heidi+0.330
 ranger+0.327
Neg logit contributions
ug-0.609
idd-0.575
ooming-0.560
irled-0.545
med-0.541
Token So
Token ID1406
Feature activation-8.82e-01

Pos logit contributions
 stranger+0.523
 keeper+0.485
 stall+0.478
 Heidi+0.443
Stan+0.440
Neg logit contributions
ug-0.845
idd-0.799
ooming-0.777
irled-0.752
zz-0.749
Token,
Token ID11
Feature activation-7.48e-01

Pos logit contributions
ug+1.918
idd+1.865
ooming+1.781
ips+1.745
med+1.709
Neg logit contributions
 stranger-1.310
 keeper-1.239
 stall-1.211
 Stanley-1.158
 Heidi-1.152
Token he
Token ID339
Feature activation+8.11e-01

Pos logit contributions
ug+1.671
idd+1.601
ooming+1.543
ips+1.500
zz+1.491
Neg logit contributions
 stranger-1.081
 keeper-1.004
 stall-0.986
 Stanley-0.934
 Heidi-0.931
Token went
Token ID1816
Feature activation+8.34e-01

Pos logit contributions
 stranger+1.336
 keeper+1.273
 stall+1.260
 Heidi+1.214
 truth+1.185
Neg logit contributions
ug-1.598
idd-1.509
zz-1.430
ooming-1.429
irled-1.425
Token to
Token ID284
Feature activation+5.70e-01

Pos logit contributions
 stranger+1.312
 keeper+1.256
 stall+1.232
 Heidi+1.182
 truth+1.165
Neg logit contributions
ug-1.675
idd-1.601
ooming-1.564
zz-1.521
ips-1.509
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+0.843
 keeper+0.800
 stall+0.781
Stan+0.730
 Heidi+0.728
Neg logit contributions
ug-1.215
idd-1.153
ooming-1.114
zz-1.082
irled-1.071
Token garden
Token ID11376
Feature activation+9.74e-01

Pos logit contributions
 Heidi+6.147
 keeper+5.910
 stranger+5.899
 Sonia+5.779
holder+5.731
Neg logit contributions
idd-7.393
bed-6.960
ooming-6.939
 stacking-6.818
ug-6.723
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 stranger+1.571
 keeper+1.445
 Heidi+1.402
 stall+1.401
Stan+1.377
Neg logit contributions
ug-1.969
idd-1.873
ooming-1.830
med-1.730
ips-1.729
Token pick
Token ID2298
Feature activation+1.18e+00

Pos logit contributions
 stranger+2.023
 keeper+1.927
 stall+1.864
Stan+1.801
 Heidi+1.785
Neg logit contributions
ug-2.480
idd-2.404
ooming-2.282
zz-2.264
irled-2.230
Token some
Token ID617
Feature activation+9.63e-01

Pos logit contributions
 stranger+1.739
 keeper+1.717
 stall+1.609
 ranger+1.568
 bench+1.483
Neg logit contributions
ug-2.530
idd-2.407
med-2.347
ooming-2.309
ged-2.287
Token fresh
Token ID4713
Feature activation-2.02e-01

Pos logit contributions
 stranger+1.137
 keeper+1.118
 stall+1.027
 ranger+1.010
 Heidi+0.992
Neg logit contributions
ug-2.319
idd-2.178
ooming-2.137
med-2.117
irled-2.087
Token hungry
Token ID14720
Feature activation+1.60e+00

Pos logit contributions
 stranger+2.462
 stall+2.456
 keeper+2.344
 bench+2.259
 Heidi+2.176
Neg logit contributions
ug-4.073
idd-3.901
irled-3.675
ooming-3.656
ged-3.585
Token.
Token ID13
Feature activation+7.89e-01

Pos logit contributions
 stranger+2.534
 stall+2.396
 keeper+2.385
Stan+2.359
 Heidi+2.337
Neg logit contributions
ug-3.240
idd-2.950
ooming-2.882
ips-2.842
med-2.837
Token He
Token ID679
Feature activation+1.10e+00

Pos logit contributions
 stranger+1.042
 keeper+0.943
 stall+0.936
 camel+0.866
 bench+0.857
Neg logit contributions
ug-1.847
idd-1.752
ooming-1.674
med-1.640
ips-1.632
Token headed
Token ID9153
Feature activation+9.91e-01

Pos logit contributions
 stranger+1.793
 stall+1.689
 keeper+1.671
 Heidi+1.621
 truth+1.595
Neg logit contributions
ug-2.158
idd-2.050
irled-1.948
zz-1.928
ooming-1.925
Token to
Token ID284
Feature activation+1.69e+00

Pos logit contributions
 stranger+1.507
 keeper+1.426
 stall+1.382
 truth+1.359
Stan+1.334
Neg logit contributions
ug-2.047
idd-1.937
ooming-1.907
zz-1.854
ged-1.846
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+2.790
 keeper+2.681
 stall+2.543
 seller+2.474
 ranger+2.464
Neg logit contributions
ug-3.183
idd-3.014
ooming-2.929
ips-2.828
zz-2.820
Token west
Token ID7421
Feature activation+1.59e+00

Pos logit contributions
 Heidi+5.007
 keeper+4.981
oto+4.915
Stan+4.820
 Stanley+4.709
Neg logit contributions
ooming-7.919
ug-7.852
idd-7.852
 pract-7.624
 practicing-7.502
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 stranger+2.845
 Heidi+2.621
Stan+2.614
 butcher+2.560
 truth+2.559
Neg logit contributions
ug-2.898
idd-2.645
ooming-2.611
ips-2.461
zz-2.423
Token began
Token ID2540
Feature activation+3.07e-01

Pos logit contributions
 stranger+1.853
 stall+1.736
 keeper+1.730
 truth+1.708
 Heidi+1.635
Neg logit contributions
ug-2.051
idd-1.980
ged-1.869
ooming-1.862
med-1.861
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 stranger+0.419
 keeper+0.392
 stall+0.382
 Heidi+0.357
 seller+0.355
Neg logit contributions
ug-0.697
idd-0.664
ooming-0.650
med-0.628
irled-0.626
Token twist
Token ID14528
Feature activation-1.58e-01

Pos logit contributions
 stranger+2.216
 keeper+2.095
 stall+2.045
 ranger+1.929
 butcher+1.899
Neg logit contributions
ug-2.818
ooming-2.701
idd-2.689
zz-2.649
irled-2.554
Token So
Token ID1406
Feature activation+5.05e-01

Pos logit contributions
 stranger+1.572
 keeper+1.456
 stall+1.442
 camel+1.356
 seller+1.354
Neg logit contributions
ug-2.052
idd-1.970
ooming-1.870
irled-1.846
ips-1.828
Token,
Token ID11
Feature activation+6.32e-01

Pos logit contributions
 stranger+0.893
 keeper+0.850
 stall+0.846
 seller+0.791
 ranger+0.791
Neg logit contributions
ug-0.950
idd-0.872
ooming-0.858
irled-0.833
ged-0.831
Token she
Token ID673
Feature activation+1.85e-01

Pos logit contributions
 stranger+0.842
 keeper+0.806
 stall+0.800
 seller+0.727
 ranger+0.727
Neg logit contributions
ug-1.435
idd-1.354
ooming-1.325
ged-1.302
irled-1.297
Token went
Token ID1816
Feature activation+6.35e-01

Pos logit contributions
 stranger+0.305
 keeper+0.290
 stall+0.287
 Heidi+0.272
 ranger+0.267
Neg logit contributions
ug-0.367
idd-0.346
ooming-0.333
irled-0.324
zz-0.324
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
 stranger+0.897
 keeper+0.845
 stall+0.824
 Heidi+0.795
 truth+0.790
Neg logit contributions
ug-1.394
idd-1.331
ooming-1.304
ips-1.264
zz-1.253
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+1.742
 keeper+1.674
 stall+1.598
 seller+1.536
 ranger+1.520
Neg logit contributions
ug-2.285
idd-2.167
ooming-2.114
zz-2.070
ips-2.024
Token gate
Token ID8946
Feature activation+1.30e+00

Pos logit contributions
 Clover+5.556
 Heidi+5.452
 Stanley+5.381
 Sonia+5.247
 keeper+5.084
Neg logit contributions
idd-8.202
ooming-7.833
 sudden-7.625
ug-7.612
 stacking-7.560
Token and
Token ID290
Feature activation+7.41e-01

Pos logit contributions
 stranger+2.168
Stan+1.991
 Stanley+1.963
 stall+1.957
 Heidi+1.945
Neg logit contributions
ug-2.551
idd-2.367
ooming-2.367
ips-2.257
med-2.234
Token opened
Token ID4721
Feature activation+1.32e+00

Pos logit contributions
 stranger+1.261
 keeper+1.190
 stall+1.177
 truth+1.139
Stan+1.124
Neg logit contributions
ug-1.396
idd-1.343
ooming-1.263
zz-1.252
med-1.243
Token it
Token ID340
Feature activation+2.24e+00

Pos logit contributions
 stranger+2.317
 keeper+2.201
 stall+2.096
 ranger+2.073
 butcher+2.063
Neg logit contributions
ug-2.474
idd-2.342
med-2.251
ooming-2.246
irled-2.220
Token.
Token ID13
Feature activation+7.15e-01

Pos logit contributions
 stranger+4.302
Stan+3.854
 camel+3.832
 butcher+3.740
 Heidi+3.707
Neg logit contributions
ug-4.072
bed-3.851
med-3.847
ooming-3.812
idd-3.764
Token field
Token ID2214
Feature activation+1.81e+00

Pos logit contributions
 stranger+1.694
 Heidi+1.618
 keeper+1.616
Stan+1.600
 Stanley+1.527
Neg logit contributions
ug-3.285
idd-3.168
ooming-3.012
ips-2.896
zz-2.892
Token.
Token ID13
Feature activation-4.36e-02

Pos logit contributions
 stranger+2.957
 Heidi+2.781
Stan+2.716
 Stanley+2.663
 butcher+2.657
Neg logit contributions
ug-3.731
idd-3.523
ooming-3.317
med-3.218
ips-3.201
Token Johnny
Token ID15470
Feature activation+7.74e-01

Pos logit contributions
ug+0.085
idd+0.079
ooming+0.077
irled+0.074
med+0.074
Neg logit contributions
 stranger-0.075
 keeper-0.071
 stall-0.070
 Heidi-0.066
 seller-0.066
Token ran
Token ID4966
Feature activation+5.94e-01

Pos logit contributions
 stranger+1.194
 keeper+1.114
 stall+1.103
 Heidi+1.056
 truth+1.036
Neg logit contributions
ug-1.607
idd-1.543
ooming-1.491
med-1.452
irled-1.448
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
 stranger+0.906
 keeper+0.860
 stall+0.858
 truth+0.811
Stan+0.805
Neg logit contributions
ug-1.224
idd-1.159
ooming-1.123
med-1.096
ged-1.095
Token the
Token ID262
Feature activation+3.93e+00

Pos logit contributions
 stranger+1.710
 keeper+1.650
 stall+1.593
 ranger+1.529
 seller+1.525
Neg logit contributions
ug-2.240
idd-2.100
ooming-2.048
zz-1.989
ged-1.973
Token middle
Token ID3504
Feature activation+1.67e-01

Pos logit contributions
 Heidi+5.706
Stan+5.656
 keeper+5.435
isi+5.332
 stranger+5.286
Neg logit contributions
idd-7.914
ug-7.717
ooming-7.539
 spinning-7.504
bed-7.408
Token and
Token ID290
Feature activation+8.77e-01

Pos logit contributions
 stranger+0.327
 keeper+0.311
 stall+0.307
 Heidi+0.296
Stan+0.294
Neg logit contributions
ug-0.281
idd-0.259
ooming-0.251
med-0.240
zz-0.238
Token noticed
Token ID6810
Feature activation+8.46e-01

Pos logit contributions
 stranger+1.426
 keeper+1.357
 stall+1.352
 truth+1.316
Stan+1.275
Neg logit contributions
ug-1.710
idd-1.660
ooming-1.574
ged-1.563
med-1.553
Token a
Token ID257
Feature activation+9.41e-01

Pos logit contributions
 stranger+1.004
 stall+0.953
 keeper+0.948
 Heidi+0.871
Stan+0.862
Neg logit contributions
ug-2.074
idd-1.965
ooming-1.900
med-1.872
ged-1.851
Token big
Token ID1263
Feature activation+4.51e-01

Pos logit contributions
 stranger+1.176
 keeper+1.139
 Heidi+1.110
 stall+1.105
 Stanley+1.065
Neg logit contributions
ug-2.107
idd-2.052
ooming-1.970
zz-1.896
irled-1.864
Token day
Token ID1110
Feature activation+4.43e-01

Pos logit contributions
 stranger+3.022
 keeper+2.989
 stall+2.959
 seller+2.886
 Heidi+2.878
Neg logit contributions
ug-1.980
zz-1.694
idd-1.671
ooming-1.580
irled-1.526
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 stranger+0.779
 keeper+0.747
 stall+0.733
 seller+0.691
 ranger+0.690
Neg logit contributions
ug-0.841
idd-0.778
ooming-0.761
ged-0.735
zz-0.730
Token he
Token ID339
Feature activation+6.68e-01

Pos logit contributions
 stranger+1.910
 keeper+1.807
 stall+1.792
 truth+1.713
 Clover+1.698
Neg logit contributions
ug-3.028
idd-2.777
ged-2.753
ooming-2.731
zz-2.658
Token went
Token ID1816
Feature activation+9.52e-01

Pos logit contributions
 stranger+1.005
 stall+0.950
 keeper+0.950
 Heidi+0.902
 Stanley+0.869
Neg logit contributions
ug-1.425
idd-1.363
ooming-1.306
irled-1.289
zz-1.278
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
 stranger+1.619
 keeper+1.568
 stall+1.539
 Heidi+1.481
Stan+1.463
Neg logit contributions
ug-1.806
ooming-1.681
idd-1.677
ips-1.615
zz-1.596
Token the
Token ID262
Feature activation+3.91e+00

Pos logit contributions
 stranger+2.319
 keeper+2.245
 stall+2.149
 seller+2.094
 ranger+2.039
Neg logit contributions
ug-2.993
idd-2.773
ooming-2.754
zz-2.706
 pract-2.584
Token forest
Token ID8222
Feature activation+1.34e+00

Pos logit contributions
 Heidi+6.314
 keeper+5.839
 stranger+5.833
 Clover+5.721
 Sonia+5.720
Neg logit contributions
idd-7.655
ug-7.520
ooming-7.249
zz-7.140
 stacking-7.130
Token and
Token ID290
Feature activation+1.99e+00

Pos logit contributions
 stranger+2.170
 Heidi+1.994
Stan+1.966
 keeper+1.963
 stall+1.949
Neg logit contributions
ug-2.703
idd-2.490
ooming-2.490
anged-2.348
zz-2.332
Token spotted
Token ID13489
Feature activation+2.06e+00

Pos logit contributions
 stranger+3.385
 stall+3.290
 truth+3.230
 keeper+3.191
Stan+3.133
Neg logit contributions
ug-3.608
idd-3.606
ged-3.439
zz-3.368
anged-3.358
Token a
Token ID257
Feature activation+9.65e-01

Pos logit contributions
 stranger+3.007
 stall+2.941
 keeper+2.913
Stan+2.650
 Hut+2.625
Neg logit contributions
ug-4.470
idd-4.279
med-4.063
ged-4.007
ooming-3.966
Token beautiful
Token ID4950
Feature activation+5.98e-01

Pos logit contributions
 stranger+1.187
 keeper+1.151
 stall+1.126
 Heidi+1.124
 Stanley+1.068
Neg logit contributions
ug-2.190
idd-2.124
ooming-2.044
zz-1.973
irled-1.920
TokenYou
Token ID1639
Feature activation+8.30e-01

Pos logit contributions
 stranger+1.134
 stall+1.074
 keeper+1.049
 Heidi+0.990
 Stanley+0.978
Neg logit contributions
ug-1.807
idd-1.740
ooming-1.664
med-1.627
zz-1.623
Token have
Token ID423
Feature activation+9.04e-01

Pos logit contributions
 stranger+1.348
 keeper+1.295
 stall+1.276
 Heidi+1.198
 ranger+1.197
Neg logit contributions
ug-1.674
idd-1.578
ooming-1.502
zz-1.483
irled-1.445
Token to
Token ID284
Feature activation+1.89e+00

Pos logit contributions
 stranger+1.223
 keeper+1.158
 stall+1.149
 seller+1.054
 Heidi+1.051
Neg logit contributions
ug-2.025
idd-1.950
ooming-1.855
ips-1.830
med-1.825
Token go
Token ID467
Feature activation+1.60e-01

Pos logit contributions
 stranger+3.028
 keeper+2.810
 stall+2.772
 ranger+2.570
keeper+2.559
Neg logit contributions
ug-3.838
idd-3.515
zz-3.508
ooming-3.431
irled-3.331
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 stranger+0.226
 keeper+0.212
 stall+0.208
 Heidi+0.196
 seller+0.193
Neg logit contributions
ug-0.356
idd-0.337
ooming-0.329
zz-0.317
ips-0.316
Token the
Token ID262
Feature activation+3.90e+00

Pos logit contributions
 stranger+2.022
 keeper+1.910
 stall+1.865
 seller+1.792
holder+1.741
Neg logit contributions
ug-2.585
idd-2.377
ooming-2.350
zz-2.266
ips-2.265
Token west
Token ID7421
Feature activation+1.08e+00

Pos logit contributions
 Heidi+5.573
 stranger+5.467
 keeper+5.371
 Stanley+5.275
holder+5.085
Neg logit contributions
idd-7.823
ooming-7.464
ug-7.401
ips-7.386
zz-7.363
Token and
Token ID290
Feature activation+9.99e-01

Pos logit contributions
 stranger+1.735
 Heidi+1.573
 keeper+1.570
 stall+1.568
Stan+1.548
Neg logit contributions
ug-2.184
idd-2.010
ooming-1.956
ips-1.876
zz-1.870
Token twist
Token ID14528
Feature activation-2.68e-01

Pos logit contributions
 stranger+1.479
 stall+1.397
 keeper+1.386
Stan+1.298
 Heidi+1.295
Neg logit contributions
ug-2.069
idd-1.954
ooming-1.895
zz-1.841
irled-1.823
Token off
Token ID572
Feature activation+6.42e-02

Pos logit contributions
ug+0.542
idd+0.511
ooming+0.496
irled+0.476
zz+0.475
Neg logit contributions
 stranger-0.434
 keeper-0.413
 stall-0.407
 Heidi-0.382
 ranger-0.380
Token a
Token ID257
Feature activation+1.36e+00

Pos logit contributions
 stranger+0.098
 keeper+0.093
 stall+0.092
 Heidi+0.086
 ranger+0.085
Neg logit contributions
ug-0.135
idd-0.127
ooming-0.124
med-0.119
irled-0.119
Token,
Token ID11
Feature activation+8.22e-01

Pos logit contributions
 stranger+3.240
 keeper+3.128
 stall+2.925
 ranger+2.901
Stan+2.895
Neg logit contributions
ug-3.768
idd-3.468
ged-3.381
ips-3.361
ooming-3.352
Token so
Token ID523
Feature activation+1.12e+00

Pos logit contributions
 stranger+1.296
 keeper+1.232
 stall+1.199
Stan+1.139
 seller+1.130
Neg logit contributions
ug-1.641
idd-1.545
ooming-1.515
med-1.466
ged-1.461
Token she
Token ID673
Feature activation+1.50e+00

Pos logit contributions
 stranger+1.458
 keeper+1.439
 stall+1.370
 ranger+1.314
 seller+1.282
Neg logit contributions
ug-2.554
idd-2.427
ooming-2.383
irled-2.358
ged-2.354
Token went
Token ID1816
Feature activation+1.15e+00

Pos logit contributions
 stranger+2.516
 keeper+2.393
 stall+2.381
 Heidi+2.290
 truth+2.283
Neg logit contributions
ug-2.895
idd-2.792
irled-2.669
ged-2.602
zz-2.598
Token to
Token ID284
Feature activation+2.28e+00

Pos logit contributions
 stranger+1.681
 keeper+1.603
 stall+1.539
 Heidi+1.522
Stan+1.489
Neg logit contributions
ug-2.405
idd-2.340
ooming-2.304
ips-2.245
zz-2.219
Token the
Token ID262
Feature activation+3.89e+00

Pos logit contributions
 keeper+3.902
 stranger+3.884
 ranger+3.624
 stall+3.563
holder+3.535
Neg logit contributions
ug-3.874
idd-3.757
ooming-3.721
zz-3.687
 pract-3.631
Token kitchen
Token ID9592
Feature activation+1.37e+00

Pos logit contributions
 Heidi+6.534
 Stanley+6.146
 Sonia+6.075
 keeper+6.069
Stan+6.016
Neg logit contributions
idd-7.333
 stacking-6.505
ooming-6.463
 mar-6.229
 pol-6.211
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 stranger+2.103
 keeper+1.939
 Heidi+1.888
Stan+1.859
 Stanley+1.836
Neg logit contributions
ug-2.814
idd-2.743
ooming-2.679
ips-2.581
zz-2.534
Token She
Token ID1375
Feature activation+4.62e-01

Pos logit contributions
 stranger+2.715
 keeper+2.703
 camel+2.485
 ranger+2.453
 seller+2.436
Neg logit contributions
ug-3.095
idd-3.092
ips-3.005
ooming-2.947
ged-2.946
Token spotted
Token ID13489
Feature activation+1.66e+00

Pos logit contributions
 stranger+0.685
 keeper+0.643
 stall+0.626
 Heidi+0.594
Stan+0.588
Neg logit contributions
ug-0.984
idd-0.949
ooming-0.906
irled-0.894
zz-0.888
Token a
Token ID257
Feature activation+6.10e-01

Pos logit contributions
 stranger+2.243
 keeper+2.238
 stall+2.153
 ranger+2.059
Stan+1.987
Neg logit contributions
ug-3.635
idd-3.565
med-3.455
ged-3.368
ooming-3.347
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
 stranger+0.769
 keeper+0.726
 stall+0.705
 Heidi+0.672
 ranger+0.661
Neg logit contributions
ug-1.209
idd-1.159
ooming-1.121
zz-1.087
irled-1.087
Token her
Token ID607
Feature activation+1.71e+00

Pos logit contributions
 keeper+2.631
 stranger+2.575
 ranger+2.460
 stall+2.458
 camel+2.322
Neg logit contributions
ug-3.670
zz-3.505
irled-3.423
ooming-3.402
idd-3.393
Token parents
Token ID3397
Feature activation+1.01e+00

Pos logit contributions
 stranger+2.513
 keeper+2.490
 Heidi+2.377
 ranger+2.343
 stall+2.334
Neg logit contributions
ug-3.388
idd-3.316
ooming-3.189
irled-3.122
zz-3.083
Token went
Token ID1816
Feature activation+7.27e-01

Pos logit contributions
 stranger+1.653
 keeper+1.611
 stall+1.548
 Heidi+1.505
 ranger+1.503
Neg logit contributions
ug-2.017
idd-1.908
ooming-1.813
ged-1.801
zz-1.793
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 stranger+1.051
 keeper+1.002
 stall+0.961
 Heidi+0.937
Stan+0.922
Neg logit contributions
ug-1.572
idd-1.500
ooming-1.476
ips-1.430
med-1.405
Token the
Token ID262
Feature activation+3.89e+00

Pos logit contributions
 stranger+2.368
 keeper+2.310
 stall+2.162
 ranger+2.142
 camel+2.129
Neg logit contributions
ug-2.656
idd-2.552
ooming-2.480
zz-2.382
ips-2.342
Token market
Token ID1910
Feature activation+2.21e+00

Pos logit contributions
 Heidi+4.074
 camel+3.721
oto+3.701
 Clover+3.688
 keeper+3.681
Neg logit contributions
idd-9.545
 stacking-8.778
ooming-8.765
ug-8.439
 pract-8.366
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 stranger+4.260
 camel+3.921
Stan+3.862
 Heidi+3.824
 Stanley+3.807
Neg logit contributions
ug-3.793
idd-3.728
ooming-3.586
ips-3.437
bed-3.379
Token Jane
Token ID12091
Feature activation+7.53e-01

Pos logit contributions
 stranger+1.705
 keeper+1.601
Stan+1.546
 stall+1.532
 camel+1.523
Neg logit contributions
ug-2.053
idd-1.968
irled-1.855
ooming-1.853
anged-1.842
Token tried
Token ID3088
Feature activation+7.85e-01

Pos logit contributions
 stranger+1.140
 keeper+1.056
 stall+1.022
 Heidi+0.998
 truth+0.982
Neg logit contributions
ug-1.590
idd-1.543
ooming-1.473
irled-1.430
ged-1.424
Token to
Token ID284
Feature activation+1.99e+00

Pos logit contributions
 stranger+1.120
 keeper+1.074
 stall+1.037
 seller+0.991
 camel+0.978
Neg logit contributions
ug-1.686
idd-1.590
ooming-1.583
ips-1.537
med-1.520
Token computer
Token ID3644
Feature activation-6.01e-01

Pos logit contributions
ug+2.688
idd+2.435
ooming+2.367
med+2.363
ips+2.336
Neg logit contributions
 stranger-2.186
 stall-2.068
 keeper-2.047
 seller-1.934
 bench-1.917
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
ug+1.419
idd+1.358
ooming+1.315
irled+1.279
zz+1.279
Neg logit contributions
 stranger-0.771
 keeper-0.728
 stall-0.721
 seller-0.662
 ranger-0.656
Token He
Token ID679
Feature activation-1.68e+00

Pos logit contributions
ug+1.049
idd+1.002
ooming+0.982
irled+0.951
med+0.951
Neg logit contributions
 stranger-0.424
 keeper-0.396
 stall-0.388
 Heidi-0.349
 ranger-0.347
Token would
Token ID561
Feature activation-1.15e+00

Pos logit contributions
ug+4.730
ooming+4.550
idd+4.535
ips+4.447
med+4.343
Neg logit contributions
 stranger-1.438
 keeper-1.287
 stall-1.258
 seller-1.179
 bench-1.132
Token spend
Token ID4341
Feature activation-2.76e+00

Pos logit contributions
ug+2.591
idd+2.484
ooming+2.411
ips+2.335
med+2.319
Neg logit contributions
 stranger-1.652
 keeper-1.536
 stall-1.533
 seller-1.448
 bench-1.432
Token hours
Token ID2250
Feature activation-4.01e+00

Pos logit contributions
ug+7.428
idd+7.307
ooming+7.027
anged+6.792
zz+6.650
Neg logit contributions
 stranger-3.103
 stall-2.913
 bench-2.840
 keeper-2.730
 Heidi-2.647
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
ug+8.287
idd+8.209
irled+7.879
ooming+7.685
anged+7.657
Neg logit contributions
 stall-6.621
 nearby-6.500
 stranger-6.277
 bench-6.149
 keeper-6.048
Token hours
Token ID2250
Feature activation-3.91e+00

Pos logit contributions
ug+3.358
idd+3.188
ooming+3.083
ips+2.927
irled+2.897
Neg logit contributions
 stranger-2.326
 keeper-2.224
 stall-2.175
 bench-2.087
 ranger-2.078
Token every
Token ID790
Feature activation-1.98e+00

Pos logit contributions
ug+9.615
idd+9.610
irled+9.422
ooming+9.255
zz+9.090
Neg logit contributions
 stall-5.007
 nearby-4.726
 stranger-4.651
 bench-4.614
 seller-4.555
Token day
Token ID1110
Feature activation-2.19e+00

Pos logit contributions
ug+4.193
idd+4.029
ooming+3.881
ips+3.754
irled+3.726
Neg logit contributions
 stranger-3.472
 stall-3.164
 keeper-3.076
 bench-3.006
 ranger-2.864
Token playing
Token ID2712
Feature activation-1.52e+00

Pos logit contributions
ug+5.883
idd+5.775
irled+5.666
ooming+5.617
zz+5.591
Neg logit contributions
 stranger-2.133
 stall-2.022
 keeper-1.942
 Stanley-1.745
 bench-1.716
Token day
Token ID1110
Feature activation-7.82e-01

Pos logit contributions
ug+0.946
idd+0.893
ooming+0.863
med+0.820
ips+0.818
Neg logit contributions
 stranger-0.982
 keeper-0.933
 stall-0.926
 Heidi-0.870
 ranger-0.867
Token Mill
Token ID9212
Feature activation-9.74e-01

Pos logit contributions
ug+1.698
idd+1.630
ooming+1.576
ips+1.532
irled+1.526
Neg logit contributions
 stranger-1.164
 keeper-1.087
 stall-1.066
 Heidi-1.020
 Stanley-0.996
Tokenie
Token ID494
Feature activation-1.48e+00

Pos logit contributions
ug+1.740
idd+1.644
ooming+1.586
ips+1.501
med+1.493
Neg logit contributions
 stranger-1.809
 keeper-1.713
 stall-1.691
 seller-1.607
 Heidi-1.603
Token would
Token ID561
Feature activation-9.45e-01

Pos logit contributions
ug+4.270
idd+4.133
ooming+4.050
ips+3.986
med+3.934
Neg logit contributions
 stranger-1.100
 stall-0.959
 keeper-0.942
 seller-0.843
 bench-0.791
Token spend
Token ID4341
Feature activation-2.48e+00

Pos logit contributions
ug+2.252
idd+2.175
ooming+2.106
ips+2.051
med+2.047
Neg logit contributions
 stranger-1.194
 keeper-1.103
 stall-1.100
 bench-1.010
 seller-1.007
Token hours
Token ID2250
Feature activation-3.94e+00

Pos logit contributions
ug+6.786
idd+6.667
ooming+6.406
anged+6.169
zz+6.035
Neg logit contributions
 stranger-2.687
 stall-2.423
 bench-2.385
 Heidi-2.324
 keeper-2.273
Token making
Token ID1642
Feature activation-1.78e+00

Pos logit contributions
ug+8.038
idd+8.007
irled+7.919
anged+7.424
ooming+7.372
Neg logit contributions
 stall-6.430
 nearby-6.227
 stranger-6.223
 bench-6.093
 keeper-5.927
Token new
Token ID649
Feature activation-2.25e+00

Pos logit contributions
ug+4.293
idd+4.196
ooming+4.046
anged+3.907
ged+3.892
Neg logit contributions
 stranger-2.227
 keeper-2.050
 stall-1.985
 truth-1.901
 bench-1.868
Token routes
Token ID11926
Feature activation-6.20e-01

Pos logit contributions
ug+4.897
idd+4.738
ged+4.588
zz+4.550
ips+4.530
Neg logit contributions
 stranger-3.386
 stall-3.274
 keeper-3.264
 bench-2.967
 truth-2.922
Token for
Token ID329
Feature activation-1.59e+00

Pos logit contributions
ug+1.315
idd+1.256
ooming+1.213
irled+1.170
zz+1.165
Neg logit contributions
 stranger-0.931
 keeper-0.893
 stall-0.871
 ranger-0.813
 seller-0.808
Token the
Token ID262
Feature activation-1.52e+00

Pos logit contributions
ug+4.259
idd+4.176
ooming+4.052
irled+3.932
zz+3.913
Neg logit contributions
 stranger-1.613
 keeper-1.452
 stall-1.428
 truth-1.336
 Heidi-1.314
Tokenbe
Token ID1350
Feature activation-2.98e+00

Pos logit contributions
ug+2.943
irled+2.791
ooming+2.750
ips+2.693
idd+2.665
Neg logit contributions
 keeper-6.501
 stranger-6.492
 ranger-6.150
 stall-6.056
 nearby-6.018
Tokenep
Token ID538
Feature activation-1.20e+00

Pos logit contributions
idd+4.645
ug+4.490
ooming+4.239
ips+3.955
anged+3.913
Neg logit contributions
 stranger-5.918
 keeper-5.851
 stall-5.660
 truth-5.480
 bench-5.457
Token.
Token ID13
Feature activation+6.63e-01

Pos logit contributions
ug+2.653
idd+2.593
ooming+2.479
irled+2.434
ged+2.404
Neg logit contributions
 stranger-1.618
 keeper-1.615
 stall-1.501
 ranger-1.431
 seller-1.405
Token They
Token ID1119
Feature activation-1.49e+00

Pos logit contributions
 stranger+1.034
 keeper+0.986
 stall+0.932
 ranger+0.897
 camel+0.885
Neg logit contributions
ug-1.364
idd-1.296
ooming-1.253
ips-1.241
irled-1.230
Token have
Token ID423
Feature activation-7.92e-01

Pos logit contributions
ug+3.505
idd+3.370
ooming+3.304
ips+3.174
irled+3.173
Neg logit contributions
 stranger-1.983
 keeper-1.844
 stall-1.801
 seller-1.738
 bench-1.689
Token fun
Token ID1257
Feature activation-3.93e+00

Pos logit contributions
ug+2.009
idd+1.915
ooming+1.888
irled+1.811
zz+1.808
Neg logit contributions
 stranger-0.890
 keeper-0.834
 stall-0.818
 ranger-0.740
 truth-0.732
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
idd+10.784
ug+10.671
anged+10.398
irled+10.218
zz+10.146
Neg logit contributions
 stranger-3.936
 nearby-3.771
 bench-3.718
 keeper-3.684
 stall-3.659
Token laugh
Token ID6487
Feature activation+1.24e-01

Pos logit contributions
ug+3.865
idd+3.754
ooming+3.679
ips+3.417
irled+3.415
Neg logit contributions
 stranger-2.786
 keeper-2.707
 stall-2.581
 ranger-2.511
 bench-2.499
Token a
Token ID257
Feature activation-6.69e-01

Pos logit contributions
 stranger+0.199
 keeper+0.189
 stall+0.187
 Heidi+0.176
 ranger+0.174
Neg logit contributions
ug-0.253
idd-0.236
ooming-0.231
med-0.224
irled-0.222
Token lot
Token ID1256
Feature activation+4.41e-01

Pos logit contributions
ug+1.236
idd+1.156
ooming+1.117
ips+1.075
irled+1.069
Neg logit contributions
 stranger-1.232
 keeper-1.164
 stall-1.160
 ranger-1.083
 bench-1.082
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 stranger+0.608
 keeper+0.562
 stall+0.561
 Heidi+0.534
Stan+0.526
Neg logit contributions
ug-1.006
idd-0.931
ooming-0.914
med-0.885
zz-0.884
Token arrow
Token ID15452
Feature activation-6.35e-01

Pos logit contributions
ug+1.752
idd+1.680
ooming+1.634
ips+1.583
irled+1.569
Neg logit contributions
 stranger-1.000
 keeper-0.934
 stall-0.923
 seller-0.855
 ranger-0.847
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
ug+1.532
idd+1.466
ooming+1.427
irled+1.386
zz+1.385
Neg logit contributions
 stranger-0.783
 keeper-0.737
 stall-0.721
 seller-0.671
 ranger-0.661
Token He
Token ID679
Feature activation-1.26e+00

Pos logit contributions
ug+0.556
idd+0.530
ooming+0.519
irled+0.503
med+0.502
Neg logit contributions
 stranger-0.231
 keeper-0.214
 stall-0.210
 Heidi-0.189
 ranger-0.188
Token would
Token ID561
Feature activation-1.03e+00

Pos logit contributions
ug+3.399
idd+3.260
ooming+3.226
ips+3.152
med+3.088
Neg logit contributions
 stranger-1.218
 keeper-1.133
 stall-1.095
 seller-1.034
 bench-0.996
Token spend
Token ID4341
Feature activation-2.34e+00

Pos logit contributions
ug+2.248
idd+2.159
ooming+2.081
ips+2.017
med+2.001
Neg logit contributions
 stranger-1.554
 keeper-1.459
 stall-1.454
 seller-1.382
 bench-1.359
Token hours
Token ID2250
Feature activation-3.93e+00

Pos logit contributions
ug+6.167
idd+6.033
ooming+5.820
anged+5.554
zz+5.533
Neg logit contributions
 stranger-2.677
 stall-2.454
 keeper-2.372
 bench-2.345
 Heidi-2.292
Token shooting
Token ID4395
Feature activation-1.31e+00

Pos logit contributions
ug+8.162
idd+8.097
irled+7.749
zz+7.639
apy+7.604
Neg logit contributions
 stall-6.415
 nearby-6.240
 stranger-6.190
 bench-5.952
 seller-5.828
Token arrows
Token ID20507
Feature activation-7.44e-01

Pos logit contributions
ug+2.877
idd+2.839
ooming+2.714
ips+2.610
irled+2.609
Neg logit contributions
 stranger-1.918
 keeper-1.788
 stall-1.761
 truth-1.611
 ranger-1.605
Token at
Token ID379
Feature activation-1.50e+00

Pos logit contributions
ug+1.467
idd+1.405
ooming+1.343
irled+1.312
zz+1.286
Neg logit contributions
 stranger-1.236
 keeper-1.174
 stall-1.160
 seller-1.091
 ranger-1.081
Token targets
Token ID6670
Feature activation-1.00e+00

Pos logit contributions
ug+3.713
idd+3.583
ooming+3.530
irled+3.409
med+3.381
Neg logit contributions
 stranger-1.789
 keeper-1.705
 stall-1.695
 truth-1.529
 ranger-1.519
Token in
Token ID287
Feature activation-1.31e+00

Pos logit contributions
ug+2.156
idd+2.082
ooming+2.005
irled+1.961
zz+1.956
Neg logit contributions
 stranger-1.477
 keeper-1.397
 stall-1.381
 seller-1.282
 ranger-1.279
Token He
Token ID679
Feature activation-1.68e+00

Pos logit contributions
ug+1.049
idd+1.002
ooming+0.982
irled+0.951
med+0.951
Neg logit contributions
 stranger-0.424
 keeper-0.396
 stall-0.388
 Heidi-0.349
 ranger-0.347
Token would
Token ID561
Feature activation-1.15e+00

Pos logit contributions
ug+4.730
ooming+4.550
idd+4.535
ips+4.447
med+4.343
Neg logit contributions
 stranger-1.438
 keeper-1.287
 stall-1.258
 seller-1.179
 bench-1.132
Token spend
Token ID4341
Feature activation-2.76e+00

Pos logit contributions
ug+2.591
idd+2.484
ooming+2.411
ips+2.335
med+2.319
Neg logit contributions
 stranger-1.652
 keeper-1.536
 stall-1.533
 seller-1.448
 bench-1.432
Token hours
Token ID2250
Feature activation-4.01e+00

Pos logit contributions
ug+7.428
idd+7.307
ooming+7.027
anged+6.792
zz+6.650
Neg logit contributions
 stranger-3.103
 stall-2.913
 bench-2.840
 keeper-2.730
 Heidi-2.647
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
ug+8.287
idd+8.209
irled+7.879
ooming+7.685
anged+7.657
Neg logit contributions
 stall-6.621
 nearby-6.500
 stranger-6.277
 bench-6.149
 keeper-6.048
Token hours
Token ID2250
Feature activation-3.91e+00

Pos logit contributions
ug+3.358
idd+3.188
ooming+3.083
ips+2.927
irled+2.897
Neg logit contributions
 stranger-2.326
 keeper-2.224
 stall-2.175
 bench-2.087
 ranger-2.078
Token every
Token ID790
Feature activation-1.98e+00

Pos logit contributions
ug+9.615
idd+9.610
irled+9.422
ooming+9.255
zz+9.090
Neg logit contributions
 stall-5.007
 nearby-4.726
 stranger-4.651
 bench-4.614
 seller-4.555
Token day
Token ID1110
Feature activation-2.19e+00

Pos logit contributions
ug+4.193
idd+4.029
ooming+3.881
ips+3.754
irled+3.726
Neg logit contributions
 stranger-3.472
 stall-3.164
 keeper-3.076
 bench-3.006
 ranger-2.864
Token playing
Token ID2712
Feature activation-1.52e+00

Pos logit contributions
ug+5.883
idd+5.775
irled+5.666
ooming+5.617
zz+5.591
Neg logit contributions
 stranger-2.133
 stall-2.022
 keeper-1.942
 Stanley-1.745
 bench-1.716
Token games
Token ID1830
Feature activation-2.79e-01

Pos logit contributions
ug+3.269
idd+3.143
ooming+2.973
anged+2.948
zz+2.940
Neg logit contributions
 stranger-2.358
 stall-2.195
 keeper-2.187
 ranger-2.029
 bench-2.024
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
ug+0.594
idd+0.562
ooming+0.546
irled+0.527
zz+0.525
Neg logit contributions
 stranger-0.422
 keeper-0.401
 stall-0.395
 Heidi-0.367
 ranger-0.367
Token soft
Token ID2705
Feature activation-2.38e+00

Pos logit contributions
ug+5.143
idd+4.974
ooming+4.825
ips+4.596
irled+4.572
Neg logit contributions
 keeper-2.891
 stranger-2.773
 ranger-2.691
 stall-2.465
 bench-2.448
Token.
Token ID13
Feature activation-8.56e-01

Pos logit contributions
ug+6.463
idd+6.409
ooming+6.096
irled+5.906
ips+5.829
Neg logit contributions
 stranger-2.260
 keeper-2.217
 ranger-2.005
 stall-1.990
 bench-1.910
Token They
Token ID1119
Feature activation-1.52e+00

Pos logit contributions
ug+1.602
idd+1.487
ooming+1.482
zz+1.400
med+1.397
Neg logit contributions
 stranger-1.524
 keeper-1.492
 stall-1.451
 ranger-1.403
 Heidi-1.383
Token both
Token ID1111
Feature activation-2.10e+00

Pos logit contributions
ug+3.218
idd+3.093
ooming+3.013
ips+2.892
irled+2.808
Neg logit contributions
 stranger-2.322
 keeper-2.242
 stall-2.124
 seller-2.086
 ranger-2.080
Token had
Token ID550
Feature activation-3.99e-01

Pos logit contributions
ug+4.485
idd+4.322
ooming+4.303
ips+4.148
med+3.909
Neg logit contributions
 stranger-3.195
 keeper-3.095
 ranger-2.878
 bench-2.853
 stall-2.833
Token fun
Token ID1257
Feature activation-3.88e+00

Pos logit contributions
ug+0.914
idd+0.867
ooming+0.846
zz+0.815
irled+0.814
Neg logit contributions
 stranger-0.543
 keeper-0.514
 stall-0.503
 ranger-0.466
 Heidi-0.465
Token and
Token ID290
Feature activation-2.06e+00

Pos logit contributions
idd+10.403
ug+10.388
zz+9.692
ips+9.573
ka+9.519
Neg logit contributions
 bench-3.886
 stranger-3.882
 nearby-3.749
 truth-3.725
 keeper-3.666
Token were
Token ID547
Feature activation-1.85e+00

Pos logit contributions
ug+4.578
idd+4.474
ooming+4.455
ips+4.201
irled+4.070
Neg logit contributions
 keeper-2.993
 stranger-2.986
 ranger-2.822
 bench-2.615
 stall-2.596
Token happy
Token ID3772
Feature activation-2.29e+00

Pos logit contributions
ug+4.718
idd+4.513
ooming+4.473
ged+4.325
zz+4.309
Neg logit contributions
 keeper-2.075
 stranger-2.055
 ranger-1.837
 stall-1.829
 Heidi-1.728
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
ug+6.056
idd+5.638
ooming+5.492
zz+5.468
irled+5.467
Neg logit contributions
 stranger-2.677
 keeper-2.602
 ranger-2.428
 truth-2.277
 stall-2.143
Token The
Token ID383
Feature activation-1.37e+00

Pos logit contributions
ug+0.491
idd+0.461
ooming+0.448
irled+0.429
med+0.429
Neg logit contributions
 stranger-0.431
 keeper-0.412
 stall-0.406
 Heidi-0.382
 ranger-0.381
Token help
Token ID1037
Feature activation-2.01e-01

Pos logit contributions
ug+1.291
idd+1.231
ooming+1.184
irled+1.148
med+1.144
Neg logit contributions
 stranger-0.921
 keeper-0.872
 stall-0.846
 seller-0.796
 truth-0.795
Token her
Token ID607
Feature activation-1.14e+00

Pos logit contributions
ug+0.501
idd+0.477
ooming+0.466
irled+0.452
med+0.451
Neg logit contributions
 stranger-0.233
 keeper-0.217
 stall-0.212
 Heidi-0.194
 ranger-0.192
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
ug+2.796
idd+2.718
ooming+2.538
med+2.517
ged+2.500
Neg logit contributions
 stranger-1.367
 keeper-1.302
 stall-1.234
 truth-1.176
 bench-1.158
Token They
Token ID1119
Feature activation-6.17e-01

Pos logit contributions
 stranger+0.473
 keeper+0.440
 stall+0.438
Stan+0.414
 Heidi+0.413
Neg logit contributions
ug-0.640
idd-0.602
ooming-0.580
med-0.565
irled-0.563
Token all
Token ID477
Feature activation-1.89e+00

Pos logit contributions
ug+1.341
idd+1.271
ooming+1.243
med+1.188
irled+1.188
Neg logit contributions
 stranger-0.912
 keeper-0.858
 stall-0.839
 seller-0.792
 ranger-0.784
Token kept
Token ID4030
Feature activation-3.86e+00

Pos logit contributions
ug+4.093
idd+3.981
ooming+3.910
ips+3.707
med+3.621
Neg logit contributions
 stranger-2.864
 keeper-2.670
 ranger-2.462
 stall-2.457
 bench-2.420
Token spinning
Token ID19493
Feature activation-2.22e+00

Pos logit contributions
ug+8.474
idd+7.917
ooming+7.294
akes+7.225
anged+7.100
Neg logit contributions
 stranger-6.206
 nearby-6.168
 keeper-5.990
 bench-5.878
 truth-5.728
Token,
Token ID11
Feature activation-2.49e+00

Pos logit contributions
idd+5.403
ug+5.318
ooming+4.951
anged+4.943
ips+4.824
Neg logit contributions
 stranger-2.663
 keeper-2.632
 ranger-2.362
 stall-2.314
 nearby-2.303
Token but
Token ID475
Feature activation-7.72e-02

Pos logit contributions
ug+6.681
idd+6.548
ooming+6.102
irled+6.017
anged+5.940
Neg logit contributions
 stranger-2.591
 keeper-2.568
 bench-2.421
 ranger-2.325
 stall-2.271
Token she
Token ID673
Feature activation-9.91e-01

Pos logit contributions
ug+0.159
idd+0.150
ooming+0.145
irled+0.140
med+0.140
Neg logit contributions
 stranger-0.123
 keeper-0.116
 stall-0.115
 Heidi-0.107
 seller-0.107
Token still
Token ID991
Feature activation-1.48e+00

Pos logit contributions
ug+2.243
idd+2.114
ooming+2.091
med+2.020
ips+2.018
Neg logit contributions
 stranger-1.365
 keeper-1.304
 stall-1.230
 seller-1.179
 ranger-1.178
Tokenep
Token ID538
Feature activation-9.48e-01

Pos logit contributions
ug+4.268
ooming+4.119
anged+3.934
idd+3.857
zz+3.783
Neg logit contributions
 keeper-5.139
 stranger-5.110
 stall-4.932
 nearby-4.889
 ranger-4.750
Token sounds
Token ID5238
Feature activation-1.72e+00

Pos logit contributions
ug+2.144
idd+2.068
ooming+2.001
irled+1.948
ips+1.929
Neg logit contributions
 stranger-1.265
 keeper-1.235
 stall-1.162
 ranger-1.099
 seller-1.077
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
ug+4.408
idd+4.303
ooming+4.190
anged+4.095
irled+4.091
Neg logit contributions
 stranger-1.939
 keeper-1.764
 stall-1.705
 bench-1.530
 truth-1.526
Token They
Token ID1119
Feature activation-8.16e-01

Pos logit contributions
ug+1.810
idd+1.707
ooming+1.704
irled+1.632
zz+1.631
Neg logit contributions
 stranger-1.068
 keeper-1.018
 stall-1.001
 Heidi-0.942
 ranger-0.933
Token have
Token ID423
Feature activation-1.03e+00

Pos logit contributions
ug+1.841
idd+1.746
ooming+1.706
irled+1.641
med+1.635
Neg logit contributions
 stranger-1.151
 keeper-1.084
 stall-1.061
 seller-1.002
 ranger-0.983
Token fun
Token ID1257
Feature activation-3.85e+00

Pos logit contributions
ug+2.519
idd+2.376
ooming+2.357
zz+2.258
irled+2.251
Neg logit contributions
 stranger-1.282
 keeper-1.208
 stall-1.183
 truth-1.099
 ranger-1.089
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
ug+10.610
idd+10.393
zz+10.000
anged+9.920
irled+9.916
Neg logit contributions
 stranger-3.821
 nearby-3.682
 bench-3.608
 keeper-3.548
 stall-3.531
Token but
Token ID475
Feature activation-4.19e-01

Pos logit contributions
ug+2.866
idd+2.763
ooming+2.672
irled+2.613
med+2.583
Neg logit contributions
 stranger-1.524
 keeper-1.447
 stall-1.405
 ranger-1.321
 bench-1.314
Token they
Token ID484
Feature activation+7.71e-02

Pos logit contributions
ug+0.936
idd+0.886
ooming+0.868
irled+0.833
zz+0.833
Neg logit contributions
 stranger-0.598
 keeper-0.568
 stall-0.555
 Heidi-0.517
 ranger-0.516
Token also
Token ID635
Feature activation-9.88e-01

Pos logit contributions
 stranger+0.113
 keeper+0.107
 stall+0.105
 Heidi+0.099
 ranger+0.097
Neg logit contributions
ug-0.167
idd-0.158
ooming-0.154
zz-0.149
irled-0.148
Token want
Token ID765
Feature activation+3.11e-01

Pos logit contributions
ug+2.239
idd+2.113
ooming+2.075
med+1.988
irled+1.986
Neg logit contributions
 stranger-1.393
 keeper-1.323
 stall-1.287
 ranger-1.204
 seller-1.199
Token her
Token ID607
Feature activation-1.35e+00

Pos logit contributions
ug+1.361
idd+1.292
ooming+1.266
zz+1.221
irled+1.220
Neg logit contributions
 stranger-0.736
 keeper-0.690
 stall-0.675
 Heidi-0.629
 ranger-0.620
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
ug+2.838
idd+2.676
ooming+2.563
zz+2.511
irled+2.504
Neg logit contributions
 stranger-2.150
 keeper-2.077
 stall-2.026
 ranger-1.910
 seller-1.907
Token They
Token ID1119
Feature activation-7.88e-01

Pos logit contributions
ug+0.747
idd+0.709
ooming+0.694
irled+0.671
zz+0.670
Neg logit contributions
 stranger-0.379
 keeper-0.357
 stall-0.350
 Heidi-0.322
 ranger-0.320
Token both
Token ID1111
Feature activation-1.44e+00

Pos logit contributions
ug+1.493
idd+1.402
ooming+1.377
ips+1.306
irled+1.286
Neg logit contributions
 stranger-1.388
 keeper-1.327
 stall-1.310
 seller-1.247
 ranger-1.241
Token had
Token ID550
Feature activation+6.97e-03

Pos logit contributions
ug+3.233
ooming+3.082
idd+3.075
ips+2.966
irled+2.844
Neg logit contributions
 stranger-2.022
 keeper-1.922
 stall-1.861
 ranger-1.766
 seller-1.756
Token fun
Token ID1257
Feature activation-3.84e+00

Pos logit contributions
 stranger+0.009
 keeper+0.009
 stall+0.009
 Heidi+0.008
 seller+0.008
Neg logit contributions
ug-0.016
idd-0.015
ooming-0.015
med-0.014
irled-0.014
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
ug+9.769
idd+9.557
zz+9.038
ooming+8.770
anged+8.749
Neg logit contributions
 stranger-4.502
 nearby-4.456
 bench-4.439
 stall-4.102
 keeper-4.047
Token created
Token ID2727
Feature activation-3.34e-01

Pos logit contributions
ug+2.866
idd+2.737
ooming+2.720
ips+2.574
zz+2.527
Neg logit contributions
 stranger-2.055
 keeper-2.009
 stall-1.883
 ranger-1.873
 Heidi-1.805
Token many
Token ID867
Feature activation-1.45e+00

Pos logit contributions
ug+0.866
idd+0.826
ooming+0.808
zz+0.783
irled+0.781
Neg logit contributions
 stranger-0.350
 keeper-0.325
 stall-0.317
 Heidi-0.286
 ranger-0.283
Token more
Token ID517
Feature activation-2.31e+00

Pos logit contributions
ug+3.642
idd+3.431
ooming+3.345
zz+3.343
ged+3.285
Neg logit contributions
 stranger-1.804
 keeper-1.653
 stall-1.608
 truth-1.494
 bench-1.466
Token things
Token ID1243
Feature activation-1.54e-01

Pos logit contributions
ug+5.740
idd+5.410
ooming+5.314
ged+5.259
zz+5.224
Neg logit contributions
 stranger-3.125
 keeper-2.850
 stall-2.757
 ranger-2.613
 bench-2.612
Token
Token ID198
Feature activation-5.45e-01

Pos logit contributions
ug+1.490
idd+1.421
ooming+1.383
irled+1.343
med+1.340
Neg logit contributions
 stranger-0.750
 keeper-0.702
 stall-0.691
 Heidi-0.646
 ranger-0.633
Token
Token ID198
Feature activation+4.21e-02

Pos logit contributions
ug+1.301
idd+1.243
ooming+1.214
med+1.173
irled+1.172
Neg logit contributions
 stranger-0.673
 keeper-0.636
 stall-0.620
 Heidi-0.571
 ranger-0.570
TokenThey
Token ID2990
Feature activation-1.14e+00

Pos logit contributions
 stranger+0.057
 keeper+0.054
 stall+0.053
 Heidi+0.049
 seller+0.049
Neg logit contributions
ug-0.098
idd-0.092
ooming-0.090
zz-0.087
irled-0.087
Token were
Token ID547
Feature activation-1.18e+00

Pos logit contributions
ug+2.327
idd+2.194
ooming+2.128
ips+2.058
irled+2.002
Neg logit contributions
 stranger-1.850
 keeper-1.731
 stall-1.730
 seller-1.650
 bench-1.641
Token having
Token ID1719
Feature activation-5.03e-01

Pos logit contributions
ug+2.613
idd+2.453
ooming+2.420
zz+2.308
ged+2.286
Neg logit contributions
 stranger-1.718
 keeper-1.664
 stall-1.635
 Heidi-1.523
 ranger-1.511
Token fun
Token ID1257
Feature activation-3.84e+00

Pos logit contributions
ug+1.134
idd+1.075
ooming+1.051
zz+1.011
irled+1.007
Neg logit contributions
 stranger-0.706
 keeper-0.663
 stall-0.653
 Heidi-0.606
 truth-0.604
Token with
Token ID351
Feature activation-1.07e+00

Pos logit contributions
ug+10.814
idd+10.167
anged+9.857
zz+9.708
ooming+9.526
Neg logit contributions
 bench-3.616
 nearby-3.546
 stranger-3.511
 stall-3.342
 keeper-3.200
Token their
Token ID511
Feature activation-2.16e+00

Pos logit contributions
ug+2.491
idd+2.384
ooming+2.342
zz+2.230
irled+2.227
Neg logit contributions
 stranger-1.423
 keeper-1.319
 stall-1.318
 Heidi-1.218
 bench-1.211
Token paper
Token ID3348
Feature activation-1.02e+00

Pos logit contributions
ug+5.080
idd+4.716
med+4.565
ged+4.502
ooming+4.497
Neg logit contributions
 stranger-3.180
 stall-2.990
 keeper-2.953
 bench-2.849
 seller-2.720
Token toys
Token ID14958
Feature activation-3.66e-01

Pos logit contributions
ug+2.144
idd+2.031
ooming+1.957
irled+1.901
ged+1.894
Neg logit contributions
 stranger-1.572
 keeper-1.528
 stall-1.491
 seller-1.404
 ranger-1.394
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
ug+0.823
idd+0.780
ooming+0.759
irled+0.735
med+0.731
Neg logit contributions
 stranger-0.511
 keeper-0.484
 stall-0.476
 ranger-0.440
 seller-0.438
TokenThe
Token ID464
Feature activation-7.39e-01

Pos logit contributions
 stranger+0.048
 keeper+0.045
 stall+0.044
 Heidi+0.040
 seller+0.040
Neg logit contributions
ug-0.099
idd-0.094
ooming-0.092
med-0.089
irled-0.089
Token cats
Token ID11875
Feature activation-8.70e-01

Pos logit contributions
ug+1.629
idd+1.533
ooming+1.499
zz+1.455
irled+1.451
Neg logit contributions
 stranger-1.109
 keeper-1.049
 stall-1.030
 ranger-0.961
 seller-0.960
Token had
Token ID550
Feature activation-3.06e-01

Pos logit contributions
ug+1.788
idd+1.700
ooming+1.653
ips+1.600
irled+1.590
Neg logit contributions
 stranger-1.378
 keeper-1.302
 stall-1.279
 seller-1.215
 ranger-1.192
Token lots
Token ID6041
Feature activation-2.41e+00

Pos logit contributions
ug+0.747
idd+0.711
ooming+0.695
irled+0.672
zz+0.671
Neg logit contributions
 stranger-0.368
 keeper-0.344
 stall-0.337
 Heidi-0.308
 ranger-0.307
Token of
Token ID286
Feature activation-2.72e+00

Pos logit contributions
idd+5.089
ug+5.088
zz+4.785
irled+4.767
ooming+4.738
Neg logit contributions
 stranger-3.982
 keeper-3.618
 stall-3.501
 ranger-3.405
 truth-3.362
Token fun
Token ID1257
Feature activation-3.80e+00

Pos logit contributions
ug+7.028
idd+6.895
ooming+6.737
zz+6.577
med+6.441
Neg logit contributions
 stranger-3.355
 truth-3.110
 stall-2.953
 keeper-2.796
 bench-2.745
Token playing
Token ID2712
Feature activation-1.06e+00

Pos logit contributions
idd+9.599
ug+9.322
zz+9.089
anged+8.803
ips+8.765
Neg logit contributions
 stranger-4.832
 bench-4.674
 stall-4.671
 nearby-4.317
 keeper-4.281
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
ug+2.188
idd+2.098
ooming+2.003
zz+1.969
anged+1.944
Neg logit contributions
 stranger-1.669
 keeper-1.592
 stall-1.584
 ranger-1.471
 bench-1.461
Token but
Token ID475
Feature activation-3.20e-01

Pos logit contributions
ug+3.224
idd+3.064
ooming+2.972
irled+2.891
ips+2.888
Neg logit contributions
 stranger-2.063
 keeper-1.991
 stall-1.921
 ranger-1.831
 bench-1.802
Token after
Token ID706
Feature activation-1.63e+00

Pos logit contributions
ug+0.667
idd+0.630
ooming+0.614
irled+0.589
zz+0.589
Neg logit contributions
 stranger-0.502
 keeper-0.478
 stall-0.470
 Heidi-0.440
 ranger-0.439
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
ug+4.391
idd+4.249
ooming+4.137
zz+4.034
irled+4.018
Neg logit contributions
 stranger-1.597
 keeper-1.456
 stall-1.435
 bench-1.318
 ranger-1.295
Token string
Token ID4731
Feature activation+3.85e-01

Pos logit contributions
ug+3.103
idd+2.952
ooming+2.903
zz+2.794
med+2.785
Neg logit contributions
 stranger-1.097
 stall-0.979
 keeper-0.956
 bench-0.881
 truth-0.860
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
 stranger+0.538
 keeper+0.507
 stall+0.497
 Heidi+0.476
Stan+0.472
Neg logit contributions
ug-0.866
idd-0.807
ooming-0.791
med-0.768
ips-0.764
Token She
Token ID1375
Feature activation-1.74e+00

Pos logit contributions
 stranger+0.453
 keeper+0.428
 stall+0.410
 ranger+0.383
 seller+0.379
Neg logit contributions
ug-0.856
idd-0.813
ooming-0.796
ips-0.783
zz-0.776
Token would
Token ID561
Feature activation-1.18e+00

Pos logit contributions
ug+4.772
idd+4.624
ooming+4.617
ips+4.488
med+4.393
Neg logit contributions
 stranger-1.511
 stall-1.377
 keeper-1.354
 bench-1.270
 seller-1.264
Token spend
Token ID4341
Feature activation-2.04e+00

Pos logit contributions
ug+2.650
idd+2.575
ooming+2.468
ips+2.393
med+2.389
Neg logit contributions
 stranger-1.677
 stall-1.585
 keeper-1.556
 seller-1.483
 bench-1.482
Token hours
Token ID2250
Feature activation-3.80e+00

Pos logit contributions
ug+5.083
idd+4.966
ooming+4.723
anged+4.533
zz+4.459
Neg logit contributions
 stranger-2.625
 stall-2.413
 bench-2.347
 Heidi-2.346
 keeper-2.301
Token on
Token ID319
Feature activation-1.04e+00

Pos logit contributions
idd+8.049
ug+8.014
anged+7.626
irled+7.562
zz+7.355
Neg logit contributions
 stall-6.091
 stranger-5.845
 nearby-5.840
 bench-5.687
 seller-5.378
Token end
Token ID886
Feature activation-1.60e+00

Pos logit contributions
ug+2.715
idd+2.596
ooming+2.495
zz+2.434
irled+2.426
Neg logit contributions
 stranger-1.144
 stall-1.066
 keeper-1.035
 Heidi-0.982
 bench-0.968
Token weaving
Token ID44889
Feature activation-1.03e+00

Pos logit contributions
ug+3.719
idd+3.604
ooming+3.494
anged+3.384
ged+3.365
Neg logit contributions
 stranger-2.146
 stall-2.045
 keeper-2.023
 bench-1.905
 Heidi-1.865
Token patterns
Token ID7572
Feature activation-9.96e-01

Pos logit contributions
ug+2.450
idd+2.382
ooming+2.303
irled+2.218
zz+2.190
Neg logit contributions
 stranger-1.331
 keeper-1.248
 stall-1.239
 Heidi-1.138
 ranger-1.114
Token and
Token ID290
Feature activation-2.41e+00

Pos logit contributions
ug+2.054
idd+1.982
ooming+1.883
irled+1.829
anged+1.800
Neg logit contributions
 stranger-1.571
 keeper-1.505
 stall-1.479
 seller-1.387
 ranger-1.374
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
ug+4.328
idd+4.311
ooming+4.182
irled+4.150
zz+4.027
Neg logit contributions
 stranger-1.968
 keeper-1.935
 stall-1.838
 seller-1.689
 ranger-1.689
Token He
Token ID679
Feature activation-1.98e+00

Pos logit contributions
ug+0.053
idd+0.050
ooming+0.049
med+0.048
irled+0.048
Neg logit contributions
 stranger-0.023
 keeper-0.021
 stall-0.021
 Heidi-0.019
Stan-0.019
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
ug+4.676
ooming+4.576
ips+4.463
idd+4.458
ged+4.242
Neg logit contributions
 stranger-2.482
 keeper-2.466
 stall-2.261
 ranger-2.235
 seller-2.230
Token very
Token ID845
Feature activation-1.43e+00

Pos logit contributions
ug+4.918
ooming+4.717
idd+4.704
zz+4.589
irled+4.537
Neg logit contributions
 stranger-1.874
 keeper-1.767
 stall-1.677
 seller-1.556
 ranger-1.524
Token good
Token ID922
Feature activation-2.17e+00

Pos logit contributions
ug+3.555
idd+3.411
ooming+3.389
med+3.316
ips+3.278
Neg logit contributions
 stranger-1.706
 keeper-1.597
 stall-1.530
 seller-1.431
 ranger-1.417
Token at
Token ID379
Feature activation-3.79e+00

Pos logit contributions
ug+4.720
idd+4.603
irled+4.531
med+4.425
ooming+4.398
Neg logit contributions
 stranger-3.298
 keeper-2.986
 stall-2.899
 seller-2.781
 ranger-2.774
Token fixing
Token ID18682
Feature activation-1.63e+00

Pos logit contributions
ug+8.621
idd+8.425
med+8.023
ooming+8.013
irled+7.852
Neg logit contributions
 truth-6.263
 keeper-5.921
 stranger-5.844
 market-5.587
 nearby-5.531
Token them
Token ID606
Feature activation-7.22e-01

Pos logit contributions
ug+3.650
idd+3.573
ooming+3.431
zz+3.373
irled+3.352
Neg logit contributions
 stranger-2.241
 keeper-2.154
 stall-2.131
 truth-1.990
 ranger-1.905
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
ug+1.713
idd+1.654
ooming+1.589
irled+1.558
zz+1.551
Neg logit contributions
 stranger-0.903
 keeper-0.864
 stall-0.841
 seller-0.771
 ranger-0.765
Token One
Token ID1881
Feature activation-1.02e+00

Pos logit contributions
ug+1.014
idd+0.965
ooming+0.948
irled+0.917
zz+0.916
Neg logit contributions
 stranger-0.481
 keeper-0.451
 stall-0.443
 Heidi-0.404
 ranger-0.402
Token day
Token ID1110
Feature activation-1.08e+00

Pos logit contributions
ug+1.905
idd+1.821
ooming+1.772
ips+1.720
irled+1.706
Neg logit contributions
 stranger-1.840
 keeper-1.742
 stall-1.711
 ranger-1.616
 seller-1.600
TokenJohn
Token ID7554
Feature activation-1.04e+00

Pos logit contributions
 stranger+2.471
 stall+2.392
 keeper+2.343
 camel+2.284
 ranger+2.225
Neg logit contributions
ug-2.901
idd-2.784
ooming-2.665
med-2.642
irled-2.605
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
ug+2.334
idd+2.211
ooming+2.152
ips+2.098
ged+2.036
Neg logit contributions
 stranger-1.459
 keeper-1.421
 stall-1.350
 seller-1.284
 ranger-1.274
Token Jane
Token ID12091
Feature activation-1.06e+00

Pos logit contributions
 stranger+0.015
 keeper+0.014
 stall+0.014
Stan+0.013
 Heidi+0.013
Neg logit contributions
ug-0.024
idd-0.022
ooming-0.022
zz-0.021
med-0.021
Token spent
Token ID3377
Feature activation-1.73e+00

Pos logit contributions
ug+2.125
idd+2.022
ooming+1.951
ips+1.890
irled+1.836
Neg logit contributions
 stranger-1.695
 keeper-1.637
 stall-1.592
 seller-1.528
 ranger-1.485
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
ug+4.434
idd+4.297
ooming+4.129
zz+4.013
anged+3.987
Neg logit contributions
 stranger-1.967
 stall-1.872
 keeper-1.770
 bench-1.715
 Heidi-1.693
Token day
Token ID1110
Feature activation-3.79e+00

Pos logit contributions
ug+4.972
idd+4.786
anged+4.616
ooming+4.497
ips+4.448
Neg logit contributions
 stranger-2.628
 keeper-2.371
 stall-2.316
 Heidi-2.238
 bench-2.222
Token practicing
Token ID18207
Feature activation-2.45e+00

Pos logit contributions
ug+7.211
zz+6.866
idd+6.794
anged+6.431
irled+6.271
Neg logit contributions
 stranger-7.150
 stall-6.724
 nearby-6.523
 bench-6.421
 keeper-6.323
Token tennis
Token ID20790
Feature activation-1.34e+00

Pos logit contributions
idd+5.780
ug+5.739
zz+5.301
ooming+5.213
anged+5.138
Neg logit contributions
 stranger-3.415
 stall-3.228
 keeper-3.182
 bench-3.035
 truth-2.951
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ug+2.884
idd+2.807
ooming+2.654
zz+2.592
irled+2.592
Neg logit contributions
 stranger-2.045
 keeper-1.971
 stall-1.934
 ranger-1.810
 seller-1.787
Token When
Token ID1649
Feature activation-6.22e-01

Pos logit contributions
ug+2.969
idd+2.784
ooming+2.781
zz+2.650
irled+2.639
Neg logit contributions
 stranger-1.734
 keeper-1.687
 stall-1.667
 Heidi-1.603
 Stanley-1.545
Token they
Token ID484
Feature activation-9.38e-01

Pos logit contributions
ug+1.322
idd+1.245
ooming+1.211
irled+1.161
zz+1.160
Neg logit contributions
 stranger-0.971
 keeper-0.918
 stall-0.900
 Heidi-0.855
 ranger-0.837
Token trumpet
Token ID40334
Feature activation-7.95e-01

Pos logit contributions
ug+3.354
idd+3.039
anged+2.979
ooming+2.868
ips+2.806
Neg logit contributions
 stranger-3.847
 keeper-3.502
 stall-3.476
 truth-3.382
 bench-3.355
Token and
Token ID290
Feature activation-2.33e+00

Pos logit contributions
ug+1.675
idd+1.603
ooming+1.536
irled+1.500
zz+1.479
Neg logit contributions
 stranger-1.218
 keeper-1.174
 stall-1.147
 seller-1.076
 ranger-1.072
Token had
Token ID550
Feature activation+1.27e-01

Pos logit contributions
ug+4.668
idd+4.267
ooming+4.139
ips+3.938
irled+3.883
Neg logit contributions
 stranger-4.203
 keeper-4.194
 ranger-3.985
 stall-3.889
 seller-3.875
Token lots
Token ID6041
Feature activation-2.07e+00

Pos logit contributions
 stranger+0.168
 keeper+0.157
 stall+0.155
 Heidi+0.142
 seller+0.142
Neg logit contributions
ug-0.294
idd-0.280
ooming-0.272
med-0.265
irled-0.265
Token of
Token ID286
Feature activation-2.36e+00

Pos logit contributions
ug+4.347
idd+4.191
zz+4.028
ooming+3.976
irled+3.956
Neg logit contributions
 stranger-3.493
 keeper-3.176
 stall-3.039
 ranger-3.031
 truth-2.993
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+6.525
idd+6.183
ooming+6.013
zz+5.877
anged+5.870
Neg logit contributions
 stranger-2.646
 truth-2.468
 stall-2.202
 keeper-2.139
 bench-2.113
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
ug+11.770
idd+11.429
zz+11.297
anged+10.739
ooming+10.618
Neg logit contributions
 stranger-2.791
 bench-2.271
 stall-2.225
 truth-2.130
 keeper-2.109
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
ug+3.060
idd+2.923
ooming+2.855
zz+2.736
irled+2.709
Neg logit contributions
 stranger-1.677
 keeper-1.616
 stall-1.582
 Heidi-1.519
 ranger-1.504
Token
Token ID198
Feature activation-7.53e-01

Pos logit contributions
ug+2.677
idd+2.588
ooming+2.522
irled+2.420
zz+2.419
Neg logit contributions
 stranger-1.212
 keeper-1.131
 stall-1.114
 Heidi-1.030
Stan-1.026
TokenAfter
Token ID3260
Feature activation-2.17e+00

Pos logit contributions
ug+1.691
idd+1.617
ooming+1.564
ips+1.504
zz+1.503
Neg logit contributions
 stranger-1.062
 keeper-0.997
 stall-0.978
Stan-0.918
 Heidi-0.916
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
ug+5.812
idd+5.472
ooming+5.282
zz+5.272
irled+5.207
Neg logit contributions
 stranger-2.392
 keeper-2.086
 stall-2.082
 ranger-2.016
 bench-1.973
Token together
Token ID1978
Feature activation-9.55e-01

Pos logit contributions
ug+3.661
idd+3.604
ooming+3.323
irled+3.289
zz+3.260
Neg logit contributions
 stranger-2.620
 keeper-2.541
 stall-2.402
 ranger-2.316
 bench-2.304
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
ug+2.259
idd+2.194
ooming+2.094
irled+2.033
zz+2.031
Neg logit contributions
 stranger-1.238
 keeper-1.177
 stall-1.160
 bench-1.055
 seller-1.052
Token They
Token ID1119
Feature activation-7.51e-01

Pos logit contributions
ug+3.414
idd+3.212
ooming+3.176
zz+3.049
med+2.985
Neg logit contributions
 stranger-1.731
 keeper-1.686
 stall-1.624
 Heidi-1.568
 ranger-1.559
Token both
Token ID1111
Feature activation-1.35e+00

Pos logit contributions
ug+1.515
idd+1.434
ooming+1.398
ips+1.336
irled+1.320
Neg logit contributions
 stranger-1.227
 keeper-1.169
 stall-1.141
 seller-1.085
 ranger-1.084
Token had
Token ID550
Feature activation-4.38e-01

Pos logit contributions
ug+2.925
idd+2.781
ooming+2.768
ips+2.661
irled+2.556
Neg logit contributions
 stranger-2.012
 keeper-1.920
 stall-1.827
 ranger-1.769
 seller-1.753
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+1.049
idd+0.996
ooming+0.975
zz+0.940
irled+0.937
Neg logit contributions
 stranger-0.554
 keeper-0.521
 stall-0.510
 Heidi-0.470
 ranger-0.468
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
ug+9.429
idd+9.307
zz+8.801
ips+8.701
anged+8.553
Neg logit contributions
 stranger-4.879
 bench-4.722
 nearby-4.488
 stall-4.440
 keeper-4.349
Token Lily
Token ID20037
Feature activation-1.12e+00

Pos logit contributions
ug+3.767
idd+3.644
ooming+3.624
ips+3.431
zz+3.368
Neg logit contributions
 stranger-2.307
 keeper-2.230
 ranger-2.067
 stall-2.007
 Heidi-1.971
Token realized
Token ID6939
Feature activation+7.92e-01

Pos logit contributions
ug+2.732
idd+2.597
ooming+2.514
ips+2.446
zz+2.418
Neg logit contributions
 stranger-1.342
 keeper-1.302
 stall-1.221
 ranger-1.159
 seller-1.142
Token that
Token ID326
Feature activation-9.58e-01

Pos logit contributions
 stranger+1.011
 stall+0.998
 keeper+0.974
Stan+0.902
 seller+0.898
Neg logit contributions
ug-1.834
idd-1.723
ooming-1.688
med-1.657
irled-1.641
Token it
Token ID340
Feature activation-8.27e-01

Pos logit contributions
ug+2.074
idd+1.965
ooming+1.910
zz+1.841
irled+1.832
Neg logit contributions
 stranger-1.478
 keeper-1.383
 stall-1.327
 truth-1.287
 Heidi-1.287
Token shiny
Token ID22441
Feature activation-2.06e+00

Pos logit contributions
ug+3.826
idd+3.643
ooming+3.613
ips+3.378
zz+3.331
Neg logit contributions
 stranger-2.771
 keeper-2.732
 stall-2.584
 truth-2.527
 seller-2.511
Token pieces
Token ID5207
Feature activation-1.02e+00

Pos logit contributions
ug+4.666
idd+4.483
ooming+4.289
irled+4.139
zz+4.113
Neg logit contributions
 stranger-3.023
 keeper-2.862
 stall-2.842
 truth-2.712
 seller-2.709
Token.
Token ID13
Feature activation+6.98e-02

Pos logit contributions
ug+2.505
idd+2.400
ooming+2.348
irled+2.266
zz+2.252
Neg logit contributions
 stranger-1.221
 keeper-1.150
 stall-1.111
 seller-1.034
 truth-1.022
Token She
Token ID1375
Feature activation-2.39e+00

Pos logit contributions
 stranger+0.094
 keeper+0.089
 stall+0.087
 ranger+0.080
 seller+0.080
Neg logit contributions
ug-0.160
idd-0.152
ooming-0.148
zz-0.143
irled-0.143
Token worked
Token ID3111
Feature activation-1.99e+00

Pos logit contributions
ug+5.452
ooming+5.282
idd+5.146
ips+5.088
ipping+4.742
Neg logit contributions
 stranger-3.214
 keeper-3.186
 seller-3.182
 stall-3.055
 truth-2.860
Token and
Token ID290
Feature activation-3.78e+00

Pos logit contributions
ug+4.827
idd+4.769
ooming+4.500
anged+4.317
zz+4.302
Neg logit contributions
 stranger-2.481
 keeper-2.412
 stall-2.321
 truth-2.175
 ranger-2.112
Token worked
Token ID3111
Feature activation-2.03e+00

Pos logit contributions
ug+8.452
idd+8.091
ooming+8.067
ips+7.414
ang+7.065
Neg logit contributions
 keeper-5.451
 nearby-5.370
 seller-5.323
 stranger-5.198
 stall-5.167
Token,
Token ID11
Feature activation-2.76e+00

Pos logit contributions
ug+4.895
idd+4.885
ooming+4.567
anged+4.442
zz+4.420
Neg logit contributions
 stranger-2.514
 keeper-2.475
 stall-2.373
 truth-2.248
 ranger-2.160
Token putting
Token ID5137
Feature activation-1.86e+00

Pos logit contributions
ug+6.538
idd+6.451
ooming+5.972
irled+5.968
ips+5.781
Neg logit contributions
 stranger-3.676
 keeper-3.670
 stall-3.657
 seller-3.427
 nearby-3.328
Token each
Token ID1123
Feature activation-2.43e+00

Pos logit contributions
ug+4.520
idd+4.342
ooming+4.246
anged+4.027
irled+3.993
Neg logit contributions
 stranger-2.352
 stall-2.269
 keeper-2.239
 truth-2.038
 seller-2.027
Token piece
Token ID3704
Feature activation-1.27e+00

Pos logit contributions
ug+5.942
idd+5.776
ooming+5.669
irled+5.436
ips+5.320
Neg logit contributions
 stranger-3.071
 stall-3.017
 keeper-2.818
 bench-2.708
 seller-2.691
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
ug+1.453
idd+1.380
ooming+1.332
irled+1.297
zz+1.289
Neg logit contributions
 stranger-0.954
 keeper-0.912
 stall-0.884
 ranger-0.833
 seller-0.820
Token They
Token ID1119
Feature activation-1.31e+00

Pos logit contributions
 stranger+0.034
 keeper+0.032
 stall+0.031
 seller+0.029
 Heidi+0.029
Neg logit contributions
ug-0.055
idd-0.052
ooming-0.051
irled-0.049
ips-0.049
Token had
Token ID550
Feature activation-4.24e-01

Pos logit contributions
ug+2.695
idd+2.574
ooming+2.528
ips+2.393
irled+2.339
Neg logit contributions
 stranger-2.093
 keeper-1.981
 stall-1.922
 seller-1.856
 bench-1.837
Token lots
Token ID6041
Feature activation-2.15e+00

Pos logit contributions
ug+1.030
idd+0.979
ooming+0.958
zz+0.922
irled+0.922
Neg logit contributions
 stranger-0.519
 keeper-0.486
 stall-0.476
 Heidi-0.438
 ranger-0.435
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
ug+4.450
idd+4.401
ooming+4.097
zz+4.042
ged+4.029
Neg logit contributions
 stranger-3.634
 keeper-3.286
 stall-3.154
 ranger-3.125
 bench-3.090
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+6.145
idd+5.853
ooming+5.694
ged+5.590
anged+5.572
Neg logit contributions
 stranger-2.476
 truth-2.266
 stall-2.098
 keeper-2.085
 bench-2.070
Token that
Token ID326
Feature activation-2.00e+00

Pos logit contributions
ug+10.022
idd+9.742
anged+9.193
zz+9.080
med+8.845
Neg logit contributions
 stranger-4.249
 bench-4.182
 stall-3.763
 keeper-3.692
 nearby-3.619
Token day
Token ID1110
Feature activation-2.06e+00

Pos logit contributions
ug+4.299
idd+4.166
ooming+3.945
ips+3.834
med+3.822
Neg logit contributions
 stranger-3.257
 keeper-3.125
 bench-2.962
 stall-2.937
 ranger-2.855
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
ug+5.222
idd+5.052
ooming+4.845
irled+4.825
zz+4.773
Neg logit contributions
 stranger-2.291
 keeper-2.132
 stall-2.096
 bench-1.948
 ranger-1.892
Token Lily
Token ID20037
Feature activation-9.63e-01

Pos logit contributions
ug+3.588
idd+3.414
ooming+3.389
ips+3.200
zz+3.153
Neg logit contributions
 stranger-2.382
 keeper-2.283
 ranger-2.113
 stall-2.111
 Heidi-2.077
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
ug+2.267
idd+2.149
ooming+2.086
ips+2.016
med+1.999
Neg logit contributions
 stranger-1.240
 keeper-1.183
 stall-1.137
 ranger-1.058
 seller-1.057
Tokenmy
Token ID1820
Feature activation-7.26e-01

Pos logit contributions
ug+1.114
idd+1.052
ooming+1.013
irled+0.988
med+0.978
Neg logit contributions
 stranger-0.920
 keeper-0.882
 stall-0.861
 ranger-0.814
 seller-0.812
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ug+1.783
idd+1.692
ooming+1.644
irled+1.612
ips+1.612
Neg logit contributions
 stranger-0.874
 keeper-0.831
 stall-0.809
 ranger-0.743
 seller-0.738
Token They
Token ID1119
Feature activation-2.83e-01

Pos logit contributions
ug+2.364
idd+2.208
ooming+2.198
irled+2.129
zz+2.129
Neg logit contributions
 stranger-1.321
 keeper-1.274
 stall-1.251
 Heidi-1.184
 ranger-1.175
Token were
Token ID547
Feature activation-1.76e+00

Pos logit contributions
ug+0.510
idd+0.477
ooming+0.462
irled+0.440
med+0.439
Neg logit contributions
 stranger-0.522
 keeper-0.499
 stall-0.493
 Heidi-0.466
 seller-0.465
Token having
Token ID1719
Feature activation-9.70e-01

Pos logit contributions
ug+3.214
ooming+3.043
idd+2.983
ged+2.893
anged+2.864
Neg logit contributions
 stranger-3.149
 keeper-3.060
 stall-3.035
 Heidi-2.950
 Stanley-2.940
Token fun
Token ID1257
Feature activation-3.76e+00

Pos logit contributions
ug+2.427
idd+2.310
ooming+2.278
zz+2.203
ged+2.183
Neg logit contributions
 stranger-1.120
 keeper-1.032
 stall-1.011
 Heidi-0.946
 truth-0.944
Token catching
Token ID16508
Feature activation-6.41e-01

Pos logit contributions
ug+10.067
idd+9.532
med+9.180
anged+9.141
ooming+8.927
Neg logit contributions
 stranger-3.918
 nearby-3.741
 bench-3.638
 stall-3.416
 truth-3.412
Token butterflies
Token ID44571
Feature activation-5.67e-01

Pos logit contributions
ug+1.342
idd+1.266
ooming+1.230
irled+1.183
ips+1.181
Neg logit contributions
 stranger-1.005
 keeper-0.949
 stall-0.934
 ranger-0.877
 Heidi-0.873
Token when
Token ID618
Feature activation-1.82e+00

Pos logit contributions
ug+1.133
idd+1.072
ooming+1.034
irled+1.002
zz+0.991
Neg logit contributions
 stranger-0.928
 keeper-0.887
 stall-0.873
 ranger-0.821
 seller-0.817
Token Lily
Token ID20037
Feature activation-7.35e-01

Pos logit contributions
ug+4.199
idd+3.929
ooming+3.844
zz+3.790
irled+3.712
Neg logit contributions
 stranger-2.670
 keeper-2.462
 Heidi-2.348
 stall-2.338
 ranger-2.292
Token noticed
Token ID6810
Feature activation-8.02e-02

Pos logit contributions
ug+1.662
idd+1.562
ooming+1.519
ips+1.471
irled+1.467
Neg logit contributions
 stranger-1.015
 keeper-0.970
 stall-0.943
 ranger-0.883
 seller-0.875
Token
Token ID198
Feature activation-5.73e-01

Pos logit contributions
ug+3.002
idd+2.850
ooming+2.822
zz+2.742
irled+2.739
Neg logit contributions
 stranger-1.303
 keeper-1.239
 stall-1.183
 ranger-1.130
 truth-1.073
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
ug+1.377
idd+1.317
ooming+1.288
zz+1.243
irled+1.241
Neg logit contributions
 stranger-0.703
 keeper-0.664
 stall-0.645
 ranger-0.596
 Heidi-0.590
TokenThey
Token ID2990
Feature activation-1.40e+00

Pos logit contributions
ug+0.697
idd+0.662
ooming+0.645
irled+0.623
zz+0.623
Neg logit contributions
 stranger-0.400
 keeper-0.377
 stall-0.369
 ranger-0.340
 Heidi-0.340
Token were
Token ID547
Feature activation-1.59e+00

Pos logit contributions
ug+2.925
idd+2.738
ooming+2.675
ips+2.578
irled+2.503
Neg logit contributions
 stranger-2.215
 keeper-2.104
 stall-2.087
 seller-2.023
 bench-1.975
Token having
Token ID1719
Feature activation-8.35e-01

Pos logit contributions
ug+3.899
idd+3.661
ooming+3.628
zz+3.526
ged+3.449
Neg logit contributions
 stranger-1.952
 keeper-1.919
 stall-1.866
 ranger-1.689
 Heidi-1.681
Token fun
Token ID1257
Feature activation-3.76e+00

Pos logit contributions
ug+1.972
idd+1.874
ooming+1.832
zz+1.781
ged+1.755
Neg logit contributions
 stranger-1.091
 keeper-1.020
 stall-1.007
 truth-0.944
 Heidi-0.925
Token.
Token ID13
Feature activation-7.06e-01

Pos logit contributions
ug+9.840
idd+9.517
zz+9.161
anged+8.697
ips+8.640
Neg logit contributions
 bench-4.383
 stranger-4.372
 stall-4.267
 nearby-4.153
 keeper-4.080
Token But
Token ID887
Feature activation-9.50e-01

Pos logit contributions
ug+1.580
idd+1.507
ooming+1.474
zz+1.412
irled+1.407
Neg logit contributions
 stranger-1.011
 keeper-0.966
 stall-0.952
 ranger-0.894
 Heidi-0.882
Token then
Token ID788
Feature activation+1.70e-01

Pos logit contributions
ug+1.989
idd+1.883
ooming+1.835
zz+1.762
ips+1.754
Neg logit contributions
 stranger-1.497
 keeper-1.437
 stall-1.396
 Heidi-1.333
 ranger-1.332
Token they
Token ID484
Feature activation-1.93e-01

Pos logit contributions
 stranger+0.285
 keeper+0.273
 stall+0.271
 Heidi+0.252
 ranger+0.252
Neg logit contributions
ug-0.332
idd-0.310
ooming-0.303
irled-0.290
med-0.290
Token saw
Token ID2497
Feature activation+4.24e-01

Pos logit contributions
ug+0.448
idd+0.425
ooming+0.414
irled+0.400
med+0.400
Neg logit contributions
 stranger-0.256
 keeper-0.241
 stall-0.237
 Heidi-0.218
 ranger-0.217
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
 stranger+0.879
 keeper+0.822
 stall+0.785
 seller+0.744
 Heidi+0.744
Neg logit contributions
ug-1.529
idd-1.448
ooming-1.398
med-1.385
zz-1.362
Token blanket
Token ID18447
Feature activation+8.06e-01

Pos logit contributions
 stranger+2.730
 Heidi+2.709
 keeper+2.683
Stan+2.595
 truth+2.561
Neg logit contributions
ug-4.669
idd-4.371
ooming-4.363
zz-4.160
ged-4.098
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
 stranger+1.335
 keeper+1.251
 stall+1.222
 Heidi+1.205
Stan+1.194
Neg logit contributions
ug-1.598
idd-1.471
ooming-1.452
med-1.390
ips-1.384
Token They
Token ID1119
Feature activation+1.09e+00

Pos logit contributions
 stranger+0.444
 keeper+0.417
 stall+0.409
 Heidi+0.389
Stan+0.388
Neg logit contributions
ug-0.574
idd-0.540
ooming-0.520
ips-0.505
irled-0.505
Token told
Token ID1297
Feature activation+2.75e-01

Pos logit contributions
 stranger+1.855
 keeper+1.773
 stall+1.713
 Heidi+1.682
 truth+1.648
Neg logit contributions
ug-2.067
idd-1.998
ooming-1.887
ged-1.870
med-1.843
Token them
Token ID606
Feature activation+2.76e+00

Pos logit contributions
 stranger+0.493
 keeper+0.471
 stall+0.470
Stan+0.442
 Heidi+0.441
Neg logit contributions
ug-0.514
idd-0.472
ooming-0.458
med-0.442
irled-0.442
Token their
Token ID511
Feature activation+1.27e+00

Pos logit contributions
 stranger+4.923
 stall+4.920
Stan+4.803
 keeper+4.793
 butcher+4.567
Neg logit contributions
ug-5.140
apy-4.472
idd-4.456
med-4.437
ooming-4.388
Token plan
Token ID1410
Feature activation+1.91e+00

Pos logit contributions
 stranger+1.592
 Heidi+1.505
 keeper+1.494
 stall+1.479
Stan+1.428
Neg logit contributions
ug-2.927
idd-2.792
ooming-2.714
zz-2.660
irled-2.649
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
 stranger+3.132
 stall+2.947
Stan+2.895
 camel+2.880
 Stanley+2.862
Neg logit contributions
ug-3.875
idd-3.515
ooming-3.452
med-3.447
ged-3.437
Token Mom
Token ID11254
Feature activation+3.42e-01

Pos logit contributions
ug+0.641
idd+0.607
ooming+0.592
irled+0.570
zz+0.570
Neg logit contributions
 stranger-0.419
 keeper-0.397
 stall-0.392
 Heidi-0.363
 ranger-0.362
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 stranger+0.561
 keeper+0.529
 stall+0.527
 Heidi+0.501
Stan+0.493
Neg logit contributions
ug-0.677
idd-0.641
ooming-0.619
med-0.602
zz-0.602
Token re
Token ID302
Feature activation-5.78e-01

Pos logit contributions
ug+0.500
idd+0.472
ooming+0.459
irled+0.442
med+0.442
Neg logit contributions
 stranger-0.352
 keeper-0.333
 stall-0.328
 Heidi-0.306
 ranger-0.304
Tokenind
Token ID521
Feature activation-4.28e-01

Pos logit contributions
ug+1.141
idd+1.069
ooming+1.044
med+1.011
irled+1.010
Neg logit contributions
 stranger-0.964
 keeper-0.919
 stall-0.909
 ranger-0.845
 seller-0.842
Tokeneer
Token ID28153
Feature activation-3.73e-01

Pos logit contributions
ug+0.895
idd+0.845
ooming+0.826
irled+0.796
ips+0.790
Neg logit contributions
 stranger-0.670
 keeper-0.631
 stall-0.624
 seller-0.580
 Heidi-0.580
Token!
Token ID0
Feature activation+1.27e+00

Pos logit contributions
ug+0.797
idd+0.751
ooming+0.732
irled+0.709
ips+0.706
Neg logit contributions
 stranger-0.560
 keeper-0.530
 stall-0.523
 ranger-0.485
 Heidi-0.484
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
 stranger+1.969
 keeper+1.909
 stall+1.792
 seller+1.706
 ranger+1.705
Neg logit contributions
ug-2.664
idd-2.513
ooming-2.410
irled-2.367
ged-2.363
Token
Token ID198
Feature activation+2.13e+00

Pos logit contributions
 stranger+0.596
 keeper+0.567
 stall+0.553
 seller+0.520
 ranger+0.515
Neg logit contributions
ug-0.919
idd-0.861
ooming-0.836
irled-0.812
zz-0.809
TokenP
Token ID47
Feature activation+1.03e+00

Pos logit contributions
 keeper+3.832
 stranger+3.740
 stall+3.537
 Heidi+3.481
 ranger+3.441
Neg logit contributions
ug-4.119
idd-3.769
ooming-3.671
ged-3.654
irled-3.619
Tokenete
Token ID14471
Feature activation+5.09e-01

Pos logit contributions
 stranger+1.305
 keeper+1.220
 stall+1.157
 Heidi+1.109
 ranger+1.101
Neg logit contributions
ug-2.583
idd-2.407
ooming-2.387
irled-2.330
ips-2.302
Token was
Token ID373
Feature activation+1.53e-01

Pos logit contributions
 stranger+0.786
 stall+0.735
 keeper+0.733
 Heidi+0.697
 butcher+0.682
Neg logit contributions
ug-1.065
idd-1.005
ooming-0.970
irled-0.954
zz-0.951
Token very
Token ID845
Feature activation+3.04e-01

Pos logit contributions
 stranger+0.219
 keeper+0.206
 stall+0.204
 Heidi+0.189
 seller+0.189
Neg logit contributions
ug-0.337
idd-0.321
ooming-0.309
irled-0.300
med-0.300
Token curious
Token ID11040
Feature activation+1.28e+00

Pos logit contributions
 stranger+0.442
 keeper+0.419
 stall+0.415
 Heidi+0.389
 seller+0.382
Neg logit contributions
ug-0.662
idd-0.628
ooming-0.603
irled-0.589
zz-0.583
Token saw
Token ID2497
Feature activation+2.12e+00

Pos logit contributions
 stranger+1.441
 keeper+1.387
 stall+1.338
 truth+1.307
 Heidi+1.278
Neg logit contributions
ug-1.836
idd-1.734
ged-1.672
ooming-1.671
zz-1.653
Token a
Token ID257
Feature activation+5.87e-01

Pos logit contributions
 keeper+3.105
 stall+3.074
 stranger+3.001
 seller+2.831
Stan+2.815
Neg logit contributions
ug-4.515
idd-4.335
med-4.102
ged-4.028
irled-4.001
Token big
Token ID1263
Feature activation+2.33e-01

Pos logit contributions
 stranger+0.689
 keeper+0.667
 stall+0.648
 Heidi+0.628
 Stanley+0.612
Neg logit contributions
ug-1.379
idd-1.335
ooming-1.294
zz-1.252
irled-1.229
Token tree
Token ID5509
Feature activation+8.16e-01

Pos logit contributions
 stranger+0.294
 keeper+0.280
 stall+0.271
 Heidi+0.256
Stan+0.254
Neg logit contributions
ug-0.543
idd-0.518
ooming-0.504
zz-0.487
irled-0.483
Token and
Token ID290
Feature activation+7.28e-01

Pos logit contributions
 stranger+1.487
 keeper+1.383
 stall+1.367
 Heidi+1.350
 Stanley+1.345
Neg logit contributions
ug-1.486
idd-1.388
ooming-1.346
zz-1.283
ips-1.278
Token decided
Token ID3066
Feature activation+2.11e+00

Pos logit contributions
 stranger+1.127
 stall+1.096
 keeper+1.094
Stan+1.027
 seller+1.008
Neg logit contributions
ug-1.479
idd-1.402
irled-1.327
ged-1.322
ooming-1.322
Token to
Token ID284
Feature activation+1.34e+00

Pos logit contributions
 stall+3.459
 stranger+3.337
 butcher+3.258
Stan+3.181
 seller+3.175
Neg logit contributions
ug-4.029
ged-3.812
idd-3.781
ooming-3.740
irled-3.630
Token play
Token ID711
Feature activation+4.88e-01

Pos logit contributions
 stranger+2.082
 keeper+2.008
 stall+1.955
Stan+1.869
 ranger+1.847
Neg logit contributions
ug-2.755
idd-2.529
zz-2.518
ooming-2.437
irled-2.391
Token hide
Token ID7808
Feature activation+1.45e+00

Pos logit contributions
 stranger+0.856
 keeper+0.818
 stall+0.807
Stan+0.766
 Heidi+0.765
Neg logit contributions
ug-0.930
idd-0.867
ooming-0.855
irled-0.822
ips-0.813
Token and
Token ID290
Feature activation+8.96e-01

Pos logit contributions
 stranger+2.607
 keeper+2.498
 stall+2.469
 Heidi+2.458
 Stanley+2.429
Neg logit contributions
ug-2.780
idd-2.501
ooming-2.441
ged-2.365
zz-2.357
Token seek
Token ID5380
Feature activation+9.45e-01

Pos logit contributions
 stranger+1.499
 stall+1.425
 keeper+1.406
Stan+1.364
 Heidi+1.343
Neg logit contributions
ug-1.679
idd-1.590
ooming-1.531
zz-1.503
irled-1.500
TokenAnna
Token ID31160
Feature activation+1.04e+00

Pos logit contributions
 stranger+0.887
 keeper+0.838
 stall+0.834
 seller+0.777
 truth+0.774
Neg logit contributions
ug-1.270
idd-1.205
ooming-1.155
zz-1.116
irled-1.112
Token and
Token ID290
Feature activation+2.46e-01

Pos logit contributions
 stranger+1.764
 stall+1.664
 keeper+1.659
 Heidi+1.574
Stan+1.573
Neg logit contributions
ug-1.963
idd-1.868
ooming-1.846
irled-1.814
zz-1.801
Token Ben
Token ID3932
Feature activation+2.18e+00

Pos logit contributions
 stranger+0.402
 keeper+0.386
 stall+0.381
 ranger+0.355
Stan+0.354
Neg logit contributions
ug-0.486
idd-0.455
ooming-0.445
zz-0.430
ips-0.428
Token nod
Token ID18666
Feature activation+6.05e-01

Pos logit contributions
 stranger+3.324
 keeper+3.218
 stall+3.198
 butcher+3.098
 ranger+3.012
Neg logit contributions
ug-4.403
ooming-4.274
zz-4.222
ged-4.102
irled-4.074
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
 stranger+0.955
 stall+0.910
 keeper+0.902
 Heidi+0.857
Stan+0.850
Neg logit contributions
ug-1.238
ooming-1.147
idd-1.143
med-1.112
ged-1.108
Token They
Token ID1119
Feature activation+2.24e+00

Pos logit contributions
 stranger+0.640
 keeper+0.605
 stall+0.591
 seller+0.559
Stan+0.556
Neg logit contributions
ug-0.786
idd-0.736
ooming-0.708
irled-0.689
ips-0.689
Token want
Token ID765
Feature activation+1.48e+00

Pos logit contributions
 stranger+3.531
 keeper+3.462
 stall+3.390
 butcher+3.183
 Heidi+3.170
Neg logit contributions
ug-4.435
zz-4.240
ooming-4.093
idd-4.004
irled-3.968
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 stranger+2.085
 stall+2.072
 keeper+2.016
Stan+1.941
 Heidi+1.929
Neg logit contributions
ug-3.108
idd-2.901
med-2.810
ooming-2.802
zz-2.767
Token be
Token ID307
Feature activation+4.47e-01

Pos logit contributions
 stranger+1.795
 stall+1.709
 keeper+1.683
 butcher+1.625
Stan+1.616
Neg logit contributions
ug-2.178
idd-2.005
zz-1.985
ooming-1.978
irled-1.904
Token good
Token ID922
Feature activation-9.27e-01

Pos logit contributions
 stranger+0.674
 keeper+0.638
 stall+0.631
 Heidi+0.591
Stan+0.591
Neg logit contributions
ug-0.953
idd-0.901
ooming-0.870
irled-0.845
med-0.843
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
ug+2.106
idd+2.006
ooming+1.936
irled+1.919
med+1.888
Neg logit contributions
 stranger-1.319
 keeper-1.238
 stall-1.197
 ranger-1.126
 seller-1.111
Token a
Token ID257
Feature activation+1.81e-01

Pos logit contributions
ug+1.976
idd+1.899
ooming+1.853
irled+1.795
zz+1.792
Neg logit contributions
 stranger-0.766
 keeper-0.703
 stall-0.682
 Heidi-0.633
 truth-0.618
Token long
Token ID890
Feature activation-1.25e+00

Pos logit contributions
 stranger+0.302
 keeper+0.291
 stall+0.288
 Heidi+0.273
 seller+0.271
Neg logit contributions
ug-0.349
idd-0.326
ooming-0.319
zz-0.305
ged-0.301
Token time
Token ID640
Feature activation-1.37e+00

Pos logit contributions
ug+2.421
idd+2.377
ooming+2.236
med+2.200
ips+2.190
Neg logit contributions
 stranger-2.215
 stall-2.078
 keeper-2.055
 bench-1.977
 Heidi-1.952
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
ug+3.366
idd+3.203
ooming+3.112
irled+3.106
zz+3.069
Neg logit contributions
 stranger-1.628
 keeper-1.547
 stall-1.527
 ranger-1.382
 bench-1.365
Token Lily
Token ID20037
Feature activation+2.55e-01

Pos logit contributions
 stranger+0.672
 keeper+0.629
 stall+0.626
 seller+0.586
 camel+0.575
Neg logit contributions
ug-0.916
idd-0.871
ooming-0.839
med-0.817
irled-0.814
Token learned
Token ID4499
Feature activation+2.48e+00

Pos logit contributions
 stranger+0.368
 stall+0.344
 keeper+0.344
 Heidi+0.321
Stan+0.316
Neg logit contributions
ug-0.559
idd-0.529
ooming-0.516
irled-0.500
zz-0.498
Token that
Token ID326
Feature activation-1.32e-01

Pos logit contributions
 stall+4.181
 keeper+3.902
 stranger+3.849
 seller+3.799
 camel+3.745
Neg logit contributions
ug-5.002
ged-4.450
med-4.448
irled-4.445
ooming-4.418
Token she
Token ID673
Feature activation-6.23e-01

Pos logit contributions
ug+0.276
idd+0.260
ooming+0.253
irled+0.243
zz+0.243
Neg logit contributions
 stranger-0.204
 keeper-0.194
 stall-0.191
 Heidi-0.179
 ranger-0.178
Token should
Token ID815
Feature activation+7.57e-01

Pos logit contributions
ug+1.583
idd+1.509
ooming+1.486
ips+1.428
irled+1.427
Neg logit contributions
 stranger-0.686
 keeper-0.639
 stall-0.608
 ranger-0.564
 seller-0.561
Token listen
Token ID6004
Feature activation+5.26e-01

Pos logit contributions
 stranger+1.168
 stall+1.115
 keeper+1.105
 Heidi+1.017
 camel+1.008
Neg logit contributions
ug-1.583
idd-1.462
ooming-1.421
zz-1.409
ips-1.381
Token to
Token ID284
Feature activation-1.75e-01

Pos logit contributions
 stranger+0.690
 stall+0.651
 keeper+0.650
 Heidi+0.601
 seller+0.596
Neg logit contributions
ug-1.223
idd-1.141
ooming-1.124
med-1.102
irled-1.093
Token their
Token ID511
Feature activation-1.33e+00

Pos logit contributions
 stranger+0.395
 keeper+0.375
 stall+0.368
 ranger+0.345
 Heidi+0.345
Neg logit contributions
ug-0.534
idd-0.502
ooming-0.487
med-0.472
ips-0.472
Token hands
Token ID2832
Feature activation+1.21e+00

Pos logit contributions
ug+2.872
idd+2.726
ooming+2.622
med+2.589
irled+2.554
Neg logit contributions
 stranger-2.043
 keeper-1.961
 stall-1.952
 bench-1.804
 ranger-1.789
Token.
Token ID13
Feature activation+9.96e-01

Pos logit contributions
 stranger+1.861
 keeper+1.718
 stall+1.693
 Heidi+1.667
Stan+1.643
Neg logit contributions
ug-2.598
idd-2.392
ooming-2.355
med-2.320
ged-2.294
Token They
Token ID1119
Feature activation+5.11e-01

Pos logit contributions
 stranger+1.690
 keeper+1.613
 stall+1.543
 seller+1.488
 ranger+1.457
Neg logit contributions
ug-1.931
idd-1.832
ips-1.725
ooming-1.724
irled-1.700
Token show
Token ID905
Feature activation-3.17e-01

Pos logit contributions
 stranger+0.734
 keeper+0.697
 stall+0.682
 Heidi+0.652
 ranger+0.633
Neg logit contributions
ug-1.109
idd-1.043
ooming-1.025
zz-1.000
ips-0.985
Token them
Token ID606
Feature activation+1.45e+00

Pos logit contributions
ug+0.615
idd+0.580
ooming+0.563
irled+0.538
zz+0.537
Neg logit contributions
 stranger-0.544
 keeper-0.518
 stall-0.510
 Heidi-0.482
 ranger-0.478
Token how
Token ID703
Feature activation+2.78e-01

Pos logit contributions
 stranger+2.362
 keeper+2.263
 stall+2.255
 Heidi+2.165
Stan+2.161
Neg logit contributions
ug-2.953
ooming-2.668
med-2.661
idd-2.654
irled-2.622
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
 stranger+0.360
 keeper+0.338
 stall+0.338
 Heidi+0.313
 seller+0.311
Neg logit contributions
ug-0.654
idd-0.618
ooming-0.601
irled-0.587
zz-0.582
Token watch
Token ID2342
Feature activation-9.67e-01

Pos logit contributions
ug+2.196
idd+2.114
ooming+2.012
med+1.970
irled+1.946
Neg logit contributions
 stranger-1.623
 keeper-1.555
 stall-1.548
 bench-1.463
 seller-1.445
Token the
Token ID262
Feature activation-4.64e-01

Pos logit contributions
ug+2.254
idd+2.195
ooming+2.111
irled+2.048
zz+2.033
Neg logit contributions
 stranger-1.248
 keeper-1.175
 stall-1.158
 bench-1.065
 truth-1.062
Token waves
Token ID9813
Feature activation-1.68e-01

Pos logit contributions
ug+0.920
idd+0.866
ooming+0.839
irled+0.809
med+0.807
Neg logit contributions
 stranger-0.779
 keeper-0.741
 stall-0.731
 ranger-0.683
 seller-0.681
Token was
Token ID373
Feature activation-8.79e-01

Pos logit contributions
 stranger+0.724
 keeper+0.678
 stall+0.672
 Heidi+0.643
Stan+0.628
Neg logit contributions
ug-1.066
idd-1.012
ooming-0.966
irled-0.955
zz-0.946
Token sm
Token ID895
Feature activation-1.09e+00

Pos logit contributions
ug+2.207
idd+2.111
ooming+2.099
med+2.012
irled+2.010
Neg logit contributions
 stranger-0.993
 keeper-0.926
 stall-0.923
 seller-0.827
 ranger-0.826
Tokenelly
Token ID6148
Feature activation+1.06e+00

Pos logit contributions
ug+2.119
idd+1.998
ooming+1.993
med+1.923
irled+1.921
Neg logit contributions
 stranger-1.832
 keeper-1.706
 stall-1.673
 ranger-1.598
Stan-1.577
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
 stranger+1.683
 keeper+1.616
 stall+1.593
 Heidi+1.565
Stan+1.552
Neg logit contributions
ug-2.162
idd-1.993
ooming-1.911
zz-1.830
med-1.824
Token He
Token ID679
Feature activation+7.53e-01

Pos logit contributions
 stranger+0.582
 keeper+0.542
 stall+0.527
 seller+0.490
Stan+0.489
Neg logit contributions
ug-1.005
idd-0.946
ooming-0.906
med-0.887
ips-0.887
Token lived
Token ID5615
Feature activation+2.40e-01

Pos logit contributions
 stranger+1.149
 keeper+1.096
 stall+1.084
 Heidi+1.035
Stan+1.014
Neg logit contributions
ug-1.587
idd-1.492
irled-1.424
ooming-1.420
zz-1.408
Token with
Token ID351
Feature activation+1.19e-01

Pos logit contributions
 stranger+0.461
 keeper+0.442
 stall+0.435
 Heidi+0.419
 ranger+0.411
Neg logit contributions
ug-0.414
idd-0.387
ooming-0.373
irled-0.357
ips-0.354
Token his
Token ID465
Feature activation+1.09e+00

Pos logit contributions
 stranger+0.151
 keeper+0.142
 stall+0.139
 Heidi+0.129
 ranger+0.127
Neg logit contributions
ug-0.281
idd-0.266
ooming-0.258
med-0.252
zz-0.251
Token farmer
Token ID18739
Feature activation+6.42e-01

Pos logit contributions
 stranger+1.691
 keeper+1.605
 Heidi+1.549
 stall+1.532
 ranger+1.476
Neg logit contributions
ug-2.232
idd-2.112
ooming-2.014
zz-1.966
ged-1.925
Token who
Token ID508
Feature activation+9.45e-01

Pos logit contributions
 stranger+1.075
 keeper+1.007
 stall+0.987
 Heidi+0.957
Stan+0.938
Neg logit contributions
ug-1.280
idd-1.189
ooming-1.150
med-1.114
zz-1.095
Token looked
Token ID3114
Feature activation+8.97e-01

Pos logit contributions
 stranger+1.406
 keeper+1.344
 stall+1.316
 Heidi+1.264
Stan+1.240
Neg logit contributions
ug-2.009
idd-1.913
ooming-1.809
irled-1.791
zz-1.782
Token heard
Token ID2982
Feature activation+1.59e+00

Pos logit contributions
 stranger+2.065
 keeper+1.991
 stall+1.945
 Heidi+1.907
 Stanley+1.823
Neg logit contributions
ug-2.837
idd-2.708
ooming-2.689
ged-2.641
zz-2.639
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 stranger+2.181
 stall+2.152
 keeper+2.130
 seller+2.036
Stan+2.013
Neg logit contributions
ug-3.547
idd-3.221
med-3.219
ooming-3.146
irled-3.082
Token loud
Token ID7812
Feature activation-1.52e-01

Pos logit contributions
 stranger+1.559
 stall+1.531
 Heidi+1.503
 keeper+1.492
 Stanley+1.477
Neg logit contributions
ug-2.404
ooming-2.235
idd-2.222
zz-2.128
med-2.118
Token noise
Token ID7838
Feature activation+2.35e-01

Pos logit contributions
ug+0.338
idd+0.320
ooming+0.312
irled+0.301
zz+0.300
Neg logit contributions
 stranger-0.215
 keeper-0.203
 stall-0.200
 Heidi-0.186
 ranger-0.184
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
 stranger+0.317
 keeper+0.300
 stall+0.299
 Heidi+0.279
 seller+0.276
Neg logit contributions
ug-0.533
idd-0.500
ooming-0.492
med-0.479
zz-0.474
Token It
Token ID632
Feature activation+1.07e+00

Pos logit contributions
 stranger+0.549
 keeper+0.521
 stall+0.507
 seller+0.476
 Heidi+0.474
Neg logit contributions
ug-0.725
idd-0.687
ooming-0.660
irled-0.644
med-0.639
Token was
Token ID373
Feature activation+9.69e-01

Pos logit contributions
 stranger+1.656
 stall+1.632
 keeper+1.539
 butcher+1.510
 Heidi+1.502
Neg logit contributions
ug-2.190
idd-2.058
med-2.023
irled-2.001
zz-1.989
Token a
Token ID257
Feature activation+5.04e-01

Pos logit contributions
 stranger+1.429
 stall+1.361
 keeper+1.361
 seller+1.292
holder+1.233
Neg logit contributions
ug-2.079
idd-1.939
med-1.860
ooming-1.859
ips-1.822
Token plane
Token ID6614
Feature activation-2.74e-01

Pos logit contributions
 stranger+0.633
 keeper+0.607
 stall+0.606
 Heidi+0.577
 Stanley+0.562
Neg logit contributions
ug-1.148
idd-1.099
ooming-1.068
zz-1.034
med-1.016
Token flying
Token ID7348
Feature activation+5.11e-01

Pos logit contributions
ug+0.541
idd+0.510
ooming+0.494
irled+0.475
zz+0.473
Neg logit contributions
 stranger-0.456
 keeper-0.435
 stall-0.429
 Heidi-0.402
 ranger-0.402
Token over
Token ID625
Feature activation-4.46e-01

Pos logit contributions
 stranger+0.843
 stall+0.800
 keeper+0.794
 Heidi+0.768
 butcher+0.763
Neg logit contributions
ug-1.030
idd-0.939
ooming-0.920
zz-0.896
med-0.896
Token dull
Token ID19222
Feature activation-2.94e-01

Pos logit contributions
 stranger+0.537
 stall+0.510
 keeper+0.500
Stan+0.472
 Heidi+0.470
Neg logit contributions
ug-0.812
idd-0.768
ooming-0.737
med-0.723
irled-0.720
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
ug+0.664
idd+0.630
ooming+0.614
irled+0.592
zz+0.591
Neg logit contributions
 stranger-0.407
 keeper-0.384
 stall-0.377
 Heidi-0.348
 ranger-0.348
Token Now
Token ID2735
Feature activation+4.40e-01

Pos logit contributions
ug+0.420
idd+0.398
ooming+0.388
irled+0.374
med+0.374
Neg logit contributions
 stranger-0.252
 keeper-0.238
 stall-0.234
 Heidi-0.217
 ranger-0.215
Token,
Token ID11
Feature activation+7.45e-02

Pos logit contributions
 stranger+0.695
 keeper+0.657
 stall+0.655
Stan+0.606
 seller+0.605
Neg logit contributions
ug-0.905
idd-0.839
ooming-0.822
med-0.797
zz-0.794
Token she
Token ID673
Feature activation-1.31e-01

Pos logit contributions
 stranger+0.110
 keeper+0.103
 stall+0.103
 Heidi+0.095
 ranger+0.094
Neg logit contributions
ug-0.161
idd-0.152
ooming-0.148
med-0.143
zz-0.142
Token had
Token ID550
Feature activation+3.68e-01

Pos logit contributions
ug+0.299
idd+0.284
ooming+0.276
irled+0.267
zz+0.267
Neg logit contributions
 stranger-0.179
 keeper-0.168
 stall-0.166
 Heidi-0.153
 ranger-0.152
Token all
Token ID477
Feature activation-8.19e-02

Pos logit contributions
 stranger+0.471
 keeper+0.442
 stall+0.440
 seller+0.401
 ranger+0.397
Neg logit contributions
ug-0.860
idd-0.822
ooming-0.796
med-0.780
irled-0.778
Token kinds
Token ID6982
Feature activation-7.80e-01

Pos logit contributions
ug+0.180
idd+0.170
ooming+0.165
irled+0.160
med+0.160
Neg logit contributions
 stranger-0.118
 keeper-0.112
 stall-0.110
 Heidi-0.102
 ranger-0.102
Token of
Token ID286
Feature activation-5.65e-01

Pos logit contributions
ug+1.392
idd+1.308
ooming+1.277
zz+1.230
irled+1.217
Neg logit contributions
 stranger-1.459
 keeper-1.394
 stall-1.357
 ranger-1.300
 seller-1.287
Token friends
Token ID2460
Feature activation+1.21e-01

Pos logit contributions
ug+1.296
idd+1.233
ooming+1.208
zz+1.163
med+1.160
Neg logit contributions
 stranger-0.775
 keeper-0.719
 stall-0.710
 Heidi-0.657
 truth-0.654
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 stranger+0.180
 keeper+0.170
 stall+0.168
 Heidi+0.158
Stan+0.156
Neg logit contributions
ug-0.257
idd-0.243
ooming-0.237
med-0.229
irled-0.228
TokenSuddenly
Token ID38582
Feature activation+2.22e-01

Pos logit contributions
 stranger+0.751
 keeper+0.711
 stall+0.705
 seller+0.651
 ranger+0.649
Neg logit contributions
ug-1.208
idd-1.142
ooming-1.113
med-1.085
irled-1.083
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
 stranger+0.376
 keeper+0.361
 stall+0.358
 seller+0.334
 ranger+0.334
Neg logit contributions
ug-0.430
idd-0.400
ooming-0.391
med-0.376
zz-0.375
Token she
Token ID673
Feature activation-7.29e-02

Pos logit contributions
 stranger+0.171
 keeper+0.162
 stall+0.160
 seller+0.145
 ranger+0.145
Neg logit contributions
ug-0.338
idd-0.321
ooming-0.314
med-0.304
zz-0.304
Token heard
Token ID2982
Feature activation+1.42e+00

Pos logit contributions
ug+0.172
idd+0.164
ooming+0.160
irled+0.155
med+0.155
Neg logit contributions
 stranger-0.093
 keeper-0.087
 stall-0.086
 Heidi-0.079
 ranger-0.078
Token a
Token ID257
Feature activation+7.88e-01

Pos logit contributions
 stall+1.810
 stranger+1.810
 keeper+1.738
 seller+1.682
Stan+1.653
Neg logit contributions
ug-3.303
med-3.076
idd-3.031
ooming-2.968
irled-2.937
Token voice
Token ID3809
Feature activation+1.07e+00

Pos logit contributions
 stranger+1.016
 stall+0.979
 keeper+0.952
 Heidi+0.940
 Stanley+0.919
Neg logit contributions
ug-1.783
idd-1.672
ooming-1.649
zz-1.592
med-1.586
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
 stranger+1.552
 stall+1.504
 keeper+1.460
Stan+1.429
 butcher+1.425
Neg logit contributions
ug-2.304
idd-2.122
ooming-2.104
med-2.080
ged-2.057
Token It
Token ID632
Feature activation+1.14e+00

Pos logit contributions
 stranger+2.515
 stall+2.355
 keeper+2.334
 seller+2.256
 butcher+2.230
Neg logit contributions
ug-3.161
idd-2.941
med-2.826
ged-2.769
zz-2.732
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
 stranger+1.787
 stall+1.721
 butcher+1.638
 keeper+1.632
Stan+1.588
Neg logit contributions
ug-2.299
idd-2.191
med-2.136
ged-2.083
irled-2.082
Token coming
Token ID2406
Feature activation+1.33e+00

Pos logit contributions
 stranger+1.399
 stall+1.325
 keeper+1.301
 seller+1.244
 butcher+1.221
Neg logit contributions
ug-2.308
idd-2.197
med-2.098
ooming-2.069
irled-2.028
Token from
Token ID422
Feature activation+7.71e-01

Pos logit contributions
 stranger+1.648
 stall+1.613
 butcher+1.502
 seller+1.476
 Heidi+1.470
Neg logit contributions
ug-3.190
idd-2.953
zz-2.883
ooming-2.855
med-2.820
Token She
Token ID1375
Feature activation-6.82e-01

Pos logit contributions
 stranger+0.215
 keeper+0.203
 stall+0.201
Stan+0.186
 Heidi+0.186
Neg logit contributions
ug-0.274
idd-0.259
ooming-0.248
zz-0.240
med-0.240
Token should
Token ID815
Feature activation+4.27e-01

Pos logit contributions
ug+1.578
idd+1.477
ooming+1.459
ips+1.403
med+1.401
Neg logit contributions
 stranger-0.916
 keeper-0.867
 stall-0.834
 seller-0.789
 ranger-0.785
Token not
Token ID407
Feature activation-8.39e-01

Pos logit contributions
 stranger+0.560
 stall+0.529
 keeper+0.525
Stan+0.475
 Heidi+0.475
Neg logit contributions
ug-0.978
idd-0.931
ooming-0.904
zz-0.881
irled-0.879
Token get
Token ID651
Feature activation-1.01e+00

Pos logit contributions
ug+1.892
idd+1.800
ooming+1.760
med+1.703
irled+1.699
Neg logit contributions
 stranger-1.192
 keeper-1.125
 stall-1.072
 ranger-1.029
 truth-1.021
Token a
Token ID257
Feature activation+1.60e-01

Pos logit contributions
ug+2.347
idd+2.208
ooming+2.175
irled+2.096
zz+2.083
Neg logit contributions
 stranger-1.411
 keeper-1.310
 stall-1.250
 ranger-1.186
 Heidi-1.184
Token cookie
Token ID19751
Feature activation+9.01e-03

Pos logit contributions
 stranger+0.229
 keeper+0.219
 stall+0.215
 Heidi+0.202
 ranger+0.200
Neg logit contributions
ug-0.343
idd-0.327
ooming-0.319
zz-0.307
irled-0.304
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
 stranger+0.012
 keeper+0.012
 stall+0.012
 Heidi+0.011
Stan+0.011
Neg logit contributions
ug-0.020
idd-0.019
ooming-0.019
zz-0.018
med-0.018
Token She
Token ID1375
Feature activation-5.99e-01

Pos logit contributions
 stranger+0.848
 keeper+0.793
 stall+0.790
Stan+0.720
 seller+0.719
Neg logit contributions
ug-1.285
idd-1.228
ooming-1.160
zz-1.143
ips-1.141
Token should
Token ID815
Feature activation+4.81e-01

Pos logit contributions
ug+1.418
idd+1.333
ooming+1.316
ips+1.265
med+1.265
Neg logit contributions
 stranger-0.773
 keeper-0.728
 stall-0.703
 seller-0.661
 ranger-0.656
Token get
Token ID651
Feature activation-6.24e-01

Pos logit contributions
 stranger+0.670
 stall+0.635
 keeper+0.632
Stan+0.573
 Heidi+0.571
Neg logit contributions
ug-1.060
idd-1.006
ooming-0.973
zz-0.952
irled-0.948
Token a
Token ID257
Feature activation+4.14e-02

Pos logit contributions
ug+1.476
idd+1.393
ooming+1.371
irled+1.315
zz+1.314
Neg logit contributions
 stranger-0.819
 keeper-0.765
 stall-0.739
 Heidi-0.692
 seller-0.689
Token outside
Token ID2354
Feature activation-6.68e-01

Pos logit contributions
ug+0.065
idd+0.061
ooming+0.059
med+0.057
irled+0.056
Neg logit contributions
 stranger-0.057
 keeper-0.055
 stall-0.054
 Heidi-0.051
 ranger-0.051
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
ug+1.498
idd+1.427
ooming+1.382
irled+1.348
zz+1.345
Neg logit contributions
 stranger-0.941
 keeper-0.890
 stall-0.871
 ranger-0.807
 Heidi-0.798
Token They
Token ID1119
Feature activation-1.72e-01

Pos logit contributions
ug+0.337
idd+0.319
ooming+0.310
irled+0.298
med+0.298
Neg logit contributions
 stranger-0.234
 keeper-0.222
 stall-0.218
 Heidi-0.204
 ranger-0.202
Token asked
Token ID1965
Feature activation+1.78e-03

Pos logit contributions
ug+0.341
idd+0.321
ooming+0.311
irled+0.299
med+0.298
Neg logit contributions
 stranger-0.285
 keeper-0.271
 stall-0.268
 Heidi-0.252
 ranger-0.250
Token her
Token ID607
Feature activation+1.18e+00

Pos logit contributions
 stranger+0.002
 keeper+0.002
 stall+0.002
 Heidi+0.002
 seller+0.002
Neg logit contributions
ug-0.004
idd-0.004
ooming-0.004
irled-0.004
med-0.004
Token to
Token ID284
Feature activation+5.53e-02

Pos logit contributions
 stranger+1.705
 stall+1.651
 keeper+1.624
Stan+1.574
 Heidi+1.546
Neg logit contributions
ug-2.515
idd-2.351
ooming-2.306
zz-2.229
med-2.218
Token join
Token ID4654
Feature activation-1.43e+00

Pos logit contributions
 stranger+0.078
 keeper+0.073
 stall+0.072
 Heidi+0.067
 ranger+0.067
Neg logit contributions
ug-0.123
idd-0.116
ooming-0.113
zz-0.110
irled-0.109
Token them
Token ID606
Feature activation-5.06e-01

Pos logit contributions
ug+3.202
idd+3.102
ooming+2.988
zz+2.878
irled+2.865
Neg logit contributions
 stranger-2.017
 stall-1.846
 keeper-1.836
 Heidi-1.700
 truth-1.692
Token,
Token ID11
Feature activation-6.55e-01

Pos logit contributions
ug+1.063
idd+1.007
ooming+0.972
irled+0.942
zz+0.936
Neg logit contributions
 stranger-0.781
 keeper-0.744
 stall-0.731
 ranger-0.683
 seller-0.679
Token but
Token ID475
Feature activation+5.73e-02

Pos logit contributions
ug+1.507
idd+1.431
ooming+1.394
irled+1.351
zz+1.343
Neg logit contributions
 stranger-0.899
 keeper-0.849
 stall-0.828
 ranger-0.773
 Heidi-0.770
Token Lily
Token ID20037
Feature activation+1.03e+00

Pos logit contributions
 stranger+0.097
 keeper+0.092
 stall+0.092
Stan+0.086
 Heidi+0.086
Neg logit contributions
ug-0.111
idd-0.104
ooming-0.101
irled-0.097
med-0.097
Token Hello
Token ID18435
Feature activation+3.66e-01

Pos logit contributions
ug+1.712
idd+1.597
ooming+1.585
irled+1.538
ips+1.513
Neg logit contributions
 stranger-1.143
 keeper-1.081
 stall-1.061
 Heidi-1.011
 ranger-0.994
Token,
Token ID11
Feature activation+5.33e-01

Pos logit contributions
 stranger+0.598
 keeper+0.568
 stall+0.565
 seller+0.531
 Heidi+0.528
Neg logit contributions
ug-0.731
idd-0.690
ooming-0.670
zz-0.647
irled-0.642
Token mom
Token ID1995
Feature activation-1.19e-01

Pos logit contributions
 stranger+0.770
 keeper+0.735
 stall+0.733
Stan+0.680
 seller+0.677
Neg logit contributions
ug-1.148
idd-1.088
ooming-1.034
zz-1.015
med-1.008
Token!
Token ID0
Feature activation-6.78e-01

Pos logit contributions
ug+0.244
idd+0.231
ooming+0.224
irled+0.215
zz+0.215
Neg logit contributions
 stranger-0.188
 keeper-0.178
 stall-0.176
 Heidi-0.165
 ranger-0.164
Token I
Token ID314
Feature activation-1.32e+00

Pos logit contributions
ug+1.471
idd+1.376
ooming+1.366
irled+1.319
ips+1.302
Neg logit contributions
 stranger-1.002
 keeper-0.948
 stall-0.930
 Heidi-0.884
 ranger-0.872
Token'm
Token ID1101
Feature activation-5.86e-01

Pos logit contributions
ug+2.934
ooming+2.673
idd+2.660
ips+2.587
irled+2.548
Neg logit contributions
 stranger-1.958
 keeper-1.853
 stall-1.779
 ranger-1.707
 seller-1.690
Token so
Token ID523
Feature activation-8.82e-01

Pos logit contributions
ug+1.309
idd+1.227
ooming+1.218
irled+1.169
zz+1.167
Neg logit contributions
 stranger-0.837
 keeper-0.789
 stall-0.771
 Heidi-0.731
 ranger-0.717
Token glad
Token ID9675
Feature activation+8.19e-01

Pos logit contributions
ug+2.017
ooming+1.880
idd+1.868
ips+1.807
med+1.805
Neg logit contributions
 stranger-1.243
 keeper-1.167
 stall-1.129
 ranger-1.051
 Heidi-1.047
Token you
Token ID345
Feature activation+3.02e-01

Pos logit contributions
 stranger+1.371
 stall+1.330
 keeper+1.315
 seller+1.250
Stan+1.231
Neg logit contributions
ug-1.584
idd-1.475
ooming-1.413
med-1.371
zz-1.367
Token're
Token ID821
Feature activation+1.75e-01

Pos logit contributions
 stranger+0.461
 keeper+0.438
 stall+0.435
 Heidi+0.407
 ranger+0.402
Neg logit contributions
ug-0.634
idd-0.605
ooming-0.585
zz-0.571
med-0.569
Token here
Token ID994
Feature activation+5.38e-01

Pos logit contributions
 stranger+0.253
 keeper+0.238
 stall+0.238
 Heidi+0.219
 seller+0.219
Neg logit contributions
ug-0.381
idd-0.362
ooming-0.350
med-0.339
irled-0.337
Token so
Token ID523
Feature activation+2.32e-01

Pos logit contributions
 stranger+0.999
 keeper+0.923
 stall+0.922
Stan+0.850
 butcher+0.849
Neg logit contributions
ug-1.622
idd-1.547
ooming-1.466
med-1.464
irled-1.449
Token busy
Token ID8179
Feature activation-9.27e-01

Pos logit contributions
 stranger+0.291
 keeper+0.273
 stall+0.269
 Heidi+0.249
 seller+0.246
Neg logit contributions
ug-0.554
idd-0.528
ooming-0.507
irled-0.499
zz-0.497
Token!"
Token ID2474
Feature activation+2.03e-01

Pos logit contributions
ug+1.964
idd+1.853
ooming+1.815
irled+1.746
med+1.743
Neg logit contributions
 stranger-1.427
 keeper-1.353
 stall-1.330
 Heidi-1.245
 ranger-1.237
Token Ben
Token ID3932
Feature activation+1.96e-01

Pos logit contributions
 stranger+0.332
 keeper+0.318
 stall+0.312
 Heidi+0.292
Stan+0.292
Neg logit contributions
ug-0.406
idd-0.380
ooming-0.369
irled-0.356
med-0.355
Token said
Token ID531
Feature activation-4.48e-01

Pos logit contributions
 stranger+0.281
 keeper+0.264
 stall+0.261
 Heidi+0.244
Stan+0.241
Neg logit contributions
ug-0.431
idd-0.408
ooming-0.395
zz-0.385
med-0.384
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
ug+1.078
idd+1.030
ooming+1.000
zz+0.969
irled+0.969
Neg logit contributions
 stranger-0.557
 keeper-0.526
 stall-0.510
 Heidi-0.471
 ranger-0.470
Token
Token ID198
Feature activation-2.42e-01

Pos logit contributions
ug+0.503
idd+0.477
ooming+0.466
irled+0.451
zz+0.450
Neg logit contributions
 stranger-0.268
 keeper-0.251
 stall-0.247
 Heidi-0.227
 ranger-0.225
Token
Token ID198
Feature activation+6.73e-01

Pos logit contributions
ug+0.566
idd+0.537
ooming+0.524
irled+0.506
med+0.506
Neg logit contributions
 stranger-0.316
 keeper-0.297
 stall-0.291
 Heidi-0.269
 ranger-0.267
Token"
Token ID1
Feature activation-4.07e-01

Pos logit contributions
 stranger+0.927
 keeper+0.853
 stall+0.843
 Heidi+0.802
 truth+0.792
Neg logit contributions
ug-1.548
idd-1.466
ooming-1.429
irled-1.382
zz-1.374
TokenThey
Token ID2990
Feature activation+4.91e-01

Pos logit contributions
ug+0.972
idd+0.920
ooming+0.902
irled+0.869
zz+0.866
Neg logit contributions
 stranger-0.521
 keeper-0.489
 stall-0.478
 ranger-0.440
 seller-0.439
Token are
Token ID389
Feature activation+9.85e-02

Pos logit contributions
 stranger+0.716
 keeper+0.677
 stall+0.670
 Heidi+0.630
 ranger+0.625
Neg logit contributions
ug-1.064
idd-1.006
ooming-0.983
zz-0.962
irled-0.950
Token,
Token ID11
Feature activation+2.94e-01

Pos logit contributions
 stranger+0.447
 keeper+0.422
 stall+0.416
 Heidi+0.395
 ranger+0.393
Neg logit contributions
ug-0.529
idd-0.494
ooming-0.487
med-0.468
zz-0.464
Token taking
Token ID2263
Feature activation-5.45e-01

Pos logit contributions
 stranger+0.471
 keeper+0.448
 stall+0.441
 Heidi+0.415
Stan+0.414
Neg logit contributions
ug-0.590
idd-0.555
ooming-0.542
med-0.527
irled-0.524
Token in
Token ID287
Feature activation+5.16e-01

Pos logit contributions
ug+1.193
idd+1.135
ooming+1.105
irled+1.058
ips+1.055
Neg logit contributions
 stranger-0.799
 keeper-0.750
 stall-0.742
 Heidi-0.691
 seller-0.681
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 stranger+0.752
 keeper+0.708
 stall+0.689
 seller+0.658
Stan+0.651
Neg logit contributions
ug-1.106
idd-1.039
ooming-1.019
med-0.993
ips-0.989
Token bright
Token ID6016
Feature activation-5.77e-01

Pos logit contributions
 stranger+1.916
 keeper+1.846
 stall+1.784
Stan+1.766
 Heidi+1.733
Neg logit contributions
ug-2.505
idd-2.368
ooming-2.331
irled-2.282
ips-2.264
Token colours
Token ID18915
Feature activation-1.83e-01

Pos logit contributions
ug+1.157
idd+1.096
ooming+1.059
zz+1.012
irled+1.008
Neg logit contributions
 stranger-0.948
 keeper-0.898
 stall-0.896
 Heidi-0.830
 seller-0.826
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
ug+0.377
idd+0.356
ooming+0.346
irled+0.333
med+0.332
Neg logit contributions
 stranger-0.285
 keeper-0.270
 stall-0.267
 Heidi-0.250
 ranger-0.248
Token sunshine
Token ID34488
Feature activation+2.83e-01

Pos logit contributions
ug+0.353
idd+0.333
ooming+0.324
irled+0.312
med+0.311
Neg logit contributions
 stranger-0.268
 keeper-0.254
 stall-0.251
 Heidi-0.235
 ranger-0.234
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
 stranger+0.401
 keeper+0.373
 stall+0.370
 Heidi+0.346
Stan+0.345
Neg logit contributions
ug-0.622
idd-0.586
ooming-0.574
med-0.560
zz-0.555
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
ug+2.153
idd+2.030
ooming+2.012
zz+1.935
irled+1.929
Neg logit contributions
 stranger-1.082
 keeper-1.028
 stall-1.020
 Heidi-0.962
 Stanley-0.928
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
ug+1.191
idd+1.132
ooming+1.108
zz+1.069
irled+1.066
Neg logit contributions
 stranger-0.624
 keeper-0.586
 stall-0.576
 Heidi-0.533
Stan-0.526
Token drawings
Token ID23388
Feature activation-3.16e-01

Pos logit contributions
ug+0.233
idd+0.217
ooming+0.210
irled+0.200
med+0.199
Neg logit contributions
 stranger-0.268
 keeper-0.256
 stall-0.254
 Heidi-0.241
 ranger-0.240
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
ug+0.680
idd+0.644
ooming+0.624
irled+0.602
zz+0.601
Neg logit contributions
 stranger-0.469
 keeper-0.444
 stall-0.436
 ranger-0.406
 Heidi-0.406
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
ug+1.340
idd+1.276
ooming+1.249
irled+1.204
zz+1.204
Neg logit contributions
 stranger-0.635
 keeper-0.597
 stall-0.582
 Heidi-0.540
 ranger-0.537
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
ug+0.799
idd+0.761
ooming+0.742
irled+0.716
zz+0.715
Neg logit contributions
 stranger-0.435
 keeper-0.408
 stall-0.402
 Heidi-0.370
 ranger-0.367
TokenA
Token ID32
Feature activation-7.90e-01

Pos logit contributions
 stranger+0.294
 keeper+0.279
 stall+0.276
 Heidi+0.256
 seller+0.255
Neg logit contributions
ug-0.456
idd-0.429
ooming-0.420
med-0.408
irled-0.407
Token few
Token ID1178
Feature activation-2.36e+00

Pos logit contributions
ug+1.948
idd+1.853
ooming+1.815
ips+1.774
irled+1.769
Neg logit contributions
 stranger-0.997
 keeper-0.932
 stall-0.882
 ranger-0.841
 seller-0.813
Token days
Token ID1528
Feature activation-1.61e+00

Pos logit contributions
ug+4.223
idd+4.033
ooming+3.785
zz+3.738
ips+3.640
Neg logit contributions
 stranger-4.884
 keeper-4.348
 stall-4.282
 seller-4.090
 ranger-4.072
Token later
Token ID1568
Feature activation-8.20e-01

Pos logit contributions
ug+3.221
idd+3.025
ooming+2.946
irled+2.858
zz+2.854
Neg logit contributions
 stranger-2.686
 keeper-2.530
 stall-2.503
 ranger-2.378
 seller-2.376
Token,
Token ID11
Feature activation-8.63e-01

Pos logit contributions
ug+1.753
idd+1.657
ooming+1.605
irled+1.557
zz+1.556
Neg logit contributions
 stranger-1.272
 keeper-1.197
 stall-1.160
 ranger-1.105
 Heidi-1.101
Token Lily
Token ID20037
Feature activation+3.87e-02

Pos logit contributions
ug+1.711
idd+1.607
ooming+1.546
zz+1.492
irled+1.487
Neg logit contributions
 stranger-1.495
 keeper-1.409
 stall-1.380
 Heidi-1.333
 ranger-1.321
Token and
Token ID290
Feature activation+2.01e-01

Pos logit contributions
 stranger+0.055
 keeper+0.051
 stall+0.051
 Heidi+0.047
 ranger+0.047
Neg logit contributions
ug-0.085
idd-0.081
ooming-0.079
irled-0.076
med-0.076
Token so
Token ID523
Feature activation-1.22e-01

Pos logit contributions
 stranger+2.397
 stall+2.318
 keeper+2.301
 Heidi+2.201
Stan+2.190
Neg logit contributions
ug-3.667
med-3.418
idd-3.404
ooming-3.402
zz-3.336
Token much
Token ID881
Feature activation-2.65e-01

Pos logit contributions
ug+0.281
idd+0.266
ooming+0.260
irled+0.251
med+0.251
Neg logit contributions
 stranger-0.163
 keeper-0.154
 stall-0.152
 Heidi-0.140
 ranger-0.139
Token joy
Token ID8716
Feature activation+2.68e-01

Pos logit contributions
ug+0.615
idd+0.584
ooming+0.570
irled+0.551
zz+0.549
Neg logit contributions
 stranger-0.349
 keeper-0.328
 stall-0.322
 Heidi-0.297
 ranger-0.296
Token.
Token ID13
Feature activation-8.16e-01

Pos logit contributions
 stranger+0.372
 keeper+0.350
 stall+0.348
 Heidi+0.323
 ranger+0.320
Neg logit contributions
ug-0.605
idd-0.568
ooming-0.558
med-0.543
zz-0.538
Token From
Token ID3574
Feature activation+5.60e-01

Pos logit contributions
ug+1.996
idd+1.880
ooming+1.865
zz+1.793
irled+1.785
Neg logit contributions
 stranger-0.995
 keeper-0.949
 stall-0.921
 Heidi-0.871
 ranger-0.851
Token then
Token ID788
Feature activation-2.30e+00

Pos logit contributions
 stranger+0.947
 stall+0.901
 keeper+0.900
 Heidi+0.844
 seller+0.844
Neg logit contributions
ug-1.094
idd-1.017
ooming-0.989
irled-0.966
med-0.958
Token on
Token ID319
Feature activation-7.37e-01

Pos logit contributions
idd+4.128
ug+4.033
ooming+3.741
anged+3.581
irled+3.575
Neg logit contributions
 stranger-4.233
 stall-3.992
 Heidi-3.849
Stan-3.805
 Stanley-3.796
Token,
Token ID11
Feature activation-1.48e+00

Pos logit contributions
ug+1.563
idd+1.499
ooming+1.433
zz+1.386
med+1.380
Neg logit contributions
 stranger-1.135
 keeper-1.076
 stall-1.060
 Heidi-1.008
 Stanley-0.983
Token the
Token ID262
Feature activation-6.03e-01

Pos logit contributions
ug+3.738
idd+3.606
ooming+3.479
irled+3.357
ged+3.357
Neg logit contributions
 stranger-1.747
 keeper-1.682
 stall-1.585
 Heidi-1.501
 seller-1.461
Token candle
Token ID26839
Feature activation-9.18e-01

Pos logit contributions
ug+1.307
idd+1.229
ooming+1.192
zz+1.151
ips+1.148
Neg logit contributions
 stranger-0.930
 keeper-0.875
 stall-0.863
 seller-0.805
 truth-0.799
Token was
Token ID373
Feature activation-6.55e-01

Pos logit contributions
ug+2.160
idd+2.063
ooming+2.002
ips+1.937
ged+1.922
Neg logit contributions
 stranger-1.189
 keeper-1.137
 stall-1.087
 seller-1.022
 ranger-1.009
Token both
Token ID1111
Feature activation-1.25e+00

Pos logit contributions
ug+1.232
idd+1.169
ooming+1.130
med+1.090
irled+1.088
Neg logit contributions
 stranger-0.841
 keeper-0.795
 stall-0.778
 ranger-0.726
 seller-0.724
Token cry
Token ID3960
Feature activation-1.09e+00

Pos logit contributions
ug+2.713
idd+2.610
ooming+2.491
med+2.417
ips+2.399
Neg logit contributions
 stranger-1.889
 keeper-1.782
 stall-1.729
 bench-1.674
 truth-1.625
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
ug+2.575
idd+2.515
ooming+2.374
irled+2.330
med+2.316
Neg logit contributions
 stranger-1.376
 keeper-1.273
 stall-1.224
 truth-1.171
 ranger-1.149
Token They
Token ID1119
Feature activation+5.22e-01

Pos logit contributions
ug+0.813
idd+0.768
ooming+0.750
med+0.719
irled+0.719
Neg logit contributions
 stranger-0.545
 keeper-0.518
 stall-0.509
 Heidi-0.476
 ranger-0.473
Token are
Token ID389
Feature activation-1.23e+00

Pos logit contributions
 stranger+0.790
 keeper+0.758
 stall+0.756
 Heidi+0.704
Stan+0.701
Neg logit contributions
ug-1.092
idd-1.026
ooming-1.007
zz-0.985
irled-0.979
Token not
Token ID407
Feature activation-2.28e+00

Pos logit contributions
ug+2.933
idd+2.782
ooming+2.753
zz+2.663
med+2.624
Neg logit contributions
 stranger-1.642
 keeper-1.520
 stall-1.427
 ranger-1.388
 truth-1.371
Token friends
Token ID2460
Feature activation+3.70e-01

Pos logit contributions
ug+5.850
idd+5.473
ooming+5.458
med+5.331
irled+5.256
Neg logit contributions
 stranger-2.764
 keeper-2.517
 ranger-2.285
 truth-2.282
 stall-2.233
Token anymore
Token ID7471
Feature activation-1.41e+00

Pos logit contributions
 stranger+0.595
 keeper+0.568
 stall+0.566
 Stanley+0.537
Stan+0.535
Neg logit contributions
ug-0.741
idd-0.698
ooming-0.685
ips-0.652
med-0.652
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
ug+3.792
idd+3.580
ooming+3.489
irled+3.416
zz+3.414
Neg logit contributions
 stranger-1.479
 keeper-1.396
 stall-1.298
 ranger-1.242
 truth-1.235
Token
Token ID198
Feature activation-2.60e-01

Pos logit contributions
ug+0.743
idd+0.702
ooming+0.686
irled+0.659
zz+0.658
Neg logit contributions
 stranger-0.486
 keeper-0.460
 stall-0.452
 Heidi-0.422
 ranger-0.420
Token
Token ID198
Feature activation-8.70e-02

Pos logit contributions
ug+0.613
idd+0.582
ooming+0.569
irled+0.549
zz+0.548
Neg logit contributions
 stranger-0.334
 keeper-0.314
 stall-0.308
 Heidi-0.284
 ranger-0.283
Token with
Token ID351
Feature activation-3.51e-01

Pos logit contributions
ug+0.631
idd+0.594
ooming+0.576
zz+0.553
irled+0.553
Neg logit contributions
 stranger-0.528
 keeper-0.503
 stall-0.495
 Heidi-0.464
 ranger-0.464
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
ug+0.773
idd+0.732
ooming+0.712
irled+0.686
zz+0.686
Neg logit contributions
 stranger-0.508
 keeper-0.481
 stall-0.472
 Heidi-0.439
 ranger-0.437
Token
Token ID198
Feature activation+1.82e-01

Pos logit contributions
ug+0.353
idd+0.335
ooming+0.327
irled+0.316
med+0.316
Neg logit contributions
 stranger-0.198
 keeper-0.186
 stall-0.183
 Heidi-0.168
 ranger-0.167
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 stranger+0.251
 keeper+0.235
 stall+0.232
 seller+0.216
 ranger+0.214
Neg logit contributions
ug-0.413
idd-0.392
ooming-0.380
irled-0.371
med-0.368
TokenThe
Token ID464
Feature activation-7.02e-01

Pos logit contributions
 stranger+1.865
 stall+1.792
 keeper+1.787
 camel+1.694
 seller+1.678
Neg logit contributions
ug-2.280
idd-2.183
irled-2.097
ooming-2.093
med-2.061
Token next
Token ID1306
Feature activation-2.01e+00

Pos logit contributions
ug+1.740
idd+1.646
ooming+1.608
zz+1.561
ips+1.560
Neg logit contributions
 stranger-0.874
 keeper-0.819
 stall-0.786
 ranger-0.735
 truth-0.720
Token day
Token ID1110
Feature activation+4.53e-01

Pos logit contributions
ug+4.276
idd+4.108
ooming+4.004
ips+3.898
ged+3.883
Neg logit contributions
 stranger-3.477
 keeper-3.275
 ranger-3.037
 stall-3.035
 truth-2.895
Token,
Token ID11
Feature activation+7.17e-01

Pos logit contributions
 stranger+0.773
 keeper+0.736
 stall+0.734
Stan+0.692
 ranger+0.690
Neg logit contributions
ug-0.865
idd-0.815
ooming-0.792
med-0.767
irled-0.765
Token Jane
Token ID12091
Feature activation-5.99e-01

Pos logit contributions
 stranger+1.080
 stall+1.045
 keeper+1.032
Stan+0.974
 camel+0.967
Neg logit contributions
ug-1.475
idd-1.396
ooming-1.361
irled-1.339
med-1.330
Token asked
Token ID1965
Feature activation+1.08e+00

Pos logit contributions
ug+1.372
idd+1.294
ooming+1.264
ips+1.222
irled+1.215
Neg logit contributions
 stranger-0.817
 keeper-0.770
 stall-0.755
 seller-0.696
 ranger-0.693
Token her
Token ID607
Feature activation+1.21e+00

Pos logit contributions
 stranger+1.498
 stall+1.448
 keeper+1.431
Stan+1.349
 ranger+1.337
Neg logit contributions
ug-2.405
idd-2.209
ooming-2.180
med-2.159
ged-2.155
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
ug+0.975
idd+0.928
ooming+0.901
irled+0.872
zz+0.868
Neg logit contributions
 stranger-0.619
 keeper-0.589
 stall-0.577
 ranger-0.534
 seller-0.534
Token they
Token ID484
Feature activation-1.55e+00

Pos logit contributions
ug+3.703
idd+3.482
ooming+3.372
ips+3.265
irled+3.237
Neg logit contributions
 stranger-2.773
 keeper-2.746
 stall-2.590
 ranger-2.537
 Heidi-2.467
Token both
Token ID1111
Feature activation-2.15e+00

Pos logit contributions
ug+3.071
idd+2.950
ooming+2.847
ips+2.792
irled+2.694
Neg logit contributions
 stranger-2.575
 keeper-2.477
 stall-2.385
 seller-2.310
 ranger-2.282
Token had
Token ID550
Feature activation+2.34e-01

Pos logit contributions
ug+4.371
idd+4.199
ooming+4.132
ips+4.042
irled+3.802
Neg logit contributions
 stranger-3.552
 keeper-3.442
 stall-3.239
 bench-3.170
 ranger-3.154
Token a
Token ID257
Feature activation-1.44e+00

Pos logit contributions
 stranger+0.321
 keeper+0.299
 stall+0.296
 Heidi+0.274
 seller+0.274
Neg logit contributions
ug-0.534
idd-0.508
ooming-0.492
med-0.481
irled-0.480
Token great
Token ID1049
Feature activation-2.05e+00

Pos logit contributions
ug+3.383
idd+3.121
ips+3.060
med+3.023
ooming+3.011
Neg logit contributions
 stranger-2.095
 keeper-1.926
 stall-1.911
 bench-1.766
 ranger-1.757
Token time
Token ID640
Feature activation-2.21e+00

Pos logit contributions
ug+4.354
idd+4.237
med+4.110
ips+3.971
ooming+3.925
Neg logit contributions
 stranger-3.376
 keeper-3.102
 stall-3.075
 truth-2.983
 bench-2.950
Token.
Token ID13
Feature activation-4.59e-02

Pos logit contributions
ug+5.472
idd+5.164
zz+5.155
ooming+5.085
irled+5.080
Neg logit contributions
 stranger-2.562
 keeper-2.500
 stall-2.417
 ranger-2.250
 bench-2.151
Token
Token ID220
Feature activation-4.79e-01

Pos logit contributions
ug+0.096
idd+0.090
ooming+0.088
irled+0.085
med+0.084
Neg logit contributions
 stranger-0.072
 keeper-0.068
 stall-0.067
 ranger-0.063
 seller-0.062
Token
Token ID198
Feature activation-2.10e-03

Pos logit contributions
ug+1.130
idd+1.074
ooming+1.050
irled+1.015
zz+1.014
Neg logit contributions
 stranger-0.601
 keeper-0.572
 stall-0.561
 Heidi-0.521
 ranger-0.513
Token
Token ID198
Feature activation+9.34e-01

Pos logit contributions
ug+0.005
idd+0.005
ooming+0.004
irled+0.004
med+0.004
Neg logit contributions
 stranger-0.003
 keeper-0.003
 stall-0.003
 Heidi-0.002
 seller-0.002