TOP ACTIVATIONS
MAX = 4.137

he decided to go to the river . He looked in
He went to the market , found the perfect
day he would go to the field to do his work
" Let 's go to the west !" said Jill .
special . She went to the forest and she looked for
but when she arrived at the first place , they were
So they went to the store and asked the shop
and her daughter went to the field to test the corn
, Sam would go to the market with his family .
time to sleep !" Dave sighed . He wanted
carrot . He went to the garden to find one .
One day he went to the pond to drink some water
So , he went to the garden to pick some fresh
hungry . He headed to the west and began to twist
So , she went to the gate and opened it .
field . Johnny ran to the middle and noticed a big
day , he went to the forest and spotted a beautiful
You have to go to the west and twist off a
, so she went to the kitchen . She spotted a
and her parents went to the market . Jane tried to

MIN ACTIVATIONS
MIN = -4.008

computer . He would spend hours and hours every day playing
day Mill ie would spend hours making new routes for the
be ep . They have fun and laugh a lot .
arrow . He would spend hours shooting arrows at targets in
He would spend hours and hours every day playing games and
soft . They both had fun and were happy . The
help her . They all kept spinning , but she still
ep sounds . They have fun , but they also want
her . They both had fun and created many more things
They were having fun with their paper toys .
The cats had lots of fun playing , but after a
string . She would spend hours on end weaving patterns and
. He was very good at fixing them . One day
John and Jane spent the day practicing tennis . When they
trumpet and had lots of fun . After a
together . They both had fun and Lily realized that it
shiny pieces . She worked and worked , putting each piece
. They had lots of fun that day and Lily was
my . They were having fun catching butterflies when Lily noticed
They were having fun . But then they saw

INTERVAL 2.101 - 4.137
CONTAINS 0.487%

was a big farm . The farm had many animals like
had a wonderful day at the store with his friends and
laughed too . He knew the fox was too smart to
day , they went to the store and found a soft
thanked him , and gave him a smile .

INTERVAL 0.065 - 2.101
CONTAINS 45.140%

Tom says . " We should share the tree with other
decided to share her food with him . Max was very
a great job on the quiz ! You are very smart
cry . Everything is going to be alright ." But the
gift . She knew it was going to be a fun

INTERVAL -1.971 - 0.065
CONTAINS 53.605%

I did !" Billy smiled and said , "
's courage that they immediately stopped teasing Tiger . From that
became very sad and scared . Eventually , a big bird
around fragile things and her mom trusted her even more .
. They were all so happy that the king had a

INTERVAL -4.008 - -1.971
CONTAINS 0.767%

Sarah started to spin . She spun so fast ! Around
make figures . He made all sorts of figures , from
to take good care of my engine ," said Tim my
. He made engine noises with his mouth , and he
She picked it up and started to mark on the hoop
Token he
Token ID339
Feature activation+1.26e+00

Pos logit contributions
 stranger+0.995
 keeper+0.944
 stall+0.937
Stan+0.891
 truth+0.884
Neg logit contributions
ug-1.110
idd-1.007
ooming-0.975
zz-0.957
irled-0.955
Token decided
Token ID3066
Feature activation+2.49e+00

Pos logit contributions
 stranger+2.241
 keeper+2.126
 stall+2.071
 Heidi+2.046
 truth+2.010
Neg logit contributions
ug-2.347
idd-2.173
zz-2.108
irled-2.084
ged-2.049
Token to
Token ID284
Feature activation+1.74e+00

Pos logit contributions
 stall+4.241
 stranger+4.156
 keeper+4.039
Stan+3.932
 butcher+3.905
Neg logit contributions
ug-4.572
ged-4.312
ooming-4.164
idd-4.138
zz-3.972
Token go
Token ID467
Feature activation+6.44e-01

Pos logit contributions
 stranger+2.711
 keeper+2.555
 stall+2.417
 ranger+2.369
 Heidi+2.338
Neg logit contributions
ug-3.589
idd-3.291
zz-3.239
irled-3.130
ooming-3.113
Token to
Token ID284
Feature activation+1.67e+00

Pos logit contributions
 stranger+1.006
 keeper+0.960
 stall+0.923
 Heidi+0.896
 truth+0.872
Neg logit contributions
ug-1.329
idd-1.257
ooming-1.234
zz-1.194
ips-1.187
Token the
Token ID262
Feature activation+4.14e+00

Pos logit contributions
 stranger+2.862
 keeper+2.784
 stall+2.580
 seller+2.551
 ranger+2.551
Neg logit contributions
ug-3.049
idd-2.842
ooming-2.797
zz-2.710
ips-2.686
Token river
Token ID7850
Feature activation+1.05e+00

Pos logit contributions
 Heidi+6.366
 keeper+5.903
 stranger+5.870
holder+5.749
 Sonia+5.662
Neg logit contributions
idd-8.072
 stacking-7.426
ooming-7.404
ug-7.198
bed-7.179
Token.
Token ID13
Feature activation+8.22e-01

Pos logit contributions
 stranger+1.565
 Heidi+1.414
 keeper+1.406
 stall+1.373
Stan+1.366
Neg logit contributions
ug-2.291
idd-2.129
ooming-2.070
ips-1.976
zz-1.972
Token He
Token ID679
Feature activation+4.91e-01

Pos logit contributions
 stranger+1.051
 keeper+0.961
 stall+0.921
Stan+0.886
 Heidi+0.870
Neg logit contributions
ug-1.959
idd-1.835
ooming-1.765
anged-1.738
zz-1.735
Token looked
Token ID3114
Feature activation+2.49e-01

Pos logit contributions
 stranger+0.791
 keeper+0.744
 stall+0.722
 Heidi+0.705
 truth+0.690
Neg logit contributions
ug-1.002
idd-0.945
ooming-0.902
zz-0.889
irled-0.888
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
 stranger+0.429
 keeper+0.415
 stall+0.404
 Heidi+0.384
 ranger+0.384
Neg logit contributions
ug-0.472
idd-0.440
ooming-0.430
irled-0.414
zz-0.412
Token
Token ID198
Feature activation-2.56e-01

Pos logit contributions
ug+0.068
idd+0.065
ooming+0.063
irled+0.061
med+0.061
Neg logit contributions
 stranger-0.040
 keeper-0.037
 stall-0.037
 Heidi-0.034
 seller-0.034
Token
Token ID198
Feature activation+9.14e-01

Pos logit contributions
ug+0.598
idd+0.568
ooming+0.554
irled+0.535
zz+0.535
Neg logit contributions
 stranger-0.333
 keeper-0.313
 stall-0.308
 Heidi-0.283
 ranger-0.282
TokenHe
Token ID1544
Feature activation+8.02e-01

Pos logit contributions
 stranger+1.296
 keeper+1.213
 stall+1.209
 Heidi+1.146
 camel+1.140
Neg logit contributions
ug-2.040
idd-1.909
ooming-1.882
med-1.847
irled-1.843
Token went
Token ID1816
Feature activation+1.07e+00

Pos logit contributions
 stranger+1.370
 keeper+1.273
 stall+1.258
 Heidi+1.233
 truth+1.194
Neg logit contributions
ug-1.564
idd-1.481
ooming-1.412
irled-1.400
zz-1.383
Token to
Token ID284
Feature activation+1.48e+00

Pos logit contributions
 stranger+1.584
 keeper+1.481
 truth+1.444
 stall+1.444
 Heidi+1.441
Neg logit contributions
ug-2.238
idd-2.163
ooming-2.108
ips-2.053
ged-2.021
Token the
Token ID262
Feature activation+4.13e+00

Pos logit contributions
 stranger+2.414
 keeper+2.310
 stall+2.194
 seller+2.150
Stan+2.149
Neg logit contributions
ug-2.795
idd-2.676
ooming-2.574
ips-2.483
zz-2.483
Token market
Token ID1910
Feature activation+2.01e+00

Pos logit contributions
asha+5.240
oto+5.153
isi+5.092
 Heidi+5.037
iri+4.917
Neg logit contributions
idd-9.580
 stacking-8.940
ooming-8.817
ug-8.225
ips-8.202
Token,
Token ID11
Feature activation+3.95e-01

Pos logit contributions
 stranger+3.840
Stan+3.548
 Heidi+3.539
 camel+3.512
 truth+3.484
Neg logit contributions
ug-3.407
idd-3.272
ooming-3.159
ged-3.009
ips-2.985
Token found
Token ID1043
Feature activation+2.13e+00

Pos logit contributions
 stranger+0.616
 keeper+0.581
 stall+0.568
 Heidi+0.539
 truth+0.537
Neg logit contributions
ug-0.808
idd-0.772
ooming-0.749
med-0.719
ged-0.717
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
 stranger+3.131
 keeper+2.961
 stall+2.861
Stan+2.733
 ranger+2.714
Neg logit contributions
ug-4.645
idd-4.536
med-4.343
ged-4.238
ips-4.102
Token perfect
Token ID2818
Feature activation+5.78e-01

Pos logit contributions
 stranger+2.366
 Heidi+2.286
 keeper+2.269
Stan+2.225
 Stanley+2.173
Neg logit contributions
idd-3.926
ug-3.883
ooming-3.719
ged-3.588
ips-3.551
Token day
Token ID1110
Feature activation-6.30e-01

Pos logit contributions
 stranger+1.960
 keeper+1.937
 stall+1.887
 Heidi+1.848
 seller+1.826
Neg logit contributions
ug-1.471
idd-1.304
zz-1.235
ooming-1.229
irled-1.192
Token he
Token ID339
Feature activation+2.37e-01

Pos logit contributions
ug+1.376
idd+1.316
ooming+1.281
irled+1.240
ips+1.239
Neg logit contributions
 stranger-0.930
 keeper-0.867
 stall-0.860
 Heidi-0.799
 ranger-0.793
Token would
Token ID561
Feature activation+1.59e+00

Pos logit contributions
 stranger+0.273
 keeper+0.256
 stall+0.250
 Heidi+0.233
Stan+0.225
Neg logit contributions
ug-0.588
idd-0.558
ooming-0.544
zz-0.533
irled-0.531
Token go
Token ID467
Feature activation+4.58e-01

Pos logit contributions
 stranger+2.284
 keeper+2.209
 stall+2.101
 Heidi+2.076
 ranger+2.060
Neg logit contributions
ug-3.441
idd-3.131
ooming-3.130
zz-3.097
irled-3.002
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
 stranger+0.709
 keeper+0.674
 stall+0.661
 Heidi+0.636
Stan+0.627
Neg logit contributions
ug-0.951
idd-0.895
ooming-0.881
zz-0.848
ips-0.839
Token the
Token ID262
Feature activation+4.08e+00

Pos logit contributions
 stranger+1.654
 keeper+1.616
 stall+1.526
 ranger+1.491
Stan+1.476
Neg logit contributions
ug-2.123
idd-1.971
ooming-1.955
zz-1.863
ips-1.837
Token field
Token ID2214
Feature activation+1.18e+00

Pos logit contributions
 Heidi+6.644
 Stanley+6.186
Stan+6.051
isi+6.022
 keeper+5.999
Neg logit contributions
ug-7.938
idd-7.923
ooming-7.616
 stacking-7.393
 spinning-7.248
Token to
Token ID284
Feature activation+1.67e+00

Pos logit contributions
 stranger+1.931
 keeper+1.778
 Heidi+1.737
 stall+1.718
Stan+1.714
Neg logit contributions
ug-2.406
ooming-2.199
idd-2.196
ips-2.103
zz-2.060
Token do
Token ID466
Feature activation+4.17e-02

Pos logit contributions
 stranger+2.752
 keeper+2.662
 stall+2.512
Stan+2.465
 ranger+2.454
Neg logit contributions
ug-3.225
idd-2.996
ooming-2.944
zz-2.880
irled-2.790
Token his
Token ID465
Feature activation+1.63e+00

Pos logit contributions
 stranger+0.050
 keeper+0.047
 stall+0.046
 Heidi+0.042
 seller+0.042
Neg logit contributions
ug-0.102
idd-0.096
ooming-0.095
irled-0.092
med-0.092
Token work
Token ID670
Feature activation-5.46e-01

Pos logit contributions
 stranger+2.842
 keeper+2.671
 Heidi+2.568
 stall+2.562
Stan+2.543
Neg logit contributions
ug-3.046
ooming-2.866
idd-2.802
irled-2.784
zz-2.733
Token"
Token ID1
Feature activation+8.90e-01

Pos logit contributions
 stranger+1.340
 stall+1.252
 keeper+1.212
 Heidi+1.177
 Stanley+1.166
Neg logit contributions
ug-2.045
idd-1.963
ooming-1.873
irled-1.851
med-1.847
TokenLet
Token ID5756
Feature activation+1.43e+00

Pos logit contributions
 stranger+1.412
 stall+1.361
 keeper+1.342
 Heidi+1.267
 camel+1.266
Neg logit contributions
ug-1.782
idd-1.731
ooming-1.620
irled-1.585
zz-1.585
Token's
Token ID338
Feature activation+9.28e-01

Pos logit contributions
 stranger+1.808
 stall+1.790
 keeper+1.767
 ranger+1.663
 seller+1.589
Neg logit contributions
ug-3.343
idd-3.123
ooming-3.086
zz-3.062
irled-3.018
Token go
Token ID467
Feature activation-3.24e-01

Pos logit contributions
 stranger+1.227
 keeper+1.157
 stall+1.114
 ranger+1.066
 Heidi+1.049
Neg logit contributions
ug-2.153
idd-2.006
ooming-1.967
zz-1.947
irled-1.900
Token to
Token ID284
Feature activation+7.39e-01

Pos logit contributions
ug+0.690
idd+0.651
ooming+0.633
irled+0.611
med+0.609
Neg logit contributions
 stranger-0.492
 keeper-0.467
 stall-0.461
 Heidi-0.428
 ranger-0.427
Token the
Token ID262
Feature activation+4.06e+00

Pos logit contributions
 stranger+1.133
 keeper+1.088
 stall+1.054
 seller+1.008
 ranger+0.992
Neg logit contributions
ug-1.532
idd-1.437
ooming-1.399
zz-1.358
med-1.340
Token west
Token ID7421
Feature activation+4.71e-01

Pos logit contributions
 Heidi+5.287
 keeper+5.068
Stan+4.966
holder+4.933
 ranger+4.908
Neg logit contributions
ug-8.511
idd-8.384
ooming-8.220
 spinning-8.135
 pract-8.094
Token!"
Token ID2474
Feature activation-3.71e-01

Pos logit contributions
 stranger+0.759
 keeper+0.703
 stall+0.700
 Heidi+0.679
Stan+0.671
Neg logit contributions
ug-0.952
idd-0.889
ooming-0.866
zz-0.830
ips-0.822
Token said
Token ID531
Feature activation-6.95e-01

Pos logit contributions
ug+0.838
idd+0.793
ooming+0.776
zz+0.744
ips+0.744
Neg logit contributions
 stranger-0.518
 keeper-0.491
 stall-0.479
 Heidi-0.446
 ranger-0.445
Token Jill
Token ID22772
Feature activation+4.70e-02

Pos logit contributions
ug+1.268
idd+1.186
ooming+1.161
ips+1.103
zz+1.099
Neg logit contributions
 stranger-1.271
 keeper-1.228
 stall-1.189
 Heidi-1.142
 ranger-1.141
Token.
Token ID13
Feature activation-3.61e-01

Pos logit contributions
 stranger+0.062
 keeper+0.058
 stall+0.057
 Heidi+0.053
Stan+0.053
Neg logit contributions
ug-0.109
idd-0.103
ooming-0.101
med-0.098
zz-0.097
Token special
Token ID2041
Feature activation+1.03e+00

Pos logit contributions
 stranger+2.470
 keeper+2.430
 stall+2.390
 Heidi+2.292
Stan+2.289
Neg logit contributions
ug-3.838
idd-3.581
ooming-3.477
irled-3.457
zz-3.378
Token.
Token ID13
Feature activation+3.92e-01

Pos logit contributions
 stranger+1.590
 keeper+1.511
 stall+1.469
Stan+1.442
 Heidi+1.430
Neg logit contributions
ug-2.134
idd-1.986
ooming-1.910
ged-1.883
ips-1.880
Token She
Token ID1375
Feature activation+3.82e-01

Pos logit contributions
 stranger+0.489
 keeper+0.458
 stall+0.442
 seller+0.411
 ranger+0.407
Neg logit contributions
ug-0.933
idd-0.885
ooming-0.867
zz-0.847
med-0.846
Token went
Token ID1816
Feature activation+6.33e-01

Pos logit contributions
 stranger+0.586
 keeper+0.551
 stall+0.540
 Heidi+0.518
Stan+0.506
Neg logit contributions
ug-0.807
idd-0.766
ooming-0.737
irled-0.724
zz-0.718
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
 stranger+0.996
 keeper+0.950
 stall+0.927
 Heidi+0.894
Stan+0.880
Neg logit contributions
ug-1.287
idd-1.212
ooming-1.199
zz-1.157
ips-1.149
Token the
Token ID262
Feature activation+4.02e+00

Pos logit contributions
 stranger+2.170
 keeper+2.118
 stall+2.009
 seller+1.968
 ranger+1.948
Neg logit contributions
ug-2.714
idd-2.524
ooming-2.507
zz-2.476
irled-2.405
Token forest
Token ID8222
Feature activation+1.21e+00

Pos logit contributions
 Heidi+6.509
 keeper+5.945
 Stanley+5.924
 Clover+5.869
Stan+5.802
Neg logit contributions
idd-8.273
 mar-7.470
 stacking-7.436
ooming-7.400
 pol-7.387
Token and
Token ID290
Feature activation+1.29e+00

Pos logit contributions
 stranger+2.065
 keeper+1.884
 Heidi+1.872
Stan+1.859
 stall+1.848
Neg logit contributions
ug-2.336
idd-2.178
ooming-2.154
zz-2.033
ged-2.028
Token she
Token ID673
Feature activation+7.90e-01

Pos logit contributions
 stranger+2.129
 keeper+2.008
 stall+1.976
 truth+1.944
Stan+1.895
Neg logit contributions
ug-2.498
idd-2.437
ged-2.357
zz-2.324
ooming-2.315
Token looked
Token ID3114
Feature activation+8.91e-01

Pos logit contributions
 stranger+1.242
 keeper+1.147
 stall+1.127
 Heidi+1.099
 truth+1.071
Neg logit contributions
ug-1.647
idd-1.592
ooming-1.517
irled-1.499
zz-1.482
Token for
Token ID329
Feature activation+1.86e+00

Pos logit contributions
 stranger+1.633
 keeper+1.608
 stall+1.546
Stan+1.488
 Heidi+1.487
Neg logit contributions
ug-1.588
idd-1.475
ooming-1.448
ged-1.410
irled-1.402
Token but
Token ID475
Feature activation+1.42e+00

Pos logit contributions
 stranger+1.121
 keeper+1.051
 stall+1.035
 Heidi+0.996
 seller+0.992
Neg logit contributions
ug-1.375
idd-1.270
ooming-1.256
zz-1.210
med-1.200
Token when
Token ID618
Feature activation+4.36e-01

Pos logit contributions
 stranger+2.226
 stall+2.091
 keeper+2.074
 butcher+2.016
Stan+2.013
Neg logit contributions
ug-2.905
idd-2.745
ged-2.686
ooming-2.615
irled-2.613
Token she
Token ID673
Feature activation+1.37e-01

Pos logit contributions
 stranger+0.595
 keeper+0.567
 stall+0.563
 seller+0.521
Stan+0.517
Neg logit contributions
ug-0.972
idd-0.923
ooming-0.901
med-0.874
irled-0.872
Token arrived
Token ID5284
Feature activation+1.42e+00

Pos logit contributions
 stranger+0.215
 keeper+0.203
 stall+0.200
 Heidi+0.190
 ranger+0.187
Neg logit contributions
ug-0.285
idd-0.270
ooming-0.263
zz-0.253
irled-0.253
Token at
Token ID379
Feature activation+7.03e-01

Pos logit contributions
 stranger+2.639
 keeper+2.559
 stall+2.531
Stan+2.475
 truth+2.463
Neg logit contributions
ug-2.496
ooming-2.422
idd-2.322
ged-2.258
ips-2.208
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 stranger+0.991
 keeper+0.931
 stall+0.915
 seller+0.867
Stan+0.866
Neg logit contributions
ug-1.545
idd-1.462
ooming-1.414
ips-1.379
zz-1.376
Token first
Token ID717
Feature activation+1.52e+00

Pos logit contributions
 Heidi+7.183
 stranger+6.225
 Stanley+6.199
Stan+6.195
 Sonia+6.141
Neg logit contributions
ug-7.799
idd-7.271
 spinning-6.977
 pract-6.932
ooming-6.895
Token place
Token ID1295
Feature activation+9.56e-01

Pos logit contributions
 stranger+2.023
 keeper+1.920
 butcher+1.855
 Heidi+1.829
holder+1.821
Neg logit contributions
ug-3.520
idd-3.132
ooming-3.106
zz-3.068
ged-2.963
Token,
Token ID11
Feature activation+8.13e-01

Pos logit contributions
 stranger+1.617
 stall+1.513
Stan+1.497
 keeper+1.480
 butcher+1.455
Neg logit contributions
ug-1.864
idd-1.741
ooming-1.698
ips-1.629
irled-1.623
Token they
Token ID484
Feature activation+1.24e+00

Pos logit contributions
 stranger+1.091
 keeper+1.054
 stall+1.047
 seller+0.965
Stan+0.964
Neg logit contributions
ug-1.817
idd-1.715
ooming-1.663
med-1.635
ged-1.628
Token were
Token ID547
Feature activation+2.59e-01

Pos logit contributions
 stranger+2.052
 keeper+1.980
 stall+1.974
 Heidi+1.908
Stan+1.856
Neg logit contributions
ug-2.462
idd-2.284
ged-2.221
ooming-2.203
zz-2.146
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
ug+0.736
idd+0.700
ooming+0.683
irled+0.660
zz+0.659
Neg logit contributions
 stranger-0.404
 keeper-0.380
 stall-0.372
 Heidi-0.343
 ranger-0.342
TokenSo
Token ID2396
Feature activation-1.01e+00

Pos logit contributions
 stranger+1.199
 stall+1.158
 keeper+1.144
 Heidi+1.100
 seller+1.085
Neg logit contributions
ug-1.147
idd-1.070
ooming-1.031
med-1.007
irled-0.987
Token they
Token ID484
Feature activation+8.58e-01

Pos logit contributions
ug+2.069
idd+2.017
ooming+1.928
ips+1.874
zz+1.836
Neg logit contributions
 stranger-1.608
 keeper-1.535
 stall-1.481
 Stanley-1.429
 Heidi-1.419
Token went
Token ID1816
Feature activation+7.08e-01

Pos logit contributions
 stranger+1.500
 keeper+1.442
 stall+1.403
 Heidi+1.347
 truth+1.337
Neg logit contributions
ug-1.641
idd-1.525
zz-1.470
ooming-1.450
med-1.447
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
 stranger+0.984
 keeper+0.933
 stall+0.902
 truth+0.865
 Heidi+0.863
Neg logit contributions
ug-1.564
idd-1.500
ooming-1.467
ips-1.425
zz-1.414
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
 stranger+1.824
 keeper+1.731
 stall+1.646
 seller+1.590
 ranger+1.579
Neg logit contributions
ug-2.240
idd-2.150
ooming-2.103
zz-2.020
ged-1.994
Token store
Token ID3650
Feature activation+1.35e+00

Pos logit contributions
 Heidi+6.826
 stranger+6.470
 Sonia+6.468
holder+6.394
Stan+6.394
Neg logit contributions
idd-6.961
 stacking-6.572
ooming-6.399
bed-6.194
 shaping-6.054
Token and
Token ID290
Feature activation+8.64e-01

Pos logit contributions
 stranger+2.372
Stan+2.130
 camel+2.129
 Heidi+2.127
 keeper+2.107
Neg logit contributions
ug-2.539
idd-2.464
ooming-2.372
ged-2.258
ips-2.229
Token asked
Token ID1965
Feature activation+6.83e-01

Pos logit contributions
 stranger+1.439
 keeper+1.351
 stall+1.321
 truth+1.295
Stan+1.259
Neg logit contributions
ug-1.661
idd-1.619
ooming-1.531
med-1.520
ged-1.516
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
 stranger+1.197
 keeper+1.151
 stall+1.138
 ranger+1.070
 bench+1.069
Neg logit contributions
ug-1.318
idd-1.186
ooming-1.163
med-1.134
ged-1.130
Token shop
Token ID6128
Feature activation+3.03e+00

Pos logit contributions
 Heidi+3.727
 Stanley+3.625
 stranger+3.475
Stan+3.462
 Sonia+3.381
Neg logit contributions
idd-6.650
ug-6.229
ooming-6.055
ips-6.050
 stacking-5.693
Token and
Token ID290
Feature activation+6.68e-01

Pos logit contributions
ug+1.408
idd+1.322
ooming+1.294
irled+1.245
ips+1.234
Neg logit contributions
 stranger-1.074
 keeper-1.016
 stall-1.004
 seller-0.948
 ranger-0.932
Token her
Token ID607
Feature activation+3.63e-01

Pos logit contributions
 stranger+0.888
 keeper+0.863
 stall+0.841
Stan+0.795
 ranger+0.780
Neg logit contributions
ug-1.516
idd-1.431
ooming-1.367
zz-1.352
ged-1.346
Token daughter
Token ID4957
Feature activation+3.07e-01

Pos logit contributions
 stranger+0.530
 keeper+0.505
 stall+0.493
 Heidi+0.468
Stan+0.467
Neg logit contributions
ug-0.775
idd-0.736
ooming-0.715
zz-0.692
irled-0.686
Token went
Token ID1816
Feature activation+9.34e-02

Pos logit contributions
 stranger+0.387
 keeper+0.366
 stall+0.356
 Heidi+0.330
 ranger+0.330
Neg logit contributions
ug-0.723
idd-0.686
ooming-0.669
zz-0.651
med-0.646
Token to
Token ID284
Feature activation+4.10e-01

Pos logit contributions
 stranger+0.122
 keeper+0.115
 stall+0.112
 Heidi+0.105
Stan+0.104
Neg logit contributions
ug-0.217
idd-0.206
ooming-0.202
ips-0.195
zz-0.194
Token the
Token ID262
Feature activation+3.99e+00

Pos logit contributions
 stranger+0.608
 keeper+0.583
 stall+0.563
 ranger+0.534
 seller+0.531
Neg logit contributions
ug-0.872
idd-0.826
ooming-0.806
zz-0.781
irled-0.775
Token field
Token ID2214
Feature activation+1.36e+00

Pos logit contributions
 keeper+5.740
 ranger+5.727
 stranger+5.178
 sir+5.065
 Heidi+5.035
Neg logit contributions
idd-7.875
ooming-7.741
 pract-7.732
 sudden-7.711
 stacking-7.699
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
 stranger+2.063
 keeper+1.859
Stan+1.839
 Heidi+1.837
 butcher+1.812
Neg logit contributions
ug-2.931
idd-2.735
ooming-2.701
ips-2.616
irled-2.561
Token test
Token ID1332
Feature activation-3.36e-01

Pos logit contributions
 stranger+2.454
 keeper+2.383
 stall+2.208
 ranger+2.189
Stan+2.133
Neg logit contributions
ug-2.982
idd-2.841
ooming-2.772
zz-2.757
irled-2.662
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
ug+0.769
idd+0.730
ooming+0.712
irled+0.686
zz+0.686
Neg logit contributions
 stranger-0.454
 keeper-0.427
 stall-0.420
 Heidi-0.389
 ranger-0.385
Token corn
Token ID11676
Feature activation+8.49e-01

Pos logit contributions
 ranger+2.230
 keeper+2.204
 stranger+2.146
 Heidi+1.829
Stan+1.824
Neg logit contributions
ug-5.359
zz-5.055
idd-5.011
ooming-4.986
irled-4.942
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
ug+0.822
idd+0.781
ooming+0.757
irled+0.727
med+0.725
Neg logit contributions
 stranger-0.636
 keeper-0.604
 stall-0.594
 ranger-0.557
 Heidi-0.557
Token Sam
Token ID3409
Feature activation+1.41e+00

Pos logit contributions
ug+0.862
idd+0.822
ooming+0.799
irled+0.772
zz+0.770
Neg logit contributions
 stranger-0.496
 keeper-0.466
 stall-0.457
 Heidi-0.423
 ranger-0.422
Token would
Token ID561
Feature activation+1.58e+00

Pos logit contributions
 stranger+2.007
 keeper+1.916
 stall+1.824
 Heidi+1.793
Stan+1.759
Neg logit contributions
ug-3.051
idd-2.890
ooming-2.809
zz-2.765
irled-2.741
Token go
Token ID467
Feature activation+6.27e-01

Pos logit contributions
 stranger+2.228
 keeper+2.154
Stan+2.038
 Heidi+2.038
 stall+2.014
Neg logit contributions
ug-3.362
idd-3.138
ooming-3.083
zz-2.998
irled-2.981
Token to
Token ID284
Feature activation+1.99e+00

Pos logit contributions
 stranger+0.939
 keeper+0.886
 stall+0.859
 Heidi+0.841
Stan+0.829
Neg logit contributions
ug-1.316
idd-1.252
ooming-1.228
ips-1.180
zz-1.176
Token the
Token ID262
Feature activation+3.99e+00

Pos logit contributions
 stranger+3.437
 keeper+3.377
Stan+3.179
 ranger+3.151
 camel+3.108
Neg logit contributions
ug-3.427
idd-3.288
ooming-3.206
ips-3.081
 pract-3.028
Token market
Token ID1910
Feature activation+2.06e+00

Pos logit contributions
 Heidi+4.380
 Stanley+4.014
 Clover+3.938
asha+3.909
oto+3.851
Neg logit contributions
idd-9.982
 stacking-9.180
ooming-9.144
ips-8.862
ug-8.786
Token with
Token ID351
Feature activation+1.37e+00

Pos logit contributions
 stranger+4.300
 Heidi+4.008
 camel+3.964
Stan+3.936
 Stanley+3.908
Neg logit contributions
ug-3.205
idd-2.941
ooming-2.897
ips-2.770
bed-2.694
Token his
Token ID465
Feature activation+1.33e+00

Pos logit contributions
 stranger+2.100
 keeper+2.009
 stall+1.920
Stan+1.865
 ranger+1.827
Neg logit contributions
ug-2.882
idd-2.721
med-2.610
ips-2.599
ooming-2.594
Token family
Token ID1641
Feature activation+1.17e+00

Pos logit contributions
 stranger+2.103
 keeper+1.970
 Heidi+1.947
Stan+1.946
 Clover+1.881
Neg logit contributions
ug-2.544
idd-2.497
ooming-2.362
ips-2.247
irled-2.239
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 stranger+1.776
 keeper+1.679
 Heidi+1.641
Stan+1.611
 stall+1.594
Neg logit contributions
ug-2.445
idd-2.283
ooming-2.232
med-2.138
ips-2.136
Token time
Token ID640
Feature activation-1.08e-01

Pos logit contributions
 stranger+2.266
 keeper+2.203
 stall+2.181
 seller+2.095
 ranger+2.069
Neg logit contributions
ug-1.720
idd-1.532
zz-1.457
ooming-1.448
irled-1.435
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
ug+0.247
idd+0.234
ooming+0.228
irled+0.220
zz+0.220
Neg logit contributions
 stranger-0.144
 keeper-0.136
 stall-0.134
 Heidi-0.124
 ranger-0.123
Token sleep
Token ID3993
Feature activation+2.96e-01

Pos logit contributions
 stranger+2.117
 keeper+2.008
 stall+2.000
 ranger+1.845
 Heidi+1.840
Neg logit contributions
ug-3.153
zz-2.878
ooming-2.871
idd-2.821
irled-2.729
Token!"
Token ID2474
Feature activation+2.36e+00

Pos logit contributions
 stranger+0.455
 keeper+0.434
 stall+0.426
 Heidi+0.400
 seller+0.399
Neg logit contributions
ug-0.621
idd-0.580
ooming-0.568
irled-0.546
zz-0.543
Token
Token ID198
Feature activation+2.76e+00

Pos logit contributions
 keeper+3.889
 stranger+3.874
 butcher+3.771
 stall+3.726
Stan+3.705
Neg logit contributions
ug-4.300
ged-3.983
irled-3.982
idd-3.956
bed-3.792
Token
Token ID198
Feature activation+3.99e+00

Pos logit contributions
 stranger+4.822
 keeper+4.762
 butcher+4.504
 ranc+4.503
 camel+4.486
Neg logit contributions
ug-5.007
idd-4.749
ooming-4.534
irled-4.458
ged-4.374
TokenDave
Token ID27984
Feature activation+4.02e-01

Pos logit contributions
 keeper+6.388
 truth+6.352
 address+6.254
 chest+6.217
 camel+6.165
Neg logit contributions
idd-7.679
ug-7.661
ooming-7.422
irled-7.302
ged-7.276
Token sighed
Token ID21893
Feature activation+7.41e-01

Pos logit contributions
 stranger+0.607
 stall+0.564
 keeper+0.562
 Heidi+0.535
Stan+0.521
Neg logit contributions
ug-0.867
idd-0.813
ooming-0.794
zz-0.777
irled-0.773
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 stranger+1.159
 keeper+1.097
 stall+1.091
Stan+1.029
 Heidi+1.029
Neg logit contributions
ug-1.524
ooming-1.406
idd-1.398
ged-1.354
irled-1.347
Token He
Token ID679
Feature activation+1.51e+00

Pos logit contributions
 stranger+1.770
 keeper+1.603
 stall+1.582
 camel+1.547
Stan+1.541
Neg logit contributions
ug-2.973
idd-2.865
ooming-2.714
ips-2.654
irled-2.648
Token wanted
Token ID2227
Feature activation+1.40e+00

Pos logit contributions
 stranger+2.598
 stall+2.509
 keeper+2.485
 Heidi+2.345
 seller+2.331
Neg logit contributions
ug-2.859
idd-2.674
irled-2.671
zz-2.646
ooming-2.581
Token carrot
Token ID40562
Feature activation+1.47e+00

Pos logit contributions
 keeper+2.050
 stranger+2.020
 Heidi+1.990
 stall+1.903
Stan+1.892
Neg logit contributions
ug-3.379
idd-3.309
ooming-3.235
zz-3.111
apy-3.051
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
 stranger+2.325
 keeper+2.153
 Heidi+2.106
 stall+2.089
Stan+2.074
Neg logit contributions
ug-3.042
idd-2.828
ooming-2.764
bed-2.683
zz-2.673
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
 stranger+0.921
 keeper+0.852
 stall+0.838
Stan+0.774
 seller+0.768
Neg logit contributions
ug-1.628
idd-1.512
ooming-1.501
zz-1.454
irled-1.451
Token went
Token ID1816
Feature activation+8.33e-01

Pos logit contributions
 stranger+3.223
 keeper+3.080
 stall+3.028
 truth+2.934
 Heidi+2.927
Neg logit contributions
ug-3.155
idd-2.950
irled-2.915
zz-2.875
ged-2.846
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 stranger+1.158
 keeper+1.103
 stall+1.073
 Heidi+1.030
 truth+1.014
Neg logit contributions
ug-1.814
idd-1.732
ooming-1.707
zz-1.666
ips-1.656
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 stranger+2.032
 keeper+1.961
 stall+1.872
 seller+1.789
 ranger+1.769
Neg logit contributions
ug-2.543
idd-2.390
ooming-2.336
zz-2.305
irled-2.247
Token garden
Token ID11376
Feature activation+1.02e+00

Pos logit contributions
 Heidi+6.668
 keeper+6.616
 stranger+6.486
 Sonia+6.308
 Clover+6.199
Neg logit contributions
idd-6.939
bed-6.575
 stacking-6.571
ooming-6.484
ug-6.345
Token to
Token ID284
Feature activation+2.18e+00

Pos logit contributions
 stranger+1.597
 keeper+1.467
 stall+1.403
 Heidi+1.401
 truth+1.381
Neg logit contributions
ug-2.110
idd-2.007
ooming-1.966
zz-1.872
ips-1.871
Token find
Token ID1064
Feature activation+2.27e+00

Pos logit contributions
 stranger+3.656
 keeper+3.521
 stall+3.334
Stan+3.254
holder+3.170
Neg logit contributions
ug-3.891
idd-3.839
zz-3.715
ooming-3.695
irled-3.625
Token one
Token ID530
Feature activation+1.94e+00

Pos logit contributions
 keeper+3.665
 stranger+3.590
 stall+3.409
 ranger+3.336
 bench+3.139
Neg logit contributions
ug-4.675
idd-4.346
med-4.227
ged-4.142
zz-4.104
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
 stranger+3.331
 keeper+3.081
Stan+3.028
 Heidi+2.994
 stall+2.962
Neg logit contributions
ug-3.621
idd-3.359
zz-3.228
ooming-3.174
ig-3.143
Token One
Token ID1881
Feature activation+1.59e+00

Pos logit contributions
 stranger+0.249
 keeper+0.232
 stall+0.228
 seller+0.211
 Heidi+0.210
Neg logit contributions
ug-0.415
idd-0.394
ooming-0.382
med-0.369
irled-0.369
Token day
Token ID1110
Feature activation-3.40e-01

Pos logit contributions
 stranger+3.527
 stall+3.502
 keeper+3.487
Stan+3.382
 seller+3.379
Neg logit contributions
ug-2.179
zz-1.848
idd-1.826
ooming-1.723
anged-1.655
Token he
Token ID339
Feature activation+8.61e-01

Pos logit contributions
ug+0.704
idd+0.665
ooming+0.645
irled+0.622
ips+0.620
Neg logit contributions
 stranger-0.539
 keeper-0.510
 stall-0.504
 Heidi-0.473
 ranger-0.469
Token went
Token ID1816
Feature activation+1.44e+00

Pos logit contributions
 stranger+1.301
 stall+1.251
 keeper+1.243
 Heidi+1.181
Stan+1.144
Neg logit contributions
ug-1.817
idd-1.727
ooming-1.666
irled-1.646
zz-1.642
Token to
Token ID284
Feature activation+8.97e-01

Pos logit contributions
 stranger+2.265
 keeper+2.220
 stall+2.187
Stan+2.092
 Heidi+2.085
Neg logit contributions
ug-2.842
ooming-2.693
idd-2.641
ips-2.572
med-2.556
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 stranger+1.368
 keeper+1.316
 stall+1.276
 seller+1.221
 ranger+1.202
Neg logit contributions
ug-1.838
idd-1.723
ooming-1.699
zz-1.641
med-1.614
Token pond
Token ID16723
Feature activation+1.52e+00

Pos logit contributions
 Heidi+5.723
 keeper+5.471
 Sonia+5.394
 Clover+5.393
 stranger+5.228
Neg logit contributions
idd-7.945
ug-7.673
ooming-7.651
 stacking-7.579
 drummer-7.486
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 stranger+2.664
 keeper+2.471
 Heidi+2.451
 stall+2.398
Stan+2.397
Neg logit contributions
ug-2.905
idd-2.681
ooming-2.675
ips-2.494
bed-2.491
Token drink
Token ID4144
Feature activation+1.42e+00

Pos logit contributions
 stranger+2.263
 keeper+2.160
 stall+2.137
Stan+2.032
 Heidi+2.016
Neg logit contributions
ug-2.371
idd-2.239
ooming-2.197
zz-2.124
irled-2.088
Token some
Token ID617
Feature activation+2.72e-01

Pos logit contributions
 stranger+2.176
 keeper+2.130
 stall+2.094
 Heidi+1.985
Stan+1.951
Neg logit contributions
ug-2.970
idd-2.736
med-2.679
ooming-2.659
ged-2.623
Token water
Token ID1660
Feature activation+8.79e-01

Pos logit contributions
 stranger+0.375
 keeper+0.361
 stall+0.352
 Heidi+0.330
 ranger+0.327
Neg logit contributions
ug-0.609
idd-0.575
ooming-0.560
irled-0.545
med-0.541
Token So
Token ID1406
Feature activation-8.82e-01

Pos logit contributions
 stranger+0.523
 keeper+0.485
 stall+0.478
 Heidi+0.443
Stan+0.440
Neg logit contributions
ug-0.845
idd-0.799
ooming-0.777
irled-0.752
zz-0.749
Token,
Token ID11
Feature activation-7.48e-01

Pos logit contributions
ug+1.918
idd+1.865
ooming+1.781
ips+1.745
med+1.709
Neg logit contributions
 stranger-1.310
 keeper-1.239
 stall-1.211
 Stanley-1.158
 Heidi-1.152
Token he
Token ID339
Feature activation+8.11e-01

Pos logit contributions
ug+1.671
idd+1.601
ooming+1.543
ips+1.500
zz+1.491
Neg logit contributions
 stranger-1.081
 keeper-1.004
 stall-0.986
 Stanley-0.934
 Heidi-0.931
Token went
Token ID1816
Feature activation+8.34e-01

Pos logit contributions
 stranger+1.336
 keeper+1.273
 stall+1.260
 Heidi+1.214
 truth+1.185
Neg logit contributions
ug-1.598
idd-1.509
zz-1.430
ooming-1.429
irled-1.425
Token to
Token ID284
Feature activation+5.70e-01

Pos logit contributions
 stranger+1.312
 keeper+1.256
 stall+1.232
 Heidi+1.182
 truth+1.165
Neg logit contributions
ug-1.675
idd-1.601
ooming-1.564
zz-1.521
ips-1.509
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+0.843
 keeper+0.800
 stall+0.781
Stan+0.730
 Heidi+0.728
Neg logit contributions
ug-1.215
idd-1.153
ooming-1.114
zz-1.082
irled-1.071
Token garden
Token ID11376
Feature activation+9.74e-01

Pos logit contributions
 Heidi+6.147
 keeper+5.910
 stranger+5.899
 Sonia+5.779
holder+5.731
Neg logit contributions
idd-7.393
bed-6.960
ooming-6.939
 stacking-6.818
ug-6.723
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 stranger+1.571
 keeper+1.445
 Heidi+1.402
 stall+1.401
Stan+1.377
Neg logit contributions
ug-1.969
idd-1.873
ooming-1.830
med-1.730
ips-1.729
Token pick
Token ID2298
Feature activation+1.18e+00

Pos logit contributions
 stranger+2.023
 keeper+1.927
 stall+1.864
Stan+1.801
 Heidi+1.785
Neg logit contributions
ug-2.480
idd-2.404
ooming-2.282
zz-2.264
irled-2.230
Token some
Token ID617
Feature activation+9.63e-01

Pos logit contributions
 stranger+1.739
 keeper+1.717
 stall+1.609
 ranger+1.568
 bench+1.483
Neg logit contributions
ug-2.530
idd-2.407
med-2.347
ooming-2.309
ged-2.287
Token fresh
Token ID4713
Feature activation-2.02e-01

Pos logit contributions
 stranger+1.137
 keeper+1.118
 stall+1.027
 ranger+1.010
 Heidi+0.992
Neg logit contributions
ug-2.319
idd-2.178
ooming-2.137
med-2.117
irled-2.087
Token hungry
Token ID14720
Feature activation+1.60e+00

Pos logit contributions
 stranger+2.462
 stall+2.456
 keeper+2.344
 bench+2.259
 Heidi+2.176
Neg logit contributions
ug-4.073
idd-3.901
irled-3.675
ooming-3.656
ged-3.585
Token.
Token ID13
Feature activation+7.89e-01

Pos logit contributions
 stranger+2.534
 stall+2.396
 keeper+2.385
Stan+2.359
 Heidi+2.337
Neg logit contributions
ug-3.240
idd-2.950
ooming-2.882
ips-2.842
med-2.837
Token He
Token ID679
Feature activation+1.10e+00

Pos logit contributions
 stranger+1.042
 keeper+0.943
 stall+0.936
 camel+0.866
 bench+0.857
Neg logit contributions
ug-1.847
idd-1.752
ooming-1.674
med-1.640
ips-1.632
Token headed
Token ID9153
Feature activation+9.91e-01

Pos logit contributions
 stranger+1.793
 stall+1.689
 keeper+1.671
 Heidi+1.621
 truth+1.595
Neg logit contributions
ug-2.158
idd-2.050
irled-1.948
zz-1.928
ooming-1.925
Token to
Token ID284
Feature activation+1.69e+00

Pos logit contributions
 stranger+1.507
 keeper+1.426
 stall+1.382
 truth+1.359
Stan+1.334
Neg logit contributions
ug-2.047
idd-1.937
ooming-1.907
zz-1.854
ged-1.846
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+2.790
 keeper+2.681
 stall+2.543
 seller+2.474
 ranger+2.464
Neg logit contributions
ug-3.183
idd-3.014
ooming-2.929
ips-2.828
zz-2.820
Token west
Token ID7421
Feature activation+1.59e+00

Pos logit contributions
 Heidi+5.007
 keeper+4.981
oto+4.915
Stan+4.820
 Stanley+4.709
Neg logit contributions
ooming-7.919
ug-7.852
idd-7.852
 pract-7.624
 practicing-7.502
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 stranger+2.845
 Heidi+2.621
Stan+2.614
 butcher+2.560
 truth+2.559
Neg logit contributions
ug-2.898
idd-2.645
ooming-2.611
ips-2.461
zz-2.423
Token began
Token ID2540
Feature activation+3.07e-01

Pos logit contributions
 stranger+1.853
 stall+1.736
 keeper+1.730
 truth+1.708
 Heidi+1.635
Neg logit contributions
ug-2.051
idd-1.980
ged-1.869
ooming-1.862
med-1.861
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 stranger+0.419
 keeper+0.392
 stall+0.382
 Heidi+0.357
 seller+0.355
Neg logit contributions
ug-0.697
idd-0.664
ooming-0.650
med-0.628
irled-0.626
Token twist
Token ID14528
Feature activation-1.58e-01

Pos logit contributions
 stranger+2.216
 keeper+2.095
 stall+2.045
 ranger+1.929
 butcher+1.899
Neg logit contributions
ug-2.818
ooming-2.701
idd-2.689
zz-2.649
irled-2.554
Token So
Token ID1406
Feature activation+5.05e-01

Pos logit contributions
 stranger+1.572
 keeper+1.456
 stall+1.442
 camel+1.356
 seller+1.354
Neg logit contributions
ug-2.052
idd-1.970
ooming-1.870
irled-1.846
ips-1.828
Token,
Token ID11
Feature activation+6.32e-01

Pos logit contributions
 stranger+0.893
 keeper+0.850
 stall+0.846
 seller+0.791
 ranger+0.791
Neg logit contributions
ug-0.950
idd-0.872
ooming-0.858
irled-0.833
ged-0.831
Token she
Token ID673
Feature activation+1.85e-01

Pos logit contributions
 stranger+0.842
 keeper+0.806
 stall+0.800
 seller+0.727
 ranger+0.727
Neg logit contributions
ug-1.435
idd-1.354
ooming-1.325
ged-1.302
irled-1.297
Token went
Token ID1816
Feature activation+6.35e-01

Pos logit contributions
 stranger+0.305
 keeper+0.290
 stall+0.287
 Heidi+0.272
 ranger+0.267
Neg logit contributions
ug-0.367
idd-0.346
ooming-0.333
irled-0.324
zz-0.324
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
 stranger+0.897
 keeper+0.845
 stall+0.824
 Heidi+0.795
 truth+0.790
Neg logit contributions
ug-1.394
idd-1.331
ooming-1.304
ips-1.264
zz-1.253
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 stranger+1.742
 keeper+1.674
 stall+1.598
 seller+1.536
 ranger+1.520
Neg logit contributions
ug-2.285
idd-2.167
ooming-2.114
zz-2.070
ips-2.024
Token gate
Token ID8946
Feature activation+1.30e+00

Pos logit contributions
 Clover+5.556
 Heidi+5.452
 Stanley+5.381
 Sonia+5.247
 keeper+5.084
Neg logit contributions
idd-8.202
ooming-7.833
 sudden-7.625
ug-7.612
 stacking-7.560
Token and
Token ID290
Feature activation+7.41e-01

Pos logit contributions
 stranger+2.168
Stan+1.991
 Stanley+1.963
 stall+1.957
 Heidi+1.945
Neg logit contributions
ug-2.551
idd-2.367
ooming-2.367
ips-2.257
med-2.234
Token opened
Token ID4721
Feature activation+1.32e+00

Pos logit contributions
 stranger+1.261
 keeper+1.190
 stall+1.177
 truth+1.139
Stan+1.124
Neg logit contributions
ug-1.396
idd-1.343
ooming-1.263
zz-1.252
med-1.243
Token it
Token ID340
Feature activation+2.24e+00

Pos logit contributions
 stranger+2.317
 keeper+2.201
 stall+2.096
 ranger+2.073
 butcher+2.063
Neg logit contributions
ug-2.474
idd-2.342
med-2.251
ooming-2.246
irled-2.220
Token.
Token ID13
Feature activation+7.15e-01

Pos logit contributions
 stranger+4.302
Stan+3.854
 camel+3.832
 butcher+3.740
 Heidi+3.707
Neg logit contributions
ug-4.072
bed-3.851
med-3.847
ooming-3.812
idd-3.764
Token field
Token ID2214
Feature activation+1.81e+00

Pos logit contributions
 stranger+1.694
 Heidi+1.618
 keeper+1.616
Stan+1.600
 Stanley+1.527
Neg logit contributions
ug-3.285
idd-3.168
ooming-3.012
ips-2.896
zz-2.892
Token.
Token ID13
Feature activation-4.36e-02

Pos logit contributions
 stranger+2.957
 Heidi+2.781
Stan+2.716
 Stanley+2.663
 butcher+2.657
Neg logit contributions
ug-3.731
idd-3.523
ooming-3.317
med-3.218
ips-3.201
Token Johnny
Token ID15470
Feature activation+7.74e-01

Pos logit contributions
ug+0.085
idd+0.079
ooming+0.077
irled+0.074
med+0.074
Neg logit contributions
 stranger-0.075
 keeper-0.071
 stall-0.070
 Heidi-0.066
 seller-0.066
Token ran
Token ID4966
Feature activation+5.94e-01

Pos logit contributions
 stranger+1.194
 keeper+1.114
 stall+1.103
 Heidi+1.056
 truth+1.036
Neg logit contributions
ug-1.607
idd-1.543
ooming-1.491
med-1.452
irled-1.448
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
 stranger+0.906
 keeper+0.860
 stall+0.858
 truth+0.811
Stan+0.805
Neg logit contributions
ug-1.224
idd-1.159
ooming-1.123
med-1.096
ged-1.095
Token the
Token ID262
Feature activation+3.93e+00

Pos logit contributions
 stranger+1.710
 keeper+1.650
 stall+1.593
 ranger+1.529
 seller+1.525
Neg logit contributions
ug-2.240
idd-2.100
ooming-2.048
zz-1.989
ged-1.973
Token middle
Token ID3504
Feature activation+1.67e-01

Pos logit contributions
 Heidi+5.706
Stan+5.656
 keeper+5.435
isi+5.332
 stranger+5.286
Neg logit contributions
idd-7.914
ug-7.717
ooming-7.539
 spinning-7.504
bed-7.408
Token and
Token ID290
Feature activation+8.77e-01

Pos logit contributions
 stranger+0.327
 keeper+0.311
 stall+0.307
 Heidi+0.296
Stan+0.294
Neg logit contributions
ug-0.281
idd-0.259
ooming-0.251
med-0.240
zz-0.238
Token noticed
Token ID6810
Feature activation+8.46e-01

Pos logit contributions
 stranger+1.426
 keeper+1.357
 stall+1.352
 truth+1.316
Stan+1.275
Neg logit contributions
ug-1.710
idd-1.660
ooming-1.574
ged-1.563
med-1.553
Token a
Token ID257
Feature activation+9.41e-01

Pos logit contributions
 stranger+1.004
 stall+0.953
 keeper+0.948
 Heidi+0.871
Stan+0.862
Neg logit contributions
ug-2.074
idd-1.965
ooming-1.900
med-1.872
ged-1.851
Token big
Token ID1263
Feature activation+4.51e-01

Pos logit contributions
 stranger+1.176
 keeper+1.139
 Heidi+1.110
 stall+1.105
 Stanley+1.065
Neg logit contributions
ug-2.107
idd-2.052
ooming-1.970
zz-1.896
irled-1.864
Token day
Token ID1110
Feature activation+4.43e-01

Pos logit contributions
 stranger+3.022
 keeper+2.989
 stall+2.959
 seller+2.886
 Heidi+2.878
Neg logit contributions
ug-1.980
zz-1.694
idd-1.671
ooming-1.580
irled-1.526
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
 stranger+0.779
 keeper+0.747
 stall+0.733
 seller+0.691
 ranger+0.690
Neg logit contributions
ug-0.841
idd-0.778
ooming-0.761
ged-0.735
zz-0.730
Token he
Token ID339
Feature activation+6.68e-01

Pos logit contributions
 stranger+1.910
 keeper+1.807
 stall+1.792
 truth+1.713
 Clover+1.698
Neg logit contributions
ug-3.028
idd-2.777
ged-2.753
ooming-2.731
zz-2.658
Token went
Token ID1816
Feature activation+9.52e-01

Pos logit contributions
 stranger+1.005
 stall+0.950
 keeper+0.950
 Heidi+0.902
 Stanley+0.869
Neg logit contributions
ug-1.425
idd-1.363
ooming-1.306
irled-1.289
zz-1.278
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
 stranger+1.619
 keeper+1.568
 stall+1.539
 Heidi+1.481
Stan+1.463
Neg logit contributions
ug-1.806
ooming-1.681
idd-1.677
ips-1.615
zz-1.596
Token the
Token ID262
Feature activation+3.91e+00

Pos logit contributions
 stranger+2.319
 keeper+2.245
 stall+2.149
 seller+2.094
 ranger+2.039
Neg logit contributions
ug-2.993
idd-2.773
ooming-2.754
zz-2.706
 pract-2.584
Token forest
Token ID8222
Feature activation+1.34e+00

Pos logit contributions
 Heidi+6.314
 keeper+5.839
 stranger+5.833
 Clover+5.721
 Sonia+5.720
Neg logit contributions
idd-7.655
ug-7.520
ooming-7.249
zz-7.140
 stacking-7.130
Token and
Token ID290
Feature activation+1.99e+00

Pos logit contributions
 stranger+2.170
 Heidi+1.994
Stan+1.966
 keeper+1.963
 stall+1.949
Neg logit contributions
ug-2.703
idd-2.490
ooming-2.490
anged-2.348
zz-2.332
Token spotted
Token ID13489
Feature activation+2.06e+00

Pos logit contributions
 stranger+3.385
 stall+3.290
 truth+3.230
 keeper+3.191
Stan+3.133
Neg logit contributions
ug-3.608
idd-3.606
ged-3.439
zz-3.368
anged-3.358
Token a
Token ID257
Feature activation+9.65e-01

Pos logit contributions
 stranger+3.007
 stall+2.941
 keeper+2.913
Stan+2.650
 Hut+2.625
Neg logit contributions
ug-4.470
idd-4.279
med-4.063
ged-4.007
ooming-3.966
Token beautiful
Token ID4950
Feature activation+5.98e-01

Pos logit contributions
 stranger+1.187
 keeper+1.151
 stall+1.126
 Heidi+1.124
 Stanley+1.068
Neg logit contributions
ug-2.190
idd-2.124
ooming-2.044
zz-1.973
irled-1.920
TokenYou
Token ID1639
Feature activation+8.30e-01

Pos logit contributions
 stranger+1.134
 stall+1.074
 keeper+1.049
 Heidi+0.990
 Stanley+0.978
Neg logit contributions
ug-1.807
idd-1.740
ooming-1.664
med-1.627
zz-1.623
Token have
Token ID423
Feature activation+9.04e-01

Pos logit contributions
 stranger+1.348
 keeper+1.295
 stall+1.276
 Heidi+1.198
 ranger+1.197
Neg logit contributions
ug-1.674
idd-1.578
ooming-1.502
zz-1.483
irled-1.445
Token to
Token ID284
Feature activation+1.89e+00

Pos logit contributions
 stranger+1.223
 keeper+1.158
 stall+1.149
 seller+1.054
 Heidi+1.051
Neg logit contributions
ug-2.025
idd-1.950
ooming-1.855
ips-1.830
med-1.825
Token go
Token ID467
Feature activation+1.60e-01

Pos logit contributions
 stranger+3.028
 keeper+2.810
 stall+2.772
 ranger+2.570
keeper+2.559
Neg logit contributions
ug-3.838
idd-3.515
zz-3.508
ooming-3.431
irled-3.331
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 stranger+0.226
 keeper+0.212
 stall+0.208
 Heidi+0.196
 seller+0.193
Neg logit contributions
ug-0.356
idd-0.337
ooming-0.329
zz-0.317
ips-0.316
Token the
Token ID262
Feature activation+3.90e+00

Pos logit contributions
 stranger+2.022
 keeper+1.910
 stall+1.865
 seller+1.792
holder+1.741
Neg logit contributions
ug-2.585
idd-2.377
ooming-2.350
zz-2.266
ips-2.265
Token west
Token ID7421
Feature activation+1.08e+00

Pos logit contributions
 Heidi+5.573
 stranger+5.467
 keeper+5.371
 Stanley+5.275
holder+5.085
Neg logit contributions
idd-7.823
ooming-7.464
ug-7.401
ips-7.386
zz-7.363
Token and
Token ID290
Feature activation+9.99e-01

Pos logit contributions
 stranger+1.735
 Heidi+1.573
 keeper+1.570
 stall+1.568
Stan+1.548
Neg logit contributions
ug-2.184
idd-2.010
ooming-1.956
ips-1.876
zz-1.870
Token twist
Token ID14528
Feature activation-2.68e-01

Pos logit contributions
 stranger+1.479
 stall+1.397
 keeper+1.386
Stan+1.298
 Heidi+1.295
Neg logit contributions
ug-2.069
idd-1.954
ooming-1.895
zz-1.841
irled-1.823
Token off
Token ID572
Feature activation+6.42e-02

Pos logit contributions
ug+0.542
idd+0.511
ooming+0.496
irled+0.476
zz+0.475
Neg logit contributions
 stranger-0.434
 keeper-0.413
 stall-0.407
 Heidi-0.382
 ranger-0.380
Token a
Token ID257
Feature activation+1.36e+00

Pos logit contributions
 stranger+0.098
 keeper+0.093
 stall+0.092
 Heidi+0.086
 ranger+0.085
Neg logit contributions
ug-0.135
idd-0.127
ooming-0.124
med-0.119
irled-0.119
Token,
Token ID11
Feature activation+8.22e-01

Pos logit contributions
 stranger+3.240
 keeper+3.128
 stall+2.925
 ranger+2.901
Stan+2.895
Neg logit contributions
ug-3.768
idd-3.468
ged-3.381
ips-3.361
ooming-3.352
Token so
Token ID523
Feature activation+1.12e+00

Pos logit contributions
 stranger+1.296
 keeper+1.232
 stall+1.199
Stan+1.139
 seller+1.130
Neg logit contributions
ug-1.641
idd-1.545
ooming-1.515
med-1.466
ged-1.461
Token she
Token ID673
Feature activation+1.50e+00

Pos logit contributions
 stranger+1.458
 keeper+1.439
 stall+1.370
 ranger+1.314
 seller+1.282
Neg logit contributions
ug-2.554
idd-2.427
ooming-2.383
irled-2.358
ged-2.354
Token went
Token ID1816
Feature activation+1.15e+00

Pos logit contributions
 stranger+2.516
 keeper+2.393
 stall+2.381
 Heidi+2.290
 truth+2.283
Neg logit contributions
ug-2.895
idd-2.792
irled-2.669
ged-2.602
zz-2.598
Token to
Token ID284
Feature activation+2.28e+00

Pos logit contributions
 stranger+1.681
 keeper+1.603
 stall+1.539
 Heidi+1.522
Stan+1.489
Neg logit contributions
ug-2.405
idd-2.340
ooming-2.304
ips-2.245
zz-2.219
Token the
Token ID262
Feature activation+3.89e+00

Pos logit contributions
 keeper+3.902
 stranger+3.884
 ranger+3.624
 stall+3.563
holder+3.535
Neg logit contributions
ug-3.874
idd-3.757
ooming-3.721
zz-3.687
 pract-3.631
Token kitchen
Token ID9592
Feature activation+1.37e+00

Pos logit contributions
 Heidi+6.534
 Stanley+6.146
 Sonia+6.075
 keeper+6.069
Stan+6.016
Neg logit contributions
idd-7.333
 stacking-6.505
ooming-6.463
 mar-6.229
 pol-6.211
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 stranger+2.103
 keeper+1.939
 Heidi+1.888
Stan+1.859
 Stanley+1.836
Neg logit contributions
ug-2.814
idd-2.743
ooming-2.679
ips-2.581
zz-2.534
Token She
Token ID1375
Feature activation+4.62e-01

Pos logit contributions
 stranger+2.715
 keeper+2.703
 camel+2.485
 ranger+2.453
 seller+2.436
Neg logit contributions
ug-3.095
idd-3.092
ips-3.005
ooming-2.947
ged-2.946
Token spotted
Token ID13489
Feature activation+1.66e+00

Pos logit contributions
 stranger+0.685
 keeper+0.643
 stall+0.626
 Heidi+0.594
Stan+0.588
Neg logit contributions
ug-0.984
idd-0.949
ooming-0.906
irled-0.894
zz-0.888
Token a
Token ID257
Feature activation+6.10e-01

Pos logit contributions
 stranger+2.243
 keeper+2.238
 stall+2.153
 ranger+2.059
Stan+1.987
Neg logit contributions
ug-3.635
idd-3.565
med-3.455
ged-3.368
ooming-3.347
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
 stranger+0.769
 keeper+0.726
 stall+0.705
 Heidi+0.672
 ranger+0.661
Neg logit contributions
ug-1.209
idd-1.159
ooming-1.121
zz-1.087
irled-1.087
Token her
Token ID607
Feature activation+1.71e+00

Pos logit contributions
 keeper+2.631
 stranger+2.575
 ranger+2.460
 stall+2.458
 camel+2.322
Neg logit contributions
ug-3.670
zz-3.505
irled-3.423
ooming-3.402
idd-3.393
Token parents
Token ID3397
Feature activation+1.01e+00

Pos logit contributions
 stranger+2.513
 keeper+2.490
 Heidi+2.377
 ranger+2.343
 stall+2.334
Neg logit contributions
ug-3.388
idd-3.316
ooming-3.189
irled-3.122
zz-3.083
Token went
Token ID1816
Feature activation+7.27e-01

Pos logit contributions
 stranger+1.653
 keeper+1.611
 stall+1.548
 Heidi+1.505
 ranger+1.503
Neg logit contributions
ug-2.017
idd-1.908
ooming-1.813
ged-1.801
zz-1.793
Token to
Token ID284
Feature activation+1.43e+00

Pos logit contributions
 stranger+1.051
 keeper+1.002
 stall+0.961
 Heidi+0.937
Stan+0.922
Neg logit contributions
ug-1.572
idd-1.500
ooming-1.476
ips-1.430
med-1.405
Token the
Token ID262
Feature activation+3.89e+00

Pos logit contributions
 stranger+2.368
 keeper+2.310
 stall+2.162
 ranger+2.142
 camel+2.129
Neg logit contributions
ug-2.656
idd-2.552
ooming-2.480
zz-2.382
ips-2.342
Token market
Token ID1910
Feature activation+2.21e+00

Pos logit contributions
 Heidi+4.074
 camel+3.721
oto+3.701
 Clover+3.688
 keeper+3.681
Neg logit contributions
idd-9.545
 stacking-8.778
ooming-8.765
ug-8.439
 pract-8.366
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 stranger+4.260
 camel+3.921
Stan+3.862
 Heidi+3.824
 Stanley+3.807
Neg logit contributions
ug-3.793
idd-3.728
ooming-3.586
ips-3.437
bed-3.379
Token Jane
Token ID12091
Feature activation+7.53e-01

Pos logit contributions
 stranger+1.705
 keeper+1.601
Stan+1.546
 stall+1.532
 camel+1.523
Neg logit contributions
ug-2.053
idd-1.968
irled-1.855
ooming-1.853
anged-1.842
Token tried
Token ID3088
Feature activation+7.85e-01

Pos logit contributions
 stranger+1.140
 keeper+1.056
 stall+1.022
 Heidi+0.998
 truth+0.982
Neg logit contributions
ug-1.590
idd-1.543
ooming-1.473
irled-1.430
ged-1.424
Token to
Token ID284
Feature activation+1.99e+00

Pos logit contributions
 stranger+1.120
 keeper+1.074
 stall+1.037
 seller+0.991
 camel+0.978
Neg logit contributions
ug-1.686
idd-1.590
ooming-1.583
ips-1.537
med-1.520
Token computer
Token ID3644
Feature activation-6.01e-01

Pos logit contributions
ug+2.688
idd+2.435
ooming+2.367
med+2.363
ips+2.336
Neg logit contributions
 stranger-2.186
 stall-2.068
 keeper-2.047
 seller-1.934
 bench-1.917
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
ug+1.419
idd+1.358
ooming+1.315
irled+1.279
zz+1.279
Neg logit contributions
 stranger-0.771
 keeper-0.728
 stall-0.721
 seller-0.662
 ranger-0.656
Token He
Token ID679
Feature activation-1.68e+00

Pos logit contributions
ug+1.049
idd+1.002
ooming+0.982
irled+0.951
med+0.951
Neg logit contributions
 stranger-0.424
 keeper-0.396
 stall-0.388
 Heidi-0.349
 ranger-0.347
Token would
Token ID561
Feature activation-1.15e+00

Pos logit contributions
ug+4.730
ooming+4.550
idd+4.535
ips+4.447
med+4.343
Neg logit contributions
 stranger-1.438
 keeper-1.287
 stall-1.258
 seller-1.179
 bench-1.132
Token spend
Token ID4341
Feature activation-2.76e+00

Pos logit contributions
ug+2.591
idd+2.484
ooming+2.411
ips+2.335
med+2.319
Neg logit contributions
 stranger-1.652
 keeper-1.536
 stall-1.533
 seller-1.448
 bench-1.432
Token hours
Token ID2250
Feature activation-4.01e+00

Pos logit contributions
ug+7.428
idd+7.307
ooming+7.027
anged+6.792
zz+6.650
Neg logit contributions
 stranger-3.103
 stall-2.913
 bench-2.840
 keeper-2.730
 Heidi-2.647
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
ug+8.287
idd+8.209
irled+7.879
ooming+7.685
anged+7.657
Neg logit contributions
 stall-6.621
 nearby-6.500
 stranger-6.277
 bench-6.149
 keeper-6.048
Token hours
Token ID2250
Feature activation-3.91e+00

Pos logit contributions
ug+3.358
idd+3.188
ooming+3.083
ips+2.927
irled+2.897
Neg logit contributions
 stranger-2.326
 keeper-2.224
 stall-2.175
 bench-2.087
 ranger-2.078
Token every
Token ID790
Feature activation-1.98e+00

Pos logit contributions
ug+9.615
idd+9.610
irled+9.422
ooming+9.255
zz+9.090
Neg logit contributions
 stall-5.007
 nearby-4.726
 stranger-4.651
 bench-4.614
 seller-4.555
Token day
Token ID1110
Feature activation-2.19e+00

Pos logit contributions
ug+4.193
idd+4.029
ooming+3.881
ips+3.754
irled+3.726
Neg logit contributions
 stranger-3.472
 stall-3.164
 keeper-3.076
 bench-3.006
 ranger-2.864
Token playing
Token ID2712
Feature activation-1.52e+00

Pos logit contributions
ug+5.883
idd+5.775
irled+5.666
ooming+5.617
zz+5.591
Neg logit contributions
 stranger-2.133
 stall-2.022
 keeper-1.942
 Stanley-1.745
 bench-1.716
Token day
Token ID1110
Feature activation-7.82e-01

Pos logit contributions
ug+0.946
idd+0.893
ooming+0.863
med+0.820
ips+0.818
Neg logit contributions
 stranger-0.982
 keeper-0.933
 stall-0.926
 Heidi-0.870
 ranger-0.867
Token Mill
Token ID9212
Feature activation-9.74e-01

Pos logit contributions
ug+1.698
idd+1.630
ooming+1.576
ips+1.532
irled+1.526
Neg logit contributions
 stranger-1.164
 keeper-1.087
 stall-1.066
 Heidi-1.020
 Stanley-0.996
Tokenie
Token ID494
Feature activation-1.48e+00

Pos logit contributions
ug+1.740
idd+1.644
ooming+1.586
ips+1.501
med+1.493
Neg logit contributions
 stranger-1.809
 keeper-1.713
 stall-1.691
 seller-1.607
 Heidi-1.603
Token would
Token ID561
Feature activation-9.45e-01

Pos logit contributions
ug+4.270
idd+4.133
ooming+4.050
ips+3.986
med+3.934
Neg logit contributions
 stranger-1.100
 stall-0.959
 keeper-0.942
 seller-0.843
 bench-0.791
Token spend
Token ID4341
Feature activation-2.48e+00

Pos logit contributions
ug+2.252
idd+2.175
ooming+2.106
ips+2.051
med+2.047
Neg logit contributions
 stranger-1.194
 keeper-1.103
 stall-1.100
 bench-1.010
 seller-1.007
Token hours
Token ID2250
Feature activation-3.94e+00

Pos logit contributions
ug+6.786
idd+6.667
ooming+6.406
anged+6.169
zz+6.035
Neg logit contributions
 stranger-2.687
 stall-2.423
 bench-2.385
 Heidi-2.324
 keeper-2.273
Token making
Token ID1642
Feature activation-1.78e+00

Pos logit contributions
ug+8.038
idd+8.007
irled+7.919
anged+7.424
ooming+7.372
Neg logit contributions
 stall-6.430
 nearby-6.227
 stranger-6.223
 bench-6.093
 keeper-5.927
Token new
Token ID649
Feature activation-2.25e+00

Pos logit contributions
ug+4.293
idd+4.196
ooming+4.046
anged+3.907
ged+3.892
Neg logit contributions
 stranger-2.227
 keeper-2.050
 stall-1.985
 truth-1.901
 bench-1.868
Token routes
Token ID11926
Feature activation-6.20e-01

Pos logit contributions
ug+4.897
idd+4.738
ged+4.588
zz+4.550
ips+4.530
Neg logit contributions
 stranger-3.386
 stall-3.274
 keeper-3.264
 bench-2.967
 truth-2.922
Token for
Token ID329
Feature activation-1.59e+00

Pos logit contributions
ug+1.315
idd+1.256
ooming+1.213
irled+1.170
zz+1.165
Neg logit contributions
 stranger-0.931
 keeper-0.893
 stall-0.871
 ranger-0.813
 seller-0.808
Token the
Token ID262
Feature activation-1.52e+00

Pos logit contributions
ug+4.259
idd+4.176
ooming+4.052
irled+3.932
zz+3.913
Neg logit contributions
 stranger-1.613
 keeper-1.452
 stall-1.428
 truth-1.336
 Heidi-1.314
Tokenbe
Token ID1350
Feature activation-2.98e+00

Pos logit contributions
ug+2.943
irled+2.791
ooming+2.750
ips+2.693
idd+2.665
Neg logit contributions
 keeper-6.501
 stranger-6.492
 ranger-6.150
 stall-6.056
 nearby-6.018
Tokenep
Token ID538
Feature activation-1.20e+00

Pos logit contributions
idd+4.645
ug+4.490
ooming+4.239
ips+3.955
anged+3.913
Neg logit contributions
 stranger-5.918
 keeper-5.851
 stall-5.660
 truth-5.480
 bench-5.457
Token.
Token ID13
Feature activation+6.63e-01

Pos logit contributions
ug+2.653
idd+2.593
ooming+2.479
irled+2.434
ged+2.404
Neg logit contributions
 stranger-1.618
 keeper-1.615
 stall-1.501
 ranger-1.431
 seller-1.405
Token They
Token ID1119
Feature activation-1.49e+00

Pos logit contributions
 stranger+1.034
 keeper+0.986
 stall+0.932
 ranger+0.897
 camel+0.885
Neg logit contributions
ug-1.364
idd-1.296
ooming-1.253
ips-1.241
irled-1.230
Token have
Token ID423
Feature activation-7.92e-01

Pos logit contributions
ug+3.505
idd+3.370
ooming+3.304
ips+3.174
irled+3.173
Neg logit contributions
 stranger-1.983
 keeper-1.844
 stall-1.801
 seller-1.738
 bench-1.689
Token fun
Token ID1257
Feature activation-3.93e+00

Pos logit contributions
ug+2.009
idd+1.915
ooming+1.888
irled+1.811
zz+1.808
Neg logit contributions
 stranger-0.890
 keeper-0.834
 stall-0.818
 ranger-0.740
 truth-0.732
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
idd+10.784
ug+10.671
anged+10.398
irled+10.218
zz+10.146
Neg logit contributions
 stranger-3.936
 nearby-3.771
 bench-3.718
 keeper-3.684
 stall-3.659
Token laugh
Token ID6487
Feature activation+1.24e-01

Pos logit contributions
ug+3.865
idd+3.754
ooming+3.679
ips+3.417
irled+3.415
Neg logit contributions
 stranger-2.786
 keeper-2.707
 stall-2.581
 ranger-2.511
 bench-2.499
Token a
Token ID257
Feature activation-6.69e-01

Pos logit contributions
 stranger+0.199
 keeper+0.189
 stall+0.187
 Heidi+0.176
 ranger+0.174
Neg logit contributions
ug-0.253
idd-0.236
ooming-0.231
med-0.224
irled-0.222
Token lot
Token ID1256
Feature activation+4.41e-01

Pos logit contributions
ug+1.236
idd+1.156
ooming+1.117
ips+1.075
irled+1.069
Neg logit contributions
 stranger-1.232
 keeper-1.164
 stall-1.160
 ranger-1.083
 bench-1.082
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 stranger+0.608
 keeper+0.562
 stall+0.561
 Heidi+0.534
Stan+0.526
Neg logit contributions
ug-1.006
idd-0.931
ooming-0.914
med-0.885
zz-0.884
Token arrow
Token ID15452
Feature activation-6.35e-01

Pos logit contributions
ug+1.752
idd+1.680
ooming+1.634
ips+1.583
irled+1.569
Neg logit contributions
 stranger-1.000
 keeper-0.934
 stall-0.923
 seller-0.855
 ranger-0.847
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
ug+1.532
idd+1.466
ooming+1.427
irled+1.386
zz+1.385
Neg logit contributions
 stranger-0.783
 keeper-0.737
 stall-0.721
 seller-0.671
 ranger-0.661
Token He
Token ID679
Feature activation-1.26e+00

Pos logit contributions
ug+0.556
idd+0.530
ooming+0.519
irled+0.503
med+0.502
Neg logit contributions
 stranger-0.231
 keeper-0.214
 stall-0.210
 Heidi-0.189
 ranger-0.188
Token would
Token ID561
Feature activation-1.03e+00

Pos logit contributions
ug+3.399
idd+3.260
ooming+3.226
ips+3.152
med+3.088
Neg logit contributions
 stranger-1.218
 keeper-1.133
 stall-1.095
 seller-1.034
 bench-0.996
Token spend
Token ID4341
Feature activation-2.34e+00

Pos logit contributions
ug+2.248
idd+2.159
ooming+2.081
ips+2.017
med+2.001
Neg logit contributions
 stranger-1.554
 keeper-1.459
 stall-1.454
 seller-1.382
 bench-1.359
Token hours
Token ID2250
Feature activation-3.93e+00

Pos logit contributions
ug+6.167
idd+6.033
ooming+5.820
anged+5.554
zz+5.533
Neg logit contributions
 stranger-2.677
 stall-2.454
 keeper-2.372
 bench-2.345
 Heidi-2.292
Token shooting
Token ID4395
Feature activation-1.31e+00

Pos logit contributions
ug+8.162
idd+8.097
irled+7.749
zz+7.639
apy+7.604
Neg logit contributions
 stall-6.415
 nearby-6.240
 stranger-6.190
 bench-5.952
 seller-5.828
Token arrows
Token ID20507
Feature activation-7.44e-01

Pos logit contributions
ug+2.877
idd+2.839
ooming+2.714
ips+2.610
irled+2.609
Neg logit contributions
 stranger-1.918
 keeper-1.788
 stall-1.761
 truth-1.611
 ranger-1.605
Token at
Token ID379
Feature activation-1.50e+00

Pos logit contributions
ug+1.467
idd+1.405
ooming+1.343
irled+1.312
zz+1.286
Neg logit contributions
 stranger-1.236
 keeper-1.174
 stall-1.160
 seller-1.091
 ranger-1.081
Token targets
Token ID6670
Feature activation-1.00e+00

Pos logit contributions
ug+3.713
idd+3.583
ooming+3.530
irled+3.409
med+3.381
Neg logit contributions
 stranger-1.789
 keeper-1.705
 stall-1.695
 truth-1.529
 ranger-1.519
Token in
Token ID287
Feature activation-1.31e+00

Pos logit contributions
ug+2.156
idd+2.082
ooming+2.005
irled+1.961
zz+1.956
Neg logit contributions
 stranger-1.477
 keeper-1.397
 stall-1.381
 seller-1.282
 ranger-1.279
Token He
Token ID679
Feature activation-1.68e+00

Pos logit contributions
ug+1.049
idd+1.002
ooming+0.982
irled+0.951
med+0.951
Neg logit contributions
 stranger-0.424
 keeper-0.396
 stall-0.388
 Heidi-0.349
 ranger-0.347
Token would
Token ID561
Feature activation-1.15e+00

Pos logit contributions
ug+4.730
ooming+4.550
idd+4.535
ips+4.447
med+4.343
Neg logit contributions
 stranger-1.438
 keeper-1.287
 stall-1.258
 seller-1.179
 bench-1.132
Token spend
Token ID4341
Feature activation-2.76e+00

Pos logit contributions
ug+2.591
idd+2.484
ooming+2.411
ips+2.335
med+2.319
Neg logit contributions
 stranger-1.652
 keeper-1.536
 stall-1.533
 seller-1.448
 bench-1.432
Token hours
Token ID2250
Feature activation-4.01e+00

Pos logit contributions
ug+7.428
idd+7.307
ooming+7.027
anged+6.792
zz+6.650
Neg logit contributions
 stranger-3.103
 stall-2.913
 bench-2.840
 keeper-2.730
 Heidi-2.647
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
ug+8.287
idd+8.209
irled+7.879
ooming+7.685
anged+7.657
Neg logit contributions
 stall-6.621
 nearby-6.500
 stranger-6.277
 bench-6.149
 keeper-6.048
Token hours
Token ID2250
Feature activation-3.91e+00

Pos logit contributions
ug+3.358
idd+3.188
ooming+3.083
ips+2.927
irled+2.897
Neg logit contributions
 stranger-2.326
 keeper-2.224
 stall-2.175
 bench-2.087
 ranger-2.078
Token every
Token ID790
Feature activation-1.98e+00

Pos logit contributions
ug+9.615
idd+9.610
irled+9.422
ooming+9.255
zz+9.090
Neg logit contributions
 stall-5.007
 nearby-4.726
 stranger-4.651
 bench-4.614
 seller-4.555
Token day
Token ID1110
Feature activation-2.19e+00

Pos logit contributions
ug+4.193
idd+4.029
ooming+3.881
ips+3.754
irled+3.726
Neg logit contributions
 stranger-3.472
 stall-3.164
 keeper-3.076
 bench-3.006
 ranger-2.864
Token playing
Token ID2712
Feature activation-1.52e+00

Pos logit contributions
ug+5.883
idd+5.775
irled+5.666
ooming+5.617
zz+5.591
Neg logit contributions
 stranger-2.133
 stall-2.022
 keeper-1.942
 Stanley-1.745
 bench-1.716
Token games
Token ID1830
Feature activation-2.79e-01

Pos logit contributions
ug+3.269
idd+3.143
ooming+2.973
anged+2.948
zz+2.940
Neg logit contributions
 stranger-2.358
 stall-2.195
 keeper-2.187
 ranger-2.029
 bench-2.024
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
ug+0.594
idd+0.562
ooming+0.546
irled+0.527
zz+0.525
Neg logit contributions
 stranger-0.422
 keeper-0.401
 stall-0.395
 Heidi-0.367
 ranger-0.367
Token soft
Token ID2705
Feature activation-2.38e+00

Pos logit contributions
ug+5.143
idd+4.974
ooming+4.825
ips+4.596
irled+4.572
Neg logit contributions
 keeper-2.891
 stranger-2.773
 ranger-2.691
 stall-2.465
 bench-2.448
Token.
Token ID13
Feature activation-8.56e-01

Pos logit contributions
ug+6.463
idd+6.409
ooming+6.096
irled+5.906
ips+5.829
Neg logit contributions
 stranger-2.260
 keeper-2.217
 ranger-2.005
 stall-1.990
 bench-1.910
Token They
Token ID1119
Feature activation-1.52e+00

Pos logit contributions
ug+1.602
idd+1.487
ooming+1.482
zz+1.400
med+1.397
Neg logit contributions
 stranger-1.524
 keeper-1.492
 stall-1.451
 ranger-1.403
 Heidi-1.383
Token both
Token ID1111
Feature activation-2.10e+00

Pos logit contributions
ug+3.218
idd+3.093
ooming+3.013
ips+2.892
irled+2.808
Neg logit contributions
 stranger-2.322
 keeper-2.242
 stall-2.124
 seller-2.086
 ranger-2.080
Token had
Token ID550
Feature activation-3.99e-01

Pos logit contributions
ug+4.485
idd+4.322
ooming+4.303
ips+4.148
med+3.909
Neg logit contributions
 stranger-3.195
 keeper-3.095
 ranger-2.878
 bench-2.853
 stall-2.833
Token fun
Token ID1257
Feature activation-3.88e+00

Pos logit contributions
ug+0.914
idd+0.867
ooming+0.846
zz+0.815
irled+0.814
Neg logit contributions
 stranger-0.543
 keeper-0.514
 stall-0.503
 ranger-0.466
 Heidi-0.465
Token and
Token ID290
Feature activation-2.06e+00

Pos logit contributions
idd+10.403
ug+10.388
zz+9.692
ips+9.573
ka+9.519
Neg logit contributions
 bench-3.886
 stranger-3.882
 nearby-3.749
 truth-3.725
 keeper-3.666
Token were
Token ID547
Feature activation-1.85e+00

Pos logit contributions
ug+4.578
idd+4.474
ooming+4.455
ips+4.201
irled+4.070
Neg logit contributions
 keeper-2.993
 stranger-2.986
 ranger-2.822
 bench-2.615
 stall-2.596
Token happy
Token ID3772
Feature activation-2.29e+00

Pos logit contributions
ug+4.718
idd+4.513
ooming+4.473
ged+4.325
zz+4.309
Neg logit contributions
 keeper-2.075
 stranger-2.055
 ranger-1.837
 stall-1.829
 Heidi-1.728
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
ug+6.056
idd+5.638
ooming+5.492
zz+5.468
irled+5.467
Neg logit contributions
 stranger-2.677
 keeper-2.602
 ranger-2.428
 truth-2.277
 stall-2.143
Token The
Token ID383
Feature activation-1.37e+00

Pos logit contributions
ug+0.491
idd+0.461
ooming+0.448
irled+0.429
med+0.429
Neg logit contributions
 stranger-0.431
 keeper-0.412
 stall-0.406
 Heidi-0.382
 ranger-0.381
Token help
Token ID1037
Feature activation-2.01e-01

Pos logit contributions
ug+1.291
idd+1.231
ooming+1.184
irled+1.148
med+1.144
Neg logit contributions
 stranger-0.921
 keeper-0.872
 stall-0.846
 seller-0.796
 truth-0.795
Token her
Token ID607
Feature activation-1.14e+00

Pos logit contributions
ug+0.501
idd+0.477
ooming+0.466
irled+0.452
med+0.451
Neg logit contributions
 stranger-0.233
 keeper-0.217
 stall-0.212
 Heidi-0.194
 ranger-0.192
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
ug+2.796
idd+2.718
ooming+2.538
med+2.517
ged+2.500
Neg logit contributions
 stranger-1.367
 keeper-1.302
 stall-1.234
 truth-1.176
 bench-1.158
Token They
Token ID1119
Feature activation-6.17e-01

Pos logit contributions
 stranger+0.473
 keeper+0.440
 stall+0.438
Stan+0.414
 Heidi+0.413
Neg logit contributions
ug-0.640
idd-0.602
ooming-0.580
med-0.565
irled-0.563
Token all
Token ID477
Feature activation-1.89e+00

Pos logit contributions
ug+1.341
idd+1.271
ooming+1.243
med+1.188
irled+1.188
Neg logit contributions
 stranger-0.912
 keeper-0.858
 stall-0.839
 seller-0.792
 ranger-0.784
Token kept
Token ID4030
Feature activation-3.86e+00

Pos logit contributions
ug+4.093
idd+3.981
ooming+3.910
ips+3.707
med+3.621
Neg logit contributions
 stranger-2.864
 keeper-2.670
 ranger-2.462
 stall-2.457
 bench-2.420
Token spinning
Token ID19493
Feature activation-2.22e+00

Pos logit contributions
ug+8.474
idd+7.917
ooming+7.294
akes+7.225
anged+7.100
Neg logit contributions
 stranger-6.206
 nearby-6.168
 keeper-5.990
 bench-5.878
 truth-5.728
Token,
Token ID11
Feature activation-2.49e+00

Pos logit contributions
idd+5.403
ug+5.318
ooming+4.951
anged+4.943
ips+4.824
Neg logit contributions
 stranger-2.663
 keeper-2.632
 ranger-2.362
 stall-2.314
 nearby-2.303
Token but
Token ID475
Feature activation-7.72e-02

Pos logit contributions
ug+6.681
idd+6.548
ooming+6.102
irled+6.017
anged+5.940
Neg logit contributions
 stranger-2.591
 keeper-2.568
 bench-2.421
 ranger-2.325
 stall-2.271
Token she
Token ID673
Feature activation-9.91e-01

Pos logit contributions
ug+0.159
idd+0.150
ooming+0.145
irled+0.140
med+0.140
Neg logit contributions
 stranger-0.123
 keeper-0.116
 stall-0.115
 Heidi-0.107
 seller-0.107
Token still
Token ID991
Feature activation-1.48e+00

Pos logit contributions
ug+2.243
idd+2.114
ooming+2.091
med+2.020
ips+2.018
Neg logit contributions
 stranger-1.365
 keeper-1.304
 stall-1.230
 seller-1.179
 ranger-1.178
Tokenep
Token ID538
Feature activation-9.48e-01

Pos logit contributions
ug+4.268
ooming+4.119
anged+3.934
idd+3.857
zz+3.783
Neg logit contributions
 keeper-5.139
 stranger-5.110
 stall-4.932
 nearby-4.889
 ranger-4.750
Token sounds
Token ID5238
Feature activation-1.72e+00

Pos logit contributions
ug+2.144
idd+2.068
ooming+2.001
irled+1.948
ips+1.929
Neg logit contributions
 stranger-1.265
 keeper-1.235
 stall-1.162
 ranger-1.099
 seller-1.077
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
ug+4.408
idd+4.303
ooming+4.190
anged+4.095
irled+4.091
Neg logit contributions
 stranger-1.939
 keeper-1.764
 stall-1.705
 bench-1.530
 truth-1.526
Token They
Token ID1119
Feature activation-8.16e-01

Pos logit contributions
ug+1.810
idd+1.707
ooming+1.704
irled+1.632
zz+1.631
Neg logit contributions
 stranger-1.068
 keeper-1.018
 stall-1.001
 Heidi-0.942
 ranger-0.933
Token have
Token ID423
Feature activation-1.03e+00

Pos logit contributions
ug+1.841
idd+1.746
ooming+1.706
irled+1.641
med+1.635
Neg logit contributions
 stranger-1.151
 keeper-1.084
 stall-1.061
 seller-1.002
 ranger-0.983
Token fun
Token ID1257
Feature activation-3.85e+00

Pos logit contributions
ug+2.519
idd+2.376
ooming+2.357
zz+2.258
irled+2.251
Neg logit contributions
 stranger-1.282
 keeper-1.208
 stall-1.183
 truth-1.099
 ranger-1.089
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
ug+10.610
idd+10.393
zz+10.000
anged+9.920
irled+9.916
Neg logit contributions
 stranger-3.821
 nearby-3.682
 bench-3.608
 keeper-3.548
 stall-3.531
Token but
Token ID475
Feature activation-4.19e-01

Pos logit contributions
ug+2.866
idd+2.763
ooming+2.672
irled+2.613
med+2.583
Neg logit contributions
 stranger-1.524
 keeper-1.447
 stall-1.405
 ranger-1.321
 bench-1.314
Token they
Token ID484
Feature activation+7.71e-02

Pos logit contributions
ug+0.936
idd+0.886
ooming+0.868
irled+0.833
zz+0.833
Neg logit contributions
 stranger-0.598
 keeper-0.568
 stall-0.555
 Heidi-0.517
 ranger-0.516
Token also
Token ID635
Feature activation-9.88e-01

Pos logit contributions
 stranger+0.113
 keeper+0.107
 stall+0.105
 Heidi+0.099
 ranger+0.097
Neg logit contributions
ug-0.167
idd-0.158
ooming-0.154
zz-0.149
irled-0.148
Token want
Token ID765
Feature activation+3.11e-01

Pos logit contributions
ug+2.239
idd+2.113
ooming+2.075
med+1.988
irled+1.986
Neg logit contributions
 stranger-1.393
 keeper-1.323
 stall-1.287
 ranger-1.204
 seller-1.199
Token her
Token ID607
Feature activation-1.35e+00

Pos logit contributions
ug+1.361
idd+1.292
ooming+1.266
zz+1.221
irled+1.220
Neg logit contributions
 stranger-0.736
 keeper-0.690
 stall-0.675
 Heidi-0.629
 ranger-0.620
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
ug+2.838
idd+2.676
ooming+2.563
zz+2.511
irled+2.504
Neg logit contributions
 stranger-2.150
 keeper-2.077
 stall-2.026
 ranger-1.910
 seller-1.907
Token They
Token ID1119
Feature activation-7.88e-01

Pos logit contributions
ug+0.747
idd+0.709
ooming+0.694
irled+0.671
zz+0.670
Neg logit contributions
 stranger-0.379
 keeper-0.357
 stall-0.350
 Heidi-0.322
 ranger-0.320
Token both
Token ID1111
Feature activation-1.44e+00

Pos logit contributions
ug+1.493
idd+1.402
ooming+1.377
ips+1.306
irled+1.286
Neg logit contributions
 stranger-1.388
 keeper-1.327
 stall-1.310
 seller-1.247
 ranger-1.241
Token had
Token ID550
Feature activation+6.97e-03

Pos logit contributions
ug+3.233
ooming+3.082
idd+3.075
ips+2.966
irled+2.844
Neg logit contributions
 stranger-2.022
 keeper-1.922
 stall-1.861
 ranger-1.766
 seller-1.756
Token fun
Token ID1257
Feature activation-3.84e+00

Pos logit contributions
 stranger+0.009
 keeper+0.009
 stall+0.009
 Heidi+0.008
 seller+0.008
Neg logit contributions
ug-0.016
idd-0.015
ooming-0.015
med-0.014
irled-0.014
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
ug+9.769
idd+9.557
zz+9.038
ooming+8.770
anged+8.749
Neg logit contributions
 stranger-4.502
 nearby-4.456
 bench-4.439
 stall-4.102
 keeper-4.047
Token created
Token ID2727
Feature activation-3.34e-01

Pos logit contributions
ug+2.866
idd+2.737
ooming+2.720
ips+2.574
zz+2.527
Neg logit contributions
 stranger-2.055
 keeper-2.009
 stall-1.883
 ranger-1.873
 Heidi-1.805
Token many
Token ID867
Feature activation-1.45e+00

Pos logit contributions
ug+0.866
idd+0.826
ooming+0.808
zz+0.783
irled+0.781
Neg logit contributions
 stranger-0.350
 keeper-0.325
 stall-0.317
 Heidi-0.286
 ranger-0.283
Token more
Token ID517
Feature activation-2.31e+00

Pos logit contributions
ug+3.642
idd+3.431
ooming+3.345
zz+3.343
ged+3.285
Neg logit contributions
 stranger-1.804
 keeper-1.653
 stall-1.608
 truth-1.494
 bench-1.466
Token things
Token ID1243
Feature activation-1.54e-01

Pos logit contributions
ug+5.740
idd+5.410
ooming+5.314
ged+5.259
zz+5.224
Neg logit contributions
 stranger-3.125
 keeper-2.850
 stall-2.757
 ranger-2.613
 bench-2.612
Token
Token ID198
Feature activation-5.45e-01

Pos logit contributions
ug+1.490
idd+1.421
ooming+1.383
irled+1.343
med+1.340
Neg logit contributions
 stranger-0.750
 keeper-0.702
 stall-0.691
 Heidi-0.646
 ranger-0.633
Token
Token ID198
Feature activation+4.21e-02

Pos logit contributions
ug+1.301
idd+1.243
ooming+1.214
med+1.173
irled+1.172
Neg logit contributions
 stranger-0.673
 keeper-0.636
 stall-0.620
 Heidi-0.571
 ranger-0.570
TokenThey
Token ID2990
Feature activation-1.14e+00

Pos logit contributions
 stranger+0.057
 keeper+0.054
 stall+0.053
 Heidi+0.049
 seller+0.049
Neg logit contributions
ug-0.098
idd-0.092
ooming-0.090
zz-0.087
irled-0.087
Token were
Token ID547
Feature activation-1.18e+00

Pos logit contributions
ug+2.327
idd+2.194
ooming+2.128
ips+2.058
irled+2.002
Neg logit contributions
 stranger-1.850
 keeper-1.731
 stall-1.730
 seller-1.650
 bench-1.641
Token having
Token ID1719
Feature activation-5.03e-01

Pos logit contributions
ug+2.613
idd+2.453
ooming+2.420
zz+2.308
ged+2.286
Neg logit contributions
 stranger-1.718
 keeper-1.664
 stall-1.635
 Heidi-1.523
 ranger-1.511
Token fun
Token ID1257
Feature activation-3.84e+00

Pos logit contributions
ug+1.134
idd+1.075
ooming+1.051
zz+1.011
irled+1.007
Neg logit contributions
 stranger-0.706
 keeper-0.663
 stall-0.653
 Heidi-0.606
 truth-0.604
Token with
Token ID351
Feature activation-1.07e+00

Pos logit contributions
ug+10.814
idd+10.167
anged+9.857
zz+9.708
ooming+9.526
Neg logit contributions
 bench-3.616
 nearby-3.546
 stranger-3.511
 stall-3.342
 keeper-3.200
Token their
Token ID511
Feature activation-2.16e+00

Pos logit contributions
ug+2.491
idd+2.384
ooming+2.342
zz+2.230
irled+2.227
Neg logit contributions
 stranger-1.423
 keeper-1.319
 stall-1.318
 Heidi-1.218
 bench-1.211
Token paper
Token ID3348
Feature activation-1.02e+00

Pos logit contributions
ug+5.080
idd+4.716
med+4.565
ged+4.502
ooming+4.497
Neg logit contributions
 stranger-3.180
 stall-2.990
 keeper-2.953
 bench-2.849
 seller-2.720
Token toys
Token ID14958
Feature activation-3.66e-01

Pos logit contributions
ug+2.144
idd+2.031
ooming+1.957
irled+1.901
ged+1.894
Neg logit contributions
 stranger-1.572
 keeper-1.528
 stall-1.491
 seller-1.404
 ranger-1.394
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
ug+0.823
idd+0.780
ooming+0.759
irled+0.735
med+0.731
Neg logit contributions
 stranger-0.511
 keeper-0.484
 stall-0.476
 ranger-0.440
 seller-0.438
TokenThe
Token ID464
Feature activation-7.39e-01

Pos logit contributions
 stranger+0.048
 keeper+0.045
 stall+0.044
 Heidi+0.040
 seller+0.040
Neg logit contributions
ug-0.099
idd-0.094
ooming-0.092
med-0.089
irled-0.089
Token cats
Token ID11875
Feature activation-8.70e-01

Pos logit contributions
ug+1.629
idd+1.533
ooming+1.499
zz+1.455
irled+1.451
Neg logit contributions
 stranger-1.109
 keeper-1.049
 stall-1.030
 ranger-0.961
 seller-0.960
Token had
Token ID550
Feature activation-3.06e-01

Pos logit contributions
ug+1.788
idd+1.700
ooming+1.653
ips+1.600
irled+1.590
Neg logit contributions
 stranger-1.378
 keeper-1.302
 stall-1.279
 seller-1.215
 ranger-1.192
Token lots
Token ID6041
Feature activation-2.41e+00

Pos logit contributions
ug+0.747
idd+0.711
ooming+0.695
irled+0.672
zz+0.671
Neg logit contributions
 stranger-0.368
 keeper-0.344
 stall-0.337
 Heidi-0.308
 ranger-0.307
Token of
Token ID286
Feature activation-2.72e+00

Pos logit contributions
idd+5.089
ug+5.088
zz+4.785
irled+4.767
ooming+4.738
Neg logit contributions
 stranger-3.982
 keeper-3.618
 stall-3.501
 ranger-3.405
 truth-3.362
Token fun
Token ID1257
Feature activation-3.80e+00

Pos logit contributions
ug+7.028
idd+6.895
ooming+6.737
zz+6.577
med+6.441
Neg logit contributions
 stranger-3.355
 truth-3.110
 stall-2.953
 keeper-2.796
 bench-2.745
Token playing
Token ID2712
Feature activation-1.06e+00

Pos logit contributions
idd+9.599
ug+9.322
zz+9.089
anged+8.803
ips+8.765
Neg logit contributions
 stranger-4.832
 bench-4.674
 stall-4.671
 nearby-4.317
 keeper-4.281
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
ug+2.188
idd+2.098
ooming+2.003
zz+1.969
anged+1.944
Neg logit contributions
 stranger-1.669
 keeper-1.592
 stall-1.584
 ranger-1.471
 bench-1.461
Token but
Token ID475
Feature activation-3.20e-01

Pos logit contributions
ug+3.224
idd+3.064
ooming+2.972
irled+2.891
ips+2.888
Neg logit contributions
 stranger-2.063
 keeper-1.991
 stall-1.921
 ranger-1.831
 bench-1.802
Token after
Token ID706
Feature activation-1.63e+00

Pos logit contributions
ug+0.667
idd+0.630
ooming+0.614
irled+0.589
zz+0.589
Neg logit contributions
 stranger-0.502
 keeper-0.478
 stall-0.470
 Heidi-0.440
 ranger-0.439
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
ug+4.391
idd+4.249
ooming+4.137
zz+4.034
irled+4.018
Neg logit contributions
 stranger-1.597
 keeper-1.456
 stall-1.435
 bench-1.318
 ranger-1.295
Token string
Token ID4731
Feature activation+3.85e-01

Pos logit contributions
ug+3.103
idd+2.952
ooming+2.903
zz+2.794
med+2.785
Neg logit contributions
 stranger-1.097
 stall-0.979
 keeper-0.956
 bench-0.881
 truth-0.860
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
 stranger+0.538
 keeper+0.507
 stall+0.497
 Heidi+0.476
Stan+0.472
Neg logit contributions
ug-0.866
idd-0.807
ooming-0.791
med-0.768
ips-0.764
Token She
Token ID1375
Feature activation-1.74e+00

Pos logit contributions
 stranger+0.453
 keeper+0.428
 stall+0.410
 ranger+0.383
 seller+0.379
Neg logit contributions
ug-0.856
idd-0.813
ooming-0.796
ips-0.783
zz-0.776
Token would
Token ID561
Feature activation-1.18e+00

Pos logit contributions
ug+4.772
idd+4.624
ooming+4.617
ips+4.488
med+4.393
Neg logit contributions
 stranger-1.511
 stall-1.377
 keeper-1.354
 bench-1.270
 seller-1.264
Token spend
Token ID4341
Feature activation-2.04e+00

Pos logit contributions
ug+2.650
idd+2.575
ooming+2.468
ips+2.393
med+2.389
Neg logit contributions
 stranger-1.677
 stall-1.585
 keeper-1.556
 seller-1.483
 bench-1.482
Token hours
Token ID2250
Feature activation-3.80e+00

Pos logit contributions
ug+5.083
idd+4.966
ooming+4.723
anged+4.533
zz+4.459
Neg logit contributions
 stranger-2.625
 stall-2.413
 bench-2.347
 Heidi-2.346
 keeper-2.301
Token on
Token ID319
Feature activation-1.04e+00

Pos logit contributions
idd+8.049
ug+8.014
anged+7.626
irled+7.562
zz+7.355
Neg logit contributions
 stall-6.091
 stranger-5.845
 nearby-5.840
 bench-5.687
 seller-5.378
Token end
Token ID886
Feature activation-1.60e+00

Pos logit contributions
ug+2.715
idd+2.596
ooming+2.495
zz+2.434
irled+2.426
Neg logit contributions
 stranger-1.144
 stall-1.066
 keeper-1.035
 Heidi-0.982
 bench-0.968
Token weaving
Token ID44889
Feature activation-1.03e+00

Pos logit contributions
ug+3.719
idd+3.604
ooming+3.494
anged+3.384
ged+3.365
Neg logit contributions
 stranger-2.146
 stall-2.045
 keeper-2.023
 bench-1.905
 Heidi-1.865
Token patterns
Token ID7572
Feature activation-9.96e-01

Pos logit contributions
ug+2.450
idd+2.382
ooming+2.303
irled+2.218
zz+2.190
Neg logit contributions
 stranger-1.331
 keeper-1.248
 stall-1.239
 Heidi-1.138
 ranger-1.114
Token and
Token ID290
Feature activation-2.41e+00

Pos logit contributions
ug+2.054
idd+1.982
ooming+1.883
irled+1.829
anged+1.800
Neg logit contributions
 stranger-1.571
 keeper-1.505
 stall-1.479
 seller-1.387
 ranger-1.374
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
ug+4.328
idd+4.311
ooming+4.182
irled+4.150
zz+4.027
Neg logit contributions
 stranger-1.968
 keeper-1.935
 stall-1.838
 seller-1.689
 ranger-1.689
Token He
Token ID679
Feature activation-1.98e+00

Pos logit contributions
ug+0.053
idd+0.050
ooming+0.049
med+0.048
irled+0.048
Neg logit contributions
 stranger-0.023
 keeper-0.021
 stall-0.021
 Heidi-0.019
Stan-0.019
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
ug+4.676
ooming+4.576
ips+4.463
idd+4.458
ged+4.242
Neg logit contributions
 stranger-2.482
 keeper-2.466
 stall-2.261
 ranger-2.235
 seller-2.230
Token very
Token ID845
Feature activation-1.43e+00

Pos logit contributions
ug+4.918
ooming+4.717
idd+4.704
zz+4.589
irled+4.537
Neg logit contributions
 stranger-1.874
 keeper-1.767
 stall-1.677
 seller-1.556
 ranger-1.524
Token good
Token ID922
Feature activation-2.17e+00

Pos logit contributions
ug+3.555
idd+3.411
ooming+3.389
med+3.316
ips+3.278
Neg logit contributions
 stranger-1.706
 keeper-1.597
 stall-1.530
 seller-1.431
 ranger-1.417
Token at
Token ID379
Feature activation-3.79e+00

Pos logit contributions
ug+4.720
idd+4.603
irled+4.531
med+4.425
ooming+4.398
Neg logit contributions
 stranger-3.298
 keeper-2.986
 stall-2.899
 seller-2.781
 ranger-2.774
Token fixing
Token ID18682
Feature activation-1.63e+00

Pos logit contributions
ug+8.621
idd+8.425
med+8.023
ooming+8.013
irled+7.852
Neg logit contributions
 truth-6.263
 keeper-5.921
 stranger-5.844
 market-5.587
 nearby-5.531
Token them
Token ID606
Feature activation-7.22e-01

Pos logit contributions
ug+3.650
idd+3.573
ooming+3.431
zz+3.373
irled+3.352
Neg logit contributions
 stranger-2.241
 keeper-2.154
 stall-2.131
 truth-1.990
 ranger-1.905
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
ug+1.713
idd+1.654
ooming+1.589
irled+1.558
zz+1.551
Neg logit contributions
 stranger-0.903
 keeper-0.864
 stall-0.841
 seller-0.771
 ranger-0.765
Token One
Token ID1881
Feature activation-1.02e+00

Pos logit contributions
ug+1.014
idd+0.965
ooming+0.948
irled+0.917
zz+0.916
Neg logit contributions
 stranger-0.481
 keeper-0.451
 stall-0.443
 Heidi-0.404
 ranger-0.402
Token day
Token ID1110
Feature activation-1.08e+00

Pos logit contributions
ug+1.905
idd+1.821
ooming+1.772
ips+1.720
irled+1.706
Neg logit contributions
 stranger-1.840
 keeper-1.742
 stall-1.711
 ranger-1.616
 seller-1.600
TokenJohn
Token ID7554
Feature activation-1.04e+00

Pos logit contributions
 stranger+2.471
 stall+2.392
 keeper+2.343
 camel+2.284
 ranger+2.225
Neg logit contributions
ug-2.901
idd-2.784
ooming-2.665
med-2.642
irled-2.605
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
ug+2.334
idd+2.211
ooming+2.152
ips+2.098
ged+2.036
Neg logit contributions
 stranger-1.459
 keeper-1.421
 stall-1.350
 seller-1.284
 ranger-1.274
Token Jane
Token ID12091
Feature activation-1.06e+00

Pos logit contributions
 stranger+0.015
 keeper+0.014
 stall+0.014
Stan+0.013
 Heidi+0.013
Neg logit contributions
ug-0.024
idd-0.022
ooming-0.022
zz-0.021
med-0.021
Token spent
Token ID3377
Feature activation-1.73e+00

Pos logit contributions
ug+2.125
idd+2.022
ooming+1.951
ips+1.890
irled+1.836
Neg logit contributions
 stranger-1.695
 keeper-1.637
 stall-1.592
 seller-1.528
 ranger-1.485
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
ug+4.434
idd+4.297
ooming+4.129
zz+4.013
anged+3.987
Neg logit contributions
 stranger-1.967
 stall-1.872
 keeper-1.770
 bench-1.715
 Heidi-1.693
Token day
Token ID1110
Feature activation-3.79e+00

Pos logit contributions
ug+4.972
idd+4.786
anged+4.616
ooming+4.497
ips+4.448
Neg logit contributions
 stranger-2.628
 keeper-2.371
 stall-2.316
 Heidi-2.238
 bench-2.222
Token practicing
Token ID18207
Feature activation-2.45e+00

Pos logit contributions
ug+7.211
zz+6.866
idd+6.794
anged+6.431
irled+6.271
Neg logit contributions
 stranger-7.150
 stall-6.724
 nearby-6.523
 bench-6.421
 keeper-6.323
Token tennis
Token ID20790
Feature activation-1.34e+00

Pos logit contributions
idd+5.780
ug+5.739
zz+5.301
ooming+5.213
anged+5.138
Neg logit contributions
 stranger-3.415
 stall-3.228
 keeper-3.182
 bench-3.035
 truth-2.951
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ug+2.884
idd+2.807
ooming+2.654
zz+2.592
irled+2.592
Neg logit contributions
 stranger-2.045
 keeper-1.971
 stall-1.934
 ranger-1.810
 seller-1.787
Token When
Token ID1649
Feature activation-6.22e-01

Pos logit contributions
ug+2.969
idd+2.784
ooming+2.781
zz+2.650
irled+2.639
Neg logit contributions
 stranger-1.734
 keeper-1.687
 stall-1.667
 Heidi-1.603
 Stanley-1.545
Token they
Token ID484
Feature activation-9.38e-01

Pos logit contributions
ug+1.322
idd+1.245
ooming+1.211
irled+1.161
zz+1.160
Neg logit contributions
 stranger-0.971
 keeper-0.918
 stall-0.900
 Heidi-0.855
 ranger-0.837
Token trumpet
Token ID40334
Feature activation-7.95e-01

Pos logit contributions
ug+3.354
idd+3.039
anged+2.979
ooming+2.868
ips+2.806
Neg logit contributions
 stranger-3.847
 keeper-3.502
 stall-3.476
 truth-3.382
 bench-3.355
Token and
Token ID290
Feature activation-2.33e+00

Pos logit contributions
ug+1.675
idd+1.603
ooming+1.536
irled+1.500
zz+1.479
Neg logit contributions
 stranger-1.218
 keeper-1.174
 stall-1.147
 seller-1.076
 ranger-1.072
Token had
Token ID550
Feature activation+1.27e-01

Pos logit contributions
ug+4.668
idd+4.267
ooming+4.139
ips+3.938
irled+3.883
Neg logit contributions
 stranger-4.203
 keeper-4.194
 ranger-3.985
 stall-3.889
 seller-3.875
Token lots
Token ID6041
Feature activation-2.07e+00

Pos logit contributions
 stranger+0.168
 keeper+0.157
 stall+0.155
 Heidi+0.142
 seller+0.142
Neg logit contributions
ug-0.294
idd-0.280
ooming-0.272
med-0.265
irled-0.265
Token of
Token ID286
Feature activation-2.36e+00

Pos logit contributions
ug+4.347
idd+4.191
zz+4.028
ooming+3.976
irled+3.956
Neg logit contributions
 stranger-3.493
 keeper-3.176
 stall-3.039
 ranger-3.031
 truth-2.993
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+6.525
idd+6.183
ooming+6.013
zz+5.877
anged+5.870
Neg logit contributions
 stranger-2.646
 truth-2.468
 stall-2.202
 keeper-2.139
 bench-2.113
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
ug+11.770
idd+11.429
zz+11.297
anged+10.739
ooming+10.618
Neg logit contributions
 stranger-2.791
 bench-2.271
 stall-2.225
 truth-2.130
 keeper-2.109
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
ug+3.060
idd+2.923
ooming+2.855
zz+2.736
irled+2.709
Neg logit contributions
 stranger-1.677
 keeper-1.616
 stall-1.582
 Heidi-1.519
 ranger-1.504
Token
Token ID198
Feature activation-7.53e-01

Pos logit contributions
ug+2.677
idd+2.588
ooming+2.522
irled+2.420
zz+2.419
Neg logit contributions
 stranger-1.212
 keeper-1.131
 stall-1.114
 Heidi-1.030
Stan-1.026
TokenAfter
Token ID3260
Feature activation-2.17e+00

Pos logit contributions
ug+1.691
idd+1.617
ooming+1.564
ips+1.504
zz+1.503
Neg logit contributions
 stranger-1.062
 keeper-0.997
 stall-0.978
Stan-0.918
 Heidi-0.916
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
ug+5.812
idd+5.472
ooming+5.282
zz+5.272
irled+5.207
Neg logit contributions
 stranger-2.392
 keeper-2.086
 stall-2.082
 ranger-2.016
 bench-1.973
Token together
Token ID1978
Feature activation-9.55e-01

Pos logit contributions
ug+3.661
idd+3.604
ooming+3.323
irled+3.289
zz+3.260
Neg logit contributions
 stranger-2.620
 keeper-2.541
 stall-2.402
 ranger-2.316
 bench-2.304
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
ug+2.259
idd+2.194
ooming+2.094
irled+2.033
zz+2.031
Neg logit contributions
 stranger-1.238
 keeper-1.177
 stall-1.160
 bench-1.055
 seller-1.052
Token They
Token ID1119
Feature activation-7.51e-01

Pos logit contributions
ug+3.414
idd+3.212
ooming+3.176
zz+3.049
med+2.985
Neg logit contributions
 stranger-1.731
 keeper-1.686
 stall-1.624
 Heidi-1.568
 ranger-1.559
Token both
Token ID1111
Feature activation-1.35e+00

Pos logit contributions
ug+1.515
idd+1.434
ooming+1.398
ips+1.336
irled+1.320
Neg logit contributions
 stranger-1.227
 keeper-1.169
 stall-1.141
 seller-1.085
 ranger-1.084
Token had
Token ID550
Feature activation-4.38e-01

Pos logit contributions
ug+2.925
idd+2.781
ooming+2.768
ips+2.661
irled+2.556
Neg logit contributions
 stranger-2.012
 keeper-1.920
 stall-1.827
 ranger-1.769
 seller-1.753
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+1.049
idd+0.996
ooming+0.975
zz+0.940
irled+0.937
Neg logit contributions
 stranger-0.554
 keeper-0.521
 stall-0.510
 Heidi-0.470
 ranger-0.468
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
ug+9.429
idd+9.307
zz+8.801
ips+8.701
anged+8.553
Neg logit contributions
 stranger-4.879
 bench-4.722
 nearby-4.488
 stall-4.440
 keeper-4.349
Token Lily
Token ID20037
Feature activation-1.12e+00

Pos logit contributions
ug+3.767
idd+3.644
ooming+3.624
ips+3.431
zz+3.368
Neg logit contributions
 stranger-2.307
 keeper-2.230
 ranger-2.067
 stall-2.007
 Heidi-1.971
Token realized
Token ID6939
Feature activation+7.92e-01

Pos logit contributions
ug+2.732
idd+2.597
ooming+2.514
ips+2.446
zz+2.418
Neg logit contributions
 stranger-1.342
 keeper-1.302
 stall-1.221
 ranger-1.159
 seller-1.142
Token that
Token ID326
Feature activation-9.58e-01

Pos logit contributions
 stranger+1.011
 stall+0.998
 keeper+0.974
Stan+0.902
 seller+0.898
Neg logit contributions
ug-1.834
idd-1.723
ooming-1.688
med-1.657
irled-1.641
Token it
Token ID340
Feature activation-8.27e-01

Pos logit contributions
ug+2.074
idd+1.965
ooming+1.910
zz+1.841
irled+1.832
Neg logit contributions
 stranger-1.478
 keeper-1.383
 stall-1.327
 truth-1.287
 Heidi-1.287
Token shiny
Token ID22441
Feature activation-2.06e+00

Pos logit contributions
ug+3.826
idd+3.643
ooming+3.613
ips+3.378
zz+3.331
Neg logit contributions
 stranger-2.771
 keeper-2.732
 stall-2.584
 truth-2.527
 seller-2.511
Token pieces
Token ID5207
Feature activation-1.02e+00

Pos logit contributions
ug+4.666
idd+4.483
ooming+4.289
irled+4.139
zz+4.113
Neg logit contributions
 stranger-3.023
 keeper-2.862
 stall-2.842
 truth-2.712
 seller-2.709
Token.
Token ID13
Feature activation+6.98e-02

Pos logit contributions
ug+2.505
idd+2.400
ooming+2.348
irled+2.266
zz+2.252
Neg logit contributions
 stranger-1.221
 keeper-1.150
 stall-1.111
 seller-1.034
 truth-1.022
Token She
Token ID1375
Feature activation-2.39e+00

Pos logit contributions
 stranger+0.094
 keeper+0.089
 stall+0.087
 ranger+0.080
 seller+0.080
Neg logit contributions
ug-0.160
idd-0.152
ooming-0.148
zz-0.143
irled-0.143
Token worked
Token ID3111
Feature activation-1.99e+00

Pos logit contributions
ug+5.452
ooming+5.282
idd+5.146
ips+5.088
ipping+4.742
Neg logit contributions
 stranger-3.214
 keeper-3.186
 seller-3.182
 stall-3.055
 truth-2.860
Token and
Token ID290
Feature activation-3.78e+00

Pos logit contributions
ug+4.827
idd+4.769
ooming+4.500
anged+4.317
zz+4.302
Neg logit contributions
 stranger-2.481
 keeper-2.412
 stall-2.321
 truth-2.175
 ranger-2.112
Token worked
Token ID3111
Feature activation-2.03e+00

Pos logit contributions
ug+8.452
idd+8.091
ooming+8.067
ips+7.414
ang+7.065
Neg logit contributions
 keeper-5.451
 nearby-5.370
 seller-5.323
 stranger-5.198
 stall-5.167
Token,
Token ID11
Feature activation-2.76e+00

Pos logit contributions
ug+4.895
idd+4.885
ooming+4.567
anged+4.442
zz+4.420
Neg logit contributions
 stranger-2.514
 keeper-2.475
 stall-2.373
 truth-2.248
 ranger-2.160
Token putting
Token ID5137
Feature activation-1.86e+00

Pos logit contributions
ug+6.538
idd+6.451
ooming+5.972
irled+5.968
ips+5.781
Neg logit contributions
 stranger-3.676
 keeper-3.670
 stall-3.657
 seller-3.427
 nearby-3.328
Token each
Token ID1123
Feature activation-2.43e+00

Pos logit contributions
ug+4.520
idd+4.342
ooming+4.246
anged+4.027
irled+3.993
Neg logit contributions
 stranger-2.352
 stall-2.269
 keeper-2.239
 truth-2.038
 seller-2.027
Token piece
Token ID3704
Feature activation-1.27e+00

Pos logit contributions
ug+5.942
idd+5.776
ooming+5.669
irled+5.436
ips+5.320
Neg logit contributions
 stranger-3.071
 stall-3.017
 keeper-2.818
 bench-2.708
 seller-2.691
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
ug+1.453
idd+1.380
ooming+1.332
irled+1.297
zz+1.289
Neg logit contributions
 stranger-0.954
 keeper-0.912
 stall-0.884
 ranger-0.833
 seller-0.820
Token They
Token ID1119
Feature activation-1.31e+00

Pos logit contributions
 stranger+0.034
 keeper+0.032
 stall+0.031
 seller+0.029
 Heidi+0.029
Neg logit contributions
ug-0.055
idd-0.052
ooming-0.051
irled-0.049
ips-0.049
Token had
Token ID550
Feature activation-4.24e-01

Pos logit contributions
ug+2.695
idd+2.574
ooming+2.528
ips+2.393
irled+2.339
Neg logit contributions
 stranger-2.093
 keeper-1.981
 stall-1.922
 seller-1.856
 bench-1.837
Token lots
Token ID6041
Feature activation-2.15e+00

Pos logit contributions
ug+1.030
idd+0.979
ooming+0.958
zz+0.922
irled+0.922
Neg logit contributions
 stranger-0.519
 keeper-0.486
 stall-0.476
 Heidi-0.438
 ranger-0.435
Token of
Token ID286
Feature activation-2.25e+00

Pos logit contributions
ug+4.450
idd+4.401
ooming+4.097
zz+4.042
ged+4.029
Neg logit contributions
 stranger-3.634
 keeper-3.286
 stall-3.154
 ranger-3.125
 bench-3.090
Token fun
Token ID1257
Feature activation-3.78e+00

Pos logit contributions
ug+6.145
idd+5.853
ooming+5.694
ged+5.590
anged+5.572
Neg logit contributions
 stranger-2.476
 truth-2.266
 stall-2.098
 keeper-2.085
 bench-2.070
Token that
Token ID326
Feature activation-2.00e+00

Pos logit contributions
ug+10.022
idd+9.742
anged+9.193
zz+9.080
med+8.845
Neg logit contributions
 stranger-4.249
 bench-4.182
 stall-3.763
 keeper-3.692
 nearby-3.619
Token day
Token ID1110
Feature activation-2.06e+00

Pos logit contributions
ug+4.299
idd+4.166
ooming+3.945
ips+3.834
med+3.822
Neg logit contributions
 stranger-3.257
 keeper-3.125
 bench-2.962
 stall-2.937
 ranger-2.855
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
ug+5.222
idd+5.052
ooming+4.845
irled+4.825
zz+4.773
Neg logit contributions
 stranger-2.291
 keeper-2.132
 stall-2.096
 bench-1.948
 ranger-1.892
Token Lily
Token ID20037
Feature activation-9.63e-01

Pos logit contributions
ug+3.588
idd+3.414
ooming+3.389
ips+3.200
zz+3.153
Neg logit contributions
 stranger-2.382
 keeper-2.283
 ranger-2.113
 stall-2.111
 Heidi-2.077
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
ug+2.267
idd+2.149
ooming+2.086
ips+2.016
med+1.999
Neg logit contributions
 stranger-1.240
 keeper-1.183
 stall-1.137
 ranger-1.058
 seller-1.057
Tokenmy
Token ID1820
Feature activation-7.26e-01

Pos logit contributions
ug+1.114
idd+1.052
ooming+1.013
irled+0.988
med+0.978
Neg logit contributions
 stranger-0.920
 keeper-0.882
 stall-0.861
 ranger-0.814
 seller-0.812
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ug+1.783
idd+1.692
ooming+1.644
irled+1.612
ips+1.612
Neg logit contributions
 stranger-0.874
 keeper-0.831
 stall-0.809
 ranger-0.743
 seller-0.738
Token They
Token ID1119
Feature activation-2.83e-01

Pos logit contributions
ug+2.364
idd+2.208
ooming+2.198
irled+2.129
zz+2.129
Neg logit contributions
 stranger-1.321
 keeper-1.274
 stall-1.251
 Heidi-1.184
 ranger-1.175
Token were
Token ID547
Feature activation-1.76e+00

Pos logit contributions
ug+0.510
idd+0.477
ooming+0.462
irled+0.440
med+0.439
Neg logit contributions
 stranger-0.522
 keeper-0.499
 stall-0.493
 Heidi-0.466
 seller-0.465
Token having
Token ID1719
Feature activation-9.70e-01

Pos logit contributions
ug+3.214
ooming+3.043
idd+2.983
ged+2.893
anged+2.864
Neg logit contributions
 stranger-3.149
 keeper-3.060
 stall-3.035
 Heidi-2.950
 Stanley-2.940
Token fun
Token ID1257
Feature activation-3.76e+00

Pos logit contributions
ug+2.427
idd+2.310
ooming+2.278
zz+2.203
ged+2.183
Neg logit contributions
 stranger-1.120
 keeper-1.032
 stall-1.011
 Heidi-0.946
 truth-0.944
Token catching
Token ID16508
Feature activation-6.41e-01

Pos logit contributions
ug+10.067
idd+9.532
med+9.180
anged+9.141
ooming+8.927
Neg logit contributions
 stranger-3.918
 nearby-3.741
 bench-3.638
 stall-3.416
 truth-3.412
Token butterflies
Token ID44571
Feature activation-5.67e-01

Pos logit contributions
ug+1.342
idd+1.266
ooming+1.230
irled+1.183
ips+1.181
Neg logit contributions
 stranger-1.005
 keeper-0.949
 stall-0.934
 ranger-0.877
 Heidi-0.873
Token when
Token ID618
Feature activation-1.82e+00

Pos logit contributions
ug+1.133
idd+1.072
ooming+1.034
irled+1.002
zz+0.991
Neg logit contributions
 stranger-0.928
 keeper-0.887
 stall-0.873
 ranger-0.821
 seller-0.817
Token Lily
Token ID20037
Feature activation-7.35e-01

Pos logit contributions
ug+4.199
idd+3.929
ooming+3.844
zz+3.790
irled+3.712
Neg logit contributions
 stranger-2.670
 keeper-2.462
 Heidi-2.348
 stall-2.338
 ranger-2.292
Token noticed
Token ID6810
Feature activation-8.02e-02

Pos logit contributions
ug+1.662
idd+1.562
ooming+1.519
ips+1.471
irled+1.467
Neg logit contributions
 stranger-1.015
 keeper-0.970
 stall-0.943
 ranger-0.883
 seller-0.875
Token
Token ID198
Feature activation-5.73e-01

Pos logit contributions
ug+3.002
idd+2.850
ooming+2.822
zz+2.742
irled+2.739
Neg logit contributions
 stranger-1.303
 keeper-1.239
 stall-1.183
 ranger-1.130
 truth-1.073
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
ug+1.377
idd+1.317
ooming+1.288
zz+1.243
irled+1.241
Neg logit contributions
 stranger-0.703
 keeper-0.664
 stall-0.645
 ranger-0.596
 Heidi-0.590
TokenThey
Token ID2990
Feature activation-1.40e+00

Pos logit contributions
ug+0.697
idd+0.662
ooming+0.645
irled+0.623
zz+0.623
Neg logit contributions
 stranger-0.400
 keeper-0.377
 stall-0.369
 ranger-0.340
 Heidi-0.340
Token were
Token ID547
Feature activation-1.59e+00

Pos logit contributions
ug+2.925
idd+2.738
ooming+2.675
ips+2.578
irled+2.503
Neg logit contributions
 stranger-2.215
 keeper-2.104
 stall-2.087
 seller-2.023
 bench-1.975
Token having
Token ID1719
Feature activation-8.35e-01

Pos logit contributions
ug+3.899
idd+3.661
ooming+3.628
zz+3.526
ged+3.449
Neg logit contributions
 stranger-1.952
 keeper-1.919
 stall-1.866
 ranger-1.689
 Heidi-1.681
Token fun
Token ID1257
Feature activation-3.76e+00

Pos logit contributions
ug+1.972
idd+1.874
ooming+1.832
zz+1.781
ged+1.755
Neg logit contributions
 stranger-1.091
 keeper-1.020
 stall-1.007
 truth-0.944
 Heidi-0.925
Token.
Token ID13
Feature activation-7.06e-01

Pos logit contributions
ug+9.840
idd+9.517
zz+9.161
anged+8.697
ips+8.640
Neg logit contributions
 bench-4.383
 stranger-4.372
 stall-4.267
 nearby-4.153
 keeper-4.080
Token But
Token ID887
Feature activation-9.50e-01

Pos logit contributions
ug+1.580
idd+1.507
ooming+1.474
zz+1.412
irled+1.407
Neg logit contributions
 stranger-1.011
 keeper-0.966
 stall-0.952
 ranger-0.894
 Heidi-0.882
Token then
Token ID788
Feature activation+1.70e-01

Pos logit contributions
ug+1.989
idd+1.883
ooming+1.835
zz+1.762
ips+1.754
Neg logit contributions
 stranger-1.497
 keeper-1.437
 stall-1.396
 Heidi-1.333
 ranger-1.332
Token they
Token ID484
Feature activation-1.93e-01

Pos logit contributions
 stranger+0.285
 keeper+0.273
 stall+0.271
 Heidi+0.252
 ranger+0.252
Neg logit contributions
ug-0.332
idd-0.310
ooming-0.303
irled-0.290
med-0.290
Token saw
Token ID2497
Feature activation+4.24e-01

Pos logit contributions
ug+0.448
idd+0.425
ooming+0.414
irled+0.400
med+0.400
Neg logit contributions
 stranger-0.256
 keeper-0.241
 stall-0.237
 Heidi-0.218
 ranger-0.217
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
ug+0.054
idd+0.052
ooming+0.050
irled+0.049
zz+0.049
Neg logit contributions
 stranger-0.034
 keeper-0.032
 stall-0.031
 Heidi-0.029
 ranger-0.029
Token a
Token ID257
Feature activation-3.40e-01

Pos logit contributions
ug+3.477
idd+3.317
ooming+3.296
ips+3.166
zz+3.158
Neg logit contributions
 stranger-1.101
 stall-0.980
 keeper-0.927
 Heidi-0.915
 Stanley-0.896
Token big
Token ID1263
Feature activation-8.14e-01

Pos logit contributions
ug+0.784
idd+0.742
ooming+0.724
med+0.701
irled+0.701
Neg logit contributions
 stranger-0.457
 keeper-0.428
 stall-0.422
 ranger-0.387
 seller-0.387
Token farm
Token ID5318
Feature activation+1.55e+00

Pos logit contributions
ug+1.890
idd+1.790
ooming+1.753
med+1.722
ips+1.711
Neg logit contributions
 stranger-1.120
 keeper-1.032
 stall-1.031
 seller-0.941
 ranger-0.939
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
 stranger+3.033
 Heidi+2.869
Stan+2.821
 keeper+2.800
 stall+2.787
Neg logit contributions
ug-2.642
idd-2.419
ooming-2.209
med-2.137
bed-2.081
Token The
Token ID383
Feature activation+2.17e+00

Pos logit contributions
 stranger+0.196
 keeper+0.185
 stall+0.181
 seller+0.169
 ranger+0.168
Neg logit contributions
ug-0.281
idd-0.264
ooming-0.254
med-0.248
zz-0.247
Token farm
Token ID5318
Feature activation+1.22e+00

Pos logit contributions
 stranger+2.896
 keeper+2.872
 Heidi+2.784
holder+2.595
 ranger+2.579
Neg logit contributions
ug-4.954
idd-4.607
zz-4.371
ooming-4.200
ips-4.118
Token had
Token ID550
Feature activation+1.61e+00

Pos logit contributions
 stranger+1.898
Stan+1.761
 stall+1.744
 Heidi+1.742
 keeper+1.714
Neg logit contributions
ug-2.499
idd-2.403
irled-2.216
med-2.183
ooming-2.176
Token many
Token ID867
Feature activation+7.67e-01

Pos logit contributions
 stranger+2.299
 keeper+2.238
 stall+2.112
 seller+2.066
 Heidi+2.056
Neg logit contributions
ug-3.471
idd-3.339
med-3.164
zz-3.091
ips-3.022
Token animals
Token ID4695
Feature activation-8.63e-02

Pos logit contributions
 stranger+1.135
 keeper+1.100
 stall+1.028
 Heidi+1.025
 ranger+1.009
Neg logit contributions
ug-1.656
idd-1.545
ooming-1.475
irled-1.458
med-1.443
Token like
Token ID588
Feature activation+1.54e+00

Pos logit contributions
ug+0.184
idd+0.173
ooming+0.168
irled+0.162
med+0.162
Neg logit contributions
 stranger-0.131
 keeper-0.124
 stall-0.122
 Heidi-0.114
 ranger-0.113
Token had
Token ID550
Feature activation+1.08e+00

Pos logit contributions
 stranger+0.313
 keeper+0.294
 stall+0.293
 Heidi+0.275
Stan+0.271
Neg logit contributions
ug-0.447
idd-0.423
ooming-0.411
irled-0.400
med-0.399
Token a
Token ID257
Feature activation+8.84e-02

Pos logit contributions
 stranger+1.614
 stall+1.543
 keeper+1.527
 seller+1.425
 butcher+1.407
Neg logit contributions
ug-2.250
idd-2.159
med-2.107
ooming-2.051
irled-2.051
Token wonderful
Token ID7932
Feature activation-6.20e-01

Pos logit contributions
 stranger+0.111
 keeper+0.105
 stall+0.103
 Heidi+0.096
 ranger+0.094
Neg logit contributions
ug-0.207
idd-0.198
ooming-0.193
zz-0.186
irled-0.186
Token day
Token ID1110
Feature activation-1.53e+00

Pos logit contributions
ug+1.316
idd+1.234
ooming+1.195
ips+1.157
med+1.157
Neg logit contributions
 stranger-0.974
 keeper-0.918
 stall-0.910
 ranger-0.841
 Heidi-0.841
Token at
Token ID379
Feature activation-1.83e-01

Pos logit contributions
ug+3.097
idd+2.875
ooming+2.786
irled+2.751
zz+2.724
Neg logit contributions
 stranger-2.533
 stall-2.412
 keeper-2.385
 seller-2.201
 Stanley-2.196
Token the
Token ID262
Feature activation+2.78e+00

Pos logit contributions
ug+0.412
idd+0.391
ooming+0.381
irled+0.368
med+0.367
Neg logit contributions
 stranger-0.255
 keeper-0.240
 stall-0.236
 Heidi-0.219
 ranger-0.218
Token store
Token ID3650
Feature activation+1.11e+00

Pos logit contributions
 Stanley+3.736
 Heidi+3.609
 camel+3.539
Stan+3.534
 stranger+3.468
Neg logit contributions
idd-6.038
ooming-5.952
ug-5.636
ips-5.581
med-5.365
Token with
Token ID351
Feature activation+5.28e-01

Pos logit contributions
 stranger+1.820
 stall+1.680
 Heidi+1.664
 keeper+1.663
Stan+1.659
Neg logit contributions
ug-2.190
idd-2.049
ooming-2.007
med-1.900
ips-1.892
Token his
Token ID465
Feature activation+1.02e-01

Pos logit contributions
 stranger+0.692
 keeper+0.653
 stall+0.644
 Heidi+0.595
 seller+0.592
Neg logit contributions
ug-1.218
idd-1.160
ooming-1.123
med-1.112
irled-1.088
Token friends
Token ID2460
Feature activation+6.21e-01

Pos logit contributions
 stranger+0.139
 keeper+0.130
 stall+0.128
 Heidi+0.122
Stan+0.120
Neg logit contributions
ug-0.232
idd-0.221
ooming-0.215
irled-0.207
zz-0.206
Token and
Token ID290
Feature activation+3.99e-01

Pos logit contributions
 stranger+0.969
 keeper+0.920
 stall+0.895
 Heidi+0.864
Stan+0.853
Neg logit contributions
ug-1.275
idd-1.199
ooming-1.180
med-1.142
ged-1.126
Token laughed
Token ID13818
Feature activation+5.24e-01

Pos logit contributions
ug+1.213
idd+1.146
ooming+1.119
ips+1.083
irled+1.075
Neg logit contributions
 stranger-0.786
 keeper-0.754
 stall-0.730
 ranger-0.682
 seller-0.680
Token too
Token ID1165
Feature activation+4.64e-01

Pos logit contributions
 stranger+0.905
 stall+0.865
 keeper+0.860
 Heidi+0.810
Stan+0.806
Neg logit contributions
ug-1.009
idd-0.926
ooming-0.919
med-0.895
ged-0.879
Token.
Token ID13
Feature activation+5.49e-02

Pos logit contributions
 stranger+0.766
 keeper+0.724
 stall+0.721
 seller+0.681
 Heidi+0.677
Neg logit contributions
ug-0.921
idd-0.851
ooming-0.840
med-0.799
zz-0.799
Token He
Token ID679
Feature activation+7.33e-02

Pos logit contributions
 stranger+0.067
 keeper+0.062
 stall+0.061
 Heidi+0.056
 ranger+0.055
Neg logit contributions
ug-0.133
idd-0.127
ooming-0.123
med-0.119
irled-0.119
Token knew
Token ID2993
Feature activation+6.00e-01

Pos logit contributions
 stranger+0.108
 keeper+0.101
 stall+0.101
 Heidi+0.094
Stan+0.093
Neg logit contributions
ug-0.158
idd-0.150
ooming-0.145
irled-0.141
med-0.141
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
 stranger+0.774
 stall+0.731
 keeper+0.723
 Heidi+0.657
 seller+0.656
Neg logit contributions
ug-1.416
idd-1.326
ooming-1.286
med-1.254
irled-1.249
Token fox
Token ID21831
Feature activation+1.07e+00

Pos logit contributions
 stranger+2.945
 stall+2.880
 keeper+2.830
 Heidi+2.759
Stan+2.662
Neg logit contributions
ug-5.120
idd-4.802
ooming-4.673
zz-4.642
 pract-4.575
Token was
Token ID373
Feature activation+7.91e-01

Pos logit contributions
 stranger+1.639
 stall+1.568
 keeper+1.533
Stan+1.474
 Heidi+1.456
Neg logit contributions
ug-2.240
idd-2.087
med-2.017
ooming-2.006
irled-1.976
Token too
Token ID1165
Feature activation+6.00e-01

Pos logit contributions
 stranger+1.109
 stall+1.057
 keeper+1.040
 seller+0.958
Stan+0.958
Neg logit contributions
ug-1.740
idd-1.663
med-1.580
ooming-1.574
irled-1.553
Token smart
Token ID4451
Feature activation+1.07e-01

Pos logit contributions
 stranger+0.844
 keeper+0.800
 stall+0.797
 seller+0.749
 Heidi+0.745
Neg logit contributions
ug-1.329
idd-1.261
ooming-1.217
zz-1.177
med-1.172
Token to
Token ID284
Feature activation+4.43e-01

Pos logit contributions
 stranger+0.159
 keeper+0.151
 stall+0.149
 Heidi+0.139
 seller+0.138
Neg logit contributions
ug-0.228
idd-0.214
ooming-0.209
med-0.202
zz-0.201
Token day
Token ID1110
Feature activation-5.51e-01

Pos logit contributions
ug+1.626
idd+1.542
ooming+1.486
med+1.422
ips+1.418
Neg logit contributions
 stranger-1.546
 keeper-1.468
 stall-1.429
 ranger-1.367
 truth-1.342
Token,
Token ID11
Feature activation-9.17e-01

Pos logit contributions
ug+1.142
idd+1.079
ooming+1.046
irled+1.007
zz+1.004
Neg logit contributions
 stranger-0.873
 keeper-0.828
 stall-0.814
 Heidi-0.766
 ranger-0.762
Token they
Token ID484
Feature activation+5.60e-01

Pos logit contributions
ug+1.821
idd+1.716
ooming+1.644
ips+1.579
zz+1.579
Neg logit contributions
 stranger-1.584
 keeper-1.507
 stall-1.469
 Heidi-1.406
 ranger-1.405
Token went
Token ID1816
Feature activation+5.08e-01

Pos logit contributions
 stranger+0.911
 keeper+0.882
 stall+0.856
 Heidi+0.819
 ranger+0.811
Neg logit contributions
ug-1.130
idd-1.065
ooming-1.024
zz-1.010
med-0.995
Token to
Token ID284
Feature activation+4.03e-01

Pos logit contributions
 stranger+0.721
 keeper+0.686
 stall+0.670
 Heidi+0.639
Stan+0.632
Neg logit contributions
ug-1.114
idd-1.061
ooming-1.047
ips-1.010
med-1.001
Token the
Token ID262
Feature activation+2.56e+00

Pos logit contributions
 stranger+0.609
 keeper+0.578
 stall+0.567
 seller+0.537
Stan+0.530
Neg logit contributions
ug-0.844
idd-0.797
ooming-0.779
zz-0.753
irled-0.746
Token store
Token ID3650
Feature activation+1.57e+00

Pos logit contributions
 Stanley+4.421
 stranger+4.378
Stan+4.353
 Heidi+4.349
 Sonia+4.332
Neg logit contributions
idd-4.239
ooming-4.234
ug-4.136
ips-3.951
med-3.900
Token and
Token ID290
Feature activation+8.47e-01

Pos logit contributions
 stranger+2.725
Stan+2.501
 Heidi+2.493
 Stanley+2.491
 camel+2.447
Neg logit contributions
ug-2.928
idd-2.851
ooming-2.781
ips-2.611
ged-2.605
Token found
Token ID1043
Feature activation+1.29e+00

Pos logit contributions
 stranger+1.510
 keeper+1.428
 stall+1.423
Stan+1.377
 truth+1.364
Neg logit contributions
ug-1.522
idd-1.474
ooming-1.384
med-1.368
ged-1.360
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
 stranger+1.786
 keeper+1.685
 stall+1.618
Stan+1.557
 ranger+1.526
Neg logit contributions
ug-2.921
idd-2.787
med-2.650
ooming-2.618
ged-2.612
Token soft
Token ID2705
Feature activation+2.43e-01

Pos logit contributions
 stranger+1.308
 keeper+1.254
 stall+1.218
 Heidi+1.210
 Stanley+1.186
Neg logit contributions
ug-2.306
idd-2.270
ooming-2.204
zz-2.116
irled-2.091
Token thanked
Token ID26280
Feature activation-3.02e-01

Pos logit contributions
 stranger+0.298
 keeper+0.278
 stall+0.277
 Heidi+0.260
Stan+0.258
Neg logit contributions
ug-0.429
idd-0.409
ooming-0.396
irled-0.387
med-0.384
Token him
Token ID683
Feature activation+1.01e+00

Pos logit contributions
ug+0.706
idd+0.672
ooming+0.654
irled+0.633
zz+0.631
Neg logit contributions
 stranger-0.396
 keeper-0.372
 stall-0.365
 Heidi-0.338
 ranger-0.335
Token,
Token ID11
Feature activation-3.90e-01

Pos logit contributions
 stranger+1.829
 stall+1.708
 keeper+1.687
 Heidi+1.656
Stan+1.640
Neg logit contributions
ug-1.885
ooming-1.728
idd-1.691
med-1.661
zz-1.644
Token and
Token ID290
Feature activation-2.78e-01

Pos logit contributions
ug+0.825
idd+0.779
ooming+0.756
irled+0.730
zz+0.727
Neg logit contributions
 stranger-0.600
 keeper-0.569
 stall-0.560
 Heidi-0.523
 ranger-0.522
Token gave
Token ID2921
Feature activation-2.94e-01

Pos logit contributions
ug+0.586
idd+0.552
ooming+0.538
irled+0.516
zz+0.516
Neg logit contributions
 stranger-0.429
 keeper-0.407
 stall-0.400
 Heidi-0.375
 ranger-0.373
Token him
Token ID683
Feature activation+2.17e+00

Pos logit contributions
ug+0.696
idd+0.661
ooming+0.645
irled+0.623
zz+0.622
Neg logit contributions
 stranger-0.379
 keeper-0.355
 stall-0.348
 Heidi-0.322
 ranger-0.319
Token a
Token ID257
Feature activation+3.84e-01

Pos logit contributions
 stranger+3.422
 keeper+3.267
 stall+3.244
Stan+3.121
 Heidi+3.107
Neg logit contributions
ug-4.361
med-4.223
idd-4.183
zz-4.066
ips-4.055
Token smile
Token ID8212
Feature activation+1.27e+00

Pos logit contributions
 stranger+0.492
 keeper+0.473
 stall+0.457
 Heidi+0.437
 ranger+0.427
Neg logit contributions
ug-0.873
idd-0.840
ooming-0.819
zz-0.797
med-0.788
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
 stranger+2.145
 keeper+2.040
 stall+2.005
Stan+1.971
 Heidi+1.960
Neg logit contributions
ug-2.437
idd-2.299
med-2.259
ged-2.214
ooming-2.206
Token
Token ID198
Feature activation+1.81e-02

Pos logit contributions
ug+0.803
idd+0.762
ooming+0.746
irled+0.720
zz+0.720
Neg logit contributions
 stranger-0.401
 keeper-0.376
 stall-0.369
 Heidi-0.340
 ranger-0.337
Token
Token ID198
Feature activation+4.25e-01

Pos logit contributions
 stranger+0.024
 keeper+0.023
 stall+0.023
 seller+0.021
 Heidi+0.021
Neg logit contributions
ug-0.042
idd-0.039
ooming-0.038
irled-0.037
med-0.037
Token Tom
Token ID4186
Feature activation+1.78e+00

Pos logit contributions
 stranger+0.066
 keeper+0.062
 stall+0.062
 Heidi+0.058
Stan+0.058
Neg logit contributions
ug-0.085
idd-0.080
ooming-0.078
zz-0.075
irled-0.075
Token says
Token ID1139
Feature activation-5.27e-01

Pos logit contributions
 stranger+2.992
 stall+2.910
 keeper+2.848
Stan+2.667
 bench+2.654
Neg logit contributions
ug-3.339
idd-3.096
ooming-3.063
zz-3.049
ips-3.019
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
ug+1.284
idd+1.232
ooming+1.192
irled+1.162
zz+1.158
Neg logit contributions
 stranger-0.633
 keeper-0.599
 stall-0.580
 ranger-0.533
 Heidi-0.531
Token "
Token ID366
Feature activation-2.19e-01

Pos logit contributions
ug+1.036
idd+0.984
ooming+0.966
irled+0.934
zz+0.932
Neg logit contributions
 stranger-0.495
 keeper-0.465
 stall-0.454
 ranger-0.416
 Heidi-0.413
TokenWe
Token ID1135
Feature activation+2.18e-01

Pos logit contributions
ug+0.523
idd+0.497
ooming+0.485
irled+0.470
zz+0.469
Neg logit contributions
 stranger-0.276
 keeper-0.258
 stall-0.254
 Heidi-0.233
 ranger-0.232
Token should
Token ID815
Feature activation+9.52e-01

Pos logit contributions
 stranger+0.319
 keeper+0.303
 stall+0.300
 Heidi+0.280
 ranger+0.276
Neg logit contributions
ug-0.474
idd-0.449
ooming-0.434
zz-0.424
irled-0.420
Token share
Token ID2648
Feature activation+7.70e-01

Pos logit contributions
 stranger+1.344
 stall+1.268
 keeper+1.263
 Heidi+1.165
 ranger+1.155
Neg logit contributions
ug-2.082
idd-1.967
ooming-1.911
zz-1.861
irled-1.860
Token the
Token ID262
Feature activation+1.03e-01

Pos logit contributions
 stranger+1.251
 keeper+1.187
 stall+1.176
 Heidi+1.126
Stan+1.121
Neg logit contributions
ug-1.514
idd-1.422
ooming-1.378
med-1.347
ips-1.323
Token tree
Token ID5509
Feature activation+6.18e-01

Pos logit contributions
 stranger+0.148
 keeper+0.141
 stall+0.138
 Heidi+0.130
Stan+0.129
Neg logit contributions
ug-0.222
idd-0.211
ooming-0.205
zz-0.197
irled-0.197
Token with
Token ID351
Feature activation-2.24e-02

Pos logit contributions
 stranger+1.046
 stall+0.972
 keeper+0.966
 Heidi+0.948
Stan+0.946
Neg logit contributions
ug-1.194
idd-1.109
ooming-1.072
med-1.028
zz-1.027
Token other
Token ID584
Feature activation-7.04e-01

Pos logit contributions
ug+0.049
idd+0.046
ooming+0.044
irled+0.043
zz+0.043
Neg logit contributions
 stranger-0.033
 keeper-0.031
 stall-0.031
 Heidi-0.029
 ranger-0.029
Token decided
Token ID3066
Feature activation+2.63e+00

Pos logit contributions
 stall+2.213
 stranger+2.196
 keeper+2.104
Stan+2.040
 truth+2.002
Neg logit contributions
ug-2.538
idd-2.385
irled-2.290
ged-2.289
ooming-2.223
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
 stall+4.650
 stranger+4.250
Stan+4.207
 butcher+4.126
 camel+4.099
Neg logit contributions
ged-4.675
ug-4.487
idd-4.417
ooming-4.302
bed-4.226
Token share
Token ID2648
Feature activation+1.81e+00

Pos logit contributions
 stranger+2.165
 stall+2.113
 keeper+2.053
Stan+1.954
 Heidi+1.948
Neg logit contributions
ug-2.539
idd-2.342
ooming-2.276
zz-2.241
irled-2.214
Token her
Token ID607
Feature activation+6.58e-01

Pos logit contributions
 stranger+2.932
 keeper+2.799
 stall+2.748
 camel+2.612
Stan+2.603
Neg logit contributions
ug-3.421
idd-3.310
med-3.292
ged-3.193
ips-3.187
Token food
Token ID2057
Feature activation+1.40e+00

Pos logit contributions
 stranger+0.796
 keeper+0.741
 stall+0.707
 Heidi+0.691
Stan+0.684
Neg logit contributions
ug-1.570
idd-1.518
ooming-1.475
irled-1.439
zz-1.417
Token with
Token ID351
Feature activation+1.87e-01

Pos logit contributions
 stranger+2.781
 Heidi+2.606
Stan+2.594
 stall+2.589
 keeper+2.565
Neg logit contributions
ug-2.250
idd-2.079
ooming-2.009
ips-1.919
med-1.894
Token him
Token ID683
Feature activation+5.81e-01

Pos logit contributions
 stranger+0.295
 keeper+0.279
 stall+0.278
 seller+0.257
 Heidi+0.257
Neg logit contributions
ug-0.385
idd-0.362
ooming-0.352
med-0.342
irled-0.338
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 stranger+0.819
 stall+0.753
 keeper+0.752
 Heidi+0.714
Stan+0.714
Neg logit contributions
ug-1.298
idd-1.227
ooming-1.193
med-1.160
irled-1.152
Token Max
Token ID5436
Feature activation+1.35e+00

Pos logit contributions
 stranger+0.638
 keeper+0.596
 stall+0.587
 seller+0.550
 camel+0.546
Neg logit contributions
ug-0.972
idd-0.925
ooming-0.894
med-0.872
irled-0.867
Token was
Token ID373
Feature activation+5.96e-01

Pos logit contributions
 stranger+2.332
 stall+2.219
 keeper+2.159
 Heidi+2.113
Stan+2.086
Neg logit contributions
ug-2.506
idd-2.417
irled-2.303
ooming-2.284
ged-2.252
Token very
Token ID845
Feature activation+3.88e-01

Pos logit contributions
 stranger+0.934
 stall+0.884
 keeper+0.879
Stan+0.813
 Heidi+0.811
Neg logit contributions
ug-1.221
idd-1.162
ooming-1.110
irled-1.091
med-1.088
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
ug+1.328
idd+1.272
ooming+1.239
zz+1.199
med+1.198
Neg logit contributions
 stranger-0.570
 keeper-0.530
 stall-0.514
 Heidi-0.470
 ranger-0.464
Token great
Token ID1049
Feature activation-9.12e-01

Pos logit contributions
ug+0.238
idd+0.225
ooming+0.220
irled+0.212
zz+0.212
Neg logit contributions
 stranger-0.142
 keeper-0.134
 stall-0.132
 Heidi-0.123
 ranger-0.121
Token job
Token ID1693
Feature activation-7.25e-01

Pos logit contributions
ug+1.922
idd+1.836
ooming+1.765
med+1.738
ips+1.730
Neg logit contributions
 stranger-1.462
 keeper-1.375
 stall-1.336
 truth-1.261
 ranger-1.260
Token on
Token ID319
Feature activation-2.45e-01

Pos logit contributions
ug+1.607
idd+1.525
ooming+1.490
irled+1.437
zz+1.434
Neg logit contributions
 stranger-1.035
 keeper-0.979
 stall-0.958
 ranger-0.887
 Heidi-0.883
Token the
Token ID262
Feature activation+7.52e-01

Pos logit contributions
ug+0.551
idd+0.522
ooming+0.508
irled+0.490
zz+0.490
Neg logit contributions
 stranger-0.344
 keeper-0.324
 stall-0.319
 Heidi-0.296
 ranger-0.294
Token quiz
Token ID38964
Feature activation+1.12e-01

Pos logit contributions
 stranger+1.078
 keeper+1.022
 stall+1.006
 Heidi+0.983
Stan+0.970
Neg logit contributions
ug-1.606
idd-1.523
ooming-1.481
zz-1.419
med-1.405
Token!
Token ID0
Feature activation-5.94e-01

Pos logit contributions
 stranger+0.158
 keeper+0.149
 stall+0.147
 Heidi+0.138
Stan+0.137
Neg logit contributions
ug-0.247
idd-0.233
ooming-0.227
med-0.219
irled-0.218
Token You
Token ID921
Feature activation-3.11e-01

Pos logit contributions
ug+1.374
idd+1.300
ooming+1.290
irled+1.240
zz+1.234
Neg logit contributions
 stranger-0.779
 keeper-0.736
 stall-0.723
 Heidi-0.676
 ranger-0.666
Token are
Token ID389
Feature activation-3.19e-02

Pos logit contributions
ug+0.684
idd+0.647
ooming+0.632
irled+0.608
med+0.607
Neg logit contributions
 stranger-0.448
 keeper-0.423
 stall-0.415
 Heidi-0.386
 seller-0.385
Token very
Token ID845
Feature activation-8.20e-03

Pos logit contributions
ug+0.075
idd+0.071
ooming+0.069
med+0.067
irled+0.067
Neg logit contributions
 stranger-0.041
 keeper-0.039
 stall-0.038
 Heidi-0.035
 ranger-0.035
Token smart
Token ID4451
Feature activation-6.71e-01

Pos logit contributions
ug+0.020
idd+0.019
ooming+0.018
irled+0.018
zz+0.017
Neg logit contributions
 stranger-0.010
 keeper-0.010
 stall-0.010
 Heidi-0.009
 ranger-0.009
Token cry
Token ID3960
Feature activation+2.64e-01

Pos logit contributions
 stranger+1.044
 stall+1.034
 keeper+0.998
 butcher+0.937
 seller+0.937
Neg logit contributions
ug-1.509
idd-1.381
ooming-1.375
zz-1.346
med-1.306
Token.
Token ID13
Feature activation-9.61e-02

Pos logit contributions
 stranger+0.444
 stall+0.423
 keeper+0.422
 Heidi+0.397
 seller+0.395
Neg logit contributions
ug-0.517
idd-0.477
ooming-0.467
zz-0.448
med-0.444
Token Everything
Token ID11391
Feature activation+2.03e-01

Pos logit contributions
ug+0.215
idd+0.204
ooming+0.198
med+0.191
irled+0.191
Neg logit contributions
 stranger-0.134
 keeper-0.126
 stall-0.125
 Heidi-0.115
 ranger-0.115
Token is
Token ID318
Feature activation+5.70e-01

Pos logit contributions
 stranger+0.268
 keeper+0.251
 stall+0.250
 Heidi+0.230
Stan+0.229
Neg logit contributions
ug-0.470
idd-0.443
ooming-0.435
med-0.422
irled-0.421
Token going
Token ID1016
Feature activation+2.51e-01

Pos logit contributions
 stranger+0.773
 stall+0.736
 keeper+0.725
 seller+0.670
 ranger+0.661
Neg logit contributions
ug-1.288
idd-1.207
ooming-1.171
med-1.155
irled-1.150
Token to
Token ID284
Feature activation+7.82e-01

Pos logit contributions
 stranger+0.344
 keeper+0.322
 stall+0.322
 Heidi+0.297
 seller+0.295
Neg logit contributions
ug-0.571
idd-0.535
ooming-0.524
med-0.508
zz-0.506
Token be
Token ID307
Feature activation-3.81e-01

Pos logit contributions
 stranger+1.175
 stall+1.126
 keeper+1.101
 Heidi+1.027
 butcher+1.025
Neg logit contributions
ug-1.668
idd-1.523
ooming-1.509
zz-1.478
irled-1.447
Token alright
Token ID23036
Feature activation+7.05e-01

Pos logit contributions
ug+0.927
idd+0.883
ooming+0.863
zz+0.833
ips+0.832
Neg logit contributions
 stranger-0.461
 keeper-0.431
 stall-0.420
 Heidi-0.389
 ranger-0.384
Token."
Token ID526
Feature activation-2.09e-01

Pos logit contributions
 stranger+1.149
 stall+1.083
 keeper+1.079
 seller+1.015
 Heidi+1.010
Neg logit contributions
ug-1.429
idd-1.320
ooming-1.277
med-1.239
zz-1.236
Token But
Token ID887
Feature activation+3.29e-01

Pos logit contributions
ug+0.471
idd+0.446
ooming+0.435
irled+0.420
med+0.419
Neg logit contributions
 stranger-0.292
 keeper-0.275
 stall-0.271
 Heidi-0.251
 ranger-0.250
Token the
Token ID262
Feature activation-2.75e-01

Pos logit contributions
 stranger+0.490
 keeper+0.460
 stall+0.460
Stan+0.426
 seller+0.423
Neg logit contributions
ug-0.706
idd-0.661
ooming-0.650
med-0.630
irled-0.624
Token gift
Token ID6979
Feature activation+1.33e+00

Pos logit contributions
ug+2.100
idd+1.959
ooming+1.907
med+1.869
ips+1.861
Neg logit contributions
 stranger-1.351
 keeper-1.261
 stall-1.236
 seller-1.152
 bench-1.150
Token.
Token ID13
Feature activation-4.76e-01

Pos logit contributions
 stranger+2.010
 stall+1.889
Stan+1.866
 keeper+1.818
 Heidi+1.785
Neg logit contributions
ug-2.794
idd-2.612
ooming-2.502
med-2.483
zz-2.448
Token She
Token ID1375
Feature activation-4.43e-01

Pos logit contributions
ug+1.138
idd+1.076
ooming+1.056
zz+1.018
med+1.015
Neg logit contributions
 stranger-0.601
 keeper-0.566
 stall-0.557
 Heidi-0.518
 ranger-0.511
Token knew
Token ID2993
Feature activation+6.50e-01

Pos logit contributions
ug+0.962
idd+0.908
ooming+0.887
ips+0.853
med+0.848
Neg logit contributions
 stranger-0.656
 keeper-0.623
 stall-0.611
 seller-0.570
 ranger-0.569
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
 stranger+0.816
 stall+0.785
 keeper+0.766
Stan+0.696
 seller+0.696
Neg logit contributions
ug-1.544
idd-1.456
ooming-1.415
med-1.385
irled-1.380
Token was
Token ID373
Feature activation+1.78e+00

Pos logit contributions
 stranger+2.131
 stall+2.080
 camel+1.999
 keeper+1.951
Stan+1.928
Neg logit contributions
ug-2.611
idd-2.453
med-2.421
ooming-2.359
zz-2.359
Token going
Token ID1016
Feature activation-3.75e-01

Pos logit contributions
 stranger+2.756
 stall+2.711
 keeper+2.604
 camel+2.519
 seller+2.479
Neg logit contributions
ug-3.658
idd-3.386
med-3.319
irled-3.306
ips-3.195
Token to
Token ID284
Feature activation+2.96e-02

Pos logit contributions
ug+0.938
idd+0.896
ooming+0.875
irled+0.847
zz+0.844
Neg logit contributions
 stranger-0.434
 keeper-0.406
 stall-0.395
 ranger-0.361
 Heidi-0.359
Token be
Token ID307
Feature activation+3.95e-02

Pos logit contributions
 stranger+0.043
 keeper+0.040
 stall+0.040
 Heidi+0.037
 seller+0.036
Neg logit contributions
ug-0.065
idd-0.061
ooming-0.060
zz-0.058
irled-0.058
Token a
Token ID257
Feature activation+1.77e-01

Pos logit contributions
 stranger+0.048
 keeper+0.045
 stall+0.045
 seller+0.040
 Heidi+0.040
Neg logit contributions
ug-0.096
idd-0.091
ooming-0.089
irled-0.086
med-0.086
Token fun
Token ID1257
Feature activation-7.23e-01

Pos logit contributions
 stranger+0.230
 keeper+0.218
 stall+0.216
 Heidi+0.202
Stan+0.196
Neg logit contributions
ug-0.405
idd-0.389
ooming-0.377
zz-0.365
irled-0.363
Token I
Token ID314
Feature activation-1.37e+00

Pos logit contributions
ug+1.368
idd+1.310
ooming+1.276
irled+1.233
ips+1.232
Neg logit contributions
 stranger-0.897
 keeper-0.843
 stall-0.820
 Heidi-0.773
 ranger-0.767
Token did
Token ID750
Feature activation-8.71e-01

Pos logit contributions
ug+3.073
idd+2.864
ooming+2.820
med+2.800
ips+2.749
Neg logit contributions
 stranger-1.966
 keeper-1.863
 stall-1.757
 truth-1.711
 ranger-1.709
Token!"
Token ID2474
Feature activation-1.45e+00

Pos logit contributions
ug+2.020
idd+1.926
ooming+1.861
zz+1.806
ips+1.800
Neg logit contributions
 stranger-1.168
 keeper-1.104
 stall-1.061
 Heidi-1.001
 ranger-0.995
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
ug+3.646
idd+3.538
ooming+3.391
irled+3.346
ips+3.277
Neg logit contributions
 stranger-1.611
 keeper-1.517
 stall-1.496
 Heidi-1.442
 Stanley-1.404
Token
Token ID198
Feature activation-9.96e-01

Pos logit contributions
ug+3.085
idd+2.976
ooming+2.892
irled+2.814
ips+2.808
Neg logit contributions
 stranger-1.367
 keeper-1.289
 stall-1.247
Stan-1.168
 Heidi-1.153
TokenBilly
Token ID49640
Feature activation-1.40e-01

Pos logit contributions
ug+1.995
idd+1.891
ooming+1.820
ips+1.778
irled+1.765
Neg logit contributions
 stranger-1.634
 keeper-1.555
 stall-1.513
Stan-1.454
 ranger-1.437
Token smiled
Token ID13541
Feature activation+1.04e+00

Pos logit contributions
ug+0.307
idd+0.290
ooming+0.283
irled+0.273
zz+0.272
Neg logit contributions
 stranger-0.201
 keeper-0.190
 stall-0.187
 Heidi-0.174
 ranger-0.173
Token and
Token ID290
Feature activation-8.97e-01

Pos logit contributions
 stranger+1.807
 stall+1.735
 keeper+1.695
 Heidi+1.655
Stan+1.633
Neg logit contributions
ug-1.993
ooming-1.855
idd-1.807
med-1.774
ged-1.759
Token said
Token ID531
Feature activation-3.53e-01

Pos logit contributions
ug+2.204
idd+2.094
ooming+2.066
ips+1.994
med+1.960
Neg logit contributions
 stranger-1.090
 keeper-1.025
 stall-0.972
 ranger-0.929
 Heidi-0.920
Token,
Token ID11
Feature activation-6.60e-01

Pos logit contributions
ug+0.717
idd+0.677
ooming+0.657
irled+0.630
zz+0.630
Neg logit contributions
 stranger-0.568
 keeper-0.540
 stall-0.530
 Heidi-0.499
 ranger-0.496
Token "
Token ID366
Feature activation-1.04e+00

Pos logit contributions
ug+1.730
idd+1.653
ooming+1.619
irled+1.572
ips+1.569
Neg logit contributions
 stranger-0.681
 keeper-0.635
 stall-0.615
 Heidi-0.553
 ranger-0.553
Token's
Token ID338
Feature activation+7.81e-01

Pos logit contributions
ug+0.092
idd+0.087
ooming+0.085
med+0.082
irled+0.082
Neg logit contributions
 stranger-0.056
 keeper-0.052
 stall-0.052
 Heidi-0.048
 ranger-0.048
Token courage
Token ID11917
Feature activation-8.66e-02

Pos logit contributions
 stranger+1.079
 stall+1.012
 keeper+0.989
 Heidi+0.958
Stan+0.958
Neg logit contributions
ug-1.722
idd-1.627
ooming-1.616
med-1.535
zz-1.535
Token that
Token ID326
Feature activation+2.92e-01

Pos logit contributions
ug+0.183
idd+0.172
ooming+0.168
med+0.161
irled+0.161
Neg logit contributions
 stranger-0.131
 keeper-0.124
 stall-0.122
 Heidi-0.115
 ranger-0.114
Token they
Token ID484
Feature activation+5.64e-01

Pos logit contributions
 stranger+0.384
 keeper+0.361
 stall+0.360
Stan+0.328
 seller+0.327
Neg logit contributions
ug-0.671
idd-0.632
ooming-0.616
med-0.604
irled-0.597
Token immediately
Token ID3393
Feature activation+7.85e-01

Pos logit contributions
 stranger+0.901
 stall+0.868
 keeper+0.863
 Heidi+0.813
Stan+0.804
Neg logit contributions
ug-1.142
idd-1.064
ooming-1.025
med-1.014
ged-1.008
Token stopped
Token ID5025
Feature activation-1.29e+00

Pos logit contributions
 stranger+1.280
 stall+1.212
 keeper+1.207
 Heidi+1.152
Stan+1.137
Neg logit contributions
ug-1.554
idd-1.458
ooming-1.410
med-1.376
irled-1.374
Token teasing
Token ID34417
Feature activation-1.50e-02

Pos logit contributions
ug+2.667
idd+2.517
ooming+2.399
irled+2.378
zz+2.362
Neg logit contributions
 stranger-2.074
 keeper-1.993
 stall-1.917
 ranger-1.827
 Heidi-1.798
Token Tiger
Token ID17030
Feature activation+2.46e-01

Pos logit contributions
ug+0.033
idd+0.031
ooming+0.030
med+0.029
zz+0.029
Neg logit contributions
 stranger-0.022
 keeper-0.021
 stall-0.020
 Heidi-0.019
 seller-0.019
Token.
Token ID13
Feature activation-6.08e-01

Pos logit contributions
 stranger+0.328
 keeper+0.304
 stall+0.301
 Heidi+0.281
Stan+0.278
Neg logit contributions
ug-0.566
idd-0.535
ooming-0.525
med-0.510
zz-0.505
Token From
Token ID3574
Feature activation+1.22e+00

Pos logit contributions
ug+1.420
idd+1.348
ooming+1.324
irled+1.282
zz+1.280
Neg logit contributions
 stranger-0.797
 keeper-0.757
 stall-0.743
 Heidi-0.693
 ranger-0.684
Token that
Token ID326
Feature activation-9.73e-01

Pos logit contributions
 stranger+1.994
 stall+1.916
 keeper+1.875
 truth+1.821
 butcher+1.788
Neg logit contributions
ug-2.423
idd-2.227
ooming-2.151
med-2.124
irled-2.104
Token became
Token ID2627
Feature activation-5.51e-01

Pos logit contributions
ug+0.532
idd+0.501
ooming+0.487
irled+0.466
med+0.466
Neg logit contributions
 stranger-0.430
 keeper-0.409
 stall-0.403
 Heidi-0.378
 ranger-0.377
Token very
Token ID845
Feature activation-2.79e-01

Pos logit contributions
ug+1.267
idd+1.203
ooming+1.179
zz+1.140
irled+1.131
Neg logit contributions
 stranger-0.751
 keeper-0.706
 stall-0.687
 ranger-0.642
 Heidi-0.638
Token sad
Token ID6507
Feature activation-7.15e-01

Pos logit contributions
ug+0.587
idd+0.554
ooming+0.539
med+0.518
zz+0.518
Neg logit contributions
 stranger-0.437
 keeper-0.414
 stall-0.407
 Heidi-0.380
 ranger-0.380
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
ug+1.566
idd+1.509
ooming+1.443
irled+1.408
med+1.403
Neg logit contributions
 stranger-1.051
 keeper-0.989
 stall-0.963
 ranger-0.908
 truth-0.892
Token scared
Token ID12008
Feature activation-1.29e-01

Pos logit contributions
 stranger+0.855
 stall+0.828
 keeper+0.813
Stan+0.770
 Heidi+0.765
Neg logit contributions
ug-1.013
idd-0.942
ooming-0.907
irled-0.897
ged-0.890
Token.
Token ID13
Feature activation-8.61e-02

Pos logit contributions
ug+0.296
idd+0.280
ooming+0.273
med+0.263
irled+0.263
Neg logit contributions
 stranger-0.175
 keeper-0.165
 stall-0.163
 Heidi-0.150
 seller-0.150
Token Eventually
Token ID16178
Feature activation-1.22e+00

Pos logit contributions
ug+0.188
idd+0.178
ooming+0.173
irled+0.167
med+0.167
Neg logit contributions
 stranger-0.126
 keeper-0.119
 stall-0.117
 Heidi-0.109
 seller-0.108
Token,
Token ID11
Feature activation-3.51e-01

Pos logit contributions
ug+2.662
idd+2.581
ooming+2.459
ips+2.391
zz+2.386
Neg logit contributions
 stranger-1.769
 keeper-1.637
 stall-1.615
 Heidi-1.548
 ranger-1.519
Token a
Token ID257
Feature activation+6.19e-01

Pos logit contributions
ug+0.829
idd+0.790
ooming+0.769
irled+0.744
zz+0.742
Neg logit contributions
 stranger-0.454
 keeper-0.425
 stall-0.418
 Heidi-0.386
 ranger-0.384
Token big
Token ID1263
Feature activation-3.47e-01

Pos logit contributions
 stranger+0.645
 stall+0.621
 keeper+0.608
 Heidi+0.579
 Stanley+0.561
Neg logit contributions
ug-1.549
idd-1.477
ooming-1.436
zz-1.391
med-1.377
Token bird
Token ID6512
Feature activation-5.14e-01

Pos logit contributions
ug+0.782
idd+0.743
ooming+0.723
irled+0.700
med+0.699
Neg logit contributions
 stranger-0.487
 keeper-0.457
 stall-0.448
 ranger-0.416
 Heidi-0.415
Token around
Token ID1088
Feature activation-7.94e-01

Pos logit contributions
ug+1.546
idd+1.495
ooming+1.433
irled+1.385
zz+1.382
Neg logit contributions
 stranger-1.004
 keeper-0.960
 stall-0.924
 Heidi-0.868
 ranger-0.865
Token fragile
Token ID21049
Feature activation-1.51e-01

Pos logit contributions
ug+1.902
idd+1.835
ooming+1.771
irled+1.725
zz+1.708
Neg logit contributions
 stranger-1.016
 keeper-0.941
 stall-0.914
 truth-0.847
 Heidi-0.847
Token things
Token ID1243
Feature activation+2.09e-01

Pos logit contributions
ug+0.306
idd+0.288
ooming+0.280
irled+0.269
med+0.268
Neg logit contributions
 stranger-0.244
 keeper-0.233
 stall-0.229
 Heidi-0.215
 ranger-0.214
Token and
Token ID290
Feature activation-7.28e-01

Pos logit contributions
 stranger+0.331
 keeper+0.312
 stall+0.311
 Heidi+0.291
Stan+0.290
Neg logit contributions
ug-0.430
idd-0.400
ooming-0.395
zz-0.379
med-0.378
Token her
Token ID607
Feature activation-1.02e+00

Pos logit contributions
ug+1.780
idd+1.705
ooming+1.665
med+1.601
ips+1.600
Neg logit contributions
 stranger-0.888
 keeper-0.832
 stall-0.795
 ranger-0.744
 Heidi-0.740
Token mom
Token ID1995
Feature activation-9.78e-01

Pos logit contributions
ug+2.464
idd+2.327
ooming+2.252
med+2.211
zz+2.186
Neg logit contributions
 stranger-1.359
 keeper-1.288
 stall-1.239
 ranger-1.160
 truth-1.147
Token trusted
Token ID13467
Feature activation-5.17e-01

Pos logit contributions
ug+2.348
idd+2.238
ooming+2.186
ips+2.110
irled+2.101
Neg logit contributions
 stranger-1.214
 keeper-1.169
 stall-1.099
 seller-1.043
 ranger-1.037
Token her
Token ID607
Feature activation+3.92e-01

Pos logit contributions
ug+1.311
idd+1.261
ooming+1.228
irled+1.189
zz+1.183
Neg logit contributions
 stranger-0.575
 keeper-0.534
 stall-0.517
 Heidi-0.476
 ranger-0.470
Token even
Token ID772
Feature activation+3.97e-02

Pos logit contributions
 stranger+0.569
 stall+0.536
 keeper+0.533
 Heidi+0.504
Stan+0.503
Neg logit contributions
ug-0.856
idd-0.795
ooming-0.792
med-0.758
zz-0.758
Token more
Token ID517
Feature activation+3.36e-01

Pos logit contributions
 stranger+0.056
 keeper+0.053
 stall+0.053
 Heidi+0.049
Stan+0.048
Neg logit contributions
ug-0.088
idd-0.083
ooming-0.081
med-0.078
irled-0.078
Token.
Token ID13
Feature activation-9.43e-02

Pos logit contributions
 stranger+0.471
 keeper+0.448
 stall+0.446
 seller+0.415
Stan+0.413
Neg logit contributions
ug-0.756
idd-0.701
ooming-0.691
med-0.668
zz-0.667
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 stranger+0.033
 keeper+0.031
 stall+0.031
 Heidi+0.029
 ranger+0.029
Neg logit contributions
ug-0.047
idd-0.044
ooming-0.043
med-0.042
irled-0.042
Token They
Token ID1119
Feature activation+9.58e-01

Pos logit contributions
ug+0.692
idd+0.655
ooming+0.641
irled+0.618
zz+0.617
Neg logit contributions
 stranger-0.420
 keeper-0.398
 stall-0.391
 Heidi-0.363
 ranger-0.361
Token were
Token ID547
Feature activation-2.78e-01

Pos logit contributions
 stranger+1.549
 stall+1.484
 keeper+1.473
 Heidi+1.412
Stan+1.378
Neg logit contributions
ug-1.916
idd-1.795
ooming-1.701
ged-1.694
med-1.693
Token all
Token ID477
Feature activation-6.42e-01

Pos logit contributions
ug+0.619
idd+0.586
ooming+0.572
zz+0.551
irled+0.551
Neg logit contributions
 stranger-0.393
 keeper-0.372
 stall-0.365
 Heidi-0.340
 ranger-0.338
Token so
Token ID523
Feature activation-9.81e-02

Pos logit contributions
ug+1.481
idd+1.398
ooming+1.379
zz+1.328
irled+1.325
Neg logit contributions
 stranger-0.870
 keeper-0.827
 stall-0.801
 ranger-0.749
 Heidi-0.739
Token happy
Token ID3772
Feature activation-9.16e-01

Pos logit contributions
ug+0.215
idd+0.204
ooming+0.198
irled+0.191
med+0.191
Neg logit contributions
 stranger-0.143
 keeper-0.135
 stall-0.133
 Heidi-0.124
 seller-0.123
Token that
Token ID326
Feature activation+6.47e-03

Pos logit contributions
ug+2.111
idd+2.007
ooming+1.955
ips+1.897
irled+1.896
Neg logit contributions
 stranger-1.253
 keeper-1.190
 stall-1.134
 ranger-1.083
 Heidi-1.051
Token the
Token ID262
Feature activation+4.08e-01

Pos logit contributions
 stranger+0.009
 keeper+0.009
 stall+0.009
 Heidi+0.008
 seller+0.008
Neg logit contributions
ug-0.014
idd-0.013
ooming-0.013
med-0.013
irled-0.013
Token king
Token ID5822
Feature activation-4.76e-01

Pos logit contributions
 stranger+0.563
 stall+0.535
 keeper+0.535
 Heidi+0.507
Stan+0.496
Neg logit contributions
ug-0.899
idd-0.861
ooming-0.829
zz-0.799
irled-0.799
Token had
Token ID550
Feature activation+5.45e-01

Pos logit contributions
ug+1.040
idd+0.984
ooming+0.961
irled+0.926
ips+0.925
Neg logit contributions
 stranger-0.692
 keeper-0.658
 stall-0.640
 ranger-0.598
 seller-0.596
Token a
Token ID257
Feature activation+2.25e-01

Pos logit contributions
 stranger+0.904
 stall+0.856
 keeper+0.856
 seller+0.798
 Heidi+0.796
Neg logit contributions
ug-1.081
idd-1.018
ooming-0.977
med-0.967
irled-0.952
Token Sarah
Token ID10490
Feature activation+3.82e-02

Pos logit contributions
ug+0.337
idd+0.319
ooming+0.311
irled+0.300
med+0.300
Neg logit contributions
 stranger-0.212
 keeper-0.200
 stall-0.197
 Heidi-0.182
 ranger-0.182
Token started
Token ID2067
Feature activation-5.40e-02

Pos logit contributions
 stranger+0.055
 keeper+0.052
 stall+0.051
 Heidi+0.048
 ranger+0.047
Neg logit contributions
ug-0.083
idd-0.079
ooming-0.077
med-0.075
irled-0.074
Token to
Token ID284
Feature activation+6.14e-01

Pos logit contributions
ug+0.127
idd+0.121
ooming+0.118
med+0.114
irled+0.114
Neg logit contributions
 stranger-0.069
 keeper-0.065
 stall-0.063
 Heidi-0.059
 ranger-0.058
Token spin
Token ID7906
Feature activation-4.53e-01

Pos logit contributions
 stranger+0.918
 keeper+0.870
 stall+0.853
Stan+0.804
 ranger+0.803
Neg logit contributions
ug-1.271
idd-1.209
ooming-1.183
zz-1.136
irled-1.135
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
ug+0.931
idd+0.881
ooming+0.854
irled+0.819
zz+0.817
Neg logit contributions
 stranger-0.726
 keeper-0.689
 stall-0.676
 Heidi-0.635
 ranger-0.632
Token She
Token ID1375
Feature activation-2.76e+00

Pos logit contributions
ug+3.803
idd+3.620
ooming+3.597
irled+3.463
zz+3.449
Neg logit contributions
 stranger-1.158
 keeper-1.082
 stall-1.018
 Heidi-0.961
 truth-0.919
Token spun
Token ID26843
Feature activation-9.02e-01

Pos logit contributions
ug+6.709
idd+6.427
ooming+6.245
ips+6.087
anged+5.774
Neg logit contributions
 stranger-3.739
 keeper-3.539
 bench-3.425
 stall-3.181
 nearby-3.165
Token so
Token ID523
Feature activation-2.04e+00

Pos logit contributions
ug+1.989
idd+1.898
ooming+1.836
irled+1.752
anged+1.751
Neg logit contributions
 stranger-1.345
 keeper-1.264
 stall-1.230
 Heidi-1.158
 ranger-1.151
Token fast
Token ID3049
Feature activation-1.68e+00

Pos logit contributions
ug+4.939
idd+4.938
ooming+4.739
anged+4.540
ips+4.507
Neg logit contributions
 stranger-2.690
 keeper-2.447
 stall-2.250
 bench-2.212
 ranger-2.170
Token!
Token ID0
Feature activation-1.54e+00

Pos logit contributions
ug+4.278
idd+4.156
ooming+3.963
irled+3.940
zz+3.841
Neg logit contributions
 stranger-2.020
 keeper-1.808
 stall-1.747
 Heidi-1.644
 bench-1.640
Token Around
Token ID16824
Feature activation-1.02e+00

Pos logit contributions
ug+4.172
idd+3.974
ooming+3.937
irled+3.796
zz+3.772
Neg logit contributions
 stranger-1.526
 keeper-1.434
 stall-1.351
 Heidi-1.340
 truth-1.287
Token make
Token ID787
Feature activation-6.71e-01

Pos logit contributions
ug+1.073
idd+1.023
ooming+0.989
med+0.955
irled+0.955
Neg logit contributions
 stranger-0.723
 keeper-0.688
 stall-0.674
 seller-0.626
 Heidi-0.624
Token figures
Token ID5538
Feature activation-4.56e-01

Pos logit contributions
ug+1.663
idd+1.598
ooming+1.552
zz+1.508
ged+1.502
Neg logit contributions
 stranger-0.783
 keeper-0.727
 stall-0.708
 Heidi-0.650
 truth-0.649
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
ug+0.987
idd+0.940
ooming+0.908
irled+0.878
zz+0.876
Neg logit contributions
 stranger-0.671
 keeper-0.636
 stall-0.622
 ranger-0.578
 seller-0.577
Token He
Token ID679
Feature activation-1.61e+00

Pos logit contributions
ug+0.596
idd+0.568
ooming+0.556
irled+0.539
zz+0.538
Neg logit contributions
 stranger-0.251
 keeper-0.232
 stall-0.228
 Heidi-0.206
 ranger-0.204
Token made
Token ID925
Feature activation-1.47e+00

Pos logit contributions
ug+4.205
idd+4.040
ooming+4.026
ips+3.932
med+3.806
Neg logit contributions
 stranger-1.638
 keeper-1.568
 stall-1.474
 seller-1.426
 ranger-1.338
Token all
Token ID477
Feature activation-2.05e+00

Pos logit contributions
ug+3.816
idd+3.726
ooming+3.635
ips+3.532
zz+3.506
Neg logit contributions
 stranger-1.555
 keeper-1.454
 stall-1.359
 truth-1.308
 ranger-1.228
Token sorts
Token ID10524
Feature activation-1.50e+00

Pos logit contributions
ug+4.786
idd+4.646
ooming+4.598
zz+4.385
ged+4.381
Neg logit contributions
 stranger-2.840
 keeper-2.657
 stall-2.520
 truth-2.408
 bench-2.363
Token of
Token ID286
Feature activation-2.42e+00

Pos logit contributions
ug+2.814
idd+2.793
ooming+2.683
zz+2.629
irled+2.536
Neg logit contributions
 stranger-2.619
 keeper-2.495
 stall-2.359
 truth-2.315
 ranger-2.285
Token figures
Token ID5538
Feature activation-8.84e-01

Pos logit contributions
idd+6.433
ug+6.410
ooming+6.294
ips+6.199
zz+6.132
Neg logit contributions
 stranger-2.564
 truth-2.352
 keeper-2.232
 stall-2.105
 wise-2.009
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
ug+1.842
idd+1.763
ooming+1.691
irled+1.641
zz+1.636
Neg logit contributions
 stranger-1.377
 keeper-1.318
 stall-1.281
 ranger-1.202
 seller-1.196
Token from
Token ID422
Feature activation-2.15e+00

Pos logit contributions
ug+2.848
idd+2.753
ooming+2.684
irled+2.600
ips+2.588
Neg logit contributions
 stranger-1.941
 keeper-1.861
 stall-1.788
 ranger-1.704
 seller-1.679
Token to
Token ID284
Feature activation-8.85e-02

Pos logit contributions
ug+1.235
idd+1.178
ooming+1.147
irled+1.111
med+1.108
Neg logit contributions
 stranger-0.717
 keeper-0.680
 stall-0.651
 Heidi-0.615
 ranger-0.614
Token take
Token ID1011
Feature activation-1.67e+00

Pos logit contributions
ug+0.205
idd+0.194
ooming+0.189
zz+0.183
irled+0.183
Neg logit contributions
 stranger-0.117
 keeper-0.110
 stall-0.109
 Heidi-0.100
 ranger-0.100
Token good
Token ID922
Feature activation-1.94e+00

Pos logit contributions
ug+3.722
idd+3.606
ooming+3.538
irled+3.348
ips+3.326
Neg logit contributions
 stranger-2.463
 keeper-2.252
 Heidi-2.190
 stall-2.188
 Stanley-2.145
Token care
Token ID1337
Feature activation-6.93e-01

Pos logit contributions
ug+3.879
idd+3.756
ooming+3.571
med+3.452
irled+3.452
Neg logit contributions
 stranger-3.422
 keeper-3.059
 truth-2.964
 stall-2.944
 Heidi-2.886
Token of
Token ID286
Feature activation-4.71e-01

Pos logit contributions
ug+1.222
idd+1.161
ooming+1.127
irled+1.076
zz+1.067
Neg logit contributions
 stranger-1.298
 keeper-1.245
 stall-1.215
 ranger-1.155
 seller-1.145
Token my
Token ID616
Feature activation-2.24e+00

Pos logit contributions
ug+1.039
idd+0.987
ooming+0.962
irled+0.930
zz+0.928
Neg logit contributions
 stranger-0.674
 keeper-0.637
 stall-0.625
 Heidi-0.586
 ranger-0.577
Token engine
Token ID3113
Feature activation-3.46e-01

Pos logit contributions
ug+4.375
idd+4.237
ips+4.215
med+4.110
irled+4.066
Neg logit contributions
 keeper-3.769
 stranger-3.754
 stall-3.549
 truth-3.483
 ranger-3.380
Token,"
Token ID553
Feature activation+7.40e-02

Pos logit contributions
ug+0.739
idd+0.701
ooming+0.680
irled+0.657
med+0.654
Neg logit contributions
 stranger-0.517
 keeper-0.492
 stall-0.482
 ranger-0.449
 Heidi-0.449
Token said
Token ID531
Feature activation-4.81e-01

Pos logit contributions
 stranger+0.117
 keeper+0.111
 stall+0.111
 Heidi+0.103
 seller+0.103
Neg logit contributions
ug-0.151
idd-0.142
ooming-0.138
med-0.133
irled-0.132
Token Tim
Token ID5045
Feature activation+4.33e-01

Pos logit contributions
ug+0.946
idd+0.894
ooming+0.870
irled+0.837
ips+0.837
Neg logit contributions
 stranger-0.805
 keeper-0.768
 stall-0.750
 Heidi-0.715
 ranger-0.709
Tokenmy
Token ID1820
Feature activation+5.27e-01

Pos logit contributions
 stranger+0.795
 stall+0.758
 keeper+0.745
Stan+0.711
 Heidi+0.708
Neg logit contributions
ug-0.779
idd-0.718
ooming-0.703
zz-0.674
med-0.673
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
ug+0.904
idd+0.863
ooming+0.837
irled+0.813
zz+0.808
Neg logit contributions
 stranger-0.531
 keeper-0.505
 stall-0.493
 ranger-0.456
 seller-0.453
Token He
Token ID679
Feature activation-1.90e+00

Pos logit contributions
ug+3.912
idd+3.766
ooming+3.737
zz+3.614
irled+3.604
Neg logit contributions
 stranger-1.299
 keeper-1.241
 stall-1.203
 ranger-1.091
 Heidi-1.080
Token made
Token ID925
Feature activation-1.33e+00

Pos logit contributions
ug+4.302
ooming+4.096
idd+4.083
ips+3.983
zz+3.793
Neg logit contributions
 stranger-2.662
 keeper-2.645
 stall-2.510
 seller-2.393
 ranger-2.379
Token engine
Token ID3113
Feature activation-1.42e+00

Pos logit contributions
ug+3.320
idd+3.225
ooming+3.139
ips+3.033
zz+3.008
Neg logit contributions
 stranger-1.522
 keeper-1.426
 stall-1.356
 truth-1.246
 bench-1.243
Token noises
Token ID26782
Feature activation-1.36e+00

Pos logit contributions
ug+3.247
idd+3.203
ooming+3.038
ips+2.994
irled+2.964
Neg logit contributions
 stranger-1.860
 keeper-1.845
 stall-1.718
 ranger-1.635
 seller-1.582
Token with
Token ID351
Feature activation-2.33e+00

Pos logit contributions
ug+3.115
idd+3.044
ooming+2.911
irled+2.896
ips+2.862
Neg logit contributions
 stranger-1.855
 keeper-1.756
 stall-1.678
 ranger-1.595
 seller-1.552
Token his
Token ID465
Feature activation-2.55e+00

Pos logit contributions
ug+6.060
idd+5.914
ooming+5.881
irled+5.659
ips+5.558
Neg logit contributions
 stranger-2.564
 keeper-2.300
 stall-2.247
 truth-2.108
 bench-2.060
Token mouth
Token ID5422
Feature activation+1.08e-01

Pos logit contributions
ug+5.600
idd+5.467
ips+5.275
irled+5.216
ooming+5.190
Neg logit contributions
 keeper-3.865
 stranger-3.794
 stall-3.718
 ranger-3.420
 bench-3.414
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
 stranger+0.168
 keeper+0.158
 stall+0.156
 Heidi+0.148
Stan+0.145
Neg logit contributions
ug-0.227
idd-0.213
ooming-0.209
med-0.202
zz-0.200
Token and
Token ID290
Feature activation-2.07e+00

Pos logit contributions
ug+4.070
idd+3.941
ooming+3.784
ips+3.685
irled+3.656
Neg logit contributions
 stranger-2.689
 keeper-2.606
 stall-2.522
 bench-2.410
 ranger-2.402
Token he
Token ID339
Feature activation-2.26e+00

Pos logit contributions
ug+5.146
idd+5.004
ooming+4.937
ips+4.788
irled+4.652
Neg logit contributions
 stranger-2.614
 keeper-2.507
 stall-2.292
 ranger-2.260
 seller-2.212
Token She
Token ID1375
Feature activation-1.32e+00

Pos logit contributions
ug+1.478
idd+1.396
ooming+1.381
zz+1.331
irled+1.328
Neg logit contributions
 stranger-0.805
 keeper-0.761
 stall-0.748
 Heidi-0.698
 ranger-0.688
Token picked
Token ID6497
Feature activation-1.03e+00

Pos logit contributions
ug+3.000
idd+2.788
ooming+2.752
ips+2.657
zz+2.610
Neg logit contributions
 stranger-1.860
 keeper-1.765
 stall-1.711
 seller-1.637
 bench-1.590
Token it
Token ID340
Feature activation+2.78e-01

Pos logit contributions
ug+2.503
idd+2.367
ooming+2.317
irled+2.225
zz+2.202
Neg logit contributions
 stranger-1.277
 keeper-1.172
 stall-1.154
 Heidi-1.103
 seller-1.071
Token up
Token ID510
Feature activation+5.09e-01

Pos logit contributions
 stranger+0.382
 keeper+0.355
 stall+0.353
 Heidi+0.328
Stan+0.328
Neg logit contributions
ug-0.636
idd-0.597
ooming-0.589
zz-0.570
med-0.570
Token and
Token ID290
Feature activation-1.37e+00

Pos logit contributions
 stranger+0.863
 keeper+0.811
 stall+0.806
 Heidi+0.761
 camel+0.761
Neg logit contributions
ug-0.985
idd-0.917
ooming-0.900
zz-0.865
ged-0.863
Token started
Token ID2067
Feature activation-2.28e+00

Pos logit contributions
ug+3.023
idd+2.769
ooming+2.691
ips+2.609
zz+2.555
Neg logit contributions
 stranger-2.108
 keeper-2.015
 stall-1.947
 seller-1.877
 Heidi-1.860
Token to
Token ID284
Feature activation-1.67e+00

Pos logit contributions
ug+6.628
idd+6.207
ooming+5.970
irled+5.848
zz+5.846
Neg logit contributions
 stranger-2.008
 keeper-1.863
 stall-1.775
 bench-1.738
 truth-1.642
Token mark
Token ID1317
Feature activation-9.19e-01

Pos logit contributions
ug+3.523
idd+3.356
ips+3.157
ooming+3.119
med+3.044
Neg logit contributions
 stranger-2.704
 keeper-2.569
 stall-2.481
 bench-2.463
 seller-2.413
Token on
Token ID319
Feature activation-1.03e+00

Pos logit contributions
ug+2.022
idd+1.927
ooming+1.870
irled+1.791
zz+1.782
Neg logit contributions
 stranger-1.324
 keeper-1.267
 stall-1.228
 Heidi-1.155
 ranger-1.145
Token the
Token ID262
Feature activation-1.55e-01

Pos logit contributions
ug+2.486
idd+2.375
ooming+2.323
irled+2.249
zz+2.217
Neg logit contributions
 stranger-1.281
 keeper-1.211
 stall-1.196
 Heidi-1.101
 truth-1.096
Token hoop
Token ID44792
Feature activation-5.90e-01

Pos logit contributions
ug+0.365
idd+0.346
ooming+0.338
irled+0.326
zz+0.326
Neg logit contributions
 stranger-0.199
 keeper-0.187
 stall-0.184
 Heidi-0.169
 ranger-0.168