TOP ACTIVATIONS
MAX = 3.281

happy . At the store , they found some
happy that the fountain was no longer dry .
store . At the store , the little girl
day while she was at the stable it started to rain
next day , the owner saw some more che wed threads
shoes . At the taxi stop , her brother
store . At the store , they picked out
day , Lily was at the park with her mom .
big , brown bear named Bub ba . Bub ba loved
He gives them to Anna and Ben . He says sorry
take a look . At the house , they saw a
The man sighed . He saw their sad faces . He
store . At the store , the mummy bear
you ." Ben saw that Sara was crying and
got his results , he saw that he had failed .
Their mom sighed . She saw the cut on the sheep
her they were going to the park . Lily was eager
all day long . At the end of the day ,
batteries . At the store , the store -
sun . At the end of the day ,

MIN ACTIVATIONS
MIN = -3.366

. She saw a beetle walking on the grass . She
ground . He wanted to drive his car over it ,
, and Peter wanted to ride it . He asked the
horn . They wanted to ride the boat , but there
very happy and wanted to ride it . L
front . Lily liked to ride it around her town and
He was so excited to ride in it . His mom
car . He wanted to drive the car with his stick
's bicycle and wanted to ride it , but Tim my
it closer , so she moved a bit closer . She
it . The crab was walking sideways and had big claws
a boat and wanted to ride it . He asked the
wanted to catch it and hold it in her hands .
to play with it and hold it tight . When she
She saw a house and knocked on the door .
the car and wanted to drive it . He climbed in
. Fl uffy wanted to ride in the je ep .
ball , and she would roll it all around her house
He sniff ed it and rolled it with his paw .
was having so much fun driving it around when he suddenly

INTERVAL 1.619 - 3.281
CONTAINS 0.674%

fish will bite . Then it can pull me in and
doll . She started gathering the pieces : yarn , buttons
They want to play with something else . Something that does
, she was walking in the park when she saw a
They ask Ben and Sue many questions . They show

INTERVAL -0.042 - 1.619
CONTAINS 52.196%

mom saw him and helped him fill his cup . She
holding up a shiny red car . " It is very
upon a time there was a girl called Sally . Sally
. He grabbed an empty soda bottle and threw it at
talked to the man . They made him feel less sad

INTERVAL -1.704 - -0.042
CONTAINS 46.722%

Suddenly , the stick broke and Tim my fell down .
's mom helped her put the picture in an envelope and
a special material that helps keep the cold inside . Ice
like kids '. He was so mean . The
Lily !" From that day on , Tom wasn 't so

INTERVAL -3.366 - -1.704
CONTAINS 0.408%

We love you , but you have to be careful and
. " I don 't know . Maybe someone
you as long as you don 't eat them all ."
saw a man with a big mustache ! She asked her
She couldn 't find Sam because the water was so thick
Token happy
Token ID3772
Feature activation-2.46e-01

Pos logit contributions
keeping+0.281
 ever+0.259
 onwards+0.239
 fright+0.211
 it+0.196
Neg logit contributions
 hey-1.139
aman-1.139
 exception-1.078
land-1.073
 whose-1.060
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
 ever+0.197
 onwards+0.194
keeping+0.190
 fright+0.174
 sideways+0.166
Neg logit contributions
aman-0.472
 hey-0.461
land-0.448
 exception-0.428
 whose-0.426
Token
Token ID198
Feature activation-3.29e-01

Pos logit contributions
aman+0.774
 hey+0.762
land+0.731
 exception+0.702
 extingu+0.693
Neg logit contributions
 ever-0.367
 onwards-0.344
keeping-0.343
 it-0.324
 fright-0.309
Token
Token ID198
Feature activation+6.60e-01

Pos logit contributions
 ever+0.340
keeping+0.327
 onwards+0.318
 fright+0.295
 sideways+0.288
Neg logit contributions
aman-0.569
 hey-0.551
land-0.531
 whose-0.505
 exception-0.504
TokenAt
Token ID2953
Feature activation+4.24e-01

Pos logit contributions
aman+0.937
 hey+0.918
land+0.874
 exception+0.827
 extingu+0.822
Neg logit contributions
 ever-0.884
 onwards-0.845
keeping-0.833
 it-0.818
 fright-0.788
Token the
Token ID262
Feature activation+3.28e+00

Pos logit contributions
aman+0.792
 hey+0.779
land+0.748
 exception+0.718
 extingu+0.708
Neg logit contributions
 ever-0.381
 onwards-0.354
keeping-0.354
 it-0.336
 fright-0.320
Token store
Token ID3650
Feature activation+3.42e-01

Pos logit contributions
der+5.429
ist+5.377
 hey+5.270
aman+5.158
land+5.149
Neg logit contributions
 it-3.949
 ever-3.941
 excited-3.714
 on-3.683
 onwards-3.540
Token,
Token ID11
Feature activation+8.91e-01

Pos logit contributions
aman+0.642
 hey+0.630
land+0.604
 exception+0.581
 extingu+0.573
Neg logit contributions
 ever-0.301
keeping-0.280
 onwards-0.280
 it-0.263
 fright-0.253
Token they
Token ID484
Feature activation+1.01e-01

Pos logit contributions
aman+1.503
 hey+1.458
land+1.428
 exception+1.348
 extingu+1.333
Neg logit contributions
 ever-0.973
 it-0.934
 onwards-0.925
keeping-0.892
 excited-0.861
Token found
Token ID1043
Feature activation+9.81e-01

Pos logit contributions
aman+0.165
 hey+0.162
land+0.153
 exception+0.147
 extingu+0.144
Neg logit contributions
 ever-0.112
keeping-0.108
 onwards-0.107
 it-0.100
 fright-0.100
Token some
Token ID617
Feature activation+4.16e-01

Pos logit contributions
aman+1.701
 hey+1.662
land+1.595
 exception+1.528
 extingu+1.475
Neg logit contributions
 ever-1.034
 it-0.985
 onwards-0.970
keeping-0.961
 excited-0.909
Token happy
Token ID3772
Feature activation+5.82e-01

Pos logit contributions
keeping+0.206
 ever+0.200
 onwards+0.191
 fright+0.174
 it+0.173
Neg logit contributions
 hey-0.463
aman-0.462
land-0.439
 exception-0.422
 whose-0.417
Token that
Token ID326
Feature activation-8.44e-01

Pos logit contributions
aman+1.051
 hey+1.027
land+0.977
 exception+0.945
 extingu+0.935
Neg logit contributions
 ever-0.574
keeping-0.525
 onwards-0.523
 it-0.513
 fright-0.477
Token the
Token ID262
Feature activation+4.62e-01

Pos logit contributions
keeping+1.090
 onwards+1.009
 ever+1.009
 fright+0.985
 sideways+0.970
Neg logit contributions
 hey-1.208
aman-1.204
land-1.108
 whose-1.067
 exception-1.063
Token fountain
Token ID29420
Feature activation+2.98e-01

Pos logit contributions
aman+0.538
 hey+0.519
land+0.485
 exception+0.455
 extingu+0.445
Neg logit contributions
 ever-0.740
 onwards-0.709
keeping-0.709
 it-0.691
 fright-0.676
Token was
Token ID373
Feature activation+5.22e-01

Pos logit contributions
aman+0.535
 hey+0.525
land+0.502
 exception+0.482
 extingu+0.475
Neg logit contributions
 ever-0.282
keeping-0.266
 onwards-0.265
 it-0.249
 fright-0.242
Token no
Token ID645
Feature activation+3.28e+00

Pos logit contributions
aman+0.947
 hey+0.930
land+0.888
 exception+0.853
 extingu+0.843
Neg logit contributions
 ever-0.503
 onwards-0.465
keeping-0.459
 it-0.445
 fright-0.424
Token longer
Token ID2392
Feature activation+5.01e-01

Pos logit contributions
aman+6.659
 Majesty+6.100
land+6.075
 hey+6.059
lves+5.890
Neg logit contributions
 ever-3.968
 it-3.483
 excited-3.093
 on-2.878
 fright-2.765
Token dry
Token ID5894
Feature activation+3.89e-02

Pos logit contributions
aman+0.927
 hey+0.910
land+0.867
 exception+0.835
 extingu+0.824
Neg logit contributions
 ever-0.460
keeping-0.426
 onwards-0.426
 it-0.404
 fright-0.387
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
aman+0.069
 hey+0.068
land+0.065
 exception+0.062
 extingu+0.062
Neg logit contributions
 ever-0.037
 onwards-0.035
keeping-0.035
 it-0.033
 fright-0.032
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
aman+0.434
 hey+0.426
land+0.408
 exception+0.392
 extingu+0.386
Neg logit contributions
 ever-0.211
keeping-0.197
 onwards-0.196
 it-0.183
 fright-0.179
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
keeping+0.922
 ever+0.913
 onwards+0.827
 fright+0.773
 sideways+0.767
Neg logit contributions
aman-1.912
 hey-1.877
land-1.850
 whose-1.798
 exception-1.725
Token store
Token ID3650
Feature activation+1.29e-01

Pos logit contributions
aman+3.811
ist+3.674
 hey+3.662
land+3.586
 extingu+3.479
Neg logit contributions
 it-3.487
 excited-3.305
 ever-3.301
keeping-3.214
 angry-3.180
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
aman+0.229
 hey+0.225
land+0.216
 exception+0.207
 extingu+0.204
Neg logit contributions
 ever-0.125
 onwards-0.118
keeping-0.117
 fright-0.109
 it-0.109
Token
Token ID198
Feature activation-4.70e-01

Pos logit contributions
aman+1.009
 hey+0.994
land+0.955
 exception+0.916
 extingu+0.902
Neg logit contributions
 ever-0.488
 onwards-0.462
keeping-0.458
 it-0.437
 fright-0.413
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
 ever+0.493
keeping+0.478
 onwards+0.448
 fright+0.429
 sideways+0.426
Neg logit contributions
aman-0.800
 hey-0.774
land-0.744
 whose-0.710
 exception-0.705
TokenAt
Token ID2953
Feature activation+7.14e-01

Pos logit contributions
 ever+0.167
keeping+0.164
 onwards+0.160
 fright+0.154
 it+0.154
Neg logit contributions
aman-0.127
 hey-0.123
land-0.114
 exception-0.106
 whose-0.105
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
aman+1.360
 hey+1.346
land+1.300
 exception+1.251
 extingu+1.231
Neg logit contributions
 ever-0.613
 onwards-0.568
keeping-0.557
 it-0.556
 excited-0.499
Token store
Token ID3650
Feature activation+4.37e-01

Pos logit contributions
ist+5.027
aman+4.985
der+4.929
 hey+4.889
land+4.851
Neg logit contributions
 it-4.098
 excited-4.038
 ever-4.028
 onwards-3.830
 on-3.801
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
aman+0.819
 hey+0.806
land+0.773
 exception+0.744
 extingu+0.735
Neg logit contributions
 ever-0.381
 onwards-0.355
keeping-0.353
 it-0.335
 fright-0.317
Token the
Token ID262
Feature activation+1.39e+00

Pos logit contributions
aman+0.597
 hey+0.586
land+0.560
 exception+0.538
 extingu+0.530
Neg logit contributions
 ever-0.326
keeping-0.306
 onwards-0.306
 it-0.289
 fright-0.280
Token little
Token ID1310
Feature activation+6.98e-01

Pos logit contributions
aman+1.634
 hey+1.492
land+1.452
 exception+1.343
 extingu+1.310
Neg logit contributions
 ever-2.282
 excited-2.198
 onwards-2.189
 it-2.189
keeping-2.163
Token girl
Token ID2576
Feature activation+8.28e-02

Pos logit contributions
aman+0.956
 hey+0.927
land+0.885
 exception+0.823
 extingu+0.805
Neg logit contributions
 ever-0.977
keeping-0.932
 onwards-0.927
 it-0.916
 excited-0.886
Token day
Token ID1110
Feature activation+9.58e-01

Pos logit contributions
aman+1.034
 hey+1.025
land+0.977
 exception+0.942
 extingu+0.935
Neg logit contributions
 ever-0.384
 onwards-0.350
keeping-0.341
 it-0.327
 excited-0.305
Token while
Token ID981
Feature activation-8.88e-01

Pos logit contributions
aman+1.913
 hey+1.853
land+1.797
 extingu+1.737
 exception+1.730
Neg logit contributions
 ever-0.786
 onwards-0.743
 it-0.733
 excited-0.653
keeping-0.644
Token she
Token ID673
Feature activation+6.80e-01

Pos logit contributions
keeping+1.161
 ever+1.112
 fright+1.075
unts+1.038
 onwards+1.033
Neg logit contributions
 hey-1.256
aman-1.247
 whose-1.121
land-1.118
 exception-1.100
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
aman+1.262
 hey+1.248
land+1.181
 exception+1.142
 extingu+1.127
Neg logit contributions
 ever-0.631
 onwards-0.573
keeping-0.570
 it-0.550
 excited-0.524
Token at
Token ID379
Feature activation+1.52e+00

Pos logit contributions
aman+2.141
 hey+2.089
land+1.985
 extingu+1.926
 exception+1.897
Neg logit contributions
 ever-1.056
 onwards-1.002
 it-0.960
 excited-0.928
keeping-0.915
Token the
Token ID262
Feature activation+3.23e+00

Pos logit contributions
 hey+3.158
aman+3.102
land+2.993
 exception+2.875
ist+2.835
Neg logit contributions
 ever-1.105
 it-1.089
keeping-1.027
 onwards-0.968
 excited-0.900
Token stable
Token ID8245
Feature activation+7.15e-01

Pos logit contributions
 hey+4.484
 extingu+4.443
ist+4.240
aman+4.208
 whose+3.974
Neg logit contributions
 it-4.831
 ever-4.703
 on-4.460
 excited-4.457
 hungry-4.369
Token it
Token ID340
Feature activation+9.02e-01

Pos logit contributions
aman+1.364
 hey+1.348
land+1.285
 exception+1.246
 extingu+1.237
Neg logit contributions
 ever-0.609
 onwards-0.563
keeping-0.552
 it-0.535
 excited-0.495
Token started
Token ID2067
Feature activation+4.48e-01

Pos logit contributions
aman+1.713
 hey+1.700
land+1.609
 extingu+1.564
 exception+1.562
Neg logit contributions
 ever-0.800
 onwards-0.734
 it-0.714
keeping-0.681
 excited-0.656
Token to
Token ID284
Feature activation+5.21e-01

Pos logit contributions
aman+0.878
 hey+0.865
land+0.826
 exception+0.798
 extingu+0.788
Neg logit contributions
 ever-0.356
 onwards-0.327
keeping-0.326
 it-0.306
 fright-0.291
Token rain
Token ID6290
Feature activation+1.80e+00

Pos logit contributions
aman+0.825
 hey+0.810
land+0.766
 exception+0.730
 extingu+0.719
Neg logit contributions
 ever-0.610
keeping-0.577
 onwards-0.575
 it-0.556
 fright-0.541
Token next
Token ID1306
Feature activation-1.66e-01

Pos logit contributions
keeping+1.127
 ever+1.093
 onwards+1.090
 it+1.009
 sideways+0.993
Neg logit contributions
aman-1.254
 hey-1.238
land-1.125
 exception-1.078
 extingu-1.068
Token day
Token ID1110
Feature activation+5.20e-01

Pos logit contributions
keeping+0.139
 ever+0.137
 onwards+0.134
 it+0.122
 fright+0.120
Neg logit contributions
aman-0.316
 hey-0.314
land-0.297
 exception-0.288
 extingu-0.281
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
aman+0.994
 hey+0.976
land+0.936
 exception+0.904
 extingu+0.891
Neg logit contributions
 ever-0.448
 onwards-0.418
keeping-0.404
 it-0.394
 fright-0.368
Token the
Token ID262
Feature activation-6.11e-01

Pos logit contributions
aman+1.876
 hey+1.762
land+1.732
 exception+1.635
 extingu+1.606
Neg logit contributions
 ever-1.844
 it-1.793
 onwards-1.737
 excited-1.651
 on-1.643
Token owner
Token ID4870
Feature activation-5.03e-01

Pos logit contributions
 ever+0.823
keeping+0.822
 onwards+0.801
 fright+0.746
 it+0.737
Neg logit contributions
aman-0.872
 hey-0.842
land-0.778
 exception-0.751
 extingu-0.730
Token saw
Token ID2497
Feature activation+3.22e+00

Pos logit contributions
 onwards+0.517
 ever+0.512
keeping+0.512
 fright+0.472
 it+0.466
Neg logit contributions
 hey-0.840
aman-0.835
land-0.779
 whose-0.746
 exception-0.744
Token some
Token ID617
Feature activation+4.30e-01

Pos logit contributions
aman+7.265
 hey+7.036
ist+7.026
 exception+6.953
land+6.853
Neg logit contributions
 it-3.126
 on-2.223
 ever-2.221
 excited-1.978
 hungry-1.920
Token more
Token ID517
Feature activation-1.18e-01

Pos logit contributions
aman+0.665
 hey+0.649
land+0.616
 exception+0.587
 extingu+0.579
Neg logit contributions
 ever-0.522
 onwards-0.494
keeping-0.492
 it-0.474
 fright-0.462
Token che
Token ID1125
Feature activation+1.23e+00

Pos logit contributions
 ever+0.144
 onwards+0.140
keeping+0.139
 fright+0.129
 it+0.129
Neg logit contributions
aman-0.180
 hey-0.175
land-0.169
 exception-0.161
 whose-0.157
Tokenwed
Token ID19103
Feature activation-1.09e-01

Pos logit contributions
aman+1.654
 hey+1.594
land+1.475
 extingu+1.410
 exception+1.400
Neg logit contributions
 ever-1.809
keeping-1.741
 onwards-1.692
 it-1.638
 fright-1.566
Token threads
Token ID14390
Feature activation+2.65e-01

Pos logit contributions
 ever+0.088
keeping+0.083
 onwards+0.082
 fright+0.076
 it+0.073
Neg logit contributions
aman-0.208
 hey-0.203
land-0.197
 exception-0.189
 extingu-0.188
Token shoes
Token ID10012
Feature activation-1.36e-01

Pos logit contributions
aman+0.941
 hey+0.925
land+0.866
 exception+0.822
 extingu+0.805
Neg logit contributions
 ever-0.798
 onwards-0.757
keeping-0.752
 it-0.731
 fright-0.711
Token.
Token ID13
Feature activation+5.17e-01

Pos logit contributions
 ever+0.128
keeping+0.125
 onwards+0.122
 fright+0.115
 it+0.114
Neg logit contributions
aman-0.240
 hey-0.233
land-0.228
 exception-0.218
 extingu-0.214
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
aman+0.949
 hey+0.936
land+0.897
 exception+0.861
 extingu+0.848
Neg logit contributions
 ever-0.474
 onwards-0.448
keeping-0.443
 it-0.425
 fright-0.402
Token
Token ID198
Feature activation-1.76e-01

Pos logit contributions
 ever+0.427
keeping+0.413
 onwards+0.394
 fright+0.373
 sideways+0.369
Neg logit contributions
aman-0.669
 hey-0.645
land-0.619
 whose-0.585
 exception-0.584
TokenAt
Token ID2953
Feature activation+4.03e-01

Pos logit contributions
 ever+0.272
keeping+0.265
 onwards+0.259
 it+0.249
 fright+0.248
Neg logit contributions
aman-0.214
 hey-0.208
land-0.191
 exception-0.179
 whose-0.177
Token the
Token ID262
Feature activation+3.21e+00

Pos logit contributions
aman+0.735
 hey+0.722
land+0.692
 exception+0.664
 extingu+0.655
Neg logit contributions
 ever-0.379
 onwards-0.354
keeping-0.354
 it-0.336
 fright-0.323
Token taxi
Token ID17536
Feature activation+1.77e+00

Pos logit contributions
ist+5.718
 hey+5.449
ware+5.444
 extingu+5.418
der+5.405
Neg logit contributions
 ever-3.542
 excited-3.498
 it-3.456
 angry-3.295
 onwards-3.163
Token stop
Token ID2245
Feature activation+3.13e-01

Pos logit contributions
 hey+3.362
aman+3.326
 extingu+3.160
 exception+3.146
land+3.137
Neg logit contributions
 ever-1.543
 onwards-1.447
 it-1.419
keeping-1.341
 excited-1.326
Token,
Token ID11
Feature activation+1.10e-03

Pos logit contributions
aman+0.598
 hey+0.587
land+0.563
 exception+0.542
 extingu+0.535
Neg logit contributions
 ever-0.264
keeping-0.245
 onwards-0.245
 it-0.228
 fright-0.221
Token her
Token ID607
Feature activation+5.70e-01

Pos logit contributions
aman+0.002
 hey+0.002
land+0.002
 exception+0.002
 whose+0.002
Neg logit contributions
 ever-0.001
keeping-0.001
 onwards-0.001
 fright-0.001
unts-0.001
Token brother
Token ID3956
Feature activation+5.67e-01

Pos logit contributions
aman+0.948
 hey+0.930
land+0.884
 exception+0.842
 extingu+0.827
Neg logit contributions
 ever-0.632
 onwards-0.595
keeping-0.590
 it-0.568
 fright-0.553
Token store
Token ID3650
Feature activation+7.15e-01

Pos logit contributions
aman+2.293
 hey+2.242
land+2.060
 exception+1.993
lves+1.977
Neg logit contributions
 ever-2.552
 it-2.408
 onwards-2.406
keeping-2.387
 excited-2.334
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
aman+1.311
 hey+1.296
land+1.216
 exception+1.178
 extingu+1.169
Neg logit contributions
 ever-0.664
keeping-0.624
 onwards-0.619
 it-0.600
 excited-0.566
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
 hey+2.156
aman+2.149
land+2.052
 exception+1.965
 extingu+1.925
Neg logit contributions
 ever-0.881
 onwards-0.871
 it-0.854
keeping-0.817
 excited-0.788
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
aman+0.227
 hey+0.221
land+0.213
 exception+0.203
 whose+0.200
Neg logit contributions
 ever-0.123
keeping-0.117
 onwards-0.114
 it-0.107
 fright-0.106
TokenAt
Token ID2953
Feature activation+6.59e-01

Pos logit contributions
aman+1.414
 hey+1.408
land+1.314
 exception+1.246
 extingu+1.243
Neg logit contributions
 ever-1.343
 onwards-1.313
 it-1.265
keeping-1.251
 excited-1.224
Token the
Token ID262
Feature activation+3.21e+00

Pos logit contributions
aman+1.245
 hey+1.231
land+1.185
 exception+1.139
 extingu+1.121
Neg logit contributions
 ever-0.574
 onwards-0.534
keeping-0.526
 it-0.519
 fright-0.474
Token store
Token ID3650
Feature activation+7.12e-01

Pos logit contributions
 hey+5.038
ist+5.020
der+4.976
lves+4.868
aman+4.816
Neg logit contributions
 it-3.982
 ever-3.969
 on-3.761
 excited-3.750
 onwards-3.717
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
aman+1.364
 hey+1.348
land+1.290
 exception+1.247
 extingu+1.239
Neg logit contributions
 ever-0.595
 onwards-0.556
keeping-0.540
 it-0.535
 excited-0.497
Token they
Token ID484
Feature activation+3.37e-01

Pos logit contributions
aman+1.939
 hey+1.925
land+1.853
 exception+1.769
 extingu+1.737
Neg logit contributions
 ever-1.058
 it-1.029
 onwards-1.013
keeping-0.951
 excited-0.938
Token picked
Token ID6497
Feature activation+2.80e-01

Pos logit contributions
aman+0.560
 hey+0.549
land+0.523
 exception+0.501
 extingu+0.493
Neg logit contributions
 ever-0.367
keeping-0.347
 onwards-0.347
 it-0.330
 fright-0.321
Token out
Token ID503
Feature activation+5.81e-01

Pos logit contributions
aman+0.438
 hey+0.428
land+0.407
 exception+0.388
 extingu+0.382
Neg logit contributions
 ever-0.332
keeping-0.315
 onwards-0.314
 it-0.298
 fright-0.293
Token day
Token ID1110
Feature activation+9.72e-01

Pos logit contributions
aman+0.585
 hey+0.576
land+0.553
 exception+0.534
 extingu+0.527
Neg logit contributions
 ever-0.223
keeping-0.208
 onwards-0.207
 it-0.191
 fright-0.184
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
aman+1.987
 hey+1.931
land+1.888
 exception+1.813
 extingu+1.798
Neg logit contributions
 ever-0.749
 onwards-0.689
 it-0.683
 excited-0.601
keeping-0.594
Token Lily
Token ID20037
Feature activation-1.51e-01

Pos logit contributions
aman+2.347
land+2.252
 hey+2.241
 extingu+2.097
 exception+2.089
Neg logit contributions
 it-1.366
 ever-1.354
 onwards-1.241
 excited-1.215
 on-1.184
Token was
Token ID373
Feature activation+1.65e+00

Pos logit contributions
 ever+0.154
 onwards+0.151
keeping+0.148
 fright+0.139
 it+0.136
Neg logit contributions
aman-0.255
 hey-0.249
land-0.239
 exception-0.229
 whose-0.227
Token at
Token ID379
Feature activation+1.26e+00

Pos logit contributions
aman+3.460
land+3.207
 hey+3.142
 extingu+3.107
 exception+2.939
Neg logit contributions
 ever-1.438
 excited-1.300
 it-1.274
 onwards-1.271
 on-1.252
Token the
Token ID262
Feature activation+3.20e+00

Pos logit contributions
aman+2.443
 hey+2.372
land+2.327
 exception+2.192
 extingu+2.164
Neg logit contributions
 ever-1.119
 it-1.017
keeping-1.015
 onwards-0.949
 excited-0.913
Token park
Token ID3952
Feature activation-5.40e-02

Pos logit contributions
aman+4.993
 extingu+4.772
 hey+4.660
lves+4.526
ist+4.475
Neg logit contributions
 ever-4.488
 it-4.229
 on-4.063
 excited-3.872
 hungry-3.865
Token with
Token ID351
Feature activation+4.34e-01

Pos logit contributions
 ever+0.066
keeping+0.065
 onwards+0.064
 fright+0.060
 it+0.060
Neg logit contributions
aman-0.080
 hey-0.079
land-0.075
 exception-0.072
 whose-0.071
Token her
Token ID607
Feature activation+1.65e+00

Pos logit contributions
aman+0.725
 hey+0.710
land+0.678
 exception+0.647
 extingu+0.637
Neg logit contributions
 ever-0.479
 onwards-0.451
keeping-0.450
 it-0.432
 fright-0.417
Token mom
Token ID1995
Feature activation+1.78e-01

Pos logit contributions
aman+3.280
 hey+3.166
land+3.071
 exception+2.921
 extingu+2.850
Neg logit contributions
 ever-1.560
 it-1.371
 onwards-1.316
 excited-1.310
 on-1.301
Token.
Token ID13
Feature activation+8.60e-02

Pos logit contributions
aman+0.309
 hey+0.303
land+0.290
 exception+0.278
 extingu+0.274
Neg logit contributions
 ever-0.180
 onwards-0.171
keeping-0.170
 it-0.160
 fright-0.158
Token big
Token ID1263
Feature activation-6.87e-01

Pos logit contributions
aman+0.386
 hey+0.375
land+0.348
 exception+0.325
 extingu+0.317
Neg logit contributions
 ever-0.565
keeping-0.544
 onwards-0.544
 it-0.526
 fright-0.517
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
keeping+0.997
 ever+0.965
 onwards+0.965
 fright+0.877
 it+0.852
Neg logit contributions
aman-0.927
 hey-0.905
land-0.821
 exception-0.797
 extingu-0.793
Token brown
Token ID7586
Feature activation+2.92e-01

Pos logit contributions
keeping+2.666
 ever+2.382
 onwards+2.261
 fright+2.157
 Go+2.058
Neg logit contributions
 hey-2.380
aman-2.351
 whose-2.211
 extingu-2.197
 exception-2.185
Token bear
Token ID6842
Feature activation+8.89e-01

Pos logit contributions
aman+0.473
 hey+0.463
land+0.441
 exception+0.421
 extingu+0.415
Neg logit contributions
 ever-0.330
 onwards-0.314
keeping-0.314
 it-0.297
 fright-0.290
Token named
Token ID3706
Feature activation+6.33e-01

Pos logit contributions
aman+1.622
 hey+1.567
land+1.497
 exception+1.439
 extingu+1.404
Neg logit contributions
 ever-0.896
keeping-0.804
 it-0.787
 onwards-0.782
 excited-0.736
Token Bub
Token ID19645
Feature activation+3.18e+00

Pos logit contributions
aman+1.032
 hey+1.009
land+0.962
 exception+0.915
 extingu+0.901
Neg logit contributions
 ever-0.725
 onwards-0.681
keeping-0.679
 it-0.675
 fright-0.636
Tokenba
Token ID7012
Feature activation+9.19e-01

Pos logit contributions
aman+4.994
land+4.450
 hey+4.449
 extingu+4.170
 exception+4.068
Neg logit contributions
 ever-4.465
 on-3.930
 it-3.843
 excited-3.798
keeping-3.783
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
aman+1.771
 hey+1.716
land+1.615
 exception+1.575
 extingu+1.546
Neg logit contributions
 ever-0.851
keeping-0.755
 onwards-0.729
 it-0.717
 excited-0.682
Token Bub
Token ID19645
Feature activation+1.91e+00

Pos logit contributions
aman+0.195
 hey+0.191
land+0.176
 exception+0.165
 extingu+0.160
Neg logit contributions
 ever-0.309
keeping-0.300
 onwards-0.298
 it-0.287
 fright-0.285
Tokenba
Token ID7012
Feature activation-3.71e-01

Pos logit contributions
aman+2.432
 hey+2.305
 extingu+2.071
land+2.045
 exception+1.956
Neg logit contributions
 ever-3.083
 onwards-2.787
keeping-2.768
 it-2.768
 excited-2.741
Token loved
Token ID6151
Feature activation-5.32e-01

Pos logit contributions
 onwards+0.490
keeping+0.487
 ever+0.483
 fright+0.464
 it+0.449
Neg logit contributions
aman-0.496
 hey-0.496
land-0.478
 whose-0.461
 exception-0.448
Token He
Token ID679
Feature activation+1.33e+00

Pos logit contributions
aman+0.799
 hey+0.785
land+0.754
 exception+0.728
 extingu+0.718
Neg logit contributions
 ever-0.287
 onwards-0.264
keeping-0.263
 it-0.245
 fright-0.233
Token gives
Token ID3607
Feature activation+6.84e-01

Pos logit contributions
aman+2.290
 hey+2.245
land+2.133
 exception+2.040
 extingu+1.987
Neg logit contributions
 ever-1.420
 it-1.342
 onwards-1.295
 excited-1.268
 fright-1.260
Token them
Token ID606
Feature activation+9.75e-01

Pos logit contributions
aman+1.126
 hey+1.095
land+1.046
 exception+1.001
 extingu+0.991
Neg logit contributions
 ever-0.778
 onwards-0.728
 it-0.727
keeping-0.726
 fright-0.677
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
aman+1.815
 hey+1.751
land+1.680
 exception+1.618
 extingu+1.568
Neg logit contributions
 ever-0.931
 it-0.850
 onwards-0.832
keeping-0.831
 fright-0.781
Token Anna
Token ID11735
Feature activation-1.20e-01

Pos logit contributions
aman+3.620
land+3.368
 hey+3.335
 exception+3.275
lves+3.264
Neg logit contributions
 it-2.037
 ever-1.981
 fright-1.711
 onwards-1.651
 excited-1.644
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
 ever+0.102
 onwards+0.095
keeping+0.095
 fright+0.086
 it+0.084
Neg logit contributions
aman-0.227
 hey-0.222
land-0.217
 whose-0.207
 exception-0.206
Token Ben
Token ID3932
Feature activation+3.29e-01

Pos logit contributions
aman+7.237
land+6.857
lves+6.855
 hey+6.757
 exception+6.653
Neg logit contributions
 it-2.766
 yawn-2.240
 excited-2.096
 ever-2.052
 on-2.009
Token.
Token ID13
Feature activation+7.40e-01

Pos logit contributions
aman+0.581
 hey+0.571
land+0.545
 exception+0.523
 extingu+0.515
Neg logit contributions
 ever-0.324
keeping-0.304
 onwards-0.304
 it-0.287
 fright-0.279
Token He
Token ID679
Feature activation+1.56e+00

Pos logit contributions
aman+1.459
 hey+1.428
land+1.359
 exception+1.320
 extingu+1.295
Neg logit contributions
 ever-0.594
 onwards-0.570
 it-0.548
keeping-0.543
 excited-0.509
Token says
Token ID1139
Feature activation-7.52e-01

Pos logit contributions
aman+2.806
 hey+2.741
land+2.618
 exception+2.501
lves+2.421
Neg logit contributions
 ever-1.569
 it-1.497
 excited-1.407
 onwards-1.402
 fright-1.372
Token sorry
Token ID7926
Feature activation-1.42e-01

Pos logit contributions
keeping+0.866
 ever+0.810
 onwards+0.759
 fright+0.748
 sideways+0.715
Neg logit contributions
 hey-1.226
aman-1.188
land-1.150
 whose-1.104
 exception-1.060
Token take
Token ID1011
Feature activation-8.31e-01

Pos logit contributions
aman+0.933
 hey+0.915
land+0.880
 exception+0.844
 extingu+0.830
Neg logit contributions
 ever-0.455
keeping-0.424
 onwards-0.418
 it-0.408
 fright-0.390
Token a
Token ID257
Feature activation+5.58e-01

Pos logit contributions
 ever+1.085
 onwards+1.051
keeping+1.050
 sideways+1.008
 fright+0.990
Neg logit contributions
aman-1.145
 hey-1.140
land-1.081
 exception-1.025
 extingu-1.021
Token look
Token ID804
Feature activation+7.20e-01

Pos logit contributions
aman+0.860
 hey+0.841
land+0.794
 exception+0.757
 extingu+0.744
Neg logit contributions
 ever-0.678
 onwards-0.640
keeping-0.635
 it-0.622
 fright-0.607
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
aman+1.388
 hey+1.367
land+1.283
 exception+1.240
 extingu+1.232
Neg logit contributions
 ever-0.620
 onwards-0.580
keeping-0.557
 it-0.549
 excited-0.515
Token At
Token ID1629
Feature activation+3.66e-01

Pos logit contributions
aman+1.145
 hey+1.129
land+1.075
 exception+1.037
 extingu+1.026
Neg logit contributions
 ever-0.540
 onwards-0.509
keeping-0.501
 it-0.489
 excited-0.462
Token the
Token ID262
Feature activation+3.17e+00

Pos logit contributions
aman+0.678
 hey+0.667
land+0.639
 exception+0.614
 extingu+0.605
Neg logit contributions
 ever-0.331
keeping-0.309
 onwards-0.308
 it-0.291
 fright-0.280
Token house
Token ID2156
Feature activation+7.92e-01

Pos logit contributions
ist+5.027
der+4.898
 hey+4.847
land+4.835
aman+4.766
Neg logit contributions
 it-3.820
 ever-3.771
 hungry-3.649
 on-3.644
 excited-3.616
Token,
Token ID11
Feature activation+2.28e-01

Pos logit contributions
aman+1.541
 hey+1.515
land+1.460
 exception+1.413
 extingu+1.404
Neg logit contributions
 ever-0.654
 onwards-0.595
 it-0.584
keeping-0.575
 excited-0.532
Token they
Token ID484
Feature activation-1.27e-01

Pos logit contributions
aman+0.391
 hey+0.385
land+0.366
 exception+0.351
 extingu+0.345
Neg logit contributions
 ever-0.235
keeping-0.224
 onwards-0.222
 it-0.207
 fright-0.205
Token saw
Token ID2497
Feature activation+5.77e-01

Pos logit contributions
 ever+0.151
keeping+0.149
 onwards+0.147
 fright+0.137
 it+0.135
Neg logit contributions
aman-0.195
 hey-0.193
land-0.180
 exception-0.172
 whose-0.171
Token a
Token ID257
Feature activation-1.46e-01

Pos logit contributions
aman+0.930
 hey+0.908
land+0.864
 exception+0.828
 extingu+0.809
Neg logit contributions
 ever-0.667
keeping-0.632
 onwards-0.626
 it-0.617
 fright-0.587
TokenThe
Token ID464
Feature activation-1.69e-01

Pos logit contributions
aman+0.808
 hey+0.777
land+0.742
 exception+0.711
 extingu+0.688
Neg logit contributions
 ever-0.746
 onwards-0.712
keeping-0.707
 it-0.691
 fright-0.669
Token man
Token ID582
Feature activation-1.49e-01

Pos logit contributions
 ever+0.182
keeping+0.176
 onwards+0.172
 it+0.157
 fright+0.156
Neg logit contributions
aman-0.291
 hey-0.289
land-0.272
 exception-0.261
 extingu-0.257
Token sighed
Token ID21893
Feature activation-8.84e-01

Pos logit contributions
 ever+0.142
keeping+0.141
 onwards+0.136
 it+0.123
 fright+0.121
Neg logit contributions
 hey-0.262
aman-0.262
land-0.247
 whose-0.238
 extingu-0.236
Token.
Token ID13
Feature activation+4.71e-01

Pos logit contributions
keeping+0.998
 ever+0.993
 onwards+0.928
 fright+0.857
unts+0.833
Neg logit contributions
aman-1.400
 hey-1.390
land-1.367
der-1.298
 whose-1.294
Token He
Token ID679
Feature activation+5.23e-01

Pos logit contributions
aman+0.976
 hey+0.960
land+0.922
 exception+0.891
 extingu+0.878
Neg logit contributions
 ever-0.325
 onwards-0.301
keeping-0.296
 it-0.279
 fright-0.262
Token saw
Token ID2497
Feature activation+3.17e+00

Pos logit contributions
aman+1.007
 hey+0.990
land+0.949
 exception+0.917
 extingu+0.900
Neg logit contributions
 ever-0.440
 onwards-0.402
keeping-0.396
 it-0.385
 fright-0.366
Token their
Token ID511
Feature activation-1.09e+00

Pos logit contributions
aman+7.355
land+6.824
 lookout+6.815
 hey+6.737
 exception+6.695
Neg logit contributions
 it-3.055
 on-2.254
 ever-2.114
 excited-1.980
 angry-1.960
Token sad
Token ID6507
Feature activation-7.48e-01

Pos logit contributions
 onwards+1.431
 ever+1.364
keeping+1.315
 it+1.230
 sideways+1.227
Neg logit contributions
aman-1.635
 hey-1.607
land-1.511
 extingu-1.449
 whose-1.425
Token faces
Token ID6698
Feature activation-1.71e-01

Pos logit contributions
 ever+0.881
 onwards+0.870
keeping+0.813
 fright+0.767
 it+0.766
Neg logit contributions
aman-1.179
 hey-1.159
land-1.131
 extingu-1.058
 exception-1.050
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 ever+0.129
keeping+0.128
 onwards+0.119
 fright+0.108
 it+0.106
Neg logit contributions
aman-0.332
 hey-0.329
land-0.318
 exception-0.304
 extingu-0.303
Token He
Token ID679
Feature activation-4.53e-01

Pos logit contributions
aman+0.462
 hey+0.454
land+0.437
 exception+0.421
 extingu+0.417
Neg logit contributions
 ever-0.171
keeping-0.157
 onwards-0.156
 it-0.143
 fright-0.139
Token store
Token ID3650
Feature activation-2.67e-01

Pos logit contributions
aman+2.730
 hey+2.649
land+2.609
ist+2.478
 exception+2.385
Neg logit contributions
 ever-3.207
 it-3.104
keeping-3.003
 fright-2.960
 hungry-2.931
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 ever+0.251
 onwards+0.242
keeping+0.234
 fright+0.229
 it+0.219
Neg logit contributions
aman-0.473
 hey-0.464
land-0.452
 exception-0.433
 whose-0.427
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
 hey+3.683
aman+3.660
land+3.524
 exception+3.410
 extingu+3.299
Neg logit contributions
 it-1.106
 onwards-1.010
 excited-0.982
 ever-0.954
keeping-0.930
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 ever+0.121
keeping+0.116
 onwards+0.113
 fright+0.105
 sideways+0.103
Neg logit contributions
aman-0.205
 hey-0.199
land-0.192
 exception-0.183
 whose-0.181
TokenAt
Token ID2953
Feature activation+2.26e-01

Pos logit contributions
aman+1.815
 hey+1.793
land+1.673
 exception+1.593
 extingu+1.586
Neg logit contributions
 ever-1.725
 onwards-1.673
 it-1.649
keeping-1.621
 excited-1.593
Token the
Token ID262
Feature activation+3.16e+00

Pos logit contributions
aman+0.412
 hey+0.403
land+0.385
 exception+0.370
 extingu+0.366
Neg logit contributions
 ever-0.212
keeping-0.200
 onwards-0.199
 it-0.185
 fright-0.182
Token store
Token ID3650
Feature activation+4.24e-01

Pos logit contributions
ist+5.109
der+4.918
 hey+4.865
land+4.812
aman+4.764
Neg logit contributions
 it-4.067
 excited-3.837
 ever-3.719
 onwards-3.711
 on-3.649
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
aman+0.801
 hey+0.788
land+0.756
 exception+0.728
 extingu+0.718
Neg logit contributions
 ever-0.367
 onwards-0.341
keeping-0.340
 it-0.322
 fright-0.305
Token the
Token ID262
Feature activation+9.42e-01

Pos logit contributions
aman+2.130
 hey+2.056
land+2.028
 exception+1.927
 extingu+1.865
Neg logit contributions
 ever-1.001
 it-0.991
 onwards-0.935
 excited-0.885
keeping-0.868
Token mummy
Token ID38567
Feature activation-3.10e-02

Pos logit contributions
aman+1.361
 hey+1.326
land+1.282
 exception+1.185
 whose+1.156
Neg logit contributions
 ever-1.254
 onwards-1.186
keeping-1.181
 it-1.161
 excited-1.144
Token bear
Token ID6842
Feature activation+4.55e-01

Pos logit contributions
 ever+0.022
keeping+0.020
 onwards+0.020
 fright+0.018
 it+0.018
Neg logit contributions
aman-0.064
 hey-0.062
land-0.060
 exception-0.058
 extingu-0.057
Token you
Token ID345
Feature activation-3.38e-02

Pos logit contributions
aman+0.621
 hey+0.609
land+0.581
 exception+0.558
 extingu+0.549
Neg logit contributions
 ever-0.364
keeping-0.342
 onwards-0.341
 it-0.324
 fright-0.314
Token."
Token ID526
Feature activation-3.81e-01

Pos logit contributions
 ever+0.033
keeping+0.032
 onwards+0.032
 it+0.030
 fright+0.029
Neg logit contributions
aman-0.058
 hey-0.058
land-0.056
 exception-0.053
 whose-0.052
Token
Token ID198
Feature activation-6.91e-03

Pos logit contributions
 ever+0.411
keeping+0.401
 onwards+0.381
 fright+0.359
 sideways+0.345
Neg logit contributions
aman-0.636
 hey-0.625
land-0.591
 whose-0.564
 exception-0.555
Token
Token ID198
Feature activation+6.77e-01

Pos logit contributions
 ever+0.007
keeping+0.007
 onwards+0.006
 fright+0.006
 it+0.006
Neg logit contributions
aman-0.012
 hey-0.012
land-0.011
 whose-0.011
 exception-0.011
TokenBen
Token ID11696
Feature activation-3.39e-01

Pos logit contributions
aman+1.017
 hey+0.987
land+0.949
 exception+0.911
 extingu+0.891
Neg logit contributions
 ever-0.860
 onwards-0.820
keeping-0.809
 it-0.798
 fright-0.765
Token saw
Token ID2497
Feature activation+3.15e+00

Pos logit contributions
 ever+0.323
keeping+0.319
 onwards+0.314
 it+0.274
 sideways+0.272
Neg logit contributions
 hey-0.601
aman-0.600
land-0.563
 whose-0.544
 extingu-0.537
Token that
Token ID326
Feature activation+2.87e-01

Pos logit contributions
aman+7.141
land+6.718
 hey+6.579
ist+6.558
 lookout+6.431
Neg logit contributions
 it-3.094
 angry-2.192
 excited-2.114
 ever-2.044
 hungry-2.038
Token Sara
Token ID24799
Feature activation-5.55e-01

Pos logit contributions
aman+0.459
 hey+0.450
land+0.427
 exception+0.408
 extingu+0.401
Neg logit contributions
 ever-0.336
keeping-0.320
 onwards-0.319
 it-0.303
 fright-0.297
Token was
Token ID373
Feature activation+2.08e-01

Pos logit contributions
keeping+0.597
 ever+0.585
 onwards+0.575
 fright+0.527
 sideways+0.514
Neg logit contributions
 hey-0.902
aman-0.890
land-0.851
 whose-0.828
 extingu-0.796
Token crying
Token ID13774
Feature activation-2.74e-03

Pos logit contributions
aman+0.423
 hey+0.418
land+0.400
 exception+0.388
 extingu+0.383
Neg logit contributions
 ever-0.148
keeping-0.138
 onwards-0.136
 it-0.124
 fright-0.119
Token and
Token ID290
Feature activation-3.26e-01

Pos logit contributions
 ever+0.002
keeping+0.002
 onwards+0.002
 it+0.002
 fright+0.002
Neg logit contributions
aman-0.005
 hey-0.005
land-0.005
 exception-0.005
 whose-0.005
Token got
Token ID1392
Feature activation-1.44e+00

Pos logit contributions
aman+0.322
 hey+0.316
land+0.302
 exception+0.289
 extingu+0.284
Neg logit contributions
 ever-0.179
keeping-0.172
 onwards-0.169
 it-0.158
 fright-0.154
Token his
Token ID465
Feature activation+4.54e-01

Pos logit contributions
keeping+1.569
 ever+1.456
unts+1.414
 onwards+1.403
 Go+1.368
Neg logit contributions
 hey-2.405
aman-2.310
 exception-2.212
land-2.088
 extingu-2.081
Token results
Token ID2482
Feature activation+5.64e-01

Pos logit contributions
aman+0.619
 hey+0.602
land+0.567
 exception+0.535
 extingu+0.527
Neg logit contributions
 ever-0.639
 onwards-0.610
keeping-0.607
 it-0.587
 fright-0.576
Token,
Token ID11
Feature activation+8.73e-01

Pos logit contributions
aman+1.081
 hey+1.060
land+1.016
 exception+0.985
 extingu+0.971
Neg logit contributions
 ever-0.483
 onwards-0.445
keeping-0.433
 it-0.421
 fright-0.396
Token he
Token ID339
Feature activation-4.14e-01

Pos logit contributions
aman+1.404
 hey+1.349
land+1.298
 exception+1.231
 extingu+1.214
Neg logit contributions
 ever-1.060
 it-1.003
 onwards-1.001
keeping-0.956
 excited-0.928
Token saw
Token ID2497
Feature activation+3.13e+00

Pos logit contributions
keeping+0.448
 ever+0.432
 onwards+0.430
 fright+0.392
 it+0.383
Neg logit contributions
aman-0.686
 hey-0.680
land-0.637
 whose-0.612
 exception-0.608
Token that
Token ID326
Feature activation+2.12e-01

Pos logit contributions
aman+6.754
 hey+6.183
land+6.135
 exception+5.958
ist+5.951
Neg logit contributions
 it-3.600
 ever-2.921
 on-2.758
 excited-2.641
 angry-2.444
Token he
Token ID339
Feature activation-6.55e-01

Pos logit contributions
aman+0.367
 hey+0.361
land+0.345
 exception+0.332
 extingu+0.326
Neg logit contributions
 ever-0.214
keeping-0.204
 onwards-0.202
 it-0.188
 fright-0.186
Token had
Token ID550
Feature activation+4.96e-01

Pos logit contributions
keeping+0.678
 onwards+0.644
 ever+0.616
 fright+0.568
 sideways+0.553
Neg logit contributions
 hey-1.109
aman-1.102
land-1.059
 whose-1.025
 exception-1.015
Token failed
Token ID4054
Feature activation-2.28e-01

Pos logit contributions
aman+0.851
 hey+0.834
land+0.793
 exception+0.761
 extingu+0.752
Neg logit contributions
 ever-0.526
keeping-0.491
 onwards-0.489
 it-0.471
 fright-0.454
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
 ever+0.215
keeping+0.208
 onwards+0.206
 fright+0.189
 it+0.185
Neg logit contributions
aman-0.402
 hey-0.392
land-0.382
 exception-0.364
 whose-0.361
TokenTheir
Token ID14574
Feature activation-7.41e-01

Pos logit contributions
 ever+0.022
keeping+0.021
 onwards+0.021
 it+0.019
 fright+0.019
Neg logit contributions
aman-0.047
 hey-0.046
land-0.043
 whose-0.042
 extingu-0.042
Token mom
Token ID1995
Feature activation+1.73e-01

Pos logit contributions
 onwards+0.772
 ever+0.769
keeping+0.768
 it+0.685
 sideways+0.684
Neg logit contributions
 hey-1.254
aman-1.238
land-1.147
 extingu-1.120
 exception-1.116
Token sighed
Token ID21893
Feature activation-9.36e-01

Pos logit contributions
aman+0.325
 hey+0.321
land+0.306
 exception+0.293
 extingu+0.291
Neg logit contributions
 ever-0.150
keeping-0.143
 onwards-0.141
 it-0.129
 fright-0.126
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
keeping+1.116
 ever+1.110
 onwards+1.038
 fright+0.977
 sideways+0.949
Neg logit contributions
aman-1.427
 hey-1.422
land-1.379
der-1.307
 whose-1.305
Token She
Token ID1375
Feature activation+6.96e-01

Pos logit contributions
aman+0.204
 hey+0.200
land+0.191
 exception+0.183
 extingu+0.181
Neg logit contributions
 ever-0.108
keeping-0.102
 onwards-0.099
 it-0.093
 fright-0.092
Token saw
Token ID2497
Feature activation+3.12e+00

Pos logit contributions
aman+1.321
 hey+1.292
land+1.247
 exception+1.208
 extingu+1.177
Neg logit contributions
 ever-0.610
 onwards-0.553
 it-0.542
keeping-0.541
 excited-0.511
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
aman+6.879
 lookout+6.746
 exception+6.651
 hey+6.650
land+6.547
Neg logit contributions
 it-2.904
 on-2.129
 ever-1.926
 angry-1.909
 excited-1.857
Token cut
Token ID2005
Feature activation+2.61e-01

Pos logit contributions
 ever+0.687
keeping+0.662
 onwards+0.657
 sideways+0.615
 fright+0.613
Neg logit contributions
aman-0.469
 hey-0.433
land-0.415
 exception-0.389
 extingu-0.386
Token on
Token ID319
Feature activation+7.20e-01

Pos logit contributions
aman+0.469
 hey+0.460
land+0.441
 exception+0.423
 extingu+0.418
Neg logit contributions
 ever-0.248
keeping-0.233
 onwards-0.232
 it-0.218
 fright-0.213
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
aman+1.341
 hey+1.331
land+1.270
 exception+1.232
lves+1.194
Neg logit contributions
 ever-0.635
keeping-0.593
 it-0.593
 onwards-0.590
 fright-0.545
Token sheep
Token ID15900
Feature activation-4.60e-01

Pos logit contributions
 ever+0.491
 onwards+0.469
keeping+0.466
 it+0.440
 fright+0.437
Neg logit contributions
aman-0.376
 hey-0.356
land-0.342
 extingu-0.317
 exception-0.311
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
 ever+0.274
 onwards+0.268
keeping+0.265
 sideways+0.245
 fright+0.242
Neg logit contributions
aman-0.348
 hey-0.340
land-0.321
 exception-0.307
 extingu-0.296
Token they
Token ID484
Feature activation-2.68e+00

Pos logit contributions
keeping+1.376
 ever+1.364
 onwards+1.359
 sideways+1.325
 fright+1.293
Neg logit contributions
 hey-1.606
aman-1.597
land-1.571
 whose-1.474
 exception-1.442
Token were
Token ID547
Feature activation-9.11e-01

Pos logit contributions
keeping+3.974
 onwards+3.509
 sideways+3.279
imp+3.272
 fright+3.223
Neg logit contributions
 hey-4.184
 whose-3.419
 exception-3.386
 who-3.362
aman-3.322
Token going
Token ID1016
Feature activation-8.64e-01

Pos logit contributions
keeping+1.154
 ever+1.115
 onwards+1.070
 fright+1.066
 sideways+1.049
Neg logit contributions
 hey-1.361
aman-1.329
 exception-1.233
land-1.207
 whose-1.173
Token to
Token ID284
Feature activation+2.96e-01

Pos logit contributions
 ever+0.764
keeping+0.738
 onwards+0.719
 fright+0.712
unts+0.673
Neg logit contributions
 hey-1.538
aman-1.525
land-1.500
 exception-1.457
 extingu-1.431
Token the
Token ID262
Feature activation+3.11e+00

Pos logit contributions
aman+0.511
 hey+0.501
land+0.478
 exception+0.458
 extingu+0.452
Neg logit contributions
 ever-0.305
 onwards-0.288
keeping-0.288
 it-0.271
 fright-0.264
Token park
Token ID3952
Feature activation-7.11e-01

Pos logit contributions
aman+4.635
land+4.489
 hey+4.328
ist+4.280
der+4.247
Neg logit contributions
 ever-4.165
 it-4.090
 on-3.877
 excited-3.776
 sideways-3.738
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
 onwards+0.725
keeping+0.722
 ever+0.722
 fright+0.687
unts+0.664
Neg logit contributions
aman-1.179
land-1.159
 hey-1.146
 exception-1.093
 whose-1.087
Token Lily
Token ID20037
Feature activation+1.34e-01

Pos logit contributions
aman+1.180
 hey+1.164
land+1.107
 exception+1.062
 extingu+1.048
Neg logit contributions
 ever-0.728
 onwards-0.709
keeping-0.695
 it-0.691
 excited-0.653
Token was
Token ID373
Feature activation-9.19e-01

Pos logit contributions
aman+0.239
 hey+0.234
land+0.223
 exception+0.214
 whose+0.212
Neg logit contributions
 ever-0.130
keeping-0.124
 onwards-0.123
 it-0.113
 fright-0.112
Token eager
Token ID11069
Feature activation-7.78e-01

Pos logit contributions
keeping+0.752
 ever+0.723
 onwards+0.664
 sideways+0.616
 fright+0.610
Neg logit contributions
 hey-1.748
aman-1.738
 exception-1.650
 whose-1.633
land-1.620
Token all
Token ID477
Feature activation+2.56e-01

Pos logit contributions
 ever+0.354
keeping+0.343
 onwards+0.331
 fright+0.314
 it+0.303
Neg logit contributions
 hey-0.463
aman-0.456
land-0.440
 whose-0.417
 exception-0.416
Token day
Token ID1110
Feature activation-1.01e+00

Pos logit contributions
aman+0.509
 hey+0.500
land+0.481
 exception+0.464
 extingu+0.457
Neg logit contributions
 ever-0.195
keeping-0.182
 onwards-0.181
 it-0.166
 fright-0.162
Token long
Token ID890
Feature activation-2.83e-01

Pos logit contributions
keeping+0.958
 ever+0.912
 fright+0.888
 onwards+0.877
 sideways+0.829
Neg logit contributions
aman-1.708
 hey-1.680
land-1.665
 exception-1.565
 whose-1.556
Token.
Token ID13
Feature activation+7.56e-01

Pos logit contributions
 ever+0.292
keeping+0.283
 onwards+0.281
 fright+0.265
 it+0.256
Neg logit contributions
aman-0.467
 hey-0.460
land-0.448
 whose-0.421
 exception-0.417
Token At
Token ID1629
Feature activation+1.99e-01

Pos logit contributions
aman+1.231
 hey+1.201
land+1.146
 exception+1.091
 extingu+1.084
Neg logit contributions
 ever-0.866
 onwards-0.834
keeping-0.808
 it-0.807
 excited-0.766
Token the
Token ID262
Feature activation+3.09e+00

Pos logit contributions
aman+0.362
 hey+0.355
land+0.339
 exception+0.326
 extingu+0.322
Neg logit contributions
 ever-0.186
keeping-0.176
 onwards-0.175
 it-0.162
 fright-0.160
Token end
Token ID886
Feature activation-3.09e-01

Pos logit contributions
 hey+5.789
 extingu+5.741
ist+5.670
der+5.656
aman+5.575
Neg logit contributions
 ever-3.260
 it-2.831
 have-2.532
 excited-2.500
 hungry-2.411
Token of
Token ID286
Feature activation-5.70e-01

Pos logit contributions
 ever+0.214
keeping+0.214
 onwards+0.194
 fright+0.179
 sideways+0.171
Neg logit contributions
aman-0.624
 hey-0.618
land-0.594
 exception-0.575
 whose-0.571
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
keeping+0.579
 ever+0.575
 onwards+0.570
 fright+0.515
 sideways+0.510
Neg logit contributions
aman-0.952
 hey-0.932
land-0.868
 extingu-0.863
 whose-0.848
Token day
Token ID1110
Feature activation-2.25e-01

Pos logit contributions
aman+0.574
 hey+0.564
land+0.541
 exception+0.521
 extingu+0.514
Neg logit contributions
 ever-0.242
keeping-0.225
 onwards-0.225
 it-0.209
 fright-0.202
Token,
Token ID11
Feature activation+8.47e-01

Pos logit contributions
 ever+0.211
keeping+0.209
 onwards+0.198
 fright+0.194
 sideways+0.186
Neg logit contributions
aman-0.400
 hey-0.392
land-0.376
 exception-0.355
 whose-0.351
Token batteries
Token ID13591
Feature activation+1.71e-02

Pos logit contributions
 ever+0.001
 onwards+0.001
keeping+0.001
 it+0.001
 fright+0.001
Neg logit contributions
aman-0.002
 hey-0.002
land-0.002
 exception-0.001
 extingu-0.001
Token.
Token ID13
Feature activation+5.52e-01

Pos logit contributions
aman+0.030
 hey+0.029
land+0.028
 exception+0.027
 extingu+0.026
Neg logit contributions
 ever-0.017
 onwards-0.016
keeping-0.016
 fright-0.015
 it-0.015
Token
Token ID198
Feature activation+1.46e-02

Pos logit contributions
aman+0.975
 hey+0.958
land+0.918
 exception+0.881
 extingu+0.870
Neg logit contributions
 ever-0.546
 onwards-0.521
keeping-0.512
 it-0.494
 fright-0.470
Token
Token ID198
Feature activation+4.49e-01

Pos logit contributions
aman+0.026
 hey+0.025
land+0.024
 exception+0.023
 whose+0.023
Neg logit contributions
 ever-0.014
keeping-0.014
 onwards-0.013
 it-0.013
 fright-0.012
TokenAt
Token ID2953
Feature activation+1.23e-01

Pos logit contributions
aman+0.648
 hey+0.633
land+0.601
 exception+0.570
 extingu+0.561
Neg logit contributions
 ever-0.593
 onwards-0.567
keeping-0.562
 it-0.543
 fright-0.529
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
aman+0.223
 hey+0.218
land+0.207
 exception+0.199
 extingu+0.197
Neg logit contributions
 ever-0.118
keeping-0.112
 onwards-0.111
 it-0.102
 fright-0.102
Token store
Token ID3650
Feature activation+9.41e-02

Pos logit contributions
ist+4.902
der+4.873
land+4.866
 hey+4.851
aman+4.803
Neg logit contributions
 it-3.680
 ever-3.673
 excited-3.446
 onwards-3.383
 on-3.354
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
aman+0.174
 hey+0.170
land+0.162
 exception+0.155
 extingu+0.152
Neg logit contributions
 ever-0.086
keeping-0.081
 onwards-0.080
 fright-0.074
 it-0.074
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
aman+0.515
 hey+0.505
land+0.480
 exception+0.459
 extingu+0.452
Neg logit contributions
 ever-0.351
keeping-0.333
 onwards-0.332
 it-0.316
 fright-0.308
Token store
Token ID3650
Feature activation-7.63e-01

Pos logit contributions
aman+1.618
 hey+1.517
land+1.492
 exception+1.360
 whose+1.347
Neg logit contributions
 ever-1.698
 onwards-1.629
keeping-1.603
 it-1.600
 excited-1.570
Token-
Token ID12
Feature activation+8.06e-01

Pos logit contributions
 ever+0.923
keeping+0.913
 onwards+0.886
 fright+0.874
 sideways+0.848
Neg logit contributions
aman-1.173
 hey-1.130
land-1.064
 exception-0.998
 whose-0.988
Token sun
Token ID4252
Feature activation+3.62e-01

Pos logit contributions
aman+1.672
 hey+1.644
land+1.565
 exception+1.520
 extingu+1.493
Neg logit contributions
 ever-0.802
 onwards-0.750
keeping-0.740
 it-0.724
 fright-0.686
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
aman+0.643
 hey+0.632
land+0.603
 exception+0.579
 extingu+0.571
Neg logit contributions
 ever-0.352
keeping-0.330
 onwards-0.330
 it-0.312
 fright-0.301
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
aman+0.617
 hey+0.607
land+0.582
 exception+0.561
 extingu+0.553
Neg logit contributions
 ever-0.260
keeping-0.241
 onwards-0.240
 it-0.224
 fright-0.216
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 ever+0.778
keeping+0.747
 onwards+0.722
 fright+0.680
 sideways+0.673
Neg logit contributions
aman-1.119
 hey-1.084
land-1.050
 whose-1.006
 exception-0.979
TokenAt
Token ID2953
Feature activation+2.53e-01

Pos logit contributions
aman+0.366
 hey+0.358
land+0.339
 exception+0.323
 extingu+0.317
Neg logit contributions
 ever-0.292
keeping-0.278
 onwards-0.277
 it-0.264
 fright-0.259
Token the
Token ID262
Feature activation+3.07e+00

Pos logit contributions
aman+0.462
 hey+0.453
land+0.433
 exception+0.416
 extingu+0.411
Neg logit contributions
 ever-0.236
keeping-0.222
 onwards-0.221
 it-0.206
 fright-0.202
Token end
Token ID886
Feature activation-5.43e-01

Pos logit contributions
 hey+5.657
aman+5.538
 extingu+5.408
lves+5.381
der+5.357
Neg logit contributions
 ever-2.998
 it-2.964
 onwards-2.582
 on-2.575
 excited-2.573
Token of
Token ID286
Feature activation-4.80e-01

Pos logit contributions
keeping+0.411
 ever+0.391
 onwards+0.364
 fright+0.336
 sideways+0.325
Neg logit contributions
aman-1.086
 hey-1.060
land-1.037
 exception-0.990
 whose-0.990
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
 ever+0.495
keeping+0.483
 onwards+0.482
 sideways+0.442
 fright+0.440
Neg logit contributions
aman-0.815
 hey-0.781
land-0.736
 extingu-0.731
 whose-0.717
Token day
Token ID1110
Feature activation-2.49e-02

Pos logit contributions
aman+0.351
 hey+0.344
land+0.330
 exception+0.318
 extingu+0.313
Neg logit contributions
 ever-0.154
keeping-0.145
 onwards-0.144
 it-0.134
 fright-0.129
Token,
Token ID11
Feature activation+4.08e-01

Pos logit contributions
 ever+0.023
keeping+0.022
 onwards+0.021
 fright+0.020
 it+0.020
Neg logit contributions
aman-0.045
 hey-0.044
land-0.043
 exception-0.041
 whose-0.040
Token.
Token ID13
Feature activation-3.59e-01

Pos logit contributions
aman+4.759
 hey+4.717
land+4.460
lves+4.376
 extingu+4.349
Neg logit contributions
 ever-1.442
 it-1.400
 on-1.352
 excited-1.269
 onwards-1.169
Token She
Token ID1375
Feature activation-2.73e-01

Pos logit contributions
 ever+0.366
keeping+0.344
 onwards+0.330
 fright+0.316
 it+0.299
Neg logit contributions
aman-0.626
 hey-0.611
land-0.588
 exception-0.567
 whose-0.559
Token saw
Token ID2497
Feature activation+1.24e+00

Pos logit contributions
keeping+0.298
 ever+0.296
 onwards+0.294
 fright+0.273
 it+0.260
Neg logit contributions
 hey-0.435
aman-0.434
land-0.407
 whose-0.395
 exception-0.390
Token a
Token ID257
Feature activation-1.95e-01

Pos logit contributions
aman+2.193
 hey+2.116
land+2.056
 exception+1.969
 extingu+1.926
Neg logit contributions
 ever-1.290
 it-1.272
 onwards-1.167
 excited-1.161
keeping-1.161
Token beetle
Token ID45489
Feature activation-4.70e-01

Pos logit contributions
 ever+0.245
keeping+0.241
 onwards+0.236
 it+0.215
 fright+0.215
Neg logit contributions
aman-0.291
 hey-0.283
land-0.268
 exception-0.253
 whose-0.252
Token walking
Token ID6155
Feature activation-3.37e+00

Pos logit contributions
keeping+0.359
 ever+0.349
 onwards+0.342
 fright+0.305
unts+0.289
Neg logit contributions
aman-0.902
 hey-0.899
land-0.867
 whose-0.849
 exception-0.825
Token on
Token ID319
Feature activation-2.83e-01

Pos logit contributions
unts+4.905
 grin+4.641
keeping+4.482
 fingertips+4.398
 onwards+4.392
Neg logit contributions
 whose-4.515
 where-4.283
 living-4.164
 who-4.134
 hey-4.030
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
 ever+0.298
 onwards+0.290
keeping+0.278
unts+0.256
 fright+0.252
Neg logit contributions
aman-0.480
 hey-0.462
land-0.447
 whose-0.423
 extingu-0.421
Token grass
Token ID8701
Feature activation+5.08e-01

Pos logit contributions
aman+0.860
 hey+0.842
land+0.791
 exception+0.759
 extingu+0.742
Neg logit contributions
 ever-0.733
keeping-0.699
 onwards-0.689
 it-0.672
 fright-0.650
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
aman+1.009
 hey+0.997
land+0.952
 exception+0.922
 extingu+0.910
Neg logit contributions
 ever-0.394
 onwards-0.360
keeping-0.360
 it-0.339
 fright-0.317
Token She
Token ID1375
Feature activation-4.17e-01

Pos logit contributions
 ever+0.538
keeping+0.510
 onwards+0.476
 fright+0.448
unts+0.437
Neg logit contributions
aman-0.923
 hey-0.896
land-0.867
 whose-0.836
 exception-0.819
Token ground
Token ID2323
Feature activation+6.81e-01

Pos logit contributions
aman+2.196
 hey+2.186
land+2.047
 exception+1.995
 whose+1.887
Neg logit contributions
 ever-1.703
keeping-1.626
 it-1.576
 onwards-1.515
 fright-1.478
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
aman+1.262
 hey+1.243
land+1.177
 exception+1.133
 extingu+1.118
Neg logit contributions
 ever-0.638
 onwards-0.583
keeping-0.583
 it-0.567
 fright-0.530
Token He
Token ID679
Feature activation-4.20e-01

Pos logit contributions
 ever+1.682
keeping+1.594
 onwards+1.489
unts+1.351
 fright+1.330
Neg logit contributions
aman-2.443
 hey-2.307
 whose-2.292
 extingu-2.211
land-2.185
Token wanted
Token ID2227
Feature activation-8.16e-01

Pos logit contributions
keeping+0.400
 ever+0.391
 onwards+0.389
 fright+0.338
 it+0.326
Neg logit contributions
aman-0.743
 hey-0.733
 whose-0.693
land-0.691
 exception-0.674
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
keeping+0.748
 onwards+0.745
 ever+0.705
 grin+0.658
unts+0.656
Neg logit contributions
aman-1.428
 hey-1.424
land-1.371
 whose-1.365
 extingu-1.310
Token drive
Token ID3708
Feature activation-3.26e+00

Pos logit contributions
 onwards+0.706
keeping+0.680
 ever+0.670
unts+0.595
 onward+0.590
Neg logit contributions
 hey-0.896
aman-0.891
 exception-0.812
land-0.812
 extingu-0.810
Token his
Token ID465
Feature activation+1.02e+00

Pos logit contributions
unts+4.577
keeping+4.503
 grin+4.168
 Amen+4.032
umbs+3.878
Neg logit contributions
 hey-4.646
 whose-4.572
 where-4.549
 who-4.252
 exception-4.229
Token car
Token ID1097
Feature activation+5.02e-01

Pos logit contributions
aman+1.497
 hey+1.469
land+1.412
 exception+1.317
 extingu+1.279
Neg logit contributions
 ever-1.377
 it-1.290
 onwards-1.278
keeping-1.263
 fright-1.221
Token over
Token ID625
Feature activation-1.51e+00

Pos logit contributions
aman+0.931
 hey+0.914
land+0.873
 exception+0.837
 extingu+0.830
Neg logit contributions
 ever-0.458
 onwards-0.425
keeping-0.421
 it-0.403
 fright-0.385
Token it
Token ID340
Feature activation+2.69e-01

Pos logit contributions
 onwards+1.643
keeping+1.643
unts+1.615
 grin+1.586
 ever+1.570
Neg logit contributions
aman-2.394
 hey-2.372
 whose-2.244
land-2.204
 extingu-2.089
Token,
Token ID11
Feature activation-6.21e-01

Pos logit contributions
aman+0.500
 hey+0.490
land+0.471
 exception+0.453
 extingu+0.446
Neg logit contributions
 ever-0.240
keeping-0.226
 onwards-0.224
 it-0.209
 fright-0.204
Token,
Token ID11
Feature activation-1.82e+00

Pos logit contributions
aman+0.339
 hey+0.332
land+0.319
 exception+0.305
 extingu+0.301
Neg logit contributions
 ever-0.185
keeping-0.175
 onwards-0.175
 it-0.161
 fright-0.160
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
keeping+2.252
 onwards+2.117
unts+2.104
 ever+2.094
tick+1.966
Neg logit contributions
aman-2.639
 whose-2.521
 hey-2.488
land-2.448
 extingu-2.362
Token Peter
Token ID5613
Feature activation+1.07e-01

Pos logit contributions
keeping+1.381
 ever+1.288
 onwards+1.207
unts+1.163
 Go+1.108
Neg logit contributions
aman-2.530
 hey-2.464
 whose-2.430
land-2.356
lves-2.301
Token wanted
Token ID2227
Feature activation-1.31e+00

Pos logit contributions
aman+0.197
 hey+0.194
land+0.186
 exception+0.178
 whose+0.177
Neg logit contributions
 ever-0.095
keeping-0.091
 onwards-0.091
 it-0.082
 fright-0.082
Token to
Token ID284
Feature activation-9.41e-02

Pos logit contributions
 onwards+1.331
keeping+1.303
unts+1.234
 grin+1.230
 ever+1.168
Neg logit contributions
 hey-2.163
aman-2.156
land-2.151
 whose-2.106
 exception-1.994
Token ride
Token ID6594
Feature activation-3.20e+00

Pos logit contributions
 onwards+0.099
 ever+0.099
keeping+0.097
 it+0.089
 sideways+0.087
Neg logit contributions
aman-0.156
 hey-0.154
land-0.146
 exception-0.141
 extingu-0.138
Token it
Token ID340
Feature activation-1.24e-01

Pos logit contributions
unts+3.989
keeping+3.861
irk+3.505
 grin+3.306
umbs+3.145
Neg logit contributions
 whose-5.257
land-4.910
 hey-4.892
 exception-4.766
 who-4.759
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
 ever+0.125
keeping+0.124
 onwards+0.119
 fright+0.110
unts+0.110
Neg logit contributions
aman-0.211
 hey-0.205
land-0.200
 exception-0.193
 whose-0.187
Token He
Token ID679
Feature activation+1.05e-01

Pos logit contributions
aman+0.874
 hey+0.859
land+0.823
 exception+0.791
 extingu+0.781
Neg logit contributions
 ever-0.415
 onwards-0.389
keeping-0.388
 it-0.371
 fright-0.352
Token asked
Token ID1965
Feature activation-2.23e-01

Pos logit contributions
aman+0.190
 hey+0.187
land+0.179
 exception+0.172
 whose+0.170
Neg logit contributions
 ever-0.098
keeping-0.095
 onwards-0.094
 it-0.086
 fright-0.084
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 ever+0.229
keeping+0.228
 onwards+0.219
 fright+0.199
unts+0.197
Neg logit contributions
aman-0.377
 hey-0.373
land-0.357
 whose-0.344
 exception-0.342
Token horn
Token ID12718
Feature activation-3.88e-01

Pos logit contributions
 ever+1.089
keeping+1.084
 onwards+1.050
 Go+0.914
 fright+0.912
Neg logit contributions
aman-1.255
 hey-1.230
land-1.187
 extingu-1.122
 whose-1.115
Token.
Token ID13
Feature activation-4.50e-02

Pos logit contributions
 ever+0.355
keeping+0.354
 onwards+0.349
 fright+0.319
 it+0.300
Neg logit contributions
aman-0.679
 hey-0.673
land-0.658
 extingu-0.624
 whose-0.623
Token They
Token ID1119
Feature activation-4.21e-02

Pos logit contributions
 ever+0.047
keeping+0.044
 onwards+0.043
 fright+0.041
 it+0.039
Neg logit contributions
aman-0.078
 hey-0.076
land-0.073
 exception-0.070
 whose-0.069
Token wanted
Token ID2227
Feature activation-9.31e-01

Pos logit contributions
 ever+0.035
keeping+0.034
 onwards+0.034
 it+0.030
 fright+0.030
Neg logit contributions
aman-0.080
 hey-0.079
land-0.075
 exception-0.073
 whose-0.072
Token to
Token ID284
Feature activation-5.25e-01

Pos logit contributions
 onwards+0.873
keeping+0.861
 ever+0.816
 grin+0.771
unts+0.769
Neg logit contributions
 hey-1.607
aman-1.604
land-1.594
 whose-1.558
 exception-1.478
Token ride
Token ID6594
Feature activation-3.19e+00

Pos logit contributions
 onwards+0.555
keeping+0.542
 ever+0.532
unts+0.469
 sideways+0.458
Neg logit contributions
aman-0.866
 hey-0.866
land-0.815
 exception-0.795
 whose-0.789
Token the
Token ID262
Feature activation+6.03e-01

Pos logit contributions
keeping+3.850
unts+3.775
 grin+3.540
irk+3.488
umbs+3.271
Neg logit contributions
 whose-5.479
land-4.960
 who-4.900
 hey-4.834
 where-4.677
Token boat
Token ID8848
Feature activation+7.35e-02

Pos logit contributions
aman+0.786
 hey+0.761
land+0.711
 exception+0.677
 extingu+0.657
Neg logit contributions
 ever-0.889
keeping-0.846
 onwards-0.844
 it-0.835
 fright-0.805
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
aman+0.134
 hey+0.132
land+0.127
 exception+0.122
 whose+0.120
Neg logit contributions
 ever-0.066
keeping-0.063
 onwards-0.062
 it-0.057
 fright-0.057
Token but
Token ID475
Feature activation-8.28e-01

Pos logit contributions
keeping+1.578
 ever+1.550
 onwards+1.534
unts+1.459
tick+1.392
Neg logit contributions
aman-1.226
 hey-1.220
land-1.189
 whose-1.149
 exception-1.107
Token there
Token ID612
Feature activation-1.46e+00

Pos logit contributions
keeping+0.844
 ever+0.826
 onwards+0.774
unts+0.757
 fright+0.751
Neg logit contributions
aman-1.417
 hey-1.403
 whose-1.332
land-1.293
 exception-1.280
Token very
Token ID845
Feature activation-7.80e-01

Pos logit contributions
keeping+0.479
 ever+0.460
 onwards+0.444
 fright+0.387
 it+0.385
Neg logit contributions
 hey-1.047
aman-1.036
land-0.975
 exception-0.974
 whose-0.966
Token happy
Token ID3772
Feature activation-2.52e-01

Pos logit contributions
keeping+0.659
 ever+0.602
 onwards+0.593
 fright+0.523
 it+0.497
Neg logit contributions
 hey-1.498
aman-1.476
 exception-1.403
lves-1.364
land-1.364
Token and
Token ID290
Feature activation-4.31e-01

Pos logit contributions
 onwards+0.208
 ever+0.206
keeping+0.200
 fright+0.184
unts+0.177
Neg logit contributions
aman-0.472
 hey-0.466
land-0.450
 exception-0.431
 whose-0.431
Token wanted
Token ID2227
Feature activation-8.02e-01

Pos logit contributions
 ever+0.389
keeping+0.382
 onwards+0.374
 fright+0.344
unts+0.325
Neg logit contributions
aman-0.778
 hey-0.771
land-0.729
 exception-0.712
 whose-0.711
Token to
Token ID284
Feature activation-4.17e-01

Pos logit contributions
 onwards+0.738
keeping+0.727
 ever+0.689
unts+0.661
 grin+0.633
Neg logit contributions
 hey-1.414
aman-1.401
land-1.382
 whose-1.325
 exception-1.299
Token ride
Token ID6594
Feature activation-3.17e+00

Pos logit contributions
 onwards+0.465
keeping+0.457
 ever+0.452
 sideways+0.402
unts+0.395
Neg logit contributions
 hey-0.664
aman-0.663
land-0.623
 exception-0.602
 whose-0.589
Token it
Token ID340
Feature activation-1.82e-01

Pos logit contributions
unts+4.311
keeping+4.260
irk+3.712
umbs+3.655
 grin+3.505
Neg logit contributions
 hey-4.649
 whose-4.600
land-4.598
 where-4.306
 exception-4.297
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
keeping+0.186
 ever+0.185
 onwards+0.176
 fright+0.164
unts+0.164
Neg logit contributions
aman-0.305
 hey-0.299
land-0.291
 exception-0.280
 whose-0.272
Token
Token ID198
Feature activation-6.07e-01

Pos logit contributions
aman+0.585
 hey+0.574
land+0.550
 exception+0.529
 extingu+0.521
Neg logit contributions
 ever-0.290
keeping-0.271
 onwards-0.271
 it-0.254
 fright-0.246
Token
Token ID198
Feature activation+3.62e-01

Pos logit contributions
 ever+0.671
keeping+0.668
 onwards+0.639
 fright+0.590
 sideways+0.590
Neg logit contributions
aman-0.976
 hey-0.944
land-0.918
 whose-0.873
 exception-0.866
TokenL
Token ID43
Feature activation+4.53e-01

Pos logit contributions
aman+0.417
 hey+0.405
land+0.377
 exception+0.353
 extingu+0.345
Neg logit contributions
 ever-0.587
 onwards-0.566
keeping-0.565
 it-0.547
 fright-0.537
Token front
Token ID2166
Feature activation-6.45e-01

Pos logit contributions
aman+1.646
 hey+1.631
land+1.522
 exception+1.463
 extingu+1.432
Neg logit contributions
 ever-1.421
keeping-1.339
 onwards-1.313
 it-1.286
 fright-1.247
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
 ever+0.767
keeping+0.751
 onwards+0.745
 it+0.701
 fright+0.696
Neg logit contributions
aman-0.992
land-0.940
 hey-0.934
 exception-0.865
 whose-0.854
Token Lily
Token ID20037
Feature activation+8.64e-01

Pos logit contributions
aman+0.777
 hey+0.765
land+0.729
 exception+0.700
 extingu+0.692
Neg logit contributions
 ever-0.440
 onwards-0.417
keeping-0.413
 it-0.398
 fright-0.380
Token liked
Token ID8288
Feature activation-1.22e+00

Pos logit contributions
aman+1.471
 hey+1.459
land+1.381
 extingu+1.323
 exception+1.315
Neg logit contributions
 ever-0.935
 onwards-0.851
 it-0.849
keeping-0.834
 excited-0.821
Token to
Token ID284
Feature activation-9.42e-01

Pos logit contributions
 onwards+1.174
keeping+1.145
 ever+1.137
unts+1.068
 grin+1.031
Neg logit contributions
aman-2.135
 hey-2.010
 whose-1.991
land-1.982
 exception-1.874
Token ride
Token ID6594
Feature activation-3.15e+00

Pos logit contributions
keeping+1.363
 onwards+1.355
 ever+1.316
unts+1.225
 sideways+1.177
Neg logit contributions
aman-1.182
 hey-1.176
land-1.097
 exception-1.090
 whose-1.053
Token it
Token ID340
Feature activation+2.75e-01

Pos logit contributions
keeping+4.482
unts+4.289
 ever+4.053
 grin+3.915
 fright+3.876
Neg logit contributions
 whose-4.248
land-4.159
aman-4.120
 who-3.835
lves-3.813
Token around
Token ID1088
Feature activation-9.48e-01

Pos logit contributions
aman+0.485
 hey+0.475
land+0.455
 exception+0.437
 extingu+0.429
Neg logit contributions
 ever-0.270
keeping-0.256
 onwards-0.253
 it-0.239
 fright-0.233
Token her
Token ID607
Feature activation+6.80e-01

Pos logit contributions
keeping+1.055
 onwards+1.022
 ever+1.006
unts+0.924
 fright+0.917
Neg logit contributions
aman-1.523
 hey-1.497
land-1.408
 whose-1.393
 exception-1.352
Token town
Token ID3240
Feature activation+7.88e-02

Pos logit contributions
aman+1.175
 hey+1.155
land+1.094
 exception+1.056
 extingu+1.042
Neg logit contributions
 ever-0.717
 onwards-0.673
keeping-0.671
 it-0.655
 excited-0.624
Token and
Token ID290
Feature activation-5.87e-01

Pos logit contributions
aman+0.144
 hey+0.141
land+0.137
 exception+0.131
 whose+0.129
Neg logit contributions
 ever-0.071
 onwards-0.067
keeping-0.067
 it-0.062
 fright-0.061
Token He
Token ID679
Feature activation-3.17e-01

Pos logit contributions
 ever+1.148
keeping+1.044
 onwards+0.967
 fright+0.944
unts+0.902
Neg logit contributions
aman-2.140
 hey-1.980
land-1.980
 whose-1.882
 exception-1.878
Token was
Token ID373
Feature activation-7.16e-01

Pos logit contributions
keeping+0.295
 ever+0.293
 onwards+0.285
 fright+0.253
 it+0.246
Neg logit contributions
aman-0.572
 hey-0.556
land-0.533
 whose-0.522
 exception-0.514
Token so
Token ID523
Feature activation-8.70e-01

Pos logit contributions
keeping+0.622
 ever+0.605
 onwards+0.575
 fright+0.514
unts+0.499
Neg logit contributions
aman-1.314
 hey-1.312
 whose-1.244
land-1.236
 exception-1.234
Token excited
Token ID6568
Feature activation+2.48e-01

Pos logit contributions
keeping+0.444
 ever+0.364
 onwards+0.357
 fright+0.302
tick+0.248
Neg logit contributions
aman-1.956
 hey-1.947
 exception-1.852
 whose-1.848
land-1.833
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
aman+0.469
 hey+0.460
land+0.443
 exception+0.426
 extingu+0.419
Neg logit contributions
 ever-0.213
 onwards-0.199
keeping-0.199
 it-0.184
 fright-0.180
Token ride
Token ID6594
Feature activation-3.15e+00

Pos logit contributions
keeping+1.382
 onwards+1.366
 ever+1.287
unts+1.207
 sideways+1.150
Neg logit contributions
 hey-1.698
aman-1.696
 exception-1.571
 whose-1.558
land-1.550
Token in
Token ID287
Feature activation-1.95e+00

Pos logit contributions
unts+3.265
keeping+3.116
 Amen+2.815
 ever+2.729
 fright+2.627
Neg logit contributions
 whose-5.568
land-5.448
aman-5.277
 hey-5.128
 who-5.119
Token it
Token ID340
Feature activation+3.30e-01

Pos logit contributions
keeping+1.473
 ever+1.421
unts+1.412
 onwards+1.355
 grin+1.225
Neg logit contributions
aman-3.883
 whose-3.734
land-3.670
 hey-3.616
 exception-3.450
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
aman+0.596
 hey+0.585
land+0.560
 exception+0.538
 extingu+0.530
Neg logit contributions
 ever-0.310
keeping-0.291
 onwards-0.291
 it-0.274
 fright-0.265
Token His
Token ID2399
Feature activation+1.35e-01

Pos logit contributions
 ever+0.281
keeping+0.261
 onwards+0.252
 fright+0.240
unts+0.229
Neg logit contributions
aman-0.497
 hey-0.477
land-0.463
 exception-0.443
 whose-0.436
Token mom
Token ID1995
Feature activation+2.62e-01

Pos logit contributions
aman+0.249
 hey+0.244
land+0.233
 exception+0.224
 extingu+0.221
Neg logit contributions
 ever-0.126
keeping-0.119
 onwards-0.118
 it-0.109
 fright-0.107
Token car
Token ID1097
Feature activation+2.36e-01

Pos logit contributions
aman+0.113
 hey+0.111
land+0.104
 exception+0.099
 extingu+0.097
Neg logit contributions
 ever-0.113
keeping-0.110
 onwards-0.109
 it-0.102
 fright-0.101
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
aman+0.451
 hey+0.445
land+0.426
 exception+0.410
 extingu+0.405
Neg logit contributions
 ever-0.196
keeping-0.184
 onwards-0.183
 it-0.169
 fright-0.164
Token He
Token ID679
Feature activation-5.77e-01

Pos logit contributions
 ever+1.851
keeping+1.818
 onwards+1.659
 fright+1.564
unts+1.482
Neg logit contributions
aman-2.339
 hey-2.294
 whose-2.236
land-2.222
 extingu-2.170
Token wanted
Token ID2227
Feature activation-7.22e-01

Pos logit contributions
keeping+0.577
 onwards+0.552
 ever+0.547
 fright+0.485
 it+0.459
Neg logit contributions
aman-1.001
 hey-0.984
 whose-0.941
land-0.939
 exception-0.907
Token to
Token ID284
Feature activation-8.11e-01

Pos logit contributions
keeping+0.657
 onwards+0.647
 ever+0.627
 grin+0.571
unts+0.558
Neg logit contributions
 hey-1.267
aman-1.252
land-1.229
 whose-1.208
 exception-1.164
Token drive
Token ID3708
Feature activation-3.12e+00

Pos logit contributions
 onwards+1.031
keeping+1.002
 ever+0.961
unts+0.868
 onward+0.866
Neg logit contributions
 hey-1.177
aman-1.151
 extingu-1.058
land-1.058
 exception-1.057
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
keeping+4.345
unts+4.261
 grin+4.216
 Amen+3.852
irk+3.723
Neg logit contributions
 whose-4.283
 hey-4.218
 where-4.080
land-4.025
 who-3.949
Token car
Token ID1097
Feature activation+1.02e-01

Pos logit contributions
aman+2.001
 hey+1.905
land+1.853
 exception+1.774
 extingu+1.736
Neg logit contributions
 ever-1.076
 it-1.007
 onwards-0.977
keeping-0.951
 fright-0.919
Token with
Token ID351
Feature activation-2.62e-01

Pos logit contributions
aman+0.178
 hey+0.175
land+0.168
 exception+0.162
 whose+0.158
Neg logit contributions
 ever-0.100
keeping-0.096
 onwards-0.095
 it-0.089
 fright-0.088
Token his
Token ID465
Feature activation+2.92e-01

Pos logit contributions
 ever+0.278
keeping+0.275
 onwards+0.270
unts+0.239
 fright+0.238
Neg logit contributions
aman-0.434
 hey-0.432
land-0.404
 exception-0.392
 whose-0.389
Token stick
Token ID4859
Feature activation-2.78e-02

Pos logit contributions
aman+0.494
 hey+0.484
land+0.461
 exception+0.442
 extingu+0.435
Neg logit contributions
 ever-0.314
keeping-0.297
 onwards-0.297
 it-0.281
 fright-0.273
Token's
Token ID338
Feature activation-3.95e-01

Pos logit contributions
 ever+0.027
keeping+0.027
 onwards+0.026
 it+0.024
 fright+0.024
Neg logit contributions
aman-0.030
 hey-0.030
land-0.028
 exception-0.027
 whose-0.026
Token bicycle
Token ID17026
Feature activation+9.85e-03

Pos logit contributions
 ever+0.512
keeping+0.501
 onwards+0.496
 it+0.462
unts+0.458
Neg logit contributions
aman-0.566
 hey-0.544
land-0.513
 exception-0.497
 extingu-0.483
Token and
Token ID290
Feature activation+5.79e-01

Pos logit contributions
aman+0.020
 hey+0.020
land+0.019
 exception+0.018
 whose+0.018
Neg logit contributions
 ever-0.007
keeping-0.007
 onwards-0.006
 it-0.006
 fright-0.006
Token wanted
Token ID2227
Feature activation-1.37e+00

Pos logit contributions
aman+1.107
 hey+1.086
land+1.047
 exception+0.999
 extingu+0.991
Neg logit contributions
 ever-0.497
 onwards-0.456
keeping-0.455
 it-0.451
 excited-0.418
Token to
Token ID284
Feature activation-9.21e-01

Pos logit contributions
 onwards+1.360
keeping+1.334
unts+1.238
 grin+1.223
 ever+1.198
Neg logit contributions
 hey-2.321
aman-2.296
 whose-2.240
land-2.182
 exception-2.150
Token ride
Token ID6594
Feature activation-3.10e+00

Pos logit contributions
 onwards+1.125
keeping+1.117
 ever+1.036
unts+0.987
 sideways+0.951
Neg logit contributions
 hey-1.371
aman-1.339
 exception-1.262
land-1.246
 whose-1.205
Token it
Token ID340
Feature activation+5.49e-02

Pos logit contributions
unts+3.363
keeping+3.161
 grin+2.844
umbs+2.767
 onwards+2.692
Neg logit contributions
 whose-5.244
 hey-5.053
 exception-4.884
aman-4.811
lves-4.739
Token,
Token ID11
Feature activation-8.83e-01

Pos logit contributions
aman+0.095
 hey+0.093
land+0.090
 exception+0.087
 whose+0.084
Neg logit contributions
 ever-0.054
keeping-0.052
 onwards-0.051
 it-0.047
 fright-0.047
Token but
Token ID475
Feature activation-7.52e-01

Pos logit contributions
keeping+1.383
 ever+1.365
 onwards+1.316
unts+1.245
 sideways+1.191
Neg logit contributions
aman-1.071
 hey-1.037
land-0.973
 exception-0.945
 whose-0.936
Token Tim
Token ID5045
Feature activation-3.97e-01

Pos logit contributions
keeping+0.732
 ever+0.711
 onwards+0.678
unts+0.668
 fright+0.630
Neg logit contributions
aman-1.342
 hey-1.319
 whose-1.221
 exception-1.210
land-1.182
Tokenmy
Token ID1820
Feature activation-5.64e-01

Pos logit contributions
 ever+0.525
keeping+0.515
 onwards+0.511
 fright+0.474
 it+0.474
Neg logit contributions
aman-0.574
 hey-0.548
land-0.513
 exception-0.500
 whose-0.489
Token it
Token ID340
Feature activation-4.05e-01

Pos logit contributions
 ever+0.103
 onwards+0.099
keeping+0.099
unts+0.083
 fright+0.081
Neg logit contributions
aman-0.305
 hey-0.300
land-0.288
 whose-0.280
 exception-0.278
Token closer
Token ID5699
Feature activation-4.28e-01

Pos logit contributions
keeping+0.420
 ever+0.407
 onwards+0.386
unts+0.360
 fright+0.360
Neg logit contributions
aman-0.685
 hey-0.668
land-0.657
 exception-0.635
 whose-0.628
Token,
Token ID11
Feature activation-1.78e+00

Pos logit contributions
keeping+0.388
 ever+0.385
 onwards+0.373
 fright+0.352
unts+0.347
Neg logit contributions
aman-0.763
 hey-0.738
land-0.736
 exception-0.693
 whose-0.690
Token so
Token ID523
Feature activation-7.50e-01

Pos logit contributions
keeping+2.929
 ever+2.751
 onwards+2.711
unts+2.690
ay+2.563
Neg logit contributions
aman-2.062
 hey-2.018
land-2.003
 whose-1.947
 exception-1.856
Token she
Token ID673
Feature activation-2.24e-01

Pos logit contributions
keeping+1.027
 ever+0.970
 onwards+0.941
 fright+0.923
unts+0.896
Neg logit contributions
aman-1.043
 hey-1.029
land-0.966
 whose-0.921
 exception-0.899
Token moved
Token ID3888
Feature activation-3.09e+00

Pos logit contributions
 ever+0.223
keeping+0.221
 onwards+0.218
 fright+0.197
 it+0.195
Neg logit contributions
aman-0.383
 hey-0.378
land-0.359
 whose-0.345
 exception-0.344
Token a
Token ID257
Feature activation+4.11e-01

Pos logit contributions
keeping+4.529
unts+4.455
 grin+4.103
ay+3.792
 onwards+3.764
Neg logit contributions
 whose-4.156
 hey-4.073
aman-3.790
 who-3.650
 exception-3.638
Token bit
Token ID1643
Feature activation-1.46e-01

Pos logit contributions
aman+0.595
 hey+0.583
land+0.549
 exception+0.523
 extingu+0.512
Neg logit contributions
 ever-0.535
 onwards-0.509
keeping-0.509
 it-0.491
 fright-0.480
Token closer
Token ID5699
Feature activation-7.28e-01

Pos logit contributions
 ever+0.137
keeping+0.135
 onwards+0.131
 it+0.117
 fright+0.116
Neg logit contributions
 hey-0.258
aman-0.257
land-0.244
 exception-0.238
 whose-0.234
Token.
Token ID13
Feature activation+4.27e-01

Pos logit contributions
 ever+0.707
keeping+0.705
 onwards+0.701
unts+0.650
 fright+0.641
Neg logit contributions
aman-1.221
land-1.208
 hey-1.181
 exception-1.119
 whose-1.112
Token She
Token ID1375
Feature activation+4.34e-03

Pos logit contributions
aman+0.779
 hey+0.765
land+0.731
 exception+0.705
 extingu+0.695
Neg logit contributions
 ever-0.397
 onwards-0.373
keeping-0.371
 it-0.354
 fright-0.338
Token it
Token ID340
Feature activation+7.90e-02

Pos logit contributions
 onwards+0.782
keeping+0.772
 ever+0.752
unts+0.668
 grin+0.614
Neg logit contributions
 hey-1.836
aman-1.832
 whose-1.773
land-1.698
 extingu-1.675
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
aman+0.141
 hey+0.138
land+0.133
 exception+0.129
 whose+0.127
Neg logit contributions
 ever-0.074
keeping-0.072
 onwards-0.070
 fright-0.064
 it-0.064
Token The
Token ID383
Feature activation-8.39e-01

Pos logit contributions
aman+0.376
 hey+0.368
land+0.352
 exception+0.338
 extingu+0.334
Neg logit contributions
 ever-0.202
keeping-0.189
 onwards-0.188
 it-0.174
 fright-0.172
Token crab
Token ID32202
Feature activation-1.25e-01

Pos logit contributions
 onwards+0.879
 ever+0.868
keeping+0.863
unts+0.758
 fright+0.758
Neg logit contributions
 hey-1.381
aman-1.380
land-1.278
 whose-1.222
 exception-1.221
Token was
Token ID373
Feature activation-7.66e-01

Pos logit contributions
keeping+0.122
 ever+0.120
 onwards+0.116
 fright+0.105
 it+0.104
Neg logit contributions
aman-0.217
 hey-0.213
land-0.204
 whose-0.197
 exception-0.196
Token walking
Token ID6155
Feature activation-3.08e+00

Pos logit contributions
keeping+0.703
 ever+0.672
 onwards+0.652
 fright+0.583
tick+0.563
Neg logit contributions
aman-1.373
 hey-1.339
land-1.290
 whose-1.286
 exception-1.278
Token sideways
Token ID35303
Feature activation-2.58e-01

Pos logit contributions
keeping+4.554
unts+4.195
 grin+3.919
tick+3.900
 Amen+3.883
Neg logit contributions
 whose-4.374
lves-4.022
 hey-3.957
aman-3.932
 exception-3.930
Token and
Token ID290
Feature activation-8.11e-02

Pos logit contributions
keeping+0.190
 ever+0.189
 onwards+0.178
 fright+0.164
unts+0.157
Neg logit contributions
aman-0.506
 hey-0.492
land-0.489
 exception-0.461
 whose-0.461
Token had
Token ID550
Feature activation-1.56e-01

Pos logit contributions
 ever+0.075
keeping+0.073
 onwards+0.071
 fright+0.065
unts+0.063
Neg logit contributions
aman-0.146
 hey-0.144
land-0.136
 exception-0.132
 whose-0.131
Token big
Token ID1263
Feature activation-1.77e+00

Pos logit contributions
 ever+0.172
 onwards+0.168
keeping+0.168
 it+0.151
 fright+0.150
Neg logit contributions
aman-0.250
 hey-0.245
land-0.237
 exception-0.223
 whose-0.222
Token claws
Token ID28421
Feature activation+9.72e-01

Pos logit contributions
keeping+2.092
 ever+2.087
 onwards+1.983
kick+1.752
tick+1.730
Neg logit contributions
aman-2.748
 hey-2.695
land-2.593
 Majesty-2.558
 whose-2.534
Token a
Token ID257
Feature activation-4.33e-01

Pos logit contributions
aman+1.743
 hey+1.687
land+1.621
 exception+1.550
 extingu+1.508
Neg logit contributions
 ever-1.064
 it-1.037
 onwards-0.979
keeping-0.961
 excited-0.953
Token boat
Token ID8848
Feature activation+2.70e-01

Pos logit contributions
keeping+0.540
 ever+0.539
 onwards+0.523
 fright+0.471
 it+0.461
Neg logit contributions
aman-0.650
 hey-0.631
land-0.606
 whose-0.577
 extingu-0.572
Token and
Token ID290
Feature activation+7.27e-01

Pos logit contributions
aman+0.507
 hey+0.498
land+0.478
 exception+0.460
 extingu+0.454
Neg logit contributions
 ever-0.235
keeping-0.221
 onwards-0.220
 it-0.205
 fright-0.199
Token wanted
Token ID2227
Feature activation-8.81e-01

Pos logit contributions
aman+1.479
 hey+1.446
land+1.393
 exception+1.338
 extingu+1.333
Neg logit contributions
 ever-0.550
 it-0.505
keeping-0.489
 onwards-0.489
 excited-0.460
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
keeping+0.820
 onwards+0.815
 ever+0.774
 grin+0.737
unts+0.731
Neg logit contributions
land-1.522
 hey-1.512
aman-1.493
 whose-1.473
 exception-1.409
Token ride
Token ID6594
Feature activation-3.08e+00

Pos logit contributions
 onwards+0.416
 ever+0.412
keeping+0.411
unts+0.363
 sideways+0.360
Neg logit contributions
aman-0.553
 hey-0.550
land-0.521
 exception-0.503
 extingu-0.497
Token it
Token ID340
Feature activation-1.14e-01

Pos logit contributions
keeping+3.769
unts+3.759
 grin+3.511
irk+3.414
 fright+2.979
Neg logit contributions
 whose-5.127
land-4.783
 who-4.571
 exception-4.544
 where-4.494
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
 ever+0.118
keeping+0.117
 onwards+0.112
 fright+0.104
unts+0.103
Neg logit contributions
aman-0.190
 hey-0.185
land-0.181
 exception-0.174
 whose-0.170
Token He
Token ID679
Feature activation+2.84e-01

Pos logit contributions
aman+0.816
 hey+0.804
land+0.770
 exception+0.740
 extingu+0.731
Neg logit contributions
 ever-0.371
 onwards-0.347
keeping-0.346
 it-0.330
 fright-0.313
Token asked
Token ID1965
Feature activation-3.91e-01

Pos logit contributions
aman+0.509
 hey+0.500
land+0.478
 exception+0.459
 extingu+0.452
Neg logit contributions
 ever-0.272
keeping-0.257
 onwards-0.256
 it-0.240
 fright-0.233
Token the
Token ID262
Feature activation+5.04e-01

Pos logit contributions
keeping+0.463
 ever+0.460
 onwards+0.441
 fright+0.409
unts+0.403
Neg logit contributions
aman-0.607
 hey-0.597
land-0.569
 whose-0.553
 exception-0.546
Token wanted
Token ID2227
Feature activation-8.81e-01

Pos logit contributions
keeping+0.369
 ever+0.356
 onwards+0.353
 fright+0.307
 sideways+0.298
Neg logit contributions
aman-0.669
 hey-0.656
land-0.625
 whose-0.615
lves-0.600
Token to
Token ID284
Feature activation-6.35e-01

Pos logit contributions
 onwards+0.818
keeping+0.817
 ever+0.760
unts+0.714
 grin+0.701
Neg logit contributions
aman-1.542
 hey-1.540
land-1.489
 whose-1.449
 exception-1.397
Token catch
Token ID4929
Feature activation-2.37e+00

Pos logit contributions
 onwards+0.714
keeping+0.693
 ever+0.656
unts+0.592
 sideways+0.590
Neg logit contributions
 hey-1.019
aman-1.013
land-0.950
 exception-0.927
 whose-0.910
Token it
Token ID340
Feature activation-3.64e-01

Pos logit contributions
keeping+2.089
 onwards+2.076
 ever+1.824
unts+1.697
 onward+1.655
Neg logit contributions
 hey-4.439
 whose-4.352
lves-4.346
aman-4.255
land-4.130
Token and
Token ID290
Feature activation-3.63e-01

Pos logit contributions
keeping+0.336
 ever+0.324
 onwards+0.315
unts+0.285
 fright+0.284
Neg logit contributions
aman-0.658
 hey-0.638
land-0.628
 exception-0.604
 whose-0.596
Token hold
Token ID1745
Feature activation-3.07e+00

Pos logit contributions
 ever+0.386
 onwards+0.384
keeping+0.380
unts+0.343
 fright+0.330
Neg logit contributions
 hey-0.594
aman-0.594
land-0.555
 exception-0.537
 whose-0.534
Token it
Token ID340
Feature activation-5.38e-02

Pos logit contributions
keeping+3.476
unts+3.104
 grin+2.804
 Amen+2.774
 Please+2.509
Neg logit contributions
 whose-5.253
 hey-5.049
aman-4.847
lves-4.708
 who-4.641
Token in
Token ID287
Feature activation-4.44e-01

Pos logit contributions
 ever+0.048
keeping+0.047
 onwards+0.045
 it+0.042
 fright+0.041
Neg logit contributions
aman-0.097
 hey-0.096
land-0.092
 exception-0.089
 whose-0.088
Token her
Token ID607
Feature activation+5.04e-01

Pos logit contributions
 ever+0.540
 onwards+0.525
keeping+0.524
 sideways+0.475
 it+0.473
Neg logit contributions
aman-0.681
 hey-0.651
land-0.622
 extingu-0.590
 whose-0.588
Token hands
Token ID2832
Feature activation+2.70e-02

Pos logit contributions
aman+1.089
 hey+1.073
land+1.033
 exception+1.002
 extingu+0.984
Neg logit contributions
 ever-0.317
 onwards-0.285
keeping-0.282
 it-0.263
 fright-0.248
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
aman+0.046
 hey+0.045
land+0.043
 exception+0.042
 whose+0.041
Neg logit contributions
 ever-0.027
keeping-0.026
 onwards-0.026
 it-0.024
 fright-0.024
Token to
Token ID284
Feature activation-7.75e-01

Pos logit contributions
 onwards+1.286
keeping+1.225
 ever+1.207
 grin+1.119
unts+1.114
Neg logit contributions
aman-1.717
 hey-1.706
 whose-1.654
land-1.628
 exception-1.552
Token play
Token ID711
Feature activation-8.10e-01

Pos logit contributions
 onwards+0.939
keeping+0.929
 ever+0.906
unts+0.811
 sideways+0.800
Neg logit contributions
aman-1.160
 hey-1.147
land-1.095
 exception-1.056
 whose-1.047
Token with
Token ID351
Feature activation-9.58e-01

Pos logit contributions
keeping+0.745
 ever+0.716
 onwards+0.695
 fright+0.628
unts+0.625
Neg logit contributions
 hey-1.450
aman-1.389
land-1.383
 whose-1.369
 exception-1.329
Token it
Token ID340
Feature activation+6.17e-01

Pos logit contributions
keeping+0.436
 ever+0.426
 onwards+0.417
 grin+0.316
 fright+0.306
Neg logit contributions
aman-2.139
 hey-2.117
 whose-2.068
land-2.056
lves-1.987
Token and
Token ID290
Feature activation-9.45e-01

Pos logit contributions
aman+1.151
 hey+1.129
land+1.075
 exception+1.032
 extingu+1.027
Neg logit contributions
 ever-0.577
 onwards-0.530
keeping-0.514
 it-0.510
 fright-0.480
Token hold
Token ID1745
Feature activation-3.07e+00

Pos logit contributions
 onwards+1.131
 ever+1.101
keeping+1.077
unts+1.009
 sideways+0.974
Neg logit contributions
 hey-1.431
aman-1.415
 whose-1.317
land-1.315
 exception-1.280
Token it
Token ID340
Feature activation-1.89e-01

Pos logit contributions
keeping+3.122
 grin+2.938
 Amen+2.846
unts+2.660
 fright+2.494
Neg logit contributions
 whose-5.367
 hey-5.161
 who-4.923
 where-4.809
aman-4.737
Token tight
Token ID5381
Feature activation+3.62e-01

Pos logit contributions
 ever+0.165
keeping+0.163
 onwards+0.155
 fright+0.142
 it+0.142
Neg logit contributions
aman-0.342
 hey-0.337
land-0.325
 exception-0.315
 whose-0.311
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
aman+0.640
 hey+0.628
land+0.600
 exception+0.576
 extingu+0.568
Neg logit contributions
 ever-0.353
 onwards-0.331
keeping-0.331
 it-0.313
 fright-0.303
Token When
Token ID1649
Feature activation-1.71e+00

Pos logit contributions
aman+0.764
 hey+0.749
land+0.717
 exception+0.687
 extingu+0.679
Neg logit contributions
 ever-0.418
 onwards-0.394
keeping-0.393
 it-0.375
 fright-0.359
Token she
Token ID673
Feature activation+6.35e-01

Pos logit contributions
keeping+2.556
 fright+2.351
 onwards+2.330
 ever+2.321
unts+2.261
Neg logit contributions
 hey-2.279
aman-2.222
 whose-1.909
land-1.877
 exception-1.824
Token She
Token ID1375
Feature activation-1.31e-01

Pos logit contributions
 ever+0.018
keeping+0.017
 onwards+0.017
 fright+0.015
 it+0.015
Neg logit contributions
aman-0.034
 hey-0.033
land-0.031
 exception-0.030
 whose-0.030
Token saw
Token ID2497
Feature activation+4.93e-01

Pos logit contributions
 ever+0.135
keeping+0.133
 onwards+0.130
 fright+0.118
 it+0.117
Neg logit contributions
aman-0.223
 hey-0.219
land-0.209
 whose-0.200
 exception-0.199
Token a
Token ID257
Feature activation-2.87e-01

Pos logit contributions
aman+0.814
 hey+0.796
land+0.759
 exception+0.726
 extingu+0.714
Neg logit contributions
 ever-0.549
keeping-0.517
 onwards-0.515
 it-0.503
 fright-0.482
Token house
Token ID2156
Feature activation-9.96e-01

Pos logit contributions
 ever+0.380
keeping+0.373
 onwards+0.367
 it+0.335
 fright+0.332
Neg logit contributions
aman-0.411
 hey-0.397
land-0.379
 exception-0.356
 whose-0.354
Token and
Token ID290
Feature activation+4.29e-01

Pos logit contributions
keeping+0.958
 ever+0.939
 onwards+0.925
 fright+0.868
unts+0.815
Neg logit contributions
aman-1.739
 hey-1.699
land-1.627
 extingu-1.591
 whose-1.588
Token knocked
Token ID13642
Feature activation-3.04e+00

Pos logit contributions
aman+0.831
 hey+0.814
land+0.782
 exception+0.753
 extingu+0.744
Neg logit contributions
 ever-0.356
keeping-0.329
 onwards-0.327
 it-0.312
 fright-0.296
Token on
Token ID319
Feature activation-8.26e-01

Pos logit contributions
 grin+3.678
unts+3.556
 Amen+3.325
keeping+3.287
 onwards+3.078
Neg logit contributions
aman-4.826
 hey-4.616
 whose-4.430
ray-4.210
land-4.196
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
 ever+0.906
 onwards+0.875
keeping+0.853
unts+0.840
 grin+0.788
Neg logit contributions
aman-1.352
 hey-1.307
land-1.259
 whose-1.212
 extingu-1.205
Token door
Token ID3420
Feature activation-4.58e-01

Pos logit contributions
 ever+0.142
 onwards+0.136
keeping+0.132
 it+0.126
 fright+0.123
Neg logit contributions
aman-0.205
 hey-0.203
land-0.193
 extingu-0.184
 exception-0.181
Token.
Token ID13
Feature activation-8.10e-02

Pos logit contributions
 ever+0.496
keeping+0.488
 onwards+0.474
 fright+0.444
unts+0.434
Neg logit contributions
aman-0.754
 hey-0.723
land-0.718
 exception-0.676
 whose-0.669
Token
Token ID220
Feature activation-2.29e-02

Pos logit contributions
 ever+0.082
keeping+0.077
 onwards+0.075
 fright+0.070
 it+0.068
Neg logit contributions
aman-0.142
 hey-0.138
land-0.132
 exception-0.126
 whose-0.125
Token the
Token ID262
Feature activation+8.79e-01

Pos logit contributions
aman+3.217
land+3.167
 hey+3.142
 exception+2.977
 extingu+2.879
Neg logit contributions
 it-1.194
 ever-1.059
keeping-0.939
 on-0.921
 excited-0.917
Token car
Token ID1097
Feature activation-1.58e-03

Pos logit contributions
aman+1.217
 hey+1.182
land+1.126
 exception+1.050
 extingu+1.035
Neg logit contributions
 ever-1.222
 onwards-1.156
 it-1.148
keeping-1.143
 excited-1.115
Token and
Token ID290
Feature activation+4.05e-01

Pos logit contributions
 ever+0.001
keeping+0.001
 onwards+0.001
 it+0.001
 fright+0.001
Neg logit contributions
aman-0.003
 hey-0.003
land-0.003
 exception-0.003
 whose-0.003
Token wanted
Token ID2227
Feature activation-1.25e+00

Pos logit contributions
aman+0.797
 hey+0.783
land+0.753
 exception+0.725
 extingu+0.716
Neg logit contributions
 ever-0.321
keeping-0.295
 onwards-0.295
 it-0.279
 fright-0.265
Token to
Token ID284
Feature activation-7.66e-01

Pos logit contributions
 onwards+1.214
keeping+1.210
 grin+1.161
 ever+1.127
unts+1.107
Neg logit contributions
 hey-2.101
aman-2.080
land-2.050
 whose-2.034
 exception-1.942
Token drive
Token ID3708
Feature activation-3.02e+00

Pos logit contributions
 onwards+0.968
keeping+0.938
 ever+0.914
unts+0.835
 sideways+0.831
Neg logit contributions
 hey-1.110
aman-1.108
land-1.025
 exception-1.000
 extingu-0.992
Token it
Token ID340
Feature activation+3.78e-01

Pos logit contributions
unts+3.822
 grin+3.702
keeping+3.681
 Amen+3.264
umbs+3.243
Neg logit contributions
 hey-4.442
 whose-4.420
 exception-4.297
land-4.295
 where-4.114
Token.
Token ID13
Feature activation+3.32e-02

Pos logit contributions
aman+0.693
 hey+0.681
land+0.651
 exception+0.625
 extingu+0.617
Neg logit contributions
 ever-0.349
 onwards-0.326
keeping-0.325
 it-0.308
 fright-0.296
Token He
Token ID679
Feature activation+2.48e-02

Pos logit contributions
aman+0.062
 hey+0.060
land+0.057
 exception+0.055
 whose+0.055
Neg logit contributions
 ever-0.031
keeping-0.028
 onwards-0.028
 fright-0.026
 it-0.025
Token climbed
Token ID19952
Feature activation-1.15e+00

Pos logit contributions
aman+0.043
 hey+0.043
land+0.041
 exception+0.039
 whose+0.039
Neg logit contributions
 ever-0.024
keeping-0.024
 onwards-0.023
 it-0.021
 fright-0.021
Token in
Token ID287
Feature activation-1.70e+00

Pos logit contributions
keeping+1.277
 ever+1.259
unts+1.203
 fright+1.158
 onwards+1.141
Neg logit contributions
aman-1.828
 hey-1.744
land-1.715
 extingu-1.629
 whose-1.626
Token.
Token ID13
Feature activation+9.61e-02

Pos logit contributions
 ever+0.310
keeping+0.306
 onwards+0.300
 fright+0.278
unts+0.267
Neg logit contributions
aman-0.473
 hey-0.464
land-0.453
 whose-0.431
 exception-0.431
Token Fl
Token ID1610
Feature activation+5.55e-01

Pos logit contributions
aman+0.139
 hey+0.136
land+0.130
 exception+0.123
 whose+0.121
Neg logit contributions
 ever-0.124
keeping-0.120
 onwards-0.118
 it-0.111
 fright-0.111
Tokenuffy
Token ID15352
Feature activation-8.36e-01

Pos logit contributions
aman+0.480
 hey+0.455
land+0.415
 exception+0.380
 extingu+0.366
Neg logit contributions
 ever-1.059
 onwards-1.019
keeping-1.015
 it-0.992
 fright-0.976
Token wanted
Token ID2227
Feature activation-8.00e-01

Pos logit contributions
keeping+0.868
 onwards+0.841
 ever+0.817
 fright+0.759
unts+0.707
Neg logit contributions
 hey-1.390
aman-1.379
 whose-1.322
land-1.319
lves-1.275
Token to
Token ID284
Feature activation-5.77e-01

Pos logit contributions
 onwards+0.738
keeping+0.735
 ever+0.698
unts+0.655
 grin+0.642
Neg logit contributions
 hey-1.387
aman-1.382
land-1.347
 whose-1.328
 exception-1.282
Token ride
Token ID6594
Feature activation-3.01e+00

Pos logit contributions
 onwards+0.698
keeping+0.669
 ever+0.664
unts+0.596
 sideways+0.593
Neg logit contributions
aman-0.865
 hey-0.862
land-0.802
 exception-0.791
 extingu-0.776
Token in
Token ID287
Feature activation-9.36e-01

Pos logit contributions
keeping+4.889
unts+4.726
irk+4.184
 grin+4.119
umbs+4.042
Neg logit contributions
 whose-3.889
 where-3.463
 hey-3.443
land-3.418
 exception-3.397
Token the
Token ID262
Feature activation-4.68e-02

Pos logit contributions
 ever+1.024
keeping+0.999
 onwards+0.992
unts+0.921
 grin+0.889
Neg logit contributions
aman-1.516
 hey-1.470
land-1.430
 whose-1.417
 exception-1.344
Token je
Token ID11223
Feature activation-3.16e-01

Pos logit contributions
 ever+0.067
keeping+0.065
 onwards+0.065
 it+0.061
 fright+0.060
Neg logit contributions
aman-0.061
 hey-0.060
land-0.056
 exception-0.053
lves-0.052
Tokenep
Token ID538
Feature activation-3.99e-01

Pos logit contributions
 ever+0.516
keeping+0.495
 onwards+0.492
 it+0.477
 fright+0.466
Neg logit contributions
aman-0.360
 hey-0.349
land-0.329
 exception-0.299
lves-0.299
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
 ever+0.411
keeping+0.403
 onwards+0.399
 fright+0.376
 it+0.364
Neg logit contributions
aman-0.651
 hey-0.651
land-0.633
 exception-0.599
 whose-0.588
Token ball
Token ID2613
Feature activation+9.02e-01

Pos logit contributions
aman+1.245
 hey+1.224
land+1.163
 exception+1.099
 extingu+1.063
Neg logit contributions
 ever-0.998
keeping-0.935
 onwards-0.934
 it-0.921
 excited-0.894
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
aman+1.806
 hey+1.768
land+1.693
 exception+1.619
 extingu+1.610
Neg logit contributions
 ever-0.746
 onwards-0.676
 it-0.658
keeping-0.638
 on-0.610
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
aman+0.171
 hey+0.167
land+0.158
 exception+0.151
 whose+0.149
Neg logit contributions
 ever-0.143
keeping-0.139
 onwards-0.138
 it-0.128
 fright-0.127
Token she
Token ID673
Feature activation+1.45e-01

Pos logit contributions
 ever+0.119
keeping+0.118
 onwards+0.116
 fright+0.107
unts+0.105
Neg logit contributions
aman-0.163
 hey-0.161
land-0.151
 exception-0.146
 whose-0.145
Token would
Token ID561
Feature activation-4.19e-01

Pos logit contributions
aman+0.242
 hey+0.239
land+0.227
 exception+0.218
 whose+0.214
Neg logit contributions
 ever-0.154
keeping-0.149
 onwards-0.147
 it-0.137
 fright-0.136
Token roll
Token ID4836
Feature activation-2.99e+00

Pos logit contributions
keeping+0.470
 ever+0.459
 onwards+0.456
 fright+0.403
 it+0.402
Neg logit contributions
aman-0.672
 hey-0.665
land-0.622
 exception-0.611
 whose-0.597
Token it
Token ID340
Feature activation+3.52e-01

Pos logit contributions
keeping+2.621
 Amen+2.419
 grin+2.411
unts+2.277
 onwards+2.125
Neg logit contributions
 hey-5.500
aman-5.366
lves-5.364
 whose-5.206
 exception-5.138
Token all
Token ID477
Feature activation+5.21e-01

Pos logit contributions
aman+0.694
 hey+0.682
land+0.655
 exception+0.631
 extingu+0.623
Neg logit contributions
 ever-0.274
 onwards-0.253
keeping-0.252
 it-0.235
 fright-0.225
Token around
Token ID1088
Feature activation-4.00e-01

Pos logit contributions
aman+1.071
 hey+1.055
land+1.013
 exception+0.974
 extingu+0.967
Neg logit contributions
 ever-0.377
 onwards-0.341
keeping-0.335
 it-0.319
 fright-0.296
Token her
Token ID607
Feature activation+1.00e+00

Pos logit contributions
 ever+0.417
 onwards+0.415
keeping+0.405
 fright+0.378
unts+0.367
Neg logit contributions
 hey-0.654
aman-0.654
land-0.613
 whose-0.589
 exception-0.586
Token house
Token ID2156
Feature activation+4.30e-01

Pos logit contributions
aman+1.792
 hey+1.750
land+1.637
 exception+1.613
 extingu+1.582
Neg logit contributions
 ever-1.032
keeping-0.973
 onwards-0.951
 it-0.949
 excited-0.901
Token He
Token ID679
Feature activation-1.68e-01

Pos logit contributions
 ever+0.997
keeping+0.944
 onwards+0.889
 fright+0.849
unts+0.817
Neg logit contributions
aman-1.343
 hey-1.316
land-1.273
 whose-1.242
 exception-1.178
Token sniff
Token ID26300
Feature activation-1.42e+00

Pos logit contributions
keeping+0.157
 ever+0.156
 onwards+0.154
 it+0.136
 fright+0.136
Neg logit contributions
aman-0.299
 hey-0.296
land-0.281
 whose-0.273
 exception-0.272
Tokened
Token ID276
Feature activation-1.79e+00

Pos logit contributions
keeping+1.684
 ever+1.607
 onwards+1.515
 fright+1.506
 grin+1.503
Neg logit contributions
land-2.081
 hey-2.066
aman-2.041
lves-2.014
 whose-2.009
Token it
Token ID340
Feature activation-1.95e-01

Pos logit contributions
keeping+1.633
 ever+1.560
 onwards+1.554
unts+1.545
tick+1.406
Neg logit contributions
 hey-3.161
 whose-3.146
aman-3.122
land-3.069
 extingu-2.961
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
keeping+0.164
 ever+0.163
 onwards+0.150
 fright+0.139
unts+0.136
Neg logit contributions
aman-0.364
 hey-0.355
land-0.347
 exception-0.335
 whose-0.333
Token rolled
Token ID11686
Feature activation-2.99e+00

Pos logit contributions
 ever+0.013
 onwards+0.013
keeping+0.013
 fright+0.011
 it+0.011
Neg logit contributions
aman-0.023
 hey-0.023
land-0.022
 exception-0.021
 whose-0.021
Token it
Token ID340
Feature activation-1.20e-01

Pos logit contributions
unts+4.063
 Amen+3.785
keeping+3.633
 onwards+3.618
 grin+3.613
Neg logit contributions
 hey-3.995
 extingu-3.848
 whose-3.741
 where-3.729
lves-3.665
Token with
Token ID351
Feature activation-7.76e-01

Pos logit contributions
 ever+0.108
keeping+0.107
 onwards+0.103
 it+0.095
 fright+0.093
Neg logit contributions
aman-0.210
 hey-0.209
land-0.200
 exception-0.194
 whose-0.191
Token his
Token ID465
Feature activation-1.13e+00

Pos logit contributions
keeping+0.951
 onwards+0.937
 ever+0.926
unts+0.806
tick+0.806
Neg logit contributions
 hey-1.137
aman-1.113
land-1.037
 whose-1.023
 exception-1.016
Token paw
Token ID41827
Feature activation-1.82e-01

Pos logit contributions
keeping+1.175
 onwards+1.153
 ever+1.091
unts+1.003
tick+0.984
Neg logit contributions
aman-1.859
 hey-1.841
 whose-1.749
land-1.722
 extingu-1.692
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 ever+0.193
keeping+0.184
 onwards+0.183
 fright+0.170
 it+0.169
Neg logit contributions
aman-0.297
 hey-0.293
land-0.284
 exception-0.270
 whose-0.269
Token was
Token ID373
Feature activation-5.45e-01

Pos logit contributions
 ever+0.213
keeping+0.213
 onwards+0.210
 fright+0.191
 it+0.186
Neg logit contributions
aman-0.319
 hey-0.315
land-0.297
 whose-0.290
 exception-0.289
Token having
Token ID1719
Feature activation-2.20e+00

Pos logit contributions
keeping+0.478
 ever+0.472
 onwards+0.447
 fright+0.409
 sideways+0.392
Neg logit contributions
 hey-0.992
aman-0.974
 exception-0.924
land-0.922
 whose-0.915
Token so
Token ID523
Feature activation-4.36e-01

Pos logit contributions
keeping+3.024
 onwards+2.857
 ever+2.761
 sideways+2.751
imp+2.702
Neg logit contributions
 hey-2.825
aman-2.764
land-2.731
 exception-2.627
 whose-2.600
Token much
Token ID881
Feature activation+3.93e-01

Pos logit contributions
keeping+0.335
 ever+0.322
 onwards+0.309
 fright+0.282
 it+0.269
Neg logit contributions
aman-0.843
 hey-0.837
land-0.806
 exception-0.791
 whose-0.777
Token fun
Token ID1257
Feature activation+2.60e-01

Pos logit contributions
aman+0.634
 hey+0.621
land+0.590
 exception+0.563
 extingu+0.555
Neg logit contributions
 ever-0.452
 onwards-0.427
keeping-0.426
 it-0.409
 fright-0.397
Token driving
Token ID5059
Feature activation-2.98e+00

Pos logit contributions
aman+0.450
 hey+0.442
land+0.422
 exception+0.405
 extingu+0.398
Neg logit contributions
 ever-0.265
keeping-0.251
 onwards-0.250
 it-0.235
 fright-0.230
Token it
Token ID340
Feature activation+9.59e-01

Pos logit contributions
 grin+3.044
unts+3.005
 Amen+2.936
keeping+2.869
 onwards+2.749
Neg logit contributions
 hey-4.921
 whose-4.874
 where-4.638
aman-4.621
land-4.610
Token around
Token ID1088
Feature activation-5.29e-01

Pos logit contributions
aman+1.974
 hey+1.934
land+1.840
 extingu+1.788
 exception+1.778
Neg logit contributions
 ever-0.758
 onwards-0.670
 it-0.663
keeping-0.625
 fright-0.598
Token when
Token ID618
Feature activation-9.27e-01

Pos logit contributions
 onwards+0.515
keeping+0.513
 ever+0.512
unts+0.458
 fright+0.454
Neg logit contributions
 hey-0.895
aman-0.893
land-0.841
 whose-0.815
 exception-0.813
Token he
Token ID339
Feature activation-4.89e-02

Pos logit contributions
keeping+1.110
 ever+1.036
 onwards+1.030
unts+0.993
 fright+0.961
Neg logit contributions
 hey-1.418
aman-1.373
land-1.258
 exception-1.251
 whose-1.243
Token suddenly
Token ID6451
Feature activation+9.71e-01

Pos logit contributions
 ever+0.056
keeping+0.056
 onwards+0.054
 fright+0.051
 it+0.050
Neg logit contributions
aman-0.077
 hey-0.076
land-0.072
 exception-0.068
 whose-0.067
Token fish
Token ID5916
Feature activation+6.37e-01

Pos logit contributions
aman+0.195
 hey+0.192
land+0.182
 exception+0.174
 extingu+0.171
Neg logit contributions
 ever-0.146
keeping-0.141
 onwards-0.140
 it-0.131
 fright-0.129
Token will
Token ID481
Feature activation+3.26e-01

Pos logit contributions
aman+1.128
 hey+1.096
land+1.048
 exception+1.011
 extingu+0.995
Neg logit contributions
 ever-0.651
keeping-0.591
 onwards-0.591
 it-0.575
 fright-0.548
Token bite
Token ID13197
Feature activation-1.04e+00

Pos logit contributions
aman+0.535
 hey+0.524
land+0.499
 exception+0.477
 extingu+0.470
Neg logit contributions
 ever-0.362
keeping-0.343
 onwards-0.342
 it-0.325
 fright-0.317
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
keeping+1.148
 onwards+1.088
 ever+1.047
 fright+1.009
 grin+1.008
Neg logit contributions
aman-1.656
 hey-1.647
land-1.613
 whose-1.510
 exception-1.487
Token Then
Token ID3244
Feature activation-2.44e-01

Pos logit contributions
 ever+0.292
keeping+0.284
 onwards+0.280
 it+0.261
 fright+0.255
Neg logit contributions
 hey-0.423
aman-0.422
land-0.396
 extingu-0.382
 whose-0.381
Token it
Token ID340
Feature activation+1.69e+00

Pos logit contributions
keeping+0.221
 ever+0.221
 onwards+0.214
unts+0.194
 fright+0.194
Neg logit contributions
 hey-0.438
aman-0.438
land-0.413
 whose-0.394
 exception-0.393
Token can
Token ID460
Feature activation+1.95e-01

Pos logit contributions
aman+3.247
 hey+3.044
land+2.939
 exception+2.835
 extingu+2.808
Neg logit contributions
 ever-1.715
 it-1.484
 onwards-1.424
 excited-1.398
 fright-1.393
Token pull
Token ID2834
Feature activation-1.15e+00

Pos logit contributions
aman+0.329
 hey+0.325
land+0.308
 exception+0.296
 extingu+0.292
Neg logit contributions
 ever-0.205
keeping-0.197
 onwards-0.197
 it-0.183
 fright-0.178
Token me
Token ID502
Feature activation+4.01e-01

Pos logit contributions
keeping+1.317
 onwards+1.286
 ever+1.265
 grin+1.220
unts+1.176
Neg logit contributions
 hey-1.777
aman-1.771
land-1.695
 whose-1.614
 exception-1.578
Token in
Token ID287
Feature activation-5.46e-01

Pos logit contributions
aman+0.747
 hey+0.733
land+0.701
 exception+0.675
 extingu+0.665
Neg logit contributions
 ever-0.358
keeping-0.333
 onwards-0.333
 it-0.314
 fright-0.302
Token and
Token ID290
Feature activation+4.46e-01

Pos logit contributions
 ever+0.595
 onwards+0.581
keeping+0.567
 sideways+0.515
 excited+0.514
Neg logit contributions
aman-0.897
 hey-0.862
land-0.829
 extingu-0.800
 whose-0.792
Token doll
Token ID3654
Feature activation-1.11e-01

Pos logit contributions
aman+0.321
 hey+0.312
land+0.294
 exception+0.279
 extingu+0.272
Neg logit contributions
 ever-0.353
 onwards-0.340
keeping-0.339
 it-0.326
 fright-0.319
Token.
Token ID13
Feature activation-4.48e-02

Pos logit contributions
 ever+0.122
keeping+0.118
 onwards+0.117
 fright+0.108
 it+0.108
Neg logit contributions
aman-0.180
 hey-0.177
land-0.171
 exception-0.164
 whose-0.160
Token She
Token ID1375
Feature activation-2.95e-01

Pos logit contributions
 ever+0.043
keeping+0.040
 onwards+0.040
 fright+0.037
 it+0.037
Neg logit contributions
aman-0.081
 hey-0.078
land-0.075
 exception-0.072
 whose-0.071
Token started
Token ID2067
Feature activation-2.61e-01

Pos logit contributions
keeping+0.331
 ever+0.329
 onwards+0.325
 fright+0.301
 it+0.293
Neg logit contributions
aman-0.470
 hey-0.460
land-0.433
 exception-0.422
 whose-0.418
Token gathering
Token ID11228
Feature activation+6.49e-02

Pos logit contributions
 ever+0.236
keeping+0.225
 onwards+0.224
 fright+0.200
 it+0.197
Neg logit contributions
aman-0.474
 hey-0.466
land-0.445
 exception-0.432
 whose-0.427
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
aman+0.103
 hey+0.101
land+0.097
 exception+0.093
 extingu+0.091
Neg logit contributions
 ever-0.074
keeping-0.071
 onwards-0.070
 it-0.065
 fright-0.065
Token pieces
Token ID5207
Feature activation-5.58e-03

Pos logit contributions
aman+2.842
 hey+2.726
land+2.541
 extingu+2.386
lves+2.363
Neg logit contributions
 ever-2.328
 it-2.179
 onwards-2.160
 on-2.148
 excited-2.105
Token:
Token ID25
Feature activation+1.75e-01

Pos logit contributions
 ever+0.005
keeping+0.005
 onwards+0.004
 fright+0.004
 it+0.004
Neg logit contributions
aman-0.010
 hey-0.010
land-0.010
 exception-0.009
 extingu-0.009
Token yarn
Token ID21181
Feature activation+4.71e-01

Pos logit contributions
aman+0.275
 hey+0.269
land+0.256
 exception+0.244
 extingu+0.240
Neg logit contributions
 ever-0.206
keeping-0.198
 onwards-0.196
 it-0.184
 fright-0.182
Token,
Token ID11
Feature activation+3.94e-01

Pos logit contributions
aman+0.898
 hey+0.884
land+0.844
 exception+0.816
 extingu+0.802
Neg logit contributions
 ever-0.398
 onwards-0.366
keeping-0.366
 it-0.347
 fright-0.329
Token buttons
Token ID12163
Feature activation+9.21e-01

Pos logit contributions
aman+0.770
 hey+0.757
land+0.726
 exception+0.700
 extingu+0.690
Neg logit contributions
 ever-0.315
 onwards-0.291
keeping-0.291
 it-0.273
 fright-0.261
Token They
Token ID1119
Feature activation+1.81e-01

Pos logit contributions
 ever+0.614
keeping+0.579
 onwards+0.565
 fright+0.530
 it+0.510
Neg logit contributions
aman-0.960
 hey-0.951
land-0.903
 whose-0.883
 extingu-0.857
Token want
Token ID765
Feature activation+1.27e-01

Pos logit contributions
aman+0.325
 hey+0.321
land+0.305
 exception+0.294
 extingu+0.289
Neg logit contributions
 ever-0.172
keeping-0.164
 onwards-0.163
 it-0.152
 fright-0.146
Token to
Token ID284
Feature activation+5.14e-01

Pos logit contributions
aman+0.249
 hey+0.246
land+0.237
 exception+0.228
 extingu+0.226
Neg logit contributions
 ever-0.097
 onwards-0.092
keeping-0.091
 it-0.081
 fright-0.080
Token play
Token ID711
Feature activation+1.19e-01

Pos logit contributions
aman+0.945
 hey+0.923
land+0.887
 exception+0.850
 extingu+0.839
Neg logit contributions
 ever-0.482
keeping-0.445
 onwards-0.438
 it-0.426
 fright-0.412
Token with
Token ID351
Feature activation-3.52e-02

Pos logit contributions
aman+0.228
 hey+0.227
land+0.217
 exception+0.209
 whose+0.207
Neg logit contributions
 ever-0.097
keeping-0.091
 onwards-0.090
 it-0.081
 fright-0.080
Token something
Token ID1223
Feature activation+2.05e+00

Pos logit contributions
 ever+0.039
 onwards+0.038
keeping+0.038
 fright+0.034
 it+0.034
Neg logit contributions
aman-0.057
 hey-0.056
land-0.053
 whose-0.051
 exception-0.051
Token else
Token ID2073
Feature activation-2.36e-01

Pos logit contributions
aman+3.317
 hey+3.046
land+2.719
lves+2.687
 extingu+2.670
Neg logit contributions
 ever-3.039
 it-2.862
 on-2.695
 excited-2.646
 onwards-2.564
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
 ever+0.244
keeping+0.242
 onwards+0.241
 fright+0.225
unts+0.216
Neg logit contributions
aman-0.382
 hey-0.380
land-0.370
 whose-0.351
 exception-0.350
Token Something
Token ID13742
Feature activation+1.22e+00

Pos logit contributions
 ever+0.487
keeping+0.465
 onwards+0.452
 fright+0.423
 it+0.412
Neg logit contributions
aman-0.746
 hey-0.740
land-0.700
 whose-0.683
 extingu-0.667
Token that
Token ID326
Feature activation+2.53e-01

Pos logit contributions
aman+2.086
 hey+1.995
land+1.853
 exception+1.790
lves+1.756
Neg logit contributions
 ever-1.434
 it-1.359
 onwards-1.299
 excited-1.251
 on-1.244
Token does
Token ID857
Feature activation+7.93e-01

Pos logit contributions
aman+0.422
 hey+0.414
land+0.394
 exception+0.377
 extingu+0.372
Neg logit contributions
 ever-0.273
keeping-0.261
 onwards-0.260
 it-0.245
 fright-0.240
Token,
Token ID11
Feature activation+8.95e-01

Pos logit contributions
aman+1.607
 hey+1.570
land+1.522
 exception+1.466
 extingu+1.447
Neg logit contributions
 ever-0.648
 onwards-0.591
 it-0.584
keeping-0.545
 excited-0.519
Token she
Token ID673
Feature activation-1.47e-01

Pos logit contributions
aman+1.576
 hey+1.527
land+1.506
 exception+1.421
 extingu+1.406
Neg logit contributions
 ever-0.928
 it-0.907
 onwards-0.857
keeping-0.822
 excited-0.812
Token was
Token ID373
Feature activation+4.81e-01

Pos logit contributions
 ever+0.157
 onwards+0.152
keeping+0.151
 fright+0.140
 it+0.138
Neg logit contributions
aman-0.246
 hey-0.242
land-0.229
 exception-0.219
 whose-0.218
Token walking
Token ID6155
Feature activation-7.75e-01

Pos logit contributions
aman+0.881
 hey+0.860
land+0.827
 exception+0.789
 extingu+0.784
Neg logit contributions
 ever-0.455
 onwards-0.426
keeping-0.419
 it-0.403
 fright-0.387
Token in
Token ID287
Feature activation-2.96e-01

Pos logit contributions
keeping+0.811
 ever+0.777
 onwards+0.773
unts+0.749
 fright+0.741
Neg logit contributions
 hey-1.259
aman-1.240
land-1.194
 exception-1.187
 whose-1.156
Token the
Token ID262
Feature activation+1.66e+00

Pos logit contributions
 ever+0.297
 onwards+0.289
keeping+0.284
unts+0.254
 sideways+0.253
Neg logit contributions
aman-0.513
 hey-0.498
land-0.477
 exception-0.455
 extingu-0.454
Token park
Token ID3952
Feature activation+6.66e-01

Pos logit contributions
 hey+2.146
aman+2.077
land+1.958
 extingu+1.928
ist+1.870
Neg logit contributions
 it-2.405
 onwards-2.353
 ever-2.332
keeping-2.274
 excited-2.262
Token when
Token ID618
Feature activation-4.89e-01

Pos logit contributions
aman+1.070
 hey+1.044
land+0.991
 extingu+0.945
 exception+0.942
Neg logit contributions
 ever-0.788
 onwards-0.730
keeping-0.727
 it-0.715
 excited-0.692
Token she
Token ID673
Feature activation+6.23e-01

Pos logit contributions
keeping+0.605
 ever+0.594
 onwards+0.579
unts+0.548
 fright+0.547
Neg logit contributions
aman-0.720
 hey-0.719
land-0.653
 exception-0.630
 whose-0.626
Token saw
Token ID2497
Feature activation+1.46e+00

Pos logit contributions
aman+1.032
 hey+1.007
land+0.961
 exception+0.927
 extingu+0.919
Neg logit contributions
 ever-0.695
 onwards-0.652
keeping-0.641
 it-0.633
 fright-0.601
Token a
Token ID257
Feature activation-7.10e-02

Pos logit contributions
aman+2.647
 hey+2.568
land+2.523
 exception+2.399
 extingu+2.347
Neg logit contributions
 it-1.520
 ever-1.440
keeping-1.331
 excited-1.291
 onwards-1.290
Token
Token ID198
Feature activation+3.94e-01

Pos logit contributions
 ever+0.299
keeping+0.285
 onwards+0.279
 fright+0.260
 sideways+0.249
Neg logit contributions
aman-0.473
 hey-0.460
land-0.448
 whose-0.425
 exception-0.421
TokenThey
Token ID2990
Feature activation+6.08e-01

Pos logit contributions
aman+0.767
 hey+0.754
land+0.723
 exception+0.698
 extingu+0.689
Neg logit contributions
 ever-0.322
 onwards-0.298
keeping-0.297
 it-0.279
 fright-0.266
Token ask
Token ID1265
Feature activation+7.06e-02

Pos logit contributions
aman+1.063
 hey+1.034
land+0.994
 exception+0.952
 extingu+0.937
Neg logit contributions
 ever-0.622
 onwards-0.574
keeping-0.569
 it-0.552
 fright-0.533
Token Ben
Token ID3932
Feature activation+1.08e+00

Pos logit contributions
aman+0.109
 hey+0.107
land+0.101
 exception+0.096
 whose+0.095
Neg logit contributions
 ever-0.085
keeping-0.082
 onwards-0.081
 fright-0.076
 it-0.076
Token and
Token ID290
Feature activation+1.77e+00

Pos logit contributions
aman+2.254
 hey+2.211
land+2.074
 exception+2.046
 extingu+2.022
Neg logit contributions
 ever-0.761
 it-0.686
 onwards-0.685
keeping-0.639
 excited-0.627
Token Sue
Token ID26089
Feature activation+1.63e+00

Pos logit contributions
aman+3.842
land+3.647
 hey+3.605
lves+3.487
 exception+3.452
Neg logit contributions
 ever-1.382
 it-1.121
 excited-1.064
keeping-1.042
 fright-1.041
Token many
Token ID867
Feature activation+4.88e-01

Pos logit contributions
aman+2.919
 hey+2.813
land+2.588
lves+2.564
 exception+2.540
Neg logit contributions
 ever-1.700
 it-1.621
 onwards-1.572
 on-1.515
 excited-1.510
Token questions
Token ID2683
Feature activation+2.57e-01

Pos logit contributions
aman+0.857
 hey+0.840
land+0.797
 exception+0.765
 extingu+0.757
Neg logit contributions
 ever-0.491
 onwards-0.462
keeping-0.456
 it-0.441
 fright-0.424
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
aman+0.457
 hey+0.449
land+0.431
 exception+0.412
 extingu+0.406
Neg logit contributions
 ever-0.248
keeping-0.234
 onwards-0.233
 it-0.219
 fright-0.214
Token They
Token ID1119
Feature activation+1.14e+00

Pos logit contributions
aman+0.827
 hey+0.814
land+0.778
 exception+0.750
 extingu+0.738
Neg logit contributions
 ever-0.377
 onwards-0.353
keeping-0.351
 it-0.332
 fright-0.316
Token show
Token ID905
Feature activation+5.10e-01

Pos logit contributions
aman+2.061
 hey+2.000
land+1.928
lves+1.831
 exception+1.830
Neg logit contributions
 ever-1.111
 onwards-0.994
 it-0.991
 excited-0.957
keeping-0.948
Token mom
Token ID1995
Feature activation-3.84e-01

Pos logit contributions
 ever+0.222
keeping+0.217
 onwards+0.215
 it+0.195
 fright+0.189
Neg logit contributions
aman-0.395
 hey-0.395
land-0.371
 exception-0.358
 extingu-0.356
Token saw
Token ID2497
Feature activation+1.31e+00

Pos logit contributions
keeping+0.445
 ever+0.438
 onwards+0.430
 fright+0.390
 sideways+0.389
Neg logit contributions
aman-0.611
 hey-0.610
land-0.562
 whose-0.541
 exception-0.531
Token him
Token ID683
Feature activation-8.16e-01

Pos logit contributions
aman+2.588
land+2.519
 hey+2.514
 exception+2.411
lves+2.333
Neg logit contributions
 it-1.074
 ever-1.023
keeping-0.933
 onwards-0.903
 fright-0.884
Token and
Token ID290
Feature activation+7.45e-01

Pos logit contributions
 ever+0.769
keeping+0.751
 onwards+0.729
 fright+0.657
 sideways+0.652
Neg logit contributions
aman-1.455
 hey-1.433
land-1.376
 whose-1.351
 exception-1.329
Token helped
Token ID4193
Feature activation+6.46e-02

Pos logit contributions
aman+1.440
 hey+1.405
land+1.364
 exception+1.310
 extingu+1.285
Neg logit contributions
 ever-0.637
 it-0.581
 onwards-0.572
keeping-0.568
 excited-0.541
Token him
Token ID683
Feature activation+1.43e+00

Pos logit contributions
aman+0.124
 hey+0.122
land+0.116
 exception+0.113
 extingu+0.111
Neg logit contributions
 ever-0.054
 onwards-0.051
keeping-0.050
 it-0.045
 fright-0.045
Token fill
Token ID6070
Feature activation+2.71e-01

Pos logit contributions
aman+2.570
 hey+2.428
land+2.333
lves+2.232
 exception+2.180
Neg logit contributions
 ever-1.544
 it-1.502
keeping-1.422
 excited-1.413
 onwards-1.404
Token his
Token ID465
Feature activation-5.05e-01

Pos logit contributions
aman+0.485
 hey+0.475
land+0.454
 exception+0.437
 extingu+0.431
Neg logit contributions
 ever-0.261
 onwards-0.244
keeping-0.244
 it-0.227
 fright-0.223
Token cup
Token ID6508
Feature activation-1.38e-01

Pos logit contributions
 ever+0.671
 onwards+0.643
keeping+0.642
 fright+0.602
unts+0.588
Neg logit contributions
aman-0.706
 hey-0.694
land-0.665
 extingu-0.634
 whose-0.633
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
 ever+0.133
keeping+0.126
 onwards+0.125
 fright+0.118
 it+0.117
Neg logit contributions
aman-0.241
 hey-0.236
land-0.228
 exception-0.218
 extingu-0.218
Token She
Token ID1375
Feature activation-5.92e-02

Pos logit contributions
 ever+0.335
keeping+0.316
 onwards+0.307
 fright+0.286
 it+0.278
Neg logit contributions
aman-0.582
 hey-0.571
land-0.533
 exception-0.519
 extingu-0.519
Token holding
Token ID4769
Feature activation-3.23e-01

Pos logit contributions
aman+1.420
 hey+1.379
land+1.335
 exception+1.272
lves+1.240
Neg logit contributions
 ever-0.860
 onwards-0.807
 it-0.796
keeping-0.784
 excited-0.773
Token up
Token ID510
Feature activation-3.14e-01

Pos logit contributions
 ever+0.393
keeping+0.381
 onwards+0.368
 fright+0.343
unts+0.339
Neg logit contributions
aman-0.490
 hey-0.487
land-0.455
 exception-0.446
 whose-0.444
Token a
Token ID257
Feature activation+2.65e-01

Pos logit contributions
 ever+0.382
keeping+0.370
 onwards+0.364
 fright+0.335
 it+0.333
Neg logit contributions
aman-0.478
 hey-0.465
land-0.448
 exception-0.428
 extingu-0.424
Token shiny
Token ID22441
Feature activation+6.67e-01

Pos logit contributions
aman+0.423
 hey+0.414
land+0.394
 exception+0.377
 extingu+0.371
Neg logit contributions
 ever-0.304
keeping-0.290
 onwards-0.288
 it-0.273
 fright-0.267
Token red
Token ID2266
Feature activation+4.84e-01

Pos logit contributions
aman+1.001
 hey+0.986
land+0.924
 exception+0.877
 extingu+0.860
Neg logit contributions
 ever-0.835
 onwards-0.789
keeping-0.782
 it-0.781
 fright-0.748
Token car
Token ID1097
Feature activation+5.30e-01

Pos logit contributions
aman+0.667
 hey+0.653
land+0.613
 exception+0.581
 extingu+0.566
Neg logit contributions
 ever-0.665
 onwards-0.633
keeping-0.632
 it-0.616
 fright-0.599
Token.
Token ID13
Feature activation-3.47e-01

Pos logit contributions
aman+0.987
 hey+0.967
land+0.922
 exception+0.887
 extingu+0.873
Neg logit contributions
 ever-0.483
 onwards-0.445
keeping-0.442
 it-0.423
 fright-0.403
Token "
Token ID366
Feature activation-9.12e-01

Pos logit contributions
 ever+0.382
keeping+0.373
 onwards+0.353
 fright+0.330
 it+0.319
Neg logit contributions
aman-0.571
 hey-0.556
land-0.532
 whose-0.511
 exception-0.511
TokenIt
Token ID1026
Feature activation+1.64e+00

Pos logit contributions
keeping+1.115
 onwards+1.093
 ever+1.084
 fright+0.994
 grin+0.966
Neg logit contributions
 hey-1.341
aman-1.295
land-1.260
 whose-1.243
 extingu-1.187
Token is
Token ID318
Feature activation+1.10e-01

Pos logit contributions
aman+3.369
 hey+3.170
land+3.010
 exception+2.897
 extingu+2.875
Neg logit contributions
 ever-1.486
 it-1.416
 onwards-1.282
 on-1.258
 excited-1.220
Token very
Token ID845
Feature activation-6.93e-01

Pos logit contributions
aman+0.198
 hey+0.194
land+0.187
 exception+0.179
 extingu+0.176
Neg logit contributions
 ever-0.104
keeping-0.100
 onwards-0.098
 it-0.090
 fright-0.090
Token upon
Token ID2402
Feature activation-4.01e-02

Pos logit contributions
aman+0.541
 hey+0.528
land+0.496
 exception+0.468
 extingu+0.461
Neg logit contributions
 ever-0.592
 onwards-0.568
keeping-0.564
 it-0.549
 fright-0.534
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
 ever+0.050
keeping+0.049
 onwards+0.049
 fright+0.045
 it+0.045
Neg logit contributions
aman-0.058
 hey-0.058
land-0.055
 exception-0.052
 whose-0.052
Token time
Token ID640
Feature activation+2.67e-01

Pos logit contributions
 hey+1.870
aman+1.865
land+1.760
 extingu+1.702
lves+1.682
Neg logit contributions
 ever-1.125
 onwards-1.005
 it-0.974
 fright-0.965
 excited-0.959
Token there
Token ID612
Feature activation-3.25e-01

Pos logit contributions
aman+0.497
 hey+0.488
land+0.467
 exception+0.449
 extingu+0.443
Neg logit contributions
 ever-0.236
keeping-0.222
 onwards-0.220
 it-0.205
 fright-0.200
Token was
Token ID373
Feature activation+1.61e-01

Pos logit contributions
keeping+0.351
 ever+0.345
 onwards+0.338
 fright+0.313
 sideways+0.308
Neg logit contributions
aman-0.537
 hey-0.519
land-0.495
 whose-0.476
 exception-0.471
Token a
Token ID257
Feature activation+1.86e-01

Pos logit contributions
aman+0.244
 hey+0.239
land+0.226
 exception+0.216
 extingu+0.212
Neg logit contributions
 ever-0.197
keeping-0.190
 onwards-0.189
 it-0.178
 fright-0.176
Token girl
Token ID2576
Feature activation+1.20e-01

Pos logit contributions
aman+0.206
 hey+0.201
land+0.186
 exception+0.174
 extingu+0.170
Neg logit contributions
 ever-0.303
keeping-0.295
 onwards-0.293
 it-0.281
 fright-0.277
Token called
Token ID1444
Feature activation+5.12e-01

Pos logit contributions
aman+0.197
 hey+0.194
land+0.184
 exception+0.176
 whose+0.176
Neg logit contributions
 ever-0.131
 onwards-0.127
keeping-0.126
 it-0.118
 fright-0.116
Token Sally
Token ID25737
Feature activation+7.96e-01

Pos logit contributions
aman+0.907
 hey+0.889
land+0.850
 exception+0.818
 extingu+0.803
Neg logit contributions
 ever-0.508
keeping-0.474
 onwards-0.471
 it-0.450
 fright-0.432
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
aman+1.489
 hey+1.472
land+1.383
 exception+1.344
 extingu+1.317
Neg logit contributions
 ever-0.744
keeping-0.675
 it-0.660
 onwards-0.658
 excited-0.613
Token Sally
Token ID25737
Feature activation-1.80e-01

Pos logit contributions
 ever+1.098
keeping+1.059
 onwards+0.993
 fright+0.959
unts+0.885
Neg logit contributions
aman-2.021
 hey-1.889
land-1.832
 exception-1.794
 whose-1.722
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
 ever+0.071
keeping+0.069
 onwards+0.068
 fright+0.063
 it+0.063
Neg logit contributions
aman-0.124
 hey-0.121
land-0.117
 exception-0.112
 whose-0.110
Token He
Token ID679
Feature activation-3.37e-01

Pos logit contributions
aman+0.296
 hey+0.289
land+0.278
 exception+0.267
 extingu+0.263
Neg logit contributions
 ever-0.136
keeping-0.127
 onwards-0.125
 it-0.117
 fright-0.114
Token grabbed
Token ID13176
Feature activation-1.02e-01

Pos logit contributions
keeping+0.336
 ever+0.330
 onwards+0.327
 fright+0.293
 it+0.291
Neg logit contributions
aman-0.582
 hey-0.566
land-0.544
 whose-0.521
 exception-0.520
Token an
Token ID281
Feature activation-8.29e-01

Pos logit contributions
 ever+0.110
keeping+0.107
 onwards+0.104
 fright+0.095
unts+0.095
Neg logit contributions
aman-0.169
 hey-0.165
land-0.156
 exception-0.151
 extingu-0.151
Token empty
Token ID6565
Feature activation-7.91e-01

Pos logit contributions
 ever+1.081
keeping+1.076
 onwards+1.070
 it+0.983
 sideways+0.957
Neg logit contributions
aman-1.200
 hey-1.123
land-1.091
 exception-1.084
 extingu-1.061
Token soda
Token ID20584
Feature activation+3.70e-01

Pos logit contributions
 ever+1.007
 onwards+0.984
keeping+0.956
 it+0.877
unts+0.863
Neg logit contributions
aman-1.191
land-1.096
 hey-1.095
 extingu-1.047
 exception-1.011
Token bottle
Token ID9294
Feature activation-2.87e-01

Pos logit contributions
aman+0.745
 hey+0.733
land+0.705
 exception+0.680
 extingu+0.670
Neg logit contributions
 ever-0.272
keeping-0.249
 onwards-0.249
 it-0.231
 fright-0.220
Token and
Token ID290
Feature activation-2.62e-02

Pos logit contributions
 ever+0.211
keeping+0.206
 onwards+0.198
 fright+0.183
 it+0.177
Neg logit contributions
aman-0.563
 hey-0.547
land-0.533
 extingu-0.518
 exception-0.510
Token threw
Token ID9617
Feature activation-7.19e-01

Pos logit contributions
 ever+0.031
keeping+0.029
 onwards+0.029
 fright+0.027
 it+0.027
Neg logit contributions
aman-0.041
 hey-0.040
land-0.038
 exception-0.036
 extingu-0.036
Token it
Token ID340
Feature activation+5.16e-01

Pos logit contributions
 ever+0.318
keeping+0.311
 onwards+0.294
unts+0.230
 fright+0.209
Neg logit contributions
aman-1.627
 hey-1.602
land-1.538
 extingu-1.507
 exception-1.499
Token at
Token ID379
Feature activation+5.77e-01

Pos logit contributions
aman+0.998
 hey+0.979
land+0.939
 exception+0.902
 extingu+0.891
Neg logit contributions
 ever-0.434
 onwards-0.400
keeping-0.392
 it-0.376
 fright-0.358
Token talked
Token ID6619
Feature activation-7.20e-01

Pos logit contributions
 onwards+0.357
 ever+0.355
keeping+0.342
 fright+0.323
 sideways+0.315
Neg logit contributions
aman-0.557
 hey-0.553
land-0.526
 extingu-0.504
 whose-0.502
Token to
Token ID284
Feature activation+3.95e-01

Pos logit contributions
keeping+0.812
 ever+0.797
 onwards+0.775
 fright+0.737
tick+0.708
Neg logit contributions
land-1.119
aman-1.111
 hey-1.089
 whose-1.024
 exception-1.012
Token the
Token ID262
Feature activation-3.04e-01

Pos logit contributions
aman+0.704
 hey+0.690
land+0.660
 exception+0.633
 extingu+0.624
Neg logit contributions
 ever-0.388
keeping-0.363
 onwards-0.362
 it-0.346
 fright-0.334
Token man
Token ID582
Feature activation-1.13e-01

Pos logit contributions
 ever+0.356
 onwards+0.349
keeping+0.343
 it+0.317
 fright+0.315
Neg logit contributions
aman-0.472
 hey-0.468
land-0.450
 exception-0.419
 extingu-0.415
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
 ever+0.116
keeping+0.113
 onwards+0.112
 fright+0.104
 it+0.103
Neg logit contributions
aman-0.189
 hey-0.187
land-0.182
 exception-0.173
 whose-0.172
Token They
Token ID1119
Feature activation+1.86e-01

Pos logit contributions
aman+0.969
 hey+0.955
land+0.912
 exception+0.880
 extingu+0.865
Neg logit contributions
 ever-0.436
 onwards-0.411
keeping-0.406
 it-0.390
 fright-0.367
Token made
Token ID925
Feature activation-9.36e-02

Pos logit contributions
aman+0.345
 hey+0.340
land+0.325
 exception+0.313
 extingu+0.309
Neg logit contributions
 ever-0.163
keeping-0.155
 onwards-0.155
 it-0.143
 fright-0.139
Token him
Token ID683
Feature activation-6.24e-01

Pos logit contributions
 ever+0.106
 onwards+0.103
keeping+0.103
 fright+0.093
 it+0.092
Neg logit contributions
aman-0.149
 hey-0.147
land-0.140
 exception-0.135
 whose-0.132
Token feel
Token ID1254
Feature activation-4.52e-01

Pos logit contributions
keeping+0.586
 ever+0.582
 onwards+0.581
 it+0.504
 fright+0.502
Neg logit contributions
 hey-1.081
aman-1.079
land-1.051
 exception-1.014
 whose-1.001
Token less
Token ID1342
Feature activation-1.26e-01

Pos logit contributions
keeping+0.494
 ever+0.490
 onwards+0.470
 fright+0.430
 sideways+0.422
Neg logit contributions
 hey-0.726
aman-0.717
land-0.686
 exception-0.660
 whose-0.655
Token sad
Token ID6507
Feature activation+4.57e-01

Pos logit contributions
 ever+0.106
keeping+0.101
 onwards+0.099
 it+0.090
 fright+0.089
Neg logit contributions
aman-0.237
 hey-0.233
land-0.226
 exception-0.218
 extingu-0.213
Token Suddenly
Token ID24975
Feature activation+3.63e-01

Pos logit contributions
aman+0.969
 hey+0.950
land+0.911
 exception+0.870
 extingu+0.857
Neg logit contributions
 ever-0.553
 onwards-0.524
keeping-0.519
 it-0.502
 fright-0.476
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
aman+0.684
 hey+0.671
land+0.644
 exception+0.619
 extingu+0.611
Neg logit contributions
 ever-0.316
 onwards-0.294
keeping-0.294
 it-0.277
 fright-0.266
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
aman+0.395
 hey+0.389
land+0.369
 exception+0.355
 extingu+0.351
Neg logit contributions
 ever-0.222
keeping-0.210
 onwards-0.208
 it-0.193
 fright-0.191
Token stick
Token ID4859
Feature activation-5.07e-01

Pos logit contributions
aman+0.430
 hey+0.422
land+0.400
 exception+0.382
 extingu+0.376
Neg logit contributions
 ever-0.317
keeping-0.302
 onwards-0.301
 it-0.286
 fright-0.279
Token broke
Token ID6265
Feature activation-4.59e-01

Pos logit contributions
keeping+0.579
 ever+0.547
 onwards+0.532
 fright+0.489
 it+0.483
Neg logit contributions
aman-0.796
 hey-0.788
land-0.749
 exception-0.711
 extingu-0.702
Token and
Token ID290
Feature activation-1.65e-01

Pos logit contributions
 ever+0.412
keeping+0.401
 onwards+0.395
 fright+0.365
unts+0.350
Neg logit contributions
aman-0.827
 hey-0.806
land-0.779
 extingu-0.757
 whose-0.740
Token Tim
Token ID5045
Feature activation-6.70e-01

Pos logit contributions
 ever+0.143
keeping+0.138
 onwards+0.138
 fright+0.122
unts+0.121
Neg logit contributions
aman-0.306
 hey-0.303
land-0.282
 extingu-0.276
 exception-0.275
Tokenmy
Token ID1820
Feature activation-5.61e-01

Pos logit contributions
 ever+0.882
 onwards+0.879
keeping+0.873
 it+0.820
 fright+0.808
Neg logit contributions
aman-0.936
 hey-0.905
land-0.849
 whose-0.801
 exception-0.798
Token fell
Token ID3214
Feature activation+1.35e-01

Pos logit contributions
keeping+0.617
 ever+0.601
 onwards+0.599
 fright+0.538
 sideways+0.538
Neg logit contributions
 hey-0.894
aman-0.889
land-0.831
 whose-0.807
lves-0.796
Token down
Token ID866
Feature activation-1.06e+00

Pos logit contributions
aman+0.253
 hey+0.249
land+0.239
 exception+0.230
 extingu+0.228
Neg logit contributions
 ever-0.116
keeping-0.109
 onwards-0.108
 it-0.100
 fright-0.097
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
keeping+1.161
 onwards+1.119
 ever+1.109
unts+1.035
 Go+1.020
Neg logit contributions
aman-1.657
 hey-1.626
land-1.562
 extingu-1.520
 exception-1.496
Token's
Token ID338
Feature activation+3.80e-01

Pos logit contributions
aman+0.286
 hey+0.280
land+0.266
 exception+0.255
 whose+0.251
Neg logit contributions
 ever-0.188
keeping-0.180
 onwards-0.179
 it-0.168
 fright-0.165
Token mom
Token ID1995
Feature activation+4.10e-01

Pos logit contributions
aman+0.673
 hey+0.659
land+0.631
 exception+0.606
 extingu+0.596
Neg logit contributions
 ever-0.378
keeping-0.354
 onwards-0.353
 it-0.336
 fright-0.325
Token helped
Token ID4193
Feature activation-7.74e-01

Pos logit contributions
aman+0.732
 hey+0.718
land+0.688
 exception+0.661
 extingu+0.651
Neg logit contributions
 ever-0.399
 onwards-0.372
keeping-0.371
 it-0.354
 fright-0.341
Token her
Token ID607
Feature activation+1.13e+00

Pos logit contributions
 onwards+0.974
 ever+0.968
keeping+0.947
 sideways+0.879
 fright+0.872
Neg logit contributions
aman-1.145
 hey-1.132
land-1.048
 exception-1.010
 whose-0.997
Token put
Token ID1234
Feature activation-1.00e+00

Pos logit contributions
aman+1.980
 hey+1.895
land+1.839
lves+1.752
 exception+1.746
Neg logit contributions
 ever-1.209
 it-1.134
 onwards-1.091
 excited-1.088
keeping-1.082
Token the
Token ID262
Feature activation-2.35e-01

Pos logit contributions
 ever+1.114
keeping+1.073
 onwards+1.031
 fright+0.991
unts+0.975
Neg logit contributions
aman-1.609
 hey-1.575
land-1.484
 extingu-1.478
 whose-1.476
Token picture
Token ID4286
Feature activation-1.05e+00

Pos logit contributions
 ever+0.257
 onwards+0.253
keeping+0.248
 it+0.227
 fright+0.224
Neg logit contributions
aman-0.387
 hey-0.380
land-0.364
 whose-0.342
 extingu-0.341
Token in
Token ID287
Feature activation-8.02e-01

Pos logit contributions
keeping+0.958
 ever+0.926
 onwards+0.917
 fright+0.887
unts+0.843
Neg logit contributions
aman-1.832
 hey-1.771
land-1.770
 extingu-1.673
 exception-1.661
Token an
Token ID281
Feature activation-2.95e-01

Pos logit contributions
 ever+0.936
 onwards+0.926
keeping+0.916
 sideways+0.845
 fright+0.835
Neg logit contributions
aman-1.260
 hey-1.186
land-1.129
 extingu-1.102
 whose-1.093
Token envelope
Token ID22878
Feature activation+2.03e-01

Pos logit contributions
 ever+0.374
 onwards+0.368
keeping+0.364
 it+0.337
 fright+0.336
Neg logit contributions
aman-0.439
 hey-0.424
land-0.401
 exception-0.385
 whose-0.379
Token and
Token ID290
Feature activation+1.98e-01

Pos logit contributions
aman+0.404
 hey+0.395
land+0.382
 exception+0.367
 extingu+0.363
Neg logit contributions
 ever-0.154
keeping-0.143
 onwards-0.142
 it-0.131
 fright-0.128
Token a
Token ID257
Feature activation-3.06e-01

Pos logit contributions
aman+0.473
 hey+0.463
land+0.437
 exception+0.416
 extingu+0.408
Neg logit contributions
 ever-0.424
keeping-0.405
 onwards-0.404
 it-0.387
 fright-0.379
Token special
Token ID2041
Feature activation-1.27e-01

Pos logit contributions
 ever+0.399
keeping+0.393
 onwards+0.388
 it+0.360
 fright+0.350
Neg logit contributions
aman-0.442
 hey-0.428
land-0.405
 exception-0.389
 extingu-0.381
Token material
Token ID2587
Feature activation+1.13e-01

Pos logit contributions
 ever+0.168
 onwards+0.164
keeping+0.163
 it+0.153
 fright+0.150
Neg logit contributions
aman-0.180
 hey-0.172
land-0.164
 exception-0.159
 extingu-0.155
Token that
Token ID326
Feature activation+2.85e-01

Pos logit contributions
aman+0.197
 hey+0.193
land+0.186
 exception+0.178
 extingu+0.176
Neg logit contributions
 ever-0.111
 onwards-0.105
keeping-0.105
 it-0.097
 fright-0.097
Token helps
Token ID5419
Feature activation-3.82e-01

Pos logit contributions
aman+0.497
 hey+0.489
land+0.466
 exception+0.447
 extingu+0.441
Neg logit contributions
 ever-0.284
keeping-0.268
 onwards-0.268
 it-0.252
 fright-0.245
Token keep
Token ID1394
Feature activation-6.14e-01

Pos logit contributions
 ever+0.397
 onwards+0.390
keeping+0.387
 fright+0.353
 sideways+0.352
Neg logit contributions
aman-0.633
 hey-0.626
land-0.603
 exception-0.570
 whose-0.563
Token the
Token ID262
Feature activation-3.40e-02

Pos logit contributions
 ever+0.602
 onwards+0.596
keeping+0.576
 fright+0.538
 sideways+0.532
Neg logit contributions
aman-1.044
 hey-1.029
land-0.986
 exception-0.948
 whose-0.941
Token cold
Token ID4692
Feature activation-6.93e-01

Pos logit contributions
 ever+0.048
 onwards+0.046
keeping+0.046
 it+0.044
 fright+0.043
Neg logit contributions
aman-0.046
 hey-0.045
land-0.042
 exception-0.040
 extingu-0.039
Token inside
Token ID2641
Feature activation-5.49e-01

Pos logit contributions
 ever+0.847
 onwards+0.833
keeping+0.800
 sideways+0.764
unts+0.762
Neg logit contributions
aman-1.024
 hey-1.004
land-0.967
 extingu-0.923
 exception-0.918
Token.
Token ID13
Feature activation-8.90e-01

Pos logit contributions
 ever+0.537
keeping+0.528
 onwards+0.525
 fright+0.493
 sideways+0.477
Neg logit contributions
aman-0.926
 hey-0.911
land-0.899
 exception-0.841
 whose-0.840
Token Ice
Token ID6663
Feature activation-1.58e-01

Pos logit contributions
 ever+0.890
keeping+0.852
 onwards+0.814
 fright+0.764
 sideways+0.744
Neg logit contributions
aman-1.568
 hey-1.554
land-1.459
 whose-1.445
 extingu-1.413
Token like
Token ID588
Feature activation+1.20e-01

Pos logit contributions
aman+0.889
 hey+0.865
land+0.825
 exception+0.785
 extingu+0.770
Neg logit contributions
 ever-0.635
keeping-0.600
 onwards-0.598
 it-0.583
 fright-0.559
Token kids
Token ID3988
Feature activation-4.97e-02

Pos logit contributions
aman+0.209
 hey+0.205
land+0.197
 exception+0.189
 whose+0.185
Neg logit contributions
 ever-0.117
 onwards-0.112
keeping-0.111
 it-0.102
 fright-0.101
Token'.
Token ID4458
Feature activation+3.02e-01

Pos logit contributions
 ever+0.044
keeping+0.042
 onwards+0.042
 fright+0.038
 it+0.038
Neg logit contributions
aman-0.091
 hey-0.090
land-0.087
 exception-0.083
 whose-0.082
Token He
Token ID679
Feature activation+1.06e-01

Pos logit contributions
aman+0.547
 hey+0.537
land+0.513
 exception+0.493
 extingu+0.486
Neg logit contributions
 ever-0.283
keeping-0.265
 onwards-0.264
 it-0.248
 fright-0.241
Token was
Token ID373
Feature activation+4.62e-01

Pos logit contributions
aman+0.187
 hey+0.184
land+0.176
 exception+0.169
 whose+0.166
Neg logit contributions
 ever-0.103
keeping-0.099
 onwards-0.098
 it-0.091
 fright-0.089
Token so
Token ID523
Feature activation-4.23e-01

Pos logit contributions
aman+0.829
 hey+0.812
land+0.777
 exception+0.743
 extingu+0.737
Neg logit contributions
 ever-0.446
 onwards-0.417
keeping-0.413
 it-0.395
 fright-0.381
Token mean
Token ID1612
Feature activation+4.52e-01

Pos logit contributions
keeping+0.257
 ever+0.240
 onwards+0.225
 fright+0.198
 it+0.196
Neg logit contributions
aman-0.893
 hey-0.891
land-0.845
 exception-0.840
 whose-0.821
Token.
Token ID13
Feature activation-3.38e-01

Pos logit contributions
aman+0.872
 hey+0.858
land+0.820
 exception+0.791
 extingu+0.782
Neg logit contributions
 ever-0.374
 onwards-0.345
keeping-0.345
 it-0.326
 fright-0.308
Token
Token ID198
Feature activation-4.65e-01

Pos logit contributions
 ever+0.332
keeping+0.316
 onwards+0.300
 fright+0.283
 it+0.281
Neg logit contributions
aman-0.598
 hey-0.582
land-0.557
 exception-0.536
 whose-0.533
Token
Token ID198
Feature activation+5.26e-03

Pos logit contributions
 ever+0.497
keeping+0.482
 onwards+0.455
 fright+0.425
 it+0.422
Neg logit contributions
aman-0.788
 hey-0.763
land-0.730
 exception-0.699
 whose-0.695
TokenThe
Token ID464
Feature activation-4.64e-01

Pos logit contributions
aman+0.007
 hey+0.007
land+0.007
 exception+0.006
 whose+0.006
Neg logit contributions
 ever-0.007
keeping-0.007
 onwards-0.007
 it-0.007
 fright-0.007
Token Lily
Token ID20037
Feature activation-8.81e-01

Pos logit contributions
keeping+1.046
 onwards+0.986
 ever+0.969
 sideways+0.886
tick+0.863
Neg logit contributions
aman-1.644
 hey-1.588
land-1.502
 whose-1.446
 exception-1.438
Token!"
Token ID2474
Feature activation-7.41e-01

Pos logit contributions
 onwards+0.815
keeping+0.808
 ever+0.796
 fright+0.762
 sideways+0.724
Neg logit contributions
aman-1.536
 hey-1.517
land-1.487
 exception-1.395
 whose-1.390
Token From
Token ID3574
Feature activation-2.77e-01

Pos logit contributions
 ever+0.816
keeping+0.777
 onwards+0.765
 fright+0.740
 sideways+0.696
Neg logit contributions
aman-1.210
 hey-1.168
land-1.127
 whose-1.089
 extingu-1.058
Token that
Token ID326
Feature activation-9.67e-01

Pos logit contributions
 ever+0.438
keeping+0.432
 onwards+0.427
 fright+0.407
unts+0.398
Neg logit contributions
aman-0.326
 hey-0.317
land-0.287
 whose-0.272
 exception-0.268
Token day
Token ID1110
Feature activation-1.65e+00

Pos logit contributions
keeping+1.015
 onwards+0.905
 ever+0.884
 fright+0.836
 sideways+0.815
Neg logit contributions
aman-1.700
 hey-1.636
land-1.534
 whose-1.509
 exception-1.498
Token on
Token ID319
Feature activation-3.14e-01

Pos logit contributions
keeping+1.173
 fright+0.848
 ever+0.722
 sideways+0.708
unts+0.647
Neg logit contributions
 hey-3.579
aman-3.433
 whose-3.175
land-3.167
 passenger-3.138
Token,
Token ID11
Feature activation+1.83e-01

Pos logit contributions
keeping+0.307
 ever+0.303
 onwards+0.285
 fright+0.273
 sideways+0.267
Neg logit contributions
aman-0.552
 hey-0.540
land-0.507
 whose-0.486
 exception-0.486
Token Tom
Token ID4186
Feature activation-5.05e-02

Pos logit contributions
aman+0.325
 hey+0.320
land+0.304
 exception+0.293
 whose+0.288
Neg logit contributions
 ever-0.175
keeping-0.168
 onwards-0.165
 it-0.154
 fright-0.152
Token wasn
Token ID2492
Feature activation-4.41e-01

Pos logit contributions
 ever+0.044
keeping+0.043
 onwards+0.042
 fright+0.039
 it+0.038
Neg logit contributions
aman-0.093
 hey-0.092
land-0.088
 exception-0.084
 whose-0.083
Token't
Token ID470
Feature activation+5.58e-01

Pos logit contributions
keeping+0.354
 ever+0.329
 onwards+0.316
 sideways+0.286
 it+0.278
Neg logit contributions
aman-0.860
 hey-0.850
land-0.802
 exception-0.780
lves-0.764
Token so
Token ID523
Feature activation+4.24e-01

Pos logit contributions
aman+1.113
 hey+1.090
land+1.049
 exception+1.004
 extingu+0.999
Neg logit contributions
 ever-0.436
 onwards-0.396
keeping-0.396
 it-0.377
 fright-0.358
TokenWe
Token ID1135
Feature activation-9.26e-01

Pos logit contributions
 ever+0.203
 onwards+0.197
keeping+0.196
 fright+0.180
 it+0.180
Neg logit contributions
aman-0.305
 hey-0.304
land-0.282
 whose-0.268
 exception-0.268
Token love
Token ID1842
Feature activation-1.04e+00

Pos logit contributions
keeping+1.106
 onwards+1.068
 ever+1.030
 sideways+0.979
 fright+0.940
Neg logit contributions
 hey-1.515
aman-1.431
 exception-1.300
land-1.296
 whose-1.281
Token you
Token ID345
Feature activation-9.23e-01

Pos logit contributions
 onwards+1.174
keeping+1.120
 ever+1.062
 fright+1.021
 grin+1.017
Neg logit contributions
 hey-1.779
aman-1.673
land-1.534
 whose-1.523
 exception-1.468
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
keeping+0.920
 onwards+0.809
 ever+0.802
 sideways+0.727
 fright+0.725
Neg logit contributions
 hey-1.701
aman-1.628
land-1.556
 whose-1.536
 exception-1.499
Token but
Token ID475
Feature activation-2.13e-01

Pos logit contributions
keeping+0.148
 ever+0.147
 onwards+0.143
 fright+0.128
 sideways+0.127
Neg logit contributions
aman-0.256
 hey-0.253
land-0.237
 exception-0.229
 whose-0.226
Token you
Token ID345
Feature activation-2.11e+00

Pos logit contributions
keeping+0.217
 onwards+0.213
 ever+0.211
 fright+0.192
 sideways+0.187
Neg logit contributions
 hey-0.368
aman-0.362
land-0.334
 whose-0.328
 exception-0.325
Token have
Token ID423
Feature activation-1.08e-01

Pos logit contributions
keeping+2.039
 onwards+2.024
 onward+1.676
 sideways+1.663
 ever+1.626
Neg logit contributions
 hey-4.101
aman-3.594
lves-3.547
 whose-3.539
 least-3.500
Token to
Token ID284
Feature activation+4.49e-02

Pos logit contributions
 ever+0.084
keeping+0.081
 onwards+0.081
 it+0.071
 fright+0.071
Neg logit contributions
 hey-0.211
aman-0.211
land-0.198
 exception-0.192
 whose-0.191
Token be
Token ID307
Feature activation+1.17e-01

Pos logit contributions
aman+0.073
 hey+0.072
land+0.068
 exception+0.065
 extingu+0.064
Neg logit contributions
 ever-0.050
 onwards-0.048
keeping-0.048
 it-0.045
 fright-0.044
Token careful
Token ID8161
Feature activation-1.07e+00

Pos logit contributions
aman+0.179
 hey+0.179
land+0.167
 exception+0.161
 whose+0.158
Neg logit contributions
 ever-0.141
keeping-0.136
 onwards-0.134
 it-0.127
 fright-0.125
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
 onwards+0.975
keeping+0.973
 ever+0.958
 fright+0.894
 sideways+0.884
Neg logit contributions
 hey-1.874
aman-1.820
land-1.771
 whose-1.709
 exception-1.682
Token.
Token ID13
Feature activation-7.91e-01

Pos logit contributions
keeping+1.005
 ever+0.986
 onwards+0.933
 fright+0.903
 sideways+0.869
Neg logit contributions
 hey-1.257
aman-1.240
land-1.182
 exception-1.168
 whose-1.150
Token
Token ID198
Feature activation-7.32e-01

Pos logit contributions
keeping+0.898
 ever+0.892
 onwards+0.807
 fright+0.774
 sideways+0.744
Neg logit contributions
aman-1.229
 hey-1.213
land-1.156
 exception-1.115
 whose-1.115
Token
Token ID198
Feature activation-4.11e-02

Pos logit contributions
keeping+0.854
 ever+0.825
 onwards+0.763
 fright+0.729
 sideways+0.713
Neg logit contributions
aman-1.151
 hey-1.126
land-1.085
 whose-1.035
 exception-1.029
Token"
Token ID1
Feature activation+2.37e-01

Pos logit contributions
 ever+0.035
keeping+0.033
 onwards+0.031
 fright+0.029
 it+0.029
Neg logit contributions
aman-0.078
 hey-0.077
land-0.074
 exception-0.071
 whose-0.070
TokenI
Token ID40
Feature activation-3.33e-01

Pos logit contributions
aman+0.389
 hey+0.382
land+0.363
 exception+0.347
 extingu+0.342
Neg logit contributions
 ever-0.265
keeping-0.252
 onwards-0.251
 it-0.238
 fright-0.232
Token don
Token ID836
Feature activation-1.88e+00

Pos logit contributions
keeping+0.333
 onwards+0.322
 ever+0.320
 fright+0.282
 it+0.280
Neg logit contributions
aman-0.581
 hey-0.575
land-0.541
 exception-0.530
 extingu-0.516
Token't
Token ID470
Feature activation-1.35e+00

Pos logit contributions
keeping+2.318
 ever+2.003
 onwards+1.914
 sideways+1.912
 grin+1.872
Neg logit contributions
 hey-2.945
aman-2.917
 whose-2.691
land-2.683
 extingu-2.648
Token know
Token ID760
Feature activation+1.18e+00

Pos logit contributions
 onwards+1.036
keeping+1.029
 ever+0.916
 sideways+0.830
tick+0.810
Neg logit contributions
 hey-2.625
aman-2.537
 exception-2.500
land-2.495
 whose-2.378
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
aman+2.385
 hey+2.329
land+2.172
 exception+2.162
lves+2.137
Neg logit contributions
 ever-0.949
 it-0.898
 onwards-0.810
keeping-0.804
 excited-0.740
Token Maybe
Token ID6674
Feature activation+6.29e-02

Pos logit contributions
 ever+0.160
 onwards+0.153
keeping+0.152
 it+0.141
 fright+0.141
Neg logit contributions
 hey-0.215
aman-0.214
land-0.203
 whose-0.193
 extingu-0.192
Token someone
Token ID2130
Feature activation+1.16e+00

Pos logit contributions
aman+0.124
 hey+0.123
land+0.118
 exception+0.113
 whose+0.113
Neg logit contributions
 ever-0.048
keeping-0.045
 onwards-0.045
 fright-0.040
 it-0.039
Token you
Token ID345
Feature activation+3.87e-01

Pos logit contributions
aman+0.158
 hey+0.155
land+0.147
 exception+0.140
 extingu+0.138
Neg logit contributions
 ever-0.109
 onwards-0.105
keeping-0.104
 it-0.097
 fright-0.096
Token as
Token ID355
Feature activation-6.13e-01

Pos logit contributions
aman+0.700
 hey+0.687
land+0.656
 exception+0.631
 extingu+0.623
Neg logit contributions
 ever-0.365
 onwards-0.340
keeping-0.340
 it-0.322
 fright-0.310
Token long
Token ID890
Feature activation+2.97e-01

Pos logit contributions
keeping+0.695
 onwards+0.680
 ever+0.664
 fright+0.632
 sideways+0.606
Neg logit contributions
aman-0.951
 hey-0.949
land-0.907
 exception-0.883
 whose-0.866
Token as
Token ID355
Feature activation-9.70e-02

Pos logit contributions
aman+0.480
 hey+0.470
land+0.447
 exception+0.427
 extingu+0.420
Neg logit contributions
 ever-0.336
keeping-0.319
 onwards-0.319
 it-0.303
 fright-0.296
Token you
Token ID345
Feature activation-3.42e-01

Pos logit contributions
 ever+0.097
keeping+0.095
 onwards+0.094
 fright+0.087
 it+0.085
Neg logit contributions
aman-0.165
 hey-0.163
land-0.156
 exception-0.150
 whose-0.147
Token don
Token ID836
Feature activation-1.72e+00

Pos logit contributions
keeping+0.303
 onwards+0.303
 ever+0.302
 it+0.274
 fright+0.267
Neg logit contributions
 hey-0.613
aman-0.611
land-0.588
 exception-0.564
 extingu-0.553
Token't
Token ID470
Feature activation-6.38e-01

Pos logit contributions
keeping+1.742
 ever+1.639
 sideways+1.624
 onwards+1.573
 fright+1.540
Neg logit contributions
aman-2.908
 hey-2.831
land-2.703
 exception-2.646
der-2.541
Token eat
Token ID4483
Feature activation-6.01e-02

Pos logit contributions
 onwards+0.715
keeping+0.685
 ever+0.685
 it+0.625
 sideways+0.623
Neg logit contributions
aman-1.015
 hey-1.011
land-0.961
 exception-0.935
 extingu-0.889
Token them
Token ID606
Feature activation-1.54e-01

Pos logit contributions
 ever+0.062
keeping+0.059
 onwards+0.059
 fright+0.055
 it+0.053
Neg logit contributions
aman-0.102
 hey-0.100
land-0.096
 exception-0.092
 whose-0.090
Token all
Token ID477
Feature activation+4.23e-01

Pos logit contributions
 ever+0.147
keeping+0.142
 onwards+0.140
 fright+0.131
 it+0.130
Neg logit contributions
aman-0.270
 hey-0.263
land-0.256
 exception-0.245
 whose-0.238
Token."
Token ID526
Feature activation+7.23e-01

Pos logit contributions
aman+0.782
 hey+0.769
land+0.734
 exception+0.705
 extingu+0.697
Neg logit contributions
 ever-0.389
keeping-0.362
 onwards-0.361
 it-0.342
 fright-0.327
Token saw
Token ID2497
Feature activation+6.18e-01

Pos logit contributions
 ever+0.235
keeping+0.232
 onwards+0.226
 fright+0.211
 it+0.206
Neg logit contributions
aman-0.316
 hey-0.308
land-0.293
 exception-0.279
 whose-0.277
Token a
Token ID257
Feature activation-5.63e-01

Pos logit contributions
aman+0.988
 hey+0.968
land+0.920
 exception+0.882
 extingu+0.869
Neg logit contributions
 ever-0.713
keeping-0.672
 onwards-0.671
 it-0.661
 fright-0.627
Token man
Token ID582
Feature activation-4.75e-01

Pos logit contributions
 ever+0.725
keeping+0.721
 onwards+0.707
 it+0.633
 fright+0.632
Neg logit contributions
aman-0.830
 hey-0.798
land-0.764
 whose-0.717
 exception-0.716
Token with
Token ID351
Feature activation+1.98e-01

Pos logit contributions
 ever+0.540
keeping+0.536
 onwards+0.528
 fright+0.492
 it+0.481
Neg logit contributions
aman-0.744
 hey-0.731
land-0.705
 whose-0.676
 exception-0.668
Token a
Token ID257
Feature activation-4.25e-01

Pos logit contributions
aman+0.305
 hey+0.299
land+0.283
 exception+0.271
 extingu+0.267
Neg logit contributions
 ever-0.237
keeping-0.226
 onwards-0.226
 it-0.214
 fright-0.209
Token big
Token ID1263
Feature activation-1.73e+00

Pos logit contributions
 ever+0.514
keeping+0.509
 onwards+0.488
 it+0.450
 fright+0.438
Neg logit contributions
aman-0.661
 hey-0.632
land-0.609
 exception-0.582
 extingu-0.575
Token mustache
Token ID49303
Feature activation-7.58e-02

Pos logit contributions
keeping+2.338
 ever+2.253
 onwards+2.143
tick+1.880
 Go+1.876
Neg logit contributions
aman-2.705
 hey-2.416
 extingu-2.395
land-2.349
 exception-2.277
Token!
Token ID0
Feature activation-1.78e-01

Pos logit contributions
 ever+0.057
keeping+0.054
 onwards+0.053
 fright+0.049
 it+0.048
Neg logit contributions
aman-0.149
 hey-0.146
land-0.142
 exception-0.136
 extingu-0.135
Token She
Token ID1375
Feature activation-4.15e-01

Pos logit contributions
 ever+0.190
keeping+0.177
 onwards+0.173
 fright+0.161
 it+0.160
Neg logit contributions
aman-0.307
 hey-0.295
land-0.284
 exception-0.271
 extingu-0.270
Token asked
Token ID1965
Feature activation-1.56e-01

Pos logit contributions
keeping+0.419
 ever+0.409
 onwards+0.402
 fright+0.360
 it+0.348
Neg logit contributions
aman-0.719
 hey-0.694
land-0.671
 whose-0.652
 exception-0.637
Token her
Token ID607
Feature activation+1.01e+00

Pos logit contributions
 ever+0.142
keeping+0.137
 onwards+0.134
 fright+0.123
 sideways+0.119
Neg logit contributions
aman-0.287
 hey-0.280
land-0.268
 exception-0.256
 whose-0.256
Token She
Token ID1375
Feature activation+2.16e-01

Pos logit contributions
 ever+0.564
keeping+0.541
 onwards+0.506
 fright+0.480
 it+0.462
Neg logit contributions
aman-0.827
 hey-0.808
land-0.742
 whose-0.728
 exception-0.717
Token couldn
Token ID3521
Feature activation-1.15e+00

Pos logit contributions
aman+0.348
 hey+0.341
land+0.323
 exception+0.309
 extingu+0.305
Neg logit contributions
 ever-0.246
keeping-0.235
 onwards-0.234
 it-0.220
 fright-0.216
Token't
Token ID470
Feature activation+3.42e-01

Pos logit contributions
keeping+1.142
 ever+1.020
 onwards+0.999
 sideways+0.989
 fright+0.942
Neg logit contributions
aman-2.014
 hey-2.001
land-1.826
 extingu-1.788
 exception-1.742
Token find
Token ID1064
Feature activation+9.40e-01

Pos logit contributions
aman+0.608
 hey+0.596
land+0.570
 exception+0.547
 extingu+0.539
Neg logit contributions
 ever-0.334
keeping-0.314
 onwards-0.313
 it-0.296
 fright-0.287
Token Sam
Token ID3409
Feature activation+7.96e-01

Pos logit contributions
aman+1.960
 hey+1.886
land+1.867
 exception+1.769
 extingu+1.734
Neg logit contributions
 ever-0.703
 it-0.646
 onwards-0.609
keeping-0.596
 fright-0.566
Token because
Token ID780
Feature activation-1.90e+00

Pos logit contributions
aman+1.453
 hey+1.429
land+1.345
 exception+1.305
 extingu+1.289
Neg logit contributions
 ever-0.771
 onwards-0.702
keeping-0.702
 it-0.690
 fright-0.648
Token the
Token ID262
Feature activation+7.23e-01

Pos logit contributions
keeping+2.743
 ever+2.403
 Amen+2.377
unts+2.372
 fright+2.369
Neg logit contributions
aman-2.532
 hey-2.527
 whose-2.347
 exception-2.223
land-2.137
Token water
Token ID1660
Feature activation-1.94e-01

Pos logit contributions
aman+1.012
 hey+0.969
land+0.926
 exception+0.865
 whose+0.851
Neg logit contributions
 ever-1.008
keeping-0.956
 onwards-0.954
 it-0.941
 fright-0.915
Token was
Token ID373
Feature activation-2.31e-01

Pos logit contributions
 ever+0.187
keeping+0.183
 onwards+0.182
 fright+0.167
 it+0.163
Neg logit contributions
aman-0.337
 hey-0.328
land-0.315
 extingu-0.300
 exception-0.298
Token so
Token ID523
Feature activation-4.89e-01

Pos logit contributions
 ever+0.237
keeping+0.235
 onwards+0.230
 fright+0.209
 it+0.208
Neg logit contributions
aman-0.390
 hey-0.377
land-0.362
 exception-0.350
 extingu-0.346
Token thick
Token ID6546
Feature activation-1.35e-01

Pos logit contributions
keeping+0.554
 ever+0.534
 onwards+0.521
 fright+0.475
 it+0.470
Neg logit contributions
aman-0.781
 hey-0.772
land-0.719
 exception-0.715
 extingu-0.698