TOP ACTIVATIONS
MAX = 3.281

happy . At the store , they found some
happy that the fountain was no longer dry .
store . At the store , the little girl
day while she was at the stable it started to rain
next day , the owner saw some more che wed threads
shoes . At the taxi stop , her brother
store . At the store , they picked out
day , Lily was at the park with her mom .
big , brown bear named Bub ba . Bub ba loved
He gives them to Anna and Ben . He says sorry
take a look . At the house , they saw a
The man sighed . He saw their sad faces . He
store . At the store , the mummy bear
you ." Ben saw that Sara was crying and
got his results , he saw that he had failed .
Their mom sighed . She saw the cut on the sheep
her they were going to the park . Lily was eager
all day long . At the end of the day ,
batteries . At the store , the store -
sun . At the end of the day ,

MIN ACTIVATIONS
MIN = -3.366

. She saw a beetle walking on the grass . She
ground . He wanted to drive his car over it ,
, and Peter wanted to ride it . He asked the
horn . They wanted to ride the boat , but there
very happy and wanted to ride it . L
front . Lily liked to ride it around her town and
He was so excited to ride in it . His mom
car . He wanted to drive the car with his stick
's bicycle and wanted to ride it , but Tim my
it closer , so she moved a bit closer . She
it . The crab was walking sideways and had big claws
a boat and wanted to ride it . He asked the
wanted to catch it and hold it in her hands .
to play with it and hold it tight . When she
She saw a house and knocked on the door .
the car and wanted to drive it . He climbed in
. Fl uffy wanted to ride in the je ep .
ball , and she would roll it all around her house
He sniff ed it and rolled it with his paw .
was having so much fun driving it around when he suddenly

INTERVAL 1.619 - 3.281
CONTAINS 0.674%

to get Danny out of the hole . Danny was so
but sometimes he did stupid things . One day , he
. <|endoftext|> S hen was playing in the garden when he
friendly bird flew by and saw the van hanging off the
boy decided to go to the reef . When he got

INTERVAL -0.042 - 1.619
CONTAINS 52.196%

Peter hopped around , marking trees and flowers until he was
a job . She works at a big store . She
's mom saw what he had done and said , "
realized it was a giant whale ! He was scared ,
a guitar , a dog , a cat , and a

INTERVAL -1.704 - -0.042
CONTAINS 46.722%

mom to fill his brain with knowledge .
da isy and tw irled around with it . Everyday ,
, a big rock fell into the stream . The water
It is their orange balloon . It is their best friend
stay up late and play until it was time for the

INTERVAL -3.366 - -1.704
CONTAINS 0.408%

, Tim saw a big , shiny bike that was very
Susan wanted it really badly , but she didn 't have
their vacation . When they got home , they looked at
over . He told her he was looking for someone .
water . It was white and shiny and had a flag
Token happy
Token ID3772
Feature activation-2.46e-01

Pos logit contributions
keeping+0.281
 ever+0.259
 onwards+0.239
 fright+0.211
 it+0.196
Neg logit contributions
 hey-1.139
aman-1.139
 exception-1.078
land-1.073
 whose-1.060
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
 ever+0.197
 onwards+0.194
keeping+0.190
 fright+0.174
 sideways+0.166
Neg logit contributions
aman-0.472
 hey-0.461
land-0.448
 exception-0.428
 whose-0.426
Token
Token ID198
Feature activation-3.29e-01

Pos logit contributions
aman+0.774
 hey+0.762
land+0.731
 exception+0.702
 extingu+0.693
Neg logit contributions
 ever-0.367
 onwards-0.344
keeping-0.343
 it-0.324
 fright-0.309
Token
Token ID198
Feature activation+6.60e-01

Pos logit contributions
 ever+0.340
keeping+0.327
 onwards+0.318
 fright+0.295
 sideways+0.288
Neg logit contributions
aman-0.569
 hey-0.551
land-0.531
 whose-0.505
 exception-0.504
TokenAt
Token ID2953
Feature activation+4.24e-01

Pos logit contributions
aman+0.937
 hey+0.918
land+0.874
 exception+0.827
 extingu+0.822
Neg logit contributions
 ever-0.884
 onwards-0.845
keeping-0.833
 it-0.818
 fright-0.788
Token the
Token ID262
Feature activation+3.28e+00

Pos logit contributions
aman+0.792
 hey+0.779
land+0.748
 exception+0.718
 extingu+0.708
Neg logit contributions
 ever-0.381
 onwards-0.354
keeping-0.354
 it-0.336
 fright-0.320
Token store
Token ID3650
Feature activation+3.42e-01

Pos logit contributions
der+5.429
ist+5.377
 hey+5.270
aman+5.158
land+5.149
Neg logit contributions
 it-3.949
 ever-3.941
 excited-3.714
 on-3.683
 onwards-3.540
Token,
Token ID11
Feature activation+8.91e-01

Pos logit contributions
aman+0.642
 hey+0.630
land+0.604
 exception+0.581
 extingu+0.573
Neg logit contributions
 ever-0.301
keeping-0.280
 onwards-0.280
 it-0.263
 fright-0.253
Token they
Token ID484
Feature activation+1.01e-01

Pos logit contributions
aman+1.503
 hey+1.458
land+1.428
 exception+1.348
 extingu+1.333
Neg logit contributions
 ever-0.973
 it-0.934
 onwards-0.925
keeping-0.892
 excited-0.861
Token found
Token ID1043
Feature activation+9.81e-01

Pos logit contributions
aman+0.165
 hey+0.162
land+0.153
 exception+0.147
 extingu+0.144
Neg logit contributions
 ever-0.112
keeping-0.108
 onwards-0.107
 it-0.100
 fright-0.100
Token some
Token ID617
Feature activation+4.16e-01

Pos logit contributions
aman+1.701
 hey+1.662
land+1.595
 exception+1.528
 extingu+1.475
Neg logit contributions
 ever-1.034
 it-0.985
 onwards-0.970
keeping-0.961
 excited-0.909
Token happy
Token ID3772
Feature activation+5.82e-01

Pos logit contributions
keeping+0.206
 ever+0.200
 onwards+0.191
 fright+0.174
 it+0.173
Neg logit contributions
 hey-0.463
aman-0.462
land-0.439
 exception-0.422
 whose-0.417
Token that
Token ID326
Feature activation-8.44e-01

Pos logit contributions
aman+1.051
 hey+1.027
land+0.977
 exception+0.945
 extingu+0.935
Neg logit contributions
 ever-0.574
keeping-0.525
 onwards-0.523
 it-0.513
 fright-0.477
Token the
Token ID262
Feature activation+4.62e-01

Pos logit contributions
keeping+1.090
 onwards+1.009
 ever+1.009
 fright+0.985
 sideways+0.970
Neg logit contributions
 hey-1.208
aman-1.204
land-1.108
 whose-1.067
 exception-1.063
Token fountain
Token ID29420
Feature activation+2.98e-01

Pos logit contributions
aman+0.538
 hey+0.519
land+0.485
 exception+0.455
 extingu+0.445
Neg logit contributions
 ever-0.740
 onwards-0.709
keeping-0.709
 it-0.691
 fright-0.676
Token was
Token ID373
Feature activation+5.22e-01

Pos logit contributions
aman+0.535
 hey+0.525
land+0.502
 exception+0.482
 extingu+0.475
Neg logit contributions
 ever-0.282
keeping-0.266
 onwards-0.265
 it-0.249
 fright-0.242
Token no
Token ID645
Feature activation+3.28e+00

Pos logit contributions
aman+0.947
 hey+0.930
land+0.888
 exception+0.853
 extingu+0.843
Neg logit contributions
 ever-0.503
 onwards-0.465
keeping-0.459
 it-0.445
 fright-0.424
Token longer
Token ID2392
Feature activation+5.01e-01

Pos logit contributions
aman+6.659
 Majesty+6.100
land+6.075
 hey+6.059
lves+5.890
Neg logit contributions
 ever-3.968
 it-3.483
 excited-3.093
 on-2.878
 fright-2.765
Token dry
Token ID5894
Feature activation+3.89e-02

Pos logit contributions
aman+0.927
 hey+0.910
land+0.867
 exception+0.835
 extingu+0.824
Neg logit contributions
 ever-0.460
keeping-0.426
 onwards-0.426
 it-0.404
 fright-0.387
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
aman+0.069
 hey+0.068
land+0.065
 exception+0.062
 extingu+0.062
Neg logit contributions
 ever-0.037
 onwards-0.035
keeping-0.035
 it-0.033
 fright-0.032
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
aman+0.434
 hey+0.426
land+0.408
 exception+0.392
 extingu+0.386
Neg logit contributions
 ever-0.211
keeping-0.197
 onwards-0.196
 it-0.183
 fright-0.179
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
keeping+0.922
 ever+0.913
 onwards+0.827
 fright+0.773
 sideways+0.767
Neg logit contributions
aman-1.912
 hey-1.877
land-1.850
 whose-1.798
 exception-1.725
Token store
Token ID3650
Feature activation+1.29e-01

Pos logit contributions
aman+3.811
ist+3.674
 hey+3.662
land+3.586
 extingu+3.479
Neg logit contributions
 it-3.487
 excited-3.305
 ever-3.301
keeping-3.214
 angry-3.180
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
aman+0.229
 hey+0.225
land+0.216
 exception+0.207
 extingu+0.204
Neg logit contributions
 ever-0.125
 onwards-0.118
keeping-0.117
 fright-0.109
 it-0.109
Token
Token ID198
Feature activation-4.70e-01

Pos logit contributions
aman+1.009
 hey+0.994
land+0.955
 exception+0.916
 extingu+0.902
Neg logit contributions
 ever-0.488
 onwards-0.462
keeping-0.458
 it-0.437
 fright-0.413
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
 ever+0.493
keeping+0.478
 onwards+0.448
 fright+0.429
 sideways+0.426
Neg logit contributions
aman-0.800
 hey-0.774
land-0.744
 whose-0.710
 exception-0.705
TokenAt
Token ID2953
Feature activation+7.14e-01

Pos logit contributions
 ever+0.167
keeping+0.164
 onwards+0.160
 fright+0.154
 it+0.154
Neg logit contributions
aman-0.127
 hey-0.123
land-0.114
 exception-0.106
 whose-0.105
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
aman+1.360
 hey+1.346
land+1.300
 exception+1.251
 extingu+1.231
Neg logit contributions
 ever-0.613
 onwards-0.568
keeping-0.557
 it-0.556
 excited-0.499
Token store
Token ID3650
Feature activation+4.37e-01

Pos logit contributions
ist+5.027
aman+4.985
der+4.929
 hey+4.889
land+4.851
Neg logit contributions
 it-4.098
 excited-4.038
 ever-4.028
 onwards-3.830
 on-3.801
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
aman+0.819
 hey+0.806
land+0.773
 exception+0.744
 extingu+0.735
Neg logit contributions
 ever-0.381
 onwards-0.355
keeping-0.353
 it-0.335
 fright-0.317
Token the
Token ID262
Feature activation+1.39e+00

Pos logit contributions
aman+0.597
 hey+0.586
land+0.560
 exception+0.538
 extingu+0.530
Neg logit contributions
 ever-0.326
keeping-0.306
 onwards-0.306
 it-0.289
 fright-0.280
Token little
Token ID1310
Feature activation+6.98e-01

Pos logit contributions
aman+1.634
 hey+1.492
land+1.452
 exception+1.343
 extingu+1.310
Neg logit contributions
 ever-2.282
 excited-2.198
 onwards-2.189
 it-2.189
keeping-2.163
Token girl
Token ID2576
Feature activation+8.28e-02

Pos logit contributions
aman+0.956
 hey+0.927
land+0.885
 exception+0.823
 extingu+0.805
Neg logit contributions
 ever-0.977
keeping-0.932
 onwards-0.927
 it-0.916
 excited-0.886
Token day
Token ID1110
Feature activation+9.58e-01

Pos logit contributions
aman+1.034
 hey+1.025
land+0.977
 exception+0.942
 extingu+0.935
Neg logit contributions
 ever-0.384
 onwards-0.350
keeping-0.341
 it-0.327
 excited-0.305
Token while
Token ID981
Feature activation-8.88e-01

Pos logit contributions
aman+1.913
 hey+1.853
land+1.797
 extingu+1.737
 exception+1.730
Neg logit contributions
 ever-0.786
 onwards-0.743
 it-0.733
 excited-0.653
keeping-0.644
Token she
Token ID673
Feature activation+6.80e-01

Pos logit contributions
keeping+1.161
 ever+1.112
 fright+1.075
unts+1.038
 onwards+1.033
Neg logit contributions
 hey-1.256
aman-1.247
 whose-1.121
land-1.118
 exception-1.100
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
aman+1.262
 hey+1.248
land+1.181
 exception+1.142
 extingu+1.127
Neg logit contributions
 ever-0.631
 onwards-0.573
keeping-0.570
 it-0.550
 excited-0.524
Token at
Token ID379
Feature activation+1.52e+00

Pos logit contributions
aman+2.141
 hey+2.089
land+1.985
 extingu+1.926
 exception+1.897
Neg logit contributions
 ever-1.056
 onwards-1.002
 it-0.960
 excited-0.928
keeping-0.915
Token the
Token ID262
Feature activation+3.23e+00

Pos logit contributions
 hey+3.158
aman+3.102
land+2.993
 exception+2.875
ist+2.835
Neg logit contributions
 ever-1.105
 it-1.089
keeping-1.027
 onwards-0.968
 excited-0.900
Token stable
Token ID8245
Feature activation+7.15e-01

Pos logit contributions
 hey+4.484
 extingu+4.443
ist+4.240
aman+4.208
 whose+3.974
Neg logit contributions
 it-4.831
 ever-4.703
 on-4.460
 excited-4.457
 hungry-4.369
Token it
Token ID340
Feature activation+9.02e-01

Pos logit contributions
aman+1.364
 hey+1.348
land+1.285
 exception+1.246
 extingu+1.237
Neg logit contributions
 ever-0.609
 onwards-0.563
keeping-0.552
 it-0.535
 excited-0.495
Token started
Token ID2067
Feature activation+4.48e-01

Pos logit contributions
aman+1.713
 hey+1.700
land+1.609
 extingu+1.564
 exception+1.562
Neg logit contributions
 ever-0.800
 onwards-0.734
 it-0.714
keeping-0.681
 excited-0.656
Token to
Token ID284
Feature activation+5.21e-01

Pos logit contributions
aman+0.878
 hey+0.865
land+0.826
 exception+0.798
 extingu+0.788
Neg logit contributions
 ever-0.356
 onwards-0.327
keeping-0.326
 it-0.306
 fright-0.291
Token rain
Token ID6290
Feature activation+1.80e+00

Pos logit contributions
aman+0.825
 hey+0.810
land+0.766
 exception+0.730
 extingu+0.719
Neg logit contributions
 ever-0.610
keeping-0.577
 onwards-0.575
 it-0.556
 fright-0.541
Token next
Token ID1306
Feature activation-1.66e-01

Pos logit contributions
keeping+1.127
 ever+1.093
 onwards+1.090
 it+1.009
 sideways+0.993
Neg logit contributions
aman-1.254
 hey-1.238
land-1.125
 exception-1.078
 extingu-1.068
Token day
Token ID1110
Feature activation+5.20e-01

Pos logit contributions
keeping+0.139
 ever+0.137
 onwards+0.134
 it+0.122
 fright+0.120
Neg logit contributions
aman-0.316
 hey-0.314
land-0.297
 exception-0.288
 extingu-0.281
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
aman+0.994
 hey+0.976
land+0.936
 exception+0.904
 extingu+0.891
Neg logit contributions
 ever-0.448
 onwards-0.418
keeping-0.404
 it-0.394
 fright-0.368
Token the
Token ID262
Feature activation-6.11e-01

Pos logit contributions
aman+1.876
 hey+1.762
land+1.732
 exception+1.635
 extingu+1.606
Neg logit contributions
 ever-1.844
 it-1.793
 onwards-1.737
 excited-1.651
 on-1.643
Token owner
Token ID4870
Feature activation-5.03e-01

Pos logit contributions
 ever+0.823
keeping+0.822
 onwards+0.801
 fright+0.746
 it+0.737
Neg logit contributions
aman-0.872
 hey-0.842
land-0.778
 exception-0.751
 extingu-0.730
Token saw
Token ID2497
Feature activation+3.22e+00

Pos logit contributions
 onwards+0.517
 ever+0.512
keeping+0.512
 fright+0.472
 it+0.466
Neg logit contributions
 hey-0.840
aman-0.835
land-0.779
 whose-0.746
 exception-0.744
Token some
Token ID617
Feature activation+4.30e-01

Pos logit contributions
aman+7.265
 hey+7.036
ist+7.026
 exception+6.953
land+6.853
Neg logit contributions
 it-3.126
 on-2.223
 ever-2.221
 excited-1.978
 hungry-1.920
Token more
Token ID517
Feature activation-1.18e-01

Pos logit contributions
aman+0.665
 hey+0.649
land+0.616
 exception+0.587
 extingu+0.579
Neg logit contributions
 ever-0.522
 onwards-0.494
keeping-0.492
 it-0.474
 fright-0.462
Token che
Token ID1125
Feature activation+1.23e+00

Pos logit contributions
 ever+0.144
 onwards+0.140
keeping+0.139
 fright+0.129
 it+0.129
Neg logit contributions
aman-0.180
 hey-0.175
land-0.169
 exception-0.161
 whose-0.157
Tokenwed
Token ID19103
Feature activation-1.09e-01

Pos logit contributions
aman+1.654
 hey+1.594
land+1.475
 extingu+1.410
 exception+1.400
Neg logit contributions
 ever-1.809
keeping-1.741
 onwards-1.692
 it-1.638
 fright-1.566
Token threads
Token ID14390
Feature activation+2.65e-01

Pos logit contributions
 ever+0.088
keeping+0.083
 onwards+0.082
 fright+0.076
 it+0.073
Neg logit contributions
aman-0.208
 hey-0.203
land-0.197
 exception-0.189
 extingu-0.188
Token shoes
Token ID10012
Feature activation-1.36e-01

Pos logit contributions
aman+0.941
 hey+0.925
land+0.866
 exception+0.822
 extingu+0.805
Neg logit contributions
 ever-0.798
 onwards-0.757
keeping-0.752
 it-0.731
 fright-0.711
Token.
Token ID13
Feature activation+5.17e-01

Pos logit contributions
 ever+0.128
keeping+0.125
 onwards+0.122
 fright+0.115
 it+0.114
Neg logit contributions
aman-0.240
 hey-0.233
land-0.228
 exception-0.218
 extingu-0.214
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
aman+0.949
 hey+0.936
land+0.897
 exception+0.861
 extingu+0.848
Neg logit contributions
 ever-0.474
 onwards-0.448
keeping-0.443
 it-0.425
 fright-0.402
Token
Token ID198
Feature activation-1.76e-01

Pos logit contributions
 ever+0.427
keeping+0.413
 onwards+0.394
 fright+0.373
 sideways+0.369
Neg logit contributions
aman-0.669
 hey-0.645
land-0.619
 whose-0.585
 exception-0.584
TokenAt
Token ID2953
Feature activation+4.03e-01

Pos logit contributions
 ever+0.272
keeping+0.265
 onwards+0.259
 it+0.249
 fright+0.248
Neg logit contributions
aman-0.214
 hey-0.208
land-0.191
 exception-0.179
 whose-0.177
Token the
Token ID262
Feature activation+3.21e+00

Pos logit contributions
aman+0.735
 hey+0.722
land+0.692
 exception+0.664
 extingu+0.655
Neg logit contributions
 ever-0.379
 onwards-0.354
keeping-0.354
 it-0.336
 fright-0.323
Token taxi
Token ID17536
Feature activation+1.77e+00

Pos logit contributions
ist+5.718
 hey+5.449
ware+5.444
 extingu+5.418
der+5.405
Neg logit contributions
 ever-3.542
 excited-3.498
 it-3.456
 angry-3.295
 onwards-3.163
Token stop
Token ID2245
Feature activation+3.13e-01

Pos logit contributions
 hey+3.362
aman+3.326
 extingu+3.160
 exception+3.146
land+3.137
Neg logit contributions
 ever-1.543
 onwards-1.447
 it-1.419
keeping-1.341
 excited-1.326
Token,
Token ID11
Feature activation+1.10e-03

Pos logit contributions
aman+0.598
 hey+0.587
land+0.563
 exception+0.542
 extingu+0.535
Neg logit contributions
 ever-0.264
keeping-0.245
 onwards-0.245
 it-0.228
 fright-0.221
Token her
Token ID607
Feature activation+5.70e-01

Pos logit contributions
aman+0.002
 hey+0.002
land+0.002
 exception+0.002
 whose+0.002
Neg logit contributions
 ever-0.001
keeping-0.001
 onwards-0.001
 fright-0.001
unts-0.001
Token brother
Token ID3956
Feature activation+5.67e-01

Pos logit contributions
aman+0.948
 hey+0.930
land+0.884
 exception+0.842
 extingu+0.827
Neg logit contributions
 ever-0.632
 onwards-0.595
keeping-0.590
 it-0.568
 fright-0.553
Token store
Token ID3650
Feature activation+7.15e-01

Pos logit contributions
aman+2.293
 hey+2.242
land+2.060
 exception+1.993
lves+1.977
Neg logit contributions
 ever-2.552
 it-2.408
 onwards-2.406
keeping-2.387
 excited-2.334
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
aman+1.311
 hey+1.296
land+1.216
 exception+1.178
 extingu+1.169
Neg logit contributions
 ever-0.664
keeping-0.624
 onwards-0.619
 it-0.600
 excited-0.566
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
 hey+2.156
aman+2.149
land+2.052
 exception+1.965
 extingu+1.925
Neg logit contributions
 ever-0.881
 onwards-0.871
 it-0.854
keeping-0.817
 excited-0.788
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
aman+0.227
 hey+0.221
land+0.213
 exception+0.203
 whose+0.200
Neg logit contributions
 ever-0.123
keeping-0.117
 onwards-0.114
 it-0.107
 fright-0.106
TokenAt
Token ID2953
Feature activation+6.59e-01

Pos logit contributions
aman+1.414
 hey+1.408
land+1.314
 exception+1.246
 extingu+1.243
Neg logit contributions
 ever-1.343
 onwards-1.313
 it-1.265
keeping-1.251
 excited-1.224
Token the
Token ID262
Feature activation+3.21e+00

Pos logit contributions
aman+1.245
 hey+1.231
land+1.185
 exception+1.139
 extingu+1.121
Neg logit contributions
 ever-0.574
 onwards-0.534
keeping-0.526
 it-0.519
 fright-0.474
Token store
Token ID3650
Feature activation+7.12e-01

Pos logit contributions
 hey+5.038
ist+5.020
der+4.976
lves+4.868
aman+4.816
Neg logit contributions
 it-3.982
 ever-3.969
 on-3.761
 excited-3.750
 onwards-3.717
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
aman+1.364
 hey+1.348
land+1.290
 exception+1.247
 extingu+1.239
Neg logit contributions
 ever-0.595
 onwards-0.556
keeping-0.540
 it-0.535
 excited-0.497
Token they
Token ID484
Feature activation+3.37e-01

Pos logit contributions
aman+1.939
 hey+1.925
land+1.853
 exception+1.769
 extingu+1.737
Neg logit contributions
 ever-1.058
 it-1.029
 onwards-1.013
keeping-0.951
 excited-0.938
Token picked
Token ID6497
Feature activation+2.80e-01

Pos logit contributions
aman+0.560
 hey+0.549
land+0.523
 exception+0.501
 extingu+0.493
Neg logit contributions
 ever-0.367
keeping-0.347
 onwards-0.347
 it-0.330
 fright-0.321
Token out
Token ID503
Feature activation+5.81e-01

Pos logit contributions
aman+0.438
 hey+0.428
land+0.407
 exception+0.388
 extingu+0.382
Neg logit contributions
 ever-0.332
keeping-0.315
 onwards-0.314
 it-0.298
 fright-0.293
Token day
Token ID1110
Feature activation+9.72e-01

Pos logit contributions
aman+0.585
 hey+0.576
land+0.553
 exception+0.534
 extingu+0.527
Neg logit contributions
 ever-0.223
keeping-0.208
 onwards-0.207
 it-0.191
 fright-0.184
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
aman+1.987
 hey+1.931
land+1.888
 exception+1.813
 extingu+1.798
Neg logit contributions
 ever-0.749
 onwards-0.689
 it-0.683
 excited-0.601
keeping-0.594
Token Lily
Token ID20037
Feature activation-1.51e-01

Pos logit contributions
aman+2.347
land+2.252
 hey+2.241
 extingu+2.097
 exception+2.089
Neg logit contributions
 it-1.366
 ever-1.354
 onwards-1.241
 excited-1.215
 on-1.184
Token was
Token ID373
Feature activation+1.65e+00

Pos logit contributions
 ever+0.154
 onwards+0.151
keeping+0.148
 fright+0.139
 it+0.136
Neg logit contributions
aman-0.255
 hey-0.249
land-0.239
 exception-0.229
 whose-0.227
Token at
Token ID379
Feature activation+1.26e+00

Pos logit contributions
aman+3.460
land+3.207
 hey+3.142
 extingu+3.107
 exception+2.939
Neg logit contributions
 ever-1.438
 excited-1.300
 it-1.274
 onwards-1.271
 on-1.252
Token the
Token ID262
Feature activation+3.20e+00

Pos logit contributions
aman+2.443
 hey+2.372
land+2.327
 exception+2.192
 extingu+2.164
Neg logit contributions
 ever-1.119
 it-1.017
keeping-1.015
 onwards-0.949
 excited-0.913
Token park
Token ID3952
Feature activation-5.40e-02

Pos logit contributions
aman+4.993
 extingu+4.772
 hey+4.660
lves+4.526
ist+4.475
Neg logit contributions
 ever-4.488
 it-4.229
 on-4.063
 excited-3.872
 hungry-3.865
Token with
Token ID351
Feature activation+4.34e-01

Pos logit contributions
 ever+0.066
keeping+0.065
 onwards+0.064
 fright+0.060
 it+0.060
Neg logit contributions
aman-0.080
 hey-0.079
land-0.075
 exception-0.072
 whose-0.071
Token her
Token ID607
Feature activation+1.65e+00

Pos logit contributions
aman+0.725
 hey+0.710
land+0.678
 exception+0.647
 extingu+0.637
Neg logit contributions
 ever-0.479
 onwards-0.451
keeping-0.450
 it-0.432
 fright-0.417
Token mom
Token ID1995
Feature activation+1.78e-01

Pos logit contributions
aman+3.280
 hey+3.166
land+3.071
 exception+2.921
 extingu+2.850
Neg logit contributions
 ever-1.560
 it-1.371
 onwards-1.316
 excited-1.310
 on-1.301
Token.
Token ID13
Feature activation+8.60e-02

Pos logit contributions
aman+0.309
 hey+0.303
land+0.290
 exception+0.278
 extingu+0.274
Neg logit contributions
 ever-0.180
 onwards-0.171
keeping-0.170
 it-0.160
 fright-0.158
Token big
Token ID1263
Feature activation-6.87e-01

Pos logit contributions
aman+0.386
 hey+0.375
land+0.348
 exception+0.325
 extingu+0.317
Neg logit contributions
 ever-0.565
keeping-0.544
 onwards-0.544
 it-0.526
 fright-0.517
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
keeping+0.997
 ever+0.965
 onwards+0.965
 fright+0.877
 it+0.852
Neg logit contributions
aman-0.927
 hey-0.905
land-0.821
 exception-0.797
 extingu-0.793
Token brown
Token ID7586
Feature activation+2.92e-01

Pos logit contributions
keeping+2.666
 ever+2.382
 onwards+2.261
 fright+2.157
 Go+2.058
Neg logit contributions
 hey-2.380
aman-2.351
 whose-2.211
 extingu-2.197
 exception-2.185
Token bear
Token ID6842
Feature activation+8.89e-01

Pos logit contributions
aman+0.473
 hey+0.463
land+0.441
 exception+0.421
 extingu+0.415
Neg logit contributions
 ever-0.330
 onwards-0.314
keeping-0.314
 it-0.297
 fright-0.290
Token named
Token ID3706
Feature activation+6.33e-01

Pos logit contributions
aman+1.622
 hey+1.567
land+1.497
 exception+1.439
 extingu+1.404
Neg logit contributions
 ever-0.896
keeping-0.804
 it-0.787
 onwards-0.782
 excited-0.736
Token Bub
Token ID19645
Feature activation+3.18e+00

Pos logit contributions
aman+1.032
 hey+1.009
land+0.962
 exception+0.915
 extingu+0.901
Neg logit contributions
 ever-0.725
 onwards-0.681
keeping-0.679
 it-0.675
 fright-0.636
Tokenba
Token ID7012
Feature activation+9.19e-01

Pos logit contributions
aman+4.994
land+4.450
 hey+4.449
 extingu+4.170
 exception+4.068
Neg logit contributions
 ever-4.465
 on-3.930
 it-3.843
 excited-3.798
keeping-3.783
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
aman+1.771
 hey+1.716
land+1.615
 exception+1.575
 extingu+1.546
Neg logit contributions
 ever-0.851
keeping-0.755
 onwards-0.729
 it-0.717
 excited-0.682
Token Bub
Token ID19645
Feature activation+1.91e+00

Pos logit contributions
aman+0.195
 hey+0.191
land+0.176
 exception+0.165
 extingu+0.160
Neg logit contributions
 ever-0.309
keeping-0.300
 onwards-0.298
 it-0.287
 fright-0.285
Tokenba
Token ID7012
Feature activation-3.71e-01

Pos logit contributions
aman+2.432
 hey+2.305
 extingu+2.071
land+2.045
 exception+1.956
Neg logit contributions
 ever-3.083
 onwards-2.787
keeping-2.768
 it-2.768
 excited-2.741
Token loved
Token ID6151
Feature activation-5.32e-01

Pos logit contributions
 onwards+0.490
keeping+0.487
 ever+0.483
 fright+0.464
 it+0.449
Neg logit contributions
aman-0.496
 hey-0.496
land-0.478
 whose-0.461
 exception-0.448
Token He
Token ID679
Feature activation+1.33e+00

Pos logit contributions
aman+0.799
 hey+0.785
land+0.754
 exception+0.728
 extingu+0.718
Neg logit contributions
 ever-0.287
 onwards-0.264
keeping-0.263
 it-0.245
 fright-0.233
Token gives
Token ID3607
Feature activation+6.84e-01

Pos logit contributions
aman+2.290
 hey+2.245
land+2.133
 exception+2.040
 extingu+1.987
Neg logit contributions
 ever-1.420
 it-1.342
 onwards-1.295
 excited-1.268
 fright-1.260
Token them
Token ID606
Feature activation+9.75e-01

Pos logit contributions
aman+1.126
 hey+1.095
land+1.046
 exception+1.001
 extingu+0.991
Neg logit contributions
 ever-0.778
 onwards-0.728
 it-0.727
keeping-0.726
 fright-0.677
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
aman+1.815
 hey+1.751
land+1.680
 exception+1.618
 extingu+1.568
Neg logit contributions
 ever-0.931
 it-0.850
 onwards-0.832
keeping-0.831
 fright-0.781
Token Anna
Token ID11735
Feature activation-1.20e-01

Pos logit contributions
aman+3.620
land+3.368
 hey+3.335
 exception+3.275
lves+3.264
Neg logit contributions
 it-2.037
 ever-1.981
 fright-1.711
 onwards-1.651
 excited-1.644
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
 ever+0.102
 onwards+0.095
keeping+0.095
 fright+0.086
 it+0.084
Neg logit contributions
aman-0.227
 hey-0.222
land-0.217
 whose-0.207
 exception-0.206
Token Ben
Token ID3932
Feature activation+3.29e-01

Pos logit contributions
aman+7.237
land+6.857
lves+6.855
 hey+6.757
 exception+6.653
Neg logit contributions
 it-2.766
 yawn-2.240
 excited-2.096
 ever-2.052
 on-2.009
Token.
Token ID13
Feature activation+7.40e-01

Pos logit contributions
aman+0.581
 hey+0.571
land+0.545
 exception+0.523
 extingu+0.515
Neg logit contributions
 ever-0.324
keeping-0.304
 onwards-0.304
 it-0.287
 fright-0.279
Token He
Token ID679
Feature activation+1.56e+00

Pos logit contributions
aman+1.459
 hey+1.428
land+1.359
 exception+1.320
 extingu+1.295
Neg logit contributions
 ever-0.594
 onwards-0.570
 it-0.548
keeping-0.543
 excited-0.509
Token says
Token ID1139
Feature activation-7.52e-01

Pos logit contributions
aman+2.806
 hey+2.741
land+2.618
 exception+2.501
lves+2.421
Neg logit contributions
 ever-1.569
 it-1.497
 excited-1.407
 onwards-1.402
 fright-1.372
Token sorry
Token ID7926
Feature activation-1.42e-01

Pos logit contributions
keeping+0.866
 ever+0.810
 onwards+0.759
 fright+0.748
 sideways+0.715
Neg logit contributions
 hey-1.226
aman-1.188
land-1.150
 whose-1.104
 exception-1.060
Token take
Token ID1011
Feature activation-8.31e-01

Pos logit contributions
aman+0.933
 hey+0.915
land+0.880
 exception+0.844
 extingu+0.830
Neg logit contributions
 ever-0.455
keeping-0.424
 onwards-0.418
 it-0.408
 fright-0.390
Token a
Token ID257
Feature activation+5.58e-01

Pos logit contributions
 ever+1.085
 onwards+1.051
keeping+1.050
 sideways+1.008
 fright+0.990
Neg logit contributions
aman-1.145
 hey-1.140
land-1.081
 exception-1.025
 extingu-1.021
Token look
Token ID804
Feature activation+7.20e-01

Pos logit contributions
aman+0.860
 hey+0.841
land+0.794
 exception+0.757
 extingu+0.744
Neg logit contributions
 ever-0.678
 onwards-0.640
keeping-0.635
 it-0.622
 fright-0.607
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
aman+1.388
 hey+1.367
land+1.283
 exception+1.240
 extingu+1.232
Neg logit contributions
 ever-0.620
 onwards-0.580
keeping-0.557
 it-0.549
 excited-0.515
Token At
Token ID1629
Feature activation+3.66e-01

Pos logit contributions
aman+1.145
 hey+1.129
land+1.075
 exception+1.037
 extingu+1.026
Neg logit contributions
 ever-0.540
 onwards-0.509
keeping-0.501
 it-0.489
 excited-0.462
Token the
Token ID262
Feature activation+3.17e+00

Pos logit contributions
aman+0.678
 hey+0.667
land+0.639
 exception+0.614
 extingu+0.605
Neg logit contributions
 ever-0.331
keeping-0.309
 onwards-0.308
 it-0.291
 fright-0.280
Token house
Token ID2156
Feature activation+7.92e-01

Pos logit contributions
ist+5.027
der+4.898
 hey+4.847
land+4.835
aman+4.766
Neg logit contributions
 it-3.820
 ever-3.771
 hungry-3.649
 on-3.644
 excited-3.616
Token,
Token ID11
Feature activation+2.28e-01

Pos logit contributions
aman+1.541
 hey+1.515
land+1.460
 exception+1.413
 extingu+1.404
Neg logit contributions
 ever-0.654
 onwards-0.595
 it-0.584
keeping-0.575
 excited-0.532
Token they
Token ID484
Feature activation-1.27e-01

Pos logit contributions
aman+0.391
 hey+0.385
land+0.366
 exception+0.351
 extingu+0.345
Neg logit contributions
 ever-0.235
keeping-0.224
 onwards-0.222
 it-0.207
 fright-0.205
Token saw
Token ID2497
Feature activation+5.77e-01

Pos logit contributions
 ever+0.151
keeping+0.149
 onwards+0.147
 fright+0.137
 it+0.135
Neg logit contributions
aman-0.195
 hey-0.193
land-0.180
 exception-0.172
 whose-0.171
Token a
Token ID257
Feature activation-1.46e-01

Pos logit contributions
aman+0.930
 hey+0.908
land+0.864
 exception+0.828
 extingu+0.809
Neg logit contributions
 ever-0.667
keeping-0.632
 onwards-0.626
 it-0.617
 fright-0.587
TokenThe
Token ID464
Feature activation-1.69e-01

Pos logit contributions
aman+0.808
 hey+0.777
land+0.742
 exception+0.711
 extingu+0.688
Neg logit contributions
 ever-0.746
 onwards-0.712
keeping-0.707
 it-0.691
 fright-0.669
Token man
Token ID582
Feature activation-1.49e-01

Pos logit contributions
 ever+0.182
keeping+0.176
 onwards+0.172
 it+0.157
 fright+0.156
Neg logit contributions
aman-0.291
 hey-0.289
land-0.272
 exception-0.261
 extingu-0.257
Token sighed
Token ID21893
Feature activation-8.84e-01

Pos logit contributions
 ever+0.142
keeping+0.141
 onwards+0.136
 it+0.123
 fright+0.121
Neg logit contributions
 hey-0.262
aman-0.262
land-0.247
 whose-0.238
 extingu-0.236
Token.
Token ID13
Feature activation+4.71e-01

Pos logit contributions
keeping+0.998
 ever+0.993
 onwards+0.928
 fright+0.857
unts+0.833
Neg logit contributions
aman-1.400
 hey-1.390
land-1.367
der-1.298
 whose-1.294
Token He
Token ID679
Feature activation+5.23e-01

Pos logit contributions
aman+0.976
 hey+0.960
land+0.922
 exception+0.891
 extingu+0.878
Neg logit contributions
 ever-0.325
 onwards-0.301
keeping-0.296
 it-0.279
 fright-0.262
Token saw
Token ID2497
Feature activation+3.17e+00

Pos logit contributions
aman+1.007
 hey+0.990
land+0.949
 exception+0.917
 extingu+0.900
Neg logit contributions
 ever-0.440
 onwards-0.402
keeping-0.396
 it-0.385
 fright-0.366
Token their
Token ID511
Feature activation-1.09e+00

Pos logit contributions
aman+7.355
land+6.824
 lookout+6.815
 hey+6.737
 exception+6.695
Neg logit contributions
 it-3.055
 on-2.254
 ever-2.114
 excited-1.980
 angry-1.960
Token sad
Token ID6507
Feature activation-7.48e-01

Pos logit contributions
 onwards+1.431
 ever+1.364
keeping+1.315
 it+1.230
 sideways+1.227
Neg logit contributions
aman-1.635
 hey-1.607
land-1.511
 extingu-1.449
 whose-1.425
Token faces
Token ID6698
Feature activation-1.71e-01

Pos logit contributions
 ever+0.881
 onwards+0.870
keeping+0.813
 fright+0.767
 it+0.766
Neg logit contributions
aman-1.179
 hey-1.159
land-1.131
 extingu-1.058
 exception-1.050
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 ever+0.129
keeping+0.128
 onwards+0.119
 fright+0.108
 it+0.106
Neg logit contributions
aman-0.332
 hey-0.329
land-0.318
 exception-0.304
 extingu-0.303
Token He
Token ID679
Feature activation-4.53e-01

Pos logit contributions
aman+0.462
 hey+0.454
land+0.437
 exception+0.421
 extingu+0.417
Neg logit contributions
 ever-0.171
keeping-0.157
 onwards-0.156
 it-0.143
 fright-0.139
Token store
Token ID3650
Feature activation-2.67e-01

Pos logit contributions
aman+2.730
 hey+2.649
land+2.609
ist+2.478
 exception+2.385
Neg logit contributions
 ever-3.207
 it-3.104
keeping-3.003
 fright-2.960
 hungry-2.931
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 ever+0.251
 onwards+0.242
keeping+0.234
 fright+0.229
 it+0.219
Neg logit contributions
aman-0.473
 hey-0.464
land-0.452
 exception-0.433
 whose-0.427
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
 hey+3.683
aman+3.660
land+3.524
 exception+3.410
 extingu+3.299
Neg logit contributions
 it-1.106
 onwards-1.010
 excited-0.982
 ever-0.954
keeping-0.930
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 ever+0.121
keeping+0.116
 onwards+0.113
 fright+0.105
 sideways+0.103
Neg logit contributions
aman-0.205
 hey-0.199
land-0.192
 exception-0.183
 whose-0.181
TokenAt
Token ID2953
Feature activation+2.26e-01

Pos logit contributions
aman+1.815
 hey+1.793
land+1.673
 exception+1.593
 extingu+1.586
Neg logit contributions
 ever-1.725
 onwards-1.673
 it-1.649
keeping-1.621
 excited-1.593
Token the
Token ID262
Feature activation+3.16e+00

Pos logit contributions
aman+0.412
 hey+0.403
land+0.385
 exception+0.370
 extingu+0.366
Neg logit contributions
 ever-0.212
keeping-0.200
 onwards-0.199
 it-0.185
 fright-0.182
Token store
Token ID3650
Feature activation+4.24e-01

Pos logit contributions
ist+5.109
der+4.918
 hey+4.865
land+4.812
aman+4.764
Neg logit contributions
 it-4.067
 excited-3.837
 ever-3.719
 onwards-3.711
 on-3.649
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
aman+0.801
 hey+0.788
land+0.756
 exception+0.728
 extingu+0.718
Neg logit contributions
 ever-0.367
 onwards-0.341
keeping-0.340
 it-0.322
 fright-0.305
Token the
Token ID262
Feature activation+9.42e-01

Pos logit contributions
aman+2.130
 hey+2.056
land+2.028
 exception+1.927
 extingu+1.865
Neg logit contributions
 ever-1.001
 it-0.991
 onwards-0.935
 excited-0.885
keeping-0.868
Token mummy
Token ID38567
Feature activation-3.10e-02

Pos logit contributions
aman+1.361
 hey+1.326
land+1.282
 exception+1.185
 whose+1.156
Neg logit contributions
 ever-1.254
 onwards-1.186
keeping-1.181
 it-1.161
 excited-1.144
Token bear
Token ID6842
Feature activation+4.55e-01

Pos logit contributions
 ever+0.022
keeping+0.020
 onwards+0.020
 fright+0.018
 it+0.018
Neg logit contributions
aman-0.064
 hey-0.062
land-0.060
 exception-0.058
 extingu-0.057
Token you
Token ID345
Feature activation-3.38e-02

Pos logit contributions
aman+0.621
 hey+0.609
land+0.581
 exception+0.558
 extingu+0.549
Neg logit contributions
 ever-0.364
keeping-0.342
 onwards-0.341
 it-0.324
 fright-0.314
Token."
Token ID526
Feature activation-3.81e-01

Pos logit contributions
 ever+0.033
keeping+0.032
 onwards+0.032
 it+0.030
 fright+0.029
Neg logit contributions
aman-0.058
 hey-0.058
land-0.056
 exception-0.053
 whose-0.052
Token
Token ID198
Feature activation-6.91e-03

Pos logit contributions
 ever+0.411
keeping+0.401
 onwards+0.381
 fright+0.359
 sideways+0.345
Neg logit contributions
aman-0.636
 hey-0.625
land-0.591
 whose-0.564
 exception-0.555
Token
Token ID198
Feature activation+6.77e-01

Pos logit contributions
 ever+0.007
keeping+0.007
 onwards+0.006
 fright+0.006
 it+0.006
Neg logit contributions
aman-0.012
 hey-0.012
land-0.011
 whose-0.011
 exception-0.011
TokenBen
Token ID11696
Feature activation-3.39e-01

Pos logit contributions
aman+1.017
 hey+0.987
land+0.949
 exception+0.911
 extingu+0.891
Neg logit contributions
 ever-0.860
 onwards-0.820
keeping-0.809
 it-0.798
 fright-0.765
Token saw
Token ID2497
Feature activation+3.15e+00

Pos logit contributions
 ever+0.323
keeping+0.319
 onwards+0.314
 it+0.274
 sideways+0.272
Neg logit contributions
 hey-0.601
aman-0.600
land-0.563
 whose-0.544
 extingu-0.537
Token that
Token ID326
Feature activation+2.87e-01

Pos logit contributions
aman+7.141
land+6.718
 hey+6.579
ist+6.558
 lookout+6.431
Neg logit contributions
 it-3.094
 angry-2.192
 excited-2.114
 ever-2.044
 hungry-2.038
Token Sara
Token ID24799
Feature activation-5.55e-01

Pos logit contributions
aman+0.459
 hey+0.450
land+0.427
 exception+0.408
 extingu+0.401
Neg logit contributions
 ever-0.336
keeping-0.320
 onwards-0.319
 it-0.303
 fright-0.297
Token was
Token ID373
Feature activation+2.08e-01

Pos logit contributions
keeping+0.597
 ever+0.585
 onwards+0.575
 fright+0.527
 sideways+0.514
Neg logit contributions
 hey-0.902
aman-0.890
land-0.851
 whose-0.828
 extingu-0.796
Token crying
Token ID13774
Feature activation-2.74e-03

Pos logit contributions
aman+0.423
 hey+0.418
land+0.400
 exception+0.388
 extingu+0.383
Neg logit contributions
 ever-0.148
keeping-0.138
 onwards-0.136
 it-0.124
 fright-0.119
Token and
Token ID290
Feature activation-3.26e-01

Pos logit contributions
 ever+0.002
keeping+0.002
 onwards+0.002
 it+0.002
 fright+0.002
Neg logit contributions
aman-0.005
 hey-0.005
land-0.005
 exception-0.005
 whose-0.005
Token got
Token ID1392
Feature activation-1.44e+00

Pos logit contributions
aman+0.322
 hey+0.316
land+0.302
 exception+0.289
 extingu+0.284
Neg logit contributions
 ever-0.179
keeping-0.172
 onwards-0.169
 it-0.158
 fright-0.154
Token his
Token ID465
Feature activation+4.54e-01

Pos logit contributions
keeping+1.569
 ever+1.456
unts+1.414
 onwards+1.403
 Go+1.368
Neg logit contributions
 hey-2.405
aman-2.310
 exception-2.212
land-2.088
 extingu-2.081
Token results
Token ID2482
Feature activation+5.64e-01

Pos logit contributions
aman+0.619
 hey+0.602
land+0.567
 exception+0.535
 extingu+0.527
Neg logit contributions
 ever-0.639
 onwards-0.610
keeping-0.607
 it-0.587
 fright-0.576
Token,
Token ID11
Feature activation+8.73e-01

Pos logit contributions
aman+1.081
 hey+1.060
land+1.016
 exception+0.985
 extingu+0.971
Neg logit contributions
 ever-0.483
 onwards-0.445
keeping-0.433
 it-0.421
 fright-0.396
Token he
Token ID339
Feature activation-4.14e-01

Pos logit contributions
aman+1.404
 hey+1.349
land+1.298
 exception+1.231
 extingu+1.214
Neg logit contributions
 ever-1.060
 it-1.003
 onwards-1.001
keeping-0.956
 excited-0.928
Token saw
Token ID2497
Feature activation+3.13e+00

Pos logit contributions
keeping+0.448
 ever+0.432
 onwards+0.430
 fright+0.392
 it+0.383
Neg logit contributions
aman-0.686
 hey-0.680
land-0.637
 whose-0.612
 exception-0.608
Token that
Token ID326
Feature activation+2.12e-01

Pos logit contributions
aman+6.754
 hey+6.183
land+6.135
 exception+5.958
ist+5.951
Neg logit contributions
 it-3.600
 ever-2.921
 on-2.758
 excited-2.641
 angry-2.444
Token he
Token ID339
Feature activation-6.55e-01

Pos logit contributions
aman+0.367
 hey+0.361
land+0.345
 exception+0.332
 extingu+0.326
Neg logit contributions
 ever-0.214
keeping-0.204
 onwards-0.202
 it-0.188
 fright-0.186
Token had
Token ID550
Feature activation+4.96e-01

Pos logit contributions
keeping+0.678
 onwards+0.644
 ever+0.616
 fright+0.568
 sideways+0.553
Neg logit contributions
 hey-1.109
aman-1.102
land-1.059
 whose-1.025
 exception-1.015
Token failed
Token ID4054
Feature activation-2.28e-01

Pos logit contributions
aman+0.851
 hey+0.834
land+0.793
 exception+0.761
 extingu+0.752
Neg logit contributions
 ever-0.526
keeping-0.491
 onwards-0.489
 it-0.471
 fright-0.454
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
 ever+0.215
keeping+0.208
 onwards+0.206
 fright+0.189
 it+0.185
Neg logit contributions
aman-0.402
 hey-0.392
land-0.382
 exception-0.364
 whose-0.361
TokenTheir
Token ID14574
Feature activation-7.41e-01

Pos logit contributions
 ever+0.022
keeping+0.021
 onwards+0.021
 it+0.019
 fright+0.019
Neg logit contributions
aman-0.047
 hey-0.046
land-0.043
 whose-0.042
 extingu-0.042
Token mom
Token ID1995
Feature activation+1.73e-01

Pos logit contributions
 onwards+0.772
 ever+0.769
keeping+0.768
 it+0.685
 sideways+0.684
Neg logit contributions
 hey-1.254
aman-1.238
land-1.147
 extingu-1.120
 exception-1.116
Token sighed
Token ID21893
Feature activation-9.36e-01

Pos logit contributions
aman+0.325
 hey+0.321
land+0.306
 exception+0.293
 extingu+0.291
Neg logit contributions
 ever-0.150
keeping-0.143
 onwards-0.141
 it-0.129
 fright-0.126
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
keeping+1.116
 ever+1.110
 onwards+1.038
 fright+0.977
 sideways+0.949
Neg logit contributions
aman-1.427
 hey-1.422
land-1.379
der-1.307
 whose-1.305
Token She
Token ID1375
Feature activation+6.96e-01

Pos logit contributions
aman+0.204
 hey+0.200
land+0.191
 exception+0.183
 extingu+0.181
Neg logit contributions
 ever-0.108
keeping-0.102
 onwards-0.099
 it-0.093
 fright-0.092
Token saw
Token ID2497
Feature activation+3.12e+00

Pos logit contributions
aman+1.321
 hey+1.292
land+1.247
 exception+1.208
 extingu+1.177
Neg logit contributions
 ever-0.610
 onwards-0.553
 it-0.542
keeping-0.541
 excited-0.511
Token the
Token ID262
Feature activation-4.09e-01

Pos logit contributions
aman+6.879
 lookout+6.746
 exception+6.651
 hey+6.650
land+6.547
Neg logit contributions
 it-2.904
 on-2.129
 ever-1.926
 angry-1.909
 excited-1.857
Token cut
Token ID2005
Feature activation+2.61e-01

Pos logit contributions
 ever+0.687
keeping+0.662
 onwards+0.657
 sideways+0.615
 fright+0.613
Neg logit contributions
aman-0.469
 hey-0.433
land-0.415
 exception-0.389
 extingu-0.386
Token on
Token ID319
Feature activation+7.20e-01

Pos logit contributions
aman+0.469
 hey+0.460
land+0.441
 exception+0.423
 extingu+0.418
Neg logit contributions
 ever-0.248
keeping-0.233
 onwards-0.232
 it-0.218
 fright-0.213
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
aman+1.341
 hey+1.331
land+1.270
 exception+1.232
lves+1.194
Neg logit contributions
 ever-0.635
keeping-0.593
 it-0.593
 onwards-0.590
 fright-0.545
Token sheep
Token ID15900
Feature activation-4.60e-01

Pos logit contributions
 ever+0.491
 onwards+0.469
keeping+0.466
 it+0.440
 fright+0.437
Neg logit contributions
aman-0.376
 hey-0.356
land-0.342
 extingu-0.317
 exception-0.311
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
 ever+0.274
 onwards+0.268
keeping+0.265
 sideways+0.245
 fright+0.242
Neg logit contributions
aman-0.348
 hey-0.340
land-0.321
 exception-0.307
 extingu-0.296
Token they
Token ID484
Feature activation-2.68e+00

Pos logit contributions
keeping+1.376
 ever+1.364
 onwards+1.359
 sideways+1.325
 fright+1.293
Neg logit contributions
 hey-1.606
aman-1.597
land-1.571
 whose-1.474
 exception-1.442
Token were
Token ID547
Feature activation-9.11e-01

Pos logit contributions
keeping+3.974
 onwards+3.509
 sideways+3.279
imp+3.272
 fright+3.223
Neg logit contributions
 hey-4.184
 whose-3.419
 exception-3.386
 who-3.362
aman-3.322
Token going
Token ID1016
Feature activation-8.64e-01

Pos logit contributions
keeping+1.154
 ever+1.115
 onwards+1.070
 fright+1.066
 sideways+1.049
Neg logit contributions
 hey-1.361
aman-1.329
 exception-1.233
land-1.207
 whose-1.173
Token to
Token ID284
Feature activation+2.96e-01

Pos logit contributions
 ever+0.764
keeping+0.738
 onwards+0.719
 fright+0.712
unts+0.673
Neg logit contributions
 hey-1.538
aman-1.525
land-1.500
 exception-1.457
 extingu-1.431
Token the
Token ID262
Feature activation+3.11e+00

Pos logit contributions
aman+0.511
 hey+0.501
land+0.478
 exception+0.458
 extingu+0.452
Neg logit contributions
 ever-0.305
 onwards-0.288
keeping-0.288
 it-0.271
 fright-0.264
Token park
Token ID3952
Feature activation-7.11e-01

Pos logit contributions
aman+4.635
land+4.489
 hey+4.328
ist+4.280
der+4.247
Neg logit contributions
 ever-4.165
 it-4.090
 on-3.877
 excited-3.776
 sideways-3.738
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
 onwards+0.725
keeping+0.722
 ever+0.722
 fright+0.687
unts+0.664
Neg logit contributions
aman-1.179
land-1.159
 hey-1.146
 exception-1.093
 whose-1.087
Token Lily
Token ID20037
Feature activation+1.34e-01

Pos logit contributions
aman+1.180
 hey+1.164
land+1.107
 exception+1.062
 extingu+1.048
Neg logit contributions
 ever-0.728
 onwards-0.709
keeping-0.695
 it-0.691
 excited-0.653
Token was
Token ID373
Feature activation-9.19e-01

Pos logit contributions
aman+0.239
 hey+0.234
land+0.223
 exception+0.214
 whose+0.212
Neg logit contributions
 ever-0.130
keeping-0.124
 onwards-0.123
 it-0.113
 fright-0.112
Token eager
Token ID11069
Feature activation-7.78e-01

Pos logit contributions
keeping+0.752
 ever+0.723
 onwards+0.664
 sideways+0.616
 fright+0.610
Neg logit contributions
 hey-1.748
aman-1.738
 exception-1.650
 whose-1.633
land-1.620
Token all
Token ID477
Feature activation+2.56e-01

Pos logit contributions
 ever+0.354
keeping+0.343
 onwards+0.331
 fright+0.314
 it+0.303
Neg logit contributions
 hey-0.463
aman-0.456
land-0.440
 whose-0.417
 exception-0.416
Token day
Token ID1110
Feature activation-1.01e+00

Pos logit contributions
aman+0.509
 hey+0.500
land+0.481
 exception+0.464
 extingu+0.457
Neg logit contributions
 ever-0.195
keeping-0.182
 onwards-0.181
 it-0.166
 fright-0.162
Token long
Token ID890
Feature activation-2.83e-01

Pos logit contributions
keeping+0.958
 ever+0.912
 fright+0.888
 onwards+0.877
 sideways+0.829
Neg logit contributions
aman-1.708
 hey-1.680
land-1.665
 exception-1.565
 whose-1.556
Token.
Token ID13
Feature activation+7.56e-01

Pos logit contributions
 ever+0.292
keeping+0.283
 onwards+0.281
 fright+0.265
 it+0.256
Neg logit contributions
aman-0.467
 hey-0.460
land-0.448
 whose-0.421
 exception-0.417
Token At
Token ID1629
Feature activation+1.99e-01

Pos logit contributions
aman+1.231
 hey+1.201
land+1.146
 exception+1.091
 extingu+1.084
Neg logit contributions
 ever-0.866
 onwards-0.834
keeping-0.808
 it-0.807
 excited-0.766
Token the
Token ID262
Feature activation+3.09e+00

Pos logit contributions
aman+0.362
 hey+0.355
land+0.339
 exception+0.326
 extingu+0.322
Neg logit contributions
 ever-0.186
keeping-0.176
 onwards-0.175
 it-0.162
 fright-0.160
Token end
Token ID886
Feature activation-3.09e-01

Pos logit contributions
 hey+5.789
 extingu+5.741
ist+5.670
der+5.656
aman+5.575
Neg logit contributions
 ever-3.260
 it-2.831
 have-2.532
 excited-2.500
 hungry-2.411
Token of
Token ID286
Feature activation-5.70e-01

Pos logit contributions
 ever+0.214
keeping+0.214
 onwards+0.194
 fright+0.179
 sideways+0.171
Neg logit contributions
aman-0.624
 hey-0.618
land-0.594
 exception-0.575
 whose-0.571
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
keeping+0.579
 ever+0.575
 onwards+0.570
 fright+0.515
 sideways+0.510
Neg logit contributions
aman-0.952
 hey-0.932
land-0.868
 extingu-0.863
 whose-0.848
Token day
Token ID1110
Feature activation-2.25e-01

Pos logit contributions
aman+0.574
 hey+0.564
land+0.541
 exception+0.521
 extingu+0.514
Neg logit contributions
 ever-0.242
keeping-0.225
 onwards-0.225
 it-0.209
 fright-0.202
Token,
Token ID11
Feature activation+8.47e-01

Pos logit contributions
 ever+0.211
keeping+0.209
 onwards+0.198
 fright+0.194
 sideways+0.186
Neg logit contributions
aman-0.400
 hey-0.392
land-0.376
 exception-0.355
 whose-0.351
Token batteries
Token ID13591
Feature activation+1.71e-02

Pos logit contributions
 ever+0.001
 onwards+0.001
keeping+0.001
 it+0.001
 fright+0.001
Neg logit contributions
aman-0.002
 hey-0.002
land-0.002
 exception-0.001
 extingu-0.001
Token.
Token ID13
Feature activation+5.52e-01

Pos logit contributions
aman+0.030
 hey+0.029
land+0.028
 exception+0.027
 extingu+0.026
Neg logit contributions
 ever-0.017
 onwards-0.016
keeping-0.016
 fright-0.015
 it-0.015
Token
Token ID198
Feature activation+1.46e-02

Pos logit contributions
aman+0.975
 hey+0.958
land+0.918
 exception+0.881
 extingu+0.870
Neg logit contributions
 ever-0.546
 onwards-0.521
keeping-0.512
 it-0.494
 fright-0.470
Token
Token ID198
Feature activation+4.49e-01

Pos logit contributions
aman+0.026
 hey+0.025
land+0.024
 exception+0.023
 whose+0.023
Neg logit contributions
 ever-0.014
keeping-0.014
 onwards-0.013
 it-0.013
 fright-0.012
TokenAt
Token ID2953
Feature activation+1.23e-01

Pos logit contributions
aman+0.648
 hey+0.633
land+0.601
 exception+0.570
 extingu+0.561
Neg logit contributions
 ever-0.593
 onwards-0.567
keeping-0.562
 it-0.543
 fright-0.529
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
aman+0.223
 hey+0.218
land+0.207
 exception+0.199
 extingu+0.197
Neg logit contributions
 ever-0.118
keeping-0.112
 onwards-0.111
 it-0.102
 fright-0.102
Token store
Token ID3650
Feature activation+9.41e-02

Pos logit contributions
ist+4.902
der+4.873
land+4.866
 hey+4.851
aman+4.803
Neg logit contributions
 it-3.680
 ever-3.673
 excited-3.446
 onwards-3.383
 on-3.354
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
aman+0.174
 hey+0.170
land+0.162
 exception+0.155
 extingu+0.152
Neg logit contributions
 ever-0.086
keeping-0.081
 onwards-0.080
 fright-0.074
 it-0.074
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
aman+0.515
 hey+0.505
land+0.480
 exception+0.459
 extingu+0.452
Neg logit contributions
 ever-0.351
keeping-0.333
 onwards-0.332
 it-0.316
 fright-0.308
Token store
Token ID3650
Feature activation-7.63e-01

Pos logit contributions
aman+1.618
 hey+1.517
land+1.492
 exception+1.360
 whose+1.347
Neg logit contributions
 ever-1.698
 onwards-1.629
keeping-1.603
 it-1.600
 excited-1.570
Token-
Token ID12
Feature activation+8.06e-01

Pos logit contributions
 ever+0.923
keeping+0.913
 onwards+0.886
 fright+0.874
 sideways+0.848
Neg logit contributions
aman-1.173
 hey-1.130
land-1.064
 exception-0.998
 whose-0.988
Token sun
Token ID4252
Feature activation+3.62e-01

Pos logit contributions
aman+1.672
 hey+1.644
land+1.565
 exception+1.520
 extingu+1.493
Neg logit contributions
 ever-0.802
 onwards-0.750
keeping-0.740
 it-0.724
 fright-0.686
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
aman+0.643
 hey+0.632
land+0.603
 exception+0.579
 extingu+0.571
Neg logit contributions
 ever-0.352
keeping-0.330
 onwards-0.330
 it-0.312
 fright-0.301
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
aman+0.617
 hey+0.607
land+0.582
 exception+0.561
 extingu+0.553
Neg logit contributions
 ever-0.260
keeping-0.241
 onwards-0.240
 it-0.224
 fright-0.216
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 ever+0.778
keeping+0.747
 onwards+0.722
 fright+0.680
 sideways+0.673
Neg logit contributions
aman-1.119
 hey-1.084
land-1.050
 whose-1.006
 exception-0.979
TokenAt
Token ID2953
Feature activation+2.53e-01

Pos logit contributions
aman+0.366
 hey+0.358
land+0.339
 exception+0.323
 extingu+0.317
Neg logit contributions
 ever-0.292
keeping-0.278
 onwards-0.277
 it-0.264
 fright-0.259
Token the
Token ID262
Feature activation+3.07e+00

Pos logit contributions
aman+0.462
 hey+0.453
land+0.433
 exception+0.416
 extingu+0.411
Neg logit contributions
 ever-0.236
keeping-0.222
 onwards-0.221
 it-0.206
 fright-0.202
Token end
Token ID886
Feature activation-5.43e-01

Pos logit contributions
 hey+5.657
aman+5.538
 extingu+5.408
lves+5.381
der+5.357
Neg logit contributions
 ever-2.998
 it-2.964
 onwards-2.582
 on-2.575
 excited-2.573
Token of
Token ID286
Feature activation-4.80e-01

Pos logit contributions
keeping+0.411
 ever+0.391
 onwards+0.364
 fright+0.336
 sideways+0.325
Neg logit contributions
aman-1.086
 hey-1.060
land-1.037
 exception-0.990
 whose-0.990
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
 ever+0.495
keeping+0.483
 onwards+0.482
 sideways+0.442
 fright+0.440
Neg logit contributions
aman-0.815
 hey-0.781
land-0.736
 extingu-0.731
 whose-0.717
Token day
Token ID1110
Feature activation-2.49e-02

Pos logit contributions
aman+0.351
 hey+0.344
land+0.330
 exception+0.318
 extingu+0.313
Neg logit contributions
 ever-0.154
keeping-0.145
 onwards-0.144
 it-0.134
 fright-0.129
Token,
Token ID11
Feature activation+4.08e-01

Pos logit contributions
 ever+0.023
keeping+0.022
 onwards+0.021
 fright+0.020
 it+0.020
Neg logit contributions
aman-0.045
 hey-0.044
land-0.043
 exception-0.041
 whose-0.040
Token.
Token ID13
Feature activation-3.59e-01

Pos logit contributions
aman+4.759
 hey+4.717
land+4.460
lves+4.376
 extingu+4.349
Neg logit contributions
 ever-1.442
 it-1.400
 on-1.352
 excited-1.269
 onwards-1.169
Token She
Token ID1375
Feature activation-2.73e-01

Pos logit contributions
 ever+0.366
keeping+0.344
 onwards+0.330
 fright+0.316
 it+0.299
Neg logit contributions
aman-0.626
 hey-0.611
land-0.588
 exception-0.567
 whose-0.559
Token saw
Token ID2497
Feature activation+1.24e+00

Pos logit contributions
keeping+0.298
 ever+0.296
 onwards+0.294
 fright+0.273
 it+0.260
Neg logit contributions
 hey-0.435
aman-0.434
land-0.407
 whose-0.395
 exception-0.390
Token a
Token ID257
Feature activation-1.95e-01

Pos logit contributions
aman+2.193
 hey+2.116
land+2.056
 exception+1.969
 extingu+1.926
Neg logit contributions
 ever-1.290
 it-1.272
 onwards-1.167
 excited-1.161
keeping-1.161
Token beetle
Token ID45489
Feature activation-4.70e-01

Pos logit contributions
 ever+0.245
keeping+0.241
 onwards+0.236
 it+0.215
 fright+0.215
Neg logit contributions
aman-0.291
 hey-0.283
land-0.268
 exception-0.253
 whose-0.252
Token walking
Token ID6155
Feature activation-3.37e+00

Pos logit contributions
keeping+0.359
 ever+0.349
 onwards+0.342
 fright+0.305
unts+0.289
Neg logit contributions
aman-0.902
 hey-0.899
land-0.867
 whose-0.849
 exception-0.825
Token on
Token ID319
Feature activation-2.83e-01

Pos logit contributions
unts+4.905
 grin+4.641
keeping+4.482
 fingertips+4.398
 onwards+4.392
Neg logit contributions
 whose-4.515
 where-4.283
 living-4.164
 who-4.134
 hey-4.030
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
 ever+0.298
 onwards+0.290
keeping+0.278
unts+0.256
 fright+0.252
Neg logit contributions
aman-0.480
 hey-0.462
land-0.447
 whose-0.423
 extingu-0.421
Token grass
Token ID8701
Feature activation+5.08e-01

Pos logit contributions
aman+0.860
 hey+0.842
land+0.791
 exception+0.759
 extingu+0.742
Neg logit contributions
 ever-0.733
keeping-0.699
 onwards-0.689
 it-0.672
 fright-0.650
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
aman+1.009
 hey+0.997
land+0.952
 exception+0.922
 extingu+0.910
Neg logit contributions
 ever-0.394
 onwards-0.360
keeping-0.360
 it-0.339
 fright-0.317
Token She
Token ID1375
Feature activation-4.17e-01

Pos logit contributions
 ever+0.538
keeping+0.510
 onwards+0.476
 fright+0.448
unts+0.437
Neg logit contributions
aman-0.923
 hey-0.896
land-0.867
 whose-0.836
 exception-0.819
Token ground
Token ID2323
Feature activation+6.81e-01

Pos logit contributions
aman+2.196
 hey+2.186
land+2.047
 exception+1.995
 whose+1.887
Neg logit contributions
 ever-1.703
keeping-1.626
 it-1.576
 onwards-1.515
 fright-1.478
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
aman+1.262
 hey+1.243
land+1.177
 exception+1.133
 extingu+1.118
Neg logit contributions
 ever-0.638
 onwards-0.583
keeping-0.583
 it-0.567
 fright-0.530
Token He
Token ID679
Feature activation-4.20e-01

Pos logit contributions
 ever+1.682
keeping+1.594
 onwards+1.489
unts+1.351
 fright+1.330
Neg logit contributions
aman-2.443
 hey-2.307
 whose-2.292
 extingu-2.211
land-2.185
Token wanted
Token ID2227
Feature activation-8.16e-01

Pos logit contributions
keeping+0.400
 ever+0.391
 onwards+0.389
 fright+0.338
 it+0.326
Neg logit contributions
aman-0.743
 hey-0.733
 whose-0.693
land-0.691
 exception-0.674
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
keeping+0.748
 onwards+0.745
 ever+0.705
 grin+0.658
unts+0.656
Neg logit contributions
aman-1.428
 hey-1.424
land-1.371
 whose-1.365
 extingu-1.310
Token drive
Token ID3708
Feature activation-3.26e+00

Pos logit contributions
 onwards+0.706
keeping+0.680
 ever+0.670
unts+0.595
 onward+0.590
Neg logit contributions
 hey-0.896
aman-0.891
 exception-0.812
land-0.812
 extingu-0.810
Token his
Token ID465
Feature activation+1.02e+00

Pos logit contributions
unts+4.577
keeping+4.503
 grin+4.168
 Amen+4.032
umbs+3.878
Neg logit contributions
 hey-4.646
 whose-4.572
 where-4.549
 who-4.252
 exception-4.229
Token car
Token ID1097
Feature activation+5.02e-01

Pos logit contributions
aman+1.497
 hey+1.469
land+1.412
 exception+1.317
 extingu+1.279
Neg logit contributions
 ever-1.377
 it-1.290
 onwards-1.278
keeping-1.263
 fright-1.221
Token over
Token ID625
Feature activation-1.51e+00

Pos logit contributions
aman+0.931
 hey+0.914
land+0.873
 exception+0.837
 extingu+0.830
Neg logit contributions
 ever-0.458
 onwards-0.425
keeping-0.421
 it-0.403
 fright-0.385
Token it
Token ID340
Feature activation+2.69e-01

Pos logit contributions
 onwards+1.643
keeping+1.643
unts+1.615
 grin+1.586
 ever+1.570
Neg logit contributions
aman-2.394
 hey-2.372
 whose-2.244
land-2.204
 extingu-2.089
Token,
Token ID11
Feature activation-6.21e-01

Pos logit contributions
aman+0.500
 hey+0.490
land+0.471
 exception+0.453
 extingu+0.446
Neg logit contributions
 ever-0.240
keeping-0.226
 onwards-0.224
 it-0.209
 fright-0.204
Token,
Token ID11
Feature activation-1.82e+00

Pos logit contributions
aman+0.339
 hey+0.332
land+0.319
 exception+0.305
 extingu+0.301
Neg logit contributions
 ever-0.185
keeping-0.175
 onwards-0.175
 it-0.161
 fright-0.160
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
keeping+2.252
 onwards+2.117
unts+2.104
 ever+2.094
tick+1.966
Neg logit contributions
aman-2.639
 whose-2.521
 hey-2.488
land-2.448
 extingu-2.362
Token Peter
Token ID5613
Feature activation+1.07e-01

Pos logit contributions
keeping+1.381
 ever+1.288
 onwards+1.207
unts+1.163
 Go+1.108
Neg logit contributions
aman-2.530
 hey-2.464
 whose-2.430
land-2.356
lves-2.301
Token wanted
Token ID2227
Feature activation-1.31e+00

Pos logit contributions
aman+0.197
 hey+0.194
land+0.186
 exception+0.178
 whose+0.177
Neg logit contributions
 ever-0.095
keeping-0.091
 onwards-0.091
 it-0.082
 fright-0.082
Token to
Token ID284
Feature activation-9.41e-02

Pos logit contributions
 onwards+1.331
keeping+1.303
unts+1.234
 grin+1.230
 ever+1.168
Neg logit contributions
 hey-2.163
aman-2.156
land-2.151
 whose-2.106
 exception-1.994
Token ride
Token ID6594
Feature activation-3.20e+00

Pos logit contributions
 onwards+0.099
 ever+0.099
keeping+0.097
 it+0.089
 sideways+0.087
Neg logit contributions
aman-0.156
 hey-0.154
land-0.146
 exception-0.141
 extingu-0.138
Token it
Token ID340
Feature activation-1.24e-01

Pos logit contributions
unts+3.989
keeping+3.861
irk+3.505
 grin+3.306
umbs+3.145
Neg logit contributions
 whose-5.257
land-4.910
 hey-4.892
 exception-4.766
 who-4.759
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
 ever+0.125
keeping+0.124
 onwards+0.119
 fright+0.110
unts+0.110
Neg logit contributions
aman-0.211
 hey-0.205
land-0.200
 exception-0.193
 whose-0.187
Token He
Token ID679
Feature activation+1.05e-01

Pos logit contributions
aman+0.874
 hey+0.859
land+0.823
 exception+0.791
 extingu+0.781
Neg logit contributions
 ever-0.415
 onwards-0.389
keeping-0.388
 it-0.371
 fright-0.352
Token asked
Token ID1965
Feature activation-2.23e-01

Pos logit contributions
aman+0.190
 hey+0.187
land+0.179
 exception+0.172
 whose+0.170
Neg logit contributions
 ever-0.098
keeping-0.095
 onwards-0.094
 it-0.086
 fright-0.084
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 ever+0.229
keeping+0.228
 onwards+0.219
 fright+0.199
unts+0.197
Neg logit contributions
aman-0.377
 hey-0.373
land-0.357
 whose-0.344
 exception-0.342
Token horn
Token ID12718
Feature activation-3.88e-01

Pos logit contributions
 ever+1.089
keeping+1.084
 onwards+1.050
 Go+0.914
 fright+0.912
Neg logit contributions
aman-1.255
 hey-1.230
land-1.187
 extingu-1.122
 whose-1.115
Token.
Token ID13
Feature activation-4.50e-02

Pos logit contributions
 ever+0.355
keeping+0.354
 onwards+0.349
 fright+0.319
 it+0.300
Neg logit contributions
aman-0.679
 hey-0.673
land-0.658
 extingu-0.624
 whose-0.623
Token They
Token ID1119
Feature activation-4.21e-02

Pos logit contributions
 ever+0.047
keeping+0.044
 onwards+0.043
 fright+0.041
 it+0.039
Neg logit contributions
aman-0.078
 hey-0.076
land-0.073
 exception-0.070
 whose-0.069
Token wanted
Token ID2227
Feature activation-9.31e-01

Pos logit contributions
 ever+0.035
keeping+0.034
 onwards+0.034
 it+0.030
 fright+0.030
Neg logit contributions
aman-0.080
 hey-0.079
land-0.075
 exception-0.073
 whose-0.072
Token to
Token ID284
Feature activation-5.25e-01

Pos logit contributions
 onwards+0.873
keeping+0.861
 ever+0.816
 grin+0.771
unts+0.769
Neg logit contributions
 hey-1.607
aman-1.604
land-1.594
 whose-1.558
 exception-1.478
Token ride
Token ID6594
Feature activation-3.19e+00

Pos logit contributions
 onwards+0.555
keeping+0.542
 ever+0.532
unts+0.469
 sideways+0.458
Neg logit contributions
aman-0.866
 hey-0.866
land-0.815
 exception-0.795
 whose-0.789
Token the
Token ID262
Feature activation+6.03e-01

Pos logit contributions
keeping+3.850
unts+3.775
 grin+3.540
irk+3.488
umbs+3.271
Neg logit contributions
 whose-5.479
land-4.960
 who-4.900
 hey-4.834
 where-4.677
Token boat
Token ID8848
Feature activation+7.35e-02

Pos logit contributions
aman+0.786
 hey+0.761
land+0.711
 exception+0.677
 extingu+0.657
Neg logit contributions
 ever-0.889
keeping-0.846
 onwards-0.844
 it-0.835
 fright-0.805
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
aman+0.134
 hey+0.132
land+0.127
 exception+0.122
 whose+0.120
Neg logit contributions
 ever-0.066
keeping-0.063
 onwards-0.062
 it-0.057
 fright-0.057
Token but
Token ID475
Feature activation-8.28e-01

Pos logit contributions
keeping+1.578
 ever+1.550
 onwards+1.534
unts+1.459
tick+1.392
Neg logit contributions
aman-1.226
 hey-1.220
land-1.189
 whose-1.149
 exception-1.107
Token there
Token ID612
Feature activation-1.46e+00

Pos logit contributions
keeping+0.844
 ever+0.826
 onwards+0.774
unts+0.757
 fright+0.751
Neg logit contributions
aman-1.417
 hey-1.403
 whose-1.332
land-1.293
 exception-1.280
Token very
Token ID845
Feature activation-7.80e-01

Pos logit contributions
keeping+0.479
 ever+0.460
 onwards+0.444
 fright+0.387
 it+0.385
Neg logit contributions
 hey-1.047
aman-1.036
land-0.975
 exception-0.974
 whose-0.966
Token happy
Token ID3772
Feature activation-2.52e-01

Pos logit contributions
keeping+0.659
 ever+0.602
 onwards+0.593
 fright+0.523
 it+0.497
Neg logit contributions
 hey-1.498
aman-1.476
 exception-1.403
lves-1.364
land-1.364
Token and
Token ID290
Feature activation-4.31e-01

Pos logit contributions
 onwards+0.208
 ever+0.206
keeping+0.200
 fright+0.184
unts+0.177
Neg logit contributions
aman-0.472
 hey-0.466
land-0.450
 exception-0.431
 whose-0.431
Token wanted
Token ID2227
Feature activation-8.02e-01

Pos logit contributions
 ever+0.389
keeping+0.382
 onwards+0.374
 fright+0.344
unts+0.325
Neg logit contributions
aman-0.778
 hey-0.771
land-0.729
 exception-0.712
 whose-0.711
Token to
Token ID284
Feature activation-4.17e-01

Pos logit contributions
 onwards+0.738
keeping+0.727
 ever+0.689
unts+0.661
 grin+0.633
Neg logit contributions
 hey-1.414
aman-1.401
land-1.382
 whose-1.325
 exception-1.299
Token ride
Token ID6594
Feature activation-3.17e+00

Pos logit contributions
 onwards+0.465
keeping+0.457
 ever+0.452
 sideways+0.402
unts+0.395
Neg logit contributions
 hey-0.664
aman-0.663
land-0.623
 exception-0.602
 whose-0.589
Token it
Token ID340
Feature activation-1.82e-01

Pos logit contributions
unts+4.311
keeping+4.260
irk+3.712
umbs+3.655
 grin+3.505
Neg logit contributions
 hey-4.649
 whose-4.600
land-4.598
 where-4.306
 exception-4.297
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
keeping+0.186
 ever+0.185
 onwards+0.176
 fright+0.164
unts+0.164
Neg logit contributions
aman-0.305
 hey-0.299
land-0.291
 exception-0.280
 whose-0.272
Token
Token ID198
Feature activation-6.07e-01

Pos logit contributions
aman+0.585
 hey+0.574
land+0.550
 exception+0.529
 extingu+0.521
Neg logit contributions
 ever-0.290
keeping-0.271
 onwards-0.271
 it-0.254
 fright-0.246
Token
Token ID198
Feature activation+3.62e-01

Pos logit contributions
 ever+0.671
keeping+0.668
 onwards+0.639
 fright+0.590
 sideways+0.590
Neg logit contributions
aman-0.976
 hey-0.944
land-0.918
 whose-0.873
 exception-0.866
TokenL
Token ID43
Feature activation+4.53e-01

Pos logit contributions
aman+0.417
 hey+0.405
land+0.377
 exception+0.353
 extingu+0.345
Neg logit contributions
 ever-0.587
 onwards-0.566
keeping-0.565
 it-0.547
 fright-0.537
Token front
Token ID2166
Feature activation-6.45e-01

Pos logit contributions
aman+1.646
 hey+1.631
land+1.522
 exception+1.463
 extingu+1.432
Neg logit contributions
 ever-1.421
keeping-1.339
 onwards-1.313
 it-1.286
 fright-1.247
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
 ever+0.767
keeping+0.751
 onwards+0.745
 it+0.701
 fright+0.696
Neg logit contributions
aman-0.992
land-0.940
 hey-0.934
 exception-0.865
 whose-0.854
Token Lily
Token ID20037
Feature activation+8.64e-01

Pos logit contributions
aman+0.777
 hey+0.765
land+0.729
 exception+0.700
 extingu+0.692
Neg logit contributions
 ever-0.440
 onwards-0.417
keeping-0.413
 it-0.398
 fright-0.380
Token liked
Token ID8288
Feature activation-1.22e+00

Pos logit contributions
aman+1.471
 hey+1.459
land+1.381
 extingu+1.323
 exception+1.315
Neg logit contributions
 ever-0.935
 onwards-0.851
 it-0.849
keeping-0.834
 excited-0.821
Token to
Token ID284
Feature activation-9.42e-01

Pos logit contributions
 onwards+1.174
keeping+1.145
 ever+1.137
unts+1.068
 grin+1.031
Neg logit contributions
aman-2.135
 hey-2.010
 whose-1.991
land-1.982
 exception-1.874
Token ride
Token ID6594
Feature activation-3.15e+00

Pos logit contributions
keeping+1.363
 onwards+1.355
 ever+1.316
unts+1.225
 sideways+1.177
Neg logit contributions
aman-1.182
 hey-1.176
land-1.097
 exception-1.090
 whose-1.053
Token it
Token ID340
Feature activation+2.75e-01

Pos logit contributions
keeping+4.482
unts+4.289
 ever+4.053
 grin+3.915
 fright+3.876
Neg logit contributions
 whose-4.248
land-4.159
aman-4.120
 who-3.835
lves-3.813
Token around
Token ID1088
Feature activation-9.48e-01

Pos logit contributions
aman+0.485
 hey+0.475
land+0.455
 exception+0.437
 extingu+0.429
Neg logit contributions
 ever-0.270
keeping-0.256
 onwards-0.253
 it-0.239
 fright-0.233
Token her
Token ID607
Feature activation+6.80e-01

Pos logit contributions
keeping+1.055
 onwards+1.022
 ever+1.006
unts+0.924
 fright+0.917
Neg logit contributions
aman-1.523
 hey-1.497
land-1.408
 whose-1.393
 exception-1.352
Token town
Token ID3240
Feature activation+7.88e-02

Pos logit contributions
aman+1.175
 hey+1.155
land+1.094
 exception+1.056
 extingu+1.042
Neg logit contributions
 ever-0.717
 onwards-0.673
keeping-0.671
 it-0.655
 excited-0.624
Token and
Token ID290
Feature activation-5.87e-01

Pos logit contributions
aman+0.144
 hey+0.141
land+0.137
 exception+0.131
 whose+0.129
Neg logit contributions
 ever-0.071
 onwards-0.067
keeping-0.067
 it-0.062
 fright-0.061
Token He
Token ID679
Feature activation-3.17e-01

Pos logit contributions
 ever+1.148
keeping+1.044
 onwards+0.967
 fright+0.944
unts+0.902
Neg logit contributions
aman-2.140
 hey-1.980
land-1.980
 whose-1.882
 exception-1.878
Token was
Token ID373
Feature activation-7.16e-01

Pos logit contributions
keeping+0.295
 ever+0.293
 onwards+0.285
 fright+0.253
 it+0.246
Neg logit contributions
aman-0.572
 hey-0.556
land-0.533
 whose-0.522
 exception-0.514
Token so
Token ID523
Feature activation-8.70e-01

Pos logit contributions
keeping+0.622
 ever+0.605
 onwards+0.575
 fright+0.514
unts+0.499
Neg logit contributions
aman-1.314
 hey-1.312
 whose-1.244
land-1.236
 exception-1.234
Token excited
Token ID6568
Feature activation+2.48e-01

Pos logit contributions
keeping+0.444
 ever+0.364
 onwards+0.357
 fright+0.302
tick+0.248
Neg logit contributions
aman-1.956
 hey-1.947
 exception-1.852
 whose-1.848
land-1.833
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
aman+0.469
 hey+0.460
land+0.443
 exception+0.426
 extingu+0.419
Neg logit contributions
 ever-0.213
 onwards-0.199
keeping-0.199
 it-0.184
 fright-0.180
Token ride
Token ID6594
Feature activation-3.15e+00

Pos logit contributions
keeping+1.382
 onwards+1.366
 ever+1.287
unts+1.207
 sideways+1.150
Neg logit contributions
 hey-1.698
aman-1.696
 exception-1.571
 whose-1.558
land-1.550
Token in
Token ID287
Feature activation-1.95e+00

Pos logit contributions
unts+3.265
keeping+3.116
 Amen+2.815
 ever+2.729
 fright+2.627
Neg logit contributions
 whose-5.568
land-5.448
aman-5.277
 hey-5.128
 who-5.119
Token it
Token ID340
Feature activation+3.30e-01

Pos logit contributions
keeping+1.473
 ever+1.421
unts+1.412
 onwards+1.355
 grin+1.225
Neg logit contributions
aman-3.883
 whose-3.734
land-3.670
 hey-3.616
 exception-3.450
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
aman+0.596
 hey+0.585
land+0.560
 exception+0.538
 extingu+0.530
Neg logit contributions
 ever-0.310
keeping-0.291
 onwards-0.291
 it-0.274
 fright-0.265
Token His
Token ID2399
Feature activation+1.35e-01

Pos logit contributions
 ever+0.281
keeping+0.261
 onwards+0.252
 fright+0.240
unts+0.229
Neg logit contributions
aman-0.497
 hey-0.477
land-0.463
 exception-0.443
 whose-0.436
Token mom
Token ID1995
Feature activation+2.62e-01

Pos logit contributions
aman+0.249
 hey+0.244
land+0.233
 exception+0.224
 extingu+0.221
Neg logit contributions
 ever-0.126
keeping-0.119
 onwards-0.118
 it-0.109
 fright-0.107
Token car
Token ID1097
Feature activation+2.36e-01

Pos logit contributions
aman+0.113
 hey+0.111
land+0.104
 exception+0.099
 extingu+0.097
Neg logit contributions
 ever-0.113
keeping-0.110
 onwards-0.109
 it-0.102
 fright-0.101
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
aman+0.451
 hey+0.445
land+0.426
 exception+0.410
 extingu+0.405
Neg logit contributions
 ever-0.196
keeping-0.184
 onwards-0.183
 it-0.169
 fright-0.164
Token He
Token ID679
Feature activation-5.77e-01

Pos logit contributions
 ever+1.851
keeping+1.818
 onwards+1.659
 fright+1.564
unts+1.482
Neg logit contributions
aman-2.339
 hey-2.294
 whose-2.236
land-2.222
 extingu-2.170
Token wanted
Token ID2227
Feature activation-7.22e-01

Pos logit contributions
keeping+0.577
 onwards+0.552
 ever+0.547
 fright+0.485
 it+0.459
Neg logit contributions
aman-1.001
 hey-0.984
 whose-0.941
land-0.939
 exception-0.907
Token to
Token ID284
Feature activation-8.11e-01

Pos logit contributions
keeping+0.657
 onwards+0.647
 ever+0.627
 grin+0.571
unts+0.558
Neg logit contributions
 hey-1.267
aman-1.252
land-1.229
 whose-1.208
 exception-1.164
Token drive
Token ID3708
Feature activation-3.12e+00

Pos logit contributions
 onwards+1.031
keeping+1.002
 ever+0.961
unts+0.868
 onward+0.866
Neg logit contributions
 hey-1.177
aman-1.151
 extingu-1.058
land-1.058
 exception-1.057
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
keeping+4.345
unts+4.261
 grin+4.216
 Amen+3.852
irk+3.723
Neg logit contributions
 whose-4.283
 hey-4.218
 where-4.080
land-4.025
 who-3.949
Token car
Token ID1097
Feature activation+1.02e-01

Pos logit contributions
aman+2.001
 hey+1.905
land+1.853
 exception+1.774
 extingu+1.736
Neg logit contributions
 ever-1.076
 it-1.007
 onwards-0.977
keeping-0.951
 fright-0.919
Token with
Token ID351
Feature activation-2.62e-01

Pos logit contributions
aman+0.178
 hey+0.175
land+0.168
 exception+0.162
 whose+0.158
Neg logit contributions
 ever-0.100
keeping-0.096
 onwards-0.095
 it-0.089
 fright-0.088
Token his
Token ID465
Feature activation+2.92e-01

Pos logit contributions
 ever+0.278
keeping+0.275
 onwards+0.270
unts+0.239
 fright+0.238
Neg logit contributions
aman-0.434
 hey-0.432
land-0.404
 exception-0.392
 whose-0.389
Token stick
Token ID4859
Feature activation-2.78e-02

Pos logit contributions
aman+0.494
 hey+0.484
land+0.461
 exception+0.442
 extingu+0.435
Neg logit contributions
 ever-0.314
keeping-0.297
 onwards-0.297
 it-0.281
 fright-0.273
Token's
Token ID338
Feature activation-3.95e-01

Pos logit contributions
 ever+0.027
keeping+0.027
 onwards+0.026
 it+0.024
 fright+0.024
Neg logit contributions
aman-0.030
 hey-0.030
land-0.028
 exception-0.027
 whose-0.026
Token bicycle
Token ID17026
Feature activation+9.85e-03

Pos logit contributions
 ever+0.512
keeping+0.501
 onwards+0.496
 it+0.462
unts+0.458
Neg logit contributions
aman-0.566
 hey-0.544
land-0.513
 exception-0.497
 extingu-0.483
Token and
Token ID290
Feature activation+5.79e-01

Pos logit contributions
aman+0.020
 hey+0.020
land+0.019
 exception+0.018
 whose+0.018
Neg logit contributions
 ever-0.007
keeping-0.007
 onwards-0.006
 it-0.006
 fright-0.006
Token wanted
Token ID2227
Feature activation-1.37e+00

Pos logit contributions
aman+1.107
 hey+1.086
land+1.047
 exception+0.999
 extingu+0.991
Neg logit contributions
 ever-0.497
 onwards-0.456
keeping-0.455
 it-0.451
 excited-0.418
Token to
Token ID284
Feature activation-9.21e-01

Pos logit contributions
 onwards+1.360
keeping+1.334
unts+1.238
 grin+1.223
 ever+1.198
Neg logit contributions
 hey-2.321
aman-2.296
 whose-2.240
land-2.182
 exception-2.150
Token ride
Token ID6594
Feature activation-3.10e+00

Pos logit contributions
 onwards+1.125
keeping+1.117
 ever+1.036
unts+0.987
 sideways+0.951
Neg logit contributions
 hey-1.371
aman-1.339
 exception-1.262
land-1.246
 whose-1.205
Token it
Token ID340
Feature activation+5.49e-02

Pos logit contributions
unts+3.363
keeping+3.161
 grin+2.844
umbs+2.767
 onwards+2.692
Neg logit contributions
 whose-5.244
 hey-5.053
 exception-4.884
aman-4.811
lves-4.739
Token,
Token ID11
Feature activation-8.83e-01

Pos logit contributions
aman+0.095
 hey+0.093
land+0.090
 exception+0.087
 whose+0.084
Neg logit contributions
 ever-0.054
keeping-0.052
 onwards-0.051
 it-0.047
 fright-0.047
Token but
Token ID475
Feature activation-7.52e-01

Pos logit contributions
keeping+1.383
 ever+1.365
 onwards+1.316
unts+1.245
 sideways+1.191
Neg logit contributions
aman-1.071
 hey-1.037
land-0.973
 exception-0.945
 whose-0.936
Token Tim
Token ID5045
Feature activation-3.97e-01

Pos logit contributions
keeping+0.732
 ever+0.711
 onwards+0.678
unts+0.668
 fright+0.630
Neg logit contributions
aman-1.342
 hey-1.319
 whose-1.221
 exception-1.210
land-1.182
Tokenmy
Token ID1820
Feature activation-5.64e-01

Pos logit contributions
 ever+0.525
keeping+0.515
 onwards+0.511
 fright+0.474
 it+0.474
Neg logit contributions
aman-0.574
 hey-0.548
land-0.513
 exception-0.500
 whose-0.489
Token it
Token ID340
Feature activation-4.05e-01

Pos logit contributions
 ever+0.103
 onwards+0.099
keeping+0.099
unts+0.083
 fright+0.081
Neg logit contributions
aman-0.305
 hey-0.300
land-0.288
 whose-0.280
 exception-0.278
Token closer
Token ID5699
Feature activation-4.28e-01

Pos logit contributions
keeping+0.420
 ever+0.407
 onwards+0.386
unts+0.360
 fright+0.360
Neg logit contributions
aman-0.685
 hey-0.668
land-0.657
 exception-0.635
 whose-0.628
Token,
Token ID11
Feature activation-1.78e+00

Pos logit contributions
keeping+0.388
 ever+0.385
 onwards+0.373
 fright+0.352
unts+0.347
Neg logit contributions
aman-0.763
 hey-0.738
land-0.736
 exception-0.693
 whose-0.690
Token so
Token ID523
Feature activation-7.50e-01

Pos logit contributions
keeping+2.929
 ever+2.751
 onwards+2.711
unts+2.690
ay+2.563
Neg logit contributions
aman-2.062
 hey-2.018
land-2.003
 whose-1.947
 exception-1.856
Token she
Token ID673
Feature activation-2.24e-01

Pos logit contributions
keeping+1.027
 ever+0.970
 onwards+0.941
 fright+0.923
unts+0.896
Neg logit contributions
aman-1.043
 hey-1.029
land-0.966
 whose-0.921
 exception-0.899
Token moved
Token ID3888
Feature activation-3.09e+00

Pos logit contributions
 ever+0.223
keeping+0.221
 onwards+0.218
 fright+0.197
 it+0.195
Neg logit contributions
aman-0.383
 hey-0.378
land-0.359
 whose-0.345
 exception-0.344
Token a
Token ID257
Feature activation+4.11e-01

Pos logit contributions
keeping+4.529
unts+4.455
 grin+4.103
ay+3.792
 onwards+3.764
Neg logit contributions
 whose-4.156
 hey-4.073
aman-3.790
 who-3.650
 exception-3.638
Token bit
Token ID1643
Feature activation-1.46e-01

Pos logit contributions
aman+0.595
 hey+0.583
land+0.549
 exception+0.523
 extingu+0.512
Neg logit contributions
 ever-0.535
 onwards-0.509
keeping-0.509
 it-0.491
 fright-0.480
Token closer
Token ID5699
Feature activation-7.28e-01

Pos logit contributions
 ever+0.137
keeping+0.135
 onwards+0.131
 it+0.117
 fright+0.116
Neg logit contributions
 hey-0.258
aman-0.257
land-0.244
 exception-0.238
 whose-0.234
Token.
Token ID13
Feature activation+4.27e-01

Pos logit contributions
 ever+0.707
keeping+0.705
 onwards+0.701
unts+0.650
 fright+0.641
Neg logit contributions
aman-1.221
land-1.208
 hey-1.181
 exception-1.119
 whose-1.112
Token She
Token ID1375
Feature activation+4.34e-03

Pos logit contributions
aman+0.779
 hey+0.765
land+0.731
 exception+0.705
 extingu+0.695
Neg logit contributions
 ever-0.397
 onwards-0.373
keeping-0.371
 it-0.354
 fright-0.338
Token it
Token ID340
Feature activation+7.90e-02

Pos logit contributions
 onwards+0.782
keeping+0.772
 ever+0.752
unts+0.668
 grin+0.614
Neg logit contributions
 hey-1.836
aman-1.832
 whose-1.773
land-1.698
 extingu-1.675
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
aman+0.141
 hey+0.138
land+0.133
 exception+0.129
 whose+0.127
Neg logit contributions
 ever-0.074
keeping-0.072
 onwards-0.070
 fright-0.064
 it-0.064
Token The
Token ID383
Feature activation-8.39e-01

Pos logit contributions
aman+0.376
 hey+0.368
land+0.352
 exception+0.338
 extingu+0.334
Neg logit contributions
 ever-0.202
keeping-0.189
 onwards-0.188
 it-0.174
 fright-0.172
Token crab
Token ID32202
Feature activation-1.25e-01

Pos logit contributions
 onwards+0.879
 ever+0.868
keeping+0.863
unts+0.758
 fright+0.758
Neg logit contributions
 hey-1.381
aman-1.380
land-1.278
 whose-1.222
 exception-1.221
Token was
Token ID373
Feature activation-7.66e-01

Pos logit contributions
keeping+0.122
 ever+0.120
 onwards+0.116
 fright+0.105
 it+0.104
Neg logit contributions
aman-0.217
 hey-0.213
land-0.204
 whose-0.197
 exception-0.196
Token walking
Token ID6155
Feature activation-3.08e+00

Pos logit contributions
keeping+0.703
 ever+0.672
 onwards+0.652
 fright+0.583
tick+0.563
Neg logit contributions
aman-1.373
 hey-1.339
land-1.290
 whose-1.286
 exception-1.278
Token sideways
Token ID35303
Feature activation-2.58e-01

Pos logit contributions
keeping+4.554
unts+4.195
 grin+3.919
tick+3.900
 Amen+3.883
Neg logit contributions
 whose-4.374
lves-4.022
 hey-3.957
aman-3.932
 exception-3.930
Token and
Token ID290
Feature activation-8.11e-02

Pos logit contributions
keeping+0.190
 ever+0.189
 onwards+0.178
 fright+0.164
unts+0.157
Neg logit contributions
aman-0.506
 hey-0.492
land-0.489
 exception-0.461
 whose-0.461
Token had
Token ID550
Feature activation-1.56e-01

Pos logit contributions
 ever+0.075
keeping+0.073
 onwards+0.071
 fright+0.065
unts+0.063
Neg logit contributions
aman-0.146
 hey-0.144
land-0.136
 exception-0.132
 whose-0.131
Token big
Token ID1263
Feature activation-1.77e+00

Pos logit contributions
 ever+0.172
 onwards+0.168
keeping+0.168
 it+0.151
 fright+0.150
Neg logit contributions
aman-0.250
 hey-0.245
land-0.237
 exception-0.223
 whose-0.222
Token claws
Token ID28421
Feature activation+9.72e-01

Pos logit contributions
keeping+2.092
 ever+2.087
 onwards+1.983
kick+1.752
tick+1.730
Neg logit contributions
aman-2.748
 hey-2.695
land-2.593
 Majesty-2.558
 whose-2.534
Token a
Token ID257
Feature activation-4.33e-01

Pos logit contributions
aman+1.743
 hey+1.687
land+1.621
 exception+1.550
 extingu+1.508
Neg logit contributions
 ever-1.064
 it-1.037
 onwards-0.979
keeping-0.961
 excited-0.953
Token boat
Token ID8848
Feature activation+2.70e-01

Pos logit contributions
keeping+0.540
 ever+0.539
 onwards+0.523
 fright+0.471
 it+0.461
Neg logit contributions
aman-0.650
 hey-0.631
land-0.606
 whose-0.577
 extingu-0.572
Token and
Token ID290
Feature activation+7.27e-01

Pos logit contributions
aman+0.507
 hey+0.498
land+0.478
 exception+0.460
 extingu+0.454
Neg logit contributions
 ever-0.235
keeping-0.221
 onwards-0.220
 it-0.205
 fright-0.199
Token wanted
Token ID2227
Feature activation-8.81e-01

Pos logit contributions
aman+1.479
 hey+1.446
land+1.393
 exception+1.338
 extingu+1.333
Neg logit contributions
 ever-0.550
 it-0.505
keeping-0.489
 onwards-0.489
 excited-0.460
Token to
Token ID284
Feature activation-3.59e-01

Pos logit contributions
keeping+0.820
 onwards+0.815
 ever+0.774
 grin+0.737
unts+0.731
Neg logit contributions
land-1.522
 hey-1.512
aman-1.493
 whose-1.473
 exception-1.409
Token ride
Token ID6594
Feature activation-3.08e+00

Pos logit contributions
 onwards+0.416
 ever+0.412
keeping+0.411
unts+0.363
 sideways+0.360
Neg logit contributions
aman-0.553
 hey-0.550
land-0.521
 exception-0.503
 extingu-0.497
Token it
Token ID340
Feature activation-1.14e-01

Pos logit contributions
keeping+3.769
unts+3.759
 grin+3.511
irk+3.414
 fright+2.979
Neg logit contributions
 whose-5.127
land-4.783
 who-4.571
 exception-4.544
 where-4.494
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
 ever+0.118
keeping+0.117
 onwards+0.112
 fright+0.104
unts+0.103
Neg logit contributions
aman-0.190
 hey-0.185
land-0.181
 exception-0.174
 whose-0.170
Token He
Token ID679
Feature activation+2.84e-01

Pos logit contributions
aman+0.816
 hey+0.804
land+0.770
 exception+0.740
 extingu+0.731
Neg logit contributions
 ever-0.371
 onwards-0.347
keeping-0.346
 it-0.330
 fright-0.313
Token asked
Token ID1965
Feature activation-3.91e-01

Pos logit contributions
aman+0.509
 hey+0.500
land+0.478
 exception+0.459
 extingu+0.452
Neg logit contributions
 ever-0.272
keeping-0.257
 onwards-0.256
 it-0.240
 fright-0.233
Token the
Token ID262
Feature activation+5.04e-01

Pos logit contributions
keeping+0.463
 ever+0.460
 onwards+0.441
 fright+0.409
unts+0.403
Neg logit contributions
aman-0.607
 hey-0.597
land-0.569
 whose-0.553
 exception-0.546
Token wanted
Token ID2227
Feature activation-8.81e-01

Pos logit contributions
keeping+0.369
 ever+0.356
 onwards+0.353
 fright+0.307
 sideways+0.298
Neg logit contributions
aman-0.669
 hey-0.656
land-0.625
 whose-0.615
lves-0.600
Token to
Token ID284
Feature activation-6.35e-01

Pos logit contributions
 onwards+0.818
keeping+0.817
 ever+0.760
unts+0.714
 grin+0.701
Neg logit contributions
aman-1.542
 hey-1.540
land-1.489
 whose-1.449
 exception-1.397
Token catch
Token ID4929
Feature activation-2.37e+00

Pos logit contributions
 onwards+0.714
keeping+0.693
 ever+0.656
unts+0.592
 sideways+0.590
Neg logit contributions
 hey-1.019
aman-1.013
land-0.950
 exception-0.927
 whose-0.910
Token it
Token ID340
Feature activation-3.64e-01

Pos logit contributions
keeping+2.089
 onwards+2.076
 ever+1.824
unts+1.697
 onward+1.655
Neg logit contributions
 hey-4.439
 whose-4.352
lves-4.346
aman-4.255
land-4.130
Token and
Token ID290
Feature activation-3.63e-01

Pos logit contributions
keeping+0.336
 ever+0.324
 onwards+0.315
unts+0.285
 fright+0.284
Neg logit contributions
aman-0.658
 hey-0.638
land-0.628
 exception-0.604
 whose-0.596
Token hold
Token ID1745
Feature activation-3.07e+00

Pos logit contributions
 ever+0.386
 onwards+0.384
keeping+0.380
unts+0.343
 fright+0.330
Neg logit contributions
 hey-0.594
aman-0.594
land-0.555
 exception-0.537
 whose-0.534
Token it
Token ID340
Feature activation-5.38e-02

Pos logit contributions
keeping+3.476
unts+3.104
 grin+2.804
 Amen+2.774
 Please+2.509
Neg logit contributions
 whose-5.253
 hey-5.049
aman-4.847
lves-4.708
 who-4.641
Token in
Token ID287
Feature activation-4.44e-01

Pos logit contributions
 ever+0.048
keeping+0.047
 onwards+0.045
 it+0.042
 fright+0.041
Neg logit contributions
aman-0.097
 hey-0.096
land-0.092
 exception-0.089
 whose-0.088
Token her
Token ID607
Feature activation+5.04e-01

Pos logit contributions
 ever+0.540
 onwards+0.525
keeping+0.524
 sideways+0.475
 it+0.473
Neg logit contributions
aman-0.681
 hey-0.651
land-0.622
 extingu-0.590
 whose-0.588
Token hands
Token ID2832
Feature activation+2.70e-02

Pos logit contributions
aman+1.089
 hey+1.073
land+1.033
 exception+1.002
 extingu+0.984
Neg logit contributions
 ever-0.317
 onwards-0.285
keeping-0.282
 it-0.263
 fright-0.248
Token.
Token ID13
Feature activation+5.88e-01

Pos logit contributions
aman+0.046
 hey+0.045
land+0.043
 exception+0.042
 whose+0.041
Neg logit contributions
 ever-0.027
keeping-0.026
 onwards-0.026
 it-0.024
 fright-0.024
Token to
Token ID284
Feature activation-7.75e-01

Pos logit contributions
 onwards+1.286
keeping+1.225
 ever+1.207
 grin+1.119
unts+1.114
Neg logit contributions
aman-1.717
 hey-1.706
 whose-1.654
land-1.628
 exception-1.552
Token play
Token ID711
Feature activation-8.10e-01

Pos logit contributions
 onwards+0.939
keeping+0.929
 ever+0.906
unts+0.811
 sideways+0.800
Neg logit contributions
aman-1.160
 hey-1.147
land-1.095
 exception-1.056
 whose-1.047
Token with
Token ID351
Feature activation-9.58e-01

Pos logit contributions
keeping+0.745
 ever+0.716
 onwards+0.695
 fright+0.628
unts+0.625
Neg logit contributions
 hey-1.450
aman-1.389
land-1.383
 whose-1.369
 exception-1.329
Token it
Token ID340
Feature activation+6.17e-01

Pos logit contributions
keeping+0.436
 ever+0.426
 onwards+0.417
 grin+0.316
 fright+0.306
Neg logit contributions
aman-2.139
 hey-2.117
 whose-2.068
land-2.056
lves-1.987
Token and
Token ID290
Feature activation-9.45e-01

Pos logit contributions
aman+1.151
 hey+1.129
land+1.075
 exception+1.032
 extingu+1.027
Neg logit contributions
 ever-0.577
 onwards-0.530
keeping-0.514
 it-0.510
 fright-0.480
Token hold
Token ID1745
Feature activation-3.07e+00

Pos logit contributions
 onwards+1.131
 ever+1.101
keeping+1.077
unts+1.009
 sideways+0.974
Neg logit contributions
 hey-1.431
aman-1.415
 whose-1.317
land-1.315
 exception-1.280
Token it
Token ID340
Feature activation-1.89e-01

Pos logit contributions
keeping+3.122
 grin+2.938
 Amen+2.846
unts+2.660
 fright+2.494
Neg logit contributions
 whose-5.367
 hey-5.161
 who-4.923
 where-4.809
aman-4.737
Token tight
Token ID5381
Feature activation+3.62e-01

Pos logit contributions
 ever+0.165
keeping+0.163
 onwards+0.155
 fright+0.142
 it+0.142
Neg logit contributions
aman-0.342
 hey-0.337
land-0.325
 exception-0.315
 whose-0.311
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
aman+0.640
 hey+0.628
land+0.600
 exception+0.576
 extingu+0.568
Neg logit contributions
 ever-0.353
 onwards-0.331
keeping-0.331
 it-0.313
 fright-0.303
Token When
Token ID1649
Feature activation-1.71e+00

Pos logit contributions
aman+0.764
 hey+0.749
land+0.717
 exception+0.687
 extingu+0.679
Neg logit contributions
 ever-0.418
 onwards-0.394
keeping-0.393
 it-0.375
 fright-0.359
Token she
Token ID673
Feature activation+6.35e-01

Pos logit contributions
keeping+2.556
 fright+2.351
 onwards+2.330
 ever+2.321
unts+2.261
Neg logit contributions
 hey-2.279
aman-2.222
 whose-1.909
land-1.877
 exception-1.824
Token She
Token ID1375
Feature activation-1.31e-01

Pos logit contributions
 ever+0.018
keeping+0.017
 onwards+0.017
 fright+0.015
 it+0.015
Neg logit contributions
aman-0.034
 hey-0.033
land-0.031
 exception-0.030
 whose-0.030
Token saw
Token ID2497
Feature activation+4.93e-01

Pos logit contributions
 ever+0.135
keeping+0.133
 onwards+0.130
 fright+0.118
 it+0.117
Neg logit contributions
aman-0.223
 hey-0.219
land-0.209
 whose-0.200
 exception-0.199
Token a
Token ID257
Feature activation-2.87e-01

Pos logit contributions
aman+0.814
 hey+0.796
land+0.759
 exception+0.726
 extingu+0.714
Neg logit contributions
 ever-0.549
keeping-0.517
 onwards-0.515
 it-0.503
 fright-0.482
Token house
Token ID2156
Feature activation-9.96e-01

Pos logit contributions
 ever+0.380
keeping+0.373
 onwards+0.367
 it+0.335
 fright+0.332
Neg logit contributions
aman-0.411
 hey-0.397
land-0.379
 exception-0.356
 whose-0.354
Token and
Token ID290
Feature activation+4.29e-01

Pos logit contributions
keeping+0.958
 ever+0.939
 onwards+0.925
 fright+0.868
unts+0.815
Neg logit contributions
aman-1.739
 hey-1.699
land-1.627
 extingu-1.591
 whose-1.588
Token knocked
Token ID13642
Feature activation-3.04e+00

Pos logit contributions
aman+0.831
 hey+0.814
land+0.782
 exception+0.753
 extingu+0.744
Neg logit contributions
 ever-0.356
keeping-0.329
 onwards-0.327
 it-0.312
 fright-0.296
Token on
Token ID319
Feature activation-8.26e-01

Pos logit contributions
 grin+3.678
unts+3.556
 Amen+3.325
keeping+3.287
 onwards+3.078
Neg logit contributions
aman-4.826
 hey-4.616
 whose-4.430
ray-4.210
land-4.196
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
 ever+0.906
 onwards+0.875
keeping+0.853
unts+0.840
 grin+0.788
Neg logit contributions
aman-1.352
 hey-1.307
land-1.259
 whose-1.212
 extingu-1.205
Token door
Token ID3420
Feature activation-4.58e-01

Pos logit contributions
 ever+0.142
 onwards+0.136
keeping+0.132
 it+0.126
 fright+0.123
Neg logit contributions
aman-0.205
 hey-0.203
land-0.193
 extingu-0.184
 exception-0.181
Token.
Token ID13
Feature activation-8.10e-02

Pos logit contributions
 ever+0.496
keeping+0.488
 onwards+0.474
 fright+0.444
unts+0.434
Neg logit contributions
aman-0.754
 hey-0.723
land-0.718
 exception-0.676
 whose-0.669
Token
Token ID220
Feature activation-2.29e-02

Pos logit contributions
 ever+0.082
keeping+0.077
 onwards+0.075
 fright+0.070
 it+0.068
Neg logit contributions
aman-0.142
 hey-0.138
land-0.132
 exception-0.126
 whose-0.125
Token the
Token ID262
Feature activation+8.79e-01

Pos logit contributions
aman+3.217
land+3.167
 hey+3.142
 exception+2.977
 extingu+2.879
Neg logit contributions
 it-1.194
 ever-1.059
keeping-0.939
 on-0.921
 excited-0.917
Token car
Token ID1097
Feature activation-1.58e-03

Pos logit contributions
aman+1.217
 hey+1.182
land+1.126
 exception+1.050
 extingu+1.035
Neg logit contributions
 ever-1.222
 onwards-1.156
 it-1.148
keeping-1.143
 excited-1.115
Token and
Token ID290
Feature activation+4.05e-01

Pos logit contributions
 ever+0.001
keeping+0.001
 onwards+0.001
 it+0.001
 fright+0.001
Neg logit contributions
aman-0.003
 hey-0.003
land-0.003
 exception-0.003
 whose-0.003
Token wanted
Token ID2227
Feature activation-1.25e+00

Pos logit contributions
aman+0.797
 hey+0.783
land+0.753
 exception+0.725
 extingu+0.716
Neg logit contributions
 ever-0.321
keeping-0.295
 onwards-0.295
 it-0.279
 fright-0.265
Token to
Token ID284
Feature activation-7.66e-01

Pos logit contributions
 onwards+1.214
keeping+1.210
 grin+1.161
 ever+1.127
unts+1.107
Neg logit contributions
 hey-2.101
aman-2.080
land-2.050
 whose-2.034
 exception-1.942
Token drive
Token ID3708
Feature activation-3.02e+00

Pos logit contributions
 onwards+0.968
keeping+0.938
 ever+0.914
unts+0.835
 sideways+0.831
Neg logit contributions
 hey-1.110
aman-1.108
land-1.025
 exception-1.000
 extingu-0.992
Token it
Token ID340
Feature activation+3.78e-01

Pos logit contributions
unts+3.822
 grin+3.702
keeping+3.681
 Amen+3.264
umbs+3.243
Neg logit contributions
 hey-4.442
 whose-4.420
 exception-4.297
land-4.295
 where-4.114
Token.
Token ID13
Feature activation+3.32e-02

Pos logit contributions
aman+0.693
 hey+0.681
land+0.651
 exception+0.625
 extingu+0.617
Neg logit contributions
 ever-0.349
 onwards-0.326
keeping-0.325
 it-0.308
 fright-0.296
Token He
Token ID679
Feature activation+2.48e-02

Pos logit contributions
aman+0.062
 hey+0.060
land+0.057
 exception+0.055
 whose+0.055
Neg logit contributions
 ever-0.031
keeping-0.028
 onwards-0.028
 fright-0.026
 it-0.025
Token climbed
Token ID19952
Feature activation-1.15e+00

Pos logit contributions
aman+0.043
 hey+0.043
land+0.041
 exception+0.039
 whose+0.039
Neg logit contributions
 ever-0.024
keeping-0.024
 onwards-0.023
 it-0.021
 fright-0.021
Token in
Token ID287
Feature activation-1.70e+00

Pos logit contributions
keeping+1.277
 ever+1.259
unts+1.203
 fright+1.158
 onwards+1.141
Neg logit contributions
aman-1.828
 hey-1.744
land-1.715
 extingu-1.629
 whose-1.626
Token.
Token ID13
Feature activation+9.61e-02

Pos logit contributions
 ever+0.310
keeping+0.306
 onwards+0.300
 fright+0.278
unts+0.267
Neg logit contributions
aman-0.473
 hey-0.464
land-0.453
 whose-0.431
 exception-0.431
Token Fl
Token ID1610
Feature activation+5.55e-01

Pos logit contributions
aman+0.139
 hey+0.136
land+0.130
 exception+0.123
 whose+0.121
Neg logit contributions
 ever-0.124
keeping-0.120
 onwards-0.118
 it-0.111
 fright-0.111
Tokenuffy
Token ID15352
Feature activation-8.36e-01

Pos logit contributions
aman+0.480
 hey+0.455
land+0.415
 exception+0.380
 extingu+0.366
Neg logit contributions
 ever-1.059
 onwards-1.019
keeping-1.015
 it-0.992
 fright-0.976
Token wanted
Token ID2227
Feature activation-8.00e-01

Pos logit contributions
keeping+0.868
 onwards+0.841
 ever+0.817
 fright+0.759
unts+0.707
Neg logit contributions
 hey-1.390
aman-1.379
 whose-1.322
land-1.319
lves-1.275
Token to
Token ID284
Feature activation-5.77e-01

Pos logit contributions
 onwards+0.738
keeping+0.735
 ever+0.698
unts+0.655
 grin+0.642
Neg logit contributions
 hey-1.387
aman-1.382
land-1.347
 whose-1.328
 exception-1.282
Token ride
Token ID6594
Feature activation-3.01e+00

Pos logit contributions
 onwards+0.698
keeping+0.669
 ever+0.664
unts+0.596
 sideways+0.593
Neg logit contributions
aman-0.865
 hey-0.862
land-0.802
 exception-0.791
 extingu-0.776
Token in
Token ID287
Feature activation-9.36e-01

Pos logit contributions
keeping+4.889
unts+4.726
irk+4.184
 grin+4.119
umbs+4.042
Neg logit contributions
 whose-3.889
 where-3.463
 hey-3.443
land-3.418
 exception-3.397
Token the
Token ID262
Feature activation-4.68e-02

Pos logit contributions
 ever+1.024
keeping+0.999
 onwards+0.992
unts+0.921
 grin+0.889
Neg logit contributions
aman-1.516
 hey-1.470
land-1.430
 whose-1.417
 exception-1.344
Token je
Token ID11223
Feature activation-3.16e-01

Pos logit contributions
 ever+0.067
keeping+0.065
 onwards+0.065
 it+0.061
 fright+0.060
Neg logit contributions
aman-0.061
 hey-0.060
land-0.056
 exception-0.053
lves-0.052
Tokenep
Token ID538
Feature activation-3.99e-01

Pos logit contributions
 ever+0.516
keeping+0.495
 onwards+0.492
 it+0.477
 fright+0.466
Neg logit contributions
aman-0.360
 hey-0.349
land-0.329
 exception-0.299
lves-0.299
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
 ever+0.411
keeping+0.403
 onwards+0.399
 fright+0.376
 it+0.364
Neg logit contributions
aman-0.651
 hey-0.651
land-0.633
 exception-0.599
 whose-0.588
Token ball
Token ID2613
Feature activation+9.02e-01

Pos logit contributions
aman+1.245
 hey+1.224
land+1.163
 exception+1.099
 extingu+1.063
Neg logit contributions
 ever-0.998
keeping-0.935
 onwards-0.934
 it-0.921
 excited-0.894
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
aman+1.806
 hey+1.768
land+1.693
 exception+1.619
 extingu+1.610
Neg logit contributions
 ever-0.746
 onwards-0.676
 it-0.658
keeping-0.638
 on-0.610
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
aman+0.171
 hey+0.167
land+0.158
 exception+0.151
 whose+0.149
Neg logit contributions
 ever-0.143
keeping-0.139
 onwards-0.138
 it-0.128
 fright-0.127
Token she
Token ID673
Feature activation+1.45e-01

Pos logit contributions
 ever+0.119
keeping+0.118
 onwards+0.116
 fright+0.107
unts+0.105
Neg logit contributions
aman-0.163
 hey-0.161
land-0.151
 exception-0.146
 whose-0.145
Token would
Token ID561
Feature activation-4.19e-01

Pos logit contributions
aman+0.242
 hey+0.239
land+0.227
 exception+0.218
 whose+0.214
Neg logit contributions
 ever-0.154
keeping-0.149
 onwards-0.147
 it-0.137
 fright-0.136
Token roll
Token ID4836
Feature activation-2.99e+00

Pos logit contributions
keeping+0.470
 ever+0.459
 onwards+0.456
 fright+0.403
 it+0.402
Neg logit contributions
aman-0.672
 hey-0.665
land-0.622
 exception-0.611
 whose-0.597
Token it
Token ID340
Feature activation+3.52e-01

Pos logit contributions
keeping+2.621
 Amen+2.419
 grin+2.411
unts+2.277
 onwards+2.125
Neg logit contributions
 hey-5.500
aman-5.366
lves-5.364
 whose-5.206
 exception-5.138
Token all
Token ID477
Feature activation+5.21e-01

Pos logit contributions
aman+0.694
 hey+0.682
land+0.655
 exception+0.631
 extingu+0.623
Neg logit contributions
 ever-0.274
 onwards-0.253
keeping-0.252
 it-0.235
 fright-0.225
Token around
Token ID1088
Feature activation-4.00e-01

Pos logit contributions
aman+1.071
 hey+1.055
land+1.013
 exception+0.974
 extingu+0.967
Neg logit contributions
 ever-0.377
 onwards-0.341
keeping-0.335
 it-0.319
 fright-0.296
Token her
Token ID607
Feature activation+1.00e+00

Pos logit contributions
 ever+0.417
 onwards+0.415
keeping+0.405
 fright+0.378
unts+0.367
Neg logit contributions
 hey-0.654
aman-0.654
land-0.613
 whose-0.589
 exception-0.586
Token house
Token ID2156
Feature activation+4.30e-01

Pos logit contributions
aman+1.792
 hey+1.750
land+1.637
 exception+1.613
 extingu+1.582
Neg logit contributions
 ever-1.032
keeping-0.973
 onwards-0.951
 it-0.949
 excited-0.901
Token He
Token ID679
Feature activation-1.68e-01

Pos logit contributions
 ever+0.997
keeping+0.944
 onwards+0.889
 fright+0.849
unts+0.817
Neg logit contributions
aman-1.343
 hey-1.316
land-1.273
 whose-1.242
 exception-1.178
Token sniff
Token ID26300
Feature activation-1.42e+00

Pos logit contributions
keeping+0.157
 ever+0.156
 onwards+0.154
 it+0.136
 fright+0.136
Neg logit contributions
aman-0.299
 hey-0.296
land-0.281
 whose-0.273
 exception-0.272
Tokened
Token ID276
Feature activation-1.79e+00

Pos logit contributions
keeping+1.684
 ever+1.607
 onwards+1.515
 fright+1.506
 grin+1.503
Neg logit contributions
land-2.081
 hey-2.066
aman-2.041
lves-2.014
 whose-2.009
Token it
Token ID340
Feature activation-1.95e-01

Pos logit contributions
keeping+1.633
 ever+1.560
 onwards+1.554
unts+1.545
tick+1.406
Neg logit contributions
 hey-3.161
 whose-3.146
aman-3.122
land-3.069
 extingu-2.961
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
keeping+0.164
 ever+0.163
 onwards+0.150
 fright+0.139
unts+0.136
Neg logit contributions
aman-0.364
 hey-0.355
land-0.347
 exception-0.335
 whose-0.333
Token rolled
Token ID11686
Feature activation-2.99e+00

Pos logit contributions
 ever+0.013
 onwards+0.013
keeping+0.013
 fright+0.011
 it+0.011
Neg logit contributions
aman-0.023
 hey-0.023
land-0.022
 exception-0.021
 whose-0.021
Token it
Token ID340
Feature activation-1.20e-01

Pos logit contributions
unts+4.063
 Amen+3.785
keeping+3.633
 onwards+3.618
 grin+3.613
Neg logit contributions
 hey-3.995
 extingu-3.848
 whose-3.741
 where-3.729
lves-3.665
Token with
Token ID351
Feature activation-7.76e-01

Pos logit contributions
 ever+0.108
keeping+0.107
 onwards+0.103
 it+0.095
 fright+0.093
Neg logit contributions
aman-0.210
 hey-0.209
land-0.200
 exception-0.194
 whose-0.191
Token his
Token ID465
Feature activation-1.13e+00

Pos logit contributions
keeping+0.951
 onwards+0.937
 ever+0.926
unts+0.806
tick+0.806
Neg logit contributions
 hey-1.137
aman-1.113
land-1.037
 whose-1.023
 exception-1.016
Token paw
Token ID41827
Feature activation-1.82e-01

Pos logit contributions
keeping+1.175
 onwards+1.153
 ever+1.091
unts+1.003
tick+0.984
Neg logit contributions
aman-1.859
 hey-1.841
 whose-1.749
land-1.722
 extingu-1.692
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 ever+0.193
keeping+0.184
 onwards+0.183
 fright+0.170
 it+0.169
Neg logit contributions
aman-0.297
 hey-0.293
land-0.284
 exception-0.270
 whose-0.269
Token was
Token ID373
Feature activation-5.45e-01

Pos logit contributions
 ever+0.213
keeping+0.213
 onwards+0.210
 fright+0.191
 it+0.186
Neg logit contributions
aman-0.319
 hey-0.315
land-0.297
 whose-0.290
 exception-0.289
Token having
Token ID1719
Feature activation-2.20e+00

Pos logit contributions
keeping+0.478
 ever+0.472
 onwards+0.447
 fright+0.409
 sideways+0.392
Neg logit contributions
 hey-0.992
aman-0.974
 exception-0.924
land-0.922
 whose-0.915
Token so
Token ID523
Feature activation-4.36e-01

Pos logit contributions
keeping+3.024
 onwards+2.857
 ever+2.761
 sideways+2.751
imp+2.702
Neg logit contributions
 hey-2.825
aman-2.764
land-2.731
 exception-2.627
 whose-2.600
Token much
Token ID881
Feature activation+3.93e-01

Pos logit contributions
keeping+0.335
 ever+0.322
 onwards+0.309
 fright+0.282
 it+0.269
Neg logit contributions
aman-0.843
 hey-0.837
land-0.806
 exception-0.791
 whose-0.777
Token fun
Token ID1257
Feature activation+2.60e-01

Pos logit contributions
aman+0.634
 hey+0.621
land+0.590
 exception+0.563
 extingu+0.555
Neg logit contributions
 ever-0.452
 onwards-0.427
keeping-0.426
 it-0.409
 fright-0.397
Token driving
Token ID5059
Feature activation-2.98e+00

Pos logit contributions
aman+0.450
 hey+0.442
land+0.422
 exception+0.405
 extingu+0.398
Neg logit contributions
 ever-0.265
keeping-0.251
 onwards-0.250
 it-0.235
 fright-0.230
Token it
Token ID340
Feature activation+9.59e-01

Pos logit contributions
 grin+3.044
unts+3.005
 Amen+2.936
keeping+2.869
 onwards+2.749
Neg logit contributions
 hey-4.921
 whose-4.874
 where-4.638
aman-4.621
land-4.610
Token around
Token ID1088
Feature activation-5.29e-01

Pos logit contributions
aman+1.974
 hey+1.934
land+1.840
 extingu+1.788
 exception+1.778
Neg logit contributions
 ever-0.758
 onwards-0.670
 it-0.663
keeping-0.625
 fright-0.598
Token when
Token ID618
Feature activation-9.27e-01

Pos logit contributions
 onwards+0.515
keeping+0.513
 ever+0.512
unts+0.458
 fright+0.454
Neg logit contributions
 hey-0.895
aman-0.893
land-0.841
 whose-0.815
 exception-0.813
Token he
Token ID339
Feature activation-4.89e-02

Pos logit contributions
keeping+1.110
 ever+1.036
 onwards+1.030
unts+0.993
 fright+0.961
Neg logit contributions
 hey-1.418
aman-1.373
land-1.258
 exception-1.251
 whose-1.243
Token suddenly
Token ID6451
Feature activation+9.71e-01

Pos logit contributions
 ever+0.056
keeping+0.056
 onwards+0.054
 fright+0.051
 it+0.050
Neg logit contributions
aman-0.077
 hey-0.076
land-0.072
 exception-0.068
 whose-0.067
Token to
Token ID284
Feature activation+1.58e+00

Pos logit contributions
aman+1.724
 hey+1.703
land+1.616
 exception+1.576
lves+1.557
Neg logit contributions
 ever-0.581
 onwards-0.517
keeping-0.497
 it-0.496
 excited-0.448
Token get
Token ID651
Feature activation+1.36e-01

Pos logit contributions
aman+2.647
land+2.525
 hey+2.500
 exception+2.369
der+2.290
Neg logit contributions
 ever-1.865
 it-1.758
keeping-1.742
 fright-1.705
 excited-1.600
Token Danny
Token ID15105
Feature activation+6.17e-01

Pos logit contributions
aman+0.236
 hey+0.234
land+0.221
 exception+0.213
 extingu+0.211
Neg logit contributions
 ever-0.136
 onwards-0.129
keeping-0.129
 it-0.119
 fright-0.118
Token out
Token ID503
Feature activation-2.26e-01

Pos logit contributions
aman+1.066
 hey+1.043
land+0.988
 exception+0.950
 extingu+0.931
Neg logit contributions
 ever-0.649
 onwards-0.609
keeping-0.606
 it-0.583
 fright-0.558
Token of
Token ID286
Feature activation+3.53e-01

Pos logit contributions
 ever+0.172
keeping+0.168
 onwards+0.160
 it+0.146
 fright+0.144
Neg logit contributions
aman-0.437
 hey-0.429
land-0.418
 exception-0.401
 extingu-0.400
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
aman+0.658
 hey+0.646
land+0.619
 exception+0.595
 extingu+0.586
Neg logit contributions
 ever-0.316
keeping-0.295
 onwards-0.295
 it-0.277
 fright-0.268
Token hole
Token ID7604
Feature activation+3.02e-01

Pos logit contributions
aman+3.159
land+3.093
 hey+3.089
 exception+2.886
 extingu+2.825
Neg logit contributions
 ever-1.410
 it-1.371
 onwards-1.326
 fright-1.300
keeping-1.276
Token.
Token ID13
Feature activation+7.32e-01

Pos logit contributions
aman+0.547
 hey+0.537
land+0.514
 exception+0.494
 extingu+0.487
Neg logit contributions
 ever-0.281
keeping-0.264
 onwards-0.264
 it-0.248
 fright-0.241
Token Danny
Token ID15105
Feature activation-2.62e-01

Pos logit contributions
aman+1.374
 hey+1.362
land+1.293
 exception+1.252
 extingu+1.228
Neg logit contributions
 ever-0.635
 onwards-0.610
keeping-0.601
 it-0.574
 excited-0.539
Token was
Token ID373
Feature activation+4.46e-02

Pos logit contributions
 ever+0.248
keeping+0.248
 onwards+0.241
 fright+0.222
 it+0.217
Neg logit contributions
aman-0.457
 hey-0.449
land-0.430
 whose-0.410
 extingu-0.406
Token so
Token ID523
Feature activation-7.34e-01

Pos logit contributions
aman+0.086
 hey+0.085
land+0.081
 exception+0.078
 extingu+0.077
Neg logit contributions
 ever-0.036
keeping-0.034
 onwards-0.034
 it-0.031
 fright-0.030
Token but
Token ID475
Feature activation+9.21e-02

Pos logit contributions
aman+0.750
 hey+0.732
land+0.695
 exception+0.662
 extingu+0.651
Neg logit contributions
 ever-0.570
 onwards-0.535
keeping-0.534
 it-0.519
 fright-0.503
Token sometimes
Token ID3360
Feature activation-3.87e-01

Pos logit contributions
aman+0.144
 hey+0.142
land+0.134
 exception+0.129
 whose+0.126
Neg logit contributions
 ever-0.107
keeping-0.104
 onwards-0.103
 fright-0.097
 it-0.096
Token he
Token ID339
Feature activation+2.74e-01

Pos logit contributions
keeping+0.449
 ever+0.440
 onwards+0.431
 fright+0.402
unts+0.392
Neg logit contributions
aman-0.592
 hey-0.592
land-0.558
 exception-0.542
 whose-0.536
Token did
Token ID750
Feature activation+1.50e+00

Pos logit contributions
aman+0.427
 hey+0.419
land+0.397
 exception+0.380
 extingu+0.372
Neg logit contributions
 ever-0.325
keeping-0.311
 onwards-0.311
 it-0.295
 fright-0.289
Token stupid
Token ID8531
Feature activation+2.90e-01

Pos logit contributions
aman+3.102
 hey+2.927
 extingu+2.878
land+2.836
lves+2.682
Neg logit contributions
 it-1.246
 ever-1.230
 onwards-1.104
 fright-1.032
 excited-1.010
Token things
Token ID1243
Feature activation+1.66e+00

Pos logit contributions
aman+0.503
 hey+0.494
land+0.472
 exception+0.452
 extingu+0.445
Neg logit contributions
 ever-0.297
keeping-0.280
 onwards-0.279
 it-0.263
 fright-0.257
Token.
Token ID13
Feature activation+5.49e-01

Pos logit contributions
aman+3.289
 hey+3.154
land+2.989
 Majesty+2.910
 exception+2.876
Neg logit contributions
 ever-1.567
 it-1.439
 onwards-1.336
 on-1.326
keeping-1.259
Token One
Token ID1881
Feature activation-4.24e-01

Pos logit contributions
aman+0.841
 hey+0.827
land+0.780
 exception+0.740
 extingu+0.735
Neg logit contributions
 ever-0.671
 onwards-0.642
keeping-0.637
 it-0.620
 fright-0.597
Token day
Token ID1110
Feature activation+6.01e-01

Pos logit contributions
keeping+0.389
 onwards+0.366
 ever+0.363
 fright+0.326
 it+0.317
Neg logit contributions
aman-0.774
 hey-0.760
land-0.728
 exception-0.719
 whose-0.712
Token,
Token ID11
Feature activation+7.93e-01

Pos logit contributions
aman+1.172
 hey+1.147
land+1.110
 exception+1.062
 extingu+1.048
Neg logit contributions
 ever-0.503
 onwards-0.465
 it-0.446
keeping-0.443
 fright-0.408
Token he
Token ID339
Feature activation-2.64e-01

Pos logit contributions
aman+1.511
 hey+1.468
land+1.448
 exception+1.355
 extingu+1.345
Neg logit contributions
 ever-0.712
 it-0.665
 onwards-0.647
keeping-0.618
 excited-0.597
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
 ever+0.081
keeping+0.078
 onwards+0.077
 fright+0.072
 it+0.071
Neg logit contributions
aman-0.132
 hey-0.129
land-0.126
 exception-0.120
 whose-0.117
Token<|endoftext|>
Token ID50256
Feature activation-8.49e-01

Pos logit contributions
 ever+0.024
keeping+0.023
 onwards+0.023
 fright+0.021
 it+0.021
Neg logit contributions
aman-0.044
 hey-0.043
land-0.041
 exception-0.039
 whose-0.039
TokenS
Token ID50
Feature activation+5.13e-01

Pos logit contributions
 ever+0.976
keeping+0.938
 onwards+0.914
 fright+0.902
 sideways+0.879
Neg logit contributions
aman-1.296
land-1.247
 hey-1.236
 whose-1.149
 extingu-1.113
Tokenhen
Token ID831
Feature activation-1.68e-01

Pos logit contributions
aman+0.711
 hey+0.705
land+0.662
 exception+0.633
 extingu+0.624
Neg logit contributions
 ever-0.690
 onwards-0.658
keeping-0.653
 it-0.634
 fright-0.616
Token was
Token ID373
Feature activation-5.11e-02

Pos logit contributions
 ever+0.163
keeping+0.154
 onwards+0.154
 fright+0.149
 sideways+0.138
Neg logit contributions
aman-0.298
 hey-0.291
land-0.280
 exception-0.267
 whose-0.266
Token playing
Token ID2712
Feature activation+1.82e+00

Pos logit contributions
 ever+0.060
keeping+0.058
 onwards+0.056
 it+0.054
 fright+0.054
Neg logit contributions
aman-0.080
 hey-0.078
land-0.074
 exception-0.072
 whose-0.070
Token in
Token ID287
Feature activation+1.07e-01

Pos logit contributions
aman+4.023
 hey+3.801
land+3.798
 exception+3.574
lves+3.572
Neg logit contributions
 it-1.219
 ever-1.215
 on-1.141
 excited-1.082
 onwards-1.072
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
aman+0.193
 hey+0.189
land+0.181
 exception+0.174
 extingu+0.172
Neg logit contributions
 ever-0.102
 onwards-0.096
keeping-0.095
 it-0.087
 fright-0.087
Token garden
Token ID11376
Feature activation+1.73e+00

Pos logit contributions
aman+2.001
 hey+1.985
land+1.888
 extingu+1.761
der+1.744
Neg logit contributions
 ever-2.228
 it-2.222
keeping-2.154
 onwards-2.151
 excited-2.139
Token when
Token ID618
Feature activation-7.63e-01

Pos logit contributions
aman+3.505
 hey+3.449
land+3.270
lves+3.217
 extingu+3.210
Neg logit contributions
 ever-1.443
 it-1.333
 excited-1.302
 onwards-1.278
 on-1.239
Token he
Token ID339
Feature activation+1.52e-02

Pos logit contributions
keeping+0.935
 ever+0.921
 onwards+0.885
 fright+0.857
unts+0.854
Neg logit contributions
aman-1.133
 hey-1.123
land-1.018
 exception-1.000
 whose-0.995
Token friendly
Token ID8030
Feature activation-3.77e-01

Pos logit contributions
keeping+0.747
 ever+0.739
 onwards+0.716
 it+0.651
 sideways+0.641
Neg logit contributions
aman-0.869
 hey-0.866
land-0.804
 exception-0.761
 extingu-0.761
Token bird
Token ID6512
Feature activation+6.69e-01

Pos logit contributions
 ever+0.462
 onwards+0.457
keeping+0.453
 sideways+0.405
 it+0.405
Neg logit contributions
aman-0.568
 hey-0.555
land-0.527
 extingu-0.499
 exception-0.498
Token flew
Token ID13112
Feature activation-3.80e-01

Pos logit contributions
aman+1.237
 hey+1.212
land+1.158
 exception+1.121
 extingu+1.103
Neg logit contributions
 ever-0.623
 onwards-0.563
keeping-0.562
 it-0.549
 fright-0.525
Token by
Token ID416
Feature activation-1.62e-01

Pos logit contributions
 ever+0.407
keeping+0.402
 onwards+0.385
 it+0.354
unts+0.354
Neg logit contributions
aman-0.634
 hey-0.615
land-0.584
 exception-0.560
 extingu-0.557
Token and
Token ID290
Feature activation+6.97e-01

Pos logit contributions
 ever+0.119
 onwards+0.116
keeping+0.113
 it+0.099
 fright+0.098
Neg logit contributions
aman-0.322
 hey-0.316
land-0.306
 extingu-0.292
 whose-0.291
Token saw
Token ID2497
Feature activation+1.66e+00

Pos logit contributions
aman+1.280
 hey+1.251
land+1.209
 exception+1.161
 extingu+1.136
Neg logit contributions
 ever-0.660
 onwards-0.604
keeping-0.600
 it-0.598
 fright-0.560
Token the
Token ID262
Feature activation+3.27e-01

Pos logit contributions
aman+3.578
 hey+3.482
land+3.401
 exception+3.352
 extingu+3.222
Neg logit contributions
 it-1.185
 ever-1.127
keeping-0.974
 fright-0.916
 onwards-0.911
Token van
Token ID5719
Feature activation+1.91e-01

Pos logit contributions
aman+0.533
 hey+0.522
land+0.497
 exception+0.475
 extingu+0.467
Neg logit contributions
 ever-0.367
keeping-0.347
 onwards-0.347
 it-0.330
 fright-0.322
Token hanging
Token ID10938
Feature activation-1.20e+00

Pos logit contributions
aman+0.337
 hey+0.331
land+0.317
 exception+0.303
 extingu+0.299
Neg logit contributions
 ever-0.185
keeping-0.176
 onwards-0.175
 it-0.163
 fright-0.159
Token off
Token ID572
Feature activation-6.63e-01

Pos logit contributions
keeping+1.189
 ever+1.155
 onwards+1.141
unts+1.089
 fright+1.081
Neg logit contributions
aman-2.049
 hey-1.933
land-1.933
 extingu-1.888
 whose-1.811
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 ever+0.626
keeping+0.617
 onwards+0.604
 fright+0.554
tick+0.550
Neg logit contributions
aman-1.155
 hey-1.109
land-1.090
 whose-1.027
 extingu-1.023
Token boy
Token ID2933
Feature activation+4.29e-02

Pos logit contributions
aman+1.136
 hey+1.096
land+1.049
 exception+0.998
 extingu+0.979
Neg logit contributions
 ever-0.870
keeping-0.822
 onwards-0.817
 it-0.799
 excited-0.776
Token decided
Token ID3066
Feature activation-1.49e-01

Pos logit contributions
aman+0.079
 hey+0.078
land+0.075
 exception+0.072
 whose+0.071
Neg logit contributions
 ever-0.038
 onwards-0.036
keeping-0.036
 it-0.033
 fright-0.033
Token to
Token ID284
Feature activation-3.00e-01

Pos logit contributions
 ever+0.118
keeping+0.112
 onwards+0.111
 fright+0.099
unts+0.098
Neg logit contributions
aman-0.289
 hey-0.285
land-0.277
 exception-0.263
 whose-0.261
Token go
Token ID467
Feature activation-6.39e-01

Pos logit contributions
 ever+0.322
 onwards+0.314
keeping+0.309
 sideways+0.286
 it+0.283
Neg logit contributions
aman-0.494
 hey-0.483
land-0.460
 exception-0.442
 extingu-0.432
Token to
Token ID284
Feature activation+7.34e-01

Pos logit contributions
 ever+0.618
keeping+0.615
 onwards+0.579
unts+0.552
 fright+0.550
Neg logit contributions
aman-1.089
 hey-1.077
land-1.076
 exception-1.034
 extingu-1.002
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
aman+1.398
 hey+1.379
land+1.312
 exception+1.261
 extingu+1.250
Neg logit contributions
 ever-0.633
keeping-0.589
 it-0.588
 onwards-0.576
 fright-0.542
Token reef
Token ID25088
Feature activation+6.17e-01

Pos logit contributions
aman+2.522
 hey+2.445
land+2.253
 extingu+2.234
ist+2.207
Neg logit contributions
 ever-3.111
 it-3.055
keeping-2.901
 sideways-2.893
 onwards-2.850
Token.
Token ID13
Feature activation+4.87e-01

Pos logit contributions
aman+1.204
 hey+1.190
land+1.127
 exception+1.089
 extingu+1.080
Neg logit contributions
 ever-0.503
 onwards-0.458
keeping-0.457
 it-0.436
 excited-0.407
Token When
Token ID1649
Feature activation-8.39e-01

Pos logit contributions
aman+0.966
 hey+0.953
land+0.912
 exception+0.879
 extingu+0.868
Neg logit contributions
 ever-0.376
 onwards-0.351
keeping-0.349
 it-0.329
 fright-0.311
Token he
Token ID339
Feature activation+1.01e+00

Pos logit contributions
keeping+1.047
 ever+1.015
unts+0.972
 fright+0.967
 onwards+0.964
Neg logit contributions
aman-1.254
 hey-1.230
land-1.120
 exception-1.082
 whose-1.065
Token got
Token ID1392
Feature activation-1.65e+00

Pos logit contributions
aman+1.765
 hey+1.754
land+1.645
 exception+1.596
 extingu+1.573
Neg logit contributions
 ever-1.087
 onwards-0.991
 it-0.984
keeping-0.937
 excited-0.925
Token Peter
Token ID5613
Feature activation+4.21e-01

Pos logit contributions
aman+1.954
 hey+1.926
land+1.846
 exception+1.759
 extingu+1.752
Neg logit contributions
 ever-0.864
 onwards-0.848
 it-0.830
keeping-0.825
 excited-0.775
Token hopped
Token ID45230
Feature activation-1.10e+00

Pos logit contributions
aman+0.759
 hey+0.745
land+0.711
 exception+0.684
 extingu+0.675
Neg logit contributions
 ever-0.401
 onwards-0.374
keeping-0.373
 it-0.355
 fright-0.341
Token around
Token ID1088
Feature activation+3.16e-02

Pos logit contributions
keeping+1.112
 ever+1.063
unts+1.004
 onwards+0.954
 fright+0.947
Neg logit contributions
aman-1.872
 hey-1.812
land-1.785
 exception-1.722
 extingu-1.689
Token,
Token ID11
Feature activation+7.81e-01

Pos logit contributions
aman+0.058
 hey+0.058
land+0.055
 exception+0.053
 extingu+0.053
Neg logit contributions
 ever-0.027
keeping-0.026
 onwards-0.026
 it-0.023
 fright-0.023
Token marking
Token ID18730
Feature activation+5.03e-01

Pos logit contributions
aman+1.389
 hey+1.365
land+1.302
 exception+1.250
 extingu+1.242
Neg logit contributions
 ever-0.777
 onwards-0.730
 it-0.712
keeping-0.710
 excited-0.684
Token trees
Token ID7150
Feature activation+6.75e-01

Pos logit contributions
aman+0.931
 hey+0.916
land+0.872
 exception+0.840
 extingu+0.827
Neg logit contributions
 ever-0.456
 onwards-0.428
keeping-0.422
 it-0.409
 fright-0.386
Token and
Token ID290
Feature activation+6.72e-02

Pos logit contributions
aman+1.396
 hey+1.378
land+1.308
 exception+1.274
 extingu+1.250
Neg logit contributions
 ever-0.481
 onwards-0.435
keeping-0.423
 it-0.412
 excited-0.376
Token flowers
Token ID12734
Feature activation+7.50e-01

Pos logit contributions
aman+0.109
 hey+0.108
land+0.103
 exception+0.098
 extingu+0.096
Neg logit contributions
 ever-0.074
 onwards-0.070
keeping-0.070
 it-0.066
 fright-0.065
Token until
Token ID1566
Feature activation-2.96e-01

Pos logit contributions
aman+1.442
 hey+1.415
land+1.334
 exception+1.299
 extingu+1.273
Neg logit contributions
 ever-0.660
 onwards-0.605
keeping-0.591
 it-0.582
 excited-0.540
Token he
Token ID339
Feature activation+4.59e-01

Pos logit contributions
 ever+0.326
keeping+0.324
 onwards+0.309
 fright+0.297
unts+0.294
Neg logit contributions
aman-0.473
 hey-0.468
land-0.438
 exception-0.421
 whose-0.411
Token was
Token ID373
Feature activation+4.14e-01

Pos logit contributions
aman+0.844
 hey+0.829
land+0.792
 exception+0.764
 extingu+0.754
Neg logit contributions
 ever-0.429
 onwards-0.395
keeping-0.393
 it-0.376
 fright-0.359
Token a
Token ID257
Feature activation+1.42e-01

Pos logit contributions
 ever+0.440
keeping+0.437
 onwards+0.436
 fright+0.399
 sideways+0.395
Neg logit contributions
aman-0.591
 hey-0.578
land-0.551
 exception-0.524
 whose-0.515
Token job
Token ID1693
Feature activation+3.14e-01

Pos logit contributions
aman+0.226
 hey+0.221
land+0.210
 exception+0.201
 extingu+0.198
Neg logit contributions
 ever-0.163
keeping-0.158
 onwards-0.156
 it-0.146
 fright-0.143
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
aman+0.559
 hey+0.549
land+0.525
 exception+0.504
 extingu+0.497
Neg logit contributions
 ever-0.302
keeping-0.284
 onwards-0.284
 it-0.268
 fright-0.260
Token She
Token ID1375
Feature activation-1.87e-01

Pos logit contributions
 ever+0.175
keeping+0.163
 onwards+0.161
 fright+0.148
 it+0.147
Neg logit contributions
aman-0.319
 hey-0.314
land-0.298
 extingu-0.287
 exception-0.287
Token works
Token ID2499
Feature activation-4.90e-01

Pos logit contributions
keeping+0.218
 ever+0.218
 onwards+0.213
 it+0.196
 fright+0.195
Neg logit contributions
aman-0.292
 hey-0.290
land-0.272
 exception-0.262
 extingu-0.256
Token at
Token ID379
Feature activation+9.21e-01

Pos logit contributions
keeping+0.415
 ever+0.400
 onwards+0.369
 fright+0.360
 it+0.343
Neg logit contributions
aman-0.919
 hey-0.914
land-0.887
 exception-0.854
 whose-0.835
Token a
Token ID257
Feature activation+7.38e-01

Pos logit contributions
aman+1.685
 hey+1.661
land+1.598
 exception+1.528
lves+1.502
Neg logit contributions
 ever-0.877
keeping-0.819
 it-0.804
 onwards-0.784
 excited-0.730
Token big
Token ID1263
Feature activation-9.38e-01

Pos logit contributions
aman+1.099
 hey+1.080
land+1.008
 exception+0.966
 extingu+0.961
Neg logit contributions
 ever-0.934
 onwards-0.878
keeping-0.868
 it-0.868
 fright-0.836
Token store
Token ID3650
Feature activation-6.20e-01

Pos logit contributions
 onwards+1.446
 ever+1.436
keeping+1.430
 fright+1.276
unts+1.264
Neg logit contributions
aman-1.164
 hey-1.111
land-1.039
 exception-0.986
 extingu-0.981
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
 ever+0.643
 onwards+0.641
keeping+0.622
 fright+0.616
 it+0.572
Neg logit contributions
aman-1.022
 hey-1.001
land-0.979
 whose-0.920
 extingu-0.912
Token She
Token ID1375
Feature activation+3.08e-01

Pos logit contributions
aman+0.507
 hey+0.498
land+0.477
 exception+0.458
 extingu+0.453
Neg logit contributions
 ever-0.244
keeping-0.228
 onwards-0.227
 it-0.212
 fright-0.206
Token's
Token ID338
Feature activation-7.67e-01

Pos logit contributions
 ever+0.220
keeping+0.215
 onwards+0.213
 fright+0.195
 it+0.195
Neg logit contributions
aman-0.344
 hey-0.337
land-0.321
 exception-0.305
 whose-0.305
Token mom
Token ID1995
Feature activation-1.61e-02

Pos logit contributions
keeping+0.818
 ever+0.802
 onwards+0.802
 it+0.710
 sideways+0.704
Neg logit contributions
 hey-1.290
aman-1.264
land-1.176
 exception-1.143
 whose-1.131
Token saw
Token ID2497
Feature activation+1.08e+00

Pos logit contributions
 ever+0.017
keeping+0.017
 onwards+0.017
 it+0.015
 fright+0.015
Neg logit contributions
aman-0.027
 hey-0.027
land-0.025
 exception-0.024
 whose-0.024
Token what
Token ID644
Feature activation-6.55e-01

Pos logit contributions
aman+2.006
 hey+1.972
land+1.937
 exception+1.858
 extingu+1.792
Neg logit contributions
 ever-0.949
 it-0.949
keeping-0.880
 onwards-0.861
 fright-0.818
Token he
Token ID339
Feature activation-6.62e-01

Pos logit contributions
keeping+0.809
 ever+0.778
 onwards+0.775
 fright+0.723
 sideways+0.708
Neg logit contributions
aman-0.978
 hey-0.963
 whose-0.889
land-0.882
 exception-0.868
Token had
Token ID550
Feature activation+5.98e-01

Pos logit contributions
keeping+0.655
 onwards+0.644
 ever+0.626
 fright+0.572
 it+0.555
Neg logit contributions
aman-1.155
 hey-1.135
land-1.080
 whose-1.051
 exception-1.041
Token done
Token ID1760
Feature activation-7.05e-01

Pos logit contributions
aman+1.180
 hey+1.163
land+1.112
 exception+1.075
 extingu+1.069
Neg logit contributions
 ever-0.481
keeping-0.436
 onwards-0.433
 it-0.412
 fright-0.385
Token and
Token ID290
Feature activation+9.68e-01

Pos logit contributions
 ever+0.568
keeping+0.553
 onwards+0.548
 fright+0.537
unts+0.498
Neg logit contributions
aman-1.322
 hey-1.281
land-1.270
 exception-1.206
 whose-1.197
Token said
Token ID531
Feature activation+9.82e-03

Pos logit contributions
aman+1.885
 hey+1.832
land+1.800
 exception+1.723
 extingu+1.704
Neg logit contributions
 ever-0.816
 it-0.762
 onwards-0.729
 excited-0.706
keeping-0.706
Token,
Token ID11
Feature activation-9.41e-02

Pos logit contributions
aman+0.017
 hey+0.017
land+0.016
 exception+0.016
 whose+0.016
Neg logit contributions
 ever-0.009
keeping-0.009
 onwards-0.009
 fright-0.008
 it-0.008
Token "
Token ID366
Feature activation+3.27e-01

Pos logit contributions
 ever+0.123
keeping+0.119
 onwards+0.117
 fright+0.110
 it+0.108
Neg logit contributions
aman-0.136
 hey-0.133
land-0.125
 whose-0.118
 extingu-0.117
Token realized
Token ID6939
Feature activation+9.26e-01

Pos logit contributions
aman+3.415
land+3.223
 hey+3.191
 exception+3.068
 extingu+3.029
Neg logit contributions
 it-1.486
 ever-1.473
 sideways-1.309
 excited-1.302
 onwards-1.297
Token it
Token ID340
Feature activation+3.95e-01

Pos logit contributions
aman+1.939
 hey+1.907
land+1.823
 exception+1.771
 extingu+1.762
Neg logit contributions
 ever-0.644
 it-0.609
 onwards-0.584
keeping-0.561
 fright-0.505
Token was
Token ID373
Feature activation+8.23e-01

Pos logit contributions
aman+0.716
 hey+0.703
land+0.672
 exception+0.645
 extingu+0.637
Neg logit contributions
 ever-0.376
 onwards-0.350
keeping-0.349
 it-0.332
 fright-0.320
Token a
Token ID257
Feature activation+7.22e-01

Pos logit contributions
aman+1.468
 hey+1.435
land+1.363
 exception+1.298
 extingu+1.293
Neg logit contributions
 ever-0.854
 onwards-0.779
 it-0.773
keeping-0.756
 fright-0.725
Token giant
Token ID6175
Feature activation-4.52e-02

Pos logit contributions
aman+1.114
 hey+1.086
land+1.027
 exception+0.977
 extingu+0.958
Neg logit contributions
 ever-0.895
 onwards-0.830
keeping-0.824
 it-0.818
 fright-0.800
Token whale
Token ID22206
Feature activation+5.18e-01

Pos logit contributions
 ever+0.053
keeping+0.052
 onwards+0.052
 it+0.048
 fright+0.048
Neg logit contributions
aman-0.069
 hey-0.068
land-0.065
 exception-0.061
 extingu-0.061
Token!
Token ID0
Feature activation-1.90e-01

Pos logit contributions
aman+0.919
 hey+0.900
land+0.857
 exception+0.825
 extingu+0.812
Neg logit contributions
 ever-0.516
 onwards-0.476
keeping-0.476
 it-0.456
 fright-0.438
Token He
Token ID679
Feature activation-4.30e-01

Pos logit contributions
 ever+0.179
keeping+0.165
 onwards+0.162
 it+0.151
 fright+0.150
Neg logit contributions
aman-0.345
 hey-0.337
land-0.323
 exception-0.308
 whose-0.307
Token was
Token ID373
Feature activation+3.24e-01

Pos logit contributions
keeping+0.437
 onwards+0.421
 ever+0.421
 it+0.371
 fright+0.370
Neg logit contributions
aman-0.721
 hey-0.715
land-0.688
 whose-0.662
 exception-0.650
Token scared
Token ID12008
Feature activation+7.01e-01

Pos logit contributions
aman+0.641
 hey+0.630
land+0.605
 exception+0.583
 extingu+0.576
Neg logit contributions
 ever-0.252
keeping-0.233
 onwards-0.232
 it-0.215
 fright-0.207
Token,
Token ID11
Feature activation-8.15e-01

Pos logit contributions
aman+1.430
 hey+1.402
land+1.343
 exception+1.297
 extingu+1.291
Neg logit contributions
 ever-0.527
 onwards-0.473
 it-0.458
keeping-0.454
 fright-0.420
Token a
Token ID257
Feature activation+1.27e-01

Pos logit contributions
aman+1.040
 hey+1.023
land+0.976
 exception+0.951
 extingu+0.932
Neg logit contributions
 ever-0.358
 onwards-0.325
keeping-0.324
 it-0.305
 fright-0.284
Token guitar
Token ID10047
Feature activation+4.61e-01

Pos logit contributions
aman+0.175
 hey+0.170
land+0.162
 exception+0.152
 extingu+0.150
Neg logit contributions
 ever-0.174
keeping-0.167
 onwards-0.167
 it-0.159
 fright-0.156
Token,
Token ID11
Feature activation+6.33e-01

Pos logit contributions
aman+0.879
 hey+0.864
land+0.823
 exception+0.797
 extingu+0.783
Neg logit contributions
 ever-0.391
keeping-0.362
 onwards-0.361
 it-0.340
 fright-0.323
Token a
Token ID257
Feature activation+1.77e-01

Pos logit contributions
aman+1.320
 hey+1.299
land+1.233
 exception+1.209
lves+1.182
Neg logit contributions
 ever-0.435
 onwards-0.393
keeping-0.390
 it-0.373
 excited-0.342
Token dog
Token ID3290
Feature activation+6.99e-01

Pos logit contributions
aman+0.246
 hey+0.240
land+0.228
 exception+0.215
 extingu+0.212
Neg logit contributions
 ever-0.238
keeping-0.228
 onwards-0.227
 it-0.217
 fright-0.213
Token,
Token ID11
Feature activation+7.37e-01

Pos logit contributions
aman+1.364
 hey+1.338
land+1.266
 exception+1.235
 extingu+1.207
Neg logit contributions
 ever-0.579
keeping-0.528
 onwards-0.527
 it-0.508
 fright-0.465
Token a
Token ID257
Feature activation+2.58e-01

Pos logit contributions
aman+1.495
 hey+1.469
land+1.391
 exception+1.366
lves+1.334
Neg logit contributions
 ever-0.560
 onwards-0.502
keeping-0.499
 it-0.484
 excited-0.449
Token cat
Token ID3797
Feature activation+5.15e-01

Pos logit contributions
aman+0.359
 hey+0.350
land+0.332
 exception+0.314
 extingu+0.308
Neg logit contributions
 ever-0.349
keeping-0.334
 onwards-0.333
 it-0.319
 fright-0.313
Token,
Token ID11
Feature activation+8.85e-01

Pos logit contributions
aman+0.996
 hey+0.978
land+0.932
 exception+0.903
 extingu+0.886
Neg logit contributions
 ever-0.427
keeping-0.394
 onwards-0.393
 it-0.372
 fright-0.350
Token and
Token ID290
Feature activation+7.82e-01

Pos logit contributions
aman+1.814
 hey+1.783
land+1.682
 exception+1.663
lves+1.622
Neg logit contributions
 ever-0.665
 onwards-0.587
keeping-0.584
 it-0.574
 excited-0.530
Token a
Token ID257
Feature activation+2.06e-01

Pos logit contributions
aman+1.294
 hey+1.251
land+1.187
 exception+1.140
lves+1.112
Neg logit contributions
 ever-0.897
 onwards-0.833
keeping-0.831
 it-0.819
 excited-0.781
Token mom
Token ID1995
Feature activation+8.16e-01

Pos logit contributions
keeping+0.560
 ever+0.559
 onwards+0.553
 it+0.514
 fright+0.505
Neg logit contributions
 hey-0.666
aman-0.652
land-0.617
 exception-0.601
 extingu-0.589
Token to
Token ID284
Feature activation-5.60e-02

Pos logit contributions
aman+1.323
 hey+1.288
land+1.214
lves+1.162
 exception+1.157
Neg logit contributions
 ever-0.953
 onwards-0.890
 it-0.880
keeping-0.870
 excited-0.832
Token fill
Token ID6070
Feature activation+2.00e-01

Pos logit contributions
 ever+0.069
keeping+0.067
 onwards+0.067
 it+0.062
 fright+0.061
Neg logit contributions
aman-0.084
 hey-0.083
land-0.077
 exception-0.074
 whose-0.073
Token his
Token ID465
Feature activation-4.46e-01

Pos logit contributions
aman+0.335
 hey+0.327
land+0.312
 exception+0.301
 extingu+0.296
Neg logit contributions
 ever-0.216
 onwards-0.204
keeping-0.203
 it-0.189
 fright-0.187
Token brain
Token ID3632
Feature activation-2.03e-01

Pos logit contributions
 ever+0.599
 onwards+0.572
keeping+0.562
 it+0.538
 fright+0.533
Neg logit contributions
aman-0.618
 hey-0.602
land-0.580
 exception-0.550
 whose-0.549
Token with
Token ID351
Feature activation-2.28e-01

Pos logit contributions
 ever+0.176
keeping+0.166
 onwards+0.164
 fright+0.151
 it+0.150
Neg logit contributions
aman-0.372
 hey-0.364
land-0.356
 exception-0.345
 whose-0.338
Token knowledge
Token ID3725
Feature activation+1.50e-01

Pos logit contributions
 ever+0.286
keeping+0.275
 onwards+0.275
 fright+0.254
 it+0.250
Neg logit contributions
aman-0.333
 hey-0.325
land-0.310
 exception-0.298
 extingu-0.294
Token.
Token ID13
Feature activation-2.72e-02

Pos logit contributions
aman+0.265
 hey+0.259
land+0.251
 exception+0.240
 extingu+0.236
Neg logit contributions
 ever-0.145
keeping-0.137
 onwards-0.137
 it-0.128
 fright-0.126
Token
Token ID220
Feature activation+4.93e-01

Pos logit contributions
 ever+0.029
keeping+0.027
 onwards+0.027
 fright+0.025
 it+0.025
Neg logit contributions
aman-0.046
 hey-0.045
land-0.043
 exception-0.041
 whose-0.040
Token
Token ID198
Feature activation-4.17e-01

Pos logit contributions
aman+0.912
 hey+0.898
land+0.857
 exception+0.825
 extingu+0.816
Neg logit contributions
 ever-0.441
 onwards-0.414
keeping-0.410
 it-0.393
 fright-0.372
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
 ever+0.446
keeping+0.430
 onwards+0.416
 fright+0.391
 sideways+0.385
Neg logit contributions
aman-0.703
 hey-0.667
land-0.656
 exception-0.618
 whose-0.616
Token da
Token ID12379
Feature activation-3.92e-01

Pos logit contributions
keeping+0.919
 onwards+0.913
 ever+0.910
unts+0.837
 sideways+0.822
Neg logit contributions
aman-1.105
 hey-1.049
land-1.013
 whose-0.962
 extingu-0.941
Tokenisy
Token ID13560
Feature activation-5.09e-02

Pos logit contributions
 ever+0.714
keeping+0.714
 onwards+0.704
 it+0.676
 sideways+0.668
Neg logit contributions
aman-0.360
 hey-0.334
land-0.323
 exception-0.278
 whose-0.267
Token and
Token ID290
Feature activation+5.36e-03

Pos logit contributions
 ever+0.044
keeping+0.042
 onwards+0.041
 it+0.038
 fright+0.037
Neg logit contributions
aman-0.095
 hey-0.092
land-0.090
 exception-0.086
 whose-0.085
Token tw
Token ID665
Feature activation-2.14e-01

Pos logit contributions
aman+0.009
 hey+0.009
land+0.008
 exception+0.008
 extingu+0.008
Neg logit contributions
 ever-0.006
keeping-0.006
 onwards-0.005
 fright-0.005
 it-0.005
Tokenirled
Token ID49376
Feature activation-1.17e+00

Pos logit contributions
 ever+0.251
keeping+0.240
 onwards+0.235
 fright+0.224
 it+0.221
Neg logit contributions
aman-0.345
 hey-0.329
land-0.315
 exception-0.302
 whose-0.298
Token around
Token ID1088
Feature activation-6.37e-01

Pos logit contributions
keeping+1.028
 ever+0.978
unts+0.912
 onwards+0.901
 fright+0.850
Neg logit contributions
aman-2.099
 hey-2.093
land-2.054
 exception-1.970
 whose-1.928
Token with
Token ID351
Feature activation-1.27e+00

Pos logit contributions
keeping+0.626
 ever+0.617
 onwards+0.609
unts+0.568
 fright+0.554
Neg logit contributions
 hey-1.065
aman-1.057
land-1.013
 whose-0.977
 extingu-0.974
Token it
Token ID340
Feature activation+7.78e-01

Pos logit contributions
keeping+1.121
 onwards+1.048
 ever+1.047
unts+0.943
tick+0.919
Neg logit contributions
aman-2.333
 hey-2.310
land-2.175
 whose-2.123
 exception-2.119
Token.
Token ID13
Feature activation+4.96e-01

Pos logit contributions
aman+1.472
 hey+1.448
land+1.363
 exception+1.318
 extingu+1.310
Neg logit contributions
 ever-0.707
 onwards-0.658
 it-0.632
keeping-0.624
 excited-0.595
Token Everyday
Token ID48154
Feature activation-5.41e-01

Pos logit contributions
aman+0.869
 hey+0.854
land+0.812
 exception+0.779
 extingu+0.771
Neg logit contributions
 ever-0.498
 onwards-0.473
keeping-0.468
 it-0.449
 fright-0.429
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
keeping+0.593
 ever+0.563
 fright+0.526
 onwards+0.524
unts+0.505
Neg logit contributions
aman-0.903
 hey-0.891
land-0.835
 exception-0.797
 whose-0.795
Token,
Token ID11
Feature activation+8.25e-01

Pos logit contributions
aman+1.463
 hey+1.425
land+1.374
 exception+1.324
 extingu+1.312
Neg logit contributions
 ever-0.615
 onwards-0.568
 it-0.553
keeping-0.528
 excited-0.502
Token a
Token ID257
Feature activation-5.37e-03

Pos logit contributions
aman+1.436
 hey+1.391
land+1.341
 exception+1.281
 extingu+1.268
Neg logit contributions
 ever-0.873
 it-0.815
 onwards-0.807
keeping-0.797
 excited-0.752
Token big
Token ID1263
Feature activation-1.25e+00

Pos logit contributions
 ever+0.008
keeping+0.008
 onwards+0.008
 it+0.008
 fright+0.008
Neg logit contributions
aman-0.006
 hey-0.006
land-0.006
 exception-0.005
 extingu-0.005
Token rock
Token ID3881
Feature activation+1.15e+00

Pos logit contributions
keeping+1.773
 ever+1.689
 onwards+1.669
 fright+1.468
tick+1.449
Neg logit contributions
aman-1.759
 hey-1.705
land-1.600
 exception-1.582
 extingu-1.529
Token fell
Token ID3214
Feature activation+4.57e-01

Pos logit contributions
aman+2.254
 hey+2.204
land+2.104
 exception+2.049
 extingu+2.008
Neg logit contributions
 ever-1.020
 it-0.890
 onwards-0.860
keeping-0.853
 excited-0.798
Token into
Token ID656
Feature activation-3.19e-01

Pos logit contributions
aman+0.885
 hey+0.872
land+0.834
 exception+0.804
 extingu+0.790
Neg logit contributions
 ever-0.377
 onwards-0.348
keeping-0.346
 it-0.327
 fright-0.312
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
 ever+0.332
 onwards+0.324
keeping+0.311
 sideways+0.284
 fright+0.282
Neg logit contributions
aman-0.538
 hey-0.524
land-0.499
 extingu-0.485
 whose-0.483
Token stream
Token ID4269
Feature activation-1.53e-02

Pos logit contributions
aman+0.399
 hey+0.390
land+0.370
 exception+0.351
 extingu+0.344
Neg logit contributions
 ever-0.351
keeping-0.335
 onwards-0.334
 it-0.320
 fright-0.313
Token.
Token ID13
Feature activation-9.93e-03

Pos logit contributions
 ever+0.014
 onwards+0.013
keeping+0.013
 it+0.012
 fright+0.012
Neg logit contributions
aman-0.028
 hey-0.027
land-0.026
 exception-0.025
 extingu-0.025
Token The
Token ID383
Feature activation+6.26e-02

Pos logit contributions
 ever+0.009
keeping+0.009
 onwards+0.008
 fright+0.008
 it+0.008
Neg logit contributions
aman-0.018
 hey-0.018
land-0.017
 exception-0.016
 whose-0.016
Token water
Token ID1660
Feature activation-7.75e-01

Pos logit contributions
aman+0.080
 hey+0.079
land+0.074
 exception+0.070
 extingu+0.068
Neg logit contributions
 ever-0.092
keeping-0.089
 onwards-0.089
 it-0.084
 fright-0.084
Token It
Token ID632
Feature activation+1.37e+00

Pos logit contributions
 ever+0.282
keeping+0.264
 onwards+0.257
 fright+0.244
 it+0.237
Neg logit contributions
aman-0.515
 hey-0.501
land-0.483
 whose-0.463
 extingu-0.459
Token is
Token ID318
Feature activation+5.97e-01

Pos logit contributions
aman+2.712
 hey+2.659
land+2.488
 exception+2.449
 extingu+2.431
Neg logit contributions
 ever-1.252
 it-1.113
 onwards-1.059
 excited-1.009
 fright-0.966
Token their
Token ID511
Feature activation+8.93e-01

Pos logit contributions
aman+1.024
 hey+1.012
land+0.956
 exception+0.916
 extingu+0.906
Neg logit contributions
 ever-0.630
 onwards-0.587
keeping-0.574
 it-0.571
 fright-0.538
Token orange
Token ID10912
Feature activation+3.62e-02

Pos logit contributions
aman+1.220
 hey+1.205
land+1.119
 exception+1.067
 extingu+1.052
Neg logit contributions
 ever-1.280
 onwards-1.178
keeping-1.166
 it-1.166
 excited-1.128
Token balloon
Token ID21190
Feature activation+1.90e-01

Pos logit contributions
aman+0.059
 hey+0.058
land+0.055
 exception+0.052
 extingu+0.052
Neg logit contributions
 ever-0.040
 onwards-0.039
keeping-0.038
 it-0.036
 fright-0.036
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
aman+0.342
 hey+0.335
land+0.322
 exception+0.309
 extingu+0.305
Neg logit contributions
 ever-0.176
keeping-0.167
 onwards-0.167
 it-0.155
 fright-0.152
Token It
Token ID632
Feature activation+1.32e+00

Pos logit contributions
 ever+0.259
keeping+0.238
 onwards+0.232
 fright+0.217
 it+0.209
Neg logit contributions
aman-0.587
 hey-0.571
land-0.553
 whose-0.530
 extingu-0.527
Token is
Token ID318
Feature activation+3.08e-01

Pos logit contributions
aman+2.599
 hey+2.552
land+2.382
 exception+2.335
 extingu+2.325
Neg logit contributions
 ever-1.210
 it-1.079
 onwards-1.023
 excited-0.971
 fright-0.935
Token their
Token ID511
Feature activation+8.37e-01

Pos logit contributions
aman+0.520
 hey+0.510
land+0.486
 exception+0.466
 extingu+0.458
Neg logit contributions
 ever-0.327
keeping-0.310
 onwards-0.309
 it-0.293
 fright-0.285
Token best
Token ID1266
Feature activation+8.89e-02

Pos logit contributions
aman+1.117
 hey+1.101
land+1.022
 exception+0.972
 extingu+0.960
Neg logit contributions
 ever-1.223
 onwards-1.131
keeping-1.120
 it-1.120
 excited-1.082
Token friend
Token ID1545
Feature activation-1.03e+00

Pos logit contributions
aman+0.112
 hey+0.108
land+0.103
 exception+0.096
 extingu+0.094
Neg logit contributions
 ever-0.131
 onwards-0.128
keeping-0.128
 it-0.122
 fright-0.121
Token stay
Token ID2652
Feature activation-1.25e+00

Pos logit contributions
 onwards+0.458
keeping+0.456
 ever+0.451
 it+0.401
 fright+0.391
Neg logit contributions
 hey-0.709
aman-0.706
land-0.664
 exception-0.641
 extingu-0.631
Token up
Token ID510
Feature activation-8.18e-01

Pos logit contributions
 ever+1.404
keeping+1.392
 onwards+1.335
 Go+1.266
tick+1.235
Neg logit contributions
 hey-1.982
aman-1.968
land-1.858
 whose-1.842
 extingu-1.830
Token late
Token ID2739
Feature activation-2.17e-01

Pos logit contributions
keeping+0.762
 ever+0.718
 onwards+0.704
 Go+0.661
 fright+0.647
Neg logit contributions
aman-1.469
 hey-1.417
land-1.363
 extingu-1.335
 exception-1.328
Token and
Token ID290
Feature activation-5.30e-01

Pos logit contributions
 ever+0.213
keeping+0.207
 onwards+0.205
 fright+0.192
 it+0.187
Neg logit contributions
aman-0.373
 hey-0.367
land-0.357
 exception-0.336
 whose-0.332
Token play
Token ID711
Feature activation-2.82e-01

Pos logit contributions
keeping+0.551
 ever+0.542
 onwards+0.540
unts+0.491
 fright+0.475
Neg logit contributions
 hey-0.882
aman-0.862
land-0.810
 exception-0.785
 whose-0.775
Token until
Token ID1566
Feature activation-8.77e-01

Pos logit contributions
 ever+0.302
keeping+0.295
 onwards+0.290
 fright+0.268
 it+0.263
Neg logit contributions
 hey-0.456
aman-0.452
land-0.436
 exception-0.413
 whose-0.412
Token it
Token ID340
Feature activation+9.65e-01

Pos logit contributions
keeping+0.854
 ever+0.802
 onwards+0.785
unts+0.756
 fright+0.738
Neg logit contributions
 hey-1.543
aman-1.542
land-1.416
 exception-1.365
 whose-1.348
Token was
Token ID373
Feature activation+2.49e-01

Pos logit contributions
aman+1.855
 hey+1.823
land+1.729
 exception+1.678
 extingu+1.666
Neg logit contributions
 ever-0.879
 onwards-0.778
 it-0.752
keeping-0.734
 fright-0.704
Token time
Token ID640
Feature activation-1.19e+00

Pos logit contributions
aman+0.400
 hey+0.391
land+0.373
 exception+0.357
 extingu+0.350
Neg logit contributions
 ever-0.283
keeping-0.270
 onwards-0.268
 it-0.255
 fright-0.249
Token for
Token ID329
Feature activation-4.22e-01

Pos logit contributions
keeping+1.228
 ever+1.160
 onwards+1.127
 fright+1.125
tick+1.098
Neg logit contributions
aman-1.977
land-1.924
 hey-1.910
 exception-1.814
 whose-1.734
Token the
Token ID262
Feature activation+4.01e-01

Pos logit contributions
 ever+0.469
 onwards+0.464
keeping+0.454
 sideways+0.426
 fright+0.414
Neg logit contributions
aman-0.674
 hey-0.667
land-0.634
 exception-0.598
 whose-0.589
Token,
Token ID11
Feature activation+3.17e-01

Pos logit contributions
aman+1.044
 hey+1.023
land+0.988
 exception+0.946
 extingu+0.933
Neg logit contributions
 ever-0.457
 onwards-0.422
keeping-0.408
 it-0.404
 fright-0.374
Token Tim
Token ID5045
Feature activation-2.81e-01

Pos logit contributions
aman+0.589
 hey+0.579
land+0.554
 exception+0.533
 extingu+0.526
Neg logit contributions
 ever-0.287
keeping-0.268
 onwards-0.268
 it-0.250
 fright-0.243
Token saw
Token ID2497
Feature activation+1.89e+00

Pos logit contributions
 ever+0.305
 onwards+0.296
keeping+0.296
 fright+0.277
 it+0.268
Neg logit contributions
aman-0.462
 hey-0.450
land-0.429
 exception-0.411
 whose-0.408
Token a
Token ID257
Feature activation+2.97e-01

Pos logit contributions
aman+3.709
 hey+3.589
land+3.574
 exception+3.327
ist+3.300
Neg logit contributions
 it-1.873
 ever-1.636
 excited-1.534
 on-1.524
keeping-1.433
Token big
Token ID1263
Feature activation-1.10e+00

Pos logit contributions
aman+0.460
 hey+0.450
land+0.427
 exception+0.407
 extingu+0.401
Neg logit contributions
 ever-0.355
keeping-0.338
 onwards-0.337
 it-0.321
 fright-0.313
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
keeping+1.595
 ever+1.570
 onwards+1.539
 fright+1.367
unts+1.345
Neg logit contributions
aman-1.531
 hey-1.426
land-1.327
 extingu-1.299
 exception-1.283
Token shiny
Token ID22441
Feature activation+8.56e-02

Pos logit contributions
keeping+2.599
 ever+2.484
 onwards+2.368
unts+2.193
 Go+2.125
Neg logit contributions
aman-2.432
land-2.334
 hey-2.320
 whose-2.311
 exception-2.300
Token bike
Token ID7161
Feature activation+1.70e-01

Pos logit contributions
aman+0.124
 hey+0.120
land+0.114
 exception+0.108
 extingu+0.106
Neg logit contributions
 ever-0.112
keeping-0.108
 onwards-0.107
 it-0.101
 fright-0.100
Token that
Token ID326
Feature activation-9.57e-01

Pos logit contributions
aman+0.313
 hey+0.307
land+0.295
 exception+0.283
 whose+0.279
Neg logit contributions
 ever-0.152
keeping-0.144
 onwards-0.143
 it-0.132
 fright-0.130
Token was
Token ID373
Feature activation-2.72e-01

Pos logit contributions
keeping+1.112
 onwards+1.057
 ever+1.055
unts+1.005
 fright+1.002
Neg logit contributions
aman-1.453
 hey-1.421
 whose-1.336
land-1.318
 exception-1.314
Token very
Token ID845
Feature activation-1.37e+00

Pos logit contributions
keeping+0.269
 ever+0.268
 onwards+0.254
 fright+0.237
 it+0.230
Neg logit contributions
aman-0.471
 hey-0.455
land-0.443
 exception-0.431
 whose-0.422
Token Susan
Token ID14919
Feature activation-8.05e-02

Pos logit contributions
 ever+0.861
keeping+0.786
 onwards+0.751
 sideways+0.723
 fright+0.723
Neg logit contributions
aman-1.502
 hey-1.422
land-1.392
 whose-1.324
 extingu-1.293
Token wanted
Token ID2227
Feature activation-8.31e-01

Pos logit contributions
 ever+0.074
keeping+0.072
 onwards+0.072
 it+0.065
 fright+0.065
Neg logit contributions
aman-0.145
 hey-0.142
land-0.137
 whose-0.130
 exception-0.130
Token it
Token ID340
Feature activation-3.01e-01

Pos logit contributions
 onwards+0.693
keeping+0.684
 ever+0.652
unts+0.632
 sideways+0.588
Neg logit contributions
aman-1.530
 hey-1.512
land-1.473
 whose-1.426
 exception-1.387
Token really
Token ID1107
Feature activation+5.10e-02

Pos logit contributions
keeping+0.261
 ever+0.254
 onwards+0.249
 fright+0.228
unts+0.226
Neg logit contributions
aman-0.552
 hey-0.539
land-0.531
 exception-0.510
 whose-0.500
Token badly
Token ID11234
Feature activation+1.39e+00

Pos logit contributions
aman+0.094
 hey+0.092
land+0.089
 exception+0.086
 whose+0.084
Neg logit contributions
 ever-0.046
keeping-0.044
 onwards-0.043
 fright-0.039
 it-0.039
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
aman+2.907
 hey+2.887
land+2.698
 exception+2.619
 extingu+2.605
Neg logit contributions
 ever-1.074
 it-0.991
keeping-0.911
 onwards-0.885
 on-0.822
Token but
Token ID475
Feature activation-1.01e+00

Pos logit contributions
keeping+2.419
 ever+2.273
unts+2.228
 onwards+2.219
tick+2.132
Neg logit contributions
aman-2.386
land-2.250
 hey-2.218
 whose-2.115
 exception-2.023
Token she
Token ID673
Feature activation-1.16e+00

Pos logit contributions
keeping+1.164
unts+1.115
 ever+1.106
 onwards+1.060
tick+1.025
Neg logit contributions
aman-1.641
 hey-1.567
 whose-1.470
land-1.438
 exception-1.394
Token didn
Token ID1422
Feature activation-1.50e+00

Pos logit contributions
keeping+1.349
 onwards+1.254
 ever+1.191
 sideways+1.177
unts+1.168
Neg logit contributions
aman-1.897
 hey-1.825
 whose-1.703
land-1.685
 exception-1.623
Token't
Token ID470
Feature activation-1.22e+00

Pos logit contributions
keeping+1.615
 ever+1.352
 sideways+1.343
 onwards+1.330
 cheered+1.309
Neg logit contributions
aman-2.536
 hey-2.480
land-2.330
 extingu-2.256
 exception-2.244
Token have
Token ID423
Feature activation+6.80e-02

Pos logit contributions
keeping+1.053
 onwards+1.029
 ever+0.920
 sideways+0.868
unts+0.847
Neg logit contributions
 hey-2.247
 exception-2.150
land-2.138
aman-2.125
 extingu-2.017
Token their
Token ID511
Feature activation+3.16e-01

Pos logit contributions
aman+0.060
 hey+0.058
land+0.056
 exception+0.053
 whose+0.053
Neg logit contributions
 ever-0.037
keeping-0.035
 onwards-0.035
 fright-0.032
 it-0.032
Token vacation
Token ID14600
Feature activation+5.78e-01

Pos logit contributions
aman+0.449
 hey+0.439
land+0.415
 exception+0.393
 extingu+0.386
Neg logit contributions
 ever-0.420
keeping-0.401
 onwards-0.401
 it-0.385
 fright-0.376
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
aman+1.057
 hey+1.042
land+0.984
 exception+0.953
 extingu+0.943
Neg logit contributions
 ever-0.548
keeping-0.505
 onwards-0.503
 it-0.480
 fright-0.455
Token When
Token ID1649
Feature activation-1.27e+00

Pos logit contributions
aman+0.295
 hey+0.290
land+0.278
 exception+0.268
 whose+0.265
Neg logit contributions
 ever-0.117
keeping-0.107
 onwards-0.106
 it-0.098
 fright-0.096
Token they
Token ID484
Feature activation+4.81e-01

Pos logit contributions
keeping+1.449
 ever+1.335
 onwards+1.290
 fright+1.289
unts+1.268
Neg logit contributions
aman-2.093
 hey-2.055
land-1.906
 whose-1.792
 camps-1.789
Token got
Token ID1392
Feature activation-1.93e+00

Pos logit contributions
aman+0.820
 hey+0.806
land+0.769
 exception+0.738
 extingu+0.728
Neg logit contributions
 ever-0.512
 onwards-0.478
keeping-0.472
 it-0.458
 fright-0.439
Token home
Token ID1363
Feature activation+6.11e-01

Pos logit contributions
keeping+2.221
 ever+2.062
unts+2.058
 fright+2.053
 Go+1.985
Neg logit contributions
aman-3.085
 hey-2.956
land-2.798
 exception-2.785
 camps-2.741
Token,
Token ID11
Feature activation-1.62e-01

Pos logit contributions
aman+1.173
 hey+1.152
land+1.103
 exception+1.067
 extingu+1.053
Neg logit contributions
 ever-0.518
 onwards-0.478
keeping-0.470
 it-0.455
 excited-0.425
Token they
Token ID484
Feature activation-2.02e-01

Pos logit contributions
 ever+0.183
keeping+0.176
 onwards+0.173
 fright+0.164
unts+0.161
Neg logit contributions
aman-0.260
 hey-0.256
land-0.240
 exception-0.231
 whose-0.230
Token looked
Token ID3114
Feature activation-2.07e-03

Pos logit contributions
 ever+0.194
keeping+0.193
 onwards+0.189
 fright+0.173
 it+0.170
Neg logit contributions
aman-0.355
 hey-0.349
land-0.332
 exception-0.318
 whose-0.316
Token at
Token ID379
Feature activation+1.00e+00

Pos logit contributions
 ever+0.002
keeping+0.002
 onwards+0.002
 it+0.002
 fright+0.002
Neg logit contributions
aman-0.004
 hey-0.003
land-0.003
 exception-0.003
 whose-0.003
Token over
Token ID625
Feature activation-1.30e+00

Pos logit contributions
 ever+0.210
 onwards+0.204
keeping+0.202
 it+0.183
 fright+0.182
Neg logit contributions
aman-0.453
 hey-0.448
land-0.434
 exception-0.419
 whose-0.411
Token.
Token ID13
Feature activation+7.43e-01

Pos logit contributions
 onwards+1.375
keeping+1.337
 ever+1.289
unts+1.270
 fright+1.226
Neg logit contributions
aman-2.145
 hey-2.095
land-2.050
 whose-1.948
 exception-1.901
Token He
Token ID679
Feature activation-6.30e-02

Pos logit contributions
aman+1.429
 hey+1.420
land+1.354
 exception+1.307
 extingu+1.283
Neg logit contributions
 ever-0.611
 onwards-0.584
keeping-0.578
 it-0.564
 excited-0.526
Token told
Token ID1297
Feature activation+7.66e-01

Pos logit contributions
 ever+0.061
keeping+0.061
 onwards+0.060
 it+0.054
 fright+0.054
Neg logit contributions
aman-0.110
 hey-0.108
land-0.104
 exception-0.100
 whose-0.099
Token her
Token ID607
Feature activation-6.71e-01

Pos logit contributions
aman+1.342
 hey+1.307
land+1.256
 exception+1.204
 extingu+1.193
Neg logit contributions
 ever-0.790
 it-0.737
keeping-0.730
 onwards-0.719
 fright-0.694
Token he
Token ID339
Feature activation-1.71e+00

Pos logit contributions
 ever+0.805
keeping+0.801
 onwards+0.782
 sideways+0.747
 fright+0.731
Neg logit contributions
 hey-0.996
aman-0.993
land-0.969
 whose-0.908
 exception-0.903
Token was
Token ID373
Feature activation-1.07e-01

Pos logit contributions
 onwards+2.069
keeping+2.066
 onward+1.783
 sideways+1.761
 fright+1.718
Neg logit contributions
 hey-2.727
aman-2.529
 exception-2.510
land-2.502
 whose-2.453
Token looking
Token ID2045
Feature activation+2.49e-01

Pos logit contributions
 ever+0.109
keeping+0.108
 onwards+0.105
 fright+0.097
 it+0.096
Neg logit contributions
 hey-0.178
aman-0.177
land-0.168
 exception-0.164
 whose-0.159
Token for
Token ID329
Feature activation+8.46e-01

Pos logit contributions
aman+0.434
 hey+0.427
land+0.408
 exception+0.392
 extingu+0.385
Neg logit contributions
 ever-0.250
keeping-0.237
 onwards-0.236
 it-0.222
 fright-0.216
Token someone
Token ID2130
Feature activation+1.46e+00

Pos logit contributions
aman+1.371
 hey+1.334
land+1.254
 exception+1.208
 extingu+1.196
Neg logit contributions
 ever-1.005
keeping-0.935
 it-0.931
 onwards-0.916
 fright-0.876
Token.
Token ID13
Feature activation+8.10e-01

Pos logit contributions
aman+3.253
 hey+3.147
land+2.939
 extingu+2.887
 exception+2.866
Neg logit contributions
 ever-1.058
 it-0.857
keeping-0.829
 onwards-0.793
 on-0.773
Token water
Token ID1660
Feature activation+3.70e-01

Pos logit contributions
aman+1.635
 hey+1.604
land+1.436
 exception+1.414
 whose+1.355
Neg logit contributions
 ever-1.957
keeping-1.863
 it-1.855
 onwards-1.816
 excited-1.753
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
aman+0.683
 hey+0.671
land+0.642
 exception+0.618
 extingu+0.609
Neg logit contributions
 ever-0.334
keeping-0.312
 onwards-0.311
 it-0.293
 fright-0.282
Token It
Token ID632
Feature activation-6.15e-01

Pos logit contributions
 ever+0.232
keeping+0.216
 onwards+0.203
 fright+0.195
 it+0.182
Neg logit contributions
aman-0.428
 hey-0.418
land-0.400
 exception-0.386
 whose-0.385
Token was
Token ID373
Feature activation-1.19e+00

Pos logit contributions
keeping+0.674
 onwards+0.639
 ever+0.631
 fright+0.572
unts+0.559
Neg logit contributions
aman-0.993
 hey-0.964
land-0.946
 whose-0.929
 extingu-0.905
Token white
Token ID2330
Feature activation-5.04e-01

Pos logit contributions
keeping+1.540
 ever+1.448
 onwards+1.410
 fright+1.334
tick+1.286
Neg logit contributions
aman-1.724
 hey-1.641
land-1.627
 whose-1.625
 extingu-1.597
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
 onwards+0.360
 ever+0.359
keeping+0.344
 fright+0.315
unts+0.296
Neg logit contributions
aman-1.001
 hey-0.970
land-0.965
 extingu-0.931
 whose-0.922
Token shiny
Token ID22441
Feature activation+2.01e-01

Pos logit contributions
keeping+2.610
 onwards+2.331
 ever+2.306
 Go+2.151
 fright+2.135
Neg logit contributions
 whose-2.795
aman-2.794
 extingu-2.687
 hey-2.675
land-2.634
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
aman+0.365
 hey+0.358
land+0.345
 exception+0.331
 extingu+0.327
Neg logit contributions
 ever-0.187
keeping-0.176
 onwards-0.176
 it-0.163
 fright-0.161
Token had
Token ID550
Feature activation+3.94e-01

Pos logit contributions
keeping+1.701
 onwards+1.533
 ever+1.490
 fright+1.430
unts+1.404
Neg logit contributions
aman-2.797
 whose-2.773
 hey-2.747
land-2.668
 extingu-2.630
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
aman+0.623
 hey+0.610
land+0.577
 exception+0.553
 extingu+0.543
Neg logit contributions
 ever-0.462
keeping-0.438
 onwards-0.437
 it-0.420
 fright-0.407
Token flag
Token ID6056
Feature activation-2.51e-01

Pos logit contributions
keeping+1.518
 ever+1.494
 onwards+1.446
tick+1.282
 Go+1.272
Neg logit contributions
aman-1.613
 hey-1.561
land-1.554
 extingu-1.441
 exception-1.433