TOP ACTIVATIONS
MAX = 1.000

the zoo and saw an ostr ich . It was so
new tree and make more cher ries for everyone . <|endoftext|>
an ostr ich . The ostr ich has a long neck
it . She saw many sandwiches and fruits and drinks .
, they saw a big mel on on a table .
cherry tree with lots of cher ries on it . She
Wow , look at that mel on !" Lily said .
that said : " Free mel on . Enjoy !" They
big pond with ducks and ge ese . The water was
though he was just an oct opus , he worked hard
Can we go see the oct opus ?" His mom answered
time , there was an oct opus named Tim my .
pond . They shared their sandwiches and juice . They thanked
Come here , we have sandwiches and juice for you ,"
. Lucky ate so many bananas that he felt very heavy
park . It is an ostr ich . The ostr ich
, Tim my saw an oct opus in the water .
highest branches and grab the ju ic iest fruit . One
took a big slice of mel on and bit into it
, this is the best mel on ever !" Lily said

MIN ACTIVATIONS
MIN = 1.000

loud whistle . The race begins . The bikes zoom past
stayed away from it . Days , weeks and months went
't help but smile . Days were filled with the bees
couldn 't find him . Days turned into weeks , and
they kept on walking . Days went by and the old
G wen 's popcorn . Days later , the post man
who were friends ..." Mom begins . L ily
it in the garden . Days later , a beautiful orange
and suggested that the bunny begins with small steps . He
- destruct activated . Countdown begins ." Tom and
and said , " I will scare you every night !"
Tom and said , " I will scare you every night
covers . He heard the sound again . It was coming
heard the sound again . It was coming from his closet
the sound again . It was coming from his closet !
at Tom and said , " I will scare you every
hid under the covers . He heard the sound again .
say anything either . Tom was scared . Tom
table ?" The table did not say anything either . Tom
, table ?" The table did not say anything either .

INTERVAL 1.000 - 1.000
CONTAINS 0.035%

the ostr ich . The ostr ich was standing next to
" But I like the mel on . It looks funny
girl who was crying . Gig i asked the girl ,
. One day , the rad ish 's dad said to
a big , juicy b erry on a branch . Tweet
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.283
,+1.151
.+1.056
 the+0.953
 a+0.813
Neg logit contributions
lor-7.744
arians-7.568
rol-7.513
ocol-7.477
umm-7.466
Token zoo
Token ID26626
Feature activation+1.00e+00

Pos logit contributions
 and+5.667
,+5.518
 the+5.489
.+5.413
 a+5.284
Neg logit contributions
lor-3.584
arians-3.381
umm-3.373
rol-3.346
osity-3.171
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.402
 the+1.367
,+1.278
 a+1.194
.+1.175
Neg logit contributions
lor-7.658
arians-7.451
rol-7.441
umm-7.427
osity-7.287
Token saw
Token ID2497
Feature activation+1.00e+00

Pos logit contributions
 and+2.938
,+2.797
 the+2.782
.+2.731
 a+2.626
Neg logit contributions
lor-6.186
arians-6.006
rol-5.995
umm-5.929
osity-5.901
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+1.424
,+1.279
.+1.162
 the+1.138
 a+0.917
Neg logit contributions
lor-7.715
rol-7.553
umm-7.474
arians-7.435
isher-7.377
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+5.171
 the+5.098
,+5.050
.+4.960
 a+4.903
Neg logit contributions
lor-3.894
umm-3.809
arians-3.796
rol-3.761
ocol-3.686
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.367
,+7.238
 the+7.236
.+7.140
 a+7.060
Neg logit contributions
lor-1.732
umm-1.584
arians-1.553
rol-1.552
ocol-1.475
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.169
 the+2.096
,+2.035
 a+1.923
.+1.892
Neg logit contributions
lor-6.987
rol-6.790
arians-6.771
umm-6.733
osity-6.632
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+3.239
 the+3.110
,+3.105
.+3.037
 a+2.949
Neg logit contributions
lor-5.869
arians-5.704
umm-5.655
osity-5.609
rol-5.571
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+1.949
,+1.835
 the+1.824
.+1.734
 a+1.678
Neg logit contributions
lor-7.080
rol-6.947
arians-6.906
umm-6.808
osity-6.709
Token so
Token ID523
Feature activation+1.00e+00

Pos logit contributions
 and+2.274
,+2.217
.+2.146
 the+2.083
 a+1.810
Neg logit contributions
rol-6.654
lor-6.555
umm-6.420
arians-6.418
ocol-6.398
Token new
Token ID649
Feature activation+1.00e+00

Pos logit contributions
 and+3.758
 the+3.673
,+3.602
 a+3.501
.+3.487
Neg logit contributions
lor-5.415
umm-5.221
arians-5.188
rol-5.098
bek-5.063
Token tree
Token ID5509
Feature activation+1.00e+00

Pos logit contributions
 and+4.517
 the+4.456
,+4.375
.+4.270
 a+4.269
Neg logit contributions
lor-4.651
umm-4.467
arians-4.447
rol-4.385
ides-4.300
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+2.227
 and+2.183
 a+2.048
,+2.044
.+1.906
Neg logit contributions
lor-6.839
arians-6.625
rol-6.620
umm-6.568
ides-6.473
Token make
Token ID787
Feature activation+1.00e+00

Pos logit contributions
 and+3.874
,+3.730
.+3.660
 the+3.636
 a+3.497
Neg logit contributions
lor-5.197
umm-5.096
arians-4.999
rol-4.998
bek-4.983
Token more
Token ID517
Feature activation+1.00e+00

Pos logit contributions
 and+3.145
,+3.030
 the+2.874
.+2.839
 a+2.698
Neg logit contributions
lor-6.065
umm-5.862
rol-5.860
ocol-5.763
arians-5.747
Token cher
Token ID21382
Feature activation+1.00e+00

Pos logit contributions
 and+4.582
 the+4.515
,+4.476
 a+4.362
.+4.330
Neg logit contributions
umm-4.454
lor-4.408
rol-4.313
ocol-4.310
arians-4.223
Tokenries
Token ID1678
Feature activation+1.00e+00

Pos logit contributions
 and+7.289
 the+7.190
,+7.152
.+7.043
 a+7.016
Neg logit contributions
lor-1.937
arians-1.760
umm-1.760
rol-1.731
ocol-1.636
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
 and+2.019
 the+1.988
,+1.894
 a+1.826
.+1.720
Neg logit contributions
lor-7.045
umm-6.904
rol-6.884
arians-6.842
ides-6.751
Token everyone
Token ID2506
Feature activation+1.00e+00

Pos logit contributions
 and+3.065
,+2.857
.+2.673
 the+2.640
 a+2.544
Neg logit contributions
ocol-6.098
lor-6.083
arians-6.039
umm-6.009
rol-5.843
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.930
 the+1.837
,+1.792
 a+1.689
.+1.641
Neg logit contributions
lor-7.189
rol-6.981
arians-6.981
umm-6.950
osity-6.892
Token<|endoftext|>
Token ID50256
Feature activation+1.00e+00

Pos logit contributions
 and+3.668
,+3.552
 the+3.534
.+3.415
 a+3.396
Neg logit contributions
lor-5.562
rol-5.365
umm-5.354
arians-5.332
osity-5.210
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+2.481
,+2.422
.+2.348
 the+2.346
 a+2.071
Neg logit contributions
lor-6.279
rol-6.266
umm-6.148
ocol-6.116
arians-6.112
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+5.025
 the+4.967
,+4.906
.+4.823
 a+4.768
Neg logit contributions
lor-4.064
umm-3.967
arians-3.907
rol-3.889
ocol-3.832
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.351
,+7.227
 the+7.199
.+7.120
 a+7.029
Neg logit contributions
lor-1.738
umm-1.597
arians-1.576
rol-1.566
ocol-1.483
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.560
 the+1.541
,+1.424
 a+1.368
.+1.270
Neg logit contributions
lor-7.526
arians-7.316
rol-7.312
umm-7.254
ides-7.158
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+2.750
,+2.640
 the+2.611
.+2.532
 a+2.487
Neg logit contributions
lor-6.314
rol-6.156
umm-6.133
arians-6.126
ocol-6.000
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+5.878
 the+5.760
,+5.743
.+5.638
 a+5.578
Neg logit contributions
lor-3.315
umm-3.111
arians-3.099
rol-3.089
ocol-2.977
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.397
,+7.259
 the+7.255
.+7.150
 a+7.073
Neg logit contributions
lor-1.734
umm-1.578
arians-1.559
rol-1.536
ocol-1.454
Token has
Token ID468
Feature activation+1.00e+00

Pos logit contributions
 and+3.734
 the+3.691
,+3.617
.+3.546
 a+3.530
Neg logit contributions
lor-5.332
arians-5.182
rol-5.132
umm-5.092
ides-5.024
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+3.479
,+3.421
 the+3.281
.+3.265
 a+2.962
Neg logit contributions
lor-5.662
rol-5.439
arians-5.350
umm-5.294
ocol-5.284
Token long
Token ID890
Feature activation+1.00e+00

Pos logit contributions
 and+4.260
 the+4.241
,+4.096
.+4.062
 a+3.977
Neg logit contributions
lor-4.895
arians-4.673
umm-4.661
rol-4.608
bek-4.563
Token neck
Token ID7393
Feature activation+1.00e+00

Pos logit contributions
 the+4.928
 and+4.896
,+4.740
.+4.725
 a+4.695
Neg logit contributions
lor-4.194
arians-3.946
ides-3.934
umm-3.863
rol-3.858
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+1.293
,+1.149
 the+1.119
.+1.020
 a+0.984
Neg logit contributions
lor-7.819
rol-7.604
arians-7.598
umm-7.559
ocol-7.536
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.657
 and+0.566
,+0.498
 a+0.435
.+0.218
Neg logit contributions
lor-8.327
rol-8.090
arians-8.073
umm-8.061
ides-7.993
Token She
Token ID1375
Feature activation+1.00e+00

Pos logit contributions
 and+3.468
,+3.353
 the+3.343
.+3.254
 a+3.115
Neg logit contributions
lor-5.644
arians-5.449
umm-5.417
osity-5.413
rol-5.387
Token saw
Token ID2497
Feature activation+1.00e+00

Pos logit contributions
 and+3.795
 the+3.745
,+3.742
.+3.668
 a+3.558
Neg logit contributions
lor-5.285
arians-5.152
rol-5.060
isher-4.970
umm-4.960
Token many
Token ID867
Feature activation+1.00e+00

Pos logit contributions
 and+1.726
,+1.601
.+1.491
 the+1.403
 a+1.198
Neg logit contributions
lor-7.292
rol-7.206
ocol-7.122
arians-7.107
umm-7.096
Token sandwiches
Token ID29777
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
 the+4.153
,+4.114
 a+4.017
.+4.014
Neg logit contributions
lor-4.749
ocol-4.666
arians-4.661
umm-4.650
rol-4.631
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.839
 and+0.798
 a+0.697
,+0.680
.+0.602
Neg logit contributions
lor-8.041
rol-7.847
umm-7.839
arians-7.839
ides-7.766
Token fruits
Token ID15921
Feature activation+1.00e+00

Pos logit contributions
 and+5.151
,+5.042
 the+5.002
.+4.936
 a+4.802
Neg logit contributions
lor-3.899
arians-3.794
umm-3.772
rol-3.707
ocol-3.697
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.581
 and+0.523
 a+0.457
,+0.448
.+0.300
Neg logit contributions
lor-8.346
arians-8.137
rol-8.132
umm-8.089
ides-8.070
Token drinks
Token ID11758
Feature activation+1.00e+00

Pos logit contributions
 and+5.060
,+4.971
 the+4.933
.+4.853
 a+4.737
Neg logit contributions
lor-3.951
arians-3.857
umm-3.838
rol-3.773
ocol-3.756
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.941
 the+0.893
,+0.835
 a+0.723
.+0.666
Neg logit contributions
lor-8.038
rol-7.861
umm-7.824
arians-7.803
ocol-7.783
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.376
 the+0.249
,+0.154
.+0.116
 a+0.052
Neg logit contributions
lor-8.747
arians-8.561
rol-8.559
umm-8.536
ocol-8.488
Token they
Token ID484
Feature activation+1.00e+00

Pos logit contributions
 and+2.662
,+2.504
 the+2.462
.+2.426
 a+2.263
Neg logit contributions
lor-6.410
arians-6.309
rol-6.257
umm-6.236
ocol-6.136
Token saw
Token ID2497
Feature activation+1.00e+00

Pos logit contributions
 and+2.915
 the+2.804
,+2.791
.+2.743
 a+2.630
Neg logit contributions
lor-6.147
rol-6.027
arians-5.961
umm-5.875
bek-5.832
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+0.880
,+0.759
 the+0.659
.+0.640
 a+0.380
Neg logit contributions
lor-8.192
rol-8.029
arians-7.964
umm-7.950
isher-7.892
Token big
Token ID1263
Feature activation+1.00e+00

Pos logit contributions
 and+3.337
 the+3.250
,+3.186
.+3.138
 a+3.043
Neg logit contributions
lor-5.763
arians-5.611
umm-5.609
rol-5.506
bek-5.503
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+4.410
 the+4.329
.+4.237
,+4.228
 a+4.186
Neg logit contributions
lor-4.596
umm-4.532
arians-4.484
ocol-4.424
rol-4.395
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.188
 the+5.090
,+5.052
.+4.947
 a+4.907
Neg logit contributions
lor-4.013
arians-3.837
umm-3.828
rol-3.800
ocol-3.702
Token on
Token ID319
Feature activation+1.00e+00

Pos logit contributions
 the+2.739
 and+2.733
,+2.629
 a+2.509
.+2.475
Neg logit contributions
lor-6.402
arians-6.173
umm-6.170
rol-6.165
ides-6.066
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+0.867
,+0.763
.+0.633
 the+0.598
 a+0.428
Neg logit contributions
lor-8.197
arians-8.032
umm-7.991
ocol-7.907
rol-7.903
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.774
 the+4.700
,+4.648
.+4.539
 a+4.511
Neg logit contributions
lor-4.399
arians-4.177
umm-4.169
rol-4.069
ides-4.023
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.269
 the+1.265
,+1.158
 a+1.056
.+0.981
Neg logit contributions
lor-7.827
umm-7.616
rol-7.608
arians-7.598
ides-7.492
Token cherry
Token ID23612
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
 the+3.545
,+3.463
.+3.422
 a+3.339
Neg logit contributions
lor-5.484
umm-5.341
arians-5.309
bek-5.211
rol-5.197
Token tree
Token ID5509
Feature activation+1.00e+00

Pos logit contributions
 and+3.070
 the+3.010
,+2.947
 a+2.847
.+2.836
Neg logit contributions
lor-6.028
umm-5.900
arians-5.837
rol-5.811
ocol-5.711
Token with
Token ID351
Feature activation+1.00e+00

Pos logit contributions
 and+2.036
 the+1.989
,+1.922
 a+1.810
.+1.796
Neg logit contributions
lor-7.045
umm-6.863
rol-6.851
arians-6.845
osity-6.731
Token lots
Token ID6041
Feature activation+1.00e+00

Pos logit contributions
 and+2.910
,+2.803
 the+2.738
.+2.676
 a+2.506
Neg logit contributions
lor-6.210
umm-6.098
rol-6.086
arians-5.990
ocol-5.979
Token of
Token ID286
Feature activation+1.00e+00

Pos logit contributions
 and+2.222
 the+2.190
,+2.117
 a+2.022
.+1.993
Neg logit contributions
lor-6.833
umm-6.748
rol-6.685
ocol-6.641
arians-6.636
Token cher
Token ID21382
Feature activation+1.00e+00

Pos logit contributions
 and+5.239
 the+5.132
,+5.114
.+5.000
 a+4.947
Neg logit contributions
lor-3.860
umm-3.748
rol-3.714
arians-3.700
ocol-3.636
Tokenries
Token ID1678
Feature activation+1.00e+00

Pos logit contributions
 and+7.142
 the+7.039
,+7.010
.+6.904
 a+6.839
Neg logit contributions
lor-2.023
arians-1.856
umm-1.828
rol-1.801
osity-1.725
Token on
Token ID319
Feature activation+1.00e+00

Pos logit contributions
 and+1.172
 the+1.139
,+1.058
 a+0.963
.+0.896
Neg logit contributions
lor-7.855
umm-7.722
rol-7.712
arians-7.700
ides-7.597
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+2.893
,+2.773
 the+2.714
.+2.597
 a+2.570
Neg logit contributions
lor-6.177
umm-5.985
arians-5.969
rol-5.887
ocol-5.845
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.454
 the+0.449
,+0.329
 a+0.265
.+0.031
Neg logit contributions
lor-8.488
rol-8.288
arians-8.274
umm-8.249
ides-8.209
Token She
Token ID1375
Feature activation+1.00e+00

Pos logit contributions
 and+2.912
 the+2.807
,+2.783
.+2.684
 a+2.654
Neg logit contributions
lor-6.221
umm-6.049
arians-6.000
rol-5.998
osity-5.895
TokenWow
Token ID22017
Feature activation+1.00e+00

Pos logit contributions
 and+5.893
,+5.810
 the+5.775
.+5.666
 a+5.563
Neg logit contributions
lor-3.404
umm-3.283
arians-3.245
rol-3.191
bek-3.089
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.140
 the+1.043
,+1.031
.+0.911
 a+0.854
Neg logit contributions
lor-8.074
arians-7.898
umm-7.893
rol-7.865
ocol-7.762
Token look
Token ID804
Feature activation+1.00e+00

Pos logit contributions
 and+3.092
 the+2.966
,+2.944
.+2.854
 a+2.779
Neg logit contributions
lor-6.119
arians-5.938
umm-5.930
rol-5.898
ocol-5.822
Token at
Token ID379
Feature activation+1.00e+00

Pos logit contributions
 and+1.922
 the+1.823
,+1.780
.+1.680
 a+1.638
Neg logit contributions
lor-7.270
arians-7.094
umm-7.090
rol-7.065
ocol-6.975
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 and+2.121
,+1.976
.+1.923
 the+1.887
 a+1.785
Neg logit contributions
lor-7.053
arians-6.840
umm-6.830
rol-6.807
ocol-6.753
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+4.610
 the+4.501
,+4.434
.+4.358
 a+4.315
Neg logit contributions
lor-4.528
umm-4.378
rol-4.243
arians-4.242
ocol-4.189
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.333
 the+5.243
,+5.182
.+5.078
 a+5.061
Neg logit contributions
lor-3.928
umm-3.708
arians-3.684
rol-3.656
ides-3.582
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 and+3.478
 the+3.419
,+3.305
 a+3.239
.+3.217
Neg logit contributions
lor-5.638
umm-5.449
arians-5.440
rol-5.417
ides-5.363
Token Lily
Token ID20037
Feature activation+1.00e+00

Pos logit contributions
 and+2.695
,+2.502
 the+2.490
.+2.416
 a+2.329
Neg logit contributions
lor-6.373
arians-6.238
rol-6.197
umm-6.165
ocol-6.136
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 the+3.075
 and+3.056
,+2.961
.+2.870
 a+2.863
Neg logit contributions
lor-5.911
arians-5.802
rol-5.728
umm-5.714
ides-5.667
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.588
 the+0.525
,+0.382
 a+0.319
.+0.240
Neg logit contributions
lor-8.370
arians-8.234
rol-8.199
umm-8.194
osity-8.185
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 and+1.711
 the+1.643
,+1.593
.+1.464
 a+1.463
Neg logit contributions
lor-7.374
arians-7.200
umm-7.177
rol-7.170
ocol-7.061
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 and+2.903
,+2.770
 the+2.677
.+2.614
 a+2.520
Neg logit contributions
lor-6.074
arians-5.988
rol-5.931
umm-5.887
ides-5.800
Token:
Token ID25
Feature activation+1.00e+00

Pos logit contributions
 and+2.418
 the+2.289
,+2.190
.+2.150
 a+2.100
Neg logit contributions
lor-6.742
arians-6.498
osity-6.420
rol-6.400
ides-6.369
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 and+2.526
,+2.383
 the+2.381
.+2.263
 a+2.192
Neg logit contributions
lor-6.596
arians-6.426
osity-6.320
rol-6.254
umm-6.248
TokenFree
Token ID11146
Feature activation+1.00e+00

Pos logit contributions
 and+6.289
,+6.239
 the+6.186
.+6.046
 a+5.981
Neg logit contributions
lor-2.973
arians-2.825
rol-2.780
umm-2.761
ides-2.681
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+5.555
 the+5.471
,+5.412
.+5.330
 a+5.239
Neg logit contributions
lor-3.666
arians-3.485
umm-3.427
rol-3.392
ocol-3.380
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.966
 the+5.887
,+5.813
.+5.705
 a+5.693
Neg logit contributions
lor-3.352
arians-3.120
umm-3.114
rol-3.081
ides-3.017
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+4.784
 and+4.772
,+4.617
 a+4.597
.+4.506
Neg logit contributions
lor-4.369
arians-4.198
rol-4.154
ides-4.112
umm-4.107
Token Enjoy
Token ID18179
Feature activation+1.00e+00

Pos logit contributions
 and+5.235
 the+5.133
,+5.111
.+5.007
 a+4.922
Neg logit contributions
lor-3.969
arians-3.834
umm-3.793
rol-3.759
osity-3.690
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 and+3.127
,+2.989
 the+2.981
.+2.894
 a+2.883
Neg logit contributions
lor-6.010
arians-5.791
ocol-5.759
umm-5.715
rol-5.700
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
 and+1.295
 the+1.271
,+1.253
.+1.116
 a+1.073
Neg logit contributions
lor-7.702
arians-7.460
rol-7.410
umm-7.382
ides-7.332
Token big
Token ID1263
Feature activation+1.00e+00

Pos logit contributions
 and+3.871
 the+3.794
,+3.727
.+3.679
 a+3.587
Neg logit contributions
lor-5.198
umm-5.058
arians-5.054
bek-4.960
rol-4.944
Token pond
Token ID16723
Feature activation+1.00e+00

Pos logit contributions
 and+4.334
 the+4.280
.+4.165
,+4.153
 a+4.137
Neg logit contributions
lor-4.651
umm-4.595
arians-4.534
ocol-4.508
rol-4.424
Token with
Token ID351
Feature activation+1.00e+00

Pos logit contributions
 and+1.801
 the+1.795
,+1.681
 a+1.612
.+1.540
Neg logit contributions
lor-7.199
rol-7.040
umm-6.992
arians-6.926
ides-6.866
Token ducks
Token ID39694
Feature activation+1.00e+00

Pos logit contributions
 and+3.251
,+3.160
 the+3.017
.+3.008
 a+2.763
Neg logit contributions
lor-5.821
rol-5.805
umm-5.709
ocol-5.652
arians-5.567
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+2.464
 and+2.464
,+2.359
 a+2.287
.+2.209
Neg logit contributions
lor-6.525
rol-6.412
arians-6.323
ocol-6.241
ides-6.236
Token ge
Token ID4903
Feature activation+1.00e+00

Pos logit contributions
 and+4.973
,+4.859
 the+4.766
.+4.753
 a+4.573
Neg logit contributions
lor-4.034
rol-3.939
arians-3.916
ocol-3.844
umm-3.840
Tokenese
Token ID2771
Feature activation+1.00e+00

Pos logit contributions
 and+7.434
 the+7.355
,+7.297
.+7.192
 a+7.167
Neg logit contributions
lor-1.782
arians-1.620
umm-1.592
rol-1.583
ides-1.488
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.641
 the+1.594
,+1.522
 a+1.429
.+1.336
Neg logit contributions
lor-7.400
rol-7.301
arians-7.207
ides-7.115
ocol-7.096
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.330
,+3.209
 the+3.202
.+3.098
 a+3.028
Neg logit contributions
lor-5.800
rol-5.641
arians-5.613
umm-5.594
ocol-5.540
Token water
Token ID1660
Feature activation+1.00e+00

Pos logit contributions
 and+5.120
 the+5.025
,+5.024
.+4.903
 a+4.841
Neg logit contributions
lor-3.916
arians-3.797
umm-3.776
rol-3.735
ocol-3.710
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.472
 the+3.438
,+3.366
 a+3.305
.+3.299
Neg logit contributions
lor-5.560
rol-5.392
arians-5.373
umm-5.343
ocol-5.233
Token though
Token ID996
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.082
.+4.030
 the+4.010
 a+3.928
Neg logit contributions
lor-4.562
arians-4.530
osity-4.505
rol-4.504
umm-4.489
Token he
Token ID339
Feature activation+1.00e+00

Pos logit contributions
 and+2.350
 the+2.084
.+2.078
,+2.071
 a+1.974
Neg logit contributions
lor-6.538
arians-6.466
umm-6.415
ocol-6.407
rol-6.383
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.693
 the+3.651
,+3.576
.+3.527
 a+3.474
Neg logit contributions
lor-5.383
arians-5.238
rol-5.130
umm-5.101
ides-5.097
Token just
Token ID655
Feature activation+1.00e+00

Pos logit contributions
 and+3.478
,+3.330
.+3.259
 the+3.223
 a+2.964
Neg logit contributions
lor-5.412
arians-5.341
rol-5.341
osity-5.293
beh-5.273
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+3.494
,+3.382
 the+3.289
.+3.249
 a+3.050
Neg logit contributions
osity-5.442
rol-5.415
ocol-5.399
umm-5.388
lor-5.380
Token oct
Token ID19318
Feature activation+1.00e+00

Pos logit contributions
 and+4.709
 the+4.627
,+4.588
.+4.498
 a+4.457
Neg logit contributions
lor-4.351
umm-4.268
arians-4.228
rol-4.169
ocol-4.160
Tokenopus
Token ID25790
Feature activation+1.00e+00

Pos logit contributions
 and+8.015
 the+7.900
,+7.885
.+7.782
 a+7.722
Neg logit contributions
lor-1.141
arians-0.980
umm-0.965
rol-0.951
ocol-0.849
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.734
 the+1.702
 a+1.558
,+1.530
.+1.455
Neg logit contributions
lor-7.286
arians-7.167
rol-7.048
umm-7.012
isher-6.976
Token he
Token ID339
Feature activation+1.00e+00

Pos logit contributions
 and+2.452
,+2.302
.+2.262
 the+2.223
 a+2.047
Neg logit contributions
lor-6.420
arians-6.372
umm-6.349
rol-6.339
ocol-6.271
Token worked
Token ID3111
Feature activation+1.00e+00

Pos logit contributions
 and+3.624
 the+3.583
,+3.526
.+3.443
 a+3.397
Neg logit contributions
lor-5.468
arians-5.346
rol-5.260
umm-5.173
ides-5.154
Token hard
Token ID1327
Feature activation+1.00e+00

Pos logit contributions
 and+2.336
 the+2.263
,+2.211
.+2.080
 a+2.055
Neg logit contributions
lor-6.613
rol-6.484
arians-6.484
umm-6.369
ides-6.368
TokenCan
Token ID6090
Feature activation+1.00e+00

Pos logit contributions
 and+4.748
 the+4.653
,+4.623
.+4.518
 a+4.468
Neg logit contributions
lor-4.478
arians-4.299
umm-4.292
rol-4.260
ocol-4.159
Token we
Token ID356
Feature activation+1.00e+00

Pos logit contributions
 and+3.640
 the+3.515
,+3.489
.+3.402
 a+3.345
Neg logit contributions
lor-5.564
arians-5.397
umm-5.365
rol-5.326
ocol-5.247
Token go
Token ID467
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
 the+2.979
,+2.936
.+2.872
 a+2.821
Neg logit contributions
lor-5.986
arians-5.855
umm-5.829
rol-5.796
osity-5.731
Token see
Token ID766
Feature activation+1.00e+00

Pos logit contributions
 and+2.814
 the+2.737
,+2.680
.+2.598
 a+2.552
Neg logit contributions
lor-6.282
arians-6.119
rol-6.098
umm-6.070
ocol-5.982
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.466
,+2.317
 the+2.278
.+2.240
 a+2.126
Neg logit contributions
lor-6.562
arians-6.367
rol-6.353
ocol-6.334
umm-6.331
Token oct
Token ID19318
Feature activation+1.00e+00

Pos logit contributions
 and+5.432
 the+5.342
,+5.311
.+5.193
 a+5.151
Neg logit contributions
lor-3.748
umm-3.646
arians-3.549
rol-3.535
ocol-3.420
Tokenopus
Token ID25790
Feature activation+1.00e+00

Pos logit contributions
 and+8.052
 the+7.947
,+7.917
.+7.814
 a+7.766
Neg logit contributions
lor-1.125
arians-0.954
umm-0.942
rol-0.920
ocol-0.822
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+3.350
 the+3.303
,+3.176
.+3.134
 a+3.128
Neg logit contributions
lor-5.686
rol-5.504
arians-5.492
umm-5.454
osity-5.362
Token His
Token ID2399
Feature activation+1.00e+00

Pos logit contributions
 and+4.405
 the+4.310
,+4.251
 a+4.161
.+4.158
Neg logit contributions
lor-4.697
umm-4.515
arians-4.487
rol-4.448
osity-4.389
Token mom
Token ID1995
Feature activation+1.00e+00

Pos logit contributions
 and+2.564
 the+2.504
,+2.404
.+2.322
 a+2.291
Neg logit contributions
lor-6.563
umm-6.463
arians-6.452
rol-6.277
ides-6.266
Token answered
Token ID9373
Feature activation+1.00e+00

Pos logit contributions
 and+3.647
 the+3.627
,+3.503
 a+3.468
.+3.456
Neg logit contributions
lor-5.390
arians-5.284
rol-5.214
umm-5.184
ides-5.136
Token time
Token ID640
Feature activation+1.00e+00

Pos logit contributions
 and+2.482
 the+2.448
,+2.351
.+2.252
 a+2.220
Neg logit contributions
lor-6.635
umm-6.438
arians-6.422
rol-6.397
ides-6.350
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.262
 the+0.165
.+0.030
,+0.020
 a-0.019
Neg logit contributions
lor-8.832
rol-8.647
arians-8.618
umm-8.613
ocol-8.551
Token there
Token ID612
Feature activation+1.00e+00

Pos logit contributions
 and+1.920
 the+1.773
,+1.725
.+1.689
 a+1.554
Neg logit contributions
lor-7.073
rol-6.955
umm-6.854
arians-6.851
ocol-6.833
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+1.559
 the+1.405
,+1.339
.+1.338
 a+1.276
Neg logit contributions
lor-7.462
rol-7.297
umm-7.241
arians-7.163
ocol-7.093
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+0.524
,+0.439
 the+0.375
.+0.334
 a+0.047
Neg logit contributions
lor-8.477
rol-8.340
umm-8.328
arians-8.256
ocol-8.240
Token oct
Token ID19318
Feature activation+1.00e+00

Pos logit contributions
 and+4.069
 the+3.996
,+3.960
.+3.870
 a+3.808
Neg logit contributions
lor-4.978
umm-4.903
arians-4.873
rol-4.818
ocol-4.785
Tokenopus
Token ID25790
Feature activation+1.00e+00

Pos logit contributions
 and+8.012
 the+7.900
,+7.879
.+7.777
 a+7.721
Neg logit contributions
lor-1.142
arians-0.979
umm-0.963
rol-0.941
ocol-0.844
Token named
Token ID3706
Feature activation+1.00e+00

Pos logit contributions
 and+3.242
 the+3.139
,+3.106
.+3.007
 a+2.956
Neg logit contributions
lor-5.948
arians-5.779
umm-5.770
rol-5.737
ocol-5.643
Token Tim
Token ID5045
Feature activation+1.00e+00

Pos logit contributions
 and+4.747
,+4.542
 the+4.503
.+4.443
 a+4.334
Neg logit contributions
lor-4.449
arians-4.301
rol-4.254
umm-4.253
ocol-4.166
Tokenmy
Token ID1820
Feature activation+1.00e+00

Pos logit contributions
 and+3.435
 the+3.360
,+3.317
 a+3.180
.+3.172
Neg logit contributions
lor-5.700
rol-5.500
arians-5.490
umm-5.469
ides-5.391
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
 the+0.730
,+0.695
.+0.596
 a+0.547
Neg logit contributions
lor-8.378
arians-8.206
umm-8.198
rol-8.167
ocol-8.070
Token pond
Token ID16723
Feature activation+1.00e+00

Pos logit contributions
 and+4.536
 the+4.429
,+4.426
.+4.282
 a+4.280
Neg logit contributions
lor-4.570
umm-4.455
arians-4.435
rol-4.363
ocol-4.299
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.481
 and+0.397
 a+0.336
,+0.301
.+0.069
Neg logit contributions
lor-8.433
rol-8.221
arians-8.179
umm-8.162
ides-8.140
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
 and+3.158
,+3.065
 the+3.019
.+2.922
 a+2.814
Neg logit contributions
lor-5.967
arians-5.779
umm-5.736
rol-5.703
osity-5.635
Token shared
Token ID4888
Feature activation+1.00e+00

Pos logit contributions
 and+4.209
,+4.134
 the+4.124
.+4.016
 a+3.950
Neg logit contributions
lor-4.807
arians-4.665
rol-4.645
umm-4.557
ides-4.500
Token their
Token ID511
Feature activation+1.00e+00

Pos logit contributions
 and+2.564
,+2.481
 the+2.423
.+2.353
 a+2.256
Neg logit contributions
lor-6.392
rol-6.189
ocol-6.136
arians-6.132
umm-6.110
Token sandwiches
Token ID29777
Feature activation+1.00e+00

Pos logit contributions
 and+4.961
 the+4.872
,+4.862
.+4.748
 a+4.658
Neg logit contributions
lor-4.100
umm-3.967
arians-3.922
ocol-3.915
rol-3.889
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.460
 and+0.374
,+0.293
 a+0.274
.+0.186
Neg logit contributions
lor-8.501
umm-8.283
arians-8.279
rol-8.259
ides-8.196
Token juice
Token ID13135
Feature activation+1.00e+00

Pos logit contributions
 and+4.795
,+4.692
 the+4.631
.+4.602
 a+4.461
Neg logit contributions
lor-4.157
umm-4.049
arians-4.041
rol-3.977
ocol-3.960
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.457
 and+1.411
,+1.321
 a+1.282
.+1.141
Neg logit contributions
lor-7.519
umm-7.356
rol-7.350
arians-7.301
ides-7.234
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
 and+3.088
,+2.988
 the+2.956
.+2.838
 a+2.751
Neg logit contributions
lor-6.066
arians-5.881
umm-5.839
rol-5.795
osity-5.734
Token thanked
Token ID26280
Feature activation+1.00e+00

Pos logit contributions
 and+4.115
,+4.024
 the+4.022
.+3.900
 a+3.839
Neg logit contributions
lor-4.933
arians-4.782
rol-4.768
umm-4.708
ides-4.624
Token Come
Token ID7911
Feature activation+1.00e+00

Pos logit contributions
 and+5.209
 the+5.106
,+5.077
.+4.982
 a+4.912
Neg logit contributions
lor-4.038
umm-3.861
arians-3.859
rol-3.803
ocol-3.727
Token here
Token ID994
Feature activation+1.00e+00

Pos logit contributions
 and+1.902
 the+1.806
,+1.766
.+1.663
 a+1.622
Neg logit contributions
lor-7.302
arians-7.128
umm-7.115
rol-7.090
ocol-6.990
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.565
 the+1.515
,+1.393
 a+1.349
.+1.332
Neg logit contributions
lor-7.397
arians-7.228
rol-7.166
umm-7.165
ides-7.088
Token we
Token ID356
Feature activation+1.00e+00

Pos logit contributions
 and+2.989
 the+2.874
,+2.855
.+2.785
 a+2.716
Neg logit contributions
lor-6.130
arians-5.953
rol-5.919
umm-5.878
ocol-5.808
Token have
Token ID423
Feature activation+1.00e+00

Pos logit contributions
 and+4.071
 the+3.991
,+3.922
.+3.845
 a+3.794
Neg logit contributions
lor-5.044
rol-4.902
arians-4.858
umm-4.821
ides-4.794
Token sandwiches
Token ID29777
Feature activation+1.00e+00

Pos logit contributions
 and+2.145
,+2.013
 the+1.995
.+1.899
 a+1.783
Neg logit contributions
lor-7.026
arians-6.796
rol-6.786
umm-6.782
ocol-6.681
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+1.698
 and+1.665
,+1.553
 a+1.487
.+1.446
Neg logit contributions
lor-7.320
arians-7.129
rol-7.112
umm-7.087
ides-7.032
Token juice
Token ID13135
Feature activation+1.00e+00

Pos logit contributions
 and+4.914
,+4.778
 the+4.772
.+4.693
 a+4.567
Neg logit contributions
lor-4.187
umm-4.031
arians-4.023
rol-3.967
ocol-3.945
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
 and+2.469
 the+2.459
,+2.359
 a+2.258
.+2.223
Neg logit contributions
lor-6.606
rol-6.417
arians-6.413
umm-6.403
ides-6.311
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+4.017
,+3.814
 the+3.707
.+3.673
 a+3.514
Neg logit contributions
lor-5.112
ocol-5.072
arians-5.028
umm-4.921
ides-4.899
Token,"
Token ID553
Feature activation+1.00e+00

Pos logit contributions
 the+2.486
 and+2.482
,+2.328
 a+2.253
.+2.165
Neg logit contributions
lor-6.499
arians-6.342
rol-6.305
ides-6.294
umm-6.223
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.815
 the+1.787
,+1.697
 a+1.595
.+1.530
Neg logit contributions
lor-7.296
rol-7.111
arians-7.070
umm-7.063
ides-6.950
Token Lucky
Token ID27963
Feature activation+1.00e+00

Pos logit contributions
 and+4.021
 the+3.924
,+3.906
.+3.809
 a+3.753
Neg logit contributions
lor-5.123
umm-4.888
rol-4.874
arians-4.869
osity-4.777
Token ate
Token ID15063
Feature activation+1.00e+00

Pos logit contributions
 and+3.065
 the+3.005
,+2.936
.+2.868
 a+2.828
Neg logit contributions
lor-6.026
arians-5.872
umm-5.748
rol-5.748
ides-5.680
Token so
Token ID523
Feature activation+1.00e+00

Pos logit contributions
 and+1.621
.+1.500
,+1.497
 the+1.477
 a+1.350
Neg logit contributions
lor-7.195
rol-7.106
umm-7.092
ocol-7.036
arians-6.953
Token many
Token ID867
Feature activation+1.00e+00

Pos logit contributions
 and+2.870
 the+2.720
,+2.696
.+2.589
 a+2.513
Neg logit contributions
lor-6.275
umm-6.095
rol-6.072
arians-5.986
ocol-5.960
Token bananas
Token ID35484
Feature activation+1.00e+00

Pos logit contributions
 and+5.226
 the+5.134
,+5.086
.+5.043
 a+5.004
Neg logit contributions
lor-3.706
umm-3.641
ocol-3.612
rol-3.580
arians-3.553
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 the+0.954
 and+0.926
,+0.789
 a+0.763
.+0.722
Neg logit contributions
lor-8.006
rol-7.888
umm-7.840
arians-7.784
ides-7.747
Token he
Token ID339
Feature activation+1.00e+00

Pos logit contributions
 and+2.902
,+2.719
.+2.706
 the+2.594
 a+2.547
Neg logit contributions
umm-6.116
lor-6.065
rol-5.960
arians-5.872
ocol-5.845
Token felt
Token ID2936
Feature activation+1.00e+00

Pos logit contributions
 and+3.979
 the+3.889
,+3.882
.+3.777
 a+3.707
Neg logit contributions
lor-5.148
arians-5.004
rol-4.951
umm-4.941
isher-4.854
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
 and+2.660
,+2.510
 the+2.507
.+2.382
 a+2.323
Neg logit contributions
lor-6.349
rol-6.261
umm-6.244
ocol-6.165
arians-6.091
Token heavy
Token ID4334
Feature activation+1.00e+00

Pos logit contributions
 and+3.606
 the+3.485
,+3.403
 a+3.325
.+3.287
Neg logit contributions
umm-5.430
lor-5.388
ocol-5.299
rol-5.275
ides-5.212
Token park
Token ID3952
Feature activation+1.00e+00

Pos logit contributions
 and+3.828
,+3.708
 the+3.690
.+3.607
 a+3.489
Neg logit contributions
lor-5.415
umm-5.166
rol-5.113
arians-5.068
isher-4.963
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.373
 the+0.333
,+0.249
 a+0.183
.+0.025
Neg logit contributions
lor-8.653
rol-8.455
arians-8.432
umm-8.383
ides-8.358
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+2.869
,+2.758
 the+2.735
.+2.660
 a+2.608
Neg logit contributions
lor-6.156
umm-6.024
rol-6.010
arians-5.995
osity-5.900
Token is
Token ID318
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
 the+2.866
,+2.836
.+2.740
 a+2.710
Neg logit contributions
lor-6.148
arians-5.955
rol-5.952
umm-5.903
ocol-5.811
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+2.481
,+2.422
.+2.348
 the+2.346
 a+2.071
Neg logit contributions
lor-6.279
rol-6.266
umm-6.148
ocol-6.116
arians-6.112
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+5.025
 the+4.967
,+4.906
.+4.823
 a+4.768
Neg logit contributions
lor-4.064
umm-3.967
arians-3.907
rol-3.889
ocol-3.832
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.351
,+7.227
 the+7.199
.+7.120
 a+7.029
Neg logit contributions
lor-1.738
umm-1.597
arians-1.576
rol-1.566
ocol-1.483
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.560
 the+1.541
,+1.424
 a+1.368
.+1.270
Neg logit contributions
lor-7.526
arians-7.316
rol-7.312
umm-7.254
ides-7.158
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+2.750
,+2.640
 the+2.611
.+2.532
 a+2.487
Neg logit contributions
lor-6.314
rol-6.156
umm-6.133
arians-6.126
ocol-6.000
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+5.878
 the+5.760
,+5.743
.+5.638
 a+5.578
Neg logit contributions
lor-3.315
umm-3.111
arians-3.099
rol-3.089
ocol-2.977
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.397
,+7.259
 the+7.255
.+7.150
 a+7.073
Neg logit contributions
lor-1.734
umm-1.578
arians-1.559
rol-1.536
ocol-1.454
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.121
 the+0.022
,+0.009
.-0.119
 a-0.162
Neg logit contributions
lor-9.116
arians-8.943
umm-8.918
rol-8.892
ides-8.789
Token Tim
Token ID5045
Feature activation+1.00e+00

Pos logit contributions
 and+2.466
 the+2.315
,+2.301
.+2.233
 a+2.140
Neg logit contributions
lor-6.575
umm-6.479
arians-6.455
rol-6.439
ocol-6.368
Tokenmy
Token ID1820
Feature activation+1.00e+00

Pos logit contributions
 and+3.619
 the+3.572
,+3.518
.+3.415
 a+3.393
Neg logit contributions
lor-5.507
umm-5.297
arians-5.289
rol-5.289
isher-5.190
Token saw
Token ID2497
Feature activation+1.00e+00

Pos logit contributions
 and+2.290
 the+2.247
,+2.178
.+2.104
 a+2.070
Neg logit contributions
lor-6.795
arians-6.591
umm-6.572
rol-6.565
ides-6.461
Token an
Token ID281
Feature activation+1.00e+00

Pos logit contributions
 and+1.061
,+0.921
 the+0.845
.+0.843
 a+0.627
Neg logit contributions
lor-8.053
rol-7.861
umm-7.834
isher-7.775
arians-7.768
Token oct
Token ID19318
Feature activation+1.00e+00

Pos logit contributions
 and+4.669
 the+4.578
,+4.538
.+4.446
 a+4.396
Neg logit contributions
lor-4.451
umm-4.326
arians-4.291
rol-4.274
ocol-4.197
Tokenopus
Token ID25790
Feature activation+1.00e+00

Pos logit contributions
 and+7.916
,+7.786
 the+7.767
.+7.682
 a+7.602
Neg logit contributions
lor-1.199
arians-1.059
rol-1.023
umm-1.020
ocol-0.929
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
 and+2.515
 the+2.426
,+2.382
.+2.272
 a+2.243
Neg logit contributions
lor-6.673
arians-6.490
umm-6.477
rol-6.463
ocol-6.358
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+0.567
,+0.480
.+0.314
 the+0.232
 a+0.108
Neg logit contributions
lor-8.532
ocol-8.291
umm-8.291
rol-8.234
arians-8.225
Token water
Token ID1660
Feature activation+1.00e+00

Pos logit contributions
 and+4.492
,+4.378
 the+4.364
.+4.260
 a+4.180
Neg logit contributions
lor-4.719
umm-4.546
rol-4.486
arians-4.406
isher-4.392
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.137
 the+1.130
,+1.053
 a+0.986
.+0.836
Neg logit contributions
lor-7.864
rol-7.726
umm-7.612
arians-7.544
isher-7.517
Token highest
Token ID4511
Feature activation+1.00e+00

Pos logit contributions
 and+5.241
,+5.125
 the+5.093
.+4.989
 a+4.866
Neg logit contributions
lor-3.926
umm-3.903
arians-3.793
rol-3.739
osity-3.626
Token branches
Token ID13737
Feature activation+1.00e+00

Pos logit contributions
 and+4.225
 the+4.160
,+4.079
 a+4.022
.+3.983
Neg logit contributions
lor-4.761
umm-4.747
arians-4.645
rol-4.567
ocol-4.511
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.897
 the+1.886
 a+1.792
,+1.771
.+1.696
Neg logit contributions
lor-7.015
rol-6.929
umm-6.806
arians-6.789
osity-6.782
Token grab
Token ID5552
Feature activation+1.00e+00

Pos logit contributions
 and+3.795
,+3.670
 the+3.624
.+3.604
 a+3.483
Neg logit contributions
lor-5.226
arians-5.137
umm-5.126
rol-5.064
osity-5.029
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.642
,+2.559
 the+2.472
.+2.422
 a+2.272
Neg logit contributions
lor-6.335
rol-6.301
umm-6.230
ocol-6.211
arians-6.135
Token ju
Token ID7544
Feature activation+1.00e+00

Pos logit contributions
 and+5.139
 the+5.050
,+5.035
.+4.904
 a+4.835
Neg logit contributions
lor-3.935
umm-3.930
arians-3.851
rol-3.837
osity-3.672
Tokenic
Token ID291
Feature activation+1.00e+00

Pos logit contributions
 and+7.655
 the+7.557
,+7.519
.+7.415
 a+7.373
Neg logit contributions
lor-1.541
arians-1.366
umm-1.356
rol-1.328
ocol-1.229
Tokeniest
Token ID6386
Feature activation+1.00e+00

Pos logit contributions
 and+6.809
 the+6.715
,+6.658
 a+6.549
.+6.533
Neg logit contributions
lor-2.400
umm-2.191
arians-2.178
rol-2.174
ocol-2.111
Token fruit
Token ID8234
Feature activation+1.00e+00

Pos logit contributions
 and+4.621
 the+4.574
,+4.486
 a+4.388
.+4.381
Neg logit contributions
lor-4.436
umm-4.382
rol-4.300
arians-4.296
ocol-4.245
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.084
 the+2.033
,+1.948
 a+1.874
.+1.792
Neg logit contributions
lor-7.008
umm-6.871
rol-6.835
arians-6.802
ides-6.680
Token One
Token ID1881
Feature activation+1.00e+00

Pos logit contributions
 and+2.523
 the+2.434
,+2.402
.+2.270
 a+2.269
Neg logit contributions
lor-6.629
umm-6.460
rol-6.447
arians-6.421
osity-6.318
Token took
Token ID1718
Feature activation+1.00e+00

Pos logit contributions
 and+3.164
,+3.154
 the+3.111
.+2.987
 a+2.842
Neg logit contributions
lor-5.687
bek-5.575
rol-5.527
arians-5.524
ides-5.485
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+2.781
,+2.631
 the+2.541
.+2.515
 a+2.349
Neg logit contributions
lor-6.259
umm-6.169
rol-6.128
ocol-6.101
isher-6.094
Token big
Token ID1263
Feature activation+1.00e+00

Pos logit contributions
 and+4.833
 the+4.749
,+4.684
.+4.614
 a+4.555
Neg logit contributions
lor-4.312
umm-4.100
arians-4.069
rol-4.068
ocol-4.039
Token slice
Token ID16416
Feature activation+1.00e+00

Pos logit contributions
 and+4.810
 the+4.716
,+4.667
.+4.575
 a+4.533
Neg logit contributions
lor-4.362
umm-4.186
arians-4.182
rol-4.148
ocol-4.072
Token of
Token ID286
Feature activation+1.00e+00

Pos logit contributions
 the+1.966
 and+1.904
,+1.836
 a+1.772
.+1.667
Neg logit contributions
lor-7.008
umm-6.910
rol-6.860
isher-6.750
arians-6.737
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+5.172
,+5.039
 the+4.970
.+4.937
 a+4.826
Neg logit contributions
lor-3.881
umm-3.784
ocol-3.751
arians-3.683
rol-3.669
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.166
 the+5.078
,+5.031
.+4.922
 a+4.894
Neg logit contributions
lor-4.052
umm-3.862
arians-3.860
rol-3.827
ides-3.734
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+2.008
 and+1.918
,+1.850
 a+1.808
.+1.677
Neg logit contributions
lor-7.099
umm-6.915
rol-6.883
arians-6.865
ides-6.826
Token bit
Token ID1643
Feature activation+1.00e+00

Pos logit contributions
 and+4.343
,+4.244
 the+4.191
.+4.138
 a+3.989
Neg logit contributions
lor-4.651
arians-4.513
umm-4.501
rol-4.425
ocol-4.398
Token into
Token ID656
Feature activation+1.00e+00

Pos logit contributions
 and+2.133
,+2.006
 the+1.962
.+1.869
 a+1.823
Neg logit contributions
lor-6.959
umm-6.864
rol-6.846
arians-6.827
ocol-6.791
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+2.042
,+1.951
.+1.724
 the+1.720
 a+1.577
Neg logit contributions
lor-6.940
ocol-6.886
arians-6.830
umm-6.798
isher-6.727
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.657
 the+2.568
,+2.510
.+2.413
 a+2.385
Neg logit contributions
lor-6.538
arians-6.359
umm-6.356
rol-6.323
ocol-6.235
Token this
Token ID428
Feature activation+1.00e+00

Pos logit contributions
 and+4.140
,+4.009
.+3.975
 the+3.966
 a+3.777
Neg logit contributions
lor-4.956
umm-4.836
ocol-4.804
arians-4.798
rol-4.703
Token is
Token ID318
Feature activation+1.00e+00

Pos logit contributions
 and+4.756
 the+4.683
,+4.617
.+4.520
 a+4.483
Neg logit contributions
lor-4.430
umm-4.227
arians-4.164
rol-4.139
ocol-4.088
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+3.486
,+3.301
 the+3.259
.+3.217
 a+3.082
Neg logit contributions
lor-5.572
umm-5.501
arians-5.427
ocol-5.398
rol-5.386
Token best
Token ID1266
Feature activation+1.00e+00

Pos logit contributions
 and+4.952
 the+4.856
,+4.823
.+4.700
 a+4.637
Neg logit contributions
lor-4.213
umm-4.149
arians-4.043
rol-3.973
ocol-3.918
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+5.012
 the+4.916
,+4.875
.+4.770
 a+4.731
Neg logit contributions
lor-4.199
arians-4.022
umm-4.018
rol-3.985
ocol-3.889
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.188
 the+5.102
,+5.052
.+4.934
 a+4.917
Neg logit contributions
lor-4.035
umm-3.846
arians-3.829
rol-3.800
ocol-3.716
Token ever
Token ID1683
Feature activation+1.00e+00

Pos logit contributions
 and+3.864
 the+3.789
,+3.725
.+3.610
 a+3.601
Neg logit contributions
lor-5.289
umm-5.106
arians-5.103
rol-5.072
ides-4.982
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 and+3.303
 the+3.232
,+3.153
 a+3.064
.+3.020
Neg logit contributions
lor-5.809
umm-5.629
arians-5.607
rol-5.605
ides-5.524
Token Lily
Token ID20037
Feature activation+1.00e+00

Pos logit contributions
 and+2.296
,+2.130
 the+2.105
.+2.044
 a+1.945
Neg logit contributions
lor-6.760
arians-6.595
rol-6.582
umm-6.576
ocol-6.537
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 the+2.709
 and+2.708
,+2.631
.+2.531
 a+2.511
Neg logit contributions
lor-6.308
arians-6.158
rol-6.085
umm-6.076
ides-6.006
Token loud
Token ID7812
Feature activation+1.00e+00

Pos logit contributions
 and+3.857
 the+3.747
,+3.703
.+3.600
 a+3.598
Neg logit contributions
lor-5.258
arians-5.131
umm-5.074
rol-4.993
bek-4.992
Token whistle
Token ID16121
Feature activation+1.00e+00

Pos logit contributions
 and+4.268
 the+4.183
,+4.127
.+4.034
 a+4.023
Neg logit contributions
lor-4.813
arians-4.646
umm-4.605
rol-4.591
ocol-4.547
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.310
 and+1.288
,+1.211
 a+1.115
.+1.008
Neg logit contributions
lor-7.682
rol-7.463
arians-7.448
ides-7.411
umm-7.349
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.434
,+3.349
 the+3.287
.+3.193
 a+3.072
Neg logit contributions
lor-5.600
arians-5.461
umm-5.454
rol-5.400
osity-5.344
Token race
Token ID3234
Feature activation+1.00e+00

Pos logit contributions
 and+4.249
,+4.171
 the+4.121
.+4.025
 a+3.938
Neg logit contributions
lor-4.811
arians-4.670
umm-4.633
isher-4.522
rol-4.518
Token begins
Token ID6140
Feature activation+1.00e+00

Pos logit contributions
 and+4.568
 the+4.466
,+4.428
.+4.319
 a+4.300
Neg logit contributions
lor-4.580
arians-4.430
rol-4.380
umm-4.363
ides-4.278
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.777
 the+1.747
,+1.661
 a+1.549
.+1.474
Neg logit contributions
lor-7.218
arians-7.056
rol-6.994
ides-6.955
osity-6.926
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.500
,+3.414
 the+3.317
.+3.243
 a+3.102
Neg logit contributions
lor-5.585
umm-5.437
arians-5.427
rol-5.420
ocol-5.332
Token bikes
Token ID16715
Feature activation+1.00e+00

Pos logit contributions
 and+4.436
,+4.346
 the+4.313
.+4.209
 a+4.123
Neg logit contributions
lor-4.598
arians-4.541
umm-4.495
isher-4.380
rol-4.372
Token zoom
Token ID19792
Feature activation+1.00e+00

Pos logit contributions
 and+3.877
 the+3.795
,+3.770
.+3.659
 a+3.649
Neg logit contributions
lor-5.146
arians-5.052
rol-5.021
umm-4.950
ides-4.929
Token past
Token ID1613
Feature activation+1.00e+00

Pos logit contributions
 and+4.736
 the+4.675
,+4.629
.+4.512
 a+4.503
Neg logit contributions
lor-4.327
arians-4.187
rol-4.183
ides-4.060
umm-4.035
Token stayed
Token ID9658
Feature activation+1.00e+00

Pos logit contributions
 and+3.907
,+3.815
.+3.737
 the+3.736
 a+3.554
Neg logit contributions
lor-5.117
arians-4.978
rol-4.881
bek-4.874
isher-4.863
Token away
Token ID1497
Feature activation+1.00e+00

Pos logit contributions
 and+3.087
,+2.999
 the+2.961
.+2.837
 a+2.773
Neg logit contributions
lor-5.979
arians-5.788
rol-5.765
umm-5.728
ocol-5.726
Token from
Token ID422
Feature activation+1.00e+00

Pos logit contributions
 and+2.317
 the+2.152
,+2.147
 a+1.994
.+1.920
Neg logit contributions
lor-6.689
rol-6.561
umm-6.553
arians-6.532
osity-6.503
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+2.845
,+2.722
.+2.471
 the+2.437
 a+2.356
Neg logit contributions
arians-6.165
lor-6.164
ocol-6.134
umm-6.070
rol-6.025
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.241
 the+1.193
,+1.090
 a+1.024
.+0.807
Neg logit contributions
lor-7.761
arians-7.570
rol-7.553
umm-7.547
ides-7.476
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+4.288
,+4.186
 the+4.162
.+4.056
 a+3.990
Neg logit contributions
lor-4.895
umm-4.807
rol-4.753
arians-4.751
osity-4.657
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+3.960
 the+3.872
,+3.837
.+3.784
 a+3.759
Neg logit contributions
lor-5.154
arians-4.969
rol-4.922
umm-4.899
isher-4.825
Token weeks
Token ID2745
Feature activation+1.00e+00

Pos logit contributions
 and+2.633
,+2.612
.+2.580
 the+2.485
 a+2.390
Neg logit contributions
lor-6.239
arians-6.181
umm-6.174
rol-6.068
ocol-6.061
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+2.753
 the+2.648
,+2.575
 a+2.569
.+2.565
Neg logit contributions
lor-6.274
arians-6.131
rol-6.118
umm-6.053
ocol-6.001
Token months
Token ID1933
Feature activation+1.00e+00

Pos logit contributions
 and+4.216
.+4.110
,+4.107
 the+3.915
 a+3.814
Neg logit contributions
umm-4.819
lor-4.802
arians-4.756
bek-4.723
rol-4.642
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+3.379
 the+3.262
,+3.202
 a+3.155
.+3.155
Neg logit contributions
lor-5.666
rol-5.554
arians-5.519
umm-5.454
ocol-5.410
Token't
Token ID470
Feature activation+1.00e+00

Pos logit contributions
 and+4.196
 the+4.089
,+4.061
.+3.920
 a+3.899
Neg logit contributions
lor-4.989
arians-4.754
rol-4.742
umm-4.709
ides-4.656
Token help
Token ID1037
Feature activation+1.00e+00

Pos logit contributions
 and+5.316
 the+5.228
,+5.173
 a+5.047
.+5.028
Neg logit contributions
lor-3.805
arians-3.653
umm-3.635
ides-3.583
rol-3.564
Token but
Token ID475
Feature activation+1.00e+00

Pos logit contributions
 and+2.664
 the+2.562
,+2.485
 a+2.441
.+2.375
Neg logit contributions
lor-6.328
arians-6.267
rol-6.145
ocol-6.136
umm-6.136
Token smile
Token ID8212
Feature activation+1.00e+00

Pos logit contributions
 and+4.365
 the+4.199
,+4.198
.+4.154
 a+4.086
Neg logit contributions
lor-4.656
osity-4.542
arians-4.506
umm-4.479
rol-4.472
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.799
 the+2.750
,+2.675
 a+2.607
.+2.544
Neg logit contributions
lor-6.205
arians-6.054
osity-6.007
rol-5.951
umm-5.944
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+4.617
,+4.526
 the+4.497
.+4.414
 a+4.306
Neg logit contributions
lor-4.624
arians-4.490
rol-4.412
umm-4.406
osity-4.347
Token were
Token ID547
Feature activation+1.00e+00

Pos logit contributions
 and+4.505
 the+4.398
,+4.380
.+4.344
 a+4.293
Neg logit contributions
lor-4.620
arians-4.454
rol-4.420
umm-4.372
isher-4.302
Token filled
Token ID5901
Feature activation+1.00e+00

Pos logit contributions
 and+4.013
,+3.878
 the+3.867
.+3.818
 a+3.686
Neg logit contributions
lor-5.010
rol-4.953
umm-4.903
arians-4.864
osity-4.817
Token with
Token ID351
Feature activation+1.00e+00

Pos logit contributions
 and+1.376
,+1.269
 the+1.260
.+1.174
 a+1.119
Neg logit contributions
lor-7.635
umm-7.450
rol-7.445
arians-7.391
bek-7.314
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+4.312
,+4.220
.+4.123
 the+4.107
 a+3.909
Neg logit contributions
rol-4.632
lor-4.615
umm-4.607
ocol-4.574
arians-4.459
Token bees
Token ID17002
Feature activation+1.00e+00

Pos logit contributions
 and+4.949
 the+4.862
,+4.841
.+4.734
 a+4.680
Neg logit contributions
lor-4.153
umm-4.065
arians-3.996
rol-3.989
ocol-3.928
Token couldn
Token ID3521
Feature activation+1.00e+00

Pos logit contributions
 and+3.956
 the+3.916
,+3.829
.+3.787
 a+3.739
Neg logit contributions
lor-5.140
arians-4.981
rol-4.927
umm-4.808
ides-4.781
Token't
Token ID470
Feature activation+1.00e+00

Pos logit contributions
 and+4.199
 the+4.079
,+4.046
.+3.942
 a+3.900
Neg logit contributions
lor-4.996
arians-4.770
rol-4.732
umm-4.723
ides-4.618
Token find
Token ID1064
Feature activation+1.00e+00

Pos logit contributions
 and+3.163
 the+3.133
,+3.035
 a+2.966
.+2.882
Neg logit contributions
lor-5.863
arians-5.770
ides-5.658
umm-5.615
rol-5.567
Token him
Token ID683
Feature activation+1.00e+00

Pos logit contributions
 and+3.804
,+3.586
 the+3.532
.+3.466
 a+3.349
Neg logit contributions
lor-5.405
rol-5.259
umm-5.196
arians-5.187
ocol-5.180
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.553
 the+2.492
,+2.386
 a+2.302
.+2.214
Neg logit contributions
lor-6.477
arians-6.267
rol-6.251
ides-6.136
umm-6.122
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+4.671
 the+4.575
,+4.499
.+4.420
 a+4.364
Neg logit contributions
lor-4.556
rol-4.435
umm-4.418
arians-4.338
ocol-4.206
Token turned
Token ID2900
Feature activation+1.00e+00

Pos logit contributions
 and+4.297
 the+4.188
,+4.120
.+4.096
 a+4.065
Neg logit contributions
lor-4.827
rol-4.674
arians-4.657
umm-4.607
isher-4.529
Token into
Token ID656
Feature activation+1.00e+00

Pos logit contributions
 and+3.411
 the+3.332
,+3.246
.+3.214
 a+3.189
Neg logit contributions
rol-5.499
lor-5.489
umm-5.431
arians-5.406
osity-5.347
Token weeks
Token ID2745
Feature activation+1.00e+00

Pos logit contributions
 and+4.061
,+3.910
 the+3.868
.+3.833
 a+3.673
Neg logit contributions
lor-4.873
arians-4.844
ocol-4.825
umm-4.713
rol-4.684
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 the+0.467
 and+0.449
 a+0.393
,+0.260
.+0.237
Neg logit contributions
lor-8.433
arians-8.334
rol-8.280
umm-8.196
ides-8.154
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+2.622
,+2.571
 the+2.570
.+2.553
 a+2.455
Neg logit contributions
lor-6.272
rol-6.197
arians-6.136
umm-6.078
ocol-5.972
Token they
Token ID484
Feature activation+1.00e+00

Pos logit contributions
 and+2.999
,+2.875
.+2.839
 the+2.836
 a+2.699
Neg logit contributions
arians-5.842
osity-5.818
bek-5.733
lor-5.719
beh-5.702
Token kept
Token ID4030
Feature activation+1.00e+00

Pos logit contributions
 and+3.645
 the+3.572
,+3.520
.+3.441
 a+3.396
Neg logit contributions
lor-5.408
arians-5.319
rol-5.249
umm-5.167
ides-5.162
Token on
Token ID319
Feature activation+1.00e+00

Pos logit contributions
 and+4.303
,+4.209
 the+4.169
.+4.072
 a+4.004
Neg logit contributions
lor-4.733
arians-4.648
rol-4.569
umm-4.511
beh-4.505
Token walking
Token ID6155
Feature activation+1.00e+00

Pos logit contributions
 and+4.758
 the+4.655
,+4.643
.+4.521
 a+4.472
Neg logit contributions
lor-4.350
arians-4.242
umm-4.162
rol-4.138
osity-4.087
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.808
 the+1.729
,+1.675
 a+1.608
.+1.510
Neg logit contributions
lor-7.005
arians-6.996
rol-6.953
umm-6.839
bek-6.824
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+3.868
 the+3.778
,+3.765
.+3.671
 a+3.587
Neg logit contributions
arians-5.172
lor-5.140
umm-5.119
rol-5.057
osity-5.010
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+3.677
 the+3.589
,+3.528
.+3.474
 a+3.466
Neg logit contributions
lor-5.399
arians-5.276
rol-5.205
umm-5.195
isher-5.119
Token by
Token ID416
Feature activation+1.00e+00

Pos logit contributions
 and+2.412
 the+2.290
,+2.223
 a+2.171
.+2.143
Neg logit contributions
lor-6.623
arians-6.567
rol-6.477
umm-6.448
osity-6.378
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.222
 the+1.199
,+1.112
.+1.029
 a+0.983
Neg logit contributions
lor-7.463
arians-7.457
ides-7.333
rol-7.265
osity-7.246
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+3.354
,+3.227
.+3.187
 the+3.082
 a+2.969
Neg logit contributions
arians-5.636
lor-5.610
umm-5.568
rol-5.515
bek-5.507
Token old
Token ID1468
Feature activation+1.00e+00

Pos logit contributions
 and+3.192
,+3.085
 the+3.042
.+2.945
 a+2.891
Neg logit contributions
umm-5.887
lor-5.808
arians-5.770
osity-5.676
ocol-5.628
Token G
Token ID402
Feature activation+1.00e+00

Pos logit contributions
 and+2.035
,+1.889
 the+1.800
.+1.733
 a+1.630
Neg logit contributions
lor-7.100
rol-6.922
arians-6.843
umm-6.783
ocol-6.774
Tokenwen
Token ID21006
Feature activation+1.00e+00

Pos logit contributions
 and+8.384
 the+8.298
,+8.252
.+8.136
 a+8.110
Neg logit contributions
lor-0.851
arians-0.657
umm-0.634
rol-0.616
ides-0.533
Token's
Token ID338
Feature activation+1.00e+00

Pos logit contributions
 and+2.469
 the+2.359
,+2.350
.+2.193
 a+2.153
Neg logit contributions
lor-6.684
arians-6.460
rol-6.455
umm-6.370
osity-6.335
Token popcorn
Token ID38914
Feature activation+1.00e+00

Pos logit contributions
 and+5.194
 the+5.122
,+5.073
.+4.924
 a+4.855
Neg logit contributions
lor-3.999
arians-3.808
umm-3.781
ocol-3.684
isher-3.662
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.462
 the+1.442
,+1.357
 a+1.235
.+1.172
Neg logit contributions
lor-7.670
arians-7.494
umm-7.481
rol-7.415
ides-7.342
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+4.084
 the+3.997
,+3.964
.+3.868
 a+3.824
Neg logit contributions
lor-5.116
rol-4.889
umm-4.886
arians-4.871
osity-4.784
Token later
Token ID1568
Feature activation+1.00e+00

Pos logit contributions
 and+3.984
 the+3.916
,+3.848
.+3.789
 a+3.787
Neg logit contributions
lor-5.160
arians-4.959
rol-4.924
umm-4.906
ides-4.820
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.887
 the+0.866
,+0.727
 a+0.709
.+0.656
Neg logit contributions
lor-8.183
umm-7.992
arians-7.958
rol-7.915
ides-7.873
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+4.502
,+4.423
.+4.370
 the+4.368
 a+4.247
Neg logit contributions
lor-4.553
rol-4.352
umm-4.334
arians-4.313
osity-4.236
Token post
Token ID1281
Feature activation+1.00e+00

Pos logit contributions
 and+5.170
 the+5.075
,+5.039
.+4.933
 a+4.882
Neg logit contributions
lor-4.068
arians-3.890
umm-3.886
rol-3.818
ides-3.752
Tokenman
Token ID805
Feature activation+1.00e+00

Pos logit contributions
 and+4.405
 the+4.361
,+4.284
.+4.186
 a+4.177
Neg logit contributions
lor-4.828
arians-4.572
umm-4.529
rol-4.501
isher-4.420
Token who
Token ID508
Feature activation+1.00e+00

Pos logit contributions
 and+2.605
 the+2.579
,+2.478
 a+2.387
.+2.322
Neg logit contributions
lor-6.508
arians-6.296
rol-6.290
umm-6.223
ocol-6.195
Token were
Token ID547
Feature activation+1.00e+00

Pos logit contributions
 and+4.404
,+4.308
 the+4.294
.+4.178
 a+4.126
Neg logit contributions
lor-4.666
arians-4.516
rol-4.499
ocol-4.405
umm-4.400
Token friends
Token ID2460
Feature activation+1.00e+00

Pos logit contributions
 and+4.149
,+4.007
 the+3.972
.+3.854
 a+3.765
Neg logit contributions
lor-4.924
rol-4.819
arians-4.788
ocol-4.726
umm-4.657
Token..."
Token ID9313
Feature activation+1.00e+00

Pos logit contributions
 and+2.150
 the+2.101
,+1.998
 a+1.917
.+1.863
Neg logit contributions
lor-6.953
arians-6.770
rol-6.731
umm-6.699
ides-6.625
Token Mom
Token ID11254
Feature activation+1.00e+00

Pos logit contributions
 and+2.593
,+2.435
 the+2.414
.+2.298
 a+2.261
Neg logit contributions
lor-6.345
arians-6.240
ocol-6.233
umm-6.197
rol-6.189
Token begins
Token ID6140
Feature activation+1.00e+00

Pos logit contributions
 and+4.070
 the+4.009
,+3.974
.+3.868
 a+3.835
Neg logit contributions
lor-5.037
arians-4.885
rol-4.859
umm-4.822
ocol-4.749
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.942
 the+0.915
,+0.837
 a+0.720
.+0.642
Neg logit contributions
lor-7.934
rol-7.781
ocol-7.746
arians-7.734
ides-7.695
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.818
,+1.744
 the+1.720
.+1.603
 a+1.560
Neg logit contributions
lor-7.158
umm-7.008
rol-6.987
arians-6.947
osity-6.891
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.415
,+1.328
 the+1.327
.+1.187
 a+1.170
Neg logit contributions
lor-7.695
rol-7.471
arians-7.462
umm-7.405
ocol-7.337
TokenL
Token ID43
Feature activation+1.00e+00

Pos logit contributions
 and+3.637
 the+3.557
,+3.529
.+3.420
 a+3.379
Neg logit contributions
lor-5.569
arians-5.398
umm-5.344
rol-5.333
bek-5.259
Tokenily
Token ID813
Feature activation+1.00e+00

Pos logit contributions
 and+4.457
 the+4.422
,+4.363
.+4.233
 a+4.200
Neg logit contributions
lor-4.765
arians-4.562
umm-4.555
rol-4.554
ocol-4.483
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+1.673
,+1.586
 the+1.412
.+1.401
 a+1.265
Neg logit contributions
lor-7.223
umm-7.162
ocol-7.083
rol-7.080
isher-6.978
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
 and+2.388
 the+2.327
,+2.250
 a+2.139
.+2.087
Neg logit contributions
lor-6.547
umm-6.412
rol-6.387
arians-6.310
ides-6.270
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.705
,+1.590
 the+1.424
.+1.420
 a+1.269
Neg logit contributions
lor-7.131
umm-7.104
ocol-7.053
arians-6.937
rol-6.930
Token garden
Token ID11376
Feature activation+1.00e+00

Pos logit contributions
 and+4.948
 the+4.837
,+4.827
.+4.662
 a+4.626
Neg logit contributions
lor-4.216
umm-4.196
rol-3.957
arians-3.922
ocol-3.918
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.115
 and+1.106
,+0.971
 a+0.962
.+0.796
Neg logit contributions
lor-7.886
umm-7.706
rol-7.684
arians-7.664
ides-7.518
Token Days
Token ID12579
Feature activation+1.00e+00

Pos logit contributions
 and+4.423
 the+4.335
,+4.310
.+4.216
 a+4.111
Neg logit contributions
lor-4.823
umm-4.720
osity-4.594
rol-4.556
arians-4.539
Token later
Token ID1568
Feature activation+1.00e+00

Pos logit contributions
 and+3.802
 the+3.719
,+3.659
.+3.623
 a+3.607
Neg logit contributions
lor-5.290
umm-5.115
arians-5.114
rol-5.055
isher-4.981
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.351
 the+1.318
,+1.168
 a+1.159
.+1.119
Neg logit contributions
lor-7.708
umm-7.577
arians-7.521
rol-7.457
ides-7.421
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+3.083
,+3.047
.+3.035
 the+2.946
 a+2.857
Neg logit contributions
lor-5.810
umm-5.796
arians-5.699
rol-5.633
osity-5.577
Token beautiful
Token ID4950
Feature activation+1.00e+00

Pos logit contributions
 and+4.346
 the+4.267
,+4.163
.+4.097
 a+4.048
Neg logit contributions
lor-4.819
umm-4.771
arians-4.633
bek-4.548
ocol-4.531
Token orange
Token ID10912
Feature activation+1.00e+00

Pos logit contributions
 and+4.659
 the+4.607
,+4.520
.+4.453
 a+4.433
Neg logit contributions
lor-4.385
umm-4.360
arians-4.156
ocol-4.153
rol-4.119
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+1.184
 and+1.128
,+1.021
 a+0.943
.+0.909
Neg logit contributions
lor-7.756
umm-7.602
rol-7.542
arians-7.529
osity-7.523
Token suggested
Token ID5220
Feature activation+1.00e+00

Pos logit contributions
 and+3.350
,+3.214
.+3.180
 the+3.148
 a+3.009
Neg logit contributions
lor-5.761
arians-5.560
rol-5.527
bek-5.496
umm-5.490
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 and+2.324
 the+2.158
,+2.121
.+2.027
 a+1.992
Neg logit contributions
lor-6.790
arians-6.615
umm-6.547
rol-6.544
ocol-6.508
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+3.762
,+3.608
.+3.500
 the+3.477
 a+3.402
Neg logit contributions
lor-5.381
arians-5.200
rol-5.172
umm-5.118
ocol-5.081
Token bunny
Token ID44915
Feature activation+1.00e+00

Pos logit contributions
 and+4.902
 the+4.794
,+4.752
.+4.649
 a+4.544
Neg logit contributions
lor-4.369
arians-4.236
umm-4.203
ocol-3.971
rol-3.956
Token begins
Token ID6140
Feature activation+1.00e+00

Pos logit contributions
 and+3.902
 the+3.838
,+3.757
.+3.646
 a+3.626
Neg logit contributions
lor-5.211
arians-5.004
rol-4.986
umm-4.894
ides-4.885
Token with
Token ID351
Feature activation+1.00e+00

Pos logit contributions
 and+2.431
 the+2.358
,+2.281
 a+2.156
.+2.113
Neg logit contributions
lor-6.575
arians-6.415
rol-6.371
ocol-6.356
osity-6.338
Token small
Token ID1402
Feature activation+1.00e+00

Pos logit contributions
 and+3.132
,+2.922
.+2.796
 the+2.721
 a+2.494
Neg logit contributions
lor-6.029
ocol-5.920
rol-5.831
arians-5.804
umm-5.737
Token steps
Token ID4831
Feature activation+1.00e+00

Pos logit contributions
 and+4.908
 the+4.863
,+4.723
.+4.677
 a+4.666
Neg logit contributions
lor-4.113
arians-4.040
umm-4.018
ocol-4.005
ides-3.909
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.655
 the+2.606
,+2.524
 a+2.464
.+2.362
Neg logit contributions
lor-6.389
rol-6.242
arians-6.224
umm-6.135
ocol-6.133
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 and+4.044
,+3.947
 the+3.942
.+3.813
 a+3.771
Neg logit contributions
lor-5.153
arians-4.964
umm-4.952
rol-4.938
osity-4.818
Token-
Token ID12
Feature activation+1.00e+00

Pos logit contributions
 and+6.388
 the+6.294
,+6.238
.+6.129
 a+6.105
Neg logit contributions
lor-2.840
arians-2.654
umm-2.623
rol-2.619
ocol-2.556
Tokendestruct
Token ID35678
Feature activation+1.00e+00

Pos logit contributions
 and+6.647
,+6.526
 the+6.521
.+6.393
 a+6.336
Neg logit contributions
lor-2.604
arians-2.375
rol-2.355
umm-2.314
osity-2.261
Token activated
Token ID13906
Feature activation+1.00e+00

Pos logit contributions
 and+6.572
 the+6.507
,+6.451
.+6.333
 a+6.303
Neg logit contributions
lor-2.653
arians-2.487
umm-2.401
rol-2.380
ides-2.354
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+3.819
 the+3.734
,+3.690
 a+3.584
.+3.547
Neg logit contributions
lor-5.219
arians-5.059
ocol-4.936
ides-4.927
rol-4.922
Token Countdown
Token ID47555
Feature activation+1.00e+00

Pos logit contributions
 and+5.040
,+4.897
 the+4.872
.+4.773
 a+4.636
Neg logit contributions
lor-4.257
arians-4.035
umm-3.975
rol-3.974
ocol-3.924
Token begins
Token ID6140
Feature activation+1.00e+00

Pos logit contributions
 and+6.458
 the+6.407
,+6.310
 a+6.207
.+6.201
Neg logit contributions
lor-2.832
arians-2.621
ides-2.519
rol-2.507
umm-2.498
Token."
Token ID526
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
 the+3.046
,+2.948
 a+2.838
.+2.791
Neg logit contributions
lor-5.992
arians-5.837
ides-5.727
osity-5.699
rol-5.696
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.749
 the+1.671
,+1.667
.+1.522
 a+1.452
Neg logit contributions
lor-7.267
arians-7.000
umm-6.949
rol-6.947
ides-6.898
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.330
,+1.219
 the+1.201
.+1.053
 a+1.030
Neg logit contributions
lor-7.792
arians-7.493
rol-7.459
umm-7.403
ides-7.373
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.779
 the+3.675
,+3.653
.+3.535
 a+3.489
Neg logit contributions
lor-5.427
arians-5.209
umm-5.148
rol-5.142
bek-5.078
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+3.243
 and+3.201
,+3.129
.+3.072
 a+3.041
Neg logit contributions
lor-5.833
arians-5.677
rol-5.568
ides-5.516
umm-5.493
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.607
 and+0.539
,+0.475
 a+0.421
.+0.330
Neg logit contributions
lor-8.418
arians-8.222
rol-8.187
ides-8.163
umm-8.088
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 and+3.659
,+3.541
.+3.458
 the+3.412
 a+3.277
Neg logit contributions
lor-5.277
arians-5.194
umm-5.123
bek-5.119
ocol-5.099
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.020
 the+0.900
,+0.815
.+0.758
 a+0.723
Neg logit contributions
lor-8.166
arians-7.944
rol-7.934
umm-7.926
ocol-7.868
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 and+2.302
 the+2.231
,+2.174
.+2.087
 a+2.080
Neg logit contributions
lor-6.700
arians-6.545
osity-6.473
rol-6.437
umm-6.405
TokenI
Token ID40
Feature activation+1.00e+00

Pos logit contributions
 and+5.982
 the+5.873
,+5.856
.+5.740
 a+5.690
Neg logit contributions
lor-3.274
arians-3.090
umm-3.083
rol-3.057
ocol-2.954
Token will
Token ID481
Feature activation+1.00e+00

Pos logit contributions
 and+4.513
 the+4.463
,+4.347
.+4.287
 a+4.235
Neg logit contributions
lor-4.625
rol-4.437
arians-4.406
umm-4.356
ides-4.350
Token scare
Token ID19437
Feature activation+1.00e+00

Pos logit contributions
 and+3.803
 the+3.693
,+3.606
.+3.505
 a+3.486
Neg logit contributions
lor-5.307
arians-5.177
ides-5.036
rol-5.020
umm-4.983
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.132
,+2.992
 the+2.922
.+2.877
 a+2.808
Neg logit contributions
lor-5.956
arians-5.788
rol-5.771
umm-5.733
ocol-5.708
Token every
Token ID790
Feature activation+1.00e+00

Pos logit contributions
 the+2.926
 and+2.925
,+2.778
 a+2.701
.+2.649
Neg logit contributions
lor-6.136
rol-5.938
arians-5.918
ides-5.821
umm-5.788
Token night
Token ID1755
Feature activation+1.00e+00

Pos logit contributions
 and+3.592
 the+3.509
,+3.439
.+3.314
 a+3.283
Neg logit contributions
lor-5.548
umm-5.320
rol-5.284
arians-5.265
ides-5.197
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 the+2.334
 and+2.328
,+2.171
 a+2.160
.+1.997
Neg logit contributions
lor-6.716
arians-6.531
umm-6.528
ides-6.440
rol-6.408
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.612
,+2.529
.+2.395
 the+2.368
 a+2.291
Neg logit contributions
lor-6.420
arians-6.248
umm-6.212
rol-6.205
isher-6.174
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.607
 and+0.539
,+0.475
 a+0.421
.+0.330
Neg logit contributions
lor-8.418
arians-8.222
rol-8.187
ides-8.163
umm-8.088
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 and+3.659
,+3.541
.+3.458
 the+3.412
 a+3.277
Neg logit contributions
lor-5.277
arians-5.194
umm-5.123
bek-5.119
ocol-5.099
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.020
 the+0.900
,+0.815
.+0.758
 a+0.723
Neg logit contributions
lor-8.166
arians-7.944
rol-7.934
umm-7.926
ocol-7.868
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 and+2.302
 the+2.231
,+2.174
.+2.087
 a+2.080
Neg logit contributions
lor-6.700
arians-6.545
osity-6.473
rol-6.437
umm-6.405
TokenI
Token ID40
Feature activation+1.00e+00

Pos logit contributions
 and+5.982
 the+5.873
,+5.856
.+5.740
 a+5.690
Neg logit contributions
lor-3.274
arians-3.090
umm-3.083
rol-3.057
ocol-2.954
Token will
Token ID481
Feature activation+1.00e+00

Pos logit contributions
 and+4.513
 the+4.463
,+4.347
.+4.287
 a+4.235
Neg logit contributions
lor-4.625
rol-4.437
arians-4.406
umm-4.356
ides-4.350
Token scare
Token ID19437
Feature activation+1.00e+00

Pos logit contributions
 and+3.803
 the+3.693
,+3.606
.+3.505
 a+3.486
Neg logit contributions
lor-5.307
arians-5.177
ides-5.036
rol-5.020
umm-4.983
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.132
,+2.992
 the+2.922
.+2.877
 a+2.808
Neg logit contributions
lor-5.956
arians-5.788
rol-5.771
umm-5.733
ocol-5.708
Token every
Token ID790
Feature activation+1.00e+00

Pos logit contributions
 the+2.926
 and+2.925
,+2.778
 a+2.701
.+2.649
Neg logit contributions
lor-6.136
rol-5.938
arians-5.918
ides-5.821
umm-5.788
Token night
Token ID1755
Feature activation+1.00e+00

Pos logit contributions
 and+3.592
 the+3.509
,+3.439
.+3.314
 a+3.283
Neg logit contributions
lor-5.548
umm-5.320
rol-5.284
arians-5.265
ides-5.197
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.528
 the+4.466
,+4.422
 a+4.301
.+4.273
Neg logit contributions
lor-4.569
umm-4.393
arians-4.348
rol-4.328
ocol-4.305
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.924
 the+0.874
,+0.820
 a+0.766
.+0.534
Neg logit contributions
lor-8.122
rol-7.901
umm-7.830
arians-7.766
isher-7.752
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 and+3.490
,+3.383
 the+3.354
.+3.232
 a+3.107
Neg logit contributions
lor-5.614
umm-5.432
arians-5.393
rol-5.370
osity-5.355
Token heard
Token ID2982
Feature activation+1.00e+00

Pos logit contributions
 and+3.679
,+3.618
 the+3.601
.+3.505
 a+3.412
Neg logit contributions
lor-5.401
arians-5.216
rol-5.124
ides-5.039
bek-5.035
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.246
,+2.166
.+2.014
 the+1.978
 a+1.760
Neg logit contributions
lor-6.759
rol-6.612
ocol-6.544
umm-6.527
arians-6.511
Token sound
Token ID2128
Feature activation+1.00e+00

Pos logit contributions
 and+4.225
 the+4.164
,+4.153
.+3.958
 a+3.921
Neg logit contributions
lor-4.872
umm-4.751
arians-4.731
rol-4.658
ocol-4.611
Token again
Token ID757
Feature activation+1.00e+00

Pos logit contributions
 the+1.041
 and+1.007
,+0.948
 a+0.916
.+0.800
Neg logit contributions
lor-7.716
rol-7.581
arians-7.518
umm-7.509
ides-7.490
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.452
 and+0.361
 a+0.359
,+0.267
.+0.074
Neg logit contributions
lor-8.264
ides-8.142
ocol-8.138
arians-8.137
umm-8.086
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+3.388
,+3.273
 the+3.244
.+3.124
 a+3.010
Neg logit contributions
lor-5.795
umm-5.582
arians-5.536
rol-5.500
osity-5.471
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.554
,+3.458
 the+3.428
.+3.330
 a+3.242
Neg logit contributions
lor-5.526
rol-5.332
arians-5.300
umm-5.269
ides-5.158
Token coming
Token ID2406
Feature activation+1.00e+00

Pos logit contributions
 and+2.682
,+2.578
 the+2.407
.+2.377
 a+2.186
Neg logit contributions
ocol-6.143
lor-6.141
rol-6.102
umm-6.031
arians-6.017
Token heard
Token ID2982
Feature activation+1.00e+00

Pos logit contributions
 and+3.679
,+3.618
 the+3.601
.+3.505
 a+3.412
Neg logit contributions
lor-5.401
arians-5.216
rol-5.124
ides-5.039
bek-5.035
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.246
,+2.166
.+2.014
 the+1.978
 a+1.760
Neg logit contributions
lor-6.759
rol-6.612
ocol-6.544
umm-6.527
arians-6.511
Token sound
Token ID2128
Feature activation+1.00e+00

Pos logit contributions
 and+4.225
 the+4.164
,+4.153
.+3.958
 a+3.921
Neg logit contributions
lor-4.872
umm-4.751
arians-4.731
rol-4.658
ocol-4.611
Token again
Token ID757
Feature activation+1.00e+00

Pos logit contributions
 the+1.041
 and+1.007
,+0.948
 a+0.916
.+0.800
Neg logit contributions
lor-7.716
rol-7.581
arians-7.518
umm-7.509
ides-7.490
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.452
 and+0.361
 a+0.359
,+0.267
.+0.074
Neg logit contributions
lor-8.264
ides-8.142
ocol-8.138
arians-8.137
umm-8.086
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+3.388
,+3.273
 the+3.244
.+3.124
 a+3.010
Neg logit contributions
lor-5.795
umm-5.582
arians-5.536
rol-5.500
osity-5.471
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.554
,+3.458
 the+3.428
.+3.330
 a+3.242
Neg logit contributions
lor-5.526
rol-5.332
arians-5.300
umm-5.269
ides-5.158
Token coming
Token ID2406
Feature activation+1.00e+00

Pos logit contributions
 and+2.682
,+2.578
 the+2.407
.+2.377
 a+2.186
Neg logit contributions
ocol-6.143
lor-6.141
rol-6.102
umm-6.031
arians-6.017
Token from
Token ID422
Feature activation+1.00e+00

Pos logit contributions
 and+2.280
 the+2.239
,+2.183
 a+2.032
.+2.032
Neg logit contributions
lor-6.652
rol-6.575
umm-6.566
arians-6.513
ides-6.420
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.546
,+2.479
 the+2.333
.+2.268
 a+2.166
Neg logit contributions
lor-6.320
ocol-6.259
arians-6.236
umm-6.201
isher-6.170
Token closet
Token ID21615
Feature activation+1.00e+00

Pos logit contributions
 and+4.344
 the+4.311
,+4.198
 a+4.079
.+4.065
Neg logit contributions
lor-4.701
arians-4.631
umm-4.589
ocol-4.435
rol-4.411
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.246
,+2.166
.+2.014
 the+1.978
 a+1.760
Neg logit contributions
lor-6.759
rol-6.612
ocol-6.544
umm-6.527
arians-6.511
Token sound
Token ID2128
Feature activation+1.00e+00

Pos logit contributions
 and+4.225
 the+4.164
,+4.153
.+3.958
 a+3.921
Neg logit contributions
lor-4.872
umm-4.751
arians-4.731
rol-4.658
ocol-4.611
Token again
Token ID757
Feature activation+1.00e+00

Pos logit contributions
 the+1.041
 and+1.007
,+0.948
 a+0.916
.+0.800
Neg logit contributions
lor-7.716
rol-7.581
arians-7.518
umm-7.509
ides-7.490
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.452
 and+0.361
 a+0.359
,+0.267
.+0.074
Neg logit contributions
lor-8.264
ides-8.142
ocol-8.138
arians-8.137
umm-8.086
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+3.388
,+3.273
 the+3.244
.+3.124
 a+3.010
Neg logit contributions
lor-5.795
umm-5.582
arians-5.536
rol-5.500
osity-5.471
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.554
,+3.458
 the+3.428
.+3.330
 a+3.242
Neg logit contributions
lor-5.526
rol-5.332
arians-5.300
umm-5.269
ides-5.158
Token coming
Token ID2406
Feature activation+1.00e+00

Pos logit contributions
 and+2.682
,+2.578
 the+2.407
.+2.377
 a+2.186
Neg logit contributions
ocol-6.143
lor-6.141
rol-6.102
umm-6.031
arians-6.017
Token from
Token ID422
Feature activation+1.00e+00

Pos logit contributions
 and+2.280
 the+2.239
,+2.183
 a+2.032
.+2.032
Neg logit contributions
lor-6.652
rol-6.575
umm-6.566
arians-6.513
ides-6.420
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.546
,+2.479
 the+2.333
.+2.268
 a+2.166
Neg logit contributions
lor-6.320
ocol-6.259
arians-6.236
umm-6.201
isher-6.170
Token closet
Token ID21615
Feature activation+1.00e+00

Pos logit contributions
 and+4.344
 the+4.311
,+4.198
 a+4.079
.+4.065
Neg logit contributions
lor-4.701
arians-4.631
umm-4.589
ocol-4.435
rol-4.411
Token!
Token ID0
Feature activation+1.00e+00

Pos logit contributions
 the+2.105
 and+1.991
 a+1.892
,+1.871
.+1.685
Neg logit contributions
lor-7.116
arians-6.867
umm-6.806
rol-6.766
ides-6.709
Token at
Token ID379
Feature activation+1.00e+00

Pos logit contributions
 and+3.255
 the+3.162
,+3.139
.+3.047
 a+2.956
Neg logit contributions
lor-5.645
rol-5.633
ocol-5.557
arians-5.511
umm-5.504
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.612
,+2.529
.+2.395
 the+2.368
 a+2.291
Neg logit contributions
lor-6.420
arians-6.248
umm-6.212
rol-6.205
isher-6.174
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 the+0.607
 and+0.539
,+0.475
 a+0.421
.+0.330
Neg logit contributions
lor-8.418
arians-8.222
rol-8.187
ides-8.163
umm-8.088
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 and+3.659
,+3.541
.+3.458
 the+3.412
 a+3.277
Neg logit contributions
lor-5.277
arians-5.194
umm-5.123
bek-5.119
ocol-5.099
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.020
 the+0.900
,+0.815
.+0.758
 a+0.723
Neg logit contributions
lor-8.166
arians-7.944
rol-7.934
umm-7.926
ocol-7.868
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 and+2.302
 the+2.231
,+2.174
.+2.087
 a+2.080
Neg logit contributions
lor-6.700
arians-6.545
osity-6.473
rol-6.437
umm-6.405
TokenI
Token ID40
Feature activation+1.00e+00

Pos logit contributions
 and+5.982
 the+5.873
,+5.856
.+5.740
 a+5.690
Neg logit contributions
lor-3.274
arians-3.090
umm-3.083
rol-3.057
ocol-2.954
Token will
Token ID481
Feature activation+1.00e+00

Pos logit contributions
 and+4.513
 the+4.463
,+4.347
.+4.287
 a+4.235
Neg logit contributions
lor-4.625
rol-4.437
arians-4.406
umm-4.356
ides-4.350
Token scare
Token ID19437
Feature activation+1.00e+00

Pos logit contributions
 and+3.803
 the+3.693
,+3.606
.+3.505
 a+3.486
Neg logit contributions
lor-5.307
arians-5.177
ides-5.036
rol-5.020
umm-4.983
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.132
,+2.992
 the+2.922
.+2.877
 a+2.808
Neg logit contributions
lor-5.956
arians-5.788
rol-5.771
umm-5.733
ocol-5.708
Token every
Token ID790
Feature activation+1.00e+00

Pos logit contributions
 the+2.926
 and+2.925
,+2.778
 a+2.701
.+2.649
Neg logit contributions
lor-6.136
rol-5.938
arians-5.918
ides-5.821
umm-5.788
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.829
,+3.720
 the+3.665
.+3.615
 a+3.486
Neg logit contributions
lor-5.230
arians-5.029
rol-4.975
bek-4.919
isher-4.871
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+3.087
,+2.983
 the+2.921
 a+2.792
.+2.753
Neg logit contributions
lor-5.899
arians-5.776
ocol-5.766
rol-5.764
isher-5.761
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.547
,+1.415
.+1.240
 the+1.213
 a+1.045
Neg logit contributions
lor-7.464
ocol-7.331
umm-7.258
arians-7.234
rol-7.185
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.528
 the+4.466
,+4.422
 a+4.301
.+4.273
Neg logit contributions
lor-4.569
umm-4.393
arians-4.348
rol-4.328
ocol-4.305
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.924
 the+0.874
,+0.820
 a+0.766
.+0.534
Neg logit contributions
lor-8.122
rol-7.901
umm-7.830
arians-7.766
isher-7.752
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 and+3.490
,+3.383
 the+3.354
.+3.232
 a+3.107
Neg logit contributions
lor-5.614
umm-5.432
arians-5.393
rol-5.370
osity-5.355
Token heard
Token ID2982
Feature activation+1.00e+00

Pos logit contributions
 and+3.679
,+3.618
 the+3.601
.+3.505
 a+3.412
Neg logit contributions
lor-5.401
arians-5.216
rol-5.124
ides-5.039
bek-5.035
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.246
,+2.166
.+2.014
 the+1.978
 a+1.760
Neg logit contributions
lor-6.759
rol-6.612
ocol-6.544
umm-6.527
arians-6.511
Token sound
Token ID2128
Feature activation+1.00e+00

Pos logit contributions
 and+4.225
 the+4.164
,+4.153
.+3.958
 a+3.921
Neg logit contributions
lor-4.872
umm-4.751
arians-4.731
rol-4.658
ocol-4.611
Token again
Token ID757
Feature activation+1.00e+00

Pos logit contributions
 the+1.041
 and+1.007
,+0.948
 a+0.916
.+0.800
Neg logit contributions
lor-7.716
rol-7.581
arians-7.518
umm-7.509
ides-7.490
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.452
 and+0.361
 a+0.359
,+0.267
.+0.074
Neg logit contributions
lor-8.264
ides-8.142
ocol-8.138
arians-8.137
umm-8.086
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+4.025
 the+3.967
,+3.894
.+3.755
 a+3.746
Neg logit contributions
lor-4.921
ides-4.875
arians-4.843
umm-4.828
ocol-4.797
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.523
 the+4.299
,+4.230
.+4.156
 a+4.102
Neg logit contributions
lor-4.699
arians-4.379
ocol-4.348
osity-4.330
rol-4.326
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+2.034
 the+2.004
,+1.888
 a+1.854
.+1.729
Neg logit contributions
lor-6.922
arians-6.773
rol-6.700
umm-6.662
ides-6.642
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.020
 and+0.995
 a+0.863
,+0.827
.+0.678
Neg logit contributions
lor-7.927
ides-7.706
arians-7.688
rol-7.687
umm-7.677
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+3.143
,+3.028
 the+2.994
.+2.860
 a+2.802
Neg logit contributions
lor-6.068
arians-5.816
rol-5.790
umm-5.758
osity-5.698
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 the+3.202
 and+3.176
,+3.111
.+3.038
 a+3.012
Neg logit contributions
lor-5.814
arians-5.669
rol-5.578
ides-5.524
umm-5.517
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.734
 the+3.662
.+3.583
 a+3.379
Neg logit contributions
lor-5.109
rol-5.055
arians-4.984
ocol-4.900
beh-4.897
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.567
 and+0.528
 a+0.437
,+0.405
.+0.262
Neg logit contributions
lor-8.226
umm-8.190
rol-8.119
arians-8.109
ocol-8.082
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.285
,+3.152
 the+3.123
.+2.967
 a+2.924
Neg logit contributions
lor-5.868
umm-5.676
arians-5.652
rol-5.628
osity-5.540
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.398
,+1.265
 the+1.249
.+1.096
 a+1.085
Neg logit contributions
lor-7.789
rol-7.520
arians-7.471
umm-7.400
ides-7.392
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.915
 the+3.803
,+3.773
.+3.652
 a+3.618
Neg logit contributions
lor-5.366
arians-5.138
rol-5.096
umm-5.070
bek-5.050
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.877
 the+3.825
,+3.797
.+3.738
 a+3.664
Neg logit contributions
lor-5.072
arians-4.996
umm-4.929
rol-4.873
osity-4.853
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 the+2.418
 and+2.400
,+2.267
 a+2.210
.+2.162
Neg logit contributions
lor-6.640
umm-6.442
arians-6.439
rol-6.383
ides-6.305
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.238
,+3.084
 the+3.069
.+2.977
 a+2.929
Neg logit contributions
lor-5.794
arians-5.628
rol-5.581
ocol-5.555
umm-5.545
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.448
 the+4.340
,+4.332
.+4.194
 a+4.133
Neg logit contributions
lor-4.612
umm-4.527
arians-4.524
ocol-4.451
isher-4.411
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.991
 the+3.975
,+3.864
 a+3.772
.+3.769
Neg logit contributions
lor-5.076
rol-4.849
arians-4.843
umm-4.838
ides-4.770
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.502
 the+2.409
,+2.348
.+2.230
 a+2.200
Neg logit contributions
lor-6.549
arians-6.345
ides-6.288
rol-6.270
umm-6.241
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+4.025
 the+3.967
,+3.894
.+3.755
 a+3.746
Neg logit contributions
lor-4.921
ides-4.875
arians-4.843
umm-4.828
ocol-4.797
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.523
 the+4.299
,+4.230
.+4.156
 a+4.102
Neg logit contributions
lor-4.699
arians-4.379
ocol-4.348
osity-4.330
rol-4.326
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+2.034
 the+2.004
,+1.888
 a+1.854
.+1.729
Neg logit contributions
lor-6.922
arians-6.773
rol-6.700
umm-6.662
ides-6.642
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.020
 and+0.995
 a+0.863
,+0.827
.+0.678
Neg logit contributions
lor-7.927
ides-7.706
arians-7.688
rol-7.687
umm-7.677
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+3.143
,+3.028
 the+2.994
.+2.860
 a+2.802
Neg logit contributions
lor-6.068
arians-5.816
rol-5.790
umm-5.758
osity-5.698
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+3.074
 the+3.049
,+2.967
 a+2.836
.+2.825
Neg logit contributions
lor-6.009
arians-5.820
rol-5.787
umm-5.756
osity-5.702
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.877
 the+3.825
,+3.797
.+3.738
 a+3.664
Neg logit contributions
lor-5.072
arians-4.996
umm-4.929
rol-4.873
osity-4.853
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 the+2.418
 and+2.400
,+2.267
 a+2.210
.+2.162
Neg logit contributions
lor-6.640
umm-6.442
arians-6.439
rol-6.383
ides-6.305
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.238
,+3.084
 the+3.069
.+2.977
 a+2.929
Neg logit contributions
lor-5.794
arians-5.628
rol-5.581
ocol-5.555
umm-5.545
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.448
 the+4.340
,+4.332
.+4.194
 a+4.133
Neg logit contributions
lor-4.612
umm-4.527
arians-4.524
ocol-4.451
isher-4.411
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.991
 the+3.975
,+3.864
 a+3.772
.+3.769
Neg logit contributions
lor-5.076
rol-4.849
arians-4.843
umm-4.838
ides-4.770
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.502
 the+2.409
,+2.348
.+2.230
 a+2.200
Neg logit contributions
lor-6.549
arians-6.345
ides-6.288
rol-6.270
umm-6.241
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+4.025
 the+3.967
,+3.894
.+3.755
 a+3.746
Neg logit contributions
lor-4.921
ides-4.875
arians-4.843
umm-4.828
ocol-4.797
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.523
 the+4.299
,+4.230
.+4.156
 a+4.102
Neg logit contributions
lor-4.699
arians-4.379
ocol-4.348
osity-4.330
rol-4.326
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+2.034
 the+2.004
,+1.888
 a+1.854
.+1.729
Neg logit contributions
lor-6.922
arians-6.773
rol-6.700
umm-6.662
ides-6.642
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.020
 and+0.995
 a+0.863
,+0.827
.+0.678
Neg logit contributions
lor-7.927
ides-7.706
arians-7.688
rol-7.687
umm-7.677
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.401
,+1.354
.+1.169
 the+1.096
 a+0.907
Neg logit contributions
lor-7.511
arians-7.390
rol-7.289
ocol-7.288
umm-7.238
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+4.635
 the+4.555
,+4.553
.+4.402
 a+4.336
Neg logit contributions
lor-4.532
arians-4.388
umm-4.336
rol-4.261
ocol-4.244
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.453
 the+7.328
,+7.315
.+7.213
 a+7.150
Neg logit contributions
lor-1.705
umm-1.550
arians-1.529
rol-1.518
ocol-1.421
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+2.061
 and+2.042
,+1.943
 a+1.908
.+1.832
Neg logit contributions
lor-6.968
arians-6.795
rol-6.728
umm-6.653
osity-6.651
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+2.502
,+2.393
 the+2.386
.+2.311
 a+2.212
Neg logit contributions
lor-6.492
arians-6.276
osity-6.252
rol-6.238
umm-6.196
Token ostr
Token ID42387
Feature activation+1.00e+00

Pos logit contributions
 and+6.004
,+5.894
 the+5.885
.+5.756
 a+5.688
Neg logit contributions
lor-3.149
arians-2.984
umm-2.953
ocol-2.912
rol-2.895
Tokenich
Token ID488
Feature activation+1.00e+00

Pos logit contributions
 and+7.471
 the+7.351
,+7.332
.+7.227
 a+7.169
Neg logit contributions
lor-1.699
umm-1.530
arians-1.518
rol-1.501
ocol-1.401
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
 the+3.799
,+3.733
.+3.659
 a+3.629
Neg logit contributions
lor-5.204
arians-5.059
rol-4.989
umm-4.916
ides-4.872
Token standing
Token ID5055
Feature activation+1.00e+00

Pos logit contributions
 and+3.919
,+3.820
 the+3.742
.+3.709
 a+3.511
Neg logit contributions
rol-5.022
lor-5.010
arians-4.884
ocol-4.832
osity-4.806
Token next
Token ID1306
Feature activation+1.00e+00

Pos logit contributions
 and+2.715
 the+2.657
,+2.622
.+2.490
 a+2.481
Neg logit contributions
lor-6.332
rol-6.181
arians-6.101
umm-6.080
ocol-6.034
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.530
 the+1.490
,+1.396
 a+1.319
.+1.276
Neg logit contributions
lor-7.574
arians-7.368
umm-7.365
rol-7.320
ides-7.266
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
 and+2.355
 the+2.308
,+2.216
.+2.146
 a+2.119
Neg logit contributions
lor-6.618
arians-6.491
osity-6.404
umm-6.363
rol-6.330
TokenBut
Token ID1537
Feature activation+1.00e+00

Pos logit contributions
 and+5.719
 the+5.628
,+5.604
.+5.474
 a+5.432
Neg logit contributions
lor-3.550
umm-3.382
arians-3.364
rol-3.317
ides-3.221
Token I
Token ID314
Feature activation+1.00e+00

Pos logit contributions
 and+3.435
 the+3.225
,+3.214
.+3.207
 a+3.091
Neg logit contributions
lor-5.640
umm-5.509
arians-5.507
ocol-5.436
isher-5.406
Token like
Token ID588
Feature activation+1.00e+00

Pos logit contributions
 and+3.874
 the+3.827
,+3.743
.+3.652
 a+3.623
Neg logit contributions
lor-5.220
arians-5.066
umm-5.056
rol-5.021
ides-4.985
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+3.929
,+3.806
 the+3.734
.+3.693
 a+3.601
Neg logit contributions
lor-5.068
umm-4.978
ocol-4.968
arians-4.919
rol-4.868
Token mel
Token ID7758
Feature activation+1.00e+00

Pos logit contributions
 and+5.113
 the+5.028
,+4.990
.+4.864
 a+4.828
Neg logit contributions
lor-3.969
umm-3.949
arians-3.784
rol-3.760
ocol-3.749
Tokenon
Token ID261
Feature activation+1.00e+00

Pos logit contributions
 and+5.368
 the+5.304
,+5.227
 a+5.134
.+5.111
Neg logit contributions
lor-3.858
umm-3.695
arians-3.624
rol-3.596
ocol-3.546
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.566
 and+1.497
 a+1.340
,+1.340
.+1.154
Neg logit contributions
lor-7.414
umm-7.323
arians-7.285
rol-7.235
ides-7.210
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+4.367
 the+4.289
,+4.247
.+4.125
 a+4.045
Neg logit contributions
lor-4.783
umm-4.725
arians-4.634
ocol-4.570
rol-4.570
Token looks
Token ID3073
Feature activation+1.00e+00

Pos logit contributions
 and+3.603
 the+3.551
,+3.487
.+3.379
 a+3.359
Neg logit contributions
lor-5.499
arians-5.335
umm-5.335
rol-5.296
ides-5.182
Token funny
Token ID8258
Feature activation+1.00e+00

Pos logit contributions
 and+3.065
 the+2.933
,+2.928
.+2.811
 a+2.710
Neg logit contributions
umm-5.772
ocol-5.768
lor-5.768
rol-5.752
arians-5.676
Token girl
Token ID2576
Feature activation+1.00e+00

Pos logit contributions
 and+4.107
 the+4.027
,+3.969
.+3.887
 a+3.846
Neg logit contributions
lor-5.005
umm-4.879
arians-4.866
rol-4.809
ocol-4.751
Token who
Token ID508
Feature activation+1.00e+00

Pos logit contributions
 and+3.453
 the+3.402
,+3.321
 a+3.220
.+3.194
Neg logit contributions
lor-5.650
arians-5.469
rol-5.450
umm-5.405
ocol-5.357
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.132
 the+3.018
,+3.007
.+2.915
 a+2.883
Neg logit contributions
lor-5.912
arians-5.790
rol-5.765
umm-5.650
bek-5.645
Token crying
Token ID13774
Feature activation+1.00e+00

Pos logit contributions
 and+4.180
 the+4.071
,+4.053
.+3.898
 a+3.835
Neg logit contributions
lor-4.865
arians-4.727
rol-4.711
umm-4.586
ocol-4.565
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+0.791
 and+0.753
,+0.642
 a+0.534
.+0.400
Neg logit contributions
lor-8.014
arians-7.946
umm-7.941
rol-7.920
osity-7.813
Token Gig
Token ID19525
Feature activation+1.00e+00

Pos logit contributions
 and+5.485
 the+5.371
,+5.326
.+5.287
 a+5.233
Neg logit contributions
lor-3.660
umm-3.505
arians-3.482
rol-3.433
osity-3.376
Tokeni
Token ID72
Feature activation+1.00e+00

Pos logit contributions
 and+6.240
 the+6.167
,+6.105
.+6.010
 a+6.010
Neg logit contributions
lor-2.984
arians-2.782
umm-2.767
rol-2.745
ocol-2.666
Token asked
Token ID1965
Feature activation+1.00e+00

Pos logit contributions
 the+2.814
 and+2.798
,+2.700
 a+2.653
.+2.641
Neg logit contributions
lor-6.214
arians-6.094
rol-5.935
ides-5.884
osity-5.878
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.518
 the+2.320
.+2.269
,+2.253
 a+2.220
Neg logit contributions
lor-6.540
rol-6.348
arians-6.327
ocol-6.307
umm-6.241
Token girl
Token ID2576
Feature activation+1.00e+00

Pos logit contributions
 and+3.735
 the+3.647
,+3.618
.+3.482
 a+3.421
Neg logit contributions
lor-5.506
umm-5.354
arians-5.319
rol-5.216
isher-5.172
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.756
 the+2.649
,+2.549
.+2.530
 a+2.497
Neg logit contributions
lor-6.277
arians-6.108
rol-6.092
osity-6.033
umm-6.018
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.571
 the+0.513
,+0.436
 a+0.337
.+0.238
Neg logit contributions
lor-8.470
arians-8.316
rol-8.270
umm-8.189
ides-8.175
Token One
Token ID1881
Feature activation+1.00e+00

Pos logit contributions
 and+3.566
,+3.426
 the+3.415
.+3.309
 a+3.234
Neg logit contributions
lor-5.547
umm-5.451
arians-5.428
rol-5.371
osity-5.304
Token day
Token ID1110
Feature activation+1.00e+00

Pos logit contributions
 and+2.935
 the+2.839
,+2.800
.+2.737
 a+2.640
Neg logit contributions
lor-5.973
umm-5.960
rol-5.795
osity-5.729
bek-5.729
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.145
 the+0.045
,+0.002
.-0.095
 a-0.139
Neg logit contributions
lor-9.047
arians-8.874
umm-8.863
rol-8.837
ocol-8.742
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.394
,+1.191
.+1.157
 the+1.144
 a+0.961
Neg logit contributions
lor-7.597
arians-7.533
umm-7.488
rol-7.459
ocol-7.433
Token rad
Token ID2511
Feature activation+1.00e+00

Pos logit contributions
 and+5.111
 the+5.005
,+4.979
.+4.892
 a+4.842
Neg logit contributions
umm-4.030
lor-3.975
arians-3.849
rol-3.794
ocol-3.775
Tokenish
Token ID680
Feature activation+1.00e+00

Pos logit contributions
 and+6.884
 the+6.786
,+6.748
.+6.644
 a+6.602
Neg logit contributions
lor-2.306
arians-2.132
umm-2.121
rol-2.094
ocol-1.996
Token's
Token ID338
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
 the+2.961
,+2.841
.+2.807
 a+2.790
Neg logit contributions
lor-6.097
arians-5.817
rol-5.803
umm-5.800
ides-5.697
Token dad
Token ID9955
Feature activation+1.00e+00

Pos logit contributions
 and+4.198
 the+4.186
,+4.054
.+3.948
 a+3.916
Neg logit contributions
umm-4.877
lor-4.876
arians-4.742
rol-4.728
ocol-4.649
Token said
Token ID531
Feature activation+1.00e+00

Pos logit contributions
 and+3.357
 the+3.325
,+3.239
.+3.194
 a+3.168
Neg logit contributions
lor-5.652
arians-5.525
rol-5.441
umm-5.407
ides-5.349
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.834
 the+1.719
,+1.661
.+1.596
 a+1.536
Neg logit contributions
lor-7.356
arians-7.171
umm-7.149
rol-7.115
osity-7.048
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+1.249
,+1.076
.+0.988
 the+0.969
 a+0.703
Neg logit contributions
lor-7.911
rol-7.699
ocol-7.591
arians-7.560
osity-7.555
Token big
Token ID1263
Feature activation+1.00e+00

Pos logit contributions
 and+3.823
 the+3.743
,+3.638
.+3.627
 a+3.521
Neg logit contributions
lor-5.302
umm-5.129
arians-5.120
bek-5.016
rol-5.004
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+4.354
 the+4.289
.+4.194
,+4.153
 a+4.143
Neg logit contributions
lor-4.635
umm-4.580
arians-4.516
ocol-4.473
rol-4.422
Token juicy
Token ID35075
Feature activation+1.00e+00

Pos logit contributions
 and+3.424
,+3.355
 the+3.318
.+3.276
 a+3.087
Neg logit contributions
lor-5.458
umm-5.416
ocol-5.359
rol-5.334
arians-5.315
Token b
Token ID275
Feature activation+1.00e+00

Pos logit contributions
 and+4.747
 the+4.695
,+4.591
.+4.541
 a+4.513
Neg logit contributions
lor-4.344
umm-4.252
arians-4.141
rol-4.110
ocol-4.098
Tokenerry
Token ID6996
Feature activation+1.00e+00

Pos logit contributions
 and+7.162
 the+7.075
,+7.024
.+6.919
 a+6.888
Neg logit contributions
lor-2.054
umm-1.884
arians-1.876
rol-1.850
ocol-1.761
Token on
Token ID319
Feature activation+1.00e+00

Pos logit contributions
 and+3.823
 the+3.792
,+3.693
 a+3.602
.+3.558
Neg logit contributions
lor-5.321
umm-5.158
arians-5.103
rol-5.098
ocol-4.991
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 and+0.974
,+0.863
.+0.739
 the+0.664
 a+0.491
Neg logit contributions
lor-8.042
umm-7.885
ocol-7.875
arians-7.862
rol-7.800
Token branch
Token ID8478
Feature activation+1.00e+00

Pos logit contributions
 and+4.521
 the+4.489
,+4.371
 a+4.278
.+4.269
Neg logit contributions
lor-4.658
umm-4.441
arians-4.394
rol-4.318
ocol-4.270
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 the+1.734
 and+1.707
,+1.616
 a+1.526
.+1.415
Neg logit contributions
lor-7.386
rol-7.210
umm-7.094
arians-7.073
ides-7.035
Token Tweet
Token ID18752
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
 the+3.525
,+3.470
.+3.391
 a+3.330
Neg logit contributions
lor-5.553
umm-5.351
arians-5.347
rol-5.343
osity-5.209