TOP ACTIVATIONS
MAX = 3.162

his friend . Y oga had an idea .
The string breaks . The k ite flies away . They
rolled out . Spot and Fl uffy ran after the ball
Ben slides down , his rh ino falls out of his
After a while , the k ang aroo woke up and
the boy and soon the k ite was ready to fly
was finally finished , the k ite was so proud !
cl ap the jar . Cl ap , cl ap ,
back and forth and the k ite went up and down
the ball . Spot and Fl uffy were very sad .
They look at the k ite . They see a
and smiled . She held Qu ack in her arms and
He flew the k ite for a long time
As the rabbit and the k ite said goodbye , the
up on the blanket . Fl uffy was very happy in
, their bike and their k ite . They are happy
to run , and the k ite went up , up
outside . Soon , the k ite was high in the
After a while , the k ite flew away . John
, and even her little k itty . Everywhere she went

MIN ACTIVATIONS
MIN = -3.115

the night ?" The rabbit said , " I
for me !" The surprise turned out to be
a jacket . The p anda asked , "
go see some birds !" The little orange bear was excited
too ?" The other child was a bit
shouted Bob . The elf said , " G
over there !" The two children carefully climbed down
scared ." But the first child said , "
seen it ?" The p anda shook his head
me ?" The crab replied , " I
like me ?" The crab looked at T una
luggage . The elephant asked , " What
with him . The little girl was so happy
the jungle ?" The child said , " I
he could play too . The babys itter said it was
it looks fun !" The second child said , "
there . The elf said that he had
you doing ?" The bear replied , " I
one day ?" The elf smiled and said ,
love to play ," replied the crab . L

INTERVAL 1.593 - 3.162
CONTAINS 0.293%

said , " It 's okay . We can buy another
family . The moral value of the story is
, " Yes , it sure is sweet heart ." <|endoftext|>
am sorry . I will help clean up . I will
you okay ? Let me help you . We need to

INTERVAL 0.024 - 1.593
CONTAINS 47.791%

busy boy . He was packing for a trip . He
Once upon a time , there was a little boy named
L ily and her friends played on the swings and slides
. Mom asked , " Can Tim ride in your red
boy got to play outside . <|endoftext|> Once upon a time

INTERVAL -1.545 - 0.024
CONTAINS 51.677%

to get a coat . Then she went back outside .
because there was more space now . Bob o learned that
Then , Sammy saw a huge rat and
and be part of something bigger . One night ,
stayed with Sun flower until the sun set and it was

INTERVAL -3.115 - -1.545
CONTAINS 0.238%

to give them back . The squirrel felt bad and said
Tim my . The next day , Sarah wore
head . He asks Tim and Sam if he can play
was very pinch y . The brown dog wanted to play
for being good . She said that sharing was caring and
Token his
Token ID465
Feature activation-3.33e-01

Pos logit contributions
ucks+1.848
ap+1.783
 upon+1.771
apping+1.758
itting+1.684
Neg logit contributions
 gn-2.107
col-2.090
 disl-1.920
 dis-1.816
riger-1.720
Token friend
Token ID1545
Feature activation-3.27e-01

Pos logit contributions
col+0.890
 gn+0.863
 disl+0.836
riger+0.785
growth+0.780
Neg logit contributions
ap-0.125
 upon-0.093
ucks-0.091
apping-0.086
aps-0.066
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
col+0.545
 gn+0.523
 disl+0.494
riger+0.441
growth+0.441
Neg logit contributions
ap-0.441
 upon-0.416
apping-0.404
ucks-0.404
aps-0.382
Token
Token ID198
Feature activation-5.03e-01

Pos logit contributions
ap+0.403
 upon+0.381
ucks+0.379
apping+0.372
angles+0.359
Neg logit contributions
col-0.480
 gn-0.463
 disl-0.433
riger-0.384
growth-0.383
Token
Token ID198
Feature activation+2.79e-02

Pos logit contributions
col+0.849
 gn+0.825
 disl+0.781
riger+0.697
 dis+0.685
Neg logit contributions
ap-0.654
 upon-0.634
ucks-0.618
apping-0.608
aps-0.583
TokenY
Token ID56
Feature activation+3.16e+00

Pos logit contributions
ap+0.037
 upon+0.035
ucks+0.035
apping+0.034
angles+0.032
Neg logit contributions
col-0.047
 gn-0.045
 disl-0.043
riger-0.038
 dis-0.038
Tokenoga
Token ID10949
Feature activation+8.87e-01

Pos logit contributions
 upon+4.358
ooks+4.150
ucks+4.012
ook+3.722
ap+3.719
Neg logit contributions
col-5.405
 gn-4.789
 disl-4.239
riger-4.231
growth-3.963
Token had
Token ID550
Feature activation+5.24e-01

Pos logit contributions
ucks+1.203
ap+1.191
 upon+1.130
apping+1.125
ooks+1.072
Neg logit contributions
col-1.448
 gn-1.378
 disl-1.278
 dis-1.179
growth-1.145
Token an
Token ID281
Feature activation-1.87e-01

Pos logit contributions
ap+0.701
ucks+0.674
apping+0.654
 upon+0.651
angles+0.617
Neg logit contributions
col-0.875
 gn-0.844
 disl-0.796
 dis-0.709
growth-0.701
Token idea
Token ID2126
Feature activation+1.31e-01

Pos logit contributions
col+0.302
 gn+0.291
 disl+0.273
riger+0.243
growth+0.241
Neg logit contributions
ap-0.260
 upon-0.245
ucks-0.244
apping-0.240
angles-0.228
Token.
Token ID13
Feature activation+6.49e-01

Pos logit contributions
ap+0.165
ucks+0.156
 upon+0.155
apping+0.151
angles+0.144
Neg logit contributions
col-0.227
 gn-0.219
 disl-0.206
 dis-0.185
riger-0.184
Token The
Token ID383
Feature activation+9.51e-01

Pos logit contributions
ap+0.049
 upon+0.046
ucks+0.046
apping+0.045
angles+0.043
Neg logit contributions
col-0.066
 gn-0.063
 disl-0.060
 dis-0.053
riger-0.053
Token string
Token ID4731
Feature activation+1.34e+00

Pos logit contributions
ap+1.358
ucks+1.337
 upon+1.298
apping+1.291
ooks+1.260
Neg logit contributions
col-1.459
 gn-1.435
 disl-1.305
 rooft-1.212
 dis-1.204
Token breaks
Token ID9457
Feature activation+7.65e-01

Pos logit contributions
ap+1.955
ucks+1.936
ooks+1.886
apping+1.882
 upon+1.861
Neg logit contributions
col-2.036
 gn-1.973
 disl-1.756
 dis-1.600
growth-1.583
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
ap+1.020
ucks+0.971
 upon+0.949
apping+0.936
angles+0.897
Neg logit contributions
col-1.282
 gn-1.221
 disl-1.095
 dis-1.034
 rooft-1.029
Token The
Token ID383
Feature activation+1.36e+00

Pos logit contributions
ap+0.483
 upon+0.458
ucks+0.454
apping+0.445
angles+0.429
Neg logit contributions
col-0.608
 gn-0.591
 disl-0.555
growth-0.501
 dis-0.497
Token k
Token ID479
Feature activation+3.12e+00

Pos logit contributions
ucks+1.995
ap+1.986
apping+1.914
 upon+1.909
ooks+1.872
Neg logit contributions
col-2.018
 gn-2.013
 disl-1.797
 rooft-1.710
 dis-1.685
Tokenite
Token ID578
Feature activation+9.65e-01

Pos logit contributions
 upon+4.790
ooks+4.657
ap+4.457
angles+4.327
ucks+4.206
Neg logit contributions
col-4.695
 gn-4.218
 rooft-3.916
 pel-3.831
riger-3.746
Token flies
Token ID17607
Feature activation+6.94e-01

Pos logit contributions
ap+1.478
ucks+1.441
apping+1.430
 upon+1.405
ooks+1.394
Neg logit contributions
col-1.402
 gn-1.359
 disl-1.205
 dis-1.090
growth-1.078
Token away
Token ID1497
Feature activation+2.72e-01

Pos logit contributions
ap+1.003
ucks+0.966
apping+0.940
 upon+0.919
ooks+0.898
Neg logit contributions
col-1.094
 gn-1.040
 disl-0.936
 dis-0.856
 rooft-0.853
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
ap+0.324
ucks+0.306
 upon+0.301
apping+0.295
angles+0.283
Neg logit contributions
col-0.492
 gn-0.478
 disl-0.444
 dis-0.405
growth-0.401
Token They
Token ID1119
Feature activation-8.00e-02

Pos logit contributions
ap+0.605
 upon+0.572
ucks+0.567
apping+0.555
angles+0.538
Neg logit contributions
col-0.772
 gn-0.750
 disl-0.706
growth-0.637
 dis-0.634
Token rolled
Token ID11686
Feature activation+8.01e-02

Pos logit contributions
ap+0.633
ucks+0.610
 upon+0.596
apping+0.594
ooks+0.568
Neg logit contributions
col-0.605
 gn-0.586
 disl-0.528
 dis-0.475
growth-0.466
Token out
Token ID503
Feature activation+7.57e-02

Pos logit contributions
ap+0.111
ucks+0.105
 upon+0.104
apping+0.103
angles+0.097
Neg logit contributions
col-0.130
 gn-0.125
 disl-0.117
 dis-0.104
riger-0.104
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
ap+0.074
ucks+0.068
 upon+0.068
apping+0.066
angles+0.062
Neg logit contributions
col-0.153
 gn-0.149
 disl-0.141
 dis-0.128
riger-0.128
Token Spot
Token ID15899
Feature activation+1.12e-01

Pos logit contributions
ap+0.334
 upon+0.314
ucks+0.314
apping+0.308
angles+0.296
Neg logit contributions
col-0.412
 gn-0.401
 disl-0.375
 dis-0.335
growth-0.334
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
ap+0.159
ucks+0.151
 upon+0.150
apping+0.148
angles+0.141
Neg logit contributions
col-0.176
 gn-0.171
 disl-0.158
 dis-0.141
riger-0.140
Token Fl
Token ID1610
Feature activation+3.08e+00

Pos logit contributions
col+0.183
 gn+0.176
 disl+0.166
riger+0.149
 dis+0.148
Neg logit contributions
ap-0.136
 upon-0.129
ucks-0.128
apping-0.125
angles-0.118
Tokenuffy
Token ID15352
Feature activation+1.23e-01

Pos logit contributions
 upon+3.796
angles+3.787
ucks+3.686
ooks+3.515
 yours+3.396
Neg logit contributions
col-4.971
 gn-4.779
 disl-4.599
cious-4.172
j-4.166
Token ran
Token ID4966
Feature activation-1.78e-01

Pos logit contributions
ap+0.177
ucks+0.170
 upon+0.169
apping+0.166
angles+0.158
Neg logit contributions
col-0.189
 gn-0.182
 disl-0.169
 dis-0.150
riger-0.150
Token after
Token ID706
Feature activation+1.36e-02

Pos logit contributions
col+0.296
 gn+0.287
 disl+0.272
riger+0.241
growth+0.240
Neg logit contributions
ap-0.234
 upon-0.223
ucks-0.220
apping-0.217
aps-0.205
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
ap+0.016
 upon+0.015
ucks+0.015
apping+0.015
angles+0.014
Neg logit contributions
col-0.025
 gn-0.024
 disl-0.023
riger-0.020
 dis-0.020
Token ball
Token ID2613
Feature activation+5.10e-01

Pos logit contributions
col+0.391
 gn+0.378
 disl+0.359
riger+0.321
growth+0.319
Neg logit contributions
ap-0.281
 upon-0.266
ucks-0.263
apping-0.258
aps-0.244
Token Ben
Token ID3932
Feature activation-7.89e-02

Pos logit contributions
ap+0.733
ucks+0.702
apping+0.688
 upon+0.681
aps+0.652
Neg logit contributions
col-0.723
 gn-0.718
 disl-0.646
 dis-0.584
growth-0.570
Token slides
Token ID19392
Feature activation+5.69e-01

Pos logit contributions
col+0.125
 gn+0.120
 disl+0.113
riger+0.099
 dis+0.099
Neg logit contributions
ap-0.111
 upon-0.106
ucks-0.105
apping-0.103
aps-0.098
Token down
Token ID866
Feature activation+2.21e-01

Pos logit contributions
ap+0.812
ucks+0.770
apping+0.765
 upon+0.756
ooks+0.737
Neg logit contributions
col-0.885
 gn-0.862
 disl-0.789
 dis-0.705
riger-0.699
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
ap+0.320
ucks+0.304
 upon+0.301
apping+0.298
angles+0.285
Neg logit contributions
col-0.342
 gn-0.331
 disl-0.304
 dis-0.272
riger-0.269
Token his
Token ID465
Feature activation+6.79e-01

Pos logit contributions
col+0.164
 gn+0.157
 disl+0.148
riger+0.131
 dis+0.130
Neg logit contributions
ap-0.144
 upon-0.137
ucks-0.136
apping-0.134
aps-0.127
Token rh
Token ID9529
Feature activation+3.06e+00

Pos logit contributions
ap+0.807
ucks+0.774
 upon+0.758
apping+0.753
angles+0.713
Neg logit contributions
col-1.197
 gn-1.193
 disl-1.094
 rooft-1.009
 dis-1.002
Tokenino
Token ID2879
Feature activation+2.13e-01

Pos logit contributions
ap+4.212
 upon+4.135
ooks+4.006
ucks+3.892
apping+3.785
Neg logit contributions
col-4.714
 gn-4.702
 disl-3.875
growth-3.860
handed-3.794
Token falls
Token ID8953
Feature activation-1.69e-01

Pos logit contributions
ap+0.291
ucks+0.279
 upon+0.276
apping+0.272
angles+0.258
Neg logit contributions
col-0.342
 gn-0.333
 disl-0.310
 dis-0.276
riger-0.274
Token out
Token ID503
Feature activation-3.20e-01

Pos logit contributions
col+0.292
 gn+0.283
 disl+0.269
riger+0.240
 dis+0.238
Neg logit contributions
ap-0.211
 upon-0.201
ucks-0.199
apping-0.194
aps-0.184
Token of
Token ID286
Feature activation-3.39e-01

Pos logit contributions
col+0.677
 gn+0.660
 disl+0.640
riger+0.586
 dis+0.575
Neg logit contributions
ap-0.278
 upon-0.261
ucks-0.256
apping-0.247
aps-0.230
Token his
Token ID465
Feature activation+1.10e-01

Pos logit contributions
col+0.632
 gn+0.607
 disl+0.581
riger+0.528
 dis+0.516
Neg logit contributions
ap-0.378
 upon-0.363
ucks-0.359
apping-0.341
aps-0.327
Token After
Token ID2293
Feature activation-6.41e-01

Pos logit contributions
col+0.834
 gn+0.805
 disl+0.760
riger+0.693
 dis+0.686
Neg logit contributions
ap-0.533
 upon-0.508
ucks-0.497
apping-0.490
aps-0.462
Token a
Token ID257
Feature activation-2.49e-01

Pos logit contributions
col+1.159
 gn+1.102
 disl+1.049
riger+0.956
growth+0.939
Neg logit contributions
ap-0.782
 upon-0.730
apping-0.704
ucks-0.697
aps-0.666
Token while
Token ID981
Feature activation+3.97e-01

Pos logit contributions
col+0.449
 gn+0.434
 disl+0.412
growth+0.368
riger+0.367
Neg logit contributions
ap-0.298
 upon-0.282
apping-0.272
ucks-0.271
angles-0.254
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
ap+0.616
ucks+0.607
 upon+0.589
apping+0.581
angles+0.570
Neg logit contributions
col-0.565
 gn-0.550
 disl-0.496
 dis-0.447
riger-0.439
Token the
Token ID262
Feature activation-2.86e-01

Pos logit contributions
ap+0.300
ucks+0.287
 upon+0.281
apping+0.279
angles+0.265
Neg logit contributions
col-0.365
 gn-0.355
 disl-0.332
 dis-0.296
riger-0.295
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
col+0.472
 gn+0.453
 disl+0.430
riger+0.383
growth+0.380
Neg logit contributions
ap-0.381
 upon-0.362
ucks-0.356
apping-0.351
aps-0.332
Tokenang
Token ID648
Feature activation+4.46e-01

Pos logit contributions
angles+4.436
ap+4.293
ooks+4.292
 upon+4.201
ucks+4.073
Neg logit contributions
col-4.935
 gn-4.565
 disl-3.876
handed-3.773
 pel-3.762
Tokenaroo
Token ID38049
Feature activation+8.12e-03

Pos logit contributions
ap+0.800
ucks+0.776
 upon+0.766
apping+0.754
angles+0.739
Neg logit contributions
col-0.533
 gn-0.503
 disl-0.451
 dis-0.385
growth-0.379
Token woke
Token ID19092
Feature activation+6.64e-01

Pos logit contributions
ap+0.011
 upon+0.011
ucks+0.011
apping+0.010
angles+0.010
Neg logit contributions
col-0.013
 gn-0.013
 disl-0.012
riger-0.011
 dis-0.010
Token up
Token ID510
Feature activation+6.24e-01

Pos logit contributions
ap+0.972
ucks+0.925
apping+0.920
 upon+0.919
aps+0.876
Neg logit contributions
col-1.004
 gn-0.984
 disl-0.910
 rooft-0.806
 dis-0.803
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
ap+0.755
ucks+0.727
angles+0.691
apping+0.690
 upon+0.681
Neg logit contributions
col-1.113
 gn-1.096
 disl-1.000
 dis-0.918
 rooft-0.905
Token the
Token ID262
Feature activation+1.42e-01

Pos logit contributions
ap+0.256
ucks+0.241
 upon+0.241
apping+0.237
aps+0.223
Neg logit contributions
col-0.364
 gn-0.353
 disl-0.331
 dis-0.299
growth-0.297
Token boy
Token ID2933
Feature activation+5.40e-02

Pos logit contributions
ap+0.198
ucks+0.187
 upon+0.187
apping+0.183
angles+0.176
Neg logit contributions
col-0.228
 gn-0.222
 disl-0.207
 dis-0.184
riger-0.182
Token and
Token ID290
Feature activation+2.70e-02

Pos logit contributions
ap+0.066
ucks+0.063
 upon+0.063
apping+0.061
angles+0.058
Neg logit contributions
col-0.095
 gn-0.092
 disl-0.086
riger-0.077
 dis-0.077
Token soon
Token ID2582
Feature activation-6.34e-01

Pos logit contributions
ap+0.037
 upon+0.035
ucks+0.035
apping+0.034
angles+0.033
Neg logit contributions
col-0.044
 gn-0.042
 disl-0.040
 dis-0.035
riger-0.035
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
col+1.087
 gn+1.030
 disl+0.998
riger+0.887
 rooft+0.883
Neg logit contributions
ap-0.811
 upon-0.785
apping-0.749
ucks-0.747
angles-0.713
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
ap+1.655
ucks+1.622
apping+1.553
ooks+1.551
 upon+1.549
Neg logit contributions
 gn-1.735
col-1.717
 disl-1.557
 rooft-1.443
 dis-1.441
Tokenite
Token ID578
Feature activation+9.83e-01

Pos logit contributions
ooks+4.584
 upon+4.529
ap+4.370
angles+4.344
ucks+4.073
Neg logit contributions
col-4.620
 gn-4.404
riger-3.901
 rooft-3.755
 disl-3.712
Token was
Token ID373
Feature activation+6.39e-01

Pos logit contributions
ap+1.436
ucks+1.418
 upon+1.358
apping+1.358
ooks+1.328
Neg logit contributions
col-1.514
 gn-1.470
 disl-1.310
 dis-1.195
 rooft-1.160
Token ready
Token ID3492
Feature activation+8.97e-01

Pos logit contributions
ap+0.836
ucks+0.822
 upon+0.785
apping+0.765
angles+0.751
Neg logit contributions
col-1.075
 gn-1.043
 disl-0.957
 dis-0.882
 rooft-0.860
Token to
Token ID284
Feature activation+2.04e-01

Pos logit contributions
ap+1.175
ucks+1.155
apping+1.086
 upon+1.064
angles+1.048
Neg logit contributions
col-1.531
 gn-1.485
 disl-1.353
 dis-1.246
 rooft-1.223
Token fly
Token ID6129
Feature activation+3.98e-01

Pos logit contributions
ap+0.230
ucks+0.219
 upon+0.219
apping+0.213
angles+0.199
Neg logit contributions
col-0.378
 gn-0.367
 disl-0.347
riger-0.313
 dis-0.313
Token was
Token ID373
Feature activation+1.47e-01

Pos logit contributions
col+0.268
 gn+0.259
 disl+0.246
riger+0.220
 dis+0.219
Neg logit contributions
ap-0.182
 upon-0.172
ucks-0.170
apping-0.166
angles-0.156
Token finally
Token ID3443
Feature activation+6.74e-01

Pos logit contributions
ap+0.213
ucks+0.204
 upon+0.201
apping+0.199
aps+0.190
Neg logit contributions
col-0.227
 gn-0.218
 disl-0.203
 dis-0.181
riger-0.180
Token finished
Token ID5201
Feature activation+8.53e-02

Pos logit contributions
ap+1.023
ucks+1.016
apping+0.959
 upon+0.948
aps+0.930
Neg logit contributions
col-0.998
 gn-0.956
 disl-0.870
 dis-0.789
riger-0.780
Token,
Token ID11
Feature activation+8.72e-01

Pos logit contributions
ap+0.127
ucks+0.121
 upon+0.121
apping+0.118
angles+0.113
Neg logit contributions
col-0.128
 gn-0.124
 disl-0.115
 dis-0.101
riger-0.101
Token the
Token ID262
Feature activation+7.76e-01

Pos logit contributions
ap+1.220
ucks+1.215
apping+1.161
 upon+1.139
angles+1.102
Neg logit contributions
col-1.375
 gn-1.342
 disl-1.239
 dis-1.122
 rooft-1.105
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
ap+1.088
ucks+1.069
 upon+1.038
apping+1.024
ooks+0.994
Neg logit contributions
col-1.198
 gn-1.191
 disl-1.084
 dis-0.994
 rooft-0.986
Tokenite
Token ID578
Feature activation+8.37e-01

Pos logit contributions
 upon+4.420
angles+4.387
ooks+4.320
ap+4.305
ucks+4.142
Neg logit contributions
col-4.677
 gn-4.346
 pel-3.861
 disl-3.684
 rooft-3.670
Token was
Token ID373
Feature activation+3.04e-01

Pos logit contributions
ap+1.216
ucks+1.193
apping+1.143
 upon+1.135
ooks+1.109
Neg logit contributions
col-1.299
 gn-1.269
 disl-1.134
 dis-1.026
growth-1.009
Token so
Token ID523
Feature activation+7.46e-02

Pos logit contributions
ap+0.453
ucks+0.438
 upon+0.427
apping+0.422
angles+0.407
Neg logit contributions
col-0.456
 gn-0.439
 disl-0.405
 dis-0.363
 rooft-0.355
Token proud
Token ID6613
Feature activation+5.47e-01

Pos logit contributions
ap+0.104
ucks+0.099
 upon+0.099
apping+0.097
angles+0.092
Neg logit contributions
col-0.120
 gn-0.115
 disl-0.108
 dis-0.096
riger-0.095
Token!
Token ID0
Feature activation+5.48e-01

Pos logit contributions
ap+0.513
ucks+0.474
 upon+0.464
apping+0.443
angles+0.435
Neg logit contributions
col-1.141
 gn-1.104
 disl-1.042
 dis-0.954
 rooft-0.944
Token cl
Token ID537
Feature activation+1.36e+00

Pos logit contributions
col+1.076
 gn+1.034
 disl+0.977
growth+0.891
riger+0.891
Neg logit contributions
ap-0.710
 upon-0.647
ucks-0.627
apping-0.622
angles-0.590
Tokenap
Token ID499
Feature activation+2.96e-01

Pos logit contributions
 upon+0.269
ucks+0.089
angles+0.085
ap+0.032
ooks+0.003
Neg logit contributions
col-3.820
 gn-3.651
 disl-3.625
riger-3.324
growth-3.315
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
ap+0.321
ucks+0.306
 upon+0.302
apping+0.293
angles+0.279
Neg logit contributions
col-0.559
 gn-0.542
 disl-0.513
 dis-0.464
growth-0.460
Token jar
Token ID17379
Feature activation+4.31e-01

Pos logit contributions
ap+0.379
ucks+0.364
 upon+0.361
apping+0.354
angles+0.339
Neg logit contributions
col-0.451
 gn-0.436
 disl-0.406
 dis-0.362
riger-0.361
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
ap+0.450
ucks+0.440
 upon+0.429
apping+0.407
angles+0.395
Neg logit contributions
col-0.828
 gn-0.800
 disl-0.743
 dis-0.681
riger-0.676
Token Cl
Token ID1012
Feature activation+3.01e+00

Pos logit contributions
col+0.376
 gn+0.365
 disl+0.346
riger+0.313
 dis+0.311
Neg logit contributions
ap-0.233
 upon-0.218
ucks-0.218
apping-0.213
aps-0.200
Tokenap
Token ID499
Feature activation+4.82e-01

Pos logit contributions
 upon+3.280
angles+3.023
 for+3.001
ucks+2.748
ies+2.672
Neg logit contributions
col-5.864
 gn-5.320
 disl-5.230
j-4.963
riger-4.851
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
ap+0.602
ucks+0.582
 upon+0.572
apping+0.557
angles+0.534
Neg logit contributions
col-0.830
 gn-0.795
 disl-0.754
 dis-0.675
riger-0.663
Token cl
Token ID537
Feature activation+1.14e+00

Pos logit contributions
ap+0.002
 upon+0.002
ucks+0.002
apping+0.002
angles+0.002
Neg logit contributions
col-0.002
 gn-0.002
 disl-0.002
riger-0.002
 dis-0.002
Tokenap
Token ID499
Feature activation+5.86e-01

Pos logit contributions
 upon+0.309
ucks+0.200
ap+0.196
angles+0.186
ooks+0.104
Neg logit contributions
col-3.089
 gn-2.964
 disl-2.939
riger-2.692
 CRE-2.662
Token,
Token ID11
Feature activation-2.47e-01

Pos logit contributions
ap+0.688
ucks+0.674
 upon+0.656
apping+0.636
angles+0.616
Neg logit contributions
col-1.050
 gn-1.010
 disl-0.956
 dis-0.857
riger-0.842
Token back
Token ID736
Feature activation-3.67e-01

Pos logit contributions
ap+0.347
ucks+0.328
 upon+0.324
apping+0.321
aps+0.303
Neg logit contributions
col-0.467
 gn-0.452
 disl-0.422
 dis-0.381
 rooft-0.379
Token and
Token ID290
Feature activation-7.34e-02

Pos logit contributions
col+0.696
 gn+0.679
 disl+0.653
riger+0.590
 dis+0.586
Neg logit contributions
ap-0.400
 upon-0.379
ucks-0.368
apping-0.361
aps-0.340
Token forth
Token ID6071
Feature activation-1.32e-02

Pos logit contributions
col+0.147
 gn+0.143
 disl+0.136
riger+0.124
 dis+0.124
Neg logit contributions
ap-0.072
 upon-0.067
ucks-0.067
apping-0.065
angles-0.060
Token and
Token ID290
Feature activation-4.22e-01

Pos logit contributions
col+0.023
 gn+0.022
 disl+0.021
riger+0.018
 dis+0.018
Neg logit contributions
ap-0.017
 upon-0.016
ucks-0.016
apping-0.016
angles-0.015
Token the
Token ID262
Feature activation+4.89e-01

Pos logit contributions
col+0.758
 gn+0.730
 disl+0.698
riger+0.626
growth+0.626
Neg logit contributions
ap-0.503
 upon-0.478
ucks-0.463
apping-0.458
angles-0.434
Token k
Token ID479
Feature activation+2.98e+00

Pos logit contributions
ap+0.683
ucks+0.661
 upon+0.645
apping+0.641
ooks+0.611
Neg logit contributions
col-0.778
 gn-0.755
 disl-0.696
 rooft-0.630
 dis-0.626
Tokenite
Token ID578
Feature activation+4.46e-01

Pos logit contributions
 upon+4.374
ooks+4.350
ap+4.256
angles+4.175
ucks+4.099
Neg logit contributions
col-4.791
 gn-4.198
 disl-3.846
 rooft-3.650
 pel-3.594
Token went
Token ID1816
Feature activation+6.38e-01

Pos logit contributions
ap+0.623
ucks+0.607
 upon+0.592
apping+0.584
ooks+0.559
Neg logit contributions
col-0.708
 gn-0.684
 disl-0.627
 dis-0.563
riger-0.556
Token up
Token ID510
Feature activation-1.30e-01

Pos logit contributions
ap+0.892
ucks+0.871
 upon+0.831
apping+0.830
angles+0.805
Neg logit contributions
col-1.031
 gn-0.998
 disl-0.899
 rooft-0.820
 dis-0.809
Token and
Token ID290
Feature activation-1.45e-01

Pos logit contributions
col+0.235
 gn+0.227
 disl+0.216
riger+0.194
 dis+0.193
Neg logit contributions
ap-0.156
 upon-0.147
ucks-0.145
apping-0.143
aps-0.134
Token down
Token ID866
Feature activation+7.68e-01

Pos logit contributions
col+0.262
 gn+0.254
 disl+0.241
riger+0.216
growth+0.216
Neg logit contributions
ap-0.173
 upon-0.163
ucks-0.161
apping-0.158
angles-0.149
Token the
Token ID262
Feature activation+4.16e-01

Pos logit contributions
ap+0.830
ucks+0.784
apping+0.784
 upon+0.781
aps+0.736
Neg logit contributions
col-0.981
 gn-0.948
 disl-0.877
 dis-0.806
growth-0.805
Token ball
Token ID2613
Feature activation+3.82e-01

Pos logit contributions
ap+0.533
ucks+0.505
 upon+0.500
apping+0.493
angles+0.469
Neg logit contributions
col-0.711
 gn-0.692
 disl-0.640
 dis-0.583
 rooft-0.583
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ap+0.501
ucks+0.484
 upon+0.472
apping+0.466
ooks+0.444
Neg logit contributions
col-0.639
 gn-0.617
 disl-0.565
 dis-0.515
riger-0.509
Token Spot
Token ID15899
Feature activation+4.76e-02

Pos logit contributions
ap+0.017
 upon+0.016
ucks+0.016
apping+0.016
angles+0.015
Neg logit contributions
col-0.022
 gn-0.021
 disl-0.020
riger-0.018
 dis-0.018
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
ap+0.063
ucks+0.060
 upon+0.060
apping+0.058
angles+0.055
Neg logit contributions
col-0.079
 gn-0.077
 disl-0.072
 dis-0.064
riger-0.064
Token Fl
Token ID1610
Feature activation+2.97e+00

Pos logit contributions
ap+0.030
 upon+0.028
ucks+0.028
apping+0.028
angles+0.026
Neg logit contributions
col-0.031
 gn-0.030
 disl-0.028
riger-0.024
 dis-0.024
Tokenuffy
Token ID15352
Feature activation-8.65e-03

Pos logit contributions
 upon+3.673
angles+3.641
ucks+3.519
ooks+3.441
ap+3.240
Neg logit contributions
col-4.739
 gn-4.504
 disl-4.354
 pel-4.072
 dis-3.977
Token were
Token ID547
Feature activation+6.35e-01

Pos logit contributions
col+0.014
 gn+0.013
 disl+0.012
riger+0.011
 dis+0.011
Neg logit contributions
ap-0.012
 upon-0.012
ucks-0.012
apping-0.011
angles-0.011
Token very
Token ID845
Feature activation+2.57e-01

Pos logit contributions
ap+1.017
ucks+0.999
 upon+0.971
apping+0.946
angles+0.944
Neg logit contributions
col-0.863
 gn-0.847
 disl-0.767
 dis-0.707
 rooft-0.673
Token sad
Token ID6507
Feature activation+4.23e-01

Pos logit contributions
ap+0.396
ucks+0.380
 upon+0.378
apping+0.369
angles+0.361
Neg logit contributions
col-0.373
 gn-0.361
 disl-0.337
 dis-0.297
riger-0.290
Token.
Token ID13
Feature activation-4.34e-02

Pos logit contributions
ap+0.552
ucks+0.526
 upon+0.515
apping+0.505
angles+0.496
Neg logit contributions
col-0.710
 gn-0.690
 disl-0.642
 dis-0.583
 rooft-0.573
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
ap+1.410
 upon+1.311
apping+1.310
ucks+1.306
angles+1.277
Neg logit contributions
col-1.256
 gn-1.204
 disl-1.127
 dis-0.978
growth-0.972
TokenThey
Token ID2990
Feature activation-5.44e-01

Pos logit contributions
ap+1.711
 upon+1.648
ucks+1.607
apping+1.583
angles+1.550
Neg logit contributions
col-2.294
 gn-2.231
 disl-2.119
riger-1.848
 CRE-1.847
Token look
Token ID804
Feature activation+3.54e-01

Pos logit contributions
col+1.002
 gn+0.966
 disl+0.931
riger+0.824
 dis+0.823
Neg logit contributions
ap-0.637
 upon-0.589
ucks-0.561
apping-0.554
angles-0.539
Token at
Token ID379
Feature activation+3.27e-01

Pos logit contributions
ap+0.410
ucks+0.389
 upon+0.380
apping+0.377
aps+0.354
Neg logit contributions
col-0.647
 gn-0.623
 disl-0.588
 dis-0.534
 rooft-0.530
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
ap+0.375
 upon+0.351
ucks+0.351
apping+0.349
aps+0.325
Neg logit contributions
col-0.601
 gn-0.579
 disl-0.553
growth-0.502
 dis-0.500
Token k
Token ID479
Feature activation+2.97e+00

Pos logit contributions
ucks+3.341
ap+3.311
apping+3.229
ooks+3.159
 upon+3.090
Neg logit contributions
col-2.684
 gn-2.649
 disl-2.348
 rooft-2.348
 dis-2.152
Tokenite
Token ID578
Feature activation+7.79e-01

Pos logit contributions
 upon+4.412
ooks+4.237
ap+4.138
angles+3.978
ucks+3.953
Neg logit contributions
col-4.529
 gn-3.884
 rooft-3.635
 pel-3.513
riger-3.497
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
ucks+0.999
ap+0.987
 upon+0.962
apping+0.923
ooks+0.912
Neg logit contributions
col-1.301
 gn-1.259
 disl-1.139
 rooft-1.045
 dis-1.037
Token They
Token ID1119
Feature activation+4.86e-03

Pos logit contributions
ap+0.227
 upon+0.214
ucks+0.212
apping+0.207
angles+0.196
Neg logit contributions
col-0.374
 gn-0.363
 disl-0.344
growth-0.311
 dis-0.310
Token see
Token ID766
Feature activation-1.95e-01

Pos logit contributions
ap+0.006
 upon+0.006
ucks+0.006
apping+0.006
angles+0.005
Neg logit contributions
col-0.008
 gn-0.008
 disl-0.008
riger-0.007
 dis-0.007
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
col+0.339
 gn+0.328
 disl+0.309
riger+0.279
 dis+0.275
Neg logit contributions
ap-0.247
 upon-0.232
ucks-0.231
apping-0.226
angles-0.214
Token and
Token ID290
Feature activation-3.51e-01

Pos logit contributions
col+1.687
 gn+1.626
 disl+1.574
riger+1.454
growth+1.451
Neg logit contributions
ap-0.839
 upon-0.793
apping-0.762
ucks-0.733
aps-0.701
Token smiled
Token ID13541
Feature activation+1.37e-01

Pos logit contributions
col+0.567
 gn+0.546
 disl+0.520
growth+0.461
riger+0.458
Neg logit contributions
ap-0.485
 upon-0.455
ucks-0.452
apping-0.448
angles-0.423
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
ap+0.168
ucks+0.159
 upon+0.158
apping+0.154
angles+0.148
Neg logit contributions
col-0.241
 gn-0.235
 disl-0.220
 dis-0.198
riger-0.197
Token She
Token ID1375
Feature activation-5.14e-01

Pos logit contributions
ap+0.092
 upon+0.085
ucks+0.085
apping+0.083
angles+0.078
Neg logit contributions
col-0.167
 gn-0.163
 disl-0.154
 dis-0.140
riger-0.139
Token held
Token ID2714
Feature activation+1.35e-01

Pos logit contributions
col+0.837
 gn+0.804
 disl+0.770
riger+0.688
growth+0.679
Neg logit contributions
ap-0.693
 upon-0.661
ucks-0.654
apping-0.638
angles-0.609
Token Qu
Token ID2264
Feature activation+2.96e+00

Pos logit contributions
ap+0.165
 upon+0.156
ucks+0.155
apping+0.153
angles+0.143
Neg logit contributions
col-0.237
 gn-0.229
 disl-0.215
 dis-0.193
riger-0.193
Tokenack
Token ID441
Feature activation-2.86e-01

Pos logit contributions
 upon+3.095
ucks+3.084
ooks+3.040
angles+2.997
 for+2.928
Neg logit contributions
col-5.253
 disl-4.832
 gn-4.645
j-4.163
riger-4.139
Token in
Token ID287
Feature activation+2.61e-02

Pos logit contributions
col+0.556
 gn+0.542
 disl+0.517
riger+0.470
 dis+0.467
Neg logit contributions
ap-0.301
 upon-0.280
ucks-0.276
apping-0.271
aps-0.252
Token her
Token ID607
Feature activation+3.24e-02

Pos logit contributions
ap+0.027
 upon+0.025
ucks+0.025
apping+0.025
angles+0.023
Neg logit contributions
col-0.051
 gn-0.050
 disl-0.047
 dis-0.043
riger-0.043
Token arms
Token ID5101
Feature activation+1.53e-01

Pos logit contributions
ap+0.033
ucks+0.031
 upon+0.031
apping+0.030
angles+0.028
Neg logit contributions
col-0.064
 gn-0.062
 disl-0.059
riger-0.053
 dis-0.053
Token and
Token ID290
Feature activation-9.04e-01

Pos logit contributions
ap+0.169
ucks+0.160
 upon+0.157
apping+0.153
angles+0.146
Neg logit contributions
col-0.286
 gn-0.277
 disl-0.261
 dis-0.237
growth-0.236
Token
Token ID198
Feature activation-7.33e-01

Pos logit contributions
ap+1.088
ucks+1.030
 upon+1.016
apping+1.004
angles+0.991
Neg logit contributions
col-1.046
 gn-0.998
 disl-0.933
 dis-0.817
growth-0.808
Token
Token ID198
Feature activation-8.51e-02

Pos logit contributions
col+1.266
 gn+1.221
 disl+1.162
riger+1.044
 rooft+1.025
Neg logit contributions
ap-0.924
 upon-0.908
ucks-0.881
apping-0.869
aps-0.835
TokenHe
Token ID1544
Feature activation+1.12e-01

Pos logit contributions
col+0.149
 gn+0.144
 disl+0.136
riger+0.122
 dis+0.122
Neg logit contributions
ap-0.106
 upon-0.100
ucks-0.100
apping-0.098
angles-0.092
Token flew
Token ID13112
Feature activation+5.21e-01

Pos logit contributions
ap+0.160
ucks+0.153
 upon+0.152
apping+0.149
aps+0.142
Neg logit contributions
col-0.176
 gn-0.170
 disl-0.158
 dis-0.140
riger-0.139
Token the
Token ID262
Feature activation+4.87e-01

Pos logit contributions
ap+0.766
ucks+0.738
apping+0.713
 upon+0.706
angles+0.684
Neg logit contributions
col-0.795
 gn-0.766
 disl-0.693
 rooft-0.621
 dis-0.618
Token k
Token ID479
Feature activation+2.96e+00

Pos logit contributions
ap+0.713
ucks+0.691
apping+0.669
 upon+0.667
ooks+0.639
Neg logit contributions
col-0.755
 gn-0.732
 disl-0.665
 rooft-0.608
riger-0.606
Tokenite
Token ID578
Feature activation+8.25e-01

Pos logit contributions
ooks+4.265
 upon+4.206
ap+4.186
angles+4.172
ucks+3.942
Neg logit contributions
col-4.676
 gn-4.111
 pel-3.558
riger-3.548
 rooft-3.548
Token for
Token ID329
Feature activation-6.30e-01

Pos logit contributions
ucks+1.208
ap+1.180
 upon+1.131
apping+1.115
ooks+1.105
Neg logit contributions
col-1.246
 gn-1.205
 disl-1.050
 rooft-0.962
growth-0.956
Token a
Token ID257
Feature activation-3.94e-01

Pos logit contributions
col+1.096
 gn+1.039
 disl+0.999
riger+0.898
 dis+0.868
Neg logit contributions
ap-0.792
 upon-0.750
ucks-0.738
apping-0.716
aps-0.691
Token long
Token ID890
Feature activation-1.04e+00

Pos logit contributions
col+0.638
 gn+0.612
 disl+0.582
growth+0.512
 dis+0.508
Neg logit contributions
ap-0.540
 upon-0.521
apping-0.499
ucks-0.489
angles-0.466
Token time
Token ID640
Feature activation-4.09e-01

Pos logit contributions
col+1.856
 gn+1.797
 disl+1.733
riger+1.532
 dis+1.515
Neg logit contributions
ap-1.294
 upon-1.204
apping-1.147
ucks-1.068
angles-1.067
Token As
Token ID1081
Feature activation+3.86e-02

Pos logit contributions
ap+0.575
ucks+0.546
 upon+0.540
apping+0.530
angles+0.515
Neg logit contributions
col-0.683
 gn-0.659
 disl-0.620
growth-0.550
 dis-0.549
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
ap+0.053
ucks+0.050
 upon+0.050
apping+0.049
angles+0.047
Neg logit contributions
col-0.062
 gn-0.060
 disl-0.057
 dis-0.050
riger-0.050
Token rabbit
Token ID22746
Feature activation-3.14e-01

Pos logit contributions
ap+0.212
ucks+0.202
 upon+0.201
apping+0.197
angles+0.188
Neg logit contributions
col-0.252
 gn-0.244
 disl-0.228
 dis-0.204
riger-0.203
Token and
Token ID290
Feature activation+5.70e-01

Pos logit contributions
col+0.509
 gn+0.489
 disl+0.466
riger+0.411
 dis+0.407
Neg logit contributions
ap-0.428
 upon-0.409
ucks-0.400
apping-0.394
aps-0.373
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
ap+0.750
ucks+0.715
apping+0.707
 upon+0.700
ooks+0.666
Neg logit contributions
col-0.955
 gn-0.943
 disl-0.874
 dis-0.783
growth-0.781
Token k
Token ID479
Feature activation+2.96e+00

Pos logit contributions
ap+0.545
ucks+0.522
 upon+0.516
apping+0.509
angles+0.487
Neg logit contributions
col-0.628
 gn-0.610
 disl-0.563
 dis-0.508
 rooft-0.508
Tokenite
Token ID578
Feature activation+5.05e-01

Pos logit contributions
ooks+4.215
angles+4.140
 upon+4.096
ap+4.048
ucks+3.951
Neg logit contributions
col-4.748
 gn-4.161
 pel-3.726
 disl-3.674
 rooft-3.674
Token said
Token ID531
Feature activation+1.07e-01

Pos logit contributions
ap+0.712
ucks+0.694
 upon+0.672
apping+0.671
ooks+0.644
Neg logit contributions
col-0.788
 gn-0.762
 disl-0.698
 dis-0.630
riger-0.621
Token goodbye
Token ID24829
Feature activation+1.30e-01

Pos logit contributions
ap+0.180
ucks+0.173
 upon+0.172
apping+0.169
angles+0.163
Neg logit contributions
col-0.140
 gn-0.134
 disl-0.123
 dis-0.106
riger-0.105
Token,
Token ID11
Feature activation+8.40e-01

Pos logit contributions
ap+0.161
ucks+0.153
 upon+0.152
apping+0.148
angles+0.141
Neg logit contributions
col-0.227
 gn-0.220
 disl-0.207
 dis-0.185
riger-0.185
Token the
Token ID262
Feature activation+3.51e-01

Pos logit contributions
ap+1.223
ucks+1.202
 upon+1.144
apping+1.140
angles+1.117
Neg logit contributions
col-1.286
 gn-1.264
 disl-1.144
 dis-1.053
 rooft-1.023
Token up
Token ID510
Feature activation-2.80e-01

Pos logit contributions
col+0.460
 gn+0.444
 disl+0.422
riger+0.376
 dis+0.373
Neg logit contributions
ap-0.355
 upon-0.338
ucks-0.337
apping-0.326
aps-0.311
Token on
Token ID319
Feature activation-2.05e-01

Pos logit contributions
col+0.509
 gn+0.491
 disl+0.474
riger+0.425
growth+0.421
Neg logit contributions
ap-0.329
 upon-0.315
ucks-0.306
apping-0.301
aps-0.285
Token the
Token ID262
Feature activation-8.80e-01

Pos logit contributions
col+0.360
 gn+0.346
 disl+0.331
riger+0.295
 dis+0.293
Neg logit contributions
ap-0.254
 upon-0.242
ucks-0.239
apping-0.232
aps-0.222
Token blanket
Token ID18447
Feature activation-8.04e-01

Pos logit contributions
col+1.393
 gn+1.337
 disl+1.326
riger+1.138
growth+1.120
Neg logit contributions
ap-1.197
 upon-1.189
ucks-1.136
apping-1.095
aps-1.066
Token.
Token ID13
Feature activation+6.50e-02

Pos logit contributions
col+1.515
 gn+1.477
 disl+1.462
riger+1.295
 dis+1.276
Neg logit contributions
ap-0.896
 upon-0.842
apping-0.818
ucks-0.783
aps-0.758
Token Fl
Token ID1610
Feature activation+2.95e+00

Pos logit contributions
ap+0.078
 upon+0.073
ucks+0.073
apping+0.072
angles+0.068
Neg logit contributions
col-0.116
 gn-0.113
 disl-0.107
riger-0.096
 dis-0.095
Tokenuffy
Token ID15352
Feature activation-2.98e-01

Pos logit contributions
angles+3.882
 upon+3.655
ucks+3.526
 yours+3.458
ooks+3.453
Neg logit contributions
 gn-4.569
col-4.511
 disl-4.181
riger-3.812
j-3.755
Token was
Token ID373
Feature activation+5.71e-02

Pos logit contributions
col+0.483
 gn+0.463
 disl+0.443
riger+0.390
growth+0.386
Neg logit contributions
ap-0.406
 upon-0.388
ucks-0.383
apping-0.375
aps-0.358
Token very
Token ID845
Feature activation-1.14e-01

Pos logit contributions
ap+0.088
ucks+0.084
 upon+0.083
apping+0.082
angles+0.079
Neg logit contributions
col-0.083
 gn-0.080
 disl-0.075
 dis-0.065
riger-0.065
Token happy
Token ID3772
Feature activation+5.67e-01

Pos logit contributions
col+0.195
 gn+0.188
 disl+0.177
growth+0.158
riger+0.158
Neg logit contributions
ap-0.149
 upon-0.141
ucks-0.140
apping-0.138
aps-0.131
Token in
Token ID287
Feature activation+4.89e-01

Pos logit contributions
ap+0.710
ucks+0.673
 upon+0.654
apping+0.643
angles+0.637
Neg logit contributions
col-0.991
 gn-0.974
 disl-0.898
 dis-0.810
 rooft-0.800
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
ap+0.180
 upon+0.170
ucks+0.169
apping+0.165
angles+0.156
Neg logit contributions
col-0.285
 gn-0.276
 disl-0.259
 dis-0.234
growth-0.233
Token their
Token ID511
Feature activation+4.52e-01

Pos logit contributions
ap+1.141
ucks+1.087
apping+1.058
 upon+1.032
angles+1.013
Neg logit contributions
col-1.468
 gn-1.443
 disl-1.343
 dis-1.225
 rooft-1.176
Token bike
Token ID7161
Feature activation+8.07e-01

Pos logit contributions
ap+0.596
ucks+0.572
 upon+0.564
apping+0.561
ooks+0.529
Neg logit contributions
col-0.749
 gn-0.727
 disl-0.686
 dis-0.611
 rooft-0.610
Token and
Token ID290
Feature activation-1.42e-01

Pos logit contributions
ap+1.156
ucks+1.126
 upon+1.111
apping+1.093
ooks+1.059
Neg logit contributions
col-1.230
 gn-1.220
 disl-1.084
 dis-0.971
riger-0.969
Token their
Token ID511
Feature activation+5.52e-01

Pos logit contributions
col+0.251
 gn+0.243
 disl+0.229
riger+0.206
 dis+0.205
Neg logit contributions
ap-0.176
 upon-0.165
ucks-0.164
apping-0.160
angles-0.152
Token k
Token ID479
Feature activation+2.95e+00

Pos logit contributions
ap+0.742
ucks+0.713
apping+0.702
 upon+0.702
ooks+0.662
Neg logit contributions
col-0.902
 gn-0.873
 disl-0.828
 dis-0.733
 rooft-0.730
Tokenite
Token ID578
Feature activation+9.47e-01

Pos logit contributions
 upon+3.981
ap+3.968
ooks+3.913
angles+3.788
apping+3.651
Neg logit contributions
col-4.825
 gn-4.334
 disl-3.987
 rooft-3.820
riger-3.677
Token.
Token ID13
Feature activation+3.07e-02

Pos logit contributions
ap+1.271
ucks+1.265
 upon+1.236
apping+1.211
ooks+1.188
Neg logit contributions
col-1.524
 gn-1.486
 disl-1.348
 dis-1.203
 rooft-1.194
Token They
Token ID1119
Feature activation-3.00e-01

Pos logit contributions
ap+0.030
 upon+0.028
ucks+0.027
apping+0.027
angles+0.025
Neg logit contributions
col-0.062
 gn-0.061
 disl-0.058
riger-0.052
 dis-0.052
Token are
Token ID389
Feature activation+4.73e-01

Pos logit contributions
col+0.556
 gn+0.537
 disl+0.512
riger+0.460
 dis+0.459
Neg logit contributions
ap-0.343
 upon-0.322
ucks-0.315
apping-0.306
angles-0.294
Token happy
Token ID3772
Feature activation+1.00e+00

Pos logit contributions
ap+0.656
ucks+0.634
 upon+0.619
apping+0.607
angles+0.584
Neg logit contributions
col-0.754
 gn-0.734
 disl-0.679
 dis-0.614
 rooft-0.598
Token to
Token ID284
Feature activation-7.19e-01

Pos logit contributions
col+0.320
 gn+0.310
 disl+0.295
riger+0.266
growth+0.265
Neg logit contributions
ap-0.196
 upon-0.184
ucks-0.183
apping-0.179
aps-0.168
Token run
Token ID1057
Feature activation-9.50e-01

Pos logit contributions
col+1.315
 gn+1.243
 disl+1.189
growth+1.077
riger+1.073
Neg logit contributions
ap-0.861
 upon-0.786
ucks-0.759
apping-0.741
angles-0.722
Token,
Token ID11
Feature activation-4.62e-01

Pos logit contributions
col+1.798
 gn+1.766
 disl+1.732
growth+1.533
 dis+1.525
Neg logit contributions
ap-1.052
 upon-0.997
ucks-0.948
apping-0.941
aps-0.885
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
col+0.792
 gn+0.757
 disl+0.728
growth+0.651
riger+0.642
Neg logit contributions
ap-0.593
 upon-0.566
apping-0.545
ucks-0.545
angles-0.512
Token the
Token ID262
Feature activation+9.96e-01

Pos logit contributions
col+1.864
 gn+1.765
 disl+1.738
growth+1.556
riger+1.533
Neg logit contributions
ap-1.302
 upon-1.245
apping-1.166
ucks-1.138
aps-1.108
Token k
Token ID479
Feature activation+2.95e+00

Pos logit contributions
ap+1.487
ucks+1.448
apping+1.411
 upon+1.389
ooks+1.368
Neg logit contributions
col-1.497
 gn-1.477
 disl-1.322
 rooft-1.244
 dis-1.231
Tokenite
Token ID578
Feature activation+6.91e-01

Pos logit contributions
 upon+4.405
ooks+4.335
ap+4.228
angles+4.183
ucks+3.848
Neg logit contributions
col-4.525
 gn-4.075
 rooft-3.652
 pel-3.650
 disl-3.588
Token went
Token ID1816
Feature activation+5.12e-01

Pos logit contributions
ap+0.971
ucks+0.954
 upon+0.928
apping+0.917
ooks+0.881
Neg logit contributions
col-1.094
 gn-1.062
 disl-0.953
 dis-0.866
riger-0.847
Token up
Token ID510
Feature activation-1.68e-01

Pos logit contributions
ap+0.708
ucks+0.679
apping+0.658
 upon+0.657
angles+0.627
Neg logit contributions
col-0.837
 gn-0.810
 disl-0.735
 rooft-0.669
riger-0.663
Token,
Token ID11
Feature activation+3.86e-01

Pos logit contributions
col+0.295
 gn+0.285
 disl+0.272
riger+0.243
 dis+0.241
Neg logit contributions
ap-0.208
 upon-0.197
ucks-0.194
apping-0.191
aps-0.180
Token up
Token ID510
Feature activation+5.97e-01

Pos logit contributions
ap+0.503
ucks+0.483
 upon+0.472
apping+0.467
ooks+0.444
Neg logit contributions
col-0.647
 gn-0.630
 disl-0.583
 dis-0.524
riger-0.522
Token outside
Token ID2354
Feature activation+8.87e-02

Pos logit contributions
col+0.021
 gn+0.020
 disl+0.019
riger+0.017
 dis+0.017
Neg logit contributions
ap-0.019
 upon-0.018
ucks-0.018
apping-0.017
angles-0.016
Token.
Token ID13
Feature activation-7.62e-04

Pos logit contributions
ap+0.107
ucks+0.101
 upon+0.100
apping+0.098
angles+0.093
Neg logit contributions
col-0.158
 gn-0.153
 disl-0.144
 dis-0.130
riger-0.130
Token Soon
Token ID15894
Feature activation-2.11e-01

Pos logit contributions
col+0.001
 gn+0.001
 disl+0.001
riger+0.001
 dis+0.001
Neg logit contributions
ap-0.001
 upon-0.001
ucks-0.001
apping-0.001
angles-0.001
Token,
Token ID11
Feature activation+6.60e-02

Pos logit contributions
col+0.330
 gn+0.318
 disl+0.300
riger+0.262
growth+0.261
Neg logit contributions
ap-0.303
 upon-0.287
ucks-0.284
apping-0.280
angles-0.267
Token the
Token ID262
Feature activation+9.50e-01

Pos logit contributions
ap+0.088
ucks+0.083
 upon+0.083
apping+0.081
aps+0.077
Neg logit contributions
col-0.110
 gn-0.106
 disl-0.100
riger-0.089
 dis-0.089
Token k
Token ID479
Feature activation+2.94e+00

Pos logit contributions
ap+1.468
ucks+1.442
apping+1.406
 upon+1.389
ooks+1.364
Neg logit contributions
col-1.359
 gn-1.349
 disl-1.193
 rooft-1.110
 dis-1.099
Tokenite
Token ID578
Feature activation+1.09e+00

Pos logit contributions
ap+4.215
 upon+4.050
angles+3.963
ooks+3.829
ags+3.656
Neg logit contributions
col-5.035
 gn-4.712
riger-3.928
 rooft-3.875
 pel-3.844
Token was
Token ID373
Feature activation+4.82e-01

Pos logit contributions
ap+1.633
ucks+1.630
apping+1.540
ooks+1.534
 upon+1.533
Neg logit contributions
col-1.644
 gn-1.573
 disl-1.374
 dis-1.271
 pel-1.248
Token high
Token ID1029
Feature activation-4.61e-01

Pos logit contributions
ap+0.650
ucks+0.632
 upon+0.606
apping+0.601
angles+0.577
Neg logit contributions
col-0.790
 gn-0.761
 disl-0.703
 dis-0.642
 rooft-0.632
Token in
Token ID287
Feature activation-3.98e-01

Pos logit contributions
col+0.858
 gn+0.828
 disl+0.807
riger+0.718
 dis+0.715
Neg logit contributions
ap-0.521
 upon-0.489
ucks-0.478
apping-0.470
angles-0.443
Token the
Token ID262
Feature activation+5.93e-01

Pos logit contributions
col+0.704
 gn+0.685
 disl+0.654
riger+0.588
 dis+0.576
Neg logit contributions
ap-0.481
 upon-0.459
ucks-0.451
apping-0.436
angles-0.417
TokenAfter
Token ID3260
Feature activation-4.15e-01

Pos logit contributions
col+0.318
 gn+0.309
 disl+0.290
riger+0.260
growth+0.258
Neg logit contributions
ap-0.232
 upon-0.220
ucks-0.218
apping-0.214
aps-0.201
Token a
Token ID257
Feature activation-3.44e-01

Pos logit contributions
col+0.734
 gn+0.697
 disl+0.671
riger+0.599
 rooft+0.588
Neg logit contributions
ap-0.511
 upon-0.485
ucks-0.470
apping-0.468
aps-0.442
Token while
Token ID981
Feature activation+1.36e-01

Pos logit contributions
col+0.626
 gn+0.600
 disl+0.575
growth+0.514
riger+0.509
Neg logit contributions
ap-0.405
 upon-0.386
apping-0.369
ucks-0.366
angles-0.343
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
ap+0.207
ucks+0.199
 upon+0.197
apping+0.194
angles+0.186
Neg logit contributions
col-0.201
 gn-0.194
 disl-0.179
 dis-0.158
riger-0.157
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
ap+0.310
ucks+0.297
 upon+0.291
apping+0.289
aps+0.276
Neg logit contributions
col-0.340
 gn-0.328
 disl-0.307
 dis-0.274
growth-0.272
Token k
Token ID479
Feature activation+2.94e+00

Pos logit contributions
ap+1.536
ucks+1.513
 upon+1.486
apping+1.461
ooks+1.459
Neg logit contributions
col-1.570
 gn-1.536
 disl-1.369
 rooft-1.317
 dis-1.295
Tokenite
Token ID578
Feature activation+8.55e-01

Pos logit contributions
ap+4.325
ooks+4.325
 upon+4.325
angles+4.300
ucks+3.997
Neg logit contributions
col-4.580
 gn-4.189
 rooft-3.689
 pel-3.664
riger-3.603
Token flew
Token ID13112
Feature activation+1.04e+00

Pos logit contributions
ap+1.257
ucks+1.245
apping+1.186
 upon+1.173
ooks+1.160
Neg logit contributions
col-1.306
 gn-1.267
 disl-1.119
 dis-1.033
growth-1.016
Token away
Token ID1497
Feature activation+3.01e-01

Pos logit contributions
ap+1.602
ucks+1.576
apping+1.500
aps+1.454
ooks+1.451
Neg logit contributions
col-1.560
 gn-1.463
 disl-1.266
 rooft-1.211
 dis-1.171
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
ap+0.384
ucks+0.367
 upon+0.357
apping+0.352
angles+0.342
Neg logit contributions
col-0.517
 gn-0.503
 disl-0.464
 dis-0.422
 rooft-0.416
Token John
Token ID1757
Feature activation+1.25e-01

Pos logit contributions
ap+0.547
ucks+0.519
 upon+0.515
apping+0.509
angles+0.486
Neg logit contributions
col-0.583
 gn-0.561
 disl-0.531
 dis-0.471
growth-0.468
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
ap+0.526
ucks+0.517
 upon+0.499
apping+0.487
angles+0.479
Neg logit contributions
col-0.706
 gn-0.683
 disl-0.637
 dis-0.577
riger-0.571
Token and
Token ID290
Feature activation+1.18e-01

Pos logit contributions
ap+1.624
ucks+1.573
apping+1.504
ooks+1.469
angles+1.445
Neg logit contributions
col-2.094
 gn-2.059
 disl-1.896
 dis-1.752
 rooft-1.680
Token even
Token ID772
Feature activation-5.35e-01

Pos logit contributions
ap+0.162
ucks+0.154
 upon+0.153
apping+0.150
aps+0.143
Neg logit contributions
col-0.190
 gn-0.183
 disl-0.172
 dis-0.152
riger-0.151
Token her
Token ID607
Feature activation-1.53e-01

Pos logit contributions
col+0.929
 gn+0.895
 disl+0.853
riger+0.771
growth+0.765
Neg logit contributions
ap-0.673
 upon-0.643
ucks-0.620
apping-0.616
angles-0.579
Token little
Token ID1310
Feature activation+1.05e+00

Pos logit contributions
col+0.309
 gn+0.299
 disl+0.285
riger+0.260
growth+0.259
Neg logit contributions
ap-0.149
 upon-0.138
ucks-0.136
apping-0.134
aps-0.124
Token k
Token ID479
Feature activation+2.93e+00

Pos logit contributions
ucks+1.059
ap+1.056
 upon+1.026
angles+1.004
apping+0.990
Neg logit contributions
col-2.014
 gn-2.006
 disl-1.856
 rooft-1.696
 dis-1.689
Tokenitty
Token ID9760
Feature activation+9.30e-01

Pos logit contributions
 upon+3.986
angles+3.963
ap+3.931
ooks+3.891
ucks+3.684
Neg logit contributions
col-4.966
 gn-4.472
 disl-3.953
 rooft-3.670
j-3.665
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
ucks+1.252
ap+1.244
 upon+1.205
apping+1.185
angles+1.164
Neg logit contributions
col-1.496
 gn-1.465
 disl-1.353
 dis-1.206
 rooft-1.171
Token Everywhere
Token ID48201
Feature activation+5.96e-01

Pos logit contributions
ap+0.536
 upon+0.503
ucks+0.499
apping+0.488
angles+0.468
Neg logit contributions
col-0.853
 gn-0.822
 disl-0.784
growth-0.705
 rooft-0.698
Token she
Token ID673
Feature activation-6.26e-01

Pos logit contributions
ap+0.882
ucks+0.861
apping+0.834
 upon+0.825
aps+0.799
Neg logit contributions
col-0.894
 gn-0.864
 disl-0.806
 dis-0.718
riger-0.704
Token went
Token ID1816
Feature activation+3.27e-01

Pos logit contributions
col+0.995
 gn+0.954
 disl+0.919
riger+0.804
 dis+0.792
Neg logit contributions
ap-0.870
 upon-0.829
ucks-0.818
apping-0.793
angles-0.765
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
col+2.488
 gn+2.364
 disl+2.302
riger+2.132
 CRE+2.090
Neg logit contributions
ap-1.273
 upon-1.228
ucks-1.168
apping-1.074
aps-1.062
Token night
Token ID1755
Feature activation-9.36e-01

Pos logit contributions
col+4.103
 disl+3.932
 gn+3.812
riger+3.555
 CRE+3.494
Neg logit contributions
 upon-1.661
ap-1.590
ucks-1.319
 friend-1.299
apping-1.270
Token?"
Token ID1701
Feature activation+3.94e-01

Pos logit contributions
col+1.676
 gn+1.596
 disl+1.555
riger+1.389
 dis+1.385
Neg logit contributions
ap-1.159
 upon-1.118
apping-1.060
ucks-1.044
aps-1.005
Token
Token ID198
Feature activation+3.67e-02

Pos logit contributions
ap+0.579
ucks+0.556
 upon+0.553
apping+0.545
angles+0.527
Neg logit contributions
col-0.601
 gn-0.585
 disl-0.547
growth-0.480
 dis-0.478
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
ap+0.052
 upon+0.049
ucks+0.049
apping+0.048
angles+0.046
Neg logit contributions
col-0.058
 gn-0.056
 disl-0.053
riger-0.046
 dis-0.046
TokenThe
Token ID464
Feature activation-3.11e+00

Pos logit contributions
ap+0.025
 upon+0.023
ucks+0.023
apping+0.023
angles+0.022
Neg logit contributions
col-0.032
 gn-0.031
 disl-0.029
riger-0.026
 dis-0.026
Token rabbit
Token ID22746
Feature activation+2.27e-01

Pos logit contributions
col+6.435
 disl+5.865
 CRE+5.801
riger+5.692
growth+5.681
Neg logit contributions
 friend-3.594
 upon-3.506
 friends-3.288
ucks-2.986
ap-2.936
Token said
Token ID531
Feature activation-3.90e-01

Pos logit contributions
ap+0.315
ucks+0.302
 upon+0.296
apping+0.294
angles+0.280
Neg logit contributions
col-0.363
 gn-0.354
 disl-0.329
 dis-0.292
riger-0.289
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
col+0.617
 gn+0.587
 disl+0.556
growth+0.496
riger+0.493
Neg logit contributions
ap-0.564
 upon-0.531
apping-0.523
ucks-0.516
aps-0.493
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
col+0.396
 gn+0.382
 disl+0.364
riger+0.329
growth+0.329
Neg logit contributions
ap-0.227
 upon-0.214
ucks-0.209
apping-0.207
aps-0.192
TokenI
Token ID40
Feature activation-4.54e-01

Pos logit contributions
col+2.008
 gn+1.958
 disl+1.863
riger+1.756
 dis+1.720
Neg logit contributions
 upon-0.965
ap-0.951
ucks-0.927
apping-0.887
aps-0.822
Token for
Token ID329
Feature activation+1.07e+00

Pos logit contributions
ap+1.073
ucks+1.048
 upon+1.012
apping+1.007
angles+0.971
Neg logit contributions
col-1.277
 gn-1.232
 disl-1.121
riger-1.009
 dis-1.005
Token me
Token ID502
Feature activation+1.17e-01

Pos logit contributions
ucks+1.040
ap+1.037
 upon+0.996
apping+0.985
aps+0.905
Neg logit contributions
col-2.133
 gn-2.069
 disl-1.963
 dis-1.857
riger-1.796
Token!"
Token ID2474
Feature activation+2.89e-01

Pos logit contributions
ap+0.162
ucks+0.155
 upon+0.154
apping+0.151
angles+0.145
Neg logit contributions
col-0.186
 gn-0.180
 disl-0.168
riger-0.149
 dis-0.149
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
ap+0.484
ucks+0.464
 upon+0.463
apping+0.454
angles+0.441
Neg logit contributions
col-0.382
 gn-0.362
 disl-0.334
growth-0.286
 dis-0.285
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
col+0.468
 gn+0.454
 disl+0.427
riger+0.382
growth+0.377
Neg logit contributions
ap-0.380
ucks-0.360
 upon-0.359
apping-0.353
aps-0.337
TokenThe
Token ID464
Feature activation-3.10e+00

Pos logit contributions
ap+0.303
 upon+0.288
ucks+0.287
apping+0.279
angles+0.268
Neg logit contributions
col-0.378
 gn-0.364
 disl-0.342
 dis-0.305
riger-0.303
Token surprise
Token ID5975
Feature activation+1.12e+00

Pos logit contributions
col+6.696
 disl+6.198
 gn+5.989
keeping+5.783
growth+5.646
Neg logit contributions
 upon-3.362
 friend-3.116
ap-3.056
 friends-2.908
apping-2.679
Token turned
Token ID2900
Feature activation+2.41e-01

Pos logit contributions
ap+1.658
ucks+1.626
 upon+1.562
apping+1.548
ooks+1.497
Neg logit contributions
col-1.735
 gn-1.687
 disl-1.450
 dis-1.370
growth-1.298
Token out
Token ID503
Feature activation+3.53e-01

Pos logit contributions
ap+0.328
ucks+0.313
 upon+0.308
apping+0.303
angles+0.290
Neg logit contributions
col-0.394
 gn-0.379
 disl-0.352
 dis-0.316
riger-0.314
Token to
Token ID284
Feature activation+8.72e-01

Pos logit contributions
ap+0.446
ucks+0.427
 upon+0.417
apping+0.409
angles+0.397
Neg logit contributions
col-0.612
 gn-0.593
 disl-0.548
 dis-0.500
 rooft-0.493
Token be
Token ID307
Feature activation-1.76e-01

Pos logit contributions
ap+1.153
ucks+1.118
apping+1.115
 upon+1.077
aps+1.060
Neg logit contributions
col-1.459
 gn-1.414
 disl-1.308
 dis-1.208
growth-1.205
Token a
Token ID257
Feature activation+2.42e-01

Pos logit contributions
col+0.213
 gn+0.206
 disl+0.195
riger+0.174
 dis+0.173
Neg logit contributions
ap-0.155
 upon-0.147
ucks-0.146
apping-0.143
aps-0.135
Token jacket
Token ID15224
Feature activation-1.17e-01

Pos logit contributions
ap+0.357
ucks+0.344
 upon+0.338
apping+0.334
angles+0.322
Neg logit contributions
col-0.364
 gn-0.354
 disl-0.327
 dis-0.287
riger-0.287
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
col+0.207
 gn+0.200
 disl+0.190
riger+0.170
 dis+0.170
Neg logit contributions
ap-0.141
 upon-0.133
ucks-0.132
apping-0.130
aps-0.122
Token
Token ID198
Feature activation-4.90e-01

Pos logit contributions
col+0.566
 gn+0.541
 disl+0.514
riger+0.460
 dis+0.456
Neg logit contributions
ap-0.434
 upon-0.418
ucks-0.413
apping-0.403
aps-0.384
Token
Token ID198
Feature activation-7.01e-01

Pos logit contributions
col+0.827
 gn+0.802
 disl+0.765
riger+0.677
 dis+0.670
Neg logit contributions
ap-0.631
 upon-0.614
ucks-0.603
apping-0.587
aps-0.564
TokenThe
Token ID464
Feature activation-3.09e+00

Pos logit contributions
col+1.298
 gn+1.246
 disl+1.198
riger+1.073
 rooft+1.069
Neg logit contributions
ap-0.785
 upon-0.757
ucks-0.737
apping-0.725
aps-0.677
Token p
Token ID279
Feature activation+4.11e-01

Pos logit contributions
col+6.257
 disl+5.867
 CRE+5.675
riger+5.546
growth+5.533
Neg logit contributions
 upon-3.678
 friend-3.660
 friends-3.234
ucks-3.087
ap-2.960
Tokenanda
Token ID5282
Feature activation-5.88e-02

Pos logit contributions
ap+0.585
ucks+0.550
 upon+0.548
apping+0.538
angles+0.519
Neg logit contributions
col-0.652
 gn-0.627
 disl-0.584
riger-0.508
 dis-0.508
Token asked
Token ID1965
Feature activation+3.44e-01

Pos logit contributions
col+0.091
 gn+0.087
 disl+0.082
riger+0.072
 dis+0.072
Neg logit contributions
ap-0.085
 upon-0.081
ucks-0.080
apping-0.079
angles-0.075
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
ap+0.480
ucks+0.460
apping+0.451
 upon+0.450
angles+0.428
Neg logit contributions
col-0.548
 gn-0.538
 disl-0.497
 dis-0.446
growth-0.439
Token "
Token ID366
Feature activation-1.28e+00

Pos logit contributions
col+0.045
 gn+0.044
 disl+0.042
riger+0.038
 dis+0.038
Neg logit contributions
ap-0.028
 upon-0.026
ucks-0.026
apping-0.026
angles-0.024
Token go
Token ID467
Feature activation+4.84e-01

Pos logit contributions
col+1.678
 gn+1.609
 disl+1.548
riger+1.407
 dis+1.400
Neg logit contributions
ap-0.887
 upon-0.782
ucks-0.740
apping-0.723
angles-0.695
Token see
Token ID766
Feature activation+1.87e-01

Pos logit contributions
ap+0.566
ucks+0.534
apping+0.530
 upon+0.522
angles+0.502
Neg logit contributions
col-0.874
 gn-0.855
 disl-0.786
riger-0.725
 dis-0.715
Token some
Token ID617
Feature activation+2.46e-01

Pos logit contributions
ap+0.226
ucks+0.211
 upon+0.210
apping+0.208
aps+0.195
Neg logit contributions
col-0.333
 gn-0.322
 disl-0.304
 dis-0.273
riger-0.273
Token birds
Token ID10087
Feature activation+8.60e-02

Pos logit contributions
ap+0.340
ucks+0.323
 upon+0.322
apping+0.317
angles+0.301
Neg logit contributions
col-0.396
 gn-0.382
 disl-0.355
 dis-0.316
 rooft-0.314
Token!"
Token ID2474
Feature activation+2.28e-01

Pos logit contributions
ap+0.104
ucks+0.099
 upon+0.098
apping+0.096
angles+0.091
Neg logit contributions
col-0.152
 gn-0.147
 disl-0.139
riger-0.125
 dis-0.124
Token The
Token ID383
Feature activation-3.08e+00

Pos logit contributions
ap+0.325
 upon+0.311
ucks+0.310
apping+0.304
angles+0.291
Neg logit contributions
col-0.358
 gn-0.344
 disl-0.324
growth-0.284
 dis-0.284
Token little
Token ID1310
Feature activation-9.22e-01

Pos logit contributions
col+6.919
 disl+6.440
riger+6.038
 gn+6.023
 CRE+6.021
Neg logit contributions
 upon-3.145
 friend-3.065
 eyes-2.612
ap-2.598
 friends-2.547
Token orange
Token ID10912
Feature activation+5.86e-01

Pos logit contributions
col+1.938
 gn+1.819
 disl+1.813
riger+1.657
growth+1.647
Neg logit contributions
ap-0.840
 upon-0.821
ucks-0.774
apping-0.721
aps-0.668
Token bear
Token ID6842
Feature activation-8.45e-02

Pos logit contributions
ap+0.828
ucks+0.804
 upon+0.782
apping+0.775
ooks+0.749
Neg logit contributions
col-0.921
 gn-0.896
 disl-0.813
 dis-0.726
growth-0.722
Token was
Token ID373
Feature activation-2.15e-01

Pos logit contributions
col+0.135
 gn+0.130
 disl+0.122
riger+0.108
 dis+0.108
Neg logit contributions
ap-0.117
 upon-0.112
ucks-0.111
apping-0.109
angles-0.103
Token excited
Token ID6568
Feature activation+6.42e-01

Pos logit contributions
col+0.306
 gn+0.292
 disl+0.274
growth+0.238
riger+0.238
Neg logit contributions
ap-0.341
 upon-0.325
ucks-0.322
apping-0.320
aps-0.305
Token too
Token ID1165
Feature activation-7.84e-02

Pos logit contributions
col+0.782
 gn+0.761
 disl+0.725
growth+0.666
 rooft+0.655
Neg logit contributions
ap-0.414
 upon-0.386
ucks-0.373
apping-0.370
aps-0.342
Token?"
Token ID1701
Feature activation-2.64e-01

Pos logit contributions
col+0.132
 gn+0.128
 disl+0.121
riger+0.107
growth+0.107
Neg logit contributions
ap-0.102
 upon-0.097
ucks-0.096
apping-0.094
aps-0.089
Token
Token ID220
Feature activation-1.63e-01

Pos logit contributions
col+0.391
 gn+0.377
 disl+0.351
riger+0.310
 dis+0.307
Neg logit contributions
ap-0.398
 upon-0.377
ucks-0.375
apping-0.371
aps-0.354
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
col+0.263
 gn+0.255
 disl+0.239
riger+0.212
 dis+0.211
Neg logit contributions
ap-0.225
 upon-0.214
ucks-0.213
apping-0.209
aps-0.199
Token
Token ID198
Feature activation-2.38e-01

Pos logit contributions
col+0.764
 gn+0.744
 disl+0.701
riger+0.629
growth+0.620
Neg logit contributions
ap-0.615
 upon-0.580
ucks-0.578
apping-0.569
aps-0.543
TokenThe
Token ID464
Feature activation-3.02e+00

Pos logit contributions
col+0.401
 gn+0.389
 disl+0.366
riger+0.327
growth+0.326
Neg logit contributions
ap-0.313
 upon-0.294
ucks-0.293
apping-0.290
aps-0.273
Token other
Token ID584
Feature activation-3.83e-01

Pos logit contributions
col+6.207
 disl+5.769
 gn+5.735
growth+5.561
riger+5.507
Neg logit contributions
 upon-3.051
 friend-3.025
ap-3.023
apping-2.591
 friends-2.563
Token child
Token ID1200
Feature activation+2.67e-01

Pos logit contributions
col+0.679
 gn+0.661
 disl+0.629
riger+0.567
growth+0.565
Neg logit contributions
ap-0.469
 upon-0.444
ucks-0.436
apping-0.427
angles-0.401
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ap+0.372
ucks+0.360
 upon+0.354
apping+0.348
aps+0.332
Neg logit contributions
col-0.423
 gn-0.408
 disl-0.380
 dis-0.339
riger-0.334
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
col+0.527
 gn+0.504
 disl+0.476
riger+0.425
growth+0.423
Neg logit contributions
ap-0.478
 upon-0.452
apping-0.445
ucks-0.444
aps-0.422
Token bit
Token ID1643
Feature activation-8.88e-01

Pos logit contributions
col+1.987
 gn+1.807
 disl+1.770
growth+1.613
keeping+1.582
Neg logit contributions
ap-1.835
 upon-1.754
apping-1.678
ucks-1.554
 friend-1.540
Token shouted
Token ID17293
Feature activation-1.16e+00

Pos logit contributions
col+0.696
 gn+0.666
 disl+0.629
riger+0.557
growth+0.551
Neg logit contributions
ap-0.625
 upon-0.602
ucks-0.590
apping-0.583
aps-0.556
Token Bob
Token ID5811
Feature activation-3.12e-01

Pos logit contributions
col+2.009
 gn+1.901
 disl+1.848
riger+1.643
growth+1.625
Neg logit contributions
ap-1.485
 upon-1.385
apping-1.352
ucks-1.327
aps-1.267
Token.
Token ID13
Feature activation-9.42e-02

Pos logit contributions
col+0.556
 gn+0.532
 disl+0.511
growth+0.458
riger+0.457
Neg logit contributions
ap-0.378
 upon-0.361
ucks-0.350
apping-0.349
aps-0.328
Token
Token ID198
Feature activation-8.46e-01

Pos logit contributions
col+0.155
 gn+0.150
 disl+0.141
riger+0.125
growth+0.125
Neg logit contributions
ap-0.126
 upon-0.120
ucks-0.119
apping-0.117
aps-0.111
Token
Token ID198
Feature activation-7.35e-01

Pos logit contributions
col+1.480
 gn+1.441
 disl+1.378
riger+1.221
growth+1.207
Neg logit contributions
ap-1.048
 upon-1.013
ucks-0.997
apping-0.974
aps-0.933
TokenThe
Token ID464
Feature activation-2.99e+00

Pos logit contributions
col+1.286
 gn+1.241
 disl+1.176
growth+1.051
riger+1.043
Neg logit contributions
ap-0.912
 upon-0.864
ucks-0.856
apping-0.843
aps-0.790
Token elf
Token ID23878
Feature activation+5.30e-01

Pos logit contributions
col+5.274
 disl+5.216
 CRE+4.656
 gn+4.631
riger+4.617
Neg logit contributions
 upon-3.836
 friend-3.750
 friends-3.531
 eyes-3.474
ap-3.436
Token said
Token ID531
Feature activation-4.96e-01

Pos logit contributions
ap+0.778
ucks+0.756
 upon+0.731
apping+0.724
angles+0.707
Neg logit contributions
col-0.814
 gn-0.797
 disl-0.707
 dis-0.648
growth-0.631
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
col+0.792
 gn+0.753
 disl+0.716
growth+0.642
riger+0.637
Neg logit contributions
ap-0.705
 upon-0.666
apping-0.652
ucks-0.645
aps-0.616
Token "
Token ID366
Feature activation-4.73e-01

Pos logit contributions
col+0.619
 gn+0.594
 disl+0.569
growth+0.517
riger+0.516
Neg logit contributions
ap-0.345
 upon-0.326
ucks-0.317
apping-0.315
aps-0.292
TokenG
Token ID38
Feature activation+4.93e-01

Pos logit contributions
col+0.805
 gn+0.783
 disl+0.739
riger+0.667
 dis+0.660
Neg logit contributions
ap-0.588
 upon-0.577
ucks-0.570
apping-0.548
aps-0.519
Token over
Token ID625
Feature activation-3.53e-01

Pos logit contributions
col+1.168
 gn+1.137
 disl+1.095
riger+0.975
 dis+0.967
Neg logit contributions
ap-0.789
 upon-0.758
ucks-0.723
apping-0.714
angles-0.686
Token there
Token ID612
Feature activation+9.94e-02

Pos logit contributions
col+0.599
 gn+0.585
 disl+0.553
riger+0.494
growth+0.486
Neg logit contributions
ap-0.454
 upon-0.436
ucks-0.425
apping-0.417
angles-0.398
Token!"
Token ID2474
Feature activation+1.50e-01

Pos logit contributions
ap+0.139
ucks+0.133
 upon+0.132
apping+0.129
angles+0.123
Neg logit contributions
col-0.157
 gn-0.151
 disl-0.141
riger-0.125
 dis-0.125
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
ap+0.254
ucks+0.244
 upon+0.244
apping+0.240
angles+0.232
Neg logit contributions
col-0.194
 gn-0.185
 disl-0.170
 dis-0.145
growth-0.145
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
col+0.274
 gn+0.265
 disl+0.249
riger+0.221
 dis+0.219
Neg logit contributions
ap-0.232
 upon-0.220
ucks-0.220
apping-0.216
aps-0.205
TokenThe
Token ID464
Feature activation-2.95e+00

Pos logit contributions
ap+0.580
 upon+0.555
ucks+0.553
apping+0.534
angles+0.521
Neg logit contributions
col-0.682
 gn-0.654
 disl-0.615
 dis-0.547
riger-0.541
Token two
Token ID734
Feature activation+8.79e-01

Pos logit contributions
col+6.320
 disl+5.828
 gn+5.646
keeping+5.379
growth+5.308
Neg logit contributions
 upon-3.261
 friend-2.968
ap-2.958
 friends-2.734
apping-2.624
Token children
Token ID1751
Feature activation+6.44e-01

Pos logit contributions
ap+0.914
ucks+0.905
 upon+0.849
apping+0.824
ooks+0.804
Neg logit contributions
col-1.695
 gn-1.643
 disl-1.511
 dis-1.449
riger-1.402
Token carefully
Token ID7773
Feature activation-4.19e-01

Pos logit contributions
ap+0.916
ucks+0.915
 upon+0.874
apping+0.858
ooks+0.834
Neg logit contributions
col-1.005
 gn-0.956
 disl-0.865
 dis-0.802
growth-0.776
Token climbed
Token ID19952
Feature activation+6.22e-01

Pos logit contributions
col+0.666
 gn+0.641
 disl+0.619
riger+0.544
 rooft+0.535
Neg logit contributions
ap-0.576
 upon-0.547
ucks-0.539
apping-0.530
angles-0.508
Token down
Token ID866
Feature activation-2.51e-01

Pos logit contributions
ap+0.888
ucks+0.847
apping+0.835
 upon+0.818
ooks+0.784
Neg logit contributions
col-0.984
 gn-0.949
 disl-0.843
 rooft-0.792
growth-0.776
Token scared
Token ID12008
Feature activation-5.50e-01

Pos logit contributions
col+0.754
 gn+0.711
 disl+0.679
growth+0.611
riger+0.595
Neg logit contributions
ap-0.650
 upon-0.624
apping-0.603
ucks-0.598
aps-0.570
Token."
Token ID526
Feature activation+5.21e-01

Pos logit contributions
col+1.001
 gn+0.972
 disl+0.928
growth+0.840
riger+0.834
Neg logit contributions
ap-0.656
 upon-0.624
apping-0.608
ucks-0.596
aps-0.560
Token
Token ID220
Feature activation+2.54e-01

Pos logit contributions
ap+0.814
ucks+0.783
 upon+0.778
apping+0.763
angles+0.746
Neg logit contributions
col-0.743
 gn-0.710
 disl-0.656
growth-0.571
 dis-0.570
Token
Token ID198
Feature activation-3.26e-02

Pos logit contributions
ap+0.364
ucks+0.345
 upon+0.344
apping+0.338
angles+0.325
Neg logit contributions
col-0.390
 gn-0.376
 disl-0.352
 dis-0.310
riger-0.310
TokenBut
Token ID1537
Feature activation+6.47e-01

Pos logit contributions
col+0.052
 gn+0.051
 disl+0.047
riger+0.042
 dis+0.042
Neg logit contributions
ap-0.045
 upon-0.043
ucks-0.043
apping-0.042
angles-0.040
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
ap+0.954
ucks+0.953
 upon+0.904
apping+0.899
angles+0.877
Neg logit contributions
col-0.958
 gn-0.949
 disl-0.859
 dis-0.782
 rooft-0.752
Token first
Token ID717
Feature activation-1.19e+00

Pos logit contributions
col+5.807
 disl+5.316
 gn+4.935
growth+4.726
keeping+4.698
Neg logit contributions
 upon-3.724
 friend-3.470
ap-3.373
 friends-3.176
apping-3.051
Token child
Token ID1200
Feature activation-4.99e-02

Pos logit contributions
col+2.303
 gn+2.176
 disl+2.173
riger+1.954
growth+1.933
Neg logit contributions
ap-1.334
 upon-1.267
apping-1.197
ucks-1.137
aps-1.073
Token said
Token ID531
Feature activation-4.18e-01

Pos logit contributions
col+0.084
 gn+0.081
 disl+0.077
riger+0.068
 dis+0.068
Neg logit contributions
ap-0.065
 upon-0.061
ucks-0.061
apping-0.060
angles-0.056
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
col+0.663
 gn+0.631
 disl+0.598
growth+0.532
riger+0.531
Neg logit contributions
ap-0.601
 upon-0.565
apping-0.556
ucks-0.548
aps-0.524
Token "
Token ID366
Feature activation+8.37e-02

Pos logit contributions
col+0.269
 gn+0.260
 disl+0.248
riger+0.223
growth+0.223
Neg logit contributions
ap-0.159
 upon-0.149
ucks-0.147
apping-0.145
aps-0.135
Token seen
Token ID1775
Feature activation+8.87e-02

Pos logit contributions
ap+0.194
ucks+0.184
 upon+0.181
apping+0.179
angles+0.169
Neg logit contributions
col-0.275
 gn-0.266
 disl-0.251
 dis-0.224
riger-0.224
Token it
Token ID340
Feature activation-2.89e-01

Pos logit contributions
ap+0.088
ucks+0.081
 upon+0.081
apping+0.079
aps+0.073
Neg logit contributions
col-0.178
 gn-0.173
 disl-0.164
 dis-0.150
riger-0.149
Token?"
Token ID1701
Feature activation-6.71e-02

Pos logit contributions
col+0.494
 gn+0.476
 disl+0.452
riger+0.402
growth+0.402
Neg logit contributions
ap-0.370
 upon-0.358
ucks-0.345
apping-0.340
angles-0.322
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
col+0.108
 gn+0.104
 disl+0.098
riger+0.086
 dis+0.086
Neg logit contributions
ap-0.094
 upon-0.089
ucks-0.089
apping-0.087
aps-0.083
Token
Token ID198
Feature activation-5.73e-02

Pos logit contributions
col+0.242
 gn+0.234
 disl+0.220
riger+0.195
 dis+0.194
Neg logit contributions
ap-0.205
 upon-0.195
ucks-0.194
apping-0.190
aps-0.181
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.096
 gn+0.093
 disl+0.087
riger+0.078
 dis+0.078
Neg logit contributions
ap-0.076
 upon-0.072
ucks-0.072
apping-0.071
angles-0.067
Token p
Token ID279
Feature activation+1.98e-01

Pos logit contributions
col+5.443
 disl+5.097
 CRE+4.956
growth+4.869
riger+4.862
Neg logit contributions
 upon-3.640
 friend-3.551
 friends-3.285
ucks-3.190
ap-3.095
Tokenanda
Token ID5282
Feature activation-1.52e-01

Pos logit contributions
ap+0.277
 upon+0.261
ucks+0.261
apping+0.256
angles+0.245
Neg logit contributions
col-0.319
 gn-0.307
 disl-0.288
riger-0.252
 dis-0.252
Token shook
Token ID14682
Feature activation+2.41e-01

Pos logit contributions
col+0.229
 gn+0.220
 disl+0.206
riger+0.182
growth+0.180
Neg logit contributions
ap-0.224
 upon-0.214
ucks-0.211
apping-0.208
angles-0.198
Token his
Token ID465
Feature activation+5.73e-01

Pos logit contributions
ap+0.242
ucks+0.228
 upon+0.223
apping+0.220
angles+0.207
Neg logit contributions
col-0.478
 gn-0.467
 disl-0.442
 dis-0.404
growth-0.401
Token head
Token ID1182
Feature activation-9.88e-01

Pos logit contributions
ap+0.662
ucks+0.641
 upon+0.623
apping+0.618
angles+0.595
Neg logit contributions
 gn-1.030
col-1.030
 disl-0.944
 rooft-0.864
 dis-0.862
Token me
Token ID502
Feature activation-7.87e-02

Pos logit contributions
col+1.472
 gn+1.448
 disl+1.380
riger+1.290
growth+1.278
Neg logit contributions
ap-0.406
 upon-0.367
ucks-0.351
apping-0.324
aps-0.296
Token?"
Token ID1701
Feature activation+2.33e-03

Pos logit contributions
col+0.136
 gn+0.131
 disl+0.124
riger+0.111
 dis+0.110
Neg logit contributions
ap-0.100
 upon-0.094
ucks-0.094
apping-0.092
aps-0.087
Token
Token ID220
Feature activation-2.75e-01

Pos logit contributions
ap+0.003
 upon+0.003
ucks+0.003
apping+0.003
angles+0.003
Neg logit contributions
col-0.004
 gn-0.004
 disl-0.003
riger-0.003
 dis-0.003
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
col+0.452
 gn+0.437
 disl+0.411
riger+0.365
 dis+0.362
Neg logit contributions
ap-0.372
 upon-0.355
ucks-0.351
apping-0.345
aps-0.329
Token
Token ID198
Feature activation-3.38e-01

Pos logit contributions
col+0.971
 gn+0.944
 disl+0.892
riger+0.794
 dis+0.780
Neg logit contributions
ap-0.736
 upon-0.699
ucks-0.690
apping-0.679
aps-0.647
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.579
 gn+0.560
 disl+0.527
riger+0.471
 dis+0.467
Neg logit contributions
ap-0.432
 upon-0.407
ucks-0.403
apping-0.399
aps-0.375
Token crab
Token ID32202
Feature activation+3.43e-01

Pos logit contributions
col+5.082
riger+4.843
 disl+4.613
 CRE+4.589
 gn+4.573
Neg logit contributions
 upon-3.925
ap-3.612
 friend-3.599
apping-3.443
 friends-3.290
Token replied
Token ID8712
Feature activation-5.93e-01

Pos logit contributions
ap+0.473
ucks+0.457
 upon+0.445
apping+0.441
angles+0.424
Neg logit contributions
col-0.550
 gn-0.535
 disl-0.496
 dis-0.441
riger-0.433
Token,
Token ID11
Feature activation-3.49e-01

Pos logit contributions
col+0.959
 gn+0.910
 disl+0.874
riger+0.781
growth+0.779
Neg logit contributions
ap-0.843
apping-0.783
 upon-0.783
ucks-0.759
aps-0.733
Token "
Token ID366
Feature activation-8.55e-01

Pos logit contributions
col+0.673
 gn+0.646
 disl+0.620
growth+0.563
riger+0.562
Neg logit contributions
ap-0.371
 upon-0.351
apping-0.339
ucks-0.339
aps-0.312
TokenI
Token ID40
Feature activation-7.39e-01

Pos logit contributions
col+1.623
 gn+1.590
 disl+1.506
riger+1.411
 dis+1.397
Neg logit contributions
ap-0.884
 upon-0.862
ucks-0.845
apping-0.820
aps-0.769
Token like
Token ID588
Feature activation+1.21e-01

Pos logit contributions
col+0.694
 gn+0.674
 disl+0.641
growth+0.575
riger+0.572
Neg logit contributions
ap-0.477
 upon-0.456
ucks-0.439
apping-0.434
aps-0.409
Token me
Token ID502
Feature activation+7.66e-02

Pos logit contributions
ap+0.134
ucks+0.125
 upon+0.124
apping+0.122
angles+0.114
Neg logit contributions
col-0.228
 gn-0.221
 disl-0.209
 dis-0.189
riger-0.189
Token?"
Token ID1701
Feature activation-4.70e-01

Pos logit contributions
ap+0.099
ucks+0.094
 upon+0.093
apping+0.091
angles+0.087
Neg logit contributions
col-0.129
 gn-0.125
 disl-0.118
riger-0.105
 dis-0.105
Token
Token ID198
Feature activation-2.84e-02

Pos logit contributions
col+0.784
 gn+0.748
 disl+0.708
riger+0.632
 rooft+0.627
Neg logit contributions
ap-0.628
 upon-0.592
ucks-0.589
apping-0.575
aps-0.549
Token
Token ID198
Feature activation-4.30e-01

Pos logit contributions
col+0.045
 gn+0.044
 disl+0.041
riger+0.036
 dis+0.036
Neg logit contributions
ap-0.040
 upon-0.038
ucks-0.037
apping-0.037
angles-0.035
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.781
 gn+0.746
 disl+0.713
 dis+0.638
riger+0.636
Neg logit contributions
ap-0.512
 upon-0.481
ucks-0.477
apping-0.467
aps-0.440
Token crab
Token ID32202
Feature activation-7.21e-02

Pos logit contributions
col+5.307
riger+4.952
 disl+4.817
 gn+4.723
 CRE+4.611
Neg logit contributions
 upon-3.789
 friend-3.573
ap-3.534
apping-3.341
 friends-3.216
Token looked
Token ID3114
Feature activation+6.63e-01

Pos logit contributions
col+0.113
 gn+0.109
 disl+0.103
riger+0.091
 dis+0.090
Neg logit contributions
ap-0.102
 upon-0.097
ucks-0.097
apping-0.095
aps-0.090
Token at
Token ID379
Feature activation+2.02e-01

Pos logit contributions
ap+0.813
ucks+0.781
apping+0.753
 upon+0.726
angles+0.721
Neg logit contributions
col-1.165
 gn-1.128
 disl-1.055
 dis-0.957
 rooft-0.939
Token T
Token ID309
Feature activation+1.28e+00

Pos logit contributions
ap+0.271
 upon+0.255
ucks+0.254
apping+0.253
aps+0.238
Neg logit contributions
col-0.334
 gn-0.324
 disl-0.303
growth-0.272
 dis-0.270
Tokenuna
Token ID9613
Feature activation-1.26e-01

Pos logit contributions
ap+1.576
 upon+1.433
ucks+1.417
apping+1.410
ooks+1.371
Neg logit contributions
col-2.212
 gn-2.183
 disl-1.991
growth-1.860
yah-1.770
Token luggage
Token ID31488
Feature activation+7.42e-01

Pos logit contributions
col+0.238
 gn+0.230
 disl+0.218
riger+0.196
 dis+0.194
Neg logit contributions
ap-0.174
 upon-0.165
ucks-0.163
apping-0.160
aps-0.151
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
ap+0.978
ucks+0.958
 upon+0.910
apping+0.895
angles+0.872
Neg logit contributions
col-1.242
 gn-1.198
 disl-1.092
 rooft-0.985
 dis-0.981
Token
Token ID220
Feature activation+4.90e-01

Pos logit contributions
ap+0.030
 upon+0.029
ucks+0.029
apping+0.028
angles+0.027
Neg logit contributions
col-0.032
 gn-0.031
 disl-0.029
riger-0.026
 dis-0.026
Token
Token ID198
Feature activation-5.89e-01

Pos logit contributions
ap+0.731
ucks+0.689
 upon+0.678
apping+0.676
angles+0.652
Neg logit contributions
col-0.727
 gn-0.694
 disl-0.650
 dis-0.570
growth-0.564
Token
Token ID198
Feature activation-4.51e-01

Pos logit contributions
col+1.002
 gn+0.974
 disl+0.933
riger+0.824
growth+0.818
Neg logit contributions
ap-0.746
 upon-0.737
ucks-0.717
apping-0.699
aps-0.669
TokenThe
Token ID464
Feature activation-2.80e+00

Pos logit contributions
col+0.801
 gn+0.773
 disl+0.734
growth+0.660
riger+0.657
Neg logit contributions
ap-0.542
 upon-0.520
ucks-0.511
apping-0.503
aps-0.470
Token elephant
Token ID20950
Feature activation+5.80e-02

Pos logit contributions
col+5.692
 disl+5.331
 gn+5.282
riger+5.149
 CRE+5.022
Neg logit contributions
 upon-3.007
 friend-2.858
ap-2.644
 for-2.487
 friends-2.378
Token asked
Token ID1965
Feature activation-1.95e-02

Pos logit contributions
ap+0.082
ucks+0.078
 upon+0.077
apping+0.076
angles+0.072
Neg logit contributions
col-0.091
 gn-0.088
 disl-0.082
 dis-0.073
riger-0.073
Token,
Token ID11
Feature activation-1.66e-02

Pos logit contributions
col+0.033
 gn+0.032
 disl+0.030
riger+0.027
 dis+0.027
Neg logit contributions
ap-0.026
 upon-0.024
ucks-0.024
apping-0.024
angles-0.022
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
col+0.030
 gn+0.030
 disl+0.028
riger+0.025
 dis+0.025
Neg logit contributions
ap-0.019
 upon-0.018
ucks-0.018
apping-0.017
angles-0.016
TokenWhat
Token ID2061
Feature activation+7.38e-01

Pos logit contributions
col+1.758
 gn+1.664
 disl+1.659
 dis+1.453
riger+1.452
Neg logit contributions
 upon-1.206
ap-1.169
ucks-1.160
apping-1.069
angles-1.022
Token with
Token ID351
Feature activation-1.20e-01

Pos logit contributions
ap+1.196
ucks+1.138
apping+1.101
angles+1.096
 upon+1.065
Neg logit contributions
col-1.554
 gn-1.504
 disl-1.362
 dis-1.242
 rooft-1.216
Token him
Token ID683
Feature activation+4.48e-01

Pos logit contributions
col+0.219
 gn+0.212
 disl+0.201
riger+0.181
 dis+0.180
Neg logit contributions
ap-0.139
 upon-0.131
ucks-0.130
apping-0.126
angles-0.119
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
ap+0.586
ucks+0.568
apping+0.547
 upon+0.544
angles+0.523
Neg logit contributions
col-0.742
 gn-0.721
 disl-0.673
 dis-0.608
riger-0.598
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
ap+0.459
ucks+0.434
 upon+0.431
apping+0.425
angles+0.413
Neg logit contributions
col-0.506
 gn-0.488
 disl-0.456
 dis-0.403
growth-0.400
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
col+0.473
 gn+0.460
 disl+0.434
riger+0.385
growth+0.382
Neg logit contributions
ap-0.385
 upon-0.371
ucks-0.367
apping-0.359
aps-0.342
TokenThe
Token ID464
Feature activation-2.77e+00

Pos logit contributions
ap+0.146
ucks+0.138
 upon+0.138
apping+0.135
angles+0.129
Neg logit contributions
col-0.177
 gn-0.171
 disl-0.160
 dis-0.143
riger-0.142
Token little
Token ID1310
Feature activation-3.44e-01

Pos logit contributions
col+5.873
 disl+5.700
 gn+5.439
riger+5.383
growth+5.218
Neg logit contributions
 upon-2.885
ap-2.581
 friend-2.568
 for-2.260
 friends-2.212
Token girl
Token ID2576
Feature activation+4.17e-01

Pos logit contributions
col+0.724
 gn+0.700
 disl+0.676
riger+0.619
growth+0.616
Neg logit contributions
ap-0.311
 upon-0.293
ucks-0.285
apping-0.274
aps-0.250
Token was
Token ID373
Feature activation-2.76e-01

Pos logit contributions
ap+0.597
ucks+0.578
 upon+0.568
apping+0.561
angles+0.540
Neg logit contributions
col-0.645
 gn-0.628
 disl-0.577
 dis-0.516
growth-0.504
Token so
Token ID523
Feature activation-5.23e-01

Pos logit contributions
col+0.424
 gn+0.405
 disl+0.383
riger+0.339
growth+0.339
Neg logit contributions
ap-0.407
 upon-0.387
ucks-0.381
apping-0.381
aps-0.361
Token happy
Token ID3772
Feature activation+7.82e-01

Pos logit contributions
col+0.911
 gn+0.877
 disl+0.825
growth+0.764
riger+0.756
Neg logit contributions
ap-0.676
 upon-0.641
apping-0.631
ucks-0.622
aps-0.593
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
col+2.700
 gn+2.549
 disl+2.476
riger+2.266
 CRE+2.211
Neg logit contributions
ap-1.580
 upon-1.548
ucks-1.465
angles-1.344
 for-1.343
Token jungle
Token ID20712
Feature activation-1.02e-01

Pos logit contributions
col+2.844
 disl+2.700
 gn+2.674
 dis+2.319
growth+2.285
Neg logit contributions
ap-1.951
 upon-1.914
ucks-1.815
apping-1.689
 for-1.661
Token?"
Token ID1701
Feature activation+1.42e-01

Pos logit contributions
col+0.176
 gn+0.170
 disl+0.161
riger+0.144
 dis+0.143
Neg logit contributions
ap-0.129
 upon-0.121
ucks-0.120
apping-0.118
aps-0.111
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
ap+0.219
ucks+0.210
 upon+0.209
apping+0.206
angles+0.198
Neg logit contributions
col-0.205
 gn-0.197
 disl-0.184
 dis-0.159
riger-0.159
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
col+0.416
 gn+0.403
 disl+0.380
riger+0.337
 dis+0.335
Neg logit contributions
ap-0.344
 upon-0.328
ucks-0.326
apping-0.319
aps-0.304
TokenThe
Token ID464
Feature activation-2.77e+00

Pos logit contributions
col+0.356
 gn+0.344
 disl+0.324
riger+0.290
growth+0.290
Neg logit contributions
ap-0.260
 upon-0.245
ucks-0.244
apping-0.239
aps-0.226
Token child
Token ID1200
Feature activation+3.39e-01

Pos logit contributions
col+5.529
 disl+5.206
 gn+5.131
riger+5.097
 CRE+4.921
Neg logit contributions
 upon-2.946
 friend-2.853
ap-2.656
 friends-2.436
 for-2.413
Token said
Token ID531
Feature activation-8.87e-01

Pos logit contributions
ap+0.473
ucks+0.458
 upon+0.446
apping+0.442
angles+0.423
Neg logit contributions
col-0.536
 gn-0.521
 disl-0.483
 dis-0.431
riger-0.423
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
col+1.487
 gn+1.405
 disl+1.356
growth+1.239
 rooft+1.219
Neg logit contributions
ap-1.228
 upon-1.134
apping-1.128
ucks-1.058
aps-1.043
Token "
Token ID366
Feature activation-8.21e-01

Pos logit contributions
col+0.368
 gn+0.356
 disl+0.339
riger+0.307
growth+0.306
Neg logit contributions
ap-0.213
 upon-0.200
ucks-0.196
apping-0.194
aps-0.180
TokenI
Token ID40
Feature activation-8.75e-01

Pos logit contributions
col+1.610
 gn+1.578
 disl+1.507
riger+1.395
 dis+1.379
Neg logit contributions
ap-0.800
 upon-0.794
ucks-0.774
apping-0.750
aps-0.692
Token he
Token ID339
Feature activation-1.41e-01

Pos logit contributions
ap+0.193
ucks+0.183
 upon+0.181
apping+0.179
aps+0.170
Neg logit contributions
col-0.218
 gn-0.212
 disl-0.198
 dis-0.176
riger-0.175
Token could
Token ID714
Feature activation+6.19e-01

Pos logit contributions
col+0.224
 gn+0.215
 disl+0.202
riger+0.179
growth+0.178
Neg logit contributions
ap-0.199
 upon-0.190
ucks-0.189
apping-0.185
aps-0.176
Token play
Token ID711
Feature activation+5.16e-01

Pos logit contributions
ap+0.790
ucks+0.771
 upon+0.756
apping+0.751
angles+0.726
Neg logit contributions
col-1.047
 gn-1.028
 disl-0.944
 dis-0.855
riger-0.838
Token too
Token ID1165
Feature activation+6.73e-05

Pos logit contributions
ap+0.537
 upon+0.510
ucks+0.507
apping+0.493
angles+0.477
Neg logit contributions
col-1.000
 gn-0.977
 disl-0.923
 dis-0.838
riger-0.832
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
ap+0.000
 upon+0.000
ucks+0.000
apping+0.000
angles+0.000
Neg logit contributions
col-0.000
 gn-0.000
 disl-0.000
riger-0.000
 dis-0.000
Token The
Token ID383
Feature activation-2.73e+00

Pos logit contributions
col+0.201
 gn+0.194
 disl+0.183
riger+0.165
growth+0.165
Neg logit contributions
ap-0.131
 upon-0.124
ucks-0.123
apping-0.120
aps-0.113
Token babys
Token ID46711
Feature activation+3.37e-01

Pos logit contributions
col+5.240
 disl+5.121
 gn+4.890
growth+4.843
riger+4.779
Neg logit contributions
 friend-2.995
 upon-2.881
ap-2.813
 friends-2.714
ucks-2.691
Tokenitter
Token ID1967
Feature activation-2.05e-01

Pos logit contributions
ap+0.371
 upon+0.348
ucks+0.344
apping+0.329
angles+0.323
Neg logit contributions
col-0.636
 gn-0.611
 disl-0.580
 dis-0.525
riger-0.521
Token said
Token ID531
Feature activation-8.47e-01

Pos logit contributions
col+0.327
 gn+0.316
 disl+0.297
riger+0.264
growth+0.264
Neg logit contributions
ap-0.284
 upon-0.270
ucks-0.267
apping-0.264
aps-0.249
Token it
Token ID340
Feature activation-4.82e-01

Pos logit contributions
col+1.379
 gn+1.285
 disl+1.226
growth+1.125
 rooft+1.113
Neg logit contributions
ap-1.186
 upon-1.141
apping-1.105
ucks-1.056
aps-1.017
Token was
Token ID373
Feature activation+1.12e+00

Pos logit contributions
col+0.889
 gn+0.857
 disl+0.824
growth+0.746
riger+0.743
Neg logit contributions
ap-0.547
 upon-0.522
ucks-0.501
apping-0.495
aps-0.461
Token it
Token ID340
Feature activation+5.05e-01

Pos logit contributions
ap+0.187
ucks+0.177
 upon+0.175
apping+0.172
aps+0.162
Neg logit contributions
col-0.275
 gn-0.267
 disl-0.251
riger-0.226
 dis-0.226
Token looks
Token ID3073
Feature activation+4.08e-01

Pos logit contributions
ap+0.645
ucks+0.623
apping+0.609
 upon+0.607
ooks+0.575
Neg logit contributions
col-0.855
 gn-0.830
 disl-0.774
riger-0.698
 dis-0.689
Token fun
Token ID1257
Feature activation-7.65e-01

Pos logit contributions
ap+0.635
ucks+0.618
apping+0.601
 upon+0.593
aps+0.574
Neg logit contributions
col-0.586
 gn-0.564
 disl-0.522
 dis-0.458
 rooft-0.455
Token!"
Token ID2474
Feature activation+1.89e-01

Pos logit contributions
col+1.306
 gn+1.286
 disl+1.208
growth+1.093
riger+1.074
Neg logit contributions
ap-0.989
 upon-0.953
apping-0.907
ucks-0.901
angles-0.859
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ap+0.316
ucks+0.304
 upon+0.304
apping+0.298
angles+0.288
Neg logit contributions
col-0.249
 gn-0.237
 disl-0.219
 dis-0.187
riger-0.187
TokenThe
Token ID464
Feature activation-2.72e+00

Pos logit contributions
ap+0.182
 upon+0.172
ucks+0.172
apping+0.169
angles+0.162
Neg logit contributions
col-0.200
 gn-0.192
 disl-0.180
 dis-0.159
riger-0.158
Token second
Token ID1218
Feature activation-9.80e-01

Pos logit contributions
col+5.794
 disl+5.227
 gn+5.118
growth+4.925
keeping+4.800
Neg logit contributions
 upon-2.887
ap-2.800
 friend-2.703
apping-2.480
 friends-2.453
Token child
Token ID1200
Feature activation-1.82e-01

Pos logit contributions
col+1.805
 gn+1.754
 disl+1.693
riger+1.535
growth+1.514
Neg logit contributions
ap-1.174
 upon-1.113
apping-1.075
ucks-1.055
angles-0.977
Token said
Token ID531
Feature activation-2.43e-01

Pos logit contributions
col+0.299
 gn+0.288
 disl+0.273
riger+0.242
growth+0.241
Neg logit contributions
ap-0.244
 upon-0.232
ucks-0.229
apping-0.226
angles-0.214
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
col+0.376
 gn+0.362
 disl+0.341
riger+0.301
growth+0.300
Neg logit contributions
ap-0.355
 upon-0.334
apping-0.330
ucks-0.329
aps-0.312
Token "
Token ID366
Feature activation-2.78e-01

Pos logit contributions
col+0.728
 gn+0.704
 disl+0.675
riger+0.610
growth+0.609
Neg logit contributions
ap-0.406
 upon-0.380
apping-0.371
ucks-0.367
aps-0.339
Token there
Token ID612
Feature activation+2.32e-01

Pos logit contributions
col+0.523
 gn+0.505
 disl+0.479
growth+0.433
riger+0.431
Neg logit contributions
ap-0.339
 upon-0.321
ucks-0.312
apping-0.311
aps-0.291
Token.
Token ID13
Feature activation+5.89e-02

Pos logit contributions
ap+0.294
ucks+0.281
 upon+0.273
apping+0.271
angles+0.258
Neg logit contributions
col-0.396
 gn-0.384
 disl-0.360
 dis-0.324
riger-0.322
Token
Token ID220
Feature activation+3.00e-01

Pos logit contributions
ap+0.086
ucks+0.081
 upon+0.081
apping+0.080
angles+0.076
Neg logit contributions
col-0.091
 gn-0.088
 disl-0.082
riger-0.072
 dis-0.072
Token
Token ID198
Feature activation-6.85e-01

Pos logit contributions
ap+0.436
ucks+0.411
 upon+0.407
apping+0.403
angles+0.387
Neg logit contributions
col-0.456
 gn-0.439
 disl-0.410
 dis-0.362
riger-0.360
Token
Token ID198
Feature activation-3.34e-01

Pos logit contributions
col+1.181
 gn+1.145
 disl+1.098
riger+0.965
growth+0.964
Neg logit contributions
ap-0.863
 upon-0.847
ucks-0.822
apping-0.805
aps-0.773
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+0.603
 gn+0.582
 disl+0.551
growth+0.496
riger+0.493
Neg logit contributions
ap-0.396
 upon-0.375
ucks-0.370
apping-0.364
aps-0.341
Token elf
Token ID23878
Feature activation+1.10e-01

Pos logit contributions
col+5.528
 disl+5.378
 gn+4.824
 CRE+4.692
growth+4.611
Neg logit contributions
 upon-3.052
 friend-2.657
ap-2.571
 for-2.450
 friends-2.401
Token said
Token ID531
Feature activation-1.43e+00

Pos logit contributions
ap+0.157
ucks+0.150
 upon+0.149
apping+0.147
angles+0.140
Neg logit contributions
col-0.170
 gn-0.164
 disl-0.153
 dis-0.135
riger-0.134
Token that
Token ID326
Feature activation-5.48e-01

Pos logit contributions
col+2.500
 gn+2.301
 disl+2.245
growth+2.116
 rooft+2.068
Neg logit contributions
ap-1.845
 upon-1.748
apping-1.720
 for-1.626
aps-1.556
Token he
Token ID339
Feature activation+4.69e-01

Pos logit contributions
col+0.927
 gn+0.881
 disl+0.845
riger+0.759
growth+0.756
Neg logit contributions
ap-0.706
 upon-0.685
ucks-0.654
apping-0.648
angles-0.616
Token had
Token ID550
Feature activation+4.36e-01

Pos logit contributions
ap+0.711
ucks+0.679
apping+0.664
 upon+0.655
aps+0.633
Neg logit contributions
col-0.702
 gn-0.687
 disl-0.629
 dis-0.561
riger-0.548
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
ap+0.727
ucks+0.704
apping+0.665
 upon+0.652
aps+0.615
Neg logit contributions
col-1.613
 gn-1.581
 disl-1.493
 dis-1.388
riger-1.383
Token doing
Token ID1804
Feature activation-7.48e-01

Pos logit contributions
col+2.067
 gn+1.920
 disl+1.878
growth+1.741
 rooft+1.675
Neg logit contributions
ap-1.325
 upon-1.287
apping-1.189
ucks-1.105
aps-1.071
Token?"
Token ID1701
Feature activation-1.16e-01

Pos logit contributions
col+1.401
 gn+1.330
 disl+1.275
growth+1.160
 rooft+1.138
Neg logit contributions
ap-0.894
 upon-0.827
apping-0.808
ucks-0.788
aps-0.747
Token
Token ID198
Feature activation-7.22e-01

Pos logit contributions
col+0.188
 gn+0.181
 disl+0.171
riger+0.151
 dis+0.151
Neg logit contributions
ap-0.160
 upon-0.152
ucks-0.151
apping-0.148
aps-0.141
Token
Token ID198
Feature activation-6.54e-01

Pos logit contributions
col+1.252
 gn+1.212
 disl+1.164
riger+1.032
 dis+1.018
Neg logit contributions
ap-0.907
 upon-0.871
ucks-0.861
apping-0.834
aps-0.808
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+1.167
 gn+1.119
 disl+1.068
growth+0.958
riger+0.955
Neg logit contributions
ap-0.790
 upon-0.741
ucks-0.735
apping-0.725
aps-0.685
Token bear
Token ID6842
Feature activation+2.99e-01

Pos logit contributions
col+5.350
 disl+4.674
 CRE+4.634
growth+4.416
riger+4.410
Neg logit contributions
 upon-3.248
 friend-2.977
ucks-2.843
ap-2.806
 friends-2.757
Token replied
Token ID8712
Feature activation-5.32e-01

Pos logit contributions
ap+0.421
ucks+0.405
 upon+0.397
apping+0.393
angles+0.376
Neg logit contributions
col-0.470
 gn-0.458
 disl-0.423
 dis-0.378
riger-0.371
Token,
Token ID11
Feature activation-5.74e-02

Pos logit contributions
col+0.857
 gn+0.814
 disl+0.779
growth+0.696
riger+0.694
Neg logit contributions
ap-0.758
 upon-0.707
apping-0.704
ucks-0.686
aps-0.659
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
col+0.107
 gn+0.104
 disl+0.098
riger+0.089
 dis+0.088
Neg logit contributions
ap-0.064
 upon-0.060
ucks-0.060
apping-0.059
aps-0.055
TokenI
Token ID40
Feature activation-7.24e-01

Pos logit contributions
col+1.974
 gn+1.935
 disl+1.847
riger+1.718
growth+1.695
Neg logit contributions
 upon-0.957
ap-0.929
ucks-0.915
apping-0.878
aps-0.810
Token one
Token ID530
Feature activation+1.14e-01

Pos logit contributions
ap+0.731
ucks+0.709
 upon+0.686
apping+0.682
ooks+0.652
Neg logit contributions
col-0.928
 gn-0.900
 disl-0.840
riger-0.758
 dis-0.750
Token day
Token ID1110
Feature activation+2.77e-01

Pos logit contributions
ap+0.146
ucks+0.139
 upon+0.138
apping+0.135
angles+0.129
Neg logit contributions
col-0.194
 gn-0.188
 disl-0.176
riger-0.158
 dis-0.157
Token?"
Token ID1701
Feature activation-2.83e-03

Pos logit contributions
ap+0.369
ucks+0.355
 upon+0.347
apping+0.343
angles+0.330
Neg logit contributions
col-0.451
 gn-0.436
 disl-0.406
 dis-0.364
riger-0.364
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
col+0.004
 gn+0.004
 disl+0.004
riger+0.004
 dis+0.004
Neg logit contributions
ap-0.004
 upon-0.004
ucks-0.004
apping-0.004
angles-0.004
Token
Token ID198
Feature activation-9.05e-02

Pos logit contributions
col+0.546
 gn+0.528
 disl+0.499
riger+0.444
 dis+0.439
Neg logit contributions
ap-0.446
 upon-0.423
ucks-0.421
apping-0.414
aps-0.394
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+0.153
 gn+0.148
 disl+0.140
riger+0.124
 dis+0.124
Neg logit contributions
ap-0.118
 upon-0.111
ucks-0.111
apping-0.109
aps-0.103
Token elf
Token ID23878
Feature activation+4.28e-01

Pos logit contributions
col+3.820
 disl+3.738
 gn+3.172
 CRE+3.101
growth+3.098
Neg logit contributions
 upon-4.566
ap-4.143
 friend-4.040
 for-3.879
 friends-3.866
Token smiled
Token ID13541
Feature activation+2.56e-01

Pos logit contributions
ap+0.606
ucks+0.588
 upon+0.572
apping+0.563
angles+0.543
Neg logit contributions
col-0.669
 gn-0.649
 disl-0.596
 dis-0.536
riger-0.526
Token and
Token ID290
Feature activation-4.01e-01

Pos logit contributions
ap+0.295
ucks+0.279
 upon+0.273
apping+0.268
angles+0.259
Neg logit contributions
col-0.467
 gn-0.456
 disl-0.426
 dis-0.386
 rooft-0.383
Token said
Token ID531
Feature activation-7.89e-01

Pos logit contributions
col+0.649
 gn+0.623
 disl+0.591
growth+0.525
riger+0.524
Neg logit contributions
ap-0.548
 upon-0.522
ucks-0.514
apping-0.507
angles-0.480
Token,
Token ID11
Feature activation-7.26e-01

Pos logit contributions
col+1.292
 gn+1.229
 disl+1.188
growth+1.065
riger+1.050
Neg logit contributions
ap-1.109
 upon-1.032
apping-1.030
ucks-0.980
aps-0.960
Token love
Token ID1842
Feature activation-1.81e-01

Pos logit contributions
ap+0.269
ucks+0.257
 upon+0.251
apping+0.250
angles+0.236
Neg logit contributions
col-0.394
 gn-0.382
 disl-0.361
riger-0.323
 dis-0.323
Token to
Token ID284
Feature activation+3.38e-03

Pos logit contributions
col+0.338
 gn+0.328
 disl+0.311
riger+0.282
growth+0.281
Neg logit contributions
ap-0.203
 upon-0.192
ucks-0.190
apping-0.185
aps-0.174
Token play
Token ID711
Feature activation-6.92e-01

Pos logit contributions
ap+0.004
 upon+0.003
ucks+0.003
apping+0.003
angles+0.003
Neg logit contributions
col-0.006
 gn-0.006
 disl-0.006
riger-0.005
 dis-0.005
Token,"
Token ID553
Feature activation+6.38e-01

Pos logit contributions
col+1.320
 gn+1.273
 disl+1.216
growth+1.122
riger+1.112
Neg logit contributions
ap-0.762
 upon-0.702
ucks-0.692
apping-0.677
aps-0.648
Token replied
Token ID8712
Feature activation-5.99e-02

Pos logit contributions
ap+0.956
ucks+0.920
 upon+0.919
apping+0.917
ooks+0.875
Neg logit contributions
col-0.956
 gn-0.938
 disl-0.872
 dis-0.768
riger-0.745
Token the
Token ID262
Feature activation-2.70e+00

Pos logit contributions
col+0.103
 gn+0.100
 disl+0.094
riger+0.084
 dis+0.084
Neg logit contributions
ap-0.076
 upon-0.072
ucks-0.072
apping-0.070
aps-0.066
Token crab
Token ID32202
Feature activation+7.82e-02

Pos logit contributions
col+4.649
riger+4.474
 disl+4.261
 gn+4.186
 CRE+4.079
Neg logit contributions
 upon-3.848
ap-3.569
 friend-3.473
apping-3.429
aps-3.244
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
ap+0.097
ucks+0.092
 upon+0.092
apping+0.090
angles+0.085
Neg logit contributions
col-0.136
 gn-0.132
 disl-0.124
 dis-0.111
riger-0.111
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
col+0.346
 gn+0.332
 disl+0.314
riger+0.280
growth+0.279
Neg logit contributions
ap-0.279
 upon-0.266
ucks-0.265
apping-0.259
aps-0.247
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
ap+1.631
apping+1.507
ucks+1.503
angles+1.483
 upon+1.479
Neg logit contributions
col-1.411
 gn-1.325
 disl-1.209
 dis-1.053
 rooft-1.034
TokenL
Token ID43
Feature activation+5.62e-01

Pos logit contributions
ap+1.319
ucks+1.232
 upon+1.225
apping+1.209
angles+1.158
Neg logit contributions
col-1.975
 gn-1.924
 disl-1.783
riger-1.598
 dis-1.588
Token said
Token ID531
Feature activation-1.03e+00

Pos logit contributions
col+0.051
 gn+0.049
 disl+0.046
riger+0.041
 dis+0.040
Neg logit contributions
ap-0.049
 upon-0.046
ucks-0.046
apping-0.045
angles-0.043
Token,
Token ID11
Feature activation-4.18e-01

Pos logit contributions
col+1.814
 gn+1.741
 disl+1.609
growth+1.534
riger+1.502
Neg logit contributions
ap-1.325
 upon-1.234
apping-1.226
ucks-1.153
aps-1.127
Token "
Token ID366
Feature activation+9.86e-01

Pos logit contributions
col+0.810
 gn+0.781
 disl+0.744
growth+0.680
riger+0.675
Neg logit contributions
ap-0.439
 upon-0.416
apping-0.403
ucks-0.403
aps-0.368
TokenIt
Token ID1026
Feature activation-5.60e-02

Pos logit contributions
ap+1.447
ucks+1.299
apping+1.298
 upon+1.290
aps+1.246
Neg logit contributions
col-1.619
 gn-1.520
 disl-1.453
 dis-1.276
 rooft-1.258
Token's
Token ID338
Feature activation+1.02e+00

Pos logit contributions
col+0.095
 gn+0.092
 disl+0.086
riger+0.077
 dis+0.077
Neg logit contributions
ap-0.073
 upon-0.069
ucks-0.069
apping-0.067
angles-0.064
Token okay
Token ID8788
Feature activation+1.68e+00

Pos logit contributions
ucks+1.295
ap+1.262
apping+1.221
aps+1.172
 upon+1.172
Neg logit contributions
col-1.768
 gn-1.726
 disl-1.647
 dis-1.536
riger-1.445
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
ucks+3.023
ap+2.921
angles+2.887
apping+2.860
 upon+2.759
Neg logit contributions
 gn-2.012
col-1.993
 disl-1.826
 dis-1.621
 rooft-1.423
Token We
Token ID775
Feature activation+5.97e-01

Pos logit contributions
ap+0.352
 upon+0.336
ucks+0.331
apping+0.322
angles+0.308
Neg logit contributions
col-0.572
 gn-0.554
 disl-0.531
 dis-0.474
riger-0.472
Token can
Token ID460
Feature activation+2.53e-01

Pos logit contributions
ap+0.816
ucks+0.800
apping+0.776
 upon+0.767
aps+0.738
Neg logit contributions
col-0.958
 gn-0.910
 disl-0.868
 dis-0.775
riger-0.761
Token buy
Token ID2822
Feature activation+3.12e-01

Pos logit contributions
ap+0.285
 upon+0.273
ucks+0.272
apping+0.265
angles+0.248
Neg logit contributions
col-0.465
 gn-0.452
 disl-0.428
 dis-0.386
riger-0.384
Token another
Token ID1194
Feature activation-4.36e-01

Pos logit contributions
ap+0.352
 upon+0.334
ucks+0.329
apping+0.327
aps+0.302
Neg logit contributions
col-0.582
 gn-0.560
 disl-0.536
 dis-0.482
riger-0.479
Token family
Token ID1641
Feature activation-1.65e-02

Pos logit contributions
col+1.591
 gn+1.541
 disl+1.423
growth+1.347
riger+1.330
Neg logit contributions
ap-0.954
 upon-0.895
ucks-0.853
apping-0.835
angles-0.797
Token.
Token ID13
Feature activation-7.50e-01

Pos logit contributions
col+0.029
 gn+0.028
 disl+0.027
riger+0.024
 dis+0.024
Neg logit contributions
ap-0.020
 upon-0.019
ucks-0.019
apping-0.019
angles-0.018
Token
Token ID198
Feature activation-4.47e-01

Pos logit contributions
col+1.447
 gn+1.401
 disl+1.317
riger+1.230
 dis+1.208
Neg logit contributions
ap-0.808
ucks-0.776
 upon-0.763
apping-0.736
aps-0.708
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
col+0.785
 gn+0.760
 disl+0.716
riger+0.652
growth+0.639
Neg logit contributions
ap-0.549
 upon-0.527
ucks-0.524
apping-0.506
aps-0.486
TokenThe
Token ID464
Feature activation-2.68e-01

Pos logit contributions
col+0.232
 gn+0.225
 disl+0.214
riger+0.193
 dis+0.193
Neg logit contributions
ap-0.134
 upon-0.126
ucks-0.125
apping-0.122
aps-0.114
Token moral
Token ID6573
Feature activation+1.71e+00

Pos logit contributions
col+0.481
 gn+0.463
 disl+0.440
growth+0.395
riger+0.394
Neg logit contributions
ap-0.326
 upon-0.309
ucks-0.304
apping-0.296
angles-0.280
Token value
Token ID1988
Feature activation+1.55e+00

Pos logit contributions
ap+1.566
 upon+1.557
angles+1.452
ucks+1.444
apping+1.386
Neg logit contributions
col-3.623
 gn-3.353
 disl-3.139
 dis-2.963
growth-2.927
Token of
Token ID286
Feature activation+1.21e-01

Pos logit contributions
ap+1.649
ucks+1.537
 upon+1.499
apping+1.486
angles+1.415
Neg logit contributions
col-2.958
 gn-2.730
 disl-2.644
 dis-2.409
growth-2.332
Token the
Token ID262
Feature activation-8.42e-01

Pos logit contributions
ap+0.160
 upon+0.151
ucks+0.151
apping+0.148
aps+0.140
Neg logit contributions
col-0.204
 gn-0.197
 disl-0.185
 dis-0.166
riger-0.165
Token story
Token ID1621
Feature activation+7.13e-01

Pos logit contributions
col+1.411
 gn+1.349
 disl+1.277
growth+1.177
 rooft+1.138
Neg logit contributions
ap-1.136
 upon-1.076
ucks-1.036
apping-1.018
angles-0.969
Token is
Token ID318
Feature activation+9.38e-01

Pos logit contributions
ap+0.734
ucks+0.709
 upon+0.696
apping+0.672
ooks+0.639
Neg logit contributions
col-1.371
 gn-1.303
 disl-1.254
 dis-1.128
riger-1.115
Token,
Token ID11
Feature activation-6.32e-01

Pos logit contributions
col+1.864
 gn+1.790
 disl+1.699
growth+1.576
riger+1.532
Neg logit contributions
ap-1.533
apping-1.401
 upon-1.380
ucks-1.302
aps-1.301
Token "
Token ID366
Feature activation+3.60e-01

Pos logit contributions
col+1.243
 gn+1.200
 disl+1.157
growth+1.061
riger+1.045
Neg logit contributions
ap-0.650
 upon-0.609
apping-0.596
ucks-0.584
aps-0.539
TokenYes
Token ID5297
Feature activation+2.69e-01

Pos logit contributions
ap+0.509
 upon+0.479
ucks+0.479
apping+0.469
aps+0.448
Neg logit contributions
col-0.579
 gn-0.552
 disl-0.518
riger-0.457
 rooft-0.456
Token,
Token ID11
Feature activation+6.92e-02

Pos logit contributions
ap+0.408
ucks+0.396
 upon+0.390
apping+0.384
aps+0.369
Neg logit contributions
col-0.394
 gn-0.378
 disl-0.353
 dis-0.311
riger-0.308
Token it
Token ID340
Feature activation-1.45e-01

Pos logit contributions
ap+0.087
ucks+0.082
 upon+0.081
apping+0.080
aps+0.075
Neg logit contributions
col-0.120
 gn-0.116
 disl-0.110
 dis-0.098
riger-0.098
Token sure
Token ID1654
Feature activation+2.13e+00

Pos logit contributions
col+0.273
 gn+0.264
 disl+0.251
riger+0.226
growth+0.226
Neg logit contributions
ap-0.162
 upon-0.150
ucks-0.150
apping-0.146
aps-0.137
Token is
Token ID318
Feature activation+1.48e-02

Pos logit contributions
ucks+3.231
apping+3.017
ap+2.963
 upon+2.931
ooks+2.930
Neg logit contributions
col-3.250
 gn-3.060
 disl-3.054
 dis-2.545
riger-2.484
Token sweet
Token ID6029
Feature activation-2.92e-01

Pos logit contributions
ap+0.019
ucks+0.018
 upon+0.018
apping+0.017
angles+0.017
Neg logit contributions
col-0.025
 gn-0.024
 disl-0.023
riger-0.021
 dis-0.021
Tokenheart
Token ID11499
Feature activation-6.29e-01

Pos logit contributions
col+0.563
 gn+0.548
 disl+0.522
riger+0.475
growth+0.473
Neg logit contributions
ap-0.313
 upon-0.291
ucks-0.290
apping-0.283
aps-0.263
Token."
Token ID526
Feature activation+2.53e-02

Pos logit contributions
col+1.124
 gn+1.085
 disl+1.034
growth+0.941
riger+0.924
Neg logit contributions
ap-0.770
 upon-0.724
apping-0.706
ucks-0.695
aps-0.659
Token<|endoftext|>
Token ID50256
Feature activation-4.80e-02

Pos logit contributions
ap+0.039
 upon+0.037
ucks+0.037
apping+0.037
angles+0.035
Neg logit contributions
col-0.037
 gn-0.035
 disl-0.033
riger-0.029
 dis-0.029
Token am
Token ID716
Feature activation+1.70e-01

Pos logit contributions
col+0.790
 gn+0.763
 disl+0.719
growth+0.664
riger+0.654
Neg logit contributions
ap-0.475
 upon-0.445
ucks-0.435
apping-0.423
aps-0.397
Token sorry
Token ID7926
Feature activation+1.24e+00

Pos logit contributions
ap+0.266
ucks+0.256
 upon+0.256
apping+0.250
angles+0.242
Neg logit contributions
col-0.243
 gn-0.235
 disl-0.219
 dis-0.191
riger-0.190
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
ap+1.845
ucks+1.815
angles+1.770
 upon+1.769
apping+1.748
Neg logit contributions
 gn-1.722
col-1.712
 disl-1.577
 dis-1.422
riger-1.346
Token I
Token ID314
Feature activation-3.46e-01

Pos logit contributions
col+0.970
 gn+0.936
 disl+0.884
riger+0.815
 dis+0.814
Neg logit contributions
ap-0.456
ucks-0.420
 upon-0.411
apping-0.400
aps-0.378
Token will
Token ID481
Feature activation-7.57e-01

Pos logit contributions
col+0.613
 gn+0.594
 disl+0.556
growth+0.508
riger+0.507
Neg logit contributions
ap-0.426
 upon-0.402
ucks-0.396
apping-0.386
aps-0.365
Token help
Token ID1037
Feature activation+2.01e+00

Pos logit contributions
col+1.516
 gn+1.443
 disl+1.365
growth+1.282
 rooft+1.269
Neg logit contributions
ap-0.802
 upon-0.704
ucks-0.690
apping-0.678
aps-0.645
Token clean
Token ID3424
Feature activation-8.49e-01

Pos logit contributions
ap+2.258
itting+2.252
 upon+2.209
apping+2.195
ooks+2.111
Neg logit contributions
col-3.595
 gn-3.483
 dis-3.249
 disl-3.243
riger-2.945
Token up
Token ID510
Feature activation-2.94e-01

Pos logit contributions
col+1.623
 gn+1.564
 disl+1.487
growth+1.367
riger+1.356
Neg logit contributions
ap-0.955
ucks-0.917
 upon-0.871
apping-0.837
angles-0.821
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
col+0.534
 gn+0.516
 disl+0.488
riger+0.440
growth+0.439
Neg logit contributions
ap-0.350
 upon-0.329
ucks-0.327
apping-0.319
aps-0.302
Token I
Token ID314
Feature activation-1.20e-01

Pos logit contributions
col+1.065
 gn+1.025
 disl+0.967
riger+0.886
 dis+0.884
Neg logit contributions
ap-0.571
ucks-0.530
 upon-0.516
apping-0.504
aps-0.480
Token will
Token ID481
Feature activation-5.16e-01

Pos logit contributions
col+0.222
 gn+0.215
 disl+0.203
riger+0.184
growth+0.184
Neg logit contributions
ap-0.138
 upon-0.129
ucks-0.129
apping-0.125
aps-0.118
Token you
Token ID345
Feature activation+4.20e-01

Pos logit contributions
col+0.222
 gn+0.216
 disl+0.206
riger+0.190
growth+0.190
Neg logit contributions
ap-0.075
 upon-0.068
ucks-0.067
apping-0.065
angles-0.059
Token okay
Token ID8788
Feature activation+5.30e-01

Pos logit contributions
ap+0.478
ucks+0.459
 upon+0.449
apping+0.439
angles+0.430
Neg logit contributions
col-0.766
 gn-0.744
 disl-0.709
 dis-0.648
riger-0.631
Token?
Token ID30
Feature activation-7.32e-01

Pos logit contributions
ap+0.680
ucks+0.656
 upon+0.647
apping+0.632
angles+0.628
Neg logit contributions
col-0.871
 gn-0.853
 disl-0.804
 dis-0.728
riger-0.706
Token Let
Token ID3914
Feature activation-2.52e-01

Pos logit contributions
col+1.378
 gn+1.324
 disl+1.227
 rooft+1.139
growth+1.138
Neg logit contributions
ap-0.838
ucks-0.777
 upon-0.764
apping-0.758
aps-0.728
Token me
Token ID502
Feature activation+6.59e-01

Pos logit contributions
col+0.450
 gn+0.438
 disl+0.413
growth+0.370
riger+0.370
Neg logit contributions
ap-0.308
 upon-0.288
ucks-0.287
apping-0.278
angles-0.265
Token help
Token ID1037
Feature activation+1.96e+00

Pos logit contributions
ap+0.841
ucks+0.819
 upon+0.808
apping+0.804
angles+0.778
Neg logit contributions
col-1.104
 gn-1.064
 disl-1.001
 dis-0.893
riger-0.885
Token you
Token ID345
Feature activation-4.37e-02

Pos logit contributions
apping+2.204
itting+2.202
 upon+2.176
ap+2.176
ooks+2.124
Neg logit contributions
col-3.595
 gn-3.346
 disl-3.275
 dis-3.193
riger-2.946
Token.
Token ID13
Feature activation-2.75e-01

Pos logit contributions
col+0.074
 gn+0.071
 disl+0.067
riger+0.060
 dis+0.059
Neg logit contributions
ap-0.058
 upon-0.055
ucks-0.054
apping-0.053
angles-0.050
Token We
Token ID775
Feature activation+6.06e-01

Pos logit contributions
col+0.551
 gn+0.536
 disl+0.506
riger+0.462
 dis+0.462
Neg logit contributions
ap-0.276
ucks-0.254
 upon-0.250
apping-0.245
aps-0.229
Token need
Token ID761
Feature activation+8.20e-01

Pos logit contributions
ap+0.829
ucks+0.813
apping+0.796
 upon+0.783
aps+0.750
Neg logit contributions
col-0.974
 gn-0.923
 disl-0.879
 dis-0.781
riger-0.768
Token to
Token ID284
Feature activation-2.37e-01

Pos logit contributions
ap+1.100
apping+1.051
 upon+1.040
ucks+1.025
angles+0.979
Neg logit contributions
col-1.377
 gn-1.308
 disl-1.235
 dis-1.132
growth-1.108
Token busy
Token ID8179
Feature activation-6.72e-01

Pos logit contributions
col+1.635
 gn+1.534
 disl+1.458
growth+1.396
riger+1.333
Neg logit contributions
ap-1.347
 upon-1.291
apping-1.240
ucks-1.180
aps-1.147
Token boy
Token ID2933
Feature activation+1.91e-01

Pos logit contributions
col+1.191
 gn+1.136
 disl+1.101
riger+0.997
growth+0.990
Neg logit contributions
ap-0.834
 upon-0.780
apping-0.757
ucks-0.743
aps-0.696
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
ap+0.234
ucks+0.224
 upon+0.222
apping+0.215
angles+0.208
Neg logit contributions
col-0.335
 gn-0.324
 disl-0.303
 dis-0.273
riger-0.272
Token He
Token ID679
Feature activation+4.92e-01

Pos logit contributions
col+0.043
 gn+0.042
 disl+0.040
riger+0.036
 dis+0.036
Neg logit contributions
ap-0.027
 upon-0.025
ucks-0.025
apping-0.024
angles-0.023
Token was
Token ID373
Feature activation+5.35e-01

Pos logit contributions
ap+0.651
ucks+0.631
apping+0.610
 upon+0.607
angles+0.576
Neg logit contributions
col-0.821
 gn-0.801
 disl-0.746
 dis-0.671
growth-0.656
Token packing
Token ID24157
Feature activation+4.65e-01

Pos logit contributions
ap+0.679
ucks+0.671
 upon+0.642
apping+0.628
angles+0.620
Neg logit contributions
col-0.905
 gn-0.890
 disl-0.817
 dis-0.752
riger-0.733
Token for
Token ID329
Feature activation+1.68e-01

Pos logit contributions
ap+0.541
ucks+0.515
apping+0.506
 upon+0.498
aps+0.468
Neg logit contributions
col-0.847
 gn-0.815
 disl-0.770
 dis-0.700
riger-0.690
Token a
Token ID257
Feature activation-3.67e-02

Pos logit contributions
ap+0.200
ucks+0.188
 upon+0.187
apping+0.185
angles+0.173
Neg logit contributions
col-0.303
 gn-0.294
 disl-0.277
 dis-0.251
riger-0.250
Token trip
Token ID5296
Feature activation+5.75e-01

Pos logit contributions
col+0.057
 gn+0.055
 disl+0.052
riger+0.045
 dis+0.045
Neg logit contributions
ap-0.052
 upon-0.050
ucks-0.050
apping-0.049
angles-0.046
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
ap+0.709
ucks+0.689
 upon+0.662
apping+0.652
angles+0.626
Neg logit contributions
col-1.004
 gn-0.961
 disl-0.891
 dis-0.815
riger-0.814
Token He
Token ID679
Feature activation+4.96e-01

Pos logit contributions
ap+0.386
ucks+0.365
 upon+0.362
apping+0.358
angles+0.342
Neg logit contributions
col-0.478
 gn-0.463
 disl-0.435
growth-0.388
 dis-0.386
TokenOnce
Token ID7454
Feature activation+4.22e-01

Pos logit contributions
col+1.282
 gn+1.245
 disl+1.209
riger+1.069
 dis+1.049
Neg logit contributions
ap-0.906
 upon-0.898
ucks-0.865
apping-0.837
aps-0.821
Token upon
Token ID2402
Feature activation+7.00e-01

Pos logit contributions
ap+0.086
ucks+0.062
apping+0.041
 upon+0.028
angles+0.015
Neg logit contributions
col-1.162
 gn-1.137
 disl-1.097
 dis-1.037
riger-1.026
Token a
Token ID257
Feature activation-2.89e-01

Pos logit contributions
ap+1.010
ucks+0.964
apping+0.937
angles+0.921
 upon+0.912
Neg logit contributions
col-1.077
 gn-1.056
 disl-0.977
growth-0.877
 dis-0.867
Token time
Token ID640
Feature activation-7.76e-01

Pos logit contributions
col+0.479
 gn+0.465
 disl+0.436
riger+0.387
growth+0.385
Neg logit contributions
ap-0.392
 upon-0.370
ucks-0.360
apping-0.360
aps-0.338
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
col+1.298
 gn+1.237
 disl+1.177
growth+1.062
 rooft+1.053
Neg logit contributions
ap-1.084
 upon-1.061
apping-1.003
ucks-0.953
aps-0.942
Token there
Token ID612
Feature activation+3.68e-01

Pos logit contributions
col+0.275
 gn+0.265
 disl+0.250
riger+0.219
growth+0.218
Neg logit contributions
ap-0.254
 upon-0.244
ucks-0.238
apping-0.237
aps-0.224
Token was
Token ID373
Feature activation-4.41e-01

Pos logit contributions
ap+0.476
ucks+0.456
apping+0.443
 upon+0.440
angles+0.421
Neg logit contributions
col-0.620
 gn-0.601
 disl-0.562
 dis-0.509
growth-0.501
Token a
Token ID257
Feature activation+1.87e-01

Pos logit contributions
col+0.791
 gn+0.767
 disl+0.731
riger+0.662
 dis+0.648
Neg logit contributions
ap-0.535
 upon-0.507
ucks-0.491
apping-0.488
aps-0.459
Token little
Token ID1310
Feature activation+5.02e-01

Pos logit contributions
ap+0.271
ucks+0.260
 upon+0.258
apping+0.253
angles+0.244
Neg logit contributions
col-0.286
 gn-0.279
 disl-0.258
 dis-0.228
riger-0.227
Token boy
Token ID2933
Feature activation-2.01e-01

Pos logit contributions
ap+0.580
ucks+0.556
 upon+0.554
apping+0.534
angles+0.517
Neg logit contributions
col-0.910
 gn-0.894
 disl-0.826
 dis-0.749
riger-0.747
Token named
Token ID3706
Feature activation-6.44e-01

Pos logit contributions
col+0.318
 gn+0.306
 disl+0.289
riger+0.256
growth+0.255
Neg logit contributions
ap-0.286
 upon-0.268
ucks-0.267
apping-0.264
aps-0.250
TokenL
Token ID43
Feature activation+8.94e-01

Pos logit contributions
ap+0.196
ucks+0.184
 upon+0.182
apping+0.178
angles+0.167
Neg logit contributions
col-0.357
 gn-0.345
 disl-0.329
 dis-0.298
riger-0.297
Tokenily
Token ID813
Feature activation-5.33e-01

Pos logit contributions
ap+1.426
ucks+1.342
 upon+1.335
apping+1.308
angles+1.273
Neg logit contributions
col-1.258
 gn-1.171
 disl-1.100
 dis-0.958
growth-0.933
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
col+1.009
 gn+0.975
 disl+0.937
riger+0.855
growth+0.846
Neg logit contributions
ap-0.587
 upon-0.554
ucks-0.534
apping-0.532
aps-0.489
Token her
Token ID607
Feature activation-9.62e-02

Pos logit contributions
ap+0.264
ucks+0.248
 upon+0.246
apping+0.243
aps+0.225
Neg logit contributions
col-0.499
 gn-0.487
 disl-0.465
 dis-0.422
riger-0.418
Token friends
Token ID2460
Feature activation+1.78e-01

Pos logit contributions
col+0.208
 gn+0.203
 disl+0.194
riger+0.178
growth+0.177
Neg logit contributions
ap-0.081
 upon-0.074
ucks-0.074
apping-0.071
aps-0.065
Token played
Token ID2826
Feature activation+8.27e-01

Pos logit contributions
ap+0.260
ucks+0.251
 upon+0.248
apping+0.244
aps+0.234
Neg logit contributions
col-0.271
 gn-0.260
 disl-0.242
 dis-0.216
riger-0.214
Token on
Token ID319
Feature activation-7.84e-01

Pos logit contributions
ucks+0.924
ap+0.918
 upon+0.915
apping+0.863
angles+0.860
Neg logit contributions
col-1.495
 gn-1.450
 disl-1.375
 dis-1.262
riger-1.238
Token the
Token ID262
Feature activation-3.76e-01

Pos logit contributions
col+1.457
 gn+1.408
 disl+1.332
riger+1.207
growth+1.179
Neg logit contributions
ap-0.904
 upon-0.848
ucks-0.819
apping-0.788
aps-0.758
Token swings
Token ID26728
Feature activation+2.15e-01

Pos logit contributions
col+0.570
 gn+0.549
 disl+0.514
growth+0.450
riger+0.447
Neg logit contributions
ap-0.557
 upon-0.526
ucks-0.524
apping-0.513
angles-0.496
Token and
Token ID290
Feature activation-9.44e-02

Pos logit contributions
ap+0.262
ucks+0.252
 upon+0.249
apping+0.243
aps+0.230
Neg logit contributions
col-0.375
 gn-0.362
 disl-0.339
 dis-0.306
growth-0.304
Token slides
Token ID19392
Feature activation-5.10e-01

Pos logit contributions
col+0.146
 gn+0.141
 disl+0.132
riger+0.116
growth+0.116
Neg logit contributions
ap-0.136
 upon-0.129
ucks-0.128
apping-0.126
angles-0.120
Token.
Token ID13
Feature activation-4.72e-02

Pos logit contributions
col+0.031
 gn+0.030
 disl+0.028
riger+0.025
 dis+0.025
Neg logit contributions
ap-0.020
 upon-0.019
ucks-0.019
apping-0.018
angles-0.017
Token Mom
Token ID11254
Feature activation+2.61e-01

Pos logit contributions
col+0.085
 gn+0.082
 disl+0.078
riger+0.070
 dis+0.070
Neg logit contributions
ap-0.056
 upon-0.053
ucks-0.053
apping-0.052
angles-0.049
Token asked
Token ID1965
Feature activation-1.99e-01

Pos logit contributions
ap+0.378
ucks+0.356
 upon+0.354
apping+0.351
aps+0.333
Neg logit contributions
col-0.406
 gn-0.393
 disl-0.361
 dis-0.323
riger-0.320
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
col+0.414
 gn+0.402
 disl+0.384
riger+0.351
growth+0.350
Neg logit contributions
ap-0.184
 upon-0.171
ucks-0.167
apping-0.162
angles-0.150
Token "
Token ID366
Feature activation+1.35e-01

Pos logit contributions
col+0.347
 gn+0.335
 disl+0.319
growth+0.289
riger+0.288
Neg logit contributions
ap-0.202
 upon-0.191
ucks-0.188
apping-0.185
aps-0.171
TokenCan
Token ID6090
Feature activation+8.40e-01

Pos logit contributions
ap+0.190
 upon+0.178
ucks+0.178
apping+0.177
aps+0.168
Neg logit contributions
col-0.217
 gn-0.210
 disl-0.196
riger-0.174
 dis-0.173
Token Tim
Token ID5045
Feature activation+7.66e-01

Pos logit contributions
ap+0.931
ucks+0.894
apping+0.879
 upon+0.840
aps+0.819
Neg logit contributions
col-1.573
 gn-1.550
 disl-1.474
 dis-1.352
riger-1.329
Token ride
Token ID6594
Feature activation+5.63e-01

Pos logit contributions
ap+0.955
ucks+0.923
apping+0.902
 upon+0.894
ooks+0.865
Neg logit contributions
col-1.315
 gn-1.301
 disl-1.220
 dis-1.087
riger-1.071
Token in
Token ID287
Feature activation-1.80e-01

Pos logit contributions
ap+0.727
ucks+0.694
apping+0.688
 upon+0.682
ooks+0.651
Neg logit contributions
col-0.947
 gn-0.914
 disl-0.860
 dis-0.774
riger-0.768
Token your
Token ID534
Feature activation-4.65e-01

Pos logit contributions
col+0.326
 gn+0.316
 disl+0.300
riger+0.269
growth+0.267
Neg logit contributions
ap-0.213
 upon-0.201
ucks-0.200
apping-0.194
angles-0.184
Token red
Token ID2266
Feature activation+8.04e-01

Pos logit contributions
col+0.849
 gn+0.815
 disl+0.777
growth+0.701
riger+0.700
Neg logit contributions
ap-0.549
 upon-0.514
ucks-0.505
apping-0.489
aps-0.465
Token boy
Token ID2933
Feature activation+2.85e-01

Pos logit contributions
ap+0.250
ucks+0.235
 upon+0.232
apping+0.228
angles+0.215
Neg logit contributions
col-0.429
 gn-0.420
 disl-0.396
 dis-0.358
 rooft-0.356
Token got
Token ID1392
Feature activation+2.65e-01

Pos logit contributions
ap+0.404
ucks+0.388
 upon+0.385
apping+0.377
angles+0.362
Neg logit contributions
col-0.446
 gn-0.433
 disl-0.399
 dis-0.357
riger-0.351
Token to
Token ID284
Feature activation+1.38e-01

Pos logit contributions
ap+0.351
ucks+0.334
 upon+0.331
apping+0.326
angles+0.313
Neg logit contributions
col-0.445
 gn-0.429
 disl-0.401
 dis-0.361
riger-0.354
Token play
Token ID711
Feature activation+5.11e-01

Pos logit contributions
ap+0.169
ucks+0.161
 upon+0.160
apping+0.157
angles+0.148
Neg logit contributions
col-0.243
 gn-0.236
 disl-0.223
 dis-0.199
riger-0.198
Token outside
Token ID2354
Feature activation-3.13e-01

Pos logit contributions
ap+0.585
ucks+0.573
 upon+0.564
apping+0.543
angles+0.526
Neg logit contributions
col-0.926
 gn-0.901
 disl-0.842
 dis-0.765
riger-0.758
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
col+0.578
 gn+0.558
 disl+0.534
riger+0.481
growth+0.478
Neg logit contributions
ap-0.361
 upon-0.339
ucks-0.329
apping-0.329
aps-0.307
Token<|endoftext|>
Token ID50256
Feature activation-4.55e-01

Pos logit contributions
ap+0.291
ucks+0.273
 upon+0.273
apping+0.268
angles+0.256
Neg logit contributions
col-0.377
 gn-0.365
 disl-0.344
growth-0.306
 dis-0.305
TokenOnce
Token ID7454
Feature activation+2.65e-01

Pos logit contributions
col+0.764
 gn+0.738
 disl+0.712
riger+0.625
 dis+0.622
Neg logit contributions
ap-0.593
 upon-0.578
ucks-0.564
apping-0.549
aps-0.531
Token upon
Token ID2402
Feature activation+5.65e-01

Pos logit contributions
ap+0.053
ucks+0.037
apping+0.025
 upon+0.022
angles+0.009
Neg logit contributions
col-0.733
 gn-0.719
 disl-0.693
 dis-0.652
riger-0.648
Token a
Token ID257
Feature activation-4.47e-01

Pos logit contributions
ap+0.801
ucks+0.763
apping+0.744
 upon+0.728
angles+0.725
Neg logit contributions
col-0.888
 gn-0.872
 disl-0.805
growth-0.724
 dis-0.716
Token time
Token ID640
Feature activation-9.98e-01

Pos logit contributions
col+0.754
 gn+0.731
 disl+0.689
riger+0.611
 dis+0.608
Neg logit contributions
ap-0.595
 upon-0.563
apping-0.543
ucks-0.538
aps-0.509
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
ap+2.561
angles+2.550
ucks+2.527
apping+2.348
ooks+2.305
Neg logit contributions
col-2.537
 gn-2.467
 disl-2.235
 dis-2.150
 vac-1.885
Token get
Token ID651
Feature activation-1.72e-03

Pos logit contributions
ap+0.243
ucks+0.231
 upon+0.229
apping+0.227
angles+0.214
Neg logit contributions
col-0.341
 gn-0.332
 disl-0.312
 dis-0.280
growth-0.278
Token a
Token ID257
Feature activation+5.62e-01

Pos logit contributions
col+0.003
 gn+0.003
 disl+0.003
riger+0.002
 dis+0.002
Neg logit contributions
ap-0.002
 upon-0.002
ucks-0.002
apping-0.002
angles-0.002
Token coat
Token ID13209
Feature activation-6.04e-01

Pos logit contributions
ap+0.830
ucks+0.803
apping+0.783
 upon+0.776
angles+0.764
Neg logit contributions
col-0.849
 gn-0.829
 disl-0.751
 dis-0.666
riger-0.663
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
col+1.118
 gn+1.089
 disl+1.063
growth+0.948
riger+0.940
Neg logit contributions
ap-0.683
 upon-0.647
apping-0.624
ucks-0.624
aps-0.575
Token Then
Token ID3244
Feature activation-4.08e-02

Pos logit contributions
col+0.326
 gn+0.317
 disl+0.301
riger+0.270
 dis+0.269
Neg logit contributions
ap-0.211
 upon-0.200
ucks-0.198
apping-0.193
aps-0.182
Token she
Token ID673
Feature activation+2.54e-01

Pos logit contributions
col+0.062
 gn+0.059
 disl+0.055
riger+0.049
 dis+0.048
Neg logit contributions
ap-0.060
 upon-0.058
ucks-0.057
apping-0.056
angles-0.054
Token went
Token ID1816
Feature activation+2.66e-01

Pos logit contributions
ap+0.345
ucks+0.328
 upon+0.321
apping+0.321
angles+0.305
Neg logit contributions
col-0.416
 gn-0.405
 disl-0.372
 dis-0.336
riger-0.332
Token back
Token ID736
Feature activation-2.46e-01

Pos logit contributions
ap+0.366
ucks+0.345
 upon+0.340
apping+0.339
angles+0.327
Neg logit contributions
col-0.435
 gn-0.422
 disl-0.387
 dis-0.347
riger-0.347
Token outside
Token ID2354
Feature activation+7.41e-01

Pos logit contributions
col+0.447
 gn+0.433
 disl+0.414
riger+0.371
growth+0.369
Neg logit contributions
ap-0.288
 upon-0.275
ucks-0.271
apping-0.263
aps-0.249
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
ap+0.909
ucks+0.886
angles+0.862
 upon+0.831
apping+0.829
Neg logit contributions
col-1.289
 gn-1.258
 disl-1.154
 dis-1.066
 rooft-1.033
Token because
Token ID780
Feature activation+3.14e-01

Pos logit contributions
ap+0.408
ucks+0.387
 upon+0.381
apping+0.372
angles+0.359
Neg logit contributions
col-0.590
 gn-0.572
 disl-0.538
 dis-0.482
 rooft-0.478
Token there
Token ID612
Feature activation+1.26e+00

Pos logit contributions
ap+0.441
ucks+0.417
 upon+0.411
apping+0.408
aps+0.388
Neg logit contributions
col-0.502
 gn-0.487
 disl-0.451
 dis-0.404
growth-0.398
Token was
Token ID373
Feature activation-2.70e-01

Pos logit contributions
ucks+1.912
ap+1.871
apping+1.782
 upon+1.728
aps+1.696
Neg logit contributions
col-1.932
 gn-1.880
 disl-1.717
 dis-1.606
growth-1.464
Token more
Token ID517
Feature activation-1.36e-01

Pos logit contributions
col+0.470
 gn+0.452
 disl+0.430
riger+0.388
growth+0.381
Neg logit contributions
ap-0.336
 upon-0.320
ucks-0.313
apping-0.307
angles-0.292
Token space
Token ID2272
Feature activation-3.72e-01

Pos logit contributions
col+0.223
 gn+0.215
 disl+0.202
riger+0.181
growth+0.179
Neg logit contributions
ap-0.184
 upon-0.175
ucks-0.174
apping-0.171
aps-0.162
Token now
Token ID783
Feature activation-3.40e-01

Pos logit contributions
col+0.675
 gn+0.658
 disl+0.624
riger+0.565
growth+0.561
Neg logit contributions
ap-0.432
 upon-0.407
ucks-0.396
apping-0.395
aps-0.369
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
col+0.613
 gn+0.595
 disl+0.562
riger+0.508
growth+0.503
Neg logit contributions
ap-0.406
 upon-0.381
ucks-0.371
apping-0.369
angles-0.348
Token Bob
Token ID5811
Feature activation+5.16e-01

Pos logit contributions
col+0.218
 gn+0.210
 disl+0.196
riger+0.171
 dis+0.171
Neg logit contributions
ap-0.224
 upon-0.214
ucks-0.212
apping-0.209
aps-0.199
Tokeno
Token ID78
Feature activation-2.05e-01

Pos logit contributions
ap+0.645
ucks+0.603
 upon+0.591
apping+0.587
angles+0.568
Neg logit contributions
col-0.904
 gn-0.874
 disl-0.808
 dis-0.727
 rooft-0.724
Token learned
Token ID4499
Feature activation-9.21e-01

Pos logit contributions
col+0.366
 gn+0.354
 disl+0.336
riger+0.303
growth+0.301
Neg logit contributions
ap-0.246
 upon-0.231
ucks-0.230
apping-0.225
aps-0.212
Token that
Token ID326
Feature activation+3.80e-01

Pos logit contributions
col+1.363
 gn+1.293
 disl+1.211
riger+1.130
growth+1.079
Neg logit contributions
ap-1.379
 upon-1.358
ucks-1.319
apping-1.292
aps-1.259
Token
Token ID220
Feature activation+6.09e-01

Pos logit contributions
ap+0.167
ucks+0.158
 upon+0.157
apping+0.154
angles+0.148
Neg logit contributions
col-0.209
 gn-0.203
 disl-0.190
 dis-0.169
riger-0.168
Token
Token ID198
Feature activation-5.64e-01

Pos logit contributions
ap+0.918
ucks+0.865
apping+0.848
 upon+0.848
angles+0.822
Neg logit contributions
col-0.892
 gn-0.852
 disl-0.794
 dis-0.703
growth-0.694
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
col+0.956
 gn+0.926
 disl+0.886
riger+0.790
 rooft+0.779
Neg logit contributions
ap-0.727
 upon-0.714
ucks-0.692
apping-0.678
aps-0.649
TokenThen
Token ID6423
Feature activation-4.75e-02

Pos logit contributions
col+0.943
 gn+0.911
 disl+0.866
 rooft+0.786
riger+0.784
Neg logit contributions
ap-0.589
 upon-0.560
ucks-0.541
apping-0.540
ooks-0.502
Token,
Token ID11
Feature activation+9.66e-02

Pos logit contributions
col+0.072
 gn+0.069
 disl+0.065
riger+0.057
 dis+0.056
Neg logit contributions
ap-0.070
 upon-0.067
ucks-0.067
apping-0.066
angles-0.062
Token Sammy
Token ID44067
Feature activation-1.04e+00

Pos logit contributions
ap+0.133
ucks+0.127
 upon+0.125
apping+0.124
angles+0.118
Neg logit contributions
col-0.156
 gn-0.152
 disl-0.141
 dis-0.126
riger-0.125
Token saw
Token ID2497
Feature activation-1.61e+00

Pos logit contributions
col+1.975
 disl+1.887
 gn+1.826
riger+1.657
 rooft+1.639
Neg logit contributions
 upon-1.161
ap-1.129
apping-1.008
ucks-0.986
aps-0.924
Token a
Token ID257
Feature activation+5.24e-01

Pos logit contributions
col+3.308
 gn+3.048
riger+2.995
 disl+2.993
 rooft+2.814
Neg logit contributions
 upon-1.549
ap-1.488
ucks-1.384
apping-1.303
 for-1.284
Token huge
Token ID3236
Feature activation+5.69e-01

Pos logit contributions
ap+0.811
ucks+0.788
apping+0.757
 upon+0.755
angles+0.746
Neg logit contributions
 gn-0.756
col-0.754
 disl-0.675
 dis-0.602
 rooft-0.590
Token rat
Token ID4227
Feature activation+4.18e-01

Pos logit contributions
ap+0.873
ucks+0.853
 upon+0.841
apping+0.825
angles+0.804
Neg logit contributions
 gn-0.820
col-0.815
 disl-0.729
 dis-0.647
riger-0.638
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
ap+0.539
ucks+0.519
 upon+0.508
apping+0.498
angles+0.487
Neg logit contributions
col-0.710
 gn-0.698
 disl-0.637
 dis-0.575
riger-0.571
Token and
Token ID290
Feature activation+1.90e-01

Pos logit contributions
ap+1.043
ucks+1.031
 upon+0.978
angles+0.975
apping+0.964
Neg logit contributions
col-1.456
 gn-1.421
 disl-1.346
 dis-1.178
riger-1.153
Token be
Token ID307
Feature activation+4.60e-01

Pos logit contributions
ap+0.200
ucks+0.189
 upon+0.187
apping+0.184
angles+0.171
Neg logit contributions
col-0.366
 gn-0.357
 disl-0.339
 dis-0.306
growth-0.303
Token part
Token ID636
Feature activation+6.85e-01

Pos logit contributions
ap+0.600
ucks+0.581
 upon+0.568
apping+0.560
angles+0.541
Neg logit contributions
col-0.775
 gn-0.750
 disl-0.704
 dis-0.631
riger-0.615
Token of
Token ID286
Feature activation+1.13e-01

Pos logit contributions
ap+0.503
ucks+0.469
 upon+0.465
apping+0.429
angles+0.415
Neg logit contributions
col-1.554
 gn-1.493
 disl-1.417
 dis-1.300
 rooft-1.294
Token something
Token ID1223
Feature activation+8.89e-02

Pos logit contributions
ap+0.135
ucks+0.127
 upon+0.127
apping+0.124
aps+0.117
Neg logit contributions
col-0.204
 gn-0.198
 disl-0.187
 dis-0.168
growth-0.168
Token bigger
Token ID5749
Feature activation-9.52e-01

Pos logit contributions
ap+0.116
ucks+0.110
 upon+0.110
apping+0.107
angles+0.102
Neg logit contributions
col-0.150
 gn-0.145
 disl-0.136
 dis-0.121
riger-0.121
Token.
Token ID13
Feature activation+7.12e-01

Pos logit contributions
col+2.090
 gn+2.050
 disl+1.956
riger+1.828
 dis+1.812
Neg logit contributions
ap-0.787
 upon-0.677
ucks-0.676
apping-0.675
aps-0.624
Token
Token ID198
Feature activation-6.66e-02

Pos logit contributions
ap+1.098
ucks+1.059
 upon+1.050
apping+1.041
angles+1.008
Neg logit contributions
col-1.027
 gn-0.975
 disl-0.911
growth-0.808
 rooft-0.804
TokenOne
Token ID3198
Feature activation-3.47e-01

Pos logit contributions
col+0.107
 gn+0.103
 disl+0.097
riger+0.085
 dis+0.085
Neg logit contributions
ap-0.092
 upon-0.088
ucks-0.087
apping-0.086
aps-0.081
Token night
Token ID1755
Feature activation+2.66e-01

Pos logit contributions
col+0.494
 gn+0.475
 disl+0.445
riger+0.391
growth+0.384
Neg logit contributions
ap-0.554
 upon-0.532
apping-0.518
ucks-0.516
aps-0.490
Token,
Token ID11
Feature activation+1.57e+00

Pos logit contributions
ap+0.409
ucks+0.396
 upon+0.387
apping+0.384
angles+0.373
Neg logit contributions
col-0.384
 gn-0.372
 disl-0.342
 dis-0.299
growth-0.299
Token stayed
Token ID9658
Feature activation+1.60e+00

Pos logit contributions
col+0.033
 gn+0.032
 disl+0.030
riger+0.026
 dis+0.026
Neg logit contributions
ap-0.028
 upon-0.026
ucks-0.026
apping-0.026
angles-0.024
Token with
Token ID351
Feature activation-7.14e-01

Pos logit contributions
ucks+2.038
ap+1.925
angles+1.896
apping+1.860
 upon+1.854
Neg logit contributions
col-2.658
 gn-2.573
 disl-2.319
 dis-2.207
 rooft-2.160
Token Sun
Token ID3825
Feature activation+3.94e-01

Pos logit contributions
col+1.446
 gn+1.422
 disl+1.350
riger+1.238
 dis+1.205
Neg logit contributions
ap-0.706
 upon-0.647
ucks-0.625
apping-0.602
angles-0.574
Tokenflower
Token ID25547
Feature activation+5.44e-01

Pos logit contributions
ap+0.617
ucks+0.596
 upon+0.590
apping+0.578
angles+0.562
Neg logit contributions
col-0.554
 gn-0.529
 disl-0.487
riger-0.422
 rooft-0.422
Token until
Token ID1566
Feature activation-4.16e-01

Pos logit contributions
ap+0.606
ucks+0.603
 upon+0.583
apping+0.564
angles+0.538
Neg logit contributions
col-0.994
 gn-0.961
 disl-0.896
riger-0.812
 dis-0.808
Token the
Token ID262
Feature activation-4.31e-01

Pos logit contributions
col+0.740
 gn+0.717
 disl+0.685
riger+0.606
 dis+0.601
Neg logit contributions
ap-0.511
 upon-0.484
ucks-0.467
apping-0.460
angles-0.434
Token sun
Token ID4252
Feature activation+8.00e-02

Pos logit contributions
col+0.641
 gn+0.618
 disl+0.584
riger+0.512
growth+0.508
Neg logit contributions
ap-0.650
 upon-0.619
ucks-0.603
apping-0.598
aps-0.570
Token set
Token ID900
Feature activation+8.36e-01

Pos logit contributions
ap+0.116
ucks+0.110
 upon+0.110
apping+0.108
angles+0.103
Neg logit contributions
col-0.123
 gn-0.118
 disl-0.110
riger-0.097
 dis-0.097
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
ucks+1.107
ap+1.100
 upon+1.066
apping+1.044
aps+1.003
Neg logit contributions
col-1.383
 gn-1.299
 disl-1.192
 rooft-1.095
growth-1.094
Token it
Token ID340
Feature activation-1.36e-01

Pos logit contributions
col+0.078
 gn+0.075
 disl+0.071
riger+0.063
 dis+0.063
Neg logit contributions
ap-0.061
 upon-0.057
ucks-0.057
apping-0.056
angles-0.053
Token was
Token ID373
Feature activation+3.29e-01

Pos logit contributions
col+0.237
 gn+0.229
 disl+0.217
riger+0.194
 dis+0.193
Neg logit contributions
ap-0.170
 upon-0.161
ucks-0.160
apping-0.156
angles-0.148
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
ap+0.230
ucks+0.218
 upon+0.214
apping+0.212
angles+0.201
Neg logit contributions
col-0.309
 gn-0.300
 disl-0.281
 dis-0.250
riger-0.249
Token give
Token ID1577
Feature activation+3.96e-01

Pos logit contributions
ucks+1.666
ap+1.638
apping+1.626
 upon+1.599
angles+1.556
Neg logit contributions
col-1.396
 gn-1.395
 disl-1.244
 dis-1.105
growth-1.036
Token them
Token ID606
Feature activation+4.46e-01

Pos logit contributions
ap+0.452
apping+0.424
ucks+0.420
 upon+0.419
aps+0.388
Neg logit contributions
col-0.740
 gn-0.719
 disl-0.674
 dis-0.611
growth-0.608
Token back
Token ID736
Feature activation+4.28e-01

Pos logit contributions
ap+0.595
ucks+0.565
apping+0.562
 upon+0.554
angles+0.526
Neg logit contributions
col-0.743
 gn-0.710
 disl-0.671
 dis-0.594
growth-0.588
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
ap+0.551
ucks+0.523
apping+0.518
 upon+0.510
angles+0.482
Neg logit contributions
col-0.733
 gn-0.704
 disl-0.660
 dis-0.587
growth-0.585
Token The
Token ID383
Feature activation-2.18e+00

Pos logit contributions
ap+0.607
 upon+0.581
ucks+0.581
apping+0.570
angles+0.550
Neg logit contributions
col-0.621
 gn-0.605
 disl-0.565
 dis-0.495
riger-0.494
Token squirrel
Token ID33039
Feature activation+8.48e-02

Pos logit contributions
col+4.417
 disl+4.040
 gn+3.873
riger+3.745
 CRE+3.740
Neg logit contributions
 upon-2.319
 friend-2.249
ap-2.219
ucks-2.159
 friends-2.037
Token felt
Token ID2936
Feature activation-5.51e-01

Pos logit contributions
ap+0.125
ucks+0.119
 upon+0.118
apping+0.116
aps+0.111
Neg logit contributions
col-0.129
 gn-0.124
 disl-0.116
 dis-0.102
riger-0.101
Token bad
Token ID2089
Feature activation+4.44e-01

Pos logit contributions
col+0.902
 gn+0.857
 disl+0.801
riger+0.735
 rooft+0.724
Neg logit contributions
ap-0.763
 upon-0.722
ucks-0.701
apping-0.697
aps-0.665
Token and
Token ID290
Feature activation+1.94e-01

Pos logit contributions
ap+0.451
ucks+0.427
 upon+0.417
apping+0.403
angles+0.392
Neg logit contributions
col-0.868
 gn-0.851
 disl-0.799
 dis-0.733
riger-0.720
Token said
Token ID531
Feature activation-6.16e-02

Pos logit contributions
ap+0.291
ucks+0.278
 upon+0.276
apping+0.273
aps+0.261
Neg logit contributions
col-0.290
 gn-0.282
 disl-0.263
 dis-0.231
riger-0.226
Token Tim
Token ID5045
Feature activation+2.75e-01

Pos logit contributions
ap+0.117
ucks+0.108
 upon+0.107
apping+0.104
aps+0.095
Neg logit contributions
col-0.299
 gn-0.291
 disl-0.279
 dis-0.256
riger-0.255
Tokenmy
Token ID1820
Feature activation-7.46e-01

Pos logit contributions
ap+0.240
ucks+0.223
 upon+0.216
apping+0.213
angles+0.197
Neg logit contributions
col-0.579
 gn-0.565
 disl-0.538
 dis-0.493
riger-0.491
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
col+1.401
 gn+1.351
 disl+1.278
growth+1.176
riger+1.157
Neg logit contributions
ap-0.860
 upon-0.800
apping-0.776
ucks-0.749
aps-0.716
Token
Token ID198
Feature activation-5.63e-01

Pos logit contributions
col+0.337
 gn+0.326
 disl+0.305
riger+0.275
growth+0.274
Neg logit contributions
ap-0.258
 upon-0.246
ucks-0.244
apping-0.238
aps-0.225
Token
Token ID198
Feature activation-8.11e-02

Pos logit contributions
col+0.978
 gn+0.957
 disl+0.898
riger+0.815
growth+0.799
Neg logit contributions
ap-0.696
 upon-0.677
ucks-0.660
apping-0.645
aps-0.614
TokenThe
Token ID464
Feature activation-1.66e+00

Pos logit contributions
col+0.138
 gn+0.134
 disl+0.126
riger+0.112
 dis+0.112
Neg logit contributions
ap-0.105
 upon-0.099
ucks-0.098
apping-0.097
angles-0.091
Token next
Token ID1306
Feature activation+6.09e-02

Pos logit contributions
col+3.263
 gn+3.051
 disl+2.945
growth+2.788
riger+2.742
Neg logit contributions
ap-1.890
 upon-1.842
apping-1.670
ucks-1.608
 friend-1.563
Token day
Token ID1110
Feature activation+8.77e-01

Pos logit contributions
ap+0.103
 upon+0.099
ucks+0.099
apping+0.097
angles+0.094
Neg logit contributions
col-0.080
 gn-0.076
 disl-0.071
riger-0.060
 dis-0.060
Token,
Token ID11
Feature activation+8.45e-01

Pos logit contributions
ucks+1.428
ap+1.424
 upon+1.369
apping+1.358
angles+1.340
Neg logit contributions
col-1.156
 gn-1.095
 disl-0.981
 dis-0.892
riger-0.854
Token Sarah
Token ID10490
Feature activation-7.40e-01

Pos logit contributions
ap+1.181
ucks+1.163
apping+1.116
aps+1.086
 upon+1.077
Neg logit contributions
col-1.363
 gn-1.292
 disl-1.241
 dis-1.122
riger-1.056
Token wore
Token ID12408
Feature activation-2.46e-01

Pos logit contributions
col+1.354
 gn+1.295
 disl+1.256
riger+1.139
growth+1.134
Neg logit contributions
ap-0.843
 upon-0.813
ucks-0.780
apping-0.761
angles-0.718
Token head
Token ID1182
Feature activation-4.25e-01

Pos logit contributions
ap+0.197
ucks+0.187
 upon+0.187
apping+0.182
angles+0.172
Neg logit contributions
col-0.305
 gn-0.297
 disl-0.280
 dis-0.252
riger-0.251
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
col+0.774
 gn+0.750
 disl+0.717
riger+0.647
growth+0.642
Neg logit contributions
ap-0.496
 upon-0.470
ucks-0.458
apping-0.455
aps-0.426
Token He
Token ID679
Feature activation+3.36e-01

Pos logit contributions
col+0.793
 gn+0.763
 disl+0.730
riger+0.669
 dis+0.661
Neg logit contributions
ap-0.402
ucks-0.378
 upon-0.377
apping-0.366
aps-0.346
Token asks
Token ID7893
Feature activation+3.98e-01

Pos logit contributions
ap+0.431
ucks+0.407
apping+0.405
 upon+0.404
ooks+0.376
Neg logit contributions
col-0.572
 gn-0.558
 disl-0.523
 dis-0.465
riger-0.463
Token Tim
Token ID5045
Feature activation+4.82e-03

Pos logit contributions
ap+0.462
ucks+0.433
apping+0.426
 upon+0.425
angles+0.396
Neg logit contributions
col-0.730
 gn-0.717
 disl-0.666
 dis-0.610
growth-0.602
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
ap+0.005
 upon+0.004
ucks+0.004
apping+0.004
angles+0.004
Neg logit contributions
col-0.010
 gn-0.010
 disl-0.009
riger-0.008
 dis-0.008
Token Sam
Token ID3409
Feature activation-2.73e-01

Pos logit contributions
col+3.584
 gn+3.236
 disl+3.209
 rooft+3.086
growth+3.082
Neg logit contributions
ap-1.681
 upon-1.638
ucks-1.588
 for-1.464
aps-1.454
Token if
Token ID611
Feature activation-1.04e-01

Pos logit contributions
col+0.471
 gn+0.453
 disl+0.429
growth+0.386
riger+0.383
Neg logit contributions
ap-0.350
 upon-0.333
ucks-0.323
apping-0.320
aps-0.302
Token he
Token ID339
Feature activation+5.00e-01

Pos logit contributions
col+0.162
 gn+0.155
 disl+0.146
riger+0.129
growth+0.128
Neg logit contributions
ap-0.149
 upon-0.142
ucks-0.141
apping-0.138
angles-0.132
Token can
Token ID460
Feature activation+3.73e-01

Pos logit contributions
ap+0.654
apping+0.618
ucks+0.616
 upon+0.614
ooks+0.578
Neg logit contributions
col-0.843
 gn-0.825
 disl-0.770
riger-0.684
 dis-0.683
Token play
Token ID711
Feature activation+3.46e-01

Pos logit contributions
ap+0.443
ucks+0.425
 upon+0.421
apping+0.414
angles+0.394
Neg logit contributions
col-0.663
 gn-0.646
 disl-0.600
 dis-0.542
riger-0.540
Token was
Token ID373
Feature activation+6.32e-01

Pos logit contributions
col+0.219
 gn+0.211
 disl+0.200
riger+0.179
growth+0.178
Neg logit contributions
ap-0.168
 upon-0.160
ucks-0.158
apping-0.155
angles-0.147
Token very
Token ID845
Feature activation+5.89e-01

Pos logit contributions
ap+0.963
ucks+0.950
 upon+0.912
apping+0.896
angles+0.895
Neg logit contributions
col-0.910
 gn-0.906
 disl-0.807
 dis-0.743
 rooft-0.703
Token pinch
Token ID25394
Feature activation+5.34e-01

Pos logit contributions
ap+0.891
ucks+0.866
 upon+0.839
apping+0.827
angles+0.818
Neg logit contributions
col-0.860
 gn-0.845
 disl-0.784
 dis-0.696
riger-0.662
Tokeny
Token ID88
Feature activation-9.05e-01

Pos logit contributions
ap+0.626
ucks+0.588
 upon+0.581
apping+0.573
angles+0.551
Neg logit contributions
col-0.976
 gn-0.935
 disl-0.879
 dis-0.807
growth-0.793
Token.
Token ID13
Feature activation+3.97e-01

Pos logit contributions
col+1.730
 gn+1.677
 disl+1.628
riger+1.505
growth+1.458
Neg logit contributions
ap-0.988
 upon-0.921
apping-0.889
ucks-0.882
aps-0.838
Token The
Token ID383
Feature activation-1.82e+00

Pos logit contributions
ap+0.561
 upon+0.536
ucks+0.535
apping+0.523
angles+0.505
Neg logit contributions
col-0.624
 gn-0.610
 disl-0.569
 dis-0.506
growth-0.503
Token brown
Token ID7586
Feature activation-3.36e-01

Pos logit contributions
col+3.530
 gn+3.184
 disl+3.121
riger+3.071
 CRE+2.959
Neg logit contributions
 upon-2.135
ap-2.085
 friend-1.882
ucks-1.849
apping-1.848
Token dog
Token ID3290
Feature activation+2.40e-01

Pos logit contributions
col+0.485
 gn+0.463
 disl+0.433
riger+0.384
 dis+0.376
Neg logit contributions
ap-0.527
 upon-0.506
ucks-0.500
apping-0.492
aps-0.470
Token wanted
Token ID2227
Feature activation+7.54e-01

Pos logit contributions
ap+0.339
ucks+0.326
 upon+0.321
apping+0.317
angles+0.302
Neg logit contributions
col-0.378
 gn-0.367
 disl-0.340
 dis-0.304
riger-0.300
Token to
Token ID284
Feature activation+5.76e-01

Pos logit contributions
ap+1.025
ucks+0.974
apping+0.969
 upon+0.945
angles+0.905
Neg logit contributions
col-1.259
 gn-1.235
 disl-1.146
 dis-1.052
growth-1.019
Token play
Token ID711
Feature activation+6.18e-01

Pos logit contributions
ap+0.859
 upon+0.835
ucks+0.834
apping+0.831
angles+0.790
Neg logit contributions
col-0.864
 gn-0.843
 disl-0.792
 dis-0.687
growth-0.676
Token for
Token ID329
Feature activation+4.37e-01

Pos logit contributions
ap+0.254
ucks+0.237
 upon+0.231
apping+0.224
angles+0.209
Neg logit contributions
col-0.697
 gn-0.680
 disl-0.645
 dis-0.595
riger-0.593
Token being
Token ID852
Feature activation-3.57e-01

Pos logit contributions
ap+0.443
ucks+0.420
 upon+0.412
apping+0.403
angles+0.377
Neg logit contributions
col-0.862
 gn-0.843
 disl-0.805
 dis-0.744
riger-0.727
Token good
Token ID922
Feature activation+5.96e-01

Pos logit contributions
col+0.554
 gn+0.537
 disl+0.501
growth+0.449
riger+0.443
Neg logit contributions
ap-0.520
 upon-0.490
ucks-0.486
apping-0.486
aps-0.459
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
ap+0.680
ucks+0.649
 upon+0.644
apping+0.624
angles+0.603
Neg logit contributions
col-1.094
 gn-1.054
 disl-1.000
 dis-0.916
riger-0.893
Token She
Token ID1375
Feature activation-9.56e-02

Pos logit contributions
col+1.096
 gn+1.063
 disl+1.015
riger+0.943
growth+0.928
Neg logit contributions
ap-0.516
 upon-0.494
ucks-0.487
apping-0.476
aps-0.444
Token said
Token ID531
Feature activation-1.77e+00

Pos logit contributions
col+0.150
 gn+0.145
 disl+0.136
riger+0.120
growth+0.120
Neg logit contributions
ap-0.135
 upon-0.129
ucks-0.128
apping-0.126
angles-0.120
Token that
Token ID326
Feature activation-5.84e-01

Pos logit contributions
col+3.369
 gn+3.185
 disl+3.007
growth+2.997
riger+2.861
Neg logit contributions
ap-2.087
 upon-1.987
apping-1.947
 for-1.813
ucks-1.784
Token sharing
Token ID7373
Feature activation+4.87e-02

Pos logit contributions
col+1.006
 gn+0.975
 disl+0.918
riger+0.839
growth+0.835
Neg logit contributions
ap-0.731
 upon-0.701
ucks-0.677
apping-0.670
angles-0.633
Token was
Token ID373
Feature activation+3.22e-01

Pos logit contributions
ap+0.055
ucks+0.052
 upon+0.052
apping+0.051
aps+0.047
Neg logit contributions
col-0.090
 gn-0.087
 disl-0.083
 dis-0.075
riger-0.075
Token caring
Token ID18088
Feature activation+2.69e-01

Pos logit contributions
ap+0.434
ucks+0.419
 upon+0.412
apping+0.405
aps+0.386
Neg logit contributions
col-0.528
 gn-0.510
 disl-0.484
 dis-0.432
riger-0.422
Token and
Token ID290
Feature activation-7.53e-02

Pos logit contributions
ap+0.316
ucks+0.302
 upon+0.298
apping+0.290
angles+0.276
Neg logit contributions
col-0.484
 gn-0.468
 disl-0.443
 dis-0.400
riger-0.396