TOP ACTIVATIONS
MAX = 3.162

his friend . Y oga had an idea .
The string breaks . The k ite flies away . They
rolled out . Spot and Fl uffy ran after the ball
Ben slides down , his rh ino falls out of his
After a while , the k ang aroo woke up and
the boy and soon the k ite was ready to fly
was finally finished , the k ite was so proud !
cl ap the jar . Cl ap , cl ap ,
back and forth and the k ite went up and down
the ball . Spot and Fl uffy were very sad .
They look at the k ite . They see a
and smiled . She held Qu ack in her arms and
He flew the k ite for a long time
As the rabbit and the k ite said goodbye , the
up on the blanket . Fl uffy was very happy in
, their bike and their k ite . They are happy
to run , and the k ite went up , up
outside . Soon , the k ite was high in the
After a while , the k ite flew away . John
, and even her little k itty . Everywhere she went

MIN ACTIVATIONS
MIN = -3.115

the night ?" The rabbit said , " I
for me !" The surprise turned out to be
a jacket . The p anda asked , "
go see some birds !" The little orange bear was excited
too ?" The other child was a bit
shouted Bob . The elf said , " G
over there !" The two children carefully climbed down
scared ." But the first child said , "
seen it ?" The p anda shook his head
me ?" The crab replied , " I
like me ?" The crab looked at T una
luggage . The elephant asked , " What
with him . The little girl was so happy
the jungle ?" The child said , " I
he could play too . The babys itter said it was
it looks fun !" The second child said , "
there . The elf said that he had
you doing ?" The bear replied , " I
one day ?" The elf smiled and said ,
love to play ," replied the crab . L

INTERVAL 1.593 - 3.162
CONTAINS 0.293%

lesson . He learned that not all plants are herbs .
They play with the blue ball . They forget about
. They took Spark y to their room and
. You love w affles , right ?" Z
did not work . The day did not end well for

INTERVAL 0.024 - 1.593
CONTAINS 47.791%

out . He pointed at it and shouted " Where does
cow . First , we need to make you big .
person that had tried to encourage the boy to change his
excited with his new toy ! He ran around the house
, but eventually the old lady succeeded . She kept it

INTERVAL -1.545 - 0.024
CONTAINS 51.677%

wanted to help Tim fix his toy , but she didn
. They had fun . They were happy . <|endoftext|> Anna
came up to him and said , " Do you want
you ?" Tim asked . The yacht did not talk ,
vous rabbit came and started to eat the flowers ! The

INTERVAL -3.115 - -1.545
CONTAINS 0.238%

the dirt and look for bugs . Sometimes they found worms
decided to help him . The giant used his strong legs
. " Look , a story !" Lily says
should go . The elephant thanked the scientist and
went back inside and told their parents what they observed in
Token his
Token ID465
Feature activation-3.33e-01

Pos logit contributions
ucks+1.848
ap+1.783
 upon+1.771
apping+1.758
itting+1.684
Neg logit contributions
 gn-2.107
col-2.090
 disl-1.920
 dis-1.816
riger-1.720
Token friend
Token ID1545
Feature activation-3.27e-01

Pos logit contributions
col+0.890
 gn+0.863
 disl+0.836
riger+0.785
growth+0.780
Neg logit contributions
ap-0.125
 upon-0.093
ucks-0.091
apping-0.086
aps-0.066
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
col+0.545
 gn+0.523
 disl+0.494
riger+0.441
growth+0.441
Neg logit contributions
ap-0.441
 upon-0.416
apping-0.404
ucks-0.404
aps-0.382
Token
Token ID198
Feature activation-5.03e-01

Pos logit contributions
ap+0.403
 upon+0.381
ucks+0.379
apping+0.372
angles+0.359
Neg logit contributions
col-0.480
 gn-0.463
 disl-0.433
riger-0.384
growth-0.383
Token
Token ID198
Feature activation+2.79e-02

Pos logit contributions
col+0.849
 gn+0.825
 disl+0.781
riger+0.697
 dis+0.685
Neg logit contributions
ap-0.654
 upon-0.634
ucks-0.618
apping-0.608
aps-0.583
TokenY
Token ID56
Feature activation+3.16e+00

Pos logit contributions
ap+0.037
 upon+0.035
ucks+0.035
apping+0.034
angles+0.032
Neg logit contributions
col-0.047
 gn-0.045
 disl-0.043
riger-0.038
 dis-0.038
Tokenoga
Token ID10949
Feature activation+8.87e-01

Pos logit contributions
 upon+4.358
ooks+4.150
ucks+4.012
ook+3.722
ap+3.719
Neg logit contributions
col-5.405
 gn-4.789
 disl-4.239
riger-4.231
growth-3.963
Token had
Token ID550
Feature activation+5.24e-01

Pos logit contributions
ucks+1.203
ap+1.191
 upon+1.130
apping+1.125
ooks+1.072
Neg logit contributions
col-1.448
 gn-1.378
 disl-1.278
 dis-1.179
growth-1.145
Token an
Token ID281
Feature activation-1.87e-01

Pos logit contributions
ap+0.701
ucks+0.674
apping+0.654
 upon+0.651
angles+0.617
Neg logit contributions
col-0.875
 gn-0.844
 disl-0.796
 dis-0.709
growth-0.701
Token idea
Token ID2126
Feature activation+1.31e-01

Pos logit contributions
col+0.302
 gn+0.291
 disl+0.273
riger+0.243
growth+0.241
Neg logit contributions
ap-0.260
 upon-0.245
ucks-0.244
apping-0.240
angles-0.228
Token.
Token ID13
Feature activation+6.49e-01

Pos logit contributions
ap+0.165
ucks+0.156
 upon+0.155
apping+0.151
angles+0.144
Neg logit contributions
col-0.227
 gn-0.219
 disl-0.206
 dis-0.185
riger-0.184
Token The
Token ID383
Feature activation+9.51e-01

Pos logit contributions
ap+0.049
 upon+0.046
ucks+0.046
apping+0.045
angles+0.043
Neg logit contributions
col-0.066
 gn-0.063
 disl-0.060
 dis-0.053
riger-0.053
Token string
Token ID4731
Feature activation+1.34e+00

Pos logit contributions
ap+1.358
ucks+1.337
 upon+1.298
apping+1.291
ooks+1.260
Neg logit contributions
col-1.459
 gn-1.435
 disl-1.305
 rooft-1.212
 dis-1.204
Token breaks
Token ID9457
Feature activation+7.65e-01

Pos logit contributions
ap+1.955
ucks+1.936
ooks+1.886
apping+1.882
 upon+1.861
Neg logit contributions
col-2.036
 gn-1.973
 disl-1.756
 dis-1.600
growth-1.583
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
ap+1.020
ucks+0.971
 upon+0.949
apping+0.936
angles+0.897
Neg logit contributions
col-1.282
 gn-1.221
 disl-1.095
 dis-1.034
 rooft-1.029
Token The
Token ID383
Feature activation+1.36e+00

Pos logit contributions
ap+0.483
 upon+0.458
ucks+0.454
apping+0.445
angles+0.429
Neg logit contributions
col-0.608
 gn-0.591
 disl-0.555
growth-0.501
 dis-0.497
Token k
Token ID479
Feature activation+3.12e+00

Pos logit contributions
ucks+1.995
ap+1.986
apping+1.914
 upon+1.909
ooks+1.872
Neg logit contributions
col-2.018
 gn-2.013
 disl-1.797
 rooft-1.710
 dis-1.685
Tokenite
Token ID578
Feature activation+9.65e-01

Pos logit contributions
 upon+4.790
ooks+4.657
ap+4.457
angles+4.327
ucks+4.206
Neg logit contributions
col-4.695
 gn-4.218
 rooft-3.916
 pel-3.831
riger-3.746
Token flies
Token ID17607
Feature activation+6.94e-01

Pos logit contributions
ap+1.478
ucks+1.441
apping+1.430
 upon+1.405
ooks+1.394
Neg logit contributions
col-1.402
 gn-1.359
 disl-1.205
 dis-1.090
growth-1.078
Token away
Token ID1497
Feature activation+2.72e-01

Pos logit contributions
ap+1.003
ucks+0.966
apping+0.940
 upon+0.919
ooks+0.898
Neg logit contributions
col-1.094
 gn-1.040
 disl-0.936
 dis-0.856
 rooft-0.853
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
ap+0.324
ucks+0.306
 upon+0.301
apping+0.295
angles+0.283
Neg logit contributions
col-0.492
 gn-0.478
 disl-0.444
 dis-0.405
growth-0.401
Token They
Token ID1119
Feature activation-8.00e-02

Pos logit contributions
ap+0.605
 upon+0.572
ucks+0.567
apping+0.555
angles+0.538
Neg logit contributions
col-0.772
 gn-0.750
 disl-0.706
growth-0.637
 dis-0.634
Token rolled
Token ID11686
Feature activation+8.01e-02

Pos logit contributions
ap+0.633
ucks+0.610
 upon+0.596
apping+0.594
ooks+0.568
Neg logit contributions
col-0.605
 gn-0.586
 disl-0.528
 dis-0.475
growth-0.466
Token out
Token ID503
Feature activation+7.57e-02

Pos logit contributions
ap+0.111
ucks+0.105
 upon+0.104
apping+0.103
angles+0.097
Neg logit contributions
col-0.130
 gn-0.125
 disl-0.117
 dis-0.104
riger-0.104
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
ap+0.074
ucks+0.068
 upon+0.068
apping+0.066
angles+0.062
Neg logit contributions
col-0.153
 gn-0.149
 disl-0.141
 dis-0.128
riger-0.128
Token Spot
Token ID15899
Feature activation+1.12e-01

Pos logit contributions
ap+0.334
 upon+0.314
ucks+0.314
apping+0.308
angles+0.296
Neg logit contributions
col-0.412
 gn-0.401
 disl-0.375
 dis-0.335
growth-0.334
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
ap+0.159
ucks+0.151
 upon+0.150
apping+0.148
angles+0.141
Neg logit contributions
col-0.176
 gn-0.171
 disl-0.158
 dis-0.141
riger-0.140
Token Fl
Token ID1610
Feature activation+3.08e+00

Pos logit contributions
col+0.183
 gn+0.176
 disl+0.166
riger+0.149
 dis+0.148
Neg logit contributions
ap-0.136
 upon-0.129
ucks-0.128
apping-0.125
angles-0.118
Tokenuffy
Token ID15352
Feature activation+1.23e-01

Pos logit contributions
 upon+3.796
angles+3.787
ucks+3.686
ooks+3.515
 yours+3.396
Neg logit contributions
col-4.971
 gn-4.779
 disl-4.599
cious-4.172
j-4.166
Token ran
Token ID4966
Feature activation-1.78e-01

Pos logit contributions
ap+0.177
ucks+0.170
 upon+0.169
apping+0.166
angles+0.158
Neg logit contributions
col-0.189
 gn-0.182
 disl-0.169
 dis-0.150
riger-0.150
Token after
Token ID706
Feature activation+1.36e-02

Pos logit contributions
col+0.296
 gn+0.287
 disl+0.272
riger+0.241
growth+0.240
Neg logit contributions
ap-0.234
 upon-0.223
ucks-0.220
apping-0.217
aps-0.205
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
ap+0.016
 upon+0.015
ucks+0.015
apping+0.015
angles+0.014
Neg logit contributions
col-0.025
 gn-0.024
 disl-0.023
riger-0.020
 dis-0.020
Token ball
Token ID2613
Feature activation+5.10e-01

Pos logit contributions
col+0.391
 gn+0.378
 disl+0.359
riger+0.321
growth+0.319
Neg logit contributions
ap-0.281
 upon-0.266
ucks-0.263
apping-0.258
aps-0.244
Token Ben
Token ID3932
Feature activation-7.89e-02

Pos logit contributions
ap+0.733
ucks+0.702
apping+0.688
 upon+0.681
aps+0.652
Neg logit contributions
col-0.723
 gn-0.718
 disl-0.646
 dis-0.584
growth-0.570
Token slides
Token ID19392
Feature activation+5.69e-01

Pos logit contributions
col+0.125
 gn+0.120
 disl+0.113
riger+0.099
 dis+0.099
Neg logit contributions
ap-0.111
 upon-0.106
ucks-0.105
apping-0.103
aps-0.098
Token down
Token ID866
Feature activation+2.21e-01

Pos logit contributions
ap+0.812
ucks+0.770
apping+0.765
 upon+0.756
ooks+0.737
Neg logit contributions
col-0.885
 gn-0.862
 disl-0.789
 dis-0.705
riger-0.699
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
ap+0.320
ucks+0.304
 upon+0.301
apping+0.298
angles+0.285
Neg logit contributions
col-0.342
 gn-0.331
 disl-0.304
 dis-0.272
riger-0.269
Token his
Token ID465
Feature activation+6.79e-01

Pos logit contributions
col+0.164
 gn+0.157
 disl+0.148
riger+0.131
 dis+0.130
Neg logit contributions
ap-0.144
 upon-0.137
ucks-0.136
apping-0.134
aps-0.127
Token rh
Token ID9529
Feature activation+3.06e+00

Pos logit contributions
ap+0.807
ucks+0.774
 upon+0.758
apping+0.753
angles+0.713
Neg logit contributions
col-1.197
 gn-1.193
 disl-1.094
 rooft-1.009
 dis-1.002
Tokenino
Token ID2879
Feature activation+2.13e-01

Pos logit contributions
ap+4.212
 upon+4.135
ooks+4.006
ucks+3.892
apping+3.785
Neg logit contributions
col-4.714
 gn-4.702
 disl-3.875
growth-3.860
handed-3.794
Token falls
Token ID8953
Feature activation-1.69e-01

Pos logit contributions
ap+0.291
ucks+0.279
 upon+0.276
apping+0.272
angles+0.258
Neg logit contributions
col-0.342
 gn-0.333
 disl-0.310
 dis-0.276
riger-0.274
Token out
Token ID503
Feature activation-3.20e-01

Pos logit contributions
col+0.292
 gn+0.283
 disl+0.269
riger+0.240
 dis+0.238
Neg logit contributions
ap-0.211
 upon-0.201
ucks-0.199
apping-0.194
aps-0.184
Token of
Token ID286
Feature activation-3.39e-01

Pos logit contributions
col+0.677
 gn+0.660
 disl+0.640
riger+0.586
 dis+0.575
Neg logit contributions
ap-0.278
 upon-0.261
ucks-0.256
apping-0.247
aps-0.230
Token his
Token ID465
Feature activation+1.10e-01

Pos logit contributions
col+0.632
 gn+0.607
 disl+0.581
riger+0.528
 dis+0.516
Neg logit contributions
ap-0.378
 upon-0.363
ucks-0.359
apping-0.341
aps-0.327
Token After
Token ID2293
Feature activation-6.41e-01

Pos logit contributions
col+0.834
 gn+0.805
 disl+0.760
riger+0.693
 dis+0.686
Neg logit contributions
ap-0.533
 upon-0.508
ucks-0.497
apping-0.490
aps-0.462
Token a
Token ID257
Feature activation-2.49e-01

Pos logit contributions
col+1.159
 gn+1.102
 disl+1.049
riger+0.956
growth+0.939
Neg logit contributions
ap-0.782
 upon-0.730
apping-0.704
ucks-0.697
aps-0.666
Token while
Token ID981
Feature activation+3.97e-01

Pos logit contributions
col+0.449
 gn+0.434
 disl+0.412
growth+0.368
riger+0.367
Neg logit contributions
ap-0.298
 upon-0.282
apping-0.272
ucks-0.271
angles-0.254
Token,
Token ID11
Feature activation+2.23e-01

Pos logit contributions
ap+0.616
ucks+0.607
 upon+0.589
apping+0.581
angles+0.570
Neg logit contributions
col-0.565
 gn-0.550
 disl-0.496
 dis-0.447
riger-0.439
Token the
Token ID262
Feature activation-2.86e-01

Pos logit contributions
ap+0.300
ucks+0.287
 upon+0.281
apping+0.279
angles+0.265
Neg logit contributions
col-0.365
 gn-0.355
 disl-0.332
 dis-0.296
riger-0.295
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
col+0.472
 gn+0.453
 disl+0.430
riger+0.383
growth+0.380
Neg logit contributions
ap-0.381
 upon-0.362
ucks-0.356
apping-0.351
aps-0.332
Tokenang
Token ID648
Feature activation+4.46e-01

Pos logit contributions
angles+4.436
ap+4.293
ooks+4.292
 upon+4.201
ucks+4.073
Neg logit contributions
col-4.935
 gn-4.565
 disl-3.876
handed-3.773
 pel-3.762
Tokenaroo
Token ID38049
Feature activation+8.12e-03

Pos logit contributions
ap+0.800
ucks+0.776
 upon+0.766
apping+0.754
angles+0.739
Neg logit contributions
col-0.533
 gn-0.503
 disl-0.451
 dis-0.385
growth-0.379
Token woke
Token ID19092
Feature activation+6.64e-01

Pos logit contributions
ap+0.011
 upon+0.011
ucks+0.011
apping+0.010
angles+0.010
Neg logit contributions
col-0.013
 gn-0.013
 disl-0.012
riger-0.011
 dis-0.010
Token up
Token ID510
Feature activation+6.24e-01

Pos logit contributions
ap+0.972
ucks+0.925
apping+0.920
 upon+0.919
aps+0.876
Neg logit contributions
col-1.004
 gn-0.984
 disl-0.910
 rooft-0.806
 dis-0.803
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
ap+0.755
ucks+0.727
angles+0.691
apping+0.690
 upon+0.681
Neg logit contributions
col-1.113
 gn-1.096
 disl-1.000
 dis-0.918
 rooft-0.905
Token the
Token ID262
Feature activation+1.42e-01

Pos logit contributions
ap+0.256
ucks+0.241
 upon+0.241
apping+0.237
aps+0.223
Neg logit contributions
col-0.364
 gn-0.353
 disl-0.331
 dis-0.299
growth-0.297
Token boy
Token ID2933
Feature activation+5.40e-02

Pos logit contributions
ap+0.198
ucks+0.187
 upon+0.187
apping+0.183
angles+0.176
Neg logit contributions
col-0.228
 gn-0.222
 disl-0.207
 dis-0.184
riger-0.182
Token and
Token ID290
Feature activation+2.70e-02

Pos logit contributions
ap+0.066
ucks+0.063
 upon+0.063
apping+0.061
angles+0.058
Neg logit contributions
col-0.095
 gn-0.092
 disl-0.086
riger-0.077
 dis-0.077
Token soon
Token ID2582
Feature activation-6.34e-01

Pos logit contributions
ap+0.037
 upon+0.035
ucks+0.035
apping+0.034
angles+0.033
Neg logit contributions
col-0.044
 gn-0.042
 disl-0.040
 dis-0.035
riger-0.035
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
col+1.087
 gn+1.030
 disl+0.998
riger+0.887
 rooft+0.883
Neg logit contributions
ap-0.811
 upon-0.785
apping-0.749
ucks-0.747
angles-0.713
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
ap+1.655
ucks+1.622
apping+1.553
ooks+1.551
 upon+1.549
Neg logit contributions
 gn-1.735
col-1.717
 disl-1.557
 rooft-1.443
 dis-1.441
Tokenite
Token ID578
Feature activation+9.83e-01

Pos logit contributions
ooks+4.584
 upon+4.529
ap+4.370
angles+4.344
ucks+4.073
Neg logit contributions
col-4.620
 gn-4.404
riger-3.901
 rooft-3.755
 disl-3.712
Token was
Token ID373
Feature activation+6.39e-01

Pos logit contributions
ap+1.436
ucks+1.418
 upon+1.358
apping+1.358
ooks+1.328
Neg logit contributions
col-1.514
 gn-1.470
 disl-1.310
 dis-1.195
 rooft-1.160
Token ready
Token ID3492
Feature activation+8.97e-01

Pos logit contributions
ap+0.836
ucks+0.822
 upon+0.785
apping+0.765
angles+0.751
Neg logit contributions
col-1.075
 gn-1.043
 disl-0.957
 dis-0.882
 rooft-0.860
Token to
Token ID284
Feature activation+2.04e-01

Pos logit contributions
ap+1.175
ucks+1.155
apping+1.086
 upon+1.064
angles+1.048
Neg logit contributions
col-1.531
 gn-1.485
 disl-1.353
 dis-1.246
 rooft-1.223
Token fly
Token ID6129
Feature activation+3.98e-01

Pos logit contributions
ap+0.230
ucks+0.219
 upon+0.219
apping+0.213
angles+0.199
Neg logit contributions
col-0.378
 gn-0.367
 disl-0.347
riger-0.313
 dis-0.313
Token was
Token ID373
Feature activation+1.47e-01

Pos logit contributions
col+0.268
 gn+0.259
 disl+0.246
riger+0.220
 dis+0.219
Neg logit contributions
ap-0.182
 upon-0.172
ucks-0.170
apping-0.166
angles-0.156
Token finally
Token ID3443
Feature activation+6.74e-01

Pos logit contributions
ap+0.213
ucks+0.204
 upon+0.201
apping+0.199
aps+0.190
Neg logit contributions
col-0.227
 gn-0.218
 disl-0.203
 dis-0.181
riger-0.180
Token finished
Token ID5201
Feature activation+8.53e-02

Pos logit contributions
ap+1.023
ucks+1.016
apping+0.959
 upon+0.948
aps+0.930
Neg logit contributions
col-0.998
 gn-0.956
 disl-0.870
 dis-0.789
riger-0.780
Token,
Token ID11
Feature activation+8.72e-01

Pos logit contributions
ap+0.127
ucks+0.121
 upon+0.121
apping+0.118
angles+0.113
Neg logit contributions
col-0.128
 gn-0.124
 disl-0.115
 dis-0.101
riger-0.101
Token the
Token ID262
Feature activation+7.76e-01

Pos logit contributions
ap+1.220
ucks+1.215
apping+1.161
 upon+1.139
angles+1.102
Neg logit contributions
col-1.375
 gn-1.342
 disl-1.239
 dis-1.122
 rooft-1.105
Token k
Token ID479
Feature activation+3.05e+00

Pos logit contributions
ap+1.088
ucks+1.069
 upon+1.038
apping+1.024
ooks+0.994
Neg logit contributions
col-1.198
 gn-1.191
 disl-1.084
 dis-0.994
 rooft-0.986
Tokenite
Token ID578
Feature activation+8.37e-01

Pos logit contributions
 upon+4.420
angles+4.387
ooks+4.320
ap+4.305
ucks+4.142
Neg logit contributions
col-4.677
 gn-4.346
 pel-3.861
 disl-3.684
 rooft-3.670
Token was
Token ID373
Feature activation+3.04e-01

Pos logit contributions
ap+1.216
ucks+1.193
apping+1.143
 upon+1.135
ooks+1.109
Neg logit contributions
col-1.299
 gn-1.269
 disl-1.134
 dis-1.026
growth-1.009
Token so
Token ID523
Feature activation+7.46e-02

Pos logit contributions
ap+0.453
ucks+0.438
 upon+0.427
apping+0.422
angles+0.407
Neg logit contributions
col-0.456
 gn-0.439
 disl-0.405
 dis-0.363
 rooft-0.355
Token proud
Token ID6613
Feature activation+5.47e-01

Pos logit contributions
ap+0.104
ucks+0.099
 upon+0.099
apping+0.097
angles+0.092
Neg logit contributions
col-0.120
 gn-0.115
 disl-0.108
 dis-0.096
riger-0.095
Token!
Token ID0
Feature activation+5.48e-01

Pos logit contributions
ap+0.513
ucks+0.474
 upon+0.464
apping+0.443
angles+0.435
Neg logit contributions
col-1.141
 gn-1.104
 disl-1.042
 dis-0.954
 rooft-0.944
Token cl
Token ID537
Feature activation+1.36e+00

Pos logit contributions
col+1.076
 gn+1.034
 disl+0.977
growth+0.891
riger+0.891
Neg logit contributions
ap-0.710
 upon-0.647
ucks-0.627
apping-0.622
angles-0.590
Tokenap
Token ID499
Feature activation+2.96e-01

Pos logit contributions
 upon+0.269
ucks+0.089
angles+0.085
ap+0.032
ooks+0.003
Neg logit contributions
col-3.820
 gn-3.651
 disl-3.625
riger-3.324
growth-3.315
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
ap+0.321
ucks+0.306
 upon+0.302
apping+0.293
angles+0.279
Neg logit contributions
col-0.559
 gn-0.542
 disl-0.513
 dis-0.464
growth-0.460
Token jar
Token ID17379
Feature activation+4.31e-01

Pos logit contributions
ap+0.379
ucks+0.364
 upon+0.361
apping+0.354
angles+0.339
Neg logit contributions
col-0.451
 gn-0.436
 disl-0.406
 dis-0.362
riger-0.361
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
ap+0.450
ucks+0.440
 upon+0.429
apping+0.407
angles+0.395
Neg logit contributions
col-0.828
 gn-0.800
 disl-0.743
 dis-0.681
riger-0.676
Token Cl
Token ID1012
Feature activation+3.01e+00

Pos logit contributions
col+0.376
 gn+0.365
 disl+0.346
riger+0.313
 dis+0.311
Neg logit contributions
ap-0.233
 upon-0.218
ucks-0.218
apping-0.213
aps-0.200
Tokenap
Token ID499
Feature activation+4.82e-01

Pos logit contributions
 upon+3.280
angles+3.023
 for+3.001
ucks+2.748
ies+2.672
Neg logit contributions
col-5.864
 gn-5.320
 disl-5.230
j-4.963
riger-4.851
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
ap+0.602
ucks+0.582
 upon+0.572
apping+0.557
angles+0.534
Neg logit contributions
col-0.830
 gn-0.795
 disl-0.754
 dis-0.675
riger-0.663
Token cl
Token ID537
Feature activation+1.14e+00

Pos logit contributions
ap+0.002
 upon+0.002
ucks+0.002
apping+0.002
angles+0.002
Neg logit contributions
col-0.002
 gn-0.002
 disl-0.002
riger-0.002
 dis-0.002
Tokenap
Token ID499
Feature activation+5.86e-01

Pos logit contributions
 upon+0.309
ucks+0.200
ap+0.196
angles+0.186
ooks+0.104
Neg logit contributions
col-3.089
 gn-2.964
 disl-2.939
riger-2.692
 CRE-2.662
Token,
Token ID11
Feature activation-2.47e-01

Pos logit contributions
ap+0.688
ucks+0.674
 upon+0.656
apping+0.636
angles+0.616
Neg logit contributions
col-1.050
 gn-1.010
 disl-0.956
 dis-0.857
riger-0.842
Token back
Token ID736
Feature activation-3.67e-01

Pos logit contributions
ap+0.347
ucks+0.328
 upon+0.324
apping+0.321
aps+0.303
Neg logit contributions
col-0.467
 gn-0.452
 disl-0.422
 dis-0.381
 rooft-0.379
Token and
Token ID290
Feature activation-7.34e-02

Pos logit contributions
col+0.696
 gn+0.679
 disl+0.653
riger+0.590
 dis+0.586
Neg logit contributions
ap-0.400
 upon-0.379
ucks-0.368
apping-0.361
aps-0.340
Token forth
Token ID6071
Feature activation-1.32e-02

Pos logit contributions
col+0.147
 gn+0.143
 disl+0.136
riger+0.124
 dis+0.124
Neg logit contributions
ap-0.072
 upon-0.067
ucks-0.067
apping-0.065
angles-0.060
Token and
Token ID290
Feature activation-4.22e-01

Pos logit contributions
col+0.023
 gn+0.022
 disl+0.021
riger+0.018
 dis+0.018
Neg logit contributions
ap-0.017
 upon-0.016
ucks-0.016
apping-0.016
angles-0.015
Token the
Token ID262
Feature activation+4.89e-01

Pos logit contributions
col+0.758
 gn+0.730
 disl+0.698
riger+0.626
growth+0.626
Neg logit contributions
ap-0.503
 upon-0.478
ucks-0.463
apping-0.458
angles-0.434
Token k
Token ID479
Feature activation+2.98e+00

Pos logit contributions
ap+0.683
ucks+0.661
 upon+0.645
apping+0.641
ooks+0.611
Neg logit contributions
col-0.778
 gn-0.755
 disl-0.696
 rooft-0.630
 dis-0.626
Tokenite
Token ID578
Feature activation+4.46e-01

Pos logit contributions
 upon+4.374
ooks+4.350
ap+4.256
angles+4.175
ucks+4.099
Neg logit contributions
col-4.791
 gn-4.198
 disl-3.846
 rooft-3.650
 pel-3.594
Token went
Token ID1816
Feature activation+6.38e-01

Pos logit contributions
ap+0.623
ucks+0.607
 upon+0.592
apping+0.584
ooks+0.559
Neg logit contributions
col-0.708
 gn-0.684
 disl-0.627
 dis-0.563
riger-0.556
Token up
Token ID510
Feature activation-1.30e-01

Pos logit contributions
ap+0.892
ucks+0.871
 upon+0.831
apping+0.830
angles+0.805
Neg logit contributions
col-1.031
 gn-0.998
 disl-0.899
 rooft-0.820
 dis-0.809
Token and
Token ID290
Feature activation-1.45e-01

Pos logit contributions
col+0.235
 gn+0.227
 disl+0.216
riger+0.194
 dis+0.193
Neg logit contributions
ap-0.156
 upon-0.147
ucks-0.145
apping-0.143
aps-0.134
Token down
Token ID866
Feature activation+7.68e-01

Pos logit contributions
col+0.262
 gn+0.254
 disl+0.241
riger+0.216
growth+0.216
Neg logit contributions
ap-0.173
 upon-0.163
ucks-0.161
apping-0.158
angles-0.149
Token the
Token ID262
Feature activation+4.16e-01

Pos logit contributions
ap+0.830
ucks+0.784
apping+0.784
 upon+0.781
aps+0.736
Neg logit contributions
col-0.981
 gn-0.948
 disl-0.877
 dis-0.806
growth-0.805
Token ball
Token ID2613
Feature activation+3.82e-01

Pos logit contributions
ap+0.533
ucks+0.505
 upon+0.500
apping+0.493
angles+0.469
Neg logit contributions
col-0.711
 gn-0.692
 disl-0.640
 dis-0.583
 rooft-0.583
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ap+0.501
ucks+0.484
 upon+0.472
apping+0.466
ooks+0.444
Neg logit contributions
col-0.639
 gn-0.617
 disl-0.565
 dis-0.515
riger-0.509
Token Spot
Token ID15899
Feature activation+4.76e-02

Pos logit contributions
ap+0.017
 upon+0.016
ucks+0.016
apping+0.016
angles+0.015
Neg logit contributions
col-0.022
 gn-0.021
 disl-0.020
riger-0.018
 dis-0.018
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
ap+0.063
ucks+0.060
 upon+0.060
apping+0.058
angles+0.055
Neg logit contributions
col-0.079
 gn-0.077
 disl-0.072
 dis-0.064
riger-0.064
Token Fl
Token ID1610
Feature activation+2.97e+00

Pos logit contributions
ap+0.030
 upon+0.028
ucks+0.028
apping+0.028
angles+0.026
Neg logit contributions
col-0.031
 gn-0.030
 disl-0.028
riger-0.024
 dis-0.024
Tokenuffy
Token ID15352
Feature activation-8.65e-03

Pos logit contributions
 upon+3.673
angles+3.641
ucks+3.519
ooks+3.441
ap+3.240
Neg logit contributions
col-4.739
 gn-4.504
 disl-4.354
 pel-4.072
 dis-3.977
Token were
Token ID547
Feature activation+6.35e-01

Pos logit contributions
col+0.014
 gn+0.013
 disl+0.012
riger+0.011
 dis+0.011
Neg logit contributions
ap-0.012
 upon-0.012
ucks-0.012
apping-0.011
angles-0.011
Token very
Token ID845
Feature activation+2.57e-01

Pos logit contributions
ap+1.017
ucks+0.999
 upon+0.971
apping+0.946
angles+0.944
Neg logit contributions
col-0.863
 gn-0.847
 disl-0.767
 dis-0.707
 rooft-0.673
Token sad
Token ID6507
Feature activation+4.23e-01

Pos logit contributions
ap+0.396
ucks+0.380
 upon+0.378
apping+0.369
angles+0.361
Neg logit contributions
col-0.373
 gn-0.361
 disl-0.337
 dis-0.297
riger-0.290
Token.
Token ID13
Feature activation-4.34e-02

Pos logit contributions
ap+0.552
ucks+0.526
 upon+0.515
apping+0.505
angles+0.496
Neg logit contributions
col-0.710
 gn-0.690
 disl-0.642
 dis-0.583
 rooft-0.573
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
ap+1.410
 upon+1.311
apping+1.310
ucks+1.306
angles+1.277
Neg logit contributions
col-1.256
 gn-1.204
 disl-1.127
 dis-0.978
growth-0.972
TokenThey
Token ID2990
Feature activation-5.44e-01

Pos logit contributions
ap+1.711
 upon+1.648
ucks+1.607
apping+1.583
angles+1.550
Neg logit contributions
col-2.294
 gn-2.231
 disl-2.119
riger-1.848
 CRE-1.847
Token look
Token ID804
Feature activation+3.54e-01

Pos logit contributions
col+1.002
 gn+0.966
 disl+0.931
riger+0.824
 dis+0.823
Neg logit contributions
ap-0.637
 upon-0.589
ucks-0.561
apping-0.554
angles-0.539
Token at
Token ID379
Feature activation+3.27e-01

Pos logit contributions
ap+0.410
ucks+0.389
 upon+0.380
apping+0.377
aps+0.354
Neg logit contributions
col-0.647
 gn-0.623
 disl-0.588
 dis-0.534
 rooft-0.530
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
ap+0.375
 upon+0.351
ucks+0.351
apping+0.349
aps+0.325
Neg logit contributions
col-0.601
 gn-0.579
 disl-0.553
growth-0.502
 dis-0.500
Token k
Token ID479
Feature activation+2.97e+00

Pos logit contributions
ucks+3.341
ap+3.311
apping+3.229
ooks+3.159
 upon+3.090
Neg logit contributions
col-2.684
 gn-2.649
 disl-2.348
 rooft-2.348
 dis-2.152
Tokenite
Token ID578
Feature activation+7.79e-01

Pos logit contributions
 upon+4.412
ooks+4.237
ap+4.138
angles+3.978
ucks+3.953
Neg logit contributions
col-4.529
 gn-3.884
 rooft-3.635
 pel-3.513
riger-3.497
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
ucks+0.999
ap+0.987
 upon+0.962
apping+0.923
ooks+0.912
Neg logit contributions
col-1.301
 gn-1.259
 disl-1.139
 rooft-1.045
 dis-1.037
Token They
Token ID1119
Feature activation+4.86e-03

Pos logit contributions
ap+0.227
 upon+0.214
ucks+0.212
apping+0.207
angles+0.196
Neg logit contributions
col-0.374
 gn-0.363
 disl-0.344
growth-0.311
 dis-0.310
Token see
Token ID766
Feature activation-1.95e-01

Pos logit contributions
ap+0.006
 upon+0.006
ucks+0.006
apping+0.006
angles+0.005
Neg logit contributions
col-0.008
 gn-0.008
 disl-0.008
riger-0.007
 dis-0.007
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
col+0.339
 gn+0.328
 disl+0.309
riger+0.279
 dis+0.275
Neg logit contributions
ap-0.247
 upon-0.232
ucks-0.231
apping-0.226
angles-0.214
Token and
Token ID290
Feature activation-3.51e-01

Pos logit contributions
col+1.687
 gn+1.626
 disl+1.574
riger+1.454
growth+1.451
Neg logit contributions
ap-0.839
 upon-0.793
apping-0.762
ucks-0.733
aps-0.701
Token smiled
Token ID13541
Feature activation+1.37e-01

Pos logit contributions
col+0.567
 gn+0.546
 disl+0.520
growth+0.461
riger+0.458
Neg logit contributions
ap-0.485
 upon-0.455
ucks-0.452
apping-0.448
angles-0.423
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
ap+0.168
ucks+0.159
 upon+0.158
apping+0.154
angles+0.148
Neg logit contributions
col-0.241
 gn-0.235
 disl-0.220
 dis-0.198
riger-0.197
Token She
Token ID1375
Feature activation-5.14e-01

Pos logit contributions
ap+0.092
 upon+0.085
ucks+0.085
apping+0.083
angles+0.078
Neg logit contributions
col-0.167
 gn-0.163
 disl-0.154
 dis-0.140
riger-0.139
Token held
Token ID2714
Feature activation+1.35e-01

Pos logit contributions
col+0.837
 gn+0.804
 disl+0.770
riger+0.688
growth+0.679
Neg logit contributions
ap-0.693
 upon-0.661
ucks-0.654
apping-0.638
angles-0.609
Token Qu
Token ID2264
Feature activation+2.96e+00

Pos logit contributions
ap+0.165
 upon+0.156
ucks+0.155
apping+0.153
angles+0.143
Neg logit contributions
col-0.237
 gn-0.229
 disl-0.215
 dis-0.193
riger-0.193
Tokenack
Token ID441
Feature activation-2.86e-01

Pos logit contributions
 upon+3.095
ucks+3.084
ooks+3.040
angles+2.997
 for+2.928
Neg logit contributions
col-5.253
 disl-4.832
 gn-4.645
j-4.163
riger-4.139
Token in
Token ID287
Feature activation+2.61e-02

Pos logit contributions
col+0.556
 gn+0.542
 disl+0.517
riger+0.470
 dis+0.467
Neg logit contributions
ap-0.301
 upon-0.280
ucks-0.276
apping-0.271
aps-0.252
Token her
Token ID607
Feature activation+3.24e-02

Pos logit contributions
ap+0.027
 upon+0.025
ucks+0.025
apping+0.025
angles+0.023
Neg logit contributions
col-0.051
 gn-0.050
 disl-0.047
 dis-0.043
riger-0.043
Token arms
Token ID5101
Feature activation+1.53e-01

Pos logit contributions
ap+0.033
ucks+0.031
 upon+0.031
apping+0.030
angles+0.028
Neg logit contributions
col-0.064
 gn-0.062
 disl-0.059
riger-0.053
 dis-0.053
Token and
Token ID290
Feature activation-9.04e-01

Pos logit contributions
ap+0.169
ucks+0.160
 upon+0.157
apping+0.153
angles+0.146
Neg logit contributions
col-0.286
 gn-0.277
 disl-0.261
 dis-0.237
growth-0.236
Token
Token ID198
Feature activation-7.33e-01

Pos logit contributions
ap+1.088
ucks+1.030
 upon+1.016
apping+1.004
angles+0.991
Neg logit contributions
col-1.046
 gn-0.998
 disl-0.933
 dis-0.817
growth-0.808
Token
Token ID198
Feature activation-8.51e-02

Pos logit contributions
col+1.266
 gn+1.221
 disl+1.162
riger+1.044
 rooft+1.025
Neg logit contributions
ap-0.924
 upon-0.908
ucks-0.881
apping-0.869
aps-0.835
TokenHe
Token ID1544
Feature activation+1.12e-01

Pos logit contributions
col+0.149
 gn+0.144
 disl+0.136
riger+0.122
 dis+0.122
Neg logit contributions
ap-0.106
 upon-0.100
ucks-0.100
apping-0.098
angles-0.092
Token flew
Token ID13112
Feature activation+5.21e-01

Pos logit contributions
ap+0.160
ucks+0.153
 upon+0.152
apping+0.149
aps+0.142
Neg logit contributions
col-0.176
 gn-0.170
 disl-0.158
 dis-0.140
riger-0.139
Token the
Token ID262
Feature activation+4.87e-01

Pos logit contributions
ap+0.766
ucks+0.738
apping+0.713
 upon+0.706
angles+0.684
Neg logit contributions
col-0.795
 gn-0.766
 disl-0.693
 rooft-0.621
 dis-0.618
Token k
Token ID479
Feature activation+2.96e+00

Pos logit contributions
ap+0.713
ucks+0.691
apping+0.669
 upon+0.667
ooks+0.639
Neg logit contributions
col-0.755
 gn-0.732
 disl-0.665
 rooft-0.608
riger-0.606
Tokenite
Token ID578
Feature activation+8.25e-01

Pos logit contributions
ooks+4.265
 upon+4.206
ap+4.186
angles+4.172
ucks+3.942
Neg logit contributions
col-4.676
 gn-4.111
 pel-3.558
riger-3.548
 rooft-3.548
Token for
Token ID329
Feature activation-6.30e-01

Pos logit contributions
ucks+1.208
ap+1.180
 upon+1.131
apping+1.115
ooks+1.105
Neg logit contributions
col-1.246
 gn-1.205
 disl-1.050
 rooft-0.962
growth-0.956
Token a
Token ID257
Feature activation-3.94e-01

Pos logit contributions
col+1.096
 gn+1.039
 disl+0.999
riger+0.898
 dis+0.868
Neg logit contributions
ap-0.792
 upon-0.750
ucks-0.738
apping-0.716
aps-0.691
Token long
Token ID890
Feature activation-1.04e+00

Pos logit contributions
col+0.638
 gn+0.612
 disl+0.582
growth+0.512
 dis+0.508
Neg logit contributions
ap-0.540
 upon-0.521
apping-0.499
ucks-0.489
angles-0.466
Token time
Token ID640
Feature activation-4.09e-01

Pos logit contributions
col+1.856
 gn+1.797
 disl+1.733
riger+1.532
 dis+1.515
Neg logit contributions
ap-1.294
 upon-1.204
apping-1.147
ucks-1.068
angles-1.067
Token As
Token ID1081
Feature activation+3.86e-02

Pos logit contributions
ap+0.575
ucks+0.546
 upon+0.540
apping+0.530
angles+0.515
Neg logit contributions
col-0.683
 gn-0.659
 disl-0.620
growth-0.550
 dis-0.549
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
ap+0.053
ucks+0.050
 upon+0.050
apping+0.049
angles+0.047
Neg logit contributions
col-0.062
 gn-0.060
 disl-0.057
 dis-0.050
riger-0.050
Token rabbit
Token ID22746
Feature activation-3.14e-01

Pos logit contributions
ap+0.212
ucks+0.202
 upon+0.201
apping+0.197
angles+0.188
Neg logit contributions
col-0.252
 gn-0.244
 disl-0.228
 dis-0.204
riger-0.203
Token and
Token ID290
Feature activation+5.70e-01

Pos logit contributions
col+0.509
 gn+0.489
 disl+0.466
riger+0.411
 dis+0.407
Neg logit contributions
ap-0.428
 upon-0.409
ucks-0.400
apping-0.394
aps-0.373
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
ap+0.750
ucks+0.715
apping+0.707
 upon+0.700
ooks+0.666
Neg logit contributions
col-0.955
 gn-0.943
 disl-0.874
 dis-0.783
growth-0.781
Token k
Token ID479
Feature activation+2.96e+00

Pos logit contributions
ap+0.545
ucks+0.522
 upon+0.516
apping+0.509
angles+0.487
Neg logit contributions
col-0.628
 gn-0.610
 disl-0.563
 dis-0.508
 rooft-0.508
Tokenite
Token ID578
Feature activation+5.05e-01

Pos logit contributions
ooks+4.215
angles+4.140
 upon+4.096
ap+4.048
ucks+3.951
Neg logit contributions
col-4.748
 gn-4.161
 pel-3.726
 disl-3.674
 rooft-3.674
Token said
Token ID531
Feature activation+1.07e-01

Pos logit contributions
ap+0.712
ucks+0.694
 upon+0.672
apping+0.671
ooks+0.644
Neg logit contributions
col-0.788
 gn-0.762
 disl-0.698
 dis-0.630
riger-0.621
Token goodbye
Token ID24829
Feature activation+1.30e-01

Pos logit contributions
ap+0.180
ucks+0.173
 upon+0.172
apping+0.169
angles+0.163
Neg logit contributions
col-0.140
 gn-0.134
 disl-0.123
 dis-0.106
riger-0.105
Token,
Token ID11
Feature activation+8.40e-01

Pos logit contributions
ap+0.161
ucks+0.153
 upon+0.152
apping+0.148
angles+0.141
Neg logit contributions
col-0.227
 gn-0.220
 disl-0.207
 dis-0.185
riger-0.185
Token the
Token ID262
Feature activation+3.51e-01

Pos logit contributions
ap+1.223
ucks+1.202
 upon+1.144
apping+1.140
angles+1.117
Neg logit contributions
col-1.286
 gn-1.264
 disl-1.144
 dis-1.053
 rooft-1.023
Token up
Token ID510
Feature activation-2.80e-01

Pos logit contributions
col+0.460
 gn+0.444
 disl+0.422
riger+0.376
 dis+0.373
Neg logit contributions
ap-0.355
 upon-0.338
ucks-0.337
apping-0.326
aps-0.311
Token on
Token ID319
Feature activation-2.05e-01

Pos logit contributions
col+0.509
 gn+0.491
 disl+0.474
riger+0.425
growth+0.421
Neg logit contributions
ap-0.329
 upon-0.315
ucks-0.306
apping-0.301
aps-0.285
Token the
Token ID262
Feature activation-8.80e-01

Pos logit contributions
col+0.360
 gn+0.346
 disl+0.331
riger+0.295
 dis+0.293
Neg logit contributions
ap-0.254
 upon-0.242
ucks-0.239
apping-0.232
aps-0.222
Token blanket
Token ID18447
Feature activation-8.04e-01

Pos logit contributions
col+1.393
 gn+1.337
 disl+1.326
riger+1.138
growth+1.120
Neg logit contributions
ap-1.197
 upon-1.189
ucks-1.136
apping-1.095
aps-1.066
Token.
Token ID13
Feature activation+6.50e-02

Pos logit contributions
col+1.515
 gn+1.477
 disl+1.462
riger+1.295
 dis+1.276
Neg logit contributions
ap-0.896
 upon-0.842
apping-0.818
ucks-0.783
aps-0.758
Token Fl
Token ID1610
Feature activation+2.95e+00

Pos logit contributions
ap+0.078
 upon+0.073
ucks+0.073
apping+0.072
angles+0.068
Neg logit contributions
col-0.116
 gn-0.113
 disl-0.107
riger-0.096
 dis-0.095
Tokenuffy
Token ID15352
Feature activation-2.98e-01

Pos logit contributions
angles+3.882
 upon+3.655
ucks+3.526
 yours+3.458
ooks+3.453
Neg logit contributions
 gn-4.569
col-4.511
 disl-4.181
riger-3.812
j-3.755
Token was
Token ID373
Feature activation+5.71e-02

Pos logit contributions
col+0.483
 gn+0.463
 disl+0.443
riger+0.390
growth+0.386
Neg logit contributions
ap-0.406
 upon-0.388
ucks-0.383
apping-0.375
aps-0.358
Token very
Token ID845
Feature activation-1.14e-01

Pos logit contributions
ap+0.088
ucks+0.084
 upon+0.083
apping+0.082
angles+0.079
Neg logit contributions
col-0.083
 gn-0.080
 disl-0.075
 dis-0.065
riger-0.065
Token happy
Token ID3772
Feature activation+5.67e-01

Pos logit contributions
col+0.195
 gn+0.188
 disl+0.177
growth+0.158
riger+0.158
Neg logit contributions
ap-0.149
 upon-0.141
ucks-0.140
apping-0.138
aps-0.131
Token in
Token ID287
Feature activation+4.89e-01

Pos logit contributions
ap+0.710
ucks+0.673
 upon+0.654
apping+0.643
angles+0.637
Neg logit contributions
col-0.991
 gn-0.974
 disl-0.898
 dis-0.810
 rooft-0.800
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
ap+0.180
 upon+0.170
ucks+0.169
apping+0.165
angles+0.156
Neg logit contributions
col-0.285
 gn-0.276
 disl-0.259
 dis-0.234
growth-0.233
Token their
Token ID511
Feature activation+4.52e-01

Pos logit contributions
ap+1.141
ucks+1.087
apping+1.058
 upon+1.032
angles+1.013
Neg logit contributions
col-1.468
 gn-1.443
 disl-1.343
 dis-1.225
 rooft-1.176
Token bike
Token ID7161
Feature activation+8.07e-01

Pos logit contributions
ap+0.596
ucks+0.572
 upon+0.564
apping+0.561
ooks+0.529
Neg logit contributions
col-0.749
 gn-0.727
 disl-0.686
 dis-0.611
 rooft-0.610
Token and
Token ID290
Feature activation-1.42e-01

Pos logit contributions
ap+1.156
ucks+1.126
 upon+1.111
apping+1.093
ooks+1.059
Neg logit contributions
col-1.230
 gn-1.220
 disl-1.084
 dis-0.971
riger-0.969
Token their
Token ID511
Feature activation+5.52e-01

Pos logit contributions
col+0.251
 gn+0.243
 disl+0.229
riger+0.206
 dis+0.205
Neg logit contributions
ap-0.176
 upon-0.165
ucks-0.164
apping-0.160
angles-0.152
Token k
Token ID479
Feature activation+2.95e+00

Pos logit contributions
ap+0.742
ucks+0.713
apping+0.702
 upon+0.702
ooks+0.662
Neg logit contributions
col-0.902
 gn-0.873
 disl-0.828
 dis-0.733
 rooft-0.730
Tokenite
Token ID578
Feature activation+9.47e-01

Pos logit contributions
 upon+3.981
ap+3.968
ooks+3.913
angles+3.788
apping+3.651
Neg logit contributions
col-4.825
 gn-4.334
 disl-3.987
 rooft-3.820
riger-3.677
Token.
Token ID13
Feature activation+3.07e-02

Pos logit contributions
ap+1.271
ucks+1.265
 upon+1.236
apping+1.211
ooks+1.188
Neg logit contributions
col-1.524
 gn-1.486
 disl-1.348
 dis-1.203
 rooft-1.194
Token They
Token ID1119
Feature activation-3.00e-01

Pos logit contributions
ap+0.030
 upon+0.028
ucks+0.027
apping+0.027
angles+0.025
Neg logit contributions
col-0.062
 gn-0.061
 disl-0.058
riger-0.052
 dis-0.052
Token are
Token ID389
Feature activation+4.73e-01

Pos logit contributions
col+0.556
 gn+0.537
 disl+0.512
riger+0.460
 dis+0.459
Neg logit contributions
ap-0.343
 upon-0.322
ucks-0.315
apping-0.306
angles-0.294
Token happy
Token ID3772
Feature activation+1.00e+00

Pos logit contributions
ap+0.656
ucks+0.634
 upon+0.619
apping+0.607
angles+0.584
Neg logit contributions
col-0.754
 gn-0.734
 disl-0.679
 dis-0.614
 rooft-0.598
Token to
Token ID284
Feature activation-7.19e-01

Pos logit contributions
col+0.320
 gn+0.310
 disl+0.295
riger+0.266
growth+0.265
Neg logit contributions
ap-0.196
 upon-0.184
ucks-0.183
apping-0.179
aps-0.168
Token run
Token ID1057
Feature activation-9.50e-01

Pos logit contributions
col+1.315
 gn+1.243
 disl+1.189
growth+1.077
riger+1.073
Neg logit contributions
ap-0.861
 upon-0.786
ucks-0.759
apping-0.741
angles-0.722
Token,
Token ID11
Feature activation-4.62e-01

Pos logit contributions
col+1.798
 gn+1.766
 disl+1.732
growth+1.533
 dis+1.525
Neg logit contributions
ap-1.052
 upon-0.997
ucks-0.948
apping-0.941
aps-0.885
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
col+0.792
 gn+0.757
 disl+0.728
growth+0.651
riger+0.642
Neg logit contributions
ap-0.593
 upon-0.566
apping-0.545
ucks-0.545
angles-0.512
Token the
Token ID262
Feature activation+9.96e-01

Pos logit contributions
col+1.864
 gn+1.765
 disl+1.738
growth+1.556
riger+1.533
Neg logit contributions
ap-1.302
 upon-1.245
apping-1.166
ucks-1.138
aps-1.108
Token k
Token ID479
Feature activation+2.95e+00

Pos logit contributions
ap+1.487
ucks+1.448
apping+1.411
 upon+1.389
ooks+1.368
Neg logit contributions
col-1.497
 gn-1.477
 disl-1.322
 rooft-1.244
 dis-1.231
Tokenite
Token ID578
Feature activation+6.91e-01

Pos logit contributions
 upon+4.405
ooks+4.335
ap+4.228
angles+4.183
ucks+3.848
Neg logit contributions
col-4.525
 gn-4.075
 rooft-3.652
 pel-3.650
 disl-3.588
Token went
Token ID1816
Feature activation+5.12e-01

Pos logit contributions
ap+0.971
ucks+0.954
 upon+0.928
apping+0.917
ooks+0.881
Neg logit contributions
col-1.094
 gn-1.062
 disl-0.953
 dis-0.866
riger-0.847
Token up
Token ID510
Feature activation-1.68e-01

Pos logit contributions
ap+0.708
ucks+0.679
apping+0.658
 upon+0.657
angles+0.627
Neg logit contributions
col-0.837
 gn-0.810
 disl-0.735
 rooft-0.669
riger-0.663
Token,
Token ID11
Feature activation+3.86e-01

Pos logit contributions
col+0.295
 gn+0.285
 disl+0.272
riger+0.243
 dis+0.241
Neg logit contributions
ap-0.208
 upon-0.197
ucks-0.194
apping-0.191
aps-0.180
Token up
Token ID510
Feature activation+5.97e-01

Pos logit contributions
ap+0.503
ucks+0.483
 upon+0.472
apping+0.467
ooks+0.444
Neg logit contributions
col-0.647
 gn-0.630
 disl-0.583
 dis-0.524
riger-0.522
Token outside
Token ID2354
Feature activation+8.87e-02

Pos logit contributions
col+0.021
 gn+0.020
 disl+0.019
riger+0.017
 dis+0.017
Neg logit contributions
ap-0.019
 upon-0.018
ucks-0.018
apping-0.017
angles-0.016
Token.
Token ID13
Feature activation-7.62e-04

Pos logit contributions
ap+0.107
ucks+0.101
 upon+0.100
apping+0.098
angles+0.093
Neg logit contributions
col-0.158
 gn-0.153
 disl-0.144
 dis-0.130
riger-0.130
Token Soon
Token ID15894
Feature activation-2.11e-01

Pos logit contributions
col+0.001
 gn+0.001
 disl+0.001
riger+0.001
 dis+0.001
Neg logit contributions
ap-0.001
 upon-0.001
ucks-0.001
apping-0.001
angles-0.001
Token,
Token ID11
Feature activation+6.60e-02

Pos logit contributions
col+0.330
 gn+0.318
 disl+0.300
riger+0.262
growth+0.261
Neg logit contributions
ap-0.303
 upon-0.287
ucks-0.284
apping-0.280
angles-0.267
Token the
Token ID262
Feature activation+9.50e-01

Pos logit contributions
ap+0.088
ucks+0.083
 upon+0.083
apping+0.081
aps+0.077
Neg logit contributions
col-0.110
 gn-0.106
 disl-0.100
riger-0.089
 dis-0.089
Token k
Token ID479
Feature activation+2.94e+00

Pos logit contributions
ap+1.468
ucks+1.442
apping+1.406
 upon+1.389
ooks+1.364
Neg logit contributions
col-1.359
 gn-1.349
 disl-1.193
 rooft-1.110
 dis-1.099
Tokenite
Token ID578
Feature activation+1.09e+00

Pos logit contributions
ap+4.215
 upon+4.050
angles+3.963
ooks+3.829
ags+3.656
Neg logit contributions
col-5.035
 gn-4.712
riger-3.928
 rooft-3.875
 pel-3.844
Token was
Token ID373
Feature activation+4.82e-01

Pos logit contributions
ap+1.633
ucks+1.630
apping+1.540
ooks+1.534
 upon+1.533
Neg logit contributions
col-1.644
 gn-1.573
 disl-1.374
 dis-1.271
 pel-1.248
Token high
Token ID1029
Feature activation-4.61e-01

Pos logit contributions
ap+0.650
ucks+0.632
 upon+0.606
apping+0.601
angles+0.577
Neg logit contributions
col-0.790
 gn-0.761
 disl-0.703
 dis-0.642
 rooft-0.632
Token in
Token ID287
Feature activation-3.98e-01

Pos logit contributions
col+0.858
 gn+0.828
 disl+0.807
riger+0.718
 dis+0.715
Neg logit contributions
ap-0.521
 upon-0.489
ucks-0.478
apping-0.470
angles-0.443
Token the
Token ID262
Feature activation+5.93e-01

Pos logit contributions
col+0.704
 gn+0.685
 disl+0.654
riger+0.588
 dis+0.576
Neg logit contributions
ap-0.481
 upon-0.459
ucks-0.451
apping-0.436
angles-0.417
TokenAfter
Token ID3260
Feature activation-4.15e-01

Pos logit contributions
col+0.318
 gn+0.309
 disl+0.290
riger+0.260
growth+0.258
Neg logit contributions
ap-0.232
 upon-0.220
ucks-0.218
apping-0.214
aps-0.201
Token a
Token ID257
Feature activation-3.44e-01

Pos logit contributions
col+0.734
 gn+0.697
 disl+0.671
riger+0.599
 rooft+0.588
Neg logit contributions
ap-0.511
 upon-0.485
ucks-0.470
apping-0.468
aps-0.442
Token while
Token ID981
Feature activation+1.36e-01

Pos logit contributions
col+0.626
 gn+0.600
 disl+0.575
growth+0.514
riger+0.509
Neg logit contributions
ap-0.405
 upon-0.386
apping-0.369
ucks-0.366
angles-0.343
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
ap+0.207
ucks+0.199
 upon+0.197
apping+0.194
angles+0.186
Neg logit contributions
col-0.201
 gn-0.194
 disl-0.179
 dis-0.158
riger-0.157
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
ap+0.310
ucks+0.297
 upon+0.291
apping+0.289
aps+0.276
Neg logit contributions
col-0.340
 gn-0.328
 disl-0.307
 dis-0.274
growth-0.272
Token k
Token ID479
Feature activation+2.94e+00

Pos logit contributions
ap+1.536
ucks+1.513
 upon+1.486
apping+1.461
ooks+1.459
Neg logit contributions
col-1.570
 gn-1.536
 disl-1.369
 rooft-1.317
 dis-1.295
Tokenite
Token ID578
Feature activation+8.55e-01

Pos logit contributions
ap+4.325
ooks+4.325
 upon+4.325
angles+4.300
ucks+3.997
Neg logit contributions
col-4.580
 gn-4.189
 rooft-3.689
 pel-3.664
riger-3.603
Token flew
Token ID13112
Feature activation+1.04e+00

Pos logit contributions
ap+1.257
ucks+1.245
apping+1.186
 upon+1.173
ooks+1.160
Neg logit contributions
col-1.306
 gn-1.267
 disl-1.119
 dis-1.033
growth-1.016
Token away
Token ID1497
Feature activation+3.01e-01

Pos logit contributions
ap+1.602
ucks+1.576
apping+1.500
aps+1.454
ooks+1.451
Neg logit contributions
col-1.560
 gn-1.463
 disl-1.266
 rooft-1.211
 dis-1.171
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
ap+0.384
ucks+0.367
 upon+0.357
apping+0.352
angles+0.342
Neg logit contributions
col-0.517
 gn-0.503
 disl-0.464
 dis-0.422
 rooft-0.416
Token John
Token ID1757
Feature activation+1.25e-01

Pos logit contributions
ap+0.547
ucks+0.519
 upon+0.515
apping+0.509
angles+0.486
Neg logit contributions
col-0.583
 gn-0.561
 disl-0.531
 dis-0.471
growth-0.468
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
ap+0.526
ucks+0.517
 upon+0.499
apping+0.487
angles+0.479
Neg logit contributions
col-0.706
 gn-0.683
 disl-0.637
 dis-0.577
riger-0.571
Token and
Token ID290
Feature activation+1.18e-01

Pos logit contributions
ap+1.624
ucks+1.573
apping+1.504
ooks+1.469
angles+1.445
Neg logit contributions
col-2.094
 gn-2.059
 disl-1.896
 dis-1.752
 rooft-1.680
Token even
Token ID772
Feature activation-5.35e-01

Pos logit contributions
ap+0.162
ucks+0.154
 upon+0.153
apping+0.150
aps+0.143
Neg logit contributions
col-0.190
 gn-0.183
 disl-0.172
 dis-0.152
riger-0.151
Token her
Token ID607
Feature activation-1.53e-01

Pos logit contributions
col+0.929
 gn+0.895
 disl+0.853
riger+0.771
growth+0.765
Neg logit contributions
ap-0.673
 upon-0.643
ucks-0.620
apping-0.616
angles-0.579
Token little
Token ID1310
Feature activation+1.05e+00

Pos logit contributions
col+0.309
 gn+0.299
 disl+0.285
riger+0.260
growth+0.259
Neg logit contributions
ap-0.149
 upon-0.138
ucks-0.136
apping-0.134
aps-0.124
Token k
Token ID479
Feature activation+2.93e+00

Pos logit contributions
ucks+1.059
ap+1.056
 upon+1.026
angles+1.004
apping+0.990
Neg logit contributions
col-2.014
 gn-2.006
 disl-1.856
 rooft-1.696
 dis-1.689
Tokenitty
Token ID9760
Feature activation+9.30e-01

Pos logit contributions
 upon+3.986
angles+3.963
ap+3.931
ooks+3.891
ucks+3.684
Neg logit contributions
col-4.966
 gn-4.472
 disl-3.953
 rooft-3.670
j-3.665
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
ucks+1.252
ap+1.244
 upon+1.205
apping+1.185
angles+1.164
Neg logit contributions
col-1.496
 gn-1.465
 disl-1.353
 dis-1.206
 rooft-1.171
Token Everywhere
Token ID48201
Feature activation+5.96e-01

Pos logit contributions
ap+0.536
 upon+0.503
ucks+0.499
apping+0.488
angles+0.468
Neg logit contributions
col-0.853
 gn-0.822
 disl-0.784
growth-0.705
 rooft-0.698
Token she
Token ID673
Feature activation-6.26e-01

Pos logit contributions
ap+0.882
ucks+0.861
apping+0.834
 upon+0.825
aps+0.799
Neg logit contributions
col-0.894
 gn-0.864
 disl-0.806
 dis-0.718
riger-0.704
Token went
Token ID1816
Feature activation+3.27e-01

Pos logit contributions
col+0.995
 gn+0.954
 disl+0.919
riger+0.804
 dis+0.792
Neg logit contributions
ap-0.870
 upon-0.829
ucks-0.818
apping-0.793
angles-0.765
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
col+2.488
 gn+2.364
 disl+2.302
riger+2.132
 CRE+2.090
Neg logit contributions
ap-1.273
 upon-1.228
ucks-1.168
apping-1.074
aps-1.062
Token night
Token ID1755
Feature activation-9.36e-01

Pos logit contributions
col+4.103
 disl+3.932
 gn+3.812
riger+3.555
 CRE+3.494
Neg logit contributions
 upon-1.661
ap-1.590
ucks-1.319
 friend-1.299
apping-1.270
Token?"
Token ID1701
Feature activation+3.94e-01

Pos logit contributions
col+1.676
 gn+1.596
 disl+1.555
riger+1.389
 dis+1.385
Neg logit contributions
ap-1.159
 upon-1.118
apping-1.060
ucks-1.044
aps-1.005
Token
Token ID198
Feature activation+3.67e-02

Pos logit contributions
ap+0.579
ucks+0.556
 upon+0.553
apping+0.545
angles+0.527
Neg logit contributions
col-0.601
 gn-0.585
 disl-0.547
growth-0.480
 dis-0.478
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
ap+0.052
 upon+0.049
ucks+0.049
apping+0.048
angles+0.046
Neg logit contributions
col-0.058
 gn-0.056
 disl-0.053
riger-0.046
 dis-0.046
TokenThe
Token ID464
Feature activation-3.11e+00

Pos logit contributions
ap+0.025
 upon+0.023
ucks+0.023
apping+0.023
angles+0.022
Neg logit contributions
col-0.032
 gn-0.031
 disl-0.029
riger-0.026
 dis-0.026
Token rabbit
Token ID22746
Feature activation+2.27e-01

Pos logit contributions
col+6.435
 disl+5.865
 CRE+5.801
riger+5.692
growth+5.681
Neg logit contributions
 friend-3.594
 upon-3.506
 friends-3.288
ucks-2.986
ap-2.936
Token said
Token ID531
Feature activation-3.90e-01

Pos logit contributions
ap+0.315
ucks+0.302
 upon+0.296
apping+0.294
angles+0.280
Neg logit contributions
col-0.363
 gn-0.354
 disl-0.329
 dis-0.292
riger-0.289
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
col+0.617
 gn+0.587
 disl+0.556
growth+0.496
riger+0.493
Neg logit contributions
ap-0.564
 upon-0.531
apping-0.523
ucks-0.516
aps-0.493
Token "
Token ID366
Feature activation-1.01e+00

Pos logit contributions
col+0.396
 gn+0.382
 disl+0.364
riger+0.329
growth+0.329
Neg logit contributions
ap-0.227
 upon-0.214
ucks-0.209
apping-0.207
aps-0.192
TokenI
Token ID40
Feature activation-4.54e-01

Pos logit contributions
col+2.008
 gn+1.958
 disl+1.863
riger+1.756
 dis+1.720
Neg logit contributions
 upon-0.965
ap-0.951
ucks-0.927
apping-0.887
aps-0.822
Token for
Token ID329
Feature activation+1.07e+00

Pos logit contributions
ap+1.073
ucks+1.048
 upon+1.012
apping+1.007
angles+0.971
Neg logit contributions
col-1.277
 gn-1.232
 disl-1.121
riger-1.009
 dis-1.005
Token me
Token ID502
Feature activation+1.17e-01

Pos logit contributions
ucks+1.040
ap+1.037
 upon+0.996
apping+0.985
aps+0.905
Neg logit contributions
col-2.133
 gn-2.069
 disl-1.963
 dis-1.857
riger-1.796
Token!"
Token ID2474
Feature activation+2.89e-01

Pos logit contributions
ap+0.162
ucks+0.155
 upon+0.154
apping+0.151
angles+0.145
Neg logit contributions
col-0.186
 gn-0.180
 disl-0.168
riger-0.149
 dis-0.149
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
ap+0.484
ucks+0.464
 upon+0.463
apping+0.454
angles+0.441
Neg logit contributions
col-0.382
 gn-0.362
 disl-0.334
growth-0.286
 dis-0.285
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
col+0.468
 gn+0.454
 disl+0.427
riger+0.382
growth+0.377
Neg logit contributions
ap-0.380
ucks-0.360
 upon-0.359
apping-0.353
aps-0.337
TokenThe
Token ID464
Feature activation-3.10e+00

Pos logit contributions
ap+0.303
 upon+0.288
ucks+0.287
apping+0.279
angles+0.268
Neg logit contributions
col-0.378
 gn-0.364
 disl-0.342
 dis-0.305
riger-0.303
Token surprise
Token ID5975
Feature activation+1.12e+00

Pos logit contributions
col+6.696
 disl+6.198
 gn+5.989
keeping+5.783
growth+5.646
Neg logit contributions
 upon-3.362
 friend-3.116
ap-3.056
 friends-2.908
apping-2.679
Token turned
Token ID2900
Feature activation+2.41e-01

Pos logit contributions
ap+1.658
ucks+1.626
 upon+1.562
apping+1.548
ooks+1.497
Neg logit contributions
col-1.735
 gn-1.687
 disl-1.450
 dis-1.370
growth-1.298
Token out
Token ID503
Feature activation+3.53e-01

Pos logit contributions
ap+0.328
ucks+0.313
 upon+0.308
apping+0.303
angles+0.290
Neg logit contributions
col-0.394
 gn-0.379
 disl-0.352
 dis-0.316
riger-0.314
Token to
Token ID284
Feature activation+8.72e-01

Pos logit contributions
ap+0.446
ucks+0.427
 upon+0.417
apping+0.409
angles+0.397
Neg logit contributions
col-0.612
 gn-0.593
 disl-0.548
 dis-0.500
 rooft-0.493
Token be
Token ID307
Feature activation-1.76e-01

Pos logit contributions
ap+1.153
ucks+1.118
apping+1.115
 upon+1.077
aps+1.060
Neg logit contributions
col-1.459
 gn-1.414
 disl-1.308
 dis-1.208
growth-1.205
Token a
Token ID257
Feature activation+2.42e-01

Pos logit contributions
col+0.213
 gn+0.206
 disl+0.195
riger+0.174
 dis+0.173
Neg logit contributions
ap-0.155
 upon-0.147
ucks-0.146
apping-0.143
aps-0.135
Token jacket
Token ID15224
Feature activation-1.17e-01

Pos logit contributions
ap+0.357
ucks+0.344
 upon+0.338
apping+0.334
angles+0.322
Neg logit contributions
col-0.364
 gn-0.354
 disl-0.327
 dis-0.287
riger-0.287
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
col+0.207
 gn+0.200
 disl+0.190
riger+0.170
 dis+0.170
Neg logit contributions
ap-0.141
 upon-0.133
ucks-0.132
apping-0.130
aps-0.122
Token
Token ID198
Feature activation-4.90e-01

Pos logit contributions
col+0.566
 gn+0.541
 disl+0.514
riger+0.460
 dis+0.456
Neg logit contributions
ap-0.434
 upon-0.418
ucks-0.413
apping-0.403
aps-0.384
Token
Token ID198
Feature activation-7.01e-01

Pos logit contributions
col+0.827
 gn+0.802
 disl+0.765
riger+0.677
 dis+0.670
Neg logit contributions
ap-0.631
 upon-0.614
ucks-0.603
apping-0.587
aps-0.564
TokenThe
Token ID464
Feature activation-3.09e+00

Pos logit contributions
col+1.298
 gn+1.246
 disl+1.198
riger+1.073
 rooft+1.069
Neg logit contributions
ap-0.785
 upon-0.757
ucks-0.737
apping-0.725
aps-0.677
Token p
Token ID279
Feature activation+4.11e-01

Pos logit contributions
col+6.257
 disl+5.867
 CRE+5.675
riger+5.546
growth+5.533
Neg logit contributions
 upon-3.678
 friend-3.660
 friends-3.234
ucks-3.087
ap-2.960
Tokenanda
Token ID5282
Feature activation-5.88e-02

Pos logit contributions
ap+0.585
ucks+0.550
 upon+0.548
apping+0.538
angles+0.519
Neg logit contributions
col-0.652
 gn-0.627
 disl-0.584
riger-0.508
 dis-0.508
Token asked
Token ID1965
Feature activation+3.44e-01

Pos logit contributions
col+0.091
 gn+0.087
 disl+0.082
riger+0.072
 dis+0.072
Neg logit contributions
ap-0.085
 upon-0.081
ucks-0.080
apping-0.079
angles-0.075
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
ap+0.480
ucks+0.460
apping+0.451
 upon+0.450
angles+0.428
Neg logit contributions
col-0.548
 gn-0.538
 disl-0.497
 dis-0.446
growth-0.439
Token "
Token ID366
Feature activation-1.28e+00

Pos logit contributions
col+0.045
 gn+0.044
 disl+0.042
riger+0.038
 dis+0.038
Neg logit contributions
ap-0.028
 upon-0.026
ucks-0.026
apping-0.026
angles-0.024
Token go
Token ID467
Feature activation+4.84e-01

Pos logit contributions
col+1.678
 gn+1.609
 disl+1.548
riger+1.407
 dis+1.400
Neg logit contributions
ap-0.887
 upon-0.782
ucks-0.740
apping-0.723
angles-0.695
Token see
Token ID766
Feature activation+1.87e-01

Pos logit contributions
ap+0.566
ucks+0.534
apping+0.530
 upon+0.522
angles+0.502
Neg logit contributions
col-0.874
 gn-0.855
 disl-0.786
riger-0.725
 dis-0.715
Token some
Token ID617
Feature activation+2.46e-01

Pos logit contributions
ap+0.226
ucks+0.211
 upon+0.210
apping+0.208
aps+0.195
Neg logit contributions
col-0.333
 gn-0.322
 disl-0.304
 dis-0.273
riger-0.273
Token birds
Token ID10087
Feature activation+8.60e-02

Pos logit contributions
ap+0.340
ucks+0.323
 upon+0.322
apping+0.317
angles+0.301
Neg logit contributions
col-0.396
 gn-0.382
 disl-0.355
 dis-0.316
 rooft-0.314
Token!"
Token ID2474
Feature activation+2.28e-01

Pos logit contributions
ap+0.104
ucks+0.099
 upon+0.098
apping+0.096
angles+0.091
Neg logit contributions
col-0.152
 gn-0.147
 disl-0.139
riger-0.125
 dis-0.124
Token The
Token ID383
Feature activation-3.08e+00

Pos logit contributions
ap+0.325
 upon+0.311
ucks+0.310
apping+0.304
angles+0.291
Neg logit contributions
col-0.358
 gn-0.344
 disl-0.324
growth-0.284
 dis-0.284
Token little
Token ID1310
Feature activation-9.22e-01

Pos logit contributions
col+6.919
 disl+6.440
riger+6.038
 gn+6.023
 CRE+6.021
Neg logit contributions
 upon-3.145
 friend-3.065
 eyes-2.612
ap-2.598
 friends-2.547
Token orange
Token ID10912
Feature activation+5.86e-01

Pos logit contributions
col+1.938
 gn+1.819
 disl+1.813
riger+1.657
growth+1.647
Neg logit contributions
ap-0.840
 upon-0.821
ucks-0.774
apping-0.721
aps-0.668
Token bear
Token ID6842
Feature activation-8.45e-02

Pos logit contributions
ap+0.828
ucks+0.804
 upon+0.782
apping+0.775
ooks+0.749
Neg logit contributions
col-0.921
 gn-0.896
 disl-0.813
 dis-0.726
growth-0.722
Token was
Token ID373
Feature activation-2.15e-01

Pos logit contributions
col+0.135
 gn+0.130
 disl+0.122
riger+0.108
 dis+0.108
Neg logit contributions
ap-0.117
 upon-0.112
ucks-0.111
apping-0.109
angles-0.103
Token excited
Token ID6568
Feature activation+6.42e-01

Pos logit contributions
col+0.306
 gn+0.292
 disl+0.274
growth+0.238
riger+0.238
Neg logit contributions
ap-0.341
 upon-0.325
ucks-0.322
apping-0.320
aps-0.305
Token too
Token ID1165
Feature activation-7.84e-02

Pos logit contributions
col+0.782
 gn+0.761
 disl+0.725
growth+0.666
 rooft+0.655
Neg logit contributions
ap-0.414
 upon-0.386
ucks-0.373
apping-0.370
aps-0.342
Token?"
Token ID1701
Feature activation-2.64e-01

Pos logit contributions
col+0.132
 gn+0.128
 disl+0.121
riger+0.107
growth+0.107
Neg logit contributions
ap-0.102
 upon-0.097
ucks-0.096
apping-0.094
aps-0.089
Token
Token ID220
Feature activation-1.63e-01

Pos logit contributions
col+0.391
 gn+0.377
 disl+0.351
riger+0.310
 dis+0.307
Neg logit contributions
ap-0.398
 upon-0.377
ucks-0.375
apping-0.371
aps-0.354
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
col+0.263
 gn+0.255
 disl+0.239
riger+0.212
 dis+0.211
Neg logit contributions
ap-0.225
 upon-0.214
ucks-0.213
apping-0.209
aps-0.199
Token
Token ID198
Feature activation-2.38e-01

Pos logit contributions
col+0.764
 gn+0.744
 disl+0.701
riger+0.629
growth+0.620
Neg logit contributions
ap-0.615
 upon-0.580
ucks-0.578
apping-0.569
aps-0.543
TokenThe
Token ID464
Feature activation-3.02e+00

Pos logit contributions
col+0.401
 gn+0.389
 disl+0.366
riger+0.327
growth+0.326
Neg logit contributions
ap-0.313
 upon-0.294
ucks-0.293
apping-0.290
aps-0.273
Token other
Token ID584
Feature activation-3.83e-01

Pos logit contributions
col+6.207
 disl+5.769
 gn+5.735
growth+5.561
riger+5.507
Neg logit contributions
 upon-3.051
 friend-3.025
ap-3.023
apping-2.591
 friends-2.563
Token child
Token ID1200
Feature activation+2.67e-01

Pos logit contributions
col+0.679
 gn+0.661
 disl+0.629
riger+0.567
growth+0.565
Neg logit contributions
ap-0.469
 upon-0.444
ucks-0.436
apping-0.427
angles-0.401
Token was
Token ID373
Feature activation-3.34e-01

Pos logit contributions
ap+0.372
ucks+0.360
 upon+0.354
apping+0.348
aps+0.332
Neg logit contributions
col-0.423
 gn-0.408
 disl-0.380
 dis-0.339
riger-0.334
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
col+0.527
 gn+0.504
 disl+0.476
riger+0.425
growth+0.423
Neg logit contributions
ap-0.478
 upon-0.452
apping-0.445
ucks-0.444
aps-0.422
Token bit
Token ID1643
Feature activation-8.88e-01

Pos logit contributions
col+1.987
 gn+1.807
 disl+1.770
growth+1.613
keeping+1.582
Neg logit contributions
ap-1.835
 upon-1.754
apping-1.678
ucks-1.554
 friend-1.540
Token shouted
Token ID17293
Feature activation-1.16e+00

Pos logit contributions
col+0.696
 gn+0.666
 disl+0.629
riger+0.557
growth+0.551
Neg logit contributions
ap-0.625
 upon-0.602
ucks-0.590
apping-0.583
aps-0.556
Token Bob
Token ID5811
Feature activation-3.12e-01

Pos logit contributions
col+2.009
 gn+1.901
 disl+1.848
riger+1.643
growth+1.625
Neg logit contributions
ap-1.485
 upon-1.385
apping-1.352
ucks-1.327
aps-1.267
Token.
Token ID13
Feature activation-9.42e-02

Pos logit contributions
col+0.556
 gn+0.532
 disl+0.511
growth+0.458
riger+0.457
Neg logit contributions
ap-0.378
 upon-0.361
ucks-0.350
apping-0.349
aps-0.328
Token
Token ID198
Feature activation-8.46e-01

Pos logit contributions
col+0.155
 gn+0.150
 disl+0.141
riger+0.125
growth+0.125
Neg logit contributions
ap-0.126
 upon-0.120
ucks-0.119
apping-0.117
aps-0.111
Token
Token ID198
Feature activation-7.35e-01

Pos logit contributions
col+1.480
 gn+1.441
 disl+1.378
riger+1.221
growth+1.207
Neg logit contributions
ap-1.048
 upon-1.013
ucks-0.997
apping-0.974
aps-0.933
TokenThe
Token ID464
Feature activation-2.99e+00

Pos logit contributions
col+1.286
 gn+1.241
 disl+1.176
growth+1.051
riger+1.043
Neg logit contributions
ap-0.912
 upon-0.864
ucks-0.856
apping-0.843
aps-0.790
Token elf
Token ID23878
Feature activation+5.30e-01

Pos logit contributions
col+5.274
 disl+5.216
 CRE+4.656
 gn+4.631
riger+4.617
Neg logit contributions
 upon-3.836
 friend-3.750
 friends-3.531
 eyes-3.474
ap-3.436
Token said
Token ID531
Feature activation-4.96e-01

Pos logit contributions
ap+0.778
ucks+0.756
 upon+0.731
apping+0.724
angles+0.707
Neg logit contributions
col-0.814
 gn-0.797
 disl-0.707
 dis-0.648
growth-0.631
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
col+0.792
 gn+0.753
 disl+0.716
growth+0.642
riger+0.637
Neg logit contributions
ap-0.705
 upon-0.666
apping-0.652
ucks-0.645
aps-0.616
Token "
Token ID366
Feature activation-4.73e-01

Pos logit contributions
col+0.619
 gn+0.594
 disl+0.569
growth+0.517
riger+0.516
Neg logit contributions
ap-0.345
 upon-0.326
ucks-0.317
apping-0.315
aps-0.292
TokenG
Token ID38
Feature activation+4.93e-01

Pos logit contributions
col+0.805
 gn+0.783
 disl+0.739
riger+0.667
 dis+0.660
Neg logit contributions
ap-0.588
 upon-0.577
ucks-0.570
apping-0.548
aps-0.519
Token over
Token ID625
Feature activation-3.53e-01

Pos logit contributions
col+1.168
 gn+1.137
 disl+1.095
riger+0.975
 dis+0.967
Neg logit contributions
ap-0.789
 upon-0.758
ucks-0.723
apping-0.714
angles-0.686
Token there
Token ID612
Feature activation+9.94e-02

Pos logit contributions
col+0.599
 gn+0.585
 disl+0.553
riger+0.494
growth+0.486
Neg logit contributions
ap-0.454
 upon-0.436
ucks-0.425
apping-0.417
angles-0.398
Token!"
Token ID2474
Feature activation+1.50e-01

Pos logit contributions
ap+0.139
ucks+0.133
 upon+0.132
apping+0.129
angles+0.123
Neg logit contributions
col-0.157
 gn-0.151
 disl-0.141
riger-0.125
 dis-0.125
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
ap+0.254
ucks+0.244
 upon+0.244
apping+0.240
angles+0.232
Neg logit contributions
col-0.194
 gn-0.185
 disl-0.170
 dis-0.145
growth-0.145
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
col+0.274
 gn+0.265
 disl+0.249
riger+0.221
 dis+0.219
Neg logit contributions
ap-0.232
 upon-0.220
ucks-0.220
apping-0.216
aps-0.205
TokenThe
Token ID464
Feature activation-2.95e+00

Pos logit contributions
ap+0.580
 upon+0.555
ucks+0.553
apping+0.534
angles+0.521
Neg logit contributions
col-0.682
 gn-0.654
 disl-0.615
 dis-0.547
riger-0.541
Token two
Token ID734
Feature activation+8.79e-01

Pos logit contributions
col+6.320
 disl+5.828
 gn+5.646
keeping+5.379
growth+5.308
Neg logit contributions
 upon-3.261
 friend-2.968
ap-2.958
 friends-2.734
apping-2.624
Token children
Token ID1751
Feature activation+6.44e-01

Pos logit contributions
ap+0.914
ucks+0.905
 upon+0.849
apping+0.824
ooks+0.804
Neg logit contributions
col-1.695
 gn-1.643
 disl-1.511
 dis-1.449
riger-1.402
Token carefully
Token ID7773
Feature activation-4.19e-01

Pos logit contributions
ap+0.916
ucks+0.915
 upon+0.874
apping+0.858
ooks+0.834
Neg logit contributions
col-1.005
 gn-0.956
 disl-0.865
 dis-0.802
growth-0.776
Token climbed
Token ID19952
Feature activation+6.22e-01

Pos logit contributions
col+0.666
 gn+0.641
 disl+0.619
riger+0.544
 rooft+0.535
Neg logit contributions
ap-0.576
 upon-0.547
ucks-0.539
apping-0.530
angles-0.508
Token down
Token ID866
Feature activation-2.51e-01

Pos logit contributions
ap+0.888
ucks+0.847
apping+0.835
 upon+0.818
ooks+0.784
Neg logit contributions
col-0.984
 gn-0.949
 disl-0.843
 rooft-0.792
growth-0.776
Token scared
Token ID12008
Feature activation-5.50e-01

Pos logit contributions
col+0.754
 gn+0.711
 disl+0.679
growth+0.611
riger+0.595
Neg logit contributions
ap-0.650
 upon-0.624
apping-0.603
ucks-0.598
aps-0.570
Token."
Token ID526
Feature activation+5.21e-01

Pos logit contributions
col+1.001
 gn+0.972
 disl+0.928
growth+0.840
riger+0.834
Neg logit contributions
ap-0.656
 upon-0.624
apping-0.608
ucks-0.596
aps-0.560
Token
Token ID220
Feature activation+2.54e-01

Pos logit contributions
ap+0.814
ucks+0.783
 upon+0.778
apping+0.763
angles+0.746
Neg logit contributions
col-0.743
 gn-0.710
 disl-0.656
growth-0.571
 dis-0.570
Token
Token ID198
Feature activation-3.26e-02

Pos logit contributions
ap+0.364
ucks+0.345
 upon+0.344
apping+0.338
angles+0.325
Neg logit contributions
col-0.390
 gn-0.376
 disl-0.352
 dis-0.310
riger-0.310
TokenBut
Token ID1537
Feature activation+6.47e-01

Pos logit contributions
col+0.052
 gn+0.051
 disl+0.047
riger+0.042
 dis+0.042
Neg logit contributions
ap-0.045
 upon-0.043
ucks-0.043
apping-0.042
angles-0.040
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
ap+0.954
ucks+0.953
 upon+0.904
apping+0.899
angles+0.877
Neg logit contributions
col-0.958
 gn-0.949
 disl-0.859
 dis-0.782
 rooft-0.752
Token first
Token ID717
Feature activation-1.19e+00

Pos logit contributions
col+5.807
 disl+5.316
 gn+4.935
growth+4.726
keeping+4.698
Neg logit contributions
 upon-3.724
 friend-3.470
ap-3.373
 friends-3.176
apping-3.051
Token child
Token ID1200
Feature activation-4.99e-02

Pos logit contributions
col+2.303
 gn+2.176
 disl+2.173
riger+1.954
growth+1.933
Neg logit contributions
ap-1.334
 upon-1.267
apping-1.197
ucks-1.137
aps-1.073
Token said
Token ID531
Feature activation-4.18e-01

Pos logit contributions
col+0.084
 gn+0.081
 disl+0.077
riger+0.068
 dis+0.068
Neg logit contributions
ap-0.065
 upon-0.061
ucks-0.061
apping-0.060
angles-0.056
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
col+0.663
 gn+0.631
 disl+0.598
growth+0.532
riger+0.531
Neg logit contributions
ap-0.601
 upon-0.565
apping-0.556
ucks-0.548
aps-0.524
Token "
Token ID366
Feature activation+8.37e-02

Pos logit contributions
col+0.269
 gn+0.260
 disl+0.248
riger+0.223
growth+0.223
Neg logit contributions
ap-0.159
 upon-0.149
ucks-0.147
apping-0.145
aps-0.135
Token seen
Token ID1775
Feature activation+8.87e-02

Pos logit contributions
ap+0.194
ucks+0.184
 upon+0.181
apping+0.179
angles+0.169
Neg logit contributions
col-0.275
 gn-0.266
 disl-0.251
 dis-0.224
riger-0.224
Token it
Token ID340
Feature activation-2.89e-01

Pos logit contributions
ap+0.088
ucks+0.081
 upon+0.081
apping+0.079
aps+0.073
Neg logit contributions
col-0.178
 gn-0.173
 disl-0.164
 dis-0.150
riger-0.149
Token?"
Token ID1701
Feature activation-6.71e-02

Pos logit contributions
col+0.494
 gn+0.476
 disl+0.452
riger+0.402
growth+0.402
Neg logit contributions
ap-0.370
 upon-0.358
ucks-0.345
apping-0.340
angles-0.322
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
col+0.108
 gn+0.104
 disl+0.098
riger+0.086
 dis+0.086
Neg logit contributions
ap-0.094
 upon-0.089
ucks-0.089
apping-0.087
aps-0.083
Token
Token ID198
Feature activation-5.73e-02

Pos logit contributions
col+0.242
 gn+0.234
 disl+0.220
riger+0.195
 dis+0.194
Neg logit contributions
ap-0.205
 upon-0.195
ucks-0.194
apping-0.190
aps-0.181
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.096
 gn+0.093
 disl+0.087
riger+0.078
 dis+0.078
Neg logit contributions
ap-0.076
 upon-0.072
ucks-0.072
apping-0.071
angles-0.067
Token p
Token ID279
Feature activation+1.98e-01

Pos logit contributions
col+5.443
 disl+5.097
 CRE+4.956
growth+4.869
riger+4.862
Neg logit contributions
 upon-3.640
 friend-3.551
 friends-3.285
ucks-3.190
ap-3.095
Tokenanda
Token ID5282
Feature activation-1.52e-01

Pos logit contributions
ap+0.277
 upon+0.261
ucks+0.261
apping+0.256
angles+0.245
Neg logit contributions
col-0.319
 gn-0.307
 disl-0.288
riger-0.252
 dis-0.252
Token shook
Token ID14682
Feature activation+2.41e-01

Pos logit contributions
col+0.229
 gn+0.220
 disl+0.206
riger+0.182
growth+0.180
Neg logit contributions
ap-0.224
 upon-0.214
ucks-0.211
apping-0.208
angles-0.198
Token his
Token ID465
Feature activation+5.73e-01

Pos logit contributions
ap+0.242
ucks+0.228
 upon+0.223
apping+0.220
angles+0.207
Neg logit contributions
col-0.478
 gn-0.467
 disl-0.442
 dis-0.404
growth-0.401
Token head
Token ID1182
Feature activation-9.88e-01

Pos logit contributions
ap+0.662
ucks+0.641
 upon+0.623
apping+0.618
angles+0.595
Neg logit contributions
 gn-1.030
col-1.030
 disl-0.944
 rooft-0.864
 dis-0.862
Token me
Token ID502
Feature activation-7.87e-02

Pos logit contributions
col+1.472
 gn+1.448
 disl+1.380
riger+1.290
growth+1.278
Neg logit contributions
ap-0.406
 upon-0.367
ucks-0.351
apping-0.324
aps-0.296
Token?"
Token ID1701
Feature activation+2.33e-03

Pos logit contributions
col+0.136
 gn+0.131
 disl+0.124
riger+0.111
 dis+0.110
Neg logit contributions
ap-0.100
 upon-0.094
ucks-0.094
apping-0.092
aps-0.087
Token
Token ID220
Feature activation-2.75e-01

Pos logit contributions
ap+0.003
 upon+0.003
ucks+0.003
apping+0.003
angles+0.003
Neg logit contributions
col-0.004
 gn-0.004
 disl-0.003
riger-0.003
 dis-0.003
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
col+0.452
 gn+0.437
 disl+0.411
riger+0.365
 dis+0.362
Neg logit contributions
ap-0.372
 upon-0.355
ucks-0.351
apping-0.345
aps-0.329
Token
Token ID198
Feature activation-3.38e-01

Pos logit contributions
col+0.971
 gn+0.944
 disl+0.892
riger+0.794
 dis+0.780
Neg logit contributions
ap-0.736
 upon-0.699
ucks-0.690
apping-0.679
aps-0.647
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.579
 gn+0.560
 disl+0.527
riger+0.471
 dis+0.467
Neg logit contributions
ap-0.432
 upon-0.407
ucks-0.403
apping-0.399
aps-0.375
Token crab
Token ID32202
Feature activation+3.43e-01

Pos logit contributions
col+5.082
riger+4.843
 disl+4.613
 CRE+4.589
 gn+4.573
Neg logit contributions
 upon-3.925
ap-3.612
 friend-3.599
apping-3.443
 friends-3.290
Token replied
Token ID8712
Feature activation-5.93e-01

Pos logit contributions
ap+0.473
ucks+0.457
 upon+0.445
apping+0.441
angles+0.424
Neg logit contributions
col-0.550
 gn-0.535
 disl-0.496
 dis-0.441
riger-0.433
Token,
Token ID11
Feature activation-3.49e-01

Pos logit contributions
col+0.959
 gn+0.910
 disl+0.874
riger+0.781
growth+0.779
Neg logit contributions
ap-0.843
apping-0.783
 upon-0.783
ucks-0.759
aps-0.733
Token "
Token ID366
Feature activation-8.55e-01

Pos logit contributions
col+0.673
 gn+0.646
 disl+0.620
growth+0.563
riger+0.562
Neg logit contributions
ap-0.371
 upon-0.351
apping-0.339
ucks-0.339
aps-0.312
TokenI
Token ID40
Feature activation-7.39e-01

Pos logit contributions
col+1.623
 gn+1.590
 disl+1.506
riger+1.411
 dis+1.397
Neg logit contributions
ap-0.884
 upon-0.862
ucks-0.845
apping-0.820
aps-0.769
Token like
Token ID588
Feature activation+1.21e-01

Pos logit contributions
col+0.694
 gn+0.674
 disl+0.641
growth+0.575
riger+0.572
Neg logit contributions
ap-0.477
 upon-0.456
ucks-0.439
apping-0.434
aps-0.409
Token me
Token ID502
Feature activation+7.66e-02

Pos logit contributions
ap+0.134
ucks+0.125
 upon+0.124
apping+0.122
angles+0.114
Neg logit contributions
col-0.228
 gn-0.221
 disl-0.209
 dis-0.189
riger-0.189
Token?"
Token ID1701
Feature activation-4.70e-01

Pos logit contributions
ap+0.099
ucks+0.094
 upon+0.093
apping+0.091
angles+0.087
Neg logit contributions
col-0.129
 gn-0.125
 disl-0.118
riger-0.105
 dis-0.105
Token
Token ID198
Feature activation-2.84e-02

Pos logit contributions
col+0.784
 gn+0.748
 disl+0.708
riger+0.632
 rooft+0.627
Neg logit contributions
ap-0.628
 upon-0.592
ucks-0.589
apping-0.575
aps-0.549
Token
Token ID198
Feature activation-4.30e-01

Pos logit contributions
col+0.045
 gn+0.044
 disl+0.041
riger+0.036
 dis+0.036
Neg logit contributions
ap-0.040
 upon-0.038
ucks-0.037
apping-0.037
angles-0.035
TokenThe
Token ID464
Feature activation-2.87e+00

Pos logit contributions
col+0.781
 gn+0.746
 disl+0.713
 dis+0.638
riger+0.636
Neg logit contributions
ap-0.512
 upon-0.481
ucks-0.477
apping-0.467
aps-0.440
Token crab
Token ID32202
Feature activation-7.21e-02

Pos logit contributions
col+5.307
riger+4.952
 disl+4.817
 gn+4.723
 CRE+4.611
Neg logit contributions
 upon-3.789
 friend-3.573
ap-3.534
apping-3.341
 friends-3.216
Token looked
Token ID3114
Feature activation+6.63e-01

Pos logit contributions
col+0.113
 gn+0.109
 disl+0.103
riger+0.091
 dis+0.090
Neg logit contributions
ap-0.102
 upon-0.097
ucks-0.097
apping-0.095
aps-0.090
Token at
Token ID379
Feature activation+2.02e-01

Pos logit contributions
ap+0.813
ucks+0.781
apping+0.753
 upon+0.726
angles+0.721
Neg logit contributions
col-1.165
 gn-1.128
 disl-1.055
 dis-0.957
 rooft-0.939
Token T
Token ID309
Feature activation+1.28e+00

Pos logit contributions
ap+0.271
 upon+0.255
ucks+0.254
apping+0.253
aps+0.238
Neg logit contributions
col-0.334
 gn-0.324
 disl-0.303
growth-0.272
 dis-0.270
Tokenuna
Token ID9613
Feature activation-1.26e-01

Pos logit contributions
ap+1.576
 upon+1.433
ucks+1.417
apping+1.410
ooks+1.371
Neg logit contributions
col-2.212
 gn-2.183
 disl-1.991
growth-1.860
yah-1.770
Token luggage
Token ID31488
Feature activation+7.42e-01

Pos logit contributions
col+0.238
 gn+0.230
 disl+0.218
riger+0.196
 dis+0.194
Neg logit contributions
ap-0.174
 upon-0.165
ucks-0.163
apping-0.160
aps-0.151
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
ap+0.978
ucks+0.958
 upon+0.910
apping+0.895
angles+0.872
Neg logit contributions
col-1.242
 gn-1.198
 disl-1.092
 rooft-0.985
 dis-0.981
Token
Token ID220
Feature activation+4.90e-01

Pos logit contributions
ap+0.030
 upon+0.029
ucks+0.029
apping+0.028
angles+0.027
Neg logit contributions
col-0.032
 gn-0.031
 disl-0.029
riger-0.026
 dis-0.026
Token
Token ID198
Feature activation-5.89e-01

Pos logit contributions
ap+0.731
ucks+0.689
 upon+0.678
apping+0.676
angles+0.652
Neg logit contributions
col-0.727
 gn-0.694
 disl-0.650
 dis-0.570
growth-0.564
Token
Token ID198
Feature activation-4.51e-01

Pos logit contributions
col+1.002
 gn+0.974
 disl+0.933
riger+0.824
growth+0.818
Neg logit contributions
ap-0.746
 upon-0.737
ucks-0.717
apping-0.699
aps-0.669
TokenThe
Token ID464
Feature activation-2.80e+00

Pos logit contributions
col+0.801
 gn+0.773
 disl+0.734
growth+0.660
riger+0.657
Neg logit contributions
ap-0.542
 upon-0.520
ucks-0.511
apping-0.503
aps-0.470
Token elephant
Token ID20950
Feature activation+5.80e-02

Pos logit contributions
col+5.692
 disl+5.331
 gn+5.282
riger+5.149
 CRE+5.022
Neg logit contributions
 upon-3.007
 friend-2.858
ap-2.644
 for-2.487
 friends-2.378
Token asked
Token ID1965
Feature activation-1.95e-02

Pos logit contributions
ap+0.082
ucks+0.078
 upon+0.077
apping+0.076
angles+0.072
Neg logit contributions
col-0.091
 gn-0.088
 disl-0.082
 dis-0.073
riger-0.073
Token,
Token ID11
Feature activation-1.66e-02

Pos logit contributions
col+0.033
 gn+0.032
 disl+0.030
riger+0.027
 dis+0.027
Neg logit contributions
ap-0.026
 upon-0.024
ucks-0.024
apping-0.024
angles-0.022
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
col+0.030
 gn+0.030
 disl+0.028
riger+0.025
 dis+0.025
Neg logit contributions
ap-0.019
 upon-0.018
ucks-0.018
apping-0.017
angles-0.016
TokenWhat
Token ID2061
Feature activation+7.38e-01

Pos logit contributions
col+1.758
 gn+1.664
 disl+1.659
 dis+1.453
riger+1.452
Neg logit contributions
 upon-1.206
ap-1.169
ucks-1.160
apping-1.069
angles-1.022
Token with
Token ID351
Feature activation-1.20e-01

Pos logit contributions
ap+1.196
ucks+1.138
apping+1.101
angles+1.096
 upon+1.065
Neg logit contributions
col-1.554
 gn-1.504
 disl-1.362
 dis-1.242
 rooft-1.216
Token him
Token ID683
Feature activation+4.48e-01

Pos logit contributions
col+0.219
 gn+0.212
 disl+0.201
riger+0.181
 dis+0.180
Neg logit contributions
ap-0.139
 upon-0.131
ucks-0.130
apping-0.126
angles-0.119
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
ap+0.586
ucks+0.568
apping+0.547
 upon+0.544
angles+0.523
Neg logit contributions
col-0.742
 gn-0.721
 disl-0.673
 dis-0.608
riger-0.598
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
ap+0.459
ucks+0.434
 upon+0.431
apping+0.425
angles+0.413
Neg logit contributions
col-0.506
 gn-0.488
 disl-0.456
 dis-0.403
growth-0.400
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
col+0.473
 gn+0.460
 disl+0.434
riger+0.385
growth+0.382
Neg logit contributions
ap-0.385
 upon-0.371
ucks-0.367
apping-0.359
aps-0.342
TokenThe
Token ID464
Feature activation-2.77e+00

Pos logit contributions
ap+0.146
ucks+0.138
 upon+0.138
apping+0.135
angles+0.129
Neg logit contributions
col-0.177
 gn-0.171
 disl-0.160
 dis-0.143
riger-0.142
Token little
Token ID1310
Feature activation-3.44e-01

Pos logit contributions
col+5.873
 disl+5.700
 gn+5.439
riger+5.383
growth+5.218
Neg logit contributions
 upon-2.885
ap-2.581
 friend-2.568
 for-2.260
 friends-2.212
Token girl
Token ID2576
Feature activation+4.17e-01

Pos logit contributions
col+0.724
 gn+0.700
 disl+0.676
riger+0.619
growth+0.616
Neg logit contributions
ap-0.311
 upon-0.293
ucks-0.285
apping-0.274
aps-0.250
Token was
Token ID373
Feature activation-2.76e-01

Pos logit contributions
ap+0.597
ucks+0.578
 upon+0.568
apping+0.561
angles+0.540
Neg logit contributions
col-0.645
 gn-0.628
 disl-0.577
 dis-0.516
growth-0.504
Token so
Token ID523
Feature activation-5.23e-01

Pos logit contributions
col+0.424
 gn+0.405
 disl+0.383
riger+0.339
growth+0.339
Neg logit contributions
ap-0.407
 upon-0.387
ucks-0.381
apping-0.381
aps-0.361
Token happy
Token ID3772
Feature activation+7.82e-01

Pos logit contributions
col+0.911
 gn+0.877
 disl+0.825
growth+0.764
riger+0.756
Neg logit contributions
ap-0.676
 upon-0.641
apping-0.631
ucks-0.622
aps-0.593
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
col+2.700
 gn+2.549
 disl+2.476
riger+2.266
 CRE+2.211
Neg logit contributions
ap-1.580
 upon-1.548
ucks-1.465
angles-1.344
 for-1.343
Token jungle
Token ID20712
Feature activation-1.02e-01

Pos logit contributions
col+2.844
 disl+2.700
 gn+2.674
 dis+2.319
growth+2.285
Neg logit contributions
ap-1.951
 upon-1.914
ucks-1.815
apping-1.689
 for-1.661
Token?"
Token ID1701
Feature activation+1.42e-01

Pos logit contributions
col+0.176
 gn+0.170
 disl+0.161
riger+0.144
 dis+0.143
Neg logit contributions
ap-0.129
 upon-0.121
ucks-0.120
apping-0.118
aps-0.111
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
ap+0.219
ucks+0.210
 upon+0.209
apping+0.206
angles+0.198
Neg logit contributions
col-0.205
 gn-0.197
 disl-0.184
 dis-0.159
riger-0.159
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
col+0.416
 gn+0.403
 disl+0.380
riger+0.337
 dis+0.335
Neg logit contributions
ap-0.344
 upon-0.328
ucks-0.326
apping-0.319
aps-0.304
TokenThe
Token ID464
Feature activation-2.77e+00

Pos logit contributions
col+0.356
 gn+0.344
 disl+0.324
riger+0.290
growth+0.290
Neg logit contributions
ap-0.260
 upon-0.245
ucks-0.244
apping-0.239
aps-0.226
Token child
Token ID1200
Feature activation+3.39e-01

Pos logit contributions
col+5.529
 disl+5.206
 gn+5.131
riger+5.097
 CRE+4.921
Neg logit contributions
 upon-2.946
 friend-2.853
ap-2.656
 friends-2.436
 for-2.413
Token said
Token ID531
Feature activation-8.87e-01

Pos logit contributions
ap+0.473
ucks+0.458
 upon+0.446
apping+0.442
angles+0.423
Neg logit contributions
col-0.536
 gn-0.521
 disl-0.483
 dis-0.431
riger-0.423
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
col+1.487
 gn+1.405
 disl+1.356
growth+1.239
 rooft+1.219
Neg logit contributions
ap-1.228
 upon-1.134
apping-1.128
ucks-1.058
aps-1.043
Token "
Token ID366
Feature activation-8.21e-01

Pos logit contributions
col+0.368
 gn+0.356
 disl+0.339
riger+0.307
growth+0.306
Neg logit contributions
ap-0.213
 upon-0.200
ucks-0.196
apping-0.194
aps-0.180
TokenI
Token ID40
Feature activation-8.75e-01

Pos logit contributions
col+1.610
 gn+1.578
 disl+1.507
riger+1.395
 dis+1.379
Neg logit contributions
ap-0.800
 upon-0.794
ucks-0.774
apping-0.750
aps-0.692
Token he
Token ID339
Feature activation-1.41e-01

Pos logit contributions
ap+0.193
ucks+0.183
 upon+0.181
apping+0.179
aps+0.170
Neg logit contributions
col-0.218
 gn-0.212
 disl-0.198
 dis-0.176
riger-0.175
Token could
Token ID714
Feature activation+6.19e-01

Pos logit contributions
col+0.224
 gn+0.215
 disl+0.202
riger+0.179
growth+0.178
Neg logit contributions
ap-0.199
 upon-0.190
ucks-0.189
apping-0.185
aps-0.176
Token play
Token ID711
Feature activation+5.16e-01

Pos logit contributions
ap+0.790
ucks+0.771
 upon+0.756
apping+0.751
angles+0.726
Neg logit contributions
col-1.047
 gn-1.028
 disl-0.944
 dis-0.855
riger-0.838
Token too
Token ID1165
Feature activation+6.73e-05

Pos logit contributions
ap+0.537
 upon+0.510
ucks+0.507
apping+0.493
angles+0.477
Neg logit contributions
col-1.000
 gn-0.977
 disl-0.923
 dis-0.838
riger-0.832
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
ap+0.000
 upon+0.000
ucks+0.000
apping+0.000
angles+0.000
Neg logit contributions
col-0.000
 gn-0.000
 disl-0.000
riger-0.000
 dis-0.000
Token The
Token ID383
Feature activation-2.73e+00

Pos logit contributions
col+0.201
 gn+0.194
 disl+0.183
riger+0.165
growth+0.165
Neg logit contributions
ap-0.131
 upon-0.124
ucks-0.123
apping-0.120
aps-0.113
Token babys
Token ID46711
Feature activation+3.37e-01

Pos logit contributions
col+5.240
 disl+5.121
 gn+4.890
growth+4.843
riger+4.779
Neg logit contributions
 friend-2.995
 upon-2.881
ap-2.813
 friends-2.714
ucks-2.691
Tokenitter
Token ID1967
Feature activation-2.05e-01

Pos logit contributions
ap+0.371
 upon+0.348
ucks+0.344
apping+0.329
angles+0.323
Neg logit contributions
col-0.636
 gn-0.611
 disl-0.580
 dis-0.525
riger-0.521
Token said
Token ID531
Feature activation-8.47e-01

Pos logit contributions
col+0.327
 gn+0.316
 disl+0.297
riger+0.264
growth+0.264
Neg logit contributions
ap-0.284
 upon-0.270
ucks-0.267
apping-0.264
aps-0.249
Token it
Token ID340
Feature activation-4.82e-01

Pos logit contributions
col+1.379
 gn+1.285
 disl+1.226
growth+1.125
 rooft+1.113
Neg logit contributions
ap-1.186
 upon-1.141
apping-1.105
ucks-1.056
aps-1.017
Token was
Token ID373
Feature activation+1.12e+00

Pos logit contributions
col+0.889
 gn+0.857
 disl+0.824
growth+0.746
riger+0.743
Neg logit contributions
ap-0.547
 upon-0.522
ucks-0.501
apping-0.495
aps-0.461
Token it
Token ID340
Feature activation+5.05e-01

Pos logit contributions
ap+0.187
ucks+0.177
 upon+0.175
apping+0.172
aps+0.162
Neg logit contributions
col-0.275
 gn-0.267
 disl-0.251
riger-0.226
 dis-0.226
Token looks
Token ID3073
Feature activation+4.08e-01

Pos logit contributions
ap+0.645
ucks+0.623
apping+0.609
 upon+0.607
ooks+0.575
Neg logit contributions
col-0.855
 gn-0.830
 disl-0.774
riger-0.698
 dis-0.689
Token fun
Token ID1257
Feature activation-7.65e-01

Pos logit contributions
ap+0.635
ucks+0.618
apping+0.601
 upon+0.593
aps+0.574
Neg logit contributions
col-0.586
 gn-0.564
 disl-0.522
 dis-0.458
 rooft-0.455
Token!"
Token ID2474
Feature activation+1.89e-01

Pos logit contributions
col+1.306
 gn+1.286
 disl+1.208
growth+1.093
riger+1.074
Neg logit contributions
ap-0.989
 upon-0.953
apping-0.907
ucks-0.901
angles-0.859
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ap+0.316
ucks+0.304
 upon+0.304
apping+0.298
angles+0.288
Neg logit contributions
col-0.249
 gn-0.237
 disl-0.219
 dis-0.187
riger-0.187
TokenThe
Token ID464
Feature activation-2.72e+00

Pos logit contributions
ap+0.182
 upon+0.172
ucks+0.172
apping+0.169
angles+0.162
Neg logit contributions
col-0.200
 gn-0.192
 disl-0.180
 dis-0.159
riger-0.158
Token second
Token ID1218
Feature activation-9.80e-01

Pos logit contributions
col+5.794
 disl+5.227
 gn+5.118
growth+4.925
keeping+4.800
Neg logit contributions
 upon-2.887
ap-2.800
 friend-2.703
apping-2.480
 friends-2.453
Token child
Token ID1200
Feature activation-1.82e-01

Pos logit contributions
col+1.805
 gn+1.754
 disl+1.693
riger+1.535
growth+1.514
Neg logit contributions
ap-1.174
 upon-1.113
apping-1.075
ucks-1.055
angles-0.977
Token said
Token ID531
Feature activation-2.43e-01

Pos logit contributions
col+0.299
 gn+0.288
 disl+0.273
riger+0.242
growth+0.241
Neg logit contributions
ap-0.244
 upon-0.232
ucks-0.229
apping-0.226
angles-0.214
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
col+0.376
 gn+0.362
 disl+0.341
riger+0.301
growth+0.300
Neg logit contributions
ap-0.355
 upon-0.334
apping-0.330
ucks-0.329
aps-0.312
Token "
Token ID366
Feature activation-2.78e-01

Pos logit contributions
col+0.728
 gn+0.704
 disl+0.675
riger+0.610
growth+0.609
Neg logit contributions
ap-0.406
 upon-0.380
apping-0.371
ucks-0.367
aps-0.339
Token there
Token ID612
Feature activation+2.32e-01

Pos logit contributions
col+0.523
 gn+0.505
 disl+0.479
growth+0.433
riger+0.431
Neg logit contributions
ap-0.339
 upon-0.321
ucks-0.312
apping-0.311
aps-0.291
Token.
Token ID13
Feature activation+5.89e-02

Pos logit contributions
ap+0.294
ucks+0.281
 upon+0.273
apping+0.271
angles+0.258
Neg logit contributions
col-0.396
 gn-0.384
 disl-0.360
 dis-0.324
riger-0.322
Token
Token ID220
Feature activation+3.00e-01

Pos logit contributions
ap+0.086
ucks+0.081
 upon+0.081
apping+0.080
angles+0.076
Neg logit contributions
col-0.091
 gn-0.088
 disl-0.082
riger-0.072
 dis-0.072
Token
Token ID198
Feature activation-6.85e-01

Pos logit contributions
ap+0.436
ucks+0.411
 upon+0.407
apping+0.403
angles+0.387
Neg logit contributions
col-0.456
 gn-0.439
 disl-0.410
 dis-0.362
riger-0.360
Token
Token ID198
Feature activation-3.34e-01

Pos logit contributions
col+1.181
 gn+1.145
 disl+1.098
riger+0.965
growth+0.964
Neg logit contributions
ap-0.863
 upon-0.847
ucks-0.822
apping-0.805
aps-0.773
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+0.603
 gn+0.582
 disl+0.551
growth+0.496
riger+0.493
Neg logit contributions
ap-0.396
 upon-0.375
ucks-0.370
apping-0.364
aps-0.341
Token elf
Token ID23878
Feature activation+1.10e-01

Pos logit contributions
col+5.528
 disl+5.378
 gn+4.824
 CRE+4.692
growth+4.611
Neg logit contributions
 upon-3.052
 friend-2.657
ap-2.571
 for-2.450
 friends-2.401
Token said
Token ID531
Feature activation-1.43e+00

Pos logit contributions
ap+0.157
ucks+0.150
 upon+0.149
apping+0.147
angles+0.140
Neg logit contributions
col-0.170
 gn-0.164
 disl-0.153
 dis-0.135
riger-0.134
Token that
Token ID326
Feature activation-5.48e-01

Pos logit contributions
col+2.500
 gn+2.301
 disl+2.245
growth+2.116
 rooft+2.068
Neg logit contributions
ap-1.845
 upon-1.748
apping-1.720
 for-1.626
aps-1.556
Token he
Token ID339
Feature activation+4.69e-01

Pos logit contributions
col+0.927
 gn+0.881
 disl+0.845
riger+0.759
growth+0.756
Neg logit contributions
ap-0.706
 upon-0.685
ucks-0.654
apping-0.648
angles-0.616
Token had
Token ID550
Feature activation+4.36e-01

Pos logit contributions
ap+0.711
ucks+0.679
apping+0.664
 upon+0.655
aps+0.633
Neg logit contributions
col-0.702
 gn-0.687
 disl-0.629
 dis-0.561
riger-0.548
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
ap+0.727
ucks+0.704
apping+0.665
 upon+0.652
aps+0.615
Neg logit contributions
col-1.613
 gn-1.581
 disl-1.493
 dis-1.388
riger-1.383
Token doing
Token ID1804
Feature activation-7.48e-01

Pos logit contributions
col+2.067
 gn+1.920
 disl+1.878
growth+1.741
 rooft+1.675
Neg logit contributions
ap-1.325
 upon-1.287
apping-1.189
ucks-1.105
aps-1.071
Token?"
Token ID1701
Feature activation-1.16e-01

Pos logit contributions
col+1.401
 gn+1.330
 disl+1.275
growth+1.160
 rooft+1.138
Neg logit contributions
ap-0.894
 upon-0.827
apping-0.808
ucks-0.788
aps-0.747
Token
Token ID198
Feature activation-7.22e-01

Pos logit contributions
col+0.188
 gn+0.181
 disl+0.171
riger+0.151
 dis+0.151
Neg logit contributions
ap-0.160
 upon-0.152
ucks-0.151
apping-0.148
aps-0.141
Token
Token ID198
Feature activation-6.54e-01

Pos logit contributions
col+1.252
 gn+1.212
 disl+1.164
riger+1.032
 dis+1.018
Neg logit contributions
ap-0.907
 upon-0.871
ucks-0.861
apping-0.834
aps-0.808
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+1.167
 gn+1.119
 disl+1.068
growth+0.958
riger+0.955
Neg logit contributions
ap-0.790
 upon-0.741
ucks-0.735
apping-0.725
aps-0.685
Token bear
Token ID6842
Feature activation+2.99e-01

Pos logit contributions
col+5.350
 disl+4.674
 CRE+4.634
growth+4.416
riger+4.410
Neg logit contributions
 upon-3.248
 friend-2.977
ucks-2.843
ap-2.806
 friends-2.757
Token replied
Token ID8712
Feature activation-5.32e-01

Pos logit contributions
ap+0.421
ucks+0.405
 upon+0.397
apping+0.393
angles+0.376
Neg logit contributions
col-0.470
 gn-0.458
 disl-0.423
 dis-0.378
riger-0.371
Token,
Token ID11
Feature activation-5.74e-02

Pos logit contributions
col+0.857
 gn+0.814
 disl+0.779
growth+0.696
riger+0.694
Neg logit contributions
ap-0.758
 upon-0.707
apping-0.704
ucks-0.686
aps-0.659
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
col+0.107
 gn+0.104
 disl+0.098
riger+0.089
 dis+0.088
Neg logit contributions
ap-0.064
 upon-0.060
ucks-0.060
apping-0.059
aps-0.055
TokenI
Token ID40
Feature activation-7.24e-01

Pos logit contributions
col+1.974
 gn+1.935
 disl+1.847
riger+1.718
growth+1.695
Neg logit contributions
 upon-0.957
ap-0.929
ucks-0.915
apping-0.878
aps-0.810
Token one
Token ID530
Feature activation+1.14e-01

Pos logit contributions
ap+0.731
ucks+0.709
 upon+0.686
apping+0.682
ooks+0.652
Neg logit contributions
col-0.928
 gn-0.900
 disl-0.840
riger-0.758
 dis-0.750
Token day
Token ID1110
Feature activation+2.77e-01

Pos logit contributions
ap+0.146
ucks+0.139
 upon+0.138
apping+0.135
angles+0.129
Neg logit contributions
col-0.194
 gn-0.188
 disl-0.176
riger-0.158
 dis-0.157
Token?"
Token ID1701
Feature activation-2.83e-03

Pos logit contributions
ap+0.369
ucks+0.355
 upon+0.347
apping+0.343
angles+0.330
Neg logit contributions
col-0.451
 gn-0.436
 disl-0.406
 dis-0.364
riger-0.364
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
col+0.004
 gn+0.004
 disl+0.004
riger+0.004
 dis+0.004
Neg logit contributions
ap-0.004
 upon-0.004
ucks-0.004
apping-0.004
angles-0.004
Token
Token ID198
Feature activation-9.05e-02

Pos logit contributions
col+0.546
 gn+0.528
 disl+0.499
riger+0.444
 dis+0.439
Neg logit contributions
ap-0.446
 upon-0.423
ucks-0.421
apping-0.414
aps-0.394
TokenThe
Token ID464
Feature activation-2.71e+00

Pos logit contributions
col+0.153
 gn+0.148
 disl+0.140
riger+0.124
 dis+0.124
Neg logit contributions
ap-0.118
 upon-0.111
ucks-0.111
apping-0.109
aps-0.103
Token elf
Token ID23878
Feature activation+4.28e-01

Pos logit contributions
col+3.820
 disl+3.738
 gn+3.172
 CRE+3.101
growth+3.098
Neg logit contributions
 upon-4.566
ap-4.143
 friend-4.040
 for-3.879
 friends-3.866
Token smiled
Token ID13541
Feature activation+2.56e-01

Pos logit contributions
ap+0.606
ucks+0.588
 upon+0.572
apping+0.563
angles+0.543
Neg logit contributions
col-0.669
 gn-0.649
 disl-0.596
 dis-0.536
riger-0.526
Token and
Token ID290
Feature activation-4.01e-01

Pos logit contributions
ap+0.295
ucks+0.279
 upon+0.273
apping+0.268
angles+0.259
Neg logit contributions
col-0.467
 gn-0.456
 disl-0.426
 dis-0.386
 rooft-0.383
Token said
Token ID531
Feature activation-7.89e-01

Pos logit contributions
col+0.649
 gn+0.623
 disl+0.591
growth+0.525
riger+0.524
Neg logit contributions
ap-0.548
 upon-0.522
ucks-0.514
apping-0.507
angles-0.480
Token,
Token ID11
Feature activation-7.26e-01

Pos logit contributions
col+1.292
 gn+1.229
 disl+1.188
growth+1.065
riger+1.050
Neg logit contributions
ap-1.109
 upon-1.032
apping-1.030
ucks-0.980
aps-0.960
Token love
Token ID1842
Feature activation-1.81e-01

Pos logit contributions
ap+0.269
ucks+0.257
 upon+0.251
apping+0.250
angles+0.236
Neg logit contributions
col-0.394
 gn-0.382
 disl-0.361
riger-0.323
 dis-0.323
Token to
Token ID284
Feature activation+3.38e-03

Pos logit contributions
col+0.338
 gn+0.328
 disl+0.311
riger+0.282
growth+0.281
Neg logit contributions
ap-0.203
 upon-0.192
ucks-0.190
apping-0.185
aps-0.174
Token play
Token ID711
Feature activation-6.92e-01

Pos logit contributions
ap+0.004
 upon+0.003
ucks+0.003
apping+0.003
angles+0.003
Neg logit contributions
col-0.006
 gn-0.006
 disl-0.006
riger-0.005
 dis-0.005
Token,"
Token ID553
Feature activation+6.38e-01

Pos logit contributions
col+1.320
 gn+1.273
 disl+1.216
growth+1.122
riger+1.112
Neg logit contributions
ap-0.762
 upon-0.702
ucks-0.692
apping-0.677
aps-0.648
Token replied
Token ID8712
Feature activation-5.99e-02

Pos logit contributions
ap+0.956
ucks+0.920
 upon+0.919
apping+0.917
ooks+0.875
Neg logit contributions
col-0.956
 gn-0.938
 disl-0.872
 dis-0.768
riger-0.745
Token the
Token ID262
Feature activation-2.70e+00

Pos logit contributions
col+0.103
 gn+0.100
 disl+0.094
riger+0.084
 dis+0.084
Neg logit contributions
ap-0.076
 upon-0.072
ucks-0.072
apping-0.070
aps-0.066
Token crab
Token ID32202
Feature activation+7.82e-02

Pos logit contributions
col+4.649
riger+4.474
 disl+4.261
 gn+4.186
 CRE+4.079
Neg logit contributions
 upon-3.848
ap-3.569
 friend-3.473
apping-3.429
aps-3.244
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
ap+0.097
ucks+0.092
 upon+0.092
apping+0.090
angles+0.085
Neg logit contributions
col-0.136
 gn-0.132
 disl-0.124
 dis-0.111
riger-0.111
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
col+0.346
 gn+0.332
 disl+0.314
riger+0.280
growth+0.279
Neg logit contributions
ap-0.279
 upon-0.266
ucks-0.265
apping-0.259
aps-0.247
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
ap+1.631
apping+1.507
ucks+1.503
angles+1.483
 upon+1.479
Neg logit contributions
col-1.411
 gn-1.325
 disl-1.209
 dis-1.053
 rooft-1.034
TokenL
Token ID43
Feature activation+5.62e-01

Pos logit contributions
ap+1.319
ucks+1.232
 upon+1.225
apping+1.209
angles+1.158
Neg logit contributions
col-1.975
 gn-1.924
 disl-1.783
riger-1.598
 dis-1.588
Token lesson
Token ID11483
Feature activation+5.59e-02

Pos logit contributions
ap+0.501
ucks+0.482
 upon+0.475
apping+0.471
angles+0.456
Neg logit contributions
col-0.365
 gn-0.351
 disl-0.323
 dis-0.273
riger-0.271
Token.
Token ID13
Feature activation-7.79e-02

Pos logit contributions
ap+0.083
 upon+0.079
ucks+0.079
apping+0.077
angles+0.074
Neg logit contributions
col-0.084
 gn-0.081
 disl-0.076
 dis-0.067
riger-0.066
Token He
Token ID679
Feature activation-5.18e-01

Pos logit contributions
col+0.149
 gn+0.144
 disl+0.137
riger+0.124
 dis+0.124
Neg logit contributions
ap-0.084
 upon-0.079
ucks-0.079
apping-0.077
aps-0.072
Token learned
Token ID4499
Feature activation-1.97e-01

Pos logit contributions
col+0.803
 gn+0.764
 disl+0.729
riger+0.651
growth+0.634
Neg logit contributions
ap-0.736
 upon-0.709
ucks-0.703
apping-0.676
angles-0.656
Token that
Token ID326
Feature activation+4.49e-01

Pos logit contributions
col+0.316
 gn+0.305
 disl+0.286
riger+0.257
growth+0.253
Neg logit contributions
ap-0.274
 upon-0.262
ucks-0.261
apping-0.254
aps-0.243
Token not
Token ID407
Feature activation+1.92e+00

Pos logit contributions
ap+0.631
ucks+0.597
 upon+0.589
apping+0.588
angles+0.560
Neg logit contributions
col-0.716
 gn-0.704
 disl-0.653
 dis-0.586
 rooft-0.565
Token all
Token ID477
Feature activation+2.10e-01

Pos logit contributions
ucks+2.905
ap+2.846
apping+2.737
aps+2.614
angles+2.581
Neg logit contributions
col-2.981
 gn-2.925
 disl-2.770
 dis-2.517
 rooft-2.293
Token plants
Token ID6134
Feature activation+6.47e-02

Pos logit contributions
ap+0.238
ucks+0.223
 upon+0.222
apping+0.218
aps+0.204
Neg logit contributions
col-0.391
 gn-0.378
 disl-0.360
 dis-0.324
riger-0.322
Token are
Token ID389
Feature activation+4.13e-01

Pos logit contributions
ap+0.077
ucks+0.072
 upon+0.072
apping+0.071
angles+0.066
Neg logit contributions
col-0.116
 gn-0.112
 disl-0.106
 dis-0.095
riger-0.095
Token herbs
Token ID25411
Feature activation-9.87e-01

Pos logit contributions
ap+0.596
ucks+0.574
apping+0.558
 upon+0.558
aps+0.531
Neg logit contributions
col-0.639
 gn-0.619
 disl-0.583
 dis-0.520
 rooft-0.501
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
col+1.844
 gn+1.798
 disl+1.700
growth+1.556
 rooft+1.539
Neg logit contributions
ap-1.130
 upon-1.054
apping-1.020
ucks-0.983
aps-0.951
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
col+0.292
 gn+0.283
 disl+0.265
riger+0.237
 dis+0.234
Neg logit contributions
ap-0.245
ucks-0.233
 upon-0.233
apping-0.227
aps-0.217
TokenThey
Token ID2990
Feature activation-4.21e-01

Pos logit contributions
col+0.441
 gn+0.430
 disl+0.409
riger+0.374
growth+0.373
Neg logit contributions
ap-0.195
ucks-0.179
 upon-0.179
apping-0.174
aps-0.160
Token play
Token ID711
Feature activation+1.48e+00

Pos logit contributions
col+0.764
 gn+0.735
 disl+0.706
riger+0.629
 dis+0.629
Neg logit contributions
ap-0.499
 upon-0.465
ucks-0.453
apping-0.442
angles-0.425
Token with
Token ID351
Feature activation+7.70e-01

Pos logit contributions
ucks+1.805
 upon+1.792
ap+1.730
apping+1.681
ooks+1.681
Neg logit contributions
col-2.461
 gn-2.458
 disl-2.265
 dis-2.070
growth-2.043
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
ap+1.012
apping+0.965
 upon+0.960
ucks+0.953
aps+0.884
Neg logit contributions
col-1.282
 gn-1.240
 disl-1.172
 dis-1.064
growth-1.056
Token blue
Token ID4171
Feature activation+1.67e+00

Pos logit contributions
ap+1.678
ucks+1.609
apping+1.571
ooks+1.551
 upon+1.482
Neg logit contributions
col-1.718
 gn-1.707
 disl-1.565
 dis-1.428
 rooft-1.381
Token ball
Token ID2613
Feature activation+7.99e-01

Pos logit contributions
ap+2.471
ooks+2.436
apping+2.378
ucks+2.375
ook+2.290
Neg logit contributions
col-2.417
 gn-2.315
 disl-2.073
growth-1.928
 rooft-1.823
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
ap+0.961
ucks+0.960
 upon+0.933
apping+0.891
ooks+0.875
Neg logit contributions
col-1.381
 gn-1.342
 disl-1.206
growth-1.116
 dis-1.115
Token They
Token ID1119
Feature activation+1.36e-02

Pos logit contributions
col+0.466
 gn+0.455
 disl+0.431
riger+0.396
 dis+0.394
Neg logit contributions
ap-0.217
ucks-0.200
 upon-0.199
apping-0.194
aps-0.180
Token forget
Token ID6044
Feature activation-5.89e-02

Pos logit contributions
ap+0.016
 upon+0.015
ucks+0.015
apping+0.015
angles+0.014
Neg logit contributions
col-0.025
 gn-0.024
 disl-0.023
riger-0.020
 dis-0.020
Token about
Token ID546
Feature activation+5.08e-01

Pos logit contributions
col+0.131
 gn+0.128
 disl+0.122
riger+0.113
 dis+0.112
Neg logit contributions
ap-0.045
 upon-0.041
ucks-0.041
apping-0.040
angles-0.036
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
ap+0.595
ucks+0.567
 upon+0.557
apping+0.552
angles+0.525
Neg logit contributions
col-0.795
 gn-0.773
 disl-0.718
growth-0.640
 dis-0.633
Token
Token ID198
Feature activation-9.68e-02

Pos logit contributions
ap+0.436
 upon+0.412
ucks+0.408
apping+0.403
angles+0.386
Neg logit contributions
col-0.489
 gn-0.473
 disl-0.442
growth-0.390
 dis-0.388
Token
Token ID198
Feature activation+4.66e-01

Pos logit contributions
col+0.156
 gn+0.150
 disl+0.141
riger+0.125
 dis+0.125
Neg logit contributions
ap-0.133
 upon-0.127
ucks-0.127
apping-0.124
aps-0.118
TokenThey
Token ID2990
Feature activation-1.77e-01

Pos logit contributions
ap+0.544
ucks+0.508
 upon+0.503
apping+0.493
aps+0.466
Neg logit contributions
col-0.863
 gn-0.835
 disl-0.787
riger-0.712
 dis-0.709
Token took
Token ID1718
Feature activation-3.98e-01

Pos logit contributions
col+0.294
 gn+0.284
 disl+0.270
riger+0.239
 dis+0.237
Neg logit contributions
ap-0.234
 upon-0.222
ucks-0.219
apping-0.215
angles-0.205
Token Spark
Token ID17732
Feature activation+1.88e+00

Pos logit contributions
col+0.673
 gn+0.652
 disl+0.619
riger+0.548
 dis+0.546
Neg logit contributions
ap-0.522
ucks-0.490
 upon-0.487
apping-0.471
angles-0.454
Tokeny
Token ID88
Feature activation-5.84e-01

Pos logit contributions
ap+2.384
ooks+2.188
ucks+2.186
apping+2.182
aps+2.070
Neg logit contributions
col-3.365
 gn-3.134
 disl-2.893
growth-2.656
j-2.574
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
col+1.102
 gn+1.084
 disl+1.034
riger+0.947
 dis+0.937
Neg logit contributions
ap-0.633
 upon-0.598
ucks-0.577
apping-0.559
aps-0.531
Token their
Token ID511
Feature activation-1.13e-01

Pos logit contributions
ap+1.584
apping+1.548
ucks+1.522
 upon+1.468
itting+1.436
Neg logit contributions
 gn-1.732
col-1.696
 disl-1.582
 dis-1.435
growth-1.433
Token room
Token ID2119
Feature activation+2.58e-01

Pos logit contributions
col+0.201
 gn+0.194
 disl+0.184
riger+0.165
 dis+0.164
Neg logit contributions
ap-0.137
 upon-0.129
ucks-0.128
apping-0.125
aps-0.118
Token and
Token ID290
Feature activation-3.41e-01

Pos logit contributions
ap+0.288
ucks+0.273
 upon+0.268
apping+0.262
angles+0.249
Neg logit contributions
col-0.479
 gn-0.466
 disl-0.435
 dis-0.396
growth-0.393
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
col+1.702
 gn+1.658
 disl+1.568
growth+1.461
 rooft+1.422
Neg logit contributions
ap-0.999
 upon-0.924
ucks-0.900
apping-0.893
aps-0.845
Token You
Token ID921
Feature activation+3.47e-01

Pos logit contributions
col+0.552
 gn+0.535
 disl+0.508
 dis+0.462
riger+0.460
Neg logit contributions
ap-0.297
ucks-0.278
 upon-0.272
apping-0.267
aps-0.251
Token love
Token ID1842
Feature activation+5.62e-01

Pos logit contributions
ap+0.425
ucks+0.408
apping+0.399
 upon+0.398
aps+0.372
Neg logit contributions
col-0.611
 gn-0.593
 disl-0.558
riger-0.507
 dis-0.506
Token w
Token ID266
Feature activation+8.96e-01

Pos logit contributions
ap+0.512
apping+0.469
ucks+0.463
 upon+0.460
aps+0.416
Neg logit contributions
col-1.178
 gn-1.146
 disl-1.090
 dis-0.999
 rooft-0.989
Tokenaffles
Token ID48501
Feature activation-4.73e-01

Pos logit contributions
ap+1.507
 upon+1.485
ucks+1.442
apping+1.421
aps+1.354
Neg logit contributions
col-1.149
 gn-1.057
 disl-1.029
 dis-0.866
riger-0.861
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
col+0.831
 gn+0.803
 disl+0.757
growth+0.689
 rooft+0.675
Neg logit contributions
ap-0.596
 upon-0.551
ucks-0.549
apping-0.544
aps-0.515
Token right
Token ID826
Feature activation-1.19e+00

Pos logit contributions
ap+2.228
ucks+2.158
apping+2.117
ooks+2.055
aps+1.985
Neg logit contributions
col-2.657
 gn-2.603
 disl-2.427
 dis-2.197
 rooft-2.159
Token?"
Token ID1701
Feature activation+5.24e-03

Pos logit contributions
col+2.231
 gn+2.156
 disl+2.044
growth+1.903
riger+1.880
Neg logit contributions
ap-1.425
 upon-1.382
ucks-1.315
apping-1.243
angles-1.207
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
ap+0.007
 upon+0.007
ucks+0.007
apping+0.007
angles+0.006
Neg logit contributions
col-0.008
 gn-0.008
 disl-0.008
riger-0.007
 dis-0.007
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
col+0.248
 gn+0.239
 disl+0.225
riger+0.200
 dis+0.198
Neg logit contributions
ap-0.210
 upon-0.200
ucks-0.199
apping-0.195
aps-0.186
TokenZ
Token ID57
Feature activation+1.29e+00

Pos logit contributions
ap+0.317
 upon+0.299
ucks+0.298
apping+0.293
angles+0.282
Neg logit contributions
col-0.367
 gn-0.355
 disl-0.331
 dis-0.294
riger-0.293
Token did
Token ID750
Feature activation+1.23e+00

Pos logit contributions
col+0.204
 gn+0.198
 disl+0.188
riger+0.168
growth+0.167
Neg logit contributions
ap-0.140
 upon-0.132
ucks-0.131
apping-0.128
angles-0.121
Token not
Token ID407
Feature activation+1.03e+00

Pos logit contributions
ucks+1.772
ap+1.750
apping+1.726
 upon+1.612
aps+1.581
Neg logit contributions
 gn-1.854
col-1.850
 disl-1.779
 dis-1.602
riger-1.489
Token work
Token ID670
Feature activation-6.28e-01

Pos logit contributions
ucks+1.461
ap+1.458
apping+1.411
 upon+1.396
ooks+1.359
Neg logit contributions
col-1.580
 gn-1.564
 disl-1.464
 dis-1.347
riger-1.300
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
col+1.180
 gn+1.150
 disl+1.095
growth+0.988
riger+0.973
Neg logit contributions
ap-0.713
 upon-0.668
apping-0.652
ucks-0.647
angles-0.607
Token The
Token ID383
Feature activation-4.31e-01

Pos logit contributions
col+0.052
 gn+0.050
 disl+0.048
riger+0.043
 dis+0.043
Neg logit contributions
ap-0.028
 upon-0.026
ucks-0.026
apping-0.025
angles-0.024
Token day
Token ID1110
Feature activation+1.61e+00

Pos logit contributions
col+0.701
 gn+0.670
 disl+0.639
riger+0.561
growth+0.559
Neg logit contributions
ap-0.598
 upon-0.573
ucks-0.565
apping-0.549
aps-0.527
Token did
Token ID750
Feature activation+1.13e+00

Pos logit contributions
ucks+2.336
ap+2.246
 upon+2.194
apping+2.139
aps+2.126
Neg logit contributions
col-2.479
 gn-2.348
 dis-2.058
 disl-2.050
growth-1.971
Token not
Token ID407
Feature activation+5.65e-01

Pos logit contributions
ucks+1.584
ap+1.550
apping+1.532
 upon+1.468
aps+1.436
Neg logit contributions
 gn-1.746
col-1.740
 disl-1.693
 dis-1.504
riger-1.429
Token end
Token ID886
Feature activation-6.18e-01

Pos logit contributions
ap+0.734
ucks+0.726
 upon+0.712
apping+0.697
ooks+0.665
Neg logit contributions
col-0.936
 gn-0.916
 disl-0.862
 dis-0.775
riger-0.765
Token well
Token ID880
Feature activation-1.20e+00

Pos logit contributions
col+1.168
 gn+1.130
 disl+1.067
riger+0.978
growth+0.976
Neg logit contributions
ap-0.699
 upon-0.643
ucks-0.639
apping-0.623
aps-0.594
Token for
Token ID329
Feature activation+4.67e-01

Pos logit contributions
col+2.372
 gn+2.306
 disl+2.181
growth+2.016
riger+1.974
Neg logit contributions
ap-1.296
apping-1.188
 upon-1.185
ucks-1.111
aps-1.066
Token out
Token ID503
Feature activation-3.19e-01

Pos logit contributions
col+0.113
 gn+0.110
 disl+0.104
riger+0.093
 dis+0.093
Neg logit contributions
ap-0.078
 upon-0.073
ucks-0.073
apping-0.071
angles-0.067
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
col+0.578
 gn+0.561
 disl+0.535
riger+0.481
growth+0.479
Neg logit contributions
ap-0.375
 upon-0.357
ucks-0.349
apping-0.344
aps-0.324
Token He
Token ID679
Feature activation+2.32e-01

Pos logit contributions
col+0.288
 gn+0.278
 disl+0.260
riger+0.231
 dis+0.230
Neg logit contributions
ap-0.250
 upon-0.239
ucks-0.237
apping-0.232
aps-0.221
Token pointed
Token ID6235
Feature activation+3.24e-01

Pos logit contributions
ap+0.352
ucks+0.335
 upon+0.331
apping+0.329
aps+0.315
Neg logit contributions
col-0.345
 gn-0.333
 disl-0.305
 dis-0.272
riger-0.269
Token at
Token ID379
Feature activation-3.65e-01

Pos logit contributions
ap+0.369
ucks+0.346
apping+0.338
 upon+0.337
aps+0.315
Neg logit contributions
col-0.604
 gn-0.581
 disl-0.548
 dis-0.497
 rooft-0.494
Token it
Token ID340
Feature activation+2.49e-02

Pos logit contributions
col+0.672
 gn+0.656
 disl+0.621
riger+0.565
 dis+0.552
Neg logit contributions
ap-0.415
 upon-0.398
ucks-0.390
apping-0.375
angles-0.364
Token and
Token ID290
Feature activation-6.15e-01

Pos logit contributions
ap+0.028
ucks+0.026
 upon+0.026
apping+0.025
angles+0.024
Neg logit contributions
col-0.047
 gn-0.045
 disl-0.043
riger-0.039
 dis-0.039
Token shouted
Token ID17293
Feature activation-1.53e-01

Pos logit contributions
col+0.999
 gn+0.960
 disl+0.907
growth+0.815
riger+0.810
Neg logit contributions
ap-0.831
 upon-0.810
ucks-0.778
apping-0.768
angles-0.736
Token "
Token ID366
Feature activation-3.49e-01

Pos logit contributions
col+0.254
 gn+0.245
 disl+0.231
riger+0.206
growth+0.205
Neg logit contributions
ap-0.205
 upon-0.193
ucks-0.192
apping-0.190
aps-0.179
TokenWhere
Token ID8496
Feature activation+3.38e-01

Pos logit contributions
col+0.540
 gn+0.527
 disl+0.497
 dis+0.436
riger+0.433
Neg logit contributions
ap-0.494
 upon-0.475
ucks-0.471
apping-0.459
angles-0.438
Token does
Token ID857
Feature activation-6.59e-01

Pos logit contributions
ap+0.385
ucks+0.371
 upon+0.360
apping+0.356
angles+0.332
Neg logit contributions
col-0.619
 gn-0.602
 disl-0.569
 dis-0.513
riger-0.512
Token cow
Token ID9875
Feature activation-5.08e-01

Pos logit contributions
col+0.306
 gn+0.294
 disl+0.278
growth+0.248
riger+0.247
Neg logit contributions
ap-0.239
 upon-0.225
ucks-0.224
apping-0.220
aps-0.208
Token.
Token ID13
Feature activation+9.86e-02

Pos logit contributions
col+0.920
 gn+0.881
 disl+0.852
growth+0.763
 dis+0.761
Neg logit contributions
ap-0.610
 upon-0.560
ucks-0.557
apping-0.554
aps-0.518
Token First
Token ID3274
Feature activation+4.09e-01

Pos logit contributions
ap+0.121
 upon+0.115
ucks+0.114
apping+0.112
angles+0.106
Neg logit contributions
col-0.173
 gn-0.168
 disl-0.158
riger-0.142
growth-0.141
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
ap+0.631
ucks+0.613
 upon+0.607
apping+0.604
aps+0.572
Neg logit contributions
col-0.578
 gn-0.562
 disl-0.516
 dis-0.455
riger-0.454
Token we
Token ID356
Feature activation-3.62e-01

Pos logit contributions
col+0.207
 gn+0.201
 disl+0.191
riger+0.172
 dis+0.172
Neg logit contributions
ap-0.125
 upon-0.118
ucks-0.117
apping-0.114
angles-0.107
Token need
Token ID761
Feature activation+6.69e-01

Pos logit contributions
col+0.649
 gn+0.629
 disl+0.599
growth+0.533
 dis+0.531
Neg logit contributions
ap-0.438
 upon-0.409
ucks-0.400
apping-0.390
angles-0.375
Token to
Token ID284
Feature activation-1.55e-01

Pos logit contributions
ap+0.872
apping+0.829
ucks+0.815
 upon+0.811
angles+0.771
Neg logit contributions
col-1.149
 gn-1.099
 disl-1.025
growth-0.937
 dis-0.934
Token make
Token ID787
Feature activation-4.26e-01

Pos logit contributions
col+0.283
 gn+0.273
 disl+0.259
riger+0.233
 dis+0.232
Neg logit contributions
ap-0.181
 upon-0.169
ucks-0.168
apping-0.164
angles-0.154
Token you
Token ID345
Feature activation-1.73e-01

Pos logit contributions
col+0.774
 gn+0.750
 disl+0.709
riger+0.639
 dis+0.636
Neg logit contributions
ap-0.503
ucks-0.472
 upon-0.465
apping-0.450
angles-0.431
Token big
Token ID1263
Feature activation-3.36e-01

Pos logit contributions
col+0.296
 gn+0.286
 disl+0.269
riger+0.241
 dis+0.241
Neg logit contributions
ap-0.222
 upon-0.209
ucks-0.207
apping-0.203
aps-0.192
Token.
Token ID13
Feature activation+3.32e-02

Pos logit contributions
col+0.604
 gn+0.581
 disl+0.551
growth+0.497
riger+0.497
Neg logit contributions
ap-0.407
ucks-0.380
 upon-0.379
apping-0.367
aps-0.349
Token person
Token ID1048
Feature activation-9.22e-02

Pos logit contributions
ap+0.760
ucks+0.729
 upon+0.718
apping+0.700
angles+0.682
Neg logit contributions
col-1.051
 gn-1.032
 disl-0.957
 dis-0.855
 rooft-0.847
Token that
Token ID326
Feature activation+1.40e-02

Pos logit contributions
col+0.161
 gn+0.156
 disl+0.148
riger+0.132
 dis+0.132
Neg logit contributions
ap-0.114
 upon-0.107
ucks-0.106
apping-0.104
angles-0.098
Token had
Token ID550
Feature activation+8.41e-01

Pos logit contributions
ap+0.019
ucks+0.018
 upon+0.018
apping+0.018
angles+0.017
Neg logit contributions
col-0.023
 gn-0.022
 disl-0.020
riger-0.018
 dis-0.018
Token tried
Token ID3088
Feature activation+6.99e-01

Pos logit contributions
ap+1.166
ucks+1.146
apping+1.092
 upon+1.085
angles+1.055
Neg logit contributions
col-1.370
 gn-1.317
 disl-1.244
 dis-1.134
growth-1.083
Token to
Token ID284
Feature activation+6.49e-01

Pos logit contributions
ap+0.869
ucks+0.850
 upon+0.809
apping+0.808
angles+0.773
Neg logit contributions
col-1.223
 gn-1.181
 disl-1.122
 dis-1.008
growth-0.980
Token encourage
Token ID7898
Feature activation+2.43e-01

Pos logit contributions
ap+0.816
ucks+0.801
 upon+0.786
apping+0.779
angles+0.737
Neg logit contributions
col-1.120
 gn-1.092
 disl-1.027
 dis-0.924
growth-0.905
Token the
Token ID262
Feature activation-4.51e-01

Pos logit contributions
ap+0.235
ucks+0.221
 upon+0.219
apping+0.214
aps+0.197
Neg logit contributions
col-0.491
 gn-0.476
 disl-0.453
 dis-0.414
growth-0.413
Token boy
Token ID2933
Feature activation-2.52e-01

Pos logit contributions
col+0.811
 gn+0.776
 disl+0.741
riger+0.670
growth+0.668
Neg logit contributions
ap-0.542
 upon-0.514
ucks-0.499
apping-0.494
aps-0.465
Token to
Token ID284
Feature activation+2.84e-02

Pos logit contributions
col+0.467
 gn+0.453
 disl+0.430
riger+0.389
growth+0.387
Neg logit contributions
ap-0.287
 upon-0.268
ucks-0.261
apping-0.260
aps-0.243
Token change
Token ID1487
Feature activation+1.52e-01

Pos logit contributions
ap+0.031
ucks+0.029
 upon+0.029
apping+0.029
angles+0.027
Neg logit contributions
col-0.054
 gn-0.052
 disl-0.049
 dis-0.044
riger-0.044
Token his
Token ID465
Feature activation-2.73e-01

Pos logit contributions
ap+0.171
ucks+0.161
 upon+0.160
apping+0.156
angles+0.147
Neg logit contributions
col-0.281
 gn-0.272
 disl-0.259
 dis-0.233
riger-0.231
Token excited
Token ID6568
Feature activation+6.29e-01

Pos logit contributions
col+1.457
 gn+1.398
 disl+1.315
growth+1.234
riger+1.200
Neg logit contributions
ap-1.195
 upon-1.138
apping-1.126
ucks-1.068
aps-1.051
Token with
Token ID351
Feature activation-1.26e+00

Pos logit contributions
ap+0.921
ucks+0.895
 upon+0.852
apping+0.850
angles+0.827
Neg logit contributions
col-0.969
 gn-0.931
 disl-0.869
 dis-0.764
riger-0.752
Token his
Token ID465
Feature activation-1.04e+00

Pos logit contributions
col+2.715
 gn+2.601
 disl+2.498
riger+2.369
 CRE+2.303
Neg logit contributions
ap-1.104
 upon-1.055
ucks-0.957
apping-0.936
angles-0.910
Token new
Token ID649
Feature activation-1.01e-01

Pos logit contributions
col+2.090
 gn+1.993
 disl+1.883
riger+1.776
 rooft+1.747
Neg logit contributions
ap-1.054
 upon-1.011
ucks-0.916
apping-0.912
aps-0.878
Token toy
Token ID13373
Feature activation+7.23e-01

Pos logit contributions
col+0.153
 gn+0.147
 disl+0.137
riger+0.121
 dis+0.120
Neg logit contributions
ap-0.149
 upon-0.142
ucks-0.141
apping-0.138
aps-0.132
Token!
Token ID0
Feature activation+1.23e-01

Pos logit contributions
ap+0.991
ucks+0.970
 upon+0.939
apping+0.928
angles+0.889
Neg logit contributions
col-1.167
 gn-1.132
 disl-1.033
 dis-0.940
growth-0.922
Token He
Token ID679
Feature activation+3.06e-01

Pos logit contributions
ap+0.138
ucks+0.129
 upon+0.128
apping+0.125
angles+0.118
Neg logit contributions
col-0.230
 gn-0.223
 disl-0.211
 dis-0.191
growth-0.190
Token ran
Token ID4966
Feature activation-3.69e-01

Pos logit contributions
ap+0.461
ucks+0.442
 upon+0.435
apping+0.432
aps+0.413
Neg logit contributions
col-0.460
 gn-0.444
 disl-0.408
 dis-0.363
riger-0.357
Token around
Token ID1088
Feature activation+2.70e-01

Pos logit contributions
col+0.625
 gn+0.606
 disl+0.580
growth+0.511
riger+0.510
Neg logit contributions
ap-0.477
 upon-0.458
ucks-0.447
apping-0.441
aps-0.416
Token the
Token ID262
Feature activation+6.96e-01

Pos logit contributions
ap+0.337
ucks+0.323
 upon+0.315
apping+0.311
angles+0.297
Neg logit contributions
col-0.469
 gn-0.458
 disl-0.427
 dis-0.387
riger-0.384
Token house
Token ID2156
Feature activation-2.73e-01

Pos logit contributions
ap+1.052
ucks+1.011
apping+0.993
 upon+0.985
angles+0.959
Neg logit contributions
col-1.039
 gn-1.016
 disl-0.922
 dis-0.823
riger-0.819
Token,
Token ID11
Feature activation+5.83e-01

Pos logit contributions
ap+0.146
ucks+0.137
 upon+0.136
apping+0.134
angles+0.127
Neg logit contributions
col-0.228
 gn-0.221
 disl-0.209
 dis-0.188
riger-0.187
Token but
Token ID475
Feature activation-2.92e-01

Pos logit contributions
ap+0.752
ucks+0.731
apping+0.699
 upon+0.693
angles+0.675
Neg logit contributions
col-0.984
 gn-0.963
 disl-0.898
 dis-0.807
 rooft-0.791
Token eventually
Token ID4191
Feature activation-4.76e-01

Pos logit contributions
col+0.494
 gn+0.475
 disl+0.449
riger+0.402
growth+0.401
Neg logit contributions
ap-0.380
 upon-0.362
ucks-0.352
apping-0.348
angles-0.330
Token the
Token ID262
Feature activation-5.22e-01

Pos logit contributions
col+0.787
 gn+0.749
 disl+0.712
riger+0.634
growth+0.632
Neg logit contributions
ap-0.641
 upon-0.614
ucks-0.585
apping-0.584
angles-0.554
Token old
Token ID1468
Feature activation+2.57e-01

Pos logit contributions
col+0.868
 gn+0.829
 disl+0.795
riger+0.712
growth+0.704
Neg logit contributions
ap-0.688
 upon-0.658
ucks-0.633
apping-0.630
aps-0.595
Token lady
Token ID10846
Feature activation+2.00e-01

Pos logit contributions
ap+0.376
ucks+0.360
 upon+0.356
apping+0.351
angles+0.337
Neg logit contributions
col-0.391
 gn-0.380
 disl-0.353
 dis-0.309
riger-0.306
Token succeeded
Token ID14131
Feature activation+4.02e-01

Pos logit contributions
ap+0.275
ucks+0.263
 upon+0.260
apping+0.256
angles+0.244
Neg logit contributions
col-0.321
 gn-0.310
 disl-0.289
 dis-0.256
riger-0.255
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
ap+0.495
ucks+0.476
 upon+0.460
apping+0.454
angles+0.437
Neg logit contributions
col-0.707
 gn-0.681
 disl-0.638
 dis-0.578
riger-0.572
Token She
Token ID1375
Feature activation-8.95e-02

Pos logit contributions
ap+0.342
ucks+0.324
 upon+0.321
apping+0.316
angles+0.304
Neg logit contributions
col-0.412
 gn-0.398
 disl-0.374
 dis-0.330
growth-0.330
Token kept
Token ID4030
Feature activation-1.04e-01

Pos logit contributions
col+0.147
 gn+0.142
 disl+0.134
riger+0.119
 dis+0.119
Neg logit contributions
ap-0.120
 upon-0.113
ucks-0.113
apping-0.111
angles-0.105
Token it
Token ID340
Feature activation+5.35e-01

Pos logit contributions
col+0.181
 gn+0.175
 disl+0.165
riger+0.148
growth+0.147
Neg logit contributions
ap-0.130
 upon-0.123
ucks-0.122
apping-0.120
aps-0.113
Token wanted
Token ID2227
Feature activation+3.82e-01

Pos logit contributions
ap+0.691
ucks+0.667
apping+0.650
 upon+0.643
aps+0.623
Neg logit contributions
col-0.707
 gn-0.691
 disl-0.636
 dis-0.574
riger-0.559
Token to
Token ID284
Feature activation+3.30e-01

Pos logit contributions
ap+0.484
ucks+0.452
apping+0.449
 upon+0.447
aps+0.418
Neg logit contributions
col-0.677
 gn-0.654
 disl-0.615
 dis-0.558
growth-0.546
Token help
Token ID1037
Feature activation+1.56e+00

Pos logit contributions
ap+0.464
 upon+0.444
ucks+0.442
apping+0.440
angles+0.416
Neg logit contributions
col-0.526
 gn-0.509
 disl-0.479
 dis-0.422
riger-0.415
Token Tim
Token ID5045
Feature activation-1.24e-01

Pos logit contributions
ucks+2.147
ap+2.145
 upon+2.125
apping+2.125
itting+2.030
Neg logit contributions
col-2.519
 gn-2.466
 disl-2.239
 dis-2.124
riger-1.904
Token fix
Token ID4259
Feature activation+9.91e-02

Pos logit contributions
col+0.231
 gn+0.224
 disl+0.212
growth+0.192
riger+0.192
Neg logit contributions
ap-0.141
 upon-0.132
ucks-0.131
apping-0.128
aps-0.120
Token his
Token ID465
Feature activation-2.27e-01

Pos logit contributions
ap+0.120
 upon+0.113
ucks+0.113
apping+0.111
aps+0.104
Neg logit contributions
col-0.177
 gn-0.171
 disl-0.162
 dis-0.145
riger-0.144
Token toy
Token ID13373
Feature activation+2.19e-01

Pos logit contributions
col+0.433
 gn+0.417
 disl+0.396
riger+0.360
growth+0.359
Neg logit contributions
ap-0.253
 upon-0.236
ucks-0.234
apping-0.228
aps-0.215
Token,
Token ID11
Feature activation+3.81e-01

Pos logit contributions
ap+0.246
ucks+0.233
 upon+0.232
apping+0.227
angles+0.212
Neg logit contributions
col-0.405
 gn-0.394
 disl-0.370
 dis-0.336
riger-0.335
Token but
Token ID475
Feature activation+7.56e-01

Pos logit contributions
ap+0.567
ucks+0.543
apping+0.528
 upon+0.526
aps+0.505
Neg logit contributions
col-0.582
 gn-0.560
 disl-0.522
 dis-0.468
 rooft-0.453
Token she
Token ID673
Feature activation+1.11e+00

Pos logit contributions
ap+1.100
ucks+1.060
apping+1.030
aps+0.993
 upon+0.977
Neg logit contributions
col-1.182
 gn-1.161
 disl-1.082
 dis-0.983
riger-0.926
Token didn
Token ID1422
Feature activation-3.82e-01

Pos logit contributions
ap+1.800
ucks+1.762
apping+1.695
 upon+1.658
aps+1.646
Neg logit contributions
col-1.521
 gn-1.520
 disl-1.307
 dis-1.268
yah-1.170
Token.
Token ID13
Feature activation-4.99e-01

Pos logit contributions
ap+0.228
ucks+0.218
 upon+0.217
apping+0.211
angles+0.200
Neg logit contributions
col-0.317
 gn-0.307
 disl-0.287
 dis-0.257
riger-0.257
Token They
Token ID1119
Feature activation-3.99e-01

Pos logit contributions
col+1.076
 gn+1.048
 disl+0.996
riger+0.924
 dis+0.915
Neg logit contributions
ap-0.434
 upon-0.399
ucks-0.397
apping-0.386
aps-0.362
Token had
Token ID550
Feature activation-2.19e-01

Pos logit contributions
col+0.657
 gn+0.630
 disl+0.604
riger+0.533
growth+0.532
Neg logit contributions
ap-0.535
 upon-0.516
ucks-0.498
apping-0.492
angles-0.472
Token fun
Token ID1257
Feature activation+6.16e-01

Pos logit contributions
col+0.375
 gn+0.362
 disl+0.341
riger+0.306
growth+0.304
Neg logit contributions
ap-0.283
 upon-0.268
ucks-0.265
apping-0.260
aps-0.247
Token.
Token ID13
Feature activation-6.13e-01

Pos logit contributions
ap+0.803
ucks+0.786
 upon+0.777
angles+0.746
apping+0.740
Neg logit contributions
col-1.014
 gn-0.986
 disl-0.927
 dis-0.832
 rooft-0.818
Token They
Token ID1119
Feature activation-2.21e-01

Pos logit contributions
col+1.281
 gn+1.249
 disl+1.178
riger+1.091
 dis+1.077
Neg logit contributions
ap-0.581
 upon-0.539
ucks-0.534
apping-0.519
aps-0.493
Token were
Token ID547
Feature activation+9.01e-01

Pos logit contributions
col+0.363
 gn+0.350
 disl+0.331
riger+0.294
growth+0.293
Neg logit contributions
ap-0.296
 upon-0.282
ucks-0.277
apping-0.273
angles-0.260
Token happy
Token ID3772
Feature activation+1.09e+00

Pos logit contributions
ap+1.222
ucks+1.215
 upon+1.189
apping+1.136
angles+1.132
Neg logit contributions
col-1.416
 gn-1.402
 disl-1.302
 dis-1.189
riger-1.127
Token.
Token ID13
Feature activation-4.67e-01

Pos logit contributions
ap+1.552
angles+1.492
ucks+1.487
 upon+1.459
apping+1.414
Neg logit contributions
col-1.679
 gn-1.658
 disl-1.536
 dis-1.375
 rooft-1.330
Token<|endoftext|>
Token ID50256
Feature activation+1.23e-01

Pos logit contributions
col+0.865
 gn+0.840
 disl+0.787
riger+0.720
 dis+0.710
Neg logit contributions
ap-0.547
 upon-0.513
ucks-0.511
apping-0.500
aps-0.477
TokenAnna
Token ID31160
Feature activation+7.76e-02

Pos logit contributions
ap+0.173
ucks+0.163
 upon+0.163
apping+0.160
angles+0.154
Neg logit contributions
col-0.194
 gn-0.187
 disl-0.174
riger-0.155
growth-0.154
Token came
Token ID1625
Feature activation+7.46e-01

Pos logit contributions
ap+0.279
ucks+0.267
 upon+0.264
apping+0.259
aps+0.248
Neg logit contributions
col-0.316
 gn-0.305
 disl-0.282
 dis-0.253
riger-0.251
Token up
Token ID510
Feature activation+2.94e-01

Pos logit contributions
ap+1.102
ucks+1.054
angles+1.031
 upon+1.017
apping+1.014
Neg logit contributions
col-1.138
 gn-1.083
 disl-0.957
 dis-0.901
 rooft-0.884
Token to
Token ID284
Feature activation+1.69e-01

Pos logit contributions
ap+0.322
ucks+0.305
 upon+0.297
apping+0.290
angles+0.284
Neg logit contributions
col-0.556
 gn-0.542
 disl-0.505
 dis-0.463
riger-0.458
Token him
Token ID683
Feature activation-4.79e-01

Pos logit contributions
ap+0.181
ucks+0.168
 upon+0.167
apping+0.165
angles+0.154
Neg logit contributions
col-0.325
 gn-0.317
 disl-0.300
 dis-0.273
riger-0.272
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
col+0.954
 gn+0.922
 disl+0.889
growth+0.806
riger+0.805
Neg logit contributions
ap-0.486
 upon-0.452
apping-0.439
ucks-0.435
aps-0.404
Token said
Token ID531
Feature activation-1.02e+00

Pos logit contributions
ap+0.024
 upon+0.023
ucks+0.023
apping+0.023
angles+0.022
Neg logit contributions
col-0.026
 gn-0.025
 disl-0.023
riger-0.020
 dis-0.020
Token,
Token ID11
Feature activation-7.76e-01

Pos logit contributions
col+1.905
 gn+1.803
 disl+1.735
growth+1.634
riger+1.594
Neg logit contributions
ap-1.193
 upon-1.134
apping-1.099
ucks-1.038
aps-1.017
Token "
Token ID366
Feature activation+1.36e-01

Pos logit contributions
col+1.559
 gn+1.486
 disl+1.444
growth+1.336
riger+1.300
Neg logit contributions
ap-0.767
 upon-0.742
apping-0.705
ucks-0.678
aps-0.633
TokenDo
Token ID5211
Feature activation+5.04e-01

Pos logit contributions
ap+0.189
ucks+0.177
 upon+0.177
apping+0.174
angles+0.166
Neg logit contributions
col-0.222
 gn-0.213
 disl-0.200
riger-0.177
 dis-0.177
Token you
Token ID345
Feature activation+7.12e-02

Pos logit contributions
ap+0.440
ucks+0.432
apping+0.401
 upon+0.400
aps+0.369
Neg logit contributions
col-1.053
 gn-1.030
 disl-0.998
riger-0.904
 dis-0.903
Token want
Token ID765
Feature activation+4.59e-01

Pos logit contributions
ap+0.087
ucks+0.083
 upon+0.082
apping+0.080
angles+0.076
Neg logit contributions
col-0.126
 gn-0.122
 disl-0.115
 dis-0.103
riger-0.103
Token you
Token ID345
Feature activation+9.22e-01

Pos logit contributions
ap+0.675
apping+0.621
ucks+0.618
 upon+0.610
ooks+0.569
Neg logit contributions
col-1.493
 gn-1.418
 disl-1.362
 dis-1.245
growth-1.230
Token?"
Token ID1701
Feature activation+2.96e-02

Pos logit contributions
ap+1.313
ucks+1.300
apping+1.253
ooks+1.240
 upon+1.216
Neg logit contributions
col-1.378
 gn-1.359
 disl-1.229
 dis-1.112
riger-1.103
Token Tim
Token ID5045
Feature activation-5.48e-01

Pos logit contributions
ap+0.038
 upon+0.035
ucks+0.035
apping+0.035
angles+0.033
Neg logit contributions
col-0.051
 gn-0.049
 disl-0.046
 dis-0.041
riger-0.041
Token asked
Token ID1965
Feature activation+6.45e-01

Pos logit contributions
col+0.977
 gn+0.935
 disl+0.909
riger+0.814
growth+0.811
Neg logit contributions
ap-0.664
 upon-0.642
ucks-0.621
apping-0.607
aps-0.573
Token.
Token ID13
Feature activation+4.98e-02

Pos logit contributions
ap+0.838
ucks+0.816
 upon+0.790
angles+0.774
apping+0.774
Neg logit contributions
col-1.073
 gn-1.049
 disl-0.950
 dis-0.885
growth-0.856
Token The
Token ID383
Feature activation-9.81e-01

Pos logit contributions
ap+0.069
 upon+0.065
ucks+0.065
apping+0.064
angles+0.061
Neg logit contributions
col-0.081
 gn-0.078
 disl-0.073
 dis-0.065
riger-0.065
Token yacht
Token ID43207
Feature activation-6.30e-03

Pos logit contributions
col+1.409
 gn+1.376
 disl+1.265
riger+1.143
growth+1.140
Neg logit contributions
ap-1.552
 upon-1.483
ucks-1.439
apping-1.438
angles-1.368
Token did
Token ID750
Feature activation+1.06e+00

Pos logit contributions
col+0.010
 gn+0.010
 disl+0.009
riger+0.008
 dis+0.008
Neg logit contributions
ap-0.009
 upon-0.008
ucks-0.008
apping-0.008
angles-0.008
Token not
Token ID407
Feature activation+9.79e-01

Pos logit contributions
ucks+1.470
ap+1.449
apping+1.395
 upon+1.369
angles+1.341
Neg logit contributions
 gn-1.711
col-1.683
 disl-1.606
 dis-1.409
 CRE-1.326
Token talk
Token ID1561
Feature activation+5.05e-01

Pos logit contributions
ap+1.380
ucks+1.371
apping+1.317
 upon+1.297
ooks+1.281
Neg logit contributions
col-1.527
 gn-1.514
 disl-1.414
 dis-1.252
riger-1.215
Token,
Token ID11
Feature activation-2.83e-02

Pos logit contributions
ap+0.626
ucks+0.606
 upon+0.582
apping+0.577
angles+0.553
Neg logit contributions
col-0.881
 gn-0.848
 disl-0.803
 dis-0.720
riger-0.705
Tokenvous
Token ID31222
Feature activation+9.86e-02

Pos logit contributions
 gn+1.244
col+1.239
 disl+1.132
riger+1.013
growth+0.990
Neg logit contributions
ap-1.629
 upon-1.575
apping-1.551
ucks-1.508
aps-1.434
Token rabbit
Token ID22746
Feature activation-5.56e-02

Pos logit contributions
ap+0.134
ucks+0.128
 upon+0.128
apping+0.124
angles+0.119
Neg logit contributions
col-0.160
 gn-0.155
 disl-0.145
 dis-0.129
riger-0.128
Token came
Token ID1625
Feature activation+5.45e-01

Pos logit contributions
col+0.095
 gn+0.092
 disl+0.087
riger+0.078
 dis+0.077
Neg logit contributions
ap-0.071
 upon-0.067
ucks-0.066
apping-0.065
angles-0.061
Token and
Token ID290
Feature activation-4.04e-01

Pos logit contributions
ap+0.699
ucks+0.669
 upon+0.642
apping+0.642
angles+0.642
Neg logit contributions
col-0.940
 gn-0.910
 disl-0.824
 dis-0.753
 rooft-0.744
Token started
Token ID2067
Feature activation-2.54e-01

Pos logit contributions
col+0.641
 gn+0.611
 disl+0.585
growth+0.517
riger+0.516
Neg logit contributions
ap-0.554
 upon-0.541
ucks-0.524
apping-0.512
angles-0.489
Token to
Token ID284
Feature activation-5.97e-01

Pos logit contributions
col+0.466
 gn+0.448
 disl+0.428
riger+0.388
growth+0.386
Neg logit contributions
ap-0.292
 upon-0.278
ucks-0.271
apping-0.267
aps-0.252
Token eat
Token ID4483
Feature activation-3.11e-02

Pos logit contributions
col+0.937
 gn+0.875
 disl+0.830
riger+0.749
growth+0.746
Neg logit contributions
ap-0.847
 upon-0.812
ucks-0.780
apping-0.762
aps-0.732
Token the
Token ID262
Feature activation-7.35e-02

Pos logit contributions
col+0.055
 gn+0.053
 disl+0.050
riger+0.045
 dis+0.045
Neg logit contributions
ap-0.038
 upon-0.036
ucks-0.036
apping-0.035
angles-0.033
Token flowers
Token ID12734
Feature activation-4.40e-01

Pos logit contributions
col+0.121
 gn+0.117
 disl+0.110
riger+0.098
 dis+0.097
Neg logit contributions
ap-0.099
 upon-0.094
ucks-0.093
apping-0.092
angles-0.087
Token!
Token ID0
Feature activation-1.50e-02

Pos logit contributions
col+0.799
 gn+0.776
 disl+0.744
riger+0.669
growth+0.667
Neg logit contributions
ap-0.513
 upon-0.485
apping-0.468
ucks-0.465
aps-0.437
Token The
Token ID383
Feature activation-3.24e-01

Pos logit contributions
col+0.025
 gn+0.025
 disl+0.023
riger+0.021
 dis+0.021
Neg logit contributions
ap-0.020
 upon-0.019
ucks-0.018
apping-0.018
angles-0.017
Token the
Token ID262
Feature activation+6.45e-02

Pos logit contributions
col+0.194
 gn+0.187
 disl+0.177
riger+0.158
 dis+0.157
Neg logit contributions
ap-0.143
 upon-0.136
ucks-0.135
apping-0.132
angles-0.125
Token dirt
Token ID13647
Feature activation-1.10e+00

Pos logit contributions
ap+0.101
ucks+0.096
 upon+0.096
apping+0.095
angles+0.090
Neg logit contributions
col-0.093
 gn-0.089
 disl-0.083
 dis-0.072
riger-0.072
Token and
Token ID290
Feature activation-4.39e-01

Pos logit contributions
col+2.213
 disl+2.169
 gn+2.148
riger+1.938
 dis+1.900
Neg logit contributions
ap-1.088
 upon-1.005
apping-0.953
ucks-0.940
angles-0.908
Token look
Token ID804
Feature activation+6.79e-01

Pos logit contributions
col+0.746
 gn+0.714
 disl+0.680
riger+0.610
growth+0.601
Neg logit contributions
ap-0.567
 upon-0.545
ucks-0.523
apping-0.510
angles-0.494
Token for
Token ID329
Feature activation+2.51e-01

Pos logit contributions
ap+0.408
ucks+0.376
apping+0.352
 upon+0.330
aps+0.305
Neg logit contributions
col-1.611
 gn-1.561
 disl-1.493
 dis-1.398
 rooft-1.389
Token bugs
Token ID11316
Feature activation-1.61e+00

Pos logit contributions
ap+0.364
ucks+0.346
 upon+0.342
apping+0.341
angles+0.322
Neg logit contributions
col-0.389
 gn-0.378
 disl-0.351
 dis-0.310
growth-0.308
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
col+3.214
 disl+3.129
 gn+3.100
growth+2.827
 dis+2.824
Neg logit contributions
ap-1.575
 upon-1.570
apping-1.442
 for-1.410
ucks-1.374
Token Sometimes
Token ID8975
Feature activation-7.44e-02

Pos logit contributions
col+0.467
 gn+0.452
 disl+0.427
riger+0.384
 dis+0.381
Neg logit contributions
ap-0.327
 upon-0.311
ucks-0.306
apping-0.299
aps-0.282
Token they
Token ID484
Feature activation-6.30e-01

Pos logit contributions
col+0.121
 gn+0.117
 disl+0.110
riger+0.098
 dis+0.097
Neg logit contributions
ap-0.102
 upon-0.096
ucks-0.096
apping-0.094
angles-0.089
Token found
Token ID1043
Feature activation+3.93e-02

Pos logit contributions
col+1.125
 gn+1.075
 disl+1.045
riger+0.932
 rooft+0.920
Neg logit contributions
ap-0.755
 upon-0.743
ucks-0.679
apping-0.675
angles-0.666
Token worms
Token ID29174
Feature activation-7.44e-01

Pos logit contributions
ap+0.049
ucks+0.046
 upon+0.046
apping+0.045
angles+0.042
Neg logit contributions
col-0.069
 gn-0.067
 disl-0.063
 dis-0.056
riger-0.056
Token decided
Token ID3066
Feature activation-3.97e-01

Pos logit contributions
ap+0.524
ucks+0.502
 upon+0.495
apping+0.491
aps+0.472
Neg logit contributions
col-0.489
 gn-0.475
 disl-0.433
 dis-0.385
riger-0.377
Token to
Token ID284
Feature activation+1.87e-01

Pos logit contributions
col+0.759
 gn+0.730
 disl+0.695
growth+0.640
riger+0.636
Neg logit contributions
ap-0.425
 upon-0.410
ucks-0.395
apping-0.388
aps-0.360
Token help
Token ID1037
Feature activation+1.23e+00

Pos logit contributions
ap+0.228
ucks+0.218
 upon+0.218
apping+0.213
angles+0.201
Neg logit contributions
col-0.328
 gn-0.318
 disl-0.299
 dis-0.268
riger-0.266
Token him
Token ID683
Feature activation-2.28e-01

Pos logit contributions
ap+1.598
ucks+1.560
 upon+1.554
apping+1.540
itting+1.471
Neg logit contributions
col-2.090
 gn-2.053
 disl-1.857
 dis-1.776
growth-1.674
Token.
Token ID13
Feature activation-9.00e-02

Pos logit contributions
col+0.411
 gn+0.397
 disl+0.377
riger+0.341
growth+0.338
Neg logit contributions
ap-0.273
 upon-0.255
ucks-0.252
apping-0.249
aps-0.235
Token The
Token ID383
Feature activation-1.56e+00

Pos logit contributions
col+0.164
 gn+0.159
 disl+0.150
riger+0.135
 dis+0.135
Neg logit contributions
ap-0.106
 upon-0.100
ucks-0.100
apping-0.097
aps-0.091
Token giant
Token ID6175
Feature activation-3.88e-01

Pos logit contributions
col+3.469
 gn+3.268
 disl+3.230
riger+3.066
 CRE+2.978
Neg logit contributions
 upon-1.364
ap-1.305
apping-1.133
ucks-1.118
 friend-1.047
Token used
Token ID973
Feature activation+1.96e-01

Pos logit contributions
col+0.635
 gn+0.608
 disl+0.582
riger+0.518
growth+0.513
Neg logit contributions
ap-0.524
 upon-0.498
ucks-0.487
apping-0.484
angles-0.456
Token his
Token ID465
Feature activation-2.51e-01

Pos logit contributions
ap+0.223
ucks+0.209
 upon+0.208
apping+0.204
angles+0.191
Neg logit contributions
col-0.365
 gn-0.354
 disl-0.334
 dis-0.303
growth-0.301
Token strong
Token ID1913
Feature activation+1.08e+00

Pos logit contributions
col+0.430
 gn+0.414
 disl+0.389
riger+0.349
growth+0.346
Neg logit contributions
ap-0.326
 upon-0.309
ucks-0.302
apping-0.299
aps-0.284
Token legs
Token ID7405
Feature activation-4.09e-01

Pos logit contributions
ucks+1.388
ap+1.356
apping+1.327
 upon+1.322
angles+1.269
Neg logit contributions
 gn-1.787
col-1.783
 disl-1.676
 dis-1.503
growth-1.467
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
col+0.155
 gn+0.150
 disl+0.141
riger+0.127
 dis+0.126
Neg logit contributions
ap-0.111
 upon-0.104
ucks-0.104
apping-0.102
angles-0.096
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
col+0.134
 gn+0.130
 disl+0.122
riger+0.109
 dis+0.109
Neg logit contributions
ap-0.101
 upon-0.096
ucks-0.095
apping-0.093
aps-0.088
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
ap+0.436
ucks+0.409
 upon+0.406
apping+0.404
angles+0.387
Neg logit contributions
col-0.455
 gn-0.437
 disl-0.406
 dis-0.360
riger-0.356
Token"
Token ID1
Feature activation-4.05e-01

Pos logit contributions
ap+0.185
ucks+0.173
 upon+0.173
apping+0.169
angles+0.159
Neg logit contributions
col-0.297
 gn-0.287
 disl-0.272
riger-0.246
 dis-0.245
TokenLook
Token ID8567
Feature activation-1.17e-01

Pos logit contributions
col+0.653
 gn+0.635
 disl+0.603
riger+0.528
 dis+0.526
Neg logit contributions
ap-0.556
ucks-0.532
 upon-0.531
apping-0.518
aps-0.491
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
col+0.192
 gn+0.186
 disl+0.175
growth+0.155
riger+0.154
Neg logit contributions
ap-0.158
 upon-0.150
ucks-0.148
apping-0.146
angles-0.139
Token a
Token ID257
Feature activation-2.55e-01

Pos logit contributions
col+3.466
 gn+3.211
 disl+3.129
growth+2.924
riger+2.864
Neg logit contributions
 upon-1.555
ap-1.532
ucks-1.305
 for-1.256
apping-1.253
Token story
Token ID1621
Feature activation-5.34e-02

Pos logit contributions
col+0.450
 gn+0.432
 disl+0.413
 dis+0.369
growth+0.369
Neg logit contributions
ap-0.311
 upon-0.294
ucks-0.289
apping-0.284
aps-0.269
Token!"
Token ID2474
Feature activation+8.82e-02

Pos logit contributions
col+0.095
 gn+0.092
 disl+0.087
riger+0.078
 dis+0.078
Neg logit contributions
ap-0.065
 upon-0.061
ucks-0.061
apping-0.059
angles-0.056
Token Lily
Token ID20037
Feature activation-1.54e-01

Pos logit contributions
ap+0.102
ucks+0.095
 upon+0.095
apping+0.093
angles+0.088
Neg logit contributions
col-0.163
 gn-0.158
 disl-0.150
 dis-0.135
riger-0.134
Token says
Token ID1139
Feature activation+4.19e-01

Pos logit contributions
col+0.270
 gn+0.261
 disl+0.247
riger+0.221
growth+0.220
Neg logit contributions
ap-0.190
 upon-0.180
ucks-0.179
apping-0.175
aps-0.165
Token should
Token ID815
Feature activation+3.04e-01

Pos logit contributions
ap+0.285
ucks+0.271
 upon+0.267
apping+0.266
aps+0.251
Neg logit contributions
col-0.341
 gn-0.332
 disl-0.308
 dis-0.275
riger-0.274
Token go
Token ID467
Feature activation+3.98e-01

Pos logit contributions
ap+0.383
ucks+0.369
 upon+0.360
apping+0.358
aps+0.335
Neg logit contributions
col-0.525
 gn-0.512
 disl-0.482
 dis-0.431
growth-0.428
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
ap+0.501
ucks+0.479
 upon+0.466
apping+0.464
angles+0.442
Neg logit contributions
col-0.695
 gn-0.676
 disl-0.624
 dis-0.562
riger-0.560
Token
Token ID198
Feature activation-6.35e-01

Pos logit contributions
col+0.035
 gn+0.034
 disl+0.032
riger+0.029
 dis+0.029
Neg logit contributions
ap-0.027
 upon-0.025
ucks-0.025
apping-0.024
angles-0.023
Token
Token ID198
Feature activation+8.87e-02

Pos logit contributions
col+1.079
 gn+1.044
 disl+0.993
riger+0.889
growth+0.882
Neg logit contributions
ap-0.810
 upon-0.791
ucks-0.783
apping-0.760
aps-0.731
TokenThe
Token ID464
Feature activation-1.69e+00

Pos logit contributions
ap+0.116
 upon+0.110
ucks+0.109
apping+0.107
angles+0.102
Neg logit contributions
col-0.150
 gn-0.145
 disl-0.137
riger-0.122
 dis-0.122
Token elephant
Token ID20950
Feature activation+1.26e-01

Pos logit contributions
col+3.064
 gn+2.803
 disl+2.779
growth+2.572
riger+2.557
Neg logit contributions
ap-2.031
 upon-1.961
apping-1.795
ucks-1.777
 for-1.742
Token thanked
Token ID26280
Feature activation+4.76e-01

Pos logit contributions
ap+0.175
ucks+0.167
 upon+0.166
apping+0.163
angles+0.155
Neg logit contributions
col-0.200
 gn-0.194
 disl-0.180
 dis-0.160
riger-0.159
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
ap+0.645
 upon+0.612
ucks+0.610
apping+0.605
aps+0.571
Neg logit contributions
col-0.775
 gn-0.760
 disl-0.705
 dis-0.636
growth-0.630
Token scientist
Token ID11444
Feature activation+1.47e-01

Pos logit contributions
col+2.291
 gn+2.114
 disl+2.079
growth+1.905
riger+1.901
Neg logit contributions
ap-1.792
 upon-1.738
ucks-1.624
apping-1.618
aps-1.570
Token and
Token ID290
Feature activation+2.72e-02

Pos logit contributions
ap+0.152
ucks+0.143
 upon+0.141
apping+0.137
angles+0.129
Neg logit contributions
col-0.287
 gn-0.279
 disl-0.263
 dis-0.239
riger-0.239
Token went
Token ID1816
Feature activation-1.52e-01

Pos logit contributions
col+0.655
 gn+0.632
 disl+0.603
riger+0.535
growth+0.531
Neg logit contributions
ap-0.527
 upon-0.506
ucks-0.488
apping-0.482
angles-0.463
Token back
Token ID736
Feature activation-6.98e-01

Pos logit contributions
col+0.267
 gn+0.258
 disl+0.246
growth+0.219
riger+0.219
Neg logit contributions
ap-0.189
 upon-0.179
ucks-0.177
apping-0.173
aps-0.164
Token inside
Token ID2641
Feature activation+8.71e-01

Pos logit contributions
col+1.328
 gn+1.299
 disl+1.255
riger+1.126
growth+1.117
Neg logit contributions
ap-0.764
 upon-0.741
ucks-0.709
apping-0.684
aps-0.654
Token and
Token ID290
Feature activation-9.45e-02

Pos logit contributions
ap+1.119
ucks+1.104
angles+1.043
apping+1.026
 upon+1.007
Neg logit contributions
col-1.468
 gn-1.425
 disl-1.319
 dis-1.234
 rooft-1.167
Token told
Token ID1297
Feature activation-1.58e-01

Pos logit contributions
col+0.154
 gn+0.149
 disl+0.140
riger+0.124
growth+0.124
Neg logit contributions
ap-0.129
 upon-0.122
ucks-0.121
apping-0.119
angles-0.113
Token their
Token ID511
Feature activation-1.60e+00

Pos logit contributions
col+0.311
 gn+0.303
 disl+0.287
riger+0.262
growth+0.260
Neg logit contributions
ap-0.162
 upon-0.149
ucks-0.149
apping-0.145
angles-0.135
Token parents
Token ID3397
Feature activation+3.44e-01

Pos logit contributions
col+3.301
 gn+3.125
 disl+2.967
riger+2.882
growth+2.869
Neg logit contributions
ap-1.569
 upon-1.487
apping-1.362
ucks-1.346
aps-1.322
Token what
Token ID644
Feature activation+2.48e-02

Pos logit contributions
ap+0.356
ucks+0.338
 upon+0.322
apping+0.322
aps+0.302
Neg logit contributions
col-0.668
 gn-0.647
 disl-0.616
 dis-0.561
riger-0.553
Token they
Token ID484
Feature activation+5.71e-01

Pos logit contributions
ap+0.035
ucks+0.033
 upon+0.033
apping+0.032
angles+0.031
Neg logit contributions
col-0.040
 gn-0.038
 disl-0.036
 dis-0.032
riger-0.032
Token observed
Token ID6515
Feature activation-8.97e-02

Pos logit contributions
ap+0.860
ucks+0.825
apping+0.808
 upon+0.782
aps+0.775
Neg logit contributions
col-0.862
 gn-0.828
 disl-0.779
 dis-0.699
riger-0.683
Token in
Token ID287
Feature activation-5.90e-01

Pos logit contributions
col+0.160
 gn+0.155
 disl+0.146
riger+0.131
 dis+0.131
Neg logit contributions
ap-0.108
 upon-0.102
ucks-0.101
apping-0.099
angles-0.093