TOP ACTIVATIONS
MAX = 34.419

day and the trees were giving a pleasant shade . Mom
that was very nice and giving . Later that
and they said goodbye by giving each other a wink and
, pushing her away and giving Lily all the sunshine .
around Fre ck les , giving her hugs and pet ting
king 's tum my by giving him too many ch ocol
me , thank you for giving me such good advice !
hungry . Would you mind giving me some of your ol
that he was serious about giving the toy to Jim .
way of thanking yoga for giving them a fun time .
Lisa !" Dad said , giving her the doll and the
these ." He said , giving them two life jackets .
store with the perfume , giving it a little hug .
proud of himself for not giving up . Tiny knew that
Bob was so nice for giving them the signs .
The strong tall tree had given them the space they needed
brave , and strong without giving up . B inky realized
the right thing instead of giving in to his greed .
, even if it means giving up something that you have
person thanked the sun for giving the person light . The

MIN ACTIVATIONS
MIN = -30.817

of the pr une . Betsy liked this idea and agreed
and suggested that she and Betsy each draw a picture of
little girl named Betsy . Betsy was so excited , because
was a little girl named Betsy . Betsy was so excited
. When Betsy met up with her friends
Betsy to the beach . Betsy was so excited she could
Mom my and Daddy took Betsy to the beach . Betsy
light out she would take Betsy to the beach .
of y ummy veggies . Betsy helped her mom spread the
From that day on , Betsy always made sure to remember
Together , Betsy and Lily enjoyed eating the
on the other side . Betsy was grateful to Lily for
, " Thank you , Betsy . You are such a
her the way out and Betsy was so happy to be
One day , Betsy was feeling a bit hungry
beef on the table . Betsy was so happy ! She
iflower ?" asked Lily . Betsy nodded her head again and
From that day on , Betsy and the thief became good
From that day on , Betsy and the fox played together
with her because they trusted Betsy . One day

INTERVAL 18.110 - 34.419
CONTAINS 4.074%

felt sad and wanted to help the bird .
very proud of Molly for using her imagination . She encouraged
on , Peter ! We dare you to do it !"
told her mom my she wanted a cone of strawberry ice
to find another way to help the fish , â €

INTERVAL 1.801 - 18.110
CONTAINS 43.537%

!" She ran to him and tried to grab her doll
They were so happy and full of love .
the toy car go with speed . They threw the ball
. They loved each other very much . <|endoftext|> Tom was
and kindness in a very special way . <|endoftext|> Once upon

INTERVAL -14.508 - 1.801
CONTAINS 40.027%

and play with him . They played games and had a
mom smiled and gave him a big hug . <|endoftext|> One
he realized it was too tall . He could not jump
nut !" Nut ty 's friends came to help and they
her friends on Sunday . Lu cy went to

INTERVAL -30.817 - -14.508
CONTAINS 12.362%

to clean it up . K iki begged her mom to
When she got home she unloaded everything from the basket
les . The bride and groom saw the
It hurt a lot and she started to cry .
playing . " L ily , please come out
Token day
Token ID1110
Feature activation+7.66e+00

Pos logit contributions
 teaching+0.213
 showing+0.208
 diving+0.202
 giving+0.198
 example+0.197
Neg logit contributions
Max-0.259
ens-0.241
Brown-0.240
Snow-0.234
Mic-0.227
Token and
Token ID290
Feature activation+9.82e+00

Pos logit contributions
 teaching+0.109
 showing+0.105
 example+0.102
 diving+0.102
 giving+0.099
Neg logit contributions
Max-0.190
ens-0.176
Brown-0.176
Snow-0.174
Mic-0.166
Token the
Token ID262
Feature activation-6.69e+00

Pos logit contributions
 teaching+0.177
 showing+0.176
 diving+0.170
 giving+0.169
 show+0.161
Neg logit contributions
Max-0.213
Brown-0.198
ens-0.194
Mic-0.191
Snow-0.191
Token trees
Token ID7150
Feature activation-3.14e+00

Pos logit contributions
Max+0.138
lves+0.129
ens+0.129
Brown+0.127
Mic+0.125
Neg logit contributions
 teaching-0.126
 showing-0.126
 diving-0.123
 giving-0.121
 show-0.121
Token were
Token ID547
Feature activation+1.35e+01

Pos logit contributions
Max+0.069
ens+0.065
Brown+0.064
Snow+0.063
Mic+0.060
Neg logit contributions
 teaching-0.054
 showing-0.053
 diving-0.051
 example-0.051
 giving-0.050
Token giving
Token ID3501
Feature activation+3.16e+01

Pos logit contributions
 teaching+0.177
 showing+0.172
 diving+0.165
 example+0.163
 giving+0.161
Neg logit contributions
Max-0.355
Brown-0.332
ens-0.327
Snow-0.325
Mic-0.317
Token a
Token ID257
Feature activation-1.49e+00

Pos logit contributions
 giving+0.367
 teaching+0.363
 showing+0.362
 diving+0.341
 helping+0.337
Neg logit contributions
Max-0.868
Mic-0.834
Brown-0.826
lves-0.805
issy-0.800
Token pleasant
Token ID15497
Feature activation+7.69e+00

Pos logit contributions
Max+0.034
Brown+0.032
Snow+0.032
ens+0.031
Blue+0.030
Neg logit contributions
 teaching-0.024
 showing-0.024
 example-0.023
 diving-0.022
 giving-0.022
Token shade
Token ID17979
Feature activation+1.55e+00

Pos logit contributions
 teaching+0.131
 showing+0.127
 winning+0.122
 example+0.121
 diving+0.120
Neg logit contributions
Max-0.161
Snow-0.153
ens-0.153
Brown-0.152
 Bun-0.143
Token.
Token ID13
Feature activation+6.23e-01

Pos logit contributions
 teaching+0.019
 showing+0.019
 diving+0.018
 giving+0.017
 example+0.017
Neg logit contributions
Max-0.043
Brown-0.040
ens-0.040
Snow-0.039
Mic-0.039
Token Mom
Token ID11254
Feature activation-1.52e+01

Pos logit contributions
 teaching+0.008
 showing+0.008
 giving+0.008
 diving+0.008
 winning+0.007
Neg logit contributions
lves-0.015
ens-0.015
mi-0.015
Max-0.015
Brown-0.014
Token that
Token ID326
Feature activation+3.54e+00

Pos logit contributions
 teaching+0.020
 showing+0.019
 diving+0.018
 giving+0.018
 example+0.018
Neg logit contributions
Max-0.027
ens-0.025
Brown-0.025
Snow-0.024
Mic-0.023
Token was
Token ID373
Feature activation+1.45e+01

Pos logit contributions
 showing+0.062
 teaching+0.061
 diving+0.059
 giving+0.059
 show+0.057
Neg logit contributions
Max-0.079
Brown-0.075
Mic-0.072
Snow-0.072
lves-0.071
Token very
Token ID845
Feature activation+1.45e+01

Pos logit contributions
 teaching+0.192
 showing+0.184
 diving+0.182
 example+0.181
 liking+0.173
Neg logit contributions
Max-0.362
ens-0.336
Brown-0.335
Snow-0.331
Mic-0.318
Token nice
Token ID3621
Feature activation+9.96e+00

Pos logit contributions
 teaching+0.235
 showing+0.227
 diving+0.216
 giving+0.214
 example+0.210
Neg logit contributions
Max-0.332
ens-0.312
Brown-0.305
Snow-0.303
lves-0.296
Token and
Token ID290
Feature activation+8.42e+00

Pos logit contributions
 teaching+0.151
 showing+0.146
 diving+0.142
 example+0.137
 giving+0.135
Neg logit contributions
Max-0.233
ens-0.225
Brown-0.215
Snow-0.211
lves-0.204
Token giving
Token ID3501
Feature activation+3.13e+01

Pos logit contributions
 teaching+0.133
 diving+0.127
 showing+0.127
 example+0.126
 giving+0.120
Neg logit contributions
Max-0.192
ens-0.180
Brown-0.176
Snow-0.175
lves-0.168
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
 teaching+0.361
 showing+0.356
 giving+0.346
 show+0.334
 example+0.333
Neg logit contributions
Max-0.850
Brown-0.846
Mic-0.828
lves-0.790
Snow-0.785
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
 teaching+0.017
 showing+0.016
 giving+0.016
 winning+0.015
 example+0.015
Neg logit contributions
lves-0.027
Max-0.027
ens-0.027
Brown-0.026
Snow-0.026
Token
Token ID198
Feature activation-1.68e+00

Pos logit contributions
lves+0.040
ens+0.040
Brown+0.038
Max+0.038
ley+0.038
Neg logit contributions
 teaching-0.022
 diving-0.020
 showing-0.020
 example-0.019
aps-0.019
TokenLater
Token ID18602
Feature activation+3.64e+00

Pos logit contributions
ens+0.035
lves+0.032
ley+0.032
uffy+0.031
mi+0.030
Neg logit contributions
 teaching-0.032
 example-0.030
 showing-0.030
 liking-0.029
 diving-0.029
Token that
Token ID326
Feature activation+1.08e+00

Pos logit contributions
 teaching+0.043
 showing+0.042
 diving+0.042
 giving+0.040
 show+0.039
Neg logit contributions
Max-0.103
Brown-0.099
Mic-0.097
lves-0.096
Snow-0.096
Token and
Token ID290
Feature activation+1.04e+01

Pos logit contributions
 teaching+0.094
 showing+0.091
 diving+0.089
 example+0.088
 giving+0.084
Neg logit contributions
Max-0.147
ens-0.139
Brown-0.139
Snow-0.136
K-0.129
Token they
Token ID484
Feature activation-9.45e+00

Pos logit contributions
 teaching+0.164
 showing+0.164
 diving+0.157
 giving+0.157
 show+0.149
Neg logit contributions
Max-0.249
Brown-0.232
ens-0.226
Mic-0.225
Snow-0.225
Token said
Token ID531
Feature activation+1.18e+01

Pos logit contributions
Max+0.231
Brown+0.217
Mic+0.214
Snow+0.208
Nut+0.207
Neg logit contributions
 show-0.150
 showing-0.145
 teaching-0.142
 diving-0.137
 giving-0.135
Token goodbye
Token ID24829
Feature activation+1.43e+01

Pos logit contributions
 teaching+0.173
 showing+0.169
 diving+0.163
 giving+0.160
 example+0.157
Neg logit contributions
Max-0.293
Brown-0.272
ens-0.271
Snow-0.266
Mic-0.261
Token by
Token ID416
Feature activation+1.03e+01

Pos logit contributions
 teaching+0.145
 showing+0.144
 giving+0.134
 diving+0.131
 example+0.125
Neg logit contributions
Max-0.427
Mic-0.401
ens-0.400
Brown-0.397
lves-0.394
Token giving
Token ID3501
Feature activation+3.12e+01

Pos logit contributions
 teaching+0.151
 showing+0.144
 diving+0.144
 example+0.140
 liking+0.137
Neg logit contributions
Max-0.241
Brown-0.228
ens-0.222
Snow-0.222
K-0.213
Token each
Token ID1123
Feature activation-1.84e+00

Pos logit contributions
 teaching+0.351
 show+0.346
 showing+0.343
 example+0.321
 giving+0.314
Neg logit contributions
Mic-0.849
Brown-0.847
Max-0.831
issy-0.826
lves-0.822
Token other
Token ID584
Feature activation-2.22e+00

Pos logit contributions
Max+0.050
Brown+0.047
Snow+0.045
Mic+0.045
ens+0.045
Neg logit contributions
 teaching-0.024
 showing-0.023
 diving-0.023
 giving-0.022
 example-0.022
Token a
Token ID257
Feature activation-1.71e+00

Pos logit contributions
Max+0.065
Mic+0.064
Brown+0.062
Nut+0.061
lves+0.061
Neg logit contributions
 teaching-0.024
 giving-0.022
 diving-0.022
 showing-0.022
 show-0.021
Token wink
Token ID41955
Feature activation+1.04e+01

Pos logit contributions
Max+0.043
Brown+0.041
ens+0.040
Snow+0.040
Mic+0.039
Neg logit contributions
 teaching-0.024
 showing-0.024
 diving-0.023
 giving-0.022
 example-0.022
Token and
Token ID290
Feature activation+7.13e+00

Pos logit contributions
 teaching+0.142
 showing+0.139
 diving+0.134
 giving+0.131
 example+0.127
Neg logit contributions
Max-0.270
Brown-0.252
ens-0.251
Snow-0.245
Mic-0.243
Token,
Token ID11
Feature activation+1.00e+01

Pos logit contributions
Max+0.419
Brown+0.397
Snow+0.385
Mic+0.385
ens+0.380
Neg logit contributions
 teaching-0.249
 showing-0.245
 giving-0.232
 diving-0.228
 liking-0.221
Token pushing
Token ID7796
Feature activation+2.60e+01

Pos logit contributions
 teaching+0.159
 showing+0.157
 giving+0.150
 diving+0.149
 liking+0.142
Neg logit contributions
Max-0.240
Brown-0.225
ens-0.221
Snow-0.219
Mic-0.217
Token her
Token ID607
Feature activation-1.01e+01

Pos logit contributions
 example+0.375
 showing+0.375
 teaching+0.367
 show+0.360
 diving+0.338
Neg logit contributions
Brown-0.642
Snow-0.612
Bug-0.594
Nut-0.594
lves-0.588
Token away
Token ID1497
Feature activation+9.42e+00

Pos logit contributions
Max+0.254
Brown+0.238
ens+0.235
Snow+0.233
Mic+0.229
Neg logit contributions
 teaching-0.147
 showing-0.146
 diving-0.140
 giving-0.137
 example-0.133
Token and
Token ID290
Feature activation+8.26e+00

Pos logit contributions
 teaching+0.129
 showing+0.123
 diving+0.121
 example+0.116
 giving+0.115
Neg logit contributions
Max-0.241
ens-0.225
Brown-0.223
Snow-0.216
Mic-0.211
Token giving
Token ID3501
Feature activation+3.10e+01

Pos logit contributions
 teaching+0.121
 diving+0.116
 showing+0.115
 example+0.114
 giving+0.110
Neg logit contributions
Max-0.202
ens-0.188
Brown-0.183
Snow-0.179
lves-0.175
Token Lily
Token ID20037
Feature activation-2.19e+01

Pos logit contributions
 teaching+0.477
 showing+0.470
 show+0.447
 giving+0.444
 example+0.442
Neg logit contributions
Brown-0.730
Max-0.726
Mic-0.693
Snow-0.688
lves-0.674
Token all
Token ID477
Feature activation+5.75e+00

Pos logit contributions
Brown+0.662
Max+0.652
Mic+0.632
Snow+0.621
Bug+0.613
Neg logit contributions
 teaching-0.217
 showing-0.193
 giving-0.189
 sharing-0.182
 show-0.182
Token the
Token ID262
Feature activation-8.35e+00

Pos logit contributions
 teaching+0.086
 showing+0.083
 diving+0.080
 giving+0.079
 example+0.077
Neg logit contributions
Max-0.143
Brown-0.135
Snow-0.131
ens-0.131
Mic-0.131
Token sunshine
Token ID34488
Feature activation-2.99e+00

Pos logit contributions
Max+0.205
Brown+0.191
ens+0.189
Snow+0.186
Mic+0.183
Neg logit contributions
 teaching-0.127
 showing-0.124
 diving-0.120
 giving-0.118
 example-0.115
Token.
Token ID13
Feature activation+5.39e-01

Pos logit contributions
Max+0.084
Brown+0.080
Mic+0.078
lves+0.077
Snow+0.077
Neg logit contributions
 showing-0.036
 teaching-0.036
 giving-0.034
 diving-0.034
 show-0.031
Token around
Token ID1088
Feature activation+1.42e+01

Pos logit contributions
 teaching+0.200
 showing+0.194
 diving+0.184
 giving+0.180
 example+0.173
Neg logit contributions
Max-0.567
ens-0.532
Brown-0.531
Snow-0.520
Mic-0.514
Token Fre
Token ID4848
Feature activation-4.08e+00

Pos logit contributions
 teaching+0.202
 showing+0.197
 diving+0.192
 giving+0.189
 example+0.186
Neg logit contributions
Max-0.357
ens-0.328
Brown-0.327
Snow-0.317
Mic-0.314
Tokenck
Token ID694
Feature activation+1.15e+01

Pos logit contributions
Max+0.071
Brown+0.063
ens+0.063
Snow+0.061
Mic+0.059
Neg logit contributions
 teaching-0.093
 showing-0.092
 diving-0.091
 example-0.088
 giving-0.088
Tokenles
Token ID829
Feature activation+1.91e+00

Pos logit contributions
 diving+0.170
 teaching+0.170
 showing+0.169
 giving+0.155
 of+0.155
Neg logit contributions
Max-0.279
Brown-0.277
Mic-0.270
Snow-0.263
Nut-0.257
Token,
Token ID11
Feature activation+8.83e+00

Pos logit contributions
 teaching+0.026
 showing+0.025
 diving+0.024
 giving+0.024
 example+0.023
Neg logit contributions
Max-0.050
Brown-0.047
ens-0.047
Snow-0.046
Mic-0.045
Token giving
Token ID3501
Feature activation+3.09e+01

Pos logit contributions
 teaching+0.108
 showing+0.106
 diving+0.101
 giving+0.099
 example+0.094
Neg logit contributions
Max-0.243
Brown-0.229
ens-0.228
Snow-0.223
Mic-0.222
Token her
Token ID607
Feature activation-1.10e+01

Pos logit contributions
 showing+0.525
 diving+0.523
 teaching+0.516
 giving+0.505
 show+0.497
Neg logit contributions
Max-0.703
Brown-0.674
Mic-0.640
Snow-0.627
issy-0.627
Token hugs
Token ID40657
Feature activation+1.95e+01

Pos logit contributions
Max+0.278
Brown+0.261
Snow+0.254
Mic+0.253
ens+0.253
Neg logit contributions
 teaching-0.163
 showing-0.162
 diving-0.156
 giving-0.153
 show-0.148
Token and
Token ID290
Feature activation+7.28e+00

Pos logit contributions
 teaching+0.230
 showing+0.228
 giving+0.220
 diving+0.203
 show+0.199
Neg logit contributions
Max-0.545
lves-0.520
Brown-0.518
Mic-0.515
ens-0.496
Token pet
Token ID4273
Feature activation+1.19e+01

Pos logit contributions
 teaching+0.114
 showing+0.111
 diving+0.108
 giving+0.106
 example+0.104
Neg logit contributions
Max-0.175
Brown-0.162
ens-0.162
Snow-0.158
Mic-0.156
Tokenting
Token ID889
Feature activation+8.05e+00

Pos logit contributions
 showing+0.198
 teaching+0.191
 diving+0.191
 giving+0.191
 show+0.189
Neg logit contributions
Max-0.271
Brown-0.264
Mic-0.262
issy-0.241
Snow-0.238
Token king
Token ID5822
Feature activation-7.69e+00

Pos logit contributions
Max+0.163
Brown+0.149
ens+0.149
Snow+0.145
Mic+0.143
Neg logit contributions
 teaching-0.131
 showing-0.130
 diving-0.126
 giving-0.124
 example-0.121
Token's
Token ID338
Feature activation+2.83e+00

Pos logit contributions
Max+0.211
Brown+0.197
Mic+0.193
Snow+0.192
ens+0.191
Neg logit contributions
 teaching-0.098
 showing-0.097
 giving-0.093
 diving-0.092
 liking-0.087
Token tum
Token ID11814
Feature activation+1.22e+01

Pos logit contributions
 teaching+0.045
 showing+0.044
 diving+0.043
 giving+0.042
 example+0.041
Neg logit contributions
Max-0.064
Brown-0.061
ens-0.060
Snow-0.059
Mic-0.058
Tokenmy
Token ID1820
Feature activation-1.16e+01

Pos logit contributions
 teaching+0.322
 showing+0.315
 giving+0.312
 diving+0.309
 serving+0.302
Neg logit contributions
Max-0.160
Brown-0.139
lves-0.137
Mic-0.133
Snow-0.130
Token by
Token ID416
Feature activation+9.60e+00

Pos logit contributions
Max+0.310
Brown+0.289
ens+0.287
Snow+0.281
Mic+0.281
Neg logit contributions
 teaching-0.150
 showing-0.148
 diving-0.142
 giving-0.140
 example-0.133
Token giving
Token ID3501
Feature activation+3.09e+01

Pos logit contributions
 teaching+0.113
 diving+0.106
 showing+0.103
 example+0.100
 guessing+0.096
Neg logit contributions
Max-0.250
Brown-0.240
Snow-0.237
ens-0.236
Mic-0.229
Token him
Token ID683
Feature activation-1.09e+01

Pos logit contributions
 show+0.375
 teaching+0.375
 showing+0.372
 example+0.368
 giving+0.365
Neg logit contributions
Max-0.806
Brown-0.771
Mic-0.744
K-0.743
lves-0.739
Token too
Token ID1165
Feature activation+7.55e+00

Pos logit contributions
Max+0.309
Brown+0.287
Mic+0.284
ens+0.277
lves+0.274
Neg logit contributions
 teaching-0.131
 showing-0.127
 diving-0.126
 giving-0.124
 example-0.121
Token many
Token ID867
Feature activation+8.34e+00

Pos logit contributions
 teaching+0.130
 showing+0.129
 diving+0.122
 giving+0.117
 designing+0.116
Neg logit contributions
Max-0.168
ens-0.158
Snow-0.156
Brown-0.156
K-0.148
Token ch
Token ID442
Feature activation+1.68e+01

Pos logit contributions
 teaching+0.133
 showing+0.130
 diving+0.126
 giving+0.121
 example+0.121
Neg logit contributions
Max-0.188
ens-0.179
Brown-0.179
Snow-0.175
Mic-0.168
Tokenocol
Token ID4668
Feature activation+5.34e+00

Pos logit contributions
 teaching+0.267
 showing+0.265
 giving+0.246
 diving+0.244
 guessing+0.237
Neg logit contributions
Max-0.389
ens-0.381
Brown-0.366
Mic-0.358
Snow-0.358
Token me
Token ID502
Feature activation-4.90e+00

Pos logit contributions
 showing+0.380
 teaching+0.379
 giving+0.366
 helping+0.354
 show+0.351
Neg logit contributions
Max-0.674
lves-0.642
Brown-0.631
Snow-0.630
ens-0.621
Token,
Token ID11
Feature activation+9.51e+00

Pos logit contributions
Max+0.116
Brown+0.111
ens+0.110
Snow+0.107
Mic+0.104
Neg logit contributions
 teaching-0.067
 diving-0.066
 showing-0.065
 example-0.065
 humor-0.064
Token thank
Token ID5875
Feature activation+2.49e+01

Pos logit contributions
 showing+0.148
 teaching+0.148
 giving+0.144
 diving+0.137
 helping+0.135
Neg logit contributions
Max-0.232
Brown-0.218
lves-0.216
ens-0.215
Snow-0.213
Token you
Token ID345
Feature activation-7.68e+00

Pos logit contributions
 teaching+0.396
 showing+0.395
 giving+0.376
 diving+0.372
 show+0.370
Neg logit contributions
Max-0.592
Brown-0.576
lves-0.570
Snow-0.552
issy-0.550
Token for
Token ID329
Feature activation+1.92e+01

Pos logit contributions
Max+0.194
ens+0.181
Brown+0.180
Snow+0.176
Mic+0.173
Neg logit contributions
 teaching-0.109
 showing-0.106
 diving-0.102
 giving-0.100
 example-0.098
Token giving
Token ID3501
Feature activation+3.08e+01

Pos logit contributions
aps+0.224
apping+0.210
 contain+0.205
 juggling+0.198
 diving+0.197
Neg logit contributions
Max-0.486
Brown-0.448
Mic-0.448
Snow-0.445
Nut-0.439
Token me
Token ID502
Feature activation-6.33e+00

Pos logit contributions
 teaching+0.382
 show+0.382
 showing+0.377
 giving+0.361
 example+0.359
Neg logit contributions
Max-0.796
Brown-0.796
lves-0.768
K-0.755
Snow-0.751
Token such
Token ID884
Feature activation+9.47e+00

Pos logit contributions
Max+0.176
Brown+0.166
Mic+0.162
ens+0.159
lves+0.159
Neg logit contributions
 teaching-0.079
 showing-0.075
 diving-0.073
 giving-0.072
 example-0.070
Token good
Token ID922
Feature activation+1.34e+01

Pos logit contributions
 teaching+0.141
 showing+0.138
 diving+0.134
 giving+0.131
 example+0.127
Neg logit contributions
Max-0.235
Brown-0.219
ens-0.217
Snow-0.213
Mic-0.211
Token advice
Token ID5608
Feature activation+1.36e+01

Pos logit contributions
 teaching+0.216
 showing+0.208
 diving+0.204
 guessing+0.196
 liking+0.196
Neg logit contributions
ens-0.291
Max-0.288
Brown-0.268
Snow-0.264
Spot-0.255
Token!
Token ID0
Feature activation+1.24e+00

Pos logit contributions
 teaching+0.193
 showing+0.189
 diving+0.182
 giving+0.178
 example+0.174
Neg logit contributions
Max-0.344
Brown-0.321
ens-0.320
Snow-0.312
Mic-0.308
Token hungry
Token ID14720
Feature activation+1.49e+01

Pos logit contributions
 teaching+0.236
 showing+0.227
 example+0.223
 diving+0.207
 joining+0.206
Neg logit contributions
Max-0.324
uffy-0.313
ens-0.313
Snow-0.307
Brown-0.306
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 teaching+0.234
 showing+0.230
 diving+0.220
 giving+0.215
 example+0.212
Neg logit contributions
Max-0.356
ens-0.333
Brown-0.328
Snow-0.322
Mic-0.314
Token Would
Token ID10928
Feature activation+1.99e+01

Pos logit contributions
 teaching+0.027
 showing+0.026
 giving+0.025
 diving+0.025
 example+0.025
Neg logit contributions
Max-0.040
ens-0.038
Brown-0.038
lves-0.037
Mic-0.037
Token you
Token ID345
Feature activation-8.01e+00

Pos logit contributions
 teaching+0.388
 showing+0.380
 diving+0.373
 giving+0.367
 serving+0.356
Neg logit contributions
Max-0.411
Brown-0.380
ens-0.376
lves-0.370
Mic-0.364
Token mind
Token ID2000
Feature activation+1.49e+01

Pos logit contributions
ens+0.143
ppy+0.133
ety+0.131
Max+0.130
Snow+0.127
Neg logit contributions
 showing-0.156
 teaching-0.152
 designing-0.146
 succeeding-0.144
 testing-0.144
Token giving
Token ID3501
Feature activation+3.05e+01

Pos logit contributions
 teaching+0.229
 showing+0.223
 example+0.218
 accomplishment+0.213
 diving+0.210
Neg logit contributions
Max-0.340
ens-0.325
Brown-0.318
Snow-0.315
ety-0.305
Token me
Token ID502
Feature activation-6.23e+00

Pos logit contributions
 teaching+0.408
 showing+0.401
 diving+0.393
 show+0.392
 giving+0.388
Neg logit contributions
Max-0.776
Brown-0.755
lves-0.719
K-0.710
issy-0.704
Token some
Token ID617
Feature activation+5.84e+00

Pos logit contributions
Max+0.155
ens+0.146
Brown+0.143
Snow+0.142
Mic+0.137
Neg logit contributions
 teaching-0.089
 showing-0.088
 diving-0.084
 giving-0.083
 example-0.081
Token of
Token ID286
Feature activation+1.59e+01

Pos logit contributions
 teaching+0.106
 showing+0.104
 succeeding+0.098
 diving+0.096
 guessing+0.096
Neg logit contributions
ens-0.114
Max-0.112
Snow-0.108
Brown-0.105
Bug-0.103
Token your
Token ID534
Feature activation-7.20e+00

Pos logit contributions
 teaching+0.241
 showing+0.236
 diving+0.228
 giving+0.225
 example+0.218
Neg logit contributions
Max-0.390
Brown-0.363
ens-0.361
Snow-0.353
Mic-0.349
Token ol
Token ID25776
Feature activation-5.11e+00

Pos logit contributions
Max+0.166
Brown+0.154
ens+0.154
Snow+0.150
Mic+0.148
Neg logit contributions
 teaching-0.118
 showing-0.116
 diving-0.112
 giving-0.111
 example-0.108
Token that
Token ID326
Feature activation+2.76e+00

Pos logit contributions
 teaching+0.291
 showing+0.277
 giving+0.270
 making+0.251
 diving+0.250
Neg logit contributions
Max-0.516
Brown-0.509
Snow-0.496
Blue-0.481
lves-0.481
Token he
Token ID339
Feature activation-1.31e+01

Pos logit contributions
 teaching+0.049
 showing+0.048
 giving+0.047
 diving+0.045
 sharing+0.044
Neg logit contributions
Max-0.061
Brown-0.060
lves-0.060
Mic-0.059
Snow-0.057
Token was
Token ID373
Feature activation+1.38e+01

Pos logit contributions
Max+0.280
ens+0.264
Brown+0.263
Mic+0.258
Snow+0.258
Neg logit contributions
 teaching-0.241
 showing-0.237
 giving-0.227
 diving-0.220
 liking-0.217
Token serious
Token ID2726
Feature activation+1.20e+01

Pos logit contributions
 diving+0.206
 example+0.201
 humor+0.198
 accomplishment+0.196
 liking+0.190
Neg logit contributions
Max-0.310
Brown-0.284
Mic-0.273
Snow-0.267
lves-0.266
Token about
Token ID546
Feature activation+2.07e+01

Pos logit contributions
 diving+0.193
 teaching+0.191
 showing+0.188
 example+0.180
 giving+0.175
Neg logit contributions
Max-0.268
ens-0.255
Brown-0.244
Snow-0.235
Mic-0.226
Token giving
Token ID3501
Feature activation+3.04e+01

Pos logit contributions
 teaching+0.281
 showing+0.276
 diving+0.263
 giving+0.261
 liking+0.247
Neg logit contributions
Max-0.546
Brown-0.513
ens-0.509
Snow-0.501
Mic-0.496
Token the
Token ID262
Feature activation-8.78e+00

Pos logit contributions
 teaching+0.410
 showing+0.393
 giving+0.384
 counting+0.373
 sharing+0.372
Neg logit contributions
Brown-0.758
Snow-0.729
lves-0.721
Mic-0.717
Max-0.717
Token toy
Token ID13373
Feature activation-5.51e+00

Pos logit contributions
Max+0.197
Brown+0.182
ens+0.180
Snow+0.177
Mic+0.175
Neg logit contributions
 teaching-0.152
 showing-0.149
 diving-0.145
 giving-0.143
 example-0.139
Token to
Token ID284
Feature activation+1.40e+01

Pos logit contributions
Max+0.144
Mic+0.136
Brown+0.136
Snow+0.134
lves+0.131
Neg logit contributions
 teaching-0.078
 showing-0.077
 giving-0.072
 show-0.071
 diving-0.070
Token Jim
Token ID5395
Feature activation-2.48e+01

Pos logit contributions
 teaching+0.230
 showing+0.226
 giving+0.217
 diving+0.210
 show+0.209
Neg logit contributions
Max-0.330
Brown-0.312
ens-0.306
lves-0.305
Snow-0.302
Token.
Token ID13
Feature activation+9.05e-02

Pos logit contributions
Max+0.650
Brown+0.609
Snow+0.596
ens+0.592
Mic+0.591
Neg logit contributions
 teaching-0.347
 showing-0.339
 giving-0.324
 diving-0.320
 serving-0.303
Token way
Token ID835
Feature activation+1.39e+01

Pos logit contributions
Max+0.200
Brown+0.184
ens+0.184
Snow+0.179
Mic+0.177
Neg logit contributions
 teaching-0.138
 showing-0.135
 diving-0.130
 giving-0.129
 example-0.125
Token of
Token ID286
Feature activation+1.70e+01

Pos logit contributions
 teaching+0.143
 showing+0.139
 diving+0.132
 giving+0.129
 example+0.123
Neg logit contributions
Max-0.408
Brown-0.384
ens-0.383
Snow-0.376
Mic-0.372
Token thanking
Token ID41892
Feature activation+2.30e+01

Pos logit contributions
 teaching+0.241
 showing+0.234
 diving+0.225
 giving+0.221
 example+0.220
Neg logit contributions
Max-0.422
ens-0.393
Brown-0.380
Snow-0.376
Mic-0.369
Token yoga
Token ID20351
Feature activation+2.03e+00

Pos logit contributions
 teaching+0.372
 showing+0.370
 giving+0.349
 diving+0.346
 example+0.346
Neg logit contributions
Max-0.549
lves-0.537
Brown-0.534
Mic-0.517
ens-0.516
Token for
Token ID329
Feature activation+1.86e+01

Pos logit contributions
 teaching+0.028
 showing+0.028
 diving+0.026
 giving+0.026
 example+0.025
Neg logit contributions
Max-0.053
Brown-0.050
ens-0.050
Mic-0.049
Snow-0.049
Token giving
Token ID3501
Feature activation+3.03e+01

Pos logit contributions
 teaching+0.266
 diving+0.262
 showing+0.262
 liking+0.252
 giving+0.249
Neg logit contributions
Max-0.436
ens-0.395
Snow-0.388
Brown-0.385
Spot-0.379
Token them
Token ID606
Feature activation-1.24e+01

Pos logit contributions
 showing+0.392
 teaching+0.391
 example+0.378
 show+0.370
 giving+0.368
Neg logit contributions
Brown-0.786
Mic-0.777
Max-0.762
lves-0.755
Snow-0.733
Token a
Token ID257
Feature activation-2.40e+00

Pos logit contributions
Max+0.368
Brown+0.360
Mic+0.356
lves+0.345
Snow+0.340
Neg logit contributions
 teaching-0.131
 diving-0.123
 showing-0.121
 show-0.117
 example-0.116
Token fun
Token ID1257
Feature activation+1.18e+01

Pos logit contributions
Max+0.060
ens+0.055
Brown+0.055
Snow+0.055
Mic+0.053
Neg logit contributions
 teaching-0.034
 showing-0.034
 diving-0.031
 giving-0.031
 designing-0.030
Token time
Token ID640
Feature activation+8.03e+00

Pos logit contributions
 teaching+0.168
 showing+0.163
 diving+0.156
 giving+0.154
 example+0.149
Neg logit contributions
Max-0.299
ens-0.279
Brown-0.276
Snow-0.270
Mic-0.264
Token.
Token ID13
Feature activation-3.23e-01

Pos logit contributions
 teaching+0.115
 showing+0.113
 diving+0.109
 example+0.106
 giving+0.106
Neg logit contributions
Max-0.199
ens-0.185
Brown-0.184
Snow-0.180
Mic-0.175
Token Lisa
Token ID15378
Feature activation-2.35e+01

Pos logit contributions
 teaching+0.161
 showing+0.159
 giving+0.154
 show+0.153
 diving+0.146
Neg logit contributions
Max-0.209
lves-0.201
Brown-0.200
Mic-0.195
ens-0.193
Token!"
Token ID2474
Feature activation+4.72e+00

Pos logit contributions
Max+0.623
Brown+0.584
Mic+0.573
ens+0.571
Snow+0.566
Neg logit contributions
 teaching-0.325
 showing-0.320
 giving-0.304
 diving-0.303
 example-0.285
Token Dad
Token ID17415
Feature activation-1.52e+01

Pos logit contributions
 teaching+0.090
 showing+0.089
 show+0.085
 giving+0.083
 diving+0.079
Neg logit contributions
lves-0.101
Max-0.098
Brown-0.097
Snow-0.095
Nut-0.093
Token said
Token ID531
Feature activation+9.35e+00

Pos logit contributions
Max+0.278
ens+0.257
Brown+0.243
issy+0.236
mi+0.229
Neg logit contributions
 example-0.306
 teaching-0.302
 liking-0.298
 diving-0.298
 showing-0.298
Token,
Token ID11
Feature activation+7.84e+00

Pos logit contributions
 teaching+0.141
 diving+0.141
 showing+0.138
 example+0.134
 liking+0.131
Neg logit contributions
Max-0.222
ens-0.204
Brown-0.202
Snow-0.196
Mic-0.192
Token giving
Token ID3501
Feature activation+3.03e+01

Pos logit contributions
 example+0.088
 diving+0.083
 humor+0.081
 teaching+0.079
 guessing+0.075
Neg logit contributions
Max-0.211
Brown-0.196
Snow-0.190
Mic-0.188
ens-0.182
Token her
Token ID607
Feature activation-1.19e+01

Pos logit contributions
 teaching+0.504
 showing+0.484
 show+0.471
 example+0.455
 diving+0.451
Neg logit contributions
lves-0.707
Brown-0.701
Nut-0.684
Snow-0.678
Mic-0.664
Token the
Token ID262
Feature activation-8.71e+00

Pos logit contributions
Max+0.338
Brown+0.328
Snow+0.318
Mic+0.317
lves+0.316
Neg logit contributions
 teaching-0.144
 showing-0.136
 giving-0.128
 diving-0.128
 show-0.127
Token doll
Token ID3654
Feature activation-1.50e+01

Pos logit contributions
Max+0.196
Brown+0.181
ens+0.181
Snow+0.176
Mic+0.174
Neg logit contributions
 teaching-0.148
 showing-0.145
 diving-0.141
 giving-0.139
 example-0.136
Token and
Token ID290
Feature activation+6.42e+00

Pos logit contributions
Max+0.427
Brown+0.403
Mic+0.399
lves+0.395
Snow+0.388
Neg logit contributions
 teaching-0.182
 showing-0.180
 giving-0.168
 show-0.167
 diving-0.163
Token the
Token ID262
Feature activation-8.99e+00

Pos logit contributions
 teaching+0.113
 showing+0.111
 diving+0.107
 giving+0.107
 example+0.102
Neg logit contributions
Max-0.143
Brown-0.133
ens-0.131
Snow-0.128
Mic-0.127
Token these
Token ID777
Feature activation-5.55e-01

Pos logit contributions
 teaching+0.287
 showing+0.281
 diving+0.271
 giving+0.266
 example+0.259
Neg logit contributions
Max-0.487
Brown-0.455
ens-0.454
Mic-0.441
Snow-0.440
Token."
Token ID526
Feature activation+2.78e+00

Pos logit contributions
Max+0.011
ens+0.010
Snow+0.010
Brown+0.010
lves+0.009
Neg logit contributions
 teaching-0.010
 showing-0.010
 giving-0.010
 diving-0.010
 serving-0.010
Token He
Token ID679
Feature activation-1.21e+01

Pos logit contributions
 teaching+0.038
 showing+0.038
 giving+0.036
 show+0.036
 diving+0.033
Neg logit contributions
lves-0.069
Brown-0.066
Max-0.065
Snow-0.065
issy-0.063
Token said
Token ID531
Feature activation+1.02e+01

Pos logit contributions
Max+0.267
ens+0.248
Brown+0.243
Snow+0.238
Mic+0.232
Neg logit contributions
 teaching-0.209
 showing-0.204
 diving-0.200
 giving-0.196
 example-0.193
Token,
Token ID11
Feature activation+8.03e+00

Pos logit contributions
 diving+0.178
 teaching+0.177
 guessing+0.176
 example+0.175
 showing+0.172
Neg logit contributions
Max-0.204
ens-0.187
Snow-0.182
Brown-0.181
 Bun-0.175
Token giving
Token ID3501
Feature activation+3.02e+01

Pos logit contributions
 example+0.135
 diving+0.132
 teaching+0.131
 humor+0.128
 showing+0.125
Neg logit contributions
Max-0.176
Brown-0.159
Snow-0.156
ens-0.152
Mic-0.152
Token them
Token ID606
Feature activation-1.19e+01

Pos logit contributions
 teaching+0.344
 showing+0.339
 diving+0.336
 example+0.319
 show+0.315
Neg logit contributions
Brown-0.833
Max-0.813
lves-0.795
Mic-0.788
Snow-0.779
Token two
Token ID734
Feature activation-5.24e+00

Pos logit contributions
Brown+0.369
Mic+0.364
Max+0.363
lves+0.345
Blue+0.345
Neg logit contributions
 teaching-0.107
 show-0.105
 diving-0.099
 join-0.097
 showing-0.095
Token life
Token ID1204
Feature activation+2.00e+00

Pos logit contributions
Max+0.135
Brown+0.127
ens+0.123
Mic+0.123
Snow+0.123
Neg logit contributions
 teaching-0.074
 showing-0.073
 diving-0.071
 giving-0.069
 example-0.067
Token jackets
Token ID35129
Feature activation-3.08e+00

Pos logit contributions
 teaching+0.023
 showing+0.023
 giving+0.022
 diving+0.021
 example+0.020
Neg logit contributions
Max-0.057
Brown-0.054
Mic-0.054
lves-0.052
Snow-0.052
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
Max+0.086
Brown+0.081
Mic+0.080
Snow+0.079
ens+0.079
Neg logit contributions
 teaching-0.037
 showing-0.036
 giving-0.034
 diving-0.034
 liking-0.031
Token store
Token ID3650
Feature activation+9.39e+00

Pos logit contributions
Max+0.178
ens+0.174
Brown+0.171
issy+0.168
Mic+0.167
Neg logit contributions
 diving-0.109
 showing-0.109
 teaching-0.107
 show-0.103
 example-0.100
Token with
Token ID351
Feature activation+2.14e+01

Pos logit contributions
 teaching+0.132
 showing+0.129
 diving+0.128
 giving+0.123
 example+0.122
Neg logit contributions
Max-0.233
ens-0.217
Brown-0.214
Snow-0.209
lves-0.206
Token the
Token ID262
Feature activation-7.87e+00

Pos logit contributions
 showing+0.301
 teaching+0.300
 diving+0.281
 show+0.280
 giving+0.278
Neg logit contributions
Brown-0.556
Max-0.549
Snow-0.524
Mic-0.521
lves-0.516
Token perfume
Token ID40094
Feature activation-5.31e-02

Pos logit contributions
Max+0.172
Brown+0.159
ens+0.156
Snow+0.154
Mic+0.152
Neg logit contributions
 teaching-0.143
 showing-0.141
 diving-0.135
 giving-0.135
 show-0.131
Token,
Token ID11
Feature activation+7.80e+00

Pos logit contributions
Max+0.001
Brown+0.001
Mic+0.001
lves+0.001
ens+0.001
Neg logit contributions
 showing-0.001
 teaching-0.001
 diving-0.001
 giving-0.001
 show-0.001
Token giving
Token ID3501
Feature activation+3.01e+01

Pos logit contributions
 example+0.091
 humor+0.084
 teaching+0.083
 diving+0.081
aps+0.081
Neg logit contributions
Max-0.208
ens-0.192
Snow-0.187
Brown-0.185
lves-0.185
Token it
Token ID340
Feature activation-1.05e+01

Pos logit contributions
 teaching+0.356
 showing+0.355
 giving+0.344
 show+0.323
 example+0.320
Neg logit contributions
Max-0.831
Brown-0.818
Mic-0.792
uffy-0.763
lves-0.762
Token a
Token ID257
Feature activation-2.50e+00

Pos logit contributions
Max+0.311
Brown+0.296
Mic+0.289
ens+0.287
Snow+0.283
Neg logit contributions
 teaching-0.108
 showing-0.106
 diving-0.102
 giving-0.099
 example-0.095
Token little
Token ID1310
Feature activation-3.75e+00

Pos logit contributions
Max+0.052
Snow+0.048
ball+0.046
Blue+0.046
Brown+0.046
Neg logit contributions
 teaching-0.045
 succeeding-0.044
 showing-0.043
 example-0.043
 serving-0.042
Token hug
Token ID16225
Feature activation+1.71e+01

Pos logit contributions
Max+0.078
ens+0.075
Brown+0.071
Snow+0.070
Blue+0.068
Neg logit contributions
 teaching-0.065
 showing-0.063
 diving-0.062
 succeeding-0.060
 liking-0.059
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 teaching+0.222
 showing+0.218
 giving+0.207
 diving+0.202
 example+0.191
Neg logit contributions
Max-0.459
Brown-0.434
Mic-0.426
ens-0.424
lves-0.421
Token proud
Token ID6613
Feature activation+1.76e+01

Pos logit contributions
 teaching+0.161
 example+0.154
 showing+0.153
 giving+0.146
 diving+0.145
Neg logit contributions
Max-0.328
Brown-0.310
Snow-0.301
ens-0.299
lves-0.295
Token of
Token ID286
Feature activation+1.62e+01

Pos logit contributions
 teaching+0.167
 showing+0.161
 example+0.156
 diving+0.153
 liking+0.145
Neg logit contributions
Max-0.499
ens-0.476
Brown-0.469
Snow-0.459
Spot-0.441
Token himself
Token ID2241
Feature activation-1.07e+01

Pos logit contributions
 teaching+0.268
 example+0.256
 showing+0.255
 liking+0.251
 giving+0.248
Neg logit contributions
Max-0.322
ens-0.289
Brown-0.284
 Bun-0.279
K-0.276
Token for
Token ID329
Feature activation+1.86e+01

Pos logit contributions
Max+0.292
Brown+0.274
ens+0.272
Snow+0.267
Mic+0.265
Neg logit contributions
 teaching-0.134
 showing-0.131
 diving-0.125
 giving-0.123
 example-0.118
Token not
Token ID407
Feature activation+1.18e+01

Pos logit contributions
aps+0.235
apping+0.196
iddle+0.196
apped+0.195
ulp+0.190
Neg logit contributions
Max-0.437
 Bun-0.394
 Rusty-0.390
Brown-0.389
 Margaret-0.384
Token giving
Token ID3501
Feature activation+3.01e+01

Pos logit contributions
 example+0.104
 teaching+0.101
aps+0.093
 diving+0.091
 showing+0.091
Neg logit contributions
Max-0.314
ens-0.308
Brown-0.298
lves-0.294
ety-0.293
Token up
Token ID510
Feature activation+9.40e+00

Pos logit contributions
 diving+0.222
 showing+0.218
 teaching+0.210
 giving+0.199
 counting+0.196
Neg logit contributions
Max-0.967
Brown-0.940
Mic-0.918
issy-0.918
Snow-0.913
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
 teaching+0.130
 showing+0.128
 diving+0.123
 giving+0.121
 example+0.117
Neg logit contributions
Max-0.244
Brown-0.228
ens-0.225
Snow-0.222
Mic-0.220
Token Tiny
Token ID20443
Feature activation-2.50e+01

Pos logit contributions
 teaching+0.003
 showing+0.003
 giving+0.002
 diving+0.002
 winning+0.002
Neg logit contributions
Max-0.004
ens-0.004
lves-0.004
Snow-0.004
Brown-0.004
Token knew
Token ID2993
Feature activation+1.62e+01

Pos logit contributions
Max+0.612
Mic+0.610
Brown+0.591
ens+0.581
Nut+0.565
Neg logit contributions
 showing-0.379
 teaching-0.361
 giving-0.354
 show-0.340
 diving-0.335
Token that
Token ID326
Feature activation+2.60e-01

Pos logit contributions
 teaching+0.269
 showing+0.264
 diving+0.256
 giving+0.254
 example+0.244
Neg logit contributions
Max-0.378
Brown-0.352
ens-0.350
Snow-0.341
Mic-0.341
Token Bob
Token ID5811
Feature activation-2.26e+01

Pos logit contributions
 teaching+0.038
 showing+0.037
 diving+0.036
 giving+0.036
 example+0.035
Neg logit contributions
Max-0.047
Brown-0.044
ens-0.044
Snow-0.042
Mic-0.042
Token was
Token ID373
Feature activation+1.42e+01

Pos logit contributions
Mic+0.566
Max+0.564
Brown+0.542
Snow+0.531
K+0.518
Neg logit contributions
 teaching-0.340
 showing-0.328
 giving-0.320
 diving-0.307
 serving-0.306
Token so
Token ID523
Feature activation+1.21e+01

Pos logit contributions
 teaching+0.202
 diving+0.201
 showing+0.200
 liking+0.193
 example+0.193
Neg logit contributions
Max-0.335
Brown-0.309
ens-0.302
Snow-0.301
lves-0.295
Token nice
Token ID3621
Feature activation+9.10e+00

Pos logit contributions
 teaching+0.162
 showing+0.160
 diving+0.153
 giving+0.148
 example+0.147
Neg logit contributions
Max-0.313
Brown-0.294
ens-0.291
Snow-0.287
lves-0.283
Token for
Token ID329
Feature activation+1.87e+01

Pos logit contributions
 teaching+0.127
 showing+0.124
 diving+0.121
 giving+0.117
 example+0.116
Neg logit contributions
Max-0.229
ens-0.215
Brown-0.212
Snow-0.207
Mic-0.202
Token giving
Token ID3501
Feature activation+3.00e+01

Pos logit contributions
 teaching+0.344
 showing+0.339
 diving+0.329
 giving+0.325
 example+0.316
Neg logit contributions
Max-0.401
Brown-0.369
ens-0.368
Snow-0.357
Mic-0.354
Token them
Token ID606
Feature activation-1.24e+01

Pos logit contributions
 showing+0.428
 teaching+0.428
 giving+0.414
 diving+0.410
 show+0.404
Neg logit contributions
Max-0.721
Brown-0.708
issy-0.705
Mic-0.699
lves-0.691
Token the
Token ID262
Feature activation-9.01e+00

Pos logit contributions
Max+0.369
Brown+0.349
Mic+0.344
ens+0.338
Snow+0.337
Neg logit contributions
 teaching-0.131
 showing-0.126
 diving-0.120
 giving-0.119
 show-0.113
Token signs
Token ID5895
Feature activation+8.47e+00

Pos logit contributions
Max+0.223
Brown+0.208
ens+0.207
Snow+0.202
Mic+0.200
Neg logit contributions
 teaching-0.133
 showing-0.130
 diving-0.125
 giving-0.123
 example-0.120
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
 teaching+0.104
 showing+0.101
 giving+0.097
 diving+0.094
 show+0.088
Neg logit contributions
Max-0.234
Brown-0.221
Mic-0.218
ens-0.216
Snow-0.215
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
 teaching+0.003
 showing+0.003
 diving+0.003
 giving+0.003
 counting+0.003
Neg logit contributions
Max-0.004
lves-0.004
Brown-0.004
ens-0.004
Snow-0.004
Token The
Token ID383
Feature activation-5.03e+00

Pos logit contributions
Max+0.130
ens+0.127
lves+0.125
Brown+0.125
Mic+0.122
Neg logit contributions
 teaching-0.055
 showing-0.054
 diving-0.050
 giving-0.050
 liking-0.048
Token strong
Token ID1913
Feature activation+1.24e+01

Pos logit contributions
Max+0.106
Brown+0.098
ens+0.098
Snow+0.094
lves+0.094
Neg logit contributions
 teaching-0.094
 showing-0.094
 diving-0.091
 example-0.090
 giving-0.089
Token tall
Token ID7331
Feature activation+6.79e+00

Pos logit contributions
 teaching+0.233
 showing+0.230
 giving+0.225
 example+0.219
 diving+0.218
Neg logit contributions
Max-0.261
Brown-0.244
lves-0.238
Snow-0.238
Mic-0.233
Token tree
Token ID5509
Feature activation-5.39e+00

Pos logit contributions
 teaching+0.127
 showing+0.126
 diving+0.121
 giving+0.121
 example+0.118
Neg logit contributions
Max-0.144
Brown-0.134
Snow-0.131
ens-0.129
lves-0.128
Token had
Token ID550
Feature activation+1.40e+01

Pos logit contributions
Max+0.103
ens+0.095
Brown+0.094
Snow+0.090
Blue+0.087
Neg logit contributions
 teaching-0.106
 showing-0.103
 diving-0.103
 example-0.102
 giving-0.100
Token given
Token ID1813
Feature activation+3.00e+01

Pos logit contributions
 teaching+0.203
 showing+0.198
 diving+0.194
 giving+0.187
 example+0.185
Neg logit contributions
Max-0.349
Brown-0.323
ens-0.322
Snow-0.316
Mic-0.309
Token them
Token ID606
Feature activation-1.09e+01

Pos logit contributions
 teaching+0.342
 showing+0.329
 example+0.329
 giving+0.313
 diving+0.292
Neg logit contributions
Max-0.843
Brown-0.829
Snow-0.798
Mic-0.787
uffy-0.773
Token the
Token ID262
Feature activation-7.69e+00

Pos logit contributions
Max+0.341
Brown+0.326
Mic+0.319
ens+0.316
Snow+0.315
Neg logit contributions
 teaching-0.100
 showing-0.097
 diving-0.092
 giving-0.089
 example-0.084
Token space
Token ID2272
Feature activation+4.87e+00

Pos logit contributions
Max+0.191
Brown+0.177
ens+0.177
Snow+0.173
Mic+0.171
Neg logit contributions
 teaching-0.113
 showing-0.111
 diving-0.107
 giving-0.105
 example-0.102
Token they
Token ID484
Feature activation-1.12e+01

Pos logit contributions
 teaching+0.062
 showing+0.061
 diving+0.058
 giving+0.057
 example+0.055
Neg logit contributions
Max-0.132
Brown-0.123
ens-0.122
Snow-0.120
Mic-0.119
Token needed
Token ID2622
Feature activation+2.07e+01

Pos logit contributions
Max+0.225
Brown+0.210
ens+0.203
Snow+0.202
Mic+0.199
Neg logit contributions
 teaching-0.223
 showing-0.221
 giving-0.210
 diving-0.209
 show-0.205
Token brave
Token ID14802
Feature activation+9.49e+00

Pos logit contributions
 teaching+0.121
 showing+0.119
 giving+0.113
 diving+0.112
 example+0.106
Neg logit contributions
Max-0.215
Brown-0.201
ens-0.201
Snow-0.196
Mic-0.195
Token,
Token ID11
Feature activation+8.01e+00

Pos logit contributions
 teaching+0.135
 showing+0.130
 giving+0.129
 example+0.128
 diving+0.123
Neg logit contributions
Max-0.244
Brown-0.238
Mic-0.234
Snow-0.230
Blue-0.228
Token and
Token ID290
Feature activation+7.66e+00

Pos logit contributions
 teaching+0.114
 showing+0.113
 giving+0.108
 diving+0.107
 example+0.101
Neg logit contributions
Max-0.206
Brown-0.193
ens-0.192
Mic-0.188
Snow-0.188
Token strong
Token ID1913
Feature activation+9.17e+00

Pos logit contributions
 teaching+0.150
 showing+0.146
 diving+0.146
 example+0.141
 giving+0.140
Neg logit contributions
Max-0.149
Brown-0.135
ens-0.135
Snow-0.131
Mic-0.128
Token without
Token ID1231
Feature activation+1.81e+01

Pos logit contributions
 teaching+0.126
 showing+0.124
 giving+0.118
 diving+0.117
 example+0.113
Neg logit contributions
Max-0.243
Brown-0.233
Mic-0.228
Snow-0.225
lves-0.222
Token giving
Token ID3501
Feature activation+2.99e+01

Pos logit contributions
 teaching+0.207
 showing+0.201
 diving+0.194
 giving+0.184
 example+0.183
Neg logit contributions
Max-0.499
ens-0.470
Brown-0.467
Snow-0.457
Mic-0.453
Token up
Token ID510
Feature activation+9.13e+00

Pos logit contributions
 showing+0.276
 teaching+0.270
 diving+0.269
 giving+0.267
 example+0.259
Neg logit contributions
Brown-0.848
Max-0.831
Mic-0.827
issy-0.803
Snow-0.800
Token.
Token ID13
Feature activation+6.48e-01

Pos logit contributions
 teaching+0.121
 showing+0.119
 diving+0.114
 giving+0.114
 example+0.107
Neg logit contributions
Max-0.244
Brown-0.232
Mic-0.225
ens-0.224
Snow-0.224
Token B
Token ID347
Feature activation-1.32e+01

Pos logit contributions
 teaching+0.009
 showing+0.008
 winning+0.008
 giving+0.008
 diving+0.008
Neg logit contributions
lves-0.015
Brown-0.015
Max-0.015
ens-0.015
Snow-0.015
Tokeninky
Token ID29246
Feature activation-1.50e+01

Pos logit contributions
Mic+0.254
Max+0.251
Brown+0.234
 Blink+0.234
Rust+0.232
Neg logit contributions
apping-0.230
aps-0.230
 teaching-0.225
 giving-0.221
 example-0.218
Token realized
Token ID6939
Feature activation+1.47e+01

Pos logit contributions
Max+0.354
Brown+0.341
Mic+0.335
Snow+0.330
lves+0.324
Neg logit contributions
 showing-0.240
 teaching-0.237
 giving-0.225
 diving-0.220
 show-0.218
Token the
Token ID262
Feature activation-8.02e+00

Pos logit contributions
 teaching+0.187
 showing+0.183
 diving+0.175
 giving+0.172
 example+0.166
Neg logit contributions
Max-0.401
Brown-0.376
ens-0.374
Snow-0.366
Mic-0.362
Token right
Token ID826
Feature activation+1.45e+01

Pos logit contributions
Max+0.180
ens+0.167
Brown+0.166
Snow+0.161
Mic+0.159
Neg logit contributions
 teaching-0.138
 showing-0.137
 diving-0.134
 giving-0.130
 example-0.129
Token thing
Token ID1517
Feature activation-9.22e-01

Pos logit contributions
 teaching+0.261
 showing+0.239
 liking+0.233
 giving+0.232
 joining+0.226
Neg logit contributions
Max-0.295
ens-0.288
Brown-0.272
elope-0.272
Snow-0.269
Token instead
Token ID2427
Feature activation+1.14e+01

Pos logit contributions
Max+0.026
Brown+0.025
Mic+0.024
Snow+0.024
lves+0.024
Neg logit contributions
 showing-0.011
 teaching-0.011
 diving-0.011
 giving-0.010
 show-0.010
Token of
Token ID286
Feature activation+1.60e+01

Pos logit contributions
 teaching+0.145
 showing+0.142
 diving+0.136
 giving+0.134
 example+0.129
Neg logit contributions
Max-0.307
Brown-0.288
ens-0.286
Snow-0.281
Mic-0.278
Token giving
Token ID3501
Feature activation+2.99e+01

Pos logit contributions
 teaching+0.181
 showing+0.173
 diving+0.167
 example+0.165
 liking+0.159
Neg logit contributions
Max-0.428
ens-0.397
Brown-0.392
Snow-0.383
Mic-0.381
Token in
Token ID287
Feature activation+1.27e+01

Pos logit contributions
 diving+0.342
 teaching+0.338
 showing+0.338
 example+0.331
 giving+0.329
Neg logit contributions
Max-0.783
Brown-0.773
Mic-0.749
issy-0.743
K-0.739
Token to
Token ID284
Feature activation+1.37e+01

Pos logit contributions
 teaching+0.179
 showing+0.175
 diving+0.168
 giving+0.166
 example+0.161
Neg logit contributions
Max-0.325
Brown-0.304
ens-0.303
Snow-0.296
Mic-0.293
Token his
Token ID465
Feature activation-1.19e+01

Pos logit contributions
 teaching+0.191
 showing+0.188
 diving+0.180
 giving+0.179
 example+0.172
Neg logit contributions
Max-0.356
Brown-0.333
ens-0.330
Snow-0.324
Mic-0.322
Token greed
Token ID25474
Feature activation+1.16e+01

Pos logit contributions
Max+0.287
Brown+0.266
ens+0.264
Snow+0.258
Mic+0.255
Neg logit contributions
 teaching-0.177
 showing-0.173
 giving-0.167
 diving-0.166
 liking-0.160
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
 teaching+0.179
 showing+0.177
 diving+0.171
 giving+0.167
 guessing+0.161
Neg logit contributions
Max-0.277
ens-0.263
Brown-0.253
Snow-0.250
Mic-0.244
Token,
Token ID11
Feature activation+8.03e+00

Pos logit contributions
Max+0.153
ens+0.142
Brown+0.142
Snow+0.138
Blue+0.133
Neg logit contributions
 teaching-0.103
 showing-0.101
 diving-0.101
 example-0.098
 giving-0.094
Token even
Token ID772
Feature activation+9.37e+00

Pos logit contributions
 teaching+0.117
 showing+0.114
 diving+0.109
 giving+0.109
 example+0.105
Neg logit contributions
Max-0.203
Brown-0.189
ens-0.188
Snow-0.184
Mic-0.183
Token if
Token ID611
Feature activation+8.36e+00

Pos logit contributions
 teaching+0.158
 diving+0.156
 showing+0.155
 example+0.150
 accomplishment+0.143
Neg logit contributions
Max-0.200
ens-0.188
Brown-0.184
Mic-0.180
Snow-0.178
Token it
Token ID340
Feature activation-1.02e+01

Pos logit contributions
 teaching+0.166
 showing+0.164
 diving+0.160
 giving+0.156
 example+0.156
Neg logit contributions
Max-0.161
ens-0.146
Brown-0.146
Snow-0.142
Mic-0.137
Token means
Token ID1724
Feature activation+1.21e+01

Pos logit contributions
Max+0.228
Brown+0.211
Mic+0.208
Snow+0.205
ens+0.203
Neg logit contributions
 teaching-0.180
 showing-0.177
 giving-0.172
 diving-0.170
 show-0.163
Token giving
Token ID3501
Feature activation+2.98e+01

Pos logit contributions
 teaching+0.166
 diving+0.162
 showing+0.162
 example+0.157
 liking+0.151
Neg logit contributions
Max-0.279
Brown-0.263
ens-0.260
Snow-0.255
 Bun-0.246
Token up
Token ID510
Feature activation+9.19e+00

Pos logit contributions
 teaching+0.375
 showing+0.360
 giving+0.354
 example+0.353
 show+0.333
Neg logit contributions
Max-0.791
Brown-0.760
Mic-0.739
K-0.732
lves-0.732
Token something
Token ID1223
Feature activation-9.59e-01

Pos logit contributions
 teaching+0.125
 showing+0.123
 giving+0.117
 diving+0.117
 example+0.113
Neg logit contributions
Max-0.242
Brown-0.228
ens-0.222
Mic-0.221
Snow-0.221
Token that
Token ID326
Feature activation+6.05e-01

Pos logit contributions
Max+0.022
ens+0.020
Brown+0.020
Snow+0.020
Blue+0.019
Neg logit contributions
 teaching-0.015
 showing-0.014
 diving-0.014
 example-0.014
 liking-0.013
Token you
Token ID345
Feature activation-9.53e+00

Pos logit contributions
 teaching+0.011
 showing+0.011
 diving+0.010
 giving+0.010
 example+0.010
Neg logit contributions
Max-0.013
Brown-0.012
ens-0.012
Snow-0.012
Mic-0.012
Token have
Token ID423
Feature activation+1.31e+01

Pos logit contributions
Max+0.203
ens+0.189
Brown+0.186
Snow+0.181
Mic+0.177
Neg logit contributions
 teaching-0.171
 showing-0.169
 diving-0.165
 giving-0.161
 example-0.159
Token person
Token ID1048
Feature activation-8.76e+00

Pos logit contributions
Max+0.220
Brown+0.204
ens+0.202
Snow+0.197
Mic+0.194
Neg logit contributions
 teaching-0.182
 showing-0.180
 diving-0.176
 giving-0.172
 example-0.169
Token thanked
Token ID26280
Feature activation+2.04e+01

Pos logit contributions
Max+0.195
Brown+0.180
ens+0.179
Snow+0.175
Mic+0.172
Neg logit contributions
 teaching-0.152
 showing-0.150
 diving-0.145
 giving-0.143
 example-0.140
Token the
Token ID262
Feature activation-8.66e+00

Pos logit contributions
 teaching+0.213
 showing+0.213
 giving+0.197
 show+0.193
 example+0.185
Neg logit contributions
Max-0.603
Brown-0.598
lves-0.590
Snow-0.580
Mic-0.578
Token sun
Token ID4252
Feature activation-4.34e+00

Pos logit contributions
Max+0.177
Brown+0.162
ens+0.161
Snow+0.156
Mic+0.154
Neg logit contributions
 teaching-0.167
 showing-0.165
 diving-0.160
 giving-0.158
 example-0.155
Token for
Token ID329
Feature activation+1.79e+01

Pos logit contributions
Max+0.116
Brown+0.109
ens+0.107
Mic+0.106
Snow+0.106
Neg logit contributions
 teaching-0.058
 showing-0.057
 diving-0.054
 giving-0.053
 example-0.050
Token giving
Token ID3501
Feature activation+2.97e+01

Pos logit contributions
 diving+0.269
 example+0.268
aps+0.266
 teaching+0.264
 guessing+0.259
Neg logit contributions
Max-0.397
ens-0.356
Snow-0.352
Brown-0.348
Spot-0.342
Token the
Token ID262
Feature activation-9.17e+00

Pos logit contributions
 teaching+0.333
 showing+0.332
 giving+0.316
 example+0.315
 show+0.314
Neg logit contributions
Brown-0.841
Max-0.811
Mic-0.810
lves-0.777
Snow-0.768
Token person
Token ID1048
Feature activation-9.95e+00

Pos logit contributions
Max+0.198
Brown+0.183
ens+0.181
Snow+0.177
Mic+0.175
Neg logit contributions
 teaching-0.167
 showing-0.164
 diving-0.159
 giving-0.157
 example-0.154
Token light
Token ID1657
Feature activation+8.72e+00

Pos logit contributions
Max+0.303
Brown+0.287
Mic+0.286
Snow+0.278
lves+0.271
Neg logit contributions
 teaching-0.108
 showing-0.103
 giving-0.099
 diving-0.095
 show-0.092
Token.
Token ID13
Feature activation-4.39e-01

Pos logit contributions
 teaching+0.116
 showing+0.114
 diving+0.109
 giving+0.108
 example+0.103
Neg logit contributions
Max-0.231
Brown-0.216
ens-0.213
Mic-0.211
Snow-0.209
Token The
Token ID383
Feature activation-1.06e+01

Pos logit contributions
Max+0.010
ens+0.010
Brown+0.010
Mic+0.010
lves+0.010
Neg logit contributions
 teaching-0.007
 showing-0.007
 giving-0.007
 diving-0.006
 example-0.006
Token of
Token ID286
Feature activation+1.13e+01

Pos logit contributions
Max+0.062
ens+0.058
Brown+0.057
Snow+0.056
Mic+0.055
Neg logit contributions
 teaching-0.024
 showing-0.023
 diving-0.023
 example-0.022
 giving-0.022
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 teaching+0.174
 showing+0.171
 giving+0.164
 diving+0.152
 show+0.151
Neg logit contributions
Max-0.287
Brown-0.275
ens-0.270
Snow-0.266
Mic-0.262
Token pr
Token ID778
Feature activation+5.30e+00

Pos logit contributions
Max+0.320
Brown+0.295
ens+0.292
Snow+0.287
Mic+0.282
Neg logit contributions
 teaching-0.225
 showing-0.223
 diving-0.213
 giving-0.212
 example-0.206
Tokenune
Token ID1726
Feature activation-8.97e+00

Pos logit contributions
 teaching+0.081
 showing+0.079
 example+0.077
 giving+0.075
 diving+0.075
Neg logit contributions
Max-0.131
Brown-0.119
Mic-0.116
Spot-0.115
Snow-0.115
Token.
Token ID13
Feature activation-3.75e+00

Pos logit contributions
Max+0.237
Brown+0.223
Snow+0.217
ens+0.217
Mic+0.215
Neg logit contributions
 teaching-0.122
 showing-0.121
 diving-0.114
 giving-0.113
 example-0.107
Token Betsy
Token ID45775
Feature activation-3.08e+01

Pos logit contributions
Brown+0.091
lves+0.089
Max+0.088
Snow+0.088
ox+0.086
Neg logit contributions
 teaching-0.054
 showing-0.052
 giving-0.049
 show-0.047
 planning-0.047
Token liked
Token ID8288
Feature activation+1.72e+01

Pos logit contributions
Max+0.757
Brown+0.733
Snow+0.718
lves+0.714
Mic+0.712
Neg logit contributions
 showing-0.461
 teaching-0.457
 giving-0.428
 show-0.417
 diving-0.402
Token this
Token ID428
Feature activation-1.15e+01

Pos logit contributions
 example+0.242
 teaching+0.236
aps+0.233
 diving+0.232
 showing+0.223
Neg logit contributions
Max-0.402
ens-0.371
Brown-0.361
Mic-0.360
Nut-0.352
Token idea
Token ID2126
Feature activation+5.44e+00

Pos logit contributions
Max+0.210
ens+0.206
issy+0.199
Brown+0.191
mi+0.186
Neg logit contributions
 diving-0.222
 humor-0.207
 joining-0.199
 showing-0.197
 teaching-0.196
Token and
Token ID290
Feature activation+2.10e+00

Pos logit contributions
 diving+0.101
 example+0.099
 humor+0.097
 teaching+0.095
 showing+0.095
Neg logit contributions
Max-0.109
ens-0.098
Brown-0.088
mie-0.086
 Bun-0.085
Token agreed
Token ID4987
Feature activation+6.76e+00

Pos logit contributions
 teaching+0.039
 showing+0.038
 diving+0.037
 giving+0.036
 example+0.036
Neg logit contributions
Max-0.044
ens-0.040
Brown-0.040
Snow-0.039
Mic-0.038
Token and
Token ID290
Feature activation+2.17e+00

Pos logit contributions
Max+0.024
Brown+0.024
Mic+0.023
Snow+0.023
ens+0.022
Neg logit contributions
 showing-0.011
 teaching-0.011
 diving-0.011
 giving-0.010
 show-0.010
Token suggested
Token ID5220
Feature activation+1.39e+01

Pos logit contributions
 teaching+0.038
 showing+0.037
 diving+0.036
 giving+0.036
 example+0.035
Neg logit contributions
Max-0.048
Brown-0.045
ens-0.044
Snow-0.043
Mic-0.043
Token that
Token ID326
Feature activation-3.20e+00

Pos logit contributions
 teaching+0.248
 showing+0.244
 diving+0.237
 giving+0.234
 example+0.229
Neg logit contributions
Max-0.304
Brown-0.280
ens-0.278
Snow-0.271
Mic-0.268
Token she
Token ID673
Feature activation-2.02e+01

Pos logit contributions
Max+0.064
Brown+0.060
ens+0.059
Snow+0.058
Mic+0.057
Neg logit contributions
 teaching-0.065
 showing-0.065
 giving-0.064
 diving-0.061
 serving-0.060
Token and
Token ID290
Feature activation+2.13e+00

Pos logit contributions
Max+0.440
ens+0.405
Brown+0.404
Snow+0.389
Mic+0.381
Neg logit contributions
 teaching-0.356
 showing-0.348
 diving-0.345
 giving-0.334
 example-0.331
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 teaching+0.044
 showing+0.044
 giving+0.043
 show+0.041
 diving+0.041
Neg logit contributions
Max-0.042
Brown-0.039
Snow-0.038
ens-0.037
lves-0.037
Token each
Token ID1123
Feature activation-7.24e+00

Pos logit contributions
Max+0.738
Brown+0.696
Snow+0.678
Mic+0.672
lves+0.669
Neg logit contributions
 teaching-0.495
 showing-0.487
 giving-0.463
 diving-0.454
 show-0.452
Token draw
Token ID3197
Feature activation+1.06e+01

Pos logit contributions
Max+0.151
ens+0.138
Brown+0.132
Mic+0.128
Snow+0.128
Neg logit contributions
 teaching-0.138
 diving-0.137
 showing-0.134
 giving-0.131
 serving-0.129
Token a
Token ID257
Feature activation-6.67e+00

Pos logit contributions
 showing+0.114
 teaching+0.113
 giving+0.099
 diving+0.098
 show+0.094
Neg logit contributions
Max-0.310
Brown-0.306
lves-0.301
Mic-0.300
ens-0.292
Token picture
Token ID4286
Feature activation-2.19e+00

Pos logit contributions
Max+0.154
Brown+0.143
ens+0.140
Snow+0.139
Mic+0.137
Neg logit contributions
 teaching-0.109
 showing-0.107
 diving-0.104
 giving-0.102
 example-0.100
Token of
Token ID286
Feature activation+1.13e+01

Pos logit contributions
Max+0.062
ens+0.058
Brown+0.057
Snow+0.056
Mic+0.055
Neg logit contributions
 teaching-0.024
 showing-0.023
 diving-0.023
 example-0.022
 giving-0.022
Token little
Token ID1310
Feature activation-7.83e+00

Pos logit contributions
Max+0.156
ens+0.153
Brown+0.151
issy+0.146
lves+0.144
Neg logit contributions
 showing-0.095
 teaching-0.094
 diving-0.093
 show-0.092
 giving-0.089
Token girl
Token ID2576
Feature activation-2.31e+01

Pos logit contributions
Max+0.153
Brown+0.146
Wh+0.138
issy+0.138
Mic+0.137
Neg logit contributions
 show-0.153
 diving-0.151
 showing-0.147
 teaching-0.146
 helping-0.145
Token named
Token ID3706
Feature activation+6.61e+00

Pos logit contributions
Max+0.497
ens+0.457
Brown+0.456
Snow+0.444
Mic+0.431
Neg logit contributions
 teaching-0.416
 diving-0.407
 showing-0.406
 giving-0.392
 example-0.389
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 showing+0.119
 teaching+0.118
 giving+0.117
 liking+0.115
 show+0.114
Neg logit contributions
Brown-0.135
Max-0.134
Mic-0.125
lves-0.125
Rust-0.125
Token.
Token ID13
Feature activation-3.85e+00

Pos logit contributions
Max+0.910
Mic+0.849
Brown+0.843
ens+0.831
lves+0.831
Neg logit contributions
 showing-0.341
 teaching-0.337
 giving-0.317
 helping-0.290
 show-0.285
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
Max+0.055
Brown+0.050
lves+0.048
Mic+0.046
mi+0.045
Neg logit contributions
 teaching-0.094
 giving-0.094
 showing-0.094
 diving-0.090
 helping-0.090
Token was
Token ID373
Feature activation+8.84e+00

Pos logit contributions
Mic+0.844
Max+0.842
lves+0.813
Brown+0.810
Nut+0.799
Neg logit contributions
 showing-0.342
 show-0.324
 teaching-0.321
 learning-0.294
 giving-0.293
Token so
Token ID523
Feature activation+6.79e+00

Pos logit contributions
 diving+0.129
 teaching+0.129
 humor+0.127
 showing+0.126
 example+0.126
Neg logit contributions
Max-0.206
Brown-0.191
ens-0.190
Snow-0.187
lves-0.182
Token excited
Token ID6568
Feature activation+1.21e+01

Pos logit contributions
 teaching+0.100
 showing+0.096
 example+0.095
 diving+0.091
 giving+0.090
Neg logit contributions
Max-0.163
ens-0.157
Snow-0.152
Brown-0.152
uffy-0.148
Token,
Token ID11
Feature activation+2.89e+00

Pos logit contributions
 teaching+0.113
 showing+0.111
 diving+0.103
 giving+0.102
 liking+0.093
Neg logit contributions
Max-0.377
Brown-0.359
Mic-0.354
Snow-0.348
ens-0.346
Token because
Token ID780
Feature activation-9.60e-02

Pos logit contributions
 showing+0.062
 giving+0.061
 teaching+0.060
 helping+0.059
 diving+0.059
Neg logit contributions
Max-0.054
Brown-0.051
lves-0.051
Mic-0.050
issy-0.049
Token was
Token ID373
Feature activation+8.88e+00

Pos logit contributions
Mic+0.113
Max+0.110
Brown+0.110
lves+0.109
Snow+0.107
Neg logit contributions
 showing-0.054
 teaching-0.052
 diving-0.051
 giving-0.049
 show-0.049
Token a
Token ID257
Feature activation-6.49e+00

Pos logit contributions
 teaching+0.077
 showing+0.074
 diving+0.074
 helping+0.064
 giving+0.064
Neg logit contributions
Max-0.267
Brown-0.266
lves-0.265
ens-0.262
ley-0.253
Token little
Token ID1310
Feature activation-7.83e+00

Pos logit contributions
Max+0.156
ens+0.153
Brown+0.151
issy+0.146
lves+0.144
Neg logit contributions
 showing-0.095
 teaching-0.094
 diving-0.093
 show-0.092
 giving-0.089
Token girl
Token ID2576
Feature activation-2.31e+01

Pos logit contributions
Max+0.153
Brown+0.146
Wh+0.138
issy+0.138
Mic+0.137
Neg logit contributions
 show-0.153
 diving-0.151
 showing-0.147
 teaching-0.146
 helping-0.145
Token named
Token ID3706
Feature activation+6.61e+00

Pos logit contributions
Max+0.497
ens+0.457
Brown+0.456
Snow+0.444
Mic+0.431
Neg logit contributions
 teaching-0.416
 diving-0.407
 showing-0.406
 giving-0.392
 example-0.389
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 showing+0.119
 teaching+0.118
 giving+0.117
 liking+0.115
 show+0.114
Neg logit contributions
Brown-0.135
Max-0.134
Mic-0.125
lves-0.125
Rust-0.125
Token.
Token ID13
Feature activation-3.85e+00

Pos logit contributions
Max+0.910
Mic+0.849
Brown+0.843
ens+0.831
lves+0.831
Neg logit contributions
 showing-0.341
 teaching-0.337
 giving-0.317
 helping-0.290
 show-0.285
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
Max+0.055
Brown+0.050
lves+0.048
Mic+0.046
mi+0.045
Neg logit contributions
 teaching-0.094
 giving-0.094
 showing-0.094
 diving-0.090
 helping-0.090
Token was
Token ID373
Feature activation+8.84e+00

Pos logit contributions
Mic+0.844
Max+0.842
lves+0.813
Brown+0.810
Nut+0.799
Neg logit contributions
 showing-0.342
 show-0.324
 teaching-0.321
 learning-0.294
 giving-0.293
Token so
Token ID523
Feature activation+6.79e+00

Pos logit contributions
 diving+0.129
 teaching+0.129
 humor+0.127
 showing+0.126
 example+0.126
Neg logit contributions
Max-0.206
Brown-0.191
ens-0.190
Snow-0.187
lves-0.182
Token excited
Token ID6568
Feature activation+1.21e+01

Pos logit contributions
 teaching+0.100
 showing+0.096
 example+0.095
 diving+0.091
 giving+0.090
Neg logit contributions
Max-0.163
ens-0.157
Snow-0.152
Brown-0.152
uffy-0.148
Token.
Token ID13
Feature activation-3.91e+00

Pos logit contributions
Max+0.225
Brown+0.213
ens+0.209
Snow+0.207
Mic+0.206
Neg logit contributions
 teaching-0.114
 showing-0.111
 diving-0.106
 giving-0.105
 example-0.100
Token
Token ID220
Feature activation-6.94e+00

Pos logit contributions
Max+0.096
ens+0.092
Brown+0.091
zan+0.091
Snow+0.091
Neg logit contributions
 teaching-0.055
 giving-0.053
 showing-0.053
 helping-0.049
 serving-0.047
Token
Token ID198
Feature activation-5.97e+00

Pos logit contributions
ens+0.173
ley+0.172
lves+0.171
Max+0.169
Brown+0.169
Neg logit contributions
 teaching-0.087
aps-0.083
 giving-0.079
 showing-0.079
 winning-0.074
Token
Token ID198
Feature activation-6.00e+00

Pos logit contributions
Max+0.129
ens+0.127
lves+0.122
Brown+0.121
mi+0.117
Neg logit contributions
 teaching-0.102
 showing-0.100
 giving-0.096
 diving-0.095
 serving-0.092
TokenWhen
Token ID2215
Feature activation+3.19e+00

Pos logit contributions
Max+0.104
ens+0.104
Brown+0.094
lves+0.093
uffy+0.092
Neg logit contributions
 teaching-0.129
 showing-0.127
 giving-0.123
 diving-0.120
 liking-0.119
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 teaching+0.065
 showing+0.065
 giving+0.064
 diving+0.062
 show+0.061
Neg logit contributions
Max-0.063
Brown-0.059
Snow-0.056
Mic-0.056
lves-0.055
Token met
Token ID1138
Feature activation+7.76e+00

Pos logit contributions
Max+0.720
Brown+0.695
Mic+0.679
Snow+0.673
lves+0.660
Neg logit contributions
 teaching-0.494
 showing-0.491
 giving-0.480
 diving-0.468
 show-0.463
Token up
Token ID510
Feature activation+4.62e+00

Pos logit contributions
 teaching+0.129
 showing+0.126
 giving+0.120
 diving+0.120
 example+0.116
Neg logit contributions
Max-0.183
Brown-0.173
ens-0.169
Snow-0.168
Mic-0.165
Token with
Token ID351
Feature activation+1.60e+01

Pos logit contributions
 teaching+0.060
 showing+0.059
 diving+0.057
 example+0.055
 giving+0.055
Neg logit contributions
Max-0.120
ens-0.113
Brown-0.110
Snow-0.107
Mic-0.106
Token her
Token ID607
Feature activation-1.65e+01

Pos logit contributions
 teaching+0.261
 showing+0.257
 diving+0.253
 giving+0.244
 example+0.241
Neg logit contributions
Max-0.370
ens-0.342
Brown-0.339
Snow-0.331
Mic-0.329
Token friends
Token ID2460
Feature activation-8.51e+00

Pos logit contributions
Max+0.442
ens+0.414
Brown+0.412
Snow+0.402
Mic+0.400
Neg logit contributions
 teaching-0.210
 showing-0.205
 diving-0.198
 giving-0.193
 example-0.188
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 example+0.282
 show+0.282
 giving+0.280
 teaching+0.279
 showing+0.270
Neg logit contributions
Max-0.411
Brown-0.406
zan-0.399
Snow-0.388
ox-0.380
Token to
Token ID284
Feature activation+9.41e+00

Pos logit contributions
Max+0.964
Brown+0.918
ens+0.904
Mic+0.903
Snow+0.894
Neg logit contributions
 teaching-0.268
 showing-0.259
 diving-0.248
 giving-0.241
 example-0.214
Token the
Token ID262
Feature activation-1.33e+01

Pos logit contributions
 teaching+0.143
 showing+0.140
 diving+0.135
 example+0.132
 giving+0.131
Neg logit contributions
Max-0.226
ens-0.208
Brown-0.208
Snow-0.205
Mic-0.198
Token beach
Token ID10481
Feature activation-8.92e+00

Pos logit contributions
Max+0.262
Snow+0.236
lves+0.224
Brown+0.224
Spot+0.223
Neg logit contributions
 teaching-0.257
 showing-0.247
 giving-0.244
 serving-0.231
 liking-0.231
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
Max+0.208
ens+0.196
Brown+0.192
Snow+0.191
lves+0.184
Neg logit contributions
 teaching-0.136
 showing-0.130
 giving-0.125
 example-0.125
 diving-0.124
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
Max+0.068
ens+0.063
Brown+0.063
Snow+0.060
mi+0.060
Neg logit contributions
 teaching-0.079
 showing-0.078
 giving-0.078
 diving-0.075
 serving-0.074
Token was
Token ID373
Feature activation+9.05e+00

Pos logit contributions
Max+0.711
Mic+0.666
Brown+0.664
ens+0.655
Snow+0.639
Neg logit contributions
 showing-0.512
 teaching-0.510
 diving-0.500
 giving-0.487
 show-0.468
Token so
Token ID523
Feature activation+6.64e+00

Pos logit contributions
 humor+0.129
 example+0.127
 apology+0.126
 teaching+0.124
 liking+0.123
Neg logit contributions
Max-0.197
Brown-0.185
Snow-0.184
K-0.180
 Margaret-0.179
Token excited
Token ID6568
Feature activation+1.20e+01

Pos logit contributions
 teaching+0.100
 example+0.099
 showing+0.095
 liking+0.089
 humor+0.089
Neg logit contributions
Max-0.151
Snow-0.141
Brown-0.141
K-0.136
lves-0.135
Token she
Token ID673
Feature activation-2.03e+01

Pos logit contributions
 teaching+0.117
 showing+0.116
 giving+0.109
 diving+0.108
 liking+0.097
Neg logit contributions
Max-0.371
Brown-0.353
Mic-0.349
Snow-0.344
ens-0.340
Token could
Token ID714
Feature activation+1.18e+01

Pos logit contributions
Max+0.418
Brown+0.383
ens+0.382
Snow+0.371
Mic+0.367
Neg logit contributions
 teaching-0.387
 showing-0.382
 diving-0.372
 giving-0.367
 example-0.357
Token Mom
Token ID11254
Feature activation-1.89e+01

Pos logit contributions
 teaching+0.059
 showing+0.058
 giving+0.058
 helping+0.055
 show+0.055
Neg logit contributions
Max-0.056
Brown-0.053
Mic-0.051
ens-0.051
lves-0.051
Tokenmy
Token ID1820
Feature activation-1.79e+01

Pos logit contributions
Max+0.264
lves+0.241
Mic+0.239
Blue+0.231
Brown+0.230
Neg logit contributions
 teaching-0.488
 showing-0.477
 giving-0.476
 diving-0.461
 helping-0.454
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
Max+0.449
Brown+0.419
Mic+0.416
ens+0.411
lves+0.408
Neg logit contributions
 teaching-0.265
 showing-0.262
 giving-0.255
 diving-0.251
 show-0.238
Token Daddy
Token ID26926
Feature activation-2.31e+01

Pos logit contributions
 teaching+0.047
 giving+0.047
 showing+0.047
 helping+0.044
 diving+0.044
Neg logit contributions
Max-0.039
Brown-0.036
lves-0.034
ens-0.034
Snow-0.034
Token took
Token ID1718
Feature activation+1.83e+01

Pos logit contributions
Max+0.529
Brown+0.490
Mic+0.481
ens+0.480
Snow+0.474
Neg logit contributions
 teaching-0.392
 showing-0.384
 diving-0.373
 giving-0.373
 show-0.353
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 example+0.282
 show+0.282
 giving+0.280
 teaching+0.279
 showing+0.270
Neg logit contributions
Max-0.411
Brown-0.406
zan-0.399
Snow-0.388
ox-0.380
Token to
Token ID284
Feature activation+9.41e+00

Pos logit contributions
Max+0.964
Brown+0.918
ens+0.904
Mic+0.903
Snow+0.894
Neg logit contributions
 teaching-0.268
 showing-0.259
 diving-0.248
 giving-0.241
 example-0.214
Token the
Token ID262
Feature activation-1.33e+01

Pos logit contributions
 teaching+0.143
 showing+0.140
 diving+0.135
 example+0.132
 giving+0.131
Neg logit contributions
Max-0.226
ens-0.208
Brown-0.208
Snow-0.205
Mic-0.198
Token beach
Token ID10481
Feature activation-8.92e+00

Pos logit contributions
Max+0.262
Snow+0.236
lves+0.224
Brown+0.224
Spot+0.223
Neg logit contributions
 teaching-0.257
 showing-0.247
 giving-0.244
 serving-0.231
 liking-0.231
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
Max+0.208
ens+0.196
Brown+0.192
Snow+0.191
lves+0.184
Neg logit contributions
 teaching-0.136
 showing-0.130
 giving-0.125
 example-0.125
 diving-0.124
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
Max+0.068
ens+0.063
Brown+0.063
Snow+0.060
mi+0.060
Neg logit contributions
 teaching-0.079
 showing-0.078
 giving-0.078
 diving-0.075
 serving-0.074
Token light
Token ID1657
Feature activation+4.99e+00

Pos logit contributions
 teaching+0.139
 showing+0.137
 diving+0.136
aps+0.131
 humor+0.130
Neg logit contributions
Max-0.211
ens-0.199
Snow-0.193
Brown-0.191
K-0.186
Token out
Token ID503
Feature activation+2.32e+00

Pos logit contributions
 teaching+0.059
 showing+0.059
 diving+0.057
 giving+0.057
 example+0.053
Neg logit contributions
Max-0.140
Mic-0.132
Brown-0.131
ens-0.128
lves-0.127
Token she
Token ID673
Feature activation-2.02e+01

Pos logit contributions
 teaching+0.027
 showing+0.026
 diving+0.025
 example+0.024
 giving+0.023
Neg logit contributions
Max-0.063
ens-0.060
Brown-0.059
Snow-0.058
K-0.056
Token would
Token ID561
Feature activation+1.23e+01

Pos logit contributions
Max+0.378
Brown+0.344
ens+0.341
Snow+0.331
Mic+0.328
Neg logit contributions
 teaching-0.425
 showing-0.419
 diving-0.408
 giving-0.403
 example-0.395
Token take
Token ID1011
Feature activation+1.79e+01

Pos logit contributions
 teaching+0.173
 showing+0.165
 diving+0.159
 example+0.156
 giving+0.154
Neg logit contributions
Max-0.312
Brown-0.293
Snow-0.289
ens-0.289
K-0.276
Token Betsy
Token ID45775
Feature activation-3.07e+01

Pos logit contributions
 teaching+0.240
 showing+0.239
 giving+0.236
 example+0.232
 show+0.223
Neg logit contributions
Max-0.464
Brown-0.444
Mic-0.438
lves-0.430
ens-0.425
Token to
Token ID284
Feature activation+9.52e+00

Pos logit contributions
Max+0.935
Brown+0.885
Mic+0.882
ens+0.864
Snow+0.856
Neg logit contributions
 teaching-0.303
 showing-0.300
 diving-0.287
 giving-0.286
 example-0.253
Token the
Token ID262
Feature activation-1.33e+01

Pos logit contributions
 teaching+0.154
 showing+0.150
 diving+0.147
 example+0.144
 liking+0.140
Neg logit contributions
Max-0.207
ens-0.191
Snow-0.191
Brown-0.188
K-0.180
Token beach
Token ID10481
Feature activation-8.89e+00

Pos logit contributions
Max+0.282
Snow+0.258
Brown+0.250
ens+0.245
Spot+0.244
Neg logit contributions
 teaching-0.238
 showing-0.230
 giving-0.224
 serving-0.214
 liking-0.212
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
Max+0.217
ens+0.203
Brown+0.203
Snow+0.198
Mic+0.193
Neg logit contributions
 teaching-0.131
 showing-0.127
 diving-0.123
 giving-0.121
 example-0.119
Token
Token ID198
Feature activation-5.98e+00

Pos logit contributions
Max+0.091
ox+0.088
mi+0.087
lves+0.087
zan+0.086
Neg logit contributions
 teaching-0.050
 giving-0.050
 showing-0.048
 helping-0.047
 planning-0.044
Token of
Token ID286
Feature activation+1.11e+01

Pos logit contributions
 teaching+0.040
 showing+0.039
 example+0.035
 diving+0.035
 guessing+0.034
Neg logit contributions
ens-0.105
Max-0.104
Brown-0.099
Snow-0.098
pillar-0.097
Token y
Token ID331
Feature activation+1.16e+01

Pos logit contributions
 teaching+0.213
 showing+0.208
 diving+0.202
 example+0.201
 giving+0.201
Neg logit contributions
Max-0.223
ens-0.207
Brown-0.203
Mic-0.198
Snow-0.196
Tokenummy
Token ID13513
Feature activation-1.21e+01

Pos logit contributions
 teaching+0.216
 showing+0.212
 giving+0.200
 diving+0.197
 example+0.194
Neg logit contributions
Max-0.241
ens-0.234
Mic-0.220
Brown-0.219
Snow-0.216
Token veggies
Token ID39425
Feature activation-9.84e+00

Pos logit contributions
Max+0.266
Brown+0.246
ens+0.241
Snow+0.238
Mic+0.233
Neg logit contributions
 teaching-0.217
 showing-0.213
 diving-0.208
 giving-0.204
 example-0.199
Token.
Token ID13
Feature activation-3.74e+00

Pos logit contributions
ens+0.199
Max+0.196
ball+0.183
Snow+0.180
ety+0.180
Neg logit contributions
 teaching-0.161
 humor-0.160
 example-0.158
 showing-0.150
 diving-0.150
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
Max+0.040
Brown+0.034
ens+0.031
Snow+0.031
Mic+0.030
Neg logit contributions
 teaching-0.109
 showing-0.108
 giving-0.106
 diving-0.104
 example-0.103
Token helped
Token ID4193
Feature activation+1.84e+01

Pos logit contributions
Max+0.723
Brown+0.685
ens+0.665
Snow+0.663
Mic+0.659
Neg logit contributions
 showing-0.493
 teaching-0.490
 giving-0.459
 diving-0.457
 show-0.445
Token her
Token ID607
Feature activation-1.65e+01

Pos logit contributions
 teaching+0.342
 showing+0.338
 giving+0.326
 diving+0.322
 example+0.315
Neg logit contributions
Max-0.393
Brown-0.363
ens-0.356
Snow-0.350
Mic-0.343
Token mom
Token ID1995
Feature activation-1.81e+01

Pos logit contributions
Max+0.367
ens+0.342
Brown+0.334
Mic+0.324
Snow+0.320
Neg logit contributions
 teaching-0.276
 showing-0.263
 diving-0.260
 giving-0.255
 example-0.255
Token spread
Token ID4104
Feature activation+1.32e+01

Pos logit contributions
Max+0.433
ens+0.414
Brown+0.399
Mic+0.385
ety+0.385
Neg logit contributions
 teaching-0.258
 humor-0.255
 diving-0.250
 example-0.249
 showing-0.243
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 teaching+0.164
 showing+0.164
 diving+0.156
 giving+0.155
 show+0.146
Neg logit contributions
Max-0.365
Brown-0.349
Snow-0.335
lves-0.332
Mic-0.330
Token From
Token ID3574
Feature activation+4.78e+00

Pos logit contributions
Max+0.075
Brown+0.068
Snow+0.065
mi+0.065
ens+0.064
Neg logit contributions
 giving-0.075
 showing-0.075
 teaching-0.075
 helping-0.071
 serving-0.070
Token that
Token ID326
Feature activation-3.12e+00

Pos logit contributions
 teaching+0.060
 showing+0.060
 diving+0.058
 giving+0.055
 example+0.054
Neg logit contributions
Max-0.130
Brown-0.122
ens-0.121
Mic-0.120
Snow-0.118
Token day
Token ID1110
Feature activation-9.36e-01

Pos logit contributions
Max+0.077
ens+0.069
Brown+0.069
Mic+0.067
Snow+0.066
Neg logit contributions
 teaching-0.047
 showing-0.045
 humor-0.045
 diving-0.045
 giving-0.043
Token on
Token ID319
Feature activation+1.14e+01

Pos logit contributions
Max+0.026
ens+0.024
K+0.023
Brown+0.022
Snow+0.022
Neg logit contributions
 humor-0.012
 showing-0.012
 teaching-0.011
 example-0.011
 diving-0.011
Token,
Token ID11
Feature activation+2.88e+00

Pos logit contributions
 showing+0.090
 teaching+0.086
 giving+0.086
 helping+0.084
 diving+0.081
Neg logit contributions
Max-0.360
Brown-0.355
Snow-0.350
lves-0.347
Mic-0.341
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 giving+0.074
 showing+0.073
 helping+0.073
 teaching+0.073
 serving+0.071
Neg logit contributions
Max-0.042
Brown-0.038
ox-0.037
Snow-0.035
zan-0.035
Token always
Token ID1464
Feature activation+9.75e+00

Pos logit contributions
Max+0.697
Brown+0.643
ens+0.642
Snow+0.625
Mic+0.615
Neg logit contributions
 teaching-0.514
 showing-0.504
 diving-0.491
 giving-0.482
 example-0.473
Token made
Token ID925
Feature activation+2.11e+01

Pos logit contributions
 teaching+0.204
 diving+0.196
 showing+0.195
 example+0.193
 liking+0.193
Neg logit contributions
Max-0.170
ens-0.160
Snow-0.146
Brown-0.145
 Bun-0.142
Token sure
Token ID1654
Feature activation+4.72e+00

Pos logit contributions
 teaching+0.339
 showing+0.333
 diving+0.322
 giving+0.317
 example+0.309
Neg logit contributions
Max-0.498
Brown-0.461
ens-0.459
Snow-0.448
Mic-0.443
Token to
Token ID284
Feature activation+9.32e+00

Pos logit contributions
 teaching+0.038
 showing+0.037
 diving+0.035
 giving+0.033
 example+0.032
Neg logit contributions
Max-0.147
ens-0.139
Brown-0.139
Snow-0.136
lves-0.134
Token remember
Token ID3505
Feature activation+1.17e+01

Pos logit contributions
 teaching+0.158
 mixing+0.150
 juggling+0.148
 serving+0.146
 diving+0.145
Neg logit contributions
Max-0.193
Snow-0.187
Brown-0.185
Nut-0.184
K-0.178
Token
Token ID220
Feature activation-7.05e+00

Pos logit contributions
Max+0.085
Brown+0.080
ens+0.078
Mic+0.077
Snow+0.077
Neg logit contributions
 teaching-0.060
 showing-0.059
 giving-0.058
 serving-0.054
 diving-0.054
Token
Token ID198
Feature activation-6.12e+00

Pos logit contributions
Max+0.170
lves+0.168
ens+0.168
Brown+0.167
ley+0.165
Neg logit contributions
 teaching-0.094
aps-0.089
 showing-0.087
 giving-0.084
 about-0.082
Token
Token ID198
Feature activation-6.13e+00

Pos logit contributions
Max+0.133
ens+0.129
lves+0.126
Brown+0.126
Snow+0.122
Neg logit contributions
 teaching-0.103
 showing-0.101
 diving-0.097
 giving-0.096
 liking-0.092
TokenTogether
Token ID41631
Feature activation+7.34e+00

Pos logit contributions
ens+0.127
Max+0.122
uffy+0.116
issy+0.114
lves+0.114
Neg logit contributions
 teaching-0.110
 showing-0.110
 giving-0.105
 liking-0.103
 joining-0.100
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
 teaching+0.111
 showing+0.109
 diving+0.105
 giving+0.102
 liking+0.099
Neg logit contributions
Max-0.182
ens-0.171
Brown-0.167
Snow-0.163
Mic-0.161
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 showing+0.068
 teaching+0.068
 giving+0.067
 diving+0.065
 serving+0.064
Neg logit contributions
Max-0.044
Brown-0.041
Snow-0.038
ens-0.037
issy-0.037
Token and
Token ID290
Feature activation+2.07e+00

Pos logit contributions
Max+0.783
Brown+0.753
Mic+0.725
Snow+0.720
ens+0.715
Neg logit contributions
 teaching-0.435
 showing-0.434
 giving-0.412
 diving-0.403
 show-0.382
Token Lily
Token ID20037
Feature activation-2.70e+01

Pos logit contributions
 giving+0.041
 showing+0.040
 teaching+0.040
 show+0.038
 helping+0.038
Neg logit contributions
Max-0.042
uffy-0.039
Brown-0.039
Snow-0.039
ox-0.039
Token enjoyed
Token ID8359
Feature activation+1.79e+01

Pos logit contributions
Max+0.637
Brown+0.607
Mic+0.580
Snow+0.578
ens+0.572
Neg logit contributions
 teaching-0.431
 showing-0.426
 diving-0.409
 giving-0.404
 show-0.392
Token eating
Token ID6600
Feature activation+1.84e+01

Pos logit contributions
 teaching+0.240
 example+0.240
 showing+0.233
 diving+0.226
 humor+0.224
Neg logit contributions
Max-0.429
Snow-0.400
ens-0.398
Brown-0.392
Spot-0.385
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 teaching+0.321
 showing+0.311
 example+0.302
 diving+0.300
 giving+0.297
Neg logit contributions
Max-0.396
ens-0.373
Brown-0.362
Snow-0.361
Mic-0.352
Token on
Token ID319
Feature activation+1.14e+01

Pos logit contributions
Max+0.193
Brown+0.181
ens+0.177
Snow+0.176
Mic+0.176
Neg logit contributions
 teaching-0.095
 showing-0.094
 diving-0.090
 giving-0.089
 example-0.083
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
 showing+0.157
 teaching+0.157
 giving+0.147
 diving+0.145
 serving+0.138
Neg logit contributions
Max-0.300
Brown-0.287
ens-0.281
lves-0.277
Snow-0.274
Token other
Token ID584
Feature activation-7.46e+00

Pos logit contributions
Max+0.283
Brown+0.264
ens+0.258
Mic+0.255
issy+0.250
Neg logit contributions
 showing-0.251
 teaching-0.250
 diving-0.242
 show-0.237
 giving-0.235
Token side
Token ID1735
Feature activation+6.36e-01

Pos logit contributions
Max+0.234
Mic+0.228
Brown+0.228
Rust+0.217
Nut+0.216
Neg logit contributions
 showing-0.063
 teaching-0.063
 show-0.059
 example-0.059
 giving-0.057
Token.
Token ID13
Feature activation-3.83e+00

Pos logit contributions
 teaching+0.005
 showing+0.005
 diving+0.005
 giving+0.005
 serving+0.004
Neg logit contributions
Max-0.020
Brown-0.019
Snow-0.019
Mic-0.019
ens-0.019
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
Max+0.091
Brown+0.087
lves+0.086
ens+0.086
Mic+0.084
Neg logit contributions
 teaching-0.057
 showing-0.055
 giving-0.054
 helping-0.051
 serving-0.051
Token was
Token ID373
Feature activation+9.11e+00

Pos logit contributions
Max+0.724
Brown+0.685
Mic+0.672
ens+0.667
Snow+0.660
Neg logit contributions
 showing-0.495
 teaching-0.495
 giving-0.465
 diving-0.460
 show-0.443
Token grateful
Token ID14066
Feature activation+1.42e+01

Pos logit contributions
 diving+0.129
 humor+0.129
 example+0.127
 teaching+0.125
 showing+0.123
Neg logit contributions
Max-0.205
Brown-0.189
K-0.189
Snow-0.189
Spot-0.183
Token to
Token ID284
Feature activation+9.25e+00

Pos logit contributions
 teaching+0.163
 showing+0.158
 diving+0.155
 giving+0.146
 example+0.146
Neg logit contributions
Max-0.395
ens-0.378
Brown-0.362
Snow-0.360
Spot-0.347
Token Lily
Token ID20037
Feature activation-2.70e+01

Pos logit contributions
 diving+0.177
 guessing+0.165
 juggling+0.163
 teaching+0.160
aps+0.157
Neg logit contributions
Max-0.167
 Tess-0.160
 Bun-0.159
ens-0.154
Brown-0.151
Token for
Token ID329
Feature activation+1.34e+01

Pos logit contributions
Max+0.744
Brown+0.704
ens+0.688
Snow+0.686
Mic+0.678
Neg logit contributions
 teaching-0.344
 showing-0.334
 giving-0.315
 diving-0.314
 example-0.299
Token,
Token ID11
Feature activation+2.88e+00

Pos logit contributions
 teaching+0.070
 showing+0.068
 diving+0.068
 example+0.065
 guessing+0.064
Neg logit contributions
Max-0.096
ens-0.090
Brown-0.088
Snow-0.086
Blue-0.085
Token "
Token ID366
Feature activation-4.79e+00

Pos logit contributions
 diving+0.049
 example+0.049
 teaching+0.048
 humor+0.048
 showing+0.047
Neg logit contributions
Max-0.062
Brown-0.057
Snow-0.056
ens-0.055
Mic-0.054
TokenThank
Token ID10449
Feature activation+1.68e+01

Pos logit contributions
uffy+0.112
ens+0.109
lves+0.104
ox+0.103
Max+0.102
Neg logit contributions
 teaching-0.073
 showing-0.071
 giving-0.068
 joining-0.064
 example-0.063
Token you
Token ID345
Feature activation-1.34e+01

Pos logit contributions
 teaching+0.281
 showing+0.270
 giving+0.259
 helping+0.257
 show+0.249
Neg logit contributions
Max-0.373
lves-0.364
Brown-0.359
ens-0.354
issy-0.347
Token,
Token ID11
Feature activation+2.93e+00

Pos logit contributions
Max+0.422
Brown+0.400
ens+0.395
lves+0.392
Snow+0.390
Neg logit contributions
 teaching-0.116
 showing-0.108
 giving-0.103
 example-0.100
 diving-0.098
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 teaching+0.083
 showing+0.083
 giving+0.083
 show+0.079
 diving+0.079
Neg logit contributions
Max-0.034
Brown-0.029
ens-0.027
Snow-0.027
lves-0.026
Token.
Token ID13
Feature activation-3.74e+00

Pos logit contributions
Max+0.886
Brown+0.833
Mic+0.826
lves+0.817
Snow+0.808
Neg logit contributions
 teaching-0.365
 showing-0.350
 giving-0.337
 helping-0.313
 example-0.308
Token You
Token ID921
Feature activation-9.34e+00

Pos logit contributions
Max+0.082
Brown+0.076
ens+0.076
Snow+0.074
Mic+0.073
Neg logit contributions
 teaching-0.066
 showing-0.065
 giving-0.063
 diving-0.062
 example-0.061
Token are
Token ID389
Feature activation+6.82e+00

Pos logit contributions
Max+0.175
ens+0.158
Brown+0.158
Snow+0.152
Mic+0.150
Neg logit contributions
 teaching-0.195
 showing-0.193
 diving-0.189
 giving-0.186
 example-0.184
Token such
Token ID884
Feature activation+4.08e+00

Pos logit contributions
 teaching+0.086
 showing+0.083
 diving+0.083
 example+0.078
 giving+0.078
Neg logit contributions
Max-0.181
Brown-0.169
ens-0.167
Snow-0.165
Mic-0.163
Token a
Token ID257
Feature activation-6.66e+00

Pos logit contributions
 teaching+0.061
 showing+0.060
 diving+0.059
 giving+0.057
 example+0.056
Neg logit contributions
Max-0.099
Brown-0.092
ens-0.091
Snow-0.090
Mic-0.089
Token her
Token ID607
Feature activation-1.63e+01

Pos logit contributions
 giving+0.302
 showing+0.302
 teaching+0.298
 show+0.297
 helping+0.278
Neg logit contributions
Max-0.510
Brown-0.476
uffy-0.465
Snow-0.462
zan-0.459
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
Max+0.435
Brown+0.409
ens+0.408
Snow+0.399
lves+0.391
Neg logit contributions
 teaching-0.207
 showing-0.201
 diving-0.193
 giving-0.188
 example-0.186
Token way
Token ID835
Feature activation+7.73e+00

Pos logit contributions
Max+0.296
Brown+0.279
Snow+0.270
lves+0.270
ens+0.266
Neg logit contributions
 teaching-0.209
 showing-0.197
 serving-0.196
 diving-0.194
 giving-0.190
Token out
Token ID503
Feature activation+2.15e+00

Pos logit contributions
 teaching+0.068
 showing+0.068
 diving+0.064
 giving+0.061
 liking+0.056
Neg logit contributions
Max-0.239
Brown-0.227
ens-0.225
Mic-0.223
Snow-0.222
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
 teaching+0.024
 showing+0.022
 example+0.022
 diving+0.022
 giving+0.021
Neg logit contributions
Max-0.059
ens-0.056
Brown-0.055
Snow-0.054
issy-0.053
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 showing+0.039
 giving+0.038
 teaching+0.038
 diving+0.035
 helping+0.035
Neg logit contributions
Max-0.047
Brown-0.043
ens-0.042
Mic-0.041
Snow-0.041
Token was
Token ID373
Feature activation+9.02e+00

Pos logit contributions
Max+0.722
Mic+0.682
Brown+0.680
ens+0.671
Snow+0.641
Neg logit contributions
 showing-0.517
 teaching-0.502
 giving-0.490
 diving-0.479
 show-0.448
Token so
Token ID523
Feature activation+6.69e+00

Pos logit contributions
 humor+0.143
 example+0.142
 teaching+0.135
 accomplishment+0.133
 apology+0.130
Neg logit contributions
Max-0.188
Snow-0.178
Brown-0.177
K-0.174
 Bun-0.173
Token happy
Token ID3772
Feature activation+6.97e+00

Pos logit contributions
 teaching+0.104
 example+0.103
 showing+0.096
 humor+0.093
 serving+0.093
Neg logit contributions
Max-0.155
Brown-0.145
Snow-0.145
Blue-0.143
lves-0.140
Token to
Token ID284
Feature activation+9.37e+00

Pos logit contributions
 teaching+0.090
 showing+0.089
 diving+0.085
 giving+0.084
 liking+0.079
Neg logit contributions
Max-0.190
Brown-0.179
ens-0.176
Mic-0.175
Snow-0.173
Token be
Token ID307
Feature activation+8.61e+00

Pos logit contributions
 teaching+0.161
 diving+0.151
 showing+0.149
 serving+0.148
 mixing+0.148
Neg logit contributions
Max-0.205
ens-0.192
Brown-0.186
Snow-0.184
Blue-0.183
Token
Token ID198
Feature activation-5.92e+00

Pos logit contributions
Max+0.095
Brown+0.093
ox+0.092
lves+0.091
issy+0.091
Neg logit contributions
 teaching-0.044
 showing-0.043
 giving-0.042
 show-0.041
 helping-0.039
Token
Token ID198
Feature activation-5.90e+00

Pos logit contributions
Max+0.133
ens+0.130
Brown+0.129
lves+0.128
Mic+0.126
Neg logit contributions
 teaching-0.093
 showing-0.089
 diving-0.087
 show-0.084
 giving-0.084
TokenOne
Token ID3198
Feature activation-1.96e+00

Pos logit contributions
ens+0.113
Max+0.109
issy+0.107
uffy+0.106
ox+0.104
Neg logit contributions
 teaching-0.110
 showing-0.109
 giving-0.105
 liking-0.104
 example-0.101
Token day
Token ID1110
Feature activation-8.13e-01

Pos logit contributions
Max+0.057
Brown+0.054
Mic+0.053
ens+0.053
Snow+0.053
Neg logit contributions
 teaching-0.021
 showing-0.020
 diving-0.019
 giving-0.019
 example-0.018
Token,
Token ID11
Feature activation+2.92e+00

Pos logit contributions
Max+0.024
Brown+0.023
Snow+0.022
lves+0.022
Mic+0.022
Neg logit contributions
 teaching-0.008
 showing-0.008
 giving-0.008
 diving-0.007
 helping-0.007
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 showing+0.068
 teaching+0.068
 giving+0.068
 helping+0.067
 making+0.066
Neg logit contributions
jo-0.044
ox-0.044
Brown-0.043
Max-0.042
zan-0.042
Token was
Token ID373
Feature activation+9.01e+00

Pos logit contributions
Max+0.709
Brown+0.696
Mic+0.667
Snow+0.654
Blue+0.645
Neg logit contributions
 showing-0.499
 teaching-0.491
 giving-0.475
 diving-0.468
 show-0.462
Token feeling
Token ID4203
Feature activation+1.16e+01

Pos logit contributions
 humor+0.138
 apology+0.130
 accomplishment+0.128
 example+0.126
 contain+0.115
Neg logit contributions
Max-0.205
lves-0.185
 Bun-0.183
 Margaret-0.182
Brown-0.182
Token a
Token ID257
Feature activation-6.53e+00

Pos logit contributions
 showing+0.182
 teaching+0.177
 example+0.174
 diving+0.172
 humor+0.171
Neg logit contributions
Max-0.263
ens-0.248
Snow-0.241
Brown-0.236
lves-0.232
Token bit
Token ID1643
Feature activation+1.81e+01

Pos logit contributions
Max+0.122
Brown+0.112
Snow+0.110
ens+0.109
Mic+0.108
Neg logit contributions
 teaching-0.135
 showing-0.131
 diving-0.128
 humor-0.125
 example-0.125
Token hungry
Token ID14720
Feature activation+7.41e+00

Pos logit contributions
 showing+0.380
 teaching+0.371
 example+0.347
 diving+0.346
 counting+0.343
Neg logit contributions
ens-0.358
Max-0.309
lves-0.306
ety-0.292
 Bun-0.281
Token beef
Token ID12023
Feature activation-1.49e+01

Pos logit contributions
 teaching+0.236
 showing+0.233
 diving+0.230
 example+0.225
 giving+0.224
Neg logit contributions
Max-0.202
ens-0.184
Brown-0.184
Snow-0.176
Mic-0.175
Token on
Token ID319
Feature activation+1.14e+01

Pos logit contributions
Max+0.397
ens+0.373
Brown+0.370
Snow+0.362
Mic+0.356
Neg logit contributions
 teaching-0.184
 showing-0.176
 diving-0.172
 example-0.171
 giving-0.167
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
 showing+0.125
 teaching+0.122
 giving+0.117
 show+0.108
 diving+0.106
Neg logit contributions
Max-0.335
Brown-0.319
ens-0.316
uffy-0.313
issy-0.312
Token table
Token ID3084
Feature activation-4.04e+00

Pos logit contributions
Max+0.312
Brown+0.289
ens+0.288
Snow+0.280
Mic+0.277
Neg logit contributions
 teaching-0.222
 showing-0.218
 diving-0.211
 giving-0.208
 example-0.203
Token.
Token ID13
Feature activation-3.74e+00

Pos logit contributions
Max+0.087
ens+0.082
Snow+0.078
Brown+0.077
Spot+0.074
Neg logit contributions
 teaching-0.067
 example-0.065
 showing-0.064
 diving-0.064
 giving-0.062
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
Max+0.069
Brown+0.063
ens+0.061
Snow+0.061
lves+0.060
Neg logit contributions
 teaching-0.078
 showing-0.077
 giving-0.077
 helping-0.074
 serving-0.072
Token was
Token ID373
Feature activation+8.92e+00

Pos logit contributions
Max+0.750
ens+0.723
Brown+0.715
Snow+0.702
issy+0.696
Neg logit contributions
 showing-0.468
 teaching-0.453
 giving-0.434
 diving-0.408
 show-0.400
Token so
Token ID523
Feature activation+6.68e+00

Pos logit contributions
 humor+0.164
 example+0.160
 accomplishment+0.158
 diving+0.154
ulp+0.147
Neg logit contributions
Max-0.154
Brown-0.146
Spot-0.139
 Kitty-0.139
K-0.139
Token happy
Token ID3772
Feature activation+6.88e+00

Pos logit contributions
 example+0.112
 teaching+0.108
 diving+0.100
 showing+0.099
 joining+0.098
Neg logit contributions
Max-0.145
Spot-0.133
ens-0.133
Blue-0.132
Snow-0.132
Token!
Token ID0
Feature activation-2.62e+00

Pos logit contributions
 teaching+0.085
 showing+0.085
 giving+0.079
 diving+0.079
 liking+0.073
Neg logit contributions
Max-0.195
Brown-0.186
Mic-0.183
ens-0.179
Snow-0.179
Token She
Token ID1375
Feature activation-1.91e+01

Pos logit contributions
Max+0.049
Brown+0.047
Snow+0.045
lves+0.045
mi+0.045
Neg logit contributions
 showing-0.051
 giving-0.051
 teaching-0.051
 helping-0.048
 planning-0.048
Tokeniflower
Token ID42642
Feature activation-1.68e+01

Pos logit contributions
Max+0.053
Brown+0.048
Snow+0.047
Mic+0.046
ens+0.044
Neg logit contributions
 teaching-0.114
 showing-0.113
 example-0.111
 giving-0.110
 diving-0.109
Token?"
Token ID1701
Feature activation-4.79e-01

Pos logit contributions
Max+0.397
ens+0.385
Brown+0.368
Snow+0.361
Mic+0.356
Neg logit contributions
 teaching-0.248
 showing-0.243
 example-0.233
 diving-0.232
 giving-0.228
Token asked
Token ID1965
Feature activation+1.14e+01

Pos logit contributions
Max+0.012
uffy+0.012
Brown+0.012
lves+0.011
Snow+0.011
Neg logit contributions
 showing-0.007
 teaching-0.007
 giving-0.007
 show-0.006
 helping-0.006
Token Lily
Token ID20037
Feature activation-2.69e+01

Pos logit contributions
 teaching+0.234
 giving+0.234
 showing+0.229
 helping+0.225
 show+0.211
Neg logit contributions
Brown-0.207
Max-0.206
ox-0.202
Snow-0.200
lves-0.199
Token.
Token ID13
Feature activation-3.61e+00

Pos logit contributions
Max+0.604
ens+0.577
Brown+0.547
ety+0.535
Snow+0.530
Neg logit contributions
 teaching-0.427
 diving-0.423
 example-0.418
 showing-0.417
 humor-0.396
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
Max+0.090
Brown+0.087
ens+0.086
lves+0.085
Snow+0.084
Neg logit contributions
 teaching-0.052
 showing-0.052
 giving-0.050
 serving-0.046
 helping-0.045
Token nodded
Token ID14464
Feature activation+7.81e+00

Pos logit contributions
Max+0.727
Brown+0.703
Snow+0.676
Mic+0.672
ens+0.666
Neg logit contributions
 showing-0.484
 teaching-0.482
 giving-0.456
 diving-0.441
 show-0.438
Token her
Token ID607
Feature activation-1.63e+01

Pos logit contributions
 teaching+0.145
 diving+0.142
 example+0.141
 showing+0.139
 giving+0.135
Neg logit contributions
Max-0.163
ens-0.149
Brown-0.145
Snow-0.143
Mic-0.139
Token head
Token ID1182
Feature activation+7.74e+00

Pos logit contributions
Max+0.378
ens+0.351
Brown+0.341
Snow+0.340
Mic+0.337
Neg logit contributions
 teaching-0.268
 designing-0.257
 diving-0.250
 planning-0.248
 serving-0.247
Token again
Token ID757
Feature activation+1.66e+00

Pos logit contributions
 diving+0.141
 humor+0.139
 example+0.139
 teaching+0.136
 planning+0.132
Neg logit contributions
Max-0.159
Brown-0.146
ens-0.144
Snow-0.144
ety-0.138
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
 humor+0.036
 diving+0.035
 teaching+0.034
 designing+0.034
 humour+0.034
Neg logit contributions
Max-0.024
ens-0.023
ety-0.021
Snow-0.020
elope-0.020
Token From
Token ID3574
Feature activation+4.86e+00

Pos logit contributions
Max+0.068
lves+0.067
Brown+0.066
mi+0.065
iko+0.064
Neg logit contributions
 teaching-0.038
 showing-0.037
 giving-0.037
 helping-0.036
 serving-0.034
Token that
Token ID326
Feature activation-3.20e+00

Pos logit contributions
 showing+0.038
 teaching+0.038
 diving+0.036
 show+0.035
 giving+0.033
Neg logit contributions
Max-0.152
Brown-0.147
Mic-0.147
issy-0.144
lves-0.142
Token day
Token ID1110
Feature activation-8.60e-01

Pos logit contributions
Max+0.082
ens+0.076
Brown+0.075
Mic+0.073
Snow+0.072
Neg logit contributions
 teaching-0.044
 showing-0.044
 diving-0.042
 giving-0.041
 example-0.041
Token on
Token ID319
Feature activation+1.16e+01

Pos logit contributions
Max+0.025
ens+0.023
Brown+0.023
Snow+0.022
K+0.022
Neg logit contributions
 teaching-0.009
 showing-0.009
 diving-0.009
 example-0.008
 liking-0.008
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
 showing+0.119
 teaching+0.119
 giving+0.113
 diving+0.111
 helping+0.107
Neg logit contributions
Max-0.344
Brown-0.331
Snow-0.328
lves-0.323
Mic-0.318
Token Betsy
Token ID45775
Feature activation-3.06e+01

Pos logit contributions
 giving+0.065
 showing+0.065
 helping+0.065
 teaching+0.065
 serving+0.062
Neg logit contributions
Max-0.054
ox-0.052
Brown-0.051
Snow-0.050
issy-0.049
Token and
Token ID290
Feature activation+2.51e+00

Pos logit contributions
Max+0.765
Brown+0.721
ens+0.704
Snow+0.701
Mic+0.696
Neg logit contributions
 teaching-0.457
 showing-0.450
 giving-0.425
 diving-0.418
 example-0.403
Token the
Token ID262
Feature activation-1.32e+01

Pos logit contributions
 giving+0.032
 teaching+0.032
 showing+0.032
 helping+0.030
 show+0.028
Neg logit contributions
Max-0.071
Snow-0.067
Brown-0.067
uffy-0.066
issy-0.065
Token thief
Token ID25906
Feature activation-1.57e+01

Pos logit contributions
Max+0.229
Brown+0.213
ens+0.207
Snow+0.205
issy+0.195
Neg logit contributions
 teaching-0.304
 showing-0.298
 giving-0.287
 diving-0.287
 example-0.283
Token became
Token ID2627
Feature activation+1.12e+01

Pos logit contributions
Max+0.319
Brown+0.292
ens+0.287
Snow+0.284
Mic+0.280
Neg logit contributions
 teaching-0.305
 showing-0.302
 diving-0.291
 giving-0.288
 example-0.279
Token good
Token ID922
Feature activation+8.29e+00

Pos logit contributions
 teaching+0.108
 showing+0.104
 diving+0.099
 giving+0.096
 example+0.095
Neg logit contributions
Max-0.340
Brown-0.327
ens-0.318
Snow-0.318
Mic-0.309
Token From
Token ID3574
Feature activation+4.77e+00

Pos logit contributions
Max+0.081
zan+0.075
ox+0.074
Brown+0.074
iko+0.073
Neg logit contributions
 giving-0.062
 teaching-0.060
 showing-0.060
 serving-0.058
 helping-0.057
Token that
Token ID326
Feature activation-3.33e+00

Pos logit contributions
 teaching+0.058
 showing+0.057
 diving+0.056
 giving+0.053
 example+0.052
Neg logit contributions
Max-0.131
Brown-0.124
ens-0.122
Mic-0.121
Snow-0.120
Token day
Token ID1110
Feature activation-9.88e-01

Pos logit contributions
Max+0.084
ens+0.078
Brown+0.076
Mic+0.074
Snow+0.074
Neg logit contributions
 teaching-0.047
 showing-0.047
 diving-0.045
 giving-0.044
 liking-0.044
Token on
Token ID319
Feature activation+1.13e+01

Pos logit contributions
Max+0.028
ens+0.026
Brown+0.025
K+0.025
Spot+0.025
Neg logit contributions
 showing-0.012
 humor-0.011
 teaching-0.011
 diving-0.011
 example-0.011
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
 showing+0.097
 teaching+0.096
 giving+0.092
 helping+0.091
 diving+0.090
Neg logit contributions
Max-0.349
Brown-0.339
Snow-0.336
lves-0.333
zan-0.324
Token Betsy
Token ID45775
Feature activation-3.05e+01

Pos logit contributions
 giving+0.075
 showing+0.074
 teaching+0.074
 helping+0.074
 serving+0.073
Neg logit contributions
Max-0.041
ox-0.037
Brown-0.037
Snow-0.035
zan-0.034
Token and
Token ID290
Feature activation+2.25e+00

Pos logit contributions
Max+0.726
Brown+0.678
ens+0.670
Snow+0.659
Mic+0.652
Neg logit contributions
 teaching-0.491
 showing-0.483
 giving-0.458
 diving-0.458
 example-0.440
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 teaching+0.025
 giving+0.025
 showing+0.025
 helping+0.023
 show+0.022
Neg logit contributions
Max-0.067
Brown-0.064
Snow-0.064
uffy-0.062
issy-0.061
Token fox
Token ID21831
Feature activation-1.59e+01

Pos logit contributions
Max+0.205
Brown+0.181
ens+0.180
Snow+0.173
Mic+0.169
Neg logit contributions
 teaching-0.335
 showing-0.331
 diving-0.324
 giving-0.320
 example-0.316
Token played
Token ID2826
Feature activation+1.65e+01

Pos logit contributions
Max+0.328
Brown+0.301
ens+0.296
Snow+0.292
Mic+0.287
Neg logit contributions
 teaching-0.307
 showing-0.303
 diving-0.294
 giving-0.290
 example-0.282
Token together
Token ID1978
Feature activation+2.53e+00

Pos logit contributions
 giving+0.229
 teaching+0.228
 showing+0.220
 diving+0.216
 liking+0.213
Neg logit contributions
Max-0.418
ens-0.376
Mic-0.368
Spot-0.365
Blue-0.364
Token with
Token ID351
Feature activation+1.60e+01

Pos logit contributions
 apology+0.062
 example+0.060
ulp+0.057
 accomplishment+0.056
 humor+0.056
Neg logit contributions
Max-0.099
Spot-0.084
Brown-0.084
Snow-0.083
Blue-0.082
Token her
Token ID607
Feature activation-1.63e+01

Pos logit contributions
 giving+0.367
 showing+0.365
 teaching+0.364
 helping+0.353
 show+0.343
Neg logit contributions
Max-0.278
Brown-0.254
issy-0.244
Snow-0.242
zan-0.237
Token because
Token ID780
Feature activation+1.77e-01

Pos logit contributions
Max+0.417
Brown+0.390
ens+0.386
Snow+0.381
Mic+0.378
Neg logit contributions
 teaching-0.236
 showing-0.231
 diving-0.220
 giving-0.218
 example-0.208
Token they
Token ID484
Feature activation-1.62e+01

Pos logit contributions
 teaching+0.004
 showing+0.004
 giving+0.004
 diving+0.004
 serving+0.004
Neg logit contributions
Max-0.003
Brown-0.003
ens-0.003
Mic-0.003
Snow-0.003
Token trusted
Token ID13467
Feature activation+1.66e+01

Pos logit contributions
Max+0.356
Brown+0.331
Snow+0.325
Mic+0.325
ens+0.317
Neg logit contributions
 teaching-0.292
 showing-0.291
 show-0.280
 giving-0.273
 diving-0.267
Token Betsy
Token ID45775
Feature activation-3.05e+01

Pos logit contributions
 teaching+0.303
 showing+0.301
 giving+0.293
 diving+0.283
 liking+0.279
Neg logit contributions
Max-0.359
Brown-0.334
ens-0.322
Snow-0.321
Mic-0.316
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
Max+0.862
Brown+0.811
Mic+0.796
Snow+0.785
ens+0.784
Neg logit contributions
 teaching-0.376
 showing-0.368
 giving-0.347
 diving-0.340
 serving-0.320
Token
Token ID198
Feature activation-5.92e+00

Pos logit contributions
Max+0.095
Brown+0.093
ox+0.092
lves+0.091
issy+0.091
Neg logit contributions
 teaching-0.044
 showing-0.043
 giving-0.042
 show-0.041
 helping-0.039
Token
Token ID198
Feature activation-5.90e+00

Pos logit contributions
Max+0.133
ens+0.130
Brown+0.129
lves+0.128
Mic+0.126
Neg logit contributions
 teaching-0.093
 showing-0.089
 diving-0.087
 show-0.084
 giving-0.084
TokenOne
Token ID3198
Feature activation-1.96e+00

Pos logit contributions
ens+0.113
Max+0.109
issy+0.107
uffy+0.106
ox+0.104
Neg logit contributions
 teaching-0.110
 showing-0.109
 giving-0.105
 liking-0.104
 example-0.101
Token day
Token ID1110
Feature activation-8.13e-01

Pos logit contributions
Max+0.057
Brown+0.054
Mic+0.053
ens+0.053
Snow+0.053
Neg logit contributions
 teaching-0.021
 showing-0.020
 diving-0.019
 giving-0.019
 example-0.018
Token felt
Token ID2936
Feature activation+1.07e+01

Pos logit contributions
Max+0.451
ens+0.427
Brown+0.420
Mic+0.414
Snow+0.410
Neg logit contributions
 showing-0.279
 teaching-0.270
 giving-0.262
 show-0.247
 liking-0.243
Token sad
Token ID6507
Feature activation+6.47e+00

Pos logit contributions
 teaching+0.167
 diving+0.162
 showing+0.162
 example+0.161
 guessing+0.154
Neg logit contributions
Max-0.241
ens-0.233
lves-0.226
Brown-0.225
Snow-0.225
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
 humor+0.137
 teaching+0.134
 showing+0.132
 diving+0.128
 example+0.127
Neg logit contributions
ens-0.114
Max-0.109
 Bun-0.102
Snow-0.098
ety-0.097
Token wanted
Token ID2227
Feature activation+1.97e+01

Pos logit contributions
 teaching+0.050
 showing+0.049
 diving+0.048
 giving+0.047
 example+0.047
Neg logit contributions
Max-0.054
ens-0.050
Brown-0.050
Snow-0.048
Mic-0.047
Token to
Token ID284
Feature activation+9.82e+00

Pos logit contributions
 teaching+0.146
 showing+0.141
 diving+0.132
 giving+0.125
 example+0.121
Neg logit contributions
Max-0.636
ens-0.601
Brown-0.601
Snow-0.590
Mic-0.583
Token help
Token ID1037
Feature activation+1.88e+01

Pos logit contributions
 diving+0.153
 teaching+0.151
 example+0.150
 planning+0.145
 designing+0.140
Neg logit contributions
Max-0.223
Brown-0.216
ens-0.209
Blue-0.209
Snow-0.206
Token the
Token ID262
Feature activation-1.29e+01

Pos logit contributions
 teaching+0.277
 showing+0.273
 giving+0.262
 diving+0.255
 show+0.245
Neg logit contributions
Max-0.479
Brown-0.446
ens-0.439
Snow-0.434
Mic-0.431
Token bird
Token ID6512
Feature activation-1.91e+01

Pos logit contributions
Max+0.208
ens+0.187
Brown+0.183
Snow+0.175
Mic+0.175
Neg logit contributions
 teaching-0.306
 showing-0.305
 giving-0.295
 diving-0.295
 example-0.288
Token.
Token ID13
Feature activation-3.11e+00

Pos logit contributions
Max+0.415
ens+0.410
Brown+0.393
ety+0.380
lves+0.378
Neg logit contributions
 teaching-0.318
 example-0.308
 showing-0.305
 diving-0.304
 humor-0.290
Token
Token ID220
Feature activation-6.54e+00

Pos logit contributions
Brown+0.085
Rust+0.082
Mic+0.082
Snow+0.082
Max+0.081
Neg logit contributions
 showing-0.032
 show-0.031
 teaching-0.030
 to-0.029
 giving-0.027
Token
Token ID198
Feature activation-5.60e+00

Pos logit contributions
Max+0.155
Brown+0.153
lves+0.152
ens+0.151
Snow+0.150
Neg logit contributions
 teaching-0.092
 showing-0.088
 giving-0.083
aps-0.082
 diving-0.081
Token very
Token ID845
Feature activation+1.25e+01

Pos logit contributions
 diving+0.197
 humor+0.192
ulp+0.189
 apology+0.188
 accomplishment+0.184
Neg logit contributions
Max-0.263
mie-0.246
Brown-0.241
K-0.238
 Tess-0.236
Token proud
Token ID6613
Feature activation+1.59e+01

Pos logit contributions
 diving+0.188
 example+0.171
 teaching+0.165
ags+0.164
 showing+0.164
Neg logit contributions
Max-0.266
lves-0.246
mie-0.246
Brown-0.245
K-0.240
Token of
Token ID286
Feature activation+1.51e+01

Pos logit contributions
 diving+0.165
 teaching+0.154
 showing+0.153
 example+0.152
 succeeding+0.143
Neg logit contributions
Max-0.431
ens-0.417
Brown-0.397
Snow-0.388
issy-0.386
Token Molly
Token ID30236
Feature activation-2.39e+01

Pos logit contributions
 teaching+0.230
 showing+0.230
 giving+0.220
 liking+0.209
 helping+0.208
Neg logit contributions
Max-0.380
Brown-0.374
Snow-0.368
lves-0.358
Mic-0.354
Token for
Token ID329
Feature activation+1.73e+01

Pos logit contributions
Max+0.667
Brown+0.632
Mic+0.617
Snow+0.616
ens+0.611
Neg logit contributions
 teaching-0.303
 showing-0.295
 giving-0.281
 diving-0.271
 example-0.261
Token using
Token ID1262
Feature activation+1.93e+01

Pos logit contributions
aps+0.208
oops+0.186
 diving+0.184
iddle+0.183
ulp+0.180
Neg logit contributions
Max-0.435
 Tess-0.396
Brown-0.390
 Bun-0.387
 Margaret-0.382
Token her
Token ID607
Feature activation-1.31e+01

Pos logit contributions
 teaching+0.322
 showing+0.316
 diving+0.304
 giving+0.302
 example+0.293
Neg logit contributions
Max-0.448
Brown-0.417
ens-0.411
Snow-0.404
Mic-0.399
Token imagination
Token ID13843
Feature activation+2.75e+00

Pos logit contributions
Max+0.320
ens+0.303
Brown+0.297
Snow+0.286
Mic+0.283
Neg logit contributions
 teaching-0.178
 showing-0.174
 diving-0.173
 giving-0.167
 example-0.161
Token.
Token ID13
Feature activation-8.23e-01

Pos logit contributions
 teaching+0.037
 showing+0.036
 giving+0.034
 diving+0.034
 example+0.032
Neg logit contributions
Max-0.073
Brown-0.068
ens-0.067
Snow-0.067
Mic-0.066
Token She
Token ID1375
Feature activation-1.60e+01

Pos logit contributions
Max+0.015
Brown+0.014
ens+0.013
Snow+0.013
Mic+0.013
Neg logit contributions
 teaching-0.018
 showing-0.018
 giving-0.017
 diving-0.017
 example-0.017
Token encouraged
Token ID10085
Feature activation+2.21e+01

Pos logit contributions
Max+0.324
Brown+0.294
ens+0.291
Snow+0.283
issy+0.273
Neg logit contributions
 teaching-0.298
 diving-0.295
 showing-0.291
 liking-0.284
 example-0.283
Token on
Token ID319
Feature activation+1.32e+01

Pos logit contributions
 teaching+0.117
 showing+0.114
 diving+0.108
 giving+0.107
 example+0.106
Neg logit contributions
Max-0.314
Brown-0.289
ens-0.286
Snow-0.283
Mic-0.275
Token,
Token ID11
Feature activation+4.49e+00

Pos logit contributions
 teaching+0.193
 showing+0.188
 diving+0.182
 giving+0.181
 example+0.173
Neg logit contributions
Max-0.335
Brown-0.317
ens-0.314
lves-0.312
Mic-0.311
Token Peter
Token ID5613
Feature activation-2.74e+01

Pos logit contributions
 teaching+0.096
 showing+0.095
 giving+0.093
 show+0.093
 helping+0.091
Neg logit contributions
Max-0.079
ens-0.079
lves-0.077
Snow-0.076
ety-0.075
Token!
Token ID0
Feature activation-1.22e+00

Pos logit contributions
Max+0.776
Brown+0.730
Mic+0.728
Snow+0.714
lves+0.704
Neg logit contributions
 teaching-0.330
 showing-0.325
 giving-0.313
 diving-0.299
 example-0.290
Token We
Token ID775
Feature activation-5.03e+00

Pos logit contributions
Max+0.028
ens+0.027
Brown+0.027
Mic+0.027
lves+0.026
Neg logit contributions
 teaching-0.019
 showing-0.018
 example-0.018
 giving-0.018
 diving-0.017
Token dare
Token ID16498
Feature activation+1.84e+01

Pos logit contributions
Max+0.115
Brown+0.107
ens+0.104
Snow+0.104
Mic+0.103
Neg logit contributions
 teaching-0.085
 showing-0.083
 diving-0.081
 giving-0.080
 show-0.078
Token you
Token ID345
Feature activation-1.21e+01

Pos logit contributions
 teaching+0.346
 showing+0.341
 diving+0.332
 giving+0.327
 example+0.321
Neg logit contributions
Max-0.383
Brown-0.351
ens-0.348
Snow-0.340
Mic-0.332
Token to
Token ID284
Feature activation+1.09e+01

Pos logit contributions
Max+0.360
Brown+0.337
ens+0.337
Snow+0.333
K+0.323
Neg logit contributions
 teaching-0.118
 showing-0.115
 diving-0.103
 giving-0.103
 example-0.100
Token do
Token ID466
Feature activation+1.82e+01

Pos logit contributions
 teaching+0.143
 showing+0.138
 diving+0.129
 giving+0.128
 example+0.127
Neg logit contributions
Max-0.288
Brown-0.269
ens-0.267
Snow-0.264
Mic-0.256
Token it
Token ID340
Feature activation-1.38e+01

Pos logit contributions
 teaching+0.286
 showing+0.280
 diving+0.271
 giving+0.266
 example+0.260
Neg logit contributions
Max-0.436
Brown-0.404
ens-0.403
Snow-0.393
Mic-0.389
Token!"
Token ID2474
Feature activation+4.48e-01

Pos logit contributions
Max+0.345
Brown+0.321
ens+0.320
Snow+0.313
Mic+0.307
Neg logit contributions
 teaching-0.201
 showing-0.197
 diving-0.189
 giving-0.186
 example-0.182
Token told
Token ID1297
Feature activation+1.45e+01

Pos logit contributions
Max+0.443
ens+0.414
Brown+0.412
Snow+0.401
Mic+0.396
Neg logit contributions
 teaching-0.306
 showing-0.304
 diving-0.291
 giving-0.288
 example-0.277
Token her
Token ID607
Feature activation-1.61e+01

Pos logit contributions
 teaching+0.237
 showing+0.235
 giving+0.224
 example+0.221
 show+0.218
Neg logit contributions
Max-0.354
Brown-0.335
ens-0.325
Mic-0.324
Snow-0.319
Token mom
Token ID1995
Feature activation-1.79e+01

Pos logit contributions
Max+0.339
Brown+0.311
ens+0.310
Snow+0.301
Mic+0.298
Neg logit contributions
 teaching-0.306
 showing-0.301
 diving-0.292
 giving-0.289
 example-0.283
Tokenmy
Token ID1820
Feature activation-1.78e+01

Pos logit contributions
Max+0.301
Mic+0.277
lves+0.274
Brown+0.272
Snow+0.268
Neg logit contributions
 teaching-0.438
 giving-0.424
 showing-0.423
 sharing-0.416
 about-0.406
Token she
Token ID673
Feature activation-2.00e+01

Pos logit contributions
Max+0.535
Mic+0.510
Brown+0.507
Snow+0.495
lves+0.494
Neg logit contributions
 teaching-0.206
 giving-0.194
 showing-0.194
 sharing-0.177
 helping-0.174
Token wanted
Token ID2227
Feature activation+1.95e+01

Pos logit contributions
Max+0.461
Brown+0.447
Mic+0.441
Snow+0.436
ens+0.427
Neg logit contributions
 teaching-0.340
 showing-0.334
 giving-0.315
 show-0.309
 learning-0.295
Token a
Token ID257
Feature activation-6.49e+00

Pos logit contributions
 teaching+0.215
 showing+0.208
 diving+0.199
 giving+0.190
 example+0.189
Neg logit contributions
Max-0.547
ens-0.521
Brown-0.508
Snow-0.503
Mic-0.490
Token cone
Token ID27763
Feature activation-8.06e+00

Pos logit contributions
Max+0.129
Snow+0.119
Brown+0.118
Spot+0.115
ens+0.114
Neg logit contributions
 teaching-0.121
 showing-0.116
 succeeding-0.111
 designing-0.111
 example-0.109
Token of
Token ID286
Feature activation+1.14e+01

Pos logit contributions
Max+0.192
ens+0.187
Brown+0.176
Snow+0.176
mi+0.171
Neg logit contributions
 teaching-0.113
 showing-0.107
 example-0.105
 humor-0.104
 diving-0.104
Token strawberry
Token ID41236
Feature activation-1.07e+01

Pos logit contributions
 teaching+0.224
aps+0.219
 example+0.216
 showing+0.214
apping+0.208
Neg logit contributions
ens-0.190
Max-0.186
Spot-0.175
Snow-0.174
Brown-0.170
Token ice
Token ID4771
Feature activation+1.84e-02

Pos logit contributions
Max+0.228
ens+0.227
Snow+0.211
Brown+0.210
mi+0.206
Neg logit contributions
 teaching-0.182
 showing-0.172
 example-0.169
 giving-0.161
 diving-0.160
Token to
Token ID284
Feature activation+9.52e+00

Pos logit contributions
 teaching+0.178
 showing+0.173
 succeeding+0.158
 diving+0.156
 humor+0.154
Neg logit contributions
Max-0.345
ens-0.326
Snow-0.321
Brown-0.313
 Bun-0.310
Token find
Token ID1064
Feature activation+1.33e+01

Pos logit contributions
 teaching+0.115
 showing+0.112
 diving+0.107
 giving+0.104
 example+0.102
Neg logit contributions
Max-0.261
Brown-0.245
ens-0.244
Snow-0.240
Mic-0.235
Token another
Token ID1194
Feature activation-8.79e+00

Pos logit contributions
 teaching+0.198
 showing+0.194
 diving+0.189
 giving+0.185
 example+0.181
Neg logit contributions
Max-0.333
Brown-0.311
ens-0.308
Snow-0.301
Mic-0.299
Token way
Token ID835
Feature activation+7.93e+00

Pos logit contributions
Max+0.187
ens+0.172
Brown+0.166
ball+0.165
Snow+0.165
Neg logit contributions
 teaching-0.153
 showing-0.144
 giving-0.139
 liking-0.136
 designing-0.135
Token to
Token ID284
Feature activation+9.49e+00

Pos logit contributions
 teaching+0.069
 showing+0.066
 diving+0.061
 giving+0.060
 example+0.057
Neg logit contributions
Max-0.246
ens-0.232
Brown-0.232
Snow-0.227
Mic-0.224
Token help
Token ID1037
Feature activation+1.87e+01

Pos logit contributions
 teaching+0.123
 showing+0.118
 diving+0.111
 giving+0.110
 example+0.109
Neg logit contributions
Max-0.251
Brown-0.233
ens-0.233
Snow-0.230
Mic-0.225
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
 teaching+0.301
 showing+0.299
 diving+0.294
 show+0.290
 giving+0.290
Neg logit contributions
Max-0.434
Brown-0.417
Mic-0.405
ens-0.404
lves-0.398
Token fish
Token ID5916
Feature activation-1.55e+01

Pos logit contributions
Max+0.286
ens+0.255
Snow+0.252
Mic+0.250
Spot+0.249
Neg logit contributions
 teaching-0.237
 showing-0.230
 giving-0.223
 liking-0.213
 example-0.211
Token,
Token ID11
Feature activation+2.80e+00

Pos logit contributions
Max+0.388
ens+0.364
Brown+0.361
Snow+0.352
Mic+0.346
Neg logit contributions
 teaching-0.223
 showing-0.216
 diving-0.206
 giving-0.204
 example-0.202
Tokenâ
Token ID22940
Feature activation-7.83e+00

Pos logit contributions
 teaching+0.041
 showing+0.040
 diving+0.039
 giving+0.039
 example+0.037
Neg logit contributions
Max-0.071
Brown-0.067
Mic-0.066
ens-0.065
lves-0.064
Token€
Token ID26391
Feature activation-1.75e+01

Pos logit contributions
Max+0.128
ens+0.115
Brown+0.113
Snow+0.110
Mic+0.106
Neg logit contributions
 teaching-0.184
 showing-0.182
 diving-0.178
 giving-0.176
 example-0.173
Token!"
Token ID2474
Feature activation+4.85e-01

Pos logit contributions
 teaching+0.032
 showing+0.032
 diving+0.031
 giving+0.030
 example+0.029
Neg logit contributions
Max-0.050
ens-0.046
Brown-0.046
Snow-0.045
Mic-0.044
Token She
Token ID1375
Feature activation-1.80e+01

Pos logit contributions
 teaching+0.010
 showing+0.010
 giving+0.010
 diving+0.009
 liking+0.009
Neg logit contributions
Max-0.009
Brown-0.009
Snow-0.008
lves-0.008
Mic-0.008
Token ran
Token ID4966
Feature activation+1.40e+01

Pos logit contributions
Max+0.389
ens+0.353
Brown+0.353
Snow+0.344
Mic+0.338
Neg logit contributions
 teaching-0.328
 showing-0.320
 diving-0.317
 giving-0.308
 example-0.306
Token to
Token ID284
Feature activation+1.04e+01

Pos logit contributions
 teaching+0.133
 showing+0.128
 diving+0.126
 giving+0.119
 serving+0.113
Neg logit contributions
Max-0.428
ens-0.403
Brown-0.395
Snow-0.389
lves-0.382
Token him
Token ID683
Feature activation-1.53e+01

Pos logit contributions
 teaching+0.185
 showing+0.181
 diving+0.180
 example+0.176
 giving+0.173
Neg logit contributions
Max-0.230
ens-0.213
Brown-0.204
Snow-0.199
Mic-0.197
Token and
Token ID290
Feature activation+3.05e+00

Pos logit contributions
Max+0.341
ens+0.317
Brown+0.308
Snow+0.297
Mic+0.292
Neg logit contributions
 teaching-0.261
 diving-0.257
 showing-0.254
 example-0.250
 humor-0.243
Token tried
Token ID3088
Feature activation+2.01e+01

Pos logit contributions
 teaching+0.048
 showing+0.048
 giving+0.046
 diving+0.045
 show+0.043
Neg logit contributions
Max-0.073
Brown-0.069
ens-0.067
Snow-0.066
Mic-0.066
Token to
Token ID284
Feature activation+1.04e+01

Pos logit contributions
 teaching+0.182
 showing+0.179
 diving+0.177
 example+0.169
 giving+0.158
Neg logit contributions
Max-0.611
ens-0.571
Brown-0.554
Snow-0.546
Spot-0.537
Token grab
Token ID5552
Feature activation+1.85e+01

Pos logit contributions
 teaching+0.136
 diving+0.130
 showing+0.129
 example+0.125
 giving+0.122
Neg logit contributions
Max-0.281
ens-0.259
Brown-0.258
Snow-0.252
Mic-0.249
Token her
Token ID607
Feature activation-1.57e+01

Pos logit contributions
 showing+0.201
 show+0.200
 teaching+0.192
 example+0.182
 giving+0.181
Neg logit contributions
Brown-0.528
Snow-0.494
Mic-0.489
Max-0.488
lves-0.481
Token doll
Token ID3654
Feature activation-1.84e+01

Pos logit contributions
Max+0.255
Brown+0.227
ens+0.223
Mic+0.221
Snow+0.220
Neg logit contributions
 teaching-0.375
 showing-0.373
 giving-0.359
 diving-0.356
 show-0.352
Token They
Token ID1119
Feature activation-1.27e+01

Pos logit contributions
Brown+0.060
Max+0.059
lves+0.058
ens+0.058
Mic+0.057
Neg logit contributions
 teaching-0.044
 showing-0.043
 diving-0.043
 giving-0.042
 example-0.042
Token were
Token ID547
Feature activation+7.45e+00

Pos logit contributions
Max+0.272
Brown+0.257
Mic+0.246
ens+0.246
Snow+0.242
Neg logit contributions
 teaching-0.233
 showing-0.233
 diving-0.227
 giving-0.222
 show-0.218
Token so
Token ID523
Feature activation+7.59e+00

Pos logit contributions
 teaching+0.092
 example+0.089
 showing+0.087
 liking+0.085
 diving+0.083
Neg logit contributions
Max-0.188
Brown-0.176
Snow-0.175
ens-0.172
lves-0.170
Token happy
Token ID3772
Feature activation+7.77e+00

Pos logit contributions
 teaching+0.105
 showing+0.100
 example+0.097
 giving+0.092
 diving+0.090
Neg logit contributions
Max-0.190
Snow-0.180
Brown-0.178
ens-0.176
lves-0.174
Token and
Token ID290
Feature activation+3.15e+00

Pos logit contributions
 teaching+0.112
 showing+0.109
 example+0.106
 diving+0.104
 giving+0.103
Neg logit contributions
Max-0.188
ens-0.175
Brown-0.172
Snow-0.171
lves-0.164
Token full
Token ID1336
Feature activation+1.36e+01

Pos logit contributions
 teaching+0.047
 example+0.047
 showing+0.043
 diving+0.043
 liking+0.042
Neg logit contributions
Max-0.074
ens-0.070
Snow-0.069
Brown-0.067
lves-0.066
Token of
Token ID286
Feature activation+1.21e+01

Pos logit contributions
 teaching+0.199
 showing+0.193
 diving+0.186
 example+0.184
 giving+0.180
Neg logit contributions
Max-0.333
ens-0.313
Brown-0.304
Snow-0.304
Spot-0.292
Token love
Token ID1842
Feature activation+1.75e+01

Pos logit contributions
 teaching+0.190
 showing+0.186
 liking+0.175
 giving+0.173
 example+0.172
Neg logit contributions
ens-0.238
Max-0.235
Snow-0.224
Spot-0.223
 Snow-0.216
Token.
Token ID13
Feature activation-2.89e+00

Pos logit contributions
 teaching+0.259
 showing+0.251
 diving+0.241
 example+0.238
 giving+0.237
Neg logit contributions
Max-0.422
ens-0.399
Brown-0.390
Snow-0.384
Spot-0.368
Token
Token ID220
Feature activation-6.14e+00

Pos logit contributions
Brown+0.074
Mic+0.073
Max+0.071
lves+0.071
Snow+0.070
Neg logit contributions
 teaching-0.038
 showing-0.035
 diving-0.034
 giving-0.034
aps-0.034
Token
Token ID198
Feature activation-5.14e+00

Pos logit contributions
ens+0.152
lves+0.150
Max+0.149
Mic+0.149
Brown+0.148
Neg logit contributions
 teaching-0.083
aps-0.081
 diving-0.076
 showing-0.076
 giving-0.075
Token the
Token ID262
Feature activation-1.32e+01

Pos logit contributions
 teaching+0.293
 showing+0.286
 diving+0.276
 giving+0.271
 example+0.263
Neg logit contributions
Max-0.536
Brown-0.499
ens-0.498
Snow-0.487
Mic-0.481
Token toy
Token ID13373
Feature activation-1.04e+01

Pos logit contributions
Max+0.300
ens+0.273
Brown+0.270
Snow+0.263
Mic+0.262
Neg logit contributions
 teaching-0.225
 showing-0.222
 diving-0.217
 giving-0.216
 example-0.211
Token car
Token ID1097
Feature activation-1.48e+01

Pos logit contributions
Max+0.263
Brown+0.245
Mic+0.245
Snow+0.244
ens+0.238
Neg logit contributions
 teaching-0.160
 showing-0.159
 diving-0.150
 giving-0.148
 show-0.147
Token go
Token ID467
Feature activation+1.52e+01

Pos logit contributions
Max+0.398
Brown+0.372
ens+0.372
Snow+0.363
Mic+0.358
Neg logit contributions
 teaching-0.189
 showing-0.184
 diving-0.177
 giving-0.173
 example-0.168
Token with
Token ID351
Feature activation+1.60e+01

Pos logit contributions
 teaching+0.178
 showing+0.174
 diving+0.166
 giving+0.162
 example+0.156
Neg logit contributions
Max-0.429
Brown-0.403
ens-0.401
Snow-0.393
Mic-0.390
Token speed
Token ID2866
Feature activation+9.51e+00

Pos logit contributions
 teaching+0.239
 showing+0.230
 diving+0.214
 show+0.212
 example+0.210
Neg logit contributions
Brown-0.407
Max-0.392
lves-0.385
Snow-0.385
ens-0.382
Token.
Token ID13
Feature activation-3.54e+00

Pos logit contributions
 teaching+0.099
 showing+0.098
 diving+0.093
 giving+0.089
 example+0.088
Neg logit contributions
Max-0.271
Mic-0.269
Brown-0.267
Snow-0.259
Nut-0.255
Token They
Token ID1119
Feature activation-1.34e+01

Pos logit contributions
Max+0.065
Brown+0.062
ens+0.062
lves+0.060
Snow+0.060
Neg logit contributions
 teaching-0.074
 showing-0.071
 giving-0.068
 diving-0.068
 example-0.068
Token threw
Token ID9617
Feature activation+1.51e+01

Pos logit contributions
Max+0.270
Brown+0.249
ens+0.246
Snow+0.240
Mic+0.238
Neg logit contributions
 teaching-0.262
 showing-0.259
 diving-0.251
 giving-0.247
 example-0.241
Token the
Token ID262
Feature activation-1.33e+01

Pos logit contributions
 teaching+0.206
 showing+0.206
 diving+0.198
 example+0.190
 giving+0.189
Neg logit contributions
Max-0.387
Brown-0.379
Snow-0.367
ens-0.363
Mic-0.363
Token ball
Token ID2613
Feature activation-1.06e+01

Pos logit contributions
Max+0.306
Brown+0.285
ens+0.282
Snow+0.279
Mic+0.275
Neg logit contributions
 teaching-0.226
 showing-0.224
 diving-0.218
 giving-0.213
 show-0.204
Token.
Token ID13
Feature activation-3.01e+00

Pos logit contributions
 teaching+0.130
 showing+0.125
 example+0.124
 diving+0.123
 humor+0.121
Neg logit contributions
Max-0.157
ens-0.147
Snow-0.144
Brown-0.144
Spot-0.135
Token They
Token ID1119
Feature activation-1.29e+01

Pos logit contributions
Max+0.066
ens+0.065
lves+0.065
Brown+0.064
Mic+0.063
Neg logit contributions
 teaching-0.050
 showing-0.049
 giving-0.047
 diving-0.047
 example-0.045
Token loved
Token ID6151
Feature activation+1.96e+01

Pos logit contributions
Max+0.292
Brown+0.276
Mic+0.268
lves+0.262
ens+0.258
Neg logit contributions
 showing-0.218
 teaching-0.215
 show-0.210
 giving-0.208
 diving-0.203
Token each
Token ID1123
Feature activation-6.67e+00

Pos logit contributions
 teaching+0.321
 showing+0.315
 diving+0.305
 giving+0.300
 example+0.293
Neg logit contributions
Max-0.460
Brown-0.426
ens-0.425
Snow-0.414
Mic-0.409
Token other
Token ID584
Feature activation-7.17e+00

Pos logit contributions
Max+0.172
Brown+0.161
ens+0.159
Snow+0.156
Mic+0.156
Neg logit contributions
 teaching-0.095
 showing-0.093
 diving-0.088
 giving-0.088
 example-0.085
Token very
Token ID845
Feature activation+8.75e+00

Pos logit contributions
Max+0.189
Brown+0.177
ens+0.177
Snow+0.173
Mic+0.171
Neg logit contributions
 teaching-0.095
 showing-0.093
 diving-0.089
 giving-0.088
 example-0.085
Token much
Token ID881
Feature activation+2.74e+00

Pos logit contributions
 teaching+0.112
 showing+0.109
 diving+0.107
 giving+0.104
 example+0.101
Neg logit contributions
Max-0.236
Brown-0.219
ens-0.218
Mic-0.216
Snow-0.212
Token.
Token ID13
Feature activation-3.11e+00

Pos logit contributions
 teaching+0.039
 showing+0.038
 diving+0.037
 giving+0.036
 example+0.035
Neg logit contributions
Max-0.070
Brown-0.065
ens-0.065
Snow-0.063
Mic-0.063
Token<|endoftext|>
Token ID50256
Feature activation-4.48e+00

Pos logit contributions
lves+0.070
Max+0.070
ens+0.069
Brown+0.068
Mic+0.067
Neg logit contributions
 teaching-0.050
 showing-0.048
 giving-0.046
 diving-0.045
 sharing-0.044
TokenTom
Token ID13787
Feature activation-2.13e+01

Pos logit contributions
Max+0.074
Brown+0.069
ens+0.068
lves+0.066
Snow+0.066
Neg logit contributions
 teaching-0.101
 showing-0.100
 diving-0.099
 giving-0.096
 serving-0.094
Token was
Token ID373
Feature activation+9.40e+00

Pos logit contributions
Max+0.538
Brown+0.534
lves+0.530
Mic+0.523
Nut+0.509
Neg logit contributions
 teaching-0.283
 showing-0.280
 giving-0.269
 diving-0.257
 show-0.257
Token and
Token ID290
Feature activation+2.50e+00

Pos logit contributions
 teaching+0.224
 showing+0.219
 giving+0.208
 diving+0.201
 example+0.197
Neg logit contributions
Max-0.454
Brown-0.431
Mic-0.423
Snow-0.420
lves-0.419
Token kindness
Token ID23887
Feature activation+3.72e+00

Pos logit contributions
 diving+0.039
 teaching+0.038
 showing+0.037
 example+0.036
 liking+0.035
Neg logit contributions
Max-0.057
ens-0.053
Brown-0.052
Snow-0.050
Mic-0.049
Token in
Token ID287
Feature activation+8.29e+00

Pos logit contributions
 teaching+0.053
 showing+0.051
 giving+0.050
 diving+0.048
 show+0.046
Neg logit contributions
Max-0.096
Brown-0.091
Mic-0.089
Snow-0.088
lves-0.088
Token a
Token ID257
Feature activation-6.60e+00

Pos logit contributions
 teaching+0.134
 diving+0.133
 showing+0.133
 giving+0.126
 example+0.123
Neg logit contributions
Max-0.189
ens-0.174
Brown-0.173
Snow-0.168
Mic-0.166
Token very
Token ID845
Feature activation+8.46e+00

Pos logit contributions
Max+0.170
Brown+0.158
ens+0.154
Snow+0.154
Mic+0.153
Neg logit contributions
 teaching-0.088
 showing-0.087
 diving-0.085
 giving-0.081
 example-0.080
Token special
Token ID2041
Feature activation+1.82e+00

Pos logit contributions
 teaching+0.126
 showing+0.121
 diving+0.115
 giving+0.114
 joining+0.111
Neg logit contributions
Max-0.203
ens-0.192
Brown-0.190
Snow-0.187
lves-0.183
Token way
Token ID835
Feature activation+7.92e+00

Pos logit contributions
 teaching+0.026
 showing+0.025
 diving+0.025
 giving+0.023
 liking+0.023
Neg logit contributions
Max-0.044
Brown-0.041
ens-0.041
Snow-0.040
Mic-0.039
Token.
Token ID13
Feature activation-3.75e+00

Pos logit contributions
 teaching+0.094
 showing+0.091
 giving+0.087
 diving+0.086
 example+0.081
Neg logit contributions
Max-0.224
Brown-0.211
Mic-0.208
ens-0.207
Snow-0.206
Token<|endoftext|>
Token ID50256
Feature activation-4.72e+00

Pos logit contributions
Max+0.091
lves+0.089
Brown+0.088
Snow+0.088
ens+0.088
Neg logit contributions
 teaching-0.059
 showing-0.053
 giving-0.052
 serving-0.050
 learning-0.050
TokenOnce
Token ID7454
Feature activation+3.39e+00

Pos logit contributions
lves+0.108
Brown+0.107
Max+0.103
ens+0.103
Mic+0.103
Neg logit contributions
 teaching-0.072
 diving-0.071
 showing-0.070
 giving-0.066
 planning-0.065
Token upon
Token ID2402
Feature activation-4.31e+00

Pos logit contributions
 teaching+0.049
 diving+0.049
 showing+0.048
 giving+0.046
 show+0.045
Neg logit contributions
Max-0.084
Brown-0.082
Snow-0.081
Mic-0.079
Nut-0.076
Token and
Token ID290
Feature activation+5.61e+00

Pos logit contributions
 teaching+0.219
 showing+0.212
 example+0.208
 giving+0.205
 humor+0.204
Neg logit contributions
Max-0.469
Brown-0.424
ens-0.422
Snow-0.419
ety-0.406
Token play
Token ID711
Feature activation+1.52e+01

Pos logit contributions
 example+0.079
 teaching+0.075
 liking+0.072
 showing+0.071
 giving+0.070
Neg logit contributions
Max-0.138
Brown-0.123
ens-0.122
Blue-0.121
Mic-0.118
Token with
Token ID351
Feature activation+1.93e+01

Pos logit contributions
 teaching+0.210
 giving+0.202
 showing+0.201
 diving+0.195
 example+0.193
Neg logit contributions
Max-0.392
ens-0.357
Brown-0.349
Snow-0.348
Spot-0.343
Token him
Token ID683
Feature activation-1.31e+01

Pos logit contributions
 teaching+0.260
 showing+0.260
 diving+0.251
 show+0.240
 giving+0.235
Neg logit contributions
Max-0.514
Brown-0.499
Snow-0.478
Mic-0.478
lves-0.474
Token.
Token ID13
Feature activation-6.00e-01

Pos logit contributions
Max+0.319
ens+0.298
Brown+0.294
Snow+0.286
Mic+0.281
Neg logit contributions
 teaching-0.194
 showing-0.189
 diving-0.183
 giving-0.181
 example-0.179
Token They
Token ID1119
Feature activation-1.09e+01

Pos logit contributions
Brown+0.015
Snow+0.014
Max+0.014
mi+0.014
lves+0.014
Neg logit contributions
 teaching-0.009
 showing-0.008
 diving-0.008
 show-0.008
 giving-0.008
Token played
Token ID2826
Feature activation+1.92e+01

Pos logit contributions
Max+0.293
Brown+0.283
Snow+0.275
ens+0.270
Mic+0.269
Neg logit contributions
 teaching-0.142
 showing-0.141
 diving-0.132
 giving-0.130
 show-0.127
Token games
Token ID1830
Feature activation+2.39e+00

Pos logit contributions
 teaching+0.263
 giving+0.259
 showing+0.251
 liking+0.245
 example+0.241
Neg logit contributions
Max-0.498
ens-0.445
Brown-0.436
Spot-0.435
Snow-0.432
Token and
Token ID290
Feature activation+5.15e+00

Pos logit contributions
 teaching+0.036
 showing+0.035
 diving+0.034
 giving+0.033
 example+0.033
Neg logit contributions
Max-0.059
ens-0.054
Brown-0.053
Snow-0.052
Mic-0.051
Token had
Token ID550
Feature activation+9.56e+00

Pos logit contributions
 showing+0.099
 teaching+0.099
 diving+0.096
 giving+0.094
 guessing+0.090
Neg logit contributions
Max-0.105
Brown-0.099
ens-0.096
Snow-0.096
Mic-0.095
Token a
Token ID257
Feature activation-4.09e+00

Pos logit contributions
 teaching+0.094
 showing+0.094
 diving+0.092
 giving+0.082
 guessing+0.082
Neg logit contributions
Max-0.283
Brown-0.278
ens-0.277
uffy-0.269
lves-0.267
Token mom
Token ID1995
Feature activation-1.81e+01

Pos logit contributions
Max+0.246
ens+0.220
Brown+0.213
Snow+0.205
Mic+0.201
Neg logit contributions
 teaching-0.313
 showing-0.309
 diving-0.303
 giving-0.301
 example-0.297
Token smiled
Token ID13541
Feature activation+8.41e+00

Pos logit contributions
Max+0.385
Brown+0.355
ens+0.353
Snow+0.344
Mic+0.340
Neg logit contributions
 teaching-0.333
 showing-0.327
 diving-0.317
 giving-0.314
 example-0.306
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 teaching+0.136
 showing+0.134
 diving+0.132
 giving+0.127
 example+0.126
Neg logit contributions
Max-0.197
ens-0.181
Brown-0.179
Snow-0.176
Mic-0.172
Token gave
Token ID2921
Feature activation+2.20e+01

Pos logit contributions
 teaching+0.043
 showing+0.043
 giving+0.042
 diving+0.041
 show+0.039
Neg logit contributions
Max-0.049
Brown-0.046
ens-0.045
Mic-0.044
Snow-0.044
Token him
Token ID683
Feature activation-1.64e+01

Pos logit contributions
 for+0.220
 to+0.215
 about+0.213
 helping+0.212
 show+0.208
Neg logit contributions
Brown-0.635
Snow-0.619
Max-0.606
pillar-0.594
uffy-0.574
Token a
Token ID257
Feature activation-6.70e+00

Pos logit contributions
Max+0.482
Brown+0.472
Mic+0.454
lves+0.451
Snow+0.443
Neg logit contributions
 teaching-0.175
 showing-0.158
 diving-0.156
 example-0.155
 giving-0.150
Token big
Token ID1263
Feature activation-2.02e+00

Pos logit contributions
Max+0.147
ens+0.136
Snow+0.130
Brown+0.130
Mic+0.129
Neg logit contributions
 showing-0.116
 teaching-0.114
 example-0.114
 succeeding-0.111
 diving-0.109
Token hug
Token ID16225
Feature activation+1.28e+01

Pos logit contributions
ens+0.048
Max+0.047
Snow+0.042
Brown+0.042
Mic+0.042
Neg logit contributions
 showing-0.030
 teaching-0.030
 succeeding-0.029
 diving-0.029
 winning-0.029
Token.
Token ID13
Feature activation-3.63e+00

Pos logit contributions
 teaching+0.182
 showing+0.178
 diving+0.172
 giving+0.169
 example+0.164
Neg logit contributions
Max-0.325
Brown-0.302
ens-0.301
Snow-0.295
Mic-0.291
Token<|endoftext|>
Token ID50256
Feature activation-4.63e+00

Pos logit contributions
Max+0.076
Brown+0.071
lves+0.070
Snow+0.069
Mic+0.069
Neg logit contributions
 teaching-0.067
 showing-0.066
 giving-0.064
 diving-0.062
 liking-0.061
TokenOne
Token ID3198
Feature activation-2.17e+00

Pos logit contributions
Max+0.094
Brown+0.088
ens+0.087
Snow+0.085
Mic+0.084
Neg logit contributions
 teaching-0.088
 showing-0.086
 diving-0.084
 giving-0.082
 example-0.080
Token he
Token ID339
Feature activation-1.70e+01

Pos logit contributions
 teaching+0.076
 showing+0.075
 diving+0.074
 giving+0.072
 example+0.069
Neg logit contributions
Max-0.099
Brown-0.094
ens-0.092
Mic-0.091
Snow-0.091
Token realized
Token ID6939
Feature activation+1.30e+01

Pos logit contributions
Max+0.371
Brown+0.350
ens+0.342
Snow+0.337
Mic+0.336
Neg logit contributions
 teaching-0.302
 showing-0.299
 diving-0.293
 giving-0.285
 example-0.276
Token it
Token ID340
Feature activation-1.31e+01

Pos logit contributions
 teaching+0.183
 showing+0.181
 diving+0.179
 giving+0.175
 helping+0.160
Neg logit contributions
Max-0.340
Brown-0.322
Mic-0.317
ens-0.313
Snow-0.309
Token was
Token ID373
Feature activation+1.11e+01

Pos logit contributions
Max+0.320
Brown+0.320
Mic+0.315
Snow+0.312
Blue+0.297
Neg logit contributions
 showing-0.195
 teaching-0.193
 diving-0.186
 giving-0.185
 winning-0.176
Token too
Token ID1165
Feature activation+4.27e+00

Pos logit contributions
 teaching+0.146
 showing+0.141
 example+0.134
 diving+0.134
aps+0.131
Neg logit contributions
Max-0.280
ens-0.262
Brown-0.253
Snow-0.253
Spot-0.246
Token tall
Token ID7331
Feature activation+1.06e+00

Pos logit contributions
 teaching+0.049
 diving+0.048
 showing+0.048
 giving+0.046
 example+0.044
Neg logit contributions
Max-0.119
Brown-0.114
ens-0.111
Mic-0.109
Snow-0.109
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
 teaching+0.016
 showing+0.016
 diving+0.015
 giving+0.015
 example+0.015
Neg logit contributions
Max-0.025
ens-0.024
Brown-0.023
Snow-0.023
Mic-0.022
Token He
Token ID679
Feature activation-1.62e+01

Pos logit contributions
Brown+0.054
Snow+0.053
Max+0.053
ens+0.053
lves+0.053
Neg logit contributions
 teaching-0.031
 example-0.029
 winning-0.029
 diving-0.029
 showing-0.029
Token could
Token ID714
Feature activation+1.34e+01

Pos logit contributions
Max+0.373
Brown+0.350
ens+0.348
Snow+0.341
Mic+0.339
Neg logit contributions
 teaching-0.266
 showing-0.265
 diving-0.256
 giving-0.250
 example-0.245
Token not
Token ID407
Feature activation+8.79e+00

Pos logit contributions
 teaching+0.159
 showing+0.153
 diving+0.147
 giving+0.145
 example+0.144
Neg logit contributions
Max-0.371
ens-0.347
Brown-0.344
Snow-0.336
Mic-0.330
Token jump
Token ID4391
Feature activation+1.25e+01

Pos logit contributions
 teaching+0.116
 showing+0.111
 diving+0.108
 giving+0.105
 example+0.104
Neg logit contributions
Max-0.231
ens-0.217
Brown-0.215
Snow-0.210
Mic-0.208
Token nut
Token ID6701
Feature activation-8.99e+00

Pos logit contributions
 teaching+0.171
 showing+0.165
 diving+0.162
 joining+0.159
 giving+0.159
Neg logit contributions
Max-0.108
ens-0.097
Brown-0.096
Mic-0.093
Snow-0.092
Token!"
Token ID2474
Feature activation+3.94e-03

Pos logit contributions
Max+0.223
ens+0.208
Brown+0.208
Snow+0.202
Mic+0.200
Neg logit contributions
 teaching-0.133
 showing-0.129
 diving-0.125
 giving-0.123
 example-0.120
Token Nut
Token ID11959
Feature activation-2.41e+01

Pos logit contributions
 teaching+0.000
 showing+0.000
 giving+0.000
 diving+0.000
 example+0.000
Neg logit contributions
Max-0.000
Brown-0.000
ens-0.000
Mic-0.000
Snow-0.000
Tokenty
Token ID774
Feature activation-2.06e+01

Pos logit contributions
Max+0.349
lves+0.330
Brown+0.324
ens+0.316
Snow+0.314
Neg logit contributions
 showing-0.565
 teaching-0.563
 giving-0.542
 helping-0.521
 diving-0.519
Token's
Token ID338
Feature activation-3.09e+00

Pos logit contributions
Max+0.439
Brown+0.408
ens+0.404
Snow+0.398
Mic+0.392
Neg logit contributions
 teaching-0.383
 showing-0.378
 giving-0.361
 diving-0.356
 example-0.353
Token friends
Token ID2460
Feature activation-7.82e+00

Pos logit contributions
Max+0.067
ens+0.062
Brown+0.060
Mic+0.060
Snow+0.059
Neg logit contributions
 teaching-0.054
 showing-0.053
 diving-0.051
 giving-0.050
 serving-0.050
Token came
Token ID1625
Feature activation+1.35e+01

Pos logit contributions
Max+0.167
ens+0.153
Brown+0.153
Snow+0.148
Mic+0.145
Neg logit contributions
 teaching-0.142
 showing-0.139
 diving-0.138
 example-0.134
 giving-0.133
Token to
Token ID284
Feature activation+1.01e+01

Pos logit contributions
 humor+0.196
 accomplishment+0.192
 succeeding+0.185
 teaching+0.184
 diving+0.179
Neg logit contributions
Max-0.336
Brown-0.299
ens-0.289
 Bun-0.289
K-0.286
Token help
Token ID1037
Feature activation+1.90e+01

Pos logit contributions
 teaching+0.095
 showing+0.092
 diving+0.086
 giving+0.085
 example+0.080
Neg logit contributions
Max-0.306
Brown-0.290
ens-0.288
Snow-0.283
Mic-0.281
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
 teaching+0.334
 showing+0.332
 giving+0.325
 helping+0.308
 diving+0.306
Neg logit contributions
Max-0.423
Brown-0.401
Snow-0.383
Mic-0.375
lves-0.374
Token they
Token ID484
Feature activation-1.59e+01

Pos logit contributions
 teaching+0.055
 showing+0.054
 diving+0.054
 giving+0.052
 example+0.052
Neg logit contributions
Max-0.056
ens-0.051
Brown-0.051
Snow-0.049
Mic-0.048
Token her
Token ID607
Feature activation-1.25e+01

Pos logit contributions
 showing+0.291
 teaching+0.286
 show+0.269
 giving+0.269
 helping+0.263
Neg logit contributions
Max-0.523
Brown-0.496
lves-0.495
Mic-0.493
issy-0.484
Token friends
Token ID2460
Feature activation-4.43e+00

Pos logit contributions
Max+0.304
Brown+0.283
ens+0.282
Snow+0.278
Mic+0.276
Neg logit contributions
 teaching-0.195
 showing-0.193
 diving-0.182
 giving-0.181
 show-0.175
Token on
Token ID319
Feature activation+1.57e+01

Pos logit contributions
Max+0.118
Brown+0.112
ens+0.111
Snow+0.110
Mic+0.109
Neg logit contributions
 teaching-0.059
 showing-0.057
 giving-0.054
 diving-0.053
 example-0.050
Token Sunday
Token ID3502
Feature activation-9.00e+00

Pos logit contributions
 teaching+0.201
 showing+0.200
 diving+0.192
 giving+0.184
 show+0.178
Neg logit contributions
Max-0.422
Brown-0.402
ens-0.400
lves-0.394
Mic-0.391
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
Max+0.245
Brown+0.232
ens+0.227
Mic+0.227
Snow+0.226
Neg logit contributions
 teaching-0.115
 showing-0.113
 diving-0.107
 giving-0.105
 serving-0.100
Token
Token ID198
Feature activation-2.53e+00

Pos logit contributions
 teaching+0.002
 showing+0.002
 giving+0.002
 about+0.002
 sharing+0.002
Neg logit contributions
lves-0.005
Brown-0.005
Max-0.005
mi-0.005
Mic-0.005
Token
Token ID198
Feature activation-2.59e+00

Pos logit contributions
Max+0.062
ens+0.060
Brown+0.060
lves+0.059
Mic+0.058
Neg logit contributions
 teaching-0.036
 showing-0.035
 diving-0.034
 giving-0.032
 guessing-0.032
TokenLu
Token ID25596
Feature activation-2.17e+01

Pos logit contributions
ens+0.053
lves+0.050
Max+0.048
ety+0.048
uffy+0.048
Neg logit contributions
 teaching-0.048
 showing-0.047
 diving-0.044
 example-0.044
 guessing-0.044
Tokency
Token ID948
Feature activation-1.54e+01

Pos logit contributions
Nut+0.415
Brown+0.414
Snow+0.410
Mic+0.404
Max+0.404
Neg logit contributions
 teaching-0.404
 example-0.384
aps-0.380
 showing-0.379
 planning-0.377
Token went
Token ID1816
Feature activation+1.75e+01

Pos logit contributions
Max+0.342
Brown+0.324
Mic+0.318
Snow+0.311
lves+0.308
Neg logit contributions
 teaching-0.271
 showing-0.268
 giving-0.254
 diving-0.253
 example-0.249
Token to
Token ID284
Feature activation+1.28e+01

Pos logit contributions
 humor+0.292
 succeeding+0.282
 accomplishment+0.280
 example+0.269
ulp+0.268
Neg logit contributions
Max-0.394
ens-0.371
Snow-0.341
Spot-0.338
K-0.332
Token to
Token ID284
Feature activation+9.91e+00

Pos logit contributions
Max+0.456
Brown+0.426
Mic+0.421
ens+0.417
Snow+0.415
Neg logit contributions
 teaching-0.190
 showing-0.184
 giving-0.176
 diving-0.173
 example-0.166
Token clean
Token ID3424
Feature activation+1.28e+01

Pos logit contributions
 diving+0.142
 teaching+0.139
 showing+0.132
 example+0.131
 serving+0.128
Neg logit contributions
Max-0.240
ens-0.224
Brown-0.219
Snow-0.216
K-0.210
Token it
Token ID340
Feature activation-1.45e+01

Pos logit contributions
 teaching+0.141
 showing+0.137
 diving+0.131
 giving+0.127
 example+0.123
Neg logit contributions
Max-0.370
Brown-0.348
ens-0.346
Snow-0.340
Mic-0.337
Token up
Token ID510
Feature activation+5.04e+00

Pos logit contributions
Max+0.365
ens+0.343
Brown+0.331
Snow+0.326
Fl+0.316
Neg logit contributions
 teaching-0.213
 showing-0.198
 diving-0.197
 humor-0.191
 giving-0.190
Token.
Token ID13
Feature activation-3.29e+00

Pos logit contributions
 teaching+0.083
 diving+0.081
 showing+0.080
 giving+0.076
 joining+0.076
Neg logit contributions
Max-0.114
ens-0.106
Brown-0.102
Snow-0.101
K-0.098
Token K
Token ID509
Feature activation-1.83e+01

Pos logit contributions
Max+0.055
Mic+0.050
ens+0.048
Brown+0.046
Snow+0.045
Neg logit contributions
 showing-0.077
 teaching-0.077
 giving-0.075
 diving-0.073
 example-0.073
Tokeniki
Token ID5580
Feature activation-2.22e+01

Pos logit contributions
Max+0.317
Brown+0.276
Mic+0.275
Snow+0.259
Blue+0.256
Neg logit contributions
 teaching-0.404
 diving-0.402
 example-0.402
 showing-0.402
 giving-0.395
Token begged
Token ID32946
Feature activation+1.52e+01

Pos logit contributions
Max+0.490
Brown+0.456
ens+0.446
Mic+0.445
Snow+0.444
Neg logit contributions
 teaching-0.394
 showing-0.387
 giving-0.368
 diving-0.366
 example-0.362
Token her
Token ID607
Feature activation-1.60e+01

Pos logit contributions
 teaching+0.293
 showing+0.288
 diving+0.287
 giving+0.273
 liking+0.271
Neg logit contributions
Max-0.306
ens-0.284
Brown-0.275
Snow-0.270
Mic-0.258
Token mom
Token ID1995
Feature activation-1.80e+01

Pos logit contributions
Max+0.355
ens+0.326
Brown+0.326
Snow+0.316
Mic+0.310
Neg logit contributions
 teaching-0.287
 showing-0.283
 diving-0.275
 giving-0.270
 example-0.265
Token to
Token ID284
Feature activation+9.79e+00

Pos logit contributions
Max+0.572
ens+0.542
Brown+0.539
Snow+0.528
Mic+0.522
Neg logit contributions
 teaching-0.141
 showing-0.137
 diving-0.129
 giving-0.123
 example-0.117
Token
Token ID220
Feature activation-5.27e+00

Pos logit contributions
Max+0.053
Brown+0.052
Snow+0.049
Mic+0.049
lves+0.049
Neg logit contributions
 teaching-0.033
 giving-0.031
 showing-0.031
 example-0.029
 planning-0.029
Token When
Token ID1649
Feature activation+4.66e+00

Pos logit contributions
lves+0.129
Brown+0.128
Max+0.128
ens+0.127
Mic+0.127
Neg logit contributions
aps-0.068
 teaching-0.068
 giving-0.061
 showing-0.060
 planning-0.059
Token she
Token ID673
Feature activation-1.87e+01

Pos logit contributions
 teaching+0.101
 showing+0.100
 diving+0.099
 giving+0.098
 show+0.095
Neg logit contributions
Max-0.086
Brown-0.081
Mic-0.079
lves-0.076
ens-0.075
Token got
Token ID1392
Feature activation+1.87e+01

Pos logit contributions
Max+0.332
ens+0.302
Brown+0.290
Snow+0.281
lves+0.276
Neg logit contributions
 teaching-0.405
 showing-0.393
 diving-0.387
 example-0.386
 giving-0.377
Token home
Token ID1363
Feature activation-9.54e-01

Pos logit contributions
 teaching+0.210
 showing+0.205
 diving+0.194
 giving+0.189
 example+0.185
Neg logit contributions
Max-0.535
ens-0.505
Brown-0.500
Snow-0.490
Mic-0.484
Token she
Token ID673
Feature activation-1.88e+01

Pos logit contributions
Max+0.022
ens+0.021
Brown+0.020
Snow+0.020
lves+0.019
Neg logit contributions
 teaching-0.015
 showing-0.015
 diving-0.014
 example-0.014
 liking-0.014
Token unloaded
Token ID49975
Feature activation+1.78e+01

Pos logit contributions
Max+0.373
ens+0.340
Brown+0.338
Snow+0.327
Mic+0.321
Neg logit contributions
 teaching-0.368
 showing-0.361
 diving-0.354
 giving-0.347
 example-0.346
Token everything
Token ID2279
Feature activation-8.94e+00

Pos logit contributions
 teaching+0.251
 showing+0.249
 diving+0.238
 giving+0.236
 example+0.226
Neg logit contributions
Max-0.460
Brown-0.431
ens-0.421
Mic-0.418
Snow-0.417
Token from
Token ID422
Feature activation+1.40e+01

Pos logit contributions
Max+0.199
Brown+0.186
Mic+0.184
Snow+0.180
ens+0.176
Neg logit contributions
 showing-0.160
 teaching-0.160
 giving-0.155
 diving-0.154
 show-0.148
Token the
Token ID262
Feature activation-1.18e+01

Pos logit contributions
 teaching+0.221
 showing+0.213
 diving+0.211
 giving+0.204
 example+0.202
Neg logit contributions
Max-0.328
ens-0.306
Brown-0.302
Snow-0.294
Mic-0.288
Token basket
Token ID7988
Feature activation-5.25e+00

Pos logit contributions
Max+0.276
Brown+0.261
ens+0.256
Mic+0.250
Snow+0.249
Neg logit contributions
 teaching-0.196
 showing-0.194
 show-0.185
 giving-0.184
 diving-0.183
Tokenles
Token ID829
Feature activation-4.26e+00

Pos logit contributions
 teaching+0.127
 showing+0.126
 diving+0.124
 giving+0.122
 example+0.120
Neg logit contributions
Max-0.099
Brown-0.089
ens-0.085
Snow-0.085
Mic-0.084
Token.
Token ID13
Feature activation-3.35e+00

Pos logit contributions
Max+0.100
ens+0.094
Brown+0.092
Snow+0.091
Mic+0.087
Neg logit contributions
 teaching-0.066
 showing-0.064
 example-0.061
 diving-0.061
 giving-0.061
Token
Token ID220
Feature activation-6.58e+00

Pos logit contributions
Max+0.074
Brown+0.071
ens+0.070
lves+0.070
Mic+0.070
Neg logit contributions
 teaching-0.057
 showing-0.056
 diving-0.055
 giving-0.054
 show-0.052
Token
Token ID198
Feature activation-5.62e+00

Pos logit contributions
Max+0.158
Brown+0.157
ens+0.157
lves+0.157
Mic+0.155
Neg logit contributions
 teaching-0.090
aps-0.086
 showing-0.084
 diving-0.083
 about-0.081
Token
Token ID198
Feature activation-5.63e+00

Pos logit contributions
Max+0.119
ens+0.117
Brown+0.113
lves+0.112
Mic+0.109
Neg logit contributions
 teaching-0.097
 showing-0.095
 diving-0.093
 giving-0.091
 example-0.089
TokenThe
Token ID464
Feature activation-1.52e+01

Pos logit contributions
ens+0.118
uffy+0.108
lves+0.108
mi+0.106
ley+0.102
Neg logit contributions
 teaching-0.102
 showing-0.100
 example-0.097
 liking-0.096
 giving-0.095
Token bride
Token ID26619
Feature activation-1.12e+01

Pos logit contributions
Max+0.240
Snow+0.225
ens+0.215
Brown+0.212
Spot+0.207
Neg logit contributions
 teaching-0.336
 succeeding-0.329
 showing-0.324
 diving-0.322
 counting-0.320
Token and
Token ID290
Feature activation+2.59e+00

Pos logit contributions
Max+0.295
Mic+0.288
Brown+0.286
Snow+0.275
Nut+0.274
Neg logit contributions
 showing-0.156
 teaching-0.153
 giving-0.148
 show-0.140
 diving-0.132
Token groom
Token ID32443
Feature activation-1.98e+00

Pos logit contributions
 showing+0.036
 teaching+0.035
 giving+0.035
 diving+0.033
 show+0.033
Neg logit contributions
Max-0.069
Brown-0.066
uffy-0.063
Snow-0.063
elope-0.062
Token saw
Token ID2497
Feature activation+1.44e+01

Pos logit contributions
Max+0.044
Brown+0.042
Mic+0.041
ens+0.040
lves+0.039
Neg logit contributions
 showing-0.035
 teaching-0.035
 giving-0.033
 diving-0.032
 show-0.031
Token the
Token ID262
Feature activation-1.30e+01

Pos logit contributions
 teaching+0.176
 showing+0.174
 giving+0.168
 diving+0.163
 helping+0.153
Neg logit contributions
Max-0.417
Brown-0.405
Mic-0.395
Snow-0.388
Rust-0.379
Token It
Token ID632
Feature activation-1.37e+01

Pos logit contributions
Brown+0.080
Max+0.079
lves+0.078
Snow+0.076
issy+0.075
Neg logit contributions
 showing-0.058
 teaching-0.057
 giving-0.055
 show-0.053
 diving-0.053
Token hurt
Token ID5938
Feature activation+8.99e+00

Pos logit contributions
Max+0.315
ens+0.298
Mic+0.295
Brown+0.293
lves+0.291
Neg logit contributions
 showing-0.221
 teaching-0.213
 giving-0.208
 diving-0.203
 show-0.201
Token a
Token ID257
Feature activation-6.66e+00

Pos logit contributions
 teaching+0.096
 showing+0.095
 giving+0.094
 liking+0.089
 helping+0.089
Neg logit contributions
lves-0.244
Max-0.242
Brown-0.242
Mic-0.240
uffy-0.239
Token lot
Token ID1256
Feature activation+6.78e+00

Pos logit contributions
Max+0.133
Brown+0.122
Snow+0.121
Mic+0.118
ens+0.114
Neg logit contributions
 teaching-0.133
 diving-0.128
 showing-0.127
 example-0.125
 serving-0.123
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
 teaching+0.097
 showing+0.095
 diving+0.091
 giving+0.090
 example+0.087
Neg logit contributions
Max-0.173
Brown-0.161
ens-0.160
Snow-0.157
Mic-0.155
Token she
Token ID673
Feature activation-2.03e+01

Pos logit contributions
 showing+0.042
 teaching+0.041
 giving+0.040
 diving+0.040
 show+0.039
Neg logit contributions
Max-0.043
Brown-0.041
Mic-0.040
Snow-0.039
lves-0.038
Token started
Token ID2067
Feature activation+1.62e+01

Pos logit contributions
Max+0.420
Brown+0.385
ens+0.384
Snow+0.372
Mic+0.370
Neg logit contributions
 teaching-0.384
 showing-0.382
 diving-0.367
 giving-0.364
 example-0.353
Token to
Token ID284
Feature activation+9.30e+00

Pos logit contributions
 example+0.232
 contain+0.215
 welcome+0.213
ags+0.209
 humor+0.208
Neg logit contributions
Max-0.372
ens-0.340
Nut-0.338
 Bun-0.334
Brown-0.333
Token cry
Token ID3960
Feature activation+7.17e+00

Pos logit contributions
 teaching+0.162
 showing+0.156
 example+0.155
 diving+0.154
 giving+0.148
Neg logit contributions
Max-0.209
ens-0.195
Brown-0.192
Snow-0.186
Mic-0.184
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
 teaching+0.086
 showing+0.085
 giving+0.082
 diving+0.079
 liking+0.076
Neg logit contributions
Max-0.199
Brown-0.187
Mic-0.185
ens-0.185
lves-0.182
Token
Token ID198
Feature activation-5.98e+00

Pos logit contributions
Brown+0.087
Max+0.086
lves+0.085
Snow+0.084
issy+0.083
Neg logit contributions
 showing-0.051
 teaching-0.050
 giving-0.047
 show-0.046
 liking-0.045
Token playing
Token ID2712
Feature activation+1.22e+01

Pos logit contributions
 teaching+0.196
 showing+0.190
 diving+0.182
 giving+0.177
 example+0.173
Neg logit contributions
Max-0.484
Brown-0.454
ens-0.454
Snow-0.444
Mic-0.439
Token.
Token ID13
Feature activation-3.14e+00

Pos logit contributions
 teaching+0.168
 showing+0.159
 diving+0.154
 giving+0.154
 example+0.153
Neg logit contributions
Max-0.314
ens-0.295
Brown-0.284
Snow-0.282
Spot-0.277
Token
Token ID198
Feature activation-5.81e+00

Pos logit contributions
Max+0.072
Brown+0.070
lves+0.070
Snow+0.069
ens+0.068
Neg logit contributions
 teaching-0.046
 showing-0.046
 diving-0.044
 giving-0.044
 liking-0.042
Token
Token ID198
Feature activation-5.83e+00

Pos logit contributions
Max+0.125
ens+0.120
Brown+0.119
lves+0.118
Snow+0.116
Neg logit contributions
 teaching-0.098
 showing-0.096
 diving-0.094
 giving-0.091
 liking-0.089
Token"
Token ID1
Feature activation-6.84e+00

Pos logit contributions
ens+0.109
Max+0.104
lves+0.103
issy+0.103
mi+0.100
Neg logit contributions
 showing-0.108
 teaching-0.108
 liking-0.105
 diving-0.104
 giving-0.103
TokenL
Token ID43
Feature activation-1.74e+01

Pos logit contributions
lves+0.104
issy+0.104
ley+0.100
uffy+0.098
iper+0.098
Neg logit contributions
 showing-0.147
 teaching-0.143
 liking-0.138
 giving-0.138
 joining-0.132
Tokenily
Token ID813
Feature activation-2.20e+01

Pos logit contributions
Max+0.337
ens+0.315
Brown+0.298
Snow+0.292
Mic+0.288
Neg logit contributions
 teaching-0.355
 showing-0.351
 diving-0.341
 giving-0.337
 serving-0.325
Token,
Token ID11
Feature activation+3.12e+00

Pos logit contributions
Max+0.704
Brown+0.683
Mic+0.683
Snow+0.670
lves+0.660
Neg logit contributions
 showing-0.180
 teaching-0.177
 giving-0.175
 show-0.158
 helping-0.150
Token please
Token ID3387
Feature activation+5.11e+00

Pos logit contributions
 showing+0.038
 show+0.038
 diving+0.038
 giving+0.037
 teaching+0.037
Neg logit contributions
Mic-0.084
Max-0.083
Brown-0.082
lves-0.082
Red-0.078
Token come
Token ID1282
Feature activation+1.43e+01

Pos logit contributions
 teaching+0.074
 showing+0.074
 diving+0.071
 giving+0.071
 show+0.068
Neg logit contributions
Max-0.130
Brown-0.121
Mic-0.119
ens-0.119
Snow-0.117
Token out
Token ID503
Feature activation+2.27e+00

Pos logit contributions
 teaching+0.192
 showing+0.186
 diving+0.178
 giving+0.175
 example+0.171
Neg logit contributions
Max-0.378
Brown-0.354
ens-0.351
Snow-0.344
Mic-0.337