TOP ACTIVATIONS
MAX = 39.733

Let 's pretend they are going to a party ," Mia
and they said goodbye by giving each other a wink and
. Ben felt relaxed and energ ized after doing the moves
to know where he was going . She put on her
me , thank you for giving me such good advice !
the rough path but kept going , just like Lucy .
write her story and kept going . This time she wrote
, pushing her away and giving Lily all the sunshine .
around Fre ck les , giving her hugs and pet ting
bike around all day , going up and down her street
my pretended his car was going really fast and he said
They had been trying to wake her up but Molly had
special polish he couldn 't contain his happiness . He began
<|endoftext|> Tom and Lily were going to visit their grandma for
very excited because he was going on a special saf ari
!" But the children kept going . The sw an grew
" We are going to the zoo ," Lily
They wait for mom to wake up . They are quiet
Yes , and we can give him a scarf too ,"
be safe , instead of going ahead and touching it ."

MIN ACTIVATIONS
MIN = -29.253

. Su zie smiled . She and her
So Bob and Jill left the broken pick behind
being fair . <|endoftext|> Su zie loved to swim . She
all along . <|endoftext|> Su zie was a 3 year old
â But Daisy was determined . She opened
a very lovely dream , Amy . Now go to sleep
â Man uel shook his head . He
meet you . â Daisy was surprised and smiled .
also very happy to meet Daisy , and it recognized her
. So , Amy and her mum continued walking
touched his huge body . Daisy and the monster soon became
the flexible band . Little Judy was so proud that they
ended ! Su zie was sad that she no
s the story of Sam and her travels
year - old girl named Lisa decided to take the challenge
. Little Daisy was disappointed but her mom
Oh wow , â Jill said ,
the sweet cher ries . Daisy and Tom had the best
. Su zie was so disappointed and tried
s okay , Jim . I can help you

INTERVAL 22.487 - 39.733
CONTAINS 0.390%

what the machine wanted to teach her . The voice replied
nail out . Lib by watched as the clear water ran
swing . Thank you for helping me . You are a
life jackets . They also gave them a flashlight and a
waking up every morning to keep the house clean and everyone

INTERVAL 5.240 - 22.487
CONTAINS 33.586%

man asks . " You should not go near the rh
there was a man who wanted to build something . He
, but her mom my explained that the doctors were going
say sorry to Lily and give her back the bow .
she could keep the scissors safe . <|endoftext|> Once upon a

INTERVAL -12.006 - 5.240
CONTAINS 49.172%

were happy and lived together forever . <|endoftext|> Once upon a
They all played on the swing together and had a
the swamp ?". The duck lings replied ,
. And they both went on their way ,
It was deep and dark , and Lily was scared .

INTERVAL -29.253 - -12.006
CONTAINS 16.852%

much he weighed . Tim my said , " Wow ,
to run and play with his dog , Rex . They
I am so sorry , Anna . I ruined your purse
soft , and the mom my loved it very much .
to eat it , so she took it to the table
TokenLet
Token ID5756
Feature activation+1.87e+01

Pos logit contributions
Max+0.600
ens+0.573
Mic+0.553
Snow+0.533
Brown+0.532
Neg logit contributions
 teaching-0.282
 showing-0.269
 giving-0.255
 diving-0.249
 example-0.249
Token's
Token ID338
Feature activation+8.54e+00

Pos logit contributions
 teaching+0.418
 showing+0.407
 show+0.401
 example+0.400
 joining+0.398
Neg logit contributions
Max-0.366
ens-0.335
Mic-0.327
Brown-0.307
Snow-0.306
Token pretend
Token ID16614
Feature activation+2.19e+01

Pos logit contributions
 teaching+0.121
 example+0.116
 diving+0.113
 showing+0.113
 joining+0.111
Neg logit contributions
Max-0.238
ens-0.223
Mic-0.212
Brown-0.206
Snow-0.204
Token they
Token ID484
Feature activation-5.92e+00

Pos logit contributions
 teaching+0.266
 showing+0.249
 example+0.248
 diving+0.247
 joining+0.240
Neg logit contributions
Max-0.656
ens-0.624
Mic-0.598
Brown-0.574
Snow-0.573
Token are
Token ID389
Feature activation+1.58e+01

Pos logit contributions
Max+0.160
ens+0.151
Mic+0.149
Snow+0.139
Spot+0.139
Neg logit contributions
 teaching-0.088
 showing-0.084
 show-0.083
 diving-0.083
 joining-0.082
Token going
Token ID1016
Feature activation+3.09e+01

Pos logit contributions
 teaching+0.215
 showing+0.203
 diving+0.202
 example+0.200
 joining+0.196
Neg logit contributions
Max-0.456
ens-0.435
Mic-0.416
Brown-0.397
Snow-0.396
Token to
Token ID284
Feature activation+1.57e+01

Pos logit contributions
 teaching+0.327
 example+0.307
 humor+0.306
 succeeding+0.301
 accomplishment+0.300
Neg logit contributions
Max-0.964
ens-0.910
Mic-0.853
Snow-0.846
Brown-0.844
Token a
Token ID257
Feature activation-3.54e-01

Pos logit contributions
 teaching+0.248
 example+0.237
 showing+0.235
 diving+0.234
 joining+0.229
Neg logit contributions
Max-0.416
ens-0.396
Mic-0.375
Brown-0.357
Snow-0.356
Token party
Token ID2151
Feature activation+5.59e+00

Pos logit contributions
Max+0.009
ens+0.009
Mic+0.008
Brown+0.008
ety+0.008
Neg logit contributions
 teaching-0.006
 diving-0.005
 example-0.005
 showing-0.005
 joining-0.005
Token,"
Token ID553
Feature activation+6.36e+00

Pos logit contributions
 teaching+0.081
 example+0.077
 showing+0.077
 diving+0.076
 joining+0.075
Neg logit contributions
Max-0.154
ens-0.146
Mic-0.138
Brown-0.133
Snow-0.133
Token Mia
Token ID32189
Feature activation-2.01e+01

Pos logit contributions
 teaching+0.128
 showing+0.125
 show+0.118
 giving+0.115
 example+0.113
Neg logit contributions
Max-0.136
lves-0.134
Snow-0.133
Mic-0.132
ens-0.131
Token and
Token ID290
Feature activation+9.96e+00

Pos logit contributions
 teaching+0.148
 example+0.145
 humor+0.142
 diving+0.140
 joining+0.138
Neg logit contributions
Max-0.222
ens-0.214
Mic-0.194
Snow-0.193
Brown-0.192
Token they
Token ID484
Feature activation-7.54e+00

Pos logit contributions
 teaching+0.168
 showing+0.163
 diving+0.160
 giving+0.158
 example+0.157
Neg logit contributions
Max-0.255
ens-0.235
Mic-0.231
Brown-0.218
Snow-0.217
Token said
Token ID531
Feature activation+1.17e+01

Pos logit contributions
Max+0.195
Mic+0.180
ens+0.172
Brown+0.168
Nut+0.166
Neg logit contributions
 show-0.127
 teaching-0.122
 showing-0.120
 diving-0.117
 join-0.116
Token goodbye
Token ID24829
Feature activation+1.65e+01

Pos logit contributions
 teaching+0.183
 example+0.176
 showing+0.173
 diving+0.173
 joining+0.169
Neg logit contributions
Max-0.305
ens-0.289
Mic-0.268
Snow-0.263
Brown-0.261
Token by
Token ID416
Feature activation+1.12e+01

Pos logit contributions
 teaching+0.195
 showing+0.184
 example+0.183
 diving+0.177
 giving+0.176
Neg logit contributions
Max-0.508
ens-0.482
Mic-0.473
issy-0.442
Brown-0.440
Token giving
Token ID3501
Feature activation+3.09e+01

Pos logit contributions
 teaching+0.183
 example+0.178
 diving+0.173
 succeeding+0.172
 humor+0.169
Neg logit contributions
Max-0.269
ens-0.253
Brown-0.233
Snow-0.232
Mic-0.230
Token each
Token ID1123
Feature activation-1.79e-02

Pos logit contributions
 teaching+0.373
 show+0.368
 example+0.364
 showing+0.349
 giving+0.325
Neg logit contributions
Mic-0.887
Max-0.871
issy-0.848
Brown-0.827
ens-0.812
Token other
Token ID584
Feature activation-1.52e+00

Pos logit contributions
Max+0.001
ens+0.000
Mic+0.000
Brown+0.000
ety+0.000
Neg logit contributions
 teaching-0.000
 example-0.000
 showing-0.000
 diving-0.000
 joining-0.000
Token a
Token ID257
Feature activation-2.16e+00

Pos logit contributions
Max+0.047
Mic+0.046
Bet+0.042
Brown+0.042
Nut+0.042
Neg logit contributions
 teaching-0.018
 example-0.017
 joining-0.016
 join-0.016
 diving-0.016
Token wink
Token ID41955
Feature activation+9.46e+00

Pos logit contributions
Max+0.058
ens+0.054
Mic+0.051
Snow+0.050
Brown+0.050
Neg logit contributions
 teaching-0.033
 example-0.032
 showing-0.032
 succeeding-0.031
 diving-0.031
Token and
Token ID290
Feature activation+4.34e+00

Pos logit contributions
 teaching+0.140
 example+0.133
 showing+0.133
 diving+0.132
 joining+0.129
Neg logit contributions
Max-0.259
ens-0.245
Mic-0.234
Brown-0.224
Snow-0.223
Token.
Token ID13
Feature activation+4.53e+00

Pos logit contributions
 teaching+0.158
 diving+0.155
 showing+0.151
 joining+0.146
 giving+0.145
Neg logit contributions
Max-0.352
ens-0.329
Mic-0.329
Brown-0.312
lves-0.310
Token Ben
Token ID3932
Feature activation-1.67e+01

Pos logit contributions
 teaching+0.075
 showing+0.072
 join+0.072
 winning+0.071
 giving+0.070
Neg logit contributions
ens-0.107
lves-0.105
Max-0.102
Mic-0.099
mi-0.098
Token felt
Token ID2936
Feature activation+1.66e+01

Pos logit contributions
Max+0.428
Mic+0.413
ens+0.391
Brown+0.388
Snow+0.378
Neg logit contributions
 teaching-0.271
 showing-0.263
 diving-0.257
 show-0.255
 giving-0.251
Token relaxed
Token ID18397
Feature activation+1.90e+01

Pos logit contributions
 example+0.281
 teaching+0.276
 showing+0.264
 humor+0.262
aps+0.257
Neg logit contributions
Max-0.395
ens-0.378
Snow-0.341
 Midnight-0.340
 Bun-0.339
Token and
Token ID290
Feature activation+5.88e+00

Pos logit contributions
 humor+0.370
 teaching+0.364
 example+0.362
 diving+0.346
 succeeding+0.342
Neg logit contributions
Max-0.420
ens-0.400
Snow-0.356
ball-0.354
Mic-0.348
Token energ
Token ID19647
Feature activation+3.06e+01

Pos logit contributions
 example+0.113
 teaching+0.108
 humor+0.104
 joining+0.100
 guessing+0.100
Neg logit contributions
Max-0.131
ens-0.125
Snow-0.114
 Bun-0.109
 Margaret-0.109
Tokenized
Token ID1143
Feature activation+8.95e+00

Pos logit contributions
 teaching+0.702
 example+0.683
 joining+0.673
 showing+0.663
 speech+0.662
Neg logit contributions
Max-0.578
ens-0.548
ety-0.492
Mic-0.484
Snow-0.484
Token after
Token ID706
Feature activation+1.60e+01

Pos logit contributions
 teaching+0.122
 example+0.117
 showing+0.116
 diving+0.115
 joining+0.113
Neg logit contributions
Max-0.256
ens-0.239
Mic-0.237
Brown-0.225
issy-0.223
Token doing
Token ID1804
Feature activation+2.20e+01

Pos logit contributions
 teaching+0.264
 example+0.257
 showing+0.247
 humor+0.244
 diving+0.240
Neg logit contributions
Max-0.394
ens-0.372
Mic-0.339
Snow-0.339
Spot-0.333
Token the
Token ID262
Feature activation-9.53e+00

Pos logit contributions
 teaching+0.346
 example+0.333
 showing+0.327
 diving+0.323
 joining+0.318
Neg logit contributions
Max-0.575
ens-0.543
Mic-0.508
Snow-0.492
Brown-0.490
Token moves
Token ID6100
Feature activation+1.55e+01

Pos logit contributions
Max+0.262
ens+0.247
Mic+0.234
Snow+0.226
Brown+0.226
Neg logit contributions
 teaching-0.137
 example-0.130
 showing-0.129
 diving-0.127
 joining-0.125
Token to
Token ID284
Feature activation+2.05e+01

Pos logit contributions
 example+0.068
 teaching+0.067
 diving+0.063
 joining+0.062
 succeeding+0.062
Neg logit contributions
Max-0.086
ens-0.085
Snow-0.074
Mic-0.074
Spot-0.074
Token know
Token ID760
Feature activation+2.12e+01

Pos logit contributions
 teaching+0.273
 showing+0.258
 example+0.256
 diving+0.255
 joining+0.249
Neg logit contributions
Max-0.590
ens-0.560
Mic-0.535
Brown-0.514
Snow-0.514
Token where
Token ID810
Feature activation+2.10e+01

Pos logit contributions
 teaching+0.294
 showing+0.276
 example+0.275
 diving+0.273
 joining+0.270
Neg logit contributions
Max-0.603
ens-0.571
Mic-0.548
Brown-0.523
Snow-0.521
Token he
Token ID339
Feature activation-9.15e+00

Pos logit contributions
 teaching+0.300
 example+0.285
 showing+0.283
 diving+0.281
 joining+0.275
Neg logit contributions
Max-0.586
ens-0.555
Mic-0.528
Brown-0.507
Snow-0.505
Token was
Token ID373
Feature activation+1.34e+01

Pos logit contributions
Max+0.201
ens+0.185
Mic+0.178
Brown+0.167
Snow+0.166
Neg logit contributions
 teaching-0.186
 showing-0.180
 example-0.179
 diving-0.178
 joining-0.175
Token going
Token ID1016
Feature activation+3.06e+01

Pos logit contributions
 example+0.192
 humor+0.188
 teaching+0.183
 accomplishment+0.178
 diving+0.173
Neg logit contributions
Max-0.345
ens-0.333
 Margaret-0.306
issy-0.300
Mic-0.300
Token.
Token ID13
Feature activation+3.06e+00

Pos logit contributions
 teaching+0.454
 example+0.434
 showing+0.427
 diving+0.425
 joining+0.418
Neg logit contributions
Max-0.830
ens-0.788
Mic-0.742
Brown-0.716
Snow-0.715
Token She
Token ID1375
Feature activation-1.12e+01

Pos logit contributions
 teaching+0.043
 example+0.041
 counting+0.040
 join+0.040
 winning+0.040
Neg logit contributions
ens-0.078
Max-0.075
ball-0.074
lves-0.074
Mic-0.073
Token put
Token ID1234
Feature activation+2.34e+01

Pos logit contributions
Max+0.276
ens+0.261
Mic+0.251
Brown+0.236
Spot+0.235
Neg logit contributions
 teaching-0.199
 showing-0.192
 example-0.191
 diving-0.189
 joining-0.184
Token on
Token ID319
Feature activation+1.72e+01

Pos logit contributions
 teaching+0.185
 showing+0.181
 diving+0.180
 show+0.168
 counting+0.161
Neg logit contributions
Max-0.771
Mic-0.756
ety-0.717
Brown-0.716
ens-0.713
Token her
Token ID607
Feature activation-1.14e+01

Pos logit contributions
 teaching+0.280
 showing+0.275
 example+0.266
 diving+0.264
 counting+0.263
Neg logit contributions
Max-0.452
ens-0.437
Mic-0.427
ety-0.415
issy-0.402
Token me
Token ID502
Feature activation-2.11e+00

Pos logit contributions
 teaching+0.462
 showing+0.448
 giving+0.438
 joining+0.433
 show+0.432
Neg logit contributions
Max-0.778
ens-0.728
Mic-0.712
lves-0.695
ety-0.692
Token,
Token ID11
Feature activation+1.06e+01

Pos logit contributions
Max+0.053
ens+0.051
Mic+0.048
Brown+0.046
ball+0.046
Neg logit contributions
 humor-0.032
 example-0.032
 teaching-0.031
 diving-0.030
 accomplishment-0.029
Token thank
Token ID5875
Feature activation+2.66e+01

Pos logit contributions
 teaching+0.181
 showing+0.176
 giving+0.173
 joining+0.169
 diving+0.168
Neg logit contributions
Max-0.269
ens-0.254
Mic-0.246
lves-0.234
uffy-0.234
Token you
Token ID345
Feature activation-4.56e+00

Pos logit contributions
 teaching+0.427
 showing+0.412
 show+0.399
 joining+0.398
 diving+0.398
Neg logit contributions
Max-0.693
issy-0.633
Mic-0.633
lves-0.627
ens-0.626
Token for
Token ID329
Feature activation+1.91e+01

Pos logit contributions
Max+0.121
ens+0.115
Mic+0.107
Snow+0.104
Brown+0.103
Neg logit contributions
 teaching-0.070
 example-0.067
 showing-0.067
 diving-0.066
 joining-0.065
Token giving
Token ID3501
Feature activation+3.03e+01

Pos logit contributions
aps+0.259
 contain+0.248
apping+0.244
oops+0.239
 phrase+0.236
Neg logit contributions
Max-0.508
Mic-0.468
 Midnight-0.455
 Mickey-0.453
 Tiny-0.452
Token me
Token ID502
Feature activation-5.26e+00

Pos logit contributions
 teaching+0.405
 show+0.404
 example+0.403
 humor+0.390
 showing+0.384
Neg logit contributions
Max-0.829
Mic-0.775
Brown-0.770
issy-0.761
K-0.751
Token such
Token ID884
Feature activation+9.85e+00

Pos logit contributions
Max+0.151
ens+0.140
Mic+0.138
Brown+0.132
Snow+0.129
Neg logit contributions
 teaching-0.073
 example-0.070
 diving-0.068
 showing-0.067
 joining-0.067
Token good
Token ID922
Feature activation+1.33e+01

Pos logit contributions
 teaching+0.166
 example+0.158
 diving+0.158
 showing+0.158
 joining+0.155
Neg logit contributions
Max-0.249
ens-0.233
Mic-0.222
Brown-0.212
Snow-0.212
Token advice
Token ID5608
Feature activation+1.50e+01

Pos logit contributions
 teaching+0.226
 diving+0.213
 joining+0.211
 showing+0.211
 succeeding+0.210
Neg logit contributions
ens-0.316
Max-0.307
ety-0.273
ley-0.272
Mic-0.270
Token!
Token ID0
Feature activation+1.54e+00

Pos logit contributions
 teaching+0.228
 example+0.217
 showing+0.215
 diving+0.214
 joining+0.210
Neg logit contributions
Max-0.404
ens-0.382
Mic-0.362
Brown-0.348
Snow-0.347
Token the
Token ID262
Feature activation-5.16e+00

Pos logit contributions
 humor+0.299
 giving+0.281
 teaching+0.280
 example+0.279
 succeeding+0.269
Neg logit contributions
Max-0.519
ens-0.468
Spot-0.444
 Blink-0.429
ball-0.426
Token rough
Token ID5210
Feature activation+1.07e+01

Pos logit contributions
Max+0.125
ens+0.117
Mic+0.110
Brown+0.106
issy+0.105
Neg logit contributions
 teaching-0.094
 example-0.092
 diving-0.092
 showing-0.091
 giving-0.087
Token path
Token ID3108
Feature activation+7.80e+00

Pos logit contributions
 teaching+0.185
 example+0.180
 showing+0.179
 diving+0.178
 giving+0.173
Neg logit contributions
Max-0.268
ens-0.248
Mic-0.242
Brown-0.229
Snow-0.226
Token but
Token ID475
Feature activation+9.49e+00

Pos logit contributions
 teaching+0.121
 example+0.115
 showing+0.114
 diving+0.114
 joining+0.112
Neg logit contributions
Max-0.207
ens-0.196
Mic-0.185
Brown-0.178
Snow-0.177
Token kept
Token ID4030
Feature activation+2.67e+01

Pos logit contributions
 diving+0.162
 showing+0.161
 teaching+0.161
 giving+0.158
 joining+0.151
Neg logit contributions
Max-0.236
Mic-0.232
ens-0.217
Brown-0.210
Snow-0.208
Token going
Token ID1016
Feature activation+3.02e+01

Pos logit contributions
 example+0.338
 teaching+0.335
 humor+0.324
 liking+0.308
aps+0.304
Neg logit contributions
Max-0.747
ens-0.706
Mic-0.649
 Margaret-0.645
Snow-0.643
Token,
Token ID11
Feature activation+8.46e+00

Pos logit contributions
 teaching+0.448
 diving+0.429
 example+0.428
 showing+0.426
 joining+0.413
Neg logit contributions
Max-0.829
ens-0.782
Mic-0.756
Brown-0.719
Snow-0.712
Token just
Token ID655
Feature activation+8.75e+00

Pos logit contributions
 teaching+0.130
 showing+0.127
 diving+0.125
 giving+0.124
 joining+0.122
Neg logit contributions
Max-0.231
ens-0.219
Mic-0.215
issy-0.205
Brown-0.202
Token like
Token ID588
Feature activation+1.98e+01

Pos logit contributions
 teaching+0.138
 example+0.134
 humor+0.128
 showing+0.126
 diving+0.126
Neg logit contributions
Max-0.219
ens-0.215
ety-0.193
Mic-0.189
Snow-0.188
Token Lucy
Token ID22162
Feature activation-2.12e+01

Pos logit contributions
 teaching+0.303
 showing+0.289
 diving+0.286
 example+0.286
 joining+0.281
Neg logit contributions
Max-0.536
ens-0.507
Mic-0.487
Brown-0.463
Snow-0.461
Token.
Token ID13
Feature activation+4.99e-02

Pos logit contributions
Max+0.596
Mic+0.575
ens+0.530
Brown+0.529
Snow+0.523
Neg logit contributions
 teaching-0.316
 showing-0.301
 giving-0.299
 joining-0.291
 diving-0.288
Token write
Token ID3551
Feature activation+2.31e+01

Pos logit contributions
 teaching+0.273
 showing+0.258
 example+0.256
 diving+0.255
 joining+0.249
Neg logit contributions
Max-0.590
ens-0.560
Mic-0.535
Brown-0.514
Snow-0.514
Token her
Token ID607
Feature activation-6.62e+00

Pos logit contributions
 teaching+0.324
 showing+0.306
 example+0.294
 diving+0.289
 giving+0.287
Neg logit contributions
Max-0.656
ens-0.626
Mic-0.615
ety-0.589
lves-0.583
Token story
Token ID1621
Feature activation+1.04e+01

Pos logit contributions
Max+0.170
ens+0.161
Mic+0.150
Brown+0.146
Snow+0.145
Neg logit contributions
 teaching-0.103
 diving-0.100
 example-0.100
 joining-0.099
 showing-0.097
Token and
Token ID290
Feature activation+8.49e+00

Pos logit contributions
 teaching+0.140
 diving+0.138
 example+0.136
 showing+0.131
 joining+0.131
Neg logit contributions
Max-0.294
ens-0.278
Mic-0.259
Brown-0.254
Snow-0.253
Token kept
Token ID4030
Feature activation+2.68e+01

Pos logit contributions
 teaching+0.143
 diving+0.142
 example+0.142
 joining+0.138
 showing+0.134
Neg logit contributions
Max-0.207
ens-0.195
Mic-0.180
Brown-0.175
 Margaret-0.174
Token going
Token ID1016
Feature activation+3.02e+01

Pos logit contributions
 contain+0.376
 example+0.337
 humor+0.335
 diving+0.335
ulp+0.319
Neg logit contributions
Max-0.686
 Margaret-0.652
 Tess-0.615
 Tiny-0.607
ens-0.606
Token.
Token ID13
Feature activation+2.72e+00

Pos logit contributions
 humor+0.482
 teaching+0.442
 example+0.441
 diving+0.441
 joining+0.437
Neg logit contributions
Max-0.745
ens-0.707
 Margaret-0.677
 Tess-0.658
 Bun-0.649
Token This
Token ID770
Feature activation+1.13e+00

Pos logit contributions
 teaching+0.039
 about+0.038
 example+0.037
 join+0.036
 winning+0.036
Neg logit contributions
Max-0.067
ens-0.067
ball-0.066
lves-0.066
ley-0.065
Token time
Token ID640
Feature activation+8.24e+00

Pos logit contributions
 teaching+0.017
 example+0.016
 diving+0.016
 showing+0.016
 joining+0.016
Neg logit contributions
Max-0.031
ens-0.029
Mic-0.027
Brown-0.026
Snow-0.026
Token she
Token ID673
Feature activation-1.49e+01

Pos logit contributions
 teaching+0.101
 giving+0.099
 showing+0.097
 counting+0.096
 diving+0.094
Neg logit contributions
Mic-0.250
Max-0.246
Brown-0.227
Nut-0.226
ens-0.226
Token wrote
Token ID2630
Feature activation+1.60e+01

Pos logit contributions
Max+0.316
ens+0.289
Mic+0.267
Brown+0.258
Snow+0.257
Neg logit contributions
 teaching-0.307
 example-0.300
 diving-0.298
 joining-0.293
 showing-0.292
Token,
Token ID11
Feature activation+9.56e+00

Pos logit contributions
Max+0.419
ens+0.388
Mic+0.382
Brown+0.364
Snow+0.361
Neg logit contributions
 teaching-0.265
 showing-0.253
 example-0.249
 diving-0.245
 joining-0.244
Token pushing
Token ID7796
Feature activation+2.72e+01

Pos logit contributions
 teaching+0.164
 showing+0.158
 example+0.155
 diving+0.154
 joining+0.153
Neg logit contributions
Max-0.240
ens-0.225
Mic-0.217
Brown-0.207
Snow-0.206
Token her
Token ID607
Feature activation-8.58e+00

Pos logit contributions
 example+0.454
 teaching+0.420
 show+0.418
 showing+0.417
 skills+0.392
Neg logit contributions
Brown-0.652
Mic-0.641
Max-0.634
Snow-0.634
ety-0.625
Token away
Token ID1497
Feature activation+9.59e+00

Pos logit contributions
Max+0.231
ens+0.218
Mic+0.208
Snow+0.201
Brown+0.200
Neg logit contributions
 teaching-0.131
 showing-0.126
 example-0.125
 diving-0.124
 joining-0.120
Token and
Token ID290
Feature activation+6.27e+00

Pos logit contributions
 teaching+0.143
 example+0.136
 diving+0.134
 showing+0.132
 joining+0.130
Neg logit contributions
Max-0.258
ens-0.245
Mic-0.227
Brown-0.220
Snow-0.219
Token giving
Token ID3501
Feature activation+3.01e+01

Pos logit contributions
 teaching+0.098
 example+0.096
 diving+0.093
 joining+0.091
 showing+0.090
Neg logit contributions
Max-0.164
ens-0.155
Mic-0.142
Snow-0.137
Spot-0.137
Token Lily
Token ID20037
Feature activation-1.98e+01

Pos logit contributions
 teaching+0.512
 example+0.500
 showing+0.490
 show+0.489
 humor+0.479
Neg logit contributions
Max-0.727
Mic-0.700
Brown-0.683
Snow-0.654
ens-0.650
Token all
Token ID477
Feature activation+5.86e+00

Pos logit contributions
Max+0.613
Mic+0.593
Brown+0.584
Snow+0.556
issy+0.547
Neg logit contributions
 teaching-0.224
 join-0.208
 example-0.205
 about-0.203
 joining-0.200
Token the
Token ID262
Feature activation-8.77e+00

Pos logit contributions
 teaching+0.100
 example+0.094
 showing+0.094
 joining+0.093
 diving+0.092
Neg logit contributions
Max-0.148
ens-0.138
Mic-0.135
Brown-0.128
Snow-0.128
Token sunshine
Token ID34488
Feature activation-1.81e+00

Pos logit contributions
Max+0.230
ens+0.217
Mic+0.206
Brown+0.197
Snow+0.197
Neg logit contributions
 teaching-0.139
 example-0.133
 showing-0.132
 diving-0.131
 joining-0.129
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
Max+0.053
Mic+0.050
ens+0.049
Brown+0.047
Snow+0.046
Neg logit contributions
 teaching-0.024
 showing-0.023
 diving-0.022
 giving-0.022
 joining-0.022
Token around
Token ID1088
Feature activation+1.55e+01

Pos logit contributions
 teaching+0.225
 example+0.210
 showing+0.208
 diving+0.206
 joining+0.200
Neg logit contributions
Max-0.644
ens-0.614
Mic-0.588
Brown-0.568
Snow-0.567
Token Fre
Token ID4848
Feature activation-3.98e-01

Pos logit contributions
 teaching+0.248
 example+0.239
 diving+0.235
 showing+0.235
 joining+0.230
Neg logit contributions
Max-0.405
ens-0.381
Mic-0.359
Brown-0.343
Spot-0.342
Tokenck
Token ID694
Feature activation+1.29e+01

Pos logit contributions
Max+0.008
ens+0.007
Mic+0.007
Brown+0.006
Snow+0.006
Neg logit contributions
 teaching-0.009
 example-0.009
 diving-0.009
 showing-0.009
 joining-0.009
Tokenles
Token ID829
Feature activation+1.09e+00

Pos logit contributions
 teaching+0.192
 diving+0.191
 showing+0.184
 example+0.184
 joining+0.183
Neg logit contributions
Max-0.342
Mic-0.331
Brown-0.315
ens-0.311
Snow-0.306
Token,
Token ID11
Feature activation+7.66e+00

Pos logit contributions
 teaching+0.016
 showing+0.015
 example+0.015
 diving+0.015
 joining+0.015
Neg logit contributions
Max-0.030
ens-0.028
Mic-0.027
Brown-0.026
Snow-0.026
Token giving
Token ID3501
Feature activation+3.00e+01

Pos logit contributions
 teaching+0.102
 showing+0.097
 diving+0.095
 example+0.094
 joining+0.093
Neg logit contributions
Max-0.222
ens-0.212
Mic-0.202
Brown-0.194
Snow-0.193
Token her
Token ID607
Feature activation-1.06e+01

Pos logit contributions
 diving+0.526
 teaching+0.519
 example+0.519
 showing+0.514
 joining+0.509
Neg logit contributions
Max-0.740
Mic-0.675
ens-0.660
Brown-0.654
issy-0.649
Token hugs
Token ID40657
Feature activation+1.86e+01

Pos logit contributions
Max+0.283
ens+0.262
Mic+0.257
Brown+0.244
Snow+0.243
Neg logit contributions
 teaching-0.170
 showing-0.163
 diving-0.162
 example-0.162
 giving-0.157
Token and
Token ID290
Feature activation+5.15e+00

Pos logit contributions
 teaching+0.241
 showing+0.231
 giving+0.228
 joining+0.223
 example+0.213
Neg logit contributions
Max-0.541
Mic-0.513
ens-0.499
lves-0.490
Brown-0.479
Token pet
Token ID4273
Feature activation+1.02e+01

Pos logit contributions
 teaching+0.083
 example+0.079
 showing+0.079
 diving+0.078
 joining+0.077
Neg logit contributions
Max-0.134
ens-0.127
Mic-0.120
Brown-0.115
Snow-0.115
Tokenting
Token ID889
Feature activation+9.71e+00

Pos logit contributions
 joining+0.169
 showing+0.168
 teaching+0.167
 diving+0.166
 show+0.165
Neg logit contributions
Max-0.252
Mic-0.245
Brown-0.227
issy-0.220
uffy-0.212
Token bike
Token ID7161
Feature activation-1.72e+00

Pos logit contributions
 humor+0.154
 example+0.140
 apology+0.135
oken+0.133
ags+0.131
Neg logit contributions
Max-0.151
ens-0.145
 Bun-0.141
 Margaret-0.137
 Snow-0.135
Token around
Token ID1088
Feature activation+1.66e+01

Pos logit contributions
Max+0.047
ens+0.044
Mic+0.042
Brown+0.040
Spot+0.040
Neg logit contributions
 teaching-0.025
 example-0.024
 diving-0.024
 showing-0.024
 joining-0.024
Token all
Token ID477
Feature activation+9.00e+00

Pos logit contributions
 teaching+0.253
 showing+0.243
 diving+0.237
 example+0.236
 joining+0.231
Neg logit contributions
Max-0.450
ens-0.430
Mic-0.413
Snow-0.395
Brown-0.394
Token day
Token ID1110
Feature activation+8.11e+00

Pos logit contributions
 teaching+0.130
 example+0.124
 showing+0.122
 diving+0.122
 joining+0.119
Neg logit contributions
Max-0.249
ens-0.236
Mic-0.223
Brown-0.215
Snow-0.214
Token,
Token ID11
Feature activation+8.92e+00

Pos logit contributions
 teaching+0.130
 example+0.129
 humor+0.124
 showing+0.122
 diving+0.122
Neg logit contributions
Max-0.207
ens-0.193
Mic-0.181
Snow-0.174
Brown-0.174
Token going
Token ID1016
Feature activation+3.00e+01

Pos logit contributions
 showing+0.134
 teaching+0.132
 giving+0.127
 joining+0.126
 diving+0.125
Neg logit contributions
Max-0.244
ens-0.238
Mic-0.238
issy-0.231
uffy-0.221
Token up
Token ID510
Feature activation+1.18e+01

Pos logit contributions
 humor+0.411
 teaching+0.408
 example+0.393
 showing+0.386
 liking+0.382
Neg logit contributions
Max-0.835
ens-0.786
Spot-0.718
 Bun-0.707
Mic-0.707
Token and
Token ID290
Feature activation+6.27e+00

Pos logit contributions
 teaching+0.176
 example+0.167
 showing+0.165
 diving+0.163
 joining+0.161
Neg logit contributions
Max-0.319
ens-0.302
Mic-0.284
Brown-0.272
Snow-0.272
Token down
Token ID866
Feature activation+9.60e+00

Pos logit contributions
 teaching+0.089
 example+0.084
 showing+0.084
 diving+0.083
 joining+0.081
Neg logit contributions
Max-0.176
ens-0.167
Mic-0.158
Brown-0.152
Snow-0.152
Token her
Token ID607
Feature activation-1.18e+01

Pos logit contributions
 teaching+0.147
 example+0.141
 showing+0.139
 diving+0.138
 joining+0.136
Neg logit contributions
Max-0.255
ens-0.240
Mic-0.224
Spot-0.217
Snow-0.216
Token street
Token ID4675
Feature activation+4.54e+00

Pos logit contributions
Max+0.253
ens+0.242
Mic+0.229
issy+0.216
Bet+0.213
Neg logit contributions
 teaching-0.244
 showing-0.241
 diving-0.239
 show-0.232
 example-0.231
Tokenmy
Token ID1820
Feature activation-9.94e+00

Pos logit contributions
Max+0.340
Mic+0.316
Snow+0.303
Brown+0.299
lves+0.289
Neg logit contributions
 teaching-0.389
 showing-0.376
 show-0.370
 giving-0.369
 diving-0.362
Token pretended
Token ID36766
Feature activation+2.20e+01

Pos logit contributions
Max+0.244
ens+0.224
Mic+0.224
Snow+0.213
Brown+0.211
Neg logit contributions
 teaching-0.177
 showing-0.169
 giving-0.164
 diving-0.162
 joining-0.161
Token his
Token ID465
Feature activation-7.97e+00

Pos logit contributions
 teaching+0.194
 showing+0.182
 diving+0.177
 joining+0.168
 giving+0.168
Neg logit contributions
Max-0.743
ens-0.707
Mic-0.700
Brown-0.663
Snow-0.660
Token car
Token ID1097
Feature activation-4.29e+00

Pos logit contributions
Max+0.178
ens+0.166
Mic+0.157
Brown+0.148
Snow+0.147
Neg logit contributions
 teaching-0.159
 showing-0.154
 diving-0.153
 example-0.153
 joining-0.149
Token was
Token ID373
Feature activation+1.27e+01

Pos logit contributions
Max+0.094
ens+0.088
Mic+0.083
Brown+0.078
Snow+0.078
Neg logit contributions
 teaching-0.087
 showing-0.083
 example-0.083
 diving-0.082
 joining-0.081
Token going
Token ID1016
Feature activation+2.99e+01

Pos logit contributions
 teaching+0.188
 showing+0.178
 example+0.178
 diving+0.177
 joining+0.173
Neg logit contributions
Max-0.348
ens-0.331
Mic-0.314
Brown-0.301
Snow-0.299
Token really
Token ID1107
Feature activation+1.40e+01

Pos logit contributions
 teaching+0.268
 example+0.246
 showing+0.244
 diving+0.242
 joining+0.232
Neg logit contributions
Max-0.994
ens-0.951
Mic-0.914
Brown-0.883
Snow-0.881
Token fast
Token ID3049
Feature activation+7.48e+00

Pos logit contributions
 teaching+0.204
 showing+0.191
 example+0.191
 diving+0.187
 joining+0.186
Neg logit contributions
Max-0.389
ens-0.367
Mic-0.347
Snow-0.337
Brown-0.335
Token and
Token ID290
Feature activation+5.67e+00

Pos logit contributions
 teaching+0.122
 example+0.116
 showing+0.115
 diving+0.114
 joining+0.113
Neg logit contributions
Max-0.193
ens-0.183
Mic-0.170
Brown-0.164
Snow-0.164
Token he
Token ID339
Feature activation-1.27e+01

Pos logit contributions
 teaching+0.118
 showing+0.114
 diving+0.113
 example+0.112
 joining+0.111
Neg logit contributions
Max-0.121
ens-0.114
Mic-0.108
Brown-0.100
Snow-0.100
Token said
Token ID531
Feature activation+9.04e+00

Pos logit contributions
Max+0.293
ens+0.279
Mic+0.273
Brown+0.249
Bet+0.248
Neg logit contributions
 teaching-0.238
 showing-0.233
 diving-0.230
 example-0.225
 joining-0.224
Token They
Token ID1119
Feature activation-4.11e+00

Pos logit contributions
 contain+0.195
aps+0.195
ulp+0.194
 phrase+0.180
 speech+0.171
Neg logit contributions
Max-0.468
Mic-0.411
 Bun-0.399
 Tiny-0.396
Billy-0.396
Token had
Token ID550
Feature activation+1.50e+01

Pos logit contributions
Max+0.104
ens+0.097
Mic+0.095
Brown+0.090
Snow+0.089
Neg logit contributions
 teaching-0.068
 showing-0.066
 diving-0.065
 example-0.065
 show-0.064
Token been
Token ID587
Feature activation+1.51e+01

Pos logit contributions
 teaching+0.252
 diving+0.249
 example+0.248
 showing+0.243
 succeeding+0.242
Neg logit contributions
Max-0.367
ens-0.336
Mic-0.314
Spot-0.310
ety-0.306
Token trying
Token ID2111
Feature activation+2.75e+01

Pos logit contributions
 accomplishment+0.200
 teaching+0.195
 diving+0.195
 example+0.193
 humor+0.191
Neg logit contributions
Max-0.402
ens-0.370
ety-0.354
Mic-0.354
Spot-0.351
Token to
Token ID284
Feature activation+1.65e+01

Pos logit contributions
 example+0.330
 teaching+0.329
 humor+0.329
 showing+0.311
aps+0.308
Neg logit contributions
Max-0.761
ens-0.723
Mic-0.652
ball-0.652
ety-0.645
Token wake
Token ID7765
Feature activation+2.98e+01

Pos logit contributions
 teaching+0.210
 example+0.203
 diving+0.195
 showing+0.191
 joining+0.186
Neg logit contributions
Max-0.478
ens-0.445
Mic-0.425
Spot-0.413
Snow-0.409
Token her
Token ID607
Feature activation-9.07e+00

Pos logit contributions
 teaching+0.255
 showing+0.246
 diving+0.241
 example+0.239
 show+0.232
Neg logit contributions
Max-0.985
Mic-0.969
ens-0.949
Brown-0.924
issy-0.917
Token up
Token ID510
Feature activation+1.08e+01

Pos logit contributions
Max+0.265
ens+0.252
Mic+0.239
Brown+0.231
Snow+0.230
Neg logit contributions
 teaching-0.117
 example-0.111
 showing-0.109
 diving-0.108
 joining-0.106
Token but
Token ID475
Feature activation+8.08e+00

Pos logit contributions
 teaching+0.179
 humor+0.172
 example+0.171
 showing+0.167
 diving+0.167
Neg logit contributions
Max-0.265
ens-0.251
Mic-0.227
ball-0.225
Snow-0.224
Token Molly
Token ID30236
Feature activation-2.04e+01

Pos logit contributions
 teaching+0.157
 showing+0.153
 giving+0.149
 diving+0.147
 example+0.146
Neg logit contributions
Max-0.186
Mic-0.175
ens-0.173
Brown-0.163
Snow-0.161
Token had
Token ID550
Feature activation+1.07e+01

Pos logit contributions
Max+0.505
Mic+0.495
ens+0.459
Brown+0.458
Snow+0.445
Neg logit contributions
 teaching-0.356
 showing-0.349
 giving-0.337
 diving-0.326
 joining-0.324
Token special
Token ID2041
Feature activation+7.54e+00

Pos logit contributions
Max+0.072
ens+0.062
Mic+0.061
Brown+0.055
Snow+0.053
Neg logit contributions
 teaching-0.127
 example-0.125
 diving-0.123
 showing-0.123
 giving-0.122
Token polish
Token ID25245
Feature activation+1.79e+01

Pos logit contributions
 teaching+0.123
 example+0.116
 diving+0.116
 joining+0.115
 succeeding+0.115
Neg logit contributions
Max-0.186
ens-0.179
Mic-0.165
Brown-0.160
Snow-0.159
Token he
Token ID339
Feature activation-8.81e+00

Pos logit contributions
 teaching+0.240
 showing+0.235
 diving+0.227
 giving+0.227
 example+0.223
Neg logit contributions
Max-0.514
Mic-0.480
ens-0.465
Bet-0.446
Brown-0.445
Token couldn
Token ID3521
Feature activation+2.14e+01

Pos logit contributions
Max+0.180
ens+0.167
Mic+0.155
Brown+0.147
Snow+0.146
Neg logit contributions
 teaching-0.190
 example-0.184
 diving-0.182
 showing-0.182
 joining-0.180
Token't
Token ID470
Feature activation+1.18e+01

Pos logit contributions
 teaching+0.316
 showing+0.299
 diving+0.295
 example+0.294
 giving+0.291
Neg logit contributions
Max-0.583
ens-0.554
Mic-0.535
ety-0.505
Spot-0.505
Token contain
Token ID3994
Feature activation+2.98e+01

Pos logit contributions
 teaching+0.173
 diving+0.166
 example+0.162
 joining+0.160
 humor+0.160
Neg logit contributions
Max-0.313
ens-0.298
issy-0.280
Brown-0.275
Snow-0.274
Token his
Token ID465
Feature activation-9.54e+00

Pos logit contributions
 teaching+0.433
 example+0.412
 showing+0.406
 diving+0.406
 joining+0.398
Neg logit contributions
Max-0.823
ens-0.778
Mic-0.739
Brown-0.709
Snow-0.708
Token happiness
Token ID12157
Feature activation+8.68e+00

Pos logit contributions
Max+0.274
ens+0.259
Mic+0.247
Brown+0.238
Snow+0.237
Neg logit contributions
 teaching-0.128
 example-0.120
 showing-0.120
 diving-0.119
 joining-0.117
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 example+0.154
 teaching+0.151
 humor+0.148
 diving+0.146
 succeeding+0.141
Neg logit contributions
Max-0.200
ens-0.189
Mic-0.172
ball-0.172
Brown-0.172
Token He
Token ID679
Feature activation-1.28e+01

Pos logit contributions
 teaching+0.002
 example+0.002
 winning+0.002
 showing+0.002
 giving+0.002
Neg logit contributions
ens-0.003
Max-0.003
ball-0.003
lves-0.003
Snow-0.003
Token began
Token ID2540
Feature activation+1.84e+01

Pos logit contributions
Max+0.285
ens+0.264
Mic+0.245
Brown+0.236
Snow+0.234
Neg logit contributions
 teaching-0.251
 example-0.241
 diving-0.240
 showing-0.237
 joining-0.237
Token<|endoftext|>
Token ID50256
Feature activation+1.15e+01

Pos logit contributions
 teaching+0.050
 example+0.048
 diving+0.048
 showing+0.047
 joining+0.046
Neg logit contributions
Max-0.099
ens-0.094
Mic-0.088
Snow-0.086
Spot-0.085
TokenTom
Token ID13787
Feature activation-9.17e+00

Pos logit contributions
 teaching+0.230
 diving+0.222
 showing+0.218
 joining+0.217
aps+0.213
Neg logit contributions
Max-0.240
ens-0.239
Mic-0.223
lves-0.214
Brown-0.212
Token and
Token ID290
Feature activation+9.58e+00

Pos logit contributions
Max+0.201
Mic+0.191
Snow+0.179
lves+0.178
Brown+0.178
Neg logit contributions
 teaching-0.181
 showing-0.173
 giving-0.171
 diving-0.165
 joining-0.165
Token Lily
Token ID20037
Feature activation-1.65e+01

Pos logit contributions
 join+0.152
 show+0.148
 teaching+0.145
 showing+0.145
 giving+0.144
Neg logit contributions
Max-0.239
lves-0.235
issy-0.230
Mic-0.227
zan-0.224
Token were
Token ID547
Feature activation+1.34e+01

Pos logit contributions
Mic+0.451
Max+0.440
Nut+0.420
Brown+0.415
 Blink+0.400
Neg logit contributions
 show-0.234
 cl-0.225
 join-0.225
 teaching-0.221
 showing-0.216
Token going
Token ID1016
Feature activation+2.98e+01

Pos logit contributions
 teaching+0.153
 example+0.146
 showing+0.144
 diving+0.142
 joining+0.139
Neg logit contributions
Max-0.406
ens-0.388
Mic-0.369
Brown-0.354
Spot-0.353
Token to
Token ID284
Feature activation+1.48e+01

Pos logit contributions
 humor+0.547
ulp+0.522
 accomplishment+0.498
 succeeding+0.467
 liking+0.458
Neg logit contributions
Max-0.686
ens-0.648
ety-0.620
gy-0.581
uffy-0.580
Token visit
Token ID3187
Feature activation+2.08e+01

Pos logit contributions
 teaching+0.232
 example+0.228
 diving+0.219
 showing+0.218
 humor+0.216
Neg logit contributions
Max-0.384
ens-0.357
Mic-0.338
Brown-0.329
Snow-0.326
Token their
Token ID511
Feature activation-7.84e+00

Pos logit contributions
 teaching+0.343
 showing+0.323
 example+0.319
 diving+0.318
 joining+0.312
Neg logit contributions
Max-0.547
ens-0.519
Mic-0.502
Brown-0.477
Snow-0.475
Token grandma
Token ID49890
Feature activation-1.39e+01

Pos logit contributions
Max+0.171
ens+0.164
Mic+0.153
Snow+0.143
issy+0.143
Neg logit contributions
 teaching-0.163
 example-0.158
 showing-0.157
 diving-0.156
 show-0.151
Token for
Token ID329
Feature activation+1.73e+01

Pos logit contributions
Max+0.305
ens+0.288
Mic+0.265
ety+0.254
Brown+0.252
Neg logit contributions
 teaching-0.269
 example-0.268
 diving-0.263
 humor-0.262
 showing-0.258
Token very
Token ID845
Feature activation+1.76e+01

Pos logit contributions
Max+0.564
Mic+0.547
Nut+0.496
Snow+0.491
Brown+0.489
Neg logit contributions
 show-0.329
 showing-0.314
 teaching-0.313
 giving-0.303
 mixing-0.296
Token excited
Token ID6568
Feature activation+2.20e+01

Pos logit contributions
 teaching+0.291
 joining+0.271
 example+0.271
 showing+0.262
 succeeding+0.256
Neg logit contributions
Max-0.425
ens-0.409
Mic-0.376
Spot-0.376
Snow-0.375
Token because
Token ID780
Feature activation+9.08e+00

Pos logit contributions
 teaching+0.322
 example+0.306
 showing+0.303
 diving+0.302
 joining+0.295
Neg logit contributions
Max-0.606
ens-0.574
Mic-0.544
Brown-0.523
Snow-0.522
Token he
Token ID339
Feature activation-1.06e+01

Pos logit contributions
 teaching+0.177
 showing+0.170
 diving+0.170
 example+0.169
 joining+0.166
Neg logit contributions
Max-0.208
ens-0.195
Mic-0.186
Brown-0.174
Snow-0.173
Token was
Token ID373
Feature activation+1.21e+01

Pos logit contributions
Max+0.232
ens+0.222
Mic+0.210
Brown+0.196
ety+0.195
Neg logit contributions
 teaching-0.214
 showing-0.210
 diving-0.208
 example-0.204
 joining-0.201
Token going
Token ID1016
Feature activation+2.97e+01

Pos logit contributions
 example+0.158
 humor+0.153
aps+0.149
 apology+0.147
ulp+0.144
Neg logit contributions
Max-0.318
ens-0.293
 Margaret-0.292
 Rusty-0.280
 Tess-0.280
Token on
Token ID319
Feature activation+1.81e+01

Pos logit contributions
 humor+0.400
ulp+0.381
 accomplishment+0.377
 example+0.362
 teaching+0.352
Neg logit contributions
Max-0.815
ens-0.761
 Margaret-0.710
Snow-0.704
Spot-0.696
Token a
Token ID257
Feature activation-2.01e+00

Pos logit contributions
 teaching+0.301
 example+0.289
 showing+0.286
 diving+0.283
 joining+0.280
Neg logit contributions
Max-0.459
ens-0.428
Mic-0.405
Snow-0.389
Brown-0.389
Token special
Token ID2041
Feature activation+5.72e+00

Pos logit contributions
Max+0.055
ens+0.055
Mic+0.049
ety+0.048
issy+0.048
Neg logit contributions
 diving-0.030
 teaching-0.029
 guessing-0.029
 show-0.028
 showing-0.028
Token saf
Token ID1932
Feature activation+2.31e+00

Pos logit contributions
 teaching+0.105
 succeeding+0.100
 humor+0.099
 joining+0.098
aps+0.098
Neg logit contributions
Max-0.131
ens-0.120
Mic-0.113
Snow-0.112
Brown-0.110
Tokenari
Token ID2743
Feature activation-4.19e+00

Pos logit contributions
 teaching+0.055
 diving+0.054
 example+0.054
 showing+0.053
 joining+0.052
Neg logit contributions
Max-0.042
ens-0.040
Mic-0.036
Brown-0.034
Snow-0.033
Token!"
Token ID2474
Feature activation+9.85e+00

Pos logit contributions
 example+0.438
 teaching+0.434
 diving+0.426
 accomplishment+0.420
 humor+0.413
Neg logit contributions
Max-0.641
ens-0.592
ball-0.538
ety-0.531
Mic-0.521
Token But
Token ID887
Feature activation+9.41e+00

Pos logit contributions
 teaching+0.179
 showing+0.175
 show+0.171
 humor+0.165
 giving+0.164
Neg logit contributions
Max-0.215
Mic-0.213
lves-0.211
ens-0.207
ety-0.206
Token the
Token ID262
Feature activation-5.01e+00

Pos logit contributions
 teaching+0.175
 showing+0.174
 giving+0.167
 counting+0.162
 diving+0.162
Neg logit contributions
Mic-0.220
Max-0.211
lves-0.205
Snow-0.203
issy-0.201
Token children
Token ID1751
Feature activation-5.63e+00

Pos logit contributions
Max+0.107
ens+0.100
Mic+0.094
Brown+0.089
Snow+0.088
Neg logit contributions
 teaching-0.105
 example-0.102
 diving-0.102
 showing-0.101
 giving-0.098
Token kept
Token ID4030
Feature activation+2.65e+01

Pos logit contributions
Max+0.136
ens+0.127
Mic+0.123
Brown+0.117
Snow+0.117
Neg logit contributions
 teaching-0.101
 showing-0.097
 diving-0.096
 example-0.096
 joining-0.094
Token going
Token ID1016
Feature activation+2.96e+01

Pos logit contributions
 example+0.357
ulp+0.345
aps+0.343
 humor+0.338
 contain+0.334
Neg logit contributions
Max-0.715
ens-0.661
Mic-0.608
 Kathy-0.605
 Tess-0.601
Token.
Token ID13
Feature activation+2.69e+00

Pos logit contributions
 teaching+0.478
 example+0.458
 showing+0.452
 diving+0.451
 joining+0.446
Neg logit contributions
Max-0.767
ens-0.728
Mic-0.674
Spot-0.649
Brown-0.649
Token The
Token ID383
Feature activation-7.44e+00

Pos logit contributions
 teaching+0.043
 example+0.040
 winning+0.040
 showing+0.039
 mixing+0.038
Neg logit contributions
ens-0.064
Brown-0.063
Mic-0.063
Max-0.063
ley-0.063
Token sw
Token ID1509
Feature activation+9.40e+00

Pos logit contributions
Max+0.171
ens+0.159
Mic+0.153
Brown+0.146
Snow+0.145
Neg logit contributions
 teaching-0.145
 example-0.140
 showing-0.139
 diving-0.138
 joining-0.134
Tokenan
Token ID272
Feature activation-1.11e+01

Pos logit contributions
 teaching+0.222
 joining+0.212
 giving+0.212
 showing+0.212
 succeeding+0.211
Neg logit contributions
Max-0.169
ens-0.159
Mic-0.142
ety-0.140
uffy-0.136
Token grew
Token ID6348
Feature activation+2.36e+01

Pos logit contributions
Max+0.268
Mic+0.252
ens+0.251
Brown+0.234
Snow+0.233
Neg logit contributions
 teaching-0.195
 showing-0.192
 diving-0.185
 joining-0.182
 giving-0.182
Token
Token ID198
Feature activation+4.69e-01

Pos logit contributions
Max+0.009
Mic+0.009
lves+0.009
ley+0.009
ens+0.009
Neg logit contributions
 teaching-0.006
 about-0.006
 giving-0.006
 serving-0.006
 joining-0.006
Token
Token ID198
Feature activation+8.96e-01

Pos logit contributions
 teaching+0.008
 example+0.008
aps+0.008
 showing+0.008
 diving+0.008
Neg logit contributions
ens-0.012
ley-0.010
lves-0.010
issy-0.010
mi-0.010
Token"
Token ID1
Feature activation+7.50e-01

Pos logit contributions
 teaching+0.017
 example+0.017
 joining+0.016
 showing+0.016
aps+0.016
Neg logit contributions
ens-0.021
ley-0.018
uffy-0.017
lves-0.017
issy-0.017
TokenWe
Token ID1135
Feature activation+3.37e+00

Pos logit contributions
 teaching+0.016
 example+0.015
 showing+0.015
 joining+0.015
 humor+0.014
Neg logit contributions
ens-0.016
Max-0.014
uffy-0.014
issy-0.014
ley-0.014
Token are
Token ID389
Feature activation+1.37e+01

Pos logit contributions
 teaching+0.059
 example+0.057
 diving+0.056
 showing+0.056
 joining+0.055
Neg logit contributions
Max-0.083
ens-0.077
Mic-0.075
Brown-0.072
Snow-0.071
Token going
Token ID1016
Feature activation+2.96e+01

Pos logit contributions
 example+0.152
 teaching+0.147
 diving+0.141
 accomplishment+0.139
 humor+0.138
Neg logit contributions
Max-0.399
ens-0.377
Mic-0.360
Spot-0.350
 Midnight-0.347
Token to
Token ID284
Feature activation+1.49e+01

Pos logit contributions
 humor+0.357
 teaching+0.352
 succeeding+0.345
 accomplishment+0.335
ulp+0.333
Neg logit contributions
Max-0.856
ens-0.803
Spot-0.755
Brown-0.746
Snow-0.745
Token the
Token ID262
Feature activation-6.81e+00

Pos logit contributions
 teaching+0.210
 example+0.200
 diving+0.199
 showing+0.198
 joining+0.195
Neg logit contributions
Max-0.420
ens-0.403
Mic-0.383
ety-0.366
Brown-0.362
Token zoo
Token ID26626
Feature activation+2.08e+00

Pos logit contributions
Max+0.165
ens+0.162
Mic+0.150
issy+0.143
ety+0.143
Neg logit contributions
 teaching-0.121
 diving-0.119
 example-0.119
 showing-0.116
 guessing-0.113
Token,"
Token ID553
Feature activation+6.07e+00

Pos logit contributions
 teaching+0.030
 example+0.029
 showing+0.028
 diving+0.028
 joining+0.028
Neg logit contributions
Max-0.058
ens-0.055
Mic-0.052
Brown-0.050
Snow-0.050
Token Lily
Token ID20037
Feature activation-2.02e+01

Pos logit contributions
 teaching+0.110
 showing+0.110
 show+0.109
 giving+0.105
 example+0.099
Neg logit contributions
Max-0.143
Mic-0.137
lves-0.132
Brown-0.130
ens-0.130
Token They
Token ID1119
Feature activation-4.11e+00

Pos logit contributions
Max+0.631
ens+0.589
Mic+0.587
Brown+0.559
ety+0.555
Neg logit contributions
 teaching-0.294
 show-0.288
 showing-0.280
 diving-0.276
 joining-0.270
Token wait
Token ID4043
Feature activation+2.64e+01

Pos logit contributions
Max+0.104
ens+0.097
Mic+0.095
Brown+0.090
Snow+0.089
Neg logit contributions
 teaching-0.068
 showing-0.066
 diving-0.065
 example-0.065
 show-0.064
Token for
Token ID329
Feature activation+2.20e+01

Pos logit contributions
 teaching+0.408
 example+0.394
 humor+0.380
 succeeding+0.368
 humour+0.354
Neg logit contributions
ens-0.639
Max-0.623
ety-0.543
 Margaret-0.536
Snow-0.533
Token mom
Token ID1995
Feature activation-9.02e+00

Pos logit contributions
 teaching+0.363
 diving+0.351
 example+0.348
 showing+0.345
 joining+0.338
Neg logit contributions
Max-0.571
ens-0.543
Mic-0.521
ety-0.498
Brown-0.497
Token to
Token ID284
Feature activation+1.58e+01

Pos logit contributions
Max+0.291
Mic+0.270
ens+0.270
Brown+0.260
Snow+0.258
Neg logit contributions
 teaching-0.094
 example-0.084
 showing-0.084
 giving-0.083
 diving-0.082
Token wake
Token ID7765
Feature activation+2.96e+01

Pos logit contributions
 teaching+0.211
 example+0.204
 diving+0.195
 showing+0.194
 succeeding+0.190
Neg logit contributions
Max-0.450
ens-0.425
Mic-0.404
Snow-0.394
Spot-0.390
Token up
Token ID510
Feature activation+1.19e+01

Pos logit contributions
 teaching+0.281
 showing+0.276
 show+0.267
 example+0.261
 giving+0.250
Neg logit contributions
Max-0.967
Mic-0.924
ens-0.906
Brown-0.883
issy-0.874
Token.
Token ID13
Feature activation+2.94e+00

Pos logit contributions
 teaching+0.180
 example+0.173
 showing+0.169
 diving+0.168
 joining+0.165
Neg logit contributions
Max-0.317
ens-0.303
Mic-0.282
Snow-0.274
Brown-0.272
Token They
Token ID1119
Feature activation-6.76e+00

Pos logit contributions
 teaching+0.054
 showing+0.052
 giving+0.052
 example+0.052
 diving+0.050
Neg logit contributions
Max-0.065
ens-0.063
Mic-0.062
lves-0.061
ley-0.060
Token are
Token ID389
Feature activation+1.20e+01

Pos logit contributions
Max+0.209
Mic+0.194
ens+0.188
Brown+0.186
ety+0.181
Neg logit contributions
 show-0.079
 teaching-0.076
 showing-0.072
 diving-0.072
 join-0.069
Token quiet
Token ID5897
Feature activation+1.20e+01

Pos logit contributions
 example+0.181
 teaching+0.172
 humor+0.170
 succeeding+0.161
 diving+0.160
Neg logit contributions
Max-0.303
ens-0.294
Spot-0.268
Snow-0.267
 Margaret-0.265
TokenYes
Token ID5297
Feature activation+1.78e+01

Pos logit contributions
Max+0.176
ens+0.165
Mic+0.154
issy+0.149
lves+0.148
Neg logit contributions
 teaching-0.208
 showing-0.206
 diving-0.203
 example-0.199
 show-0.198
Token,
Token ID11
Feature activation+1.36e+01

Pos logit contributions
 teaching+0.261
 example+0.249
 showing+0.246
 diving+0.244
 joining+0.240
Neg logit contributions
Max-0.496
ens-0.465
Mic-0.452
Brown-0.431
Snow-0.429
Token and
Token ID290
Feature activation+1.02e+01

Pos logit contributions
 teaching+0.248
 showing+0.246
 show+0.245
 diving+0.244
 joining+0.243
Neg logit contributions
Max-0.321
ens-0.312
Mic-0.303
lves-0.296
ley-0.292
Token we
Token ID356
Feature activation+1.74e+00

Pos logit contributions
 teaching+0.198
 showing+0.191
 example+0.189
 diving+0.188
 joining+0.185
Neg logit contributions
Max-0.237
ens-0.222
Mic-0.212
Brown-0.201
issy-0.199
Token can
Token ID460
Feature activation+2.08e+01

Pos logit contributions
 teaching+0.030
 example+0.029
 showing+0.029
 diving+0.029
 show+0.028
Neg logit contributions
Max-0.043
ens-0.040
Mic-0.039
Brown-0.037
Snow-0.036
Token give
Token ID1577
Feature activation+2.96e+01

Pos logit contributions
 teaching+0.285
 example+0.277
 diving+0.269
 showing+0.267
 joining+0.261
Neg logit contributions
Max-0.595
ens-0.555
Mic-0.523
Brown-0.517
Snow-0.515
Token him
Token ID683
Feature activation-7.45e+00

Pos logit contributions
 teaching+0.404
 show+0.391
 example+0.386
 showing+0.378
 diving+0.376
Neg logit contributions
Max-0.820
issy-0.772
Mic-0.765
ens-0.752
Bet-0.745
Token a
Token ID257
Feature activation-2.43e-01

Pos logit contributions
Max+0.223
Mic+0.209
ens+0.204
issy+0.197
Brown+0.196
Neg logit contributions
 teaching-0.092
 example-0.090
 diving-0.089
 showing-0.085
 show-0.084
Token scarf
Token ID36953
Feature activation-4.87e+00

Pos logit contributions
Max+0.006
ens+0.006
Mic+0.005
Snow+0.005
Spot+0.005
Neg logit contributions
 teaching-0.004
 succeeding-0.004
 showing-0.004
 example-0.004
 joining-0.004
Token too
Token ID1165
Feature activation+9.22e+00

Pos logit contributions
Max+0.150
ens+0.141
Mic+0.139
lves+0.131
Brown+0.130
Neg logit contributions
 teaching-0.055
 showing-0.052
 example-0.050
 diving-0.050
 giving-0.049
Token,"
Token ID553
Feature activation+4.82e+00

Pos logit contributions
 teaching+0.164
 example+0.156
 joining+0.155
 showing+0.154
 succeeding+0.154
Neg logit contributions
Max-0.221
ens-0.214
Snow-0.198
Spot-0.191
Mic-0.191
Token be
Token ID307
Feature activation+1.76e+01

Pos logit contributions
 humor+0.305
 example+0.294
 teaching+0.287
 succeeding+0.278
ulp+0.273
Neg logit contributions
Max-0.400
ens-0.373
Snow-0.338
 Bun-0.335
 Margaret-0.331
Token safe
Token ID3338
Feature activation+1.35e+01

Pos logit contributions
 teaching+0.270
 example+0.261
 diving+0.259
 succeeding+0.255
 accomplishment+0.255
Neg logit contributions
Max-0.428
Spot-0.380
ens-0.379
ety-0.368
Mic-0.362
Token,
Token ID11
Feature activation+1.06e+01

Pos logit contributions
 teaching+0.203
 example+0.192
 showing+0.189
 diving+0.188
 joining+0.185
Neg logit contributions
Max-0.361
ens-0.345
Mic-0.320
Snow-0.311
Spot-0.311
Token instead
Token ID2427
Feature activation+1.44e+01

Pos logit contributions
 teaching+0.169
 showing+0.165
 diving+0.162
 giving+0.160
 joining+0.159
Neg logit contributions
Max-0.278
ens-0.262
Mic-0.255
issy-0.242
Brown-0.240
Token of
Token ID286
Feature activation+1.70e+01

Pos logit contributions
 teaching+0.195
 example+0.186
 showing+0.183
 diving+0.180
 joining+0.176
Neg logit contributions
Max-0.411
ens-0.393
Mic-0.367
Snow-0.358
Brown-0.357
Token going
Token ID1016
Feature activation+2.96e+01

Pos logit contributions
 teaching+0.216
 example+0.210
aps+0.198
 showing+0.196
 diving+0.194
Neg logit contributions
Max-0.466
ens-0.442
Spot-0.408
Mic-0.408
Snow-0.404
Token ahead
Token ID4058
Feature activation+1.30e+01

Pos logit contributions
 teaching+0.416
 example+0.395
 showing+0.391
 diving+0.387
 joining+0.379
Neg logit contributions
Max-0.831
ens-0.789
Mic-0.746
Brown-0.720
Snow-0.719
Token and
Token ID290
Feature activation+6.38e+00

Pos logit contributions
 teaching+0.184
 showing+0.174
 example+0.174
 diving+0.173
 joining+0.169
Neg logit contributions
Max-0.365
ens-0.347
Mic-0.336
Brown-0.319
Snow-0.317
Token touching
Token ID15241
Feature activation+2.29e+01

Pos logit contributions
 teaching+0.092
 example+0.090
 diving+0.086
 showing+0.085
 joining+0.084
Neg logit contributions
Max-0.171
ens-0.162
Mic-0.149
Snow-0.147
Brown-0.146
Token it
Token ID340
Feature activation-8.08e+00

Pos logit contributions
 teaching+0.312
 example+0.303
 diving+0.302
 showing+0.300
 joining+0.293
Neg logit contributions
Max-0.651
Mic-0.611
ens-0.606
issy-0.578
Brown-0.575
Token."
Token ID526
Feature activation+1.09e+00

Pos logit contributions
Max+0.213
ens+0.202
Mic+0.188
Brown+0.182
Snow+0.182
Neg logit contributions
 teaching-0.126
 example-0.120
 showing-0.117
 diving-0.117
 joining-0.115
Token.
Token ID13
Feature activation-4.80e+00

Pos logit contributions
Max+0.497
ens+0.466
Mic+0.452
Brown+0.429
Snow+0.429
Neg logit contributions
 teaching-0.268
 showing-0.253
 example-0.247
 giving-0.245
 joining-0.245
Token
Token ID220
Feature activation-3.84e+00

Pos logit contributions
Max+0.110
ens+0.104
Mic+0.098
Snow+0.092
Brown+0.092
Neg logit contributions
 teaching-0.093
 showing-0.089
 example-0.088
 diving-0.088
 giving-0.087
Token
Token ID198
Feature activation-5.24e+00

Pos logit contributions
Max+0.103
ens+0.101
Mic+0.100
ley+0.097
lves+0.096
Neg logit contributions
aps-0.054
 teaching-0.051
 about-0.050
 giving-0.047
 joining-0.046
Token
Token ID198
Feature activation-5.35e+00

Pos logit contributions
Max+0.118
ens+0.116
Mic+0.108
lves+0.101
ley+0.101
Neg logit contributions
 teaching-0.099
 showing-0.094
 example-0.094
 diving-0.093
 joining-0.092
TokenSu
Token ID5606
Feature activation-1.76e+01

Pos logit contributions
Max+0.099
ens+0.096
Mic+0.087
ety+0.079
issy+0.079
Neg logit contributions
 teaching-0.125
 example-0.122
 showing-0.121
 diving-0.119
 joining-0.119
Tokenzie
Token ID49746
Feature activation-2.93e+01

Pos logit contributions
Max+0.302
Mic+0.264
Spot+0.253
Snow+0.250
ens+0.249
Neg logit contributions
 teaching-0.434
 showing-0.419
 accomplishment-0.418
 example-0.416
 giving-0.411
Token smiled
Token ID13541
Feature activation+6.51e+00

Pos logit contributions
Max+0.747
ens+0.705
Mic+0.691
Brown+0.642
Snow+0.637
Neg logit contributions
 teaching-0.488
 showing-0.472
 diving-0.460
 example-0.457
 giving-0.456
Token.
Token ID13
Feature activation-5.02e+00

Pos logit contributions
 teaching+0.111
 example+0.107
 diving+0.105
 showing+0.105
 humor+0.103
Neg logit contributions
Max-0.163
ens-0.153
Mic-0.144
Snow-0.139
Brown-0.138
Token She
Token ID1375
Feature activation-1.89e+01

Pos logit contributions
Max+0.109
ens+0.103
Mic+0.098
Spot+0.092
Snow+0.092
Neg logit contributions
 teaching-0.102
 showing-0.098
 example-0.097
 diving-0.097
 giving-0.096
Token and
Token ID290
Feature activation-9.61e-01

Pos logit contributions
Max+0.431
ens+0.405
Mic+0.381
Brown+0.361
Snow+0.359
Neg logit contributions
 teaching-0.365
 showing-0.350
 example-0.350
 diving-0.347
 joining-0.342
Token her
Token ID607
Feature activation-1.71e+01

Pos logit contributions
Max+0.023
Mic+0.021
ens+0.020
Spot+0.020
Snow+0.020
Neg logit contributions
 teaching-0.019
 showing-0.018
 giving-0.018
 joining-0.017
 diving-0.017
Token
Token ID198
Feature activation-6.78e+00

Pos logit contributions
ens+0.135
Mic+0.134
Max+0.133
ley+0.131
lves+0.126
Neg logit contributions
aps-0.083
 teaching-0.078
 winning-0.074
 joining-0.074
 join-0.074
Token
Token ID198
Feature activation-6.79e+00

Pos logit contributions
ens+0.148
Max+0.148
Mic+0.135
ety+0.132
ley+0.131
Neg logit contributions
 teaching-0.129
 diving-0.125
 joining-0.124
 example-0.123
 showing-0.122
TokenSo
Token ID2396
Feature activation-1.46e+00

Pos logit contributions
ens+0.137
Max+0.124
ley+0.123
ety+0.120
issy+0.120
Neg logit contributions
 teaching-0.145
 joining-0.144
 example-0.140
 diving-0.140
 showing-0.135
Token Bob
Token ID5811
Feature activation-2.71e+01

Pos logit contributions
Max+0.037
Mic+0.035
ens+0.034
Brown+0.033
Snow+0.032
Neg logit contributions
 example-0.025
 teaching-0.024
 diving-0.024
 showing-0.024
 show-0.024
Token and
Token ID290
Feature activation-2.36e+00

Pos logit contributions
Max+0.733
Mic+0.687
ens+0.665
Brown+0.646
Snow+0.635
Neg logit contributions
 teaching-0.411
 showing-0.395
 diving-0.392
 example-0.391
 giving-0.388
Token Jill
Token ID22772
Feature activation-2.91e+01

Pos logit contributions
Max+0.054
Mic+0.050
ens+0.046
Snow+0.046
ball+0.046
Neg logit contributions
 teaching-0.046
 giving-0.045
 example-0.045
 joining-0.045
 showing-0.045
Token left
Token ID1364
Feature activation+1.05e+01

Pos logit contributions
Max+0.769
Mic+0.716
Brown+0.691
ens+0.689
Snow+0.685
Neg logit contributions
 teaching-0.447
 show-0.436
 showing-0.435
 example-0.417
 joining-0.412
Token the
Token ID262
Feature activation-1.51e+01

Pos logit contributions
 teaching+0.181
 example+0.175
 showing+0.170
 diving+0.170
 joining+0.169
Neg logit contributions
Max-0.258
ens-0.248
Mic-0.220
Spot-0.219
Snow-0.217
Token broken
Token ID5445
Feature activation+4.38e+00

Pos logit contributions
Max+0.330
ens+0.307
Mic+0.286
Snow+0.273
Spot+0.273
Neg logit contributions
 teaching-0.306
 example-0.294
 showing-0.291
 joining-0.291
 diving-0.289
Token pick
Token ID2298
Feature activation+1.34e+01

Pos logit contributions
 teaching+0.093
 joining+0.089
 example+0.088
 diving+0.088
 showing+0.087
Neg logit contributions
Max-0.090
ens-0.086
Mic-0.078
ety-0.075
Snow-0.075
Token behind
Token ID2157
Feature activation+5.83e+00

Pos logit contributions
 teaching+0.164
 showing+0.158
 diving+0.157
 example+0.156
 joining+0.153
Neg logit contributions
Max-0.401
Mic-0.383
Brown-0.370
ens-0.359
Snow-0.355
Token being
Token ID852
Feature activation+6.85e+00

Pos logit contributions
aps+0.115
oops+0.113
ulp+0.108
 contain+0.107
apping+0.104
Neg logit contributions
Max-0.206
ens-0.186
elope-0.185
 Bun-0.181
 Blink-0.180
Token fair
Token ID3148
Feature activation+3.67e+00

Pos logit contributions
 accomplishment+0.113
 diving+0.112
ulp+0.111
 humor+0.109
 teaching+0.108
Neg logit contributions
Max-0.157
ens-0.139
 Margaret-0.137
Snow-0.132
 Bun-0.132
Token.
Token ID13
Feature activation-4.61e+00

Pos logit contributions
 teaching+0.058
 example+0.055
 showing+0.055
 diving+0.055
 joining+0.054
Neg logit contributions
Max-0.094
ens-0.090
Mic-0.083
Snow-0.081
ety-0.081
Token<|endoftext|>
Token ID50256
Feature activation-1.48e+00

Pos logit contributions
Max+0.123
ens+0.122
ball+0.116
Mic+0.115
Snow+0.113
Neg logit contributions
 teaching-0.068
 example-0.065
aps-0.062
 join-0.060
 giving-0.059
TokenSu
Token ID5606
Feature activation-1.71e+01

Pos logit contributions
ens+0.035
Mic+0.035
Max+0.034
lves+0.034
Brown+0.033
Neg logit contributions
 diving-0.024
aps-0.024
 teaching-0.024
 showing-0.023
 joining-0.023
Tokenzie
Token ID49746
Feature activation-2.90e+01

Pos logit contributions
Max+0.300
Mic+0.267
Snow+0.253
Brown+0.250
Spot+0.249
Neg logit contributions
 teaching-0.418
 accomplishment-0.400
 showing-0.400
 example-0.397
 show-0.396
Token loved
Token ID6151
Feature activation+1.66e+01

Pos logit contributions
Max+0.791
Mic+0.753
ens+0.725
Brown+0.694
Snow+0.692
Neg logit contributions
 teaching-0.435
 showing-0.423
 diving-0.406
 giving-0.406
 show-0.403
Token to
Token ID284
Feature activation+7.25e+00

Pos logit contributions
 teaching+0.208
 example+0.202
 diving+0.193
 showing+0.192
 joining+0.188
Neg logit contributions
Max-0.481
ens-0.459
Mic-0.438
Brown-0.420
Snow-0.419
Token swim
Token ID9422
Feature activation+5.47e+00

Pos logit contributions
 teaching+0.105
 example+0.104
 diving+0.097
 succeeding+0.096
 showing+0.096
Neg logit contributions
Max-0.194
ens-0.185
Mic-0.173
 Margaret-0.167
Snow-0.166
Token.
Token ID13
Feature activation-4.55e+00

Pos logit contributions
 teaching+0.067
 example+0.063
 showing+0.062
 diving+0.061
 joining+0.060
Neg logit contributions
Max-0.165
ens-0.157
Mic-0.148
Snow-0.144
Brown-0.143
Token She
Token ID1375
Feature activation-1.85e+01

Pos logit contributions
Max+0.100
Mic+0.093
ens+0.089
Spot+0.088
lves+0.087
Neg logit contributions
 example-0.089
 teaching-0.088
 join-0.087
 designing-0.086
 mixing-0.085
Token all
Token ID477
Feature activation-4.53e-01

Pos logit contributions
Max+0.539
ens+0.509
Mic+0.486
Snow+0.463
Brown+0.462
Neg logit contributions
 teaching-0.336
 showing-0.319
 example-0.316
 diving-0.311
 joining-0.310
Token along
Token ID1863
Feature activation+8.51e+00

Pos logit contributions
Max+0.012
ens+0.011
Mic+0.010
Spot+0.010
Brown+0.010
Neg logit contributions
 example-0.007
 teaching-0.006
 diving-0.006
 humor-0.006
aps-0.006
Token.
Token ID13
Feature activation-5.24e+00

Pos logit contributions
 teaching+0.130
 example+0.124
 diving+0.124
 showing+0.122
 joining+0.120
Neg logit contributions
Max-0.226
ens-0.213
Mic-0.200
Brown-0.193
Spot-0.192
Token<|endoftext|>
Token ID50256
Feature activation-1.54e+00

Pos logit contributions
Max+0.134
ens+0.132
ball+0.125
Mic+0.123
Snow+0.121
Neg logit contributions
 teaching-0.080
 example-0.077
 giving-0.076
aps-0.076
 showing-0.073
TokenSu
Token ID5606
Feature activation-1.72e+01

Pos logit contributions
Mic+0.035
ens+0.035
Max+0.035
lves+0.034
Brown+0.033
Neg logit contributions
 diving-0.026
 teaching-0.026
aps-0.026
 joining-0.025
 showing-0.024
Tokenzie
Token ID49746
Feature activation-2.90e+01

Pos logit contributions
Max+0.304
Mic+0.279
Snow+0.265
Blue+0.258
Brown+0.255
Neg logit contributions
 teaching-0.407
 accomplishment-0.394
 show-0.394
 showing-0.391
 giving-0.385
Token was
Token ID373
Feature activation+5.15e+00

Pos logit contributions
Max+0.754
Mic+0.701
ens+0.691
Brown+0.652
Snow+0.652
Neg logit contributions
 teaching-0.469
 showing-0.457
 diving-0.444
 giving-0.441
 show-0.433
Token a
Token ID257
Feature activation-7.31e+00

Pos logit contributions
 humor+0.073
 example+0.073
 teaching+0.072
 diving+0.070
 liking+0.069
Neg logit contributions
Max-0.138
ens-0.124
Mic-0.121
Spot-0.117
Brown-0.117
Token 3
Token ID513
Feature activation-9.88e-01

Pos logit contributions
ens+0.205
Max+0.205
Mic+0.187
issy+0.186
Nut+0.183
Neg logit contributions
 teaching-0.099
 example-0.098
 show-0.096
 showing-0.096
 diving-0.095
Token year
Token ID614
Feature activation+3.20e+00

Pos logit contributions
Max+0.027
ens+0.026
Mic+0.024
Brown+0.023
Snow+0.023
Neg logit contributions
 teaching-0.015
 example-0.014
 showing-0.014
 diving-0.014
 joining-0.013
Token old
Token ID1468
Feature activation-7.88e+00

Pos logit contributions
 teaching+0.042
 example+0.040
 showing+0.039
 diving+0.039
 joining+0.038
Neg logit contributions
Max-0.093
ens-0.088
Mic-0.085
Brown-0.081
Snow-0.081
Tokenâ
Token ID22940
Feature activation-8.77e+00

Pos logit contributions
Max+0.135
ens+0.132
Mic+0.125
ball+0.123
ley+0.120
Neg logit contributions
 teaching-0.093
 example-0.093
 giving-0.088
 joining-0.087
 humor-0.086
Token
Token ID26391
Feature activation-1.47e+01

Pos logit contributions
Max+0.169
ens+0.153
Mic+0.148
Brown+0.138
Snow+0.137
Neg logit contributions
 teaching-0.199
 example-0.194
 showing-0.190
 diving-0.188
 joining-0.187
Token
Token ID198
Feature activation-6.19e+00

Pos logit contributions
Mic+0.326
Max+0.316
ley+0.316
ens+0.315
issy+0.295
Neg logit contributions
 teaching-0.266
 diving-0.261
aps-0.259
 planning-0.253
 counting-0.252
Token
Token ID198
Feature activation-6.28e+00

Pos logit contributions
ens+0.149
Max+0.142
ley+0.137
lves+0.133
ety+0.133
Neg logit contributions
 teaching-0.104
 diving-0.099
 joining-0.099
 example-0.099
 giving-0.096
TokenBut
Token ID1537
Feature activation+9.02e-01

Pos logit contributions
ens+0.126
ley+0.119
issy+0.110
 Blink+0.107
gy+0.107
Neg logit contributions
 joining-0.131
 teaching-0.131
 example-0.128
 liking-0.124
 humor-0.122
Token Daisy
Token ID40355
Feature activation-2.90e+01

Pos logit contributions
 teaching+0.015
 showing+0.015
 giving+0.015
 example+0.015
 diving+0.015
Neg logit contributions
Max-0.023
Mic-0.022
ens-0.021
Brown-0.020
issy-0.020
Token was
Token ID373
Feature activation+4.09e+00

Pos logit contributions
Max+0.735
Mic+0.713
Brown+0.661
ens+0.647
Snow+0.640
Neg logit contributions
 teaching-0.475
 showing-0.461
 giving-0.455
 diving-0.447
 show-0.446
Token determined
Token ID5295
Feature activation+1.54e+01

Pos logit contributions
 example+0.060
 teaching+0.060
 humor+0.059
 showing+0.059
 accomplishment+0.059
Neg logit contributions
Max-0.105
ens-0.097
Spot-0.091
Snow-0.089
K-0.088
Token.
Token ID13
Feature activation-6.09e+00

Pos logit contributions
 teaching+0.209
 example+0.199
 showing+0.196
 diving+0.195
 joining+0.192
Neg logit contributions
Max-0.430
ens-0.415
Mic-0.376
Spot-0.372
Snow-0.372
Token She
Token ID1375
Feature activation-1.95e+01

Pos logit contributions
Max+0.113
ens+0.103
Mic+0.101
Brown+0.095
ball+0.094
Neg logit contributions
 example-0.140
 teaching-0.140
 giving-0.135
 showing-0.132
 joining-0.131
Token opened
Token ID4721
Feature activation+1.27e+01

Pos logit contributions
Max+0.497
ens+0.465
Mic+0.457
Brown+0.431
Snow+0.425
Neg logit contributions
 teaching-0.321
 example-0.310
 showing-0.309
 diving-0.303
 giving-0.297
Token a
Token ID257
Feature activation-8.81e+00

Pos logit contributions
Max+0.166
ens+0.157
Mic+0.156
ety+0.145
Brown+0.144
Neg logit contributions
 teaching-0.088
 example-0.085
 showing-0.083
 diving-0.081
 giving-0.081
Token very
Token ID845
Feature activation+6.04e+00

Pos logit contributions
Max+0.230
ens+0.210
Mic+0.203
Snow+0.198
Brown+0.195
Neg logit contributions
 teaching-0.136
 diving-0.128
 example-0.127
 showing-0.127
 succeeding-0.127
Token lovely
Token ID14081
Feature activation-1.75e+00

Pos logit contributions
 teaching+0.098
 joining+0.092
 showing+0.090
 succeeding+0.089
 giving+0.087
Neg logit contributions
Max-0.148
ens-0.143
Mic-0.129
Snow-0.128
Spot-0.128
Token dream
Token ID4320
Feature activation+4.27e+00

Pos logit contributions
Max+0.044
ens+0.042
Mic+0.038
Snow+0.037
 Margaret+0.037
Neg logit contributions
 teaching-0.029
 showing-0.027
 joining-0.027
 example-0.027
 diving-0.027
Token,
Token ID11
Feature activation-7.33e-01

Pos logit contributions
 teaching+0.063
 example+0.060
 showing+0.060
 diving+0.059
 joining+0.058
Neg logit contributions
Max-0.117
ens-0.110
Mic-0.105
Brown-0.101
Snow-0.100
Token Amy
Token ID14235
Feature activation-2.89e+01

Pos logit contributions
Max+0.017
Mic+0.016
ens+0.016
Brown+0.015
ety+0.015
Neg logit contributions
 teaching-0.013
 showing-0.013
 example-0.013
 joining-0.013
 giving-0.013
Token.
Token ID13
Feature activation-6.34e+00

Pos logit contributions
Max+0.758
ens+0.711
Mic+0.701
Brown+0.647
Spot+0.646
Neg logit contributions
 teaching-0.469
 showing-0.449
 example-0.442
 diving-0.434
 giving-0.432
Token Now
Token ID2735
Feature activation-8.60e-01

Pos logit contributions
Max+0.139
ens+0.137
Mic+0.129
lves+0.125
ball+0.124
Neg logit contributions
 teaching-0.121
 example-0.117
 joining-0.114
 giving-0.114
 showing-0.113
Token go
Token ID467
Feature activation+1.28e+01

Pos logit contributions
Max+0.022
Mic+0.021
ens+0.019
lves+0.019
Brown+0.019
Neg logit contributions
 show-0.014
 teaching-0.014
 giving-0.014
 showing-0.014
 diving-0.014
Token to
Token ID284
Feature activation+6.12e+00

Pos logit contributions
 teaching+0.161
 example+0.150
 succeeding+0.149
 showing+0.148
 humor+0.148
Neg logit contributions
Max-0.369
ens-0.364
Spot-0.322
Snow-0.322
 Margaret-0.322
Token sleep
Token ID3993
Feature activation+6.03e+00

Pos logit contributions
 teaching+0.106
 example+0.102
 showing+0.100
 succeeding+0.098
 diving+0.098
Neg logit contributions
Max-0.146
ens-0.139
Mic-0.125
Snow-0.125
 Margaret-0.125
Tokenâ
Token ID22940
Feature activation-9.07e+00

Pos logit contributions
ens+0.121
Max+0.120
Mic+0.113
Snow+0.110
ball+0.110
Neg logit contributions
 teaching-0.092
 example-0.086
 giving-0.085
 showing-0.084
 join-0.084
Token
Token ID26391
Feature activation-1.53e+01

Pos logit contributions
Max+0.178
ens+0.164
Mic+0.157
Brown+0.147
Snow+0.146
Neg logit contributions
 teaching-0.202
 example-0.197
 showing-0.193
 diving-0.189
 joining-0.188
Token
Token ID198
Feature activation-6.35e+00

Pos logit contributions
ens+0.351
ley+0.340
Mic+0.324
lves+0.312
 chick+0.310
Neg logit contributions
 teaching-0.279
aps-0.259
 learning-0.253
 showing-0.250
 testing-0.248
Token
Token ID198
Feature activation-6.39e+00

Pos logit contributions
Max+0.142
ens+0.141
Mic+0.129
ety+0.127
ley+0.125
Neg logit contributions
 teaching-0.119
 diving-0.113
 example-0.112
 joining-0.111
 showing-0.111
TokenMan
Token ID5124
Feature activation-1.42e+01

Pos logit contributions
ens+0.117
ety+0.104
ley+0.104
issy+0.102
Max+0.101
Neg logit contributions
 teaching-0.148
 joining-0.144
 example-0.142
 diving-0.139
 humor-0.138
Tokenuel
Token ID2731
Feature activation-2.88e+01

Pos logit contributions
Max+0.366
ens+0.357
Mic+0.356
lves+0.335
Spot+0.328
Neg logit contributions
 teaching-0.229
 example-0.214
 showing-0.207
 giving-0.203
 joining-0.196
Token shook
Token ID14682
Feature activation+1.17e+01

Pos logit contributions
Max+0.699
ens+0.666
Mic+0.655
Brown+0.617
Snow+0.610
Neg logit contributions
 teaching-0.510
 showing-0.489
 example-0.480
 giving-0.473
 joining-0.469
Token his
Token ID465
Feature activation-1.74e+01

Pos logit contributions
 teaching+0.156
 showing+0.151
 example+0.147
 diving+0.145
 giving+0.143
Neg logit contributions
Max-0.339
ens-0.323
Mic-0.319
issy-0.304
Brown-0.304
Token head
Token ID1182
Feature activation+4.04e+00

Pos logit contributions
Max+0.428
ens+0.399
Mic+0.371
Spot+0.363
uffy+0.362
Neg logit contributions
 teaching-0.296
 joining-0.288
 serving-0.278
 diving-0.276
 succeeding-0.275
Token.
Token ID13
Feature activation-5.99e+00

Pos logit contributions
 teaching+0.067
 example+0.064
 diving+0.064
 showing+0.063
 joining+0.062
Neg logit contributions
Max-0.103
ens-0.097
Mic-0.091
Snow-0.087
Brown-0.087
Token He
Token ID679
Feature activation-1.87e+01

Pos logit contributions
Max+0.132
ens+0.126
Mic+0.118
Brown+0.112
Snow+0.112
Neg logit contributions
 teaching-0.119
 example-0.116
 showing-0.113
 diving-0.112
 giving-0.112
Token meet
Token ID1826
Feature activation+1.13e+01

Pos logit contributions
 teaching+0.096
 example+0.091
 showing+0.090
 diving+0.090
 joining+0.088
Neg logit contributions
Max-0.155
ens-0.147
Mic-0.138
Snow-0.133
Brown-0.133
Token you
Token ID345
Feature activation-1.37e+01

Pos logit contributions
 teaching+0.185
 joining+0.180
 show+0.177
 showing+0.176
 giving+0.172
Neg logit contributions
Max-0.287
Mic-0.270
ens-0.267
Brown-0.261
issy-0.258
Token.
Token ID13
Feature activation-6.26e+00

Pos logit contributions
Max+0.388
ens+0.365
Mic+0.356
Brown+0.339
Snow+0.335
Neg logit contributions
 teaching-0.196
 diving-0.184
 showing-0.184
 example-0.183
 joining-0.181
Tokenâ
Token ID22940
Feature activation-9.21e+00

Pos logit contributions
Max+0.152
ens+0.150
Mic+0.138
ball+0.137
issy+0.136
Neg logit contributions
 teaching-0.105
 joining-0.100
 giving-0.099
 example-0.098
 showing-0.097
Token
Token ID26391
Feature activation-1.53e+01

Pos logit contributions
Max+0.177
ens+0.162
Mic+0.155
Brown+0.145
Snow+0.143
Neg logit contributions
 teaching-0.210
 example-0.203
 showing-0.201
 diving-0.199
 joining-0.198
Token Daisy
Token ID40355
Feature activation-2.88e+01

Pos logit contributions
ley+0.385
ens+0.378
Mic+0.371
issy+0.367
Max+0.362
Neg logit contributions
 teaching-0.230
 diving-0.229
aps-0.228
 showing-0.215
 planning-0.215
Token was
Token ID373
Feature activation+3.93e+00

Pos logit contributions
Max+0.735
Mic+0.696
ens+0.687
Brown+0.638
issy+0.631
Neg logit contributions
 teaching-0.482
 showing-0.476
 show-0.459
 giving-0.455
 diving-0.452
Token surprised
Token ID6655
Feature activation+1.12e+01

Pos logit contributions
 teaching+0.051
 example+0.051
 diving+0.049
 accomplishment+0.048
 showing+0.048
Neg logit contributions
Max-0.109
ens-0.102
Mic-0.096
 Margaret-0.095
Spot-0.095
Token and
Token ID290
Feature activation-1.90e+00

Pos logit contributions
 teaching+0.174
 example+0.166
 showing+0.164
 diving+0.164
 joining+0.159
Neg logit contributions
Max-0.301
ens-0.286
Mic-0.267
Snow-0.257
Brown-0.257
Token smiled
Token ID13541
Feature activation+5.94e+00

Pos logit contributions
Max+0.040
ens+0.039
Mic+0.034
Snow+0.033
Spot+0.033
Neg logit contributions
 example-0.038
 teaching-0.038
 diving-0.036
 showing-0.035
 joining-0.035
Token.
Token ID13
Feature activation-5.75e+00

Pos logit contributions
 teaching+0.099
 example+0.095
 diving+0.094
 showing+0.093
 humor+0.093
Neg logit contributions
Max-0.149
ens-0.140
Mic-0.130
Snow-0.128
Brown-0.126
Token also
Token ID635
Feature activation-2.17e+00

Pos logit contributions
 teaching+0.064
 example+0.060
 showing+0.060
 diving+0.060
 joining+0.058
Neg logit contributions
Max-0.132
ens-0.125
Mic-0.119
Brown-0.114
Snow-0.114
Token very
Token ID845
Feature activation+7.31e+00

Pos logit contributions
Max+0.060
ens+0.057
Mic+0.055
Brown+0.052
Snow+0.052
Neg logit contributions
 teaching-0.031
 example-0.029
 showing-0.029
 diving-0.029
 joining-0.029
Token happy
Token ID3772
Feature activation+6.97e+00

Pos logit contributions
 teaching+0.110
 example+0.105
 showing+0.103
 diving+0.102
 joining+0.100
Neg logit contributions
Max-0.197
ens-0.187
Mic-0.175
Snow-0.170
Brown-0.170
Token to
Token ID284
Feature activation+6.71e+00

Pos logit contributions
 teaching+0.108
 example+0.104
 diving+0.102
 showing+0.101
 joining+0.099
Neg logit contributions
Max-0.182
ens-0.174
Mic-0.160
Snow-0.155
Brown-0.155
Token meet
Token ID1826
Feature activation+1.18e+01

Pos logit contributions
 teaching+0.107
 example+0.104
 diving+0.101
 succeeding+0.099
 accomplishment+0.099
Neg logit contributions
Max-0.166
ens-0.162
 Tess-0.144
 Margaret-0.144
Bet-0.143
Token Daisy
Token ID40355
Feature activation-2.88e+01

Pos logit contributions
 teaching+0.189
 joining+0.183
 showing+0.182
 giving+0.177
 show+0.173
Neg logit contributions
Max-0.315
Mic-0.298
ens-0.291
Brown-0.283
Snow-0.278
Token,
Token ID11
Feature activation+1.86e-01

Pos logit contributions
Max+0.866
Mic+0.809
ens+0.803
Brown+0.766
Snow+0.759
Neg logit contributions
 teaching-0.387
 showing-0.369
 giving-0.363
 joining-0.363
 diving-0.351
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
 teaching+0.003
 showing+0.003
 giving+0.003
 joining+0.003
 diving+0.003
Neg logit contributions
Max-0.005
ens-0.005
Mic-0.005
issy-0.004
Brown-0.004
Token it
Token ID340
Feature activation-1.51e+01

Pos logit contributions
Max+0.023
ens+0.021
Mic+0.019
Brown+0.018
Snow+0.018
Neg logit contributions
 teaching-0.028
 showing-0.027
 example-0.027
 diving-0.026
 joining-0.026
Token recognized
Token ID8018
Feature activation+1.37e+01

Pos logit contributions
Max+0.378
Mic+0.359
ens+0.357
Brown+0.332
ety+0.329
Neg logit contributions
 teaching-0.264
 showing-0.262
 joining-0.254
 giving-0.250
 show-0.250
Token her
Token ID607
Feature activation-1.75e+01

Pos logit contributions
 teaching+0.192
 showing+0.185
 joining+0.183
 example+0.180
 giving+0.180
Neg logit contributions
Max-0.392
Mic-0.369
ens-0.361
Brown-0.351
ety-0.346
Token.
Token ID13
Feature activation-6.33e+00

Pos logit contributions
Max+0.596
Mic+0.579
Snow+0.535
ens+0.533
lves+0.531
Neg logit contributions
 teaching-0.237
 giving-0.215
 showing-0.214
 about-0.214
 counting-0.208
Token
Token ID198
Feature activation-6.49e+00

Pos logit contributions
Max+0.142
Mic+0.139
ens+0.139
lves+0.133
issy+0.132
Neg logit contributions
 teaching-0.111
 joining-0.108
 giving-0.106
 join-0.106
 example-0.105
Token
Token ID198
Feature activation-6.54e+00

Pos logit contributions
Max+0.147
ens+0.146
Mic+0.135
ety+0.133
ley+0.131
Neg logit contributions
 teaching-0.118
 joining-0.113
 diving-0.113
 example-0.111
 showing-0.111
TokenSo
Token ID2396
Feature activation-1.07e+00

Pos logit contributions
ens+0.132
ley+0.124
ety+0.122
gy+0.118
issy+0.118
Neg logit contributions
 teaching-0.135
 joining-0.134
 example-0.127
 diving-0.127
 showing-0.124
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
Max+0.026
Mic+0.026
Brown+0.024
Snow+0.024
lves+0.024
Neg logit contributions
 example-0.019
 show-0.018
 teaching-0.018
 accomplishment-0.018
 showing-0.018
Token Amy
Token ID14235
Feature activation-2.88e+01

Pos logit contributions
Mic+0.011
Max+0.010
Brown+0.010
lves+0.010
Nut+0.010
Neg logit contributions
 teaching-0.008
 joining-0.008
 show-0.008
 showing-0.008
 diving-0.008
Token and
Token ID290
Feature activation-1.78e+00

Pos logit contributions
Max+0.737
ens+0.691
Mic+0.674
Brown+0.632
Snow+0.627
Neg logit contributions
 teaching-0.480
 showing-0.462
 example-0.458
 diving-0.455
 joining-0.449
Token her
Token ID607
Feature activation-1.79e+01

Pos logit contributions
Max+0.044
Mic+0.040
Brown+0.038
ens+0.038
Snow+0.037
Neg logit contributions
 teaching-0.032
 giving-0.031
 joining-0.031
 showing-0.031
 example-0.031
Token mum
Token ID25682
Feature activation-1.91e+01

Pos logit contributions
Max+0.391
ens+0.384
Mic+0.346
uffy+0.330
ety+0.329
Neg logit contributions
 teaching-0.337
 example-0.320
 showing-0.315
 joining-0.315
 succeeding-0.309
Token continued
Token ID3767
Feature activation+1.18e+01

Pos logit contributions
Max+0.474
ens+0.441
Mic+0.429
Brown+0.407
Snow+0.406
Neg logit contributions
 teaching-0.337
 showing-0.321
 diving-0.319
 example-0.317
 joining-0.316
Token walking
Token ID6155
Feature activation+9.45e+00

Pos logit contributions
 humor+0.170
 accomplishment+0.156
 example+0.156
 apology+0.155
 teaching+0.155
Neg logit contributions
Max-0.305
ens-0.295
 Margaret-0.279
 Tess-0.269
 Tiny-0.267
Token touched
Token ID12615
Feature activation+1.20e+01

Pos logit contributions
Max+0.437
ens+0.413
Mic+0.393
Brown+0.366
ety+0.362
Neg logit contributions
 teaching-0.445
 showing-0.433
 joining-0.419
 diving-0.419
 giving-0.418
Token his
Token ID465
Feature activation-1.70e+01

Pos logit contributions
 teaching+0.165
 showing+0.157
 example+0.151
 joining+0.150
 giving+0.150
Neg logit contributions
Max-0.344
Mic-0.322
ens-0.316
Brown-0.312
Snow-0.311
Token huge
Token ID3236
Feature activation-2.78e+00

Pos logit contributions
Max+0.414
ens+0.386
Mic+0.361
ball+0.355
Spot+0.349
Neg logit contributions
 teaching-0.267
 joining-0.263
 example-0.255
 succeeding-0.251
 diving-0.251
Token body
Token ID1767
Feature activation-1.09e+00

Pos logit contributions
Max+0.060
ens+0.058
Mic+0.052
ball+0.050
ley+0.050
Neg logit contributions
 teaching-0.052
 joining-0.050
 example-0.049
 diving-0.049
 succeeding-0.049
Token.
Token ID13
Feature activation-5.81e+00

Pos logit contributions
Max+0.029
ens+0.027
Mic+0.026
Brown+0.025
Snow+0.025
Neg logit contributions
 teaching-0.017
 example-0.016
 showing-0.016
 diving-0.016
 joining-0.015
Token Daisy
Token ID40355
Feature activation-2.87e+01

Pos logit contributions
Max+0.146
ens+0.143
Snow+0.137
lves+0.135
ety+0.134
Neg logit contributions
 teaching-0.094
 showing-0.088
 giving-0.085
 joining-0.085
 example-0.084
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
Max+0.671
ens+0.632
Mic+0.597
Brown+0.563
Snow+0.562
Neg logit contributions
 teaching-0.541
 showing-0.520
 example-0.517
 diving-0.512
 joining-0.506
Token the
Token ID262
Feature activation-1.45e+01

Pos logit contributions
Max+0.040
Snow+0.037
ens+0.037
issy+0.036
Brown+0.036
Neg logit contributions
 teaching-0.026
 showing-0.025
 giving-0.024
 joining-0.023
 show-0.023
Token monster
Token ID9234
Feature activation-1.76e+01

Pos logit contributions
Max+0.317
ens+0.301
Mic+0.281
ety+0.271
Snow+0.270
Neg logit contributions
 teaching-0.301
 showing-0.287
 example-0.287
 diving-0.287
 joining-0.280
Token soon
Token ID2582
Feature activation-2.24e+00

Pos logit contributions
Max+0.414
ens+0.387
Mic+0.369
Snow+0.348
Brown+0.347
Neg logit contributions
 teaching-0.330
 showing-0.316
 example-0.312
 diving-0.312
 joining-0.309
Token became
Token ID2627
Feature activation+7.76e+00

Pos logit contributions
Max+0.060
Mic+0.057
Brown+0.056
ens+0.053
Snow+0.053
Neg logit contributions
 diving-0.035
 giving-0.035
 teaching-0.035
 joining-0.034
 showing-0.034
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
 teaching+0.106
 example+0.101
 showing+0.097
 diving+0.093
 joining+0.093
Neg logit contributions
Max-0.317
Mic-0.300
ens-0.296
Brown-0.288
ety-0.287
Token flexible
Token ID12846
Feature activation+4.06e+00

Pos logit contributions
Max+0.330
ens+0.309
Mic+0.289
Snow+0.282
Brown+0.280
Neg logit contributions
 teaching-0.234
 example-0.226
 showing-0.222
 diving-0.220
 joining-0.218
Token band
Token ID4097
Feature activation-1.75e+00

Pos logit contributions
 teaching+0.082
 example+0.079
 showing+0.077
 diving+0.077
 joining+0.077
Neg logit contributions
Max-0.088
ens-0.083
Mic-0.076
Snow-0.074
Brown-0.073
Token.
Token ID13
Feature activation-5.13e+00

Pos logit contributions
Max+0.048
ens+0.045
Mic+0.044
Snow+0.041
Spot+0.041
Neg logit contributions
 teaching-0.028
 showing-0.027
 giving-0.025
 joining-0.025
 diving-0.025
Token Little
Token ID7703
Feature activation-1.68e+01

Pos logit contributions
Max+0.115
ens+0.108
Mic+0.102
Snow+0.098
Brown+0.098
Neg logit contributions
 teaching-0.104
 example-0.099
 showing-0.098
 giving-0.096
 diving-0.095
Token Judy
Token ID26395
Feature activation-2.87e+01

Pos logit contributions
Mic+0.432
Max+0.422
Mid+0.419
Snow+0.412
Fl+0.408
Neg logit contributions
 counting-0.272
 teaching-0.270
 example-0.262
 showing-0.261
 adding-0.258
Token was
Token ID373
Feature activation+4.95e+00

Pos logit contributions
Max+0.715
ens+0.683
Mic+0.650
Snow+0.616
Brown+0.612
Neg logit contributions
 teaching-0.495
 showing-0.483
 example-0.465
 diving-0.465
 joining-0.461
Token so
Token ID523
Feature activation+6.33e+00

Pos logit contributions
 apology+0.085
 humor+0.081
ulp+0.078
 accomplishment+0.077
 manners+0.077
Neg logit contributions
Max-0.110
 Margaret-0.096
 Kathy-0.094
mie-0.093
 Tess-0.093
Token proud
Token ID6613
Feature activation+1.06e+01

Pos logit contributions
 example+0.116
 teaching+0.107
ags+0.105
 humor+0.103
ulp+0.101
Neg logit contributions
Max-0.134
ens-0.112
Spot-0.110
ety-0.107
K-0.107
Token that
Token ID326
Feature activation-5.96e+00

Pos logit contributions
 example+0.108
 teaching+0.108
 diving+0.101
 succeeding+0.100
 showing+0.099
Neg logit contributions
Max-0.325
ens-0.314
Mic-0.287
Spot-0.281
Snow-0.281
Token they
Token ID484
Feature activation-1.67e+01

Pos logit contributions
Max+0.129
Mic+0.123
lves+0.117
ens+0.116
Brown+0.115
Neg logit contributions
 teaching-0.132
 showing-0.129
 giving-0.127
 show-0.122
 diving-0.121
Token ended
Token ID4444
Feature activation+1.23e+01

Pos logit contributions
 example+0.135
 teaching+0.135
 humor+0.130
 diving+0.126
aps+0.126
Neg logit contributions
ens-0.149
Max-0.147
ety-0.132
 Bun-0.123
Snow-0.121
Token!
Token ID0
Feature activation-3.02e+00

Pos logit contributions
 showing+0.143
 teaching+0.141
 example+0.141
 diving+0.136
 giving+0.135
Neg logit contributions
Max-0.385
ens-0.366
Mic-0.363
issy-0.345
Brown-0.340
Token
Token ID198
Feature activation-4.58e+00

Pos logit contributions
Max+0.076
ens+0.072
Mic+0.072
ball+0.066
ley+0.066
Neg logit contributions
 teaching-0.051
 example-0.049
 showing-0.047
 giving-0.047
 joining-0.045
Token
Token ID198
Feature activation-4.75e+00

Pos logit contributions
Max+0.124
ens+0.122
Mic+0.115
ety+0.110
ley+0.109
Neg logit contributions
 teaching-0.067
 diving-0.062
 showing-0.062
 example-0.061
 joining-0.061
TokenSu
Token ID5606
Feature activation-1.69e+01

Pos logit contributions
Max+0.091
ens+0.090
Mic+0.081
ley+0.075
ety+0.075
Neg logit contributions
 teaching-0.107
 example-0.104
 showing-0.103
 joining-0.102
 diving-0.102
Tokenzie
Token ID49746
Feature activation-2.86e+01

Pos logit contributions
Max+0.295
Mic+0.258
ens+0.242
Spot+0.239
Snow+0.238
Neg logit contributions
 teaching-0.419
 example-0.406
 accomplishment-0.404
 showing-0.400
 giving-0.397
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
Max+0.685
ens+0.646
Mic+0.622
Brown+0.577
Snow+0.577
Neg logit contributions
 teaching-0.528
 showing-0.508
 example-0.502
 diving-0.493
 giving-0.492
Token sad
Token ID6507
Feature activation+6.28e+00

Pos logit contributions
 humor+0.087
 diving+0.084
 teaching+0.083
 example+0.082
 liking+0.080
Neg logit contributions
Max-0.139
ens-0.124
Snow-0.120
Spot-0.120
Brown-0.120
Token that
Token ID326
Feature activation-5.19e+00

Pos logit contributions
 teaching+0.104
 example+0.099
 showing+0.098
 diving+0.098
 joining+0.096
Neg logit contributions
Max-0.159
ens-0.153
Mic-0.139
ety-0.135
Snow-0.135
Token she
Token ID673
Feature activation-1.98e+01

Pos logit contributions
Max+0.107
Mic+0.099
ens+0.095
Brown+0.091
Snow+0.089
Neg logit contributions
 teaching-0.118
 showing-0.114
 giving-0.113
 example-0.112
 diving-0.111
Token no
Token ID645
Feature activation+2.21e+00

Pos logit contributions
Max+0.425
ens+0.399
Mic+0.382
Brown+0.355
Snow+0.354
Neg logit contributions
 teaching-0.408
 showing-0.397
 example-0.384
 giving-0.382
 diving-0.382
Token
Token ID8151
Feature activation-2.52e+00

Pos logit contributions
Mic+0.327
ley+0.321
ens+0.310
Max+0.300
 chick+0.295
Neg logit contributions
 teaching-0.311
 learning-0.288
 planning-0.287
 showing-0.286
 diving-0.282
Tokens
Token ID82
Feature activation-5.70e+00

Pos logit contributions
Mic+0.063
Max+0.062
Brown+0.058
lves+0.058
Snow+0.058
Neg logit contributions
 of-0.041
 about-0.038
 teaching-0.035
aps-0.035
 diving-0.033
Token the
Token ID262
Feature activation-1.51e+01

Pos logit contributions
Max+0.172
ens+0.162
Mic+0.160
Brown+0.150
Spot+0.149
Neg logit contributions
 teaching-0.073
 example-0.069
 showing-0.068
 giving-0.067
 diving-0.066
Token story
Token ID1621
Feature activation-1.22e+00

Pos logit contributions
Max+0.434
ens+0.411
Mic+0.391
Brown+0.377
Snow+0.377
Neg logit contributions
 teaching-0.203
 example-0.191
 showing-0.190
 diving-0.189
 joining-0.185
Token of
Token ID286
Feature activation+7.39e+00

Pos logit contributions
Max+0.043
ens+0.041
Mic+0.040
Brown+0.038
ety+0.038
Neg logit contributions
 teaching-0.009
 showing-0.008
 example-0.008
 joining-0.007
 diving-0.007
Token Sam
Token ID3409
Feature activation-2.86e+01

Pos logit contributions
 teaching+0.122
 example+0.118
 showing+0.117
 diving+0.116
 joining+0.113
Neg logit contributions
Max-0.188
ens-0.178
Mic-0.166
Snow-0.160
Spot-0.159
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
Max+0.695
Mic+0.664
ens+0.638
Brown+0.604
Snow+0.596
Neg logit contributions
 teaching-0.524
 showing-0.502
 giving-0.492
 joining-0.489
 show-0.486
Token her
Token ID607
Feature activation-1.81e+01

Pos logit contributions
Max+0.050
ens+0.047
Mic+0.045
Brown+0.043
Snow+0.043
Neg logit contributions
 teaching-0.036
 example-0.035
 showing-0.035
 diving-0.034
 joining-0.034
Token travels
Token ID17781
Feature activation+3.93e+00

Pos logit contributions
Max+0.444
ens+0.430
Mic+0.395
ety+0.382
Snow+0.382
Neg logit contributions
 teaching-0.290
 example-0.273
 joining-0.273
 showing-0.267
 diving-0.265
Token
Token ID6184
Feature activation-1.03e+00

Pos logit contributions
 teaching+0.049
 example+0.046
 showing+0.046
 diving+0.045
 joining+0.044
Neg logit contributions
Max-0.117
ens-0.111
Mic-0.108
Brown-0.103
Snow-0.102
Token
Token ID95
Feature activation-8.06e+00

Pos logit contributions
Max+0.024
ens+0.022
Snow+0.021
Mic+0.021
Brown+0.020
Neg logit contributions
 teaching-0.019
 showing-0.019
 diving-0.018
 example-0.018
 giving-0.018
Tokenyear
Token ID1941
Feature activation+1.75e+01

Pos logit contributions
 teaching+0.014
 example+0.013
 showing+0.013
 diving+0.012
 joining+0.012
Neg logit contributions
Max-0.053
ens-0.051
Mic-0.049
Brown-0.047
Snow-0.047
Token-
Token ID12
Feature activation+1.47e+00

Pos logit contributions
 teaching+0.376
 joining+0.366
 diving+0.361
 showing+0.358
 guessing+0.358
Neg logit contributions
ens-0.340
Max-0.324
Mic-0.297
ley-0.295
Snow-0.286
Tokenold
Token ID727
Feature activation-6.16e+00

Pos logit contributions
 teaching+0.024
 example+0.023
 showing+0.022
 humor+0.022
 joining+0.021
Neg logit contributions
Max-0.038
ens-0.036
Mic-0.034
Brown-0.033
uffy-0.033
Token girl
Token ID2576
Feature activation-2.38e+01

Pos logit contributions
Max+0.122
Mic+0.110
ens+0.107
Snow+0.102
Brown+0.102
Neg logit contributions
 teaching-0.144
 example-0.139
 showing-0.138
 giving-0.134
 show-0.134
Token named
Token ID3706
Feature activation+3.88e+00

Pos logit contributions
Max+0.620
ens+0.577
Mic+0.563
Brown+0.534
Snow+0.532
Neg logit contributions
 teaching-0.395
 showing-0.377
 example-0.373
 diving-0.364
 joining-0.362
Token Lisa
Token ID15378
Feature activation-2.86e+01

Pos logit contributions
 teaching+0.072
 counting+0.072
 giving+0.072
 joining+0.071
 example+0.071
Neg logit contributions
Max-0.087
Brown-0.078
Snow-0.077
Mic-0.077
ball-0.076
Token decided
Token ID3066
Feature activation+7.19e+00

Pos logit contributions
Max+0.855
Mic+0.801
ens+0.790
Brown+0.753
Spot+0.749
Neg logit contributions
 teaching-0.369
 showing-0.354
 giving-0.338
 joining-0.334
 diving-0.328
Token to
Token ID284
Feature activation+6.62e+00

Pos logit contributions
 teaching+0.072
 example+0.069
 diving+0.068
 showing+0.065
 joining+0.065
Neg logit contributions
Max-0.228
ens-0.218
Mic-0.204
Snow-0.200
Brown-0.199
Token take
Token ID1011
Feature activation+1.57e+01

Pos logit contributions
 teaching+0.081
 diving+0.080
 example+0.079
 succeeding+0.075
 joining+0.075
Neg logit contributions
Max-0.191
ens-0.179
Mic-0.173
Brown-0.166
Snow-0.165
Token the
Token ID262
Feature activation-1.40e+01

Pos logit contributions
 example+0.229
 teaching+0.221
 show+0.219
 showing+0.204
 diving+0.204
Neg logit contributions
Max-0.433
Mic-0.405
Brown-0.403
ety-0.402
ens-0.399
Token challenge
Token ID4427
Feature activation+1.28e+01

Pos logit contributions
Max+0.352
ens+0.329
Mic+0.318
Brown+0.302
Snow+0.299
Neg logit contributions
 teaching-0.225
 diving-0.217
 example-0.216
 joining-0.215
 showing-0.212
Token.
Token ID13
Feature activation-5.13e+00

Pos logit contributions
Max+0.109
ens+0.103
Brown+0.093
ball+0.093
Snow+0.093
Neg logit contributions
 example-0.077
 teaching-0.076
 humor-0.074
 diving-0.072
 succeeding-0.071
Token
Token ID220
Feature activation-3.96e+00

Pos logit contributions
Max+0.110
Brown+0.106
Snow+0.104
ens+0.104
issy+0.100
Neg logit contributions
 teaching-0.099
 showing-0.095
 example-0.094
 giving-0.092
 mixing-0.092
Token
Token ID198
Feature activation-5.48e+00

Pos logit contributions
ens+0.108
Mic+0.108
ley+0.108
Max+0.106
lves+0.104
Neg logit contributions
aps-0.054
 about-0.051
 teaching-0.048
 of-0.044
 join-0.044
Token
Token ID198
Feature activation-5.55e+00

Pos logit contributions
ens+0.149
ley+0.139
lves+0.139
Max+0.134
issy+0.134
Neg logit contributions
 teaching-0.074
aps-0.068
 joining-0.068
 example-0.066
 join-0.066
TokenLittle
Token ID22253
Feature activation-1.62e+01

Pos logit contributions
ens+0.099
issy+0.085
ley+0.084
Max+0.084
lves+0.080
Neg logit contributions
 teaching-0.130
 example-0.130
 joining-0.126
 liking-0.124
 showing-0.124
Token Daisy
Token ID40355
Feature activation-2.86e+01

Pos logit contributions
Brown+0.343
Mic+0.333
Max+0.330
Snow+0.318
issy+0.312
Neg logit contributions
 show-0.304
 example-0.300
 showing-0.293
 tasting-0.285
 teaching-0.285
Token was
Token ID373
Feature activation+4.50e+00

Pos logit contributions
Max+0.755
ens+0.752
Mic+0.743
Brown+0.692
Nut+0.686
Neg logit contributions
 showing-0.425
 teaching-0.403
 show-0.399
 giving-0.386
 diving-0.376
Token disappointed
Token ID11679
Feature activation+8.21e+00

Pos logit contributions
 humor+0.073
 accomplishment+0.071
 apology+0.069
 example+0.068
ulp+0.068
Neg logit contributions
Max-0.107
 Margaret-0.095
ens-0.093
 Kathy-0.093
 Midnight-0.091
Token but
Token ID475
Feature activation-2.83e-02

Pos logit contributions
 teaching+0.139
 example+0.133
 showing+0.132
 diving+0.132
 joining+0.129
Neg logit contributions
Max-0.206
ens-0.196
Mic-0.181
Snow-0.175
Brown-0.175
Token her
Token ID607
Feature activation-1.72e+01

Pos logit contributions
Max+0.001
Mic+0.001
ens+0.001
Brown+0.001
Snow+0.001
Neg logit contributions
 teaching-0.001
 showing-0.001
 giving-0.000
 diving-0.000
 example-0.000
Token mom
Token ID1995
Feature activation-1.85e+01

Pos logit contributions
Max+0.311
ens+0.277
Mic+0.251
uffy+0.241
Brown+0.241
Neg logit contributions
 teaching-0.411
 joining-0.404
 diving-0.397
 showing-0.392
 example-0.391
TokenOh
Token ID5812
Feature activation-3.76e+00

Pos logit contributions
 teaching+0.005
 example+0.005
 showing+0.005
 diving+0.005
 joining+0.005
Neg logit contributions
Max-0.005
ens-0.005
Mic-0.004
issy-0.004
Brown-0.004
Token wow
Token ID28796
Feature activation-4.16e+00

Pos logit contributions
Max+0.105
ens+0.098
Mic+0.098
Brown+0.091
ety+0.090
Neg logit contributions
 teaching-0.055
 example-0.053
 diving-0.053
 showing-0.052
 joining-0.051
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
Max+0.117
Mic+0.112
ens+0.108
Brown+0.106
Nut+0.102
Neg logit contributions
 diving-0.058
 teaching-0.057
 joining-0.056
 example-0.056
 showing-0.055
Tokenâ
Token ID22940
Feature activation-9.50e+00

Pos logit contributions
Max+0.007
Mic+0.007
ens+0.007
Brown+0.006
issy+0.006
Neg logit contributions
 teaching-0.005
 diving-0.004
 example-0.004
 joining-0.004
 showing-0.004
Token
Token ID26391
Feature activation-1.53e+01

Pos logit contributions
Max+0.185
ens+0.169
Mic+0.163
Brown+0.152
ety+0.149
Neg logit contributions
 teaching-0.214
 example-0.207
 showing-0.206
 diving-0.204
 joining-0.202
Token Jill
Token ID22772
Feature activation-2.86e+01

Pos logit contributions
Mic+0.329
Max+0.316
ens+0.312
ley+0.305
Brown+0.291
Neg logit contributions
 diving-0.299
 teaching-0.290
aps-0.281
 planning-0.281
 showing-0.279
Token said
Token ID531
Feature activation+1.24e+00

Pos logit contributions
Max+0.830
Mic+0.795
ens+0.765
Brown+0.735
Snow+0.733
Neg logit contributions
 teaching-0.384
 showing-0.382
 show-0.372
 diving-0.368
 giving-0.355
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
 teaching+0.017
 diving+0.016
 example+0.016
 showing+0.016
 joining+0.016
Neg logit contributions
Max-0.036
ens-0.034
Mic-0.033
Brown-0.031
ety-0.031
Token
Token ID6184
Feature activation-9.81e-01

Pos logit contributions
Max+0.009
ens+0.008
Mic+0.008
ety+0.008
Brown+0.008
Neg logit contributions
 teaching-0.005
 showing-0.005
 diving-0.005
 giving-0.005
 example-0.005
Token
Token ID95
Feature activation-7.94e+00

Pos logit contributions
Max+0.021
ens+0.020
Mic+0.018
Brown+0.017
Snow+0.017
Neg logit contributions
 teaching-0.020
 example-0.020
 showing-0.019
 diving-0.019
 joining-0.019
Token
Token ID26391
Feature activation-1.54e+01

Pos logit contributions
Max+0.167
ens+0.153
Mic+0.142
Snow+0.136
Brown+0.136
Neg logit contributions
 teaching-0.169
 diving-0.163
 showing-0.163
 example-0.163
 joining-0.160
Token the
Token ID262
Feature activation-1.40e+01

Pos logit contributions
 diving+0.142
 showing+0.141
 teaching+0.141
 example+0.136
 giving+0.134
Neg logit contributions
Max-0.300
Mic-0.282
ens-0.280
issy-0.271
Brown-0.263
Token sweet
Token ID6029
Feature activation-1.77e+00

Pos logit contributions
Max+0.334
ens+0.314
Mic+0.294
issy+0.287
 Margaret+0.279
Neg logit contributions
 teaching-0.255
 diving-0.251
 showing-0.251
 example-0.248
 giving-0.241
Token cher
Token ID21382
Feature activation-5.14e-01

Pos logit contributions
Max+0.039
ens+0.038
Mic+0.034
ety+0.033
Spot+0.033
Neg logit contributions
 teaching-0.035
 example-0.033
 showing-0.033
 diving-0.033
 succeeding-0.032
Tokenries
Token ID1678
Feature activation-8.36e+00

Pos logit contributions
Max+0.014
Mic+0.013
 Margaret+0.013
ens+0.013
Mid+0.012
Neg logit contributions
 diving-0.007
 mixing-0.007
 teaching-0.007
 counting-0.007
 adding-0.007
Token.
Token ID13
Feature activation-5.56e+00

Pos logit contributions
Max+0.233
ens+0.218
Mic+0.211
Brown+0.202
Snow+0.200
Neg logit contributions
 teaching-0.121
 showing-0.115
 diving-0.115
 example-0.112
 joining-0.111
Token Daisy
Token ID40355
Feature activation-2.86e+01

Pos logit contributions
Max+0.133
Mic+0.128
ens+0.126
Brown+0.119
ley+0.117
Neg logit contributions
 teaching-0.094
 example-0.093
 joining-0.091
 giving-0.091
 showing-0.089
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
Max+0.714
ens+0.678
Mic+0.667
Brown+0.617
issy+0.613
Neg logit contributions
 teaching-0.494
 showing-0.485
 giving-0.466
 diving-0.461
 joining-0.459
Token Tom
Token ID4186
Feature activation-2.56e+01

Pos logit contributions
Max+0.036
Mic+0.035
Brown+0.033
Red+0.033
Snow+0.033
Neg logit contributions
 teaching-0.024
 showing-0.023
 giving-0.023
 joining-0.022
 show-0.022
Token had
Token ID550
Feature activation+3.86e+00

Pos logit contributions
Max+0.670
Mic+0.656
Nut+0.597
Brown+0.595
Snow+0.594
Neg logit contributions
 teaching-0.388
 showing-0.375
 show-0.375
 joining-0.373
 giving-0.368
Token the
Token ID262
Feature activation-1.42e+01

Pos logit contributions
 teaching+0.051
 example+0.048
 showing+0.048
 diving+0.047
 joining+0.046
Neg logit contributions
Max-0.113
ens-0.108
Mic-0.103
Brown-0.099
Snow-0.098
Token best
Token ID1266
Feature activation-1.02e+00

Pos logit contributions
Max+0.395
ens+0.372
Mic+0.354
Brown+0.341
Snow+0.340
Neg logit contributions
 teaching-0.208
 example-0.200
 showing-0.197
 diving-0.195
 joining-0.190
Token.
Token ID13
Feature activation-4.68e+00

Pos logit contributions
 teaching+0.004
 showing+0.004
 example+0.004
 giving+0.004
 diving+0.004
Neg logit contributions
Max-0.009
Mic-0.009
ens-0.009
Brown-0.008
lves-0.008
Token
Token ID220
Feature activation-3.35e+00

Pos logit contributions
Max+0.109
Mic+0.102
ens+0.102
Snow+0.093
ball+0.093
Neg logit contributions
 teaching-0.088
 example-0.086
 showing-0.084
 giving-0.082
 joining-0.081
Token
Token ID198
Feature activation-5.07e+00

Pos logit contributions
Max+0.089
Mic+0.089
ens+0.088
ley+0.085
lves+0.081
Neg logit contributions
aps-0.048
 teaching-0.046
 about-0.045
 planning-0.042
 join-0.042
Token
Token ID198
Feature activation-5.11e+00

Pos logit contributions
Max+0.116
ens+0.112
Mic+0.106
ley+0.100
ety+0.099
Neg logit contributions
 teaching-0.094
 showing-0.089
 diving-0.089
 example-0.088
 joining-0.088
TokenSu
Token ID5606
Feature activation-1.70e+01

Pos logit contributions
Max+0.094
ens+0.091
Mic+0.083
ley+0.076
ety+0.076
Neg logit contributions
 teaching-0.119
 example-0.116
 showing-0.115
 joining-0.114
 diving-0.114
Tokenzie
Token ID49746
Feature activation-2.86e+01

Pos logit contributions
Max+0.299
Mic+0.259
ens+0.241
Spot+0.239
Snow+0.237
Neg logit contributions
 teaching-0.421
 example-0.412
 accomplishment-0.409
 showing-0.405
 giving-0.400
Token was
Token ID373
Feature activation+5.55e+00

Pos logit contributions
Max+0.677
ens+0.637
Mic+0.631
Snow+0.574
Brown+0.574
Neg logit contributions
 teaching-0.530
 showing-0.512
 example-0.505
 giving-0.495
 diving-0.494
Token so
Token ID523
Feature activation+6.81e+00

Pos logit contributions
 humor+0.085
 example+0.082
 diving+0.081
 teaching+0.080
 liking+0.079
Neg logit contributions
Max-0.137
ens-0.124
 Margaret-0.122
 Kathy-0.119
Spot-0.118
Token disappointed
Token ID11679
Feature activation+9.24e+00

Pos logit contributions
 example+0.108
 teaching+0.108
 humor+0.105
 diving+0.105
 showing+0.105
Neg logit contributions
Max-0.164
ens-0.155
Spot-0.142
 Margaret-0.142
Snow-0.138
Token and
Token ID290
Feature activation-5.61e-01

Pos logit contributions
 teaching+0.135
 example+0.128
 showing+0.128
 diving+0.126
 giving+0.125
Neg logit contributions
Max-0.258
ens-0.240
Mic-0.240
Brown-0.225
Snow-0.223
Token tried
Token ID3088
Feature activation+1.73e+01

Pos logit contributions
Max+0.014
ens+0.013
Mic+0.013
Brown+0.012
Snow+0.012
Neg logit contributions
 teaching-0.010
 showing-0.010
 example-0.009
 diving-0.009
 giving-0.009
Token
Token ID26391
Feature activation-1.56e+01

Pos logit contributions
Max+0.192
ens+0.178
Mic+0.167
Brown+0.158
Snow+0.157
Neg logit contributions
 teaching-0.202
 example-0.195
 showing-0.194
 diving-0.193
 joining-0.190
Token
Token ID8151
Feature activation-2.78e+00

Pos logit contributions
Mic+0.316
Max+0.315
ens+0.307
ley+0.295
Brown+0.284
Neg logit contributions
 teaching-0.321
 diving-0.318
aps-0.309
 counting-0.301
 showing-0.300
Tokens
Token ID82
Feature activation-6.11e+00

Pos logit contributions
Max+0.064
Mic+0.063
ens+0.059
Snow+0.059
Brown+0.059
Neg logit contributions
 teaching-0.049
 diving-0.046
 joining-0.046
 winning-0.045
 giving-0.045
Token okay
Token ID8788
Feature activation+2.18e+00

Pos logit contributions
Max+0.165
ens+0.156
Mic+0.150
Brown+0.142
Snow+0.142
Neg logit contributions
 teaching-0.094
 example-0.089
 showing-0.088
 diving-0.088
 joining-0.086
Token,
Token ID11
Feature activation-7.77e-01

Pos logit contributions
 teaching+0.032
 example+0.031
 diving+0.030
 showing+0.030
 giving+0.030
Neg logit contributions
Max-0.061
ens-0.057
Mic-0.057
Brown-0.053
lves-0.053
Token Jim
Token ID5395
Feature activation-2.86e+01

Pos logit contributions
Max+0.019
ens+0.018
Mic+0.018
Brown+0.016
issy+0.016
Neg logit contributions
 teaching-0.014
 diving-0.013
 example-0.013
 showing-0.013
 joining-0.013
Token.
Token ID13
Feature activation-6.28e+00

Pos logit contributions
Max+0.816
Mic+0.771
ens+0.744
Brown+0.709
lves+0.708
Neg logit contributions
 teaching-0.402
 showing-0.382
 giving-0.377
 example-0.374
 diving-0.373
Token I
Token ID314
Feature activation-9.45e+00

Pos logit contributions
Max+0.148
ens+0.143
Mic+0.136
ball+0.128
issy+0.127
Neg logit contributions
 teaching-0.114
 example-0.110
 joining-0.107
 diving-0.106
 giving-0.106
Token can
Token ID460
Feature activation+1.00e+01

Pos logit contributions
Max+0.210
ens+0.201
issy+0.177
ety+0.177
uffy+0.172
Neg logit contributions
 teaching-0.182
 diving-0.180
 showing-0.178
 example-0.174
 joining-0.174
Token help
Token ID1037
Feature activation+1.53e+01

Pos logit contributions
 diving+0.167
 example+0.163
 humor+0.155
ulp+0.154
 teaching+0.154
Neg logit contributions
Max-0.248
ens-0.222
issy-0.210
Brown-0.208
 Tess-0.205
Token you
Token ID345
Feature activation-1.35e+01

Pos logit contributions
 teaching+0.260
 diving+0.255
 show+0.254
 joining+0.253
 showing+0.251
Neg logit contributions
Max-0.378
Mic-0.357
ens-0.352
ety-0.331
issy-0.329
Token what
Token ID644
Feature activation+8.50e+00

Pos logit contributions
 teaching+0.309
 example+0.294
 showing+0.289
 giving+0.286
 joining+0.281
Neg logit contributions
Max-0.496
Mic-0.452
ens-0.450
Brown-0.441
Snow-0.436
Token the
Token ID262
Feature activation-6.99e+00

Pos logit contributions
 teaching+0.150
 showing+0.142
 diving+0.137
 giving+0.136
 show+0.136
Neg logit contributions
Max-0.217
ens-0.207
Mic-0.205
ety-0.195
lves-0.194
Token machine
Token ID4572
Feature activation-2.38e+00

Pos logit contributions
Max+0.164
ens+0.154
Mic+0.145
Brown+0.138
Snow+0.137
Neg logit contributions
 teaching-0.133
 showing-0.128
 example-0.127
 diving-0.127
 joining-0.124
Token wanted
Token ID2227
Feature activation+2.33e+01

Pos logit contributions
Max+0.054
ens+0.050
Mic+0.049
Brown+0.046
Snow+0.045
Neg logit contributions
 teaching-0.047
 showing-0.046
 diving-0.045
 giving-0.044
 joining-0.044
Token to
Token ID284
Feature activation+1.38e+01

Pos logit contributions
 teaching+0.265
 showing+0.246
 example+0.244
 diving+0.241
 joining+0.235
Neg logit contributions
Max-0.720
ens-0.682
Mic-0.664
Brown-0.637
Snow-0.634
Token teach
Token ID4545
Feature activation+2.30e+01

Pos logit contributions
 example+0.220
 teaching+0.215
 succeeding+0.211
 diving+0.209
 humor+0.205
Neg logit contributions
Max-0.341
ens-0.313
ball-0.294
Snow-0.291
Brown-0.289
Token her
Token ID607
Feature activation-1.16e+01

Pos logit contributions
 teaching+0.380
 showing+0.357
 joining+0.352
 giving+0.350
 example+0.343
Neg logit contributions
Max-0.602
Mic-0.568
ens-0.561
lves-0.537
ety-0.534
Token.
Token ID13
Feature activation+5.31e-01

Pos logit contributions
Max+0.294
ens+0.279
Mic+0.253
Brown+0.252
ball+0.250
Neg logit contributions
 teaching-0.175
 example-0.174
 diving-0.171
 showing-0.168
 humor-0.166
Token The
Token ID383
Feature activation-9.96e+00

Pos logit contributions
 teaching+0.008
 showing+0.007
 giving+0.007
 example+0.007
 join+0.007
Neg logit contributions
ens-0.013
lves-0.013
Max-0.013
Mic-0.013
ball-0.012
Token voice
Token ID3809
Feature activation+2.78e+00

Pos logit contributions
Max+0.221
ens+0.205
Mic+0.190
Snow+0.182
Brown+0.182
Neg logit contributions
 teaching-0.197
 example-0.190
 diving-0.189
 showing-0.189
 joining-0.187
Token replied
Token ID8712
Feature activation+6.45e+00

Pos logit contributions
 teaching+0.055
 diving+0.054
 example+0.054
 showing+0.053
 joining+0.052
Neg logit contributions
Max-0.061
ens-0.057
Mic-0.053
Brown-0.051
Snow-0.050
Token nail
Token ID17864
Feature activation-1.81e+00

Pos logit contributions
Max+0.205
ens+0.193
Mic+0.183
Brown+0.174
Snow+0.173
Neg logit contributions
 teaching-0.146
 diving-0.141
 example-0.141
 showing-0.141
 joining-0.137
Token out
Token ID503
Feature activation+7.51e+00

Pos logit contributions
Max+0.055
ens+0.052
Mic+0.051
Brown+0.049
Snow+0.048
Neg logit contributions
 teaching-0.021
 showing-0.020
 example-0.020
 diving-0.020
 joining-0.019
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
 teaching+0.092
 example+0.089
 humor+0.084
 diving+0.084
 showing+0.082
Neg logit contributions
Max-0.218
ens-0.208
Mic-0.189
Snow-0.188
Brown-0.187
Token Lib
Token ID7980
Feature activation-1.13e+01

Pos logit contributions
Max+0.003
ens+0.003
Mic+0.003
lves+0.003
ley+0.003
Neg logit contributions
 teaching-0.002
 giving-0.002
 showing-0.002
aps-0.002
 example-0.002
Tokenby
Token ID1525
Feature activation-1.48e+01

Pos logit contributions
Max+0.229
ens+0.204
Mic+0.204
Snow+0.191
Nut+0.188
Neg logit contributions
 teaching-0.241
 example-0.241
 showing-0.231
 mixing-0.230
 show-0.229
Token watched
Token ID7342
Feature activation+2.32e+01

Pos logit contributions
Max+0.339
ens+0.319
Mic+0.300
Brown+0.284
Snow+0.283
Neg logit contributions
 teaching-0.286
 example-0.275
 showing-0.274
 diving-0.272
 joining-0.268
Token as
Token ID355
Feature activation+1.07e+01

Pos logit contributions
 teaching+0.352
 example+0.341
 diving+0.332
 showing+0.327
 humor+0.323
Neg logit contributions
Max-0.614
ens-0.582
Mic-0.544
Snow-0.524
Brown-0.524
Token the
Token ID262
Feature activation-9.77e+00

Pos logit contributions
 teaching+0.200
 showing+0.193
 diving+0.190
 example+0.190
 giving+0.188
Neg logit contributions
Max-0.256
ens-0.240
Mic-0.228
Brown-0.219
Snow-0.217
Token clear
Token ID1598
Feature activation+1.00e+01

Pos logit contributions
Max+0.224
ens+0.212
Mic+0.200
Brown+0.190
Snow+0.189
Neg logit contributions
 teaching-0.189
 example-0.183
 showing-0.182
 diving-0.180
 joining-0.175
Token water
Token ID1660
Feature activation-1.13e+00

Pos logit contributions
 teaching+0.165
 showing+0.161
 example+0.159
 diving+0.158
 giving+0.157
Neg logit contributions
Max-0.258
Mic-0.239
ens-0.235
Brown-0.223
Snow-0.220
Token ran
Token ID4966
Feature activation+1.42e+01

Pos logit contributions
Max+0.027
ens+0.026
Mic+0.023
Snow+0.023
Brown+0.022
Neg logit contributions
 teaching-0.021
 example-0.020
 showing-0.019
 diving-0.019
 humor-0.019
Token swing
Token ID9628
Feature activation+5.30e+00

Pos logit contributions
 teaching+0.145
 example+0.139
 showing+0.135
 diving+0.135
 joining+0.132
Neg logit contributions
Max-0.310
ens-0.294
Mic-0.278
Snow-0.270
Brown-0.270
Token.
Token ID13
Feature activation-6.65e-01

Pos logit contributions
 teaching+0.061
 diving+0.059
 showing+0.059
 example+0.057
 joining+0.056
Neg logit contributions
Max-0.158
Mic-0.152
ens-0.151
issy-0.145
lves-0.143
Token Thank
Token ID6952
Feature activation+1.56e+01

Pos logit contributions
Max+0.016
ens+0.015
Mic+0.014
ety+0.014
issy+0.013
Neg logit contributions
 teaching-0.012
 example-0.012
 showing-0.012
 diving-0.011
 joining-0.011
Token you
Token ID345
Feature activation-9.23e+00

Pos logit contributions
 teaching+0.335
 example+0.324
 showing+0.322
 diving+0.321
 joining+0.316
Neg logit contributions
Max-0.324
ens-0.301
Mic-0.280
Snow-0.264
Brown-0.264
Token for
Token ID329
Feature activation+1.51e+01

Pos logit contributions
Max+0.305
ens+0.286
Mic+0.281
Brown+0.273
lves+0.269
Neg logit contributions
 teaching-0.091
 example-0.087
 diving-0.083
 showing-0.082
 joining-0.079
Token helping
Token ID5742
Feature activation+2.30e+01

Pos logit contributions
 contain+0.201
aps+0.195
asured+0.189
oops+0.187
iddle+0.185
Neg logit contributions
Max-0.405
Spot-0.354
Snow-0.352
 Midnight-0.349
 Max-0.349
Token me
Token ID502
Feature activation-8.10e+00

Pos logit contributions
 teaching+0.384
 example+0.371
 show+0.369
 showing+0.369
 diving+0.368
Neg logit contributions
Max-0.577
Mic-0.539
ens-0.537
lves-0.524
Brown-0.511
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
Max+0.162
ens+0.160
Snow+0.144
ety+0.143
ball+0.142
Neg logit contributions
 humor-0.154
 accomplishment-0.152
 teaching-0.152
 example-0.151
 succeeding-0.149
Token You
Token ID921
Feature activation-5.67e+00

Pos logit contributions
Max+0.036
ens+0.035
Mic+0.033
ety+0.031
lves+0.031
Neg logit contributions
 teaching-0.026
 example-0.025
 showing-0.025
 diving-0.024
 joining-0.024
Token are
Token ID389
Feature activation+8.20e+00

Pos logit contributions
Max+0.120
ens+0.112
Mic+0.103
Snow+0.098
Brown+0.097
Neg logit contributions
 teaching-0.119
 example-0.117
 diving-0.115
 showing-0.115
 joining-0.113
Token a
Token ID257
Feature activation-4.85e+00

Pos logit contributions
 teaching+0.123
 example+0.116
 showing+0.116
 diving+0.115
 joining+0.113
Neg logit contributions
Max-0.225
ens-0.214
Mic-0.203
Brown-0.195
Snow-0.194
Token life
Token ID1204
Feature activation+2.21e+00

Pos logit contributions
 teaching+0.239
 example+0.229
 diving+0.229
 showing+0.228
 joining+0.223
Neg logit contributions
Max-0.376
ens-0.355
Mic-0.348
ety-0.325
Brown-0.325
Token jackets
Token ID35129
Feature activation-1.70e+00

Pos logit contributions
 teaching+0.032
 example+0.030
 diving+0.030
 showing+0.030
 joining+0.029
Neg logit contributions
Max-0.062
ens-0.058
Mic-0.056
Brown-0.053
ety-0.053
Token.
Token ID13
Feature activation-8.46e-01

Pos logit contributions
Max+0.051
Mic+0.049
ens+0.048
Brown+0.046
Bet+0.045
Neg logit contributions
 teaching-0.021
 showing-0.019
 example-0.019
 diving-0.019
 giving-0.018
Token They
Token ID1119
Feature activation-9.56e+00

Pos logit contributions
Max+0.015
ens+0.014
Mic+0.012
Brown+0.012
Snow+0.012
Neg logit contributions
 teaching-0.021
 example-0.020
 showing-0.020
 diving-0.020
 joining-0.020
Token also
Token ID635
Feature activation+1.94e+00

Pos logit contributions
Max+0.231
ens+0.216
Mic+0.207
Brown+0.197
Snow+0.195
Neg logit contributions
 teaching-0.173
 showing-0.165
 diving-0.164
 example-0.162
 joining-0.160
Token gave
Token ID2921
Feature activation+2.31e+01

Pos logit contributions
 example+0.041
 succeeding+0.041
 liking+0.040
 teaching+0.040
 diving+0.040
Neg logit contributions
Max-0.039
ens-0.037
Snow-0.033
ball-0.032
ety-0.032
Token them
Token ID606
Feature activation-1.19e+01

Pos logit contributions
 example+0.282
 teaching+0.276
 diving+0.268
 showing+0.262
 joining+0.261
Neg logit contributions
Max-0.674
Brown-0.649
Mic-0.648
Snow-0.614
Nut-0.611
Token a
Token ID257
Feature activation-3.28e+00

Pos logit contributions
Max+0.350
Mic+0.341
ens+0.322
Brown+0.318
issy+0.313
Neg logit contributions
 teaching-0.148
 example-0.145
 diving-0.138
 join-0.136
 show-0.136
Token flashlight
Token ID38371
Feature activation-3.30e+00

Pos logit contributions
Max+0.068
ens+0.060
Snow+0.059
ball+0.057
Mic+0.056
Neg logit contributions
 succeeding-0.070
 teaching-0.068
 example-0.067
 showing-0.066
 joining-0.065
Token and
Token ID290
Feature activation+3.63e+00

Pos logit contributions
Max+0.082
ens+0.078
ball+0.071
Snow+0.071
Brown+0.070
Neg logit contributions
 teaching-0.053
 succeeding-0.053
 example-0.052
 humor-0.052
 diving-0.051
Token a
Token ID257
Feature activation-3.15e+00

Pos logit contributions
 teaching+0.061
 example+0.058
 showing+0.058
 diving+0.058
 joining+0.056
Neg logit contributions
Max-0.092
ens-0.087
Mic-0.083
Brown-0.079
Snow-0.078
Token waking
Token ID23137
Feature activation+3.36e+01

Pos logit contributions
 teaching+0.351
 example+0.339
 showing+0.320
 diving+0.319
 humor+0.317
Neg logit contributions
Max-0.744
ens-0.706
Mic-0.662
Snow-0.647
Brown-0.643
Token up
Token ID510
Feature activation+1.13e+01

Pos logit contributions
 teaching+0.441
 showing+0.439
 diving+0.428
 joining+0.421
 giving+0.421
Neg logit contributions
Max-0.982
Mic-0.928
ens-0.913
Brown-0.861
Spot-0.857
Token every
Token ID790
Feature activation+1.16e+01

Pos logit contributions
 teaching+0.153
 example+0.145
 showing+0.143
 diving+0.142
 joining+0.139
Neg logit contributions
Max-0.322
ens-0.307
Mic-0.290
Brown-0.279
Snow-0.279
Token morning
Token ID3329
Feature activation+4.41e+00

Pos logit contributions
 teaching+0.190
 example+0.177
 showing+0.176
 joining+0.173
 succeeding+0.173
Neg logit contributions
Max-0.285
ens-0.280
Snow-0.244
Spot-0.243
 Margaret-0.243
Token to
Token ID284
Feature activation+1.34e+01

Pos logit contributions
 teaching+0.067
 example+0.064
 showing+0.063
 diving+0.063
 joining+0.062
Neg logit contributions
Max-0.118
ens-0.112
Mic-0.105
Snow-0.101
Brown-0.101
Token keep
Token ID1394
Feature activation+2.30e+01

Pos logit contributions
 teaching+0.179
 example+0.173
 diving+0.165
 showing+0.165
 joining+0.160
Neg logit contributions
Max-0.378
ens-0.360
Mic-0.337
Snow-0.331
Brown-0.329
Token the
Token ID262
Feature activation-8.23e+00

Pos logit contributions
 teaching+0.339
 example+0.330
 diving+0.317
 showing+0.316
 humor+0.312
Neg logit contributions
Max-0.617
ens-0.584
Mic-0.543
Snow-0.530
Brown-0.527
Token house
Token ID2156
Feature activation+2.73e+00

Pos logit contributions
Max+0.211
ens+0.198
Mic+0.189
Brown+0.180
Snow+0.179
Neg logit contributions
 teaching-0.138
 example-0.132
 showing-0.131
 diving-0.130
 joining-0.127
Token clean
Token ID3424
Feature activation+1.67e+01

Pos logit contributions
 teaching+0.038
 example+0.036
 showing+0.036
 diving+0.036
 joining+0.035
Neg logit contributions
Max-0.077
ens-0.073
Mic-0.070
Brown-0.066
Snow-0.066
Token and
Token ID290
Feature activation+4.20e+00

Pos logit contributions
 teaching+0.230
 showing+0.223
 example+0.219
 diving+0.216
 giving+0.214
Neg logit contributions
Max-0.476
Mic-0.446
ens-0.440
Brown-0.414
Spot-0.413
Token everyone
Token ID2506
Feature activation-1.03e+01

Pos logit contributions
 teaching+0.069
 example+0.069
 humor+0.067
 diving+0.066
 joining+0.065
Neg logit contributions
Max-0.104
ens-0.097
Snow-0.091
Brown-0.089
 Margaret-0.087
Token man
Token ID582
Feature activation-1.83e+01

Pos logit contributions
Max+0.211
ens+0.202
Mic+0.179
ety+0.168
issy+0.167
Neg logit contributions
 teaching-0.300
 example-0.289
 showing-0.289
 giving-0.284
 diving-0.280
Token asks
Token ID7893
Feature activation+1.14e+01

Pos logit contributions
Max+0.444
ens+0.431
Mic+0.381
 Blink+0.375
ety+0.374
Neg logit contributions
 example-0.319
 accomplishment-0.309
 teaching-0.306
 liking-0.292
 succeeding-0.291
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
 teaching+0.146
 diving+0.140
 showing+0.138
 joining+0.135
 example+0.133
Neg logit contributions
Max-0.336
Mic-0.316
ens-0.310
Brown-0.310
Snow-0.304
Token "
Token ID366
Feature activation-1.25e+00

Pos logit contributions
Max+0.056
ens+0.051
Mic+0.047
Spot+0.045
Snow+0.045
Neg logit contributions
 teaching-0.051
 example-0.050
 diving-0.049
 showing-0.049
 joining-0.049
TokenYou
Token ID1639
Feature activation-6.14e+00

Pos logit contributions
Max+0.022
ens+0.020
Mic+0.018
Brown+0.017
issy+0.017
Neg logit contributions
 teaching-0.031
 showing-0.030
 example-0.030
 diving-0.030
 joining-0.030
Token should
Token ID815
Feature activation+1.57e+01

Pos logit contributions
Max+0.157
ens+0.142
Mic+0.142
Brown+0.133
Snow+0.133
Neg logit contributions
 teaching-0.103
 show-0.099
 showing-0.097
 giving-0.095
 diving-0.095
Token not
Token ID407
Feature activation+6.77e+00

Pos logit contributions
 teaching+0.158
 showing+0.149
 example+0.144
 show+0.144
 giving+0.144
Neg logit contributions
Max-0.507
ens-0.475
Mic-0.466
issy-0.446
Brown-0.445
Token go
Token ID467
Feature activation+1.60e+01

Pos logit contributions
 teaching+0.106
 example+0.103
 accomplishment+0.101
 showing+0.100
 diving+0.099
Neg logit contributions
Max-0.181
ens-0.176
ety-0.161
Mic-0.159
Spot-0.158
Token near
Token ID1474
Feature activation+1.40e+01

Pos logit contributions
 accomplishment+0.223
 teaching+0.220
 succeeding+0.214
 example+0.211
 humor+0.207
Neg logit contributions
Max-0.446
ens-0.435
ety-0.409
Spot-0.403
Snow-0.397
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
 teaching+0.202
 showing+0.191
 diving+0.190
 giving+0.189
 example+0.189
Neg logit contributions
Max-0.401
ens-0.380
Mic-0.364
issy-0.356
ety-0.347
Token rh
Token ID9529
Feature activation-3.91e+00

Pos logit contributions
Max+0.174
ens+0.158
Mic+0.142
Brown+0.130
Snow+0.130
Neg logit contributions
 teaching-0.313
 example-0.305
 showing-0.303
 diving-0.302
 joining-0.299
Token there
Token ID612
Feature activation-1.68e+00

Pos logit contributions
 diving+0.026
 showing+0.024
 about+0.023
 mixing+0.023
 teaching+0.023
Neg logit contributions
ens-0.087
Mic-0.084
Max-0.083
issy-0.082
Nut-0.080
Token was
Token ID373
Feature activation+6.59e+00

Pos logit contributions
Max+0.043
Mic+0.042
ens+0.041
Brown+0.039
Snow+0.038
Neg logit contributions
 teaching-0.027
 showing-0.026
 diving-0.026
 giving-0.025
 joining-0.025
Token a
Token ID257
Feature activation-5.90e+00

Pos logit contributions
 teaching+0.059
 diving+0.056
 showing+0.053
 mixing+0.052
 example+0.052
Neg logit contributions
ens-0.213
ley-0.208
Max-0.205
lves-0.200
issy-0.199
Token man
Token ID582
Feature activation-1.88e+01

Pos logit contributions
ens+0.152
Max+0.150
issy+0.139
Mic+0.136
ley+0.134
Neg logit contributions
 teaching-0.092
 show-0.091
 showing-0.090
 diving-0.090
 liking-0.086
Token who
Token ID508
Feature activation-5.43e+00

Pos logit contributions
Max+0.327
ens+0.293
ety+0.271
Brown+0.271
Mic+0.267
Neg logit contributions
 accomplishment-0.462
 example-0.443
 humor-0.438
 succeeding-0.437
 diving-0.425
Token wanted
Token ID2227
Feature activation+1.81e+01

Pos logit contributions
Max+0.140
ens+0.136
Mic+0.132
Bet+0.124
issy+0.123
Neg logit contributions
 teaching-0.087
 showing-0.084
 giving-0.080
 serving-0.079
 show-0.079
Token to
Token ID284
Feature activation+8.57e+00

Pos logit contributions
 accomplishment+0.220
 succeeding+0.213
 humor+0.213
 teaching+0.210
 example+0.209
Neg logit contributions
Max-0.522
ens-0.490
Snow-0.458
Mic-0.457
Spot-0.454
Token build
Token ID1382
Feature activation+1.43e+01

Pos logit contributions
 example+0.133
 teaching+0.131
 succeeding+0.130
 diving+0.128
 accomplishment+0.128
Neg logit contributions
Max-0.221
ens-0.203
Mic-0.195
Snow-0.191
Brown-0.191
Token something
Token ID1223
Feature activation-3.00e+00

Pos logit contributions
 teaching+0.245
 example+0.236
 showing+0.233
 diving+0.231
 joining+0.229
Neg logit contributions
Max-0.357
ens-0.335
Mic-0.316
Snow-0.301
Brown-0.301
Token.
Token ID13
Feature activation-3.31e+00

Pos logit contributions
Max+0.064
ens+0.059
ball+0.057
 Bun+0.054
Mic+0.054
Neg logit contributions
 example-0.056
 humor-0.056
 teaching-0.053
 humour-0.050
 accomplishment-0.050
Token He
Token ID679
Feature activation-1.62e+01

Pos logit contributions
Max+0.069
ens+0.064
Mic+0.061
Brown+0.057
Snow+0.057
Neg logit contributions
 teaching-0.071
 example-0.069
 showing-0.068
 diving-0.067
 giving-0.067
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 teaching+0.132
 example+0.126
 showing+0.125
 diving+0.124
 joining+0.121
Neg logit contributions
Max-0.235
ens-0.223
Mic-0.209
Brown-0.202
Snow-0.201
Token but
Token ID475
Feature activation+1.41e+00

Pos logit contributions
 showing+0.023
 teaching+0.022
 giving+0.022
 joining+0.020
 diving+0.020
Neg logit contributions
Max-0.043
ens-0.040
Mic-0.040
uffy-0.039
issy-0.039
Token her
Token ID607
Feature activation-1.61e+01

Pos logit contributions
 teaching+0.030
 showing+0.030
 giving+0.029
 example+0.028
 diving+0.028
Neg logit contributions
Max-0.029
Mic-0.028
ens-0.027
Brown-0.026
ball-0.025
Token mom
Token ID1995
Feature activation-1.68e+01

Pos logit contributions
Max+0.319
ens+0.296
Mic+0.268
Snow+0.259
Brown+0.258
Neg logit contributions
 teaching-0.361
 example-0.349
 showing-0.349
 diving-0.348
 joining-0.347
Tokenmy
Token ID1820
Feature activation-1.69e+01

Pos logit contributions
Max+0.215
ens+0.191
Mic+0.185
Snow+0.165
Spot+0.161
Neg logit contributions
 teaching-0.495
 showing-0.474
 giving-0.468
 joining-0.460
 example-0.457
Token explained
Token ID4893
Feature activation+9.03e+00

Pos logit contributions
Max+0.455
ens+0.432
Mic+0.428
Snow+0.401
Fl+0.401
Neg logit contributions
 teaching-0.266
 showing-0.246
 giving-0.239
 example-0.230
 joining-0.229
Token that
Token ID326
Feature activation-5.02e+00

Pos logit contributions
 teaching+0.120
 showing+0.112
 example+0.112
 diving+0.109
 joining+0.108
Neg logit contributions
Max-0.265
ens-0.249
Mic-0.242
Brown-0.231
Spot-0.230
Token the
Token ID262
Feature activation-1.30e+01

Pos logit contributions
Max+0.119
ens+0.111
Mic+0.108
Brown+0.101
ety+0.100
Neg logit contributions
 teaching-0.095
 showing-0.091
 diving-0.089
 giving-0.089
 joining-0.088
Token doctors
Token ID7519
Feature activation-9.59e+00

Pos logit contributions
Max+0.279
ens+0.252
Brown+0.239
Snow+0.238
Spot+0.226
Neg logit contributions
 diving-0.267
ulp-0.263
 teaching-0.261
 humor-0.261
 showing-0.261
Token were
Token ID547
Feature activation+5.18e+00

Pos logit contributions
Max+0.197
ens+0.181
Mic+0.173
Brown+0.162
Snow+0.162
Neg logit contributions
 teaching-0.208
 showing-0.200
 example-0.198
 diving-0.197
 joining-0.195
Token going
Token ID1016
Feature activation+2.15e+01

Pos logit contributions
 humor+0.067
 example+0.067
 diving+0.066
aps+0.066
ulp+0.063
Neg logit contributions
Max-0.138
ens-0.137
 Margaret-0.125
Snow-0.122
Mic-0.121
Token say
Token ID910
Feature activation+8.59e+00

Pos logit contributions
 teaching+0.173
 example+0.161
 showing+0.160
 diving+0.158
 joining+0.154
Neg logit contributions
Max-0.455
ens-0.432
Mic-0.415
Brown-0.399
Snow-0.398
Token sorry
Token ID7926
Feature activation+7.69e+00

Pos logit contributions
 teaching+0.098
 example+0.094
 diving+0.093
 showing+0.092
 joining+0.089
Neg logit contributions
Max-0.260
ens-0.248
Mic-0.233
Snow-0.227
Brown-0.227
Token to
Token ID284
Feature activation+8.54e+00

Pos logit contributions
 teaching+0.107
 example+0.102
 showing+0.101
 diving+0.100
 joining+0.098
Neg logit contributions
Max-0.218
ens-0.207
Mic-0.196
Brown-0.189
Snow-0.188
Token Lily
Token ID20037
Feature activation-2.47e+01

Pos logit contributions
 teaching+0.161
 showing+0.159
 example+0.157
 show+0.156
 humor+0.155
Neg logit contributions
Max-0.200
ens-0.180
Mic-0.176
Brown-0.170
Snow-0.169
Token and
Token ID290
Feature activation+4.25e-01

Pos logit contributions
Max+0.710
Mic+0.665
ens+0.634
Brown+0.629
Snow+0.627
Neg logit contributions
 teaching-0.356
 showing-0.326
 giving-0.326
 example-0.321
 joining-0.305
Token give
Token ID1577
Feature activation+2.16e+01

Pos logit contributions
 teaching+0.007
 example+0.007
 diving+0.007
 showing+0.007
 joining+0.007
Neg logit contributions
Max-0.011
ens-0.010
Mic-0.009
ety-0.009
Spot-0.009
Token her
Token ID607
Feature activation-1.54e+01

Pos logit contributions
 show+0.283
 teaching+0.278
 showing+0.267
 humor+0.265
 example+0.261
Neg logit contributions
Max-0.581
Brown-0.576
Mic-0.572
K-0.555
Snow-0.548
Token back
Token ID736
Feature activation+8.06e-01

Pos logit contributions
Max+0.458
Mic+0.434
ens+0.421
Bet+0.413
Brown+0.407
Neg logit contributions
 teaching-0.194
 example-0.182
 show-0.177
 humor-0.175
 showing-0.175
Token the
Token ID262
Feature activation-1.24e+01

Pos logit contributions
 teaching+0.010
 example+0.010
 showing+0.009
 show+0.009
 humor+0.009
Neg logit contributions
Max-0.024
Mic-0.023
ens-0.022
Bet-0.021
Brown-0.021
Token bow
Token ID9563
Feature activation+8.72e-01

Pos logit contributions
Max+0.268
ens+0.252
Mic+0.235
Brown+0.223
Snow+0.222
Neg logit contributions
 teaching-0.255
 showing-0.245
 example-0.244
 diving-0.241
 giving-0.238
Token.
Token ID13
Feature activation-3.20e+00

Pos logit contributions
 teaching+0.009
 showing+0.009
 show+0.008
 giving+0.008
 example+0.008
Neg logit contributions
Max-0.027
Mic-0.026
ens-0.025
Bet-0.025
lves-0.025
Token she
Token ID673
Feature activation-1.82e+01

Pos logit contributions
 teaching+0.134
 diving+0.131
 example+0.130
 succeeding+0.129
 showing+0.129
Neg logit contributions
Max-0.212
ens-0.200
Mic-0.199
issy-0.186
Brown-0.186
Token could
Token ID714
Feature activation+1.22e+01

Pos logit contributions
Max+0.403
ens+0.372
Mic+0.362
Brown+0.338
Snow+0.334
Neg logit contributions
 teaching-0.365
 showing-0.353
 diving-0.351
 example-0.349
 joining-0.344
Token keep
Token ID1394
Feature activation+1.87e+01

Pos logit contributions
 teaching+0.229
 example+0.228
 showing+0.213
 humor+0.212
 serving+0.211
Neg logit contributions
Max-0.279
ens-0.265
ety-0.236
K-0.234
Spot-0.233
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 teaching+0.236
 example+0.226
 showing+0.220
 diving+0.219
 joining+0.214
Neg logit contributions
Max-0.549
ens-0.522
Mic-0.495
Brown-0.476
Snow-0.476
Token scissors
Token ID41786
Feature activation-2.73e+00

Pos logit contributions
Max+0.295
ens+0.277
Mic+0.267
Brown+0.253
Snow+0.251
Neg logit contributions
 teaching-0.204
 example-0.197
 showing-0.195
 diving-0.193
 accomplishment-0.186
Token safe
Token ID3338
Feature activation+6.51e+00

Pos logit contributions
Max+0.079
ens+0.074
Mic+0.071
Brown+0.068
Snow+0.068
Neg logit contributions
 teaching-0.036
 showing-0.034
 example-0.034
 diving-0.034
 joining-0.033
Token.
Token ID13
Feature activation-2.85e+00

Pos logit contributions
 teaching+0.092
 showing+0.089
 example+0.088
 diving+0.088
 joining+0.085
Neg logit contributions
Max-0.183
ens-0.172
Mic-0.167
Brown-0.159
Snow-0.158
Token<|endoftext|>
Token ID50256
Feature activation+1.03e+00

Pos logit contributions
Max+0.074
Mic+0.070
ens+0.070
ley+0.069
Brown+0.067
Neg logit contributions
 teaching-0.043
aps-0.040
 example-0.040
 giving-0.039
 about-0.039
TokenOnce
Token ID7454
Feature activation+7.06e+00

Pos logit contributions
 teaching+0.018
 diving+0.018
aps+0.017
 joining+0.017
 showing+0.017
Neg logit contributions
ens-0.023
Max-0.023
Mic-0.023
lves-0.023
Brown-0.022
Token upon
Token ID2402
Feature activation-1.44e+00

Pos logit contributions
 teaching+0.107
 diving+0.106
 about+0.104
 showing+0.100
 show+0.100
Neg logit contributions
Max-0.186
Mic-0.178
Snow-0.172
Brown-0.170
Nut-0.164
Token a
Token ID257
Feature activation-4.85e+00

Pos logit contributions
Max+0.041
ens+0.041
Mic+0.039
ley+0.039
lves+0.038
Neg logit contributions
 teaching-0.019
 showing-0.017
 diving-0.017
 liking-0.017
 joining-0.017
Token were
Token ID547
Feature activation+7.88e+00

Pos logit contributions
Max+0.258
ens+0.242
Mic+0.235
Brown+0.231
uffy+0.227
Neg logit contributions
 teaching-0.172
 showing-0.167
 show-0.163
 diving-0.162
 joining-0.161
Token happy
Token ID3772
Feature activation+1.09e+01

Pos logit contributions
 teaching+0.106
 example+0.101
 showing+0.099
 diving+0.099
 joining+0.096
Neg logit contributions
Max-0.226
ens-0.214
Mic-0.203
Brown-0.196
Snow-0.196
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
 example+0.172
 teaching+0.171
 humor+0.171
 showing+0.161
 succeeding+0.161
Neg logit contributions
Max-0.273
ens-0.254
Spot-0.231
ball-0.229
ety-0.229
Token lived
Token ID5615
Feature activation+7.05e+00

Pos logit contributions
 example+0.049
 teaching+0.049
 diving+0.047
 humor+0.046
 joining+0.046
Neg logit contributions
Max-0.061
ens-0.057
Mic-0.052
Spot-0.051
Snow-0.051
Token together
Token ID1978
Feature activation+6.74e+00

Pos logit contributions
 teaching+0.115
 example+0.113
 humor+0.113
 giving+0.109
 designing+0.109
Neg logit contributions
Max-0.177
ens-0.156
Spot-0.154
Mic-0.151
ball-0.148
Token forever
Token ID8097
Feature activation+4.47e+00

Pos logit contributions
 humor+0.131
 apology+0.126
 example+0.124
 speech+0.116
 Listen+0.116
Neg logit contributions
Max-0.136
ens-0.127
ball-0.124
Spot-0.119
 Bun-0.110
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 humor+0.086
 teaching+0.084
 example+0.083
 showing+0.079
 joining+0.079
Neg logit contributions
Max-0.095
ens-0.091
Spot-0.081
ball-0.081
ety-0.079
Token<|endoftext|>
Token ID50256
Feature activation+2.97e+00

Pos logit contributions
Mic+0.026
Max+0.026
ens+0.025
Brown+0.024
ley+0.024
Neg logit contributions
 teaching-0.016
 showing-0.014
 about-0.014
 giving-0.014
 show-0.014
TokenOnce
Token ID7454
Feature activation+8.70e+00

Pos logit contributions
 teaching+0.052
 diving+0.049
 showing+0.049
 joining+0.048
 giving+0.047
Neg logit contributions
ens-0.070
Max-0.069
Mic-0.066
lves-0.064
Brown-0.063
Token upon
Token ID2402
Feature activation+7.94e-02

Pos logit contributions
 teaching+0.145
 diving+0.143
 showing+0.136
 giving+0.132
 joining+0.132
Neg logit contributions
Max-0.218
Mic-0.204
ens-0.198
Snow-0.197
Brown-0.197
Token a
Token ID257
Feature activation-3.23e+00

Pos logit contributions
 teaching+0.001
 showing+0.001
 diving+0.001
 joining+0.001
 liking+0.001
Neg logit contributions
ens-0.002
Max-0.002
Mic-0.002
ley-0.002
issy-0.002
Token
Token ID198
Feature activation-3.33e+00

Pos logit contributions
Max+0.077
ens+0.076
Mic+0.072
ety+0.069
issy+0.069
Neg logit contributions
 teaching-0.060
 diving-0.056
 showing-0.056
 example-0.055
 joining-0.055
TokenThey
Token ID2990
Feature activation-9.95e+00

Pos logit contributions
ens+0.073
ley+0.067
issy+0.067
lves+0.065
 Margaret+0.064
Neg logit contributions
 teaching-0.067
 showing-0.063
aps-0.062
 diving-0.062
 example-0.062
Token all
Token ID477
Feature activation+1.96e+00

Pos logit contributions
Max+0.244
ens+0.229
Mic+0.219
Brown+0.211
Snow+0.209
Neg logit contributions
 teaching-0.175
 showing-0.167
 example-0.165
 diving-0.165
 joining-0.162
Token played
Token ID2826
Feature activation+1.72e+01

Pos logit contributions
 teaching+0.033
 showing+0.031
 joining+0.031
 diving+0.031
 show+0.031
Neg logit contributions
Max-0.049
Mic-0.045
ens-0.044
Brown-0.043
Snow-0.042
Token on
Token ID319
Feature activation+1.27e+01

Pos logit contributions
 humor+0.275
 giving+0.270
 example+0.262
 teaching+0.262
 planning+0.260
Neg logit contributions
Max-0.445
ens-0.406
Spot-0.386
ball-0.368
Fl-0.347
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
 teaching+0.182
 example+0.174
 diving+0.174
 showing+0.174
 joining+0.165
Neg logit contributions
Max-0.354
ens-0.340
Mic-0.330
Brown-0.319
issy-0.317
Token swing
Token ID9628
Feature activation+3.28e+00

Pos logit contributions
Max+0.257
ens+0.240
Mic+0.222
Spot+0.212
Snow+0.211
Neg logit contributions
 teaching-0.230
 example-0.223
 showing-0.219
 diving-0.218
 joining-0.217
Token together
Token ID1978
Feature activation+5.27e+00

Pos logit contributions
 teaching+0.049
 example+0.047
 showing+0.047
 diving+0.046
 joining+0.045
Neg logit contributions
Max-0.089
ens-0.084
Mic-0.081
Brown-0.077
Snow-0.077
Token and
Token ID290
Feature activation+1.56e+00

Pos logit contributions
 humor+0.116
 planning+0.110
 teaching+0.109
 example+0.109
 succeeding+0.108
Neg logit contributions
Max-0.101
ens-0.092
ball-0.085
Spot-0.079
Fl-0.075
Token had
Token ID550
Feature activation+7.25e+00

Pos logit contributions
 teaching+0.032
 example+0.031
 showing+0.031
 diving+0.031
 joining+0.030
Neg logit contributions
Max-0.034
ens-0.032
Mic-0.030
Brown-0.028
Snow-0.028
Token a
Token ID257
Feature activation-4.92e+00

Pos logit contributions
 teaching+0.090
 example+0.086
 diving+0.086
 showing+0.084
 joining+0.082
Neg logit contributions
Max-0.214
ens-0.208
Mic-0.199
Brown-0.194
issy-0.194
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 accomplishment+0.158
 succeeding+0.157
 humor+0.155
 teaching+0.149
aps+0.149
Neg logit contributions
Max-0.176
ens-0.170
 Tiny-0.160
 Margaret-0.159
 Bun-0.156
Token swamp
Token ID26837
Feature activation-6.60e+00

Pos logit contributions
Max+0.244
ens+0.238
Spot+0.209
Fl+0.209
Bet+0.207
Neg logit contributions
 teaching-0.235
 succeeding-0.220
 joining-0.219
 showing-0.218
 giving-0.218
Token?".
Token ID43634
Feature activation+8.30e-02

Pos logit contributions
ens+0.151
Max+0.150
ety+0.133
issy+0.131
ball+0.131
Neg logit contributions
 humor-0.112
 accomplishment-0.107
 teaching-0.106
 example-0.105
 succeeding-0.104
Token
Token ID220
Feature activation-1.41e+00

Pos logit contributions
 teaching+0.002
 showing+0.002
 example+0.001
 giving+0.001
 joining+0.001
Neg logit contributions
Max-0.002
ens-0.002
Mic-0.002
ety-0.002
Snow-0.002
Token
Token ID198
Feature activation-2.96e+00

Pos logit contributions
Max+0.036
ens+0.035
Mic+0.034
Brown+0.033
ety+0.033
Neg logit contributions
 teaching-0.021
aps-0.021
 diving-0.020
 showing-0.020
 joining-0.020
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
ens+0.066
Max+0.065
Mic+0.058
ety+0.058
lves+0.057
Neg logit contributions
 teaching-0.056
 showing-0.053
 example-0.053
 diving-0.053
 joining-0.052
TokenThe
Token ID464
Feature activation-1.23e+01

Pos logit contributions
ens+0.061
Max+0.059
ety+0.052
uffy+0.052
Mic+0.052
Neg logit contributions
 teaching-0.065
 example-0.063
 showing-0.062
 joining-0.061
 diving-0.060
Token duck
Token ID22045
Feature activation-1.56e+01

Pos logit contributions
Max+0.194
ens+0.176
Spot+0.168
ball+0.158
Fl+0.158
Neg logit contributions
 teaching-0.307
 succeeding-0.305
 designing-0.300
 accomplishment-0.297
 creating-0.292
Tokenlings
Token ID17783
Feature activation-1.50e+01

Pos logit contributions
Max+0.383
ens+0.371
Mic+0.348
Snow+0.330
Spot+0.327
Neg logit contributions
 teaching-0.279
 showing-0.273
 giving-0.260
 diving-0.260
 joining-0.260
Token replied
Token ID8712
Feature activation+4.88e+00

Pos logit contributions
Max+0.341
ens+0.319
Mic+0.296
Brown+0.282
Snow+0.281
Neg logit contributions
 teaching-0.287
 example-0.283
 diving-0.273
 showing-0.272
 humor-0.270
Token,
Token ID11
Feature activation+2.81e+00

Pos logit contributions
 humor+0.086
 teaching+0.084
 example+0.082
 diving+0.082
 liking+0.080
Neg logit contributions
Max-0.113
ens-0.106
Mic-0.096
ball-0.096
 Bun-0.095
Token.
Token ID13
Feature activation-4.67e+00

Pos logit contributions
 humor+0.057
 example+0.057
 diving+0.057
 teaching+0.056
 guessing+0.054
Neg logit contributions
Max-0.077
ens-0.072
 Margaret-0.065
Mic-0.064
Snow-0.063
Token
Token ID198
Feature activation-5.14e+00

Pos logit contributions
Max+0.113
ens+0.113
Mic+0.106
lves+0.102
Spot+0.101
Neg logit contributions
 teaching-0.083
 showing-0.078
 giving-0.077
 example-0.076
 joining-0.075
Token
Token ID198
Feature activation-5.24e+00

Pos logit contributions
ens+0.132
lves+0.121
Max+0.121
ety+0.121
ley+0.118
Neg logit contributions
 teaching-0.085
 showing-0.077
 joining-0.076
aps-0.076
 example-0.075
TokenAnd
Token ID1870
Feature activation-5.23e-01

Pos logit contributions
ens+0.119
ety+0.103
lves+0.099
Max+0.099
ley+0.098
Neg logit contributions
 teaching-0.112
 example-0.105
 showing-0.103
 joining-0.103
 guessing-0.099
Token they
Token ID484
Feature activation-1.65e+01

Pos logit contributions
Max+0.014
ens+0.013
Mic+0.012
Brown+0.012
Snow+0.012
Neg logit contributions
 teaching-0.008
 example-0.008
 diving-0.008
 showing-0.007
 joining-0.007
Token both
Token ID1111
Feature activation-6.27e+00

Pos logit contributions
Max+0.446
ens+0.419
Mic+0.411
Brown+0.394
Spot+0.390
Neg logit contributions
 teaching-0.245
 show-0.240
 showing-0.237
 example-0.230
 diving-0.230
Token went
Token ID1816
Feature activation+1.25e+01

Pos logit contributions
Max+0.161
Mic+0.149
ens+0.148
Brown+0.140
Spot+0.139
Neg logit contributions
 teaching-0.103
 showing-0.099
 diving-0.099
 example-0.099
 show-0.099
Token on
Token ID319
Feature activation+1.05e+01

Pos logit contributions
 teaching+0.157
 example+0.148
 showing+0.145
 humor+0.144
 succeeding+0.143
Neg logit contributions
Max-0.367
ens-0.349
Mic-0.324
Snow-0.319
Spot-0.318
Token their
Token ID511
Feature activation-1.38e+01

Pos logit contributions
 teaching+0.179
 example+0.172
 showing+0.171
 diving+0.170
 joining+0.167
Neg logit contributions
Max-0.265
ens-0.250
Mic-0.236
Brown-0.226
Snow-0.225
Token way
Token ID835
Feature activation+5.58e+00

Pos logit contributions
Max+0.350
ens+0.329
Mic+0.310
Brown+0.298
Snow+0.296
Neg logit contributions
 teaching-0.233
 example-0.222
 showing-0.219
 diving-0.218
 joining-0.217
Token,
Token ID11
Feature activation+8.24e-01

Pos logit contributions
 teaching+0.069
 showing+0.065
 diving+0.065
 example+0.064
 joining+0.062
Neg logit contributions
Max-0.168
ens-0.160
Mic-0.155
Brown-0.148
Snow-0.147
Token It
Token ID632
Feature activation-1.08e+01

Pos logit contributions
Max+0.056
ens+0.053
Mic+0.052
ley+0.052
lves+0.051
Neg logit contributions
 teaching-0.065
 showing-0.064
 giving-0.063
 show-0.062
 mixing-0.061
Token was
Token ID373
Feature activation+7.18e+00

Pos logit contributions
Max+0.235
ens+0.222
Mic+0.210
Brown+0.196
Snow+0.196
Neg logit contributions
 teaching-0.217
 showing-0.210
 diving-0.209
 example-0.207
 joining-0.203
Token deep
Token ID2769
Feature activation+1.80e+00

Pos logit contributions
 humor+0.139
 example+0.134
 teaching+0.131
aps+0.129
 joining+0.128
Neg logit contributions
Max-0.164
ens-0.148
Snow-0.138
Mic-0.137
Spot-0.136
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 teaching+0.029
 example+0.028
 showing+0.028
 diving+0.028
 joining+0.027
Neg logit contributions
Max-0.047
ens-0.044
Mic-0.042
Brown-0.040
Snow-0.040
Token dark
Token ID3223
Feature activation+2.12e+00

Pos logit contributions
 teaching+0.021
 showing+0.020
 diving+0.020
 giving+0.020
 succeeding+0.019
Neg logit contributions
Max-0.025
ens-0.023
Mic-0.023
Brown-0.021
issy-0.021
Token,
Token ID11
Feature activation+2.87e+00

Pos logit contributions
 teaching+0.034
 example+0.032
 showing+0.032
 diving+0.032
 joining+0.031
Neg logit contributions
Max-0.056
ens-0.053
Mic-0.049
Snow-0.048
Brown-0.048
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
 teaching+0.057
 example+0.057
 humor+0.055
 joining+0.053
 diving+0.053
Neg logit contributions
Max-0.063
ens-0.059
Mic-0.053
Snow-0.052
Spot-0.052
Token Lily
Token ID20037
Feature activation-2.36e+01

Pos logit contributions
 showing+0.027
 teaching+0.026
 giving+0.025
 show+0.025
 diving+0.025
Neg logit contributions
Mic-0.027
Max-0.027
Brown-0.025
ens-0.025
issy-0.024
Token was
Token ID373
Feature activation+7.50e+00

Pos logit contributions
Max+0.594
ens+0.575
Mic+0.559
Brown+0.522
issy+0.522
Neg logit contributions
 teaching-0.386
 showing-0.383
 diving-0.358
 giving-0.355
 show-0.349
Token scared
Token ID12008
Feature activation+1.37e+01

Pos logit contributions
 humor+0.182
 example+0.171
 accomplishment+0.170
 apology+0.166
ulp+0.161
Neg logit contributions
Max-0.127
ens-0.114
 Bun-0.110
lves-0.106
ety-0.106
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
 teaching+0.184
 diving+0.175
 showing+0.173
 example+0.172
 giving+0.170
Neg logit contributions
Max-0.397
ens-0.370
Mic-0.369
Brown-0.349
issy-0.344
Token much
Token ID881
Feature activation+5.43e+00

Pos logit contributions
 teaching+0.128
 example+0.120
 showing+0.119
 diving+0.116
 joining+0.116
Neg logit contributions
Max-0.335
ens-0.320
Mic-0.302
Snow-0.294
Spot-0.293
Token he
Token ID339
Feature activation-1.72e+01

Pos logit contributions
 teaching+0.098
 example+0.094
 showing+0.093
 diving+0.092
 joining+0.091
Neg logit contributions
Max-0.131
ens-0.124
Mic-0.115
Brown-0.110
Snow-0.110
Token weighed
Token ID20261
Feature activation+1.54e+01

Pos logit contributions
Max+0.420
ens+0.393
Mic+0.379
Brown+0.363
Snow+0.360
Neg logit contributions
 teaching-0.300
 showing-0.289
 example-0.287
 diving-0.281
 giving-0.277
Token.
Token ID13
Feature activation-2.35e+00

Pos logit contributions
 teaching+0.237
 example+0.226
 showing+0.224
 diving+0.223
 joining+0.219
Neg logit contributions
Max-0.412
ens-0.390
Mic-0.369
Brown-0.354
Snow-0.354
Token Tim
Token ID5045
Feature activation-2.24e+01

Pos logit contributions
Max+0.041
ens+0.037
Mic+0.035
Snow+0.033
Brown+0.032
Neg logit contributions
 teaching-0.058
 showing-0.057
 example-0.056
 diving-0.055
 giving-0.055
Tokenmy
Token ID1820
Feature activation-1.57e+01

Pos logit contributions
Max+0.312
ens+0.279
Snow+0.268
Mic+0.265
Spot+0.245
Neg logit contributions
 teaching-0.615
 example-0.590
 showing-0.588
 giving-0.575
 show-0.567
Token said
Token ID531
Feature activation+5.19e+00

Pos logit contributions
Max+0.320
ens+0.288
Mic+0.269
Brown+0.254
Spot+0.250
Neg logit contributions
 teaching-0.339
 diving-0.335
 example-0.335
 joining-0.332
 humor-0.331
Token,
Token ID11
Feature activation+2.82e+00

Pos logit contributions
 teaching+0.102
 example+0.101
 humor+0.100
 accomplishment+0.097
 diving+0.097
Neg logit contributions
Max-0.108
ens-0.102
Mic-0.090
 Bun-0.089
 Blink-0.088
Token "
Token ID366
Feature activation-1.35e+00

Pos logit contributions
 humor+0.061
 example+0.060
 accomplishment+0.059
 diving+0.057
aps+0.057
Neg logit contributions
Max-0.057
ens-0.050
Mic-0.047
Snow-0.046
Brown-0.045
TokenWow
Token ID22017
Feature activation+8.23e-01

Pos logit contributions
ens+0.029
uffy+0.026
Max+0.025
 Margaret+0.024
ley+0.024
Neg logit contributions
 teaching-0.027
 example-0.027
 showing-0.026
 joining-0.025
 humor-0.025
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
 diving+0.007
 show+0.007
 teaching+0.007
 showing+0.007
 joining+0.006
Neg logit contributions
Max-0.027
Mic-0.026
ens-0.026
Bet-0.025
Brown-0.025
Token to
Token ID284
Feature activation+7.56e+00

Pos logit contributions
 example+0.157
 teaching+0.156
 accomplishment+0.143
 succeeding+0.143
 showing+0.141
Neg logit contributions
Max-0.464
ens-0.444
Mic-0.422
Spot-0.407
ety-0.406
Token run
Token ID1057
Feature activation+1.18e+01

Pos logit contributions
 teaching+0.108
 example+0.104
 diving+0.100
 showing+0.100
 joining+0.098
Neg logit contributions
Max-0.208
ens-0.199
Mic-0.186
Brown-0.178
Snow-0.178
Token and
Token ID290
Feature activation-2.09e-01

Pos logit contributions
 teaching+0.198
 example+0.187
 showing+0.186
 diving+0.184
 succeeding+0.184
Neg logit contributions
Max-0.302
ens-0.288
Mic-0.262
Spot-0.258
Snow-0.255
Token play
Token ID711
Feature activation+1.16e+01

Pos logit contributions
Max+0.006
ens+0.005
Mic+0.005
Spot+0.005
Brown+0.005
Neg logit contributions
 example-0.003
 teaching-0.003
 accomplishment-0.003
 succeeding-0.003
 showing-0.003
Token with
Token ID351
Feature activation+1.35e+01

Pos logit contributions
 teaching+0.167
 example+0.162
 humor+0.160
 giving+0.159
 diving+0.155
Neg logit contributions
Max-0.326
ens-0.296
Mic-0.283
Spot-0.278
ball-0.277
Token his
Token ID465
Feature activation-1.59e+01

Pos logit contributions
 teaching+0.157
 showing+0.152
 diving+0.148
 show+0.145
 humor+0.144
Neg logit contributions
Max-0.414
ens-0.393
Mic-0.381
issy-0.376
Snow-0.371
Token dog
Token ID3290
Feature activation-1.72e+01

Pos logit contributions
Max+0.432
ens+0.423
Mic+0.405
issy+0.402
Snow+0.389
Neg logit contributions
 teaching-0.241
 showing-0.235
 diving-0.233
 example-0.224
 show-0.223
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
Max+0.503
Mic+0.477
ens+0.470
Brown+0.462
Snow+0.456
Neg logit contributions
 teaching-0.218
 showing-0.204
 diving-0.199
 joining-0.193
 show-0.192
Token Rex
Token ID17853
Feature activation-2.41e+01

Pos logit contributions
 join+0.027
 showing+0.027
 diving+0.027
 teaching+0.027
 joining+0.027
Neg logit contributions
ley-0.030
issy-0.030
Max-0.030
Mic-0.029
lves-0.029
Token.
Token ID13
Feature activation-3.98e+00

Pos logit contributions
Max+0.735
Mic+0.725
ens+0.698
issy+0.685
Brown+0.682
Neg logit contributions
 teaching-0.287
 showing-0.277
 joining-0.267
 diving-0.263
 giving-0.259
Token They
Token ID1119
Feature activation-1.30e+01

Pos logit contributions
Mic+0.079
Max+0.078
lves+0.076
Brown+0.073
mi+0.073
Neg logit contributions
 diving-0.080
 teaching-0.079
 example-0.079
 join-0.077
 winning-0.076
Token I
Token ID314
Feature activation-8.03e+00

Pos logit contributions
Max+0.067
Mic+0.065
ens+0.064
Brown+0.060
Nut+0.059
Neg logit contributions
 example-0.045
 teaching-0.044
 giving-0.042
 showing-0.042
 mixing-0.042
Token am
Token ID716
Feature activation+7.54e+00

Pos logit contributions
Max+0.161
ens+0.158
ety+0.137
ball+0.134
uffy+0.130
Neg logit contributions
 example-0.182
 teaching-0.180
 accomplishment-0.175
 succeeding-0.173
 showing-0.173
Token so
Token ID523
Feature activation+6.94e+00

Pos logit contributions
 example+0.117
 humor+0.115
 teaching+0.110
 accomplishment+0.109
 phrase+0.108
Neg logit contributions
Max-0.185
ens-0.174
 Midnight-0.169
 Bun-0.163
Mic-0.162
Token sorry
Token ID7926
Feature activation+6.29e+00

Pos logit contributions
 example+0.125
 teaching+0.125
 joining+0.121
 showing+0.117
 diving+0.114
Neg logit contributions
Max-0.163
ens-0.157
uffy-0.144
ety-0.142
Mic-0.140
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 about+0.063
 show+0.061
 teaching+0.061
 diving+0.059
 example+0.058
Neg logit contributions
Max-0.199
Mic-0.196
ens-0.187
Brown-0.184
lves-0.184
Token Anna
Token ID11735
Feature activation-2.73e+01

Pos logit contributions
 show+0.022
 giving+0.021
 showing+0.020
 sharing+0.020
 liking+0.020
Neg logit contributions
Max-0.026
Mic-0.025
ley-0.024
elope-0.024
Brown-0.024
Token.
Token ID13
Feature activation-3.94e+00

Pos logit contributions
Max+0.858
Mic+0.822
ens+0.801
Brown+0.765
lves+0.761
Neg logit contributions
 teaching-0.299
 showing-0.288
 giving-0.281
 show-0.272
 diving-0.266
Token I
Token ID314
Feature activation-7.98e+00

Pos logit contributions
Max+0.103
Mic+0.098
ens+0.097
ball+0.093
lves+0.092
Neg logit contributions
 teaching-0.059
 example-0.058
 showing-0.056
 giving-0.055
 humor-0.055
Token ruined
Token ID20484
Feature activation+9.64e+00

Pos logit contributions
Max+0.171
ens+0.164
ety+0.141
ball+0.141
Snow+0.141
Neg logit contributions
 teaching-0.169
 example-0.165
 diving-0.162
 joining-0.162
 showing-0.161
Token your
Token ID534
Feature activation-1.17e+01

Pos logit contributions
 teaching+0.102
 showing+0.100
 giving+0.097
 example+0.094
 diving+0.093
Neg logit contributions
Max-0.309
ens-0.294
Mic-0.292
issy-0.291
Brown-0.281
Token purse
Token ID27517
Feature activation-9.99e+00

Pos logit contributions
Max+0.232
ens+0.220
Mic+0.199
Spot+0.191
ety+0.189
Neg logit contributions
 teaching-0.250
 joining-0.245
 example-0.243
 diving-0.241
 succeeding-0.237
Token soft
Token ID2705
Feature activation+5.95e-01

Pos logit contributions
Max+0.016
ens+0.015
Mic+0.015
lves+0.014
ety+0.014
Neg logit contributions
 teaching-0.009
 showing-0.009
 giving-0.009
 mixing-0.008
 show-0.008
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
 teaching+0.008
 showing+0.008
 example+0.008
 giving+0.008
 diving+0.008
Neg logit contributions
Max-0.017
ens-0.016
Mic-0.016
Brown-0.015
issy-0.015
Token and
Token ID290
Feature activation-5.66e-01

Pos logit contributions
 teaching+0.017
 showing+0.017
 diving+0.016
 example+0.016
 giving+0.016
Neg logit contributions
Max-0.029
ens-0.027
Mic-0.026
issy-0.025
Brown-0.025
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
Max+0.015
ens+0.015
Mic+0.014
ety+0.014
lves+0.014
Neg logit contributions
 teaching-0.009
 showing-0.009
 giving-0.008
 helping-0.008
 mixing-0.008
Token mom
Token ID1995
Feature activation-1.77e+01

Pos logit contributions
Max+0.309
ens+0.277
Mic+0.264
Brown+0.258
Spot+0.255
Neg logit contributions
 teaching-0.230
 joining-0.230
 diving-0.226
 showing-0.218
 example-0.217
Tokenmy
Token ID1820
Feature activation-1.77e+01

Pos logit contributions
Max+0.250
Mic+0.215
ens+0.207
Snow+0.184
Spot+0.182
Neg logit contributions
 teaching-0.512
 showing-0.494
 giving-0.490
 joining-0.480
 sharing-0.477
Token loved
Token ID6151
Feature activation+1.63e+01

Pos logit contributions
Max+0.483
Mic+0.459
ens+0.440
lves+0.411
Bet+0.409
Neg logit contributions
 teaching-0.279
 showing-0.266
 giving-0.261
 joining-0.245
 show-0.243
Token it
Token ID340
Feature activation-1.47e+01

Pos logit contributions
 teaching+0.193
 showing+0.182
 example+0.175
 diving+0.173
 giving+0.171
Neg logit contributions
Max-0.507
ens-0.480
Mic-0.462
issy-0.445
Brown-0.442
Token very
Token ID845
Feature activation+7.80e+00

Pos logit contributions
Max+0.382
ens+0.363
Mic+0.336
Snow+0.327
Brown+0.327
Neg logit contributions
 teaching-0.230
 example-0.223
 humor-0.221
 diving-0.220
 showing-0.217
Token much
Token ID881
Feature activation+3.26e+00

Pos logit contributions
 teaching+0.094
 example+0.091
 showing+0.088
 diving+0.088
 show+0.085
Neg logit contributions
Max-0.234
ens-0.222
Mic-0.215
Bet-0.202
Brown-0.202
Token.
Token ID13
Feature activation-4.68e+00

Pos logit contributions
 teaching+0.055
 diving+0.053
 showing+0.053
 example+0.053
 joining+0.052
Neg logit contributions
Max-0.082
ens-0.078
Mic-0.072
Snow-0.070
Brown-0.069
Token to
Token ID284
Feature activation+9.08e+00

Pos logit contributions
 teaching+0.197
 example+0.188
 showing+0.182
 diving+0.180
 joining+0.179
Neg logit contributions
Max-0.607
ens-0.582
Mic-0.547
ety-0.532
Spot-0.529
Token eat
Token ID4483
Feature activation+1.77e+01

Pos logit contributions
 example+0.165
 teaching+0.158
 succeeding+0.154
 joining+0.153
 accomplishment+0.152
Neg logit contributions
Max-0.220
ens-0.211
Mic-0.193
Blue-0.181
Spot-0.181
Token it
Token ID340
Feature activation-1.25e+01

Pos logit contributions
 teaching+0.311
 example+0.299
 showing+0.295
 diving+0.294
 joining+0.290
Neg logit contributions
Max-0.434
ens-0.411
Mic-0.384
Snow-0.367
Brown-0.365
Token,
Token ID11
Feature activation+3.04e+00

Pos logit contributions
Max+0.302
ens+0.293
Mic+0.258
ball+0.258
ety+0.252
Neg logit contributions
 humor-0.220
 teaching-0.217
 example-0.210
 joining-0.205
 diving-0.203
Token so
Token ID523
Feature activation+8.74e+00

Pos logit contributions
 example+0.068
 teaching+0.068
 diving+0.065
 accomplishment+0.065
 succeeding+0.065
Neg logit contributions
Max-0.058
ens-0.055
Mic-0.048
ety-0.046
Spot-0.046
Token she
Token ID673
Feature activation-1.81e+01

Pos logit contributions
 teaching+0.196
 showing+0.194
 diving+0.189
 giving+0.188
 counting+0.186
Neg logit contributions
Max-0.165
Mic-0.161
ens-0.150
lves-0.146
Brown-0.145
Token took
Token ID1718
Feature activation+1.81e+01

Pos logit contributions
Max+0.475
ens+0.456
Mic+0.433
Brown+0.426
 Margaret+0.418
Neg logit contributions
 teaching-0.259
 showing-0.257
 example-0.248
 diving-0.248
 show-0.243
Token it
Token ID340
Feature activation-1.21e+01

Pos logit contributions
 teaching+0.313
 example+0.300
 showing+0.298
 diving+0.297
 joining+0.291
Neg logit contributions
Max-0.453
ens-0.427
Mic-0.403
Brown-0.385
Snow-0.384
Token to
Token ID284
Feature activation+9.51e+00

Pos logit contributions
Max+0.340
ens+0.332
Mic+0.303
ball+0.297
ety+0.296
Neg logit contributions
 teaching-0.167
 humor-0.165
 joining-0.159
 example-0.158
 diving-0.155
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 example+0.210
 diving+0.209
 teaching+0.208
 joining+0.204
 succeeding+0.202
Neg logit contributions
Max-0.179
ens-0.167
Mic-0.147
 Midnight-0.142
 Blink-0.140
Token table
Token ID3084
Feature activation-2.63e-01

Pos logit contributions
Max+0.247
ens+0.228
Mic+0.213
Spot+0.203
Snow+0.203
Neg logit contributions
 teaching-0.216
 example-0.210
 joining-0.210
 succeeding-0.209
 diving-0.209