TOP ACTIVATIONS
MAX = 2.504

and saw a magical fair yt ale creature called a n
ashamed and sorry . " I wanted to see what it
. Nora smiled and looked out into the me adow ,
sorry and said , " I 'm sorry , mom .
The child smiled and looked out the window again . In
. He said , " I am sorry , Lily .
. She said , " I did not know , Mama
bad and said , " I know , mom . I
thing . She said " I 'm sorry , I 'm
. She says , " I 'm sorry , Mom .
cry and said , " I 'm sorry , Mom my
alone ." " I 'm sorry , Mom my
felt bad and said ," I 'm sorry , let me
computer . She said " I 'm sorry , dad .
aches ." " I 'm sorry , Tom ,"
. I was mad . I wanted you to play with
The little girl was su pr ised . She had never
to share and wait . How about we clean up this
Lily . They were very close , but they were also
down and the butterfly slowly unw ound the thread .

MIN ACTIVATIONS
MIN = -2.612

clue . The next clue said : " Look
wrong . The next day , the store owner
lonely . The next day , the sun came
unfair . The next day , they decided to
say . " Me too . I wish I
later . The next day , Ben and Lily
. They hoped that the next time they had a picnic
owner . The next day , Tim my went
sad . The next day , Lucy was very
them . The next day , Lily went to
, but they don 't see any mud . They only
soon . The next day , Anna checked on
. The next day , Lucy took the
sad and angry . The next day , Tim wanted to
happy . The next day , they went back
hurt . The next day , Tom and Lisa
every night . She hugs it and squee zes it tight
. It wasn 't scary after all .
er . The next day , she was very
later . The next day , they went back

INTERVAL 1.225 - 2.504
CONTAINS 0.590%

scissors and cut his pocket open . He carefully pulled out
to his field to pick some corn . He saw a
asks . " I don 't know . Maybe
, you are a hero ! We love you !" <|endoftext|>
other and said , " I forgive you . I love

INTERVAL -0.054 - 1.225
CONTAINS 53.436%

for his success . Bob learned that with hard
day , Lily 's mom asked her to lend her a
in a sweet voice . Suddenly one of the birds hopped
a little light . It was very pretty and bright .
sea noticed the sand castle . She wanted to carry it

INTERVAL -1.333 - -0.054
CONTAINS 45.659%

surprise . Together , they decided to pick some strawberries and
and he wondered . Finally , he had an idea !
had all of the colors , and they put them in
to yawn . It seemed like the song and the warm
. You are welcome to play in the p udd les

INTERVAL -2.612 - -1.333
CONTAINS 0.315%

was important . She did not know the money was for
and wet . They did not know how to swim .
please ," Tom said . " We live on Green Street
brought it home . The next day , Lucy and her
and tried but couldn 't get the ball in the basket
Token and
Token ID290
Feature activation-2.84e-01

Pos logit contributions
ocol+0.649
 rib+0.560
 microsc+0.538
ifying+0.527
hy+0.519
Neg logit contributions
 someone-0.214
 everything-0.207
Maybe-0.195
 dad-0.193
 something-0.188
Token saw
Token ID2497
Feature activation+1.11e-01

Pos logit contributions
 someone+0.206
 everything+0.203
 dad+0.195
Maybe+0.192
How+0.188
Neg logit contributions
ocol-0.527
 rib-0.466
 microsc-0.444
ifying-0.421
hy-0.419
Token a
Token ID257
Feature activation-2.22e-02

Pos logit contributions
ocol+0.248
 rib+0.218
 microsc+0.213
hy+0.204
ifying+0.203
Neg logit contributions
 someone-0.048
 everything-0.047
Maybe-0.045
 dad-0.043
They-0.041
Token magical
Token ID10883
Feature activation+3.21e-01

Pos logit contributions
 everything+0.017
 someone+0.017
Maybe+0.016
 dad+0.016
 something+0.016
Neg logit contributions
ocol-0.041
 rib-0.036
 microsc-0.035
ifying-0.033
hy-0.033
Token fair
Token ID3148
Feature activation-6.27e-02

Pos logit contributions
ocol+0.590
 rib+0.499
 microsc+0.477
ifying+0.462
hy+0.459
Neg logit contributions
 someone-0.273
 everything-0.271
Maybe-0.258
 dad-0.257
 something-0.251
Tokenyt
Token ID20760
Feature activation+2.50e+00

Pos logit contributions
 someone+0.050
 everything+0.050
Maybe+0.049
How+0.047
 something+0.046
Neg logit contributions
ocol-0.118
 rib-0.101
 microsc-0.096
ifying-0.093
hy-0.093
Tokenale
Token ID1000
Feature activation+6.37e-01

Pos logit contributions
ocol+4.111
 microsc+3.620
 rib+3.500
 former+3.397
 topp+3.387
Neg logit contributions
 one-2.552
 someone-2.537
 everything-2.497
 where-2.488
 dad-2.481
Token creature
Token ID7185
Feature activation+2.74e-01

Pos logit contributions
ocol+1.279
 rib+1.094
 microsc+1.057
ifying+1.029
hy+1.020
Neg logit contributions
 someone-0.448
 everything-0.437
 dad-0.411
 something-0.407
Maybe-0.402
Token called
Token ID1444
Feature activation+6.88e-01

Pos logit contributions
ocol+0.551
 rib+0.474
 microsc+0.456
ifying+0.447
hy+0.444
Neg logit contributions
 someone-0.185
 everything-0.183
Maybe-0.171
 dad-0.171
 something-0.166
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
ocol+1.415
 rib+1.214
 microsc+1.174
ifying+1.144
hy+1.134
Neg logit contributions
 someone-0.452
 everything-0.439
 dad-0.411
 something-0.408
 one-0.398
Token n
Token ID299
Feature activation+9.10e-01

Pos logit contributions
ocol+0.612
 rib+0.520
 microsc+0.500
ifying+0.485
hy+0.480
Neg logit contributions
 someone-0.285
 everything-0.284
Maybe-0.267
 dad-0.267
 something-0.264
Token ashamed
Token ID22461
Feature activation-8.12e-01

Pos logit contributions
ocol+2.332
 rib+2.048
 microsc+1.984
ifying+1.943
 former+1.919
Neg logit contributions
 someone-0.373
 everything-0.345
 dad-0.320
 something-0.314
 one-0.289
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
Maybe+0.620
 someone+0.613
How+0.608
 everything+0.599
 looked+0.592
Neg logit contributions
ocol-1.454
 rib-1.220
 microsc-1.177
hy-1.133
ifying-1.133
Token sorry
Token ID7926
Feature activation-7.69e-01

Pos logit contributions
ocol+1.018
 rib+0.873
 microsc+0.843
ifying+0.820
hy+0.818
Neg logit contributions
 someone-0.350
 everything-0.343
 dad-0.319
 something-0.317
Maybe-0.316
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
 someone+0.558
Maybe+0.550
How+0.543
 everything+0.532
 dad+0.529
Neg logit contributions
ocol-1.372
 rib-1.229
 microsc-1.159
onna-1.125
rou-1.104
Token "
Token ID366
Feature activation+1.32e-01

Pos logit contributions
 everything+0.451
 someone+0.445
 looked+0.437
 one+0.432
 something+0.431
Neg logit contributions
ocol-0.927
 microsc-0.777
 rib-0.770
hy-0.725
onna-0.722
TokenI
Token ID40
Feature activation+2.45e+00

Pos logit contributions
ocol+0.276
 rib+0.238
 microsc+0.231
ifying+0.227
hy+0.225
Neg logit contributions
 everything-0.078
 someone-0.078
 dad-0.072
 something-0.071
 one-0.070
Token wanted
Token ID2227
Feature activation-4.34e-01

Pos logit contributions
ocol+5.660
 rib+4.660
ifying+4.517
 microsc+4.411
hy+4.242
Neg logit contributions
 looked-1.837
Maybe-1.803
 someone-1.702
 turned-1.699
 they-1.672
Token to
Token ID284
Feature activation+5.89e-01

Pos logit contributions
Maybe+0.528
 looked+0.509
 everything+0.506
They+0.505
 someone+0.502
Neg logit contributions
ocol-0.613
 rib-0.512
 microsc-0.483
 topp-0.438
ifying-0.437
Token see
Token ID766
Feature activation-4.27e-01

Pos logit contributions
ocol+1.177
 rib+1.007
 microsc+0.972
ifying+0.945
hy+0.939
Neg logit contributions
 someone-0.421
 everything-0.412
 dad-0.387
 something-0.384
Maybe-0.380
Token what
Token ID644
Feature activation-2.25e-01

Pos logit contributions
Maybe+0.297
 looked+0.295
 someone+0.278
 everything+0.277
They+0.271
Neg logit contributions
ocol-0.836
 rib-0.717
 microsc-0.680
ifying-0.649
rou-0.640
Token it
Token ID340
Feature activation+9.26e-02

Pos logit contributions
 someone+0.179
 everything+0.179
 dad+0.175
Maybe+0.174
 something+0.170
Neg logit contributions
ocol-0.399
 rib-0.348
 microsc-0.333
hy-0.317
 topp-0.315
Token.
Token ID13
Feature activation+5.75e-02

Pos logit contributions
ocol+0.497
 rib+0.432
 microsc+0.418
ifying+0.407
hy+0.405
Neg logit contributions
 someone-0.137
 everything-0.132
Maybe-0.125
 dad-0.123
 something-0.120
Token Nora
Token ID32561
Feature activation+2.56e-01

Pos logit contributions
ocol+0.095
 rib+0.079
 microsc+0.075
ifying+0.074
hy+0.072
Neg logit contributions
 someone-0.059
 everything-0.058
 dad-0.056
 something-0.055
Maybe-0.055
Token smiled
Token ID13541
Feature activation-2.18e-01

Pos logit contributions
ocol+0.475
 rib+0.412
 microsc+0.394
hy+0.381
ifying+0.380
Neg logit contributions
 someone-0.207
 everything-0.202
 something-0.191
 dad-0.190
Maybe-0.188
Token and
Token ID290
Feature activation+6.99e-03

Pos logit contributions
 someone+0.139
 everything+0.134
Maybe+0.126
How+0.121
 dad+0.120
Neg logit contributions
ocol-0.428
 rib-0.384
 microsc-0.361
ifying-0.360
rou-0.356
Token looked
Token ID3114
Feature activation+4.32e-01

Pos logit contributions
ocol+0.012
 rib+0.011
 microsc+0.010
hy+0.010
ifying+0.010
Neg logit contributions
 someone-0.006
 everything-0.006
 something-0.005
Maybe-0.005
 dad-0.005
Token out
Token ID503
Feature activation+2.42e+00

Pos logit contributions
ocol+0.781
 rib+0.657
 microsc+0.631
ifying+0.614
hy+0.604
Neg logit contributions
 someone-0.390
 everything-0.378
 dad-0.364
Maybe-0.363
 something-0.359
Token into
Token ID656
Feature activation+1.57e+00

Pos logit contributions
ocol+5.242
 former+4.602
 microsc+4.492
 topp+4.447
 rib+4.404
Neg logit contributions
 everything-1.765
 one-1.648
 someone-1.606
 something-1.554
 very-1.523
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
ocol+3.423
 rib+2.915
hy+2.888
ifying+2.859
rou+2.847
Neg logit contributions
 everything-0.936
 someone-0.927
 something-0.873
 one-0.851
 dad-0.829
Token me
Token ID502
Feature activation-1.56e-01

Pos logit contributions
ocol+0.157
 rib+0.136
 microsc+0.131
ifying+0.129
 former+0.126
Neg logit contributions
 someone-0.054
 everything-0.052
Maybe-0.050
 dad-0.050
 something-0.049
Tokenadow
Token ID4584
Feature activation+6.01e-01

Pos logit contributions
 someone+0.221
 everything+0.218
Maybe+0.216
 dad+0.208
How+0.208
Neg logit contributions
ocol-0.203
 rib-0.152
 microsc-0.149
ifying-0.142
rou-0.141
Token,
Token ID11
Feature activation-4.25e-02

Pos logit contributions
ocol+1.288
 rib+1.114
 microsc+1.079
ifying+1.053
hy+1.045
Neg logit contributions
 someone-0.338
 everything-0.328
 dad-0.304
 something-0.299
Maybe-0.296
Token sorry
Token ID7926
Feature activation-5.06e-02

Pos logit contributions
 everything+0.178
 someone+0.175
Maybe+0.171
 dad+0.170
 looked+0.165
Neg logit contributions
ocol-0.500
 rib-0.421
 microsc-0.405
hy-0.400
ifying-0.393
Token and
Token ID290
Feature activation-1.09e-03

Pos logit contributions
 someone+0.046
 everything+0.044
Maybe+0.044
 dad+0.043
 looked+0.043
Neg logit contributions
ocol-0.088
 rib-0.074
 microsc-0.071
rou-0.068
ifying-0.068
Token said
Token ID531
Feature activation+7.62e-01

Pos logit contributions
 someone+0.001
 everything+0.001
Maybe+0.001
 dad+0.001
 something+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
hy-0.002
rou-0.002
Token,
Token ID11
Feature activation-7.64e-02

Pos logit contributions
ocol+1.638
 rib+1.411
 microsc+1.368
ifying+1.339
hy+1.320
Neg logit contributions
 someone-0.450
 everything-0.437
 dad-0.406
 something-0.402
 one-0.389
Token "
Token ID366
Feature activation-3.04e-01

Pos logit contributions
 someone+0.068
 everything+0.066
 something+0.063
Maybe+0.062
 dad+0.062
Neg logit contributions
ocol-0.132
 rib-0.115
 microsc-0.108
hy-0.106
ifying-0.105
TokenI
Token ID40
Feature activation+2.42e+00

Pos logit contributions
 everything+0.203
 someone+0.198
 something+0.186
 dad+0.184
 where+0.184
Neg logit contributions
ocol-0.601
 rib-0.532
 microsc-0.499
hy-0.495
ifying-0.494
Token'm
Token ID1101
Feature activation+1.06e+00

Pos logit contributions
ocol+5.083
 rib+4.051
ifying+3.979
 microsc+3.821
hy+3.675
Neg logit contributions
Maybe-2.355
 someone-2.319
 looked-2.275
 something-2.245
 one-2.220
Token sorry
Token ID7926
Feature activation-2.75e-01

Pos logit contributions
ocol+2.355
 rib+2.062
 microsc+1.992
ifying+1.960
hy+1.923
Neg logit contributions
 someone-0.575
 everything-0.543
 dad-0.511
 something-0.509
 one-0.486
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
Maybe+0.195
 someone+0.192
 everything+0.188
 looked+0.185
How+0.178
Neg logit contributions
ocol-0.533
 rib-0.459
 microsc-0.443
rou-0.422
hy-0.421
Token mom
Token ID1995
Feature activation+1.17e-01

Pos logit contributions
ocol+1.182
 rib+1.021
 microsc+0.987
ifying+0.961
hy+0.954
Neg logit contributions
 someone-0.336
 everything-0.327
 dad-0.301
 something-0.301
Maybe-0.298
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
ocol+0.242
 rib+0.210
 microsc+0.203
hy+0.198
rou+0.196
Neg logit contributions
 someone-0.071
 everything-0.068
Maybe-0.063
 something-0.063
 dad-0.062
TokenThe
Token ID464
Feature activation-7.72e-01

Pos logit contributions
 someone+0.368
 dad+0.358
 everything+0.358
 something+0.347
 one+0.346
Neg logit contributions
ocol-0.583
 rib-0.507
 microsc-0.466
ifying-0.464
hy-0.450
Token child
Token ID1200
Feature activation+2.25e-02

Pos logit contributions
 someone+0.782
 everything+0.764
They+0.744
 something+0.728
Maybe+0.717
Neg logit contributions
ocol-1.250
 rib-1.114
 microsc-1.039
hy-0.990
rou-0.975
Token smiled
Token ID13541
Feature activation-2.11e-01

Pos logit contributions
ocol+0.039
 rib+0.034
 microsc+0.032
hy+0.031
 former+0.031
Neg logit contributions
 someone-0.021
 everything-0.020
 something-0.019
 dad-0.019
Maybe-0.019
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 someone+0.133
 everything+0.129
Maybe+0.122
How+0.119
 dad+0.115
Neg logit contributions
ocol-0.419
 rib-0.374
ifying-0.349
 microsc-0.349
 former-0.346
Token looked
Token ID3114
Feature activation+3.69e-01

Pos logit contributions
 someone+0.330
 everything+0.320
How+0.308
Maybe+0.306
 something+0.303
Neg logit contributions
ocol-0.687
 rib-0.622
 microsc-0.571
hy-0.561
 former-0.550
Token out
Token ID503
Feature activation+2.41e+00

Pos logit contributions
ocol+0.669
 rib+0.566
 microsc+0.540
ifying+0.527
hy+0.518
Neg logit contributions
 someone-0.329
 everything-0.318
Maybe-0.305
 dad-0.305
 something-0.301
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
ocol+5.531
 microsc+4.787
 former+4.781
pill+4.724
 topp+4.686
Neg logit contributions
 everything-1.427
 one-1.376
 someone-1.305
 they-1.239
 something-1.230
Token window
Token ID4324
Feature activation+9.68e-01

Pos logit contributions
ocol+0.052
 rib+0.045
 microsc+0.042
ifying+0.042
hy+0.041
Neg logit contributions
 someone-0.026
 everything-0.025
Maybe-0.025
 something-0.024
 they-0.024
Token again
Token ID757
Feature activation+1.11e+00

Pos logit contributions
ocol+2.121
 rib+1.820
 microsc+1.782
ifying+1.719
hy+1.701
Neg logit contributions
 someone-0.530
 everything-0.513
 dad-0.474
 something-0.473
 one-0.473
Token.
Token ID13
Feature activation-7.29e-01

Pos logit contributions
ocol+2.509
 rib+2.176
 microsc+2.120
hy+2.066
ifying+2.058
Neg logit contributions
 someone-0.540
 everything-0.515
 something-0.478
 one-0.477
 dad-0.474
Token In
Token ID554
Feature activation-5.54e-01

Pos logit contributions
 someone+0.770
 everything+0.768
 something+0.749
 dad+0.734
 somewhere+0.732
Neg logit contributions
ocol-1.103
 rib-0.931
 microsc-0.892
ifying-0.891
 former-0.870
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 someone+0.097
 everything+0.096
Maybe+0.092
 dad+0.090
 something+0.087
Neg logit contributions
ocol-0.309
 rib-0.271
 microsc-0.261
hy-0.258
ifying-0.256
Token He
Token ID679
Feature activation+6.31e-01

Pos logit contributions
 everything+0.252
 someone+0.246
 dad+0.242
 looked+0.241
 one+0.238
Neg logit contributions
ocol-0.459
 rib-0.377
 microsc-0.376
hy-0.361
ifying-0.354
Token said
Token ID531
Feature activation+6.87e-01

Pos logit contributions
ocol+1.198
 rib+1.014
 microsc+0.977
ifying+0.950
hy+0.941
Neg logit contributions
 someone-0.515
 everything-0.504
 dad-0.478
 something-0.474
Maybe-0.469
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
ocol+1.448
 rib+1.248
 microsc+1.207
ifying+1.180
hy+1.168
Neg logit contributions
 someone-0.422
 everything-0.410
 dad-0.382
 something-0.378
Maybe-0.368
Token "
Token ID366
Feature activation+1.71e-01

Pos logit contributions
ocol+0.620
 rib+0.527
 microsc+0.506
ifying+0.489
hy+0.488
Neg logit contributions
 someone-0.272
 everything-0.268
Maybe-0.254
 dad-0.253
 something-0.253
TokenI
Token ID40
Feature activation+2.40e+00

Pos logit contributions
ocol+0.327
 rib+0.279
 microsc+0.267
ifying+0.262
hy+0.261
Neg logit contributions
 everything-0.131
 someone-0.131
 dad-0.126
 something-0.123
 where-0.120
Token am
Token ID716
Feature activation+2.69e-01

Pos logit contributions
ocol+4.938
 rib+4.056
ifying+3.964
 microsc+3.909
 former+3.753
Neg logit contributions
 someone-2.181
Maybe-2.110
 something-2.094
 everything-2.092
 one-2.089
Token sorry
Token ID7926
Feature activation-5.75e-01

Pos logit contributions
ocol+0.565
 rib+0.483
 microsc+0.468
hy+0.456
ifying+0.454
Neg logit contributions
 someone-0.160
 everything-0.158
Maybe-0.155
 dad-0.145
 looked-0.144
Token,
Token ID11
Feature activation+6.35e-01

Pos logit contributions
Maybe+0.442
 looked+0.417
 someone+0.408
 everything+0.405
How+0.403
Neg logit contributions
ocol-1.082
 rib-0.923
 microsc-0.894
rou-0.853
hy-0.850
Token Lily
Token ID20037
Feature activation+6.11e-01

Pos logit contributions
ocol+1.368
 rib+1.182
 microsc+1.144
ifying+1.117
hy+1.108
Neg logit contributions
 someone-0.363
 everything-0.352
 dad-0.326
 something-0.322
Maybe-0.317
Token.
Token ID13
Feature activation+9.63e-01

Pos logit contributions
ocol+1.336
 rib+1.158
 microsc+1.122
ifying+1.096
hy+1.088
Neg logit contributions
 someone-0.323
 everything-0.313
 dad-0.288
 something-0.284
Maybe-0.280
Token.
Token ID13
Feature activation-7.77e-01

Pos logit contributions
 someone+0.213
 everything+0.204
Maybe+0.203
 dad+0.197
 looked+0.196
Neg logit contributions
ocol-0.571
 rib-0.502
 microsc-0.481
rou-0.463
hy-0.458
Token She
Token ID1375
Feature activation+5.11e-01

Pos logit contributions
 everything+0.712
 someone+0.677
 looked+0.669
 somewhere+0.668
 one+0.665
Neg logit contributions
ocol-1.314
 microsc-1.121
 rib-1.119
hy-1.032
ifying-1.027
Token said
Token ID531
Feature activation+1.03e+00

Pos logit contributions
ocol+0.960
 rib+0.818
 microsc+0.787
ifying+0.762
hy+0.759
Neg logit contributions
 someone-0.420
 everything-0.413
 dad-0.389
 something-0.387
Maybe-0.382
Token,
Token ID11
Feature activation+2.62e-01

Pos logit contributions
ocol+2.252
 rib+1.938
 microsc+1.881
ifying+1.866
 former+1.841
Neg logit contributions
 someone-0.584
 everything-0.568
 dad-0.534
 something-0.523
 one-0.501
Token "
Token ID366
Feature activation+4.59e-02

Pos logit contributions
ocol+0.482
 rib+0.411
 microsc+0.393
ifying+0.382
hy+0.378
Neg logit contributions
 someone-0.219
 everything-0.215
 something-0.204
Maybe-0.203
 dad-0.202
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
ocol+0.094
 rib+0.082
 microsc+0.078
ifying+0.076
hy+0.076
Neg logit contributions
 everything-0.030
 someone-0.029
 something-0.027
 dad-0.026
 where-0.026
Token did
Token ID750
Feature activation+4.66e-01

Pos logit contributions
ocol+4.969
 rib+3.997
ifying+3.889
 microsc+3.752
 former+3.649
Neg logit contributions
Maybe-2.305
 someone-2.305
 looked-2.222
 dad-2.206
 everything-2.201
Token not
Token ID407
Feature activation+2.73e-02

Pos logit contributions
ocol+1.147
 rib+1.013
 microsc+0.986
ifying+0.962
hy+0.960
Neg logit contributions
 someone-0.112
 everything-0.106
 dad-0.088
Maybe-0.087
 something-0.079
Token know
Token ID760
Feature activation+1.08e-01

Pos logit contributions
ocol+0.052
 rib+0.045
 microsc+0.043
hy+0.042
ifying+0.041
Neg logit contributions
 everything-0.020
 someone-0.020
 dad-0.019
Maybe-0.019
 they-0.019
Token,
Token ID11
Feature activation+5.57e-01

Pos logit contributions
ocol+0.230
 rib+0.197
 microsc+0.191
ifying+0.184
hy+0.183
Neg logit contributions
 someone-0.061
 everything-0.060
Maybe-0.059
 dad-0.057
 looked-0.056
Token Mama
Token ID40084
Feature activation+3.94e-01

Pos logit contributions
ocol+1.131
 rib+0.969
 microsc+0.937
ifying+0.910
hy+0.902
Neg logit contributions
 someone-0.380
 everything-0.372
 dad-0.346
 something-0.345
Maybe-0.343
Token bad
Token ID2089
Feature activation+2.56e-01

Pos logit contributions
 everything+0.314
 someone+0.308
Maybe+0.303
 dad+0.299
How+0.292
Neg logit contributions
ocol-0.843
 rib-0.712
 microsc-0.682
hy-0.680
ifying-0.664
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
ocol+0.529
 rib+0.458
 microsc+0.443
hy+0.430
ifying+0.428
Neg logit contributions
 someone-0.155
 everything-0.151
Maybe-0.141
 dad-0.141
 something-0.137
Token said
Token ID531
Feature activation+2.61e-01

Pos logit contributions
ocol+0.731
 rib+0.626
 microsc+0.601
hy+0.584
ifying+0.581
Neg logit contributions
 someone-0.291
 everything-0.287
 dad-0.269
 something-0.266
Maybe-0.265
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
ocol+0.533
 rib+0.460
 microsc+0.442
hy+0.431
ifying+0.429
Neg logit contributions
 someone-0.168
 everything-0.163
Maybe-0.157
 dad-0.152
 something-0.149
Token "
Token ID366
Feature activation-2.51e-01

Pos logit contributions
 someone+0.138
 everything+0.136
 something+0.130
Maybe+0.128
 dad+0.128
Neg logit contributions
ocol-0.265
 rib-0.231
 microsc-0.217
hy-0.214
ifying-0.210
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
 everything+0.177
 someone+0.169
 something+0.160
 dad+0.160
 where+0.157
Neg logit contributions
ocol-0.493
 rib-0.433
 microsc-0.405
hy-0.402
ifying-0.401
Token know
Token ID760
Feature activation-5.63e-02

Pos logit contributions
ocol+4.967
 rib+3.989
ifying+3.891
 microsc+3.829
pill+3.650
Neg logit contributions
 someone-2.247
Maybe-2.215
 something-2.184
 one-2.175
 dad-2.166
Token,
Token ID11
Feature activation+9.38e-02

Pos logit contributions
 someone+0.035
Maybe+0.035
 everything+0.035
 dad+0.033
How+0.032
Neg logit contributions
ocol-0.115
 rib-0.099
 microsc-0.095
ifying-0.091
hy-0.091
Token mom
Token ID1995
Feature activation-2.42e-01

Pos logit contributions
ocol+0.224
 rib+0.199
 microsc+0.192
ifying+0.185
hy+0.184
Neg logit contributions
 someone-0.027
 everything-0.026
Maybe-0.023
 looked-0.022
 something-0.021
Token.
Token ID13
Feature activation+6.57e-01

Pos logit contributions
 someone+0.158
 everything+0.153
Maybe+0.147
 something+0.140
 dad+0.139
Neg logit contributions
ocol-0.482
 rib-0.418
 microsc-0.403
hy-0.397
rou-0.394
Token I
Token ID314
Feature activation+1.71e+00

Pos logit contributions
ocol+1.353
 rib+1.161
 microsc+1.123
ifying+1.094
hy+1.084
Neg logit contributions
 someone-0.433
 everything-0.421
 dad-0.395
 something-0.391
Maybe-0.385
Token thing
Token ID1517
Feature activation+1.20e+00

Pos logit contributions
 everything+0.251
 someone+0.246
Maybe+0.236
 dad+0.235
 looked+0.231
Neg logit contributions
ocol-0.651
 rib-0.560
hy-0.533
 microsc-0.532
 gol-0.517
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
ocol+2.781
 rib+2.397
 microsc+2.335
 former+2.311
ifying+2.289
Neg logit contributions
 someone-0.562
 everything-0.525
 dad-0.488
 one-0.479
 something-0.478
Token She
Token ID1375
Feature activation+3.84e-01

Pos logit contributions
 everything+0.732
 someone+0.705
 somewhere+0.686
 one+0.685
 looked+0.681
Neg logit contributions
ocol-1.300
 microsc-1.102
 rib-1.094
hy-1.079
ifying-1.017
Token said
Token ID531
Feature activation+8.21e-01

Pos logit contributions
ocol+0.734
 rib+0.634
 microsc+0.607
hy+0.588
ifying+0.588
Neg logit contributions
 someone-0.295
 everything-0.291
 something-0.271
 dad-0.271
 one-0.266
Token "
Token ID366
Feature activation-2.42e-01

Pos logit contributions
ocol+1.682
 rib+1.439
 microsc+1.393
ifying+1.366
 former+1.345
Neg logit contributions
 someone-0.569
 everything-0.553
 dad-0.523
 something-0.515
 one-0.500
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
 everything+0.200
 someone+0.196
 where+0.191
 dad+0.190
 something+0.188
Neg logit contributions
ocol-0.441
 rib-0.378
hy-0.354
 microsc-0.354
ifying-0.349
Token'm
Token ID1101
Feature activation+4.67e-01

Pos logit contributions
ocol+4.841
 rib+3.933
ifying+3.840
 microsc+3.762
 former+3.623
Neg logit contributions
 someone-2.264
Maybe-2.258
 looked-2.251
 something-2.171
 everything-2.159
Token sorry
Token ID7926
Feature activation-6.38e-01

Pos logit contributions
ocol+0.990
 rib+0.849
 microsc+0.822
ifying+0.803
hy+0.800
Neg logit contributions
 someone-0.279
 everything-0.273
Maybe-0.255
 dad-0.252
 something-0.248
Token,
Token ID11
Feature activation+7.58e-01

Pos logit contributions
Maybe+0.506
 looked+0.480
How+0.464
 someone+0.463
 everything+0.461
Neg logit contributions
ocol-1.194
 rib-1.016
 microsc-0.979
hy-0.943
rou-0.930
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
ocol+1.451
 rib+1.227
 microsc+1.186
ifying+1.156
hy+1.139
Neg logit contributions
 someone-0.622
 everything-0.607
 dad-0.581
 something-0.571
 one-0.562
Token'm
Token ID1101
Feature activation+2.49e-01

Pos logit contributions
ocol+3.705
 rib+3.039
 microsc+2.900
ifying+2.848
 former+2.747
Neg logit contributions
 someone-1.601
Maybe-1.566
 looked-1.564
 everything-1.535
 something-1.515
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
 someone+0.433
Maybe+0.415
 dad+0.414
 everything+0.412
 looked+0.398
Neg logit contributions
ocol-1.207
 rib-1.050
 microsc-0.999
onna-0.969
rou-0.966
Token She
Token ID1375
Feature activation+8.55e-01

Pos logit contributions
 everything+0.447
 someone+0.435
 one+0.427
 something+0.423
 somewhere+0.421
Neg logit contributions
ocol-0.924
 rib-0.790
 microsc-0.790
hy-0.747
ifying-0.743
Token says
Token ID1139
Feature activation+8.70e-01

Pos logit contributions
ocol+1.557
 rib+1.291
 microsc+1.243
ifying+1.219
hy+1.197
Neg logit contributions
 someone-0.786
 everything-0.774
 dad-0.740
 something-0.735
Maybe-0.729
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
ocol+1.887
 rib+1.626
 microsc+1.576
ifying+1.551
 former+1.533
Neg logit contributions
 someone-0.508
 everything-0.494
 dad-0.461
 something-0.453
 one-0.435
Token "
Token ID366
Feature activation-8.01e-02

Pos logit contributions
ocol+0.431
 rib+0.368
 microsc+0.351
ifying+0.341
hy+0.340
Neg logit contributions
 someone-0.196
 everything-0.192
 something-0.182
Maybe-0.182
 dad-0.182
TokenI
Token ID40
Feature activation+2.38e+00

Pos logit contributions
 everything+0.042
 someone+0.041
 something+0.038
 dad+0.037
 where+0.037
Neg logit contributions
ocol-0.170
 rib-0.150
 microsc-0.143
ifying-0.141
hy-0.140
Token'm
Token ID1101
Feature activation+8.08e-01

Pos logit contributions
ocol+4.895
 rib+3.928
ifying+3.792
 microsc+3.681
 former+3.587
Neg logit contributions
 someone-2.387
Maybe-2.358
 everything-2.299
 something-2.286
 dad-2.280
Token sorry
Token ID7926
Feature activation-7.23e-01

Pos logit contributions
ocol+1.777
 rib+1.545
 microsc+1.494
ifying+1.459
hy+1.443
Neg logit contributions
 someone-0.448
 everything-0.432
 dad-0.402
 something-0.396
 one-0.379
Token,
Token ID11
Feature activation+5.24e-01

Pos logit contributions
Maybe+0.586
How+0.547
 looked+0.544
 someone+0.539
 everything+0.529
Neg logit contributions
ocol-1.325
 rib-1.135
 microsc-1.094
rou-1.043
hy-1.028
Token Mom
Token ID11254
Feature activation-1.21e-01

Pos logit contributions
ocol+1.254
 rib+1.106
 microsc+1.073
ifying+1.048
hy+1.041
Neg logit contributions
 someone-0.165
 everything-0.156
 something-0.132
 dad-0.132
Maybe-0.131
Token.
Token ID13
Feature activation+5.67e-01

Pos logit contributions
 someone+0.077
 everything+0.074
Maybe+0.070
 something+0.069
 dad+0.067
Neg logit contributions
ocol-0.242
 rib-0.210
 microsc-0.202
hy-0.199
rou-0.199
Token cry
Token ID3960
Feature activation-9.90e-01

Pos logit contributions
ocol+0.781
 rib+0.656
 microsc+0.633
ifying+0.606
hy+0.605
Neg logit contributions
 someone-0.425
 everything-0.415
 dad-0.397
 something-0.395
Maybe-0.395
Token and
Token ID290
Feature activation-3.76e-02

Pos logit contributions
Maybe+0.684
How+0.681
 someone+0.672
 looked+0.638
 everything+0.633
Neg logit contributions
ocol-1.803
 rib-1.621
rou-1.512
 microsc-1.512
ifying-1.509
Token said
Token ID531
Feature activation+1.14e+00

Pos logit contributions
 someone+0.027
 everything+0.027
Maybe+0.026
 something+0.025
 dad+0.025
Neg logit contributions
ocol-0.071
 rib-0.062
 microsc-0.060
hy-0.057
 former-0.057
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
ocol+2.536
 rib+2.177
 microsc+2.119
ifying+2.102
 former+2.079
Neg logit contributions
 someone-0.649
 everything-0.631
 something-0.582
 dad-0.575
 one-0.547
Token "
Token ID366
Feature activation+4.83e-01

Pos logit contributions
ocol+0.343
 rib+0.294
 microsc+0.281
hy+0.271
ifying+0.271
Neg logit contributions
 someone-0.159
 everything-0.157
Maybe-0.149
 something-0.149
 dad-0.148
TokenI
Token ID40
Feature activation+2.37e+00

Pos logit contributions
ocol+1.078
 rib+0.941
 microsc+0.909
ifying+0.892
hy+0.883
Neg logit contributions
 someone-0.229
 everything-0.223
 dad-0.204
 something-0.200
 one-0.192
Token'm
Token ID1101
Feature activation+6.27e-01

Pos logit contributions
ocol+4.786
 rib+3.896
ifying+3.731
 microsc+3.693
hy+3.614
Neg logit contributions
 looked-2.224
 someone-2.203
Maybe-2.195
 turned-2.126
 one-2.123
Token sorry
Token ID7926
Feature activation-4.92e-01

Pos logit contributions
ocol+1.353
 rib+1.170
 microsc+1.133
ifying+1.106
hy+1.097
Neg logit contributions
 someone-0.358
 everything-0.347
 dad-0.321
 something-0.317
Maybe-0.312
Token,
Token ID11
Feature activation+3.86e-01

Pos logit contributions
Maybe+0.360
 someone+0.345
 looked+0.340
How+0.330
 everything+0.330
Neg logit contributions
ocol-0.930
 rib-0.815
 microsc-0.776
rou-0.750
hy-0.736
Token Mom
Token ID11254
Feature activation-2.73e-03

Pos logit contributions
ocol+0.917
 rib+0.811
 microsc+0.785
ifying+0.766
hy+0.761
Neg logit contributions
 someone-0.123
 everything-0.117
Maybe-0.102
 something-0.100
 dad-0.097
Tokenmy
Token ID1820
Feature activation-1.45e-02

Pos logit contributions
 someone+0.002
 everything+0.002
Maybe+0.001
 something+0.001
 dad+0.001
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
rou-0.005
hy-0.005
Token alone
Token ID3436
Feature activation-4.07e-01

Pos logit contributions
 someone+0.251
 everything+0.246
Maybe+0.245
 dad+0.242
 looked+0.232
Neg logit contributions
ocol-0.564
 rib-0.484
 microsc-0.459
ifying-0.455
hy-0.445
Token."
Token ID526
Feature activation-4.49e-01

Pos logit contributions
 everything+0.317
 someone+0.315
Maybe+0.305
 dad+0.303
 looked+0.298
Neg logit contributions
ocol-0.745
 rib-0.639
 microsc-0.609
ifying-0.598
hy-0.587
Token
Token ID198
Feature activation-2.78e-01

Pos logit contributions
 everything+0.301
 someone+0.288
 looked+0.286
Maybe+0.271
 one+0.268
Neg logit contributions
ocol-0.867
 rib-0.743
 microsc-0.727
onna-0.683
ifying-0.683
Token
Token ID198
Feature activation+4.80e-02

Pos logit contributions
 someone+0.178
 everything+0.177
 one+0.168
 looked+0.168
 something+0.165
Neg logit contributions
ocol-0.549
 rib-0.472
 microsc-0.461
rou-0.441
hy-0.439
Token"
Token ID1
Feature activation+7.71e-02

Pos logit contributions
ocol+0.089
 rib+0.075
 microsc+0.073
ifying+0.070
rou+0.069
Neg logit contributions
 someone-0.041
 everything-0.040
 looked-0.038
 something-0.037
 one-0.037
TokenI
Token ID40
Feature activation+2.36e+00

Pos logit contributions
ocol+0.146
 rib+0.126
 microsc+0.119
ifying+0.117
hy+0.114
Neg logit contributions
 everything-0.062
 someone-0.061
 something-0.057
 dad-0.057
 one-0.056
Token'm
Token ID1101
Feature activation+8.15e-01

Pos logit contributions
ocol+5.101
 rib+4.156
ifying+4.002
 microsc+3.976
pill+3.825
Neg logit contributions
Maybe-2.096
 someone-2.056
 everything-1.996
 dad-1.977
 looked-1.962
Token sorry
Token ID7926
Feature activation-3.76e-01

Pos logit contributions
ocol+1.774
 rib+1.539
 microsc+1.490
ifying+1.455
hy+1.436
Neg logit contributions
 someone-0.466
 everything-0.448
 dad-0.419
 something-0.415
 one-0.398
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
Maybe+0.280
 someone+0.277
 everything+0.274
How+0.270
 looked+0.261
Neg logit contributions
ocol-0.713
 rib-0.613
 microsc-0.583
hy-0.562
onna-0.554
Token Mom
Token ID11254
Feature activation-5.93e-02

Pos logit contributions
ocol+0.670
 rib+0.592
 microsc+0.573
ifying+0.557
hy+0.552
Neg logit contributions
 someone-0.092
 everything-0.088
Maybe-0.078
 something-0.074
 looked-0.071
Tokenmy
Token ID1820
Feature activation+1.78e-01

Pos logit contributions
 someone+0.037
 everything+0.035
Maybe+0.033
 something+0.033
 dad+0.032
Neg logit contributions
ocol-0.121
 rib-0.105
 microsc-0.101
rou-0.099
hy-0.099
Token felt
Token ID2936
Feature activation-3.52e-01

Pos logit contributions
ocol+0.145
 rib+0.125
 microsc+0.119
hy+0.116
ifying+0.115
Neg logit contributions
 someone-0.065
 everything-0.064
 something-0.060
 dad-0.060
Maybe-0.059
Token bad
Token ID2089
Feature activation-1.62e-01

Pos logit contributions
 everything+0.230
 dad+0.229
 someone+0.229
Maybe+0.221
 one+0.210
Neg logit contributions
ocol-0.687
 rib-0.581
hy-0.564
 microsc-0.562
ifying-0.556
Token and
Token ID290
Feature activation+5.43e-01

Pos logit contributions
 someone+0.117
 everything+0.113
 dad+0.110
Maybe+0.110
How+0.105
Neg logit contributions
ocol-0.306
 rib-0.267
 microsc-0.256
hy-0.251
ifying-0.246
Token said
Token ID531
Feature activation+4.63e-01

Pos logit contributions
ocol+1.072
 rib+0.916
 microsc+0.884
ifying+0.858
hy+0.854
Neg logit contributions
 someone-0.397
 everything-0.389
 dad-0.366
 something-0.363
Maybe-0.360
Token,"
Token ID553
Feature activation+3.10e-02

Pos logit contributions
ocol+0.969
 rib+0.834
 microsc+0.807
ifying+0.784
hy+0.782
Neg logit contributions
 someone-0.282
 everything-0.274
 dad-0.257
Maybe-0.256
 something-0.251
TokenI
Token ID40
Feature activation+2.36e+00

Pos logit contributions
ocol+0.056
 rib+0.048
 microsc+0.045
ifying+0.044
hy+0.043
Neg logit contributions
 someone-0.027
 everything-0.026
 dad-0.026
 something-0.025
 one-0.024
Token'm
Token ID1101
Feature activation+2.54e-01

Pos logit contributions
ocol+4.968
 rib+4.101
 microsc+4.065
ifying+3.985
hy+3.703
Neg logit contributions
 looked-1.935
Maybe-1.913
 someone-1.891
 one-1.812
 turned-1.802
Token sorry
Token ID7926
Feature activation-4.07e-01

Pos logit contributions
ocol+0.527
 rib+0.450
 microsc+0.434
hy+0.426
ifying+0.425
Neg logit contributions
 someone-0.155
 everything-0.155
Maybe-0.148
 dad-0.143
 something-0.139
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 someone+0.373
 everything+0.368
Maybe+0.365
 looked+0.362
 dad+0.353
Neg logit contributions
ocol-0.689
 rib-0.584
 microsc-0.547
rou-0.530
ifying-0.528
Token let
Token ID1309
Feature activation+1.09e-01

Pos logit contributions
 someone+0.069
 everything+0.068
 dad+0.064
Maybe+0.064
 something+0.063
Neg logit contributions
ocol-0.197
 rib-0.173
 microsc-0.162
hy-0.158
ifying-0.157
Token me
Token ID502
Feature activation+3.30e-02

Pos logit contributions
ocol+0.214
 rib+0.183
 microsc+0.175
ifying+0.170
hy+0.167
Neg logit contributions
 someone-0.075
 everything-0.072
 dad-0.071
Maybe-0.070
 something-0.069
Token computer
Token ID3644
Feature activation-7.91e-01

Pos logit contributions
ocol+1.356
 rib+1.122
 microsc+1.074
ifying+1.049
hy+1.036
Neg logit contributions
 someone-0.747
 everything-0.735
 dad-0.707
 something-0.697
 one-0.689
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 someone+0.565
 dad+0.550
 everything+0.545
Maybe+0.534
How+0.524
Neg logit contributions
ocol-1.447
 rib-1.284
hy-1.226
 microsc-1.226
ifying-1.188
Token She
Token ID1375
Feature activation+6.14e-01

Pos logit contributions
 everything+0.446
 someone+0.437
 looked+0.423
 one+0.419
 somewhere+0.414
Neg logit contributions
ocol-0.920
 microsc-0.783
 rib-0.776
ifying-0.745
hy-0.736
Token said
Token ID531
Feature activation+7.11e-01

Pos logit contributions
ocol+1.166
 rib+0.988
 microsc+0.952
ifying+0.925
hy+0.917
Neg logit contributions
 someone-0.500
 everything-0.490
 dad-0.464
 something-0.461
Maybe-0.455
Token "
Token ID366
Feature activation+1.24e-01

Pos logit contributions
ocol+1.449
 rib+1.240
 microsc+1.198
ifying+1.172
hy+1.158
Neg logit contributions
 someone-0.490
 everything-0.478
 dad-0.450
 something-0.444
Maybe-0.433
TokenI
Token ID40
Feature activation+2.35e+00

Pos logit contributions
ocol+0.241
 rib+0.207
 microsc+0.197
ifying+0.195
hy+0.192
Neg logit contributions
 everything-0.092
 someone-0.091
 dad-0.086
 something-0.085
 where-0.084
Token'm
Token ID1101
Feature activation+3.35e-01

Pos logit contributions
ocol+4.838
 rib+3.889
ifying+3.793
 microsc+3.725
 former+3.559
Neg logit contributions
 someone-2.193
Maybe-2.165
 everything-2.097
 something-2.094
 one-2.086
Token sorry
Token ID7926
Feature activation-8.34e-01

Pos logit contributions
ocol+0.701
 rib+0.599
 microsc+0.580
ifying+0.565
hy+0.563
Neg logit contributions
 someone-0.205
 everything-0.202
Maybe-0.194
 dad-0.184
 something-0.183
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
Maybe+0.684
 looked+0.649
How+0.629
 someone+0.621
 everything+0.609
Neg logit contributions
ocol-1.508
 rib-1.297
 microsc-1.252
rou-1.185
hy-1.173
Token dad
Token ID9955
Feature activation-7.23e-02

Pos logit contributions
ocol+0.368
 rib+0.323
 microsc+0.312
ifying+0.305
hy+0.301
Neg logit contributions
 someone-0.066
 everything-0.063
Maybe-0.059
 something-0.055
 looked-0.055
Token.
Token ID13
Feature activation+7.74e-01

Pos logit contributions
 someone+0.045
 everything+0.044
Maybe+0.042
 something+0.040
 dad+0.040
Neg logit contributions
ocol-0.147
 rib-0.127
 microsc-0.122
hy-0.121
ifying-0.119
Tokenaches
Token ID3694
Feature activation+5.71e-01

Pos logit contributions
ocol+0.707
 rib+0.605
 microsc+0.581
hy+0.566
ifying+0.563
Neg logit contributions
 someone-0.252
 everything-0.246
 dad-0.237
Maybe-0.235
 something-0.230
Token."
Token ID526
Feature activation-2.73e-01

Pos logit contributions
ocol+1.186
 rib+1.021
 microsc+0.987
ifying+0.964
hy+0.955
Neg logit contributions
 someone-0.356
 everything-0.346
 dad-0.324
 something-0.319
Maybe-0.316
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
 everything+0.204
 someone+0.198
Maybe+0.191
 looked+0.190
 something+0.188
Neg logit contributions
ocol-0.512
 rib-0.433
 microsc-0.427
hy-0.415
onna-0.406
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
 someone+0.090
 everything+0.090
 one+0.086
 something+0.085
 looked+0.084
Neg logit contributions
ocol-0.289
 rib-0.248
 microsc-0.242
hy-0.236
rou-0.232
Token"
Token ID1
Feature activation-2.68e-02

Pos logit contributions
 someone+0.037
 everything+0.037
 dad+0.035
 something+0.035
 one+0.035
Neg logit contributions
ocol-0.059
 rib-0.048
 microsc-0.047
hy-0.045
ifying-0.045
TokenI
Token ID40
Feature activation+2.34e+00

Pos logit contributions
 everything+0.022
 someone+0.022
 dad+0.020
 something+0.020
 looked+0.020
Neg logit contributions
ocol-0.050
 rib-0.043
 microsc-0.040
ifying-0.039
hy-0.039
Token'm
Token ID1101
Feature activation+6.42e-01

Pos logit contributions
ocol+4.841
 rib+3.923
ifying+3.775
 microsc+3.772
 former+3.548
Neg logit contributions
 someone-2.231
Maybe-2.220
 everything-2.152
 looked-2.130
 they-2.116
Token sorry
Token ID7926
Feature activation-1.57e-01

Pos logit contributions
ocol+1.382
 rib+1.194
 microsc+1.156
ifying+1.128
hy+1.119
Neg logit contributions
 someone-0.371
 everything-0.360
 dad-0.334
 something-0.330
Maybe-0.323
Token,
Token ID11
Feature activation+8.63e-01

Pos logit contributions
Maybe+0.110
 someone+0.110
 everything+0.107
How+0.104
 looked+0.104
Neg logit contributions
ocol-0.308
 rib-0.262
 microsc-0.253
hy-0.245
rou-0.240
Token Tom
Token ID4186
Feature activation-1.81e-01

Pos logit contributions
ocol+1.620
 rib+1.361
 microsc+1.316
ifying+1.288
hy+1.262
Neg logit contributions
 someone-0.754
 everything-0.735
 dad-0.710
 something-0.693
 one-0.684
Token,"
Token ID553
Feature activation+3.23e-01

Pos logit contributions
 someone+0.159
 everything+0.158
Maybe+0.150
 something+0.149
 dad+0.147
Neg logit contributions
ocol-0.316
 rib-0.267
hy-0.253
 microsc-0.252
rou-0.245
Token.
Token ID13
Feature activation+9.48e-01

Pos logit contributions
ocol+1.827
 rib+1.576
 microsc+1.533
ifying+1.504
 former+1.476
Neg logit contributions
 someone-0.483
 everything-0.471
 dad-0.439
 something-0.428
 one-0.425
Token I
Token ID314
Feature activation+1.98e+00

Pos logit contributions
ocol+1.999
 rib+1.722
 microsc+1.663
ifying+1.622
 former+1.608
Neg logit contributions
 someone-0.601
 everything-0.571
 dad-0.543
 something-0.532
 one-0.521
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
ocol+4.347
 rib+3.607
ifying+3.438
 microsc+3.435
hy+3.338
Neg logit contributions
Maybe-1.461
 looked-1.459
 someone-1.448
 everything-1.418
 something-1.374
Token mad
Token ID8805
Feature activation-4.07e-01

Pos logit contributions
ocol+2.192
 rib+1.889
 microsc+1.824
ifying+1.799
hy+1.765
Neg logit contributions
 someone-0.680
 everything-0.652
 something-0.621
 dad-0.606
 one-0.602
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 someone+0.397
 everything+0.397
Maybe+0.390
 looked+0.385
How+0.377
Neg logit contributions
ocol-0.665
 rib-0.552
 microsc-0.522
rou-0.514
hy-0.507
Token I
Token ID314
Feature activation+2.34e+00

Pos logit contributions
ocol+2.488
 rib+2.149
 microsc+2.073
ifying+2.035
 former+2.028
Neg logit contributions
 someone-0.713
 everything-0.656
 dad-0.635
 something-0.617
 one-0.603
Token wanted
Token ID2227
Feature activation-3.02e-01

Pos logit contributions
ocol+5.266
 rib+4.333
ifying+4.183
 microsc+4.154
hy+3.987
Neg logit contributions
 looked-1.756
Maybe-1.734
 someone-1.664
 turned-1.626
 everything-1.621
Token you
Token ID345
Feature activation+6.94e-01

Pos logit contributions
Maybe+0.356
 everything+0.349
 someone+0.347
 looked+0.344
They+0.339
Neg logit contributions
ocol-0.436
 rib-0.366
 microsc-0.344
hy-0.319
ifying-0.312
Token to
Token ID284
Feature activation+7.92e-01

Pos logit contributions
ocol+1.137
 rib+0.931
 microsc+0.892
ifying+0.860
hy+0.850
Neg logit contributions
 someone-0.755
 everything-0.744
 dad-0.715
 something-0.713
Maybe-0.703
Token play
Token ID711
Feature activation+1.35e-01

Pos logit contributions
ocol+1.653
 rib+1.413
 microsc+1.373
ifying+1.343
hy+1.317
Neg logit contributions
 someone-0.520
 everything-0.505
 dad-0.472
 something-0.467
 one-0.457
Token with
Token ID351
Feature activation-1.46e-01

Pos logit contributions
ocol+0.283
 rib+0.244
 microsc+0.233
hy+0.227
ifying+0.226
Neg logit contributions
 someone-0.077
 everything-0.077
 dad-0.074
Maybe-0.071
 looked-0.070
TokenThe
Token ID464
Feature activation-5.65e-01

Pos logit contributions
ocol+0.318
 rib+0.269
 microsc+0.256
ifying+0.248
hy+0.246
Neg logit contributions
 someone-0.175
 everything-0.173
 dad-0.167
 something-0.164
 looked-0.162
Token little
Token ID1310
Feature activation-2.83e-01

Pos logit contributions
 everything+0.508
 someone+0.498
Maybe+0.475
They+0.471
 dad+0.465
Neg logit contributions
ocol-0.978
 rib-0.877
 microsc-0.764
hy-0.763
 gol-0.747
Token girl
Token ID2576
Feature activation+2.69e-01

Pos logit contributions
 everything+0.187
 someone+0.186
 dad+0.174
They+0.171
Maybe+0.170
Neg logit contributions
ocol-0.565
 rib-0.507
 microsc-0.456
hy-0.455
 gol-0.446
Token was
Token ID373
Feature activation-3.22e-01

Pos logit contributions
ocol+0.488
 rib+0.420
 microsc+0.398
hy+0.386
ifying+0.385
Neg logit contributions
 someone-0.232
 everything-0.227
 dad-0.214
 something-0.214
Maybe-0.211
Token su
Token ID424
Feature activation+3.07e-01

Pos logit contributions
 someone+0.219
 everything+0.213
Maybe+0.209
 dad+0.199
How+0.193
Neg logit contributions
ocol-0.622
 rib-0.537
hy-0.516
 microsc-0.512
rou-0.511
Tokenpr
Token ID1050
Feature activation+2.34e+00

Pos logit contributions
ocol+0.432
 rib+0.340
 microsc+0.320
ifying+0.311
hy+0.304
Neg logit contributions
 someone-0.407
 everything-0.399
 dad-0.391
 something-0.385
Maybe-0.381
Tokenised
Token ID1417
Feature activation+6.25e-01

Pos logit contributions
ocol+4.013
 rib+3.493
 microsc+3.405
hy+3.113
pri+3.069
Neg logit contributions
They-2.206
 one-2.191
 someone-2.182
 everything-2.177
 looked-2.173
Token.
Token ID13
Feature activation-3.19e-01

Pos logit contributions
ocol+1.178
 rib+0.996
 microsc+0.959
ifying+0.933
hy+0.924
Neg logit contributions
 someone-0.519
 everything-0.508
 dad-0.482
 something-0.478
Maybe-0.474
Token She
Token ID1375
Feature activation+2.38e-01

Pos logit contributions
 everything+0.283
 someone+0.281
 something+0.267
 dad+0.265
Maybe+0.263
Neg logit contributions
ocol-0.559
 rib-0.482
 microsc-0.457
ifying-0.448
hy-0.446
Token had
Token ID550
Feature activation+5.59e-01

Pos logit contributions
ocol+0.444
 rib+0.385
 microsc+0.364
ifying+0.353
hy+0.352
Neg logit contributions
 someone-0.193
 everything-0.189
 dad-0.178
 something-0.178
Maybe-0.173
Token never
Token ID1239
Feature activation+4.83e-01

Pos logit contributions
ocol+1.166
 rib+1.008
 microsc+0.973
ifying+0.948
hy+0.941
Neg logit contributions
 someone-0.346
 everything-0.336
 dad-0.315
 something-0.309
Maybe-0.307
Token to
Token ID284
Feature activation+4.59e-01

Pos logit contributions
ocol+1.108
 rib+0.924
 microsc+0.887
ifying+0.860
hy+0.851
Neg logit contributions
 someone-0.607
 everything-0.596
 dad-0.570
 something-0.566
Maybe-0.561
Token share
Token ID2648
Feature activation+5.41e-01

Pos logit contributions
ocol+0.895
 rib+0.764
 microsc+0.736
hy+0.711
ifying+0.710
Neg logit contributions
 someone-0.346
 everything-0.338
 dad-0.321
 something-0.317
Maybe-0.317
Token and
Token ID290
Feature activation+8.45e-01

Pos logit contributions
ocol+1.139
 rib+0.984
 microsc+0.951
ifying+0.925
hy+0.917
Neg logit contributions
 someone-0.325
 everything-0.315
 dad-0.294
 something-0.289
Maybe-0.288
Token wait
Token ID4043
Feature activation+6.18e-01

Pos logit contributions
ocol+1.738
 rib+1.477
 microsc+1.433
ifying+1.410
hy+1.387
Neg logit contributions
 someone-0.577
 everything-0.561
 dad-0.526
 something-0.523
 one-0.510
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
ocol+1.095
 rib+0.915
 microsc+0.879
ifying+0.852
hy+0.843
Neg logit contributions
 someone-0.579
 everything-0.569
 dad-0.543
 something-0.539
Maybe-0.535
Token How
Token ID1374
Feature activation+2.33e+00

Pos logit contributions
ocol+0.937
 rib+0.771
 microsc+0.739
ifying+0.715
hy+0.706
Neg logit contributions
 someone-0.602
 everything-0.594
 dad-0.570
 something-0.566
Maybe-0.561
Token about
Token ID546
Feature activation+1.78e-01

Pos logit contributions
ocol+4.391
 rib+3.447
ifying+3.307
 former+3.239
hy+3.190
Neg logit contributions
 everything-2.677
 someone-2.653
 something-2.554
 they-2.478
 dad-2.461
Token we
Token ID356
Feature activation+5.28e-01

Pos logit contributions
ocol+0.369
 rib+0.320
 microsc+0.310
ifying+0.297
 former+0.294
Neg logit contributions
 someone-0.108
 everything-0.107
Maybe-0.104
 dad-0.099
 looked-0.098
Token clean
Token ID3424
Feature activation+3.70e-01

Pos logit contributions
ocol+0.919
 rib+0.768
 microsc+0.737
ifying+0.714
hy+0.706
Neg logit contributions
 someone-0.510
 everything-0.501
 dad-0.478
 something-0.476
Maybe-0.474
Token up
Token ID510
Feature activation+5.21e-01

Pos logit contributions
ocol+0.780
 rib+0.676
 microsc+0.653
ifying+0.633
 former+0.623
Neg logit contributions
 someone-0.222
 everything-0.212
Maybe-0.202
 dad-0.199
 looked-0.193
Token this
Token ID428
Feature activation-2.72e-01

Pos logit contributions
ocol+1.048
 rib+0.898
 microsc+0.866
ifying+0.843
hy+0.833
Neg logit contributions
 someone-0.359
 everything-0.351
 dad-0.329
Maybe-0.326
 something-0.324
Token Lily
Token ID20037
Feature activation-1.75e-01

Pos logit contributions
ocol+0.078
 rib+0.066
 microsc+0.064
hy+0.061
ifying+0.060
Neg logit contributions
 someone-0.039
 everything-0.038
Maybe-0.037
 something-0.036
 somewhere-0.035
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
 someone+0.112
 everything+0.105
Maybe+0.101
 something+0.100
How+0.100
Neg logit contributions
ocol-0.348
 rib-0.307
 microsc-0.294
hy-0.292
ifying-0.284
Token They
Token ID1119
Feature activation+1.15e-01

Pos logit contributions
ocol+0.572
 rib+0.497
 microsc+0.480
ifying+0.469
hy+0.460
Neg logit contributions
 someone-0.184
 everything-0.178
 dad-0.169
 something-0.166
Maybe-0.164
Token were
Token ID547
Feature activation+2.82e-01

Pos logit contributions
ocol+0.199
 rib+0.170
 microsc+0.162
ifying+0.158
hy+0.157
Neg logit contributions
 someone-0.110
 everything-0.105
 something-0.102
 dad-0.101
Maybe-0.100
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
ocol+0.605
 rib+0.525
 microsc+0.508
ifying+0.495
hy+0.495
Neg logit contributions
 someone-0.155
 everything-0.147
Maybe-0.142
 something-0.137
 dad-0.134
Token close
Token ID1969
Feature activation+2.32e+00

Pos logit contributions
ocol+2.115
 rib+1.830
 microsc+1.765
ifying+1.737
hy+1.702
Neg logit contributions
 someone-0.615
 everything-0.602
 dad-0.586
 one-0.545
 something-0.543
Token,
Token ID11
Feature activation-3.99e-01

Pos logit contributions
ocol+4.612
 rib+3.666
 microsc+3.599
ifying+3.514
 former+3.465
Neg logit contributions
 everything-2.150
 dad-2.113
 one-2.096
 someone-2.016
 they-1.986
Token but
Token ID475
Feature activation+2.79e-01

Pos logit contributions
 someone+0.330
 everything+0.320
Maybe+0.313
How+0.305
 something+0.304
Neg logit contributions
ocol-0.691
 rib-0.626
hy-0.589
 microsc-0.587
ifying-0.563
Token they
Token ID484
Feature activation+1.03e-01

Pos logit contributions
ocol+0.655
 rib+0.582
 microsc+0.563
ifying+0.549
hy+0.544
Neg logit contributions
 someone-0.088
 everything-0.082
 dad-0.073
Maybe-0.070
 something-0.069
Token were
Token ID547
Feature activation+3.91e-02

Pos logit contributions
ocol+0.171
 rib+0.145
 microsc+0.139
ifying+0.134
hy+0.133
Neg logit contributions
 someone-0.102
 everything-0.098
 something-0.095
 dad-0.094
Maybe-0.093
Token also
Token ID635
Feature activation+1.26e-02

Pos logit contributions
ocol+0.083
 rib+0.073
 microsc+0.070
ifying+0.068
hy+0.068
Neg logit contributions
 someone-0.021
Maybe-0.020
 everything-0.020
 something-0.018
 dad-0.018
Token down
Token ID866
Feature activation+2.77e-01

Pos logit contributions
ocol+0.224
 rib+0.195
 microsc+0.186
ifying+0.183
hy+0.178
Neg logit contributions
 someone-0.070
 everything-0.067
Maybe-0.065
 dad-0.063
 something-0.061
Token and
Token ID290
Feature activation-8.12e-02

Pos logit contributions
ocol+0.545
 rib+0.468
 microsc+0.449
ifying+0.440
hy+0.432
Neg logit contributions
 someone-0.198
 everything-0.193
 dad-0.183
Maybe-0.182
 something-0.179
Token the
Token ID262
Feature activation-4.57e-01

Pos logit contributions
 someone+0.062
 everything+0.059
 dad+0.057
Maybe+0.057
 something+0.056
Neg logit contributions
ocol-0.153
 rib-0.135
 microsc-0.127
ifying-0.123
 former-0.123
Token butterfly
Token ID35113
Feature activation-2.46e-01

Pos logit contributions
 everything+0.437
 someone+0.434
Maybe+0.416
They+0.415
 dad+0.415
Neg logit contributions
ocol-0.762
 rib-0.684
 microsc-0.617
hy-0.602
 former-0.597
Token slowly
Token ID6364
Feature activation-2.33e-01

Pos logit contributions
 someone+0.196
 everything+0.191
 dad+0.187
 something+0.180
Maybe+0.178
Neg logit contributions
ocol-0.447
 rib-0.402
 microsc-0.368
hy-0.364
ifying-0.360
Token unw
Token ID7379
Feature activation+2.32e+00

Pos logit contributions
 someone+0.177
 everything+0.171
 dad+0.166
Maybe+0.163
 something+0.161
Neg logit contributions
ocol-0.444
 rib-0.395
hy-0.361
 microsc-0.359
ifying-0.356
Tokenound
Token ID633
Feature activation+4.54e-01

Pos logit contributions
ocol+2.918
 microsc+2.266
 rib+2.164
hy+2.095
pill+2.071
Neg logit contributions
 someone-3.369
 everything-3.311
 dad-3.304
Mom-3.299
 next-3.298
Token the
Token ID262
Feature activation+4.99e-01

Pos logit contributions
ocol+0.942
 rib+0.814
 microsc+0.783
ifying+0.764
hy+0.757
Neg logit contributions
 someone-0.283
 everything-0.276
 dad-0.260
Maybe-0.256
 something-0.253
Token thread
Token ID4704
Feature activation+5.63e-01

Pos logit contributions
ocol+0.870
 rib+0.732
 microsc+0.697
ifying+0.677
hy+0.670
Neg logit contributions
 someone-0.479
 everything-0.472
 dad-0.451
 something-0.447
Maybe-0.445
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ocol+1.191
 rib+1.030
 microsc+0.995
ifying+0.973
hy+0.965
Neg logit contributions
 someone-0.330
 everything-0.321
 dad-0.298
 something-0.294
Maybe-0.291
Token
Token ID220
Feature activation-1.93e-02

Pos logit contributions
 someone+0.294
 everything+0.285
 dad+0.279
 something+0.279
Maybe+0.277
Neg logit contributions
ocol-0.432
 rib-0.363
 microsc-0.351
ifying-0.336
hy-0.331
Token clue
Token ID18437
Feature activation-3.00e-01

Pos logit contributions
Maybe+1.824
They+1.812
 everything+1.809
 someone+1.734
 something+1.703
Neg logit contributions
ocol-3.131
 rib-2.882
 gol-2.628
hy-2.616
 topp-2.600
Token.
Token ID13
Feature activation-3.23e-02

Pos logit contributions
 everything+0.187
 someone+0.184
Maybe+0.176
How+0.170
 dad+0.168
Neg logit contributions
ocol-0.598
 rib-0.528
 microsc-0.505
hy-0.490
ifying-0.488
Token
Token ID198
Feature activation-5.96e-01

Pos logit contributions
 everything+0.022
 someone+0.022
 dad+0.020
 something+0.020
 looked+0.020
Neg logit contributions
ocol-0.063
 rib-0.055
 microsc-0.053
ifying-0.052
hy-0.052
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
 someone+0.411
 everything+0.397
 looked+0.397
 one+0.395
 something+0.387
Neg logit contributions
ocol-1.111
 rib-0.957
 microsc-0.940
onna-0.902
hy-0.893
TokenThe
Token ID464
Feature activation-9.94e-01

Pos logit contributions
 someone+0.209
 everything+0.204
 dad+0.201
 looked+0.197
 one+0.191
Neg logit contributions
ocol-0.779
 rib-0.689
 microsc-0.662
ifying-0.654
hy-0.642
Token next
Token ID1306
Feature activation-2.61e+00

Pos logit contributions
 everything+0.914
 someone+0.893
Maybe+0.880
They+0.863
 looked+0.847
Neg logit contributions
ocol-1.679
 rib-1.508
 microsc-1.439
hy-1.357
ifying-1.317
Token clue
Token ID18437
Feature activation-1.48e-01

Pos logit contributions
Maybe+2.716
 someone+2.576
 everything+2.559
 what+2.526
They+2.504
Neg logit contributions
ocol-3.674
 gol-3.418
hy-3.376
 rib-3.335
pri-3.163
Token said
Token ID531
Feature activation+2.72e-01

Pos logit contributions
 everything+0.110
 someone+0.108
Maybe+0.104
How+0.103
 dad+0.102
Neg logit contributions
ocol-0.278
 rib-0.242
 microsc-0.230
hy-0.225
 topp-0.217
Token:
Token ID25
Feature activation-2.61e-01

Pos logit contributions
ocol+0.512
 rib+0.434
 microsc+0.417
ifying+0.405
hy+0.404
Neg logit contributions
 someone-0.217
 everything-0.217
Maybe-0.207
 dad-0.204
 looked-0.199
Token "
Token ID366
Feature activation-1.80e-01

Pos logit contributions
 someone+0.226
 everything+0.226
Maybe+0.215
 dad+0.213
 looked+0.211
Neg logit contributions
ocol-0.458
 rib-0.393
 microsc-0.378
ifying-0.365
hy-0.360
TokenLook
Token ID8567
Feature activation-6.06e-01

Pos logit contributions
 everything+0.173
 someone+0.169
 dad+0.166
 something+0.162
 where+0.159
Neg logit contributions
ocol-0.311
 rib-0.268
 microsc-0.248
ifying-0.245
hy-0.244
Token wrong
Token ID2642
Feature activation-4.33e-01

Pos logit contributions
ocol+2.123
 rib+1.776
 microsc+1.698
ifying+1.691
 former+1.662
Neg logit contributions
 someone-0.917
 everything-0.903
 something-0.861
 dad-0.829
 one-0.824
Token.
Token ID13
Feature activation-4.46e-01

Pos logit contributions
 everything+0.276
 someone+0.275
Maybe+0.266
 dad+0.260
How+0.257
Neg logit contributions
ocol-0.829
 rib-0.727
 microsc-0.705
hy-0.678
ifying-0.676
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
 someone+0.317
 everything+0.316
 something+0.304
 dad+0.297
 one+0.297
Neg logit contributions
ocol-0.828
 microsc-0.716
 rib-0.707
 former-0.671
hy-0.669
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 someone+0.078
 everything+0.076
 one+0.073
 something+0.073
 dad+0.072
Neg logit contributions
ocol-0.247
 rib-0.211
 microsc-0.210
hy-0.200
rou-0.197
TokenThe
Token ID464
Feature activation-8.95e-01

Pos logit contributions
 someone+0.412
 dad+0.406
 everything+0.404
 one+0.394
 looked+0.391
Neg logit contributions
ocol-0.749
 rib-0.631
 microsc-0.623
ifying-0.596
 former-0.589
Token next
Token ID1306
Feature activation-2.59e+00

Pos logit contributions
 everything+0.648
 someone+0.635
They+0.623
 something+0.621
Maybe+0.605
Neg logit contributions
ocol-1.675
 rib-1.512
 microsc-1.400
hy-1.392
 former-1.364
Token day
Token ID1110
Feature activation+5.51e-01

Pos logit contributions
 what+2.707
 everything+2.685
Maybe+2.654
They+2.638
 something+2.603
Neg logit contributions
ocol-3.731
 gol-3.714
hy-3.374
 rib-3.144
pri-3.123
Token,
Token ID11
Feature activation-3.33e-01

Pos logit contributions
ocol+1.161
 rib+1.001
 microsc+0.968
ifying+0.944
hy+0.936
Neg logit contributions
 someone-0.334
 everything-0.325
 dad-0.304
 something-0.299
Maybe-0.298
Token the
Token ID262
Feature activation-6.81e-01

Pos logit contributions
Maybe+0.216
 someone+0.211
They+0.207
 everything+0.203
 dad+0.201
Neg logit contributions
ocol-0.647
 rib-0.576
 microsc-0.545
ifying-0.519
 former-0.518
Token store
Token ID3650
Feature activation-2.30e-01

Pos logit contributions
 everything+0.637
 someone+0.634
They+0.625
 something+0.613
Maybe+0.611
Neg logit contributions
ocol-1.125
 rib-1.012
 microsc-0.915
hy-0.905
 former-0.883
Token owner
Token ID4870
Feature activation-5.21e-01

Pos logit contributions
 everything+0.171
 someone+0.168
 something+0.160
How+0.159
Maybe+0.157
Neg logit contributions
ocol-0.418
 rib-0.371
hy-0.356
 microsc-0.350
 gol-0.339
Token lonely
Token ID21757
Feature activation-5.38e-01

Pos logit contributions
Maybe+0.081
 someone+0.079
 everything+0.078
 looked+0.074
 dad+0.072
Neg logit contributions
ocol-0.260
 rib-0.220
 microsc-0.212
ifying-0.208
 former-0.206
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
 everything+0.366
 someone+0.365
Maybe+0.361
Mom+0.343
How+0.342
Neg logit contributions
ocol-1.017
 rib-0.880
 microsc-0.845
ifying-0.833
hy-0.823
Token
Token ID198
Feature activation-3.11e-01

Pos logit contributions
ocol+0.408
 rib+0.351
 microsc+0.344
ifying+0.337
hy+0.333
Neg logit contributions
 everything-0.114
 someone-0.114
 something-0.104
 dad-0.104
 one-0.102
Token
Token ID198
Feature activation+7.87e-02

Pos logit contributions
 someone+0.203
 everything+0.202
 looked+0.191
 something+0.190
 dad+0.189
Neg logit contributions
ocol-0.601
 rib-0.514
 microsc-0.504
hy-0.489
ifying-0.480
TokenThe
Token ID464
Feature activation-5.78e-01

Pos logit contributions
ocol+0.160
 rib+0.138
 microsc+0.133
ifying+0.132
hy+0.131
Neg logit contributions
 someone-0.050
 everything-0.049
 dad-0.046
 something-0.045
 looked-0.045
Token next
Token ID1306
Feature activation-2.57e+00

Pos logit contributions
 someone+0.280
 everything+0.276
 something+0.248
 looked+0.243
 where+0.243
Neg logit contributions
ocol-1.235
 rib-1.108
 microsc-1.052
 former-1.030
hy-1.023
Token day
Token ID1110
Feature activation+2.39e-01

Pos logit contributions
 what+2.670
 everything+2.646
Maybe+2.641
 someone+2.577
 something+2.539
Neg logit contributions
ocol-3.667
 rib-3.112
hy-3.096
 gol-3.055
 topp-2.993
Token,
Token ID11
Feature activation+3.02e-02

Pos logit contributions
ocol+0.487
 rib+0.418
 microsc+0.403
ifying+0.395
hy+0.391
Neg logit contributions
 someone-0.150
 everything-0.147
 dad-0.140
Maybe-0.140
 something-0.135
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
ocol+0.058
 rib+0.051
 microsc+0.049
ifying+0.047
hy+0.047
Neg logit contributions
 someone-0.020
 everything-0.020
Maybe-0.019
 dad-0.019
 looked-0.019
Token sun
Token ID4252
Feature activation+7.98e-02

Pos logit contributions
 someone+0.199
 everything+0.197
Maybe+0.182
 something+0.182
 looked+0.179
Neg logit contributions
ocol-0.585
 rib-0.517
 microsc-0.488
 former-0.474
hy-0.473
Token came
Token ID1625
Feature activation+6.72e-02

Pos logit contributions
ocol+0.160
 rib+0.140
ifying+0.132
 microsc+0.132
hy+0.130
Neg logit contributions
 someone-0.052
 everything-0.050
 something-0.047
 dad-0.047
Maybe-0.045
Token unfair
Token ID11675
Feature activation-5.45e-01

Pos logit contributions
ocol+2.003
 rib+1.692
 microsc+1.649
ifying+1.607
 former+1.579
Neg logit contributions
 someone-0.671
 everything-0.654
 dad-0.616
 something-0.607
 looked-0.596
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
 someone+0.341
Maybe+0.326
 everything+0.325
How+0.313
Mom+0.308
Neg logit contributions
ocol-1.054
 rib-0.929
 microsc-0.907
hy-0.889
ifying-0.872
Token
Token ID198
Feature activation-3.49e-01

Pos logit contributions
 someone+0.072
 everything+0.071
 something+0.068
 dad+0.066
 looked+0.064
Neg logit contributions
ocol-0.221
 rib-0.190
 microsc-0.188
hy-0.181
ifying-0.180
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 someone+0.228
 everything+0.223
 something+0.215
 dad+0.214
 looked+0.210
Neg logit contributions
ocol-0.671
 rib-0.574
 microsc-0.566
hy-0.548
ifying-0.540
TokenThe
Token ID464
Feature activation-1.11e-01

Pos logit contributions
 someone+0.336
 everything+0.326
 dad+0.323
 something+0.311
 looked+0.311
Neg logit contributions
ocol-0.766
 rib-0.646
 microsc-0.627
ifying-0.613
hy-0.608
Token next
Token ID1306
Feature activation-2.50e+00

Pos logit contributions
 someone+0.088
 everything+0.085
 something+0.083
 looked+0.080
Maybe+0.080
Neg logit contributions
ocol-0.207
 rib-0.184
hy-0.167
 microsc-0.167
ifying-0.164
Token day
Token ID1110
Feature activation+2.53e-01

Pos logit contributions
Maybe+2.631
 what+2.594
 someone+2.591
 something+2.558
 everything+2.521
Neg logit contributions
ocol-3.526
 gol-3.337
hy-3.087
 rib-3.053
spe-2.986
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
ocol+0.517
 rib+0.444
 microsc+0.427
ifying+0.417
hy+0.413
Neg logit contributions
 someone-0.161
 everything-0.156
Maybe-0.152
 dad-0.149
 something-0.144
Token they
Token ID484
Feature activation-3.89e-01

Pos logit contributions
 someone+0.102
Maybe+0.099
 everything+0.097
 dad+0.097
 looked+0.094
Neg logit contributions
ocol-0.305
 rib-0.266
 microsc-0.254
ifying-0.245
 topp-0.241
Token decided
Token ID3066
Feature activation+9.55e-02

Pos logit contributions
 someone+0.252
 everything+0.239
 something+0.233
Maybe+0.222
How+0.220
Neg logit contributions
ocol-0.757
 rib-0.680
 microsc-0.650
ifying-0.642
hy-0.632
Token to
Token ID284
Feature activation+1.16e-02

Pos logit contributions
ocol+0.140
 rib+0.118
 microsc+0.111
hy+0.106
ifying+0.105
Neg logit contributions
 someone-0.109
 everything-0.108
Maybe-0.106
 dad-0.104
 looked-0.103
Token say
Token ID910
Feature activation-1.64e-01

Pos logit contributions
 someone+0.510
 everything+0.475
Maybe+0.469
 something+0.468
How+0.465
Neg logit contributions
ocol-0.890
 rib-0.783
 microsc-0.707
hy-0.696
ifying-0.677
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
 someone+0.101
 everything+0.100
Maybe+0.095
 dad+0.092
How+0.091
Neg logit contributions
ocol-0.335
 rib-0.295
hy-0.276
 microsc-0.275
ifying-0.270
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
 everything+0.252
 someone+0.249
 dad+0.235
 something+0.234
 looked+0.230
Neg logit contributions
ocol-0.643
 rib-0.575
 microsc-0.532
ifying-0.530
hy-0.525
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 someone+0.088
 everything+0.085
 looked+0.083
 one+0.082
 something+0.082
Neg logit contributions
ocol-0.279
 rib-0.244
 microsc-0.235
hy-0.224
rou-0.224
Token"
Token ID1
Feature activation-1.09e+00

Pos logit contributions
 someone+0.156
 everything+0.151
 looked+0.145
 dad+0.144
 something+0.142
Neg logit contributions
ocol-0.453
 rib-0.400
 microsc-0.377
ifying-0.369
hy-0.361
TokenMe
Token ID5308
Feature activation-2.44e+00

Pos logit contributions
 everything+1.214
 someone+1.186
 dad+1.160
 where+1.149
 something+1.132
Neg logit contributions
ocol-1.689
 rib-1.609
leaf-1.403
hy-1.390
 gol-1.382
Token too
Token ID1165
Feature activation-4.70e-02

Pos logit contributions
Maybe+1.814
 someone+1.766
 everything+1.660
Days+1.631
They+1.588
Neg logit contributions
ocol-4.759
hy-4.237
 rib-4.134
rou-3.981
ifying-3.976
Token.
Token ID13
Feature activation+9.29e-02

Pos logit contributions
 someone+0.036
 everything+0.035
Maybe+0.034
 something+0.033
How+0.033
Neg logit contributions
ocol-0.089
 rib-0.077
 microsc-0.074
hy-0.073
ifying-0.072
Token I
Token ID314
Feature activation+5.89e-01

Pos logit contributions
ocol+0.140
 rib+0.115
 microsc+0.108
ifying+0.106
hy+0.106
Neg logit contributions
 everything-0.110
 someone-0.109
 dad-0.105
 something-0.104
 looked-0.103
Token wish
Token ID4601
Feature activation+4.41e-02

Pos logit contributions
ocol+1.007
 rib+0.836
 microsc+0.801
ifying+0.775
hy+0.768
Neg logit contributions
 someone-0.592
 everything-0.582
 dad-0.558
 something-0.553
Maybe-0.548
Token I
Token ID314
Feature activation+1.52e-01

Pos logit contributions
ocol+0.089
 rib+0.078
 microsc+0.074
hy+0.072
ifying+0.071
Neg logit contributions
 someone-0.028
Maybe-0.027
 everything-0.027
 dad-0.026
 looked-0.025
Token later
Token ID1568
Feature activation+7.13e-02

Pos logit contributions
 someone+0.126
 everything+0.120
 dad+0.118
Maybe+0.118
How+0.113
Neg logit contributions
ocol-0.326
 rib-0.286
 microsc-0.271
ifying-0.266
 former-0.263
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
ocol+0.146
 rib+0.128
 microsc+0.123
ifying+0.121
hy+0.120
Neg logit contributions
 someone-0.042
 everything-0.041
Maybe-0.039
 dad-0.038
 something-0.037
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 everything+0.075
 someone+0.074
 looked+0.070
 dad+0.069
 one+0.069
Neg logit contributions
ocol-0.197
 rib-0.168
 microsc-0.166
ifying-0.160
 former-0.158
Token
Token ID198
Feature activation-3.16e-01

Pos logit contributions
 someone+0.233
 everything+0.227
 one+0.223
 looked+0.217
 something+0.215
Neg logit contributions
ocol-0.647
 rib-0.551
 microsc-0.545
hy-0.521
rou-0.517
TokenThe
Token ID464
Feature activation-3.88e-01

Pos logit contributions
 someone+0.241
 everything+0.234
 looked+0.225
 one+0.225
 dad+0.219
Neg logit contributions
ocol-0.595
 rib-0.509
 microsc-0.500
ifying-0.481
rou-0.473
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 someone+0.221
 everything+0.217
Maybe+0.202
 something+0.198
 looked+0.195
Neg logit contributions
ocol-0.817
 rib-0.722
 microsc-0.682
 former-0.662
ifying-0.652
Token day
Token ID1110
Feature activation+9.38e-02

Pos logit contributions
Maybe+2.667
They+2.523
 someone+2.509
 everything+2.506
 what+2.495
Neg logit contributions
ocol-3.493
 gol-2.885
 rib-2.864
 topp-2.785
pri-2.705
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
ocol+0.189
 rib+0.162
 microsc+0.156
ifying+0.152
hy+0.149
Neg logit contributions
 someone-0.060
 everything-0.059
Maybe-0.058
 dad-0.056
How-0.054
Token Ben
Token ID3932
Feature activation-6.20e-02

Pos logit contributions
 someone+0.148
Maybe+0.147
 looked+0.144
 everything+0.144
How+0.126
Neg logit contributions
ocol-0.643
 rib-0.571
 microsc-0.546
ifying-0.533
 former-0.518
Token and
Token ID290
Feature activation+5.48e-01

Pos logit contributions
 someone+0.037
 everything+0.037
 something+0.033
 dad+0.032
Maybe+0.032
Neg logit contributions
ocol-0.126
 rib-0.111
 microsc-0.107
hy-0.104
ifying-0.103
Token Lily
Token ID20037
Feature activation-3.42e-01

Pos logit contributions
ocol+1.173
 rib+1.016
 microsc+0.983
ifying+0.957
hy+0.951
Neg logit contributions
 someone-0.315
 everything-0.306
 dad-0.283
 something-0.281
Maybe-0.278
Token.
Token ID13
Feature activation-6.37e-02

Pos logit contributions
ocol+1.678
 rib+1.453
 microsc+1.408
ifying+1.380
hy+1.363
Neg logit contributions
 someone-0.390
 everything-0.378
 dad-0.348
 something-0.342
 one-0.331
Token They
Token ID1119
Feature activation+5.69e-01

Pos logit contributions
 everything+0.043
 someone+0.042
 looked+0.040
 dad+0.039
 something+0.039
Neg logit contributions
ocol-0.125
 rib-0.107
 microsc-0.106
ifying-0.103
 former-0.102
Token hoped
Token ID10719
Feature activation+2.35e-01

Pos logit contributions
ocol+1.137
 rib+0.975
 microsc+0.942
ifying+0.916
hy+0.910
Neg logit contributions
 someone-0.403
 everything-0.393
 dad-0.369
 something-0.367
Maybe-0.361
Token that
Token ID326
Feature activation+1.90e-01

Pos logit contributions
ocol+0.493
 rib+0.429
 microsc+0.414
hy+0.401
ifying+0.399
Neg logit contributions
 someone-0.132
 everything-0.129
Maybe-0.126
 dad-0.121
 looked-0.119
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
ocol+0.402
 rib+0.353
 microsc+0.341
ifying+0.328
hy+0.328
Neg logit contributions
 someone-0.099
 everything-0.097
Maybe-0.091
 dad-0.090
 something-0.089
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 someone+0.620
 everything+0.610
They+0.594
 they+0.591
 looked+0.584
Neg logit contributions
ocol-1.418
 rib-1.268
 microsc-1.153
hy-1.150
 former-1.144
Token time
Token ID640
Feature activation-8.49e-02

Pos logit contributions
They+2.788
 what+2.725
 everything+2.641
Nothing+2.561
 dad+2.530
Neg logit contributions
ocol-3.253
 rib-3.012
 topp-2.806
 spe-2.721
 gol-2.671
Token they
Token ID484
Feature activation-1.12e-01

Pos logit contributions
 someone+0.028
 everything+0.027
Maybe+0.027
 dad+0.024
How+0.024
Neg logit contributions
ocol-0.195
 rib-0.173
 microsc-0.167
hy-0.163
ifying-0.163
Token had
Token ID550
Feature activation-4.39e-01

Pos logit contributions
 someone+0.098
 everything+0.094
 something+0.092
 dad+0.090
 one+0.089
Neg logit contributions
ocol-0.201
 rib-0.172
 microsc-0.164
ifying-0.161
hy-0.160
Token a
Token ID257
Feature activation-7.99e-01

Pos logit contributions
Maybe+0.336
 someone+0.323
 everything+0.317
How+0.315
They+0.309
Neg logit contributions
ocol-0.811
 rib-0.712
 microsc-0.669
 topp-0.654
hy-0.648
Token picnic
Token ID35715
Feature activation-5.15e-01

Pos logit contributions
 someone+0.793
 everything+0.779
They+0.768
Maybe+0.757
How+0.750
Neg logit contributions
ocol-1.271
 rib-1.112
 microsc-1.044
 former-1.013
hy-1.005
Token owner
Token ID4870
Feature activation-5.59e-02

Pos logit contributions
 someone+0.149
 everything+0.149
Maybe+0.142
 something+0.141
 looked+0.141
Neg logit contributions
ocol-0.270
 rib-0.235
 microsc-0.225
hy-0.214
ifying-0.213
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 someone+0.038
 everything+0.037
Maybe+0.036
 dad+0.035
 something+0.034
Neg logit contributions
ocol-0.111
 rib-0.095
 microsc-0.092
hy-0.091
ifying-0.090
Token
Token ID198
Feature activation+3.09e-01

Pos logit contributions
ocol+0.234
 rib+0.201
 microsc+0.196
hy+0.190
ifying+0.190
Neg logit contributions
 someone-0.093
 everything-0.090
 something-0.086
Maybe-0.085
 dad-0.085
Token
Token ID198
Feature activation+6.43e-01

Pos logit contributions
ocol+0.647
 rib+0.561
 microsc+0.545
hy+0.529
ifying+0.528
Neg logit contributions
 someone-0.176
 everything-0.169
 dad-0.159
 something-0.158
 one-0.154
TokenThe
Token ID464
Feature activation-5.31e-01

Pos logit contributions
ocol+1.301
 rib+1.112
 microsc+1.075
ifying+1.047
hy+1.037
Neg logit contributions
 someone-0.450
 everything-0.439
 dad-0.412
 something-0.408
Maybe-0.403
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 everything+0.302
 someone+0.283
 something+0.280
Maybe+0.268
 looked+0.265
Neg logit contributions
ocol-1.075
 rib-0.960
 microsc-0.934
hy-0.925
 former-0.908
Token day
Token ID1110
Feature activation+7.55e-01

Pos logit contributions
Maybe+2.510
 everything+2.409
 someone+2.386
 what+2.363
 something+2.325
Neg logit contributions
 gol-3.395
ocol-3.317
hy-2.933
pri-2.876
 rib-2.827
Token,
Token ID11
Feature activation+8.66e-02

Pos logit contributions
ocol+1.619
 rib+1.398
 microsc+1.355
ifying+1.320
hy+1.308
Neg logit contributions
 someone-0.446
 everything-0.433
 dad-0.399
 something-0.397
 one-0.387
Token Tim
Token ID5045
Feature activation-4.63e-01

Pos logit contributions
ocol+0.151
 rib+0.128
 microsc+0.122
ifying+0.120
hy+0.117
Neg logit contributions
 someone-0.076
 everything-0.074
Maybe-0.074
 dad-0.072
 looked-0.071
Tokenmy
Token ID1820
Feature activation-4.92e-01

Pos logit contributions
 someone+0.326
 everything+0.308
 something+0.301
Maybe+0.291
 they+0.285
Neg logit contributions
ocol-0.868
 rib-0.761
 microsc-0.755
rou-0.745
hy-0.728
Token went
Token ID1816
Feature activation-1.86e-01

Pos logit contributions
 someone+0.355
 everything+0.353
Maybe+0.329
 something+0.327
How+0.325
Neg logit contributions
ocol-0.896
 rib-0.803
 microsc-0.785
hy-0.743
 gol-0.741
Token sad
Token ID6507
Feature activation-2.73e-01

Pos logit contributions
 someone+0.174
 everything+0.174
Maybe+0.167
How+0.164
 something+0.162
Neg logit contributions
ocol-0.483
 rib-0.401
 microsc-0.396
hy-0.387
ifying-0.382
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
 someone+0.167
Maybe+0.163
 everything+0.161
 dad+0.158
Mom+0.158
Neg logit contributions
ocol-0.542
 rib-0.473
 microsc-0.459
ifying-0.440
rou-0.440
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 everything+0.186
 someone+0.185
 dad+0.182
 something+0.178
Maybe+0.175
Neg logit contributions
ocol-0.330
 rib-0.276
 microsc-0.270
hy-0.258
 former-0.257
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
 someone+0.181
 everything+0.179
 dad+0.173
 something+0.172
 one+0.171
Neg logit contributions
ocol-0.543
 rib-0.464
 microsc-0.463
hy-0.439
ifying-0.434
TokenThe
Token ID464
Feature activation-7.54e-01

Pos logit contributions
 dad+0.307
 someone+0.305
 everything+0.298
 one+0.287
 something+0.286
Neg logit contributions
ocol-0.642
 rib-0.541
 microsc-0.529
ifying-0.513
hy-0.508
Token next
Token ID1306
Feature activation-2.39e+00

Pos logit contributions
 everything+0.266
 someone+0.263
They+0.254
Maybe+0.245
 they+0.245
Neg logit contributions
ocol-1.719
 rib-1.532
 microsc-1.493
 former-1.426
 gol-1.424
Token day
Token ID1110
Feature activation+2.53e-01

Pos logit contributions
Maybe+2.425
They+2.385
 everything+2.337
 what+2.310
 someone+2.268
Neg logit contributions
ocol-3.538
 gol-3.356
hy-3.004
 rib-2.925
pri-2.846
Token,
Token ID11
Feature activation-4.36e-01

Pos logit contributions
ocol+0.516
 rib+0.446
 microsc+0.427
ifying+0.417
hy+0.415
Neg logit contributions
 someone-0.158
 everything-0.154
 dad-0.149
Maybe-0.147
 something-0.142
Token Lucy
Token ID22162
Feature activation-2.95e-01

Pos logit contributions
 dad+0.388
 someone+0.385
Maybe+0.377
 everything+0.372
They+0.366
Neg logit contributions
ocol-0.718
 rib-0.628
 microsc-0.580
ifying-0.565
hy-0.553
Token was
Token ID373
Feature activation-6.84e-01

Pos logit contributions
 everything+0.213
 someone+0.213
 something+0.196
 dad+0.192
Maybe+0.192
Neg logit contributions
ocol-0.549
 rib-0.485
 microsc-0.470
hy-0.452
ifying-0.449
Token very
Token ID845
Feature activation-1.67e-01

Pos logit contributions
Maybe+0.494
They+0.464
 someone+0.457
How+0.452
 everything+0.446
Neg logit contributions
ocol-1.297
 rib-1.104
rou-1.073
hy-1.072
 microsc-1.066
Token them
Token ID606
Feature activation-1.57e-01

Pos logit contributions
ocol+0.443
 rib+0.384
 microsc+0.371
hy+0.353
ifying+0.352
Neg logit contributions
 someone-0.136
 everything-0.131
Maybe-0.129
 dad-0.127
 something-0.122
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
 someone+0.096
 everything+0.092
Maybe+0.091
 dad+0.089
How+0.085
Neg logit contributions
ocol-0.319
 rib-0.279
 microsc-0.270
hy-0.260
ifying-0.258
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 everything+0.345
 someone+0.336
 looked+0.329
 something+0.323
 dad+0.313
Neg logit contributions
ocol-0.723
 rib-0.626
 microsc-0.607
hy-0.584
ifying-0.577
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
 someone+0.228
 everything+0.220
 looked+0.214
 something+0.213
 one+0.210
Neg logit contributions
ocol-0.652
 rib-0.558
 microsc-0.550
hy-0.527
rou-0.518
TokenThe
Token ID464
Feature activation-5.37e-01

Pos logit contributions
 someone+0.223
 everything+0.215
 looked+0.205
 dad+0.202
 something+0.201
Neg logit contributions
ocol-0.703
 rib-0.613
 microsc-0.596
hy-0.577
ifying-0.573
Token next
Token ID1306
Feature activation-2.39e+00

Pos logit contributions
 someone+0.438
 everything+0.428
 something+0.419
Maybe+0.411
 dad+0.393
Neg logit contributions
ocol-0.986
 rib-0.862
 microsc-0.827
hy-0.800
 former-0.776
Token day
Token ID1110
Feature activation+3.03e-01

Pos logit contributions
Maybe+2.510
 someone+2.447
 something+2.413
 everything+2.346
 what+2.328
Neg logit contributions
ocol-3.477
hy-3.141
 rib-3.028
 gol-3.023
spe-2.889
Token,
Token ID11
Feature activation-1.78e-01

Pos logit contributions
ocol+0.623
 rib+0.539
 microsc+0.518
ifying+0.503
hy+0.500
Neg logit contributions
 someone-0.189
 everything-0.186
Maybe-0.178
 dad-0.177
 something-0.170
Token Lily
Token ID20037
Feature activation-2.92e-01

Pos logit contributions
Maybe+0.105
 someone+0.105
 everything+0.102
 looked+0.100
 dad+0.100
Neg logit contributions
ocol-0.357
 rib-0.320
 microsc-0.301
ifying-0.287
 former-0.285
Token went
Token ID1816
Feature activation-2.62e-01

Pos logit contributions
 someone+0.197
 everything+0.195
 something+0.182
Maybe+0.172
How+0.170
Neg logit contributions
ocol-0.558
 rib-0.501
 microsc-0.481
hy-0.463
ifying-0.456
Token to
Token ID284
Feature activation+6.18e-01

Pos logit contributions
 someone+0.254
Maybe+0.253
 everything+0.249
 dad+0.241
 looked+0.241
Neg logit contributions
ocol-0.433
 rib-0.368
 microsc-0.352
hy-0.340
ifying-0.337
Token,
Token ID11
Feature activation-8.60e-01

Pos logit contributions
 someone+0.635
 dad+0.594
How+0.587
 everything+0.582
Maybe+0.571
Neg logit contributions
ocol-1.526
 rib-1.358
hy-1.302
 microsc-1.291
onna-1.273
Token but
Token ID475
Feature activation-2.12e-01

Pos logit contributions
 someone+0.746
Maybe+0.694
They+0.690
Mom+0.679
 everything+0.674
Neg logit contributions
ocol-1.430
 rib-1.308
 microsc-1.209
hy-1.170
 gymn-1.138
Token they
Token ID484
Feature activation+1.51e-01

Pos logit contributions
 someone+0.055
Maybe+0.049
 everything+0.048
They+0.043
How+0.043
Neg logit contributions
ocol-0.495
 rib-0.444
 microsc-0.433
ifying-0.419
hy-0.415
Token don
Token ID836
Feature activation-3.48e-01

Pos logit contributions
ocol+0.297
 rib+0.255
 microsc+0.248
ifying+0.241
hy+0.239
Neg logit contributions
 someone-0.108
 everything-0.102
 something-0.096
 dad-0.096
Maybe-0.095
Token't
Token ID470
Feature activation-1.85e+00

Pos logit contributions
 someone+0.365
 everything+0.357
 dad+0.353
 looked+0.346
 something+0.346
Neg logit contributions
ocol-0.548
 rib-0.453
 microsc-0.430
ifying-0.423
hy-0.415
Token see
Token ID766
Feature activation-2.38e+00

Pos logit contributions
How+1.552
Maybe+1.466
 everything+1.452
 someone+1.432
They+1.392
Neg logit contributions
ocol-3.308
onna-2.836
 microsc-2.717
 rib-2.713
hy-2.562
Token any
Token ID597
Feature activation-3.45e-02

Pos logit contributions
They+2.227
Turn+2.180
Mom+2.169
Maybe+2.159
Nothing+2.112
Neg logit contributions
ocol-4.027
 rib-3.610
 microsc-3.401
ifying-3.302
 spe-3.261
Token mud
Token ID17492
Feature activation-6.03e-01

Pos logit contributions
 someone+0.026
 everything+0.026
 dad+0.025
Maybe+0.025
 looked+0.024
Neg logit contributions
ocol-0.067
 rib-0.057
 microsc-0.055
ifying-0.052
hy-0.051
Token.
Token ID13
Feature activation-6.46e-02

Pos logit contributions
 someone+0.454
 dad+0.427
Maybe+0.424
 everything+0.411
Mom+0.403
Neg logit contributions
ocol-1.161
 rib-1.000
 microsc-0.941
ifying-0.914
 topp-0.897
Token They
Token ID1119
Feature activation+2.94e-01

Pos logit contributions
 everything+0.048
 someone+0.047
 dad+0.045
 looked+0.044
 one+0.044
Neg logit contributions
ocol-0.123
 rib-0.106
 microsc-0.103
ifying-0.101
hy-0.099
Token only
Token ID691
Feature activation-4.12e-01

Pos logit contributions
ocol+0.606
 rib+0.525
 microsc+0.506
ifying+0.494
hy+0.491
Neg logit contributions
 someone-0.188
 everything-0.180
 something-0.167
 dad-0.167
Maybe-0.165
Token soon
Token ID2582
Feature activation+3.86e-01

Pos logit contributions
ocol+0.145
 rib+0.127
 microsc+0.122
ifying+0.119
 former+0.118
Neg logit contributions
 someone-0.044
 everything-0.044
 dad-0.041
Maybe-0.041
 looked-0.039
Token.
Token ID13
Feature activation-4.46e-01

Pos logit contributions
ocol+0.823
 rib+0.715
 microsc+0.688
ifying+0.674
hy+0.670
Neg logit contributions
 someone-0.214
 everything-0.208
 dad-0.193
Maybe-0.193
 something-0.189
Token
Token ID198
Feature activation-6.15e-02

Pos logit contributions
 someone+0.315
 everything+0.303
 something+0.297
 dad+0.296
 looked+0.282
Neg logit contributions
ocol-0.843
 rib-0.730
 microsc-0.711
ifying-0.685
hy-0.680
Token
Token ID198
Feature activation+5.62e-02

Pos logit contributions
 someone+0.038
 everything+0.037
 dad+0.036
 something+0.036
 one+0.035
Neg logit contributions
ocol-0.123
 rib-0.106
 microsc-0.104
hy-0.100
onna-0.099
TokenThe
Token ID464
Feature activation-8.57e-01

Pos logit contributions
ocol+0.104
 rib+0.089
 microsc+0.085
ifying+0.084
hy+0.082
Neg logit contributions
 someone-0.046
 everything-0.044
 dad-0.044
 something-0.042
 looked-0.042
Token next
Token ID1306
Feature activation-2.37e+00

Pos logit contributions
 everything+0.673
 someone+0.655
They+0.647
 something+0.617
Maybe+0.608
Neg logit contributions
ocol-1.548
 rib-1.406
 microsc-1.273
 former-1.242
hy-1.226
Token day
Token ID1110
Feature activation+2.04e-01

Pos logit contributions
Maybe+2.474
 someone+2.438
 what+2.433
 everything+2.433
They+2.431
Neg logit contributions
ocol-3.461
 rib-2.964
 gol-2.916
pri-2.736
 topp-2.710
Token,
Token ID11
Feature activation-7.70e-02

Pos logit contributions
ocol+0.414
 rib+0.358
 microsc+0.343
ifying+0.334
hy+0.331
Neg logit contributions
 someone-0.129
 everything-0.127
Maybe-0.122
 dad-0.121
 something-0.115
Token Anna
Token ID11735
Feature activation-4.05e-01

Pos logit contributions
 someone+0.055
 everything+0.053
 dad+0.052
Maybe+0.052
 looked+0.050
Neg logit contributions
ocol-0.147
 rib-0.128
 microsc-0.121
ifying-0.119
hy-0.115
Token checked
Token ID10667
Feature activation-1.82e-01

Pos logit contributions
 someone+0.272
 everything+0.265
 something+0.245
 dad+0.237
How+0.232
Neg logit contributions
ocol-0.788
 rib-0.707
 microsc-0.675
ifying-0.647
 former-0.645
Token on
Token ID319
Feature activation-2.18e-02

Pos logit contributions
 someone+0.143
Maybe+0.143
 looked+0.137
 everything+0.137
They+0.136
Neg logit contributions
ocol-0.339
 rib-0.292
 microsc-0.276
ifying-0.263
rou-0.257
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
 someone+0.401
 everything+0.396
Maybe+0.393
How+0.378
 dad+0.378
Neg logit contributions
ocol-0.997
 rib-0.878
 microsc-0.850
hy-0.820
pill-0.788
Token
Token ID220
Feature activation-4.71e-01

Pos logit contributions
 someone+0.135
 everything+0.132
 something+0.128
 dad+0.127
Maybe+0.127
Neg logit contributions
ocol-0.236
 rib-0.200
 microsc-0.193
hy-0.186
ifying-0.185
Token
Token ID198
Feature activation-8.84e-02

Pos logit contributions
 someone+0.309
 everything+0.306
 one+0.300
 looked+0.298
 dad+0.295
Neg logit contributions
ocol-0.913
 rib-0.779
 microsc-0.762
hy-0.742
ifying-0.740
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 someone+0.057
 everything+0.053
 dad+0.053
 something+0.052
 one+0.052
Neg logit contributions
ocol-0.177
 rib-0.154
 microsc-0.150
hy-0.145
ifying-0.143
TokenThe
Token ID464
Feature activation-5.19e-01

Pos logit contributions
ocol+0.523
 rib+0.439
 microsc+0.418
ifying+0.407
hy+0.403
Neg logit contributions
 someone-0.305
 everything-0.297
 dad-0.291
 something-0.286
 one-0.283
Token next
Token ID1306
Feature activation-2.37e+00

Pos logit contributions
 someone+0.277
 everything+0.275
Maybe+0.255
 something+0.252
They+0.246
Neg logit contributions
ocol-1.091
 rib-0.954
 microsc-0.948
hy-0.908
 former-0.891
Token day
Token ID1110
Feature activation+2.68e-01

Pos logit contributions
Maybe+2.519
 someone+2.419
 everything+2.395
They+2.370
 somewhere+2.354
Neg logit contributions
ocol-3.410
 gol-3.227
hy-2.983
 rib-2.807
pri-2.790
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
ocol+0.540
 rib+0.467
 microsc+0.447
ifying+0.437
hy+0.433
Neg logit contributions
 someone-0.174
 everything-0.168
 dad-0.162
Maybe-0.160
 something-0.156
Token Lucy
Token ID22162
Feature activation-1.18e-01

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
ifying+0.014
 former+0.014
Neg logit contributions
 someone-0.015
 dad-0.014
 everything-0.014
Maybe-0.014
 looked-0.014
Token took
Token ID1718
Feature activation-7.76e-01

Pos logit contributions
 someone+0.079
 everything+0.077
 something+0.072
Maybe+0.069
 dad+0.069
Neg logit contributions
ocol-0.232
 rib-0.205
 microsc-0.198
ifying-0.191
hy-0.191
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
Maybe+0.573
They+0.554
How+0.546
Nothing+0.498
 someone+0.496
Neg logit contributions
ocol-1.495
 rib-1.317
 microsc-1.217
hy-1.164
leaf-1.145
Token sad
Token ID6507
Feature activation-2.73e-01

Pos logit contributions
 someone+0.196
Maybe+0.195
 everything+0.191
 dad+0.186
How+0.186
Neg logit contributions
ocol-0.654
 rib-0.557
hy-0.541
ifying-0.538
 microsc-0.538
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
 someone+0.188
 everything+0.179
Maybe+0.177
 dad+0.174
How+0.174
Neg logit contributions
ocol-0.526
 rib-0.459
 microsc-0.439
 former-0.424
ifying-0.424
Token angry
Token ID7954
Feature activation-5.92e-01

Pos logit contributions
 someone+0.129
 everything+0.127
Maybe+0.123
 one+0.118
 something+0.118
Neg logit contributions
ocol-0.333
 rib-0.285
 microsc-0.279
hy-0.277
ifying-0.271
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
Maybe+0.404
 someone+0.395
 everything+0.389
How+0.386
Mom+0.380
Neg logit contributions
ocol-1.132
 rib-0.995
 microsc-0.945
hy-0.941
ifying-0.922
Token The
Token ID383
Feature activation-8.91e-01

Pos logit contributions
ocol+0.210
 rib+0.173
 microsc+0.170
hy+0.164
ifying+0.164
Neg logit contributions
 someone-0.130
 everything-0.129
 dad-0.124
 something-0.124
Maybe-0.123
Token next
Token ID1306
Feature activation-2.36e+00

Pos logit contributions
 everything+0.448
 someone+0.439
Maybe+0.419
How+0.419
They+0.405
Neg logit contributions
ocol-1.883
 rib-1.671
 microsc-1.608
 former-1.557
hy-1.545
Token day
Token ID1110
Feature activation+3.61e-01

Pos logit contributions
Maybe+2.396
 everything+2.364
They+2.354
 where+2.324
 what+2.295
Neg logit contributions
ocol-3.505
 gol-3.240
 topp-2.931
pri-2.868
 rib-2.855
Token,
Token ID11
Feature activation-6.55e-02

Pos logit contributions
ocol+0.741
 rib+0.637
 microsc+0.616
ifying+0.602
hy+0.597
Neg logit contributions
 someone-0.224
 everything-0.220
Maybe-0.209
 dad-0.207
 something-0.202
Token Tim
Token ID5045
Feature activation-3.75e-01

Pos logit contributions
 someone+0.036
 everything+0.034
Maybe+0.034
 dad+0.033
 looked+0.033
Neg logit contributions
ocol-0.134
 rib-0.117
 microsc-0.113
ifying-0.111
hy-0.109
Token wanted
Token ID2227
Feature activation-1.03e-01

Pos logit contributions
 someone+0.286
 everything+0.285
 something+0.269
Maybe+0.264
How+0.258
Neg logit contributions
ocol-0.669
 rib-0.596
 microsc-0.573
hy-0.563
rou-0.554
Token to
Token ID284
Feature activation+4.17e-01

Pos logit contributions
Maybe+0.116
 everything+0.115
 someone+0.115
They+0.113
 looked+0.112
Neg logit contributions
ocol-0.152
 rib-0.127
 microsc-0.123
 topp-0.112
hy-0.112
Token happy
Token ID3772
Feature activation-8.53e-01

Pos logit contributions
Maybe+0.471
 someone+0.465
 everything+0.456
 dad+0.438
How+0.430
Neg logit contributions
ocol-1.353
 rib-1.186
 microsc-1.091
hy-1.082
 former-1.075
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
 someone+0.616
Maybe+0.605
Mom+0.603
 everything+0.601
Turn+0.582
Neg logit contributions
ocol-1.553
 rib-1.389
 microsc-1.313
 former-1.266
onna-1.249
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 someone+0.217
 everything+0.217
 dad+0.204
 looked+0.203
 something+0.202
Neg logit contributions
ocol-0.572
 rib-0.496
 microsc-0.487
ifying-0.472
 former-0.469
Token
Token ID198
Feature activation-3.62e-01

Pos logit contributions
 someone+0.348
 everything+0.335
 looked+0.331
 one+0.330
 something+0.325
Neg logit contributions
ocol-0.879
 rib-0.754
 microsc-0.741
rou-0.705
hy-0.701
TokenThe
Token ID464
Feature activation-6.45e-02

Pos logit contributions
 someone+0.204
 everything+0.196
 dad+0.188
 looked+0.187
 one+0.181
Neg logit contributions
ocol-0.748
 rib-0.659
 microsc-0.642
ifying-0.618
rou-0.615
Token next
Token ID1306
Feature activation-2.36e+00

Pos logit contributions
 everything+0.058
 someone+0.058
Maybe+0.055
 dad+0.055
 looked+0.054
Neg logit contributions
ocol-0.115
 rib-0.097
 microsc-0.093
hy-0.088
 former-0.088
Token day
Token ID1110
Feature activation+2.79e-01

Pos logit contributions
Maybe+2.499
 someone+2.445
 what+2.395
 everything+2.360
 something+2.351
Neg logit contributions
ocol-3.446
 rib-2.924
 gol-2.905
hy-2.854
 gymn-2.658
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
ocol+0.572
 rib+0.493
 microsc+0.476
ifying+0.463
hy+0.458
Neg logit contributions
 someone-0.176
 everything-0.172
Maybe-0.164
 dad-0.163
 something-0.157
Token they
Token ID484
Feature activation-4.14e-01

Pos logit contributions
 someone+0.180
Maybe+0.176
 everything+0.174
 looked+0.168
 dad+0.166
Neg logit contributions
ocol-0.570
 rib-0.507
 microsc-0.483
ifying-0.465
rou-0.451
Token went
Token ID1816
Feature activation-3.72e-01

Pos logit contributions
 someone+0.285
 everything+0.271
 something+0.265
How+0.249
Maybe+0.247
Neg logit contributions
ocol-0.792
 rib-0.712
 microsc-0.679
ifying-0.665
hy-0.656
Token back
Token ID736
Feature activation+3.41e-01

Pos logit contributions
 someone+0.308
Maybe+0.307
 everything+0.297
 dad+0.287
 looked+0.283
Neg logit contributions
ocol-0.661
 rib-0.575
 microsc-0.549
hy-0.538
ifying-0.536
Token hurt
Token ID5938
Feature activation-8.28e-01

Pos logit contributions
ocol+0.177
 rib+0.153
 microsc+0.147
hy+0.145
ifying+0.142
Neg logit contributions
 someone-0.065
 everything-0.063
Maybe-0.060
 dad-0.059
 something-0.059
Token.
Token ID13
Feature activation-7.52e-01

Pos logit contributions
 dad+0.560
 someone+0.545
 everything+0.544
Maybe+0.542
 looked+0.532
Neg logit contributions
ocol-1.568
 rib-1.328
 microsc-1.290
hy-1.270
ifying-1.259
Token
Token ID198
Feature activation-5.54e-01

Pos logit contributions
 everything+0.624
 someone+0.596
 dad+0.582
 one+0.581
 something+0.580
Neg logit contributions
ocol-1.286
 microsc-1.126
 rib-1.110
hy-1.066
ifying-1.032
Token
Token ID198
Feature activation-6.91e-01

Pos logit contributions
 everything+0.373
 someone+0.370
 looked+0.364
 one+0.359
 something+0.356
Neg logit contributions
ocol-1.064
 rib-0.901
 microsc-0.900
hy-0.856
onna-0.842
TokenThe
Token ID464
Feature activation-1.25e+00

Pos logit contributions
 everything+0.449
 someone+0.449
 looked+0.417
 something+0.414
 one+0.412
Neg logit contributions
ocol-1.364
 microsc-1.167
 rib-1.164
hy-1.108
ifying-1.100
Token next
Token ID1306
Feature activation-2.35e+00

Pos logit contributions
 everything+1.019
How+0.997
 someone+0.990
They+0.982
 where+0.969
Neg logit contributions
ocol-2.194
 rib-1.980
 microsc-1.921
 gol-1.823
hy-1.804
Token day
Token ID1110
Feature activation-4.19e-01

Pos logit contributions
Maybe+2.315
 everything+2.223
They+2.181
How+2.157
 someone+2.139
Neg logit contributions
ocol-3.714
 gol-3.305
hy-3.172
 rib-3.127
pri-2.972
Token,
Token ID11
Feature activation-9.79e-01

Pos logit contributions
Maybe+0.297
 everything+0.294
 someone+0.292
How+0.284
 dad+0.277
Neg logit contributions
ocol-0.786
 rib-0.670
 microsc-0.653
ifying-0.633
hy-0.633
Token Tom
Token ID4186
Feature activation-1.01e+00

Pos logit contributions
 someone+0.629
 everything+0.622
Maybe+0.614
 looked+0.610
 dad+0.592
Neg logit contributions
ocol-1.748
 rib-1.587
 microsc-1.536
ifying-1.441
hy-1.438
Token and
Token ID290
Feature activation+4.36e-01

Pos logit contributions
 someone+0.871
 everything+0.834
Mom+0.811
Maybe+0.808
 something+0.800
Neg logit contributions
ocol-1.737
 rib-1.516
 microsc-1.491
hy-1.416
rou-1.416
Token Lisa
Token ID15378
Feature activation-1.93e-01

Pos logit contributions
ocol+0.846
 rib+0.727
 microsc+0.701
hy+0.675
ifying+0.675
Neg logit contributions
 someone-0.329
 everything-0.321
 dad-0.303
 something-0.302
Maybe-0.301
Token every
Token ID790
Feature activation-1.22e+00

Pos logit contributions
Maybe+0.304
 someone+0.291
How+0.284
 everything+0.283
 looked+0.280
Neg logit contributions
ocol-0.807
 rib-0.695
 microsc-0.673
hy-0.635
ifying-0.634
Token night
Token ID1755
Feature activation+2.13e-01

Pos logit contributions
Maybe+1.039
 everything+1.007
They+0.997
 someone+0.977
How+0.944
Neg logit contributions
ocol-2.201
 rib-1.896
ifying-1.796
 microsc-1.720
 topp-1.682
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
ocol+0.450
 rib+0.391
 microsc+0.375
ifying+0.371
hy+0.369
Neg logit contributions
 someone-0.120
 everything-0.118
Maybe-0.112
 dad-0.108
 something-0.107
Token She
Token ID1375
Feature activation-3.22e-01

Pos logit contributions
ocol+1.190
 rib+1.009
 microsc+0.972
ifying+0.945
hy+0.937
Neg logit contributions
 someone-0.497
 everything-0.486
 dad-0.461
 something-0.457
Maybe-0.452
Token hugs
Token ID40657
Feature activation-3.78e-01

Pos logit contributions
 someone+0.351
 everything+0.338
 something+0.325
 dad+0.323
Maybe+0.320
Neg logit contributions
ocol-0.514
 rib-0.448
 microsc-0.412
hy-0.390
ifying-0.384
Token it
Token ID340
Feature activation-2.35e+00

Pos logit contributions
Maybe+0.246
 someone+0.244
 everything+0.234
 looked+0.229
How+0.226
Neg logit contributions
ocol-0.735
 rib-0.662
 microsc-0.612
hy-0.589
ifying-0.588
Token and
Token ID290
Feature activation-1.95e-01

Pos logit contributions
Maybe+2.270
How+2.249
Nothing+2.248
 someone+2.210
What+2.107
Neg logit contributions
 rib-3.595
ocol-3.168
 stuffed-3.151
hy-3.125
rou-3.058
Token squee
Token ID13170
Feature activation+5.36e-02

Pos logit contributions
 someone+0.183
 everything+0.177
Maybe+0.173
 somewhere+0.168
How+0.167
Neg logit contributions
ocol-0.330
 rib-0.293
 microsc-0.274
hy-0.257
 former-0.256
Tokenzes
Token ID12271
Feature activation+1.68e-01

Pos logit contributions
ocol+0.091
 rib+0.076
 microsc+0.071
hy+0.068
ifying+0.068
Neg logit contributions
 someone-0.055
 everything-0.052
Maybe-0.051
 something-0.051
 dad-0.050
Token it
Token ID340
Feature activation-1.56e+00

Pos logit contributions
ocol+0.372
 rib+0.330
 microsc+0.316
ifying+0.305
hy+0.303
Neg logit contributions
 someone-0.077
 everything-0.072
Maybe-0.069
 dad-0.069
 looked-0.065
Token tight
Token ID5381
Feature activation+5.59e-01

Pos logit contributions
 someone+1.362
 dad+1.344
Maybe+1.344
How+1.305
Nothing+1.287
Neg logit contributions
ocol-2.502
 rib-2.481
hy-2.196
ifying-2.164
 microsc-2.155
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 dad+0.329
 someone+0.326
 everything+0.322
Maybe+0.318
 looked+0.297
Neg logit contributions
ocol-0.947
 rib-0.842
 microsc-0.802
ifying-0.778
hy-0.769
Token It
Token ID632
Feature activation+1.24e-01

Pos logit contributions
ocol+0.512
 rib+0.436
 microsc+0.418
ifying+0.406
hy+0.406
Neg logit contributions
 someone-0.222
 everything-0.221
 dad-0.214
Maybe-0.208
 something-0.207
Token wasn
Token ID2492
Feature activation+9.85e-02

Pos logit contributions
ocol+0.235
 rib+0.206
 microsc+0.196
hy+0.190
ifying+0.189
Neg logit contributions
 someone-0.094
 everything-0.092
 dad-0.090
 something-0.086
Maybe-0.084
Token't
Token ID470
Feature activation+1.82e-01

Pos logit contributions
ocol+0.165
 rib+0.138
 microsc+0.130
ifying+0.129
hy+0.127
Neg logit contributions
 someone-0.099
 everything-0.097
 dad-0.095
 something-0.092
Maybe-0.092
Token scary
Token ID14343
Feature activation+8.80e-01

Pos logit contributions
ocol+0.355
 rib+0.305
 microsc+0.293
hy+0.284
ifying+0.280
Neg logit contributions
 someone-0.132
 everything-0.129
Maybe-0.127
 dad-0.124
 something-0.120
Token after
Token ID706
Feature activation-2.35e+00

Pos logit contributions
ocol+1.598
 rib+1.340
 microsc+1.287
ifying+1.261
 former+1.237
Neg logit contributions
 someone-0.808
 everything-0.793
 something-0.752
 one-0.748
 dad-0.745
Token all
Token ID477
Feature activation+4.66e-01

Pos logit contributions
Maybe+2.375
How+2.089
Mom+1.994
They+1.886
Dad+1.740
Neg logit contributions
ocol-4.538
 rib-3.789
 microsc-3.716
ous-3.326
ifying-3.291
Token.
Token ID13
Feature activation+5.58e-02

Pos logit contributions
ocol+0.993
 rib+0.857
 microsc+0.833
ifying+0.808
hy+0.804
Neg logit contributions
 someone-0.268
 everything-0.259
 dad-0.244
Maybe-0.241
 something-0.239
Token
Token ID220
Feature activation-3.95e-01

Pos logit contributions
ocol+0.109
 rib+0.093
 microsc+0.090
ifying+0.087
hy+0.087
Neg logit contributions
 everything-0.040
 someone-0.040
 dad-0.039
 something-0.037
 one-0.037
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
 someone+0.252
 looked+0.251
 everything+0.247
 dad+0.245
 one+0.240
Neg logit contributions
ocol-0.766
 rib-0.654
 microsc-0.647
rou-0.618
ifying-0.617
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.018
hy+0.017
ifying+0.017
Neg logit contributions
 someone-0.007
 everything-0.006
 dad-0.006
 something-0.006
 one-0.006
Tokener
Token ID263
Feature activation+6.28e-01

Pos logit contributions
ocol+1.522
 rib+1.316
 microsc+1.278
ifying+1.247
hy+1.235
Neg logit contributions
 someone-0.361
 everything-0.349
 dad-0.320
 something-0.317
 one-0.309
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
ocol+1.360
 rib+1.177
 microsc+1.140
ifying+1.114
hy+1.104
Neg logit contributions
 someone-0.344
 everything-0.333
 dad-0.307
 something-0.303
Maybe-0.298
Token
Token ID198
Feature activation+2.88e-02

Pos logit contributions
 someone+0.291
 everything+0.289
 dad+0.284
 something+0.281
Maybe+0.274
Neg logit contributions
ocol-0.533
 rib-0.459
 microsc-0.437
hy-0.425
ifying-0.415
Token
Token ID198
Feature activation+5.05e-02

Pos logit contributions
ocol+0.058
 rib+0.050
 microsc+0.049
hy+0.047
ifying+0.047
Neg logit contributions
 someone-0.018
 everything-0.017
 dad-0.016
 something-0.016
 one-0.016
TokenThe
Token ID464
Feature activation-6.76e-01

Pos logit contributions
ocol+0.084
 rib+0.071
 microsc+0.067
ifying+0.066
hy+0.065
Neg logit contributions
 someone-0.050
 everything-0.049
 dad-0.049
 something-0.048
 one-0.047
Token next
Token ID1306
Feature activation-2.34e+00

Pos logit contributions
 someone+0.451
They+0.440
 they+0.435
 everything+0.434
Maybe+0.421
Neg logit contributions
ocol-1.322
 rib-1.182
 microsc-1.117
 gol-1.070
hy-1.059
Token day
Token ID1110
Feature activation+8.18e-02

Pos logit contributions
Maybe+2.409
 someone+2.368
 what+2.350
They+2.336
 something+2.330
Neg logit contributions
ocol-3.378
 gol-3.226
 rib-2.985
pri-2.842
hy-2.837
Token,
Token ID11
Feature activation-3.61e-01

Pos logit contributions
ocol+0.163
 rib+0.140
 microsc+0.134
ifying+0.131
hy+0.130
Neg logit contributions
 someone-0.054
 everything-0.052
Maybe-0.051
 dad-0.051
How-0.049
Token she
Token ID673
Feature activation-6.94e-01

Pos logit contributions
 someone+0.433
 dad+0.430
Maybe+0.427
 everything+0.420
 looked+0.413
Neg logit contributions
ocol-0.488
 rib-0.411
 microsc-0.376
ifying-0.361
hy-0.349
Token was
Token ID373
Feature activation-5.33e-01

Pos logit contributions
 someone+0.463
 everything+0.459
 something+0.420
Maybe+0.409
 dad+0.408
Neg logit contributions
ocol-1.315
 rib-1.207
 microsc-1.151
rou-1.089
 topp-1.078
Token very
Token ID845
Feature activation+1.00e-01

Pos logit contributions
Maybe+0.463
 someone+0.428
They+0.421
 everything+0.417
How+0.417
Neg logit contributions
ocol-0.953
 rib-0.810
 microsc-0.779
rou-0.775
hy-0.770
Token later
Token ID1568
Feature activation+1.60e-02

Pos logit contributions
ocol+0.813
 rib+0.706
 microsc+0.685
ifying+0.664
hy+0.657
Neg logit contributions
 someone-0.205
 everything-0.200
 dad-0.185
Maybe-0.183
 something-0.179
Token.
Token ID13
Feature activation-4.05e-02

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.028
hy+0.027
ifying+0.027
Neg logit contributions
 someone-0.009
 everything-0.009
Maybe-0.009
 dad-0.009
How-0.008
Token
Token ID198
Feature activation-4.58e-01

Pos logit contributions
 everything+0.030
 someone+0.030
 dad+0.028
 looked+0.028
 something+0.028
Neg logit contributions
ocol-0.077
 rib-0.066
 microsc-0.065
ifying-0.063
 former-0.062
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
 someone+0.310
 everything+0.304
 one+0.296
 something+0.294
 looked+0.294
Neg logit contributions
ocol-0.874
 rib-0.748
 microsc-0.739
onna-0.707
hy-0.702
TokenThe
Token ID464
Feature activation-9.52e-01

Pos logit contributions
 someone+0.221
 everything+0.216
 looked+0.208
 dad+0.206
 one+0.202
Neg logit contributions
ocol-0.655
 rib-0.565
 microsc-0.552
ifying-0.535
hy-0.525
Token next
Token ID1306
Feature activation-2.34e+00

Pos logit contributions
They+0.697
 everything+0.694
 someone+0.675
Maybe+0.655
How+0.634
Neg logit contributions
ocol-1.782
 rib-1.609
 microsc-1.569
hy-1.448
 former-1.429
Token day
Token ID1110
Feature activation+3.93e-01

Pos logit contributions
Maybe+2.402
 everything+2.343
They+2.318
 someone+2.292
How+2.245
Neg logit contributions
ocol-3.519
 gol-3.001
hy-2.989
 rib-2.928
onna-2.854
Token,
Token ID11
Feature activation-3.57e-03

Pos logit contributions
ocol+0.814
 rib+0.702
 microsc+0.677
ifying+0.660
hy+0.654
Neg logit contributions
 someone-0.241
 everything-0.237
 dad-0.222
Maybe-0.222
 something-0.217
Token they
Token ID484
Feature activation-3.56e-01

Pos logit contributions
 someone+0.001
 everything+0.001
Maybe+0.001
 looked+0.001
 dad+0.001
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
ifying-0.007
 former-0.007
Token went
Token ID1816
Feature activation-2.59e-01

Pos logit contributions
 someone+0.259
 everything+0.248
 something+0.239
How+0.229
 one+0.227
Neg logit contributions
ocol-0.672
 rib-0.600
 microsc-0.581
ifying-0.557
hy-0.546
Token back
Token ID736
Feature activation+2.77e-01

Pos logit contributions
 someone+0.261
Maybe+0.259
 everything+0.253
 dad+0.245
How+0.241
Neg logit contributions
ocol-0.419
 rib-0.356
 microsc-0.344
hy-0.327
ifying-0.325
Token scissors
Token ID41786
Feature activation+5.59e-01

Pos logit contributions
ocol+2.177
 rib+1.796
 microsc+1.781
ifying+1.745
hy+1.716
Neg logit contributions
 someone-1.054
 everything-1.027
 something-0.996
 one-0.993
 dad-0.987
Token and
Token ID290
Feature activation+1.66e-01

Pos logit contributions
ocol+1.170
 rib+1.010
 microsc+0.975
ifying+0.953
hy+0.945
Neg logit contributions
 someone-0.338
 everything-0.328
 dad-0.306
 something-0.301
Maybe-0.299
Token cut
Token ID2005
Feature activation+1.76e-01

Pos logit contributions
ocol+0.326
 rib+0.285
 microsc+0.270
ifying+0.263
hy+0.260
Neg logit contributions
 someone-0.118
 everything-0.116
Maybe-0.110
 dad-0.108
 something-0.106
Token his
Token ID465
Feature activation+5.96e-01

Pos logit contributions
ocol+0.373
 rib+0.325
 microsc+0.309
ifying+0.303
hy+0.299
Neg logit contributions
 someone-0.100
 everything-0.096
 dad-0.093
Maybe-0.092
 something-0.087
Token pocket
Token ID10000
Feature activation+6.55e-01

Pos logit contributions
ocol+1.091
 rib+0.919
 microsc+0.883
ifying+0.858
hy+0.849
Neg logit contributions
 someone-0.523
 everything-0.513
 dad-0.487
 something-0.484
Maybe-0.481
Token open
Token ID1280
Feature activation+1.31e+00

Pos logit contributions
ocol+1.369
 rib+1.178
 microsc+1.141
ifying+1.112
hy+1.102
Neg logit contributions
 someone-0.407
 everything-0.396
 dad-0.369
 something-0.365
Maybe-0.359
Token.
Token ID13
Feature activation-1.86e-01

Pos logit contributions
ocol+2.919
 rib+2.509
 microsc+2.476
hy+2.404
 former+2.386
Neg logit contributions
 someone-0.659
 everything-0.657
 one-0.616
 something-0.606
 dad-0.583
Token He
Token ID679
Feature activation+2.29e-01

Pos logit contributions
 everything+0.172
 someone+0.171
Maybe+0.163
 something+0.162
 dad+0.162
Neg logit contributions
ocol-0.323
 rib-0.269
 microsc-0.261
ifying-0.253
hy-0.250
Token carefully
Token ID7773
Feature activation+1.49e-02

Pos logit contributions
ocol+0.433
 rib+0.375
 microsc+0.355
ifying+0.345
hy+0.343
Neg logit contributions
 someone-0.182
 everything-0.177
 dad-0.167
 something-0.166
Maybe-0.162
Token pulled
Token ID5954
Feature activation-3.44e-01

Pos logit contributions
ocol+0.026
 rib+0.023
 microsc+0.021
hy+0.020
ifying+0.020
Neg logit contributions
 someone-0.014
 everything-0.014
 dad-0.013
Maybe-0.013
 something-0.013
Token out
Token ID503
Feature activation+1.64e-01

Pos logit contributions
 someone+0.219
Maybe+0.214
How+0.203
 everything+0.202
 dad+0.201
Neg logit contributions
ocol-0.705
 rib-0.616
 microsc-0.574
ifying-0.560
hy-0.551
Token to
Token ID284
Feature activation+5.93e-02

Pos logit contributions
ocol+0.006
 rib+0.005
 microsc+0.005
ifying+0.005
hy+0.005
Neg logit contributions
 someone-0.004
Maybe-0.004
 everything-0.004
 dad-0.004
 looked-0.004
Token his
Token ID465
Feature activation+3.29e-01

Pos logit contributions
ocol+0.115
 rib+0.100
 microsc+0.095
hy+0.091
ifying+0.091
Neg logit contributions
 someone-0.041
 everything-0.040
Maybe-0.040
 dad-0.039
 looked-0.037
Token field
Token ID2214
Feature activation-2.70e-01

Pos logit contributions
ocol+0.609
 rib+0.519
 microsc+0.500
ifying+0.481
 former+0.480
Neg logit contributions
 someone-0.274
 everything-0.267
Maybe-0.256
 something-0.252
 dad-0.248
Token to
Token ID284
Feature activation+7.47e-02

Pos logit contributions
 someone+0.226
Maybe+0.225
 everything+0.225
How+0.214
 dad+0.214
Neg logit contributions
ocol-0.470
 rib-0.400
 microsc-0.383
hy-0.383
 former-0.370
Token pick
Token ID2298
Feature activation+3.79e-01

Pos logit contributions
ocol+0.144
 rib+0.123
 microsc+0.117
ifying+0.113
hy+0.113
Neg logit contributions
 someone-0.056
 everything-0.055
Maybe-0.053
 dad-0.052
 something-0.051
Token some
Token ID617
Feature activation+1.32e+00

Pos logit contributions
ocol+0.863
 rib+0.756
 microsc+0.731
ifying+0.711
hy+0.706
Neg logit contributions
 someone-0.156
 everything-0.149
 dad-0.141
Maybe-0.136
 something-0.129
Token corn
Token ID11676
Feature activation+4.80e-01

Pos logit contributions
ocol+2.675
 rib+2.268
ifying+2.223
hy+2.216
 microsc+2.216
Neg logit contributions
 someone-0.919
 everything-0.902
 something-0.857
 one-0.828
 dad-0.827
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
ocol+1.034
 rib+0.896
 microsc+0.867
ifying+0.846
hy+0.838
Neg logit contributions
 someone-0.266
 everything-0.255
 dad-0.235
Maybe-0.233
 something-0.232
Token He
Token ID679
Feature activation-3.01e-01

Pos logit contributions
 someone+0.327
 everything+0.326
Maybe+0.320
 dad+0.320
 something+0.312
Neg logit contributions
ocol-0.438
 rib-0.361
hy-0.342
 microsc-0.338
ifying-0.330
Token saw
Token ID2497
Feature activation-7.73e-01

Pos logit contributions
 someone+0.256
 everything+0.250
 something+0.238
 dad+0.230
 one+0.228
Neg logit contributions
ocol-0.528
 rib-0.470
 microsc-0.436
hy-0.427
onna-0.420
Token a
Token ID257
Feature activation+1.28e-01

Pos logit contributions
Maybe+0.338
 someone+0.321
They+0.313
How+0.312
 everything+0.302
Neg logit contributions
ocol-1.632
 rib-1.471
 microsc-1.393
 topp-1.369
hy-1.324
Token asks
Token ID7893
Feature activation+3.23e-01

Pos logit contributions
ocol+0.479
 rib+0.409
 microsc+0.394
ifying+0.378
hy+0.377
Neg logit contributions
 someone-0.212
 everything-0.207
 something-0.195
 dad-0.195
Maybe-0.194
Token.
Token ID13
Feature activation-8.32e-01

Pos logit contributions
ocol+0.689
 rib+0.594
 microsc+0.574
hy+0.556
ifying+0.556
Neg logit contributions
 someone-0.181
 everything-0.178
Maybe-0.169
 dad-0.162
 something-0.160
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
 everything+0.678
 someone+0.643
 something+0.633
 somewhere+0.613
Maybe+0.605
Neg logit contributions
ocol-1.490
 rib-1.297
 microsc-1.243
hy-1.228
onna-1.210
Token
Token ID198
Feature activation-5.60e-01

Pos logit contributions
 someone+0.095
 everything+0.094
 something+0.089
 looked+0.089
 one+0.087
Neg logit contributions
ocol-0.310
 rib-0.266
 microsc-0.261
hy-0.251
ifying-0.246
Token"
Token ID1
Feature activation-5.68e-01

Pos logit contributions
 everything+0.353
 someone+0.349
 something+0.327
 looked+0.324
 one+0.316
Neg logit contributions
ocol-1.136
 rib-0.981
 microsc-0.958
hy-0.933
ifying-0.922
TokenI
Token ID40
Feature activation+1.67e+00

Pos logit contributions
 everything+0.523
 someone+0.498
 something+0.466
 dad+0.464
 where+0.459
Neg logit contributions
ocol-0.991
 rib-0.890
ifying-0.818
hy-0.814
 microsc-0.810
Token don
Token ID836
Feature activation-1.20e-01

Pos logit contributions
ocol+3.172
 rib+2.642
 microsc+2.553
ifying+2.524
hy+2.413
Neg logit contributions
 someone-1.535
 everything-1.488
 something-1.475
 looked-1.471
 one-1.467
Token't
Token ID470
Feature activation-1.38e-01

Pos logit contributions
 someone+0.126
 everything+0.124
 dad+0.120
Maybe+0.120
 something+0.119
Neg logit contributions
ocol-0.197
 rib-0.163
 microsc-0.153
ifying-0.151
hy-0.151
Token know
Token ID760
Feature activation+3.15e-01

Pos logit contributions
 someone+0.125
 everything+0.123
 dad+0.120
Maybe+0.120
 they+0.115
Neg logit contributions
ocol-0.245
 rib-0.205
hy-0.194
 microsc-0.194
ifying-0.184
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
ocol+0.651
 rib+0.563
 microsc+0.541
ifying+0.522
hy+0.522
Neg logit contributions
 someone-0.196
 everything-0.191
Maybe-0.182
 dad-0.181
 something-0.174
Token Maybe
Token ID6674
Feature activation+1.07e-01

Pos logit contributions
ocol+0.444
 rib+0.383
 microsc+0.370
ifying+0.362
hy+0.359
Neg logit contributions
 everything-0.149
 someone-0.146
 dad-0.134
 something-0.134
Maybe-0.132
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
 someone+0.407
 everything+0.387
Maybe+0.378
 dad+0.368
 something+0.358
Neg logit contributions
ocol-0.963
 rib-0.836
hy-0.798
 microsc-0.775
ifying-0.766
Token you
Token ID345
Feature activation-7.68e-01

Pos logit contributions
 someone+0.148
 everything+0.143
Maybe+0.141
 looked+0.137
 something+0.136
Neg logit contributions
ocol-0.375
 rib-0.325
 microsc-0.304
hy-0.299
ifying-0.294
Token are
Token ID389
Feature activation-2.09e-01

Pos logit contributions
 someone+0.948
How+0.927
Maybe+0.927
 dad+0.917
 everything+0.916
Neg logit contributions
ocol-1.079
 rib-0.892
hy-0.838
 microsc-0.832
ifying-0.826
Token a
Token ID257
Feature activation-1.17e-01

Pos logit contributions
Maybe+0.123
 someone+0.118
 everything+0.115
 looked+0.110
 dad+0.105
Neg logit contributions
ocol-0.442
 rib-0.376
 microsc-0.366
hy-0.356
ifying-0.355
Token hero
Token ID4293
Feature activation+7.55e-01

Pos logit contributions
 everything+0.083
 someone+0.083
Maybe+0.075
 something+0.075
 looked+0.074
Neg logit contributions
ocol-0.229
 rib-0.199
 microsc-0.190
hy-0.185
ifying-0.184
Token!
Token ID0
Feature activation+1.26e+00

Pos logit contributions
ocol+1.578
 rib+1.357
 microsc+1.316
ifying+1.280
hy+1.267
Neg logit contributions
 someone-0.474
 everything-0.459
 dad-0.430
 something-0.426
 one-0.419
Token We
Token ID775
Feature activation+5.46e-01

Pos logit contributions
ocol+2.183
 rib+1.822
 microsc+1.783
 former+1.704
ifying+1.673
Neg logit contributions
 someone-1.282
 everything-1.217
 dad-1.185
Maybe-1.183
 one-1.173
Token love
Token ID1842
Feature activation-1.19e-01

Pos logit contributions
ocol+0.937
 rib+0.779
 microsc+0.745
ifying+0.723
hy+0.716
Neg logit contributions
 someone-0.545
 everything-0.535
 dad-0.513
 something-0.509
Maybe-0.505
Token you
Token ID345
Feature activation-2.54e-01

Pos logit contributions
 someone+0.088
Maybe+0.085
 everything+0.085
 looked+0.083
 something+0.080
Neg logit contributions
ocol-0.228
 rib-0.197
 microsc-0.188
ifying-0.180
 topp-0.178
Token!"
Token ID2474
Feature activation-4.16e-01

Pos logit contributions
 someone+0.203
Maybe+0.194
 everything+0.192
 dad+0.183
How+0.182
Neg logit contributions
ocol-0.468
 rib-0.406
 microsc-0.387
ifying-0.387
hy-0.380
Token<|endoftext|>
Token ID50256
Feature activation+4.54e-01

Pos logit contributions
 someone+0.292
 everything+0.292
Maybe+0.275
 something+0.274
 one+0.267
Neg logit contributions
ocol-0.780
 rib-0.689
hy-0.659
 microsc-0.653
ifying-0.636
Token other
Token ID584
Feature activation-7.75e-01

Pos logit contributions
ocol+1.162
 rib+0.966
 microsc+0.929
ifying+0.901
hy+0.890
Neg logit contributions
 someone-0.645
 everything-0.634
 dad-0.607
 something-0.602
Maybe-0.596
Token and
Token ID290
Feature activation+4.85e-01

Pos logit contributions
 someone+0.542
How+0.520
Maybe+0.512
 everything+0.483
 something+0.465
Neg logit contributions
ocol-1.439
 rib-1.325
 microsc-1.248
hy-1.232
ifying-1.191
Token said
Token ID531
Feature activation-9.09e-02

Pos logit contributions
ocol+0.923
 rib+0.792
 microsc+0.759
ifying+0.736
hy+0.734
Neg logit contributions
 someone-0.383
 everything-0.375
 something-0.352
 dad-0.352
Maybe-0.349
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
 someone+0.062
Maybe+0.060
 everything+0.060
How+0.059
 dad+0.057
Neg logit contributions
ocol-0.179
 rib-0.156
 microsc-0.148
hy-0.145
ifying-0.141
Token "
Token ID366
Feature activation-1.39e-01

Pos logit contributions
 someone+0.226
 everything+0.219
 something+0.213
Maybe+0.207
 one+0.205
Neg logit contributions
ocol-0.416
 rib-0.363
 microsc-0.341
hy-0.331
ifying-0.330
TokenI
Token ID40
Feature activation+1.60e+00

Pos logit contributions
 someone+0.084
 everything+0.084
 dad+0.079
 something+0.077
 where+0.075
Neg logit contributions
ocol-0.282
 rib-0.252
ifying-0.236
hy-0.235
 microsc-0.235
Token forgive
Token ID20927
Feature activation-5.09e-01

Pos logit contributions
ocol+3.071
 rib+2.440
 microsc+2.359
ifying+2.348
hy+2.235
Neg logit contributions
 someone-1.579
 everything-1.570
Maybe-1.553
 something-1.528
 looked-1.515
Token you
Token ID345
Feature activation+1.97e-01

Pos logit contributions
Maybe+0.374
 looked+0.372
 everything+0.365
 someone+0.360
 turned+0.347
Neg logit contributions
ocol-0.971
 rib-0.835
 microsc-0.793
ifying-0.742
hy-0.740
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
ocol+0.403
 rib+0.347
 microsc+0.337
ifying+0.327
hy+0.324
Neg logit contributions
 someone-0.125
 everything-0.122
Maybe-0.118
 looked-0.112
 something-0.111
Token I
Token ID314
Feature activation+8.54e-01

Pos logit contributions
ocol+1.681
 rib+1.375
 microsc+1.330
ifying+1.272
hy+1.256
Neg logit contributions
 someone-1.081
 everything-1.051
 dad-1.015
Maybe-1.012
 something-1.012
Token love
Token ID1842
Feature activation-7.54e-02

Pos logit contributions
ocol+1.459
 rib+1.192
 microsc+1.146
ifying+1.111
hy+1.091
Neg logit contributions
 someone-0.891
 everything-0.880
Maybe-0.843
 something-0.843
 dad-0.841
Token for
Token ID329
Feature activation-1.22e-01

Pos logit contributions
ocol+0.583
 rib+0.497
 microsc+0.479
ifying+0.464
hy+0.462
Neg logit contributions
 someone-0.210
 everything-0.205
Maybe-0.196
 dad-0.193
 something-0.188
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
Maybe+0.109
 someone+0.108
 everything+0.106
 looked+0.102
 dad+0.101
Neg logit contributions
ocol-0.220
 rib-0.183
 microsc-0.177
 topp-0.166
ifying-0.166
Token success
Token ID1943
Feature activation+2.91e-02

Pos logit contributions
 someone+0.107
 everything+0.107
Maybe+0.103
They+0.100
 something+0.098
Neg logit contributions
ocol-0.209
 rib-0.179
 microsc-0.171
 former-0.163
hy-0.162
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
ocol+0.060
 rib+0.052
 microsc+0.050
ifying+0.049
hy+0.049
Neg logit contributions
 someone-0.017
 everything-0.017
Maybe-0.016
 dad-0.016
 something-0.015
Token
Token ID198
Feature activation-1.44e-01

Pos logit contributions
 someone+0.181
 everything+0.179
 dad+0.171
 something+0.170
Maybe+0.167
Neg logit contributions
ocol-0.361
 rib-0.300
 microsc-0.296
hy-0.284
ifying-0.282
Token
Token ID198
Feature activation+4.60e-01

Pos logit contributions
 someone+0.105
 everything+0.100
 dad+0.097
 something+0.097
 one+0.097
Neg logit contributions
ocol-0.278
 rib-0.235
 microsc-0.230
hy-0.222
ifying-0.220
TokenBob
Token ID18861
Feature activation-2.63e-01

Pos logit contributions
ocol+0.826
 rib+0.692
 microsc+0.665
ifying+0.647
hy+0.640
Neg logit contributions
 someone-0.423
 everything-0.414
 dad-0.400
 something-0.394
 one-0.387
Token learned
Token ID4499
Feature activation+6.44e-01

Pos logit contributions
 someone+0.226
 everything+0.219
 something+0.207
 somewhere+0.202
Maybe+0.202
Neg logit contributions
ocol-0.465
 rib-0.410
 microsc-0.389
rou-0.377
hy-0.377
Token that
Token ID326
Feature activation-4.16e-01

Pos logit contributions
ocol+1.501
 rib+1.311
 microsc+1.273
ifying+1.246
hy+1.236
Neg logit contributions
 someone-0.258
 everything-0.247
 dad-0.220
 something-0.217
Maybe-0.210
Token with
Token ID351
Feature activation+2.57e-01

Pos logit contributions
 someone+0.257
Maybe+0.245
 dad+0.245
 everything+0.243
 looked+0.237
Neg logit contributions
ocol-0.827
 rib-0.719
 microsc-0.708
 topp-0.665
 former-0.663
Token hard
Token ID1327
Feature activation-4.73e-01

Pos logit contributions
ocol+0.510
 rib+0.437
 microsc+0.422
ifying+0.404
hy+0.402
Neg logit contributions
 someone-0.182
 everything-0.177
Maybe-0.173
 dad-0.166
 looked-0.165
Token day
Token ID1110
Feature activation+5.02e-01

Pos logit contributions
ocol+0.160
 rib+0.139
 microsc+0.136
ifying+0.131
hy+0.131
Neg logit contributions
 someone-0.051
 everything-0.048
 dad-0.046
Maybe-0.046
 something-0.046
Token,
Token ID11
Feature activation-2.01e-01

Pos logit contributions
ocol+1.050
 rib+0.904
 microsc+0.873
ifying+0.854
hy+0.846
Neg logit contributions
 someone-0.309
 everything-0.299
 dad-0.282
Maybe-0.279
 something-0.277
Token Lily
Token ID20037
Feature activation-1.70e-02

Pos logit contributions
Maybe+0.131
 someone+0.131
 dad+0.128
 everything+0.127
 looked+0.121
Neg logit contributions
ocol-0.388
 rib-0.339
 microsc-0.324
ifying-0.319
hy-0.308
Token's
Token ID338
Feature activation-4.35e-01

Pos logit contributions
 someone+0.012
 everything+0.012
 something+0.011
Maybe+0.011
 dad+0.010
Neg logit contributions
ocol-0.033
 rib-0.029
 microsc-0.028
ifying-0.026
hy-0.026
Token mom
Token ID1995
Feature activation-3.72e-01

Pos logit contributions
 everything+0.236
 someone+0.236
Maybe+0.229
How+0.226
They+0.214
Neg logit contributions
ocol-0.899
 rib-0.792
 microsc-0.766
 former-0.751
rou-0.746
Token asked
Token ID1965
Feature activation+1.80e-01

Pos logit contributions
 someone+0.275
 everything+0.262
 something+0.259
Maybe+0.253
 somewhere+0.245
Neg logit contributions
ocol-0.697
 rib-0.609
 microsc-0.580
rou-0.572
hy-0.564
Token her
Token ID607
Feature activation-5.62e-01

Pos logit contributions
ocol+0.363
 rib+0.311
 microsc+0.300
ifying+0.287
hy+0.286
Neg logit contributions
 someone-0.120
 everything-0.117
Maybe-0.115
 dad-0.111
 looked-0.110
Token to
Token ID284
Feature activation-2.46e-01

Pos logit contributions
Maybe+0.558
 someone+0.548
 everything+0.540
They+0.534
 looked+0.534
Neg logit contributions
ocol-0.890
 rib-0.763
 microsc-0.749
rou-0.687
hy-0.670
Token lend
Token ID22096
Feature activation-8.98e-02

Pos logit contributions
 everything+0.185
Maybe+0.182
 someone+0.182
 something+0.167
How+0.167
Neg logit contributions
ocol-0.462
 rib-0.394
 microsc-0.376
ifying-0.368
hy-0.363
Token her
Token ID607
Feature activation-1.57e-01

Pos logit contributions
Maybe+0.053
 everything+0.052
 someone+0.052
 dad+0.049
 looked+0.049
Neg logit contributions
ocol-0.185
 rib-0.163
 microsc-0.154
hy-0.147
ifying-0.147
Token a
Token ID257
Feature activation+1.25e-01

Pos logit contributions
 someone+0.060
 everything+0.057
Maybe+0.057
 looked+0.051
They+0.051
Neg logit contributions
ocol-0.355
 rib-0.317
 microsc-0.304
ifying-0.296
hy-0.290
Token in
Token ID287
Feature activation+1.54e-01

Pos logit contributions
ocol+1.126
 rib+0.973
 microsc+0.940
ifying+0.919
hy+0.912
Neg logit contributions
 someone-0.316
 everything-0.306
 dad-0.285
 something-0.281
Maybe-0.280
Token a
Token ID257
Feature activation+6.15e-01

Pos logit contributions
ocol+0.303
 rib+0.259
 microsc+0.248
ifying+0.244
hy+0.241
Neg logit contributions
 someone-0.113
 everything-0.111
Maybe-0.107
 dad-0.105
 something-0.103
Token sweet
Token ID6029
Feature activation+1.07e-01

Pos logit contributions
ocol+1.204
 rib+1.025
 microsc+0.989
ifying+0.963
hy+0.954
Neg logit contributions
 someone-0.466
 everything-0.455
 dad-0.430
 something-0.426
Maybe-0.421
Token voice
Token ID3809
Feature activation+4.45e-01

Pos logit contributions
ocol+0.210
 rib+0.180
 microsc+0.170
hy+0.167
ifying+0.164
Neg logit contributions
 someone-0.078
Maybe-0.077
 everything-0.076
 dad-0.072
 something-0.071
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
ocol+0.942
 rib+0.814
 microsc+0.786
ifying+0.767
hy+0.766
Neg logit contributions
 someone-0.260
 everything-0.254
 dad-0.235
Maybe-0.234
 something-0.231
Token Suddenly
Token ID24975
Feature activation+2.43e-01

Pos logit contributions
ocol+0.035
 rib+0.030
 microsc+0.028
ifying+0.028
hy+0.028
Neg logit contributions
 someone-0.017
 everything-0.017
 dad-0.016
 something-0.016
Maybe-0.016
Token one
Token ID530
Feature activation+1.59e-01

Pos logit contributions
ocol+0.494
 rib+0.429
 microsc+0.410
hy+0.398
ifying+0.396
Neg logit contributions
 someone-0.153
 everything-0.149
Maybe-0.144
 dad-0.143
How-0.137
Token of
Token ID286
Feature activation+5.95e-01

Pos logit contributions
ocol+0.219
 rib+0.176
 microsc+0.162
hy+0.159
ifying+0.159
Neg logit contributions
 someone-0.206
 everything-0.206
Maybe-0.199
 dad-0.198
 something-0.197
Token the
Token ID262
Feature activation+9.78e-02

Pos logit contributions
ocol+1.235
 rib+1.061
 microsc+1.026
ifying+1.000
hy+0.991
Neg logit contributions
 someone-0.381
 everything-0.371
 dad-0.347
 something-0.343
Maybe-0.340
Token birds
Token ID10087
Feature activation+4.10e-01

Pos logit contributions
ocol+0.190
 rib+0.165
 microsc+0.156
hy+0.154
ifying+0.151
Neg logit contributions
 someone-0.072
 everything-0.071
Maybe-0.068
 something-0.067
 dad-0.065
Token hopped
Token ID45230
Feature activation+1.10e+00

Pos logit contributions
ocol+0.787
 rib+0.672
 microsc+0.644
ifying+0.631
hy+0.624
Neg logit contributions
 someone-0.318
 everything-0.309
 dad-0.296
 something-0.292
Maybe-0.291
Token a
Token ID257
Feature activation+3.48e-01

Pos logit contributions
Maybe+0.172
They+0.165
How+0.163
 everything+0.160
 someone+0.160
Neg logit contributions
ocol-1.082
 rib-0.971
 microsc-0.941
hy-0.908
 topp-0.894
Token little
Token ID1310
Feature activation+5.83e-02

Pos logit contributions
ocol+0.680
 rib+0.587
 microsc+0.567
ifying+0.548
hy+0.544
Neg logit contributions
 someone-0.249
 everything-0.245
Maybe-0.229
 dad-0.228
 something-0.225
Token light
Token ID1657
Feature activation-2.23e-02

Pos logit contributions
ocol+0.110
 rib+0.095
 microsc+0.091
ifying+0.087
hy+0.087
Neg logit contributions
 someone-0.046
 everything-0.046
Maybe-0.043
 dad-0.043
They-0.042
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
 everything+0.016
 someone+0.016
Maybe+0.016
 dad+0.015
 something+0.015
Neg logit contributions
ocol-0.044
 rib-0.038
 microsc-0.036
ifying-0.036
hy-0.035
Token It
Token ID632
Feature activation-6.74e-01

Pos logit contributions
ocol+0.480
 rib+0.400
 microsc+0.385
hy+0.376
ifying+0.370
Neg logit contributions
 someone-0.296
 everything-0.292
Maybe-0.281
 dad-0.280
 something-0.278
Token was
Token ID373
Feature activation+2.17e-01

Pos logit contributions
 someone+0.625
 dad+0.617
 everything+0.608
 something+0.586
 somewhere+0.562
Neg logit contributions
ocol-1.080
 rib-1.007
 microsc-0.943
hy-0.888
onna-0.866
Token very
Token ID845
Feature activation+6.17e-01

Pos logit contributions
ocol+0.488
 rib+0.430
 microsc+0.418
ifying+0.406
hy+0.405
Neg logit contributions
 someone-0.090
 everything-0.086
Maybe-0.083
 dad-0.078
 something-0.074
Token pretty
Token ID2495
Feature activation-1.84e-02

Pos logit contributions
ocol+1.285
 rib+1.104
 microsc+1.068
ifying+1.041
hy+1.032
Neg logit contributions
 someone-0.397
 everything-0.387
 dad-0.361
 something-0.358
Maybe-0.353
Token and
Token ID290
Feature activation-7.87e-01

Pos logit contributions
 someone+0.011
 everything+0.011
Maybe+0.011
 dad+0.010
How+0.010
Neg logit contributions
ocol-0.038
 rib-0.033
 microsc-0.032
ifying-0.031
hy-0.031
Token bright
Token ID6016
Feature activation-1.04e-01

Pos logit contributions
 someone+0.599
Maybe+0.589
 everything+0.575
They+0.572
How+0.568
Neg logit contributions
ocol-1.376
 rib-1.273
 microsc-1.231
hy-1.167
 topp-1.124
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 someone+0.065
 everything+0.062
Maybe+0.061
 dad+0.060
How+0.056
Neg logit contributions
ocol-0.213
 rib-0.186
 microsc-0.178
ifying-0.175
hy-0.174
Token sea
Token ID5417
Feature activation-5.44e-01

Pos logit contributions
ocol+0.553
 rib+0.468
 microsc+0.452
ifying+0.440
hy+0.431
Neg logit contributions
 someone-0.225
 everything-0.217
Maybe-0.211
 dad-0.207
 something-0.204
Token noticed
Token ID6810
Feature activation-2.42e-01

Pos logit contributions
 someone+0.366
Maybe+0.356
 everything+0.350
 dad+0.334
How+0.329
Neg logit contributions
ocol-1.058
 rib-0.931
 microsc-0.901
rou-0.876
ifying-0.874
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
Maybe+0.110
 someone+0.107
 everything+0.107
 dad+0.105
How+0.098
Neg logit contributions
ocol-0.523
 rib-0.462
 microsc-0.451
ifying-0.430
rou-0.426
Token sand
Token ID6450
Feature activation-7.05e-01

Pos logit contributions
 someone+0.119
 everything+0.116
Maybe+0.114
They+0.109
 something+0.109
Neg logit contributions
ocol-0.284
 rib-0.245
 microsc-0.238
ifying-0.228
 former-0.225
Token castle
Token ID16669
Feature activation-3.19e-01

Pos logit contributions
 someone+0.577
Maybe+0.547
 everything+0.534
 dad+0.530
How+0.529
Neg logit contributions
ocol-1.275
 rib-1.124
 microsc-1.073
ifying-1.049
hy-1.027
Token.
Token ID13
Feature activation+6.76e-02

Pos logit contributions
 someone+0.209
 dad+0.202
 everything+0.198
Maybe+0.195
How+0.187
Neg logit contributions
ocol-0.627
 rib-0.556
 microsc-0.531
hy-0.525
ifying-0.521
Token She
Token ID1375
Feature activation-6.71e-01

Pos logit contributions
ocol+0.115
 rib+0.098
 microsc+0.095
hy+0.091
ifying+0.089
Neg logit contributions
 someone-0.064
 everything-0.064
 dad-0.061
Maybe-0.061
 something-0.061
Token wanted
Token ID2227
Feature activation-3.25e-01

Pos logit contributions
 someone+0.580
 everything+0.556
 something+0.541
 somewhere+0.533
 dad+0.528
Neg logit contributions
ocol-1.152
 rib-1.045
 microsc-0.992
rou-0.924
hy-0.915
Token to
Token ID284
Feature activation+4.14e-01

Pos logit contributions
Maybe+0.405
 someone+0.396
They+0.395
How+0.392
Mom+0.390
Neg logit contributions
ocol-0.436
 rib-0.372
 microsc-0.357
 topp-0.307
onna-0.307
Token carry
Token ID3283
Feature activation-6.23e-01

Pos logit contributions
ocol+0.838
 rib+0.722
 microsc+0.695
ifying+0.673
hy+0.671
Neg logit contributions
 someone-0.278
 everything-0.272
 dad-0.255
 something-0.253
Maybe-0.253
Token it
Token ID340
Feature activation-1.23e+00

Pos logit contributions
Maybe+0.442
 dad+0.421
How+0.420
They+0.416
 someone+0.411
Neg logit contributions
ocol-1.240
 rib-1.113
 microsc-1.032
ifying-0.940
hy-0.933
Token surprise
Token ID5975
Feature activation+3.23e-01

Pos logit contributions
 someone+0.123
 everything+0.123
Maybe+0.116
They+0.115
 something+0.114
Neg logit contributions
ocol-0.215
 rib-0.186
 microsc-0.173
hy-0.166
 topp-0.165
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
ocol+0.681
 rib+0.591
 microsc+0.570
hy+0.556
ifying+0.555
Neg logit contributions
 someone-0.188
 everything-0.185
Maybe-0.172
 something-0.167
 dad-0.166
Token Together
Token ID17083
Feature activation+1.36e-01

Pos logit contributions
 someone+0.144
 everything+0.143
 something+0.138
 dad+0.134
Maybe+0.134
Neg logit contributions
ocol-0.247
 rib-0.207
 microsc-0.200
ifying-0.195
hy-0.193
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
ocol+0.296
 rib+0.260
 microsc+0.250
hy+0.243
ifying+0.243
Neg logit contributions
 someone-0.068
Maybe-0.063
 everything-0.063
How-0.059
 something-0.059
Token they
Token ID484
Feature activation+2.83e-01

Pos logit contributions
ocol+0.283
 rib+0.252
 microsc+0.244
ifying+0.238
hy+0.237
Neg logit contributions
 someone-0.026
 everything-0.022
Maybe-0.020
 dad-0.018
 something-0.018
Token decided
Token ID3066
Feature activation-5.94e-02

Pos logit contributions
ocol+0.558
 rib+0.485
 microsc+0.463
ifying+0.453
hy+0.447
Neg logit contributions
 someone-0.206
 everything-0.195
 something-0.187
 dad-0.184
Maybe-0.183
Token to
Token ID284
Feature activation+3.50e-01

Pos logit contributions
 someone+0.069
 everything+0.067
Maybe+0.067
 dad+0.066
How+0.065
Neg logit contributions
ocol-0.085
 rib-0.073
 microsc-0.068
hy-0.065
ifying-0.065
Token pick
Token ID2298
Feature activation-2.45e-01

Pos logit contributions
ocol+0.658
 rib+0.561
 microsc+0.538
ifying+0.524
hy+0.518
Neg logit contributions
 someone-0.286
 everything-0.278
 dad-0.264
 something-0.263
Maybe-0.261
Token some
Token ID617
Feature activation+4.71e-01

Pos logit contributions
 someone+0.127
 dad+0.119
Maybe+0.117
 everything+0.114
 looked+0.108
Neg logit contributions
ocol-0.525
 rib-0.465
 microsc-0.439
ifying-0.426
 topp-0.423
Token strawberries
Token ID36973
Feature activation-3.42e-02

Pos logit contributions
ocol+0.817
 rib+0.683
 microsc+0.650
ifying+0.630
hy+0.619
Neg logit contributions
 someone-0.458
 everything-0.447
 dad-0.432
Maybe-0.426
 something-0.424
Token and
Token ID290
Feature activation+8.92e-01

Pos logit contributions
 someone+0.025
 everything+0.024
 dad+0.023
Maybe+0.023
 something+0.022
Neg logit contributions
ocol-0.067
 rib-0.058
 microsc-0.055
ifying-0.054
hy-0.052
Token and
Token ID290
Feature activation-5.62e-01

Pos logit contributions
Maybe+0.198
 someone+0.191
Mom+0.185
How+0.174
They+0.174
Neg logit contributions
ocol-0.994
 rib-0.891
 microsc-0.865
rou-0.813
ifying-0.812
Token he
Token ID339
Feature activation-1.87e-01

Pos logit contributions
 someone+0.454
Maybe+0.449
 everything+0.435
Mom+0.431
They+0.430
Neg logit contributions
ocol-1.027
 rib-0.882
 microsc-0.861
 former-0.809
hy-0.802
Token wondered
Token ID14028
Feature activation-3.79e-01

Pos logit contributions
 someone+0.211
 everything+0.205
 something+0.198
 dad+0.196
They+0.190
Neg logit contributions
ocol-0.283
 rib-0.244
 microsc-0.231
hy-0.218
ifying-0.216
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
Maybe+0.140
 someone+0.136
Mom+0.125
 everything+0.122
They+0.119
Neg logit contributions
ocol-0.851
 rib-0.759
 microsc-0.741
ifying-0.707
rou-0.701
Token Finally
Token ID9461
Feature activation-8.71e-01

Pos logit contributions
 someone+0.208
 everything+0.208
 dad+0.201
Maybe+0.194
 something+0.194
Neg logit contributions
ocol-0.542
 rib-0.464
 microsc-0.455
hy-0.436
 former-0.436
Token,
Token ID11
Feature activation-3.56e-01

Pos logit contributions
Maybe+0.781
How+0.725
They+0.725
 someone+0.692
 everything+0.671
Neg logit contributions
ocol-1.601
 rib-1.307
 microsc-1.236
hy-1.219
ifying-1.176
Token he
Token ID339
Feature activation-5.75e-02

Pos logit contributions
Maybe+0.198
 someone+0.193
 everything+0.192
They+0.191
 dad+0.186
Neg logit contributions
ocol-0.728
 rib-0.636
 microsc-0.611
ifying-0.599
hy-0.589
Token had
Token ID550
Feature activation+8.53e-01

Pos logit contributions
 someone+0.043
 everything+0.043
 something+0.040
 dad+0.039
Maybe+0.038
Neg logit contributions
ocol-0.108
 rib-0.095
 microsc-0.091
ifying-0.088
hy-0.087
Token an
Token ID281
Feature activation+2.64e-01

Pos logit contributions
ocol+1.765
 rib+1.502
 microsc+1.452
ifying+1.429
hy+1.414
Neg logit contributions
 someone-0.587
 everything-0.571
 something-0.539
 dad-0.535
 one-0.526
Token idea
Token ID2126
Feature activation+3.82e-01

Pos logit contributions
ocol+0.428
 rib+0.350
 microsc+0.334
hy+0.322
 former+0.318
Neg logit contributions
 someone-0.285
 everything-0.282
Maybe-0.272
 something-0.267
 dad-0.265
Token!
Token ID0
Feature activation+4.95e-02

Pos logit contributions
ocol+0.806
 rib+0.698
 microsc+0.676
hy+0.655
ifying+0.654
Neg logit contributions
 someone-0.224
 everything-0.220
 dad-0.203
Maybe-0.201
 something-0.199
Token had
Token ID550
Feature activation+3.61e-01

Pos logit contributions
ocol+0.376
 rib+0.324
 microsc+0.312
ifying+0.304
hy+0.300
Neg logit contributions
 someone-0.147
 everything-0.140
 something-0.134
 dad-0.133
Maybe-0.131
Token all
Token ID477
Feature activation+4.94e-01

Pos logit contributions
ocol+0.670
 rib+0.571
 microsc+0.548
ifying+0.529
hy+0.524
Neg logit contributions
 someone-0.301
 everything-0.292
 dad-0.281
Maybe-0.281
 something-0.274
Token of
Token ID286
Feature activation+4.84e-01

Pos logit contributions
ocol+0.908
 rib+0.767
 microsc+0.737
ifying+0.712
hy+0.708
Neg logit contributions
 someone-0.433
 everything-0.421
 dad-0.403
Maybe-0.399
 something-0.398
Token the
Token ID262
Feature activation+7.28e-01

Pos logit contributions
ocol+0.995
 rib+0.857
 microsc+0.829
ifying+0.800
hy+0.794
Neg logit contributions
 someone-0.317
 everything-0.308
Maybe-0.289
 dad-0.289
 something-0.284
Token colors
Token ID7577
Feature activation-3.22e-01

Pos logit contributions
ocol+1.342
 rib+1.125
 microsc+1.084
ifying+1.059
hy+1.045
Neg logit contributions
 someone-0.639
 everything-0.628
 dad-0.598
 something-0.594
 one-0.587
Token,
Token ID11
Feature activation-2.24e-01

Pos logit contributions
 someone+0.262
 everything+0.246
Maybe+0.243
How+0.242
 dad+0.242
Neg logit contributions
ocol-0.595
 rib-0.525
 microsc-0.485
ifying-0.483
hy-0.481
Token and
Token ID290
Feature activation+4.81e-01

Pos logit contributions
 someone+0.139
 everything+0.132
Maybe+0.130
 dad+0.126
How+0.122
Neg logit contributions
ocol-0.455
 rib-0.406
 microsc-0.382
ifying-0.371
hy-0.369
Token they
Token ID484
Feature activation+1.01e-01

Pos logit contributions
ocol+1.056
 rib+0.923
 microsc+0.892
ifying+0.869
hy+0.863
Neg logit contributions
 someone-0.240
 everything-0.230
 dad-0.211
 something-0.209
Maybe-0.208
Token put
Token ID1234
Feature activation-8.29e-01

Pos logit contributions
ocol+0.200
 rib+0.175
 microsc+0.167
ifying+0.163
hy+0.162
Neg logit contributions
 someone-0.073
 everything-0.068
 something-0.066
Maybe-0.064
 dad-0.064
Token them
Token ID606
Feature activation-6.34e-01

Pos logit contributions
 someone+0.586
Maybe+0.582
How+0.545
They+0.537
Nothing+0.535
Neg logit contributions
ocol-1.635
 rib-1.475
 microsc-1.326
ifying-1.317
 topp-1.284
Token in
Token ID287
Feature activation+2.76e-01

Pos logit contributions
 someone+0.496
Maybe+0.473
How+0.452
 everything+0.439
 dad+0.435
Neg logit contributions
ocol-1.198
 rib-1.086
 microsc-0.994
ifying-0.993
hy-0.957
Token to
Token ID284
Feature activation+1.76e-01

Pos logit contributions
ocol+1.246
 rib+1.016
 microsc+0.973
ifying+0.945
hy+0.932
Neg logit contributions
 someone-0.834
 everything-0.822
 dad-0.789
 something-0.787
 one-0.777
Token yawn
Token ID40908
Feature activation+3.55e-01

Pos logit contributions
ocol+0.336
 rib+0.291
 microsc+0.278
 former+0.269
ifying+0.269
Neg logit contributions
 someone-0.135
 everything-0.131
 dad-0.126
Maybe-0.125
 something-0.122
Token.
Token ID13
Feature activation+7.44e-02

Pos logit contributions
ocol+0.733
 rib+0.634
 microsc+0.610
ifying+0.603
hy+0.597
Neg logit contributions
 someone-0.217
 everything-0.212
 dad-0.201
Maybe-0.194
 something-0.194
Token It
Token ID632
Feature activation+2.12e-01

Pos logit contributions
ocol+0.122
 rib+0.103
 microsc+0.099
ifying+0.095
hy+0.094
Neg logit contributions
 someone-0.076
 everything-0.075
 dad-0.073
 something-0.072
Maybe-0.072
Token seemed
Token ID3947
Feature activation+6.67e-01

Pos logit contributions
ocol+0.406
 rib+0.354
 microsc+0.337
ifying+0.330
hy+0.328
Neg logit contributions
 someone-0.156
 everything-0.151
 dad-0.149
 something-0.143
Maybe-0.139
Token like
Token ID588
Feature activation-4.78e-01

Pos logit contributions
ocol+1.237
 rib+1.041
 microsc+1.003
ifying+0.974
hy+0.964
Neg logit contributions
 someone-0.574
 everything-0.562
 dad-0.534
 something-0.532
Maybe-0.524
Token the
Token ID262
Feature activation-2.33e-01

Pos logit contributions
Maybe+0.231
 someone+0.227
 everything+0.220
They+0.215
How+0.214
Neg logit contributions
ocol-1.017
 rib-0.914
 microsc-0.867
hy-0.831
rou-0.824
Token song
Token ID3496
Feature activation-5.70e-02

Pos logit contributions
 someone+0.196
 everything+0.192
Maybe+0.184
 dad+0.183
They+0.182
Neg logit contributions
ocol-0.419
 rib-0.368
 microsc-0.342
hy-0.335
ifying-0.335
Token and
Token ID290
Feature activation-5.65e-02

Pos logit contributions
 someone+0.042
 everything+0.040
 dad+0.040
Maybe+0.039
 something+0.038
Neg logit contributions
ocol-0.109
 rib-0.096
 microsc-0.090
hy-0.089
ifying-0.089
Token the
Token ID262
Feature activation-3.38e-01

Pos logit contributions
 someone+0.039
 everything+0.037
Maybe+0.036
 dad+0.036
 something+0.035
Neg logit contributions
ocol-0.111
 rib-0.098
 microsc-0.093
hy-0.089
ifying-0.089
Token warm
Token ID5814
Feature activation-7.09e-02

Pos logit contributions
 someone+0.274
 everything+0.273
Maybe+0.263
They+0.258
 something+0.252
Neg logit contributions
ocol-0.614
 rib-0.545
 microsc-0.504
ifying-0.492
 former-0.491
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 someone+0.178
 everything+0.177
Maybe+0.171
How+0.164
 looked+0.163
Neg logit contributions
ocol-0.492
 rib-0.434
 microsc-0.407
hy-0.403
ifying-0.403
Token You
Token ID921
Feature activation+7.72e-02

Pos logit contributions
ocol+0.519
 rib+0.430
 microsc+0.412
ifying+0.403
hy+0.398
Neg logit contributions
 everything-0.298
 someone-0.296
 dad-0.280
 something-0.279
Maybe-0.276
Token are
Token ID389
Feature activation-8.01e-02

Pos logit contributions
ocol+0.123
 rib+0.104
 microsc+0.097
ifying+0.096
hy+0.095
Neg logit contributions
 someone-0.083
 everything-0.081
 dad-0.079
 something-0.078
Maybe-0.077
Token welcome
Token ID7062
Feature activation+2.84e-02

Pos logit contributions
Maybe+0.055
 someone+0.055
 everything+0.054
 looked+0.050
 dad+0.049
Neg logit contributions
ocol-0.159
 rib-0.136
 microsc-0.131
ifying-0.128
hy-0.127
Token to
Token ID284
Feature activation-2.52e-01

Pos logit contributions
ocol+0.052
 rib+0.045
 microsc+0.042
hy+0.041
ifying+0.041
Neg logit contributions
 everything-0.023
 someone-0.023
Maybe-0.023
 looked-0.022
 something-0.021
Token play
Token ID711
Feature activation-7.00e-02

Pos logit contributions
Maybe+0.189
 someone+0.180
 everything+0.178
 looked+0.173
 something+0.171
Neg logit contributions
ocol-0.490
 rib-0.416
 microsc-0.397
 former-0.387
hy-0.387
Token in
Token ID287
Feature activation+7.18e-03

Pos logit contributions
 everything+0.042
 someone+0.041
 dad+0.040
Maybe+0.039
 looked+0.039
Neg logit contributions
ocol-0.144
 rib-0.125
 microsc-0.119
ifying-0.116
hy-0.115
Token the
Token ID262
Feature activation-1.09e-01

Pos logit contributions
ocol+0.013
 rib+0.011
 microsc+0.010
ifying+0.010
 former+0.010
Neg logit contributions
 someone-0.006
 everything-0.006
Maybe-0.006
 looked-0.006
 dad-0.006
Token p
Token ID279
Feature activation-1.95e-01

Pos logit contributions
 someone+0.080
 everything+0.079
Maybe+0.077
 dad+0.074
 looked+0.074
Neg logit contributions
ocol-0.209
 rib-0.180
 microsc-0.174
ifying-0.167
 former-0.167
Tokenudd
Token ID4185
Feature activation+1.62e-01

Pos logit contributions
 someone+0.262
 everything+0.251
Maybe+0.249
 dad+0.247
 something+0.245
Neg logit contributions
ocol-0.276
 rib-0.212
 microsc-0.207
rou-0.202
 former-0.199
Tokenles
Token ID829
Feature activation+1.30e-01

Pos logit contributions
ocol+0.271
 rib+0.221
 microsc+0.210
ifying+0.201
hy+0.198
Neg logit contributions
 someone-0.174
 everything-0.167
Maybe-0.166
 dad-0.166
 something-0.164
Token was
Token ID373
Feature activation+4.76e-02

Pos logit contributions
 everything+0.357
 someone+0.356
 dad+0.342
Maybe+0.338
They+0.336
Neg logit contributions
ocol-0.811
 rib-0.723
hy-0.677
 microsc-0.677
ifying-0.645
Token important
Token ID1593
Feature activation+6.29e-01

Pos logit contributions
ocol+0.090
 rib+0.077
 microsc+0.074
ifying+0.072
hy+0.071
Neg logit contributions
 someone-0.036
Maybe-0.036
 everything-0.036
They-0.034
 dad-0.034
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
ocol+1.361
 rib+1.178
 microsc+1.141
ifying+1.114
hy+1.105
Neg logit contributions
 someone-0.345
 everything-0.334
 dad-0.309
 something-0.305
Maybe-0.300
Token She
Token ID1375
Feature activation+5.88e-01

Pos logit contributions
 everything+0.149
 someone+0.146
 dad+0.141
 one+0.140
 looked+0.139
Neg logit contributions
ocol-0.343
 rib-0.294
 microsc-0.288
ifying-0.278
hy-0.278
Token did
Token ID750
Feature activation-8.14e-01

Pos logit contributions
ocol+1.057
 rib+0.888
 microsc+0.852
ifying+0.826
hy+0.819
Neg logit contributions
 someone-0.538
 everything-0.529
 dad-0.504
 something-0.500
Maybe-0.495
Token not
Token ID407
Feature activation-1.54e+00

Pos logit contributions
Maybe+0.303
 looked+0.274
Mom+0.262
 dad+0.262
 someone+0.262
Neg logit contributions
ocol-1.764
 rib-1.588
 microsc-1.543
hy-1.494
rou-1.489
Token know
Token ID760
Feature activation-4.61e-01

Pos logit contributions
How+1.665
Maybe+1.641
 everything+1.629
 someone+1.621
 something+1.566
Neg logit contributions
ocol-2.298
hy-1.975
 rib-1.918
 microsc-1.813
 former-1.756
Token the
Token ID262
Feature activation-2.68e-01

Pos logit contributions
Maybe+0.231
 someone+0.219
 looked+0.214
They+0.209
 everything+0.207
Neg logit contributions
ocol-0.983
 rib-0.863
 microsc-0.837
ifying-0.791
rou-0.790
Token money
Token ID1637
Feature activation-3.18e-01

Pos logit contributions
 everything+0.239
 someone+0.233
They+0.223
Maybe+0.221
 looked+0.221
Neg logit contributions
ocol-0.460
 rib-0.404
 microsc-0.377
hy-0.368
 topp-0.364
Token was
Token ID373
Feature activation-4.51e-02

Pos logit contributions
 someone+0.261
 everything+0.257
Maybe+0.248
How+0.246
 dad+0.245
Neg logit contributions
ocol-0.563
 rib-0.495
hy-0.468
 microsc-0.458
ifying-0.446
Token for
Token ID329
Feature activation-8.28e-01

Pos logit contributions
 someone+0.034
Maybe+0.034
 everything+0.034
They+0.032
 dad+0.032
Neg logit contributions
ocol-0.085
 rib-0.073
 microsc-0.070
ifying-0.068
hy-0.067
Token and
Token ID290
Feature activation-4.69e-01

Pos logit contributions
ocol+1.036
 rib+0.898
 microsc+0.866
ifying+0.850
hy+0.840
Neg logit contributions
 someone-0.265
 everything-0.256
 dad-0.239
Maybe-0.232
 something-0.232
Token wet
Token ID9583
Feature activation-2.51e-01

Pos logit contributions
 someone+0.362
Maybe+0.341
 dad+0.341
 everything+0.339
They+0.325
Neg logit contributions
ocol-0.876
 rib-0.766
 microsc-0.721
ifying-0.720
hy-0.694
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 someone+0.158
 dad+0.151
 everything+0.150
Maybe+0.149
How+0.140
Neg logit contributions
ocol-0.507
 rib-0.440
ifying-0.418
 microsc-0.415
hy-0.408
Token They
Token ID1119
Feature activation+2.25e-01

Pos logit contributions
ocol+0.038
 rib+0.033
 microsc+0.032
ifying+0.031
hy+0.031
Neg logit contributions
 everything-0.014
 someone-0.014
 dad-0.013
 something-0.013
 looked-0.013
Token did
Token ID750
Feature activation-1.20e-01

Pos logit contributions
ocol+0.424
 rib+0.366
 microsc+0.350
ifying+0.343
hy+0.338
Neg logit contributions
 someone-0.180
 everything-0.174
 something-0.165
 dad-0.163
Maybe-0.163
Token not
Token ID407
Feature activation-1.98e+00

Pos logit contributions
 someone+0.034
 dad+0.030
Maybe+0.029
 everything+0.028
 looked+0.025
Neg logit contributions
ocol-0.281
 rib-0.252
 microsc-0.244
ifying-0.238
rou-0.235
Token know
Token ID760
Feature activation-5.55e-01

Pos logit contributions
How+2.288
Maybe+2.236
Mom+2.193
 someone+2.185
Nothing+2.149
Neg logit contributions
ocol-2.976
 rib-2.418
onna-2.367
hy-2.335
leaf-2.219
Token how
Token ID703
Feature activation+2.16e-01

Pos logit contributions
Maybe+0.219
 someone+0.204
How+0.196
 looked+0.193
They+0.185
Neg logit contributions
ocol-1.238
 rib-1.105
 microsc-1.068
ifying-1.025
rou-1.007
Token to
Token ID284
Feature activation-6.51e-01

Pos logit contributions
ocol+0.358
 rib+0.299
 microsc+0.286
ifying+0.276
hy+0.270
Neg logit contributions
 someone-0.220
 everything-0.215
Maybe-0.209
 dad-0.208
 something-0.206
Token swim
Token ID9422
Feature activation-6.31e-01

Pos logit contributions
 someone+0.501
 everything+0.487
Maybe+0.480
 something+0.466
How+0.465
Neg logit contributions
ocol-1.218
 rib-1.062
hy-0.980
 microsc-0.967
ifying-0.960
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
 someone+0.475
 dad+0.461
 everything+0.446
Maybe+0.440
Mom+0.440
Neg logit contributions
ocol-1.142
 rib-1.011
ifying-0.981
 microsc-0.962
hy-0.936
Token please
Token ID3387
Feature activation-4.90e-01

Pos logit contributions
ocol+1.703
 rib+1.467
 microsc+1.429
ifying+1.401
hy+1.383
Neg logit contributions
 someone-0.416
 everything-0.399
 dad-0.371
 something-0.364
 one-0.354
Token,"
Token ID553
Feature activation+5.73e-01

Pos logit contributions
 someone+0.527
 everything+0.527
Maybe+0.512
How+0.500
 dad+0.498
Neg logit contributions
ocol-0.764
 rib-0.656
hy-0.598
 microsc-0.589
 topp-0.561
Token Tom
Token ID4186
Feature activation+1.29e-01

Pos logit contributions
ocol+1.288
 rib+1.124
 microsc+1.088
ifying+1.062
hy+1.057
Neg logit contributions
 someone-0.267
 everything-0.258
 dad-0.233
 something-0.230
Maybe-0.227
Token said
Token ID531
Feature activation+3.52e-01

Pos logit contributions
ocol+0.228
 rib+0.198
 microsc+0.187
hy+0.185
ifying+0.177
Neg logit contributions
 everything-0.112
 someone-0.112
 something-0.104
Maybe-0.103
 dad-0.103
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ocol+0.748
 rib+0.651
 microsc+0.624
hy+0.613
ifying+0.608
Neg logit contributions
 someone-0.195
 everything-0.194
Maybe-0.177
 dad-0.175
 something-0.174
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
 everything+0.985
 something+0.944
 someone+0.941
 looked+0.917
 dad+0.904
Neg logit contributions
ocol-1.511
 rib-1.373
hy-1.275
onna-1.252
 microsc-1.246
TokenWe
Token ID1135
Feature activation+1.97e-01

Pos logit contributions
 everything+1.166
 dad+1.133
 someone+1.110
 looked+1.092
 something+1.066
Neg logit contributions
ocol-2.231
 rib-2.090
ifying-1.907
leaf-1.879
hy-1.873
Token live
Token ID2107
Feature activation+1.48e-01

Pos logit contributions
ocol+0.330
 rib+0.276
 microsc+0.261
ifying+0.254
hy+0.252
Neg logit contributions
 someone-0.203
 everything-0.197
 dad-0.190
 something-0.189
Maybe-0.186
Token on
Token ID319
Feature activation+1.01e+00

Pos logit contributions
ocol+0.290
 rib+0.251
 microsc+0.239
ifying+0.234
 former+0.229
Neg logit contributions
 someone-0.107
 everything-0.105
 dad-0.099
Maybe-0.099
 looked-0.097
Token Green
Token ID3469
Feature activation+3.75e-02

Pos logit contributions
ocol+2.179
 rib+1.862
 microsc+1.807
ifying+1.778
hy+1.766
Neg logit contributions
 someone-0.601
 everything-0.583
 something-0.541
 dad-0.532
 one-0.526
Token Street
Token ID3530
Feature activation-2.39e-01

Pos logit contributions
ocol+0.065
 rib+0.055
 microsc+0.052
ifying+0.050
hy+0.049
Neg logit contributions
 someone-0.037
 everything-0.035
Maybe-0.035
 dad-0.034
 something-0.034
Token brought
Token ID3181
Feature activation-7.84e-01

Pos logit contributions
 someone+0.213
 everything+0.208
Maybe+0.197
 something+0.191
 dad+0.190
Neg logit contributions
ocol-0.556
 rib-0.503
 microsc-0.464
 topp-0.447
hy-0.445
Token it
Token ID340
Feature activation-4.21e-01

Pos logit contributions
Maybe+0.455
How+0.452
They+0.434
 someone+0.427
Nothing+0.416
Neg logit contributions
ocol-1.617
 rib-1.469
 microsc-1.317
 topp-1.267
ifying-1.265
Token home
Token ID1363
Feature activation+6.23e-01

Pos logit contributions
 someone+0.402
Maybe+0.397
 everything+0.389
How+0.384
They+0.381
Neg logit contributions
ocol-0.690
 rib-0.607
 microsc-0.569
ifying-0.542
hy-0.542
Token.
Token ID13
Feature activation+7.19e-01

Pos logit contributions
ocol+1.365
 rib+1.184
 microsc+1.147
ifying+1.121
hy+1.112
Neg logit contributions
 someone-0.326
 everything-0.315
 dad-0.290
 something-0.286
Maybe-0.281
Token The
Token ID383
Feature activation-5.42e-01

Pos logit contributions
ocol+1.108
 rib+0.894
 microsc+0.853
ifying+0.824
hy+0.811
Neg logit contributions
 someone-0.857
 everything-0.844
 dad-0.816
 something-0.809
Maybe-0.802
Token next
Token ID1306
Feature activation-2.00e+00

Pos logit contributions
 everything+0.587
 someone+0.580
Maybe+0.558
 looked+0.556
 where+0.552
Neg logit contributions
ocol-0.787
 rib-0.714
ifying-0.629
 microsc-0.623
hy-0.621
Token day
Token ID1110
Feature activation+3.75e-01

Pos logit contributions
 everything+1.909
 what+1.865
They+1.859
 someone+1.854
Maybe+1.837
Neg logit contributions
ocol-3.011
 rib-2.775
 gol-2.749
 topp-2.616
pri-2.601
Token,
Token ID11
Feature activation-2.19e-01

Pos logit contributions
ocol+0.771
 rib+0.666
 microsc+0.641
ifying+0.625
hy+0.620
Neg logit contributions
 someone-0.235
 everything-0.228
 dad-0.216
Maybe-0.216
 something-0.210
Token Lucy
Token ID22162
Feature activation-1.19e-01

Pos logit contributions
 someone+0.246
Maybe+0.240
 dad+0.239
 everything+0.238
How+0.232
Neg logit contributions
ocol-0.319
 rib-0.271
 microsc-0.251
ifying-0.241
hy-0.234
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
 someone+0.083
 everything+0.081
 something+0.075
 dad+0.073
Maybe+0.072
Neg logit contributions
ocol-0.231
 rib-0.204
 microsc-0.196
hy-0.189
ifying-0.188
Token her
Token ID607
Feature activation-2.85e-01

Pos logit contributions
ocol+0.375
 rib+0.327
 microsc+0.314
hy+0.303
ifying+0.297
Neg logit contributions
 someone-0.127
 everything-0.121
Maybe-0.119
 something-0.115
 dad-0.113
Token and
Token ID290
Feature activation-5.42e-01

Pos logit contributions
 someone+0.478
Maybe+0.475
 dad+0.462
How+0.452
 looked+0.451
Neg logit contributions
ocol-0.633
 rib-0.552
 microsc-0.523
rou-0.480
onna-0.479
Token tried
Token ID3088
Feature activation-2.10e-01

Pos logit contributions
 someone+0.447
 everything+0.425
 dad+0.420
Maybe+0.415
 something+0.405
Neg logit contributions
ocol-0.993
 rib-0.866
 microsc-0.825
 topp-0.796
 gol-0.787
Token but
Token ID475
Feature activation+4.59e-01

Pos logit contributions
 someone+0.170
Maybe+0.167
 dad+0.164
 everything+0.159
 looked+0.158
Neg logit contributions
ocol-0.378
 rib-0.332
 microsc-0.319
rou-0.298
hy-0.297
Token couldn
Token ID3521
Feature activation+4.35e-02

Pos logit contributions
ocol+1.023
 rib+0.895
 microsc+0.868
ifying+0.845
hy+0.839
Neg logit contributions
 someone-0.216
 everything-0.207
 dad-0.190
Maybe-0.187
 something-0.186
Token't
Token ID470
Feature activation-4.27e-01

Pos logit contributions
ocol+0.071
 rib+0.060
 microsc+0.056
hy+0.056
ifying+0.056
Neg logit contributions
 someone-0.043
 everything-0.043
 dad-0.043
 something-0.041
Maybe-0.041
Token get
Token ID651
Feature activation-1.40e+00

Pos logit contributions
 everything+0.286
 someone+0.278
Maybe+0.263
 something+0.258
 dad+0.257
Neg logit contributions
ocol-0.825
 rib-0.722
 microsc-0.688
hy-0.670
 gol-0.666
Token the
Token ID262
Feature activation-4.42e-02

Pos logit contributions
Maybe+1.233
They+1.172
Mom+1.100
How+1.083
 someone+1.070
Neg logit contributions
ocol-2.532
 rib-2.200
 microsc-2.036
hy-1.963
rou-1.953
Token ball
Token ID2613
Feature activation-7.51e-01

Pos logit contributions
 someone+0.041
 everything+0.040
Maybe+0.038
 dad+0.038
They+0.038
Neg logit contributions
ocol-0.077
 rib-0.068
 microsc-0.062
ifying-0.062
hy-0.061
Token in
Token ID287
Feature activation+1.69e-01

Pos logit contributions
 someone+0.551
 dad+0.547
Maybe+0.538
 everything+0.537
How+0.530
Neg logit contributions
ocol-1.354
 rib-1.227
hy-1.182
ifying-1.127
 microsc-1.120
Token the
Token ID262
Feature activation-2.68e-01

Pos logit contributions
ocol+0.341
 rib+0.297
 microsc+0.286
ifying+0.277
hy+0.274
Neg logit contributions
 someone-0.107
 everything-0.104
 dad-0.100
Maybe-0.098
 looked-0.096
Token basket
Token ID7988
Feature activation+2.61e-01

Pos logit contributions
 someone+0.215
 everything+0.205
Maybe+0.205
 dad+0.204
 they+0.198
Neg logit contributions
ocol-0.496
 rib-0.431
ifying-0.418
 microsc-0.413
hy-0.411