TOP ACTIVATIONS
MAX = 2.504

and saw a magical fair yt ale creature called a n
ashamed and sorry . " I wanted to see what it
. Nora smiled and looked out into the me adow ,
sorry and said , " I 'm sorry , mom .
The child smiled and looked out the window again . In
. He said , " I am sorry , Lily .
. She said , " I did not know , Mama
bad and said , " I know , mom . I
thing . She said " I 'm sorry , I 'm
. She says , " I 'm sorry , Mom .
cry and said , " I 'm sorry , Mom my
alone ." " I 'm sorry , Mom my
felt bad and said ," I 'm sorry , let me
computer . She said " I 'm sorry , dad .
aches ." " I 'm sorry , Tom ,"
. I was mad . I wanted you to play with
The little girl was su pr ised . She had never
to share and wait . How about we clean up this
Lily . They were very close , but they were also
down and the butterfly slowly unw ound the thread .

MIN ACTIVATIONS
MIN = -2.612

clue . The next clue said : " Look
wrong . The next day , the store owner
lonely . The next day , the sun came
unfair . The next day , they decided to
say . " Me too . I wish I
later . The next day , Ben and Lily
. They hoped that the next time they had a picnic
owner . The next day , Tim my went
sad . The next day , Lucy was very
them . The next day , Lily went to
, but they don 't see any mud . They only
soon . The next day , Anna checked on
. The next day , Lucy took the
sad and angry . The next day , Tim wanted to
happy . The next day , they went back
hurt . The next day , Tom and Lisa
every night . She hugs it and squee zes it tight
. It wasn 't scary after all .
er . The next day , she was very
later . The next day , they went back

INTERVAL 1.225 - 2.504
CONTAINS 0.590%

of scissors and carefully cut the tape . Now , Lucy
bird was happy and flew close to Lily . They both
Anna . I was curious . I won 't do it
The bird replied with a tweet and flew away .
I was nos y . I love you ."

INTERVAL -0.054 - 1.225
CONTAINS 53.436%

mom . While she was there , she saw a big
person you are lending it to . <|endoftext|> Once upon a
and climbed the swings one last time . Her Dad said
." With her help , her little brother soon stopped crying
the sky , and the rad ish floated up and back

INTERVAL -1.333 - -0.054
CONTAINS 45.659%

loved to dress up in her princess costume and dance around
has floppy ears ." S ara 's mom nodded
place . But to do that , the boat needed to
to mom and hug her . Mom forg ives
. <|endoftext|> One day , Jack and Jill went out to

INTERVAL -2.612 - -1.333
CONTAINS 0.315%

the tank . They do not see the tank . They
One day , Lily didn 't feel good . She had
up ." The next day , the sun came
soon . But they did not . They lied . They
they love dad and his laptop . They hug dad and
Token and
Token ID290
Feature activation-2.84e-01

Pos logit contributions
ocol+0.649
 rib+0.560
 microsc+0.538
ifying+0.527
hy+0.519
Neg logit contributions
 someone-0.214
 everything-0.207
Maybe-0.195
 dad-0.193
 something-0.188
Token saw
Token ID2497
Feature activation+1.11e-01

Pos logit contributions
 someone+0.206
 everything+0.203
 dad+0.195
Maybe+0.192
How+0.188
Neg logit contributions
ocol-0.527
 rib-0.466
 microsc-0.444
ifying-0.421
hy-0.419
Token a
Token ID257
Feature activation-2.22e-02

Pos logit contributions
ocol+0.248
 rib+0.218
 microsc+0.213
hy+0.204
ifying+0.203
Neg logit contributions
 someone-0.048
 everything-0.047
Maybe-0.045
 dad-0.043
They-0.041
Token magical
Token ID10883
Feature activation+3.21e-01

Pos logit contributions
 everything+0.017
 someone+0.017
Maybe+0.016
 dad+0.016
 something+0.016
Neg logit contributions
ocol-0.041
 rib-0.036
 microsc-0.035
ifying-0.033
hy-0.033
Token fair
Token ID3148
Feature activation-6.27e-02

Pos logit contributions
ocol+0.590
 rib+0.499
 microsc+0.477
ifying+0.462
hy+0.459
Neg logit contributions
 someone-0.273
 everything-0.271
Maybe-0.258
 dad-0.257
 something-0.251
Tokenyt
Token ID20760
Feature activation+2.50e+00

Pos logit contributions
 someone+0.050
 everything+0.050
Maybe+0.049
How+0.047
 something+0.046
Neg logit contributions
ocol-0.118
 rib-0.101
 microsc-0.096
ifying-0.093
hy-0.093
Tokenale
Token ID1000
Feature activation+6.37e-01

Pos logit contributions
ocol+4.111
 microsc+3.620
 rib+3.500
 former+3.397
 topp+3.387
Neg logit contributions
 one-2.552
 someone-2.537
 everything-2.497
 where-2.488
 dad-2.481
Token creature
Token ID7185
Feature activation+2.74e-01

Pos logit contributions
ocol+1.279
 rib+1.094
 microsc+1.057
ifying+1.029
hy+1.020
Neg logit contributions
 someone-0.448
 everything-0.437
 dad-0.411
 something-0.407
Maybe-0.402
Token called
Token ID1444
Feature activation+6.88e-01

Pos logit contributions
ocol+0.551
 rib+0.474
 microsc+0.456
ifying+0.447
hy+0.444
Neg logit contributions
 someone-0.185
 everything-0.183
Maybe-0.171
 dad-0.171
 something-0.166
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
ocol+1.415
 rib+1.214
 microsc+1.174
ifying+1.144
hy+1.134
Neg logit contributions
 someone-0.452
 everything-0.439
 dad-0.411
 something-0.408
 one-0.398
Token n
Token ID299
Feature activation+9.10e-01

Pos logit contributions
ocol+0.612
 rib+0.520
 microsc+0.500
ifying+0.485
hy+0.480
Neg logit contributions
 someone-0.285
 everything-0.284
Maybe-0.267
 dad-0.267
 something-0.264
Token ashamed
Token ID22461
Feature activation-8.12e-01

Pos logit contributions
ocol+2.332
 rib+2.048
 microsc+1.984
ifying+1.943
 former+1.919
Neg logit contributions
 someone-0.373
 everything-0.345
 dad-0.320
 something-0.314
 one-0.289
Token and
Token ID290
Feature activation+5.05e-01

Pos logit contributions
Maybe+0.620
 someone+0.613
How+0.608
 everything+0.599
 looked+0.592
Neg logit contributions
ocol-1.454
 rib-1.220
 microsc-1.177
hy-1.133
ifying-1.133
Token sorry
Token ID7926
Feature activation-7.69e-01

Pos logit contributions
ocol+1.018
 rib+0.873
 microsc+0.843
ifying+0.820
hy+0.818
Neg logit contributions
 someone-0.350
 everything-0.343
 dad-0.319
 something-0.317
Maybe-0.316
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
 someone+0.558
Maybe+0.550
How+0.543
 everything+0.532
 dad+0.529
Neg logit contributions
ocol-1.372
 rib-1.229
 microsc-1.159
onna-1.125
rou-1.104
Token "
Token ID366
Feature activation+1.32e-01

Pos logit contributions
 everything+0.451
 someone+0.445
 looked+0.437
 one+0.432
 something+0.431
Neg logit contributions
ocol-0.927
 microsc-0.777
 rib-0.770
hy-0.725
onna-0.722
TokenI
Token ID40
Feature activation+2.45e+00

Pos logit contributions
ocol+0.276
 rib+0.238
 microsc+0.231
ifying+0.227
hy+0.225
Neg logit contributions
 everything-0.078
 someone-0.078
 dad-0.072
 something-0.071
 one-0.070
Token wanted
Token ID2227
Feature activation-4.34e-01

Pos logit contributions
ocol+5.660
 rib+4.660
ifying+4.517
 microsc+4.411
hy+4.242
Neg logit contributions
 looked-1.837
Maybe-1.803
 someone-1.702
 turned-1.699
 they-1.672
Token to
Token ID284
Feature activation+5.89e-01

Pos logit contributions
Maybe+0.528
 looked+0.509
 everything+0.506
They+0.505
 someone+0.502
Neg logit contributions
ocol-0.613
 rib-0.512
 microsc-0.483
 topp-0.438
ifying-0.437
Token see
Token ID766
Feature activation-4.27e-01

Pos logit contributions
ocol+1.177
 rib+1.007
 microsc+0.972
ifying+0.945
hy+0.939
Neg logit contributions
 someone-0.421
 everything-0.412
 dad-0.387
 something-0.384
Maybe-0.380
Token what
Token ID644
Feature activation-2.25e-01

Pos logit contributions
Maybe+0.297
 looked+0.295
 someone+0.278
 everything+0.277
They+0.271
Neg logit contributions
ocol-0.836
 rib-0.717
 microsc-0.680
ifying-0.649
rou-0.640
Token it
Token ID340
Feature activation+9.26e-02

Pos logit contributions
 someone+0.179
 everything+0.179
 dad+0.175
Maybe+0.174
 something+0.170
Neg logit contributions
ocol-0.399
 rib-0.348
 microsc-0.333
hy-0.317
 topp-0.315
Token.
Token ID13
Feature activation+5.75e-02

Pos logit contributions
ocol+0.497
 rib+0.432
 microsc+0.418
ifying+0.407
hy+0.405
Neg logit contributions
 someone-0.137
 everything-0.132
Maybe-0.125
 dad-0.123
 something-0.120
Token Nora
Token ID32561
Feature activation+2.56e-01

Pos logit contributions
ocol+0.095
 rib+0.079
 microsc+0.075
ifying+0.074
hy+0.072
Neg logit contributions
 someone-0.059
 everything-0.058
 dad-0.056
 something-0.055
Maybe-0.055
Token smiled
Token ID13541
Feature activation-2.18e-01

Pos logit contributions
ocol+0.475
 rib+0.412
 microsc+0.394
hy+0.381
ifying+0.380
Neg logit contributions
 someone-0.207
 everything-0.202
 something-0.191
 dad-0.190
Maybe-0.188
Token and
Token ID290
Feature activation+6.99e-03

Pos logit contributions
 someone+0.139
 everything+0.134
Maybe+0.126
How+0.121
 dad+0.120
Neg logit contributions
ocol-0.428
 rib-0.384
 microsc-0.361
ifying-0.360
rou-0.356
Token looked
Token ID3114
Feature activation+4.32e-01

Pos logit contributions
ocol+0.012
 rib+0.011
 microsc+0.010
hy+0.010
ifying+0.010
Neg logit contributions
 someone-0.006
 everything-0.006
 something-0.005
Maybe-0.005
 dad-0.005
Token out
Token ID503
Feature activation+2.42e+00

Pos logit contributions
ocol+0.781
 rib+0.657
 microsc+0.631
ifying+0.614
hy+0.604
Neg logit contributions
 someone-0.390
 everything-0.378
 dad-0.364
Maybe-0.363
 something-0.359
Token into
Token ID656
Feature activation+1.57e+00

Pos logit contributions
ocol+5.242
 former+4.602
 microsc+4.492
 topp+4.447
 rib+4.404
Neg logit contributions
 everything-1.765
 one-1.648
 someone-1.606
 something-1.554
 very-1.523
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
ocol+3.423
 rib+2.915
hy+2.888
ifying+2.859
rou+2.847
Neg logit contributions
 everything-0.936
 someone-0.927
 something-0.873
 one-0.851
 dad-0.829
Token me
Token ID502
Feature activation-1.56e-01

Pos logit contributions
ocol+0.157
 rib+0.136
 microsc+0.131
ifying+0.129
 former+0.126
Neg logit contributions
 someone-0.054
 everything-0.052
Maybe-0.050
 dad-0.050
 something-0.049
Tokenadow
Token ID4584
Feature activation+6.01e-01

Pos logit contributions
 someone+0.221
 everything+0.218
Maybe+0.216
 dad+0.208
How+0.208
Neg logit contributions
ocol-0.203
 rib-0.152
 microsc-0.149
ifying-0.142
rou-0.141
Token,
Token ID11
Feature activation-4.25e-02

Pos logit contributions
ocol+1.288
 rib+1.114
 microsc+1.079
ifying+1.053
hy+1.045
Neg logit contributions
 someone-0.338
 everything-0.328
 dad-0.304
 something-0.299
Maybe-0.296
Token sorry
Token ID7926
Feature activation-5.06e-02

Pos logit contributions
 everything+0.178
 someone+0.175
Maybe+0.171
 dad+0.170
 looked+0.165
Neg logit contributions
ocol-0.500
 rib-0.421
 microsc-0.405
hy-0.400
ifying-0.393
Token and
Token ID290
Feature activation-1.09e-03

Pos logit contributions
 someone+0.046
 everything+0.044
Maybe+0.044
 dad+0.043
 looked+0.043
Neg logit contributions
ocol-0.088
 rib-0.074
 microsc-0.071
rou-0.068
ifying-0.068
Token said
Token ID531
Feature activation+7.62e-01

Pos logit contributions
 someone+0.001
 everything+0.001
Maybe+0.001
 dad+0.001
 something+0.001
Neg logit contributions
ocol-0.002
 rib-0.002
 microsc-0.002
hy-0.002
rou-0.002
Token,
Token ID11
Feature activation-7.64e-02

Pos logit contributions
ocol+1.638
 rib+1.411
 microsc+1.368
ifying+1.339
hy+1.320
Neg logit contributions
 someone-0.450
 everything-0.437
 dad-0.406
 something-0.402
 one-0.389
Token "
Token ID366
Feature activation-3.04e-01

Pos logit contributions
 someone+0.068
 everything+0.066
 something+0.063
Maybe+0.062
 dad+0.062
Neg logit contributions
ocol-0.132
 rib-0.115
 microsc-0.108
hy-0.106
ifying-0.105
TokenI
Token ID40
Feature activation+2.42e+00

Pos logit contributions
 everything+0.203
 someone+0.198
 something+0.186
 dad+0.184
 where+0.184
Neg logit contributions
ocol-0.601
 rib-0.532
 microsc-0.499
hy-0.495
ifying-0.494
Token'm
Token ID1101
Feature activation+1.06e+00

Pos logit contributions
ocol+5.083
 rib+4.051
ifying+3.979
 microsc+3.821
hy+3.675
Neg logit contributions
Maybe-2.355
 someone-2.319
 looked-2.275
 something-2.245
 one-2.220
Token sorry
Token ID7926
Feature activation-2.75e-01

Pos logit contributions
ocol+2.355
 rib+2.062
 microsc+1.992
ifying+1.960
hy+1.923
Neg logit contributions
 someone-0.575
 everything-0.543
 dad-0.511
 something-0.509
 one-0.486
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
Maybe+0.195
 someone+0.192
 everything+0.188
 looked+0.185
How+0.178
Neg logit contributions
ocol-0.533
 rib-0.459
 microsc-0.443
rou-0.422
hy-0.421
Token mom
Token ID1995
Feature activation+1.17e-01

Pos logit contributions
ocol+1.182
 rib+1.021
 microsc+0.987
ifying+0.961
hy+0.954
Neg logit contributions
 someone-0.336
 everything-0.327
 dad-0.301
 something-0.301
Maybe-0.298
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
ocol+0.242
 rib+0.210
 microsc+0.203
hy+0.198
rou+0.196
Neg logit contributions
 someone-0.071
 everything-0.068
Maybe-0.063
 something-0.063
 dad-0.062
TokenThe
Token ID464
Feature activation-7.72e-01

Pos logit contributions
 someone+0.368
 dad+0.358
 everything+0.358
 something+0.347
 one+0.346
Neg logit contributions
ocol-0.583
 rib-0.507
 microsc-0.466
ifying-0.464
hy-0.450
Token child
Token ID1200
Feature activation+2.25e-02

Pos logit contributions
 someone+0.782
 everything+0.764
They+0.744
 something+0.728
Maybe+0.717
Neg logit contributions
ocol-1.250
 rib-1.114
 microsc-1.039
hy-0.990
rou-0.975
Token smiled
Token ID13541
Feature activation-2.11e-01

Pos logit contributions
ocol+0.039
 rib+0.034
 microsc+0.032
hy+0.031
 former+0.031
Neg logit contributions
 someone-0.021
 everything-0.020
 something-0.019
 dad-0.019
Maybe-0.019
Token and
Token ID290
Feature activation-3.95e-01

Pos logit contributions
 someone+0.133
 everything+0.129
Maybe+0.122
How+0.119
 dad+0.115
Neg logit contributions
ocol-0.419
 rib-0.374
ifying-0.349
 microsc-0.349
 former-0.346
Token looked
Token ID3114
Feature activation+3.69e-01

Pos logit contributions
 someone+0.330
 everything+0.320
How+0.308
Maybe+0.306
 something+0.303
Neg logit contributions
ocol-0.687
 rib-0.622
 microsc-0.571
hy-0.561
 former-0.550
Token out
Token ID503
Feature activation+2.41e+00

Pos logit contributions
ocol+0.669
 rib+0.566
 microsc+0.540
ifying+0.527
hy+0.518
Neg logit contributions
 someone-0.329
 everything-0.318
Maybe-0.305
 dad-0.305
 something-0.301
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
ocol+5.531
 microsc+4.787
 former+4.781
pill+4.724
 topp+4.686
Neg logit contributions
 everything-1.427
 one-1.376
 someone-1.305
 they-1.239
 something-1.230
Token window
Token ID4324
Feature activation+9.68e-01

Pos logit contributions
ocol+0.052
 rib+0.045
 microsc+0.042
ifying+0.042
hy+0.041
Neg logit contributions
 someone-0.026
 everything-0.025
Maybe-0.025
 something-0.024
 they-0.024
Token again
Token ID757
Feature activation+1.11e+00

Pos logit contributions
ocol+2.121
 rib+1.820
 microsc+1.782
ifying+1.719
hy+1.701
Neg logit contributions
 someone-0.530
 everything-0.513
 dad-0.474
 something-0.473
 one-0.473
Token.
Token ID13
Feature activation-7.29e-01

Pos logit contributions
ocol+2.509
 rib+2.176
 microsc+2.120
hy+2.066
ifying+2.058
Neg logit contributions
 someone-0.540
 everything-0.515
 something-0.478
 one-0.477
 dad-0.474
Token In
Token ID554
Feature activation-5.54e-01

Pos logit contributions
 someone+0.770
 everything+0.768
 something+0.749
 dad+0.734
 somewhere+0.732
Neg logit contributions
ocol-1.103
 rib-0.931
 microsc-0.892
ifying-0.891
 former-0.870
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 someone+0.097
 everything+0.096
Maybe+0.092
 dad+0.090
 something+0.087
Neg logit contributions
ocol-0.309
 rib-0.271
 microsc-0.261
hy-0.258
ifying-0.256
Token He
Token ID679
Feature activation+6.31e-01

Pos logit contributions
 everything+0.252
 someone+0.246
 dad+0.242
 looked+0.241
 one+0.238
Neg logit contributions
ocol-0.459
 rib-0.377
 microsc-0.376
hy-0.361
ifying-0.354
Token said
Token ID531
Feature activation+6.87e-01

Pos logit contributions
ocol+1.198
 rib+1.014
 microsc+0.977
ifying+0.950
hy+0.941
Neg logit contributions
 someone-0.515
 everything-0.504
 dad-0.478
 something-0.474
Maybe-0.469
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
ocol+1.448
 rib+1.248
 microsc+1.207
ifying+1.180
hy+1.168
Neg logit contributions
 someone-0.422
 everything-0.410
 dad-0.382
 something-0.378
Maybe-0.368
Token "
Token ID366
Feature activation+1.71e-01

Pos logit contributions
ocol+0.620
 rib+0.527
 microsc+0.506
ifying+0.489
hy+0.488
Neg logit contributions
 someone-0.272
 everything-0.268
Maybe-0.254
 dad-0.253
 something-0.253
TokenI
Token ID40
Feature activation+2.40e+00

Pos logit contributions
ocol+0.327
 rib+0.279
 microsc+0.267
ifying+0.262
hy+0.261
Neg logit contributions
 everything-0.131
 someone-0.131
 dad-0.126
 something-0.123
 where-0.120
Token am
Token ID716
Feature activation+2.69e-01

Pos logit contributions
ocol+4.938
 rib+4.056
ifying+3.964
 microsc+3.909
 former+3.753
Neg logit contributions
 someone-2.181
Maybe-2.110
 something-2.094
 everything-2.092
 one-2.089
Token sorry
Token ID7926
Feature activation-5.75e-01

Pos logit contributions
ocol+0.565
 rib+0.483
 microsc+0.468
hy+0.456
ifying+0.454
Neg logit contributions
 someone-0.160
 everything-0.158
Maybe-0.155
 dad-0.145
 looked-0.144
Token,
Token ID11
Feature activation+6.35e-01

Pos logit contributions
Maybe+0.442
 looked+0.417
 someone+0.408
 everything+0.405
How+0.403
Neg logit contributions
ocol-1.082
 rib-0.923
 microsc-0.894
rou-0.853
hy-0.850
Token Lily
Token ID20037
Feature activation+6.11e-01

Pos logit contributions
ocol+1.368
 rib+1.182
 microsc+1.144
ifying+1.117
hy+1.108
Neg logit contributions
 someone-0.363
 everything-0.352
 dad-0.326
 something-0.322
Maybe-0.317
Token.
Token ID13
Feature activation+9.63e-01

Pos logit contributions
ocol+1.336
 rib+1.158
 microsc+1.122
ifying+1.096
hy+1.088
Neg logit contributions
 someone-0.323
 everything-0.313
 dad-0.288
 something-0.284
Maybe-0.280
Token.
Token ID13
Feature activation-7.77e-01

Pos logit contributions
 someone+0.213
 everything+0.204
Maybe+0.203
 dad+0.197
 looked+0.196
Neg logit contributions
ocol-0.571
 rib-0.502
 microsc-0.481
rou-0.463
hy-0.458
Token She
Token ID1375
Feature activation+5.11e-01

Pos logit contributions
 everything+0.712
 someone+0.677
 looked+0.669
 somewhere+0.668
 one+0.665
Neg logit contributions
ocol-1.314
 microsc-1.121
 rib-1.119
hy-1.032
ifying-1.027
Token said
Token ID531
Feature activation+1.03e+00

Pos logit contributions
ocol+0.960
 rib+0.818
 microsc+0.787
ifying+0.762
hy+0.759
Neg logit contributions
 someone-0.420
 everything-0.413
 dad-0.389
 something-0.387
Maybe-0.382
Token,
Token ID11
Feature activation+2.62e-01

Pos logit contributions
ocol+2.252
 rib+1.938
 microsc+1.881
ifying+1.866
 former+1.841
Neg logit contributions
 someone-0.584
 everything-0.568
 dad-0.534
 something-0.523
 one-0.501
Token "
Token ID366
Feature activation+4.59e-02

Pos logit contributions
ocol+0.482
 rib+0.411
 microsc+0.393
ifying+0.382
hy+0.378
Neg logit contributions
 someone-0.219
 everything-0.215
 something-0.204
Maybe-0.203
 dad-0.202
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
ocol+0.094
 rib+0.082
 microsc+0.078
ifying+0.076
hy+0.076
Neg logit contributions
 everything-0.030
 someone-0.029
 something-0.027
 dad-0.026
 where-0.026
Token did
Token ID750
Feature activation+4.66e-01

Pos logit contributions
ocol+4.969
 rib+3.997
ifying+3.889
 microsc+3.752
 former+3.649
Neg logit contributions
Maybe-2.305
 someone-2.305
 looked-2.222
 dad-2.206
 everything-2.201
Token not
Token ID407
Feature activation+2.73e-02

Pos logit contributions
ocol+1.147
 rib+1.013
 microsc+0.986
ifying+0.962
hy+0.960
Neg logit contributions
 someone-0.112
 everything-0.106
 dad-0.088
Maybe-0.087
 something-0.079
Token know
Token ID760
Feature activation+1.08e-01

Pos logit contributions
ocol+0.052
 rib+0.045
 microsc+0.043
hy+0.042
ifying+0.041
Neg logit contributions
 everything-0.020
 someone-0.020
 dad-0.019
Maybe-0.019
 they-0.019
Token,
Token ID11
Feature activation+5.57e-01

Pos logit contributions
ocol+0.230
 rib+0.197
 microsc+0.191
ifying+0.184
hy+0.183
Neg logit contributions
 someone-0.061
 everything-0.060
Maybe-0.059
 dad-0.057
 looked-0.056
Token Mama
Token ID40084
Feature activation+3.94e-01

Pos logit contributions
ocol+1.131
 rib+0.969
 microsc+0.937
ifying+0.910
hy+0.902
Neg logit contributions
 someone-0.380
 everything-0.372
 dad-0.346
 something-0.345
Maybe-0.343
Token bad
Token ID2089
Feature activation+2.56e-01

Pos logit contributions
 everything+0.314
 someone+0.308
Maybe+0.303
 dad+0.299
How+0.292
Neg logit contributions
ocol-0.843
 rib-0.712
 microsc-0.682
hy-0.680
ifying-0.664
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
ocol+0.529
 rib+0.458
 microsc+0.443
hy+0.430
ifying+0.428
Neg logit contributions
 someone-0.155
 everything-0.151
Maybe-0.141
 dad-0.141
 something-0.137
Token said
Token ID531
Feature activation+2.61e-01

Pos logit contributions
ocol+0.731
 rib+0.626
 microsc+0.601
hy+0.584
ifying+0.581
Neg logit contributions
 someone-0.291
 everything-0.287
 dad-0.269
 something-0.266
Maybe-0.265
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
ocol+0.533
 rib+0.460
 microsc+0.442
hy+0.431
ifying+0.429
Neg logit contributions
 someone-0.168
 everything-0.163
Maybe-0.157
 dad-0.152
 something-0.149
Token "
Token ID366
Feature activation-2.51e-01

Pos logit contributions
 someone+0.138
 everything+0.136
 something+0.130
Maybe+0.128
 dad+0.128
Neg logit contributions
ocol-0.265
 rib-0.231
 microsc-0.217
hy-0.214
ifying-0.210
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
 everything+0.177
 someone+0.169
 something+0.160
 dad+0.160
 where+0.157
Neg logit contributions
ocol-0.493
 rib-0.433
 microsc-0.405
hy-0.402
ifying-0.401
Token know
Token ID760
Feature activation-5.63e-02

Pos logit contributions
ocol+4.967
 rib+3.989
ifying+3.891
 microsc+3.829
pill+3.650
Neg logit contributions
 someone-2.247
Maybe-2.215
 something-2.184
 one-2.175
 dad-2.166
Token,
Token ID11
Feature activation+9.38e-02

Pos logit contributions
 someone+0.035
Maybe+0.035
 everything+0.035
 dad+0.033
How+0.032
Neg logit contributions
ocol-0.115
 rib-0.099
 microsc-0.095
ifying-0.091
hy-0.091
Token mom
Token ID1995
Feature activation-2.42e-01

Pos logit contributions
ocol+0.224
 rib+0.199
 microsc+0.192
ifying+0.185
hy+0.184
Neg logit contributions
 someone-0.027
 everything-0.026
Maybe-0.023
 looked-0.022
 something-0.021
Token.
Token ID13
Feature activation+6.57e-01

Pos logit contributions
 someone+0.158
 everything+0.153
Maybe+0.147
 something+0.140
 dad+0.139
Neg logit contributions
ocol-0.482
 rib-0.418
 microsc-0.403
hy-0.397
rou-0.394
Token I
Token ID314
Feature activation+1.71e+00

Pos logit contributions
ocol+1.353
 rib+1.161
 microsc+1.123
ifying+1.094
hy+1.084
Neg logit contributions
 someone-0.433
 everything-0.421
 dad-0.395
 something-0.391
Maybe-0.385
Token thing
Token ID1517
Feature activation+1.20e+00

Pos logit contributions
 everything+0.251
 someone+0.246
Maybe+0.236
 dad+0.235
 looked+0.231
Neg logit contributions
ocol-0.651
 rib-0.560
hy-0.533
 microsc-0.532
 gol-0.517
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
ocol+2.781
 rib+2.397
 microsc+2.335
 former+2.311
ifying+2.289
Neg logit contributions
 someone-0.562
 everything-0.525
 dad-0.488
 one-0.479
 something-0.478
Token She
Token ID1375
Feature activation+3.84e-01

Pos logit contributions
 everything+0.732
 someone+0.705
 somewhere+0.686
 one+0.685
 looked+0.681
Neg logit contributions
ocol-1.300
 microsc-1.102
 rib-1.094
hy-1.079
ifying-1.017
Token said
Token ID531
Feature activation+8.21e-01

Pos logit contributions
ocol+0.734
 rib+0.634
 microsc+0.607
hy+0.588
ifying+0.588
Neg logit contributions
 someone-0.295
 everything-0.291
 something-0.271
 dad-0.271
 one-0.266
Token "
Token ID366
Feature activation-2.42e-01

Pos logit contributions
ocol+1.682
 rib+1.439
 microsc+1.393
ifying+1.366
 former+1.345
Neg logit contributions
 someone-0.569
 everything-0.553
 dad-0.523
 something-0.515
 one-0.500
TokenI
Token ID40
Feature activation+2.39e+00

Pos logit contributions
 everything+0.200
 someone+0.196
 where+0.191
 dad+0.190
 something+0.188
Neg logit contributions
ocol-0.441
 rib-0.378
hy-0.354
 microsc-0.354
ifying-0.349
Token'm
Token ID1101
Feature activation+4.67e-01

Pos logit contributions
ocol+4.841
 rib+3.933
ifying+3.840
 microsc+3.762
 former+3.623
Neg logit contributions
 someone-2.264
Maybe-2.258
 looked-2.251
 something-2.171
 everything-2.159
Token sorry
Token ID7926
Feature activation-6.38e-01

Pos logit contributions
ocol+0.990
 rib+0.849
 microsc+0.822
ifying+0.803
hy+0.800
Neg logit contributions
 someone-0.279
 everything-0.273
Maybe-0.255
 dad-0.252
 something-0.248
Token,
Token ID11
Feature activation+7.58e-01

Pos logit contributions
Maybe+0.506
 looked+0.480
How+0.464
 someone+0.463
 everything+0.461
Neg logit contributions
ocol-1.194
 rib-1.016
 microsc-0.979
hy-0.943
rou-0.930
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
ocol+1.451
 rib+1.227
 microsc+1.186
ifying+1.156
hy+1.139
Neg logit contributions
 someone-0.622
 everything-0.607
 dad-0.581
 something-0.571
 one-0.562
Token'm
Token ID1101
Feature activation+2.49e-01

Pos logit contributions
ocol+3.705
 rib+3.039
 microsc+2.900
ifying+2.848
 former+2.747
Neg logit contributions
 someone-1.601
Maybe-1.566
 looked-1.564
 everything-1.535
 something-1.515
Token.
Token ID13
Feature activation-5.34e-01

Pos logit contributions
 someone+0.433
Maybe+0.415
 dad+0.414
 everything+0.412
 looked+0.398
Neg logit contributions
ocol-1.207
 rib-1.050
 microsc-0.999
onna-0.969
rou-0.966
Token She
Token ID1375
Feature activation+8.55e-01

Pos logit contributions
 everything+0.447
 someone+0.435
 one+0.427
 something+0.423
 somewhere+0.421
Neg logit contributions
ocol-0.924
 rib-0.790
 microsc-0.790
hy-0.747
ifying-0.743
Token says
Token ID1139
Feature activation+8.70e-01

Pos logit contributions
ocol+1.557
 rib+1.291
 microsc+1.243
ifying+1.219
hy+1.197
Neg logit contributions
 someone-0.786
 everything-0.774
 dad-0.740
 something-0.735
Maybe-0.729
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
ocol+1.887
 rib+1.626
 microsc+1.576
ifying+1.551
 former+1.533
Neg logit contributions
 someone-0.508
 everything-0.494
 dad-0.461
 something-0.453
 one-0.435
Token "
Token ID366
Feature activation-8.01e-02

Pos logit contributions
ocol+0.431
 rib+0.368
 microsc+0.351
ifying+0.341
hy+0.340
Neg logit contributions
 someone-0.196
 everything-0.192
 something-0.182
Maybe-0.182
 dad-0.182
TokenI
Token ID40
Feature activation+2.38e+00

Pos logit contributions
 everything+0.042
 someone+0.041
 something+0.038
 dad+0.037
 where+0.037
Neg logit contributions
ocol-0.170
 rib-0.150
 microsc-0.143
ifying-0.141
hy-0.140
Token'm
Token ID1101
Feature activation+8.08e-01

Pos logit contributions
ocol+4.895
 rib+3.928
ifying+3.792
 microsc+3.681
 former+3.587
Neg logit contributions
 someone-2.387
Maybe-2.358
 everything-2.299
 something-2.286
 dad-2.280
Token sorry
Token ID7926
Feature activation-7.23e-01

Pos logit contributions
ocol+1.777
 rib+1.545
 microsc+1.494
ifying+1.459
hy+1.443
Neg logit contributions
 someone-0.448
 everything-0.432
 dad-0.402
 something-0.396
 one-0.379
Token,
Token ID11
Feature activation+5.24e-01

Pos logit contributions
Maybe+0.586
How+0.547
 looked+0.544
 someone+0.539
 everything+0.529
Neg logit contributions
ocol-1.325
 rib-1.135
 microsc-1.094
rou-1.043
hy-1.028
Token Mom
Token ID11254
Feature activation-1.21e-01

Pos logit contributions
ocol+1.254
 rib+1.106
 microsc+1.073
ifying+1.048
hy+1.041
Neg logit contributions
 someone-0.165
 everything-0.156
 something-0.132
 dad-0.132
Maybe-0.131
Token.
Token ID13
Feature activation+5.67e-01

Pos logit contributions
 someone+0.077
 everything+0.074
Maybe+0.070
 something+0.069
 dad+0.067
Neg logit contributions
ocol-0.242
 rib-0.210
 microsc-0.202
hy-0.199
rou-0.199
Token cry
Token ID3960
Feature activation-9.90e-01

Pos logit contributions
ocol+0.781
 rib+0.656
 microsc+0.633
ifying+0.606
hy+0.605
Neg logit contributions
 someone-0.425
 everything-0.415
 dad-0.397
 something-0.395
Maybe-0.395
Token and
Token ID290
Feature activation-3.76e-02

Pos logit contributions
Maybe+0.684
How+0.681
 someone+0.672
 looked+0.638
 everything+0.633
Neg logit contributions
ocol-1.803
 rib-1.621
rou-1.512
 microsc-1.512
ifying-1.509
Token said
Token ID531
Feature activation+1.14e+00

Pos logit contributions
 someone+0.027
 everything+0.027
Maybe+0.026
 something+0.025
 dad+0.025
Neg logit contributions
ocol-0.071
 rib-0.062
 microsc-0.060
hy-0.057
 former-0.057
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
ocol+2.536
 rib+2.177
 microsc+2.119
ifying+2.102
 former+2.079
Neg logit contributions
 someone-0.649
 everything-0.631
 something-0.582
 dad-0.575
 one-0.547
Token "
Token ID366
Feature activation+4.83e-01

Pos logit contributions
ocol+0.343
 rib+0.294
 microsc+0.281
hy+0.271
ifying+0.271
Neg logit contributions
 someone-0.159
 everything-0.157
Maybe-0.149
 something-0.149
 dad-0.148
TokenI
Token ID40
Feature activation+2.37e+00

Pos logit contributions
ocol+1.078
 rib+0.941
 microsc+0.909
ifying+0.892
hy+0.883
Neg logit contributions
 someone-0.229
 everything-0.223
 dad-0.204
 something-0.200
 one-0.192
Token'm
Token ID1101
Feature activation+6.27e-01

Pos logit contributions
ocol+4.786
 rib+3.896
ifying+3.731
 microsc+3.693
hy+3.614
Neg logit contributions
 looked-2.224
 someone-2.203
Maybe-2.195
 turned-2.126
 one-2.123
Token sorry
Token ID7926
Feature activation-4.92e-01

Pos logit contributions
ocol+1.353
 rib+1.170
 microsc+1.133
ifying+1.106
hy+1.097
Neg logit contributions
 someone-0.358
 everything-0.347
 dad-0.321
 something-0.317
Maybe-0.312
Token,
Token ID11
Feature activation+3.86e-01

Pos logit contributions
Maybe+0.360
 someone+0.345
 looked+0.340
How+0.330
 everything+0.330
Neg logit contributions
ocol-0.930
 rib-0.815
 microsc-0.776
rou-0.750
hy-0.736
Token Mom
Token ID11254
Feature activation-2.73e-03

Pos logit contributions
ocol+0.917
 rib+0.811
 microsc+0.785
ifying+0.766
hy+0.761
Neg logit contributions
 someone-0.123
 everything-0.117
Maybe-0.102
 something-0.100
 dad-0.097
Tokenmy
Token ID1820
Feature activation-1.45e-02

Pos logit contributions
 someone+0.002
 everything+0.002
Maybe+0.001
 something+0.001
 dad+0.001
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
rou-0.005
hy-0.005
Token alone
Token ID3436
Feature activation-4.07e-01

Pos logit contributions
 someone+0.251
 everything+0.246
Maybe+0.245
 dad+0.242
 looked+0.232
Neg logit contributions
ocol-0.564
 rib-0.484
 microsc-0.459
ifying-0.455
hy-0.445
Token."
Token ID526
Feature activation-4.49e-01

Pos logit contributions
 everything+0.317
 someone+0.315
Maybe+0.305
 dad+0.303
 looked+0.298
Neg logit contributions
ocol-0.745
 rib-0.639
 microsc-0.609
ifying-0.598
hy-0.587
Token
Token ID198
Feature activation-2.78e-01

Pos logit contributions
 everything+0.301
 someone+0.288
 looked+0.286
Maybe+0.271
 one+0.268
Neg logit contributions
ocol-0.867
 rib-0.743
 microsc-0.727
onna-0.683
ifying-0.683
Token
Token ID198
Feature activation+4.80e-02

Pos logit contributions
 someone+0.178
 everything+0.177
 one+0.168
 looked+0.168
 something+0.165
Neg logit contributions
ocol-0.549
 rib-0.472
 microsc-0.461
rou-0.441
hy-0.439
Token"
Token ID1
Feature activation+7.71e-02

Pos logit contributions
ocol+0.089
 rib+0.075
 microsc+0.073
ifying+0.070
rou+0.069
Neg logit contributions
 someone-0.041
 everything-0.040
 looked-0.038
 something-0.037
 one-0.037
TokenI
Token ID40
Feature activation+2.36e+00

Pos logit contributions
ocol+0.146
 rib+0.126
 microsc+0.119
ifying+0.117
hy+0.114
Neg logit contributions
 everything-0.062
 someone-0.061
 something-0.057
 dad-0.057
 one-0.056
Token'm
Token ID1101
Feature activation+8.15e-01

Pos logit contributions
ocol+5.101
 rib+4.156
ifying+4.002
 microsc+3.976
pill+3.825
Neg logit contributions
Maybe-2.096
 someone-2.056
 everything-1.996
 dad-1.977
 looked-1.962
Token sorry
Token ID7926
Feature activation-3.76e-01

Pos logit contributions
ocol+1.774
 rib+1.539
 microsc+1.490
ifying+1.455
hy+1.436
Neg logit contributions
 someone-0.466
 everything-0.448
 dad-0.419
 something-0.415
 one-0.398
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
Maybe+0.280
 someone+0.277
 everything+0.274
How+0.270
 looked+0.261
Neg logit contributions
ocol-0.713
 rib-0.613
 microsc-0.583
hy-0.562
onna-0.554
Token Mom
Token ID11254
Feature activation-5.93e-02

Pos logit contributions
ocol+0.670
 rib+0.592
 microsc+0.573
ifying+0.557
hy+0.552
Neg logit contributions
 someone-0.092
 everything-0.088
Maybe-0.078
 something-0.074
 looked-0.071
Tokenmy
Token ID1820
Feature activation+1.78e-01

Pos logit contributions
 someone+0.037
 everything+0.035
Maybe+0.033
 something+0.033
 dad+0.032
Neg logit contributions
ocol-0.121
 rib-0.105
 microsc-0.101
rou-0.099
hy-0.099
Token felt
Token ID2936
Feature activation-3.52e-01

Pos logit contributions
ocol+0.145
 rib+0.125
 microsc+0.119
hy+0.116
ifying+0.115
Neg logit contributions
 someone-0.065
 everything-0.064
 something-0.060
 dad-0.060
Maybe-0.059
Token bad
Token ID2089
Feature activation-1.62e-01

Pos logit contributions
 everything+0.230
 dad+0.229
 someone+0.229
Maybe+0.221
 one+0.210
Neg logit contributions
ocol-0.687
 rib-0.581
hy-0.564
 microsc-0.562
ifying-0.556
Token and
Token ID290
Feature activation+5.43e-01

Pos logit contributions
 someone+0.117
 everything+0.113
 dad+0.110
Maybe+0.110
How+0.105
Neg logit contributions
ocol-0.306
 rib-0.267
 microsc-0.256
hy-0.251
ifying-0.246
Token said
Token ID531
Feature activation+4.63e-01

Pos logit contributions
ocol+1.072
 rib+0.916
 microsc+0.884
ifying+0.858
hy+0.854
Neg logit contributions
 someone-0.397
 everything-0.389
 dad-0.366
 something-0.363
Maybe-0.360
Token,"
Token ID553
Feature activation+3.10e-02

Pos logit contributions
ocol+0.969
 rib+0.834
 microsc+0.807
ifying+0.784
hy+0.782
Neg logit contributions
 someone-0.282
 everything-0.274
 dad-0.257
Maybe-0.256
 something-0.251
TokenI
Token ID40
Feature activation+2.36e+00

Pos logit contributions
ocol+0.056
 rib+0.048
 microsc+0.045
ifying+0.044
hy+0.043
Neg logit contributions
 someone-0.027
 everything-0.026
 dad-0.026
 something-0.025
 one-0.024
Token'm
Token ID1101
Feature activation+2.54e-01

Pos logit contributions
ocol+4.968
 rib+4.101
 microsc+4.065
ifying+3.985
hy+3.703
Neg logit contributions
 looked-1.935
Maybe-1.913
 someone-1.891
 one-1.812
 turned-1.802
Token sorry
Token ID7926
Feature activation-4.07e-01

Pos logit contributions
ocol+0.527
 rib+0.450
 microsc+0.434
hy+0.426
ifying+0.425
Neg logit contributions
 someone-0.155
 everything-0.155
Maybe-0.148
 dad-0.143
 something-0.139
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 someone+0.373
 everything+0.368
Maybe+0.365
 looked+0.362
 dad+0.353
Neg logit contributions
ocol-0.689
 rib-0.584
 microsc-0.547
rou-0.530
ifying-0.528
Token let
Token ID1309
Feature activation+1.09e-01

Pos logit contributions
 someone+0.069
 everything+0.068
 dad+0.064
Maybe+0.064
 something+0.063
Neg logit contributions
ocol-0.197
 rib-0.173
 microsc-0.162
hy-0.158
ifying-0.157
Token me
Token ID502
Feature activation+3.30e-02

Pos logit contributions
ocol+0.214
 rib+0.183
 microsc+0.175
ifying+0.170
hy+0.167
Neg logit contributions
 someone-0.075
 everything-0.072
 dad-0.071
Maybe-0.070
 something-0.069
Token computer
Token ID3644
Feature activation-7.91e-01

Pos logit contributions
ocol+1.356
 rib+1.122
 microsc+1.074
ifying+1.049
hy+1.036
Neg logit contributions
 someone-0.747
 everything-0.735
 dad-0.707
 something-0.697
 one-0.689
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
 someone+0.565
 dad+0.550
 everything+0.545
Maybe+0.534
How+0.524
Neg logit contributions
ocol-1.447
 rib-1.284
hy-1.226
 microsc-1.226
ifying-1.188
Token She
Token ID1375
Feature activation+6.14e-01

Pos logit contributions
 everything+0.446
 someone+0.437
 looked+0.423
 one+0.419
 somewhere+0.414
Neg logit contributions
ocol-0.920
 microsc-0.783
 rib-0.776
ifying-0.745
hy-0.736
Token said
Token ID531
Feature activation+7.11e-01

Pos logit contributions
ocol+1.166
 rib+0.988
 microsc+0.952
ifying+0.925
hy+0.917
Neg logit contributions
 someone-0.500
 everything-0.490
 dad-0.464
 something-0.461
Maybe-0.455
Token "
Token ID366
Feature activation+1.24e-01

Pos logit contributions
ocol+1.449
 rib+1.240
 microsc+1.198
ifying+1.172
hy+1.158
Neg logit contributions
 someone-0.490
 everything-0.478
 dad-0.450
 something-0.444
Maybe-0.433
TokenI
Token ID40
Feature activation+2.35e+00

Pos logit contributions
ocol+0.241
 rib+0.207
 microsc+0.197
ifying+0.195
hy+0.192
Neg logit contributions
 everything-0.092
 someone-0.091
 dad-0.086
 something-0.085
 where-0.084
Token'm
Token ID1101
Feature activation+3.35e-01

Pos logit contributions
ocol+4.838
 rib+3.889
ifying+3.793
 microsc+3.725
 former+3.559
Neg logit contributions
 someone-2.193
Maybe-2.165
 everything-2.097
 something-2.094
 one-2.086
Token sorry
Token ID7926
Feature activation-8.34e-01

Pos logit contributions
ocol+0.701
 rib+0.599
 microsc+0.580
ifying+0.565
hy+0.563
Neg logit contributions
 someone-0.205
 everything-0.202
Maybe-0.194
 dad-0.184
 something-0.183
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
Maybe+0.684
 looked+0.649
How+0.629
 someone+0.621
 everything+0.609
Neg logit contributions
ocol-1.508
 rib-1.297
 microsc-1.252
rou-1.185
hy-1.173
Token dad
Token ID9955
Feature activation-7.23e-02

Pos logit contributions
ocol+0.368
 rib+0.323
 microsc+0.312
ifying+0.305
hy+0.301
Neg logit contributions
 someone-0.066
 everything-0.063
Maybe-0.059
 something-0.055
 looked-0.055
Token.
Token ID13
Feature activation+7.74e-01

Pos logit contributions
 someone+0.045
 everything+0.044
Maybe+0.042
 something+0.040
 dad+0.040
Neg logit contributions
ocol-0.147
 rib-0.127
 microsc-0.122
hy-0.121
ifying-0.119
Tokenaches
Token ID3694
Feature activation+5.71e-01

Pos logit contributions
ocol+0.707
 rib+0.605
 microsc+0.581
hy+0.566
ifying+0.563
Neg logit contributions
 someone-0.252
 everything-0.246
 dad-0.237
Maybe-0.235
 something-0.230
Token."
Token ID526
Feature activation-2.73e-01

Pos logit contributions
ocol+1.186
 rib+1.021
 microsc+0.987
ifying+0.964
hy+0.955
Neg logit contributions
 someone-0.356
 everything-0.346
 dad-0.324
 something-0.319
Maybe-0.316
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
 everything+0.204
 someone+0.198
Maybe+0.191
 looked+0.190
 something+0.188
Neg logit contributions
ocol-0.512
 rib-0.433
 microsc-0.427
hy-0.415
onna-0.406
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
 someone+0.090
 everything+0.090
 one+0.086
 something+0.085
 looked+0.084
Neg logit contributions
ocol-0.289
 rib-0.248
 microsc-0.242
hy-0.236
rou-0.232
Token"
Token ID1
Feature activation-2.68e-02

Pos logit contributions
 someone+0.037
 everything+0.037
 dad+0.035
 something+0.035
 one+0.035
Neg logit contributions
ocol-0.059
 rib-0.048
 microsc-0.047
hy-0.045
ifying-0.045
TokenI
Token ID40
Feature activation+2.34e+00

Pos logit contributions
 everything+0.022
 someone+0.022
 dad+0.020
 something+0.020
 looked+0.020
Neg logit contributions
ocol-0.050
 rib-0.043
 microsc-0.040
ifying-0.039
hy-0.039
Token'm
Token ID1101
Feature activation+6.42e-01

Pos logit contributions
ocol+4.841
 rib+3.923
ifying+3.775
 microsc+3.772
 former+3.548
Neg logit contributions
 someone-2.231
Maybe-2.220
 everything-2.152
 looked-2.130
 they-2.116
Token sorry
Token ID7926
Feature activation-1.57e-01

Pos logit contributions
ocol+1.382
 rib+1.194
 microsc+1.156
ifying+1.128
hy+1.119
Neg logit contributions
 someone-0.371
 everything-0.360
 dad-0.334
 something-0.330
Maybe-0.323
Token,
Token ID11
Feature activation+8.63e-01

Pos logit contributions
Maybe+0.110
 someone+0.110
 everything+0.107
How+0.104
 looked+0.104
Neg logit contributions
ocol-0.308
 rib-0.262
 microsc-0.253
hy-0.245
rou-0.240
Token Tom
Token ID4186
Feature activation-1.81e-01

Pos logit contributions
ocol+1.620
 rib+1.361
 microsc+1.316
ifying+1.288
hy+1.262
Neg logit contributions
 someone-0.754
 everything-0.735
 dad-0.710
 something-0.693
 one-0.684
Token,"
Token ID553
Feature activation+3.23e-01

Pos logit contributions
 someone+0.159
 everything+0.158
Maybe+0.150
 something+0.149
 dad+0.147
Neg logit contributions
ocol-0.316
 rib-0.267
hy-0.253
 microsc-0.252
rou-0.245
Token.
Token ID13
Feature activation+9.48e-01

Pos logit contributions
ocol+1.827
 rib+1.576
 microsc+1.533
ifying+1.504
 former+1.476
Neg logit contributions
 someone-0.483
 everything-0.471
 dad-0.439
 something-0.428
 one-0.425
Token I
Token ID314
Feature activation+1.98e+00

Pos logit contributions
ocol+1.999
 rib+1.722
 microsc+1.663
ifying+1.622
 former+1.608
Neg logit contributions
 someone-0.601
 everything-0.571
 dad-0.543
 something-0.532
 one-0.521
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
ocol+4.347
 rib+3.607
ifying+3.438
 microsc+3.435
hy+3.338
Neg logit contributions
Maybe-1.461
 looked-1.459
 someone-1.448
 everything-1.418
 something-1.374
Token mad
Token ID8805
Feature activation-4.07e-01

Pos logit contributions
ocol+2.192
 rib+1.889
 microsc+1.824
ifying+1.799
hy+1.765
Neg logit contributions
 someone-0.680
 everything-0.652
 something-0.621
 dad-0.606
 one-0.602
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 someone+0.397
 everything+0.397
Maybe+0.390
 looked+0.385
How+0.377
Neg logit contributions
ocol-0.665
 rib-0.552
 microsc-0.522
rou-0.514
hy-0.507
Token I
Token ID314
Feature activation+2.34e+00

Pos logit contributions
ocol+2.488
 rib+2.149
 microsc+2.073
ifying+2.035
 former+2.028
Neg logit contributions
 someone-0.713
 everything-0.656
 dad-0.635
 something-0.617
 one-0.603
Token wanted
Token ID2227
Feature activation-3.02e-01

Pos logit contributions
ocol+5.266
 rib+4.333
ifying+4.183
 microsc+4.154
hy+3.987
Neg logit contributions
 looked-1.756
Maybe-1.734
 someone-1.664
 turned-1.626
 everything-1.621
Token you
Token ID345
Feature activation+6.94e-01

Pos logit contributions
Maybe+0.356
 everything+0.349
 someone+0.347
 looked+0.344
They+0.339
Neg logit contributions
ocol-0.436
 rib-0.366
 microsc-0.344
hy-0.319
ifying-0.312
Token to
Token ID284
Feature activation+7.92e-01

Pos logit contributions
ocol+1.137
 rib+0.931
 microsc+0.892
ifying+0.860
hy+0.850
Neg logit contributions
 someone-0.755
 everything-0.744
 dad-0.715
 something-0.713
Maybe-0.703
Token play
Token ID711
Feature activation+1.35e-01

Pos logit contributions
ocol+1.653
 rib+1.413
 microsc+1.373
ifying+1.343
hy+1.317
Neg logit contributions
 someone-0.520
 everything-0.505
 dad-0.472
 something-0.467
 one-0.457
Token with
Token ID351
Feature activation-1.46e-01

Pos logit contributions
ocol+0.283
 rib+0.244
 microsc+0.233
hy+0.227
ifying+0.226
Neg logit contributions
 someone-0.077
 everything-0.077
 dad-0.074
Maybe-0.071
 looked-0.070
TokenThe
Token ID464
Feature activation-5.65e-01

Pos logit contributions
ocol+0.318
 rib+0.269
 microsc+0.256
ifying+0.248
hy+0.246
Neg logit contributions
 someone-0.175
 everything-0.173
 dad-0.167
 something-0.164
 looked-0.162
Token little
Token ID1310
Feature activation-2.83e-01

Pos logit contributions
 everything+0.508
 someone+0.498
Maybe+0.475
They+0.471
 dad+0.465
Neg logit contributions
ocol-0.978
 rib-0.877
 microsc-0.764
hy-0.763
 gol-0.747
Token girl
Token ID2576
Feature activation+2.69e-01

Pos logit contributions
 everything+0.187
 someone+0.186
 dad+0.174
They+0.171
Maybe+0.170
Neg logit contributions
ocol-0.565
 rib-0.507
 microsc-0.456
hy-0.455
 gol-0.446
Token was
Token ID373
Feature activation-3.22e-01

Pos logit contributions
ocol+0.488
 rib+0.420
 microsc+0.398
hy+0.386
ifying+0.385
Neg logit contributions
 someone-0.232
 everything-0.227
 dad-0.214
 something-0.214
Maybe-0.211
Token su
Token ID424
Feature activation+3.07e-01

Pos logit contributions
 someone+0.219
 everything+0.213
Maybe+0.209
 dad+0.199
How+0.193
Neg logit contributions
ocol-0.622
 rib-0.537
hy-0.516
 microsc-0.512
rou-0.511
Tokenpr
Token ID1050
Feature activation+2.34e+00

Pos logit contributions
ocol+0.432
 rib+0.340
 microsc+0.320
ifying+0.311
hy+0.304
Neg logit contributions
 someone-0.407
 everything-0.399
 dad-0.391
 something-0.385
Maybe-0.381
Tokenised
Token ID1417
Feature activation+6.25e-01

Pos logit contributions
ocol+4.013
 rib+3.493
 microsc+3.405
hy+3.113
pri+3.069
Neg logit contributions
They-2.206
 one-2.191
 someone-2.182
 everything-2.177
 looked-2.173
Token.
Token ID13
Feature activation-3.19e-01

Pos logit contributions
ocol+1.178
 rib+0.996
 microsc+0.959
ifying+0.933
hy+0.924
Neg logit contributions
 someone-0.519
 everything-0.508
 dad-0.482
 something-0.478
Maybe-0.474
Token She
Token ID1375
Feature activation+2.38e-01

Pos logit contributions
 everything+0.283
 someone+0.281
 something+0.267
 dad+0.265
Maybe+0.263
Neg logit contributions
ocol-0.559
 rib-0.482
 microsc-0.457
ifying-0.448
hy-0.446
Token had
Token ID550
Feature activation+5.59e-01

Pos logit contributions
ocol+0.444
 rib+0.385
 microsc+0.364
ifying+0.353
hy+0.352
Neg logit contributions
 someone-0.193
 everything-0.189
 dad-0.178
 something-0.178
Maybe-0.173
Token never
Token ID1239
Feature activation+4.83e-01

Pos logit contributions
ocol+1.166
 rib+1.008
 microsc+0.973
ifying+0.948
hy+0.941
Neg logit contributions
 someone-0.346
 everything-0.336
 dad-0.315
 something-0.309
Maybe-0.307
Token to
Token ID284
Feature activation+4.59e-01

Pos logit contributions
ocol+1.108
 rib+0.924
 microsc+0.887
ifying+0.860
hy+0.851
Neg logit contributions
 someone-0.607
 everything-0.596
 dad-0.570
 something-0.566
Maybe-0.561
Token share
Token ID2648
Feature activation+5.41e-01

Pos logit contributions
ocol+0.895
 rib+0.764
 microsc+0.736
hy+0.711
ifying+0.710
Neg logit contributions
 someone-0.346
 everything-0.338
 dad-0.321
 something-0.317
Maybe-0.317
Token and
Token ID290
Feature activation+8.45e-01

Pos logit contributions
ocol+1.139
 rib+0.984
 microsc+0.951
ifying+0.925
hy+0.917
Neg logit contributions
 someone-0.325
 everything-0.315
 dad-0.294
 something-0.289
Maybe-0.288
Token wait
Token ID4043
Feature activation+6.18e-01

Pos logit contributions
ocol+1.738
 rib+1.477
 microsc+1.433
ifying+1.410
hy+1.387
Neg logit contributions
 someone-0.577
 everything-0.561
 dad-0.526
 something-0.523
 one-0.510
Token.
Token ID13
Feature activation+5.69e-01

Pos logit contributions
ocol+1.095
 rib+0.915
 microsc+0.879
ifying+0.852
hy+0.843
Neg logit contributions
 someone-0.579
 everything-0.569
 dad-0.543
 something-0.539
Maybe-0.535
Token How
Token ID1374
Feature activation+2.33e+00

Pos logit contributions
ocol+0.937
 rib+0.771
 microsc+0.739
ifying+0.715
hy+0.706
Neg logit contributions
 someone-0.602
 everything-0.594
 dad-0.570
 something-0.566
Maybe-0.561
Token about
Token ID546
Feature activation+1.78e-01

Pos logit contributions
ocol+4.391
 rib+3.447
ifying+3.307
 former+3.239
hy+3.190
Neg logit contributions
 everything-2.677
 someone-2.653
 something-2.554
 they-2.478
 dad-2.461
Token we
Token ID356
Feature activation+5.28e-01

Pos logit contributions
ocol+0.369
 rib+0.320
 microsc+0.310
ifying+0.297
 former+0.294
Neg logit contributions
 someone-0.108
 everything-0.107
Maybe-0.104
 dad-0.099
 looked-0.098
Token clean
Token ID3424
Feature activation+3.70e-01

Pos logit contributions
ocol+0.919
 rib+0.768
 microsc+0.737
ifying+0.714
hy+0.706
Neg logit contributions
 someone-0.510
 everything-0.501
 dad-0.478
 something-0.476
Maybe-0.474
Token up
Token ID510
Feature activation+5.21e-01

Pos logit contributions
ocol+0.780
 rib+0.676
 microsc+0.653
ifying+0.633
 former+0.623
Neg logit contributions
 someone-0.222
 everything-0.212
Maybe-0.202
 dad-0.199
 looked-0.193
Token this
Token ID428
Feature activation-2.72e-01

Pos logit contributions
ocol+1.048
 rib+0.898
 microsc+0.866
ifying+0.843
hy+0.833
Neg logit contributions
 someone-0.359
 everything-0.351
 dad-0.329
Maybe-0.326
 something-0.324
Token Lily
Token ID20037
Feature activation-1.75e-01

Pos logit contributions
ocol+0.078
 rib+0.066
 microsc+0.064
hy+0.061
ifying+0.060
Neg logit contributions
 someone-0.039
 everything-0.038
Maybe-0.037
 something-0.036
 somewhere-0.035
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
 someone+0.112
 everything+0.105
Maybe+0.101
 something+0.100
How+0.100
Neg logit contributions
ocol-0.348
 rib-0.307
 microsc-0.294
hy-0.292
ifying-0.284
Token They
Token ID1119
Feature activation+1.15e-01

Pos logit contributions
ocol+0.572
 rib+0.497
 microsc+0.480
ifying+0.469
hy+0.460
Neg logit contributions
 someone-0.184
 everything-0.178
 dad-0.169
 something-0.166
Maybe-0.164
Token were
Token ID547
Feature activation+2.82e-01

Pos logit contributions
ocol+0.199
 rib+0.170
 microsc+0.162
ifying+0.158
hy+0.157
Neg logit contributions
 someone-0.110
 everything-0.105
 something-0.102
 dad-0.101
Maybe-0.100
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
ocol+0.605
 rib+0.525
 microsc+0.508
ifying+0.495
hy+0.495
Neg logit contributions
 someone-0.155
 everything-0.147
Maybe-0.142
 something-0.137
 dad-0.134
Token close
Token ID1969
Feature activation+2.32e+00

Pos logit contributions
ocol+2.115
 rib+1.830
 microsc+1.765
ifying+1.737
hy+1.702
Neg logit contributions
 someone-0.615
 everything-0.602
 dad-0.586
 one-0.545
 something-0.543
Token,
Token ID11
Feature activation-3.99e-01

Pos logit contributions
ocol+4.612
 rib+3.666
 microsc+3.599
ifying+3.514
 former+3.465
Neg logit contributions
 everything-2.150
 dad-2.113
 one-2.096
 someone-2.016
 they-1.986
Token but
Token ID475
Feature activation+2.79e-01

Pos logit contributions
 someone+0.330
 everything+0.320
Maybe+0.313
How+0.305
 something+0.304
Neg logit contributions
ocol-0.691
 rib-0.626
hy-0.589
 microsc-0.587
ifying-0.563
Token they
Token ID484
Feature activation+1.03e-01

Pos logit contributions
ocol+0.655
 rib+0.582
 microsc+0.563
ifying+0.549
hy+0.544
Neg logit contributions
 someone-0.088
 everything-0.082
 dad-0.073
Maybe-0.070
 something-0.069
Token were
Token ID547
Feature activation+3.91e-02

Pos logit contributions
ocol+0.171
 rib+0.145
 microsc+0.139
ifying+0.134
hy+0.133
Neg logit contributions
 someone-0.102
 everything-0.098
 something-0.095
 dad-0.094
Maybe-0.093
Token also
Token ID635
Feature activation+1.26e-02

Pos logit contributions
ocol+0.083
 rib+0.073
 microsc+0.070
ifying+0.068
hy+0.068
Neg logit contributions
 someone-0.021
Maybe-0.020
 everything-0.020
 something-0.018
 dad-0.018
Token down
Token ID866
Feature activation+2.77e-01

Pos logit contributions
ocol+0.224
 rib+0.195
 microsc+0.186
ifying+0.183
hy+0.178
Neg logit contributions
 someone-0.070
 everything-0.067
Maybe-0.065
 dad-0.063
 something-0.061
Token and
Token ID290
Feature activation-8.12e-02

Pos logit contributions
ocol+0.545
 rib+0.468
 microsc+0.449
ifying+0.440
hy+0.432
Neg logit contributions
 someone-0.198
 everything-0.193
 dad-0.183
Maybe-0.182
 something-0.179
Token the
Token ID262
Feature activation-4.57e-01

Pos logit contributions
 someone+0.062
 everything+0.059
 dad+0.057
Maybe+0.057
 something+0.056
Neg logit contributions
ocol-0.153
 rib-0.135
 microsc-0.127
ifying-0.123
 former-0.123
Token butterfly
Token ID35113
Feature activation-2.46e-01

Pos logit contributions
 everything+0.437
 someone+0.434
Maybe+0.416
They+0.415
 dad+0.415
Neg logit contributions
ocol-0.762
 rib-0.684
 microsc-0.617
hy-0.602
 former-0.597
Token slowly
Token ID6364
Feature activation-2.33e-01

Pos logit contributions
 someone+0.196
 everything+0.191
 dad+0.187
 something+0.180
Maybe+0.178
Neg logit contributions
ocol-0.447
 rib-0.402
 microsc-0.368
hy-0.364
ifying-0.360
Token unw
Token ID7379
Feature activation+2.32e+00

Pos logit contributions
 someone+0.177
 everything+0.171
 dad+0.166
Maybe+0.163
 something+0.161
Neg logit contributions
ocol-0.444
 rib-0.395
hy-0.361
 microsc-0.359
ifying-0.356
Tokenound
Token ID633
Feature activation+4.54e-01

Pos logit contributions
ocol+2.918
 microsc+2.266
 rib+2.164
hy+2.095
pill+2.071
Neg logit contributions
 someone-3.369
 everything-3.311
 dad-3.304
Mom-3.299
 next-3.298
Token the
Token ID262
Feature activation+4.99e-01

Pos logit contributions
ocol+0.942
 rib+0.814
 microsc+0.783
ifying+0.764
hy+0.757
Neg logit contributions
 someone-0.283
 everything-0.276
 dad-0.260
Maybe-0.256
 something-0.253
Token thread
Token ID4704
Feature activation+5.63e-01

Pos logit contributions
ocol+0.870
 rib+0.732
 microsc+0.697
ifying+0.677
hy+0.670
Neg logit contributions
 someone-0.479
 everything-0.472
 dad-0.451
 something-0.447
Maybe-0.445
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ocol+1.191
 rib+1.030
 microsc+0.995
ifying+0.973
hy+0.965
Neg logit contributions
 someone-0.330
 everything-0.321
 dad-0.298
 something-0.294
Maybe-0.291
Token
Token ID220
Feature activation-1.93e-02

Pos logit contributions
 someone+0.294
 everything+0.285
 dad+0.279
 something+0.279
Maybe+0.277
Neg logit contributions
ocol-0.432
 rib-0.363
 microsc-0.351
ifying-0.336
hy-0.331
Token clue
Token ID18437
Feature activation-3.00e-01

Pos logit contributions
Maybe+1.824
They+1.812
 everything+1.809
 someone+1.734
 something+1.703
Neg logit contributions
ocol-3.131
 rib-2.882
 gol-2.628
hy-2.616
 topp-2.600
Token.
Token ID13
Feature activation-3.23e-02

Pos logit contributions
 everything+0.187
 someone+0.184
Maybe+0.176
How+0.170
 dad+0.168
Neg logit contributions
ocol-0.598
 rib-0.528
 microsc-0.505
hy-0.490
ifying-0.488
Token
Token ID198
Feature activation-5.96e-01

Pos logit contributions
 everything+0.022
 someone+0.022
 dad+0.020
 something+0.020
 looked+0.020
Neg logit contributions
ocol-0.063
 rib-0.055
 microsc-0.053
ifying-0.052
hy-0.052
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
 someone+0.411
 everything+0.397
 looked+0.397
 one+0.395
 something+0.387
Neg logit contributions
ocol-1.111
 rib-0.957
 microsc-0.940
onna-0.902
hy-0.893
TokenThe
Token ID464
Feature activation-9.94e-01

Pos logit contributions
 someone+0.209
 everything+0.204
 dad+0.201
 looked+0.197
 one+0.191
Neg logit contributions
ocol-0.779
 rib-0.689
 microsc-0.662
ifying-0.654
hy-0.642
Token next
Token ID1306
Feature activation-2.61e+00

Pos logit contributions
 everything+0.914
 someone+0.893
Maybe+0.880
They+0.863
 looked+0.847
Neg logit contributions
ocol-1.679
 rib-1.508
 microsc-1.439
hy-1.357
ifying-1.317
Token clue
Token ID18437
Feature activation-1.48e-01

Pos logit contributions
Maybe+2.716
 someone+2.576
 everything+2.559
 what+2.526
They+2.504
Neg logit contributions
ocol-3.674
 gol-3.418
hy-3.376
 rib-3.335
pri-3.163
Token said
Token ID531
Feature activation+2.72e-01

Pos logit contributions
 everything+0.110
 someone+0.108
Maybe+0.104
How+0.103
 dad+0.102
Neg logit contributions
ocol-0.278
 rib-0.242
 microsc-0.230
hy-0.225
 topp-0.217
Token:
Token ID25
Feature activation-2.61e-01

Pos logit contributions
ocol+0.512
 rib+0.434
 microsc+0.417
ifying+0.405
hy+0.404
Neg logit contributions
 someone-0.217
 everything-0.217
Maybe-0.207
 dad-0.204
 looked-0.199
Token "
Token ID366
Feature activation-1.80e-01

Pos logit contributions
 someone+0.226
 everything+0.226
Maybe+0.215
 dad+0.213
 looked+0.211
Neg logit contributions
ocol-0.458
 rib-0.393
 microsc-0.378
ifying-0.365
hy-0.360
TokenLook
Token ID8567
Feature activation-6.06e-01

Pos logit contributions
 everything+0.173
 someone+0.169
 dad+0.166
 something+0.162
 where+0.159
Neg logit contributions
ocol-0.311
 rib-0.268
 microsc-0.248
ifying-0.245
hy-0.244
Token wrong
Token ID2642
Feature activation-4.33e-01

Pos logit contributions
ocol+2.123
 rib+1.776
 microsc+1.698
ifying+1.691
 former+1.662
Neg logit contributions
 someone-0.917
 everything-0.903
 something-0.861
 dad-0.829
 one-0.824
Token.
Token ID13
Feature activation-4.46e-01

Pos logit contributions
 everything+0.276
 someone+0.275
Maybe+0.266
 dad+0.260
How+0.257
Neg logit contributions
ocol-0.829
 rib-0.727
 microsc-0.705
hy-0.678
ifying-0.676
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
 someone+0.317
 everything+0.316
 something+0.304
 dad+0.297
 one+0.297
Neg logit contributions
ocol-0.828
 microsc-0.716
 rib-0.707
 former-0.671
hy-0.669
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 someone+0.078
 everything+0.076
 one+0.073
 something+0.073
 dad+0.072
Neg logit contributions
ocol-0.247
 rib-0.211
 microsc-0.210
hy-0.200
rou-0.197
TokenThe
Token ID464
Feature activation-8.95e-01

Pos logit contributions
 someone+0.412
 dad+0.406
 everything+0.404
 one+0.394
 looked+0.391
Neg logit contributions
ocol-0.749
 rib-0.631
 microsc-0.623
ifying-0.596
 former-0.589
Token next
Token ID1306
Feature activation-2.59e+00

Pos logit contributions
 everything+0.648
 someone+0.635
They+0.623
 something+0.621
Maybe+0.605
Neg logit contributions
ocol-1.675
 rib-1.512
 microsc-1.400
hy-1.392
 former-1.364
Token day
Token ID1110
Feature activation+5.51e-01

Pos logit contributions
 what+2.707
 everything+2.685
Maybe+2.654
They+2.638
 something+2.603
Neg logit contributions
ocol-3.731
 gol-3.714
hy-3.374
 rib-3.144
pri-3.123
Token,
Token ID11
Feature activation-3.33e-01

Pos logit contributions
ocol+1.161
 rib+1.001
 microsc+0.968
ifying+0.944
hy+0.936
Neg logit contributions
 someone-0.334
 everything-0.325
 dad-0.304
 something-0.299
Maybe-0.298
Token the
Token ID262
Feature activation-6.81e-01

Pos logit contributions
Maybe+0.216
 someone+0.211
They+0.207
 everything+0.203
 dad+0.201
Neg logit contributions
ocol-0.647
 rib-0.576
 microsc-0.545
ifying-0.519
 former-0.518
Token store
Token ID3650
Feature activation-2.30e-01

Pos logit contributions
 everything+0.637
 someone+0.634
They+0.625
 something+0.613
Maybe+0.611
Neg logit contributions
ocol-1.125
 rib-1.012
 microsc-0.915
hy-0.905
 former-0.883
Token owner
Token ID4870
Feature activation-5.21e-01

Pos logit contributions
 everything+0.171
 someone+0.168
 something+0.160
How+0.159
Maybe+0.157
Neg logit contributions
ocol-0.418
 rib-0.371
hy-0.356
 microsc-0.350
 gol-0.339
Token lonely
Token ID21757
Feature activation-5.38e-01

Pos logit contributions
Maybe+0.081
 someone+0.079
 everything+0.078
 looked+0.074
 dad+0.072
Neg logit contributions
ocol-0.260
 rib-0.220
 microsc-0.212
ifying-0.208
 former-0.206
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
 everything+0.366
 someone+0.365
Maybe+0.361
Mom+0.343
How+0.342
Neg logit contributions
ocol-1.017
 rib-0.880
 microsc-0.845
ifying-0.833
hy-0.823
Token
Token ID198
Feature activation-3.11e-01

Pos logit contributions
ocol+0.408
 rib+0.351
 microsc+0.344
ifying+0.337
hy+0.333
Neg logit contributions
 everything-0.114
 someone-0.114
 something-0.104
 dad-0.104
 one-0.102
Token
Token ID198
Feature activation+7.87e-02

Pos logit contributions
 someone+0.203
 everything+0.202
 looked+0.191
 something+0.190
 dad+0.189
Neg logit contributions
ocol-0.601
 rib-0.514
 microsc-0.504
hy-0.489
ifying-0.480
TokenThe
Token ID464
Feature activation-5.78e-01

Pos logit contributions
ocol+0.160
 rib+0.138
 microsc+0.133
ifying+0.132
hy+0.131
Neg logit contributions
 someone-0.050
 everything-0.049
 dad-0.046
 something-0.045
 looked-0.045
Token next
Token ID1306
Feature activation-2.57e+00

Pos logit contributions
 someone+0.280
 everything+0.276
 something+0.248
 looked+0.243
 where+0.243
Neg logit contributions
ocol-1.235
 rib-1.108
 microsc-1.052
 former-1.030
hy-1.023
Token day
Token ID1110
Feature activation+2.39e-01

Pos logit contributions
 what+2.670
 everything+2.646
Maybe+2.641
 someone+2.577
 something+2.539
Neg logit contributions
ocol-3.667
 rib-3.112
hy-3.096
 gol-3.055
 topp-2.993
Token,
Token ID11
Feature activation+3.02e-02

Pos logit contributions
ocol+0.487
 rib+0.418
 microsc+0.403
ifying+0.395
hy+0.391
Neg logit contributions
 someone-0.150
 everything-0.147
 dad-0.140
Maybe-0.140
 something-0.135
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
ocol+0.058
 rib+0.051
 microsc+0.049
ifying+0.047
hy+0.047
Neg logit contributions
 someone-0.020
 everything-0.020
Maybe-0.019
 dad-0.019
 looked-0.019
Token sun
Token ID4252
Feature activation+7.98e-02

Pos logit contributions
 someone+0.199
 everything+0.197
Maybe+0.182
 something+0.182
 looked+0.179
Neg logit contributions
ocol-0.585
 rib-0.517
 microsc-0.488
 former-0.474
hy-0.473
Token came
Token ID1625
Feature activation+6.72e-02

Pos logit contributions
ocol+0.160
 rib+0.140
ifying+0.132
 microsc+0.132
hy+0.130
Neg logit contributions
 someone-0.052
 everything-0.050
 something-0.047
 dad-0.047
Maybe-0.045
Token unfair
Token ID11675
Feature activation-5.45e-01

Pos logit contributions
ocol+2.003
 rib+1.692
 microsc+1.649
ifying+1.607
 former+1.579
Neg logit contributions
 someone-0.671
 everything-0.654
 dad-0.616
 something-0.607
 looked-0.596
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
 someone+0.341
Maybe+0.326
 everything+0.325
How+0.313
Mom+0.308
Neg logit contributions
ocol-1.054
 rib-0.929
 microsc-0.907
hy-0.889
ifying-0.872
Token
Token ID198
Feature activation-3.49e-01

Pos logit contributions
 someone+0.072
 everything+0.071
 something+0.068
 dad+0.066
 looked+0.064
Neg logit contributions
ocol-0.221
 rib-0.190
 microsc-0.188
hy-0.181
ifying-0.180
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 someone+0.228
 everything+0.223
 something+0.215
 dad+0.214
 looked+0.210
Neg logit contributions
ocol-0.671
 rib-0.574
 microsc-0.566
hy-0.548
ifying-0.540
TokenThe
Token ID464
Feature activation-1.11e-01

Pos logit contributions
 someone+0.336
 everything+0.326
 dad+0.323
 something+0.311
 looked+0.311
Neg logit contributions
ocol-0.766
 rib-0.646
 microsc-0.627
ifying-0.613
hy-0.608
Token next
Token ID1306
Feature activation-2.50e+00

Pos logit contributions
 someone+0.088
 everything+0.085
 something+0.083
 looked+0.080
Maybe+0.080
Neg logit contributions
ocol-0.207
 rib-0.184
hy-0.167
 microsc-0.167
ifying-0.164
Token day
Token ID1110
Feature activation+2.53e-01

Pos logit contributions
Maybe+2.631
 what+2.594
 someone+2.591
 something+2.558
 everything+2.521
Neg logit contributions
ocol-3.526
 gol-3.337
hy-3.087
 rib-3.053
spe-2.986
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
ocol+0.517
 rib+0.444
 microsc+0.427
ifying+0.417
hy+0.413
Neg logit contributions
 someone-0.161
 everything-0.156
Maybe-0.152
 dad-0.149
 something-0.144
Token they
Token ID484
Feature activation-3.89e-01

Pos logit contributions
 someone+0.102
Maybe+0.099
 everything+0.097
 dad+0.097
 looked+0.094
Neg logit contributions
ocol-0.305
 rib-0.266
 microsc-0.254
ifying-0.245
 topp-0.241
Token decided
Token ID3066
Feature activation+9.55e-02

Pos logit contributions
 someone+0.252
 everything+0.239
 something+0.233
Maybe+0.222
How+0.220
Neg logit contributions
ocol-0.757
 rib-0.680
 microsc-0.650
ifying-0.642
hy-0.632
Token to
Token ID284
Feature activation+1.16e-02

Pos logit contributions
ocol+0.140
 rib+0.118
 microsc+0.111
hy+0.106
ifying+0.105
Neg logit contributions
 someone-0.109
 everything-0.108
Maybe-0.106
 dad-0.104
 looked-0.103
Token say
Token ID910
Feature activation-1.64e-01

Pos logit contributions
 someone+0.510
 everything+0.475
Maybe+0.469
 something+0.468
How+0.465
Neg logit contributions
ocol-0.890
 rib-0.783
 microsc-0.707
hy-0.696
ifying-0.677
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
 someone+0.101
 everything+0.100
Maybe+0.095
 dad+0.092
How+0.091
Neg logit contributions
ocol-0.335
 rib-0.295
hy-0.276
 microsc-0.275
ifying-0.270
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
 everything+0.252
 someone+0.249
 dad+0.235
 something+0.234
 looked+0.230
Neg logit contributions
ocol-0.643
 rib-0.575
 microsc-0.532
ifying-0.530
hy-0.525
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 someone+0.088
 everything+0.085
 looked+0.083
 one+0.082
 something+0.082
Neg logit contributions
ocol-0.279
 rib-0.244
 microsc-0.235
hy-0.224
rou-0.224
Token"
Token ID1
Feature activation-1.09e+00

Pos logit contributions
 someone+0.156
 everything+0.151
 looked+0.145
 dad+0.144
 something+0.142
Neg logit contributions
ocol-0.453
 rib-0.400
 microsc-0.377
ifying-0.369
hy-0.361
TokenMe
Token ID5308
Feature activation-2.44e+00

Pos logit contributions
 everything+1.214
 someone+1.186
 dad+1.160
 where+1.149
 something+1.132
Neg logit contributions
ocol-1.689
 rib-1.609
leaf-1.403
hy-1.390
 gol-1.382
Token too
Token ID1165
Feature activation-4.70e-02

Pos logit contributions
Maybe+1.814
 someone+1.766
 everything+1.660
Days+1.631
They+1.588
Neg logit contributions
ocol-4.759
hy-4.237
 rib-4.134
rou-3.981
ifying-3.976
Token.
Token ID13
Feature activation+9.29e-02

Pos logit contributions
 someone+0.036
 everything+0.035
Maybe+0.034
 something+0.033
How+0.033
Neg logit contributions
ocol-0.089
 rib-0.077
 microsc-0.074
hy-0.073
ifying-0.072
Token I
Token ID314
Feature activation+5.89e-01

Pos logit contributions
ocol+0.140
 rib+0.115
 microsc+0.108
ifying+0.106
hy+0.106
Neg logit contributions
 everything-0.110
 someone-0.109
 dad-0.105
 something-0.104
 looked-0.103
Token wish
Token ID4601
Feature activation+4.41e-02

Pos logit contributions
ocol+1.007
 rib+0.836
 microsc+0.801
ifying+0.775
hy+0.768
Neg logit contributions
 someone-0.592
 everything-0.582
 dad-0.558
 something-0.553
Maybe-0.548
Token I
Token ID314
Feature activation+1.52e-01

Pos logit contributions
ocol+0.089
 rib+0.078
 microsc+0.074
hy+0.072
ifying+0.071
Neg logit contributions
 someone-0.028
Maybe-0.027
 everything-0.027
 dad-0.026
 looked-0.025
Token later
Token ID1568
Feature activation+7.13e-02

Pos logit contributions
 someone+0.126
 everything+0.120
 dad+0.118
Maybe+0.118
How+0.113
Neg logit contributions
ocol-0.326
 rib-0.286
 microsc-0.271
ifying-0.266
 former-0.263
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
ocol+0.146
 rib+0.128
 microsc+0.123
ifying+0.121
hy+0.120
Neg logit contributions
 someone-0.042
 everything-0.041
Maybe-0.039
 dad-0.038
 something-0.037
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 everything+0.075
 someone+0.074
 looked+0.070
 dad+0.069
 one+0.069
Neg logit contributions
ocol-0.197
 rib-0.168
 microsc-0.166
ifying-0.160
 former-0.158
Token
Token ID198
Feature activation-3.16e-01

Pos logit contributions
 someone+0.233
 everything+0.227
 one+0.223
 looked+0.217
 something+0.215
Neg logit contributions
ocol-0.647
 rib-0.551
 microsc-0.545
hy-0.521
rou-0.517
TokenThe
Token ID464
Feature activation-3.88e-01

Pos logit contributions
 someone+0.241
 everything+0.234
 looked+0.225
 one+0.225
 dad+0.219
Neg logit contributions
ocol-0.595
 rib-0.509
 microsc-0.500
ifying-0.481
rou-0.473
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 someone+0.221
 everything+0.217
Maybe+0.202
 something+0.198
 looked+0.195
Neg logit contributions
ocol-0.817
 rib-0.722
 microsc-0.682
 former-0.662
ifying-0.652
Token day
Token ID1110
Feature activation+9.38e-02

Pos logit contributions
Maybe+2.667
They+2.523
 someone+2.509
 everything+2.506
 what+2.495
Neg logit contributions
ocol-3.493
 gol-2.885
 rib-2.864
 topp-2.785
pri-2.705
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
ocol+0.189
 rib+0.162
 microsc+0.156
ifying+0.152
hy+0.149
Neg logit contributions
 someone-0.060
 everything-0.059
Maybe-0.058
 dad-0.056
How-0.054
Token Ben
Token ID3932
Feature activation-6.20e-02

Pos logit contributions
 someone+0.148
Maybe+0.147
 looked+0.144
 everything+0.144
How+0.126
Neg logit contributions
ocol-0.643
 rib-0.571
 microsc-0.546
ifying-0.533
 former-0.518
Token and
Token ID290
Feature activation+5.48e-01

Pos logit contributions
 someone+0.037
 everything+0.037
 something+0.033
 dad+0.032
Maybe+0.032
Neg logit contributions
ocol-0.126
 rib-0.111
 microsc-0.107
hy-0.104
ifying-0.103
Token Lily
Token ID20037
Feature activation-3.42e-01

Pos logit contributions
ocol+1.173
 rib+1.016
 microsc+0.983
ifying+0.957
hy+0.951
Neg logit contributions
 someone-0.315
 everything-0.306
 dad-0.283
 something-0.281
Maybe-0.278
Token.
Token ID13
Feature activation-6.37e-02

Pos logit contributions
ocol+1.678
 rib+1.453
 microsc+1.408
ifying+1.380
hy+1.363
Neg logit contributions
 someone-0.390
 everything-0.378
 dad-0.348
 something-0.342
 one-0.331
Token They
Token ID1119
Feature activation+5.69e-01

Pos logit contributions
 everything+0.043
 someone+0.042
 looked+0.040
 dad+0.039
 something+0.039
Neg logit contributions
ocol-0.125
 rib-0.107
 microsc-0.106
ifying-0.103
 former-0.102
Token hoped
Token ID10719
Feature activation+2.35e-01

Pos logit contributions
ocol+1.137
 rib+0.975
 microsc+0.942
ifying+0.916
hy+0.910
Neg logit contributions
 someone-0.403
 everything-0.393
 dad-0.369
 something-0.367
Maybe-0.361
Token that
Token ID326
Feature activation+1.90e-01

Pos logit contributions
ocol+0.493
 rib+0.429
 microsc+0.414
hy+0.401
ifying+0.399
Neg logit contributions
 someone-0.132
 everything-0.129
Maybe-0.126
 dad-0.121
 looked-0.119
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
ocol+0.402
 rib+0.353
 microsc+0.341
ifying+0.328
hy+0.328
Neg logit contributions
 someone-0.099
 everything-0.097
Maybe-0.091
 dad-0.090
 something-0.089
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 someone+0.620
 everything+0.610
They+0.594
 they+0.591
 looked+0.584
Neg logit contributions
ocol-1.418
 rib-1.268
 microsc-1.153
hy-1.150
 former-1.144
Token time
Token ID640
Feature activation-8.49e-02

Pos logit contributions
They+2.788
 what+2.725
 everything+2.641
Nothing+2.561
 dad+2.530
Neg logit contributions
ocol-3.253
 rib-3.012
 topp-2.806
 spe-2.721
 gol-2.671
Token they
Token ID484
Feature activation-1.12e-01

Pos logit contributions
 someone+0.028
 everything+0.027
Maybe+0.027
 dad+0.024
How+0.024
Neg logit contributions
ocol-0.195
 rib-0.173
 microsc-0.167
hy-0.163
ifying-0.163
Token had
Token ID550
Feature activation-4.39e-01

Pos logit contributions
 someone+0.098
 everything+0.094
 something+0.092
 dad+0.090
 one+0.089
Neg logit contributions
ocol-0.201
 rib-0.172
 microsc-0.164
ifying-0.161
hy-0.160
Token a
Token ID257
Feature activation-7.99e-01

Pos logit contributions
Maybe+0.336
 someone+0.323
 everything+0.317
How+0.315
They+0.309
Neg logit contributions
ocol-0.811
 rib-0.712
 microsc-0.669
 topp-0.654
hy-0.648
Token picnic
Token ID35715
Feature activation-5.15e-01

Pos logit contributions
 someone+0.793
 everything+0.779
They+0.768
Maybe+0.757
How+0.750
Neg logit contributions
ocol-1.271
 rib-1.112
 microsc-1.044
 former-1.013
hy-1.005
Token owner
Token ID4870
Feature activation-5.59e-02

Pos logit contributions
 someone+0.149
 everything+0.149
Maybe+0.142
 something+0.141
 looked+0.141
Neg logit contributions
ocol-0.270
 rib-0.235
 microsc-0.225
hy-0.214
ifying-0.213
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 someone+0.038
 everything+0.037
Maybe+0.036
 dad+0.035
 something+0.034
Neg logit contributions
ocol-0.111
 rib-0.095
 microsc-0.092
hy-0.091
ifying-0.090
Token
Token ID198
Feature activation+3.09e-01

Pos logit contributions
ocol+0.234
 rib+0.201
 microsc+0.196
hy+0.190
ifying+0.190
Neg logit contributions
 someone-0.093
 everything-0.090
 something-0.086
Maybe-0.085
 dad-0.085
Token
Token ID198
Feature activation+6.43e-01

Pos logit contributions
ocol+0.647
 rib+0.561
 microsc+0.545
hy+0.529
ifying+0.528
Neg logit contributions
 someone-0.176
 everything-0.169
 dad-0.159
 something-0.158
 one-0.154
TokenThe
Token ID464
Feature activation-5.31e-01

Pos logit contributions
ocol+1.301
 rib+1.112
 microsc+1.075
ifying+1.047
hy+1.037
Neg logit contributions
 someone-0.450
 everything-0.439
 dad-0.412
 something-0.408
Maybe-0.403
Token next
Token ID1306
Feature activation-2.40e+00

Pos logit contributions
 everything+0.302
 someone+0.283
 something+0.280
Maybe+0.268
 looked+0.265
Neg logit contributions
ocol-1.075
 rib-0.960
 microsc-0.934
hy-0.925
 former-0.908
Token day
Token ID1110
Feature activation+7.55e-01

Pos logit contributions
Maybe+2.510
 everything+2.409
 someone+2.386
 what+2.363
 something+2.325
Neg logit contributions
 gol-3.395
ocol-3.317
hy-2.933
pri-2.876
 rib-2.827
Token,
Token ID11
Feature activation+8.66e-02

Pos logit contributions
ocol+1.619
 rib+1.398
 microsc+1.355
ifying+1.320
hy+1.308
Neg logit contributions
 someone-0.446
 everything-0.433
 dad-0.399
 something-0.397
 one-0.387
Token Tim
Token ID5045
Feature activation-4.63e-01

Pos logit contributions
ocol+0.151
 rib+0.128
 microsc+0.122
ifying+0.120
hy+0.117
Neg logit contributions
 someone-0.076
 everything-0.074
Maybe-0.074
 dad-0.072
 looked-0.071
Tokenmy
Token ID1820
Feature activation-4.92e-01

Pos logit contributions
 someone+0.326
 everything+0.308
 something+0.301
Maybe+0.291
 they+0.285
Neg logit contributions
ocol-0.868
 rib-0.761
 microsc-0.755
rou-0.745
hy-0.728
Token went
Token ID1816
Feature activation-1.86e-01

Pos logit contributions
 someone+0.355
 everything+0.353
Maybe+0.329
 something+0.327
How+0.325
Neg logit contributions
ocol-0.896
 rib-0.803
 microsc-0.785
hy-0.743
 gol-0.741
Token sad
Token ID6507
Feature activation-2.73e-01

Pos logit contributions
 someone+0.174
 everything+0.174
Maybe+0.167
How+0.164
 something+0.162
Neg logit contributions
ocol-0.483
 rib-0.401
 microsc-0.396
hy-0.387
ifying-0.382
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
 someone+0.167
Maybe+0.163
 everything+0.161
 dad+0.158
Mom+0.158
Neg logit contributions
ocol-0.542
 rib-0.473
 microsc-0.459
ifying-0.440
rou-0.440
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 everything+0.186
 someone+0.185
 dad+0.182
 something+0.178
Maybe+0.175
Neg logit contributions
ocol-0.330
 rib-0.276
 microsc-0.270
hy-0.258
 former-0.257
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
 someone+0.181
 everything+0.179
 dad+0.173
 something+0.172
 one+0.171
Neg logit contributions
ocol-0.543
 rib-0.464
 microsc-0.463
hy-0.439
ifying-0.434
TokenThe
Token ID464
Feature activation-7.54e-01

Pos logit contributions
 dad+0.307
 someone+0.305
 everything+0.298
 one+0.287
 something+0.286
Neg logit contributions
ocol-0.642
 rib-0.541
 microsc-0.529
ifying-0.513
hy-0.508
Token next
Token ID1306
Feature activation-2.39e+00

Pos logit contributions
 everything+0.266
 someone+0.263
They+0.254
Maybe+0.245
 they+0.245
Neg logit contributions
ocol-1.719
 rib-1.532
 microsc-1.493
 former-1.426
 gol-1.424
Token day
Token ID1110
Feature activation+2.53e-01

Pos logit contributions
Maybe+2.425
They+2.385
 everything+2.337
 what+2.310
 someone+2.268
Neg logit contributions
ocol-3.538
 gol-3.356
hy-3.004
 rib-2.925
pri-2.846
Token,
Token ID11
Feature activation-4.36e-01

Pos logit contributions
ocol+0.516
 rib+0.446
 microsc+0.427
ifying+0.417
hy+0.415
Neg logit contributions
 someone-0.158
 everything-0.154
 dad-0.149
Maybe-0.147
 something-0.142
Token Lucy
Token ID22162
Feature activation-2.95e-01

Pos logit contributions
 dad+0.388
 someone+0.385
Maybe+0.377
 everything+0.372
They+0.366
Neg logit contributions
ocol-0.718
 rib-0.628
 microsc-0.580
ifying-0.565
hy-0.553
Token was
Token ID373
Feature activation-6.84e-01

Pos logit contributions
 everything+0.213
 someone+0.213
 something+0.196
 dad+0.192
Maybe+0.192
Neg logit contributions
ocol-0.549
 rib-0.485
 microsc-0.470
hy-0.452
ifying-0.449
Token very
Token ID845
Feature activation-1.67e-01

Pos logit contributions
Maybe+0.494
They+0.464
 someone+0.457
How+0.452
 everything+0.446
Neg logit contributions
ocol-1.297
 rib-1.104
rou-1.073
hy-1.072
 microsc-1.066
Token them
Token ID606
Feature activation-1.57e-01

Pos logit contributions
ocol+0.443
 rib+0.384
 microsc+0.371
hy+0.353
ifying+0.352
Neg logit contributions
 someone-0.136
 everything-0.131
Maybe-0.129
 dad-0.127
 something-0.122
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
 someone+0.096
 everything+0.092
Maybe+0.091
 dad+0.089
How+0.085
Neg logit contributions
ocol-0.319
 rib-0.279
 microsc-0.270
hy-0.260
ifying-0.258
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 everything+0.345
 someone+0.336
 looked+0.329
 something+0.323
 dad+0.313
Neg logit contributions
ocol-0.723
 rib-0.626
 microsc-0.607
hy-0.584
ifying-0.577
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
 someone+0.228
 everything+0.220
 looked+0.214
 something+0.213
 one+0.210
Neg logit contributions
ocol-0.652
 rib-0.558
 microsc-0.550
hy-0.527
rou-0.518
TokenThe
Token ID464
Feature activation-5.37e-01

Pos logit contributions
 someone+0.223
 everything+0.215
 looked+0.205
 dad+0.202
 something+0.201
Neg logit contributions
ocol-0.703
 rib-0.613
 microsc-0.596
hy-0.577
ifying-0.573
Token next
Token ID1306
Feature activation-2.39e+00

Pos logit contributions
 someone+0.438
 everything+0.428
 something+0.419
Maybe+0.411
 dad+0.393
Neg logit contributions
ocol-0.986
 rib-0.862
 microsc-0.827
hy-0.800
 former-0.776
Token day
Token ID1110
Feature activation+3.03e-01

Pos logit contributions
Maybe+2.510
 someone+2.447
 something+2.413
 everything+2.346
 what+2.328
Neg logit contributions
ocol-3.477
hy-3.141
 rib-3.028
 gol-3.023
spe-2.889
Token,
Token ID11
Feature activation-1.78e-01

Pos logit contributions
ocol+0.623
 rib+0.539
 microsc+0.518
ifying+0.503
hy+0.500
Neg logit contributions
 someone-0.189
 everything-0.186
Maybe-0.178
 dad-0.177
 something-0.170
Token Lily
Token ID20037
Feature activation-2.92e-01

Pos logit contributions
Maybe+0.105
 someone+0.105
 everything+0.102
 looked+0.100
 dad+0.100
Neg logit contributions
ocol-0.357
 rib-0.320
 microsc-0.301
ifying-0.287
 former-0.285
Token went
Token ID1816
Feature activation-2.62e-01

Pos logit contributions
 someone+0.197
 everything+0.195
 something+0.182
Maybe+0.172
How+0.170
Neg logit contributions
ocol-0.558
 rib-0.501
 microsc-0.481
hy-0.463
ifying-0.456
Token to
Token ID284
Feature activation+6.18e-01

Pos logit contributions
 someone+0.254
Maybe+0.253
 everything+0.249
 dad+0.241
 looked+0.241
Neg logit contributions
ocol-0.433
 rib-0.368
 microsc-0.352
hy-0.340
ifying-0.337
Token,
Token ID11
Feature activation-8.60e-01

Pos logit contributions
 someone+0.635
 dad+0.594
How+0.587
 everything+0.582
Maybe+0.571
Neg logit contributions
ocol-1.526
 rib-1.358
hy-1.302
 microsc-1.291
onna-1.273
Token but
Token ID475
Feature activation-2.12e-01

Pos logit contributions
 someone+0.746
Maybe+0.694
They+0.690
Mom+0.679
 everything+0.674
Neg logit contributions
ocol-1.430
 rib-1.308
 microsc-1.209
hy-1.170
 gymn-1.138
Token they
Token ID484
Feature activation+1.51e-01

Pos logit contributions
 someone+0.055
Maybe+0.049
 everything+0.048
They+0.043
How+0.043
Neg logit contributions
ocol-0.495
 rib-0.444
 microsc-0.433
ifying-0.419
hy-0.415
Token don
Token ID836
Feature activation-3.48e-01

Pos logit contributions
ocol+0.297
 rib+0.255
 microsc+0.248
ifying+0.241
hy+0.239
Neg logit contributions
 someone-0.108
 everything-0.102
 something-0.096
 dad-0.096
Maybe-0.095
Token't
Token ID470
Feature activation-1.85e+00

Pos logit contributions
 someone+0.365
 everything+0.357
 dad+0.353
 looked+0.346
 something+0.346
Neg logit contributions
ocol-0.548
 rib-0.453
 microsc-0.430
ifying-0.423
hy-0.415
Token see
Token ID766
Feature activation-2.38e+00

Pos logit contributions
How+1.552
Maybe+1.466
 everything+1.452
 someone+1.432
They+1.392
Neg logit contributions
ocol-3.308
onna-2.836
 microsc-2.717
 rib-2.713
hy-2.562
Token any
Token ID597
Feature activation-3.45e-02

Pos logit contributions
They+2.227
Turn+2.180
Mom+2.169
Maybe+2.159
Nothing+2.112
Neg logit contributions
ocol-4.027
 rib-3.610
 microsc-3.401
ifying-3.302
 spe-3.261
Token mud
Token ID17492
Feature activation-6.03e-01

Pos logit contributions
 someone+0.026
 everything+0.026
 dad+0.025
Maybe+0.025
 looked+0.024
Neg logit contributions
ocol-0.067
 rib-0.057
 microsc-0.055
ifying-0.052
hy-0.051
Token.
Token ID13
Feature activation-6.46e-02

Pos logit contributions
 someone+0.454
 dad+0.427
Maybe+0.424
 everything+0.411
Mom+0.403
Neg logit contributions
ocol-1.161
 rib-1.000
 microsc-0.941
ifying-0.914
 topp-0.897
Token They
Token ID1119
Feature activation+2.94e-01

Pos logit contributions
 everything+0.048
 someone+0.047
 dad+0.045
 looked+0.044
 one+0.044
Neg logit contributions
ocol-0.123
 rib-0.106
 microsc-0.103
ifying-0.101
hy-0.099
Token only
Token ID691
Feature activation-4.12e-01

Pos logit contributions
ocol+0.606
 rib+0.525
 microsc+0.506
ifying+0.494
hy+0.491
Neg logit contributions
 someone-0.188
 everything-0.180
 something-0.167
 dad-0.167
Maybe-0.165
Token soon
Token ID2582
Feature activation+3.86e-01

Pos logit contributions
ocol+0.145
 rib+0.127
 microsc+0.122
ifying+0.119
 former+0.118
Neg logit contributions
 someone-0.044
 everything-0.044
 dad-0.041
Maybe-0.041
 looked-0.039
Token.
Token ID13
Feature activation-4.46e-01

Pos logit contributions
ocol+0.823
 rib+0.715
 microsc+0.688
ifying+0.674
hy+0.670
Neg logit contributions
 someone-0.214
 everything-0.208
 dad-0.193
Maybe-0.193
 something-0.189
Token
Token ID198
Feature activation-6.15e-02

Pos logit contributions
 someone+0.315
 everything+0.303
 something+0.297
 dad+0.296
 looked+0.282
Neg logit contributions
ocol-0.843
 rib-0.730
 microsc-0.711
ifying-0.685
hy-0.680
Token
Token ID198
Feature activation+5.62e-02

Pos logit contributions
 someone+0.038
 everything+0.037
 dad+0.036
 something+0.036
 one+0.035
Neg logit contributions
ocol-0.123
 rib-0.106
 microsc-0.104
hy-0.100
onna-0.099
TokenThe
Token ID464
Feature activation-8.57e-01

Pos logit contributions
ocol+0.104
 rib+0.089
 microsc+0.085
ifying+0.084
hy+0.082
Neg logit contributions
 someone-0.046
 everything-0.044
 dad-0.044
 something-0.042
 looked-0.042
Token next
Token ID1306
Feature activation-2.37e+00

Pos logit contributions
 everything+0.673
 someone+0.655
They+0.647
 something+0.617
Maybe+0.608
Neg logit contributions
ocol-1.548
 rib-1.406
 microsc-1.273
 former-1.242
hy-1.226
Token day
Token ID1110
Feature activation+2.04e-01

Pos logit contributions
Maybe+2.474
 someone+2.438
 what+2.433
 everything+2.433
They+2.431
Neg logit contributions
ocol-3.461
 rib-2.964
 gol-2.916
pri-2.736
 topp-2.710
Token,
Token ID11
Feature activation-7.70e-02

Pos logit contributions
ocol+0.414
 rib+0.358
 microsc+0.343
ifying+0.334
hy+0.331
Neg logit contributions
 someone-0.129
 everything-0.127
Maybe-0.122
 dad-0.121
 something-0.115
Token Anna
Token ID11735
Feature activation-4.05e-01

Pos logit contributions
 someone+0.055
 everything+0.053
 dad+0.052
Maybe+0.052
 looked+0.050
Neg logit contributions
ocol-0.147
 rib-0.128
 microsc-0.121
ifying-0.119
hy-0.115
Token checked
Token ID10667
Feature activation-1.82e-01

Pos logit contributions
 someone+0.272
 everything+0.265
 something+0.245
 dad+0.237
How+0.232
Neg logit contributions
ocol-0.788
 rib-0.707
 microsc-0.675
ifying-0.647
 former-0.645
Token on
Token ID319
Feature activation-2.18e-02

Pos logit contributions
 someone+0.143
Maybe+0.143
 looked+0.137
 everything+0.137
They+0.136
Neg logit contributions
ocol-0.339
 rib-0.292
 microsc-0.276
ifying-0.263
rou-0.257
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
 someone+0.401
 everything+0.396
Maybe+0.393
How+0.378
 dad+0.378
Neg logit contributions
ocol-0.997
 rib-0.878
 microsc-0.850
hy-0.820
pill-0.788
Token
Token ID220
Feature activation-4.71e-01

Pos logit contributions
 someone+0.135
 everything+0.132
 something+0.128
 dad+0.127
Maybe+0.127
Neg logit contributions
ocol-0.236
 rib-0.200
 microsc-0.193
hy-0.186
ifying-0.185
Token
Token ID198
Feature activation-8.84e-02

Pos logit contributions
 someone+0.309
 everything+0.306
 one+0.300
 looked+0.298
 dad+0.295
Neg logit contributions
ocol-0.913
 rib-0.779
 microsc-0.762
hy-0.742
ifying-0.740
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 someone+0.057
 everything+0.053
 dad+0.053
 something+0.052
 one+0.052
Neg logit contributions
ocol-0.177
 rib-0.154
 microsc-0.150
hy-0.145
ifying-0.143
TokenThe
Token ID464
Feature activation-5.19e-01

Pos logit contributions
ocol+0.523
 rib+0.439
 microsc+0.418
ifying+0.407
hy+0.403
Neg logit contributions
 someone-0.305
 everything-0.297
 dad-0.291
 something-0.286
 one-0.283
Token next
Token ID1306
Feature activation-2.37e+00

Pos logit contributions
 someone+0.277
 everything+0.275
Maybe+0.255
 something+0.252
They+0.246
Neg logit contributions
ocol-1.091
 rib-0.954
 microsc-0.948
hy-0.908
 former-0.891
Token day
Token ID1110
Feature activation+2.68e-01

Pos logit contributions
Maybe+2.519
 someone+2.419
 everything+2.395
They+2.370
 somewhere+2.354
Neg logit contributions
ocol-3.410
 gol-3.227
hy-2.983
 rib-2.807
pri-2.790
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
ocol+0.540
 rib+0.467
 microsc+0.447
ifying+0.437
hy+0.433
Neg logit contributions
 someone-0.174
 everything-0.168
 dad-0.162
Maybe-0.160
 something-0.156
Token Lucy
Token ID22162
Feature activation-1.18e-01

Pos logit contributions
ocol+0.019
 rib+0.016
 microsc+0.015
ifying+0.014
 former+0.014
Neg logit contributions
 someone-0.015
 dad-0.014
 everything-0.014
Maybe-0.014
 looked-0.014
Token took
Token ID1718
Feature activation-7.76e-01

Pos logit contributions
 someone+0.079
 everything+0.077
 something+0.072
Maybe+0.069
 dad+0.069
Neg logit contributions
ocol-0.232
 rib-0.205
 microsc-0.198
ifying-0.191
hy-0.191
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
Maybe+0.573
They+0.554
How+0.546
Nothing+0.498
 someone+0.496
Neg logit contributions
ocol-1.495
 rib-1.317
 microsc-1.217
hy-1.164
leaf-1.145
Token sad
Token ID6507
Feature activation-2.73e-01

Pos logit contributions
 someone+0.196
Maybe+0.195
 everything+0.191
 dad+0.186
How+0.186
Neg logit contributions
ocol-0.654
 rib-0.557
hy-0.541
ifying-0.538
 microsc-0.538
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
 someone+0.188
 everything+0.179
Maybe+0.177
 dad+0.174
How+0.174
Neg logit contributions
ocol-0.526
 rib-0.459
 microsc-0.439
 former-0.424
ifying-0.424
Token angry
Token ID7954
Feature activation-5.92e-01

Pos logit contributions
 someone+0.129
 everything+0.127
Maybe+0.123
 one+0.118
 something+0.118
Neg logit contributions
ocol-0.333
 rib-0.285
 microsc-0.279
hy-0.277
ifying-0.271
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
Maybe+0.404
 someone+0.395
 everything+0.389
How+0.386
Mom+0.380
Neg logit contributions
ocol-1.132
 rib-0.995
 microsc-0.945
hy-0.941
ifying-0.922
Token The
Token ID383
Feature activation-8.91e-01

Pos logit contributions
ocol+0.210
 rib+0.173
 microsc+0.170
hy+0.164
ifying+0.164
Neg logit contributions
 someone-0.130
 everything-0.129
 dad-0.124
 something-0.124
Maybe-0.123
Token next
Token ID1306
Feature activation-2.36e+00

Pos logit contributions
 everything+0.448
 someone+0.439
Maybe+0.419
How+0.419
They+0.405
Neg logit contributions
ocol-1.883
 rib-1.671
 microsc-1.608
 former-1.557
hy-1.545
Token day
Token ID1110
Feature activation+3.61e-01

Pos logit contributions
Maybe+2.396
 everything+2.364
They+2.354
 where+2.324
 what+2.295
Neg logit contributions
ocol-3.505
 gol-3.240
 topp-2.931
pri-2.868
 rib-2.855
Token,
Token ID11
Feature activation-6.55e-02

Pos logit contributions
ocol+0.741
 rib+0.637
 microsc+0.616
ifying+0.602
hy+0.597
Neg logit contributions
 someone-0.224
 everything-0.220
Maybe-0.209
 dad-0.207
 something-0.202
Token Tim
Token ID5045
Feature activation-3.75e-01

Pos logit contributions
 someone+0.036
 everything+0.034
Maybe+0.034
 dad+0.033
 looked+0.033
Neg logit contributions
ocol-0.134
 rib-0.117
 microsc-0.113
ifying-0.111
hy-0.109
Token wanted
Token ID2227
Feature activation-1.03e-01

Pos logit contributions
 someone+0.286
 everything+0.285
 something+0.269
Maybe+0.264
How+0.258
Neg logit contributions
ocol-0.669
 rib-0.596
 microsc-0.573
hy-0.563
rou-0.554
Token to
Token ID284
Feature activation+4.17e-01

Pos logit contributions
Maybe+0.116
 everything+0.115
 someone+0.115
They+0.113
 looked+0.112
Neg logit contributions
ocol-0.152
 rib-0.127
 microsc-0.123
 topp-0.112
hy-0.112
Token happy
Token ID3772
Feature activation-8.53e-01

Pos logit contributions
Maybe+0.471
 someone+0.465
 everything+0.456
 dad+0.438
How+0.430
Neg logit contributions
ocol-1.353
 rib-1.186
 microsc-1.091
hy-1.082
 former-1.075
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
 someone+0.616
Maybe+0.605
Mom+0.603
 everything+0.601
Turn+0.582
Neg logit contributions
ocol-1.553
 rib-1.389
 microsc-1.313
 former-1.266
onna-1.249
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
 someone+0.217
 everything+0.217
 dad+0.204
 looked+0.203
 something+0.202
Neg logit contributions
ocol-0.572
 rib-0.496
 microsc-0.487
ifying-0.472
 former-0.469
Token
Token ID198
Feature activation-3.62e-01

Pos logit contributions
 someone+0.348
 everything+0.335
 looked+0.331
 one+0.330
 something+0.325
Neg logit contributions
ocol-0.879
 rib-0.754
 microsc-0.741
rou-0.705
hy-0.701
TokenThe
Token ID464
Feature activation-6.45e-02

Pos logit contributions
 someone+0.204
 everything+0.196
 dad+0.188
 looked+0.187
 one+0.181
Neg logit contributions
ocol-0.748
 rib-0.659
 microsc-0.642
ifying-0.618
rou-0.615
Token next
Token ID1306
Feature activation-2.36e+00

Pos logit contributions
 everything+0.058
 someone+0.058
Maybe+0.055
 dad+0.055
 looked+0.054
Neg logit contributions
ocol-0.115
 rib-0.097
 microsc-0.093
hy-0.088
 former-0.088
Token day
Token ID1110
Feature activation+2.79e-01

Pos logit contributions
Maybe+2.499
 someone+2.445
 what+2.395
 everything+2.360
 something+2.351
Neg logit contributions
ocol-3.446
 rib-2.924
 gol-2.905
hy-2.854
 gymn-2.658
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
ocol+0.572
 rib+0.493
 microsc+0.476
ifying+0.463
hy+0.458
Neg logit contributions
 someone-0.176
 everything-0.172
Maybe-0.164
 dad-0.163
 something-0.157
Token they
Token ID484
Feature activation-4.14e-01

Pos logit contributions
 someone+0.180
Maybe+0.176
 everything+0.174
 looked+0.168
 dad+0.166
Neg logit contributions
ocol-0.570
 rib-0.507
 microsc-0.483
ifying-0.465
rou-0.451
Token went
Token ID1816
Feature activation-3.72e-01

Pos logit contributions
 someone+0.285
 everything+0.271
 something+0.265
How+0.249
Maybe+0.247
Neg logit contributions
ocol-0.792
 rib-0.712
 microsc-0.679
ifying-0.665
hy-0.656
Token back
Token ID736
Feature activation+3.41e-01

Pos logit contributions
 someone+0.308
Maybe+0.307
 everything+0.297
 dad+0.287
 looked+0.283
Neg logit contributions
ocol-0.661
 rib-0.575
 microsc-0.549
hy-0.538
ifying-0.536
Token hurt
Token ID5938
Feature activation-8.28e-01

Pos logit contributions
ocol+0.177
 rib+0.153
 microsc+0.147
hy+0.145
ifying+0.142
Neg logit contributions
 someone-0.065
 everything-0.063
Maybe-0.060
 dad-0.059
 something-0.059
Token.
Token ID13
Feature activation-7.52e-01

Pos logit contributions
 dad+0.560
 someone+0.545
 everything+0.544
Maybe+0.542
 looked+0.532
Neg logit contributions
ocol-1.568
 rib-1.328
 microsc-1.290
hy-1.270
ifying-1.259
Token
Token ID198
Feature activation-5.54e-01

Pos logit contributions
 everything+0.624
 someone+0.596
 dad+0.582
 one+0.581
 something+0.580
Neg logit contributions
ocol-1.286
 microsc-1.126
 rib-1.110
hy-1.066
ifying-1.032
Token
Token ID198
Feature activation-6.91e-01

Pos logit contributions
 everything+0.373
 someone+0.370
 looked+0.364
 one+0.359
 something+0.356
Neg logit contributions
ocol-1.064
 rib-0.901
 microsc-0.900
hy-0.856
onna-0.842
TokenThe
Token ID464
Feature activation-1.25e+00

Pos logit contributions
 everything+0.449
 someone+0.449
 looked+0.417
 something+0.414
 one+0.412
Neg logit contributions
ocol-1.364
 microsc-1.167
 rib-1.164
hy-1.108
ifying-1.100
Token next
Token ID1306
Feature activation-2.35e+00

Pos logit contributions
 everything+1.019
How+0.997
 someone+0.990
They+0.982
 where+0.969
Neg logit contributions
ocol-2.194
 rib-1.980
 microsc-1.921
 gol-1.823
hy-1.804
Token day
Token ID1110
Feature activation-4.19e-01

Pos logit contributions
Maybe+2.315
 everything+2.223
They+2.181
How+2.157
 someone+2.139
Neg logit contributions
ocol-3.714
 gol-3.305
hy-3.172
 rib-3.127
pri-2.972
Token,
Token ID11
Feature activation-9.79e-01

Pos logit contributions
Maybe+0.297
 everything+0.294
 someone+0.292
How+0.284
 dad+0.277
Neg logit contributions
ocol-0.786
 rib-0.670
 microsc-0.653
ifying-0.633
hy-0.633
Token Tom
Token ID4186
Feature activation-1.01e+00

Pos logit contributions
 someone+0.629
 everything+0.622
Maybe+0.614
 looked+0.610
 dad+0.592
Neg logit contributions
ocol-1.748
 rib-1.587
 microsc-1.536
ifying-1.441
hy-1.438
Token and
Token ID290
Feature activation+4.36e-01

Pos logit contributions
 someone+0.871
 everything+0.834
Mom+0.811
Maybe+0.808
 something+0.800
Neg logit contributions
ocol-1.737
 rib-1.516
 microsc-1.491
hy-1.416
rou-1.416
Token Lisa
Token ID15378
Feature activation-1.93e-01

Pos logit contributions
ocol+0.846
 rib+0.727
 microsc+0.701
hy+0.675
ifying+0.675
Neg logit contributions
 someone-0.329
 everything-0.321
 dad-0.303
 something-0.302
Maybe-0.301
Token every
Token ID790
Feature activation-1.22e+00

Pos logit contributions
Maybe+0.304
 someone+0.291
How+0.284
 everything+0.283
 looked+0.280
Neg logit contributions
ocol-0.807
 rib-0.695
 microsc-0.673
hy-0.635
ifying-0.634
Token night
Token ID1755
Feature activation+2.13e-01

Pos logit contributions
Maybe+1.039
 everything+1.007
They+0.997
 someone+0.977
How+0.944
Neg logit contributions
ocol-2.201
 rib-1.896
ifying-1.796
 microsc-1.720
 topp-1.682
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
ocol+0.450
 rib+0.391
 microsc+0.375
ifying+0.371
hy+0.369
Neg logit contributions
 someone-0.120
 everything-0.118
Maybe-0.112
 dad-0.108
 something-0.107
Token She
Token ID1375
Feature activation-3.22e-01

Pos logit contributions
ocol+1.190
 rib+1.009
 microsc+0.972
ifying+0.945
hy+0.937
Neg logit contributions
 someone-0.497
 everything-0.486
 dad-0.461
 something-0.457
Maybe-0.452
Token hugs
Token ID40657
Feature activation-3.78e-01

Pos logit contributions
 someone+0.351
 everything+0.338
 something+0.325
 dad+0.323
Maybe+0.320
Neg logit contributions
ocol-0.514
 rib-0.448
 microsc-0.412
hy-0.390
ifying-0.384
Token it
Token ID340
Feature activation-2.35e+00

Pos logit contributions
Maybe+0.246
 someone+0.244
 everything+0.234
 looked+0.229
How+0.226
Neg logit contributions
ocol-0.735
 rib-0.662
 microsc-0.612
hy-0.589
ifying-0.588
Token and
Token ID290
Feature activation-1.95e-01

Pos logit contributions
Maybe+2.270
How+2.249
Nothing+2.248
 someone+2.210
What+2.107
Neg logit contributions
 rib-3.595
ocol-3.168
 stuffed-3.151
hy-3.125
rou-3.058
Token squee
Token ID13170
Feature activation+5.36e-02

Pos logit contributions
 someone+0.183
 everything+0.177
Maybe+0.173
 somewhere+0.168
How+0.167
Neg logit contributions
ocol-0.330
 rib-0.293
 microsc-0.274
hy-0.257
 former-0.256
Tokenzes
Token ID12271
Feature activation+1.68e-01

Pos logit contributions
ocol+0.091
 rib+0.076
 microsc+0.071
hy+0.068
ifying+0.068
Neg logit contributions
 someone-0.055
 everything-0.052
Maybe-0.051
 something-0.051
 dad-0.050
Token it
Token ID340
Feature activation-1.56e+00

Pos logit contributions
ocol+0.372
 rib+0.330
 microsc+0.316
ifying+0.305
hy+0.303
Neg logit contributions
 someone-0.077
 everything-0.072
Maybe-0.069
 dad-0.069
 looked-0.065
Token tight
Token ID5381
Feature activation+5.59e-01

Pos logit contributions
 someone+1.362
 dad+1.344
Maybe+1.344
How+1.305
Nothing+1.287
Neg logit contributions
ocol-2.502
 rib-2.481
hy-2.196
ifying-2.164
 microsc-2.155
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 dad+0.329
 someone+0.326
 everything+0.322
Maybe+0.318
 looked+0.297
Neg logit contributions
ocol-0.947
 rib-0.842
 microsc-0.802
ifying-0.778
hy-0.769
Token It
Token ID632
Feature activation+1.24e-01

Pos logit contributions
ocol+0.512
 rib+0.436
 microsc+0.418
ifying+0.406
hy+0.406
Neg logit contributions
 someone-0.222
 everything-0.221
 dad-0.214
Maybe-0.208
 something-0.207
Token wasn
Token ID2492
Feature activation+9.85e-02

Pos logit contributions
ocol+0.235
 rib+0.206
 microsc+0.196
hy+0.190
ifying+0.189
Neg logit contributions
 someone-0.094
 everything-0.092
 dad-0.090
 something-0.086
Maybe-0.084
Token't
Token ID470
Feature activation+1.82e-01

Pos logit contributions
ocol+0.165
 rib+0.138
 microsc+0.130
ifying+0.129
hy+0.127
Neg logit contributions
 someone-0.099
 everything-0.097
 dad-0.095
 something-0.092
Maybe-0.092
Token scary
Token ID14343
Feature activation+8.80e-01

Pos logit contributions
ocol+0.355
 rib+0.305
 microsc+0.293
hy+0.284
ifying+0.280
Neg logit contributions
 someone-0.132
 everything-0.129
Maybe-0.127
 dad-0.124
 something-0.120
Token after
Token ID706
Feature activation-2.35e+00

Pos logit contributions
ocol+1.598
 rib+1.340
 microsc+1.287
ifying+1.261
 former+1.237
Neg logit contributions
 someone-0.808
 everything-0.793
 something-0.752
 one-0.748
 dad-0.745
Token all
Token ID477
Feature activation+4.66e-01

Pos logit contributions
Maybe+2.375
How+2.089
Mom+1.994
They+1.886
Dad+1.740
Neg logit contributions
ocol-4.538
 rib-3.789
 microsc-3.716
ous-3.326
ifying-3.291
Token.
Token ID13
Feature activation+5.58e-02

Pos logit contributions
ocol+0.993
 rib+0.857
 microsc+0.833
ifying+0.808
hy+0.804
Neg logit contributions
 someone-0.268
 everything-0.259
 dad-0.244
Maybe-0.241
 something-0.239
Token
Token ID220
Feature activation-3.95e-01

Pos logit contributions
ocol+0.109
 rib+0.093
 microsc+0.090
ifying+0.087
hy+0.087
Neg logit contributions
 everything-0.040
 someone-0.040
 dad-0.039
 something-0.037
 one-0.037
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
 someone+0.252
 looked+0.251
 everything+0.247
 dad+0.245
 one+0.240
Neg logit contributions
ocol-0.766
 rib-0.654
 microsc-0.647
rou-0.618
ifying-0.617
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
ocol+0.021
 rib+0.018
 microsc+0.018
hy+0.017
ifying+0.017
Neg logit contributions
 someone-0.007
 everything-0.006
 dad-0.006
 something-0.006
 one-0.006
Tokener
Token ID263
Feature activation+6.28e-01

Pos logit contributions
ocol+1.522
 rib+1.316
 microsc+1.278
ifying+1.247
hy+1.235
Neg logit contributions
 someone-0.361
 everything-0.349
 dad-0.320
 something-0.317
 one-0.309
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
ocol+1.360
 rib+1.177
 microsc+1.140
ifying+1.114
hy+1.104
Neg logit contributions
 someone-0.344
 everything-0.333
 dad-0.307
 something-0.303
Maybe-0.298
Token
Token ID198
Feature activation+2.88e-02

Pos logit contributions
 someone+0.291
 everything+0.289
 dad+0.284
 something+0.281
Maybe+0.274
Neg logit contributions
ocol-0.533
 rib-0.459
 microsc-0.437
hy-0.425
ifying-0.415
Token
Token ID198
Feature activation+5.05e-02

Pos logit contributions
ocol+0.058
 rib+0.050
 microsc+0.049
hy+0.047
ifying+0.047
Neg logit contributions
 someone-0.018
 everything-0.017
 dad-0.016
 something-0.016
 one-0.016
TokenThe
Token ID464
Feature activation-6.76e-01

Pos logit contributions
ocol+0.084
 rib+0.071
 microsc+0.067
ifying+0.066
hy+0.065
Neg logit contributions
 someone-0.050
 everything-0.049
 dad-0.049
 something-0.048
 one-0.047
Token next
Token ID1306
Feature activation-2.34e+00

Pos logit contributions
 someone+0.451
They+0.440
 they+0.435
 everything+0.434
Maybe+0.421
Neg logit contributions
ocol-1.322
 rib-1.182
 microsc-1.117
 gol-1.070
hy-1.059
Token day
Token ID1110
Feature activation+8.18e-02

Pos logit contributions
Maybe+2.409
 someone+2.368
 what+2.350
They+2.336
 something+2.330
Neg logit contributions
ocol-3.378
 gol-3.226
 rib-2.985
pri-2.842
hy-2.837
Token,
Token ID11
Feature activation-3.61e-01

Pos logit contributions
ocol+0.163
 rib+0.140
 microsc+0.134
ifying+0.131
hy+0.130
Neg logit contributions
 someone-0.054
 everything-0.052
Maybe-0.051
 dad-0.051
How-0.049
Token she
Token ID673
Feature activation-6.94e-01

Pos logit contributions
 someone+0.433
 dad+0.430
Maybe+0.427
 everything+0.420
 looked+0.413
Neg logit contributions
ocol-0.488
 rib-0.411
 microsc-0.376
ifying-0.361
hy-0.349
Token was
Token ID373
Feature activation-5.33e-01

Pos logit contributions
 someone+0.463
 everything+0.459
 something+0.420
Maybe+0.409
 dad+0.408
Neg logit contributions
ocol-1.315
 rib-1.207
 microsc-1.151
rou-1.089
 topp-1.078
Token very
Token ID845
Feature activation+1.00e-01

Pos logit contributions
Maybe+0.463
 someone+0.428
They+0.421
 everything+0.417
How+0.417
Neg logit contributions
ocol-0.953
 rib-0.810
 microsc-0.779
rou-0.775
hy-0.770
Token later
Token ID1568
Feature activation+1.60e-02

Pos logit contributions
ocol+0.813
 rib+0.706
 microsc+0.685
ifying+0.664
hy+0.657
Neg logit contributions
 someone-0.205
 everything-0.200
 dad-0.185
Maybe-0.183
 something-0.179
Token.
Token ID13
Feature activation-4.05e-02

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.028
hy+0.027
ifying+0.027
Neg logit contributions
 someone-0.009
 everything-0.009
Maybe-0.009
 dad-0.009
How-0.008
Token
Token ID198
Feature activation-4.58e-01

Pos logit contributions
 everything+0.030
 someone+0.030
 dad+0.028
 looked+0.028
 something+0.028
Neg logit contributions
ocol-0.077
 rib-0.066
 microsc-0.065
ifying-0.063
 former-0.062
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
 someone+0.310
 everything+0.304
 one+0.296
 something+0.294
 looked+0.294
Neg logit contributions
ocol-0.874
 rib-0.748
 microsc-0.739
onna-0.707
hy-0.702
TokenThe
Token ID464
Feature activation-9.52e-01

Pos logit contributions
 someone+0.221
 everything+0.216
 looked+0.208
 dad+0.206
 one+0.202
Neg logit contributions
ocol-0.655
 rib-0.565
 microsc-0.552
ifying-0.535
hy-0.525
Token next
Token ID1306
Feature activation-2.34e+00

Pos logit contributions
They+0.697
 everything+0.694
 someone+0.675
Maybe+0.655
How+0.634
Neg logit contributions
ocol-1.782
 rib-1.609
 microsc-1.569
hy-1.448
 former-1.429
Token day
Token ID1110
Feature activation+3.93e-01

Pos logit contributions
Maybe+2.402
 everything+2.343
They+2.318
 someone+2.292
How+2.245
Neg logit contributions
ocol-3.519
 gol-3.001
hy-2.989
 rib-2.928
onna-2.854
Token,
Token ID11
Feature activation-3.57e-03

Pos logit contributions
ocol+0.814
 rib+0.702
 microsc+0.677
ifying+0.660
hy+0.654
Neg logit contributions
 someone-0.241
 everything-0.237
 dad-0.222
Maybe-0.222
 something-0.217
Token they
Token ID484
Feature activation-3.56e-01

Pos logit contributions
 someone+0.001
 everything+0.001
Maybe+0.001
 looked+0.001
 dad+0.001
Neg logit contributions
ocol-0.008
 rib-0.007
 microsc-0.007
ifying-0.007
 former-0.007
Token went
Token ID1816
Feature activation-2.59e-01

Pos logit contributions
 someone+0.259
 everything+0.248
 something+0.239
How+0.229
 one+0.227
Neg logit contributions
ocol-0.672
 rib-0.600
 microsc-0.581
ifying-0.557
hy-0.546
Token back
Token ID736
Feature activation+2.77e-01

Pos logit contributions
 someone+0.261
Maybe+0.259
 everything+0.253
 dad+0.245
How+0.241
Neg logit contributions
ocol-0.419
 rib-0.356
 microsc-0.344
hy-0.327
ifying-0.325
Token of
Token ID286
Feature activation+1.46e+00

Pos logit contributions
ocol+2.132
 rib+1.725
 microsc+1.692
 former+1.660
ifying+1.602
Neg logit contributions
 someone-1.540
 everything-1.509
 one-1.482
 something-1.470
 looked-1.451
Token scissors
Token ID41786
Feature activation+8.91e-01

Pos logit contributions
ocol+2.563
 microsc+2.106
ifying+2.103
 rib+2.077
 former+2.057
Neg logit contributions
 someone-1.408
 everything-1.388
 something-1.348
 one-1.346
 dad-1.328
Token and
Token ID290
Feature activation+1.44e-01

Pos logit contributions
ocol+1.969
 rib+1.699
 microsc+1.659
ifying+1.612
 former+1.603
Neg logit contributions
 someone-0.458
 everything-0.446
 something-0.410
 dad-0.409
 one-0.403
Token carefully
Token ID7773
Feature activation-1.58e-01

Pos logit contributions
ocol+0.284
 rib+0.251
 microsc+0.237
ifying+0.229
hy+0.228
Neg logit contributions
 someone-0.097
 everything-0.097
 dad-0.090
Maybe-0.089
 something-0.087
Token cut
Token ID2005
Feature activation-1.64e-02

Pos logit contributions
 someone+0.146
 everything+0.144
Maybe+0.139
 dad+0.138
 something+0.133
Neg logit contributions
ocol-0.282
 rib-0.241
 microsc-0.222
hy-0.216
 topp-0.214
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
 someone+0.010
 dad+0.010
Maybe+0.010
 everything+0.010
 looked+0.009
Neg logit contributions
ocol-0.034
 rib-0.029
 microsc-0.028
ifying-0.027
hy-0.027
Token tape
Token ID9154
Feature activation+4.37e-01

Pos logit contributions
ocol+2.538
 microsc+2.078
 rib+2.025
ifying+1.974
 former+1.962
Neg logit contributions
 someone-1.339
 everything-1.303
 one-1.269
 something-1.263
 dad-1.240
Token.
Token ID13
Feature activation-5.52e-02

Pos logit contributions
ocol+0.909
 rib+0.788
 microsc+0.758
ifying+0.742
hy+0.735
Neg logit contributions
 someone-0.268
 everything-0.259
 dad-0.243
Maybe-0.239
 something-0.238
Token Now
Token ID2735
Feature activation-5.60e-01

Pos logit contributions
 someone+0.049
 everything+0.048
 something+0.046
 dad+0.046
Maybe+0.045
Neg logit contributions
ocol-0.097
 rib-0.082
 microsc-0.079
hy-0.076
ifying-0.076
Token,
Token ID11
Feature activation+4.46e-01

Pos logit contributions
Maybe+0.384
 someone+0.381
How+0.376
Mom+0.352
They+0.352
Neg logit contributions
ocol-1.068
 rib-0.946
 microsc-0.893
hy-0.866
ifying-0.845
Token Lucy
Token ID22162
Feature activation+4.48e-02

Pos logit contributions
ocol+0.943
 rib+0.818
 microsc+0.790
ifying+0.771
hy+0.765
Neg logit contributions
 someone-0.254
 everything-0.246
 dad-0.230
Maybe-0.228
 something-0.226
Token bird
Token ID6512
Feature activation-1.11e-01

Pos logit contributions
 everything+0.620
 someone+0.611
They+0.606
 something+0.582
Maybe+0.579
Neg logit contributions
ocol-1.287
 rib-1.160
 microsc-1.056
hy-1.052
 spe-1.018
Token was
Token ID373
Feature activation-4.28e-01

Pos logit contributions
 someone+0.096
 everything+0.092
 dad+0.090
 something+0.087
Maybe+0.085
Neg logit contributions
ocol-0.196
 rib-0.173
 microsc-0.160
hy-0.155
 former-0.155
Token happy
Token ID3772
Feature activation-9.09e-01

Pos logit contributions
 someone+0.283
Maybe+0.275
 everything+0.275
How+0.255
 dad+0.252
Neg logit contributions
ocol-0.835
 rib-0.722
rou-0.688
 microsc-0.686
hy-0.682
Token and
Token ID290
Feature activation-3.36e-01

Pos logit contributions
 someone+0.790
Maybe+0.774
Mom+0.772
 everything+0.747
How+0.739
Neg logit contributions
ocol-1.516
 rib-1.345
 microsc-1.296
rou-1.238
onna-1.231
Token flew
Token ID13112
Feature activation+2.94e-01

Pos logit contributions
 someone+0.305
 everything+0.295
 something+0.283
Maybe+0.283
 dad+0.280
Neg logit contributions
ocol-0.560
 rib-0.503
 microsc-0.466
hy-0.459
rou-0.449
Token close
Token ID1969
Feature activation+1.30e+00

Pos logit contributions
ocol+0.565
 rib+0.489
 microsc+0.464
ifying+0.455
hy+0.453
Neg logit contributions
 someone-0.225
 everything-0.216
 dad-0.212
Maybe-0.206
 something-0.203
Token to
Token ID284
Feature activation-3.23e-03

Pos logit contributions
ocol+2.332
 rib+1.910
 microsc+1.860
ifying+1.788
 former+1.783
Neg logit contributions
 someone-1.288
 everything-1.286
 one-1.240
 dad-1.222
 something-1.221
Token Lily
Token ID20037
Feature activation+1.62e-01

Pos logit contributions
 someone+0.002
 everything+0.002
Maybe+0.002
 dad+0.002
They+0.002
Neg logit contributions
ocol-0.006
 rib-0.005
 microsc-0.005
hy-0.005
 former-0.005
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
ocol+0.327
 rib+0.284
 microsc+0.274
hy+0.267
ifying+0.266
Neg logit contributions
 someone-0.106
 everything-0.103
Maybe-0.098
 dad-0.095
 something-0.094
Token They
Token ID1119
Feature activation+2.12e-01

Pos logit contributions
ocol+0.387
 rib+0.334
 microsc+0.321
hy+0.311
ifying+0.310
Neg logit contributions
 someone-0.144
 everything-0.140
 dad-0.135
 something-0.134
Maybe-0.132
Token both
Token ID1111
Feature activation+2.78e-01

Pos logit contributions
ocol+0.399
 rib+0.348
 microsc+0.331
ifying+0.320
hy+0.320
Neg logit contributions
 someone-0.170
 everything-0.161
 something-0.155
 dad-0.152
Maybe-0.151
Token Anna
Token ID11735
Feature activation+3.82e-01

Pos logit contributions
ocol+0.395
 rib+0.339
 microsc+0.323
ifying+0.316
hy+0.312
Neg logit contributions
 someone-0.146
 everything-0.143
Maybe-0.137
 something-0.133
 dad-0.132
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
ocol+0.803
 rib+0.695
 microsc+0.671
ifying+0.656
hy+0.654
Neg logit contributions
 someone-0.226
 everything-0.220
 dad-0.201
Maybe-0.201
 something-0.201
Token I
Token ID314
Feature activation+1.60e+00

Pos logit contributions
ocol+2.932
 rib+2.539
 microsc+2.445
 former+2.393
rou+2.363
Neg logit contributions
 someone-0.828
 everything-0.758
 dad-0.740
 something-0.713
Maybe-0.701
Token was
Token ID373
Feature activation+8.68e-01

Pos logit contributions
ocol+3.296
 rib+2.743
 microsc+2.637
ifying+2.596
hy+2.521
Neg logit contributions
 someone-1.277
 looked-1.259
Maybe-1.251
 everything-1.249
 something-1.205
Token curious
Token ID11040
Feature activation-1.45e-01

Pos logit contributions
ocol+1.743
 rib+1.492
 microsc+1.440
ifying+1.407
hy+1.387
Neg logit contributions
 someone-0.644
 everything-0.624
 something-0.593
 dad-0.587
 one-0.579
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 someone+0.089
 everything+0.087
Maybe+0.084
 dad+0.082
How+0.080
Neg logit contributions
ocol-0.291
 rib-0.255
 microsc-0.243
hy-0.239
ifying-0.236
Token I
Token ID314
Feature activation+1.58e+00

Pos logit contributions
ocol+3.169
 rib+2.704
 microsc+2.616
 former+2.553
ifying+2.525
Neg logit contributions
 someone-1.053
 dad-0.938
 everything-0.936
Maybe-0.922
 something-0.914
Token won
Token ID1839
Feature activation+1.60e-01

Pos logit contributions
ocol+3.152
 rib+2.613
 microsc+2.530
ifying+2.480
hy+2.400
Neg logit contributions
 looked-1.281
 someone-1.280
Maybe-1.257
 everything-1.256
 something-1.207
Token't
Token ID470
Feature activation+2.27e-01

Pos logit contributions
ocol+0.271
 rib+0.227
 microsc+0.217
ifying+0.209
hy+0.208
Neg logit contributions
 someone-0.154
 everything-0.153
 dad-0.149
Maybe-0.148
 looked-0.145
Token do
Token ID466
Feature activation-5.85e-01

Pos logit contributions
ocol+0.410
 rib+0.349
 microsc+0.333
hy+0.326
ifying+0.321
Neg logit contributions
 someone-0.197
 everything-0.194
 dad-0.186
 something-0.184
Maybe-0.182
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
Maybe+0.327
 looked+0.293
Mom+0.285
They+0.282
 someone+0.281
Neg logit contributions
ocol-1.230
 rib-1.074
 microsc-1.033
hy-0.998
ifying-0.989
Token The
Token ID383
Feature activation-2.05e-01

Pos logit contributions
ocol+0.072
 rib+0.062
 microsc+0.060
ifying+0.058
hy+0.058
Neg logit contributions
 someone-0.026
 everything-0.025
 dad-0.024
 something-0.024
Maybe-0.023
Token bird
Token ID6512
Feature activation+2.30e-01

Pos logit contributions
 everything+0.156
 someone+0.154
 something+0.147
Maybe+0.145
 dad+0.144
Neg logit contributions
ocol-0.380
 rib-0.337
 microsc-0.311
hy-0.310
ifying-0.300
Token replied
Token ID8712
Feature activation+7.55e-01

Pos logit contributions
ocol+0.446
 rib+0.389
 microsc+0.369
hy+0.359
ifying+0.357
Neg logit contributions
 someone-0.169
 everything-0.162
 dad-0.158
 something-0.154
Maybe-0.150
Token with
Token ID351
Feature activation-2.68e-01

Pos logit contributions
ocol+1.615
 rib+1.392
 microsc+1.351
ifying+1.322
hy+1.306
Neg logit contributions
 someone-0.445
 everything-0.434
 dad-0.399
 something-0.399
 one-0.387
Token a
Token ID257
Feature activation+3.64e-02

Pos logit contributions
 someone+0.121
 everything+0.115
Maybe+0.108
 dad+0.107
 they+0.101
Neg logit contributions
ocol-0.593
 rib-0.527
 microsc-0.497
hy-0.493
ifying-0.489
Token tweet
Token ID6126
Feature activation+1.27e+00

Pos logit contributions
ocol+0.070
 rib+0.061
 microsc+0.058
hy+0.057
ifying+0.056
Neg logit contributions
 someone-0.027
 everything-0.026
Maybe-0.024
 dad-0.024
 something-0.024
Token and
Token ID290
Feature activation-5.75e-02

Pos logit contributions
ocol+2.920
 rib+2.499
 microsc+2.455
ifying+2.391
 former+2.390
Neg logit contributions
 someone-0.602
 everything-0.584
 something-0.529
 one-0.528
 looked-0.507
Token flew
Token ID13112
Feature activation+8.64e-01

Pos logit contributions
 someone+0.041
 everything+0.041
 dad+0.040
Maybe+0.039
 something+0.038
Neg logit contributions
ocol-0.108
 rib-0.097
 microsc-0.091
hy-0.087
 former-0.086
Token away
Token ID1497
Feature activation+9.02e-01

Pos logit contributions
ocol+1.781
 rib+1.507
 microsc+1.469
ifying+1.427
hy+1.409
Neg logit contributions
 someone-0.609
 everything-0.602
 something-0.559
 one-0.552
 dad-0.550
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
ocol+2.013
 rib+1.733
 microsc+1.687
ifying+1.649
hy+1.627
Neg logit contributions
 someone-0.477
 everything-0.463
 something-0.425
 one-0.416
 dad-0.410
Token
Token ID220
Feature activation-7.82e-02

Pos logit contributions
ocol+0.229
 rib+0.197
 microsc+0.190
ifying+0.184
hy+0.183
Neg logit contributions
 someone-0.090
 everything-0.087
 dad-0.084
 something-0.083
Maybe-0.082
Token I
Token ID314
Feature activation+1.63e+00

Pos logit contributions
Maybe+0.200
 someone+0.198
 looked+0.194
 everything+0.192
How+0.182
Neg logit contributions
ocol-0.609
 rib-0.531
 microsc-0.508
hy-0.487
rou-0.485
Token was
Token ID373
Feature activation+6.17e-01

Pos logit contributions
ocol+3.427
 rib+2.823
 microsc+2.742
ifying+2.699
 former+2.616
Neg logit contributions
 someone-1.207
 looked-1.188
 everything-1.152
Maybe-1.139
 turned-1.111
Token nos
Token ID43630
Feature activation+1.21e+00

Pos logit contributions
ocol+1.242
 rib+1.062
 microsc+1.026
ifying+0.999
hy+0.991
Neg logit contributions
 someone-0.434
 everything-0.424
 dad-0.398
 something-0.394
Maybe-0.390
Tokeny
Token ID88
Feature activation+5.05e-01

Pos logit contributions
ocol+2.208
 rib+1.863
 microsc+1.783
 former+1.751
ifying+1.706
Neg logit contributions
 someone-1.113
 everything-1.099
 dad-1.048
 one-1.043
 something-1.030
Token.
Token ID13
Feature activation+9.93e-01

Pos logit contributions
ocol+1.077
 rib+0.932
 microsc+0.901
ifying+0.881
hy+0.875
Neg logit contributions
 someone-0.286
 everything-0.278
 dad-0.257
 something-0.254
Maybe-0.252
Token I
Token ID314
Feature activation+1.60e+00

Pos logit contributions
ocol+1.951
 rib+1.647
 microsc+1.590
ifying+1.540
 former+1.526
Neg logit contributions
 someone-0.781
 everything-0.743
 dad-0.722
Maybe-0.708
 something-0.707
Token love
Token ID1842
Feature activation+2.74e-02

Pos logit contributions
ocol+3.211
 rib+2.614
 microsc+2.552
ifying+2.503
hy+2.377
Neg logit contributions
 someone-1.330
 looked-1.323
Maybe-1.316
 everything-1.293
 something-1.265
Token you
Token ID345
Feature activation-2.52e-01

Pos logit contributions
ocol+0.055
 rib+0.048
 microsc+0.046
hy+0.044
ifying+0.044
Neg logit contributions
 someone-0.018
Maybe-0.018
 everything-0.018
 looked-0.017
 something-0.017
Token."
Token ID526
Feature activation-8.64e-01

Pos logit contributions
 someone+0.174
Maybe+0.167
 everything+0.166
 dad+0.157
How+0.157
Neg logit contributions
ocol-0.482
 rib-0.424
 microsc-0.406
ifying-0.402
hy-0.401
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
 everything+0.665
 someone+0.646
 something+0.627
 somewhere+0.626
Maybe+0.607
Neg logit contributions
ocol-1.552
 rib-1.363
 microsc-1.312
hy-1.306
onna-1.282
Token
Token ID198
Feature activation-7.06e-01

Pos logit contributions
 someone+0.223
 everything+0.222
 one+0.214
 something+0.214
 looked+0.212
Neg logit contributions
ocol-0.664
 rib-0.569
 microsc-0.555
hy-0.540
onna-0.529
Token mom
Token ID1995
Feature activation-4.95e-02

Pos logit contributions
ocol+0.820
 rib+0.732
 microsc+0.711
ifying+0.690
hy+0.689
Neg logit contributions
 someone-0.088
 everything-0.082
Maybe-0.068
 something-0.068
 looked-0.059
Token.
Token ID13
Feature activation-5.68e-02

Pos logit contributions
 someone+0.032
 everything+0.030
Maybe+0.029
 something+0.028
 dad+0.027
Neg logit contributions
ocol-0.099
 rib-0.087
 microsc-0.084
ifying-0.082
hy-0.081
Token While
Token ID2893
Feature activation-2.73e-01

Pos logit contributions
 someone+0.051
 everything+0.050
Maybe+0.048
 something+0.048
 dad+0.047
Neg logit contributions
ocol-0.098
 rib-0.085
 microsc-0.081
ifying-0.080
hy-0.078
Token she
Token ID673
Feature activation+8.42e-01

Pos logit contributions
Maybe+0.173
 someone+0.170
 everything+0.164
 dad+0.156
How+0.155
Neg logit contributions
ocol-0.546
 rib-0.476
 microsc-0.456
ifying-0.445
hy-0.432
Token was
Token ID373
Feature activation+2.95e-02

Pos logit contributions
ocol+1.734
 rib+1.477
 microsc+1.430
ifying+1.396
hy+1.376
Neg logit contributions
 someone-0.568
 everything-0.556
 dad-0.517
 something-0.514
 looked-0.514
Token there
Token ID612
Feature activation+1.56e-02

Pos logit contributions
ocol+0.051
 rib+0.043
 microsc+0.041
hy+0.040
ifying+0.040
Neg logit contributions
 someone-0.027
Maybe-0.027
 everything-0.026
They-0.025
 dad-0.025
Token,
Token ID11
Feature activation-4.17e-01

Pos logit contributions
ocol+0.031
 rib+0.027
 microsc+0.026
hy+0.025
ifying+0.024
Neg logit contributions
 someone-0.010
Maybe-0.010
 everything-0.010
 dad-0.010
How-0.010
Token she
Token ID673
Feature activation-9.99e-02

Pos logit contributions
Maybe+0.281
They+0.270
 someone+0.269
 everything+0.266
 dad+0.265
Neg logit contributions
ocol-0.780
 rib-0.692
 microsc-0.661
ifying-0.649
hy-0.621
Token saw
Token ID2497
Feature activation-5.51e-01

Pos logit contributions
 someone+0.077
 everything+0.076
Maybe+0.072
 dad+0.071
They+0.071
Neg logit contributions
ocol-0.182
 rib-0.159
 microsc-0.154
hy-0.147
rou-0.146
Token a
Token ID257
Feature activation-1.85e-01

Pos logit contributions
Maybe+0.179
 everything+0.175
How+0.172
 someone+0.172
They+0.166
Neg logit contributions
ocol-1.201
 rib-1.106
 microsc-1.075
hy-1.028
ifying-1.005
Token big
Token ID1263
Feature activation-1.49e-01

Pos logit contributions
 someone+0.134
 everything+0.131
Maybe+0.125
 one+0.121
 something+0.121
Neg logit contributions
ocol-0.353
 rib-0.309
 microsc-0.298
ifying-0.288
hy-0.287
Token person
Token ID1048
Feature activation-1.90e-01

Pos logit contributions
ocol+1.387
 rib+1.160
 microsc+1.117
ifying+1.090
hy+1.077
Neg logit contributions
 someone-0.690
 everything-0.677
 dad-0.648
 something-0.641
 one-0.635
Token you
Token ID345
Feature activation-2.97e-02

Pos logit contributions
 someone+0.131
 everything+0.129
Maybe+0.126
 dad+0.124
 looked+0.119
Neg logit contributions
ocol-0.371
 rib-0.320
 microsc-0.309
ifying-0.303
hy-0.299
Token are
Token ID389
Feature activation-3.37e-01

Pos logit contributions
 someone+0.021
 everything+0.021
Maybe+0.020
 dad+0.020
 something+0.019
Neg logit contributions
ocol-0.057
 rib-0.050
 microsc-0.048
ifying-0.047
hy-0.046
Token lending
Token ID20983
Feature activation-1.25e-01

Pos logit contributions
Maybe+0.251
 everything+0.247
 someone+0.246
 looked+0.236
 dad+0.235
Neg logit contributions
ocol-0.638
 rib-0.538
 microsc-0.518
ifying-0.508
 former-0.505
Token it
Token ID340
Feature activation-8.48e-01

Pos logit contributions
 someone+0.079
 everything+0.079
Maybe+0.077
 dad+0.077
 looked+0.075
Neg logit contributions
ocol-0.249
 rib-0.216
 microsc-0.207
ifying-0.201
hy-0.198
Token to
Token ID284
Feature activation-2.42e-02

Pos logit contributions
 looked+0.729
Maybe+0.715
 dad+0.692
 someone+0.692
 everything+0.681
Neg logit contributions
ocol-1.481
 rib-1.299
 microsc-1.236
ifying-1.188
 former-1.181
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
Maybe+0.015
 everything+0.015
 someone+0.015
 looked+0.015
 dad+0.014
Neg logit contributions
ocol-0.049
 rib-0.042
 microsc-0.041
 former-0.039
ifying-0.039
Token<|endoftext|>
Token ID50256
Feature activation+5.28e-01

Pos logit contributions
ocol+0.029
 rib+0.025
 microsc+0.024
ifying+0.024
 former+0.023
Neg logit contributions
 someone-0.009
 everything-0.009
 dad-0.008
 looked-0.008
 something-0.008
TokenOnce
Token ID7454
Feature activation+9.86e-01

Pos logit contributions
ocol+1.126
 rib+0.974
 microsc+0.944
ifying+0.922
hy+0.914
Neg logit contributions
 someone-0.297
 everything-0.288
 dad-0.269
 something-0.264
Maybe-0.260
Token upon
Token ID2402
Feature activation+2.73e-01

Pos logit contributions
ocol+1.285
 rib+1.003
 microsc+0.928
ifying+0.903
 former+0.888
Neg logit contributions
 everything-1.418
 someone-1.417
 dad-1.374
 something-1.350
 one-1.345
Token a
Token ID257
Feature activation+6.14e-02

Pos logit contributions
ocol+0.651
 rib+0.579
 microsc+0.560
ifying+0.549
hy+0.548
Neg logit contributions
 someone-0.071
 everything-0.069
 dad-0.062
Maybe-0.056
 looked-0.052
Token and
Token ID290
Feature activation+2.74e-01

Pos logit contributions
 someone+0.396
 everything+0.379
How+0.374
Maybe+0.374
 dad+0.360
Neg logit contributions
ocol-1.146
 rib-1.041
hy-0.981
 microsc-0.959
ifying-0.948
Token climbed
Token ID19952
Feature activation+7.23e-01

Pos logit contributions
ocol+0.567
 rib+0.499
 microsc+0.479
hy+0.463
ifying+0.463
Neg logit contributions
 someone-0.162
 everything-0.155
 dad-0.145
Maybe-0.145
 something-0.144
Token the
Token ID262
Feature activation-1.80e-01

Pos logit contributions
ocol+1.580
 rib+1.362
 microsc+1.325
ifying+1.294
hy+1.283
Neg logit contributions
 someone-0.392
 everything-0.383
 dad-0.349
 something-0.349
 one-0.341
Token swings
Token ID26728
Feature activation+1.88e-01

Pos logit contributions
 someone+0.215
 everything+0.205
 something+0.203
 dad+0.201
Maybe+0.201
Neg logit contributions
ocol-0.267
 rib-0.229
 microsc-0.218
hy-0.210
ifying-0.209
Token one
Token ID530
Feature activation-2.40e-01

Pos logit contributions
ocol+0.389
 rib+0.342
 microsc+0.325
hy+0.322
ifying+0.321
Neg logit contributions
 someone-0.112
 everything-0.106
 dad-0.102
Maybe-0.101
 something-0.098
Token last
Token ID938
Feature activation-4.76e-02

Pos logit contributions
 someone+0.185
 everything+0.177
Maybe+0.171
 dad+0.170
 something+0.167
Neg logit contributions
ocol-0.448
 rib-0.392
ifying-0.370
hy-0.367
 microsc-0.366
Token time
Token ID640
Feature activation+5.58e-01

Pos logit contributions
 someone+0.042
 everything+0.041
 dad+0.040
Maybe+0.040
 something+0.039
Neg logit contributions
ocol-0.085
 rib-0.072
 microsc-0.068
ifying-0.068
hy-0.068
Token.
Token ID13
Feature activation-6.47e-01

Pos logit contributions
ocol+1.179
 rib+1.019
 microsc+0.985
ifying+0.961
hy+0.955
Neg logit contributions
 someone-0.330
 everything-0.320
 dad-0.297
 something-0.294
Maybe-0.292
Token Her
Token ID2332
Feature activation+2.49e-01

Pos logit contributions
 everything+0.661
 someone+0.660
 dad+0.647
 something+0.641
 somewhere+0.623
Neg logit contributions
ocol-1.008
 rib-0.854
 microsc-0.835
 former-0.801
ifying-0.789
Token Dad
Token ID17415
Feature activation+7.13e-02

Pos logit contributions
ocol+0.500
 rib+0.436
 microsc+0.419
ifying+0.406
hy+0.404
Neg logit contributions
 someone-0.168
 everything-0.163
Maybe-0.155
 something-0.151
 dad-0.147
Token said
Token ID531
Feature activation+6.64e-01

Pos logit contributions
ocol+0.129
 rib+0.112
 microsc+0.106
hy+0.104
ifying+0.103
Neg logit contributions
 someone-0.061
 everything-0.059
 something-0.057
 dad-0.056
Maybe-0.055
Token."
Token ID526
Feature activation-6.84e-01

Pos logit contributions
ocol+0.679
 rib+0.576
 microsc+0.551
ifying+0.538
 former+0.529
Neg logit contributions
 someone-0.295
 everything-0.289
 dad-0.277
Maybe-0.275
 something-0.269
Token With
Token ID2080
Feature activation-6.34e-02

Pos logit contributions
 someone+0.612
 everything+0.597
 something+0.567
 dad+0.567
Maybe+0.560
Neg logit contributions
ocol-1.190
 rib-1.008
 microsc-0.990
onna-0.936
ifying-0.935
Token her
Token ID607
Feature activation+7.25e-01

Pos logit contributions
 someone+0.029
 everything+0.028
Maybe+0.026
How+0.024
They+0.023
Neg logit contributions
ocol-0.142
 rib-0.125
 microsc-0.119
ifying-0.116
hy-0.116
Token help
Token ID1037
Feature activation+7.85e-01

Pos logit contributions
ocol+1.446
 rib+1.230
 microsc+1.189
ifying+1.159
hy+1.149
Neg logit contributions
 someone-0.541
 everything-0.527
 dad-0.502
 something-0.495
 one-0.487
Token,
Token ID11
Feature activation-4.72e-01

Pos logit contributions
ocol+1.650
 rib+1.418
 microsc+1.377
ifying+1.345
 former+1.324
Neg logit contributions
 someone-0.486
 everything-0.472
 dad-0.440
 something-0.435
 one-0.427
Token her
Token ID607
Feature activation+5.77e-02

Pos logit contributions
 someone+0.506
 everything+0.486
Maybe+0.480
How+0.473
 dad+0.472
Neg logit contributions
ocol-0.734
 rib-0.611
 microsc-0.578
ifying-0.554
onna-0.530
Token little
Token ID1310
Feature activation+3.18e-01

Pos logit contributions
ocol+0.107
 rib+0.091
 microsc+0.088
ifying+0.084
hy+0.084
Neg logit contributions
 someone-0.048
 everything-0.048
Maybe-0.046
They-0.044
How-0.044
Token brother
Token ID3956
Feature activation-6.55e-01

Pos logit contributions
ocol+0.595
 rib+0.507
 microsc+0.484
ifying+0.467
hy+0.466
Neg logit contributions
 someone-0.262
 everything-0.259
Maybe-0.245
 something-0.240
 dad-0.240
Token soon
Token ID2582
Feature activation-5.50e-01

Pos logit contributions
 someone+0.577
 everything+0.567
How+0.556
Maybe+0.534
 something+0.533
Neg logit contributions
ocol-1.111
 rib-1.010
hy-0.939
 microsc-0.922
rou-0.891
Token stopped
Token ID5025
Feature activation+4.48e-01

Pos logit contributions
 someone+0.424
How+0.409
Maybe+0.406
 everything+0.404
Mom+0.393
Neg logit contributions
ocol-1.008
 rib-0.891
 microsc-0.841
hy-0.813
rou-0.797
Token crying
Token ID13774
Feature activation+4.41e-01

Pos logit contributions
ocol+0.839
 rib+0.708
 microsc+0.681
ifying+0.659
hy+0.652
Neg logit contributions
 someone-0.379
 everything-0.370
 dad-0.351
Maybe-0.351
 something-0.348
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
 someone+0.038
 everything+0.037
Maybe+0.037
 dad+0.036
 looked+0.034
Neg logit contributions
ocol-0.095
 rib-0.081
 microsc-0.078
ifying-0.076
hy-0.073
Token sky
Token ID6766
Feature activation+1.96e-01

Pos logit contributions
ocol+0.038
 rib+0.033
 microsc+0.032
ifying+0.032
hy+0.031
Neg logit contributions
 someone-0.014
 everything-0.014
 dad-0.013
Maybe-0.013
 they-0.013
Token,
Token ID11
Feature activation+4.43e-02

Pos logit contributions
ocol+0.399
 rib+0.345
 microsc+0.331
ifying+0.328
hy+0.323
Neg logit contributions
 someone-0.126
 everything-0.122
 dad-0.116
Maybe-0.115
 something-0.110
Token and
Token ID290
Feature activation-6.75e-01

Pos logit contributions
ocol+0.085
 rib+0.074
 microsc+0.070
ifying+0.069
hy+0.068
Neg logit contributions
 someone-0.033
 everything-0.032
 dad-0.031
Maybe-0.030
 something-0.029
Token the
Token ID262
Feature activation+2.37e-04

Pos logit contributions
Maybe+0.352
 dad+0.351
 someone+0.349
They+0.345
Mom+0.345
Neg logit contributions
ocol-1.360
 rib-1.235
 microsc-1.175
ifying-1.138
 former-1.106
Token rad
Token ID2511
Feature activation+1.63e-01

Pos logit contributions
ocol+0.000
 rib+0.000
 microsc+0.000
ifying+0.000
 former+0.000
Neg logit contributions
 someone-0.000
 everything-0.000
Maybe-0.000
 dad-0.000
 something-0.000
Tokenish
Token ID680
Feature activation-2.34e-01

Pos logit contributions
ocol+0.254
 rib+0.206
 microsc+0.195
ifying+0.190
rou+0.183
Neg logit contributions
 someone-0.193
 everything-0.187
 dad-0.183
 something-0.181
Maybe-0.179
Token floated
Token ID29742
Feature activation+4.67e-01

Pos logit contributions
 someone+0.191
 everything+0.187
 dad+0.180
Maybe+0.178
 something+0.175
Neg logit contributions
ocol-0.427
 rib-0.370
 microsc-0.352
ifying-0.344
hy-0.338
Token up
Token ID510
Feature activation+7.88e-01

Pos logit contributions
ocol+0.948
 rib+0.818
 microsc+0.787
ifying+0.772
hy+0.762
Neg logit contributions
 someone-0.314
 everything-0.303
 dad-0.288
Maybe-0.281
 something-0.281
Token and
Token ID290
Feature activation-2.74e-01

Pos logit contributions
ocol+1.545
 rib+1.306
 microsc+1.265
ifying+1.226
hy+1.220
Neg logit contributions
 someone-0.609
 everything-0.599
 something-0.564
 dad-0.560
 one-0.556
Token back
Token ID736
Feature activation+2.11e-01

Pos logit contributions
 someone+0.156
 dad+0.146
 everything+0.144
Maybe+0.142
They+0.138
Neg logit contributions
ocol-0.560
 rib-0.504
 microsc-0.477
ifying-0.470
 former-0.457
Token loved
Token ID6151
Feature activation+8.63e-01

Pos logit contributions
ocol+0.450
 rib+0.387
 microsc+0.371
hy+0.363
ifying+0.357
Neg logit contributions
 someone-0.207
 everything-0.202
 something-0.193
 dad-0.193
Maybe-0.189
Token to
Token ID284
Feature activation-5.28e-01

Pos logit contributions
ocol+1.430
 rib+1.162
 microsc+1.114
ifying+1.099
hy+1.077
Neg logit contributions
 someone-0.930
 everything-0.927
 dad-0.890
 something-0.877
 one-0.868
Token dress
Token ID6576
Feature activation+1.47e-01

Pos logit contributions
 someone+0.479
Maybe+0.475
 everything+0.468
How+0.446
 something+0.440
Neg logit contributions
ocol-0.924
 rib-0.813
 microsc-0.757
hy-0.739
 topp-0.710
Token up
Token ID510
Feature activation+7.06e-01

Pos logit contributions
ocol+0.305
 rib+0.267
 microsc+0.255
ifying+0.246
hy+0.245
Neg logit contributions
 someone-0.088
Maybe-0.084
 everything-0.082
 dad-0.080
 looked-0.077
Token in
Token ID287
Feature activation-3.32e-01

Pos logit contributions
ocol+1.454
 rib+1.246
 microsc+1.205
ifying+1.175
hy+1.166
Neg logit contributions
 someone-0.470
 everything-0.461
 dad-0.429
 something-0.426
 one-0.417
Token her
Token ID607
Feature activation-4.32e-01

Pos logit contributions
Maybe+0.240
 someone+0.234
 everything+0.220
 looked+0.216
 dad+0.213
Neg logit contributions
ocol-0.643
 rib-0.566
 microsc-0.539
ifying-0.508
hy-0.504
Token princess
Token ID21752
Feature activation+1.37e-01

Pos logit contributions
 someone+0.345
Maybe+0.341
 everything+0.328
They+0.319
How+0.315
Neg logit contributions
ocol-0.792
 rib-0.703
 microsc-0.661
ifying-0.640
hy-0.627
Token costume
Token ID16569
Feature activation-2.82e-01

Pos logit contributions
ocol+0.232
 rib+0.200
 microsc+0.188
hy+0.180
ifying+0.180
Neg logit contributions
 someone-0.134
 everything-0.130
Maybe-0.127
 something-0.124
 dad-0.122
Token and
Token ID290
Feature activation+4.14e-01

Pos logit contributions
Maybe+0.189
 someone+0.187
 everything+0.184
How+0.177
 dad+0.175
Neg logit contributions
ocol-0.538
 rib-0.479
 microsc-0.454
hy-0.450
 former-0.444
Token dance
Token ID9280
Feature activation+5.14e-01

Pos logit contributions
ocol+0.785
 rib+0.676
 microsc+0.645
ifying+0.624
hy+0.622
Neg logit contributions
 someone-0.329
 everything-0.320
Maybe-0.303
 something-0.303
 dad-0.302
Token around
Token ID1088
Feature activation+1.13e+00

Pos logit contributions
ocol+1.036
 rib+0.892
 microsc+0.857
ifying+0.835
hy+0.829
Neg logit contributions
 someone-0.359
 everything-0.348
 dad-0.329
Maybe-0.325
 something-0.324
Token has
Token ID468
Feature activation+4.57e-01

Pos logit contributions
 someone+0.088
 everything+0.086
 dad+0.086
 something+0.082
Maybe+0.081
Neg logit contributions
ocol-0.159
 rib-0.138
 microsc-0.130
hy-0.125
ifying-0.125
Token floppy
Token ID49415
Feature activation+7.84e-01

Pos logit contributions
ocol+0.987
 rib+0.862
 microsc+0.832
ifying+0.811
hy+0.808
Neg logit contributions
 someone-0.240
 everything-0.235
 dad-0.219
Maybe-0.215
 something-0.211
Token ears
Token ID11368
Feature activation+4.24e-01

Pos logit contributions
ocol+1.334
 rib+1.096
 microsc+1.056
ifying+1.025
hy+1.013
Neg logit contributions
 someone-0.800
 everything-0.786
 dad-0.756
 something-0.753
 one-0.745
Token."
Token ID526
Feature activation-7.74e-01

Pos logit contributions
ocol+0.906
 rib+0.786
 microsc+0.759
ifying+0.745
hy+0.738
Neg logit contributions
 someone-0.238
 everything-0.230
 dad-0.215
Maybe-0.212
 something-0.209
Token
Token ID198
Feature activation-3.47e-01

Pos logit contributions
 everything+0.563
 someone+0.549
 looked+0.532
 something+0.510
 dad+0.504
Neg logit contributions
ocol-1.438
 rib-1.264
 microsc-1.179
onna-1.166
hy-1.154
Token
Token ID198
Feature activation-2.64e-01

Pos logit contributions
 someone+0.230
 everything+0.224
 looked+0.216
 one+0.216
 something+0.215
Neg logit contributions
ocol-0.677
 rib-0.588
 microsc-0.561
hy-0.548
onna-0.542
TokenS
Token ID50
Feature activation+8.24e-01

Pos logit contributions
 someone+0.187
 everything+0.183
 looked+0.171
 something+0.171
 one+0.169
Neg logit contributions
ocol-0.513
 rib-0.452
 microsc-0.422
hy-0.420
ifying-0.415
Tokenara
Token ID3301
Feature activation-5.43e-01

Pos logit contributions
ocol+1.425
 rib+1.186
 microsc+1.145
ifying+1.104
hy+1.090
Neg logit contributions
 someone-0.817
 everything-0.808
 dad-0.770
 something-0.769
Maybe-0.760
Token's
Token ID338
Feature activation-9.70e-01

Pos logit contributions
 someone+0.568
 everything+0.553
 something+0.524
 somewhere+0.518
 dad+0.499
Neg logit contributions
ocol-0.829
 rib-0.765
 microsc-0.699
rou-0.678
hy-0.677
Token mom
Token ID1995
Feature activation-5.74e-01

Pos logit contributions
Maybe+0.500
How+0.497
They+0.496
 someone+0.487
 everything+0.484
Neg logit contributions
ocol-2.039
 rib-1.887
hy-1.766
 microsc-1.759
rou-1.704
Token nodded
Token ID14464
Feature activation+1.51e-02

Pos logit contributions
 someone+0.633
 everything+0.607
 something+0.588
 somewhere+0.574
 dad+0.568
Neg logit contributions
ocol-0.876
 rib-0.785
 microsc-0.709
rou-0.708
hy-0.705
Token place
Token ID1295
Feature activation+3.37e-01

Pos logit contributions
 someone+0.392
 everything+0.386
Maybe+0.380
 dad+0.363
 looked+0.363
Neg logit contributions
ocol-1.113
 rib-0.943
ifying-0.920
 former-0.897
 microsc-0.894
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
ocol+0.652
 rib+0.558
 microsc+0.536
ifying+0.525
hy+0.519
Neg logit contributions
 someone-0.251
 everything-0.244
 dad-0.233
Maybe-0.231
 something-0.227
Token But
Token ID887
Feature activation+5.53e-01

Pos logit contributions
ocol+0.194
 rib+0.164
 microsc+0.158
ifying+0.154
hy+0.153
Neg logit contributions
 someone-0.083
 everything-0.082
 something-0.077
 dad-0.077
Maybe-0.076
Token to
Token ID284
Feature activation+2.44e-02

Pos logit contributions
ocol+1.181
 rib+1.021
 microsc+0.987
ifying+0.964
hy+0.955
Neg logit contributions
 someone-0.316
 everything-0.307
 dad-0.284
 something-0.280
Maybe-0.279
Token do
Token ID466
Feature activation-4.51e-01

Pos logit contributions
ocol+0.050
 rib+0.043
 microsc+0.041
 former+0.040
hy+0.040
Neg logit contributions
 someone-0.015
 everything-0.015
Maybe-0.014
 looked-0.014
 dad-0.014
Token that
Token ID326
Feature activation-3.47e-01

Pos logit contributions
Maybe+0.226
 someone+0.218
 dad+0.216
Mom+0.208
 everything+0.204
Neg logit contributions
ocol-0.965
 rib-0.834
 microsc-0.817
ifying-0.782
 former-0.781
Token,
Token ID11
Feature activation+1.29e-01

Pos logit contributions
 someone+0.234
 everything+0.230
Maybe+0.228
 dad+0.222
Mom+0.217
Neg logit contributions
ocol-0.671
 rib-0.581
 microsc-0.560
 former-0.550
ifying-0.541
Token the
Token ID262
Feature activation-9.62e-01

Pos logit contributions
ocol+0.263
 rib+0.228
 microsc+0.219
ifying+0.214
 former+0.211
Neg logit contributions
 someone-0.078
 everything-0.076
Maybe-0.073
 dad-0.073
 looked-0.070
Token boat
Token ID8848
Feature activation+3.43e-01

Pos logit contributions
 everything+0.704
 someone+0.702
Maybe+0.688
How+0.676
They+0.665
Neg logit contributions
ocol-1.799
 rib-1.609
ifying-1.463
 microsc-1.462
 gol-1.458
Token needed
Token ID2622
Feature activation+4.87e-01

Pos logit contributions
ocol+0.624
 rib+0.530
 microsc+0.506
ifying+0.495
hy+0.492
Neg logit contributions
 someone-0.293
 everything-0.288
 dad-0.276
 something-0.272
Maybe-0.270
Token to
Token ID284
Feature activation-2.14e-01

Pos logit contributions
ocol+0.895
 rib+0.757
 microsc+0.725
ifying+0.702
hy+0.697
Neg logit contributions
 someone-0.417
 everything-0.412
 dad-0.394
Maybe-0.389
 something-0.386
Token to
Token ID284
Feature activation-1.51e-01

Pos logit contributions
 someone+0.918
Maybe+0.896
How+0.883
 dad+0.881
 looked+0.877
Neg logit contributions
ocol-0.877
 rib-0.761
 microsc-0.713
rou-0.672
onna-0.663
Token mom
Token ID1995
Feature activation-6.35e-01

Pos logit contributions
 someone+0.090
 everything+0.087
Maybe+0.087
 looked+0.085
 something+0.084
Neg logit contributions
ocol-0.311
 rib-0.273
 microsc-0.255
ifying-0.248
 former-0.248
Token and
Token ID290
Feature activation+5.07e-01

Pos logit contributions
Maybe+0.466
 someone+0.464
How+0.451
Turn+0.420
 everything+0.417
Neg logit contributions
ocol-1.186
 rib-1.037
 microsc-0.995
hy-0.968
rou-0.958
Token hug
Token ID16225
Feature activation+1.20e-01

Pos logit contributions
ocol+1.002
 rib+0.860
 microsc+0.829
ifying+0.803
hy+0.800
Neg logit contributions
 someone-0.368
 everything-0.359
 something-0.336
Maybe-0.333
 dad-0.333
Token her
Token ID607
Feature activation-2.50e-01

Pos logit contributions
ocol+0.250
 rib+0.218
 microsc+0.208
ifying+0.203
hy+0.199
Neg logit contributions
 someone-0.070
 everything-0.067
Maybe-0.064
 looked-0.064
 something-0.063
Token.
Token ID13
Feature activation-1.00e-01

Pos logit contributions
 someone+0.159
Maybe+0.147
 everything+0.146
 looked+0.141
 something+0.137
Neg logit contributions
ocol-0.498
 rib-0.447
 microsc-0.423
ifying-0.412
hy-0.409
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
 everything+0.063
 someone+0.063
 something+0.059
 looked+0.058
 one+0.057
Neg logit contributions
ocol-0.198
 rib-0.174
 microsc-0.168
ifying-0.165
hy-0.163
Token
Token ID198
Feature activation-5.24e-01

Pos logit contributions
 someone+0.120
 everything+0.117
 something+0.112
 looked+0.112
 one+0.112
Neg logit contributions
ocol-0.367
 rib-0.316
 microsc-0.309
hy-0.298
ifying-0.294
TokenMom
Token ID29252
Feature activation-9.37e-01

Pos logit contributions
 someone+0.285
 everything+0.277
 looked+0.263
 something+0.261
 one+0.250
Neg logit contributions
ocol-1.098
 rib-0.962
 microsc-0.928
ifying-0.922
hy-0.902
Token forg
Token ID5784
Feature activation+5.71e-01

Pos logit contributions
 someone+1.154
 something+1.054
 everything+1.044
 somewhere+1.038
Maybe+1.022
Neg logit contributions
ocol-1.371
 rib-1.192
rou-1.113
 microsc-1.078
 topp-1.050
Tokenives
Token ID1083
Feature activation-5.52e-01

Pos logit contributions
ocol+0.627
 rib+0.462
 microsc+0.427
ifying+0.403
hy+0.397
Neg logit contributions
 someone-0.926
 everything-0.916
 dad-0.894
 something-0.890
Maybe-0.884
Token.
Token ID13
Feature activation-8.66e-02

Pos logit contributions
ocol+1.010
 rib+0.867
 microsc+0.837
ifying+0.818
hy+0.811
Neg logit contributions
 someone-0.337
 everything-0.329
 dad-0.308
Maybe-0.305
 something-0.305
Token<|endoftext|>
Token ID50256
Feature activation+2.02e-01

Pos logit contributions
 everything+0.072
 someone+0.071
 something+0.068
 dad+0.068
 looked+0.066
Neg logit contributions
ocol-0.159
 rib-0.133
 microsc-0.130
ifying-0.126
hy-0.125
TokenOne
Token ID3198
Feature activation-3.70e-01

Pos logit contributions
ocol+0.416
 rib+0.359
 microsc+0.349
ifying+0.341
hy+0.338
Neg logit contributions
 someone-0.121
 everything-0.117
 dad-0.114
 something-0.109
Maybe-0.108
Token day
Token ID1110
Feature activation+1.68e-01

Pos logit contributions
 someone+0.260
Maybe+0.258
 everything+0.245
How+0.240
 dad+0.239
Neg logit contributions
ocol-0.715
 rib-0.607
hy-0.587
 microsc-0.586
ifying-0.566
Token,
Token ID11
Feature activation-4.00e-01

Pos logit contributions
ocol+0.340
 rib+0.290
 microsc+0.281
ifying+0.275
hy+0.272
Neg logit contributions
 someone-0.110
 everything-0.106
Maybe-0.105
 dad-0.100
 something-0.098
Token Jack
Token ID3619
Feature activation-1.46e-01

Pos logit contributions
 someone+0.140
 everything+0.129
Maybe+0.122
 dad+0.121
How+0.108
Neg logit contributions
ocol-0.900
 rib-0.797
ifying-0.774
 microsc-0.763
hy-0.756
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
 someone+0.099
 everything+0.095
 something+0.089
Maybe+0.089
 dad+0.089
Neg logit contributions
ocol-0.288
 rib-0.247
 microsc-0.240
hy-0.232
ifying-0.230
Token Jill
Token ID22772
Feature activation-2.28e-01

Pos logit contributions
 someone+0.390
Maybe+0.386
 everything+0.381
How+0.375
 something+0.372
Neg logit contributions
ocol-0.734
 rib-0.631
 microsc-0.614
hy-0.589
ifying-0.565
Token went
Token ID1816
Feature activation+2.18e-02

Pos logit contributions
 someone+0.218
 everything+0.207
 something+0.203
How+0.202
 dad+0.201
Neg logit contributions
ocol-0.388
 rib-0.330
 microsc-0.309
hy-0.308
ifying-0.302
Token out
Token ID503
Feature activation+9.91e-01

Pos logit contributions
ocol+0.033
 rib+0.028
 microsc+0.026
hy+0.025
ifying+0.025
Neg logit contributions
 someone-0.025
Maybe-0.024
 everything-0.024
 dad-0.024
 something-0.023
Token to
Token ID284
Feature activation+5.13e-01

Pos logit contributions
ocol+1.696
 rib+1.399
 microsc+1.344
ifying+1.314
hy+1.298
Neg logit contributions
 someone-1.020
 everything-1.005
 dad-0.966
 something-0.962
 one-0.957
Token the
Token ID262
Feature activation-1.64e-01

Pos logit contributions
Maybe+0.331
How+0.319
They+0.313
 someone+0.312
 everything+0.310
Neg logit contributions
ocol-0.808
 rib-0.692
 microsc-0.658
hy-0.614
 topp-0.612
Token tank
Token ID6873
Feature activation-9.47e-01

Pos logit contributions
 someone+0.136
 everything+0.134
Maybe+0.132
 looked+0.126
 something+0.126
Neg logit contributions
ocol-0.300
 rib-0.256
 microsc-0.243
ifying-0.238
 topp-0.236
Token.
Token ID13
Feature activation-3.49e-02

Pos logit contributions
 someone+0.703
How+0.690
 everything+0.686
Maybe+0.667
Mom+0.651
Neg logit contributions
ocol-1.693
 rib-1.512
 microsc-1.458
hy-1.450
ifying-1.424
Token They
Token ID1119
Feature activation+2.81e-01

Pos logit contributions
 everything+0.025
 someone+0.025
 dad+0.024
 something+0.023
 one+0.023
Neg logit contributions
ocol-0.067
 rib-0.058
 microsc-0.056
hy-0.055
ifying-0.054
Token do
Token ID466
Feature activation-9.08e-01

Pos logit contributions
ocol+0.606
 rib+0.532
 microsc+0.514
hy+0.499
ifying+0.499
Neg logit contributions
 someone-0.152
 everything-0.144
 something-0.132
 dad-0.129
Maybe-0.128
Token not
Token ID407
Feature activation-1.77e+00

Pos logit contributions
Maybe+0.347
 someone+0.343
Mom+0.335
 dad+0.315
 looked+0.292
Neg logit contributions
ocol-1.960
 rib-1.760
 microsc-1.711
rou-1.691
 topp-1.661
Token see
Token ID766
Feature activation-1.82e+00

Pos logit contributions
How+1.423
 everything+1.414
Maybe+1.373
 someone+1.351
ths+1.348
Neg logit contributions
ocol-3.097
hy-2.697
 rib-2.667
onna-2.606
 microsc-2.577
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
They+1.845
Maybe+1.844
How+1.788
Turn+1.785
Nothing+1.781
Neg logit contributions
ocol-3.023
 rib-2.502
 microsc-2.383
hy-2.298
 spe-2.188
Token tank
Token ID6873
Feature activation-7.58e-01

Pos logit contributions
ocol+0.031
 rib+0.027
 microsc+0.026
ifying+0.025
hy+0.025
Neg logit contributions
 someone-0.013
 everything-0.013
Maybe-0.012
 dad-0.012
 something-0.012
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
 someone+0.558
Maybe+0.554
 dad+0.537
 everything+0.534
Mom+0.522
Neg logit contributions
ocol-1.406
 rib-1.191
ifying-1.166
 microsc-1.159
hy-1.153
Token They
Token ID1119
Feature activation+2.08e-01

Pos logit contributions
 everything+0.126
 someone+0.124
 dad+0.120
 one+0.116
 something+0.116
Neg logit contributions
ocol-0.317
 rib-0.272
 microsc-0.265
hy-0.258
ifying-0.256
Token One
Token ID1881
Feature activation-2.47e-01

Pos logit contributions
ocol+0.825
 rib+0.708
 microsc+0.685
ifying+0.664
hy+0.662
Neg logit contributions
 someone-0.301
 everything-0.293
 dad-0.280
 something-0.276
Maybe-0.272
Token day
Token ID1110
Feature activation+2.43e-01

Pos logit contributions
 someone+0.166
 everything+0.165
Maybe+0.157
 something+0.152
How+0.152
Neg logit contributions
ocol-0.475
 rib-0.407
 microsc-0.399
hy-0.396
ifying-0.387
Token,
Token ID11
Feature activation-3.40e-01

Pos logit contributions
ocol+0.494
 rib+0.423
 microsc+0.409
ifying+0.399
hy+0.397
Neg logit contributions
 someone-0.159
 everything-0.152
Maybe-0.148
 dad-0.146
 something-0.141
Token Lily
Token ID20037
Feature activation-1.65e-01

Pos logit contributions
 someone+0.170
Maybe+0.165
 everything+0.164
 dad+0.156
They+0.150
Neg logit contributions
ocol-0.708
 rib-0.634
 microsc-0.598
hy-0.583
ifying-0.583
Token didn
Token ID1422
Feature activation-1.10e-01

Pos logit contributions
 someone+0.094
 everything+0.092
 something+0.084
 dad+0.084
Maybe+0.080
Neg logit contributions
ocol-0.333
 rib-0.295
 microsc-0.287
hy-0.277
ifying-0.274
Token't
Token ID470
Feature activation-1.55e+00

Pos logit contributions
 everything+0.113
 someone+0.112
 dad+0.110
Maybe+0.107
 looked+0.106
Neg logit contributions
ocol-0.178
 rib-0.148
 microsc-0.142
ifying-0.140
hy-0.139
Token feel
Token ID1254
Feature activation-6.68e-01

Pos logit contributions
Maybe+1.647
 dad+1.637
 someone+1.628
How+1.626
 everything+1.612
Neg logit contributions
ocol-2.359
hy-1.934
 rib-1.906
 microsc-1.788
ifying-1.777
Token good
Token ID922
Feature activation+1.77e+00

Pos logit contributions
Maybe+0.499
Mom+0.483
They+0.483
 someone+0.466
 dad+0.451
Neg logit contributions
ocol-1.283
 rib-1.084
 microsc-1.045
rou-1.007
hy-1.000
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
ocol+3.962
 rib+3.399
 microsc+3.257
ifying+3.255
hy+3.146
Neg logit contributions
 someone-1.075
 everything-1.063
 one-1.009
 something-1.001
 dad-0.972
Token She
Token ID1375
Feature activation+3.62e-01

Pos logit contributions
 someone+0.290
 everything+0.286
 dad+0.279
Maybe+0.277
 they+0.276
Neg logit contributions
ocol-0.599
 rib-0.518
 microsc-0.510
hy-0.493
ifying-0.478
Token had
Token ID550
Feature activation+5.90e-01

Pos logit contributions
ocol+0.690
 rib+0.594
 microsc+0.572
ifying+0.552
hy+0.551
Neg logit contributions
 someone-0.278
 everything-0.273
 dad-0.257
 something-0.256
Maybe-0.253
Token up
Token ID510
Feature activation+4.08e-01

Pos logit contributions
 someone+0.743
 dad+0.738
Maybe+0.722
 everything+0.702
Mom+0.688
Neg logit contributions
ocol-1.427
 rib-1.255
ifying-1.175
 microsc-1.140
rou-1.138
Token."
Token ID526
Feature activation+6.01e-01

Pos logit contributions
ocol+0.791
 rib+0.675
 microsc+0.649
ifying+0.636
hy+0.624
Neg logit contributions
 someone-0.309
 everything-0.300
 dad-0.288
Maybe-0.285
 something-0.281
Token
Token ID198
Feature activation+6.30e-01

Pos logit contributions
ocol+1.283
 rib+1.109
 microsc+1.073
ifying+1.046
hy+1.039
Neg logit contributions
 someone-0.351
 everything-0.341
 dad-0.317
 something-0.313
Maybe-0.308
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
ocol+1.382
 rib+1.198
 microsc+1.161
ifying+1.134
hy+1.125
Neg logit contributions
 someone-0.329
 everything-0.318
 dad-0.292
 something-0.289
Maybe-0.283
TokenThe
Token ID464
Feature activation-4.70e-01

Pos logit contributions
ocol+0.208
 rib+0.180
 microsc+0.172
ifying+0.166
hy+0.165
Neg logit contributions
 someone-0.102
 everything-0.100
 dad-0.099
 something-0.095
 one-0.093
Token next
Token ID1306
Feature activation-1.82e+00

Pos logit contributions
 someone+0.318
 everything+0.313
Maybe+0.304
 something+0.297
 looked+0.289
Neg logit contributions
ocol-0.934
 rib-0.817
 microsc-0.770
 former-0.737
 topp-0.732
Token day
Token ID1110
Feature activation+4.77e-01

Pos logit contributions
 someone+1.712
Maybe+1.701
 everything+1.655
 what+1.640
 something+1.618
Neg logit contributions
ocol-2.858
 gol-2.562
 rib-2.447
pri-2.319
hy-2.310
Token,
Token ID11
Feature activation-1.90e-01

Pos logit contributions
ocol+0.995
 rib+0.857
 microsc+0.827
ifying+0.807
hy+0.801
Neg logit contributions
 someone-0.294
 everything-0.287
 dad-0.270
Maybe-0.267
 something-0.263
Token the
Token ID262
Feature activation-2.41e-01

Pos logit contributions
 someone+0.118
 dad+0.116
Maybe+0.116
 everything+0.114
 looked+0.107
Neg logit contributions
ocol-0.374
 rib-0.333
 microsc-0.312
ifying-0.303
 topp-0.298
Token sun
Token ID4252
Feature activation+7.48e-03

Pos logit contributions
 someone+0.169
 everything+0.166
Maybe+0.161
 something+0.156
 dad+0.155
Neg logit contributions
ocol-0.476
 rib-0.414
 microsc-0.390
 former-0.376
 topp-0.371
Token came
Token ID1625
Feature activation+1.32e-01

Pos logit contributions
ocol+0.015
 rib+0.013
 microsc+0.012
ifying+0.012
hy+0.011
Neg logit contributions
 someone-0.006
 everything-0.005
 dad-0.005
 something-0.005
Maybe-0.005
Token soon
Token ID2582
Feature activation-4.59e-02

Pos logit contributions
 someone+0.416
Maybe+0.398
How+0.392
 everything+0.383
 dad+0.371
Neg logit contributions
ocol-1.056
 rib-0.951
ifying-0.888
 microsc-0.884
hy-0.874
Token.
Token ID13
Feature activation+3.21e-01

Pos logit contributions
 someone+0.028
 everything+0.027
Maybe+0.026
 dad+0.025
 something+0.024
Neg logit contributions
ocol-0.095
 rib-0.083
 microsc-0.078
ifying-0.077
hy-0.077
Token But
Token ID887
Feature activation+8.30e-01

Pos logit contributions
ocol+0.645
 rib+0.554
 microsc+0.536
ifying+0.524
hy+0.521
Neg logit contributions
 someone-0.216
 everything-0.214
 dad-0.199
 something-0.198
 looked-0.195
Token they
Token ID484
Feature activation+8.98e-01

Pos logit contributions
ocol+2.108
 rib+1.860
 microsc+1.814
ifying+1.779
hy+1.769
Neg logit contributions
 someone-0.177
 everything-0.164
 dad-0.129
 something-0.126
 one-0.115
Token did
Token ID750
Feature activation-9.30e-01

Pos logit contributions
ocol+1.740
 rib+1.459
 microsc+1.415
ifying+1.374
 former+1.353
Neg logit contributions
 someone-0.730
 everything-0.721
 dad-0.691
 looked-0.677
Maybe-0.677
Token not
Token ID407
Feature activation-1.71e+00

Pos logit contributions
Maybe+0.388
 someone+0.349
 looked+0.335
 dad+0.316
Mom+0.306
Neg logit contributions
ocol-2.035
 rib-1.775
 microsc-1.741
onna-1.704
ifying-1.699
Token.
Token ID13
Feature activation+4.23e-01

Pos logit contributions
Maybe+1.776
How+1.765
 someone+1.683
 everything+1.666
 something+1.613
Neg logit contributions
ocol-2.837
 rib-2.274
hy-2.253
onna-2.167
 microsc-2.142
Token They
Token ID1119
Feature activation+7.15e-01

Pos logit contributions
ocol+0.874
 rib+0.752
 microsc+0.730
ifying+0.715
hy+0.705
Neg logit contributions
 someone-0.265
 everything-0.261
 dad-0.242
 something-0.240
Maybe-0.236
Token lied
Token ID19837
Feature activation+7.12e-02

Pos logit contributions
ocol+1.377
 rib+1.162
 microsc+1.123
ifying+1.092
hy+1.077
Neg logit contributions
 someone-0.574
 everything-0.562
 dad-0.534
 something-0.527
Maybe-0.524
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
ocol+0.128
 rib+0.112
 microsc+0.107
ifying+0.105
hy+0.104
Neg logit contributions
 someone-0.058
 everything-0.056
 dad-0.053
Maybe-0.053
 something-0.052
Token They
Token ID1119
Feature activation+8.58e-01

Pos logit contributions
ocol+0.467
 rib+0.403
 microsc+0.392
ifying+0.384
hy+0.378
Neg logit contributions
 everything-0.141
 someone-0.141
 dad-0.130
 something-0.129
 looked-0.128
Token they
Token ID484
Feature activation+1.59e-01

Pos logit contributions
ocol+0.147
 rib+0.131
 microsc+0.125
hy+0.122
ifying+0.120
Neg logit contributions
 someone-0.023
 everything-0.022
Maybe-0.021
 dad-0.019
 looked-0.019
Token love
Token ID1842
Feature activation-2.69e-01

Pos logit contributions
ocol+0.264
 rib+0.223
 microsc+0.210
ifying+0.204
hy+0.202
Neg logit contributions
 someone-0.165
 everything-0.158
 dad-0.153
 something-0.152
Maybe-0.150
Token dad
Token ID9955
Feature activation-6.40e-01

Pos logit contributions
 someone+0.197
Maybe+0.196
 everything+0.184
How+0.179
 something+0.177
Neg logit contributions
ocol-0.522
 rib-0.453
 microsc-0.442
 topp-0.410
hy-0.402
Token and
Token ID290
Feature activation+5.21e-01

Pos logit contributions
 someone+0.434
Maybe+0.406
 everything+0.395
How+0.391
Mom+0.378
Neg logit contributions
ocol-1.229
 rib-1.092
hy-1.040
 microsc-1.033
 topp-1.014
Token his
Token ID465
Feature activation+4.72e-01

Pos logit contributions
ocol+1.027
 rib+0.880
 microsc+0.849
ifying+0.823
hy+0.819
Neg logit contributions
 someone-0.381
 everything-0.373
 dad-0.349
 something-0.347
Maybe-0.346
Token laptop
Token ID13224
Feature activation-1.35e+00

Pos logit contributions
ocol+0.849
 rib+0.716
 microsc+0.685
ifying+0.661
hy+0.657
Neg logit contributions
 someone-0.429
 everything-0.420
 something-0.399
 dad-0.399
Maybe-0.399
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
 someone+0.989
How+0.929
Turn+0.912
Maybe+0.912
 everything+0.905
Neg logit contributions
ocol-2.471
 rib-2.273
hy-2.154
 topp-2.079
 microsc-2.040
Token They
Token ID1119
Feature activation+7.00e-01

Pos logit contributions
ocol+0.743
 rib+0.641
 microsc+0.624
ifying+0.605
hy+0.601
Neg logit contributions
 someone-0.224
 everything-0.221
 something-0.203
 dad-0.201
Maybe-0.198
Token hug
Token ID16225
Feature activation-1.21e-01

Pos logit contributions
ocol+1.314
 rib+1.105
 microsc+1.066
ifying+1.038
hy+1.025
Neg logit contributions
 someone-0.588
 everything-0.579
 dad-0.552
 something-0.544
Maybe-0.540
Token dad
Token ID9955
Feature activation-7.54e-01

Pos logit contributions
 someone+0.088
Maybe+0.086
 everything+0.084
How+0.080
 looked+0.080
Neg logit contributions
ocol-0.232
 rib-0.203
 microsc-0.194
ifying-0.184
hy-0.182
Token and
Token ID290
Feature activation+7.69e-01

Pos logit contributions
 someone+0.556
Maybe+0.515
How+0.489
 something+0.475
 everything+0.475
Neg logit contributions
ocol-1.448
 rib-1.292
hy-1.222
 microsc-1.202
rou-1.191