TOP ACTIVATIONS
MAX = 3.392

help each other . Lily let Ben use some paper and
her toy bird . She let it fly high above the
to the grass . They let the balloon fly high in
and their switch . They let each other use the switch
says . They let go of the jet and
by the swings . She let a little bit of sand
Ben were scared . They let go of the lamp and
. L ila and Sam let them have a turn ,
and Ben cried . They let go of the nest and
She opened the door and let the bird out . The
inside it , so she let herself in . She stepped
to the grass . They let the balloon fly high in
After a while , Lily let go of the cow and
open the box . They let the rabbit go . The
, the king decided to let people come to visit .
other . They let go of the spring and
bird was very tired and let out a big yawn .
gate . She smiled and let them in . They saw
sun ball . They also let other kids join them and
!" They both let go of the dress and

MIN ACTIVATIONS
MIN = -3.238

dance around ! Em ma was over joy ed
new symbol ! Em ma 's friends cl apped
the surface . Em ma 's friends cheered as
he was . Em ma loved the present and
his friends dropped her lip bal m on the ground .
birds . Em ma was very proud and
you help ?" Em ma nodded and got to
here . Em ma had a great time
come out ! Em ma was so sad .
down safely ." Em ma understood what they meant
the terrible mess . His mar bles were everywhere ! He
with me ?" Mel ody was very excited and
diamond ! Em ma was so happy and
place . Em ma was so proud of
the night . Em ma was so proud of
away . Em ma and her friends all
Emma ! '. Em ma was so happy she
L ily was scared at first , but then she
away to hear her . Ros y watched sadly as her
ie . They both had birth days coming up , and

INTERVAL 1.735 - 3.392
CONTAINS 0.272%

They have fun with the wheat . But they do
holds the rail . She feels better . She climbs up
the children play . He feels happy and warm . He
was now happy . She showed the pie a spark ly
friends . <|endoftext|> Tom and Lily liked to play in the

INTERVAL 0.077 - 1.735
CONTAINS 44.343%

thick soil . On one summer day , his mom came
new , so it might take a few times for it
friends . " It 's okay , Tom ,"
He learned that sharing is good . So , he shared
One day he decided to go on an adventure .

INTERVAL -1.581 - 0.077
CONTAINS 54.991%

" Let 's get hard and have a fire work show
Tim my 's mom told him they needed to clean his
other and hopped around together . At the end
noticed the little girl and went over to her . She
and count how many he had r aked . <|endoftext|> Once

INTERVAL -3.238 - -1.581
CONTAINS 0.394%

. But then they heard their mom 's voice
Mom asked . " L ily and Max pointed
says slapping is not nice and it hurts . She asks
they both had their own brace lets and they were both
so excited ! She had never seen anything so beautiful .
Token help
Token ID1037
Feature activation+1.18e+00

Pos logit contributions
ifying+1.397
igsaw+1.246
run+1.090
chie+1.068
 brief+1.017
Neg logit contributions
 idea-0.606
 morning-0.496
 day-0.494
 people-0.488
 prevent-0.469
Token each
Token ID1123
Feature activation+6.08e-01

Pos logit contributions
 idea+1.858
 morning+1.634
cut+1.615
 ostr+1.604
 paramedics+1.597
Neg logit contributions
ifying-2.232
igsaw-1.911
run-1.658
chie-1.545
 brief-1.481
Token other
Token ID584
Feature activation-9.69e-03

Pos logit contributions
 idea+0.420
 morning+0.290
 paramedics+0.269
 prevent+0.261
 doctors+0.260
Neg logit contributions
ifying-1.687
igsaw-1.535
chie-1.361
run-1.360
 brief-1.304
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
ifying+0.024
igsaw+0.022
run+0.019
chie+0.019
 brief+0.018
Neg logit contributions
 idea-0.010
 morning-0.008
 day-0.008
 people-0.007
 doctors-0.007
Token Lily
Token ID20037
Feature activation+9.75e-01

Pos logit contributions
 idea+0.400
 prevent+0.338
 day+0.335
 people+0.335
 morning+0.334
Neg logit contributions
ifying-0.626
igsaw-0.549
run-0.469
chie-0.433
jamin-0.428
Token let
Token ID1309
Feature activation+3.39e+00

Pos logit contributions
 idea+1.365
 morning+1.232
 day+1.190
 prevent+1.173
 people+1.163
Neg logit contributions
ifying-1.926
igsaw-1.717
run-1.473
jamin-1.445
 hips-1.390
Token Ben
Token ID3932
Feature activation+1.79e-01

Pos logit contributions
cut+4.730
 idea+4.315
 ostr+4.058
bye+4.006
crow+3.772
Neg logit contributions
ifying-6.718
igsaw-6.120
jamin-5.376
run-4.971
 hips-4.901
Token use
Token ID779
Feature activation-1.81e-02

Pos logit contributions
 idea+0.205
 morning+0.173
 day+0.171
 people+0.170
 doctors+0.168
Neg logit contributions
ifying-0.421
igsaw-0.377
run-0.331
chie-0.318
 brief-0.305
Token some
Token ID617
Feature activation+1.58e-01

Pos logit contributions
ifying+0.046
igsaw+0.042
run+0.037
chie+0.036
 brief+0.034
Neg logit contributions
 idea-0.017
 morning-0.014
 day-0.014
 people-0.013
 doctors-0.013
Token paper
Token ID3348
Feature activation+1.33e-01

Pos logit contributions
 idea+0.208
 morning+0.179
 day+0.177
 people+0.174
 prevent+0.173
Neg logit contributions
ifying-0.344
igsaw-0.305
run-0.263
chie-0.250
 brief-0.243
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
 idea+0.156
 morning+0.133
 day+0.131
 prevent+0.126
 people+0.125
Neg logit contributions
ifying-0.312
igsaw-0.276
run-0.241
chie-0.233
 brief-0.226
Token her
Token ID607
Feature activation+3.15e-01

Pos logit contributions
 idea+0.118
 morning+0.090
 day+0.089
 prevent+0.087
 people+0.087
Neg logit contributions
ifying-0.321
igsaw-0.289
run-0.253
chie-0.245
 brief-0.240
Token toy
Token ID13373
Feature activation+1.12e+00

Pos logit contributions
 idea+0.299
 morning+0.243
 prevent+0.237
 people+0.229
 day+0.228
Neg logit contributions
ifying-0.797
igsaw-0.714
run-0.628
chie-0.615
 brief-0.607
Token bird
Token ID6512
Feature activation+5.27e-01

Pos logit contributions
 idea+1.313
 prevent+1.030
 morning+1.016
 day+0.971
 paramedics+0.951
Neg logit contributions
ifying-2.659
igsaw-2.231
chie-1.996
 brief-1.974
run-1.974
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
 idea+0.584
 morning+0.463
 prevent+0.461
 paramedics+0.460
 people+0.450
Neg logit contributions
ifying-1.262
igsaw-1.093
run-0.994
chie-0.941
 brief-0.931
Token She
Token ID1375
Feature activation+4.80e-01

Pos logit contributions
 idea+0.350
 day+0.294
 morning+0.292
 people+0.291
 prevent+0.291
Neg logit contributions
ifying-0.580
igsaw-0.518
run-0.445
chie-0.419
jamin-0.406
Token let
Token ID1309
Feature activation+3.38e+00

Pos logit contributions
 idea+0.679
 morning+0.569
 paramedics+0.553
 prevent+0.552
 people+0.552
Neg logit contributions
ifying-0.994
igsaw-0.841
run-0.764
chie-0.724
jamin-0.710
Token it
Token ID340
Feature activation+5.76e-01

Pos logit contributions
 idea+5.114
cut+5.063
 paramedics+4.301
 choice+4.132
bye+4.016
Neg logit contributions
ifying-6.159
igsaw-5.498
jamin-4.602
run-4.529
of-4.408
Token fly
Token ID6129
Feature activation+1.52e-02

Pos logit contributions
 idea+0.499
 doctors+0.399
 morning+0.397
 paramedics+0.395
 day+0.383
Neg logit contributions
ifying-1.467
igsaw-1.321
run-1.205
chie-1.174
 brief-1.125
Token high
Token ID1029
Feature activation-7.25e-01

Pos logit contributions
 idea+0.018
 morning+0.015
 doctors+0.015
 people+0.014
 day+0.014
Neg logit contributions
ifying-0.036
igsaw-0.032
run-0.028
chie-0.027
 brief-0.026
Token above
Token ID2029
Feature activation+7.88e-01

Pos logit contributions
ifying+1.776
igsaw+1.589
run+1.387
chie+1.355
 brief+1.291
Neg logit contributions
 idea-0.800
 morning-0.668
 day-0.666
 people-0.648
 prevent-0.616
Token the
Token ID262
Feature activation-3.95e-02

Pos logit contributions
 idea+0.860
 paramedics+0.704
cut+0.679
 doctors+0.674
 prevent+0.666
Neg logit contributions
ifying-1.912
igsaw-1.696
run-1.469
chie-1.435
jamin-1.396
Token to
Token ID284
Feature activation-3.08e-01

Pos logit contributions
ifying+2.587
igsaw+2.268
chie+1.957
run+1.936
jamin+1.849
Neg logit contributions
 idea-1.309
 day-1.166
 morning-1.156
 people-1.128
 hospital-1.061
Token the
Token ID262
Feature activation+6.60e-03

Pos logit contributions
ifying+0.730
igsaw+0.650
run+0.564
chie+0.548
 brief+0.525
Neg logit contributions
 idea-0.365
 morning-0.303
 day-0.301
 people-0.297
 doctors-0.288
Token grass
Token ID8701
Feature activation-4.19e-01

Pos logit contributions
 idea+0.007
 morning+0.006
 day+0.005
 people+0.005
 prevent+0.005
Neg logit contributions
ifying-0.016
igsaw-0.015
run-0.013
chie-0.012
 brief-0.012
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ifying+1.081
igsaw+0.969
run+0.853
chie+0.835
 brief+0.800
Neg logit contributions
 idea-0.404
 morning-0.323
 day-0.320
 people-0.315
 prevent-0.302
Token They
Token ID1119
Feature activation-1.06e-02

Pos logit contributions
 idea+0.234
 day+0.195
 morning+0.194
 prevent+0.194
 people+0.193
Neg logit contributions
ifying-0.370
igsaw-0.325
run-0.280
chie-0.261
jamin-0.256
Token let
Token ID1309
Feature activation+3.32e+00

Pos logit contributions
ifying+0.022
igsaw+0.019
run+0.016
chie+0.015
 brief+0.015
Neg logit contributions
 idea-0.016
 morning-0.014
 day-0.013
 people-0.013
 doctors-0.013
Token the
Token ID262
Feature activation-4.64e-01

Pos logit contributions
 idea+4.626
cut+4.616
opl+3.687
 exc+3.621
 ad+3.606
Neg logit contributions
ifying-6.658
igsaw-5.640
jamin-4.944
run-4.804
 nowhere-4.622
Token balloon
Token ID21190
Feature activation+8.46e-01

Pos logit contributions
ifying+1.130
igsaw+1.007
run+0.877
chie+0.858
 brief+0.818
Neg logit contributions
 idea-0.524
 morning-0.433
 day-0.430
 people-0.424
 prevent-0.408
Token fly
Token ID6129
Feature activation+4.09e-01

Pos logit contributions
 idea+0.902
 doctors+0.753
 morning+0.749
 people+0.721
 day+0.720
Neg logit contributions
ifying-2.012
igsaw-1.783
run-1.583
chie-1.533
 brief-1.488
Token high
Token ID1029
Feature activation-8.57e-01

Pos logit contributions
 idea+0.564
 doctors+0.482
 morning+0.475
 paramedics+0.472
 people+0.470
Neg logit contributions
ifying-0.879
igsaw-0.761
run-0.663
chie-0.643
 brief-0.604
Token in
Token ID287
Feature activation-6.51e-01

Pos logit contributions
ifying+2.173
igsaw+1.955
run+1.718
chie+1.682
jamin+1.604
Neg logit contributions
 idea-0.879
 morning-0.724
 day-0.723
 people-0.699
 prevent-0.655
Token and
Token ID290
Feature activation+2.25e-01

Pos logit contributions
 idea+1.218
 prevent+1.038
 doctors+1.028
 paramedics+1.024
 morning+1.003
Neg logit contributions
ifying-2.074
igsaw-1.889
run-1.601
chie-1.510
 brief-1.499
Token their
Token ID511
Feature activation+7.41e-01

Pos logit contributions
 idea+0.390
 morning+0.339
 prevent+0.332
 day+0.331
 hospital+0.331
Neg logit contributions
ifying-0.400
igsaw-0.337
run-0.281
chie-0.268
 brief-0.260
Token switch
Token ID5078
Feature activation+7.46e-02

Pos logit contributions
 idea+0.952
 morning+0.789
 prevent+0.776
crow+0.772
 hospital+0.750
Neg logit contributions
ifying-1.649
igsaw-1.433
run-1.252
chie-1.221
 brief-1.206
Token.
Token ID13
Feature activation+8.79e-01

Pos logit contributions
 idea+0.080
 morning+0.066
 day+0.064
 people+0.064
 prevent+0.063
Neg logit contributions
ifying-0.182
igsaw-0.161
run-0.142
chie-0.138
 brief-0.133
Token They
Token ID1119
Feature activation+4.85e-01

Pos logit contributions
 idea+1.331
 prevent+1.149
cut+1.108
 people+1.106
 hospital+1.098
Neg logit contributions
ifying-1.869
igsaw-1.560
run-1.346
jamin-1.212
lic-1.205
Token let
Token ID1309
Feature activation+3.32e+00

Pos logit contributions
 idea+0.865
 morning+0.773
 people+0.750
 day+0.750
 prevent+0.748
Neg logit contributions
ifying-0.834
igsaw-0.672
run-0.556
chie-0.544
 brief-0.524
Token each
Token ID1123
Feature activation+6.59e-01

Pos logit contributions
cut+4.579
 idea+4.468
thank+3.848
bye+3.737
 gard+3.717
Neg logit contributions
ifying-6.854
igsaw-5.522
run-5.028
jamin-4.858
chie-4.598
Token other
Token ID584
Feature activation+9.16e-01

Pos logit contributions
 idea+0.644
 morning+0.491
 doctors+0.469
 day+0.464
 paramedics+0.460
Neg logit contributions
ifying-1.672
igsaw-1.449
run-1.326
chie-1.302
 brief-1.253
Token use
Token ID779
Feature activation+5.56e-01

Pos logit contributions
 idea+0.940
 morning+0.730
 hospital+0.708
crow+0.708
cut+0.705
Neg logit contributions
ifying-2.248
igsaw-2.005
run-1.817
chie-1.737
 brief-1.711
Token the
Token ID262
Feature activation+8.79e-01

Pos logit contributions
 idea+0.589
 morning+0.478
 day+0.471
 prevent+0.456
bye+0.455
Neg logit contributions
ifying-1.367
igsaw-1.211
run-1.084
chie-1.026
 brief-1.004
Token switch
Token ID5078
Feature activation-1.56e-01

Pos logit contributions
 idea+1.039
 morning+0.907
 prevent+0.895
 doctors+0.889
crow+0.886
Neg logit contributions
ifying-1.975
igsaw-1.738
run-1.501
chie-1.483
 brief-1.454
Token says
Token ID1139
Feature activation+4.33e-01

Pos logit contributions
 idea+1.199
 day+1.115
 morning+1.088
 prevent+1.078
cut+1.053
Neg logit contributions
ifying-2.900
igsaw-2.684
run-2.290
jamin-2.262
 brief-2.228
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
 idea+0.452
 morning+0.382
 day+0.378
 prevent+0.364
 hospital+0.364
Neg logit contributions
ifying-1.053
igsaw-0.932
run-0.851
chie-0.805
 hips-0.785
Token
Token ID198
Feature activation+1.03e-02

Pos logit contributions
ifying+0.073
igsaw+0.065
run+0.057
chie+0.054
 brief+0.053
Neg logit contributions
 idea-0.039
 morning-0.032
 day-0.032
 people-0.031
 prevent-0.031
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
 idea+0.012
 morning+0.010
 day+0.010
 people+0.010
 prevent+0.010
Neg logit contributions
ifying-0.024
igsaw-0.022
run-0.019
chie-0.018
jamin-0.017
TokenThey
Token ID2990
Feature activation+2.66e-01

Pos logit contributions
ifying+1.607
igsaw+1.424
run+1.222
chie+1.196
 brief+1.136
Neg logit contributions
 idea-0.865
 morning-0.736
 day-0.734
 people-0.733
 doctors-0.705
Token let
Token ID1309
Feature activation+3.31e+00

Pos logit contributions
 idea+0.322
 morning+0.273
 day+0.270
 doctors+0.270
 people+0.265
Neg logit contributions
ifying-0.618
igsaw-0.540
run-0.463
chie-0.462
 brief-0.442
Token go
Token ID467
Feature activation+3.72e-01

Pos logit contributions
cut+3.838
 idea+3.401
thank+3.386
 ad+3.226
 bal+3.221
Neg logit contributions
ifying-7.552
igsaw-6.577
jamin-5.557
run-5.500
|-5.304
Token of
Token ID286
Feature activation-5.56e-01

Pos logit contributions
 idea+0.402
 doctors+0.328
 morning+0.320
 people+0.316
 hospital+0.313
Neg logit contributions
ifying-0.898
igsaw-0.802
run-0.716
chie-0.698
 brief-0.648
Token the
Token ID262
Feature activation+6.47e-01

Pos logit contributions
ifying+1.470
igsaw+1.321
run+1.167
chie+1.146
 brief+1.095
Neg logit contributions
 idea-0.511
 morning-0.402
 day-0.400
 people-0.396
 prevent-0.374
Token jet
Token ID12644
Feature activation+8.23e-01

Pos logit contributions
 idea+0.792
 morning+0.658
 prevent+0.653
 doctors+0.648
 paramedics+0.634
Neg logit contributions
ifying-1.469
igsaw-1.319
chie-1.097
run-1.088
 brief-1.063
Token and
Token ID290
Feature activation+6.23e-01

Pos logit contributions
 idea+0.868
 morning+0.728
 doctors+0.719
 paramedics+0.709
 people+0.697
Neg logit contributions
ifying-1.953
igsaw-1.754
run-1.536
chie-1.522
 brief-1.460
Token by
Token ID416
Feature activation-6.61e-01

Pos logit contributions
ifying+2.255
igsaw+2.022
run+1.765
chie+1.745
jamin+1.662
Neg logit contributions
 idea-0.907
 day-0.729
 morning-0.721
 people-0.720
 doctors-0.677
Token the
Token ID262
Feature activation+7.37e-01

Pos logit contributions
ifying+1.690
igsaw+1.507
run+1.319
chie+1.291
 brief+1.242
Neg logit contributions
 idea-0.685
 day-0.556
 morning-0.555
 people-0.547
 prevent-0.520
Token swings
Token ID26728
Feature activation+5.49e-01

Pos logit contributions
 idea+0.819
 morning+0.635
 prevent+0.620
 doctors+0.609
 day+0.606
Neg logit contributions
ifying-1.814
igsaw-1.566
chie-1.358
run-1.324
 brief-1.304
Token.
Token ID13
Feature activation+8.71e-01

Pos logit contributions
 idea+0.588
 doctors+0.473
 paramedics+0.465
 morning+0.465
 people+0.464
Neg logit contributions
ifying-1.347
igsaw-1.166
run-1.051
chie-1.002
 brief-0.968
Token She
Token ID1375
Feature activation+4.99e-01

Pos logit contributions
 idea+1.195
 prevent+1.023
 day+1.015
cut+1.013
 doctors+1.011
Neg logit contributions
ifying-1.790
igsaw-1.612
run-1.404
chie-1.307
 hips-1.280
Token let
Token ID1309
Feature activation+3.27e+00

Pos logit contributions
 idea+0.691
 morning+0.589
 people+0.580
 day+0.578
 prevent+0.576
Neg logit contributions
ifying-1.033
igsaw-0.897
run-0.805
chie-0.766
jamin-0.748
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
cut+4.433
 idea+4.289
bye+3.575
 paramedics+3.556
 ot+3.448
Neg logit contributions
ifying-6.468
igsaw-5.702
run-5.179
jamin-4.926
chie-4.890
Token little
Token ID1310
Feature activation+1.71e-01

Pos logit contributions
ifying+2.759
igsaw+2.426
run+2.086
chie+2.041
jamin+1.968
Neg logit contributions
 idea-1.491
 day-1.258
 morning-1.250
 people-1.228
 prevent-1.183
Token bit
Token ID1643
Feature activation+6.87e-01

Pos logit contributions
 idea+0.228
 morning+0.196
 day+0.193
 people+0.192
 doctors+0.190
Neg logit contributions
ifying-0.368
igsaw-0.328
run-0.282
chie-0.275
 brief-0.262
Token of
Token ID286
Feature activation+4.80e-01

Pos logit contributions
 idea+0.783
 day+0.630
 morning+0.628
 doctors+0.611
 hospital+0.610
Neg logit contributions
ifying-1.623
igsaw-1.451
run-1.313
chie-1.244
 brief-1.168
Token sand
Token ID6450
Feature activation-4.46e-01

Pos logit contributions
 idea+0.493
 morning+0.411
 day+0.406
 doctors+0.399
 hospital+0.391
Neg logit contributions
ifying-1.154
igsaw-1.048
run-0.922
chie-0.892
 brief-0.859
Token Ben
Token ID3932
Feature activation-3.99e-01

Pos logit contributions
 idea+0.677
 day+0.552
 morning+0.550
 doctors+0.536
 hospital+0.528
Neg logit contributions
ifying-1.249
igsaw-1.123
run-0.960
chie-0.936
 brief-0.918
Token were
Token ID547
Feature activation+4.13e-01

Pos logit contributions
ifying+0.840
igsaw+0.733
run+0.623
chie+0.607
 brief+0.574
Neg logit contributions
 idea-0.572
 morning-0.496
 day-0.492
 people-0.486
 prevent-0.474
Token scared
Token ID12008
Feature activation+1.09e-01

Pos logit contributions
 idea+0.576
 morning+0.506
 day+0.496
 prevent+0.474
 people+0.469
Neg logit contributions
ifying-0.875
igsaw-0.745
run-0.654
chie-0.641
 brief-0.600
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
 idea+0.113
 morning+0.090
 day+0.088
 prevent+0.087
 people+0.085
Neg logit contributions
ifying-0.274
igsaw-0.244
run-0.219
chie-0.210
 brief-0.200
Token They
Token ID1119
Feature activation-7.44e-01

Pos logit contributions
 idea+0.778
 day+0.691
 morning+0.689
 doctors+0.676
 prevent+0.670
Neg logit contributions
ifying-1.163
igsaw-1.018
run-0.874
chie-0.847
 brief-0.785
Token let
Token ID1309
Feature activation+3.26e+00

Pos logit contributions
ifying+1.637
igsaw+1.467
run+1.237
chie+1.204
 brief+1.135
Neg logit contributions
 idea-1.013
 morning-0.860
 day-0.857
 people-0.857
 prevent-0.832
Token go
Token ID467
Feature activation-4.53e-01

Pos logit contributions
cut+4.279
 idea+3.787
thank+3.720
 paramedics+3.367
 doctors+3.251
Neg logit contributions
ifying-7.022
igsaw-6.155
run-5.551
chie-5.335
jamin-5.074
Token of
Token ID286
Feature activation-4.13e-01

Pos logit contributions
ifying+1.174
igsaw+1.053
run+0.927
chie+0.908
 brief+0.870
Neg logit contributions
 idea-0.436
 morning-0.348
 day-0.345
 people-0.339
 prevent-0.324
Token the
Token ID262
Feature activation+6.26e-01

Pos logit contributions
ifying+1.080
igsaw+0.971
run+0.855
chie+0.838
 brief+0.803
Neg logit contributions
 idea-0.389
 morning-0.309
 day-0.305
 people-0.299
 prevent-0.286
Token lamp
Token ID20450
Feature activation-4.58e-01

Pos logit contributions
 idea+0.920
crow+0.805
 morning+0.793
 paramedics+0.792
 prevent+0.784
Neg logit contributions
ifying-1.254
igsaw-1.093
run-0.896
chie-0.884
 brief-0.868
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
ifying+1.172
igsaw+1.051
run+0.924
chie+0.905
 brief+0.866
Neg logit contributions
 idea-0.451
 morning-0.362
 day-0.359
 people-0.353
 prevent-0.338
Token.
Token ID13
Feature activation+2.73e-02

Pos logit contributions
 idea+0.071
 morning+0.057
 people+0.056
 prevent+0.056
 day+0.055
Neg logit contributions
ifying-0.179
igsaw-0.160
run-0.141
chie-0.136
 brief-0.131
Token L
Token ID406
Feature activation-4.55e-01

Pos logit contributions
 idea+0.035
 day+0.029
 morning+0.029
 people+0.028
 prevent+0.028
Neg logit contributions
ifying-0.062
igsaw-0.055
run-0.048
chie-0.046
jamin-0.044
Tokenila
Token ID10102
Feature activation+4.12e-01

Pos logit contributions
ifying+1.133
igsaw+1.012
run+0.885
chie+0.867
 brief+0.828
Neg logit contributions
 idea-0.484
 morning-0.396
 day-0.393
 people-0.387
 prevent-0.372
Token and
Token ID290
Feature activation+8.31e-01

Pos logit contributions
 idea+0.506
 morning+0.422
 day+0.410
 prevent+0.407
 doctors+0.400
Neg logit contributions
ifying-0.934
igsaw-0.817
run-0.733
chie-0.698
jamin-0.689
Token Sam
Token ID3409
Feature activation+4.17e-01

Pos logit contributions
 idea+1.024
 morning+0.862
 day+0.852
 prevent+0.821
cut+0.809
Neg logit contributions
ifying-1.846
igsaw-1.670
run-1.439
 brief-1.386
chie-1.383
Token let
Token ID1309
Feature activation+3.26e+00

Pos logit contributions
 idea+0.637
 morning+0.559
 doctors+0.548
 day+0.547
 people+0.538
Neg logit contributions
ifying-0.827
igsaw-0.687
run-0.601
chie-0.586
jamin-0.559
Token them
Token ID606
Feature activation-2.17e-01

Pos logit contributions
cut+4.617
 idea+4.020
thank+3.641
 doctors+3.517
 paramedics+3.487
Neg logit contributions
ifying-6.739
igsaw-6.023
run-5.322
jamin-5.066
chie-4.896
Token have
Token ID423
Feature activation-1.01e-01

Pos logit contributions
ifying+0.532
igsaw+0.473
run+0.416
chie+0.405
 brief+0.387
Neg logit contributions
 idea-0.235
 morning-0.193
 day-0.190
 people-0.188
 doctors-0.184
Token a
Token ID257
Feature activation-1.28e+00

Pos logit contributions
ifying+0.244
igsaw+0.218
run+0.189
chie+0.184
 brief+0.178
Neg logit contributions
 idea-0.111
 morning-0.090
 day-0.089
 people-0.088
 prevent-0.087
Token turn
Token ID1210
Feature activation+1.21e-01

Pos logit contributions
ifying+3.016
igsaw+2.616
run+2.294
chie+2.293
years+2.147
Neg logit contributions
 idea-1.640
 day-1.417
 morning-1.401
 people-1.272
 prevent-1.262
Token,
Token ID11
Feature activation-2.66e-04

Pos logit contributions
 idea+0.126
 morning+0.102
 day+0.100
 people+0.099
 prevent+0.099
Neg logit contributions
ifying-0.300
igsaw-0.268
run-0.238
chie-0.229
 brief-0.220
Token and
Token ID290
Feature activation-9.91e-01

Pos logit contributions
ifying+0.043
igsaw+0.038
run+0.033
chie+0.032
jamin+0.031
Neg logit contributions
 idea-0.025
 morning-0.022
 day-0.021
 people-0.021
 doctors-0.020
Token Ben
Token ID3932
Feature activation-1.95e-01

Pos logit contributions
ifying+2.630
igsaw+2.359
run+2.057
chie+2.020
 brief+1.925
Neg logit contributions
 idea-0.940
 people-0.765
 morning-0.758
 day-0.741
 doctors-0.721
Token cried
Token ID16896
Feature activation+1.47e-01

Pos logit contributions
ifying+0.400
igsaw+0.342
run+0.295
chie+0.286
 brief+0.270
Neg logit contributions
 idea-0.287
 morning-0.252
 day-0.251
 people-0.244
 doctors-0.240
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
 idea+0.149
 morning+0.122
 day+0.121
 people+0.115
 prevent+0.114
Neg logit contributions
ifying-0.367
igsaw-0.326
run-0.291
chie-0.281
 brief-0.271
Token They
Token ID1119
Feature activation-1.08e+00

Pos logit contributions
ifying+0.440
igsaw+0.387
run+0.338
chie+0.328
 brief+0.314
Neg logit contributions
 idea-0.225
 morning-0.188
 day-0.187
 people-0.183
 prevent-0.180
Token let
Token ID1309
Feature activation+3.25e+00

Pos logit contributions
ifying+2.395
igsaw+2.222
run+1.829
chie+1.784
 brief+1.670
Neg logit contributions
 idea-1.462
 people-1.254
 prevent-1.210
 morning-1.208
 day-1.206
Token go
Token ID467
Feature activation+8.82e-02

Pos logit contributions
cut+3.797
 idea+3.417
thank+3.112
 artist+2.661
bye+2.642
Neg logit contributions
ifying-7.544
igsaw-6.403
chie-5.714
run-5.700
jamin-5.541
Token of
Token ID286
Feature activation-5.14e-01

Pos logit contributions
 idea+0.092
 morning+0.074
 doctors+0.072
 people+0.072
 day+0.072
Neg logit contributions
ifying-0.218
igsaw-0.195
run-0.174
chie-0.168
 brief-0.159
Token the
Token ID262
Feature activation+3.37e-01

Pos logit contributions
ifying+1.354
igsaw+1.218
run+1.074
chie+1.052
 brief+1.008
Neg logit contributions
 idea-0.475
 morning-0.375
 day-0.372
 people-0.368
 prevent-0.350
Token nest
Token ID16343
Feature activation-1.88e-01

Pos logit contributions
 idea+0.363
 morning+0.294
 day+0.285
 people+0.279
 prevent+0.279
Neg logit contributions
ifying-0.818
igsaw-0.726
run-0.632
chie-0.632
 brief-0.611
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
ifying+0.479
igsaw+0.429
run+0.379
chie+0.370
 brief+0.354
Neg logit contributions
 idea-0.183
 morning-0.147
 day-0.145
 people-0.143
 doctors-0.139
Token She
Token ID1375
Feature activation-8.45e-02

Pos logit contributions
ifying+1.089
igsaw+0.968
run+0.840
chie+0.821
 brief+0.782
Neg logit contributions
 idea-0.537
 morning-0.448
 day-0.445
 people-0.439
 prevent-0.424
Token opened
Token ID4721
Feature activation+5.11e-01

Pos logit contributions
ifying+0.189
igsaw+0.165
run+0.144
chie+0.140
jamin+0.134
Neg logit contributions
 idea-0.108
 morning-0.092
 day-0.090
 people-0.089
 prevent-0.087
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
 idea+0.563
 day+0.447
 doctors+0.444
 morning+0.440
 paramedics+0.437
Neg logit contributions
ifying-1.259
igsaw-1.115
run-0.963
chie-0.934
 brief-0.903
Token door
Token ID3420
Feature activation-3.50e-01

Pos logit contributions
ifying+1.088
igsaw+0.952
run+0.804
chie+0.781
jamin+0.735
Neg logit contributions
 idea-0.778
 morning-0.676
 day-0.672
 people-0.666
 prevent-0.647
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
ifying+0.884
igsaw+0.791
run+0.696
chie+0.678
 brief+0.650
Neg logit contributions
 idea-0.355
 morning-0.286
 day-0.284
 people-0.280
 doctors-0.270
Token let
Token ID1309
Feature activation+3.24e+00

Pos logit contributions
 idea+0.341
 morning+0.281
 day+0.281
 people+0.270
 doctors+0.268
Neg logit contributions
ifying-0.558
igsaw-0.482
run-0.418
chie-0.405
 brief-0.393
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
 idea+4.471
cut+4.135
 paramedics+4.084
 bal+3.898
 day+3.770
Neg logit contributions
ifying-6.309
igsaw-5.371
chie-4.571
run-4.547
 brief-4.405
Token bird
Token ID6512
Feature activation+2.91e-01

Pos logit contributions
 idea+0.273
 morning+0.227
 day+0.220
 prevent+0.216
 doctors+0.212
Neg logit contributions
ifying-0.657
igsaw-0.585
run-0.512
chie-0.511
 brief-0.494
Token out
Token ID503
Feature activation-5.94e-01

Pos logit contributions
 idea+0.259
 morning+0.209
 day+0.204
 doctors+0.200
 people+0.200
Neg logit contributions
ifying-0.748
igsaw-0.666
run-0.602
chie-0.589
 brief-0.560
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
ifying+1.565
igsaw+1.410
run+1.237
chie+1.218
jamin+1.168
Neg logit contributions
 idea-0.551
 morning-0.434
 day-0.431
 people-0.424
 prevent-0.401
Token The
Token ID383
Feature activation+6.07e-01

Pos logit contributions
 idea+0.381
 day+0.319
 morning+0.313
 people+0.309
 prevent+0.307
Neg logit contributions
ifying-0.662
igsaw-0.589
run-0.507
chie-0.487
jamin-0.463
Token inside
Token ID2641
Feature activation+5.09e-01

Pos logit contributions
 idea+1.187
 paramedics+1.036
 morning+1.025
 prevent+0.977
 people+0.975
Neg logit contributions
ifying-2.327
igsaw-2.079
run-1.815
chie-1.787
 brief-1.743
Token it
Token ID340
Feature activation+3.18e-02

Pos logit contributions
 idea+0.539
 morning+0.424
 paramedics+0.415
 doctors+0.411
 day+0.411
Neg logit contributions
ifying-1.265
igsaw-1.109
run-0.984
chie-0.939
jamin-0.900
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
 idea+0.031
 morning+0.025
 people+0.024
 day+0.024
 doctors+0.024
Neg logit contributions
ifying-0.081
igsaw-0.072
run-0.064
chie-0.061
 brief-0.059
Token so
Token ID523
Feature activation+7.75e-01

Pos logit contributions
 idea+0.770
 morning+0.685
 day+0.677
crow+0.656
 people+0.653
Neg logit contributions
ifying-0.912
igsaw-0.756
run-0.669
chie-0.622
 brief-0.599
Token she
Token ID673
Feature activation-4.33e-01

Pos logit contributions
 idea+0.899
crow+0.700
 morning+0.692
 prevent+0.692
 paramedics+0.691
Neg logit contributions
ifying-1.831
igsaw-1.586
run-1.425
 brief-1.341
chie-1.330
Token let
Token ID1309
Feature activation+3.23e+00

Pos logit contributions
ifying+1.042
igsaw+0.927
run+0.807
chie+0.789
 brief+0.752
Neg logit contributions
 idea-0.495
 morning-0.411
 day-0.408
 people-0.402
 prevent-0.388
Token herself
Token ID5223
Feature activation-2.35e-01

Pos logit contributions
 idea+3.982
cut+3.939
 ot+3.642
 paramedics+3.592
 ostr+3.513
Neg logit contributions
ifying-6.685
igsaw-5.770
chie-5.146
run-5.033
 brief-5.002
Token in
Token ID287
Feature activation-5.08e-01

Pos logit contributions
ifying+0.525
igsaw+0.464
run+0.401
chie+0.388
 brief+0.371
Neg logit contributions
 idea-0.302
 morning-0.261
 day-0.258
 people-0.255
 doctors-0.248
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
ifying+1.322
igsaw+1.186
run+1.042
chie+1.024
 brief+0.978
Neg logit contributions
 idea-0.493
 morning-0.392
 day-0.389
 people-0.383
 prevent-0.365
Token She
Token ID1375
Feature activation-4.85e-01

Pos logit contributions
 idea+0.556
 doctors+0.461
 day+0.457
 prevent+0.454
 morning+0.454
Neg logit contributions
ifying-0.905
igsaw-0.805
run-0.702
chie-0.658
jamin-0.641
Token stepped
Token ID10764
Feature activation-1.06e-01

Pos logit contributions
ifying+1.040
igsaw+0.912
run+0.775
chie+0.755
 brief+0.713
Neg logit contributions
 idea-0.684
 morning-0.590
 day-0.587
 people-0.581
 prevent-0.564
Token to
Token ID284
Feature activation+2.77e-02

Pos logit contributions
ifying+1.820
igsaw+1.619
run+1.404
chie+1.390
jamin+1.321
Neg logit contributions
 idea-0.808
 morning-0.677
 day-0.676
 people-0.657
 prevent-0.623
Token the
Token ID262
Feature activation+7.63e-02

Pos logit contributions
 idea+0.029
 morning+0.024
 day+0.023
 people+0.023
 doctors+0.023
Neg logit contributions
ifying-0.068
igsaw-0.062
run-0.054
chie-0.052
 brief-0.050
Token grass
Token ID8701
Feature activation-3.44e-01

Pos logit contributions
 idea+0.087
 morning+0.070
 day+0.069
 people+0.068
 prevent+0.068
Neg logit contributions
ifying-0.184
igsaw-0.163
run-0.139
chie-0.138
 brief-0.133
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
ifying+0.873
igsaw+0.781
run+0.687
chie+0.671
 brief+0.643
Neg logit contributions
 idea-0.344
 morning-0.277
 day-0.274
 people-0.271
 prevent-0.261
Token They
Token ID1119
Feature activation-1.11e-01

Pos logit contributions
 idea+0.202
 day+0.166
 people+0.166
 morning+0.166
 prevent+0.165
Neg logit contributions
ifying-0.361
igsaw-0.319
run-0.279
chie-0.261
jamin-0.255
Token let
Token ID1309
Feature activation+3.19e+00

Pos logit contributions
ifying+0.231
igsaw+0.199
run+0.170
chie+0.165
 brief+0.158
Neg logit contributions
 idea-0.162
 morning-0.140
 day-0.138
 people-0.136
 doctors-0.134
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
 idea+4.342
cut+4.293
thank+3.573
 exc+3.570
 bal+3.406
Neg logit contributions
ifying-6.764
igsaw-5.750
run-5.069
jamin-4.895
chie-4.612
Token balloon
Token ID21190
Feature activation+5.01e-01

Pos logit contributions
ifying+0.067
igsaw+0.059
run+0.052
chie+0.051
 brief+0.049
Neg logit contributions
 idea-0.031
 morning-0.025
 day-0.025
 people-0.024
 prevent-0.024
Token fly
Token ID6129
Feature activation-3.55e-02

Pos logit contributions
 idea+0.530
 morning+0.439
 doctors+0.436
 day+0.428
 people+0.427
Neg logit contributions
ifying-1.213
igsaw-1.072
run-0.952
chie-0.927
 brief-0.890
Token high
Token ID1029
Feature activation-8.80e-01

Pos logit contributions
ifying+0.081
igsaw+0.071
run+0.062
chie+0.060
 brief+0.057
Neg logit contributions
 idea-0.045
 morning-0.037
 people-0.037
 doctors-0.037
 day-0.037
Token in
Token ID287
Feature activation-5.55e-01

Pos logit contributions
ifying+2.229
igsaw+2.008
run+1.766
chie+1.725
 brief+1.648
Neg logit contributions
 idea-0.898
 morning-0.739
 day-0.736
 people-0.712
 prevent-0.671
TokenAfter
Token ID3260
Feature activation-8.91e-01

Pos logit contributions
 idea+0.310
 morning+0.261
 people+0.257
 day+0.256
 doctors+0.252
Neg logit contributions
ifying-0.459
igsaw-0.400
run-0.338
chie-0.330
jamin-0.314
Token a
Token ID257
Feature activation-9.99e-01

Pos logit contributions
ifying+2.197
igsaw+1.957
run+1.710
chie+1.697
jamin+1.602
Neg logit contributions
 idea-0.979
 morning-0.847
 day-0.840
 people-0.824
 doctors-0.789
Token while
Token ID981
Feature activation+5.77e-01

Pos logit contributions
ifying+2.431
igsaw+2.142
run+1.890
chie+1.844
jamin+1.759
Neg logit contributions
 idea-1.151
 day-1.001
 morning-0.990
 people-0.936
 prevent-0.903
Token,
Token ID11
Feature activation+3.98e-01

Pos logit contributions
 idea+0.604
 prevent+0.448
 people+0.437
 doctors+0.435
cut+0.435
Neg logit contributions
ifying-1.454
igsaw-1.290
run-1.144
chie-1.089
 brief-1.068
Token Lily
Token ID20037
Feature activation-1.84e-01

Pos logit contributions
 idea+0.525
 day+0.416
 prevent+0.415
 morning+0.414
 people+0.411
Neg logit contributions
ifying-0.891
igsaw-0.773
run-0.647
chie-0.626
 brief-0.612
Token let
Token ID1309
Feature activation+3.19e+00

Pos logit contributions
ifying+0.409
igsaw+0.358
run+0.309
chie+0.300
jamin+0.289
Neg logit contributions
 idea-0.243
 morning-0.206
 day-0.204
 people-0.201
 prevent-0.196
Token go
Token ID467
Feature activation-6.31e-01

Pos logit contributions
 idea+4.447
cut+4.156
 ostr+3.634
 bal+3.613
 exc+3.540
Neg logit contributions
ifying-6.387
igsaw-5.414
jamin-4.713
run-4.495
 nowhere-4.463
Token of
Token ID286
Feature activation+4.62e-01

Pos logit contributions
ifying+1.659
igsaw+1.489
run+1.307
chie+1.288
 brief+1.234
Neg logit contributions
 idea-0.588
 morning-0.472
 day-0.469
 people-0.455
 prevent-0.434
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 idea+0.491
 morning+0.394
 day+0.389
 people+0.365
bye+0.364
Neg logit contributions
ifying-1.122
igsaw-0.989
run-0.868
chie-0.837
 brief-0.826
Token cow
Token ID9875
Feature activation+3.64e-01

Pos logit contributions
 idea+0.998
 bal+0.727
cut+0.714
 if+0.708
crow+0.703
Neg logit contributions
ifying-3.095
igsaw-2.686
run-2.351
 brief-2.350
chie-2.343
Token and
Token ID290
Feature activation+5.53e-01

Pos logit contributions
 idea+0.425
 morning+0.350
 day+0.344
 people+0.344
 doctors+0.336
Neg logit contributions
ifying-0.851
igsaw-0.756
run-0.660
chie-0.629
 brief-0.615
Token open
Token ID1280
Feature activation+1.74e-01

Pos logit contributions
 idea+0.197
 morning+0.170
 day+0.170
 people+0.163
 doctors+0.162
Neg logit contributions
ifying-0.386
igsaw-0.337
run-0.291
chie-0.289
 brief-0.278
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 idea+0.182
 morning+0.144
 day+0.144
 doctors+0.140
 people+0.138
Neg logit contributions
ifying-0.438
igsaw-0.388
run-0.341
chie-0.330
 brief-0.318
Token box
Token ID3091
Feature activation-6.25e-02

Pos logit contributions
ifying+3.616
igsaw+3.186
run+2.791
chie+2.741
years+2.573
Neg logit contributions
 idea-2.751
 people-2.409
 day-2.360
 morning-2.343
 hospital-2.266
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
ifying+0.156
igsaw+0.139
run+0.124
chie+0.119
 brief+0.115
Neg logit contributions
 idea-0.063
 morning-0.052
 day-0.051
 people-0.050
 doctors-0.050
Token They
Token ID1119
Feature activation+2.08e-01

Pos logit contributions
ifying+0.400
igsaw+0.351
run+0.304
chie+0.292
 brief+0.281
Neg logit contributions
 idea-0.224
 morning-0.189
 day-0.189
 people-0.185
 prevent-0.181
Token let
Token ID1309
Feature activation+3.18e+00

Pos logit contributions
 idea+0.247
 morning+0.215
 day+0.214
 doctors+0.202
 people+0.201
Neg logit contributions
ifying-0.485
igsaw-0.421
run-0.366
chie-0.359
 brief-0.349
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
cut+4.199
 idea+4.063
 bal+3.774
thank+3.526
opl+3.432
Neg logit contributions
ifying-6.765
igsaw-5.395
run-4.917
 nowhere-4.778
jamin-4.661
Token rabbit
Token ID22746
Feature activation+7.89e-01

Pos logit contributions
ifying+2.978
igsaw+2.651
run+2.326
chie+2.257
jamin+2.112
Neg logit contributions
 idea-1.578
 people-1.312
 day-1.307
 morning-1.291
 doctors-1.233
Token go
Token ID467
Feature activation+4.72e-01

Pos logit contributions
 idea+0.909
 morning+0.806
 day+0.784
 doctors+0.775
 people+0.756
Neg logit contributions
ifying-1.807
igsaw-1.599
run-1.440
chie-1.363
 brief-1.310
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
 idea+0.529
 morning+0.431
 doctors+0.422
 people+0.411
 hospital+0.410
Neg logit contributions
ifying-1.123
igsaw-0.996
run-0.898
chie-0.868
 brief-0.816
Token The
Token ID383
Feature activation-4.17e-01

Pos logit contributions
 idea+0.412
 day+0.347
 morning+0.344
 doctors+0.337
 people+0.337
Neg logit contributions
ifying-0.675
igsaw-0.581
run-0.511
chie-0.473
 brief-0.465
Token,
Token ID11
Feature activation+7.10e-01

Pos logit contributions
 idea+1.758
cut+1.403
 hospital+1.387
 paramedics+1.385
 prevent+1.338
Neg logit contributions
ifying-3.240
igsaw-2.699
run-2.479
chie-2.305
 brief-2.286
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
 idea+0.838
 paramedics+0.649
 morning+0.646
 day+0.636
 hospital+0.632
Neg logit contributions
ifying-1.681
igsaw-1.437
run-1.267
chie-1.235
 brief-1.197
Token king
Token ID5822
Feature activation+9.39e-01

Pos logit contributions
 idea+0.868
 prevent+0.743
 morning+0.727
 bal+0.702
 paramedics+0.699
Neg logit contributions
ifying-1.471
igsaw-1.262
chie-1.106
 brief-1.048
run-1.039
Token decided
Token ID3066
Feature activation+6.60e-01

Pos logit contributions
 idea+1.196
 morning+1.007
 prevent+1.001
 paramedics+0.969
 doctors+0.956
Neg logit contributions
ifying-2.050
igsaw-1.752
run-1.598
chie-1.520
jamin-1.481
Token to
Token ID284
Feature activation+8.14e-01

Pos logit contributions
 idea+1.068
 morning+0.913
 day+0.891
 paramedics+0.865
crow+0.861
Neg logit contributions
ifying-1.240
igsaw-1.057
run-0.925
chie-0.881
 brief-0.868
Token let
Token ID1309
Feature activation+3.18e+00

Pos logit contributions
 idea+0.812
 morning+0.691
 doctors+0.668
 day+0.665
 paramedics+0.648
Neg logit contributions
ifying-1.980
igsaw-1.764
chie-1.529
run-1.516
 brief-1.450
Token people
Token ID661
Feature activation-1.60e-01

Pos logit contributions
 idea+4.094
cut+3.890
 paramedics+3.760
thank+3.538
 bal+3.494
Neg logit contributions
ifying-6.234
igsaw-5.562
run-5.129
chie-5.097
 brief-4.602
Token come
Token ID1282
Feature activation-3.68e-01

Pos logit contributions
ifying+0.400
igsaw+0.356
run+0.314
chie+0.308
 brief+0.294
Neg logit contributions
 idea-0.163
 morning-0.133
 day-0.131
 people-0.127
 doctors-0.125
Token to
Token ID284
Feature activation+2.64e-01

Pos logit contributions
ifying+0.882
igsaw+0.783
run+0.683
chie+0.667
 brief+0.636
Neg logit contributions
 idea-0.422
 morning-0.351
 day-0.348
 people-0.342
 doctors-0.331
Token visit
Token ID3187
Feature activation+1.73e-01

Pos logit contributions
 idea+0.283
 morning+0.227
 day+0.224
 paramedics+0.216
 doctors+0.215
Neg logit contributions
ifying-0.639
igsaw-0.572
run-0.496
chie-0.488
 brief-0.474
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 idea+0.177
 morning+0.139
 day+0.136
 paramedics+0.133
 doctors+0.131
Neg logit contributions
ifying-0.432
igsaw-0.386
run-0.341
chie-0.331
 brief-0.320
Token other
Token ID584
Feature activation-1.86e-02

Pos logit contributions
 idea+0.421
 morning+0.305
 day+0.275
 doctors+0.274
 prevent+0.273
Neg logit contributions
ifying-1.682
igsaw-1.494
run-1.380
chie-1.338
 brief-1.298
Token.
Token ID13
Feature activation-9.66e-02

Pos logit contributions
ifying+0.047
igsaw+0.042
run+0.037
chie+0.036
 brief+0.035
Neg logit contributions
 idea-0.018
 morning-0.015
 day-0.014
 prevent-0.014
 people-0.014
Token
Token ID198
Feature activation+5.56e-01

Pos logit contributions
ifying+0.226
igsaw+0.199
run+0.172
chie+0.166
jamin+0.160
Neg logit contributions
 idea-0.119
 morning-0.099
 day-0.098
 people-0.096
 prevent-0.095
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
 idea+0.685
 morning+0.556
 day+0.538
 prevent+0.532
 doctors+0.532
Neg logit contributions
ifying-1.276
igsaw-1.132
run-1.008
chie-0.938
 brief-0.912
TokenThey
Token ID2990
Feature activation-2.56e-01

Pos logit contributions
ifying+0.655
igsaw+0.572
run+0.488
chie+0.476
 brief+0.451
Neg logit contributions
 idea-0.424
 morning-0.363
 day-0.359
 people-0.355
 prevent-0.346
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
ifying+0.570
igsaw+0.497
run+0.430
chie+0.421
 brief+0.400
Neg logit contributions
 idea-0.334
 morning-0.286
 day-0.285
 people-0.279
 doctors-0.270
Token go
Token ID467
Feature activation-3.83e-01

Pos logit contributions
cut+4.093
 idea+3.680
thank+3.368
 clim+3.238
 ostr+3.217
Neg logit contributions
ifying-6.903
igsaw-5.877
jamin-5.006
run-4.856
|-4.753
Token of
Token ID286
Feature activation-7.76e-01

Pos logit contributions
ifying+0.983
igsaw+0.881
run+0.777
chie+0.759
 brief+0.727
Neg logit contributions
 idea-0.373
 morning-0.298
 day-0.295
 people-0.291
 prevent-0.279
Token the
Token ID262
Feature activation+8.72e-01

Pos logit contributions
ifying+2.052
igsaw+1.838
run+1.626
chie+1.599
jamin+1.527
Neg logit contributions
 idea-0.731
 people-0.581
 day-0.578
 morning-0.578
 prevent-0.544
Token spring
Token ID6076
Feature activation+4.83e-01

Pos logit contributions
 idea+1.162
 whenever+0.994
 morning+0.968
crow+0.939
 day+0.936
Neg logit contributions
ifying-1.854
igsaw-1.644
 brief-1.377
run-1.353
chie-1.351
Token and
Token ID290
Feature activation-7.67e-02

Pos logit contributions
 idea+0.543
 people+0.442
 doctors+0.437
 prevent+0.436
 morning+0.435
Neg logit contributions
ifying-1.146
igsaw-1.025
run-0.885
chie-0.857
 brief-0.828
Token bird
Token ID6512
Feature activation+3.89e-01

Pos logit contributions
ifying+0.602
igsaw+0.541
run+0.472
chie+0.464
 brief+0.443
Neg logit contributions
 idea-0.251
 morning-0.204
 day-0.201
 people-0.198
 prevent-0.194
Token was
Token ID373
Feature activation+1.08e-01

Pos logit contributions
 idea+0.533
 morning+0.452
 paramedics+0.448
 doctors+0.445
 people+0.439
Neg logit contributions
ifying-0.817
igsaw-0.697
run-0.626
chie-0.599
 brief-0.570
Token very
Token ID845
Feature activation+3.20e-01

Pos logit contributions
 idea+0.133
 morning+0.114
 day+0.112
 people+0.110
 prevent+0.108
Neg logit contributions
ifying-0.243
igsaw-0.212
run-0.187
chie-0.182
 brief-0.174
Token tired
Token ID10032
Feature activation+7.27e-02

Pos logit contributions
 idea+0.411
 morning+0.357
 doctors+0.346
 day+0.345
 hospital+0.344
Neg logit contributions
ifying-0.698
igsaw-0.615
run-0.531
chie-0.530
 brief-0.492
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
 idea+0.076
 morning+0.060
 prevent+0.059
 day+0.059
 people+0.059
Neg logit contributions
ifying-0.180
igsaw-0.160
run-0.140
chie-0.137
 brief-0.131
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
 idea+0.561
 morning+0.471
 day+0.462
 paramedics+0.461
 people+0.456
Neg logit contributions
ifying-0.892
igsaw-0.754
run-0.670
chie-0.652
 brief-0.621
Token out
Token ID503
Feature activation-1.14e+00

Pos logit contributions
 idea+4.199
cut+4.006
 bal+3.984
 paramedics+3.971
thank+3.634
Neg logit contributions
ifying-6.114
igsaw-4.959
chie-4.670
run-4.599
 nowhere-4.403
Token a
Token ID257
Feature activation-2.50e-01

Pos logit contributions
ifying+2.772
igsaw+2.463
chie+2.106
run+2.094
jamin+2.013
Neg logit contributions
 idea-1.376
 morning-1.142
 day-1.139
 people-1.131
 prevent-1.060
Token big
Token ID1263
Feature activation-2.67e-01

Pos logit contributions
ifying+0.567
igsaw+0.502
run+0.432
chie+0.422
 brief+0.404
Neg logit contributions
 idea-0.312
 morning-0.265
 day-0.262
 people-0.260
 doctors-0.254
Token yawn
Token ID40908
Feature activation+3.19e-01

Pos logit contributions
ifying+0.594
igsaw+0.528
run+0.451
chie+0.441
 brief+0.421
Neg logit contributions
 idea-0.343
 morning-0.293
 day-0.291
 people-0.290
 doctors-0.283
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
 idea+0.328
 morning+0.261
 people+0.256
 day+0.254
 prevent+0.253
Neg logit contributions
ifying-0.795
igsaw-0.694
run-0.623
chie-0.601
lic-0.581
Token gate
Token ID8946
Feature activation+2.30e-01

Pos logit contributions
ifying+0.110
igsaw+0.096
run+0.080
chie+0.078
 brief+0.075
Neg logit contributions
 idea-0.074
 morning-0.064
 day-0.063
 people-0.062
 prevent-0.062
Token.
Token ID13
Feature activation+6.71e-01

Pos logit contributions
 idea+0.236
 morning+0.192
 day+0.186
 people+0.183
 doctors+0.181
Neg logit contributions
ifying-0.579
igsaw-0.511
run-0.457
chie-0.437
 brief-0.423
Token She
Token ID1375
Feature activation-9.17e-01

Pos logit contributions
 idea+0.932
 day+0.794
 morning+0.779
 prevent+0.770
 people+0.761
Neg logit contributions
ifying-1.462
igsaw-1.311
run-1.119
chie-1.010
 brief-1.007
Token smiled
Token ID13541
Feature activation+2.46e-01

Pos logit contributions
ifying+2.180
igsaw+1.953
run+1.648
chie+1.641
 brief+1.532
Neg logit contributions
 idea-1.103
 people-0.927
 day-0.913
 morning-0.908
 doctors-0.894
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 idea+0.270
 morning+0.222
 day+0.222
 prevent+0.217
 people+0.212
Neg logit contributions
ifying-0.603
igsaw-0.523
run-0.466
chie-0.447
 brief-0.440
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
ifying+2.498
igsaw+2.234
chie+1.886
run+1.886
 brief+1.744
Neg logit contributions
 idea-1.289
 people-1.099
 morning-1.088
 day-1.080
 doctors-1.049
Token them
Token ID606
Feature activation-7.11e-02

Pos logit contributions
 idea+3.618
cut+3.447
 day+3.034
bye+3.031
 bal+3.020
Neg logit contributions
ifying-7.020
igsaw-6.263
run-5.688
 brief-5.141
jamin-5.044
Token in
Token ID287
Feature activation-1.03e-01

Pos logit contributions
ifying+0.186
igsaw+0.166
run+0.148
chie+0.144
 brief+0.139
Neg logit contributions
 idea-0.065
 morning-0.052
 day-0.051
 people-0.049
 doctors-0.048
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
ifying+0.254
igsaw+0.226
run+0.199
chie+0.192
 brief+0.186
Neg logit contributions
 idea-0.110
 morning-0.092
 day-0.090
 people-0.089
 doctors-0.087
Token They
Token ID1119
Feature activation-4.33e-01

Pos logit contributions
 idea+0.713
 day+0.596
 morning+0.589
 prevent+0.584
 people+0.576
Neg logit contributions
ifying-1.177
igsaw-1.044
run-0.903
chie-0.820
jamin-0.818
Token saw
Token ID2497
Feature activation+1.10e+00

Pos logit contributions
ifying+0.967
igsaw+0.852
run+0.732
chie+0.714
 brief+0.678
Neg logit contributions
 idea-0.567
 morning-0.484
 day-0.480
 people-0.474
 prevent-0.460
Token sun
Token ID4252
Feature activation+2.23e-01

Pos logit contributions
ifying+0.703
igsaw+0.624
run+0.539
chie+0.525
 brief+0.502
Neg logit contributions
 idea-0.371
 morning-0.314
 day-0.309
 people-0.306
 doctors-0.299
Token ball
Token ID2613
Feature activation+3.01e-01

Pos logit contributions
 idea+0.232
 prevent+0.187
 morning+0.186
 people+0.185
 doctors+0.183
Neg logit contributions
ifying-0.547
igsaw-0.486
run-0.426
chie-0.409
 brief-0.401
Token.
Token ID13
Feature activation+8.81e-01

Pos logit contributions
 idea+0.311
 morning+0.252
 prevent+0.248
 people+0.245
 doctors+0.242
Neg logit contributions
ifying-0.742
igsaw-0.661
run-0.586
chie-0.561
 brief-0.543
Token They
Token ID1119
Feature activation+2.75e-01

Pos logit contributions
 idea+1.254
 prevent+1.062
cut+1.056
 doctors+1.032
 morning+1.032
Neg logit contributions
ifying-1.928
igsaw-1.617
run-1.443
chie-1.306
lic-1.290
Token also
Token ID635
Feature activation-1.74e-01

Pos logit contributions
 idea+0.430
 morning+0.381
 day+0.371
 people+0.369
 doctors+0.369
Neg logit contributions
ifying-0.533
igsaw-0.446
run-0.376
chie-0.370
jamin-0.355
Token let
Token ID1309
Feature activation+3.15e+00

Pos logit contributions
ifying+0.372
igsaw+0.323
run+0.277
chie+0.270
 brief+0.257
Neg logit contributions
 idea-0.242
 morning-0.209
 day-0.206
 people-0.204
 doctors-0.200
Token other
Token ID584
Feature activation-1.12e-03

Pos logit contributions
cut+4.221
 idea+3.974
thank+3.441
bye+3.385
 paramedics+3.381
Neg logit contributions
ifying-6.450
igsaw-5.444
run-5.062
jamin-4.796
 nowhere-4.572
Token kids
Token ID3988
Feature activation+3.62e-03

Pos logit contributions
ifying+0.003
igsaw+0.003
run+0.002
chie+0.002
 brief+0.002
Neg logit contributions
 idea-0.001
 morning-0.001
 day-0.001
 prevent-0.001
 hospital-0.001
Token join
Token ID4654
Feature activation-5.22e-01

Pos logit contributions
 idea+0.004
 morning+0.003
 day+0.003
 people+0.003
 doctors+0.003
Neg logit contributions
ifying-0.009
igsaw-0.008
run-0.007
chie-0.006
 brief-0.006
Token them
Token ID606
Feature activation+2.37e-01

Pos logit contributions
ifying+1.392
igsaw+1.255
run+1.106
chie+1.086
 brief+1.042
Neg logit contributions
 idea-0.465
 morning-0.365
 day-0.363
 people-0.357
 prevent-0.336
Token and
Token ID290
Feature activation-1.47e-01

Pos logit contributions
 idea+0.250
 morning+0.205
 doctors+0.199
 day+0.196
 people+0.195
Neg logit contributions
ifying-0.580
igsaw-0.523
run-0.458
chie-0.439
 brief-0.428
Token!"
Token ID2474
Feature activation+2.90e-01

Pos logit contributions
ifying+0.546
igsaw+0.486
run+0.421
chie+0.412
 brief+0.392
Neg logit contributions
 idea-0.268
 morning-0.226
 day-0.223
 people-0.219
 prevent-0.214
Token
Token ID198
Feature activation+3.19e-01

Pos logit contributions
 idea+0.359
 day+0.295
 morning+0.294
 prevent+0.291
 doctors+0.288
Neg logit contributions
ifying-0.662
igsaw-0.590
run-0.518
chie-0.486
jamin-0.471
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
 idea+0.400
 morning+0.334
 day+0.327
 prevent+0.318
 doctors+0.317
Neg logit contributions
ifying-0.719
igsaw-0.641
run-0.569
chie-0.536
jamin-0.514
TokenThey
Token ID2990
Feature activation-7.61e-01

Pos logit contributions
ifying+0.580
igsaw+0.509
run+0.435
chie+0.426
 brief+0.403
Neg logit contributions
 idea-0.366
 morning-0.313
 day-0.310
 people-0.303
 prevent-0.297
Token both
Token ID1111
Feature activation-2.66e-01

Pos logit contributions
ifying+1.634
igsaw+1.459
run+1.223
chie+1.189
 brief+1.123
Neg logit contributions
 idea-1.070
 morning-0.915
 people-0.915
 day-0.912
 prevent-0.889
Token let
Token ID1309
Feature activation+3.15e+00

Pos logit contributions
ifying+0.565
igsaw+0.491
run+0.423
chie+0.411
 brief+0.387
Neg logit contributions
 idea-0.377
 morning-0.326
 day-0.324
 people-0.320
 doctors-0.313
Token go
Token ID467
Feature activation+8.24e-02

Pos logit contributions
cut+3.692
 idea+3.181
 ostr+2.969
thank+2.873
 paramedics+2.741
Neg logit contributions
ifying-7.333
igsaw-6.448
run-5.432
jamin-5.405
chie-5.269
Token of
Token ID286
Feature activation-9.10e-01

Pos logit contributions
 idea+0.086
 morning+0.069
 doctors+0.068
 people+0.068
 day+0.067
Neg logit contributions
ifying-0.203
igsaw-0.183
run-0.163
chie-0.157
 brief-0.149
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
ifying+2.482
igsaw+2.235
run+1.987
chie+1.957
jamin+1.876
Neg logit contributions
 idea-0.782
 people-0.618
 day-0.603
 morning-0.601
 prevent-0.562
Token dress
Token ID6576
Feature activation+2.65e-01

Pos logit contributions
 idea+0.125
 morning+0.105
 day+0.103
 prevent+0.101
 people+0.101
Neg logit contributions
ifying-0.245
igsaw-0.219
run-0.187
chie-0.185
 brief-0.179
Token and
Token ID290
Feature activation-1.61e-02

Pos logit contributions
 idea+0.276
 morning+0.230
 day+0.225
 people+0.221
 doctors+0.218
Neg logit contributions
ifying-0.646
igsaw-0.581
run-0.511
chie-0.494
 brief-0.478
Token dance
Token ID9280
Feature activation-1.64e-01

Pos logit contributions
ifying+0.357
igsaw+0.316
run+0.275
chie+0.268
 brief+0.254
Neg logit contributions
 idea-0.186
 morning-0.155
 day-0.153
 people-0.151
 doctors-0.148
Token around
Token ID1088
Feature activation-1.53e-01

Pos logit contributions
ifying+0.410
igsaw+0.367
run+0.321
chie+0.315
 brief+0.300
Neg logit contributions
 idea-0.170
 morning-0.138
 day-0.137
 people-0.135
 doctors-0.132
Token!
Token ID0
Feature activation-7.82e-01

Pos logit contributions
ifying+0.388
igsaw+0.348
run+0.304
chie+0.298
 brief+0.285
Neg logit contributions
 idea-0.153
 morning-0.122
 day-0.120
 people-0.117
 prevent-0.116
Token
Token ID198
Feature activation-6.38e-01

Pos logit contributions
ifying+1.790
igsaw+1.577
run+1.362
chie+1.350
 brief+1.284
Neg logit contributions
 idea-0.976
 morning-0.843
 day-0.831
 people-0.819
 doctors-0.782
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
ifying+1.594
igsaw+1.421
run+1.242
chie+1.224
 brief+1.171
Neg logit contributions
 idea-0.671
 morning-0.552
 day-0.548
 people-0.540
 prevent-0.513
TokenEm
Token ID10161
Feature activation-3.24e+00

Pos logit contributions
ifying+1.847
igsaw+1.566
run+1.280
chie+1.244
 brief+1.169
Neg logit contributions
 idea-1.854
 morning-1.683
 people-1.677
 day-1.675
 doctors-1.629
Tokenma
Token ID2611
Feature activation-3.67e-01

Pos logit contributions
ifying+8.335
igsaw+7.306
run+6.387
chie+6.101
lic+6.086
Neg logit contributions
 idea-3.477
 day-3.418
 morning-3.325
 night-3.009
 people-2.950
Token was
Token ID373
Feature activation+4.18e-01

Pos logit contributions
ifying+0.759
igsaw+0.661
run+0.561
chie+0.545
 brief+0.514
Neg logit contributions
 idea-0.541
 morning-0.469
 day-0.466
 people-0.461
 prevent-0.451
Token over
Token ID625
Feature activation-1.02e-01

Pos logit contributions
 idea+0.718
 morning+0.632
 day+0.626
 people+0.620
 doctors+0.619
Neg logit contributions
ifying-0.763
igsaw-0.637
run-0.539
chie-0.523
jamin-0.487
Tokenjoy
Token ID2633
Feature activation-4.72e-01

Pos logit contributions
ifying+0.156
igsaw+0.128
run+0.100
chie+0.095
 brief+0.088
Neg logit contributions
 idea-0.206
 morning-0.184
 day-0.184
 people-0.182
 prevent-0.181
Tokened
Token ID276
Feature activation-6.97e-02

Pos logit contributions
ifying+0.907
igsaw+0.782
run+0.650
chie+0.631
 brief+0.591
Neg logit contributions
 idea-0.769
 morning-0.678
 day-0.675
 people-0.669
 prevent-0.652
Token new
Token ID649
Feature activation+5.94e-01

Pos logit contributions
ifying+1.736
igsaw+1.537
run+1.345
chie+1.316
jamin+1.262
Neg logit contributions
 idea-0.801
 morning-0.660
 day-0.657
 people-0.639
 prevent-0.623
Token symbol
Token ID6194
Feature activation+2.06e-01

Pos logit contributions
 idea+0.689
 doctors+0.608
 people+0.596
 paramedics+0.595
 morning+0.592
Neg logit contributions
ifying-1.333
igsaw-1.219
run-1.023
chie-1.016
 brief-0.977
Token!
Token ID0
Feature activation-1.30e+00

Pos logit contributions
 idea+0.245
 morning+0.201
 people+0.198
 day+0.196
 prevent+0.194
Neg logit contributions
ifying-0.486
igsaw-0.428
run-0.375
chie-0.363
 brief-0.347
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
ifying+2.997
igsaw+2.635
chie+2.304
run+2.293
 brief+2.215
Neg logit contributions
 idea-1.597
 morning-1.394
 day-1.367
 people-1.344
 hospital-1.264
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
ifying+1.759
igsaw+1.578
run+1.387
chie+1.371
 brief+1.313
Neg logit contributions
 idea-0.627
 morning-0.501
 day-0.498
 people-0.489
 doctors-0.460
TokenEm
Token ID10161
Feature activation-3.16e+00

Pos logit contributions
ifying+2.053
igsaw+1.755
run+1.438
chie+1.406
 brief+1.322
Neg logit contributions
 idea-1.940
 day-1.752
 morning-1.750
 people-1.748
 doctors-1.703
Tokenma
Token ID2611
Feature activation-7.24e-01

Pos logit contributions
ifying+8.149
igsaw+7.115
run+6.315
lic+5.919
chie+5.863
Neg logit contributions
 idea-3.545
 day-3.381
 people-3.239
 morning-3.195
 night-2.883
Token's
Token ID338
Feature activation+4.50e-02

Pos logit contributions
ifying+1.561
igsaw+1.377
run+1.162
chie+1.132
 brief+1.076
Neg logit contributions
 idea-1.006
 morning-0.872
 day-0.870
 people-0.861
 prevent-0.830
Token friends
Token ID2460
Feature activation-9.73e-01

Pos logit contributions
 idea+0.049
 morning+0.041
 day+0.040
 prevent+0.039
 people+0.039
Neg logit contributions
ifying-0.108
igsaw-0.098
run-0.085
chie-0.083
 brief-0.079
Token cl
Token ID537
Feature activation-1.17e+00

Pos logit contributions
ifying+2.072
igsaw+1.853
run+1.545
chie+1.500
 brief+1.433
Neg logit contributions
 idea-1.378
 day-1.201
 morning-1.198
 people-1.190
 prevent-1.142
Tokenapped
Token ID6320
Feature activation+1.70e-01

Pos logit contributions
ifying+2.539
igsaw+2.266
run+1.887
chie+1.850
 brief+1.826
Neg logit contributions
 idea-1.580
 morning-1.342
 day-1.323
 people-1.304
 prevent-1.276
Token the
Token ID262
Feature activation-7.74e-01

Pos logit contributions
ifying+0.472
igsaw+0.424
run+0.374
chie+0.366
 brief+0.351
Neg logit contributions
 idea-0.156
 morning-0.119
 day-0.118
 people-0.116
 doctors-0.113
Token surface
Token ID4417
Feature activation-2.64e-02

Pos logit contributions
ifying+1.933
igsaw+1.730
run+1.526
chie+1.476
 brief+1.420
Neg logit contributions
 idea-0.829
 day-0.681
 morning-0.681
 people-0.677
 prevent-0.635
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
ifying+0.067
igsaw+0.060
run+0.053
chie+0.052
 brief+0.049
Neg logit contributions
 idea-0.026
 morning-0.021
 day-0.021
 people-0.020
 doctors-0.020
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
ifying+2.868
igsaw+2.515
chie+2.198
run+2.176
 brief+2.101
Neg logit contributions
 idea-1.545
 morning-1.360
 day-1.331
 people-1.310
 hospital-1.235
Token
Token ID198
Feature activation-8.99e-01

Pos logit contributions
ifying+0.967
igsaw+0.864
run+0.755
chie+0.737
 brief+0.704
Neg logit contributions
 idea-0.421
 morning-0.344
 day-0.341
 people-0.336
 prevent-0.324
TokenEm
Token ID10161
Feature activation-3.07e+00

Pos logit contributions
ifying+1.827
igsaw+1.593
run+1.346
chie+1.310
 brief+1.239
Neg logit contributions
 idea-1.348
 morning-1.193
 day-1.188
 people-1.185
 prevent-1.150
Tokenma
Token ID2611
Feature activation-1.22e-01

Pos logit contributions
ifying+7.910
igsaw+6.949
run+6.115
lic+5.707
chie+5.680
Neg logit contributions
 idea-3.375
 day-3.250
 morning-3.082
 people-3.048
 prevent-2.814
Token's
Token ID338
Feature activation+5.28e-02

Pos logit contributions
ifying+0.263
igsaw+0.228
run+0.199
chie+0.193
jamin+0.183
Neg logit contributions
 idea-0.168
 morning-0.143
 day-0.141
 people-0.139
 doctors-0.137
Token friends
Token ID2460
Feature activation-3.86e-01

Pos logit contributions
 idea+0.061
 morning+0.052
 day+0.050
 prevent+0.049
 people+0.049
Neg logit contributions
ifying-0.124
igsaw-0.112
run-0.097
chie-0.095
 brief-0.091
Token cheered
Token ID34767
Feature activation-1.10e-01

Pos logit contributions
ifying+0.817
igsaw+0.713
run+0.609
chie+0.593
 brief+0.559
Neg logit contributions
 idea-0.548
 morning-0.473
 day-0.470
 people-0.464
 doctors-0.452
Token as
Token ID355
Feature activation-1.46e-01

Pos logit contributions
ifying+0.280
igsaw+0.250
run+0.220
chie+0.216
jamin+0.207
Neg logit contributions
 idea-0.108
 morning-0.087
 day-0.086
 people-0.084
 doctors-0.082
Token he
Token ID339
Feature activation-8.40e-01

Pos logit contributions
ifying+0.426
igsaw+0.382
run+0.335
chie+0.325
jamin+0.315
Neg logit contributions
 idea-0.170
 morning-0.137
 day-0.135
 people-0.132
 prevent-0.130
Token was
Token ID373
Feature activation+2.43e-01

Pos logit contributions
ifying+1.961
igsaw+1.737
run+1.479
chie+1.457
 brief+1.381
Neg logit contributions
 idea-1.054
 day-0.893
 people-0.887
 morning-0.884
 prevent-0.848
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 idea+0.263
 morning+0.217
 day+0.209
 people+0.207
 doctors+0.204
Neg logit contributions
ifying-0.589
igsaw-0.525
run-0.467
chie-0.448
 brief-0.431
Token
Token ID198
Feature activation-6.45e-02

Pos logit contributions
ifying+2.796
igsaw+2.465
chie+2.200
run+2.150
 brief+2.082
Neg logit contributions
 idea-1.129
 morning-0.951
 day-0.937
 people-0.914
 hospital-0.863
Token
Token ID198
Feature activation-4.08e-01

Pos logit contributions
ifying+0.153
igsaw+0.138
run+0.121
chie+0.115
jamin+0.112
Neg logit contributions
 idea-0.075
 morning-0.061
 day-0.060
 people-0.060
 doctors-0.059
TokenEm
Token ID10161
Feature activation-3.03e+00

Pos logit contributions
ifying+0.867
igsaw+0.759
run+0.645
chie+0.627
 brief+0.593
Neg logit contributions
 idea-0.584
 morning-0.504
 day-0.500
 people-0.495
 prevent-0.482
Tokenma
Token ID2611
Feature activation-2.84e-01

Pos logit contributions
ifying+7.990
igsaw+6.871
run+6.034
chie+5.978
lic+5.737
Neg logit contributions
 idea-3.331
 day-3.179
 morning-3.055
 people-2.874
 night-2.675
Token loved
Token ID6151
Feature activation-6.24e-01

Pos logit contributions
ifying+0.596
igsaw+0.521
run+0.444
chie+0.432
jamin+0.409
Neg logit contributions
 idea-0.407
 morning-0.351
 day-0.349
 people-0.344
 prevent-0.336
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
ifying+1.486
igsaw+1.313
run+1.137
chie+1.119
 brief+1.060
Neg logit contributions
 idea-0.739
 morning-0.621
 day-0.620
 people-0.612
 doctors-0.587
Token present
Token ID1944
Feature activation-1.10e-01

Pos logit contributions
 idea+0.150
 morning+0.130
 people+0.128
 prevent+0.128
 doctors+0.128
Neg logit contributions
ifying-0.270
igsaw-0.245
run-0.204
chie-0.202
jamin-0.194
Token and
Token ID290
Feature activation-3.99e-01

Pos logit contributions
ifying+0.276
igsaw+0.248
run+0.217
chie+0.211
 brief+0.203
Neg logit contributions
 idea-0.112
 morning-0.091
 day-0.089
 people-0.088
 prevent-0.086
Token his
Token ID465
Feature activation-8.89e-02

Pos logit contributions
ifying+0.940
igsaw+0.843
run+0.744
chie+0.728
 brief+0.696
Neg logit contributions
 idea-0.347
 morning-0.275
 day-0.272
 people-0.267
 prevent-0.258
Token friends
Token ID2460
Feature activation+9.68e-01

Pos logit contributions
ifying+0.219
igsaw+0.195
run+0.172
chie+0.168
 brief+0.161
Neg logit contributions
 idea-0.094
 morning-0.077
 day-0.076
 prevent-0.074
 people-0.074
Token dropped
Token ID5710
Feature activation-2.66e-01

Pos logit contributions
 idea+1.331
 morning+1.115
 day+1.080
 prevent+1.057
 doctors+1.054
Neg logit contributions
ifying-2.103
igsaw-1.747
run-1.596
chie-1.527
jamin-1.479
Token her
Token ID607
Feature activation-7.21e-01

Pos logit contributions
ifying+0.638
igsaw+0.564
run+0.493
chie+0.480
 brief+0.459
Neg logit contributions
 idea-0.307
 morning-0.254
 day-0.252
 people-0.248
 prevent-0.242
Token lip
Token ID10645
Feature activation-4.90e-01

Pos logit contributions
ifying+1.664
igsaw+1.484
run+1.276
chie+1.254
jamin+1.185
Neg logit contributions
 idea-0.898
 day-0.757
 morning-0.756
 people-0.742
 prevent-0.712
Token bal
Token ID3652
Feature activation-3.01e+00

Pos logit contributions
ifying+1.243
igsaw+1.114
run+0.977
chie+0.958
 brief+0.915
Neg logit contributions
 idea-0.494
 morning-0.400
 day-0.397
 people-0.390
 prevent-0.373
Tokenm
Token ID76
Feature activation-7.23e-01

Pos logit contributions
igsaw+5.185
ifying+5.063
 brief+3.625
chie+3.559
jamin+3.387
Neg logit contributions
 day-4.910
 if-4.901
 idea-4.850
 morning-4.825
 whenever-4.675
Token on
Token ID319
Feature activation-4.46e-02

Pos logit contributions
ifying+1.446
igsaw+1.262
run+1.048
chie+1.022
 brief+0.964
Neg logit contributions
 idea-1.132
 morning-0.984
 day-0.983
 people-0.973
 prevent-0.940
Token the
Token ID262
Feature activation-5.69e-02

Pos logit contributions
ifying+0.113
igsaw+0.101
run+0.088
chie+0.086
 brief+0.083
Neg logit contributions
 idea-0.045
 morning-0.036
 day-0.035
 people-0.035
 prevent-0.034
Token ground
Token ID2323
Feature activation-3.75e-01

Pos logit contributions
ifying+0.149
igsaw+0.133
run+0.117
chie+0.116
 brief+0.111
Neg logit contributions
 idea-0.052
 morning-0.040
 day-0.040
 people-0.039
 prevent-0.038
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
ifying+0.976
igsaw+0.875
run+0.773
chie+0.756
 brief+0.725
Neg logit contributions
 idea-0.351
 morning-0.278
 day-0.275
 people-0.271
 prevent-0.259
Token birds
Token ID10087
Feature activation-4.55e-01

Pos logit contributions
 idea+0.511
 morning+0.404
 day+0.402
 prevent+0.392
 people+0.380
Neg logit contributions
ifying-1.058
igsaw-0.908
run-0.799
chie-0.795
 brief-0.753
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
ifying+1.127
igsaw+1.007
run+0.880
chie+0.862
 brief+0.823
Neg logit contributions
 idea-0.483
 morning-0.395
 day-0.392
 people-0.386
 prevent-0.371
Token
Token ID220
Feature activation-8.59e-02

Pos logit contributions
ifying+2.212
igsaw+1.919
run+1.650
chie+1.646
 brief+1.564
Neg logit contributions
 idea-1.423
 morning-1.255
 day-1.235
 people-1.228
 doctors-1.166
Token
Token ID198
Feature activation-8.65e-01

Pos logit contributions
ifying+0.205
igsaw+0.184
run+0.161
chie+0.155
jamin+0.148
Neg logit contributions
 idea-0.099
 morning-0.082
 day-0.081
 people-0.079
 doctors-0.078
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ifying+2.216
igsaw+1.968
run+1.729
chie+1.717
 brief+1.646
Neg logit contributions
 idea-0.858
 morning-0.707
 day-0.703
 people-0.693
 prevent-0.646
TokenEm
Token ID10161
Feature activation-2.99e+00

Pos logit contributions
ifying+1.835
igsaw+1.517
run+1.233
chie+1.194
 brief+1.111
Neg logit contributions
 idea-2.111
 morning-1.937
 day-1.932
 people-1.928
 doctors-1.876
Tokenma
Token ID2611
Feature activation-5.74e-01

Pos logit contributions
ifying+7.569
igsaw+6.583
run+5.816
chie+5.458
lic+5.449
Neg logit contributions
 idea-3.421
 day-3.308
 morning-3.216
 people-2.989
 night-2.895
Token was
Token ID373
Feature activation+2.77e-01

Pos logit contributions
ifying+1.290
igsaw+1.140
run+0.976
chie+0.953
 brief+0.906
Neg logit contributions
 idea-0.743
 morning-0.634
 day-0.630
 people-0.624
 prevent-0.603
Token very
Token ID845
Feature activation+3.03e-01

Pos logit contributions
 idea+0.471
 morning+0.417
 day+0.413
 people+0.408
 doctors+0.403
Neg logit contributions
ifying-0.510
igsaw-0.427
run-0.361
chie-0.350
jamin-0.325
Token proud
Token ID6613
Feature activation+6.99e-02

Pos logit contributions
 idea+0.343
 morning+0.296
 day+0.287
 doctors+0.280
 hospital+0.276
Neg logit contributions
ifying-0.718
igsaw-0.639
run-0.562
chie-0.555
 brief-0.516
Token and
Token ID290
Feature activation+1.04e-01

Pos logit contributions
 idea+0.074
 morning+0.060
 day+0.058
 prevent+0.058
 people+0.057
Neg logit contributions
ifying-0.173
igsaw-0.154
run-0.137
chie-0.131
 brief-0.127
Token you
Token ID345
Feature activation-8.46e-01

Pos logit contributions
ifying+0.580
igsaw+0.508
run+0.438
chie+0.428
 brief+0.404
Neg logit contributions
 idea-0.347
 morning-0.295
 day-0.292
 people-0.288
 prevent-0.282
Token help
Token ID1037
Feature activation-3.05e-01

Pos logit contributions
ifying+1.948
igsaw+1.746
run+1.492
chie+1.449
 brief+1.390
Neg logit contributions
 idea-1.063
 morning-0.889
 day-0.887
 people-0.881
 prevent-0.856
Token?"
Token ID1701
Feature activation-9.64e-02

Pos logit contributions
ifying+0.676
igsaw+0.595
run+0.511
chie+0.500
jamin+0.473
Neg logit contributions
 idea-0.402
 morning-0.343
 day-0.340
 people-0.334
 doctors-0.326
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
ifying+0.226
igsaw+0.202
run+0.176
chie+0.169
jamin+0.163
Neg logit contributions
 idea-0.114
 day-0.094
 morning-0.094
 people-0.092
 prevent-0.091
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
ifying+0.806
igsaw+0.721
run+0.629
chie+0.613
jamin+0.585
Neg logit contributions
 idea-0.366
 morning-0.300
 day-0.298
 people-0.293
 prevent-0.285
TokenEm
Token ID10161
Feature activation-2.96e+00

Pos logit contributions
ifying+2.378
igsaw+1.986
run+1.654
chie+1.585
 brief+1.484
Neg logit contributions
 idea-2.301
 people-2.115
 morning-2.110
 day-2.095
 doctors-2.055
Tokenma
Token ID2611
Feature activation-1.19e+00

Pos logit contributions
ifying+7.569
igsaw+6.683
run+5.756
chie+5.547
lic+5.412
Neg logit contributions
 idea-3.379
 day-3.106
 morning-3.045
 people-2.811
 night-2.724
Token nodded
Token ID14464
Feature activation-5.90e-01

Pos logit contributions
ifying+2.483
igsaw+2.163
run+1.780
chie+1.755
 brief+1.646
Neg logit contributions
 idea-1.789
 morning-1.556
 day-1.556
 people-1.544
 prevent-1.490
Token and
Token ID290
Feature activation-2.75e-01

Pos logit contributions
ifying+1.502
igsaw+1.348
run+1.179
chie+1.157
 brief+1.105
Neg logit contributions
 idea-0.597
 morning-0.481
 day-0.477
 people-0.471
 prevent-0.449
Token got
Token ID1392
Feature activation+1.12e+00

Pos logit contributions
ifying+0.615
igsaw+0.542
run+0.469
chie+0.455
 brief+0.433
Neg logit contributions
 idea-0.356
 morning-0.301
 day-0.300
 people-0.295
 doctors-0.288
Token to
Token ID284
Feature activation+3.18e-01

Pos logit contributions
 idea+1.437
 day+1.174
 morning+1.154
 hospital+1.136
 doctors+1.105
Neg logit contributions
ifying-2.533
igsaw-2.189
run-2.007
chie-1.878
years-1.803
Token here
Token ID994
Feature activation-4.65e-02

Pos logit contributions
ifying+0.036
igsaw+0.032
run+0.028
chie+0.027
 brief+0.026
Neg logit contributions
 idea-0.020
 morning-0.017
 day-0.016
 people-0.016
 doctors-0.016
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
ifying+0.116
igsaw+0.103
run+0.091
chie+0.088
 brief+0.085
Neg logit contributions
 idea-0.048
 morning-0.039
 people-0.038
 day-0.038
 doctors-0.038
Token
Token ID220
Feature activation-1.84e-01

Pos logit contributions
ifying+2.875
igsaw+2.514
chie+2.211
run+2.180
 brief+2.095
Neg logit contributions
 idea-1.639
 morning-1.455
 day-1.425
 people-1.395
 whenever-1.313
Token
Token ID198
Feature activation-8.49e-01

Pos logit contributions
ifying+0.444
igsaw+0.396
run+0.347
chie+0.335
jamin+0.321
Neg logit contributions
 idea-0.207
 morning-0.171
 day-0.170
 people-0.166
 prevent-0.162
Token
Token ID198
Feature activation-8.90e-01

Pos logit contributions
ifying+2.169
igsaw+1.933
run+1.695
chie+1.685
 brief+1.608
Neg logit contributions
 idea-0.848
 morning-0.701
 day-0.698
 people-0.684
 prevent-0.641
TokenEm
Token ID10161
Feature activation-2.95e+00

Pos logit contributions
ifying+1.626
igsaw+1.392
run+1.154
chie+1.121
 brief+1.042
Neg logit contributions
 idea-1.522
 morning-1.372
 day-1.369
 people-1.360
 doctors-1.324
Tokenma
Token ID2611
Feature activation+5.83e-02

Pos logit contributions
ifying+7.314
igsaw+6.348
run+5.666
chie+5.313
lic+5.239
Neg logit contributions
 idea-3.515
 day-3.429
 morning-3.345
 people-3.089
 night-3.029
Token had
Token ID550
Feature activation-6.15e-01

Pos logit contributions
 idea+0.080
 morning+0.068
 day+0.067
 people+0.066
 doctors+0.066
Neg logit contributions
ifying-0.125
igsaw-0.109
run-0.095
chie-0.092
jamin-0.088
Token a
Token ID257
Feature activation-5.90e-01

Pos logit contributions
ifying+1.382
igsaw+1.222
run+1.045
chie+1.020
 brief+0.966
Neg logit contributions
 idea-0.806
 morning-0.690
 day-0.686
 people-0.679
 prevent-0.656
Token great
Token ID1049
Feature activation+7.21e-01

Pos logit contributions
ifying+1.430
igsaw+1.272
run+1.112
chie+1.087
 brief+1.030
Neg logit contributions
 idea-0.666
 morning-0.552
 day-0.549
 people-0.536
 prevent-0.517
Token time
Token ID640
Feature activation+3.29e-01

Pos logit contributions
 idea+0.368
 doctors+0.306
 people+0.298
 prevent+0.286
cut+0.283
Neg logit contributions
ifying-2.044
igsaw-1.950
run-1.689
 brief-1.649
chie-1.629
Token come
Token ID1282
Feature activation-4.17e-01

Pos logit contributions
ifying+0.019
igsaw+0.016
run+0.014
chie+0.014
 brief+0.013
Neg logit contributions
 idea-0.009
 morning-0.007
 day-0.007
 people-0.007
 doctors-0.007
Token out
Token ID503
Feature activation-6.33e-01

Pos logit contributions
ifying+0.995
igsaw+0.884
run+0.768
chie+0.751
 brief+0.715
Neg logit contributions
 idea-0.483
 morning-0.402
 day-0.399
 people-0.394
 prevent-0.380
Token!
Token ID0
Feature activation-1.25e+00

Pos logit contributions
ifying+1.643
igsaw+1.477
run+1.295
chie+1.275
 brief+1.221
Neg logit contributions
 idea-0.603
 morning-0.481
 day-0.479
 people-0.471
 prevent-0.447
Token
Token ID198
Feature activation-5.33e-01

Pos logit contributions
ifying+3.113
igsaw+2.741
chie+2.432
run+2.401
 brief+2.335
Neg logit contributions
 idea-1.321
 morning-1.128
 day-1.104
 people-1.081
 hospital-0.998
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
ifying+1.328
igsaw+1.184
run+1.035
chie+1.017
 brief+0.971
Neg logit contributions
 idea-0.564
 morning-0.462
 day-0.459
 people-0.452
 prevent-0.432
TokenEm
Token ID10161
Feature activation-2.94e+00

Pos logit contributions
ifying+2.206
igsaw+1.881
run+1.574
chie+1.536
 brief+1.447
Neg logit contributions
 idea-1.885
 people-1.689
 morning-1.689
 day-1.688
 prevent-1.635
Tokenma
Token ID2611
Feature activation-3.38e-01

Pos logit contributions
ifying+7.496
igsaw+6.495
run+5.709
chie+5.394
lic+5.368
Neg logit contributions
 idea-3.346
 day-3.153
 morning-3.037
 people-2.914
 night-2.732
Token was
Token ID373
Feature activation-7.36e-02

Pos logit contributions
ifying+0.724
igsaw+0.633
run+0.542
chie+0.527
jamin+0.499
Neg logit contributions
 idea-0.472
 morning-0.406
 day-0.403
 people-0.399
 prevent-0.388
Token so
Token ID523
Feature activation-3.69e-01

Pos logit contributions
ifying+0.141
igsaw+0.121
run+0.102
chie+0.098
jamin+0.092
Neg logit contributions
 idea-0.120
 morning-0.105
 day-0.105
 people-0.103
 prevent-0.101
Token sad
Token ID6507
Feature activation-3.41e-01

Pos logit contributions
ifying+0.930
igsaw+0.831
run+0.730
chie+0.714
 brief+0.682
Neg logit contributions
 idea-0.380
 morning-0.309
 day-0.306
 people-0.301
 doctors-0.290
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
ifying+0.862
igsaw+0.772
run+0.678
chie+0.662
 brief+0.634
Neg logit contributions
 idea-0.346
 morning-0.280
 day-0.277
 people-0.273
 prevent-0.263
Token down
Token ID866
Feature activation-4.09e-01

Pos logit contributions
ifying+0.876
igsaw+0.781
run+0.683
chie+0.668
 brief+0.637
Neg logit contributions
 idea-0.382
 morning-0.312
 day-0.310
 people-0.305
 doctors-0.294
Token safely
Token ID11512
Feature activation-5.34e-01

Pos logit contributions
ifying+1.013
igsaw+0.905
run+0.792
chie+0.775
 brief+0.740
Neg logit contributions
 idea-0.433
 morning-0.353
 day-0.351
 people-0.345
 prevent-0.332
Token."
Token ID526
Feature activation-3.14e-01

Pos logit contributions
ifying+1.321
igsaw+1.181
run+1.033
chie+1.010
 brief+0.967
Neg logit contributions
 idea-0.567
 morning-0.465
 day-0.462
 people-0.454
 prevent-0.435
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
ifying+0.734
igsaw+0.654
run+0.567
chie+0.551
jamin+0.525
Neg logit contributions
 idea-0.377
 morning-0.314
 day-0.313
 people-0.308
 prevent-0.300
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
ifying+1.067
igsaw+0.953
run+0.832
chie+0.814
 brief+0.777
Neg logit contributions
 idea-0.470
 morning-0.385
 day-0.382
 people-0.377
 prevent-0.363
TokenEm
Token ID10161
Feature activation-2.93e+00

Pos logit contributions
ifying+2.401
igsaw+2.024
run+1.703
chie+1.649
 brief+1.547
Neg logit contributions
 idea-1.978
 people-1.758
 day-1.755
 morning-1.753
 prevent-1.718
Tokenma
Token ID2611
Feature activation-2.90e-01

Pos logit contributions
ifying+7.518
igsaw+6.524
run+5.782
chie+5.457
lic+5.336
Neg logit contributions
 idea-3.434
 day-3.142
 morning-3.029
 people-2.969
 prevent-2.772
Token understood
Token ID7247
Feature activation-5.05e-01

Pos logit contributions
ifying+0.582
igsaw+0.507
run+0.429
chie+0.415
jamin+0.392
Neg logit contributions
 idea-0.439
 morning-0.383
 day-0.380
 people-0.378
 doctors-0.369
Token what
Token ID644
Feature activation-4.94e-01

Pos logit contributions
ifying+1.270
igsaw+1.135
run+0.993
chie+0.975
 brief+0.930
Neg logit contributions
 idea-0.528
 morning-0.429
 day-0.426
 people-0.420
 prevent-0.403
Token they
Token ID484
Feature activation-5.01e-01

Pos logit contributions
ifying+1.229
igsaw+1.097
run+0.959
chie+0.940
 brief+0.897
Neg logit contributions
 idea-0.529
 morning-0.433
 day-0.430
 people-0.423
 prevent-0.406
Token meant
Token ID4001
Feature activation+9.27e-02

Pos logit contributions
ifying+1.129
igsaw+0.996
run+0.855
chie+0.834
 brief+0.791
Neg logit contributions
 idea-0.657
 morning-0.558
 day-0.556
 people-0.548
 prevent-0.532
Token the
Token ID262
Feature activation-3.00e-01

Pos logit contributions
 idea+0.170
 morning+0.144
 day+0.143
 prevent+0.139
 people+0.137
Neg logit contributions
ifying-0.275
igsaw-0.240
run-0.208
chie-0.199
 brief-0.192
Token terrible
Token ID7818
Feature activation-4.39e-01

Pos logit contributions
ifying+0.634
igsaw+0.554
run+0.468
chie+0.458
 brief+0.433
Neg logit contributions
 idea-0.430
 morning-0.372
 day-0.369
 people-0.363
 prevent-0.358
Token mess
Token ID2085
Feature activation-6.04e-01

Pos logit contributions
ifying+0.962
igsaw+0.846
run+0.724
chie+0.706
 brief+0.669
Neg logit contributions
 idea-0.593
 morning-0.508
 day-0.505
 people-0.499
 prevent-0.485
Token.
Token ID13
Feature activation-5.00e-01

Pos logit contributions
ifying+1.507
igsaw+1.348
run+1.178
chie+1.159
 brief+1.105
Neg logit contributions
 idea-0.631
 morning-0.516
 day-0.514
 people-0.507
 doctors-0.482
Token His
Token ID2399
Feature activation-5.39e-01

Pos logit contributions
ifying+1.123
igsaw+0.989
run+0.850
chie+0.831
 brief+0.788
Neg logit contributions
 idea-0.653
 morning-0.557
 day-0.553
 people-0.547
 prevent-0.529
Token mar
Token ID1667
Feature activation-2.92e+00

Pos logit contributions
ifying+1.352
igsaw+1.207
run+1.056
chie+1.036
 brief+0.987
Neg logit contributions
 idea-0.570
 morning-0.465
 day-0.463
 people-0.455
 prevent-0.434
Tokenbles
Token ID7689
Feature activation-1.13e+00

Pos logit contributions
ifying+5.485
igsaw+4.815
run+3.720
jamin+3.576
 brief+3.506
Neg logit contributions
 idea-5.094
 people-4.724
 doctors-4.292
cut-4.289
 day-4.251
Token were
Token ID547
Feature activation-9.42e-01

Pos logit contributions
ifying+2.524
igsaw+2.257
run+1.886
chie+1.860
 brief+1.758
Neg logit contributions
 idea-1.533
 day-1.316
 people-1.312
 morning-1.296
 prevent-1.253
Token everywhere
Token ID8347
Feature activation-4.27e-01

Pos logit contributions
ifying+2.030
igsaw+1.814
run+1.520
chie+1.499
jamin+1.417
Neg logit contributions
 idea-1.302
 day-1.122
 people-1.119
 morning-1.115
 doctors-1.085
Token!
Token ID0
Feature activation-3.73e-01

Pos logit contributions
ifying+1.044
igsaw+0.931
run+0.813
chie+0.795
 brief+0.759
Neg logit contributions
 idea-0.468
 morning-0.386
 day-0.382
 people-0.377
 prevent-0.363
Token He
Token ID679
Feature activation-1.13e+00

Pos logit contributions
ifying+0.828
igsaw+0.730
run+0.626
chie+0.609
 brief+0.578
Neg logit contributions
 idea-0.498
 morning-0.425
 day-0.423
 people-0.417
 prevent-0.406
Token with
Token ID351
Feature activation-5.70e-03

Pos logit contributions
ifying+0.467
igsaw+0.415
run+0.362
chie+0.353
 brief+0.337
Neg logit contributions
 idea-0.218
 morning-0.178
 day-0.177
 people-0.173
 prevent-0.169
Token me
Token ID502
Feature activation-1.10e+00

Pos logit contributions
ifying+0.012
igsaw+0.011
run+0.009
chie+0.009
 brief+0.008
Neg logit contributions
 idea-0.008
 morning-0.007
 day-0.007
 people-0.007
 prevent-0.007
Token?"
Token ID1701
Feature activation-4.07e-01

Pos logit contributions
ifying+2.783
igsaw+2.524
run+2.171
chie+2.152
 brief+2.041
Neg logit contributions
 idea-1.134
 morning-0.939
 day-0.938
 people-0.915
 prevent-0.855
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
ifying+0.989
igsaw+0.882
run+0.768
chie+0.751
 brief+0.717
Neg logit contributions
 idea-0.451
 morning-0.373
 day-0.370
 people-0.364
 prevent-0.351
Token
Token ID198
Feature activation-9.77e-01

Pos logit contributions
ifying+0.688
igsaw+0.616
run+0.536
chie+0.524
 brief+0.500
Neg logit contributions
 idea-0.317
 morning-0.262
 day-0.259
 people-0.254
 doctors-0.247
TokenMel
Token ID21102
Feature activation-2.92e+00

Pos logit contributions
ifying+2.096
igsaw+1.815
run+1.566
chie+1.502
 brief+1.419
Neg logit contributions
 idea-1.384
 people-1.209
 day-1.189
 morning-1.188
 prevent-1.152
Tokenody
Token ID1118
Feature activation-1.20e+00

Pos logit contributions
ifying+6.651
igsaw+5.621
run+5.014
chie+4.522
 brief+4.413
Neg logit contributions
 idea-4.256
 people-3.902
 day-3.739
 morning-3.557
crow-3.498
Token was
Token ID373
Feature activation+5.51e-01

Pos logit contributions
ifying+2.654
igsaw+2.338
run+1.963
chie+1.923
 brief+1.811
Neg logit contributions
 idea-1.678
 day-1.421
 morning-1.416
 people-1.404
 prevent-1.339
Token very
Token ID845
Feature activation+5.84e-01

Pos logit contributions
 idea+0.837
 morning+0.753
 day+0.744
 prevent+0.733
 doctors+0.729
Neg logit contributions
ifying-1.080
igsaw-0.914
run-0.806
chie-0.766
jamin-0.743
Token excited
Token ID6568
Feature activation-4.02e-01

Pos logit contributions
 idea+0.607
 morning+0.527
 doctors+0.514
 hospital+0.507
 day+0.503
Neg logit contributions
ifying-1.406
igsaw-1.260
run-1.118
chie-1.093
jamin-1.042
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
ifying+1.028
igsaw+0.922
run+0.811
chie+0.793
 brief+0.760
Neg logit contributions
 idea-0.395
 morning-0.317
 day-0.314
 people-0.309
 prevent-0.297
Token diamond
Token ID15291
Feature activation-2.83e-01

Pos logit contributions
 idea+0.764
 paramedics+0.691
 doctors+0.688
 people+0.678
 morning+0.665
Neg logit contributions
ifying-1.335
igsaw-1.228
run-1.013
chie-0.993
 brief-0.974
Token!
Token ID0
Feature activation-1.32e+00

Pos logit contributions
ifying+0.682
igsaw+0.606
run+0.528
chie+0.516
 brief+0.493
Neg logit contributions
 idea-0.320
 morning-0.265
 day-0.262
 people-0.260
 prevent-0.251
Token
Token ID220
Feature activation-5.61e-02

Pos logit contributions
ifying+3.126
igsaw+2.733
chie+2.421
run+2.376
 brief+2.323
Neg logit contributions
 idea-1.548
 morning-1.352
 day-1.319
 people-1.299
 hospital-1.213
Token
Token ID198
Feature activation-9.37e-01

Pos logit contributions
ifying+0.134
igsaw+0.120
run+0.105
chie+0.101
jamin+0.097
Neg logit contributions
 idea-0.064
 morning-0.054
 day-0.053
 people-0.052
 doctors-0.051
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
ifying+2.414
igsaw+2.143
run+1.877
chie+1.876
 brief+1.789
Neg logit contributions
 idea-0.925
 morning-0.759
 day-0.758
 people-0.748
 prevent-0.697
TokenEm
Token ID10161
Feature activation-2.92e+00

Pos logit contributions
ifying+1.990
igsaw+1.700
run+1.423
chie+1.392
 brief+1.304
Neg logit contributions
 idea-1.683
 morning-1.504
 day-1.503
 people-1.498
 doctors-1.457
Tokenma
Token ID2611
Feature activation-2.47e-01

Pos logit contributions
ifying+7.415
igsaw+6.415
run+5.680
lic+5.337
 brief+5.304
Neg logit contributions
 idea-3.432
 day-3.255
 morning-3.124
 people-3.027
 prevent-2.888
Token was
Token ID373
Feature activation+1.04e-01

Pos logit contributions
ifying+0.547
igsaw+0.481
run+0.416
chie+0.405
jamin+0.384
Neg logit contributions
 idea-0.327
 morning-0.278
 day-0.276
 people-0.272
 doctors-0.266
Token so
Token ID523
Feature activation+9.46e-02

Pos logit contributions
 idea+0.173
 morning+0.154
 day+0.152
 people+0.151
 doctors+0.148
Neg logit contributions
ifying-0.193
igsaw-0.163
run-0.138
chie-0.134
jamin-0.124
Token happy
Token ID3772
Feature activation-1.17e-01

Pos logit contributions
 idea+0.098
 morning+0.082
 day+0.080
 doctors+0.078
 people+0.078
Neg logit contributions
ifying-0.235
igsaw-0.211
run-0.187
chie-0.182
jamin-0.172
Token and
Token ID290
Feature activation+1.16e-01

Pos logit contributions
ifying+0.277
igsaw+0.247
run+0.215
chie+0.208
 brief+0.200
Neg logit contributions
 idea-0.138
 morning-0.114
 day-0.112
 people-0.111
 prevent-0.109
Token place
Token ID1295
Feature activation-5.16e-01

Pos logit contributions
 idea+0.764
 paramedics+0.626
 doctors+0.623
 people+0.611
 prevent+0.608
Neg logit contributions
ifying-2.244
igsaw-2.074
run-1.814
chie-1.769
 brief-1.739
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
ifying+1.236
igsaw+1.101
run+0.956
chie+0.937
 brief+0.893
Neg logit contributions
 idea-0.586
 morning-0.487
 day-0.485
 people-0.477
 prevent-0.460
Token
Token ID220
Feature activation-4.87e-02

Pos logit contributions
ifying+3.320
igsaw+2.951
chie+2.631
run+2.576
 brief+2.535
Neg logit contributions
 idea-1.432
 morning-1.265
 day-1.243
 people-1.175
 whenever-1.121
Token
Token ID198
Feature activation-4.73e-01

Pos logit contributions
ifying+0.117
igsaw+0.104
run+0.092
chie+0.088
jamin+0.084
Neg logit contributions
 idea-0.056
 morning-0.046
 day-0.046
 people-0.045
 doctors-0.044
Token
Token ID198
Feature activation-8.27e-01

Pos logit contributions
ifying+1.188
igsaw+1.062
run+0.930
chie+0.911
 brief+0.871
Neg logit contributions
 idea-0.491
 morning-0.400
 day-0.397
 people-0.391
 prevent-0.375
TokenEm
Token ID10161
Feature activation-2.92e+00

Pos logit contributions
ifying+1.816
igsaw+1.596
run+1.369
chie+1.335
 brief+1.270
Neg logit contributions
 idea-1.112
 day-0.966
 morning-0.966
 people-0.954
 prevent-0.925
Tokenma
Token ID2611
Feature activation-2.84e-01

Pos logit contributions
ifying+7.323
igsaw+6.395
run+5.604
lic+5.268
 brief+5.219
Neg logit contributions
 idea-3.438
 day-3.243
 morning-3.043
 people-3.016
 if-2.749
Token was
Token ID373
Feature activation+8.72e-02

Pos logit contributions
ifying+0.653
igsaw+0.578
run+0.501
chie+0.490
jamin+0.467
Neg logit contributions
 idea-0.348
 morning-0.292
 day-0.289
 people-0.286
 prevent-0.278
Token so
Token ID523
Feature activation-2.06e-01

Pos logit contributions
 idea+0.135
 morning+0.117
 day+0.116
 people+0.116
 doctors+0.113
Neg logit contributions
ifying-0.173
igsaw-0.148
run-0.127
chie-0.124
jamin-0.115
Token proud
Token ID6613
Feature activation+1.84e-01

Pos logit contributions
ifying+0.525
igsaw+0.470
run+0.415
chie+0.407
jamin+0.386
Neg logit contributions
 idea-0.205
 morning-0.166
 day-0.163
 people-0.161
 doctors-0.157
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
 idea+0.202
 morning+0.163
 prevent+0.160
 day+0.158
 people+0.158
Neg logit contributions
ifying-0.444
igsaw-0.398
run-0.355
chie-0.335
 brief-0.324
Token the
Token ID262
Feature activation+6.04e-02

Pos logit contributions
ifying+4.252
igsaw+3.773
run+3.412
chie+3.372
lic+3.249
Neg logit contributions
 idea-1.247
 people-1.049
 day-0.997
 morning-0.985
 hospital-0.913
Token night
Token ID1755
Feature activation+1.59e-01

Pos logit contributions
 idea+0.067
 prevent+0.052
 morning+0.052
 day+0.052
 people+0.052
Neg logit contributions
ifying-0.148
igsaw-0.131
run-0.112
chie-0.111
 brief-0.108
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 idea+0.170
 prevent+0.131
 morning+0.131
 day+0.131
 people+0.128
Neg logit contributions
ifying-0.397
igsaw-0.354
run-0.305
chie-0.298
 brief-0.289
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
ifying+3.318
igsaw+2.910
chie+2.600
run+2.563
 brief+2.469
Neg logit contributions
 idea-1.443
 morning-1.252
 day-1.219
 people-1.190
 hospital-1.128
Token
Token ID198
Feature activation-9.61e-01

Pos logit contributions
ifying+1.084
igsaw+0.970
run+0.849
chie+0.831
 brief+0.794
Neg logit contributions
 idea-0.455
 morning-0.370
 day-0.367
 people-0.362
 prevent-0.348
TokenEm
Token ID10161
Feature activation-2.89e+00

Pos logit contributions
ifying+1.950
igsaw+1.693
run+1.433
chie+1.400
 brief+1.314
Neg logit contributions
 idea-1.455
 morning-1.297
 day-1.294
 people-1.283
 doctors-1.248
Tokenma
Token ID2611
Feature activation-1.95e-01

Pos logit contributions
ifying+7.295
igsaw+6.389
run+5.577
chie+5.246
lic+5.141
Neg logit contributions
 idea-3.319
 day-3.233
 morning-3.120
 people-3.041
 night-2.841
Token was
Token ID373
Feature activation-4.47e-01

Pos logit contributions
ifying+0.439
igsaw+0.384
run+0.335
chie+0.326
jamin+0.310
Neg logit contributions
 idea-0.252
 morning-0.212
 day-0.210
 people-0.207
 doctors-0.202
Token so
Token ID523
Feature activation-8.70e-01

Pos logit contributions
ifying+0.950
igsaw+0.831
run+0.707
chie+0.688
 brief+0.650
Neg logit contributions
 idea-0.638
 morning-0.552
 day-0.549
 people-0.543
 prevent-0.528
Token proud
Token ID6613
Feature activation+4.19e-01

Pos logit contributions
ifying+2.110
igsaw+1.885
run+1.622
chie+1.584
 brief+1.520
Neg logit contributions
 idea-1.005
 day-0.837
 morning-0.836
 people-0.833
 prevent-0.797
Token of
Token ID286
Feature activation-1.12e+00

Pos logit contributions
 idea+0.481
 morning+0.387
 prevent+0.383
 day+0.373
 people+0.369
Neg logit contributions
ifying-0.992
igsaw-0.888
run-0.798
chie-0.739
 brief-0.724
Token away
Token ID1497
Feature activation-5.78e-01

Pos logit contributions
ifying+0.174
igsaw+0.150
run+0.128
chie+0.125
jamin+0.117
Neg logit contributions
 idea-0.119
 morning-0.102
 day-0.101
 people-0.101
 doctors-0.100
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ifying+1.498
igsaw+1.347
run+1.183
chie+1.160
 brief+1.111
Neg logit contributions
 idea-0.555
 morning-0.441
 day-0.439
 people-0.431
 prevent-0.412
Token
Token ID220
Feature activation+1.08e-01

Pos logit contributions
ifying+2.701
igsaw+2.356
chie+2.071
run+2.043
 brief+1.967
Neg logit contributions
 idea-1.539
 morning-1.356
 day-1.332
 people-1.297
 hospital-1.248
Token
Token ID198
Feature activation-6.97e-01

Pos logit contributions
 idea+0.128
 morning+0.106
 day+0.105
 people+0.103
 doctors+0.102
Neg logit contributions
ifying-0.255
igsaw-0.227
run-0.201
chie-0.191
jamin-0.183
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
ifying+1.756
igsaw+1.566
run+1.368
chie+1.354
 brief+1.295
Neg logit contributions
 idea-0.719
 morning-0.591
 day-0.588
 people-0.577
 prevent-0.547
TokenEm
Token ID10161
Feature activation-2.88e+00

Pos logit contributions
ifying+2.204
igsaw+1.920
run+1.636
chie+1.599
 brief+1.520
Neg logit contributions
 idea-1.540
 morning-1.365
 day-1.365
 people-1.351
 doctors-1.307
Tokenma
Token ID2611
Feature activation-4.25e-01

Pos logit contributions
ifying+7.149
igsaw+6.248
run+5.411
 brief+5.171
lic+5.170
Neg logit contributions
 idea-3.423
 day-3.260
 morning-3.077
 people-2.929
 if-2.772
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
ifying+0.942
igsaw+0.829
run+0.712
chie+0.695
 brief+0.659
Neg logit contributions
 idea-0.560
 morning-0.478
 day-0.475
 people-0.469
 prevent-0.456
Token her
Token ID607
Feature activation-2.12e-01

Pos logit contributions
ifying+2.507
igsaw+2.175
run+1.923
chie+1.901
 brief+1.807
Neg logit contributions
 idea-1.154
 day-0.964
 morning-0.957
 people-0.953
 doctors-0.891
Token friends
Token ID2460
Feature activation-2.98e-01

Pos logit contributions
ifying+0.532
igsaw+0.478
run+0.418
chie+0.410
 brief+0.393
Neg logit contributions
 idea-0.217
 morning-0.177
 day-0.174
 people-0.170
 prevent-0.167
Token all
Token ID477
Feature activation-4.18e-01

Pos logit contributions
ifying+0.615
igsaw+0.533
run+0.453
chie+0.442
 brief+0.415
Neg logit contributions
 idea-0.439
 morning-0.381
 day-0.378
 people-0.374
 doctors-0.367
Token Emma
Token ID18966
Feature activation-1.06e+00

Pos logit contributions
 idea+0.470
 morning+0.402
 day+0.400
 prevent+0.397
 people+0.397
Neg logit contributions
ifying-0.632
igsaw-0.551
run-0.463
chie-0.445
jamin-0.426
Token!
Token ID0
Feature activation-2.81e-01

Pos logit contributions
ifying+2.611
igsaw+2.341
run+2.050
chie+2.006
 brief+1.933
Neg logit contributions
 idea-1.148
 morning-0.953
 day-0.952
 people-0.927
 doctors-0.864
Token'.
Token ID4458
Feature activation-9.48e-01

Pos logit contributions
ifying+0.586
igsaw+0.510
run+0.434
chie+0.420
jamin+0.397
Neg logit contributions
 idea-0.411
 morning-0.355
 day-0.354
 people-0.350
 prevent-0.342
Token
Token ID198
Feature activation-9.79e-01

Pos logit contributions
ifying+2.118
igsaw+1.857
chie+1.594
run+1.593
 brief+1.512
Neg logit contributions
 idea-1.244
 morning-1.076
 day-1.059
 people-1.050
 doctors-1.001
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
ifying+2.525
igsaw+2.242
chie+1.973
run+1.973
 brief+1.882
Neg logit contributions
 idea-0.966
 morning-0.789
 day-0.787
 people-0.769
 prevent-0.712
TokenEm
Token ID10161
Feature activation-2.88e+00

Pos logit contributions
ifying+2.425
igsaw+2.015
run+1.643
chie+1.624
 brief+1.495
Neg logit contributions
 idea-2.686
 people-2.454
 day-2.453
 morning-2.445
 doctors-2.377
Tokenma
Token ID2611
Feature activation-5.92e-01

Pos logit contributions
ifying+7.317
igsaw+6.211
run+5.601
chie+5.385
lic+5.266
Neg logit contributions
 idea-3.415
 day-3.143
 morning-3.059
 night-2.765
 people-2.693
Token was
Token ID373
Feature activation+6.59e-01

Pos logit contributions
ifying+1.241
igsaw+1.087
run+0.917
chie+0.894
 brief+0.845
Neg logit contributions
 idea-0.856
 morning-0.743
 day-0.740
 people-0.732
 prevent-0.710
Token so
Token ID523
Feature activation+2.55e-01

Pos logit contributions
 idea+1.111
 morning+0.981
 day+0.974
 people+0.962
 prevent+0.957
Neg logit contributions
ifying-1.217
igsaw-1.012
run-0.880
chie-0.827
jamin-0.795
Token happy
Token ID3772
Feature activation-4.74e-01

Pos logit contributions
 idea+0.246
 morning+0.201
 doctors+0.197
 day+0.196
 people+0.194
Neg logit contributions
ifying-0.651
igsaw-0.584
run-0.519
chie-0.504
jamin-0.481
Token she
Token ID673
Feature activation-6.71e-01

Pos logit contributions
ifying+1.172
igsaw+1.045
run+0.913
chie+0.894
 brief+0.854
Neg logit contributions
 idea-0.512
 morning-0.420
 day-0.417
 people-0.411
 prevent-0.395
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
ifying+0.221
igsaw+0.196
run+0.174
chie+0.167
jamin+0.159
Neg logit contributions
 idea-0.108
 morning-0.089
 day-0.088
 people-0.087
 doctors-0.085
TokenL
Token ID43
Feature activation-9.08e-01

Pos logit contributions
ifying+0.498
igsaw+0.440
run+0.383
chie+0.372
jamin+0.354
Neg logit contributions
 idea-0.272
 morning-0.229
 day-0.226
 people-0.222
 hospital-0.215
Tokenily
Token ID813
Feature activation-9.89e-01

Pos logit contributions
ifying+2.616
igsaw+2.323
run+2.078
chie+2.049
 brief+1.988
Neg logit contributions
 idea-0.685
 day-0.486
 morning-0.482
 people-0.467
 doctors-0.439
Token was
Token ID373
Feature activation+3.74e-02

Pos logit contributions
ifying+2.181
igsaw+1.930
run+1.607
chie+1.591
 brief+1.480
Neg logit contributions
 idea-1.372
 people-1.170
 day-1.164
 morning-1.162
 prevent-1.125
Token scared
Token ID12008
Feature activation-3.35e-01

Pos logit contributions
 idea+0.046
 morning+0.040
 day+0.039
 people+0.038
 prevent+0.037
Neg logit contributions
ifying-0.085
igsaw-0.075
run-0.066
chie-0.064
jamin-0.061
Token at
Token ID379
Feature activation-2.87e+00

Pos logit contributions
ifying+0.861
igsaw+0.772
run+0.681
chie+0.664
 brief+0.636
Neg logit contributions
 idea-0.330
 morning-0.264
 day-0.261
 people-0.256
 prevent-0.248
Token first
Token ID717
Feature activation+2.62e-01

Pos logit contributions
ifying+7.227
igsaw+6.117
run+5.384
lic+5.077
chie+4.989
Neg logit contributions
 idea-3.777
 day-3.645
 people-3.578
 morning-3.483
 if-3.373
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
 idea+0.247
 morning+0.202
 day+0.193
 prevent+0.192
 doctors+0.190
Neg logit contributions
ifying-0.666
igsaw-0.605
run-0.540
chie-0.513
 brief-0.499
Token but
Token ID475
Feature activation+4.40e-01

Pos logit contributions
 idea+0.228
 morning+0.197
 day+0.194
 prevent+0.189
 people+0.189
Neg logit contributions
ifying-0.345
igsaw-0.299
run-0.262
chie-0.252
jamin-0.241
Token then
Token ID788
Feature activation+5.48e-01

Pos logit contributions
 idea+0.511
 morning+0.404
 day+0.393
 people+0.393
 prevent+0.386
Neg logit contributions
ifying-1.036
igsaw-0.915
run-0.822
chie-0.778
jamin-0.766
Token she
Token ID673
Feature activation-1.18e+00

Pos logit contributions
 idea+0.615
 morning+0.487
 day+0.485
 prevent+0.468
cut+0.460
Neg logit contributions
ifying-1.301
igsaw-1.163
run-1.033
 brief-0.978
chie-0.973
Token away
Token ID1497
Feature activation-4.28e-01

Pos logit contributions
 idea+0.689
 people+0.590
 doctors+0.586
 morning+0.586
 day+0.584
Neg logit contributions
ifying-0.962
igsaw-0.819
run-0.711
chie-0.684
 brief-0.650
Token to
Token ID284
Feature activation+5.23e-01

Pos logit contributions
ifying+1.103
igsaw+0.990
run+0.871
chie+0.853
 brief+0.817
Neg logit contributions
 idea-0.413
 morning-0.330
 day-0.327
 people-0.322
 prevent-0.307
Token hear
Token ID3285
Feature activation-4.68e-02

Pos logit contributions
 idea+0.581
 morning+0.495
 day+0.486
cut+0.453
 doctors+0.453
Neg logit contributions
ifying-1.219
igsaw-1.106
run-0.976
chie-0.946
 brief-0.908
Token her
Token ID607
Feature activation-2.04e-01

Pos logit contributions
ifying+0.119
igsaw+0.107
run+0.094
chie+0.091
 brief+0.088
Neg logit contributions
 idea-0.047
 morning-0.037
 day-0.037
 people-0.035
 prevent-0.035
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
ifying+0.507
igsaw+0.453
run+0.399
chie+0.387
 brief+0.372
Neg logit contributions
 idea-0.214
 morning-0.177
 day-0.174
 people-0.172
 doctors-0.166
Token Ros
Token ID10018
Feature activation-2.85e+00

Pos logit contributions
ifying+1.018
igsaw+0.901
run+0.778
chie+0.760
 brief+0.723
Neg logit contributions
 idea-0.545
 morning-0.459
 day-0.457
 people-0.451
 prevent-0.436
Tokeny
Token ID88
Feature activation-2.42e-01

Pos logit contributions
ifying+7.646
igsaw+6.886
 brief+5.859
run+5.841
chie+5.681
Neg logit contributions
 idea-2.999
 day-2.421
 morning-2.235
 people-2.202
 night-2.062
Token watched
Token ID7342
Feature activation+6.15e-01

Pos logit contributions
ifying+0.515
igsaw+0.450
run+0.387
chie+0.376
 brief+0.357
Neg logit contributions
 idea-0.336
 morning-0.290
 day-0.288
 people-0.283
 prevent-0.278
Token sadly
Token ID21098
Feature activation+3.82e-02

Pos logit contributions
 idea+0.668
 morning+0.549
 day+0.545
 hospital+0.525
 paramedics+0.515
Neg logit contributions
ifying-1.493
igsaw-1.339
run-1.178
chie-1.139
 brief-1.076
Token as
Token ID355
Feature activation+1.68e-02

Pos logit contributions
 idea+0.038
 morning+0.030
 day+0.030
 people+0.029
 doctors+0.029
Neg logit contributions
ifying-0.097
igsaw-0.087
run-0.077
chie-0.074
 brief-0.072
Token her
Token ID607
Feature activation-5.21e-01

Pos logit contributions
 idea+0.019
 morning+0.015
 day+0.015
 people+0.015
 prevent+0.015
Neg logit contributions
ifying-0.040
igsaw-0.036
run-0.031
chie-0.031
 brief-0.029
Tokenie
Token ID494
Feature activation-3.19e-01

Pos logit contributions
ifying+0.122
igsaw+0.107
run+0.092
chie+0.091
jamin+0.085
Neg logit contributions
 idea-0.080
 morning-0.070
 day-0.068
 people-0.068
 prevent-0.066
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
ifying+0.816
igsaw+0.731
run+0.645
chie+0.631
 brief+0.604
Neg logit contributions
 idea-0.311
 morning-0.250
 day-0.245
 people-0.242
 prevent-0.235
Token They
Token ID1119
Feature activation+8.45e-02

Pos logit contributions
ifying+0.373
igsaw+0.331
run+0.286
chie+0.275
jamin+0.262
Neg logit contributions
 idea-0.209
 morning-0.177
 day-0.175
 people-0.174
 doctors-0.170
Token both
Token ID1111
Feature activation-1.31e-01

Pos logit contributions
 idea+0.111
 morning+0.095
 day+0.092
 people+0.092
 hospital+0.091
Neg logit contributions
ifying-0.187
igsaw-0.161
run-0.140
chie-0.138
jamin-0.131
Token had
Token ID550
Feature activation-1.01e+00

Pos logit contributions
ifying+0.310
igsaw+0.272
run+0.239
chie+0.234
jamin+0.223
Neg logit contributions
 idea-0.151
 morning-0.125
 day-0.123
 people-0.122
 prevent-0.120
Token birth
Token ID4082
Feature activation-2.85e+00

Pos logit contributions
ifying+2.387
igsaw+2.160
run+1.856
chie+1.813
 brief+1.707
Neg logit contributions
 idea-1.207
 day-1.040
 people-1.039
 morning-1.036
 doctors-0.990
Tokendays
Token ID12545
Feature activation-7.76e-01

Pos logit contributions
ifying+4.718
igsaw+4.361
chie+3.165
jamin+3.162
 brief+2.970
Neg logit contributions
 idea-5.363
 day-5.070
 if-5.055
 whenever-4.912
 morning-4.862
Token coming
Token ID2406
Feature activation-5.17e-01

Pos logit contributions
ifying+1.949
igsaw+1.742
run+1.517
chie+1.501
jamin+1.428
Neg logit contributions
 idea-0.817
 day-0.676
 morning-0.672
 people-0.653
 prevent-0.618
Token up
Token ID510
Feature activation-6.69e-01

Pos logit contributions
ifying+1.165
igsaw+1.029
run+0.884
chie+0.862
 brief+0.818
Neg logit contributions
 idea-0.672
 morning-0.573
 day-0.571
 people-0.564
 prevent-0.546
Token,
Token ID11
Feature activation+9.59e-02

Pos logit contributions
ifying+1.598
igsaw+1.416
run+1.234
chie+1.210
jamin+1.148
Neg logit contributions
 idea-0.786
 day-0.661
 morning-0.660
 people-0.646
 prevent-0.618
Token and
Token ID290
Feature activation-2.99e-01

Pos logit contributions
 idea+0.122
 morning+0.102
 day+0.100
 people+0.099
 prevent+0.098
Neg logit contributions
ifying-0.216
igsaw-0.189
run-0.166
chie-0.158
 brief-0.153
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
 idea+0.782
 doctors+0.653
 morning+0.651
 people+0.637
 day+0.634
Neg logit contributions
ifying-1.302
igsaw-1.162
run-1.043
chie-0.949
lic-0.946
TokenThey
Token ID2990
Feature activation+1.01e-01

Pos logit contributions
 idea+0.305
 morning+0.258
 day+0.253
 people+0.247
 hospital+0.245
Neg logit contributions
ifying-0.479
igsaw-0.415
run-0.360
chie-0.351
jamin-0.333
Token have
Token ID423
Feature activation-2.26e-01

Pos logit contributions
 idea+0.124
 morning+0.105
 day+0.104
 people+0.102
 doctors+0.102
Neg logit contributions
ifying-0.234
igsaw-0.205
run-0.174
chie-0.173
 brief-0.165
Token fun
Token ID1257
Feature activation+5.95e-01

Pos logit contributions
ifying+0.551
igsaw+0.491
run+0.429
chie+0.419
 brief+0.403
Neg logit contributions
 idea-0.248
 morning-0.205
 day-0.203
 people-0.200
 prevent-0.193
Token with
Token ID351
Feature activation+6.69e-01

Pos logit contributions
 idea+0.764
 morning+0.651
 paramedics+0.637
 hospital+0.625
crow+0.623
Neg logit contributions
ifying-1.306
igsaw-1.152
run-1.017
chie-0.958
 brief-0.944
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
 idea+0.751
 morning+0.622
 day+0.598
bye+0.576
crow+0.571
Neg logit contributions
ifying-1.604
igsaw-1.411
run-1.250
chie-1.167
 brief-1.162
Token wheat
Token ID17135
Feature activation+4.96e-01

Pos logit contributions
 bal+1.765
 idea+1.709
 hospital+1.652
crow+1.646
bye+1.639
Neg logit contributions
ifying-5.057
igsaw-4.470
chie-4.028
 brief-3.976
run-3.817
Token.
Token ID13
Feature activation+6.84e-01

Pos logit contributions
 idea+0.525
 morning+0.433
 prevent+0.423
 doctors+0.419
 paramedics+0.417
Neg logit contributions
ifying-1.196
igsaw-1.061
run-0.927
chie-0.897
 brief-0.874
Token But
Token ID887
Feature activation+9.34e-01

Pos logit contributions
 idea+0.927
cut+0.787
 prevent+0.778
 people+0.762
 morning+0.761
Neg logit contributions
ifying-1.531
igsaw-1.306
run-1.138
lic-1.029
chie-1.022
Token they
Token ID484
Feature activation-4.39e-01

Pos logit contributions
 idea+1.126
 paramedics+0.843
 morning+0.839
 prevent+0.838
crow+0.831
Neg logit contributions
ifying-2.190
igsaw-1.863
run-1.664
chie-1.581
 nowhere-1.565
Token do
Token ID466
Feature activation-7.96e-01

Pos logit contributions
ifying+1.077
igsaw+0.960
run+0.838
chie+0.820
 brief+0.783
Neg logit contributions
 idea-0.480
 morning-0.395
 day-0.392
 people-0.387
 prevent-0.372
Token holds
Token ID6622
Feature activation+3.61e-01

Pos logit contributions
 idea+0.141
 morning+0.123
 day+0.123
 people+0.121
 prevent+0.119
Neg logit contributions
ifying-0.201
igsaw-0.176
run-0.150
chie-0.144
jamin-0.139
Token the
Token ID262
Feature activation+9.06e-01

Pos logit contributions
 idea+0.387
 day+0.311
 morning+0.310
 prevent+0.304
 doctors+0.303
Neg logit contributions
ifying-0.887
igsaw-0.789
run-0.704
chie-0.671
jamin-0.646
Token rail
Token ID6787
Feature activation+3.71e-01

Pos logit contributions
 idea+0.898
 morning+0.776
 whenever+0.741
 doctors+0.740
 day+0.724
Neg logit contributions
ifying-2.202
igsaw-1.949
chie-1.775
run-1.752
 brief-1.674
Token.
Token ID13
Feature activation+6.82e-01

Pos logit contributions
 idea+0.412
 morning+0.345
 people+0.335
 day+0.335
 doctors+0.334
Neg logit contributions
ifying-0.876
igsaw-0.784
run-0.695
chie-0.669
 brief-0.641
Token She
Token ID1375
Feature activation+4.76e-01

Pos logit contributions
 idea+0.944
 day+0.822
 morning+0.802
 people+0.801
 prevent+0.796
Neg logit contributions
ifying-1.454
igsaw-1.287
run-1.125
chie-1.014
jamin-1.007
Token feels
Token ID5300
Feature activation+2.46e+00

Pos logit contributions
 idea+0.673
 day+0.599
 morning+0.595
 people+0.570
 doctors+0.561
Neg logit contributions
ifying-0.986
igsaw-0.852
run-0.749
chie-0.695
 brief-0.687
Token better
Token ID1365
Feature activation-5.53e-02

Pos logit contributions
 idea+3.446
bye+3.324
 day+3.145
 morning+3.124
 ostr+3.038
Neg logit contributions
ifying-5.145
igsaw-4.029
run-3.993
 nowhere-3.419
 brief-3.346
Token.
Token ID13
Feature activation+6.42e-01

Pos logit contributions
ifying+0.141
igsaw+0.126
run+0.112
chie+0.109
 brief+0.105
Neg logit contributions
 idea-0.055
 morning-0.044
 day-0.042
 people-0.042
 doctors-0.041
Token She
Token ID1375
Feature activation+7.46e-01

Pos logit contributions
 idea+0.907
 day+0.772
 prevent+0.761
 people+0.756
 morning+0.755
Neg logit contributions
ifying-1.361
igsaw-1.200
run-1.044
chie-0.944
jamin-0.940
Token climbs
Token ID34332
Feature activation+8.85e-01

Pos logit contributions
 idea+1.060
 day+0.948
 morning+0.940
 people+0.895
crow+0.884
Neg logit contributions
ifying-1.528
igsaw-1.307
run-1.155
 brief-1.074
chie-1.063
Token up
Token ID510
Feature activation+2.74e-01

Pos logit contributions
 idea+1.074
 doctors+0.823
 prevent+0.817
 paramedics+0.812
 people+0.810
Neg logit contributions
ifying-2.049
igsaw-1.791
run-1.583
chie-1.535
 brief-1.479
Token the
Token ID262
Feature activation-2.70e-02

Pos logit contributions
 idea+0.539
 morning+0.440
 day+0.437
 prevent+0.428
 paramedics+0.421
Neg logit contributions
ifying-1.099
igsaw-0.973
run-0.856
chie-0.815
 brief-0.796
Token children
Token ID1751
Feature activation+1.85e-01

Pos logit contributions
ifying+0.062
igsaw+0.054
run+0.046
chie+0.046
 brief+0.044
Neg logit contributions
 idea-0.033
 morning-0.028
 day-0.027
 prevent-0.027
 doctors-0.027
Token play
Token ID711
Feature activation+5.49e-01

Pos logit contributions
 idea+0.218
 morning+0.180
 day+0.176
 doctors+0.171
 people+0.171
Neg logit contributions
ifying-0.435
igsaw-0.382
run-0.337
chie-0.324
 brief-0.313
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 idea+0.611
 morning+0.506
 prevent+0.491
 paramedics+0.489
 day+0.489
Neg logit contributions
ifying-1.295
igsaw-1.158
run-1.024
chie-0.974
 brief-0.957
Token He
Token ID679
Feature activation+2.59e-01

Pos logit contributions
 idea+0.178
 day+0.154
 morning+0.154
 prevent+0.151
 people+0.150
Neg logit contributions
ifying-0.258
igsaw-0.225
run-0.195
chie-0.180
jamin-0.175
Token feels
Token ID5300
Feature activation+1.78e+00

Pos logit contributions
 idea+0.346
 morning+0.298
 day+0.298
 people+0.287
 prevent+0.284
Neg logit contributions
ifying-0.560
igsaw-0.484
run-0.422
chie-0.403
 brief-0.395
Token happy
Token ID3772
Feature activation-1.42e-01

Pos logit contributions
 idea+2.386
 morning+2.015
 day+1.999
bye+1.956
 paramedics+1.911
Neg logit contributions
ifying-3.972
igsaw-3.176
run-3.022
jamin-2.705
 brief-2.685
Token and
Token ID290
Feature activation-3.65e-01

Pos logit contributions
ifying+0.351
igsaw+0.313
run+0.276
chie+0.266
 brief+0.258
Neg logit contributions
 idea-0.152
 morning-0.123
 day-0.121
 prevent-0.118
 people-0.118
Token warm
Token ID5814
Feature activation-6.27e-01

Pos logit contributions
ifying+0.862
igsaw+0.764
run+0.663
chie+0.648
 brief+0.618
Neg logit contributions
 idea-0.432
 morning-0.361
 day-0.358
 people-0.353
 doctors-0.342
Token.
Token ID13
Feature activation-2.46e-02

Pos logit contributions
ifying+1.665
igsaw+1.506
run+1.327
chie+1.306
 brief+1.241
Neg logit contributions
 idea-0.569
 morning-0.453
 day-0.448
 people-0.441
 doctors-0.416
Token He
Token ID679
Feature activation+4.39e-01

Pos logit contributions
ifying+0.054
igsaw+0.048
run+0.042
chie+0.039
jamin+0.038
Neg logit contributions
 idea-0.033
 morning-0.028
 day-0.028
 prevent-0.027
 people-0.027
Token was
Token ID373
Feature activation+6.77e-01

Pos logit contributions
 idea+0.914
 morning+0.803
 paramedics+0.786
 prevent+0.778
 doctors+0.777
Neg logit contributions
ifying-1.331
igsaw-1.167
run-1.008
chie-0.958
jamin-0.922
Token now
Token ID783
Feature activation+9.69e-02

Pos logit contributions
 idea+0.873
crow+0.753
 morning+0.752
 paramedics+0.745
 people+0.738
Neg logit contributions
ifying-1.468
igsaw-1.279
run-1.114
chie-1.072
 brief-1.029
Token happy
Token ID3772
Feature activation+1.32e-01

Pos logit contributions
 idea+0.122
 morning+0.103
 people+0.101
 day+0.101
 prevent+0.098
Neg logit contributions
ifying-0.219
igsaw-0.193
run-0.167
chie-0.161
 brief-0.156
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
 idea+0.155
 morning+0.128
 paramedics+0.124
 day+0.124
 prevent+0.124
Neg logit contributions
ifying-0.306
igsaw-0.271
run-0.238
chie-0.228
 brief-0.220
Token She
Token ID1375
Feature activation+2.37e-01

Pos logit contributions
 idea+0.584
 prevent+0.488
 day+0.484
 people+0.480
 doctors+0.479
Neg logit contributions
ifying-0.909
igsaw-0.810
run-0.693
chie-0.652
jamin-0.642
Token showed
Token ID3751
Feature activation+1.93e+00

Pos logit contributions
 idea+0.317
 morning+0.272
 day+0.268
 people+0.265
 prevent+0.258
Neg logit contributions
ifying-0.513
igsaw-0.445
run-0.391
chie-0.374
jamin-0.366
Token the
Token ID262
Feature activation+1.71e-01

Pos logit contributions
 idea+2.438
 paramedics+2.044
cut+2.015
 day+1.958
 morning+1.942
Neg logit contributions
ifying-4.007
igsaw-3.784
run-3.271
chie-3.148
 nowhere-3.006
Token pie
Token ID2508
Feature activation+1.62e-01

Pos logit contributions
 idea+0.224
 morning+0.197
 day+0.192
 prevent+0.190
 people+0.188
Neg logit contributions
ifying-0.363
igsaw-0.323
run-0.270
chie-0.269
 brief-0.254
Token a
Token ID257
Feature activation-2.63e-01

Pos logit contributions
 idea+0.218
 morning+0.188
 people+0.184
 prevent+0.183
 paramedics+0.183
Neg logit contributions
ifying-0.344
igsaw-0.307
run-0.263
chie-0.250
 brief-0.241
Token spark
Token ID9009
Feature activation+5.75e-03

Pos logit contributions
ifying+0.606
igsaw+0.537
run+0.464
chie+0.452
 brief+0.433
Neg logit contributions
 idea-0.322
 morning-0.273
 day-0.269
 people-0.268
 doctors-0.262
Tokenly
Token ID306
Feature activation+6.32e-02

Pos logit contributions
 idea+0.009
 morning+0.008
 people+0.008
 day+0.008
 doctors+0.008
Neg logit contributions
ifying-0.011
igsaw-0.009
run-0.008
chie-0.007
 brief-0.007
Token friends
Token ID2460
Feature activation-5.20e-01

Pos logit contributions
 idea+0.595
 morning+0.502
 prevent+0.501
 day+0.489
cut+0.482
Neg logit contributions
ifying-1.128
igsaw-0.998
run-0.868
chie-0.834
 brief-0.823
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
ifying+1.369
igsaw+1.233
run+1.086
chie+1.067
 brief+1.021
Neg logit contributions
 idea-0.476
 morning-0.376
 day-0.373
 people-0.367
 prevent-0.347
Token<|endoftext|>
Token ID50256
Feature activation-1.02e+00

Pos logit contributions
ifying+0.273
igsaw+0.242
run+0.210
chie+0.202
jamin+0.195
Neg logit contributions
 idea-0.136
 morning-0.112
 day-0.110
 people-0.110
 prevent-0.109
TokenTom
Token ID13787
Feature activation+4.64e-01

Pos logit contributions
ifying+2.329
igsaw+2.067
run+1.791
chie+1.774
 brief+1.664
Neg logit contributions
 idea-1.272
 morning-1.122
 day-1.119
 people-1.095
 doctors-1.022
Token and
Token ID290
Feature activation-4.52e-01

Pos logit contributions
 idea+0.583
 paramedics+0.490
 doctors+0.486
 prevent+0.483
 people+0.480
Neg logit contributions
ifying-1.024
igsaw-0.910
run-0.787
jamin-0.763
chie-0.752
Token Lily
Token ID20037
Feature activation+1.81e+00

Pos logit contributions
ifying+1.096
igsaw+0.976
run+0.849
chie+0.831
 brief+0.792
Neg logit contributions
 idea-0.513
 morning-0.425
 day-0.422
 people-0.416
 prevent-0.401
Token liked
Token ID8288
Feature activation-1.61e-01

Pos logit contributions
 idea+2.525
 paramedics+2.240
 doctors+2.221
cut+2.146
 morning+2.071
Neg logit contributions
ifying-3.723
igsaw-3.238
jamin-2.777
run-2.579
 brief-2.557
Token to
Token ID284
Feature activation-5.00e-01

Pos logit contributions
ifying+0.339
igsaw+0.298
run+0.253
chie+0.244
 brief+0.234
Neg logit contributions
 idea-0.230
 morning-0.196
 day-0.193
 people-0.192
 prevent-0.190
Token play
Token ID711
Feature activation+4.02e-01

Pos logit contributions
ifying+1.219
igsaw+1.086
run+0.947
chie+0.927
 brief+0.883
Neg logit contributions
 idea-0.555
 morning-0.458
 day-0.455
 people-0.448
 prevent-0.432
Token in
Token ID287
Feature activation-7.95e-01

Pos logit contributions
 idea+0.522
 morning+0.451
 day+0.443
 paramedics+0.440
 prevent+0.429
Neg logit contributions
ifying-0.873
igsaw-0.774
run-0.653
chie-0.628
 brief-0.623
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
ifying+2.167
igsaw+1.932
run+1.730
chie+1.696
 brief+1.617
Neg logit contributions
 idea-0.687
 day-0.549
 morning-0.544
 people-0.535
 prevent-0.491
Token thick
Token ID6546
Feature activation-1.78e-01

Pos logit contributions
 idea+0.319
 morning+0.260
 day+0.252
 prevent+0.252
 hospital+0.247
Neg logit contributions
ifying-0.781
igsaw-0.686
chie-0.593
run-0.592
 brief-0.569
Token soil
Token ID9260
Feature activation-6.47e-01

Pos logit contributions
ifying+0.450
igsaw+0.404
run+0.352
chie+0.346
 brief+0.333
Neg logit contributions
 idea-0.175
 morning-0.140
 day-0.138
 people-0.136
 prevent-0.135
Token.
Token ID13
Feature activation+8.70e-01

Pos logit contributions
ifying+1.694
igsaw+1.528
run+1.344
chie+1.324
 brief+1.265
Neg logit contributions
 idea-0.600
 day-0.476
 morning-0.475
 people-0.463
 doctors-0.435
Token On
Token ID1550
Feature activation-1.37e-01

Pos logit contributions
 idea+1.149
 doctors+0.998
 prevent+0.989
 people+0.975
 paramedics+0.956
Neg logit contributions
ifying-1.871
igsaw-1.673
run-1.481
jamin-1.337
chie-1.292
Token one
Token ID530
Feature activation+8.79e-02

Pos logit contributions
ifying+0.342
igsaw+0.307
run+0.267
chie+0.262
jamin+0.251
Neg logit contributions
 idea-0.141
 people-0.111
 morning-0.111
 day-0.110
 prevent-0.108
Token summer
Token ID3931
Feature activation+5.06e-01

Pos logit contributions
 idea+0.084
 people+0.065
 doctors+0.065
 paramedics+0.065
 morning+0.064
Neg logit contributions
ifying-0.225
igsaw-0.202
run-0.179
chie-0.173
 brief-0.166
Token day
Token ID1110
Feature activation+8.00e-01

Pos logit contributions
 idea+0.267
 prevent+0.164
 paramedics+0.162
 doctors+0.160
 hospital+0.151
Neg logit contributions
ifying-1.480
igsaw-1.391
run-1.231
chie-1.182
jamin-1.167
Token,
Token ID11
Feature activation+7.60e-01

Pos logit contributions
 idea+0.845
 doctors+0.697
 prevent+0.683
 hospital+0.681
 paramedics+0.676
Neg logit contributions
ifying-1.963
igsaw-1.771
run-1.555
chie-1.445
lic-1.423
Token his
Token ID465
Feature activation+4.20e-01

Pos logit contributions
 idea+0.905
 paramedics+0.712
 morning+0.711
crow+0.708
 prevent+0.705
Neg logit contributions
ifying-1.784
igsaw-1.547
run-1.370
 brief-1.271
chie-1.241
Token mom
Token ID1995
Feature activation+1.60e+00

Pos logit contributions
 idea+0.393
 prevent+0.320
 morning+0.319
crow+0.306
 people+0.305
Neg logit contributions
ifying-1.067
igsaw-0.969
run-0.839
chie-0.814
 brief-0.804
Token came
Token ID1625
Feature activation-5.58e-01

Pos logit contributions
 idea+2.723
crow+2.375
 morning+2.356
 hospital+2.352
 prevent+2.333
Neg logit contributions
ifying-2.761
igsaw-2.232
run-2.073
jamin-2.011
lic-1.925
Token new
Token ID649
Feature activation+1.68e-01

Pos logit contributions
ifying+1.174
igsaw+1.046
run+0.912
chie+0.893
 brief+0.852
Neg logit contributions
 idea-0.534
 morning-0.441
 day-0.438
 people-0.431
 prevent-0.416
Token,
Token ID11
Feature activation+1.28e-01

Pos logit contributions
 idea+0.225
 morning+0.192
 day+0.189
 people+0.188
 prevent+0.187
Neg logit contributions
ifying-0.361
igsaw-0.318
run-0.277
chie-0.265
 brief-0.255
Token so
Token ID523
Feature activation+2.40e-01

Pos logit contributions
 idea+0.183
 morning+0.161
 day+0.159
 people+0.154
 prevent+0.153
Neg logit contributions
ifying-0.264
igsaw-0.233
run-0.199
chie-0.191
jamin-0.180
Token it
Token ID340
Feature activation+4.29e-02

Pos logit contributions
 idea+0.339
 morning+0.285
 day+0.281
 hospital+0.278
 people+0.276
Neg logit contributions
ifying-0.512
igsaw-0.443
run-0.387
chie-0.369
 brief-0.347
Token might
Token ID1244
Feature activation+2.96e-02

Pos logit contributions
 idea+0.056
 morning+0.048
 day+0.047
 people+0.046
 doctors+0.045
Neg logit contributions
ifying-0.095
igsaw-0.083
run-0.072
chie-0.071
jamin-0.067
Token take
Token ID1011
Feature activation+2.59e-01

Pos logit contributions
 idea+0.039
 morning+0.033
 day+0.033
 people+0.032
 doctors+0.032
Neg logit contributions
ifying-0.065
igsaw-0.057
run-0.049
chie-0.049
 brief-0.046
Token a
Token ID257
Feature activation-7.23e-01

Pos logit contributions
 idea+0.305
 morning+0.246
 day+0.245
 paramedics+0.242
 doctors+0.240
Neg logit contributions
ifying-0.607
igsaw-0.540
run-0.470
chie-0.460
 brief-0.434
Token few
Token ID1178
Feature activation+3.21e-01

Pos logit contributions
ifying+1.456
igsaw+1.258
run+1.056
chie+1.031
jamin+0.975
Neg logit contributions
 idea-1.121
 morning-0.989
 day-0.986
 people-0.962
 prevent-0.943
Token times
Token ID1661
Feature activation+1.44e-01

Pos logit contributions
 idea+0.420
 morning+0.350
 day+0.348
 prevent+0.343
 people+0.342
Neg logit contributions
ifying-0.705
igsaw-0.624
run-0.548
chie-0.520
 brief-0.503
Token for
Token ID329
Feature activation-9.10e-01

Pos logit contributions
 idea+0.162
 morning+0.130
 day+0.128
 people+0.127
 doctors+0.125
Neg logit contributions
ifying-0.348
igsaw-0.307
run-0.271
chie-0.263
 brief-0.249
Token it
Token ID340
Feature activation-1.93e-01

Pos logit contributions
ifying+2.212
igsaw+1.959
run+1.707
chie+1.665
jamin+1.599
Neg logit contributions
 idea-1.031
 people-0.880
 morning-0.877
 day-0.873
 doctors-0.832
Token friends
Token ID2460
Feature activation-3.02e-02

Pos logit contributions
 idea+0.643
 morning+0.559
 day+0.544
 hospital+0.504
 people+0.504
Neg logit contributions
ifying-1.228
igsaw-1.084
run-0.977
chie-0.940
 brief-0.905
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
ifying+0.070
igsaw+0.062
run+0.055
chie+0.053
 brief+0.051
Neg logit contributions
 idea-0.036
 morning-0.030
 day-0.030
 people-0.029
 prevent-0.029
Token
Token ID198
Feature activation+8.69e-01

Pos logit contributions
 idea+1.238
 day+1.033
 morning+1.031
 prevent+1.030
 doctors+1.012
Neg logit contributions
ifying-1.831
igsaw-1.600
run-1.395
jamin-1.270
chie-1.257
Token
Token ID198
Feature activation+5.68e-01

Pos logit contributions
 idea+1.188
 morning+0.988
 doctors+0.977
 day+0.953
 prevent+0.946
Neg logit contributions
ifying-1.841
igsaw-1.668
run-1.474
chie-1.362
jamin-1.328
Token"
Token ID1
Feature activation+2.48e-01

Pos logit contributions
 idea+0.800
 morning+0.664
 day+0.644
 people+0.635
 doctors+0.635
Neg logit contributions
ifying-1.231
igsaw-1.085
run-0.938
chie-0.904
 brief-0.865
TokenIt
Token ID1026
Feature activation+8.18e-02

Pos logit contributions
 idea+0.332
 morning+0.280
 day+0.275
 people+0.265
 hospital+0.265
Neg logit contributions
ifying-0.550
igsaw-0.481
run-0.416
chie-0.408
 brief-0.393
Token's
Token ID338
Feature activation+6.18e-01

Pos logit contributions
 idea+0.109
 morning+0.095
 day+0.094
 people+0.093
 doctors+0.091
Neg logit contributions
ifying-0.175
igsaw-0.154
run-0.134
chie-0.130
jamin-0.124
Token okay
Token ID8788
Feature activation-4.48e-01

Pos logit contributions
 idea+0.675
 morning+0.574
crow+0.557
 people+0.554
cut+0.554
Neg logit contributions
ifying-1.467
igsaw-1.293
run-1.161
chie-1.112
 brief-1.070
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
ifying+1.218
igsaw+1.099
run+0.973
chie+0.955
 brief+0.917
Neg logit contributions
 idea-0.379
 morning-0.292
 day-0.289
 people-0.283
 prevent-0.268
Token Tom
Token ID4186
Feature activation+1.35e+00

Pos logit contributions
 idea+0.479
 morning+0.403
 day+0.393
 prevent+0.376
 people+0.367
Neg logit contributions
ifying-0.997
igsaw-0.894
run-0.785
chie-0.753
 brief-0.732
Token,"
Token ID553
Feature activation+4.73e-01

Pos logit contributions
 idea+1.600
 morning+1.410
 paramedics+1.363
 hospital+1.343
 day+1.326
Neg logit contributions
ifying-2.888
igsaw-2.557
run-2.377
chie-2.228
jamin-2.181
Token He
Token ID679
Feature activation+1.74e-01

Pos logit contributions
ifying+0.854
igsaw+0.753
run+0.648
chie+0.631
 brief+0.598
Neg logit contributions
 idea-0.499
 morning-0.424
 day-0.422
 people-0.417
 prevent-0.406
Token learned
Token ID4499
Feature activation-1.32e+00

Pos logit contributions
 idea+0.229
 morning+0.197
 day+0.195
 people+0.191
 prevent+0.185
Neg logit contributions
ifying-0.379
igsaw-0.327
run-0.288
chie-0.279
jamin-0.268
Token that
Token ID326
Feature activation+1.48e+00

Pos logit contributions
ifying+3.401
igsaw+3.005
chie+2.611
run+2.605
jamin+2.486
Neg logit contributions
 idea-1.394
 people-1.170
 day-1.150
 morning-1.142
 doctors-1.136
Token sharing
Token ID7373
Feature activation-7.66e-01

Pos logit contributions
 idea+2.043
 paramedics+1.698
crow+1.679
 morning+1.663
bye+1.638
Neg logit contributions
ifying-2.943
igsaw-2.702
run-2.333
 nowhere-2.207
chie-2.123
Token is
Token ID318
Feature activation+4.36e-01

Pos logit contributions
ifying+1.900
igsaw+1.692
run+1.472
chie+1.455
 brief+1.380
Neg logit contributions
 idea-0.839
 day-0.692
 people-0.691
 morning-0.685
 prevent-0.650
Token good
Token ID922
Feature activation+2.24e-01

Pos logit contributions
 idea+0.504
 morning+0.431
 day+0.408
crow+0.396
 paramedics+0.393
Neg logit contributions
ifying-1.001
igsaw-0.887
run-0.794
chie-0.766
 brief-0.727
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 idea+0.239
 morning+0.202
 prevent+0.194
 day+0.193
 paramedics+0.192
Neg logit contributions
ifying-0.539
igsaw-0.486
run-0.431
chie-0.410
 brief-0.395
Token So
Token ID1406
Feature activation+7.79e-01

Pos logit contributions
 idea+0.170
 morning+0.143
 day+0.142
 prevent+0.141
 people+0.140
Neg logit contributions
ifying-0.270
igsaw-0.240
run-0.208
chie-0.197
jamin-0.189
Token,
Token ID11
Feature activation+8.12e-01

Pos logit contributions
 idea+0.840
 paramedics+0.671
 doctors+0.650
 cows+0.635
 prevent+0.634
Neg logit contributions
ifying-1.879
igsaw-1.687
run-1.480
chie-1.410
 brief-1.376
Token he
Token ID339
Feature activation+6.45e-01

Pos logit contributions
 idea+1.043
 morning+0.829
 day+0.804
 paramedics+0.804
 prevent+0.796
Neg logit contributions
ifying-1.812
igsaw-1.592
run-1.390
chie-1.312
 brief-1.283
Token shared
Token ID4888
Feature activation-4.59e-01

Pos logit contributions
 idea+0.922
 morning+0.812
 day+0.796
 people+0.764
 prevent+0.748
Neg logit contributions
ifying-1.315
igsaw-1.122
run-0.982
chie-0.939
jamin-0.928
Token
Token ID198
Feature activation-8.02e-02

Pos logit contributions
 idea+0.328
 morning+0.276
 day+0.270
 prevent+0.267
 doctors+0.267
Neg logit contributions
ifying-0.619
igsaw-0.548
run-0.493
chie-0.460
jamin-0.443
TokenOne
Token ID3198
Feature activation+2.10e-01

Pos logit contributions
ifying+0.189
igsaw+0.169
run+0.148
chie+0.143
jamin+0.137
Neg logit contributions
 idea-0.092
 morning-0.076
 people-0.074
 day-0.074
 doctors-0.073
Token day
Token ID1110
Feature activation+8.62e-01

Pos logit contributions
 idea+0.073
 paramedics+0.029
 prevent+0.028
 people+0.027
 doctors+0.027
Neg logit contributions
ifying-0.664
igsaw-0.611
run-0.549
chie-0.537
 brief-0.525
Token he
Token ID339
Feature activation+3.93e-01

Pos logit contributions
 idea+0.980
 prevent+0.739
 paramedics+0.739
 hospital+0.727
 decide+0.706
Neg logit contributions
ifying-2.112
igsaw-1.816
run-1.608
 brief-1.539
chie-1.509
Token decided
Token ID3066
Feature activation+2.42e-01

Pos logit contributions
 idea+0.479
 morning+0.395
 day+0.385
 paramedics+0.383
 people+0.382
Neg logit contributions
ifying-0.893
igsaw-0.773
run-0.678
chie-0.657
jamin-0.652
Token to
Token ID284
Feature activation+9.54e-01

Pos logit contributions
 idea+0.370
 morning+0.314
 day+0.309
 people+0.300
 paramedics+0.300
Neg logit contributions
ifying-0.482
igsaw-0.419
run-0.359
chie-0.344
 brief-0.335
Token go
Token ID467
Feature activation-7.92e-01

Pos logit contributions
 idea+1.016
 morning+0.875
 paramedics+0.864
 day+0.849
 doctors+0.847
Neg logit contributions
ifying-2.224
igsaw-2.020
chie-1.699
run-1.686
jamin-1.626
Token on
Token ID319
Feature activation-1.47e+00

Pos logit contributions
ifying+2.057
igsaw+1.840
run+1.608
chie+1.587
jamin+1.512
Neg logit contributions
 idea-0.785
 morning-0.642
 day-0.640
 people-0.622
 prevent-0.596
Token an
Token ID281
Feature activation+9.04e-01

Pos logit contributions
ifying+3.787
igsaw+3.320
run+2.986
chie+2.980
lic+2.774
Neg logit contributions
 idea-1.575
 morning-1.392
 day-1.381
 people-1.307
 hospital-1.266
Token adventure
Token ID8855
Feature activation+2.78e-01

Pos logit contributions
 idea+0.930
 people+0.788
 morning+0.763
crow+0.761
 paramedics+0.760
Neg logit contributions
ifying-2.085
igsaw-1.974
chie-1.648
run-1.647
 brief-1.592
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
 idea+0.319
 morning+0.257
 paramedics+0.256
 people+0.253
 prevent+0.252
Neg logit contributions
ifying-0.652
igsaw-0.579
run-0.511
chie-0.490
 brief-0.477
Token "
Token ID366
Feature activation-2.62e-01

Pos logit contributions
ifying+1.569
igsaw+1.406
run+1.225
chie+1.204
 brief+1.150
Neg logit contributions
 idea-0.645
 morning-0.527
 day-0.523
 people-0.517
 prevent-0.492
TokenLet
Token ID5756
Feature activation+7.93e-01

Pos logit contributions
ifying+0.534
igsaw+0.461
run+0.389
chie+0.381
jamin+0.357
Neg logit contributions
 idea-0.398
 morning-0.344
 day-0.342
 people-0.340
 doctors-0.332
Token's
Token ID338
Feature activation+2.12e-01

Pos logit contributions
 idea+1.193
cut+1.009
 morning+1.008
 doctors+1.005
 day+1.005
Neg logit contributions
ifying-1.567
igsaw-1.358
chie-1.148
run-1.131
jamin-1.114
Token get
Token ID651
Feature activation-6.49e-01

Pos logit contributions
 idea+0.212
 morning+0.175
 day+0.172
 doctors+0.165
 people+0.165
Neg logit contributions
ifying-0.534
igsaw-0.471
run-0.413
chie-0.409
 brief-0.387
Token hard
Token ID1327
Feature activation-3.30e-01

Pos logit contributions
ifying+1.570
igsaw+1.408
run+1.220
chie+1.191
jamin+1.142
Neg logit contributions
 idea-0.724
 morning-0.610
 day-0.605
 people-0.598
 prevent-0.573
Token and
Token ID290
Feature activation-3.84e-01

Pos logit contributions
ifying+0.803
igsaw+0.716
run+0.624
chie+0.610
 brief+0.583
Neg logit contributions
 idea-0.363
 morning-0.299
 day-0.297
 people-0.293
 prevent-0.284
Token have
Token ID423
Feature activation-1.15e+00

Pos logit contributions
ifying+0.940
igsaw+0.837
run+0.732
chie+0.716
 brief+0.683
Neg logit contributions
 idea-0.418
 morning-0.343
 day-0.341
 people-0.335
 doctors-0.322
Token a
Token ID257
Feature activation-5.24e-01

Pos logit contributions
ifying+2.795
igsaw+2.518
run+2.166
chie+2.128
jamin+2.048
Neg logit contributions
 idea-1.276
 morning-1.100
 day-1.087
 people-1.082
 doctors-1.031
Token fire
Token ID2046
Feature activation-1.29e-01

Pos logit contributions
ifying+1.198
igsaw+1.059
run+0.915
chie+0.893
 brief+0.846
Neg logit contributions
 idea-0.660
 morning-0.558
 day-0.555
 people-0.545
 prevent-0.529
Tokenwork
Token ID1818
Feature activation+1.89e-01

Pos logit contributions
ifying+0.230
igsaw+0.194
run+0.159
chie+0.154
 brief+0.144
Neg logit contributions
 idea-0.225
 morning-0.200
 day-0.199
 people-0.198
 prevent-0.194
Token show
Token ID905
Feature activation+2.63e-01

Pos logit contributions
 idea+0.240
 morning+0.202
 people+0.202
 doctors+0.199
 day+0.198
Neg logit contributions
ifying-0.425
igsaw-0.376
run-0.321
chie-0.313
 brief-0.300
Token Tim
Token ID5045
Feature activation-3.38e-01

Pos logit contributions
 idea+0.335
 morning+0.264
 people+0.260
 hospital+0.259
 day+0.258
Neg logit contributions
ifying-0.733
igsaw-0.645
run-0.565
chie-0.544
 brief-0.525
Tokenmy
Token ID1820
Feature activation+1.25e+00

Pos logit contributions
ifying+0.622
igsaw+0.533
run+0.441
chie+0.425
jamin+0.399
Neg logit contributions
 idea-0.574
 morning-0.507
 day-0.503
 people-0.502
 prevent-0.492
Token's
Token ID338
Feature activation+4.32e-01

Pos logit contributions
 idea+1.868
 prevent+1.578
cut+1.549
 morning+1.529
 paramedics+1.515
Neg logit contributions
ifying-2.480
igsaw-2.123
run-1.854
jamin-1.853
 brief-1.720
Token mom
Token ID1995
Feature activation+9.84e-01

Pos logit contributions
 idea+0.391
 prevent+0.314
 morning+0.307
crow+0.295
 people+0.292
Neg logit contributions
ifying-1.127
igsaw-1.010
run-0.884
chie-0.846
 brief-0.844
Token told
Token ID1297
Feature activation+1.23e+00

Pos logit contributions
 idea+1.485
 morning+1.286
crow+1.281
 people+1.267
 hospital+1.245
Neg logit contributions
ifying-1.908
igsaw-1.607
run-1.443
jamin-1.377
lic-1.339
Token him
Token ID683
Feature activation-6.36e-01

Pos logit contributions
 idea+1.301
cut+1.082
 day+1.026
 morning+1.001
 bal+0.987
Neg logit contributions
ifying-2.806
igsaw-2.662
run-2.272
chie-2.204
jamin-2.122
Token they
Token ID484
Feature activation-3.96e-01

Pos logit contributions
ifying+1.559
igsaw+1.386
run+1.209
chie+1.188
 brief+1.128
Neg logit contributions
 idea-0.703
 morning-0.582
 day-0.580
 people-0.575
 doctors-0.550
Token needed
Token ID2622
Feature activation-2.03e-01

Pos logit contributions
ifying+0.821
igsaw+0.714
run+0.604
chie+0.588
 brief+0.554
Neg logit contributions
 idea-0.587
 morning-0.510
 day-0.506
 people-0.501
 prevent-0.489
Token to
Token ID284
Feature activation+4.91e-01

Pos logit contributions
ifying+0.435
igsaw+0.381
run+0.327
chie+0.316
 brief+0.301
Neg logit contributions
 idea-0.285
 morning-0.242
 day-0.240
 people-0.238
 prevent-0.233
Token clean
Token ID3424
Feature activation+2.26e-06

Pos logit contributions
 idea+0.484
 morning+0.397
 day+0.388
crow+0.375
 people+0.370
Neg logit contributions
ifying-1.216
igsaw-1.086
run-0.940
chie-0.932
 brief-0.899
Token his
Token ID465
Feature activation-1.43e-01

Feature not active on prev token;
no predictions were affected.
Token other
Token ID584
Feature activation+1.08e-01

Pos logit contributions
 idea+0.297
 doctors+0.194
 paramedics+0.190
 morning+0.186
 prevent+0.186
Neg logit contributions
ifying-1.235
igsaw-1.118
run-1.000
chie-0.986
 brief-0.943
Token and
Token ID290
Feature activation+2.26e-02

Pos logit contributions
 idea+0.107
 morning+0.085
 doctors+0.083
 prevent+0.082
 day+0.082
Neg logit contributions
ifying-0.273
igsaw-0.245
run-0.216
chie-0.211
 brief-0.202
Token hopped
Token ID45230
Feature activation+1.07e-01

Pos logit contributions
 idea+0.038
 morning+0.033
 day+0.032
 people+0.032
 doctors+0.032
Neg logit contributions
ifying-0.042
igsaw-0.036
run-0.030
chie-0.029
jamin-0.027
Token around
Token ID1088
Feature activation+2.13e-01

Pos logit contributions
 idea+0.112
 morning+0.091
 doctors+0.091
 day+0.089
 people+0.089
Neg logit contributions
ifying-0.265
igsaw-0.237
run-0.207
chie-0.202
 brief-0.193
Token together
Token ID1978
Feature activation+2.92e-01

Pos logit contributions
 idea+0.223
 morning+0.179
 paramedics+0.175
 doctors+0.175
 day+0.173
Neg logit contributions
ifying-0.525
igsaw-0.472
run-0.412
chie-0.402
 brief-0.387
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
 idea+0.275
 doctors+0.218
 paramedics+0.216
 morning+0.216
 prevent+0.212
Neg logit contributions
ifying-0.750
igsaw-0.671
run-0.594
chie-0.578
 brief-0.553
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
ifying+0.663
igsaw+0.588
run+0.512
chie+0.497
jamin+0.474
Neg logit contributions
 idea-0.331
 morning-0.273
 day-0.271
 people-0.270
 doctors-0.262
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
 idea+0.176
 morning+0.145
 people+0.142
 doctors+0.142
 day+0.141
Neg logit contributions
ifying-0.341
igsaw-0.303
run-0.269
chie-0.256
jamin-0.246
TokenAt
Token ID2953
Feature activation-4.08e-01

Pos logit contributions
 idea+0.252
 morning+0.209
 day+0.205
 people+0.203
 doctors+0.200
Neg logit contributions
ifying-0.416
igsaw-0.362
run-0.313
chie-0.308
jamin-0.291
Token the
Token ID262
Feature activation+4.59e-01

Pos logit contributions
ifying+1.064
igsaw+0.957
run+0.842
chie+0.825
 brief+0.791
Neg logit contributions
 idea-0.386
 morning-0.305
 day-0.302
 people-0.299
 prevent-0.285
Token end
Token ID886
Feature activation-2.53e-01

Pos logit contributions
 idea+0.307
 doctors+0.203
 paramedics+0.203
 people+0.202
 prevent+0.193
Neg logit contributions
ifying-1.316
igsaw-1.204
chie-1.031
run-1.026
jamin-1.008
Token noticed
Token ID6810
Feature activation-7.68e-01

Pos logit contributions
ifying+0.106
igsaw+0.092
run+0.081
chie+0.078
 brief+0.075
Neg logit contributions
 idea-0.062
 morning-0.051
 day-0.051
 people-0.050
 prevent-0.049
Token the
Token ID262
Feature activation+9.33e-01

Pos logit contributions
ifying+2.005
igsaw+1.800
run+1.576
chie+1.561
 brief+1.484
Neg logit contributions
 idea-0.726
 morning-0.593
 people-0.592
 day-0.588
 doctors-0.560
Token little
Token ID1310
Feature activation-2.70e-02

Pos logit contributions
 idea+1.069
 morning+0.929
 prevent+0.901
 day+0.880
 bal+0.864
Neg logit contributions
ifying-2.139
igsaw-1.887
chie-1.606
 brief-1.588
run-1.570
Token girl
Token ID2576
Feature activation-4.94e-01

Pos logit contributions
ifying+0.063
igsaw+0.056
run+0.048
chie+0.047
 brief+0.045
Neg logit contributions
 idea-0.032
 morning-0.027
 day-0.027
 prevent-0.026
 people-0.026
Token and
Token ID290
Feature activation-1.93e-01

Pos logit contributions
ifying+1.170
igsaw+1.041
run+0.901
chie+0.883
 brief+0.840
Neg logit contributions
 idea-0.578
 morning-0.483
 day-0.480
 people-0.474
 prevent-0.457
Token went
Token ID1816
Feature activation-1.11e-01

Pos logit contributions
ifying+0.482
igsaw+0.430
run+0.381
chie+0.373
 brief+0.356
Neg logit contributions
 idea-0.199
 morning-0.161
 day-0.159
 people-0.153
 prevent-0.150
Token over
Token ID625
Feature activation-5.85e-01

Pos logit contributions
ifying+0.253
igsaw+0.224
run+0.195
chie+0.189
 brief+0.182
Neg logit contributions
 idea-0.138
 morning-0.115
 day-0.114
 people-0.112
 prevent-0.109
Token to
Token ID284
Feature activation+6.66e-01

Pos logit contributions
ifying+1.296
igsaw+1.139
run+0.972
chie+0.954
jamin+0.900
Neg logit contributions
 idea-0.783
 morning-0.675
 day-0.672
 people-0.663
 prevent-0.642
Token her
Token ID607
Feature activation-2.01e-01

Pos logit contributions
 idea+0.724
 morning+0.586
 day+0.579
 doctors+0.567
 bal+0.561
Neg logit contributions
ifying-1.577
igsaw-1.421
run-1.258
chie-1.222
 brief-1.177
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
ifying+0.518
igsaw+0.465
run+0.410
chie+0.399
 brief+0.386
Neg logit contributions
 idea-0.194
 morning-0.157
 day-0.154
 people-0.150
 prevent-0.146
Token She
Token ID1375
Feature activation-3.08e-01

Pos logit contributions
ifying+0.525
igsaw+0.466
run+0.404
chie+0.393
jamin+0.373
Neg logit contributions
 idea-0.280
 morning-0.234
 day-0.234
 people-0.229
 prevent-0.225
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
 idea+0.045
 morning+0.036
 people+0.036
 day+0.035
 doctors+0.035
Neg logit contributions
ifying-0.101
igsaw-0.090
run-0.079
chie-0.077
 brief-0.073
Token count
Token ID954
Feature activation-5.20e-01

Pos logit contributions
 idea+0.185
 morning+0.149
 day+0.146
 people+0.142
 paramedics+0.141
Neg logit contributions
ifying-0.414
igsaw-0.368
run-0.326
chie-0.316
 brief-0.303
Token how
Token ID703
Feature activation-9.18e-01

Pos logit contributions
ifying+1.264
igsaw+1.128
run+0.980
chie+0.961
 brief+0.917
Neg logit contributions
 idea-0.578
 morning-0.479
 day-0.476
 people-0.471
 prevent-0.452
Token many
Token ID867
Feature activation-1.89e-01

Pos logit contributions
ifying+2.239
igsaw+1.999
run+1.717
chie+1.683
jamin+1.634
Neg logit contributions
 idea-1.019
 people-0.873
 day-0.872
 morning-0.869
 prevent-0.823
Token he
Token ID339
Feature activation-1.24e+00

Pos logit contributions
ifying+0.447
igsaw+0.398
run+0.350
chie+0.339
 brief+0.323
Neg logit contributions
 idea-0.218
 morning-0.182
 day-0.179
 people-0.174
 prevent-0.172
Token had
Token ID550
Feature activation-1.97e-01

Pos logit contributions
ifying+2.916
igsaw+2.560
run+2.142
chie+2.123
 brief+2.019
Neg logit contributions
 idea-1.592
 day-1.374
 people-1.362
 morning-1.348
 doctors-1.294
Token r
Token ID374
Feature activation-4.86e-01

Pos logit contributions
ifying+0.476
igsaw+0.424
run+0.372
chie+0.362
 brief+0.346
Neg logit contributions
 idea-0.221
 morning-0.183
 day-0.180
 people-0.177
 doctors-0.172
Tokenaked
Token ID4335
Feature activation-9.08e-02

Pos logit contributions
ifying+0.988
igsaw+0.858
run+0.722
chie+0.702
 brief+0.662
Neg logit contributions
 idea-0.739
 morning-0.644
 day-0.641
 people-0.635
 prevent-0.619
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
ifying+0.215
igsaw+0.191
run+0.168
chie+0.163
 brief+0.155
Neg logit contributions
 idea-0.105
 morning-0.088
 day-0.087
 people-0.084
 prevent-0.083
Token<|endoftext|>
Token ID50256
Feature activation-5.94e-01

Pos logit contributions
 idea+0.778
 people+0.635
 doctors+0.634
 prevent+0.629
 paramedics+0.624
Neg logit contributions
ifying-1.355
igsaw-1.233
run-1.076
chie-0.987
jamin-0.966
TokenOnce
Token ID7454
Feature activation-9.26e-03

Pos logit contributions
ifying+1.482
igsaw+1.324
run+1.158
chie+1.142
 brief+1.087
Neg logit contributions
 idea-0.622
 morning-0.513
 day-0.510
 people-0.499
 prevent-0.473
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
ifying+1.343
igsaw+1.207
run+1.061
chie+1.041
 brief+0.997
Neg logit contributions
 idea-0.496
 morning-0.396
 day-0.394
 people-0.387
 prevent-0.368
Token
Token ID198
Feature activation+4.43e-01

Pos logit contributions
ifying+0.527
igsaw+0.468
run+0.407
chie+0.392
jamin+0.377
Neg logit contributions
 idea-0.259
 morning-0.213
 day-0.212
 people-0.210
 prevent-0.206
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
 idea+0.570
 morning+0.465
 doctors+0.456
 day+0.453
 people+0.453
Neg logit contributions
ifying-0.994
igsaw-0.889
run-0.785
chie-0.733
jamin-0.720
TokenBut
Token ID1537
Feature activation+3.62e-01

Pos logit contributions
 idea+0.215
 morning+0.176
 day+0.173
 people+0.170
 doctors+0.166
Neg logit contributions
ifying-0.375
igsaw-0.329
run-0.284
chie-0.277
jamin-0.266
Token then
Token ID788
Feature activation+1.36e-01

Pos logit contributions
 idea+0.419
 morning+0.324
 day+0.320
 people+0.319
 prevent+0.316
Neg logit contributions
ifying-0.876
igsaw-0.766
run-0.672
chie-0.644
jamin-0.627
Token they
Token ID484
Feature activation-1.68e+00

Pos logit contributions
 idea+0.136
 morning+0.107
 day+0.107
 prevent+0.102
 people+0.100
Neg logit contributions
ifying-0.345
igsaw-0.310
run-0.270
chie-0.263
 brief-0.257
Token heard
Token ID2982
Feature activation-7.35e-01

Pos logit contributions
ifying+3.701
igsaw+3.559
run+2.937
chie+2.860
 brief+2.628
Neg logit contributions
 idea-2.202
 people-1.974
 morning-1.899
 day-1.898
 doctors-1.877
Token their
Token ID511
Feature activation-4.02e-01

Pos logit contributions
ifying+1.711
igsaw+1.514
run+1.308
chie+1.292
jamin+1.214
Neg logit contributions
 idea-0.904
 morning-0.765
 people-0.762
 day-0.757
 doctors-0.734
Token mom
Token ID1995
Feature activation-9.12e-01

Pos logit contributions
ifying+1.101
igsaw+0.994
run+0.882
chie+0.865
 brief+0.832
Neg logit contributions
 idea-0.328
 morning-0.250
 day-0.247
 people-0.241
 prevent-0.230
Token's
Token ID338
Feature activation-3.90e-01

Pos logit contributions
ifying+2.279
igsaw+2.058
run+1.757
chie+1.750
 brief+1.662
Neg logit contributions
 idea-0.973
 morning-0.794
 people-0.793
 day-0.790
 prevent-0.764
Token voice
Token ID3809
Feature activation-3.99e-01

Pos logit contributions
ifying+0.812
igsaw+0.710
run+0.601
chie+0.585
 brief+0.553
Neg logit contributions
 idea-0.572
 morning-0.497
 day-0.494
 people-0.488
 prevent-0.476
Token Mom
Token ID11254
Feature activation-1.80e-01

Pos logit contributions
 idea+0.766
cut+0.614
 prevent+0.595
 morning+0.588
 day+0.585
Neg logit contributions
ifying-1.705
igsaw-1.526
run-1.355
chie-1.271
 brief-1.264
Token asked
Token ID1965
Feature activation-5.44e-01

Pos logit contributions
ifying+0.468
igsaw+0.420
run+0.375
chie+0.364
jamin+0.351
Neg logit contributions
 idea-0.167
 morning-0.131
 day-0.129
 people-0.126
 prevent-0.123
Token.
Token ID13
Feature activation+9.91e-02

Pos logit contributions
ifying+1.442
igsaw+1.298
run+1.142
chie+1.122
 brief+1.077
Neg logit contributions
 idea-0.490
 morning-0.385
 day-0.381
 people-0.378
 doctors-0.357
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
 idea+0.119
 morning+0.098
 day+0.098
 prevent+0.096
 people+0.095
Neg logit contributions
ifying-0.231
igsaw-0.205
run-0.180
chie-0.171
jamin-0.166
Token
Token ID198
Feature activation-5.21e-01

Pos logit contributions
 idea+0.318
 morning+0.262
 day+0.257
 people+0.249
 prevent+0.249
Neg logit contributions
ifying-0.586
igsaw-0.526
run-0.460
chie-0.436
jamin-0.422
Token"
Token ID1
Feature activation-1.65e+00

Pos logit contributions
ifying+1.259
igsaw+1.120
run+0.974
chie+0.953
 brief+0.908
Neg logit contributions
 idea-0.595
 morning-0.496
 day-0.493
 people-0.488
 prevent-0.469
TokenL
Token ID43
Feature activation-6.55e-01

Pos logit contributions
ifying+3.720
igsaw+3.345
run+2.852
chie+2.760
jamin+2.610
Neg logit contributions
 idea-2.066
 people-1.948
 doctors-1.893
 day-1.876
 morning-1.846
Tokenily
Token ID813
Feature activation+5.75e-01

Pos logit contributions
ifying+1.837
igsaw+1.648
run+1.466
chie+1.443
 brief+1.391
Neg logit contributions
 idea-0.511
 morning-0.378
 day-0.376
 people-0.368
 doctors-0.346
Token and
Token ID290
Feature activation-6.59e-01

Pos logit contributions
 idea+0.697
 morning+0.587
 day+0.576
 prevent+0.569
 people+0.565
Neg logit contributions
ifying-1.300
igsaw-1.151
run-1.004
jamin-0.962
chie-0.957
Token Max
Token ID5436
Feature activation+6.25e-01

Pos logit contributions
ifying+1.569
igsaw+1.388
run+1.204
chie+1.184
 brief+1.122
Neg logit contributions
 idea-0.794
 morning-0.673
 day-0.670
 people-0.660
 doctors-0.632
Token pointed
Token ID6235
Feature activation+5.52e-01

Pos logit contributions
 idea+0.817
 morning+0.707
 day+0.694
 doctors+0.685
 paramedics+0.680
Neg logit contributions
ifying-1.351
igsaw-1.167
run-1.012
chie-0.987
jamin-0.962
Token says
Token ID1139
Feature activation+9.52e-03

Pos logit contributions
ifying+0.028
igsaw+0.025
run+0.022
chie+0.021
 brief+0.020
Neg logit contributions
 idea-0.012
 morning-0.010
 day-0.010
 people-0.009
 prevent-0.009
Token slapping
Token ID48966
Feature activation-7.84e-01

Pos logit contributions
 idea+0.011
 morning+0.009
 day+0.009
 prevent+0.009
 people+0.009
Neg logit contributions
ifying-0.022
igsaw-0.020
run-0.017
chie-0.017
 brief-0.016
Token is
Token ID318
Feature activation+6.34e-01

Pos logit contributions
ifying+2.021
igsaw+1.821
run+1.585
chie+1.567
 brief+1.495
Neg logit contributions
 idea-0.774
 morning-0.628
 people-0.627
 day-0.622
 prevent-0.583
Token not
Token ID407
Feature activation-5.15e-01

Pos logit contributions
 idea+0.812
 morning+0.698
 day+0.678
crow+0.658
 people+0.649
Neg logit contributions
ifying-1.429
igsaw-1.222
run-1.099
chie-1.033
 brief-1.008
Token nice
Token ID3621
Feature activation+6.68e-02

Pos logit contributions
ifying+1.226
igsaw+1.089
run+0.942
chie+0.922
 brief+0.878
Neg logit contributions
 idea-0.608
 morning-0.506
 day-0.504
 people-0.499
 doctors-0.480
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
 idea+0.072
 morning+0.060
 day+0.058
 prevent+0.057
 people+0.057
Neg logit contributions
ifying-0.163
igsaw-0.145
run-0.129
chie-0.123
 brief-0.119
Token it
Token ID340
Feature activation-4.75e-01

Pos logit contributions
ifying+3.929
igsaw+3.552
chie+2.956
run+2.949
jamin+2.865
Neg logit contributions
 idea-1.831
 people-1.603
 day-1.586
 morning-1.538
 doctors-1.509
Token hurts
Token ID20406
Feature activation-4.32e-01

Pos logit contributions
ifying+1.116
igsaw+0.991
run+0.858
chie+0.839
 brief+0.799
Neg logit contributions
 idea-0.568
 morning-0.476
 day-0.473
 people-0.467
 prevent-0.451
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
ifying+1.111
igsaw+0.996
run+0.877
chie+0.859
 brief+0.822
Neg logit contributions
 idea-0.422
 morning-0.338
 day-0.334
 people-0.328
 prevent-0.315
Token She
Token ID1375
Feature activation+1.96e-01

Pos logit contributions
 idea+0.034
 day+0.028
 morning+0.028
 prevent+0.027
 people+0.027
Neg logit contributions
ifying-0.064
igsaw-0.057
run-0.049
chie-0.047
jamin-0.046
Token asks
Token ID7893
Feature activation+6.32e-02

Pos logit contributions
 idea+0.193
 morning+0.162
 day+0.161
 people+0.151
 prevent+0.145
Neg logit contributions
ifying-0.492
igsaw-0.436
run-0.388
chie-0.372
 brief-0.365
Token they
Token ID484
Feature activation-3.77e-01

Pos logit contributions
 idea+1.376
cut+1.160
 paramedics+1.132
 doctors+1.109
 hospital+1.080
Neg logit contributions
ifying-2.771
igsaw-2.537
run-2.174
jamin-2.054
chie-2.044
Token both
Token ID1111
Feature activation+4.94e-03

Pos logit contributions
ifying+0.807
igsaw+0.705
run+0.600
chie+0.585
 brief+0.552
Neg logit contributions
 idea-0.534
 morning-0.461
 day-0.457
 people-0.453
 doctors-0.442
Token had
Token ID550
Feature activation-1.04e+00

Pos logit contributions
 idea+0.007
 morning+0.006
 people+0.006
 doctors+0.006
 day+0.006
Neg logit contributions
ifying-0.011
igsaw-0.009
run-0.008
chie-0.008
jamin-0.007
Token their
Token ID511
Feature activation-3.53e-01

Pos logit contributions
ifying+2.614
igsaw+2.331
run+2.037
chie+2.003
 brief+1.893
Neg logit contributions
 idea-1.120
 morning-0.924
 day-0.922
 people-0.919
 doctors-0.857
Token own
Token ID898
Feature activation-2.75e-01

Pos logit contributions
ifying+0.880
igsaw+0.788
run+0.689
chie+0.674
 brief+0.646
Neg logit contributions
 idea-0.366
 morning-0.298
 day-0.294
 people-0.291
 prevent-0.281
Token brace
Token ID22581
Feature activation-1.66e+00

Pos logit contributions
ifying+0.658
igsaw+0.587
run+0.510
chie+0.497
 brief+0.476
Neg logit contributions
 idea-0.311
 morning-0.259
 day-0.255
 people-0.253
 doctors-0.247
Tokenlets
Token ID5289
Feature activation-7.42e-01

Pos logit contributions
ifying+2.889
igsaw+2.520
run+1.976
chie+1.895
jamin+1.845
Neg logit contributions
 idea-3.006
 people-2.621
 day-2.619
 morning-2.577
 whenever-2.549
Token and
Token ID290
Feature activation-8.62e-01

Pos logit contributions
ifying+1.901
igsaw+1.710
run+1.497
chie+1.470
 brief+1.407
Neg logit contributions
 idea-0.730
 day-0.589
 morning-0.586
 people-0.574
 prevent-0.540
Token they
Token ID484
Feature activation+4.30e-01

Pos logit contributions
ifying+1.814
igsaw+1.585
run+1.339
chie+1.305
 brief+1.225
Neg logit contributions
 idea-1.265
 day-1.104
 people-1.100
 morning-1.096
 doctors-1.051
Token were
Token ID547
Feature activation+5.00e-01

Pos logit contributions
 idea+0.646
 morning+0.565
 doctors+0.559
 people+0.553
 paramedics+0.552
Neg logit contributions
ifying-0.871
igsaw-0.729
run-0.623
chie-0.620
jamin-0.574
Token both
Token ID1111
Feature activation-4.30e-01

Pos logit contributions
 idea+0.636
 morning+0.545
 day+0.527
 prevent+0.526
 people+0.523
Neg logit contributions
ifying-1.113
igsaw-0.971
run-0.857
chie-0.835
jamin-0.784
Token so
Token ID523
Feature activation+5.30e-01

Pos logit contributions
 idea+0.023
 morning+0.020
 day+0.020
 people+0.020
 prevent+0.020
Neg logit contributions
ifying-0.029
igsaw-0.025
run-0.021
chie-0.021
 brief-0.019
Token excited
Token ID6568
Feature activation-8.13e-02

Pos logit contributions
 idea+0.514
 morning+0.443
 doctors+0.426
 day+0.424
 hospital+0.423
Neg logit contributions
ifying-1.328
igsaw-1.185
run-1.064
chie-1.032
 brief-0.981
Token!
Token ID0
Feature activation+4.01e-01

Pos logit contributions
ifying+0.192
igsaw+0.172
run+0.150
chie+0.145
 brief+0.139
Neg logit contributions
 idea-0.094
 morning-0.077
 day-0.076
 people-0.075
 prevent-0.074
Token She
Token ID1375
Feature activation-5.59e-01

Pos logit contributions
 idea+0.559
 day+0.467
 prevent+0.466
 people+0.461
 morning+0.460
Neg logit contributions
ifying-0.871
igsaw-0.769
run-0.672
chie-0.623
jamin-0.612
Token had
Token ID550
Feature activation-5.29e-01

Pos logit contributions
ifying+1.205
igsaw+1.059
run+0.898
chie+0.877
 brief+0.828
Neg logit contributions
 idea-0.780
 morning-0.670
 day-0.667
 people-0.662
 prevent-0.642
Token never
Token ID1239
Feature activation-1.64e+00

Pos logit contributions
ifying+1.244
igsaw+1.105
run+0.955
chie+0.933
 brief+0.886
Neg logit contributions
 idea-0.636
 morning-0.535
 day-0.531
 people-0.526
 prevent-0.507
Token seen
Token ID1775
Feature activation+2.02e-01

Pos logit contributions
ifying+3.894
igsaw+3.450
run+2.877
chie+2.858
jamin+2.702
Neg logit contributions
 idea-2.131
 day-1.879
 morning-1.826
 people-1.819
 doctors-1.734
Token anything
Token ID1997
Feature activation+2.52e-01

Pos logit contributions
 idea+0.253
 morning+0.210
 day+0.209
 prevent+0.204
 people+0.200
Neg logit contributions
ifying-0.461
igsaw-0.407
run-0.352
chie-0.338
 brief-0.325
Token so
Token ID523
Feature activation+2.68e-01

Pos logit contributions
 idea+0.227
 morning+0.178
 paramedics+0.175
 people+0.172
 day+0.170
Neg logit contributions
ifying-0.660
igsaw-0.593
run-0.525
chie-0.507
 brief-0.493
Token beautiful
Token ID4950
Feature activation+4.54e-01

Pos logit contributions
 idea+0.329
 morning+0.276
 day+0.272
 paramedics+0.270
 doctors+0.267
Neg logit contributions
ifying-0.618
igsaw-0.548
run-0.472
chie-0.452
 brief-0.438
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
 idea+0.532
 paramedics+0.430
 morning+0.426
 prevent+0.420
 day+0.415
Neg logit contributions
ifying-1.078
igsaw-0.954
run-0.819
chie-0.786
 brief-0.765