TOP ACTIVATIONS
MAX = 3.392

help each other . Lily let Ben use some paper and
her toy bird . She let it fly high above the
to the grass . They let the balloon fly high in
and their switch . They let each other use the switch
says . They let go of the jet and
by the swings . She let a little bit of sand
Ben were scared . They let go of the lamp and
. L ila and Sam let them have a turn ,
and Ben cried . They let go of the nest and
She opened the door and let the bird out . The
inside it , so she let herself in . She stepped
to the grass . They let the balloon fly high in
After a while , Lily let go of the cow and
open the box . They let the rabbit go . The
, the king decided to let people come to visit .
other . They let go of the spring and
bird was very tired and let out a big yawn .
gate . She smiled and let them in . They saw
sun ball . They also let other kids join them and
!" They both let go of the dress and

MIN ACTIVATIONS
MIN = -3.238

dance around ! Em ma was over joy ed
new symbol ! Em ma 's friends cl apped
the surface . Em ma 's friends cheered as
he was . Em ma loved the present and
his friends dropped her lip bal m on the ground .
birds . Em ma was very proud and
you help ?" Em ma nodded and got to
here . Em ma had a great time
come out ! Em ma was so sad .
down safely ." Em ma understood what they meant
the terrible mess . His mar bles were everywhere ! He
with me ?" Mel ody was very excited and
diamond ! Em ma was so happy and
place . Em ma was so proud of
the night . Em ma was so proud of
away . Em ma and her friends all
Emma ! '. Em ma was so happy she
L ily was scared at first , but then she
away to hear her . Ros y watched sadly as her
ie . They both had birth days coming up , and

INTERVAL 1.735 - 3.392
CONTAINS 0.272%

She blew some bubbles and watched them float in the air
was very tired . Every day he worked in the hot
big and bright . Every day the truck would drive down
. " You are a smart girl ." But
girl opened the wrap and showed it to her mom and

INTERVAL 0.077 - 1.735
CONTAINS 44.343%

" I saw it first , so I should eat it
went inside . Maybe he 'll try again later . <|endoftext|>
uzz . After some time , they found him safe and
little girl named Lily . She loved to draw and color
Ben and Lily and wanted to eat them . He hid

INTERVAL -1.581 - 0.077
CONTAINS 54.991%

laughing . After a while , it was time
and buried the gold coin . After they finished , Mrs
I want to make my shoes shiny too . Can you
he had to help someone . He didn 't want to
decided to make a beautiful hair b raid with it .

INTERVAL -3.238 - -1.581
CONTAINS 0.394%

to nap . They thought n apping was boring and silly
She asked , " Why are you sad ?" The boy
what it was . As he got closer , he realized
bear coming towards them . They quickly forgot about the rock
ies do ?" " Well ," her mom said
Token help
Token ID1037
Feature activation+1.18e+00

Pos logit contributions
ifying+1.397
igsaw+1.246
run+1.090
chie+1.068
 brief+1.017
Neg logit contributions
 idea-0.606
 morning-0.496
 day-0.494
 people-0.488
 prevent-0.469
Token each
Token ID1123
Feature activation+6.08e-01

Pos logit contributions
 idea+1.858
 morning+1.634
cut+1.615
 ostr+1.604
 paramedics+1.597
Neg logit contributions
ifying-2.232
igsaw-1.911
run-1.658
chie-1.545
 brief-1.481
Token other
Token ID584
Feature activation-9.69e-03

Pos logit contributions
 idea+0.420
 morning+0.290
 paramedics+0.269
 prevent+0.261
 doctors+0.260
Neg logit contributions
ifying-1.687
igsaw-1.535
chie-1.361
run-1.360
 brief-1.304
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
ifying+0.024
igsaw+0.022
run+0.019
chie+0.019
 brief+0.018
Neg logit contributions
 idea-0.010
 morning-0.008
 day-0.008
 people-0.007
 doctors-0.007
Token Lily
Token ID20037
Feature activation+9.75e-01

Pos logit contributions
 idea+0.400
 prevent+0.338
 day+0.335
 people+0.335
 morning+0.334
Neg logit contributions
ifying-0.626
igsaw-0.549
run-0.469
chie-0.433
jamin-0.428
Token let
Token ID1309
Feature activation+3.39e+00

Pos logit contributions
 idea+1.365
 morning+1.232
 day+1.190
 prevent+1.173
 people+1.163
Neg logit contributions
ifying-1.926
igsaw-1.717
run-1.473
jamin-1.445
 hips-1.390
Token Ben
Token ID3932
Feature activation+1.79e-01

Pos logit contributions
cut+4.730
 idea+4.315
 ostr+4.058
bye+4.006
crow+3.772
Neg logit contributions
ifying-6.718
igsaw-6.120
jamin-5.376
run-4.971
 hips-4.901
Token use
Token ID779
Feature activation-1.81e-02

Pos logit contributions
 idea+0.205
 morning+0.173
 day+0.171
 people+0.170
 doctors+0.168
Neg logit contributions
ifying-0.421
igsaw-0.377
run-0.331
chie-0.318
 brief-0.305
Token some
Token ID617
Feature activation+1.58e-01

Pos logit contributions
ifying+0.046
igsaw+0.042
run+0.037
chie+0.036
 brief+0.034
Neg logit contributions
 idea-0.017
 morning-0.014
 day-0.014
 people-0.013
 doctors-0.013
Token paper
Token ID3348
Feature activation+1.33e-01

Pos logit contributions
 idea+0.208
 morning+0.179
 day+0.177
 people+0.174
 prevent+0.173
Neg logit contributions
ifying-0.344
igsaw-0.305
run-0.263
chie-0.250
 brief-0.243
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
 idea+0.156
 morning+0.133
 day+0.131
 prevent+0.126
 people+0.125
Neg logit contributions
ifying-0.312
igsaw-0.276
run-0.241
chie-0.233
 brief-0.226
Token her
Token ID607
Feature activation+3.15e-01

Pos logit contributions
 idea+0.118
 morning+0.090
 day+0.089
 prevent+0.087
 people+0.087
Neg logit contributions
ifying-0.321
igsaw-0.289
run-0.253
chie-0.245
 brief-0.240
Token toy
Token ID13373
Feature activation+1.12e+00

Pos logit contributions
 idea+0.299
 morning+0.243
 prevent+0.237
 people+0.229
 day+0.228
Neg logit contributions
ifying-0.797
igsaw-0.714
run-0.628
chie-0.615
 brief-0.607
Token bird
Token ID6512
Feature activation+5.27e-01

Pos logit contributions
 idea+1.313
 prevent+1.030
 morning+1.016
 day+0.971
 paramedics+0.951
Neg logit contributions
ifying-2.659
igsaw-2.231
chie-1.996
 brief-1.974
run-1.974
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
 idea+0.584
 morning+0.463
 prevent+0.461
 paramedics+0.460
 people+0.450
Neg logit contributions
ifying-1.262
igsaw-1.093
run-0.994
chie-0.941
 brief-0.931
Token She
Token ID1375
Feature activation+4.80e-01

Pos logit contributions
 idea+0.350
 day+0.294
 morning+0.292
 people+0.291
 prevent+0.291
Neg logit contributions
ifying-0.580
igsaw-0.518
run-0.445
chie-0.419
jamin-0.406
Token let
Token ID1309
Feature activation+3.38e+00

Pos logit contributions
 idea+0.679
 morning+0.569
 paramedics+0.553
 prevent+0.552
 people+0.552
Neg logit contributions
ifying-0.994
igsaw-0.841
run-0.764
chie-0.724
jamin-0.710
Token it
Token ID340
Feature activation+5.76e-01

Pos logit contributions
 idea+5.114
cut+5.063
 paramedics+4.301
 choice+4.132
bye+4.016
Neg logit contributions
ifying-6.159
igsaw-5.498
jamin-4.602
run-4.529
of-4.408
Token fly
Token ID6129
Feature activation+1.52e-02

Pos logit contributions
 idea+0.499
 doctors+0.399
 morning+0.397
 paramedics+0.395
 day+0.383
Neg logit contributions
ifying-1.467
igsaw-1.321
run-1.205
chie-1.174
 brief-1.125
Token high
Token ID1029
Feature activation-7.25e-01

Pos logit contributions
 idea+0.018
 morning+0.015
 doctors+0.015
 people+0.014
 day+0.014
Neg logit contributions
ifying-0.036
igsaw-0.032
run-0.028
chie-0.027
 brief-0.026
Token above
Token ID2029
Feature activation+7.88e-01

Pos logit contributions
ifying+1.776
igsaw+1.589
run+1.387
chie+1.355
 brief+1.291
Neg logit contributions
 idea-0.800
 morning-0.668
 day-0.666
 people-0.648
 prevent-0.616
Token the
Token ID262
Feature activation-3.95e-02

Pos logit contributions
 idea+0.860
 paramedics+0.704
cut+0.679
 doctors+0.674
 prevent+0.666
Neg logit contributions
ifying-1.912
igsaw-1.696
run-1.469
chie-1.435
jamin-1.396
Token to
Token ID284
Feature activation-3.08e-01

Pos logit contributions
ifying+2.587
igsaw+2.268
chie+1.957
run+1.936
jamin+1.849
Neg logit contributions
 idea-1.309
 day-1.166
 morning-1.156
 people-1.128
 hospital-1.061
Token the
Token ID262
Feature activation+6.60e-03

Pos logit contributions
ifying+0.730
igsaw+0.650
run+0.564
chie+0.548
 brief+0.525
Neg logit contributions
 idea-0.365
 morning-0.303
 day-0.301
 people-0.297
 doctors-0.288
Token grass
Token ID8701
Feature activation-4.19e-01

Pos logit contributions
 idea+0.007
 morning+0.006
 day+0.005
 people+0.005
 prevent+0.005
Neg logit contributions
ifying-0.016
igsaw-0.015
run-0.013
chie-0.012
 brief-0.012
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ifying+1.081
igsaw+0.969
run+0.853
chie+0.835
 brief+0.800
Neg logit contributions
 idea-0.404
 morning-0.323
 day-0.320
 people-0.315
 prevent-0.302
Token They
Token ID1119
Feature activation-1.06e-02

Pos logit contributions
 idea+0.234
 day+0.195
 morning+0.194
 prevent+0.194
 people+0.193
Neg logit contributions
ifying-0.370
igsaw-0.325
run-0.280
chie-0.261
jamin-0.256
Token let
Token ID1309
Feature activation+3.32e+00

Pos logit contributions
ifying+0.022
igsaw+0.019
run+0.016
chie+0.015
 brief+0.015
Neg logit contributions
 idea-0.016
 morning-0.014
 day-0.013
 people-0.013
 doctors-0.013
Token the
Token ID262
Feature activation-4.64e-01

Pos logit contributions
 idea+4.626
cut+4.616
opl+3.687
 exc+3.621
 ad+3.606
Neg logit contributions
ifying-6.658
igsaw-5.640
jamin-4.944
run-4.804
 nowhere-4.622
Token balloon
Token ID21190
Feature activation+8.46e-01

Pos logit contributions
ifying+1.130
igsaw+1.007
run+0.877
chie+0.858
 brief+0.818
Neg logit contributions
 idea-0.524
 morning-0.433
 day-0.430
 people-0.424
 prevent-0.408
Token fly
Token ID6129
Feature activation+4.09e-01

Pos logit contributions
 idea+0.902
 doctors+0.753
 morning+0.749
 people+0.721
 day+0.720
Neg logit contributions
ifying-2.012
igsaw-1.783
run-1.583
chie-1.533
 brief-1.488
Token high
Token ID1029
Feature activation-8.57e-01

Pos logit contributions
 idea+0.564
 doctors+0.482
 morning+0.475
 paramedics+0.472
 people+0.470
Neg logit contributions
ifying-0.879
igsaw-0.761
run-0.663
chie-0.643
 brief-0.604
Token in
Token ID287
Feature activation-6.51e-01

Pos logit contributions
ifying+2.173
igsaw+1.955
run+1.718
chie+1.682
jamin+1.604
Neg logit contributions
 idea-0.879
 morning-0.724
 day-0.723
 people-0.699
 prevent-0.655
Token and
Token ID290
Feature activation+2.25e-01

Pos logit contributions
 idea+1.218
 prevent+1.038
 doctors+1.028
 paramedics+1.024
 morning+1.003
Neg logit contributions
ifying-2.074
igsaw-1.889
run-1.601
chie-1.510
 brief-1.499
Token their
Token ID511
Feature activation+7.41e-01

Pos logit contributions
 idea+0.390
 morning+0.339
 prevent+0.332
 day+0.331
 hospital+0.331
Neg logit contributions
ifying-0.400
igsaw-0.337
run-0.281
chie-0.268
 brief-0.260
Token switch
Token ID5078
Feature activation+7.46e-02

Pos logit contributions
 idea+0.952
 morning+0.789
 prevent+0.776
crow+0.772
 hospital+0.750
Neg logit contributions
ifying-1.649
igsaw-1.433
run-1.252
chie-1.221
 brief-1.206
Token.
Token ID13
Feature activation+8.79e-01

Pos logit contributions
 idea+0.080
 morning+0.066
 day+0.064
 people+0.064
 prevent+0.063
Neg logit contributions
ifying-0.182
igsaw-0.161
run-0.142
chie-0.138
 brief-0.133
Token They
Token ID1119
Feature activation+4.85e-01

Pos logit contributions
 idea+1.331
 prevent+1.149
cut+1.108
 people+1.106
 hospital+1.098
Neg logit contributions
ifying-1.869
igsaw-1.560
run-1.346
jamin-1.212
lic-1.205
Token let
Token ID1309
Feature activation+3.32e+00

Pos logit contributions
 idea+0.865
 morning+0.773
 people+0.750
 day+0.750
 prevent+0.748
Neg logit contributions
ifying-0.834
igsaw-0.672
run-0.556
chie-0.544
 brief-0.524
Token each
Token ID1123
Feature activation+6.59e-01

Pos logit contributions
cut+4.579
 idea+4.468
thank+3.848
bye+3.737
 gard+3.717
Neg logit contributions
ifying-6.854
igsaw-5.522
run-5.028
jamin-4.858
chie-4.598
Token other
Token ID584
Feature activation+9.16e-01

Pos logit contributions
 idea+0.644
 morning+0.491
 doctors+0.469
 day+0.464
 paramedics+0.460
Neg logit contributions
ifying-1.672
igsaw-1.449
run-1.326
chie-1.302
 brief-1.253
Token use
Token ID779
Feature activation+5.56e-01

Pos logit contributions
 idea+0.940
 morning+0.730
 hospital+0.708
crow+0.708
cut+0.705
Neg logit contributions
ifying-2.248
igsaw-2.005
run-1.817
chie-1.737
 brief-1.711
Token the
Token ID262
Feature activation+8.79e-01

Pos logit contributions
 idea+0.589
 morning+0.478
 day+0.471
 prevent+0.456
bye+0.455
Neg logit contributions
ifying-1.367
igsaw-1.211
run-1.084
chie-1.026
 brief-1.004
Token switch
Token ID5078
Feature activation-1.56e-01

Pos logit contributions
 idea+1.039
 morning+0.907
 prevent+0.895
 doctors+0.889
crow+0.886
Neg logit contributions
ifying-1.975
igsaw-1.738
run-1.501
chie-1.483
 brief-1.454
Token says
Token ID1139
Feature activation+4.33e-01

Pos logit contributions
 idea+1.199
 day+1.115
 morning+1.088
 prevent+1.078
cut+1.053
Neg logit contributions
ifying-2.900
igsaw-2.684
run-2.290
jamin-2.262
 brief-2.228
Token.
Token ID13
Feature activation-3.17e-02

Pos logit contributions
 idea+0.452
 morning+0.382
 day+0.378
 prevent+0.364
 hospital+0.364
Neg logit contributions
ifying-1.053
igsaw-0.932
run-0.851
chie-0.805
 hips-0.785
Token
Token ID198
Feature activation+1.03e-02

Pos logit contributions
ifying+0.073
igsaw+0.065
run+0.057
chie+0.054
 brief+0.053
Neg logit contributions
 idea-0.039
 morning-0.032
 day-0.032
 people-0.031
 prevent-0.031
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
 idea+0.012
 morning+0.010
 day+0.010
 people+0.010
 prevent+0.010
Neg logit contributions
ifying-0.024
igsaw-0.022
run-0.019
chie-0.018
jamin-0.017
TokenThey
Token ID2990
Feature activation+2.66e-01

Pos logit contributions
ifying+1.607
igsaw+1.424
run+1.222
chie+1.196
 brief+1.136
Neg logit contributions
 idea-0.865
 morning-0.736
 day-0.734
 people-0.733
 doctors-0.705
Token let
Token ID1309
Feature activation+3.31e+00

Pos logit contributions
 idea+0.322
 morning+0.273
 day+0.270
 doctors+0.270
 people+0.265
Neg logit contributions
ifying-0.618
igsaw-0.540
run-0.463
chie-0.462
 brief-0.442
Token go
Token ID467
Feature activation+3.72e-01

Pos logit contributions
cut+3.838
 idea+3.401
thank+3.386
 ad+3.226
 bal+3.221
Neg logit contributions
ifying-7.552
igsaw-6.577
jamin-5.557
run-5.500
|-5.304
Token of
Token ID286
Feature activation-5.56e-01

Pos logit contributions
 idea+0.402
 doctors+0.328
 morning+0.320
 people+0.316
 hospital+0.313
Neg logit contributions
ifying-0.898
igsaw-0.802
run-0.716
chie-0.698
 brief-0.648
Token the
Token ID262
Feature activation+6.47e-01

Pos logit contributions
ifying+1.470
igsaw+1.321
run+1.167
chie+1.146
 brief+1.095
Neg logit contributions
 idea-0.511
 morning-0.402
 day-0.400
 people-0.396
 prevent-0.374
Token jet
Token ID12644
Feature activation+8.23e-01

Pos logit contributions
 idea+0.792
 morning+0.658
 prevent+0.653
 doctors+0.648
 paramedics+0.634
Neg logit contributions
ifying-1.469
igsaw-1.319
chie-1.097
run-1.088
 brief-1.063
Token and
Token ID290
Feature activation+6.23e-01

Pos logit contributions
 idea+0.868
 morning+0.728
 doctors+0.719
 paramedics+0.709
 people+0.697
Neg logit contributions
ifying-1.953
igsaw-1.754
run-1.536
chie-1.522
 brief-1.460
Token by
Token ID416
Feature activation-6.61e-01

Pos logit contributions
ifying+2.255
igsaw+2.022
run+1.765
chie+1.745
jamin+1.662
Neg logit contributions
 idea-0.907
 day-0.729
 morning-0.721
 people-0.720
 doctors-0.677
Token the
Token ID262
Feature activation+7.37e-01

Pos logit contributions
ifying+1.690
igsaw+1.507
run+1.319
chie+1.291
 brief+1.242
Neg logit contributions
 idea-0.685
 day-0.556
 morning-0.555
 people-0.547
 prevent-0.520
Token swings
Token ID26728
Feature activation+5.49e-01

Pos logit contributions
 idea+0.819
 morning+0.635
 prevent+0.620
 doctors+0.609
 day+0.606
Neg logit contributions
ifying-1.814
igsaw-1.566
chie-1.358
run-1.324
 brief-1.304
Token.
Token ID13
Feature activation+8.71e-01

Pos logit contributions
 idea+0.588
 doctors+0.473
 paramedics+0.465
 morning+0.465
 people+0.464
Neg logit contributions
ifying-1.347
igsaw-1.166
run-1.051
chie-1.002
 brief-0.968
Token She
Token ID1375
Feature activation+4.99e-01

Pos logit contributions
 idea+1.195
 prevent+1.023
 day+1.015
cut+1.013
 doctors+1.011
Neg logit contributions
ifying-1.790
igsaw-1.612
run-1.404
chie-1.307
 hips-1.280
Token let
Token ID1309
Feature activation+3.27e+00

Pos logit contributions
 idea+0.691
 morning+0.589
 people+0.580
 day+0.578
 prevent+0.576
Neg logit contributions
ifying-1.033
igsaw-0.897
run-0.805
chie-0.766
jamin-0.748
Token a
Token ID257
Feature activation-1.17e+00

Pos logit contributions
cut+4.433
 idea+4.289
bye+3.575
 paramedics+3.556
 ot+3.448
Neg logit contributions
ifying-6.468
igsaw-5.702
run-5.179
jamin-4.926
chie-4.890
Token little
Token ID1310
Feature activation+1.71e-01

Pos logit contributions
ifying+2.759
igsaw+2.426
run+2.086
chie+2.041
jamin+1.968
Neg logit contributions
 idea-1.491
 day-1.258
 morning-1.250
 people-1.228
 prevent-1.183
Token bit
Token ID1643
Feature activation+6.87e-01

Pos logit contributions
 idea+0.228
 morning+0.196
 day+0.193
 people+0.192
 doctors+0.190
Neg logit contributions
ifying-0.368
igsaw-0.328
run-0.282
chie-0.275
 brief-0.262
Token of
Token ID286
Feature activation+4.80e-01

Pos logit contributions
 idea+0.783
 day+0.630
 morning+0.628
 doctors+0.611
 hospital+0.610
Neg logit contributions
ifying-1.623
igsaw-1.451
run-1.313
chie-1.244
 brief-1.168
Token sand
Token ID6450
Feature activation-4.46e-01

Pos logit contributions
 idea+0.493
 morning+0.411
 day+0.406
 doctors+0.399
 hospital+0.391
Neg logit contributions
ifying-1.154
igsaw-1.048
run-0.922
chie-0.892
 brief-0.859
Token Ben
Token ID3932
Feature activation-3.99e-01

Pos logit contributions
 idea+0.677
 day+0.552
 morning+0.550
 doctors+0.536
 hospital+0.528
Neg logit contributions
ifying-1.249
igsaw-1.123
run-0.960
chie-0.936
 brief-0.918
Token were
Token ID547
Feature activation+4.13e-01

Pos logit contributions
ifying+0.840
igsaw+0.733
run+0.623
chie+0.607
 brief+0.574
Neg logit contributions
 idea-0.572
 morning-0.496
 day-0.492
 people-0.486
 prevent-0.474
Token scared
Token ID12008
Feature activation+1.09e-01

Pos logit contributions
 idea+0.576
 morning+0.506
 day+0.496
 prevent+0.474
 people+0.469
Neg logit contributions
ifying-0.875
igsaw-0.745
run-0.654
chie-0.641
 brief-0.600
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
 idea+0.113
 morning+0.090
 day+0.088
 prevent+0.087
 people+0.085
Neg logit contributions
ifying-0.274
igsaw-0.244
run-0.219
chie-0.210
 brief-0.200
Token They
Token ID1119
Feature activation-7.44e-01

Pos logit contributions
 idea+0.778
 day+0.691
 morning+0.689
 doctors+0.676
 prevent+0.670
Neg logit contributions
ifying-1.163
igsaw-1.018
run-0.874
chie-0.847
 brief-0.785
Token let
Token ID1309
Feature activation+3.26e+00

Pos logit contributions
ifying+1.637
igsaw+1.467
run+1.237
chie+1.204
 brief+1.135
Neg logit contributions
 idea-1.013
 morning-0.860
 day-0.857
 people-0.857
 prevent-0.832
Token go
Token ID467
Feature activation-4.53e-01

Pos logit contributions
cut+4.279
 idea+3.787
thank+3.720
 paramedics+3.367
 doctors+3.251
Neg logit contributions
ifying-7.022
igsaw-6.155
run-5.551
chie-5.335
jamin-5.074
Token of
Token ID286
Feature activation-4.13e-01

Pos logit contributions
ifying+1.174
igsaw+1.053
run+0.927
chie+0.908
 brief+0.870
Neg logit contributions
 idea-0.436
 morning-0.348
 day-0.345
 people-0.339
 prevent-0.324
Token the
Token ID262
Feature activation+6.26e-01

Pos logit contributions
ifying+1.080
igsaw+0.971
run+0.855
chie+0.838
 brief+0.803
Neg logit contributions
 idea-0.389
 morning-0.309
 day-0.305
 people-0.299
 prevent-0.286
Token lamp
Token ID20450
Feature activation-4.58e-01

Pos logit contributions
 idea+0.920
crow+0.805
 morning+0.793
 paramedics+0.792
 prevent+0.784
Neg logit contributions
ifying-1.254
igsaw-1.093
run-0.896
chie-0.884
 brief-0.868
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
ifying+1.172
igsaw+1.051
run+0.924
chie+0.905
 brief+0.866
Neg logit contributions
 idea-0.451
 morning-0.362
 day-0.359
 people-0.353
 prevent-0.338
Token.
Token ID13
Feature activation+2.73e-02

Pos logit contributions
 idea+0.071
 morning+0.057
 people+0.056
 prevent+0.056
 day+0.055
Neg logit contributions
ifying-0.179
igsaw-0.160
run-0.141
chie-0.136
 brief-0.131
Token L
Token ID406
Feature activation-4.55e-01

Pos logit contributions
 idea+0.035
 day+0.029
 morning+0.029
 people+0.028
 prevent+0.028
Neg logit contributions
ifying-0.062
igsaw-0.055
run-0.048
chie-0.046
jamin-0.044
Tokenila
Token ID10102
Feature activation+4.12e-01

Pos logit contributions
ifying+1.133
igsaw+1.012
run+0.885
chie+0.867
 brief+0.828
Neg logit contributions
 idea-0.484
 morning-0.396
 day-0.393
 people-0.387
 prevent-0.372
Token and
Token ID290
Feature activation+8.31e-01

Pos logit contributions
 idea+0.506
 morning+0.422
 day+0.410
 prevent+0.407
 doctors+0.400
Neg logit contributions
ifying-0.934
igsaw-0.817
run-0.733
chie-0.698
jamin-0.689
Token Sam
Token ID3409
Feature activation+4.17e-01

Pos logit contributions
 idea+1.024
 morning+0.862
 day+0.852
 prevent+0.821
cut+0.809
Neg logit contributions
ifying-1.846
igsaw-1.670
run-1.439
 brief-1.386
chie-1.383
Token let
Token ID1309
Feature activation+3.26e+00

Pos logit contributions
 idea+0.637
 morning+0.559
 doctors+0.548
 day+0.547
 people+0.538
Neg logit contributions
ifying-0.827
igsaw-0.687
run-0.601
chie-0.586
jamin-0.559
Token them
Token ID606
Feature activation-2.17e-01

Pos logit contributions
cut+4.617
 idea+4.020
thank+3.641
 doctors+3.517
 paramedics+3.487
Neg logit contributions
ifying-6.739
igsaw-6.023
run-5.322
jamin-5.066
chie-4.896
Token have
Token ID423
Feature activation-1.01e-01

Pos logit contributions
ifying+0.532
igsaw+0.473
run+0.416
chie+0.405
 brief+0.387
Neg logit contributions
 idea-0.235
 morning-0.193
 day-0.190
 people-0.188
 doctors-0.184
Token a
Token ID257
Feature activation-1.28e+00

Pos logit contributions
ifying+0.244
igsaw+0.218
run+0.189
chie+0.184
 brief+0.178
Neg logit contributions
 idea-0.111
 morning-0.090
 day-0.089
 people-0.088
 prevent-0.087
Token turn
Token ID1210
Feature activation+1.21e-01

Pos logit contributions
ifying+3.016
igsaw+2.616
run+2.294
chie+2.293
years+2.147
Neg logit contributions
 idea-1.640
 day-1.417
 morning-1.401
 people-1.272
 prevent-1.262
Token,
Token ID11
Feature activation-2.66e-04

Pos logit contributions
 idea+0.126
 morning+0.102
 day+0.100
 people+0.099
 prevent+0.099
Neg logit contributions
ifying-0.300
igsaw-0.268
run-0.238
chie-0.229
 brief-0.220
Token and
Token ID290
Feature activation-9.91e-01

Pos logit contributions
ifying+0.043
igsaw+0.038
run+0.033
chie+0.032
jamin+0.031
Neg logit contributions
 idea-0.025
 morning-0.022
 day-0.021
 people-0.021
 doctors-0.020
Token Ben
Token ID3932
Feature activation-1.95e-01

Pos logit contributions
ifying+2.630
igsaw+2.359
run+2.057
chie+2.020
 brief+1.925
Neg logit contributions
 idea-0.940
 people-0.765
 morning-0.758
 day-0.741
 doctors-0.721
Token cried
Token ID16896
Feature activation+1.47e-01

Pos logit contributions
ifying+0.400
igsaw+0.342
run+0.295
chie+0.286
 brief+0.270
Neg logit contributions
 idea-0.287
 morning-0.252
 day-0.251
 people-0.244
 doctors-0.240
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
 idea+0.149
 morning+0.122
 day+0.121
 people+0.115
 prevent+0.114
Neg logit contributions
ifying-0.367
igsaw-0.326
run-0.291
chie-0.281
 brief-0.271
Token They
Token ID1119
Feature activation-1.08e+00

Pos logit contributions
ifying+0.440
igsaw+0.387
run+0.338
chie+0.328
 brief+0.314
Neg logit contributions
 idea-0.225
 morning-0.188
 day-0.187
 people-0.183
 prevent-0.180
Token let
Token ID1309
Feature activation+3.25e+00

Pos logit contributions
ifying+2.395
igsaw+2.222
run+1.829
chie+1.784
 brief+1.670
Neg logit contributions
 idea-1.462
 people-1.254
 prevent-1.210
 morning-1.208
 day-1.206
Token go
Token ID467
Feature activation+8.82e-02

Pos logit contributions
cut+3.797
 idea+3.417
thank+3.112
 artist+2.661
bye+2.642
Neg logit contributions
ifying-7.544
igsaw-6.403
chie-5.714
run-5.700
jamin-5.541
Token of
Token ID286
Feature activation-5.14e-01

Pos logit contributions
 idea+0.092
 morning+0.074
 doctors+0.072
 people+0.072
 day+0.072
Neg logit contributions
ifying-0.218
igsaw-0.195
run-0.174
chie-0.168
 brief-0.159
Token the
Token ID262
Feature activation+3.37e-01

Pos logit contributions
ifying+1.354
igsaw+1.218
run+1.074
chie+1.052
 brief+1.008
Neg logit contributions
 idea-0.475
 morning-0.375
 day-0.372
 people-0.368
 prevent-0.350
Token nest
Token ID16343
Feature activation-1.88e-01

Pos logit contributions
 idea+0.363
 morning+0.294
 day+0.285
 people+0.279
 prevent+0.279
Neg logit contributions
ifying-0.818
igsaw-0.726
run-0.632
chie-0.632
 brief-0.611
Token and
Token ID290
Feature activation+2.21e-01

Pos logit contributions
ifying+0.479
igsaw+0.429
run+0.379
chie+0.370
 brief+0.354
Neg logit contributions
 idea-0.183
 morning-0.147
 day-0.145
 people-0.143
 doctors-0.139
Token She
Token ID1375
Feature activation-8.45e-02

Pos logit contributions
ifying+1.089
igsaw+0.968
run+0.840
chie+0.821
 brief+0.782
Neg logit contributions
 idea-0.537
 morning-0.448
 day-0.445
 people-0.439
 prevent-0.424
Token opened
Token ID4721
Feature activation+5.11e-01

Pos logit contributions
ifying+0.189
igsaw+0.165
run+0.144
chie+0.140
jamin+0.134
Neg logit contributions
 idea-0.108
 morning-0.092
 day-0.090
 people-0.089
 prevent-0.087
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
 idea+0.563
 day+0.447
 doctors+0.444
 morning+0.440
 paramedics+0.437
Neg logit contributions
ifying-1.259
igsaw-1.115
run-0.963
chie-0.934
 brief-0.903
Token door
Token ID3420
Feature activation-3.50e-01

Pos logit contributions
ifying+1.088
igsaw+0.952
run+0.804
chie+0.781
jamin+0.735
Neg logit contributions
 idea-0.778
 morning-0.676
 day-0.672
 people-0.666
 prevent-0.647
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
ifying+0.884
igsaw+0.791
run+0.696
chie+0.678
 brief+0.650
Neg logit contributions
 idea-0.355
 morning-0.286
 day-0.284
 people-0.280
 doctors-0.270
Token let
Token ID1309
Feature activation+3.24e+00

Pos logit contributions
 idea+0.341
 morning+0.281
 day+0.281
 people+0.270
 doctors+0.268
Neg logit contributions
ifying-0.558
igsaw-0.482
run-0.418
chie-0.405
 brief-0.393
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
 idea+4.471
cut+4.135
 paramedics+4.084
 bal+3.898
 day+3.770
Neg logit contributions
ifying-6.309
igsaw-5.371
chie-4.571
run-4.547
 brief-4.405
Token bird
Token ID6512
Feature activation+2.91e-01

Pos logit contributions
 idea+0.273
 morning+0.227
 day+0.220
 prevent+0.216
 doctors+0.212
Neg logit contributions
ifying-0.657
igsaw-0.585
run-0.512
chie-0.511
 brief-0.494
Token out
Token ID503
Feature activation-5.94e-01

Pos logit contributions
 idea+0.259
 morning+0.209
 day+0.204
 doctors+0.200
 people+0.200
Neg logit contributions
ifying-0.748
igsaw-0.666
run-0.602
chie-0.589
 brief-0.560
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
ifying+1.565
igsaw+1.410
run+1.237
chie+1.218
jamin+1.168
Neg logit contributions
 idea-0.551
 morning-0.434
 day-0.431
 people-0.424
 prevent-0.401
Token The
Token ID383
Feature activation+6.07e-01

Pos logit contributions
 idea+0.381
 day+0.319
 morning+0.313
 people+0.309
 prevent+0.307
Neg logit contributions
ifying-0.662
igsaw-0.589
run-0.507
chie-0.487
jamin-0.463
Token inside
Token ID2641
Feature activation+5.09e-01

Pos logit contributions
 idea+1.187
 paramedics+1.036
 morning+1.025
 prevent+0.977
 people+0.975
Neg logit contributions
ifying-2.327
igsaw-2.079
run-1.815
chie-1.787
 brief-1.743
Token it
Token ID340
Feature activation+3.18e-02

Pos logit contributions
 idea+0.539
 morning+0.424
 paramedics+0.415
 doctors+0.411
 day+0.411
Neg logit contributions
ifying-1.265
igsaw-1.109
run-0.984
chie-0.939
jamin-0.900
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
 idea+0.031
 morning+0.025
 people+0.024
 day+0.024
 doctors+0.024
Neg logit contributions
ifying-0.081
igsaw-0.072
run-0.064
chie-0.061
 brief-0.059
Token so
Token ID523
Feature activation+7.75e-01

Pos logit contributions
 idea+0.770
 morning+0.685
 day+0.677
crow+0.656
 people+0.653
Neg logit contributions
ifying-0.912
igsaw-0.756
run-0.669
chie-0.622
 brief-0.599
Token she
Token ID673
Feature activation-4.33e-01

Pos logit contributions
 idea+0.899
crow+0.700
 morning+0.692
 prevent+0.692
 paramedics+0.691
Neg logit contributions
ifying-1.831
igsaw-1.586
run-1.425
 brief-1.341
chie-1.330
Token let
Token ID1309
Feature activation+3.23e+00

Pos logit contributions
ifying+1.042
igsaw+0.927
run+0.807
chie+0.789
 brief+0.752
Neg logit contributions
 idea-0.495
 morning-0.411
 day-0.408
 people-0.402
 prevent-0.388
Token herself
Token ID5223
Feature activation-2.35e-01

Pos logit contributions
 idea+3.982
cut+3.939
 ot+3.642
 paramedics+3.592
 ostr+3.513
Neg logit contributions
ifying-6.685
igsaw-5.770
chie-5.146
run-5.033
 brief-5.002
Token in
Token ID287
Feature activation-5.08e-01

Pos logit contributions
ifying+0.525
igsaw+0.464
run+0.401
chie+0.388
 brief+0.371
Neg logit contributions
 idea-0.302
 morning-0.261
 day-0.258
 people-0.255
 doctors-0.248
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
ifying+1.322
igsaw+1.186
run+1.042
chie+1.024
 brief+0.978
Neg logit contributions
 idea-0.493
 morning-0.392
 day-0.389
 people-0.383
 prevent-0.365
Token She
Token ID1375
Feature activation-4.85e-01

Pos logit contributions
 idea+0.556
 doctors+0.461
 day+0.457
 prevent+0.454
 morning+0.454
Neg logit contributions
ifying-0.905
igsaw-0.805
run-0.702
chie-0.658
jamin-0.641
Token stepped
Token ID10764
Feature activation-1.06e-01

Pos logit contributions
ifying+1.040
igsaw+0.912
run+0.775
chie+0.755
 brief+0.713
Neg logit contributions
 idea-0.684
 morning-0.590
 day-0.587
 people-0.581
 prevent-0.564
Token to
Token ID284
Feature activation+2.77e-02

Pos logit contributions
ifying+1.820
igsaw+1.619
run+1.404
chie+1.390
jamin+1.321
Neg logit contributions
 idea-0.808
 morning-0.677
 day-0.676
 people-0.657
 prevent-0.623
Token the
Token ID262
Feature activation+7.63e-02

Pos logit contributions
 idea+0.029
 morning+0.024
 day+0.023
 people+0.023
 doctors+0.023
Neg logit contributions
ifying-0.068
igsaw-0.062
run-0.054
chie-0.052
 brief-0.050
Token grass
Token ID8701
Feature activation-3.44e-01

Pos logit contributions
 idea+0.087
 morning+0.070
 day+0.069
 people+0.068
 prevent+0.068
Neg logit contributions
ifying-0.184
igsaw-0.163
run-0.139
chie-0.138
 brief-0.133
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
ifying+0.873
igsaw+0.781
run+0.687
chie+0.671
 brief+0.643
Neg logit contributions
 idea-0.344
 morning-0.277
 day-0.274
 people-0.271
 prevent-0.261
Token They
Token ID1119
Feature activation-1.11e-01

Pos logit contributions
 idea+0.202
 day+0.166
 people+0.166
 morning+0.166
 prevent+0.165
Neg logit contributions
ifying-0.361
igsaw-0.319
run-0.279
chie-0.261
jamin-0.255
Token let
Token ID1309
Feature activation+3.19e+00

Pos logit contributions
ifying+0.231
igsaw+0.199
run+0.170
chie+0.165
 brief+0.158
Neg logit contributions
 idea-0.162
 morning-0.140
 day-0.138
 people-0.136
 doctors-0.134
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
 idea+4.342
cut+4.293
thank+3.573
 exc+3.570
 bal+3.406
Neg logit contributions
ifying-6.764
igsaw-5.750
run-5.069
jamin-4.895
chie-4.612
Token balloon
Token ID21190
Feature activation+5.01e-01

Pos logit contributions
ifying+0.067
igsaw+0.059
run+0.052
chie+0.051
 brief+0.049
Neg logit contributions
 idea-0.031
 morning-0.025
 day-0.025
 people-0.024
 prevent-0.024
Token fly
Token ID6129
Feature activation-3.55e-02

Pos logit contributions
 idea+0.530
 morning+0.439
 doctors+0.436
 day+0.428
 people+0.427
Neg logit contributions
ifying-1.213
igsaw-1.072
run-0.952
chie-0.927
 brief-0.890
Token high
Token ID1029
Feature activation-8.80e-01

Pos logit contributions
ifying+0.081
igsaw+0.071
run+0.062
chie+0.060
 brief+0.057
Neg logit contributions
 idea-0.045
 morning-0.037
 people-0.037
 doctors-0.037
 day-0.037
Token in
Token ID287
Feature activation-5.55e-01

Pos logit contributions
ifying+2.229
igsaw+2.008
run+1.766
chie+1.725
 brief+1.648
Neg logit contributions
 idea-0.898
 morning-0.739
 day-0.736
 people-0.712
 prevent-0.671
TokenAfter
Token ID3260
Feature activation-8.91e-01

Pos logit contributions
 idea+0.310
 morning+0.261
 people+0.257
 day+0.256
 doctors+0.252
Neg logit contributions
ifying-0.459
igsaw-0.400
run-0.338
chie-0.330
jamin-0.314
Token a
Token ID257
Feature activation-9.99e-01

Pos logit contributions
ifying+2.197
igsaw+1.957
run+1.710
chie+1.697
jamin+1.602
Neg logit contributions
 idea-0.979
 morning-0.847
 day-0.840
 people-0.824
 doctors-0.789
Token while
Token ID981
Feature activation+5.77e-01

Pos logit contributions
ifying+2.431
igsaw+2.142
run+1.890
chie+1.844
jamin+1.759
Neg logit contributions
 idea-1.151
 day-1.001
 morning-0.990
 people-0.936
 prevent-0.903
Token,
Token ID11
Feature activation+3.98e-01

Pos logit contributions
 idea+0.604
 prevent+0.448
 people+0.437
 doctors+0.435
cut+0.435
Neg logit contributions
ifying-1.454
igsaw-1.290
run-1.144
chie-1.089
 brief-1.068
Token Lily
Token ID20037
Feature activation-1.84e-01

Pos logit contributions
 idea+0.525
 day+0.416
 prevent+0.415
 morning+0.414
 people+0.411
Neg logit contributions
ifying-0.891
igsaw-0.773
run-0.647
chie-0.626
 brief-0.612
Token let
Token ID1309
Feature activation+3.19e+00

Pos logit contributions
ifying+0.409
igsaw+0.358
run+0.309
chie+0.300
jamin+0.289
Neg logit contributions
 idea-0.243
 morning-0.206
 day-0.204
 people-0.201
 prevent-0.196
Token go
Token ID467
Feature activation-6.31e-01

Pos logit contributions
 idea+4.447
cut+4.156
 ostr+3.634
 bal+3.613
 exc+3.540
Neg logit contributions
ifying-6.387
igsaw-5.414
jamin-4.713
run-4.495
 nowhere-4.463
Token of
Token ID286
Feature activation+4.62e-01

Pos logit contributions
ifying+1.659
igsaw+1.489
run+1.307
chie+1.288
 brief+1.234
Neg logit contributions
 idea-0.588
 morning-0.472
 day-0.469
 people-0.455
 prevent-0.434
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 idea+0.491
 morning+0.394
 day+0.389
 people+0.365
bye+0.364
Neg logit contributions
ifying-1.122
igsaw-0.989
run-0.868
chie-0.837
 brief-0.826
Token cow
Token ID9875
Feature activation+3.64e-01

Pos logit contributions
 idea+0.998
 bal+0.727
cut+0.714
 if+0.708
crow+0.703
Neg logit contributions
ifying-3.095
igsaw-2.686
run-2.351
 brief-2.350
chie-2.343
Token and
Token ID290
Feature activation+5.53e-01

Pos logit contributions
 idea+0.425
 morning+0.350
 day+0.344
 people+0.344
 doctors+0.336
Neg logit contributions
ifying-0.851
igsaw-0.756
run-0.660
chie-0.629
 brief-0.615
Token open
Token ID1280
Feature activation+1.74e-01

Pos logit contributions
 idea+0.197
 morning+0.170
 day+0.170
 people+0.163
 doctors+0.162
Neg logit contributions
ifying-0.386
igsaw-0.337
run-0.291
chie-0.289
 brief-0.278
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 idea+0.182
 morning+0.144
 day+0.144
 doctors+0.140
 people+0.138
Neg logit contributions
ifying-0.438
igsaw-0.388
run-0.341
chie-0.330
 brief-0.318
Token box
Token ID3091
Feature activation-6.25e-02

Pos logit contributions
ifying+3.616
igsaw+3.186
run+2.791
chie+2.741
years+2.573
Neg logit contributions
 idea-2.751
 people-2.409
 day-2.360
 morning-2.343
 hospital-2.266
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
ifying+0.156
igsaw+0.139
run+0.124
chie+0.119
 brief+0.115
Neg logit contributions
 idea-0.063
 morning-0.052
 day-0.051
 people-0.050
 doctors-0.050
Token They
Token ID1119
Feature activation+2.08e-01

Pos logit contributions
ifying+0.400
igsaw+0.351
run+0.304
chie+0.292
 brief+0.281
Neg logit contributions
 idea-0.224
 morning-0.189
 day-0.189
 people-0.185
 prevent-0.181
Token let
Token ID1309
Feature activation+3.18e+00

Pos logit contributions
 idea+0.247
 morning+0.215
 day+0.214
 doctors+0.202
 people+0.201
Neg logit contributions
ifying-0.485
igsaw-0.421
run-0.366
chie-0.359
 brief-0.349
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
cut+4.199
 idea+4.063
 bal+3.774
thank+3.526
opl+3.432
Neg logit contributions
ifying-6.765
igsaw-5.395
run-4.917
 nowhere-4.778
jamin-4.661
Token rabbit
Token ID22746
Feature activation+7.89e-01

Pos logit contributions
ifying+2.978
igsaw+2.651
run+2.326
chie+2.257
jamin+2.112
Neg logit contributions
 idea-1.578
 people-1.312
 day-1.307
 morning-1.291
 doctors-1.233
Token go
Token ID467
Feature activation+4.72e-01

Pos logit contributions
 idea+0.909
 morning+0.806
 day+0.784
 doctors+0.775
 people+0.756
Neg logit contributions
ifying-1.807
igsaw-1.599
run-1.440
chie-1.363
 brief-1.310
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
 idea+0.529
 morning+0.431
 doctors+0.422
 people+0.411
 hospital+0.410
Neg logit contributions
ifying-1.123
igsaw-0.996
run-0.898
chie-0.868
 brief-0.816
Token The
Token ID383
Feature activation-4.17e-01

Pos logit contributions
 idea+0.412
 day+0.347
 morning+0.344
 doctors+0.337
 people+0.337
Neg logit contributions
ifying-0.675
igsaw-0.581
run-0.511
chie-0.473
 brief-0.465
Token,
Token ID11
Feature activation+7.10e-01

Pos logit contributions
 idea+1.758
cut+1.403
 hospital+1.387
 paramedics+1.385
 prevent+1.338
Neg logit contributions
ifying-3.240
igsaw-2.699
run-2.479
chie-2.305
 brief-2.286
Token the
Token ID262
Feature activation+6.76e-01

Pos logit contributions
 idea+0.838
 paramedics+0.649
 morning+0.646
 day+0.636
 hospital+0.632
Neg logit contributions
ifying-1.681
igsaw-1.437
run-1.267
chie-1.235
 brief-1.197
Token king
Token ID5822
Feature activation+9.39e-01

Pos logit contributions
 idea+0.868
 prevent+0.743
 morning+0.727
 bal+0.702
 paramedics+0.699
Neg logit contributions
ifying-1.471
igsaw-1.262
chie-1.106
 brief-1.048
run-1.039
Token decided
Token ID3066
Feature activation+6.60e-01

Pos logit contributions
 idea+1.196
 morning+1.007
 prevent+1.001
 paramedics+0.969
 doctors+0.956
Neg logit contributions
ifying-2.050
igsaw-1.752
run-1.598
chie-1.520
jamin-1.481
Token to
Token ID284
Feature activation+8.14e-01

Pos logit contributions
 idea+1.068
 morning+0.913
 day+0.891
 paramedics+0.865
crow+0.861
Neg logit contributions
ifying-1.240
igsaw-1.057
run-0.925
chie-0.881
 brief-0.868
Token let
Token ID1309
Feature activation+3.18e+00

Pos logit contributions
 idea+0.812
 morning+0.691
 doctors+0.668
 day+0.665
 paramedics+0.648
Neg logit contributions
ifying-1.980
igsaw-1.764
chie-1.529
run-1.516
 brief-1.450
Token people
Token ID661
Feature activation-1.60e-01

Pos logit contributions
 idea+4.094
cut+3.890
 paramedics+3.760
thank+3.538
 bal+3.494
Neg logit contributions
ifying-6.234
igsaw-5.562
run-5.129
chie-5.097
 brief-4.602
Token come
Token ID1282
Feature activation-3.68e-01

Pos logit contributions
ifying+0.400
igsaw+0.356
run+0.314
chie+0.308
 brief+0.294
Neg logit contributions
 idea-0.163
 morning-0.133
 day-0.131
 people-0.127
 doctors-0.125
Token to
Token ID284
Feature activation+2.64e-01

Pos logit contributions
ifying+0.882
igsaw+0.783
run+0.683
chie+0.667
 brief+0.636
Neg logit contributions
 idea-0.422
 morning-0.351
 day-0.348
 people-0.342
 doctors-0.331
Token visit
Token ID3187
Feature activation+1.73e-01

Pos logit contributions
 idea+0.283
 morning+0.227
 day+0.224
 paramedics+0.216
 doctors+0.215
Neg logit contributions
ifying-0.639
igsaw-0.572
run-0.496
chie-0.488
 brief-0.474
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 idea+0.177
 morning+0.139
 day+0.136
 paramedics+0.133
 doctors+0.131
Neg logit contributions
ifying-0.432
igsaw-0.386
run-0.341
chie-0.331
 brief-0.320
Token other
Token ID584
Feature activation-1.86e-02

Pos logit contributions
 idea+0.421
 morning+0.305
 day+0.275
 doctors+0.274
 prevent+0.273
Neg logit contributions
ifying-1.682
igsaw-1.494
run-1.380
chie-1.338
 brief-1.298
Token.
Token ID13
Feature activation-9.66e-02

Pos logit contributions
ifying+0.047
igsaw+0.042
run+0.037
chie+0.036
 brief+0.035
Neg logit contributions
 idea-0.018
 morning-0.015
 day-0.014
 prevent-0.014
 people-0.014
Token
Token ID198
Feature activation+5.56e-01

Pos logit contributions
ifying+0.226
igsaw+0.199
run+0.172
chie+0.166
jamin+0.160
Neg logit contributions
 idea-0.119
 morning-0.099
 day-0.098
 people-0.096
 prevent-0.095
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
 idea+0.685
 morning+0.556
 day+0.538
 prevent+0.532
 doctors+0.532
Neg logit contributions
ifying-1.276
igsaw-1.132
run-1.008
chie-0.938
 brief-0.912
TokenThey
Token ID2990
Feature activation-2.56e-01

Pos logit contributions
ifying+0.655
igsaw+0.572
run+0.488
chie+0.476
 brief+0.451
Neg logit contributions
 idea-0.424
 morning-0.363
 day-0.359
 people-0.355
 prevent-0.346
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
ifying+0.570
igsaw+0.497
run+0.430
chie+0.421
 brief+0.400
Neg logit contributions
 idea-0.334
 morning-0.286
 day-0.285
 people-0.279
 doctors-0.270
Token go
Token ID467
Feature activation-3.83e-01

Pos logit contributions
cut+4.093
 idea+3.680
thank+3.368
 clim+3.238
 ostr+3.217
Neg logit contributions
ifying-6.903
igsaw-5.877
jamin-5.006
run-4.856
|-4.753
Token of
Token ID286
Feature activation-7.76e-01

Pos logit contributions
ifying+0.983
igsaw+0.881
run+0.777
chie+0.759
 brief+0.727
Neg logit contributions
 idea-0.373
 morning-0.298
 day-0.295
 people-0.291
 prevent-0.279
Token the
Token ID262
Feature activation+8.72e-01

Pos logit contributions
ifying+2.052
igsaw+1.838
run+1.626
chie+1.599
jamin+1.527
Neg logit contributions
 idea-0.731
 people-0.581
 day-0.578
 morning-0.578
 prevent-0.544
Token spring
Token ID6076
Feature activation+4.83e-01

Pos logit contributions
 idea+1.162
 whenever+0.994
 morning+0.968
crow+0.939
 day+0.936
Neg logit contributions
ifying-1.854
igsaw-1.644
 brief-1.377
run-1.353
chie-1.351
Token and
Token ID290
Feature activation-7.67e-02

Pos logit contributions
 idea+0.543
 people+0.442
 doctors+0.437
 prevent+0.436
 morning+0.435
Neg logit contributions
ifying-1.146
igsaw-1.025
run-0.885
chie-0.857
 brief-0.828
Token bird
Token ID6512
Feature activation+3.89e-01

Pos logit contributions
ifying+0.602
igsaw+0.541
run+0.472
chie+0.464
 brief+0.443
Neg logit contributions
 idea-0.251
 morning-0.204
 day-0.201
 people-0.198
 prevent-0.194
Token was
Token ID373
Feature activation+1.08e-01

Pos logit contributions
 idea+0.533
 morning+0.452
 paramedics+0.448
 doctors+0.445
 people+0.439
Neg logit contributions
ifying-0.817
igsaw-0.697
run-0.626
chie-0.599
 brief-0.570
Token very
Token ID845
Feature activation+3.20e-01

Pos logit contributions
 idea+0.133
 morning+0.114
 day+0.112
 people+0.110
 prevent+0.108
Neg logit contributions
ifying-0.243
igsaw-0.212
run-0.187
chie-0.182
 brief-0.174
Token tired
Token ID10032
Feature activation+7.27e-02

Pos logit contributions
 idea+0.411
 morning+0.357
 doctors+0.346
 day+0.345
 hospital+0.344
Neg logit contributions
ifying-0.698
igsaw-0.615
run-0.531
chie-0.530
 brief-0.492
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
 idea+0.076
 morning+0.060
 prevent+0.059
 day+0.059
 people+0.059
Neg logit contributions
ifying-0.180
igsaw-0.160
run-0.140
chie-0.137
 brief-0.131
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
 idea+0.561
 morning+0.471
 day+0.462
 paramedics+0.461
 people+0.456
Neg logit contributions
ifying-0.892
igsaw-0.754
run-0.670
chie-0.652
 brief-0.621
Token out
Token ID503
Feature activation-1.14e+00

Pos logit contributions
 idea+4.199
cut+4.006
 bal+3.984
 paramedics+3.971
thank+3.634
Neg logit contributions
ifying-6.114
igsaw-4.959
chie-4.670
run-4.599
 nowhere-4.403
Token a
Token ID257
Feature activation-2.50e-01

Pos logit contributions
ifying+2.772
igsaw+2.463
chie+2.106
run+2.094
jamin+2.013
Neg logit contributions
 idea-1.376
 morning-1.142
 day-1.139
 people-1.131
 prevent-1.060
Token big
Token ID1263
Feature activation-2.67e-01

Pos logit contributions
ifying+0.567
igsaw+0.502
run+0.432
chie+0.422
 brief+0.404
Neg logit contributions
 idea-0.312
 morning-0.265
 day-0.262
 people-0.260
 doctors-0.254
Token yawn
Token ID40908
Feature activation+3.19e-01

Pos logit contributions
ifying+0.594
igsaw+0.528
run+0.451
chie+0.441
 brief+0.421
Neg logit contributions
 idea-0.343
 morning-0.293
 day-0.291
 people-0.290
 doctors-0.283
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
 idea+0.328
 morning+0.261
 people+0.256
 day+0.254
 prevent+0.253
Neg logit contributions
ifying-0.795
igsaw-0.694
run-0.623
chie-0.601
lic-0.581
Token gate
Token ID8946
Feature activation+2.30e-01

Pos logit contributions
ifying+0.110
igsaw+0.096
run+0.080
chie+0.078
 brief+0.075
Neg logit contributions
 idea-0.074
 morning-0.064
 day-0.063
 people-0.062
 prevent-0.062
Token.
Token ID13
Feature activation+6.71e-01

Pos logit contributions
 idea+0.236
 morning+0.192
 day+0.186
 people+0.183
 doctors+0.181
Neg logit contributions
ifying-0.579
igsaw-0.511
run-0.457
chie-0.437
 brief-0.423
Token She
Token ID1375
Feature activation-9.17e-01

Pos logit contributions
 idea+0.932
 day+0.794
 morning+0.779
 prevent+0.770
 people+0.761
Neg logit contributions
ifying-1.462
igsaw-1.311
run-1.119
chie-1.010
 brief-1.007
Token smiled
Token ID13541
Feature activation+2.46e-01

Pos logit contributions
ifying+2.180
igsaw+1.953
run+1.648
chie+1.641
 brief+1.532
Neg logit contributions
 idea-1.103
 people-0.927
 day-0.913
 morning-0.908
 doctors-0.894
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 idea+0.270
 morning+0.222
 day+0.222
 prevent+0.217
 people+0.212
Neg logit contributions
ifying-0.603
igsaw-0.523
run-0.466
chie-0.447
 brief-0.440
Token let
Token ID1309
Feature activation+3.16e+00

Pos logit contributions
ifying+2.498
igsaw+2.234
chie+1.886
run+1.886
 brief+1.744
Neg logit contributions
 idea-1.289
 people-1.099
 morning-1.088
 day-1.080
 doctors-1.049
Token them
Token ID606
Feature activation-7.11e-02

Pos logit contributions
 idea+3.618
cut+3.447
 day+3.034
bye+3.031
 bal+3.020
Neg logit contributions
ifying-7.020
igsaw-6.263
run-5.688
 brief-5.141
jamin-5.044
Token in
Token ID287
Feature activation-1.03e-01

Pos logit contributions
ifying+0.186
igsaw+0.166
run+0.148
chie+0.144
 brief+0.139
Neg logit contributions
 idea-0.065
 morning-0.052
 day-0.051
 people-0.049
 doctors-0.048
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
ifying+0.254
igsaw+0.226
run+0.199
chie+0.192
 brief+0.186
Neg logit contributions
 idea-0.110
 morning-0.092
 day-0.090
 people-0.089
 doctors-0.087
Token They
Token ID1119
Feature activation-4.33e-01

Pos logit contributions
 idea+0.713
 day+0.596
 morning+0.589
 prevent+0.584
 people+0.576
Neg logit contributions
ifying-1.177
igsaw-1.044
run-0.903
chie-0.820
jamin-0.818
Token saw
Token ID2497
Feature activation+1.10e+00

Pos logit contributions
ifying+0.967
igsaw+0.852
run+0.732
chie+0.714
 brief+0.678
Neg logit contributions
 idea-0.567
 morning-0.484
 day-0.480
 people-0.474
 prevent-0.460
Token sun
Token ID4252
Feature activation+2.23e-01

Pos logit contributions
ifying+0.703
igsaw+0.624
run+0.539
chie+0.525
 brief+0.502
Neg logit contributions
 idea-0.371
 morning-0.314
 day-0.309
 people-0.306
 doctors-0.299
Token ball
Token ID2613
Feature activation+3.01e-01

Pos logit contributions
 idea+0.232
 prevent+0.187
 morning+0.186
 people+0.185
 doctors+0.183
Neg logit contributions
ifying-0.547
igsaw-0.486
run-0.426
chie-0.409
 brief-0.401
Token.
Token ID13
Feature activation+8.81e-01

Pos logit contributions
 idea+0.311
 morning+0.252
 prevent+0.248
 people+0.245
 doctors+0.242
Neg logit contributions
ifying-0.742
igsaw-0.661
run-0.586
chie-0.561
 brief-0.543
Token They
Token ID1119
Feature activation+2.75e-01

Pos logit contributions
 idea+1.254
 prevent+1.062
cut+1.056
 doctors+1.032
 morning+1.032
Neg logit contributions
ifying-1.928
igsaw-1.617
run-1.443
chie-1.306
lic-1.290
Token also
Token ID635
Feature activation-1.74e-01

Pos logit contributions
 idea+0.430
 morning+0.381
 day+0.371
 people+0.369
 doctors+0.369
Neg logit contributions
ifying-0.533
igsaw-0.446
run-0.376
chie-0.370
jamin-0.355
Token let
Token ID1309
Feature activation+3.15e+00

Pos logit contributions
ifying+0.372
igsaw+0.323
run+0.277
chie+0.270
 brief+0.257
Neg logit contributions
 idea-0.242
 morning-0.209
 day-0.206
 people-0.204
 doctors-0.200
Token other
Token ID584
Feature activation-1.12e-03

Pos logit contributions
cut+4.221
 idea+3.974
thank+3.441
bye+3.385
 paramedics+3.381
Neg logit contributions
ifying-6.450
igsaw-5.444
run-5.062
jamin-4.796
 nowhere-4.572
Token kids
Token ID3988
Feature activation+3.62e-03

Pos logit contributions
ifying+0.003
igsaw+0.003
run+0.002
chie+0.002
 brief+0.002
Neg logit contributions
 idea-0.001
 morning-0.001
 day-0.001
 prevent-0.001
 hospital-0.001
Token join
Token ID4654
Feature activation-5.22e-01

Pos logit contributions
 idea+0.004
 morning+0.003
 day+0.003
 people+0.003
 doctors+0.003
Neg logit contributions
ifying-0.009
igsaw-0.008
run-0.007
chie-0.006
 brief-0.006
Token them
Token ID606
Feature activation+2.37e-01

Pos logit contributions
ifying+1.392
igsaw+1.255
run+1.106
chie+1.086
 brief+1.042
Neg logit contributions
 idea-0.465
 morning-0.365
 day-0.363
 people-0.357
 prevent-0.336
Token and
Token ID290
Feature activation-1.47e-01

Pos logit contributions
 idea+0.250
 morning+0.205
 doctors+0.199
 day+0.196
 people+0.195
Neg logit contributions
ifying-0.580
igsaw-0.523
run-0.458
chie-0.439
 brief-0.428
Token!"
Token ID2474
Feature activation+2.90e-01

Pos logit contributions
ifying+0.546
igsaw+0.486
run+0.421
chie+0.412
 brief+0.392
Neg logit contributions
 idea-0.268
 morning-0.226
 day-0.223
 people-0.219
 prevent-0.214
Token
Token ID198
Feature activation+3.19e-01

Pos logit contributions
 idea+0.359
 day+0.295
 morning+0.294
 prevent+0.291
 doctors+0.288
Neg logit contributions
ifying-0.662
igsaw-0.590
run-0.518
chie-0.486
jamin-0.471
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
 idea+0.400
 morning+0.334
 day+0.327
 prevent+0.318
 doctors+0.317
Neg logit contributions
ifying-0.719
igsaw-0.641
run-0.569
chie-0.536
jamin-0.514
TokenThey
Token ID2990
Feature activation-7.61e-01

Pos logit contributions
ifying+0.580
igsaw+0.509
run+0.435
chie+0.426
 brief+0.403
Neg logit contributions
 idea-0.366
 morning-0.313
 day-0.310
 people-0.303
 prevent-0.297
Token both
Token ID1111
Feature activation-2.66e-01

Pos logit contributions
ifying+1.634
igsaw+1.459
run+1.223
chie+1.189
 brief+1.123
Neg logit contributions
 idea-1.070
 morning-0.915
 people-0.915
 day-0.912
 prevent-0.889
Token let
Token ID1309
Feature activation+3.15e+00

Pos logit contributions
ifying+0.565
igsaw+0.491
run+0.423
chie+0.411
 brief+0.387
Neg logit contributions
 idea-0.377
 morning-0.326
 day-0.324
 people-0.320
 doctors-0.313
Token go
Token ID467
Feature activation+8.24e-02

Pos logit contributions
cut+3.692
 idea+3.181
 ostr+2.969
thank+2.873
 paramedics+2.741
Neg logit contributions
ifying-7.333
igsaw-6.448
run-5.432
jamin-5.405
chie-5.269
Token of
Token ID286
Feature activation-9.10e-01

Pos logit contributions
 idea+0.086
 morning+0.069
 doctors+0.068
 people+0.068
 day+0.067
Neg logit contributions
ifying-0.203
igsaw-0.183
run-0.163
chie-0.157
 brief-0.149
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
ifying+2.482
igsaw+2.235
run+1.987
chie+1.957
jamin+1.876
Neg logit contributions
 idea-0.782
 people-0.618
 day-0.603
 morning-0.601
 prevent-0.562
Token dress
Token ID6576
Feature activation+2.65e-01

Pos logit contributions
 idea+0.125
 morning+0.105
 day+0.103
 prevent+0.101
 people+0.101
Neg logit contributions
ifying-0.245
igsaw-0.219
run-0.187
chie-0.185
 brief-0.179
Token and
Token ID290
Feature activation-1.61e-02

Pos logit contributions
 idea+0.276
 morning+0.230
 day+0.225
 people+0.221
 doctors+0.218
Neg logit contributions
ifying-0.646
igsaw-0.581
run-0.511
chie-0.494
 brief-0.478
Token dance
Token ID9280
Feature activation-1.64e-01

Pos logit contributions
ifying+0.357
igsaw+0.316
run+0.275
chie+0.268
 brief+0.254
Neg logit contributions
 idea-0.186
 morning-0.155
 day-0.153
 people-0.151
 doctors-0.148
Token around
Token ID1088
Feature activation-1.53e-01

Pos logit contributions
ifying+0.410
igsaw+0.367
run+0.321
chie+0.315
 brief+0.300
Neg logit contributions
 idea-0.170
 morning-0.138
 day-0.137
 people-0.135
 doctors-0.132
Token!
Token ID0
Feature activation-7.82e-01

Pos logit contributions
ifying+0.388
igsaw+0.348
run+0.304
chie+0.298
 brief+0.285
Neg logit contributions
 idea-0.153
 morning-0.122
 day-0.120
 people-0.117
 prevent-0.116
Token
Token ID198
Feature activation-6.38e-01

Pos logit contributions
ifying+1.790
igsaw+1.577
run+1.362
chie+1.350
 brief+1.284
Neg logit contributions
 idea-0.976
 morning-0.843
 day-0.831
 people-0.819
 doctors-0.782
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
ifying+1.594
igsaw+1.421
run+1.242
chie+1.224
 brief+1.171
Neg logit contributions
 idea-0.671
 morning-0.552
 day-0.548
 people-0.540
 prevent-0.513
TokenEm
Token ID10161
Feature activation-3.24e+00

Pos logit contributions
ifying+1.847
igsaw+1.566
run+1.280
chie+1.244
 brief+1.169
Neg logit contributions
 idea-1.854
 morning-1.683
 people-1.677
 day-1.675
 doctors-1.629
Tokenma
Token ID2611
Feature activation-3.67e-01

Pos logit contributions
ifying+8.335
igsaw+7.306
run+6.387
chie+6.101
lic+6.086
Neg logit contributions
 idea-3.477
 day-3.418
 morning-3.325
 night-3.009
 people-2.950
Token was
Token ID373
Feature activation+4.18e-01

Pos logit contributions
ifying+0.759
igsaw+0.661
run+0.561
chie+0.545
 brief+0.514
Neg logit contributions
 idea-0.541
 morning-0.469
 day-0.466
 people-0.461
 prevent-0.451
Token over
Token ID625
Feature activation-1.02e-01

Pos logit contributions
 idea+0.718
 morning+0.632
 day+0.626
 people+0.620
 doctors+0.619
Neg logit contributions
ifying-0.763
igsaw-0.637
run-0.539
chie-0.523
jamin-0.487
Tokenjoy
Token ID2633
Feature activation-4.72e-01

Pos logit contributions
ifying+0.156
igsaw+0.128
run+0.100
chie+0.095
 brief+0.088
Neg logit contributions
 idea-0.206
 morning-0.184
 day-0.184
 people-0.182
 prevent-0.181
Tokened
Token ID276
Feature activation-6.97e-02

Pos logit contributions
ifying+0.907
igsaw+0.782
run+0.650
chie+0.631
 brief+0.591
Neg logit contributions
 idea-0.769
 morning-0.678
 day-0.675
 people-0.669
 prevent-0.652
Token new
Token ID649
Feature activation+5.94e-01

Pos logit contributions
ifying+1.736
igsaw+1.537
run+1.345
chie+1.316
jamin+1.262
Neg logit contributions
 idea-0.801
 morning-0.660
 day-0.657
 people-0.639
 prevent-0.623
Token symbol
Token ID6194
Feature activation+2.06e-01

Pos logit contributions
 idea+0.689
 doctors+0.608
 people+0.596
 paramedics+0.595
 morning+0.592
Neg logit contributions
ifying-1.333
igsaw-1.219
run-1.023
chie-1.016
 brief-0.977
Token!
Token ID0
Feature activation-1.30e+00

Pos logit contributions
 idea+0.245
 morning+0.201
 people+0.198
 day+0.196
 prevent+0.194
Neg logit contributions
ifying-0.486
igsaw-0.428
run-0.375
chie-0.363
 brief-0.347
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
ifying+2.997
igsaw+2.635
chie+2.304
run+2.293
 brief+2.215
Neg logit contributions
 idea-1.597
 morning-1.394
 day-1.367
 people-1.344
 hospital-1.264
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
ifying+1.759
igsaw+1.578
run+1.387
chie+1.371
 brief+1.313
Neg logit contributions
 idea-0.627
 morning-0.501
 day-0.498
 people-0.489
 doctors-0.460
TokenEm
Token ID10161
Feature activation-3.16e+00

Pos logit contributions
ifying+2.053
igsaw+1.755
run+1.438
chie+1.406
 brief+1.322
Neg logit contributions
 idea-1.940
 day-1.752
 morning-1.750
 people-1.748
 doctors-1.703
Tokenma
Token ID2611
Feature activation-7.24e-01

Pos logit contributions
ifying+8.149
igsaw+7.115
run+6.315
lic+5.919
chie+5.863
Neg logit contributions
 idea-3.545
 day-3.381
 people-3.239
 morning-3.195
 night-2.883
Token's
Token ID338
Feature activation+4.50e-02

Pos logit contributions
ifying+1.561
igsaw+1.377
run+1.162
chie+1.132
 brief+1.076
Neg logit contributions
 idea-1.006
 morning-0.872
 day-0.870
 people-0.861
 prevent-0.830
Token friends
Token ID2460
Feature activation-9.73e-01

Pos logit contributions
 idea+0.049
 morning+0.041
 day+0.040
 prevent+0.039
 people+0.039
Neg logit contributions
ifying-0.108
igsaw-0.098
run-0.085
chie-0.083
 brief-0.079
Token cl
Token ID537
Feature activation-1.17e+00

Pos logit contributions
ifying+2.072
igsaw+1.853
run+1.545
chie+1.500
 brief+1.433
Neg logit contributions
 idea-1.378
 day-1.201
 morning-1.198
 people-1.190
 prevent-1.142
Tokenapped
Token ID6320
Feature activation+1.70e-01

Pos logit contributions
ifying+2.539
igsaw+2.266
run+1.887
chie+1.850
 brief+1.826
Neg logit contributions
 idea-1.580
 morning-1.342
 day-1.323
 people-1.304
 prevent-1.276
Token the
Token ID262
Feature activation-7.74e-01

Pos logit contributions
ifying+0.472
igsaw+0.424
run+0.374
chie+0.366
 brief+0.351
Neg logit contributions
 idea-0.156
 morning-0.119
 day-0.118
 people-0.116
 doctors-0.113
Token surface
Token ID4417
Feature activation-2.64e-02

Pos logit contributions
ifying+1.933
igsaw+1.730
run+1.526
chie+1.476
 brief+1.420
Neg logit contributions
 idea-0.829
 day-0.681
 morning-0.681
 people-0.677
 prevent-0.635
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
ifying+0.067
igsaw+0.060
run+0.053
chie+0.052
 brief+0.049
Neg logit contributions
 idea-0.026
 morning-0.021
 day-0.021
 people-0.020
 doctors-0.020
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
ifying+2.868
igsaw+2.515
chie+2.198
run+2.176
 brief+2.101
Neg logit contributions
 idea-1.545
 morning-1.360
 day-1.331
 people-1.310
 hospital-1.235
Token
Token ID198
Feature activation-8.99e-01

Pos logit contributions
ifying+0.967
igsaw+0.864
run+0.755
chie+0.737
 brief+0.704
Neg logit contributions
 idea-0.421
 morning-0.344
 day-0.341
 people-0.336
 prevent-0.324
TokenEm
Token ID10161
Feature activation-3.07e+00

Pos logit contributions
ifying+1.827
igsaw+1.593
run+1.346
chie+1.310
 brief+1.239
Neg logit contributions
 idea-1.348
 morning-1.193
 day-1.188
 people-1.185
 prevent-1.150
Tokenma
Token ID2611
Feature activation-1.22e-01

Pos logit contributions
ifying+7.910
igsaw+6.949
run+6.115
lic+5.707
chie+5.680
Neg logit contributions
 idea-3.375
 day-3.250
 morning-3.082
 people-3.048
 prevent-2.814
Token's
Token ID338
Feature activation+5.28e-02

Pos logit contributions
ifying+0.263
igsaw+0.228
run+0.199
chie+0.193
jamin+0.183
Neg logit contributions
 idea-0.168
 morning-0.143
 day-0.141
 people-0.139
 doctors-0.137
Token friends
Token ID2460
Feature activation-3.86e-01

Pos logit contributions
 idea+0.061
 morning+0.052
 day+0.050
 prevent+0.049
 people+0.049
Neg logit contributions
ifying-0.124
igsaw-0.112
run-0.097
chie-0.095
 brief-0.091
Token cheered
Token ID34767
Feature activation-1.10e-01

Pos logit contributions
ifying+0.817
igsaw+0.713
run+0.609
chie+0.593
 brief+0.559
Neg logit contributions
 idea-0.548
 morning-0.473
 day-0.470
 people-0.464
 doctors-0.452
Token as
Token ID355
Feature activation-1.46e-01

Pos logit contributions
ifying+0.280
igsaw+0.250
run+0.220
chie+0.216
jamin+0.207
Neg logit contributions
 idea-0.108
 morning-0.087
 day-0.086
 people-0.084
 doctors-0.082
Token he
Token ID339
Feature activation-8.40e-01

Pos logit contributions
ifying+0.426
igsaw+0.382
run+0.335
chie+0.325
jamin+0.315
Neg logit contributions
 idea-0.170
 morning-0.137
 day-0.135
 people-0.132
 prevent-0.130
Token was
Token ID373
Feature activation+2.43e-01

Pos logit contributions
ifying+1.961
igsaw+1.737
run+1.479
chie+1.457
 brief+1.381
Neg logit contributions
 idea-1.054
 day-0.893
 people-0.887
 morning-0.884
 prevent-0.848
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 idea+0.263
 morning+0.217
 day+0.209
 people+0.207
 doctors+0.204
Neg logit contributions
ifying-0.589
igsaw-0.525
run-0.467
chie-0.448
 brief-0.431
Token
Token ID198
Feature activation-6.45e-02

Pos logit contributions
ifying+2.796
igsaw+2.465
chie+2.200
run+2.150
 brief+2.082
Neg logit contributions
 idea-1.129
 morning-0.951
 day-0.937
 people-0.914
 hospital-0.863
Token
Token ID198
Feature activation-4.08e-01

Pos logit contributions
ifying+0.153
igsaw+0.138
run+0.121
chie+0.115
jamin+0.112
Neg logit contributions
 idea-0.075
 morning-0.061
 day-0.060
 people-0.060
 doctors-0.059
TokenEm
Token ID10161
Feature activation-3.03e+00

Pos logit contributions
ifying+0.867
igsaw+0.759
run+0.645
chie+0.627
 brief+0.593
Neg logit contributions
 idea-0.584
 morning-0.504
 day-0.500
 people-0.495
 prevent-0.482
Tokenma
Token ID2611
Feature activation-2.84e-01

Pos logit contributions
ifying+7.990
igsaw+6.871
run+6.034
chie+5.978
lic+5.737
Neg logit contributions
 idea-3.331
 day-3.179
 morning-3.055
 people-2.874
 night-2.675
Token loved
Token ID6151
Feature activation-6.24e-01

Pos logit contributions
ifying+0.596
igsaw+0.521
run+0.444
chie+0.432
jamin+0.409
Neg logit contributions
 idea-0.407
 morning-0.351
 day-0.349
 people-0.344
 prevent-0.336
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
ifying+1.486
igsaw+1.313
run+1.137
chie+1.119
 brief+1.060
Neg logit contributions
 idea-0.739
 morning-0.621
 day-0.620
 people-0.612
 doctors-0.587
Token present
Token ID1944
Feature activation-1.10e-01

Pos logit contributions
 idea+0.150
 morning+0.130
 people+0.128
 prevent+0.128
 doctors+0.128
Neg logit contributions
ifying-0.270
igsaw-0.245
run-0.204
chie-0.202
jamin-0.194
Token and
Token ID290
Feature activation-3.99e-01

Pos logit contributions
ifying+0.276
igsaw+0.248
run+0.217
chie+0.211
 brief+0.203
Neg logit contributions
 idea-0.112
 morning-0.091
 day-0.089
 people-0.088
 prevent-0.086
Token his
Token ID465
Feature activation-8.89e-02

Pos logit contributions
ifying+0.940
igsaw+0.843
run+0.744
chie+0.728
 brief+0.696
Neg logit contributions
 idea-0.347
 morning-0.275
 day-0.272
 people-0.267
 prevent-0.258
Token friends
Token ID2460
Feature activation+9.68e-01

Pos logit contributions
ifying+0.219
igsaw+0.195
run+0.172
chie+0.168
 brief+0.161
Neg logit contributions
 idea-0.094
 morning-0.077
 day-0.076
 prevent-0.074
 people-0.074
Token dropped
Token ID5710
Feature activation-2.66e-01

Pos logit contributions
 idea+1.331
 morning+1.115
 day+1.080
 prevent+1.057
 doctors+1.054
Neg logit contributions
ifying-2.103
igsaw-1.747
run-1.596
chie-1.527
jamin-1.479
Token her
Token ID607
Feature activation-7.21e-01

Pos logit contributions
ifying+0.638
igsaw+0.564
run+0.493
chie+0.480
 brief+0.459
Neg logit contributions
 idea-0.307
 morning-0.254
 day-0.252
 people-0.248
 prevent-0.242
Token lip
Token ID10645
Feature activation-4.90e-01

Pos logit contributions
ifying+1.664
igsaw+1.484
run+1.276
chie+1.254
jamin+1.185
Neg logit contributions
 idea-0.898
 day-0.757
 morning-0.756
 people-0.742
 prevent-0.712
Token bal
Token ID3652
Feature activation-3.01e+00

Pos logit contributions
ifying+1.243
igsaw+1.114
run+0.977
chie+0.958
 brief+0.915
Neg logit contributions
 idea-0.494
 morning-0.400
 day-0.397
 people-0.390
 prevent-0.373
Tokenm
Token ID76
Feature activation-7.23e-01

Pos logit contributions
igsaw+5.185
ifying+5.063
 brief+3.625
chie+3.559
jamin+3.387
Neg logit contributions
 day-4.910
 if-4.901
 idea-4.850
 morning-4.825
 whenever-4.675
Token on
Token ID319
Feature activation-4.46e-02

Pos logit contributions
ifying+1.446
igsaw+1.262
run+1.048
chie+1.022
 brief+0.964
Neg logit contributions
 idea-1.132
 morning-0.984
 day-0.983
 people-0.973
 prevent-0.940
Token the
Token ID262
Feature activation-5.69e-02

Pos logit contributions
ifying+0.113
igsaw+0.101
run+0.088
chie+0.086
 brief+0.083
Neg logit contributions
 idea-0.045
 morning-0.036
 day-0.035
 people-0.035
 prevent-0.034
Token ground
Token ID2323
Feature activation-3.75e-01

Pos logit contributions
ifying+0.149
igsaw+0.133
run+0.117
chie+0.116
 brief+0.111
Neg logit contributions
 idea-0.052
 morning-0.040
 day-0.040
 people-0.039
 prevent-0.038
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
ifying+0.976
igsaw+0.875
run+0.773
chie+0.756
 brief+0.725
Neg logit contributions
 idea-0.351
 morning-0.278
 day-0.275
 people-0.271
 prevent-0.259
Token birds
Token ID10087
Feature activation-4.55e-01

Pos logit contributions
 idea+0.511
 morning+0.404
 day+0.402
 prevent+0.392
 people+0.380
Neg logit contributions
ifying-1.058
igsaw-0.908
run-0.799
chie-0.795
 brief-0.753
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
ifying+1.127
igsaw+1.007
run+0.880
chie+0.862
 brief+0.823
Neg logit contributions
 idea-0.483
 morning-0.395
 day-0.392
 people-0.386
 prevent-0.371
Token
Token ID220
Feature activation-8.59e-02

Pos logit contributions
ifying+2.212
igsaw+1.919
run+1.650
chie+1.646
 brief+1.564
Neg logit contributions
 idea-1.423
 morning-1.255
 day-1.235
 people-1.228
 doctors-1.166
Token
Token ID198
Feature activation-8.65e-01

Pos logit contributions
ifying+0.205
igsaw+0.184
run+0.161
chie+0.155
jamin+0.148
Neg logit contributions
 idea-0.099
 morning-0.082
 day-0.081
 people-0.079
 doctors-0.078
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ifying+2.216
igsaw+1.968
run+1.729
chie+1.717
 brief+1.646
Neg logit contributions
 idea-0.858
 morning-0.707
 day-0.703
 people-0.693
 prevent-0.646
TokenEm
Token ID10161
Feature activation-2.99e+00

Pos logit contributions
ifying+1.835
igsaw+1.517
run+1.233
chie+1.194
 brief+1.111
Neg logit contributions
 idea-2.111
 morning-1.937
 day-1.932
 people-1.928
 doctors-1.876
Tokenma
Token ID2611
Feature activation-5.74e-01

Pos logit contributions
ifying+7.569
igsaw+6.583
run+5.816
chie+5.458
lic+5.449
Neg logit contributions
 idea-3.421
 day-3.308
 morning-3.216
 people-2.989
 night-2.895
Token was
Token ID373
Feature activation+2.77e-01

Pos logit contributions
ifying+1.290
igsaw+1.140
run+0.976
chie+0.953
 brief+0.906
Neg logit contributions
 idea-0.743
 morning-0.634
 day-0.630
 people-0.624
 prevent-0.603
Token very
Token ID845
Feature activation+3.03e-01

Pos logit contributions
 idea+0.471
 morning+0.417
 day+0.413
 people+0.408
 doctors+0.403
Neg logit contributions
ifying-0.510
igsaw-0.427
run-0.361
chie-0.350
jamin-0.325
Token proud
Token ID6613
Feature activation+6.99e-02

Pos logit contributions
 idea+0.343
 morning+0.296
 day+0.287
 doctors+0.280
 hospital+0.276
Neg logit contributions
ifying-0.718
igsaw-0.639
run-0.562
chie-0.555
 brief-0.516
Token and
Token ID290
Feature activation+1.04e-01

Pos logit contributions
 idea+0.074
 morning+0.060
 day+0.058
 prevent+0.058
 people+0.057
Neg logit contributions
ifying-0.173
igsaw-0.154
run-0.137
chie-0.131
 brief-0.127
Token you
Token ID345
Feature activation-8.46e-01

Pos logit contributions
ifying+0.580
igsaw+0.508
run+0.438
chie+0.428
 brief+0.404
Neg logit contributions
 idea-0.347
 morning-0.295
 day-0.292
 people-0.288
 prevent-0.282
Token help
Token ID1037
Feature activation-3.05e-01

Pos logit contributions
ifying+1.948
igsaw+1.746
run+1.492
chie+1.449
 brief+1.390
Neg logit contributions
 idea-1.063
 morning-0.889
 day-0.887
 people-0.881
 prevent-0.856
Token?"
Token ID1701
Feature activation-9.64e-02

Pos logit contributions
ifying+0.676
igsaw+0.595
run+0.511
chie+0.500
jamin+0.473
Neg logit contributions
 idea-0.402
 morning-0.343
 day-0.340
 people-0.334
 doctors-0.326
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
ifying+0.226
igsaw+0.202
run+0.176
chie+0.169
jamin+0.163
Neg logit contributions
 idea-0.114
 day-0.094
 morning-0.094
 people-0.092
 prevent-0.091
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
ifying+0.806
igsaw+0.721
run+0.629
chie+0.613
jamin+0.585
Neg logit contributions
 idea-0.366
 morning-0.300
 day-0.298
 people-0.293
 prevent-0.285
TokenEm
Token ID10161
Feature activation-2.96e+00

Pos logit contributions
ifying+2.378
igsaw+1.986
run+1.654
chie+1.585
 brief+1.484
Neg logit contributions
 idea-2.301
 people-2.115
 morning-2.110
 day-2.095
 doctors-2.055
Tokenma
Token ID2611
Feature activation-1.19e+00

Pos logit contributions
ifying+7.569
igsaw+6.683
run+5.756
chie+5.547
lic+5.412
Neg logit contributions
 idea-3.379
 day-3.106
 morning-3.045
 people-2.811
 night-2.724
Token nodded
Token ID14464
Feature activation-5.90e-01

Pos logit contributions
ifying+2.483
igsaw+2.163
run+1.780
chie+1.755
 brief+1.646
Neg logit contributions
 idea-1.789
 morning-1.556
 day-1.556
 people-1.544
 prevent-1.490
Token and
Token ID290
Feature activation-2.75e-01

Pos logit contributions
ifying+1.502
igsaw+1.348
run+1.179
chie+1.157
 brief+1.105
Neg logit contributions
 idea-0.597
 morning-0.481
 day-0.477
 people-0.471
 prevent-0.449
Token got
Token ID1392
Feature activation+1.12e+00

Pos logit contributions
ifying+0.615
igsaw+0.542
run+0.469
chie+0.455
 brief+0.433
Neg logit contributions
 idea-0.356
 morning-0.301
 day-0.300
 people-0.295
 doctors-0.288
Token to
Token ID284
Feature activation+3.18e-01

Pos logit contributions
 idea+1.437
 day+1.174
 morning+1.154
 hospital+1.136
 doctors+1.105
Neg logit contributions
ifying-2.533
igsaw-2.189
run-2.007
chie-1.878
years-1.803
Token here
Token ID994
Feature activation-4.65e-02

Pos logit contributions
ifying+0.036
igsaw+0.032
run+0.028
chie+0.027
 brief+0.026
Neg logit contributions
 idea-0.020
 morning-0.017
 day-0.016
 people-0.016
 doctors-0.016
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
ifying+0.116
igsaw+0.103
run+0.091
chie+0.088
 brief+0.085
Neg logit contributions
 idea-0.048
 morning-0.039
 people-0.038
 day-0.038
 doctors-0.038
Token
Token ID220
Feature activation-1.84e-01

Pos logit contributions
ifying+2.875
igsaw+2.514
chie+2.211
run+2.180
 brief+2.095
Neg logit contributions
 idea-1.639
 morning-1.455
 day-1.425
 people-1.395
 whenever-1.313
Token
Token ID198
Feature activation-8.49e-01

Pos logit contributions
ifying+0.444
igsaw+0.396
run+0.347
chie+0.335
jamin+0.321
Neg logit contributions
 idea-0.207
 morning-0.171
 day-0.170
 people-0.166
 prevent-0.162
Token
Token ID198
Feature activation-8.90e-01

Pos logit contributions
ifying+2.169
igsaw+1.933
run+1.695
chie+1.685
 brief+1.608
Neg logit contributions
 idea-0.848
 morning-0.701
 day-0.698
 people-0.684
 prevent-0.641
TokenEm
Token ID10161
Feature activation-2.95e+00

Pos logit contributions
ifying+1.626
igsaw+1.392
run+1.154
chie+1.121
 brief+1.042
Neg logit contributions
 idea-1.522
 morning-1.372
 day-1.369
 people-1.360
 doctors-1.324
Tokenma
Token ID2611
Feature activation+5.83e-02

Pos logit contributions
ifying+7.314
igsaw+6.348
run+5.666
chie+5.313
lic+5.239
Neg logit contributions
 idea-3.515
 day-3.429
 morning-3.345
 people-3.089
 night-3.029
Token had
Token ID550
Feature activation-6.15e-01

Pos logit contributions
 idea+0.080
 morning+0.068
 day+0.067
 people+0.066
 doctors+0.066
Neg logit contributions
ifying-0.125
igsaw-0.109
run-0.095
chie-0.092
jamin-0.088
Token a
Token ID257
Feature activation-5.90e-01

Pos logit contributions
ifying+1.382
igsaw+1.222
run+1.045
chie+1.020
 brief+0.966
Neg logit contributions
 idea-0.806
 morning-0.690
 day-0.686
 people-0.679
 prevent-0.656
Token great
Token ID1049
Feature activation+7.21e-01

Pos logit contributions
ifying+1.430
igsaw+1.272
run+1.112
chie+1.087
 brief+1.030
Neg logit contributions
 idea-0.666
 morning-0.552
 day-0.549
 people-0.536
 prevent-0.517
Token time
Token ID640
Feature activation+3.29e-01

Pos logit contributions
 idea+0.368
 doctors+0.306
 people+0.298
 prevent+0.286
cut+0.283
Neg logit contributions
ifying-2.044
igsaw-1.950
run-1.689
 brief-1.649
chie-1.629
Token come
Token ID1282
Feature activation-4.17e-01

Pos logit contributions
ifying+0.019
igsaw+0.016
run+0.014
chie+0.014
 brief+0.013
Neg logit contributions
 idea-0.009
 morning-0.007
 day-0.007
 people-0.007
 doctors-0.007
Token out
Token ID503
Feature activation-6.33e-01

Pos logit contributions
ifying+0.995
igsaw+0.884
run+0.768
chie+0.751
 brief+0.715
Neg logit contributions
 idea-0.483
 morning-0.402
 day-0.399
 people-0.394
 prevent-0.380
Token!
Token ID0
Feature activation-1.25e+00

Pos logit contributions
ifying+1.643
igsaw+1.477
run+1.295
chie+1.275
 brief+1.221
Neg logit contributions
 idea-0.603
 morning-0.481
 day-0.479
 people-0.471
 prevent-0.447
Token
Token ID198
Feature activation-5.33e-01

Pos logit contributions
ifying+3.113
igsaw+2.741
chie+2.432
run+2.401
 brief+2.335
Neg logit contributions
 idea-1.321
 morning-1.128
 day-1.104
 people-1.081
 hospital-0.998
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
ifying+1.328
igsaw+1.184
run+1.035
chie+1.017
 brief+0.971
Neg logit contributions
 idea-0.564
 morning-0.462
 day-0.459
 people-0.452
 prevent-0.432
TokenEm
Token ID10161
Feature activation-2.94e+00

Pos logit contributions
ifying+2.206
igsaw+1.881
run+1.574
chie+1.536
 brief+1.447
Neg logit contributions
 idea-1.885
 people-1.689
 morning-1.689
 day-1.688
 prevent-1.635
Tokenma
Token ID2611
Feature activation-3.38e-01

Pos logit contributions
ifying+7.496
igsaw+6.495
run+5.709
chie+5.394
lic+5.368
Neg logit contributions
 idea-3.346
 day-3.153
 morning-3.037
 people-2.914
 night-2.732
Token was
Token ID373
Feature activation-7.36e-02

Pos logit contributions
ifying+0.724
igsaw+0.633
run+0.542
chie+0.527
jamin+0.499
Neg logit contributions
 idea-0.472
 morning-0.406
 day-0.403
 people-0.399
 prevent-0.388
Token so
Token ID523
Feature activation-3.69e-01

Pos logit contributions
ifying+0.141
igsaw+0.121
run+0.102
chie+0.098
jamin+0.092
Neg logit contributions
 idea-0.120
 morning-0.105
 day-0.105
 people-0.103
 prevent-0.101
Token sad
Token ID6507
Feature activation-3.41e-01

Pos logit contributions
ifying+0.930
igsaw+0.831
run+0.730
chie+0.714
 brief+0.682
Neg logit contributions
 idea-0.380
 morning-0.309
 day-0.306
 people-0.301
 doctors-0.290
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
ifying+0.862
igsaw+0.772
run+0.678
chie+0.662
 brief+0.634
Neg logit contributions
 idea-0.346
 morning-0.280
 day-0.277
 people-0.273
 prevent-0.263
Token down
Token ID866
Feature activation-4.09e-01

Pos logit contributions
ifying+0.876
igsaw+0.781
run+0.683
chie+0.668
 brief+0.637
Neg logit contributions
 idea-0.382
 morning-0.312
 day-0.310
 people-0.305
 doctors-0.294
Token safely
Token ID11512
Feature activation-5.34e-01

Pos logit contributions
ifying+1.013
igsaw+0.905
run+0.792
chie+0.775
 brief+0.740
Neg logit contributions
 idea-0.433
 morning-0.353
 day-0.351
 people-0.345
 prevent-0.332
Token."
Token ID526
Feature activation-3.14e-01

Pos logit contributions
ifying+1.321
igsaw+1.181
run+1.033
chie+1.010
 brief+0.967
Neg logit contributions
 idea-0.567
 morning-0.465
 day-0.462
 people-0.454
 prevent-0.435
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
ifying+0.734
igsaw+0.654
run+0.567
chie+0.551
jamin+0.525
Neg logit contributions
 idea-0.377
 morning-0.314
 day-0.313
 people-0.308
 prevent-0.300
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
ifying+1.067
igsaw+0.953
run+0.832
chie+0.814
 brief+0.777
Neg logit contributions
 idea-0.470
 morning-0.385
 day-0.382
 people-0.377
 prevent-0.363
TokenEm
Token ID10161
Feature activation-2.93e+00

Pos logit contributions
ifying+2.401
igsaw+2.024
run+1.703
chie+1.649
 brief+1.547
Neg logit contributions
 idea-1.978
 people-1.758
 day-1.755
 morning-1.753
 prevent-1.718
Tokenma
Token ID2611
Feature activation-2.90e-01

Pos logit contributions
ifying+7.518
igsaw+6.524
run+5.782
chie+5.457
lic+5.336
Neg logit contributions
 idea-3.434
 day-3.142
 morning-3.029
 people-2.969
 prevent-2.772
Token understood
Token ID7247
Feature activation-5.05e-01

Pos logit contributions
ifying+0.582
igsaw+0.507
run+0.429
chie+0.415
jamin+0.392
Neg logit contributions
 idea-0.439
 morning-0.383
 day-0.380
 people-0.378
 doctors-0.369
Token what
Token ID644
Feature activation-4.94e-01

Pos logit contributions
ifying+1.270
igsaw+1.135
run+0.993
chie+0.975
 brief+0.930
Neg logit contributions
 idea-0.528
 morning-0.429
 day-0.426
 people-0.420
 prevent-0.403
Token they
Token ID484
Feature activation-5.01e-01

Pos logit contributions
ifying+1.229
igsaw+1.097
run+0.959
chie+0.940
 brief+0.897
Neg logit contributions
 idea-0.529
 morning-0.433
 day-0.430
 people-0.423
 prevent-0.406
Token meant
Token ID4001
Feature activation+9.27e-02

Pos logit contributions
ifying+1.129
igsaw+0.996
run+0.855
chie+0.834
 brief+0.791
Neg logit contributions
 idea-0.657
 morning-0.558
 day-0.556
 people-0.548
 prevent-0.532
Token the
Token ID262
Feature activation-3.00e-01

Pos logit contributions
 idea+0.170
 morning+0.144
 day+0.143
 prevent+0.139
 people+0.137
Neg logit contributions
ifying-0.275
igsaw-0.240
run-0.208
chie-0.199
 brief-0.192
Token terrible
Token ID7818
Feature activation-4.39e-01

Pos logit contributions
ifying+0.634
igsaw+0.554
run+0.468
chie+0.458
 brief+0.433
Neg logit contributions
 idea-0.430
 morning-0.372
 day-0.369
 people-0.363
 prevent-0.358
Token mess
Token ID2085
Feature activation-6.04e-01

Pos logit contributions
ifying+0.962
igsaw+0.846
run+0.724
chie+0.706
 brief+0.669
Neg logit contributions
 idea-0.593
 morning-0.508
 day-0.505
 people-0.499
 prevent-0.485
Token.
Token ID13
Feature activation-5.00e-01

Pos logit contributions
ifying+1.507
igsaw+1.348
run+1.178
chie+1.159
 brief+1.105
Neg logit contributions
 idea-0.631
 morning-0.516
 day-0.514
 people-0.507
 doctors-0.482
Token His
Token ID2399
Feature activation-5.39e-01

Pos logit contributions
ifying+1.123
igsaw+0.989
run+0.850
chie+0.831
 brief+0.788
Neg logit contributions
 idea-0.653
 morning-0.557
 day-0.553
 people-0.547
 prevent-0.529
Token mar
Token ID1667
Feature activation-2.92e+00

Pos logit contributions
ifying+1.352
igsaw+1.207
run+1.056
chie+1.036
 brief+0.987
Neg logit contributions
 idea-0.570
 morning-0.465
 day-0.463
 people-0.455
 prevent-0.434
Tokenbles
Token ID7689
Feature activation-1.13e+00

Pos logit contributions
ifying+5.485
igsaw+4.815
run+3.720
jamin+3.576
 brief+3.506
Neg logit contributions
 idea-5.094
 people-4.724
 doctors-4.292
cut-4.289
 day-4.251
Token were
Token ID547
Feature activation-9.42e-01

Pos logit contributions
ifying+2.524
igsaw+2.257
run+1.886
chie+1.860
 brief+1.758
Neg logit contributions
 idea-1.533
 day-1.316
 people-1.312
 morning-1.296
 prevent-1.253
Token everywhere
Token ID8347
Feature activation-4.27e-01

Pos logit contributions
ifying+2.030
igsaw+1.814
run+1.520
chie+1.499
jamin+1.417
Neg logit contributions
 idea-1.302
 day-1.122
 people-1.119
 morning-1.115
 doctors-1.085
Token!
Token ID0
Feature activation-3.73e-01

Pos logit contributions
ifying+1.044
igsaw+0.931
run+0.813
chie+0.795
 brief+0.759
Neg logit contributions
 idea-0.468
 morning-0.386
 day-0.382
 people-0.377
 prevent-0.363
Token He
Token ID679
Feature activation-1.13e+00

Pos logit contributions
ifying+0.828
igsaw+0.730
run+0.626
chie+0.609
 brief+0.578
Neg logit contributions
 idea-0.498
 morning-0.425
 day-0.423
 people-0.417
 prevent-0.406
Token with
Token ID351
Feature activation-5.70e-03

Pos logit contributions
ifying+0.467
igsaw+0.415
run+0.362
chie+0.353
 brief+0.337
Neg logit contributions
 idea-0.218
 morning-0.178
 day-0.177
 people-0.173
 prevent-0.169
Token me
Token ID502
Feature activation-1.10e+00

Pos logit contributions
ifying+0.012
igsaw+0.011
run+0.009
chie+0.009
 brief+0.008
Neg logit contributions
 idea-0.008
 morning-0.007
 day-0.007
 people-0.007
 prevent-0.007
Token?"
Token ID1701
Feature activation-4.07e-01

Pos logit contributions
ifying+2.783
igsaw+2.524
run+2.171
chie+2.152
 brief+2.041
Neg logit contributions
 idea-1.134
 morning-0.939
 day-0.938
 people-0.915
 prevent-0.855
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
ifying+0.989
igsaw+0.882
run+0.768
chie+0.751
 brief+0.717
Neg logit contributions
 idea-0.451
 morning-0.373
 day-0.370
 people-0.364
 prevent-0.351
Token
Token ID198
Feature activation-9.77e-01

Pos logit contributions
ifying+0.688
igsaw+0.616
run+0.536
chie+0.524
 brief+0.500
Neg logit contributions
 idea-0.317
 morning-0.262
 day-0.259
 people-0.254
 doctors-0.247
TokenMel
Token ID21102
Feature activation-2.92e+00

Pos logit contributions
ifying+2.096
igsaw+1.815
run+1.566
chie+1.502
 brief+1.419
Neg logit contributions
 idea-1.384
 people-1.209
 day-1.189
 morning-1.188
 prevent-1.152
Tokenody
Token ID1118
Feature activation-1.20e+00

Pos logit contributions
ifying+6.651
igsaw+5.621
run+5.014
chie+4.522
 brief+4.413
Neg logit contributions
 idea-4.256
 people-3.902
 day-3.739
 morning-3.557
crow-3.498
Token was
Token ID373
Feature activation+5.51e-01

Pos logit contributions
ifying+2.654
igsaw+2.338
run+1.963
chie+1.923
 brief+1.811
Neg logit contributions
 idea-1.678
 day-1.421
 morning-1.416
 people-1.404
 prevent-1.339
Token very
Token ID845
Feature activation+5.84e-01

Pos logit contributions
 idea+0.837
 morning+0.753
 day+0.744
 prevent+0.733
 doctors+0.729
Neg logit contributions
ifying-1.080
igsaw-0.914
run-0.806
chie-0.766
jamin-0.743
Token excited
Token ID6568
Feature activation-4.02e-01

Pos logit contributions
 idea+0.607
 morning+0.527
 doctors+0.514
 hospital+0.507
 day+0.503
Neg logit contributions
ifying-1.406
igsaw-1.260
run-1.118
chie-1.093
jamin-1.042
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
ifying+1.028
igsaw+0.922
run+0.811
chie+0.793
 brief+0.760
Neg logit contributions
 idea-0.395
 morning-0.317
 day-0.314
 people-0.309
 prevent-0.297
Token diamond
Token ID15291
Feature activation-2.83e-01

Pos logit contributions
 idea+0.764
 paramedics+0.691
 doctors+0.688
 people+0.678
 morning+0.665
Neg logit contributions
ifying-1.335
igsaw-1.228
run-1.013
chie-0.993
 brief-0.974
Token!
Token ID0
Feature activation-1.32e+00

Pos logit contributions
ifying+0.682
igsaw+0.606
run+0.528
chie+0.516
 brief+0.493
Neg logit contributions
 idea-0.320
 morning-0.265
 day-0.262
 people-0.260
 prevent-0.251
Token
Token ID220
Feature activation-5.61e-02

Pos logit contributions
ifying+3.126
igsaw+2.733
chie+2.421
run+2.376
 brief+2.323
Neg logit contributions
 idea-1.548
 morning-1.352
 day-1.319
 people-1.299
 hospital-1.213
Token
Token ID198
Feature activation-9.37e-01

Pos logit contributions
ifying+0.134
igsaw+0.120
run+0.105
chie+0.101
jamin+0.097
Neg logit contributions
 idea-0.064
 morning-0.054
 day-0.053
 people-0.052
 doctors-0.051
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
ifying+2.414
igsaw+2.143
run+1.877
chie+1.876
 brief+1.789
Neg logit contributions
 idea-0.925
 morning-0.759
 day-0.758
 people-0.748
 prevent-0.697
TokenEm
Token ID10161
Feature activation-2.92e+00

Pos logit contributions
ifying+1.990
igsaw+1.700
run+1.423
chie+1.392
 brief+1.304
Neg logit contributions
 idea-1.683
 morning-1.504
 day-1.503
 people-1.498
 doctors-1.457
Tokenma
Token ID2611
Feature activation-2.47e-01

Pos logit contributions
ifying+7.415
igsaw+6.415
run+5.680
lic+5.337
 brief+5.304
Neg logit contributions
 idea-3.432
 day-3.255
 morning-3.124
 people-3.027
 prevent-2.888
Token was
Token ID373
Feature activation+1.04e-01

Pos logit contributions
ifying+0.547
igsaw+0.481
run+0.416
chie+0.405
jamin+0.384
Neg logit contributions
 idea-0.327
 morning-0.278
 day-0.276
 people-0.272
 doctors-0.266
Token so
Token ID523
Feature activation+9.46e-02

Pos logit contributions
 idea+0.173
 morning+0.154
 day+0.152
 people+0.151
 doctors+0.148
Neg logit contributions
ifying-0.193
igsaw-0.163
run-0.138
chie-0.134
jamin-0.124
Token happy
Token ID3772
Feature activation-1.17e-01

Pos logit contributions
 idea+0.098
 morning+0.082
 day+0.080
 doctors+0.078
 people+0.078
Neg logit contributions
ifying-0.235
igsaw-0.211
run-0.187
chie-0.182
jamin-0.172
Token and
Token ID290
Feature activation+1.16e-01

Pos logit contributions
ifying+0.277
igsaw+0.247
run+0.215
chie+0.208
 brief+0.200
Neg logit contributions
 idea-0.138
 morning-0.114
 day-0.112
 people-0.111
 prevent-0.109
Token place
Token ID1295
Feature activation-5.16e-01

Pos logit contributions
 idea+0.764
 paramedics+0.626
 doctors+0.623
 people+0.611
 prevent+0.608
Neg logit contributions
ifying-2.244
igsaw-2.074
run-1.814
chie-1.769
 brief-1.739
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
ifying+1.236
igsaw+1.101
run+0.956
chie+0.937
 brief+0.893
Neg logit contributions
 idea-0.586
 morning-0.487
 day-0.485
 people-0.477
 prevent-0.460
Token
Token ID220
Feature activation-4.87e-02

Pos logit contributions
ifying+3.320
igsaw+2.951
chie+2.631
run+2.576
 brief+2.535
Neg logit contributions
 idea-1.432
 morning-1.265
 day-1.243
 people-1.175
 whenever-1.121
Token
Token ID198
Feature activation-4.73e-01

Pos logit contributions
ifying+0.117
igsaw+0.104
run+0.092
chie+0.088
jamin+0.084
Neg logit contributions
 idea-0.056
 morning-0.046
 day-0.046
 people-0.045
 doctors-0.044
Token
Token ID198
Feature activation-8.27e-01

Pos logit contributions
ifying+1.188
igsaw+1.062
run+0.930
chie+0.911
 brief+0.871
Neg logit contributions
 idea-0.491
 morning-0.400
 day-0.397
 people-0.391
 prevent-0.375
TokenEm
Token ID10161
Feature activation-2.92e+00

Pos logit contributions
ifying+1.816
igsaw+1.596
run+1.369
chie+1.335
 brief+1.270
Neg logit contributions
 idea-1.112
 day-0.966
 morning-0.966
 people-0.954
 prevent-0.925
Tokenma
Token ID2611
Feature activation-2.84e-01

Pos logit contributions
ifying+7.323
igsaw+6.395
run+5.604
lic+5.268
 brief+5.219
Neg logit contributions
 idea-3.438
 day-3.243
 morning-3.043
 people-3.016
 if-2.749
Token was
Token ID373
Feature activation+8.72e-02

Pos logit contributions
ifying+0.653
igsaw+0.578
run+0.501
chie+0.490
jamin+0.467
Neg logit contributions
 idea-0.348
 morning-0.292
 day-0.289
 people-0.286
 prevent-0.278
Token so
Token ID523
Feature activation-2.06e-01

Pos logit contributions
 idea+0.135
 morning+0.117
 day+0.116
 people+0.116
 doctors+0.113
Neg logit contributions
ifying-0.173
igsaw-0.148
run-0.127
chie-0.124
jamin-0.115
Token proud
Token ID6613
Feature activation+1.84e-01

Pos logit contributions
ifying+0.525
igsaw+0.470
run+0.415
chie+0.407
jamin+0.386
Neg logit contributions
 idea-0.205
 morning-0.166
 day-0.163
 people-0.161
 doctors-0.157
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
 idea+0.202
 morning+0.163
 prevent+0.160
 day+0.158
 people+0.158
Neg logit contributions
ifying-0.444
igsaw-0.398
run-0.355
chie-0.335
 brief-0.324
Token the
Token ID262
Feature activation+6.04e-02

Pos logit contributions
ifying+4.252
igsaw+3.773
run+3.412
chie+3.372
lic+3.249
Neg logit contributions
 idea-1.247
 people-1.049
 day-0.997
 morning-0.985
 hospital-0.913
Token night
Token ID1755
Feature activation+1.59e-01

Pos logit contributions
 idea+0.067
 prevent+0.052
 morning+0.052
 day+0.052
 people+0.052
Neg logit contributions
ifying-0.148
igsaw-0.131
run-0.112
chie-0.111
 brief-0.108
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 idea+0.170
 prevent+0.131
 morning+0.131
 day+0.131
 people+0.128
Neg logit contributions
ifying-0.397
igsaw-0.354
run-0.305
chie-0.298
 brief-0.289
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
ifying+3.318
igsaw+2.910
chie+2.600
run+2.563
 brief+2.469
Neg logit contributions
 idea-1.443
 morning-1.252
 day-1.219
 people-1.190
 hospital-1.128
Token
Token ID198
Feature activation-9.61e-01

Pos logit contributions
ifying+1.084
igsaw+0.970
run+0.849
chie+0.831
 brief+0.794
Neg logit contributions
 idea-0.455
 morning-0.370
 day-0.367
 people-0.362
 prevent-0.348
TokenEm
Token ID10161
Feature activation-2.89e+00

Pos logit contributions
ifying+1.950
igsaw+1.693
run+1.433
chie+1.400
 brief+1.314
Neg logit contributions
 idea-1.455
 morning-1.297
 day-1.294
 people-1.283
 doctors-1.248
Tokenma
Token ID2611
Feature activation-1.95e-01

Pos logit contributions
ifying+7.295
igsaw+6.389
run+5.577
chie+5.246
lic+5.141
Neg logit contributions
 idea-3.319
 day-3.233
 morning-3.120
 people-3.041
 night-2.841
Token was
Token ID373
Feature activation-4.47e-01

Pos logit contributions
ifying+0.439
igsaw+0.384
run+0.335
chie+0.326
jamin+0.310
Neg logit contributions
 idea-0.252
 morning-0.212
 day-0.210
 people-0.207
 doctors-0.202
Token so
Token ID523
Feature activation-8.70e-01

Pos logit contributions
ifying+0.950
igsaw+0.831
run+0.707
chie+0.688
 brief+0.650
Neg logit contributions
 idea-0.638
 morning-0.552
 day-0.549
 people-0.543
 prevent-0.528
Token proud
Token ID6613
Feature activation+4.19e-01

Pos logit contributions
ifying+2.110
igsaw+1.885
run+1.622
chie+1.584
 brief+1.520
Neg logit contributions
 idea-1.005
 day-0.837
 morning-0.836
 people-0.833
 prevent-0.797
Token of
Token ID286
Feature activation-1.12e+00

Pos logit contributions
 idea+0.481
 morning+0.387
 prevent+0.383
 day+0.373
 people+0.369
Neg logit contributions
ifying-0.992
igsaw-0.888
run-0.798
chie-0.739
 brief-0.724
Token away
Token ID1497
Feature activation-5.78e-01

Pos logit contributions
ifying+0.174
igsaw+0.150
run+0.128
chie+0.125
jamin+0.117
Neg logit contributions
 idea-0.119
 morning-0.102
 day-0.101
 people-0.101
 doctors-0.100
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ifying+1.498
igsaw+1.347
run+1.183
chie+1.160
 brief+1.111
Neg logit contributions
 idea-0.555
 morning-0.441
 day-0.439
 people-0.431
 prevent-0.412
Token
Token ID220
Feature activation+1.08e-01

Pos logit contributions
ifying+2.701
igsaw+2.356
chie+2.071
run+2.043
 brief+1.967
Neg logit contributions
 idea-1.539
 morning-1.356
 day-1.332
 people-1.297
 hospital-1.248
Token
Token ID198
Feature activation-6.97e-01

Pos logit contributions
 idea+0.128
 morning+0.106
 day+0.105
 people+0.103
 doctors+0.102
Neg logit contributions
ifying-0.255
igsaw-0.227
run-0.201
chie-0.191
jamin-0.183
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
ifying+1.756
igsaw+1.566
run+1.368
chie+1.354
 brief+1.295
Neg logit contributions
 idea-0.719
 morning-0.591
 day-0.588
 people-0.577
 prevent-0.547
TokenEm
Token ID10161
Feature activation-2.88e+00

Pos logit contributions
ifying+2.204
igsaw+1.920
run+1.636
chie+1.599
 brief+1.520
Neg logit contributions
 idea-1.540
 morning-1.365
 day-1.365
 people-1.351
 doctors-1.307
Tokenma
Token ID2611
Feature activation-4.25e-01

Pos logit contributions
ifying+7.149
igsaw+6.248
run+5.411
 brief+5.171
lic+5.170
Neg logit contributions
 idea-3.423
 day-3.260
 morning-3.077
 people-2.929
 if-2.772
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
ifying+0.942
igsaw+0.829
run+0.712
chie+0.695
 brief+0.659
Neg logit contributions
 idea-0.560
 morning-0.478
 day-0.475
 people-0.469
 prevent-0.456
Token her
Token ID607
Feature activation-2.12e-01

Pos logit contributions
ifying+2.507
igsaw+2.175
run+1.923
chie+1.901
 brief+1.807
Neg logit contributions
 idea-1.154
 day-0.964
 morning-0.957
 people-0.953
 doctors-0.891
Token friends
Token ID2460
Feature activation-2.98e-01

Pos logit contributions
ifying+0.532
igsaw+0.478
run+0.418
chie+0.410
 brief+0.393
Neg logit contributions
 idea-0.217
 morning-0.177
 day-0.174
 people-0.170
 prevent-0.167
Token all
Token ID477
Feature activation-4.18e-01

Pos logit contributions
ifying+0.615
igsaw+0.533
run+0.453
chie+0.442
 brief+0.415
Neg logit contributions
 idea-0.439
 morning-0.381
 day-0.378
 people-0.374
 doctors-0.367
Token Emma
Token ID18966
Feature activation-1.06e+00

Pos logit contributions
 idea+0.470
 morning+0.402
 day+0.400
 prevent+0.397
 people+0.397
Neg logit contributions
ifying-0.632
igsaw-0.551
run-0.463
chie-0.445
jamin-0.426
Token!
Token ID0
Feature activation-2.81e-01

Pos logit contributions
ifying+2.611
igsaw+2.341
run+2.050
chie+2.006
 brief+1.933
Neg logit contributions
 idea-1.148
 morning-0.953
 day-0.952
 people-0.927
 doctors-0.864
Token'.
Token ID4458
Feature activation-9.48e-01

Pos logit contributions
ifying+0.586
igsaw+0.510
run+0.434
chie+0.420
jamin+0.397
Neg logit contributions
 idea-0.411
 morning-0.355
 day-0.354
 people-0.350
 prevent-0.342
Token
Token ID198
Feature activation-9.79e-01

Pos logit contributions
ifying+2.118
igsaw+1.857
chie+1.594
run+1.593
 brief+1.512
Neg logit contributions
 idea-1.244
 morning-1.076
 day-1.059
 people-1.050
 doctors-1.001
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
ifying+2.525
igsaw+2.242
chie+1.973
run+1.973
 brief+1.882
Neg logit contributions
 idea-0.966
 morning-0.789
 day-0.787
 people-0.769
 prevent-0.712
TokenEm
Token ID10161
Feature activation-2.88e+00

Pos logit contributions
ifying+2.425
igsaw+2.015
run+1.643
chie+1.624
 brief+1.495
Neg logit contributions
 idea-2.686
 people-2.454
 day-2.453
 morning-2.445
 doctors-2.377
Tokenma
Token ID2611
Feature activation-5.92e-01

Pos logit contributions
ifying+7.317
igsaw+6.211
run+5.601
chie+5.385
lic+5.266
Neg logit contributions
 idea-3.415
 day-3.143
 morning-3.059
 night-2.765
 people-2.693
Token was
Token ID373
Feature activation+6.59e-01

Pos logit contributions
ifying+1.241
igsaw+1.087
run+0.917
chie+0.894
 brief+0.845
Neg logit contributions
 idea-0.856
 morning-0.743
 day-0.740
 people-0.732
 prevent-0.710
Token so
Token ID523
Feature activation+2.55e-01

Pos logit contributions
 idea+1.111
 morning+0.981
 day+0.974
 people+0.962
 prevent+0.957
Neg logit contributions
ifying-1.217
igsaw-1.012
run-0.880
chie-0.827
jamin-0.795
Token happy
Token ID3772
Feature activation-4.74e-01

Pos logit contributions
 idea+0.246
 morning+0.201
 doctors+0.197
 day+0.196
 people+0.194
Neg logit contributions
ifying-0.651
igsaw-0.584
run-0.519
chie-0.504
jamin-0.481
Token she
Token ID673
Feature activation-6.71e-01

Pos logit contributions
ifying+1.172
igsaw+1.045
run+0.913
chie+0.894
 brief+0.854
Neg logit contributions
 idea-0.512
 morning-0.420
 day-0.417
 people-0.411
 prevent-0.395
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
ifying+0.221
igsaw+0.196
run+0.174
chie+0.167
jamin+0.159
Neg logit contributions
 idea-0.108
 morning-0.089
 day-0.088
 people-0.087
 doctors-0.085
TokenL
Token ID43
Feature activation-9.08e-01

Pos logit contributions
ifying+0.498
igsaw+0.440
run+0.383
chie+0.372
jamin+0.354
Neg logit contributions
 idea-0.272
 morning-0.229
 day-0.226
 people-0.222
 hospital-0.215
Tokenily
Token ID813
Feature activation-9.89e-01

Pos logit contributions
ifying+2.616
igsaw+2.323
run+2.078
chie+2.049
 brief+1.988
Neg logit contributions
 idea-0.685
 day-0.486
 morning-0.482
 people-0.467
 doctors-0.439
Token was
Token ID373
Feature activation+3.74e-02

Pos logit contributions
ifying+2.181
igsaw+1.930
run+1.607
chie+1.591
 brief+1.480
Neg logit contributions
 idea-1.372
 people-1.170
 day-1.164
 morning-1.162
 prevent-1.125
Token scared
Token ID12008
Feature activation-3.35e-01

Pos logit contributions
 idea+0.046
 morning+0.040
 day+0.039
 people+0.038
 prevent+0.037
Neg logit contributions
ifying-0.085
igsaw-0.075
run-0.066
chie-0.064
jamin-0.061
Token at
Token ID379
Feature activation-2.87e+00

Pos logit contributions
ifying+0.861
igsaw+0.772
run+0.681
chie+0.664
 brief+0.636
Neg logit contributions
 idea-0.330
 morning-0.264
 day-0.261
 people-0.256
 prevent-0.248
Token first
Token ID717
Feature activation+2.62e-01

Pos logit contributions
ifying+7.227
igsaw+6.117
run+5.384
lic+5.077
chie+4.989
Neg logit contributions
 idea-3.777
 day-3.645
 people-3.578
 morning-3.483
 if-3.373
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
 idea+0.247
 morning+0.202
 day+0.193
 prevent+0.192
 doctors+0.190
Neg logit contributions
ifying-0.666
igsaw-0.605
run-0.540
chie-0.513
 brief-0.499
Token but
Token ID475
Feature activation+4.40e-01

Pos logit contributions
 idea+0.228
 morning+0.197
 day+0.194
 prevent+0.189
 people+0.189
Neg logit contributions
ifying-0.345
igsaw-0.299
run-0.262
chie-0.252
jamin-0.241
Token then
Token ID788
Feature activation+5.48e-01

Pos logit contributions
 idea+0.511
 morning+0.404
 day+0.393
 people+0.393
 prevent+0.386
Neg logit contributions
ifying-1.036
igsaw-0.915
run-0.822
chie-0.778
jamin-0.766
Token she
Token ID673
Feature activation-1.18e+00

Pos logit contributions
 idea+0.615
 morning+0.487
 day+0.485
 prevent+0.468
cut+0.460
Neg logit contributions
ifying-1.301
igsaw-1.163
run-1.033
 brief-0.978
chie-0.973
Token away
Token ID1497
Feature activation-4.28e-01

Pos logit contributions
 idea+0.689
 people+0.590
 doctors+0.586
 morning+0.586
 day+0.584
Neg logit contributions
ifying-0.962
igsaw-0.819
run-0.711
chie-0.684
 brief-0.650
Token to
Token ID284
Feature activation+5.23e-01

Pos logit contributions
ifying+1.103
igsaw+0.990
run+0.871
chie+0.853
 brief+0.817
Neg logit contributions
 idea-0.413
 morning-0.330
 day-0.327
 people-0.322
 prevent-0.307
Token hear
Token ID3285
Feature activation-4.68e-02

Pos logit contributions
 idea+0.581
 morning+0.495
 day+0.486
cut+0.453
 doctors+0.453
Neg logit contributions
ifying-1.219
igsaw-1.106
run-0.976
chie-0.946
 brief-0.908
Token her
Token ID607
Feature activation-2.04e-01

Pos logit contributions
ifying+0.119
igsaw+0.107
run+0.094
chie+0.091
 brief+0.088
Neg logit contributions
 idea-0.047
 morning-0.037
 day-0.037
 people-0.035
 prevent-0.035
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
ifying+0.507
igsaw+0.453
run+0.399
chie+0.387
 brief+0.372
Neg logit contributions
 idea-0.214
 morning-0.177
 day-0.174
 people-0.172
 doctors-0.166
Token Ros
Token ID10018
Feature activation-2.85e+00

Pos logit contributions
ifying+1.018
igsaw+0.901
run+0.778
chie+0.760
 brief+0.723
Neg logit contributions
 idea-0.545
 morning-0.459
 day-0.457
 people-0.451
 prevent-0.436
Tokeny
Token ID88
Feature activation-2.42e-01

Pos logit contributions
ifying+7.646
igsaw+6.886
 brief+5.859
run+5.841
chie+5.681
Neg logit contributions
 idea-2.999
 day-2.421
 morning-2.235
 people-2.202
 night-2.062
Token watched
Token ID7342
Feature activation+6.15e-01

Pos logit contributions
ifying+0.515
igsaw+0.450
run+0.387
chie+0.376
 brief+0.357
Neg logit contributions
 idea-0.336
 morning-0.290
 day-0.288
 people-0.283
 prevent-0.278
Token sadly
Token ID21098
Feature activation+3.82e-02

Pos logit contributions
 idea+0.668
 morning+0.549
 day+0.545
 hospital+0.525
 paramedics+0.515
Neg logit contributions
ifying-1.493
igsaw-1.339
run-1.178
chie-1.139
 brief-1.076
Token as
Token ID355
Feature activation+1.68e-02

Pos logit contributions
 idea+0.038
 morning+0.030
 day+0.030
 people+0.029
 doctors+0.029
Neg logit contributions
ifying-0.097
igsaw-0.087
run-0.077
chie-0.074
 brief-0.072
Token her
Token ID607
Feature activation-5.21e-01

Pos logit contributions
 idea+0.019
 morning+0.015
 day+0.015
 people+0.015
 prevent+0.015
Neg logit contributions
ifying-0.040
igsaw-0.036
run-0.031
chie-0.031
 brief-0.029
Tokenie
Token ID494
Feature activation-3.19e-01

Pos logit contributions
ifying+0.122
igsaw+0.107
run+0.092
chie+0.091
jamin+0.085
Neg logit contributions
 idea-0.080
 morning-0.070
 day-0.068
 people-0.068
 prevent-0.066
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
ifying+0.816
igsaw+0.731
run+0.645
chie+0.631
 brief+0.604
Neg logit contributions
 idea-0.311
 morning-0.250
 day-0.245
 people-0.242
 prevent-0.235
Token They
Token ID1119
Feature activation+8.45e-02

Pos logit contributions
ifying+0.373
igsaw+0.331
run+0.286
chie+0.275
jamin+0.262
Neg logit contributions
 idea-0.209
 morning-0.177
 day-0.175
 people-0.174
 doctors-0.170
Token both
Token ID1111
Feature activation-1.31e-01

Pos logit contributions
 idea+0.111
 morning+0.095
 day+0.092
 people+0.092
 hospital+0.091
Neg logit contributions
ifying-0.187
igsaw-0.161
run-0.140
chie-0.138
jamin-0.131
Token had
Token ID550
Feature activation-1.01e+00

Pos logit contributions
ifying+0.310
igsaw+0.272
run+0.239
chie+0.234
jamin+0.223
Neg logit contributions
 idea-0.151
 morning-0.125
 day-0.123
 people-0.122
 prevent-0.120
Token birth
Token ID4082
Feature activation-2.85e+00

Pos logit contributions
ifying+2.387
igsaw+2.160
run+1.856
chie+1.813
 brief+1.707
Neg logit contributions
 idea-1.207
 day-1.040
 people-1.039
 morning-1.036
 doctors-0.990
Tokendays
Token ID12545
Feature activation-7.76e-01

Pos logit contributions
ifying+4.718
igsaw+4.361
chie+3.165
jamin+3.162
 brief+2.970
Neg logit contributions
 idea-5.363
 day-5.070
 if-5.055
 whenever-4.912
 morning-4.862
Token coming
Token ID2406
Feature activation-5.17e-01

Pos logit contributions
ifying+1.949
igsaw+1.742
run+1.517
chie+1.501
jamin+1.428
Neg logit contributions
 idea-0.817
 day-0.676
 morning-0.672
 people-0.653
 prevent-0.618
Token up
Token ID510
Feature activation-6.69e-01

Pos logit contributions
ifying+1.165
igsaw+1.029
run+0.884
chie+0.862
 brief+0.818
Neg logit contributions
 idea-0.672
 morning-0.573
 day-0.571
 people-0.564
 prevent-0.546
Token,
Token ID11
Feature activation+9.59e-02

Pos logit contributions
ifying+1.598
igsaw+1.416
run+1.234
chie+1.210
jamin+1.148
Neg logit contributions
 idea-0.786
 day-0.661
 morning-0.660
 people-0.646
 prevent-0.618
Token and
Token ID290
Feature activation-2.99e-01

Pos logit contributions
 idea+0.122
 morning+0.102
 day+0.100
 people+0.099
 prevent+0.098
Neg logit contributions
ifying-0.216
igsaw-0.189
run-0.166
chie-0.158
 brief-0.153
Token She
Token ID1375
Feature activation+3.23e-01

Pos logit contributions
 idea+1.094
 prevent+0.946
 doctors+0.919
 people+0.917
cut+0.917
Neg logit contributions
ifying-1.657
igsaw-1.504
run-1.285
jamin-1.198
chie-1.171
Token blew
Token ID17948
Feature activation+1.29e+00

Pos logit contributions
 idea+0.456
 morning+0.401
 day+0.392
 people+0.388
crow+0.379
Neg logit contributions
ifying-0.664
igsaw-0.583
run-0.514
chie-0.487
jamin-0.472
Token some
Token ID617
Feature activation+7.12e-01

Pos logit contributions
 idea+1.508
 paramedics+1.318
 ostr+1.270
 hospital+1.255
 doctors+1.248
Neg logit contributions
ifying-2.979
igsaw-2.731
run-2.341
chie-2.210
 brief-2.142
Token bubbles
Token ID25037
Feature activation+8.62e-01

Pos logit contributions
 idea+0.929
 paramedics+0.795
 morning+0.789
 day+0.780
 hospital+0.779
Neg logit contributions
ifying-1.573
igsaw-1.375
run-1.205
chie-1.121
 brief-1.103
Token and
Token ID290
Feature activation+7.96e-01

Pos logit contributions
 idea+1.093
 doctors+0.915
 morning+0.914
 paramedics+0.906
 people+0.902
Neg logit contributions
ifying-1.927
igsaw-1.703
run-1.491
chie-1.415
 brief-1.362
Token watched
Token ID7342
Feature activation+1.78e+00

Pos logit contributions
 idea+1.062
 morning+0.869
crow+0.855
 day+0.853
 hospital+0.845
Neg logit contributions
ifying-1.735
igsaw-1.517
run-1.343
chie-1.251
 brief-1.242
Token them
Token ID606
Feature activation+9.12e-01

Pos logit contributions
 idea+2.064
crow+1.792
bye+1.766
 paramedics+1.712
 day+1.703
Neg logit contributions
ifying-4.244
igsaw-3.722
run-3.373
 hips-3.073
chie-3.071
Token float
Token ID12178
Feature activation-3.13e-01

Pos logit contributions
 idea+1.088
crow+0.961
 paramedics+0.955
 morning+0.949
 doctors+0.934
Neg logit contributions
ifying-2.074
igsaw-1.828
run-1.605
chie-1.534
 brief-1.479
Token in
Token ID287
Feature activation-3.23e-01

Pos logit contributions
ifying+0.690
igsaw+0.605
run+0.519
chie+0.507
 brief+0.479
Neg logit contributions
 idea-0.424
 morning-0.363
 day-0.360
 people-0.356
 doctors-0.348
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
ifying+0.835
igsaw+0.750
run+0.658
chie+0.644
 brief+0.618
Neg logit contributions
 idea-0.309
 morning-0.246
 day-0.243
 people-0.240
 doctors-0.231
Token air
Token ID1633
Feature activation+6.46e-01

Pos logit contributions
ifying+0.613
igsaw+0.551
run+0.481
chie+0.473
 brief+0.457
Neg logit contributions
 idea-0.213
 day-0.163
 morning-0.163
 people-0.162
 prevent-0.156
Token was
Token ID373
Feature activation-7.81e-01

Pos logit contributions
 idea+0.250
 morning+0.214
 people+0.210
 day+0.207
 prevent+0.206
Neg logit contributions
ifying-0.381
igsaw-0.330
run-0.287
chie-0.279
jamin-0.267
Token very
Token ID845
Feature activation-6.93e-01

Pos logit contributions
ifying+1.796
igsaw+1.595
run+1.361
chie+1.322
 brief+1.261
Neg logit contributions
 idea-0.996
 day-0.853
 morning-0.853
 people-0.841
 doctors-0.823
Token tired
Token ID10032
Feature activation+1.39e-01

Pos logit contributions
ifying+1.552
igsaw+1.366
run+1.166
chie+1.134
jamin+1.081
Neg logit contributions
 idea-0.927
 day-0.793
 morning-0.793
 people-0.783
 prevent-0.755
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
 idea+0.149
 morning+0.120
 day+0.116
 prevent+0.116
 people+0.116
Neg logit contributions
ifying-0.337
igsaw-0.300
run-0.263
chie-0.254
 brief-0.245
Token Every
Token ID3887
Feature activation+2.57e-01

Pos logit contributions
ifying+0.468
igsaw+0.411
run+0.352
chie+0.340
jamin+0.324
Neg logit contributions
 idea-0.302
 morning-0.258
 day-0.256
 people-0.253
 prevent-0.251
Token day
Token ID1110
Feature activation+1.74e+00

Pos logit contributions
 idea+0.085
 prevent+0.036
 people+0.036
 paramedics+0.032
 doctors+0.031
Neg logit contributions
ifying-0.809
igsaw-0.747
run-0.667
chie-0.653
 brief-0.643
Token he
Token ID339
Feature activation+2.44e-01

Pos logit contributions
 idea+2.291
 prevent+1.787
 paramedics+1.776
 decide+1.739
 hospital+1.722
Neg logit contributions
ifying-3.980
igsaw-3.310
run-2.944
 brief-2.815
lic-2.695
Token worked
Token ID3111
Feature activation-9.13e-01

Pos logit contributions
 idea+0.297
 morning+0.246
 day+0.243
 people+0.243
 prevent+0.236
Neg logit contributions
ifying-0.556
igsaw-0.480
run-0.423
chie-0.409
 brief-0.403
Token in
Token ID287
Feature activation-5.34e-01

Pos logit contributions
ifying+2.213
igsaw+1.956
run+1.688
chie+1.674
jamin+1.578
Neg logit contributions
 idea-1.084
 day-0.907
 morning-0.904
 people-0.880
 prevent-0.843
Token the
Token ID262
Feature activation+2.94e-01

Pos logit contributions
ifying+1.310
igsaw+1.166
run+1.017
chie+0.997
 brief+0.949
Neg logit contributions
 idea-0.592
 morning-0.488
 day-0.486
 people-0.478
 prevent-0.459
Token hot
Token ID3024
Feature activation-1.93e-01

Pos logit contributions
 idea+0.320
 prevent+0.253
 paramedics+0.249
 people+0.249
 day+0.248
Neg logit contributions
ifying-0.721
igsaw-0.638
run-0.536
chie-0.531
 brief-0.527
Token big
Token ID1263
Feature activation+6.37e-01

Pos logit contributions
 idea+1.786
crow+1.655
 hospital+1.579
 prevent+1.577
 decide+1.576
Neg logit contributions
ifying-2.369
igsaw-1.892
run-1.715
chie-1.588
 brief-1.585
Token and
Token ID290
Feature activation-6.07e-01

Pos logit contributions
 idea+0.712
 prevent+0.606
 morning+0.582
 paramedics+0.576
 day+0.566
Neg logit contributions
ifying-1.496
igsaw-1.324
run-1.145
 brief-1.103
chie-1.085
Token bright
Token ID6016
Feature activation+2.62e-01

Pos logit contributions
ifying+1.399
igsaw+1.245
run+1.069
chie+1.046
jamin+0.991
Neg logit contributions
 idea-0.756
 morning-0.642
 day-0.641
 people-0.631
 prevent-0.606
Token.
Token ID13
Feature activation+6.14e-01

Pos logit contributions
 idea+0.271
 prevent+0.222
 morning+0.214
 hospital+0.212
 paramedics+0.211
Neg logit contributions
ifying-0.658
igsaw-0.579
run-0.502
chie-0.492
 brief-0.475
Token Every
Token ID3887
Feature activation+8.16e-01

Pos logit contributions
 idea+0.788
 prevent+0.667
 hospital+0.640
 day+0.640
 doctors+0.639
Neg logit contributions
ifying-1.377
igsaw-1.221
run-1.048
chie-0.998
jamin-0.966
Token day
Token ID1110
Feature activation+1.90e+00

Pos logit contributions
 idea+0.273
 prevent+0.160
 people+0.159
 doctors+0.154
 paramedics+0.140
Neg logit contributions
ifying-2.499
igsaw-2.295
run-2.083
chie-2.017
 brief-2.003
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
 idea+2.391
cut+1.987
 paramedics+1.915
 hospital+1.903
 prevent+1.827
Neg logit contributions
ifying-4.567
igsaw-3.750
run-3.236
 brief-3.179
chie-3.156
Token truck
Token ID7779
Feature activation+1.89e+00

Pos logit contributions
 idea+1.450
 prevent+1.226
crow+1.209
 bal+1.189
 whenever+1.182
Neg logit contributions
ifying-2.644
igsaw-2.235
run-1.934
 brief-1.915
chie-1.897
Token would
Token ID561
Feature activation+1.09e+00

Pos logit contributions
 idea+2.727
 prevent+2.241
 paramedics+2.179
 choice+2.169
 morning+2.101
Neg logit contributions
ifying-3.902
igsaw-2.992
run-2.908
 brief-2.787
chie-2.678
Token drive
Token ID3708
Feature activation+6.14e-01

Pos logit contributions
 idea+1.272
 morning+1.098
crow+1.070
 doctors+1.066
 day+1.055
Neg logit contributions
ifying-2.505
igsaw-2.099
run-1.908
chie-1.874
 brief-1.827
Token down
Token ID866
Feature activation+7.14e-01

Pos logit contributions
 idea+0.762
 paramedics+0.634
 hospital+0.621
 doctors+0.617
 prevent+0.611
Neg logit contributions
ifying-1.397
igsaw-1.207
run-1.070
chie-1.039
 brief-1.004
Token.
Token ID13
Feature activation-3.63e-02

Pos logit contributions
 idea+0.474
 morning+0.389
 day+0.387
 hospital+0.373
 people+0.371
Neg logit contributions
ifying-1.100
igsaw-0.988
run-0.883
chie-0.846
 hips-0.820
Token "
Token ID366
Feature activation-7.19e-01

Pos logit contributions
ifying+0.085
igsaw+0.076
run+0.067
chie+0.064
jamin+0.061
Neg logit contributions
 idea-0.042
 morning-0.035
 day-0.035
 people-0.034
 prevent-0.034
TokenYou
Token ID1639
Feature activation-6.02e-03

Pos logit contributions
ifying+1.638
igsaw+1.451
run+1.251
chie+1.214
 brief+1.160
Neg logit contributions
 idea-0.907
 day-0.779
 morning-0.779
 people-0.772
 doctors-0.739
Token are
Token ID389
Feature activation+4.96e-02

Pos logit contributions
ifying+0.013
igsaw+0.011
run+0.010
chie+0.010
 brief+0.009
Neg logit contributions
 idea-0.008
 morning-0.007
 day-0.007
 people-0.007
 doctors-0.007
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
 idea+0.065
 morning+0.056
 day+0.055
 people+0.053
 prevent+0.053
Neg logit contributions
ifying-0.110
igsaw-0.096
run-0.082
chie-0.082
 brief-0.077
Token smart
Token ID4451
Feature activation+2.60e+00

Pos logit contributions
 idea+1.440
 morning+1.232
 paramedics+1.221
crow+1.182
 doctors+1.178
Neg logit contributions
ifying-2.585
igsaw-2.242
chie-1.988
 brief-1.901
run-1.834
Token girl
Token ID2576
Feature activation+3.38e-01

Pos logit contributions
 idea+2.614
 prevent+2.502
 decide+2.472
 morning+2.459
 whenever+2.413
Neg logit contributions
ifying-5.610
chie-5.043
igsaw-4.890
 brief-4.431
run-4.417
Token."
Token ID526
Feature activation-2.68e-01

Pos logit contributions
 idea+0.367
 morning+0.299
 day+0.289
 prevent+0.288
 hospital+0.284
Neg logit contributions
ifying-0.823
igsaw-0.722
run-0.647
chie-0.629
 brief-0.600
Token
Token ID198
Feature activation+2.66e-01

Pos logit contributions
ifying+0.637
igsaw+0.567
run+0.495
chie+0.479
 brief+0.457
Neg logit contributions
 idea-0.313
 morning-0.260
 day-0.258
 people-0.255
 prevent-0.249
Token
Token ID198
Feature activation-9.48e-02

Pos logit contributions
 idea+0.319
 morning+0.262
 doctors+0.259
 people+0.258
 prevent+0.257
Neg logit contributions
ifying-0.610
igsaw-0.545
run-0.479
chie-0.453
lic-0.443
TokenBut
Token ID1537
Feature activation+1.24e-01

Pos logit contributions
ifying+0.178
igsaw+0.153
run+0.127
chie+0.123
jamin+0.115
Neg logit contributions
 idea-0.158
 morning-0.139
 day-0.137
 people-0.136
 doctors-0.134
Token girl
Token ID2576
Feature activation+6.40e-01

Pos logit contributions
 idea+1.027
 prevent+0.899
 morning+0.887
 day+0.864
cut+0.854
Neg logit contributions
ifying-1.700
igsaw-1.529
run-1.273
chie-1.273
jamin-1.226
Token opened
Token ID4721
Feature activation+6.29e-01

Pos logit contributions
 idea+0.929
 morning+0.800
 day+0.791
 people+0.776
 doctors+0.768
Neg logit contributions
ifying-1.317
igsaw-1.121
run-0.989
chie-0.938
jamin-0.918
Token the
Token ID262
Feature activation-2.76e-01

Pos logit contributions
 idea+0.678
 doctors+0.548
 day+0.545
 paramedics+0.542
 morning+0.536
Neg logit contributions
ifying-1.544
igsaw-1.388
run-1.196
chie-1.149
 brief-1.111
Token wrap
Token ID14441
Feature activation-8.00e-02

Pos logit contributions
ifying+0.595
igsaw+0.520
run+0.442
chie+0.432
 brief+0.410
Neg logit contributions
 idea-0.384
 morning-0.330
 day-0.329
 people-0.324
 prevent-0.317
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
ifying+0.196
igsaw+0.174
run+0.152
chie+0.148
 brief+0.142
Neg logit contributions
 idea-0.087
 morning-0.071
 day-0.070
 people-0.069
 doctors-0.068
Token showed
Token ID3751
Feature activation+2.06e+00

Pos logit contributions
ifying+0.421
igsaw+0.369
run+0.318
chie+0.309
 brief+0.295
Neg logit contributions
 idea-0.248
 morning-0.210
 day-0.208
 people-0.204
 doctors-0.200
Token it
Token ID340
Feature activation-1.16e-01

Pos logit contributions
 idea+2.378
cut+2.025
 paramedics+2.011
 day+1.993
 morning+1.993
Neg logit contributions
ifying-4.596
igsaw-4.335
run-3.889
chie-3.594
years-3.353
Token to
Token ID284
Feature activation+5.73e-01

Pos logit contributions
ifying+0.249
igsaw+0.220
run+0.189
chie+0.182
 brief+0.173
Neg logit contributions
 idea-0.158
 morning-0.136
 day-0.135
 people-0.133
 doctors-0.131
Token her
Token ID607
Feature activation+3.31e-01

Pos logit contributions
 idea+0.568
 morning+0.464
 day+0.456
 paramedics+0.436
 doctors+0.431
Neg logit contributions
ifying-1.422
igsaw-1.293
run-1.129
chie-1.103
 brief-1.051
Token mom
Token ID1995
Feature activation-1.69e-01

Pos logit contributions
 idea+0.333
 morning+0.278
 day+0.266
 prevent+0.262
 people+0.259
Neg logit contributions
ifying-0.824
igsaw-0.742
run-0.652
chie-0.626
 brief-0.613
Token and
Token ID290
Feature activation+8.15e-01

Pos logit contributions
ifying+0.429
igsaw+0.384
run+0.339
chie+0.328
jamin+0.316
Neg logit contributions
 idea-0.170
 morning-0.137
 day-0.134
 people-0.133
 prevent-0.129
Token "
Token ID366
Feature activation+1.00e+00

Pos logit contributions
ifying+0.114
igsaw+0.101
run+0.089
chie+0.086
 brief+0.083
Neg logit contributions
 idea-0.056
 morning-0.046
 day-0.045
 people-0.044
 prevent-0.044
TokenI
Token ID40
Feature activation+1.57e+00

Pos logit contributions
 idea+1.651
 morning+1.331
 hospital+1.317
 day+1.315
 people+1.285
Neg logit contributions
ifying-2.032
igsaw-1.648
chie-1.459
run-1.429
jamin-1.359
Token saw
Token ID2497
Feature activation+5.81e-01

Pos logit contributions
 idea+2.047
 paramedics+1.911
 people+1.907
 doctors+1.897
 morning+1.892
Neg logit contributions
ifying-3.285
igsaw-2.777
run-2.513
chie-2.324
 brief-2.318
Token it
Token ID340
Feature activation-7.09e-01

Pos logit contributions
 idea+0.739
 morning+0.588
 day+0.585
 prevent+0.569
 paramedics+0.564
Neg logit contributions
ifying-1.341
igsaw-1.157
run-1.025
chie-0.977
 brief-0.940
Token first
Token ID717
Feature activation-1.16e-01

Pos logit contributions
ifying+1.608
igsaw+1.424
run+1.223
chie+1.195
 brief+1.137
Neg logit contributions
 idea-0.919
 day-0.781
 morning-0.781
 people-0.767
 prevent-0.743
Token,
Token ID11
Feature activation+4.92e-01

Pos logit contributions
ifying+0.278
igsaw+0.248
run+0.217
chie+0.210
 brief+0.201
Neg logit contributions
 idea-0.130
 morning-0.108
 day-0.106
 people-0.106
 doctors-0.103
Token so
Token ID523
Feature activation+5.03e-01

Pos logit contributions
 idea+0.646
 morning+0.564
 day+0.551
 prevent+0.521
 people+0.520
Neg logit contributions
ifying-1.077
igsaw-0.931
run-0.827
chie-0.784
 brief-0.756
Token I
Token ID314
Feature activation+6.43e-01

Pos logit contributions
 idea+0.793
 morning+0.683
 hospital+0.669
 day+0.667
 paramedics+0.667
Neg logit contributions
ifying-0.991
igsaw-0.825
run-0.724
chie-0.677
 brief-0.640
Token should
Token ID815
Feature activation+5.40e-01

Pos logit contributions
 idea+0.818
 morning+0.734
 people+0.731
 doctors+0.720
 day+0.716
Neg logit contributions
ifying-1.415
igsaw-1.216
run-1.074
chie-1.033
 brief-0.978
Token eat
Token ID4483
Feature activation+3.01e-01

Pos logit contributions
 idea+0.662
 morning+0.579
 day+0.567
 people+0.538
 doctors+0.535
Neg logit contributions
ifying-1.212
igsaw-1.057
run-0.915
chie-0.897
 brief-0.854
Token it
Token ID340
Feature activation-1.04e-01

Pos logit contributions
 idea+0.262
 morning+0.194
 day+0.188
 paramedics+0.186
 doctors+0.181
Neg logit contributions
ifying-0.806
igsaw-0.718
run-0.644
chie-0.620
 brief-0.599
Token went
Token ID1816
Feature activation-8.87e-01

Pos logit contributions
ifying+0.133
igsaw+0.118
run+0.103
chie+0.100
jamin+0.096
Neg logit contributions
 idea-0.064
 morning-0.053
 day-0.052
 people-0.052
 doctors-0.050
Token inside
Token ID2641
Feature activation-1.14e-01

Pos logit contributions
ifying+2.245
igsaw+2.004
run+1.745
chie+1.732
jamin+1.641
Neg logit contributions
 idea-0.915
 morning-0.770
 day-0.761
 people-0.742
 hospital-0.718
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
ifying+0.277
igsaw+0.248
run+0.219
chie+0.210
 brief+0.202
Neg logit contributions
 idea-0.124
 morning-0.101
 day-0.100
 people-0.099
 doctors-0.097
Token Maybe
Token ID6674
Feature activation+3.24e-01

Pos logit contributions
ifying+2.491
igsaw+2.175
chie+1.913
run+1.894
 brief+1.824
Neg logit contributions
 idea-1.192
 morning-1.021
 day-1.005
 people-0.970
 hospital-0.930
Token he
Token ID339
Feature activation+8.76e-02

Pos logit contributions
 idea+0.394
 morning+0.316
 day+0.308
 paramedics+0.305
 prevent+0.305
Neg logit contributions
ifying-0.747
igsaw-0.668
run-0.583
chie-0.550
 brief-0.537
Token'll
Token ID1183
Feature activation+5.59e-01

Pos logit contributions
 idea+0.120
 morning+0.103
 day+0.102
 people+0.101
 doctors+0.099
Neg logit contributions
ifying-0.189
igsaw-0.163
run-0.143
chie-0.138
jamin-0.131
Token try
Token ID1949
Feature activation-8.28e-01

Pos logit contributions
 idea+0.652
 morning+0.565
 day+0.550
 doctors+0.537
 people+0.535
Neg logit contributions
ifying-1.287
igsaw-1.136
run-0.996
chie-0.986
 brief-0.930
Token again
Token ID757
Feature activation-7.18e-01

Pos logit contributions
ifying+2.050
igsaw+1.824
run+1.581
chie+1.568
 brief+1.482
Neg logit contributions
 idea-0.906
 morning-0.755
 day-0.753
 people-0.748
 prevent-0.716
Token later
Token ID1568
Feature activation-5.75e-02

Pos logit contributions
ifying+1.876
igsaw+1.685
run+1.477
chie+1.455
 brief+1.391
Neg logit contributions
 idea-0.683
 morning-0.546
 day-0.545
 people-0.536
 prevent-0.509
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
ifying+0.142
igsaw+0.127
run+0.111
chie+0.108
 brief+0.104
Neg logit contributions
 idea-0.060
 morning-0.049
 day-0.048
 people-0.047
 doctors-0.046
Token<|endoftext|>
Token ID50256
Feature activation-7.30e-01

Pos logit contributions
ifying+2.683
igsaw+2.370
chie+2.097
run+2.074
 brief+2.009
Neg logit contributions
 idea-1.087
 morning-0.918
 day-0.903
 people-0.861
 hospital-0.821
Tokenuzz
Token ID4715
Feature activation-1.48e+00

Pos logit contributions
ifying+2.431
igsaw+2.077
run+1.758
chie+1.712
 brief+1.670
Neg logit contributions
 idea-1.677
 day-1.470
 morning-1.459
 people-1.422
 hospital-1.375
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
ifying+3.662
igsaw+3.323
run+2.853
chie+2.825
 brief+2.701
Neg logit contributions
 idea-1.618
 day-1.384
 morning-1.363
 people-1.346
 doctors-1.218
Token After
Token ID2293
Feature activation-1.24e+00

Pos logit contributions
ifying+0.261
igsaw+0.230
run+0.201
chie+0.193
jamin+0.184
Neg logit contributions
 idea-0.142
 morning-0.118
 day-0.118
 people-0.117
 prevent-0.115
Token some
Token ID617
Feature activation-3.35e-01

Pos logit contributions
ifying+3.039
igsaw+2.702
run+2.342
chie+2.319
jamin+2.217
Neg logit contributions
 idea-1.401
 day-1.219
 morning-1.213
 people-1.210
 doctors-1.147
Token time
Token ID640
Feature activation+9.34e-01

Pos logit contributions
ifying+0.964
igsaw+0.877
run+0.784
chie+0.769
 brief+0.742
Neg logit contributions
 idea-0.224
 morning-0.159
 day-0.156
 people-0.153
 prevent-0.143
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
 idea+1.047
 morning+0.790
 prevent+0.776
 doctors+0.774
 hospital+0.773
Neg logit contributions
ifying-2.300
igsaw-2.004
run-1.846
chie-1.739
 brief-1.654
Token they
Token ID484
Feature activation-5.15e-01

Pos logit contributions
 idea+0.267
 morning+0.215
 day+0.213
 doctors+0.209
 prevent+0.208
Neg logit contributions
ifying-0.490
igsaw-0.425
run-0.373
chie-0.361
 brief-0.342
Token found
Token ID1043
Feature activation+9.67e-01

Pos logit contributions
ifying+1.176
igsaw+1.041
run+0.897
chie+0.875
 brief+0.831
Neg logit contributions
 idea-0.652
 morning-0.552
 day-0.550
 people-0.543
 prevent-0.526
Token him
Token ID683
Feature activation-3.77e-01

Pos logit contributions
 idea+1.353
 morning+1.128
 day+1.120
 prevent+1.077
 doctors+1.076
Neg logit contributions
ifying-2.108
igsaw-1.772
run-1.606
chie-1.513
 brief-1.417
Token safe
Token ID3338
Feature activation-4.24e-01

Pos logit contributions
ifying+0.909
igsaw+0.809
run+0.705
chie+0.688
 brief+0.657
Neg logit contributions
 idea-0.425
 morning-0.352
 day-0.349
 people-0.345
 prevent-0.332
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
ifying+1.056
igsaw+0.943
run+0.825
chie+0.808
 brief+0.772
Neg logit contributions
 idea-0.446
 morning-0.364
 day-0.361
 people-0.356
 prevent-0.342
Token little
Token ID1310
Feature activation-2.32e-01

Pos logit contributions
 idea+0.256
 morning+0.220
 day+0.214
 prevent+0.214
 people+0.213
Neg logit contributions
ifying-0.383
igsaw-0.338
run-0.281
chie-0.277
 brief-0.270
Token girl
Token ID2576
Feature activation+9.31e-01

Pos logit contributions
ifying+0.555
igsaw+0.496
run+0.428
chie+0.421
 brief+0.403
Neg logit contributions
 idea-0.262
 morning-0.215
 day-0.213
 people-0.211
 prevent-0.208
Token named
Token ID3706
Feature activation-7.55e-01

Pos logit contributions
 idea+1.106
 prevent+0.833
 paramedics+0.832
 morning+0.827
 people+0.821
Neg logit contributions
ifying-2.211
igsaw-1.926
run-1.745
chie-1.647
 brief-1.613
Token Lily
Token ID20037
Feature activation+1.07e+00

Pos logit contributions
ifying+1.816
igsaw+1.610
run+1.399
chie+1.374
 brief+1.315
Neg logit contributions
 idea-0.882
 morning-0.749
 day-0.748
 people-0.727
 prevent-0.694
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
 idea+1.361
 paramedics+1.144
 doctors+1.073
 prevent+1.072
 morning+1.072
Neg logit contributions
ifying-2.416
igsaw-2.085
run-1.891
chie-1.759
jamin-1.747
Token She
Token ID1375
Feature activation+4.94e-01

Pos logit contributions
ifying+0.813
igsaw+0.721
run+0.622
chie+0.605
jamin+0.576
Neg logit contributions
 idea-0.433
 morning-0.364
 day-0.361
 people-0.359
 prevent-0.349
Token loved
Token ID6151
Feature activation-5.57e-02

Pos logit contributions
 idea+0.588
 paramedics+0.484
 prevent+0.472
 people+0.466
 morning+0.464
Neg logit contributions
ifying-1.134
igsaw-0.994
run-0.864
chie-0.843
jamin-0.839
Token to
Token ID284
Feature activation-3.11e-01

Pos logit contributions
ifying+0.118
igsaw+0.104
run+0.088
chie+0.085
 brief+0.082
Neg logit contributions
 idea-0.079
 morning-0.066
 day-0.065
 people-0.065
 prevent-0.064
Token draw
Token ID3197
Feature activation+6.52e-01

Pos logit contributions
ifying+0.751
igsaw+0.671
run+0.583
chie+0.570
 brief+0.546
Neg logit contributions
 idea-0.346
 morning-0.286
 day-0.283
 people-0.280
 doctors-0.270
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
 idea+0.746
 morning+0.638
 paramedics+0.629
 prevent+0.622
 day+0.619
Neg logit contributions
ifying-1.497
igsaw-1.351
run-1.156
 brief-1.118
lic-1.112
Token color
Token ID3124
Feature activation+4.26e-01

Pos logit contributions
ifying+0.283
igsaw+0.252
run+0.218
chie+0.214
 brief+0.206
Neg logit contributions
 idea-0.131
 morning-0.108
 day-0.106
 people-0.104
 doctors-0.102
Token Ben
Token ID3932
Feature activation-1.94e-01

Pos logit contributions
ifying+0.635
igsaw+0.564
run+0.493
chie+0.479
 brief+0.458
Neg logit contributions
 idea-0.301
 morning-0.249
 day-0.247
 people-0.240
 prevent-0.235
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
ifying+0.481
igsaw+0.427
run+0.378
chie+0.366
 brief+0.352
Neg logit contributions
 idea-0.205
 morning-0.167
 day-0.164
 people-0.161
 doctors-0.156
Token Lily
Token ID20037
Feature activation-1.19e-01

Pos logit contributions
ifying+2.632
igsaw+2.355
chie+1.993
run+1.991
 brief+1.881
Neg logit contributions
 idea-1.212
 day-1.010
 people-1.007
 morning-1.006
 doctors-0.959
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
ifying+0.272
igsaw+0.238
run+0.209
chie+0.201
 brief+0.193
Neg logit contributions
 idea-0.150
 morning-0.127
 day-0.125
 people-0.123
 doctors-0.121
Token wanted
Token ID2227
Feature activation+3.69e-01

Pos logit contributions
ifying+1.213
igsaw+1.079
run+0.937
chie+0.917
 brief+0.873
Neg logit contributions
 idea-0.582
 morning-0.484
 day-0.481
 people-0.475
 prevent-0.457
Token to
Token ID284
Feature activation+7.06e-01

Pos logit contributions
 idea+0.560
 morning+0.469
 day+0.463
 prevent+0.447
 people+0.447
Neg logit contributions
ifying-0.745
igsaw-0.637
run-0.560
chie-0.532
 brief-0.514
Token eat
Token ID4483
Feature activation-2.74e-02

Pos logit contributions
 idea+0.897
 morning+0.779
 day+0.758
 doctors+0.743
 people+0.725
Neg logit contributions
ifying-1.543
igsaw-1.340
run-1.193
chie-1.182
 brief-1.103
Token them
Token ID606
Feature activation+2.39e-02

Pos logit contributions
ifying+0.067
igsaw+0.060
run+0.053
chie+0.051
 brief+0.049
Neg logit contributions
 idea-0.029
 morning-0.023
 day-0.023
 people-0.022
 doctors-0.022
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 idea+0.024
 morning+0.020
 day+0.019
 people+0.019
 doctors+0.019
Neg logit contributions
ifying-0.059
igsaw-0.053
run-0.047
chie-0.045
 brief-0.043
Token He
Token ID679
Feature activation-4.01e-01

Pos logit contributions
 idea+0.181
 day+0.157
 morning+0.156
 prevent+0.153
 people+0.152
Neg logit contributions
ifying-0.266
igsaw-0.234
run-0.200
chie-0.190
jamin-0.183
Token hid
Token ID24519
Feature activation-7.09e-01

Pos logit contributions
ifying+0.919
igsaw+0.811
run+0.702
chie+0.685
 brief+0.651
Neg logit contributions
 idea-0.503
 morning-0.426
 day-0.423
 people-0.416
 prevent-0.404
Token laughing
Token ID14376
Feature activation+1.27e-01

Pos logit contributions
ifying+0.619
igsaw+0.537
run+0.451
chie+0.437
 brief+0.412
Neg logit contributions
 idea-0.490
 morning-0.427
 day-0.423
 people-0.421
 doctors-0.412
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
 idea+0.131
 morning+0.107
 day+0.105
 prevent+0.105
 paramedics+0.102
Neg logit contributions
ifying-0.314
igsaw-0.279
run-0.244
chie-0.237
 brief-0.231
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
 idea+0.878
 prevent+0.737
 morning+0.719
 doctors+0.706
 day+0.703
Neg logit contributions
ifying-1.473
igsaw-1.303
run-1.143
jamin-1.057
chie-1.035
Token
Token ID198
Feature activation+7.15e-01

Pos logit contributions
 idea+0.623
 doctors+0.517
 morning+0.515
 prevent+0.514
 people+0.509
Neg logit contributions
ifying-1.093
igsaw-0.973
run-0.857
chie-0.794
lic-0.788
TokenAfter
Token ID3260
Feature activation-1.24e+00

Pos logit contributions
 idea+1.099
 morning+0.931
 day+0.904
 prevent+0.902
 people+0.891
Neg logit contributions
ifying-1.440
igsaw-1.225
run-1.014
chie-0.997
jamin-0.978
Token a
Token ID257
Feature activation-7.04e-01

Pos logit contributions
ifying+3.046
igsaw+2.721
chie+2.384
run+2.366
jamin+2.219
Neg logit contributions
 idea-1.362
 morning-1.227
 day-1.219
 people-1.198
 doctors-1.154
Token while
Token ID981
Feature activation+1.73e-01

Pos logit contributions
ifying+1.719
igsaw+1.525
run+1.337
chie+1.304
jamin+1.242
Neg logit contributions
 idea-0.789
 day-0.664
 morning-0.662
 people-0.639
 prevent-0.614
Token,
Token ID11
Feature activation+5.00e-01

Pos logit contributions
 idea+0.167
 prevent+0.125
 people+0.123
 morning+0.123
 day+0.121
Neg logit contributions
ifying-0.448
igsaw-0.400
run-0.355
chie-0.341
 brief-0.333
Token it
Token ID340
Feature activation+4.18e-01

Pos logit contributions
 idea+0.672
 morning+0.547
 day+0.545
 prevent+0.539
 people+0.527
Neg logit contributions
ifying-1.100
igsaw-0.946
run-0.791
chie-0.770
 brief-0.757
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
 idea+0.543
 morning+0.458
 day+0.452
 people+0.450
 prevent+0.442
Neg logit contributions
ifying-0.913
igsaw-0.787
run-0.676
chie-0.660
jamin-0.650
Token time
Token ID640
Feature activation-5.85e-01

Pos logit contributions
 idea+0.831
crow+0.705
cut+0.696
 people+0.656
 prevent+0.654
Neg logit contributions
ifying-3.247
igsaw-2.867
run-2.567
 nowhere-2.516
jamin-2.504
Token and
Token ID290
Feature activation-3.97e-01

Pos logit contributions
ifying+1.077
igsaw+0.964
run+0.845
chie+0.828
 brief+0.791
Neg logit contributions
 idea-0.437
 morning-0.354
 day-0.351
 people-0.346
 prevent-0.332
Token buried
Token ID11694
Feature activation-2.34e-01

Pos logit contributions
ifying+0.904
igsaw+0.798
run+0.688
chie+0.671
 brief+0.638
Neg logit contributions
 idea-0.502
 morning-0.425
 day-0.422
 people-0.416
 prevent-0.404
Token the
Token ID262
Feature activation+8.29e-02

Pos logit contributions
ifying+0.609
igsaw+0.546
run+0.482
chie+0.470
 brief+0.451
Neg logit contributions
 idea-0.220
 morning-0.176
 day-0.173
 people-0.168
 doctors-0.163
Token gold
Token ID3869
Feature activation+6.13e-03

Pos logit contributions
 idea+0.120
 morning+0.104
 day+0.102
 prevent+0.101
 people+0.101
Neg logit contributions
ifying-0.172
igsaw-0.150
run-0.125
chie-0.124
 brief-0.117
Token coin
Token ID10752
Feature activation-6.24e-01

Pos logit contributions
 idea+0.009
 morning+0.008
 day+0.007
 people+0.007
 doctors+0.007
Neg logit contributions
ifying-0.013
igsaw-0.011
run-0.010
chie-0.009
 brief-0.009
Token.
Token ID13
Feature activation-4.22e-01

Pos logit contributions
ifying+1.524
igsaw+1.361
run+1.185
chie+1.162
 brief+1.110
Neg logit contributions
 idea-0.688
 day-0.567
 morning-0.567
 people-0.557
 prevent-0.533
Token After
Token ID2293
Feature activation-1.26e+00

Pos logit contributions
ifying+0.940
igsaw+0.828
run+0.710
chie+0.691
 brief+0.655
Neg logit contributions
 idea-0.562
 morning-0.480
 day-0.477
 people-0.472
 prevent-0.458
Token they
Token ID484
Feature activation-9.80e-01

Pos logit contributions
ifying+3.151
igsaw+2.778
run+2.452
chie+2.451
jamin+2.310
Neg logit contributions
 idea-1.375
 day-1.185
 morning-1.175
 people-1.163
 doctors-1.091
Token finished
Token ID5201
Feature activation-4.76e-01

Pos logit contributions
ifying+2.128
igsaw+1.881
run+1.604
chie+1.573
 brief+1.489
Neg logit contributions
 idea-1.369
 day-1.195
 morning-1.188
 people-1.154
 prevent-1.134
Token,
Token ID11
Feature activation+1.58e-03

Pos logit contributions
ifying+1.150
igsaw+1.023
run+0.891
chie+0.872
 brief+0.831
Neg logit contributions
 idea-0.542
 morning-0.450
 day-0.447
 people-0.440
 prevent-0.425
Token Mrs
Token ID9074
Feature activation-8.57e-01

Pos logit contributions
 idea+0.002
 morning+0.002
 day+0.002
 people+0.002
 doctors+0.002
Neg logit contributions
ifying-0.004
igsaw-0.003
run-0.003
chie-0.003
jamin-0.003
TokenI
Token ID40
Feature activation+6.96e-01

Pos logit contributions
 idea+0.289
 morning+0.248
 people+0.247
 hospital+0.247
 day+0.245
Neg logit contributions
ifying-0.364
igsaw-0.312
run-0.261
chie-0.257
jamin-0.242
Token want
Token ID765
Feature activation-4.69e-01

Pos logit contributions
 idea+0.794
 people+0.736
 cows+0.725
 paramedics+0.723
 doctors+0.723
Neg logit contributions
ifying-1.597
igsaw-1.406
run-1.234
chie-1.197
 brief-1.152
Token to
Token ID284
Feature activation+4.36e-01

Pos logit contributions
ifying+1.038
igsaw+0.914
run+0.782
chie+0.764
 brief+0.723
Neg logit contributions
 idea-0.629
 morning-0.539
 day-0.535
 people-0.529
 prevent-0.514
Token make
Token ID787
Feature activation+7.36e-01

Pos logit contributions
 idea+0.497
 morning+0.426
 day+0.418
 paramedics+0.414
 doctors+0.412
Neg logit contributions
ifying-1.023
igsaw-0.913
run-0.786
chie-0.776
 brief-0.734
Token my
Token ID616
Feature activation+9.24e-02

Pos logit contributions
 idea+0.996
 day+0.885
 morning+0.883
crow+0.828
 paramedics+0.820
Neg logit contributions
ifying-1.536
igsaw-1.394
run-1.199
chie-1.147
 brief-1.074
Token shoes
Token ID10012
Feature activation-1.13e+00

Pos logit contributions
 idea+0.139
 morning+0.122
 day+0.121
 prevent+0.120
 people+0.120
Neg logit contributions
ifying-0.181
igsaw-0.160
run-0.134
chie-0.129
 brief-0.125
Token shiny
Token ID22441
Feature activation-4.61e-01

Pos logit contributions
ifying+2.752
igsaw+2.462
run+2.160
chie+2.084
jamin+2.021
Neg logit contributions
 idea-1.289
 day-1.046
 morning-1.037
 people-1.029
 prevent-1.006
Token too
Token ID1165
Feature activation-4.41e-01

Pos logit contributions
ifying+1.131
igsaw+1.009
run+0.881
chie+0.862
 brief+0.823
Neg logit contributions
 idea-0.505
 morning-0.415
 day-0.412
 people-0.406
 prevent-0.391
Token.
Token ID13
Feature activation-6.80e-02

Pos logit contributions
ifying+1.058
igsaw+0.942
run+0.819
chie+0.801
 brief+0.763
Neg logit contributions
 idea-0.504
 morning-0.419
 day-0.416
 people-0.410
 prevent-0.396
Token Can
Token ID1680
Feature activation-1.36e+00

Pos logit contributions
ifying+0.138
igsaw+0.119
run+0.102
chie+0.097
jamin+0.092
Neg logit contributions
 idea-0.104
 day-0.089
 morning-0.089
 people-0.088
 prevent-0.087
Token you
Token ID345
Feature activation-3.75e-01

Pos logit contributions
ifying+2.794
igsaw+2.458
run+2.026
chie+1.963
 brief+1.913
Neg logit contributions
 idea-2.008
 people-1.858
 morning-1.841
 day-1.838
 prevent-1.790
Token he
Token ID339
Feature activation-2.31e-01

Pos logit contributions
 idea+1.053
 hospital+0.806
 prevent+0.803
 day+0.800
 paramedics+0.796
Neg logit contributions
ifying-2.337
igsaw-2.071
run-1.804
chie-1.725
 brief-1.691
Token had
Token ID550
Feature activation-3.95e-01

Pos logit contributions
ifying+0.566
igsaw+0.503
run+0.441
chie+0.431
jamin+0.414
Neg logit contributions
 idea-0.250
 morning-0.205
 day-0.203
 people-0.200
 prevent-0.194
Token to
Token ID284
Feature activation+5.78e-01

Pos logit contributions
ifying+0.900
igsaw+0.795
run+0.685
chie+0.669
 brief+0.636
Neg logit contributions
 idea-0.501
 morning-0.424
 day-0.421
 people-0.415
 prevent-0.403
Token help
Token ID1037
Feature activation+1.01e+00

Pos logit contributions
 idea+0.568
 morning+0.484
 day+0.474
 doctors+0.457
 paramedics+0.451
Neg logit contributions
ifying-1.406
igsaw-1.283
chie-1.087
run-1.079
jamin-1.049
Token someone
Token ID2130
Feature activation-1.54e-01

Pos logit contributions
 idea+1.197
 morning+1.044
 day+0.996
 paramedics+0.989
bye+0.969
Neg logit contributions
ifying-2.253
igsaw-2.048
run-1.824
chie-1.720
jamin-1.636
Token.
Token ID13
Feature activation-5.44e-01

Pos logit contributions
ifying+0.361
igsaw+0.320
run+0.280
chie+0.271
 brief+0.258
Neg logit contributions
 idea-0.183
 morning-0.154
 day-0.152
 people-0.149
 doctors-0.145
Token He
Token ID679
Feature activation-4.48e-02

Pos logit contributions
ifying+1.203
igsaw+1.056
run+0.904
chie+0.886
 brief+0.839
Neg logit contributions
 idea-0.732
 morning-0.626
 day-0.622
 people-0.616
 prevent-0.596
Token didn
Token ID1422
Feature activation-4.64e-01

Pos logit contributions
ifying+0.100
igsaw+0.087
run+0.076
chie+0.074
jamin+0.071
Neg logit contributions
 idea-0.058
 morning-0.050
 day-0.049
 people-0.048
 prevent-0.047
Token't
Token ID470
Feature activation-9.31e-02

Pos logit contributions
ifying+0.906
igsaw+0.783
run+0.655
chie+0.636
 brief+0.597
Neg logit contributions
 idea-0.737
 morning-0.647
 day-0.644
 people-0.638
 prevent-0.623
Token want
Token ID765
Feature activation-3.01e-02

Pos logit contributions
ifying+0.236
igsaw+0.213
run+0.187
chie+0.185
 brief+0.176
Neg logit contributions
 idea-0.089
 morning-0.074
 day-0.072
 people-0.071
 doctors-0.069
Token to
Token ID284
Feature activation+4.70e-02

Pos logit contributions
ifying+0.064
igsaw+0.057
run+0.048
chie+0.047
 brief+0.044
Neg logit contributions
 idea-0.042
 morning-0.036
 day-0.035
 people-0.035
 prevent-0.034
Token decided
Token ID3066
Feature activation+1.04e-02

Pos logit contributions
 idea+0.238
 morning+0.200
 day+0.199
 people+0.193
 prevent+0.190
Neg logit contributions
ifying-0.401
igsaw-0.354
run-0.309
chie-0.296
jamin-0.286
Token to
Token ID284
Feature activation+3.02e-01

Pos logit contributions
 idea+0.015
 morning+0.013
 day+0.013
 people+0.012
 doctors+0.012
Neg logit contributions
ifying-0.022
igsaw-0.019
run-0.016
chie-0.016
 brief-0.015
Token make
Token ID787
Feature activation+1.08e+00

Pos logit contributions
 idea+0.355
 morning+0.304
 day+0.298
 doctors+0.295
 people+0.293
Neg logit contributions
ifying-0.700
igsaw-0.624
run-0.535
chie-0.525
jamin-0.501
Token a
Token ID257
Feature activation+3.75e-01

Pos logit contributions
 idea+1.333
 day+1.196
 morning+1.184
cut+1.183
bye+1.163
Neg logit contributions
ifying-2.285
igsaw-2.122
run-1.816
chie-1.739
 brief-1.672
Token beautiful
Token ID4950
Feature activation+6.43e-01

Pos logit contributions
 idea+0.449
 morning+0.392
 people+0.385
 day+0.385
 doctors+0.383
Neg logit contributions
ifying-0.841
igsaw-0.758
run-0.637
chie-0.629
 brief-0.616
Token hair
Token ID4190
Feature activation+7.14e-02

Pos logit contributions
 idea+0.775
 paramedics+0.692
 doctors+0.691
 people+0.677
crow+0.668
Neg logit contributions
ifying-1.395
igsaw-1.282
run-1.051
chie-1.037
 brief-1.029
Token b
Token ID275
Feature activation-3.56e-01

Pos logit contributions
 idea+0.080
 morning+0.067
 day+0.066
 people+0.065
 doctors+0.063
Neg logit contributions
ifying-0.172
igsaw-0.152
run-0.132
chie-0.129
 brief-0.124
Tokenraid
Token ID7086
Feature activation-3.74e-01

Pos logit contributions
ifying+0.741
igsaw+0.648
run+0.549
chie+0.535
 brief+0.502
Neg logit contributions
 idea-0.527
 morning-0.458
 day-0.455
 people-0.451
 doctors-0.439
Token with
Token ID351
Feature activation+5.71e-01

Pos logit contributions
ifying+0.956
igsaw+0.856
run+0.754
chie+0.738
 brief+0.706
Neg logit contributions
 idea-0.368
 morning-0.296
 day-0.293
 people-0.288
 doctors-0.276
Token it
Token ID340
Feature activation-1.64e-01

Pos logit contributions
 idea+0.655
 morning+0.538
 day+0.530
 hospital+0.529
 doctors+0.527
Neg logit contributions
ifying-1.374
igsaw-1.199
run-1.065
chie-1.018
 brief-0.972
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
ifying+0.418
igsaw+0.372
run+0.329
chie+0.319
 brief+0.307
Neg logit contributions
 idea-0.163
 morning-0.131
 day-0.129
 people-0.128
 doctors-0.124
Token to
Token ID284
Feature activation+4.28e-01

Pos logit contributions
 idea+0.134
 morning+0.113
 day+0.111
 prevent+0.110
 paramedics+0.108
Neg logit contributions
ifying-0.209
igsaw-0.185
run-0.159
chie-0.152
 brief-0.147
Token nap
Token ID25422
Feature activation-8.65e-01

Pos logit contributions
 idea+0.508
 morning+0.441
 day+0.427
 paramedics+0.427
 doctors+0.421
Neg logit contributions
ifying-0.962
igsaw-0.866
chie-0.739
run-0.728
 brief-0.698
Token.
Token ID13
Feature activation-7.60e-01

Pos logit contributions
ifying+2.273
igsaw+2.049
run+1.813
chie+1.782
 brief+1.689
Neg logit contributions
 idea-0.824
 day-0.647
 people-0.646
 morning-0.645
 doctors-0.603
Token They
Token ID1119
Feature activation+8.82e-02

Pos logit contributions
ifying+1.871
igsaw+1.679
run+1.462
chie+1.451
 brief+1.383
Neg logit contributions
 idea-0.823
 morning-0.686
 day-0.676
 people-0.664
 hospital-0.629
Token thought
Token ID1807
Feature activation-1.51e-01

Pos logit contributions
 idea+0.121
 morning+0.104
 day+0.103
 people+0.102
 doctors+0.101
Neg logit contributions
ifying-0.190
igsaw-0.162
run-0.140
chie-0.138
 brief-0.130
Token n
Token ID299
Feature activation-1.99e+00

Pos logit contributions
ifying+0.395
igsaw+0.356
run+0.315
chie+0.305
 brief+0.295
Neg logit contributions
 idea-0.139
 morning-0.109
 day-0.107
 people-0.104
 prevent-0.103
Tokenapping
Token ID5912
Feature activation-3.82e-01

Pos logit contributions
ifying+4.382
igsaw+4.003
chie+3.385
 brief+3.280
run+3.226
Neg logit contributions
 idea-2.675
 day-2.269
 morning-2.223
 people-2.163
 whenever-2.142
Token was
Token ID373
Feature activation-2.79e-01

Pos logit contributions
ifying+0.926
igsaw+0.824
run+0.719
chie+0.702
 brief+0.671
Neg logit contributions
 idea-0.425
 morning-0.351
 day-0.348
 people-0.343
 prevent-0.332
Token boring
Token ID14262
Feature activation+2.78e-02

Pos logit contributions
ifying+0.658
igsaw+0.584
run+0.508
chie+0.494
 brief+0.473
Neg logit contributions
 idea-0.325
 morning-0.272
 day-0.269
 people-0.266
 prevent-0.258
Token and
Token ID290
Feature activation-8.92e-01

Pos logit contributions
 idea+0.027
 morning+0.022
 prevent+0.021
 day+0.021
 people+0.021
Neg logit contributions
ifying-0.070
igsaw-0.063
run-0.055
chie-0.053
 brief-0.052
Token silly
Token ID14397
Feature activation+2.93e-01

Pos logit contributions
ifying+2.155
igsaw+1.933
run+1.665
chie+1.641
jamin+1.561
Neg logit contributions
 idea-1.032
 day-0.872
 morning-0.865
 people-0.849
 doctors-0.808
Token She
Token ID1375
Feature activation-1.45e+00

Pos logit contributions
ifying+2.552
igsaw+2.244
run+1.929
chie+1.928
 brief+1.829
Neg logit contributions
 idea-1.306
 morning-1.116
 day-1.096
 people-1.090
 hospital-1.028
Token asked
Token ID1965
Feature activation+1.36e+00

Pos logit contributions
ifying+3.578
igsaw+3.267
run+2.730
chie+2.701
 brief+2.540
Neg logit contributions
 idea-1.670
 day-1.417
 people-1.415
 morning-1.403
 doctors-1.330
Token,
Token ID11
Feature activation-5.43e-02

Pos logit contributions
 idea+1.606
 paramedics+1.283
 morning+1.282
 hospital+1.274
 day+1.242
Neg logit contributions
ifying-3.035
igsaw-2.743
run-2.482
jamin-2.353
chie-2.285
Token "
Token ID366
Feature activation-1.14e+00

Pos logit contributions
ifying+0.130
igsaw+0.114
run+0.101
chie+0.098
jamin+0.094
Neg logit contributions
 idea-0.063
 morning-0.052
 day-0.051
 people-0.051
 doctors-0.049
TokenWhy
Token ID5195
Feature activation-1.16e-01

Pos logit contributions
ifying+2.476
igsaw+2.249
run+1.922
chie+1.850
 brief+1.737
Neg logit contributions
 idea-1.490
 people-1.341
 morning-1.340
 day-1.325
 doctors-1.299
Token are
Token ID389
Feature activation-2.04e+00

Pos logit contributions
ifying+0.254
igsaw+0.222
run+0.191
chie+0.185
 brief+0.177
Neg logit contributions
 idea-0.155
 morning-0.135
 day-0.133
 people-0.130
 doctors-0.128
Token you
Token ID345
Feature activation-1.58e+00

Pos logit contributions
ifying+4.918
igsaw+4.433
run+3.903
chie+3.662
jamin+3.627
Neg logit contributions
 people-2.347
 idea-2.346
 doctors-2.176
 morning-2.175
 day-2.168
Token sad
Token ID6507
Feature activation-2.92e-01

Pos logit contributions
ifying+3.862
igsaw+3.483
run+2.967
chie+2.892
 brief+2.798
Neg logit contributions
 idea-1.885
 day-1.581
 people-1.561
 morning-1.552
 hospital-1.495
Token?"
Token ID1701
Feature activation-4.38e-01

Pos logit contributions
ifying+0.728
igsaw+0.652
run+0.570
chie+0.557
 brief+0.534
Neg logit contributions
 idea-0.305
 morning-0.250
 day-0.247
 people-0.242
 prevent-0.235
Token The
Token ID383
Feature activation-1.31e+00

Pos logit contributions
ifying+1.026
igsaw+0.910
run+0.788
chie+0.770
 brief+0.733
Neg logit contributions
 idea-0.528
 morning-0.443
 day-0.440
 people-0.434
 prevent-0.420
Token boy
Token ID2933
Feature activation-5.45e-01

Pos logit contributions
ifying+3.601
igsaw+3.207
run+2.867
chie+2.820
jamin+2.671
Neg logit contributions
 idea-1.197
 people-0.981
 day-0.975
 morning-0.951
 hospital-0.879
Token what
Token ID644
Feature activation+2.04e-01

Pos logit contributions
 idea+0.262
 morning+0.234
 day+0.231
 paramedics+0.224
 people+0.224
Neg logit contributions
ifying-0.306
igsaw-0.258
run-0.222
chie-0.209
 brief-0.200
Token it
Token ID340
Feature activation-3.86e-01

Pos logit contributions
 idea+0.205
 morning+0.160
 paramedics+0.156
 prevent+0.154
 day+0.153
Neg logit contributions
ifying-0.510
igsaw-0.456
run-0.406
chie-0.392
 brief-0.384
Token was
Token ID373
Feature activation+1.10e+00

Pos logit contributions
ifying+0.866
igsaw+0.763
run+0.657
chie+0.640
 brief+0.608
Neg logit contributions
 idea-0.502
 morning-0.428
 day-0.424
 people-0.419
 prevent-0.407
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
 idea+1.219
 paramedics+1.076
 morning+1.027
 doctors+1.011
crow+1.011
Neg logit contributions
ifying-2.578
igsaw-2.176
run-1.957
chie-1.911
 brief-1.859
Token As
Token ID1081
Feature activation+1.07e+00

Pos logit contributions
ifying+0.670
igsaw+0.595
run+0.517
chie+0.501
 brief+0.478
Neg logit contributions
 idea-0.347
 morning-0.289
 day-0.288
 people-0.284
 prevent-0.278
Token he
Token ID339
Feature activation-1.84e+00

Pos logit contributions
 idea+1.552
 paramedics+1.231
 prevent+1.220
crow+1.202
 hospital+1.188
Neg logit contributions
ifying-2.270
igsaw-1.942
run-1.641
chie-1.578
 nowhere-1.573
Token got
Token ID1392
Feature activation+6.93e-01

Pos logit contributions
ifying+4.274
igsaw+3.984
chie+3.208
run+3.169
jamin+3.011
Neg logit contributions
 idea-2.310
 day-2.106
 morning-2.092
 people-2.036
 prevent-1.956
Token closer
Token ID5699
Feature activation-1.55e-01

Pos logit contributions
 idea+1.144
 morning+0.984
 day+0.980
 hospital+0.964
 paramedics+0.961
Neg logit contributions
ifying-1.292
igsaw-1.090
run-0.944
chie-0.907
 brief-0.869
Token,
Token ID11
Feature activation+4.47e-01

Pos logit contributions
ifying+0.394
igsaw+0.353
run+0.312
chie+0.304
 brief+0.292
Neg logit contributions
 idea-0.153
 morning-0.121
 day-0.120
 people-0.118
 doctors-0.116
Token he
Token ID339
Feature activation-4.00e-01

Pos logit contributions
 idea+0.569
 day+0.455
 morning+0.455
 paramedics+0.445
 prevent+0.442
Neg logit contributions
ifying-1.013
igsaw-0.882
run-0.761
chie-0.738
 brief-0.719
Token realized
Token ID6939
Feature activation+4.23e-01

Pos logit contributions
ifying+0.895
igsaw+0.787
run+0.678
chie+0.661
 brief+0.628
Neg logit contributions
 idea-0.525
 morning-0.446
 day-0.444
 people-0.438
 prevent-0.425
Token bear
Token ID6842
Feature activation-3.30e-01

Pos logit contributions
ifying+1.526
igsaw+1.358
run+1.193
chie+1.165
jamin+1.107
Neg logit contributions
 idea-0.661
 morning-0.539
 day-0.536
 people-0.530
 prevent-0.506
Token coming
Token ID2406
Feature activation+2.91e-01

Pos logit contributions
ifying+0.759
igsaw+0.669
run+0.581
chie+0.565
 brief+0.539
Neg logit contributions
 idea-0.409
 morning-0.345
 day-0.342
 people-0.337
 doctors-0.326
Token towards
Token ID3371
Feature activation+4.76e-01

Pos logit contributions
 idea+0.423
 morning+0.364
 day+0.357
 hospital+0.351
 doctors+0.350
Neg logit contributions
ifying-0.611
igsaw-0.524
run-0.460
chie-0.454
 brief-0.415
Token them
Token ID606
Feature activation-2.61e-01

Pos logit contributions
 idea+0.498
 morning+0.388
 day+0.377
 doctors+0.377
 bal+0.376
Neg logit contributions
ifying-1.175
igsaw-1.048
run-0.927
chie-0.911
 brief-0.856
Token.
Token ID13
Feature activation-2.56e-01

Pos logit contributions
ifying+0.672
igsaw+0.602
run+0.533
chie+0.519
 brief+0.496
Neg logit contributions
 idea-0.251
 morning-0.202
 day-0.197
 people-0.194
 doctors-0.188
Token They
Token ID1119
Feature activation-1.74e+00

Pos logit contributions
ifying+0.620
igsaw+0.553
run+0.483
chie+0.469
jamin+0.449
Neg logit contributions
 idea-0.286
 morning-0.236
 day-0.234
 people-0.229
 prevent-0.223
Token quickly
Token ID2952
Feature activation-3.66e-01

Pos logit contributions
ifying+4.161
igsaw+3.924
run+3.174
chie+3.104
 brief+2.983
Neg logit contributions
 idea-2.100
 people-1.897
 day-1.755
 prevent-1.753
 morning-1.699
Token forgot
Token ID16453
Feature activation-7.07e-01

Pos logit contributions
ifying+0.798
igsaw+0.701
run+0.600
chie+0.585
 brief+0.555
Neg logit contributions
 idea-0.499
 morning-0.429
 day-0.426
 people-0.420
 prevent-0.408
Token about
Token ID546
Feature activation-1.03e+00

Pos logit contributions
ifying+1.792
igsaw+1.608
run+1.400
chie+1.385
jamin+1.314
Neg logit contributions
 idea-0.723
 people-0.586
 morning-0.584
 day-0.583
 prevent-0.555
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
ifying+2.714
igsaw+2.432
run+2.118
chie+2.107
jamin+2.010
Neg logit contributions
 idea-0.993
 people-0.809
 day-0.806
 morning-0.788
 prevent-0.766
Token rock
Token ID3881
Feature activation-9.13e-01

Pos logit contributions
 idea+0.362
 morning+0.321
 prevent+0.313
 day+0.313
 people+0.313
Neg logit contributions
ifying-0.491
igsaw-0.435
run-0.361
chie-0.353
 brief-0.344
Tokenies
Token ID444
Feature activation-3.77e-01

Pos logit contributions
ifying+1.359
igsaw+1.185
run+1.003
chie+0.978
 brief+0.927
Neg logit contributions
 idea-0.934
 morning-0.807
 day-0.806
 people-0.797
 doctors-0.769
Token do
Token ID466
Feature activation+7.56e-01

Pos logit contributions
ifying+0.931
igsaw+0.831
run+0.727
chie+0.712
 brief+0.680
Neg logit contributions
 idea-0.398
 morning-0.326
 day-0.322
 people-0.317
 doctors-0.305
Token?"
Token ID1701
Feature activation+4.49e-01

Pos logit contributions
 idea+0.924
 paramedics+0.803
 morning+0.785
 ostr+0.773
 hospital+0.769
Neg logit contributions
ifying-1.648
igsaw-1.478
run-1.277
chie-1.236
 brief-1.174
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
 idea+0.642
 people+0.528
 day+0.528
 prevent+0.527
 doctors+0.520
Neg logit contributions
ifying-0.948
igsaw-0.834
run-0.721
jamin-0.672
chie-0.665
Token
Token ID198
Feature activation+2.44e-01

Pos logit contributions
 idea+1.172
 doctors+0.945
 morning+0.933
 people+0.930
 day+0.925
Neg logit contributions
ifying-1.863
igsaw-1.657
run-1.458
chie-1.350
jamin-1.345
Token"
Token ID1
Feature activation-1.66e+00

Pos logit contributions
 idea+0.350
 morning+0.293
 day+0.290
 people+0.287
 doctors+0.281
Neg logit contributions
ifying-0.527
igsaw-0.462
run-0.390
chie-0.383
jamin-0.366
TokenWell
Token ID5779
Feature activation+9.13e-01

Pos logit contributions
ifying+3.619
igsaw+3.313
run+2.887
chie+2.711
 brief+2.616
Neg logit contributions
 idea-2.247
 day-2.039
 morning-2.008
 people-1.998
 doctors-1.928
Token,"
Token ID553
Feature activation+6.31e-01

Pos logit contributions
 idea+0.901
 prevent+0.746
 people+0.746
 doctors+0.744
 paramedics+0.728
Neg logit contributions
ifying-2.203
igsaw-2.036
run-1.775
chie-1.743
jamin-1.655
Token her
Token ID607
Feature activation-2.32e-01

Pos logit contributions
 idea+0.740
 morning+0.615
 day+0.611
 prevent+0.601
 hospital+0.592
Neg logit contributions
ifying-1.429
igsaw-1.289
run-1.117
chie-1.065
jamin-1.051
Token mom
Token ID1995
Feature activation-1.57e-01

Pos logit contributions
ifying+0.647
igsaw+0.586
run+0.523
chie+0.511
 brief+0.494
Neg logit contributions
 idea-0.174
 morning-0.132
 day-0.129
 people-0.125
 prevent-0.122
Token said
Token ID531
Feature activation+2.31e-01

Pos logit contributions
ifying+0.359
igsaw+0.319
run+0.279
chie+0.269
jamin+0.259
Neg logit contributions
 idea-0.191
 morning-0.162
 day-0.160
 people-0.160
 doctors-0.155