TOP ACTIVATIONS
MAX = 27.799

a black cat named Wh isk ers . Wh isk ers
Wh isk ers . Wh isk ers loved to walk around
a lazy cat named Wh isk ers . Wh isk ers
, fluffy cat named Wh isk ers , who liked to
a friendly cat named Wh isk ers . Wh isk ers
Wh isk ers . Wh isk ers liked to imagine that
Wh isk ers . Wh isk ers loved to sleep all
a naughty cat named Wh isk ers jumped in front of
's pet cat , Wh isk ers , jumped up and
happier . Mr . Wh isk ers went outside and when
a quiet mouse named Wh isk ers . Wh isk ers
One day , Wh isk ers ' owner was drinking
an adorable cat named Wh isk ers . Lily and Wh
, while walking , Wh isk ers met a bird named
nos y cat , Wh isk ers , saw her searching
ers . Lily and Wh isk ers loved to play with
. One day , Wh isk ers received a letter in
of the camera . Wh isk ers wanted to be in
a lazy cat named Wh isk ers . Wh isk ers
a quiet cat named Wh isk ers who liked to examine

MIN ACTIVATIONS
MIN = -26.885

still stubborn but he was determined to figure it all out
But even though Jack was determined , he hit the ball
sunny day , he was determined to reach the top of
she was brave and was determined to reach the other side
scared , but she was determined to try it out .
on her head and was determined to succeed . She went
game , but he was determined to still play fair .
small towel . He was determined to succeed in iron ing
But she was determined to learn how to read
and started building again , determined to make an even longer
kept cleaning . She was determined to finish the job .
quite ignorant but she was determined to take care of the
T eddy was determined . He knew he could
a promise . She was determined to find Sarah and deliver
The cats were still determined to get the mouse .
Lucy kept trying and was determined to increase her knowledge .
But Ted was determined to try it . He
frustrated , but she was determined to find a way to
. Lily was cheerful and determined . She opened the back
. Sam was determined . He threw on more

INTERVAL 14.128 - 27.799
CONTAINS 0.966%

to go . The veterinarian is a nice person who helps
The crab won 't hurt us . It 's just looking
mom very much . <|endoftext|> L ily and Tom are friends
Mom my . I have my stick to protect me ."
. He said " Ar r !" and wandered to the

INTERVAL 0.457 - 14.128
CONTAINS 43.768%

it . " Oh no !" Lily cried
yet . She suggested they keep walking and perhaps they would
, but the best things in life are free . Spending
for Bob . She wondered how it would be to not
too , my dear . You have a big heart and

INTERVAL -13.214 - 0.457
CONTAINS 53.969%

L ila and Ben thought about what their mom said .
. He wanted to help a lot but he didn â
he walked to the pond . He wanted to see what
hugged her mum tightly and said goodbye with a big kiss
The bunny knew he had to find a way to solve

INTERVAL -26.885 - -13.214
CONTAINS 1.297%

When Sue reached the garden she took a sip of her
Amy was amazed . She had an easy
Sarah was so happy she had been saved ! She
S ally was determined to get her balloon back
outside . One day , she decided to adventure around her
Token a
Token ID257
Feature activation-9.25e-01

Pos logit contributions
ara+0.530
ucks+0.518
eds+0.498
 is+0.475
icer+0.471
Neg logit contributions
 determined-0.446
 unsure-0.408
 confused-0.386
 nervous-0.381
 hungry-0.374
Token black
Token ID2042
Feature activation-3.15e+00

Pos logit contributions
 determined+0.047
 unsure+0.045
 confused+0.041
dream+0.040
 thirsty+0.040
Neg logit contributions
ara-0.051
ucks-0.047
eds-0.046
 is-0.044
icer-0.043
Token cat
Token ID3797
Feature activation+3.05e+00

Pos logit contributions
 determined+0.161
 hungry+0.158
 thirsty+0.154
 nervous+0.152
 curious+0.152
Neg logit contributions
ucks-0.166
eds-0.163
ara-0.151
ac-0.142
 occasions-0.141
Token named
Token ID3706
Feature activation-1.74e+00

Pos logit contributions
ucks+0.173
eds+0.171
ara+0.170
of+0.152
icer+0.151
Neg logit contributions
 determined-0.151
 unsure-0.137
 hungry-0.135
 thirsty-0.134
 nervous-0.133
Token Wh
Token ID854
Feature activation+9.24e+00

Pos logit contributions
 determined+0.079
 hungry+0.074
 thirsty+0.073
 unsure+0.071
 nervous+0.071
Neg logit contributions
ucks-0.106
eds-0.104
ara-0.101
icer-0.092
ent-0.092
Tokenisk
Token ID1984
Feature activation+2.78e+01

Pos logit contributions
ucks+0.353
 occasions+0.290
eds+0.272
ac+0.270
ara+0.258
Neg logit contributions
 determined-0.547
 hungry-0.535
 curious-0.531
dream-0.523
 thirsty-0.519
Tokeners
Token ID364
Feature activation+1.27e+01

Pos logit contributions
ucks+1.058
ara+0.986
eds+0.943
icer+0.889
 occasions+0.821
Neg logit contributions
 determined-1.804
 he-1.713
 curious-1.669
 hungry-1.663
 she-1.654
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
ucks+0.677
eds+0.661
ara+0.653
icer+0.580
of+0.576
Neg logit contributions
 determined-0.678
 hungry-0.626
 unsure-0.619
 thirsty-0.613
 curious-0.611
Token Wh
Token ID854
Feature activation+9.18e+00

Pos logit contributions
ucks+0.011
eds+0.011
ara+0.011
of+0.010
icer+0.010
Neg logit contributions
 determined-0.012
 unsure-0.011
 hungry-0.011
 thirsty-0.011
 nervous-0.011
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.342
 occasions+0.318
eds+0.286
ara+0.281
ee+0.276
Neg logit contributions
 hungry-0.544
 determined-0.536
 thirsty-0.533
 nervous-0.519
 curious-0.515
Tokeners
Token ID364
Feature activation+1.27e+01

Pos logit contributions
ucks+1.071
ara+1.034
eds+0.934
icer+0.896
 occasions+0.854
Neg logit contributions
 determined-1.782
 curious-1.666
 he-1.663
 excited-1.637
 hungry-1.635
Token Wh
Token ID854
Feature activation+9.24e+00

Pos logit contributions
 determined+0.079
 hungry+0.074
 thirsty+0.073
 unsure+0.071
 nervous+0.071
Neg logit contributions
ucks-0.106
eds-0.104
ara-0.101
icer-0.092
ent-0.092
Tokenisk
Token ID1984
Feature activation+2.78e+01

Pos logit contributions
ucks+0.353
 occasions+0.290
eds+0.272
ac+0.270
ara+0.258
Neg logit contributions
 determined-0.547
 hungry-0.535
 curious-0.531
dream-0.523
 thirsty-0.519
Tokeners
Token ID364
Feature activation+1.27e+01

Pos logit contributions
ucks+1.058
ara+0.986
eds+0.943
icer+0.889
 occasions+0.821
Neg logit contributions
 determined-1.804
 he-1.713
 curious-1.669
 hungry-1.663
 she-1.654
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
ucks+0.677
eds+0.661
ara+0.653
icer+0.580
of+0.576
Neg logit contributions
 determined-0.678
 hungry-0.626
 unsure-0.619
 thirsty-0.613
 curious-0.611
Token Wh
Token ID854
Feature activation+9.18e+00

Pos logit contributions
ucks+0.011
eds+0.011
ara+0.011
of+0.010
icer+0.010
Neg logit contributions
 determined-0.012
 unsure-0.011
 hungry-0.011
 thirsty-0.011
 nervous-0.011
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.342
 occasions+0.318
eds+0.286
ara+0.281
ee+0.276
Neg logit contributions
 hungry-0.544
 determined-0.536
 thirsty-0.533
 nervous-0.519
 curious-0.515
Tokeners
Token ID364
Feature activation+1.27e+01

Pos logit contributions
ucks+1.071
ara+1.034
eds+0.934
icer+0.896
 occasions+0.854
Neg logit contributions
 determined-1.782
 curious-1.666
 he-1.663
 excited-1.637
 hungry-1.635
Token loved
Token ID6151
Feature activation+1.02e+00

Pos logit contributions
ucks+0.778
eds+0.749
ara+0.713
icer+0.656
 occasions+0.651
Neg logit contributions
 determined-0.616
 curious-0.567
 hungry-0.564
 thirsty-0.554
 nervous-0.543
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
eds+0.056
ucks+0.055
ara+0.053
 us+0.050
 is+0.050
Neg logit contributions
 determined-0.052
 unsure-0.049
 confused-0.046
 nervous-0.045
 confident-0.045
Token walk
Token ID2513
Feature activation+2.52e+00

Pos logit contributions
 determined+0.072
 hungry+0.067
 unsure+0.066
 thirsty+0.066
 nervous+0.066
Neg logit contributions
ucks-0.047
eds-0.046
ara-0.046
icer-0.040
of-0.039
Token around
Token ID1088
Feature activation-3.22e+00

Pos logit contributions
ucks+0.134
eds+0.134
ara+0.133
 is+0.121
of+0.120
Neg logit contributions
 determined-0.133
 unsure-0.121
 thirsty-0.117
 confused-0.117
 hungry-0.116
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
ara+0.512
ucks+0.497
eds+0.477
 is+0.456
icer+0.455
Neg logit contributions
 determined-0.443
 unsure-0.407
 confused-0.386
 nervous-0.379
 hungry-0.372
Token lazy
Token ID16931
Feature activation-7.54e+00

Pos logit contributions
 determined+0.063
 unsure+0.062
 discouraged+0.055
dream+0.055
 confused+0.055
Neg logit contributions
ara-0.063
ucks-0.054
eds-0.054
 is-0.052
ee-0.051
Token cat
Token ID3797
Feature activation+2.84e+00

Pos logit contributions
 determined+0.488
 unsure+0.458
 desperate+0.426
 confused+0.425
 thirsty+0.425
Neg logit contributions
ara-0.331
eds-0.286
ucks-0.282
 is-0.280
 us-0.256
Token named
Token ID3706
Feature activation-1.93e+00

Pos logit contributions
ucks+0.162
eds+0.160
ara+0.160
of+0.143
 is+0.142
Neg logit contributions
 determined-0.140
 unsure-0.126
 hungry-0.125
 thirsty-0.124
 nervous-0.123
Token Wh
Token ID854
Feature activation+9.09e+00

Pos logit contributions
 determined+0.091
 unsure+0.081
 hungry+0.081
 thirsty+0.080
 nervous+0.079
Neg logit contributions
ucks-0.115
eds-0.114
ara-0.114
of-0.102
icer-0.102
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.358
 occasions+0.295
eds+0.278
ac+0.270
ara+0.267
Neg logit contributions
 determined-0.544
 hungry-0.515
dream-0.514
 curious-0.509
 unsure-0.504
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.034
ara+0.962
eds+0.909
icer+0.859
 occasions+0.787
Neg logit contributions
 determined-1.858
 he-1.740
 hungry-1.703
 curious-1.698
 she-1.692
Token.
Token ID13
Feature activation+7.68e-02

Pos logit contributions
ucks+0.640
eds+0.626
ara+0.621
icer+0.548
of+0.544
Neg logit contributions
 determined-0.697
 hungry-0.640
 unsure-0.639
 thirsty-0.631
 nervous-0.626
Token Wh
Token ID854
Feature activation+9.04e+00

Pos logit contributions
ucks+0.004
ara+0.004
eds+0.004
icer+0.003
of+0.003
Neg logit contributions
 determined-0.004
 unsure-0.004
 hungry-0.004
 thirsty-0.004
 nervous-0.004
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.351
 occasions+0.325
ara+0.293
eds+0.291
ee+0.286
Neg logit contributions
 determined-0.522
 hungry-0.520
 thirsty-0.508
 nervous-0.496
 confused-0.490
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.082
ara+1.043
eds+0.924
icer+0.894
 occasions+0.854
Neg logit contributions
 determined-1.785
 he-1.655
 curious-1.646
 excited-1.629
 hungry-1.624
Token,
Token ID11
Feature activation-2.56e+00

Pos logit contributions
ara+0.008
ucks+0.008
eds+0.008
 is+0.007
icer+0.007
Neg logit contributions
 determined-0.009
 unsure-0.009
 thirsty-0.008
 hungry-0.008
 nervous-0.008
Token fluffy
Token ID39145
Feature activation-3.12e+00

Pos logit contributions
 determined+0.112
 unsure+0.109
 discouraged+0.102
 desperate+0.099
 barely+0.098
Neg logit contributions
ara-0.133
 is-0.129
 us-0.115
ucks-0.113
ac-0.111
Token cat
Token ID3797
Feature activation+2.83e+00

Pos logit contributions
 determined+0.177
 unsure+0.163
 hungry+0.160
 thirsty+0.159
 nervous+0.157
Neg logit contributions
ucks-0.154
ara-0.153
eds-0.151
 is-0.134
icer-0.133
Token named
Token ID3706
Feature activation-1.83e+00

Pos logit contributions
eds+0.146
ara+0.146
ucks+0.144
 is+0.131
of+0.129
Neg logit contributions
 determined-0.152
 unsure-0.140
 hungry-0.136
 thirsty-0.136
 nervous-0.135
Token Wh
Token ID854
Feature activation+9.11e+00

Pos logit contributions
 determined+0.087
 hungry+0.079
 unsure+0.079
 thirsty+0.078
 nervous+0.078
Neg logit contributions
ucks-0.107
eds-0.106
ara-0.103
icer-0.094
ac-0.092
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.327
eds+0.288
ac+0.279
ases+0.264
angles+0.252
Neg logit contributions
 determined-0.575
dream-0.550
 curious-0.545
 nervous-0.540
 unsure-0.536
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+0.972
ara+0.921
eds+0.879
icer+0.838
atch+0.724
Neg logit contributions
 determined-1.923
 curious-1.761
 she-1.749
 hungry-1.743
 excited-1.740
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
ara+0.521
eds+0.514
ucks+0.512
 is+0.462
of+0.455
Neg logit contributions
 determined-0.776
 unsure-0.724
 thirsty-0.715
 hungry-0.708
 nervous-0.702
Token who
Token ID508
Feature activation+1.27e+00

Pos logit contributions
 determined+0.136
 unsure+0.125
 thirsty+0.118
 nervous+0.118
 confused+0.118
Neg logit contributions
ara-0.134
eds-0.128
ucks-0.128
 is-0.123
of-0.116
Token liked
Token ID8288
Feature activation+9.74e-01

Pos logit contributions
ucks+0.061
ara+0.061
eds+0.060
 is+0.054
of+0.053
Neg logit contributions
 determined-0.073
 unsure-0.067
 hungry-0.066
 thirsty-0.066
 nervous-0.065
Token to
Token ID284
Feature activation-1.28e+00

Pos logit contributions
ucks+0.058
eds+0.058
ara+0.058
 is+0.052
of+0.052
Neg logit contributions
 determined-0.044
 unsure-0.040
 thirsty-0.039
 hungry-0.039
 confused-0.039
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
ara+0.505
ucks+0.487
eds+0.468
 is+0.450
 us+0.447
Neg logit contributions
 determined-0.449
 unsure-0.414
 confused-0.391
 nervous-0.384
 confident-0.376
Token friendly
Token ID8030
Feature activation-3.58e+00

Pos logit contributions
 determined+0.063
 unsure+0.061
 discouraged+0.055
dream+0.055
 confused+0.054
Neg logit contributions
ara-0.063
ucks-0.055
eds-0.054
 is-0.053
ee-0.051
Token cat
Token ID3797
Feature activation+2.76e+00

Pos logit contributions
 determined+0.216
 unsure+0.199
 thirsty+0.190
 hungry+0.189
 confused+0.189
Neg logit contributions
ara-0.169
ucks-0.164
eds-0.159
 is-0.143
of-0.142
Token named
Token ID3706
Feature activation-1.95e+00

Pos logit contributions
ucks+0.163
eds+0.160
ara+0.160
icer+0.143
of+0.143
Neg logit contributions
 determined-0.130
 unsure-0.117
 hungry-0.117
 thirsty-0.116
 nervous-0.114
Token Wh
Token ID854
Feature activation+9.05e+00

Pos logit contributions
 determined+0.085
 hungry+0.078
 thirsty+0.076
 unsure+0.076
 nervous+0.075
Neg logit contributions
ucks-0.124
eds-0.121
ara-0.119
icer-0.109
ent-0.108
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.363
 occasions+0.302
ac+0.281
eds+0.278
ara+0.269
Neg logit contributions
 determined-0.518
 hungry-0.503
dream-0.496
 curious-0.495
 thirsty-0.490
Tokeners
Token ID364
Feature activation+1.26e+01

Pos logit contributions
ucks+1.071
ara+0.987
eds+0.935
icer+0.905
 occasions+0.818
Neg logit contributions
 determined-1.782
 he-1.683
 she-1.656
 hungry-1.646
 curious-1.639
Token.
Token ID13
Feature activation+6.34e-03

Pos logit contributions
ucks+0.689
eds+0.667
ara+0.661
icer+0.590
of+0.582
Neg logit contributions
 determined-0.655
 hungry-0.607
 unsure-0.595
 curious-0.593
 thirsty-0.592
Token Wh
Token ID854
Feature activation+9.04e+00

Pos logit contributions
ucks+0.000
ara+0.000
eds+0.000
 is+0.000
icer+0.000
Neg logit contributions
 determined-0.000
 unsure-0.000
 thirsty-0.000
 hungry-0.000
 confused-0.000
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.354
 occasions+0.329
ara+0.296
ee+0.295
eds+0.295
Neg logit contributions
 hungry-0.515
 determined-0.506
 thirsty-0.503
 nervous-0.487
 confused-0.477
Tokeners
Token ID364
Feature activation+1.26e+01

Pos logit contributions
ucks+1.099
ara+1.053
eds+0.938
icer+0.918
 occasions+0.860
Neg logit contributions
 determined-1.740
 he-1.628
 curious-1.617
 excited-1.607
 hungry-1.597
Token Wh
Token ID854
Feature activation+9.05e+00

Pos logit contributions
 determined+0.085
 hungry+0.078
 thirsty+0.076
 unsure+0.076
 nervous+0.075
Neg logit contributions
ucks-0.124
eds-0.121
ara-0.119
icer-0.109
ent-0.108
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.363
 occasions+0.302
ac+0.281
eds+0.278
ara+0.269
Neg logit contributions
 determined-0.518
 hungry-0.503
dream-0.496
 curious-0.495
 thirsty-0.490
Tokeners
Token ID364
Feature activation+1.26e+01

Pos logit contributions
ucks+1.071
ara+0.987
eds+0.935
icer+0.905
 occasions+0.818
Neg logit contributions
 determined-1.782
 he-1.683
 she-1.656
 hungry-1.646
 curious-1.639
Token.
Token ID13
Feature activation+6.34e-03

Pos logit contributions
ucks+0.689
eds+0.667
ara+0.661
icer+0.590
of+0.582
Neg logit contributions
 determined-0.655
 hungry-0.607
 unsure-0.595
 curious-0.593
 thirsty-0.592
Token Wh
Token ID854
Feature activation+9.04e+00

Pos logit contributions
ucks+0.000
ara+0.000
eds+0.000
 is+0.000
icer+0.000
Neg logit contributions
 determined-0.000
 unsure-0.000
 thirsty-0.000
 hungry-0.000
 confused-0.000
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.354
 occasions+0.329
ara+0.296
ee+0.295
eds+0.295
Neg logit contributions
 hungry-0.515
 determined-0.506
 thirsty-0.503
 nervous-0.487
 confused-0.477
Tokeners
Token ID364
Feature activation+1.26e+01

Pos logit contributions
ucks+1.099
ara+1.053
eds+0.938
icer+0.918
 occasions+0.860
Neg logit contributions
 determined-1.740
 he-1.628
 curious-1.617
 excited-1.607
 hungry-1.597
Token liked
Token ID8288
Feature activation+9.29e-01

Pos logit contributions
ucks+0.799
eds+0.765
ara+0.737
icer+0.679
ases+0.673
Neg logit contributions
 determined-0.589
 hungry-0.536
 curious-0.535
 thirsty-0.524
 nervous-0.513
Token to
Token ID284
Feature activation-1.15e+00

Pos logit contributions
eds+0.047
ucks+0.046
ara+0.044
 us+0.043
 is+0.042
Neg logit contributions
 determined-0.050
 unsure-0.048
 confused-0.045
 confident-0.044
 nervous-0.043
Token imagine
Token ID5967
Feature activation+3.77e-01

Pos logit contributions
 determined+0.073
 hungry+0.068
 thirsty+0.067
 unsure+0.067
 nervous+0.066
Neg logit contributions
ucks-0.050
eds-0.049
ara-0.049
icer-0.042
of-0.041
Token that
Token ID326
Feature activation-6.81e+00

Pos logit contributions
eds+0.020
 is+0.019
ucks+0.019
 us+0.018
ara+0.018
Neg logit contributions
 determined-0.020
 unsure-0.018
 thirsty-0.018
 confused-0.017
 nervous-0.017
Token Wh
Token ID854
Feature activation+9.09e+00

Pos logit contributions
 determined+0.091
 unsure+0.081
 hungry+0.081
 thirsty+0.080
 nervous+0.079
Neg logit contributions
ucks-0.115
eds-0.114
ara-0.114
of-0.102
icer-0.102
Tokenisk
Token ID1984
Feature activation+2.77e+01

Pos logit contributions
ucks+0.358
 occasions+0.295
eds+0.278
ac+0.270
ara+0.267
Neg logit contributions
 determined-0.544
 hungry-0.515
dream-0.514
 curious-0.509
 unsure-0.504
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.034
ara+0.962
eds+0.909
icer+0.859
 occasions+0.787
Neg logit contributions
 determined-1.858
 he-1.740
 hungry-1.703
 curious-1.698
 she-1.692
Token.
Token ID13
Feature activation+7.68e-02

Pos logit contributions
ucks+0.640
eds+0.626
ara+0.621
icer+0.548
of+0.544
Neg logit contributions
 determined-0.697
 hungry-0.640
 unsure-0.639
 thirsty-0.631
 nervous-0.626
Token Wh
Token ID854
Feature activation+9.04e+00

Pos logit contributions
ucks+0.004
ara+0.004
eds+0.004
icer+0.003
of+0.003
Neg logit contributions
 determined-0.004
 unsure-0.004
 hungry-0.004
 thirsty-0.004
 nervous-0.004
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.351
 occasions+0.325
ara+0.293
eds+0.291
ee+0.286
Neg logit contributions
 determined-0.522
 hungry-0.520
 thirsty-0.508
 nervous-0.496
 confused-0.490
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.082
ara+1.043
eds+0.924
icer+0.894
 occasions+0.854
Neg logit contributions
 determined-1.785
 he-1.655
 curious-1.646
 excited-1.629
 hungry-1.624
Token loved
Token ID6151
Feature activation+8.56e-01

Pos logit contributions
ucks+0.718
eds+0.689
ara+0.671
icer+0.604
of+0.602
Neg logit contributions
 determined-0.650
 hungry-0.592
 thirsty-0.583
 unsure-0.583
 curious-0.581
Token to
Token ID284
Feature activation-1.28e+00

Pos logit contributions
eds+0.046
ucks+0.043
ara+0.041
 us+0.041
 is+0.040
Neg logit contributions
 determined-0.045
 unsure-0.043
 confused-0.040
 nervous-0.039
 delighted-0.039
Token sleep
Token ID3993
Feature activation-1.91e+00

Pos logit contributions
 determined+0.080
 unsure+0.075
 hungry+0.074
 thirsty+0.074
 nervous+0.073
Neg logit contributions
ucks-0.055
eds-0.054
ara-0.054
of-0.046
icer-0.046
Token all
Token ID477
Feature activation-2.08e+00

Pos logit contributions
 determined+0.103
 unsure+0.097
 thirsty+0.093
 confused+0.092
 nervous+0.090
Neg logit contributions
eds-0.093
ara-0.087
 is-0.086
ucks-0.086
 us-0.084
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
 determined+0.095
 unsure+0.085
 confident+0.083
 confused+0.082
 nervous+0.080
Neg logit contributions
ara-0.160
 is-0.155
 us-0.148
of-0.144
ucks-0.142
Token naughty
Token ID45128
Feature activation-5.15e+00

Pos logit contributions
 determined+0.049
 unsure+0.046
 thirsty+0.043
 confident+0.041
 hungry+0.041
Neg logit contributions
ara-0.057
eds-0.055
 is-0.053
ucks-0.052
of-0.050
Token cat
Token ID3797
Feature activation+2.93e+00

Pos logit contributions
 determined+0.324
 unsure+0.300
 thirsty+0.292
 hungry+0.288
 confused+0.285
Neg logit contributions
ara-0.230
ucks-0.214
eds-0.209
 is-0.198
of-0.190
Token named
Token ID3706
Feature activation-1.64e+00

Pos logit contributions
ucks+0.158
eds+0.157
ara+0.157
of+0.140
 is+0.139
Neg logit contributions
 determined-0.151
 unsure-0.138
 hungry-0.136
 thirsty-0.136
 nervous-0.134
Token Wh
Token ID854
Feature activation+9.19e+00

Pos logit contributions
 determined+0.073
 hungry+0.066
 unsure+0.065
 thirsty+0.065
 nervous+0.064
Neg logit contributions
ucks-0.102
eds-0.101
ara-0.099
icer-0.091
ent-0.089
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.332
eds+0.290
ac+0.265
icer+0.263
 occasions+0.262
Neg logit contributions
 determined-0.591
 hungry-0.557
 curious-0.555
 unsure-0.551
 nervous-0.544
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+0.994
ara+0.904
eds+0.901
icer+0.856
 occasions+0.731
Neg logit contributions
 determined-1.917
 he-1.757
 hungry-1.738
 curious-1.735
 she-1.734
Token jumped
Token ID11687
Feature activation-4.15e+00

Pos logit contributions
ara+0.637
eds+0.632
ucks+0.611
 is+0.583
of+0.574
Neg logit contributions
 determined-0.657
 unsure-0.604
 thirsty-0.591
 hungry-0.585
 nervous-0.581
Token in
Token ID287
Feature activation+1.04e+00

Pos logit contributions
 determined+0.232
 unsure+0.208
 thirsty+0.206
 confident+0.203
 hungry+0.201
Neg logit contributions
eds-0.205
ucks-0.203
ara-0.201
 us-0.185
 is-0.185
Token front
Token ID2166
Feature activation+4.18e-01

Pos logit contributions
ucks+0.051
eds+0.051
ara+0.050
 is+0.045
of+0.044
Neg logit contributions
 determined-0.059
 unsure-0.054
 thirsty-0.054
 hungry-0.054
 nervous-0.053
Token of
Token ID286
Feature activation+5.27e+00

Pos logit contributions
ucks+0.020
eds+0.020
ara+0.019
of+0.017
 is+0.017
Neg logit contributions
 determined-0.025
 unsure-0.023
 hungry-0.023
 thirsty-0.022
 nervous-0.022
Token's
Token ID338
Feature activation+9.30e+00

Pos logit contributions
eds+0.032
ara+0.032
ucks+0.031
of+0.028
 is+0.028
Neg logit contributions
 determined-0.038
 unsure-0.035
 hungry-0.034
 thirsty-0.034
 nervous-0.034
Token pet
Token ID4273
Feature activation+1.22e+00

Pos logit contributions
ara+0.474
ucks+0.466
eds+0.461
 is+0.430
icer+0.411
Neg logit contributions
 determined-0.492
 unsure-0.457
 thirsty-0.438
 hungry-0.435
 nervous-0.430
Token cat
Token ID3797
Feature activation+2.86e+00

Pos logit contributions
ucks+0.068
eds+0.066
ara+0.066
icer+0.059
of+0.058
Neg logit contributions
 determined-0.062
 hungry-0.057
 unsure-0.056
 thirsty-0.056
 nervous-0.055
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
ara+0.148
eds+0.148
ucks+0.144
 is+0.134
of+0.132
Neg logit contributions
 determined-0.152
 unsure-0.140
 thirsty-0.136
 nervous-0.134
 hungry-0.134
Token Wh
Token ID854
Feature activation+9.13e+00

Pos logit contributions
 determined+0.122
 unsure+0.109
 thirsty+0.106
 confident+0.105
 hungry+0.103
Neg logit contributions
ara-0.145
eds-0.141
ucks-0.138
 is-0.136
of-0.130
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.345
eds+0.281
 occasions+0.263
icer+0.258
ac+0.255
Neg logit contributions
 determined-0.563
 curious-0.535
 hungry-0.531
 unsure-0.529
dream-0.527
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+0.949
ara+0.877
eds+0.837
icer+0.786
atch+0.668
Neg logit contributions
 determined-1.969
 curious-1.800
 hungry-1.798
 unsure-1.782
 he-1.775
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
eds+0.645
ara+0.627
ucks+0.609
 is+0.585
of+0.583
Neg logit contributions
 determined-0.641
 unsure-0.591
 thirsty-0.575
 hungry-0.563
 confident-0.563
Token jumped
Token ID11687
Feature activation-4.21e+00

Pos logit contributions
 determined+0.131
 unsure+0.120
 thirsty+0.116
 nervous+0.114
 hungry+0.114
Neg logit contributions
eds-0.141
ara-0.141
ucks-0.138
 is-0.130
 us-0.126
Token up
Token ID510
Feature activation+2.86e+00

Pos logit contributions
 determined+0.240
 unsure+0.218
 thirsty+0.214
 confident+0.212
 hungry+0.206
Neg logit contributions
eds-0.199
ara-0.195
ucks-0.194
 us-0.182
 is-0.181
Token and
Token ID290
Feature activation+1.44e+00

Pos logit contributions
ara+0.105
ucks+0.105
eds+0.104
 is+0.102
 us+0.093
Neg logit contributions
 determined-0.181
 unsure-0.164
 thirsty-0.164
 confident-0.162
 curious-0.159
Token happier
Token ID23030
Feature activation-6.09e+00

Pos logit contributions
 determined+0.412
 unsure+0.373
chard+0.368
 barely+0.365
 curious+0.360
Neg logit contributions
ucks-0.298
ara-0.297
 is-0.288
 us-0.267
eds-0.260
Token.
Token ID13
Feature activation+9.72e-02

Pos logit contributions
 determined+0.364
 unsure+0.334
 thirsty+0.329
 hungry+0.329
 nervous+0.326
Neg logit contributions
ucks-0.280
ara-0.279
eds-0.278
 is-0.246
 us-0.240
Token Mr
Token ID1770
Feature activation+5.30e+00

Pos logit contributions
ara+0.006
ucks+0.006
eds+0.006
 is+0.005
of+0.005
Neg logit contributions
 determined-0.004
 unsure-0.004
 thirsty-0.004
 hungry-0.004
 confused-0.004
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
ucks+0.294
eds+0.291
ara+0.276
 occasions+0.264
of+0.256
Neg logit contributions
 determined-0.255
 hungry-0.250
 nervous-0.241
 thirsty-0.240
 unsure-0.238
Token Wh
Token ID854
Feature activation+9.03e+00

Pos logit contributions
ucks+0.008
eds+0.008
ara+0.008
of+0.007
ent+0.007
Neg logit contributions
 determined-0.008
 hungry-0.008
 thirsty-0.008
 unsure-0.008
 nervous-0.008
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.299
eds+0.271
ara+0.253
 occasions+0.246
ate+0.235
Neg logit contributions
 determined-0.618
 hungry-0.589
 nervous-0.582
 unsure-0.580
 thirsty-0.578
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.039
ara+0.954
eds+0.906
icer+0.868
atch+0.771
Neg logit contributions
 determined-1.892
 he-1.744
 confident-1.712
 hungry-1.687
 excited-1.687
Token went
Token ID1816
Feature activation-8.15e-01

Pos logit contributions
ara+0.673
eds+0.670
ucks+0.669
of+0.607
 is+0.602
Neg logit contributions
 determined-0.628
 unsure-0.577
 hungry-0.569
 thirsty-0.567
 nervous-0.560
Token outside
Token ID2354
Feature activation-4.21e+00

Pos logit contributions
 determined+0.050
 unsure+0.048
 thirsty+0.046
 confused+0.045
 nervous+0.044
Neg logit contributions
eds-0.036
ucks-0.035
ara-0.034
of-0.031
 is-0.030
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 determined+0.253
 thirsty+0.236
 unsure+0.233
 confused+0.227
 hungry+0.226
Neg logit contributions
ucks-0.171
eds-0.171
ara-0.165
 is-0.161
of-0.153
Token when
Token ID618
Feature activation-4.42e+00

Pos logit contributions
eds+0.071
ara+0.068
ucks+0.066
 is+0.066
of+0.065
Neg logit contributions
 determined-0.067
 unsure-0.062
 nervous-0.059
 confused-0.058
dream-0.057
Token a
Token ID257
Feature activation-8.83e-01

Pos logit contributions
ucks+0.358
ara+0.353
eds+0.353
icer+0.322
of+0.320
Neg logit contributions
 determined-0.213
 unsure-0.188
 hungry-0.188
 thirsty-0.184
 nervous-0.183
Token quiet
Token ID5897
Feature activation-3.99e+00

Pos logit contributions
 determined+0.045
 unsure+0.044
dream+0.040
 confused+0.039
 thirsty+0.039
Neg logit contributions
ara-0.046
 is-0.042
eds-0.041
ucks-0.040
of-0.038
Token mouse
Token ID10211
Feature activation-3.11e+00

Pos logit contributions
 determined+0.245
 unsure+0.227
 hungry+0.220
 thirsty+0.220
 confused+0.216
Neg logit contributions
ara-0.181
eds-0.175
ucks-0.174
 is-0.156
of-0.152
Token named
Token ID3706
Feature activation-1.66e+00

Pos logit contributions
 determined+0.168
 unsure+0.154
 hungry+0.151
 thirsty+0.151
 nervous+0.150
Neg logit contributions
ucks-0.161
eds-0.160
ara-0.158
 is-0.141
of-0.140
Token Wh
Token ID854
Feature activation+9.13e+00

Pos logit contributions
 determined+0.077
 unsure+0.069
 hungry+0.068
 thirsty+0.068
 nervous+0.067
Neg logit contributions
ucks-0.100
eds-0.098
ara-0.098
of-0.088
icer-0.088
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.361
 occasions+0.297
ara+0.295
ac+0.268
 forms+0.266
Neg logit contributions
 determined-0.511
 curious-0.498
 hungry-0.497
 nervous-0.492
 searching-0.489
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+0.987
ara+0.948
icer+0.863
eds+0.863
 occasions+0.727
Neg logit contributions
 determined-1.884
 he-1.744
 curious-1.741
 hungry-1.729
 she-1.728
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
ucks+0.540
eds+0.537
ara+0.531
of+0.467
 is+0.467
Neg logit contributions
 determined-0.765
 unsure-0.706
 thirsty-0.695
 hungry-0.693
 nervous-0.689
Token Wh
Token ID854
Feature activation+9.08e+00

Pos logit contributions
ucks+0.013
ara+0.013
eds+0.013
of+0.012
 is+0.012
Neg logit contributions
 determined-0.013
 unsure-0.012
 hungry-0.012
 thirsty-0.012
 nervous-0.012
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.358
ara+0.328
 occasions+0.322
'."+0.270
 ends+0.269
Neg logit contributions
 determined-0.486
 afraid-0.471
 nervous-0.468
 fearful-0.468
 hungry-0.468
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+1.106
ara+1.086
icer+0.944
eds+0.925
atch+0.853
Neg logit contributions
 determined-1.734
 curious-1.606
 he-1.603
 excited-1.579
 she-1.564
Token
Token ID198
Feature activation+3.85e+00

Pos logit contributions
ucks+0.236
eds+0.235
ara+0.223
ases+0.214
ese+0.211
Neg logit contributions
 determined-0.172
 nervous-0.152
 unsure-0.152
 hungry-0.151
 excited-0.151
TokenOne
Token ID3198
Feature activation+6.85e+00

Pos logit contributions
ucks+0.185
eds+0.179
ara+0.170
ases+0.152
icer+0.151
Neg logit contributions
 determined-0.237
 hungry-0.210
 nervous-0.210
 unsure-0.209
 curious-0.208
Token day
Token ID1110
Feature activation-1.36e+00

Pos logit contributions
ucks+0.347
ara+0.343
eds+0.342
of+0.302
icer+0.299
Neg logit contributions
 determined-0.382
 unsure-0.352
 hungry-0.346
 thirsty-0.344
 nervous-0.340
Token,
Token ID11
Feature activation-2.59e+00

Pos logit contributions
 determined+0.070
 unsure+0.064
 thirsty+0.062
 hungry+0.061
 confused+0.060
Neg logit contributions
ara-0.075
ucks-0.074
eds-0.072
 is-0.068
of-0.067
Token Wh
Token ID854
Feature activation+9.03e+00

Pos logit contributions
 determined+0.165
 unsure+0.153
 thirsty+0.146
 delighted+0.146
 confused+0.145
Neg logit contributions
ara-0.108
ucks-0.100
 is-0.099
of-0.092
eds-0.092
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.356
 occasions+0.333
eds+0.304
ee+0.294
ators+0.291
Neg logit contributions
 hungry-0.515
 determined-0.500
 thirsty-0.497
 nervous-0.483
 confused-0.471
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.113
ara+1.035
eds+0.942
icer+0.937
 occasions+0.826
Neg logit contributions
 determined-1.782
 he-1.625
 curious-1.611
 she-1.593
 hungry-1.591
Token'
Token ID6
Feature activation+7.56e+00

Pos logit contributions
ucks+0.672
ara+0.666
eds+0.664
of+0.590
 is+0.588
Neg logit contributions
 determined-0.650
 unsure-0.593
 hungry-0.586
 thirsty-0.582
 nervous-0.576
Token owner
Token ID4870
Feature activation+1.51e+00

Pos logit contributions
ucks+0.380
eds+0.377
ara+0.370
icer+0.345
 is+0.330
Neg logit contributions
 determined-0.416
 hungry-0.392
 thirsty-0.380
 nervous-0.380
 curious-0.380
Token was
Token ID373
Feature activation+9.13e+00

Pos logit contributions
ucks+0.090
eds+0.089
ara+0.086
icer+0.080
of+0.078
Neg logit contributions
 determined-0.072
 hungry-0.066
 unsure-0.065
 thirsty-0.064
 nervous-0.064
Token drinking
Token ID7722
Feature activation-8.15e+00

Pos logit contributions
eds+0.503
ara+0.482
 us+0.466
ucks+0.463
 is+0.461
Neg logit contributions
 determined-0.382
 unsure-0.360
 desperate-0.325
 confident-0.322
 he-0.320
Token an
Token ID281
Feature activation+1.42e+00

Pos logit contributions
 determined+0.001
 unsure+0.001
 confused+0.001
 discouraged+0.001
 confident+0.001
Neg logit contributions
ara-0.001
eds-0.001
ucks-0.001
 is-0.001
 us-0.001
Token adorable
Token ID29012
Feature activation-2.09e+00

Pos logit contributions
ara+0.080
ucks+0.080
eds+0.077
 is+0.074
icer+0.074
Neg logit contributions
 determined-0.064
 unsure-0.058
 thirsty-0.056
dream-0.056
 hungry-0.056
Token cat
Token ID3797
Feature activation+2.86e+00

Pos logit contributions
 determined+0.124
 unsure+0.115
 thirsty+0.111
 hungry+0.110
 confused+0.110
Neg logit contributions
ara-0.098
ucks-0.095
eds-0.093
 is-0.086
icer-0.082
Token named
Token ID3706
Feature activation-1.80e+00

Pos logit contributions
eds+0.157
ara+0.157
ucks+0.156
 is+0.141
of+0.139
Neg logit contributions
 determined-0.146
 unsure-0.133
 hungry-0.130
 thirsty-0.129
 nervous-0.128
Token Wh
Token ID854
Feature activation+9.13e+00

Pos logit contributions
 determined+0.084
 hungry+0.077
 unsure+0.076
 thirsty+0.076
 nervous+0.075
Neg logit contributions
ucks-0.107
eds-0.105
ara-0.104
icer-0.094
of-0.093
Tokenisk
Token ID1984
Feature activation+2.76e+01

Pos logit contributions
ucks+0.375
ac+0.303
eds+0.298
 occasions+0.294
ara+0.283
Neg logit contributions
 determined-0.511
 hungry-0.494
 curious-0.491
dream-0.485
 thirsty-0.484
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+0.969
ara+0.900
eds+0.867
icer+0.808
 occasions+0.693
Neg logit contributions
 determined-1.907
 hungry-1.759
 she-1.758
 curious-1.757
 he-1.751
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
ucks+0.597
ara+0.587
eds+0.587
of+0.512
icer+0.511
Neg logit contributions
 determined-0.724
 unsure-0.666
 hungry-0.661
 thirsty-0.657
 nervous-0.650
Token Lily
Token ID20037
Feature activation+1.76e+00

Pos logit contributions
ucks+0.006
eds+0.006
ara+0.006
of+0.005
ent+0.005
Neg logit contributions
 determined-0.005
 hungry-0.004
 unsure-0.004
 thirsty-0.004
 nervous-0.004
Token and
Token ID290
Feature activation+1.52e+00

Pos logit contributions
ucks+0.114
eds+0.109
ara+0.106
icer+0.100
ases+0.099
Neg logit contributions
 determined-0.079
 hungry-0.071
 curious-0.071
 thirsty-0.070
 nervous-0.069
Token Wh
Token ID854
Feature activation+9.12e+00

Pos logit contributions
ucks+0.079
eds+0.078
ara+0.077
icer+0.068
of+0.068
Neg logit contributions
 determined-0.083
 hungry-0.076
 unsure-0.076
 thirsty-0.075
 nervous-0.074
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
 determined+0.064
 unsure+0.058
 hungry+0.056
 thirsty+0.056
 nervous+0.055
Neg logit contributions
ucks-0.082
ara-0.081
eds-0.081
 is-0.073
of-0.073
Token while
Token ID981
Feature activation-7.75e+00

Pos logit contributions
 determined+0.130
 unsure+0.120
 delighted+0.113
 confused+0.111
 confident+0.111
Neg logit contributions
ara-0.127
ucks-0.122
 is-0.121
eds-0.115
of-0.114
Token walking
Token ID6155
Feature activation-7.86e+00

Pos logit contributions
 determined+0.395
 unsure+0.375
 delighted+0.345
 confused+0.343
 confident+0.341
Neg logit contributions
ara-0.405
ucks-0.404
eds-0.402
 is-0.385
 us-0.362
Token,
Token ID11
Feature activation-2.46e+00

Pos logit contributions
 determined+0.394
 unsure+0.357
 thirsty+0.345
 hungry+0.344
 confused+0.343
Neg logit contributions
ucks-0.445
eds-0.436
ara-0.435
 is-0.401
of-0.396
Token Wh
Token ID854
Feature activation+9.05e+00

Pos logit contributions
 determined+0.133
 unsure+0.124
 delighted+0.119
 thrilled+0.114
 pleased+0.114
Neg logit contributions
ara-0.124
 is-0.123
ucks-0.112
of-0.110
icer-0.109
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.332
 occasions+0.328
ators+0.276
eds+0.276
ac+0.272
Neg logit contributions
 hungry-0.537
 thirsty-0.523
 determined-0.516
 nervous-0.510
 tired-0.498
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.013
ara+0.961
eds+0.894
icer+0.871
 occasions+0.793
Neg logit contributions
 determined-1.878
 curious-1.729
 he-1.720
 hungry-1.713
 excited-1.685
Token met
Token ID1138
Feature activation+5.13e+00

Pos logit contributions
ara+0.636
ucks+0.626
eds+0.625
 is+0.565
of+0.559
Neg logit contributions
 determined-0.678
 unsure-0.624
 hungry-0.607
 thirsty-0.603
 nervous-0.597
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
ucks+0.361
eds+0.360
ara+0.356
icer+0.330
 occasions+0.329
Neg logit contributions
 determined-0.182
 hungry-0.166
 unsure-0.162
 curious-0.159
 nervous-0.159
Token bird
Token ID6512
Feature activation+5.91e-01

Pos logit contributions
 determined+0.056
 unsure+0.054
dream+0.049
 confused+0.049
 thirsty+0.048
Neg logit contributions
ara-0.054
 is-0.048
eds-0.048
ucks-0.048
of-0.044
Token named
Token ID3706
Feature activation-1.96e+00

Pos logit contributions
ucks+0.035
ara+0.034
eds+0.034
icer+0.031
of+0.030
Neg logit contributions
 determined-0.028
 hungry-0.026
 unsure-0.026
 thirsty-0.025
 nervous-0.025
Token nos
Token ID43630
Feature activation-4.40e+00

Pos logit contributions
ara+0.447
ucks+0.418
eds+0.416
 is+0.393
of+0.376
Neg logit contributions
 determined-0.523
 unsure-0.491
 thirsty-0.478
 hungry-0.473
 confident-0.460
Tokeny
Token ID88
Feature activation+1.28e+01

Pos logit contributions
 determined+0.299
 nervous+0.298
 hungry+0.297
 unsure+0.295
 confused+0.295
Neg logit contributions
ucks-0.135
eds-0.130
ara-0.128
 occasions-0.118
icer-0.116
Token cat
Token ID3797
Feature activation+2.84e+00

Pos logit contributions
ara+0.583
eds+0.571
ucks+0.565
 is+0.512
of+0.494
Neg logit contributions
 determined-0.770
 unsure-0.708
 thirsty-0.698
 hungry-0.694
 nervous-0.679
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
ara+0.148
eds+0.146
 is+0.141
ucks+0.139
of+0.131
Neg logit contributions
 determined-0.149
 unsure-0.136
 thirsty-0.136
 hungry-0.132
 nervous-0.132
Token Wh
Token ID854
Feature activation+9.21e+00

Pos logit contributions
 determined+0.133
 confident+0.120
 unsure+0.118
 thirsty+0.116
 hungry+0.114
Neg logit contributions
ara-0.113
 is-0.109
eds-0.109
ucks-0.107
 us-0.106
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.345
eds+0.286
ara+0.261
 occasions+0.260
icer+0.252
Neg logit contributions
 determined-0.554
 unsure-0.533
 hungry-0.528
 curious-0.527
 nervous-0.520
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+0.913
ara+0.861
eds+0.820
icer+0.757
of+0.648
Neg logit contributions
 determined-2.011
 curious-1.837
 hungry-1.834
 unsure-1.830
 he-1.806
Token,
Token ID11
Feature activation-2.43e+00

Pos logit contributions
ara+0.601
eds+0.595
 is+0.580
ucks+0.569
of+0.554
Neg logit contributions
 determined-0.661
 unsure-0.605
 thirsty-0.603
 hungry-0.589
 confident-0.583
Token saw
Token ID2497
Feature activation+2.28e+00

Pos logit contributions
 determined+0.136
 unsure+0.126
 confident+0.123
 thirsty+0.123
 nervous+0.121
Neg logit contributions
eds-0.115
ara-0.114
ucks-0.112
 is-0.108
 us-0.105
Token her
Token ID607
Feature activation+1.62e+00

Pos logit contributions
ucks+0.128
eds+0.127
ara+0.126
 is+0.115
 us+0.113
Neg logit contributions
 determined-0.118
 unsure-0.105
 thirsty-0.103
 hungry-0.102
 nervous-0.101
Token searching
Token ID10342
Feature activation-1.68e+01

Pos logit contributions
ara+0.074
eds+0.073
ucks+0.072
 is+0.066
of+0.063
Neg logit contributions
 determined-0.096
 unsure-0.088
 thirsty-0.087
 hungry-0.087
 nervous-0.084
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+0.969
ara+0.900
eds+0.867
icer+0.808
 occasions+0.693
Neg logit contributions
 determined-1.907
 hungry-1.759
 she-1.758
 curious-1.757
 he-1.751
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
ucks+0.597
ara+0.587
eds+0.587
of+0.512
icer+0.511
Neg logit contributions
 determined-0.724
 unsure-0.666
 hungry-0.661
 thirsty-0.657
 nervous-0.650
Token Lily
Token ID20037
Feature activation+1.76e+00

Pos logit contributions
ucks+0.006
eds+0.006
ara+0.006
of+0.005
ent+0.005
Neg logit contributions
 determined-0.005
 hungry-0.004
 unsure-0.004
 thirsty-0.004
 nervous-0.004
Token and
Token ID290
Feature activation+1.52e+00

Pos logit contributions
ucks+0.114
eds+0.109
ara+0.106
icer+0.100
ases+0.099
Neg logit contributions
 determined-0.079
 hungry-0.071
 curious-0.071
 thirsty-0.070
 nervous-0.069
Token Wh
Token ID854
Feature activation+9.12e+00

Pos logit contributions
ucks+0.079
eds+0.078
ara+0.077
icer+0.068
of+0.068
Neg logit contributions
 determined-0.083
 hungry-0.076
 unsure-0.076
 thirsty-0.075
 nervous-0.074
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.350
ara+0.314
eds+0.300
 occasions+0.286
ac+0.276
Neg logit contributions
 determined-0.529
 hungry-0.527
 thirsty-0.522
 nervous-0.518
 unsure-0.503
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ara+1.008
ucks+1.004
eds+0.878
icer+0.855
atch+0.754
Neg logit contributions
 determined-1.856
 hungry-1.723
 he-1.714
 curious-1.712
 excited-1.710
Token loved
Token ID6151
Feature activation+9.51e-01

Pos logit contributions
ara+0.589
eds+0.581
ucks+0.580
 is+0.534
of+0.529
Neg logit contributions
 determined-0.699
 unsure-0.654
 hungry-0.635
 thirsty-0.631
 nervous-0.627
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
eds+0.051
ucks+0.049
ara+0.047
 is+0.046
 us+0.045
Neg logit contributions
 determined-0.050
 unsure-0.048
 confused-0.044
 thirsty-0.043
 nervous-0.043
Token play
Token ID711
Feature activation+4.00e+00

Pos logit contributions
 determined+0.076
 hungry+0.072
 thirsty+0.071
 nervous+0.070
 unsure+0.070
Neg logit contributions
ucks-0.049
eds-0.048
ara-0.048
icer-0.042
ac-0.040
Token with
Token ID351
Feature activation+3.76e+00

Pos logit contributions
ucks+0.233
eds+0.229
ara+0.229
of+0.207
icer+0.206
Neg logit contributions
 determined-0.192
 hungry-0.177
 unsure-0.172
 nervous-0.172
 thirsty-0.172
Token.
Token ID13
Feature activation+3.88e-02

Pos logit contributions
 determined+0.063
 unsure+0.058
 thirsty+0.057
 hungry+0.056
 confused+0.056
Neg logit contributions
eds-0.053
ucks-0.053
ara-0.052
 is-0.049
 us-0.047
Token One
Token ID1881
Feature activation-1.12e+00

Pos logit contributions
ucks+0.002
eds+0.002
ara+0.002
ent+0.002
of+0.002
Neg logit contributions
 determined-0.002
 hungry-0.002
 unsure-0.002
 thirsty-0.002
 nervous-0.002
Token day
Token ID1110
Feature activation-1.32e+00

Pos logit contributions
 determined+0.063
 unsure+0.057
 hungry+0.057
 thirsty+0.057
 nervous+0.056
Neg logit contributions
ucks-0.057
eds-0.056
ara-0.055
icer-0.049
of-0.048
Token,
Token ID11
Feature activation-2.51e+00

Pos logit contributions
 determined+0.064
 unsure+0.058
 thirsty+0.057
 hungry+0.056
 confused+0.056
Neg logit contributions
ucks-0.075
ara-0.075
eds-0.074
 is-0.068
of-0.067
Token Wh
Token ID854
Feature activation+9.08e+00

Pos logit contributions
 determined+0.161
 unsure+0.149
 delighted+0.146
 confident+0.141
 thrilled+0.141
Neg logit contributions
ara-0.102
 is-0.097
ucks-0.095
eds-0.088
w-0.088
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.379
 occasions+0.361
ators+0.309
ara+0.305
eds+0.301
Neg logit contributions
 hungry-0.482
 thirsty-0.466
 determined-0.463
 nervous-0.445
dream-0.443
Tokeners
Token ID364
Feature activation+1.25e+01

Pos logit contributions
ucks+1.122
ara+1.055
icer+0.947
eds+0.939
of+0.844
Neg logit contributions
 determined-1.775
 he-1.631
 curious-1.600
 excited-1.579
 hungry-1.574
Token received
Token ID2722
Feature activation+5.17e+00

Pos logit contributions
ucks+0.645
ara+0.642
eds+0.638
of+0.564
 is+0.563
Neg logit contributions
 determined-0.672
 unsure-0.615
 hungry-0.608
 thirsty-0.604
 nervous-0.598
Token a
Token ID257
Feature activation-1.11e+00

Pos logit contributions
ucks+0.307
ara+0.302
eds+0.299
 is+0.273
 us+0.273
Neg logit contributions
 determined-0.247
 unsure-0.218
 thirsty-0.213
 hungry-0.209
 nervous-0.208
Token letter
Token ID3850
Feature activation+4.29e-01

Pos logit contributions
 determined+0.063
 unsure+0.059
 thirsty+0.057
 hungry+0.056
 confused+0.055
Neg logit contributions
ara-0.051
eds-0.049
ucks-0.048
 is-0.048
of-0.044
Token in
Token ID287
Feature activation+9.79e-01

Pos logit contributions
ucks+0.019
ara+0.019
eds+0.019
 is+0.018
of+0.017
Neg logit contributions
 determined-0.026
 thirsty-0.024
 unsure-0.023
 hungry-0.023
 confused-0.022
Token of
Token ID286
Feature activation+5.27e+00

Pos logit contributions
ucks+0.020
eds+0.020
ara+0.019
of+0.017
 is+0.017
Neg logit contributions
 determined-0.025
 unsure-0.023
 hungry-0.023
 thirsty-0.022
 nervous-0.022
Token the
Token ID262
Feature activation+3.66e+00

Pos logit contributions
ucks+0.243
eds+0.240
ara+0.240
 us+0.212
of+0.211
Neg logit contributions
 determined-0.314
 unsure-0.289
 thirsty-0.285
 hungry-0.284
 nervous-0.282
Token camera
Token ID4676
Feature activation-9.51e-01

Pos logit contributions
ucks+0.192
eds+0.183
ara+0.181
icer+0.161
ac+0.161
Neg logit contributions
 determined-0.208
 hungry-0.198
 thirsty-0.194
 nervous-0.192
 curious-0.190
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
 determined+0.060
 unsure+0.055
 thirsty+0.055
 hungry+0.054
 confused+0.053
Neg logit contributions
ara-0.040
eds-0.039
ucks-0.039
 is-0.036
of-0.035
Token Wh
Token ID854
Feature activation+9.12e+00

Pos logit contributions
ara+0.008
ucks+0.008
 is+0.008
eds+0.007
 us+0.007
Neg logit contributions
 determined-0.010
 unsure-0.009
 confident-0.009
 thirsty-0.009
dream-0.009
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.361
 occasions+0.324
eds+0.308
ac+0.282
ate+0.281
Neg logit contributions
 determined-0.528
 unsure-0.494
 hungry-0.493
 thirsty-0.482
 nervous-0.477
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+1.120
ara+1.035
eds+0.959
icer+0.955
 occasions+0.849
Neg logit contributions
 determined-1.787
 he-1.598
 curious-1.585
 excited-1.569
 hungry-1.562
Token wanted
Token ID2227
Feature activation-4.78e+00

Pos logit contributions
ara+0.643
eds+0.635
ucks+0.626
 is+0.582
of+0.578
Neg logit contributions
 determined-0.640
 unsure-0.587
 thirsty-0.585
 hungry-0.584
 nervous-0.572
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
 determined+0.234
 unsure+0.214
 thirsty+0.207
 hungry+0.204
 nervous+0.203
Neg logit contributions
ara-0.266
ucks-0.266
eds-0.265
 is-0.244
 us-0.244
Token be
Token ID307
Feature activation+1.29e+01

Pos logit contributions
 determined+0.064
 nervous+0.059
 hungry+0.059
 thirsty+0.058
 unsure+0.058
Neg logit contributions
ucks-0.065
eds-0.064
ara-0.062
icer-0.060
ac-0.056
Token in
Token ID287
Feature activation+9.83e-01

Pos logit contributions
eds+0.731
ucks+0.709
 us+0.688
ac+0.681
 is+0.679
Neg logit contributions
 determined-0.512
dream-0.463
 unsure-0.463
 desperate-0.445
 thirsty-0.423
Token a
Token ID257
Feature activation-1.44e+00

Pos logit contributions
ara+0.492
ucks+0.472
eds+0.456
 us+0.438
 is+0.437
Neg logit contributions
 determined-0.442
 unsure-0.410
 confused-0.388
 nervous-0.379
 confident-0.372
Token lazy
Token ID16931
Feature activation-7.74e+00

Pos logit contributions
 determined+0.076
 unsure+0.074
dream+0.067
 discouraged+0.066
 confused+0.066
Neg logit contributions
ara-0.077
ucks-0.067
eds-0.067
 is-0.065
icer-0.062
Token cat
Token ID3797
Feature activation+2.77e+00

Pos logit contributions
 determined+0.498
 unsure+0.466
 confused+0.436
 thirsty+0.436
 desperate+0.433
Neg logit contributions
ara-0.340
eds-0.306
ucks-0.302
 is-0.289
 us-0.269
Token named
Token ID3706
Feature activation-1.95e+00

Pos logit contributions
ucks+0.165
ara+0.162
eds+0.161
icer+0.145
of+0.145
Neg logit contributions
 determined-0.130
 hungry-0.117
 unsure-0.117
 thirsty-0.116
 nervous-0.114
Token Wh
Token ID854
Feature activation+9.00e+00

Pos logit contributions
 determined+0.087
 hungry+0.079
 thirsty+0.078
 unsure+0.078
 nervous+0.077
Neg logit contributions
ucks-0.122
eds-0.119
ara-0.118
icer-0.107
ent-0.106
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.366
 occasions+0.297
ara+0.277
ac+0.276
eds+0.272
Neg logit contributions
 determined-0.520
 hungry-0.497
dream-0.495
 curious-0.489
 thirsty-0.487
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+1.056
ara+0.978
eds+0.909
icer+0.865
 occasions+0.790
Neg logit contributions
 determined-1.826
 he-1.713
 hungry-1.669
 curious-1.666
 she-1.661
Token.
Token ID13
Feature activation-5.71e-02

Pos logit contributions
ucks+0.680
eds+0.657
ara+0.650
icer+0.580
of+0.573
Neg logit contributions
 determined-0.651
 hungry-0.602
 unsure-0.593
 thirsty-0.589
 curious-0.587
Token Wh
Token ID854
Feature activation+9.01e+00

Pos logit contributions
 determined+0.003
 unsure+0.003
 hungry+0.003
 thirsty+0.003
 nervous+0.003
Neg logit contributions
ucks-0.003
ara-0.003
eds-0.003
icer-0.002
of-0.002
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.366
 occasions+0.335
ara+0.310
eds+0.291
of+0.287
Neg logit contributions
 determined-0.490
 hungry-0.490
 thirsty-0.479
 nervous-0.465
 confused-0.457
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+1.086
ara+1.049
eds+0.912
icer+0.888
 occasions+0.836
Neg logit contributions
 determined-1.772
 he-1.647
 curious-1.635
 excited-1.619
 hungry-1.611
Token a
Token ID257
Feature activation-1.12e+00

Pos logit contributions
ara+0.478
eds+0.466
 is+0.463
ucks+0.461
 us+0.432
Neg logit contributions
 determined-0.446
 unsure-0.418
 confused-0.391
 thirsty-0.390
 hungry-0.389
Token quiet
Token ID5897
Feature activation-4.09e+00

Pos logit contributions
 determined+0.056
 unsure+0.055
 confused+0.049
dream+0.049
 thirsty+0.048
Neg logit contributions
ara-0.061
eds-0.057
 is-0.055
ucks-0.055
 us-0.051
Token cat
Token ID3797
Feature activation+2.87e+00

Pos logit contributions
 determined+0.237
 unsure+0.218
 hungry+0.216
 thirsty+0.214
 nervous+0.212
Neg logit contributions
ucks-0.197
ara-0.195
eds-0.194
of-0.169
 is-0.169
Token named
Token ID3706
Feature activation-1.85e+00

Pos logit contributions
ucks+0.166
eds+0.163
ara+0.162
icer+0.145
of+0.145
Neg logit contributions
 determined-0.140
 unsure-0.127
 hungry-0.127
 thirsty-0.125
 nervous-0.124
Token Wh
Token ID854
Feature activation+9.08e+00

Pos logit contributions
 determined+0.088
 hungry+0.080
 unsure+0.080
 thirsty+0.079
 nervous+0.078
Neg logit contributions
ucks-0.109
eds-0.107
ara-0.106
icer-0.096
of-0.095
Tokenisk
Token ID1984
Feature activation+2.75e+01

Pos logit contributions
ucks+0.341
 occasions+0.286
ac+0.271
ara+0.271
eds+0.258
Neg logit contributions
 determined-0.548
 curious-0.533
 hungry-0.528
 nervous-0.519
dream-0.516
Tokeners
Token ID364
Feature activation+1.24e+01

Pos logit contributions
ucks+0.980
ara+0.914
eds+0.864
icer+0.836
 occasions+0.733
Neg logit contributions
 determined-1.917
 curious-1.765
 hungry-1.764
 he-1.760
 she-1.732
Token who
Token ID508
Feature activation+1.22e+00

Pos logit contributions
ucks+0.593
eds+0.585
ara+0.582
of+0.508
icer+0.506
Neg logit contributions
 determined-0.720
 unsure-0.663
 hungry-0.657
 thirsty-0.654
 nervous-0.647
Token liked
Token ID8288
Feature activation+9.87e-01

Pos logit contributions
ara+0.057
eds+0.057
ucks+0.057
 is+0.051
of+0.050
Neg logit contributions
 determined-0.071
 unsure-0.066
 hungry-0.065
 thirsty-0.065
 nervous-0.064
Token to
Token ID284
Feature activation-1.30e+00

Pos logit contributions
eds+0.056
ucks+0.055
ara+0.054
 us+0.051
 is+0.051
Neg logit contributions
 determined-0.048
 unsure-0.045
 thirsty-0.042
 confused-0.042
 hungry-0.042
Token examine
Token ID10716
Feature activation+6.38e+00

Pos logit contributions
 determined+0.073
 hungry+0.068
 thirsty+0.067
 nervous+0.067
 unsure+0.066
Neg logit contributions
ucks-0.067
ara-0.064
eds-0.064
icer-0.059
ac-0.056
Token still
Token ID991
Feature activation-1.07e+01

Pos logit contributions
 determined+0.104
 unsure+0.095
 hungry+0.095
 thirsty+0.094
 nervous+0.092
Neg logit contributions
ara-0.099
eds-0.099
ucks-0.099
of-0.088
 is-0.087
Token stubborn
Token ID24444
Feature activation-1.39e+01

Pos logit contributions
 determined+0.530
 unsure+0.480
 hungry+0.476
 thirsty+0.473
 nervous+0.467
Neg logit contributions
ucks-0.601
eds-0.596
ara-0.592
 is-0.530
of-0.528
Token but
Token ID475
Feature activation-9.44e+00

Pos logit contributions
 determined+0.707
 hungry+0.647
 unsure+0.644
 thirsty+0.634
 nervous+0.633
Neg logit contributions
ucks-0.770
eds-0.749
ara-0.745
icer-0.680
of-0.662
Token he
Token ID339
Feature activation-1.60e+01

Pos logit contributions
 determined+0.359
 unsure+0.319
 hungry+0.314
 thirsty+0.312
 nervous+0.311
Neg logit contributions
ara-0.617
ucks-0.602
eds-0.602
 is-0.575
of-0.563
Token was
Token ID373
Feature activation+6.00e+00

Pos logit contributions
 determined+0.842
 unsure+0.763
 hungry+0.759
 thirsty+0.750
 nervous+0.744
Neg logit contributions
ucks-0.871
eds-0.852
ara-0.843
icer-0.755
of-0.747
Token determined
Token ID5295
Feature activation-2.45e+01

Pos logit contributions
 us+0.355
 is+0.352
eds+0.350
ac+0.332
ara+0.326
Neg logit contributions
dream-0.186
 thirsty-0.168
 determined-0.166
 unsure-0.161
 hungry-0.159
Token to
Token ID284
Feature activation-3.79e+00

Pos logit contributions
 determined+1.148
 unsure+1.037
 hungry+1.033
 thirsty+1.025
 nervous+1.008
Neg logit contributions
ucks-1.439
eds-1.426
ara-1.422
of-1.279
 is-1.275
Token figure
Token ID3785
Feature activation-8.57e+00

Pos logit contributions
 determined+0.229
 unsure+0.209
 hungry+0.208
 thirsty+0.206
 nervous+0.205
Neg logit contributions
ucks-0.181
eds-0.175
ara-0.175
icer-0.155
of-0.150
Token it
Token ID340
Feature activation-7.77e+00

Pos logit contributions
 determined+0.388
 unsure+0.350
 hungry+0.343
 nervous+0.338
 thirsty+0.337
Neg logit contributions
ucks-0.525
ara-0.518
eds-0.515
icer-0.468
of-0.459
Token all
Token ID477
Feature activation-4.24e+00

Pos logit contributions
 determined+0.426
 thirsty+0.393
 hungry+0.393
 unsure+0.391
 nervous+0.383
Neg logit contributions
eds-0.384
ucks-0.383
ara-0.378
 is-0.344
of-0.340
Token out
Token ID503
Feature activation-1.83e+00

Pos logit contributions
 determined+0.228
 unsure+0.208
 hungry+0.204
 nervous+0.202
 thirsty+0.201
Neg logit contributions
ucks-0.234
ara-0.228
eds-0.221
icer-0.202
of-0.193
TokenBut
Token ID1537
Feature activation-6.85e+00

Pos logit contributions
 determined+0.207
 unsure+0.191
 thirsty+0.186
 hungry+0.186
 nervous+0.183
Neg logit contributions
ucks-0.226
ara-0.225
eds-0.225
 is-0.204
 us-0.202
Token even
Token ID772
Feature activation-8.32e+00

Pos logit contributions
 determined+0.291
 unsure+0.259
 hungry+0.257
 thirsty+0.254
 nervous+0.251
Neg logit contributions
ucks-0.431
ara-0.428
eds-0.426
of-0.387
 is-0.385
Token though
Token ID996
Feature activation-9.66e+00

Pos logit contributions
 determined+0.361
 thirsty+0.321
 hungry+0.320
 unsure+0.317
 nervous+0.314
Neg logit contributions
ara-0.486
eds-0.473
ucks-0.466
 is-0.454
of-0.445
Token Jack
Token ID3619
Feature activation-9.34e+00

Pos logit contributions
 determined+0.338
 unsure+0.294
 hungry+0.290
 thirsty+0.287
 nervous+0.283
Neg logit contributions
ucks-0.678
ara-0.675
eds-0.672
 is-0.619
of-0.616
Token was
Token ID373
Feature activation+5.68e+00

Pos logit contributions
 determined+0.527
 unsure+0.480
 hungry+0.476
 confused+0.471
 nervous+0.469
Neg logit contributions
ucks-0.508
eds-0.474
ara-0.449
 occasions-0.410
icer-0.409
Token determined
Token ID5295
Feature activation-2.44e+01

Pos logit contributions
 is+0.325
eds+0.324
 us+0.318
ara+0.303
ac+0.294
Neg logit contributions
 determined-0.197
 unsure-0.186
dream-0.185
 thirsty-0.177
 nervous-0.171
Token,
Token ID11
Feature activation-5.41e+00

Pos logit contributions
 determined+1.197
 unsure+1.083
 hungry+1.077
 thirsty+1.067
 nervous+1.054
Neg logit contributions
ucks-1.395
eds-1.373
ara-1.370
of-1.223
icer-1.222
Token he
Token ID339
Feature activation-1.53e+01

Pos logit contributions
 determined+0.143
 thirsty+0.128
 nervous+0.122
 hungry+0.120
 confused+0.120
Neg logit contributions
 is-0.378
ara-0.371
 us-0.369
eds-0.350
 occasions-0.349
Token hit
Token ID2277
Feature activation-2.10e+00

Pos logit contributions
 determined+0.745
 thirsty+0.693
 hungry+0.690
 unsure+0.685
 nervous+0.672
Neg logit contributions
ara-0.830
eds-0.811
ucks-0.798
 is-0.762
of-0.742
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 determined+0.097
 unsure+0.087
 hungry+0.086
 thirsty+0.085
 nervous+0.084
Neg logit contributions
ucks-0.126
eds-0.124
ara-0.124
of-0.111
icer-0.111
Token ball
Token ID2613
Feature activation+7.02e-01

Pos logit contributions
ucks+0.066
eds+0.064
ara+0.064
of+0.057
icer+0.057
Neg logit contributions
 determined-0.066
 unsure-0.060
 hungry-0.060
 thirsty-0.059
 nervous-0.059
Token sunny
Token ID27737
Feature activation-3.11e+00

Pos logit contributions
eds+0.013
ucks+0.012
 us+0.012
ara+0.011
 is+0.011
Neg logit contributions
 determined-0.019
 unsure-0.017
 thirsty-0.017
 confident-0.016
dream-0.016
Token day
Token ID1110
Feature activation-5.48e+00

Pos logit contributions
 determined+0.155
 hungry+0.146
 thirsty+0.146
 nervous+0.139
 curious+0.139
Neg logit contributions
ucks-0.174
eds-0.165
ara-0.165
of-0.150
icer-0.150
Token,
Token ID11
Feature activation-6.52e+00

Pos logit contributions
 determined+0.285
 unsure+0.262
 hungry+0.253
 thirsty+0.252
 nervous+0.252
Neg logit contributions
ara-0.290
eds-0.288
ucks-0.286
 is-0.258
 us-0.258
Token he
Token ID339
Feature activation-1.61e+01

Pos logit contributions
 determined+0.284
 unsure+0.257
 hungry+0.248
 thirsty+0.246
 nervous+0.245
Neg logit contributions
ara-0.401
ucks-0.392
eds-0.390
of-0.361
 is-0.359
Token was
Token ID373
Feature activation+5.95e+00

Pos logit contributions
 determined+0.866
 hungry+0.786
 unsure+0.781
 thirsty+0.777
 nervous+0.769
Neg logit contributions
ucks-0.863
eds-0.836
ara-0.826
icer-0.743
of-0.731
Token determined
Token ID5295
Feature activation-2.43e+01

Pos logit contributions
eds+0.382
 us+0.359
 is+0.358
ara+0.353
ucks+0.348
Neg logit contributions
 determined-0.164
 unsure-0.158
dream-0.148
 confident-0.139
 confused-0.135
Token to
Token ID284
Feature activation-3.78e+00

Pos logit contributions
 determined+1.107
 unsure+1.009
 hungry+1.001
 thirsty+0.997
 nervous+0.977
Neg logit contributions
eds-1.431
ucks-1.414
ara-1.413
 is-1.305
of-1.287
Token reach
Token ID3151
Feature activation-1.85e+00

Pos logit contributions
 determined+0.229
 unsure+0.210
 hungry+0.210
 thirsty+0.208
 nervous+0.206
Neg logit contributions
ucks-0.176
eds-0.171
ara-0.171
icer-0.149
of-0.147
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 determined+0.090
 unsure+0.081
 hungry+0.081
 thirsty+0.080
 nervous+0.079
Neg logit contributions
ucks-0.107
ara-0.105
eds-0.105
icer-0.094
of-0.094
Token top
Token ID1353
Feature activation-4.74e+00

Pos logit contributions
ucks+0.061
eds+0.060
ara+0.060
of+0.053
icer+0.053
Neg logit contributions
 determined-0.059
 hungry-0.053
 unsure-0.053
 thirsty-0.053
 nervous-0.052
Token of
Token ID286
Feature activation+3.25e+00

Pos logit contributions
 determined+0.267
 unsure+0.245
 hungry+0.244
 thirsty+0.242
 nervous+0.239
Neg logit contributions
ucks-0.233
eds-0.230
ara-0.230
of-0.204
 is-0.202
Token she
Token ID673
Feature activation-1.73e+01

Pos logit contributions
 determined+1.044
 unsure+0.964
 thirsty+0.954
 hungry+0.945
 confused+0.926
Neg logit contributions
eds-0.802
ucks-0.787
ara-0.784
 is-0.714
 us-0.701
Token was
Token ID373
Feature activation+4.45e+00

Pos logit contributions
 determined+0.864
 hungry+0.785
 unsure+0.778
 thirsty+0.774
 nervous+0.764
Neg logit contributions
ucks-0.995
eds-0.972
ara-0.953
icer-0.873
of-0.852
Token brave
Token ID14802
Feature activation-1.50e+01

Pos logit contributions
eds+0.281
 us+0.271
ucks+0.266
ara+0.265
 is+0.264
Neg logit contributions
 determined-0.146
 unsure-0.132
dream-0.125
 confident-0.123
 nervous-0.121
Token and
Token ID290
Feature activation-2.07e+00

Pos logit contributions
 determined+0.901
 thirsty+0.849
 unsure+0.825
 hungry+0.809
 confused+0.801
Neg logit contributions
ara-0.646
ucks-0.614
eds-0.608
 is-0.598
 us-0.586
Token was
Token ID373
Feature activation+5.75e+00

Pos logit contributions
dream+0.068
 determined+0.061
 unsure+0.061
 thirsty+0.058
 discouraged+0.055
Neg logit contributions
 is-0.128
 will-0.116
 us-0.114
eds-0.114
ara-0.112
Token determined
Token ID5295
Feature activation-2.43e+01

Pos logit contributions
eds+0.341
 is+0.334
 us+0.327
ara+0.327
ucks+0.321
Neg logit contributions
 determined-0.194
 unsure-0.185
 thirsty-0.184
dream-0.180
 confused-0.178
Token to
Token ID284
Feature activation-3.86e+00

Pos logit contributions
 determined+1.082
 unsure+0.975
 hungry+0.970
 thirsty+0.964
 nervous+0.945
Neg logit contributions
ucks-1.472
eds-1.466
ara-1.460
 is-1.327
of-1.324
Token reach
Token ID3151
Feature activation-1.76e+00

Pos logit contributions
 determined+0.234
 unsure+0.215
 hungry+0.214
 thirsty+0.212
 nervous+0.211
Neg logit contributions
ucks-0.180
ara-0.175
eds-0.175
icer-0.152
of-0.150
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
 determined+0.087
 unsure+0.079
 hungry+0.078
 thirsty+0.077
 nervous+0.076
Neg logit contributions
ucks-0.101
ara-0.099
eds-0.099
icer-0.088
of-0.088
Token other
Token ID584
Feature activation+3.52e+00

Pos logit contributions
ucks+0.054
eds+0.052
ara+0.052
of+0.046
icer+0.046
Neg logit contributions
 determined-0.055
 hungry-0.050
 unsure-0.050
 thirsty-0.049
 nervous-0.049
Token side
Token ID1735
Feature activation+5.09e+00

Pos logit contributions
ucks+0.174
eds+0.163
ara+0.160
icer+0.138
ate+0.137
Neg logit contributions
 determined-0.211
 hungry-0.198
 curious-0.195
 nervous-0.194
 unsure-0.194
Token scared
Token ID12008
Feature activation-2.04e+01

Pos logit contributions
 determined+0.291
 unsure+0.256
 thirsty+0.249
 hungry+0.249
 curious+0.248
Neg logit contributions
ucks-0.327
eds-0.324
ara-0.322
 us-0.299
 is-0.294
Token,
Token ID11
Feature activation-5.90e+00

Pos logit contributions
 determined+1.127
 unsure+1.026
 hungry+1.015
 thirsty+1.011
 nervous+0.997
Neg logit contributions
ucks-1.042
eds-1.029
ara-1.025
of-0.908
 is-0.902
Token but
Token ID475
Feature activation-8.78e+00

Pos logit contributions
 determined+0.224
 unsure+0.196
 confident+0.188
 thirsty+0.188
 hungry+0.183
Neg logit contributions
ara-0.357
 is-0.355
 us-0.351
eds-0.348
ucks-0.335
Token she
Token ID673
Feature activation-1.56e+01

Pos logit contributions
 determined+0.331
 unsure+0.292
 confident+0.287
 thirsty+0.287
 nervous+0.286
Neg logit contributions
ara-0.551
 is-0.542
 us-0.532
eds-0.528
ucks-0.514
Token was
Token ID373
Feature activation+6.32e+00

Pos logit contributions
 determined+0.786
 unsure+0.713
 hungry+0.709
 thirsty+0.702
 nervous+0.694
Neg logit contributions
ucks-0.875
eds-0.860
ara-0.857
icer-0.764
of-0.763
Token determined
Token ID5295
Feature activation-2.42e+01

Pos logit contributions
eds+0.370
 us+0.360
 is+0.349
e+0.344
ac+0.343
Neg logit contributions
dream-0.204
 he-0.181
 determined-0.180
 unsure-0.177
 thirsty-0.176
Token to
Token ID284
Feature activation-3.54e+00

Pos logit contributions
 determined+1.150
 unsure+1.052
 thirsty+1.051
 hungry+1.050
 nervous+1.024
Neg logit contributions
eds-1.371
ara-1.358
ucks-1.354
 is-1.253
of-1.234
Token try
Token ID1949
Feature activation-4.63e+00

Pos logit contributions
 determined+0.213
 unsure+0.196
 hungry+0.195
 thirsty+0.194
 nervous+0.192
Neg logit contributions
ucks-0.164
eds-0.160
ara-0.160
icer-0.139
of-0.138
Token it
Token ID340
Feature activation-7.52e+00

Pos logit contributions
 determined+0.234
 unsure+0.213
 hungry+0.212
 thirsty+0.211
 nervous+0.208
Neg logit contributions
ucks-0.256
eds-0.254
ara-0.252
of-0.225
 is-0.224
Token out
Token ID503
Feature activation-1.69e+00

Pos logit contributions
 determined+0.417
 unsure+0.383
 hungry+0.380
 thirsty+0.376
 nervous+0.374
Neg logit contributions
ucks-0.385
ara-0.377
eds-0.376
icer-0.332
of-0.328
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
 determined+0.092
 unsure+0.084
 hungry+0.084
 thirsty+0.083
 nervous+0.082
Neg logit contributions
ucks-0.087
eds-0.086
ara-0.085
of-0.076
 is-0.075
Token on
Token ID319
Feature activation-3.03e+00

Pos logit contributions
 determined+0.288
 unsure+0.263
 thirsty+0.260
 hungry+0.260
 nervous+0.255
Neg logit contributions
ara-0.237
ucks-0.236
eds-0.235
 is-0.218
of-0.209
Token her
Token ID607
Feature activation-1.42e+00

Pos logit contributions
 determined+0.132
 unsure+0.119
 hungry+0.118
 nervous+0.116
 thirsty+0.114
Neg logit contributions
ucks-0.191
eds-0.187
ara-0.183
icer-0.165
of-0.165
Token head
Token ID1182
Feature activation-1.01e+00

Pos logit contributions
 determined+0.079
 unsure+0.072
 hungry+0.072
 thirsty+0.071
 nervous+0.071
Neg logit contributions
ucks-0.072
eds-0.070
ara-0.070
icer-0.062
of-0.062
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
 determined+0.059
 unsure+0.054
 hungry+0.054
 thirsty+0.054
 nervous+0.053
Neg logit contributions
ucks-0.048
ara-0.048
eds-0.048
of-0.042
 is-0.041
Token was
Token ID373
Feature activation+6.68e+00

Pos logit contributions
 determined+0.048
 unsure+0.043
 thirsty+0.042
 hungry+0.041
 nervous+0.041
Neg logit contributions
ara-0.059
 is-0.058
ucks-0.058
eds-0.057
of-0.053
Token determined
Token ID5295
Feature activation-2.40e+01

Pos logit contributions
eds+0.412
ucks+0.398
ara+0.396
 is+0.394
 us+0.385
Neg logit contributions
 determined-0.223
 unsure-0.208
 thirsty-0.198
dream-0.195
 hungry-0.192
Token to
Token ID284
Feature activation-3.23e+00

Pos logit contributions
 determined+1.100
 unsure+0.998
 hungry+0.993
 thirsty+0.989
 nervous+0.968
Neg logit contributions
eds-1.405
ucks-1.401
ara-1.397
 is-1.277
of-1.269
Token succeed
Token ID6758
Feature activation-5.78e+00

Pos logit contributions
 determined+0.192
 unsure+0.174
 hungry+0.171
 nervous+0.171
 thirsty+0.169
Neg logit contributions
ucks-0.160
eds-0.156
ara-0.156
icer-0.139
ac-0.131
Token.
Token ID13
Feature activation-2.04e+00

Pos logit contributions
 determined+0.316
 unsure+0.289
 hungry+0.287
 thirsty+0.284
 nervous+0.282
Neg logit contributions
ucks-0.301
ara-0.294
eds-0.293
icer-0.259
of-0.258
Token She
Token ID1375
Feature activation-1.03e+01

Pos logit contributions
 determined+0.101
 hungry+0.091
 unsure+0.091
 thirsty+0.090
 nervous+0.088
Neg logit contributions
ucks-0.115
ara-0.114
eds-0.112
of-0.102
icer-0.101
Token went
Token ID1816
Feature activation-2.36e+00

Pos logit contributions
 determined+0.517
 unsure+0.464
 hungry+0.456
 nervous+0.450
 thirsty+0.449
Neg logit contributions
ucks-0.591
eds-0.579
ara-0.572
icer-0.516
of-0.506
Token game
Token ID983
Feature activation-8.24e-02

Pos logit contributions
 determined+0.001
 unsure+0.001
 hungry+0.001
 thirsty+0.001
 nervous+0.001
Neg logit contributions
ucks-0.001
eds-0.001
ara-0.001
of-0.001
icer-0.001
Token,
Token ID11
Feature activation-5.33e+00

Pos logit contributions
 determined+0.004
 unsure+0.004
 hungry+0.004
 thirsty+0.004
 nervous+0.004
Neg logit contributions
ucks-0.004
eds-0.004
ara-0.004
icer-0.004
of-0.004
Token but
Token ID475
Feature activation-8.34e+00

Pos logit contributions
 determined+0.221
 unsure+0.198
 hungry+0.198
 thirsty+0.197
 nervous+0.191
Neg logit contributions
ara-0.323
ucks-0.319
eds-0.317
 is-0.304
 us-0.299
Token he
Token ID339
Feature activation-1.52e+01

Pos logit contributions
 determined+0.335
 hungry+0.297
 thirsty+0.297
 unsure+0.297
 nervous+0.292
Neg logit contributions
ara-0.525
eds-0.510
ucks-0.509
 is-0.492
 us-0.483
Token was
Token ID373
Feature activation+6.81e+00

Pos logit contributions
 determined+0.829
 unsure+0.755
 hungry+0.748
 thirsty+0.740
 nervous+0.736
Neg logit contributions
ucks-0.799
eds-0.781
ara-0.773
icer-0.689
of-0.681
Token determined
Token ID5295
Feature activation-2.39e+01

Pos logit contributions
 is+0.407
 us+0.392
eds+0.391
ac+0.383
ara+0.379
Neg logit contributions
dream-0.206
 thirsty-0.204
 determined-0.195
 hungry-0.192
 unsure-0.186
Token to
Token ID284
Feature activation-3.12e+00

Pos logit contributions
 determined+1.159
 hungry+1.055
 unsure+1.054
 thirsty+1.052
 nervous+1.028
Neg logit contributions
ucks-1.343
eds-1.341
ara-1.338
 is-1.213
of-1.205
Token still
Token ID991
Feature activation-7.63e+00

Pos logit contributions
 determined+0.180
 unsure+0.162
 nervous+0.159
 hungry+0.157
 curious+0.157
Neg logit contributions
ucks-0.162
eds-0.157
ara-0.156
icer-0.144
ac-0.132
Token play
Token ID711
Feature activation+2.10e+00

Pos logit contributions
 determined+0.466
 unsure+0.427
 hungry+0.419
 nervous+0.417
 thirsty+0.414
Neg logit contributions
ucks-0.361
ara-0.349
eds-0.349
icer-0.312
of-0.294
Token fair
Token ID3148
Feature activation-3.06e+00

Pos logit contributions
ucks+0.112
eds+0.111
ara+0.110
of+0.098
icer+0.098
Neg logit contributions
 determined-0.110
 unsure-0.100
 hungry-0.100
 thirsty-0.099
 nervous-0.098
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 determined+0.163
 unsure+0.149
 hungry+0.148
 nervous+0.147
 curious+0.145
Neg logit contributions
ucks-0.163
eds-0.158
ara-0.158
icer-0.145
of-0.137
Token small
Token ID1402
Feature activation-6.94e+00

Pos logit contributions
 determined+0.261
 unsure+0.237
 hungry+0.236
 thirsty+0.233
 nervous+0.231
Neg logit contributions
ucks-0.281
eds-0.276
ara-0.275
icer-0.244
of-0.244
Token towel
Token ID24808
Feature activation-4.41e+00

Pos logit contributions
 determined+0.405
 unsure+0.370
 hungry+0.362
 nervous+0.361
 curious+0.356
Neg logit contributions
ucks-0.355
ara-0.333
eds-0.332
icer-0.285
ent-0.284
Token.
Token ID13
Feature activation-2.72e+00

Pos logit contributions
 determined+0.222
 unsure+0.202
 hungry+0.200
 thirsty+0.198
 nervous+0.197
Neg logit contributions
ucks-0.245
eds-0.241
ara-0.240
icer-0.214
of-0.213
Token He
Token ID679
Feature activation-9.90e+00

Pos logit contributions
 determined+0.131
 unsure+0.118
 hungry+0.117
 thirsty+0.116
 nervous+0.114
Neg logit contributions
ucks-0.158
ara-0.156
eds-0.154
of-0.139
icer-0.139
Token was
Token ID373
Feature activation+6.66e+00

Pos logit contributions
 determined+0.512
 unsure+0.460
 hungry+0.453
 nervous+0.448
 thirsty+0.447
Neg logit contributions
ucks-0.562
eds-0.546
ara-0.534
icer-0.491
ent-0.476
Token determined
Token ID5295
Feature activation-2.39e+01

Pos logit contributions
 is+0.370
eds+0.369
ac+0.367
e+0.359
 us+0.356
Neg logit contributions
dream-0.215
 thirsty-0.193
 unsure-0.188
 determined-0.188
 desperate-0.182
Token to
Token ID284
Feature activation-3.30e+00

Pos logit contributions
 determined+1.073
 unsure+0.966
 hungry+0.965
 thirsty+0.962
 nervous+0.937
Neg logit contributions
ucks-1.433
eds-1.432
ara-1.427
 is-1.302
of-1.292
Token succeed
Token ID6758
Feature activation-5.67e+00

Pos logit contributions
 determined+0.196
 unsure+0.178
 hungry+0.176
 nervous+0.176
 thirsty+0.175
Neg logit contributions
ucks-0.160
eds-0.155
ara-0.155
icer-0.138
of-0.132
Token in
Token ID287
Feature activation-8.97e-01

Pos logit contributions
 determined+0.311
 hungry+0.285
 unsure+0.285
 thirsty+0.284
 nervous+0.278
Neg logit contributions
ucks-0.283
eds-0.283
ara-0.280
 is-0.250
of-0.248
Token iron
Token ID6953
Feature activation-3.45e+00

Pos logit contributions
 determined+0.043
 unsure+0.039
 hungry+0.039
 thirsty+0.039
 nervous+0.038
Neg logit contributions
ucks-0.051
eds-0.051
ara-0.051
icer-0.045
of-0.045
Tokening
Token ID278
Feature activation+8.70e+00

Pos logit contributions
 determined+0.187
 unsure+0.171
 hungry+0.170
 thirsty+0.169
 nervous+0.167
Neg logit contributions
ucks-0.179
eds-0.176
ara-0.176
of-0.155
icer-0.155
Token
Token ID198
Feature activation+7.49e-01

Pos logit contributions
 determined+0.179
 unsure+0.160
 hungry+0.160
 thirsty+0.158
 nervous+0.156
Neg logit contributions
ucks-0.224
ara-0.221
eds-0.220
of-0.198
icer-0.197
Token
Token ID198
Feature activation+9.83e-01

Pos logit contributions
eds+0.043
ucks+0.043
ara+0.041
ent+0.040
 occasions+0.039
Neg logit contributions
 determined-0.035
 hungry-0.031
 curious-0.031
 unsure-0.031
 excited-0.030
TokenBut
Token ID1537
Feature activation-5.08e+00

Pos logit contributions
ucks+0.044
eds+0.044
ara+0.040
icer+0.037
 is+0.037
Neg logit contributions
 determined-0.062
 unsure-0.055
 hungry-0.055
 curious-0.054
 thirsty-0.054
Token she
Token ID673
Feature activation-1.55e+01

Pos logit contributions
 determined+0.240
 unsure+0.214
 thirsty+0.213
 nervous+0.212
 hungry+0.211
Neg logit contributions
ara-0.294
eds-0.278
ucks-0.277
 us-0.264
 is-0.264
Token was
Token ID373
Feature activation+6.59e+00

Pos logit contributions
 determined+0.748
 unsure+0.672
 hungry+0.670
 thirsty+0.662
 confused+0.653
Neg logit contributions
ucks-0.924
eds-0.901
ara-0.878
icer-0.805
of-0.795
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
eds+0.355
 us+0.355
 is+0.346
ac+0.344
ara+0.341
Neg logit contributions
dream-0.213
 determined-0.197
 he-0.196
 unsure-0.190
 wishing-0.184
Token to
Token ID284
Feature activation-3.29e+00

Pos logit contributions
 determined+1.144
 unsure+1.036
 hungry+1.032
 thirsty+1.026
 nervous+1.007
Neg logit contributions
ucks-1.366
ara-1.359
eds-1.358
 is-1.224
of-1.220
Token learn
Token ID2193
Feature activation+1.14e+00

Pos logit contributions
 determined+0.200
 unsure+0.183
 hungry+0.182
 thirsty+0.181
 nervous+0.180
Neg logit contributions
ucks-0.153
eds-0.149
ara-0.149
icer-0.130
of-0.128
Token how
Token ID703
Feature activation+1.57e+00

Pos logit contributions
ucks+0.067
ara+0.066
eds+0.066
icer+0.060
of+0.058
Neg logit contributions
 determined-0.055
 unsure-0.050
 hungry-0.049
 nervous-0.048
 thirsty-0.048
Token to
Token ID284
Feature activation-2.88e+00

Pos logit contributions
eds+0.083
ucks+0.078
 is+0.077
ara+0.077
 us+0.074
Neg logit contributions
 determined-0.077
 unsure-0.074
 thirsty-0.073
 hungry-0.070
 confused-0.070
Token read
Token ID1100
Feature activation-8.04e-01

Pos logit contributions
 determined+0.168
 unsure+0.153
 hungry+0.150
 curious+0.150
 nervous+0.150
Neg logit contributions
ucks-0.147
eds-0.141
ara-0.139
icer-0.127
ac-0.119
Token and
Token ID290
Feature activation-2.28e+00

Pos logit contributions
 determined+0.056
 unsure+0.051
 thirsty+0.050
 nervous+0.049
 hungry+0.049
Neg logit contributions
eds-0.051
ucks-0.050
ara-0.050
 is-0.047
 us-0.045
Token started
Token ID2067
Feature activation-7.77e+00

Pos logit contributions
 determined+0.096
 unsure+0.085
 nervous+0.083
 thirsty+0.082
 hungry+0.081
Neg logit contributions
ara-0.135
eds-0.134
ucks-0.133
 is-0.129
 us-0.123
Token building
Token ID2615
Feature activation-7.95e+00

Pos logit contributions
 determined+0.373
 unsure+0.360
 thirsty+0.345
 nervous+0.340
 confident+0.337
Neg logit contributions
eds-0.399
ucks-0.376
 is-0.365
ara-0.351
 us-0.348
Token again
Token ID757
Feature activation+2.75e+00

Pos logit contributions
 determined+0.435
 unsure+0.397
 thirsty+0.397
 hungry+0.392
 nervous+0.383
Neg logit contributions
ucks-0.395
eds-0.392
ara-0.384
 is-0.365
 us-0.350
Token,
Token ID11
Feature activation-5.16e+00

Pos logit contributions
eds+0.128
ucks+0.127
ara+0.125
 is+0.112
 us+0.109
Neg logit contributions
 determined-0.161
 unsure-0.148
 thirsty-0.146
 hungry-0.146
 nervous-0.144
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
 determined+0.197
 thirsty+0.188
 hungry+0.185
 nervous+0.178
 unsure+0.178
Neg logit contributions
 is-0.290
eds-0.286
ara-0.279
ucks-0.275
 us-0.274
Token to
Token ID284
Feature activation-3.23e+00

Pos logit contributions
 determined+1.078
 unsure+0.973
 hungry+0.969
 thirsty+0.966
 nervous+0.945
Neg logit contributions
eds-1.427
ucks-1.422
ara-1.421
 is-1.296
of-1.284
Token make
Token ID787
Feature activation+5.17e+00

Pos logit contributions
 determined+0.200
 unsure+0.184
 hungry+0.183
 thirsty+0.182
 nervous+0.181
Neg logit contributions
ucks-0.145
ara-0.141
eds-0.141
icer-0.122
of-0.121
Token an
Token ID281
Feature activation-5.14e-01

Pos logit contributions
ucks+0.291
eds+0.288
ara+0.284
 is+0.262
 us+0.261
Neg logit contributions
 determined-0.251
 thirsty-0.227
 unsure-0.226
 hungry-0.226
 nervous-0.219
Token even
Token ID772
Feature activation-5.67e+00

Pos logit contributions
 determined+0.027
 hungry+0.025
 thirsty+0.025
 unsure+0.025
 nervous+0.024
Neg logit contributions
ucks-0.027
eds-0.026
ara-0.026
icer-0.023
 us-0.023
Token longer
Token ID2392
Feature activation-6.31e+00

Pos logit contributions
 determined+0.279
 thirsty+0.253
dream+0.252
 hungry+0.251
 unsure+0.249
Neg logit contributions
ucks-0.284
eds-0.274
ara-0.273
 is-0.268
ac-0.265
Token kept
Token ID4030
Feature activation-2.93e+00

Pos logit contributions
 determined+0.005
 unsure+0.004
 hungry+0.004
 confused+0.004
 nervous+0.004
Neg logit contributions
ucks-0.005
eds-0.005
ara-0.005
icer-0.004
 occasions-0.004
Token cleaning
Token ID12724
Feature activation-6.53e+00

Pos logit contributions
 determined+0.120
 thirsty+0.116
 unsure+0.113
 nervous+0.109
 hungry+0.109
Neg logit contributions
eds-0.163
 us-0.155
ucks-0.153
 is-0.152
ara-0.149
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
 determined+0.320
 unsure+0.290
 hungry+0.289
 thirsty+0.286
 nervous+0.283
Neg logit contributions
ucks-0.373
eds-0.367
ara-0.366
of-0.327
icer-0.326
Token She
Token ID1375
Feature activation-1.05e+01

Pos logit contributions
 determined+0.088
 unsure+0.077
 hungry+0.076
 thirsty+0.076
 nervous+0.073
Neg logit contributions
ucks-0.156
ara-0.155
eds-0.152
icer-0.140
 is-0.140
Token was
Token ID373
Feature activation+6.85e+00

Pos logit contributions
 determined+0.590
 unsure+0.546
 hungry+0.545
 thirsty+0.539
 nervous+0.531
Neg logit contributions
ara-0.513
ucks-0.508
eds-0.505
 is-0.454
of-0.444
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
eds+0.359
 us+0.358
ac+0.336
 is+0.336
ma+0.324
Neg logit contributions
 he-0.230
dream-0.228
 determined-0.220
 unsure-0.212
chard-0.210
Token to
Token ID284
Feature activation-3.27e+00

Pos logit contributions
 determined+1.066
 unsure+0.957
 hungry+0.953
 thirsty+0.946
 nervous+0.928
Neg logit contributions
ucks-1.450
eds-1.439
ara-1.438
 is-1.296
of-1.296
Token finish
Token ID5461
Feature activation-2.96e+00

Pos logit contributions
 determined+0.186
 unsure+0.169
 hungry+0.168
 thirsty+0.167
 nervous+0.167
Neg logit contributions
ucks-0.165
ara-0.160
eds-0.160
icer-0.142
of-0.139
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 determined+0.146
 unsure+0.132
 hungry+0.131
 thirsty+0.131
 nervous+0.129
Neg logit contributions
ucks-0.166
eds-0.165
ara-0.164
 is-0.148
icer-0.147
Token job
Token ID1693
Feature activation-6.25e-02

Pos logit contributions
ucks+0.073
eds+0.072
ara+0.072
icer+0.063
of+0.062
Neg logit contributions
 determined-0.095
 unsure-0.088
 hungry-0.087
 thirsty-0.087
 nervous-0.086
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 determined+0.003
 hungry+0.003
 thirsty+0.003
 unsure+0.003
 confused+0.003
Neg logit contributions
eds-0.003
ara-0.003
ucks-0.003
 is-0.003
 occasions-0.003
Token quite
Token ID2407
Feature activation-9.25e+00

Pos logit contributions
 determined+0.001
dream+0.001
 unsure+0.001
chard+0.001
 thirsty+0.001
Neg logit contributions
 is-0.003
 us-0.003
eds-0.002
 yourself-0.002
ucks-0.002
Token ignorant
Token ID20109
Feature activation-1.21e+01

Pos logit contributions
 determined+0.209
 unsure+0.162
dream+0.161
 barely+0.161
chard+0.149
Neg logit contributions
 is-0.650
 us-0.629
ara-0.613
eds-0.610
ucks-0.610
Token but
Token ID475
Feature activation-8.06e+00

Pos logit contributions
 determined+0.737
 thirsty+0.689
 hungry+0.671
 unsure+0.669
 nervous+0.644
Neg logit contributions
eds-0.525
ucks-0.517
ara-0.513
 is-0.496
ent-0.468
Token she
Token ID673
Feature activation-1.50e+01

Pos logit contributions
 determined+0.269
dream+0.263
 thirsty+0.259
 hungry+0.253
 nervous+0.253
Neg logit contributions
 is-0.480
ara-0.463
 us-0.447
eds-0.439
of-0.431
Token was
Token ID373
Feature activation+6.89e+00

Pos logit contributions
 determined+0.765
 hungry+0.710
 thirsty+0.706
 unsure+0.704
 nervous+0.689
Neg logit contributions
ara-0.796
eds-0.780
ucks-0.776
 is-0.717
of-0.699
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
 is+0.377
 us+0.377
ac+0.359
eds+0.354
e+0.348
Neg logit contributions
dream-0.208
 thirsty-0.190
 he-0.188
 hungry-0.178
chard-0.174
Token to
Token ID284
Feature activation-3.22e+00

Pos logit contributions
 determined+1.143
 unsure+1.040
 hungry+1.039
 thirsty+1.038
 nervous+1.010
Neg logit contributions
eds-1.339
ara-1.337
ucks-1.335
 is-1.218
of-1.205
Token take
Token ID1011
Feature activation+7.28e+00

Pos logit contributions
 determined+0.196
 unsure+0.180
 hungry+0.179
 thirsty+0.177
 nervous+0.177
Neg logit contributions
ucks-0.150
eds-0.145
ara-0.145
icer-0.128
of-0.124
Token care
Token ID1337
Feature activation-1.31e+00

Pos logit contributions
ucks+0.402
eds+0.401
ara+0.400
 is+0.367
of+0.365
Neg logit contributions
 determined-0.353
 thirsty-0.323
 unsure-0.321
 hungry-0.320
 nervous-0.310
Token of
Token ID286
Feature activation+3.69e+00

Pos logit contributions
 determined+0.076
 unsure+0.070
 hungry+0.070
 thirsty+0.070
 nervous+0.068
Neg logit contributions
ucks-0.063
eds-0.062
ara-0.061
of-0.054
 is-0.054
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
ucks+0.207
eds+0.204
ara+0.203
of+0.182
icer+0.181
Neg logit contributions
 determined-0.183
 unsure-0.166
 hungry-0.165
 thirsty-0.164
 nervous-0.162
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
ara+0.189
ucks+0.189
eds+0.182
of+0.165
 is+0.165
Neg logit contributions
 determined-0.198
 unsure-0.179
 hungry-0.176
 thirsty-0.175
 nervous-0.174
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
ucks+0.025
eds+0.025
ara+0.024
ent+0.024
ators+0.023
Neg logit contributions
 determined-0.020
 hungry-0.018
 curious-0.017
 excited-0.017
 unsure-0.017
TokenT
Token ID51
Feature activation+2.57e+00

Pos logit contributions
ucks+0.027
eds+0.027
ara+0.025
icer+0.023
 is+0.023
Neg logit contributions
 determined-0.038
 curious-0.034
 unsure-0.034
 hungry-0.034
 thirsty-0.033
Tokeneddy
Token ID21874
Feature activation+1.27e+01

Pos logit contributions
eds+0.082
ucks+0.078
ases+0.069
 occasions+0.064
ara+0.063
Neg logit contributions
 determined-0.174
 unsure-0.169
 nervous-0.168
 hungry-0.167
 confused-0.167
Token was
Token ID373
Feature activation+6.40e+00

Pos logit contributions
ucks+0.750
eds+0.719
ara+0.675
 occasions+0.666
ases+0.657
Neg logit contributions
 determined-0.586
 hungry-0.551
 confused-0.543
 unsure-0.539
 curious-0.532
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
eds+0.388
 is+0.378
 us+0.369
ac+0.358
ucks+0.357
Neg logit contributions
dream-0.186
 determined-0.178
 unsure-0.166
 thirsty-0.160
chard-0.156
Token.
Token ID13
Feature activation-2.20e+00

Pos logit contributions
 determined+1.156
 unsure+1.054
 thirsty+1.052
 hungry+1.047
 nervous+1.022
Neg logit contributions
ara-1.333
eds-1.330
ucks-1.310
 is-1.229
of-1.205
Token He
Token ID679
Feature activation-9.66e+00

Pos logit contributions
 determined+0.088
 thirsty+0.081
 hungry+0.079
 unsure+0.075
 confused+0.072
Neg logit contributions
ucks-0.144
ara-0.143
eds-0.131
 is-0.129
of-0.128
Token knew
Token ID2993
Feature activation-7.06e+00

Pos logit contributions
 determined+0.461
 hungry+0.423
 thirsty+0.423
 unsure+0.423
 nervous+0.413
Neg logit contributions
ara-0.545
ucks-0.543
eds-0.538
 is-0.492
of-0.490
Token he
Token ID339
Feature activation-1.45e+01

Pos logit contributions
 determined+0.276
 thirsty+0.261
 hungry+0.254
 nervous+0.250
 unsure+0.248
Neg logit contributions
ara-0.445
eds-0.436
ucks-0.433
 is-0.430
 us-0.419
Token could
Token ID714
Feature activation-7.02e+00

Pos logit contributions
 determined+0.800
 thirsty+0.746
 hungry+0.745
 unsure+0.743
 nervous+0.730
Neg logit contributions
ara-0.705
eds-0.704
ucks-0.700
 is-0.645
of-0.628
Token a
Token ID257
Feature activation-4.37e+00

Pos logit contributions
 determined+0.200
 unsure+0.180
 thirsty+0.179
 hungry+0.179
 nervous+0.175
Neg logit contributions
ucks-0.259
eds-0.257
ara-0.256
 us-0.233
 is-0.232
Token promise
Token ID6991
Feature activation+4.33e+00

Pos logit contributions
 determined+0.215
 unsure+0.199
 thirsty+0.193
 hungry+0.193
 nervous+0.190
Neg logit contributions
eds-0.234
ara-0.234
ucks-0.230
 is-0.213
of-0.209
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
eds+0.241
ucks+0.239
ara+0.239
 is+0.225
of+0.217
Neg logit contributions
 determined-0.206
 unsure-0.190
 hungry-0.189
 thirsty-0.189
 confused-0.185
Token She
Token ID1375
Feature activation-1.06e+01

Pos logit contributions
 determined+0.109
 unsure+0.097
 hungry+0.097
 thirsty+0.096
 nervous+0.095
Neg logit contributions
ucks-0.137
ara-0.136
eds-0.134
of-0.121
 is-0.121
Token was
Token ID373
Feature activation+6.93e+00

Pos logit contributions
 determined+0.555
 unsure+0.500
 hungry+0.495
 thirsty+0.490
 nervous+0.486
Neg logit contributions
ucks-0.583
eds-0.573
ara-0.558
icer-0.513
ent-0.497
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
eds+0.433
 us+0.424
ac+0.420
ara+0.418
 is+0.416
Neg logit contributions
dream-0.195
 determined-0.190
 unsure-0.182
 he-0.172
 thirsty-0.171
Token to
Token ID284
Feature activation-3.15e+00

Pos logit contributions
 determined+1.072
 unsure+0.969
 hungry+0.966
 thirsty+0.964
 nervous+0.941
Neg logit contributions
eds-1.417
ara-1.415
ucks-1.413
 is-1.291
of-1.281
Token find
Token ID1064
Feature activation+2.07e+00

Pos logit contributions
 determined+0.189
 unsure+0.172
 hungry+0.171
 thirsty+0.169
 nervous+0.169
Neg logit contributions
ucks-0.152
ara-0.147
eds-0.146
icer-0.130
of-0.125
Token Sarah
Token ID10490
Feature activation-1.28e+00

Pos logit contributions
ucks+0.122
eds+0.121
ara+0.120
of+0.108
icer+0.107
Neg logit contributions
 determined-0.098
 unsure-0.088
 hungry-0.087
 thirsty-0.087
 nervous-0.085
Token and
Token ID290
Feature activation-3.16e-01

Pos logit contributions
 determined+0.077
 unsure+0.072
 hungry+0.071
 thirsty+0.071
 nervous+0.070
Neg logit contributions
ucks-0.057
eds-0.056
ara-0.056
of-0.049
 is-0.049
Token deliver
Token ID5203
Feature activation+2.95e+00

Pos logit contributions
 determined+0.014
 thirsty+0.013
 hungry+0.013
 unsure+0.013
 nervous+0.013
Neg logit contributions
ara-0.017
eds-0.017
ucks-0.017
 is-0.017
of-0.016
Token
Token ID198
Feature activation-4.16e-01

Pos logit contributions
 determined+0.040
 unsure+0.036
 hungry+0.036
 thirsty+0.035
 curious+0.035
Neg logit contributions
ucks-0.036
eds-0.035
ara-0.034
icer-0.031
ent-0.031
TokenThe
Token ID464
Feature activation+5.02e+00

Pos logit contributions
 determined+0.025
 unsure+0.022
 hungry+0.022
 curious+0.022
 thirsty+0.022
Neg logit contributions
ucks-0.020
eds-0.019
ara-0.018
icer-0.017
 is-0.017
Token cats
Token ID11875
Feature activation+3.22e+00

Pos logit contributions
ucks+0.262
ara+0.259
eds+0.258
icer+0.228
of+0.228
Neg logit contributions
 determined-0.267
 unsure-0.244
 hungry-0.242
 thirsty-0.240
 nervous-0.238
Token were
Token ID547
Feature activation+2.08e+00

Pos logit contributions
ucks+0.184
eds+0.181
ara+0.179
icer+0.161
of+0.160
Neg logit contributions
 determined-0.159
 unsure-0.144
 hungry-0.143
 thirsty-0.141
 nervous-0.140
Token still
Token ID991
Feature activation-8.38e+00

Pos logit contributions
eds+0.135
 us+0.132
ucks+0.125
 is+0.124
ara+0.123
Neg logit contributions
 determined-0.065
 unsure-0.061
dream-0.060
 he-0.059
 confident-0.056
Token determined
Token ID5295
Feature activation-2.38e+01

Pos logit contributions
 determined+0.313
 unsure+0.280
 confident+0.263
 thirsty+0.256
dream+0.256
Neg logit contributions
eds-0.543
ucks-0.518
ara-0.507
 us-0.492
 is-0.491
Token to
Token ID284
Feature activation-3.30e+00

Pos logit contributions
 determined+1.119
 unsure+1.019
 thirsty+1.012
 hungry+1.011
 nervous+0.993
Neg logit contributions
eds-1.363
ara-1.350
ucks-1.348
 is-1.245
 us-1.225
Token get
Token ID651
Feature activation+2.50e+00

Pos logit contributions
 determined+0.197
 unsure+0.179
 hungry+0.179
 thirsty+0.177
 nervous+0.175
Neg logit contributions
ucks-0.161
ara-0.155
eds-0.155
icer-0.140
of-0.131
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
eds+0.126
ucks+0.123
 us+0.119
ara+0.117
 is+0.113
Neg logit contributions
 determined-0.128
 unsure-0.119
 nervous-0.114
 thirsty-0.113
 confused-0.112
Token mouse
Token ID10211
Feature activation-4.96e+00

Pos logit contributions
ucks+0.071
eds+0.070
ara+0.070
icer+0.061
of+0.061
Neg logit contributions
 determined-0.081
 unsure-0.075
 hungry-0.074
 thirsty-0.074
 nervous-0.073
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
 determined+0.252
 unsure+0.230
 hungry+0.227
 thirsty+0.226
 nervous+0.224
Neg logit contributions
eds-0.264
ucks-0.259
ara-0.257
 is-0.238
of-0.236
Token Lucy
Token ID22162
Feature activation-2.42e+00

Pos logit contributions
 determined+0.262
 unsure+0.219
 nervous+0.213
 thirsty+0.210
 hungry+0.210
Neg logit contributions
ara-0.709
ucks-0.670
 is-0.669
eds-0.665
 us-0.653
Token kept
Token ID4030
Feature activation-2.45e+00

Pos logit contributions
 determined+0.135
 unsure+0.123
 hungry+0.123
 thirsty+0.122
 nervous+0.120
Neg logit contributions
ucks-0.123
eds-0.121
ara-0.120
icer-0.106
of-0.105
Token trying
Token ID2111
Feature activation-1.73e+01

Pos logit contributions
 determined+0.100
 unsure+0.094
 thirsty+0.094
 nervous+0.091
 hungry+0.089
Neg logit contributions
eds-0.141
ucks-0.136
 us-0.134
ara-0.133
 is-0.129
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
 determined+0.881
 unsure+0.802
 hungry+0.798
 thirsty+0.795
 nervous+0.783
Neg logit contributions
ucks-0.940
eds-0.934
ara-0.926
of-0.829
 is-0.827
Token was
Token ID373
Feature activation+6.71e+00

Pos logit contributions
 determined+0.046
 thirsty+0.045
 hungry+0.045
 nervous+0.044
 curious+0.043
Neg logit contributions
ara-0.063
eds-0.062
 is-0.062
ucks-0.060
of-0.059
Token determined
Token ID5295
Feature activation-2.37e+01

Pos logit contributions
 is+0.394
eds+0.391
ara+0.380
ac+0.375
 us+0.373
Neg logit contributions
 thirsty-0.201
dream-0.194
 hungry-0.194
 determined-0.190
 unsure-0.185
Token to
Token ID284
Feature activation-3.22e+00

Pos logit contributions
 determined+1.198
 unsure+1.090
 hungry+1.088
 thirsty+1.080
 nervous+1.063
Neg logit contributions
ucks-1.305
eds-1.292
ara-1.292
of-1.153
 is-1.150
Token increase
Token ID2620
Feature activation-1.19e+00

Pos logit contributions
 determined+0.194
 unsure+0.178
 hungry+0.177
 thirsty+0.175
 nervous+0.174
Neg logit contributions
ucks-0.151
eds-0.148
ara-0.147
icer-0.129
of-0.127
Token her
Token ID607
Feature activation-5.82e-01

Pos logit contributions
 determined+0.060
 unsure+0.054
 hungry+0.054
 thirsty+0.053
 nervous+0.053
Neg logit contributions
ucks-0.067
eds-0.066
ara-0.066
icer-0.059
of-0.058
Token knowledge
Token ID3725
Feature activation+3.49e+00

Pos logit contributions
 determined+0.032
 unsure+0.030
 hungry+0.030
 thirsty+0.029
 nervous+0.029
Neg logit contributions
ucks-0.029
eds-0.029
ara-0.028
 is-0.025
of-0.025
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
ucks+0.171
eds+0.170
ara+0.170
of+0.152
 is+0.152
Neg logit contributions
 determined-0.193
 hungry-0.178
 thirsty-0.177
 unsure-0.177
 nervous-0.173
Token
Token ID198
Feature activation+8.81e-01

Pos logit contributions
ara+0.439
ucks+0.428
eds+0.411
 is+0.389
of+0.387
Neg logit contributions
 determined-0.407
 unsure-0.361
 thirsty-0.357
 hungry-0.355
 nervous-0.348
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
ucks+0.050
eds+0.049
ara+0.048
icer+0.045
angles+0.044
Neg logit contributions
 determined-0.044
 hungry-0.039
 curious-0.039
 unsure-0.038
 nervous-0.038
TokenBut
Token ID1537
Feature activation-4.75e+00

Pos logit contributions
ucks+0.050
eds+0.049
ara+0.046
icer+0.043
 is+0.042
Neg logit contributions
 determined-0.066
 unsure-0.059
 curious-0.059
 hungry-0.058
 thirsty-0.058
Token Ted
Token ID11396
Feature activation-9.04e+00

Pos logit contributions
 determined+0.229
 unsure+0.206
 thirsty+0.205
 nervous+0.204
 hungry+0.204
Neg logit contributions
ara-0.270
eds-0.255
ucks-0.252
 us-0.242
 is-0.238
Token was
Token ID373
Feature activation+6.39e+00

Pos logit contributions
 determined+0.461
 unsure+0.418
 hungry+0.416
 thirsty+0.410
 nervous+0.407
Neg logit contributions
ucks-0.510
eds-0.497
ara-0.491
icer-0.443
of-0.436
Token determined
Token ID5295
Feature activation-2.37e+01

Pos logit contributions
eds+0.357
 us+0.354
 is+0.351
e+0.330
ara+0.326
Neg logit contributions
dream-0.193
 determined-0.175
 thirsty-0.169
 unsure-0.167
 she-0.166
Token to
Token ID284
Feature activation-3.30e+00

Pos logit contributions
 determined+1.167
 thirsty+1.064
 unsure+1.064
 hungry+1.063
 nervous+1.035
Neg logit contributions
ara-1.310
eds-1.307
ucks-1.298
 is-1.199
of-1.179
Token try
Token ID1949
Feature activation-4.36e+00

Pos logit contributions
 determined+0.198
 unsure+0.182
 hungry+0.181
 thirsty+0.180
 nervous+0.178
Neg logit contributions
ucks-0.154
eds-0.151
ara-0.151
icer-0.131
of-0.130
Token it
Token ID340
Feature activation-7.28e+00

Pos logit contributions
 determined+0.221
 unsure+0.202
 thirsty+0.202
 hungry+0.200
 nervous+0.197
Neg logit contributions
eds-0.239
ucks-0.237
ara-0.232
 is-0.211
of-0.209
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
 determined+0.415
 thirsty+0.380
 hungry+0.380
 unsure+0.379
 confused+0.370
Neg logit contributions
ucks-0.351
eds-0.349
ara-0.344
 is-0.311
of-0.308
Token He
Token ID679
Feature activation-9.23e+00

Pos logit contributions
 determined+0.071
 thirsty+0.065
 hungry+0.064
 unsure+0.061
 nervous+0.060
Neg logit contributions
ucks-0.121
ara-0.120
eds-0.115
 is-0.110
of-0.109
Token frustrated
Token ID14718
Feature activation-1.67e+01

Pos logit contributions
 determined+0.138
 barely+0.118
chard+0.115
dream+0.111
 thirsty+0.107
Neg logit contributions
ucks-0.260
ara-0.258
 is-0.250
 us-0.242
ac-0.242
Token,
Token ID11
Feature activation-5.17e+00

Pos logit contributions
 determined+0.973
 thirsty+0.904
 hungry+0.889
 unsure+0.865
 confused+0.856
Neg logit contributions
ara-0.760
ucks-0.727
 is-0.717
eds-0.714
 us-0.670
Token but
Token ID475
Feature activation-8.13e+00

Pos logit contributions
 determined+0.213
 thirsty+0.202
 hungry+0.197
 sorry+0.189
 unsure+0.185
Neg logit contributions
 is-0.262
ara-0.259
 us-0.258
eds-0.243
of-0.243
Token she
Token ID673
Feature activation-1.50e+01

Pos logit contributions
 determined+0.287
 thirsty+0.265
 hungry+0.264
 nervous+0.261
 confident+0.255
Neg logit contributions
ara-0.504
 is-0.485
 us-0.470
of-0.460
ac-0.452
Token was
Token ID373
Feature activation+6.74e+00

Pos logit contributions
 determined+0.768
 hungry+0.716
 thirsty+0.710
 unsure+0.709
 nervous+0.697
Neg logit contributions
ara-0.788
eds-0.753
ucks-0.748
 is-0.700
of-0.691
Token determined
Token ID5295
Feature activation-2.37e+01

Pos logit contributions
ac+0.341
 us+0.341
 is+0.338
e+0.329
eds+0.324
Neg logit contributions
dream-0.223
 he-0.197
 thirsty-0.196
 determined-0.195
 hungry-0.193
Token to
Token ID284
Feature activation-3.17e+00

Pos logit contributions
 determined+1.206
 thirsty+1.123
 hungry+1.116
 unsure+1.109
 nervous+1.088
Neg logit contributions
ara-1.237
eds-1.223
ucks-1.202
 is-1.141
of-1.113
Token find
Token ID1064
Feature activation+2.10e+00

Pos logit contributions
 determined+0.193
 unsure+0.178
 hungry+0.177
 thirsty+0.175
 nervous+0.174
Neg logit contributions
ucks-0.146
eds-0.143
ara-0.142
icer-0.124
of-0.122
Token a
Token ID257
Feature activation-3.19e+00

Pos logit contributions
ucks+0.120
eds+0.118
ara+0.118
 is+0.107
of+0.107
Neg logit contributions
 determined-0.102
 hungry-0.091
 unsure-0.091
 thirsty-0.091
 nervous-0.089
Token way
Token ID835
Feature activation-1.17e+00

Pos logit contributions
 determined+0.186
 unsure+0.176
 thirsty+0.172
 hungry+0.171
 confused+0.170
Neg logit contributions
eds-0.140
ara-0.137
ucks-0.133
 is-0.125
of-0.122
Token to
Token ID284
Feature activation-2.87e+00

Pos logit contributions
 determined+0.063
 hungry+0.059
 unsure+0.059
 thirsty+0.058
 confused+0.057
Neg logit contributions
eds-0.058
ucks-0.053
 us-0.052
of-0.051
 is-0.051
Token.
Token ID13
Feature activation-2.31e+00

Pos logit contributions
 determined+0.034
 thirsty+0.032
 hungry+0.032
 unsure+0.031
 curious+0.030
Neg logit contributions
eds-0.026
ara-0.026
ucks-0.026
 is-0.024
of-0.024
Token Lily
Token ID20037
Feature activation-7.90e-02

Pos logit contributions
 determined+0.102
 unsure+0.091
 hungry+0.090
 thirsty+0.090
 nervous+0.088
Neg logit contributions
ucks-0.144
ara-0.142
eds-0.139
of-0.127
 is-0.127
Token was
Token ID373
Feature activation+6.86e+00

Pos logit contributions
 determined+0.004
 unsure+0.004
 confused+0.004
 hungry+0.004
 nervous+0.004
Neg logit contributions
ucks-0.004
eds-0.004
ara-0.004
icer-0.004
of-0.003
Token cheerful
Token ID37999
Feature activation-6.31e+00

Pos logit contributions
 is+0.402
e+0.401
ac+0.391
ate+0.391
eds+0.379
Neg logit contributions
dream-0.205
 he-0.179
chard-0.173
 determined-0.173
 unsure-0.164
Token and
Token ID290
Feature activation-3.90e-01

Pos logit contributions
 determined+0.423
 thirsty+0.400
chard+0.382
 desperate+0.381
 unsure+0.376
Neg logit contributions
ara-0.234
 is-0.221
ucks-0.221
en-0.206
eds-0.200
Token determined
Token ID5295
Feature activation-2.36e+01

Pos logit contributions
dream+0.014
 unsure+0.014
 determined+0.014
 desperate+0.013
 discouraged+0.013
Neg logit contributions
 is-0.022
ate-0.020
ucks-0.020
ara-0.020
eds-0.020
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
 determined+1.069
 unsure+0.958
 hungry+0.952
 thirsty+0.939
 nervous+0.932
Neg logit contributions
ucks-1.456
eds-1.431
ara-1.425
icer-1.292
of-1.280
Token She
Token ID1375
Feature activation-1.03e+01

Pos logit contributions
 determined+0.081
 unsure+0.071
 hungry+0.071
 thirsty+0.071
 nervous+0.069
Neg logit contributions
ucks-0.112
ara-0.112
eds-0.108
 is-0.100
of-0.099
Token opened
Token ID4721
Feature activation-8.93e+00

Pos logit contributions
 determined+0.499
 unsure+0.456
 hungry+0.455
 thirsty+0.451
 nervous+0.443
Neg logit contributions
ucks-0.581
ara-0.579
eds-0.573
 is-0.518
of-0.515
Token the
Token ID262
Feature activation+2.04e+00

Pos logit contributions
 determined+0.469
 thirsty+0.424
 hungry+0.423
 unsure+0.423
 nervous+0.413
Neg logit contributions
ucks-0.484
ara-0.477
eds-0.472
of-0.422
icer-0.422
Token back
Token ID736
Feature activation+1.03e+00

Pos logit contributions
ucks+0.109
ara+0.107
eds+0.106
of+0.094
ases+0.093
Neg logit contributions
 determined-0.109
 unsure-0.101
 confused-0.101
 nervous-0.101
 hungry-0.100
Token.
Token ID13
Feature activation-2.89e+00

Pos logit contributions
 determined+0.510
 unsure+0.465
 thirsty+0.462
 hungry+0.462
 nervous+0.454
Neg logit contributions
ucks-0.471
eds-0.466
ara-0.465
 is-0.420
 us-0.408
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 determined+0.136
 unsure+0.121
 hungry+0.121
 thirsty+0.120
 nervous+0.117
Neg logit contributions
ara-0.170
ucks-0.170
eds-0.163
 is-0.151
of-0.150
Token
Token ID198
Feature activation+1.66e+00

Pos logit contributions
eds+0.081
ucks+0.081
ara+0.076
icer+0.073
ent+0.073
Neg logit contributions
 determined-0.070
 unsure-0.061
 curious-0.061
 hungry-0.061
 excited-0.060
TokenSam
Token ID16305
Feature activation+3.89e+00

Pos logit contributions
ucks+0.077
eds+0.076
ara+0.072
icer+0.066
 is+0.064
Neg logit contributions
 determined-0.103
 unsure-0.092
 hungry-0.091
 curious-0.090
 thirsty-0.090
Token was
Token ID373
Feature activation+6.98e+00

Pos logit contributions
ucks+0.233
eds+0.221
ara+0.213
icer+0.197
 occasions+0.195
Neg logit contributions
 determined-0.195
 unsure-0.175
 confused-0.173
 hungry-0.173
 nervous-0.170
Token determined
Token ID5295
Feature activation-2.36e+01

Pos logit contributions
 is+0.418
e+0.413
eds+0.405
ac+0.399
 us+0.392
Neg logit contributions
dream-0.191
 he-0.168
 she-0.165
 determined-0.158
 thirsty-0.155
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
 determined+1.147
 thirsty+1.055
 hungry+1.049
 unsure+1.045
 nervous+1.019
Neg logit contributions
ara-1.306
eds-1.302
ucks-1.282
 is-1.212
 us-1.179
Token He
Token ID679
Feature activation-9.34e+00

Pos logit contributions
 determined+0.066
 thirsty+0.061
 hungry+0.060
 unsure+0.056
 nervous+0.055
Neg logit contributions
ucks-0.117
ara-0.115
eds-0.110
 is-0.108
of-0.104
Token threw
Token ID9617
Feature activation-1.84e+00

Pos logit contributions
 determined+0.438
 hungry+0.402
 thirsty+0.401
 unsure+0.400
 nervous+0.392
Neg logit contributions
ara-0.531
ucks-0.528
eds-0.526
 is-0.483
of-0.477
Token on
Token ID319
Feature activation-1.70e+00

Pos logit contributions
 determined+0.090
 unsure+0.081
 hungry+0.081
 thirsty+0.080
 nervous+0.079
Neg logit contributions
ucks-0.107
eds-0.105
ara-0.105
icer-0.093
of-0.093
Token more
Token ID517
Feature activation-1.39e+00

Pos logit contributions
 determined+0.091
 unsure+0.084
 thirsty+0.083
 hungry+0.083
 nervous+0.082
Neg logit contributions
ucks-0.088
eds-0.087
ara-0.087
of-0.078
icer-0.078
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
ucks+0.400
eds+0.398
ara+0.397
 is+0.360
of+0.357
Neg logit contributions
 determined-0.291
 unsure-0.263
 hungry-0.258
 thirsty-0.255
 nervous-0.253
Token go
Token ID467
Feature activation+5.71e+00

Pos logit contributions
 determined+0.065
 thirsty+0.059
 hungry+0.059
 unsure+0.059
 nervous+0.059
Neg logit contributions
ucks-0.051
eds-0.049
ara-0.047
icer-0.044
icy-0.040
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
eds+0.304
ucks+0.303
ara+0.296
 is+0.270
 us+0.269
Neg logit contributions
 determined-0.301
 unsure-0.275
 hungry-0.266
 thirsty-0.263
 confused-0.262
Token The
Token ID383
Feature activation+3.82e+00

Pos logit contributions
ucks+0.015
eds+0.014
ara+0.014
of+0.013
icer+0.013
Neg logit contributions
 determined-0.018
 unsure-0.017
 hungry-0.016
 thirsty-0.016
 nervous-0.016
Token veterinarian
Token ID41404
Feature activation-4.62e-01

Pos logit contributions
ucks+0.196
ara+0.195
eds+0.191
 is+0.178
 us+0.173
Neg logit contributions
 determined-0.202
 unsure-0.186
 thirsty-0.182
 hungry-0.180
 confused-0.178
Token is
Token ID318
Feature activation+1.64e+01

Pos logit contributions
 determined+0.036
 hungry+0.034
 unsure+0.034
 curious+0.034
 nervous+0.034
Neg logit contributions
ucks-0.013
eds-0.013
ara-0.011
icer-0.011
of-0.009
Token a
Token ID257
Feature activation-9.84e-01

Pos logit contributions
eds+0.948
 is+0.930
ara+0.929
 us+0.928
ucks+0.904
Neg logit contributions
 determined-0.628
 unsure-0.601
dream-0.568
 thirsty-0.554
 delighted-0.552
Token nice
Token ID3621
Feature activation-1.35e-01

Pos logit contributions
 determined+0.046
 unsure+0.043
 thirsty+0.042
 confused+0.041
 hungry+0.040
Neg logit contributions
ara-0.055
eds-0.054
ucks-0.053
 is-0.052
 us-0.049
Token person
Token ID1048
Feature activation+1.71e+00

Pos logit contributions
 determined+0.007
 unsure+0.007
 thirsty+0.007
 hungry+0.007
 confused+0.007
Neg logit contributions
ara-0.007
ucks-0.007
eds-0.006
 is-0.006
 occasions-0.006
Token who
Token ID508
Feature activation+1.41e+00

Pos logit contributions
ucks+0.091
eds+0.088
ara+0.088
icer+0.079
of+0.078
Neg logit contributions
 determined-0.091
 unsure-0.083
 hungry-0.083
 thirsty-0.082
 nervous-0.082
Token helps
Token ID5419
Feature activation+1.53e+00

Pos logit contributions
ucks+0.053
eds+0.051
ara+0.051
icer+0.044
of+0.041
Neg logit contributions
 determined-0.098
 hungry-0.092
 unsure-0.091
 thirsty-0.091
 nervous-0.090
TokenThe
Token ID464
Feature activation+8.29e+00

Pos logit contributions
 determined+0.324
 unsure+0.298
 hungry+0.294
 thirsty+0.293
 nervous+0.291
Neg logit contributions
ucks-0.250
eds-0.250
ara-0.244
icer-0.214
 is-0.213
Token crab
Token ID32202
Feature activation-5.88e+00

Pos logit contributions
ara+0.460
eds+0.448
ucks+0.447
 is+0.420
 us+0.412
Neg logit contributions
 determined-0.406
 unsure-0.377
 nervous-0.355
 thirsty-0.353
 hungry-0.352
Token won
Token ID1839
Feature activation+2.53e+00

Pos logit contributions
 determined+0.455
 hungry+0.429
 unsure+0.427
 nervous+0.422
 thirsty+0.421
Neg logit contributions
ucks-0.187
eds-0.172
ara-0.170
icer-0.137
 occasions-0.135
Token't
Token ID470
Feature activation-2.42e+00

Pos logit contributions
ucks+0.170
ara+0.168
eds+0.163
 occasions+0.153
ac+0.152
Neg logit contributions
 determined-0.102
 hungry-0.094
 nervous-0.093
 unsure-0.091
 confused-0.090
Token hurt
Token ID5938
Feature activation-1.49e+00

Pos logit contributions
 determined+0.126
 unsure+0.114
 hungry+0.114
 thirsty+0.113
 nervous+0.112
Neg logit contributions
ucks-0.134
eds-0.131
ara-0.131
icer-0.118
of-0.115
Token us
Token ID514
Feature activation+1.91e+01

Pos logit contributions
 determined+0.103
 hungry+0.097
 unsure+0.097
 nervous+0.095
 thirsty+0.095
Neg logit contributions
ucks-0.057
ara-0.055
eds-0.054
icer-0.046
of-0.045
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
eds+0.647
ucks+0.642
ara+0.610
 us+0.537
of+0.534
Neg logit contributions
 determined-1.364
 unsure-1.262
 thirsty-1.242
 hungry-1.238
 confused-1.224
Token It
Token ID632
Feature activation-4.22e+00

Pos logit contributions
 determined+0.007
 unsure+0.006
 hungry+0.006
 thirsty+0.006
 nervous+0.006
Neg logit contributions
ucks-0.006
ara-0.006
eds-0.006
of-0.005
icer-0.005
Token's
Token ID338
Feature activation+8.99e+00

Pos logit contributions
 determined+0.295
 unsure+0.278
 hungry+0.274
 nervous+0.274
 confused+0.273
Neg logit contributions
ucks-0.164
eds-0.157
ara-0.157
icer-0.132
 occasions-0.132
Token just
Token ID655
Feature activation-3.20e+00

Pos logit contributions
eds+0.529
ucks+0.512
ara+0.505
 us+0.493
 is+0.491
Neg logit contributions
 determined-0.396
 unsure-0.352
 thirsty-0.344
dream-0.343
 hungry-0.341
Token looking
Token ID2045
Feature activation-1.39e+01

Pos logit contributions
 determined+0.146
 unsure+0.129
 thirsty+0.126
 hungry+0.126
 nervous+0.124
Neg logit contributions
ucks-0.186
eds-0.186
ara-0.184
 is-0.175
 us-0.172
Token mom
Token ID1995
Feature activation-9.68e-01

Pos logit contributions
ucks+0.091
ara+0.090
eds+0.088
 is+0.083
 us+0.079
Neg logit contributions
 determined-0.089
 unsure-0.082
 thirsty-0.079
 hungry-0.078
 confused-0.077
Token very
Token ID845
Feature activation-1.11e+01

Pos logit contributions
 determined+0.046
 hungry+0.042
 unsure+0.042
 thirsty+0.041
 nervous+0.041
Neg logit contributions
ucks-0.057
eds-0.056
ara-0.056
icer-0.050
of-0.050
Token much
Token ID881
Feature activation-7.17e+00

Pos logit contributions
 determined+0.302
 unsure+0.253
 hungry+0.242
 thirsty+0.239
 confused+0.232
Neg logit contributions
ucks-0.863
ara-0.863
eds-0.851
of-0.800
 is-0.798
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
 determined+0.479
 unsure+0.444
 thirsty+0.443
 confused+0.426
 hungry+0.426
Neg logit contributions
ara-0.280
ucks-0.264
 is-0.258
eds-0.249
of-0.233
Token<|endoftext|>
Token ID50256
Feature activation+3.28e+00

Pos logit contributions
eds+0.024
ucks+0.024
ara+0.023
ent+0.021
icer+0.021
Neg logit contributions
 determined-0.022
 hungry-0.021
 thirsty-0.020
 nervous-0.020
 unsure-0.020
TokenL
Token ID43
Feature activation+1.54e+01

Pos logit contributions
eds+0.219
angles+0.213
ucks+0.212
ac+0.205
 occasions+0.205
Neg logit contributions
 hungry-0.119
 he-0.111
 excited-0.111
 nervous-0.108
 thirsty-0.107
Tokenily
Token ID813
Feature activation+3.36e-01

Pos logit contributions
ucks+0.826
eds+0.811
ators+0.747
atch+0.747
icy+0.732
Neg logit contributions
 determined-0.708
 she-0.702
 hungry-0.695
 thirsty-0.676
 excited-0.670
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
ucks+0.021
eds+0.020
ara+0.018
icer+0.018
 occasions+0.018
Neg logit contributions
 determined-0.016
 hungry-0.015
 thirsty-0.014
 curious-0.014
 nervous-0.014
Token Tom
Token ID4186
Feature activation+1.36e+00

Pos logit contributions
ara+0.090
ucks+0.089
eds+0.086
 is+0.081
 us+0.080
Neg logit contributions
 determined-0.084
 unsure-0.076
 hungry-0.073
 thirsty-0.073
 nervous-0.073
Token are
Token ID389
Feature activation+1.39e+01

Pos logit contributions
ucks+0.081
eds+0.080
 occasions+0.072
ara+0.069
icer+0.069
Neg logit contributions
 hungry-0.065
 determined-0.065
 thirsty-0.063
 excited-0.061
 curious-0.060
Token friends
Token ID2460
Feature activation+3.77e+00

Pos logit contributions
ucks+0.764
ara+0.762
eds+0.728
 is+0.694
 us+0.693
Neg logit contributions
 determined-0.658
 unsure-0.624
 confident-0.578
 confused-0.572
 desperate-0.566
Token Mom
Token ID11254
Feature activation-1.92e+00

Pos logit contributions
 determined+0.133
 unsure+0.121
 thirsty+0.119
 hungry+0.119
 nervous+0.116
Neg logit contributions
ara-0.145
ucks-0.142
eds-0.139
 is-0.131
 us-0.128
Tokenmy
Token ID1820
Feature activation+1.31e+01

Pos logit contributions
 determined+0.112
 hungry+0.106
 unsure+0.104
 nervous+0.103
 thirsty+0.103
Neg logit contributions
ucks-0.091
eds-0.089
ara-0.087
icer-0.078
of-0.077
Token.
Token ID13
Feature activation+7.67e-02

Pos logit contributions
ucks+0.592
ara+0.586
eds+0.584
 is+0.507
of+0.504
Neg logit contributions
 determined-0.794
 unsure-0.734
 hungry-0.726
 thirsty-0.725
 nervous-0.715
Token I
Token ID314
Feature activation+3.62e+00

Pos logit contributions
ucks+0.004
ara+0.004
eds+0.004
 is+0.003
of+0.003
Neg logit contributions
 determined-0.004
 unsure-0.004
 hungry-0.004
 thirsty-0.004
 nervous-0.004
Token have
Token ID423
Feature activation+6.41e+00

Pos logit contributions
ucks+0.118
eds+0.118
ara+0.111
icer+0.096
 occasions+0.089
Neg logit contributions
 determined-0.279
 hungry-0.257
 unsure-0.256
 nervous-0.256
 thirsty-0.255
Token my
Token ID616
Feature activation+1.58e+01

Pos logit contributions
ucks+0.397
eds+0.394
ara+0.392
 is+0.353
of+0.353
Neg logit contributions
 determined-0.282
 unsure-0.253
 hungry-0.249
 thirsty-0.247
 nervous-0.244
Token stick
Token ID4859
Feature activation-5.18e-01

Pos logit contributions
ucks+0.734
eds+0.733
ara+0.728
 is+0.668
 us+0.649
Neg logit contributions
 determined-0.897
 unsure-0.837
 hungry-0.815
 thirsty-0.815
 nervous-0.808
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
 determined+0.027
 hungry+0.024
 unsure+0.024
 thirsty+0.024
 nervous+0.024
Neg logit contributions
ucks-0.028
ara-0.028
eds-0.028
icer-0.025
of-0.024
Token protect
Token ID1805
Feature activation+5.96e+00

Pos logit contributions
 determined+0.062
 hungry+0.059
 thirsty+0.058
 curious+0.057
 nervous+0.057
Neg logit contributions
ucks-0.060
ara-0.058
eds-0.056
icer-0.051
ac-0.049
Token me
Token ID502
Feature activation+1.39e+01

Pos logit contributions
ucks+0.275
ara+0.273
eds+0.242
icer+0.240
 occasions+0.216
Neg logit contributions
 determined-0.346
 hungry-0.339
 curious-0.332
 nervous-0.330
 unsure-0.327
Token."
Token ID526
Feature activation+7.19e+00

Pos logit contributions
ara+0.661
ucks+0.659
eds+0.613
icer+0.565
of+0.545
Neg logit contributions
 determined-0.831
 hungry-0.790
 thirsty-0.775
 nervous-0.769
 unsure-0.766
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
 determined+0.170
 hungry+0.157
 thirsty+0.156
 unsure+0.156
 nervous+0.152
Neg logit contributions
ucks-0.110
eds-0.109
ara-0.109
 is-0.098
of-0.096
Token He
Token ID679
Feature activation-7.38e+00

Pos logit contributions
ucks+0.028
ara+0.027
 is+0.025
eds+0.024
ate+0.024
Neg logit contributions
 determined-0.019
 unsure-0.016
 thirsty-0.016
 hungry-0.015
 desperate-0.015
Token said
Token ID531
Feature activation-1.27e+00

Pos logit contributions
 determined+0.359
 unsure+0.325
 hungry+0.323
 thirsty+0.320
 nervous+0.316
Neg logit contributions
ucks-0.425
ara-0.418
eds-0.418
of-0.373
icer-0.373
Token "
Token ID366
Feature activation-5.11e+00

Pos logit contributions
 determined+0.041
 unsure+0.036
 thirsty+0.034
 desperate+0.034
 confident+0.032
Neg logit contributions
 is-0.089
ara-0.088
ucks-0.085
 us-0.083
of-0.079
TokenAr
Token ID3163
Feature activation+7.58e+00

Pos logit contributions
 determined+0.269
 unsure+0.250
dream+0.246
 nervous+0.245
 he+0.243
Neg logit contributions
eds-0.276
ucks-0.244
ara-0.234
rap-0.227
 is-0.227
Tokenr
Token ID81
Feature activation+1.51e+01

Pos logit contributions
ucks+0.235
ara+0.225
eds+0.216
icer+0.194
 is+0.183
Neg logit contributions
 determined-0.549
dream-0.505
 unsure-0.497
 curious-0.480
 sorry-0.480
Token!"
Token ID2474
Feature activation+1.21e+01

Pos logit contributions
ucks+0.757
eds+0.713
ara+0.699
 occasions+0.672
icer+0.623
Neg logit contributions
 determined-0.814
 hungry-0.768
 nervous-0.747
 confused-0.739
 curious-0.738
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
ucks+0.604
ara+0.594
 is+0.514
of+0.477
ate+0.473
Neg logit contributions
 determined-0.729
 unsure-0.623
 thirsty-0.616
 desperate-0.606
 curious-0.603
Token wandered
Token ID36036
Feature activation-2.18e+00

Pos logit contributions
ucks+0.077
ara+0.076
 is+0.071
eds+0.068
ac+0.063
Neg logit contributions
 determined-0.088
 unsure-0.080
dream-0.078
 desperate-0.078
 thirsty-0.077
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
 determined+0.149
 unsure+0.138
 thirsty+0.134
 hungry+0.131
 desperate+0.131
Neg logit contributions
ucks-0.085
eds-0.079
 us-0.077
ara-0.077
 is-0.074
Token the
Token ID262
Feature activation+3.74e+00

Pos logit contributions
 determined+0.057
 hungry+0.052
 nervous+0.051
 unsure+0.051
 thirsty+0.050
Neg logit contributions
ucks-0.071
eds-0.071
ara-0.071
icer-0.064
w-0.064
Token it
Token ID340
Feature activation-5.44e+00

Pos logit contributions
 determined+0.036
 unsure+0.033
 hungry+0.032
 nervous+0.032
 thirsty+0.032
Neg logit contributions
ucks-0.057
eds-0.056
ara-0.056
of-0.050
icer-0.050
Token.
Token ID13
Feature activation-5.54e-03

Pos logit contributions
 determined+0.295
 unsure+0.269
 hungry+0.268
 thirsty+0.266
 nervous+0.261
Neg logit contributions
ucks-0.280
eds-0.278
ara-0.275
of-0.245
 is-0.245
Token
Token ID220
Feature activation-4.74e+00

Pos logit contributions
 determined+0.000
 unsure+0.000
 thirsty+0.000
 confident+0.000
 hungry+0.000
Neg logit contributions
ara-0.000
ucks-0.000
 is-0.000
eds-0.000
of-0.000
Token
Token ID198
Feature activation+3.71e+00

Pos logit contributions
 determined+0.211
 nervous+0.199
 unsure+0.196
 hungry+0.194
 confused+0.193
Neg logit contributions
eds-0.277
ucks-0.271
ara-0.268
 occasions-0.259
ent-0.251
Token
Token ID198
Feature activation+3.70e+00

Pos logit contributions
eds+0.212
ucks+0.211
ara+0.201
ases+0.192
icer+0.191
Neg logit contributions
 determined-0.178
 nervous-0.163
 hungry-0.162
 unsure-0.161
 curious-0.160
Token"
Token ID1
Feature activation+6.65e+00

Pos logit contributions
ucks+0.114
eds+0.110
ara+0.108
icer+0.088
of+0.086
Neg logit contributions
 determined-0.283
 unsure-0.264
 hungry-0.264
 thirsty-0.263
 nervous-0.261
TokenOh
Token ID5812
Feature activation+3.46e+00

Pos logit contributions
ucks+0.324
eds+0.318
ara+0.296
of+0.273
 is+0.268
Neg logit contributions
 determined-0.388
 thirsty-0.362
 hungry-0.362
 nervous-0.356
 unsure-0.356
Token no
Token ID645
Feature activation-3.92e+00

Pos logit contributions
ara+0.225
eds+0.215
ucks+0.214
 is+0.209
of+0.202
Neg logit contributions
 determined-0.140
 unsure-0.123
 thirsty-0.120
 confident-0.118
 confused-0.116
Token!"
Token ID2474
Feature activation+1.21e+01

Pos logit contributions
 determined+0.271
 hungry+0.256
 unsure+0.253
 nervous+0.252
 thirsty+0.250
Neg logit contributions
ucks-0.150
eds-0.143
ara-0.140
icer-0.120
of-0.118
Token Lily
Token ID20037
Feature activation+1.51e+00

Pos logit contributions
ara+0.596
 is+0.566
ucks+0.536
 us+0.525
 occasions+0.506
Neg logit contributions
 determined-0.666
 unsure-0.561
 confident-0.559
 thirsty-0.559
 desperate-0.558
Token cried
Token ID16896
Feature activation-5.09e+00

Pos logit contributions
ara+0.082
eds+0.082
ucks+0.079
 is+0.076
 us+0.074
Neg logit contributions
 determined-0.073
 thirsty-0.064
 hungry-0.064
 confident-0.063
 unsure-0.063
Token yet
Token ID1865
Feature activation-1.32e+00

Pos logit contributions
ara+0.078
ucks+0.076
eds+0.075
 is+0.070
of+0.069
Neg logit contributions
 determined-0.088
 thirsty-0.081
 unsure-0.080
 hungry-0.080
 confused-0.078
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
 determined+0.075
 thirsty+0.069
 unsure+0.069
 hungry+0.068
 nervous+0.067
Neg logit contributions
ara-0.063
ucks-0.062
eds-0.061
 is-0.056
of-0.055
Token She
Token ID1375
Feature activation-8.61e+00

Pos logit contributions
 determined+0.008
 hungry+0.007
 thirsty+0.007
 unsure+0.007
 confused+0.007
Neg logit contributions
ucks-0.012
ara-0.012
eds-0.011
of-0.011
 is-0.011
Token suggested
Token ID5220
Feature activation-5.75e+00

Pos logit contributions
 determined+0.409
 hungry+0.376
 thirsty+0.374
 unsure+0.372
 nervous+0.364
Neg logit contributions
ara-0.490
ucks-0.476
eds-0.473
 is-0.439
of-0.437
Token they
Token ID484
Feature activation-7.49e+00

Pos logit contributions
 determined+0.267
 thirsty+0.258
 unsure+0.254
 hungry+0.248
 desperate+0.243
Neg logit contributions
 us-0.292
ara-0.280
 is-0.277
of-0.257
 occasions-0.253
Token keep
Token ID1394
Feature activation+1.00e+01

Pos logit contributions
 determined+0.468
 unsure+0.433
 hungry+0.432
 thirsty+0.428
 nervous+0.425
Neg logit contributions
ucks-0.327
eds-0.320
ara-0.319
icer-0.274
of-0.273
Token walking
Token ID6155
Feature activation-7.75e+00

Pos logit contributions
eds+0.554
ucks+0.553
ara+0.539
 us+0.502
 is+0.497
Neg logit contributions
 determined-0.492
 unsure-0.446
 thirsty-0.434
 hungry-0.432
 nervous-0.431
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 determined+0.406
 unsure+0.370
 hungry+0.368
 thirsty+0.365
 nervous+0.360
Neg logit contributions
ucks-0.415
eds-0.408
ara-0.407
of-0.362
icer-0.360
Token perhaps
Token ID3737
Feature activation-3.05e+00

Pos logit contributions
ucks+0.051
eds+0.051
ara+0.050
 is+0.049
of+0.048
Neg logit contributions
 determined-0.066
 unsure-0.061
 hungry-0.060
 thirsty-0.060
 nervous-0.060
Token they
Token ID484
Feature activation-7.45e+00

Pos logit contributions
 determined+0.138
 unsure+0.125
 hungry+0.123
 thirsty+0.122
 nervous+0.120
Neg logit contributions
ara-0.178
ucks-0.177
eds-0.176
of-0.165
 is-0.165
Token would
Token ID561
Feature activation+2.78e+00

Pos logit contributions
 determined+0.416
 unsure+0.384
 hungry+0.383
 thirsty+0.379
 nervous+0.374
Neg logit contributions
eds-0.356
ucks-0.355
ara-0.354
of-0.322
 is-0.320
Token,
Token ID11
Feature activation-3.14e+00

Pos logit contributions
 determined+0.063
 unsure+0.057
 hungry+0.057
 nervous+0.057
 curious+0.056
Neg logit contributions
ucks-0.076
ara-0.073
eds-0.072
icer-0.068
of-0.064
Token but
Token ID475
Feature activation-6.30e+00

Pos logit contributions
 determined+0.157
 unsure+0.143
 hungry+0.142
 thirsty+0.141
 nervous+0.139
Neg logit contributions
ucks-0.176
eds-0.173
ara-0.172
of-0.154
icer-0.154
Token the
Token ID262
Feature activation+3.20e+00

Pos logit contributions
 determined+0.338
 unsure+0.308
 hungry+0.307
 thirsty+0.304
 nervous+0.301
Neg logit contributions
ucks-0.329
eds-0.323
ara-0.323
of-0.286
 is-0.286
Token best
Token ID1266
Feature activation+3.22e-01

Pos logit contributions
ucks+0.170
eds+0.166
ara+0.163
of+0.145
icer+0.144
Neg logit contributions
 determined-0.176
 unsure-0.160
 hungry-0.160
 thirsty-0.159
 nervous-0.157
Token things
Token ID1243
Feature activation+6.52e+00

Pos logit contributions
ucks+0.016
eds+0.016
ara+0.015
icer+0.014
of+0.014
Neg logit contributions
 determined-0.019
 unsure-0.017
 hungry-0.017
 thirsty-0.017
 nervous-0.017
Token in
Token ID287
Feature activation+5.97e-01

Pos logit contributions
ucks+0.315
eds+0.307
ara+0.307
icer+0.271
of+0.265
Neg logit contributions
 determined-0.379
 unsure-0.348
 hungry-0.347
 thirsty-0.345
 nervous-0.342
Token life
Token ID1204
Feature activation+6.90e+00

Pos logit contributions
ucks+0.034
ara+0.033
eds+0.032
icer+0.030
of+0.029
Neg logit contributions
 determined-0.030
 hungry-0.028
 thirsty-0.027
 unsure-0.027
 nervous-0.027
Token are
Token ID389
Feature activation+1.33e+01

Pos logit contributions
ucks+0.323
ara+0.306
eds+0.296
icer+0.278
of+0.253
Neg logit contributions
 determined-0.423
 hungry-0.398
 thirsty-0.392
 nervous-0.391
 unsure-0.388
Token free
Token ID1479
Feature activation-2.95e+00

Pos logit contributions
eds+0.813
ucks+0.803
ara+0.801
 is+0.751
 us+0.741
Neg logit contributions
 determined-0.565
 unsure-0.510
 hungry-0.500
 thirsty-0.497
 nervous-0.491
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 determined+0.190
 hungry+0.179
 unsure+0.177
 thirsty+0.176
 nervous+0.175
Neg logit contributions
ucks-0.125
ara-0.121
eds-0.119
icer-0.105
of-0.102
Token Spending
Token ID43902
Feature activation+1.24e+00

Pos logit contributions
 determined+0.005
 unsure+0.005
 hungry+0.005
 nervous+0.005
 thirsty+0.005
Neg logit contributions
eds-0.006
ucks-0.006
ara-0.006
ent-0.005
icer-0.005
Token for
Token ID329
Feature activation+9.15e+00

Pos logit contributions
 determined+0.987
 thirsty+0.928
 hungry+0.885
 unsure+0.878
 delighted+0.857
Neg logit contributions
ucks-0.752
ara-0.738
 is-0.722
eds-0.716
of-0.658
Token Bob
Token ID5811
Feature activation-1.35e+00

Pos logit contributions
 us+0.355
ara+0.351
eds+0.330
ucks+0.321
 is+0.320
Neg logit contributions
 determined-0.575
 thirsty-0.525
 confident-0.519
 unsure-0.517
 delighted-0.514
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
 determined+0.088
 thirsty+0.083
 unsure+0.081
 hungry+0.080
 nervous+0.080
Neg logit contributions
eds-0.046
 is-0.045
ara-0.045
of-0.044
ucks-0.042
Token She
Token ID1375
Feature activation-8.37e+00

Pos logit contributions
ara+0.018
ucks+0.017
 is+0.016
eds+0.015
of+0.015
Neg logit contributions
 determined-0.016
 thirsty-0.014
 confident-0.014
 satisfied-0.013
 hungry-0.013
Token wondered
Token ID14028
Feature activation-1.17e+01

Pos logit contributions
 determined+0.419
 thirsty+0.399
 hungry+0.392
 unsure+0.390
dream+0.379
Neg logit contributions
ara-0.449
 is-0.418
ucks-0.411
eds-0.404
of-0.384
Token how
Token ID703
Feature activation+3.79e+00

Pos logit contributions
 determined+0.671
 thirsty+0.619
 hungry+0.615
 unsure+0.614
 nervous+0.607
Neg logit contributions
ara-0.541
ucks-0.534
eds-0.531
 is-0.507
 us-0.488
Token it
Token ID340
Feature activation-5.04e+00

Pos logit contributions
ara+0.278
 is+0.278
eds+0.276
 us+0.272
ucks+0.269
Neg logit contributions
 determined-0.095
 unsure-0.088
 thirsty-0.075
dream-0.075
 delighted-0.073
Token would
Token ID561
Feature activation+2.72e+00

Pos logit contributions
 determined+0.318
 unsure+0.295
 hungry+0.292
 thirsty+0.289
 nervous+0.289
Neg logit contributions
ucks-0.225
eds-0.219
ara-0.218
icer-0.189
of-0.185
Token be
Token ID307
Feature activation+1.30e+01

Pos logit contributions
ucks+0.128
ara+0.124
eds+0.124
icer+0.110
of+0.104
Neg logit contributions
 determined-0.166
 hungry-0.153
 unsure-0.153
 thirsty-0.151
 nervous-0.150
Token to
Token ID284
Feature activation-9.43e-01

Pos logit contributions
eds+0.699
ucks+0.688
ara+0.674
 is+0.659
 us+0.643
Neg logit contributions
 determined-0.631
 unsure-0.569
 thirsty-0.556
 hungry-0.545
 nervous-0.538
Token not
Token ID407
Feature activation-7.09e+00

Pos logit contributions
 determined+0.058
 hungry+0.054
 unsure+0.053
 nervous+0.053
 thirsty+0.052
Neg logit contributions
ucks-0.044
ara-0.043
eds-0.043
icer-0.039
ac-0.036
Token too
Token ID1165
Feature activation-4.39e+00

Pos logit contributions
ucks+0.653
eds+0.643
ara+0.642
of+0.571
icer+0.570
Neg logit contributions
 determined-0.606
 unsure-0.551
 hungry-0.547
 thirsty-0.542
 nervous-0.536
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
 determined+0.307
 unsure+0.288
 thirsty+0.283
 hungry+0.279
 confused+0.278
Neg logit contributions
ucks-0.152
ara-0.152
eds-0.149
 is-0.132
 us-0.127
Token my
Token ID616
Feature activation+1.59e+01

Pos logit contributions
 determined+0.132
 thirsty+0.121
 unsure+0.121
 hungry+0.121
 nervous+0.117
Neg logit contributions
ara-0.107
ucks-0.106
eds-0.102
 is-0.094
 us-0.093
Token dear
Token ID13674
Feature activation+5.01e+00

Pos logit contributions
ucks+0.899
eds+0.889
ara+0.866
of+0.784
icer+0.779
Neg logit contributions
 determined-0.810
 hungry-0.742
 unsure-0.736
 nervous-0.729
 thirsty-0.727
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
eds+0.250
ucks+0.245
ara+0.229
icer+0.217
of+0.212
Neg logit contributions
 determined-0.281
 hungry-0.268
 nervous-0.264
 unsure-0.262
 thirsty-0.258
Token You
Token ID921
Feature activation+5.98e+00

Pos logit contributions
ucks+0.027
eds+0.027
ara+0.026
icer+0.023
of+0.023
Neg logit contributions
 determined-0.026
 unsure-0.024
 hungry-0.023
 thirsty-0.023
 nervous-0.023
Token have
Token ID423
Feature activation+6.56e+00

Pos logit contributions
ucks+0.233
eds+0.227
ara+0.222
icer+0.187
 occasions+0.180
Neg logit contributions
 determined-0.412
 unsure-0.385
 hungry-0.385
 thirsty-0.382
 nervous-0.381
Token a
Token ID257
Feature activation-8.62e-01

Pos logit contributions
ucks+0.430
eds+0.416
ara+0.414
icer+0.382
of+0.377
Neg logit contributions
 determined-0.280
 hungry-0.252
 unsure-0.247
 thirsty-0.247
 nervous-0.243
Token big
Token ID1263
Feature activation+4.75e-01

Pos logit contributions
 determined+0.047
 unsure+0.043
 hungry+0.042
 thirsty+0.042
 nervous+0.041
Neg logit contributions
ucks-0.044
ara-0.044
eds-0.044
 is-0.039
of-0.039
Token heart
Token ID2612
Feature activation-6.79e-01

Pos logit contributions
ucks+0.025
eds+0.025
ara+0.025
of+0.022
icer+0.022
Neg logit contributions
 determined-0.025
 unsure-0.023
 hungry-0.023
 thirsty-0.023
 nervous-0.023
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
 determined+0.037
 unsure+0.034
 hungry+0.034
 thirsty+0.033
 nervous+0.033
Neg logit contributions
ucks-0.035
eds-0.035
ara-0.034
of-0.030
icer-0.030
TokenL
Token ID43
Feature activation+1.47e+01

Pos logit contributions
eds+0.152
ucks+0.141
ators+0.140
 occasions+0.135
ent+0.135
Neg logit contributions
 determined-0.218
 he-0.200
 she-0.198
 curious-0.196
 hungry-0.196
Tokenila
Token ID10102
Feature activation+7.78e+00

Pos logit contributions
eds+0.738
 occasions+0.705
ucks+0.679
ators+0.674
icer+0.670
Neg logit contributions
 determined-0.667
 nervous-0.649
 curious-0.618
 confused-0.617
 excited-0.616
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
ucks+0.414
eds+0.407
ara+0.404
icer+0.360
of+0.358
Neg logit contributions
 determined-0.414
 unsure-0.378
 hungry-0.375
 thirsty-0.372
 nervous-0.369
Token Ben
Token ID3932
Feature activation-4.22e+00

Pos logit contributions
ucks+0.067
ara+0.066
eds+0.066
of+0.059
icer+0.058
Neg logit contributions
 determined-0.061
 unsure-0.055
 hungry-0.054
 thirsty-0.054
 nervous-0.053
Token thought
Token ID1807
Feature activation-7.54e+00

Pos logit contributions
 determined+0.186
 unsure+0.165
 hungry+0.165
 thirsty+0.163
 nervous+0.160
Neg logit contributions
ucks-0.260
ara-0.258
eds-0.254
of-0.234
 is-0.232
Token about
Token ID546
Feature activation-2.06e+00

Pos logit contributions
 determined+0.357
 thirsty+0.338
 unsure+0.331
 hungry+0.327
 nervous+0.313
Neg logit contributions
 is-0.416
ara-0.409
ucks-0.409
eds-0.399
 us-0.394
Token what
Token ID644
Feature activation+2.62e+00

Pos logit contributions
 determined+0.092
 hungry+0.084
 unsure+0.082
 curious+0.082
 thirsty+0.080
Neg logit contributions
ucks-0.127
eds-0.127
ara-0.123
icer-0.116
ac-0.115
Token their
Token ID511
Feature activation+4.96e+00

Pos logit contributions
ara+0.135
ucks+0.133
eds+0.129
 is+0.125
 us+0.117
Neg logit contributions
 determined-0.138
 unsure-0.129
 thirsty-0.126
 hungry-0.124
 nervous-0.123
Token mom
Token ID1995
Feature activation-1.49e+00

Pos logit contributions
ucks+0.298
eds+0.294
ara+0.293
of+0.264
icer+0.263
Neg logit contributions
 determined-0.231
 unsure-0.208
 hungry-0.208
 thirsty-0.205
 nervous-0.203
Token said
Token ID531
Feature activation-1.86e+00

Pos logit contributions
 determined+0.075
 unsure+0.068
 hungry+0.067
 thirsty+0.067
 nervous+0.066
Neg logit contributions
ucks-0.083
ara-0.083
eds-0.082
 is-0.074
of-0.073
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
 determined+0.121
 thirsty+0.115
 hungry+0.114
 unsure+0.113
 nervous+0.109
Neg logit contributions
 is-0.068
ara-0.065
eds-0.063
ucks-0.062
ent-0.061
Token.
Token ID13
Feature activation-4.84e-02

Pos logit contributions
 determined+0.058
 hungry+0.052
 unsure+0.052
 thirsty+0.051
 nervous+0.051
Neg logit contributions
ucks-0.076
ara-0.075
eds-0.074
icer-0.067
of-0.067
Token He
Token ID679
Feature activation-7.63e+00

Pos logit contributions
 determined+0.002
 unsure+0.002
 thirsty+0.002
 hungry+0.002
 confused+0.002
Neg logit contributions
ara-0.003
ucks-0.003
eds-0.003
icer-0.003
 is-0.003
Token wanted
Token ID2227
Feature activation-4.93e+00

Pos logit contributions
 determined+0.424
 hungry+0.387
 unsure+0.387
 thirsty+0.383
 nervous+0.379
Neg logit contributions
ucks-0.392
eds-0.382
ara-0.382
of-0.336
icer-0.336
Token to
Token ID284
Feature activation-1.42e+00

Pos logit contributions
 determined+0.265
 unsure+0.248
 confused+0.231
 confident+0.230
 thirsty+0.229
Neg logit contributions
eds-0.251
ucks-0.244
ara-0.235
 is-0.232
 us-0.230
Token help
Token ID1037
Feature activation-2.10e+00

Pos logit contributions
 determined+0.079
 hungry+0.074
 thirsty+0.074
 nervous+0.073
 curious+0.072
Neg logit contributions
ucks-0.074
ara-0.073
eds-0.073
icer-0.065
ac-0.062
Token a
Token ID257
Feature activation-1.43e+00

Pos logit contributions
 determined+0.096
 hungry+0.087
 unsure+0.087
 thirsty+0.085
 nervous+0.085
Neg logit contributions
ucks-0.129
ara-0.128
eds-0.125
icer-0.114
of-0.113
Token lot
Token ID1256
Feature activation+5.76e+00

Pos logit contributions
 determined+0.077
 unsure+0.071
 thirsty+0.068
 hungry+0.068
 confused+0.068
Neg logit contributions
eds-0.073
ucks-0.073
ara-0.072
of-0.064
 is-0.064
Token but
Token ID475
Feature activation-6.18e+00

Pos logit contributions
ucks+0.303
ara+0.300
eds+0.297
icer+0.264
of+0.258
Neg logit contributions
 determined-0.305
 unsure-0.282
 hungry-0.281
 thirsty-0.277
 nervous-0.276
Token he
Token ID339
Feature activation-1.30e+01

Pos logit contributions
 determined+0.201
 unsure+0.188
dream+0.185
 confident+0.182
 hungry+0.176
Neg logit contributions
 is-0.414
ara-0.369
eds-0.367
 us-0.366
ac-0.362
Token didn
Token ID1422
Feature activation-1.86e+00

Pos logit contributions
 determined+0.734
 unsure+0.686
 hungry+0.674
 thirsty+0.672
 nervous+0.666
Neg logit contributions
ara-0.623
eds-0.619
ucks-0.610
 is-0.556
of-0.541
Tokenâ
Token ID22940
Feature activation+8.24e-01

Pos logit contributions
 determined+0.088
 unsure+0.079
 hungry+0.078
 nervous+0.078
 thirsty+0.077
Neg logit contributions
ucks-0.111
eds-0.109
ara-0.108
icer-0.096
of-0.096
Token he
Token ID339
Feature activation-1.40e+01

Pos logit contributions
 determined+0.169
 unsure+0.155
 thirsty+0.144
dream+0.139
 confused+0.136
Neg logit contributions
ara-0.366
 is-0.359
 us-0.350
eds-0.350
ucks-0.348
Token walked
Token ID6807
Feature activation-3.17e+00

Pos logit contributions
 determined+0.655
 unsure+0.585
 hungry+0.584
 thirsty+0.577
 nervous+0.569
Neg logit contributions
ucks-0.843
eds-0.826
ara-0.820
icer-0.742
of-0.732
Token to
Token ID284
Feature activation-2.19e+00

Pos logit contributions
 determined+0.188
 thirsty+0.176
 unsure+0.170
 hungry+0.170
 confused+0.165
Neg logit contributions
ucks-0.144
eds-0.138
ara-0.131
of-0.131
 us-0.127
Token the
Token ID262
Feature activation+2.58e+00

Pos logit contributions
 determined+0.103
 hungry+0.094
 unsure+0.093
 nervous+0.092
 thirsty+0.091
Neg logit contributions
ucks-0.131
eds-0.131
ara-0.130
icer-0.118
ac-0.116
Token pond
Token ID16723
Feature activation-2.50e+00

Pos logit contributions
ucks+0.126
eds+0.124
ara+0.124
icer+0.108
of+0.108
Neg logit contributions
 determined-0.148
 unsure-0.136
 hungry-0.135
 thirsty-0.134
 nervous-0.133
Token.
Token ID13
Feature activation-6.77e-01

Pos logit contributions
 determined+0.149
 unsure+0.135
 nervous+0.133
 hungry+0.132
 confused+0.131
Neg logit contributions
ucks-0.110
eds-0.110
ara-0.107
 is-0.100
of-0.098
Token He
Token ID679
Feature activation-8.27e+00

Pos logit contributions
 determined+0.030
 thirsty+0.027
 hungry+0.026
 unsure+0.026
 confident+0.025
Neg logit contributions
ucks-0.041
ara-0.041
 is-0.037
eds-0.036
of-0.036
Token wanted
Token ID2227
Feature activation-5.48e+00

Pos logit contributions
 determined+0.406
 unsure+0.375
 hungry+0.371
 thirsty+0.367
 nervous+0.364
Neg logit contributions
ara-0.455
ucks-0.452
eds-0.451
of-0.408
 is-0.407
Token to
Token ID284
Feature activation-1.92e+00

Pos logit contributions
 determined+0.264
 unsure+0.241
 thirsty+0.233
 hungry+0.231
 nervous+0.231
Neg logit contributions
eds-0.313
ucks-0.313
ara-0.309
 us-0.286
 is-0.285
Token see
Token ID766
Feature activation-1.11e+00

Pos logit contributions
 determined+0.108
 hungry+0.099
 thirsty+0.099
 nervous+0.098
 curious+0.097
Neg logit contributions
ucks-0.100
ara-0.097
eds-0.097
icer-0.090
ac-0.084
Token what
Token ID644
Feature activation+2.27e+00

Pos logit contributions
 determined+0.055
 unsure+0.050
 hungry+0.050
 thirsty+0.049
 nervous+0.049
Neg logit contributions
ucks-0.064
ara-0.063
eds-0.063
icer-0.056
of-0.056
Token hugged
Token ID41130
Feature activation-1.74e+00

Pos logit contributions
ucks+0.183
eds+0.169
ara+0.160
icer+0.159
ases+0.155
Neg logit contributions
 determined-0.149
 hungry-0.132
 excited-0.130
 thirsty-0.130
 nervous-0.129
Token her
Token ID607
Feature activation+1.43e+00

Pos logit contributions
 determined+0.082
 unsure+0.074
 hungry+0.074
 thirsty+0.073
 nervous+0.072
Neg logit contributions
ucks-0.103
eds-0.102
ara-0.101
icer-0.091
of-0.091
Token mum
Token ID25682
Feature activation-5.18e+00

Pos logit contributions
ara+0.087
ucks+0.087
eds+0.085
 is+0.080
ac+0.077
Neg logit contributions
 determined-0.062
 unsure-0.057
 thirsty-0.055
 hungry-0.054
 nervous-0.054
Token tightly
Token ID17707
Feature activation-5.53e+00

Pos logit contributions
 determined+0.272
 unsure+0.248
 hungry+0.247
 thirsty+0.245
 nervous+0.242
Neg logit contributions
ucks-0.277
ara-0.273
eds-0.272
of-0.242
icer-0.240
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 determined+0.339
 unsure+0.316
 thirsty+0.315
 hungry+0.312
 nervous+0.308
Neg logit contributions
ara-0.225
eds-0.222
ucks-0.215
 is-0.211
of-0.200
Token said
Token ID531
Feature activation-1.81e+00

Pos logit contributions
ara+0.059
ucks+0.059
eds+0.058
 is+0.054
of+0.054
Neg logit contributions
 determined-0.054
 unsure-0.049
 hungry-0.048
 thirsty-0.048
 nervous-0.048
Token goodbye
Token ID24829
Feature activation-4.72e+00

Pos logit contributions
 thirsty+0.080
 determined+0.079
 unsure+0.076
 hungry+0.075
 confused+0.072
Neg logit contributions
 is-0.101
ara-0.099
of-0.093
ucks-0.092
 us-0.090
Token with
Token ID351
Feature activation+3.47e+00

Pos logit contributions
 determined+0.302
 thirsty+0.284
 unsure+0.282
 hungry+0.279
 desperate+0.273
Neg logit contributions
ara-0.176
eds-0.169
ucks-0.167
 is-0.161
of-0.156
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
ara+0.176
ucks+0.175
eds+0.168
 us+0.167
 is+0.163
Neg logit contributions
 determined-0.174
 unsure-0.159
 nervous-0.154
 hungry-0.154
 thirsty-0.154
Token big
Token ID1263
Feature activation-1.39e-01

Pos logit contributions
 determined+0.068
 unsure+0.065
 thirsty+0.060
 hungry+0.060
 confused+0.059
Neg logit contributions
ara-0.080
eds-0.077
ucks-0.076
of-0.075
 is-0.073
Token kiss
Token ID9245
Feature activation+6.21e+00

Pos logit contributions
 determined+0.007
 unsure+0.007
 thirsty+0.007
 hungry+0.007
 nervous+0.006
Neg logit contributions
ara-0.007
ucks-0.007
eds-0.007
 is-0.007
of-0.006
TokenThe
Token ID464
Feature activation+8.13e+00

Pos logit contributions
ucks+0.188
eds+0.185
ara+0.184
of+0.163
icer+0.163
Neg logit contributions
 determined-0.192
 unsure-0.175
 hungry-0.174
 thirsty-0.173
 nervous-0.171
Token bunny
Token ID44915
Feature activation-3.21e+00

Pos logit contributions
ara+0.371
eds+0.359
ucks+0.358
 is+0.343
 us+0.334
Neg logit contributions
 determined-0.450
 unsure-0.425
dream-0.403
 thirsty-0.401
 confused-0.400
Token knew
Token ID2993
Feature activation-5.59e+00

Pos logit contributions
 determined+0.176
 unsure+0.163
 thirsty+0.162
 hungry+0.162
 nervous+0.158
Neg logit contributions
ara-0.156
eds-0.154
ucks-0.150
 is-0.142
of-0.138
Token he
Token ID339
Feature activation-1.32e+01

Pos logit contributions
 determined+0.205
 thirsty+0.193
 unsure+0.189
 hungry+0.189
 nervous+0.187
Neg logit contributions
 is-0.359
eds-0.352
 us-0.344
ara-0.344
ucks-0.333
Token had
Token ID550
Feature activation-3.00e-01

Pos logit contributions
 determined+0.684
 unsure+0.623
 hungry+0.620
 thirsty+0.614
 nervous+0.607
Neg logit contributions
ucks-0.715
eds-0.703
ara-0.702
of-0.622
icer-0.622
Token to
Token ID284
Feature activation-1.53e+00

Pos logit contributions
 determined+0.014
 unsure+0.013
 thirsty+0.012
 hungry+0.012
 nervous+0.012
Neg logit contributions
eds-0.017
ucks-0.017
ara-0.017
 is-0.016
 us-0.016
Token find
Token ID1064
Feature activation+3.32e+00

Pos logit contributions
 determined+0.091
 hungry+0.082
 thirsty+0.082
 unsure+0.082
 nervous+0.082
Neg logit contributions
ucks-0.076
eds-0.073
ara-0.072
icer-0.065
of-0.061
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
ucks+0.192
eds+0.191
ara+0.186
 is+0.172
 us+0.171
Neg logit contributions
 determined-0.161
 unsure-0.144
 thirsty-0.142
 hungry-0.142
 nervous-0.139
Token way
Token ID835
Feature activation-2.23e-01

Pos logit contributions
 determined+0.088
 unsure+0.086
 confused+0.081
 thirsty+0.081
 hungry+0.080
Neg logit contributions
eds-0.060
ara-0.057
ucks-0.055
 is-0.054
 us-0.051
Token to
Token ID284
Feature activation-1.58e+00

Pos logit contributions
 determined+0.012
 unsure+0.011
 hungry+0.011
 thirsty+0.011
 confused+0.010
Neg logit contributions
eds-0.012
ucks-0.011
 is-0.011
ara-0.011
 us-0.011
Token solve
Token ID8494
Feature activation+4.88e+00

Pos logit contributions
 determined+0.091
 hungry+0.083
 unsure+0.083
 thirsty+0.083
 nervous+0.082
Neg logit contributions
ucks-0.078
ara-0.077
eds-0.076
icer-0.067
of-0.065
TokenWhen
Token ID2215
Feature activation-6.02e+00

Pos logit contributions
ucks+0.019
eds+0.019
ara+0.018
icer+0.016
 is+0.016
Neg logit contributions
 determined-0.025
 unsure-0.022
 hungry-0.022
 curious-0.022
 thirsty-0.022
Token Sue
Token ID26089
Feature activation-5.34e-01

Pos logit contributions
 determined+0.257
 unsure+0.235
 confused+0.223
 nervous+0.221
 thirsty+0.221
Neg logit contributions
ara-0.380
ucks-0.355
eds-0.352
 us-0.350
 is-0.349
Token reached
Token ID4251
Feature activation-9.23e+00

Pos logit contributions
 determined+0.024
 hungry+0.022
 confused+0.021
 thirsty+0.021
 nervous+0.021
Neg logit contributions
ucks-0.035
eds-0.033
 occasions-0.030
icer-0.030
ate-0.029
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
 determined+0.499
 unsure+0.456
 hungry+0.448
 thirsty+0.445
 nervous+0.441
Neg logit contributions
ucks-0.481
ara-0.480
eds-0.477
 is-0.423
of-0.423
Token garden
Token ID11376
Feature activation-7.06e+00

Pos logit contributions
ucks+0.070
eds+0.067
ara+0.066
ent+0.058
of+0.058
Neg logit contributions
 determined-0.080
 hungry-0.074
 thirsty-0.073
 unsure-0.073
 nervous-0.072
Token she
Token ID673
Feature activation-1.46e+01

Pos logit contributions
 determined+0.361
 unsure+0.320
 hungry+0.312
 thirsty+0.310
 nervous+0.309
Neg logit contributions
ara-0.389
ucks-0.379
eds-0.374
 is-0.360
of-0.349
Token took
Token ID1718
Feature activation+2.40e+00

Pos logit contributions
 determined+0.694
 hungry+0.628
 thirsty+0.624
 unsure+0.619
 confused+0.610
Neg logit contributions
ucks-0.894
eds-0.860
ara-0.845
icer-0.776
ent-0.774
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
ucks+0.134
ara+0.133
eds+0.131
of+0.122
 us+0.121
Neg logit contributions
 determined-0.118
 unsure-0.107
 thirsty-0.106
 hungry-0.104
 confused-0.103
Token sip
Token ID31145
Feature activation-3.05e+00

Pos logit contributions
 determined+0.155
 unsure+0.146
 hungry+0.138
 thirsty+0.138
 confused+0.138
Neg logit contributions
eds-0.160
ara-0.160
ucks-0.153
of-0.151
 is-0.144
Token of
Token ID286
Feature activation+3.61e+00

Pos logit contributions
 determined+0.179
 unsure+0.163
 thirsty+0.160
 hungry+0.160
 confused+0.158
Neg logit contributions
ucks-0.144
ara-0.144
eds-0.142
 is-0.128
of-0.128
Token her
Token ID607
Feature activation-1.78e-01

Pos logit contributions
ucks+0.187
ara+0.185
eds+0.185
 is+0.167
 us+0.167
Neg logit contributions
 determined-0.193
 unsure-0.175
 nervous-0.170
 hungry-0.169
 thirsty-0.169
Token
Token ID220
Feature activation-4.61e+00

Pos logit contributions
ucks+0.279
ara+0.279
eds+0.266
 is+0.250
of+0.246
Neg logit contributions
 determined-0.283
 thirsty-0.259
 hungry-0.256
 unsure-0.255
 nervous-0.248
Token
Token ID198
Feature activation+3.74e+00

Pos logit contributions
 determined+0.215
 unsure+0.197
 nervous+0.196
 hungry+0.196
 curious+0.195
Neg logit contributions
eds-0.268
ucks-0.261
ara-0.253
 occasions-0.244
ent-0.240
Token
Token ID198
Feature activation+3.74e+00

Pos logit contributions
ucks+0.197
eds+0.194
ara+0.190
icer+0.175
ent+0.169
Neg logit contributions
 determined-0.204
 unsure-0.184
 hungry-0.183
 nervous-0.180
 curious-0.180
TokenAmy
Token ID40797
Feature activation+5.96e+00

Pos logit contributions
ucks+0.172
eds+0.168
ara+0.164
icer+0.148
ent+0.141
Neg logit contributions
 determined-0.233
 unsure-0.211
 hungry-0.210
 thirsty-0.206
 nervous-0.206
Token was
Token ID373
Feature activation+9.04e+00

Pos logit contributions
ucks+0.251
ara+0.249
eds+0.248
of+0.215
 is+0.215
Neg logit contributions
 determined-0.376
 unsure-0.350
 hungry-0.349
 thirsty-0.348
 nervous-0.342
Token amazed
Token ID24562
Feature activation-1.47e+01

Pos logit contributions
 is+0.526
eds+0.510
e+0.510
 us+0.482
ate+0.479
Neg logit contributions
dream-0.276
 determined-0.260
 he-0.242
 unsure-0.241
chard-0.229
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
 determined+0.929
 thirsty+0.858
 hungry+0.834
 unsure+0.825
 nervous+0.810
Neg logit contributions
ara-0.646
ucks-0.613
eds-0.589
 us-0.569
of-0.562
Token She
Token ID1375
Feature activation-8.41e+00

Pos logit contributions
ara+0.006
ucks+0.006
 is+0.006
eds+0.006
of+0.006
Neg logit contributions
 determined-0.004
 thirsty-0.004
 hungry-0.004
 unsure-0.004
 nervous-0.004
Token had
Token ID550
Feature activation+3.25e-02

Pos logit contributions
 determined+0.415
 unsure+0.382
 hungry+0.381
 thirsty+0.381
 nervous+0.371
Neg logit contributions
ara-0.461
eds-0.460
ucks-0.459
 is-0.418
of-0.415
Token an
Token ID281
Feature activation+1.54e+00

Pos logit contributions
eds+0.002
ara+0.002
ucks+0.002
 is+0.001
 us+0.001
Neg logit contributions
 determined-0.002
 unsure-0.002
 thirsty-0.002
 hungry-0.002
 nervous-0.002
Token easy
Token ID2562
Feature activation-5.17e+00

Pos logit contributions
ucks+0.083
ara+0.082
eds+0.082
 is+0.079
ac+0.079
Neg logit contributions
 determined-0.068
 thirsty-0.065
 hungry-0.065
 unsure-0.063
 nervous-0.062
Token
Token ID198
Feature activation+3.18e+00

Pos logit contributions
ucks+0.190
eds+0.183
ara+0.182
ac+0.172
ese+0.169
Neg logit contributions
 determined-0.151
 excited-0.134
 hungry-0.134
 curious-0.132
 nervous-0.130
TokenSarah
Token ID29284
Feature activation+4.25e+00

Pos logit contributions
ucks+0.182
eds+0.173
ara+0.165
icer+0.156
ases+0.154
Neg logit contributions
 determined-0.174
 unsure-0.146
 confident-0.145
 curious-0.145
dream-0.145
Token was
Token ID373
Feature activation+8.45e+00

Pos logit contributions
ucks+0.221
eds+0.209
ara+0.201
icer+0.183
ases+0.181
Neg logit contributions
 determined-0.247
 unsure-0.220
 hungry-0.219
 confused-0.217
 nervous-0.216
Token so
Token ID523
Feature activation-1.11e+01

Pos logit contributions
ac+0.479
e+0.478
 is+0.476
eds+0.454
ma+0.444
Neg logit contributions
dream-0.264
 unsure-0.235
 he-0.233
 determined-0.226
 thirsty-0.223
Token happy
Token ID3772
Feature activation-1.05e+01

Pos logit contributions
 determined+0.299
 unsure+0.295
dream+0.274
 wondering+0.265
 searching+0.258
Neg logit contributions
ac-0.740
ara-0.726
ucks-0.721
 is-0.710
ee-0.698
Token she
Token ID673
Feature activation-1.35e+01

Pos logit contributions
 determined+0.563
 thirsty+0.515
 unsure+0.511
 confused+0.503
 hungry+0.498
Neg logit contributions
ara-0.546
ucks-0.531
eds-0.529
 is-0.515
 us-0.484
Token had
Token ID550
Feature activation-7.10e-01

Pos logit contributions
 determined+0.675
 unsure+0.630
 thirsty+0.619
 hungry+0.618
 nervous+0.599
Neg logit contributions
ara-0.723
eds-0.722
ucks-0.710
 is-0.666
of-0.644
Token been
Token ID587
Feature activation+8.75e+00

Pos logit contributions
 determined+0.038
 unsure+0.035
 hungry+0.035
 thirsty+0.035
 nervous+0.034
Neg logit contributions
ucks-0.036
ara-0.036
eds-0.036
 is-0.032
of-0.032
Token saved
Token ID7448
Feature activation-2.89e+00

Pos logit contributions
eds+0.579
ucks+0.552
ac+0.547
ara+0.545
 is+0.532
Neg logit contributions
 determined-0.270
 unsure-0.255
dream-0.238
 thirsty-0.228
 confused-0.228
Token!
Token ID0
Feature activation+4.54e+00

Pos logit contributions
 determined+0.151
 unsure+0.138
 hungry+0.138
 nervous+0.136
 thirsty+0.135
Neg logit contributions
ucks-0.158
eds-0.154
ara-0.153
icer-0.136
of-0.135
Token She
Token ID1375
Feature activation-9.24e+00

Pos logit contributions
ucks+0.315
eds+0.311
ara+0.310
icer+0.283
of+0.283
Neg logit contributions
 determined-0.169
 unsure-0.147
 hungry-0.146
 thirsty-0.144
 nervous-0.142
Token
Token ID198
Feature activation+3.33e+00

Pos logit contributions
 determined+0.248
 nervous+0.227
 unsure+0.226
 hungry+0.225
 thirsty+0.223
Neg logit contributions
eds-0.293
ucks-0.286
ara-0.279
 occasions-0.263
icer-0.262
Token
Token ID198
Feature activation+3.30e+00

Pos logit contributions
ucks+0.178
eds+0.177
ara+0.171
icer+0.159
ent+0.155
Neg logit contributions
 determined-0.176
 hungry-0.161
 thirsty-0.160
 nervous-0.158
 unsure-0.158
TokenS
Token ID50
Feature activation+1.22e+01

Pos logit contributions
ucks+0.136
eds+0.133
ara+0.129
icer+0.114
 is+0.109
Neg logit contributions
 determined-0.219
 hungry-0.202
 unsure-0.201
 thirsty-0.201
 nervous-0.198
Tokenally
Token ID453
Feature activation+1.08e+01

Pos logit contributions
eds+0.612
ucks+0.606
icer+0.532
ases+0.526
 occasions+0.511
Neg logit contributions
 determined-0.644
 hungry-0.606
 thirsty-0.602
 she-0.589
dream-0.582
Token was
Token ID373
Feature activation+8.55e+00

Pos logit contributions
ucks+0.482
eds+0.473
ara+0.472
of+0.406
icer+0.406
Neg logit contributions
 determined-0.669
 unsure-0.619
 hungry-0.616
 thirsty-0.612
 nervous-0.606
Token determined
Token ID5295
Feature activation-2.26e+01

Pos logit contributions
 is+0.476
e+0.455
eds+0.444
 us+0.433
ate+0.413
Neg logit contributions
dream-0.271
chard-0.231
 he-0.228
 determined-0.225
 she-0.225
Token to
Token ID284
Feature activation-1.74e+00

Pos logit contributions
 determined+1.145
 thirsty+1.078
 unsure+1.063
 hungry+1.062
 confused+1.034
Neg logit contributions
eds-1.181
ara-1.166
 is-1.124
ucks-1.118
 us-1.078
Token get
Token ID651
Feature activation+3.82e+00

Pos logit contributions
 determined+0.100
 unsure+0.091
 hungry+0.091
 thirsty+0.090
 nervous+0.090
Neg logit contributions
ucks-0.087
ara-0.085
eds-0.085
icer-0.076
of-0.073
Token her
Token ID607
Feature activation+9.89e-01

Pos logit contributions
ucks+0.167
eds+0.166
 us+0.158
 is+0.153
ara+0.146
Neg logit contributions
 determined-0.213
 unsure-0.201
dream-0.201
 delighted-0.193
 confident-0.189
Token balloon
Token ID21190
Feature activation+6.80e-01

Pos logit contributions
ucks+0.046
eds+0.045
ara+0.045
of+0.039
 is+0.039
Neg logit contributions
 determined-0.059
 unsure-0.055
 hungry-0.054
 thirsty-0.054
 nervous-0.053
Token back
Token ID736
Feature activation+1.72e+00

Pos logit contributions
eds+0.029
ara+0.028
ucks+0.027
 is+0.027
of+0.026
Neg logit contributions
 determined-0.041
 unsure-0.038
 hungry-0.037
 thirsty-0.037
 confused-0.036
Token outside
Token ID2354
Feature activation-4.99e+00

Pos logit contributions
ucks+0.245
ara+0.238
eds+0.235
ac+0.217
icer+0.216
Neg logit contributions
 determined-0.167
 hungry-0.157
 curious-0.154
 nervous-0.150
 thirsty-0.148
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
 determined+0.276
 unsure+0.253
 hungry+0.252
 thirsty+0.250
 nervous+0.247
Neg logit contributions
ucks-0.255
eds-0.251
ara-0.250
of-0.220
icer-0.220
Token One
Token ID1881
Feature activation-1.83e+00

Pos logit contributions
 determined+0.029
 hungry+0.026
 unsure+0.026
 thirsty+0.026
 nervous+0.025
Neg logit contributions
eds-0.049
ucks-0.048
ara-0.046
ent-0.043
ac-0.043
Token day
Token ID1110
Feature activation-1.87e+00

Pos logit contributions
 determined+0.104
 unsure+0.096
 hungry+0.095
 thirsty+0.094
 nervous+0.093
Neg logit contributions
ucks-0.091
ara-0.090
eds-0.090
of-0.079
icer-0.078
Token,
Token ID11
Feature activation-3.23e+00

Pos logit contributions
 determined+0.098
 unsure+0.090
 thirsty+0.086
 confused+0.086
 hungry+0.085
Neg logit contributions
ara-0.103
ucks-0.098
eds-0.097
 is-0.092
of-0.090
Token she
Token ID673
Feature activation-1.34e+01

Pos logit contributions
 determined+0.131
 unsure+0.114
 confident+0.107
 desperate+0.105
 confused+0.103
Neg logit contributions
ara-0.215
 is-0.199
 us-0.193
of-0.191
ucks-0.191
Token decided
Token ID3066
Feature activation-1.07e+01

Pos logit contributions
 determined+0.630
 hungry+0.588
 thirsty+0.585
 nervous+0.569
 curious+0.563
Neg logit contributions
ucks-0.807
eds-0.783
ara-0.756
icer-0.718
 occasions-0.707
Token to
Token ID284
Feature activation-1.79e+00

Pos logit contributions
 determined+0.615
 unsure+0.607
 thirsty+0.577
 confused+0.567
 desperate+0.566
Neg logit contributions
 is-0.456
 us-0.443
ara-0.443
eds-0.429
of-0.394
Token adventure
Token ID8855
Feature activation-1.74e-01

Pos logit contributions
 determined+0.105
 hungry+0.096
 curious+0.095
 thirsty+0.095
 nervous+0.095
Neg logit contributions
ucks-0.089
eds-0.087
ara-0.085
icer-0.078
of-0.072
Token around
Token ID1088
Feature activation-3.75e+00

Pos logit contributions
 determined+0.009
 unsure+0.008
 hungry+0.008
 thirsty+0.008
 confused+0.008
Neg logit contributions
eds-0.009
ucks-0.009
ara-0.009
 is-0.009
of-0.009
Token her
Token ID607
Feature activation+9.68e-01

Pos logit contributions
 determined+0.175
 hungry+0.158
 unsure+0.157
 thirsty+0.155
 nervous+0.155
Neg logit contributions
ucks-0.230
ara-0.224
eds-0.224
icer-0.202
of-0.200