TOP ACTIVATIONS
MAX = 0.192

Sof ia ," said Jo Jo . " Why are you
uncomfortable . Jo an saw that it was
Jo jo . Jo jo looked very sad .
, " Be ep be ep !" and drive away .
room and be ep be ep . One day
They say , " Be ep be ep !" and drive
â Jo hopped on the bike and
every day ." Jo jo smiled and said :
room v room and be ep be ep .
what was inside , El isa opened the ig loo and
makes them zoom and be ep on the floor . Sometimes
much !" Jo anne smiled and thanked the
Before he left , Jo Jo hugged his mom and said
. The therm ometer be ep ed and the doctor said
of the night . <|endoftext|> Jo Jo was walking outside on
the night . <|endoftext|> Jo Jo was walking outside on a
the door ." Jo thought for a moment .
princess . Jo anne had so much fun
some ?" Jo was so excited . She
be hurt . Jo anna smiled and said to

MIN ACTIVATIONS
MIN = -0.125

he liked a big soft swing
other family a dad , a
special thing wings . Every day
a normal penny â it was special .
, one dull â and he had to
Thank you ! I like to
two penn ies â one shiny , one
rare silence that added to the
Thank you for supporting me ,
Thank you . â
special one a scary story !
very sad it was her favourite
for him it was a big
something special a helpful cabinet .
Thank you , my dear .
so excited she couldn 't wait
perfect coat it was soft and
special day the boat had arrived
Thank you . I will be
Thank you for touching me .

INTERVAL 0.113 - 0.192
CONTAINS 0.063%

their mom and dad , who were waiting for them .
happy boy . <|endoftext|> Jo Jo was sad . He wanted
They hope to find someone who knows how to open the
!" A man who was walking by heard Tom
She hears a loud th ud . She opens the lid

INTERVAL 0.034 - 0.113
CONTAINS 9.756%

now !" Ben says . " I don 't want to
that day on Sarah was very careful when she saw the
v ase on her mom my 's table . Lily wanted
They felt the grass under their feet and the wind in
pond , the goose started swimming in the clear , cool

INTERVAL -0.045 - 0.034
CONTAINS 86.669%

the mud . It was so much fun ! Tim my
but this popcorn was disgusting ! It tasted y ucky and
peanut rolled closer and closer and it saw a fish !
ocean with some other fishes . They all jumped into the
me ?" Max asked . " Yes , I

INTERVAL -0.125 - -0.045
CONTAINS 3.512%

a big table with many bottles . The bottles have different
. Betty be amed with happiness and skipped all the way
she was so excited . She asked the fairy what it
curious and brave Min nie promised to stay safe and take
they played together happily . From then on , they built
Token Sof
Token ID41427
Feature activation-1.73e-02

Pos logit contributions
ep+0.031
we+0.031
ophone+0.030
ician+0.030
utes+0.030
Neg logit contributions
,-0.012
 Daisy-0.011
 her-0.010
 Sally-0.010
 she-0.010
Tokenia
Token ID544
Feature activation+2.69e-02

Pos logit contributions
 Daisy+0.034
 Sally+0.032
 berries+0.031
 she+0.030
,+0.030
Neg logit contributions
ep-0.031
ophone-0.028
we-0.028
ician-0.027
ber-0.027
Token,"
Token ID553
Feature activation-6.92e-03

Pos logit contributions
ep+0.068
ophone+0.064
we+0.063
anas+0.062
ician+0.062
Neg logit contributions
 Daisy-0.033
 Sally-0.031
 berries-0.028
,-0.027
 her-0.027
Token said
Token ID531
Feature activation-7.34e-03

Pos logit contributions
 Daisy+0.006
 berries+0.006
,+0.006
 Sally+0.005
 taste+0.005
Neg logit contributions
ep-0.018
we-0.017
ician-0.017
ophone-0.017
anas-0.017
Token Jo
Token ID5302
Feature activation+1.15e-01

Pos logit contributions
 berries+0.005
 Daisy+0.005
,+0.004
 She+0.004
 Sally+0.004
Neg logit contributions
ep-0.021
ophone-0.020
ician-0.020
we-0.020
utes-0.019
TokenJo
Token ID9908
Feature activation+1.78e-01

Pos logit contributions
ep+0.311
ophone+0.300
ician+0.298
we+0.295
utes+0.294
Neg logit contributions
 Daisy-0.120
,-0.118
 Sally-0.107
 she-0.102
 her-0.102
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
ep+0.469
ophone+0.452
ician+0.449
utes+0.448
we+0.447
Neg logit contributions
,-0.204
 Daisy-0.201
 Sally-0.180
 her-0.174
 she-0.173
Token "
Token ID366
Feature activation+4.66e-02

Pos logit contributions
ep+0.071
ophone+0.069
ician+0.068
we+0.068
utes+0.067
Neg logit contributions
 Daisy-0.028
,-0.028
 Sally-0.025
 she-0.024
 her-0.024
TokenWhy
Token ID5195
Feature activation-8.05e-04

Pos logit contributions
ep+0.101
ophone+0.098
utes+0.095
anas+0.094
ician+0.093
Neg logit contributions
 Daisy-0.075
 She-0.072
 she-0.071
 Sally-0.070
 her-0.068
Token are
Token ID389
Feature activation-1.65e-03

Pos logit contributions
 Daisy+0.001
 Sally+0.001
 berries+0.001
,+0.001
 her+0.001
Neg logit contributions
ep-0.002
ophone-0.002
anas-0.002
we-0.002
ician-0.002
Token you
Token ID345
Feature activation-8.28e-03

Pos logit contributions
,+0.002
 Daisy+0.002
 Sally+0.002
 her+0.002
 she+0.002
Neg logit contributions
ep-0.004
ophone-0.004
we-0.004
ician-0.004
utes-0.004
Token uncomfortable
Token ID12916
Feature activation-2.36e-02

Pos logit contributions
ep+0.156
aker+0.143
we+0.142
utes+0.138
washer+0.136
Neg logit contributions
 Daisy-0.136
 Sally-0.127
 berries-0.126
 Polly-0.118
 Betty-0.113
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 Daisy+0.030
 Sally+0.027
 berries+0.026
 Polly+0.024
,+0.024
Neg logit contributions
ep-0.057
we-0.057
ophone-0.054
ician-0.053
aker-0.053
Token
Token ID220
Feature activation+7.93e-03

Pos logit contributions
ep+0.042
ophone+0.042
ician+0.042
we+0.042
anas+0.042
Neg logit contributions
,-0.015
 berries-0.015
 Daisy-0.014
 end-0.013
 taste-0.013
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
ep+0.019
ophone+0.018
utes+0.018
we+0.018
ician+0.017
Neg logit contributions
 Daisy-0.010
,-0.010
 berries-0.010
 Sally-0.009
 she-0.008
Token
Token ID198
Feature activation+8.69e-03

Pos logit contributions
ep+0.027
ophone+0.026
ician+0.026
we+0.025
utes+0.025
Neg logit contributions
 Daisy-0.012
,-0.012
 Sally-0.011
 her-0.011
 she-0.011
TokenJo
Token ID9908
Feature activation+1.67e-01

Pos logit contributions
ep+0.017
ophone+0.017
ician+0.016
utes+0.016
we+0.016
Neg logit contributions
 Daisy-0.015
,-0.015
 Sally-0.014
 she-0.014
 her-0.014
Tokenan
Token ID272
Feature activation+2.93e-02

Pos logit contributions
ep+0.356
ophone+0.344
utes+0.341
ician+0.339
we+0.335
Neg logit contributions
,-0.271
 Daisy-0.266
 Sally-0.245
 her-0.243
 she-0.242
Token saw
Token ID2497
Feature activation+6.17e-02

Pos logit contributions
ep+0.073
ophone+0.069
ician+0.068
we+0.068
utes+0.066
Neg logit contributions
 Daisy-0.037
,-0.035
 Sally-0.035
 her-0.032
 she-0.032
Token that
Token ID326
Feature activation+1.71e-03

Pos logit contributions
ep+0.173
we+0.169
ophone+0.162
ician+0.161
etr+0.159
Neg logit contributions
 Daisy-0.046
,-0.043
 Sally-0.040
 berries-0.038
 She-0.037
Token it
Token ID340
Feature activation-6.43e-03

Pos logit contributions
ep+0.005
we+0.005
ophone+0.005
ician+0.004
ber+0.004
Neg logit contributions
 Daisy-0.001
 berries-0.001
,-0.001
 Sally-0.001
 Freddy-0.001
Token was
Token ID373
Feature activation-9.72e-03

Pos logit contributions
 Sally+0.009
 Daisy+0.009
 Gig+0.008
 Sophie+0.008
 berries+0.008
Neg logit contributions
ep-0.015
we-0.014
ophone-0.014
ician-0.013
etr-0.013
Token Jo
Token ID5302
Feature activation+1.08e-01

Pos logit contributions
 Daisy+0.001
,+0.001
 Sally+0.001
 her+0.001
 she+0.001
Neg logit contributions
ep-0.004
ophone-0.004
ician-0.004
utes-0.004
we-0.004
Tokenjo
Token ID7639
Feature activation+1.28e-01

Pos logit contributions
plug+0.153
utes+0.153
ep+0.150
anas+0.147
ician+0.143
Neg logit contributions
,-0.272
 Daisy-0.248
 Sally-0.242
 she-0.241
 her-0.237
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
ep+0.363
ophone+0.353
ician+0.347
we+0.346
anas+0.342
Neg logit contributions
 Daisy-0.110
,-0.099
 Sally-0.097
 her-0.089
 she-0.088
Token
Token ID198
Feature activation+4.70e-03

Pos logit contributions
ep+0.010
ician+0.009
ophone+0.009
aker+0.009
aghetti+0.009
Neg logit contributions
,-0.004
 Daisy-0.003
 She-0.003
 Sally-0.003
 she-0.003
Token
Token ID198
Feature activation+5.14e-03

Pos logit contributions
ophone+0.016
aghetti+0.015
ician+0.015
aker+0.015
anas+0.015
Neg logit contributions
,-0.003
 the-0.002
 Daisy-0.002
 She-0.002
 she-0.002
TokenJo
Token ID9908
Feature activation+1.66e-01

Pos logit contributions
ophone+0.011
utes+0.011
ician+0.011
anas+0.011
etr+0.010
Neg logit contributions
,-0.009
 she-0.008
 Daisy-0.008
 She-0.008
 her-0.007
Tokenjo
Token ID7639
Feature activation+1.29e-01

Pos logit contributions
aghetti+0.213
utes+0.209
anas+0.199
 keyboard+0.187
plug+0.186
Neg logit contributions
,-0.445
 Daisy-0.390
 her-0.379
 Sally-0.375
 the-0.371
Token looked
Token ID3114
Feature activation+5.41e-02

Pos logit contributions
ep+0.330
ophone+0.315
ician+0.311
we+0.307
anas+0.302
Neg logit contributions
 Daisy-0.147
,-0.137
 Sally-0.133
 her-0.127
 she-0.125
Token very
Token ID845
Feature activation+7.06e-02

Pos logit contributions
ep+0.122
ophone+0.121
utes+0.112
ician+0.110
washer+0.106
Neg logit contributions
 Daisy-0.070
 Sally-0.066
 berries-0.064
 she-0.064
,-0.061
Token sad
Token ID6507
Feature activation-2.89e-02

Pos logit contributions
ep+0.139
utes+0.130
ophone+0.125
ike+0.125
ician+0.124
Neg logit contributions
 Daisy-0.111
 berries-0.107
 Sally-0.101
,-0.099
 she-0.097
Token.
Token ID13
Feature activation+6.48e-03

Pos logit contributions
 Daisy+0.031
 Sally+0.028
,+0.026
 berries+0.026
 her+0.025
Neg logit contributions
ep-0.075
ophone-0.072
we-0.072
utes-0.071
ician-0.071
Token,
Token ID11
Feature activation+2.39e-02

Pos logit contributions
ep+0.062
ophone+0.060
ician+0.059
we+0.059
utes+0.059
Neg logit contributions
 Daisy-0.016
 berries-0.013
 Sally-0.013
,-0.013
 her-0.011
Token "
Token ID366
Feature activation+5.74e-02

Pos logit contributions
ep+0.056
ophone+0.054
ician+0.053
utes+0.053
we+0.053
Neg logit contributions
 Daisy-0.032
,-0.031
 Sally-0.029
 berries-0.029
 her-0.028
TokenBe
Token ID3856
Feature activation+5.93e-02

Pos logit contributions
ophone+0.120
ep+0.120
utes+0.117
ician+0.112
anas+0.111
Neg logit contributions
 Daisy-0.093
,-0.088
 Sally-0.086
 she-0.085
 her-0.083
Tokenep
Token ID538
Feature activation+1.64e-01

Pos logit contributions
ophone+0.080
utes+0.074
aghetti+0.072
we+0.069
ician+0.069
Neg logit contributions
,-0.150
 her-0.142
 the-0.140
 Daisy-0.136
 Sally-0.133
Token be
Token ID307
Feature activation+2.75e-02

Pos logit contributions
ep+0.505
ophone+0.495
ician+0.494
utes+0.492
we+0.489
Neg logit contributions
,-0.115
 Daisy-0.095
 Sally-0.080
 her-0.077
 she-0.073
Tokenep
Token ID538
Feature activation+1.66e-01

Pos logit contributions
ophone+0.011
we+0.011
ician+0.010
utes+0.009
aghetti+0.009
Neg logit contributions
,-0.093
 Daisy-0.089
 Sally-0.087
 her-0.087
 the-0.086
Token!"
Token ID2474
Feature activation+9.79e-03

Pos logit contributions
ep+0.454
ophone+0.447
ician+0.446
utes+0.444
we+0.442
Neg logit contributions
,-0.176
 Daisy-0.139
 Sally-0.126
 her-0.124
 she-0.119
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
ep+0.027
ophone+0.026
ician+0.025
we+0.025
utes+0.025
Neg logit contributions
 Daisy-0.010
,-0.010
 Sally-0.009
 her-0.009
 she-0.009
Token drive
Token ID3708
Feature activation+1.13e-01

Pos logit contributions
ep+0.041
ophone+0.039
we+0.038
utes+0.038
ician+0.038
Neg logit contributions
 Daisy-0.019
,-0.018
 Sally-0.017
 berries-0.016
 her-0.016
Token away
Token ID1497
Feature activation-4.49e-03

Pos logit contributions
ep+0.310
ophone+0.300
ician+0.299
we+0.296
utes+0.296
Neg logit contributions
,-0.115
 Daisy-0.114
 Sally-0.101
 her-0.098
 she-0.097
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
 Daisy+0.003
 berries+0.003
 Sally+0.002
 She+0.002
 her+0.002
Neg logit contributions
ep-0.014
utes-0.013
ophone-0.013
ician-0.013
we-0.013
Tokenroom
Token ID3823
Feature activation+1.12e-01

Pos logit contributions
ician+0.053
anas+0.049
ep+0.049
ophone+0.047
aghetti+0.046
Neg logit contributions
,-0.083
 Sally-0.074
 her-0.074
 from-0.073
 she-0.073
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
ep+0.297
ophone+0.284
we+0.280
ician+0.278
utes+0.275
Neg logit contributions
 Daisy-0.129
,-0.116
 Sally-0.115
 she-0.106
 her-0.106
Token be
Token ID307
Feature activation+2.93e-02

Pos logit contributions
ep+0.039
ophone+0.037
we+0.036
ician+0.036
utes+0.036
Neg logit contributions
 Daisy-0.025
,-0.024
 Sally-0.023
 her-0.022
 berries-0.022
Tokenep
Token ID538
Feature activation+1.63e-01

Pos logit contributions
ophone+0.031
utes+0.030
ician+0.030
aghetti+0.029
we+0.029
Neg logit contributions
,-0.077
 Daisy-0.076
 her-0.075
 Sally-0.074
 the-0.073
Token be
Token ID307
Feature activation+2.65e-02

Pos logit contributions
ep+0.429
ician+0.416
ophone+0.416
utes+0.412
we+0.410
Neg logit contributions
,-0.186
 Daisy-0.173
 Sally-0.158
 her-0.156
 she-0.154
Tokenep
Token ID538
Feature activation+1.64e-01

Pos logit contributions
ophone+0.008
ician+0.008
we+0.007
utes+0.007
aghetti+0.007
Neg logit contributions
,-0.090
 Daisy-0.088
 her-0.087
 Sally-0.087
 the-0.085
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
ep+0.408
ician+0.402
aghetti+0.398
ophone+0.398
utes+0.397
Neg logit contributions
,-0.214
 Daisy-0.176
 her-0.169
 she-0.167
 Sally-0.167
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
ep+0.072
ophone+0.070
ician+0.069
utes+0.068
we+0.068
Neg logit contributions
 Daisy-0.027
,-0.027
 Sally-0.024
 her-0.023
 she-0.023
Token
Token ID198
Feature activation+1.80e-02

Pos logit contributions
ep+0.061
ophone+0.060
ician+0.059
we+0.059
utes+0.058
Neg logit contributions
,-0.020
 Daisy-0.020
 Sally-0.018
 her-0.017
 she-0.017
TokenOne
Token ID3198
Feature activation+1.91e-02

Pos logit contributions
ophone+0.036
ep+0.035
we+0.035
ician+0.035
anas+0.034
Neg logit contributions
 Daisy-0.031
,-0.030
 she-0.029
 her-0.029
 She-0.029
Token day
Token ID1110
Feature activation+1.63e-02

Pos logit contributions
we+0.057
ician+0.057
anas+0.057
ep+0.056
ophone+0.056
Neg logit contributions
,-0.019
 the-0.014
 she-0.013
 her-0.013
 Sally-0.012
Token They
Token ID1119
Feature activation+2.31e-02

Pos logit contributions
ep+0.047
ophone+0.044
we+0.044
ician+0.043
ber+0.042
Neg logit contributions
 Daisy-0.014
 berries-0.013
 She-0.013
 taste-0.011
 Everyday-0.011
Token say
Token ID910
Feature activation+2.12e-02

Pos logit contributions
ep+0.055
ophone+0.054
ician+0.050
anas+0.049
etr+0.049
Neg logit contributions
 Daisy-0.031
 Sally-0.029
,-0.027
 berries-0.027
 she-0.026
Token,
Token ID11
Feature activation+2.39e-02

Pos logit contributions
ep+0.062
ophone+0.060
ician+0.059
we+0.059
utes+0.059
Neg logit contributions
 Daisy-0.016
 berries-0.013
 Sally-0.013
,-0.013
 her-0.011
Token "
Token ID366
Feature activation+5.74e-02

Pos logit contributions
ep+0.056
ophone+0.054
ician+0.053
utes+0.053
we+0.053
Neg logit contributions
 Daisy-0.032
,-0.031
 Sally-0.029
 berries-0.029
 her-0.028
TokenBe
Token ID3856
Feature activation+5.93e-02

Pos logit contributions
ophone+0.120
ep+0.120
utes+0.117
ician+0.112
anas+0.111
Neg logit contributions
 Daisy-0.093
,-0.088
 Sally-0.086
 she-0.085
 her-0.083
Tokenep
Token ID538
Feature activation+1.64e-01

Pos logit contributions
ophone+0.080
utes+0.074
aghetti+0.072
we+0.069
ician+0.069
Neg logit contributions
,-0.150
 her-0.142
 the-0.140
 Daisy-0.136
 Sally-0.133
Token be
Token ID307
Feature activation+2.75e-02

Pos logit contributions
ep+0.505
ophone+0.495
ician+0.494
utes+0.492
we+0.489
Neg logit contributions
,-0.115
 Daisy-0.095
 Sally-0.080
 her-0.077
 she-0.073
Tokenep
Token ID538
Feature activation+1.66e-01

Pos logit contributions
ophone+0.011
we+0.011
ician+0.010
utes+0.009
aghetti+0.009
Neg logit contributions
,-0.093
 Daisy-0.089
 Sally-0.087
 her-0.087
 the-0.086
Token!"
Token ID2474
Feature activation+9.79e-03

Pos logit contributions
ep+0.454
ophone+0.447
ician+0.446
utes+0.444
we+0.442
Neg logit contributions
,-0.176
 Daisy-0.139
 Sally-0.126
 her-0.124
 she-0.119
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
ep+0.027
ophone+0.026
ician+0.025
we+0.025
utes+0.025
Neg logit contributions
 Daisy-0.010
,-0.010
 Sally-0.009
 her-0.009
 she-0.009
Token drive
Token ID3708
Feature activation+1.13e-01

Pos logit contributions
ep+0.041
ophone+0.039
we+0.038
utes+0.038
ician+0.038
Neg logit contributions
 Daisy-0.019
,-0.018
 Sally-0.017
 berries-0.016
 her-0.016
Tokenâ
Token ID22940
Feature activation+5.24e-02

Pos logit contributions
ep+0.070
ophone+0.069
ician+0.068
utes+0.067
we+0.067
Neg logit contributions
,-0.032
 Daisy-0.032
 her-0.029
 Sally-0.028
 she-0.028
Token
Token ID26391
Feature activation+5.14e-02

Pos logit contributions
utes+0.099
we+0.098
ep+0.098
anas+0.096
ophone+0.094
Neg logit contributions
,-0.096
 Daisy-0.092
 Sally-0.087
 She-0.084
 her-0.084
Token
Token ID220
Feature activation+6.92e-03

Pos logit contributions
ep+0.131
ophone+0.126
ician+0.125
aghetti+0.125
utes+0.124
Neg logit contributions
,-0.066
 Daisy-0.059
 she-0.055
 Sally-0.055
 She-0.053
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
ep+0.018
ophone+0.018
utes+0.017
ician+0.017
we+0.017
Neg logit contributions
 Daisy-0.008
,-0.008
 Sally-0.007
 her-0.007
 she-0.007
Token
Token ID198
Feature activation+7.95e-03

Pos logit contributions
ep+0.026
ophone+0.026
ician+0.026
we+0.025
utes+0.025
Neg logit contributions
 Daisy-0.008
,-0.008
 Sally-0.007
 she-0.007
 She-0.007
TokenJo
Token ID9908
Feature activation+1.64e-01

Pos logit contributions
ophone+0.016
ician+0.015
ep+0.015
utes+0.014
anas+0.014
Neg logit contributions
 Daisy-0.014
,-0.014
 she-0.014
 She-0.014
 her-0.013
Token hopped
Token ID45230
Feature activation-1.88e-02

Pos logit contributions
ep+0.301
utes+0.297
aghetti+0.293
ophone+0.292
ician+0.288
Neg logit contributions
,-0.319
 Daisy-0.298
 her-0.284
 she-0.282
 Sally-0.279
Token on
Token ID319
Feature activation+1.45e-02

Pos logit contributions
 Daisy+0.023
 berries+0.022
 Sally+0.021
,+0.020
 taste+0.020
Neg logit contributions
ep-0.044
ophone-0.042
utes-0.042
we-0.041
ician-0.040
Token the
Token ID262
Feature activation-2.70e-04

Pos logit contributions
ep+0.040
we+0.038
utes+0.038
ophone+0.038
ician+0.037
Neg logit contributions
 Daisy-0.012
,-0.011
 Sally-0.010
 berries-0.010
 she-0.009
Token bike
Token ID7161
Feature activation+5.47e-02

Pos logit contributions
,+0.000
 Daisy+0.000
 her+0.000
 she+0.000
 Sally+0.000
Neg logit contributions
ep-0.001
ophone-0.001
ician-0.001
we-0.001
utes-0.001
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
ep+0.160
ophone+0.156
ician+0.153
utes+0.152
we+0.152
Neg logit contributions
 Daisy-0.045
,-0.041
 Sally-0.038
 her-0.035
 she-0.035
Token every
Token ID790
Feature activation-9.80e-03

Pos logit contributions
ep+0.137
ophone+0.132
we+0.131
utes+0.130
ician+0.129
Neg logit contributions
 Daisy-0.053
 Sally-0.047
,-0.046
 berries-0.043
 she-0.043
Token day
Token ID1110
Feature activation+1.27e-02

Pos logit contributions
 Daisy+0.014
,+0.013
 Sally+0.013
 her+0.012
 She+0.012
Neg logit contributions
ep-0.022
ophone-0.021
ician-0.021
we-0.021
utes-0.021
Token."
Token ID526
Feature activation+2.38e-03

Pos logit contributions
ep+0.031
ophone+0.030
we+0.030
ician+0.030
utes+0.030
Neg logit contributions
 Daisy-0.014
 berries-0.013
 Sally-0.013
,-0.012
 her-0.012
Token
Token ID198
Feature activation+6.93e-03

Pos logit contributions
ep+0.006
ophone+0.006
ician+0.006
aghetti+0.006
anas+0.006
Neg logit contributions
,-0.003
 Daisy-0.003
 Sally-0.003
 she-0.003
 her-0.003
Token
Token ID198
Feature activation+7.00e-03

Pos logit contributions
ophone+0.021
aghetti+0.021
ep+0.021
ician+0.021
anas+0.021
Neg logit contributions
,-0.006
 Daisy-0.005
 the-0.004
 she-0.004
 her-0.004
TokenJo
Token ID9908
Feature activation+1.63e-01

Pos logit contributions
ophone+0.011
aghetti+0.010
anas+0.010
ician+0.010
utes+0.010
Neg logit contributions
,-0.017
 she-0.015
 the-0.015
 Daisy-0.015
 her-0.015
Tokenjo
Token ID7639
Feature activation+1.26e-01

Pos logit contributions
aghetti+0.195
utes+0.195
anas+0.175
 keyboard+0.168
ophone+0.167
Neg logit contributions
,-0.437
 Daisy-0.395
 her-0.392
 the-0.381
 Sally-0.374
Token smiled
Token ID13541
Feature activation-3.26e-03

Pos logit contributions
ep+0.324
ophone+0.310
ician+0.308
we+0.305
utes+0.302
Neg logit contributions
 Daisy-0.147
,-0.142
 Sally-0.133
 she-0.127
 her-0.127
Token and
Token ID290
Feature activation+7.20e-03

Pos logit contributions
 Daisy+0.004
 berries+0.003
 Sally+0.003
 are+0.003
 she+0.003
Neg logit contributions
ep-0.008
ician-0.008
utes-0.008
we-0.008
ophone-0.008
Token said
Token ID531
Feature activation-1.36e-02

Pos logit contributions
ep+0.018
ophone+0.017
ician+0.017
we+0.016
utes+0.016
Neg logit contributions
 Daisy-0.009
,-0.009
 berries-0.008
 Sally-0.008
 her-0.007
Token:
Token ID25
Feature activation+2.17e-02

Pos logit contributions
 Daisy+0.014
 Sally+0.013
 berries+0.013
,+0.012
 her+0.011
Neg logit contributions
ep-0.035
ophone-0.034
ician-0.034
we-0.034
utes-0.034
Tokenroom
Token ID3823
Feature activation+1.15e-01

Pos logit contributions
ep+0.064
ophone+0.062
ician+0.062
utes+0.061
we+0.061
Neg logit contributions
 Daisy-0.030
,-0.030
 Sally-0.027
 berries-0.026
 her-0.026
Token v
Token ID410
Feature activation+3.65e-02

Pos logit contributions
ep+0.327
ophone+0.317
ician+0.316
utes+0.313
we+0.312
Neg logit contributions
,-0.102
 Daisy-0.099
 Sally-0.086
 her-0.084
 she-0.083
Tokenroom
Token ID3823
Feature activation+1.12e-01

Pos logit contributions
ician+0.053
anas+0.049
ep+0.049
ophone+0.047
aghetti+0.046
Neg logit contributions
,-0.083
 Sally-0.074
 her-0.074
 from-0.073
 she-0.073
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
ep+0.297
ophone+0.284
we+0.280
ician+0.278
utes+0.275
Neg logit contributions
 Daisy-0.129
,-0.116
 Sally-0.115
 she-0.106
 her-0.106
Token be
Token ID307
Feature activation+2.93e-02

Pos logit contributions
ep+0.039
ophone+0.037
we+0.036
ician+0.036
utes+0.036
Neg logit contributions
 Daisy-0.025
,-0.024
 Sally-0.023
 her-0.022
 berries-0.022
Tokenep
Token ID538
Feature activation+1.63e-01

Pos logit contributions
ophone+0.031
utes+0.030
ician+0.030
aghetti+0.029
we+0.029
Neg logit contributions
,-0.077
 Daisy-0.076
 her-0.075
 Sally-0.074
 the-0.073
Token be
Token ID307
Feature activation+2.65e-02

Pos logit contributions
ep+0.429
ician+0.416
ophone+0.416
utes+0.412
we+0.410
Neg logit contributions
,-0.186
 Daisy-0.173
 Sally-0.158
 her-0.156
 she-0.154
Tokenep
Token ID538
Feature activation+1.64e-01

Pos logit contributions
ophone+0.008
ician+0.008
we+0.007
utes+0.007
aghetti+0.007
Neg logit contributions
,-0.090
 Daisy-0.088
 her-0.087
 Sally-0.087
 the-0.085
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
ep+0.408
ician+0.402
aghetti+0.398
ophone+0.398
utes+0.397
Neg logit contributions
,-0.214
 Daisy-0.176
 her-0.169
 she-0.167
 Sally-0.167
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
ep+0.072
ophone+0.070
ician+0.069
utes+0.068
we+0.068
Neg logit contributions
 Daisy-0.027
,-0.027
 Sally-0.024
 her-0.023
 she-0.023
Token
Token ID198
Feature activation+1.80e-02

Pos logit contributions
ep+0.061
ophone+0.060
ician+0.059
we+0.059
utes+0.058
Neg logit contributions
,-0.020
 Daisy-0.020
 Sally-0.018
 her-0.017
 she-0.017
Token what
Token ID644
Feature activation+4.89e-02

Pos logit contributions
ep+0.081
ophone+0.078
we+0.076
utes+0.076
ician+0.075
Neg logit contributions
 Daisy-0.044
 Sally-0.040
,-0.039
 berries-0.037
 She-0.036
Token was
Token ID373
Feature activation+6.80e-03

Pos logit contributions
ep+0.119
we+0.116
ophone+0.115
utes+0.112
ician+0.111
Neg logit contributions
 Daisy-0.058
,-0.054
 Sally-0.053
 berries-0.049
 her-0.047
Token inside
Token ID2641
Feature activation-1.96e-04

Pos logit contributions
ep+0.014
ophone+0.013
we+0.013
ician+0.012
utes+0.012
Neg logit contributions
 Daisy-0.011
 berries-0.010
,-0.010
 Sally-0.010
 She-0.010
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
 Daisy+0.000
 berries+0.000
 Sally+0.000
 Freddy+0.000
 Gig+0.000
Neg logit contributions
ep-0.000
we-0.000
ophone-0.000
aker-0.000
washer-0.000
Token El
Token ID2574
Feature activation+8.59e-02

Pos logit contributions
ep+0.039
we+0.037
ophone+0.037
utes+0.036
anas+0.036
Neg logit contributions
 Daisy-0.019
,-0.018
 Sally-0.018
 berries-0.017
 Freddy-0.016
Tokenisa
Token ID9160
Feature activation+1.63e-01

Pos logit contributions
ep+0.151
ophone+0.144
ician+0.142
utes+0.140
anas+0.140
Neg logit contributions
,-0.174
 Daisy-0.169
 her-0.159
 Sally-0.159
 she-0.157
Token opened
Token ID4721
Feature activation+3.36e-03

Pos logit contributions
ep+0.402
ophone+0.381
we+0.379
ician+0.374
anas+0.369
Neg logit contributions
 Daisy-0.208
 Sally-0.194
 berries-0.181
,-0.171
 her-0.169
Token the
Token ID262
Feature activation+7.98e-03

Pos logit contributions
ep+0.009
utes+0.009
ician+0.008
ophone+0.008
we+0.008
Neg logit contributions
 Daisy-0.002
 berries-0.002
 Sally-0.002
 Freddy-0.002
 taste-0.002
Token ig
Token ID45329
Feature activation+9.70e-02

Pos logit contributions
ep+0.015
ophone+0.015
ician+0.014
utes+0.014
we+0.014
Neg logit contributions
 Daisy-0.014
 Sally-0.014
,-0.013
 she-0.013
 her-0.013
Tokenloo
Token ID29680
Feature activation+7.05e-02

Pos logit contributions
ician+0.178
ophone+0.163
anas+0.162
ep+0.158
etr+0.157
Neg logit contributions
,-0.200
 Sally-0.168
 she-0.167
 berries-0.163
 the-0.161
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
ep+0.180
we+0.171
ophone+0.169
washer+0.166
aker+0.163
Neg logit contributions
 Daisy-0.084
 Sally-0.075
 berries-0.072
 Gig-0.068
 Freddy-0.067
Token makes
Token ID1838
Feature activation+2.76e-02

Pos logit contributions
 Daisy+0.048
 Sally+0.045
,+0.043
 berries+0.042
 Freddy+0.039
Neg logit contributions
ep-0.077
ophone-0.072
ician-0.069
we-0.068
ber-0.068
Token them
Token ID606
Feature activation+2.57e-02

Pos logit contributions
ep+0.074
ophone+0.070
we+0.070
ician+0.069
utes+0.069
Neg logit contributions
 Daisy-0.030
,-0.029
 Sally-0.027
 she-0.025
 berries-0.024
Token zoom
Token ID19792
Feature activation+7.64e-02

Pos logit contributions
ep+0.062
ophone+0.059
ician+0.057
utes+0.057
we+0.056
Neg logit contributions
 Daisy-0.036
,-0.033
 Sally-0.033
 she-0.031
 She-0.030
Token and
Token ID290
Feature activation+2.80e-02

Pos logit contributions
ep+0.213
ophone+0.206
ician+0.205
we+0.203
utes+0.202
Neg logit contributions
,-0.074
 Daisy-0.073
 Sally-0.065
 her-0.062
 she-0.062
Token be
Token ID307
Feature activation+3.43e-02

Pos logit contributions
ep+0.058
ophone+0.054
we+0.054
ician+0.053
utes+0.052
Neg logit contributions
 Daisy-0.049
,-0.046
 Sally-0.046
 her-0.043
 she-0.042
Tokenep
Token ID538
Feature activation+1.63e-01

Pos logit contributions
aghetti+0.040
ophone+0.040
ician+0.040
utes+0.038
ber+0.038
Neg logit contributions
,-0.090
 her-0.083
 Daisy-0.083
 the-0.083
 Sally-0.081
Token on
Token ID319
Feature activation+3.12e-02

Pos logit contributions
ep+0.451
ophone+0.438
ician+0.437
utes+0.432
we+0.431
Neg logit contributions
,-0.159
 Daisy-0.151
 Sally-0.134
 her-0.131
 she-0.130
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
ep+0.096
ician+0.091
we+0.090
utes+0.090
ophone+0.090
Neg logit contributions
 Daisy-0.020
,-0.017
 Sally-0.017
 she-0.016
 berries-0.015
Token floor
Token ID4314
Feature activation+2.47e-02

Pos logit contributions
we+0.024
ep+0.024
ophone+0.024
ician+0.023
utes+0.023
Neg logit contributions
,-0.017
 Daisy-0.015
 the-0.015
 her-0.015
 Sally-0.014
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
ep+0.067
ophone+0.064
ician+0.063
we+0.063
utes+0.063
Neg logit contributions
 Daisy-0.027
,-0.024
 Sally-0.024
 she-0.022
 berries-0.022
Token Sometimes
Token ID8975
Feature activation-1.34e-02

Pos logit contributions
ep+0.082
utes+0.081
ophone+0.081
we+0.080
ician+0.080
Neg logit contributions
,-0.019
 Daisy-0.018
 berries-0.016
 Sally-0.015
 her-0.014
Token much
Token ID881
Feature activation+3.17e-03

Pos logit contributions
 Daisy+0.041
 Sally+0.038
 her+0.037
,+0.037
 berries+0.036
Neg logit contributions
ep-0.058
ician-0.053
utes-0.053
ophone-0.053
we-0.051
Token!"
Token ID2474
Feature activation-5.48e-03

Pos logit contributions
ep+0.008
anas+0.007
ician+0.007
ophone+0.007
utes+0.007
Neg logit contributions
 Daisy-0.004
 Sally-0.004
 her-0.003
 She-0.003
 Sophie-0.003
Token
Token ID220
Feature activation+1.06e-02

Pos logit contributions
 Daisy+0.006
,+0.006
 Sally+0.005
 she+0.005
 her+0.005
Neg logit contributions
ep-0.015
ophone-0.014
ician-0.014
utes-0.014
we-0.014
Token
Token ID198
Feature activation+7.58e-03

Pos logit contributions
ep+0.027
ophone+0.025
utes+0.025
we+0.025
ician+0.025
Neg logit contributions
 Daisy-0.013
,-0.012
 berries-0.011
 Sally-0.011
 her-0.011
Token
Token ID198
Feature activation+7.67e-03

Pos logit contributions
ophone+0.023
ep+0.023
aghetti+0.022
ician+0.022
utes+0.022
Neg logit contributions
,-0.006
 Daisy-0.006
 she-0.005
 her-0.005
 Sally-0.005
TokenJo
Token ID9908
Feature activation+1.62e-01

Pos logit contributions
ophone+0.014
aghetti+0.013
utes+0.012
anas+0.012
aker+0.012
Neg logit contributions
,-0.016
 she-0.016
 her-0.015
 She-0.015
 Daisy-0.015
Tokenanne
Token ID21952
Feature activation+8.26e-02

Pos logit contributions
ep+0.268
utes+0.257
ophone+0.257
ician+0.249
aghetti+0.248
Neg logit contributions
,-0.346
 Daisy-0.339
 Sally-0.319
 her-0.315
 she-0.312
Token smiled
Token ID13541
Feature activation-4.88e-03

Pos logit contributions
ep+0.206
ophone+0.197
ician+0.195
we+0.193
anas+0.189
Neg logit contributions
 Daisy-0.104
,-0.098
 Sally-0.095
 her-0.090
 she-0.089
Token and
Token ID290
Feature activation+5.24e-03

Pos logit contributions
 berries+0.006
 Daisy+0.006
 Sally+0.005
 are+0.005
 Freddy+0.004
Neg logit contributions
ep-0.011
ician-0.011
we-0.011
utes-0.010
ber-0.010
Token thanked
Token ID26280
Feature activation-4.52e-02

Pos logit contributions
ep+0.012
ophone+0.011
ician+0.011
we+0.011
utes+0.011
Neg logit contributions
 Daisy-0.007
,-0.007
 berries-0.007
 Sally-0.006
 She-0.006
Token the
Token ID262
Feature activation+2.77e-03

Pos logit contributions
 berries+0.019
 Daisy+0.019
,+0.019
 Sally+0.015
 She+0.012
Neg logit contributions
ep-0.137
we-0.135
ician-0.133
utes-0.128
etr-0.127
TokenBefore
Token ID8421
Feature activation-3.97e-02

Pos logit contributions
ophone+0.030
anas+0.028
ician+0.028
aghetti+0.028
utes+0.027
Neg logit contributions
,-0.036
 she-0.034
 her-0.034
 the-0.033
 She-0.033
Token he
Token ID339
Feature activation+3.71e-02

Pos logit contributions
 Daisy+0.043
,+0.041
 Sally+0.038
 her+0.036
 she+0.036
Neg logit contributions
ep-0.105
ophone-0.100
we-0.100
ician-0.100
utes-0.099
Token left
Token ID1364
Feature activation-1.49e-02

Pos logit contributions
ep+0.089
ophone+0.086
ician+0.082
ber+0.081
washer+0.080
Neg logit contributions
 Daisy-0.047
 berries-0.044
 She-0.041
,-0.041
 Sally-0.041
Token,
Token ID11
Feature activation+4.22e-03

Pos logit contributions
 Daisy+0.004
 berries+0.003
 Sally+0.002
,+0.002
 She+0.001
Neg logit contributions
ep-0.050
ophone-0.049
we-0.048
ician-0.048
utes-0.048
Token Jo
Token ID5302
Feature activation+1.04e-01

Pos logit contributions
ep+0.011
ophone+0.011
ician+0.011
anas+0.011
aghetti+0.011
Neg logit contributions
,-0.005
 Daisy-0.005
 she-0.005
 her-0.005
 the-0.004
TokenJo
Token ID9908
Feature activation+1.61e-01

Pos logit contributions
utes+0.237
aghetti+0.227
ep+0.220
ophone+0.217
anas+0.215
Neg logit contributions
,-0.176
 Daisy-0.155
 she-0.153
 her-0.150
 the-0.146
Token hugged
Token ID41130
Feature activation-1.02e-02

Pos logit contributions
ep+0.417
utes+0.417
ophone+0.412
aghetti+0.408
ician+0.405
Neg logit contributions
,-0.193
 Daisy-0.177
 she-0.159
 Sally-0.159
 her-0.158
Token his
Token ID465
Feature activation+4.92e-02

Pos logit contributions
,+0.012
 her+0.011
 Daisy+0.010
 Sally+0.010
 she+0.009
Neg logit contributions
ep-0.028
ophone-0.028
ician-0.027
utes-0.027
aghetti-0.027
Token mom
Token ID1995
Feature activation-2.53e-03

Pos logit contributions
ep+0.117
ophone+0.112
ician+0.111
utes+0.110
we+0.110
Neg logit contributions
 Daisy-0.069
,-0.068
 Sally-0.063
 she-0.061
 her-0.061
Token and
Token ID290
Feature activation+1.36e-02

Pos logit contributions
 Daisy+0.002
 Sally+0.002
,+0.002
 berries+0.002
 she+0.002
Neg logit contributions
ep-0.007
ophone-0.007
ician-0.007
we-0.006
anas-0.006
Token said
Token ID531
Feature activation-4.45e-03

Pos logit contributions
ep+0.032
ician+0.030
ophone+0.030
we+0.029
anas+0.029
Neg logit contributions
 Daisy-0.018
,-0.017
 berries-0.016
 Sally-0.016
 She-0.016
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
ophone+0.077
utes+0.072
ician+0.071
we+0.069
aghetti+0.068
Neg logit contributions
,-0.092
 her-0.086
 Daisy-0.085
 the-0.083
 Sally-0.082
Token The
Token ID383
Feature activation+1.05e-02

Pos logit contributions
ep+0.046
ophone+0.046
ician+0.044
utes+0.044
we+0.044
Neg logit contributions
,-0.019
 Daisy-0.019
 berries-0.018
 her-0.017
 Sally-0.017
Token therm
Token ID21969
Feature activation+7.19e-02

Pos logit contributions
ep+0.023
ophone+0.022
ician+0.021
utes+0.021
we+0.021
Neg logit contributions
 Daisy-0.016
,-0.016
 Sally-0.015
 her-0.015
 she-0.014
Tokenometer
Token ID15635
Feature activation+6.59e-02

Pos logit contributions
ep+0.083
ician+0.077
ophone+0.076
utes+0.074
we+0.073
Neg logit contributions
,-0.186
 Daisy-0.178
 she-0.173
 Sally-0.171
 She-0.170
Token be
Token ID307
Feature activation+3.16e-02

Pos logit contributions
ep+0.166
ophone+0.154
we+0.153
ician+0.148
anas+0.146
Neg logit contributions
 Daisy-0.088
 Sally-0.082
,-0.074
 berries-0.071
 Sophie-0.071
Tokenep
Token ID538
Feature activation+1.60e-01

Pos logit contributions
ep+0.046
ician+0.037
ophone+0.037
we+0.037
aker+0.036
Neg logit contributions
 Daisy-0.078
,-0.075
 Sally-0.074
 she-0.072
 She-0.072
Tokened
Token ID276
Feature activation-1.90e-02

Pos logit contributions
ep+0.439
ophone+0.421
ician+0.413
we+0.413
utes+0.408
Neg logit contributions
 Daisy-0.171
,-0.153
 Sally-0.149
 her-0.139
 she-0.139
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 Daisy+0.022
 Sally+0.020
 berries+0.019
 she+0.019
 Sophie+0.018
Neg logit contributions
ep-0.048
ician-0.043
ophone-0.043
we-0.043
aker-0.043
Token the
Token ID262
Feature activation-1.42e-03

Pos logit contributions
ep+0.034
we+0.030
ophone+0.030
ician+0.030
anas+0.029
Neg logit contributions
 Daisy-0.018
,-0.017
 Sally-0.017
 She-0.016
 berries-0.015
Token doctor
Token ID6253
Feature activation+2.90e-02

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 her+0.002
 she+0.002
Neg logit contributions
ep-0.003
ophone-0.003
ician-0.003
we-0.003
utes-0.003
Token said
Token ID531
Feature activation-1.56e-02

Pos logit contributions
ep+0.073
ophone+0.070
ician+0.069
we+0.068
utes+0.068
Neg logit contributions
 Daisy-0.036
,-0.035
 Sally-0.033
 her-0.032
 she-0.031
Token of
Token ID286
Feature activation+9.83e-03

Pos logit contributions
 Daisy+0.021
,+0.021
 Sally+0.019
 her+0.018
 she+0.018
Neg logit contributions
ep-0.056
ophone-0.055
we-0.054
ician-0.054
utes-0.054
Token the
Token ID262
Feature activation+2.77e-03

Pos logit contributions
ep+0.029
we+0.027
utes+0.027
ician+0.027
ophone+0.027
Neg logit contributions
 Daisy-0.007
,-0.007
 berries-0.006
 Sally-0.006
 she-0.006
Token night
Token ID1755
Feature activation+2.12e-02

Pos logit contributions
ep+0.007
ophone+0.006
ician+0.006
we+0.006
utes+0.006
Neg logit contributions
 Daisy-0.004
,-0.004
 Sally-0.003
 her-0.003
 she-0.003
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
ep+0.053
we+0.052
ophone+0.051
ician+0.050
utes+0.049
Neg logit contributions
 Daisy-0.026
 Sally-0.023
 berries-0.023
,-0.021
 her-0.021
Token<|endoftext|>
Token ID50256
Feature activation-5.81e-03

Pos logit contributions
ep+0.051
ophone+0.050
ician+0.050
utes+0.049
we+0.049
Neg logit contributions
,-0.022
 Daisy-0.021
 her-0.019
 Sally-0.019
 berries-0.019
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
 Daisy+0.007
,+0.007
 Sally+0.006
 berries+0.006
 she+0.006
Neg logit contributions
ep-0.014
we-0.014
ophone-0.014
ician-0.013
utes-0.013
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
ep+0.350
ophone+0.334
ician+0.330
we+0.326
utes+0.323
Neg logit contributions
 Daisy-0.250
,-0.242
 Sally-0.231
 she-0.223
 berries-0.221
Token was
Token ID373
Feature activation-1.53e-02

Pos logit contributions
ep+0.369
ophone+0.352
ician+0.348
we+0.343
utes+0.337
Neg logit contributions
 Daisy-0.233
,-0.217
 Sally-0.212
 berries-0.209
 she-0.203
Token walking
Token ID6155
Feature activation+5.40e-02

Pos logit contributions
 Daisy+0.021
,+0.020
 Sally+0.019
 berries+0.019
 she+0.018
Neg logit contributions
ep-0.037
ophone-0.034
ician-0.034
utes-0.034
we-0.034
Token outside
Token ID2354
Feature activation+4.05e-02

Pos logit contributions
ep+0.139
ophone+0.134
ician+0.132
we+0.132
utes+0.132
Neg logit contributions
 Daisy-0.064
,-0.061
 Sally-0.057
 she-0.054
 her-0.053
Token on
Token ID319
Feature activation+1.05e-02

Pos logit contributions
ep+0.106
we+0.102
utes+0.100
ophone+0.098
ician+0.098
Neg logit contributions
 Daisy-0.043
 berries-0.040
 Sally-0.039
 Polly-0.034
,-0.034
Token the
Token ID262
Feature activation+2.77e-03

Pos logit contributions
ep+0.029
we+0.027
utes+0.027
ician+0.027
ophone+0.027
Neg logit contributions
 Daisy-0.007
,-0.007
 berries-0.006
 Sally-0.006
 she-0.006
Token night
Token ID1755
Feature activation+2.12e-02

Pos logit contributions
ep+0.007
ophone+0.006
ician+0.006
we+0.006
utes+0.006
Neg logit contributions
 Daisy-0.004
,-0.004
 Sally-0.003
 her-0.003
 she-0.003
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
ep+0.053
we+0.052
ophone+0.051
ician+0.050
utes+0.049
Neg logit contributions
 Daisy-0.026
 Sally-0.023
 berries-0.023
,-0.021
 her-0.021
Token<|endoftext|>
Token ID50256
Feature activation-5.81e-03

Pos logit contributions
ep+0.051
ophone+0.050
ician+0.050
utes+0.049
we+0.049
Neg logit contributions
,-0.022
 Daisy-0.021
 her-0.019
 Sally-0.019
 berries-0.019
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
 Daisy+0.007
,+0.007
 Sally+0.006
 berries+0.006
 she+0.006
Neg logit contributions
ep-0.014
we-0.014
ophone-0.014
ician-0.013
utes-0.013
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
ep+0.350
ophone+0.334
ician+0.330
we+0.326
utes+0.323
Neg logit contributions
 Daisy-0.250
,-0.242
 Sally-0.231
 she-0.223
 berries-0.221
Token was
Token ID373
Feature activation-1.53e-02

Pos logit contributions
ep+0.369
ophone+0.352
ician+0.348
we+0.343
utes+0.337
Neg logit contributions
 Daisy-0.233
,-0.217
 Sally-0.212
 berries-0.209
 she-0.203
Token walking
Token ID6155
Feature activation+5.40e-02

Pos logit contributions
 Daisy+0.021
,+0.020
 Sally+0.019
 berries+0.019
 she+0.018
Neg logit contributions
ep-0.037
ophone-0.034
ician-0.034
utes-0.034
we-0.034
Token outside
Token ID2354
Feature activation+4.05e-02

Pos logit contributions
ep+0.139
ophone+0.134
ician+0.132
we+0.132
utes+0.132
Neg logit contributions
 Daisy-0.064
,-0.061
 Sally-0.057
 she-0.054
 her-0.053
Token on
Token ID319
Feature activation+1.05e-02

Pos logit contributions
ep+0.106
we+0.102
utes+0.100
ophone+0.098
ician+0.098
Neg logit contributions
 Daisy-0.043
 berries-0.040
 Sally-0.039
 Polly-0.034
,-0.034
Token a
Token ID257
Feature activation-6.96e-03

Pos logit contributions
ep+0.025
utes+0.024
 motorcycle+0.024
etr+0.023
we+0.023
Neg logit contributions
 berries-0.009
 Daisy-0.009
 Sally-0.008
're-0.008
 taste-0.008
Token the
Token ID262
Feature activation+6.55e-03

Pos logit contributions
 Daisy+0.002
 Sally+0.002
 Freddy+0.002
 She+0.002
 berries+0.002
Neg logit contributions
ep-0.006
ophone-0.006
we-0.006
ician-0.006
utes-0.006
Token door
Token ID3420
Feature activation+2.05e-02

Pos logit contributions
ep+0.011
utes+0.010
ician+0.010
ophone+0.010
ber+0.010
Neg logit contributions
 Daisy-0.013
 Sally-0.012
 Freddy-0.012
 her-0.011
 Polly-0.011
Token."
Token ID526
Feature activation+7.40e-03

Pos logit contributions
ep+0.048
ophone+0.046
ician+0.044
we+0.044
ber+0.043
Neg logit contributions
 Daisy-0.029
 berries-0.027
 Sally-0.026
 Freddy-0.025
 Polly-0.024
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
ep+0.019
ophone+0.018
ician+0.018
utes+0.018
aghetti+0.018
Neg logit contributions
,-0.010
 Daisy-0.009
 she-0.008
 Sally-0.008
 her-0.008
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
ophone+0.037
ician+0.036
aghetti+0.036
anas+0.036
ep+0.036
Neg logit contributions
,-0.017
 Daisy-0.015
 her-0.015
 she-0.014
 She-0.014
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
ophone+0.024
anas+0.022
aghetti+0.021
utes+0.021
aker+0.021
Neg logit contributions
,-0.032
 she-0.030
 She-0.030
 her-0.030
 Daisy-0.029
Token thought
Token ID1807
Feature activation+2.56e-02

Pos logit contributions
ep+0.410
utes+0.407
ophone+0.402
aghetti+0.398
ician+0.394
Neg logit contributions
,-0.194
 Daisy-0.182
 her-0.162
 Sally-0.161
 she-0.160
Token for
Token ID329
Feature activation-2.49e-02

Pos logit contributions
ep+0.071
ophone+0.068
ician+0.068
we+0.067
utes+0.067
Neg logit contributions
 Daisy-0.025
,-0.024
 Sally-0.022
 she-0.021
 her-0.021
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
 Daisy+0.022
,+0.022
 Sally+0.020
 she+0.019
 her+0.018
Neg logit contributions
ep-0.071
ophone-0.068
ician-0.068
we-0.067
utes-0.066
Token moment
Token ID2589
Feature activation+4.79e-02

Pos logit contributions
ep+0.050
we+0.049
utes+0.048
ophone+0.047
cles+0.047
Neg logit contributions
,-0.024
 Daisy-0.023
 she-0.021
 Sally-0.020
 her-0.019
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
ep+0.130
we+0.125
ophone+0.124
ician+0.124
anas+0.121
Neg logit contributions
 Daisy-0.045
 berries-0.041
 Sally-0.040
,-0.035
 her-0.034
Token princess
Token ID21752
Feature activation+5.95e-03

Pos logit contributions
ep+0.005
ophone+0.005
ician+0.005
we+0.005
utes+0.005
Neg logit contributions
 Daisy-0.003
,-0.003
 Sally-0.003
 her-0.003
 she-0.003
Token.
Token ID13
Feature activation+5.12e-03

Pos logit contributions
ep+0.015
ophone+0.015
we+0.015
ician+0.015
anas+0.014
Neg logit contributions
 Daisy-0.007
 Sally-0.006
,-0.006
 berries-0.006
 her-0.006
Token
Token ID220
Feature activation+5.51e-03

Pos logit contributions
ep+0.013
ophone+0.013
ician+0.013
we+0.013
utes+0.013
Neg logit contributions
 Daisy-0.005
,-0.005
 berries-0.005
 Sally-0.005
 her-0.005
Token
Token ID198
Feature activation+3.38e-03

Pos logit contributions
ep+0.014
ophone+0.013
utes+0.013
we+0.013
ician+0.013
Neg logit contributions
 Daisy-0.006
,-0.006
 berries-0.006
 Sally-0.006
 her-0.005
Token
Token ID198
Feature activation+2.90e-03

Pos logit contributions
ep+0.010
ophone+0.010
ician+0.010
aghetti+0.009
utes+0.009
Neg logit contributions
,-0.003
 Daisy-0.003
 her-0.003
 she-0.003
 Sally-0.002
TokenJo
Token ID9908
Feature activation+1.59e-01

Pos logit contributions
ophone+0.005
ep+0.005
anas+0.005
aghetti+0.005
ician+0.005
Neg logit contributions
,-0.005
 Daisy-0.005
 her-0.005
 she-0.005
 She-0.005
Tokenanne
Token ID21952
Feature activation+7.72e-02

Pos logit contributions
ep+0.228
ophone+0.214
utes+0.211
ician+0.210
we+0.207
Neg logit contributions
,-0.369
 Daisy-0.368
 Sally-0.348
 her-0.344
 she-0.343
Token had
Token ID550
Feature activation+1.80e-03

Pos logit contributions
ep+0.181
ophone+0.170
ician+0.168
we+0.165
ber+0.161
Neg logit contributions
 Daisy-0.110
 Sally-0.103
 berries-0.099
,-0.092
 her-0.091
Token so
Token ID523
Feature activation+8.01e-03

Pos logit contributions
ep+0.004
ophone+0.004
etr+0.004
ician+0.004
we+0.004
Neg logit contributions
 Daisy-0.002
,-0.002
 berries-0.002
 Sally-0.002
 she-0.002
Token much
Token ID881
Feature activation-8.24e-05

Pos logit contributions
ep+0.014
ophone+0.012
etr+0.012
and+0.012
aker+0.012
Neg logit contributions
 Daisy-0.016
 berries-0.014
 Sally-0.014
,-0.014
 Polly-0.014
Token fun
Token ID1257
Feature activation-3.84e-03

Pos logit contributions
 Daisy+0.000
 Sally+0.000
 She+0.000
,+0.000
 Polly+0.000
Neg logit contributions
ep-0.000
we-0.000
etr-0.000
utes-0.000
and-0.000
Token some
Token ID617
Feature activation-4.22e-02

Pos logit contributions
 Daisy+0.004
 Sally+0.004
,+0.004
 she+0.004
 She+0.004
Neg logit contributions
ep-0.011
we-0.010
ophone-0.010
ician-0.010
aghetti-0.010
Token?"
Token ID1701
Feature activation+3.74e-03

Pos logit contributions
 Daisy+0.065
 Sally+0.064
 She+0.061
 Pam+0.060
 she+0.060
Neg logit contributions
ep-0.083
aghetti-0.080
utes-0.076
aker-0.075
cles-0.075
Token
Token ID220
Feature activation+1.66e-02

Pos logit contributions
ep+0.009
ophone+0.009
ician+0.009
aghetti+0.009
utes+0.009
Neg logit contributions
,-0.005
 Daisy-0.005
 she-0.005
 Sally-0.005
 her-0.005
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
ep+0.046
ophone+0.044
ician+0.044
we+0.043
utes+0.043
Neg logit contributions
,-0.017
 Daisy-0.016
 Sally-0.015
 she-0.014
 her-0.014
Token
Token ID198
Feature activation+1.11e-02

Pos logit contributions
ophone+0.033
aghetti+0.033
ep+0.032
ician+0.032
anas+0.032
Neg logit contributions
,-0.010
 Daisy-0.008
 her-0.007
 she-0.007
 the-0.007
TokenJo
Token ID9908
Feature activation+1.58e-01

Pos logit contributions
ophone+0.017
aghetti+0.016
anas+0.016
ician+0.016
aker+0.016
Neg logit contributions
,-0.026
 she-0.025
 her-0.024
 the-0.024
 Daisy-0.024
Token was
Token ID373
Feature activation-2.85e-03

Pos logit contributions
utes+0.407
aghetti+0.396
ep+0.394
ophone+0.394
we+0.387
Neg logit contributions
,-0.208
 Daisy-0.181
 her-0.165
 Sally-0.164
 the-0.160
Token so
Token ID523
Feature activation+2.02e-02

Pos logit contributions
 berries+0.004
 Daisy+0.004
,+0.004
 Sally+0.004
 she+0.004
Neg logit contributions
ep-0.006
ician-0.006
ophone-0.006
ber-0.005
anas-0.005
Token excited
Token ID6568
Feature activation+2.02e-02

Pos logit contributions
ep+0.033
ber+0.031
washer+0.029
and+0.029
utes+0.029
Neg logit contributions
 berries-0.040
 Daisy-0.038
 Polly-0.034
 Sally-0.034
 end-0.033
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
ep+0.052
we+0.052
ophone+0.050
ician+0.050
anas+0.049
Neg logit contributions
 Daisy-0.020
 berries-0.018
 Sally-0.017
,-0.016
 her-0.016
Token She
Token ID1375
Feature activation-3.50e-02

Pos logit contributions
ep+0.031
ophone+0.030
ician+0.029
we+0.029
utes+0.029
Neg logit contributions
 Daisy-0.012
,-0.012
 Sally-0.011
 she-0.011
 her-0.011
Token be
Token ID307
Feature activation+1.45e-02

Pos logit contributions
ep+0.018
ician+0.016
etr+0.016
 sax+0.015
eping+0.015
Neg logit contributions
 berries-0.013
 Daisy-0.012
 Sophie-0.012
 Sally-0.012
 She-0.011
Token hurt
Token ID5938
Feature activation-3.78e-02

Pos logit contributions
ep+0.033
ician+0.029
ophone+0.028
aker+0.028
we+0.028
Neg logit contributions
 Daisy-0.023
,-0.021
 Sally-0.021
 she-0.021
 berries-0.021
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 berries+0.023
 Daisy+0.022
 taste+0.020
 Sally+0.018
 smell+0.017
Neg logit contributions
we-0.106
ep-0.105
and-0.102
ophone-0.100
ician-0.099
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
ep+0.026
ophone+0.025
ician+0.025
we+0.025
utes+0.025
Neg logit contributions
 Daisy-0.020
,-0.020
 Sally-0.019
 her-0.018
 she-0.018
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
ep+0.036
ophone+0.036
ician+0.035
aghetti+0.035
aker+0.035
Neg logit contributions
,-0.011
 Daisy-0.010
 she-0.009
 her-0.009
 She-0.009
TokenJo
Token ID9908
Feature activation+1.58e-01

Pos logit contributions
ophone+0.023
ep+0.023
utes+0.022
ician+0.022
anas+0.022
Neg logit contributions
,-0.024
 Daisy-0.023
 she-0.022
 her-0.022
 She-0.021
Tokenanna
Token ID7697
Feature activation+5.83e-02

Pos logit contributions
ep+0.213
ophone+0.211
aghetti+0.210
utes+0.210
anas+0.200
Neg logit contributions
,-0.383
 Daisy-0.366
 her-0.351
 she-0.349
 Sally-0.344
Token smiled
Token ID13541
Feature activation-6.41e-04

Pos logit contributions
ep+0.139
ophone+0.131
ician+0.129
we+0.127
anas+0.126
Neg logit contributions
 Daisy-0.078
 Sally-0.071
 berries-0.071
,-0.066
 her-0.066
Token and
Token ID290
Feature activation+9.26e-03

Pos logit contributions
 Daisy+0.001
 Sally+0.001
 berries+0.001
 are+0.001
 taste+0.001
Neg logit contributions
ep-0.002
ician-0.001
utes-0.001
ophone-0.001
we-0.001
Token said
Token ID531
Feature activation-9.57e-03

Pos logit contributions
ep+0.021
ician+0.020
we+0.020
ophone+0.020
anas+0.019
Neg logit contributions
 Daisy-0.011
,-0.011
 berries-0.011
 Sally-0.010
 end-0.010
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
 Daisy+0.004
 Sally+0.004
 berries+0.003
 Sophie+0.002
 taste+0.002
Neg logit contributions
ep-0.028
we-0.028
ophone-0.028
ber-0.028
ician-0.028
Token he
Token ID339
Feature activation+4.25e-03

Pos logit contributions
 Daisy+0.035
 Sally+0.031
,+0.031
 She+0.030
 her+0.029
Neg logit contributions
ep-0.051
we-0.048
ophone-0.048
utes-0.047
aker-0.045
Token liked
Token ID8288
Feature activation-1.69e-03

Pos logit contributions
ep+0.011
ophone+0.010
we+0.009
ician+0.009
anas+0.009
Neg logit contributions
 Daisy-0.006
,-0.005
 Sally-0.005
 berries-0.005
 she-0.005
Token
Token ID6184
Feature activation+1.34e-02

Pos logit contributions
 Daisy+0.002
 Sally+0.002
,+0.002
 she+0.002
 berries+0.002
Neg logit contributions
ep-0.004
we-0.004
ophone-0.004
ician-0.004
utes-0.004
Token
Token ID95
Feature activation+3.03e-02

Pos logit contributions
ep+0.029
ician+0.027
ophone+0.027
we+0.027
utes+0.026
Neg logit contributions
,-0.024
 Daisy-0.022
 Sally-0.021
 She-0.021
 she-0.021
Token
Token ID26391
Feature activation+1.41e-02

Pos logit contributions
ep+0.062
utes+0.061
we+0.059
ophone+0.058
ician+0.058
Neg logit contributions
,-0.053
 Daisy-0.052
 Sally-0.050
 She-0.049
 her-0.046
Token
Token ID447
Feature activation-1.25e-01

Pos logit contributions
ep+0.028
ician+0.027
ophone+0.026
utes+0.026
we+0.026
Neg logit contributions
,-0.027
 Daisy-0.024
 she-0.023
 Sally-0.023
 She-0.022
Token
Token ID250
Feature activation+1.61e-02

Pos logit contributions
,+0.163
 Daisy+0.150
 Sally+0.138
 she+0.137
 She+0.134
Neg logit contributions
ep-0.310
ophone-0.306
ician-0.304
we-0.300
utes-0.296
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
ep+0.031
ician+0.029
ophone+0.029
utes+0.029
anas+0.029
Neg logit contributions
 Daisy-0.029
,-0.028
 Sally-0.028
 her-0.027
 She-0.027
Token big
Token ID1263
Feature activation-2.43e-02

Pos logit contributions
,+0.020
 Daisy+0.020
 Sally+0.018
 she+0.018
 her+0.017
Neg logit contributions
ep-0.033
ophone-0.032
ician-0.032
we-0.032
utes-0.031
Token soft
Token ID2705
Feature activation-4.23e-02

Pos logit contributions
 Daisy+0.030
 she+0.028
 her+0.028
 She+0.027
 Sally+0.026
Neg logit contributions
ep-0.056
ophone-0.052
ician-0.052
 sax-0.052
anas-0.051
Token swing
Token ID9628
Feature activation-3.53e-04

Pos logit contributions
 Daisy+0.058
 she+0.054
 Sally+0.053
 her+0.052
 She+0.052
Neg logit contributions
ep-0.091
ician-0.085
ophone-0.085
aker-0.084
anas-0.084
Token other
Token ID584
Feature activation+2.81e-02

Pos logit contributions
 Daisy+0.015
,+0.015
 Sally+0.014
 her+0.013
 she+0.013
Neg logit contributions
ep-0.024
ophone-0.023
ician-0.022
we-0.022
utes-0.022
Token family
Token ID1641
Feature activation-1.14e-02

Pos logit contributions
ep+0.070
ophone+0.068
ician+0.067
utes+0.067
we+0.065
Neg logit contributions
,-0.038
 Daisy-0.036
 Sally-0.033
 her-0.033
 she-0.032
Token
Token ID6184
Feature activation+1.45e-02

Pos logit contributions
 Daisy+0.012
,+0.012
 Sally+0.011
 her+0.010
 she+0.010
Neg logit contributions
ep-0.030
ophone-0.029
ician-0.029
we-0.029
utes-0.029
Token
Token ID95
Feature activation+3.18e-02

Pos logit contributions
ep+0.032
ician+0.029
ophone+0.029
we+0.028
aker+0.027
Neg logit contributions
,-0.026
 Daisy-0.024
 Sally-0.024
 she-0.023
 She-0.022
Token
Token ID26391
Feature activation+1.53e-02

Pos logit contributions
ep+0.057
utes+0.055
we+0.052
ophone+0.051
ber+0.051
Neg logit contributions
,-0.062
 Sally-0.059
 Daisy-0.058
 She-0.057
 the-0.054
Token
Token ID447
Feature activation-1.23e-01

Pos logit contributions
ep+0.027
ophone+0.026
etr+0.026
aghetti+0.026
ician+0.026
Neg logit contributions
,-0.032
 Daisy-0.028
 Sally-0.027
 she-0.026
 the-0.026
Token
Token ID250
Feature activation+2.00e-02

Pos logit contributions
,+0.184
 Daisy+0.153
 Sally+0.145
 She+0.140
 the+0.139
Neg logit contributions
ophone-0.295
ician-0.285
ep-0.285
we-0.284
utes-0.279
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
ophone+0.040
ep+0.039
anas+0.038
ician+0.038
utes+0.038
Neg logit contributions
 Daisy-0.032
 Sally-0.032
 She-0.032
 she-0.031
,-0.031
Token dad
Token ID9955
Feature activation+7.44e-03

Pos logit contributions
,+0.019
 Daisy+0.017
 Sally+0.015
 the+0.015
 she+0.014
Neg logit contributions
we-0.043
ep-0.043
ophone-0.042
aghetti-0.041
ician-0.040
Token,
Token ID11
Feature activation-2.77e-02

Pos logit contributions
ep+0.025
ophone+0.024
ician+0.024
we+0.023
utes+0.023
Neg logit contributions
 Daisy-0.003
,-0.003
 Sally-0.002
 her-0.002
 she-0.002
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
,+0.031
 Daisy+0.030
 Sally+0.028
 she+0.027
 her+0.027
Neg logit contributions
ep-0.074
ician-0.072
ophone-0.071
we-0.071
utes-0.071
Token special
Token ID2041
Feature activation-3.14e-02

Pos logit contributions
ep+0.118
we+0.116
ophone+0.115
ician+0.115
utes+0.111
Neg logit contributions
,-0.070
 Daisy-0.068
 Sally-0.063
 she-0.062
 her-0.061
Token thing
Token ID1517
Feature activation-3.59e-02

Pos logit contributions
 Daisy+0.051
,+0.049
 Sally+0.047
 she+0.046
 her+0.046
Neg logit contributions
ep-0.067
ophone-0.063
ician-0.063
utes-0.062
we-0.061
Token
Token ID6184
Feature activation+1.39e-02

Pos logit contributions
,+0.040
 Daisy+0.040
 Sally+0.036
 she+0.035
 her+0.034
Neg logit contributions
ep-0.095
ophone-0.091
ician-0.091
utes-0.090
we-0.090
Token
Token ID95
Feature activation+3.13e-02

Pos logit contributions
ep+0.035
ophone+0.034
ician+0.033
we+0.033
utes+0.032
Neg logit contributions
,-0.019
 Daisy-0.018
 Sally-0.017
 she-0.016
 She-0.015
Token
Token ID26391
Feature activation+1.53e-02

Pos logit contributions
utes+0.057
ep+0.057
ophone+0.056
ber+0.055
we+0.054
Neg logit contributions
,-0.059
 Daisy-0.058
 Sally-0.056
 She-0.055
 her-0.053
Token
Token ID447
Feature activation-1.23e-01

Pos logit contributions
ep+0.029
ophone+0.028
ician+0.028
utes+0.028
we+0.027
Neg logit contributions
,-0.030
 Daisy-0.027
 Sally-0.026
 she-0.026
 She-0.025
Token
Token ID250
Feature activation+1.85e-02

Pos logit contributions
,+0.172
 Daisy+0.156
 Sally+0.143
 she+0.142
 She+0.139
Neg logit contributions
ophone-0.294
ep-0.293
ician-0.288
we-0.284
utes-0.283
Token wings
Token ID12098
Feature activation-5.91e-02

Pos logit contributions
ep+0.037
ophone+0.036
ician+0.036
utes+0.036
anas+0.035
Neg logit contributions
 Daisy-0.031
,-0.030
 Sally-0.029
 her-0.029
 She-0.029
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
 Daisy+0.068
 Sally+0.061
 berries+0.058
 Sophie+0.056
 Polly+0.053
Neg logit contributions
ep-0.159
ophone-0.149
aker-0.145
etr-0.144
we-0.142
Token Every
Token ID3887
Feature activation-6.83e-02

Pos logit contributions
,+0.030
 Daisy+0.028
 her+0.027
 she+0.027
 berries+0.026
Neg logit contributions
ophone-0.049
ep-0.049
ician-0.048
anas-0.047
utes-0.047
Token day
Token ID1110
Feature activation+3.60e-04

Pos logit contributions
 Daisy+0.080
,+0.078
 Sally+0.073
 her+0.069
 she+0.069
Neg logit contributions
ep-0.175
ophone-0.169
ician-0.167
utes-0.165
we-0.165
Token a
Token ID257
Feature activation-1.15e-02

Pos logit contributions
 Daisy+0.045
,+0.038
 She+0.038
 Sally+0.037
 berries+0.035
Neg logit contributions
ep-0.093
we-0.091
utes-0.090
 sax-0.089
 motorcycle-0.088
Token normal
Token ID3487
Feature activation-4.69e-04

Pos logit contributions
 Daisy+0.018
,+0.017
 Sally+0.016
 her+0.016
 she+0.016
Neg logit contributions
ep-0.025
ician-0.024
ophone-0.024
we-0.024
utes-0.024
Token penny
Token ID26369
Feature activation-4.18e-02

Pos logit contributions
 Daisy+0.001
 Sally+0.001
 She+0.001
 she+0.001
 Polly+0.001
Neg logit contributions
ep-0.001
ician-0.001
we-0.001
ber-0.001
aker-0.001
Tokenâ
Token ID22940
Feature activation+2.05e-02

Pos logit contributions
 Daisy+0.047
 Sally+0.042
,+0.040
 her+0.038
 berries+0.038
Neg logit contributions
ep-0.108
ophone-0.106
we-0.104
ician-0.102
utes-0.101
Token
Token ID26391
Feature activation+2.18e-02

Pos logit contributions
ep+0.039
utes+0.037
ophone+0.036
ician+0.036
we+0.036
Neg logit contributions
,-0.038
 Daisy-0.038
 Sally-0.037
 her-0.035
 she-0.035
Token
Token ID447
Feature activation-1.22e-01

Pos logit contributions
ep+0.033
ophone+0.032
ician+0.031
utes+0.031
we+0.030
Neg logit contributions
,-0.049
 Daisy-0.048
 Sally-0.046
 she-0.045
 her-0.044
Token
Token ID251
Feature activation-2.65e-03

Pos logit contributions
,+0.124
 Daisy+0.114
 Sally+0.101
 she+0.098
 She+0.094
Neg logit contributions
ep-0.341
ophone-0.333
ician-0.332
we-0.323
utes-0.323
Tokenit
Token ID270
Feature activation-7.67e-04

Pos logit contributions
,+0.002
 Daisy+0.002
 Sally+0.002
 her+0.002
 she+0.002
Neg logit contributions
ep-0.008
ophone-0.008
ician-0.008
utes-0.008
we-0.007
Token was
Token ID373
Feature activation-3.00e-02

Pos logit contributions
,+0.001
 Daisy+0.001
 her+0.001
 she+0.001
 Sally+0.001
Neg logit contributions
ep-0.002
utes-0.002
ophone-0.002
ician-0.002
aghetti-0.002
Token special
Token ID2041
Feature activation-3.05e-02

Pos logit contributions
 Daisy+0.044
 berries+0.041
,+0.040
 Sally+0.039
 she+0.039
Neg logit contributions
ep-0.066
ophone-0.058
ician-0.056
aker-0.056
utes-0.056
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
 Daisy+0.025
,+0.022
 Sally+0.022
 her+0.020
 she+0.020
Neg logit contributions
ep-0.087
we-0.084
ophone-0.084
ician-0.084
utes-0.083
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
 Daisy+0.062
 Sally+0.057
,+0.054
 she+0.052
 She+0.052
Neg logit contributions
ep-0.092
ophone-0.087
we-0.086
utes-0.085
ician-0.084
Token one
Token ID530
Feature activation-2.49e-02

Pos logit contributions
 Daisy+0.036
,+0.036
 Sally+0.033
 her+0.032
 she+0.032
Neg logit contributions
ep-0.066
ophone-0.064
ician-0.064
utes-0.063
we-0.063
Token dull
Token ID19222
Feature activation-4.57e-02

Pos logit contributions
,+0.031
 Daisy+0.029
 she+0.027
 Sally+0.026
 her+0.026
Neg logit contributions
ep-0.064
ophone-0.062
ician-0.062
utes-0.061
we-0.061
Tokenâ
Token ID22940
Feature activation+1.68e-02

Pos logit contributions
 Daisy+0.069
 Sally+0.063
,+0.060
 she+0.058
 her+0.058
Neg logit contributions
ep-0.102
ophone-0.097
we-0.097
ician-0.095
utes-0.094
Token
Token ID26391
Feature activation+1.32e-02

Pos logit contributions
utes+0.033
ep+0.033
we+0.031
ophone+0.031
ades+0.031
Neg logit contributions
,-0.030
 Daisy-0.029
 Sally-0.028
 her-0.027
 She-0.027
Token
Token ID447
Feature activation-1.22e-01

Pos logit contributions
ep+0.031
ophone+0.029
utes+0.029
ician+0.029
we+0.028
Neg logit contributions
,-0.021
 Daisy-0.018
 she-0.018
 Sally-0.017
 She-0.017
Token
Token ID251
Feature activation-9.89e-03

Pos logit contributions
,+0.144
 Daisy+0.144
 Sally+0.130
 she+0.126
 her+0.125
Neg logit contributions
ep-0.314
ophone-0.303
ician-0.301
we-0.298
utes-0.297
Tokenand
Token ID392
Feature activation+1.05e-01

Pos logit contributions
,+0.010
 Daisy+0.010
 Sally+0.009
 she+0.009
 her+0.009
Neg logit contributions
ep-0.028
ophone-0.027
ician-0.027
utes-0.026
we-0.026
Token he
Token ID339
Feature activation+6.04e-03

Pos logit contributions
ep+0.219
utes+0.211
ician+0.208
ophone+0.205
aghetti+0.203
Neg logit contributions
,-0.185
 Daisy-0.175
 her-0.171
 she-0.170
 Sally-0.167
Token had
Token ID550
Feature activation-1.50e-02

Pos logit contributions
ep+0.015
ophone+0.014
ician+0.014
we+0.014
utes+0.014
Neg logit contributions
 Daisy-0.008
,-0.008
 Sally-0.007
 she-0.007
 her-0.007
Token to
Token ID284
Feature activation-1.37e-02

Pos logit contributions
 Daisy+0.019
,+0.017
 Sally+0.016
 berries+0.016
 she+0.015
Neg logit contributions
ep-0.038
ophone-0.036
ician-0.035
we-0.035
anas-0.034
Token
Token ID6184
Feature activation+1.80e-02

Pos logit contributions
 Daisy+0.025
,+0.024
 Sally+0.023
 her+0.022
 she+0.022
Neg logit contributions
ep-0.046
ophone-0.044
ician-0.044
we-0.044
utes-0.044
Token
Token ID95
Feature activation+3.95e-02

Pos logit contributions
ep+0.036
utes+0.034
ophone+0.034
we+0.034
ician+0.034
Neg logit contributions
,-0.036
 Daisy-0.030
 She-0.030
 Sally-0.029
 the-0.028
Token
Token ID26391
Feature activation+2.05e-02

Pos logit contributions
utes+0.082
ep+0.074
we+0.074
ophone+0.071
ades+0.069
Neg logit contributions
,-0.072
 She-0.066
 Daisy-0.066
 Sally-0.064
 the-0.062
Token
Token ID129
Feature activation+2.60e-02

Pos logit contributions
ep+0.038
utes+0.037
ophone+0.037
ician+0.037
aghetti+0.036
Neg logit contributions
,-0.041
 Daisy-0.037
 she-0.036
 Sally-0.035
 She-0.035
Token
Token ID241
Feature activation+3.35e-02

Pos logit contributions
we+0.050
ophone+0.050
ep+0.050
utes+0.049
ician+0.048
Neg logit contributions
,-0.047
 Daisy-0.044
 Sally-0.043
 her-0.042
 She-0.041
TokenThank
Token ID10449
Feature activation-1.22e-01

Pos logit contributions
ep+0.067
ophone+0.064
ician+0.062
utes+0.062
anas+0.060
Neg logit contributions
,-0.059
 Daisy-0.059
 she-0.057
 Sally-0.057
 her-0.056
Token you
Token ID345
Feature activation-2.47e-02

Pos logit contributions
 Daisy+0.166
,+0.166
 Sally+0.152
 she+0.148
 her+0.148
Neg logit contributions
ep-0.293
ophone-0.282
ician-0.279
utes-0.276
we-0.275
Token!
Token ID0
Feature activation+8.04e-03

Pos logit contributions
 Daisy+0.020
,+0.019
 Sally+0.017
 she+0.017
 her+0.016
Neg logit contributions
ep-0.072
ophone-0.069
ician-0.069
we-0.068
utes-0.068
Token I
Token ID314
Feature activation-3.06e-02

Pos logit contributions
ep+0.018
ophone+0.018
ician+0.017
we+0.017
utes+0.017
Neg logit contributions
,-0.011
 Daisy-0.011
 her-0.010
 Sally-0.010
 berries-0.010
Token like
Token ID588
Feature activation-4.30e-02

Pos logit contributions
 Daisy+0.045
 berries+0.042
 Sally+0.040
,+0.039
 her+0.039
Neg logit contributions
ep-0.070
ophone-0.068
ician-0.066
aker-0.064
anas-0.064
Token to
Token ID284
Feature activation-9.41e-03

Pos logit contributions
 Daisy+0.048
,+0.048
 Sally+0.043
 she+0.041
 her+0.041
Neg logit contributions
ep-0.112
ophone-0.108
ician-0.107
we-0.106
utes-0.106
Token two
Token ID734
Feature activation-1.86e-02

Pos logit contributions
 Daisy+0.014
,+0.014
 Sally+0.013
 she+0.012
 her+0.012
Neg logit contributions
ep-0.039
ophone-0.037
ician-0.037
we-0.037
utes-0.037
Token penn
Token ID22429
Feature activation+8.07e-04

Pos logit contributions
 Daisy+0.031
 Sally+0.029
,+0.028
 she+0.027
 her+0.027
Neg logit contributions
ep-0.039
we-0.036
anas-0.035
ophone-0.035
utes-0.035
Tokenies
Token ID444
Feature activation-4.06e-02

Pos logit contributions
ep+0.002
ophone+0.002
ician+0.002
we+0.002
utes+0.002
Neg logit contributions
 Daisy-0.001
,-0.001
 Sally-0.001
 she-0.001
 her-0.001
Tokenâ
Token ID22940
Feature activation+1.83e-02

Pos logit contributions
 Daisy+0.043
 Sally+0.039
 berries+0.035
 Sophie+0.034
 her+0.032
Neg logit contributions
ep-0.108
ophone-0.105
we-0.103
anas-0.100
utes-0.100
Token
Token ID26391
Feature activation+1.40e-02

Pos logit contributions
ep+0.034
utes+0.033
we+0.031
ophone+0.031
ician+0.031
Neg logit contributions
,-0.035
 Daisy-0.035
 Sally-0.034
 her-0.033
 She-0.032
Token
Token ID447
Feature activation-1.21e-01

Pos logit contributions
ep+0.022
ician+0.021
ophone+0.020
utes+0.020
aghetti+0.019
Neg logit contributions
,-0.033
 Daisy-0.030
 she-0.029
 Sally-0.028
 She-0.028
Token
Token ID251
Feature activation-1.01e-02

Pos logit contributions
,+0.119
 Daisy+0.104
 Sally+0.090
 she+0.090
 her+0.088
Neg logit contributions
ophone-0.342
ep-0.340
ician-0.339
we-0.328
utes-0.325
Tokenone
Token ID505
Feature activation+5.01e-02

Pos logit contributions
,+0.011
 Daisy+0.009
 her+0.009
 Sally+0.009
 she+0.008
Neg logit contributions
ep-0.029
ician-0.029
ophone-0.028
we-0.028
utes-0.028
Token shiny
Token ID22441
Feature activation-4.07e-02

Pos logit contributions
ep+0.115
ician+0.113
utes+0.112
we+0.112
ophone+0.111
Neg logit contributions
,-0.077
 Daisy-0.069
 she-0.067
 her-0.067
 Sally-0.064
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
 Daisy+0.062
 Sally+0.057
,+0.054
 she+0.052
 She+0.052
Neg logit contributions
ep-0.092
ophone-0.087
we-0.086
utes-0.085
ician-0.084
Token one
Token ID530
Feature activation-2.49e-02

Pos logit contributions
 Daisy+0.036
,+0.036
 Sally+0.033
 her+0.032
 she+0.032
Neg logit contributions
ep-0.066
ophone-0.064
ician-0.064
utes-0.063
we-0.063
Token rare
Token ID4071
Feature activation-4.24e-02

Pos logit contributions
,+0.009
 Daisy+0.008
 Sally+0.008
 she+0.008
 her+0.007
Neg logit contributions
ep-0.014
ophone-0.013
we-0.013
ician-0.013
utes-0.013
Token silence
Token ID9550
Feature activation-1.33e-02

Pos logit contributions
 Daisy+0.063
 Sally+0.058
,+0.058
 her+0.056
 She+0.055
Neg logit contributions
ep-0.090
ophone-0.087
ician-0.086
utes-0.085
ber-0.083
Token
Token ID6184
Feature activation+1.99e-02

Pos logit contributions
 Daisy+0.015
 Sally+0.014
 berries+0.013
 her+0.013
 Freddy+0.012
Neg logit contributions
ep-0.033
ophone-0.032
we-0.032
utes-0.031
ician-0.031
Token
Token ID95
Feature activation+4.41e-02

Pos logit contributions
ep+0.040
ophone+0.038
ician+0.037
we+0.037
utes+0.036
Neg logit contributions
,-0.035
 Daisy-0.035
 Sally-0.033
 she-0.032
 her-0.032
Token
Token ID26391
Feature activation+3.10e-02

Pos logit contributions
ep+0.088
utes+0.082
we+0.082
ophone+0.082
ician+0.080
Neg logit contributions
,-0.078
 Daisy-0.078
 She-0.074
 Sally-0.074
 she-0.072
Token
Token ID447
Feature activation-1.21e-01

Pos logit contributions
ep+0.064
ophone+0.061
ician+0.060
we+0.059
utes+0.059
Neg logit contributions
 Daisy-0.052
,-0.052
 Sally-0.049
 her-0.048
 she-0.048
Token
Token ID250
Feature activation+2.74e-02

Pos logit contributions
 Daisy+0.166
,+0.164
 Sally+0.152
 her+0.147
 she+0.146
Neg logit contributions
ep-0.289
ophone-0.278
ician-0.275
we-0.272
utes-0.272
Token that
Token ID326
Feature activation-1.81e-02

Pos logit contributions
ep+0.055
ophone+0.053
ician+0.052
anas+0.052
utes+0.051
Neg logit contributions
 Daisy-0.047
,-0.045
 she-0.045
 Sally-0.044
 her-0.044
Token added
Token ID2087
Feature activation+1.71e-03

Pos logit contributions
 Daisy+0.020
 Sally+0.019
,+0.019
 her+0.017
 berries+0.017
Neg logit contributions
ep-0.046
ophone-0.045
ician-0.044
we-0.043
utes-0.043
Token to
Token ID284
Feature activation-9.10e-03

Pos logit contributions
ep+0.004
ophone+0.004
we+0.004
utes+0.004
ician+0.004
Neg logit contributions
 Daisy-0.002
 Sally-0.002
 She-0.002
 she-0.002
,-0.002
Token the
Token ID262
Feature activation-1.53e-02

Pos logit contributions
 Daisy+0.011
,+0.010
 Sally+0.010
 berries+0.010
 She+0.009
Neg logit contributions
ep-0.022
ician-0.021
ophone-0.021
utes-0.021
we-0.021
Token
Token ID6184
Feature activation+1.98e-02

Pos logit contributions
,+0.006
 Daisy+0.006
 berries+0.005
 her+0.005
 Sally+0.004
Neg logit contributions
ophone-0.020
ep-0.020
ician-0.020
utes-0.020
we-0.020
Token
Token ID95
Feature activation+4.11e-02

Pos logit contributions
ep+0.040
ician+0.038
we+0.037
ophone+0.037
aghetti+0.036
Neg logit contributions
,-0.038
 She-0.034
 Daisy-0.033
 Sally-0.033
 she-0.032
Token
Token ID26391
Feature activation+2.18e-02

Pos logit contributions
utes+0.091
we+0.085
ep+0.084
ymes+0.082
aghetti+0.082
Neg logit contributions
,-0.067
 She-0.063
 Sally-0.057
 Daisy-0.055
-0.054
Token
Token ID129
Feature activation+2.59e-02

Pos logit contributions
ep+0.042
ophone+0.041
ician+0.040
utes+0.040
we+0.040
Neg logit contributions
,-0.041
 Daisy-0.038
 she-0.036
 Sally-0.036
 She-0.036
Token
Token ID241
Feature activation+3.43e-02

Pos logit contributions
we+0.051
ep+0.051
ophone+0.050
utes+0.049
ician+0.048
Neg logit contributions
,-0.046
 Daisy-0.044
 her-0.042
 Sally-0.042
 She-0.041
TokenThank
Token ID10449
Feature activation-1.21e-01

Pos logit contributions
ep+0.071
ophone+0.068
ician+0.066
utes+0.066
we+0.064
Neg logit contributions
,-0.058
 Daisy-0.058
 Sally-0.056
 she-0.056
 her-0.055
Token you
Token ID345
Feature activation-2.52e-02

Pos logit contributions
,+0.159
 Daisy+0.157
 she+0.142
 Sally+0.142
 her+0.141
Neg logit contributions
ep-0.302
ophone-0.293
ician-0.290
utes-0.285
anas-0.281
Token for
Token ID329
Feature activation-5.13e-02

Pos logit contributions
 Daisy+0.023
,+0.022
 Sally+0.021
 she+0.020
 her+0.020
Neg logit contributions
ep-0.070
ophone-0.068
ician-0.067
utes-0.066
we-0.066
Token supporting
Token ID6493
Feature activation+4.62e-02

Pos logit contributions
 Daisy+0.060
,+0.059
 Sally+0.054
 she+0.052
 her+0.051
Neg logit contributions
ep-0.133
ophone-0.127
ician-0.126
we-0.125
utes-0.125
Token me
Token ID502
Feature activation-4.70e-02

Pos logit contributions
ep+0.119
ician+0.113
we+0.113
ophone+0.112
utes+0.111
Neg logit contributions
 Daisy-0.051
,-0.048
 Sally-0.046
 she-0.045
 berries-0.044
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
 Daisy+0.044
,+0.041
 Sally+0.039
 she+0.036
 her+0.036
Neg logit contributions
ep-0.132
ophone-0.128
ician-0.126
utes-0.125
we-0.125
Token
Token ID6184
Feature activation+1.47e-02

Pos logit contributions
 Daisy+0.031
,+0.030
 Sally+0.028
 her+0.028
 she+0.027
Neg logit contributions
ep-0.055
ophone-0.053
ician-0.053
we-0.052
utes-0.052
Token
Token ID95
Feature activation+3.65e-02

Pos logit contributions
ep+0.029
ophone+0.027
ician+0.027
we+0.027
and+0.026
Neg logit contributions
,-0.029
 Daisy-0.026
 Sally-0.026
 she-0.026
 She-0.025
Token
Token ID26391
Feature activation+1.75e-02

Pos logit contributions
utes+0.074
ep+0.072
we+0.069
ophone+0.067
ades+0.067
Neg logit contributions
,-0.062
 Daisy-0.062
 She-0.061
 Sally-0.061
 her-0.053
Token
Token ID129
Feature activation+2.12e-02

Pos logit contributions
ep+0.029
ophone+0.028
ician+0.027
utes+0.027
we+0.027
Neg logit contributions
,-0.038
 Daisy-0.036
 Sally-0.035
 she-0.035
 She-0.033
Token
Token ID241
Feature activation+3.34e-02

Pos logit contributions
ep+0.039
we+0.039
ophone+0.038
ician+0.037
utes+0.037
Neg logit contributions
,-0.040
 Daisy-0.039
 Sally-0.037
 her-0.037
 She-0.036
TokenThank
Token ID10449
Feature activation-1.21e-01

Pos logit contributions
ep+0.072
ophone+0.069
ician+0.066
utes+0.065
anas+0.064
Neg logit contributions
,-0.055
 she-0.055
 Daisy-0.054
 Sally-0.053
 She-0.053
Token you
Token ID345
Feature activation-2.55e-02

Pos logit contributions
,+0.161
 Daisy+0.158
 Sally+0.144
 she+0.143
 her+0.143
Neg logit contributions
ep-0.298
ophone-0.289
ician-0.286
utes-0.282
anas-0.277
Token.
Token ID13
Feature activation-8.96e-03

Pos logit contributions
 Daisy+0.018
,+0.017
 Sally+0.015
 she+0.014
 her+0.014
Neg logit contributions
ep-0.077
ophone-0.074
ician-0.074
utes-0.073
we-0.073
Tokenâ
Token ID22940
Feature activation+1.56e-02

Pos logit contributions
,+0.012
 Daisy+0.012
 her+0.011
 Sally+0.011
 she+0.011
Neg logit contributions
ep-0.021
ophone-0.020
ician-0.020
utes-0.020
we-0.020
Token
Token ID26391
Feature activation+1.84e-02

Pos logit contributions
ep+0.029
utes+0.028
we+0.027
ician+0.027
ophone+0.027
Neg logit contributions
,-0.030
 Daisy-0.029
 Sally-0.028
 she-0.027
 her-0.027
Token
Token ID220
Feature activation-1.04e-02

Pos logit contributions
ep+0.036
ophone+0.034
ician+0.034
we+0.033
utes+0.033
Neg logit contributions
,-0.033
 Daisy-0.033
 Sally-0.031
 she-0.030
 her-0.030
Token special
Token ID2041
Feature activation-2.92e-02

Pos logit contributions
,+0.014
 Daisy+0.014
 she+0.012
 Sally+0.012
 her+0.012
Neg logit contributions
ep-0.019
we-0.019
ophone-0.018
ician-0.018
utes-0.018
Token one
Token ID530
Feature activation-2.26e-02

Pos logit contributions
,+0.052
 Daisy+0.052
 Sally+0.049
 her+0.048
 she+0.048
Neg logit contributions
ep-0.057
ophone-0.054
ician-0.054
we-0.053
utes-0.053
Token
Token ID6184
Feature activation+1.85e-02

Pos logit contributions
 Daisy+0.018
 Sally+0.016
 her+0.014
 berries+0.014
,+0.014
Neg logit contributions
ep-0.063
ophone-0.063
ician-0.061
we-0.060
ber-0.060
Token
Token ID95
Feature activation+3.73e-02

Pos logit contributions
ep+0.047
ician+0.042
ophone+0.042
we+0.042
utes+0.042
Neg logit contributions
,-0.028
 Daisy-0.025
 Sally-0.023
 She-0.023
 she-0.023
Token
Token ID26391
Feature activation+2.00e-02

Pos logit contributions
utes+0.075
ep+0.073
we+0.069
ber+0.068
ophone+0.068
Neg logit contributions
,-0.064
 She-0.063
 Daisy-0.062
 Sally-0.061
 her-0.055
Token
Token ID447
Feature activation-1.21e-01

Pos logit contributions
ep+0.037
ophone+0.036
utes+0.035
ician+0.035
etr+0.035
Neg logit contributions
,-0.041
 Daisy-0.036
 she-0.035
 Sally-0.035
 She-0.034
Token
Token ID250
Feature activation+2.29e-02

Pos logit contributions
,+0.148
 Daisy+0.128
 she+0.117
 She+0.115
 Sally+0.115
Neg logit contributions
ophone-0.314
ep-0.313
ician-0.310
we-0.306
utes-0.302
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
ep+0.043
ophone+0.042
ician+0.042
utes+0.041
anas+0.040
Neg logit contributions
 Daisy-0.041
,-0.040
 Sally-0.039
 her-0.039
 she-0.039
Token scary
Token ID14343
Feature activation-4.25e-02

Pos logit contributions
,+0.016
 Daisy+0.015
 Sally+0.014
 she+0.013
 her+0.013
Neg logit contributions
ep-0.025
we-0.025
ophone-0.024
utes-0.024
ician-0.024
Token story
Token ID1621
Feature activation-1.79e-02

Pos logit contributions
,+0.073
 Daisy+0.070
 Sally+0.066
 her+0.064
 she+0.064
Neg logit contributions
ep-0.090
ophone-0.087
utes-0.086
we-0.086
ician-0.085
Token!
Token ID0
Feature activation-6.52e-03

Pos logit contributions
 Daisy+0.024
,+0.023
 Sally+0.022
 she+0.021
 her+0.021
Neg logit contributions
ep-0.044
ophone-0.042
ician-0.041
utes-0.041
we-0.040
Token very
Token ID845
Feature activation+5.18e-02

Pos logit contributions
 Daisy+0.044
 Sally+0.041
,+0.040
 berries+0.039
 she+0.038
Neg logit contributions
ep-0.062
ophone-0.056
we-0.056
ician-0.055
utes-0.054
Token sad
Token ID6507
Feature activation-4.78e-02

Pos logit contributions
ep+0.113
utes+0.103
we+0.101
ophone+0.100
aker+0.100
Neg logit contributions
 Daisy-0.083
 Sally-0.075
 berries-0.072
,-0.072
 Sophie-0.070
Token
Token ID6184
Feature activation+1.78e-02

Pos logit contributions
 Daisy+0.056
 Sally+0.049
 berries+0.048
 Freddy+0.043
 Polly+0.043
Neg logit contributions
ep-0.117
we-0.114
ophone-0.111
anas-0.111
utes-0.110
Token
Token ID95
Feature activation+3.72e-02

Pos logit contributions
ep+0.045
ophone+0.041
ician+0.041
we+0.040
utes+0.040
Neg logit contributions
,-0.026
 Daisy-0.024
 Sally-0.023
 she-0.023
 She-0.022
Token
Token ID26391
Feature activation+1.64e-02

Pos logit contributions
utes+0.067
ep+0.067
ophone+0.063
ber+0.061
ades+0.061
Neg logit contributions
 She-0.071
 Daisy-0.071
,-0.071
 Sally-0.068
 her-0.067
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
ep+0.030
ophone+0.029
utes+0.028
ician+0.028
aghetti+0.028
Neg logit contributions
,-0.033
 Daisy-0.031
 she-0.030
 Sally-0.029
 She-0.029
Token
Token ID250
Feature activation+2.33e-02

Pos logit contributions
,+0.164
 Daisy+0.150
 she+0.141
 Sally+0.137
 She+0.135
Neg logit contributions
ophone-0.291
ep-0.291
ician-0.286
utes-0.283
we-0.280
Token it
Token ID340
Feature activation-2.36e-02

Pos logit contributions
ep+0.045
ophone+0.043
ician+0.043
utes+0.042
we+0.042
Neg logit contributions
 Daisy-0.042
,-0.042
 Sally-0.040
 her-0.039
 she-0.039
Token was
Token ID373
Feature activation-2.80e-02

Pos logit contributions
 Daisy+0.033
,+0.031
 Sally+0.031
 her+0.029
 berries+0.028
Neg logit contributions
ep-0.055
ophone-0.054
we-0.052
ician-0.052
etr-0.050
Token her
Token ID607
Feature activation-2.47e-02

Pos logit contributions
,+0.041
 Daisy+0.041
 Sally+0.037
 She+0.037
 she+0.036
Neg logit contributions
ep-0.061
ophone-0.057
we-0.054
ician-0.054
plug-0.053
Token favourite
Token ID12507
Feature activation+1.11e-02

Pos logit contributions
 Daisy+0.043
 Sally+0.040
,+0.040
 her+0.039
 she+0.038
Neg logit contributions
ophone-0.046
ep-0.046
ician-0.045
utes-0.044
ber-0.043
Token for
Token ID329
Feature activation-4.95e-02

Pos logit contributions
 Daisy+0.010
,+0.010
 Sally+0.009
 her+0.009
 she+0.009
Neg logit contributions
ep-0.030
ophone-0.029
ician-0.029
we-0.028
utes-0.028
Token him
Token ID683
Feature activation-1.47e-03

Pos logit contributions
 Daisy+0.054
,+0.054
 Sally+0.049
 she+0.047
 her+0.046
Neg logit contributions
ep-0.130
ophone-0.124
ician-0.124
we-0.124
utes-0.123
Token
Token ID6184
Feature activation+1.59e-02

Pos logit contributions
 Daisy+0.002
 Sally+0.002
,+0.001
 berries+0.001
 she+0.001
Neg logit contributions
ep-0.004
ophone-0.004
anas-0.004
we-0.004
ician-0.004
Token
Token ID95
Feature activation+3.45e-02

Pos logit contributions
ep+0.045
we+0.042
ophone+0.042
ician+0.041
utes+0.041
Neg logit contributions
,-0.018
 Daisy-0.017
 She-0.015
 Sally-0.015
 she-0.014
Token
Token ID26391
Feature activation+1.66e-02

Pos logit contributions
utes+0.079
ep+0.076
we+0.072
ophone+0.071
ades+0.070
Neg logit contributions
 She-0.055
,-0.052
 Daisy-0.051
 Sally-0.049
 her-0.046
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
ep+0.030
utes+0.029
ophone+0.029
ician+0.028
we+0.028
Neg logit contributions
,-0.034
 Daisy-0.030
 she-0.030
 She-0.029
 Sally-0.029
Token
Token ID250
Feature activation+2.15e-02

Pos logit contributions
,+0.142
 Daisy+0.128
 she+0.117
 She+0.114
 Sally+0.113
Neg logit contributions
ophone-0.315
ep-0.313
ician-0.310
utes-0.307
we-0.306
Token it
Token ID340
Feature activation-2.37e-02

Pos logit contributions
ep+0.041
ophone+0.041
utes+0.040
ician+0.040
anas+0.038
Neg logit contributions
 Daisy-0.038
,-0.036
 Sally-0.036
 she-0.036
 her-0.036
Token was
Token ID373
Feature activation-2.70e-02

Pos logit contributions
 Daisy+0.025
 Sally+0.023
,+0.023
 her+0.021
 berries+0.021
Neg logit contributions
ep-0.063
ophone-0.061
we-0.060
ician-0.059
anas-0.058
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
 Daisy+0.034
,+0.034
 Sally+0.031
 she+0.031
 She+0.030
Neg logit contributions
ep-0.063
aghetti-0.058
ophone-0.058
aker-0.057
anas-0.057
Token big
Token ID1263
Feature activation-2.05e-02

Pos logit contributions
,+0.014
 Daisy+0.014
 Sally+0.013
 she+0.013
 her+0.013
Neg logit contributions
ep-0.025
ophone-0.024
ician-0.024
we-0.024
utes-0.024
Token something
Token ID1223
Feature activation-2.23e-02

Pos logit contributions
ep+0.108
we+0.103
ophone+0.099
etr+0.098
ician+0.096
Neg logit contributions
,-0.044
 Daisy-0.042
 taste-0.040
 Everyday-0.040
 are-0.040
Token special
Token ID2041
Feature activation-3.08e-02

Pos logit contributions
 Daisy+0.033
,+0.031
 Sally+0.030
 She+0.029
 berries+0.028
Neg logit contributions
ep-0.049
we-0.047
ophone-0.046
utes-0.046
ician-0.045
Token
Token ID6184
Feature activation+1.31e-02

Pos logit contributions
 Daisy+0.039
 Sally+0.037
 She+0.034
 berries+0.033
 Gig+0.032
Neg logit contributions
we-0.072
ep-0.070
ophone-0.068
utes-0.065
aker-0.065
Token
Token ID95
Feature activation+3.34e-02

Pos logit contributions
ep+0.030
ophone+0.028
ician+0.028
utes+0.027
we+0.027
Neg logit contributions
,-0.021
 Daisy-0.020
 Sally-0.019
 she-0.018
 her-0.018
Token
Token ID26391
Feature activation+1.63e-02

Pos logit contributions
utes+0.061
ep+0.060
ophone+0.059
ber+0.057
we+0.056
Neg logit contributions
 Daisy-0.065
,-0.065
 Sally-0.062
 She-0.060
 her-0.058
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
ep+0.031
ophone+0.030
ician+0.030
utes+0.030
we+0.029
Neg logit contributions
,-0.030
 Daisy-0.029
 Sally-0.027
 she-0.027
 her-0.026
Token
Token ID251
Feature activation-8.94e-04

Pos logit contributions
,+0.166
 Daisy+0.160
 Sally+0.146
 she+0.145
 her+0.140
Neg logit contributions
ep-0.288
ophone-0.282
ician-0.278
we-0.274
utes-0.273
Token a
Token ID257
Feature activation-9.67e-03

Pos logit contributions
 Daisy+0.001
,+0.001
 Sally+0.001
 her+0.001
 she+0.001
Neg logit contributions
ep-0.002
ophone-0.002
ician-0.002
we-0.002
utes-0.002
Token helpful
Token ID7613
Feature activation-2.75e-02

Pos logit contributions
,+0.014
 Daisy+0.013
 Sally+0.012
 she+0.012
 her+0.011
Neg logit contributions
ep-0.023
ophone-0.023
we-0.023
ician-0.022
aghetti-0.022
Token cabinet
Token ID13447
Feature activation-1.51e-02

Pos logit contributions
 Daisy+0.039
,+0.038
 Sally+0.036
 she+0.035
 her+0.035
Neg logit contributions
ep-0.062
ician-0.059
ophone-0.059
utes-0.058
we-0.057
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 Daisy+0.019
 Sally+0.017
,+0.016
 her+0.016
 berries+0.015
Neg logit contributions
ep-0.038
we-0.036
ophone-0.036
ician-0.035
ber-0.035
Token
Token ID6184
Feature activation+2.16e-02

Pos logit contributions
 Daisy+0.020
,+0.020
 Sally+0.019
 she+0.018
 her+0.018
Neg logit contributions
ep-0.040
ophone-0.039
ician-0.039
we-0.038
utes-0.038
Token
Token ID95
Feature activation+4.34e-02

Pos logit contributions
ep+0.042
ophone+0.040
we+0.039
ician+0.038
utes+0.037
Neg logit contributions
,-0.043
 Daisy-0.038
 She-0.038
 she-0.038
 Sally-0.038
Token
Token ID26391
Feature activation+2.41e-02

Pos logit contributions
utes+0.090
ep+0.087
we+0.084
ophone+0.082
ades+0.079
Neg logit contributions
,-0.073
 She-0.070
 Daisy-0.068
 Sally-0.067
-0.065
Token
Token ID129
Feature activation+3.03e-02

Pos logit contributions
ep+0.052
ophone+0.050
ician+0.050
utes+0.049
we+0.049
Neg logit contributions
,-0.040
 Daisy-0.037
 she-0.035
 Sally-0.035
 She-0.034
Token
Token ID241
Feature activation+3.60e-02

Pos logit contributions
we+0.057
ep+0.056
ophone+0.056
utes+0.054
ician+0.053
Neg logit contributions
,-0.056
 Daisy-0.054
 Sally-0.052
 her-0.052
 She-0.050
TokenThank
Token ID10449
Feature activation-1.20e-01

Pos logit contributions
ep+0.075
ophone+0.072
ician+0.069
utes+0.069
anas+0.067
Neg logit contributions
,-0.062
 Daisy-0.061
 she-0.060
 Sally-0.059
 her-0.058
Token you
Token ID345
Feature activation-2.65e-02

Pos logit contributions
 Daisy+0.165
,+0.164
 Sally+0.151
 her+0.146
 she+0.146
Neg logit contributions
ep-0.287
ophone-0.276
ician-0.274
utes-0.271
we-0.270
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
 Daisy+0.020
,+0.019
 Sally+0.017
 she+0.016
 her+0.016
Neg logit contributions
ep-0.078
ophone-0.076
ician-0.075
utes-0.074
we-0.074
Token my
Token ID616
Feature activation-2.17e-02

Pos logit contributions
 Daisy+0.021
,+0.021
 berries+0.019
 Sally+0.019
 her+0.018
Neg logit contributions
ep-0.049
ophone-0.047
ician-0.047
utes-0.046
we-0.046
Token dear
Token ID13674
Feature activation-5.29e-02

Pos logit contributions
,+0.033
 Daisy+0.031
 Sally+0.029
 she+0.028
 her+0.028
Neg logit contributions
ep-0.051
ophone-0.048
ician-0.048
we-0.048
utes-0.047
Token.
Token ID13
Feature activation-5.21e-03

Pos logit contributions
 Daisy+0.060
,+0.053
 Sally+0.052
 berries+0.051
 her+0.050
Neg logit contributions
ep-0.133
ician-0.128
ophone-0.127
we-0.127
ber-0.124
Token so
Token ID523
Feature activation-3.62e-03

Pos logit contributions
 Daisy+0.036
 berries+0.034
,+0.033
 Sally+0.033
 she+0.032
Neg logit contributions
ep-0.060
ician-0.055
ophone-0.055
we-0.053
utes-0.053
Token excited
Token ID6568
Feature activation-9.01e-04

Pos logit contributions
 berries+0.007
 Daisy+0.006
 Sally+0.006
,+0.006
 Polly+0.006
Neg logit contributions
ep-0.007
ber-0.006
ician-0.006
utes-0.006
ophone-0.006
Token
Token ID6184
Feature activation+1.74e-02

Pos logit contributions
 Daisy+0.001
 berries+0.001
 are+0.001
 Sally+0.001
 Everyday+0.001
Neg logit contributions
we-0.002
ep-0.002
ician-0.002
anas-0.002
ophone-0.002
Token
Token ID95
Feature activation+3.81e-02

Pos logit contributions
ep+0.035
ophone+0.032
ician+0.031
we+0.031
utes+0.031
Neg logit contributions
,-0.033
 Daisy-0.032
 Sally-0.031
 she-0.031
 She-0.030
Token
Token ID26391
Feature activation+1.89e-02

Pos logit contributions
ep+0.070
utes+0.069
ophone+0.066
ber+0.063
ades+0.063
Neg logit contributions
 Daisy-0.073
,-0.071
 She-0.071
 Sally-0.069
 her-0.066
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
ep+0.035
ophone+0.033
ician+0.032
utes+0.032
we+0.032
Neg logit contributions
,-0.037
 Daisy-0.036
 Sally-0.034
 she-0.034
 her-0.033
Token
Token ID250
Feature activation+2.34e-02

Pos logit contributions
,+0.189
 Daisy+0.179
 she+0.168
 Sally+0.166
 She+0.162
Neg logit contributions
ep-0.265
ophone-0.263
ician-0.258
utes-0.252
we-0.251
Token she
Token ID673
Feature activation-3.17e-02

Pos logit contributions
ep+0.048
ophone+0.046
ician+0.044
utes+0.044
anas+0.044
Neg logit contributions
 Daisy-0.040
,-0.039
 she-0.038
 Sally-0.038
 her-0.037
Token couldn
Token ID3521
Feature activation-3.31e-02

Pos logit contributions
 Daisy+0.044
 Sally+0.042
,+0.041
 berries+0.040
 her+0.039
Neg logit contributions
ep-0.073
ophone-0.073
ician-0.069
anas-0.067
we-0.067
Token't
Token ID470
Feature activation-1.28e-02

Pos logit contributions
 berries+0.047
 Daisy+0.046
 Sally+0.044
 Gig+0.040
 Sophie+0.040
Neg logit contributions
ep-0.069
we-0.068
etr-0.067
ophone-0.067
washer-0.062
Token wait
Token ID4043
Feature activation+2.58e-02

Pos logit contributions
 Daisy+0.019
,+0.018
 Sally+0.017
 berries+0.017
 her+0.016
Neg logit contributions
ep-0.029
ician-0.027
ophone-0.027
we-0.027
utes-0.026
Token perfect
Token ID2818
Feature activation-9.21e-03

Pos logit contributions
 Daisy+0.033
,+0.031
 Sally+0.031
 she+0.029
 her+0.029
Neg logit contributions
ep-0.034
ophone-0.031
ician-0.030
utes-0.030
we-0.029
Token coat
Token ID13209
Feature activation-1.85e-02

Pos logit contributions
 Daisy+0.019
,+0.017
 Sally+0.017
 she+0.017
 Freddy+0.016
Neg logit contributions
ep-0.017
ician-0.015
ophone-0.015
we-0.014
utes-0.014
Token
Token ID6184
Feature activation+1.80e-02

Pos logit contributions
 Daisy+0.021
 Sally+0.018
 berries+0.018
,+0.017
 her+0.017
Neg logit contributions
ep-0.047
ophone-0.046
we-0.046
anas-0.044
ician-0.044
Token
Token ID95
Feature activation+3.79e-02

Pos logit contributions
ep+0.043
ician+0.040
ophone+0.040
we+0.040
utes+0.039
Neg logit contributions
,-0.028
 Daisy-0.025
 Sally-0.024
 She-0.024
 she-0.024
Token
Token ID26391
Feature activation+1.86e-02

Pos logit contributions
utes+0.084
we+0.080
ep+0.080
ophone+0.076
ber+0.075
Neg logit contributions
,-0.060
 She-0.059
 Sally-0.057
 Daisy-0.056
 her-0.052
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
utes+0.033
ep+0.033
ophone+0.033
ician+0.033
aghetti+0.032
Neg logit contributions
,-0.038
 she-0.033
 Daisy-0.032
 She-0.032
 Sally-0.032
Token
Token ID250
Feature activation+2.40e-02

Pos logit contributions
,+0.156
 Daisy+0.143
 she+0.132
 Sally+0.131
 her+0.127
Neg logit contributions
ep-0.299
ophone-0.297
ician-0.294
utes-0.287
we-0.286
Token it
Token ID340
Feature activation-2.32e-02

Pos logit contributions
ep+0.049
ophone+0.048
ician+0.047
utes+0.047
anas+0.046
Neg logit contributions
 Daisy-0.039
,-0.039
 Sally-0.038
 she-0.038
 her-0.037
Token was
Token ID373
Feature activation-2.74e-02

Pos logit contributions
 Daisy+0.028
 Sally+0.026
,+0.025
 berries+0.024
 her+0.023
Neg logit contributions
ep-0.058
ophone-0.057
we-0.055
ician-0.054
etr-0.054
Token soft
Token ID2705
Feature activation-4.10e-02

Pos logit contributions
 Daisy+0.036
,+0.034
 berries+0.032
 She+0.032
 she+0.032
Neg logit contributions
ep-0.064
ophone-0.060
ician-0.057
aker-0.056
anas-0.056
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
 Daisy+0.043
 Sally+0.040
 berries+0.039
 Pam+0.038
 she+0.038
Neg logit contributions
ep-0.098
aker-0.093
ician-0.092
aghetti-0.091
we-0.090
Token special
Token ID2041
Feature activation-2.98e-02

Pos logit contributions
,+0.014
 Daisy+0.012
 Sally+0.011
 she+0.011
 the+0.011
Neg logit contributions
we-0.026
ep-0.026
ophone-0.025
utes-0.025
ician-0.025
Token day
Token ID1110
Feature activation-6.08e-04

Pos logit contributions
,+0.047
 Daisy+0.045
 Sally+0.042
 she+0.041
 her+0.040
Neg logit contributions
ep-0.067
ophone-0.064
we-0.064
ician-0.063
utes-0.063
Token
Token ID6184
Feature activation+1.48e-02

Pos logit contributions
 Daisy+0.000
,+0.000
 Sally+0.000
 berries+0.000
 her+0.000
Neg logit contributions
ep-0.002
ophone-0.002
we-0.002
ician-0.002
utes-0.002
Token
Token ID95
Feature activation+3.26e-02

Pos logit contributions
ep+0.032
we+0.028
ophone+0.028
ician+0.028
aker+0.027
Neg logit contributions
,-0.028
 Daisy-0.026
 Sally-0.025
 she-0.024
 She-0.024
Token
Token ID26391
Feature activation+1.54e-02

Pos logit contributions
ep+0.064
utes+0.064
ber+0.061
we+0.060
ymes+0.059
Neg logit contributions
,-0.057
 She-0.054
 Sally-0.053
 Daisy-0.053
 she-0.049
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
ep+0.031
ophone+0.030
ician+0.030
aghetti+0.030
utes+0.029
Neg logit contributions
,-0.028
 Daisy-0.024
 she-0.024
 Sally-0.024
 She-0.023
Token
Token ID250
Feature activation+2.00e-02

Pos logit contributions
,+0.160
 Daisy+0.142
 she+0.134
 Sally+0.131
 She+0.130
Neg logit contributions
ophone-0.294
ep-0.291
ician-0.290
we-0.287
utes-0.283
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
ep+0.039
ophone+0.038
utes+0.038
ician+0.037
anas+0.037
Neg logit contributions
 Daisy-0.034
 Sally-0.033
,-0.033
 she-0.032
 her-0.032
Token boat
Token ID8848
Feature activation+3.21e-02

Pos logit contributions
,+0.032
 Daisy+0.031
 Sally+0.029
 she+0.028
 her+0.028
Neg logit contributions
ep-0.047
ophone-0.045
ician-0.044
we-0.044
utes-0.044
Token had
Token ID550
Feature activation-1.40e-02

Pos logit contributions
ep+0.081
ophone+0.078
ician+0.075
we+0.074
anas+0.072
Neg logit contributions
 Daisy-0.040
 Sally-0.037
 berries-0.036
 her-0.035
,-0.034
Token arrived
Token ID5284
Feature activation+4.91e-02

Pos logit contributions
 Daisy+0.016
,+0.015
 Sally+0.014
 she+0.014
 her+0.013
Neg logit contributions
ep-0.037
ophone-0.035
ician-0.035
utes-0.035
we-0.035
Token
Token ID6184
Feature activation+1.94e-02

Pos logit contributions
 Daisy+0.019
,+0.019
 Sally+0.017
 her+0.017
 she+0.017
Neg logit contributions
ep-0.037
ophone-0.036
ician-0.036
we-0.035
utes-0.035
Token
Token ID95
Feature activation+4.17e-02

Pos logit contributions
ep+0.039
ophone+0.037
ician+0.035
we+0.035
and+0.034
Neg logit contributions
,-0.038
 Daisy-0.034
 She-0.033
 she-0.033
 Sally-0.033
Token
Token ID26391
Feature activation+2.28e-02

Pos logit contributions
ep+0.080
utes+0.079
ophone+0.076
we+0.074
aghetti+0.073
Neg logit contributions
,-0.076
 Daisy-0.075
 She-0.074
 Sally-0.072
 her-0.067
Token
Token ID129
Feature activation+2.81e-02

Pos logit contributions
ep+0.040
ophone+0.038
ician+0.037
utes+0.037
we+0.036
Neg logit contributions
,-0.047
 Daisy-0.045
 Sally-0.043
 she-0.043
 She-0.042
Token
Token ID241
Feature activation+3.59e-02

Pos logit contributions
ep+0.052
ophone+0.050
we+0.050
ician+0.049
utes+0.049
Neg logit contributions
,-0.053
 Daisy-0.052
 Sally-0.050
 her-0.049
 She-0.047
TokenThank
Token ID10449
Feature activation-1.20e-01

Pos logit contributions
ep+0.072
ophone+0.069
ician+0.067
utes+0.066
anas+0.065
Neg logit contributions
,-0.063
 Daisy-0.063
 she-0.061
 Sally-0.061
 her-0.060
Token you
Token ID345
Feature activation-2.42e-02

Pos logit contributions
,+0.160
 Daisy+0.157
 Sally+0.143
 she+0.142
 her+0.141
Neg logit contributions
ep-0.296
ophone-0.287
ician-0.284
utes-0.279
anas-0.275
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
 Daisy+0.021
,+0.020
 Sally+0.018
 berries+0.018
 she+0.017
Neg logit contributions
ep-0.068
ophone-0.066
ician-0.065
we-0.065
utes-0.064
Token I
Token ID314
Feature activation-2.97e-02

Pos logit contributions
,+0.007
 Daisy+0.007
 her+0.007
 berries+0.007
 Sally+0.007
Neg logit contributions
ep-0.011
ophone-0.011
ician-0.011
utes-0.011
we-0.011
Token will
Token ID481
Feature activation+2.30e-03

Pos logit contributions
 Daisy+0.042
 Sally+0.038
 berries+0.038
,+0.038
 her+0.037
Neg logit contributions
ep-0.070
ophone-0.067
ician-0.066
anas-0.064
aker-0.064
Token be
Token ID307
Feature activation-4.09e-03

Pos logit contributions
ep+0.005
ician+0.005
ophone+0.005
anas+0.005
utes+0.005
Neg logit contributions
 Daisy-0.003
 berries-0.003
 Sally-0.003
,-0.003
 her-0.002
Token
Token ID6184
Feature activation+1.99e-02

Pos logit contributions
 Daisy+0.024
,+0.023
 Sally+0.021
 her+0.021
 she+0.021
Neg logit contributions
ep-0.045
ophone-0.043
ician-0.043
we-0.042
utes-0.042
Token
Token ID95
Feature activation+4.13e-02

Pos logit contributions
ep+0.039
ophone+0.037
ician+0.036
we+0.035
utes+0.035
Neg logit contributions
,-0.039
 Daisy-0.036
 Sally-0.035
 She-0.035
 she-0.035
Token
Token ID26391
Feature activation+2.23e-02

Pos logit contributions
utes+0.083
ep+0.081
ophone+0.078
we+0.078
ician+0.074
Neg logit contributions
,-0.072
 Daisy-0.070
 She-0.069
 Sally-0.068
 she-0.063
Token
Token ID129
Feature activation+2.63e-02

Pos logit contributions
ep+0.045
ophone+0.043
ician+0.043
utes+0.043
we+0.042
Neg logit contributions
,-0.040
 Daisy-0.038
 Sally-0.036
 she-0.036
 She-0.035
Token
Token ID241
Feature activation+3.65e-02

Pos logit contributions
ep+0.048
ophone+0.047
we+0.046
ician+0.045
utes+0.045
Neg logit contributions
,-0.050
 Daisy-0.049
 Sally-0.047
 her-0.046
 she-0.045
TokenThank
Token ID10449
Feature activation-1.20e-01

Pos logit contributions
ep+0.076
ophone+0.073
ician+0.071
utes+0.070
anas+0.068
Neg logit contributions
,-0.062
 Daisy-0.062
 Sally-0.059
 she-0.059
 her-0.058
Token you
Token ID345
Feature activation-2.38e-02

Pos logit contributions
,+0.160
 Daisy+0.159
 Sally+0.144
 she+0.142
 her+0.142
Neg logit contributions
ep-0.294
ophone-0.284
ician-0.281
utes-0.277
anas-0.273
Token for
Token ID329
Feature activation-5.11e-02

Pos logit contributions
 Daisy+0.021
,+0.020
 Sally+0.019
 she+0.018
 her+0.018
Neg logit contributions
ep-0.067
ophone-0.065
ician-0.064
utes-0.063
we-0.063
Token touching
Token ID15241
Feature activation-4.09e-03

Pos logit contributions
 Daisy+0.063
,+0.061
 Sally+0.057
 she+0.054
 berries+0.054
Neg logit contributions
ep-0.128
ophone-0.121
ician-0.120
we-0.120
utes-0.119
Token me
Token ID502
Feature activation-4.84e-02

Pos logit contributions
 Daisy+0.004
 Sally+0.003
 She+0.003
,+0.003
 she+0.003
Neg logit contributions
ep-0.011
ophone-0.011
we-0.011
utes-0.010
ician-0.010
Token.
Token ID13
Feature activation-4.14e-03

Pos logit contributions
 Daisy+0.050
 Sally+0.045
 berries+0.042
,+0.041
 she+0.041
Neg logit contributions
ep-0.130
ophone-0.125
ician-0.124
we-0.123
utes-0.122
Token their
Token ID511
Feature activation+5.07e-02

Pos logit contributions
ep+0.029
ician+0.029
utes+0.028
ophone+0.028
we+0.028
Neg logit contributions
,-0.010
 Daisy-0.009
 berries-0.009
 she-0.009
 Sally-0.008
Token mom
Token ID1995
Feature activation-2.63e-03

Pos logit contributions
ep+0.119
ophone+0.116
ician+0.112
we+0.111
anas+0.111
Neg logit contributions
,-0.074
 Daisy-0.071
 her-0.065
 Sally-0.065
 she-0.064
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
 Daisy+0.003
 Sally+0.003
 berries+0.002
 Freddy+0.002
 she+0.002
Neg logit contributions
ep-0.007
ophone-0.007
we-0.006
ician-0.006
anas-0.006
Token dad
Token ID9955
Feature activation+3.37e-02

Pos logit contributions
ep+0.032
ophone+0.030
ician+0.030
we+0.030
utes+0.030
Neg logit contributions
 Daisy-0.023
,-0.022
 Sally-0.021
 berries-0.020
 she-0.020
Token,
Token ID11
Feature activation-1.28e-03

Pos logit contributions
ep+0.095
ophone+0.092
ician+0.091
we+0.091
utes+0.090
Neg logit contributions
 Daisy-0.031
,-0.028
 Sally-0.027
 she-0.024
 her-0.024
Token who
Token ID508
Feature activation+1.28e-01

Pos logit contributions
,+0.003
 Daisy+0.003
 Sally+0.003
 her+0.003
 she+0.003
Neg logit contributions
ep-0.002
ophone-0.002
ician-0.002
utes-0.002
we-0.002
Token were
Token ID547
Feature activation+3.26e-03

Pos logit contributions
ep+0.320
ophone+0.300
we+0.296
ber+0.290
washer+0.287
Neg logit contributions
 Daisy-0.153
,-0.135
 Sally-0.134
 berries-0.131
 She-0.120
Token waiting
Token ID4953
Feature activation+5.54e-02

Pos logit contributions
ep+0.008
ophone+0.008
ician+0.008
utes+0.007
we+0.007
Neg logit contributions
,-0.005
 Daisy-0.004
 her-0.004
 Sally-0.004
 she-0.004
Token for
Token ID329
Feature activation-2.49e-02

Pos logit contributions
ep+0.157
ophone+0.151
we+0.150
ician+0.150
utes+0.149
Neg logit contributions
 Daisy-0.051
,-0.048
 Sally-0.044
 she-0.041
 her-0.041
Token them
Token ID606
Feature activation+6.66e-03

Pos logit contributions
 Daisy+0.032
,+0.030
 Sally+0.029
 She+0.029
 berries+0.029
Neg logit contributions
ep-0.057
we-0.054
utes-0.052
ician-0.051
ophone-0.051
Token.
Token ID13
Feature activation+2.25e-02

Pos logit contributions
ep+0.016
ophone+0.016
we+0.016
ician+0.015
anas+0.015
Neg logit contributions
 Daisy-0.008
 Sally-0.007
 berries-0.007
,-0.007
 She-0.007
Token happy
Token ID3772
Feature activation+1.35e-02

Pos logit contributions
ep+0.014
ician+0.013
ophone+0.013
utes+0.013
we+0.013
Neg logit contributions
 Daisy-0.011
,-0.011
 Sally-0.011
 her-0.010
 she-0.010
Token boy
Token ID2933
Feature activation+4.76e-03

Pos logit contributions
ep+0.029
ician+0.028
ophone+0.027
utes+0.027
we+0.027
Neg logit contributions
 Daisy-0.020
,-0.020
 Sally-0.019
 her-0.018
 she-0.018
Token.
Token ID13
Feature activation+5.05e-03

Pos logit contributions
ep+0.013
ophone+0.013
ician+0.012
we+0.012
utes+0.012
Neg logit contributions
 Daisy-0.005
,-0.005
 Sally-0.004
 her-0.004
 she-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.34e-02

Pos logit contributions
ep+0.013
ophone+0.013
ician+0.012
utes+0.012
we+0.012
Neg logit contributions
,-0.006
 Daisy-0.006
 her-0.005
 Sally-0.005
 she-0.005
TokenJo
Token ID9908
Feature activation+1.44e-01

Pos logit contributions
 Daisy+0.029
,+0.027
 Sally+0.027
 berries+0.027
 she+0.026
Neg logit contributions
ep-0.055
we-0.054
ophone-0.053
utes-0.051
ician-0.051
TokenJo
Token ID9908
Feature activation+1.45e-01

Pos logit contributions
ep+0.356
ophone+0.342
ician+0.340
utes+0.337
we+0.336
Neg logit contributions
 Daisy-0.185
,-0.185
 Sally-0.169
 her-0.164
 she-0.163
Token was
Token ID373
Feature activation-2.62e-02

Pos logit contributions
ep+0.368
ophone+0.354
ician+0.351
we+0.347
utes+0.345
Neg logit contributions
 Daisy-0.175
,-0.170
 Sally-0.158
 she-0.152
 her-0.151
Token sad
Token ID6507
Feature activation-4.56e-02

Pos logit contributions
 Daisy+0.036
,+0.034
 Sally+0.032
 berries+0.032
 she+0.031
Neg logit contributions
ep-0.063
ophone-0.060
ician-0.059
utes-0.058
we-0.058
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
 Daisy+0.045
,+0.042
 Sally+0.040
 her+0.037
 she+0.037
Neg logit contributions
ep-0.125
ophone-0.120
ician-0.119
we-0.119
utes-0.119
Token He
Token ID679
Feature activation-1.25e-02

Pos logit contributions
,+0.016
 Daisy+0.016
 Sally+0.014
 her+0.014
 she+0.014
Neg logit contributions
ep-0.041
ophone-0.039
ician-0.039
we-0.038
utes-0.038
Token wanted
Token ID2227
Feature activation-3.77e-02

Pos logit contributions
 Daisy+0.017
,+0.016
 Sally+0.016
 berries+0.015
 she+0.014
Neg logit contributions
ep-0.030
ophone-0.029
ician-0.028
we-0.028
anas-0.027
Token They
Token ID1119
Feature activation-6.54e-03

Pos logit contributions
ep+0.031
ophone+0.030
ician+0.030
we+0.029
utes+0.029
Neg logit contributions
 Daisy-0.013
,-0.013
 Sally-0.011
 her-0.011
 she-0.011
Token hope
Token ID2911
Feature activation+2.54e-03

Pos logit contributions
 Daisy+0.009
 Sally+0.009
 berries+0.008
,+0.007
 Sophie+0.007
Neg logit contributions
ep-0.015
ophone-0.015
anas-0.014
etr-0.014
ber-0.014
Token to
Token ID284
Feature activation+1.75e-03

Pos logit contributions
ep+0.007
we+0.006
ician+0.006
ophone+0.006
utes+0.006
Neg logit contributions
 Daisy-0.002
,-0.002
 Sally-0.002
 berries-0.002
 She-0.002
Token find
Token ID1064
Feature activation+2.71e-02

Pos logit contributions
ep+0.005
ician+0.004
ophone+0.004
utes+0.004
we+0.004
Neg logit contributions
 Daisy-0.002
,-0.002
 Sally-0.002
 berries-0.002
 she-0.001
Token someone
Token ID2130
Feature activation-7.40e-04

Pos logit contributions
ep+0.077
ophone+0.073
ician+0.073
we+0.072
utes+0.072
Neg logit contributions
 Daisy-0.024
,-0.023
 Sally-0.021
 she-0.020
 She-0.019
Token who
Token ID508
Feature activation+1.18e-01

Pos logit contributions
 Daisy+0.001
,+0.001
 Sally+0.001
 her+0.001
 she+0.001
Neg logit contributions
ep-0.001
ophone-0.001
ician-0.001
we-0.001
utes-0.001
Token knows
Token ID4206
Feature activation+1.88e-02

Pos logit contributions
ep+0.280
ophone+0.271
we+0.257
ber+0.257
ician+0.256
Neg logit contributions
 Daisy-0.164
 Sally-0.150
,-0.150
 berries-0.137
 She-0.133
Token how
Token ID703
Feature activation+2.77e-02

Pos logit contributions
ep+0.044
ophone+0.043
we+0.042
ician+0.042
utes+0.042
Neg logit contributions
 Daisy-0.025
,-0.023
 Sally-0.023
 She-0.022
 she-0.021
Token to
Token ID284
Feature activation+1.52e-03

Pos logit contributions
ep+0.072
ophone+0.069
ician+0.069
utes+0.068
we+0.068
Neg logit contributions
,-0.033
 Daisy-0.033
 Sally-0.029
 she-0.029
 her-0.029
Token open
Token ID1280
Feature activation-9.65e-03

Pos logit contributions
ep+0.004
ician+0.003
anas+0.003
ophone+0.003
etr+0.003
Neg logit contributions
 Daisy-0.002
 Sally-0.002
 berries-0.002
,-0.002
 she-0.002
Token the
Token ID262
Feature activation+1.15e-03

Pos logit contributions
 Daisy+0.008
 Sally+0.007
 She+0.007
 berries+0.006
 she+0.006
Neg logit contributions
ep-0.027
ophone-0.025
we-0.025
utes-0.025
ician-0.025
Token!"
Token ID2474
Feature activation-4.61e-03

Pos logit contributions
ep+0.028
ophone+0.027
ician+0.027
we+0.026
anas+0.026
Neg logit contributions
 Daisy-0.012
,-0.011
 Sally-0.011
 her-0.010
 she-0.010
Token
Token ID198
Feature activation+9.72e-03

Pos logit contributions
,+0.004
 Daisy+0.004
 Sally+0.004
 she+0.004
 her+0.004
Neg logit contributions
ep-0.013
ophone-0.013
ician-0.013
utes-0.013
we-0.013
Token
Token ID198
Feature activation+9.09e-03

Pos logit contributions
ophone+0.031
ep+0.030
ician+0.030
aghetti+0.030
cles+0.030
Neg logit contributions
,-0.007
 Daisy-0.006
 the-0.005
 her-0.005
 she-0.005
TokenA
Token ID32
Feature activation-6.58e-03

Pos logit contributions
ophone+0.021
utes+0.019
aghetti+0.019
etr+0.019
ician+0.019
Neg logit contributions
,-0.015
 she-0.014
 the-0.014
 She-0.014
 her-0.013
Token man
Token ID582
Feature activation-7.99e-04

Pos logit contributions
,+0.011
 Daisy+0.010
 she+0.009
 Sally+0.009
 her+0.009
Neg logit contributions
ophone-0.015
we-0.015
ician-0.015
ep-0.015
anas-0.014
Token who
Token ID508
Feature activation+1.14e-01

Pos logit contributions
,+0.001
 Daisy+0.001
 she+0.001
 her+0.001
 the+0.001
Neg logit contributions
utes-0.002
ician-0.002
ophone-0.002
ep-0.002
aghetti-0.002
Token was
Token ID373
Feature activation-1.36e-02

Pos logit contributions
ep+0.293
ophone+0.280
ician+0.276
we+0.275
utes+0.271
Neg logit contributions
 Daisy-0.137
,-0.134
 Sally-0.123
 she-0.117
 her-0.117
Token walking
Token ID6155
Feature activation+5.20e-02

Pos logit contributions
,+0.020
 Daisy+0.019
 her+0.018
 Sally+0.017
 she+0.017
Neg logit contributions
ophone-0.032
utes-0.031
ician-0.031
ep-0.031
aghetti-0.031
Token by
Token ID416
Feature activation-5.05e-03

Pos logit contributions
ep+0.123
we+0.115
ophone+0.115
ician+0.112
utes+0.111
Neg logit contributions
 Daisy-0.072
 Sally-0.066
,-0.065
 berries-0.062
 she-0.062
Token heard
Token ID2982
Feature activation+5.02e-02

Pos logit contributions
 Daisy+0.005
,+0.004
 Sally+0.004
 she+0.004
 berries+0.004
Neg logit contributions
ep-0.014
we-0.014
ician-0.014
ophone-0.013
utes-0.013
Token Tom
Token ID4186
Feature activation+3.35e-02

Pos logit contributions
ep+0.143
we+0.133
ophone+0.132
ician+0.132
utes+0.130
Neg logit contributions
 Daisy-0.042
,-0.038
 berries-0.036
 Sally-0.035
 she-0.033
Token She
Token ID1375
Feature activation-3.36e-02

Pos logit contributions
ep+0.066
ophone+0.064
utes+0.063
we+0.063
ician+0.063
Neg logit contributions
 Daisy-0.013
,-0.013
 Sally-0.010
 berries-0.010
 her-0.009
Token hears
Token ID23180
Feature activation+4.66e-02

Pos logit contributions
 Daisy+0.054
 Sally+0.051
,+0.048
 she+0.044
 her+0.044
Neg logit contributions
ep-0.076
ophone-0.072
we-0.068
ber-0.068
ician-0.067
Token a
Token ID257
Feature activation+1.73e-02

Pos logit contributions
ep+0.133
we+0.126
ophone+0.125
utes+0.124
ician+0.124
Neg logit contributions
 Daisy-0.036
,-0.034
 Sally-0.031
 berries-0.030
 she-0.027
Token loud
Token ID7812
Feature activation+6.54e-03

Pos logit contributions
ophone+0.043
ician+0.041
ep+0.041
utes+0.041
we+0.040
Neg logit contributions
,-0.025
 Daisy-0.023
 she-0.022
 her-0.022
 She-0.021
Token th
Token ID294
Feature activation-4.81e-02

Pos logit contributions
ophone+0.016
utes+0.015
ep+0.015
ician+0.015
we+0.015
Neg logit contributions
,-0.010
 Daisy-0.009
 her-0.008
 she-0.008
 Sally-0.008
Tokenud
Token ID463
Feature activation+1.16e-01

Pos logit contributions
,+0.117
 Daisy+0.115
 Sally+0.110
 her+0.110
 she+0.108
Neg logit contributions
ep-0.062
ophone-0.062
ician-0.060
utes-0.059
we-0.057
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
ep+0.336
ophone+0.330
ician+0.327
utes+0.326
anas+0.321
Neg logit contributions
,-0.107
 Daisy-0.095
 she-0.085
 her-0.085
 Sally-0.084
Token She
Token ID1375
Feature activation-3.49e-02

Pos logit contributions
ep+0.066
ophone+0.064
we+0.064
utes+0.064
ician+0.064
Neg logit contributions
,-0.013
 Daisy-0.013
 Sally-0.010
 berries-0.010
 her-0.009
Token opens
Token ID9808
Feature activation+2.89e-03

Pos logit contributions
 Daisy+0.054
 Sally+0.052
,+0.048
 she+0.046
 Sophie+0.046
Neg logit contributions
ep-0.081
ophone-0.074
we-0.071
utes-0.070
ician-0.069
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
ep+0.008
we+0.007
ician+0.007
plug+0.007
utes+0.007
Neg logit contributions
 Daisy-0.002
 berries-0.002
 Sally-0.002
 Freddy-0.002
,-0.002
Token lid
Token ID19789
Feature activation-2.93e-02

Pos logit contributions
ep+0.022
ophone+0.021
utes+0.020
ician+0.020
washer+0.020
Neg logit contributions
 Daisy-0.022
 Sally-0.021
,-0.020
 she-0.020
 her-0.020
Token now
Token ID783
Feature activation-3.13e-02

Pos logit contributions
 Daisy+0.009
 Sally+0.008
 She+0.007
 she+0.007
 Polly+0.007
Neg logit contributions
ep-0.015
we-0.014
aghetti-0.014
ber-0.014
ophone-0.014
Token!"
Token ID2474
Feature activation-1.31e-02

Pos logit contributions
 Daisy+0.048
 Sally+0.045
 berries+0.042
 Madd+0.041
 Pam+0.040
Neg logit contributions
ep-0.065
we-0.059
anas-0.059
ician-0.059
ber-0.058
Token Ben
Token ID3932
Feature activation+2.85e-02

Pos logit contributions
 berries+0.015
 Daisy+0.015
 Sally+0.014
,+0.014
 taste+0.014
Neg logit contributions
ep-0.030
ophone-0.028
ician-0.028
utes-0.028
we-0.028
Token says
Token ID1139
Feature activation-2.00e-02

Pos logit contributions
ep+0.065
ophone+0.060
ician+0.059
etr+0.058
anas+0.058
Neg logit contributions
 Daisy-0.040
 Sally-0.039
 she-0.034
 berries-0.034
,-0.034
Token.
Token ID13
Feature activation+8.78e-03

Pos logit contributions
,+0.019
 Daisy+0.018
 Sally+0.016
 her+0.015
 she+0.015
Neg logit contributions
ep-0.058
ophone-0.056
ician-0.056
utes-0.055
we-0.055
Token "
Token ID366
Feature activation+4.18e-02

Pos logit contributions
ep+0.023
ophone+0.022
ician+0.022
utes+0.022
we+0.022
Neg logit contributions
,-0.010
 Daisy-0.010
 Sally-0.009
 she-0.009
 her-0.009
TokenI
Token ID40
Feature activation-3.17e-02

Pos logit contributions
ophone+0.094
ician+0.087
utes+0.087
ep+0.086
cles+0.085
Neg logit contributions
 she-0.063
 She-0.063
 her-0.061
 the-0.057
,-0.056
Token don
Token ID836
Feature activation-3.10e-02

Pos logit contributions
 Daisy+0.045
 Sally+0.039
 berries+0.038
 her+0.037
 Polly+0.036
Neg logit contributions
ep-0.076
ophone-0.072
anas-0.070
aker-0.070
ber-0.070
Token't
Token ID470
Feature activation-4.35e-03

Pos logit contributions
 Daisy+0.043
,+0.041
 Sally+0.040
 her+0.038
 berries+0.038
Neg logit contributions
ep-0.073
we-0.069
ophone-0.069
utes-0.069
ician-0.068
Token want
Token ID765
Feature activation-3.26e-02

Pos logit contributions
 Daisy+0.005
,+0.004
 berries+0.004
 she+0.004
 She+0.004
Neg logit contributions
ep-0.011
ophone-0.011
ician-0.011
aker-0.011
anas-0.010
Token to
Token ID284
Feature activation-2.60e-03

Pos logit contributions
 Daisy+0.042
,+0.040
 Sally+0.039
 She+0.038
 she+0.037
Neg logit contributions
ep-0.074
we-0.071
ophone-0.071
ician-0.069
utes-0.069
Token that
Token ID326
Feature activation-8.89e-03

Pos logit contributions
 Daisy+0.091
 berries+0.087
,+0.085
 are+0.084
 She+0.084
Neg logit contributions
ep-0.180
ician-0.169
utes-0.168
ophone-0.166
we-0.165
Token day
Token ID1110
Feature activation+2.82e-03

Pos logit contributions
 Daisy+0.012
 berries+0.012
 Sally+0.012
,+0.011
 her+0.011
Neg logit contributions
ep-0.019
ophone-0.019
ician-0.019
utes-0.018
we-0.018
Token on
Token ID319
Feature activation+7.43e-03

Pos logit contributions
ep+0.007
ophone+0.006
utes+0.006
ician+0.006
we+0.006
Neg logit contributions
 Daisy-0.004
 berries-0.004
,-0.004
 Sally-0.004
 she-0.003
Token Sarah
Token ID10490
Feature activation+2.96e-02

Pos logit contributions
ep+0.017
we+0.017
utes+0.017
James+0.017
ophone+0.017
Neg logit contributions
 berries-0.006
 taste-0.005
 Gig-0.004
 smell-0.004
 bloss-0.003
Token was
Token ID373
Feature activation-2.07e-02

Pos logit contributions
ep+0.061
ophone+0.059
ician+0.057
we+0.057
anas+0.056
Neg logit contributions
 Daisy-0.047
 Sally-0.044
 berries-0.044
 Freddy-0.040
,-0.039
Token very
Token ID845
Feature activation+5.65e-02

Pos logit contributions
 Daisy+0.033
,+0.032
 berries+0.031
 Sally+0.030
 She+0.029
Neg logit contributions
ep-0.044
ophone-0.040
ician-0.040
we-0.039
aker-0.038
Token careful
Token ID8161
Feature activation+3.53e-02

Pos logit contributions
ep+0.091
ophone+0.085
ician+0.085
ber+0.084
utes+0.083
Neg logit contributions
 Daisy-0.115
 berries-0.110
,-0.109
 Sally-0.107
 her-0.101
Token when
Token ID618
Feature activation-1.58e-02

Pos logit contributions
ep+0.091
ophone+0.087
we+0.087
ician+0.087
utes+0.086
Neg logit contributions
 Daisy-0.041
,-0.037
 Sally-0.036
 berries-0.035
 her-0.033
Token she
Token ID673
Feature activation-2.33e-02

Pos logit contributions
 Daisy+0.020
 Sally+0.019
 berries+0.019
 Everyday+0.018
 She+0.018
Neg logit contributions
we-0.035
ep-0.032
utes-0.031
ber-0.030
ician-0.030
Token saw
Token ID2497
Feature activation+5.53e-02

Pos logit contributions
 Daisy+0.034
 Sally+0.032
,+0.030
 berries+0.029
 her+0.029
Neg logit contributions
ep-0.054
ophone-0.051
we-0.050
ician-0.050
anas-0.049
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
ep+0.138
we+0.133
ophone+0.131
ician+0.129
utes+0.127
Neg logit contributions
 Daisy-0.054
,-0.051
 She-0.049
 Sally-0.048
 Freddy-0.045
Token v
Token ID410
Feature activation+2.02e-02

Pos logit contributions
 Daisy+0.043
 Sally+0.040
 She+0.038
,+0.037
 she+0.037
Neg logit contributions
ep-0.054
ophone-0.050
ician-0.049
we-0.048
aker-0.048
Tokenase
Token ID589
Feature activation+3.87e-02

Pos logit contributions
ep+0.027
ician+0.027
anas+0.025
ophone+0.025
aghetti+0.024
Neg logit contributions
,-0.048
 Daisy-0.044
 Sally-0.044
 her-0.043
 She-0.043
Token on
Token ID319
Feature activation+1.87e-02

Pos logit contributions
ep+0.099
ophone+0.094
we+0.093
aker+0.088
washer+0.088
Neg logit contributions
 Daisy-0.049
 Sally-0.043
 berries-0.042
 Sophie-0.038
 Gig-0.038
Token her
Token ID607
Feature activation-1.44e-02

Pos logit contributions
ep+0.054
ician+0.050
ophone+0.050
utes+0.049
we+0.049
Neg logit contributions
 Daisy-0.014
 berries-0.012
,-0.011
 Sally-0.011
 she-0.010
Token mom
Token ID1995
Feature activation-1.67e-03

Pos logit contributions
 Daisy+0.027
,+0.026
 Sally+0.025
 she+0.024
 her+0.024
Neg logit contributions
ep-0.028
ophone-0.026
ician-0.026
we-0.025
utes-0.025
Tokenmy
Token ID1820
Feature activation+3.61e-02

Pos logit contributions
 Daisy+0.003
 Sally+0.003
 berries+0.002
,+0.002
 her+0.002
Neg logit contributions
ep-0.004
ophone-0.003
we-0.003
ician-0.003
anas-0.003
Token's
Token ID338
Feature activation+4.65e-02

Pos logit contributions
ep+0.082
ophone+0.078
we+0.076
anas+0.075
ician+0.075
Neg logit contributions
 Daisy-0.053
 Sally-0.050
 berries-0.048
,-0.046
 Freddy-0.044
Token table
Token ID3084
Feature activation+1.96e-02

Pos logit contributions
ep+0.093
ician+0.087
ophone+0.086
we+0.086
utes+0.086
Neg logit contributions
 Daisy-0.083
,-0.080
 Sally-0.078
 she-0.075
 her-0.073
Token.
Token ID13
Feature activation+3.85e-03

Pos logit contributions
ep+0.048
ophone+0.045
we+0.045
aker+0.044
anas+0.044
Neg logit contributions
 Daisy-0.026
 Sally-0.023
 berries-0.021
 Gig-0.020
 Freddy-0.020
Token Lily
Token ID20037
Feature activation-3.06e-02

Pos logit contributions
ep+0.011
ophone+0.011
ician+0.011
utes+0.011
we+0.011
Neg logit contributions
,-0.003
 Daisy-0.002
 berries-0.002
 her-0.002
 Sally-0.002
Token wanted
Token ID2227
Feature activation-3.08e-02

Pos logit contributions
 Daisy+0.046
 Sally+0.043
,+0.042
 berries+0.040
 she+0.040
Neg logit contributions
ep-0.071
ophone-0.067
ician-0.065
anas-0.064
we-0.063
Token They
Token ID1119
Feature activation-8.75e-03

Pos logit contributions
ep+0.035
ophone+0.034
ician+0.033
utes+0.033
anas+0.033
Neg logit contributions
,-0.015
 Daisy-0.015
 her-0.013
 Sally-0.013
 berries-0.013
Token felt
Token ID2936
Feature activation-8.94e-03

Pos logit contributions
 Daisy+0.013
 Sally+0.012
,+0.012
 She+0.011
 berries+0.011
Neg logit contributions
ep-0.021
ophone-0.019
ician-0.018
anas-0.018
ber-0.018
Token the
Token ID262
Feature activation-1.43e-03

Pos logit contributions
 Daisy+0.013
 Sally+0.013
 berries+0.013
berries+0.012
 Betty+0.012
Neg logit contributions
ep-0.016
ophone-0.014
eping-0.014
etr-0.014
aker-0.014
Token grass
Token ID8701
Feature activation-2.97e-02

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 she+0.002
 her+0.002
Neg logit contributions
ep-0.003
ophone-0.003
ician-0.003
utes-0.003
we-0.003
Token under
Token ID739
Feature activation-1.07e-02

Pos logit contributions
 Daisy+0.041
 Sally+0.039
 Sophie+0.035
 her+0.032
 Polly+0.032
Neg logit contributions
ep-0.071
we-0.067
ophone-0.064
utes-0.063
aker-0.062
Token their
Token ID511
Feature activation+4.11e-02

Pos logit contributions
 Daisy+0.011
 berries+0.010
,+0.010
 she+0.010
 Sally+0.010
Neg logit contributions
ep-0.027
ician-0.025
we-0.024
anas-0.024
utes-0.024
Token feet
Token ID3625
Feature activation-5.59e-03

Pos logit contributions
ep+0.088
ician+0.083
ophone+0.083
we+0.082
utes+0.082
Neg logit contributions
 Daisy-0.066
,-0.063
 Sally-0.062
 she-0.059
 her-0.058
Token and
Token ID290
Feature activation+3.55e-03

Pos logit contributions
 Daisy+0.007
 Sally+0.006
 berries+0.006
 Freddy+0.005
 her+0.005
Neg logit contributions
ep-0.014
ophone-0.014
we-0.013
anas-0.013
ician-0.013
Token the
Token ID262
Feature activation-1.52e-03

Pos logit contributions
ep+0.008
we+0.007
ophone+0.007
glass+0.007
anas+0.007
Neg logit contributions
 Daisy-0.005
 Everyday-0.004
 Sally-0.004
 She-0.004
 Sophie-0.004
Token wind
Token ID2344
Feature activation-2.79e-02

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 her+0.002
 she+0.002
Neg logit contributions
ep-0.004
we-0.003
ician-0.003
ophone-0.003
utes-0.003
Token in
Token ID287
Feature activation+3.28e-02

Pos logit contributions
 Daisy+0.034
 Sally+0.030
,+0.029
 berries+0.028
 her+0.027
Neg logit contributions
ep-0.073
we-0.070
ophone-0.069
ician-0.068
utes-0.067
Token pond
Token ID16723
Feature activation-1.55e-02

Pos logit contributions
 Daisy+0.003
,+0.003
 Sally+0.003
 she+0.002
 her+0.002
Neg logit contributions
ep-0.004
ician-0.004
ophone-0.004
we-0.004
utes-0.004
Token,
Token ID11
Feature activation-6.15e-03

Pos logit contributions
 Daisy+0.008
 berries+0.007
 Cherry+0.006
 Sally+0.006
 She+0.005
Neg logit contributions
ep-0.044
washer-0.042
ophone-0.042
ber-0.041
we-0.041
Token the
Token ID262
Feature activation-1.49e-03

Pos logit contributions
 Daisy+0.005
 She+0.005
,+0.005
 Sally+0.005
 berries+0.005
Neg logit contributions
ep-0.016
we-0.015
ber-0.015
ophone-0.015
utes-0.014
Token goose
Token ID37246
Feature activation-7.52e-03

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 she+0.002
 her+0.002
Neg logit contributions
ep-0.003
ician-0.003
ophone-0.003
we-0.003
utes-0.003
Token started
Token ID2067
Feature activation+8.08e-03

Pos logit contributions
 Daisy+0.010
 Sally+0.009
 her+0.009
,+0.009
 she+0.009
Neg logit contributions
ep-0.019
ophone-0.017
we-0.017
ician-0.017
anas-0.017
Token swimming
Token ID14899
Feature activation+3.76e-02

Pos logit contributions
ep+0.021
ophone+0.020
ician+0.020
utes+0.019
we+0.019
Neg logit contributions
,-0.010
 Daisy-0.010
 Sally-0.009
 her-0.009
 she-0.009
Token in
Token ID287
Feature activation+3.80e-02

Pos logit contributions
ep+0.096
ophone+0.091
we+0.090
ician+0.089
utes+0.089
Neg logit contributions
 Daisy-0.043
 Sally-0.039
 she-0.038
,-0.038
 her-0.037
Token the
Token ID262
Feature activation-1.60e-04

Pos logit contributions
ep+0.102
ician+0.091
we+0.090
etr+0.090
and+0.089
Neg logit contributions
 Daisy-0.028
 she-0.027
 berries-0.025
 Sally-0.024
 She-0.024
Token clear
Token ID1598
Feature activation-2.16e-02

Pos logit contributions
 Daisy+0.000
,+0.000
 Sally+0.000
 she+0.000
 her+0.000
Neg logit contributions
ep-0.000
ophone-0.000
ician-0.000
we-0.000
utes-0.000
Token,
Token ID11
Feature activation-5.64e-03

Pos logit contributions
 Daisy+0.033
 she+0.032
 Cherry+0.031
 her+0.030
 Sally+0.029
Neg logit contributions
ep-0.044
aker-0.039
utes-0.038
cles-0.036
ades-0.035
Token cool
Token ID3608
Feature activation+7.80e-03

Pos logit contributions
 Daisy+0.008
,+0.007
 Sally+0.007
 her+0.007
 she+0.007
Neg logit contributions
ep-0.013
ophone-0.012
ician-0.012
we-0.012
utes-0.012
Token the
Token ID262
Feature activation-6.07e-03

Pos logit contributions
ep+0.082
ician+0.073
we+0.073
etr+0.073
aker+0.072
Neg logit contributions
 Daisy-0.027
 berries-0.024
 she-0.023
 She-0.022
 Sally-0.022
Token mud
Token ID17492
Feature activation-3.10e-02

Pos logit contributions
 Daisy+0.009
,+0.008
 she+0.008
 her+0.007
 Sally+0.007
Neg logit contributions
ep-0.014
ophone-0.013
we-0.012
ician-0.012
utes-0.012
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
 Daisy+0.036
 Sally+0.031
,+0.030
 her+0.029
 she+0.029
Neg logit contributions
ep-0.081
we-0.077
ophone-0.076
utes-0.075
ician-0.074
Token It
Token ID632
Feature activation-1.74e-02

Pos logit contributions
ep+0.009
ophone+0.009
ician+0.009
anas+0.009
utes+0.008
Neg logit contributions
 Daisy-0.005
,-0.005
 berries-0.005
 her-0.005
 she-0.005
Token was
Token ID373
Feature activation-1.43e-02

Pos logit contributions
 Daisy+0.021
,+0.020
 Sally+0.020
 berries+0.018
 her+0.018
Neg logit contributions
ep-0.045
ophone-0.042
we-0.042
ician-0.040
etr-0.040
Token so
Token ID523
Feature activation+9.77e-03

Pos logit contributions
 Daisy+0.021
 berries+0.020
 She+0.020
 she+0.020
als+0.019
Neg logit contributions
ep-0.032
eping-0.026
aker-0.024
ophone-0.024
ician-0.023
Token much
Token ID881
Feature activation-1.50e-03

Pos logit contributions
ep+0.020
ophone+0.018
utes+0.018
aghetti+0.017
aker+0.017
Neg logit contributions
 Daisy-0.017
,-0.016
 berries-0.016
 Sally-0.015
 her-0.015
Token fun
Token ID1257
Feature activation-2.53e-03

Pos logit contributions
 Daisy+0.003
 She+0.003
 her+0.003
 berries+0.003
 Abby+0.003
Neg logit contributions
ep-0.002
aghetti-0.002
utes-0.002
we-0.002
aker-0.002
Token!
Token ID0
Feature activation+1.54e-02

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 her+0.002
 berries+0.002
Neg logit contributions
ep-0.007
we-0.007
ophone-0.007
ician-0.007
utes-0.007
Token Tim
Token ID5045
Feature activation+8.83e-03

Pos logit contributions
ep+0.037
ophone+0.037
ician+0.036
anas+0.036
we+0.036
Neg logit contributions
 Daisy-0.018
,-0.017
 berries-0.016
 her-0.016
 she-0.015
Tokenmy
Token ID1820
Feature activation+3.72e-02

Pos logit contributions
ep+0.022
ophone+0.021
ician+0.021
utes+0.021
we+0.021
Neg logit contributions
,-0.011
 Daisy-0.011
 Sally-0.010
 her-0.010
 she-0.010
Token but
Token ID475
Feature activation+4.47e-03

Pos logit contributions
 Daisy+0.034
,+0.033
 Sally+0.030
 her+0.029
 She+0.028
Neg logit contributions
ep-0.058
we-0.055
ophone-0.054
anas-0.054
ician-0.053
Token this
Token ID428
Feature activation-2.37e-02

Pos logit contributions
ep+0.011
ber+0.010
ophone+0.010
we+0.010
ician+0.009
Neg logit contributions
 taste-0.004
 berries-0.004
 Everyday-0.004
 Daisy-0.004
 smell-0.004
Token popcorn
Token ID38914
Feature activation+2.51e-02

Pos logit contributions
,+0.025
 Daisy+0.025
 Sally+0.022
 her+0.021
 she+0.021
Neg logit contributions
ep-0.064
ophone-0.062
ician-0.062
utes-0.061
we-0.061
Token was
Token ID373
Feature activation-2.55e-02

Pos logit contributions
ep+0.062
we+0.060
ophone+0.059
anas+0.058
aker+0.057
Neg logit contributions
 Daisy-0.031
 Sally-0.028
,-0.028
 her-0.027
 She-0.027
Token disgusting
Token ID23374
Feature activation-5.37e-02

Pos logit contributions
 Daisy+0.039
,+0.037
 She+0.036
 Sally+0.035
 Sophie+0.033
Neg logit contributions
ep-0.055
aker-0.047
ophone-0.047
we-0.046
aghetti-0.046
Token!
Token ID0
Feature activation+4.68e-03

Pos logit contributions
 Daisy+0.069
 Sally+0.061
 Everyday+0.057
 She+0.056
 Gig+0.055
Neg logit contributions
we-0.127
ep-0.123
aghetti-0.119
aker-0.117
ophone-0.115
Token It
Token ID632
Feature activation-2.57e-02

Pos logit contributions
ophone+0.011
utes+0.011
ep+0.011
ician+0.011
we+0.011
Neg logit contributions
,-0.005
 Daisy-0.005
 berries-0.005
 her-0.004
 from-0.004
Token tasted
Token ID29187
Feature activation-6.39e-02

Pos logit contributions
 Daisy+0.035
,+0.033
 Sally+0.032
 Sophie+0.030
 She+0.030
Neg logit contributions
ep-0.061
we-0.057
ophone-0.055
ician-0.053
etr-0.053
Token y
Token ID331
Feature activation-1.64e-02

Pos logit contributions
 Daisy+0.103
 Sally+0.093
 Sophie+0.090
 She+0.090
 Cherry+0.089
Neg logit contributions
ep-0.121
aker-0.109
we-0.107
aghetti-0.104
eping-0.101
Tokenucky
Token ID5309
Feature activation-2.24e-02

Pos logit contributions
 Daisy+0.023
 she+0.022
,+0.021
 her+0.021
 Sally+0.020
Neg logit contributions
we-0.035
aghetti-0.035
aker-0.035
ep-0.034
anas-0.034
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
 Daisy+0.025
 Sally+0.022
 She+0.021
 Everyday+0.020
 her+0.019
Neg logit contributions
we-0.056
ep-0.056
anas-0.055
aker-0.054
aghetti-0.053
Token peanut
Token ID26636
Feature activation-5.70e-02

Pos logit contributions
ep+0.001
we+0.001
ician+0.001
aker+0.001
anas+0.001
Neg logit contributions
 Daisy-0.001
 her-0.001
 She-0.001
,-0.001
 she-0.001
Token rolled
Token ID11686
Feature activation-1.63e-02

Pos logit contributions
 Daisy+0.073
 Sally+0.070
 She+0.067
 her+0.064
 Sophie+0.063
Neg logit contributions
ep-0.145
we-0.132
aker-0.124
ophone-0.123
anas-0.123
Token closer
Token ID5699
Feature activation-9.66e-03

Pos logit contributions
 Daisy+0.021
 she+0.020
 Everyday+0.019
 Sophie+0.019
 taste+0.019
Neg logit contributions
ep-0.035
ike-0.031
we-0.031
plug-0.031
and-0.031
Token and
Token ID290
Feature activation+5.02e-03

Pos logit contributions
 Daisy+0.011
 Sally+0.010
 berries+0.009
 Sophie+0.009
 Polly+0.008
Neg logit contributions
ep-0.023
we-0.023
and-0.022
ophone-0.022
ician-0.021
Token closer
Token ID5699
Feature activation-8.86e-03

Pos logit contributions
ep+0.013
ophone+0.013
we+0.013
ician+0.012
utes+0.012
Neg logit contributions
 Daisy-0.005
,-0.005
 Sally-0.005
 her-0.005
 She-0.005
Token and
Token ID290
Feature activation+5.40e-03

Pos logit contributions
 Daisy+0.009
 Sally+0.008
 berries+0.008
,+0.007
 her+0.007
Neg logit contributions
ep-0.023
we-0.023
ophone-0.022
ician-0.022
utes-0.021
Token it
Token ID340
Feature activation-4.52e-03

Pos logit contributions
ep+0.014
ophone+0.014
we+0.014
ician+0.013
utes+0.013
Neg logit contributions
 Daisy-0.006
,-0.006
 Sally-0.005
 berries-0.005
 her-0.005
Token saw
Token ID2497
Feature activation+6.81e-02

Pos logit contributions
 Daisy+0.005
,+0.005
 Sally+0.005
 her+0.004
 she+0.004
Neg logit contributions
ep-0.012
ophone-0.011
ician-0.011
we-0.011
utes-0.011
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
ep+0.167
we+0.159
ophone+0.154
ician+0.149
utes+0.149
Neg logit contributions
 Daisy-0.081
,-0.073
 She-0.070
 Sally-0.069
 she-0.069
Token fish
Token ID5916
Feature activation+9.30e-03

Pos logit contributions
ep+0.030
ician+0.028
ophone+0.028
utes+0.028
we+0.028
Neg logit contributions
 Daisy-0.017
,-0.017
 Sally-0.016
 her-0.016
 she-0.015
Token!
Token ID0
Feature activation+1.86e-02

Pos logit contributions
ep+0.022
we+0.021
ophone+0.021
ician+0.021
utes+0.020
Neg logit contributions
 Daisy-0.013
 Sally-0.012
,-0.011
 her-0.011
 She-0.011
Token ocean
Token ID9151
Feature activation+3.86e-02

Pos logit contributions
 Daisy+0.013
,+0.013
 Sally+0.012
 she+0.012
 her+0.012
Neg logit contributions
ep-0.022
ophone-0.021
ician-0.021
utes-0.021
we-0.021
Token with
Token ID351
Feature activation-1.78e-02

Pos logit contributions
ep+0.114
ician+0.111
ophone+0.111
utes+0.110
we+0.109
Neg logit contributions
,-0.036
 Daisy-0.029
 she-0.026
 her-0.026
 Sally-0.026
Token some
Token ID617
Feature activation-5.17e-02

Pos logit contributions
 Daisy+0.028
,+0.028
 she+0.026
 She+0.026
 berries+0.025
Neg logit contributions
ep-0.037
ophone-0.033
ician-0.033
etr-0.032
we-0.032
Token other
Token ID584
Feature activation+4.45e-02

Pos logit contributions
 Daisy+0.069
 Sally+0.064
 She+0.061
,+0.061
 she+0.059
Neg logit contributions
ep-0.122
utes-0.117
we-0.114
ophone-0.113
aker-0.111
Token fishes
Token ID49765
Feature activation+5.00e-03

Pos logit contributions
ep+0.088
ophone+0.083
we+0.083
anas+0.081
utes+0.081
Neg logit contributions
 Daisy-0.075
 she-0.071
 Sally-0.070
,-0.070
 She-0.069
Token.
Token ID13
Feature activation-2.81e-03

Pos logit contributions
ep+0.013
ophone+0.012
we+0.012
utes+0.012
ician+0.012
Neg logit contributions
 Daisy-0.006
 Sally-0.006
,-0.005
 her-0.005
 She-0.005
Token They
Token ID1119
Feature activation-1.65e-02

Pos logit contributions
 Daisy+0.003
,+0.003
 Sally+0.003
 her+0.003
 she+0.003
Neg logit contributions
ep-0.007
ophone-0.007
ician-0.007
utes-0.007
anas-0.007
Token all
Token ID477
Feature activation+2.46e-02

Pos logit contributions
 Daisy+0.022
,+0.021
 Sally+0.020
 she+0.019
 her+0.019
Neg logit contributions
ep-0.041
ophone-0.039
ician-0.038
utes-0.038
we-0.038
Token jumped
Token ID11687
Feature activation-1.06e-02

Pos logit contributions
ep+0.060
ophone+0.057
ician+0.057
we+0.057
utes+0.057
Neg logit contributions
 Daisy-0.033
,-0.033
 Sally-0.030
 her-0.029
 she-0.029
Token into
Token ID656
Feature activation-1.53e-02

Pos logit contributions
 Daisy+0.012
 Sally+0.011
 she+0.011
 her+0.011
 She+0.010
Neg logit contributions
ep-0.027
utes-0.026
ophone-0.026
we-0.025
ician-0.024
Token the
Token ID262
Feature activation-4.36e-03

Pos logit contributions
 Daisy+0.009
 she+0.008
,+0.008
 She+0.008
 Sally+0.008
Neg logit contributions
ep-0.046
utes-0.043
we-0.043
ician-0.043
etr-0.043
Token me
Token ID502
Feature activation-3.89e-02

Pos logit contributions
 Daisy+0.025
,+0.024
 Sally+0.023
 she+0.023
 berries+0.022
Neg logit contributions
ep-0.049
ophone-0.047
ician-0.046
we-0.046
utes-0.045
Token?"
Token ID1701
Feature activation-2.26e-02

Pos logit contributions
 Daisy+0.053
,+0.053
 Sally+0.049
 her+0.047
 she+0.047
Neg logit contributions
ep-0.093
ophone-0.090
ician-0.089
we-0.088
utes-0.088
Token Max
Token ID5436
Feature activation+3.92e-02

Pos logit contributions
 Daisy+0.028
,+0.027
 berries+0.026
 Sally+0.025
 she+0.023
Neg logit contributions
ep-0.055
ophone-0.052
ician-0.052
we-0.051
utes-0.050
Token asked
Token ID1965
Feature activation-1.57e-02

Pos logit contributions
ep+0.106
ophone+0.102
ician+0.101
we+0.100
utes+0.099
Neg logit contributions
 Daisy-0.041
,-0.040
 Sally-0.036
 she-0.035
 her-0.035
Token.
Token ID13
Feature activation-4.53e-03

Pos logit contributions
 Daisy+0.016
 berries+0.015
,+0.014
 Sally+0.014
 she+0.013
Neg logit contributions
ep-0.041
ophone-0.039
ician-0.038
we-0.038
etr-0.037
Token
Token ID198
Feature activation-3.31e-03

Pos logit contributions
 Daisy+0.004
,+0.004
 Sally+0.004
 her+0.004
 she+0.004
Neg logit contributions
ep-0.012
ophone-0.012
ician-0.012
we-0.012
utes-0.012
Token
Token ID198
Feature activation-2.94e-03

Pos logit contributions
,+0.003
 Daisy+0.003
 Sally+0.003
 her+0.003
 she+0.003
Neg logit contributions
ep-0.009
ophone-0.009
ician-0.009
we-0.009
utes-0.009
Token"
Token ID1
Feature activation-1.33e-02

Pos logit contributions
,+0.004
 Daisy+0.004
 she+0.003
 her+0.003
 Sally+0.003
Neg logit contributions
ep-0.007
ophone-0.007
utes-0.007
we-0.007
ician-0.007
TokenYes
Token ID5297
Feature activation-3.92e-03

Pos logit contributions
 Daisy+0.023
 she+0.023
,+0.022
 She+0.022
 her+0.022
Neg logit contributions
ep-0.029
ophone-0.027
ician-0.027
anas-0.026
aker-0.026
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 Daisy+0.001
 berries+0.001
 her+0.001
 Sally+0.001
 She+0.000
Neg logit contributions
ep-0.013
ophone-0.012
ician-0.012
anas-0.012
etr-0.012
Token I
Token ID314
Feature activation-2.11e-02

Pos logit contributions
 Daisy+0.017
,+0.017
 Sally+0.015
 her+0.015
 she+0.015
Neg logit contributions
ep-0.034
ophone-0.033
ician-0.033
utes-0.032
we-0.032
Token a
Token ID257
Feature activation-1.21e-03

Pos logit contributions
ep+0.094
we+0.088
utes+0.088
ophone+0.087
ber+0.086
Neg logit contributions
 Daisy-0.041
,-0.035
 Sally-0.034
 berries-0.030
 Freddy-0.028
Token big
Token ID1263
Feature activation-1.14e-02

Pos logit contributions
,+0.002
 Daisy+0.002
 Sally+0.002
 she+0.001
 her+0.001
Neg logit contributions
ep-0.003
ophone-0.003
ician-0.003
we-0.003
utes-0.003
Token table
Token ID3084
Feature activation+1.68e-02

Pos logit contributions
 Daisy+0.021
 Sally+0.019
 her+0.019
 she+0.019
 She+0.019
Neg logit contributions
ep-0.022
ophone-0.019
ician-0.019
utes-0.019
anas-0.019
Token with
Token ID351
Feature activation-2.28e-02

Pos logit contributions
ep+0.041
we+0.038
ophone+0.038
ician+0.038
utes+0.038
Neg logit contributions
 Daisy-0.023
 Sally-0.021
,-0.020
 berries-0.019
 her-0.019
Token many
Token ID867
Feature activation+3.91e-02

Pos logit contributions
 Daisy+0.031
,+0.030
 Sally+0.028
 she+0.026
 She+0.026
Neg logit contributions
ep-0.053
ophone-0.049
we-0.048
utes-0.048
ician-0.047
Token bottles
Token ID14096
Feature activation-8.63e-02

Pos logit contributions
ep+0.076
we+0.069
ophone+0.069
utes+0.068
anas+0.068
Neg logit contributions
 Daisy-0.073
 Sally-0.068
,-0.066
 she-0.064
 her-0.063
Token.
Token ID13
Feature activation+3.98e-03

Pos logit contributions
 Daisy+0.123
 Sally+0.104
 her+0.095
 Sophie+0.093
 Abby+0.093
Neg logit contributions
ep-0.201
washer-0.193
we-0.190
utes-0.190
aker-0.190
Token The
Token ID383
Feature activation-9.04e-04

Pos logit contributions
utes+0.011
ep+0.011
ber+0.011
anas+0.011
ophone+0.011
Neg logit contributions
 Daisy-0.002
,-0.002
 Sally-0.002
 end-0.002
 berries-0.002
Token bottles
Token ID14096
Feature activation-8.58e-02

Pos logit contributions
 Daisy+0.002
,+0.002
 Sally+0.002
 her+0.001
 she+0.001
Neg logit contributions
ep-0.002
utes-0.002
ophone-0.002
we-0.002
ician-0.002
Token have
Token ID423
Feature activation+1.73e-03

Pos logit contributions
 Daisy+0.098
 Gig+0.081
 Sally+0.080
 Sophie+0.071
 Ginny+0.071
Neg logit contributions
ep-0.225
washer-0.215
utes-0.209
ades-0.205
we-0.200
Token different
Token ID1180
Feature activation+2.37e-02

Pos logit contributions
ep+0.004
ophone+0.004
utes+0.004
we+0.004
ician+0.004
Neg logit contributions
 Daisy-0.002
,-0.002
 Sally-0.002
 she-0.002
 her-0.002
Token.
Token ID13
Feature activation-7.95e-05

Pos logit contributions
 Daisy+0.035
 Sally+0.032
 berries+0.031
,+0.029
 She+0.029
Neg logit contributions
ep-0.074
ophone-0.072
we-0.071
ician-0.068
utes-0.068
Token Betty
Token ID29504
Feature activation-6.93e-03

Pos logit contributions
,+0.000
 berries+0.000
 smell+0.000
 her+0.000
 taste+0.000
Neg logit contributions
anas-0.000
utes-0.000
ician-0.000
ophone-0.000
we-0.000
Token be
Token ID307
Feature activation+2.18e-03

Pos logit contributions
 Daisy+0.010
 Sally+0.009
,+0.009
 berries+0.008
 she+0.008
Neg logit contributions
ep-0.017
ophone-0.015
ician-0.015
we-0.015
anas-0.015
Tokenamed
Token ID2434
Feature activation+6.09e-03

Pos logit contributions
ep+0.004
ician+0.003
ophone+0.003
we+0.003
utes+0.003
Neg logit contributions
 Daisy-0.005
,-0.005
 Sally-0.005
 she-0.004
 berries-0.004
Token with
Token ID351
Feature activation-2.33e-02

Pos logit contributions
ep+0.014
ician+0.013
we+0.013
ber+0.012
ophone+0.012
Neg logit contributions
 Daisy-0.008
 berries-0.008
 Sally-0.008
 are-0.007
,-0.007
Token happiness
Token ID12157
Feature activation-5.91e-02

Pos logit contributions
,+0.035
 berries+0.033
 she+0.032
 Sally+0.031
 Daisy+0.030
Neg logit contributions
ep-0.037
etr-0.036
ician-0.034
James-0.034
and-0.033
Token and
Token ID290
Feature activation-3.95e-03

Pos logit contributions
 Daisy+0.052
 Sally+0.050
 berries+0.046
 She+0.040
 Freddy+0.039
Neg logit contributions
ep-0.156
we-0.154
anas-0.151
ician-0.150
ophone-0.150
Token skipped
Token ID26684
Feature activation-3.49e-02

Pos logit contributions
 berries+0.004
,+0.004
 Daisy+0.004
 She+0.004
 end+0.004
Neg logit contributions
ep-0.009
we-0.008
ophone-0.008
ician-0.008
ber-0.008
Token all
Token ID477
Feature activation+1.34e-02

Pos logit contributions
 berries+0.042
 Daisy+0.041
 Sally+0.039
,+0.037
 she+0.036
Neg logit contributions
ep-0.082
utes-0.080
ophone-0.079
ician-0.076
we-0.076
Token the
Token ID262
Feature activation-8.09e-03

Pos logit contributions
ep+0.036
ophone+0.035
we+0.034
utes+0.034
ician+0.033
Neg logit contributions
 berries-0.013
 Daisy-0.012
 Sally-0.010
,-0.010
 smell-0.009
Token way
Token ID835
Feature activation+2.25e-02

Pos logit contributions
 Daisy+0.010
,+0.010
 Sally+0.009
 she+0.009
 her+0.009
Neg logit contributions
ep-0.020
ophone-0.019
ician-0.019
we-0.019
utes-0.019
Token she
Token ID673
Feature activation-2.34e-02

Pos logit contributions
 Daisy+0.005
 Freddy+0.004
 berries+0.004
 end+0.004
 Sally+0.004
Neg logit contributions
we-0.015
ep-0.014
ophone-0.014
ber-0.014
ician-0.013
Token was
Token ID373
Feature activation-1.69e-02

Pos logit contributions
 Daisy+0.034
 Sally+0.032
,+0.031
 berries+0.029
 her+0.028
Neg logit contributions
ep-0.055
ophone-0.051
ician-0.050
we-0.050
ber-0.049
Token so
Token ID523
Feature activation+2.74e-03

Pos logit contributions
 berries+0.026
 Daisy+0.026
 Sally+0.025
,+0.025
 she+0.024
Neg logit contributions
ep-0.035
ician-0.032
ophone-0.032
we-0.030
ber-0.030
Token excited
Token ID6568
Feature activation+4.12e-03

Pos logit contributions
ep+0.005
ber+0.004
ician+0.004
utes+0.004
we+0.004
Neg logit contributions
 berries-0.005
 Daisy-0.005
 Sally-0.005
,-0.005
 end-0.005
Token.
Token ID13
Feature activation+2.93e-03

Pos logit contributions
we+0.009
ep+0.008
and+0.008
ophone+0.008
ician+0.008
Neg logit contributions
 Daisy-0.005
 berries-0.005
 are-0.005
 Sally-0.005
 Everyday-0.005
Token She
Token ID1375
Feature activation-5.18e-02

Pos logit contributions
ophone+0.008
utes+0.008
ep+0.008
we+0.008
ician+0.008
Neg logit contributions
 Daisy-0.002
,-0.002
 berries-0.002
 her-0.002
 Sally-0.001
Token asked
Token ID1965
Feature activation-1.56e-02

Pos logit contributions
 Daisy+0.074
,+0.070
 Sally+0.069
 berries+0.065
 she+0.064
Neg logit contributions
ep-0.122
ophone-0.112
ician-0.111
anas-0.109
we-0.108
Token the
Token ID262
Feature activation-7.50e-03

Pos logit contributions
 Daisy+0.008
 berries+0.006
,+0.006
 Sally+0.006
 taste+0.005
Neg logit contributions
ep-0.046
we-0.046
ophone-0.045
ician-0.045
utes-0.044
Token fairy
Token ID25607
Feature activation-4.07e-02

Pos logit contributions
 Daisy+0.010
 Sally+0.010
,+0.010
 her+0.009
 She+0.008
Neg logit contributions
ep-0.017
we-0.017
ician-0.017
ophone-0.016
utes-0.016
Token what
Token ID644
Feature activation+3.44e-02

Pos logit contributions
 Daisy+0.042
 Sally+0.042
 taste+0.039
 smell+0.038
 berries+0.037
Neg logit contributions
ophone-0.099
ber-0.094
ep-0.092
ician-0.091
we-0.089
Token it
Token ID340
Feature activation-1.53e-02

Pos logit contributions
ep+0.094
we+0.093
ophone+0.091
ber+0.090
ician+0.088
Neg logit contributions
 Daisy-0.022
,-0.021
 berries-0.020
 Sally-0.019
 She-0.017
Token curious
Token ID11040
Feature activation+1.64e-02

Pos logit contributions
ep+0.025
ophone+0.024
ician+0.024
utes+0.024
we+0.024
Neg logit contributions
 Daisy-0.016
,-0.015
 Sally-0.015
 her-0.014
 she-0.014
Token and
Token ID290
Feature activation-4.07e-03

Pos logit contributions
ep+0.036
ophone+0.035
ician+0.034
utes+0.034
we+0.034
Neg logit contributions
 Daisy-0.025
,-0.025
 Sally-0.023
 she-0.023
 her-0.023
Token brave
Token ID14802
Feature activation+1.79e-02

Pos logit contributions
,+0.004
 Daisy+0.004
 berries+0.004
 Sally+0.004
 She+0.004
Neg logit contributions
ep-0.010
ician-0.010
ophone-0.009
ber-0.009
we-0.009
Token Min
Token ID1855
Feature activation+9.22e-02

Pos logit contributions
ep+0.040
ophone+0.038
ician+0.038
utes+0.037
we+0.037
Neg logit contributions
 Daisy-0.027
,-0.027
 Sally-0.025
 her-0.025
 she-0.024
Tokennie
Token ID11952
Feature activation+1.27e-01

Pos logit contributions
utes+0.139
ep+0.134
ophone+0.131
ician+0.125
aghetti+0.123
Neg logit contributions
,-0.218
 Daisy-0.204
 her-0.199
 she-0.196
 the-0.191
Token promised
Token ID8072
Feature activation-5.12e-02

Pos logit contributions
ep+0.312
ophone+0.291
ician+0.289
we+0.286
anas+0.283
Neg logit contributions
 Daisy-0.168
 Sally-0.157
,-0.149
 berries-0.145
 she-0.143
Token to
Token ID284
Feature activation-1.01e-03

Pos logit contributions
 Daisy+0.063
,+0.059
 berries+0.057
 Sally+0.056
 her+0.052
Neg logit contributions
ep-0.123
we-0.119
ician-0.118
ophone-0.117
utes-0.116
Token stay
Token ID2652
Feature activation+5.90e-04

Pos logit contributions
 Daisy+0.001
 berries+0.001
,+0.001
 Sally+0.001
 she+0.001
Neg logit contributions
ep-0.002
ician-0.002
anas-0.002
ophone-0.002
utes-0.002
Token safe
Token ID3338
Feature activation+6.22e-03

Pos logit contributions
ep+0.001
ician+0.001
ophone+0.001
utes+0.001
we+0.001
Neg logit contributions
 Daisy-0.001
,-0.001
 Sally-0.001
 she-0.001
 berries-0.001
Token and
Token ID290
Feature activation-3.46e-03

Pos logit contributions
ep+0.016
we+0.015
ician+0.015
utes+0.014
anas+0.014
Neg logit contributions
 Daisy-0.007
 Sally-0.007
 berries-0.006
 Sophie-0.006
 Freddy-0.006
Token take
Token ID1011
Feature activation-1.45e-02

Pos logit contributions
 Daisy+0.005
,+0.005
 berries+0.005
 Sally+0.005
 She+0.004
Neg logit contributions
ep-0.008
ician-0.007
ophone-0.007
we-0.007
anas-0.007
Token they
Token ID484
Feature activation+8.54e-03

Pos logit contributions
 Daisy+0.006
 berries+0.006
 She+0.006
 Sally+0.006
,+0.006
Neg logit contributions
ep-0.008
we-0.008
ician-0.008
ber-0.008
ophone-0.008
Token played
Token ID2826
Feature activation-1.06e-02

Pos logit contributions
ep+0.019
ophone+0.018
ician+0.017
washer+0.017
etr+0.017
Neg logit contributions
 Daisy-0.013
 Sally-0.012
,-0.011
 berries-0.011
 She-0.011
Token together
Token ID1978
Feature activation+5.35e-02

Pos logit contributions
 Daisy+0.019
,+0.018
 Sally+0.017
 she+0.017
 her+0.017
Neg logit contributions
ep-0.021
ophone-0.020
ician-0.020
utes-0.020
we-0.020
Token happily
Token ID18177
Feature activation+1.97e-02

Pos logit contributions
ep+0.128
ophone+0.122
we+0.122
ician+0.121
utes+0.121
Neg logit contributions
 Daisy-0.067
 Sally-0.064
 berries-0.060
 She-0.057
 she-0.056
Token.
Token ID13
Feature activation+6.29e-03

Pos logit contributions
ep+0.048
ophone+0.047
we+0.046
ician+0.046
utes+0.046
Neg logit contributions
 Daisy-0.026
,-0.024
 Sally-0.023
 her-0.022
 she-0.022
Token From
Token ID3574
Feature activation-6.65e-02

Pos logit contributions
ophone+0.014
anas+0.014
ician+0.013
ep+0.013
utes+0.013
Neg logit contributions
,-0.008
 Daisy-0.008
 berries-0.008
 her-0.007
 smell-0.007
Token then
Token ID788
Feature activation-4.64e-02

Pos logit contributions
 Daisy+0.091
 She+0.083
 berries+0.083
 Sally+0.081
,+0.080
Neg logit contributions
ep-0.153
utes-0.143
ician-0.140
ophone-0.137
ber-0.136
Token on
Token ID319
Feature activation+8.47e-03

Pos logit contributions
 Daisy+0.056
,+0.055
 Sally+0.051
 she+0.049
 her+0.049
Neg logit contributions
ep-0.118
ophone-0.113
ician-0.112
we-0.111
utes-0.111
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
ep+0.021
we+0.020
ber+0.020
anas+0.020
ggie+0.020
Neg logit contributions
 taste-0.006
 berries-0.005
 smell-0.005
 bloom-0.005
 hollow-0.005
Token they
Token ID484
Feature activation+9.16e-03

Pos logit contributions
 berries+0.014
 Daisy+0.014
,+0.013
 hollow+0.013
 aback+0.012
Neg logit contributions
we-0.030
ber-0.029
ep-0.028
anas-0.028
ician-0.028
Token built
Token ID3170
Feature activation+3.30e-02

Pos logit contributions
ep+0.020
ophone+0.019
ician+0.018
anas+0.018
etr+0.017
Neg logit contributions
 Daisy-0.014
 berries-0.013
 Sally-0.013
,-0.012
 she-0.012