TOP ACTIVATIONS
MAX = 4.693

The town spe ople had to find a
! The town spe ople were sad and scared
she bowed , the town spe ople stopped to admire her
," Be good , Holly ,. I 'll be back soon
. The town spe ople decided to ask their
Bra very and safety go together
. He asked the town spe ople what was wrong and
says they are thoughtful . Thought ful means they think a
and visit our animals . Free entry ." Mom and dad
. The town spe ople soon discovered where the
. The town spe ople were sad because they
. No matter how much encouraging his owner
happy and the nation pro spe red . <|endoftext|> Once upon
and icky . No matter how much he tried to
So , the town spe ople looked for Old Tom
had to help the town spe ople . He quickly formulated
The town spe ople thought and thought until
bell stopped . The town spe ople cheered and hugged each
But no matter how hard Lily tried ,
other villagers : " No matter what , together we can

MIN ACTIVATIONS
MIN = -5.104

He wrote z ig z ags on walls , floors ,
enjoyed writing z ig z ags . He wrote z ig
lines to z ig z ags . One day
named Lucy who loved r ags . She would always let
She loved to bring r ags from the kitchen around the
But the z ig z ags weren 't as easy as
it was difficult finding r ags and water . But Bob
Jack was turning the r ags , he looked out the
grabbed one of the r ags and started to spin it
around in z ig z ags . When Zig
and showed her the r ags . She said that she
mom bought some new r ags . When they got back
him to turn the r ags she kept in the closet
make many z ig z ags and circles and lines and
He is happy and w ags his tail . But Tim
icks her face and w ags his tail . He loves
is happy . He w ags his tail . He sits
them . The dog w ags its tail and makes a
dog . The dog w ags its tail . It looks
and warm ." Max w ags his tail and l icks

INTERVAL 2.244 - 4.693
CONTAINS 0.644%

The boy replied , " Yes ! Let 's play together
open up the luggage and spoil himself with candy . One
. The bird said " t weet tweet " and flew
When they reached the checkout , Alice 's mom picked
a little bird who didn 't save enough food for the

INTERVAL -0.205 - 2.244
CONTAINS 64.440%

one ." But Sue wanted the knife very
, L ila , you are so brave !" Ben said
She ordered them to make a pile on her bed .
. She and Tom chase after Spot . They
who is the prett iest girl ?" The mirror showed a

INTERVAL -2.655 - -0.205
CONTAINS 34.844%

cream . She picked up a piece of broccoli
. Tim my loved to brush his teeth every day before
watch ! The brave bear then decided to jump
They wished they had not locked the door .
. Lily lived in a small house on the side of

INTERVAL -5.104 - -2.655
CONTAINS 0.072%

But always with me . OK ?" " OK
mother . We love g aj ar ka hal wa .
sailed away in the kay ak , never to see the
see rings , neck l aces , brace lets and crown
very dark and the oy ster started to glow . Suddenly
Token
Token ID220
Feature activation+1.24e+00

Pos logit contributions
aj+0.049
 Deal+0.043
oll+0.043
obb+0.042
arl+0.041
Neg logit contributions
 Frankie-0.028
 Abby-0.028
 Susan-0.027
 Martha-0.026
 Mill-0.025
Token
Token ID198
Feature activation-9.66e-02

Pos logit contributions
 it+0.078
 Frankie+0.063
 Abby+0.060
 she+0.057
 Susan+0.055
Neg logit contributions
aj-0.172
irms-0.133
ccoli-0.130
udd-0.129
ses-0.129
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
aj+0.013
irms+0.010
ccoli+0.010
udd+0.010
ibr+0.010
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.005
 Susan-0.004
TokenThe
Token ID464
Feature activation+1.14e+00

Pos logit contributions
aj+0.025
irms+0.018
ibr+0.018
ccoli+0.018
udd+0.018
Neg logit contributions
 it-0.013
 Frankie-0.011
 Abby-0.010
 she-0.010
 happen-0.010
Token town
Token ID3240
Feature activation+8.92e-01

Pos logit contributions
 it+0.072
 Frankie+0.057
 Abby+0.055
 she+0.054
 the+0.050
Neg logit contributions
aj-0.160
irms-0.126
ccoli-0.122
udd-0.121
ses-0.121
Tokenspe
Token ID4125
Feature activation+4.69e+00

Pos logit contributions
 it+0.057
 Frankie+0.051
 Abby+0.048
 Susan+0.043
 she+0.042
Neg logit contributions
aj-0.115
udd-0.088
irms-0.085
ibr-0.085
ses-0.085
Tokenople
Token ID643
Feature activation+6.93e-02

Pos logit contributions
 it+0.503
 Frankie+0.447
 Abby+0.436
 she+0.431
 Susan+0.409
Neg logit contributions
aj-0.441
irms-0.301
udd-0.290
ses-0.286
ibr-0.280
Token had
Token ID550
Feature activation-3.37e-01

Pos logit contributions
 it+0.004
 Frankie+0.004
 Abby+0.003
 she+0.003
 Susan+0.003
Neg logit contributions
aj-0.009
irms-0.007
udd-0.007
ibr-0.007
ccoli-0.007
Token to
Token ID284
Feature activation+5.77e-02

Pos logit contributions
aj+0.043
udd+0.032
irms+0.032
 cares+0.032
aked+0.032
Neg logit contributions
 it-0.021
 Frankie-0.019
 Abby-0.018
 she-0.017
 Mill-0.016
Token find
Token ID1064
Feature activation-8.00e-01

Pos logit contributions
 it+0.004
 Frankie+0.004
 Abby+0.003
 she+0.003
 Lib+0.003
Neg logit contributions
aj-0.006
irms-0.005
 imag-0.005
 cares-0.005
ibr-0.005
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
aj+0.116
irms+0.094
ibr+0.091
ses+0.091
udd+0.090
Neg logit contributions
 it-0.037
 Frankie-0.030
 Abby-0.029
 she-0.026
 Susan-0.024
Token!
Token ID0
Feature activation-4.13e-01

Pos logit contributions
aj+0.035
irms+0.026
udd+0.026
arl+0.026
ccoli+0.025
Neg logit contributions
 it-0.016
 Frankie-0.014
 Abby-0.013
 Susan-0.013
 Frank-0.012
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
aj+0.060
irms+0.047
ccoli+0.046
ses+0.046
udd+0.046
Neg logit contributions
 it-0.025
 Frankie-0.019
 Abby-0.019
 she-0.018
 Susan-0.017
Token
Token ID198
Feature activation-2.33e-01

Pos logit contributions
aj+0.035
irms+0.027
ccoli+0.026
udd+0.026
arl+0.026
Neg logit contributions
 it-0.016
 Frankie-0.014
 Abby-0.013
 she-0.012
 Susan-0.012
TokenThe
Token ID464
Feature activation+1.06e+00

Pos logit contributions
aj+0.032
irms+0.025
ccoli+0.024
udd+0.024
ibr+0.024
Neg logit contributions
 it-0.015
 Frankie-0.012
 Abby-0.012
 she-0.011
 Susan-0.011
Token town
Token ID3240
Feature activation+7.82e-01

Pos logit contributions
 it+0.070
 Frankie+0.055
 Abby+0.053
 she+0.052
 Susan+0.049
Neg logit contributions
aj-0.146
irms-0.113
ccoli-0.111
ses-0.110
udd-0.110
Tokenspe
Token ID4125
Feature activation+4.22e+00

Pos logit contributions
 it+0.047
 Frankie+0.046
 Abby+0.043
 Frank+0.039
 Susan+0.039
Neg logit contributions
aj-0.099
udd-0.077
irms-0.074
aked-0.074
ses-0.073
Tokenople
Token ID643
Feature activation+1.80e-02

Pos logit contributions
 it+0.458
 Frankie+0.407
 Abby+0.398
 she+0.392
 Susan+0.378
Neg logit contributions
aj-0.396
irms-0.271
ses-0.256
udd-0.256
ccoli-0.253
Token were
Token ID547
Feature activation+1.35e+00

Pos logit contributions
 it+0.001
 Frankie+0.001
 Abby+0.001
 Susan+0.001
 Frank+0.001
Neg logit contributions
aj-0.002
udd-0.002
irms-0.002
ibr-0.002
aked-0.002
Token sad
Token ID6507
Feature activation-1.70e-01

Pos logit contributions
 it+0.101
 Frankie+0.092
 Abby+0.088
 she+0.085
 Susan+0.080
Neg logit contributions
aj-0.154
irms-0.116
ses-0.112
udd-0.111
aked-0.108
Token and
Token ID290
Feature activation-3.19e-01

Pos logit contributions
aj+0.025
irms+0.020
ccoli+0.019
ses+0.019
udd+0.019
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.007
 she-0.007
 Susan-0.006
Token scared
Token ID12008
Feature activation-3.93e-01

Pos logit contributions
aj+0.036
irms+0.027
ses+0.027
 cares+0.026
ccoli+0.026
Neg logit contributions
 it-0.024
 Frankie-0.022
 Abby-0.021
 Lib-0.020
 Susan-0.019
Token she
Token ID673
Feature activation+2.07e-01

Pos logit contributions
 it+0.061
 Frankie+0.048
 Abby+0.047
 Susan+0.038
 Lib+0.036
Neg logit contributions
aj-0.265
irms-0.214
ccoli-0.213
 cares-0.209
ibr-0.209
Token bowed
Token ID35737
Feature activation+8.89e-01

Pos logit contributions
 it+0.015
 she+0.011
 the+0.011
 her+0.010
 Frankie+0.010
Neg logit contributions
aj-0.029
arl-0.024
irms-0.023
 Deal-0.022
ses-0.022
Token,
Token ID11
Feature activation-3.72e-02

Pos logit contributions
 it+0.047
 Frankie+0.039
 Abby+0.037
 Susan+0.033
 she+0.032
Neg logit contributions
aj-0.129
irms-0.101
ccoli-0.100
ses-0.098
udd-0.098
Token the
Token ID262
Feature activation-2.81e-01

Pos logit contributions
aj+0.006
irms+0.005
ibr+0.005
ems+0.005
ccoli+0.005
Neg logit contributions
 it-0.001
 Frankie-0.001
 Abby-0.001
 Frank-0.000
 Susan-0.000
Token town
Token ID3240
Feature activation+7.15e-01

Pos logit contributions
aj+0.040
irms+0.031
ccoli+0.030
udd+0.030
arl+0.030
Neg logit contributions
 it-0.018
 Frankie-0.014
 she-0.013
 Abby-0.013
 the-0.013
Tokenspe
Token ID4125
Feature activation+4.18e+00

Pos logit contributions
 it+0.050
 Frankie+0.044
 Abby+0.042
 Susan+0.038
 Frank+0.038
Neg logit contributions
aj-0.091
irms-0.068
udd-0.068
ccoli-0.068
ses-0.067
Tokenople
Token ID643
Feature activation-4.73e-02

Pos logit contributions
 it+0.441
 Frankie+0.387
 Abby+0.380
 she+0.373
 Susan+0.360
Neg logit contributions
aj-0.408
irms-0.281
ses-0.268
ccoli-0.267
udd-0.265
Token stopped
Token ID5025
Feature activation-1.24e-01

Pos logit contributions
aj+0.006
irms+0.005
ccoli+0.004
udd+0.004
ses+0.004
Neg logit contributions
 it-0.003
 Frankie-0.003
 Abby-0.003
 she-0.003
 Susan-0.003
Token to
Token ID284
Feature activation-3.17e-02

Pos logit contributions
aj+0.016
irms+0.013
ccoli+0.012
ses+0.012
udd+0.012
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.007
 Susan-0.006
 Lib-0.006
Token admire
Token ID21099
Feature activation+6.94e-01

Pos logit contributions
aj+0.004
irms+0.003
ses+0.003
ems+0.003
 imag+0.003
Neg logit contributions
 it-0.002
 Frankie-0.002
 Abby-0.002
 Susan-0.002
 she-0.002
Token her
Token ID607
Feature activation+4.08e-02

Pos logit contributions
 it+0.020
 Frankie+0.017
 Abby+0.014
 Susan+0.011
 Frank+0.010
Neg logit contributions
aj-0.111
irms-0.090
ccoli-0.088
ibr-0.088
udd-0.087
Token,"
Token ID553
Feature activation-4.31e-01

Pos logit contributions
aj+0.296
irms+0.238
ses+0.232
ccoli+0.232
udd+0.230
Neg logit contributions
 it-0.106
 Frankie-0.076
 Abby-0.073
 she-0.073
 the-0.068
TokenBe
Token ID3856
Feature activation+1.20e+00

Pos logit contributions
aj+0.057
irms+0.044
ccoli+0.042
ses+0.042
udd+0.042
Neg logit contributions
 it-0.031
 Frankie-0.025
 Abby-0.025
 she-0.024
 Susan-0.023
Token good
Token ID922
Feature activation-1.19e+00

Pos logit contributions
 it+0.103
 her+0.082
 Frankie+0.082
 Abby+0.082
 she+0.080
Neg logit contributions
aj-0.146
irms-0.108
ccoli-0.108
ses-0.107
arl-0.107
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
aj+0.174
irms+0.143
ccoli+0.141
ses+0.137
esty+0.135
Neg logit contributions
 it-0.068
 she-0.055
 her-0.050
 the-0.046
 Abby-0.044
Token Holly
Token ID7764
Feature activation+1.22e+00

Pos logit contributions
 it+0.017
 Frankie+0.014
 Abby+0.013
 she+0.012
 Susan+0.012
Neg logit contributions
aj-0.040
irms-0.031
ccoli-0.030
ses-0.030
udd-0.030
Token,.
Token ID38508
Feature activation+4.08e+00

Pos logit contributions
 it+0.076
 Frankie+0.063
 Abby+0.060
 Susan+0.055
 she+0.055
Neg logit contributions
aj-0.171
ccoli-0.130
irms-0.128
udd-0.127
ses-0.127
Token I
Token ID314
Feature activation-1.85e-01

Pos logit contributions
 it+0.332
 Abby+0.286
 Frankie+0.280
 she+0.276
 Susan+0.258
Neg logit contributions
aj-0.494
irms-0.371
ccoli-0.368
ses-0.361
udd-0.351
Token'll
Token ID1183
Feature activation-4.96e-01

Pos logit contributions
aj+0.024
irms+0.021
ses+0.020
ccoli+0.019
sed+0.019
Neg logit contributions
 it-0.013
 she-0.010
 the-0.009
 Abby-0.009
't-0.009
Token be
Token ID307
Feature activation-3.08e-01

Pos logit contributions
aj+0.062
ibr+0.046
arl+0.046
ems+0.045
udd+0.045
Neg logit contributions
 it-0.033
 Frankie-0.029
 Abby-0.029
 Susan-0.026
 Frank-0.025
Token back
Token ID736
Feature activation-1.95e+00

Pos logit contributions
aj+0.040
irms+0.031
ses+0.031
ccoli+0.031
udd+0.030
Neg logit contributions
 it-0.023
 Frankie-0.018
 she-0.018
 Abby-0.018
 the-0.018
Token soon
Token ID2582
Feature activation-3.54e-01

Pos logit contributions
aj+0.277
irms+0.216
ccoli+0.212
ses+0.210
udd+0.209
Neg logit contributions
 it-0.119
 Frankie-0.094
 Abby-0.090
 she-0.086
 Susan-0.082
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
aj+0.140
udd+0.108
irms+0.108
ccoli+0.106
ibr+0.105
Neg logit contributions
 it-0.060
 Frankie-0.058
 Abby-0.055
 Susan-0.051
 Martha-0.049
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
aj+0.022
irms+0.018
ccoli+0.017
ses+0.017
udd+0.017
Neg logit contributions
 it-0.009
 Frankie-0.006
 Abby-0.006
 she-0.006
 Susan-0.006
Token
Token ID198
Feature activation-2.81e-01

Pos logit contributions
aj+0.039
irms+0.030
udd+0.029
ccoli+0.029
ibr+0.028
Neg logit contributions
 it-0.019
 Frankie-0.017
 Abby-0.016
 she-0.015
 Susan-0.015
TokenThe
Token ID464
Feature activation+1.11e+00

Pos logit contributions
aj+0.036
irms+0.027
udd+0.027
ibr+0.026
ccoli+0.026
Neg logit contributions
 it-0.020
 Frankie-0.017
 Abby-0.016
 she-0.016
 Susan-0.015
Token town
Token ID3240
Feature activation+7.70e-01

Pos logit contributions
 it+0.093
 Frankie+0.079
 Abby+0.077
 she+0.074
 Susan+0.072
Neg logit contributions
aj-0.134
irms-0.099
ses-0.096
ccoli-0.096
udd-0.095
Tokenspe
Token ID4125
Feature activation+4.06e+00

Pos logit contributions
 it+0.048
 Frankie+0.044
 Abby+0.042
 Susan+0.038
 Martha+0.037
Neg logit contributions
aj-0.099
udd-0.075
irms-0.074
ses-0.074
aked-0.073
Tokenople
Token ID643
Feature activation-6.76e-02

Pos logit contributions
 it+0.437
 Frankie+0.389
 Abby+0.380
 she+0.373
 Susan+0.362
Neg logit contributions
aj-0.383
irms-0.263
ses-0.249
udd-0.248
ccoli-0.247
Token decided
Token ID3066
Feature activation+6.80e-01

Pos logit contributions
aj+0.008
udd+0.006
irms+0.006
ibr+0.006
ses+0.006
Neg logit contributions
 it-0.005
 Frankie-0.004
 Abby-0.004
 Susan-0.004
 Lib-0.004
Token to
Token ID284
Feature activation+9.26e-02

Pos logit contributions
 Frankie+0.045
 Abby+0.043
 it+0.041
 Martha+0.038
 Susan+0.038
Neg logit contributions
aj-0.075
 cares-0.055
ses-0.055
ccoli-0.055
udd-0.054
Token ask
Token ID1265
Feature activation-1.57e-01

Pos logit contributions
 it+0.006
 Frankie+0.005
 Abby+0.005
 Lib+0.005
 she+0.005
Neg logit contributions
aj-0.010
 cares-0.008
ses-0.008
 imag-0.008
irms-0.008
Token their
Token ID511
Feature activation-8.71e-01

Pos logit contributions
aj+0.021
irms+0.017
ccoli+0.017
ibr+0.017
ses+0.017
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.007
 she-0.006
 Susan-0.006
Token
Token ID6184
Feature activation+2.64e+00

Pos logit contributions
 it+0.019
 Frankie+0.017
 Abby+0.016
 Susan+0.015
 happen+0.014
Neg logit contributions
aj-0.038
ccoli-0.029
arl-0.028
ses-0.028
 Deal-0.027
Token
Token ID95
Feature activation+8.93e-01

Pos logit contributions
 it+0.272
 Frankie+0.231
 Abby+0.229
 she+0.224
 Frank+0.218
Neg logit contributions
aj-0.262
irms-0.187
ses-0.180
ccoli-0.172
ems-0.171
Token
Token ID26391
Feature activation-1.40e-01

Pos logit contributions
 it+0.080
 Frankie+0.068
 Susan+0.065
 John+0.064
't+0.064
Neg logit contributions
aj-0.100
irms-0.071
ses-0.071
ccoli-0.067
ems-0.067
Token
Token ID129
Feature activation-9.43e-01

Pos logit contributions
aj+0.016
irms+0.012
ses+0.011
arl+0.011
ccoli+0.011
Neg logit contributions
 it-0.013
 Frankie-0.011
 she-0.010
 Abby-0.010
 Susan-0.010
Token
Token ID241
Feature activation+7.56e-01

Pos logit contributions
aj+0.106
irms+0.077
ses+0.075
ccoli+0.074
udd+0.074
Neg logit contributions
 it-0.085
 Frankie-0.073
 Abby-0.072
 she-0.071
 Susan-0.068
Token Bra
Token ID9718
Feature activation+4.06e+00

Pos logit contributions
 it+0.052
 Frankie+0.051
 Ellie+0.047
 Susan+0.046
 happen+0.045
Neg logit contributions
aj-0.085
irms-0.062
udd-0.061
ses-0.061
ccoli-0.061
Tokenvery
Token ID548
Feature activation+1.04e+00

Pos logit contributions
 it+0.370
 Frankie+0.315
 Abby+0.308
 she+0.300
 Susan+0.290
Neg logit contributions
aj-0.458
irms-0.334
ses-0.325
ccoli-0.323
udd-0.317
Token and
Token ID290
Feature activation-2.38e-01

Pos logit contributions
 it+0.069
 Frankie+0.059
 Abby+0.057
 she+0.054
 Susan+0.053
Neg logit contributions
aj-0.136
ccoli-0.104
irms-0.101
ses-0.101
udd-0.099
Token safety
Token ID3747
Feature activation+1.19e-01

Pos logit contributions
aj+0.032
irms+0.024
ccoli+0.024
ses+0.024
arl+0.024
Neg logit contributions
 it-0.016
 Frankie-0.013
 Abby-0.013
 she-0.012
 Susan-0.012
Token go
Token ID467
Feature activation+1.04e+00

Pos logit contributions
 it+0.008
 Frankie+0.007
 Abby+0.007
 she+0.006
 Susan+0.006
Neg logit contributions
aj-0.016
irms-0.012
ccoli-0.012
ses-0.012
udd-0.012
Token together
Token ID1978
Feature activation-5.06e-01

Pos logit contributions
 it+0.069
 Frankie+0.058
 Abby+0.056
 she+0.053
 Susan+0.052
Neg logit contributions
aj-0.139
irms-0.105
udd-0.104
arl-0.103
ccoli-0.102
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
aj+0.061
irms+0.048
udd+0.047
arl+0.045
ccoli+0.045
Neg logit contributions
 it-0.028
 Frankie-0.026
 Abby-0.025
 Lib-0.023
 Susan-0.022
Token He
Token ID679
Feature activation-6.94e-01

Pos logit contributions
aj+0.032
irms+0.025
udd+0.024
ems+0.024
arl+0.024
Neg logit contributions
 it-0.015
 Frankie-0.012
 Abby-0.012
 she-0.011
 Frank-0.011
Token asked
Token ID1965
Feature activation+6.49e-01

Pos logit contributions
aj+0.097
irms+0.076
arl+0.075
ccoli+0.074
udd+0.073
Neg logit contributions
 it-0.047
 Frankie-0.036
 Abby-0.035
 she-0.034
 the-0.033
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
 it+0.028
 Frankie+0.020
 Abby+0.019
 she+0.018
 Susan+0.016
Neg logit contributions
aj-0.101
irms-0.082
ccoli-0.080
ses-0.079
ibr-0.079
Token town
Token ID3240
Feature activation+7.39e-01

Pos logit contributions
aj+0.020
irms+0.016
ccoli+0.015
ses+0.015
udd+0.015
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.008
 the-0.007
Tokenspe
Token ID4125
Feature activation+4.06e+00

Pos logit contributions
 it+0.047
 Frankie+0.041
 Abby+0.039
 Susan+0.035
 Samantha+0.034
Neg logit contributions
aj-0.093
ccoli-0.070
 cares-0.068
ibr-0.068
udd-0.067
Tokenople
Token ID643
Feature activation-6.84e-02

Pos logit contributions
 it+0.448
 Frankie+0.395
 Abby+0.390
 she+0.384
 Susan+0.367
Neg logit contributions
aj-0.376
irms-0.257
ses-0.241
ccoli-0.241
udd-0.239
Token what
Token ID644
Feature activation-1.16e+00

Pos logit contributions
aj+0.008
ccoli+0.006
irms+0.006
udd+0.006
ems+0.006
Neg logit contributions
 it-0.005
 Frankie-0.004
 Abby-0.004
 Susan-0.004
 Frank-0.004
Token was
Token ID373
Feature activation+1.24e-01

Pos logit contributions
aj+0.150
 cares+0.125
ccoli+0.124
ibr+0.124
ses+0.120
Neg logit contributions
 it-0.050
 Frankie-0.049
 Abby-0.049
 Lib-0.040
 ringing-0.040
Token wrong
Token ID2642
Feature activation-1.92e+00

Pos logit contributions
 it+0.010
 Abby+0.009
 Frankie+0.009
 she+0.008
 Susan+0.008
Neg logit contributions
aj-0.014
irms-0.010
ses-0.010
ccoli-0.010
ibr-0.010
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
aj+0.268
ccoli+0.205
irms+0.204
ses+0.201
udd+0.201
Neg logit contributions
 it-0.108
 Frankie-0.091
 Abby-0.087
 Susan-0.078
 she-0.076
Token says
Token ID1139
Feature activation-7.24e-02

Pos logit contributions
 it+0.040
 she+0.034
 the+0.033
 her+0.032
 Abby+0.030
Neg logit contributions
aj-0.048
arl-0.036
irms-0.035
ccoli-0.033
udd-0.033
Token they
Token ID484
Feature activation-1.33e-01

Pos logit contributions
aj+0.011
ccoli+0.009
irms+0.009
ibr+0.009
ses+0.008
Neg logit contributions
 it-0.003
 Frankie-0.003
 Abby-0.002
 Susan-0.002
 Frank-0.002
Token are
Token ID389
Feature activation+1.13e-01

Pos logit contributions
aj+0.016
irms+0.012
ses+0.011
arl+0.011
ccoli+0.011
Neg logit contributions
 it-0.012
 she-0.009
 the-0.009
 Frankie-0.009
 Abby-0.009
Token thoughtful
Token ID22677
Feature activation-6.69e-01

Pos logit contributions
 it+0.008
 Frankie+0.006
 Abby+0.006
 she+0.006
 the+0.006
Neg logit contributions
aj-0.016
irms-0.012
udd-0.012
ses-0.012
ccoli-0.012
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
aj+0.099
irms+0.078
ccoli+0.076
ses+0.076
udd+0.075
Neg logit contributions
 it-0.037
 Frankie-0.029
 Abby-0.028
 she-0.026
 Susan-0.025
Token Thought
Token ID27522
Feature activation+4.04e+00

Pos logit contributions
 it+0.011
 she+0.009
 Abby+0.008
 Frankie+0.008
 the+0.008
Neg logit contributions
aj-0.028
ses-0.022
ccoli-0.022
udd-0.022
irms-0.022
Tokenful
Token ID913
Feature activation-3.09e-02

Pos logit contributions
 it+0.309
 Frankie+0.274
 Abby+0.259
 Susan+0.245
 she+0.241
Neg logit contributions
aj-0.500
ccoli-0.371
ses-0.367
arl-0.362
irms-0.361
Token means
Token ID1724
Feature activation+8.56e-01

Pos logit contributions
aj+0.004
irms+0.003
ses+0.003
ccoli+0.003
ibr+0.003
Neg logit contributions
 it-0.002
 she-0.002
 the-0.002
 Abby-0.002
 her-0.002
Token they
Token ID484
Feature activation-1.34e-01

Pos logit contributions
 Frankie+0.039
 it+0.037
 Abby+0.032
 Susan+0.029
 Frank+0.029
Neg logit contributions
aj-0.116
ibr-0.093
 cares-0.093
ses-0.091
ccoli-0.090
Token think
Token ID892
Feature activation-1.89e-01

Pos logit contributions
aj+0.018
irms+0.013
ses+0.013
udd+0.013
arl+0.013
Neg logit contributions
 it-0.010
 she-0.008
 Frankie-0.008
 the-0.008
 Abby-0.008
Token a
Token ID257
Feature activation+3.02e-01

Pos logit contributions
aj+0.027
irms+0.022
ccoli+0.021
ibr+0.021
ses+0.021
Neg logit contributions
 it-0.009
 Frankie-0.008
 Abby-0.008
 Susan-0.007
 Frank-0.006
Token and
Token ID290
Feature activation-9.78e-02

Pos logit contributions
aj+0.147
irms+0.112
ccoli+0.110
ses+0.109
arl+0.109
Neg logit contributions
 it-0.075
 Frankie-0.062
 Abby-0.060
 she-0.057
 Susan-0.056
Token visit
Token ID3187
Feature activation+7.53e-01

Pos logit contributions
aj+0.013
irms+0.010
ccoli+0.010
ses+0.010
udd+0.010
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.005
 Susan-0.005
Token our
Token ID674
Feature activation-1.66e+00

Pos logit contributions
 it+0.032
 Frankie+0.026
 Abby+0.024
 she+0.023
 Susan+0.021
Neg logit contributions
aj-0.116
irms-0.092
ccoli-0.092
udd-0.090
ses-0.090
Token animals
Token ID4695
Feature activation+4.27e-02

Pos logit contributions
aj+0.207
irms+0.168
ccoli+0.165
oll+0.161
ems+0.160
Neg logit contributions
 it-0.127
 the-0.102
 baby-0.097
 Abby-0.090
 ringing-0.089
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
 it+0.003
 Frankie+0.002
 Abby+0.002
 she+0.002
 Susan+0.002
Neg logit contributions
aj-0.006
irms-0.005
ccoli-0.005
ses-0.005
udd-0.005
Token Free
Token ID3232
Feature activation+4.03e+00

Pos logit contributions
 it+0.026
 Frankie+0.021
 Abby+0.021
 she+0.020
 Susan+0.019
Neg logit contributions
aj-0.039
irms-0.029
udd-0.029
ccoli-0.028
ses-0.028
Token entry
Token ID5726
Feature activation-1.71e-01

Pos logit contributions
 it+0.364
 Frankie+0.305
 Abby+0.297
 she+0.288
 the+0.282
Neg logit contributions
aj-0.458
irms-0.339
ccoli-0.323
ses-0.322
udd-0.320
Token."
Token ID526
Feature activation-1.31e-01

Pos logit contributions
aj+0.023
irms+0.018
ccoli+0.018
ses+0.018
udd+0.018
Neg logit contributions
 it-0.011
 Frankie-0.009
 Abby-0.009
 she-0.008
 Susan-0.008
Token Mom
Token ID11254
Feature activation-1.77e-01

Pos logit contributions
aj+0.018
irms+0.015
ccoli+0.014
ses+0.014
udd+0.014
Neg logit contributions
 it-0.009
 Frankie-0.007
 she-0.007
 Abby-0.007
 the-0.006
Token and
Token ID290
Feature activation-8.85e-02

Pos logit contributions
aj+0.026
irms+0.020
ccoli+0.020
ses+0.020
udd+0.020
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.007
 Susan-0.007
Token dad
Token ID9955
Feature activation-4.90e-01

Pos logit contributions
aj+0.011
irms+0.008
ccoli+0.008
ses+0.008
udd+0.008
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 she-0.006
 the-0.005
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
 it+0.036
 Frankie+0.033
 Abby+0.031
 Susan+0.029
 Frank+0.027
Neg logit contributions
aj-0.082
ccoli-0.064
irms-0.062
udd-0.062
ows-0.062
Token
Token ID198
Feature activation-2.98e-01

Pos logit contributions
aj+0.019
irms+0.015
ccoli+0.015
ses+0.015
udd+0.014
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.007
 she-0.007
 Susan-0.006
Token
Token ID198
Feature activation-2.86e-01

Pos logit contributions
aj+0.035
ibr+0.026
udd+0.026
irms+0.026
aked+0.025
Neg logit contributions
 it-0.021
 Frankie-0.020
 Abby-0.019
 Susan-0.018
 ringing-0.018
TokenThe
Token ID464
Feature activation+1.17e+00

Pos logit contributions
aj+0.031
ibr+0.024
udd+0.023
aked+0.023
irms+0.023
Neg logit contributions
 it-0.022
 Frankie-0.021
 Abby-0.020
 Susan-0.019
 Mill-0.018
Token town
Token ID3240
Feature activation+7.61e-01

Pos logit contributions
 it+0.068
 Frankie+0.052
 Abby+0.050
 she+0.048
 Susan+0.045
Neg logit contributions
aj-0.170
irms-0.134
ccoli-0.132
ses-0.131
udd-0.130
Tokenspe
Token ID4125
Feature activation+4.02e+00

Pos logit contributions
 it+0.047
 Frankie+0.044
 Abby+0.041
 Susan+0.037
 Frank+0.037
Neg logit contributions
aj-0.097
udd-0.076
aked-0.072
ibr-0.072
 cares-0.072
Tokenople
Token ID643
Feature activation-9.63e-02

Pos logit contributions
 it+0.425
 Frankie+0.373
 Abby+0.366
 she+0.359
 Susan+0.348
Neg logit contributions
aj-0.392
irms-0.269
ccoli-0.258
ses-0.257
udd-0.254
Token soon
Token ID2582
Feature activation-3.19e-01

Pos logit contributions
aj+0.012
udd+0.009
irms+0.009
ccoli+0.009
ibr+0.009
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 Susan-0.005
 she-0.005
Token discovered
Token ID5071
Feature activation+2.15e-01

Pos logit contributions
aj+0.038
udd+0.029
irms+0.028
ccoli+0.028
ibr+0.028
Neg logit contributions
 it-0.024
 Frankie-0.022
 Abby-0.020
 Susan-0.019
 she-0.019
Token where
Token ID810
Feature activation-2.17e+00

Pos logit contributions
 it+0.011
 Frankie+0.011
 Abby+0.010
't+0.009
 Susan+0.009
Neg logit contributions
aj-0.027
udd-0.022
ibr-0.022
irms-0.021
ses-0.021
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
aj+0.297
 cares+0.241
ibr+0.238
irms+0.237
 imag+0.236
Neg logit contributions
 Frankie-0.081
 Abby-0.067
 it-0.066
 Lib-0.061
 Susan-0.056
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
 it+0.046
 Frankie+0.040
 Abby+0.038
 Susan+0.034
 she+0.033
Neg logit contributions
aj-0.123
irms-0.096
ccoli-0.096
ses-0.094
udd-0.094
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
aj+0.021
irms+0.016
ccoli+0.016
ses+0.016
udd+0.016
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.007
 she-0.006
 the-0.006
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
aj+0.037
irms+0.029
udd+0.028
ccoli+0.028
ibr+0.027
Neg logit contributions
 it-0.018
 Frankie-0.016
 Abby-0.015
 she-0.014
 Susan-0.014
TokenThe
Token ID464
Feature activation+1.14e+00

Pos logit contributions
aj+0.035
irms+0.028
udd+0.027
ibr+0.027
ccoli+0.027
Neg logit contributions
 it-0.018
 Frankie-0.015
 Abby-0.015
 she-0.013
 Susan-0.013
Token town
Token ID3240
Feature activation+7.67e-01

Pos logit contributions
 it+0.075
 Frankie+0.056
 she+0.055
 the+0.054
 Abby+0.052
Neg logit contributions
aj-0.161
irms-0.127
ccoli-0.123
udd-0.122
ses-0.122
Tokenspe
Token ID4125
Feature activation+4.02e+00

Pos logit contributions
 it+0.047
 Frankie+0.044
 Abby+0.042
 Susan+0.037
 Frank+0.037
Neg logit contributions
aj-0.098
udd-0.076
irms-0.074
aked-0.074
ses-0.074
Tokenople
Token ID643
Feature activation-9.65e-02

Pos logit contributions
 it+0.412
 Frankie+0.355
 Abby+0.348
 she+0.342
 the+0.333
Neg logit contributions
aj-0.409
irms-0.282
ccoli-0.275
ses-0.272
arl-0.268
Token were
Token ID547
Feature activation+1.38e+00

Pos logit contributions
aj+0.012
irms+0.010
udd+0.009
ccoli+0.009
ses+0.009
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.005
 Susan-0.005
 she-0.005
Token sad
Token ID6507
Feature activation-1.09e-01

Pos logit contributions
 it+0.109
 Frankie+0.098
 Abby+0.094
 she+0.090
 Mill+0.087
Neg logit contributions
aj-0.157
irms-0.120
ses-0.118
udd-0.113
ibr-0.112
Token because
Token ID780
Feature activation-1.44e+00

Pos logit contributions
aj+0.017
irms+0.014
ccoli+0.014
ibr+0.014
ses+0.013
Neg logit contributions
 it-0.006
 she-0.004
 the-0.004
 Frankie-0.003
 her-0.003
Token they
Token ID484
Feature activation-3.69e-01

Pos logit contributions
aj+0.146
ses+0.124
 cares+0.123
ibr+0.121
 imag+0.120
Neg logit contributions
 Frankie-0.074
 Abby-0.064
quet-0.060
Ab-0.057
 Lib-0.057
Token.
Token ID13
Feature activation+5.15e-01

Pos logit contributions
 it+0.013
 Frankie+0.011
 Abby+0.011
 Susan+0.010
 she+0.010
Neg logit contributions
aj-0.027
irms-0.021
ccoli-0.020
udd-0.020
ses-0.020
Token
Token ID220
Feature activation+1.23e+00

Pos logit contributions
 it+0.035
 Frankie+0.029
 Abby+0.028
 she+0.027
 Susan+0.025
Neg logit contributions
aj-0.069
irms-0.053
ccoli-0.052
ses-0.051
udd-0.051
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
 it+0.074
 Frankie+0.057
 Abby+0.055
 she+0.053
 Susan+0.050
Neg logit contributions
aj-0.176
irms-0.139
ccoli-0.135
ses-0.135
udd-0.134
Token
Token ID198
Feature activation-1.83e-01

Pos logit contributions
aj+0.014
irms+0.011
ccoli+0.011
udd+0.011
arl+0.010
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.005
 she-0.005
 Susan-0.005
TokenNo
Token ID2949
Feature activation+2.00e+00

Pos logit contributions
aj+0.024
irms+0.018
ibr+0.018
ccoli+0.018
arl+0.018
Neg logit contributions
 it-0.013
 Frankie-0.010
 Abby-0.010
 happen-0.009
 she-0.009
Token matter
Token ID2300
Feature activation+4.01e+00

Pos logit contributions
 it+0.135
 Frankie+0.111
 Abby+0.109
 she+0.103
 Susan+0.099
Neg logit contributions
aj-0.267
irms-0.205
ccoli-0.201
ses-0.201
udd-0.197
Token how
Token ID703
Feature activation-9.50e-01

Pos logit contributions
 it+0.265
 Abby+0.254
 Frankie+0.253
 Lib+0.226
 she+0.224
Neg logit contributions
aj-0.481
ccoli-0.352
irms-0.351
 cares-0.348
ses-0.348
Token much
Token ID881
Feature activation+4.59e-01

Pos logit contributions
aj+0.123
udd+0.093
irms+0.092
ccoli+0.091
arl+0.089
Neg logit contributions
 it-0.063
 Frankie-0.056
 Abby-0.056
 Lib-0.052
 Susan-0.051
Token encouraging
Token ID12577
Feature activation+1.81e+00

Pos logit contributions
 it+0.020
 Frankie+0.018
 Abby+0.017
 Lib+0.015
 Martha+0.015
Neg logit contributions
aj-0.063
ccoli-0.050
 cares-0.050
irms-0.050
 imag-0.049
Token his
Token ID465
Feature activation-5.38e-01

Pos logit contributions
 it+0.093
 Frankie+0.081
 Abby+0.079
 Lib+0.071
 Susan+0.070
Neg logit contributions
aj-0.248
ccoli-0.195
irms-0.194
ems-0.191
 cares-0.189
Token owner
Token ID4870
Feature activation-4.92e-01

Pos logit contributions
aj+0.065
irms+0.047
ccoli+0.047
ses+0.047
udd+0.046
Neg logit contributions
 it-0.044
 Frankie-0.038
 Abby-0.037
 she-0.035
 Susan-0.034
Token happy
Token ID3772
Feature activation-3.57e-01

Pos logit contributions
 it+0.001
 Frankie+0.001
 Abby+0.001
 happen+0.001
 she+0.001
Neg logit contributions
aj-0.002
udd-0.001
aked-0.001
ibr-0.001
ccoli-0.001
Token and
Token ID290
Feature activation-3.97e-01

Pos logit contributions
aj+0.047
udd+0.037
irms+0.036
ccoli+0.036
ses+0.036
Neg logit contributions
 it-0.021
 Frankie-0.019
 Abby-0.018
 Susan-0.017
 Martha-0.016
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
aj+0.050
udd+0.038
irms+0.038
ccoli+0.038
ses+0.038
Neg logit contributions
 it-0.024
 Frankie-0.022
 Abby-0.021
 Susan-0.019
 Lib-0.019
Token nation
Token ID3277
Feature activation-5.01e-01

Pos logit contributions
aj+0.011
irms+0.008
ses+0.008
ibr+0.008
 cares+0.008
Neg logit contributions
 it-0.008
 Frankie-0.008
 Abby-0.007
 she-0.007
 Susan-0.007
Token pro
Token ID386
Feature activation+2.57e+00

Pos logit contributions
aj+0.062
udd+0.048
ibr+0.048
 cares+0.048
ccoli+0.047
Neg logit contributions
 it-0.032
 Frankie-0.029
 Abby-0.028
 Susan-0.026
 Martha-0.025
Tokenspe
Token ID4125
Feature activation+3.98e+00

Pos logit contributions
 it+0.224
 Frankie+0.200
 Abby+0.194
 Susan+0.184
 she+0.184
Neg logit contributions
aj-0.288
irms-0.208
udd-0.206
ccoli-0.203
ibr-0.200
Tokenred
Token ID445
Feature activation+9.01e-02

Pos logit contributions
 it+0.435
 Frankie+0.390
 Abby+0.381
 she+0.374
 Susan+0.360
Neg logit contributions
aj-0.363
irms-0.248
ses-0.236
udd-0.236
ccoli-0.233
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
 it+0.005
 Frankie+0.005
 Abby+0.004
 Susan+0.004
 Lib+0.004
Neg logit contributions
aj-0.012
udd-0.009
ccoli-0.009
irms-0.009
ibr-0.009
Token<|endoftext|>
Token ID50256
Feature activation-6.19e-01

Pos logit contributions
aj+0.018
irms+0.014
ccoli+0.014
ses+0.013
ibr+0.013
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.008
 ringing-0.008
TokenOnce
Token ID7454
Feature activation+9.03e-01

Pos logit contributions
aj+0.072
irms+0.053
ccoli+0.052
ses+0.052
udd+0.051
Neg logit contributions
 it-0.053
 Frankie-0.045
 Abby-0.044
 she-0.042
 Susan-0.041
Token upon
Token ID2402
Feature activation-7.45e-01

Pos logit contributions
 it+0.091
 Frankie+0.082
 Abby+0.080
 Susan+0.076
 she+0.075
Neg logit contributions
aj-0.089
irms-0.059
udd-0.058
ccoli-0.058
ses-0.057
Token and
Token ID290
Feature activation+4.76e-01

Pos logit contributions
 it+0.009
 Frankie+0.009
 Abby+0.007
 ringing+0.007
 Susan+0.007
Neg logit contributions
aj-0.020
udd-0.015
irms-0.015
ibr-0.015
aked-0.014
Token
Token ID220
Feature activation+1.13e+00

Pos logit contributions
 it+0.030
 Frankie+0.024
 Abby+0.023
 she+0.022
 Susan+0.021
Neg logit contributions
aj-0.066
irms-0.052
ccoli-0.051
ses-0.050
udd-0.050
Tokenicky
Token ID17479
Feature activation+3.29e-01

Pos logit contributions
 it+0.069
 Frankie+0.054
 Abby+0.052
 she+0.050
 Susan+0.047
Neg logit contributions
aj-0.160
irms-0.126
ccoli-0.123
ses-0.122
udd-0.121
Token.
Token ID13
Feature activation+1.65e-03

Pos logit contributions
 it+0.020
 Frankie+0.016
 Abby+0.015
 she+0.014
 Susan+0.014
Neg logit contributions
aj-0.047
irms-0.037
ccoli-0.036
ses-0.036
udd-0.035
Token No
Token ID1400
Feature activation+1.26e+00

Pos logit contributions
 it+0.000
 Frankie+0.000
 Abby+0.000
 she+0.000
 Susan+0.000
Neg logit contributions
aj-0.000
irms-0.000
ccoli-0.000
ses-0.000
udd-0.000
Token matter
Token ID2300
Feature activation+3.98e+00

Pos logit contributions
 it+0.076
 Frankie+0.064
 Abby+0.062
 she+0.056
 Susan+0.055
Neg logit contributions
aj-0.175
irms-0.138
ccoli-0.135
ses-0.133
udd-0.131
Token how
Token ID703
Feature activation-9.86e-01

Pos logit contributions
 it+0.275
 Frankie+0.259
 Abby+0.258
 Lib+0.234
 she+0.229
Neg logit contributions
aj-0.472
ccoli-0.348
irms-0.345
 cares-0.338
ses-0.338
Token much
Token ID881
Feature activation+4.38e-01

Pos logit contributions
aj+0.123
irms+0.092
udd+0.092
ccoli+0.090
arl+0.088
Neg logit contributions
 it-0.066
 Frankie-0.062
 Abby-0.061
 Lib-0.058
 Susan-0.056
Token he
Token ID339
Feature activation+2.80e-01

Pos logit contributions
 it+0.020
 Frankie+0.018
 Abby+0.017
 Lib+0.015
 Martha+0.015
Neg logit contributions
aj-0.062
irms-0.050
ccoli-0.050
ses-0.049
 cares-0.049
Token tried
Token ID3088
Feature activation-4.03e-01

Pos logit contributions
 it+0.024
 she+0.019
 Frankie+0.018
 the+0.018
 Abby+0.018
Neg logit contributions
aj-0.035
irms-0.026
arl-0.026
ses-0.025
ccoli-0.025
Token to
Token ID284
Feature activation+1.34e-02

Pos logit contributions
aj+0.053
ccoli+0.041
irms+0.040
udd+0.040
ses+0.039
Neg logit contributions
 Frankie-0.020
 it-0.020
 Abby-0.020
 Martha-0.018
 Susan-0.018
Token
Token ID198
Feature activation-2.26e-01

Pos logit contributions
aj+0.032
irms+0.024
ccoli+0.024
udd+0.024
arl+0.023
Neg logit contributions
 it-0.015
 Frankie-0.012
 Abby-0.012
 she-0.011
 Susan-0.011
TokenSo
Token ID2396
Feature activation-1.96e-01

Pos logit contributions
aj+0.029
irms+0.022
udd+0.022
ccoli+0.022
arl+0.021
Neg logit contributions
 it-0.017
 Frankie-0.014
 Abby-0.014
 she-0.013
 Susan-0.012
Token,
Token ID11
Feature activation+2.87e-01

Pos logit contributions
aj+0.028
irms+0.022
ccoli+0.022
ses+0.022
ibr+0.021
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.007
 Susan-0.007
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 it+0.015
 Frankie+0.013
 Abby+0.013
 Mill+0.011
 happen+0.011
Neg logit contributions
aj-0.038
irms-0.030
ccoli-0.030
ibr-0.029
ems-0.029
Token town
Token ID3240
Feature activation+8.26e-01

Pos logit contributions
aj+0.001
irms+0.001
ses+0.001
ibr+0.001
arl+0.001
Neg logit contributions
 it-0.001
 Frankie-0.001
 Abby-0.001
 Susan-0.001
 Martha-0.001
Tokenspe
Token ID4125
Feature activation+3.97e+00

Pos logit contributions
 it+0.059
 Frankie+0.053
 Abby+0.052
 happen+0.046
 Martha+0.045
Neg logit contributions
aj-0.096
aked-0.072
udd-0.072
ibr-0.071
ses-0.069
Tokenople
Token ID643
Feature activation-1.00e-01

Pos logit contributions
 it+0.427
 Frankie+0.377
 Abby+0.370
 she+0.362
 Susan+0.351
Neg logit contributions
aj-0.379
irms-0.260
ses-0.246
ccoli-0.245
udd-0.244
Token looked
Token ID3114
Feature activation-1.74e+00

Pos logit contributions
aj+0.012
udd+0.009
ibr+0.009
irms+0.009
ccoli+0.009
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 Susan-0.006
 she-0.006
Token for
Token ID329
Feature activation-4.72e-01

Pos logit contributions
aj+0.233
ses+0.178
irms+0.177
ccoli+0.176
arl+0.174
Neg logit contributions
 it-0.132
 Frankie-0.101
 the-0.100
 she-0.099
 Abby-0.097
Token Old
Token ID5706
Feature activation+2.21e+00

Pos logit contributions
aj+0.064
irms+0.053
ibr+0.051
ses+0.051
 cares+0.050
Neg logit contributions
 it-0.020
 Frankie-0.017
 Abby-0.017
 she-0.016
't-0.015
Token Tom
Token ID4186
Feature activation+4.08e-01

Pos logit contributions
 it+0.175
 Frankie+0.147
 Abby+0.143
 she+0.139
 Susan+0.133
Neg logit contributions
aj-0.270
irms-0.202
ccoli-0.198
ses-0.196
udd-0.195
Token had
Token ID550
Feature activation-4.66e-01

Pos logit contributions
 it+0.015
 Frankie+0.013
 Abby+0.013
 she+0.012
 Susan+0.011
Neg logit contributions
aj-0.027
ccoli-0.021
ses-0.021
irms-0.020
udd-0.020
Token to
Token ID284
Feature activation+7.04e-02

Pos logit contributions
aj+0.066
irms+0.051
ccoli+0.050
ses+0.050
udd+0.050
Neg logit contributions
 it-0.029
 Frankie-0.023
 Abby-0.022
 she-0.021
 Susan-0.020
Token help
Token ID1037
Feature activation+4.01e-01

Pos logit contributions
 it+0.005
 Abby+0.004
 Frankie+0.004
 she+0.004
 Lib+0.004
Neg logit contributions
aj-0.008
irms-0.006
arl-0.006
ses-0.006
 cares-0.006
Token the
Token ID262
Feature activation-7.61e-02

Pos logit contributions
 Abby+0.016
 Lib+0.015
 it+0.014
Ab+0.014
 Frankie+0.013
Neg logit contributions
aj-0.050
irms-0.042
ccoli-0.041
ibr-0.040
ows-0.040
Token town
Token ID3240
Feature activation+7.67e-01

Pos logit contributions
aj+0.010
irms+0.008
ses+0.007
ibr+0.007
arl+0.007
Neg logit contributions
 it-0.005
 Frankie-0.004
 Abby-0.004
 she-0.004
 Susan-0.004
Tokenspe
Token ID4125
Feature activation+3.97e+00

Pos logit contributions
 it+0.047
 Abby+0.043
 Frankie+0.042
 Samantha+0.037
 poppy+0.037
Neg logit contributions
aj-0.094
ows-0.072
udd-0.071
ibr-0.070
ccoli-0.069
Tokenople
Token ID643
Feature activation-1.08e-01

Pos logit contributions
 it+0.456
 Frankie+0.406
 Abby+0.404
 she+0.397
 Susan+0.378
Neg logit contributions
aj-0.344
irms-0.232
udd-0.216
ses-0.215
ccoli-0.210
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
aj+0.013
ows+0.010
udd+0.010
ccoli+0.010
irms+0.010
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 Lib-0.006
 Susan-0.006
Token He
Token ID679
Feature activation-6.33e-01

Pos logit contributions
aj+0.014
irms+0.011
ccoli+0.011
ses+0.011
udd+0.011
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 she-0.006
 Susan-0.005
Token quickly
Token ID2952
Feature activation-3.95e-01

Pos logit contributions
aj+0.087
irms+0.068
arl+0.066
ccoli+0.066
ses+0.066
Neg logit contributions
 it-0.043
 Frankie-0.034
 Abby-0.033
 she-0.032
 the-0.030
Token formulated
Token ID34391
Feature activation+6.97e-01

Pos logit contributions
aj+0.051
irms+0.039
ccoli+0.038
ses+0.037
udd+0.037
Neg logit contributions
 it-0.029
 Frankie-0.024
 Abby-0.024
 she-0.023
 Susan-0.022
Token
Token ID220
Feature activation+8.67e-01

Pos logit contributions
aj+0.016
irms+0.012
arl+0.012
ccoli+0.012
udd+0.012
Neg logit contributions
 it-0.008
 Frankie-0.006
 Abby-0.006
 she-0.006
 Susan-0.006
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
 it+0.049
 Frankie+0.037
 Abby+0.035
 she+0.034
 the+0.032
Neg logit contributions
aj-0.128
irms-0.102
ses-0.100
ccoli-0.099
udd-0.098
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
aj+0.034
irms+0.026
udd+0.026
ccoli+0.026
arl+0.025
Neg logit contributions
 it-0.017
 Frankie-0.014
 Abby-0.014
 she-0.013
 Susan-0.013
TokenThe
Token ID464
Feature activation+1.17e+00

Pos logit contributions
aj+0.031
irms+0.023
udd+0.023
ibr+0.023
ccoli+0.023
Neg logit contributions
 it-0.018
 Frankie-0.016
 Abby-0.016
 she-0.014
 Susan-0.014
Token town
Token ID3240
Feature activation+7.95e-01

Pos logit contributions
 it+0.083
 Frankie+0.068
 Abby+0.066
 she+0.063
 Susan+0.061
Neg logit contributions
aj-0.155
irms-0.119
ccoli-0.116
ses-0.116
udd-0.114
Tokenspe
Token ID4125
Feature activation+3.97e+00

Pos logit contributions
 it+0.053
 Frankie+0.048
 Abby+0.046
 Susan+0.041
 she+0.041
Neg logit contributions
aj-0.100
udd-0.075
irms-0.074
ccoli-0.074
ses-0.074
Tokenople
Token ID643
Feature activation-1.14e-01

Pos logit contributions
 it+0.402
 Frankie+0.344
 Abby+0.337
 she+0.331
 the+0.325
Neg logit contributions
aj-0.411
irms-0.283
ccoli-0.279
ses-0.275
arl-0.273
Token thought
Token ID1807
Feature activation-4.44e-01

Pos logit contributions
aj+0.014
irms+0.011
udd+0.011
ccoli+0.010
ibr+0.010
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.007
 she-0.006
 Susan-0.006
Token and
Token ID290
Feature activation-2.72e-01

Pos logit contributions
aj+0.061
ccoli+0.048
irms+0.048
udd+0.048
ibr+0.047
Neg logit contributions
 Abby-0.021
 it-0.020
 Frankie-0.020
 Lib-0.018
 Susan-0.017
Token thought
Token ID1807
Feature activation-4.42e-01

Pos logit contributions
aj+0.031
irms+0.023
ses+0.023
ccoli+0.023
udd+0.023
Neg logit contributions
 it-0.021
 Frankie-0.019
 Abby-0.018
 she-0.017
 Susan-0.017
Token until
Token ID1566
Feature activation+1.24e-01

Pos logit contributions
aj+0.057
ccoli+0.044
ibr+0.043
ows+0.043
irms+0.043
Neg logit contributions
 Abby-0.025
 Frankie-0.025
 it-0.024
 Lib-0.022
 Susan-0.021
Token bell
Token ID8966
Feature activation+5.53e-01

Pos logit contributions
aj+0.017
irms+0.013
ses+0.013
ccoli+0.013
ibr+0.012
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.007
 she-0.007
 Susan-0.006
Token stopped
Token ID5025
Feature activation-6.50e-02

Pos logit contributions
 it+0.039
 Frankie+0.037
 Abby+0.036
 Susan+0.033
 Frank+0.033
Neg logit contributions
aj-0.068
udd-0.050
ccoli-0.049
ses-0.049
ems-0.049
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
aj+0.008
irms+0.006
udd+0.006
ccoli+0.006
ses+0.006
Neg logit contributions
 it-0.004
 Frankie-0.004
 Abby-0.004
 Lib-0.003
 Susan-0.003
Token The
Token ID383
Feature activation-2.65e-01

Pos logit contributions
aj+0.027
irms+0.021
ccoli+0.020
ses+0.020
udd+0.020
Neg logit contributions
 it-0.013
 Frankie-0.010
 Abby-0.010
 she-0.010
 Susan-0.009
Token town
Token ID3240
Feature activation+7.51e-01

Pos logit contributions
aj+0.036
irms+0.028
ses+0.027
ccoli+0.027
udd+0.026
Neg logit contributions
 it-0.017
 Frankie-0.014
 Abby-0.014
 she-0.013
 Susan-0.013
Tokenspe
Token ID4125
Feature activation+3.96e+00

Pos logit contributions
 it+0.048
 Frankie+0.048
 Abby+0.045
 Susan+0.041
 Frank+0.040
Neg logit contributions
aj-0.090
udd-0.069
ibr-0.066
 cares-0.066
aked-0.066
Tokenople
Token ID643
Feature activation-1.07e-01

Pos logit contributions
 it+0.408
 Frankie+0.355
 Abby+0.348
 she+0.340
 Susan+0.331
Neg logit contributions
aj-0.399
irms-0.275
ccoli-0.267
ses-0.265
udd-0.260
Token cheered
Token ID34767
Feature activation+1.93e+00

Pos logit contributions
aj+0.013
udd+0.010
ibr+0.010
ems+0.010
 cares+0.010
Neg logit contributions
 it-0.007
 Frankie-0.007
 Abby-0.006
 Susan-0.006
 Frank-0.006
Token and
Token ID290
Feature activation-3.77e-01

Pos logit contributions
 Frankie+0.127
 Abby+0.124
 it+0.123
 Lib+0.116
 Susan+0.112
Neg logit contributions
aj-0.221
ccoli-0.164
udd-0.163
ems-0.161
irms-0.160
Token hugged
Token ID41130
Feature activation-4.54e-01

Pos logit contributions
aj+0.047
irms+0.037
ses+0.037
ibr+0.036
udd+0.036
Neg logit contributions
 it-0.022
 Frankie-0.021
 Abby-0.019
 Susan-0.018
 Frank-0.017
Token each
Token ID1123
Feature activation+5.06e-02

Pos logit contributions
aj+0.061
irms+0.051
ses+0.049
ccoli+0.048
ibr+0.048
Neg logit contributions
 it-0.020
 Frankie-0.019
 Abby-0.017
 she-0.016
't-0.016
Token
Token ID220
Feature activation+9.51e-01

Pos logit contributions
aj+0.011
irms+0.009
ccoli+0.008
ses+0.008
udd+0.008
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.004
 Susan-0.004
Token
Token ID198
Feature activation-3.11e-01

Pos logit contributions
 it+0.061
 Frankie+0.050
 Abby+0.048
 she+0.045
 Susan+0.043
Neg logit contributions
aj-0.130
irms-0.100
ccoli-0.098
udd-0.097
ses-0.097
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
aj+0.040
irms+0.030
ibr+0.029
udd+0.029
ccoli+0.029
Neg logit contributions
 it-0.021
 Frankie-0.018
 Abby-0.017
 happen-0.017
 ringing-0.016
TokenBut
Token ID1537
Feature activation+6.69e-01

Pos logit contributions
aj+0.039
irms+0.029
ibr+0.029
ccoli+0.028
udd+0.028
Neg logit contributions
 it-0.021
 Frankie-0.018
 Abby-0.017
 happen-0.016
 she-0.016
Token no
Token ID645
Feature activation+1.07e+00

Pos logit contributions
 it+0.038
 Frankie+0.031
 Abby+0.030
 she+0.027
 Susan+0.027
Neg logit contributions
aj-0.092
ccoli-0.072
irms-0.071
ses-0.070
ems-0.068
Token matter
Token ID2300
Feature activation+3.96e+00

Pos logit contributions
 it+0.049
 Frankie+0.038
 Abby+0.036
 Susan+0.032
 she+0.031
Neg logit contributions
aj-0.162
irms-0.130
ccoli-0.128
ses-0.125
ems-0.124
Token how
Token ID703
Feature activation-9.16e-01

Pos logit contributions
 it+0.275
 Frankie+0.267
 Abby+0.267
 Lib+0.244
 poppy+0.242
Neg logit contributions
aj-0.438
ccoli-0.318
 cares-0.313
ems-0.310
irms-0.309
Token hard
Token ID1327
Feature activation-3.27e-01

Pos logit contributions
aj+0.116
irms+0.087
udd+0.086
ccoli+0.085
arl+0.084
Neg logit contributions
 it-0.062
 Frankie-0.055
 Abby-0.054
 Lib-0.051
 Susan-0.050
Token Lily
Token ID20037
Feature activation-5.20e-01

Pos logit contributions
aj+0.049
irms+0.041
ccoli+0.040
ems+0.040
 cares+0.040
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
Susan-0.005
Mill-0.005
Token tried
Token ID3088
Feature activation-3.32e-01

Pos logit contributions
aj+0.071
irms+0.055
arl+0.054
ses+0.053
ccoli+0.052
Neg logit contributions
 it-0.037
 she-0.028
 Frankie-0.028
 Abby-0.028
 the-0.026
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
aj+0.043
ccoli+0.033
irms+0.033
udd+0.033
ems+0.032
Neg logit contributions
 Frankie-0.017
 it-0.017
 Abby-0.017
 Lib-0.015
 Martha-0.015
Token other
Token ID584
Feature activation+8.08e-01

Pos logit contributions
 it+0.091
 Frankie+0.087
 Abby+0.087
 Martha+0.076
 Frank+0.076
Neg logit contributions
aj-0.161
udd-0.120
ccoli-0.118
ibr-0.118
ems-0.117
Token villagers
Token ID25263
Feature activation+2.52e-01

Pos logit contributions
 it+0.056
 Frankie+0.046
 Abby+0.045
 she+0.042
 Susan+0.041
Neg logit contributions
aj-0.106
irms-0.080
ses-0.079
ccoli-0.078
udd-0.078
Token:
Token ID25
Feature activation-4.85e-02

Pos logit contributions
 it+0.017
 Frankie+0.014
 Abby+0.014
 she+0.013
 Susan+0.013
Neg logit contributions
aj-0.033
irms-0.025
ccoli-0.025
ses-0.025
udd-0.025
Token "
Token ID366
Feature activation+3.96e-01

Pos logit contributions
aj+0.006
ccoli+0.005
irms+0.005
ses+0.005
udd+0.005
Neg logit contributions
 it-0.003
 Frankie-0.003
 Abby-0.002
 she-0.002
 Susan-0.002
TokenNo
Token ID2949
Feature activation+1.92e+00

Pos logit contributions
 it+0.030
 Frankie+0.025
 Abby+0.024
 she+0.023
 Susan+0.022
Neg logit contributions
aj-0.050
irms-0.038
ccoli-0.037
ses-0.036
udd-0.036
Token matter
Token ID2300
Feature activation+3.95e+00

Pos logit contributions
 it+0.110
 Frankie+0.102
 Abby+0.099
 Lib+0.088
Mill+0.086
Neg logit contributions
aj-0.248
ses-0.185
ccoli-0.183
 cares-0.183
irms-0.181
Token what
Token ID644
Feature activation-1.33e+00

Pos logit contributions
 it+0.270
 Frankie+0.241
 Abby+0.238
 she+0.223
 Lib+0.209
Neg logit contributions
aj-0.496
ccoli-0.372
irms-0.371
ses-0.366
udd-0.362
Token,
Token ID11
Feature activation+1.53e-01

Pos logit contributions
aj+0.174
ccoli+0.134
irms+0.132
ems+0.129
udd+0.129
Neg logit contributions
 it-0.079
 Frankie-0.073
 Abby-0.071
 Susan-0.064
 Lib-0.063
Token together
Token ID1978
Feature activation-4.74e-01

Pos logit contributions
 it+0.009
 Frankie+0.008
 Abby+0.007
 Lib+0.007
 Susan+0.007
Neg logit contributions
aj-0.020
ccoli-0.015
irms-0.015
ems-0.015
ibr-0.015
Token we
Token ID356
Feature activation-1.15e+00

Pos logit contributions
aj+0.060
ccoli+0.047
 cares+0.046
ems+0.046
irms+0.045
Neg logit contributions
 it-0.029
 Frankie-0.027
 Abby-0.025
 Lib-0.023
 Susan-0.023
Token can
Token ID460
Feature activation-1.24e+00

Pos logit contributions
aj+0.155
irms+0.118
ses+0.114
ccoli+0.113
udd+0.113
Neg logit contributions
 it-0.084
 Frankie-0.067
 Abby-0.064
 she-0.063
 the-0.062
Token He
Token ID679
Feature activation-6.81e-01

Pos logit contributions
aj+0.003
irms+0.002
ccoli+0.002
ses+0.002
udd+0.002
Neg logit contributions
 it-0.002
 Frankie-0.001
 Abby-0.001
 she-0.001
 Susan-0.001
Token wrote
Token ID2630
Feature activation-2.01e-01

Pos logit contributions
aj+0.095
arl+0.075
irms+0.075
ses+0.073
ccoli+0.072
Neg logit contributions
 it-0.047
 she-0.035
 the-0.034
 Frankie-0.033
 her-0.032
Token z
Token ID1976
Feature activation+8.40e-02

Pos logit contributions
aj+0.028
irms+0.022
ccoli+0.021
udd+0.021
ses+0.021
Neg logit contributions
 it-0.013
 Frankie-0.010
 Abby-0.010
 she-0.009
 Susan-0.009
Tokenig
Token ID328
Feature activation-2.96e+00

Pos logit contributions
 it+0.008
 the+0.006
 her+0.006
 she+0.006
't+0.006
Neg logit contributions
aj-0.008
ccoli-0.007
irms-0.006
ses-0.006
 Deal-0.006
Tokenz
Token ID89
Feature activation+4.54e-01

Pos logit contributions
aj+0.350
ccoli+0.248
udd+0.248
ibr+0.242
irms+0.241
Neg logit contributions
 it-0.244
 Frankie-0.222
 Abby-0.219
 Susan-0.203
 she-0.196
Tokenags
Token ID3775
Feature activation-5.10e+00

Pos logit contributions
 it+0.057
 her+0.049
 the+0.049
 she+0.048
 Abby+0.047
Neg logit contributions
aj-0.035
ccoli-0.025
ses-0.023
irms-0.022
ibr-0.019
Token on
Token ID319
Feature activation+1.06e+00

Pos logit contributions
aj+0.698
ses+0.527
ccoli+0.526
irms+0.522
udd+0.521
Neg logit contributions
 Frankie-0.253
 Abby-0.248
 it-0.239
 Susan-0.203
 Samantha-0.200
Token walls
Token ID7714
Feature activation-9.06e-02

Pos logit contributions
 Frankie+0.040
 it+0.037
 Abby+0.037
 Susan+0.033
 she+0.030
Neg logit contributions
aj-0.155
irms-0.128
ibr-0.124
udd-0.122
arl-0.121
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
aj+0.012
udd+0.009
irms+0.008
ows+0.008
ets+0.008
Neg logit contributions
 it-0.006
 Frankie-0.006
 Abby-0.005
 Susan-0.005
 Mill-0.005
Token floors
Token ID18570
Feature activation+3.57e-01

Pos logit contributions
 it+0.013
 Frankie+0.011
 Abby+0.010
 she+0.009
 Susan+0.009
Neg logit contributions
aj-0.031
irms-0.024
ccoli-0.024
udd-0.023
ses-0.023
Token,
Token ID11
Feature activation+2.15e-01

Pos logit contributions
 it+0.021
 Frankie+0.019
 Abby+0.018
 Susan+0.017
 Frank+0.016
Neg logit contributions
aj-0.050
udd-0.038
irms-0.038
ses-0.037
ibr-0.037
Token enjoyed
Token ID8359
Feature activation+1.24e+00

Pos logit contributions
aj+0.087
irms+0.067
arl+0.067
ses+0.065
ccoli+0.064
Neg logit contributions
 it-0.051
 she-0.040
 Frankie-0.039
 the-0.038
 Abby-0.038
Token writing
Token ID3597
Feature activation-1.82e-01

Pos logit contributions
 it+0.086
 Abby+0.084
 Frankie+0.084
 Susan+0.077
 Lib+0.074
Neg logit contributions
aj-0.147
irms-0.108
ses-0.104
ccoli-0.103
ibr-0.103
Token z
Token ID1976
Feature activation+9.53e-02

Pos logit contributions
aj+0.026
irms+0.020
ses+0.020
ccoli+0.020
udd+0.019
Neg logit contributions
 it-0.012
 Frankie-0.009
 Abby-0.009
 she-0.008
 the-0.008
Tokenig
Token ID328
Feature activation-2.95e+00

Pos logit contributions
 it+0.010
 she+0.008
 the+0.008
 her+0.008
't+0.008
Neg logit contributions
aj-0.008
ccoli-0.007
ses-0.007
irms-0.006
arl-0.006
Tokenz
Token ID89
Feature activation+4.69e-01

Pos logit contributions
aj+0.352
udd+0.259
ibr+0.247
ccoli+0.246
ems+0.242
Neg logit contributions
 it-0.237
 Abby-0.224
 Frankie-0.221
 Susan-0.203
 Samantha-0.198
Tokenags
Token ID3775
Feature activation-5.10e+00

Pos logit contributions
 it+0.056
 Frankie+0.048
 Abby+0.047
 she+0.047
 her+0.047
Neg logit contributions
aj-0.040
ccoli-0.026
ses-0.026
irms-0.026
udd-0.024
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
aj+0.634
udd+0.469
irms+0.466
ccoli+0.464
ems+0.463
Neg logit contributions
 Abby-0.306
 Frankie-0.303
 it-0.282
 Susan-0.260
 Samantha-0.257
Token He
Token ID679
Feature activation-6.81e-01

Pos logit contributions
aj+0.003
irms+0.002
ccoli+0.002
ses+0.002
udd+0.002
Neg logit contributions
 it-0.002
 Frankie-0.001
 Abby-0.001
 she-0.001
 Susan-0.001
Token wrote
Token ID2630
Feature activation-2.01e-01

Pos logit contributions
aj+0.095
arl+0.075
irms+0.075
ses+0.073
ccoli+0.072
Neg logit contributions
 it-0.047
 she-0.035
 the-0.034
 Frankie-0.033
 her-0.032
Token z
Token ID1976
Feature activation+8.40e-02

Pos logit contributions
aj+0.028
irms+0.022
ccoli+0.021
udd+0.021
ses+0.021
Neg logit contributions
 it-0.013
 Frankie-0.010
 Abby-0.010
 she-0.009
 Susan-0.009
Tokenig
Token ID328
Feature activation-2.96e+00

Pos logit contributions
 it+0.008
 the+0.006
 her+0.006
 she+0.006
't+0.006
Neg logit contributions
aj-0.008
ccoli-0.007
irms-0.006
ses-0.006
 Deal-0.006
Token lines
Token ID3951
Feature activation+4.55e-01

Pos logit contributions
 it+0.076
 the+0.056
 her+0.051
 she+0.051
 Frankie+0.051
Neg logit contributions
aj-0.156
ccoli-0.124
ses-0.123
irms-0.120
udd-0.119
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
 it+0.027
 Frankie+0.026
 Abby+0.025
 Susan+0.023
 Martha+0.022
Neg logit contributions
aj-0.060
udd-0.046
ibr-0.045
ows-0.045
irms-0.045
Token z
Token ID1976
Feature activation+1.09e-01

Pos logit contributions
 it+0.007
 Frankie+0.007
 Abby+0.006
 Frank+0.006
 Susan+0.006
Neg logit contributions
aj-0.015
irms-0.012
ccoli-0.012
ibr-0.012
ems-0.012
Tokenig
Token ID328
Feature activation-2.94e+00

Pos logit contributions
 it+0.012
 the+0.010
 her+0.010
 she+0.010
't+0.010
Neg logit contributions
aj-0.010
ses-0.007
ccoli-0.007
irms-0.006
sed-0.006
Tokenz
Token ID89
Feature activation+5.06e-01

Pos logit contributions
aj+0.331
udd+0.236
ibr+0.233
ccoli+0.230
ems+0.227
Neg logit contributions
 it-0.246
 Abby-0.234
 Frankie-0.233
 Susan-0.213
 Frank-0.209
Tokenags
Token ID3775
Feature activation-5.09e+00

Pos logit contributions
 it+0.068
 her+0.058
 the+0.058
 Frankie+0.058
 she+0.058
Neg logit contributions
aj-0.036
ccoli-0.022
ses-0.022
irms-0.021
udd-0.018
Token.
Token ID13
Feature activation-2.82e-02

Pos logit contributions
aj+0.639
ccoli+0.488
udd+0.485
ems+0.483
ows+0.481
Neg logit contributions
 Abby-0.291
 Frankie-0.290
 it-0.274
 Samantha-0.244
 Martha-0.243
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
aj+0.004
irms+0.003
ses+0.003
ccoli+0.003
arl+0.003
Neg logit contributions
 it-0.002
 Frankie-0.001
 Abby-0.001
 she-0.001
 the-0.001
Token
Token ID198
Feature activation-1.91e-01

Pos logit contributions
aj+0.026
irms+0.019
udd+0.019
ccoli+0.019
ibr+0.019
Neg logit contributions
 it-0.013
 Frankie-0.012
 Abby-0.012
 she-0.011
 Susan-0.011
TokenOne
Token ID3198
Feature activation-1.31e-01

Pos logit contributions
aj+0.023
irms+0.017
ccoli+0.017
udd+0.017
ibr+0.017
Neg logit contributions
 it-0.015
 Frankie-0.013
 Abby-0.013
 Susan-0.012
 she-0.012
Token day
Token ID1110
Feature activation+1.19e-01

Pos logit contributions
aj+0.019
irms+0.015
ccoli+0.015
udd+0.015
ses+0.015
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 she-0.005
 Susan-0.005
Token named
Token ID3706
Feature activation-4.69e-01

Pos logit contributions
 it+0.022
 Frankie+0.019
 Abby+0.019
 she+0.018
 Susan+0.018
Neg logit contributions
aj-0.024
irms-0.017
ccoli-0.016
ses-0.016
udd-0.016
Token Lucy
Token ID22162
Feature activation+1.57e+00

Pos logit contributions
aj+0.059
irms+0.044
ccoli+0.043
ses+0.043
udd+0.043
Neg logit contributions
 it-0.037
 Frankie-0.031
 Abby-0.030
 she-0.029
 Susan-0.028
Token who
Token ID508
Feature activation-1.25e+00

Pos logit contributions
 it+0.100
 Frankie+0.092
 Abby+0.089
 Frank+0.082
 Susan+0.081
Neg logit contributions
aj-0.200
udd-0.152
ccoli-0.151
irms-0.150
ses-0.148
Token loved
Token ID6151
Feature activation+1.06e-01

Pos logit contributions
aj+0.151
irms+0.112
ccoli+0.110
udd+0.109
ses+0.109
Neg logit contributions
 it-0.102
 Frankie-0.087
 Abby-0.085
 she-0.080
 Susan-0.079
Token r
Token ID374
Feature activation-9.23e-01

Pos logit contributions
 it+0.007
 Frankie+0.007
 Abby+0.006
 Susan+0.006
 Frank+0.006
Neg logit contributions
aj-0.013
irms-0.010
 imag-0.009
udd-0.009
ccoli-0.009
Tokenags
Token ID3775
Feature activation-5.04e+00

Pos logit contributions
aj+0.087
irms+0.059
ccoli+0.059
ses+0.058
ems+0.054
Neg logit contributions
 it-0.099
 Abby-0.087
 Frankie-0.087
 she-0.085
 Susan-0.082
Token.
Token ID13
Feature activation-2.56e-02

Pos logit contributions
aj+0.655
udd+0.523
irms+0.490
ags+0.485
ems+0.482
Neg logit contributions
 Frankie-0.238
 Frank-0.227
Frank-0.208
 Abby-0.206
't-0.194
Token She
Token ID1375
Feature activation+1.73e-01

Pos logit contributions
aj+0.003
irms+0.002
arl+0.002
udd+0.002
ccoli+0.002
Neg logit contributions
 it-0.002
 Frankie-0.002
 ringing-0.002
 Frank-0.001
 Abby-0.001
Token would
Token ID561
Feature activation-8.37e-02

Pos logit contributions
 it+0.012
 Frankie+0.010
 Abby+0.010
 she+0.010
 the+0.009
Neg logit contributions
aj-0.023
irms-0.018
ses-0.017
ccoli-0.017
arl-0.017
Token always
Token ID1464
Feature activation-1.47e-01

Pos logit contributions
aj+0.010
ses+0.008
arl+0.008
udd+0.008
ccoli+0.008
Neg logit contributions
 it-0.006
 Frankie-0.005
 Susan-0.005
 Frank-0.005
 Abby-0.005
Token let
Token ID1309
Feature activation-3.65e-02

Pos logit contributions
aj+0.019
irms+0.015
ccoli+0.015
ses+0.014
udd+0.014
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.008
 Susan-0.008
Token She
Token ID1375
Feature activation+1.94e-01

Pos logit contributions
 it+0.004
 she+0.003
 Frankie+0.003
 her+0.003
 the+0.003
Neg logit contributions
aj-0.011
ccoli-0.009
irms-0.009
ses-0.009
udd-0.009
Token loved
Token ID6151
Feature activation+1.44e-01

Pos logit contributions
 it+0.015
 Frankie+0.013
 Abby+0.013
 she+0.012
 Susan+0.012
Neg logit contributions
aj-0.024
irms-0.018
ccoli-0.017
ses-0.017
udd-0.017
Token to
Token ID284
Feature activation+9.76e-02

Pos logit contributions
 it+0.009
 Frankie+0.009
 Abby+0.009
 Susan+0.008
 Frank+0.008
Neg logit contributions
aj-0.018
irms-0.013
udd-0.013
 imag-0.013
ccoli-0.013
Token bring
Token ID2222
Feature activation+8.86e-01

Pos logit contributions
 it+0.008
 Frankie+0.007
 Abby+0.007
 Lib+0.007
 Frank+0.007
Neg logit contributions
aj-0.010
ccoli-0.007
 imag-0.007
irms-0.007
ems-0.007
Token r
Token ID374
Feature activation-9.33e-01

Pos logit contributions
 it+0.030
 Frankie+0.023
 Abby+0.020
 Susan+0.017
 she+0.017
Neg logit contributions
aj-0.145
irms-0.118
udd-0.114
ccoli-0.113
ibr-0.112
Tokenags
Token ID3775
Feature activation-5.04e+00

Pos logit contributions
aj+0.080
ccoli+0.061
ses+0.060
irms+0.056
sed+0.055
Neg logit contributions
 it-0.098
 Abby-0.088
 her-0.087
 she-0.085
 Frankie-0.085
Token from
Token ID422
Feature activation-8.55e-01

Pos logit contributions
aj+0.720
udd+0.559
aked+0.554
ows+0.528
ags+0.526
Neg logit contributions
 Frankie-0.213
 happen-0.190
 Abby-0.176
 Frank-0.170
 it-0.161
Token the
Token ID262
Feature activation-7.92e-02

Pos logit contributions
aj+0.110
irms+0.096
ses+0.090
 imag+0.090
oken+0.089
Neg logit contributions
 Frankie-0.032
Frank-0.029
John-0.028
 Frank-0.027
Susan-0.027
Token kitchen
Token ID9592
Feature activation-7.38e-01

Pos logit contributions
aj+0.010
irms+0.007
ses+0.007
udd+0.007
ccoli+0.007
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.005
 Susan-0.005
Token around
Token ID1088
Feature activation+4.59e-01

Pos logit contributions
aj+0.105
irms+0.081
udd+0.080
ccoli+0.080
ses+0.079
Neg logit contributions
 it-0.045
 Frankie-0.036
 Abby-0.034
 she-0.032
 Susan-0.032
Token the
Token ID262
Feature activation-7.34e-02

Pos logit contributions
 Frankie+0.021
 Abby+0.018
 Frank+0.017
 Lib+0.017
 Susan+0.017
Neg logit contributions
aj-0.061
irms-0.050
ibr-0.049
 imag-0.047
udd-0.047
Token But
Token ID887
Feature activation-6.94e-01

Pos logit contributions
aj+0.000
irms+0.000
ses+0.000
arl+0.000
ems+0.000
Neg logit contributions
 it-0.000
 Frankie-0.000
 she-0.000
 the-0.000
 John-0.000
Token the
Token ID262
Feature activation-5.26e-02

Pos logit contributions
aj+0.105
irms+0.083
arl+0.081
udd+0.081
ses+0.081
Neg logit contributions
 it-0.042
 Frankie-0.030
 she-0.029
 Abby-0.028
 the-0.027
Token z
Token ID1976
Feature activation+1.43e-01

Pos logit contributions
aj+0.008
irms+0.006
ccoli+0.006
ses+0.006
udd+0.006
Neg logit contributions
 it-0.003
 Frankie-0.002
 Abby-0.002
 she-0.002
 the-0.002
Tokenig
Token ID328
Feature activation-2.90e+00

Pos logit contributions
 it+0.013
 of+0.011
't+0.011
 she+0.010
 one+0.010
Neg logit contributions
aj-0.013
ccoli-0.011
irms-0.010
ems-0.009
 Deal-0.009
Tokenz
Token ID89
Feature activation+5.14e-01

Pos logit contributions
aj+0.409
irms+0.313
ccoli+0.311
ses+0.309
udd+0.309
Neg logit contributions
 it-0.176
 Frankie-0.143
 Abby-0.139
 she-0.126
 Susan-0.124
Tokenags
Token ID3775
Feature activation-5.02e+00

Pos logit contributions
 it+0.050
 the+0.041
 her+0.041
 she+0.039
 Frankie+0.039
Neg logit contributions
aj-0.057
ccoli-0.043
ses-0.039
irms-0.039
ibr-0.037
Token weren
Token ID6304
Feature activation+2.62e+00

Pos logit contributions
aj+0.640
ses+0.485
irms+0.475
ccoli+0.474
udd+0.474
Neg logit contributions
 it-0.334
 Frankie-0.303
 Abby-0.295
 Susan-0.264
 she-0.262
Token't
Token ID470
Feature activation+2.49e+00

Pos logit contributions
 Abby+0.144
 Frankie+0.125
 Susan+0.119
 Mill+0.119
 Martha+0.118
Neg logit contributions
aj-0.317
arl-0.283
 Deal-0.271
oll-0.265
obb-0.261
Token as
Token ID355
Feature activation+4.20e-01

Pos logit contributions
 Abby+0.142
 it+0.141
 Frankie+0.138
 Amelia+0.132
 Lib+0.129
Neg logit contributions
aj-0.286
udd-0.216
arl-0.209
ccoli-0.209
irms-0.208
Token easy
Token ID2562
Feature activation+3.25e-01

Pos logit contributions
 it+0.026
 Frankie+0.022
 Abby+0.022
 she+0.020
 Susan+0.019
Neg logit contributions
aj-0.057
ccoli-0.043
udd-0.042
irms-0.042
ibr-0.042
Token as
Token ID355
Feature activation+4.24e-01

Pos logit contributions
 it+0.018
 Frankie+0.015
 Abby+0.015
 Susan+0.014
 she+0.013
Neg logit contributions
aj-0.046
irms-0.036
ses-0.035
ccoli-0.035
udd-0.035
Token it
Token ID340
Feature activation+1.05e+00

Pos logit contributions
quet+0.014
 happen+0.014
reating+0.014
 ringing+0.013
 it+0.013
Neg logit contributions
aj-0.030
ses-0.025
irms-0.025
ccoli-0.025
sed-0.024
Token was
Token ID373
Feature activation+1.39e-01

Pos logit contributions
 it+0.060
 Frankie+0.043
 she+0.041
 Abby+0.040
 the+0.039
Neg logit contributions
aj-0.158
irms-0.127
arl-0.124
ses-0.123
ccoli-0.122
Token difficult
Token ID2408
Feature activation-1.87e-01

Pos logit contributions
 it+0.009
 Frankie+0.008
 Abby+0.007
 she+0.007
 Susan+0.007
Neg logit contributions
aj-0.019
irms-0.014
ccoli-0.014
ses-0.014
udd-0.014
Token finding
Token ID4917
Feature activation-1.10e+00

Pos logit contributions
aj+0.025
irms+0.019
udd+0.019
ccoli+0.018
ses+0.018
Neg logit contributions
 it-0.012
 Frankie-0.011
 Abby-0.011
 Susan-0.010
 she-0.009
Token r
Token ID374
Feature activation-9.11e-01

Pos logit contributions
aj+0.157
irms+0.124
udd+0.123
ccoli+0.121
ses+0.118
Neg logit contributions
 it-0.056
 Frankie-0.048
 Abby-0.047
 she-0.044
 Susan-0.042
Tokenags
Token ID3775
Feature activation-5.01e+00

Pos logit contributions
aj+0.084
ccoli+0.063
irms+0.060
ses+0.059
ems+0.056
Neg logit contributions
 it-0.089
 Frankie-0.078
 Susan-0.076
 Abby-0.076
 the-0.075
Token and
Token ID290
Feature activation-2.44e-01

Pos logit contributions
aj+0.659
udd+0.503
ccoli+0.500
ows+0.489
irms+0.486
Neg logit contributions
 it-0.264
 Frankie-0.263
 Abby-0.248
 Susan-0.220
't-0.219
Token water
Token ID1660
Feature activation-6.16e-02

Pos logit contributions
aj+0.035
irms+0.028
ses+0.028
ccoli+0.028
udd+0.027
Neg logit contributions
 it-0.010
 Frankie-0.009
 Abby-0.008
 Susan-0.007
 Lib-0.007
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
aj+0.008
udd+0.006
irms+0.006
ses+0.006
ccoli+0.006
Neg logit contributions
 it-0.004
 Frankie-0.003
 Abby-0.003
 Susan-0.003
 Frank-0.003
Token But
Token ID887
Feature activation-6.84e-01

Pos logit contributions
 it+0.001
 she+0.001
 Frankie+0.001
 happen+0.001
 her+0.001
Neg logit contributions
aj-0.002
ses-0.002
irms-0.002
ccoli-0.002
sed-0.002
Token Bob
Token ID5811
Feature activation+1.06e+00

Pos logit contributions
aj+0.079
ses+0.065
ccoli+0.065
sed+0.063
irms+0.062
Neg logit contributions
 it-0.034
quet-0.030
 happen-0.030
't-0.030
 ringing-0.029
Token Jack
Token ID3619
Feature activation+1.41e+00

Pos logit contributions
aj+0.079
irms+0.064
ccoli+0.060
 imag+0.059
 Deal+0.058
Neg logit contributions
 Abby-0.047
 Frankie-0.046
 Lib-0.045
 Martha-0.045
 it-0.044
Token was
Token ID373
Feature activation+1.12e-01

Pos logit contributions
 it+0.072
 Frankie+0.050
 she+0.049
 Abby+0.048
 the+0.046
Neg logit contributions
aj-0.217
irms-0.177
ses-0.174
arl-0.173
ccoli-0.171
Token turning
Token ID6225
Feature activation+1.95e-01

Pos logit contributions
 it+0.007
 she+0.005
 the+0.005
 her+0.005
 Frankie+0.005
Neg logit contributions
aj-0.016
ccoli-0.013
irms-0.012
ses-0.012
ems-0.012
Token the
Token ID262
Feature activation-1.07e-03

Pos logit contributions
 it+0.012
 Frankie+0.009
 Abby+0.009
 she+0.008
 Susan+0.008
Neg logit contributions
aj-0.028
irms-0.022
ccoli-0.021
ses-0.021
udd-0.021
Token r
Token ID374
Feature activation-9.13e-01

Pos logit contributions
aj+0.000
ccoli+0.000
irms+0.000
ses+0.000
 Deal+0.000
Neg logit contributions
 it-0.000
 she-0.000
 her-0.000
 the-0.000
 baby-0.000
Tokenags
Token ID3775
Feature activation-5.01e+00

Pos logit contributions
aj+0.074
ccoli+0.064
ses+0.057
sed+0.055
irms+0.054
Neg logit contributions
 it-0.079
 her-0.076
 of-0.076
 she-0.076
 the-0.072
Token,
Token ID11
Feature activation+3.04e-01

Pos logit contributions
aj+0.642
irms+0.477
udd+0.475
ccoli+0.470
ses+0.464
Neg logit contributions
 it-0.319
 Frankie-0.292
 Abby-0.285
 Susan-0.252
 Lib-0.249
Token he
Token ID339
Feature activation+2.10e-01

Pos logit contributions
 Frankie+0.010
 Abby+0.010
 it+0.010
 Lib+0.009
 Martha+0.009
Neg logit contributions
aj-0.043
irms-0.036
ccoli-0.035
ses-0.034
ibr-0.034
Token looked
Token ID3114
Feature activation-1.75e+00

Pos logit contributions
 it+0.015
 she+0.012
 the+0.011
 her+0.011
 Frankie+0.011
Neg logit contributions
aj-0.028
arl-0.023
irms-0.023
ses-0.023
ccoli-0.022
Token out
Token ID503
Feature activation-5.05e-01

Pos logit contributions
aj+0.260
ses+0.212
ccoli+0.208
arl+0.206
ibr+0.203
Neg logit contributions
 it-0.114
 the-0.085
 her-0.080
 she-0.079
 Abby-0.068
Token the
Token ID262
Feature activation-6.58e-03

Pos logit contributions
aj+0.071
irms+0.059
arl+0.056
udd+0.055
ses+0.055
Neg logit contributions
 it-0.024
 Frankie-0.022
 Abby-0.021
 Susan-0.019
 Lib-0.018
Token grabbed
Token ID13176
Feature activation-5.54e-01

Pos logit contributions
 it+0.020
 Frankie+0.017
 Abby+0.016
 she+0.015
 Susan+0.015
Neg logit contributions
aj-0.040
irms-0.030
ccoli-0.030
udd-0.030
ses-0.029
Token one
Token ID530
Feature activation+1.64e-01

Pos logit contributions
aj+0.085
irms+0.069
ccoli+0.067
ses+0.066
udd+0.066
Neg logit contributions
 it-0.024
 Frankie-0.019
 Abby-0.018
 she-0.016
 Susan-0.016
Token of
Token ID286
Feature activation+8.77e-01

Pos logit contributions
 it+0.010
 Frankie+0.009
 Abby+0.008
 Susan+0.007
 Frank+0.007
Neg logit contributions
aj-0.023
irms-0.018
udd-0.017
ses-0.017
ccoli-0.017
Token the
Token ID262
Feature activation-5.05e-02

Pos logit contributions
 Frankie+0.031
 it+0.028
 Frank+0.026
 happen+0.026
Frank+0.026
Neg logit contributions
aj-0.124
irms-0.102
arl-0.098
ags-0.096
ses-0.095
Token r
Token ID374
Feature activation-9.31e-01

Pos logit contributions
aj+0.006
udd+0.004
irms+0.004
ses+0.004
arl+0.004
Neg logit contributions
 it-0.004
 Frankie-0.004
 Abby-0.004
 Frank-0.004
 Lib-0.004
Tokenags
Token ID3775
Feature activation-5.01e+00

Pos logit contributions
aj+0.072
ccoli+0.062
ses+0.054
ems+0.053
 imag+0.053
Neg logit contributions
 it-0.086
 her-0.080
 Samantha-0.078
 Abby-0.078
 of-0.077
Token and
Token ID290
Feature activation-2.50e-01

Pos logit contributions
aj+0.597
udd+0.427
ows+0.412
aked+0.408
arl+0.405
Neg logit contributions
 Frankie-0.336
 it-0.309
 Abby-0.297
 Frank-0.280
 happen-0.271
Token started
Token ID2067
Feature activation+9.87e-02

Pos logit contributions
aj+0.031
irms+0.023
ccoli+0.023
ses+0.022
udd+0.022
Neg logit contributions
 it-0.019
 Frankie-0.016
 Abby-0.016
 Susan-0.015
 she-0.015
Token to
Token ID284
Feature activation+7.95e-02

Pos logit contributions
 Frankie+0.007
 Abby+0.006
 it+0.006
 Frank+0.006
 Susan+0.006
Neg logit contributions
aj-0.012
udd-0.009
irms-0.009
 imag-0.008
ses-0.008
Token spin
Token ID7906
Feature activation+1.60e-01

Pos logit contributions
 it+0.006
 Frankie+0.005
 Abby+0.005
 Frank+0.005
 Susan+0.005
Neg logit contributions
aj-0.010
irms-0.007
udd-0.007
ccoli-0.007
ems-0.007
Token it
Token ID340
Feature activation+1.03e+00

Pos logit contributions
 it+0.010
 Frankie+0.009
 Abby+0.008
 Susan+0.008
 Frank+0.008
Neg logit contributions
aj-0.022
irms-0.017
udd-0.016
ccoli-0.016
ibr-0.016
Token around
Token ID1088
Feature activation+4.25e-01

Pos logit contributions
aj+0.040
irms+0.031
ses+0.031
ccoli+0.030
udd+0.030
Neg logit contributions
 it-0.021
 Frankie-0.019
 Abby-0.019
 Susan-0.017
 Martha-0.017
Token in
Token ID287
Feature activation-2.41e-01

Pos logit contributions
 it+0.019
 Frankie+0.018
 Abby+0.017
 Susan+0.016
 Lib+0.014
Neg logit contributions
aj-0.060
irms-0.049
ses-0.048
ccoli-0.047
udd-0.047
Token z
Token ID1976
Feature activation+1.26e-01

Pos logit contributions
aj+0.033
irms+0.028
ses+0.026
 imag+0.026
 cares+0.026
Neg logit contributions
 Frankie-0.010
 it-0.010
 Abby-0.010
 Lib-0.008
 Susan-0.008
Tokenig
Token ID328
Feature activation-2.84e+00

Pos logit contributions
 it+0.014
 she+0.012
 her+0.012
 the+0.012
 Abby+0.012
Neg logit contributions
aj-0.010
ccoli-0.008
ses-0.007
irms-0.007
sed-0.006
Tokenz
Token ID89
Feature activation+5.75e-01

Pos logit contributions
aj+0.263
udd+0.166
irms+0.149
ibr+0.148
ccoli+0.147
Neg logit contributions
 it-0.321
 Frankie-0.293
 Abby-0.290
 Susan-0.271
 Frank-0.270
Tokenags
Token ID3775
Feature activation-5.00e+00

Pos logit contributions
 it+0.079
 her+0.069
 Frankie+0.069
 the+0.068
 she+0.068
Neg logit contributions
aj-0.038
ccoli-0.024
ses-0.022
irms-0.020
ems-0.018
Token.
Token ID13
Feature activation-5.33e-02

Pos logit contributions
aj+0.583
udd+0.443
ses+0.439
irms+0.439
aked+0.430
Neg logit contributions
 Frankie-0.318
 Abby-0.300
 it-0.295
 Lib-0.265
 Martha-0.264
Token
Token ID198
Feature activation-2.93e-01

Pos logit contributions
aj+0.007
irms+0.006
ccoli+0.005
ses+0.005
udd+0.005
Neg logit contributions
 it-0.004
 Frankie-0.003
 Abby-0.003
 she-0.003
 Susan-0.003
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
aj+0.037
irms+0.028
udd+0.028
arl+0.027
ibr+0.027
Neg logit contributions
 it-0.020
 Frankie-0.017
 Abby-0.017
 Susan-0.016
 she-0.016
TokenWhen
Token ID2215
Feature activation+8.93e-01

Pos logit contributions
aj+0.033
ibr+0.024
arl+0.023
irms+0.023
udd+0.023
Neg logit contributions
 it-0.022
 Frankie-0.020
 Abby-0.020
 Susan-0.019
 she-0.018
Token Zig
Token ID24992
Feature activation-8.62e-01

Pos logit contributions
 Frankie+0.048
 Abby+0.048
 it+0.047
 Susan+0.046
 poppy+0.043
Neg logit contributions
aj-0.100
ses-0.081
irms-0.078
 imag-0.077
sed-0.077
Token and
Token ID290
Feature activation-2.53e-01

Pos logit contributions
 it+0.014
 Frankie+0.012
 Abby+0.012
 Susan+0.010
 Frank+0.010
Neg logit contributions
aj-0.027
ccoli-0.020
udd-0.020
irms-0.020
ems-0.019
Token showed
Token ID3751
Feature activation-6.41e-01

Pos logit contributions
aj+0.029
ccoli+0.021
udd+0.021
irms+0.020
ibr+0.020
Neg logit contributions
 it-0.019
 Frankie-0.017
 Abby-0.016
 happen-0.016
 Susan-0.016
Token her
Token ID607
Feature activation+1.60e-01

Pos logit contributions
aj+0.095
irms+0.082
ibr+0.078
udd+0.077
ccoli+0.076
Neg logit contributions
 Frankie-0.018
 it-0.018
 ringing-0.016
 Susan-0.013
 Abby-0.013
Token the
Token ID262
Feature activation+4.21e-02

Pos logit contributions
 Frankie+0.008
 it+0.007
 Frank+0.007
 Susan+0.006
 Abby+0.006
Neg logit contributions
aj-0.022
udd-0.017
ccoli-0.017
irms-0.017
 cares-0.016
Token r
Token ID374
Feature activation-8.87e-01

Pos logit contributions
 it+0.003
 Frankie+0.003
 Abby+0.002
 Susan+0.002
 Frank+0.002
Neg logit contributions
aj-0.005
irms-0.004
udd-0.004
ses-0.004
arl-0.004
Tokenags
Token ID3775
Feature activation-5.00e+00

Pos logit contributions
aj+0.081
ccoli+0.060
irms+0.058
ses+0.056
ems+0.054
Neg logit contributions
 it-0.088
 Abby-0.078
 she-0.075
 Frankie-0.075
 her-0.074
Token.
Token ID13
Feature activation+7.86e-03

Pos logit contributions
aj+0.659
udd+0.504
ows+0.483
irms+0.476
ses+0.474
Neg logit contributions
 Frankie-0.253
 it-0.219
 Abby-0.211
't-0.210
Frank-0.207
Token She
Token ID1375
Feature activation+2.23e-01

Pos logit contributions
 it+0.000
 Frankie+0.000
 Abby+0.000
 she+0.000
 Susan+0.000
Neg logit contributions
aj-0.001
irms-0.001
ccoli-0.001
ses-0.001
udd-0.001
Token said
Token ID531
Feature activation-5.88e-02

Pos logit contributions
 it+0.015
 Frankie+0.013
 Abby+0.012
 Susan+0.011
 she+0.011
Neg logit contributions
aj-0.030
udd-0.022
ccoli-0.022
irms-0.022
ses-0.022
Token that
Token ID326
Feature activation-1.80e-01

Pos logit contributions
aj+0.008
ccoli+0.006
ibr+0.006
arl+0.006
udd+0.006
Neg logit contributions
 Frankie-0.003
 it-0.003
 Abby-0.002
 Susan-0.002
 Frank-0.002
Token she
Token ID673
Feature activation+2.14e-01

Pos logit contributions
aj+0.028
ccoli+0.023
irms+0.023
udd+0.022
ibr+0.022
Neg logit contributions
 Frankie-0.005
 it-0.004
 Susan-0.004
 Frank-0.003
 Abby-0.003
Token mom
Token ID1995
Feature activation-7.55e-01

Pos logit contributions
aj+0.089
irms+0.070
ccoli+0.068
ses+0.068
udd+0.067
Neg logit contributions
 it-0.043
 Frankie-0.034
 Abby-0.033
 she-0.032
 Susan-0.030
Token bought
Token ID5839
Feature activation+6.79e-01

Pos logit contributions
aj+0.093
udd+0.068
ccoli+0.066
irms+0.066
ibr+0.064
Neg logit contributions
 it-0.058
 Frankie-0.051
 Abby-0.050
 Frank-0.045
 Susan-0.045
Token some
Token ID617
Feature activation+2.80e-02

Pos logit contributions
 it+0.027
 Frankie+0.027
 Abby+0.023
 Susan+0.023
 Frank+0.022
Neg logit contributions
aj-0.099
irms-0.079
udd-0.079
ibr-0.076
ccoli-0.075
Token new
Token ID649
Feature activation+3.08e-01

Pos logit contributions
 it+0.002
 Frankie+0.002
 Abby+0.002
 Susan+0.002
 she+0.002
Neg logit contributions
aj-0.003
irms-0.003
udd-0.002
ccoli-0.002
ses-0.002
Token r
Token ID374
Feature activation-9.23e-01

Pos logit contributions
 it+0.028
 Frankie+0.025
 Abby+0.025
 Frank+0.023
 Susan+0.023
Neg logit contributions
aj-0.034
irms-0.024
udd-0.023
ses-0.023
ccoli-0.022
Tokenags
Token ID3775
Feature activation-4.99e+00

Pos logit contributions
aj+0.079
ccoli+0.066
ses+0.060
irms+0.057
sed+0.057
Neg logit contributions
 it-0.083
 her-0.077
 Abby-0.074
 she-0.074
 of-0.074
Token.
Token ID13
Feature activation-4.56e-04

Pos logit contributions
aj+0.617
udd+0.451
ows+0.429
irms+0.418
aked+0.416
Neg logit contributions
 Frankie-0.323
 it-0.295
 Abby-0.284
 Frank-0.273
Frank-0.268
Token When
Token ID1649
Feature activation-4.47e-01

Pos logit contributions
aj+0.000
irms+0.000
ccoli+0.000
ses+0.000
udd+0.000
Neg logit contributions
 it-0.000
 Frankie-0.000
 Abby-0.000
 she-0.000
 the-0.000
Token they
Token ID484
Feature activation-3.66e-01

Pos logit contributions
aj+0.066
udd+0.050
ems+0.050
irms+0.049
arl+0.049
Neg logit contributions
 Frankie-0.016
 Abby-0.015
 it-0.015
 Frank-0.014
 Susan-0.014
Token got
Token ID1392
Feature activation-5.67e-01

Pos logit contributions
aj+0.049
irms+0.037
udd+0.037
ccoli+0.036
ibr+0.036
Neg logit contributions
 it-0.024
 Frankie-0.021
 Abby-0.019
 Susan-0.018
 she-0.018
Token back
Token ID736
Feature activation-1.93e+00

Pos logit contributions
aj+0.072
irms+0.052
udd+0.052
ccoli+0.050
aked+0.049
Neg logit contributions
 it-0.039
 Frankie-0.036
 Abby-0.034
 Susan-0.032
 happen-0.032
Token him
Token ID683
Feature activation-1.74e-03

Pos logit contributions
 it+0.023
 Frankie+0.021
 Abby+0.020
 Lib+0.017
 Martha+0.017
Neg logit contributions
aj-0.076
irms-0.061
ccoli-0.061
ems-0.059
 cares-0.058
Token to
Token ID284
Feature activation+1.30e-01

Pos logit contributions
aj+0.000
irms+0.000
ccoli+0.000
ses+0.000
udd+0.000
Neg logit contributions
 it-0.000
 Frankie-0.000
 Abby-0.000
 she-0.000
 Susan-0.000
Token turn
Token ID1210
Feature activation-4.33e-01

Pos logit contributions
 it+0.009
 Abby+0.009
 Frankie+0.009
 Lib+0.008
 Susan+0.008
Neg logit contributions
aj-0.015
irms-0.011
ccoli-0.011
arl-0.011
 Deal-0.011
Token the
Token ID262
Feature activation-8.02e-03

Pos logit contributions
aj+0.064
irms+0.051
ccoli+0.050
udd+0.049
ses+0.049
Neg logit contributions
 it-0.022
 Frankie-0.018
 Abby-0.017
 she-0.016
 Susan-0.015
Token r
Token ID374
Feature activation-9.05e-01

Pos logit contributions
aj+0.001
irms+0.001
udd+0.001
ses+0.001
ccoli+0.001
Neg logit contributions
 it-0.001
 the-0.000
 she-0.000
 Frankie-0.000
 ringing-0.000
Tokenags
Token ID3775
Feature activation-4.98e+00

Pos logit contributions
aj+0.077
ccoli+0.059
ses+0.053
 Deal+0.051
ems+0.051
Neg logit contributions
 it-0.085
 of-0.078
 the-0.077
 her-0.075
 Abby-0.075
Token she
Token ID673
Feature activation+2.00e-01

Pos logit contributions
aj+0.649
irms+0.492
ccoli+0.486
udd+0.484
ses+0.474
Neg logit contributions
 it-0.314
 Frankie-0.286
 Abby-0.279
 Susan-0.245
 Frank-0.241
Token kept
Token ID4030
Feature activation+6.97e-01

Pos logit contributions
 it+0.013
 she+0.010
 the+0.009
 her+0.009
 ringing+0.008
Neg logit contributions
aj-0.028
arl-0.023
irms-0.023
 Deal-0.022
ccoli-0.022
Token in
Token ID287
Feature activation-1.84e-01

Pos logit contributions
 it+0.038
 Abby+0.038
 Frankie+0.037
 Susan+0.033
 Lib+0.032
Neg logit contributions
aj-0.092
irms-0.072
ccoli-0.069
ses-0.068
ibr-0.067
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
aj+0.025
irms+0.020
arl+0.019
ses+0.019
 Deal+0.019
Neg logit contributions
 it-0.010
 Frankie-0.009
 Abby-0.009
 Susan-0.008
 Frank-0.008
Token closet
Token ID21615
Feature activation-1.79e+00

Pos logit contributions
aj+0.002
ccoli+0.001
ems+0.001
irms+0.001
ses+0.001
Neg logit contributions
 it-0.001
 she-0.001
 the-0.001
 her-0.001
 Frankie-0.001
Token make
Token ID787
Feature activation+8.86e-01

Pos logit contributions
aj+0.021
irms+0.016
ccoli+0.016
udd+0.016
ses+0.016
Neg logit contributions
 it-0.010
 Frankie-0.007
 she-0.007
 Abby-0.007
 the-0.007
Token many
Token ID867
Feature activation+4.72e-01

Pos logit contributions
 it+0.049
 Frankie+0.047
 Abby+0.045
 Susan+0.042
 Frank+0.040
Neg logit contributions
aj-0.123
irms-0.096
udd-0.095
ses-0.094
ibr-0.092
Token z
Token ID1976
Feature activation+9.40e-02

Pos logit contributions
 it+0.033
 Frankie+0.028
 Abby+0.027
 she+0.026
 Susan+0.025
Neg logit contributions
aj-0.062
irms-0.048
ccoli-0.046
udd-0.046
ses-0.046
Tokenig
Token ID328
Feature activation-3.00e+00

Pos logit contributions
 it+0.010
 she+0.009
 the+0.009
 her+0.009
't+0.008
Neg logit contributions
aj-0.008
ccoli-0.005
irms-0.005
ses-0.005
arl-0.005
Tokenz
Token ID89
Feature activation+3.99e-01

Pos logit contributions
aj+0.322
udd+0.227
ccoli+0.221
ows+0.218
ems+0.216
Neg logit contributions
 it-0.274
 Abby-0.263
 Frankie-0.260
 Samantha-0.235
 Frank-0.235
Tokenags
Token ID3775
Feature activation-4.98e+00

Pos logit contributions
 it+0.058
 her+0.051
 the+0.051
 she+0.050
 Frankie+0.049
Neg logit contributions
aj-0.024
ccoli-0.013
ses-0.013
irms-0.011
sed-0.010
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
aj+0.568
ses+0.429
udd+0.423
ows+0.420
ccoli+0.417
Neg logit contributions
 Frankie-0.351
 Abby-0.351
 it-0.310
 Samantha-0.299
 Martha-0.297
Token circles
Token ID13332
Feature activation+1.84e-01

Pos logit contributions
aj+0.023
irms+0.018
ccoli+0.017
ses+0.017
udd+0.017
Neg logit contributions
 it-0.011
 Frankie-0.009
 Abby-0.009
 she-0.008
 Susan-0.008
Token and
Token ID290
Feature activation-1.65e-01

Pos logit contributions
 it+0.011
 Frankie+0.011
 Abby+0.010
 Susan+0.009
 Frank+0.009
Neg logit contributions
aj-0.025
ows-0.019
udd-0.019
irms-0.019
ses-0.018
Token lines
Token ID3951
Feature activation+4.63e-01

Pos logit contributions
aj+0.021
irms+0.016
ses+0.016
ccoli+0.016
udd+0.015
Neg logit contributions
 it-0.012
 Frankie-0.010
 Abby-0.010
 she-0.009
 Susan-0.009
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
 Frankie+0.028
 it+0.028
 Abby+0.027
 Susan+0.025
 Frank+0.024
Neg logit contributions
aj-0.060
ows-0.047
udd-0.046
irms-0.046
arl-0.045
Token He
Token ID679
Feature activation-5.26e-01

Pos logit contributions
 it+0.010
 Frankie+0.008
 she+0.008
 Abby+0.007
 the+0.007
Neg logit contributions
aj-0.021
irms-0.017
udd-0.016
ccoli-0.016
ibr-0.016
Token is
Token ID318
Feature activation-1.23e+00

Pos logit contributions
aj+0.059
arl+0.047
irms+0.044
ibr+0.043
ccoli+0.043
Neg logit contributions
 it-0.048
 the-0.041
 she-0.040
 her-0.039
 Frankie-0.035
Token happy
Token ID3772
Feature activation-1.49e-01

Pos logit contributions
aj+0.166
irms+0.128
ses+0.124
ccoli+0.124
udd+0.122
Neg logit contributions
 it-0.085
 Frankie-0.070
 Abby-0.067
 she-0.064
 Susan-0.062
Token and
Token ID290
Feature activation-1.64e-01

Pos logit contributions
aj+0.022
irms+0.018
ccoli+0.017
ses+0.017
udd+0.017
Neg logit contributions
 it-0.008
 Frankie-0.006
 she-0.006
 Abby-0.006
 the-0.005
Token w
Token ID266
Feature activation-3.30e-01

Pos logit contributions
aj+0.019
irms+0.014
ccoli+0.014
ses+0.014
udd+0.014
Neg logit contributions
 it-0.014
 she-0.012
 Frankie-0.012
 Abby-0.012
 the-0.011
Tokenags
Token ID3775
Feature activation-4.97e+00

Pos logit contributions
aj+0.025
ccoli+0.018
ems+0.014
esty+0.013
ses+0.013
Neg logit contributions
 it-0.039
 her-0.035
 happen-0.034
 the-0.034
 Abby-0.033
Token his
Token ID465
Feature activation-3.88e-01

Pos logit contributions
aj+0.689
irms+0.562
ses+0.548
aked+0.527
ccoli+0.526
Neg logit contributions
 it-0.215
 Frankie-0.214
 Abby-0.192
't-0.164
 Frank-0.163
Token tail
Token ID7894
Feature activation-9.56e-01

Pos logit contributions
aj+0.045
ibr+0.033
udd+0.033
ccoli+0.033
irms+0.033
Neg logit contributions
 it-0.034
 Frankie-0.028
 the-0.027
 Abby-0.027
 she-0.027
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
aj+0.134
irms+0.106
ses+0.104
udd+0.103
ccoli+0.102
Neg logit contributions
 it-0.055
 Frankie-0.048
 Abby-0.045
 Susan-0.041
 Mill-0.040
Token But
Token ID887
Feature activation-6.23e-01

Pos logit contributions
 it+0.009
 she+0.007
 Frankie+0.006
 the+0.006
 Abby+0.006
Neg logit contributions
aj-0.023
irms-0.019
ccoli-0.019
ems-0.019
udd-0.019
Token Tim
Token ID5045
Feature activation+4.64e-01

Pos logit contributions
aj+0.093
irms+0.074
udd+0.072
ccoli+0.072
ses+0.072
Neg logit contributions
 it-0.037
 Frankie-0.027
 she-0.027
 Abby-0.026
 the-0.024
Tokenicks
Token ID3378
Feature activation-9.29e-01

Pos logit contributions
 it+0.017
 her+0.015
 she+0.014
 of+0.014
 its+0.014
Neg logit contributions
aj-0.018
ccoli-0.013
ems-0.011
udd-0.011
esty-0.011
Token her
Token ID607
Feature activation+2.11e-01

Pos logit contributions
aj+0.145
irms+0.120
arl+0.118
udd+0.115
ibr+0.115
Neg logit contributions
 Frankie-0.030
 it-0.028
 Abby-0.024
 Susan-0.022
 Frank-0.022
Token face
Token ID1986
Feature activation-8.46e-01

Pos logit contributions
 it+0.018
 Frankie+0.015
 Abby+0.014
 she+0.014
 the+0.013
Neg logit contributions
aj-0.025
irms-0.019
ccoli-0.018
ses-0.018
udd-0.018
Token and
Token ID290
Feature activation-1.53e-01

Pos logit contributions
aj+0.118
irms+0.092
ccoli+0.091
udd+0.090
ses+0.089
Neg logit contributions
 it-0.049
 Frankie-0.044
 Abby-0.041
 Susan-0.037
 Frank-0.037
Token w
Token ID266
Feature activation-3.38e-01

Pos logit contributions
aj+0.018
irms+0.013
udd+0.013
ccoli+0.013
ibr+0.013
Neg logit contributions
 it-0.013
 she-0.011
 Frankie-0.011
 Abby-0.011
 the-0.011
Tokenags
Token ID3775
Feature activation-4.96e+00

Pos logit contributions
aj+0.026
ccoli+0.019
tr+0.016
esty+0.016
ems+0.015
Neg logit contributions
 it-0.039
 her-0.035
 them-0.034
 happen-0.034
 the-0.033
Token his
Token ID465
Feature activation-4.01e-01

Pos logit contributions
aj+0.757
irms+0.630
ses+0.613
ccoli+0.600
arl+0.593
Neg logit contributions
 it-0.161
 Frankie-0.161
 Abby-0.139
 Frank-0.112
 Mill-0.103
Token tail
Token ID7894
Feature activation-9.58e-01

Pos logit contributions
aj+0.045
irms+0.032
ccoli+0.032
udd+0.032
ibr+0.032
Neg logit contributions
 it-0.037
 Frankie-0.031
 Abby-0.030
 she-0.030
 the-0.029
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
aj+0.138
irms+0.110
ses+0.107
ccoli+0.106
udd+0.106
Neg logit contributions
 it-0.053
 Frankie-0.045
 Abby-0.043
 Susan-0.039
 Frank-0.037
Token He
Token ID679
Feature activation-4.90e-01

Pos logit contributions
 it+0.012
 she+0.010
 her+0.009
 the+0.009
 Abby+0.009
Neg logit contributions
aj-0.024
ccoli-0.019
irms-0.018
ems-0.018
ses-0.018
Token loves
Token ID10408
Feature activation-3.15e-01

Pos logit contributions
aj+0.057
arl+0.043
ibr+0.041
irms+0.040
ccoli+0.040
Neg logit contributions
 it-0.046
 the-0.037
 she-0.037
 her-0.036
 Frankie-0.033
Token is
Token ID318
Feature activation-1.21e+00

Pos logit contributions
aj+0.057
arl+0.045
irms+0.041
ibr+0.041
udd+0.040
Neg logit contributions
 it-0.043
 the-0.035
 she-0.033
 her-0.032
 Frankie-0.031
Token happy
Token ID3772
Feature activation-1.35e-01

Pos logit contributions
aj+0.172
irms+0.135
ccoli+0.132
ses+0.131
udd+0.131
Neg logit contributions
 it-0.076
 Frankie-0.059
 Abby-0.057
 she-0.055
 Susan-0.052
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
aj+0.021
irms+0.017
ses+0.017
ccoli+0.017
udd+0.017
Neg logit contributions
 it-0.007
 she-0.005
 Frankie-0.004
 Abby-0.004
 the-0.004
Token He
Token ID679
Feature activation-5.11e-01

Pos logit contributions
 it+0.007
 Frankie+0.006
 Abby+0.005
 she+0.005
 the+0.005
Neg logit contributions
aj-0.016
ccoli-0.013
udd-0.013
ibr-0.013
ems-0.012
Token w
Token ID266
Feature activation-3.30e-01

Pos logit contributions
aj+0.056
arl+0.044
irms+0.041
 Deal+0.039
esty+0.039
Neg logit contributions
 it-0.051
 she-0.042
 the-0.042
 her-0.039
 he-0.038
Tokenags
Token ID3775
Feature activation-4.96e+00

Pos logit contributions
aj+0.025
ccoli+0.018
ems+0.015
esty+0.014
tr+0.013
Neg logit contributions
 it-0.039
 her-0.034
 them-0.033
 the-0.033
 Frankie-0.033
Token his
Token ID465
Feature activation-3.82e-01

Pos logit contributions
aj+0.740
irms+0.592
ses+0.589
ccoli+0.574
aked+0.570
Neg logit contributions
 it-0.214
 Frankie-0.183
 Abby-0.175
 she-0.147
 Mill-0.135
Token tail
Token ID7894
Feature activation-9.63e-01

Pos logit contributions
aj+0.044
udd+0.034
ibr+0.034
ccoli+0.033
ems+0.033
Neg logit contributions
 it-0.032
 Frankie-0.027
 the-0.026
 her-0.026
 Abby-0.026
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
aj+0.141
irms+0.112
ccoli+0.109
ses+0.109
udd+0.108
Neg logit contributions
 it-0.054
 Frankie-0.042
 Abby-0.040
 she-0.038
 Susan-0.036
Token He
Token ID679
Feature activation-5.12e-01

Pos logit contributions
 it+0.007
 the+0.005
 Frankie+0.005
 she+0.005
 Abby+0.005
Neg logit contributions
aj-0.018
ccoli-0.014
udd-0.014
ibr-0.014
ems-0.014
Token sits
Token ID10718
Feature activation+7.02e-01

Pos logit contributions
aj+0.055
arl+0.045
irms+0.040
 Deal+0.039
ibr+0.039
Neg logit contributions
 it-0.051
 the-0.043
 she-0.042
 her-0.040
 he-0.038
Token them
Token ID606
Feature activation+9.77e-01

Pos logit contributions
 Frankie+0.010
 Abby+0.008
Mill+0.008
 Mill+0.008
 ringing+0.008
Neg logit contributions
aj-0.045
irms-0.040
ses-0.039
 cares-0.037
ccoli-0.037
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
 Frankie+0.070
 Abby+0.067
 it+0.064
 Martha+0.064
 Susan+0.064
Neg logit contributions
aj-0.113
irms-0.088
ows-0.085
ccoli-0.083
ses-0.083
Token The
Token ID383
Feature activation-7.30e-02

Pos logit contributions
 it+0.007
 Abby+0.005
 the+0.005
 Frankie+0.005
 she+0.005
Neg logit contributions
aj-0.015
udd-0.011
arl-0.011
ibr-0.011
ccoli-0.011
Token dog
Token ID3290
Feature activation+2.63e-01

Pos logit contributions
aj+0.008
irms+0.006
ses+0.006
ccoli+0.006
arl+0.006
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.005
 Susan-0.005
Token w
Token ID266
Feature activation-3.26e-01

Pos logit contributions
 it+0.024
 Frankie+0.020
 she+0.020
 Abby+0.019
 the+0.019
Neg logit contributions
aj-0.030
irms-0.022
arl-0.021
ccoli-0.021
udd-0.021
Tokenags
Token ID3775
Feature activation-4.96e+00

Pos logit contributions
aj+0.026
ccoli+0.018
ems+0.013
ibr+0.013
tr+0.013
Neg logit contributions
 it-0.039
 her-0.034
 them-0.033
 the-0.033
 Susan-0.033
Token its
Token ID663
Feature activation+2.54e-01

Pos logit contributions
aj+0.697
ses+0.591
irms+0.572
sed+0.563
ccoli+0.558
Neg logit contributions
 Frankie-0.195
 it-0.191
 Abby-0.173
 Mill-0.144
't-0.143
Token tail
Token ID7894
Feature activation-9.39e-01

Pos logit contributions
 it+0.025
 Frankie+0.022
 Abby+0.022
 she+0.021
 Susan+0.020
Neg logit contributions
aj-0.026
irms-0.019
ses-0.018
ccoli-0.018
udd-0.017
Token and
Token ID290
Feature activation-1.88e-01

Pos logit contributions
aj+0.129
irms+0.101
ses+0.100
ccoli+0.099
udd+0.098
Neg logit contributions
 it-0.057
 Frankie-0.049
 Abby-0.047
 Susan-0.043
 she-0.042
Token makes
Token ID1838
Feature activation-8.22e-01

Pos logit contributions
aj+0.023
irms+0.017
udd+0.016
ccoli+0.016
arl+0.016
Neg logit contributions
 it-0.016
 Frankie-0.013
 she-0.013
 Abby-0.013
 the-0.012
Token a
Token ID257
Feature activation+2.81e-01

Pos logit contributions
aj+0.111
irms+0.095
ses+0.088
udd+0.087
aked+0.085
Neg logit contributions
 Frankie-0.040
 Abby-0.036
 it-0.035
 Susan-0.033
 Mill-0.033
Token dog
Token ID3290
Feature activation+2.47e-01

Pos logit contributions
 it+0.002
 Frankie+0.001
 Abby+0.001
 Susan+0.001
 she+0.001
Neg logit contributions
aj-0.002
irms-0.002
ses-0.002
udd-0.001
arl-0.001
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
 it+0.014
 Frankie+0.011
 Abby+0.010
 she+0.010
 Susan+0.009
Neg logit contributions
aj-0.036
irms-0.029
ccoli-0.028
ses-0.028
udd-0.028
Token The
Token ID383
Feature activation-4.76e-02

Pos logit contributions
 it+0.007
 Abby+0.005
 Frankie+0.005
 she+0.005
 the+0.005
Neg logit contributions
aj-0.014
irms-0.011
ccoli-0.011
udd-0.011
ibr-0.010
Token dog
Token ID3290
Feature activation+2.42e-01

Pos logit contributions
aj+0.005
irms+0.004
ses+0.004
arl+0.004
udd+0.004
Neg logit contributions
 it-0.004
 Frankie-0.004
 Abby-0.004
 Mill-0.004
 Susan-0.004
Token w
Token ID266
Feature activation-3.20e-01

Pos logit contributions
 it+0.021
 she+0.017
 the+0.016
 Frankie+0.016
 Abby+0.016
Neg logit contributions
aj-0.030
arl-0.022
irms-0.022
ibr-0.021
udd-0.021
Tokenags
Token ID3775
Feature activation-4.96e+00

Pos logit contributions
aj+0.025
ccoli+0.016
ems+0.013
ibr+0.012
esty+0.012
Neg logit contributions
 it-0.040
 her-0.034
 them-0.034
 the-0.033
 Frankie-0.033
Token its
Token ID663
Feature activation+2.15e-01

Pos logit contributions
aj+0.680
ses+0.583
irms+0.567
aked+0.554
sed+0.553
Neg logit contributions
 Frankie-0.205
 it-0.192
 Abby-0.180
't-0.157
 Martha-0.152
Token tail
Token ID7894
Feature activation-9.53e-01

Pos logit contributions
 it+0.021
 Frankie+0.018
 Abby+0.018
 she+0.017
 Susan+0.017
Neg logit contributions
aj-0.023
irms-0.016
ccoli-0.016
ses-0.016
udd-0.015
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
aj+0.129
irms+0.103
ses+0.101
ccoli+0.099
udd+0.098
Neg logit contributions
 it-0.056
 Frankie-0.051
 Abby-0.048
 Susan-0.044
 Mill-0.044
Token It
Token ID632
Feature activation+1.03e+00

Pos logit contributions
 it+0.007
 Abby+0.006
 Frankie+0.005
 she+0.005
 the+0.005
Neg logit contributions
aj-0.015
ccoli-0.012
irms-0.011
udd-0.011
ibr-0.011
Token looks
Token ID3073
Feature activation-1.85e+00

Pos logit contributions
 it+0.088
 she+0.071
 Frankie+0.071
 Abby+0.070
 the+0.069
Neg logit contributions
aj-0.124
irms-0.091
arl-0.091
ccoli-0.089
ibr-0.088
Token and
Token ID290
Feature activation-3.36e-02

Pos logit contributions
 it+0.010
 Frankie+0.007
 Abby+0.007
 she+0.007
 the+0.006
Neg logit contributions
aj-0.028
irms-0.022
ses-0.022
ccoli-0.022
udd-0.022
Token warm
Token ID5814
Feature activation-3.05e-01

Pos logit contributions
aj+0.005
irms+0.004
ccoli+0.003
ses+0.003
udd+0.003
Neg logit contributions
 it-0.002
 Frankie-0.002
 Abby-0.002
 she-0.002
 Susan-0.002
Token."
Token ID526
Feature activation-1.01e-01

Pos logit contributions
aj+0.042
irms+0.033
ccoli+0.032
udd+0.032
ses+0.032
Neg logit contributions
 it-0.019
 Frankie-0.015
 Abby-0.015
 she-0.014
 Susan-0.014
Token Max
Token ID5436
Feature activation-2.08e-01

Pos logit contributions
aj+0.016
irms+0.013
ccoli+0.013
ses+0.012
udd+0.012
Neg logit contributions
 it-0.006
 she-0.004
 the-0.004
 Frankie-0.004
 her-0.003
Token w
Token ID266
Feature activation-2.93e-01

Pos logit contributions
aj+0.024
arl+0.019
ibr+0.018
irms+0.017
ider+0.017
Neg logit contributions
 it-0.019
 the-0.015
 she-0.015
 her-0.015
 Abby-0.014
Tokenags
Token ID3775
Feature activation-4.95e+00

Pos logit contributions
aj+0.021
ccoli+0.013
ems+0.010
esty+0.010
irms+0.010
Neg logit contributions
 it-0.037
 her-0.032
 them-0.031
 Abby-0.031
 the-0.031
Token his
Token ID465
Feature activation-3.48e-01

Pos logit contributions
aj+0.678
irms+0.544
ses+0.539
ccoli+0.522
aked+0.508
Neg logit contributions
 Frankie-0.234
 it-0.226
 Abby-0.214
 Mill-0.180
 Frank-0.175
Token tail
Token ID7894
Feature activation-9.28e-01

Pos logit contributions
aj+0.040
udd+0.030
ibr+0.030
ccoli+0.030
irms+0.029
Neg logit contributions
 it-0.030
 Frankie-0.025
 the-0.024
 Abby-0.024
 her-0.024
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
aj+0.130
irms+0.102
ses+0.100
ccoli+0.099
udd+0.099
Neg logit contributions
 it-0.056
 Frankie-0.047
 Abby-0.045
 Susan-0.041
 she-0.040
Token l
Token ID300
Feature activation+2.04e-01

Pos logit contributions
aj+0.005
irms+0.004
arl+0.004
ibr+0.004
udd+0.004
Neg logit contributions
 it-0.004
 she-0.003
 Frankie-0.003
 the-0.003
 Abby-0.003
Tokenicks
Token ID3378
Feature activation-9.44e-01

Pos logit contributions
 it+0.023
 her+0.020
 she+0.020
 Abby+0.020
 the+0.019
Neg logit contributions
aj-0.019
ccoli-0.012
ems-0.011
udd-0.011
irms-0.010
TokenThe
Token ID464
Feature activation+1.14e+00

Pos logit contributions
aj+0.034
ibr+0.027
irms+0.026
udd+0.026
ccoli+0.026
Neg logit contributions
 it-0.016
 Frankie-0.013
 Abby-0.013
 happen-0.012
 ringing-0.011
Token boy
Token ID2933
Feature activation+2.01e-01

Pos logit contributions
 it+0.083
 Frankie+0.069
 Abby+0.067
 she+0.065
 Susan+0.062
Neg logit contributions
aj-0.148
irms-0.113
ccoli-0.110
ses-0.110
udd-0.109
Token replied
Token ID8712
Feature activation-2.89e-01

Pos logit contributions
 it+0.014
 Frankie+0.011
 Abby+0.011
 she+0.011
 the+0.010
Neg logit contributions
aj-0.027
irms-0.021
arl-0.020
ses-0.020
ccoli-0.020
Token,
Token ID11
Feature activation+3.22e-01

Pos logit contributions
aj+0.036
ccoli+0.028
ibr+0.027
irms+0.027
ses+0.027
Neg logit contributions
 it-0.017
 Frankie-0.017
 Abby-0.015
 Susan-0.014
 Martha-0.014
Token "
Token ID366
Feature activation+2.60e-01

Pos logit contributions
 it+0.021
 Frankie+0.020
 Abby+0.018
 happen+0.018
 Susan+0.018
Neg logit contributions
aj-0.039
ccoli-0.030
ses-0.029
arl-0.028
sed-0.028
TokenYes
Token ID5297
Feature activation+2.31e+00

Pos logit contributions
 it+0.018
 Frankie+0.015
 Abby+0.015
 Susan+0.014
 she+0.013
Neg logit contributions
aj-0.034
irms-0.026
ccoli-0.025
ses-0.025
udd-0.025
Token!
Token ID0
Feature activation-3.53e-01

Pos logit contributions
 it+0.134
 Frankie+0.127
 Abby+0.117
 Susan+0.111
Susan+0.108
Neg logit contributions
aj-0.297
arl-0.227
ccoli-0.224
udd-0.218
ems-0.216
Token Let
Token ID3914
Feature activation-6.07e-01

Pos logit contributions
aj+0.050
udd+0.040
irms+0.039
obb+0.038
ses+0.037
Neg logit contributions
 it-0.023
 Frankie-0.017
 she-0.017
 Abby-0.016
 the-0.015
Token's
Token ID338
Feature activation-6.03e-01

Pos logit contributions
aj+0.058
udd+0.045
irms+0.042
ems+0.042
 cares+0.040
Neg logit contributions
 it-0.049
 Abby-0.048
 Frankie-0.048
 Lib-0.046
 ringing-0.044
Token play
Token ID711
Feature activation+9.97e-01

Pos logit contributions
aj+0.079
irms+0.062
ccoli+0.060
ses+0.059
udd+0.059
Neg logit contributions
 it-0.041
 Frankie-0.034
 Abby-0.034
 she-0.032
 Susan-0.031
Token together
Token ID1978
Feature activation-4.79e-01

Pos logit contributions
 it+0.075
 the+0.055
 one+0.044
 her+0.043
 she+0.042
Neg logit contributions
aj-0.145
ses-0.113
arl-0.112
ccoli-0.109
irms-0.107
Token open
Token ID1280
Feature activation-9.70e-02

Pos logit contributions
 it+0.017
 Frankie+0.016
 Abby+0.016
 Susan+0.014
 Martha+0.014
Neg logit contributions
aj-0.035
irms-0.027
ccoli-0.027
ses-0.027
udd-0.026
Token up
Token ID510
Feature activation-2.13e-01

Pos logit contributions
aj+0.013
irms+0.009
udd+0.009
ccoli+0.009
ses+0.009
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 she-0.006
 Susan-0.005
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
aj+0.032
irms+0.026
ccoli+0.025
ses+0.025
udd+0.025
Neg logit contributions
 it-0.011
 Frankie-0.008
 Abby-0.008
 she-0.008
 the-0.007
Token luggage
Token ID31488
Feature activation+3.95e-01

Pos logit contributions
aj+0.003
irms+0.002
ccoli+0.002
ses+0.002
udd+0.002
Neg logit contributions
 it-0.003
 Frankie-0.002
 Abby-0.002
 she-0.002
 the-0.002
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 it+0.024
 Frankie+0.023
 Abby+0.022
 Susan+0.019
 Samantha+0.019
Neg logit contributions
aj-0.052
udd-0.040
ows-0.040
ses-0.039
irms-0.039
Token spoil
Token ID20851
Feature activation+2.49e+00

Pos logit contributions
aj+0.043
irms+0.034
ccoli+0.033
ses+0.033
udd+0.033
Neg logit contributions
 it-0.018
 Frankie-0.014
 Abby-0.013
 she-0.013
 Susan-0.012
Token himself
Token ID2241
Feature activation-6.17e-01

Pos logit contributions
 it+0.102
 Frankie+0.096
 Abby+0.087
 Susan+0.082
 she+0.082
Neg logit contributions
aj-0.367
irms-0.297
ccoli-0.285
udd-0.283
ibr-0.279
Token with
Token ID351
Feature activation+4.43e-01

Pos logit contributions
aj+0.086
udd+0.065
ccoli+0.064
irms+0.063
ows+0.063
Neg logit contributions
 it-0.037
 Frankie-0.033
 Abby-0.031
 Susan-0.030
 Frank-0.029
Token candy
Token ID18550
Feature activation+3.22e-02

Pos logit contributions
 it+0.018
 Frankie+0.017
 Abby+0.017
 Susan+0.015
 Lib+0.015
Neg logit contributions
aj-0.065
irms-0.052
ccoli-0.049
ses-0.048
udd-0.048
Token.
Token ID13
Feature activation-5.41e-02

Pos logit contributions
 it+0.002
 Frankie+0.001
 Abby+0.001
 Susan+0.001
 she+0.001
Neg logit contributions
aj-0.005
irms-0.004
ccoli-0.004
udd-0.004
ses-0.004
Token One
Token ID1881
Feature activation+1.07e-01

Pos logit contributions
aj+0.008
irms+0.006
ses+0.006
ccoli+0.006
arl+0.006
Neg logit contributions
 it-0.003
 Frankie-0.002
 Abby-0.002
 she-0.002
 Susan-0.002
Token.
Token ID13
Feature activation+2.78e-02

Pos logit contributions
aj+0.180
irms+0.139
ses+0.139
udd+0.138
ccoli+0.135
Neg logit contributions
 it-0.080
 Abby-0.072
 Frankie-0.072
 Susan-0.063
 Martha-0.062
Token The
Token ID383
Feature activation-1.50e-01

Pos logit contributions
 it+0.002
 Frankie+0.002
 Abby+0.002
 she+0.001
 Susan+0.001
Neg logit contributions
aj-0.004
irms-0.003
ccoli-0.003
ses-0.003
udd-0.003
Token bird
Token ID6512
Feature activation+1.39e-01

Pos logit contributions
aj+0.021
ses+0.017
irms+0.016
udd+0.016
arl+0.016
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.007
 Susan-0.006
 she-0.006
Token said
Token ID531
Feature activation-2.55e-02

Pos logit contributions
 it+0.009
 Abby+0.008
 Frankie+0.008
 Lib+0.008
 Samantha+0.007
Neg logit contributions
aj-0.018
udd-0.013
ccoli-0.013
irms-0.013
ppy-0.013
Token "
Token ID366
Feature activation+2.06e-01

Pos logit contributions
aj+0.003
ccoli+0.003
ses+0.003
ibr+0.003
irms+0.003
Neg logit contributions
 it-0.001
 Frankie-0.001
 Abby-0.001
 Susan-0.001
 Martha-0.001
Tokent
Token ID83
Feature activation+2.42e+00

Pos logit contributions
 it+0.017
 Frankie+0.015
 Abby+0.014
 Susan+0.014
 Frank+0.013
Neg logit contributions
aj-0.022
irms-0.017
ems-0.017
udd-0.017
aked-0.016
Tokenweet
Token ID7277
Feature activation+2.33e+00

Pos logit contributions
 Abby+0.165
 Frankie+0.165
 it+0.159
 Lib+0.159
 Martha+0.148
Neg logit contributions
aj-0.280
udd-0.206
ccoli-0.204
ems-0.201
irms-0.201
Token tweet
Token ID6126
Feature activation-1.17e-01

Pos logit contributions
 it+0.174
 Abby+0.158
 Frankie+0.158
 Lib+0.147
 Susan+0.144
Neg logit contributions
aj-0.271
ccoli-0.209
arl-0.208
ems-0.207
ses-0.206
Token"
Token ID1
Feature activation-6.47e-01

Pos logit contributions
aj+0.016
irms+0.013
udd+0.013
ccoli+0.013
arl+0.012
Neg logit contributions
 it-0.007
 Frankie-0.006
 Abby-0.006
 she-0.005
 Susan-0.005
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
aj+0.090
irms+0.070
ccoli+0.069
ses+0.069
udd+0.068
Neg logit contributions
 it-0.040
 Frankie-0.032
 Abby-0.031
 she-0.029
 Susan-0.028
Token flew
Token ID13112
Feature activation-3.25e-01

Pos logit contributions
aj+0.030
ccoli+0.024
ses+0.024
irms+0.023
ems+0.023
Neg logit contributions
 it-0.012
 Frankie-0.011
 Abby-0.011
 Susan-0.010
 Frank-0.009
Token
Token ID198
Feature activation-2.81e-01

Pos logit contributions
aj+0.036
udd+0.026
ibr+0.026
arl+0.026
irms+0.025
Neg logit contributions
 it-0.022
 Frankie-0.019
 Abby-0.018
 happen-0.018
 ringing-0.017
TokenWhen
Token ID2215
Feature activation+9.31e-01

Pos logit contributions
aj+0.033
ibr+0.023
 Deal+0.023
aked+0.023
udd+0.022
Neg logit contributions
 it-0.023
 Frankie-0.020
 Abby-0.019
 Mill-0.019
 Frank-0.018
Token they
Token ID484
Feature activation-4.41e-01

Pos logit contributions
 Frankie+0.034
tle+0.032
 ringing+0.031
Mill+0.031
 it+0.030
Neg logit contributions
aj-0.115
 cares-0.091
ccoli-0.089
 Deal-0.088
 seeker-0.087
Token reached
Token ID4251
Feature activation+3.57e-01

Pos logit contributions
aj+0.059
irms+0.045
ccoli+0.044
udd+0.044
ibr+0.044
Neg logit contributions
 it-0.029
 Frankie-0.024
 Abby-0.023
 she-0.022
 Susan-0.021
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 it+0.011
 Frankie+0.009
 Abby+0.008
 Susan+0.007
 Frank+0.007
Neg logit contributions
aj-0.058
irms-0.046
ses-0.045
ccoli-0.045
udd-0.045
Token checkout
Token ID28006
Feature activation+2.48e+00

Pos logit contributions
aj+0.015
irms+0.011
ccoli+0.011
ses+0.011
udd+0.011
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.007
 Susan-0.007
 she-0.007
Token,
Token ID11
Feature activation+2.45e-01

Pos logit contributions
 Frankie+0.181
 Susan+0.164
 it+0.163
 Mill+0.162
 Abby+0.160
Neg logit contributions
aj-0.302
ows-0.219
ider-0.212
udd-0.211
irms-0.205
Token Alice
Token ID14862
Feature activation+2.23e+00

Pos logit contributions
 it+0.010
 Frankie+0.010
 happen+0.009
 ringing+0.009
 Abby+0.009
Neg logit contributions
aj-0.032
irms-0.024
ccoli-0.024
 cares-0.024
ems-0.024
Token's
Token ID338
Feature activation-6.76e-01

Pos logit contributions
 it+0.157
 Frankie+0.127
 Abby+0.123
 she+0.119
 Susan+0.114
Neg logit contributions
aj-0.297
irms-0.228
ccoli-0.223
ses-0.222
udd-0.220
Token mom
Token ID1995
Feature activation-7.42e-01

Pos logit contributions
aj+0.093
irms+0.072
ccoli+0.071
ses+0.070
udd+0.070
Neg logit contributions
 it-0.045
 Frankie-0.035
 Abby-0.034
 she-0.033
 Susan-0.031
Token picked
Token ID6497
Feature activation-5.29e-01

Pos logit contributions
aj+0.086
udd+0.062
ccoli+0.061
irms+0.059
ibr+0.059
Neg logit contributions
 it-0.061
 Frankie-0.053
 Abby-0.052
 happen-0.048
 Frank-0.048
Token a
Token ID257
Feature activation+2.06e-01

Pos logit contributions
aj+0.036
ccoli+0.029
irms+0.028
ses+0.028
ibr+0.027
Neg logit contributions
 it-0.014
 Frankie-0.012
 Abby-0.011
 Susan-0.010
 she-0.010
Token little
Token ID1310
Feature activation-3.62e-01

Pos logit contributions
 it+0.013
 Frankie+0.011
 Abby+0.011
 she+0.010
 Susan+0.010
Neg logit contributions
aj-0.028
irms-0.022
ccoli-0.021
ses-0.021
udd-0.021
Token bird
Token ID6512
Feature activation+1.51e-01

Pos logit contributions
aj+0.048
irms+0.037
ccoli+0.036
ses+0.036
udd+0.035
Neg logit contributions
 it-0.026
 Frankie-0.021
 Abby-0.020
 she-0.020
 Susan-0.019
Token who
Token ID508
Feature activation-1.24e+00

Pos logit contributions
 it+0.012
 Frankie+0.010
 Abby+0.010
 she+0.009
 Susan+0.009
Neg logit contributions
aj-0.019
irms-0.014
ccoli-0.014
ses-0.014
udd-0.014
Token didn
Token ID1422
Feature activation-4.14e-01

Pos logit contributions
aj+0.142
ccoli+0.109
irms+0.106
udd+0.104
ses+0.104
Neg logit contributions
 it-0.098
 Frankie-0.087
 Abby-0.086
 Susan-0.080
 she-0.078
Token't
Token ID470
Feature activation+2.40e+00

Pos logit contributions
aj+0.042
obb+0.037
oll+0.036
ibr+0.036
arl+0.035
Neg logit contributions
 Abby-0.026
 Susan-0.025
 Samantha-0.025
 Frankie-0.024
 Martha-0.024
Token save
Token ID3613
Feature activation+2.17e-01

Pos logit contributions
 it+0.203
 Frankie+0.189
 Abby+0.179
 Susan+0.175
 Martha+0.173
Neg logit contributions
aj-0.237
arl-0.186
irms-0.183
ems-0.179
ppy-0.179
Token enough
Token ID1576
Feature activation+8.64e-01

Pos logit contributions
 Frankie+0.007
 it+0.007
 Abby+0.007
 Susan+0.006
 Frank+0.006
Neg logit contributions
aj-0.032
irms-0.026
ccoli-0.025
ses-0.025
ems-0.024
Token food
Token ID2057
Feature activation-3.99e-01

Pos logit contributions
 it+0.061
 Frankie+0.053
 Abby+0.051
 Susan+0.050
 she+0.048
Neg logit contributions
aj-0.110
irms-0.084
ccoli-0.081
arl-0.081
ses-0.080
Token for
Token ID329
Feature activation-5.08e-01

Pos logit contributions
aj+0.060
irms+0.047
ses+0.046
ccoli+0.046
udd+0.046
Neg logit contributions
 it-0.022
 Frankie-0.016
 Abby-0.015
 she-0.015
 the-0.014
Token the
Token ID262
Feature activation-3.56e-02

Pos logit contributions
aj+0.065
irms+0.049
ses+0.048
ccoli+0.048
ibr+0.047
Neg logit contributions
 Frankie-0.026
 it-0.026
 Abby-0.025
 Susan-0.024
 Lib-0.024
Token one
Token ID530
Feature activation+1.68e-01

Pos logit contributions
 it+0.037
 Frankie+0.034
 Abby+0.033
 she+0.030
 Susan+0.030
Neg logit contributions
aj-0.103
irms-0.081
ses-0.081
ccoli-0.081
 imag-0.078
Token."
Token ID526
Feature activation-2.97e-01

Pos logit contributions
 it+0.011
 Frankie+0.009
 Abby+0.008
 she+0.008
 Susan+0.007
Neg logit contributions
aj-0.023
irms-0.018
ccoli-0.018
ses-0.018
udd-0.018
Token
Token ID220
Feature activation+9.03e-01

Pos logit contributions
aj+0.039
irms+0.030
ses+0.030
udd+0.029
aked+0.028
Neg logit contributions
 it-0.018
 Frankie-0.016
 Abby-0.014
 ringing-0.014
 happen-0.014
Token
Token ID198
Feature activation-2.93e-01

Pos logit contributions
 it+0.056
 Frankie+0.045
 Abby+0.043
 she+0.041
 Susan+0.039
Neg logit contributions
aj-0.126
irms-0.098
ccoli-0.096
ses-0.095
udd-0.095
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
aj+0.034
udd+0.025
irms+0.024
arl+0.024
aked+0.023
Neg logit contributions
 it-0.022
 Frankie-0.020
 ringing-0.019
 Abby-0.019
 happen-0.018
TokenBut
Token ID1537
Feature activation+6.88e-01

Pos logit contributions
aj+0.026
udd+0.018
irms+0.018
aked+0.018
arl+0.017
Neg logit contributions
 it-0.029
 Frankie-0.026
 Abby-0.025
 ringing-0.024
 happen-0.024
Token Sue
Token ID26089
Feature activation+9.71e-01

Pos logit contributions
 Frankie+0.040
 it+0.039
 ringing+0.034
 happen+0.034
Frank+0.033
Neg logit contributions
aj-0.078
ses-0.064
sed-0.061
irms-0.061
 cares-0.059
Token wanted
Token ID2227
Feature activation+5.20e-01

Pos logit contributions
 it+0.081
 she+0.066
 Frankie+0.063
 the+0.062
 her+0.062
Neg logit contributions
aj-0.121
arl-0.093
irms-0.092
ses-0.089
ccoli-0.087
Token the
Token ID262
Feature activation-3.08e-03

Pos logit contributions
 it+0.030
 Frankie+0.028
 Abby+0.026
 Susan+0.024
 Frank+0.023
Neg logit contributions
aj-0.069
irms-0.053
ccoli-0.053
ses-0.052
udd-0.051
Token knife
Token ID9845
Feature activation-8.11e-01

Pos logit contributions
aj+0.000
irms+0.000
ccoli+0.000
ses+0.000
udd+0.000
Neg logit contributions
 it-0.000
 Frankie-0.000
 Abby-0.000
 she-0.000
 Susan-0.000
Token very
Token ID845
Feature activation+3.13e-01

Pos logit contributions
aj+0.109
irms+0.084
ccoli+0.083
udd+0.082
ses+0.082
Neg logit contributions
 it-0.050
 Frankie-0.044
 Abby-0.041
 Susan-0.037
 Frank-0.037
Token,
Token ID11
Feature activation+3.76e-01

Pos logit contributions
 it+0.101
 Frankie+0.082
 Abby+0.078
 Susan+0.071
 she+0.071
Neg logit contributions
aj-0.268
irms-0.210
ccoli-0.207
udd-0.205
ses-0.204
Token L
Token ID406
Feature activation+2.24e-02

Pos logit contributions
 it+0.024
 she+0.018
 Frankie+0.018
 Abby+0.018
 the+0.017
Neg logit contributions
aj-0.054
irms-0.043
ccoli-0.041
ses-0.041
udd-0.041
Tokenila
Token ID10102
Feature activation-5.58e-01

Pos logit contributions
 it+0.002
 she+0.002
 Abby+0.002
 Frankie+0.002
 the+0.002
Neg logit contributions
aj-0.002
irms-0.001
ccoli-0.001
ses-0.001
udd-0.001
Token,
Token ID11
Feature activation+3.85e-01

Pos logit contributions
aj+0.087
irms+0.070
ccoli+0.068
ses+0.068
udd+0.067
Neg logit contributions
 it-0.027
 Frankie-0.019
 Abby-0.018
 she-0.018
 Susan-0.016
Token you
Token ID345
Feature activation-6.54e-01

Pos logit contributions
 it+0.022
 Frankie+0.019
 Abby+0.018
 Susan+0.016
 she+0.016
Neg logit contributions
aj-0.053
irms-0.042
ccoli-0.041
ses-0.041
udd-0.040
Token are
Token ID389
Feature activation+1.35e-01

Pos logit contributions
aj+0.094
irms+0.073
arl+0.070
ccoli+0.069
ses+0.069
Neg logit contributions
 it-0.049
 the-0.036
 she-0.033
't-0.032
 one-0.031
Token so
Token ID523
Feature activation+2.48e-01

Pos logit contributions
 it+0.007
 Frankie+0.005
 Abby+0.005
 she+0.005
 Susan+0.005
Neg logit contributions
aj-0.020
irms-0.016
ccoli-0.016
ses-0.016
udd-0.016
Token brave
Token ID14802
Feature activation-1.13e+00

Pos logit contributions
 it+0.019
 Frankie+0.017
 Abby+0.016
 Susan+0.015
 Frank+0.015
Neg logit contributions
aj-0.030
udd-0.022
irms-0.022
ibr-0.022
arl-0.021
Token!"
Token ID2474
Feature activation+4.09e-02

Pos logit contributions
aj+0.152
irms+0.119
ccoli+0.114
ses+0.114
udd+0.114
Neg logit contributions
 it-0.071
 Frankie-0.061
 Abby-0.058
 she-0.053
 Susan-0.053
Token Ben
Token ID3932
Feature activation-2.91e-01

Pos logit contributions
 it+0.005
 Frankie+0.004
 Abby+0.004
 Susan+0.004
 she+0.004
Neg logit contributions
aj-0.004
irms-0.002
ses-0.002
ccoli-0.002
udd-0.002
Token said
Token ID531
Feature activation-3.10e-02

Pos logit contributions
aj+0.035
irms+0.026
ccoli+0.026
ses+0.026
udd+0.025
Neg logit contributions
 it-0.023
 Frankie-0.020
 Abby-0.019
 she-0.018
 Susan-0.018
Token She
Token ID1375
Feature activation+1.71e-01

Pos logit contributions
aj+0.005
irms+0.004
ccoli+0.004
ses+0.004
udd+0.004
Neg logit contributions
 it-0.002
 Frankie-0.002
 she-0.002
 Abby-0.001
 the-0.001
Token ordered
Token ID6149
Feature activation+1.61e+00

Pos logit contributions
 it+0.011
 Frankie+0.009
 Abby+0.009
 she+0.008
 Susan+0.008
Neg logit contributions
aj-0.023
irms-0.017
ccoli-0.017
ses-0.017
udd-0.017
Token them
Token ID606
Feature activation+8.70e-01

Pos logit contributions
 Frankie+0.089
 Susan+0.079
 Frank+0.079
 Abby+0.077
 it+0.077
Neg logit contributions
aj-0.218
ccoli-0.164
irms-0.162
ibr-0.162
ows-0.161
Token to
Token ID284
Feature activation+1.24e-01

Pos logit contributions
 it+0.061
 Frankie+0.053
 Abby+0.051
 Susan+0.047
 Frank+0.046
Neg logit contributions
aj-0.113
ccoli-0.085
irms-0.083
udd-0.083
ibr-0.082
Token make
Token ID787
Feature activation+9.13e-01

Pos logit contributions
 it+0.008
 Frankie+0.008
 Abby+0.007
 Susan+0.007
 Lib+0.007
Neg logit contributions
aj-0.016
irms-0.011
ccoli-0.011
arl-0.011
ses-0.011
Token a
Token ID257
Feature activation+2.01e-01

Pos logit contributions
 it+0.045
 Frankie+0.044
 Abby+0.042
 Susan+0.039
 Frank+0.039
Neg logit contributions
aj-0.131
irms-0.101
udd-0.101
ibr-0.099
ccoli-0.099
Token pile
Token ID14540
Feature activation-4.66e-01

Pos logit contributions
 it+0.015
 Frankie+0.012
 she+0.012
 Abby+0.011
 the+0.011
Neg logit contributions
aj-0.026
irms-0.020
ses-0.019
ccoli-0.019
arl-0.019
Token on
Token ID319
Feature activation+1.09e+00

Pos logit contributions
aj+0.061
ows+0.049
udd+0.047
oll+0.046
irms+0.046
Neg logit contributions
 Frankie-0.026
 it-0.025
 Abby-0.024
 Frank-0.023
 Susan-0.022
Token her
Token ID607
Feature activation+1.49e-01

Pos logit contributions
 Frankie+0.044
 Susan+0.042
 Abby+0.041
Susan+0.041
 Frank+0.041
Neg logit contributions
aj-0.143
ccoli-0.115
irms-0.113
ses-0.112
ws-0.111
Token bed
Token ID3996
Feature activation-1.61e-01

Pos logit contributions
 it+0.011
 she+0.009
 her+0.009
 the+0.008
 Frankie+0.008
Neg logit contributions
aj-0.019
irms-0.015
arl-0.015
ccoli-0.015
ems-0.015
Token.
Token ID13
Feature activation-2.46e-02

Pos logit contributions
aj+0.024
irms+0.019
ccoli+0.019
ses+0.019
udd+0.019
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.006
 she-0.006
 Susan-0.006
Token.
Token ID13
Feature activation+3.32e-01

Pos logit contributions
 it+0.068
 Frankie+0.058
 Abby+0.055
 Susan+0.049
 she+0.048
Neg logit contributions
aj-0.177
irms-0.142
ses-0.138
udd-0.136
ccoli-0.136
Token
Token ID198
Feature activation-4.49e-03

Pos logit contributions
 it+0.021
 Frankie+0.018
 Abby+0.017
 Susan+0.015
 she+0.015
Neg logit contributions
aj-0.044
irms-0.035
ses-0.035
arl-0.034
ccoli-0.033
Token
Token ID198
Feature activation+3.29e-03

Pos logit contributions
aj+0.001
irms+0.000
ses+0.000
arl+0.000
udd+0.000
Neg logit contributions
 it-0.000
 Frankie-0.000
 Abby-0.000
 she-0.000
 Susan-0.000
TokenShe
Token ID3347
Feature activation+7.07e-01

Pos logit contributions
 it+0.000
 Frankie+0.000
 Abby+0.000
 Susan+0.000
 she+0.000
Neg logit contributions
aj-0.000
irms-0.000
arl-0.000
udd-0.000
ses-0.000
Token and
Token ID290
Feature activation+3.61e-02

Pos logit contributions
 it+0.066
 the+0.053
 she+0.051
 her+0.050
't+0.045
Neg logit contributions
aj-0.084
ccoli-0.063
arl-0.062
ibr-0.059
irms-0.058
Token Tom
Token ID4186
Feature activation+6.24e-01

Pos logit contributions
 it+0.003
 the+0.003
 she+0.003
 her+0.003
 Frankie+0.002
Neg logit contributions
aj-0.004
ccoli-0.003
udd-0.003
irms-0.003
ibr-0.003
Token chase
Token ID15505
Feature activation+1.04e+00

Pos logit contributions
 it+0.039
 Frankie+0.028
 she+0.027
 the+0.027
 Abby+0.027
Neg logit contributions
aj-0.091
ccoli-0.070
irms-0.069
ses-0.068
ibr-0.068
Token after
Token ID706
Feature activation+5.24e-01

Pos logit contributions
 it+0.045
 Frankie+0.036
 Abby+0.033
 she+0.030
 Susan+0.029
Neg logit contributions
aj-0.160
irms-0.133
ses-0.127
udd-0.126
ccoli-0.125
Token Spot
Token ID15899
Feature activation+1.82e-01

Pos logit contributions
 Frankie+0.038
 Abby+0.038
 Mill+0.037
Mill+0.035
 Susan+0.035
Neg logit contributions
aj-0.048
irms-0.043
ses-0.043
arl-0.038
sed-0.038
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
 it+0.011
 Frankie+0.009
 Abby+0.009
 Susan+0.008
 she+0.008
Neg logit contributions
aj-0.025
irms-0.020
ses-0.020
ccoli-0.019
udd-0.019
Token They
Token ID1119
Feature activation+1.06e-01

Pos logit contributions
 it+0.024
 the+0.018
 she+0.018
 Frankie+0.017
 Abby+0.017
Neg logit contributions
aj-0.052
ccoli-0.040
udd-0.039
ibr-0.039
irms-0.039
Token who
Token ID508
Feature activation-4.61e-01

Pos logit contributions
aj+0.268
irms+0.207
udd+0.204
ccoli+0.203
ses+0.201
Neg logit contributions
 it-0.102
 Frankie-0.083
 Abby-0.079
 Susan-0.071
 she-0.070
Token is
Token ID318
Feature activation-8.64e-01

Pos logit contributions
aj+0.058
irms+0.044
ccoli+0.043
ses+0.043
udd+0.042
Neg logit contributions
 it-0.034
 Frankie-0.029
 Abby-0.028
 she-0.026
 Susan-0.026
Token the
Token ID262
Feature activation-4.86e-02

Pos logit contributions
aj+0.117
irms+0.091
ccoli+0.089
ses+0.089
udd+0.087
Neg logit contributions
 it-0.057
 Frankie-0.046
 Abby-0.045
 she-0.043
 Susan-0.041
Token prett
Token ID46442
Feature activation-2.54e+00

Pos logit contributions
aj+0.006
irms+0.005
ccoli+0.005
udd+0.004
ses+0.004
Neg logit contributions
 it-0.004
 Frankie-0.003
 Abby-0.003
 she-0.003
 the-0.003
Tokeniest
Token ID6386
Feature activation+2.71e+00

Pos logit contributions
aj+0.366
udd+0.285
irms+0.285
ccoli+0.284
arl+0.282
Neg logit contributions
 it-0.137
 Frankie-0.116
 Abby-0.105
 Susan-0.101
 Frank-0.095
Token girl
Token ID2576
Feature activation+4.21e-01

Pos logit contributions
 it+0.163
 Frankie+0.155
 Susan+0.141
 Abby+0.134
 Frank+0.134
Neg logit contributions
aj-0.346
ses-0.257
ccoli-0.255
udd-0.255
 cares-0.254
Token?"
Token ID1701
Feature activation-1.05e+00

Pos logit contributions
 it+0.024
 Frankie+0.022
 Abby+0.020
 Susan+0.019
 Frank+0.018
Neg logit contributions
aj-0.056
ccoli-0.044
ses-0.043
irms-0.042
 cares-0.042
Token The
Token ID383
Feature activation-1.37e-01

Pos logit contributions
aj+0.136
irms+0.105
ccoli+0.102
ses+0.099
ibr+0.099
Neg logit contributions
 it-0.065
 Frankie-0.056
 Abby-0.051
 happen-0.051
 ringing-0.050
Token mirror
Token ID10162
Feature activation+5.82e-01

Pos logit contributions
aj+0.019
irms+0.014
ccoli+0.014
ses+0.014
udd+0.014
Neg logit contributions
 it-0.009
 Frankie-0.007
 Abby-0.007
 she-0.007
 Susan-0.007
Token showed
Token ID3751
Feature activation-8.09e-01

Pos logit contributions
 it+0.044
 Frankie+0.036
 Abby+0.035
 she+0.035
 Susan+0.033
Neg logit contributions
aj-0.074
irms-0.057
ccoli-0.055
ses-0.055
udd-0.054
Token a
Token ID257
Feature activation-1.73e-02

Pos logit contributions
aj+0.124
irms+0.102
ibr+0.099
udd+0.099
ccoli+0.099
Neg logit contributions
 it-0.028
 Frankie-0.024
 Abby-0.020
 she-0.017
 ringing-0.016
Token cream
Token ID8566
Feature activation-1.16e-01

Pos logit contributions
aj+0.019
irms+0.015
ses+0.014
ibr+0.014
ccoli+0.014
Neg logit contributions
 it-0.011
 she-0.009
 Abby-0.009
 Frankie-0.009
 the-0.008
Token.
Token ID13
Feature activation+2.19e-01

Pos logit contributions
aj+0.017
irms+0.014
ccoli+0.013
ses+0.013
udd+0.013
Neg logit contributions
 it-0.006
 Frankie-0.005
 Abby-0.005
 she-0.004
 Susan-0.004
Token
Token ID198
Feature activation-1.44e-01

Pos logit contributions
 it+0.014
 Frankie+0.011
 she+0.011
 Abby+0.010
 Susan+0.010
Neg logit contributions
aj-0.032
udd-0.025
irms-0.025
ccoli-0.024
ibr-0.024
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
aj+0.019
irms+0.015
udd+0.014
ccoli+0.014
arl+0.014
Neg logit contributions
 it-0.009
 Frankie-0.008
 Abby-0.008
 she-0.007
 Susan-0.007
TokenShe
Token ID3347
Feature activation+5.39e-01

Pos logit contributions
aj+0.014
irms+0.010
arl+0.010
udd+0.010
ccoli+0.009
Neg logit contributions
 it-0.012
 Frankie-0.011
 Abby-0.011
 Mill-0.010
 ringing-0.010
Token picked
Token ID6497
Feature activation-5.10e-01

Pos logit contributions
 it+0.037
 she+0.028
 the+0.027
 Abby+0.027
 Frankie+0.026
Neg logit contributions
aj-0.075
arl-0.060
irms-0.060
ses-0.057
 Deal-0.056
Token up
Token ID510
Feature activation-2.02e-01

Pos logit contributions
aj+0.056
irms+0.052
ses+0.046
ccoli+0.046
ws+0.045
Neg logit contributions
 Frankie-0.028
 ringing-0.027
Mill-0.027
 happen-0.027
 Susan-0.026
Token a
Token ID257
Feature activation+2.20e-01

Pos logit contributions
aj+0.029
irms+0.024
ccoli+0.023
ses+0.022
udd+0.022
Neg logit contributions
 it-0.008
 Frankie-0.008
 happen-0.007
 ringing-0.007
 Abby-0.007
Token piece
Token ID3704
Feature activation+3.27e-01

Pos logit contributions
 it+0.018
 Frankie+0.016
 Abby+0.016
 Susan+0.015
 Frank+0.015
Neg logit contributions
aj-0.026
ccoli-0.019
irms-0.018
arl-0.018
ses-0.017
Token of
Token ID286
Feature activation+9.61e-01

Pos logit contributions
 it+0.014
 Frankie+0.014
 Abby+0.012
 Susan+0.011
 Martha+0.011
Neg logit contributions
aj-0.047
irms-0.038
ses-0.038
oll-0.038
arl-0.038
Token broccoli
Token ID44653
Feature activation+5.86e-01

Pos logit contributions
 Frankie+0.076
 Abby+0.074
 it+0.074
 Martha+0.073
 Karen+0.073
Neg logit contributions
aj-0.094
irms-0.074
ccoli-0.069
ses-0.065
ags-0.064
Token.
Token ID13
Feature activation+7.90e-03

Pos logit contributions
aj+0.036
irms+0.029
ccoli+0.028
udd+0.028
ses+0.028
Neg logit contributions
 it-0.016
 Abby-0.014
 Frankie-0.014
 Susan-0.012
 she-0.012
Token Tim
Token ID5045
Feature activation+4.10e-01

Pos logit contributions
 it+0.000
 Frankie+0.000
 Abby+0.000
 she+0.000
 Susan+0.000
Neg logit contributions
aj-0.001
irms-0.001
ccoli-0.001
ses-0.001
udd-0.001
Tokenmy
Token ID1820
Feature activation-2.67e-01

Pos logit contributions
 it+0.029
 Frankie+0.023
 Abby+0.022
 the+0.022
 she+0.022
Neg logit contributions
aj-0.055
irms-0.042
ses-0.042
arl-0.041
ibr-0.041
Token loved
Token ID6151
Feature activation+9.07e-02

Pos logit contributions
aj+0.033
irms+0.025
ccoli+0.025
ses+0.024
udd+0.024
Neg logit contributions
 it-0.021
 Frankie-0.017
 Abby-0.017
 she-0.016
 Susan-0.016
Token to
Token ID284
Feature activation+1.07e-01

Pos logit contributions
 it+0.006
 Abby+0.005
 Frankie+0.005
 Susan+0.005
 she+0.004
Neg logit contributions
aj-0.012
irms-0.009
ccoli-0.009
udd-0.009
ses-0.009
Token brush
Token ID14093
Feature activation-6.19e-01

Pos logit contributions
 it+0.009
 Frankie+0.008
 Abby+0.008
 Lib+0.007
 Susan+0.007
Neg logit contributions
aj-0.012
ccoli-0.009
irms-0.009
ses-0.008
 imag-0.008
Token his
Token ID465
Feature activation-4.29e-01

Pos logit contributions
aj+0.089
udd+0.069
irms+0.068
ccoli+0.067
ows+0.064
Neg logit contributions
 Abby-0.030
 Frankie-0.028
 Susan-0.027
 Frank-0.025
Susan-0.025
Token teeth
Token ID9941
Feature activation-3.42e-01

Pos logit contributions
aj+0.047
irms+0.036
arl+0.035
ccoli+0.034
udd+0.034
Neg logit contributions
 it-0.039
 Frankie-0.033
 her-0.032
 she-0.032
 the-0.031
Token every
Token ID790
Feature activation+2.90e-01

Pos logit contributions
aj+0.040
ows+0.031
udd+0.031
oll+0.030
obb+0.029
Neg logit contributions
 Frankie-0.024
 Abby-0.023
 Frank-0.023
 Susan-0.023
 Martha-0.022
Token day
Token ID1110
Feature activation+1.27e-01

Pos logit contributions
 it+0.018
 Frankie+0.015
 Abby+0.015
 Susan+0.014
 Frank+0.013
Neg logit contributions
aj-0.039
irms-0.030
ccoli-0.030
ses-0.030
arl-0.029
Token before
Token ID878
Feature activation+1.12e+00

Pos logit contributions
 Frankie+0.008
 Abby+0.008
 Susan+0.008
 it+0.008
Susan+0.007
Neg logit contributions
aj-0.014
oll-0.011
ccoli-0.010
udd-0.010
ets-0.010
Token watch
Token ID2342
Feature activation-2.04e-01

Pos logit contributions
 it+0.001
 Frankie+0.001
 Abby+0.001
 she+0.001
 Susan+0.001
Neg logit contributions
aj-0.002
irms-0.002
ccoli-0.001
ses-0.001
arl-0.001
Token!
Token ID0
Feature activation-5.73e-01

Pos logit contributions
aj+0.031
irms+0.025
ccoli+0.025
ses+0.024
ibr+0.024
Neg logit contributions
 it-0.008
 Frankie-0.007
 Abby-0.006
 she-0.006
 Susan-0.006
Token
Token ID198
Feature activation-3.19e-01

Pos logit contributions
aj+0.078
irms+0.060
ccoli+0.059
ses+0.059
udd+0.058
Neg logit contributions
 it-0.039
 Frankie-0.031
 Abby-0.030
 she-0.029
 Susan-0.028
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
aj+0.042
irms+0.032
ccoli+0.031
udd+0.031
ibr+0.030
Neg logit contributions
 it-0.022
 Frankie-0.018
 Abby-0.018
 she-0.017
 Susan-0.016
TokenThe
Token ID464
Feature activation+1.05e+00

Pos logit contributions
aj+0.037
irms+0.028
ibr+0.028
ccoli+0.028
udd+0.027
Neg logit contributions
 it-0.018
 Frankie-0.015
 Abby-0.014
 she-0.013
 ringing-0.013
Token brave
Token ID14802
Feature activation-1.33e+00

Pos logit contributions
 it+0.069
 Frankie+0.054
 Abby+0.052
 she+0.051
 the+0.048
Neg logit contributions
aj-0.146
irms-0.114
ccoli-0.111
ses-0.111
udd-0.110
Token bear
Token ID6842
Feature activation-5.93e-01

Pos logit contributions
aj+0.173
irms+0.133
ccoli+0.130
ses+0.129
udd+0.127
Neg logit contributions
 it-0.093
 Frankie-0.076
 Abby-0.074
 she-0.071
 Susan-0.069
Token then
Token ID788
Feature activation-7.84e-01

Pos logit contributions
aj+0.081
irms+0.062
ccoli+0.061
ses+0.061
udd+0.060
Neg logit contributions
 it-0.039
 Frankie-0.032
 Abby-0.030
 she-0.029
 Susan-0.028
Token decided
Token ID3066
Feature activation+6.14e-01

Pos logit contributions
aj+0.101
irms+0.077
ccoli+0.075
udd+0.074
ses+0.074
Neg logit contributions
 it-0.056
 Frankie-0.047
 Abby-0.045
 she-0.043
 Susan-0.042
Token to
Token ID284
Feature activation+2.35e-02

Pos logit contributions
 Frankie+0.035
 it+0.034
 Abby+0.033
 Susan+0.030
 Mill+0.029
Neg logit contributions
aj-0.077
ccoli-0.057
ems-0.056
udd-0.056
irms-0.055
Token jump
Token ID4391
Feature activation-2.29e-01

Pos logit contributions
 it+0.002
 Frankie+0.001
 Abby+0.001
 she+0.001
 Lib+0.001
Neg logit contributions
aj-0.003
irms-0.002
ccoli-0.002
ibr-0.002
 cares-0.002
Token They
Token ID1119
Feature activation+9.71e-03

Pos logit contributions
 it+0.022
 the+0.014
 she+0.013
 Abby+0.013
 her+0.012
Neg logit contributions
aj-0.063
oll-0.050
udd-0.049
ccoli-0.049
irms-0.049
Token wished
Token ID16555
Feature activation-1.69e-02

Pos logit contributions
 it+0.001
 she+0.001
 Frankie+0.001
 Abby+0.001
 the+0.001
Neg logit contributions
aj-0.001
irms-0.001
ccoli-0.001
arl-0.001
udd-0.001
Token they
Token ID484
Feature activation-1.16e-01

Pos logit contributions
aj+0.003
irms+0.002
ses+0.002
ccoli+0.002
udd+0.002
Neg logit contributions
 it-0.001
 Frankie-0.000
 Abby-0.000
 Susan-0.000
 she-0.000
Token had
Token ID550
Feature activation-4.11e-01

Pos logit contributions
aj+0.016
irms+0.012
ccoli+0.012
ses+0.012
udd+0.012
Neg logit contributions
 it-0.008
 Frankie-0.006
 Abby-0.006
 she-0.006
 Susan-0.005
Token not
Token ID407
Feature activation-4.30e-01

Pos logit contributions
aj+0.056
irms+0.044
arl+0.042
udd+0.042
ccoli+0.042
Neg logit contributions
 it-0.029
 Frankie-0.022
 the-0.022
 Abby-0.022
 she-0.021
Token locked
Token ID8970
Feature activation-2.08e+00

Pos logit contributions
aj+0.054
irms+0.041
ccoli+0.040
arl+0.039
ses+0.039
Neg logit contributions
 it-0.035
 Frankie-0.028
 she-0.027
 the-0.027
 Abby-0.027
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
aj+0.343
irms+0.284
ses+0.277
ccoli+0.273
arl+0.267
Neg logit contributions
 it-0.053
 Frankie-0.042
 Abby-0.037
 she-0.036
 Susan-0.027
Token door
Token ID3420
Feature activation-8.41e-02

Pos logit contributions
 it+0.003
 Frankie+0.002
 she+0.002
 Abby+0.002
 the+0.002
Neg logit contributions
aj-0.005
irms-0.004
ccoli-0.004
udd-0.004
ibr-0.004
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
aj+0.012
irms+0.009
ccoli+0.009
udd+0.009
arl+0.009
Neg logit contributions
 it-0.005
 Frankie-0.004
 Abby-0.004
 Frank-0.003
 Susan-0.003
Token
Token ID198
Feature activation+1.77e-03

Pos logit contributions
 it+0.021
 the+0.013
 she+0.013
 Abby+0.012
 her+0.012
Neg logit contributions
aj-0.061
oll-0.048
udd-0.048
irms-0.048
ccoli-0.048
Token
Token ID198
Feature activation+7.35e-03

Pos logit contributions
 it+0.000
 Frankie+0.000
 Abby+0.000
 she+0.000
 the+0.000
Neg logit contributions
aj-0.000
irms-0.000
ccoli-0.000
ses-0.000
udd-0.000
Token.
Token ID13
Feature activation+8.13e-02

Pos logit contributions
aj+0.078
udd+0.061
irms+0.060
ccoli+0.060
ses+0.058
Neg logit contributions
 it-0.037
 Frankie-0.034
 Abby-0.034
 Susan-0.030
 Frank-0.029
Token Lily
Token ID20037
Feature activation-6.01e-01

Pos logit contributions
 it+0.005
 Frankie+0.003
 Abby+0.003
 she+0.003
 Susan+0.003
Neg logit contributions
aj-0.012
irms-0.010
ccoli-0.009
ses-0.009
udd-0.009
Token lived
Token ID5615
Feature activation-7.39e-01

Pos logit contributions
aj+0.076
irms+0.057
ccoli+0.056
ses+0.056
udd+0.055
Neg logit contributions
 it-0.046
 Frankie-0.038
 Abby-0.037
 she-0.036
 Susan-0.035
Token in
Token ID287
Feature activation-1.98e-01

Pos logit contributions
aj+0.114
ccoli+0.093
ses+0.091
ems+0.091
irms+0.091
Neg logit contributions
 it-0.041
 her-0.032
 the-0.032
 she-0.030
 Abby-0.025
Token a
Token ID257
Feature activation+2.16e-01

Pos logit contributions
aj+0.025
irms+0.021
ses+0.020
ibr+0.020
arl+0.019
Neg logit contributions
 it-0.011
 Frankie-0.010
 Abby-0.010
 Frank-0.009
 Susan-0.009
Token small
Token ID1402
Feature activation-3.08e-01

Pos logit contributions
 it+0.013
 Frankie+0.011
 Abby+0.010
 she+0.010
 Susan+0.009
Neg logit contributions
aj-0.031
irms-0.024
ccoli-0.023
ses-0.023
udd-0.023
Token house
Token ID2156
Feature activation+3.59e-01

Pos logit contributions
aj+0.036
irms+0.026
ccoli+0.025
udd+0.025
ses+0.025
Neg logit contributions
 it-0.027
 Frankie-0.023
 Abby-0.022
 she-0.021
 Susan-0.021
Token on
Token ID319
Feature activation+1.10e+00

Pos logit contributions
 it+0.022
 Frankie+0.019
 Abby+0.018
 Susan+0.016
 Frank+0.016
Neg logit contributions
aj-0.050
irms-0.039
udd-0.038
ses-0.038
ccoli-0.037
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 it+0.055
 Frankie+0.052
 Abby+0.050
 Susan+0.044
 Frank+0.044
Neg logit contributions
aj-0.146
irms-0.113
ibr-0.112
udd-0.109
ses-0.108
Token side
Token ID1735
Feature activation-1.01e-01

Pos logit contributions
 it+0.001
 Frankie+0.001
 Abby+0.001
 Susan+0.001
 she+0.001
Neg logit contributions
aj-0.001
udd-0.001
ses-0.001
irms-0.001
arl-0.001
Token of
Token ID286
Feature activation+9.33e-01

Pos logit contributions
aj+0.014
udd+0.011
aked+0.011
irms+0.011
ibr+0.011
Neg logit contributions
 it-0.005
 Frankie-0.005
 Abby-0.004
 happen-0.004
 Frank-0.004
Token But
Token ID887
Feature activation-6.42e-01

Pos logit contributions
 it+0.007
 Frankie+0.006
 Abby+0.005
 she+0.005
 Frank+0.005
Neg logit contributions
aj-0.011
udd-0.009
obb-0.008
irms-0.008
uddle-0.008
Token always
Token ID1464
Feature activation-2.00e-01

Pos logit contributions
aj+0.092
irms+0.072
udd+0.070
ses+0.070
ccoli+0.070
Neg logit contributions
 it-0.041
 Frankie-0.031
 Abby-0.030
 she-0.029
 the-0.027
Token with
Token ID351
Feature activation+4.16e-01

Pos logit contributions
aj+0.027
irms+0.020
udd+0.020
ses+0.020
arl+0.020
Neg logit contributions
 it-0.015
 Frankie-0.012
 Abby-0.011
 she-0.011
 the-0.011
Token me
Token ID502
Feature activation-1.18e+00

Pos logit contributions
 it+0.024
 Frankie+0.017
 Abby+0.016
 she+0.016
 the+0.015
Neg logit contributions
aj-0.063
irms-0.050
udd-0.049
ccoli-0.049
ses-0.049
Token.
Token ID13
Feature activation+8.72e-02

Pos logit contributions
aj+0.172
irms+0.139
ses+0.135
udd+0.133
ccoli+0.132
Neg logit contributions
 it-0.070
 Frankie-0.051
 she-0.049
 Abby-0.048
 the-0.047
Token OK
Token ID7477
Feature activation-2.71e+00

Pos logit contributions
 it+0.007
 Frankie+0.006
 Abby+0.005
 she+0.005
 Susan+0.005
Neg logit contributions
aj-0.011
udd-0.009
irms-0.009
ses-0.009
obb-0.008
Token?"
Token ID1701
Feature activation-9.53e-01

Pos logit contributions
aj+0.307
ccoli+0.220
irms+0.215
ses+0.211
arl+0.208
Neg logit contributions
 it-0.235
 Frankie-0.204
 Abby-0.201
 she-0.193
 Susan-0.189
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
aj+0.121
irms+0.092
ccoli+0.090
ses+0.088
arl+0.087
Neg logit contributions
 it-0.063
 Frankie-0.054
 Abby-0.052
 happen-0.049
 ringing-0.049
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
aj+0.022
irms+0.017
ccoli+0.016
udd+0.016
arl+0.016
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.008
 Susan-0.007
Token"
Token ID1
Feature activation-6.38e-01

Pos logit contributions
aj+0.016
irms+0.012
ccoli+0.011
udd+0.011
ses+0.011
Neg logit contributions
 it-0.014
 Frankie-0.012
 Abby-0.012
 she-0.012
 Susan-0.011
TokenOK
Token ID11380
Feature activation+1.78e-01

Pos logit contributions
aj+0.074
irms+0.054
udd+0.052
ses+0.052
ccoli+0.052
Neg logit contributions
 it-0.058
 she-0.051
 Frankie-0.048
 Abby-0.048
 her-0.045
Token mother
Token ID2802
Feature activation-2.31e-01

Pos logit contributions
 it+0.039
 Frankie+0.033
 Abby+0.032
 she+0.031
 Susan+0.030
Neg logit contributions
aj-0.053
irms-0.040
ccoli-0.038
ses-0.038
udd-0.038
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
aj+0.031
ccoli+0.023
udd+0.023
arl+0.023
ems+0.023
Neg logit contributions
 it-0.014
 Frankie-0.013
 Abby-0.012
 Susan-0.011
 Frank-0.011
Token We
Token ID775
Feature activation-5.21e-01

Pos logit contributions
 it+0.025
 Frankie+0.021
 Abby+0.021
 she+0.020
 the+0.019
Neg logit contributions
aj-0.036
irms-0.027
udd-0.026
ccoli-0.025
ibr-0.025
Token love
Token ID1842
Feature activation+9.18e-01

Pos logit contributions
aj+0.072
irms+0.055
arl+0.053
ses+0.053
ccoli+0.052
Neg logit contributions
 it-0.037
 Abby-0.028
 Frankie-0.028
 she-0.027
 the-0.026
Token g
Token ID308
Feature activation+8.94e-01

Pos logit contributions
 Frankie+0.061
 Susan+0.058
 it+0.058
 Abby+0.055
Susan+0.053
Neg logit contributions
aj-0.107
ccoli-0.083
irms-0.077
 cares-0.076
ses-0.075
Tokenaj
Token ID1228
Feature activation-4.05e+00

Pos logit contributions
 it+0.064
 Frankie+0.053
 Abby+0.051
 she+0.050
 Susan+0.047
Neg logit contributions
aj-0.119
irms-0.091
ccoli-0.088
udd-0.088
ses-0.088
Tokenar
Token ID283
Feature activation-1.24e+00

Pos logit contributions
aj+0.400
irms+0.275
ccoli+0.265
ses+0.263
udd+0.260
Neg logit contributions
 it-0.423
 Frankie-0.369
 Abby-0.362
 she-0.355
 Susan-0.345
Token ka
Token ID38387
Feature activation+1.13e+00

Pos logit contributions
aj+0.162
udd+0.122
ccoli+0.117
irms+0.117
arl+0.117
Neg logit contributions
 it-0.085
 Frankie-0.082
 Susan-0.075
 Abby-0.074
 Frank-0.070
Token hal
Token ID10284
Feature activation-4.99e-02

Pos logit contributions
 it+0.106
 Frankie+0.091
 Abby+0.089
 she+0.087
 Susan+0.084
Neg logit contributions
aj-0.123
irms-0.089
ccoli-0.086
ses-0.086
udd-0.084
Tokenwa
Token ID10247
Feature activation+1.34e+00

Pos logit contributions
aj+0.005
arl+0.004
ccoli+0.004
ppy+0.004
ems+0.004
Neg logit contributions
 it-0.005
 Frankie-0.004
 Abby-0.004
 Susan-0.004
 she-0.004
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
 it+0.088
 Frankie+0.077
 Abby+0.074
 she+0.072
 Susan+0.071
Neg logit contributions
aj-0.182
ccoli-0.139
irms-0.137
udd-0.136
arl-0.134
Token sailed
Token ID33835
Feature activation-2.55e-01

Pos logit contributions
aj+0.044
udd+0.033
irms+0.033
 cares+0.031
oll+0.031
Neg logit contributions
 it-0.032
 Frankie-0.031
 Susan-0.029
 Abby-0.028
 Martha-0.028
Token away
Token ID1497
Feature activation-8.79e-01

Pos logit contributions
aj+0.028
irms+0.021
udd+0.020
ses+0.020
aked+0.020
Neg logit contributions
 it-0.021
 Frankie-0.019
 Abby-0.018
 she-0.017
 Susan-0.017
Token in
Token ID287
Feature activation-1.92e-01

Pos logit contributions
aj+0.116
irms+0.090
ses+0.089
udd+0.089
ccoli+0.086
Neg logit contributions
 it-0.053
 Frankie-0.046
 Abby-0.042
 Susan-0.040
 she-0.038
Token the
Token ID262
Feature activation-5.47e-02

Pos logit contributions
aj+0.028
irms+0.024
ses+0.024
 imag+0.022
arl+0.022
Neg logit contributions
 it-0.007
 Frankie-0.006
 she-0.005
 Abby-0.005
 Susan-0.005
Token kay
Token ID34681
Feature activation-6.13e-01

Pos logit contributions
aj+0.008
ccoli+0.006
irms+0.006
ses+0.006
ibr+0.006
Neg logit contributions
 it-0.004
 Frankie-0.003
 Abby-0.003
 the-0.003
 she-0.003
Tokenak
Token ID461
Feature activation-2.68e+00

Pos logit contributions
aj+0.057
ses+0.051
ccoli+0.048
sed+0.046
ems+0.045
Neg logit contributions
 it-0.057
John-0.046
 of-0.046
 the-0.046
 Abby-0.045
Token,
Token ID11
Feature activation+2.93e-01

Pos logit contributions
aj+0.373
udd+0.288
irms+0.285
ccoli+0.280
ses+0.280
Neg logit contributions
 it-0.154
 Frankie-0.128
 Abby-0.124
 Susan-0.112
 she-0.108
Token never
Token ID1239
Feature activation+5.34e-01

Pos logit contributions
 it+0.019
 Frankie+0.016
 Abby+0.016
 Susan+0.015
 she+0.015
Neg logit contributions
aj-0.038
irms-0.029
ses-0.029
ccoli-0.029
udd-0.028
Token to
Token ID284
Feature activation+1.28e-01

Pos logit contributions
 it+0.035
 Frankie+0.030
 Abby+0.028
 Susan+0.026
 she+0.026
Neg logit contributions
aj-0.070
irms-0.054
ses-0.053
ccoli-0.053
udd-0.052
Token see
Token ID766
Feature activation+6.19e-01

Pos logit contributions
 it+0.009
 Frankie+0.008
 she+0.007
 Susan+0.007
 Abby+0.007
Neg logit contributions
aj-0.015
irms-0.012
arl-0.011
udd-0.011
 imag-0.011
Token the
Token ID262
Feature activation-3.63e-02

Pos logit contributions
 it+0.023
 Frankie+0.019
 Abby+0.017
 Susan+0.017
 she+0.016
Neg logit contributions
aj-0.093
irms-0.075
ccoli-0.075
ses-0.074
udd-0.073
Token see
Token ID766
Feature activation+7.48e-01

Pos logit contributions
aj+0.018
irms+0.014
ses+0.013
ccoli+0.013
udd+0.013
Neg logit contributions
 it-0.010
 Frankie-0.008
 Abby-0.008
 she-0.008
 Susan-0.008
Token rings
Token ID13917
Feature activation-2.95e-01

Pos logit contributions
 it+0.045
 Frankie+0.045
 Abby+0.041
 Susan+0.038
't+0.038
Neg logit contributions
aj-0.094
ses-0.073
udd-0.073
irms-0.073
 cares-0.069
Token,
Token ID11
Feature activation+3.92e-01

Pos logit contributions
aj+0.034
ows+0.028
udd+0.025
ems+0.025
ccoli+0.024
Neg logit contributions
 Frankie-0.021
 Abby-0.020
 it-0.018
 Susan-0.018
 Martha-0.018
Token neck
Token ID7393
Feature activation-1.59e+00

Pos logit contributions
 it+0.029
 Frankie+0.027
 Abby+0.025
 Frank+0.024
 Susan+0.023
Neg logit contributions
aj-0.048
irms-0.036
ems-0.035
ses-0.035
ccoli-0.035
Tokenl
Token ID75
Feature activation+9.40e-01

Pos logit contributions
aj+0.184
ows+0.146
udd+0.137
ccoli+0.127
ems+0.126
Neg logit contributions
 Frankie-0.118
 Abby-0.111
 Susan-0.109
 it-0.104
 Frank-0.103
Tokenaces
Token ID2114
Feature activation-2.74e+00

Pos logit contributions
 it+0.086
 Frankie+0.074
 Abby+0.072
 she+0.071
 Susan+0.068
Neg logit contributions
aj-0.105
irms-0.076
ccoli-0.073
ses-0.072
udd-0.072
Token,
Token ID11
Feature activation+3.87e-01

Pos logit contributions
aj+0.326
ows+0.262
udd+0.250
ets+0.243
ems+0.243
Neg logit contributions
 Frankie-0.190
 Abby-0.183
 it-0.183
 Martha-0.170
 Susan-0.167
Token brace
Token ID22581
Feature activation+5.49e-01

Pos logit contributions
 it+0.029
 Frankie+0.028
 Abby+0.026
 Susan+0.025
 Frank+0.025
Neg logit contributions
aj-0.047
irms-0.034
ems-0.034
ccoli-0.033
ses-0.033
Tokenlets
Token ID5289
Feature activation-2.88e-01

Pos logit contributions
 it+0.060
 Frankie+0.053
 Abby+0.052
 she+0.051
 Susan+0.049
Neg logit contributions
aj-0.052
irms-0.035
ccoli-0.033
ses-0.033
udd-0.033
Token and
Token ID290
Feature activation-1.62e-01

Pos logit contributions
aj+0.032
ows+0.026
ems+0.024
udd+0.024
ses+0.023
Neg logit contributions
 Frankie-0.020
 Abby-0.019
 it-0.019
 Susan-0.018
 Samantha-0.018
Token crown
Token ID12389
Feature activation-1.25e+00

Pos logit contributions
aj+0.020
irms+0.015
ses+0.014
ccoli+0.014
ems+0.014
Neg logit contributions
 it-0.013
 Frankie-0.011
 Abby-0.011
 Susan-0.010
 Frank-0.010
Token very
Token ID845
Feature activation+3.10e-01

Pos logit contributions
 it+0.007
 Frankie+0.006
 Abby+0.005
 she+0.005
 the+0.005
Neg logit contributions
aj-0.016
irms-0.013
ccoli-0.012
ses-0.012
arl-0.012
Token dark
Token ID3223
Feature activation-6.89e-01

Pos logit contributions
 it+0.022
 she+0.017
 the+0.016
 Abby+0.016
 Frankie+0.016
Neg logit contributions
aj-0.042
irms-0.033
ccoli-0.033
ses-0.032
ems-0.032
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
aj+0.090
udd+0.069
irms+0.069
ses+0.067
ccoli+0.067
Neg logit contributions
 it-0.044
 Frankie-0.039
 Abby-0.039
 Susan-0.035
 Frank-0.033
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
aj+0.020
irms+0.015
ccoli+0.015
ses+0.015
udd+0.014
Neg logit contributions
 it-0.009
 Frankie-0.008
 Abby-0.008
 Frank-0.007
 Susan-0.007
Token oy
Token ID35104
Feature activation-5.85e-01

Pos logit contributions
 it+0.002
 Frankie+0.002
 Abby+0.002
 Susan+0.002
 she+0.002
Neg logit contributions
aj-0.003
irms-0.002
ses-0.002
ccoli-0.002
udd-0.002
Tokenster
Token ID1706
Feature activation-2.80e+00

Pos logit contributions
aj+0.055
irms+0.042
arl+0.038
ses+0.037
 Deal+0.036
Neg logit contributions
 it-0.065
 she-0.056
 Frankie-0.054
 Abby-0.053
 he-0.051
Token started
Token ID2067
Feature activation+1.70e-01

Pos logit contributions
aj+0.388
irms+0.298
ccoli+0.294
ses+0.293
udd+0.292
Neg logit contributions
 it-0.176
 Frankie-0.144
 Abby-0.141
 she-0.129
 Susan-0.127
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
 it+0.011
 Frankie+0.010
 Abby+0.010
 Susan+0.009
 Frank+0.009
Neg logit contributions
aj-0.022
irms-0.017
ses-0.017
udd-0.017
ccoli-0.016
Token glow
Token ID19634
Feature activation+1.90e-01

Pos logit contributions
 it+0.009
 Abby+0.008
 Frankie+0.008
 Lib+0.008
 Susan+0.007
Neg logit contributions
aj-0.014
ses-0.011
ccoli-0.011
irms-0.011
 imag-0.010
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
 it+0.011
 Frankie+0.010
 Abby+0.010
 Susan+0.009
 Lib+0.009
Neg logit contributions
aj-0.025
udd-0.020
irms-0.019
ccoli-0.019
ows-0.019
Token Suddenly
Token ID24975
Feature activation+1.01e-01

Pos logit contributions
 it+0.014
 she+0.011
 Abby+0.010
 Frankie+0.010
 her+0.010
Neg logit contributions
aj-0.030
irms-0.024
ccoli-0.024
ses-0.024
ems-0.023