TOP ACTIVATIONS
MAX = 7.336

's mom made food without onions . Tim was very happy
to find out all about therm ometers and other things she
was finally able to get caul iflower after he didn 't
to drink it to stay hyd rated !" Billy drank the
Everyone was ready to start living in the new house .
<|endoftext|> Anna did not like caul iflower . She thought it
an eye out for little onions in the garden . <|endoftext|>
was very happy . <|endoftext|> Far away in the sky was
. They are for wood cut ters ." They took Lily
, so she went to Farmer Brown 's house . When
my was teaching Sally about therm ometers . Sally watched as
because they couldn 't have caul iflower for dinner . Lily
" Hello , caul iflower . I am Lily
then scoop ed it into pans and put it in the
. She said it was caul iflower mash and it was
. " Bro ccoli is not our friend ,
had saved him . <|endoftext|> Charl otte and her mom were
and to only have mass ages from people she trusted .
mom made them cocoa with marsh m allows . The boy
It 's important to stay hyd rated and healthy ," mom

MIN ACTIVATIONS
MIN = -6.590

and tried to wrap the other lace around it . But
Sometimes they repeat what the other one says , like a
, and then tie the other end around his ant ler
sad . She thought the other pengu ins would never want
did not care what the other cats and mice said .
shine as bright as the other lamps . The lamp felt
who can repeat what the other person says the best ,"
would never do what the other people wanted him to do
did not care what the other kids said . She knew
He punched and kicked the other boy . Suddenly , the
mad . They think the other one took the cell .
it could do what the other cube could do . So
, and he tie the other end of the rope to
The girl tied the other end of the rope to
loud noise that scared the other students . They all stopped
shine as bright as the other lamps . The lamp was
confused . They think the other kids are silly . They
and sun n ier the other side was !
embarrassed . She thought the other kids would laugh at her
wanted to do what the other one wanted . They got

INTERVAL 3.854 - 7.336
CONTAINS 0.114%

was a little bug named Bug gy . Bug gy wanted
" Don 't worry , Mickey . I will help you
<|endoftext|> Once there was a fire man named Jeffrey . He
a flower , you won â € ™ t poke it
Ben like to play with bricks . They have many bricks

INTERVAL 0.373 - 3.854
CONTAINS 34.564%

it was broken again . She was so disappointed , she
are happy and safe . <|endoftext|> Tom and Mia liked to
He had a high er aser . Bob loved to add
dirty . His mom was very sad that he didn 't
's owner . He was a man with a hat and

INTERVAL -3.109 - 0.373
CONTAINS 64.881%

. He concentrated very hard and cast his first spell .
hit her . L ily 's mom rushed
decided to yield and let the truck pass . After the
named Lily . She had a big heart and was always
asked her mom for help . " Mom ,

INTERVAL -6.590 - -3.109
CONTAINS 0.441%

even more lively than the first one . She held out
brought out a big pot of spaghetti . Tim my was
Joe . It will unlock more exciting days for you ."
't let me wear her dress !" Molly shook her head
the mess and ate some of the sandwiches . They agreed
Token's
Token ID338
Feature activation+1.61e+00

Pos logit contributions
 Jill+0.003
 tad+0.003
ill+0.003
 Sophia+0.003
 Pam+0.003
Neg logit contributions
cel-0.004
 Junior-0.004
 ambul-0.003
uf-0.003
 operating-0.003
Token mom
Token ID1995
Feature activation+1.92e+00

Pos logit contributions
cel+0.059
 Junior+0.057
 ambul+0.054
uf+0.054
 operating+0.053
Neg logit contributions
 Jill-0.047
 tad-0.046
ill-0.046
 Sophia-0.045
 Pam-0.041
Token made
Token ID925
Feature activation+1.58e+00

Pos logit contributions
cel+0.072
 Junior+0.069
 ambul+0.067
uf+0.066
Brown+0.066
Neg logit contributions
 Jill-0.055
ill-0.053
 tad-0.052
 Sophia-0.052
 Pam-0.047
Token food
Token ID2057
Feature activation+3.12e+00

Pos logit contributions
 Junior+0.035
 tid+0.032
 ambul+0.031
hers+0.030
 Thomas+0.029
Neg logit contributions
ill-0.068
 tad-0.068
 Sophia-0.064
Meg-0.063
ulu-0.062
Token without
Token ID1231
Feature activation+9.24e-01

Pos logit contributions
cel+0.081
 Junior+0.081
 tid+0.078
 ambul+0.077
 operating+0.076
Neg logit contributions
 Jill-0.116
 tad-0.116
 Sophia-0.113
ill-0.112
 Pam-0.106
Token onions
Token ID24681
Feature activation+7.08e+00

Pos logit contributions
cel+0.030
Brown+0.027
 Junior+0.027
 operating+0.026
uf+0.026
Neg logit contributions
 Jill-0.033
 Sophia-0.031
 tad-0.031
ill-0.030
 Pam-0.029
Token.
Token ID13
Feature activation-7.73e-02

Pos logit contributions
cel+0.237
 Junior+0.225
 ambul+0.212
Brown+0.209
uf+0.209
Neg logit contributions
 Jill-0.232
ill-0.227
 tad-0.222
 Sophia-0.219
 Pam-0.204
Token Tim
Token ID5045
Feature activation-6.57e-01

Pos logit contributions
 tad+0.002
ill+0.002
 Sophia+0.002
ulu+0.002
 Jill+0.002
Neg logit contributions
 Junior-0.002
cel-0.002
 ambul-0.002
 operating-0.002
uf-0.002
Token was
Token ID373
Feature activation+8.91e-03

Pos logit contributions
 Jill+0.018
ill+0.018
 tad+0.017
 Sophia+0.017
 grape+0.015
Neg logit contributions
cel-0.025
 Junior-0.024
 ambul-0.023
Thomas-0.023
 operating-0.023
Token very
Token ID845
Feature activation+1.38e+00

Pos logit contributions
cel+0.000
 Junior+0.000
 tid+0.000
 ambul+0.000
 operating+0.000
Neg logit contributions
ill-0.000
 Jill-0.000
 tad-0.000
 Sophia-0.000
 Pam-0.000
Token happy
Token ID3772
Feature activation-2.36e-01

Pos logit contributions
 Junior+0.046
 tid+0.046
uf+0.044
cel+0.044
 operating+0.044
Neg logit contributions
ill-0.043
 Jill-0.040
 Sophia-0.039
 tad-0.038
 Meg-0.036
Token to
Token ID284
Feature activation+2.98e-01

Pos logit contributions
cel+0.027
 Junior+0.027
 tid+0.026
 ambul+0.025
 operating+0.025
Neg logit contributions
 tad-0.021
 Jill-0.020
ill-0.020
 Sophia-0.019
 Pam-0.018
Token find
Token ID1064
Feature activation+8.59e-02

Pos logit contributions
cel+0.012
 Junior+0.012
 Thomas+0.011
uf+0.011
Thomas+0.011
Neg logit contributions
 Jill-0.007
ill-0.007
 tad-0.007
 Sophia-0.007
 flower-0.006
Token out
Token ID503
Feature activation+1.25e+00

Pos logit contributions
cel+0.004
Brown+0.003
 Junior+0.003
uf+0.003
hers+0.003
Neg logit contributions
 Jill-0.002
 tad-0.002
 Sophia-0.002
 Pam-0.002
ill-0.002
Token all
Token ID477
Feature activation+6.18e-01

Pos logit contributions
cel+0.030
 Junior+0.028
 ambul+0.026
uf+0.025
 operating+0.025
Neg logit contributions
 Jill-0.052
 tad-0.051
ill-0.051
 Sophia-0.050
 Pam-0.048
Token about
Token ID546
Feature activation+1.45e+00

Pos logit contributions
cel+0.020
 Junior+0.020
 ambul+0.019
 tid+0.019
 operating+0.019
Neg logit contributions
 tad-0.019
 Jill-0.019
ill-0.019
 Sophia-0.018
 Pam-0.017
Token therm
Token ID21969
Feature activation+6.87e+00

Pos logit contributions
cel+0.057
 Junior+0.054
Brown+0.054
uf+0.053
Thomas+0.052
Neg logit contributions
 Jill-0.038
 tad-0.037
 Sophia-0.036
ill-0.033
 Pam-0.033
Tokenometers
Token ID40077
Feature activation+6.22e+00

Pos logit contributions
cel+0.207
brate+0.205
Michael+0.194
 Junior+0.193
Charl+0.193
Neg logit contributions
ill-0.223
uttered-0.202
 Jill-0.198
ib-0.192
 tad-0.192
Token and
Token ID290
Feature activation+5.81e-02

Pos logit contributions
cel+0.205
 Junior+0.197
 ambul+0.186
uf+0.185
 operating+0.181
Neg logit contributions
 Jill-0.205
ill-0.200
 tad-0.199
 Sophia-0.195
 Pam-0.181
Token other
Token ID584
Feature activation-2.65e+00

Pos logit contributions
cel+0.002
 Junior+0.002
 ambul+0.002
 tid+0.002
 operating+0.002
Neg logit contributions
 Jill-0.002
ill-0.002
 tad-0.002
 Sophia-0.002
 Pam-0.002
Token things
Token ID1243
Feature activation+1.75e+00

Pos logit contributions
 tad+0.054
 new+0.047
 chance+0.047
 Jill+0.046
 flower+0.041
Neg logit contributions
cel-0.112
Brown-0.110
Rust-0.108
Charl-0.107
riger-0.107
Token she
Token ID673
Feature activation-1.92e-01

Pos logit contributions
cel+0.061
 Junior+0.059
uf+0.056
 ambul+0.055
 operating+0.054
Neg logit contributions
 Jill-0.055
 tad-0.053
ill-0.053
 Sophia-0.052
 Pam-0.049
Token was
Token ID373
Feature activation+4.09e-01

Pos logit contributions
cel+0.043
 Junior+0.042
 tid+0.039
uf+0.038
 Thomas+0.038
Neg logit contributions
 Jill-0.042
ill-0.042
 tad-0.041
 Sophia-0.040
 Pam-0.038
Token finally
Token ID3443
Feature activation-9.76e-01

Pos logit contributions
 Junior+0.012
cel+0.012
 tid+0.012
hers+0.011
 operating+0.011
Neg logit contributions
ill-0.014
 tad-0.014
 Jill-0.013
alk-0.013
 Sophia-0.013
Token able
Token ID1498
Feature activation+1.66e+00

Pos logit contributions
ill+0.035
 Jill+0.035
 tad+0.035
 Sophia+0.034
 Pam+0.032
Neg logit contributions
cel-0.028
 Junior-0.027
 tid-0.026
 ambul-0.025
hers-0.025
Token to
Token ID284
Feature activation+2.79e-01

Pos logit contributions
 Junior+0.053
 tid+0.051
 ambul+0.049
 Thomas+0.048
cel+0.048
Neg logit contributions
 tad-0.052
ill-0.052
 Jill-0.050
 Sophia-0.048
ulu-0.047
Token get
Token ID651
Feature activation+7.94e-01

Pos logit contributions
cel+0.010
 Junior+0.010
 ambul+0.010
uf+0.009
 operating+0.009
Neg logit contributions
 Jill-0.008
 tad-0.008
ill-0.008
 Sophia-0.008
 Pam-0.007
Token caul
Token ID46087
Feature activation+6.85e+00

Pos logit contributions
 Junior+0.025
cel+0.024
 tid+0.023
 ambul+0.023
uf+0.023
Neg logit contributions
ill-0.026
 tad-0.026
 Jill-0.025
 Sophia-0.025
 Meg-0.023
Tokeniflower
Token ID42642
Feature activation+5.67e+00

Pos logit contributions
 ambul+0.215
 Michael+0.214
cel+0.211
 Thomas+0.208
Michael+0.201
Neg logit contributions
 tad-0.229
ill-0.216
ectar-0.209
 bloom-0.205
 flower-0.204
Token after
Token ID706
Feature activation+2.29e-01

Pos logit contributions
cel+0.175
 Junior+0.171
 ambul+0.159
uf+0.157
 operating+0.156
Neg logit contributions
 Jill-0.195
 tad-0.192
ill-0.189
 Sophia-0.186
 Pam-0.176
Token he
Token ID339
Feature activation+1.67e+00

Pos logit contributions
 Junior+0.006
cel+0.006
 Thomas+0.006
uf+0.006
 ambul+0.006
Neg logit contributions
 tad-0.009
ill-0.009
 Jill-0.008
 Sophia-0.008
 Pam-0.008
Token didn
Token ID1422
Feature activation-5.31e-01

Pos logit contributions
cel+0.055
 Junior+0.052
 tid+0.050
uf+0.049
hers+0.048
Neg logit contributions
 Jill-0.055
 tad-0.054
ill-0.054
 Sophia-0.053
 Pam-0.051
Token't
Token ID470
Feature activation-1.79e-01

Pos logit contributions
 Jill+0.018
 tad+0.017
ill+0.017
 Sophia+0.017
 Pam+0.016
Neg logit contributions
cel-0.017
 Junior-0.017
 ambul-0.016
uf-0.016
 operating-0.015
Token to
Token ID284
Feature activation+7.09e-01

Pos logit contributions
cel+0.046
 Junior+0.045
 ambul+0.043
uf+0.042
 operating+0.042
Neg logit contributions
 Jill-0.030
 tad-0.030
ill-0.029
 Sophia-0.029
 Pam-0.026
Token drink
Token ID4144
Feature activation+2.20e-01

Pos logit contributions
cel+0.029
 Junior+0.027
 ambul+0.026
Brown+0.026
 Thomas+0.026
Neg logit contributions
 Jill-0.018
 tad-0.018
ill-0.018
 Sophia-0.017
 Pam-0.016
Token it
Token ID340
Feature activation-3.89e-01

Pos logit contributions
cel+0.008
 Junior+0.008
 ambul+0.007
uf+0.007
 operating+0.007
Neg logit contributions
 Jill-0.006
 tad-0.006
ill-0.006
 Sophia-0.006
 Pam-0.006
Token to
Token ID284
Feature activation+1.60e-01

Pos logit contributions
 tad+0.014
 Jill+0.014
 Sophia+0.014
ill+0.014
 Pam+0.013
Neg logit contributions
cel-0.011
 Junior-0.011
 tid-0.010
 ambul-0.010
hers-0.010
Token stay
Token ID2652
Feature activation+1.05e+00

Pos logit contributions
cel+0.006
 Junior+0.005
 ambul+0.005
uf+0.005
 operating+0.005
Neg logit contributions
 Jill-0.005
 tad-0.005
ill-0.005
 Sophia-0.005
 Pam-0.004
Token hyd
Token ID7409
Feature activation+6.77e+00

Pos logit contributions
cel+0.038
 Junior+0.036
 ambul+0.036
uf+0.035
Brown+0.035
Neg logit contributions
 Jill-0.031
 tad-0.030
ill-0.029
 Sophia-0.028
 Pam-0.026
Tokenrated
Token ID4111
Feature activation+2.13e+00

Pos logit contributions
cel+0.187
 Junior+0.178
 ambul+0.167
uf+0.164
 operating+0.161
Neg logit contributions
 Jill-0.260
ill-0.255
 tad-0.254
 Sophia-0.248
 Pam-0.235
Token!"
Token ID2474
Feature activation-4.77e-01

Pos logit contributions
cel+0.068
 Junior+0.065
 ambul+0.061
uf+0.061
 operating+0.060
Neg logit contributions
 Jill-0.072
ill-0.070
 tad-0.070
 Sophia-0.068
 Pam-0.064
Token Billy
Token ID15890
Feature activation+6.25e-01

Pos logit contributions
 Jill+0.012
 tad+0.012
ill+0.012
 Sophia+0.011
 Pam+0.010
Neg logit contributions
cel-0.019
 Junior-0.019
 ambul-0.018
uf-0.018
 Thomas-0.018
Token drank
Token ID24070
Feature activation+2.24e-01

Pos logit contributions
cel+0.025
 Junior+0.024
 ambul+0.023
uf+0.023
 operating+0.023
Neg logit contributions
 Jill-0.016
 tad-0.016
ill-0.016
 Sophia-0.015
 Pam-0.014
Token the
Token ID262
Feature activation-6.62e-01

Pos logit contributions
 Junior+0.007
cel+0.007
 tid+0.007
 ambul+0.006
 Thomas+0.006
Neg logit contributions
ill-0.007
 tad-0.007
 Jill-0.007
 Sophia-0.007
 Meg-0.007
Token Everyone
Token ID11075
Feature activation-1.05e+00

Pos logit contributions
 bead+0.005
arf+0.005
 bracelet+0.004
 butterfly+0.004
 stone+0.004
Neg logit contributions
 Junior-0.004
 tid-0.004
 why-0.004
 Little-0.004
 Thomas-0.004
Token was
Token ID373
Feature activation+3.23e-01

Pos logit contributions
 Jill+0.032
 tad+0.030
ill+0.030
 Sophia+0.029
 Pam+0.027
Neg logit contributions
cel-0.040
 Junior-0.037
uf-0.037
 ambul-0.035
Brown-0.034
Token ready
Token ID3492
Feature activation+2.11e+00

Pos logit contributions
cel+0.013
 Junior+0.013
uf+0.012
Brown+0.012
brate+0.012
Neg logit contributions
 Jill-0.008
 tad-0.008
 Sophia-0.008
ill-0.008
 Pam-0.007
Token to
Token ID284
Feature activation+1.47e+00

Pos logit contributions
cel+0.074
 Junior+0.072
 ambul+0.069
 tid+0.068
 Thomas+0.068
Neg logit contributions
 tad-0.064
 Jill-0.062
ill-0.061
 Sophia-0.060
 Pam-0.057
Token start
Token ID923
Feature activation+6.69e-01

Pos logit contributions
cel+0.064
 Junior+0.059
 ambul+0.058
uf+0.057
Brown+0.057
Neg logit contributions
 Jill-0.035
ill-0.033
 tad-0.032
 Sophia-0.032
 Pam-0.029
Token living
Token ID2877
Feature activation+6.77e+00

Pos logit contributions
cel+0.022
 Junior+0.021
uf+0.020
 ambul+0.020
Brown+0.020
Neg logit contributions
 Jill-0.022
ill-0.021
 tad-0.021
 Sophia-0.021
 Pam-0.019
Token in
Token ID287
Feature activation+2.93e+00

Pos logit contributions
cel+0.245
 Junior+0.231
Brown+0.226
uf+0.225
 ambul+0.223
Neg logit contributions
 Jill-0.216
 tad-0.200
 Sophia-0.199
ill-0.197
 Pam-0.176
Token the
Token ID262
Feature activation+2.71e+00

Pos logit contributions
cel+0.098
 Junior+0.095
 ambul+0.089
uf+0.089
 operating+0.087
Neg logit contributions
 Jill-0.095
 tad-0.093
ill-0.092
 Sophia-0.090
 Pam-0.084
Token new
Token ID649
Feature activation-1.58e+00

Pos logit contributions
cel+0.105
 Junior+0.101
uf+0.096
 ambul+0.094
Brown+0.093
Neg logit contributions
 Jill-0.076
 tad-0.073
ill-0.070
 Sophia-0.069
 Pam-0.065
Token house
Token ID2156
Feature activation+5.03e-01

Pos logit contributions
 Jill+0.047
 tad+0.045
ill+0.045
 Sophia+0.044
 Pam+0.041
Neg logit contributions
cel-0.058
 Junior-0.057
uf-0.053
 ambul-0.053
 operating-0.052
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
cel+0.015
 Junior+0.015
 ambul+0.014
uf+0.014
 operating+0.013
Neg logit contributions
 Jill-0.018
 tad-0.017
ill-0.017
 Sophia-0.017
 Pam-0.016
Token<|endoftext|>
Token ID50256
Feature activation+9.95e-01

Pos logit contributions
cel+0.023
 Junior+0.023
uf+0.021
 Thomas+0.021
 operating+0.020
Neg logit contributions
 tad-0.020
ill-0.019
 Jill-0.019
 Sophia-0.019
 bou-0.018
TokenAnna
Token ID31160
Feature activation+5.68e-02

Pos logit contributions
cel+0.025
 Junior+0.024
 Thomas+0.022
 operating+0.022
uf+0.021
Neg logit contributions
 Jill-0.040
 tad-0.039
 Sophia-0.038
ill-0.038
 Pam-0.037
Token did
Token ID750
Feature activation-3.81e-01

Pos logit contributions
cel+0.002
 Junior+0.002
 ambul+0.002
uf+0.002
 operating+0.002
Neg logit contributions
 Jill-0.002
ill-0.002
 tad-0.002
 Sophia-0.002
 Pam-0.001
Token not
Token ID407
Feature activation+1.38e+00

Pos logit contributions
 tad+0.016
ill+0.016
 Jill+0.016
 Sophia+0.015
 Pam+0.015
Neg logit contributions
cel-0.009
 Junior-0.008
uf-0.007
 tid-0.007
 ambul-0.007
Token like
Token ID588
Feature activation+1.30e+00

Pos logit contributions
cel+0.052
 Junior+0.051
 ambul+0.048
 tid+0.048
 operating+0.048
Neg logit contributions
 tad-0.039
 Jill-0.038
ill-0.037
 Sophia-0.036
 Pam-0.033
Token caul
Token ID46087
Feature activation+6.76e+00

Pos logit contributions
 Junior+0.045
cel+0.045
 ambul+0.044
 tid+0.044
 Thomas+0.043
Neg logit contributions
 tad-0.038
ill-0.037
 Jill-0.036
 Sophia-0.036
 Pam-0.033
Tokeniflower
Token ID42642
Feature activation+5.44e+00

Pos logit contributions
cel+0.220
 Thomas+0.217
 Junior+0.217
 Michael+0.211
 ambul+0.209
Neg logit contributions
 bloom-0.200
 flower-0.198
 grape-0.195
 tad-0.194
ectar-0.187
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
cel+0.159
 Junior+0.153
 ambul+0.144
uf+0.142
 operating+0.140
Neg logit contributions
 Jill-0.197
 tad-0.194
ill-0.192
 Sophia-0.189
 Pam-0.178
Token She
Token ID1375
Feature activation+1.48e-01

Pos logit contributions
 Junior+0.006
cel+0.006
 Thomas+0.006
 operating+0.006
uf+0.005
Neg logit contributions
 tad-0.004
ill-0.004
 Jill-0.003
 Sophia-0.003
 chance-0.003
Token thought
Token ID1807
Feature activation-2.31e+00

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.005
uf+0.005
 operating+0.005
Neg logit contributions
 Jill-0.005
 tad-0.004
ill-0.004
 Sophia-0.004
 Pam-0.004
Token it
Token ID340
Feature activation-3.00e+00

Pos logit contributions
 Jill+0.061
 tad+0.059
ill+0.058
 Sophia+0.057
 Pam+0.052
Neg logit contributions
cel-0.092
 Junior-0.088
 ambul-0.084
uf-0.083
 operating-0.083
Token an
Token ID281
Feature activation+2.14e+00

Pos logit contributions
cel+0.044
 Junior+0.043
 ambul+0.041
 tid+0.040
 operating+0.040
Neg logit contributions
 Jill-0.034
ill-0.034
 tad-0.034
 Sophia-0.032
 Pam-0.030
Token eye
Token ID4151
Feature activation-1.62e+00

Pos logit contributions
cel+0.087
 Junior+0.079
 Thomas+0.077
Thomas+0.077
Brown+0.076
Neg logit contributions
 tad-0.058
 Jill-0.057
ill-0.055
 Sophia-0.052
 chance-0.050
Token out
Token ID503
Feature activation+3.06e-01

Pos logit contributions
 Jill+0.050
ill+0.049
 tad+0.049
 Sophia+0.048
 Pam+0.045
Neg logit contributions
cel-0.055
 Junior-0.053
uf-0.051
 ambul-0.051
 Thomas-0.049
Token for
Token ID329
Feature activation+1.57e+00

Pos logit contributions
cel+0.009
 Junior+0.009
 ambul+0.008
 operating+0.008
 tid+0.008
Neg logit contributions
 Jill-0.011
 tad-0.011
ill-0.011
 Sophia-0.010
 Pam-0.010
Token little
Token ID1310
Feature activation+7.09e-01

Pos logit contributions
cel+0.053
 Junior+0.052
 ambul+0.049
uf+0.048
 tid+0.048
Neg logit contributions
 Jill-0.050
ill-0.050
 tad-0.050
 Sophia-0.048
 Pam-0.045
Token onions
Token ID24681
Feature activation+6.75e+00

Pos logit contributions
cel+0.026
 Junior+0.025
uf+0.024
 ambul+0.023
hers+0.023
Neg logit contributions
 Jill-0.021
 tad-0.021
ill-0.020
 Sophia-0.020
 Pam-0.019
Token in
Token ID287
Feature activation+9.79e-01

Pos logit contributions
cel+0.216
 Junior+0.207
 ambul+0.195
uf+0.194
Brown+0.189
Neg logit contributions
 Jill-0.227
ill-0.222
 tad-0.221
 Sophia-0.216
 Pam-0.202
Token the
Token ID262
Feature activation+1.66e+00

Pos logit contributions
 Junior+0.034
cel+0.033
 tid+0.032
 Thomas+0.032
 ambul+0.032
Neg logit contributions
 tad-0.029
ill-0.028
 Sophia-0.027
 Jill-0.027
ashes-0.026
Token garden
Token ID11376
Feature activation+8.67e-01

Pos logit contributions
cel+0.074
 Junior+0.072
 ambul+0.070
uf+0.069
Brown+0.068
Neg logit contributions
 Jill-0.036
 tad-0.034
 Sophia-0.032
ill-0.032
 Pam-0.029
Token.
Token ID13
Feature activation+7.11e-01

Pos logit contributions
cel+0.027
 Junior+0.026
 ambul+0.024
uf+0.024
 operating+0.023
Neg logit contributions
 Jill-0.030
 tad-0.030
ill-0.029
 Sophia-0.029
 Pam-0.027
Token<|endoftext|>
Token ID50256
Feature activation+1.21e+00

Pos logit contributions
cel+0.027
 Junior+0.027
uf+0.025
 ambul+0.025
 Thomas+0.025
Neg logit contributions
 tad-0.020
 Jill-0.019
ill-0.019
 Sophia-0.018
 Pam-0.017
Token was
Token ID373
Feature activation+1.33e+00

Pos logit contributions
cel+0.016
 Junior+0.015
uf+0.014
 ambul+0.014
Brown+0.014
Neg logit contributions
 Jill-0.013
 tad-0.012
ill-0.012
 Sophia-0.012
ulu-0.011
Token very
Token ID845
Feature activation+1.59e+00

Pos logit contributions
cel+0.042
 Junior+0.040
 ambul+0.038
uf+0.037
 operating+0.037
Neg logit contributions
 Jill-0.045
 tad-0.045
ill-0.044
 Sophia-0.043
 Pam-0.041
Token happy
Token ID3772
Feature activation-5.33e-01

Pos logit contributions
 Junior+0.058
cel+0.057
 ambul+0.055
 tid+0.054
 operating+0.054
Neg logit contributions
ill-0.045
 Jill-0.043
 tad-0.043
 Sophia-0.042
 Meg-0.039
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
 Jill+0.019
 tad+0.018
ill+0.018
 Sophia+0.018
 Pam+0.017
Neg logit contributions
cel-0.017
 Junior-0.016
 ambul-0.015
uf-0.015
 operating-0.015
Token<|endoftext|>
Token ID50256
Feature activation+9.07e-01

Pos logit contributions
cel+0.034
 Junior+0.034
 Thomas+0.032
uf+0.032
 ambul+0.031
Neg logit contributions
 tad-0.028
ill-0.027
 Jill-0.027
 Sophia-0.026
 Pam-0.024
TokenFar
Token ID21428
Feature activation+6.70e+00

Pos logit contributions
cel+0.026
 Junior+0.025
 Thomas+0.023
uf+0.023
 operating+0.023
Neg logit contributions
 tad-0.033
 Jill-0.033
ill-0.032
 Sophia-0.032
 Pam-0.030
Token away
Token ID1497
Feature activation-1.93e+00

Pos logit contributions
cel+0.221
 Junior+0.205
 ambul+0.201
 tid+0.196
uf+0.196
Neg logit contributions
 Jill-0.217
 Sophia-0.212
 tad-0.211
ill-0.210
 Pam-0.194
Token in
Token ID287
Feature activation+1.09e+00

Pos logit contributions
 Jill+0.073
 tad+0.072
ill+0.071
 Sophia+0.070
 Pam+0.066
Neg logit contributions
cel-0.053
 Junior-0.051
 ambul-0.048
uf-0.047
 operating-0.047
Token the
Token ID262
Feature activation+1.95e+00

Pos logit contributions
cel+0.041
 Junior+0.038
 ambul+0.037
uf+0.037
hers+0.037
Neg logit contributions
 Jill-0.032
 tad-0.030
 Sophia-0.030
ill-0.030
 Pam-0.028
Token sky
Token ID6766
Feature activation+9.30e-01

Pos logit contributions
 Junior+0.072
cel+0.072
 operating+0.068
 ambul+0.068
 tid+0.067
Neg logit contributions
 Jill-0.054
ill-0.054
 tad-0.053
 Sophia-0.051
 Pam-0.048
Token was
Token ID373
Feature activation+1.07e+00

Pos logit contributions
cel+0.024
 Junior+0.023
 ambul+0.021
 tid+0.021
 operating+0.021
Neg logit contributions
 tad-0.036
 Jill-0.036
 Sophia-0.035
ill-0.035
 Pam-0.033
Token.
Token ID13
Feature activation+8.90e-01

Pos logit contributions
cel+0.010
 Junior+0.010
 ambul+0.009
uf+0.009
 operating+0.009
Neg logit contributions
 Jill-0.012
 tad-0.011
ill-0.011
 Sophia-0.011
 Pam-0.011
Token They
Token ID1119
Feature activation+8.63e-01

Pos logit contributions
cel+0.037
 Junior+0.035
 tid+0.034
hers+0.034
 ambul+0.034
Neg logit contributions
 Jill-0.022
 tad-0.021
ill-0.021
 Sophia-0.020
 Pam-0.019
Token are
Token ID389
Feature activation+2.60e-01

Pos logit contributions
cel+0.034
 Junior+0.033
uf+0.031
Thomas+0.031
 ambul+0.031
Neg logit contributions
 tad-0.023
 Jill-0.022
ill-0.022
 Sophia-0.020
 Pam-0.020
Token for
Token ID329
Feature activation+2.04e+00

Pos logit contributions
cel+0.009
 Junior+0.008
uf+0.008
 ambul+0.008
Brown+0.007
Neg logit contributions
 Jill-0.009
 tad-0.008
 Sophia-0.008
ill-0.008
 Pam-0.008
Token wood
Token ID4898
Feature activation+4.61e+00

Pos logit contributions
cel+0.065
 Junior+0.061
Brown+0.057
Thomas+0.056
uf+0.055
Neg logit contributions
 Jill-0.073
 tad-0.071
 Sophia-0.066
 Pam-0.065
ill-0.064
Tokencut
Token ID8968
Feature activation+6.70e+00

Pos logit contributions
cel+0.169
 Junior+0.165
uf+0.155
 ambul+0.152
 Thomas+0.151
Neg logit contributions
 Jill-0.135
 tad-0.132
ill-0.131
 Sophia-0.125
 Pam-0.117
Tokenters
Token ID1010
Feature activation+2.64e+00

Pos logit contributions
cel+0.242
 operating+0.231
 Junior+0.230
 tid+0.225
Charl+0.222
Neg logit contributions
 Jill-0.205
ill-0.202
 tad-0.190
 Sophia-0.188
 Pam-0.173
Token."
Token ID526
Feature activation-2.93e-01

Pos logit contributions
cel+0.095
 Junior+0.092
uf+0.085
Brown+0.085
Thomas+0.085
Neg logit contributions
 Jill-0.079
ill-0.077
 tad-0.077
 Sophia-0.074
 Pam-0.068
Token They
Token ID1119
Feature activation+7.34e-01

Pos logit contributions
 tad+0.008
ill+0.008
 Sophia+0.008
 Jill+0.008
ulu+0.007
Neg logit contributions
 Junior-0.011
cel-0.010
 Thomas-0.010
 ambul-0.010
uf-0.010
Token took
Token ID1718
Feature activation+1.02e-02

Pos logit contributions
cel+0.025
 Junior+0.024
 ambul+0.023
uf+0.023
 operating+0.023
Neg logit contributions
 Jill-0.023
 tad-0.023
ill-0.022
 Sophia-0.022
 Pam-0.020
Token Lily
Token ID20037
Feature activation-1.17e+00

Pos logit contributions
cel+0.000
 Junior+0.000
 ambul+0.000
uf+0.000
 operating+0.000
Neg logit contributions
 Jill-0.000
 tad-0.000
ill-0.000
 Sophia-0.000
 Pam-0.000
Token,
Token ID11
Feature activation-8.60e-01

Pos logit contributions
 Jill+0.070
ill+0.067
 Sophia+0.064
 tad+0.062
 Meg+0.059
Neg logit contributions
cel-0.088
 Junior-0.085
uf-0.081
ind-0.080
 ambul-0.079
Token so
Token ID523
Feature activation-7.86e-01

Pos logit contributions
ill+0.033
 tad+0.033
 Jill+0.032
 Sophia+0.031
ulu+0.030
Neg logit contributions
cel-0.023
 Junior-0.023
 ambul-0.022
 Thomas-0.021
 tid-0.021
Token she
Token ID673
Feature activation-3.46e-01

Pos logit contributions
 Jill+0.020
 tad+0.018
 Sophia+0.018
ill+0.018
 Pam+0.017
Neg logit contributions
cel-0.034
 Junior-0.032
uf-0.031
hers-0.031
 ambul-0.030
Token went
Token ID1816
Feature activation-1.63e-01

Pos logit contributions
 Jill+0.011
ill+0.011
 tad+0.011
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.012
 Junior-0.012
 tid-0.011
 Thomas-0.011
hers-0.010
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
 tad+0.006
ill+0.006
 Jill+0.005
 Sophia+0.005
 Pam+0.005
Neg logit contributions
cel-0.005
 Junior-0.005
 tid-0.005
 operating-0.005
 Thomas-0.005
Token Farmer
Token ID37755
Feature activation+6.68e+00

Pos logit contributions
cel+0.047
 Junior+0.045
 ambul+0.043
uf+0.042
 operating+0.042
Neg logit contributions
 Jill-0.040
 tad-0.039
ill-0.038
 Sophia-0.038
 Pam-0.036
Token Brown
Token ID4373
Feature activation+4.74e+00

Pos logit contributions
brate+0.212
cel+0.206
ind+0.202
uf+0.198
rast+0.196
Neg logit contributions
 Jill-0.234
 Sophia-0.217
ill-0.212
 Sue-0.208
 Meg-0.201
Token's
Token ID338
Feature activation+3.13e+00

Pos logit contributions
cel+0.152
 Junior+0.140
 ambul+0.139
uf+0.135
Brown+0.134
Neg logit contributions
ill-0.158
 Jill-0.147
 Sophia-0.140
 tad-0.139
 flower-0.131
Token house
Token ID2156
Feature activation+2.34e+00

Pos logit contributions
cel+0.106
 Junior+0.102
 ambul+0.095
uf+0.095
 operating+0.093
Neg logit contributions
 Jill-0.101
 tad-0.098
ill-0.096
 Sophia-0.096
 Pam-0.089
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
cel+0.079
 Junior+0.076
 ambul+0.072
uf+0.071
 operating+0.070
Neg logit contributions
 Jill-0.075
 tad-0.074
ill-0.073
 Sophia-0.072
 Pam-0.067
Token When
Token ID1649
Feature activation-1.49e-01

Pos logit contributions
cel+0.010
hers+0.010
ind+0.010
brate+0.010
 Junior+0.010
Neg logit contributions
 Jill-0.006
 Sophia-0.006
ill-0.006
 tad-0.005
 Sue-0.005
Tokenmy
Token ID1820
Feature activation+2.06e+00

Pos logit contributions
cel+0.067
 ambul+0.062
 Junior+0.062
 operating+0.061
uf+0.061
Neg logit contributions
 Jill-0.061
ill-0.060
 Sophia-0.056
 tad-0.053
 Pam-0.049
Token was
Token ID373
Feature activation+1.90e+00

Pos logit contributions
cel+0.083
 ambul+0.079
 Junior+0.078
uf+0.077
Brown+0.076
Neg logit contributions
 Jill-0.052
 Sophia-0.048
ill-0.047
 tad-0.046
 Pam-0.042
Token teaching
Token ID7743
Feature activation+1.13e+00

Pos logit contributions
cel+0.057
 Junior+0.054
uf+0.050
 ambul+0.050
Brown+0.049
Neg logit contributions
 Jill-0.069
 tad-0.066
 Sophia-0.065
ill-0.065
 Pam-0.060
Token Sally
Token ID25737
Feature activation-9.23e-01

Pos logit contributions
cel+0.038
 Junior+0.035
uf+0.033
 operating+0.033
 ambul+0.033
Neg logit contributions
 Jill-0.039
ill-0.037
 Sophia-0.037
 tad-0.036
 Pam-0.034
Token about
Token ID546
Feature activation+1.74e+00

Pos logit contributions
 Jill+0.029
ill+0.028
 Sophia+0.028
 tad+0.027
 Pam+0.025
Neg logit contributions
cel-0.033
 Junior-0.031
 ambul-0.030
 operating-0.029
uf-0.029
Token therm
Token ID21969
Feature activation+6.64e+00

Pos logit contributions
cel+0.058
 Junior+0.057
 ambul+0.054
 Thomas+0.052
 tid+0.052
Neg logit contributions
ill-0.055
 tad-0.055
 Jill-0.054
 Sophia-0.052
 Pam-0.049
Tokenometers
Token ID40077
Feature activation+5.95e+00

Pos logit contributions
cel+0.218
 Junior+0.206
 ambul+0.201
brate+0.196
 operating+0.195
Neg logit contributions
ill-0.219
 Jill-0.213
 Sophia-0.198
uttered-0.191
 tad-0.187
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
cel+0.204
 Junior+0.195
uf+0.184
 ambul+0.180
Brown+0.180
Neg logit contributions
 Jill-0.192
ill-0.186
 tad-0.182
 Sophia-0.180
 Pam-0.166
Token Sally
Token ID25737
Feature activation-1.40e+00

Pos logit contributions
 Junior+0.012
cel+0.011
 ambul+0.011
 why+0.011
 tid+0.011
Neg logit contributions
 tad-0.010
 bead-0.010
ilies-0.010
ill-0.010
ived-0.009
Token watched
Token ID7342
Feature activation+1.29e+00

Pos logit contributions
 Jill+0.036
 Sophia+0.032
ill+0.032
 tad+0.030
 Sue+0.029
Neg logit contributions
cel-0.056
 ambul-0.053
uf-0.053
 Junior-0.052
 operating-0.051
Token as
Token ID355
Feature activation+1.91e-01

Pos logit contributions
cel+0.042
 Junior+0.039
uf+0.037
 ambul+0.037
 operating+0.037
Neg logit contributions
 Jill-0.044
 tad-0.042
ill-0.041
 Sophia-0.041
 Pam-0.039
Token because
Token ID780
Feature activation+2.32e+00

Pos logit contributions
cel+0.036
 Junior+0.035
uf+0.033
Brown+0.033
riger+0.033
Neg logit contributions
 Jill-0.028
ill-0.026
 Sophia-0.025
 tad-0.023
 Pam-0.023
Token they
Token ID484
Feature activation+5.24e-01

Pos logit contributions
 Junior+0.095
 ambul+0.094
 tid+0.093
 operating+0.092
cel+0.092
Neg logit contributions
 tad-0.057
ill-0.054
ashes-0.050
 Sophia-0.048
 Jill-0.047
Token couldn
Token ID3521
Feature activation-5.82e-01

Pos logit contributions
cel+0.021
uf+0.020
 Junior+0.020
 ambul+0.019
 operating+0.019
Neg logit contributions
 Jill-0.014
ill-0.013
 tad-0.012
 Sophia-0.012
 Pam-0.011
Token't
Token ID470
Feature activation-6.31e-01

Pos logit contributions
 tad+0.023
 Jill+0.022
 Sophia+0.021
ill+0.021
 Pam+0.020
Neg logit contributions
cel-0.016
 Junior-0.016
 operating-0.014
 ambul-0.014
 Thomas-0.014
Token have
Token ID423
Feature activation-7.26e-01

Pos logit contributions
 Jill+0.017
ill+0.016
 tad+0.016
 Sophia+0.016
 Pam+0.015
Neg logit contributions
cel-0.025
 Junior-0.024
 ambul-0.023
 Thomas-0.022
uf-0.022
Token caul
Token ID46087
Feature activation+6.60e+00

Pos logit contributions
 tad+0.023
ill+0.023
 Jill+0.023
 Sophia+0.022
 Pam+0.021
Neg logit contributions
 Junior-0.024
cel-0.023
 tid-0.022
uf-0.022
 ambul-0.022
Tokeniflower
Token ID42642
Feature activation+4.91e+00

Pos logit contributions
 Michael+0.200
 ambul+0.198
cel+0.196
 Thomas+0.193
Michael+0.187
Neg logit contributions
 tad-0.218
ill-0.214
ectar-0.203
ib-0.202
 bead-0.202
Token for
Token ID329
Feature activation+8.03e-01

Pos logit contributions
cel+0.156
 Junior+0.150
 ambul+0.142
uf+0.139
 operating+0.138
Neg logit contributions
 Jill-0.167
 tad-0.163
ill-0.162
 Sophia-0.159
 Pam-0.149
Token dinner
Token ID8073
Feature activation+2.61e+00

Pos logit contributions
cel+0.029
 Junior+0.028
 ambul+0.026
uf+0.026
 Thomas+0.026
Neg logit contributions
 Jill-0.024
 tad-0.024
ill-0.024
 Sophia-0.023
 Pam-0.021
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
cel+0.086
 Junior+0.082
 ambul+0.078
uf+0.077
Brown+0.076
Neg logit contributions
 Jill-0.087
ill-0.085
 Sophia-0.083
 tad-0.081
 Pam-0.075
Token Lily
Token ID20037
Feature activation-1.62e+00

Pos logit contributions
cel+0.022
 Junior+0.021
 Thomas+0.020
 operating+0.020
 tid+0.020
Neg logit contributions
 tad-0.015
ill-0.015
 Jill-0.014
 Sophia-0.014
ulu-0.013
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
cel+0.007
 Junior+0.006
 Thomas+0.006
 operating+0.006
uf+0.006
Neg logit contributions
 tad-0.007
ill-0.006
 crab-0.006
 chance-0.006
 shell-0.006
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
 tad+0.007
 Jill+0.007
ill+0.006
 Sophia+0.006
 Pam+0.006
Neg logit contributions
 Junior-0.007
cel-0.007
uf-0.007
 Thomas-0.007
Brown-0.007
Token"
Token ID1
Feature activation+3.15e-02

Pos logit contributions
 Jill+0.006
 tad+0.006
ill+0.006
 Sophia+0.006
 Pam+0.006
Neg logit contributions
cel-0.007
 Junior-0.007
 ambul-0.007
uf-0.006
 operating-0.006
TokenHello
Token ID15496
Feature activation+6.66e-01

Pos logit contributions
cel+0.001
 Junior+0.001
Brown+0.001
 ambul+0.001
uf+0.001
Neg logit contributions
 Jill-0.001
ill-0.001
 tad-0.001
 Sophia-0.001
 Pam-0.001
Token,
Token ID11
Feature activation+1.76e+00

Pos logit contributions
cel+0.024
 ambul+0.023
hers+0.022
 Junior+0.022
Brown+0.022
Neg logit contributions
 Jill-0.021
 Sophia-0.021
ill-0.020
 tad-0.020
 Sue-0.019
Token caul
Token ID46087
Feature activation+6.59e+00

Pos logit contributions
cel+0.062
 Junior+0.058
 ambul+0.058
hers+0.056
 operating+0.056
Neg logit contributions
 Jill-0.055
 tad-0.054
 Sophia-0.053
ill-0.052
 Pam-0.049
Tokeniflower
Token ID42642
Feature activation+5.63e+00

Pos logit contributions
 ambul+0.222
 Michael+0.210
 Thomas+0.206
 Junior+0.204
cel+0.200
Neg logit contributions
 tad-0.218
 flower-0.208
ill-0.206
 bloom-0.205
ectar-0.201
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
cel+0.185
 Junior+0.180
 ambul+0.175
Thomas+0.168
Brown+0.165
Neg logit contributions
 Jill-0.193
 tad-0.187
ill-0.187
 Sophia-0.181
 flower-0.171
Token I
Token ID314
Feature activation-5.38e-01

Pos logit contributions
cel+0.042
 Junior+0.040
 ambul+0.038
uf+0.037
 operating+0.037
Neg logit contributions
 Jill-0.035
 tad-0.035
ill-0.035
 Sophia-0.034
 Pam-0.031
Token am
Token ID716
Feature activation+3.39e-01

Pos logit contributions
 Jill+0.017
 Sophia+0.017
 tad+0.017
ill+0.017
 Meg+0.016
Neg logit contributions
cel-0.018
 Junior-0.017
 tid-0.016
uf-0.016
 operating-0.016
Token Lily
Token ID20037
Feature activation-4.26e-01

Pos logit contributions
cel+0.012
 Junior+0.011
 ambul+0.011
uf+0.011
 tid+0.011
Neg logit contributions
ill-0.010
 Jill-0.010
 tad-0.010
 Sophia-0.010
 Pam-0.009
Token then
Token ID788
Feature activation-4.03e-02

Pos logit contributions
cel+0.049
 Junior+0.047
 ambul+0.047
uf+0.046
 operating+0.046
Neg logit contributions
 Jill-0.032
ill-0.031
 tad-0.030
 Sophia-0.030
 Pam-0.026
Token scoop
Token ID23529
Feature activation-1.05e+00

Pos logit contributions
 Jill+0.001
 tad+0.001
ill+0.001
 Sophia+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
uf-0.001
 operating-0.001
Tokened
Token ID276
Feature activation-3.14e-01

Pos logit contributions
 tad+0.032
 Jill+0.031
 Sophia+0.030
ill+0.030
 Pam+0.029
Neg logit contributions
 Junior-0.035
cel-0.034
 tid-0.032
hers-0.032
 Thomas-0.031
Token it
Token ID340
Feature activation-7.06e-01

Pos logit contributions
 Jill+0.009
 tad+0.009
ill+0.008
 Sophia+0.008
 Pam+0.008
Neg logit contributions
cel-0.012
 Junior-0.011
uf-0.011
 ambul-0.011
 operating-0.011
Token into
Token ID656
Feature activation+1.10e+00

Pos logit contributions
 tad+0.025
 Jill+0.025
 Sophia+0.024
ill+0.024
 Pam+0.023
Neg logit contributions
cel-0.021
 Junior-0.021
 tid-0.019
 operating-0.019
hers-0.019
Token pans
Token ID36209
Feature activation+6.58e+00

Pos logit contributions
 Junior+0.034
cel+0.033
 tid+0.031
 ambul+0.031
 Thomas+0.031
Neg logit contributions
 tad-0.036
ill-0.036
 Jill-0.035
 Sophia-0.035
 Pam-0.033
Token and
Token ID290
Feature activation+5.31e-01

Pos logit contributions
cel+0.275
 Junior+0.259
 ambul+0.256
uf+0.256
Brown+0.252
Neg logit contributions
ill-0.172
 Jill-0.168
 tad-0.154
 Sophia-0.153
 Pam-0.131
Token put
Token ID1234
Feature activation+7.05e-01

Pos logit contributions
cel+0.019
 Junior+0.018
uf+0.018
 ambul+0.018
 operating+0.017
Neg logit contributions
 Jill-0.016
 tad-0.016
ill-0.015
 Sophia-0.015
 Pam-0.014
Token it
Token ID340
Feature activation-4.38e-01

Pos logit contributions
cel+0.027
 Junior+0.027
 ambul+0.025
 tid+0.025
 Thomas+0.025
Neg logit contributions
 Jill-0.019
 tad-0.019
ill-0.018
 Sophia-0.018
 Pam-0.016
Token in
Token ID287
Feature activation-6.90e-02

Pos logit contributions
 tad+0.018
 Jill+0.018
 Sophia+0.017
ill+0.017
 Pam+0.016
Neg logit contributions
cel-0.011
 Junior-0.011
 tid-0.010
 ambul-0.010
hers-0.010
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 tad+0.002
 Jill+0.002
ill+0.002
 Sophia+0.002
 Pam+0.002
Neg logit contributions
 Junior-0.002
cel-0.002
 ambul-0.002
 tid-0.002
 Thomas-0.002
Token.
Token ID13
Feature activation+9.00e-04

Pos logit contributions
cel+0.050
 Junior+0.047
 tid+0.044
 operating+0.043
 ambul+0.043
Neg logit contributions
 tad-0.068
 Jill-0.068
 Sophia-0.066
ill-0.065
 Pam-0.063
Token She
Token ID1375
Feature activation+3.86e-01

Pos logit contributions
cel+0.000
 Junior+0.000
 Thomas+0.000
 operating+0.000
uf+0.000
Neg logit contributions
 tad-0.000
ill-0.000
 Jill-0.000
 Sophia-0.000
ulu-0.000
Token said
Token ID531
Feature activation-3.49e-01

Pos logit contributions
cel+0.013
 Junior+0.013
 ambul+0.012
uf+0.012
 operating+0.012
Neg logit contributions
 Jill-0.012
 tad-0.012
ill-0.012
 Sophia-0.012
 Pam-0.011
Token it
Token ID340
Feature activation-1.68e+00

Pos logit contributions
 tad+0.011
 Jill+0.010
ill+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.012
 Junior-0.012
 ambul-0.011
uf-0.011
 operating-0.011
Token was
Token ID373
Feature activation-9.05e-01

Pos logit contributions
 Jill+0.050
 tad+0.049
ill+0.048
 Sophia+0.047
 Pam+0.043
Neg logit contributions
cel-0.060
 Junior-0.059
uf-0.056
 ambul-0.056
 operating-0.055
Token caul
Token ID46087
Feature activation+6.58e+00

Pos logit contributions
 Jill+0.027
 tad+0.025
 Sophia+0.025
ill+0.024
 Pam+0.023
Neg logit contributions
cel-0.035
 Junior-0.032
 ambul-0.032
 operating-0.031
Brown-0.031
Tokeniflower
Token ID42642
Feature activation+5.51e+00

Pos logit contributions
 ambul+0.218
 Michael+0.206
 Thomas+0.205
 Junior+0.204
cel+0.203
Neg logit contributions
 tad-0.219
ill-0.210
ectar-0.199
 flower-0.197
ulu-0.197
Token mash
Token ID30407
Feature activation+1.17e+00

Pos logit contributions
cel+0.178
 Junior+0.170
 ambul+0.162
uf+0.159
 operating+0.157
Neg logit contributions
 Jill-0.187
 tad-0.182
ill-0.181
 Sophia-0.177
 Pam-0.165
Token and
Token ID290
Feature activation-9.18e-01

Pos logit contributions
cel+0.037
 Junior+0.036
 ambul+0.034
uf+0.033
 tid+0.033
Neg logit contributions
 Jill-0.039
 tad-0.039
ill-0.038
 Sophia-0.038
 Pam-0.036
Token it
Token ID340
Feature activation-9.75e-01

Pos logit contributions
 Jill+0.029
ill+0.029
 tad+0.029
 Sophia+0.028
 Pam+0.026
Neg logit contributions
cel-0.030
 Junior-0.030
 ambul-0.028
 tid-0.028
 operating-0.027
Token was
Token ID373
Feature activation-9.85e-02

Pos logit contributions
 Jill+0.032
 tad+0.031
ill+0.031
 Sophia+0.031
 Pam+0.029
Neg logit contributions
cel-0.032
 Junior-0.031
 ambul-0.029
uf-0.029
 operating-0.028
Token.
Token ID13
Feature activation+4.38e-01

Pos logit contributions
cel+0.076
 Junior+0.075
 ambul+0.069
 tid+0.068
uf+0.068
Neg logit contributions
 tad-0.112
 Jill-0.111
ill-0.108
 Sophia-0.106
 Pam-0.103
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
 Junior+0.014
cel+0.014
 Thomas+0.013
uf+0.013
 ambul+0.013
Neg logit contributions
 tad-0.014
ill-0.014
 Jill-0.013
 Sophia-0.013
 bou-0.012
Token
Token ID198
Feature activation-4.92e-02

Pos logit contributions
 Jill+0.001
 tad+0.001
ill+0.001
 Sophia+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
uf-0.001
 operating-0.001
Token"
Token ID1
Feature activation+1.99e-01

Pos logit contributions
 Jill+0.002
 tad+0.002
ill+0.002
 Sophia+0.002
 Pam+0.001
Neg logit contributions
cel-0.002
 Junior-0.002
 ambul-0.002
uf-0.001
 operating-0.001
TokenBro
Token ID15783
Feature activation+4.31e+00

Pos logit contributions
cel+0.006
 Junior+0.006
Brown+0.006
 ambul+0.006
hers+0.006
Neg logit contributions
 Jill-0.007
ill-0.007
 tad-0.007
 Sophia-0.006
 Pam-0.006
Tokenccoli
Token ID34544
Feature activation+6.53e+00

Pos logit contributions
cel+0.092
 Junior+0.088
 ambul+0.080
 operating+0.080
 Thomas+0.078
Neg logit contributions
 Jill-0.195
ill-0.190
 tad-0.186
 Sophia-0.183
 Pam-0.176
Token is
Token ID318
Feature activation+7.43e-01

Pos logit contributions
cel+0.269
 Junior+0.248
 ambul+0.245
uf+0.244
Brown+0.240
Neg logit contributions
 Jill-0.167
ill-0.166
 tad-0.159
 Sophia-0.153
 frog-0.145
Token not
Token ID407
Feature activation+9.00e-01

Pos logit contributions
cel+0.029
 Junior+0.027
Thomas+0.026
Brown+0.026
 ambul+0.026
Neg logit contributions
 Jill-0.022
 tad-0.021
 Sophia-0.020
 Pam-0.019
 grape-0.019
Token our
Token ID674
Feature activation+4.98e-01

Pos logit contributions
cel+0.036
 Junior+0.033
 ambul+0.032
 operating+0.032
uf+0.032
Neg logit contributions
 Jill-0.025
 tad-0.024
 Sophia-0.023
ill-0.023
 Pam-0.021
Token friend
Token ID1545
Feature activation-1.68e+00

Pos logit contributions
 Junior+0.018
cel+0.018
 ambul+0.017
uf+0.017
 tid+0.017
Neg logit contributions
 Jill-0.014
ill-0.014
 tad-0.014
 Sophia-0.013
 Pam-0.012
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 Jill+0.065
 tad+0.063
ill+0.063
 Sophia+0.062
 Pam+0.059
Neg logit contributions
cel-0.046
 Junior-0.044
 ambul-0.041
uf-0.040
 operating-0.040
Token had
Token ID550
Feature activation-1.10e-01

Pos logit contributions
 Jill+0.063
 tad+0.062
ill+0.061
 Sophia+0.060
 Pam+0.057
Neg logit contributions
cel-0.055
 Junior-0.053
 tid-0.049
 ambul-0.049
 Thomas-0.049
Token saved
Token ID7448
Feature activation+1.66e+00

Pos logit contributions
 tad+0.004
 Jill+0.004
ill+0.004
 Sophia+0.004
 Pam+0.003
Neg logit contributions
cel-0.003
 Junior-0.003
 tid-0.003
 ambul-0.003
 Thomas-0.003
Token him
Token ID683
Feature activation+1.45e+00

Pos logit contributions
cel+0.053
 Junior+0.050
 ambul+0.047
uf+0.047
 operating+0.046
Neg logit contributions
 Jill-0.058
 tad-0.056
ill-0.056
 Sophia-0.055
 Pam-0.052
Token.
Token ID13
Feature activation+6.18e-01

Pos logit contributions
cel+0.042
 Junior+0.041
 ambul+0.038
uf+0.038
 operating+0.038
Neg logit contributions
 Jill-0.052
 tad-0.052
ill-0.051
 Sophia-0.050
 Pam-0.047
Token<|endoftext|>
Token ID50256
Feature activation+9.72e-01

Pos logit contributions
 Junior+0.022
cel+0.022
 Thomas+0.020
uf+0.020
 ambul+0.020
Neg logit contributions
 tad-0.018
ill-0.018
 Jill-0.018
 Sophia-0.017
 Pam-0.016
TokenCharl
Token ID24453
Feature activation+6.52e+00

Pos logit contributions
cel+0.029
 Junior+0.028
 Thomas+0.027
Thomas+0.026
 operating+0.025
Neg logit contributions
 tad-0.034
 Jill-0.034
 Sophia-0.033
ill-0.032
 Pam-0.032
Tokenotte
Token ID11404
Feature activation+2.52e+00

Pos logit contributions
cel+0.195
 ambul+0.187
 gau+0.183
 kil+0.183
 why+0.180
Neg logit contributions
ulu-0.189
 Sophia-0.187
ill-0.187
 tad-0.185
aney-0.173
Token and
Token ID290
Feature activation+9.99e-01

Pos logit contributions
cel+0.097
 ambul+0.093
 Junior+0.093
hers+0.090
 operating+0.089
Neg logit contributions
 Jill-0.068
 tad-0.064
 Sophia-0.064
ill-0.064
 Pam-0.056
Token her
Token ID607
Feature activation+1.70e-01

Pos logit contributions
cel+0.036
 Junior+0.035
 ambul+0.033
uf+0.033
 Thomas+0.032
Neg logit contributions
 Jill-0.030
 tad-0.029
ill-0.029
 Sophia-0.028
 Pam-0.026
Token mom
Token ID1995
Feature activation+8.50e-01

Pos logit contributions
cel+0.007
 ambul+0.006
 Junior+0.006
 operating+0.006
hers+0.006
Neg logit contributions
 Jill-0.005
 tad-0.005
 Sophia-0.004
ill-0.004
 Pam-0.004
Token were
Token ID547
Feature activation+3.08e+00

Pos logit contributions
cel+0.031
 Junior+0.029
 operating+0.028
 ambul+0.028
uf+0.028
Neg logit contributions
 Jill-0.026
ill-0.025
 Sophia-0.024
 tad-0.022
 Pam-0.021
Token and
Token ID290
Feature activation+1.66e-01

Pos logit contributions
 tad+0.003
ill+0.003
 Jill+0.003
 Sophia+0.003
ulu+0.003
Neg logit contributions
cel-0.003
 Junior-0.003
 ambul-0.003
 tid-0.003
 operating-0.003
Token to
Token ID284
Feature activation+4.15e-03

Pos logit contributions
 Junior+0.006
cel+0.006
 tid+0.005
 ambul+0.005
 operating+0.005
Neg logit contributions
ill-0.005
 tad-0.005
 Jill-0.005
 Sophia-0.005
ulu-0.004
Token only
Token ID691
Feature activation+1.39e-01

Pos logit contributions
cel+0.000
 Junior+0.000
 ambul+0.000
uf+0.000
 operating+0.000
Neg logit contributions
 Jill-0.000
 tad-0.000
ill-0.000
 Sophia-0.000
 Pam-0.000
Token have
Token ID423
Feature activation+5.81e-01

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.005
 operating+0.005
uf+0.005
Neg logit contributions
 Jill-0.004
 tad-0.004
ill-0.004
 Sophia-0.004
 Pam-0.004
Token mass
Token ID2347
Feature activation+4.00e+00

Pos logit contributions
 Junior+0.019
cel+0.019
 ambul+0.018
 tid+0.018
 operating+0.017
Neg logit contributions
 tad-0.019
ill-0.018
 Jill-0.018
 Sophia-0.018
 Pam-0.016
Tokenages
Token ID1095
Feature activation+6.52e+00

Pos logit contributions
cel+0.120
 ambul+0.112
 Junior+0.111
brate+0.108
uf+0.105
Neg logit contributions
ashes-0.125
ill-0.123
 tad-0.123
ived-0.122
 Jill-0.121
Token from
Token ID422
Feature activation+2.07e+00

Pos logit contributions
cel+0.206
 Junior+0.199
 ambul+0.187
uf+0.184
 operating+0.182
Neg logit contributions
 Jill-0.222
 tad-0.217
ill-0.216
 Sophia-0.212
 Pam-0.198
Token people
Token ID661
Feature activation+2.24e+00

Pos logit contributions
cel+0.078
 Junior+0.076
 ambul+0.073
uf+0.072
 operating+0.071
Neg logit contributions
 Jill-0.058
 tad-0.057
ill-0.056
 Sophia-0.055
 Pam-0.050
Token she
Token ID673
Feature activation+1.34e+00

Pos logit contributions
cel+0.071
 Junior+0.067
 ambul+0.064
uf+0.063
Brown+0.061
Neg logit contributions
 Jill-0.076
ill-0.074
 Sophia-0.072
 tad-0.071
 Pam-0.066
Token trusted
Token ID13467
Feature activation+6.69e-01

Pos logit contributions
cel+0.045
 Junior+0.043
 ambul+0.041
uf+0.040
 operating+0.040
Neg logit contributions
 Jill-0.044
 tad-0.043
ill-0.042
 Sophia-0.042
 Pam-0.039
Token.
Token ID13
Feature activation+6.63e-01

Pos logit contributions
cel+0.022
 Junior+0.022
 ambul+0.020
uf+0.020
 operating+0.020
Neg logit contributions
 tad-0.021
 Jill-0.021
ill-0.021
 Sophia-0.020
 Pam-0.019
Token mom
Token ID1995
Feature activation+1.20e+00

Pos logit contributions
 Jill+0.016
 tad+0.015
ill+0.015
 Sophia+0.015
 Pam+0.014
Neg logit contributions
cel-0.021
 Junior-0.020
 ambul-0.019
uf-0.019
 operating-0.019
Token made
Token ID925
Feature activation+1.56e+00

Pos logit contributions
cel+0.043
 Junior+0.041
uf+0.040
 ambul+0.040
 operating+0.039
Neg logit contributions
 Jill-0.036
ill-0.035
 tad-0.034
 Sophia-0.034
 Pam-0.031
Token them
Token ID606
Feature activation-3.34e-01

Pos logit contributions
 Junior+0.043
 tid+0.042
 Thomas+0.040
hers+0.038
 operating+0.038
Neg logit contributions
ill-0.059
 tad-0.058
 Sophia-0.053
alk-0.053
Meg-0.053
Token cocoa
Token ID35845
Feature activation+3.79e+00

Pos logit contributions
 tad+0.011
 Sophia+0.011
 Jill+0.011
ill+0.011
 Pam+0.010
Neg logit contributions
 tid-0.010
cel-0.010
 Junior-0.010
 ambul-0.010
 operating-0.010
Token with
Token ID351
Feature activation-7.80e-01

Pos logit contributions
cel+0.108
 Junior+0.103
 tid+0.101
 ambul+0.099
hers+0.097
Neg logit contributions
 tad-0.140
 Jill-0.137
 Sophia-0.133
ill-0.131
 Pam-0.128
Token marsh
Token ID22397
Feature activation+6.52e+00

Pos logit contributions
 Jill+0.022
 tad+0.022
ill+0.021
 Sophia+0.021
 Pam+0.019
Neg logit contributions
cel-0.029
 Junior-0.028
 ambul-0.027
uf-0.027
 operating-0.026
Tokenm
Token ID76
Feature activation+3.55e-01

Pos logit contributions
cel+0.226
 Junior+0.221
 ambul+0.209
uf+0.207
 operating+0.202
Neg logit contributions
 Jill-0.200
 tad-0.198
 Sophia-0.190
ill-0.190
 Pam-0.179
Tokenallows
Token ID47205
Feature activation+3.25e+00

Pos logit contributions
cel+0.012
 Junior+0.011
 ambul+0.011
uf+0.011
 Thomas+0.010
Neg logit contributions
 Jill-0.012
ill-0.012
 tad-0.012
 Sophia-0.011
 Pam-0.011
Token.
Token ID13
Feature activation+9.39e-02

Pos logit contributions
 Junior+0.076
cel+0.076
 tid+0.072
 ambul+0.072
hers+0.070
Neg logit contributions
 tad-0.135
 Jill-0.130
 Sophia-0.128
 Pam-0.124
ill-0.123
Token The
Token ID383
Feature activation-3.72e-01

Pos logit contributions
cel+0.003
 Junior+0.003
 tid+0.003
 Thomas+0.003
 operating+0.003
Neg logit contributions
ill-0.003
 tad-0.003
ulu-0.003
 crab-0.003
ila-0.003
Token boy
Token ID2933
Feature activation-1.57e+00

Pos logit contributions
ill+0.010
 Jill+0.010
 tad+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
 Junior-0.014
cel-0.014
 ambul-0.014
 tid-0.014
 operating-0.014
Token It
Token ID632
Feature activation-1.18e+00

Pos logit contributions
cel+0.004
 Junior+0.004
 ambul+0.003
uf+0.003
 operating+0.003
Neg logit contributions
 Jill-0.003
 tad-0.003
ill-0.003
 Sophia-0.003
 Pam-0.003
Token's
Token ID338
Feature activation+3.42e-01

Pos logit contributions
 Jill+0.043
 tad+0.043
ill+0.042
 Sophia+0.042
 Pam+0.039
Neg logit contributions
cel-0.034
 Junior-0.033
 ambul-0.031
uf-0.030
 operating-0.030
Token important
Token ID1593
Feature activation+1.53e-01

Pos logit contributions
cel+0.012
 Junior+0.011
uf+0.011
Thomas+0.011
 ambul+0.011
Neg logit contributions
 Jill-0.011
 tad-0.010
 Sophia-0.010
ill-0.010
 Pam-0.009
Token to
Token ID284
Feature activation-5.30e-01

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.005
uf+0.005
 operating+0.005
Neg logit contributions
 Jill-0.005
 tad-0.005
ill-0.004
 Sophia-0.004
 Pam-0.004
Token stay
Token ID2652
Feature activation+6.95e-01

Pos logit contributions
 Jill+0.016
 tad+0.016
ill+0.015
 Sophia+0.015
 Pam+0.014
Neg logit contributions
cel-0.019
 Junior-0.018
 ambul-0.017
uf-0.017
 operating-0.017
Token hyd
Token ID7409
Feature activation+6.50e+00

Pos logit contributions
cel+0.025
 Junior+0.024
 ambul+0.024
uf+0.023
Michael+0.023
Neg logit contributions
 Jill-0.021
 tad-0.019
 Sophia-0.019
ill-0.018
 Pam-0.018
Tokenrated
Token ID4111
Feature activation+1.40e+00

Pos logit contributions
cel+0.177
 Junior+0.170
 ambul+0.158
uf+0.155
 operating+0.154
Neg logit contributions
 Jill-0.252
 tad-0.248
ill-0.245
 Sophia-0.242
 Pam-0.229
Token and
Token ID290
Feature activation+5.13e-01

Pos logit contributions
cel+0.042
 Junior+0.040
 ambul+0.037
uf+0.037
 operating+0.036
Neg logit contributions
 Jill-0.050
 tad-0.049
ill-0.049
 Sophia-0.048
 Pam-0.045
Token healthy
Token ID5448
Feature activation+2.35e+00

Pos logit contributions
cel+0.017
 Junior+0.016
 tid+0.016
 operating+0.016
 ambul+0.016
Neg logit contributions
ill-0.017
 Jill-0.016
 tad-0.016
 Sophia-0.016
 Pam-0.015
Token,"
Token ID553
Feature activation-2.61e-01

Pos logit contributions
cel+0.087
 Junior+0.082
uf+0.078
Brown+0.078
Thomas+0.078
Neg logit contributions
 Jill-0.071
 tad-0.068
 Sophia-0.066
ill-0.065
ashes-0.059
Token mom
Token ID1995
Feature activation-2.19e-01

Pos logit contributions
ill+0.009
 tad+0.008
ashes+0.007
 Sophia+0.007
alk+0.007
Neg logit contributions
 Junior-0.009
 ambul-0.008
cel-0.008
 tid-0.008
 Thomas-0.008
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
 tad+0.054
 Sophia+0.051
 Jill+0.051
ill+0.048
Meg+0.048
Neg logit contributions
 tid-0.040
cel-0.039
hers-0.037
 Thomas-0.037
 operating-0.037
Token tried
Token ID3088
Feature activation-1.19e+00

Pos logit contributions
ill+0.037
 tad+0.037
ulu+0.034
 Jill+0.034
 Sophia+0.034
Neg logit contributions
cel-0.034
 tid-0.034
 Junior-0.033
 Thomas-0.032
 ambul-0.032
Token to
Token ID284
Feature activation-5.09e-01

Pos logit contributions
 tad+0.039
ill+0.036
 Jill+0.036
 Sophia+0.034
ulu+0.034
Neg logit contributions
cel-0.039
 Junior-0.038
 tid-0.038
 ambul-0.037
 Thomas-0.037
Token wrap
Token ID14441
Feature activation+1.76e-01

Pos logit contributions
 Jill+0.014
 tad+0.014
ill+0.013
 Sophia+0.013
 Pam+0.012
Neg logit contributions
cel-0.020
 Junior-0.019
 ambul-0.018
uf-0.018
 operating-0.018
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 Junior+0.006
cel+0.006
 Thomas+0.006
 tid+0.005
hers+0.005
Neg logit contributions
 tad-0.005
ill-0.005
 Jill-0.005
 Sophia-0.005
 Goal-0.005
Token other
Token ID584
Feature activation-6.59e+00

Pos logit contributions
 Jill+0.052
ill+0.051
 Sophia+0.050
 tad+0.048
 Meg+0.047
Neg logit contributions
cel-0.063
 tid-0.060
 ambul-0.058
 operating-0.057
 Junior-0.057
Token lace
Token ID35938
Feature activation-3.08e+00

Pos logit contributions
 tad+0.206
 Jill+0.196
 Sophia+0.192
ill+0.187
 Pam+0.183
Neg logit contributions
cel-0.220
 tid-0.206
 Thomas-0.196
 ambul-0.195
 years-0.194
Token around
Token ID1088
Feature activation-1.74e+00

Pos logit contributions
 tad+0.135
Meg+0.125
 Sophia+0.125
 Jill+0.124
 Goal+0.123
Neg logit contributions
cel-0.063
 tid-0.060
 Thomas-0.056
 operating-0.055
 Junior-0.055
Token it
Token ID340
Feature activation-1.63e+00

Pos logit contributions
 tad+0.058
ill+0.052
 Jill+0.050
 Sophia+0.050
 Goal+0.050
Neg logit contributions
 Junior-0.055
cel-0.055
 Thomas-0.052
 tid-0.051
 ambul-0.051
Token.
Token ID13
Feature activation-7.86e-02

Pos logit contributions
 tad+0.070
 Jill+0.066
 Sophia+0.065
ill+0.064
 Pam+0.063
Neg logit contributions
cel-0.038
 Junior-0.035
 tid-0.035
 Thomas-0.033
 operating-0.032
Token But
Token ID887
Feature activation+2.98e-01

Pos logit contributions
 tad+0.003
ill+0.002
 Jill+0.002
ulu+0.002
 bou+0.002
Neg logit contributions
 Junior-0.003
cel-0.002
 Michael-0.002
uf-0.002
 ambul-0.002
Token Sometimes
Token ID8975
Feature activation-1.36e+00

Pos logit contributions
 tad+0.001
ill+0.001
 Jill+0.001
ulu+0.001
 Pam+0.001
Neg logit contributions
 Junior-0.001
 ambul-0.001
cel-0.001
 Michael-0.001
 operating-0.001
Token they
Token ID484
Feature activation-8.17e-01

Pos logit contributions
 Jill+0.029
 Sophia+0.026
 tad+0.025
ill+0.024
 Sue+0.024
Neg logit contributions
cel-0.063
uf-0.060
 Junior-0.059
hers-0.057
ind-0.057
Token repeat
Token ID9585
Feature activation-2.05e+00

Pos logit contributions
 Jill+0.022
ill+0.021
 tad+0.020
 Sophia+0.020
 Pam+0.019
Neg logit contributions
cel-0.032
 Junior-0.031
 ambul-0.030
uf-0.030
Brown-0.029
Token what
Token ID644
Feature activation+6.54e-01

Pos logit contributions
 Jill+0.070
 tad+0.069
ill+0.068
 Sophia+0.067
 Pam+0.063
Neg logit contributions
cel-0.064
 Junior-0.062
 ambul-0.059
uf-0.058
 operating-0.057
Token the
Token ID262
Feature activation-1.96e+00

Pos logit contributions
cel+0.023
 Junior+0.023
 ambul+0.022
 Thomas+0.022
 operating+0.022
Neg logit contributions
 tad-0.019
ill-0.019
 Jill-0.019
 Sophia-0.018
 Pam-0.017
Token other
Token ID584
Feature activation-6.45e+00

Pos logit contributions
ill+0.060
 Jill+0.059
 tad+0.058
 Sophia+0.057
 Pam+0.053
Neg logit contributions
cel-0.069
 ambul-0.066
 Junior-0.066
 tid-0.063
 Thomas-0.063
Token one
Token ID530
Feature activation-2.89e+00

Pos logit contributions
 Jill+0.196
 tad+0.186
ill+0.185
 Sophia+0.184
 Pam+0.168
Neg logit contributions
cel-0.232
 Junior-0.228
uf-0.214
 operating-0.209
 ambul-0.208
Token says
Token ID1139
Feature activation-2.28e+00

Pos logit contributions
 tad+0.105
ill+0.103
 Jill+0.102
 Sophia+0.098
 Pam+0.095
Neg logit contributions
cel-0.080
 Junior-0.079
 ambul-0.076
 tid-0.075
 operating-0.074
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 tad+0.091
 Jill+0.086
ill+0.085
 Sophia+0.083
 Pam+0.081
Neg logit contributions
cel-0.060
 Junior-0.056
 ambul-0.054
 Thomas-0.053
 operating-0.051
Token like
Token ID588
Feature activation-2.98e-01

Pos logit contributions
ill+0.048
 Jill+0.048
 tad+0.047
 Sophia+0.046
 Pam+0.043
Neg logit contributions
cel-0.051
 Junior-0.049
 ambul-0.048
 Thomas-0.045
 operating-0.045
Token a
Token ID257
Feature activation-6.01e-01

Pos logit contributions
 tad+0.010
ill+0.010
 Jill+0.009
 Sophia+0.009
 Pam+0.009
Neg logit contributions
 ambul-0.009
 Junior-0.009
cel-0.009
 Thomas-0.009
 tid-0.009
Token,
Token ID11
Feature activation-8.30e-01

Pos logit contributions
 Jill+0.024
 tad+0.024
 Sophia+0.024
ill+0.023
 Pam+0.023
Neg logit contributions
cel-0.018
 Junior-0.016
 tid-0.016
 ambul-0.016
 operating-0.016
Token and
Token ID290
Feature activation-6.79e-01

Pos logit contributions
ill+0.029
 tad+0.029
 Jill+0.029
 Sophia+0.028
 Pam+0.026
Neg logit contributions
 Junior-0.025
cel-0.024
 ambul-0.023
 Thomas-0.022
 tid-0.022
Token then
Token ID788
Feature activation-8.59e-01

Pos logit contributions
ill+0.023
 tad+0.022
 Jill+0.022
 Sophia+0.021
ulu+0.020
Neg logit contributions
 Junior-0.021
cel-0.021
 Thomas-0.020
 ambul-0.020
 tid-0.020
Token tie
Token ID9839
Feature activation-1.74e+00

Pos logit contributions
 tad+0.025
ill+0.025
 Jill+0.025
 Sophia+0.024
 Pam+0.022
Neg logit contributions
 Junior-0.030
cel-0.030
 Thomas-0.028
 ambul-0.028
 tid-0.027
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
ill+0.060
 tad+0.059
 Jill+0.055
 Sophia+0.055
ulu+0.053
Neg logit contributions
 Junior-0.053
 Thomas-0.051
cel-0.050
 tid-0.047
hers-0.047
Token other
Token ID584
Feature activation-6.35e+00

Pos logit contributions
ill+0.073
 Jill+0.071
 Sophia+0.067
 tad+0.064
 Pam+0.060
Neg logit contributions
cel-0.089
 Junior-0.088
 ambul-0.085
 operating-0.083
hers-0.083
Token end
Token ID886
Feature activation-2.12e+00

Pos logit contributions
 Jill+0.162
 tad+0.156
ill+0.156
 Sophia+0.151
 Pam+0.137
Neg logit contributions
cel-0.250
 Junior-0.245
 ambul-0.236
 tid-0.233
 operating-0.231
Token around
Token ID1088
Feature activation-4.17e-01

Pos logit contributions
 tad+0.067
 Sophia+0.065
ill+0.064
 Jill+0.064
 Goal+0.060
Neg logit contributions
 tid-0.064
cel-0.064
 Junior-0.063
 operating-0.062
 years-0.061
Token his
Token ID465
Feature activation-3.07e-01

Pos logit contributions
 tad+0.015
ill+0.014
 Jill+0.014
 Sophia+0.014
 Pam+0.013
Neg logit contributions
cel-0.012
 Junior-0.012
 Thomas-0.011
 ambul-0.011
 tid-0.011
Token ant
Token ID1885
Feature activation+1.44e+00

Pos logit contributions
 Jill+0.009
ill+0.008
 tad+0.008
 Sophia+0.008
 Pam+0.008
Neg logit contributions
cel-0.012
 Junior-0.011
 ambul-0.011
uf-0.011
 tid-0.011
Tokenler
Token ID1754
Feature activation+2.28e+00

Pos logit contributions
cel+0.040
 Junior+0.038
 ambul+0.036
 operating+0.035
 tid+0.035
Neg logit contributions
 Jill-0.054
 tad-0.053
ill-0.053
 Sophia-0.052
 Pam-0.049
Token sad
Token ID6507
Feature activation-8.31e-01

Pos logit contributions
 tid+0.003
 Junior+0.003
 why+0.003
 disturbing+0.003
cel+0.003
Neg logit contributions
ill-0.005
 tad-0.004
aney-0.004
 Jill-0.004
 Sophia-0.004
Token.
Token ID13
Feature activation-4.11e-02

Pos logit contributions
 Jill+0.028
ill+0.026
 tad+0.026
 Sophia+0.026
 Pam+0.024
Neg logit contributions
cel-0.028
 Junior-0.028
 ambul-0.026
uf-0.025
hers-0.025
Token She
Token ID1375
Feature activation+2.79e-01

Pos logit contributions
 tad+0.001
ill+0.001
ulu+0.001
 Jill+0.001
 Sophia+0.001
Neg logit contributions
cel-0.002
 Junior-0.002
 Thomas-0.002
uf-0.002
 operating-0.001
Token thought
Token ID1807
Feature activation-1.77e+00

Pos logit contributions
cel+0.009
 Junior+0.009
 ambul+0.009
uf+0.009
 operating+0.008
Neg logit contributions
 Jill-0.009
 tad-0.009
ill-0.009
 Sophia-0.008
 Pam-0.008
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
 tad+0.057
ill+0.056
 Jill+0.053
 Sophia+0.052
Meg+0.049
Neg logit contributions
 Junior-0.057
cel-0.057
 Thomas-0.055
 ambul-0.055
 operating-0.054
Token other
Token ID584
Feature activation-6.34e+00

Pos logit contributions
 Jill+0.069
ill+0.068
 Sophia+0.066
 tad+0.065
 Pam+0.062
Neg logit contributions
cel-0.057
 Junior-0.054
 ambul-0.053
uf-0.052
 operating-0.051
Token pengu
Token ID38373
Feature activation-2.28e-02

Pos logit contributions
 Jill+0.236
ill+0.230
 tad+0.228
 Sophia+0.225
 Pam+0.214
Neg logit contributions
cel-0.179
 Junior-0.169
 ambul-0.164
uf-0.159
 operating-0.157
Tokenins
Token ID1040
Feature activation-5.29e-01

Pos logit contributions
 Jill+0.001
 Sophia+0.001
ill+0.001
 tad+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
hers-0.001
uf-0.001
Token would
Token ID561
Feature activation-1.58e+00

Pos logit contributions
 Jill+0.019
 Sophia+0.019
 tad+0.019
ill+0.018
 Pam+0.017
Neg logit contributions
cel-0.014
 Junior-0.014
 ambul-0.014
 tid-0.014
 Thomas-0.013
Token never
Token ID1239
Feature activation-1.57e+00

Pos logit contributions
 Jill+0.050
 tad+0.049
ill+0.048
 Sophia+0.048
 Pam+0.043
Neg logit contributions
cel-0.053
 Junior-0.052
 ambul-0.050
 operating-0.049
 tid-0.049
Token want
Token ID765
Feature activation-8.59e-01

Pos logit contributions
 tad+0.047
 Jill+0.047
 Sophia+0.046
ill+0.045
 Pam+0.042
Neg logit contributions
cel-0.053
 Junior-0.050
 tid-0.050
 operating-0.049
 ambul-0.049
Token did
Token ID750
Feature activation-1.67e+00

Pos logit contributions
cel+0.025
 Junior+0.024
 tid+0.023
 ambul+0.023
uf+0.022
Neg logit contributions
 Jill-0.025
 tad-0.025
ill-0.025
 Sophia-0.024
 Pam-0.023
Token not
Token ID407
Feature activation+2.33e-01

Pos logit contributions
 tad+0.070
ill+0.070
 Jill+0.069
 Sophia+0.065
 Pam+0.065
Neg logit contributions
cel-0.038
 Junior-0.036
 tid-0.034
hers-0.034
 ambul-0.034
Token care
Token ID1337
Feature activation+8.36e-01

Pos logit contributions
cel+0.009
 Junior+0.009
 ambul+0.009
 operating+0.008
uf+0.008
Neg logit contributions
 Jill-0.006
ill-0.006
 tad-0.006
 Sophia-0.006
 Pam-0.005
Token what
Token ID644
Feature activation+8.67e-01

Pos logit contributions
cel+0.023
 Junior+0.023
uf+0.021
hers+0.020
Brown+0.020
Neg logit contributions
 Jill-0.033
ill-0.031
 tad-0.030
 Sophia-0.030
 Pam-0.029
Token the
Token ID262
Feature activation-2.24e+00

Pos logit contributions
cel+0.035
 Junior+0.034
 ambul+0.032
uf+0.032
 operating+0.032
Neg logit contributions
 Jill-0.022
 tad-0.022
ill-0.022
 Sophia-0.021
 Pam-0.019
Token other
Token ID584
Feature activation-6.32e+00

Pos logit contributions
 Jill+0.072
 tad+0.071
ill+0.068
 Sophia+0.068
 Pam+0.064
Neg logit contributions
cel-0.076
 Junior-0.073
uf-0.069
 ambul-0.068
 operating-0.067
Token cats
Token ID11875
Feature activation-8.16e-01

Pos logit contributions
 Jill+0.214
 tad+0.212
ill+0.206
 Sophia+0.205
 Pam+0.193
Neg logit contributions
cel-0.200
 Junior-0.194
uf-0.181
 operating-0.178
 ambul-0.178
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
 Jill+0.027
 tad+0.027
ill+0.026
 Sophia+0.026
 Pam+0.024
Neg logit contributions
cel-0.026
 Junior-0.026
 ambul-0.024
 tid-0.024
 operating-0.024
Token mice
Token ID10693
Feature activation+1.89e+00

Pos logit contributions
 Jill+0.045
ill+0.044
 tad+0.044
 Sophia+0.043
 Pam+0.040
Neg logit contributions
cel-0.043
 Junior-0.042
 ambul-0.040
uf-0.039
 operating-0.039
Token said
Token ID531
Feature activation+4.58e-01

Pos logit contributions
cel+0.051
 Junior+0.048
 ambul+0.045
 tid+0.045
 operating+0.045
Neg logit contributions
 Jill-0.073
 tad-0.072
ill-0.070
 Sophia-0.069
 Pam-0.067
Token.
Token ID13
Feature activation+3.90e-01

Pos logit contributions
cel+0.014
 Junior+0.013
 ambul+0.013
 operating+0.012
 Thomas+0.012
Neg logit contributions
 tad-0.016
 Jill-0.016
ill-0.016
 Sophia-0.015
 Pam-0.015
Token shine
Token ID18340
Feature activation+6.50e-02

Pos logit contributions
 Jill+0.019
ill+0.019
 tad+0.019
 Sophia+0.018
 Pam+0.017
Neg logit contributions
cel-0.028
 Junior-0.027
 ambul-0.026
uf-0.025
 operating-0.025
Token as
Token ID355
Feature activation-7.87e-01

Pos logit contributions
cel+0.002
 Junior+0.002
uf+0.002
 ambul+0.002
 operating+0.002
Neg logit contributions
 Jill-0.002
ill-0.002
 tad-0.002
 Sophia-0.002
 Pam-0.002
Token bright
Token ID6016
Feature activation-2.15e-01

Pos logit contributions
 tad+0.026
ill+0.026
 Sophia+0.026
Meg+0.025
 Jill+0.024
Neg logit contributions
 Junior-0.023
 tid-0.022
cel-0.022
 Thomas-0.022
hers-0.022
Token as
Token ID355
Feature activation-1.84e+00

Pos logit contributions
 Jill+0.007
ill+0.007
 tad+0.007
 Sophia+0.006
 Pam+0.006
Neg logit contributions
cel-0.007
 Junior-0.007
uf-0.007
 ambul-0.007
Brown-0.007
Token the
Token ID262
Feature activation-2.62e+00

Pos logit contributions
 tad+0.062
ill+0.060
 Sophia+0.058
Meg+0.057
aney+0.056
Neg logit contributions
 Junior-0.055
 tid-0.052
 Thomas-0.052
cel-0.052
hers-0.050
Token other
Token ID584
Feature activation-6.31e+00

Pos logit contributions
 Jill+0.064
 flower+0.060
 chance+0.060
 tad+0.058
ill+0.056
Neg logit contributions
 Junior-0.108
uf-0.108
cel-0.104
Matthew-0.098
brate-0.097
Token lamps
Token ID32209
Feature activation+5.80e-01

Pos logit contributions
 Jill+0.189
ill+0.181
 tad+0.161
 flower+0.160
 chance+0.156
Neg logit contributions
 Junior-0.238
uf-0.232
cel-0.222
brate-0.218
hers-0.216
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
cel+0.016
 Junior+0.015
 tid+0.014
 ambul+0.014
hers+0.014
Neg logit contributions
 tad-0.022
 Jill-0.022
 Sophia-0.021
ill-0.021
 Pam-0.020
Token The
Token ID383
Feature activation-8.99e-01

Pos logit contributions
 tad+0.013
ill+0.012
 Sophia+0.012
 Jill+0.012
ulu+0.011
Neg logit contributions
 Junior-0.011
cel-0.011
 Thomas-0.011
 ambul-0.011
Brown-0.010
Token lamp
Token ID20450
Feature activation+4.76e-01

Pos logit contributions
 flower+0.020
 chance+0.019
 Jill+0.019
 tad+0.018
 lady+0.017
Neg logit contributions
 Junior-0.039
uf-0.038
cel-0.036
riger-0.035
Matthew-0.035
Token felt
Token ID2936
Feature activation-9.44e-01

Pos logit contributions
cel+0.015
 Junior+0.015
 ambul+0.014
 operating+0.014
 tid+0.014
Neg logit contributions
 Jill-0.016
 tad-0.016
ill-0.015
 Sophia-0.015
 Pam-0.014
Token who
Token ID508
Feature activation+1.16e+00

Pos logit contributions
 Jill+0.011
 Sophia+0.011
ill+0.010
 tad+0.010
 Meg+0.010
Neg logit contributions
cel-0.011
 Junior-0.009
Brown-0.009
uf-0.009
 operating-0.009
Token can
Token ID460
Feature activation-1.03e+00

Pos logit contributions
cel+0.039
 Junior+0.038
 ambul+0.035
 Thomas+0.035
 operating+0.035
Neg logit contributions
 Jill-0.037
 tad-0.037
ill-0.036
 Sophia-0.035
 Pam-0.033
Token repeat
Token ID9585
Feature activation-1.16e+00

Pos logit contributions
 Jill+0.027
ill+0.026
 tad+0.026
 Sophia+0.025
 Pam+0.023
Neg logit contributions
cel-0.041
 Junior-0.040
uf-0.038
 ambul-0.038
 operating-0.036
Token what
Token ID644
Feature activation+8.65e-01

Pos logit contributions
 Jill+0.037
 Sophia+0.035
 tad+0.034
ill+0.034
 Meg+0.032
Neg logit contributions
cel-0.040
 Junior-0.038
uf-0.036
 operating-0.036
 ambul-0.036
Token the
Token ID262
Feature activation-2.16e+00

Pos logit contributions
cel+0.028
 Junior+0.027
 ambul+0.026
 Thomas+0.025
 operating+0.025
Neg logit contributions
 Jill-0.028
 tad-0.028
ill-0.028
 Sophia-0.027
 Pam-0.026
Token other
Token ID584
Feature activation-6.30e+00

Pos logit contributions
ill+0.072
 Jill+0.065
aney+0.065
 tad+0.062
 Sophia+0.062
Neg logit contributions
cel-0.063
 ambul-0.063
 Thomas-0.062
 disturbing-0.061
 tid-0.060
Token person
Token ID1048
Feature activation-2.35e+00

Pos logit contributions
 tad+0.194
ill+0.194
 Jill+0.193
 Sophia+0.186
 Pam+0.178
Neg logit contributions
cel-0.205
 Junior-0.195
 ambul-0.194
 tid-0.187
 Thomas-0.186
Token says
Token ID1139
Feature activation-1.40e+00

Pos logit contributions
 Jill+0.075
 tad+0.074
ill+0.073
 Sophia+0.072
 Pam+0.067
Neg logit contributions
cel-0.079
 Junior-0.076
 ambul-0.072
uf-0.071
 operating-0.070
Token the
Token ID262
Feature activation-8.42e-01

Pos logit contributions
 Jill+0.052
 tad+0.052
ill+0.051
 Sophia+0.050
 Pam+0.048
Neg logit contributions
cel-0.039
 Junior-0.038
 ambul-0.035
uf-0.035
 operating-0.034
Token best
Token ID1266
Feature activation-2.03e+00

Pos logit contributions
ill+0.026
 Jill+0.025
 Sophia+0.024
 tad+0.024
aney+0.023
Neg logit contributions
 ambul-0.026
cel-0.025
 disturbing-0.025
 operating-0.025
 tid-0.025
Token,"
Token ID553
Feature activation-1.18e+00

Pos logit contributions
 tad+0.073
ill+0.072
 Jill+0.071
 Sophia+0.070
 Pam+0.066
Neg logit contributions
cel-0.055
 Junior-0.053
 ambul-0.052
 tid-0.050
 Thomas-0.050
Token would
Token ID561
Feature activation-4.72e-01

Pos logit contributions
cel+0.004
 Junior+0.004
 ambul+0.004
uf+0.004
 operating+0.004
Neg logit contributions
 Jill-0.005
 tad-0.004
ill-0.004
 Sophia-0.004
 Pam-0.004
Token never
Token ID1239
Feature activation-1.60e+00

Pos logit contributions
 Jill+0.016
 tad+0.016
ill+0.016
 Sophia+0.016
 Pam+0.015
Neg logit contributions
cel-0.015
 Junior-0.014
uf-0.013
 ambul-0.013
 operating-0.013
Token do
Token ID466
Feature activation-1.28e+00

Pos logit contributions
 Jill+0.046
 tad+0.045
ill+0.044
 Sophia+0.043
 Pam+0.040
Neg logit contributions
cel-0.059
 Junior-0.057
 ambul-0.055
 operating-0.054
uf-0.054
Token what
Token ID644
Feature activation+9.18e-01

Pos logit contributions
 Jill+0.045
 tad+0.045
ill+0.044
 Sophia+0.043
 Pam+0.041
Neg logit contributions
cel-0.039
 Junior-0.037
 ambul-0.035
 operating-0.034
uf-0.034
Token the
Token ID262
Feature activation-2.06e+00

Pos logit contributions
cel+0.035
 Junior+0.034
uf+0.032
 ambul+0.032
 operating+0.031
Neg logit contributions
 Jill-0.026
 tad-0.025
ill-0.024
 Sophia-0.024
 Pam-0.022
Token other
Token ID584
Feature activation-6.28e+00

Pos logit contributions
ill+0.061
 Jill+0.060
 tad+0.058
 Sophia+0.057
 Pam+0.053
Neg logit contributions
cel-0.075
 Junior-0.073
 ambul-0.070
uf-0.069
 operating-0.068
Token people
Token ID661
Feature activation-1.38e+00

Pos logit contributions
 Jill+0.165
 tad+0.162
 Sophia+0.154
ill+0.146
 Pam+0.138
Neg logit contributions
cel-0.249
 Junior-0.240
Brown-0.232
hers-0.229
 operating-0.227
Token wanted
Token ID2227
Feature activation-1.48e+00

Pos logit contributions
 Jill+0.046
 tad+0.045
ill+0.044
 Sophia+0.044
 Pam+0.041
Neg logit contributions
cel-0.045
 Junior-0.043
 ambul-0.041
uf-0.040
 operating-0.040
Token him
Token ID683
Feature activation-1.87e-01

Pos logit contributions
 tad+0.051
 Jill+0.049
ill+0.048
 Sophia+0.048
 Pam+0.045
Neg logit contributions
cel-0.045
 Junior-0.044
 ambul-0.043
 tid-0.042
 operating-0.042
Token to
Token ID284
Feature activation+8.59e-01

Pos logit contributions
 tad+0.007
 Jill+0.007
 Sophia+0.007
ill+0.007
 Pam+0.007
Neg logit contributions
cel-0.005
 Junior-0.005
 ambul-0.005
 tid-0.005
 operating-0.005
Token do
Token ID466
Feature activation-5.25e-01

Pos logit contributions
cel+0.028
 Junior+0.027
 ambul+0.026
 operating+0.026
uf+0.026
Neg logit contributions
 tad-0.028
 Jill-0.027
ill-0.027
 Sophia-0.027
 Pam-0.025
Token did
Token ID750
Feature activation-1.78e+00

Pos logit contributions
cel+0.016
 Junior+0.015
 ambul+0.015
uf+0.014
 operating+0.014
Neg logit contributions
 Jill-0.015
 tad-0.014
ill-0.014
 Sophia-0.014
 Pam-0.013
Token not
Token ID407
Feature activation+1.37e-01

Pos logit contributions
 tad+0.075
ill+0.074
 Jill+0.072
 Sophia+0.069
 Pam+0.068
Neg logit contributions
cel-0.042
 Junior-0.039
 tid-0.037
 ambul-0.037
 operating-0.037
Token care
Token ID1337
Feature activation+7.98e-01

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.005
uf+0.005
 operating+0.005
Neg logit contributions
 Jill-0.004
ill-0.004
 tad-0.004
 Sophia-0.003
 Pam-0.003
Token what
Token ID644
Feature activation+7.97e-01

Pos logit contributions
cel+0.021
 Junior+0.021
uf+0.019
 ambul+0.019
hers+0.018
Neg logit contributions
 Jill-0.032
ill-0.030
 tad-0.030
 Sophia-0.030
 Pam-0.028
Token the
Token ID262
Feature activation-2.19e+00

Pos logit contributions
cel+0.035
 Junior+0.035
 ambul+0.033
 operating+0.033
 Thomas+0.033
Neg logit contributions
 tad-0.017
 Jill-0.017
ill-0.016
 Sophia-0.015
 Pam-0.014
Token other
Token ID584
Feature activation-6.28e+00

Pos logit contributions
ill+0.074
 Jill+0.069
 tad+0.065
 Sophia+0.063
 Pam+0.062
Neg logit contributions
cel-0.072
 ambul-0.070
 Junior-0.068
 operating-0.068
 tid-0.067
Token kids
Token ID3988
Feature activation-2.38e+00

Pos logit contributions
 Jill+0.214
 tad+0.213
ill+0.211
 Sophia+0.200
 Pam+0.197
Neg logit contributions
cel-0.192
 Junior-0.183
 ambul-0.181
 tid-0.175
 operating-0.174
Token said
Token ID531
Feature activation-1.50e-02

Pos logit contributions
 tad+0.079
 Jill+0.078
ill+0.077
 Sophia+0.075
 Pam+0.071
Neg logit contributions
cel-0.076
 Junior-0.074
 tid-0.070
 ambul-0.069
 operating-0.068
Token.
Token ID13
Feature activation+2.16e-01

Pos logit contributions
 tad+0.001
 Jill+0.001
ill+0.001
 Sophia+0.001
 Pam+0.001
Neg logit contributions
cel-0.000
 Junior-0.000
 ambul-0.000
 tid-0.000
 operating-0.000
Token She
Token ID1375
Feature activation+3.69e-01

Pos logit contributions
cel+0.008
 Junior+0.008
 Thomas+0.008
 operating+0.008
uf+0.008
Neg logit contributions
 tad-0.006
ill-0.005
 Jill-0.005
 Sophia-0.005
ulu-0.005
Token knew
Token ID2993
Feature activation-4.12e-01

Pos logit contributions
cel+0.012
 Junior+0.012
 ambul+0.011
uf+0.011
 operating+0.011
Neg logit contributions
 Jill-0.012
 tad-0.012
ill-0.011
 Sophia-0.011
 Pam-0.010
Token He
Token ID679
Feature activation+5.57e-01

Pos logit contributions
 Jill+0.005
 tad+0.005
ill+0.005
 Sophia+0.005
 Pam+0.005
Neg logit contributions
cel-0.007
 Junior-0.007
 ambul-0.006
uf-0.006
 Thomas-0.006
Token punched
Token ID25436
Feature activation+6.21e-01

Pos logit contributions
cel+0.021
 Junior+0.020
uf+0.019
 ambul+0.019
 operating+0.019
Neg logit contributions
 Jill-0.016
 tad-0.015
ill-0.015
 Sophia-0.015
 Pam-0.014
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
cel+0.023
 Junior+0.022
 ambul+0.020
uf+0.020
 operating+0.020
Neg logit contributions
 Jill-0.019
 tad-0.018
ill-0.018
 Sophia-0.018
 Pam-0.017
Token kicked
Token ID12165
Feature activation-9.96e-01

Pos logit contributions
ill+0.052
 tad+0.051
 Jill+0.051
 Sophia+0.050
 Pam+0.046
Neg logit contributions
cel-0.049
 Junior-0.048
 ambul-0.045
 tid-0.044
 Thomas-0.044
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
 Jill+0.031
 tad+0.031
ill+0.031
 Sophia+0.030
 Pam+0.028
Neg logit contributions
cel-0.034
 Junior-0.033
 ambul-0.031
uf-0.031
 operating-0.030
Token other
Token ID584
Feature activation-6.25e+00

Pos logit contributions
 Jill+0.040
ill+0.039
 tad+0.039
 Sophia+0.038
 Pam+0.036
Neg logit contributions
cel-0.047
 Junior-0.045
 ambul-0.045
 operating-0.043
 tid-0.043
Token boy
Token ID2933
Feature activation-2.54e+00

Pos logit contributions
 tad+0.186
 Jill+0.183
 Sophia+0.180
ill+0.178
 Pam+0.167
Neg logit contributions
cel-0.216
 Junior-0.206
 ambul-0.203
 tid-0.197
 Thomas-0.196
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
 Jill+0.090
 tad+0.088
ill+0.088
 Sophia+0.086
 Pam+0.081
Neg logit contributions
cel-0.077
 Junior-0.074
 ambul-0.069
uf-0.068
 operating-0.067
Token Suddenly
Token ID24975
Feature activation-3.28e-01

Pos logit contributions
 tad+0.008
 Jill+0.008
ill+0.008
 Sophia+0.008
 Pam+0.007
Neg logit contributions
cel-0.010
 Junior-0.010
 ambul-0.009
 Thomas-0.009
uf-0.009
Token,
Token ID11
Feature activation-7.26e-01

Pos logit contributions
 Jill+0.011
 tad+0.010
 Sophia+0.010
ill+0.010
 Pam+0.010
Neg logit contributions
cel-0.012
uf-0.011
 Junior-0.011
ind-0.011
hers-0.010
Token the
Token ID262
Feature activation-9.01e-01

Pos logit contributions
ill+0.024
 tad+0.023
 Jill+0.023
 Sophia+0.023
 Pam+0.021
Neg logit contributions
 Junior-0.023
cel-0.022
 ambul-0.022
 Thomas-0.021
 operating-0.021
Token mad
Token ID8805
Feature activation-4.10e-02

Pos logit contributions
cel+0.018
 Junior+0.018
 ambul+0.017
 tid+0.017
 operating+0.017
Neg logit contributions
ill-0.016
 tad-0.016
 Jill-0.016
 Sophia-0.015
 Pam-0.014
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 Jill+0.001
ill+0.001
 Sophia+0.001
 tad+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
uf-0.001
Brown-0.001
Token They
Token ID1119
Feature activation+4.11e-01

Pos logit contributions
 Junior+0.005
cel+0.005
uf+0.005
 Thomas+0.005
 ambul+0.005
Neg logit contributions
ill-0.004
 tad-0.004
 Jill-0.004
 Sophia-0.004
 bou-0.004
Token think
Token ID892
Feature activation-1.25e+00

Pos logit contributions
cel+0.018
 Junior+0.017
 ambul+0.017
Brown+0.017
uf+0.016
Neg logit contributions
 Jill-0.010
 tad-0.009
 Sophia-0.009
ill-0.009
 Pam-0.009
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
 tad+0.038
ill+0.037
 Jill+0.036
 Sophia+0.036
 Pam+0.033
Neg logit contributions
cel-0.043
 Junior-0.043
 ambul-0.041
 Thomas-0.041
 operating-0.040
Token other
Token ID584
Feature activation-6.23e+00

Pos logit contributions
 Jill+0.052
 tad+0.052
ill+0.049
 Sophia+0.048
 Pam+0.046
Neg logit contributions
cel-0.058
 Junior-0.058
uf-0.054
 ambul-0.053
 Thomas-0.052
Token one
Token ID530
Feature activation-2.68e+00

Pos logit contributions
 Jill+0.192
 tad+0.185
ill+0.177
 Pam+0.167
 Sophia+0.165
Neg logit contributions
 Junior-0.222
cel-0.216
Charl-0.206
uf-0.206
Thomas-0.204
Token took
Token ID1718
Feature activation-1.10e+00

Pos logit contributions
 Jill+0.081
 tad+0.079
ill+0.079
 Sophia+0.077
 Pam+0.071
Neg logit contributions
cel-0.095
 Junior-0.092
 ambul-0.088
 operating-0.086
uf-0.086
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
ill+0.034
 tad+0.033
 Jill+0.033
 Sophia+0.032
ulu+0.030
Neg logit contributions
 Junior-0.037
cel-0.037
 Thomas-0.035
 ambul-0.035
 tid-0.035
Token cell
Token ID2685
Feature activation-3.61e-02

Pos logit contributions
 Jill+0.005
 tad+0.005
ill+0.005
 Sophia+0.005
 Pam+0.005
Neg logit contributions
cel-0.006
 Junior-0.006
 ambul-0.005
uf-0.005
 operating-0.005
Token.
Token ID13
Feature activation-6.22e-02

Pos logit contributions
 tad+0.001
 Jill+0.001
 Sophia+0.001
ill+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
 operating-0.001
 tid-0.001
Token it
Token ID340
Feature activation-2.99e+00

Pos logit contributions
 tad+0.015
ill+0.014
 Jill+0.014
 Sophia+0.014
 Pam+0.013
Neg logit contributions
cel-0.017
 Junior-0.017
 ambul-0.016
 Thomas-0.016
 tid-0.015
Token could
Token ID714
Feature activation-2.46e+00

Pos logit contributions
 tad+0.108
ill+0.106
 Jill+0.104
 Sophia+0.102
 Pam+0.096
Neg logit contributions
cel-0.089
 Junior-0.083
 tid-0.081
 ambul-0.080
 Thomas-0.079
Token do
Token ID466
Feature activation-2.62e+00

Pos logit contributions
 Jill+0.078
 tad+0.077
 Sophia+0.077
ill+0.076
 Meg+0.069
Neg logit contributions
cel-0.082
 Junior-0.079
 tid-0.076
 operating-0.076
 ambul-0.075
Token what
Token ID644
Feature activation+4.31e-01

Pos logit contributions
ill+0.093
 tad+0.092
 Sophia+0.089
 Jill+0.088
 Pam+0.083
Neg logit contributions
cel-0.076
 Junior-0.072
 tid-0.072
 ambul-0.071
 Thomas-0.069
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 Junior+0.015
cel+0.015
 ambul+0.014
 tid+0.014
 Thomas+0.014
Neg logit contributions
ill-0.013
 tad-0.013
 Jill-0.012
 Sophia-0.012
 Pam-0.011
Token other
Token ID584
Feature activation-6.22e+00

Pos logit contributions
ill+0.050
 Jill+0.050
 Sophia+0.049
 tad+0.049
 Pam+0.045
Neg logit contributions
cel-0.066
 Junior-0.064
 ambul-0.063
 tid-0.062
 operating-0.060
Token cube
Token ID23441
Feature activation-9.74e-01

Pos logit contributions
 Jill+0.180
 tad+0.175
ill+0.173
 Sophia+0.169
 Pam+0.157
Neg logit contributions
cel-0.231
 Junior-0.224
 ambul-0.212
uf-0.210
 operating-0.209
Token could
Token ID714
Feature activation-2.22e+00

Pos logit contributions
 tad+0.035
 Sophia+0.034
ill+0.034
 Jill+0.033
 Pam+0.032
Neg logit contributions
cel-0.028
 Junior-0.027
 tid-0.026
 ambul-0.026
 Thomas-0.025
Token do
Token ID466
Feature activation-2.49e+00

Pos logit contributions
 tad+0.081
 Sophia+0.078
 Jill+0.077
ill+0.073
 Pam+0.072
Neg logit contributions
cel-0.062
 tid-0.060
 operating-0.059
 Junior-0.058
 ambul-0.057
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 tad+0.094
 Jill+0.092
ill+0.091
 Sophia+0.090
 Pam+0.086
Neg logit contributions
cel-0.068
 Junior-0.065
 tid-0.061
 ambul-0.061
 operating-0.060
Token So
Token ID1406
Feature activation+2.04e-01

Pos logit contributions
ill+0.005
 tad+0.005
 Jill+0.005
 Sophia+0.005
ulu+0.004
Neg logit contributions
 Junior-0.004
cel-0.004
 Thomas-0.004
 ambul-0.004
 operating-0.004
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
 Jill+0.016
ill+0.016
 tad+0.015
 Sophia+0.015
 Pam+0.014
Neg logit contributions
cel-0.019
 Junior-0.018
 ambul-0.017
uf-0.017
 operating-0.017
Token and
Token ID290
Feature activation-3.59e-01

Pos logit contributions
ill+0.012
 tad+0.012
 Jill+0.012
 Sophia+0.011
 Pam+0.011
Neg logit contributions
 Junior-0.012
cel-0.012
 ambul-0.011
 tid-0.011
 Thomas-0.011
Token he
Token ID339
Feature activation+1.26e+00

Pos logit contributions
ill+0.011
 tad+0.011
 Jill+0.010
 Sophia+0.010
alk+0.010
Neg logit contributions
 Junior-0.012
cel-0.012
 tid-0.012
 Thomas-0.011
 ambul-0.011
Token tie
Token ID9839
Feature activation-1.08e+00

Pos logit contributions
cel+0.042
 Junior+0.041
 tid+0.038
 ambul+0.038
 Thomas+0.038
Neg logit contributions
 Jill-0.040
 tad-0.040
ill-0.039
 Sophia-0.039
 Pam-0.036
Token the
Token ID262
Feature activation-2.00e+00

Pos logit contributions
 tad+0.038
ill+0.038
 Sophia+0.036
 Jill+0.036
Meg+0.034
Neg logit contributions
 Junior-0.031
cel-0.031
 Thomas-0.030
 tid-0.029
 ambul-0.029
Token other
Token ID584
Feature activation-6.21e+00

Pos logit contributions
 Jill+0.060
ill+0.059
 tad+0.058
 Sophia+0.057
 Pam+0.053
Neg logit contributions
cel-0.072
 Junior-0.070
 ambul-0.067
 operating-0.065
uf-0.065
Token end
Token ID886
Feature activation-2.69e+00

Pos logit contributions
 Jill+0.170
ill+0.167
 tad+0.159
 Sophia+0.155
 Sue+0.141
Neg logit contributions
cel-0.247
 Junior-0.235
uf-0.225
Thomas-0.221
 Thomas-0.221
Token of
Token ID286
Feature activation-2.62e+00

Pos logit contributions
 tad+0.088
 Sophia+0.084
 Jill+0.083
ill+0.082
 Goal+0.077
Neg logit contributions
 Junior-0.083
cel-0.082
 tid-0.080
 operating-0.078
 Thomas-0.077
Token the
Token ID262
Feature activation-8.36e-01

Pos logit contributions
ill+0.090
 tad+0.089
 Jill+0.089
 Sophia+0.087
 Meg+0.081
Neg logit contributions
 Junior-0.081
cel-0.080
 ambul-0.076
 tid-0.074
 operating-0.073
Token rope
Token ID17182
Feature activation-3.73e-01

Pos logit contributions
 Jill+0.024
 tad+0.024
ill+0.023
 Sophia+0.022
 Pam+0.021
Neg logit contributions
cel-0.031
 Junior-0.030
uf-0.028
 ambul-0.028
 Thomas-0.028
Token to
Token ID284
Feature activation-5.21e-01

Pos logit contributions
 tad+0.015
 Sophia+0.014
 Jill+0.014
ill+0.014
Meg+0.013
Neg logit contributions
cel-0.009
 tid-0.009
 Junior-0.009
 operating-0.008
hers-0.008
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
 tad+0.015
 Jill+0.015
ill+0.014
 Sophia+0.014
 Pam+0.013
Neg logit contributions
 Junior-0.015
cel-0.015
 Thomas-0.014
Thomas-0.014
uf-0.014
TokenThe
Token ID464
Feature activation-2.51e-01

Pos logit contributions
 Jill+0.011
 tad+0.011
ill+0.010
 Sophia+0.010
 Pam+0.010
Neg logit contributions
cel-0.010
 Junior-0.009
 ambul-0.009
uf-0.009
 operating-0.009
Token girl
Token ID2576
Feature activation-2.37e+00

Pos logit contributions
ill+0.008
 Jill+0.008
 Sophia+0.007
 tad+0.007
alk+0.007
Neg logit contributions
 Junior-0.009
cel-0.009
 ambul-0.008
 operating-0.008
 nurses-0.008
Token tied
Token ID8165
Feature activation-1.11e+00

Pos logit contributions
 Jill+0.076
 tad+0.074
ill+0.073
 Sophia+0.071
 Pam+0.067
Neg logit contributions
cel-0.080
 Junior-0.077
 ambul-0.073
uf-0.072
 tid-0.072
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
ill+0.045
 tad+0.045
 Jill+0.042
aney+0.042
 Sophia+0.042
Neg logit contributions
 Junior-0.026
 Thomas-0.025
 tid-0.024
cel-0.024
 ambul-0.024
Token other
Token ID584
Feature activation-6.20e+00

Pos logit contributions
 Jill+0.061
ill+0.061
 tad+0.055
 Sophia+0.054
ulu+0.052
Neg logit contributions
 Junior-0.058
cel-0.058
 ambul-0.056
 operating-0.056
 tid-0.055
Token end
Token ID886
Feature activation-2.92e+00

Pos logit contributions
 Jill+0.155
 tad+0.148
ill+0.148
 Sophia+0.139
 Pam+0.128
Neg logit contributions
cel-0.250
 Junior-0.243
 ambul-0.236
 tid-0.232
 operating-0.231
Token of
Token ID286
Feature activation-2.29e+00

Pos logit contributions
 tad+0.087
 Sophia+0.082
 Jill+0.081
ill+0.079
 bou+0.077
Neg logit contributions
 tid-0.093
 Junior-0.092
cel-0.090
 operating-0.090
 Thomas-0.089
Token the
Token ID262
Feature activation-6.48e-01

Pos logit contributions
ill+0.093
 tad+0.090
 Jill+0.087
aney+0.086
 Sophia+0.085
Neg logit contributions
 Junior-0.060
 Thomas-0.055
 ambul-0.054
 tid-0.052
 Michael-0.052
Token rope
Token ID17182
Feature activation-3.90e-01

Pos logit contributions
ill+0.021
 Jill+0.020
 tad+0.019
 Sophia+0.018
aney+0.017
Neg logit contributions
cel-0.022
 Junior-0.022
 ambul-0.021
 tid-0.021
 operating-0.021
Token to
Token ID284
Feature activation-1.60e-01

Pos logit contributions
 tad+0.015
 Jill+0.015
 Sophia+0.015
ill+0.014
Meg+0.014
Neg logit contributions
cel-0.010
 tid-0.010
 Junior-0.009
 operating-0.009
 Thomas-0.009
Token loud
Token ID7812
Feature activation-1.82e+00

Pos logit contributions
ill+0.001
 Jill+0.001
 Sophia+0.001
 tad+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
 ambul-0.001
 tid-0.001
 operating-0.001
Token noise
Token ID7838
Feature activation+3.81e-01

Pos logit contributions
 Jill+0.063
 tad+0.062
ill+0.061
 Sophia+0.060
 Pam+0.056
Neg logit contributions
cel-0.057
 Junior-0.055
uf-0.051
 ambul-0.051
 operating-0.050
Token that
Token ID326
Feature activation+5.60e-01

Pos logit contributions
cel+0.015
 Junior+0.015
uf+0.014
Brown+0.014
 ambul+0.014
Neg logit contributions
 Jill-0.010
ill-0.010
 tad-0.009
 Sophia-0.009
 Pam-0.008
Token scared
Token ID12008
Feature activation-1.17e+00

Pos logit contributions
cel+0.018
 Junior+0.018
 ambul+0.017
 Thomas+0.017
 tid+0.016
Neg logit contributions
 tad-0.018
ill-0.018
 Jill-0.018
 Sophia-0.017
 Pam-0.016
Token the
Token ID262
Feature activation-2.18e+00

Pos logit contributions
 tad+0.036
 Jill+0.036
ill+0.035
 Sophia+0.034
 Pam+0.032
Neg logit contributions
cel-0.039
 Junior-0.039
 ambul-0.037
 Thomas-0.037
uf-0.036
Token other
Token ID584
Feature activation-6.20e+00

Pos logit contributions
 Jill+0.072
ill+0.072
 tad+0.071
 Sophia+0.069
 Pam+0.065
Neg logit contributions
cel-0.071
 Junior-0.068
 ambul-0.066
 operating-0.064
 tid-0.064
Token students
Token ID2444
Feature activation-2.56e+00

Pos logit contributions
 tad+0.205
 Jill+0.203
ill+0.200
 Sophia+0.195
 Pam+0.187
Neg logit contributions
cel-0.199
 Junior-0.186
 ambul-0.185
 tid-0.179
 Thomas-0.178
Token.
Token ID13
Feature activation+1.48e-03

Pos logit contributions
 Jill+0.089
 tad+0.089
ill+0.086
 Sophia+0.086
 Pam+0.081
Neg logit contributions
cel-0.078
 Junior-0.075
 ambul-0.071
 operating-0.070
 tid-0.070
Token They
Token ID1119
Feature activation+4.50e-01

Pos logit contributions
cel+0.000
 Junior+0.000
 Thomas+0.000
uf+0.000
 ambul+0.000
Neg logit contributions
 tad-0.000
ill-0.000
 Jill-0.000
 Sophia-0.000
ulu-0.000
Token all
Token ID477
Feature activation-3.12e-01

Pos logit contributions
cel+0.018
 Junior+0.017
uf+0.017
Brown+0.016
 operating+0.016
Neg logit contributions
 Jill-0.013
 tad-0.012
ill-0.012
 Sophia-0.011
 Pam-0.011
Token stopped
Token ID5025
Feature activation+9.92e-01

Pos logit contributions
 Jill+0.009
 tad+0.009
ill+0.009
 Sophia+0.008
 Pam+0.008
Neg logit contributions
cel-0.012
 Junior-0.011
uf-0.011
 ambul-0.011
 operating-0.011
Token shine
Token ID18340
Feature activation+3.75e-01

Pos logit contributions
cel+0.017
 Junior+0.016
 ambul+0.015
uf+0.015
 operating+0.015
Neg logit contributions
 Jill-0.011
 tad-0.011
ill-0.011
 Sophia-0.011
 Pam-0.010
Token as
Token ID355
Feature activation-5.67e-01

Pos logit contributions
cel+0.012
 Junior+0.012
 ambul+0.011
 operating+0.011
uf+0.011
Neg logit contributions
 Jill-0.012
 tad-0.012
ill-0.012
 Sophia-0.012
 Pam-0.011
Token bright
Token ID6016
Feature activation-2.60e-01

Pos logit contributions
 tad+0.020
 Sophia+0.019
ill+0.019
Meg+0.019
 Jill+0.018
Neg logit contributions
 Junior-0.016
 tid-0.015
 Thomas-0.015
 ambul-0.015
hers-0.015
Token as
Token ID355
Feature activation-1.65e+00

Pos logit contributions
 Jill+0.008
 tad+0.008
ill+0.008
 Sophia+0.008
 Pam+0.007
Neg logit contributions
cel-0.009
 Junior-0.009
 ambul-0.008
uf-0.008
 operating-0.008
Token the
Token ID262
Feature activation-2.40e+00

Pos logit contributions
 tad+0.059
ill+0.058
 Sophia+0.057
aney+0.055
Meg+0.055
Neg logit contributions
 Junior-0.045
 tid-0.043
 Thomas-0.042
cel-0.042
 ambul-0.041
Token other
Token ID584
Feature activation-6.19e+00

Pos logit contributions
 Jill+0.067
 tad+0.064
ill+0.062
 Sophia+0.060
 chance+0.057
Neg logit contributions
cel-0.091
 Junior-0.090
uf-0.087
hers-0.082
 ambul-0.081
Token lamps
Token ID32209
Feature activation+5.97e-01

Pos logit contributions
 Jill+0.180
ill+0.175
 tad+0.159
 flower+0.154
 chance+0.152
Neg logit contributions
 Junior-0.236
uf-0.229
cel-0.225
hers-0.218
brate-0.216
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
cel+0.018
 Junior+0.017
 tid+0.017
 ambul+0.016
 operating+0.016
Neg logit contributions
 tad-0.021
 Jill-0.021
 Sophia-0.020
ill-0.020
 Pam-0.019
Token The
Token ID383
Feature activation-5.24e-01

Pos logit contributions
ill+0.009
 tad+0.009
ulu+0.008
 Jill+0.008
 Sophia+0.008
Neg logit contributions
cel-0.008
 Junior-0.008
 Thomas-0.007
 ambul-0.007
uf-0.007
Token lamp
Token ID20450
Feature activation+7.44e-01

Pos logit contributions
 Jill+0.015
 tad+0.015
ill+0.015
 Sophia+0.014
 Pam+0.013
Neg logit contributions
cel-0.019
 Junior-0.019
uf-0.017
 ambul-0.017
 Thomas-0.017
Token was
Token ID373
Feature activation-6.51e-01

Pos logit contributions
cel+0.024
 Junior+0.023
 tid+0.022
 ambul+0.022
 operating+0.021
Neg logit contributions
 Jill-0.025
 tad-0.025
 Sophia-0.024
ill-0.024
 Pam-0.023
Token confused
Token ID10416
Feature activation+1.36e+00

Pos logit contributions
cel+0.014
 Junior+0.014
uf+0.013
 ambul+0.013
brate+0.013
Neg logit contributions
 Jill-0.009
 tad-0.008
 Sophia-0.008
ill-0.007
 Pam-0.007
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
cel+0.057
 Junior+0.054
Brown+0.050
riger+0.050
uf+0.050
Neg logit contributions
 Jill-0.039
ill-0.037
 Sophia-0.034
 tad-0.033
 Pam-0.031
Token They
Token ID1119
Feature activation+2.91e-01

Pos logit contributions
 Junior+0.010
 ambul+0.009
cel+0.009
 Thomas+0.009
 Michael+0.009
Neg logit contributions
 tad-0.009
ill-0.008
 bou-0.008
ulu-0.008
 Jill-0.008
Token think
Token ID892
Feature activation-1.37e+00

Pos logit contributions
cel+0.012
 Junior+0.012
 ambul+0.012
Brown+0.011
uf+0.011
Neg logit contributions
 Jill-0.007
 tad-0.007
 Sophia-0.006
ill-0.006
 Pam-0.006
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
 tad+0.048
ill+0.046
 Sophia+0.043
 Jill+0.043
 bou+0.042
Neg logit contributions
 Junior-0.041
 ambul-0.041
 Thomas-0.039
 operating-0.039
cel-0.038
Token other
Token ID584
Feature activation-6.16e+00

Pos logit contributions
ill+0.073
 Jill+0.068
 Sophia+0.066
 Pam+0.065
 tad+0.065
Neg logit contributions
 praying-0.042
 ambul-0.042
 disturbing-0.042
 operating-0.041
 Junior-0.040
Token kids
Token ID3988
Feature activation-2.56e+00

Pos logit contributions
 tad+0.216
 Jill+0.215
ill+0.213
 Sophia+0.211
 Pam+0.204
Neg logit contributions
cel-0.169
 ambul-0.167
 Junior-0.164
 operating-0.159
 years-0.157
Token are
Token ID389
Feature activation-3.24e-02

Pos logit contributions
 Jill+0.073
 tad+0.070
ill+0.070
 Sophia+0.069
 Pam+0.064
Neg logit contributions
cel-0.097
 Junior-0.093
uf-0.087
 ambul-0.087
Brown-0.087
Token silly
Token ID14397
Feature activation+2.01e+00

Pos logit contributions
 Jill+0.001
 tad+0.001
 Sophia+0.001
ill+0.001
 Pam+0.001
Neg logit contributions
cel-0.001
 Junior-0.001
uf-0.001
Brown-0.001
 ambul-0.001
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
cel+0.062
 Junior+0.060
 ambul+0.057
 operating+0.056
uf+0.056
Neg logit contributions
 Jill-0.069
 tad-0.068
ill-0.067
 Sophia-0.066
 Pam-0.062
Token They
Token ID1119
Feature activation+3.86e-01

Pos logit contributions
 Junior+0.014
cel+0.014
 ambul+0.013
 Thomas+0.013
uf+0.013
Neg logit contributions
 tad-0.010
ill-0.010
 Jill-0.009
 Sophia-0.009
ulu-0.008
Token and
Token ID290
Feature activation-8.40e-01

Pos logit contributions
 Jill+0.074
 tad+0.067
 Sophia+0.066
ill+0.065
 Pam+0.061
Neg logit contributions
cel-0.120
 Junior-0.115
uf-0.113
 ambul-0.112
 operating-0.111
Token sun
Token ID4252
Feature activation-1.51e-01

Pos logit contributions
 Jill+0.024
 tad+0.023
ill+0.022
 Sophia+0.022
 Pam+0.020
Neg logit contributions
cel-0.032
 Junior-0.031
uf-0.030
 ambul-0.030
 operating-0.029
Tokenn
Token ID77
Feature activation-1.67e+00

Pos logit contributions
 Jill+0.005
ill+0.004
 tad+0.004
 Sophia+0.004
 Pam+0.004
Neg logit contributions
cel-0.005
uf-0.005
 Junior-0.005
 ambul-0.005
 operating-0.005
Tokenier
Token ID959
Feature activation-1.23e+00

Pos logit contributions
 tad+0.055
 Jill+0.052
 Sophia+0.051
 Pam+0.047
 bou+0.047
Neg logit contributions
cel-0.056
 Junior-0.055
hers-0.053
 Thomas-0.051
Brown-0.050
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
 Jill+0.044
 tad+0.043
ill+0.043
 Sophia+0.042
 Pam+0.040
Neg logit contributions
cel-0.037
 Junior-0.036
 ambul-0.033
uf-0.033
 operating-0.033
Token other
Token ID584
Feature activation-6.13e+00

Pos logit contributions
 Jill+0.055
 tad+0.054
ill+0.052
 Sophia+0.051
 Pam+0.047
Neg logit contributions
cel-0.092
 Junior-0.090
 ambul-0.085
uf-0.085
 operating-0.084
Token side
Token ID1735
Feature activation-3.41e+00

Pos logit contributions
 tad+0.171
 Jill+0.171
ill+0.164
 Sophia+0.160
 Pam+0.153
Neg logit contributions
cel-0.233
 Junior-0.229
uf-0.215
 ambul-0.215
 operating-0.212
Token was
Token ID373
Feature activation+2.18e-01

Pos logit contributions
 tad+0.121
 Sophia+0.115
 Jill+0.112
 bou+0.109
ill+0.108
Neg logit contributions
 Junior-0.096
cel-0.095
 tid-0.092
 Thomas-0.091
 ambul-0.091
Token!
Token ID0
Feature activation+1.22e-01

Pos logit contributions
cel+0.007
 Junior+0.007
uf+0.006
 ambul+0.006
Brown+0.006
Neg logit contributions
 Jill-0.008
ill-0.007
 tad-0.007
 Sophia-0.007
 Pam-0.007
Token
Token ID220
Feature activation+5.25e-01

Pos logit contributions
 Junior+0.004
cel+0.004
 Thomas+0.004
uf+0.004
 Michael+0.004
Neg logit contributions
 tad-0.004
ill-0.003
 Jill-0.003
 bou-0.003
 Sophia-0.003
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
cel+0.022
 Junior+0.021
 ambul+0.020
uf+0.020
 operating+0.020
Neg logit contributions
 Jill-0.013
ill-0.012
 tad-0.012
 Sophia-0.012
 Pam-0.011
Token embarrassed
Token ID21100
Feature activation+9.52e-01

Pos logit contributions
cel+0.008
 Junior+0.008
 operating+0.007
uf+0.007
 tid+0.007
Neg logit contributions
ill-0.009
 Jill-0.009
 Sophia-0.008
 tad-0.008
 Meg-0.008
Token.
Token ID13
Feature activation+5.99e-01

Pos logit contributions
cel+0.041
 Junior+0.040
uf+0.038
riger+0.038
Brown+0.037
Neg logit contributions
 Jill-0.024
ill-0.023
 Sophia-0.021
 tad-0.020
 hers-0.019
Token She
Token ID1375
Feature activation+7.89e-01

Pos logit contributions
cel+0.026
 Junior+0.025
uf+0.024
 Thomas+0.024
 ambul+0.023
Neg logit contributions
 tad-0.014
ill-0.013
 Jill-0.013
 Sophia-0.013
 bou-0.011
Token thought
Token ID1807
Feature activation-1.62e+00

Pos logit contributions
cel+0.028
 Junior+0.028
 ambul+0.027
 operating+0.026
uf+0.026
Neg logit contributions
 Jill-0.023
 tad-0.023
 Sophia-0.021
ill-0.021
 Pam-0.020
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
 tad+0.049
ill+0.046
 Jill+0.044
 Sophia+0.043
 bou+0.040
Neg logit contributions
 Junior-0.058
cel-0.057
 ambul-0.055
 Thomas-0.055
 operating-0.054
Token other
Token ID584
Feature activation-6.13e+00

Pos logit contributions
 Jill+0.056
ill+0.055
 tad+0.054
 Sophia+0.053
 Pam+0.049
Neg logit contributions
cel-0.060
 Junior-0.058
 ambul-0.055
uf-0.054
 operating-0.054
Token kids
Token ID3988
Feature activation-2.75e+00

Pos logit contributions
 Jill+0.180
 tad+0.176
ill+0.175
 Sophia+0.171
 Pam+0.157
Neg logit contributions
cel-0.224
 Junior-0.217
 ambul-0.210
uf-0.204
 operating-0.203
Token would
Token ID561
Feature activation-1.34e+00

Pos logit contributions
 Jill+0.086
 tad+0.083
ill+0.083
 Sophia+0.081
 Pam+0.076
Neg logit contributions
cel-0.096
 Junior-0.092
 ambul-0.087
uf-0.087
 operating-0.085
Token laugh
Token ID6487
Feature activation-1.12e+00

Pos logit contributions
 Jill+0.045
 tad+0.044
ill+0.044
 Sophia+0.043
 Pam+0.040
Neg logit contributions
cel-0.042
 Junior-0.041
 ambul-0.039
 operating-0.038
 tid-0.038
Token at
Token ID379
Feature activation+5.10e-01

Pos logit contributions
 tad+0.042
 Jill+0.041
 Sophia+0.039
ill+0.039
 Pam+0.037
Neg logit contributions
cel-0.032
 Junior-0.031
 operating-0.029
 ambul-0.029
 tid-0.029
Token her
Token ID607
Feature activation+8.09e-03

Pos logit contributions
 Junior+0.014
cel+0.014
 Thomas+0.013
uf+0.012
 ambul+0.012
Neg logit contributions
 tad-0.020
ill-0.019
 Jill-0.019
 Sophia-0.018
 Goal-0.018
Token wanted
Token ID2227
Feature activation-1.94e+00

Pos logit contributions
 Jill+0.090
 tad+0.088
ill+0.087
 Sophia+0.085
 Pam+0.080
Neg logit contributions
cel-0.098
 Junior-0.095
 ambul-0.090
uf-0.088
 operating-0.088
Token to
Token ID284
Feature activation+3.18e-01

Pos logit contributions
 tad+0.065
 Jill+0.064
ill+0.063
 Sophia+0.061
 Pam+0.058
Neg logit contributions
cel-0.061
 Junior-0.060
 ambul-0.057
 tid-0.056
uf-0.056
Token do
Token ID466
Feature activation-6.39e-01

Pos logit contributions
cel+0.011
 Junior+0.011
 ambul+0.010
uf+0.010
 operating+0.010
Neg logit contributions
 Jill-0.009
 tad-0.009
ill-0.009
 Sophia-0.009
 Pam-0.008
Token what
Token ID644
Feature activation+1.18e+00

Pos logit contributions
 Jill+0.022
 tad+0.021
ill+0.021
 Sophia+0.021
 Pam+0.019
Neg logit contributions
cel-0.020
 Junior-0.020
 ambul-0.018
uf-0.018
Brown-0.018
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
cel+0.042
 Junior+0.042
 ambul+0.040
 tid+0.039
 Thomas+0.039
Neg logit contributions
 tad-0.034
ill-0.034
 Jill-0.033
 Sophia-0.032
 Pam-0.030
Token other
Token ID584
Feature activation-6.11e+00

Pos logit contributions
 Jill+0.045
ill+0.044
 tad+0.043
 Sophia+0.042
 Pam+0.039
Neg logit contributions
cel-0.056
 Junior-0.054
 ambul-0.052
 operating-0.051
 tid-0.050
Token one
Token ID530
Feature activation-2.30e+00

Pos logit contributions
 Jill+0.199
ill+0.197
 tad+0.196
 Sophia+0.191
 Pam+0.179
Neg logit contributions
cel-0.192
 Junior-0.188
 ambul-0.183
 tid-0.177
 operating-0.176
Token wanted
Token ID2227
Feature activation-3.61e+00

Pos logit contributions
 tad+0.087
ill+0.082
 Jill+0.080
 bou+0.080
 Sophia+0.078
Neg logit contributions
 Junior-0.058
 why-0.057
 tid-0.057
 years-0.056
cel-0.054
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
 tad+0.157
 Jill+0.148
 Sophia+0.146
ill+0.143
 Pam+0.141
Neg logit contributions
 Junior-0.074
cel-0.073
 tid-0.069
 ambul-0.068
 years-0.067
Token They
Token ID1119
Feature activation+7.83e-02

Pos logit contributions
 tad+0.007
 bou+0.007
ill+0.006
 butterfly+0.006
ulu+0.006
Neg logit contributions
 Junior-0.006
uf-0.006
cel-0.006
 Thomas-0.006
 operating-0.005
Token got
Token ID1392
Feature activation-3.95e-01

Pos logit contributions
cel+0.003
 Junior+0.003
 ambul+0.003
uf+0.003
 operating+0.003
Neg logit contributions
 Jill-0.002
 tad-0.002
ill-0.002
 Sophia-0.002
 Pam-0.002
Token was
Token ID373
Feature activation+3.18e-01

Pos logit contributions
 Jill+0.025
ill+0.024
 tad+0.024
 Sophia+0.023
 Pam+0.022
Neg logit contributions
cel-0.029
 Junior-0.028
 ambul-0.027
uf-0.026
 operating-0.026
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
 Junior+0.008
cel+0.007
 Thomas+0.007
 tid+0.007
 ambul+0.007
Neg logit contributions
 tad-0.012
ill-0.012
 Jill-0.012
 Sophia-0.012
 Pam-0.011
Token little
Token ID1310
Feature activation-2.89e-01

Pos logit contributions
cel+0.047
 Junior+0.046
 ambul+0.043
uf+0.043
 operating+0.043
Neg logit contributions
 Jill-0.031
ill-0.030
 tad-0.030
 Sophia-0.029
 Pam-0.027
Token bug
Token ID5434
Feature activation+1.25e+00

Pos logit contributions
 Jill+0.006
ill+0.005
 tad+0.005
 Sophia+0.005
 Pam+0.004
Neg logit contributions
cel-0.013
 Junior-0.013
 ambul-0.013
uf-0.012
 operating-0.012
Token named
Token ID3706
Feature activation-2.14e+00

Pos logit contributions
 Junior+0.047
cel+0.044
riger+0.043
 ambul+0.043
 Thomas+0.043
Neg logit contributions
 Jill-0.036
 tad-0.035
ill-0.034
 flower-0.034
 grape-0.032
Token Bug
Token ID15217
Feature activation+4.16e+00

Pos logit contributions
 tad+0.060
ill+0.057
 Sophia+0.054
 Jill+0.052
aney+0.050
Neg logit contributions
cel-0.080
 Junior-0.075
 Thomas-0.072
uf-0.072
 tid-0.069
Tokengy
Token ID1360
Feature activation+1.12e+00

Pos logit contributions
cel+0.133
 Junior+0.129
brate+0.127
 ambul+0.125
 operating+0.125
Neg logit contributions
 Jill-0.137
ill-0.133
 tad-0.132
 Pam-0.126
 Sue-0.124
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
 Junior+0.039
cel+0.038
Thomas+0.036
riger+0.036
Michael+0.035
Neg logit contributions
 Jill-0.034
 tad-0.034
ill-0.033
 Sophia-0.032
 Pam-0.031
Token Bug
Token ID15217
Feature activation+3.03e+00

Pos logit contributions
cel+0.005
hers+0.005
issa+0.005
ators+0.005
ise+0.005
Neg logit contributions
 tad-0.004
 Sue-0.004
 Pam-0.004
 frog-0.004
 Jill-0.004
Tokengy
Token ID1360
Feature activation+1.74e-01

Pos logit contributions
cel+0.095
 ambul+0.092
 Junior+0.090
 operating+0.090
riger+0.089
Neg logit contributions
 Jill-0.100
ill-0.098
 tad-0.098
 Pam-0.094
 Sue-0.093
Token wanted
Token ID2227
Feature activation-1.80e+00

Pos logit contributions
 Junior+0.006
cel+0.006
 operating+0.006
 ambul+0.006
Thomas+0.006
Neg logit contributions
 Jill-0.005
 tad-0.005
 Sophia-0.005
ill-0.005
 Sue-0.004
Token "
Token ID366
Feature activation+5.20e-01

Pos logit contributions
 tad+0.003
ill+0.003
 Jill+0.003
 Sophia+0.003
 Pam+0.002
Neg logit contributions
cel-0.003
 Junior-0.003
 Thomas-0.003
 ambul-0.003
 tid-0.003
TokenDon
Token ID3987
Feature activation+7.48e-01

Pos logit contributions
cel+0.018
 Junior+0.018
 ambul+0.016
Brown+0.016
 operating+0.016
Neg logit contributions
 Jill-0.016
 tad-0.016
ill-0.016
 Sophia-0.016
 Pam-0.014
Token't
Token ID470
Feature activation-2.28e-01

Pos logit contributions
cel+0.027
 Junior+0.026
 ambul+0.025
uf+0.025
 operating+0.024
Neg logit contributions
 Jill-0.022
 tad-0.021
ill-0.021
 Sophia-0.021
 Pam-0.020
Token worry
Token ID5490
Feature activation+1.66e+00

Pos logit contributions
 Jill+0.007
 tad+0.007
ill+0.007
 Sophia+0.007
 Pam+0.007
Neg logit contributions
cel-0.008
 Junior-0.008
hers-0.007
 ambul-0.007
uf-0.007
Token,
Token ID11
Feature activation+6.55e-01

Pos logit contributions
cel+0.057
 Junior+0.053
hers+0.052
 ambul+0.051
uf+0.050
Neg logit contributions
 Jill-0.058
ill-0.053
 Sophia-0.052
 tad-0.051
 Pam-0.051
Token Mickey
Token ID27232
Feature activation+4.21e+00

Pos logit contributions
cel+0.023
 Junior+0.022
 ambul+0.021
 Thomas+0.021
 tid+0.020
Neg logit contributions
ill-0.020
 tad-0.020
 Jill-0.020
 Sophia-0.019
 Pam-0.017
Token.
Token ID13
Feature activation+4.17e-01

Pos logit contributions
cel+0.152
 ambul+0.146
 Junior+0.146
hers+0.141
 operating+0.140
Neg logit contributions
 Jill-0.127
ill-0.121
 tad-0.117
 Sophia-0.115
 Pam-0.110
Token I
Token ID314
Feature activation-5.18e-01

Pos logit contributions
cel+0.016
 Junior+0.015
 ambul+0.015
uf+0.014
 operating+0.014
Neg logit contributions
 Jill-0.012
 tad-0.011
ill-0.011
 Sophia-0.011
 Pam-0.010
Token will
Token ID481
Feature activation-2.01e-01

Pos logit contributions
 Jill+0.018
 Sophia+0.018
 tad+0.017
ill+0.017
 Meg+0.017
Neg logit contributions
cel-0.016
 Junior-0.015
 tid-0.015
 operating-0.014
 ambul-0.014
Token help
Token ID1037
Feature activation+1.13e+00

Pos logit contributions
 Jill+0.005
 tad+0.005
ill+0.005
 Sophia+0.005
 Pam+0.005
Neg logit contributions
cel-0.008
 Junior-0.007
 ambul-0.007
Michael-0.007
uf-0.007
Token you
Token ID345
Feature activation-8.28e-01

Pos logit contributions
cel+0.044
 Junior+0.042
hers+0.042
 ambul+0.041
Michael+0.040
Neg logit contributions
 Jill-0.032
 Pam-0.028
 tad-0.027
 Sophia-0.026
 Sue-0.026
Token<|endoftext|>
Token ID50256
Feature activation+9.99e-01

Pos logit contributions
 Junior+0.039
cel+0.039
 Thomas+0.036
uf+0.036
 ambul+0.035
Neg logit contributions
 tad-0.031
 Jill-0.031
ill-0.030
 Sophia-0.029
 Pam-0.027
TokenOnce
Token ID7454
Feature activation+1.04e+00

Pos logit contributions
cel+0.026
 Junior+0.025
 Thomas+0.023
Thomas+0.022
 operating+0.022
Neg logit contributions
 tad-0.039
 Jill-0.039
 Sophia-0.038
ill-0.037
 Pam-0.037
Token there
Token ID612
Feature activation-2.84e-01

Pos logit contributions
 Junior+0.040
 Thomas+0.040
 Michael+0.038
 why+0.038
 tid+0.037
Neg logit contributions
 tad-0.028
 Sophia-0.022
ill-0.022
 bou-0.021
 Jill-0.021
Token was
Token ID373
Feature activation+7.40e-01

Pos logit contributions
 tad+0.010
 Jill+0.010
ill+0.009
 Sophia+0.009
 Pam+0.009
Neg logit contributions
cel-0.009
 Junior-0.009
 ambul-0.008
 Thomas-0.008
 operating-0.008
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
 Junior+0.020
 ambul+0.019
 Thomas+0.018
 tid+0.018
cel+0.018
Neg logit contributions
 tad-0.027
ill-0.027
 Sophia-0.027
 Jill-0.026
 Pam-0.025
Token fire
Token ID2046
Feature activation+3.90e+00

Pos logit contributions
cel+0.070
 Junior+0.068
 ambul+0.066
 operating+0.064
uf+0.064
Neg logit contributions
 Jill-0.040
 tad-0.039
ill-0.037
 Sophia-0.037
 Pam-0.034
Tokenman
Token ID805
Feature activation+2.70e+00

Pos logit contributions
cel+0.148
 Junior+0.144
riger+0.139
Thomas+0.138
uf+0.138
Neg logit contributions
 Jill-0.111
 frog-0.103
 tad-0.102
 crab-0.102
ill-0.101
Token named
Token ID3706
Feature activation-2.04e+00

Pos logit contributions
 Junior+0.108
cel+0.104
riger+0.103
Thomas+0.103
Michael+0.103
Neg logit contributions
 Jill-0.068
ill-0.061
 flower-0.059
 Sue-0.056
 crab-0.055
Token Jeffrey
Token ID19627
Feature activation+1.35e+00

Pos logit contributions
 Jill+0.053
 Sue+0.042
ill+0.042
 Jen+0.039
 Sophia+0.038
Neg logit contributions
cel-0.091
 operating-0.083
Charl-0.083
ises-0.082
ise-0.082
Token.
Token ID13
Feature activation+5.94e-01

Pos logit contributions
cel+0.053
 Junior+0.050
 ambul+0.050
riger+0.049
ths+0.049
Neg logit contributions
 Jill-0.034
ill-0.032
 tad-0.032
 Sophia-0.031
 crab-0.029
Token He
Token ID679
Feature activation+7.37e-01

Pos logit contributions
cel+0.024
 tid+0.022
 Junior+0.022
 ambul+0.022
 operating+0.022
Neg logit contributions
 Sophia-0.016
 Jill-0.016
ill-0.015
 tad-0.014
 Meg-0.014
Token a
Token ID257
Feature activation+3.79e-01

Pos logit contributions
cel+0.003
 Junior+0.003
 ambul+0.003
 operating+0.003
Brown+0.003
Neg logit contributions
 Jill-0.004
 tad-0.004
 Sophia-0.004
ill-0.004
 Pam-0.003
Token flower
Token ID15061
Feature activation-4.18e-01

Pos logit contributions
cel+0.014
 Junior+0.013
 tid+0.013
 ambul+0.013
 operating+0.013
Neg logit contributions
ill-0.011
 Jill-0.011
 Sophia-0.010
 tad-0.010
 Pam-0.009
Token,
Token ID11
Feature activation-4.70e-01

Pos logit contributions
 Jill+0.016
 tad+0.016
ill+0.015
 Sophia+0.015
 Pam+0.015
Neg logit contributions
cel-0.011
 Junior-0.010
 operating-0.010
 tid-0.010
 ambul-0.010
Token you
Token ID345
Feature activation-3.21e-01

Pos logit contributions
ill+0.017
 tad+0.017
 Jill+0.016
 Sophia+0.015
alk+0.015
Neg logit contributions
 Junior-0.013
cel-0.013
 tid-0.012
 Thomas-0.012
 ambul-0.012
Token won
Token ID1839
Feature activation+1.27e-01

Pos logit contributions
 Jill+0.010
 tad+0.010
ill+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.011
 Junior-0.010
 ambul-0.010
uf-0.010
 operating-0.010
Tokenâ
Token ID22940
Feature activation+4.32e+00

Pos logit contributions
cel+0.005
 Junior+0.005
uf+0.005
 ambul+0.004
ind+0.004
Neg logit contributions
 Jill-0.003
ill-0.003
 Sophia-0.003
 tad-0.003
 Pam-0.003
Token€
Token ID26391
Feature activation+8.68e-01

Pos logit contributions
cel+0.138
 Junior+0.133
 ambul+0.126
 operating+0.122
uf+0.122
Neg logit contributions
 Jill-0.146
ill-0.142
 tad-0.142
 Sophia-0.140
 Pam-0.131
Tokenâ„¢
Token ID8151
Feature activation+2.65e+00

Pos logit contributions
cel+0.027
 Junior+0.026
 ambul+0.025
uf+0.024
 operating+0.024
Neg logit contributions
 Jill-0.030
 tad-0.029
ill-0.029
 Sophia-0.028
 Pam-0.027
Tokent
Token ID83
Feature activation+8.18e-01

Pos logit contributions
cel+0.094
 Junior+0.092
uf+0.088
 ambul+0.087
 Thomas+0.085
Neg logit contributions
 tad-0.080
 Jill-0.076
 Sophia-0.072
ill-0.069
 Pam-0.068
Token poke
Token ID22620
Feature activation+1.54e+00

Pos logit contributions
cel+0.033
 Junior+0.032
 ambul+0.030
 Thomas+0.030
uf+0.030
Neg logit contributions
 Jill-0.022
ill-0.021
 tad-0.021
 Sophia-0.020
 Pam-0.019
Token it
Token ID340
Feature activation-4.50e-01

Pos logit contributions
cel+0.054
 Junior+0.053
 ambul+0.050
uf+0.049
 operating+0.049
Neg logit contributions
 Jill-0.046
 tad-0.046
ill-0.045
 Sophia-0.044
 Pam-0.041
Token Ben
Token ID3932
Feature activation+1.02e+00

Pos logit contributions
 Junior+0.037
cel+0.035
 Thomas+0.034
 ambul+0.034
uf+0.034
Neg logit contributions
 tad-0.025
ill-0.024
 Jill-0.024
 Sophia-0.022
ulu-0.021
Token like
Token ID588
Feature activation+7.84e-01

Pos logit contributions
cel+0.041
 Junior+0.039
 ambul+0.038
 operating+0.038
uf+0.037
Neg logit contributions
 Jill-0.027
ill-0.026
 tad-0.025
 Sophia-0.025
 Pam-0.022
Token to
Token ID284
Feature activation+3.02e-01

Pos logit contributions
 Junior+0.028
cel+0.027
 ambul+0.027
 Thomas+0.026
 tid+0.026
Neg logit contributions
ill-0.023
 tad-0.023
 Jill-0.022
 Sophia-0.022
 Pam-0.020
Token play
Token ID711
Feature activation-2.33e+00

Pos logit contributions
 Junior+0.010
cel+0.009
 tid+0.009
 Thomas+0.009
 ambul+0.009
Neg logit contributions
 tad-0.010
ill-0.009
 Sophia-0.009
 Jill-0.009
ilies-0.009
Token with
Token ID351
Feature activation+1.56e-01

Pos logit contributions
 Jill+0.079
 tad+0.077
ill+0.073
 Sophia+0.072
 Pam+0.070
Neg logit contributions
cel-0.075
 Junior-0.074
 ambul-0.070
uf-0.069
Brown-0.067
Token bricks
Token ID28902
Feature activation+4.07e+00

Pos logit contributions
cel+0.007
 Junior+0.006
uf+0.006
 operating+0.006
riger+0.006
Neg logit contributions
 Jill-0.004
 tad-0.003
 Sophia-0.003
ill-0.003
 Pam-0.003
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
cel+0.141
 Junior+0.135
uf+0.127
 ambul+0.125
 operating+0.123
Neg logit contributions
 Jill-0.133
ill-0.132
 tad-0.126
 Sophia-0.124
 Pam-0.115
Token They
Token ID1119
Feature activation-1.27e-02

Pos logit contributions
 Junior+0.000
 Thomas+0.000
cel+0.000
 operating+0.000
 tid+0.000
Neg logit contributions
 tad-0.000
 Sophia-0.000
ill-0.000
Meg-0.000
 bou-0.000
Token have
Token ID423
Feature activation-1.74e+00

Pos logit contributions
 Jill+0.000
ill+0.000
 tad+0.000
 Pam+0.000
 Sophia+0.000
Neg logit contributions
cel-0.001
 Junior-0.000
uf-0.000
 ambul-0.000
 operating-0.000
Token many
Token ID867
Feature activation-8.76e-01

Pos logit contributions
 Jill+0.052
 tad+0.052
ill+0.051
 Sophia+0.050
 Pam+0.046
Neg logit contributions
cel-0.062
 Junior-0.060
 ambul-0.057
uf-0.056
 operating-0.056
Token bricks
Token ID28902
Feature activation+3.22e+00

Pos logit contributions
 Jill+0.024
 tad+0.024
ill+0.023
 Sophia+0.023
 Pam+0.021
Neg logit contributions
cel-0.033
 Junior-0.033
 ambul-0.031
uf-0.031
 operating-0.030
Token it
Token ID340
Feature activation-7.37e-01

Pos logit contributions
 tad+0.037
 Jill+0.036
ill+0.036
 Sophia+0.034
 Pam+0.031
Neg logit contributions
cel-0.067
 Junior-0.066
 ambul-0.063
 tid-0.062
 operating-0.062
Token was
Token ID373
Feature activation-4.08e-01

Pos logit contributions
 Jill+0.024
ill+0.023
 tad+0.023
 Sophia+0.022
 Pam+0.021
Neg logit contributions
cel-0.025
 Junior-0.024
uf-0.023
 ambul-0.023
 operating-0.022
Token broken
Token ID5445
Feature activation+1.17e+00

Pos logit contributions
 Jill+0.014
ill+0.013
 tad+0.013
 Sophia+0.013
 Pam+0.012
Neg logit contributions
cel-0.013
 Junior-0.013
uf-0.012
 ambul-0.012
 operating-0.012
Token again
Token ID757
Feature activation-1.01e+00

Pos logit contributions
cel+0.040
 Junior+0.038
uf+0.038
 ambul+0.036
Brown+0.035
Neg logit contributions
 Jill-0.038
ill-0.037
 tad-0.035
 Sophia-0.035
 Pam-0.032
Token.
Token ID13
Feature activation+7.50e-01

Pos logit contributions
 Jill+0.036
ill+0.034
 tad+0.034
 Sophia+0.033
 Pam+0.031
Neg logit contributions
cel-0.031
 Junior-0.030
uf-0.028
 ambul-0.028
 operating-0.027
Token She
Token ID1375
Feature activation+8.79e-01

Pos logit contributions
cel+0.027
 Junior+0.026
 Thomas+0.025
uf+0.025
 operating+0.024
Neg logit contributions
 tad-0.023
ill-0.021
 Jill-0.021
 Sophia-0.020
ulu-0.019
Token was
Token ID373
Feature activation+5.26e-01

Pos logit contributions
cel+0.030
 Junior+0.029
 ambul+0.027
uf+0.027
 operating+0.027
Neg logit contributions
 Jill-0.028
 tad-0.027
ill-0.027
 Sophia-0.026
 Pam-0.025
Token so
Token ID523
Feature activation+1.03e+00

Pos logit contributions
cel+0.017
 Junior+0.017
 ambul+0.016
uf+0.016
 operating+0.015
Neg logit contributions
 Jill-0.018
 tad-0.017
ill-0.017
 Sophia-0.017
 Pam-0.016
Token disappointed
Token ID11679
Feature activation+8.90e-01

Pos logit contributions
cel+0.037
 Junior+0.037
 tid+0.036
 operating+0.035
 ambul+0.035
Neg logit contributions
ill-0.029
 Jill-0.029
 tad-0.029
 Sophia-0.028
 Pam-0.026
Token,
Token ID11
Feature activation+9.85e-02

Pos logit contributions
cel+0.031
 Junior+0.029
hers+0.028
uf+0.028
 ambul+0.028
Neg logit contributions
 Jill-0.030
ill-0.028
 Sophia-0.028
 Pam-0.026
 tad-0.026
Token she
Token ID673
Feature activation+6.11e-01

Pos logit contributions
cel+0.003
 Junior+0.003
 ambul+0.003
 tid+0.003
 Thomas+0.003
Neg logit contributions
 tad-0.003
ill-0.003
 Jill-0.003
 Sophia-0.003
ulu-0.003
Token are
Token ID389
Feature activation+2.47e-01

Pos logit contributions
cel+0.015
 Junior+0.014
 ambul+0.014
uf+0.014
 tid+0.013
Neg logit contributions
 Jill-0.013
 tad-0.013
ill-0.012
 Sophia-0.012
 Pam-0.011
Token happy
Token ID3772
Feature activation-3.39e-01

Pos logit contributions
cel+0.010
 Junior+0.009
 ambul+0.009
 tid+0.009
hers+0.009
Neg logit contributions
 Jill-0.006
 tad-0.006
ill-0.006
 Sophia-0.006
 Pam-0.006
Token and
Token ID290
Feature activation-5.17e-03

Pos logit contributions
 Jill+0.011
 tad+0.011
ill+0.011
 Sophia+0.011
 Pam+0.010
Neg logit contributions
cel-0.011
 Junior-0.011
 ambul-0.010
uf-0.010
 operating-0.010
Token safe
Token ID3338
Feature activation+7.82e-01

Pos logit contributions
ill+0.000
 Jill+0.000
 tad+0.000
 Sophia+0.000
 Pam+0.000
Neg logit contributions
cel-0.000
 Junior-0.000
 tid-0.000
 ambul-0.000
 Thomas-0.000
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
cel+0.021
 Junior+0.020
 ambul+0.019
 tid+0.019
 operating+0.019
Neg logit contributions
 Jill-0.030
 tad-0.030
ill-0.029
 Sophia-0.028
 Pam-0.027
Token<|endoftext|>
Token ID50256
Feature activation+1.07e+00

Pos logit contributions
cel+0.032
 Junior+0.031
hers+0.030
 ambul+0.030
 operating+0.030
Neg logit contributions
 Jill-0.016
 Sophia-0.014
 tad-0.014
ill-0.014
 Pam-0.013
TokenTom
Token ID13787
Feature activation+3.86e-01

Pos logit contributions
cel+0.028
 Junior+0.027
 Thomas+0.024
uf+0.024
 operating+0.024
Neg logit contributions
 Jill-0.042
 tad-0.041
ill-0.041
 Sophia-0.040
 Pam-0.039
Token and
Token ID290
Feature activation+8.46e-01

Pos logit contributions
cel+0.013
 Junior+0.013
 ambul+0.012
 operating+0.012
uf+0.012
Neg logit contributions
 Jill-0.012
ill-0.012
 tad-0.012
 Sophia-0.012
 Pam-0.011
Token Mia
Token ID32189
Feature activation-2.02e-01

Pos logit contributions
 Junior+0.028
 Thomas+0.025
cel+0.023
 Michael+0.023
 ambul+0.023
Neg logit contributions
ill-0.029
 tad-0.028
 bou-0.026
 Sophia-0.025
ilies-0.025
Token liked
Token ID8288
Feature activation+3.59e-01

Pos logit contributions
 Jill+0.005
ill+0.005
 Sophia+0.005
 tad+0.005
 Pam+0.004
Neg logit contributions
cel-0.008
 Junior-0.008
 ambul-0.008
Brown-0.008
hers-0.008
Token to
Token ID284
Feature activation+7.06e-01

Pos logit contributions
 Junior+0.012
 ambul+0.011
 tid+0.011
 operating+0.011
 Thomas+0.011
Neg logit contributions
ill-0.012
 tad-0.012
 Jill-0.011
 Sophia-0.011
ulu-0.010
Token He
Token ID679
Feature activation+1.01e+00

Pos logit contributions
cel+0.024
 Junior+0.022
 ambul+0.021
 operating+0.021
uf+0.021
Neg logit contributions
 Jill-0.018
ill-0.018
 tad-0.017
 Sophia-0.017
 Pam-0.016
Token had
Token ID550
Feature activation-3.16e-01

Pos logit contributions
cel+0.036
 Junior+0.035
 ambul+0.035
 operating+0.033
uf+0.033
Neg logit contributions
 Jill-0.030
 tad-0.029
 Sophia-0.027
ill-0.027
 Pam-0.026
Token a
Token ID257
Feature activation-4.22e-01

Pos logit contributions
 Jill+0.010
 tad+0.010
ill+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.011
 Junior-0.011
 ambul-0.010
uf-0.010
 operating-0.010
Token high
Token ID1029
Feature activation-1.10e+00

Pos logit contributions
ill+0.012
 Jill+0.012
 tad+0.011
 Sophia+0.011
 Pam+0.010
Neg logit contributions
cel-0.016
 Junior-0.016
 ambul-0.015
uf-0.015
 tid-0.015
Token er
Token ID1931
Feature activation+7.22e-01

Pos logit contributions
 Jill+0.032
ill+0.031
 tad+0.031
 Sophia+0.030
 Pam+0.028
Neg logit contributions
cel-0.040
 Junior-0.039
 ambul-0.037
 tid-0.037
uf-0.037
Tokenaser
Token ID6005
Feature activation+4.44e-01

Pos logit contributions
cel+0.030
 Junior+0.029
 ambul+0.028
uf+0.027
 operating+0.027
Neg logit contributions
 Jill-0.018
 tad-0.017
ill-0.017
 Sophia-0.017
 Pam-0.015
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
 Junior+0.014
cel+0.013
 ambul+0.012
uf+0.012
 Thomas+0.012
Neg logit contributions
 Jill-0.015
 tad-0.015
ill-0.015
 Sophia-0.014
 Pam-0.014
Token Bob
Token ID5811
Feature activation+1.43e-02

Pos logit contributions
 tad+0.005
 Jill+0.005
ill+0.005
 Sophia+0.004
 Pam+0.004
Neg logit contributions
 Junior-0.006
cel-0.006
 ambul-0.006
uf-0.006
 Thomas-0.005
Token loved
Token ID6151
Feature activation+5.36e-01

Pos logit contributions
cel+0.001
 ambul+0.001
 Junior+0.001
riger+0.001
uf+0.001
Neg logit contributions
 Jill-0.000
ill-0.000
 tad-0.000
 Sophia-0.000
 flower-0.000
Token to
Token ID284
Feature activation+7.65e-01

Pos logit contributions
cel+0.021
 Junior+0.020
 ambul+0.019
 tid+0.019
 operating+0.019
Neg logit contributions
ill-0.014
 Jill-0.014
 tad-0.014
 Sophia-0.014
 Pam-0.012
Token add
Token ID751
Feature activation+7.45e-01

Pos logit contributions
cel+0.026
 Junior+0.025
 tid+0.025
 operating+0.024
 ambul+0.024
Neg logit contributions
 tad-0.024
ill-0.023
 Sophia-0.023
 Jill-0.022
ulu-0.020
Token dirty
Token ID11841
Feature activation+1.34e+00

Pos logit contributions
cel+0.018
 Junior+0.017
uf+0.017
 ambul+0.016
 operating+0.016
Neg logit contributions
 Jill-0.016
 tad-0.015
ill-0.015
 Sophia-0.015
 Pam-0.014
Token.
Token ID13
Feature activation+5.35e-01

Pos logit contributions
cel+0.045
 Junior+0.043
uf+0.042
 ambul+0.040
Brown+0.040
Neg logit contributions
 Jill-0.044
ill-0.042
 tad-0.042
 Sophia-0.041
 Pam-0.038
Token His
Token ID2399
Feature activation+1.44e+00

Pos logit contributions
cel+0.020
 Junior+0.020
 ambul+0.019
uf+0.019
 operating+0.019
Neg logit contributions
 tad-0.015
 Jill-0.014
ill-0.014
 Sophia-0.014
 Pam-0.013
Token mom
Token ID1995
Feature activation+1.21e+00

Pos logit contributions
 Junior+0.048
cel+0.047
 ambul+0.045
 tid+0.044
 operating+0.044
Neg logit contributions
ill-0.046
 Jill-0.046
 tad-0.045
 Sophia-0.044
 Pam-0.040
Token was
Token ID373
Feature activation+1.29e+00

Pos logit contributions
cel+0.044
 Junior+0.043
uf+0.042
 ambul+0.041
 operating+0.040
Neg logit contributions
 Jill-0.035
ill-0.035
 Sophia-0.033
 tad-0.032
 Pam-0.030
Token very
Token ID845
Feature activation+1.84e+00

Pos logit contributions
cel+0.048
 Junior+0.046
uf+0.043
 ambul+0.043
Brown+0.042
Neg logit contributions
 Jill-0.038
 Sophia-0.036
 tad-0.036
ill-0.035
 Pam-0.032
Token sad
Token ID6507
Feature activation+1.88e-01

Pos logit contributions
cel+0.072
 Junior+0.070
 ambul+0.066
uf+0.066
 operating+0.065
Neg logit contributions
 Jill-0.050
 tad-0.048
ill-0.047
 Sophia-0.047
 Pam-0.043
Token that
Token ID326
Feature activation+1.76e+00

Pos logit contributions
cel+0.008
 Junior+0.008
uf+0.007
Brown+0.007
hers+0.007
Neg logit contributions
 Jill-0.005
ill-0.004
 Sophia-0.004
 tad-0.004
 Pam-0.004
Token he
Token ID339
Feature activation+1.22e+00

Pos logit contributions
cel+0.065
 Junior+0.060
uf+0.059
 ambul+0.056
 operating+0.056
Neg logit contributions
 Jill-0.054
ill-0.051
 Sophia-0.051
 tad-0.050
 Pam-0.046
Token didn
Token ID1422
Feature activation-4.87e-01

Pos logit contributions
cel+0.045
 Junior+0.043
uf+0.042
 ambul+0.042
 operating+0.040
Neg logit contributions
 Jill-0.034
 tad-0.032
ill-0.032
 Sophia-0.031
 Pam-0.029
Token't
Token ID470
Feature activation-4.18e-01

Pos logit contributions
 Jill+0.014
ill+0.014
 tad+0.014
 Sophia+0.013
 Pam+0.013
Neg logit contributions
cel-0.018
 Junior-0.017
uf-0.016
 ambul-0.016
 tid-0.016
Token's
Token ID338
Feature activation+1.08e+00

Pos logit contributions
 Jill+0.007
 tad+0.007
ill+0.007
 Sophia+0.007
 Pam+0.006
Neg logit contributions
cel-0.006
 Junior-0.006
 ambul-0.005
uf-0.005
 operating-0.005
Token owner
Token ID4870
Feature activation+6.34e-01

Pos logit contributions
 Junior+0.032
uf+0.031
 operating+0.030
 ambul+0.030
 Thomas+0.029
Neg logit contributions
ill-0.036
 Jill-0.036
 tad-0.036
 Sophia-0.034
ulu-0.033
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
cel+0.017
 Junior+0.016
 ambul+0.015
uf+0.015
 operating+0.015
Neg logit contributions
 tad-0.024
 Jill-0.024
ill-0.023
 Sophia-0.023
 Pam-0.022
Token He
Token ID679
Feature activation+1.32e+00

Pos logit contributions
cel+0.006
 Junior+0.006
 Thomas+0.006
uf+0.006
 operating+0.006
Neg logit contributions
ill-0.005
 tad-0.005
 Sophia-0.005
ilies-0.005
ulu-0.005
Token was
Token ID373
Feature activation+1.93e-01

Pos logit contributions
cel+0.048
 Junior+0.046
 ambul+0.044
uf+0.043
 operating+0.043
Neg logit contributions
 Jill-0.040
 tad-0.039
ill-0.038
 Sophia-0.037
 Pam-0.035
Token a
Token ID257
Feature activation+5.72e-01

Pos logit contributions
cel+0.006
 Junior+0.006
 ambul+0.006
uf+0.006
 operating+0.006
Neg logit contributions
 Jill-0.006
 tad-0.006
ill-0.006
 Sophia-0.006
 Pam-0.006
Token man
Token ID582
Feature activation+2.11e+00

Pos logit contributions
cel+0.020
 Junior+0.020
 ambul+0.019
 operating+0.019
uf+0.019
Neg logit contributions
ill-0.018
 Jill-0.017
 Sophia-0.016
 tad-0.016
ilies-0.015
Token with
Token ID351
Feature activation-5.81e-01

Pos logit contributions
cel+0.064
 Junior+0.063
 ambul+0.059
brate+0.057
hers+0.057
Neg logit contributions
 Jill-0.073
ill-0.073
 Sophia-0.069
 tad-0.069
 Sue-0.064
Token a
Token ID257
Feature activation-4.25e-01

Pos logit contributions
ill+0.019
 Jill+0.018
 tad+0.018
 Sophia+0.018
 Pam+0.016
Neg logit contributions
cel-0.019
 Junior-0.019
 ambul-0.018
 Thomas-0.018
uf-0.017
Token hat
Token ID6877
Feature activation-6.88e-01

Pos logit contributions
ill+0.015
 Jill+0.014
 tad+0.013
 Sophia+0.013
ilies+0.012
Neg logit contributions
 Junior-0.014
cel-0.014
 ambul-0.013
 operating-0.013
 Thomas-0.013
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
 Jill+0.021
ill+0.021
 tad+0.020
 Sophia+0.020
 Pam+0.018
Neg logit contributions
cel-0.024
 Junior-0.023
 ambul-0.022
uf-0.022
 operating-0.022
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
cel+0.006
 tid+0.006
 operating+0.005
 ambul+0.005
 Junior+0.005
Neg logit contributions
 tad-0.009
 Sophia-0.008
 Jill-0.008
ill-0.008
 Pam-0.008
Token He
Token ID679
Feature activation+1.08e+00

Pos logit contributions
cel+0.010
 ambul+0.010
ind+0.010
hers+0.010
 Junior+0.009
Neg logit contributions
 Jill-0.005
 Sophia-0.004
ill-0.004
 Sue-0.004
 tad-0.004
Token concentrated
Token ID17298
Feature activation+7.71e-01

Pos logit contributions
cel+0.039
 Junior+0.038
 ambul+0.036
uf+0.036
 operating+0.035
Neg logit contributions
 Jill-0.033
 tad-0.032
ill-0.031
 Sophia-0.030
 Pam-0.028
Token very
Token ID845
Feature activation-4.03e-01

Pos logit contributions
cel+0.027
 Junior+0.026
uf+0.024
 ambul+0.024
 operating+0.024
Neg logit contributions
 Jill-0.024
 tad-0.023
ill-0.023
 Sophia-0.023
 Pam-0.021
Token hard
Token ID1327
Feature activation+1.18e+00

Pos logit contributions
 Jill+0.011
 tad+0.010
ill+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.016
 Junior-0.015
 ambul-0.015
uf-0.014
Brown-0.014
Token and
Token ID290
Feature activation-4.81e-01

Pos logit contributions
cel+0.048
 Junior+0.047
uf+0.045
Brown+0.045
 ambul+0.044
Neg logit contributions
 Jill-0.029
ill-0.028
 Sophia-0.026
 tad-0.025
 chance-0.025
Token cast
Token ID3350
Feature activation-9.41e-01

Pos logit contributions
ill+0.017
 tad+0.017
 Jill+0.016
 Sophia+0.016
aney+0.015
Neg logit contributions
cel-0.015
 tid-0.014
 Junior-0.014
 ambul-0.013
 operating-0.013
Token his
Token ID465
Feature activation-6.77e-01

Pos logit contributions
 Jill+0.028
 tad+0.027
ill+0.026
 Sophia+0.026
 Pam+0.024
Neg logit contributions
cel-0.034
 Junior-0.033
 ambul-0.032
uf-0.032
 operating-0.031
Token first
Token ID717
Feature activation-3.18e+00

Pos logit contributions
ill+0.023
 Jill+0.023
 tad+0.023
 Sophia+0.023
ulu+0.021
Neg logit contributions
cel-0.022
 tid-0.021
 ambul-0.020
 operating-0.020
 Junior-0.020
Token spell
Token ID4822
Feature activation-3.00e-01

Pos logit contributions
 tad+0.106
 Sophia+0.102
 Jill+0.101
ill+0.098
 Pam+0.096
Neg logit contributions
cel-0.104
 tid-0.099
 operating-0.095
 Junior-0.093
 ambul-0.091
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
 Jill+0.010
 tad+0.010
ill+0.010
 Sophia+0.010
 Pam+0.009
Neg logit contributions
cel-0.010
 Junior-0.009
 ambul-0.009
 operating-0.009
 tid-0.009
Token hit
Token ID2277
Feature activation-7.63e-01

Pos logit contributions
ill+0.012
 tad+0.012
 Sophia+0.011
 Jill+0.011
aney+0.011
Neg logit contributions
 Junior-0.013
cel-0.013
 Thomas-0.012
 tid-0.012
 ambul-0.012
Token her
Token ID607
Feature activation-1.67e+00

Pos logit contributions
 tad+0.027
ill+0.026
 Sophia+0.024
aney+0.024
 crab+0.024
Neg logit contributions
cel-0.023
 Junior-0.022
 Thomas-0.022
 tid-0.021
 ambul-0.020
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
 Jill+0.056
 tad+0.056
ill+0.054
 Sophia+0.054
ulu+0.050
Neg logit contributions
cel-0.053
 Junior-0.051
 Thomas-0.048
 tid-0.048
 operating-0.048
Token
Token ID220
Feature activation-4.32e-01

Pos logit contributions
 tad+0.011
ill+0.010
ilies+0.009
 crab+0.009
ulu+0.009
Neg logit contributions
cel-0.010
 Junior-0.010
 Thomas-0.010
Brown-0.009
 Min-0.009
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
 tad+0.016
 crab+0.014
 Jill+0.014
 Sophia+0.014
ill+0.014
Neg logit contributions
cel-0.012
 Junior-0.012
 Thomas-0.011
Brown-0.011
uf-0.011
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
 tad+0.004
 Jill+0.004
ill+0.004
 crab+0.004
 bou+0.004
Neg logit contributions
 Junior-0.003
cel-0.003
Brown-0.003
Thomas-0.003
 Thomas-0.003
TokenL
Token ID43
Feature activation-2.01e+00

Pos logit contributions
cel+0.004
 Junior+0.003
Brown+0.003
Thomas+0.003
 operating+0.003
Neg logit contributions
 tad-0.004
 Jill-0.004
ill-0.004
 Sophia-0.004
 Pam-0.003
Tokenily
Token ID813
Feature activation-1.72e+00

Pos logit contributions
 tad+0.061
 bou+0.053
 net+0.051
 low+0.051
 crab+0.051
Neg logit contributions
ise-0.067
ice-0.066
 years-0.065
uf-0.064
innie-0.063
Token's
Token ID338
Feature activation+1.68e+00

Pos logit contributions
 Jill+0.045
 tad+0.044
ill+0.043
 Sophia+0.042
 Pam+0.039
Neg logit contributions
 Junior-0.066
cel-0.066
 ambul-0.063
uf-0.062
 operating-0.062
Token mom
Token ID1995
Feature activation+1.70e+00

Pos logit contributions
 Junior+0.050
cel+0.049
 tid+0.049
 Thomas+0.048
 operating+0.046
Neg logit contributions
ill-0.060
 Jill-0.058
 tad-0.057
 Sophia-0.057
ulu-0.053
Token rushed
Token ID13999
Feature activation+7.69e-01

Pos logit contributions
cel+0.061
 Junior+0.060
uf+0.058
 ambul+0.057
 operating+0.056
Neg logit contributions
 Jill-0.050
ill-0.049
 tad-0.047
 Sophia-0.046
 Pam-0.043
Token decided
Token ID3066
Feature activation-1.68e+00

Pos logit contributions
 Jill+0.002
 tad+0.002
ill+0.002
 Sophia+0.002
 Pam+0.002
Neg logit contributions
cel-0.003
 Junior-0.003
 operating-0.003
 ambul-0.003
uf-0.003
Token to
Token ID284
Feature activation+1.30e-01

Pos logit contributions
 Jill+0.043
 tad+0.041
ill+0.041
 Sophia+0.040
 Pam+0.037
Neg logit contributions
cel-0.068
 Junior-0.066
 ambul-0.063
uf-0.062
 operating-0.062
Token yield
Token ID7800
Feature activation+5.66e-01

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.005
uf+0.005
 operating+0.005
Neg logit contributions
 Jill-0.003
 tad-0.003
ill-0.003
 Sophia-0.003
 Pam-0.003
Token and
Token ID290
Feature activation-1.86e-01

Pos logit contributions
cel+0.022
 Junior+0.021
uf+0.020
 ambul+0.020
 operating+0.019
Neg logit contributions
 Jill-0.016
 tad-0.015
ill-0.015
 Sophia-0.015
 Pam-0.014
Token let
Token ID1309
Feature activation-9.06e-01

Pos logit contributions
ill+0.006
 Jill+0.006
 tad+0.006
 Sophia+0.006
 Pam+0.005
Neg logit contributions
cel-0.006
 Junior-0.006
 ambul-0.006
 tid-0.006
 operating-0.006
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
 Jill+0.023
 tad+0.021
 Pam+0.021
 hers+0.021
 Sophia+0.021
Neg logit contributions
cel-0.035
uf-0.032
 years-0.031
 Junior-0.031
ise-0.030
Token truck
Token ID7779
Feature activation-9.67e-02

Pos logit contributions
 Jill+0.006
 tad+0.006
ill+0.005
 Sophia+0.005
 Pam+0.005
Neg logit contributions
cel-0.006
 Junior-0.006
uf-0.006
 operating-0.006
 ambul-0.006
Token pass
Token ID1208
Feature activation-1.69e+00

Pos logit contributions
 tad+0.003
 Sophia+0.003
 Jill+0.003
ill+0.003
 Meg+0.003
Neg logit contributions
 ambul-0.003
cel-0.003
 operating-0.003
 tid-0.003
 Junior-0.003
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
 tad+0.060
ill+0.058
 Sophia+0.058
 Jill+0.058
Meg+0.054
Neg logit contributions
 Junior-0.048
cel-0.047
 ambul-0.046
 Thomas-0.046
hers-0.044
Token After
Token ID2293
Feature activation-8.55e-01

Pos logit contributions
 Jill+0.006
 tad+0.006
 Sophia+0.006
ill+0.006
 Pam+0.005
Neg logit contributions
cel-0.010
 Junior-0.010
uf-0.010
ise-0.010
 ambul-0.010
Token the
Token ID262
Feature activation-3.46e-01

Pos logit contributions
 Jill+0.027
 tad+0.027
 Sophia+0.026
ill+0.025
 Pam+0.024
Neg logit contributions
cel-0.029
 Junior-0.028
uf-0.026
 operating-0.026
 ambul-0.026
Token named
Token ID3706
Feature activation-2.07e+00

Pos logit contributions
 Jill+0.019
ill+0.017
 Sophia+0.017
 tad+0.017
 Pam+0.016
Neg logit contributions
 Junior-0.021
cel-0.020
hers-0.019
Thomas-0.019
 ambul-0.019
Token Lily
Token ID20037
Feature activation-1.43e+00

Pos logit contributions
 tad+0.057
ill+0.048
alk+0.047
 crab+0.046
 shell+0.045
Neg logit contributions
cel-0.082
 tid-0.075
 Junior-0.075
 Thomas-0.073
uf-0.071
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 Jill+0.035
ill+0.032
 Sue+0.030
 flower+0.029
 Sophia+0.028
Neg logit contributions
cel-0.061
 Junior-0.061
 ambul-0.059
riger-0.059
brate-0.059
Token She
Token ID1375
Feature activation+1.33e-01

Pos logit contributions
cel+0.005
 tid+0.005
 Thomas+0.005
 Junior+0.005
 operating+0.005
Neg logit contributions
 tad-0.005
 crab-0.004
 Sophia-0.004
ill-0.004
ilies-0.004
Token had
Token ID550
Feature activation-6.25e-01

Pos logit contributions
 Junior+0.004
cel+0.004
 ambul+0.004
uf+0.004
 operating+0.004
Neg logit contributions
 Jill-0.004
 tad-0.004
 Sophia-0.004
ill-0.004
 Pam-0.004
Token a
Token ID257
Feature activation-6.56e-01

Pos logit contributions
ill+0.021
 tad+0.021
 Jill+0.021
 Sophia+0.020
 Pam+0.019
Neg logit contributions
cel-0.020
 Junior-0.019
 ambul-0.018
 tid-0.018
 Thomas-0.018
Token big
Token ID1263
Feature activation-2.91e-01

Pos logit contributions
ill+0.021
 Jill+0.020
 Sophia+0.019
 tad+0.019
 Meg+0.017
Neg logit contributions
cel-0.023
 Junior-0.023
 tid-0.022
uf-0.021
 ambul-0.021
Token heart
Token ID2612
Feature activation-1.83e+00

Pos logit contributions
ill+0.009
 Jill+0.009
 tad+0.009
 Sophia+0.008
 Pam+0.008
Neg logit contributions
cel-0.010
 Junior-0.010
 ambul-0.009
 tid-0.009
uf-0.009
Token and
Token ID290
Feature activation-9.77e-01

Pos logit contributions
 Jill+0.067
ill+0.065
 tad+0.065
 Sophia+0.064
 Pam+0.060
Neg logit contributions
cel-0.053
 Junior-0.052
 ambul-0.048
uf-0.048
 Thomas-0.047
Token was
Token ID373
Feature activation-2.64e-01

Pos logit contributions
ill+0.044
 tad+0.043
 Jill+0.041
 Sophia+0.040
 Pam+0.039
Neg logit contributions
cel-0.022
 Junior-0.021
 tid-0.020
 ambul-0.019
 Thomas-0.019
Token always
Token ID1464
Feature activation+1.13e+00

Pos logit contributions
 Jill+0.009
 tad+0.009
 Sophia+0.009
ill+0.008
 Pam+0.008
Neg logit contributions
cel-0.008
 Junior-0.008
uf-0.008
 ambul-0.008
 operating-0.007
Token asked
Token ID1965
Feature activation+5.42e-01

Pos logit contributions
cel+0.009
 Junior+0.009
 ambul+0.008
 operating+0.008
uf+0.008
Neg logit contributions
 Jill-0.007
 tad-0.007
ill-0.007
 Sophia-0.006
 Pam-0.006
Token her
Token ID607
Feature activation-2.18e-01

Pos logit contributions
cel+0.024
 Junior+0.024
 ambul+0.023
 operating+0.022
uf+0.022
Neg logit contributions
 tad-0.012
 Jill-0.011
ill-0.011
 Sophia-0.011
 Pam-0.009
Token mom
Token ID1995
Feature activation+1.16e+00

Pos logit contributions
 tad+0.006
 Jill+0.006
ill+0.006
 Sophia+0.006
 Pam+0.006
Neg logit contributions
cel-0.008
 Junior-0.007
 ambul-0.007
 tid-0.007
 operating-0.007
Token for
Token ID329
Feature activation+1.23e+00

Pos logit contributions
cel+0.032
 Junior+0.030
 ambul+0.028
uf+0.028
 operating+0.028
Neg logit contributions
 Jill-0.045
 tad-0.044
ill-0.043
 Sophia-0.043
 Pam-0.040
Token help
Token ID1037
Feature activation+1.24e+00

Pos logit contributions
cel+0.039
 Junior+0.038
 ambul+0.035
uf+0.035
 operating+0.035
Neg logit contributions
 Jill-0.043
 tad-0.042
ill-0.041
 Sophia-0.040
 Pam-0.038
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
cel+0.045
 Junior+0.043
 ambul+0.041
uf+0.041
Brown+0.041
Neg logit contributions
 Jill-0.038
ill-0.037
 Sophia-0.035
 tad-0.034
 Pam-0.032
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
 tad+0.001
ill+0.001
ilies+0.001
 crab+0.001
ulu+0.001
Neg logit contributions
 Junior-0.000
cel-0.000
 Thomas-0.000
uf-0.000
 operating-0.000
Token
Token ID198
Feature activation-6.40e-01

Pos logit contributions
 tad+0.021
 Jill+0.020
ill+0.020
 Sophia+0.019
 Pam+0.018
Neg logit contributions
 Junior-0.021
cel-0.020
 Thomas-0.019
uf-0.019
Brown-0.019
Token"
Token ID1
Feature activation-3.28e-01

Pos logit contributions
 Jill+0.017
 tad+0.016
ill+0.016
 Sophia+0.016
 Pam+0.014
Neg logit contributions
cel-0.026
 Junior-0.025
 ambul-0.024
uf-0.023
 operating-0.023
TokenMom
Token ID29252
Feature activation+7.98e-01

Pos logit contributions
 Jill+0.011
 tad+0.011
ill+0.011
 Sophia+0.011
 Pam+0.010
Neg logit contributions
cel-0.010
 Junior-0.010
 ambul-0.009
uf-0.009
 operating-0.009
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
cel+0.029
 Junior+0.028
 ambul+0.028
ind+0.027
hers+0.027
Neg logit contributions
 Jill-0.025
ill-0.024
 Sophia-0.023
 Sue-0.021
 Meg-0.021
Token even
Token ID772
Feature activation+1.62e+00

Pos logit contributions
 tad+0.028
 Jill+0.028
ill+0.028
 Sophia+0.027
 Pam+0.025
Neg logit contributions
cel-0.026
 Junior-0.026
 ambul-0.024
hers-0.024
 tid-0.024
Token more
Token ID517
Feature activation-2.32e+00

Pos logit contributions
cel+0.053
 Junior+0.052
 tid+0.051
 ambul+0.051
 operating+0.050
Neg logit contributions
ill-0.050
 Jill-0.049
 Sophia-0.048
 tad-0.048
 Pam-0.044
Token lively
Token ID29696
Feature activation-4.26e-01

Pos logit contributions
ill+0.072
 Jill+0.071
 tad+0.069
 Sophia+0.069
 Pam+0.065
Neg logit contributions
 Junior-0.076
cel-0.075
 tid-0.073
 ambul-0.072
 operating-0.071
Token than
Token ID621
Feature activation+1.35e+00

Pos logit contributions
 Jill+0.016
 tad+0.016
ill+0.015
 Sophia+0.015
 Pam+0.014
Neg logit contributions
cel-0.012
 Junior-0.012
 ambul-0.011
uf-0.011
 operating-0.011
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
 Thomas+0.033
 disturbing+0.032
 Junior+0.031
 ambul+0.031
 years+0.031
Neg logit contributions
 tad-0.050
ashes-0.048
 pitcher-0.046
 Goal-0.046
ill-0.046
Token first
Token ID717
Feature activation-4.38e+00

Pos logit contributions
ill+0.037
aney+0.035
uttered+0.034
 Jill+0.033
alk+0.033
Neg logit contributions
 disturbing-0.029
 Thomas-0.029
 operating-0.028
 years-0.028
 Junior-0.028
Token one
Token ID530
Feature activation-1.91e+00

Pos logit contributions
ill+0.147
 Jill+0.142
 tad+0.142
 Pam+0.136
 bou+0.136
Neg logit contributions
 Junior-0.117
ise-0.117
 years-0.115
cel-0.112
uf-0.110
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
 tad+0.083
 Goal+0.078
 bou+0.078
ill+0.077
 Jill+0.076
Neg logit contributions
 years-0.034
 why-0.034
 Junior-0.033
 kil-0.031
 Thomas-0.031
Token She
Token ID1375
Feature activation+3.69e-01

Pos logit contributions
cel+0.006
 Junior+0.006
 Thomas+0.006
uf+0.006
 Michael+0.005
Neg logit contributions
 tad-0.005
ill-0.005
ulu-0.005
 chance-0.004
 Jill-0.004
Token held
Token ID2714
Feature activation+5.97e-01

Pos logit contributions
cel+0.012
 Junior+0.011
 tid+0.011
 Thomas+0.011
 ambul+0.010
Neg logit contributions
 Jill-0.012
ill-0.012
 Sophia-0.012
 tad-0.012
 Pam-0.011
Token out
Token ID503
Feature activation+5.03e-01

Pos logit contributions
 Junior+0.016
 Thomas+0.016
cel+0.016
 Michael+0.015
 Char+0.014
Neg logit contributions
ill-0.022
 tad-0.022
aney-0.020
ashes-0.020
 Jill-0.020
Token brought
Token ID3181
Feature activation+9.91e-01

Pos logit contributions
cel+0.072
 Junior+0.069
 ambul+0.066
uf+0.065
 operating+0.065
Neg logit contributions
 Jill-0.064
ill-0.062
 tad-0.061
 Sophia-0.060
 Pam-0.056
Token out
Token ID503
Feature activation+2.10e+00

Pos logit contributions
 Junior+0.033
cel+0.032
 tid+0.031
 ambul+0.030
 Thomas+0.030
Neg logit contributions
 tad-0.032
ill-0.031
 Jill-0.031
 Sophia-0.030
 Pam-0.028
Token a
Token ID257
Feature activation+9.70e-01

Pos logit contributions
 Junior+0.063
cel+0.061
 ambul+0.059
 tid+0.059
 operating+0.057
Neg logit contributions
 tad-0.072
ill-0.071
 Jill-0.070
 Sophia-0.069
 Pam-0.066
Token big
Token ID1263
Feature activation+9.16e-01

Pos logit contributions
 Junior+0.033
cel+0.033
 tid+0.032
 operating+0.031
 ambul+0.031
Neg logit contributions
ill-0.031
 Jill-0.029
 Sophia-0.028
 tad-0.027
 Lia-0.026
Token pot
Token ID1787
Feature activation+2.73e+00

Pos logit contributions
cel+0.035
 Junior+0.034
 tid+0.032
 ambul+0.032
 operating+0.032
Neg logit contributions
ill-0.025
 Jill-0.025
 tad-0.024
 Sophia-0.024
 Pam-0.021
Token of
Token ID286
Feature activation-3.13e+00

Pos logit contributions
cel+0.102
 Junior+0.100
 tid+0.096
 operating+0.095
 ambul+0.094
Neg logit contributions
 tad-0.074
 Jill-0.074
 Sophia-0.071
ill-0.069
 Pam-0.066
Token spaghetti
Token ID48241
Feature activation+3.49e+00

Pos logit contributions
 Jill+0.095
ill+0.093
 tad+0.093
 Sophia+0.091
 Pam+0.084
Neg logit contributions
cel-0.109
 Junior-0.107
 ambul-0.101
uf-0.099
 operating-0.099
Token.
Token ID13
Feature activation-6.95e-02

Pos logit contributions
cel+0.104
 Junior+0.100
 ambul+0.093
 tid+0.092
 operating+0.092
Neg logit contributions
 Jill-0.124
 tad-0.122
ill-0.120
 Sophia-0.119
 Pam-0.112
Token Tim
Token ID5045
Feature activation-8.92e-01

Pos logit contributions
ill+0.002
 tad+0.002
 Sophia+0.002
ulu+0.002
 Jill+0.002
Neg logit contributions
 Junior-0.002
cel-0.002
 ambul-0.002
 operating-0.002
 Thomas-0.002
Tokenmy
Token ID1820
Feature activation+6.49e-01

Pos logit contributions
 Jill+0.030
ill+0.030
 tad+0.029
 Sophia+0.028
 Pam+0.027
Neg logit contributions
cel-0.028
 Junior-0.027
 ambul-0.026
 operating-0.026
uf-0.025
Token was
Token ID373
Feature activation+1.03e-01

Pos logit contributions
cel+0.025
 Junior+0.024
 ambul+0.023
 operating+0.023
uf+0.022
Neg logit contributions
 Jill-0.018
 tad-0.018
ill-0.018
 Sophia-0.017
 Pam-0.016
Token Joe
Token ID5689
Feature activation+1.32e+00

Pos logit contributions
cel+0.009
 Junior+0.009
 ambul+0.009
uf+0.009
 Thomas+0.009
Neg logit contributions
 Jill-0.009
 tad-0.009
ill-0.009
 Sophia-0.008
 Pam-0.008
Token.
Token ID13
Feature activation+7.10e-01

Pos logit contributions
cel+0.044
 Junior+0.042
 ambul+0.041
uf+0.041
 operating+0.040
Neg logit contributions
 Jill-0.044
ill-0.042
 Sophia-0.040
 tad-0.038
 Pam-0.036
Token It
Token ID632
Feature activation-9.38e-01

Pos logit contributions
cel+0.026
 Junior+0.025
 ambul+0.024
uf+0.024
 operating+0.023
Neg logit contributions
 Jill-0.021
 tad-0.020
ill-0.020
 Sophia-0.019
 Pam-0.018
Token will
Token ID481
Feature activation-1.54e+00

Pos logit contributions
 Jill+0.029
 tad+0.028
ill+0.028
 Sophia+0.027
 Pam+0.025
Neg logit contributions
cel-0.031
 Junior-0.031
uf-0.031
 ambul-0.030
ind-0.029
Token unlock
Token ID12116
Feature activation-6.62e-02

Pos logit contributions
 Jill+0.041
 tad+0.039
ill+0.039
 Sophia+0.037
 Pam+0.036
Neg logit contributions
cel-0.061
 Junior-0.059
uf-0.057
 ambul-0.056
 operating-0.055
Token more
Token ID517
Feature activation-3.54e+00

Pos logit contributions
 Jill+0.002
 tad+0.002
ill+0.002
 Sophia+0.002
 Pam+0.002
Neg logit contributions
cel-0.002
 Junior-0.002
 ambul-0.002
uf-0.002
 operating-0.002
Token exciting
Token ID7895
Feature activation-1.49e+00

Pos logit contributions
 Jill+0.110
 tad+0.106
ill+0.105
 Sophia+0.103
 Pam+0.095
Neg logit contributions
cel-0.124
 Junior-0.121
uf-0.113
 ambul-0.113
 operating-0.110
Token days
Token ID1528
Feature activation-8.29e-01

Pos logit contributions
 Jill+0.045
 tad+0.042
ill+0.041
 Sophia+0.040
 Meg+0.036
Neg logit contributions
cel-0.055
 Junior-0.055
uf-0.053
Brown-0.049
 ambul-0.049
Token for
Token ID329
Feature activation+3.88e-01

Pos logit contributions
 Jill+0.027
ill+0.027
 tad+0.027
 Sophia+0.026
 Pam+0.024
Neg logit contributions
cel-0.027
 Junior-0.026
 ambul-0.025
uf-0.024
 operating-0.024
Token you
Token ID345
Feature activation-1.03e+00

Pos logit contributions
cel+0.013
 Junior+0.012
 ambul+0.012
 operating+0.012
uf+0.011
Neg logit contributions
 Jill-0.013
 tad-0.012
 Sophia-0.012
ill-0.012
 Pam-0.011
Token."
Token ID526
Feature activation-3.58e-01

Pos logit contributions
 tad+0.040
 Jill+0.039
 Sophia+0.038
ill+0.038
 Pam+0.037
Neg logit contributions
cel-0.027
 Junior-0.026
 tid-0.024
 ambul-0.024
 Thomas-0.024
Token't
Token ID470
Feature activation+3.30e-01

Pos logit contributions
 Jill+0.011
 Sophia+0.010
ill+0.010
 tad+0.009
 Meg+0.009
Neg logit contributions
cel-0.016
 Junior-0.016
uf-0.015
 ambul-0.015
ind-0.015
Token let
Token ID1309
Feature activation-5.41e-01

Pos logit contributions
cel+0.013
 Junior+0.013
 ambul+0.012
uf+0.012
hers+0.012
Neg logit contributions
 Jill-0.009
ill-0.009
 tad-0.009
 Sophia-0.008
 Pam-0.008
Token me
Token ID502
Feature activation-8.44e-01

Pos logit contributions
 Jill+0.017
 Sophia+0.016
ill+0.016
 tad+0.016
 Pam+0.015
Neg logit contributions
cel-0.019
 Junior-0.018
 ambul-0.017
uf-0.017
hers-0.017
Token wear
Token ID5806
Feature activation-5.13e-01

Pos logit contributions
 tad+0.028
 Sophia+0.026
ill+0.026
 Jill+0.026
 Lia+0.024
Neg logit contributions
cel-0.027
 tid-0.026
 Junior-0.026
 operating-0.025
uf-0.025
Token her
Token ID607
Feature activation-8.80e-01

Pos logit contributions
 Jill+0.016
 tad+0.015
ill+0.015
 Sophia+0.015
 Pam+0.014
Neg logit contributions
cel-0.018
 Junior-0.017
 ambul-0.017
uf-0.016
 operating-0.016
Token dress
Token ID6576
Feature activation-3.28e+00

Pos logit contributions
 tad+0.028
ill+0.028
 Jill+0.026
 Sophia+0.026
 Pam+0.025
Neg logit contributions
cel-0.032
 tid-0.030
 years-0.027
 Thomas-0.027
 Junior-0.027
Token!"
Token ID2474
Feature activation-1.06e+00

Pos logit contributions
 tad+0.137
 Jill+0.125
 Sophia+0.123
 Pam+0.121
ill+0.120
Neg logit contributions
cel-0.088
 Junior-0.079
 tid-0.076
 Thomas-0.074
hers-0.074
Token Molly
Token ID30236
Feature activation+1.27e-01

Pos logit contributions
 Jill+0.029
 tad+0.028
ill+0.028
 Sophia+0.027
 Pam+0.025
Neg logit contributions
cel-0.042
 Junior-0.040
 ambul-0.039
uf-0.038
 operating-0.038
Token shook
Token ID14682
Feature activation+1.52e+00

Pos logit contributions
cel+0.005
 Junior+0.005
 ambul+0.004
 operating+0.004
uf+0.004
Neg logit contributions
 Jill-0.004
 tad-0.004
ill-0.004
 Sophia-0.003
 Pam-0.003
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
cel+0.068
 Junior+0.066
 ambul+0.064
uf+0.063
 operating+0.063
Neg logit contributions
 Jill-0.032
 tad-0.031
ill-0.031
 Sophia-0.030
 Pam-0.027
Token head
Token ID1182
Feature activation-1.61e+00

Pos logit contributions
 Jill+0.035
 tad+0.035
ill+0.034
 Sophia+0.034
 Pam+0.031
Neg logit contributions
cel-0.038
 Junior-0.037
 ambul-0.035
uf-0.035
 Thomas-0.034
Token the
Token ID262
Feature activation-6.97e-01

Pos logit contributions
 Jill+0.024
 tad+0.024
ill+0.024
 Sophia+0.023
 Pam+0.022
Neg logit contributions
cel-0.027
 Junior-0.026
 ambul-0.025
uf-0.024
 operating-0.024
Token mess
Token ID2085
Feature activation+5.81e-01

Pos logit contributions
 Jill+0.023
 tad+0.022
ill+0.022
 Sophia+0.021
 Pam+0.020
Neg logit contributions
cel-0.024
 Junior-0.023
 ambul-0.022
 operating-0.021
uf-0.021
Token and
Token ID290
Feature activation+3.56e-01

Pos logit contributions
cel+0.020
 Junior+0.019
 ambul+0.018
 tid+0.018
 operating+0.018
Neg logit contributions
 Jill-0.018
 tad-0.018
 Sophia-0.018
ill-0.018
 Pam-0.017
Token ate
Token ID15063
Feature activation-2.04e-01

Pos logit contributions
cel+0.012
 Junior+0.012
 ambul+0.011
uf+0.011
 operating+0.011
Neg logit contributions
 Jill-0.012
 tad-0.011
ill-0.011
 Sophia-0.011
 Pam-0.010
Token some
Token ID617
Feature activation-1.12e+00

Pos logit contributions
 tad+0.006
 Jill+0.006
ill+0.006
 Sophia+0.006
 Pam+0.006
Neg logit contributions
cel-0.007
 Junior-0.007
 tid-0.006
 ambul-0.006
 operating-0.006
Token of
Token ID286
Feature activation-3.11e+00

Pos logit contributions
 Jill+0.033
ill+0.032
 tad+0.032
 Sophia+0.031
 Pam+0.029
Neg logit contributions
cel-0.040
 Junior-0.039
 ambul-0.037
 tid-0.037
 operating-0.037
Token the
Token ID262
Feature activation-4.63e-01

Pos logit contributions
ill+0.107
 tad+0.106
 Jill+0.104
 Sophia+0.102
 Pam+0.097
Neg logit contributions
 Junior-0.093
cel-0.091
 ambul-0.086
 tid-0.085
uf-0.084
Token sandwiches
Token ID29777
Feature activation-1.36e-01

Pos logit contributions
 Jill+0.015
ill+0.015
 Sophia+0.014
 tad+0.014
 Pam+0.014
Neg logit contributions
cel-0.015
 Junior-0.014
 ambul-0.014
 tid-0.014
 operating-0.014
Token.
Token ID13
Feature activation+6.07e-01

Pos logit contributions
 tad+0.006
 Jill+0.006
 Pam+0.006
 Sophia+0.006
Meg+0.006
Neg logit contributions
 tid-0.003
 Junior-0.002
cel-0.002
hers-0.002
 operating-0.002
Token They
Token ID1119
Feature activation+7.35e-01

Pos logit contributions
cel+0.023
 Junior+0.023
 Thomas+0.022
uf+0.022
 ambul+0.022
Neg logit contributions
 tad-0.016
ill-0.016
 Jill-0.016
 Sophia-0.015
 Pam-0.014
Token agreed
Token ID4987
Feature activation-6.29e-01

Pos logit contributions
cel+0.027
 Junior+0.026
 ambul+0.025
 operating+0.024
uf+0.024
Neg logit contributions
 Jill-0.022
 tad-0.021
ill-0.021
 Sophia-0.021
 Pam-0.019