TOP ACTIVATIONS
MAX = 0.166

and nig ella and f enn el and an ise and
car , and the fire hyd rant are wet . Lily
The car hits a fire hyd rant . The fire hyd
hyd rant . The fire hyd rant breaks . Water spr
the car and the fire hyd rant . They take the
" ch oo cho o cho o ". They
spray from a nearby fire hyd rant to keep her dry
and the lightning struck the hyd rant . Suddenly water was
see the corn and the hus ks on the ground .
was connected to a k enn el full of fierce dogs
a pepper and now my tum my hurts ."
on his legs and his tum my . He felt cold
hurts my mouth and my tum my ." Sam was scared
, oh no , oh no !" Mom said .
, oh no , oh no !" Mom said .
sick . He felt his tum my ac he . He
of the cake . His tum my r umbled . He
a bad cough and a hot head . She did not
" O uch , my tum my hurts !" She repeats
ate them up and his tum my was full .

MIN ACTIVATIONS
MIN = -0.154

who liked to collect or nam ents . Everywhere she went
for me ! Aren â t we both so
to help sort the or nam ents . She picked up
at the calendar to remind herself of the day .
ily liked to keep cater pill ars in a jar .
did something special . On Wed nes days she liked to
went she searched for or nam ents , and the more
she could carry the or nam ents to the place .
better now . She reminded herself of the promise her mom
Wed nes days because on Wed nes days she did something
take one of the or nam ents outside for a walk
she could carry the or nam ents back with her .
They give Spot more or nam ents . Spot leads them
you know you shouldn â t speed ? â
Jimmy was always excited for Wed nes days because it was
to look at the or nam ents on the tree .
! Tom enjoyed imagining so much , he decided
, " I love cater pill ars ! You can be
time playing with her or nam ents and making her special
here and I won â t let anything bad

INTERVAL 0.086 - 0.166
CONTAINS 0.164%

bird . Suddenly , his mom my came outside and saw
counted five , six , seven , eight . He even
: " Law is a rule that people make to keep
. " Oh no ," Ben said . "
too much . Mom says no . Lily is sad .

INTERVAL 0.006 - 0.086
CONTAINS 39.411%

mom smiled and said , " You are so creative and
he decided to climb up the tallest tree in the forest
and the truck was happy to be safe .
Her mom put a plate of hot food on the table
felt a little dizz y from all the excitement , he

INTERVAL -0.074 - 0.006
CONTAINS 59.553%

named Lily in the park . L ily
The little girl , named Emma , loved Lily very much
dad . They wished they could play and have fun .
thrilled that she could make up this story with only a
ak by themselves . They thought mom would not notice .

INTERVAL -0.154 - -0.074
CONTAINS 0.873%

down the street and people wearing bright costumes . It was
can . But it â s getting late ,
nervous he couldn 't bring himself to speak .
One day , Lily was playing with her toys when she
, Sara and Tom are playing with yarn in Sara 's
Token and
Token ID290
Feature activation-2.95e-03

Pos logit contributions
idd+0.009
 cho+0.008
etchup+0.008
 hyd+0.008
 canoe+0.008
Neg logit contributions
 herself-0.007
isy-0.006
 Her-0.006
 solve-0.006
 seem-0.006
Token nig
Token ID44873
Feature activation+1.08e-01

Pos logit contributions
 herself+0.004
 read+0.004
 solve+0.004
 Her+0.004
isy+0.004
Neg logit contributions
idd-0.006
 cho-0.006
etchup-0.005
mer-0.005
 hyd-0.005
Tokenella
Token ID12627
Feature activation+7.00e-03

Pos logit contributions
 cho+0.174
idd+0.169
etchup+0.164
 hyd+0.163
 ge+0.158
Neg logit contributions
 herself-0.188
 Her-0.174
 Whatever-0.171
isy-0.167
 aloud-0.166
Token and
Token ID290
Feature activation+3.13e-02

Pos logit contributions
idd+0.014
 cho+0.014
etchup+0.013
 hyd+0.013
 canoe+0.013
Neg logit contributions
 herself-0.010
 Her-0.009
isy-0.009
 about-0.008
 Whatever-0.008
Token f
Token ID277
Feature activation+8.89e-02

Pos logit contributions
idd+0.065
 cho+0.063
mer+0.060
etchup+0.059
 hyd+0.059
Neg logit contributions
 herself-0.042
 read-0.039
 solve-0.037
 wear-0.036
 seem-0.036
Tokenenn
Token ID1697
Feature activation+1.59e-01

Pos logit contributions
 cho+0.136
 hyd+0.132
etchup+0.121
 canoe+0.120
idd+0.119
Neg logit contributions
 herself-0.162
 Her-0.149
 aloud-0.143
 about-0.142
 Whatever-0.142
Tokenel
Token ID417
Feature activation+7.17e-02

Pos logit contributions
 hyd+0.273
 cho+0.270
idd+0.248
 canoe+0.241
etchup+0.236
Neg logit contributions
 herself-0.283
 Her-0.251
isy-0.239
 how-0.233
 about-0.232
Token and
Token ID290
Feature activation+3.35e-02

Pos logit contributions
idd+0.134
 cho+0.131
etchup+0.128
 hyd+0.127
 canoe+0.123
Neg logit contributions
 herself-0.111
 Her-0.098
isy-0.097
 Whatever-0.093
 about-0.092
Token an
Token ID281
Feature activation+6.85e-02

Pos logit contributions
idd+0.071
 cho+0.069
mer+0.066
etchup+0.065
 hyd+0.064
Neg logit contributions
 herself-0.044
 read-0.040
 solve-0.038
 wear-0.038
 seem-0.038
Tokenise
Token ID786
Feature activation+5.18e-02

Pos logit contributions
 hyd+0.127
idd+0.127
 cho+0.125
etchup+0.124
mer+0.115
Neg logit contributions
 herself-0.110
isy-0.099
 Her-0.097
 about-0.095
 Whatever-0.092
Token and
Token ID290
Feature activation+3.57e-02

Pos logit contributions
idd+0.104
 hyd+0.101
etchup+0.099
 cho+0.099
 canoe+0.096
Neg logit contributions
 herself-0.076
 Her-0.066
 how-0.061
 about-0.061
 sure-0.060
Token car
Token ID1097
Feature activation+5.60e-02

Pos logit contributions
idd+0.081
etchup+0.077
mer+0.077
enn+0.076
ang+0.071
Neg logit contributions
 herself-0.063
 solve-0.062
 read-0.059
 write-0.057
 wear-0.053
Token,
Token ID11
Feature activation+2.82e-02

Pos logit contributions
idd+0.111
 cho+0.101
etchup+0.101
 hyd+0.100
 canoe+0.099
Neg logit contributions
 herself-0.079
 solve-0.076
 read-0.074
 seem-0.073
 about-0.071
Token and
Token ID290
Feature activation-8.06e-03

Pos logit contributions
idd+0.075
mer+0.070
enn+0.066
rim+0.065
etchup+0.063
Neg logit contributions
 herself-0.031
 how-0.028
 her-0.025
 about-0.024
 read-0.021
Token the
Token ID262
Feature activation+4.71e-02

Pos logit contributions
 herself+0.009
 how+0.007
 read+0.007
 her+0.007
 solve+0.007
Neg logit contributions
idd-0.021
mer-0.019
enn-0.018
etchup-0.018
 cho-0.018
Token fire
Token ID2046
Feature activation+8.37e-02

Pos logit contributions
idd+0.087
mer+0.083
etchup+0.082
enn+0.082
ang+0.076
Neg logit contributions
 herself-0.067
 solve-0.066
 read-0.063
 write-0.060
 understand-0.057
Token hyd
Token ID7409
Feature activation+1.55e-01

Pos logit contributions
idd+0.161
joy+0.150
etchup+0.147
 canoe+0.145
enn+0.142
Neg logit contributions
 herself-0.107
 seem-0.106
 solve-0.103
 how-0.102
 about-0.102
Tokenrant
Token ID5250
Feature activation-3.30e-02

Pos logit contributions
idd+0.316
 cho+0.304
etchup+0.299
 canoe+0.287
 ge+0.285
Neg logit contributions
 herself-0.206
isy-0.182
 about-0.179
 Whatever-0.179
 seem-0.179
Token are
Token ID389
Feature activation+3.99e-02

Pos logit contributions
 herself+0.042
 seem+0.035
 about+0.035
 solve+0.034
 how+0.033
Neg logit contributions
idd-0.076
 cho-0.070
etchup-0.068
mer-0.067
 canoe-0.066
Token wet
Token ID9583
Feature activation+4.46e-02

Pos logit contributions
idd+0.090
mer+0.085
etchup+0.083
 cho+0.082
crow+0.081
Neg logit contributions
 herself-0.051
 about-0.045
 sure-0.042
 solve-0.042
 how-0.041
Token.
Token ID13
Feature activation+7.05e-03

Pos logit contributions
idd+0.121
etchup+0.119
enn+0.115
 canoe+0.114
 cho+0.113
Neg logit contributions
 herself-0.038
 about-0.026
 how-0.019
 all-0.019
 both-0.019
Token Lily
Token ID20037
Feature activation-3.23e-02

Pos logit contributions
 cho+0.016
idd+0.016
 hyd+0.015
mer+0.015
etchup+0.015
Neg logit contributions
 herself-0.008
 Her-0.007
 Whatever-0.007
 seem-0.006
 solve-0.006
Token The
Token ID383
Feature activation+2.44e-02

Pos logit contributions
 cho+0.003
idd+0.003
etchup+0.003
 hyd+0.003
 canoe+0.003
Neg logit contributions
 herself-0.002
isy-0.002
 solve-0.002
 Whatever-0.002
 Her-0.002
Token car
Token ID1097
Feature activation+4.58e-02

Pos logit contributions
idd+0.043
etchup+0.042
mer+0.041
 hyd+0.039
enn+0.039
Neg logit contributions
 solve-0.037
 herself-0.037
 read-0.035
 write-0.033
 wear-0.033
Token hits
Token ID7127
Feature activation+8.20e-03

Pos logit contributions
idd+0.085
etchup+0.080
 cho+0.080
 hyd+0.079
mer+0.076
Neg logit contributions
 herself-0.070
 solve-0.065
 read-0.065
 seem-0.063
 about-0.063
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
idd+0.018
 hyd+0.017
 cho+0.017
etchup+0.017
mer+0.017
Neg logit contributions
 herself-0.011
 Her-0.009
 Whatever-0.009
 solve-0.009
 aloud-0.009
Token fire
Token ID2046
Feature activation+7.70e-02

Pos logit contributions
 solve+0.014
 herself+0.014
isy+0.014
 read+0.013
 Whatever+0.013
Neg logit contributions
idd-0.024
etchup-0.022
 hyd-0.022
 cho-0.021
mer-0.021
Token hyd
Token ID7409
Feature activation+1.54e-01

Pos logit contributions
idd+0.167
etchup+0.158
 ge+0.150
rim+0.149
joy+0.148
Neg logit contributions
 about-0.090
 how-0.087
 herself-0.086
 solve-0.085
 read-0.084
Tokenrant
Token ID5250
Feature activation-3.79e-02

Pos logit contributions
idd+0.292
 cho+0.289
etchup+0.276
 hyd+0.270
 canoe+0.261
Neg logit contributions
 herself-0.227
isy-0.204
 solve-0.201
 Whatever-0.201
 Her-0.201
Token.
Token ID13
Feature activation+6.36e-03

Pos logit contributions
 herself+0.043
 about+0.038
 closely+0.035
 solve+0.035
 aloud+0.035
Neg logit contributions
idd-0.092
etchup-0.085
 cho-0.083
enn-0.079
 canoe-0.078
Token The
Token ID383
Feature activation+2.93e-02

Pos logit contributions
 cho+0.014
idd+0.014
 hyd+0.013
etchup+0.013
mer+0.013
Neg logit contributions
 herself-0.008
 Her-0.007
 Whatever-0.007
 solve-0.006
 seem-0.006
Token fire
Token ID2046
Feature activation+7.70e-02

Pos logit contributions
idd+0.054
mer+0.053
etchup+0.052
enn+0.051
boats+0.048
Neg logit contributions
 solve-0.041
 herself-0.040
 read-0.037
 write-0.036
 wear-0.035
Token hyd
Token ID7409
Feature activation+1.51e-01

Pos logit contributions
idd+0.144
joy+0.143
etchup+0.134
 ge+0.132
 canoe+0.131
Neg logit contributions
 how-0.095
 about-0.094
 seem-0.091
 solve-0.090
 read-0.089
Token hyd
Token ID7409
Feature activation+1.54e-01

Pos logit contributions
idd+0.167
etchup+0.158
 ge+0.150
rim+0.149
joy+0.148
Neg logit contributions
 about-0.090
 how-0.087
 herself-0.086
 solve-0.085
 read-0.084
Tokenrant
Token ID5250
Feature activation-3.79e-02

Pos logit contributions
idd+0.292
 cho+0.289
etchup+0.276
 hyd+0.270
 canoe+0.261
Neg logit contributions
 herself-0.227
isy-0.204
 solve-0.201
 Whatever-0.201
 Her-0.201
Token.
Token ID13
Feature activation+6.36e-03

Pos logit contributions
 herself+0.043
 about+0.038
 closely+0.035
 solve+0.035
 aloud+0.035
Neg logit contributions
idd-0.092
etchup-0.085
 cho-0.083
enn-0.079
 canoe-0.078
Token The
Token ID383
Feature activation+2.93e-02

Pos logit contributions
 cho+0.014
idd+0.014
 hyd+0.013
etchup+0.013
mer+0.013
Neg logit contributions
 herself-0.008
 Her-0.007
 Whatever-0.007
 solve-0.006
 seem-0.006
Token fire
Token ID2046
Feature activation+7.70e-02

Pos logit contributions
idd+0.054
mer+0.053
etchup+0.052
enn+0.051
boats+0.048
Neg logit contributions
 solve-0.041
 herself-0.040
 read-0.037
 write-0.036
 wear-0.035
Token hyd
Token ID7409
Feature activation+1.51e-01

Pos logit contributions
idd+0.144
joy+0.143
etchup+0.134
 ge+0.132
 canoe+0.131
Neg logit contributions
 how-0.095
 about-0.094
 seem-0.091
 solve-0.090
 read-0.089
Tokenrant
Token ID5250
Feature activation-3.99e-02

Pos logit contributions
idd+0.295
 cho+0.286
etchup+0.279
 hyd+0.266
 canoe+0.266
Neg logit contributions
 herself-0.215
isy-0.191
 Whatever-0.189
 aloud-0.188
 solve-0.188
Token breaks
Token ID9457
Feature activation+4.25e-02

Pos logit contributions
 herself+0.053
 about+0.046
 seem+0.046
 how+0.044
 closely+0.043
Neg logit contributions
idd-0.088
 cho-0.080
etchup-0.079
mer-0.077
 canoe-0.076
Token.
Token ID13
Feature activation+5.40e-03

Pos logit contributions
idd+0.100
etchup+0.097
 canoe+0.093
 ge+0.093
 cho+0.093
Neg logit contributions
 herself-0.048
 about-0.039
 closely-0.035
 how-0.035
 aloud-0.034
Token Water
Token ID5638
Feature activation+3.53e-02

Pos logit contributions
 cho+0.012
idd+0.012
 hyd+0.011
mer+0.011
etchup+0.011
Neg logit contributions
 herself-0.006
 Her-0.006
 Whatever-0.005
 seem-0.005
 solve-0.005
Token spr
Token ID7500
Feature activation+5.66e-02

Pos logit contributions
idd+0.066
 canoe+0.059
mer+0.059
etchup+0.058
boats+0.058
Neg logit contributions
 herself-0.055
 seem-0.050
 about-0.049
 closely-0.048
 how-0.048
Token the
Token ID262
Feature activation-1.78e-03

Pos logit contributions
 herself+0.017
 solve+0.014
 Her+0.013
 about+0.013
 how+0.013
Neg logit contributions
idd-0.029
 cho-0.027
 hyd-0.027
etchup-0.026
mer-0.026
Token car
Token ID1097
Feature activation+2.08e-02

Pos logit contributions
 solve+0.003
 herself+0.002
 read+0.002
 write+0.002
 wear+0.002
Neg logit contributions
idd-0.003
etchup-0.003
mer-0.003
enn-0.003
boats-0.003
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
idd+0.042
 cho+0.042
etchup+0.040
 hyd+0.039
 ge+0.038
Neg logit contributions
 herself-0.029
 solve-0.026
 Her-0.025
isy-0.025
 seem-0.025
Token the
Token ID262
Feature activation+3.37e-02

Pos logit contributions
 herself+0.011
 solve+0.011
 read+0.011
 wear+0.010
 how+0.010
Neg logit contributions
idd-0.028
mer-0.025
 cho-0.025
etchup-0.024
enn-0.024
Token fire
Token ID2046
Feature activation+7.08e-02

Pos logit contributions
idd+0.062
mer+0.058
etchup+0.058
enn+0.056
ang+0.053
Neg logit contributions
 solve-0.050
 herself-0.048
 read-0.045
 write-0.044
 wear-0.043
Token hyd
Token ID7409
Feature activation+1.49e-01

Pos logit contributions
idd+0.166
etchup+0.157
Uh+0.153
 ge+0.153
rim+0.151
Neg logit contributions
 herself-0.067
 about-0.061
 solve-0.059
 how-0.058
 seem-0.058
Tokenrant
Token ID5250
Feature activation-3.90e-02

Pos logit contributions
idd+0.295
 cho+0.292
etchup+0.279
 hyd+0.272
 canoe+0.267
Neg logit contributions
 herself-0.208
isy-0.184
 Whatever-0.183
 Her-0.183
 solve-0.183
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
 herself+0.047
 about+0.039
 seem+0.038
 solve+0.038
 Her+0.037
Neg logit contributions
idd-0.091
 cho-0.086
etchup-0.083
mer-0.081
 canoe-0.080
Token They
Token ID1119
Feature activation+2.48e-02

Pos logit contributions
idd+0.034
 cho+0.034
etchup+0.032
 hyd+0.031
boats+0.031
Neg logit contributions
 herself-0.020
 Her-0.018
 Whatever-0.017
 about-0.016
 solve-0.016
Token take
Token ID1011
Feature activation-5.23e-03

Pos logit contributions
 cho+0.055
idd+0.055
etchup+0.052
 hyd+0.052
mer+0.050
Neg logit contributions
 herself-0.029
 solve-0.025
isy-0.025
 Her-0.025
 Whatever-0.025
Token the
Token ID262
Feature activation-7.03e-03

Pos logit contributions
 herself+0.006
 sure+0.005
 solve+0.005
 Her+0.005
 about+0.005
Neg logit contributions
idd-0.012
 hyd-0.011
etchup-0.011
mer-0.011
 cho-0.011
Token "
Token ID366
Feature activation+3.95e-02

Pos logit contributions
idd+0.091
etchup+0.087
 cho+0.086
 hyd+0.085
mer+0.085
Neg logit contributions
 herself-0.057
 about-0.048
 wear-0.048
 read-0.045
 closely-0.045
Tokench
Token ID354
Feature activation+3.42e-02

Pos logit contributions
 cho+0.068
idd+0.064
 hyd+0.062
etchup+0.061
 canoe+0.061
Neg logit contributions
 herself-0.067
 Whatever-0.061
 read-0.061
 solve-0.059
 seem-0.059
Tokenoo
Token ID2238
Feature activation+6.74e-02

Pos logit contributions
idd+0.056
 hyd+0.054
etchup+0.053
 cho+0.052
 canoe+0.049
Neg logit contributions
 herself-0.061
isy-0.055
 solve-0.054
 seem-0.053
 about-0.053
Token cho
Token ID1727
Feature activation+1.34e-01

Pos logit contributions
idd+0.117
etchup+0.111
 hyd+0.106
boats+0.104
 canoe+0.103
Neg logit contributions
 herself-0.115
isy-0.104
 sure-0.098
 about-0.098
 Her-0.097
Tokeno
Token ID78
Feature activation+5.39e-02

Pos logit contributions
idd+0.193
 cho+0.192
 hyd+0.183
etchup+0.180
 canoe+0.174
Neg logit contributions
 herself-0.253
isy-0.234
 Whatever-0.230
 Her-0.230
 solve-0.228
Token cho
Token ID1727
Feature activation+1.45e-01

Pos logit contributions
idd+0.090
etchup+0.083
 cho+0.080
 hyd+0.080
 canoe+0.078
Neg logit contributions
 herself-0.091
isy-0.085
 solve-0.081
 seem-0.080
 Whatever-0.080
Tokeno
Token ID78
Feature activation+5.78e-02

Pos logit contributions
idd+0.206
 cho+0.204
 hyd+0.198
etchup+0.193
 canoe+0.189
Neg logit contributions
 herself-0.271
isy-0.252
 Her-0.246
 Whatever-0.245
 solve-0.243
Token".
Token ID1911
Feature activation-2.86e-02

Pos logit contributions
idd+0.097
etchup+0.090
 hyd+0.085
 cho+0.084
 canoe+0.084
Neg logit contributions
 herself-0.096
isy-0.091
 seem-0.085
 solve-0.085
 sure-0.084
Token
Token ID198
Feature activation-1.91e-03

Pos logit contributions
 herself+0.031
 Her+0.030
 Whatever+0.026
 aloud+0.024
 seem+0.024
Neg logit contributions
 cho-0.067
idd-0.067
 hyd-0.064
mer-0.064
etchup-0.063
Token
Token ID198
Feature activation-4.68e-03

Pos logit contributions
 herself+0.002
 Her+0.002
isy+0.002
 Whatever+0.002
 solve+0.002
Neg logit contributions
 cho-0.004
idd-0.004
 hyd-0.004
etchup-0.004
mer-0.004
TokenThey
Token ID2990
Feature activation+9.86e-03

Pos logit contributions
 herself+0.007
 Her+0.006
 solve+0.006
 Whatever+0.006
isy+0.006
Neg logit contributions
 cho-0.009
idd-0.008
 hyd-0.008
etchup-0.008
mer-0.008
Token spray
Token ID11662
Feature activation+9.71e-03

Pos logit contributions
 herself+0.009
 seem+0.008
 solve+0.008
 about+0.008
 wear+0.007
Neg logit contributions
idd-0.013
etchup-0.013
mer-0.012
 cho-0.012
enn-0.012
Token from
Token ID422
Feature activation+2.09e-02

Pos logit contributions
idd+0.020
etchup+0.019
enn+0.017
 cho+0.017
joy+0.017
Neg logit contributions
 herself-0.014
 about-0.012
 how-0.012
 Her-0.012
 her-0.012
Token a
Token ID257
Feature activation-8.13e-03

Pos logit contributions
 cho+0.047
idd+0.047
 hyd+0.045
etchup+0.045
mer+0.043
Neg logit contributions
 herself-0.024
isy-0.020
 Her-0.020
 solve-0.020
 Whatever-0.020
Token nearby
Token ID6716
Feature activation+4.08e-02

Pos logit contributions
 solve+0.010
 read+0.009
 math+0.009
 bracelet+0.009
isy+0.009
Neg logit contributions
idd-0.017
etchup-0.016
ucks-0.015
mer-0.015
boats-0.015
Token fire
Token ID2046
Feature activation+6.38e-02

Pos logit contributions
idd+0.074
etchup+0.073
boats+0.066
 cho+0.065
 hyd+0.064
Neg logit contributions
 read-0.064
 solve-0.062
 about-0.058
 herself-0.058
 math-0.056
Token hyd
Token ID7409
Feature activation+1.42e-01

Pos logit contributions
idd+0.150
etchup+0.147
Uh+0.142
 cho+0.140
 ge+0.138
Neg logit contributions
 about-0.060
 read-0.053
 herself-0.052
 how-0.052
 solve-0.049
Tokenrant
Token ID5250
Feature activation-4.02e-02

Pos logit contributions
 cho+0.261
idd+0.253
 hyd+0.247
mer+0.235
etchup+0.233
Neg logit contributions
 herself-0.225
 solve-0.203
 Her-0.201
 Whatever-0.200
isy-0.200
Token to
Token ID284
Feature activation+7.09e-03

Pos logit contributions
 herself+0.048
 about+0.044
 how+0.040
 solve+0.040
 seem+0.039
Neg logit contributions
idd-0.093
etchup-0.087
 cho-0.083
enn-0.080
Uh-0.080
Token keep
Token ID1394
Feature activation-4.59e-03

Pos logit contributions
idd+0.016
 cho+0.016
etchup+0.015
 hyd+0.015
mer+0.015
Neg logit contributions
 herself-0.008
 solve-0.007
 Her-0.007
isy-0.007
 seem-0.007
Token her
Token ID607
Feature activation-5.90e-02

Pos logit contributions
 herself+0.006
 solve+0.005
 seem+0.004
 Her+0.004
 aloud+0.004
Neg logit contributions
idd-0.010
 cho-0.010
etchup-0.010
mer-0.010
 hyd-0.010
Token dry
Token ID5894
Feature activation+2.80e-02

Pos logit contributions
 herself+0.076
 solve+0.066
 about+0.063
 sure+0.062
 seem+0.062
Neg logit contributions
idd-0.135
 cho-0.127
etchup-0.124
iren-0.119
boats-0.116
Token and
Token ID290
Feature activation-2.27e-02

Pos logit contributions
 herself+0.012
 solve+0.009
 read+0.008
 her+0.008
 Her+0.008
Neg logit contributions
idd-0.027
mer-0.025
etchup-0.025
 cho-0.025
boats-0.024
Token the
Token ID262
Feature activation+1.98e-02

Pos logit contributions
 herself+0.027
 read+0.023
 seem+0.022
 about+0.022
 how+0.022
Neg logit contributions
idd-0.053
etchup-0.049
mer-0.048
 hyd-0.047
 cho-0.047
Token lightning
Token ID14357
Feature activation+1.65e-02

Pos logit contributions
idd+0.039
etchup+0.036
enn+0.036
 cho+0.033
mer+0.032
Neg logit contributions
 herself-0.026
 solve-0.026
 about-0.025
isy-0.025
 read-0.024
Token struck
Token ID7425
Feature activation+2.41e-02

Pos logit contributions
idd+0.033
etchup+0.031
 hyd+0.030
 canoe+0.030
 cho+0.029
Neg logit contributions
 herself-0.021
 seem-0.021
isy-0.020
 read-0.020
 about-0.020
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
idd+0.058
etchup+0.056
 cho+0.055
 hyd+0.053
mer+0.053
Neg logit contributions
 herself-0.028
 aloud-0.022
 solve-0.021
 about-0.020
 Her-0.020
Token hyd
Token ID7409
Feature activation+1.37e-01

Pos logit contributions
idd+0.028
etchup+0.028
enn+0.026
 cho+0.025
boats+0.025
Neg logit contributions
 solve-0.020
 herself-0.020
isy-0.019
 read-0.018
 closely-0.018
Tokenrant
Token ID5250
Feature activation-5.65e-02

Pos logit contributions
idd+0.290
 cho+0.279
etchup+0.278
 ge+0.262
 canoe+0.260
Neg logit contributions
 herself-0.169
isy-0.157
 aloud-0.151
 Her-0.150
 seem-0.150
Token.
Token ID13
Feature activation-1.19e-03

Pos logit contributions
 herself+0.067
 about+0.062
 solve+0.060
 seem+0.060
 Her+0.059
Neg logit contributions
idd-0.127
etchup-0.120
 cho-0.114
enn-0.110
boats-0.108
Token Suddenly
Token ID24975
Feature activation-7.46e-03

Pos logit contributions
 herself+0.002
isy+0.002
 solve+0.001
 seem+0.001
 Whatever+0.001
Neg logit contributions
 cho-0.002
idd-0.002
 hyd-0.002
etchup-0.002
 canoe-0.002
Token water
Token ID1660
Feature activation+4.09e-02

Pos logit contributions
 herself+0.009
 about+0.008
 solve+0.008
 seem+0.008
 read+0.008
Neg logit contributions
idd-0.017
mer-0.016
 hyd-0.016
etchup-0.016
rim-0.016
Token was
Token ID373
Feature activation+2.89e-02

Pos logit contributions
idd+0.094
etchup+0.091
enn+0.086
 hyd+0.085
iw+0.082
Neg logit contributions
 about-0.045
 herself-0.043
 how-0.040
 seem-0.039
 read-0.038
Token see
Token ID766
Feature activation-2.60e-02

Pos logit contributions
 cho+0.049
idd+0.049
 hyd+0.046
etchup+0.046
mer+0.045
Neg logit contributions
 herself-0.025
 solve-0.023
 read-0.022
 seem-0.022
 Her-0.021
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
 herself+0.035
 Her+0.030
 Whatever+0.029
 seem+0.029
 solve+0.029
Neg logit contributions
idd-0.055
 cho-0.054
 hyd-0.053
mer-0.051
etchup-0.050
Token corn
Token ID11676
Feature activation+3.42e-02

Pos logit contributions
idd+0.052
 cho+0.048
etchup+0.046
mer+0.045
 hyd+0.044
Neg logit contributions
 herself-0.041
isy-0.037
 closely-0.036
 read-0.035
 solve-0.035
Token and
Token ID290
Feature activation+5.74e-03

Pos logit contributions
idd+0.076
 cho+0.069
crow+0.064
mer+0.063
enn+0.062
Neg logit contributions
 herself-0.048
 closely-0.042
 solve-0.041
 read-0.041
 about-0.040
Token the
Token ID262
Feature activation+5.01e-02

Pos logit contributions
idd+0.014
 cho+0.013
mer+0.013
 hyd+0.012
enn+0.012
Neg logit contributions
 herself-0.007
 read-0.006
 solve-0.006
 how-0.006
 seem-0.005
Token hus
Token ID4791
Feature activation+1.36e-01

Pos logit contributions
idd+0.099
 cho+0.088
etchup+0.085
enn+0.083
mer+0.083
Neg logit contributions
 herself-0.079
isy-0.068
 solve-0.066
 read-0.066
 understand-0.065
Tokenks
Token ID591
Feature activation+2.50e-02

Pos logit contributions
 cho+0.235
idd+0.214
boats+0.209
 canoe+0.205
rim+0.199
Neg logit contributions
 herself-0.214
 read-0.192
 Her-0.186
 aloud-0.181
 seem-0.178
Token on
Token ID319
Feature activation+6.02e-02

Pos logit contributions
idd+0.054
 cho+0.052
etchup+0.051
 hyd+0.051
 canoe+0.049
Neg logit contributions
 herself-0.033
 about-0.029
 closely-0.028
 Her-0.028
 aloud-0.027
Token the
Token ID262
Feature activation+3.80e-02

Pos logit contributions
idd+0.119
 cho+0.117
etchup+0.112
 hyd+0.111
mer+0.108
Neg logit contributions
 herself-0.088
 Her-0.077
 solve-0.076
 Whatever-0.076
isy-0.075
Token ground
Token ID2323
Feature activation+2.78e-02

Pos logit contributions
idd+0.083
etchup+0.074
 cho+0.072
enn+0.068
mer+0.068
Neg logit contributions
 herself-0.058
isy-0.051
 closely-0.046
 solve-0.046
 seem-0.045
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
idd+0.066
 cho+0.064
etchup+0.061
 hyd+0.060
 ge+0.059
Neg logit contributions
 herself-0.031
 about-0.027
 closely-0.026
 aloud-0.025
 Her-0.025
Token was
Token ID373
Feature activation+3.72e-02

Pos logit contributions
idd+0.067
 cho+0.064
etchup+0.063
 hyd+0.060
 ge+0.059
Neg logit contributions
 herself-0.041
 aloud-0.036
 about-0.036
 solve-0.035
 seem-0.035
Token connected
Token ID5884
Feature activation+4.42e-02

Pos logit contributions
idd+0.081
 cho+0.077
etchup+0.076
mer+0.075
 hyd+0.072
Neg logit contributions
 herself-0.050
 about-0.045
 aloud-0.042
 sure-0.041
 read-0.040
Token to
Token ID284
Feature activation+2.83e-02

Pos logit contributions
idd+0.108
etchup+0.103
 cho+0.102
 ge+0.099
 hyd+0.098
Neg logit contributions
 herself-0.050
 about-0.045
 closely-0.041
 aloud-0.041
 how-0.039
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
idd+0.065
 cho+0.063
etchup+0.062
mer+0.059
 hyd+0.059
Neg logit contributions
 herself-0.035
 solve-0.029
 Whatever-0.028
 seem-0.028
 aloud-0.028
Token k
Token ID479
Feature activation+5.38e-02

Pos logit contributions
 solve+0.013
isy+0.012
 read+0.012
 closely+0.011
 seem+0.011
Neg logit contributions
idd-0.024
etchup-0.022
 cho-0.021
 hyd-0.021
boats-0.020
Tokenenn
Token ID1697
Feature activation+1.31e-01

Pos logit contributions
 ge+0.072
 hyd+0.066
 men+0.063
 mir+0.063
 cho+0.062
Neg logit contributions
 solve-0.096
isy-0.094
 herself-0.093
 Whatever-0.091
 read-0.091
Tokenel
Token ID417
Feature activation+5.64e-02

Pos logit contributions
 cho+0.239
 hyd+0.230
 ge+0.209
idd+0.207
etchup+0.205
Neg logit contributions
 herself-0.222
 Her-0.195
isy-0.186
 closely-0.178
 aloud-0.176
Token full
Token ID1336
Feature activation+1.10e-01

Pos logit contributions
 cho+0.115
idd+0.115
etchup+0.111
 hyd+0.110
 canoe+0.106
Neg logit contributions
 herself-0.077
isy-0.069
 Her-0.066
 aloud-0.066
 about-0.064
Token of
Token ID286
Feature activation+4.37e-02

Pos logit contributions
idd+0.262
etchup+0.243
 hyd+0.238
 cho+0.235
enn+0.230
Neg logit contributions
 about-0.139
 herself-0.135
 Her-0.127
isy-0.118
 aloud-0.118
Token fierce
Token ID14800
Feature activation+2.43e-02

Pos logit contributions
idd+0.087
 cho+0.082
 hyd+0.080
mer+0.080
etchup+0.079
Neg logit contributions
 herself-0.066
 solve-0.058
 aloud-0.056
 read-0.056
 about-0.056
Token dogs
Token ID6844
Feature activation+1.84e-02

Pos logit contributions
idd+0.048
etchup+0.046
 hyd+0.045
boats+0.043
 cho+0.043
Neg logit contributions
 herself-0.035
 about-0.032
 seem-0.032
 solve-0.032
 Her-0.031
Token a
Token ID257
Feature activation-2.40e-02

Pos logit contributions
 herself+0.003
 Her+0.002
isy+0.002
 aloud+0.002
 about+0.002
Neg logit contributions
idd-0.004
 cho-0.004
 hyd-0.004
etchup-0.004
mer-0.004
Token pepper
Token ID13385
Feature activation+2.62e-02

Pos logit contributions
 herself+0.033
 Her+0.029
 Whatever+0.029
 solve+0.028
 seem+0.028
Neg logit contributions
 cho-0.049
idd-0.047
etchup-0.046
 hyd-0.046
 canoe-0.044
Token and
Token ID290
Feature activation+1.88e-03

Pos logit contributions
idd+0.059
 cho+0.053
 canoe+0.052
etchup+0.051
Uh+0.050
Neg logit contributions
 herself-0.033
isy-0.031
 about-0.029
 aloud-0.027
 how-0.027
Token now
Token ID783
Feature activation+3.12e-02

Pos logit contributions
idd+0.004
 cho+0.004
 hyd+0.004
etchup+0.004
mer+0.004
Neg logit contributions
 herself-0.003
 read-0.002
isy-0.002
 Her-0.002
 solve-0.002
Token my
Token ID616
Feature activation+1.95e-02

Pos logit contributions
 cho+0.058
idd+0.056
 hyd+0.054
etchup+0.054
 ge+0.052
Neg logit contributions
 herself-0.046
isy-0.043
 Whatever-0.042
 solve-0.042
 Her-0.042
Token tum
Token ID11814
Feature activation+1.27e-01

Pos logit contributions
 cho+0.039
idd+0.039
 hyd+0.037
etchup+0.036
mer+0.035
Neg logit contributions
 herself-0.027
isy-0.024
 solve-0.024
 Her-0.023
 Whatever-0.023
Tokenmy
Token ID1820
Feature activation-1.96e-03

Pos logit contributions
 cho+0.221
idd+0.218
etchup+0.208
 hyd+0.203
boats+0.192
Neg logit contributions
 herself-0.200
 solve-0.196
 Whatever-0.191
 aloud-0.189
isy-0.187
Token hurts
Token ID20406
Feature activation+3.86e-02

Pos logit contributions
 herself+0.003
 Whatever+0.003
 Her+0.003
 solve+0.003
isy+0.003
Neg logit contributions
 cho-0.004
idd-0.003
 hyd-0.003
etchup-0.003
mer-0.003
Token."
Token ID526
Feature activation-3.38e-02

Pos logit contributions
idd+0.085
 cho+0.082
etchup+0.080
 hyd+0.077
 canoe+0.077
Neg logit contributions
 herself-0.051
 Her-0.043
isy-0.042
 solve-0.041
 about-0.041
Token
Token ID220
Feature activation-9.64e-03

Pos logit contributions
 herself+0.039
 Her+0.035
 Whatever+0.033
isy+0.032
 solve+0.032
Neg logit contributions
 cho-0.077
idd-0.076
 hyd-0.073
etchup-0.072
mer-0.071
Token
Token ID198
Feature activation-5.30e-03

Pos logit contributions
isy+0.015
 herself+0.014
 solve+0.014
 read+0.014
 seem+0.014
Neg logit contributions
idd-0.017
 cho-0.017
etchup-0.016
enn-0.015
 hyd-0.015
Token on
Token ID319
Feature activation+2.20e-02

Pos logit contributions
idd+0.122
etchup+0.108
enn+0.105
 cho+0.100
 ge+0.097
Neg logit contributions
 herself-0.080
 seem-0.069
 closely-0.067
isy-0.066
 about-0.064
Token his
Token ID465
Feature activation-1.60e-02

Pos logit contributions
idd+0.053
 cho+0.052
etchup+0.050
 hyd+0.049
mer+0.048
Neg logit contributions
 herself-0.024
 Her-0.020
 solve-0.019
 Whatever-0.019
isy-0.019
Token legs
Token ID7405
Feature activation+3.40e-02

Pos logit contributions
 herself+0.020
isy+0.018
 solve+0.017
 Her+0.017
 seem+0.016
Neg logit contributions
idd-0.035
 cho-0.034
etchup-0.033
 hyd-0.032
mer-0.031
Token and
Token ID290
Feature activation-2.84e-03

Pos logit contributions
idd+0.072
 cho+0.070
etchup+0.068
 hyd+0.066
mer+0.065
Neg logit contributions
 herself-0.048
 seem-0.040
 Her-0.040
isy-0.039
 about-0.039
Token his
Token ID465
Feature activation+2.46e-02

Pos logit contributions
 herself+0.004
 read+0.003
 Her+0.003
 how+0.003
 her+0.003
Neg logit contributions
idd-0.007
 cho-0.006
mer-0.006
etchup-0.006
enn-0.006
Token tum
Token ID11814
Feature activation+1.26e-01

Pos logit contributions
idd+0.053
 cho+0.050
etchup+0.049
 hyd+0.046
mer+0.046
Neg logit contributions
 herself-0.034
isy-0.029
 Her-0.028
 wear-0.027
 solve-0.027
Tokenmy
Token ID1820
Feature activation-1.35e-02

Pos logit contributions
 cho+0.246
idd+0.245
 hyd+0.232
etchup+0.230
mer+0.224
Neg logit contributions
 herself-0.186
isy-0.163
 Her-0.162
 solve-0.159
 Whatever-0.159
Token.
Token ID13
Feature activation-3.07e-04

Pos logit contributions
 herself+0.020
isy+0.017
 Her+0.017
 Whatever+0.017
 solve+0.017
Neg logit contributions
 cho-0.026
idd-0.026
 hyd-0.024
etchup-0.024
mer-0.023
Token He
Token ID679
Feature activation+1.20e-02

Pos logit contributions
 herself+0.000
isy+0.000
 solve+0.000
 Whatever+0.000
 Her+0.000
Neg logit contributions
 cho-0.001
idd-0.001
 hyd-0.001
etchup-0.001
 canoe-0.001
Token felt
Token ID2936
Feature activation-2.22e-02

Pos logit contributions
idd+0.024
 cho+0.024
etchup+0.023
 hyd+0.023
boats+0.022
Neg logit contributions
 herself-0.016
 read-0.014
isy-0.014
 Her-0.014
 solve-0.014
Token cold
Token ID4692
Feature activation+5.21e-02

Pos logit contributions
 herself+0.029
isy+0.025
 Her+0.025
 solve+0.025
 Whatever+0.025
Neg logit contributions
idd-0.047
 cho-0.047
etchup-0.044
 hyd-0.044
mer-0.042
Token hurts
Token ID20406
Feature activation+3.44e-02

Pos logit contributions
 herself+0.033
 seem+0.030
 solve+0.030
isy+0.030
 Her+0.030
Neg logit contributions
idd-0.039
 cho-0.038
etchup-0.036
 hyd-0.036
 canoe-0.035
Token my
Token ID616
Feature activation+9.92e-04

Pos logit contributions
idd+0.079
etchup+0.073
 canoe+0.073
 cho+0.071
mer+0.070
Neg logit contributions
 herself-0.047
 how-0.037
 about-0.037
 her-0.036
 Her-0.035
Token mouth
Token ID5422
Feature activation+4.99e-02

Pos logit contributions
idd+0.002
 cho+0.002
etchup+0.002
 hyd+0.002
boats+0.002
Neg logit contributions
 herself-0.001
isy-0.001
 Her-0.001
 solve-0.001
 seem-0.001
Token and
Token ID290
Feature activation+7.27e-04

Pos logit contributions
idd+0.112
 cho+0.108
etchup+0.103
 canoe+0.100
 ge+0.100
Neg logit contributions
 herself-0.065
 about-0.055
isy-0.054
 seem-0.053
 Her-0.052
Token my
Token ID616
Feature activation+2.85e-02

Pos logit contributions
idd+0.001
 cho+0.001
mer+0.001
etchup+0.001
 canoe+0.001
Neg logit contributions
 herself-0.001
 seem-0.001
 read-0.001
 Her-0.001
 solve-0.001
Token tum
Token ID11814
Feature activation+1.26e-01

Pos logit contributions
idd+0.055
 cho+0.053
 hyd+0.049
etchup+0.048
boats+0.048
Neg logit contributions
 herself-0.044
isy-0.039
 Her-0.038
 solve-0.038
 seem-0.037
Tokenmy
Token ID1820
Feature activation-3.07e-03

Pos logit contributions
 cho+0.240
idd+0.237
etchup+0.225
 hyd+0.224
mer+0.214
Neg logit contributions
 herself-0.186
 solve-0.169
isy-0.168
 Her-0.167
 Whatever-0.167
Token."
Token ID526
Feature activation-2.55e-02

Pos logit contributions
 herself+0.005
isy+0.004
 Her+0.004
 solve+0.004
 seem+0.004
Neg logit contributions
idd-0.006
 cho-0.006
 hyd-0.005
etchup-0.005
 canoe-0.005
Token Sam
Token ID3409
Feature activation-6.98e-04

Pos logit contributions
 herself+0.029
 Her+0.026
 Whatever+0.024
 solve+0.023
 aloud+0.023
Neg logit contributions
 cho-0.059
idd-0.058
 hyd-0.056
etchup-0.055
mer-0.054
Token was
Token ID373
Feature activation+1.47e-02

Pos logit contributions
 herself+0.001
 solve+0.001
 Whatever+0.001
isy+0.001
 Her+0.001
Neg logit contributions
 cho-0.001
idd-0.001
 hyd-0.001
etchup-0.001
mer-0.001
Token scared
Token ID12008
Feature activation+2.13e-02

Pos logit contributions
 cho+0.026
idd+0.026
 hyd+0.024
etchup+0.024
 canoe+0.023
Neg logit contributions
 herself-0.023
isy-0.022
 Whatever-0.022
 solve-0.022
 Her-0.021
Token,
Token ID11
Feature activation+3.60e-02

Pos logit contributions
 hyd+0.216
boats+0.216
idd+0.210
 canoe+0.205
mer+0.204
Neg logit contributions
 herself-0.155
 about-0.146
 read-0.142
isy-0.142
 solve-0.142
Token oh
Token ID11752
Feature activation+1.03e-02

Pos logit contributions
 cho+0.071
idd+0.071
 hyd+0.067
etchup+0.066
mer+0.065
Neg logit contributions
 herself-0.052
isy-0.046
 solve-0.046
 Whatever-0.046
 Her-0.046
Token no
Token ID645
Feature activation+1.22e-01

Pos logit contributions
 hyd+0.018
idd+0.018
etchup+0.018
mer+0.018
boats+0.018
Neg logit contributions
 herself-0.016
 seem-0.015
 aloud-0.014
 about-0.014
 solve-0.014
Token,
Token ID11
Feature activation+4.40e-02

Pos logit contributions
idd+0.234
 hyd+0.227
boats+0.224
 canoe+0.223
etchup+0.221
Neg logit contributions
 herself-0.178
isy-0.167
 solve-0.167
 seem-0.165
 about-0.163
Token oh
Token ID11752
Feature activation+1.48e-02

Pos logit contributions
idd+0.086
 cho+0.085
 hyd+0.080
etchup+0.080
mer+0.079
Neg logit contributions
 herself-0.065
isy-0.057
 Whatever-0.057
 solve-0.056
 seem-0.056
Token no
Token ID645
Feature activation+1.26e-01

Pos logit contributions
 hyd+0.025
idd+0.025
mer+0.024
etchup+0.024
 cho+0.024
Neg logit contributions
 herself-0.024
 seem-0.022
 about-0.022
 aloud-0.022
 solve-0.022
Token!"
Token ID2474
Feature activation-2.85e-02

Pos logit contributions
idd+0.240
 hyd+0.228
boats+0.226
 canoe+0.226
etchup+0.226
Neg logit contributions
 herself-0.187
 solve-0.176
isy-0.174
 seem-0.173
 about-0.171
Token Mom
Token ID11254
Feature activation+6.01e-02

Pos logit contributions
 herself+0.039
 Her+0.034
isy+0.034
 solve+0.033
 Whatever+0.033
Neg logit contributions
 cho-0.059
idd-0.058
 hyd-0.056
etchup-0.055
mer-0.053
Token said
Token ID531
Feature activation+5.00e-02

Pos logit contributions
 cho+0.118
idd+0.116
 hyd+0.115
 canoe+0.113
boats+0.111
Neg logit contributions
 herself-0.090
 Her-0.082
 read-0.080
isy-0.079
 about-0.077
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
idd+0.111
 cho+0.110
 hyd+0.104
etchup+0.104
 canoe+0.102
Neg logit contributions
 herself-0.065
 Her-0.055
 aloud-0.054
 seem-0.052
 about-0.052
Token
Token ID198
Feature activation+9.31e-03

Pos logit contributions
idd+0.030
 cho+0.030
 hyd+0.028
etchup+0.028
boats+0.028
Neg logit contributions
 herself-0.014
 Her-0.012
 aloud-0.011
 Whatever-0.011
 seem-0.011
Token,
Token ID11
Feature activation+3.89e-02

Pos logit contributions
 hyd+0.234
boats+0.227
idd+0.222
mer+0.216
boat+0.216
Neg logit contributions
 herself-0.137
 about-0.127
 solve-0.123
isy-0.121
 seem-0.121
Token oh
Token ID11752
Feature activation+1.07e-02

Pos logit contributions
idd+0.081
 cho+0.078
mer+0.077
etchup+0.075
 hyd+0.075
Neg logit contributions
 herself-0.055
 seem-0.049
 sure-0.047
 how-0.046
 Whatever-0.045
Token no
Token ID645
Feature activation+1.22e-01

Pos logit contributions
 hyd+0.020
idd+0.020
etchup+0.020
mer+0.020
 cho+0.020
Neg logit contributions
 herself-0.015
 seem-0.014
 about-0.014
 aloud-0.013
 solve-0.013
Token,
Token ID11
Feature activation+4.75e-02

Pos logit contributions
idd+0.244
 hyd+0.242
etchup+0.234
boats+0.230
 canoe+0.229
Neg logit contributions
 herself-0.156
 seem-0.154
isy-0.151
 solve-0.149
 about-0.144
Token oh
Token ID11752
Feature activation+1.57e-02

Pos logit contributions
idd+0.101
mer+0.096
enn+0.094
 cho+0.093
etchup+0.093
Neg logit contributions
 herself-0.065
 seem-0.059
 sure-0.057
 how-0.057
 Whatever-0.053
Token no
Token ID645
Feature activation+1.26e-01

Pos logit contributions
 hyd+0.029
idd+0.028
etchup+0.028
mer+0.028
 cho+0.027
Neg logit contributions
 herself-0.023
 seem-0.022
 about-0.021
 aloud-0.021
 how-0.021
Token!"
Token ID2474
Feature activation-2.51e-02

Pos logit contributions
idd+0.253
 hyd+0.245
etchup+0.241
 canoe+0.234
boats+0.234
Neg logit contributions
 herself-0.165
 seem-0.161
isy-0.159
 solve-0.157
 about-0.151
Token Mom
Token ID11254
Feature activation+6.00e-02

Pos logit contributions
 herself+0.032
 Her+0.031
 about+0.026
 seem+0.025
 her+0.025
Neg logit contributions
idd-0.060
mer-0.057
 cho-0.057
boats-0.056
 hyd-0.055
Token said
Token ID531
Feature activation+5.32e-02

Pos logit contributions
 cho+0.130
idd+0.128
 hyd+0.127
mer+0.125
 canoe+0.124
Neg logit contributions
 herself-0.077
 Her-0.070
 about-0.070
 read-0.069
 sure-0.065
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
idd+0.126
 cho+0.124
etchup+0.119
 hyd+0.119
 canoe+0.117
Neg logit contributions
 herself-0.062
 Her-0.052
 aloud-0.051
 about-0.050
 seem-0.049
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
idd+0.057
boats+0.056
 cho+0.056
etchup+0.055
rim+0.054
Neg logit contributions
 Her-0.016
 herself-0.015
Her-0.011
 about-0.010
 her-0.009
Token sick
Token ID6639
Feature activation+5.36e-02

Pos logit contributions
 herself+0.036
isy+0.034
 solve+0.033
 Her+0.033
 Whatever+0.033
Neg logit contributions
 cho-0.050
idd-0.049
 hyd-0.047
etchup-0.046
 canoe-0.044
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
idd+0.139
 cho+0.130
etchup+0.129
 canoe+0.126
 ge+0.123
Neg logit contributions
 herself-0.062
 about-0.049
 seem-0.040
isy-0.039
 read-0.039
Token He
Token ID679
Feature activation+1.11e-02

Pos logit contributions
idd+0.020
 cho+0.019
etchup+0.019
 hyd+0.019
 canoe+0.017
Neg logit contributions
isy-0.024
 herself-0.022
 closely-0.021
 solve-0.021
icity-0.020
Token felt
Token ID2936
Feature activation-2.64e-02

Pos logit contributions
 cho+0.021
idd+0.020
 hyd+0.019
etchup+0.019
mer+0.018
Neg logit contributions
 herself-0.017
isy-0.015
 solve-0.015
 Whatever-0.015
 Her-0.015
Token his
Token ID465
Feature activation+3.02e-02

Pos logit contributions
 herself+0.039
isy+0.034
 Her+0.034
 solve+0.034
 Whatever+0.034
Neg logit contributions
idd-0.051
 cho-0.051
etchup-0.048
 hyd-0.048
mer-0.046
Token tum
Token ID11814
Feature activation+1.26e-01

Pos logit contributions
 cho+0.051
idd+0.051
 hyd+0.048
etchup+0.048
mer+0.046
Neg logit contributions
 herself-0.051
isy-0.046
 Her-0.046
 solve-0.046
 Whatever-0.046
Tokenmy
Token ID1820
Feature activation-3.70e-03

Pos logit contributions
 cho+0.217
idd+0.211
etchup+0.203
 hyd+0.200
boats+0.187
Neg logit contributions
 herself-0.196
 solve-0.193
 Whatever-0.191
 aloud-0.188
 Her-0.187
Token ac
Token ID936
Feature activation+2.88e-02

Pos logit contributions
 herself+0.006
 Her+0.005
isy+0.005
 Whatever+0.005
 solve+0.005
Neg logit contributions
 cho-0.007
idd-0.007
 hyd-0.006
etchup-0.006
mer-0.006
Tokenhe
Token ID258
Feature activation+5.13e-03

Pos logit contributions
 cho+0.068
idd+0.068
 hyd+0.064
 canoe+0.064
 ge+0.064
Neg logit contributions
 herself-0.030
isy-0.026
 solve-0.024
 seem-0.023
 read-0.022
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
 cho+0.010
idd+0.010
 hyd+0.009
etchup+0.009
mer+0.009
Neg logit contributions
 herself-0.007
isy-0.006
 Her-0.006
 solve-0.006
 Whatever-0.006
Token He
Token ID679
Feature activation+1.70e-02

Pos logit contributions
idd+0.025
 cho+0.024
etchup+0.023
 hyd+0.023
 canoe+0.022
Neg logit contributions
isy-0.030
 herself-0.027
 closely-0.027
 solve-0.027
 read-0.025
Token of
Token ID286
Feature activation+6.71e-02

Pos logit contributions
idd+0.019
 cho+0.019
etchup+0.018
 hyd+0.018
 canoe+0.017
Neg logit contributions
 herself-0.015
isy-0.014
 seem-0.013
 Her-0.013
 solve-0.013
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
idd+0.147
 cho+0.139
mer+0.136
enn+0.132
etchup+0.132
Neg logit contributions
 herself-0.093
 about-0.076
 Her-0.075
 solve-0.074
 read-0.074
Token cake
Token ID12187
Feature activation+2.30e-02

Pos logit contributions
idd+0.073
 cho+0.072
 canoe+0.067
etchup+0.067
mer+0.066
Neg logit contributions
 herself-0.047
isy-0.044
 solve-0.041
 read-0.041
 seem-0.041
Token.
Token ID13
Feature activation+4.56e-03

Pos logit contributions
idd+0.047
 cho+0.047
etchup+0.044
 hyd+0.043
 canoe+0.042
Neg logit contributions
 herself-0.032
isy-0.028
 solve-0.028
 Her-0.028
 seem-0.027
Token His
Token ID2399
Feature activation-1.10e-02

Pos logit contributions
 cho+0.010
idd+0.010
 hyd+0.009
etchup+0.009
mer+0.009
Neg logit contributions
 herself-0.006
isy-0.005
 Her-0.005
 solve-0.005
 Whatever-0.005
Token tum
Token ID11814
Feature activation+1.25e-01

Pos logit contributions
 herself+0.014
 Her+0.013
isy+0.013
 solve+0.013
 read+0.012
Neg logit contributions
 cho-0.023
idd-0.023
etchup-0.022
 hyd-0.022
mer-0.021
Tokenmy
Token ID1820
Feature activation-8.67e-03

Pos logit contributions
 cho+0.232
idd+0.230
etchup+0.217
 hyd+0.217
mer+0.207
Neg logit contributions
 herself-0.192
isy-0.173
 solve-0.173
 Her-0.172
 Whatever-0.172
Token r
Token ID374
Feature activation+4.92e-02

Pos logit contributions
 herself+0.013
isy+0.012
 Her+0.012
 solve+0.012
 Whatever+0.012
Neg logit contributions
 cho-0.016
idd-0.016
 hyd-0.015
etchup-0.015
mer-0.015
Tokenumbled
Token ID11137
Feature activation+4.48e-02

Pos logit contributions
 cho+0.071
idd+0.065
 hyd+0.065
 canoe+0.064
etchup+0.063
Neg logit contributions
 herself-0.100
 Her-0.087
isy-0.086
 about-0.085
aces-0.084
Token.
Token ID13
Feature activation+1.69e-03

Pos logit contributions
idd+0.100
etchup+0.094
 cho+0.090
 ge+0.088
mer+0.088
Neg logit contributions
 herself-0.063
 about-0.054
 aloud-0.053
 Her-0.052
 closely-0.052
Token He
Token ID679
Feature activation+5.81e-03

Pos logit contributions
 cho+0.003
idd+0.003
etchup+0.003
 hyd+0.003
 canoe+0.003
Neg logit contributions
 herself-0.002
isy-0.002
 solve-0.002
 seem-0.002
 Whatever-0.002
Token a
Token ID257
Feature activation+3.59e-02

Pos logit contributions
 herself+0.019
isy+0.016
 solve+0.016
 Her+0.016
 Whatever+0.016
Neg logit contributions
idd-0.027
 cho-0.026
 hyd-0.025
etchup-0.025
mer-0.024
Token bad
Token ID2089
Feature activation+1.01e-01

Pos logit contributions
idd+0.080
 cho+0.077
etchup+0.076
 hyd+0.075
mer+0.073
Neg logit contributions
isy-0.038
 herself-0.038
 read-0.038
 solve-0.038
 Whatever-0.035
Token cough
Token ID22094
Feature activation+7.26e-02

Pos logit contributions
idd+0.206
etchup+0.199
 cho+0.190
mer+0.186
 hyd+0.185
Neg logit contributions
 herself-0.142
isy-0.133
 about-0.128
 read-0.128
 solve-0.127
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
idd+0.162
etchup+0.152
 cho+0.147
 canoe+0.143
 hyd+0.142
Neg logit contributions
 herself-0.094
 about-0.083
 seem-0.083
isy-0.080
 Her-0.078
Token a
Token ID257
Feature activation+2.94e-02

Pos logit contributions
idd+0.026
 cho+0.025
mer+0.024
etchup+0.024
boats+0.023
Neg logit contributions
 herself-0.018
 read-0.016
 Her-0.015
 seem-0.015
 solve-0.015
Token hot
Token ID3024
Feature activation+1.25e-01

Pos logit contributions
idd+0.066
etchup+0.060
mer+0.059
 cho+0.059
boats+0.058
Neg logit contributions
isy-0.033
 herself-0.032
 read-0.032
 solve-0.031
 Her-0.031
Token head
Token ID1182
Feature activation+8.18e-02

Pos logit contributions
idd+0.267
boats+0.236
etchup+0.235
 cho+0.235
mer+0.232
Neg logit contributions
 herself-0.165
isy-0.155
 read-0.149
 wear-0.146
 about-0.145
Token.
Token ID13
Feature activation+9.26e-03

Pos logit contributions
idd+0.191
etchup+0.183
 cho+0.181
 canoe+0.176
 hyd+0.174
Neg logit contributions
 herself-0.090
 about-0.075
 seem-0.073
 Her-0.073
isy-0.073
Token She
Token ID1375
Feature activation+2.80e-03

Pos logit contributions
 cho+0.020
idd+0.020
 hyd+0.019
etchup+0.019
mer+0.018
Neg logit contributions
 herself-0.011
isy-0.010
 solve-0.010
 Her-0.010
 Whatever-0.010
Token did
Token ID750
Feature activation-6.87e-03

Pos logit contributions
 cho+0.005
idd+0.005
 hyd+0.005
etchup+0.005
 canoe+0.005
Neg logit contributions
 herself-0.004
isy-0.004
 Whatever-0.004
 solve-0.004
 Her-0.004
Token not
Token ID407
Feature activation+2.23e-02

Pos logit contributions
isy+0.008
 Whatever+0.008
ust+0.008
aces+0.008
 aloud+0.007
Neg logit contributions
 cho-0.014
 hyd-0.013
etchup-0.013
 ge-0.012
 canoe-0.012
Token "
Token ID366
Feature activation+4.56e-02

Pos logit contributions
idd+0.040
 cho+0.039
etchup+0.038
 hyd+0.038
mer+0.037
Neg logit contributions
 herself-0.024
 Her-0.021
isy-0.020
 Whatever-0.020
 aloud-0.020
TokenO
Token ID46
Feature activation+4.27e-03

Pos logit contributions
 cho+0.083
idd+0.081
 hyd+0.076
etchup+0.075
 canoe+0.074
Neg logit contributions
 herself-0.077
 Her-0.068
isy-0.067
Her-0.066
 Whatever-0.066
Tokenuch
Token ID794
Feature activation+4.07e-02

Pos logit contributions
 cho+0.005
idd+0.005
 hyd+0.005
 canoe+0.005
 ge+0.005
Neg logit contributions
 herself-0.009
isy-0.009
 solve-0.008
 seem-0.008
 Her-0.008
Token,
Token ID11
Feature activation+3.67e-02

Pos logit contributions
idd+0.077
etchup+0.072
boats+0.071
 canoe+0.071
 hyd+0.070
Neg logit contributions
 herself-0.062
isy-0.060
 about-0.053
 sure-0.053
 seem-0.052
Token my
Token ID616
Feature activation+2.31e-02

Pos logit contributions
idd+0.077
 cho+0.072
mer+0.072
etchup+0.071
 hyd+0.069
Neg logit contributions
 herself-0.052
isy-0.045
 seem-0.044
 Her-0.044
 Whatever-0.043
Token tum
Token ID11814
Feature activation+1.25e-01

Pos logit contributions
idd+0.044
 cho+0.043
 hyd+0.041
etchup+0.041
boats+0.041
Neg logit contributions
 herself-0.034
isy-0.031
 Her-0.030
 Whatever-0.029
 seem-0.029
Tokenmy
Token ID1820
Feature activation-8.44e-03

Pos logit contributions
idd+0.265
 cho+0.264
 hyd+0.254
 canoe+0.250
mer+0.248
Neg logit contributions
 herself-0.165
isy-0.141
 Her-0.138
 seem-0.133
 Whatever-0.131
Token hurts
Token ID20406
Feature activation+3.81e-02

Pos logit contributions
 herself+0.013
isy+0.012
 solve+0.012
 seem+0.012
 Her+0.011
Neg logit contributions
idd-0.016
 cho-0.015
 hyd-0.014
etchup-0.014
 canoe-0.014
Token!"
Token ID2474
Feature activation-2.57e-02

Pos logit contributions
idd+0.080
etchup+0.076
 canoe+0.073
boats+0.072
mer+0.072
Neg logit contributions
 herself-0.061
 her-0.049
 how-0.047
 about-0.047
isy-0.047
Token She
Token ID1375
Feature activation-3.37e-03

Pos logit contributions
 herself+0.030
 Her+0.026
 Whatever+0.024
 aloud+0.024
isy+0.024
Neg logit contributions
 cho-0.059
idd-0.059
 hyd-0.056
etchup-0.056
mer-0.056
Token repeats
Token ID29819
Feature activation-1.42e-02

Pos logit contributions
 herself+0.006
isy+0.005
 Whatever+0.005
 solve+0.005
 aloud+0.005
Neg logit contributions
 cho-0.006
idd-0.006
etchup-0.005
 hyd-0.005
 ge-0.005
Token ate
Token ID15063
Feature activation-8.01e-03

Pos logit contributions
idd+0.036
 cho+0.036
 hyd+0.034
etchup+0.034
mer+0.033
Neg logit contributions
 herself-0.023
isy-0.021
 solve-0.021
 read-0.020
 Her-0.020
Token them
Token ID606
Feature activation-3.24e-02

Pos logit contributions
 herself+0.010
isy+0.009
 solve+0.008
 Her+0.008
 Whatever+0.008
Neg logit contributions
idd-0.017
 cho-0.017
 hyd-0.016
etchup-0.016
mer-0.016
Token up
Token ID510
Feature activation-3.62e-02

Pos logit contributions
 herself+0.038
isy+0.033
 Her+0.033
 Whatever+0.033
 solve+0.033
Neg logit contributions
 cho-0.071
idd-0.070
 hyd-0.067
etchup-0.067
mer-0.065
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
 herself+0.048
 aloud+0.039
 closely+0.039
 about+0.039
 solve+0.037
Neg logit contributions
idd-0.082
 cho-0.080
etchup-0.075
crow-0.073
 hyd-0.072
Token his
Token ID465
Feature activation+2.21e-02

Pos logit contributions
idd+0.023
 cho+0.022
etchup+0.021
mer+0.021
 canoe+0.021
Neg logit contributions
 herself-0.015
 read-0.013
 solve-0.012
 aloud-0.012
isy-0.012
Token tum
Token ID11814
Feature activation+1.25e-01

Pos logit contributions
 cho+0.040
idd+0.039
 hyd+0.037
etchup+0.037
 canoe+0.035
Neg logit contributions
 herself-0.035
 Her-0.032
 Whatever-0.032
 solve-0.031
isy-0.031
Tokenmy
Token ID1820
Feature activation-4.00e-03

Pos logit contributions
 cho+0.234
idd+0.232
etchup+0.219
 hyd+0.218
mer+0.209
Neg logit contributions
 herself-0.189
 solve-0.170
isy-0.170
 Whatever-0.169
 Her-0.169
Token was
Token ID373
Feature activation+3.82e-02

Pos logit contributions
 herself+0.006
 Her+0.006
 Whatever+0.006
 solve+0.005
isy+0.005
Neg logit contributions
 cho-0.007
idd-0.007
 hyd-0.007
etchup-0.007
mer-0.007
Token full
Token ID1336
Feature activation+9.10e-02

Pos logit contributions
idd+0.081
 cho+0.080
etchup+0.076
 hyd+0.076
mer+0.074
Neg logit contributions
 herself-0.050
isy-0.043
 Her-0.042
 solve-0.042
 Whatever-0.041
Token.
Token ID13
Feature activation-5.56e-03

Pos logit contributions
idd+0.217
 cho+0.202
etchup+0.200
 hyd+0.193
 canoe+0.191
Neg logit contributions
 herself-0.110
 about-0.109
isy-0.097
 Her-0.094
 aloud-0.092
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
 herself+0.008
isy+0.008
 solve+0.007
 seem+0.007
 read+0.007
Neg logit contributions
idd-0.011
 cho-0.011
 hyd-0.010
etchup-0.010
 canoe-0.010
Token who
Token ID508
Feature activation-2.95e-02

Pos logit contributions
 herself+0.020
 Her+0.019
 read+0.019
 solve+0.019
isy+0.019
Neg logit contributions
idd-0.019
 cho-0.019
boats-0.018
 canoe-0.017
etchup-0.017
Token liked
Token ID8288
Feature activation-4.85e-02

Pos logit contributions
 herself+0.041
isy+0.040
 aloud+0.039
 closely+0.038
 Whatever+0.037
Neg logit contributions
 hyd-0.058
 cho-0.057
etchup-0.054
mer-0.053
idd-0.053
Token to
Token ID284
Feature activation-3.02e-02

Pos logit contributions
 herself+0.082
isy+0.075
 Whatever+0.074
 solve+0.073
 Her+0.073
Neg logit contributions
 cho-0.083
idd-0.082
 hyd-0.077
etchup-0.077
 ge-0.072
Token collect
Token ID2824
Feature activation-8.59e-02

Pos logit contributions
ust+0.045
isy+0.041
 closely+0.040
 herself+0.040
 Whatever+0.040
Neg logit contributions
 hyd-0.051
 cho-0.050
 canoe-0.049
 mill-0.048
 ge-0.048
Token or
Token ID393
Feature activation-3.84e-02

Pos logit contributions
 herself+0.128
isy+0.114
 Her+0.113
 solve+0.113
 Whatever+0.112
Neg logit contributions
 cho-0.164
idd-0.163
 hyd-0.154
etchup-0.153
mer-0.147
Tokennam
Token ID7402
Feature activation-1.54e-01

Pos logit contributions
isy+0.039
 Whatever+0.036
 herself+0.035
 Her+0.034
 aloud+0.033
Neg logit contributions
 cho-0.086
 hyd-0.084
 ge-0.083
 canoe-0.081
idd-0.080
Tokenents
Token ID658
Feature activation-4.48e-02

Pos logit contributions
 herself+0.211
 Whatever+0.199
isy+0.198
 aloud+0.193
 seem+0.192
Neg logit contributions
idd-0.306
 cho-0.301
etchup-0.285
 hyd-0.269
boats-0.265
Token.
Token ID13
Feature activation-2.48e-02

Pos logit contributions
 herself+0.064
isy+0.057
 Her+0.056
 solve+0.056
 Whatever+0.056
Neg logit contributions
 cho-0.088
idd-0.087
 hyd-0.083
etchup-0.083
mer-0.079
Token Everywhere
Token ID48201
Feature activation-7.91e-02

Pos logit contributions
isy+0.045
 sure+0.039
ust+0.038
zed+0.038
ieve+0.038
Neg logit contributions
 cho-0.036
crow-0.034
 hyd-0.034
 mill-0.033
etchup-0.032
Token she
Token ID673
Feature activation+7.13e-03

Pos logit contributions
 herself+0.065
 Her+0.055
 read+0.050
 seem+0.048
 solve+0.048
Neg logit contributions
idd-0.212
 cho-0.204
 hyd-0.202
 canoe-0.199
etchup-0.196
Token went
Token ID1816
Feature activation+7.42e-03

Pos logit contributions
idd+0.011
 cho+0.011
 hyd+0.010
mer+0.010
etchup+0.010
Neg logit contributions
 herself-0.013
isy-0.012
 Whatever-0.012
 Her-0.012
 aloud-0.012
Token for
Token ID329
Feature activation+4.46e-02

Pos logit contributions
idd+0.064
 cho+0.063
 hyd+0.062
etchup+0.061
mer+0.060
Neg logit contributions
 herself-0.038
 Her-0.032
isy-0.032
 solve-0.032
 Whatever-0.031
Token me
Token ID502
Feature activation-3.08e-02

Pos logit contributions
idd+0.085
 cho+0.085
 hyd+0.083
etchup+0.081
mer+0.080
Neg logit contributions
 herself-0.068
 solve-0.058
 Her-0.057
isy-0.057
 seem-0.056
Token!
Token ID0
Feature activation+6.25e-03

Pos logit contributions
 herself+0.051
 Her+0.046
 about+0.046
 solve+0.046
isy+0.045
Neg logit contributions
 cho-0.054
idd-0.052
 hyd-0.051
etchup-0.049
mer-0.048
Token Aren
Token ID9843
Feature activation-7.33e-02

Pos logit contributions
 cho+0.013
idd+0.013
 hyd+0.012
mer+0.012
etchup+0.012
Neg logit contributions
 herself-0.008
 Her-0.008
 Whatever-0.008
 seem-0.007
 about-0.007
Tokenâ
Token ID22940
Feature activation-9.68e-03

Pos logit contributions
 aloud+0.115
 closely+0.106
isy+0.106
 herself+0.103
 meanings+0.102
Neg logit contributions
 cho-0.128
idd-0.126
etchup-0.115
iren-0.109
crow-0.101
Token
Token ID26391
Feature activation-1.49e-01

Pos logit contributions
 herself+0.012
isy+0.011
 solve+0.011
 seem+0.010
 Whatever+0.010
Neg logit contributions
 cho-0.021
idd-0.021
etchup-0.020
 hyd-0.020
 canoe-0.019
Token
Token ID8151
Feature activation-7.71e-02

Pos logit contributions
 herself+0.165
 Her+0.154
 read+0.153
 Whatever+0.151
 seem+0.149
Neg logit contributions
 cho-0.325
idd-0.325
 hyd-0.314
mer-0.307
etchup-0.305
Tokent
Token ID83
Feature activation-2.74e-02

Pos logit contributions
 herself+0.106
 Her+0.105
 seem+0.104
 about+0.100
isy+0.097
Neg logit contributions
idd-0.146
 hyd-0.145
 cho-0.143
 canoe-0.141
etchup-0.132
Token we
Token ID356
Feature activation+7.86e-04

Pos logit contributions
 herself+0.037
 about+0.032
 seem+0.032
 wear+0.031
 read+0.031
Neg logit contributions
idd-0.058
 cho-0.056
etchup-0.054
 hyd-0.054
mer-0.054
Token both
Token ID1111
Feature activation-2.36e-02

Pos logit contributions
idd+0.002
 cho+0.002
etchup+0.002
 hyd+0.002
mer+0.002
Neg logit contributions
 herself-0.001
 solve-0.001
 Her-0.001
 seem-0.001
isy-0.001
Token so
Token ID523
Feature activation-5.71e-02

Pos logit contributions
 herself+0.025
 seem+0.021
 solve+0.021
 about+0.021
 read+0.020
Neg logit contributions
idd-0.057
 cho-0.056
etchup-0.054
 hyd-0.053
mer-0.052
Token to
Token ID284
Feature activation+9.96e-03

Pos logit contributions
 herself+0.082
 Her+0.071
 about+0.070
 read+0.069
isy+0.069
Neg logit contributions
idd-0.121
 cho-0.119
etchup-0.112
 hyd-0.112
mer-0.109
Token help
Token ID1037
Feature activation+1.25e-02

Pos logit contributions
 cho+0.021
idd+0.020
 hyd+0.020
etchup+0.019
 canoe+0.019
Neg logit contributions
 herself-0.012
isy-0.012
 Whatever-0.011
 aloud-0.011
 Her-0.011
Token sort
Token ID3297
Feature activation-5.34e-02

Pos logit contributions
 cho+0.028
idd+0.027
 hyd+0.026
etchup+0.026
mer+0.025
Neg logit contributions
 herself-0.015
isy-0.013
 Her-0.013
 solve-0.013
 Whatever-0.013
Token the
Token ID262
Feature activation-2.32e-02

Pos logit contributions
 herself+0.051
 Her+0.033
 her+0.033
 about+0.032
 solve+0.031
Neg logit contributions
idd-0.147
 cho-0.144
 hyd-0.140
etchup-0.136
mer-0.135
Token or
Token ID393
Feature activation-2.22e-02

Pos logit contributions
 herself+0.031
 solve+0.028
 Her+0.026
 closely+0.026
 read+0.026
Neg logit contributions
 cho-0.050
idd-0.048
etchup-0.046
 hyd-0.046
mer-0.044
Tokennam
Token ID7402
Feature activation-1.47e-01

Pos logit contributions
 herself+0.022
isy+0.020
 Her+0.019
 Whatever+0.019
 aloud+0.018
Neg logit contributions
 cho-0.052
idd-0.051
 hyd-0.050
etchup-0.049
mer-0.048
Tokenents
Token ID658
Feature activation-3.76e-02

Pos logit contributions
 herself+0.191
isy+0.177
 Whatever+0.175
 seem+0.171
 solve+0.170
Neg logit contributions
idd-0.303
 cho-0.296
etchup-0.282
 hyd-0.275
mer-0.271
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 herself+0.053
 solve+0.045
 seem+0.045
isy+0.045
 Her+0.044
Neg logit contributions
idd-0.078
 cho-0.077
etchup-0.073
 hyd-0.072
 canoe-0.070
Token She
Token ID1375
Feature activation-1.63e-02

Pos logit contributions
isy+0.021
 herself+0.020
 solve+0.019
 seem+0.018
 read+0.018
Neg logit contributions
 cho-0.035
idd-0.035
etchup-0.033
 hyd-0.032
mer-0.031
Token picked
Token ID6497
Feature activation-5.86e-02

Pos logit contributions
 herself+0.022
isy+0.020
 Her+0.019
 solve+0.019
 Whatever+0.019
Neg logit contributions
 cho-0.033
idd-0.033
 hyd-0.031
etchup-0.031
mer-0.030
Token up
Token ID510
Feature activation-7.10e-02

Pos logit contributions
isy+0.087
 solve+0.085
 herself+0.084
 Whatever+0.083
 Her+0.082
Neg logit contributions
 cho-0.106
idd-0.100
etchup-0.095
 ge-0.093
 hyd-0.091
Token at
Token ID379
Feature activation+3.24e-02

Pos logit contributions
 herself+0.098
isy+0.094
 seem+0.093
 Her+0.092
 Whatever+0.091
Neg logit contributions
 cho-0.083
idd-0.077
 hyd-0.075
etchup-0.073
mer-0.072
Token the
Token ID262
Feature activation+6.02e-03

Pos logit contributions
 cho+0.058
idd+0.055
etchup+0.053
 mill+0.051
 ge+0.051
Neg logit contributions
isy-0.047
thy-0.044
ust-0.044
 solve-0.043
 seem-0.043
Token calendar
Token ID11845
Feature activation-1.51e-02

Pos logit contributions
idd+0.011
 cho+0.011
etchup+0.011
 hyd+0.011
mer+0.010
Neg logit contributions
 herself-0.009
 solve-0.008
 Whatever-0.008
 Her-0.008
isy-0.008
Token to
Token ID284
Feature activation+8.90e-04

Pos logit contributions
 herself+0.024
isy+0.022
 Whatever+0.022
 Her+0.022
 seem+0.022
Neg logit contributions
 cho-0.026
idd-0.025
 hyd-0.024
etchup-0.024
mer-0.023
Token remind
Token ID7101
Feature activation+2.46e-04

Pos logit contributions
 cho+0.002
 hyd+0.001
 ge+0.001
idd+0.001
 mill+0.001
Neg logit contributions
ust-0.001
isy-0.001
oll-0.001
aces-0.001
 herself-0.001
Token herself
Token ID5223
Feature activation-1.37e-01

Pos logit contributions
 cho+0.001
idd+0.001
crow+0.001
 hyd+0.001
etchup+0.001
Neg logit contributions
 Whatever-0.000
isy-0.000
ust-0.000
hell-0.000
 seem-0.000
Token of
Token ID286
Feature activation-2.03e-02

Pos logit contributions
 herself+0.183
isy+0.179
 Whatever+0.175
 aloud+0.167
 Her+0.167
Neg logit contributions
 cho-0.258
idd-0.243
 hyd-0.242
etchup-0.235
 ge-0.226
Token the
Token ID262
Feature activation-3.53e-02

Pos logit contributions
 Whatever+0.022
isy+0.022
 Her+0.022
 seem+0.022
 herself+0.021
Neg logit contributions
 cho-0.044
idd-0.041
etchup-0.040
 hyd-0.039
 ge-0.039
Token day
Token ID1110
Feature activation+2.07e-02

Pos logit contributions
 herself+0.055
isy+0.050
 Her+0.050
 Whatever+0.050
 solve+0.049
Neg logit contributions
 cho-0.064
idd-0.063
 hyd-0.060
etchup-0.059
mer-0.057
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
idd+0.043
 cho+0.043
 hyd+0.041
etchup+0.040
 canoe+0.040
Neg logit contributions
 herself-0.028
 Her-0.024
 solve-0.024
 read-0.024
 about-0.024
Token
Token ID198
Feature activation-1.60e-03

Pos logit contributions
isy+0.019
 herself+0.019
icity+0.018
ust+0.018
 solve+0.018
Neg logit contributions
 cho-0.017
idd-0.017
etchup-0.016
enn-0.015
crow-0.015
Tokenily
Token ID813
Feature activation-3.41e-02

Pos logit contributions
 herself+0.017
 Whatever+0.015
isy+0.015
 solve+0.015
 aloud+0.015
Neg logit contributions
idd-0.023
 cho-0.023
etchup-0.022
 hyd-0.022
 ge-0.021
Token liked
Token ID8288
Feature activation-5.60e-02

Pos logit contributions
 herself+0.049
 read+0.046
 Her+0.045
 about+0.043
 solve+0.043
Neg logit contributions
 cho-0.067
idd-0.066
 hyd-0.062
etchup-0.061
boats-0.061
Token to
Token ID284
Feature activation-2.60e-02

Pos logit contributions
 herself+0.089
 Her+0.080
 solve+0.079
isy+0.078
 Whatever+0.078
Neg logit contributions
idd-0.101
 cho-0.101
 hyd-0.095
etchup-0.094
mer-0.092
Token keep
Token ID1394
Feature activation-2.72e-02

Pos logit contributions
isy+0.032
ust+0.030
 herself+0.030
 Whatever+0.029
 aloud+0.028
Neg logit contributions
 cho-0.051
 hyd-0.050
 canoe-0.049
idd-0.048
 ge-0.048
Token cater
Token ID20825
Feature activation-4.88e-02

Pos logit contributions
isy+0.038
 herself+0.038
 Whatever+0.036
 seem+0.035
 solve+0.035
Neg logit contributions
 cho-0.053
idd-0.051
 hyd-0.050
etchup-0.050
 ge-0.047
Tokenpill
Token ID27215
Feature activation-1.37e-01

Pos logit contributions
 herself+0.074
isy+0.069
 Her+0.066
 seem+0.065
 solve+0.064
Neg logit contributions
 cho-0.090
idd-0.088
 hyd-0.087
 canoe-0.082
etchup-0.080
Tokenars
Token ID945
Feature activation-1.83e-02

Pos logit contributions
 herself+0.299
 Whatever+0.288
 Her+0.284
 how+0.276
 aloud+0.274
Neg logit contributions
 cho-0.159
idd-0.144
etchup-0.144
 hyd-0.141
mer-0.139
Token in
Token ID287
Feature activation-5.27e-03

Pos logit contributions
 herself+0.026
isy+0.023
 Her+0.023
 solve+0.023
 Whatever+0.023
Neg logit contributions
 cho-0.036
idd-0.036
 hyd-0.034
etchup-0.034
mer-0.032
Token a
Token ID257
Feature activation-1.82e-02

Pos logit contributions
 herself+0.004
 Her+0.004
 solve+0.003
isy+0.003
 Whatever+0.003
Neg logit contributions
 cho-0.014
idd-0.014
 hyd-0.013
etchup-0.013
mer-0.013
Token jar
Token ID17379
Feature activation+1.19e-02

Pos logit contributions
 solve+0.023
 herself+0.023
 read+0.023
 closely+0.022
 Her+0.022
Neg logit contributions
idd-0.037
 cho-0.036
 hyd-0.034
etchup-0.033
boats-0.033
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
idd+0.024
 cho+0.024
etchup+0.023
 hyd+0.022
 ge+0.022
Neg logit contributions
 herself-0.017
isy-0.015
 solve-0.015
 Her-0.015
 aloud-0.014
Token did
Token ID750
Feature activation-2.55e-02

Pos logit contributions
 cho+0.042
idd+0.042
 hyd+0.040
etchup+0.040
mer+0.039
Neg logit contributions
 herself-0.025
isy-0.023
 Whatever-0.022
 solve-0.022
 Her-0.022
Token something
Token ID1223
Feature activation-5.85e-02

Pos logit contributions
isy+0.028
 Whatever+0.026
 aloud+0.025
 Her+0.025
 herself+0.025
Neg logit contributions
 cho-0.055
 hyd-0.051
idd-0.051
 ge-0.048
etchup-0.048
Token special
Token ID2041
Feature activation-7.33e-03

Pos logit contributions
 herself+0.072
 Whatever+0.069
isy+0.067
aces+0.066
 Her+0.063
Neg logit contributions
 cho-0.121
 hyd-0.111
idd-0.105
etchup-0.103
mer-0.103
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
 herself+0.011
isy+0.010
 Her+0.010
 Whatever+0.010
 solve+0.010
Neg logit contributions
 cho-0.014
idd-0.014
 hyd-0.013
etchup-0.013
mer-0.013
Token On
Token ID1550
Feature activation-4.81e-02

Pos logit contributions
isy+0.022
 solve+0.020
ieve+0.019
 herself+0.019
aces+0.019
Neg logit contributions
idd-0.024
etchup-0.023
 cho-0.023
crow-0.023
 canoe-0.022
Token Wed
Token ID3300
Feature activation-1.36e-01

Pos logit contributions
 herself+0.057
isy+0.050
 solve+0.049
 Whatever+0.049
 Her+0.049
Neg logit contributions
 cho-0.106
idd-0.105
 hyd-0.100
etchup-0.099
mer-0.095
Tokennes
Token ID2516
Feature activation-6.52e-02

Pos logit contributions
 herself+0.207
 Her+0.200
 Whatever+0.196
 closely+0.176
 aloud+0.175
Neg logit contributions
 cho-0.244
idd-0.241
mer-0.226
 hyd-0.223
boats-0.211
Tokendays
Token ID12545
Feature activation-7.43e-02

Pos logit contributions
 herself+0.101
isy+0.091
 Her+0.090
 solve+0.090
 Whatever+0.089
Neg logit contributions
 cho-0.120
idd-0.119
 hyd-0.112
etchup-0.112
mer-0.107
Token she
Token ID673
Feature activation-9.56e-03

Pos logit contributions
 herself+0.125
 Her+0.117
 solve+0.112
 about+0.112
 Whatever+0.111
Neg logit contributions
 cho-0.129
idd-0.128
 hyd-0.123
etchup-0.122
 canoe-0.119
Token liked
Token ID8288
Feature activation-5.91e-02

Pos logit contributions
 herself+0.012
isy+0.011
 Whatever+0.011
 solve+0.011
 Her+0.011
Neg logit contributions
 cho-0.020
idd-0.020
 hyd-0.019
etchup-0.019
mer-0.018
Token to
Token ID284
Feature activation-3.21e-02

Pos logit contributions
 herself+0.093
 Her+0.083
 solve+0.082
 Whatever+0.081
isy+0.081
Neg logit contributions
idd-0.109
 cho-0.108
 hyd-0.103
etchup-0.102
mer-0.099
Token went
Token ID1816
Feature activation+7.42e-03

Pos logit contributions
idd+0.011
 cho+0.011
 hyd+0.010
mer+0.010
etchup+0.010
Neg logit contributions
 herself-0.013
isy-0.012
 Whatever-0.012
 Her-0.012
 aloud-0.012
Token she
Token ID673
Feature activation+1.16e-02

Pos logit contributions
idd+0.014
 cho+0.013
 hyd+0.013
etchup+0.013
 canoe+0.012
Neg logit contributions
 herself-0.012
 Her-0.011
 seem-0.011
 solve-0.011
 Whatever-0.010
Token searched
Token ID16499
Feature activation-3.65e-02

Pos logit contributions
idd+0.020
 cho+0.020
 hyd+0.018
etchup+0.018
mer+0.018
Neg logit contributions
 herself-0.020
isy-0.019
 Whatever-0.018
 Her-0.018
 aloud-0.017
Token for
Token ID329
Feature activation+5.59e-02

Pos logit contributions
 herself+0.053
 solve+0.046
 Her+0.046
isy+0.045
 seem+0.045
Neg logit contributions
idd-0.073
 cho-0.071
 hyd-0.068
etchup-0.068
mer-0.065
Token or
Token ID393
Feature activation-2.66e-02

Pos logit contributions
 cho+0.107
idd+0.100
 ge+0.096
etchup+0.095
 hyd+0.095
Neg logit contributions
isy-0.076
 herself-0.076
 Whatever-0.074
 Her-0.070
 aloud-0.070
Tokennam
Token ID7402
Feature activation-1.36e-01

Pos logit contributions
isy+0.028
 Whatever+0.028
 Her+0.026
 herself+0.026
oll+0.024
Neg logit contributions
 cho-0.057
 hyd-0.055
boats-0.053
mer-0.052
idd-0.052
Tokenents
Token ID658
Feature activation-3.08e-02

Pos logit contributions
 herself+0.179
 Whatever+0.167
isy+0.166
 seem+0.162
 solve+0.161
Neg logit contributions
idd-0.276
 cho-0.272
etchup-0.258
 hyd-0.247
mer-0.241
Token,
Token ID11
Feature activation-1.87e-03

Pos logit contributions
 herself+0.047
isy+0.041
 solve+0.041
 Her+0.041
 seem+0.040
Neg logit contributions
idd-0.059
 cho-0.059
 hyd-0.055
etchup-0.055
 canoe-0.053
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
 herself+0.002
isy+0.002
 Whatever+0.002
 Her+0.002
 solve+0.002
Neg logit contributions
 cho-0.004
idd-0.004
 hyd-0.004
etchup-0.003
mer-0.003
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
isy+0.031
 herself+0.031
 Whatever+0.030
 Her+0.029
 solve+0.028
Neg logit contributions
 cho-0.063
idd-0.059
etchup-0.057
 hyd-0.057
 ge-0.056
Token more
Token ID517
Feature activation+1.58e-02

Pos logit contributions
 cho+0.047
idd+0.046
 hyd+0.044
etchup+0.043
mer+0.042
Neg logit contributions
 herself-0.041
isy-0.038
 Whatever-0.037
 Her-0.037
 solve-0.037
Token she
Token ID673
Feature activation+3.62e-02

Pos logit contributions
 herself+0.034
 Her+0.029
isy+0.028
 solve+0.028
 Whatever+0.027
Neg logit contributions
idd-0.062
 cho-0.061
 hyd-0.059
etchup-0.058
mer-0.057
Token could
Token ID714
Feature activation+5.43e-03

Pos logit contributions
 cho+0.068
idd+0.068
 hyd+0.064
etchup+0.063
mer+0.062
Neg logit contributions
 herself-0.055
isy-0.049
 Her-0.049
 Whatever-0.049
 solve-0.048
Token carry
Token ID3283
Feature activation-1.21e-02

Pos logit contributions
 cho+0.009
 hyd+0.009
idd+0.009
 canoe+0.008
mer+0.008
Neg logit contributions
 Whatever-0.008
ust-0.007
isy-0.007
 closely-0.007
 herself-0.007
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
 herself+0.015
 Her+0.012
 solve+0.012
isy+0.012
 Whatever+0.012
Neg logit contributions
idd-0.027
 cho-0.027
 hyd-0.026
etchup-0.026
mer-0.025
Token or
Token ID393
Feature activation-1.77e-02

Pos logit contributions
 herself+0.023
isy+0.021
 Whatever+0.021
 Her+0.021
 solve+0.021
Neg logit contributions
 cho-0.027
idd-0.026
 hyd-0.025
etchup-0.024
mer-0.024
Tokennam
Token ID7402
Feature activation-1.35e-01

Pos logit contributions
 herself+0.019
isy+0.018
 Her+0.017
 Whatever+0.017
 aloud+0.016
Neg logit contributions
 cho-0.039
 hyd-0.037
idd-0.037
etchup-0.036
mer-0.035
Tokenents
Token ID658
Feature activation-3.36e-02

Pos logit contributions
 herself+0.172
 Whatever+0.156
isy+0.156
 solve+0.154
 seem+0.152
Neg logit contributions
idd-0.281
 cho-0.278
etchup-0.264
 hyd-0.259
mer-0.252
Token to
Token ID284
Feature activation-2.60e-03

Pos logit contributions
 herself+0.050
 solve+0.043
isy+0.042
 seem+0.042
 aloud+0.041
Neg logit contributions
idd-0.068
 cho-0.066
etchup-0.063
 hyd-0.062
 canoe-0.060
Token the
Token ID262
Feature activation-5.02e-03

Pos logit contributions
isy+0.004
ust+0.004
 Whatever+0.004
oll+0.004
aces+0.004
Neg logit contributions
 cho-0.005
 hyd-0.004
 ge-0.004
 mill-0.004
idd-0.004
Token place
Token ID1295
Feature activation+1.24e-02

Pos logit contributions
 herself+0.008
isy+0.007
 Her+0.007
 solve+0.007
 Whatever+0.007
Neg logit contributions
 cho-0.009
idd-0.009
 hyd-0.008
etchup-0.008
mer-0.008
Token.
Token ID13
Feature activation-6.27e-03

Pos logit contributions
idd+0.028
 cho+0.027
etchup+0.026
 hyd+0.025
 ge+0.025
Neg logit contributions
 herself-0.017
 about-0.014
 solve-0.014
 aloud-0.014
 Her-0.014
Token better
Token ID1365
Feature activation-2.15e-02

Pos logit contributions
idd+0.048
 cho+0.047
etchup+0.045
 hyd+0.045
mer+0.044
Neg logit contributions
 herself-0.026
 Her-0.022
 about-0.021
 solve-0.021
isy-0.021
Token now
Token ID783
Feature activation-2.50e-03

Pos logit contributions
 herself+0.030
 sure+0.023
 read+0.023
 about+0.023
 wear+0.022
Neg logit contributions
 cho-0.049
idd-0.047
boats-0.046
 funnel-0.045
 canoe-0.045
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
 herself+0.004
 Her+0.003
 solve+0.003
isy+0.003
 seem+0.003
Neg logit contributions
idd-0.005
 cho-0.005
etchup-0.005
 hyd-0.004
mer-0.004
Token She
Token ID1375
Feature activation-2.34e-02

Pos logit contributions
 herself+0.018
isy+0.018
 solve+0.017
 seem+0.016
 aloud+0.016
Neg logit contributions
 cho-0.037
idd-0.037
etchup-0.035
 hyd-0.035
 canoe-0.033
Token reminded
Token ID14516
Feature activation+1.67e-02

Pos logit contributions
 herself+0.033
isy+0.029
 Her+0.029
 solve+0.029
 Whatever+0.029
Neg logit contributions
 cho-0.047
idd-0.046
 hyd-0.044
etchup-0.044
mer-0.042
Token herself
Token ID5223
Feature activation-1.35e-01

Pos logit contributions
 cho+0.048
idd+0.047
 hyd+0.046
etchup+0.046
 canoe+0.044
Neg logit contributions
 herself-0.008
 solve-0.006
 Whatever-0.006
isy-0.006
 seem-0.006
Token of
Token ID286
Feature activation-2.18e-02

Pos logit contributions
 herself+0.188
isy+0.168
 Whatever+0.166
 Her+0.166
 solve+0.166
Neg logit contributions
 cho-0.267
idd-0.262
 hyd-0.250
etchup-0.248
mer-0.237
Token the
Token ID262
Feature activation-3.21e-02

Pos logit contributions
 herself+0.021
 Whatever+0.019
 Her+0.019
isy+0.019
 seem+0.019
Neg logit contributions
 cho-0.052
idd-0.051
etchup-0.049
 hyd-0.049
 ge-0.046
Token promise
Token ID6991
Feature activation+5.00e-02

Pos logit contributions
 herself+0.041
isy+0.037
 solve+0.036
 Whatever+0.035
 Her+0.035
Neg logit contributions
idd-0.068
 cho-0.068
etchup-0.064
 hyd-0.064
mer-0.062
Token her
Token ID607
Feature activation-7.08e-02

Pos logit contributions
idd+0.109
 cho+0.102
etchup+0.102
 hyd+0.099
mer+0.099
Neg logit contributions
 herself-0.071
 about-0.062
 Her-0.062
 solve-0.058
 read-0.058
Token mom
Token ID1995
Feature activation+5.61e-02

Pos logit contributions
 herself+0.100
isy+0.090
 Her+0.089
 Whatever+0.089
 solve+0.089
Neg logit contributions
 cho-0.139
idd-0.136
 hyd-0.130
etchup-0.129
 canoe-0.123
Token Wed
Token ID3300
Feature activation-1.21e-01

Pos logit contributions
 herself+0.116
isy+0.115
 Whatever+0.110
 seem+0.109
 solve+0.108
Neg logit contributions
 cho-0.113
idd-0.105
etchup-0.101
 hyd-0.100
 ge-0.099
Tokennes
Token ID2516
Feature activation-6.42e-02

Pos logit contributions
 herself+0.178
 Whatever+0.175
 how+0.168
 novels+0.165
 Her+0.164
Neg logit contributions
idd-0.222
mer-0.210
 cho-0.208
 hyd-0.206
boats-0.184
Tokendays
Token ID12545
Feature activation-6.50e-02

Pos logit contributions
 herself+0.111
 Whatever+0.103
isy+0.102
 solve+0.100
 aloud+0.100
Neg logit contributions
idd-0.105
 cho-0.102
 hyd-0.100
etchup-0.096
mer-0.094
Token because
Token ID780
Feature activation-2.02e-02

Pos logit contributions
 herself+0.080
 Her+0.069
 solve+0.068
isy+0.067
 Whatever+0.067
Neg logit contributions
idd-0.142
 cho-0.141
etchup-0.135
 hyd-0.134
 canoe-0.130
Token on
Token ID319
Feature activation-2.89e-02

Pos logit contributions
 herself+0.026
isy+0.025
 solve+0.024
 Whatever+0.024
 Her+0.024
Neg logit contributions
 cho-0.041
idd-0.040
etchup-0.038
 hyd-0.038
 canoe-0.036
Token Wed
Token ID3300
Feature activation-1.34e-01

Pos logit contributions
 herself+0.028
isy+0.025
 Whatever+0.024
 solve+0.024
 Her+0.024
Neg logit contributions
 cho-0.069
idd-0.068
 hyd-0.065
etchup-0.065
 canoe-0.063
Tokennes
Token ID2516
Feature activation-5.69e-02

Pos logit contributions
 herself+0.210
 Whatever+0.206
 Her+0.201
 how+0.194
 aloud+0.190
Neg logit contributions
idd-0.234
 cho-0.226
mer-0.213
 hyd-0.206
boats-0.195
Tokendays
Token ID12545
Feature activation-6.56e-02

Pos logit contributions
 herself+0.100
isy+0.095
 Whatever+0.092
 solve+0.091
 aloud+0.091
Neg logit contributions
idd-0.093
 cho-0.088
 hyd-0.086
etchup-0.083
mer-0.081
Token she
Token ID673
Feature activation+1.99e-02

Pos logit contributions
 herself+0.091
isy+0.084
 solve+0.081
 Whatever+0.081
 Her+0.080
Neg logit contributions
 cho-0.130
idd-0.127
 hyd-0.122
etchup-0.120
mer-0.116
Token did
Token ID750
Feature activation-2.55e-02

Pos logit contributions
 cho+0.042
idd+0.042
 hyd+0.040
etchup+0.040
mer+0.039
Neg logit contributions
 herself-0.025
isy-0.023
 Whatever-0.022
 solve-0.022
 Her-0.022
Token something
Token ID1223
Feature activation-5.85e-02

Pos logit contributions
isy+0.028
 Whatever+0.026
 aloud+0.025
 Her+0.025
 herself+0.025
Neg logit contributions
 cho-0.055
 hyd-0.051
idd-0.051
 ge-0.048
etchup-0.048
Token take
Token ID1011
Feature activation-1.85e-02

Pos logit contributions
 herself+0.020
isy+0.019
 Whatever+0.018
 aloud+0.017
 Her+0.017
Neg logit contributions
 cho-0.038
 hyd-0.037
idd-0.037
 canoe-0.035
 mill-0.034
Token one
Token ID530
Feature activation+9.93e-03

Pos logit contributions
 herself+0.019
isy+0.018
 Her+0.017
 Whatever+0.017
 solve+0.017
Neg logit contributions
 cho-0.043
idd-0.042
etchup-0.040
 hyd-0.040
 canoe-0.039
Token of
Token ID286
Feature activation+1.62e-02

Pos logit contributions
 cho+0.017
idd+0.017
 hyd+0.016
etchup+0.016
mer+0.015
Neg logit contributions
 herself-0.016
isy-0.015
 Her-0.015
 solve-0.015
 Whatever-0.015
Token the
Token ID262
Feature activation-6.00e-03

Pos logit contributions
idd+0.038
 cho+0.038
 hyd+0.036
etchup+0.036
mer+0.035
Neg logit contributions
 herself-0.019
 solve-0.015
 Her-0.015
 Whatever-0.015
 aloud-0.014
Token or
Token ID393
Feature activation-1.49e-02

Pos logit contributions
 herself+0.009
isy+0.008
 solve+0.008
 Her+0.008
 Whatever+0.008
Neg logit contributions
 cho-0.012
idd-0.012
 hyd-0.011
etchup-0.011
mer-0.010
Tokennam
Token ID7402
Feature activation-1.34e-01

Pos logit contributions
 herself+0.016
isy+0.015
 Her+0.014
 Whatever+0.014
 seem+0.013
Neg logit contributions
 cho-0.034
idd-0.033
 hyd-0.032
etchup-0.032
 canoe-0.031
Tokenents
Token ID658
Feature activation-3.12e-02

Pos logit contributions
 herself+0.166
isy+0.149
 Whatever+0.148
 solve+0.146
 seem+0.145
Neg logit contributions
idd-0.287
 cho-0.283
etchup-0.270
 hyd-0.265
mer-0.258
Token outside
Token ID2354
Feature activation-3.97e-02

Pos logit contributions
 herself+0.045
 solve+0.038
 seem+0.038
 about+0.038
 aloud+0.037
Neg logit contributions
idd-0.065
 cho-0.063
etchup-0.061
 hyd-0.059
 canoe-0.057
Token for
Token ID329
Feature activation+3.15e-02

Pos logit contributions
 herself+0.046
 about+0.040
 solve+0.038
 seem+0.037
 aloud+0.037
Neg logit contributions
idd-0.097
 cho-0.094
etchup-0.092
 ge-0.087
 hyd-0.086
Token a
Token ID257
Feature activation-4.99e-02

Pos logit contributions
 cho+0.069
idd+0.066
 hyd+0.065
etchup+0.063
 canoe+0.062
Neg logit contributions
 herself-0.035
isy-0.034
 Her-0.033
 Whatever-0.033
 aloud-0.031
Token walk
Token ID2513
Feature activation-2.76e-02

Pos logit contributions
 herself+0.077
 Her+0.066
 Whatever+0.065
isy+0.065
 aloud+0.063
Neg logit contributions
 cho-0.095
idd-0.091
 hyd-0.088
etchup-0.086
mer-0.085
Token she
Token ID673
Feature activation+3.69e-02

Pos logit contributions
idd+0.006
 cho+0.006
 hyd+0.006
etchup+0.006
mer+0.005
Neg logit contributions
 herself-0.003
 solve-0.002
 Her-0.002
 seem-0.002
isy-0.002
Token could
Token ID714
Feature activation+1.49e-02

Pos logit contributions
idd+0.064
 cho+0.064
etchup+0.060
 hyd+0.060
mer+0.059
Neg logit contributions
 herself-0.061
isy-0.056
 Whatever-0.054
 Her-0.054
 aloud-0.054
Token carry
Token ID3283
Feature activation-1.11e-02

Pos logit contributions
 cho+0.028
 hyd+0.028
idd+0.028
 canoe+0.026
etchup+0.025
Neg logit contributions
isy-0.019
 Whatever-0.018
 herself-0.017
ust-0.017
 aloud-0.016
Token the
Token ID262
Feature activation-1.28e-02

Pos logit contributions
 herself+0.015
 Her+0.013
 solve+0.013
 seem+0.012
 Whatever+0.012
Neg logit contributions
idd-0.024
 cho-0.023
 hyd-0.022
etchup-0.022
mer-0.021
Token or
Token ID393
Feature activation-1.16e-02

Pos logit contributions
 herself+0.020
isy+0.018
 Her+0.018
 Whatever+0.018
 solve+0.018
Neg logit contributions
 cho-0.023
idd-0.023
 hyd-0.021
etchup-0.021
mer-0.021
Tokennam
Token ID7402
Feature activation-1.34e-01

Pos logit contributions
 herself+0.011
isy+0.010
 Her+0.010
 Whatever+0.010
 aloud+0.010
Neg logit contributions
 cho-0.027
idd-0.027
 hyd-0.026
etchup-0.026
 canoe-0.025
Tokenents
Token ID658
Feature activation-3.08e-02

Pos logit contributions
 herself+0.166
isy+0.149
 Whatever+0.148
 solve+0.146
 seem+0.145
Neg logit contributions
idd-0.286
 cho-0.283
etchup-0.270
 hyd-0.265
mer-0.258
Token back
Token ID736
Feature activation+1.89e-02

Pos logit contributions
 herself+0.044
 solve+0.037
 seem+0.036
isy+0.036
 about+0.035
Neg logit contributions
idd-0.065
 cho-0.063
etchup-0.060
 hyd-0.059
 canoe-0.058
Token with
Token ID351
Feature activation+6.83e-03

Pos logit contributions
 cho+0.034
idd+0.033
 hyd+0.031
etchup+0.031
mer+0.030
Neg logit contributions
 herself-0.031
 Her-0.028
 solve-0.028
 Whatever-0.027
isy-0.027
Token her
Token ID607
Feature activation-3.70e-02

Pos logit contributions
idd+0.017
 cho+0.017
 hyd+0.016
etchup+0.016
mer+0.016
Neg logit contributions
 herself-0.007
 Her-0.005
isy-0.005
 solve-0.005
 Whatever-0.005
Token.
Token ID13
Feature activation-1.28e-03

Pos logit contributions
 herself+0.055
isy+0.050
 Her+0.049
 Whatever+0.049
 solve+0.049
Neg logit contributions
 cho-0.070
idd-0.069
 hyd-0.065
etchup-0.065
mer-0.063
Token They
Token ID1119
Feature activation+1.84e-02

Pos logit contributions
idd+0.019
 cho+0.018
 hyd+0.018
etchup+0.017
mer+0.017
Neg logit contributions
 herself-0.013
 Her-0.012
 Whatever-0.011
 aloud-0.011
 about-0.011
Token give
Token ID1577
Feature activation-1.89e-02

Pos logit contributions
 cho+0.033
 ge+0.031
 hyd+0.030
idd+0.030
 canoe+0.029
Neg logit contributions
isy-0.026
 herself-0.026
 aloud-0.026
 Whatever-0.025
 Her-0.025
Token Spot
Token ID15899
Feature activation-4.03e-02

Pos logit contributions
 herself+0.024
 Her+0.020
 aloud+0.019
 read+0.019
 Whatever+0.019
Neg logit contributions
idd-0.043
 hyd-0.040
 cho-0.040
mer-0.039
etchup-0.038
Token more
Token ID517
Feature activation-2.31e-02

Pos logit contributions
 herself+0.048
isy+0.043
 solve+0.042
 Whatever+0.042
 Her+0.042
Neg logit contributions
 cho-0.088
idd-0.086
 hyd-0.083
etchup-0.083
 canoe-0.080
Token or
Token ID393
Feature activation-2.15e-02

Pos logit contributions
 herself+0.036
 Whatever+0.034
 solve+0.034
isy+0.033
 Her+0.033
Neg logit contributions
 cho-0.041
idd-0.039
etchup-0.038
 hyd-0.037
 ge-0.037
Tokennam
Token ID7402
Feature activation-1.33e-01

Pos logit contributions
 herself+0.028
 solve+0.025
 read+0.024
 seem+0.024
 Her+0.023
Neg logit contributions
idd-0.047
 cho-0.045
 hyd-0.043
etchup-0.043
mer-0.043
Tokenents
Token ID658
Feature activation-3.49e-02

Pos logit contributions
 herself+0.166
isy+0.148
 Whatever+0.147
 solve+0.145
 Her+0.144
Neg logit contributions
 cho-0.286
idd-0.284
etchup-0.269
 hyd-0.265
mer-0.256
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 herself+0.050
 seem+0.042
 about+0.042
 Her+0.041
 solve+0.041
Neg logit contributions
idd-0.073
 cho-0.068
 hyd-0.066
etchup-0.066
 canoe-0.065
Token Spot
Token ID15899
Feature activation-1.17e-04

Pos logit contributions
idd+0.026
 cho+0.025
 hyd+0.025
etchup+0.024
mer+0.024
Neg logit contributions
 herself-0.017
 Her-0.016
 Whatever-0.015
 about-0.014
 aloud-0.014
Token leads
Token ID5983
Feature activation+2.28e-02

Pos logit contributions
 herself+0.000
isy+0.000
 aloud+0.000
 Whatever+0.000
 Her+0.000
Neg logit contributions
 cho-0.000
idd-0.000
etchup-0.000
 hyd-0.000
 ge-0.000
Token them
Token ID606
Feature activation-3.49e-02

Pos logit contributions
idd+0.052
 hyd+0.049
 cho+0.048
etchup+0.048
mer+0.046
Neg logit contributions
 herself-0.031
 Her-0.025
 her-0.024
 about-0.024
 aloud-0.023
Token you
Token ID345
Feature activation-6.95e-03

Pos logit contributions
idd+0.050
 cho+0.049
etchup+0.048
 hyd+0.047
mer+0.045
Neg logit contributions
 herself-0.027
 seem-0.024
 solve-0.023
 Her-0.023
isy-0.023
Token know
Token ID760
Feature activation-7.51e-02

Pos logit contributions
 herself+0.007
 seem+0.007
 solve+0.006
 Her+0.006
 wear+0.006
Neg logit contributions
idd-0.017
 cho-0.017
etchup-0.016
 hyd-0.016
boats-0.015
Token you
Token ID345
Feature activation+1.86e-03

Pos logit contributions
 herself+0.099
 about+0.086
isy+0.085
 solve+0.085
 Her+0.084
Neg logit contributions
idd-0.160
 cho-0.154
 hyd-0.153
etchup-0.152
mer-0.147
Token shouldn
Token ID6584
Feature activation-2.28e-02

Pos logit contributions
idd+0.004
 cho+0.004
etchup+0.004
 hyd+0.004
 canoe+0.003
Neg logit contributions
 herself-0.003
 solve-0.002
 seem-0.002
 Whatever-0.002
 Her-0.002
Tokenâ
Token ID22940
Feature activation+8.11e-03

Pos logit contributions
 herself+0.023
 seem+0.019
 solve+0.019
 Her+0.019
 read+0.018
Neg logit contributions
idd-0.056
 cho-0.056
etchup-0.055
 hyd-0.054
mer-0.053
Token
Token ID26391
Feature activation-1.32e-01

Pos logit contributions
 cho+0.017
idd+0.016
 hyd+0.016
mer+0.016
etchup+0.015
Neg logit contributions
 herself-0.010
 Her-0.009
 aloud-0.009
isy-0.009
 Whatever-0.009
Token
Token ID8151
Feature activation-5.72e-02

Pos logit contributions
 herself+0.174
 Her+0.161
 Whatever+0.158
 seem+0.158
 solve+0.156
Neg logit contributions
 cho-0.265
idd-0.265
 hyd-0.258
etchup-0.252
mer-0.244
Tokent
Token ID83
Feature activation-1.41e-02

Pos logit contributions
 herself+0.081
 Her+0.075
 seem+0.072
 about+0.072
 Whatever+0.071
Neg logit contributions
 hyd-0.111
 cho-0.111
idd-0.109
etchup-0.104
 canoe-0.104
Token speed
Token ID2866
Feature activation-2.30e-02

Pos logit contributions
 herself+0.018
 solve+0.016
 seem+0.016
 Her+0.016
isy+0.016
Neg logit contributions
idd-0.030
 cho-0.030
etchup-0.029
 hyd-0.028
mer-0.027
Token?
Token ID30
Feature activation-1.48e-02

Pos logit contributions
 herself+0.033
 seem+0.027
 about+0.027
 how+0.026
 wear+0.026
Neg logit contributions
idd-0.050
 hyd-0.047
etchup-0.047
 cho-0.046
 canoe-0.044
Tokenâ
Token ID22940
Feature activation+6.47e-02

Pos logit contributions
 herself+0.020
 Her+0.018
 Whatever+0.018
 seem+0.017
 solve+0.016
Neg logit contributions
 cho-0.031
idd-0.030
 hyd-0.029
etchup-0.029
mer-0.028
TokenJimmy
Token ID40335
Feature activation-2.19e-02

Pos logit contributions
 cho+0.074
idd+0.072
 hyd+0.069
etchup+0.067
 canoe+0.064
Neg logit contributions
 herself-0.086
 Her-0.079
 Whatever-0.078
 solve-0.077
 seem-0.076
Token was
Token ID373
Feature activation+3.54e-03

Pos logit contributions
 herself+0.027
 Her+0.025
 read+0.024
 solve+0.023
 wear+0.023
Neg logit contributions
 cho-0.048
idd-0.046
etchup-0.044
 hyd-0.044
boats-0.043
Token always
Token ID1464
Feature activation+1.83e-02

Pos logit contributions
 cho+0.008
idd+0.008
mer+0.007
 hyd+0.007
etchup+0.007
Neg logit contributions
 herself-0.004
 Her-0.004
 about-0.004
 solve-0.003
 read-0.003
Token excited
Token ID6568
Feature activation-2.66e-02

Pos logit contributions
 cho+0.041
idd+0.039
etchup+0.038
crow+0.038
 hyd+0.038
Neg logit contributions
 herself-0.023
 sure-0.020
 about-0.019
 read-0.017
 her-0.017
Token for
Token ID329
Feature activation+3.30e-02

Pos logit contributions
 herself+0.041
isy+0.037
 Her+0.037
 solve+0.037
 Whatever+0.036
Neg logit contributions
 cho-0.049
idd-0.049
 hyd-0.046
etchup-0.046
mer-0.044
Token Wed
Token ID3300
Feature activation-1.30e-01

Pos logit contributions
 cho+0.070
idd+0.069
 hyd+0.066
etchup+0.065
 canoe+0.063
Neg logit contributions
 herself-0.040
isy-0.038
 Whatever-0.036
 Her-0.035
 seem-0.035
Tokennes
Token ID2516
Feature activation-6.10e-02

Pos logit contributions
 herself+0.190
 Whatever+0.174
 Her+0.173
 how+0.159
isy+0.159
Neg logit contributions
idd-0.248
 cho-0.242
mer-0.236
 hyd-0.229
boats-0.216
Tokendays
Token ID12545
Feature activation-6.33e-02

Pos logit contributions
 herself+0.093
isy+0.084
 Her+0.084
 solve+0.083
 Whatever+0.082
Neg logit contributions
 cho-0.114
idd-0.113
etchup-0.106
 hyd-0.106
 canoe-0.101
Token because
Token ID780
Feature activation-1.27e-02

Pos logit contributions
 herself+0.082
 about+0.074
 Her+0.071
 solve+0.070
 aloud+0.069
Neg logit contributions
idd-0.138
 cho-0.135
etchup-0.133
 hyd-0.128
 ge-0.127
Token it
Token ID340
Feature activation-6.47e-03

Pos logit contributions
 herself+0.018
 about+0.015
 Her+0.015
 Whatever+0.015
 solve+0.015
Neg logit contributions
idd-0.026
 cho-0.025
 hyd-0.024
etchup-0.024
mer-0.024
Token was
Token ID373
Feature activation+1.95e-02

Pos logit contributions
 herself+0.008
 seem+0.007
 solve+0.007
 read+0.007
 Her+0.007
Neg logit contributions
idd-0.014
 cho-0.014
etchup-0.014
 hyd-0.013
 ge-0.013
Token to
Token ID284
Feature activation-2.48e-02

Pos logit contributions
 herself+0.094
 Her+0.084
isy+0.083
 solve+0.083
 Whatever+0.083
Neg logit contributions
 cho-0.108
idd-0.108
 hyd-0.101
etchup-0.101
mer-0.098
Token look
Token ID804
Feature activation-5.61e-02

Pos logit contributions
ust+0.032
isy+0.032
 herself+0.030
 aloud+0.029
 Whatever+0.028
Neg logit contributions
 cho-0.047
 hyd-0.045
 canoe-0.045
 ge-0.045
etchup-0.045
Token at
Token ID379
Feature activation+3.34e-02

Pos logit contributions
 herself+0.099
 seem+0.095
isy+0.094
 Whatever+0.091
 Her+0.091
Neg logit contributions
 cho-0.087
idd-0.084
 hyd-0.080
etchup-0.079
mer-0.076
Token the
Token ID262
Feature activation+8.91e-03

Pos logit contributions
 cho+0.062
idd+0.061
etchup+0.058
 hyd+0.057
 ge+0.056
Neg logit contributions
 herself-0.048
isy-0.047
 solve-0.045
 seem-0.045
 Whatever-0.044
Token or
Token ID393
Feature activation-7.38e-03

Pos logit contributions
idd+0.018
 cho+0.017
 hyd+0.017
etchup+0.016
mer+0.016
Neg logit contributions
 herself-0.012
 solve-0.011
 Whatever-0.011
 read-0.011
isy-0.011
Tokennam
Token ID7402
Feature activation-1.30e-01

Pos logit contributions
 herself+0.009
isy+0.009
 Whatever+0.008
 Her+0.008
 solve+0.008
Neg logit contributions
 cho-0.015
 hyd-0.015
idd-0.015
etchup-0.014
 ge-0.014
Tokenents
Token ID658
Feature activation-3.20e-02

Pos logit contributions
 herself+0.175
isy+0.163
 Whatever+0.163
 seem+0.161
 aloud+0.158
Neg logit contributions
idd-0.263
 cho-0.253
etchup-0.243
 hyd-0.235
mer-0.229
Token on
Token ID319
Feature activation+9.41e-03

Pos logit contributions
 herself+0.047
 Whatever+0.044
 Her+0.044
isy+0.043
 solve+0.042
Neg logit contributions
 cho-0.059
idd-0.057
 hyd-0.056
etchup-0.055
mer-0.053
Token the
Token ID262
Feature activation-8.78e-03

Pos logit contributions
 cho+0.018
idd+0.017
 hyd+0.017
etchup+0.016
 canoe+0.016
Neg logit contributions
 herself-0.013
isy-0.013
 Whatever-0.012
 solve-0.012
 Her-0.012
Token tree
Token ID5509
Feature activation-1.48e-03

Pos logit contributions
 herself+0.013
isy+0.012
 Her+0.012
 solve+0.012
 Whatever+0.012
Neg logit contributions
 cho-0.017
idd-0.016
 hyd-0.015
etchup-0.015
mer-0.015
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
 herself+0.002
isy+0.002
 Whatever+0.002
 Her+0.002
 solve+0.002
Neg logit contributions
 cho-0.003
idd-0.003
 hyd-0.003
etchup-0.003
mer-0.002
Token!
Token ID0
Feature activation-3.09e-03

Pos logit contributions
 herself+0.009
 about+0.009
 solve+0.008
 aloud+0.008
 seem+0.008
Neg logit contributions
idd-0.013
etchup-0.013
 cho-0.013
 hyd-0.012
 ge-0.012
Token
Token ID198
Feature activation-5.16e-03

Pos logit contributions
isy+0.005
 herself+0.005
 solve+0.004
 seem+0.004
 aloud+0.004
Neg logit contributions
 cho-0.006
idd-0.006
etchup-0.005
 canoe-0.005
 hyd-0.005
Token
Token ID198
Feature activation-7.03e-03

Pos logit contributions
 blush+0.009
 herself+0.009
 seem+0.009
isy+0.009
ust+0.009
Neg logit contributions
 cho-0.008
idd-0.007
enn-0.007
etchup-0.007
 canoe-0.006
TokenTom
Token ID13787
Feature activation-1.10e-02

Pos logit contributions
 herself+0.010
 Her+0.009
 seem+0.009
isy+0.009
 aloud+0.009
Neg logit contributions
 cho-0.014
idd-0.013
 ge-0.012
boats-0.012
 hyd-0.012
Token enjoyed
Token ID8359
Feature activation-4.40e-02

Pos logit contributions
 herself+0.014
 read+0.013
isy+0.012
 solve+0.012
 Her+0.012
Neg logit contributions
idd-0.023
 cho-0.023
etchup-0.022
 hyd-0.022
 canoe-0.021
Token imagining
Token ID34140
Feature activation-1.30e-01

Pos logit contributions
 herself+0.061
 solve+0.053
 Her+0.052
isy+0.052
 Whatever+0.052
Neg logit contributions
idd-0.089
 cho-0.089
 hyd-0.085
etchup-0.085
mer-0.082
Token so
Token ID523
Feature activation-5.63e-02

Pos logit contributions
 herself+0.137
isy+0.129
 Her+0.128
 Whatever+0.124
 solve+0.118
Neg logit contributions
 cho-0.297
idd-0.280
 hyd-0.273
etchup-0.266
 canoe-0.265
Token much
Token ID881
Feature activation-2.88e-02

Pos logit contributions
 herself+0.083
 Her+0.083
 blush+0.081
isy+0.078
 write+0.077
Neg logit contributions
 cho-0.107
 men-0.085
idd-0.085
 canoe-0.084
iren-0.080
Token,
Token ID11
Feature activation-7.43e-03

Pos logit contributions
 herself+0.040
 Her+0.037
isy+0.036
 Whatever+0.036
 solve+0.035
Neg logit contributions
 cho-0.057
idd-0.055
 hyd-0.053
etchup-0.052
 canoe-0.051
Token he
Token ID339
Feature activation+3.47e-02

Pos logit contributions
 herself+0.010
 solve+0.009
 Whatever+0.009
 read+0.009
 seem+0.009
Neg logit contributions
idd-0.015
 cho-0.015
etchup-0.014
 hyd-0.014
mer-0.014
Token decided
Token ID3066
Feature activation-1.02e-02

Pos logit contributions
idd+0.071
 cho+0.070
etchup+0.068
 hyd+0.067
 canoe+0.064
Neg logit contributions
 herself-0.047
isy-0.041
 solve-0.041
 read-0.041
 Her-0.040
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
idd+0.064
 cho+0.061
etchup+0.060
 hyd+0.060
mer+0.057
Neg logit contributions
 herself-0.067
 Her-0.062
isy-0.060
 aloud-0.060
 solve-0.060
Token "
Token ID366
Feature activation+3.85e-02

Pos logit contributions
 cho+0.037
idd+0.035
 hyd+0.034
etchup+0.033
 canoe+0.032
Neg logit contributions
 herself-0.030
 solve-0.028
isy-0.028
 Whatever-0.027
 seem-0.027
TokenI
Token ID40
Feature activation+1.24e-02

Pos logit contributions
 cho+0.065
idd+0.064
etchup+0.059
 hyd+0.059
 canoe+0.057
Neg logit contributions
 herself-0.069
 Her-0.063
isy-0.062
 Whatever-0.062
 read-0.061
Token love
Token ID1842
Feature activation-2.06e-02

Pos logit contributions
 cho+0.027
idd+0.027
etchup+0.025
 hyd+0.025
boats+0.025
Neg logit contributions
 herself-0.014
 solve-0.014
 seem-0.014
 read-0.014
 wear-0.013
Token cater
Token ID20825
Feature activation-3.72e-02

Pos logit contributions
 herself+0.031
 solve+0.026
 Her+0.026
 read+0.026
 about+0.026
Neg logit contributions
idd-0.041
 cho-0.039
 hyd-0.039
mer-0.039
etchup-0.038
Tokenpill
Token ID27215
Feature activation-1.30e-01

Pos logit contributions
 herself+0.054
isy+0.048
 Her+0.047
 solve+0.047
 Whatever+0.047
Neg logit contributions
 cho-0.073
idd-0.072
 hyd-0.069
etchup-0.067
 canoe-0.065
Tokenars
Token ID945
Feature activation-1.46e-02

Pos logit contributions
 herself+0.272
 Whatever+0.251
 Her+0.251
 aloud+0.244
 how+0.243
Neg logit contributions
 cho-0.166
idd-0.157
mer-0.155
etchup-0.153
boats-0.146
Token!
Token ID0
Feature activation-4.97e-03

Pos logit contributions
 herself+0.024
isy+0.021
 about+0.021
 Her+0.020
 read+0.020
Neg logit contributions
idd-0.027
 cho-0.026
 hyd-0.025
etchup-0.025
 canoe-0.025
Token You
Token ID921
Feature activation+8.82e-04

Pos logit contributions
 herself+0.008
 Whatever+0.008
 Her+0.008
 solve+0.007
 seem+0.007
Neg logit contributions
 cho-0.009
idd-0.009
etchup-0.008
 hyd-0.008
mer-0.008
Token can
Token ID460
Feature activation-7.13e-03

Pos logit contributions
 cho+0.002
idd+0.001
 hyd+0.001
mer+0.001
etchup+0.001
Neg logit contributions
 herself-0.001
isy-0.001
 aloud-0.001
 Her-0.001
 Whatever-0.001
Token be
Token ID307
Feature activation-2.37e-02

Pos logit contributions
 herself+0.008
 aloud+0.007
 Whatever+0.007
isy+0.007
 Her+0.007
Neg logit contributions
 cho-0.015
idd-0.015
 hyd-0.015
etchup-0.014
 canoe-0.014
Token time
Token ID640
Feature activation+3.34e-02

Pos logit contributions
 herself+0.039
isy+0.035
 solve+0.034
 Her+0.034
 Whatever+0.034
Neg logit contributions
 cho-0.062
idd-0.062
etchup-0.059
 hyd-0.059
mer-0.057
Token playing
Token ID2712
Feature activation-7.90e-02

Pos logit contributions
 cho+0.071
idd+0.068
 hyd+0.067
etchup+0.066
 canoe+0.065
Neg logit contributions
 herself-0.045
 about-0.040
 solve-0.038
 read-0.037
 aloud-0.037
Token with
Token ID351
Feature activation-2.03e-02

Pos logit contributions
 herself+0.108
 solve+0.091
 Her+0.088
 aloud+0.088
isy+0.087
Neg logit contributions
 cho-0.168
idd-0.167
etchup-0.160
 hyd-0.157
 canoe-0.152
Token her
Token ID607
Feature activation-6.09e-02

Pos logit contributions
 herself+0.012
isy+0.009
 Her+0.009
 solve+0.009
 Whatever+0.009
Neg logit contributions
 cho-0.057
idd-0.057
 hyd-0.054
etchup-0.054
mer-0.053
Token or
Token ID393
Feature activation-1.26e-02

Pos logit contributions
 herself+0.086
isy+0.076
 solve+0.076
 Her+0.075
 Whatever+0.074
Neg logit contributions
 cho-0.122
idd-0.120
 hyd-0.114
etchup-0.113
 canoe-0.109
Tokennam
Token ID7402
Feature activation-1.29e-01

Pos logit contributions
 herself+0.014
 solve+0.012
 Her+0.012
isy+0.012
 Whatever+0.012
Neg logit contributions
idd-0.029
 cho-0.029
 hyd-0.027
etchup-0.027
mer-0.027
Tokenents
Token ID658
Feature activation-1.93e-02

Pos logit contributions
 herself+0.174
 Whatever+0.165
 seem+0.164
isy+0.163
 solve+0.160
Neg logit contributions
idd-0.260
 cho-0.250
etchup-0.239
 hyd-0.227
mer-0.226
Token and
Token ID290
Feature activation-9.53e-03

Pos logit contributions
 herself+0.026
isy+0.023
 solve+0.022
 Her+0.022
 aloud+0.022
Neg logit contributions
idd-0.040
 cho-0.040
etchup-0.038
 hyd-0.038
 canoe-0.036
Token making
Token ID1642
Feature activation+9.38e-04

Pos logit contributions
 herself+0.013
 solve+0.011
 read+0.011
 Her+0.011
isy+0.011
Neg logit contributions
idd-0.020
 cho-0.020
 hyd-0.019
etchup-0.019
mer-0.019
Token her
Token ID607
Feature activation-7.24e-02

Pos logit contributions
 cho+0.002
idd+0.002
etchup+0.002
 hyd+0.002
 ge+0.002
Neg logit contributions
 herself-0.001
 Whatever-0.001
isy-0.001
 solve-0.001
 Her-0.001
Token special
Token ID2041
Feature activation-2.07e-02

Pos logit contributions
isy+0.087
 herself+0.084
 Whatever+0.080
ust+0.078
 solve+0.078
Neg logit contributions
 cho-0.140
idd-0.132
etchup-0.129
 ge-0.129
 hyd-0.128
Token here
Token ID994
Feature activation-1.44e-02

Pos logit contributions
 cho+0.007
idd+0.007
etchup+0.007
mer+0.007
boats+0.007
Neg logit contributions
 herself-0.005
 about-0.004
 sure-0.004
 seem-0.004
 Her-0.004
Token and
Token ID290
Feature activation-2.99e-02

Pos logit contributions
 herself+0.019
 about+0.018
 solve+0.017
 sure+0.016
 seem+0.016
Neg logit contributions
idd-0.033
 cho-0.031
etchup-0.031
boats-0.029
mer-0.029
Token I
Token ID314
Feature activation+2.90e-02

Pos logit contributions
 herself+0.041
 solve+0.037
 seem+0.036
 read+0.036
 Her+0.035
Neg logit contributions
idd-0.062
 cho-0.061
etchup-0.059
 hyd-0.057
mer-0.057
Token won
Token ID1839
Feature activation+5.41e-03

Pos logit contributions
 cho+0.057
idd+0.055
etchup+0.053
 hyd+0.051
boats+0.051
Neg logit contributions
 herself-0.042
 solve-0.039
 seem-0.039
 Her-0.038
 read-0.038
Tokenâ
Token ID22940
Feature activation+1.77e-02

Pos logit contributions
 cho+0.012
idd+0.012
 hyd+0.011
etchup+0.011
 canoe+0.011
Neg logit contributions
 herself-0.006
isy-0.006
 Her-0.006
 Whatever-0.006
 solve-0.006
Token
Token ID26391
Feature activation-1.29e-01

Pos logit contributions
 cho+0.037
idd+0.036
 hyd+0.035
etchup+0.034
mer+0.034
Neg logit contributions
 herself-0.022
 aloud-0.020
 Her-0.019
 Whatever-0.019
isy-0.019
Token
Token ID8151
Feature activation-6.43e-02

Pos logit contributions
 herself+0.169
 Her+0.151
 seem+0.148
 Whatever+0.148
 solve+0.148
Neg logit contributions
 cho-0.264
idd-0.264
 hyd-0.254
etchup-0.252
mer-0.245
Tokent
Token ID83
Feature activation-1.22e-02

Pos logit contributions
 herself+0.083
 Her+0.077
 seem+0.072
 about+0.070
isy+0.070
Neg logit contributions
 hyd-0.136
 cho-0.132
idd-0.131
 canoe-0.127
etchup-0.126
Token let
Token ID1309
Feature activation-9.91e-03

Pos logit contributions
 herself+0.016
 Whatever+0.015
 Her+0.015
 aloud+0.014
isy+0.014
Neg logit contributions
 cho-0.024
idd-0.023
 hyd-0.023
mer-0.022
 canoe-0.022
Token anything
Token ID1997
Feature activation-8.80e-02

Pos logit contributions
 herself+0.015
isy+0.015
 aloud+0.015
 Her+0.014
 Whatever+0.014
Neg logit contributions
 cho-0.018
idd-0.016
 hyd-0.016
etchup-0.015
 canoe-0.015
Token bad
Token ID2089
Feature activation+2.23e-02

Pos logit contributions
 herself+0.110
 solve+0.096
isy+0.095
 seem+0.095
 Her+0.093
Neg logit contributions
idd-0.191
 cho-0.189
etchup-0.180
 hyd-0.179
 canoe-0.172
Token bird
Token ID6512
Feature activation+2.90e-04

Pos logit contributions
idd+0.035
 cho+0.034
etchup+0.033
mer+0.032
 hyd+0.032
Neg logit contributions
 herself-0.027
isy-0.025
 solve-0.024
 seem-0.023
 Whatever-0.023
Token.
Token ID13
Feature activation-2.83e-03

Pos logit contributions
 cho+0.001
idd+0.001
 hyd+0.001
etchup+0.000
mer+0.000
Neg logit contributions
 herself-0.000
 Whatever-0.000
 Her-0.000
 solve-0.000
isy-0.000
Token Suddenly
Token ID24975
Feature activation-7.64e-03

Pos logit contributions
 herself+0.004
isy+0.003
 solve+0.003
 seem+0.003
 Whatever+0.003
Neg logit contributions
 cho-0.006
idd-0.006
 hyd-0.006
etchup-0.005
 canoe-0.005
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
 herself+0.010
 solve+0.009
 Her+0.008
 her+0.008
 about+0.008
Neg logit contributions
idd-0.017
mer-0.017
etchup-0.017
 hyd-0.017
rim-0.016
Token his
Token ID465
Feature activation+1.20e-02

Pos logit contributions
idd+0.037
 cho+0.036
mer+0.035
etchup+0.035
 hyd+0.034
Neg logit contributions
 herself-0.018
 solve-0.015
 Her-0.015
 read-0.015
 aloud-0.014
Token mom
Token ID1995
Feature activation+9.22e-02

Pos logit contributions
 cho+0.025
idd+0.025
etchup+0.024
 hyd+0.023
mer+0.023
Neg logit contributions
 herself-0.016
isy-0.014
 Her-0.014
 solve-0.013
 aloud-0.013
Tokenmy
Token ID1820
Feature activation-3.57e-03

Pos logit contributions
idd+0.213
mer+0.213
 cho+0.212
 hyd+0.212
boats+0.206
Neg logit contributions
 herself-0.097
 read-0.091
 Her-0.088
isy-0.086
 about-0.081
Token came
Token ID1625
Feature activation+3.30e-02

Pos logit contributions
 read+0.005
 herself+0.005
isy+0.005
 Her+0.005
 sure+0.005
Neg logit contributions
boats-0.007
idd-0.007
 hyd-0.007
mer-0.007
 cho-0.007
Token outside
Token ID2354
Feature activation-2.81e-02

Pos logit contributions
 cho+0.073
idd+0.070
etchup+0.069
boats+0.068
mer+0.068
Neg logit contributions
 herself-0.045
 solve-0.036
 Her-0.035
 read-0.035
 closely-0.035
Token and
Token ID290
Feature activation-3.74e-03

Pos logit contributions
 herself+0.037
 Her+0.033
 about+0.033
 read+0.032
 solve+0.031
Neg logit contributions
idd-0.063
 cho-0.062
etchup-0.058
boats-0.058
mer-0.057
Token saw
Token ID2497
Feature activation-1.91e-03

Pos logit contributions
 herself+0.007
 read+0.006
 Her+0.006
 solve+0.006
isy+0.006
Neg logit contributions
idd-0.006
 cho-0.006
etchup-0.006
 hyd-0.006
mer-0.005
Token counted
Token ID14789
Feature activation-1.23e-02

Pos logit contributions
 herself+0.007
 Whatever+0.007
 Her+0.006
isy+0.006
 seem+0.006
Neg logit contributions
 cho-0.008
idd-0.008
etchup-0.008
 ge-0.008
mer-0.008
Token five
Token ID1936
Feature activation+1.33e-02

Pos logit contributions
isy+0.018
 Whatever+0.018
 herself+0.017
 solve+0.017
 Her+0.017
Neg logit contributions
 cho-0.022
idd-0.020
etchup-0.019
 ge-0.019
 canoe-0.018
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
 cho+0.020
idd+0.019
etchup+0.019
mer+0.018
 ge+0.018
Neg logit contributions
 herself-0.024
 Her-0.023
 Whatever-0.022
 solve-0.022
 aloud-0.021
Token six
Token ID2237
Feature activation+2.87e-02

Pos logit contributions
 cho+0.051
idd+0.051
 hyd+0.048
etchup+0.048
mer+0.046
Neg logit contributions
 herself-0.040
isy-0.036
 Her-0.036
 solve-0.036
 Whatever-0.036
Token,
Token ID11
Feature activation+3.69e-02

Pos logit contributions
 cho+0.042
idd+0.040
etchup+0.039
 hyd+0.037
 ge+0.037
Neg logit contributions
 herself-0.053
 Whatever-0.050
 Her-0.050
 solve-0.049
 aloud-0.048
Token seven
Token ID3598
Feature activation+8.87e-02

Pos logit contributions
idd+0.056
 cho+0.056
 hyd+0.052
etchup+0.052
mer+0.049
Neg logit contributions
 herself-0.070
isy-0.064
 solve-0.063
 Her-0.063
 read-0.062
Token,
Token ID11
Feature activation+4.57e-02

Pos logit contributions
 cho+0.139
idd+0.137
etchup+0.128
 hyd+0.127
mer+0.121
Neg logit contributions
 herself-0.161
isy-0.147
 Her-0.147
 solve-0.146
 Whatever-0.146
Token eight
Token ID3624
Feature activation+4.93e-02

Pos logit contributions
 cho+0.090
idd+0.089
 hyd+0.085
etchup+0.084
mer+0.081
Neg logit contributions
 herself-0.066
 solve-0.058
isy-0.058
 Her-0.058
 Whatever-0.057
Token.
Token ID13
Feature activation+9.74e-03

Pos logit contributions
idd+0.089
 cho+0.087
 hyd+0.085
etchup+0.079
boats+0.079
Neg logit contributions
 herself-0.083
 solve-0.074
isy-0.073
 Her-0.073
 seem-0.073
Token He
Token ID679
Feature activation+5.72e-03

Pos logit contributions
idd+0.017
 cho+0.016
etchup+0.015
 hyd+0.015
 canoe+0.015
Neg logit contributions
isy-0.016
 herself-0.015
ust-0.014
 solve-0.014
 bracelet-0.014
Token even
Token ID772
Feature activation+4.36e-02

Pos logit contributions
idd+0.010
 cho+0.010
etchup+0.010
mer+0.009
 ge+0.009
Neg logit contributions
 herself-0.009
 Whatever-0.008
 Her-0.008
 seem-0.008
isy-0.008
Token:
Token ID25
Feature activation-4.67e-03

Pos logit contributions
idd+0.094
 cho+0.092
 hyd+0.091
etchup+0.090
mer+0.089
Neg logit contributions
 herself-0.046
 Her-0.043
 aloud-0.039
 about-0.039
isy-0.039
Token "
Token ID366
Feature activation+4.99e-02

Pos logit contributions
 herself+0.008
isy+0.008
ust+0.007
 solve+0.007
 seem+0.007
Neg logit contributions
 cho-0.008
idd-0.007
 hyd-0.007
etchup-0.007
 canoe-0.006
TokenLaw
Token ID16966
Feature activation+1.47e-02

Pos logit contributions
 cho+0.091
idd+0.090
 hyd+0.085
etchup+0.085
mer+0.081
Neg logit contributions
 herself-0.078
isy-0.071
 solve-0.070
 Her-0.070
 Whatever-0.070
Token is
Token ID318
Feature activation+3.02e-02

Pos logit contributions
idd+0.027
 cho+0.027
etchup+0.025
 hyd+0.025
 canoe+0.024
Neg logit contributions
 herself-0.023
 solve-0.021
isy-0.020
 seem-0.020
 Whatever-0.020
Token a
Token ID257
Feature activation+6.28e-04

Pos logit contributions
idd+0.071
 cho+0.070
 hyd+0.067
etchup+0.067
mer+0.066
Neg logit contributions
 herself-0.032
 solve-0.027
isy-0.027
 Her-0.027
 Whatever-0.026
Token rule
Token ID3896
Feature activation+9.52e-02

Pos logit contributions
idd+0.001
etchup+0.001
 cho+0.001
 hyd+0.001
mer+0.001
Neg logit contributions
 solve-0.001
 read-0.001
 math-0.001
 herself-0.001
 understand-0.001
Token that
Token ID326
Feature activation+5.74e-02

Pos logit contributions
idd+0.207
 cho+0.202
 hyd+0.198
etchup+0.193
 canoe+0.193
Neg logit contributions
 herself-0.120
 about-0.117
 how-0.104
 solve-0.104
 seem-0.101
Token people
Token ID661
Feature activation+2.64e-02

Pos logit contributions
idd+0.119
 cho+0.112
etchup+0.111
 hyd+0.109
mer+0.107
Neg logit contributions
 herself-0.079
 solve-0.074
 seem-0.073
 about-0.070
 read-0.069
Token make
Token ID787
Feature activation-8.38e-04

Pos logit contributions
 cho+0.057
idd+0.057
 hyd+0.054
etchup+0.053
mer+0.052
Neg logit contributions
 herself-0.032
isy-0.028
 solve-0.028
 Her-0.028
 Whatever-0.028
Token to
Token ID284
Feature activation-5.14e-03

Pos logit contributions
 herself+0.001
 solve+0.001
isy+0.001
 Her+0.001
 seem+0.001
Neg logit contributions
idd-0.002
 cho-0.002
etchup-0.001
 hyd-0.001
mer-0.001
Token keep
Token ID1394
Feature activation-1.11e-02

Pos logit contributions
 herself+0.006
isy+0.006
 Her+0.006
 Whatever+0.005
 aloud+0.005
Neg logit contributions
 cho-0.011
idd-0.011
 hyd-0.010
etchup-0.010
 canoe-0.010
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
idd+0.016
 cho+0.015
etchup+0.015
 canoe+0.014
 ge+0.014
Neg logit contributions
 herself-0.009
 about-0.008
 read-0.008
 Her-0.008
 how-0.008
Token
Token ID198
Feature activation-6.64e-03

Pos logit contributions
 herself+0.015
 Her+0.013
 Whatever+0.013
isy+0.012
 solve+0.012
Neg logit contributions
 cho-0.027
idd-0.026
 hyd-0.025
etchup-0.025
mer-0.024
Token
Token ID198
Feature activation-8.28e-03

Pos logit contributions
 herself+0.008
isy+0.007
 solve+0.007
 seem+0.007
 Whatever+0.007
Neg logit contributions
 cho-0.014
idd-0.014
etchup-0.013
 hyd-0.013
 canoe-0.013
Token"
Token ID1
Feature activation+3.70e-02

Pos logit contributions
 herself+0.011
isy+0.010
 Her+0.010
 solve+0.010
 Whatever+0.010
Neg logit contributions
 cho-0.017
idd-0.017
 hyd-0.016
etchup-0.016
mer-0.015
TokenOh
Token ID5812
Feature activation-6.86e-03

Pos logit contributions
 cho+0.062
idd+0.060
 hyd+0.057
etchup+0.056
 canoe+0.054
Neg logit contributions
 herself-0.065
isy-0.059
 Her-0.059
 Whatever-0.059
 read-0.058
Token no
Token ID645
Feature activation+9.43e-02

Pos logit contributions
 herself+0.010
 Her+0.009
 solve+0.009
 seem+0.009
 how+0.009
Neg logit contributions
idd-0.013
etchup-0.013
 hyd-0.013
mer-0.013
boats-0.013
Token,"
Token ID553
Feature activation-3.79e-02

Pos logit contributions
idd+0.168
 hyd+0.168
 cho+0.166
etchup+0.159
boats+0.159
Neg logit contributions
 herself-0.151
isy-0.142
 solve-0.141
 about-0.136
 aloud-0.135
Token Ben
Token ID3932
Feature activation-1.41e-02

Pos logit contributions
 herself+0.048
 Her+0.043
 read+0.038
 her+0.038
 aloud+0.037
Neg logit contributions
idd-0.088
 cho-0.085
 hyd-0.083
etchup-0.081
mer-0.080
Token said
Token ID531
Feature activation+3.61e-02

Pos logit contributions
 herself+0.019
 Her+0.018
 read+0.018
 sure+0.018
 seem+0.017
Neg logit contributions
idd-0.030
 cho-0.029
etchup-0.028
 hyd-0.028
boats-0.028
Token.
Token ID13
Feature activation+8.48e-03

Pos logit contributions
idd+0.080
 cho+0.079
etchup+0.077
 hyd+0.075
 canoe+0.073
Neg logit contributions
 herself-0.046
 Her-0.040
 aloud-0.040
 about-0.037
 solve-0.037
Token "
Token ID366
Feature activation+3.58e-02

Pos logit contributions
idd+0.021
 cho+0.020
etchup+0.020
 hyd+0.020
boats+0.019
Neg logit contributions
 herself-0.009
 Her-0.008
 aloud-0.007
 about-0.007
 Whatever-0.007
Token too
Token ID1165
Feature activation+2.72e-02

Pos logit contributions
 cho+0.060
idd+0.058
etchup+0.056
 hyd+0.056
 ge+0.053
Neg logit contributions
 herself-0.034
isy-0.031
 Whatever-0.031
 solve-0.031
 seem-0.030
Token much
Token ID881
Feature activation+1.84e-02

Pos logit contributions
 cho+0.057
idd+0.056
 hyd+0.053
etchup+0.053
mer+0.051
Neg logit contributions
 herself-0.036
isy-0.032
 Her-0.031
 solve-0.031
 Whatever-0.031
Token.
Token ID13
Feature activation+8.01e-03

Pos logit contributions
idd+0.039
 cho+0.037
 hyd+0.036
etchup+0.035
mer+0.035
Neg logit contributions
 herself-0.025
 solve-0.023
 about-0.023
 Her-0.022
 read-0.022
Token Mom
Token ID11254
Feature activation+4.72e-02

Pos logit contributions
idd+0.017
 cho+0.016
etchup+0.016
 hyd+0.015
 ge+0.015
Neg logit contributions
isy-0.010
 herself-0.010
 solve-0.009
 seem-0.009
 read-0.008
Token says
Token ID1139
Feature activation+3.99e-02

Pos logit contributions
 cho+0.080
idd+0.079
 hyd+0.076
etchup+0.073
mer+0.072
Neg logit contributions
 herself-0.080
 solve-0.072
 Her-0.072
isy-0.072
 Whatever-0.071
Token no
Token ID645
Feature activation+1.04e-01

Pos logit contributions
idd+0.079
 cho+0.075
 hyd+0.075
mer+0.074
etchup+0.072
Neg logit contributions
 herself-0.060
 Her-0.054
 seem-0.053
 solve-0.052
 aloud-0.052
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 cho+0.208
idd+0.207
 hyd+0.196
etchup+0.195
mer+0.188
Neg logit contributions
 herself-0.149
 Her-0.132
isy-0.131
 solve-0.131
 Whatever-0.130
Token Lily
Token ID20037
Feature activation-2.96e-02

Pos logit contributions
 cho+0.031
idd+0.031
 hyd+0.030
etchup+0.030
mer+0.029
Neg logit contributions
 herself-0.012
 Her-0.011
 Whatever-0.010
 solve-0.010
 seem-0.009
Token is
Token ID318
Feature activation+3.08e-02

Pos logit contributions
 herself+0.047
isy+0.043
 Her+0.042
 solve+0.042
 Whatever+0.042
Neg logit contributions
 cho-0.053
idd-0.053
 hyd-0.050
etchup-0.050
mer-0.047
Token sad
Token ID6507
Feature activation-9.45e-03

Pos logit contributions
 cho+0.054
idd+0.053
etchup+0.050
 hyd+0.050
 ge+0.047
Neg logit contributions
 herself-0.049
isy-0.047
 Whatever-0.046
 solve-0.045
 Her-0.045
Token.
Token ID13
Feature activation+8.86e-03

Pos logit contributions
 herself+0.014
isy+0.013
 Whatever+0.013
 solve+0.013
 Her+0.013
Neg logit contributions
 cho-0.018
idd-0.017
etchup-0.016
 hyd-0.016
mer-0.015
Token mom
Token ID1995
Feature activation+6.90e-02

Pos logit contributions
 herself+0.072
 seem+0.070
 Whatever+0.068
isy+0.066
ust+0.065
Neg logit contributions
 cho-0.063
idd-0.061
etchup-0.057
 hyd-0.056
 ge-0.054
Token smiled
Token ID13541
Feature activation+1.49e-02

Pos logit contributions
 cho+0.127
idd+0.126
 hyd+0.119
etchup+0.119
mer+0.114
Neg logit contributions
 herself-0.107
isy-0.096
 Her-0.095
 solve-0.095
 Whatever-0.095
Token and
Token ID290
Feature activation+1.31e-02

Pos logit contributions
 cho+0.026
idd+0.025
 hyd+0.024
etchup+0.024
mer+0.023
Neg logit contributions
 herself-0.024
isy-0.022
 solve-0.022
 Whatever-0.022
 seem-0.022
Token said
Token ID531
Feature activation+4.74e-02

Pos logit contributions
 cho+0.021
idd+0.020
 hyd+0.019
etchup+0.019
 ge+0.018
Neg logit contributions
 herself-0.024
 Whatever-0.022
 solve-0.022
 seem-0.022
isy-0.022
Token,
Token ID11
Feature activation+2.61e-02

Pos logit contributions
 cho+0.071
idd+0.067
 hyd+0.062
etchup+0.062
 ge+0.058
Neg logit contributions
 herself-0.090
isy-0.084
 Whatever-0.083
 solve-0.083
 Her-0.081
Token "
Token ID366
Feature activation+4.48e-02

Pos logit contributions
 cho+0.047
idd+0.043
 hyd+0.042
etchup+0.040
 ge+0.038
Neg logit contributions
 herself-0.042
 solve-0.039
isy-0.038
 Whatever-0.037
 seem-0.037
TokenYou
Token ID1639
Feature activation+9.93e-03

Pos logit contributions
 cho+0.076
idd+0.074
etchup+0.070
 hyd+0.069
mer+0.066
Neg logit contributions
 herself-0.075
 solve-0.071
isy-0.069
 aloud-0.069
 seem-0.069
Token are
Token ID389
Feature activation+2.23e-02

Pos logit contributions
 cho+0.018
idd+0.018
 hyd+0.017
etchup+0.016
 ge+0.015
Neg logit contributions
 herself-0.015
isy-0.014
 aloud-0.014
 Her-0.014
 Whatever-0.014
Token so
Token ID523
Feature activation-3.96e-02

Pos logit contributions
 cho+0.043
idd+0.042
 hyd+0.040
etchup+0.039
 ge+0.037
Neg logit contributions
 herself-0.030
isy-0.030
 aloud-0.029
 Whatever-0.029
 Her-0.028
Token creative
Token ID7325
Feature activation-3.62e-02

Pos logit contributions
 herself+0.050
 Whatever+0.045
 aloud+0.045
isy+0.043
 Her+0.043
Neg logit contributions
idd-0.087
 cho-0.086
 hyd-0.081
etchup-0.080
mer-0.076
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
 herself+0.053
 about+0.050
isy+0.045
 aloud+0.044
 sure+0.043
Neg logit contributions
idd-0.073
 canoe-0.072
 cho-0.071
 hyd-0.070
 ge-0.070
Token he
Token ID339
Feature activation+1.79e-02

Pos logit contributions
idd+0.054
 cho+0.053
 hyd+0.051
etchup+0.050
mer+0.049
Neg logit contributions
 herself-0.052
 Her-0.048
 solve-0.047
 Whatever-0.046
 seem-0.046
Token decided
Token ID3066
Feature activation-1.96e-02

Pos logit contributions
 cho+0.030
idd+0.030
 hyd+0.028
etchup+0.028
mer+0.027
Neg logit contributions
 herself-0.031
 solve-0.028
 Her-0.028
isy-0.028
 read-0.027
Token to
Token ID284
Feature activation-2.70e-02

Pos logit contributions
 herself+0.029
 solve+0.026
 Her+0.025
 seem+0.025
 aloud+0.025
Neg logit contributions
idd-0.040
 cho-0.038
etchup-0.037
 hyd-0.037
mer-0.036
Token climb
Token ID12080
Feature activation-3.26e-02

Pos logit contributions
 herself+0.031
isy+0.030
 Whatever+0.028
 aloud+0.027
 Her+0.027
Neg logit contributions
 cho-0.057
idd-0.055
 hyd-0.055
 canoe-0.054
 ge-0.054
Token up
Token ID510
Feature activation-4.85e-02

Pos logit contributions
 herself+0.036
isy+0.031
 Her+0.031
 solve+0.030
 Whatever+0.030
Neg logit contributions
 cho-0.075
idd-0.074
 hyd-0.071
etchup-0.071
mer-0.068
Token the
Token ID262
Feature activation+2.47e-02

Pos logit contributions
 herself+0.056
 solve+0.047
 Her+0.047
 seem+0.046
isy+0.046
Neg logit contributions
idd-0.110
 cho-0.110
 hyd-0.104
etchup-0.104
mer-0.100
Token tallest
Token ID38760
Feature activation-2.66e-02

Pos logit contributions
 cho+0.035
idd+0.034
 hyd+0.032
etchup+0.031
 canoe+0.030
Neg logit contributions
 herself-0.049
isy-0.045
 Her-0.045
 Whatever-0.045
 solve-0.045
Token tree
Token ID5509
Feature activation+1.57e-02

Pos logit contributions
 herself+0.044
 solve+0.041
isy+0.040
 read+0.039
 seem+0.039
Neg logit contributions
idd-0.047
 cho-0.046
 hyd-0.044
etchup-0.043
boats-0.040
Token in
Token ID287
Feature activation-4.04e-03

Pos logit contributions
idd+0.034
 cho+0.034
etchup+0.032
 hyd+0.032
 ge+0.031
Neg logit contributions
 herself-0.020
 solve-0.017
 about-0.017
 seem-0.017
 aloud-0.017
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
 herself+0.006
 solve+0.005
 seem+0.005
 Her+0.005
 Whatever+0.005
Neg logit contributions
 cho-0.008
idd-0.008
 hyd-0.008
etchup-0.008
mer-0.008
Token forest
Token ID8222
Feature activation-4.50e-02

Pos logit contributions
 cho+0.027
idd+0.025
 hyd+0.024
etchup+0.023
 canoe+0.022
Neg logit contributions
 herself-0.047
isy-0.045
 aloud-0.044
 Whatever-0.044
 Her-0.044
Token and
Token ID290
Feature activation-1.12e-04

Pos logit contributions
idd+0.048
etchup+0.047
 cho+0.044
Uh+0.044
 ge+0.043
Neg logit contributions
 about-0.022
 herself-0.021
 solve-0.020
 seem-0.019
 read-0.019
Token the
Token ID262
Feature activation+4.25e-02

Pos logit contributions
 herself+0.000
 read+0.000
 solve+0.000
 how+0.000
 about+0.000
Neg logit contributions
idd-0.000
etchup-0.000
 cho-0.000
iren-0.000
mer-0.000
Token truck
Token ID7779
Feature activation+7.24e-02

Pos logit contributions
idd+0.069
etchup+0.068
enn+0.067
 cho+0.063
rim+0.059
Neg logit contributions
 solve-0.070
isy-0.066
 herself-0.064
 read-0.064
 Whatever-0.063
Token was
Token ID373
Feature activation+3.29e-02

Pos logit contributions
idd+0.149
etchup+0.148
 cho+0.133
 hyd+0.132
rim+0.130
Neg logit contributions
 herself-0.095
 about-0.093
 solve-0.092
 read-0.091
 aloud-0.088
Token happy
Token ID3772
Feature activation-3.89e-02

Pos logit contributions
idd+0.077
etchup+0.075
mer+0.074
crow+0.072
 cho+0.071
Neg logit contributions
 herself-0.036
 about-0.034
 sure-0.033
 read-0.029
 how-0.028
Token to
Token ID284
Feature activation+3.94e-02

Pos logit contributions
 herself+0.051
 about+0.048
 how+0.043
 aloud+0.043
 Her+0.041
Neg logit contributions
idd-0.087
 cho-0.083
etchup-0.082
 hyd-0.079
 canoe-0.079
Token be
Token ID307
Feature activation+2.58e-02

Pos logit contributions
idd+0.095
 cho+0.094
etchup+0.092
 hyd+0.088
mer+0.086
Neg logit contributions
 herself-0.042
 solve-0.036
 seem-0.035
 Her-0.034
 read-0.034
Token safe
Token ID3338
Feature activation+2.32e-02

Pos logit contributions
etchup+0.058
Uh+0.057
idd+0.056
ho+0.055
 cho+0.055
Neg logit contributions
 herself-0.029
 sure-0.028
 about-0.026
 solve-0.025
 read-0.023
Token.
Token ID13
Feature activation-5.51e-03

Pos logit contributions
idd+0.065
Uh+0.064
etchup+0.064
 cho+0.062
iren+0.062
Neg logit contributions
 herself-0.020
 about-0.018
 how-0.013
 sure-0.012
 aloud-0.011
Token
Token ID198
Feature activation+3.32e-04

Pos logit contributions
 herself+0.006
 Her+0.005
 Whatever+0.005
isy+0.005
 solve+0.005
Neg logit contributions
 cho-0.013
idd-0.012
 hyd-0.012
etchup-0.012
mer-0.012
Token
Token ID198
Feature activation-1.46e-03

Pos logit contributions
 cho+0.001
idd+0.001
etchup+0.001
 canoe+0.001
 hyd+0.001
Neg logit contributions
 herself-0.000
 seem-0.000
 solve-0.000
 Whatever-0.000
 Her-0.000
Token Her
Token ID2332
Feature activation-4.83e-02

Pos logit contributions
 herself+0.014
isy+0.014
 solve+0.013
 Whatever+0.012
 seem+0.012
Neg logit contributions
 cho-0.024
idd-0.024
etchup-0.023
 hyd-0.023
 ge-0.022
Token mom
Token ID1995
Feature activation+6.29e-02

Pos logit contributions
 herself+0.079
 Whatever+0.073
 seem+0.073
isy+0.072
 solve+0.071
Neg logit contributions
 cho-0.084
idd-0.083
 hyd-0.079
etchup-0.078
 ge-0.074
Token put
Token ID1234
Feature activation-9.03e-03

Pos logit contributions
 cho+0.117
idd+0.116
 hyd+0.111
etchup+0.109
 canoe+0.108
Neg logit contributions
 herself-0.096
isy-0.089
 Her-0.088
 read-0.088
 solve-0.086
Token a
Token ID257
Feature activation-1.91e-02

Pos logit contributions
 herself+0.009
 Her+0.007
isy+0.006
 aloud+0.006
 read+0.006
Neg logit contributions
idd-0.023
 cho-0.023
 hyd-0.022
mer-0.021
 canoe-0.021
Token plate
Token ID7480
Feature activation+9.15e-03

Pos logit contributions
 solve+0.023
isy+0.023
 read+0.023
 write+0.022
 herself+0.021
Neg logit contributions
idd-0.040
 cho-0.038
mer-0.037
crow-0.036
boats-0.036
Token of
Token ID286
Feature activation+1.96e-02

Pos logit contributions
idd+0.019
 cho+0.019
etchup+0.018
 ge+0.018
mer+0.018
Neg logit contributions
 herself-0.013
 read-0.011
 about-0.011
isy-0.011
 aloud-0.011
Token hot
Token ID3024
Feature activation+5.82e-02

Pos logit contributions
idd+0.038
 cho+0.037
mer+0.034
 hyd+0.034
etchup+0.033
Neg logit contributions
 herself-0.030
isy-0.027
 Her-0.027
 solve-0.026
 read-0.026
Token food
Token ID2057
Feature activation+1.42e-02

Pos logit contributions
 cho+0.124
idd+0.124
boats+0.118
mer+0.118
 canoe+0.114
Neg logit contributions
 herself-0.075
 read-0.066
 about-0.063
 wear-0.063
 Her-0.062
Token on
Token ID319
Feature activation-9.86e-05

Pos logit contributions
idd+0.034
 cho+0.033
 ge+0.030
etchup+0.030
 canoe+0.030
Neg logit contributions
 herself-0.016
 about-0.015
 read-0.014
 wear-0.013
 solve-0.013
Token the
Token ID262
Feature activation-6.95e-03

Pos logit contributions
 herself+0.000
isy+0.000
 Whatever+0.000
 solve+0.000
 Her+0.000
Neg logit contributions
 cho-0.000
idd-0.000
 hyd-0.000
etchup-0.000
 ge-0.000
Token table
Token ID3084
Feature activation-2.58e-02

Pos logit contributions
 herself+0.011
isy+0.010
 solve+0.009
 Her+0.009
 read+0.009
Neg logit contributions
idd-0.014
 cho-0.013
etchup-0.013
 hyd-0.012
boats-0.012
Token felt
Token ID2936
Feature activation-3.86e-02

Pos logit contributions
idd+0.080
 cho+0.079
 hyd+0.074
etchup+0.074
 canoe+0.069
Neg logit contributions
 herself-0.076
 solve-0.069
 seem-0.069
 Whatever-0.068
 Her-0.068
Token a
Token ID257
Feature activation+3.14e-03

Pos logit contributions
 herself+0.053
 solve+0.047
 Her+0.046
isy+0.046
 Whatever+0.046
Neg logit contributions
idd-0.078
 cho-0.078
 hyd-0.073
etchup-0.073
mer-0.070
Token little
Token ID1310
Feature activation+3.54e-03

Pos logit contributions
idd+0.007
 cho+0.006
etchup+0.006
 hyd+0.006
boats+0.006
Neg logit contributions
 herself-0.004
 solve-0.004
isy-0.004
 seem-0.004
 Whatever-0.004
Token dizz
Token ID29527
Feature activation+6.43e-02

Pos logit contributions
idd+0.008
 cho+0.007
rim+0.007
etchup+0.007
enn+0.007
Neg logit contributions
 seem-0.005
 herself-0.004
 understand-0.004
 solve-0.004
 about-0.004
Tokeny
Token ID88
Feature activation-2.67e-04

Pos logit contributions
 cho+0.113
idd+0.110
etchup+0.106
 hyd+0.104
 canoe+0.104
Neg logit contributions
 herself-0.104
 seem-0.098
 about-0.095
 Her-0.095
isy-0.095
Token from
Token ID422
Feature activation+1.63e-02

Pos logit contributions
 herself+0.000
 solve+0.000
 seem+0.000
isy+0.000
 about+0.000
Neg logit contributions
idd-0.001
 cho-0.001
etchup-0.001
 hyd-0.001
 canoe-0.000
Token all
Token ID477
Feature activation+3.31e-02

Pos logit contributions
 cho+0.032
idd+0.032
 hyd+0.030
etchup+0.030
mer+0.029
Neg logit contributions
 herself-0.024
isy-0.021
 solve-0.021
 Her-0.021
 Whatever-0.021
Token the
Token ID262
Feature activation+6.62e-03

Pos logit contributions
idd+0.074
 cho+0.072
 hyd+0.069
etchup+0.069
 canoe+0.065
Neg logit contributions
 herself-0.044
 about-0.038
 solve-0.037
 seem-0.037
 Whatever-0.035
Token excitement
Token ID14067
Feature activation-9.63e-03

Pos logit contributions
idd+0.015
 cho+0.014
 hyd+0.014
etchup+0.014
mer+0.013
Neg logit contributions
 herself-0.008
 solve-0.007
isy-0.007
 Whatever-0.007
 seem-0.007
Token,
Token ID11
Feature activation+7.08e-03

Pos logit contributions
 herself+0.014
 seem+0.012
 about+0.012
 solve+0.012
 Her+0.012
Neg logit contributions
idd-0.020
 cho-0.019
etchup-0.019
 hyd-0.018
 canoe-0.018
Token he
Token ID339
Feature activation+4.95e-02

Pos logit contributions
idd+0.015
 cho+0.015
etchup+0.014
 hyd+0.014
mer+0.014
Neg logit contributions
 herself-0.010
 solve-0.008
 seem-0.008
 Whatever-0.008
 Her-0.008
Token named
Token ID3706
Feature activation+5.18e-02

Pos logit contributions
 cho+0.004
idd+0.004
boats+0.004
 hyd+0.004
 canoe+0.004
Neg logit contributions
 herself-0.004
 Her-0.004
 read-0.004
 solve-0.003
 about-0.003
Token Lily
Token ID20037
Feature activation-3.22e-02

Pos logit contributions
 cho+0.089
idd+0.085
etchup+0.080
 hyd+0.079
 ge+0.077
Neg logit contributions
isy-0.078
 closely-0.074
 Whatever-0.074
 aloud-0.073
 herself-0.073
Token in
Token ID287
Feature activation-9.62e-03

Pos logit contributions
 herself+0.050
isy+0.048
 Whatever+0.046
 solve+0.045
 seem+0.045
Neg logit contributions
 cho-0.058
idd-0.057
 hyd-0.054
etchup-0.054
mer-0.051
Token the
Token ID262
Feature activation+2.22e-02

Pos logit contributions
isy+0.015
ust+0.014
 herself+0.013
 aloud+0.013
aces+0.013
Neg logit contributions
 cho-0.017
etchup-0.017
idd-0.016
 ge-0.015
 hyd-0.015
Token park
Token ID3952
Feature activation-2.62e-02

Pos logit contributions
 cho+0.036
idd+0.036
 hyd+0.033
etchup+0.033
mer+0.032
Neg logit contributions
 herself-0.040
isy-0.036
 Her-0.036
 solve-0.036
 Whatever-0.036
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
 herself+0.030
 Her+0.028
 read+0.026
 about+0.025
 solve+0.025
Neg logit contributions
 cho-0.062
idd-0.060
etchup-0.058
 canoe-0.057
 hyd-0.057
Token
Token ID220
Feature activation-2.28e-02

Pos logit contributions
isy+0.019
 herself+0.019
 solve+0.017
 aloud+0.017
ust+0.017
Neg logit contributions
 cho-0.026
idd-0.025
etchup-0.024
 hyd-0.023
mer-0.022
Token
Token ID198
Feature activation-5.85e-03

Pos logit contributions
isy+0.038
 solve+0.036
 herself+0.036
 read+0.035
icity+0.034
Neg logit contributions
idd-0.037
 cho-0.036
etchup-0.034
enn-0.033
 hyd-0.031
Token
Token ID198
Feature activation-8.07e-03

Pos logit contributions
 herself+0.007
 Her+0.006
isy+0.006
 solve+0.006
 Whatever+0.006
Neg logit contributions
 cho-0.013
idd-0.013
 hyd-0.012
etchup-0.012
mer-0.012
TokenL
Token ID43
Feature activation-1.63e-02

Pos logit contributions
 herself+0.013
 Her+0.012
 solve+0.012
isy+0.012
 Whatever+0.012
Neg logit contributions
idd-0.014
 cho-0.014
 hyd-0.014
etchup-0.013
 canoe-0.013
Tokenily
Token ID813
Feature activation-3.99e-02

Pos logit contributions
 meanings+0.021
 read+0.021
nam+0.021
 Whatever+0.021
 solve+0.021
Neg logit contributions
idd-0.032
crow-0.027
auc-0.027
iren-0.026
rim-0.026
Token The
Token ID383
Feature activation+2.11e-02

Pos logit contributions
isy+0.017
ust+0.017
ieve+0.016
 sure+0.016
icity+0.016
Neg logit contributions
etchup-0.014
 cho-0.014
idd-0.013
 tub-0.012
 funnel-0.012
Token little
Token ID1310
Feature activation-1.29e-02

Pos logit contributions
 cho+0.028
idd+0.028
 hyd+0.026
etchup+0.025
 ge+0.024
Neg logit contributions
 herself-0.043
isy-0.040
 Whatever-0.040
 solve-0.040
 Her-0.040
Token girl
Token ID2576
Feature activation-1.16e-02

Pos logit contributions
 herself+0.021
 Her+0.019
 her+0.018
 read+0.018
isy+0.018
Neg logit contributions
idd-0.024
 cho-0.023
 hyd-0.023
etchup-0.022
 canoe-0.022
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
 herself+0.016
 Whatever+0.015
isy+0.015
 solve+0.014
 seem+0.014
Neg logit contributions
 cho-0.023
idd-0.023
 hyd-0.021
etchup-0.021
mer-0.021
Token named
Token ID3706
Feature activation+4.35e-02

Pos logit contributions
 cho+0.031
idd+0.030
 hyd+0.029
etchup+0.029
 ge+0.028
Neg logit contributions
 herself-0.017
isy-0.015
 Whatever-0.015
 solve-0.015
 Her-0.015
Token Emma
Token ID18966
Feature activation-5.29e-02

Pos logit contributions
 cho+0.067
 ge+0.056
crow+0.055
 hyd+0.054
idd+0.054
Neg logit contributions
isy-0.074
aces-0.072
ust-0.072
oll-0.071
 aloud-0.068
Token,
Token ID11
Feature activation+1.67e-02

Pos logit contributions
 herself+0.101
isy+0.093
 Whatever+0.092
 solve+0.092
 Her+0.092
Neg logit contributions
 cho-0.079
idd-0.077
 hyd-0.072
etchup-0.071
mer-0.068
Token loved
Token ID6151
Feature activation-6.45e-02

Pos logit contributions
idd+0.034
 cho+0.034
 hyd+0.032
etchup+0.032
mer+0.031
Neg logit contributions
 herself-0.023
 Her-0.020
isy-0.020
 solve-0.020
 Whatever-0.020
Token Lily
Token ID20037
Feature activation-7.79e-02

Pos logit contributions
 herself+0.106
isy+0.104
 Whatever+0.100
 seem+0.098
 solve+0.098
Neg logit contributions
 cho-0.112
idd-0.104
 hyd-0.100
etchup-0.100
 ge-0.098
Token very
Token ID845
Feature activation-3.68e-02

Pos logit contributions
 Whatever+0.120
 herself+0.120
isy+0.116
 solve+0.116
 seem+0.112
Neg logit contributions
 cho-0.131
 hyd-0.125
idd-0.123
etchup-0.119
mer-0.114
Token much
Token ID881
Feature activation-1.39e-02

Pos logit contributions
 Whatever+0.055
 herself+0.055
isy+0.051
 solve+0.050
 aloud+0.050
Neg logit contributions
 cho-0.065
idd-0.062
etchup-0.059
 hyd-0.059
mer-0.056
Token dad
Token ID9955
Feature activation+5.63e-02

Pos logit contributions
idd+0.039
 cho+0.038
etchup+0.037
 hyd+0.036
mer+0.035
Neg logit contributions
 herself-0.022
 solve-0.020
 read-0.019
 Her-0.019
isy-0.019
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
 cho+0.104
idd+0.101
 hyd+0.098
etchup+0.097
mer+0.093
Neg logit contributions
 herself-0.086
isy-0.078
 Her-0.077
 Whatever-0.076
 solve-0.076
Token They
Token ID1119
Feature activation+2.19e-02

Pos logit contributions
 cho+0.013
idd+0.012
etchup+0.012
 hyd+0.012
mer+0.011
Neg logit contributions
 herself-0.008
 Her-0.007
 Whatever-0.007
 solve-0.007
isy-0.007
Token wished
Token ID16555
Feature activation-1.62e-02

Pos logit contributions
 cho+0.043
idd+0.043
etchup+0.041
 hyd+0.040
mer+0.039
Neg logit contributions
 herself-0.031
 Her-0.028
isy-0.028
 solve-0.027
 Whatever-0.027
Token they
Token ID484
Feature activation+3.84e-02

Pos logit contributions
 herself+0.027
 Her+0.024
 about+0.024
 solve+0.023
 aloud+0.023
Neg logit contributions
idd-0.031
 cho-0.028
mer-0.028
etchup-0.028
 hyd-0.027
Token could
Token ID714
Feature activation-8.37e-03

Pos logit contributions
 cho+0.075
idd+0.074
 hyd+0.071
etchup+0.069
mer+0.067
Neg logit contributions
 herself-0.055
isy-0.050
 Her-0.049
 Whatever-0.049
 solve-0.049
Token play
Token ID711
Feature activation-7.42e-02

Pos logit contributions
 herself+0.009
isy+0.008
 Whatever+0.008
 Her+0.008
 solve+0.008
Neg logit contributions
 cho-0.019
idd-0.019
 hyd-0.018
etchup-0.018
 canoe-0.017
Token and
Token ID290
Feature activation-4.15e-02

Pos logit contributions
 herself+0.100
 solve+0.090
 read+0.086
 aloud+0.085
 about+0.084
Neg logit contributions
idd-0.155
 cho-0.154
etchup-0.148
 hyd-0.143
mer-0.140
Token have
Token ID423
Feature activation-2.00e-03

Pos logit contributions
 herself+0.044
isy+0.038
 Her+0.037
 Whatever+0.037
 solve+0.036
Neg logit contributions
 cho-0.096
idd-0.095
 hyd-0.092
etchup-0.091
 canoe-0.088
Token fun
Token ID1257
Feature activation-3.91e-02

Pos logit contributions
 herself+0.002
isy+0.002
 Whatever+0.002
 Her+0.002
 solve+0.002
Neg logit contributions
 cho-0.005
idd-0.004
 hyd-0.004
etchup-0.004
 canoe-0.004
Token.
Token ID13
Feature activation-6.51e-03

Pos logit contributions
 herself+0.046
 about+0.039
 solve+0.039
 read+0.038
 aloud+0.037
Neg logit contributions
 cho-0.089
idd-0.088
etchup-0.085
 hyd-0.083
 canoe-0.082
Token thrilled
Token ID20536
Feature activation-4.06e-02

Pos logit contributions
 herself+0.062
isy+0.059
 Whatever+0.058
 aloud+0.057
 solve+0.056
Neg logit contributions
idd-0.091
 cho-0.091
 hyd-0.086
etchup-0.085
mer-0.082
Token that
Token ID326
Feature activation+4.54e-02

Pos logit contributions
 herself+0.058
isy+0.050
 Her+0.050
 solve+0.049
 aloud+0.049
Neg logit contributions
idd-0.083
 cho-0.083
etchup-0.078
 hyd-0.078
 canoe-0.076
Token she
Token ID673
Feature activation+2.57e-02

Pos logit contributions
 cho+0.099
 hyd+0.093
idd+0.093
etchup+0.091
boats+0.088
Neg logit contributions
 herself-0.049
isy-0.046
 Whatever-0.045
 solve-0.044
 Her-0.043
Token could
Token ID714
Feature activation-2.42e-03

Pos logit contributions
 cho+0.053
idd+0.053
etchup+0.050
 hyd+0.050
 canoe+0.048
Neg logit contributions
 herself-0.034
isy-0.031
 solve-0.030
 Her-0.030
 read-0.029
Token make
Token ID787
Feature activation-1.86e-02

Pos logit contributions
 Whatever+0.003
isy+0.003
 herself+0.003
ust+0.003
 aloud+0.002
Neg logit contributions
 cho-0.005
 hyd-0.005
idd-0.005
 canoe-0.004
mer-0.004
Token up
Token ID510
Feature activation-6.71e-02

Pos logit contributions
 herself+0.021
isy+0.020
 Whatever+0.019
 Her+0.019
 solve+0.018
Neg logit contributions
 cho-0.041
idd-0.040
 hyd-0.038
etchup-0.038
 ge-0.036
Token this
Token ID428
Feature activation-1.86e-02

Pos logit contributions
 herself+0.073
 solve+0.060
 Her+0.059
isy+0.058
 Whatever+0.058
Neg logit contributions
idd-0.160
 cho-0.158
etchup-0.151
 hyd-0.151
mer-0.147
Token story
Token ID1621
Feature activation-7.14e-03

Pos logit contributions
 herself+0.023
isy+0.021
 Her+0.021
 Whatever+0.021
 seem+0.020
Neg logit contributions
 cho-0.038
idd-0.038
 hyd-0.036
etchup-0.035
mer-0.035
Token with
Token ID351
Feature activation-2.15e-03

Pos logit contributions
 herself+0.011
 Whatever+0.010
isy+0.010
 Her+0.010
 solve+0.010
Neg logit contributions
 cho-0.013
idd-0.012
 hyd-0.012
mer-0.011
etchup-0.011
Token only
Token ID691
Feature activation+6.01e-02

Pos logit contributions
 herself+0.002
isy+0.002
 Whatever+0.002
 solve+0.002
 Her+0.002
Neg logit contributions
 cho-0.005
idd-0.005
etchup-0.005
 hyd-0.005
 canoe-0.005
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
 cho+0.126
idd+0.126
etchup+0.119
 hyd+0.119
mer+0.115
Neg logit contributions
 herself-0.079
 solve-0.069
 Her-0.068
isy-0.068
 Whatever-0.068
Tokenak
Token ID461
Feature activation-3.10e-03

Pos logit contributions
 cho+0.042
 ge+0.041
 men+0.039
 hawk+0.038
 current+0.038
Neg logit contributions
ust-0.030
 herself-0.029
 solve-0.029
isy-0.029
aces-0.028
Token by
Token ID416
Feature activation+1.98e-02

Pos logit contributions
 herself+0.004
 about+0.004
 aloud+0.003
 solve+0.003
 how+0.003
Neg logit contributions
idd-0.007
 cho-0.007
etchup-0.007
 hyd-0.006
 ge-0.006
Token themselves
Token ID2405
Feature activation-2.17e-02

Pos logit contributions
idd+0.040
 cho+0.040
etchup+0.038
 hyd+0.036
mer+0.036
Neg logit contributions
 herself-0.030
 solve-0.025
 Her-0.024
isy-0.024
 Whatever-0.024
Token.
Token ID13
Feature activation+4.20e-03

Pos logit contributions
 herself+0.030
 solve+0.026
isy+0.026
 Her+0.026
 Whatever+0.025
Neg logit contributions
 cho-0.045
idd-0.044
etchup-0.042
 hyd-0.042
mer-0.040
Token They
Token ID1119
Feature activation+1.55e-02

Pos logit contributions
 cho+0.008
idd+0.008
 hyd+0.008
etchup+0.008
mer+0.008
Neg logit contributions
 herself-0.006
 Her-0.005
 Whatever-0.005
 solve-0.005
 seem-0.005
Token thought
Token ID1807
Feature activation-3.11e-02

Pos logit contributions
idd+0.033
 cho+0.033
etchup+0.032
 hyd+0.031
mer+0.030
Neg logit contributions
 read-0.019
 herself-0.018
 solve-0.017
 wear-0.017
 seem-0.016
Token mom
Token ID1995
Feature activation+5.58e-02

Pos logit contributions
 herself+0.045
isy+0.040
 solve+0.040
 Her+0.040
 Whatever+0.039
Neg logit contributions
idd-0.061
 cho-0.061
 hyd-0.058
etchup-0.057
mer-0.055
Token would
Token ID561
Feature activation+2.95e-02

Pos logit contributions
 cho+0.117
idd+0.116
 hyd+0.111
etchup+0.110
 canoe+0.106
Neg logit contributions
 herself-0.073
isy-0.064
 Her-0.064
 solve-0.064
 Whatever-0.063
Token not
Token ID407
Feature activation+3.34e-02

Pos logit contributions
idd+0.073
 cho+0.070
etchup+0.069
mer+0.066
 hyd+0.066
Neg logit contributions
 herself-0.032
 solve-0.027
 seem-0.027
 read-0.026
isy-0.025
Token notice
Token ID4003
Feature activation-9.55e-04

Pos logit contributions
idd+0.085
 cho+0.083
etchup+0.081
 hyd+0.078
mer+0.076
Neg logit contributions
 herself-0.032
 seem-0.030
 solve-0.029
 read-0.028
 wear-0.026
Token.
Token ID13
Feature activation+3.87e-03

Pos logit contributions
 herself+0.001
 about+0.001
 how+0.001
 Her+0.001
 closely+0.001
Neg logit contributions
idd-0.002
 cho-0.002
etchup-0.002
 hyd-0.002
 canoe-0.002
Token down
Token ID866
Feature activation-6.68e-03

Pos logit contributions
idd+0.018
etchup+0.017
 hyd+0.017
 cho+0.016
 canoe+0.016
Neg logit contributions
 herself-0.010
 closely-0.008
 about-0.008
 aloud-0.008
 her-0.008
Token the
Token ID262
Feature activation+2.16e-02

Pos logit contributions
 herself+0.007
 her+0.006
 about+0.005
 how+0.005
 aloud+0.004
Neg logit contributions
idd-0.018
etchup-0.017
 hyd-0.017
rim-0.017
 cho-0.017
Token street
Token ID4675
Feature activation+1.11e-02

Pos logit contributions
idd+0.036
 cho+0.035
etchup+0.034
 hyd+0.032
 ge+0.031
Neg logit contributions
 herself-0.037
 solve-0.035
isy-0.034
 read-0.034
 closely-0.034
Token and
Token ID290
Feature activation-2.67e-03

Pos logit contributions
idd+0.026
etchup+0.026
 cho+0.025
 hyd+0.024
rim+0.023
Neg logit contributions
 herself-0.013
 about-0.012
 closely-0.011
 read-0.011
 solve-0.011
Token people
Token ID661
Feature activation+2.81e-02

Pos logit contributions
 herself+0.004
 read+0.003
 Her+0.003
 seem+0.003
 about+0.003
Neg logit contributions
idd-0.006
 cho-0.005
etchup-0.005
 hyd-0.005
mer-0.005
Token wearing
Token ID5762
Feature activation-7.57e-02

Pos logit contributions
 cho+0.053
idd+0.053
etchup+0.051
 hyd+0.051
 canoe+0.048
Neg logit contributions
 herself-0.042
 seem-0.037
 about-0.037
isy-0.037
 aloud-0.037
Token bright
Token ID6016
Feature activation-1.57e-02

Pos logit contributions
 herself+0.089
 Her+0.073
 aloud+0.073
 solve+0.072
isy+0.072
Neg logit contributions
idd-0.173
 cho-0.170
etchup-0.164
 hyd-0.163
mer-0.158
Token costumes
Token ID24138
Feature activation-5.33e-02

Pos logit contributions
 herself+0.020
 Her+0.018
 about+0.018
 wear+0.017
 closely+0.017
Neg logit contributions
idd-0.033
 cho-0.033
 hyd-0.033
etchup-0.032
enn-0.031
Token.
Token ID13
Feature activation-9.96e-03

Pos logit contributions
 herself+0.068
 aloud+0.057
 about+0.057
isy+0.055
 Her+0.054
Neg logit contributions
idd-0.119
 cho-0.118
etchup-0.115
 hyd-0.111
 ge-0.109
Token It
Token ID632
Feature activation-2.35e-02

Pos logit contributions
 herself+0.011
isy+0.010
 solve+0.010
 Whatever+0.009
 seem+0.009
Neg logit contributions
 cho-0.023
idd-0.022
etchup-0.021
 hyd-0.021
mer-0.020
Token was
Token ID373
Feature activation+1.28e-02

Pos logit contributions
 herself+0.028
 seem+0.025
 read+0.024
 Her+0.024
isy+0.024
Neg logit contributions
idd-0.053
etchup-0.051
 cho-0.051
 hyd-0.049
 canoe-0.047
Token can
Token ID460
Feature activation-2.29e-03

Pos logit contributions
 cho+0.014
idd+0.014
 hyd+0.013
etchup+0.013
mer+0.012
Neg logit contributions
 herself-0.011
isy-0.010
 Her-0.010
 Whatever-0.010
 solve-0.010
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
 herself+0.003
isy+0.003
 Her+0.003
 solve+0.003
 Whatever+0.003
Neg logit contributions
 cho-0.005
idd-0.005
 hyd-0.004
etchup-0.004
mer-0.004
Token But
Token ID887
Feature activation+3.90e-03

Pos logit contributions
 herself+0.030
 Her+0.027
 Whatever+0.027
 solve+0.026
 seem+0.026
Neg logit contributions
 cho-0.041
idd-0.040
 hyd-0.039
etchup-0.038
mer-0.037
Token it
Token ID340
Feature activation-2.27e-02

Pos logit contributions
idd+0.008
 cho+0.008
etchup+0.008
 hyd+0.008
mer+0.007
Neg logit contributions
 herself-0.005
 solve-0.005
isy-0.004
 seem-0.004
 Whatever-0.004
Tokenâ
Token ID22940
Feature activation+6.52e-02

Pos logit contributions
 herself+0.027
 seem+0.025
 solve+0.024
isy+0.024
 Her+0.024
Neg logit contributions
idd-0.049
 cho-0.049
 hyd-0.046
etchup-0.046
 canoe-0.045
Token
Token ID26391
Feature activation-7.57e-02

Pos logit contributions
 cho+0.145
idd+0.144
etchup+0.136
 hyd+0.136
mer+0.131
Neg logit contributions
 herself-0.077
isy-0.067
 Her-0.066
 solve-0.066
 Whatever-0.066
Token
Token ID8151
Feature activation-1.87e-02

Pos logit contributions
 herself+0.098
 Whatever+0.094
 Her+0.093
 seem+0.093
 read+0.091
Neg logit contributions
 cho-0.148
idd-0.147
 hyd-0.146
mer-0.142
boats-0.141
Tokens
Token ID82
Feature activation+1.42e-02

Pos logit contributions
 herself+0.031
 Her+0.030
 Whatever+0.028
 seem+0.028
 about+0.028
Neg logit contributions
 hyd-0.032
 cho-0.032
idd-0.031
etchup-0.029
 canoe-0.029
Token getting
Token ID1972
Feature activation-2.78e-02

Pos logit contributions
idd+0.030
 cho+0.030
etchup+0.028
 hyd+0.028
mer+0.028
Neg logit contributions
 herself-0.018
 solve-0.016
isy-0.016
 seem-0.016
 Whatever-0.016
Token late
Token ID2739
Feature activation+4.47e-02

Pos logit contributions
 herself+0.038
 how+0.031
 her+0.030
 about+0.030
 sure+0.030
Neg logit contributions
idd-0.059
mer-0.058
 cho-0.055
 hyd-0.054
 ge-0.054
Token,
Token ID11
Feature activation+7.18e-03

Pos logit contributions
idd+0.102
 cho+0.097
etchup+0.097
 hyd+0.092
 canoe+0.090
Neg logit contributions
 herself-0.061
 about-0.052
 aloud-0.049
 closely-0.047
 solve-0.046
Token nervous
Token ID10927
Feature activation-1.00e-02

Pos logit contributions
 herself+0.064
 Her+0.056
 seem+0.056
 solve+0.056
isy+0.055
Neg logit contributions
 cho-0.079
idd-0.078
etchup-0.074
 hyd-0.074
mer-0.071
Token he
Token ID339
Feature activation+4.12e-02

Pos logit contributions
 herself+0.014
 about+0.012
 seem+0.012
 solve+0.011
isy+0.011
Neg logit contributions
idd-0.022
 cho-0.021
etchup-0.020
 hyd-0.020
 canoe-0.019
Token couldn
Token ID3521
Feature activation-1.66e-02

Pos logit contributions
idd+0.087
 cho+0.087
etchup+0.083
 hyd+0.082
 canoe+0.078
Neg logit contributions
 herself-0.052
isy-0.046
 solve-0.046
 Her-0.046
 seem-0.045
Token't
Token ID470
Feature activation-4.85e-03

Pos logit contributions
 herself+0.020
isy+0.018
 Whatever+0.017
 solve+0.017
 Her+0.017
Neg logit contributions
 cho-0.036
idd-0.036
 hyd-0.034
etchup-0.034
 canoe-0.033
Token bring
Token ID2222
Feature activation-2.44e-02

Pos logit contributions
 novels+0.008
 meanings+0.007
 math+0.007
 Whatever+0.007
aces+0.007
Neg logit contributions
 canoe-0.007
mer-0.007
 hyd-0.007
 cho-0.007
 ge-0.007
Token himself
Token ID2241
Feature activation-7.95e-02

Pos logit contributions
isy+0.033
 herself+0.033
 Her+0.032
 Whatever+0.031
 solve+0.031
Neg logit contributions
 cho-0.048
idd-0.044
 hyd-0.044
 canoe-0.043
etchup-0.043
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
 herself+0.127
isy+0.117
 Her+0.116
 Whatever+0.115
 solve+0.115
Neg logit contributions
 cho-0.140
idd-0.137
 hyd-0.131
etchup-0.129
 canoe-0.124
Token speak
Token ID2740
Feature activation-4.19e-02

Pos logit contributions
 herself+0.015
isy+0.014
 Her+0.014
 Whatever+0.014
 aloud+0.013
Neg logit contributions
 cho-0.022
idd-0.021
 hyd-0.020
etchup-0.020
 canoe-0.020
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 herself+0.060
isy+0.053
 Her+0.053
 solve+0.053
 Whatever+0.052
Neg logit contributions
 cho-0.082
idd-0.081
 hyd-0.077
etchup-0.077
mer-0.074
Token
Token ID220
Feature activation-1.92e-02

Pos logit contributions
isy+0.020
 herself+0.019
 solve+0.018
 seem+0.017
 read+0.017
Neg logit contributions
idd-0.023
 cho-0.023
etchup-0.022
 canoe-0.022
 hyd-0.022
Token
Token ID198
Feature activation-7.13e-04

Pos logit contributions
isy+0.030
 herself+0.029
 read+0.028
 solve+0.028
 blush+0.028
Neg logit contributions
idd-0.032
 cho-0.032
etchup-0.030
enn-0.029
 hyd-0.028
Token One
Token ID1881
Feature activation+7.61e-03

Pos logit contributions
 herself+0.012
isy+0.011
 solve+0.011
 seem+0.011
 aloud+0.010
Neg logit contributions
 cho-0.016
idd-0.016
etchup-0.015
 hyd-0.015
 canoe-0.014
Token day
Token ID1110
Feature activation+3.52e-02

Pos logit contributions
 cho+0.014
idd+0.013
 hyd+0.013
etchup+0.013
mer+0.012
Neg logit contributions
 herself-0.012
isy-0.011
 solve-0.011
 Whatever-0.011
 Her-0.011
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
 cho+0.059
idd+0.058
 hyd+0.055
etchup+0.055
mer+0.052
Neg logit contributions
 herself-0.061
 Her-0.055
 solve-0.055
isy-0.055
 Whatever-0.055
Token Lily
Token ID20037
Feature activation-4.04e-02

Pos logit contributions
 herself+0.019
isy+0.017
 Her+0.017
 Whatever+0.017
 seem+0.016
Neg logit contributions
 cho-0.023
idd-0.022
 hyd-0.021
etchup-0.021
mer-0.021
Token was
Token ID373
Feature activation+3.86e-03

Pos logit contributions
 herself+0.052
 Her+0.047
 read+0.046
 solve+0.045
isy+0.044
Neg logit contributions
 cho-0.086
idd-0.085
etchup-0.081
 hyd-0.080
boats-0.078
Token playing
Token ID2712
Feature activation-7.76e-02

Pos logit contributions
 cho+0.008
idd+0.008
 hyd+0.007
etchup+0.007
mer+0.007
Neg logit contributions
 herself-0.005
 Her-0.005
isy-0.005
 solve-0.005
 Whatever-0.005
Token with
Token ID351
Feature activation-2.32e-02

Pos logit contributions
 herself+0.112
 Whatever+0.107
isy+0.107
 Her+0.106
 seem+0.106
Neg logit contributions
 cho-0.145
idd-0.142
 hyd-0.135
etchup-0.132
mer-0.128
Token her
Token ID607
Feature activation-5.60e-02

Pos logit contributions
isy+0.021
ust+0.020
 Whatever+0.019
 seem+0.018
 solve+0.017
Neg logit contributions
 cho-0.056
idd-0.055
etchup-0.054
 ge-0.050
 canoe-0.050
Token toys
Token ID14958
Feature activation-5.27e-02

Pos logit contributions
 herself+0.082
 seem+0.075
 Whatever+0.075
 Her+0.074
isy+0.074
Neg logit contributions
 cho-0.107
idd-0.107
etchup-0.101
 hyd-0.100
mer-0.096
Token when
Token ID618
Feature activation-1.19e-02

Pos logit contributions
 herself+0.069
isy+0.062
 Her+0.061
 Whatever+0.061
 solve+0.060
Neg logit contributions
 cho-0.109
idd-0.108
 hyd-0.103
etchup-0.102
mer-0.099
Token she
Token ID673
Feature activation+2.43e-02

Pos logit contributions
 herself+0.012
isy+0.010
 Her+0.010
 solve+0.010
 Whatever+0.009
Neg logit contributions
 cho-0.029
idd-0.029
 hyd-0.028
etchup-0.027
mer-0.027
Token,
Token ID11
Feature activation-4.66e-03

Pos logit contributions
 hyd+0.071
idd+0.071
 cho+0.069
mer+0.067
etchup+0.066
Neg logit contributions
 herself-0.054
 Her-0.049
 about-0.049
 solve-0.049
 aloud-0.047
Token Sara
Token ID24799
Feature activation-2.10e-02

Pos logit contributions
 herself+0.007
isy+0.007
 Her+0.007
 solve+0.007
 Whatever+0.007
Neg logit contributions
 cho-0.008
idd-0.008
etchup-0.008
 hyd-0.008
mer-0.007
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
 herself+0.031
 read+0.029
 solve+0.028
 Her+0.028
 wear+0.026
Neg logit contributions
 cho-0.042
idd-0.040
boats-0.038
 hyd-0.038
etchup-0.038
Token Tom
Token ID4186
Feature activation+7.85e-03

Pos logit contributions
isy+0.034
 solve+0.031
 Whatever+0.031
 aloud+0.031
 closely+0.031
Neg logit contributions
 cho-0.022
 ge-0.022
etchup-0.019
 hyd-0.018
 men-0.018
Token are
Token ID389
Feature activation+2.42e-02

Pos logit contributions
 cho+0.017
idd+0.017
 hyd+0.017
etchup+0.016
mer+0.016
Neg logit contributions
 herself-0.009
isy-0.008
 Her-0.008
 Whatever-0.008
 solve-0.008
Token playing
Token ID2712
Feature activation-7.65e-02

Pos logit contributions
mer+0.056
 cho+0.055
 hyd+0.054
boats+0.054
idd+0.054
Neg logit contributions
 herself-0.028
 about-0.023
 sure-0.022
 Her-0.022
 solve-0.022
Token with
Token ID351
Feature activation-1.98e-02

Pos logit contributions
 herself+0.103
 solve+0.091
isy+0.090
 Her+0.090
 Whatever+0.089
Neg logit contributions
 cho-0.157
idd-0.156
 hyd-0.148
etchup-0.148
mer-0.142
Token yarn
Token ID21181
Feature activation-6.45e-02

Pos logit contributions
 herself+0.031
isy+0.029
 Her+0.028
 Whatever+0.028
 solve+0.028
Neg logit contributions
 cho-0.036
idd-0.036
etchup-0.034
 hyd-0.034
mer-0.032
Token in
Token ID287
Feature activation-1.37e-02

Pos logit contributions
 herself+0.082
 solve+0.070
isy+0.067
 read+0.065
 about+0.064
Neg logit contributions
idd-0.142
 cho-0.138
etchup-0.133
 hyd-0.133
 canoe-0.131
Token Sara
Token ID24799
Feature activation-2.97e-02

Pos logit contributions
 herself+0.020
 solve+0.017
 Her+0.017
 Whatever+0.017
isy+0.017
Neg logit contributions
 cho-0.027
idd-0.027
 hyd-0.025
etchup-0.025
mer-0.024
Token's
Token ID338
Feature activation+9.26e-03

Pos logit contributions
 herself+0.029
 Her+0.026
 solve+0.025
 read+0.024
isy+0.024
Neg logit contributions
 cho-0.072
idd-0.071
 hyd-0.068
etchup-0.067
 canoe-0.066