TOP ACTIVATIONS
MAX = 9.686

old man who was sun bat hing in the shed .
was a fairy or a gn ome ," Mia says .
. Lily had a magn ifying glass that she found in
you go , my d arl ings . Enjoy your lunch
the black mascara on her eyel ashes . She makes a
on dry and cozy p aj amas and sn ugg led
She had a big magn ifying glass that made things look
. They had a magn ifying glass and a small box
a flashlight and a magn ifying glass to examine their finds
a family of turtles sun bat hing in a nearby pond
and put on their p aj amas . They went to
examine it with the magn ifying glass ?" "
let out a big sn arl and the mouse knew he
and put on your p aj amas . Then we can
the necklace and the magn ifying glass and told him that
her own special slide - bat he . She
cried , dropping the magn ifying glass . "
they come out and sun bat he on the logs .
she forgot Teddy at the camps ite . She was very
in the river and sun bat he on the shore .

MIN ACTIVATIONS
MIN = -6.949

. " By e , dolphin !" Ben and
. " By e little crab !" he said
. " By e , bye , bye !"
. " By e Sally ," said Tim .
and said , " By e , sur fer ! Have
and said , " By e daddy , have a good
parents reminded him to " be have " and to play
. " By e Mom my !" she said
and said , " By e ! Stay safe and warm
â € � � By e bye , Tommy ! See
son asked , " Where are we going today ?"
" By e bye , small girl !"
and said , " By e then ! Have a nice
. " By e , Sam and Mia .
she replied . " Let 's do it again !"
and said , " By e , it was great to
. " By e - bye , monkey !"
. " By e , goat !" Tom and
and asked , " Where are we going ?" His mom
and said , " By e Polly ! I 'll see

INTERVAL 5.527 - 9.686
CONTAINS 0.017%

on the roof and sun bat he again . <|endoftext|> Once
she could have in the <|endoftext|> Once upon a time ,
They felt sick in their tum m ies . They dropped
their unique paint mix ! <|endoftext|> Once upon a time there
She puts on her p aj amas . She hugs her

INTERVAL 1.368 - 5.527
CONTAINS 15.572%

They make you sick and sore . You need to get
L ily and Max looked for their mom and
but their favorite was a big puzzle with many pieces .
but he needed to leave a little tip for the nice
, there was a clown named Bob o . He was

INTERVAL -2.790 - 1.368
CONTAINS 82.291%

man did not give up easily . He tried to find
was especially hot and the z ebra herd was growing very
w , ow , ow ! My knee is bleeding !"
The village people used Dave 's treasure to make
. They have fun . They are happy . <|endoftext|> Sam

INTERVAL -6.949 - -2.790
CONTAINS 2.120%

and Jen . You should share and be nice ."
The moral of the story is that love is an un
them and said , " Oh , no , you went
From that day forward , the polite raven was
far away . After a few rounds , it
Token old
Token ID1468
Feature activation+5.90e-02

Pos logit contributions
 Esk+0.023
ge+0.022
 camps+0.021
rou+0.021
 polar+0.021
Neg logit contributions
Her-0.024
 Her-0.018
gen-0.018
Then-0.018
asure-0.018
Token man
Token ID582
Feature activation+1.66e+00

Pos logit contributions
ge+0.001
 Esk+0.001
 camps+0.001
rou+0.001
yt+0.001
Neg logit contributions
Her-0.001
 Her-0.001
gen-0.001
Then-0.001
 herself-0.001
Token who
Token ID508
Feature activation+2.74e+00

Pos logit contributions
ge+0.043
 Esk+0.040
rou+0.039
 camps+0.037
 upon+0.037
Neg logit contributions
Her-0.039
 Her-0.030
 herself-0.029
gen-0.029
Then-0.028
Token was
Token ID373
Feature activation-6.55e-01

Pos logit contributions
ge+0.070
 Esk+0.068
rou+0.066
 upon+0.063
grown+0.062
Neg logit contributions
Her-0.064
 Her-0.050
 herself-0.047
gen-0.046
Then-0.046
Token sun
Token ID4252
Feature activation+9.03e-01

Pos logit contributions
Her+0.016
 Her+0.013
gen+0.013
Then+0.012
asure+0.012
Neg logit contributions
 Esk-0.016
ge-0.016
rou-0.015
 camps-0.014
 upon-0.014
Tokenbat
Token ID8664
Feature activation+8.63e+00

Pos logit contributions
ge+0.022
 Esk+0.022
 camps+0.021
rou+0.020
icer+0.020
Neg logit contributions
Her-0.021
 Her-0.017
gen-0.017
 herself-0.016
Then-0.016
Tokenhing
Token ID722
Feature activation-1.84e+00

Pos logit contributions
ge+0.150
 Esk+0.142
 camps+0.127
rou+0.126
icer+0.125
Neg logit contributions
Her-0.267
 Her-0.231
gen-0.225
 herself-0.218
 counted-0.214
Token in
Token ID287
Feature activation+1.37e+00

Pos logit contributions
Her+0.045
 Her+0.038
asure+0.035
gen+0.035
Then+0.034
Neg logit contributions
ge-0.044
 Esk-0.043
 camps-0.041
rou-0.039
 polar-0.038
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
 Esk+0.043
ge+0.043
 camps+0.042
rou+0.040
icer+0.039
Neg logit contributions
Her-0.023
 Her-0.017
gen-0.016
 herself-0.016
asure-0.015
Token shed
Token ID14999
Feature activation+2.75e+00

Pos logit contributions
 Esk+0.031
 camps+0.030
 polar+0.028
icer+0.028
ge+0.028
Neg logit contributions
Her-0.035
 Her-0.028
gen-0.027
 herself-0.027
asure-0.027
Token.
Token ID13
Feature activation+7.94e-02

Pos logit contributions
ge+0.069
 Esk+0.068
 camps+0.064
rou+0.064
 upon+0.060
Neg logit contributions
Her-0.064
 Her-0.052
gen-0.050
 herself-0.047
asure-0.047
Token was
Token ID373
Feature activation-8.68e-01

Pos logit contributions
Her+0.045
 herself+0.035
 Her+0.034
ough+0.034
Then+0.033
Neg logit contributions
 Esk-0.038
ge-0.036
 camps-0.035
 upon-0.034
rou-0.033
Token a
Token ID257
Feature activation+3.15e+00

Pos logit contributions
Her+0.020
 Her+0.015
Then+0.015
amed+0.014
gen+0.013
Neg logit contributions
 Esk-0.024
 camps-0.022
ge-0.022
 polar-0.022
rou-0.021
Token fairy
Token ID25607
Feature activation+1.01e+00

Pos logit contributions
 Esk+0.076
 camps+0.074
ge+0.074
rou+0.070
 polar+0.069
Neg logit contributions
Her-0.076
 Her-0.063
 herself-0.061
gen-0.061
asure-0.059
Token or
Token ID393
Feature activation+5.25e-01

Pos logit contributions
 Esk+0.024
ge+0.024
 camps+0.023
rou+0.022
yt+0.021
Neg logit contributions
Her-0.024
 Her-0.020
gen-0.019
 herself-0.019
Then-0.019
Token a
Token ID257
Feature activation+3.32e+00

Pos logit contributions
 Esk+0.014
 camps+0.013
 polar+0.013
 upon+0.013
 asleep+0.012
Neg logit contributions
Her-0.011
Then-0.008
ough-0.008
amed-0.008
gen-0.008
Token gn
Token ID19967
Feature activation+8.44e+00

Pos logit contributions
 Esk+0.074
 camps+0.074
ge+0.072
 polar+0.070
rou+0.070
Neg logit contributions
Her-0.083
 Her-0.069
gen-0.068
asure-0.068
 herself-0.067
Tokenome
Token ID462
Feature activation-3.52e-02

Pos logit contributions
 Esk+0.260
ge+0.259
 camps+0.252
 upon+0.241
 polar+0.233
Neg logit contributions
Her-0.137
gen-0.104
 Her-0.102
asure-0.095
 listened-0.093
Token,"
Token ID553
Feature activation-5.09e-01

Pos logit contributions
Her+0.001
 Her+0.001
gen+0.001
 herself+0.001
asure+0.001
Neg logit contributions
 Esk-0.001
ge-0.001
 camps-0.001
rou-0.001
yt-0.001
Token Mia
Token ID32189
Feature activation-3.92e-01

Pos logit contributions
Her+0.009
gen+0.007
 Her+0.007
asure+0.007
 herself+0.006
Neg logit contributions
 Esk-0.015
ge-0.015
 camps-0.015
 upon-0.014
rou-0.014
Token says
Token ID1139
Feature activation+1.49e+00

Pos logit contributions
Her+0.010
gen+0.009
amed+0.008
ages+0.008
ough+0.008
Neg logit contributions
 Esk-0.008
 camps-0.008
ge-0.008
 upon-0.008
rou-0.007
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
ge+0.036
 Esk+0.035
rou+0.032
 camps+0.032
 upon+0.031
Neg logit contributions
Her-0.035
gen-0.029
 Her-0.029
asure-0.028
Then-0.027
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
ge+0.021
 camps+0.019
 upon+0.019
 Esk+0.018
yt+0.018
Neg logit contributions
Her-0.025
 Her-0.020
gen-0.019
Then-0.019
amed-0.018
Token Lily
Token ID20037
Feature activation+2.82e-01

Pos logit contributions
ge+0.021
rou+0.020
 Esk+0.019
yt+0.019
 camps+0.018
Neg logit contributions
Her-0.007
 Then-0.006
 Her-0.006
 herself-0.005
 His-0.005
Token had
Token ID550
Feature activation+1.43e-01

Pos logit contributions
 camps+0.007
ge+0.007
 Esk+0.006
 upon+0.006
yt+0.006
Neg logit contributions
Her-0.007
 Her-0.005
 herself-0.005
gen-0.005
asure-0.005
Token a
Token ID257
Feature activation+2.54e+00

Pos logit contributions
 Esk+0.004
 camps+0.004
ge+0.004
grown+0.004
 upon+0.004
Neg logit contributions
Her-0.002
 Her-0.002
gen-0.002
asure-0.002
ough-0.002
Token magn
Token ID7842
Feature activation+5.26e+00

Pos logit contributions
 Esk+0.055
ge+0.055
 camps+0.055
yt+0.051
rou+0.050
Neg logit contributions
Her-0.065
 Her-0.054
 herself-0.053
gen-0.053
asure-0.051
Tokenifying
Token ID4035
Feature activation+7.99e+00

Pos logit contributions
 Esk+0.100
ge+0.084
 camps+0.078
rou+0.076
 polar+0.076
Neg logit contributions
Her-0.155
asure-0.139
:-0.139
 Her-0.139
gen-0.135
Token glass
Token ID5405
Feature activation+1.23e+00

Pos logit contributions
 Esk+0.148
 camps+0.144
ge+0.136
icer+0.133
rou+0.128
Neg logit contributions
Her-0.235
gen-0.197
 Her-0.195
 herself-0.194
Then-0.190
Token that
Token ID326
Feature activation+5.24e+00

Pos logit contributions
ge+0.029
 Esk+0.029
 camps+0.029
rou+0.027
yt+0.027
Neg logit contributions
Her-0.030
 Her-0.024
gen-0.023
 herself-0.023
Then-0.022
Token she
Token ID673
Feature activation+5.03e+00

Pos logit contributions
 upon+0.097
cast+0.090
 Esk+0.089
 camps+0.080
yt+0.079
Neg logit contributions
Her-0.144
Then-0.120
gen-0.116
 eighteen-0.115
amed-0.114
Token found
Token ID1043
Feature activation+1.06e+00

Pos logit contributions
 Esk+0.122
ge+0.120
 camps+0.114
rou+0.112
 upon+0.109
Neg logit contributions
Her-0.120
 Her-0.095
 herself-0.088
asure-0.085
gen-0.084
Token in
Token ID287
Feature activation+8.94e-01

Pos logit contributions
 Esk+0.026
ge+0.025
 camps+0.025
rou+0.024
 upon+0.023
Neg logit contributions
Her-0.025
 Her-0.021
gen-0.019
Then-0.019
asure-0.018
Token you
Token ID345
Feature activation+1.15e+00

Pos logit contributions
 Esk+0.025
 upon+0.023
ge+0.022
 breath+0.022
 camps+0.022
Neg logit contributions
Her-0.030
gen-0.027
asure-0.027
ough-0.026
 Her-0.026
Token go
Token ID467
Feature activation+1.68e-01

Pos logit contributions
ge+0.024
 Esk+0.024
 upon+0.023
 camps+0.023
rou+0.022
Neg logit contributions
Her-0.031
 Her-0.025
gen-0.024
Then-0.024
 herself-0.023
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
 Esk+0.005
ge+0.005
icer+0.005
rou+0.005
yt+0.005
Neg logit contributions
Her-0.003
:-0.002
 herself-0.002
 then-0.002
 Then-0.002
Token my
Token ID616
Feature activation-6.06e-01

Pos logit contributions
Her+0.004
 Her+0.003
gen+0.003
amed+0.003
asure+0.003
Neg logit contributions
 Esk-0.005
ge-0.004
 camps-0.004
 upon-0.004
rou-0.004
Token d
Token ID288
Feature activation+1.48e+00

Pos logit contributions
Her+0.015
 Her+0.012
gen+0.012
 herself+0.011
asure+0.011
Neg logit contributions
 Esk-0.015
ge-0.014
 camps-0.014
rou-0.014
icer-0.013
Tokenarl
Token ID7063
Feature activation+7.96e+00

Pos logit contributions
ge+0.021
 Esk+0.020
rou+0.019
 camps+0.019
icer+0.017
Neg logit contributions
Her-0.051
 Her-0.044
gen-0.044
 herself-0.043
 listened-0.042
Tokenings
Token ID654
Feature activation+2.85e-01

Pos logit contributions
ge+0.148
 Esk+0.141
 camps+0.137
rou+0.130
yt+0.127
Neg logit contributions
Her-0.236
 Her-0.203
gen-0.196
 herself-0.192
 counted-0.187
Token.
Token ID13
Feature activation-7.29e-01

Pos logit contributions
 upon+0.005
ge+0.005
 camps+0.005
rou+0.005
 Esk+0.005
Neg logit contributions
Her-0.008
 Her-0.007
gen-0.006
Then-0.006
 counted-0.006
Token Enjoy
Token ID18179
Feature activation-9.27e-01

Pos logit contributions
Her+0.017
amed+0.014
 listened+0.014
 herself+0.013
gen+0.013
Neg logit contributions
 Esk-0.017
 camps-0.016
 polar-0.015
icer-0.015
rou-0.015
Token your
Token ID534
Feature activation-1.85e+00

Pos logit contributions
Her+0.019
 Her+0.016
gen+0.015
asure+0.014
 counted+0.014
Neg logit contributions
ge-0.025
 Esk-0.024
 camps-0.024
rou-0.023
icer-0.023
Token lunch
Token ID9965
Feature activation+4.88e-01

Pos logit contributions
Her+0.048
 Her+0.039
gen+0.038
 herself+0.037
Then+0.036
Neg logit contributions
 Esk-0.043
 camps-0.041
ge-0.041
rou-0.038
 polar-0.037
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 camps+0.011
 breath+0.011
 asleep+0.011
 Esk+0.010
yt+0.010
Neg logit contributions
Her-0.019
ough-0.016
asure-0.016
gen-0.016
Then-0.016
Token black
Token ID2042
Feature activation+1.58e+00

Pos logit contributions
 Esk+0.024
ge+0.023
 camps+0.022
rou+0.022
 upon+0.022
Neg logit contributions
Her-0.026
gen-0.022
 Her-0.022
asure-0.021
Then-0.020
Token mascara
Token ID48430
Feature activation+2.92e-01

Pos logit contributions
 upon+0.031
ge+0.031
 Esk+0.031
yt+0.030
icer+0.030
Neg logit contributions
Her-0.040
gen-0.036
Then-0.034
asure-0.033
 Her-0.033
Token on
Token ID319
Feature activation+3.06e+00

Pos logit contributions
ge+0.007
 Esk+0.006
 camps+0.006
rou+0.006
yt+0.006
Neg logit contributions
Her-0.007
gen-0.006
 Her-0.006
asure-0.006
Then-0.006
Token her
Token ID607
Feature activation+3.32e+00

Pos logit contributions
 Esk+0.078
ge+0.076
yt+0.076
 breath+0.076
icer+0.075
Neg logit contributions
asure-0.049
amed-0.047
gen-0.045
Her-0.044
 counted-0.044
Token eyel
Token ID29708
Feature activation+7.90e+00

Pos logit contributions
 breath+0.067
 Esk+0.062
 upon+0.060
yt+0.058
ge+0.058
Neg logit contributions
Her-0.079
gen-0.073
asure-0.072
Then-0.071
 listened-0.069
Tokenashes
Token ID7465
Feature activation-7.00e-01

Pos logit contributions
ge+0.158
 Esk+0.157
rou+0.151
 camps+0.151
icer+0.142
Neg logit contributions
Her-0.216
gen-0.177
 Her-0.177
 herself-0.175
 listened-0.175
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
Her+0.020
gen+0.017
Then+0.016
asure+0.016
 Her+0.016
Neg logit contributions
ge-0.013
 Esk-0.012
 upon-0.012
rou-0.011
 camps-0.011
Token She
Token ID1375
Feature activation+1.54e+00

Pos logit contributions
 upon+0.006
 Esk+0.006
 camps+0.006
joy+0.005
rou+0.005
Neg logit contributions
gen-0.007
amed-0.007
asure-0.007
 listened-0.007
ough-0.007
Token makes
Token ID1838
Feature activation-4.96e-01

Pos logit contributions
 Esk+0.033
 camps+0.032
ge+0.032
rou+0.032
icer+0.029
Neg logit contributions
Her-0.040
gen-0.033
 Her-0.032
asure-0.031
ough-0.031
Token a
Token ID257
Feature activation+5.33e-01

Pos logit contributions
Her+0.008
gen+0.006
 Her+0.006
asure+0.006
Then+0.005
Neg logit contributions
 Esk-0.015
ge-0.015
rou-0.015
 camps-0.015
 upon-0.014
Token on
Token ID319
Feature activation-1.98e-01

Pos logit contributions
 Esk+0.029
ge+0.029
 camps+0.029
icer+0.028
rou+0.027
Neg logit contributions
Her-0.028
 Her-0.023
gen-0.023
asure-0.022
Then-0.021
Token dry
Token ID5894
Feature activation+7.79e-01

Pos logit contributions
Her+0.004
gen+0.004
asure+0.003
amed+0.003
 Her+0.003
Neg logit contributions
ge-0.005
 camps-0.005
 Esk-0.005
 upon-0.005
rou-0.005
Token and
Token ID290
Feature activation-1.52e+00

Pos logit contributions
 Esk+0.020
ge+0.020
 camps+0.019
rou+0.018
icer+0.017
Neg logit contributions
Her-0.018
 Her-0.014
gen-0.014
asure-0.013
 herself-0.013
Token cozy
Token ID37438
Feature activation+2.37e+00

Pos logit contributions
Her+0.040
 Her+0.034
gen+0.033
asure+0.032
:+0.030
Neg logit contributions
 Esk-0.034
ge-0.032
 camps-0.031
rou-0.029
 upon-0.029
Token p
Token ID279
Feature activation+1.56e+00

Pos logit contributions
ge+0.060
 Esk+0.058
rou+0.056
icer+0.054
 camps+0.053
Neg logit contributions
Her-0.054
 herself-0.046
 Her-0.045
gen-0.042
 Then-0.042
Tokenaj
Token ID1228
Feature activation+7.88e+00

Pos logit contributions
 Esk+0.024
ge+0.022
 camps+0.021
icer+0.019
 polar+0.019
Neg logit contributions
Her-0.050
 Her-0.044
 herself-0.044
ages-0.042
ough-0.041
Tokenamas
Token ID17485
Feature activation+3.67e+00

Pos logit contributions
 Esk+0.128
ge+0.122
 camps+0.116
yt+0.114
rou+0.106
Neg logit contributions
Her-0.243
gen-0.219
 Her-0.202
 Queen-0.201
 listened-0.201
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
ge+0.093
 Esk+0.092
 camps+0.087
rou+0.084
yt+0.081
Neg logit contributions
Her-0.083
 Her-0.067
gen-0.067
asure-0.063
 herself-0.062
Token sn
Token ID3013
Feature activation+2.78e+00

Pos logit contributions
Her+0.028
gen+0.023
 Her+0.022
asure+0.022
Then+0.021
Neg logit contributions
 Esk-0.028
 camps-0.027
ge-0.027
 polar-0.025
 upon-0.025
Tokenugg
Token ID6837
Feature activation+4.74e+00

Pos logit contributions
 Esk+0.063
ge+0.063
 camps+0.059
rou+0.057
icer+0.055
Neg logit contributions
Her-0.071
 Her-0.059
gen-0.058
 herself-0.057
asure-0.055
Tokenled
Token ID992
Feature activation-1.18e+00

Pos logit contributions
 Esk+0.130
ge+0.130
 camps+0.124
rou+0.120
icer+0.116
Neg logit contributions
Her-0.098
 Her-0.078
 herself-0.075
gen-0.075
 Then-0.069
Token She
Token ID1375
Feature activation+4.19e-01

Pos logit contributions
 Esk+0.005
ge+0.005
 camps+0.005
rou+0.005
icer+0.005
Neg logit contributions
Her-0.003
gen-0.003
 Her-0.003
asure-0.003
 herself-0.003
Token had
Token ID550
Feature activation-8.30e-01

Pos logit contributions
 camps+0.009
 Esk+0.009
 upon+0.008
ge+0.008
rou+0.008
Neg logit contributions
Her-0.011
asure-0.009
ough-0.009
 Her-0.009
 herself-0.009
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
Her+0.014
gen+0.010
 Her+0.010
asure+0.010
ough+0.008
Neg logit contributions
 Esk-0.026
 camps-0.025
ge-0.024
rou-0.024
 upon-0.023
Token big
Token ID1263
Feature activation+1.89e+00

Pos logit contributions
 Esk+0.040
ge+0.040
 camps+0.038
rou+0.036
icer+0.035
Neg logit contributions
Her-0.044
 Her-0.037
gen-0.036
 herself-0.035
asure-0.034
Token magn
Token ID7842
Feature activation+5.02e+00

Pos logit contributions
 Esk+0.042
ge+0.042
 camps+0.041
rou+0.039
icer+0.038
Neg logit contributions
Her-0.048
 Her-0.040
gen-0.039
 herself-0.038
asure-0.037
Tokenifying
Token ID4035
Feature activation+7.85e+00

Pos logit contributions
 Esk+0.095
ge+0.084
 camps+0.077
 polar+0.075
rou+0.069
Neg logit contributions
Her-0.150
 Her-0.133
asure-0.131
:-0.130
 herself-0.126
Token glass
Token ID5405
Feature activation+7.73e-01

Pos logit contributions
 Esk+0.153
 camps+0.147
ge+0.141
icer+0.136
rou+0.135
Neg logit contributions
Her-0.225
gen-0.188
 Her-0.187
 herself-0.185
Then-0.179
Token that
Token ID326
Feature activation+4.96e+00

Pos logit contributions
ge+0.019
 Esk+0.018
 camps+0.018
rou+0.017
yt+0.017
Neg logit contributions
Her-0.018
 Her-0.015
gen-0.015
 herself-0.014
Then-0.014
Token made
Token ID925
Feature activation+1.42e+00

Pos logit contributions
 upon+0.089
 Esk+0.083
cast+0.080
yt+0.073
 scaven+0.069
Neg logit contributions
Her-0.135
Then-0.116
 eighteen-0.112
His-0.111
gen-0.111
Token things
Token ID1243
Feature activation-1.53e-01

Pos logit contributions
 Esk+0.039
 upon+0.038
ge+0.038
 camps+0.036
rou+0.036
Neg logit contributions
Her-0.025
gen-0.022
Then-0.021
asure-0.020
His-0.019
Token look
Token ID804
Feature activation+1.51e+00

Pos logit contributions
Her+0.004
 Her+0.003
gen+0.003
Then+0.003
asure+0.003
Neg logit contributions
ge-0.004
 Esk-0.003
 camps-0.003
rou-0.003
yt-0.003
Token.
Token ID13
Feature activation+3.52e-01

Pos logit contributions
Her+0.003
 Her+0.002
gen+0.002
asure+0.002
 herself+0.002
Neg logit contributions
ge-0.003
 Esk-0.003
 camps-0.003
rou-0.003
yt-0.003
Token They
Token ID1119
Feature activation-3.24e-01

Pos logit contributions
ge+0.011
 Esk+0.011
rou+0.011
 camps+0.011
yt+0.010
Neg logit contributions
Her-0.006
 Her-0.005
 Then-0.004
 herself-0.004
gen-0.004
Token had
Token ID550
Feature activation-7.37e-01

Pos logit contributions
Her+0.008
gen+0.007
 Her+0.006
asure+0.006
 herself+0.006
Neg logit contributions
ge-0.007
 Esk-0.007
 camps-0.007
 upon-0.007
rou-0.007
Token a
Token ID257
Feature activation+1.72e+00

Pos logit contributions
Her+0.010
 Her+0.007
gen+0.007
asure+0.006
 herself+0.006
Neg logit contributions
 Esk-0.025
ge-0.025
 camps-0.024
rou-0.023
 upon-0.023
Token magn
Token ID7842
Feature activation+4.80e+00

Pos logit contributions
ge+0.038
 Esk+0.038
 camps+0.036
rou+0.035
icer+0.034
Neg logit contributions
Her-0.044
 Her-0.037
gen-0.036
 herself-0.035
asure-0.034
Tokenifying
Token ID4035
Feature activation+7.85e+00

Pos logit contributions
 Esk+0.101
ge+0.087
 fuss+0.081
 polar+0.079
 choice+0.078
Neg logit contributions
Her-0.130
asure-0.116
:-0.115
 Her-0.113
amed-0.111
Token glass
Token ID5405
Feature activation+9.57e-01

Pos logit contributions
 Esk+0.155
 camps+0.148
ge+0.147
rou+0.137
icer+0.136
Neg logit contributions
Her-0.223
 Her-0.187
gen-0.187
 herself-0.182
asure-0.176
Token and
Token ID290
Feature activation+1.74e-01

Pos logit contributions
ge+0.022
 Esk+0.022
 camps+0.021
rou+0.021
icer+0.020
Neg logit contributions
Her-0.023
 Her-0.019
gen-0.019
 herself-0.018
asure-0.018
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
 Esk+0.005
 camps+0.005
ge+0.005
grown+0.005
 upon+0.005
Neg logit contributions
Her-0.003
gen-0.002
 Her-0.002
asure-0.002
Then-0.002
Token small
Token ID1402
Feature activation+1.96e+00

Pos logit contributions
 camps+0.040
 Esk+0.040
ge+0.039
rou+0.037
yt+0.037
Neg logit contributions
Her-0.052
 Her-0.044
gen-0.043
 herself-0.043
asure-0.041
Token box
Token ID3091
Feature activation+1.35e+00

Pos logit contributions
ge+0.043
 Esk+0.043
 camps+0.042
rou+0.039
yt+0.038
Neg logit contributions
Her-0.051
 Her-0.042
gen-0.041
 herself-0.041
asure-0.039
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
Her+0.012
 Her+0.008
gen+0.007
 herself+0.007
 counted+0.006
Neg logit contributions
ge-0.038
 Esk-0.037
 camps-0.036
rou-0.036
icer-0.035
Token flashlight
Token ID38371
Feature activation+1.36e+00

Pos logit contributions
ge+0.030
 Esk+0.029
 camps+0.027
rou+0.027
icer+0.026
Neg logit contributions
Her-0.032
 Her-0.027
gen-0.026
 herself-0.025
asure-0.024
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
ge+0.039
 Esk+0.039
 camps+0.038
rou+0.037
yt+0.035
Neg logit contributions
Her-0.026
 Her-0.020
gen-0.019
 herself-0.019
asure-0.018
Token a
Token ID257
Feature activation+1.55e+00

Pos logit contributions
Her+0.003
 Her+0.002
gen+0.002
asure+0.002
Then+0.002
Neg logit contributions
 Esk-0.005
 camps-0.005
ge-0.005
 polar-0.005
 upon-0.005
Token magn
Token ID7842
Feature activation+4.99e+00

Pos logit contributions
 Esk+0.032
 camps+0.031
ge+0.031
rou+0.029
yt+0.028
Neg logit contributions
Her-0.042
 Her-0.036
gen-0.035
 herself-0.035
asure-0.034
Tokenifying
Token ID4035
Feature activation+7.80e+00

Pos logit contributions
 Esk+0.106
ge+0.094
 fuss+0.094
 choice+0.091
 polar+0.087
Neg logit contributions
Her-0.130
:-0.118
asure-0.117
 Her-0.114
gen-0.110
Token glass
Token ID5405
Feature activation+7.04e-01

Pos logit contributions
 Esk+0.162
ge+0.162
 camps+0.152
rou+0.146
icer+0.140
Neg logit contributions
Her-0.212
 Her-0.180
gen-0.176
 herself-0.172
asure-0.166
Token to
Token ID284
Feature activation+2.96e+00

Pos logit contributions
 Esk+0.018
ge+0.018
 camps+0.017
rou+0.016
icer+0.016
Neg logit contributions
Her-0.016
 Her-0.013
gen-0.013
 herself-0.012
asure-0.012
Token examine
Token ID10716
Feature activation+1.87e-02

Pos logit contributions
 Esk+0.055
ge+0.051
 camps+0.048
 polar+0.047
 breath+0.046
Neg logit contributions
Her-0.082
Then-0.071
 Her-0.070
asure-0.070
amed-0.069
Token their
Token ID511
Feature activation-1.88e-01

Pos logit contributions
 Esk+0.000
ge+0.000
 camps+0.000
rou+0.000
icer+0.000
Neg logit contributions
Her-0.000
 Her-0.000
gen-0.000
 herself-0.000
asure-0.000
Token finds
Token ID7228
Feature activation-3.77e-02

Pos logit contributions
Her+0.005
 Her+0.004
gen+0.004
 herself+0.004
asure+0.004
Neg logit contributions
 Esk-0.004
ge-0.004
 camps-0.004
rou-0.004
icer-0.004
Token a
Token ID257
Feature activation+2.02e+00

Pos logit contributions
Her+0.018
gen+0.016
ages+0.015
ough+0.014
 Her+0.014
Neg logit contributions
 Esk-0.022
 camps-0.020
rou-0.020
grown-0.020
 ig-0.020
Token family
Token ID1641
Feature activation-3.87e-01

Pos logit contributions
ge+0.050
 Esk+0.049
 camps+0.047
rou+0.045
icer+0.044
Neg logit contributions
Her-0.047
 Her-0.039
gen-0.038
 herself-0.036
asure-0.035
Token of
Token ID286
Feature activation+6.09e-01

Pos logit contributions
Her+0.008
 Her+0.007
 herself+0.007
:+0.006
gen+0.006
Neg logit contributions
 Esk-0.010
ge-0.010
 camps-0.010
icer-0.010
rou-0.009
Token turtles
Token ID36288
Feature activation+3.16e-01

Pos logit contributions
 Esk+0.016
ge+0.016
 camps+0.015
rou+0.015
icer+0.014
Neg logit contributions
Her-0.013
 Her-0.011
gen-0.010
 herself-0.010
asure-0.010
Token sun
Token ID4252
Feature activation-3.47e-01

Pos logit contributions
 Esk+0.008
ge+0.008
 camps+0.008
rou+0.008
icer+0.008
Neg logit contributions
Her-0.007
 Her-0.006
 herself-0.005
gen-0.005
:-0.005
Tokenbat
Token ID8664
Feature activation+7.79e+00

Pos logit contributions
Her+0.009
 Her+0.007
gen+0.007
 herself+0.007
Then+0.007
Neg logit contributions
ge-0.008
 Esk-0.008
 camps-0.007
rou-0.007
icer-0.007
Tokenhing
Token ID722
Feature activation-1.86e+00

Pos logit contributions
ge+0.166
 Esk+0.164
 camps+0.154
rou+0.146
icer+0.145
Neg logit contributions
Her-0.207
 Her-0.174
gen-0.170
 herself-0.165
Then-0.160
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
Her+0.045
 Her+0.037
gen+0.034
 herself+0.034
asure+0.034
Neg logit contributions
 Esk-0.045
ge-0.045
 camps-0.044
rou-0.040
 polar-0.039
Token a
Token ID257
Feature activation+2.46e+00

Pos logit contributions
ge+0.047
rou+0.046
 Esk+0.044
 camps+0.044
yt+0.043
Neg logit contributions
Her-0.015
 her-0.011
 herself-0.010
 Her-0.010
 a-0.008
Token nearby
Token ID6716
Feature activation+3.06e+00

Pos logit contributions
ge+0.057
 Esk+0.056
 camps+0.052
rou+0.051
icer+0.049
Neg logit contributions
Her-0.062
 Her-0.052
gen-0.050
 herself-0.048
Then-0.047
Token pond
Token ID16723
Feature activation+2.08e-01

Pos logit contributions
 Esk+0.055
 camps+0.054
ge+0.051
 polar+0.048
icer+0.047
Neg logit contributions
Her-0.093
 Her-0.078
gen-0.077
 herself-0.077
Then-0.075
Token and
Token ID290
Feature activation-2.53e-01

Pos logit contributions
ge+0.024
 camps+0.020
 upon+0.019
 asleep+0.019
rou+0.019
Neg logit contributions
Her-0.031
gen-0.028
 Fran-0.028
 Her-0.027
 Queen-0.027
Token put
Token ID1234
Feature activation+6.38e-01

Pos logit contributions
Her+0.006
asure+0.005
 Her+0.005
ough+0.005
gen+0.005
Neg logit contributions
 Esk-0.006
ge-0.006
 breath-0.005
 camps-0.005
 polar-0.005
Token on
Token ID319
Feature activation-3.67e-01

Pos logit contributions
 Esk+0.013
 camps+0.012
icer+0.012
ge+0.012
 breath+0.011
Neg logit contributions
Her-0.018
 Her-0.016
asure-0.015
gen-0.014
Then-0.014
Token their
Token ID511
Feature activation-1.38e+00

Pos logit contributions
Her+0.008
asure+0.006
 Her+0.006
gen+0.006
Then+0.006
Neg logit contributions
 camps-0.010
 Esk-0.009
ge-0.009
icer-0.009
rou-0.009
Token p
Token ID279
Feature activation+2.58e+00

Pos logit contributions
Her+0.037
gen+0.030
asure+0.030
amed+0.030
 Her+0.029
Neg logit contributions
 Esk-0.029
 camps-0.029
ge-0.027
 breath-0.026
 polar-0.026
Tokenaj
Token ID1228
Feature activation+7.69e+00

Pos logit contributions
 Esk+0.026
ge+0.026
 camps+0.023
rou+0.020
icer+0.018
Neg logit contributions
Her-0.098
 Her-0.087
gen-0.086
 herself-0.084
asure-0.083
Tokenamas
Token ID17485
Feature activation+2.46e+00

Pos logit contributions
 Esk+0.122
ge+0.117
 camps+0.116
yt+0.111
icer+0.110
Neg logit contributions
Her-0.240
gen-0.213
 listened-0.202
 Her-0.200
 Queen-0.198
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
ge+0.059
 Esk+0.057
 camps+0.054
rou+0.053
 upon+0.051
Neg logit contributions
Her-0.059
 Her-0.049
gen-0.048
asure-0.046
Then-0.045
Token They
Token ID1119
Feature activation-2.34e-01

Pos logit contributions
Her+0.004
gen+0.003
asure+0.003
 listened+0.003
amed+0.003
Neg logit contributions
 Esk-0.005
 camps-0.005
ge-0.005
 upon-0.004
rou-0.004
Token went
Token ID1816
Feature activation+6.76e-01

Pos logit contributions
Her+0.005
 Her+0.004
gen+0.004
 herself+0.004
asure+0.004
Neg logit contributions
 Esk-0.006
ge-0.006
 camps-0.006
rou-0.006
icer-0.006
Token to
Token ID284
Feature activation-2.10e-01

Pos logit contributions
 Esk+0.020
ge+0.020
icer+0.019
rou+0.018
 camps+0.018
Neg logit contributions
Her-0.013
 Her-0.010
 herself-0.010
:-0.009
 Then-0.009
Token examine
Token ID10716
Feature activation-5.72e-01

Pos logit contributions
Her+0.017
gen+0.014
 Her+0.014
 herself+0.014
Then+0.013
Neg logit contributions
 Esk-0.015
ge-0.015
 camps-0.014
rou-0.013
 upon-0.013
Token it
Token ID340
Feature activation-2.56e+00

Pos logit contributions
Her+0.015
 Her+0.013
gen+0.013
asure+0.012
Then+0.012
Neg logit contributions
 Esk-0.012
ge-0.012
 camps-0.011
icer-0.011
rou-0.011
Token with
Token ID351
Feature activation-6.47e-01

Pos logit contributions
Her+0.076
 Her+0.066
gen+0.065
ages+0.061
amed+0.061
Neg logit contributions
 Esk-0.046
ge-0.045
 camps-0.045
 upon-0.043
icer-0.040
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
Her+0.017
 Her+0.014
Then+0.014
amed+0.014
ages+0.014
Neg logit contributions
 Esk-0.012
 breath-0.012
 upon-0.012
 polar-0.012
 camps-0.012
Token magn
Token ID7842
Feature activation+4.65e+00

Pos logit contributions
 Esk+0.030
 camps+0.028
ge+0.028
 upon+0.028
rou+0.027
Neg logit contributions
Her-0.032
 Her-0.025
gen-0.024
amed-0.023
Then-0.023
Tokenifying
Token ID4035
Feature activation+7.68e+00

Pos logit contributions
 Esk+0.075
ge+0.071
 camps+0.063
rou+0.059
 polar+0.057
Neg logit contributions
Her-0.149
 Her-0.132
gen-0.131
asure-0.130
:-0.129
Token glass
Token ID5405
Feature activation+7.43e-01

Pos logit contributions
 Esk+0.136
 camps+0.131
ge+0.125
icer+0.120
rou+0.118
Neg logit contributions
Her-0.232
gen-0.196
 Her-0.194
 herself-0.193
Then-0.188
Token?"
Token ID1701
Feature activation-5.11e-01

Pos logit contributions
ge+0.013
 Esk+0.013
 camps+0.013
 upon+0.012
yt+0.012
Neg logit contributions
Her-0.022
 Her-0.019
gen-0.018
Then-0.018
 herself-0.018
Token
Token ID198
Feature activation+7.37e-01

Pos logit contributions
Her+0.009
 Her+0.007
 herself+0.006
gen+0.006
 Then+0.006
Neg logit contributions
ge-0.016
 Esk-0.016
 camps-0.016
rou-0.015
icer-0.015
Token
Token ID198
Feature activation+2.73e-01

Pos logit contributions
ge+0.026
 Esk+0.025
 camps+0.024
rou+0.024
icer+0.023
Neg logit contributions
Her-0.011
 Her-0.008
 Then-0.007
Then-0.007
:-0.007
Token"
Token ID1
Feature activation-1.48e-02

Pos logit contributions
 Esk+0.008
ge+0.008
 camps+0.007
 upon+0.007
rou+0.007
Neg logit contributions
Her-0.005
gen-0.004
 herself-0.004
 Her-0.004
 Queen-0.004
Token let
Token ID1309
Feature activation+2.22e+00

Pos logit contributions
ge+0.000
 Esk+0.000
rou+0.000
 upon+0.000
 polar+0.000
Neg logit contributions
Her-0.000
ough-0.000
 herself-0.000
 Her-0.000
gen-0.000
Token out
Token ID503
Feature activation-4.48e+00

Pos logit contributions
 Esk+0.053
ge+0.051
 camps+0.050
 upon+0.047
icer+0.047
Neg logit contributions
Her-0.054
 Her-0.042
asure-0.042
gen-0.042
Then-0.041
Token a
Token ID257
Feature activation+8.11e-01

Pos logit contributions
Her+0.071
gen+0.052
Then+0.049
 Her+0.048
asure+0.046
Neg logit contributions
 Esk-0.142
ge-0.142
 camps-0.136
rou-0.132
 upon-0.132
Token big
Token ID1263
Feature activation+1.35e+00

Pos logit contributions
 Esk+0.020
ge+0.020
 camps+0.019
rou+0.018
icer+0.017
Neg logit contributions
Her-0.019
 Her-0.016
gen-0.015
 herself-0.015
asure-0.014
Token sn
Token ID3013
Feature activation+3.09e+00

Pos logit contributions
 Esk+0.032
ge+0.032
 camps+0.031
rou+0.030
icer+0.029
Neg logit contributions
Her-0.033
 Her-0.027
gen-0.026
 herself-0.026
asure-0.025
Tokenarl
Token ID7063
Feature activation+7.63e+00

Pos logit contributions
ge+0.051
rou+0.051
 Esk+0.049
 camps+0.046
icer+0.046
Neg logit contributions
Her-0.095
 Her-0.081
gen-0.080
 herself-0.079
 counted-0.079
Token and
Token ID290
Feature activation-3.12e-01

Pos logit contributions
ge+0.199
 Esk+0.196
 camps+0.188
rou+0.181
yt+0.178
Neg logit contributions
Her-0.166
 Her-0.135
gen-0.131
 herself-0.126
asure-0.123
Token the
Token ID262
Feature activation-2.22e-01

Pos logit contributions
Her+0.008
gen+0.007
asure+0.007
 Her+0.007
ough+0.006
Neg logit contributions
 Esk-0.007
 camps-0.007
ge-0.007
rou-0.007
 polar-0.006
Token mouse
Token ID10211
Feature activation-2.02e-01

Pos logit contributions
Her+0.005
 Her+0.004
gen+0.004
 herself+0.004
asure+0.004
Neg logit contributions
 Esk-0.006
ge-0.006
 camps-0.005
rou-0.005
icer-0.005
Token knew
Token ID2993
Feature activation+2.47e-01

Pos logit contributions
Her+0.005
gen+0.004
 Her+0.004
asure+0.004
 herself+0.003
Neg logit contributions
ge-0.005
 Esk-0.005
 camps-0.005
 upon-0.005
rou-0.005
Token he
Token ID339
Feature activation+2.48e+00

Pos logit contributions
 Esk+0.006
ge+0.006
 camps+0.006
rou+0.006
 upon+0.006
Neg logit contributions
Her-0.006
 Her-0.005
gen-0.004
Then-0.004
asure-0.004
Token and
Token ID290
Feature activation-8.47e-01

Pos logit contributions
 Esk+0.043
ge+0.043
 camps+0.041
rou+0.039
icer+0.038
Neg logit contributions
Her-0.033
 herself-0.028
 Her-0.026
gen-0.025
:-0.024
Token put
Token ID1234
Feature activation+6.82e-01

Pos logit contributions
Her+0.021
 Her+0.017
gen+0.017
 herself+0.016
asure+0.016
Neg logit contributions
 Esk-0.020
ge-0.020
 camps-0.019
rou-0.018
icer-0.018
Token on
Token ID319
Feature activation-6.02e-01

Pos logit contributions
 Esk+0.015
icer+0.014
 camps+0.014
ge+0.014
 breath+0.013
Neg logit contributions
Her-0.018
 Her-0.015
gen-0.014
asure-0.014
Then-0.014
Token your
Token ID534
Feature activation-7.22e-01

Pos logit contributions
Her+0.013
gen+0.010
 Her+0.010
asure+0.010
 counted+0.009
Neg logit contributions
 Esk-0.016
ge-0.016
 camps-0.016
rou-0.015
icer-0.015
Token p
Token ID279
Feature activation+2.48e+00

Pos logit contributions
Her+0.019
 Her+0.016
gen+0.016
asure+0.015
Then+0.015
Neg logit contributions
 Esk-0.016
 camps-0.015
ge-0.015
 polar-0.014
icer-0.014
Tokenaj
Token ID1228
Feature activation+7.57e+00

Pos logit contributions
 Esk+0.034
ge+0.032
 camps+0.028
 polar+0.025
icer+0.024
Neg logit contributions
Her-0.084
 Her-0.075
asure-0.072
 herself-0.072
gen-0.072
Tokenamas
Token ID17485
Feature activation+2.51e+00

Pos logit contributions
 Esk+0.142
ge+0.139
 camps+0.133
rou+0.125
icer+0.125
Neg logit contributions
Her-0.222
gen-0.188
 Her-0.187
 herself-0.178
 listened-0.176
Token.
Token ID13
Feature activation-6.87e-01

Pos logit contributions
ge+0.060
 Esk+0.058
 camps+0.056
rou+0.055
 upon+0.053
Neg logit contributions
Her-0.061
 Her-0.050
gen-0.049
Then-0.046
 herself-0.046
Token Then
Token ID3244
Feature activation-1.89e+00

Pos logit contributions
Her+0.014
gen+0.011
 herself+0.011
 Her+0.011
 listened+0.011
Neg logit contributions
 Esk-0.019
ge-0.018
 camps-0.017
rou-0.017
icer-0.017
Token we
Token ID356
Feature activation-3.07e-01

Pos logit contributions
Her+0.038
gen+0.029
 Her+0.028
amed+0.028
asure+0.028
Neg logit contributions
 Esk-0.053
ge-0.051
 camps-0.050
yt-0.048
 upon-0.048
Token can
Token ID460
Feature activation-3.15e-01

Pos logit contributions
Her+0.006
gen+0.006
 listened+0.006
 Her+0.006
 counted+0.006
Neg logit contributions
ge-0.008
 Esk-0.007
rou-0.007
yt-0.007
 camps-0.007
Token the
Token ID262
Feature activation+1.34e-01

Pos logit contributions
Her+0.037
gen+0.028
 Her+0.028
Then+0.026
asure+0.025
Neg logit contributions
ge-0.067
 Esk-0.067
 camps-0.065
rou-0.063
 upon-0.061
Token necklace
Token ID38987
Feature activation-4.53e-01

Pos logit contributions
ge+0.004
 Esk+0.004
 camps+0.004
rou+0.003
icer+0.003
Neg logit contributions
Her-0.003
 herself-0.002
 Her-0.002
gen-0.002
asure-0.002
Token and
Token ID290
Feature activation-8.26e-02

Pos logit contributions
Her+0.012
Then+0.010
gen+0.010
 Her+0.009
ough+0.009
Neg logit contributions
ge-0.010
 Esk-0.009
 upon-0.009
yt-0.009
 camps-0.008
Token the
Token ID262
Feature activation+5.83e-01

Pos logit contributions
Her+0.002
gen+0.002
ough+0.002
Then+0.002
asure+0.001
Neg logit contributions
 Esk-0.002
ge-0.002
 camps-0.002
yt-0.002
 polar-0.002
Token magn
Token ID7842
Feature activation+4.84e+00

Pos logit contributions
 Esk+0.015
ge+0.015
 camps+0.014
rou+0.014
icer+0.013
Neg logit contributions
Her-0.013
 Her-0.011
gen-0.010
 herself-0.010
asure-0.010
Tokenifying
Token ID4035
Feature activation+7.52e+00

Pos logit contributions
 Esk+0.100
 camps+0.088
ge+0.085
 polar+0.082
rou+0.076
Neg logit contributions
Her-0.137
 Her-0.118
 herself-0.117
asure-0.115
gen-0.114
Token glass
Token ID5405
Feature activation+5.77e-01

Pos logit contributions
 Esk+0.138
 camps+0.131
ge+0.130
rou+0.122
icer+0.121
Neg logit contributions
Her-0.222
 Her-0.188
gen-0.187
 herself-0.184
Then-0.178
Token and
Token ID290
Feature activation+6.04e-02

Pos logit contributions
ge+0.014
 Esk+0.014
 camps+0.013
rou+0.013
yt+0.013
Neg logit contributions
Her-0.014
 Her-0.011
gen-0.011
 herself-0.010
Then-0.010
Token told
Token ID1297
Feature activation+1.57e-01

Pos logit contributions
ge+0.001
 Esk+0.001
 camps+0.001
yt+0.001
 upon+0.001
Neg logit contributions
Her-0.001
gen-0.001
Then-0.001
ough-0.001
 Her-0.001
Token him
Token ID683
Feature activation-1.98e+00

Pos logit contributions
 Esk+0.004
ge+0.004
 camps+0.004
rou+0.004
icer+0.003
Neg logit contributions
Her-0.004
gen-0.003
 Her-0.003
asure-0.003
Then-0.003
Token that
Token ID326
Feature activation+2.74e+00

Pos logit contributions
Her+0.037
 Her+0.029
gen+0.028
 herself+0.027
asure+0.025
Neg logit contributions
 Esk-0.059
ge-0.059
 camps-0.056
rou-0.054
icer-0.054
Token her
Token ID607
Feature activation+1.01e+00

Pos logit contributions
Her+0.045
gen+0.041
asure+0.038
thel+0.037
 Queen+0.037
Neg logit contributions
 upon-0.047
ge-0.046
 Esk-0.045
 camps-0.045
rou-0.041
Token own
Token ID898
Feature activation+5.45e-01

Pos logit contributions
ge+0.027
 Esk+0.027
 camps+0.025
rou+0.025
icer+0.024
Neg logit contributions
Her-0.021
 Her-0.017
gen-0.017
 herself-0.016
asure-0.016
Token special
Token ID2041
Feature activation-2.49e-01

Pos logit contributions
 Esk+0.013
ge+0.013
 camps+0.013
rou+0.012
icer+0.012
Neg logit contributions
Her-0.013
 Her-0.011
gen-0.010
 herself-0.010
asure-0.010
Token slide
Token ID10649
Feature activation-1.21e+00

Pos logit contributions
Her+0.006
 Her+0.005
gen+0.005
 herself+0.005
asure+0.005
Neg logit contributions
 Esk-0.006
ge-0.006
 camps-0.005
rou-0.005
icer-0.005
Token-
Token ID12
Feature activation-8.69e-01

Pos logit contributions
Her+0.031
gen+0.026
 Her+0.025
 Queen+0.024
asure+0.024
Neg logit contributions
ge-0.028
 Esk-0.026
rou-0.025
 camps-0.025
yt-0.024
Tokenbat
Token ID8664
Feature activation+7.46e+00

Pos logit contributions
Her+0.022
 Her+0.019
 Queen+0.019
gen+0.018
 Then+0.017
Neg logit contributions
ge-0.020
rou-0.019
 Esk-0.016
grown-0.016
 upon-0.015
Tokenhe
Token ID258
Feature activation-1.18e+00

Pos logit contributions
ge+0.114
 Esk+0.109
 camps+0.101
rou+0.096
icer+0.094
Neg logit contributions
Her-0.246
 Her-0.214
gen-0.210
 herself-0.204
Then-0.201
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
Her+0.027
gen+0.023
 Her+0.023
asure+0.022
 Queen+0.020
Neg logit contributions
ge-0.029
 Esk-0.028
 camps-0.027
yt-0.026
rou-0.026
Token
Token ID198
Feature activation-2.46e-01

Pos logit contributions
gen+0.005
Her+0.005
 listened+0.005
asure+0.005
 counted+0.005
Neg logit contributions
 upon-0.005
 Esk-0.004
 camps-0.004
icer-0.004
ge-0.004
Token
Token ID198
Feature activation-6.39e-01

Pos logit contributions
Her+0.004
 Her+0.003
 herself+0.003
gen+0.003
Then+0.003
Neg logit contributions
 Esk-0.008
ge-0.008
 camps-0.008
rou-0.007
icer-0.007
TokenShe
Token ID3347
Feature activation+1.90e-01

Pos logit contributions
gen+0.010
Her+0.009
asure+0.009
 counted+0.009
 Her+0.009
Neg logit contributions
ge-0.018
 Esk-0.017
 camps-0.017
rou-0.017
 upon-0.016
Token cried
Token ID16896
Feature activation+2.91e+00

Pos logit contributions
Her+0.015
gen+0.014
asure+0.012
Thanks+0.012
amed+0.012
Neg logit contributions
ge-0.014
 Esk-0.014
yt-0.013
 upon-0.012
 camps-0.012
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 Esk+0.070
ge+0.069
 camps+0.064
 polar+0.060
 upon+0.059
Neg logit contributions
Her-0.066
gen-0.061
asure-0.057
Then-0.053
 Her-0.053
Token dropping
Token ID12047
Feature activation-1.54e+00

Pos logit contributions
ge+0.031
 Esk+0.031
 camps+0.028
 upon+0.027
 polar+0.026
Neg logit contributions
Her-0.033
gen-0.031
asure-0.029
ough-0.028
 Her-0.027
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
Her+0.026
gen+0.020
 Her+0.020
asure+0.020
Then+0.019
Neg logit contributions
 Esk-0.047
ge-0.046
 camps-0.044
rou-0.042
 polar-0.042
Token magn
Token ID7842
Feature activation+5.19e+00

Pos logit contributions
ge+0.032
 Esk+0.032
 camps+0.031
 upon+0.029
rou+0.029
Neg logit contributions
Her-0.038
 Her-0.032
gen-0.030
 herself-0.029
 Then-0.029
Tokenifying
Token ID4035
Feature activation+7.46e+00

Pos logit contributions
 Esk+0.095
ge+0.084
 camps+0.079
 polar+0.073
rou+0.073
Neg logit contributions
Her-0.155
 Her-0.133
gen-0.132
asure-0.132
:-0.132
Token glass
Token ID5405
Feature activation+9.71e-01

Pos logit contributions
 Esk+0.150
ge+0.145
 camps+0.142
rou+0.134
icer+0.131
Neg logit contributions
Her-0.207
 Her-0.174
gen-0.173
 herself-0.169
asure-0.164
Token.
Token ID13
Feature activation+7.39e-01

Pos logit contributions
ge+0.024
 Esk+0.023
 camps+0.022
rou+0.022
icer+0.021
Neg logit contributions
Her-0.023
 Her-0.019
gen-0.018
 herself-0.018
Then-0.017
Token
Token ID198
Feature activation+2.78e-01

Pos logit contributions
 upon+0.015
 Esk+0.014
joy+0.013
grown+0.012
ounce+0.012
Neg logit contributions
gen-0.017
amed-0.015
ough-0.015
Her-0.015
ages-0.015
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
ge+0.005
 Esk+0.005
 upon+0.005
 camps+0.005
rou+0.004
Neg logit contributions
gen-0.007
Her-0.006
 Queen-0.006
asure-0.006
 counted-0.006
Token"
Token ID1
Feature activation+2.99e-01

Pos logit contributions
 Queen+0.004
gen+0.004
aces+0.003
 Fairy+0.003
 Her+0.003
Neg logit contributions
ge-0.003
 upon-0.003
 Esk-0.003
ounce-0.003
grown-0.003
Token they
Token ID484
Feature activation+6.00e-01

Pos logit contributions
Her+0.009
 Her+0.007
gen+0.007
amed+0.007
asure+0.007
Neg logit contributions
 Esk-0.012
 camps-0.011
ge-0.011
 upon-0.010
yt-0.010
Token come
Token ID1282
Feature activation+9.94e-01

Pos logit contributions
ge+0.013
 Esk+0.013
 camps+0.013
rou+0.012
icer+0.012
Neg logit contributions
Her-0.015
 Her-0.013
gen-0.013
 herself-0.012
asure-0.012
Token out
Token ID503
Feature activation-1.18e+00

Pos logit contributions
 Esk+0.028
ge+0.027
 camps+0.026
rou+0.026
icer+0.025
Neg logit contributions
Her-0.020
 Her-0.016
gen-0.015
 herself-0.015
asure-0.014
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
Her+0.025
 Her+0.020
 herself+0.020
gen+0.020
:+0.019
Neg logit contributions
 Esk-0.031
ge-0.030
 camps-0.029
rou-0.028
icer-0.028
Token sun
Token ID4252
Feature activation-5.29e-02

Pos logit contributions
Her+0.000
 Her+0.000
gen+0.000
Then+0.000
 herself+0.000
Neg logit contributions
 Esk-0.000
ge-0.000
 camps-0.000
 upon-0.000
 polar-0.000
Tokenbat
Token ID8664
Feature activation+7.46e+00

Pos logit contributions
Her+0.001
 Her+0.001
gen+0.001
Then+0.001
 herself+0.001
Neg logit contributions
 Esk-0.001
ge-0.001
 camps-0.001
 upon-0.001
rou-0.001
Tokenhe
Token ID258
Feature activation-2.98e-01

Pos logit contributions
ge+0.122
 Esk+0.122
 camps+0.112
icer+0.106
 upon+0.106
Neg logit contributions
Her-0.234
 Her-0.200
gen-0.194
 herself-0.194
Then-0.186
Token on
Token ID319
Feature activation+1.77e+00

Pos logit contributions
Her+0.008
 Her+0.007
gen+0.006
asure+0.006
Then+0.006
Neg logit contributions
ge-0.007
 Esk-0.006
 camps-0.006
 upon-0.006
yt-0.006
Token the
Token ID262
Feature activation+8.89e-01

Pos logit contributions
 Esk+0.045
 camps+0.045
ge+0.044
icer+0.042
 upon+0.040
Neg logit contributions
Her-0.039
 Her-0.032
gen-0.031
asure-0.031
 counted-0.029
Token logs
Token ID17259
Feature activation+8.80e-01

Pos logit contributions
 Esk+0.024
ge+0.023
 camps+0.023
rou+0.021
icer+0.021
Neg logit contributions
Her-0.019
 Her-0.016
gen-0.015
 herself-0.014
asure-0.014
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
ge+0.020
 Esk+0.019
 camps+0.019
 upon+0.017
yt+0.017
Neg logit contributions
Her-0.023
 Her-0.019
gen-0.018
Then-0.017
asure-0.017
Token she
Token ID673
Feature activation+2.69e+00

Pos logit contributions
Her+0.017
 Her+0.012
gen+0.012
 herself+0.011
asure+0.011
Neg logit contributions
ge-0.038
 Esk-0.038
 camps-0.037
rou-0.036
icer-0.035
Token forgot
Token ID16453
Feature activation-3.16e-01

Pos logit contributions
 Esk+0.089
 camps+0.088
ge+0.088
yt+0.083
rou+0.081
Neg logit contributions
Her-0.036
 listened-0.032
 Her-0.029
:-0.029
gen-0.028
Token Teddy
Token ID29345
Feature activation+2.10e+00

Pos logit contributions
Her+0.006
 Her+0.005
gen+0.005
 herself+0.005
asure+0.004
Neg logit contributions
 Esk-0.009
ge-0.009
 camps-0.008
rou-0.008
 upon-0.008
Token at
Token ID379
Feature activation+1.27e+00

Pos logit contributions
ge+0.067
 Esk+0.066
 camps+0.063
rou+0.062
 upon+0.060
Neg logit contributions
Her-0.035
 Her-0.024
gen-0.023
Then-0.023
asure-0.022
Token the
Token ID262
Feature activation-2.06e-01

Pos logit contributions
ge+0.032
rou+0.032
 Esk+0.031
yt+0.030
icer+0.029
Neg logit contributions
Her-0.026
 herself-0.024
 her-0.023
 Her-0.022
gen-0.022
Token camps
Token ID11665
Feature activation+7.40e+00

Pos logit contributions
Her+0.005
gen+0.005
 herself+0.004
 Her+0.004
:+0.004
Neg logit contributions
ge-0.005
rou-0.004
run-0.004
 Esk-0.004
yt-0.004
Tokenite
Token ID578
Feature activation-9.64e-01

Pos logit contributions
 Esk+0.169
ge+0.166
 camps+0.157
rou+0.147
yt+0.144
Neg logit contributions
Her-0.184
 Her-0.155
gen-0.152
asure-0.147
 herself-0.147
Token.
Token ID13
Feature activation+8.22e-02

Pos logit contributions
Her+0.023
 Her+0.019
gen+0.018
Then+0.018
 herself+0.017
Neg logit contributions
ge-0.024
 Esk-0.023
 camps-0.023
rou-0.021
yt-0.021
Token She
Token ID1375
Feature activation+5.74e-01

Pos logit contributions
 Esk+0.003
ge+0.003
 camps+0.003
rou+0.002
 upon+0.002
Neg logit contributions
Her-0.001
gen-0.001
asure-0.001
 Her-0.001
 herself-0.001
Token was
Token ID373
Feature activation-1.59e+00

Pos logit contributions
 Esk+0.015
ge+0.015
 camps+0.014
rou+0.014
 polar+0.014
Neg logit contributions
Her-0.013
gen-0.010
 herself-0.010
 Her-0.010
asure-0.009
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
Her+0.037
gen+0.030
 Her+0.030
Then+0.028
 herself+0.028
Neg logit contributions
 Esk-0.041
ge-0.039
 camps-0.038
rou-0.037
 polar-0.035
Token in
Token ID287
Feature activation+1.14e-01

Pos logit contributions
Her+0.042
 Her+0.032
 herself+0.032
gen+0.031
Then+0.029
Neg logit contributions
ge-0.066
 Esk-0.065
 camps-0.062
rou-0.062
icer-0.059
Token the
Token ID262
Feature activation+6.67e-01

Pos logit contributions
ge+0.005
rou+0.005
ucker+0.005
ait+0.004
ounce+0.004
Neg logit contributions
 her-0.001
 herself-0.001
Her-0.001
 a-0.001
 one-0.000
Token river
Token ID7850
Feature activation-8.50e-01

Pos logit contributions
 Esk+0.018
ge+0.018
 camps+0.017
rou+0.016
icer+0.016
Neg logit contributions
Her-0.014
 Her-0.012
gen-0.011
 herself-0.011
asure-0.010
Token and
Token ID290
Feature activation-9.64e-01

Pos logit contributions
Her+0.017
 Her+0.013
gen+0.013
 herself+0.013
asure+0.012
Neg logit contributions
 Esk-0.024
ge-0.024
 camps-0.023
rou-0.022
icer-0.022
Token sun
Token ID4252
Feature activation-1.24e-01

Pos logit contributions
Her+0.025
 Her+0.021
gen+0.021
 herself+0.020
asure+0.020
Neg logit contributions
ge-0.021
 Esk-0.021
 camps-0.020
rou-0.019
icer-0.018
Tokenbat
Token ID8664
Feature activation+7.36e+00

Pos logit contributions
Her+0.003
 Her+0.003
gen+0.003
asure+0.002
Then+0.002
Neg logit contributions
 Esk-0.003
ge-0.003
 camps-0.003
 polar-0.002
yt-0.002
Tokenhe
Token ID258
Feature activation-3.37e-01

Pos logit contributions
 Esk+0.150
ge+0.149
 camps+0.139
rou+0.132
icer+0.130
Neg logit contributions
Her-0.204
 Her-0.173
gen-0.167
 herself-0.164
asure-0.158
Token on
Token ID319
Feature activation+1.53e+00

Pos logit contributions
Her+0.009
 Her+0.007
asure+0.007
gen+0.007
amed+0.007
Neg logit contributions
 Esk-0.007
ge-0.007
 camps-0.007
 breath-0.006
yt-0.006
Token the
Token ID262
Feature activation+7.90e-01

Pos logit contributions
 Esk+0.042
 camps+0.041
ge+0.039
icer+0.038
 polar+0.037
Neg logit contributions
Her-0.031
gen-0.026
asure-0.025
 Her-0.025
amed-0.024
Token shore
Token ID15191
Feature activation-4.35e-01

Pos logit contributions
 Esk+0.020
 camps+0.019
ge+0.019
rou+0.017
icer+0.017
Neg logit contributions
Her-0.019
 Her-0.016
gen-0.015
 herself-0.014
asure-0.014
Token.
Token ID13
Feature activation-7.26e-02

Pos logit contributions
Her+0.011
 Her+0.009
gen+0.009
asure+0.008
 herself+0.008
Neg logit contributions
 Esk-0.010
ge-0.010
 camps-0.009
rou-0.009
 upon-0.009
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
Her+0.027
 Her+0.022
gen+0.022
asure+0.021
amed+0.021
Neg logit contributions
ge-0.025
 Esk-0.025
 camps-0.023
rou-0.022
 upon-0.021
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 Esk+0.000
 camps+0.000
ge+0.000
 upon+0.000
rou+0.000
Neg logit contributions
Her-0.000
gen-0.000
asure-0.000
 herself-0.000
 listened-0.000
Token
Token ID198
Feature activation-7.89e-01

Pos logit contributions
Her+0.009
 Her+0.007
gen+0.007
 herself+0.007
asure+0.006
Neg logit contributions
ge-0.011
 Esk-0.011
 camps-0.011
rou-0.010
icer-0.010
Token"
Token ID1
Feature activation-1.47e-01

Pos logit contributions
Her+0.013
 Her+0.012
asure+0.011
gen+0.011
 herself+0.011
Neg logit contributions
ge-0.022
 Esk-0.022
 camps-0.022
 upon-0.021
rou-0.021
TokenBy
Token ID3886
Feature activation+5.16e-01

Pos logit contributions
Her+0.003
 Her+0.003
asure+0.003
ough+0.003
:+0.003
Neg logit contributions
 Esk-0.003
 camps-0.003
ge-0.003
 polar-0.003
icer-0.003
Tokene
Token ID68
Feature activation-6.95e+00

Pos logit contributions
 Esk+0.015
ge+0.014
 camps+0.013
icer+0.013
rou+0.013
Neg logit contributions
Her-0.010
 Her-0.009
asure-0.007
gen-0.007
ages-0.007
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
Her+0.112
 Her+0.083
gen+0.079
 herself+0.076
asure+0.071
Neg logit contributions
 Esk-0.222
ge-0.221
 camps-0.213
rou-0.207
icer-0.202
Token dolphin
Token ID44136
Feature activation-2.60e+00

Pos logit contributions
Her+0.036
 Her+0.031
gen+0.030
 herself+0.029
asure+0.028
Neg logit contributions
ge-0.032
 Esk-0.031
 camps-0.029
rou-0.028
icer-0.028
Token!"
Token ID2474
Feature activation-2.20e+00

Pos logit contributions
Her+0.064
 Her+0.054
gen+0.052
 herself+0.051
asure+0.049
Neg logit contributions
 Esk-0.060
ge-0.060
 camps-0.057
rou-0.055
icer-0.053
Token Ben
Token ID3932
Feature activation-5.40e-01

Pos logit contributions
Her+0.044
gen+0.040
asure+0.039
amed+0.037
ough+0.036
Neg logit contributions
 Esk-0.058
 camps-0.055
 upon-0.054
 polar-0.054
ge-0.052
Token and
Token ID290
Feature activation-5.45e-01

Pos logit contributions
Her+0.014
gen+0.011
 Her+0.011
 herself+0.011
asure+0.010
Neg logit contributions
 Esk-0.012
ge-0.011
 camps-0.011
yt-0.011
 polar-0.011
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
 Esk+0.055
 camps+0.052
rou+0.052
icer+0.052
ge+0.051
Neg logit contributions
Her-0.028
:-0.022
 herself-0.022
 Her-0.021
 she-0.020
Token
Token ID220
Feature activation+1.16e+00

Pos logit contributions
 Esk+0.005
 camps+0.005
ge+0.005
 upon+0.005
rou+0.005
Neg logit contributions
Her-0.004
gen-0.004
 herself-0.003
asure-0.003
 Her-0.003
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
 Esk+0.034
ge+0.034
 camps+0.032
rou+0.031
icer+0.031
Neg logit contributions
Her-0.022
 Her-0.017
gen-0.016
 herself-0.016
asure-0.015
Token"
Token ID1
Feature activation-5.54e-01

Pos logit contributions
Her+0.003
gen+0.003
 Her+0.003
 herself+0.003
asure+0.003
Neg logit contributions
 Esk-0.005
ge-0.005
 camps-0.005
rou-0.005
 upon-0.005
TokenBy
Token ID3886
Feature activation+4.98e-01

Pos logit contributions
Her+0.013
 Her+0.013
gen+0.012
:+0.012
ough+0.012
Neg logit contributions
 Esk-0.013
ge-0.012
 upon-0.011
 camps-0.011
rou-0.011
Tokene
Token ID68
Feature activation-6.94e+00

Pos logit contributions
 Esk+0.015
ge+0.014
 camps+0.014
rou+0.013
icer+0.013
Neg logit contributions
Her-0.010
 Her-0.008
gen-0.007
asure-0.006
 herself-0.006
Token little
Token ID1310
Feature activation-2.03e+00

Pos logit contributions
Her+0.124
 Her+0.097
gen+0.093
 herself+0.089
asure+0.085
Neg logit contributions
ge-0.209
 Esk-0.207
 camps-0.199
rou-0.193
icer-0.189
Token crab
Token ID32202
Feature activation-4.36e+00

Pos logit contributions
Her+0.040
 herself+0.034
 she+0.034
 listened+0.034
 Her+0.032
Neg logit contributions
ge-0.053
 Esk-0.050
rou-0.050
icer-0.049
 camps-0.049
Token!"
Token ID2474
Feature activation-2.21e+00

Pos logit contributions
Her+0.106
 Her+0.087
gen+0.083
 herself+0.081
asure+0.080
Neg logit contributions
ge-0.105
 Esk-0.103
 camps-0.100
rou-0.096
 upon-0.091
Token he
Token ID339
Feature activation-5.47e-01

Pos logit contributions
Her+0.043
gen+0.039
asure+0.036
ough+0.033
 Her+0.032
Neg logit contributions
 Esk-0.059
ge-0.057
 upon-0.057
 camps-0.056
 polar-0.054
Token said
Token ID531
Feature activation+1.20e+00

Pos logit contributions
Her+0.013
gen+0.012
ough+0.011
Then+0.011
�+0.011
Neg logit contributions
yt-0.011
 polar-0.011
 Esk-0.011
 breath-0.011
 upon-0.011
Token.
Token ID13
Feature activation+5.54e-04

Pos logit contributions
Her+0.048
gen+0.039
 Her+0.039
Then+0.038
asure+0.038
Neg logit contributions
ge-0.047
 Esk-0.045
 camps-0.043
rou-0.042
 upon-0.042
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
 Esk+0.000
 upon+0.000
 camps+0.000
rou+0.000
grown+0.000
Neg logit contributions
Her-0.000
gen-0.000
asure-0.000
 listened-0.000
 herself-0.000
Token
Token ID198
Feature activation-6.20e-01

Pos logit contributions
Her+0.004
 Her+0.003
gen+0.003
 herself+0.003
asure+0.003
Neg logit contributions
 Esk-0.007
ge-0.007
 camps-0.006
rou-0.006
icer-0.006
Token"
Token ID1
Feature activation-3.50e-01

Pos logit contributions
Her+0.009
 Her+0.009
gen+0.008
asure+0.008
 herself+0.008
Neg logit contributions
 Esk-0.018
ge-0.018
 upon-0.017
rou-0.017
 camps-0.017
TokenBy
Token ID3886
Feature activation+5.88e-01

Pos logit contributions
Her+0.008
:+0.008
 Her+0.008
ages+0.008
ough+0.008
Neg logit contributions
 Esk-0.008
 camps-0.007
 polar-0.006
ge-0.006
 snowball-0.006
Tokene
Token ID68
Feature activation-6.93e+00

Pos logit contributions
 Esk+0.017
ge+0.016
 camps+0.016
icer+0.016
rou+0.015
Neg logit contributions
Her-0.011
 Her-0.009
gen-0.008
asure-0.008
 herself-0.007
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
Her+0.112
 Her+0.083
gen+0.079
 herself+0.076
asure+0.071
Neg logit contributions
 Esk-0.222
ge-0.220
 camps-0.213
rou-0.207
icer-0.202
Token bye
Token ID33847
Feature activation-3.98e-01

Pos logit contributions
Her+0.040
 Her+0.034
gen+0.033
 herself+0.033
asure+0.032
Neg logit contributions
ge-0.034
 Esk-0.034
 camps-0.032
rou-0.031
icer-0.030
Token,
Token ID11
Feature activation-8.35e-01

Pos logit contributions
Her+0.005
gen+0.004
 Her+0.004
:+0.003
asure+0.003
Neg logit contributions
ge-0.014
 Esk-0.013
rou-0.013
 camps-0.013
icer-0.013
Token bye
Token ID33847
Feature activation+4.51e-02

Pos logit contributions
Her+0.021
 Her+0.018
gen+0.017
 herself+0.017
asure+0.017
Neg logit contributions
ge-0.019
 Esk-0.018
 camps-0.017
rou-0.017
icer-0.016
Token!"
Token ID2474
Feature activation-2.19e+00

Pos logit contributions
ge+0.001
 Esk+0.001
rou+0.001
 camps+0.001
icer+0.001
Neg logit contributions
Her-0.001
gen-0.001
 Her-0.001
:-0.001
asure-0.001
Token.
Token ID13
Feature activation-6.56e-01

Pos logit contributions
Her+0.026
 herself+0.020
 Her+0.020
:+0.018
gen+0.017
Neg logit contributions
 Esk-0.045
 camps-0.044
icer-0.043
ge-0.043
rou-0.042
Token
Token ID198
Feature activation-7.63e-01

Pos logit contributions
Her+0.015
gen+0.013
 listened+0.012
 herself+0.012
amed+0.012
Neg logit contributions
 Esk-0.015
 camps-0.015
 upon-0.014
rou-0.014
ge-0.014
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
Her+0.016
 Her+0.013
 herself+0.012
gen+0.012
:+0.011
Neg logit contributions
ge-0.022
 Esk-0.022
 camps-0.021
rou-0.020
icer-0.020
Token"
Token ID1
Feature activation-1.18e+00

Pos logit contributions
Her+0.022
 Her+0.019
gen+0.019
 herself+0.018
 Queen+0.018
Neg logit contributions
ge-0.032
 Esk-0.031
 camps-0.030
rou-0.030
 upon-0.029
TokenBy
Token ID3886
Feature activation+1.82e-01

Pos logit contributions
Her+0.024
 Her+0.021
gen+0.021
:+0.020
asure+0.020
Neg logit contributions
 Esk-0.030
 camps-0.029
ge-0.028
rou-0.027
 upon-0.027
Tokene
Token ID68
Feature activation-6.92e+00

Pos logit contributions
 Esk+0.006
ge+0.005
 camps+0.005
rou+0.005
icer+0.005
Neg logit contributions
Her-0.003
 Her-0.003
gen-0.002
asure-0.002
 herself-0.002
Token Sally
Token ID25737
Feature activation-2.49e+00

Pos logit contributions
Her+0.109
 Her+0.081
gen+0.077
 herself+0.074
asure+0.069
Neg logit contributions
ge-0.224
 Esk-0.222
 camps-0.214
rou-0.208
icer-0.204
Token,"
Token ID553
Feature activation-1.28e+00

Pos logit contributions
Her+0.044
 Her+0.035
gen+0.033
 herself+0.032
:+0.031
Neg logit contributions
 Esk-0.075
ge-0.075
 camps-0.071
rou-0.069
icer-0.068
Token said
Token ID531
Feature activation+1.40e-01

Pos logit contributions
gen+0.035
Her+0.031
amed+0.030
ibrarian+0.029
ough+0.029
Neg logit contributions
 polar-0.022
 Esk-0.022
 upon-0.021
 kay-0.019
yt-0.018
Token Tim
Token ID5045
Feature activation-5.27e-01

Pos logit contributions
 Esk+0.002
 polar+0.002
ge+0.002
 breath+0.002
 upon+0.002
Neg logit contributions
Her-0.004
Then-0.004
gen-0.004
asure-0.003
amed-0.003
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
Her+0.013
gen+0.011
Then+0.010
 Her+0.010
asure+0.010
Neg logit contributions
ge-0.013
 Esk-0.012
rou-0.011
 camps-0.011
 upon-0.011
Token and
Token ID290
Feature activation+8.80e-01

Pos logit contributions
Her+0.026
gen+0.023
 Her+0.022
asure+0.021
 counted+0.021
Neg logit contributions
ge-0.024
 upon-0.022
 camps-0.022
rou-0.022
yt-0.021
Token said
Token ID531
Feature activation+1.55e+00

Pos logit contributions
 Esk+0.023
ge+0.022
yt+0.022
rou+0.021
 breath+0.021
Neg logit contributions
Her-0.018
gen-0.016
ough-0.015
asure-0.015
 Her-0.015
Token,
Token ID11
Feature activation+7.13e-01

Pos logit contributions
 upon+0.031
 Esk+0.029
yt+0.027
ge+0.027
rou+0.027
Neg logit contributions
her-0.034
ough-0.034
Her-0.033
gen-0.032
Bro-0.032
Token "
Token ID366
Feature activation-1.46e+00

Pos logit contributions
 upon+0.016
 Esk+0.015
ge+0.015
 asleep+0.014
rou+0.013
Neg logit contributions
Her-0.015
ough-0.014
P-0.014
gen-0.014
amed-0.014
TokenBy
Token ID3886
Feature activation+3.84e-01

Pos logit contributions
Her+0.028
 Her+0.023
gen+0.023
:+0.022
asure+0.022
Neg logit contributions
 Esk-0.040
ge-0.039
 camps-0.038
rou-0.038
icer-0.036
Tokene
Token ID68
Feature activation-6.87e+00

Pos logit contributions
 Esk+0.012
ge+0.011
 camps+0.011
rou+0.011
icer+0.010
Neg logit contributions
Her-0.007
 Her-0.006
gen-0.005
asure-0.005
 herself-0.005
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
Her+0.113
 Her+0.085
gen+0.081
 herself+0.077
asure+0.072
Neg logit contributions
ge-0.218
 Esk-0.217
 camps-0.209
rou-0.202
icer-0.199
Token sur
Token ID969
Feature activation-1.84e+00

Pos logit contributions
Her+0.034
 Her+0.028
gen+0.027
 herself+0.026
asure+0.025
Neg logit contributions
ge-0.045
 Esk-0.043
 camps-0.042
icer-0.040
rou-0.040
Tokenfer
Token ID2232
Feature activation-5.66e+00

Pos logit contributions
Her+0.034
 listened+0.026
gen+0.025
asure+0.024
 counted+0.023
Neg logit contributions
 camps-0.056
 Esk-0.054
ge-0.052
icer-0.050
rou-0.049
Token!
Token ID0
Feature activation-1.72e+00

Pos logit contributions
Her+0.142
 Her+0.118
gen+0.116
 herself+0.113
asure+0.108
Neg logit contributions
 Esk-0.129
ge-0.129
 camps-0.122
rou-0.116
icer-0.114
Token Have
Token ID8192
Feature activation-1.03e+00

Pos logit contributions
Her+0.035
 Her+0.026
 herself+0.026
gen+0.026
Then+0.024
Neg logit contributions
 Esk-0.049
ge-0.046
 camps-0.045
rou-0.045
icer-0.043
Token and
Token ID290
Feature activation-7.42e-01

Pos logit contributions
 upon+0.027
ge+0.027
 breath+0.025
 asleep+0.025
rou+0.024
Neg logit contributions
Her-0.064
P-0.055
ibrarian-0.055
gen-0.054
ily-0.054
Token said
Token ID531
Feature activation+1.35e+00

Pos logit contributions
Her+0.017
gen+0.014
asure+0.014
 Her+0.014
ough+0.014
Neg logit contributions
 Esk-0.018
 camps-0.018
ge-0.018
 breath-0.017
yt-0.017
Token,
Token ID11
Feature activation+6.02e-01

Pos logit contributions
 Esk+0.026
 upon+0.025
ge+0.024
 asleep+0.023
rou+0.023
Neg logit contributions
her-0.030
gen-0.029
Her-0.029
asure-0.027
Ble-0.027
Token "
Token ID366
Feature activation-7.54e-01

Pos logit contributions
 upon+0.014
ge+0.014
 Esk+0.013
 asleep+0.013
 camps+0.013
Neg logit contributions
Her-0.012
gen-0.011
amed-0.011
P-0.011
ily-0.011
TokenBy
Token ID3886
Feature activation+2.51e-01

Pos logit contributions
Her+0.014
 Her+0.012
gen+0.012
:+0.012
asure+0.012
Neg logit contributions
 Esk-0.020
ge-0.020
 camps-0.019
rou-0.019
icer-0.018
Tokene
Token ID68
Feature activation-6.87e+00

Pos logit contributions
 Esk+0.008
ge+0.007
 camps+0.007
rou+0.007
icer+0.007
Neg logit contributions
Her-0.005
 Her-0.004
gen-0.003
asure-0.003
 herself-0.003
Token daddy
Token ID35822
Feature activation-2.65e+00

Pos logit contributions
Her+0.116
 Her+0.088
gen+0.084
 herself+0.080
asure+0.076
Neg logit contributions
ge-0.214
 Esk-0.213
 camps-0.205
rou-0.199
icer-0.195
Token,
Token ID11
Feature activation-9.87e-01

Pos logit contributions
Her+0.078
 Her+0.066
gen+0.064
 herself+0.063
Then+0.061
Neg logit contributions
ge-0.050
 Esk-0.050
 camps-0.047
rou-0.045
icer-0.042
Token have
Token ID423
Feature activation-2.87e+00

Pos logit contributions
Her+0.018
 Her+0.014
gen+0.014
 herself+0.013
asure+0.012
Neg logit contributions
 Esk-0.029
ge-0.029
 camps-0.028
rou-0.027
icer-0.026
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
Her+0.061
gen+0.047
 Her+0.046
asure+0.044
Then+0.042
Neg logit contributions
 Esk-0.076
rou-0.071
 camps-0.070
 upon-0.068
grown-0.068
Token good
Token ID922
Feature activation-6.73e-01

Pos logit contributions
 Esk+0.041
ge+0.040
 camps+0.039
rou+0.038
icer+0.036
Neg logit contributions
Her-0.034
 Her-0.027
gen-0.026
 herself-0.026
asure-0.025
Token parents
Token ID3397
Feature activation+1.16e+00

Pos logit contributions
Her+0.032
 Her+0.025
gen+0.025
asure+0.024
 herself+0.024
Neg logit contributions
ge-0.037
 Esk-0.036
 camps-0.036
rou-0.035
 upon-0.033
Token reminded
Token ID14516
Feature activation+1.58e+00

Pos logit contributions
ge+0.031
 Esk+0.029
rou+0.028
 upon+0.027
 camps+0.027
Neg logit contributions
Her-0.025
gen-0.020
 Her-0.020
 herself-0.019
 Then-0.018
Token him
Token ID683
Feature activation-2.07e+00

Pos logit contributions
ge+0.051
 Esk+0.049
 upon+0.048
grown+0.046
 breath+0.046
Neg logit contributions
Her-0.020
gen-0.019
asure-0.016
 counted-0.016
ages-0.015
Token to
Token ID284
Feature activation-5.45e-01

Pos logit contributions
Her+0.046
gen+0.037
 Her+0.035
asure+0.035
Then+0.035
Neg logit contributions
ge-0.051
rou-0.048
 Esk-0.047
 upon-0.046
 camps-0.044
Token "
Token ID366
Feature activation-9.99e-01

Pos logit contributions
Her+0.012
 Her+0.010
gen+0.010
ages+0.009
 Then+0.009
Neg logit contributions
ge-0.014
 Esk-0.013
rou-0.012
 upon-0.012
 camps-0.012
Tokenbe
Token ID1350
Feature activation-6.82e+00

Pos logit contributions
Her+0.022
 Her+0.019
gen+0.019
:+0.018
 Then+0.017
Neg logit contributions
ge-0.026
 Esk-0.024
rou-0.023
 upon-0.023
icer-0.022
Tokenhave
Token ID14150
Feature activation-1.40e+00

Pos logit contributions
Her+0.179
 Her+0.159
 Queen+0.146
gen+0.145
asure+0.140
Neg logit contributions
ge-0.151
 Esk-0.139
rou-0.137
 upon-0.126
icer-0.123
Token"
Token ID1
Feature activation+1.61e+00

Pos logit contributions
Her+0.034
gen+0.029
 Her+0.028
asure+0.026
 Queen+0.026
Neg logit contributions
ge-0.032
 upon-0.030
rou-0.030
 Esk-0.028
grown-0.027
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
ge+0.036
 upon+0.034
 Esk+0.029
rou+0.029
grown+0.029
Neg logit contributions
gen-0.038
Her-0.038
asure-0.032
 Her-0.032
 Queen-0.031
Token to
Token ID284
Feature activation-2.16e+00

Pos logit contributions
Her+0.041
gen+0.031
Then+0.031
amed+0.030
 Her+0.030
Neg logit contributions
ge-0.052
 Esk-0.049
rou-0.048
 upon-0.047
 camps-0.045
Token play
Token ID711
Feature activation-2.25e+00

Pos logit contributions
Her+0.049
 Her+0.041
gen+0.041
ages+0.041
amed+0.039
Neg logit contributions
ge-0.053
 Esk-0.050
rou-0.047
 camps-0.044
 breath-0.044
Token.
Token ID13
Feature activation-5.65e-01

Pos logit contributions
Her+0.013
 herself+0.011
 Her+0.010
 her+0.009
 she+0.009
Neg logit contributions
 Esk-0.026
icer-0.026
ge-0.025
rou-0.025
 camps-0.024
Token
Token ID220
Feature activation+1.79e+00

Pos logit contributions
Her+0.011
gen+0.010
asure+0.009
 listened+0.009
amed+0.009
Neg logit contributions
 upon-0.014
 camps-0.014
 Esk-0.014
rou-0.013
ge-0.013
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
ge+0.048
 Esk+0.047
 camps+0.045
rou+0.045
 upon+0.045
Neg logit contributions
Her-0.035
gen-0.029
 listened-0.027
 Her-0.027
asure-0.027
Token"
Token ID1
Feature activation-6.50e-01

Pos logit contributions
gen+0.009
asure+0.009
Her+0.008
 counted+0.008
 Queen+0.008
Neg logit contributions
 upon-0.009
ge-0.009
 Esk-0.009
 camps-0.009
rou-0.009
TokenBy
Token ID3886
Feature activation+3.84e-01

Pos logit contributions
Her+0.014
gen+0.013
 Her+0.013
:+0.012
asure+0.012
Neg logit contributions
 Esk-0.016
ge-0.015
 camps-0.015
rou-0.014
 upon-0.014
Tokene
Token ID68
Feature activation-6.82e+00

Pos logit contributions
 Esk+0.011
ge+0.011
 camps+0.011
rou+0.010
icer+0.010
Neg logit contributions
Her-0.007
 Her-0.006
gen-0.005
asure-0.005
 counted-0.005
Token Mom
Token ID11254
Feature activation-1.32e+00

Pos logit contributions
Her+0.128
 Her+0.100
gen+0.096
 herself+0.093
asure+0.088
Neg logit contributions
ge-0.199
 Esk-0.199
 camps-0.191
rou-0.185
icer-0.181
Tokenmy
Token ID1820
Feature activation-2.27e+00

Pos logit contributions
Her+0.030
 Her+0.025
 herself+0.024
gen+0.024
asure+0.023
Neg logit contributions
 Esk-0.033
ge-0.032
 camps-0.031
rou-0.030
icer-0.029
Token!"
Token ID2474
Feature activation-2.08e+00

Pos logit contributions
Her+0.055
 Her+0.046
gen+0.045
 herself+0.044
asure+0.042
Neg logit contributions
 Esk-0.053
ge-0.053
 camps-0.050
rou-0.049
icer-0.047
Token she
Token ID673
Feature activation-2.70e-01

Pos logit contributions
gen+0.049
asure+0.047
ough+0.043
her+0.039
wl+0.039
Neg logit contributions
 upon-0.045
 breath-0.042
yt-0.041
 polar-0.040
 asleep-0.037
Token said
Token ID531
Feature activation+1.44e+00

Pos logit contributions
Her+0.006
gen+0.006
asure+0.005
ough+0.005
Then+0.005
Neg logit contributions
yt-0.007
 Esk-0.006
ge-0.006
 upon-0.006
 polar-0.006
Token and
Token ID290
Feature activation-7.32e-01

Pos logit contributions
ge+0.014
 camps+0.013
 Esk+0.013
 upon+0.012
rou+0.012
Neg logit contributions
Her-0.014
gen-0.011
asure-0.010
Then-0.010
 Her-0.010
Token said
Token ID531
Feature activation+8.88e-01

Pos logit contributions
Her+0.017
ough+0.014
gen+0.014
asure+0.013
Then+0.013
Neg logit contributions
 Esk-0.017
 polar-0.017
 camps-0.016
ge-0.016
yt-0.016
Token,
Token ID11
Feature activation+3.25e-01

Pos logit contributions
 Esk+0.019
 upon+0.018
ge+0.018
yt+0.017
 camps+0.017
Neg logit contributions
Her-0.020
her-0.018
Then-0.017
asure-0.017
Bro-0.017
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
 upon+0.008
 Esk+0.007
ge+0.007
rou+0.007
yt+0.007
Neg logit contributions
Her-0.006
gen-0.006
ough-0.006
amed-0.006
P-0.006
TokenBy
Token ID3886
Feature activation+1.68e-01

Pos logit contributions
Her+0.025
 Her+0.022
:+0.022
gen+0.021
asure+0.020
Neg logit contributions
 Esk-0.035
ge-0.034
 camps-0.033
rou-0.032
icer-0.031
Tokene
Token ID68
Feature activation-6.81e+00

Pos logit contributions
 Esk+0.005
ge+0.005
 camps+0.005
rou+0.005
icer+0.005
Neg logit contributions
Her-0.003
 Her-0.002
gen-0.002
asure-0.002
 herself-0.002
Token!
Token ID0
Feature activation-1.75e+00

Pos logit contributions
Her+0.126
 Her+0.099
gen+0.095
 herself+0.091
asure+0.086
Neg logit contributions
ge-0.201
 Esk-0.200
 camps-0.191
rou-0.186
icer-0.182
Token Stay
Token ID16160
Feature activation-2.23e-01

Pos logit contributions
Her+0.037
 herself+0.028
 Her+0.027
gen+0.027
Then+0.026
Neg logit contributions
 Esk-0.048
ge-0.045
 camps-0.045
rou-0.044
icer-0.042
Token safe
Token ID3338
Feature activation-8.62e-01

Pos logit contributions
Her+0.004
gen+0.003
 herself+0.003
 Her+0.003
 then+0.003
Neg logit contributions
ge-0.007
 Esk-0.006
 camps-0.006
rou-0.006
icer-0.006
Token and
Token ID290
Feature activation-7.14e-01

Pos logit contributions
Her+0.021
gen+0.018
 Her+0.017
 herself+0.017
asure+0.016
Neg logit contributions
ge-0.020
 Esk-0.020
 camps-0.018
rou-0.018
icer-0.018
Token warm
Token ID5814
Feature activation+4.53e-01

Pos logit contributions
Her+0.016
 Her+0.013
gen+0.013
 herself+0.012
asure+0.012
Neg logit contributions
 Esk-0.018
ge-0.018
 camps-0.017
rou-0.017
 upon-0.016
Tokenâ
Token ID22940
Feature activation+6.28e-01

Pos logit contributions
Her+0.008
 Her+0.006
gen+0.005
 herself+0.005
Then+0.005
Neg logit contributions
 Esk-0.018
ge-0.018
 camps-0.017
rou-0.017
icer-0.016
Token€
Token ID26391
Feature activation-3.31e+00

Pos logit contributions
 Esk+0.017
ge+0.017
rou+0.016
icer+0.016
 upon+0.015
Neg logit contributions
Her-0.013
 Her-0.010
 Queen-0.009
 herself-0.009
gen-0.009
Token�
Token ID129
Feature activation-9.56e-01

Pos logit contributions
Her+0.062
 Her+0.050
gen+0.047
 herself+0.044
 listened+0.044
Neg logit contributions
ge-0.097
 Esk-0.096
 camps-0.093
rou-0.088
icer-0.086
Token�
Token ID241
Feature activation-6.82e-01

Pos logit contributions
Her+0.021
 Her+0.017
gen+0.016
 herself+0.016
 Queen+0.015
Neg logit contributions
 Esk-0.025
ge-0.024
 camps-0.023
rou-0.023
icer-0.022
TokenBy
Token ID3886
Feature activation+8.03e-01

Pos logit contributions
Her+0.010
 Her+0.007
Then+0.007
gen+0.007
 herself+0.007
Neg logit contributions
 Esk-0.023
ge-0.023
 camps-0.023
yt-0.021
icer-0.021
Tokene
Token ID68
Feature activation-6.79e+00

Pos logit contributions
 Esk+0.024
ge+0.023
 camps+0.022
rou+0.022
icer+0.022
Neg logit contributions
Her-0.015
 Her-0.012
gen-0.011
asure-0.010
 herself-0.010
Token bye
Token ID33847
Feature activation-5.51e-01

Pos logit contributions
Her+0.135
 Her+0.104
gen+0.102
 herself+0.099
Then+0.095
Neg logit contributions
 Esk-0.191
ge-0.188
 camps-0.182
rou-0.179
 polar-0.174
Token,
Token ID11
Feature activation-6.83e-01

Pos logit contributions
Her+0.011
 Her+0.008
gen+0.008
 herself+0.008
asure+0.007
Neg logit contributions
 Esk-0.016
ge-0.016
 camps-0.015
rou-0.015
yt-0.015
Token Tommy
Token ID19919
Feature activation-1.15e+00

Pos logit contributions
Her+0.015
 Her+0.012
gen+0.012
 herself+0.012
asure+0.011
Neg logit contributions
 Esk-0.018
ge-0.018
 camps-0.017
rou-0.016
icer-0.016
Token!
Token ID0
Feature activation-1.09e+00

Pos logit contributions
Her+0.031
 Her+0.026
gen+0.024
 herself+0.024
asure+0.024
Neg logit contributions
 Esk-0.025
ge-0.024
rou-0.023
 camps-0.023
yt-0.022
Token See
Token ID4091
Feature activation-1.79e+00

Pos logit contributions
Her+0.022
gen+0.016
 Her+0.016
 herself+0.016
asure+0.015
Neg logit contributions
 Esk-0.031
ge-0.030
 camps-0.029
rou-0.029
icer-0.028
Token son
Token ID3367
Feature activation-1.45e+00

Pos logit contributions
Her+0.092
 Her+0.074
 herself+0.073
gen+0.072
asure+0.069
Neg logit contributions
 Esk-0.076
ge-0.073
 camps-0.071
 polar-0.068
yt-0.067
Token asked
Token ID1965
Feature activation+1.12e+00

Pos logit contributions
Her+0.034
 Her+0.026
 herself+0.026
gen+0.026
asure+0.024
Neg logit contributions
 Esk-0.037
ge-0.036
 camps-0.036
 polar-0.034
 upon-0.033
Token,
Token ID11
Feature activation-3.92e-01

Pos logit contributions
 Esk+0.027
ge+0.027
 camps+0.025
 upon+0.025
 polar+0.024
Neg logit contributions
Her-0.026
gen-0.021
asure-0.021
Then-0.021
 Her-0.020
Token "
Token ID366
Feature activation-1.68e+00

Pos logit contributions
Her+0.007
gen+0.007
ages+0.006
ough+0.006
amed+0.006
Neg logit contributions
 upon-0.011
 Esk-0.011
ge-0.010
 polar-0.010
 camps-0.009
TokenWhere
Token ID8496
Feature activation-1.37e+00

Pos logit contributions
Her+0.039
 Her+0.033
gen+0.031
 herself+0.030
asure+0.029
Neg logit contributions
 Esk-0.042
ge-0.042
 camps-0.040
rou-0.038
icer-0.037
Token are
Token ID389
Feature activation-6.76e+00

Pos logit contributions
Her+0.032
 Her+0.027
gen+0.026
 herself+0.025
asure+0.024
Neg logit contributions
ge-0.034
 Esk-0.034
 camps-0.032
rou-0.031
icer-0.030
Token we
Token ID356
Feature activation-9.94e-01

Pos logit contributions
Her+0.134
 Her+0.104
gen+0.102
 herself+0.097
Then+0.092
Neg logit contributions
 Esk-0.194
ge-0.190
 camps-0.184
rou-0.174
icer-0.171
Token going
Token ID1016
Feature activation+6.82e-01

Pos logit contributions
Her+0.029
 Her+0.024
gen+0.023
 herself+0.023
Then+0.022
Neg logit contributions
 Esk-0.021
ge-0.020
 camps-0.019
 upon-0.018
rou-0.017
Token today
Token ID1909
Feature activation-8.40e-01

Pos logit contributions
ge+0.019
 Esk+0.019
rou+0.018
yt+0.018
icer+0.018
Neg logit contributions
Her-0.012
:-0.010
asure-0.010
 Her-0.010
gen-0.010
Token?"
Token ID1701
Feature activation-8.99e-01

Pos logit contributions
Her+0.021
 Her+0.018
gen+0.018
 herself+0.017
asure+0.017
Neg logit contributions
ge-0.019
 Esk-0.019
 camps-0.017
rou-0.017
icer-0.016
Token
Token ID220
Feature activation+1.92e+00

Pos logit contributions
amed+0.020
Her+0.020
gen+0.019
 herself+0.018
asure+0.018
Neg logit contributions
 upon-0.017
 polar-0.017
 Esk-0.017
joy-0.016
ge-0.016
Token
Token ID220
Feature activation+1.93e+00

Pos logit contributions
Her+0.007
gen+0.006
asure+0.006
 Her+0.006
 listened+0.006
Neg logit contributions
 Esk-0.010
ge-0.010
 camps-0.009
rou-0.009
 upon-0.009
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
 Esk+0.050
ge+0.050
rou+0.047
 upon+0.047
 camps+0.046
Neg logit contributions
Her-0.040
gen-0.033
 listened-0.031
 Her-0.030
 counted-0.030
Token
Token ID198
Feature activation-3.79e-01

Pos logit contributions
 Esk+0.006
ge+0.006
 camps+0.005
rou+0.005
icer+0.005
Neg logit contributions
Her-0.003
 Her-0.003
gen-0.003
 herself-0.002
asure-0.002
Token"
Token ID1
Feature activation-9.51e-01

Pos logit contributions
Her+0.006
 Her+0.005
gen+0.005
 herself+0.004
asure+0.004
Neg logit contributions
ge-0.012
 Esk-0.012
 camps-0.011
rou-0.011
icer-0.011
TokenBy
Token ID3886
Feature activation+4.01e-01

Pos logit contributions
Her+0.021
:+0.020
 Her+0.020
gen+0.019
ages+0.018
Neg logit contributions
 Esk-0.022
ge-0.021
rou-0.020
 camps-0.019
 upon-0.019
Tokene
Token ID68
Feature activation-6.76e+00

Pos logit contributions
 Esk+0.012
ge+0.011
 camps+0.011
rou+0.011
icer+0.010
Neg logit contributions
Her-0.008
 Her-0.006
gen-0.005
asure-0.005
 counted-0.005
Token bye
Token ID33847
Feature activation-9.96e-01

Pos logit contributions
Her+0.133
 Her+0.103
gen+0.099
 herself+0.097
Then+0.094
Neg logit contributions
 Esk-0.191
ge-0.188
 camps-0.182
rou-0.179
yt-0.171
Token,
Token ID11
Feature activation-8.42e-01

Pos logit contributions
Her+0.020
 Her+0.015
gen+0.015
 herself+0.014
Then+0.013
Neg logit contributions
 Esk-0.028
ge-0.028
 camps-0.027
rou-0.026
yt-0.026
Token small
Token ID1402
Feature activation-2.05e+00

Pos logit contributions
Her+0.018
 Her+0.014
gen+0.014
 herself+0.013
asure+0.013
Neg logit contributions
 Esk-0.023
ge-0.023
 camps-0.022
rou-0.021
icer-0.020
Token girl
Token ID2576
Feature activation-3.61e+00

Pos logit contributions
Her+0.036
gen+0.028
 Her+0.028
 herself+0.027
asure+0.025
Neg logit contributions
 Esk-0.061
ge-0.061
 camps-0.058
rou-0.057
icer-0.055
Token!"
Token ID2474
Feature activation-2.01e+00

Pos logit contributions
Her+0.097
 Her+0.083
gen+0.082
 herself+0.080
asure+0.076
Neg logit contributions
ge-0.075
 Esk-0.075
 camps-0.071
rou-0.067
icer-0.066
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
 Esk+0.029
 camps+0.029
ge+0.029
rou+0.027
 upon+0.027
Neg logit contributions
Her-0.028
gen-0.024
asure-0.023
 Her-0.023
ough-0.022
Token said
Token ID531
Feature activation+1.79e+00

Pos logit contributions
 Esk+0.025
yt+0.024
 polar+0.024
 breath+0.023
 upon+0.022
Neg logit contributions
Her-0.030
gen-0.026
ough-0.026
asure-0.025
ocolate-0.025
Token,
Token ID11
Feature activation+8.02e-01

Pos logit contributions
 Esk+0.041
ge+0.038
 upon+0.037
yt+0.036
 camps+0.035
Neg logit contributions
Her-0.040
her-0.035
asure-0.035
gen-0.034
Then-0.033
Token "
Token ID366
Feature activation-9.83e-01

Pos logit contributions
 Esk+0.016
 upon+0.015
ge+0.014
 camps+0.014
 asleep+0.014
Neg logit contributions
Her-0.018
gen-0.017
wl-0.017
ough-0.016
asure-0.016
TokenBy
Token ID3886
Feature activation+4.02e-01

Pos logit contributions
Her+0.021
:+0.019
 Her+0.019
asure+0.018
gen+0.018
Neg logit contributions
 Esk-0.023
ge-0.022
rou-0.022
 camps-0.021
 upon-0.020
Tokene
Token ID68
Feature activation-6.75e+00

Pos logit contributions
 Esk+0.012
ge+0.012
 camps+0.011
rou+0.011
icer+0.011
Neg logit contributions
Her-0.007
 Her-0.006
gen-0.005
asure-0.005
 herself-0.005
Token then
Token ID788
Feature activation-1.31e+00

Pos logit contributions
Her+0.126
 Her+0.098
gen+0.094
 herself+0.091
asure+0.086
Neg logit contributions
 Esk-0.198
ge-0.198
 camps-0.189
rou-0.184
icer-0.179
Token!
Token ID0
Feature activation-1.46e+00

Pos logit contributions
Her+0.030
 Her+0.024
gen+0.023
 herself+0.023
asure+0.022
Neg logit contributions
 Esk-0.033
ge-0.033
 camps-0.032
rou-0.030
icer-0.029
Token Have
Token ID8192
Feature activation-1.23e+00

Pos logit contributions
Her+0.033
 herself+0.025
 Her+0.025
gen+0.025
asure+0.023
Neg logit contributions
 Esk-0.037
 camps-0.035
ge-0.035
rou-0.034
 upon-0.033
Token a
Token ID257
Feature activation+3.01e-02

Pos logit contributions
Her+0.018
 Her+0.013
gen+0.013
 herself+0.012
asure+0.011
Neg logit contributions
 Esk-0.041
ge-0.041
 camps-0.039
rou-0.038
icer-0.037
Token nice
Token ID3621
Feature activation-1.52e+00

Pos logit contributions
 Esk+0.001
ge+0.001
 camps+0.001
rou+0.001
yt+0.001
Neg logit contributions
Her-0.001
 Her-0.001
gen-0.001
 herself-0.001
asure-0.000
Token.
Token ID13
Feature activation-5.72e-01

Pos logit contributions
ge+0.005
 Esk+0.005
 camps+0.004
rou+0.004
yt+0.004
Neg logit contributions
Her-0.004
 Her-0.003
gen-0.003
asure-0.003
 herself-0.003
Token
Token ID198
Feature activation-5.26e-01

Pos logit contributions
Her+0.013
gen+0.011
 listened+0.011
amed+0.010
asure+0.010
Neg logit contributions
 Esk-0.013
 camps-0.013
ge-0.012
 upon-0.012
rou-0.012
Token
Token ID198
Feature activation-8.73e-01

Pos logit contributions
Her+0.010
 Her+0.008
gen+0.007
 herself+0.007
Then+0.007
Neg logit contributions
ge-0.016
 Esk-0.016
 camps-0.015
rou-0.015
icer-0.014
Token"
Token ID1
Feature activation-9.45e-01

Pos logit contributions
Her+0.014
 Her+0.013
gen+0.013
asure+0.012
 herself+0.012
Neg logit contributions
ge-0.025
 Esk-0.024
 camps-0.024
rou-0.023
 upon-0.022
TokenBy
Token ID3886
Feature activation+2.97e-01

Pos logit contributions
Her+0.020
 Her+0.019
:+0.019
ages+0.018
gen+0.018
Neg logit contributions
 Esk-0.023
 camps-0.020
ge-0.020
rou-0.019
 polar-0.018
Tokene
Token ID68
Feature activation-6.74e+00

Pos logit contributions
 Esk+0.009
ge+0.008
 camps+0.008
rou+0.008
icer+0.008
Neg logit contributions
Her-0.006
 Her-0.005
gen-0.004
asure-0.004
 herself-0.004
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
Her+0.119
 Her+0.089
gen+0.086
 herself+0.084
Then+0.079
Neg logit contributions
 Esk-0.205
ge-0.203
 camps-0.197
rou-0.192
yt-0.186
Token Sam
Token ID3409
Feature activation-1.70e+00

Pos logit contributions
Her+0.039
 Her+0.030
Then+0.029
gen+0.029
 herself+0.028
Neg logit contributions
 Esk-0.043
 camps-0.041
rou-0.039
ge-0.039
 polar-0.039
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
Her+0.034
 Her+0.027
gen+0.027
 herself+0.026
:+0.026
Neg logit contributions
 Esk-0.047
ge-0.047
 camps-0.045
rou-0.043
icer-0.043
Token Mia
Token ID32189
Feature activation-1.70e+00

Pos logit contributions
Her+0.037
Then+0.031
amed+0.029
asure+0.028
her+0.028
Neg logit contributions
 Esk-0.039
 polar-0.036
 camps-0.035
 upon-0.035
ge-0.035
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
Her+0.041
 Her+0.034
gen+0.033
 herself+0.032
asure+0.031
Neg logit contributions
 Esk-0.041
ge-0.041
 camps-0.039
rou-0.037
icer-0.036
Token she
Token ID673
Feature activation-1.50e-01

Pos logit contributions
gen+0.023
ough+0.021
asure+0.021
Her+0.020
amed+0.018
Neg logit contributions
 upon-0.024
 Esk-0.023
 breath-0.023
 polar-0.023
ge-0.023
Token replied
Token ID8712
Feature activation+1.02e+00

Pos logit contributions
Her+0.004
gen+0.004
ough+0.004
ocolate+0.004
�+0.003
Neg logit contributions
yt-0.003
 upon-0.002
 Esk-0.002
rou-0.002
ge-0.002
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
ge+0.021
 Esk+0.018
 breath+0.018
yt+0.017
 upon+0.016
Neg logit contributions
Her-0.026
asure-0.025
gen-0.025
Then-0.024
P-0.023
Token "
Token ID366
Feature activation-1.42e+00

Pos logit contributions
Her+0.005
gen+0.005
 listened+0.005
P+0.005
asure+0.005
Neg logit contributions
 upon-0.005
run-0.004
joy-0.004
rou-0.004
ge-0.004
TokenLet
Token ID5756
Feature activation-1.23e-01

Pos logit contributions
Her+0.029
 Her+0.027
:+0.026
asure+0.024
gen+0.024
Neg logit contributions
ge-0.033
 Esk-0.033
rou-0.033
icer-0.031
 upon-0.031
Token's
Token ID338
Feature activation-6.74e+00

Pos logit contributions
Her+0.002
asure+0.002
edly+0.002
gen+0.002
 Then+0.002
Neg logit contributions
 Esk-0.003
 upon-0.003
ge-0.003
yt-0.003
 camps-0.003
Token do
Token ID466
Feature activation-4.91e+00

Pos logit contributions
Her+0.167
 Her+0.136
gen+0.129
 herself+0.125
Then+0.124
Neg logit contributions
 Esk-0.159
ge-0.155
 camps-0.151
rou-0.141
 upon-0.137
Token it
Token ID340
Feature activation-3.05e+00

Pos logit contributions
Her+0.121
 Her+0.100
asure+0.092
gen+0.091
amed+0.087
Neg logit contributions
 Esk-0.116
ge-0.114
 camps-0.108
rou-0.108
 upon-0.101
Token again
Token ID757
Feature activation-2.06e+00

Pos logit contributions
Her+0.099
 Her+0.086
gen+0.082
asure+0.079
Then+0.078
Neg logit contributions
 Esk-0.051
ge-0.050
 camps-0.047
rou-0.044
 upon-0.044
Token!"
Token ID2474
Feature activation-1.71e+00

Pos logit contributions
Her+0.055
 Her+0.047
gen+0.044
asure+0.043
 herself+0.043
Neg logit contributions
ge-0.044
 Esk-0.044
 camps-0.042
rou-0.040
 upon-0.039
Token
Token ID198
Feature activation+5.67e-02

Pos logit contributions
gen+0.043
Her+0.041
asure+0.040
amed+0.037
ough+0.037
Neg logit contributions
 upon-0.030
ge-0.028
joy-0.025
 Esk-0.025
 camps-0.025
Token and
Token ID290
Feature activation-2.73e-01

Pos logit contributions
ge+0.045
yt+0.041
 breath+0.041
 upon+0.039
joy+0.038
Neg logit contributions
Her-0.082
asure-0.070
gen-0.069
P-0.069
vals-0.068
Token said
Token ID531
Feature activation+1.60e+00

Pos logit contributions
Her+0.007
 Her+0.005
gen+0.005
asure+0.005
ough+0.005
Neg logit contributions
 Esk-0.007
 breath-0.006
ge-0.006
 polar-0.006
 camps-0.006
Token,
Token ID11
Feature activation+4.93e-01

Pos logit contributions
 Esk+0.031
 upon+0.029
ge+0.029
yt+0.028
 asleep+0.028
Neg logit contributions
Her-0.037
her-0.035
asure-0.034
Then-0.033
wl-0.033
Token "
Token ID366
Feature activation-1.06e+00

Pos logit contributions
 upon+0.011
 Esk+0.011
ge+0.010
 asleep+0.010
 camps+0.010
Neg logit contributions
Her-0.010
gen-0.009
wl-0.009
asure-0.009
ough-0.009
TokenBy
Token ID3886
Feature activation+5.65e-01

Pos logit contributions
Her+0.022
 Her+0.020
:+0.020
asure+0.019
gen+0.019
Neg logit contributions
 Esk-0.026
ge-0.026
rou-0.024
 camps-0.024
 upon-0.023
Tokene
Token ID68
Feature activation-6.74e+00

Pos logit contributions
 Esk+0.017
ge+0.016
 camps+0.016
rou+0.015
icer+0.015
Neg logit contributions
Her-0.010
 Her-0.009
gen-0.007
asure-0.007
 herself-0.007
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
Her+0.135
 Her+0.106
gen+0.102
 herself+0.100
asure+0.095
Neg logit contributions
 Esk-0.189
ge-0.187
 camps-0.181
rou-0.176
yt-0.169
Token it
Token ID340
Feature activation-3.44e+00

Pos logit contributions
Her+0.030
 Her+0.025
gen+0.024
 herself+0.023
asure+0.023
Neg logit contributions
ge-0.035
 Esk-0.035
 camps-0.033
rou-0.032
icer-0.031
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
Her+0.082
 Her+0.066
 herself+0.062
gen+0.062
asure+0.060
Neg logit contributions
 Esk-0.086
ge-0.083
 camps-0.081
rou-0.079
icer-0.075
Token great
Token ID1049
Feature activation-1.87e+00

Pos logit contributions
Her+0.021
 Her+0.017
gen+0.017
 herself+0.016
asure+0.015
Neg logit contributions
ge-0.030
 Esk-0.029
 camps-0.028
rou-0.027
icer-0.026
Token to
Token ID284
Feature activation+1.26e+00

Pos logit contributions
Her+0.047
 Her+0.041
gen+0.040
 herself+0.040
:+0.038
Neg logit contributions
ge-0.042
 Esk-0.041
 camps-0.038
icer-0.038
rou-0.037
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
Her+0.015
 Her+0.013
gen+0.012
asure+0.012
ough+0.012
Neg logit contributions
 Esk-0.012
ge-0.011
 camps-0.011
rou-0.011
 upon-0.011
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 Esk+0.003
ge+0.003
 camps+0.003
rou+0.003
icer+0.003
Neg logit contributions
Her-0.002
gen-0.002
 Her-0.002
 herself-0.002
asure-0.002
Token
Token ID198
Feature activation-6.11e-01

Pos logit contributions
Her+0.004
 Her+0.002
:+0.002
Then+0.002
 Then+0.002
Neg logit contributions
 Esk-0.010
ge-0.010
 camps-0.010
yt-0.010
rou-0.010
Token"
Token ID1
Feature activation-2.46e-01

Pos logit contributions
Her+0.010
 Her+0.009
gen+0.008
asure+0.008
 herself+0.008
Neg logit contributions
ge-0.019
 Esk-0.018
 camps-0.017
rou-0.017
 upon-0.017
TokenBy
Token ID3886
Feature activation+5.96e-01

Pos logit contributions
:+0.006
 Her+0.006
Her+0.006
ages+0.006
asure+0.006
Neg logit contributions
 Esk-0.005
 camps-0.004
 snowball-0.004
 polar-0.004
ge-0.004
Tokene
Token ID68
Feature activation-6.68e+00

Pos logit contributions
 Esk+0.018
ge+0.017
 camps+0.016
icer+0.016
rou+0.016
Neg logit contributions
Her-0.011
 Her-0.009
gen-0.008
asure-0.008
 herself-0.008
Token-
Token ID12
Feature activation-4.74e+00

Pos logit contributions
Her+0.109
 Her+0.080
gen+0.077
 herself+0.075
Then+0.069
Neg logit contributions
 Esk-0.212
ge-0.210
 camps-0.204
rou-0.198
icer-0.193
Tokenbye
Token ID16390
Feature activation-1.27e+00

Pos logit contributions
Her+0.084
 Her+0.069
 herself+0.063
gen+0.062
 Then+0.059
Neg logit contributions
ge-0.138
rou-0.136
 Esk-0.132
 upon-0.124
 camps-0.122
Token,
Token ID11
Feature activation-7.29e-01

Pos logit contributions
Her+0.019
 Her+0.014
gen+0.013
 herself+0.013
asure+0.012
Neg logit contributions
ge-0.041
 Esk-0.041
 camps-0.040
rou-0.039
icer-0.038
Token monkey
Token ID21657
Feature activation-4.76e+00

Pos logit contributions
Her+0.015
 Her+0.012
gen+0.012
 herself+0.011
asure+0.011
Neg logit contributions
 Esk-0.020
ge-0.020
 camps-0.019
rou-0.019
 polar-0.018
Token!"
Token ID2474
Feature activation-2.04e+00

Pos logit contributions
Her+0.135
 Her+0.116
gen+0.113
 herself+0.111
asure+0.107
Neg logit contributions
 Esk-0.093
ge-0.093
 camps-0.087
rou-0.083
icer-0.080
Token.
Token ID13
Feature activation-4.47e-02

Pos logit contributions
Her+0.006
 Her+0.005
gen+0.005
 herself+0.005
asure+0.005
Neg logit contributions
ge-0.007
 Esk-0.007
 camps-0.007
rou-0.007
yt-0.006
Token
Token ID198
Feature activation-3.69e-01

Pos logit contributions
Her+0.001
 Her+0.001
gen+0.001
 herself+0.001
asure+0.001
Neg logit contributions
ge-0.001
 Esk-0.001
 camps-0.001
rou-0.001
icer-0.001
Token
Token ID198
Feature activation-7.06e-01

Pos logit contributions
Her+0.005
 Her+0.004
 herself+0.003
 Then+0.003
:+0.003
Neg logit contributions
ge-0.013
 Esk-0.013
 camps-0.013
yt-0.012
rou-0.012
Token"
Token ID1
Feature activation-2.33e-01

Pos logit contributions
Her+0.013
 Her+0.011
gen+0.011
 herself+0.010
asure+0.010
Neg logit contributions
ge-0.019
 Esk-0.019
 camps-0.018
rou-0.018
 upon-0.018
TokenBy
Token ID3886
Feature activation+6.68e-01

Pos logit contributions
Her+0.005
 Her+0.005
gen+0.004
:+0.004
asure+0.004
Neg logit contributions
 Esk-0.006
ge-0.005
 camps-0.005
icer-0.005
rou-0.005
Tokene
Token ID68
Feature activation-6.68e+00

Pos logit contributions
 Esk+0.021
ge+0.020
 camps+0.020
rou+0.019
icer+0.019
Neg logit contributions
Her-0.011
 Her-0.009
gen-0.008
 herself-0.008
asure-0.007
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
Her+0.104
 Her+0.078
gen+0.074
 herself+0.069
asure+0.067
Neg logit contributions
ge-0.218
 Esk-0.215
 camps-0.207
rou-0.201
icer-0.198
Token goat
Token ID26917
Feature activation-2.18e+00

Pos logit contributions
Her+0.027
 Her+0.021
gen+0.020
 herself+0.020
Then+0.019
Neg logit contributions
 Esk-0.035
ge-0.034
 camps-0.033
rou-0.032
icer-0.031
Token!"
Token ID2474
Feature activation-2.18e+00

Pos logit contributions
Her+0.054
gen+0.047
 Her+0.046
:+0.045
 herself+0.045
Neg logit contributions
ge-0.049
 Esk-0.048
 camps-0.046
rou-0.043
icer-0.043
Token Tom
Token ID4186
Feature activation-1.42e+00

Pos logit contributions
Her+0.041
gen+0.034
 Her+0.031
asure+0.030
 herself+0.030
Neg logit contributions
 Esk-0.062
ge-0.059
 camps-0.059
 upon-0.056
rou-0.056
Token and
Token ID290
Feature activation-9.49e-01

Pos logit contributions
Her+0.028
gen+0.023
 Her+0.021
 herself+0.020
asure+0.020
Neg logit contributions
 Esk-0.039
ge-0.039
 camps-0.037
rou-0.036
 polar-0.036
Token and
Token ID290
Feature activation-4.43e-01

Pos logit contributions
 Esk+0.008
ge+0.008
 camps+0.008
rou+0.008
icer+0.007
Neg logit contributions
Her-0.006
 Her-0.005
 herself-0.004
gen-0.004
:-0.004
Token asked
Token ID1965
Feature activation+1.36e+00

Pos logit contributions
Her+0.010
gen+0.008
ough+0.008
asure+0.008
Bro+0.008
Neg logit contributions
 Esk-0.011
ge-0.010
 upon-0.010
 camps-0.010
yt-0.010
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
 Esk+0.034
ge+0.032
 upon+0.031
 camps+0.031
 polar+0.031
Neg logit contributions
Her-0.030
gen-0.026
Then-0.026
asure-0.025
amed-0.025
Token "
Token ID366
Feature activation-1.72e+00

Pos logit contributions
gen+0.002
Her+0.002
amed+0.001
vals+0.001
wl+0.001
Neg logit contributions
 upon-0.002
 Esk-0.002
ge-0.002
 polar-0.002
 camps-0.001
TokenWhere
Token ID8496
Feature activation-1.41e+00

Pos logit contributions
Her+0.036
 Her+0.028
gen+0.027
 herself+0.027
Then+0.026
Neg logit contributions
ge-0.047
 Esk-0.047
 camps-0.044
rou-0.044
icer-0.042
Token are
Token ID389
Feature activation-6.67e+00

Pos logit contributions
Her+0.027
 Her+0.023
 listened+0.022
gen+0.022
:+0.021
Neg logit contributions
ge-0.041
 Esk-0.039
 camps-0.038
icer-0.037
rou-0.037
Token we
Token ID356
Feature activation-9.62e-01

Pos logit contributions
Her+0.129
 Her+0.103
gen+0.099
 herself+0.096
asure+0.093
Neg logit contributions
ge-0.191
 Esk-0.187
 camps-0.180
rou-0.178
icer-0.172
Token going
Token ID1016
Feature activation+2.91e-01

Pos logit contributions
Her+0.020
gen+0.017
 Her+0.017
asure+0.017
 listened+0.017
Neg logit contributions
ge-0.025
rou-0.024
 Esk-0.023
icer-0.022
 camps-0.022
Token?"
Token ID1701
Feature activation-1.03e+00

Pos logit contributions
ge+0.009
rou+0.009
 Esk+0.008
icer+0.008
yt+0.008
Neg logit contributions
Her-0.005
:-0.004
 then-0.004
asure-0.004
 Her-0.004
Token His
Token ID2399
Feature activation-8.84e-01

Pos logit contributions
amed+0.023
jected+0.022
gen+0.022
keys+0.022
ibrarian+0.022
Neg logit contributions
 upon-0.022
 polar-0.020
 Esk-0.018
 camps-0.017
yt-0.017
Token mom
Token ID1995
Feature activation+2.23e+00

Pos logit contributions
Her+0.010
 Her+0.007
 herself+0.006
gen+0.006
:+0.005
Neg logit contributions
ge-0.033
 Esk-0.031
rou-0.031
 camps-0.030
icer-0.030
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
Her+0.000
gen+0.000
 Her+0.000
asure+0.000
ough+0.000
Neg logit contributions
ge-0.000
 Esk-0.000
 camps-0.000
rou-0.000
yt-0.000
Token said
Token ID531
Feature activation+9.34e-01

Pos logit contributions
Her+0.003
ough+0.003
gen+0.003
Then+0.002
 Her+0.002
Neg logit contributions
 upon-0.003
ge-0.003
yt-0.003
rou-0.003
 Esk-0.003
Token,
Token ID11
Feature activation+4.65e-01

Pos logit contributions
 upon+0.019
joy+0.018
 Esk+0.018
rou+0.018
yt+0.018
Neg logit contributions
Her-0.020
gen-0.019
ough-0.019
her-0.018
Then-0.018
Token "
Token ID366
Feature activation-1.21e+00

Pos logit contributions
 upon+0.011
ge+0.010
 Esk+0.010
rou+0.010
joy+0.010
Neg logit contributions
Her-0.010
ough-0.009
P-0.009
gen-0.009
amed-0.008
TokenBy
Token ID3886
Feature activation+1.49e-01

Pos logit contributions
Her+0.025
 Her+0.021
gen+0.021
:+0.020
asure+0.019
Neg logit contributions
ge-0.031
 Esk-0.031
 camps-0.029
rou-0.029
 upon-0.028
Tokene
Token ID68
Feature activation-6.67e+00

Pos logit contributions
 Esk+0.004
ge+0.004
 camps+0.004
rou+0.004
icer+0.004
Neg logit contributions
Her-0.003
 Her-0.002
gen-0.002
asure-0.002
 herself-0.002
Token Polly
Token ID36898
Feature activation-2.80e+00

Pos logit contributions
Her+0.124
 Her+0.095
gen+0.093
 herself+0.090
Then+0.086
Neg logit contributions
 Esk-0.195
ge-0.193
 camps-0.187
rou-0.183
yt-0.176
Token!
Token ID0
Feature activation-1.64e+00

Pos logit contributions
Her+0.068
 Her+0.055
gen+0.054
 herself+0.051
Then+0.051
Neg logit contributions
ge-0.067
 Esk-0.066
 camps-0.064
rou-0.062
yt-0.060
Token I
Token ID314
Feature activation+3.53e-01

Pos logit contributions
Her+0.035
 herself+0.026
gen+0.026
 Her+0.025
Then+0.025
Neg logit contributions
 Esk-0.044
ge-0.041
 camps-0.041
rou-0.041
 upon-0.039
Token'll
Token ID1183
Feature activation-7.67e-02

Pos logit contributions
ge+0.008
 Esk+0.008
 camps+0.008
icer+0.008
rou+0.007
Neg logit contributions
Her-0.008
gen-0.007
 Her-0.007
 listened-0.007
 counted-0.007
Token see
Token ID766
Feature activation-1.69e+00

Pos logit contributions
Her+0.002
 Her+0.001
gen+0.001
 herself+0.001
asure+0.001
Neg logit contributions
 Esk-0.002
ge-0.002
 camps-0.002
rou-0.002
icer-0.002
Token on
Token ID319
Feature activation+7.91e-01

Pos logit contributions
ge+0.044
 Esk+0.043
 camps+0.042
rou+0.040
icer+0.039
Neg logit contributions
Her-0.030
 herself-0.023
 Her-0.023
gen-0.022
 Then-0.021
Token the
Token ID262
Feature activation-9.12e-02

Pos logit contributions
 Esk+0.022
ge+0.022
 camps+0.021
icer+0.020
rou+0.020
Neg logit contributions
Her-0.016
 Her-0.013
gen-0.012
asure-0.012
amed-0.011
Token roof
Token ID9753
Feature activation-2.12e-01

Pos logit contributions
Her+0.002
 Her+0.001
gen+0.001
 herself+0.001
asure+0.001
Neg logit contributions
ge-0.003
 Esk-0.003
 camps-0.002
rou-0.002
icer-0.002
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
Her+0.005
 Her+0.004
gen+0.004
 herself+0.004
asure+0.004
Neg logit contributions
ge-0.005
 Esk-0.005
 camps-0.005
rou-0.005
icer-0.005
Token sun
Token ID4252
Feature activation-1.09e+00

Pos logit contributions
Her+0.034
 Her+0.028
gen+0.027
asure+0.027
Then+0.026
Neg logit contributions
 Esk-0.029
ge-0.028
 camps-0.027
rou-0.026
 upon-0.025
Tokenbat
Token ID8664
Feature activation+7.07e+00

Pos logit contributions
Her+0.026
 Her+0.021
gen+0.020
Then+0.020
asure+0.019
Neg logit contributions
 Esk-0.027
ge-0.027
 camps-0.026
rou-0.025
 upon-0.024
Tokenhe
Token ID258
Feature activation-8.18e-01

Pos logit contributions
ge+0.098
 Esk+0.095
 camps+0.085
rou+0.081
icer+0.077
Neg logit contributions
Her-0.241
 Her-0.212
gen-0.211
 herself-0.203
asure-0.200
Token again
Token ID757
Feature activation-1.40e+00

Pos logit contributions
Her+0.020
 Her+0.016
asure+0.016
gen+0.016
amed+0.014
Neg logit contributions
ge-0.020
 Esk-0.020
 camps-0.018
rou-0.017
yt-0.017
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
Her+0.037
 Her+0.031
gen+0.030
asure+0.030
amed+0.028
Neg logit contributions
ge-0.032
 Esk-0.030
 camps-0.029
rou-0.028
yt-0.027
Token<|endoftext|>
Token ID50256
Feature activation+4.96e+00

Pos logit contributions
Her+0.007
gen+0.005
asure+0.005
 herself+0.005
amed+0.005
Neg logit contributions
 Esk-0.007
 camps-0.007
ge-0.007
 upon-0.006
rou-0.006
TokenOnce
Token ID7454
Feature activation+3.08e+00

Pos logit contributions
ge+0.168
 Esk+0.159
rou+0.158
 camps+0.153
yt+0.152
Neg logit contributions
Her-0.073
 Her-0.061
 Then-0.057
:-0.057
 herself-0.055
Token she
Token ID673
Feature activation+4.26e+00

Pos logit contributions
ge+0.010
 Esk+0.010
 camps+0.009
rou+0.009
 upon+0.009
Neg logit contributions
Her-0.008
gen-0.006
 Her-0.006
asure-0.006
Then-0.006
Token could
Token ID714
Feature activation+3.04e+00

Pos logit contributions
 Esk+0.106
ge+0.106
 camps+0.101
rou+0.098
 upon+0.095
Neg logit contributions
Her-0.100
 Her-0.081
gen-0.078
 herself-0.076
asure-0.075
Token have
Token ID423
Feature activation-1.04e+00

Pos logit contributions
ge+0.071
 Esk+0.071
 camps+0.067
rou+0.064
 upon+0.062
Neg logit contributions
Her-0.074
 Her-0.061
gen-0.060
asure-0.057
 herself-0.057
Token in
Token ID287
Feature activation-5.89e-01

Pos logit contributions
Her+0.028
gen+0.024
 Her+0.023
Then+0.022
asure+0.022
Neg logit contributions
ge-0.021
 Esk-0.020
 upon-0.020
grown-0.019
 camps-0.019
Token the
Token ID262
Feature activation+5.84e-01

Pos logit contributions
Her+0.010
 Her+0.008
gen+0.007
 herself+0.007
asure+0.007
Neg logit contributions
ge-0.018
 Esk-0.018
 camps-0.017
rou-0.017
icer-0.016
Token<|endoftext|>
Token ID50256
Feature activation+5.55e+00

Pos logit contributions
 Esk+0.013
 camps+0.013
ge+0.012
rou+0.012
icer+0.012
Neg logit contributions
Her-0.015
 Her-0.012
gen-0.012
 herself-0.012
asure-0.012
TokenOnce
Token ID7454
Feature activation+3.65e+00

Pos logit contributions
ge+0.157
 Esk+0.149
rou+0.146
 camps+0.143
yt+0.141
Neg logit contributions
Her-0.114
 Her-0.097
 herself-0.090
:-0.089
 Then-0.089
Token upon
Token ID2402
Feature activation-2.48e-01

Pos logit contributions
 Esk+0.052
ge+0.042
 camps+0.041
 polar+0.038
rou+0.038
Neg logit contributions
Her-0.129
:-0.122
 Her-0.121
 she-0.120
 her-0.117
Token a
Token ID257
Feature activation-1.48e+00

Pos logit contributions
Her+0.002
 Her+0.001
 herself+0.001
gen+0.001
 Then+0.001
Neg logit contributions
ge-0.010
 Esk-0.010
 camps-0.010
rou-0.010
yt-0.010
Token time
Token ID640
Feature activation-4.12e-01

Pos logit contributions
Her+0.031
 Her+0.025
gen+0.023
Then+0.023
 listened+0.022
Neg logit contributions
ge-0.041
 Esk-0.041
rou-0.038
 camps-0.037
 polar-0.036
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
Her+0.007
gen+0.006
 Her+0.005
asure+0.005
 herself+0.005
Neg logit contributions
 Esk-0.012
ge-0.012
 upon-0.012
 camps-0.012
rou-0.011
Token They
Token ID1119
Feature activation+7.15e-01

Pos logit contributions
 Esk+0.004
ge+0.004
 camps+0.004
icer+0.004
 upon+0.004
Neg logit contributions
Her-0.005
gen-0.004
amed-0.004
 listened-0.004
asure-0.004
Token felt
Token ID2936
Feature activation+9.55e-02

Pos logit contributions
ge+0.021
 Esk+0.020
 camps+0.020
rou+0.019
icer+0.019
Neg logit contributions
Her-0.014
 Her-0.011
gen-0.010
 herself-0.010
asure-0.010
Token sick
Token ID6639
Feature activation+3.34e+00

Pos logit contributions
 Esk+0.002
ge+0.002
 camps+0.002
rou+0.002
 polar+0.002
Neg logit contributions
Her-0.002
 Her-0.002
gen-0.002
asure-0.002
 herself-0.002
Token in
Token ID287
Feature activation+8.72e-01

Pos logit contributions
 Esk+0.080
ge+0.079
 camps+0.076
rou+0.072
yt+0.070
Neg logit contributions
Her-0.080
 Her-0.066
gen-0.065
asure-0.062
Then-0.060
Token their
Token ID511
Feature activation+1.08e+00

Pos logit contributions
rou+0.033
ge+0.033
 Esk+0.032
 camps+0.032
yt+0.031
Neg logit contributions
Her-0.010
 herself-0.007
 her-0.007
 Her-0.006
gen-0.004
Token tum
Token ID11814
Feature activation+5.57e+00

Pos logit contributions
ge+0.028
 Esk+0.027
rou+0.025
 camps+0.025
yt+0.024
Neg logit contributions
Her-0.024
 Her-0.021
gen-0.020
 herself-0.019
 Then-0.018
Tokenm
Token ID76
Feature activation+1.40e+00

Pos logit contributions
ge+0.089
 Esk+0.085
rou+0.078
icer+0.077
 upon+0.074
Neg logit contributions
Her-0.174
gen-0.159
 Her-0.151
 Queen-0.147
asure-0.145
Tokenies
Token ID444
Feature activation+2.81e+00

Pos logit contributions
 Esk+0.020
ge+0.020
 camps+0.017
rou+0.017
icer+0.016
Neg logit contributions
Her-0.048
 Her-0.042
gen-0.041
 herself-0.040
asure-0.040
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
 Esk+0.084
ge+0.083
 camps+0.081
rou+0.080
icer+0.079
Neg logit contributions
Her-0.050
 Her-0.039
 herself-0.039
gen-0.036
:-0.035
Token They
Token ID1119
Feature activation+9.86e-01

Pos logit contributions
 Esk+0.008
ge+0.008
 camps+0.007
 upon+0.007
icer+0.007
Neg logit contributions
Her-0.008
gen-0.007
amed-0.007
asure-0.007
 counted-0.007
Token dropped
Token ID5710
Feature activation-1.39e-01

Pos logit contributions
ge+0.030
 Esk+0.030
 camps+0.029
rou+0.028
icer+0.028
Neg logit contributions
Her-0.018
 Her-0.014
gen-0.013
 herself-0.013
 listened-0.012
Token their
Token ID511
Feature activation-1.02e+00

Pos logit contributions
 Esk+0.037
ge+0.034
rou+0.033
 camps+0.033
grown+0.031
Neg logit contributions
Her-0.035
 Her-0.027
amed-0.027
asure-0.027
Then-0.027
Token unique
Token ID3748
Feature activation-1.22e+00

Pos logit contributions
Her+0.029
 Her+0.024
gen+0.022
 herself+0.022
asure+0.022
Neg logit contributions
 Esk-0.021
ge-0.020
 camps-0.020
rou-0.019
 upon-0.018
Token paint
Token ID7521
Feature activation-9.66e-01

Pos logit contributions
Her+0.032
 Her+0.026
gen+0.025
 herself+0.024
asure+0.024
Neg logit contributions
 Esk-0.028
ge-0.028
 camps-0.027
rou-0.026
yt-0.025
Token mix
Token ID5022
Feature activation-2.15e+00

Pos logit contributions
Her+0.027
 Her+0.022
gen+0.022
asure+0.021
Then+0.021
Neg logit contributions
 camps-0.019
 Esk-0.019
ge-0.019
yt-0.019
rou-0.018
Token!
Token ID0
Feature activation-3.85e-01

Pos logit contributions
Her+0.055
 Her+0.046
gen+0.043
asure+0.042
Then+0.039
Neg logit contributions
ge-0.050
 Esk-0.049
 camps-0.048
rou-0.047
yt-0.045
Token<|endoftext|>
Token ID50256
Feature activation+5.60e+00

Pos logit contributions
Her+0.009
gen+0.007
 herself+0.007
 listened+0.007
asure+0.007
Neg logit contributions
 camps-0.009
 Esk-0.009
ge-0.009
rou-0.009
icer-0.009
TokenOnce
Token ID7454
Feature activation+3.26e+00

Pos logit contributions
ge+0.190
 Esk+0.182
rou+0.178
 camps+0.177
yt+0.172
Neg logit contributions
Her-0.083
 Her-0.067
 herself-0.060
 Then-0.058
:-0.058
Token upon
Token ID2402
Feature activation-1.43e-01

Pos logit contributions
 Esk+0.041
ge+0.036
 camps+0.033
rou+0.030
 polar+0.028
Neg logit contributions
Her-0.119
 Her-0.111
:-0.111
 she-0.106
 her-0.105
Token a
Token ID257
Feature activation-1.91e+00

Pos logit contributions
Her+0.001
 herself+0.000
 Her+0.000
 her+0.000
gen+0.000
Neg logit contributions
ge-0.006
 Esk-0.006
 camps-0.006
rou-0.006
yt-0.006
Token time
Token ID640
Feature activation-8.30e-01

Pos logit contributions
Her+0.040
 Her+0.032
gen+0.030
Then+0.029
 listened+0.028
Neg logit contributions
ge-0.054
 Esk-0.053
rou-0.048
 camps-0.047
 polar-0.046
Token there
Token ID612
Feature activation+2.16e+00

Pos logit contributions
Her+0.013
gen+0.010
 Her+0.010
 herself+0.009
asure+0.008
Neg logit contributions
 Esk-0.027
ge-0.026
 camps-0.025
rou-0.025
 upon-0.024
Token She
Token ID1375
Feature activation+1.33e+00

Pos logit contributions
Her+0.001
gen+0.001
 listened+0.001
amed+0.001
asure+0.001
Neg logit contributions
 Esk-0.002
 upon-0.002
 camps-0.002
ge-0.002
grown-0.002
Token puts
Token ID7584
Feature activation+5.59e-01

Pos logit contributions
 Esk+0.029
 camps+0.028
ge+0.028
rou+0.027
icer+0.026
Neg logit contributions
Her-0.034
 Her-0.028
gen-0.028
asure-0.027
 herself-0.026
Token on
Token ID319
Feature activation+1.79e-01

Pos logit contributions
icer+0.014
 Esk+0.014
 camps+0.013
ge+0.013
 breath+0.013
Neg logit contributions
Her-0.012
gen-0.010
asure-0.010
 Her-0.010
Then-0.010
Token her
Token ID607
Feature activation+8.53e-01

Pos logit contributions
 camps+0.005
ge+0.005
 Esk+0.005
 breath+0.005
icer+0.005
Neg logit contributions
Her-0.003
asure-0.002
gen-0.002
amed-0.002
Then-0.002
Token p
Token ID279
Feature activation+2.41e+00

Pos logit contributions
 camps+0.016
 Esk+0.016
 upon+0.015
 breath+0.015
icer+0.014
Neg logit contributions
Her-0.024
gen-0.021
asure-0.020
amed-0.019
ough-0.019
Tokenaj
Token ID1228
Feature activation+7.24e+00

Pos logit contributions
ge+0.018
 Esk+0.017
 camps+0.016
rou+0.015
icer+0.013
Neg logit contributions
Her-0.097
 Her-0.087
gen-0.087
 herself-0.085
 listened-0.083
Tokenamas
Token ID17485
Feature activation+2.78e+00

Pos logit contributions
 Esk+0.102
 camps+0.096
icer+0.096
yt+0.093
ge+0.093
Neg logit contributions
Her-0.231
gen-0.209
 listened-0.202
Then-0.197
 nobody-0.196
Token.
Token ID13
Feature activation-7.25e-02

Pos logit contributions
ge+0.062
 Esk+0.059
 camps+0.057
 upon+0.055
rou+0.055
Neg logit contributions
Her-0.067
gen-0.058
asure-0.055
 Her-0.055
Then-0.055
Token She
Token ID1375
Feature activation+1.25e+00

Pos logit contributions
Her+0.001
gen+0.001
asure+0.001
amed+0.001
 listened+0.001
Neg logit contributions
 Esk-0.002
 upon-0.002
 camps-0.002
ge-0.002
grown-0.002
Token hugs
Token ID40657
Feature activation+1.76e+00

Pos logit contributions
ge+0.031
 Esk+0.030
 camps+0.029
rou+0.028
icer+0.027
Neg logit contributions
Her-0.029
 Her-0.024
gen-0.024
 herself-0.023
asure-0.022
Token her
Token ID607
Feature activation+8.24e-01

Pos logit contributions
 Esk+0.049
ge+0.047
 camps+0.047
 polar+0.046
 asleep+0.046
Neg logit contributions
gen-0.029
Her-0.028
asure-0.026
amed-0.025
Then-0.025
Token They
Token ID1119
Feature activation+9.12e-01

Pos logit contributions
 Esk+0.000
 camps+0.000
 polar+0.000
ge+0.000
rou+0.000
Neg logit contributions
Her-0.000
amed-0.000
 listened-0.000
 herself-0.000
gen-0.000
Token make
Token ID787
Feature activation-1.45e+00

Pos logit contributions
 Esk+0.019
ge+0.019
 camps+0.018
rou+0.017
icer+0.016
Neg logit contributions
Her-0.025
 Her-0.021
gen-0.020
 herself-0.019
Then-0.019
Token you
Token ID345
Feature activation+2.10e-01

Pos logit contributions
Her+0.034
 Her+0.028
gen+0.028
asure+0.027
 counted+0.027
Neg logit contributions
 Esk-0.036
ge-0.034
rou-0.032
 camps-0.032
 polar-0.031
Token sick
Token ID6639
Feature activation+2.05e+00

Pos logit contributions
 Esk+0.005
ge+0.005
 camps+0.005
rou+0.005
icer+0.004
Neg logit contributions
Her-0.005
 Her-0.004
gen-0.004
 herself-0.004
asure-0.004
Token and
Token ID290
Feature activation-5.97e-01

Pos logit contributions
 Esk+0.042
ge+0.040
 camps+0.038
 polar+0.035
rou+0.035
Neg logit contributions
Her-0.057
 Her-0.049
gen-0.047
asure-0.046
Then-0.045
Token sore
Token ID19597
Feature activation+3.73e+00

Pos logit contributions
Her+0.014
 Her+0.012
gen+0.011
 herself+0.011
asure+0.011
Neg logit contributions
 Esk-0.014
ge-0.014
 camps-0.013
rou-0.013
 upon-0.012
Token.
Token ID13
Feature activation+4.66e-02

Pos logit contributions
ge+0.066
 Esk+0.064
 camps+0.056
 polar+0.054
 upon+0.053
Neg logit contributions
Her-0.112
 Her-0.095
gen-0.094
asure-0.093
Mir-0.092
Token You
Token ID921
Feature activation-1.43e-01

Pos logit contributions
 Esk+0.001
 camps+0.001
 polar+0.001
ge+0.001
rou+0.001
Neg logit contributions
Her-0.001
amed-0.001
 listened-0.001
 herself-0.001
asure-0.001
Token need
Token ID761
Feature activation+8.55e-01

Pos logit contributions
Her+0.003
 Her+0.003
gen+0.003
 herself+0.003
 listened+0.003
Neg logit contributions
ge-0.003
 Esk-0.003
 camps-0.003
rou-0.003
yt-0.003
Token to
Token ID284
Feature activation-3.22e-01

Pos logit contributions
 Esk+0.015
 camps+0.015
ge+0.015
 polar+0.014
 upon+0.014
Neg logit contributions
Her-0.023
gen-0.020
amed-0.020
iece-0.020
 Her-0.020
Token get
Token ID651
Feature activation-8.59e-01

Pos logit contributions
Her+0.007
 Her+0.006
gen+0.006
asure+0.005
Then+0.005
Neg logit contributions
 Esk-0.008
ge-0.008
 camps-0.008
rou-0.007
 polar-0.007
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
Her+0.002
gen+0.002
 listened+0.001
asure+0.001
 counted+0.001
Neg logit contributions
 Esk-0.002
ge-0.002
 camps-0.002
 upon-0.002
rou-0.002
Token
Token ID198
Feature activation-5.93e-01

Pos logit contributions
Her+0.003
 Her+0.002
 herself+0.002
:+0.002
Then+0.001
Neg logit contributions
 Esk-0.006
ge-0.006
 camps-0.006
rou-0.006
icer-0.006
TokenL
Token ID43
Feature activation+1.13e+00

Pos logit contributions
Her+0.009
gen+0.007
 Her+0.007
asure+0.007
 herself+0.006
Neg logit contributions
ge-0.019
 Esk-0.018
 camps-0.018
rou-0.017
 upon-0.017
Tokenily
Token ID813
Feature activation-3.30e-01

Pos logit contributions
ge+0.044
 Esk+0.044
 camps+0.042
rou+0.042
icer+0.041
Neg logit contributions
Her-0.010
 Her-0.006
gen-0.006
 herself-0.004
asure-0.004
Token and
Token ID290
Feature activation+3.15e-01

Pos logit contributions
Her+0.010
gen+0.008
 Her+0.008
 herself+0.008
ough+0.008
Neg logit contributions
rou-0.006
ge-0.006
 camps-0.005
 upon-0.005
 Esk-0.005
Token Max
Token ID5436
Feature activation+1.51e+00

Pos logit contributions
ge+0.007
 camps+0.007
 Esk+0.007
 upon+0.007
icer+0.007
Neg logit contributions
Her-0.007
gen-0.006
Then-0.006
amed-0.006
edly-0.006
Token looked
Token ID3114
Feature activation+1.38e+00

Pos logit contributions
 Esk+0.033
ge+0.032
 camps+0.030
 upon+0.029
rou+0.028
Neg logit contributions
Her-0.037
gen-0.030
ough-0.029
 herself-0.028
 Her-0.028
Token for
Token ID329
Feature activation+4.35e-01

Pos logit contributions
 Esk+0.039
ge+0.039
 camps+0.037
rou+0.036
icer+0.035
Neg logit contributions
Her-0.027
 Her-0.022
gen-0.021
 herself-0.020
asure-0.019
Token their
Token ID511
Feature activation-7.18e-02

Pos logit contributions
 Esk+0.012
 camps+0.011
ge+0.011
 polar+0.011
grown+0.011
Neg logit contributions
Her-0.009
asure-0.008
gen-0.008
amed-0.007
 Her-0.007
Token mom
Token ID1995
Feature activation+1.39e+00

Pos logit contributions
Her+0.002
 Her+0.002
 herself+0.001
amed+0.001
Then+0.001
Neg logit contributions
 camps-0.002
 Esk-0.002
ge-0.002
 upon-0.001
icer-0.001
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
ge+0.037
 Esk+0.036
 camps+0.035
rou+0.034
icer+0.033
Neg logit contributions
Her-0.030
 Her-0.024
gen-0.023
 herself-0.022
Then-0.022
Token but
Token ID475
Feature activation+1.31e+00

Pos logit contributions
ge+0.007
 Esk+0.007
 camps+0.007
rou+0.006
 upon+0.006
Neg logit contributions
Her-0.009
gen-0.007
 Her-0.007
 herself-0.007
asure-0.007
Token their
Token ID511
Feature activation-1.01e+00

Pos logit contributions
 upon+0.033
ge+0.032
 Esk+0.032
 camps+0.031
grown+0.030
Neg logit contributions
Her-0.028
gen-0.023
asure-0.022
ough-0.022
ages-0.022
Token favorite
Token ID4004
Feature activation-9.14e-01

Pos logit contributions
Her+0.022
 Her+0.017
gen+0.017
 herself+0.016
asure+0.016
Neg logit contributions
 Esk-0.027
ge-0.026
 camps-0.026
rou-0.025
icer-0.024
Token was
Token ID373
Feature activation-4.92e-01

Pos logit contributions
Her+0.023
 Her+0.017
asure+0.017
gen+0.017
Then+0.017
Neg logit contributions
ge-0.022
 Esk-0.021
 camps-0.021
icer-0.019
 upon-0.019
Token a
Token ID257
Feature activation+3.05e+00

Pos logit contributions
Her+0.009
gen+0.007
 Her+0.007
Then+0.006
asure+0.006
Neg logit contributions
 Esk-0.015
 camps-0.014
ge-0.014
rou-0.014
 upon-0.013
Token big
Token ID1263
Feature activation+2.27e+00

Pos logit contributions
 Esk+0.068
 camps+0.067
ge+0.065
rou+0.062
yt+0.060
Neg logit contributions
Her-0.079
 Her-0.067
 herself-0.066
gen-0.065
asure-0.063
Token puzzle
Token ID15027
Feature activation-1.21e+00

Pos logit contributions
ge+0.063
 Esk+0.063
 camps+0.059
rou+0.058
icer+0.056
Neg logit contributions
Her-0.045
 Her-0.037
gen-0.035
 herself-0.034
asure-0.032
Token with
Token ID351
Feature activation-3.26e-01

Pos logit contributions
Her+0.028
 Her+0.023
gen+0.022
 herself+0.021
asure+0.021
Neg logit contributions
ge-0.030
 Esk-0.030
 camps-0.029
rou-0.028
yt-0.027
Token many
Token ID867
Feature activation-8.57e-01

Pos logit contributions
Her+0.008
 Her+0.006
gen+0.006
 listened+0.006
Then+0.006
Neg logit contributions
 camps-0.008
 Esk-0.008
ge-0.007
grown-0.007
rou-0.007
Token pieces
Token ID5207
Feature activation-1.02e+00

Pos logit contributions
Her+0.020
 Her+0.016
gen+0.016
 herself+0.016
asure+0.015
Neg logit contributions
ge-0.021
 Esk-0.021
 camps-0.020
rou-0.019
icer-0.019
Token.
Token ID13
Feature activation-3.58e-02

Pos logit contributions
Her+0.022
 Her+0.018
gen+0.017
 herself+0.017
asure+0.016
Neg logit contributions
 Esk-0.027
ge-0.027
 camps-0.025
rou-0.025
icer-0.024
Token but
Token ID475
Feature activation+8.50e-01

Pos logit contributions
ge+0.015
 Esk+0.015
 camps+0.014
rou+0.014
 upon+0.014
Neg logit contributions
Her-0.018
gen-0.015
 Her-0.015
asure-0.014
amed-0.014
Token he
Token ID339
Feature activation+6.08e-01

Pos logit contributions
ge+0.022
 Esk+0.022
 upon+0.021
 camps+0.021
rou+0.020
Neg logit contributions
Her-0.019
gen-0.015
 Her-0.015
amed-0.015
Then-0.014
Token needed
Token ID2622
Feature activation+1.17e-01

Pos logit contributions
ge+0.014
 Esk+0.014
rou+0.013
 upon+0.013
 camps+0.013
Neg logit contributions
Her-0.015
 Her-0.012
gen-0.012
 herself-0.011
ough-0.011
Token to
Token ID284
Feature activation+1.51e-01

Pos logit contributions
ge+0.003
 Esk+0.003
 camps+0.003
rou+0.003
grown+0.003
Neg logit contributions
Her-0.003
 Her-0.002
gen-0.002
asure-0.002
amed-0.002
Token leave
Token ID2666
Feature activation-1.23e-01

Pos logit contributions
ge+0.004
 Esk+0.004
 camps+0.003
rou+0.003
icer+0.003
Neg logit contributions
Her-0.003
 Her-0.003
gen-0.003
amed-0.003
Then-0.003
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
Her+0.003
 Her+0.002
gen+0.002
asure+0.002
Then+0.002
Neg logit contributions
ge-0.003
 Esk-0.003
 camps-0.003
icer-0.003
 upon-0.003
Token little
Token ID1310
Feature activation+1.61e+00

Pos logit contributions
 Esk+0.043
ge+0.041
rou+0.038
icer+0.038
 camps+0.037
Neg logit contributions
Her-0.035
 Her-0.028
gen-0.028
 Queen-0.027
Then-0.027
Token tip
Token ID8171
Feature activation-1.09e+00

Pos logit contributions
 Esk+0.044
ge+0.042
rou+0.041
icer+0.040
 camps+0.039
Neg logit contributions
Her-0.028
 then-0.024
 she-0.023
gen-0.023
 Her-0.023
Token for
Token ID329
Feature activation-1.12e+00

Pos logit contributions
Her+0.025
 Her+0.021
gen+0.021
asure+0.019
 herself+0.019
Neg logit contributions
ge-0.028
 Esk-0.027
 camps-0.026
rou-0.025
 upon-0.024
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
Her+0.023
 Her+0.019
gen+0.019
asure+0.018
amed+0.017
Neg logit contributions
 Esk-0.031
ge-0.031
 camps-0.028
icer-0.027
 polar-0.027
Token nice
Token ID3621
Feature activation-5.98e-01

Pos logit contributions
Her+0.004
 Her+0.004
gen+0.003
 herself+0.003
asure+0.003
Neg logit contributions
 Esk-0.005
ge-0.005
 camps-0.005
rou-0.005
icer-0.004
Token,
Token ID11
Feature activation-5.74e-01

Pos logit contributions
Her+0.018
 Her+0.013
gen+0.012
 herself+0.012
asure+0.011
Neg logit contributions
 Esk-0.040
ge-0.039
 camps-0.038
rou-0.037
icer-0.036
Token there
Token ID612
Feature activation+1.78e+00

Pos logit contributions
Her+0.015
 Her+0.012
gen+0.012
 herself+0.012
:+0.012
Neg logit contributions
ge-0.013
rou-0.013
 Esk-0.013
 camps-0.012
yt-0.012
Token was
Token ID373
Feature activation-2.87e+00

Pos logit contributions
ge+0.060
 Esk+0.058
rou+0.057
 camps+0.057
yt+0.054
Neg logit contributions
Her-0.026
 Her-0.021
:-0.019
 herself-0.018
gen-0.018
Token a
Token ID257
Feature activation+9.53e-01

Pos logit contributions
Her+0.040
gen+0.029
 Her+0.027
asure+0.025
 herself+0.023
Neg logit contributions
 Esk-0.097
 camps-0.095
ge-0.091
icer-0.089
 upon-0.088
Token clown
Token ID25573
Feature activation-2.01e+00

Pos logit contributions
ge+0.026
 Esk+0.025
 camps+0.023
rou+0.023
icer+0.023
Neg logit contributions
Her-0.020
 Her-0.016
gen-0.016
Then-0.015
 listened-0.015
Token named
Token ID3706
Feature activation+2.73e+00

Pos logit contributions
Her+0.040
 Her+0.032
gen+0.031
 herself+0.030
asure+0.028
Neg logit contributions
 Esk-0.056
ge-0.056
 camps-0.054
rou-0.052
icer-0.051
Token Bob
Token ID5811
Feature activation+1.30e+00

Pos logit contributions
 Esk+0.067
 camps+0.064
ge+0.061
 polar+0.060
icer+0.058
Neg logit contributions
Her-0.067
gen-0.053
Then-0.053
amed-0.052
 herself-0.050
Tokeno
Token ID78
Feature activation+1.51e-02

Pos logit contributions
 Esk+0.038
ge+0.038
 camps+0.037
rou+0.036
yt+0.035
Neg logit contributions
Her-0.023
gen-0.019
 Her-0.019
:-0.019
 listened-0.017
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
ge+0.000
 Esk+0.000
 camps+0.000
rou+0.000
yt+0.000
Neg logit contributions
Her-0.000
 Her-0.000
gen-0.000
:-0.000
 herself-0.000
Token He
Token ID679
Feature activation+1.85e-01

Pos logit contributions
Her+0.004
 Her+0.003
gen+0.003
 herself+0.003
 Then+0.003
Neg logit contributions
ge-0.007
 Esk-0.007
 camps-0.006
rou-0.006
yt-0.006
Token was
Token ID373
Feature activation-1.37e+00

Pos logit contributions
 Esk+0.005
ge+0.005
 camps+0.005
rou+0.005
 polar+0.005
Neg logit contributions
Her-0.004
 Her-0.003
 herself-0.003
gen-0.003
asure-0.003
Token man
Token ID582
Feature activation-2.75e-01

Pos logit contributions
Her+0.029
 Her+0.023
 herself+0.021
asure+0.020
gen+0.020
Neg logit contributions
 Esk-0.033
 camps-0.031
ge-0.031
 polar-0.029
yt-0.029
Token did
Token ID750
Feature activation-1.25e+00

Pos logit contributions
Her+0.007
 Her+0.006
 herself+0.006
gen+0.006
ough+0.005
Neg logit contributions
 Esk-0.006
ge-0.006
rou-0.006
 camps-0.005
 polar-0.005
Token not
Token ID407
Feature activation+1.76e+00

Pos logit contributions
Her+0.020
 Her+0.015
gen+0.014
asure+0.013
Then+0.013
Neg logit contributions
 Esk-0.040
ge-0.040
 camps-0.038
rou-0.037
 upon-0.037
Token give
Token ID1577
Feature activation+2.31e+00

Pos logit contributions
ge+0.041
 Esk+0.040
 camps+0.036
rou+0.035
yt+0.034
Neg logit contributions
Her-0.044
 Her-0.037
gen-0.035
Then-0.035
 Then-0.035
Token up
Token ID510
Feature activation-1.67e+00

Pos logit contributions
 Esk+0.054
ge+0.054
 camps+0.052
rou+0.049
 upon+0.048
Neg logit contributions
Her-0.057
 Her-0.047
gen-0.045
Then-0.044
asure-0.043
Token easily
Token ID3538
Feature activation-6.63e-02

Pos logit contributions
Her+0.038
 Her+0.031
gen+0.030
asure+0.029
 herself+0.028
Neg logit contributions
ge-0.043
 Esk-0.043
 camps-0.040
rou-0.039
 upon-0.038
Token.
Token ID13
Feature activation+7.54e-01

Pos logit contributions
Her+0.002
 Her+0.001
gen+0.001
Then+0.001
 herself+0.001
Neg logit contributions
ge-0.002
 Esk-0.002
 camps-0.002
rou-0.002
 upon-0.001
Token He
Token ID679
Feature activation+6.37e-01

Pos logit contributions
 Esk+0.013
 camps+0.012
rou+0.011
 upon+0.010
 polar+0.010
Neg logit contributions
Her-0.023
P-0.018
amed-0.018
 listened-0.018
Then-0.018
Token tried
Token ID3088
Feature activation-6.45e-01

Pos logit contributions
 Esk+0.015
 camps+0.015
ge+0.015
rou+0.014
 polar+0.014
Neg logit contributions
Her-0.016
 Her-0.012
 herself-0.012
ough-0.012
asure-0.011
Token to
Token ID284
Feature activation+8.87e-02

Pos logit contributions
Her+0.017
gen+0.014
 Her+0.014
asure+0.014
amed+0.014
Neg logit contributions
ge-0.014
 camps-0.013
 Esk-0.013
rou-0.012
 upon-0.012
Token find
Token ID1064
Feature activation-8.95e-01

Pos logit contributions
 Esk+0.002
ge+0.002
 camps+0.002
rou+0.002
 polar+0.002
Neg logit contributions
Her-0.002
 Her-0.002
gen-0.002
Then-0.002
 Then-0.002
Token was
Token ID373
Feature activation-6.24e-01

Pos logit contributions
Her+0.008
 Her+0.006
gen+0.006
 herself+0.005
asure+0.005
Neg logit contributions
 Esk-0.014
ge-0.014
 camps-0.014
rou-0.013
 upon-0.013
Token especially
Token ID2592
Feature activation-7.28e-01

Pos logit contributions
Her+0.013
 Her+0.010
gen+0.010
asure+0.009
Then+0.009
Neg logit contributions
 Esk-0.017
ge-0.017
 camps-0.016
rou-0.016
 upon-0.016
Token hot
Token ID3024
Feature activation+2.23e+00

Pos logit contributions
Her+0.018
 Her+0.016
gen+0.015
asure+0.015
 Queen+0.014
Neg logit contributions
 Esk-0.017
rou-0.016
ge-0.015
 upon-0.015
grown-0.014
Token and
Token ID290
Feature activation+5.76e-01

Pos logit contributions
 Esk+0.060
ge+0.060
 camps+0.058
rou+0.055
icer+0.054
Neg logit contributions
Her-0.048
 Her-0.038
gen-0.037
 herself-0.035
asure-0.034
Token the
Token ID262
Feature activation+8.46e-01

Pos logit contributions
 Esk+0.014
ge+0.014
 camps+0.013
rou+0.013
 upon+0.013
Neg logit contributions
Her-0.013
 Her-0.011
gen-0.011
asure-0.010
 herself-0.010
Token z
Token ID1976
Feature activation+1.36e+00

Pos logit contributions
 Esk+0.021
 camps+0.020
ge+0.020
rou+0.019
 polar+0.019
Neg logit contributions
Her-0.021
 Her-0.017
gen-0.016
asure-0.015
 herself-0.015
Tokenebra
Token ID37052
Feature activation-8.21e-02

Pos logit contributions
 Esk+0.029
ge+0.029
 camps+0.027
rou+0.026
icer+0.025
Neg logit contributions
Her-0.036
 Her-0.030
gen-0.030
 herself-0.029
asure-0.028
Token herd
Token ID27638
Feature activation+8.68e-01

Pos logit contributions
Her+0.002
 Her+0.001
gen+0.001
asure+0.001
 herself+0.001
Neg logit contributions
ge-0.002
 Esk-0.002
 camps-0.002
 upon-0.002
rou-0.002
Token was
Token ID373
Feature activation+1.71e-01

Pos logit contributions
ge+0.020
 Esk+0.020
 upon+0.019
 camps+0.018
rou+0.018
Neg logit contributions
Her-0.021
 Her-0.017
gen-0.017
Then-0.016
asure-0.016
Token growing
Token ID3957
Feature activation+2.71e+00

Pos logit contributions
 Esk+0.004
ge+0.004
 camps+0.004
rou+0.004
 upon+0.004
Neg logit contributions
Her-0.004
 Her-0.003
gen-0.003
asure-0.003
 herself-0.003
Token very
Token ID845
Feature activation-1.29e-01

Pos logit contributions
ge+0.076
 Esk+0.075
 camps+0.071
rou+0.069
icer+0.069
Neg logit contributions
Her-0.051
 herself-0.041
 Her-0.040
gen-0.040
:-0.038
Tokenw
Token ID86
Feature activation-1.40e-01

Pos logit contributions
Her+0.017
gen+0.012
 listened+0.011
 herself+0.010
 Her+0.010
Neg logit contributions
 Esk-0.047
 camps-0.046
ge-0.046
 polar-0.043
rou-0.042
Token,
Token ID11
Feature activation+4.97e-01

Pos logit contributions
Her+0.003
 Her+0.002
gen+0.002
 herself+0.002
asure+0.002
Neg logit contributions
 Esk-0.004
ge-0.004
 camps-0.004
rou-0.004
icer-0.004
Token ow
Token ID12334
Feature activation-4.84e-01

Pos logit contributions
ge+0.014
 Esk+0.014
 camps+0.014
rou+0.013
icer+0.013
Neg logit contributions
Her-0.009
 Her-0.007
gen-0.007
 herself-0.007
asure-0.007
Token,
Token ID11
Feature activation+5.09e-01

Pos logit contributions
Her+0.005
:+0.003
 Her+0.003
 herself+0.003
gen+0.003
Neg logit contributions
ge-0.018
 Esk-0.018
 camps-0.017
rou-0.017
icer-0.016
Token ow
Token ID12334
Feature activation-3.33e-01

Pos logit contributions
ge+0.015
 Esk+0.015
 camps+0.014
rou+0.014
icer+0.013
Neg logit contributions
Her-0.010
 Her-0.008
gen-0.007
 herself-0.007
asure-0.007
Token!
Token ID0
Feature activation-9.15e-01

Pos logit contributions
Her+0.007
:+0.006
 herself+0.006
gen+0.006
 Her+0.006
Neg logit contributions
ge-0.008
 Esk-0.008
 camps-0.008
rou-0.008
icer-0.007
Token My
Token ID2011
Feature activation-1.16e+00

Pos logit contributions
Her+0.017
 Her+0.013
gen+0.012
 herself+0.012
asure+0.011
Neg logit contributions
ge-0.028
 Esk-0.027
 camps-0.026
rou-0.025
yt-0.025
Token knee
Token ID10329
Feature activation+2.17e-01

Pos logit contributions
Her+0.020
 herself+0.017
 Her+0.016
gen+0.014
 she+0.014
Neg logit contributions
ge-0.035
 Esk-0.034
 camps-0.033
rou-0.031
icer-0.031
Token is
Token ID318
Feature activation-4.21e-01

Pos logit contributions
 Esk+0.006
ge+0.006
 camps+0.006
icer+0.005
rou+0.005
Neg logit contributions
Her-0.004
 Her-0.004
 herself-0.004
gen-0.004
 listened-0.003
Token bleeding
Token ID16832
Feature activation+3.27e+00

Pos logit contributions
Her+0.009
 Her+0.008
gen+0.007
 herself+0.007
asure+0.007
Neg logit contributions
ge-0.011
 Esk-0.011
 camps-0.010
rou-0.010
icer-0.010
Token!"
Token ID2474
Feature activation-9.47e-01

Pos logit contributions
ge+0.074
 Esk+0.072
 camps+0.069
rou+0.067
icer+0.065
Neg logit contributions
Her-0.081
 herself-0.069
 Her-0.066
:-0.066
gen-0.065
Token
Token ID198
Feature activation-9.64e-02

Pos logit contributions
 Esk+0.003
ge+0.003
 camps+0.003
 upon+0.003
icer+0.003
Neg logit contributions
Her-0.002
gen-0.002
 Her-0.002
 herself-0.002
asure-0.002
Token
Token ID198
Feature activation-4.79e-01

Pos logit contributions
Her+0.001
 Her+0.001
:+0.001
 herself+0.001
Then+0.001
Neg logit contributions
 Esk-0.003
ge-0.003
 camps-0.003
rou-0.003
yt-0.003
TokenThe
Token ID464
Feature activation-2.10e+00

Pos logit contributions
Her+0.010
 Her+0.008
gen+0.008
 herself+0.008
asure+0.007
Neg logit contributions
ge-0.013
 Esk-0.013
 camps-0.012
rou-0.012
icer-0.012
Token village
Token ID7404
Feature activation-2.45e+00

Pos logit contributions
Her+0.053
 Her+0.044
 herself+0.039
 Then+0.038
asure+0.037
Neg logit contributions
 Esk-0.054
 camps-0.050
 polar-0.046
ge-0.044
 cabin-0.044
Token people
Token ID661
Feature activation-1.97e+00

Pos logit contributions
Her+0.064
 herself+0.051
 Her+0.051
gen+0.049
ough+0.048
Neg logit contributions
 Esk-0.052
ge-0.052
 upon-0.051
 camps-0.049
 polar-0.048
Token used
Token ID973
Feature activation+3.61e-01

Pos logit contributions
Her+0.050
 Her+0.040
gen+0.040
 herself+0.039
ough+0.037
Neg logit contributions
ge-0.046
 Esk-0.044
 upon-0.043
rou-0.041
 camps-0.040
Token Dave
Token ID9935
Feature activation-1.08e+00

Pos logit contributions
ge+0.010
 Esk+0.010
 camps+0.009
rou+0.009
 upon+0.009
Neg logit contributions
Her-0.008
 Her-0.006
gen-0.006
Then-0.006
 herself-0.005
Token's
Token ID338
Feature activation+4.19e-02

Pos logit contributions
Her+0.028
 Her+0.021
Then+0.021
 Then+0.020
asure+0.020
Neg logit contributions
ge-0.026
 upon-0.024
 camps-0.024
rou-0.022
grown-0.022
Token treasure
Token ID14068
Feature activation-6.25e-01

Pos logit contributions
 Esk+0.001
ge+0.001
 camps+0.001
rou+0.001
 upon+0.001
Neg logit contributions
Her-0.001
 Her-0.001
 herself-0.001
gen-0.001
Then-0.001
Token to
Token ID284
Feature activation+6.98e-01

Pos logit contributions
Her+0.018
 Her+0.015
Then+0.014
Mir+0.014
Bro+0.014
Neg logit contributions
ge-0.013
 camps-0.012
 upon-0.012
 Esk-0.011
yt-0.011
Token make
Token ID787
Feature activation-1.13e+00

Pos logit contributions
 Esk+0.016
ge+0.016
 camps+0.015
rou+0.014
 polar+0.014
Neg logit contributions
Her-0.018
 Her-0.015
gen-0.014
Then-0.014
 herself-0.013
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
Her+0.014
 Her+0.012
gen+0.011
 herself+0.011
Then+0.011
Neg logit contributions
ge-0.012
 Esk-0.012
 camps-0.012
rou-0.011
yt-0.010
Token They
Token ID1119
Feature activation-1.71e-01

Pos logit contributions
Her+0.010
gen+0.008
asure+0.008
 listened+0.008
amed+0.008
Neg logit contributions
 Esk-0.010
 camps-0.010
ge-0.010
 upon-0.009
rou-0.009
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
Her+0.004
 Her+0.003
gen+0.003
 herself+0.003
asure+0.003
Neg logit contributions
ge-0.004
 Esk-0.004
 camps-0.004
rou-0.004
icer-0.004
Token fun
Token ID1257
Feature activation-1.85e+00

Pos logit contributions
Her+0.050
 Her+0.040
gen+0.037
asure+0.037
 herself+0.036
Neg logit contributions
 Esk-0.054
 camps-0.052
ge-0.050
rou-0.048
grown-0.046
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
Her+0.033
 herself+0.026
 Her+0.025
gen+0.025
:+0.024
Neg logit contributions
ge-0.055
 Esk-0.054
 camps-0.053
rou-0.051
icer-0.051
Token They
Token ID1119
Feature activation+3.27e-03

Pos logit contributions
Her+0.008
gen+0.007
asure+0.007
 listened+0.006
 herself+0.006
Neg logit contributions
 Esk-0.009
 camps-0.008
ge-0.008
 upon-0.008
rou-0.008
Token are
Token ID389
Feature activation-3.30e+00

Pos logit contributions
 Esk+0.000
ge+0.000
 camps+0.000
rou+0.000
icer+0.000
Neg logit contributions
Her-0.000
 Her-0.000
gen-0.000
 herself-0.000
asure-0.000
Token happy
Token ID3772
Feature activation-1.47e+00

Pos logit contributions
Her+0.075
 Her+0.062
gen+0.057
asure+0.054
Then+0.054
Neg logit contributions
 Esk-0.092
 camps-0.080
rou-0.080
ge-0.079
 polar-0.075
Token.
Token ID13
Feature activation+1.41e-01

Pos logit contributions
Her+0.031
 Her+0.025
gen+0.024
 herself+0.024
asure+0.023
Neg logit contributions
 Esk-0.040
ge-0.039
 camps-0.038
rou-0.037
icer-0.035
Token<|endoftext|>
Token ID50256
Feature activation+5.21e+00

Pos logit contributions
 Esk+0.003
 camps+0.003
 upon+0.003
grown+0.003
ge+0.003
Neg logit contributions
Her-0.003
asure-0.003
gen-0.003
amed-0.003
 herself-0.003
TokenSam
Token ID16305
Feature activation+1.12e+00

Pos logit contributions
ge+0.153
 Esk+0.141
rou+0.140
yt+0.135
 camps+0.135
Neg logit contributions
Her-0.104
 Her-0.088
:-0.084
 herself-0.082
 Then-0.082
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
 Esk+0.014
ge+0.014
 camps+0.013
rou+0.013
icer+0.012
Neg logit contributions
Her-0.011
 Her-0.009
gen-0.009
 herself-0.009
asure-0.008
Token Jen
Token ID13364
Feature activation-3.07e-01

Pos logit contributions
Her+0.003
 Her+0.002
gen+0.002
Then+0.002
amed+0.002
Neg logit contributions
 Esk-0.003
ge-0.003
 camps-0.003
rou-0.003
 upon-0.003
Token.
Token ID13
Feature activation-4.38e-02

Pos logit contributions
Her+0.007
 Her+0.006
gen+0.006
 herself+0.005
asure+0.005
Neg logit contributions
ge-0.008
 Esk-0.008
 camps-0.007
rou-0.007
icer-0.007
Token You
Token ID921
Feature activation-3.30e-01

Pos logit contributions
Her+0.001
 listened+0.001
amed+0.001
 counted+0.001
gen+0.001
Neg logit contributions
 Esk-0.001
icer-0.001
ge-0.001
rou-0.001
 camps-0.001
Token should
Token ID815
Feature activation-1.88e+00

Pos logit contributions
Her+0.007
gen+0.006
 Her+0.006
 listened+0.006
asure+0.006
Neg logit contributions
ge-0.008
 Esk-0.008
 camps-0.008
rou-0.008
yt-0.008
Token share
Token ID2648
Feature activation-3.39e+00

Pos logit contributions
Her+0.029
 Her+0.021
gen+0.020
Then+0.018
 herself+0.018
Neg logit contributions
 Esk-0.060
ge-0.060
 camps-0.057
rou-0.056
yt-0.054
Token and
Token ID290
Feature activation-2.45e+00

Pos logit contributions
Her+0.083
 Her+0.072
gen+0.072
amed+0.072
asure+0.069
Neg logit contributions
ge-0.076
 Esk-0.068
 upon-0.066
rou-0.065
 camps-0.065
Token be
Token ID307
Feature activation-3.19e+00

Pos logit contributions
Her+0.050
 Her+0.041
gen+0.039
amed+0.038
Then+0.037
Neg logit contributions
ge-0.064
 Esk-0.063
 breath-0.057
rou-0.057
 camps-0.056
Token nice
Token ID3621
Feature activation+2.04e-01

Pos logit contributions
Her+0.071
 Her+0.069
cake+0.065
owers+0.064
ibrarian+0.063
Neg logit contributions
 Esk-0.071
rou-0.062
grown-0.058
ge-0.058
 polar-0.055
Token."
Token ID526
Feature activation-4.00e-01

Pos logit contributions
ge+0.005
 Esk+0.005
 camps+0.005
rou+0.004
 upon+0.004
Neg logit contributions
Her-0.005
 Her-0.004
gen-0.004
 herself-0.004
asure-0.004
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
Her+0.008
gen+0.006
 Her+0.006
asure+0.006
 herself+0.006
Neg logit contributions
 Esk-0.011
ge-0.011
 camps-0.011
rou-0.010
 upon-0.010
TokenThe
Token ID464
Feature activation-2.08e+00

Pos logit contributions
Her+0.009
gen+0.008
 Her+0.008
asure+0.008
 counted+0.008
Neg logit contributions
ge-0.020
 Esk-0.019
 camps-0.019
rou-0.018
 upon-0.018
Token moral
Token ID6573
Feature activation-1.67e+00

Pos logit contributions
Her+0.041
 Her+0.032
gen+0.031
 herself+0.030
asure+0.028
Neg logit contributions
 Esk-0.060
ge-0.060
 camps-0.057
rou-0.055
icer-0.054
Token of
Token ID286
Feature activation-2.06e+00

Pos logit contributions
Her+0.051
 Her+0.044
gen+0.043
 herself+0.042
asure+0.041
Neg logit contributions
ge-0.029
 Esk-0.029
 camps-0.027
rou-0.026
icer-0.024
Token the
Token ID262
Feature activation-4.03e+00

Pos logit contributions
Her+0.039
 Her+0.030
gen+0.029
asure+0.028
 herself+0.027
Neg logit contributions
ge-0.059
 Esk-0.059
 camps-0.056
rou-0.055
icer-0.054
Token story
Token ID1621
Feature activation-1.36e+00

Pos logit contributions
Her+0.084
 Her+0.071
 herself+0.070
gen+0.069
:+0.069
Neg logit contributions
ge-0.112
 Esk-0.103
rou-0.099
 camps-0.099
icer-0.095
Token is
Token ID318
Feature activation-4.06e+00

Pos logit contributions
Her+0.037
 Her+0.030
 herself+0.029
gen+0.029
Then+0.028
Neg logit contributions
ge-0.028
 Esk-0.027
rou-0.026
 camps-0.025
 upon-0.025
Token that
Token ID326
Feature activation+3.20e+00

Pos logit contributions
Her+0.078
gen+0.058
 Her+0.058
 counted+0.058
amed+0.055
Neg logit contributions
ge-0.112
 Esk-0.110
rou-0.110
 upon-0.105
 camps-0.103
Token love
Token ID1842
Feature activation+4.87e-01

Pos logit contributions
ge+0.081
 Esk+0.080
 camps+0.076
rou+0.075
 upon+0.073
Neg logit contributions
Her-0.072
 Her-0.059
asure-0.056
gen-0.056
amed-0.054
Token is
Token ID318
Feature activation-2.20e+00

Pos logit contributions
ge+0.010
rou+0.010
 upon+0.010
 Esk+0.009
 camps+0.009
Neg logit contributions
Her-0.013
 Her-0.010
gen-0.010
His-0.010
asure-0.010
Token an
Token ID281
Feature activation+2.17e+00

Pos logit contributions
Her+0.051
 Her+0.041
iece+0.040
asure+0.038
ough+0.038
Neg logit contributions
ge-0.053
rou-0.053
 Esk-0.051
 upon-0.050
grown-0.049
Token un
Token ID555
Feature activation+2.45e+00

Pos logit contributions
ge+0.065
 Esk+0.064
 camps+0.061
rou+0.059
icer+0.059
Neg logit contributions
Her-0.039
 Her-0.030
gen-0.030
 herself-0.029
asure-0.027
Token them
Token ID606
Feature activation-7.30e-01

Pos logit contributions
ge+0.013
 Esk+0.013
yt+0.013
rou+0.013
icer+0.012
Neg logit contributions
Her-0.006
 herself-0.004
 her-0.004
 Her-0.004
:-0.004
Token and
Token ID290
Feature activation+7.60e-01

Pos logit contributions
Her+0.016
 Her+0.013
gen+0.013
 herself+0.012
asure+0.012
Neg logit contributions
ge-0.019
 Esk-0.019
 camps-0.018
rou-0.018
icer-0.017
Token said
Token ID531
Feature activation+2.15e+00

Pos logit contributions
 Esk+0.021
ge+0.021
 camps+0.020
rou+0.020
 upon+0.019
Neg logit contributions
Her-0.015
gen-0.012
 Her-0.012
asure-0.011
 herself-0.011
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 Esk+0.062
ge+0.060
rou+0.057
 camps+0.057
 upon+0.057
Neg logit contributions
Her-0.038
gen-0.031
asure-0.028
 Her-0.028
Then-0.027
Token "
Token ID366
Feature activation-8.94e-02

Pos logit contributions
 Esk+0.025
 upon+0.024
ge+0.024
rou+0.023
joy+0.021
Neg logit contributions
gen-0.020
Her-0.019
ough-0.019
bye-0.018
amed-0.018
TokenOh
Token ID5812
Feature activation-3.35e+00

Pos logit contributions
Her+0.002
 Her+0.001
gen+0.001
 herself+0.001
asure+0.001
Neg logit contributions
ge-0.003
 Esk-0.003
 camps-0.002
rou-0.002
icer-0.002
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
Her+0.057
gen+0.044
 Her+0.042
 herself+0.040
asure+0.040
Neg logit contributions
 Esk-0.104
ge-0.098
 camps-0.098
rou-0.094
 upon-0.093
Token no
Token ID645
Feature activation+1.48e+00

Pos logit contributions
ge+0.001
 Esk+0.001
 camps+0.001
rou+0.001
icer+0.001
Neg logit contributions
Her-0.000
 herself-0.000
 Her-0.000
 she-0.000
 listened-0.000
Token,
Token ID11
Feature activation+8.65e-01

Pos logit contributions
ge+0.046
 Esk+0.046
 camps+0.044
rou+0.043
icer+0.042
Neg logit contributions
Her-0.026
 Her-0.019
gen-0.019
 herself-0.018
asure-0.017
Token you
Token ID345
Feature activation+2.27e-01

Pos logit contributions
ge+0.027
 Esk+0.026
 camps+0.026
rou+0.025
icer+0.024
Neg logit contributions
Her-0.015
 Her-0.011
 herself-0.011
gen-0.011
 listened-0.010
Token went
Token ID1816
Feature activation+2.06e+00

Pos logit contributions
ge+0.007
 camps+0.007
 Esk+0.007
rou+0.006
icer+0.006
Neg logit contributions
Her-0.004
 listened-0.004
 counted-0.003
 Her-0.003
gen-0.003
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
Her+0.006
gen+0.005
 listened+0.005
 counted+0.004
amed+0.004
Neg logit contributions
 Esk-0.006
ge-0.006
 camps-0.006
 upon-0.006
icer-0.006
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
Her+0.005
 Her+0.003
 herself+0.003
gen+0.003
Then+0.003
Neg logit contributions
ge-0.007
 Esk-0.007
 camps-0.007
rou-0.006
yt-0.006
TokenFrom
Token ID4863
Feature activation-3.05e+00

Pos logit contributions
Her+0.009
 Her+0.008
gen+0.007
asure+0.007
 herself+0.007
Neg logit contributions
ge-0.017
 Esk-0.017
 camps-0.016
rou-0.016
 upon-0.016
Token that
Token ID326
Feature activation+1.44e+00

Pos logit contributions
Her+0.050
 Her+0.039
gen+0.035
ough+0.031
 counted+0.031
Neg logit contributions
 Esk-0.097
ge-0.095
 camps-0.093
rou-0.092
 upon-0.088
Token day
Token ID1110
Feature activation-9.07e-01

Pos logit contributions
 camps+0.036
ge+0.036
 Esk+0.035
 upon+0.034
icer+0.033
Neg logit contributions
Her-0.033
 Her-0.026
asure-0.025
gen-0.025
 herself-0.025
Token forward
Token ID2651
Feature activation-3.37e+00

Pos logit contributions
Her+0.027
gen+0.024
 Her+0.023
asure+0.023
her+0.022
Neg logit contributions
 Esk-0.014
ge-0.014
 upon-0.014
 camps-0.013
grown-0.012
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
Her+0.065
gen+0.046
 Her+0.046
Then+0.045
amed+0.043
Neg logit contributions
ge-0.099
 Esk-0.099
 camps-0.093
 upon-0.092
rou-0.090
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
Her+0.026
gen+0.020
 Her+0.020
asure+0.020
amed+0.019
Neg logit contributions
 Esk-0.029
ge-0.029
 camps-0.028
 upon-0.026
rou-0.026
Token polite
Token ID23507
Feature activation-1.09e+00

Pos logit contributions
Her+0.035
 Her+0.029
gen+0.028
asure+0.026
 herself+0.026
Neg logit contributions
 Esk-0.039
 camps-0.037
ge-0.036
rou-0.034
icer-0.034
Token raven
Token ID37735
Feature activation-1.59e+00

Pos logit contributions
Her+0.025
 Her+0.020
gen+0.019
 herself+0.019
asure+0.019
Neg logit contributions
 Esk-0.027
ge-0.027
 camps-0.027
rou-0.025
 upon-0.025
Token was
Token ID373
Feature activation-1.15e+00

Pos logit contributions
Her+0.040
 Her+0.031
asure+0.031
ough+0.031
 herself+0.030
Neg logit contributions
ge-0.035
rou-0.034
 upon-0.034
 camps-0.034
 Esk-0.033
Token far
Token ID1290
Feature activation+1.78e+00

Pos logit contributions
ge+0.077
 Esk+0.077
 camps+0.073
rou+0.071
icer+0.068
Neg logit contributions
Her-0.068
 Her-0.056
gen-0.054
 herself-0.052
asure-0.051
Token away
Token ID1497
Feature activation+4.97e-01

Pos logit contributions
 Esk+0.053
ge+0.050
 camps+0.049
rou+0.049
icer+0.048
Neg logit contributions
Her-0.032
 herself-0.025
 Her-0.024
:-0.024
gen-0.024
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
ge+0.013
 Esk+0.013
 camps+0.012
rou+0.012
icer+0.011
Neg logit contributions
Her-0.011
 Her-0.009
gen-0.009
 herself-0.008
asure-0.008
Token
Token ID198
Feature activation-3.45e-01

Pos logit contributions
Her+0.005
asure+0.005
amed+0.004
her+0.004
gen+0.004
Neg logit contributions
 upon-0.004
 Esk-0.004
 camps-0.004
ge-0.004
icer-0.004
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
Her+0.007
gen+0.006
 Her+0.006
asure+0.006
 herself+0.005
Neg logit contributions
ge-0.009
 Esk-0.009
 camps-0.009
 upon-0.008
rou-0.008
TokenAfter
Token ID3260
Feature activation-3.84e+00

Pos logit contributions
Her+0.013
 Her+0.013
asure+0.013
 herself+0.012
gen+0.012
Neg logit contributions
ge-0.016
 Esk-0.016
 upon-0.016
 camps-0.016
grown-0.014
Token a
Token ID257
Feature activation-1.17e-01

Pos logit contributions
Her+0.082
gen+0.071
asure+0.069
 Her+0.066
rac+0.064
Neg logit contributions
 Esk-0.101
ge-0.094
 camps-0.093
 polar-0.093
grown-0.088
Token few
Token ID1178
Feature activation-4.42e+00

Pos logit contributions
Her+0.003
 herself+0.003
 Her+0.003
gen+0.003
asure+0.003
Neg logit contributions
ge-0.002
 camps-0.002
 Esk-0.002
yt-0.002
 upon-0.002
Token rounds
Token ID9196
Feature activation-3.93e+00

Pos logit contributions
Her+0.119
 Her+0.103
asure+0.102
gen+0.100
 herself+0.094
Neg logit contributions
ge-0.093
 Esk-0.092
 camps-0.092
 breath-0.083
grown-0.082
Token,
Token ID11
Feature activation-9.24e-01

Pos logit contributions
Her+0.106
gen+0.092
 Her+0.085
asure+0.084
Then+0.082
Neg logit contributions
ge-0.085
 Esk-0.078
 upon-0.078
 camps-0.076
rou-0.072
Token it
Token ID340
Feature activation-2.25e+00

Pos logit contributions
Her+0.024
asure+0.020
 Her+0.020
amed+0.019
her+0.019
Neg logit contributions
 Esk-0.020
 upon-0.020
 camps-0.019
ge-0.019
 breath-0.018